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Preface 


Politics  has  always  been  a  key  dimension  of 
human  life.  Politics  affects  everyone,  just  as  the 
weather  does,  and  since  the  time  of  the  Greek  clas¬ 
sics,  humankind  has  attempted  to  come  to  grips 
with  its  empirical  and  normative  implications. 
Politics  and  political  science  have  changed  over 
time  and  have  been  even  more  deeply  transformed 
during  the  past  decades  in  every  part  of  an  ever- 
smaller  world.  Empirical  research  and  theoretical 
reflections  on  politics  and  its  multiple  connections 
with  all  other  aspects  of  human  life  have  devel¬ 
oped  enormously  during  the  second  half  of  the 
past  century  and  now  cover  virtually  all  parts  of 
the  world,  with  their  growing  interdependence. 
These  studies  concern,  for  example,  basic  issues 
such  as  war  and  peace,  prosperity,  welfare,  and  a 
sustainable  environment,  as  well  as  issues  of  free¬ 
dom,  justice,  gender,  and  democracy  under  chang¬ 
ing  cultural  perspectives. 

At  this  point,  one  feels  the  need  for  an 
exhaustive  overview  of  the  empirical  findings 
and  the  reflections  on  politics.  The  International 
Encyclopedia  of  Political  Science  caters  to  that 
need  in  many  different  kinds  of  readers,  includ¬ 
ing  undergraduate  or  graduate  students  who 
like  to  be  informed  effectively  and  quickly  on 
their  field  of  study;  scholars  who  seek  informa¬ 
tion  on  the  relevant  empirical  findings  in  their 
area  of  specialization  or  in  related  fields;  and 
lay  readers,  who  may  not  have  a  formal  back¬ 
ground  in  political  science  but  are  attracted  to, 
and  interested  in,  politics.  Thus,  the  encyclope¬ 
dia  can  serve  both  the  diffusion  of  knowledge 
about  politics  and  the  further  development  of 
the  field. 

The  encyclopedia  is  the  result  of  collaborative 
work  by  leading  international  scholars  from  all 
over  the  world.  This  team  ensured  the  widest  pos¬ 
sible  coverage  of  key  areas,  both  regionally  and 


globally.  In  this  sense,  this  is  the  first  truly  interna¬ 
tional  encyclopedia  of  its  kind.  It  aims  to  give  a 
comprehensive  picture  of  all  aspects  of  political 
life,  recognizing  the  theoretical  and  cultural  plural¬ 
ism  of  the  approaches  and  including  findings  from 
all  parts  of  the  world.  The  eight  volumes  of  the 
encyclopedia  cover  every  field  of  politics,  from 
political  theory  and  methodology  to  political  soci¬ 
ology,  comparative  politics,  public  policies,  and 
international  relations. 

The  entries  are  arranged  in  alphabetical  order, 
and  a  list  of  entries  by  subject  area  appears  in  the 
front  of  each  volume.  The  entries  are  organized  in 
three  major  categories,  according  to  their  substan¬ 
tive  relevance,  of  roughly  1,800,  3,600,  and  8,000 
words.  In  addition,  longer  entries  of  12,000  words 
cover  the  major  subdisciplines  and  the  state  of  the 
art  in  each  field.  The  encyclopedia  contains  a 
detailed  index  as  well  as  extensive  bibliographical 
references.  Thus,  it  can  provide  an  essential  and 
authoritative  guide  to  the  state  of  political  science 
at  the  beginning  of  the  21st  century  for  decades  to 
come.  It  will  be  an  invaluable  resource  for  a  global 
readership,  including  researchers,  students,  and 
policymakers. 

The  editors  wish  to  acknowledge  the  invalu¬ 
able  assistance  and  advice  of  many  colleagues 
and  friends,  including  the  members  of  the 
Executive  Committee  of  the  International 
Political  Science  Association  (IPSA)  and  its  presi¬ 
dents  during  this  period,  Max  Kaase  and  Lourdes 
Sola.  Bertrand  Badie  wishes  to  thank  Sciences  Po, 
Paris,  and  particularly  Delphine  Placidi,  who 
assisted  him  in  the  first  steps  of  the  work.  Dirk 
Berg-Schlosser  gladly  acknowledges  the  assis¬ 
tance  from  Diana  Mai  and  Joris-Johann  Krapp  at 
the  Institute  of  Political  Science,  Philipps- 
University,  Marburg,  Germany,  in  the  initial 
phases  of  the  project.  Leonardo  Morlino  would 
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like  to  warmly  thank  Dr.  Claudius  Wagemann  for 
his  invaluable  help  in  the  starting  phase  of  the 
encyclopedia. 
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Cet  ouvrage  produira  surement  avec  le 
temps  une  revolution  dans  les  esprits,  et 
j’espere  que  les  tyrans,  les  oppresseurs, 
les  fanatiques  et  les  intolerants  n’y 
gagneront  pas.  Nous  aurons  servi 
I’humanite.  [Over  time,  this  work  will 
surely  produce  an  intellectual  revolution, 
and  I  hope  that  the  tyrants,  oppressors, 
fanatics,  and  those  without  tolerance 
will  not  benefit  from  it.  We  will  have 
served  humanity.] 

— Letter  from  Diderot  to  Sophie 
Volland,  September  26,  1762 

Political  science,  paradoxically,  is  both  a  very  old 
and  relatively  recent  discipline.  Its  origins  go  back 
to  antiquity  in  classic  European  or  Asian  thought 
as  far  as  recorded  history  goes.  As  an  independent 
and  respected  academic  field,  however,  it  came 
into  being  in  most  countries  only  after  World  War 
II.  This  is  due  in  part  to  the  fact  that  its  subject 
matter  had  been  addressed  by  neighboring  disci¬ 
plines  like  philosophy,  history,  and  public  law,  but 
also  because  it  requires,  more  than  others,  a 
“breathing  space”  of  freedom  of  thought  and 
expression  that  is  not  voluntarily  granted  by  most 
authoritarian  regimes  and  that  has  developed 
worldwide  only  with  decolonization  and  recent 
waves  of  democratization. 

This  introduction  surveys  the  discipline  of 
political  science,  beginning  with  an  analysis  of 
politics  itself.  Two  important  definitions  of  politics 
are  discussed  in  detail  here.  First,  politics  can  be 
viewed  as  a  means,  either  for  maintaining  social 
harmony  or  for  achieving  a  supreme  good,  as  in 
some  religious  conceptions  of  the  state.  Second, 
politics  has  been  understood  as  an  instrument  by 


which  a  central  institution  or  a  legitimate  author¬ 
ity  exercises  power.  The  next  section  of  the  intro¬ 
duction  examines  the  epistemological  foundations 
of  the  discipline  of  political  science.  The  third  sec¬ 
tion  traces  the  growth  of  political  science  during 
the  20th  century  and  examines  its  evolving  rela¬ 
tionships  to  other  social  sciences  and  to  other 
fields,  most  notably  law  and  philosophy.  The  final 
section  of  the  introduction  describes  more  recent 
developments  and  perspectives  in  political  science, 
as  it  has  become  a  dynamic  discipline  with  its  own 
identity  and  with  participation  from  political  sci¬ 
entists  around  the  globe. 

Politics 

Politics  as  a  Special  Sphere 

The  concept  of  politics  carries  different  mean¬ 
ings.  It  can  be  considered  to  be  an  art  (scholars  are 
“studying  politics”);  an  activity  (one  can  “play 
politics”  in  one’s  office,  in  one’s  club,  even  in  one’s 
family);  a  profession  (some  “go  into  politics”);  or 
a  function  (“local  politics,”  “national  politics”). 
Most  political  scientists  consider  the  first  meanings 
as  derived  from  the  last  one,  even  as  metaphoric, 
while  some  others  have  a  wider  and  more  abstract 
conception  that  is  broader  than  that  of  function. 
Politics  also  has  been  understood  as  both  to  polity, 
which  refers  to  an  organization  (a  state,  a  regime 
and  its  constitution)  and  to  policy,  which  refers  to 
a  system  of  political  decisions  and  specific  sub¬ 
fields  (like  health,  education,  foreign  relations) 
through  organizations  act  to  carry  out  their  func¬ 
tions.  In  a  comprehensive  perspective,  politics  is 
thus  considered  to  be  linked  to  a  function,  a 
system,  an  action,  and  a  behavior. 

All  of  them  are,  however,  connected  to  a  special 
dimension  of  the  history  of  humankind.  Even  if 
some  scholars  object  that  some  societies  ignored 
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politics  (Clastres,  1975),  most  anthropologists 
consider  politics  as  a  constant  of  the  human  condi¬ 
tion.  In  the  first  part  of  this  entry,  this  constant  will 
be  grasped  in  its  various  definitions,  and  then  it 
will  be  inserted  into  the  general  social  order.  The 
following  sections  examine  definitions  of  this  con¬ 
stant  as  a  function  and  as  an  instrument. 

Politics  as  a  Function 

Politics  can  be  conceived  as  a  contribution  to 
the  social  adventure,  as  a  function  of  the  social 
order,  or,  quite  differently,  as  a  distinctive  instru¬ 
ment,  a  special  way  of  action.  The  first  approach 
is  more  classic  and  deeply  rooted  in  the  various 
philosophical  traditions  that  were  elaborated 
around  the  world,  while  the  second  one  is  modern, 
related  to  the  rise  of  positivist  theory  in  the  social 
sciences.  In  this  section,  we  consider  two  ways  in 
which  politics  can  be  seen  as  a  function. 

Politics  as  a  Function:  Promoting  Social 

Harmony  or  Supreme  Good? 

Many  philosophers  have  located  politics  in  the 
art  of  coexistence.  If  human  beings  are  selfish  by 
nature,  as  they  are  often  conceived,  but  must  live 
and  grow  up  together,  to  create  peaceful  coexis¬ 
tence  is  obviously  one  of  the  main  functions  of  the 
polis.  As  such,  politics  should  be  considered  as  the 
permanent  invention  of  the  polis  (city),  as  the  con¬ 
struction  of  each  social  unit  that  aims  to  keep  peo¬ 
ple  together  on  a  permanent  basis.  This  point  was 
already  made  by  Plato,  who  considered  politics  as 
the  art  of  organizing  social  harmony.  We  also  find 
it  in  other  traditions.  Islam  conceives  of  politics  as 
a  weakly  differentiated  function  that  aims  to  over¬ 
come  tribal  fragmentation  through  the  principle  of 
unity  (taivbid).  As  such,  tawbid  will  be  achieved 
through  the  absolute  Unity  of  God,  and  so  politics 
cannot  be  entirely  accomplished  without  religion: 
Politics  cannot  be  conceived  as  a  differentiated  struc¬ 
ture,  but  it  is  obviously  a  social  function.  The  same 
principle  can  be  found  in  Hindu  writings.  The 
Artbasastra  (3rd  century  BC)  and  the  Manusmriti 
(2nd  century  BC)  were  written  during  periods  of 
decay  and  so-called  evil,  which  implied  starvation, 
violence,  and  chaos,  while  the  Mababarata  covered 
a  much  longer  period  (from  1st  millennium  BCE  to 
the  5th  century  BCE).  Here,  politics  is  presented  as 
an  absolute  requisite  for  keeping  peace  and  order, 


even  if  the  tradition  oscillates  between  divine 
invention  in  politics  ( Manusmriti )  and  its  contrac¬ 
tual  origin  ( Mababarata ).  In  its  turn,  the  Buddhist 
vision  stresses  the  entropy  of  the  world  that  leads 
to  inequality,  sexual  division,  property,  and  thus  to 
conflicts  and  lack  of  safety.  The  orientation  of  this 
reform  of  Hinduism  prompts  human  beings  to 
choose  a  king  as  a  guarantor  of  the  social  order. 
Similarly,  Confucius  pointed  out  that  men  need  a 
ruler  for  preventing  disorder,  disturbance,  and 
confusion. 

This  first  function,  promoting  social  harmony, 
obviously  shaped  political  philosophy  to  a  large 
extent  up  to  the  present  day.  Social  contract  theory 
clearly  emanated  from  this  postulate,  in  the  Islamic 
tradition  ( mithaq ,  bay’a)  as  well  as  during  the 
European  Enlightenment.  The  Artbasastra  antici¬ 
pated  Thomas  Hobbes’s  vision  of  the  state  of 
nature,  when  it  described  the  lack  of  politics  as 
resulting  in  evil  and  vices,  or  when  it  mentioned  the 
fable  of  the  big  fish  that  will  eat  the  small  one 
(Matsyanyaya) .  The  functional  dimension  of  poli¬ 
tics,  as  the  art  of  coexistence  and  “maintaining 
harmony,”  can,  therefore,  be  considered  as  really 
transcultural  and  common  to  the  humanity’s  differ¬ 
ent  histories.  Here,  we  can  probably  locate  the  roots 
of  a  pluralist  vision  of  politics,  as  this  first  definition 
paves  the  way  for  a  plural  conception  of  the  city 
(polis)  where  people  do  not  necessarily  share  the 
same  interests,  the  same  beliefs,  or  the  same  ethnic 
characteristics.  Quite  the  opposite,  in  this  perspec¬ 
tive,  diversity  is  the  real  raison  d’etre  of  politics. 

However,  politics  also  claims  a  second  function, 
which  is  more  demanding  and  sophisticated.  Some 
philosophers  and  thinkers  are  going  further, 
beyond  the  invention  of  the  city,  pointing  to 
another  purpose:  Politics  is  supposed  to  lead  to  the 
path  of  righteousness,  to  promote  virtue,  and  to 
enable  humans  to  achieve  the  Supreme  Good. 
Thus,  Aristotle  conceived  of  politics  as  referring  to 
welfare  and  virtue.  The  city  must  be  constructed  as 
the  good  city.  Political  science  is  elevated  then  to 
something  much  more  demanding,  namely,  the 
“science  of  the  good  politics”  or  the  “science  of 
good  government.”  This  vision  can  be  found  in 
Islam  through  the  commitment  to  divine  law 
(sharia);  when  taken  to  its  extreme,  this  concep¬ 
tion  even  becomes  a  way  of  challenging  power 
holders  and  leads  to  a  political  inversion  in  which 
protest  is  a  more  important  political  activity  even 
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than  governing.  In  the  same  way,  the  Artbasastra 
describes  politics  as  promoting  peace  and  prosper¬ 
ity,  while  Buddhism  produces  an  ethic  of  human 
behavior.  Confucianism  also  gives  the  central  role 
to  the  virtue  that  humans  naturally  possesses,  but 
that  is  achieved  through  the  ruler  and  his  norms. 

These  two  functions  of  politics  convey  the  two 
faces  of  political  theory,  one  of  which  is  positivist, 
the  other  normative.  If  politics  is  only  the  science 
of  the  city,  it  is  first  of  all  a  bebauioralist  science.  If 
it  is  the  science  of  the  Supreme  Good,  its  normative 
orientation  is  dominant.  This  tension  has  partly 
been  overcome  by  an  instrumentalist  approach  to 
politics,  according  to  which  the  distinguishing 
nature  of  politics  has  to  be  found  in  the  instru¬ 
ments  used  for  running  the  city  instead  of  its  ulti¬ 
mate  goals. 

Politics  as  an  Instrument:  The  Science  of  Power? 

Power — in  Max  Weber’s  sense  as  the  ability  to 
achieve  your  interests  even  against  someone  else’s 
will,  that  is,  as  coercion — is  understood  here  as  the 
first  instrument  through  which  politics  operates. 
All  major  thinkers  claim  that  the  city  cannot  exist 
without  power,  no  matter  how  it  is  structured. 
There  is  thus  a  long  tradition  of  connecting  power 
and  politics,  in  which  political  science  is  assigned 
to  study  how  power  is  formed,  structured,  and 
shared  (Lasswell  &  Kaplan,  1950).  However,  the 
science  of  politics  and  a  science  of  power  are  not 
synonymous:  Is  every  kind  of  power  necessarily 
political,  for  example,  in  a  firm  or  a  club?  In  a 
broader  sense,  some  authors  state  that  politics  can 
be  played  in  an  office  or  inside  a  family,  but  this 
expression  is  merely  a  metaphor  when  conceived 
in  a  micro-social  order.  Conversely,  if  we  consider 
power  as  the  essence  of  politics,  we  have  to  opt  for 
a  wide  definition  of  power  that  includes  ideology, 
social  control,  and  even  social  structures,  and  thus 
deviates  from  a  vision  of  power  as  purely  coercive. 

For  this  reason,  politics  is  commonly  defined  as 
a  specific  kind  of  power;  either  it  is  held  by  a  cen¬ 
tral  institution,  such  as  a  state,  a  government,  a 
ruling  class,  or  it  is  used  by  a  power  holder  who  is 
considered  to  be  legitimate.  The  first  perspective 
approaches  politics  as  the  science  of  the  state  and 
implies  that  traditional  and  weakly  institutional¬ 
ized  societies  lack  the  centralized  power  that  is 
necessary  for  politics  to  exist.  The  second  promotes 


the  concept  of  legitimate  authority  as  the  real 
essence  of  politics  and  suggests  that  in  this  sense 
politics  is  more  evident  in  democracies  than  in 
than  authoritarian  or  totalitarian  systems. 

Some  visions  also  link  power  and  norms. 
Pushing  the  Aristotelian  definition  further,  they 
define  politics  as  the  set  of  norms  that  lead  to  the 
City  of  Good — either  the  City  of  God  or  the  City 
of  the  Philosopher.  This  radical  conception  is  to  be 
found  in  several  traditions:  that  of  Augustine, 
rather  than  Thomas  Aquinas,  in  the  Christian  cul¬ 
ture;  Puritanism  with  the  Reformation  and 
Calvinism;  radical  Islam  in  the  perspective  of  Ibn 
Taimyya.  This  conception,  however,  runs  the  risk 
of  drifting  into  totalitarianism  or  at  the  very  least 
depreciating  the  political  debate,  as  it  make  any 
kind  of  political  choice  impossible. 

The  connection  with  territory  is  also  frequently 
used  as  an  instrumental  approach  to  politics.  The 
Greek  tradition  paved  the  way  when  Aristotle 
stressed  the  difference  between  politics,  ethnos, 
and  oikos  (household).  If  politics  is  conceived  as 
the  coexistence  of  diverse  peoples,  it  denies  that 
there  is  such  a  thing  as  “natural  territory”  and 
implies  a  socially  constructed  territory  as  its 
required  arena.  That  is  why  Max  Weber  makes 
territory  a  key  element  of  his  definition  of  politics. 
For  Weber,  a  community  has  a  political  quality 
only  if  its  rules  are  granted  inside  a  given  territory: 
this  territorial  invention  hardly  fits  nomadic  societ¬ 
ies  or  even  a  number  of  traditional  ones  (Evans- 
Pritchard,  1940).  But  even  if  it  gets  close  to  a  state 
vision  of  politics,  it  emphasizes  the  role  of  plural¬ 
ism  and  diversity  inside  the  political  order.  In  a 
more  extensive  conception  of  the  spatial  dimen¬ 
sion  of  politics,  public  and  private  spheres  are 
opposed:  the  former  is  seen  as  the  natural  back¬ 
ground  of  political  debate,  while  the  latter  is  con¬ 
ceived  as  a  resistance  against  political  power  and 
its  penetrations  (Habermas,  1975).  We  also  find 
again  the  possible  opposition  between  religious 
and  secular  spheres  (other-worldly/this-worldly), 
and  even  the  disenchantment  with  the  world  as 
one  of  the  possible  sources  of  politics. 

After  going  through  all  these  definitions,  the 
criterion  of  social  coexistence  seems  to  be  the  most 
extensive  one,  and  probably  the  least  questionable. 
If  politics  is  everywhere  around  the  world  consid¬ 
ered  more  or  less  as  managing  social  harmony,  it 
can  clearly  be  conceived  as  the  opposite  of  some 
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other  classical  spheres  of  social  action  (politics  vs. 
social  life,  military,  administration,  etc.).  As  such, 
it  is  part  of  the  general  social  arena,  as  an  ordinary 
social  fact,  but  a  very  specific  one. 

Politics  in  the  Social  Division  of  Labor 

Here  we  face  a  contradiction  that  is  shaping  a 
serious  debate  among  political  scientists.  If  the 
conception  of  politics  as  an  ordinary  social  fact 
tends  to  prevail,  political  science  merges  with 
political  sociology  (see  below).  In  the  opposite  ver¬ 
sion,  the  latter  would  be  defined  as  a  part  of 
political  science,  sometimes  with  ambiguous  bor¬ 
derlines.  The  vagueness  and  the  mobility  of  the 
borderline  stem  from  different  factors:  the  diver¬ 
sity  of  the  great  theories  in  the  social  sciences, 
which  do  not  reflect  the  same  visions  of  politics 
and  which  are  torn  between  power  and  integra¬ 
tion;  the  historical  and  cultural  background  of 
politics,  which  is  shaping  different  kinds  of  lin¬ 
eages;  and  the  present  impact  of  globalization, 
which  is  probably  fueling  a  new  definition  of  poli¬ 
tics  that  is  increasingly  detached  from  concepts  of 
ethnicity  and  territoriality. 

Two  Traditions:  Power  Versus  Integration 

Max  Weber  is  obviously  considered  as  a  “found¬ 
ing  father”  by  both  sociologists  and  political  scien¬ 
tists.  He  clearly  promoted  a  political  vision  of 
sociology  when  he  developed  his  two  major  con¬ 
cepts  of  Macht  (power  as  coercion)  and  Herrschaft 
(power  as  authority).  Both  of  them  can  be  found  in 
the  very  first  steps  of  his  sociology  where  he 
defines  power  as  the  ability  of  one  actor  in  a  social 
relationship  to  modify  the  behavior  of  another, 
through  pressure,  force,  or  other  forms  of  domina¬ 
tion.  From  a  Hobbesian  perspective,  power  plays 
the  major  role  in  structuring  social  relationships, 
while  the  social  actors  strive  to  give  meaning  to 
this  asymmetrical  relationship  in  order  to  make  it 
just  and  acceptable,  thus  establishing  the  legiti¬ 
macy  of  those  with  power. 

In  modern  society,  the  state  plays  an  important 
role  and  politics  has  an  exceptional  status,  as  it  is 
theoretically  conceived  as  the  main  basis  of  social 
order.  This  conception  is  also  strongly  rooted  in 
the  Marxist  vision,  where  the  state  is  considered 
the  instrument  by  which  the  ruling  class  maintains 


its  domination,  as  the  bourgeoisie  does  in  the 
capitalist  mode  of  production.  Carl  Schmitt  also 
starts  from  a  Weberian  presupposition  in  linking 
politics  to  enmity.  By  contrast,  in  a  Durkheimian 
vision,  integration  is  substituted  for  power  as  the 
key  concept.  Politics  is  conceived  neither  as  an 
instrument  of  domination  nor  as  a  way  of  produc¬ 
ing  social  order;  rather,  it  is  a  function  by  which 
the  social  system  is  performing  its  integration. 
Obviously,  this  function  implies  institutions  and 
then  a  political  sphere,  including  state  and  govern¬ 
ment,  but  it  is  considerably  more  diffuse  and 
appears  to  be  produced  by  the  social  community 
and  its  collective  consciousness.  From  this  perspec¬ 
tive,  a  political  society  is  made  up  of  social  groups 
coming  together  under  the  same  authority.  Such  an 
authority  derives  from  the  social  community  and 
the  collective  consciousness;  it  is  constituted  by 
rules,  norms,  and  collective  beliefs,  which  are 
assimilated  through  socialization  processes.  As 
such,  politics  is  closely  related  to  social  integration 
and  is  supposed  to  strengthen  it  further.  That  is 
why  there  is  a  strong  correlation  between  a  grow¬ 
ing  division  of  labor,  from  the  increasing  political 
functions,  and  their  differentiation  from  the  social 
structures.  “The  greater  the  development  of  soci¬ 
ety,  the  greater  the  development  of  the  state” 
(Durkheim,  1975,  3,  p.  170).  Durkheim  contrasted 
“mechanical  solidarity”  arising  because  of  per¬ 
ceived  similarities  among  people  (e.g.,  in  work  or 
education)  from  “organic  solidarity”  arising  when 
people  are  doing  different  things  but  see  them¬ 
selves  as  part  of  an  interdependent  web  of  coop¬ 
erative  associations.  This  Durkheimian  vision  is  to 
be  found  later  in  the  functionalist  and  systemic 
concepts  of  political  science  as  elaborated  by 
Talcott  Parsons,  David  Easton,  Gabriel  Almond 
and  others,  but  also  in  the  socio-historical  tradi¬ 
tions,  which  attempted  to  link  the  invention  of 
politics  to  the  sociology  of  social  changes,  as  in  the 
work  of  Charles  Tilly  or  Stein  Rokkan.  It  is  also 
congruent  with  the  social  psychological  paradigm, 
which  tries  to  capture  politics  through  its  social 
roots,  such  as  socialization,  mobilization,  and 
behavioral  analysis. 

By  contrast,  state  and  power  are  the  real  sources 
of  a  Weberian  political  science.  Politics  is  no  longer 
a  function  of  the  division  of  labor,  but  has  defi¬ 
nitely  its  own  determinants.  Quite  the  opposite, 
social  history  is  considering  the  transformations  in 
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the  mode  of  government  and  more  precisely  the 
mode  of  domination.  Power  is  thus  conceived  as 
an  explanatory  variable  of  the  transformation  of 
societies  and  political  orders.  Such  a  vision  is  com¬ 
mon  among  those  approaches  of  political  science 
that  are  centered  on  power  politics,  the  role  of  the 
state,  or  the  nature  of  political  regimes  or  that  are 
focusing  on  political  institutions  and  the  condi¬ 
tions  of  their  legitimization. 

The  Diversified  Lineages  of  Politics 

Politics  is  thus  approached  in  different  ways, 
but  is  also  intrinsically  plural.  During  the  1970s, 
when  globalization  began  to  shape  the  world  and 
when  decolonization  was  completed,  both  history 
and  anthropology  incorporated  the  perspective  of 
politics  with  respect  to  plurality.  This  perspective 
on  plurality  also  challenged  the  mono-dimensional 
vision  that  had  been  promoted  by  developmental- 
ism  a  decade  earlier.  In  anthropology,  Geertz 
(1973)  pointed  out  that  politics  covered  several 
meanings  that  are  changing  along  historical  lines 
and  according  to  specific  cultures.  These  meanings 
are  socially  constructed  as  human  actors  encounter 
different  kinds  of  events,  challenges,  or  goals  and 
as  they  are  rooted  or  embedded  in  different  sorts 
of  economic  and  social  structures.  Politics  is 
understood  as  achieving  the  will  of  God  and  his 
law  in  Islam,  while  it  aims  to  manage  the  human 
city  in  this  world  according  to  the  Roman  Christian 
culture.  The  first  conception  was  fueled  by 
Muhammad’s  hijra  when  the  Prophet  left  Mecca 
because  of  opposition  to  his  teaching  and  went  to 
Medina  to  build  up  the  City  of  God.  The  second 
conception  was  shaped  by  the  Roman  experience 
of  religion,  which  survived  during  the  centuries  of 
the  Empire  and  had  again  to  survive  when  the  lat¬ 
ter  collapsed  during  the  fifth  century.  In  this  dra¬ 
matic  contrast,  politics  does  not  cover  the  same 
meaning,  as  it  is  differentiated  from  the  public 
sphere  and  oriented  toward  individuals  in  the 
Roman  tradition,  while  it  was  more  globally  con¬ 
structed  in  the  Muslim  tradition.  In  both  cultures, 
this  diversification  continued;  as  Geertz  mentions, 
politics  does  not  have  the  same  meaning  in 
Indonesia  and  in  Morocco,  two  Muslim  societies 
that  experienced  greatly  different  histories. 

For  that  reason,  politics  can  be  properly  defined 
only  when  the  definition  includes  the  meaning  that 


the  social  actors  usually  give  to  it.  This  cultural 
background  implies  a  huge  empirical  investigation, 
which  is  all  the  more  difficult  since  the  observer 
tends  to  view  things  through  his  or  her  own  con¬ 
cepts,  which  are  obviously  culturally  oriented.  The 
risk,  therefore,  is  high  to  consider  as  universal  a 
cultural  vision  of  politics,  which  shapes  the  para¬ 
digm  of  empirical  political  science.  Translations 
can  be  particularly  misleading  and  even  fanciful. 
For  instance,  the  Arabic  word  dawla  is  often  trans¬ 
lated  as  “state,”  whereas  their  meanings  are  hardly 
equivalent.  It  is  quite  impossible  to  convey,  through 
translation,  the  deep  cultural  gap  that  really 
implies  two  competing  visions  of  politics.  The  only 
way  of  going  ahead  is  to  deepen  the  anthropologi¬ 
cal  and  the  linguistic  investigations  in  order  to 
identify  distinctive  features  of  each  conceptualiza¬ 
tion  of  politics,  following  the  “thick  description 
method”  recommended  by  Geertz.  But  is  there  any 
end  to  this  “individualization”  of  politics?  To  be 
operative,  research  must  postulate  a  minimal  uni¬ 
versality  of  its  own  concepts  and  contain  the  risks 
of  “culturalism”:  It  has  to  keep  the  connection 
with  history  and  anthropology  while  remaining  in 
a  universalist  framework. 

A  Politics  of  Globalization 

This  dilemma  is  revived  and  even  stimulated  by 
the  globalization  of  the  world.  In  the  new  global 
order,  politics  is  no  longer  limited  or  contained  by 
the  territoriality  principle.  It  becomes  reinvented 
beyond  the  classical  coexistence  of  sovereign  cities. 
Politics  cannot  be  conceived  as  a  simple  addition 
of  social  contracts,  as  it  was  in  the  Westphalian 
paradigm.  This  challenge  is  first  posed  to  the  “real¬ 
ist”  theory  of  international  relations,  questioning 
the  absolute  opposition  between  “inside”  and 
“outside,”  or  “domestic  politics”  and  “interna¬ 
tional  politics.”  The  latter  is  no  longer  confined  to 
the  dialogue  of  sovereigns  and  has  destroyed  the 
traditional  categories  and  criteria  of  politics.  After 
all,  is  there  a  “global  covenant,”  as  Robert  Jackson 
(2000)  argues,  that  totally  reshapes  the  construc¬ 
tion  of  politics? 

The  hypothesis  that  competition  among  nation¬ 
states  can  be  understood  as  parallel  to  that  within 
nation-states  supported  the  extension  of  the  con¬ 
cept  of  politics  to  the  international  sphere.  The 
idea  of  power  politics  was  projected  into  the 
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international  arena  in  order  to  stress  that  interna¬ 
tional  politics  referred  to  the  classical  grammar: 
States,  like  political  actors,  were  competing 
according  to  their  own  interests  and  were  primar¬ 
ily  concerned  with  their  ability  to  dominate  other 
states,  or,  at  least,  to  contain  the  power  of  the  oth¬ 
ers.  Morgenthau  (1948)  defined  international 
politics  as  the  “struggle  for  power,”  power  as  “the 
control  over  the  minds  and  actions  of  other  men,” 
and  political  power  as  “the  mutual  relations  of 
control  among  the  holders  of  public  authority  and 
between  the  latter  and  the  people  at  large”  (p.  27). 

Although  this  conception  is  clearly  rooted  in  a 
Weberian  approach  to  politics,  it  does  not  belong 
only  to  the  past.  But  it  neither  covers  nor  exhausts 
all  the  political  issues  at  stake  in  the  new  configu¬ 
ration  of  the  international  arena.  First  of  all,  as 
sovereignty  is  fading,  the  proliferation  of  transna¬ 
tional  actors  no  longer  restricts  international  poli¬ 
tics  to  a  juxtaposition  of  territorial  nation-states. 
Second,  power  and  coercion  are  losing  their  effi¬ 
ciency  as  influence  and  social  relationships  are 
getting  more  and  more  performance  oriented. 
Third,  globalization  and  the  growing  international 
social  community  are  shaping  common  goods, 
creating  a  kind  of  community  of  humankind; 
human  beings  are  then  creating  “a  political  dia¬ 
logue  that  can  bridge  their  differences  .  .  .  without 
having  to  suppress  them  or  obliterate  them” 
(Jackson,  2000,  p.  16).  We  are  here  rediscovering 
Aristotle  when  he  claimed  that  men  need  each 
other  for  their  own  survival. 

Nevertheless,  no  one  would  assert  as  yet  the 
complete  achievement  of  an  international  society 
or  an  international  community.  International  poli¬ 
tics  remains  an  unstable  combination  of  references 
to  power  politics  and  to  international  social  inte¬ 
gration:  It  then  confronts  the  vision  of  politics  as 
coexistence  among  diversity.  It  goes  back  to  the 
idea  of  harmony,  but  without  a  completed  con¬ 
tract,  to  the  hypothesis  of  a  global  city  without  a 
central  government,  to  the  assertion  of  common 
norms  without  binding  measures.  This  combination 
is  at  the  core  of  the  English  School  of  international 
relations  that  refers  to  the  “anarchical  society.” 
But  it  is  also  close  to  the  French  vision  of  an  inter¬ 
national  solidarity.  In  the  end,  politics  gets  closer 
and  closer  to  a  functional  vision  of  managing 
social  diversity  in  order  to  make  it  compatible  with 
the  need  for  survival. 


Epistemological  Building 
Blocks  of  Political  Science 

Some  major  building  blocks  of  political  science  can 
be  identified  that  help  characterize  some  common 
elements  in  existing  approaches,  but  also,  and  per¬ 
haps  more  important,  enable  us  to  locate  these 
positions  and  their  differences  more  precisely  with 
regard  to  the  major  epistemological  foundations. 
The  first  of  these  building  blocks  concerns  the  mul¬ 
tidimensionality  of  our  subject  matter;  the  second, 
its  plastic  and  malleable  character  and  the  result¬ 
ing  self-referential  problems;  the  third  refers  to  a 
systems  perspective  of  politics;  and  the  fourth  to 
the  linkages  between  different  levels  (micro-, 
meso-,  macro-)  of  political  (and  more  generally 
social)  analysis. 

Multi-Dimensionality 

The  most  basic  distinctions  of  our  discipline, 
which  distinguish  it  in  some  important  respects 
from  the  “natural”  sciences,  concern  the  dimen¬ 
sions  of  its  subject  matter.  As  in  nature,  there  are 
certain  hard  “objects”  such  as  political  institutions 
and  social  structures,  which  can  be  identified  and 
which  are  “tangible”  and  observable  in  certain 
ways.  In  addition,  however,  there  is  a  “subjective” 
dimension  in  which  such  objects  are  perceived  by 
individuals  and  groups  and  translated  into  concrete 
actions.  Such  perceptions  themselves  are  shaped  by 
a  number  of  psychological,  social,  or  other  factors. 
This  distinction  is  commonly  accepted  and  runs 
through  the  history  of  philosophy  from  antiquity  to 
the  present  day  and  concerns  all  sciences  of  human¬ 
kind,  including  medicine.  There,  distinctions 
between  body  and  mind  (or  consciousness)  and  the 
subsequent  divisions  into  subdisciplines  such  as 
anatomy  and  psychology  are  commonplace.  Sim¬ 
ilarly,  the  fact  that  there  are  possible  interactions 
between  these  dimensions  is  well  accepted,  even 
though  in  medicine  some  of  these  psychosomatic 
relationships  are  still  not  well  researched.  The  third 
dimension,  the  “normative”  one  that  concerns 
ethical  judgments  of  “good”  or  “bad”  actions  and 
behavior,  is  more  problematic.  In  medicine,  again, 
some  ethical  norms  have  been  generally  accepted 
since  the  time  of  Hippocrates,  but  debates  continue 
about,  for  example,  when  exactly  human  life 
begins  or  ends,  and  what  the  respective  theologi¬ 
cal  or  philosophical  justifications  are  for  such 
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positions.  In  philosophy,  this  “three-dimensional¬ 
ity”  of  human  existence  has  also  been  elaborated 
by  Immanuel  Kant,  for  example,  in  his  Critique  of 
Pure  Reason  (1787/1 956,  p.  748  ff.). 

A  graphical  representation  of  these  dimensions 
can  be  rendered  in  the  following  Figure  1  (where 
the  dotted  line  represents  a  “holistic”  position): 
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Figure  I  Dimensions  of  Human  Existence 

Source:  Dirk  Berg-Schlosser. 

The  crux  of  the  matter  really  concerns  problems 
of  distinguishing  such  dimensions  and  their  interac¬ 
tions  not  only  analytically  but  also  in  actual  prac¬ 
tice,  and  controversies  about  normative,  ontologi- 
cally  based  justifications  and  their  respective  episte¬ 
mological  and  methodological  consequences  per¬ 
sist.  Here,  we  cannot  go  into  these  debates  in  any 
detail,  but  we  find  it  useful  to  locate  the  major 
emphases  of  the  current  meta-theoretical  positions 
in  political  science  with  the  help  of  such  distinc¬ 
tions.  Thus,  the  major  ontological  approaches  have 
their  basis  in  the  normative  dimension  ranging  from 
Plato  to  Eric  Voegelin  or  Leo  Strauss,  but  also  con¬ 
cern  attempts  in  linguistic  analysis  (e.g.,  Lorenzen, 
1978),  or  communications  theory  (Habermas, 
1981).  This  also  applies  to  non-Western  traditions 
such  as  Confucian  (Shin,  1999),  Indian  (Madan, 
1992),  or  sub-Saharan  African  (Mbiti,  1969)  ones. 

Sharply  opposed  to  such  foundations  of  politi¬ 
cal  theory  are  critical-dialectical  or  historical- 
materialist  positions  in  the  tradition  of  Karl  Marx 
and  his  followers.  There,  the  object  dimension  of 
the  modes  of  production  and  re-production  of 
human  existence  is  the  basic  one  from  which  the 
others  are  derived.  Thus,  the  objective  social  exist¬ 
ence  determines  the  subjective  consciousness  and 


the  political  and  normative  superstructures.  On 
this  position,  the  teleological  theory  of  history  of 
Marx  and  his  followers  from  the  early  beginning 
until  the  classless  society  and  its  peaceful  end  is 
based  as  well. 

The  third  major  meta-theoretical  position,  a 
behavioral  or  behavioralist  one,  takes  the  subjec¬ 
tive  dimension  as  its  starting  point,  expressing  a 
position  of  methodological  or  phenomenological 
individualism  (for  the  use  of  these  terms,  see 
Goodin  &  Tilly,  2006,  p.  10ff.).  Then  subjective 
perceptions  and  subsequent  actions  of  human 
beings  are  what  really  matters.  These  shape  social 
and  political  life.  This  position  has  been  most 
influential  in  election  studies,  for  example,  but  also 
concerning  some  aspects  of  political  culture 
research.  In  a  somewhat  broader  perception,  both 
subjective  and  objective  dimensions  and  their 
interactions  are  considered  by  empirical-analytical 
approaches,  but,  from  a  positivistic  point  of  view, 
no  normative  judgments  can  be  made  on  this  basis. 
Long-lasting  controversies  concerning  this  position 
go  back  to  Max  Weber  and  his  followers  but  are 
also  reflected  in  more  recent  debates  between  Karl 
Popper  and  Jurgen  Habermas,  for  example  (see 
Adorno  et  ah,  1969). 

These  basic  meta-theoretical  positions  and  their 
variations  remain  incompatible.  Similarly,  whether 
these  dimensions  can  in  actual  fact  be  separated  or, 
by  necessity,  always  go  together  from  a  holistic 
perspective  remains  controversial.  The  latter  posi¬ 
tion,  in  contrast  to  Kant,  is,  for  example,  repre¬ 
sented  by  G.  W.  F.  Hegel,  but  also  by  Marx  and 
some  of  his  followers  (e.g.,  Lukacs,  1967).  In  the 
same  way,  epistemological  positions  based  on  reli¬ 
gion,  including  Buddhism  and  Confucianism,  per¬ 
ceive  these  dimensions  in  a  holistic  manner.  From 
a  more  pragmatic  perspective,  it  seems  that  the 
fundamentalist  debates  about  such  matters  have 
subsided  in  the  last  few  decades  and  most  political 
or  social  scientists  just  agree  to  disagree  about  such 
basic  ontological  or  religious  positions  and  their 
respective  justifications.  Nevertheless,  Figure  1 
may  help  better  locate  such  positions  and  to  put 
some  conceptual  order  into  these  controversies. 

The  Plastic  Matter  of  Political  Science 

As  a  result  of  the  change  from  Newtonian  phys¬ 
ics  with  its  deterministic  relationships  to  quantum 
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theory  and  probabilistic  relations  in  nuclear  phys¬ 
ics,  Popper  (1972)  has  coined  the  metaphor  of 
“clouds  and  clocks.”  Clocks  represent  Isaac 
Newton’s  deterministic  world,  as  in  astronomy,  for 
example,  where  the  movements  of  stars  and  plan¬ 
ets  or  the  next  solar  eclipse  can  be  predicted  (or 
retro-dicted)  with  clocklike  precision.  Clouds,  by 
contrast,  constitute  a  very  elusive  substance,  the 
structures  and  regularities  of  which  cannot  easily 
be  grasped  over  a  somewhat  longer  period  even 
today  by  the  most  advanced  computers  of  meteor¬ 
ologists  and  their  satellite-based  data.  Between 
these  two  extremes,  however,  which  should  be 
perceived  as  the  opposite  poles  of  a  continuum 
rather  than  mutually  exclusive  positions,  there  is  a 
plastic  matter  that  is  malleable  in  the  course  of 
time  and  that  is  neither  perfectly  determined  nor 
subject  to  pure  chance. 

In  an  important  essay,  Gabriel  Almond  and 
Stephen  Genco  (1977)  have  transferred  this  con¬ 
cept  to  the  social  sciences  and  politics.  They  state 
that 

the  implication  of  these  complexities  of  human 
and  social  reality  is  that  the  explanatory  strategy 
of  the  hard  sciences  has  only  a  limited  application 
to  the  social  sciences.  .  .  .  Thus,  a  simple  search 
for  regularities  and  lawful  relationships  among 
variables — a  strategy  that  has  led  to  tremendous 
success  in  the  physical  sciences — will  not  explain 
social  outcomes,  but  only  some  of  the  conditions 
affecting  those  outcomes,  (p.  493,  emphasis 
added) 

The  deductive  subsumption  of  individual  events 
under  “covering  laws”  in  Carl  Hempel’s  (1965) 
sense,  according  to  which  claims  about  individual 
events  can  be  derived  deductively  from  premises  that 
include  a  scientific  law,  thus  is  not  possible  for  the 
most  part.  In  addition,  factors  of  human  choice  and 
action  plus,  possibly,  some  elements  of  pure  chance 
in  certain  conjunctures  also  have  to  be  considered. 

As  a  consequence,  we  have  to  be  more  modest 
in  our  claims  about  the  precision  of  causal  rela¬ 
tionships,  the  generalizability  of  regularities,  and 
the  universality  of  theories.  At  best,  therefore,  only 
theories  located  more  precisely  in  time  and  space — 
what  Robert  K.  Merton  called  “medium-range 
theories” — seem  to  be  possible  for  most  practical 
purposes.  Such  a  view  also  corresponds  with  a 


position  already  expressed  by  Aristotle,  who 
located  politics  in  an  intermediate  sphere  between 
the  necessary,  where  strict  science  can  be  applied, 
and  the  realm  of  pure  chance,  which  is  not  acces¬ 
sible  for  scientific  explanations. 

Such  distinctions  are  illustrated  in  Figure  2: 


Subject  Matter  of  Social  Sciences 


Figure  2  Degree  of  Determination  ofTheories 

Source:  Dirk  Berg-Schlosser. 


Again,  the  full  implications  of  such  a  perspec¬ 
tive  cannot  be  discussed  here,  but  this  figure 
should  be  helpful,  once  more,  to  locate  some  of  the 
“harder”  and  some  of  the  “softer”  approaches  in 
our  discipline  along  this  spectrum.  On  the  whole, 
we  would  agree  with  Almond  and  Genco’s  conclu¬ 
sion  that 

the  essence  of  political  science  ...  is  the  analysis 
of  choice  in  the  context  of  constraints.  That 
would  place  the  search  for  regularities,  the  search 
for  solutions  to  problems,  and  the  evaluation  of 
these  problems  on  the  same  level.  They  would  all 
be  parts  of  a  common  effort  to  confront  man’s 
political  fate  with  rigor,  with  the  necessary 
objectivity,  and  with  an  inescapable  sense  of 
identification  with  the  subject  matter  which  the 
political  scientist  studies,  (p.  522) 

The  last  point  also  leads  to  the  next  differentia 
specified  of  the  social  sciences  as  compared  to  the 
naturalist  sciences  and  their  distinct  epistemology. 

Self-Referential  Aspects 

This  sense  of  identification  also  can  be  seen  in 
different  ways.  First  of  all,  it  means  that  as  human 
and  social  beings  we  are  inevitably  part  of  the  sub¬ 
ject  matter  we  are  studying.  Even  if  we  attempt  to 
detach  ourselves  as  much  as  possible  from  the  object 
under  consideration  some  subjective  influences  on 
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our  perception  remain.  These  can  be  analyzed  by 
psychology,  the  sociology  of  knowledge  to  discern 
our  (conscious  or  unconscious)  “interests”  in  such 
matters,  and  so  on,  but  some  individual  “coloring” 
of  our  lenses  seems  inevitable.  Therefore,  a  certain 
“hermeneutic  circle,”  which  should  be  made  con¬ 
scious  and  explicit  in  the  interactions  with  others, 
remains  (Moses  &  Knutsen,  2007,  Chapter  7). 

However,  this  limitation  can,  again  in  contrast 
to  naturalist  perceptions  of  science,  be  turned  to 
one’s  advantage.  As  human  beings  we  can  empa¬ 
thize  with  each  other  and  can  intersubjectively,  if 
not  objectively,  understand  and  interpret  the  mean¬ 
ing  of  each  other’s  thoughts  and  actions.  This  is 
even  more  the  case  when  we  are  trained  as  social 
scientists  in  a  common  methodology  and  scientific 
language.  This  latter  point  also  distinguishes  the 
perception,  level  of  information,  and  theoretical 
interpretation  of  a  political  scientist  from  the 
“man  (or  woman)  in  the  street”  talking  politics,  in 
the  same  way  that  a  meteorologist  has  a  different 
knowledge  of  what  is  happening  in  the  atmosphere 
compared  to  the  daily  small  talk  about  the  weather. 
Nevertheless,  such  inevitable  subjectivity,  which  is 
also  historically  and  culturally  conditioned,  opens 
the  way  to  more  pluralist  interpretations  and 
meanings.  Constructivist  approaches,  as  contrasted 
to  naturalist  ones,  can  dig  deeper  in  certain  ways 
into  this  subjectivity  and  the  plurality  of  meanings 
(cf.,  e.g.,  Foucault,  1970). 

Two  more  points  concerning  our  identification 
with  the  subject  matter  and  our  self-referential 
position  within  it  must  be  mentioned.  Being  part  of 
the  substance,  we  can  also,  consciously  or  uncon¬ 
sciously,  act  upon  it.  Thus,  self-fulfilling  or  self- 
defeating  prophecies  become  possible  as  feedbacks 
between  the  interpretation  or  even  just  personal 
opinion  of  an  important  actor  or  social  scientist 
whose  authority  in  a  certain  sphere  has  become 
acknowledged  in  the  matter  he  is  dealing  with. 
This  frequently  occurs  when  some  “analysts”  give 
their  opinion  on  probable  developments  of  the 
stock  exchange  or  currency  rates  and  many  people 
follow  suit.  This  also  applies  to  electoral  predic¬ 
tions  with  respective  bandwagon  and  underdog 
effects. 

Finally,  being  part  of  our  world  and  being  able, 
to  some  extent,  to  act  on  it,  also  raises  the  question 
of  social  and  political  responsibility.  This  brings  us 
back  to  the  normative  side  of  politics  with  which 


we  inevitably  have  to  deal,  self-consciously  and 
being  aware  of  possible  consequences.  In  this 
respect,  too,  a  recent  constructivist  turn  in  the 
theory  of  international  politics,  in  a  somewhat 
more  specific  sense  of  the  term,  has  led  to  the 
broader  discussion  and  possible  acceptance  of 
more  universal  norms. 

A  Systems  Perspective 

Within  this  multidimensional,  malleable,  and 
dynamic  universe  more  specific  political  elements 
can  be  identified.  One  difficulty  in  this  respect, 
again,  lies  in  the  contrasting  meta-theoretical  posi¬ 
tions  and  their  perspectives  on  politics  (see  the  first 
section  above).  In  a  more  abstract  way,  politics  can 
also  be  conceived  as  the  regulating  mechanism  in 
large-scale  modern  societies.  Easton  (1965)  thus 
defines  politics  as  “the  authoritative  allocation  of 
values”  in  society  and  the  forces  shaping  these  pro¬ 
cedures.  In  this  process,  different  elements  interact  in 
a  systemic  way  regulating  conflicts.  This  mechanism 
can  be  conceived  like  a  thermostat  with  the  respec¬ 
tive  inputs  and  outputs  connected  by  an  effective 
feedback  procedure  in  a  cybernetic  sense  (see  also 
Deutsch,  1963).  Such  relationships  can  be  illustrated 
in  a  simplified  system  model  (see  Figure  3). 

This  system  model  should  not,  however,  be 
equated  with  systems  theory  in  a  more  demanding 
sense  (e.g.,  Luhmann,  1984).  Thus,  such  systems 
need  not  necessarily  be  in  equilibrium  and  they 
may  also  explode  or  implode  as,  in  fact,  they  did 
in  Communist  Eastern  Europe. 

Nevertheless,  such  a  model  is  again  helpful 
to  locate  the  major  subdivisions  of  politics  (and 
political  science),  which  also  constitute  the  major 
subsections  of  this  encyclopedia  and,  in  fact,  many 
political  science  departments  or  national  associa¬ 
tions.  The  bottom  square  includes,  in  a  broader 
sense,  the  fields  of  political  sociology  and,  when  this 
is  treated  separately,  political  economy.  The  square 
on  the  left-hand  side  represents  political  sociology  in 
a  narrower  sense  of  the  term  (organized  interest 
groups,  political  parties,  etc.).  The  top  square 
reflects  the  institutional  side  (involving  a  possible 
separation  of  powers,  etc.)  but  also  questions 
of  governance  often  including  the  realm  of  public 
policies  and  public  administration  on  the  right-hand 
side.  All  this  is  embedded  in  the  international 
system  concerning  interactions  with  the  outside 
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Figure  3  Simplified  System  Model 

Source:  Dirk  Berg-Schlosser. 


world  both  of  state  and  society  as  the  field  of  inter¬ 
national  politics  and,  in  a  more  limited  sense, 
international  political  economy.  The  arrows  of 
such  interactions  can  go  in  both  directions.  The 
systematic  comparison  of  such  systems  or  some 
subfields  is  the  realm  of  comparative  government. 
Overall  theoretical  (and  philosophical)  implica¬ 
tions  are  the  concern  of  political  theory,  and  the 
respective  methods  and  analytic  techniques  applied 
constitute  the  subfield  of  political  methodology. 

Linkages  Between  Levels  of  Analysis 

A  final  building  block  to  be  considered  here 
concerns  the  links  between  macro-aspects  of  entire 
political  systems  and  their  relationship  with  the 
micro-world  of  individual  citizens  and  the  meso- 
level  of  organisations  in  between.  For  this  purpose, 
what  has  been  dubbed  “Coleman’s  bathtub” 
(Coleman,  1990,  p.  8)  is  most  helpful.  Here,  a 
given  objective  (structural)  situation  at  the  macro¬ 
level  (on  the  upper-left-hand  side  in  Figure  4)  can 
be  linked  to  the  micro-level  of  individual  subjective 
perceptions  and  values,  which  are  then  translated 
into  concrete  actions,  possibly  aggregated  on  the 
meso-level,  and  then  leading  to  the  outcome  on  the 
macro-level  to  be  explained  (upper-right-hand 
side).  This  relationship  is  illustrated  in  Figure  4. 


It  is  important  to  note  at  this  place  that  we  do 
not  imply  for  the  individual  actors,  as  is  done  for 
example  in  economics  and  rational  choice  theory,  a 
specific  logic  of  selection,  as,  for  example,  maxi¬ 
mizing  a  person’s  material  well-being.  Such  very 
restrictive  assumptions  of  “homo  oeconomicus”  or 
even  “homo  sociologicus”  (Dahrendorf,  1977) 
only  rarely  apply  in  political  science,  where  usually 
a  much  wider  range  of  choices  exists,  even  if  some 
of  which  may  appear  as  “irrational”  to  others  (for 
example,  strongly  felt  ethnic  or  religious  identities). 

The  purpose  here,  again,  rather  lies  in  the  pos¬ 
sibility  to  locate  various  approaches  and  their 
respective  assumptions  in  such  a  scheme  and  to 
show  the  plurality  of  concepts  that  can  be  integrated 
here,  but  keeping  them  in  a  coherent  relationship. 
Hartmut  Esser  (1993,  p.  23  f f . ) ,  for  example,  has 
extended  possible  assumptions  at  the  micro-level 
to  include  “restricted,  resourceful,  evaluating, 
expecting,  maximingmen”  (RREEMM)  or  women, 
and  even  further  assumptions  about  conflicting 
“identifying”  (with  some  collective  entities)  or 
“individualizing”  attitudes  (RREEIIMM)  or  simi¬ 
lar  ones  may  be  added.  For  the  logic  of  the  situa¬ 
tion  also  framing  procedures  play  a  role  where 
individual  perceptions  are  shaped  by  the  social 
milieus  of  one’s  childhood  and  later  environment 
(see  also  D ’Andrade,  1995).  The  point  here  is  to 
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Figure  4  Linking  Levels  of  Analysis 

Sources:  Adapted  from  Coleman  (1990)  and  Esser  (1993,  p.  98). 


show  that  in  this  way  given  historical  and  tradi¬ 
tional  constraints  at  the  macro-level  terms  can  be 
meaningfully  linked  to  individual  and,  at  the 
aggregated  level,  collective  political  action.  Which 
particular  historical,  cultural,  or  other  factors 
condition  these  choices  in  any  given  situation  can 
be  left  open  at  this  place,  leaving  room,  again,  for 
a  plurality  of  cultural  and  theoretical  perspectives. 

The  Need  for  a  Reflective  Pluralism 

As  this  overview  has  shown,  there  are  some 
basic  building  blocks,  which  can  be  usefully 
employed  in  a  variety  of  ways  for  locating  differ¬ 
ent  epistemological  positions  and  historical- 
cultural  traditions  in  political  science  and  similar 
fields.  In  this  way,  it  at  least  becomes  clearer 
where  (and  perhaps  also  why)  certain  contending 
positions  actually  differ.  We  do  not  intend  to  “har¬ 
monize”  these  positions.  They  all  have,  to  varying 
degrees,  their  respective  strengths  and  weaknesses, 
and  no  coherent,  well-integrated  theoretical  build¬ 
ing  is  constructed  here  with  these  blocks.  That 
may  even  not  be  desirable,  leaving  some  room  to 
agree  to  disagree  about  some  basic  issues  and  per¬ 
spectives.  What  is  desirable  instead  is  to  elevate 
our  consciousness  and  our  way  to  deal  with  such 
controversies  to  a  level  of  reflective  pluralism, 
where  not  just  anything  goes,  but  where  contend¬ 
ing  epistemologies  and  approaches  can  be  brought 


into  a  fruitful  interdisciplinary,  intercultural,  and, 
possibly  even,  meta-theoretical  dialogue. 

As  already  mentioned,  political  science  has 
always  been  characterized  by  a  diversity  of  con¬ 
tending  meta-theoretical  positions,  paradigms,  and 
approaches.  In  Europe,  in  the  last  century  various 
strands  of  normative-ontological,  Marxist,  and 
empirical-analytical  persuasions  have  been  at  the 
forefront  (for  such  and  similar  terms  see,  e.g., 
Easton,  Gunnell,  &  Graziano,  1991;  Quermonne, 
1996).  For  several  decades  in  the  United  States, 
“behavioralist”  positions  and,  more  recently, 
“rational”  and  “public  choice”  approaches  have 
dominated  (cf.  the  influential  volumes  by  King, 
Keohane,  &  Verba,  1994,  and  Brady  &  Collier, 
2004).  In  other  parts  of  the  world,  different  theo¬ 
logical,  philosophical,  and  epistemological  tradi¬ 
tions  have  influenced  the  (more  recent)  emergence 
of  political  science  there.  Altogether,  thus  a  great 
variety  of  contending  positions,  which  have  been 
summarized  as  “naturalist,”  “constructivist,”  and 
“realist,”  can  be  observed  (Moses  &  Knutsen, 
2007). 

Relations  With  Other  Social  Sciences 

Pluralism  and  different  traditions  in  political  sci¬ 
ence  also  emerge  when  we  change  perspective  and 
focus  more  precisely  on  its  relationships  with  other 
social  sciences.  This  section  explores  three  sources 
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of  political  science  as  it  differentiated  itself  from 
other  disciplines. 

Evolution  of  Political  Science 

When  looking  at  the  period  after  World  War  II, 
the  basic  difference  in  the  traditions  of  different 
countries  and  areas  of  the  world  is  between  a  plu¬ 
ral  form  (political  sciences)  that  is  more  common 
in  Europe  and  encompasses  the  singular  (political 
science).  Conversely,  in  the  tradition  of  United 
States  the  singular  form  (political  science)  includes 
the  plural  (political  sciences).  In  the  singular,  there 
is  a  pluralist  political  science  where  empirical 
analysis  is  dominant,  but  also  other  perspectives 
(law,  history,  philosophy)  are  present.  However,  be 
it  plural  or  singular,  during  the  last  decades  empir¬ 
ical  political  science  has  increasingly  differentiated 
itself  from  sociology,  and  above  all  from  political 
sociology,  public  law,  political  philosophy,  and 
contemporary  history.  Actually,  in  these  develop¬ 
ments  we  can  see  differences  among  disciplines  or, 
more  precisely,  among  specific  groups  of  scholars 
in  specific  countries,  but  also  overlapping  and 
mutual  influences  with  ever  stronger  interactions 
among  scholars  who  are  able  to  cross  borders  from 
Europe  to  North  and  South  America,  and  to  Africa 
and  Asia,  with  a  strong  British  tradition  still  pres¬ 
ent  in  Australia. 

When  we  trace  the  original  development  of 
empirical  political  science,  we  can  see  that  in  a 
large  number  of  European  and  American  coun¬ 
tries,  political  science  is  the  result  of  empirical 
developments  in  public  law.  Consequently,  the  first 
difference  concerns  the  difference  between  the  per¬ 
spective  of  law,  which  deals  with  “what  ought  to 
be” — with  norms  and  the  institutions  that  seek  to 
embody  them,  and  that  of  political  science  as 
transformed  by  behavioralism  into  an  empirical 
social  science,  which  is  focused  on  “what  is” — on 
the  reality  and  on  the  explanations  of  it. 

In  Europe  as  well  as  in  North  and  South 
America,  there  are  other  strong  traditions  that 
make  contemporary  history  a  parent  of  the  new, 
post-World  War  II  empirical  political  science. 
Here,  despite  all  its  ambiguities,  the  criterion  of 
differentiation  is  between  historical  idiographic 
research,  focused  on  the  analysis  of  specific 
unique  events,  and  a  political  science  character¬ 
ized  by  epistemological  and  methodological 


assumptions  of  other  social  sciences  such  as  eco¬ 
nomics,  sociology,  and  psychology,  at  least  in 
terms  of  expectations  of  empirical  findings  (nomo¬ 
thetic)  with  a  more  general  scope  (regularities, 
patterns,  laws).  Social  history  and  historical  soci¬ 
ology  as  in  the  works  of  Reinhard  Bendix, 
Barrington  Moore,  Stein  Rokkan,  Charles  Tilly, 
and  others  have  also  greatly  contributed  to  our 
understanding  of  long-term  political  processes  at 
the  macro-level.  In  this  respect  historical  studies 
and  political  analysis  can  nicely  supplement  each 
other,  as  in  the  adage  “Political  science  without 
history  has  no  root,  history  without  political  sci¬ 
ence  bears  no  fruit.” 

Within  the  European  and  North  American  tra¬ 
ditions,  sociology  is  the  third  parent  of  the  new 
empirical  science.  Here,  in  addition  to  the  com¬ 
mon  epistemology  and  possibly  methodology  of 
research,  the  overlapping  of  the  contents,  when 
political  sociology  is  considered,  makes  the  dif¬ 
ferentiation  more  difficult.  Such  a  criterion  was 
set  up  by  two  famous  sociologists  of  the  1950s, 
Bendix  and  Seymour  Lipset,  when  they  stated  that 
political  science  starts  from  the  state  and  analyzes 
how  it  influences  society,  whereas  political  sociol¬ 
ogy  starts  from  the  society  and  analyzes  how  it 
influences  the  state  (Bendix  &  Lipset,  1957,  p. 
87).  In  other  words,  the  independent  variables  of 
a  sociologist  are  the  dependent  variables  of  a 
political  scientist:  The  arrows  of  explanation  are 
going  in  opposite  directions.  Such  a  distinction 
sounds  artificial  and  unrealistic  when  the  inner 
logic  of  research  is  taken  into  account — if  we 
decide  in  advance  what  is/are  the  independent 
variable/s,  how  can  we  stop  when  no  salient 
results  come  out  and  declare  that  from  now  on 
one  becomes  a  sociologist  or  economist  or  else? 
Nevertheless,  for  years  such  a  distinction  was  the 
rule  of  thumb  used  to  stress  the  difference  between 
political  sociology  and  political  science.  However, 
such  a  rule  was  responding  more  to  the  necessities 
of  differentiation  between  academic  communities 
than  to  the  needs  of  developments  in  empirical 
research.  It  must  also  be  noted  that  political  soci¬ 
ology  can  be  understood  in  both  a  broad  and  a 
narrow  sense.  In  the  former,  it  covers  the  broad 
social-structural  and  political-cultural  bases  of 
politics  and  their  long-term  developments  over 
time  at  the  macro  level.  In  the  latter,  the  interme¬ 
diate  and  input  structures  of  politics  like  interest 
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groups,  parties,  social  movements  and  other 
aspects  of  civil  society  are  dealt  with.  These, 
undoubtedly,  belong  more  to  the  realm  of  political 
science  proper  and  have  continued  to  flourish.  In 
the  former  sense,  closer  to  historical  sociology,  a 
certain  slackening  can  be  observed.  This  is  due  to 
the  fact  that  the  consideration  of  long-term  social- 
structural  developments  had  rigidified  to  some 
extent  in  the  1970s  and  1980s  in  variants  of 
orthodox  Marxism,  or  the  political  element  had 
largely  disappeared  in  the  analysis  of  finer  social 
distinctions  in  Pierre  Bourdieu’s  sense. 

Last  but  not  least,  the  development  of  differ¬ 
ences  between  political  philosophy  and  political 
science  should  be  recalled.  Again,  there  is  much 
overlapping  of  contents,  but  epistemology  and 
methods  are  different  and  easy  to  distinguish.  As 
recalled  by  Giovanni  Sartori  (1984)  with  regard  to 
the  “language  watershed,”  first  of  all,  the  language 
is  different:  The  words  and  the  related  empirical 
concepts  of  political  science  are  operationalized, 
that  is,  translated  into  indicators  and,  when  pos¬ 
sible,  in  measures,  whereas  the  language  of  politi¬ 
cal  philosophy  is  not  necessarily  so;  it  usually 
adopts  meta-observed  concepts,  that  is,  concepts 
that  are  not  empirically  translated. 

As  discussed  in  the  next  section,  this  apparently 
simple  differentiation  covers  possible  commonali¬ 
ties,  but  leaves  unsolved  how  the  two  different 
disciplinary  perspectives  deal  with  normative 
issues.  Norberto  Bobbio  (1971,  pp.  367,  370) 
made  a  relevant  contribution  in  this  direction 
when  he  emphasized  that  political  philosophy 
focuses  mainly  on 

•  the  search  for  the  best  government; 

•  the  search  for  the  foundations  of  the  state  or  the 
justification  of  political  obligations; 

•  the  search  for  the  ‘nature’  of  politics  or  of 
‘politicness’;  and 

•  the  analysis  of  political  language. 

All  four  topics  have  an  ethical,  normative  content, 
which  is  a  characterizing  feature  of  each  political 
philosophical  activity.  At  the  same  time,  Bobbio 
recalls  that  an  empirical  analysis  of  political 
phenomena  that  are  the  objects  of  political  science 
should  satisfy  three  conditions: 


•  the  principle  of  empirical  control  as  the  main 
criterion  of  validity; 

•  explanation  as  the  main  goal;  and 

•  Wertfreiheit,  or  freedom  from  values,  as  the 
main  virtue  of  a  political  scientist. 

As  noted  in  the  discussion  of  epistemology  above, 
the  key  element  is  in  differentiating  the  speculative, 
ethically  bound  activity  of  a  philosopher  from  the 
empirical  analysis,  even  of  phenomena  that  are 
influenced  by  the  values  of  the  actors. 

The  Influences  of  Other  Disciplines 

The  obvious  conclusion  of  the  previous  subsec¬ 
tion  is  that  there  are  different  ways  of  analyzing 
political  phenomena  that  correspond  to  different 
traditions  and  come  from  different  cultural  influ¬ 
ences.  Moreover,  the  discussion  of  those  differ¬ 
ences  may  help  in  developing  a  negative  identity 
of  political  science.  This  is  the  very  first  meaning 
of  the  actual  pluralism  we  have  in  this  domain  of 
knowledge:  Pluralism  only  means  that  politics  can 
be  legitimately  studied  in  different  ways  and  with 
different  goals  that  belong,  at  least,  also  to  law, 
history,  sociology,  and  economics.  Pluralism  in 
this  sense  challenges  the  autonomy  of  political  sci¬ 
ence  and  even,  in  a  radical  version,  has  led  to  a 
denial  that  it  constitutes  a  specific  science.  This 
view,  however,  no  longer  corresponds  to  the  inter¬ 
nal  differentiation  of  the  discipline,  its  specific 
achievements,  and  its  more  general  institutional¬ 
ization  as  an  academic  field.  In  addition,  a  second 
sort  of  pluralism  inside  political  science  proper 
reveals  the  overlapping  and  the  influences  of  other 
disciplines  in  empirical  political  science.  In  this 
vein,  when  again  considering  the  period  starting 
after  World  War  II,  a  main  hypothesis  can  be  pro¬ 
posed:  Political  science  is  influenced  by  the  disci¬ 
pline  or  the  other  social  science  that  in  the  imme¬ 
diately  previous  years  has  developed  new  salient 
knowledge.  This  is  so  for  sociology,  as  can  be  seen 
in  the  analysis  of  Lipset  and  Bendix  and  other 
important  authors  since  the  end  of  World  War  II, 
who  developed  the  work  of  classic  sociologists, 
from  Weber  and  Durkheim  to  Parsons  and  others. 
This  is  so  for  the  influence  of  general  systems 
theory,  coming  from  cybernetics,  and  translated 
meaningfully  into  the  analysis  of  political  systems 
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so  that  since  the  mid-1950s,  it  has  become  a  major 
approach  in  political  science.  The  same  applies  to 
the  influence  of  functionalism,  born  with  the 
developments  of  anthropology,  and  to  rational 
choice  or  more  specifically  game  theory,  coming 
from  economics  and  becoming  more  and  more 
influential  with  several  adaptations  since  the  end 
of  the  1950s.  This  is  so,  finally,  for  cognitive  psy¬ 
chology  that  became  very  important  in  economics 
and  at  the  same  time  in  political  science  with  the 
development  of  new  ways  of  studying  electoral 
behavior. 

Moreover,  when  we  consider  more  closely  some 
of  the  subsectors  of  political  science  we  can  see 
more  specific  influences.  For  example,  in  the  field 
of  international  relations,  we  can  see  the  influence 
of  international  law.  The  public  policies  sector  of 
political  science  has  been  influenced  by  sociology, 
economics,  and  constitutional  and  administrative 
law.  Sociology  has  shaped  the  development  of 
research  on  political  communication.  The  influ¬ 
ence  of  history  can  be  seen  in  the  selection  of  spe¬ 
cific  topics  in  comparative  politics.  Thus,  we  see 
that  political  science  not  only  embodies  a  highly 
developed  pluralism  of  the  two  kinds  mentioned 
above,  but  also  requires  the  integration  of  knowl¬ 
edge  from  other  disciplines.  Political  scientists 
therefore  also  need  an  educational  background 
that  enables  them  to  draw  on  these  interdisciplin¬ 
ary  sources  of  the  field. 

Recent  Developments  and  Perspectives 

As  the  International  Political  Science  Association 
notes  on  its  website, 

it  is  hard  today  for  political  scientists  of  2011  to 
imagine  the  very  different  status  their  discipline 
in  the  world  under  reconstruction  of  1949.  In 
place  of  the  familiar,  well-structured  web  of 
national  associations  we  know  today,  there  were 
associations  only  in  the  United  States  (founded  in 
1903),  Canada  (1913),  Finland  (1935),  India 
(1938),  China  (1932),  and  Japan  (1948).  (http:// 
www.ipsa.org/history/prologue) 

Founders  of  the  International  Political  Science 
Association  met  in  1948  to  plan  for  a  new  interna¬ 
tional  organization  that  would  establish  dialogue 
among  political  scientists  throughout  the  world. 


Taking  into  account  the  views  of  political  science 
as  variously  defined  in  different  countries,  they 
identified  four  fields  as  constituting  the  discipline, 
acknowledging  “the  influence  of  the  philosophers 
with  ‘political  theory,’  the  jurists  with  ‘govern¬ 
ment,’  the  internationalists  with  ‘international 
relations,’  and  the  fledgling  behaviorist  school  of 
American  political  science  with  ‘parties,  groups 
and  political  opinion.’”  Today  IPSA  serves  as  the 
primary  international  organization  in  the  field, 
with  individual  and  institutional  members  as  well 
as  affiliations  with  national  political  science  mem¬ 
berships  across  the  globe. 

With  respect  to  the  ways  that  pluralism  and 
interdisciplinary  developments  have  taken  place  in 
political  science,  the  North  American  influence  has 
been  paramount.  The  so-called  Americanization 
affected  all  of  Europe  as  well  as  other  areas  of  the 
world  where  native  scholars,  educated  in  North 
American  universities,  went  back  to  conduct 
research  and  to  teach,  bringing  a  new  empirical 
conception  of  the  discipline  that  significantly  con¬ 
tributed  to  create  new  communities  of  political 
scientists  (Favre,  1985).  Moreover,  American  foun¬ 
dations  and  research  centers  gave  support  for 
research  in  Europe,  Latin  America,  Asia,  and 
Africa.  While  there  are  differences  in  political  sci¬ 
ence  as  it  exists  today  on  different  continents  in 
this  domain  of  knowledge — and  actually  also  in 
most  other  scientific  research  domains — the  North 
American  universities,  as  well  as  the  American 
research  centers  and  the  scholars  associated  with 
them,  had  a  great  influence  that  can  be  compared 
only  to  the  intellectual  German  influence  during 
the  50  years  between  the  end  of  the  19th  century 
and  the  first  3  decades  of  the  20th  century.  Thus, 
at  the  end  of  the  1960s,  Mackenzie  (1969,  p.  59) 
suggested  that  in  this  period  90%  of  political  sci¬ 
entists  worked  in  North  America,  and  Klaus  von 
Beyme  noted  that  the  Department  of  Political 
Science  at  the  University  of  California,  Berkeley 
had  more  professors  in  this  field  than  all  German 
universities  combined.  Moreover,  in  those  years 
and  earlier  in  the  1950s  in  all  European  countries 
and  in  Japan,  the  American  influence  had  been 
very  strong  in  all  social  sciences,  with  some  excep¬ 
tions  such  as  anthropology,  which  had  a  specific 
French  presence.  Forty  years  later,  70%  of  all 
political  scientists  are  almost  equally  present  in 
North  America  and  Western  Europe  and  the  other 
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30%  are  spread  throughout  the  rest  of  world, 
again  with  a  relatively  strong  presence  in  Japan. 

To  better  understand  the  development  of  the 
discipline  all  over  the  world  with  its  specific  con¬ 
tents,  approaches,  and  methods,  we  should  note 
that  the  American  influence  has  been  supple¬ 
mented  by  the  great  increase  of  faculty  members  in 
all  universities  of  the  world  since  the  1960s. 
During  this  period,  especially  in  Europe,  there  was 
the  so-called  transition  from  elite  universities  to 
mass  universities;  that  is,  there  was  a  significant 
growth  in  the  number  of  university  students, 
which  required  the  recruitment  of  a  large  number 
of  new  faculty  members  in  all  disciplines,  political 
science  included.  This  growth  of  the  discipline 
allowed  the  creation  of  academic  groups  who 
absorbed  and  translated  the  American  influence  in 
different  ways.  Without  that  internal  growth,  there 
would  not  have  been  even  the  possibility  of  such  a 
widespread  influence. 

This  penetrating  influence  had  a  different  impact 
in  the  various  countries  also  in  connection  with 
their  respective  traditions.  More  precisely,  on  the 
one  hand,  the  influence  of  the  way  empirical 
research  is  developed  through  quantitative  statisti¬ 
cal  analysis  and  qualitative  research  is  general  and 
fairly  homogeneously  widespread;  on  the  other 
hand,  some  approaches  that  have  a  stronger  cor¬ 
respondence  or  congruence  in  the  European  and 
Japanese  traditions,  such  as  the  different  neo¬ 
institutionalist  approaches,  have  had  more  success 
than  other  approaches,  such  as  the  rational  choice 
approach.  That  latter  has  become  very  strong  in 
North  American  political  science,  where  it  has  its 
roots  in  economics,  but  it  has  remained  much 
weaker  among  political  scientists  in  other  areas  of 
the  world.  By  its  very  nature,  political  science  in 
other  regions  of  the  world  also  has  been  more  spe¬ 
cifically  historical  and  comparative  rather  than 
just  focusing  (mostly)  on  a  single  case,  the  United 
States.  Moreover,  the  legal  traditions  of  several 
European  countries  especially  influenced  research 
in  the  subfield  of  public  policies.  At  the  same  time, 
traditions  in  political  philosophy  and  contempo¬ 
rary  history  maintained  some  influence  on  research 
that  was  predominantly  qualitative  rather  than 
quantitative.  Finally,  and  more  specifically  in 
Europe,  research  funding  from  the  European 
Union  led  to  the  development  of  a  number  of 
works  focused  on  topics  related  to  the  Union. 


In  the  most  recent  developments,  the  impact  of 
a  more  continuous  and  effective  communication 
among  scholars  through  different  modalities,  such 
as  domestic  and  international  collective  associa¬ 
tions,  research  networks,  and  initiatives  of  private 
and  public  institutions,  affected  the  discipline  as  a 
whole  mainly  in  three  directions.  The  first  one  is  a 
growing  trend  toward  blurring  national  differ¬ 
ences  and  a  consequent  convergence  between 
North  America,  or  between  North  and  South 
America,  and  Europe.  The  second  is  an  increased 
blurring  of  subdisciplinary  divides.  This  is  so  espe¬ 
cially  between  comparative  politics  and  interna¬ 
tional  relations,  traditionally  two  separate  fields  in 
the  past.  Such  a  trend  is  particularly  evident  in  the 
European  studies.  Third,  research  in  political  sci¬ 
ence  more  and  more  focuses  on  relevant,  contem¬ 
porary  realities  rather  than  confining  itself  to  an 
ivory  tower,  which  made  it  distant  and  largely 
irrelevant  and,  consequently,  created  that  “tragedy 
of  political  science”  Ricci  singled  out  years  ago 
(1987).  Contemporary  political  science  thus  has 
developed  into  a  multi-faceted,  well-established 
discipline  that  is  concerned  with  the  pressing  prob¬ 
lems  of  our  times  and  provides  sound  empirical 
analyses  and  meaningful  orientation  in  the  ever 
more  integrated  and  complex  world  of  the 
21st  century. 

Bertrand  Badie 
Professor  of  International  Relations, 
Paris  Institute  of  Political  Studies  (Sciences  Po) 

Paris,  France 

Dirk  Berg-Schlosser 
Emeritus  Professor  of  Political  Science, 
University  of  Marburg 
Marburg,  Germany 

Leonardo  Morlino 
Professor  of  Political  Science,  LUISS 

Rome,  Italy 

See  also  African  Political  Thought,  Behavioralism; 

Buddhism,  Comparative  Politics;  Concept  Formation; 
Constructivism;  Epistemological  and  Methodological 
Foundations;  Greek  Philosophy;  Hinduism;  History  of 
Political  Science;  International  Relations  as  a  Field  of 
Study;  Islam;  Marxism;  Policy  Analysis;  Political 
Sociology  as  a  Field  of  Study;  Political  Theory 


lxvi  Introduction  to  Political  Science 


Further  Readings 

Adorno,  T.,  Albert,  H.  Dahrendorf,  R.,  Habermas,  J., 
Pilot,  H.,  &  Popper,  K.  R.  (Eds.).  (1969).  Der 
Positivismusstreit  in  der  deutschen  Soziologie  [The 
positivism  debate  in  German  sociology].  Darmstadt, 
Germany:  Luchterhand. 

Almond,  G.  A.,  &  Genco,  S.  (1977).  Clouds,  clocks, 
and  the  study  of  politics.  World  Politics,  29(4), 
489-522. 

Aristotle.  (1962).  Politics.  Harmondsworth,  UK:  Penguin. 
(Original  work  350  BC) 

Bendix,  R.,  &  Lipset,  S.  M.  (1957).  Political  sociology: 

An  essay  and  bibliography.  Current  Sociology, 
2(VI):79— 99. 

Bobbio,  N.  (1971).  Considerazioni  sulla  filosofia  politica 
[Considerations  on  Political  Philosophy].  Rivista 
Italiana  di  Scienza  Politica,  1(2),  367-380. 

Brady,  H.  E.,  &  Collier,  D.  (2004).  Rethinking  social 
inquiry:  Diverse  tools,  shared  standards.  Lanham, 

MD:  Rowman  &  Littlefield. 

Clastres,  P.  (1975).  La  societe  contre  I’Etat.  [Society 
against  the  state].  Paris:  Minuit. 

Coleman,  J.  S.  (1990).  Foundations  of  social  theory. 
Cambridge,  MA:  Harvard  University  Press. 

D’Andrade,  R.  G.  (1995).  The  development  of  cognitive 
anthropology.  Cambridge,  UK:  Cambridge  University 
Press. 

Dahrendorf,  R.  (1977).  Homo  Sociologies.  Opladen, 
Germany:  Westdeutscher  Verlag. 

Deutsch,  K.  W.  (1963).  The  nerves  of  government.  New 
York:  Free  Press. 

Durkheim,  E.  (1975).  Textes.  Paris:  Editions  de  Minuit. 

Easton,  D.  (1965).  A  systems  analysis  of  political  life. 
New  York:  Wiley. 

Easton,  D.,  Gunnell,  J.  G.,  &  Graziano,  L.  (Eds.).  (1991). 
The  development  of  political  science.  London: 
Routledge. 

Esser,  H.  (1993).  Soziologie.  Allgemeine  Grundlagen 
[Sociology.  General  Foundations].  Frankfurt  am  Main, 
Germany:  Campus. 

Evans-Pritchard,  E.  (1940).  The  Nuer.  Oxford,  UK: 
Clarendon  Press. 

Favre,  P.  (1985).  Histoire  de  la  science  politique  [History 
of  political  science].  In  M.  Grawitz  &  J.  Leca  (Eds.), 
Traite  de  science  politique  [Treatise  of  political 
science]  (Vol.  1,  pp.  28-41).  Paris:  Presses 
Universitaires  de  France. 

Foucault,  M.  (1970).  The  order  of  things.  London: 
Routledge. 

Geertz,  C.  (1973).  The  interpretation  of  cultures.  New 
York:  Basic  Books. 


Goodin,  R.  E.,  &  Tilly,  C.  (Eds.).  (2006).  Contextual 
political  analysis.  Oxford,  UK:  Oxford  University  Press. 
Habermas,  J.  (1975).  Legitimation  crisis.  Boston:  Beacon 
Press. 

Habermas,  J.  (1981).  Theorie  des  kommunikativen 
Handelns  [Theory  of  Communicative  Action]  (2 
vols.).  Frankfurt  am  Main,  Germany:  Suhrkamp. 
Hempel,  C.  G.  (1965).  Aspects  of  scientific  explanation 
and  other  essays  in  the  philosophy  of  science.  New 
York:  Free  Press. 

Jackson,  R.  (2000).  The  global  covenant.  Oxford,  UK: 
Oxford  University  Press. 

Kant,  I.  (1956).  Kritik  der  reinen  Vernuft  [Critique  of 
pure  reason].  Hamburg,  Germany:  Felix  Meiner. 
(Original  work  published  1787) 

King,  G.,  Keohane,  R.  O.,  &  Verba,  S.  (1994).  Designing 
social  inquiry.  Princeton,  NJ:  Princeton  University 
Press. 

Lasswell,  H.,  &  Kaplan,  A.  (1950).  Power  and  societies. 

New  Haven,  CT:  Yale  University  Press. 

Lorenzen,  P.  (1978).  Theorie  der  teclmischen  und 

politischen  Vernunft  [Theory  of  technical  and  political 
reason].  Stuttgart,  Germany:  Reclam. 

Luhmann,  N.  (1984).  Soziale  Systeme  [Social  systems]. 

Frankfurt  am  Main,  Germany:  Suhrkamp. 

Lukacs,  G.  (1967).  Geschichte  und  Klassenbewufitsein 
[History  and  class  consciousness].  Darmstadt, 
Germany:  Luchterhand. 

Mackenzie,  W.  M.  (1969).  Politics  and  social  science. 
London:  Penguin. 

Madan,  T.  N.  (Ed.).  (1992).  Religion  in  India.  Delhi, 
India:  Oxford  University  Press. 

Mbiti,  J.  S.  (1969).  African  religions  and  philosophy. 
London:  Heinemann. 

Merton,  R.  K.  (1968).  Social  theory  and  social  structures. 
New  York:  Free  Press. 

Monroe,  K.  R.  (Ed.).  (  2005).  Perestroika!  The  raucous 
rebellion  in  political  science.  New  Haven,  CT:  Yale 
University  Press. 

Morgenthau,  H.  (1948).  Politics  among  nations.  New 
York:  Knopf. 

Moses,  J.  W.,  &  Knutsen,  T.  L.  (2007).  Ways  of  knowing: 
Competing  methodologies  and  methods  in  social  and 
political  research.  London:  Palgrave  Macmillan. 
Parsons,  T.  (1951).  The  social  system.  Cambridge,  MA: 

Harvard  University  Press. 

Plato.  (1945).  The  republic.  New  York:  Oxford 
University  Press.  (Original  work  360  BC) 

Popper,  K.  (1972).  Objektive  Erkenntnis.  Ein 

evolutiondrer  Entwurf  [Objective  knowledge:  An 
evolutionary  approach].  Hamburg,  Germany: 

Hoffmann  und  Campe. 


Introduction  to  Political  Science  lxvii 


Quermonne,  J.-L.  (Ed.).  (1996).  Political  science  in 

Europe:  Education,  co-operation,  prospects,  report  on 
the  state  of  the  discipline  in  Europe.  Paris:  Institut 
d’Etudes  Politiques. 

Ricci,  D.  (1987).  The  tragedy  of  political  science:  Politics, 
scholarship,  and  democracy.  New  Haven,  CT:  Yale 
University  Press. 


Sartori,  G.  (Ed.).  (1984).  Social  science  concepts:  A 
systematic  analysis.  Beverly  Hills,  CA:  Sage. 

Shin,  D.  C.  (1999).  Mass  politics  and  culture  in 

democratizing  Korea.  Cambridge,  UK:  Cambridge 
University  Press. 

Weber,  M.  (1968).  Economy  and  society.  New  York: 
Bedminster. 


Accountability 


Accountability  has  become  a  key  concept  in  both 
public  administration  and  democratic  theory.  Its 
meaning  is  contested,  but  the  general  definition 
“obligation  to  answer  for  the  performance  of 
duties”  would  fit  most  versions.  In  this  sense, 
accountability  is  a  relationship  between  two  par¬ 
ties — the  person  or  organization  answering  or 
being  held  to  account  (the  accountor  or  agent)  and 
the  person  or  organization  to  whom  the  account  is 
owed  (the  account  holder  or  principal).  Analysis 
of  accountability  therefore  begins  with  the  double 
question:  Who  is  accountable  to  whom ? 
Accountability  obligations  depend  on  the  terms  of 
the  relationship  and  on  its  institutional  context, 
leading  to  two  more  key  questions:  (1)  For  ivhat  is 
the  accountor  accountable  and  (2)  how}  This 
entry  covers  the  following  topics:  defining  account¬ 
ability,  typologies  of  accountability,  mechanisms  of 
public  sector  accountability,  democratic  account¬ 
ability,  accountability  in  international  relations, 
single  versus  multiple  accountability,  accountabil¬ 
ity  in  networks,  and  accountability  and  the  new 
public  management  (NPM). 

Though  the  English  word  accountability  has  a 
respectable  historical  pedigree  (the  Oxford  English 
Dictionary  records  its  use  from  the  late  18th  cen¬ 
tury),  its  prominence  in  political  science  dates  only 
from  the  1980s,  before  which  time  the  cognate 
term  responsibility  was  preferred.  (Indeed,  linguis¬ 
tic  equivalents  of  “responsibility”  are  still  domi¬ 
nant  in  other  European  languages  that  lack  a  direct 


parallel  for  “accountability.”)  The  rapid  rise  of 
“accountability”  can  be  traced  to  its  adoption  by 
public  choice  theory,  particularly  principal-agent 
theory,  and  by  management  theory,  which  in  turn 
greatly  influenced  democratic  theory  and  public 
administration.  However,  the  relative  lack  of  intel¬ 
lectual  history  and  of  cross-disciplinary  seminal 
texts  has  meant  that  academic  analysis  of  account¬ 
ability  has  proceeded  in  a  haphazard,  fragmented, 
and  repetitive  fashion.  Different  subdisciplines, 
including  comparative  politics,  international  rela¬ 
tions,  deliberative  democracy,  constitutional  law, 
and  public  management  (further  subdivided  into 
U.S.  and  European  versions)  have  each  been  devel¬ 
oping  their  own  parallel  theories  of  accountability, 
with  little  cross-fertilization  or  sense  of  common 
purpose. 

Defining  Accountability 

Though  the  core  sense  of  accountability,  the  obliga¬ 
tion  of  the  accountor  to  answer  for  the  performance 
of  duties,  is  uncontroversial,  disagreement  occurs 
over  what  should  be  added  to  that  core.  Most 
analyses  also  include  the  capacity  of  the  account 
holder  to  impose  sanctions  or  other  remedies  on  the 
accountor  as  a  necessary  complement  to  full 
accountability.  However,  some  versions  confine 
accountability  to  the  initial  informing  and  discuss¬ 
ing  stages,  omitting  the  requirement  for  any  rectifi¬ 
cation.  In  effect,  they  equate  accountability  with 
transparency ,  another  popular  term  with  which 
accountability  is  frequently  linked.  Certainly,  in 
complex  modern  systems  of  public  accountability, 
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some  accountability  mechanisms,  such  as  parlia¬ 
mentary  inquiry  or  media  investigation,  can  pro¬ 
vide  transparency  but  lack  the  capacity  for 
imposing  sanctions,  leaving  that  function  to  other 
agencies  such  as  courts  or  the  executive.  But 
transparency  on  its  own,  with  no  prospect  of  cor¬ 
rection  or  other  adverse  consequences,  falls  short 
of  full  accountability. 

In  its  core  sense,  accountability,  like  accounting 
itself,  is  essentially  retrospective  or  ex  post,  in  that 
it  is  concerned  with  information  and  explanation 
about  past  actions  of  the  accountor.  Discussion 
about  future  actions  is,  therefore,  not,  strictly 
speaking,  an  exercise  in  accountability.  However, 
in  a  continuing  relationship  between  principal  and 
agent,  ex  post  can  easily  overlap  with  ex  ante,  as 
in  election  campaigns  when  incumbent  representa¬ 
tives  not  only  defend  their  past  actions  but  also 
outline  their  future  plans.  Indeed,  some  theorists 
have  wanted  to  distinguish  two  types  or  aspects  of 
accountability,  ex  post  (retrospective)  and  ex  ante 
(prospective)  accountability,  while  others  have 
included  all  communication  between  political  lead¬ 
ers  and  the  public  as  part  of  an  ongoing  account¬ 
ability  dialogue.  Such  usage,  while  understandable 
because  of  the  obligations  of  democratic  govern¬ 
ments  to  engage  in  continuing  discussion  with 
their  citizens,  extends  accountability  beyond  its 
normal  focus  of  answering  for  past  actions. 
Similarly,  government  consultation  with  stake¬ 
holders  about  future  policy  certainly  helps  keep 
politicians  and  bureaucrats  in  touch  with  relevant 
sections  of  public  opinion,  but  it  does  not  necessar¬ 
ily  imply  accountability  in  the  strict  sense  of 
accounting  for  previous  decisions. 

Accountability  is  also  sometimes  taken  beyond 
its  retrospective  core  when  it  is  equated  with  insti¬ 
tutional  devices  for  limiting  or  constraining  power. 
For  instance,  constitutional  checks  and  balances, 
such  as  federalism  and  the  separation  of  powers, 
are  sometimes  described  as  mechanisms  of  account¬ 
ability  because  they  limit  the  legal  power  of  gov¬ 
ernments  and  prevent  them  from  abusing  the 
rights  of  citizen.  Such  constraints  may  certainly 
involve  accountability  mechanisms,  for  instance, 
when  a  government  oversteps  its  legal  powers  and 
is  called  to  account  by  the  courts.  However,  con¬ 
stitutional  laws  and  regulations  themselves  are 
essentially  prospective  in  focus,  seeking  to  limit 
and  control  future  actions  of  governments.  The 


mere  fact  that  laws  and  regulation  constrain  the 
power  of  governments  need  not  in  itself  imply  that 
they  are  instruments  of  accountability. 

Significant  disagreement  also  surrounds  how  far 
accountability  is  to  be  distinguished  from  other, 
closely  related  concepts,  namely,  responsibility  and 
responsiveness.  “Responsibility”  and  “account¬ 
ability”  share  parallel  conceptual  histories,  both 
originating  in  the  notion  of  answering  (“respond¬ 
ing,”  “giving  an  account  to”)  to  someone. 
“Accountability,”  as  the  obligation  to  answer  for 
the  performance  of  duties,  generally  implies  a  rela¬ 
tionship  between  two  or  more  parties,  in  which 
one  party  is  subject  to  external  scrutiny  from  oth¬ 
ers.  “Responsibility,”  on  the  other  hand,  typically 
refers  to  the  internal  deliberations  and  actions  of  a 
single  person  or  organization. 

This  division  of  conceptual  labor  is  by  no  means 
universally  observed.  “Responsibility”  sometimes 
includes  answering  to  others  as  well  as  making  indi¬ 
vidual  choices,  as,  for  instance,  in  the  classic  British 
conventions  of  “responsible  government”  and 
“ministerial  responsibility.”  Indeed,  before  the  rise 
of  the  term  accountability,  responsibility  regularly 
covered  both  external  and  internal  aspects  (as  its 
equivalents  still  do  in  other  European  languages). 

The  popularity  of  “accountability”  has  seen  it 
extended  in  a  similar  fashion,  but  in  the  opposite 
direction,  to  include  not  only  external  scrutiny  but 
also  the  internal  capacity  for  considered  and  con¬ 
scientious  action  usually  described  as  acting 
“responsibly.”  For  instance,  where  members  of 
caring  professions,  such  as  social  workers  and 
health  professionals,  have  a  strong  vocational  com¬ 
mitment  to  serving  the  interests  of  the  community, 
this  concern  is  sometimes  identified  as  in  itself 
constituting  accountability  to  the  community, 
regardless  of  whether  members  of  the  community 
actually  have  any  rights  of  scrutiny  or  complaint. 
Similarly,  members  of  the  nonprofit  charitable  sec¬ 
tor  commonly  see  themselves  as  accountable  to  the 
recipients  of  charity  even  though  they  are  not, 
strictly  speaking,  answerable  to  them.  Particularly 
where  professional  standards  or  values  are  publicly 
articulated,  voluntary  adherence  to  such  standards 
can  be  seen  as  a  form  of  accountability  without 
any  requirement  to  report  externally  or  without 
external  any  mechanism  for  others  to  demand 
compliance.  The  accountability  has  become  inter¬ 
nalized,  as  though  the  conscientious  professional  is 
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in  an  imaginary  dialogue  with  his  or  her  clients 
and  answering  to  them. 

The  question  of  whether  an  internalized  sense 
of  service  should  be  accepted  as  a  form  of  account¬ 
ability  recalls  the  perennial  debate  over  the  relative 
merits  of  professional  discretion  compared  with 
external  scrutiny  as  a  means  of  securing  public 
service  in  the  public  interest.  The  issue  was  classi¬ 
cally  discussed  by  Carl  Friedrich  and  Herman 
Finer  in  the  1940s  as  a  debate  over  two  contrasting 
types  of  responsibility  but  is  now  typically  described 
as  a  clash  between  two  types  of  accountability. 
Conceptual  clarity  might  be  better  served  if 
accountability  were  reserved  for  external  scrutiny 
and  rectification,  recasting  the  Friedrich-Finer 
debate  as  being  between  the  respective  merits  of 
(internal)  responsibility  and  (external)  account¬ 
ability.  However,  the  emotive  pull  of  “account¬ 
ability,”  like  that  of  “democracy,”  makes  support¬ 
ers  of  professional  discretion  unwilling  to  accept 
what  would  amount  to  an  accountability  deficit. 
Instead,  they  prefer  to  see  themselves  as  embracing 
a  special  form  of  accountability. 

Similar  extensions  beyond  the  core  meaning  of 
accountability  derive  from  the  connection  between 
accountability  and  responsiveness.  Accountability, 
as  the  obligation  to  answer  to  external  scrutiny, 
derives  its  main  justification  from  its  contribution 
to  responsiveness,  understood  as  the  readiness  of 
institutions  and  officials  to  respond  to  the  needs 
and  interests  of  those  whom  they  serve.  In  public 
administration,  for  instance,  responsiveness,  the 
alignment  between  official  action  and  public  pref¬ 
erences,  is  the  goal  to  which  the  accountability  of 
governments  is  a  key  means.  However,  account¬ 
ability  procedures  of  external  scrutiny,  such  as 
review  and  audit,  are  not  the  only  mechanisms  for 
making  governments  responsive.  For  instance, 
changing  the  organizational  culture  of  a  public 
agency  toward  a  greater  client  focus  and  concern 
for  service  quality  can  make  the  agency  more 
responsive  to  the  public  but  need  not  involve  addi¬ 
tional  accountability  procedures  of  scrutiny  or 
inquiry.  Yet  many  in  such  an  agency  would  typi¬ 
cally  claim  that  it  had  become  both  more  respon¬ 
sive  and  more  accountable,  in  effect  identifying 
accountability  with  responsiveness. 

The  link  between  accountability  and  responsive¬ 
ness  also  arises  with  market  mechanisms.  Markets 
are  unquestionably  instruments  of  responsiveness. 


They  find  their  main  rationale  in  their  capacity  to 
align  the  provision  of  goods  and  services  with  the 
preferences  of  consumers  and  citizens.  But  are  they 
therefore  instruments  of  accountability?  According 
to  a  strict  understanding,  organizations  operating 
in  a  market,  particularly  private  companies,  are 
accountable  primarily  to  their  owners  and  share¬ 
holders,  not  to  their  customers.  They  may  be 
accountable  to  individual  consumers  who  have 
purchased  goods  or  services,  and  they  may  also  be 
publicly  accountable  in  the  sense  of  being  liable  to 
scrutiny  for  complying  with  any  relevant  law  and 
regulations.  But  overall  responsiveness  to  con¬ 
sumer  preferences  comes  from  the  consumers’ 
capacity  to  choose  between  alternative  suppliers  in 
a  competitive  market.  Suppliers  adjust  to  consumer 
demand  not  because  of  any  complaints  or  scrutiny 
from  potential  customers,  who  have  no  such  rights 
against  a  supplier,  but  because  they  will  go  out  of 
business  if  they  have  no  customers. 

Markets  are  thus  primarily  “exit”  mechanisms 
for  securing  responsiveness.  Dissatisfied  customers 
vote  with  their  feet.  Accountability,  on  the  other 
hand,  as  normally  understood,  is  a  “voice”  mecha¬ 
nism,  allowing  dissatisfied  members  of  the  public 
to  complain  and  to  seek  information  and  rectifica¬ 
tion  from  an  organization.  In  this  case,  markets  do 
not  count  as  accountability  mechanisms. 
Nonetheless,  because  competitive  markets  have 
the  capacity  to  force  service  providers  to  take  note 
of  consumer  preferences  and,  in  some  sense,  to 
“answer”  to  expressed  demand,  to  describe  them 
as  instruments  of  accountability  can  have  a  certain 
plausibility. 

In  the  same  way,  organizations  that  respond  to 
peer  or  public  opinion  through  concern  for  their 
corporate  reputations  are  sometimes  said  to  be 
exhibiting  “reputational”  accountability.  Again, 
there  is  no  direct  connection  or  dialogue  between 
the  supposed  “accountor”  and  the  “account 
holder,”  no  right  of  the  public  or  the  organiza¬ 
tion’s  peers  to  call  the  organization  to  account, 
and  no  formal  obligation  of  the  accountor  to 
accept  sanctions  or  redirections  from  others. 
Again,  however,  the  emotive  force  of  the  term 
accountability  leads  sympathetic  observers  to  clas¬ 
sify  such  responsiveness  as  instances  of  account¬ 
ability.  Moreover,  in  modern  democratic  societies, 
reputational  effects  are  often  the  result  of  media 
publicity  and  scrutiny.  There  are  good  grounds  for 
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seeing  the  media  as  agents  of  accountability,  hold¬ 
ing  public  figures  and  organizations  up  to  scrutiny, 
even  though  the  media  may  have  no  formal  rights 
to  demand  information  or  to  impose  sanctions.  As 
with  transparency  in  general,  however,  such 
accountability  without  rectification  is  inchoate  and 
incomplete. 

Accountability  is  thus  a  chameleon-like  concept 
that  readily  takes  on  new,  additional  senses  from 
the  different  contexts  in  which  it  is  used.  Because 
external  scrutiny  and  sanctions,  the  core  of  account¬ 
ability,  are  so  central  to  checking  abuses  of  power 
and  to  the  processes  of  representative  democracy, 
the  term  itself  is  easily  extended  to  other  mecha¬ 
nisms  and  processes  that  secure  the  same  overall 
objectives,  including  legal  and  regulatory  con¬ 
straints,  market  competition,  and  public  service 
professionalism  and  commitment  to  the  public 
interest.  These  extensions  and  variations  have  been 
driven  by  the  emotive  power  of  the  term  combined 
with  the  lack  of  widely  recognized  academic 
authorities  on  the  topic.  To  expect  agreement  on  a 
single  concept  of  accountability  is  unrealistic.  But, 
at  least,  analysts  of  accountability  could  become 
more  aware  of  the  well-established  variations  in 
usage  and  more  willing  to  place  their  own  versions 
within  that  larger  conceptual  context. 

Typologies  of  Accountability 

Most  analyses  of  accountability  divide  it  into  dif¬ 
ferent  types,  though,  as  with  the  definition  of 
accountability,  there  is  no  agreement  on  a  typol¬ 
ogy  or  even  on  the  meaning  of  some  of  the  labels. 
Some  typologies  are  based  on  distinctions  in  the 
subject  matter  of  accountability  ( for  what),  for 
instance,  between  accountability  for  contestable 
outcomes  (“political”)  or  for  agreed  tasks  (“man¬ 
agerial”)  and  accountability  for  different  types  of 
activity,  such  as  finances,  processes,  and  perfor¬ 
mance.  Other  typologies  focus  more  on  the  insti¬ 
tutions  and  mechanisms  of  accountability,  for 
instance,  “political,”  “legal,”  “bureaucratic,”  and 
“professional.” 

Several  classifications  are  built  on  distinctions  in 
the  direction  of  accountability,  such  as  “vertical,” 
which  can  include  “upward”  accountability  within 
a  structured  hierarchy  as  well  as  downward  to  citi¬ 
zens  and  customers,  and  “horizontal”  and  “out¬ 
ward,”  which  refers  to  accountability  to  institutions 


or  individual  of  roughly  equal  status .  A  “  3  60-degr  ee  ” 
accountability  implies  accountability  in  all  direc¬ 
tions:  upward,  outward,  and  horizontal.  “Internal” 
versus  “external”  accountability  can  refer  to  the 
difference  in  to  whom  an  organization  is  account¬ 
able,  internal  referring  to  those  who  exercise  clear 
ownership  or  delegation  rights  while  “external” 
refers  to  those  who  have  no  such  rights  but  are  nev¬ 
ertheless  affected. 

Typologies  tend  to  be  context  dependent.  For 
instance,  writers  on  developmental  politics  and 
democratization  have  employed  the  contrast 
between  “horizontal”  and  “vertical”  accountabil¬ 
ity  to  stress  the  importance  of  having  executive 
governments  accountable  (horizontally)  to  other 
coequal  institutions,  such  as  courts,  legislatures, 
and  auditors.  Horizontal  accountability  has 
become  equated  with  the  rule  of  law  and  constitu¬ 
tional  government,  seen  as  prerequisites  for  suc¬ 
cessful  representative  democracy.  In  international 
politics,  on  the  other  hand,  types  of  accountability 
identified  in  one  influential  study  (hierarchical, 
supervisory,  fiscal,  legal,  market,  peer,  and  repu¬ 
tational)  reflect  the  political  realities  of  interna¬ 
tional  relations  and  the  absence  of  some  of  the 
more  robust  accountability  mechanisms,  such  as 
elections  and  legal  sanctions,  available  within 
nation-states. 

In  the  public  administration  literature,  the  struc¬ 
ture  of  accountability  typologies  tends  to  follow 
constitutional  structure,  with  a  sharp  division 
between  the  approach  in  the  United  States  and  that 
in  the  parliamentary  systems  of  the  United  Kingdom 
(UK);  other  Westminster  systems  such  as  Canada, 
Australia,  and  New  Zealand;  and  Western  Europe. 
In  the  United  States,  the  separation  of  powers, 
along  with  federalism,  makes  bureaucrats  indepen¬ 
dently  accountable  to  a  range  of  institutions, 
including  the  president,  Congress,  and  the  courts, 
forcing  them  to  juggle  between  competing  account¬ 
ability  demands.  Analyses  of  accountability  tend  to 
stress  the  discretionary  role  of  bureaucrats  and  the 
varied  types  of  accountability  forum — hierarchical, 
political,  legal,  and  professional — in  which  they 
operate. 

In  parliamentary  systems,  by  contrast,  bureau¬ 
cratic  accountability  centers  on  the  hierarchical 
chain  of  accountability  through  ministers  to  par¬ 
liament  and  the  public,  associated  with  the  tradi¬ 
tional  conventions  of  ministerial  responsibility 
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and  described  variously  as  “parliamentary”  or 
“political.”  To  this  have  been  added  other,  supple¬ 
mentary  types,  such  as  “legal,”  “judicial”  (to  cover 
accountability  through  courts  and  tribunals),  or 
“managerial,”  referring  specifically  to  the  output- 
focused  accountability  emphasized  in  the  NPM. 

Not  only  do  different  typologies  contain  differ¬ 
ent  sets  and  combinations  of  individual  types  of 
accountability  but  particular  individual  types  also 
vary  in  meaning  between  different  typologies. 
Political  accountability,  for  instance,  is  sometimes 
confined  to  politicians  and  to  processes  involving 
politicians  while  at  other  times  it  refers,  more 
broadly,  to  contestable  outcomes  involving  non- 
elected  officials  as  well  as  elected  politicians.  Legal 
and  judicial  accountability  typically  imply  the 
processes  of  courts  and  the  legal  system.  But  they 
vary  as  to  whether  they  are  confined  to  the  judi¬ 
cial  branch  of  government  or  also  include  quasi¬ 
judicial  tribunals  or  other  executive-based  institu¬ 
tions  such  as  ombudsmen  and  freedom  of  infor¬ 
mation  rules.  Legal  accountability  has  also  been 
defined  even  more  broadly  to  cover  external 
monitoring  for  compliance  with  legally  estab¬ 
lished  rules,  which  therefore  implies  that  legisla¬ 
tures  scrutinizing  compliance  with  their  own  leg¬ 
islation  are  engaging  in  legal  accountability. 

Finally,  professional  accountability  also  has  no 
settled  meaning.  Sometimes  it  refers  to  the  account¬ 
ability  that  members  of  an  expert  profession  owe  to 
each  other  (peer  accountability).  For  example, 
some  professionals,  such  as  doctors  and  lawyers, 
can  be  held  to  account  by  disciplinary  bodies 
drawn  from  the  profession  itself.  Less  formally, 
members  of  a  profession,  including  public  servants, 
may  be  monitored  and  called  to  account  through 
shared  networks  and  collegial  relationships. 
However,  professional  accountability  has  also  been 
applied  to  the  independent  exercise  of  ethical 
norms  of  a  particular  profession,  which  have  been 
internalized  through  socialization  into  the  profes¬ 
sion.  In  this  case,  professional  accountability  is 
being  identified  as  a  sense  of  professional  responsi¬ 
bility,  quite  detached  from  any  external  scrutiny. 

Mechanisms  of  Public  Sector  Accountability 

The  complex  accountability  structures  surround¬ 
ing  modern  governments  involve  a  wide  range  of 
institutional  mechanisms,  which  vary  according  to 


the  section  of  government  on  which  they  focus 
(i who ?),  the  type  of  issue  that  they  investigate  ( for 
what ?),  and  the  procedures  they  follow  (how}). 

Elections 

In  a  representative  democracy,  the  basic 
accountability  mechanism  is  the  general  election 
at  which  incumbent  executive  leaders  present 
themselves  to  the  voters  and  seek  a  renewal  of 
their  mandate  to  govern.  Elections  compel  elected 
politicians  to  explain  and  justify  their  actions  and 
give  the  citizens  the  opportunity  to  listen  and 
impose  a  verdict.  Elections  legitimize  the  control 
exercised  by  political  leaders  over  the  executive 
bureaucracy  and  thus  underpin  the  accountability 
of  agency  heads  to  ministers  and  of  junior  officials 
to  their  superiors  through  the  hierarchical  chain  of 
command.  Though  elections  may  be  held  at  infre¬ 
quent  intervals,  their  indirect  influence  on  govern¬ 
ment  accountability  is  immense,  through  the 
threat  of  future  of  retribution  imposed  on  unpop¬ 
ular  governments. 

Elections  may  be  powerful  but  they  are  also 
blunt.  Their  effectiveness  as  accountability  instru¬ 
ments  can  be  compromised  by  their  forward-looking 
function  of  selecting  an  incoming  government. 
Voters  may  be  deflected  from  sanctioning  an 
unpopular  government  by  the  perception  that  the 
alternative  leadership  is  even  less  palatable. 
Accountability  is  also  denied  where  incumbent 
leaders  do  not  stand  for  reelection  (most  notably  in 
the  case  of  second-term  United  States  presidents). 
Moreover,  because  elections  require  a  general 
judgment  over  a  whole  regime  and  its  overall  pro¬ 
gram,  they  do  not  allow  for  more  fine-grained 
accountability  on  particular  issues  or  decisions. 
Electioneering  is  dominated  by  general  slogans  and 
misleading  rhetoric  and  does  not  offer  much  scope 
for  accurate  information  or  serious  discussion. 

Elections  therefore  need  to  be  supplemented  by 
a  range  of  other  accountability  mechanisms  and 
should  not  be  seen  as  the  sole  instruments  of 
democratic  accountability,  as  is  the  tendency  in 
theories  of  democracy  built  around  the  single  act 
of  voting  and  electoral  choice.  Conversely,  regimes 
that  lack  elections,  though  undemocratic,  may  still 
offer  citizens  opportunities  to  hold  their  govern¬ 
ments  to  account,  for  instance,  through  legal  pro¬ 
cesses  or  complaints  procedures. 
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Legislative  Scrutiny 

Between  elections,  the  major  institution  of 
accountability  is  the  legislature.  Though  primarily 
defined  in  terms  of  their  lawmaking  functions, 
modern  legislatures  have  ceded  much  of  their  leg¬ 
islative  initiative  and  discretion  to  the  executive 
branch,  especially  in  parliamentary  democracies 
where  the  main  legislative  chamber  is  typically 
under  the  control  of  the  ruling  party.  In  compensa¬ 
tion,  legislatures  have  increasingly  emphasized 
their  accountability  role  as  the  main  forum  where 
the  executive  is  required  to  answer  to  the  public. 

Legislatures  hold  executives  to  account  through 
a  variety  of  avenues.  One  is  the  requirement  for 
regular  reporting  on  executive  activities.  All  gov¬ 
ernment  agencies  and  statutory  officials  are  obliged 
to  report  periodically  (usually  annually)  to  the 
legislature  on  their  general  performance. 
Information  provided  to  the  legislature  thereby 
enters  the  public  realm  and  is  available  for  general 
debate  and  discussion. 

Legislatures  also  have  the  right  to  question 
members  of  the  executive  and  subject  them  to  pub¬ 
lic  scrutiny.  In  Westminster-based  democracies, 
conventions  of  ministerial  responsibility  require 
ministers  to  answer  to  parliament  for  the  conduct 
of  their  departments,  to  provide  information  about 
decisions,  and,  where  necessary,  to  impose  reme¬ 
dies.  Similar  conventions  exist  in  most  parliamen¬ 
tary  democracies  where  ministers  are  accountable 
to  parliament.  Ministerial  responsibility  has  been  a 
topic  of  perennial  controversy,  mainly  because  of 
a  widespread  but  erroneous  belief  that  it  requires 
ministers  to  take  personal  responsibility  for  the 
actions  of  subordinates  and  resign  when  major 
mistakes  are  discovered  to  have  been  made  within 
their  departments.  Such  “sacrificial”  resignation  is 
rarely,  if  ever,  undertaken  and  the  doctrine  of 
“vicarious”  responsibility  on  which  it  is  based 
(that  ministers  are  personally  responsible  for  the 
acts  of  subordinates)  does  not  accord  with  prac¬ 
tice.  When  ministers  resign,  they  do  so  because  of 
personal  faults  of  their  own  making,  such  as  mis¬ 
leading  parliament  or  engaging  in  improper  or 
corrupt  conduct,  and  the  decision  is  usually  politi¬ 
cally  determined  in  terms  of  minimizing  damage  to 
the  government’s  popularity. 

Ministerial  resignation  is  a  side  issue  in  assess¬ 
ing  the  role  of  ministers’  responsibility  in  govern¬ 
ment  accountability.  Much  more  important,  and 


the  basis  of  the  effectiveness  of  ministerial  respon¬ 
sibility  as  an  accountability  mechanism,  are  the 
obligations  it  imposes  on  ministers  to  answer  to 
the  public  on  matters  of  public  concern,  either  in 
parliament  or  directly  through  the  media.  Though 
ministers  are  not  required  to  take  personal  respon¬ 
sibility  for  all  actions  of  their  subordinates,  they 
must  provide  information  and  justification  when 
asked.  Refusal  to  respond  publicly,  which  is  com¬ 
mon  among  leaders  of  private  organizations,  is 
politically  unacceptable  for  elected  politicians.  At 
the  same  time,  however,  while  avoiding  outright 
deceit  (which  remains  a  strong  ground  for  forced 
resignation),  ministers  can  readily  prevaricate  and 
avoid  disclosure  of  embarrassing  information. 
Moreover,  their  monopoly  of  the  right  of  public 
response  can  carry  the  corollary  that  departmental 
officials  remain  out  of  the  public  eye,  which  can 
shield  them  from  legitimate  public  scrutiny. 

To  circumvent  the  problem  of  bureaucratic  ano¬ 
nymity,  legislatures  also  question  public  officials 
directly,  usually  through  a  system  of  legislative 
committees.  The  practice  of  committee  scrutiny  is 
most  highly  developed  in  the  U.S.  Congress,  which, 
through  the  separation  of  powers,  is  an  active  part¬ 
ner  in  shaping  government  policy  and  has  a  legiti¬ 
mate  interest  in  overseeing  the  whole  conduct  of 
government  business.  In  parliamentary  systems, 
the  scope  of  questioning  is  somewhat  more  con¬ 
fined,  exempting  appointed  officials  from  answer¬ 
ing  on  issues  of  government  policy,  out  of  defer¬ 
ence  to  the  democratic  mandate  of  the  elected 
leaders.  Even  so,  restricting  questions  to  adminis¬ 
trative  matters  for  which  officials  are  more  imme¬ 
diately  responsible  still  allows  considerable  oppor¬ 
tunity  for  holding  the  bureaucracy  to  account. 

Courts 

All  governments  are  subject  to  legal  account¬ 
ability  through  the  courts  because  courts  determine 
whether  the  government  has  acted  within  the  law. 
The  operation  and  effect  of  this  power  vary  with  a 
country’s  legal  and  constitutional  structure.  One 
contrast  is  between  Anglo-American  countries, 
where  cases  involving  the  government  are  heard  in 
the  same  courts  as  civil  cases,  and  some  European 
countries,  notably  France,  where  a  completely 
separate  court  structure  is  reserved  for  cases 
involving  the  state.  Another  contrast  concerns  the 
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scope  of  judicial  review.  Where  the  constitution,  as 
in  the  United  States,  defines  and  limits  the  powers 
of  both  Congress  and  the  president,  the  courts 
become  a  forum  for  holding  the  government  gener¬ 
ally  accountable  across  a  wide  range  of  substantial 
policy  issues.  On  the  other  hand,  in  parliamentary 
democracies  such  as  the  UK,  where  few  constitu¬ 
tional  limits  are  placed  on  legislative  power, 
opportunities  for  challenging  policies  through  the 
courts  are  much  more  limited. 

Most  legal  cases  involving  the  government  are 
brought  by  individual  citizens  and  deal  with  par¬ 
ticular  decisions  affecting  them  that  have  been 
made  by  government  agencies.  It  is  open  to  the 
court  to  rule  whether  a  decision  was  taken  within 
the  powers  legally  conferred  on  the  government 
agency;  whether  the  citizen  received  natural  jus¬ 
tice,  in  terms  of  fair  procedure  and  due  process; 
and  whether  the  decision  itself  was  reasonable. 
Whether  courts  can  decide  on  the  actual  substance 
and  merits  of  a  decision  varies  according  to  the 
provisions  of  the  individual  legal  system.  Some 
systems  make  use  of  quasi-judicial  tribunals,  which 
operate  like  courts  though  with  a  generally  more 
relaxed  approach  to  procedure  and  which,  being 
technically  part  of  the  executive  branch,  are 
empowered  to  amend  executive  decisions. 

Legal  accountability,  like  litigation  generally, 
suffers  from  being  slow  and  expensive  and  is 
beyond  the  reach  of  most  people  for  most  issues. 
However,  its  availability  as  a  last  resort  is  crucial 
to  the  public’s  capacity  to  hold  governments  to 
account.  Like  the  rule  of  law  itself,  of  which  it  is  a 
key  element,  accountability  through  an  indepen¬ 
dent  and  honest  judiciary  is  the  foundation  of  all 
public  accountability. 

Auditors  and  Other  Monitoring  Agencies 

Governments  are  overseen  and  investigated  by  a 
range  of  special-purpose  accountability  agencies. 
Of  these  agencies,  the  most  long-standing  are  the 
offices  of  government  auditors  (variously  described 
as  “auditors  general”  and/or  “comptrollers  gen¬ 
eral”).  Their  traditional  function  has  been  the 
monitoring  of  government  finances  on  behalf  of 
the  legislature  to  see  whether  public  revenue  and 
expenditure  have  been  managed  according  to  legis¬ 
lative  authorization  and  according  to  standards  of 
public  probity  and  propriety.  The  historic  function 


of  “regularity  auditing”  for  financial  compliance 
has  more  recently  been  supplemented  by  “perfor¬ 
mance”  (“value  for  money,”  “efficiency,”  “com¬ 
prehensive”)  auditing  that  extends  to  assessing  the 
efficiency  and  effectiveness  of  government  pro¬ 
grams.  Performance  audits  usually  take  a  pro¬ 
gram’s  objectives  as  given  and  then  examine 
whether  these  objectives  have  been  achieved  and  at 
what  cost.  In  performance  auditing,  as  distinct 
from  regularity  auditing,  auditors  typically  lack 
any  powers  of  sanction  or  rectification  and  can 
simply  recommend  changes  and  improvement. 

In  general,  government  auditors  have  proved 
essential  in  maintaining  financial  integrity  in  gov¬ 
ernments.  Conversely,  the  absence  of  effective  audit 
is  a  key  indicator  of  weak  and  corrupt  government 
systems.  They  have  also  been  very  successful  in 
exposing  bureaucratic  waste  and  inefficiency.  Even 
though  they  cannot  mandate  any  remedies,  the 
authority  of  their  recommendations  and  the 
adverse  publicity  attached  to  the  exposure  of  seri¬ 
ous  waste  and  inefficiency  are  often  sufficient  to 
prompt  governments  to  follow  their  recommenda¬ 
tions  voluntarily. 

Besides  auditors,  other  investigating  bodies 
include  government  inspectors  and  ombudsmen. 
Inspectors  are  officials  established  within  particu¬ 
lar  government  departments  and  agencies  with  the 
function  of  improving  efficiency  and  effectiveness. 
Inspectors  have  been  employed  to  monitor  school 
and  prison  systems  as  well  as  government  depart¬ 
ments  in  areas  such  as  taxation,  defense,  and  secu¬ 
rity  where  bureaucratic  performance  is  a  matter  of 
particular  concern.  The  position  of  ombudsman, 
first  introduced  in  Scandinavia,  has  been  adopted 
worldwide  as  an  avenue  of  complaint  for  individ¬ 
ual  citizens  seeking  redress  in  connection  with 
particular  decisions.  Ombudsmen  usually  have  the 
power  to  investigate  and  recommend  but  not  to 
impose  remedies.  In  spite  of  this  limitation,  how¬ 
ever,  they  have  proved  an  effective  accountability 
mechanism  midway  between  individual  complaint 
and  full  legal  proceedings. 

Recent  decades  have  witnessed  an  “audit  explo¬ 
sion”  as  governments  and  government  agencies 
become  subject  to  increasing  supervision  by  regula¬ 
tory  agencies.  With  the  transfer  of  responsibility  for 
providing  public  goods  and  services  away  from 
government  departments  under  political  direction  to 
various  forms  of  arm’s-length  providers,  regulation 
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has  tended  to  replace  political  and  bureaucratic 
direction  as  the  means  of  making  public  service 
providers  publicly  accountable.  A  plethora  of  reg¬ 
ulatory  agencies  now  monitor  different  areas  of 
public  service  provision,  such  as  health  and  educa¬ 
tion,  or  different  aspects  of  government  activity, 
such  as  occupational  health  and  safety  or  human 
rights.  While  most  regulatory  bodies  are  public 
bodies,  some  are  privately  established  but  have 
been  granted  legal  powers — for  instance,  some 
consumers’  associations  and  animal  protection 
societies.  Other  private  monitoring  bodies  have  no 
legal  mandate  but  operate  more  informally  as 
observers  and  critics  of  government  activities. 
These  include  a  number  of  private  international 
watchdogs,  such  as  Greenpeace  and  Amnesty 
International  and  the  financial  ratings  agencies, 
Standard  and  Poor’s  and  Moody’s,  which  have  a 
great  impact  on  economic  policy  because  of  the 
impact  of  their  ratings  decisions. 

Freedom  of  Information  and  the  Media 

The  public  availability  of  information  held  by 
governments  may  be  only  the  initial  stage  of  a  full 
accountability  process  and  needs  to  be  followed  up 
by  discussion  and  then,  if  necessary,  by  rectifica¬ 
tion.  However,  once  such  information  is  released 
into  the  public  realm  it  can  readily  be  used  to 
spark  a  political  reaction  and  force  governments 
into  remedies.  Public  access  to  government  infor¬ 
mation  is  therefore  an  essential  component  of 
government  accountability  and  is  provided  by  a 
number  of  channels. 

One  such  channel  is  the  right  of  the  general 
citizen  to  seek  access  to  information,  both  personal 
information  held  about  them  as  individual  citizens 
and  general  information  about  government  policy. 
Rights  of  freedom  of  information  are  found  in 
most  established  democracies,  with  the  United 
States  having  led  the  way  and  the  UK  being  a 
reluctant  latecomer.  Certain  exemptions  usually 
apply,  on  grounds  such  as  national  security,  cabi¬ 
net  confidentiality,  commercial  confidentiality, 
and  protection  of  legal  proceedings.  Financial 
charges  can  also  be  imposed,  especially  on  matters 
of  general  interest,  with  the  result  that  information 
is  generally  sought  by  well-resourced  journalists  or 
by  organized  groups  with  political  interests  rather 
than  by  individual  citizens. 


The  various  media  outlets,  both  print  and  elec¬ 
tronic,  also  help  spread  information  and  stimulate 
debate.  In  part,  their  function  is  strictly  intermedi¬ 
ary,  relaying  to  a  wider  public  news  items  and 
arguments  supplied  by  others.  However,  they  also 
play  an  independent  role  in  instigating  inquiries 
and  conducting  investigations.  Though,  for  the 
most  part,  privately  owned  and  not  formally  part 
of  the  machinery  of  government,  the  media  are 
essential  to  effective  accountability  in  large-scale 
modern  states.  Indeed,  a  free  press,  along  with 
elections  and  an  independent  judiciary,  has  been 
acknowledged  as  one  of  the  key  institutions  in 
securing  an  accountable  government. 

Intra-Organizational  Accountability 

In  addition  to  being  externally  accountable, 
government  agencies,  like  all  organizations,  also 
exhibit  internal  structures  of  accountability 
whereby  different  members  or  sections  are  account¬ 
able  to  others  within  the  organization.  Indeed, 
from  the  perspective  of  individual  officials,  organi¬ 
zational  accountability  upward  through  the  chain 
of  bureaucratic  command  is  often  the  most  imme¬ 
diate  and  salient  form  of  accountability  in  their 
daily  activities.  To  assist  in  reinforcing  upward 
accountability,  organizations  often  impose  their 
own  in-house  versions  of  independent  scrutiny, 
such  as  internal  audit  or  inspection,  which  mirrors 
and  anticipates  the  financial  monitoring  of  exter¬ 
nal  auditors  and  inspectors. 

Accountability  of  individual  officials  to  their 
superiors  is  an  essential  element  in  the  overall 
democratic  accountability  of  executive  govern¬ 
ment.  If  elected  leaders  are  to  be  accountable  for 
the  actions  of  bureaucrats,  then  they  must  be  able 
to  rely  on  their  bureaucrats’  willingness  to  take 
direction  and  answer  for  their  actions. 
Accountability  upward  is  thus  a  corollary  of  con¬ 
trol  downward.  Conversely,  any  slippage  in  con¬ 
trol  from  above  is  typically  reflected  in  lack  of 
accountability  upward. 

Weakness  in  upward  accountability  may  be 
symptomatic  of  the  well-known  propensity  of 
bureaucrats  to  pursue  their  own  policy  agendas 
and  apply  their  own  judgment  against  the  wishes 
of  their  superiors.  It  may  also  come  from  contrary 
accountability  pressures.  “Bottom-up”  views  of 
administration  legitimate  the  direct  accountability 
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of  street-level  bureaucrats  to  their  clients  among 
the  public  and  undermine  the  authority  of  organi¬ 
zational  superiors.  From  this  perspective,  account¬ 
ability  through  the  “top-down”  chain  of  bureau¬ 
cratic  command  becomes  an  unwelcome  constraint 
to  be  managed  rather  than  a  mandate  to  be 
respected.  Bottom-up  versions  of  accountability 
can  claim  their  own  democratic  legitimacy  through 
answerability  directly  to  the  members  of  the  public 
rather  than  indirectly  through  the  public’s  elected 
representatives. 

Systems  of  Democratic  Accountability 

Democratic  accountability  can  be  viewed  from 
various  perspectives,  from  that  of  the  citizen  wish¬ 
ing  to  hold  his  or  her  government  to  account,  from 
that  of  the  elected  politicians  responding  to  the  vot¬ 
ers,  or  from  that  of  the  government  official  answer¬ 
ing  to  political  masters  and  to  the  public.  Depending 
on  one’s  position  and  interests,  different  account¬ 
ability  mechanisms  claim  the  most  attention. 
Politicians  are  focused  primarily  on  elections  and 
the  media,  whereas  public  servants  are  more  con¬ 
cerned  with  accountability  to  superiors  in  the  hier¬ 
archy.  Citizens  in  strife  with  government  agencies 
will  be  looking  to  grievance  procedures  and 
ombudsmen.  In  all  cases,  the  structure  of  available 
accountability  mechanisms  appears  complex  and 
untidy,  incapable  of  being  reduced  to  tidy  diagrams 
or  flowcharts  without  oversimplification.  Structures 
of  accountability  are  inherently  pluralistic  and  are 
better  described  as  “webs,”  or  possibly  “systems,” 
to  indicate  their  complexity  and  fluidity. 

The  various  accountability  agencies  and  pro¬ 
cesses  in  webs  of  democratic  accountability  exhibit 
widely  different  functions.  Some  accountability 
processes,  such  as  elections,  focus  on  overall  per¬ 
formance  of  government,  while  others,  such  as 
financial  audit,  concentrate  on  details  of  adminis¬ 
tration.  Politicians  and  the  media  home  in  on 
politically  controversial  and  sensational  issues. 
Auditors  and  inspectors,  by  contrast,  tend  to  delib¬ 
erately  steer  clear  of  political  controversy  and 
instead  concentrate  on  more  humdrum  areas  of 
administration,  which  may  appear  dull  but  are 
often  the  site  of  major  inefficiencies. 

Some  agencies,  such  as  courts,  exercise  the  full 
range  of  accountability  functions,  including 
information,  discussion,  and  rectification,  while 


others,  such  as  the  media,  are  limited  to  the 
transparency  functions  of  information  and  dis¬ 
cussion.  It  is  for  this  reason  that  democratic 
accountability  cannot  be  accurately  represented 
by  a  chain  of  principal-agent  relationships,  given 
such  relationships  always  imply  the  principal’s 
right  to  impose  sanctions.  Standard  structures  of 
democratic  accountability  certainly  involve 
account  holders  with  the  power  to  impose  reme¬ 
dies  and  sanctions.  This  power  may  be  located 
with  a  number  of  different  actors.  The  basic  right 
of  rectification  lies  with  citizens  as  voters  who 
have  the  capacity  to  remove  their  elected  repre¬ 
sentatives.  But  other  public  officers  and  institu¬ 
tions  can  also  compel  compliance  on  those 
accountable  to  them.  Elected  political  leaders  can 
direct  their  departments  and  agency  heads  in 
relation  to  their  subordinates.  Courts  and  tribu¬ 
nals  can  issue  binding  decisions. 

However,  not  all  agencies  of  accountability  pos¬ 
sess  the  power  of  rectification.  Indeed,  many  of  the 
most  effective  accountability  agencies,  including 
legislative  committees,  auditors,  ombudsmen,  and 
the  media,  can  only  investigate  and  recommend. 
Their  effectiveness  depends  on  their  capacity  to 
exert  pressure  on  account  holders  with  rectificatory 
powers,  particularly  executive  leaders,  to  impose 
remedies  in  response  to  adverse  publicity,  backed 
by  the  ultimate  electoral  sanction.  Democratic 
accountability  can  thus  operate  as  an  interlocking 
system  with  some  institutions  specializing  in  infor¬ 
mation  and  investigation  while  others  are  relied  on 
for  rectification.  While  there  may  be  grounds  for 
identifying  elections  as  the  defining  accountability 
mechanism  in  representative  democracies,  they  are 
far  from  being  the  only  such  mechanism. 

A  key  factor  in  any  web  of  accountability  is 
whether  each  accountability  agency  charged  with 
holding  the  government  to  account  is  itself  publicly 
accountable  for  performing  its  accountability  func¬ 
tions.  Legislators,  judges,  auditors,  ombudsmen, 
regulators,  and  so  on  need  to  be  subjected  to  scru¬ 
tiny  themselves  to  prevent  them  from  becoming  lax 
or  corrupt.  Most  officials  in  such  scrutinizing  posi¬ 
tions  are  granted  a  degree  of  statutory  autonomy 
to  help  them  maintain  an  independent  stance 
against  the  executive  power  that  they  must  call  to 
account.  At  the  same  time,  however,  this  indepen¬ 
dence  is  itself  open  to  abuse.  Transparency,  media 
scrutiny,  and  the  ultimate  power  of  arraignment 
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and  dismissal  are  essential  buttresses  of  profes¬ 
sional  integrity.  The  guardians  must  be  guarded. 

Accountability  in  International  Relations 

Most  analysis  of  government  accountability  has 
taken  place  within  the  context  of  nation-states, 
where  the  defining  element  of  legal  and  political 
sovereignty  provides  a  framework  of  effective 
sanctions  to  support  other  accountability  mecha¬ 
nisms  and  where  elections  provide  the  people  with 
the  ultimate  sanction.  In  the  international  sphere, 
however,  the  absence  of  effective  sovereign  power 
and  elections  raises  the  specter  of  an  accountability 
vacuum.  How  can  international  organizations 
such  as  the  United  Nations  (UN)  or  the  World 
Bank  be  held  accountable  for  serving  the  interests 
of  the  international  community  if  that  community 
is  not  capable  of  enforcing  sanctions  on  them? 

In  practice,  accountability  in  the  international 
sphere,  though  weaker  than  in  many  nation-states, 
is  far  from  nonexistent.  In  some  areas  such  as  trade 
policy  and  criminal  behavior,  international  law 
can  offer  effective  remedies  backed  up  by  the  coer¬ 
cive  power  of  nation-states.  International  organi¬ 
zations  are  subject  to  accountability  mechanisms, 
such  as  audit,  review,  and  media  scrutiny,  as  well 
as  being  answerable  to  the  national  governments 
that  underwrite  them.  The  UN  bureaucracy, 
though  often  incompetent  and  corrupt,  can  be 
exposed  to  detailed  investigation,  as  over  its  han¬ 
dling  of  the  Oil-for-Food  Programme,  and  can  be 
pressured  to  institute  reforms.  Rectification  tends 
to  be  ineffective,  because  of  the  apathy  and  dis¬ 
unity  among  leading  members  of  the  Security 
Council,  the  body  to  which  the  bureaucracy  is 
accountable.  But,  with  more  determination  from 
member  states,  the  accountability  deficit  could  be 
significantly  reduced.  As  it  stands,  the  accountabil¬ 
ity  deficit  in  the  UN  bureaucracy  is  probably  no 
greater  than  that  found  in  the  national  bureaucra¬ 
cies  of  most  of  the  member  states  themselves. 

The  inherent  contrast  between  accountability  in 
nation-states  and  in  the  international  arena  should 
therefore  not  be  overstated.  Both  are  pluralistic  in 
structure  and  involve  a  range  of  accountability 
mechanisms,  many  of  which,  such  as  the  media  and 
NGO  (nongovernmental  organization)  watchdogs, 
operate  at  both  levels.  The  significant  lack  of  effec¬ 
tive  legal  and  political  sanctions  at  the  international 


level  must  be  conceded.  But  so  too  must  the  poten¬ 
tial  for  improved  international  accountability  short 
of  a  united  system  of  world  government. 

Single  Versus  Multiple  Accountability 

Should  the  duties  of  public  accountability,  whether 
for  government  as  a  whole  or  for  particular  gov¬ 
ernment  agencies,  be  concentrated  in  a  single  per¬ 
son  or  dispersed  among  a  number  of  different 
people?  The  main  argument  for  concentration  lies 
in  the  value  of  having  a  designated  person,  usually 
the  leader  or  agency  head,  who  is  obliged  to  take 
collective  responsibility  and  to  answer  to  the  pub¬ 
lic,  particularly  in  times  of  crisis  or  government 
failure.  Where  responsibility  and  accountability 
are  dispersed  between  members  of  a  group  or 
between  different  agencies,  officials  and  politicians 
can  easily  shift  the  blame  to  others,  with  the  result 
that  no  one  accepts  an  obligation  to  answer  to  the 
public.  A  single  point  of  accountability,  by  con¬ 
trast,  makes  buck  passing  much  more  difficult. 

On  the  other  hand,  multiple  avenue  of  account¬ 
ability  can  provide  the  public  with  greater  opportu¬ 
nities  for  extracting  information  from  government 
and  for  holding  government  officials  up  to  scrutiny. 
A  leader  exercising  sole  powers  of  accountability, 
while  less  open  to  buck  passing,  is  better  able  to 
resist  embarrassing  inquiry  into  the  actions  of  sub¬ 
ordinates  and  to  cover  up  mistakes.  In  practice, 
most  collective  actions  involve  the  responsibility  of 
many  individuals  (“the  problem  of  many  hands”), 
and  the  attempt  to  hold  only  one  person  responsi¬ 
ble  and  accountable  obscures  the  reality  of  how 
bureaucracies  operate. 

In  general,  multiple  avenues  of  accountability 
appear  superior  for  the  initial  stages  of  account¬ 
ability,  for  revealing  information,  and  for  encour¬ 
aging  the  scrutiny  of  government.  Single  points  of 
accountability,  however,  are  more  effective  for 
taking  charge  and  imposing  remedies.  The  differ¬ 
ing  U.S.  and  UK  constitutions  illustrate  the  con¬ 
trast.  The  United  States,  with  its  separation  of 
powers  and  multiple  points  of  authority,  provides 
a  very  open  and  transparent  system  of  government 
but  one  where  solutions  are  hard  to  impose.  The 
United  Kingdom,  on  the  other  hand,  concentrates 
power  in  the  prime  minister  and  cabinet  under 
tight  conventions  of  ministerial  responsibility, 
which  have  preserved  executive  secrecy  but  allowed 
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effective  rectification  when  problems  come  to 
light.  A  similar  contrast  can  be  found  in  the 
accountability  of  federal  systems  compared  with 
that  of  unitary  systems.  Federations  provide  more 
avenues  for  inquiry  but  lack  a  single  point  of 
responsibility  and  are  prone  to  blame  shifting. 

These  dilemmas  remain  intractable.  The  best 
solution  is  to  try  to  combine  multiple  avenues  for 
scrutiny  with  a  single  authority  for  rectification, 
thus  drawing  on  the  virtues  of  each  approach. 
Indeed,  pluralistic  democratic  systems  of  account¬ 
ability  can  be  viewed  in  this  light.  Ideally,  they 
combine  many  different,  complementary  mecha¬ 
nisms  of  scrutiny  with  a  few  clear  points  of  unam¬ 
biguous  control  and  direction. 

Accountability  in  Networks 

“Networks”  are  an  increasingly  important  feature 
of  modern  government  posing  particular  problems 
of  accountability.  Networks  may  be  understood  as 
structures  of  collective  action  and  decision  making 
in  which  formally  independent  groups  or  individu¬ 
als  cooperate  for  shared  purposes.  They  are  com¬ 
monly  contrasted  both  with  hierarchies,  where  the 
members  are  linked  by  formal  control  structures 
and  in  superior-subordinate  relationships,  and 
with  markets,  where  self-interested  parties  are 
linked  through  formal  agreements  and  contracts. 
Network  members  share  the  formal  independence 
of  market  players  while  cooperating  in  shared  val¬ 
ues  and  objectives  over  a  substantial  period  of  time. 

The  concept  of  network  has  come  to  promi¬ 
nence  in  the  analysis  of  modern  systems  of  gover¬ 
nance  (to  use  a  closely  associated  term)  for  various 
reasons.  First,  it  signifies  in  part  the  long-standing 
aspects  of  all  political  systems,  particularly  the 
more  informal  cross-institutional  relationships, 
which  tended  to  be  overlooked  in  more  traditional 
institutional  analysis  and  which  now  appear  wor¬ 
thy  of  much  more  careful  study.  For  instance, 
relationships  between  different  levels  of  govern¬ 
ment,  central  and  local,  federal  and  state,  have 
always  relied  heavily  on  informal  partnerships  and 
negotiation.  Second,  government  systems  are  mak¬ 
ing  more  use  of  arms-length  institutions,  such  as 
executive  agencies  and  private  organizations,  for 
the  delivery  of  public  services.  Relations  between 
the  purchasing  government  and  the  provider  orga¬ 
nizations  are  neither  hierarchical  nor  market  based 


but  instead  depend  more  on  network  characteris¬ 
tics  of  partnerships,  trust,  and  agreed  values. 

Networks  cause  accountability  problems 
because  responsibility  for  collective  action  is  shared 
between  a  number  of  different  parties,  giving  rise 
to  the  classic  buck  passing  associated  with  multiple 
accountability.  Within  the  network  itself,  admit¬ 
tedly,  relationships  of  trust  and  common  interest 
may  lead  to  mutual  accountability  and  responsive¬ 
ness.  But  for  outsiders  wishing  to  hold  the  network 
to  account  for  its  actions,  the  blurring  of  responsi¬ 
bility  is  problematic.  Particularly  when  mistakes 
are  made,  interested  members  of  the  public  are 
often  unable  to  gain  satisfactory  answers  because 
the  various  members  of  the  network  can  shift  the 
blame  to  each  other. 

Outsourcing  of  public  services  to  stand-alone 
agencies  has  been  bedeviled  by  accountability  dif¬ 
ficulties  as  politicians  blame  providers  for  perfor¬ 
mance  failures  and  providers  respond  that  the 
government  has  given  them  insufficient  resources. 
The  accountability  deficit  often  increases  if  the  pro¬ 
viders  are  in  the  private  sector  and  not  accustomed 
to  the  level  of  public  scrutiny  applied  to  the  public 
sector.  Commercial  contractors  plead  commercial 
confidentiality  as  a  reason  for  concealing  their 
internal  operations  and  are  often  exempt  from 
investigation  by  legislative  committees  or  ombuds¬ 
men.  Nonprofit  organizations  rely  heavily  on  the 
conscientiousness  of  their  staff,  many  of  whom  are 
volunteers,  and  are  particularly  reluctant  to  face 
demands  for  information  and  rectification. 

Accountability  and  the 
New  Public  Management 

The  international  public  sector  reform  movement 
of  the  1980s  and  1990s,  known  as  “the  new  public 
management,”  though  primarily  aimed  at  improv¬ 
ing  public  sector  efficiency  and  effectiveness,  also 
included  an  accountability  agenda.  The  move¬ 
ment’s  main  assumption  was  that  the  public  sector 
was  less  efficient  than  the  commercial  private  sec¬ 
tor  and  needed  to  move  closer  to  private  sector 
management  methods.  With  respect  to  account¬ 
ability,  the  most  serious  deficiency  of  the  public 
sector  compared  with  the  commercial  private  sec¬ 
tor  is  perceived  to  be  its  lack  of  clear  objectives. 
Private  companies  have  a  clear  and  quantifiable 
“bottom  line,”  the  maximizing  of  shareholder 
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value,  which  provides  a  clear  focus  for  the  perfor¬ 
mance  and  accountability  of  managers.  Government 
agencies,  by  contrast,  often  have  little  sense  of 
their  objectives  and  lack  clear  criteria  for  judging 
the  performance  of  officials  and  their  agencies  and 
for  holding  them  to  account. 

Another  deficiency  noted  in  public  sector  account¬ 
ability  is  bureaucrats’  comparative  lack  of  concern 
for  serving  the  members  of  the  public  with  whom 
they  have  direct  dealings.  In  contrast  to  those  offer¬ 
ing  goods  and  services  for  sale  in  a  competitive 
market — who  must  focus  on  their  customers’ 
preferences — bureaucrats  can  exploit  their  monop¬ 
oly  position  and  remain  largely  unaccountable  to  the 
members  of  the  public  they  are  supposed  to  serve. 

In  other  respects,  however,  the  public  sector  is 
seen  as  laboring  under  excessive  accountability 
burdens.  For  instance,  lack  of  accountability  for 
results  is  typically  balanced  by  much  more  account¬ 
ability  for  following  set  procedures  than  is  found 
in  the  private  sector.  This  accountability  for  pro¬ 
cess  encourages  red  tape  and  discourages  manage¬ 
rial  initiative.  At  the  same  time,  the  managerial 
efficiency  of  public  sector  managers  is  also  stifled 
by  the  constant  threat  of  interference  in  their  deci¬ 
sions  by  their  political  masters  who  are  themselves 
responding  to  the  accountability  demands  of  the 
general  public.  Because  political  leaders  can  be 
held  publicly  accountable  for  any  action  taken  by 
their  departmental  officials,  the  first  imperative  on 
all  loyal  bureaucrats  is  to  save  their  masters  from 
political  embarrassment,  even  if  organizational 
efficiency  is  compromised  thereby. 

The  managerial  accountability  agenda  of  the 
NPM  reformers  therefore  includes  a  number  of 
interlocking  strategies.  First,  objectives  are  to  be 
clarified,  and  managers  are  to  be  held  accountable 
in  terms  of  achieving  measurable  outputs.  Second, 
political  accountability  through  elected  political 
leaders  is  to  be  confined  to  the  setting  of  broad 
objectives  and  outcomes,  leaving  responsibility 
and  accountability  for  outputs  with  arms-length 
managers  who  are  quarantined  from  day-to-day 
political  interference  and  are  accountable  to  inde¬ 
pendent  regulators  rather  than  to  politicians. 
Third,  service  providers  are  to  be  made  more 
directly  accountable  to  individual  citizens,  viewed 
as  clients  or  customers. 

The  reform  agenda  has  had  considerable 
impact  on  systems  of  bureaucratic  accountability. 


Most  government  agencies  now  report  perfor¬ 
mance  in  terms  of  objectives,  including  outcomes 
and  outputs,  and  much  effort  has  been  directed 
toward  designing  performance  measures,  partic¬ 
ularly  in  service  agencies  and  in  public  health  and 
education.  The  widespread  decoupling  of  service 
provision  from  direct  political  control,  through 
executive  agencies  and  outsourcing,  has  raised 
the  accountability  profile  of  many  agency  heads 
and  nongovernment  service  providers,  while 
removing  some  internal  organizational  matters 
from,  public  scrutiny.  The  “audit  explosion,” 
signified  by  the  creation  of  new  monitoring  and 
regulatory  bodies,  reflects  the  move  away  from 
political  accountability  to  independent  regula¬ 
tion.  Moves  toward  greater  client  focus,  includ¬ 
ing  the  service  charter  initiatives,  pioneered  in 
the  UK,  have  done  much  to  reorient  frontline 
bureaucrats  toward  a  more  user-friendly  service 
culture. 

At  the  same  time,  however,  traditional  public 
sector  accountability  practices  have  proved  much 
more  resistant  to  change  than  the  reformers  had 
hoped,  largely  because  of  entrenched  public  expec¬ 
tations  about  accountability.  The  managerialist 
injunction  to  restrict  the  politicians’  accountability 
for  general  objectives  and  to  delegate  accountabil¬ 
ity  for  implementation  to  agency  heads  and  man¬ 
agers  has  proved  politically  unworkable.  Members 
of  the  public  and  the  media  will  not  readily  accept 
what  they  see  as  unjustifiable  blame  shifting  from 
leaders  to  subordinates  or  contractors.  Leaders 
become  inevitably  drawn  into  discussion  of  admin¬ 
istrative  details:  making  bureaucrats,  correspond¬ 
ingly,  remain  highly  sensitive  to  political  direction. 
Private  contractors  become  similarly  adept  at 
anticipating  political  pressures  if  they  want  their 
contracts  renewed.  Hopes  that  public  servants 
would  be  less  process  driven  have  not  been  ful¬ 
filled.  In  particular,  the  continuing  role  of  courts 
and  quasi-judicial  tribunals  in  reviewing  adminis¬ 
trative  decisions  has  maintained  a  strong  demand 
for  due  process.  The  public  sector  continues  to  be 
held  to  higher  procedural  standards  through  the 
principles  of  natural  justice,  which  are  jealously 
protected  by  the  courts. 

Richard  Mulgan 
Australian  National  University 
Canberra,  Australia 
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See  also  Audit  Society;  Auditing;  Bureaucracy;  Contract 
Theory;  Networks;  New  Public  Management; 
Performance  Management;  Principal-Agent  Theory; 
Regulation;  Responsibility;  Responsiveness;  Rule  of 
Law 
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Electoral  accountability  is  best  interpreted  as  a 
three-phase  process  and  a  feedback.  Political 
actors  looking  for  election  will  first  try  to  take 
into  account  the  preferences  of  the  voters.  If 
elected,  they  will  then  keep  into  account  what 
they  have  heard  and  learned  during  the  electoral 
campaign.  Returning  to  the  voters,  they  will  give 
an  account  of  their  behavior  and  their  perfor¬ 
mance.  Even  those  representatives  who  do  not  run 
again  will  retain  some  electoral  accountability, 
because  none  of  them  would  want  to  ruin  the  elec¬ 
toral  chances  of  his  or  her  party  and  successor. 

Electoral  accountability  lies  at  the  heart  of  all 
processes  of  democratic  representation.  It  is  a  com¬ 
plex,  multilayered  phenomenon  entailing  three 
quite  distinct,  though  interrelated,  phases  and  sev¬ 
eral  individual  and  collective  actors.  It  is  signifi¬ 
cantly  affected  and,  in  some  ways,  shaped  by  the 
electoral  system  and,  more  generally,  by  the  insti¬ 
tutional  arrangements. 

Accountability  During 
the  Electoral  Campaign 

Electoral  accountability  begins  when  a  relation¬ 
ship  is  established  between  the  voters  and  their 
potential  representatives,  candidates,  and  parties. 
During  the  electoral  campaign,  the  candidates  and 
the  parties  have  an  interest  in  getting  to  know 
what  the  voters  need  and  want,  which  preferences 
and  values  they  have,  and  which  ideals  they  would 
promote.  In  this  phase,  accountability  manifests 
itself  as  the  most  conscious  effort  by  candidates 
and  parties  to  learn,  that  is,  to  take  into  account 
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what  the  voters  communicate  to  them.  Obviously, 
there  are  two  limits  to  this  process.  First,  the  voters 
may  not  have  a  very  precise  idea  about  their  prefer¬ 
ences  and  interests  and  may  not  know  exactly  how 
to  communicate  them  to  the  candidates  and  the 
parties.  Second,  the  candidates  and  the  parties  may 
discount  the  preferences  and  the  values  of  some  or 
several  groups  of  voters  or  may  be  unable  to  intro¬ 
duce  them  into  their  predisposed  programmatic 
package  and,  to  a  much  lesser  extent,  to  their  ideo¬ 
logical  views.  On  the  whole,  however,  an  effort 
will  certainly  be  made  by  them  to  take  into  account 
what  they  have  heard  and  have  come  to  know. 

Usually,  a  difference  is  drawn  between  a  pro¬ 
portional  electoral  system  used  in  relatively  large 
constituencies  with  no  possibility  of  casting  a  pref¬ 
erence  vote  and  plurality-majority  electoral  sys¬ 
tems  applied  in  single-member  constituencies. 
Understandably,  in  the  first  case,  much  or  most  of 
the  electoral  accountability  depends  on  the  ability 
and  the  willingness  of  the  political  parties  to  take 
into  account  what  they  have  heard  during  the  elec¬ 
toral  campaign,  and  much  of  the  electoral  account¬ 
ability  will  then  be  projected  to  the  national  level. 
When  plurality-majority  systems  operate  in  single¬ 
member  constituencies,  individual  candidates  are 
the  protagonists  of  electoral  accountability  (Bruce 
Cain,  John  Ferejohn,  &  Morris  Fiorina,  1987).  It 
is  up  to  them  to  interact  with  the  voters,  to  learn 
as  well  as  to  explain,  to  take  into  account  what 
they  have  been  communicated  by  the  voters,  and 
also  to  carry  that  knowledge  into  the  representa¬ 
tive  assembly.  The  candidates’  ability  to  learn 
about  the  preferences  of  the  voters  and  to  shape 
them  may  make  a  difference  in  the  outcome  of  the 
electoral  processes  and,  possibly,  in  the  type  of 
politics  and  policies  proposed  and  later  imple¬ 
mented  by  their  party  or  governmental  coalition. 

Accountability  in  the 
Implementation  of  Policies 

Electoral  accountability  does  not  end  with  the 
holding  of  the  elections  and  their  outcomes.  Once 
installed  inside  the  representative  assembly  at  any 
level  of  the  political  system,  elected  representatives 
and  their  parties  must  try  to  translate  their  propos¬ 
als  into  policies.  Again,  there  is  a  difference  in  the 
approach  taken  by  representatives  elected  through 
a  proportional  system  in  large  constituencies,  even 


more  so  when  there  is  no  preference  vote,  and  rep¬ 
resentatives  elected  in  single-member  constituencies 
through  a  plurality-majority  electoral  system.  In 
the  first  case,  party  leaders  will  decide  which  poli¬ 
cies  to  implement,  usually  claiming  to  have  received 
a  mandate  and  remaining  somewhat  indifferent  to 
positions  articulated  by  their  individual  representa¬ 
tives.  Party  leaders  will  keep  into  account  what 
they  believe  has  been  the  response  of  the  voters  to 
their  overall  programmatic  offer.  As  to  representa¬ 
tives  elected  in  single-member  constituencies,  no 
doubt  quite  a  number  of  them  will  certainly  sup¬ 
port  their  party’s  policies.  Flowever,  not  a  small 
number  of  those  elected  representatives  will  indeed 
attempt  to  introduce  into  the  political  discourse 
and  the  policy  debate  what  they  have  learned  from 
the/their  voters.  In  addition  to  some  general  prefer¬ 
ences  and  interests  expressed  by  their  voters  and 
largely  compatible  with  their  party’s  program/ 
manifesto,  elected  representatives  will  seriously 
attempt  to  keep  into  account  specific  preferences 
and  interests  concerning  their  single-member  con¬ 
stituencies  and  their  voters,  or  all  of  them.  Indeed, 
the  working  of  a  specific  representative  assembly 
will  be  significantly  affected  by  the  degree  of  polit¬ 
ical  discretionality  and  electoral  power  of  its  indi¬ 
vidual  members,  especially  those  elected  in  single¬ 
member  constituencies  (Fleinz  Eulau,  1986). 

Accountability  and  Reelection 

Neither  political  debates  nor  electoral  accountabil¬ 
ity  comes  to  an  end  in  a  representative  assembly. 
All  party  leaders  and  elected  representatives,  gov¬ 
ernmental  office  holders,  and  oppositionists  are 
fully  aware  that  public  opinion  has  its  (more  or 
less  open  and  informed)  eyes  on  them,  that  it  scru¬ 
tinizes  their  behavior,  its  conformity  to  the  prom¬ 
ises  and  the  programs,  and  its  consequences.  One 
way  or  another,  all  the  protagonists,  voters 
included,  know  perfectly  well  that  in  democratic 
regimes  the  voters  will  periodically  pass  judgment 
on  the  policies  that  they  have  approved  or  opposed 
and  on  their  behavior  while  in  office.  Flence,  when 
the  time  comes  for  new  elections,  elected  represen¬ 
tatives  and  party  leaders  cannot  escape  from  per¬ 
forming  another  task  fully  belonging  to  account¬ 
ability.  They  will  have  to  explain  to  the  voters 
what  they  have  done,  not  done,  or  poorly  done. 
They  will  be  obliged  to  give  an  account  of  their 
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performance  in  their  roles.  It  is  of  decisive  impor¬ 
tance  that  elected  representatives  and  party  leaders 
accept  full  responsibility  for  their  performance  or 
lack  of  it.  Indeed,  often,  but  inadequately,  electoral 
accountability  is  considered  mechanically  limited 
only  to  this  specific  phase:  reward  (reelection)  and 
punishment  (defeat).  No  doubt,  depending  on  the 
strength  and  the  independence  of  the  mass  media 
and  on  the  degree  of  vibrancy  and  robustness  of 
civil  society,  the  politicians  in  office  aiming  at  reac¬ 
quiring  their  positions  will  be  held  accountable. 
Technically,  the  process  through  which  representa¬ 
tives  and  party  leaders  launch  their  reelection  bid 
is  called  feedback.  It  introduces  old  and  new 
issues,  renewing  the  circuit  of  accountability  that 
depends  on  what  was  done  and  has  to  be  reformed, 
and  what  was  set  aside  but  must  be  taken  again 
into  account. 

Seen  from  this  perspective — that  is,  focused  on 
the  activities,  the  perceptions,  the  performance 
of  parties  as  well  as  individual  representatives — 
electoral  accountability  is  a  never-ending  game. 
However,  it  is  true  that  from  time  to  time  indi¬ 
vidual  representatives  do  voluntarily  retire.  It  is 
also  true  that  in  some  political  systems  for  some 
political  offices,  there  are  term  limits  that  oblige 
those  office  holders  to  exit  from  politics.  One  may 
conjecture  that  “lame  duck”  politicians  will  feel 
less  inclined  to  take  electoral  accountability  into 
account  in  their  behavior.  They  will  not  be  asked 
by  the  voters  to  explain  their  behavior.  Hence,  at 
least  in  theory,  they  will  run  few  risks  if  and  when 
they  behave  in  an  irresponsible  way.  What  is  miss¬ 
ing  in  all  the  statements  concerning  the  potential 
irresponsibility  of  office  holders  who  are  not  con¬ 
strained  by  the  imperatives  of  reelection  is  the 
relationship  between  those  representatives  and 
their  corresponding  parties.  Though  there  is  no 
specific  research  on  the  motivations  of  the  outgo¬ 
ing  representatives  or  on  the  ability  of  party  lead¬ 
ers  to  enforce  on  them  the  “ethics”  of  accountabil¬ 
ity,  it  does  not  seem  farfetched  to  suggest  that 
something  of  the  kind  is  and  always  has  been  at 
work  in  most  cases.  That  is,  practically  no  outgo¬ 
ing  representative  has  deliberately  chosen  to  behave 
in  an  indifferent  manner — more  precisely,  not  car¬ 
ing  at  all  about  the  relationship  between  his  or  her 
promises  and  his  or  her  behavior,  thus  negatively 
affecting  the  chances  of  election  of  his  or  her 
party’s  candidates. 


Accountability  in  Proportional  Versus 
Majoritarian  Electoral  Systems 

A  larger  issue  looms  with  regard  to  electoral 
accountability,  though  to  be  more  precise,  perhaps 
one  ought  to  speak  of  political  representation. 
Though  the  comparative  study  by  G.  Bingham 
Powell  (2000)  is  excellent,  unfortunately  it  does 
not  tackle  the  various  processes,  phases,  and  inter¬ 
pretations  of  electoral  accountability.  Rather,  it 
refers  to  what  kind  of  representation  is  overall 
provided  by  proportional  systems  (in  which  a 
party’s  share  of  seats  is  determined  by  its  share  of 
votes)  versus  majoritarian  electoral  systems  (in 
which  a  candidate  must  win  a  plurality  or  an  abso¬ 
lute  majority  of  votes  to  be  elected).  Assemblies 
elected  through  proportional  electoral  systems  and 
coalition  governments  may  provide  for  the  repre¬ 
sentation  of  a  wider  spectrum  of  opinions,  inter¬ 
ests,  and  preferences.  Using  the  terminology  above, 
they  have  the  possibility  of  taking  into  account  a 
greater  number  of  opinions,  interests,  and  prefer¬ 
ences  in  the  first  phase  of  the  electoral  process. 
Statically,  they  may  almost  come  to  “mirror” 
them.  Not  much  is  learned,  however,  when  it 
comes  to  the  phase  in  which  those  opinions,  inter¬ 
ests,  and  preferences  will  have  to  be  kept  into 
account.  The  representatives  of  the  various  parties 
and  their  leaders  will  attempt  to  have  their  way, 
and  the  ensuing  bargaining  process  does  not  at  all 
guarantee  better  accountability,  in  terms  of  trans¬ 
parent  and  responsible  processes  of  decision  mak¬ 
ing.  Also,  when  returning  to  the  voters  to  give 
them  an  account  of  what  has  been  done  or  not 
done,  it  is  likely  that  the  politics  of  buck  passing 
will  defeat  the  politics  of  personal  and  party 
accountability.  It  is  also  likely  that  some  represen¬ 
tatives  and  several  party  leaders  might  enact  the 
politics  of  outbidding  if  they  believe  that  they  will 
not  be  asked  to  take  on  governing  duties. 

On  the  contrary,  in  single-member  constituen¬ 
cies  and  in  one-party  governments,  one  can  legiti¬ 
mately  hypothesize  that  two  likely  processes  will 
be  at  work.  All  representatives  elected  in  single¬ 
member  constituencies  will  feel  the  need  to  avoid 
relying  exclusively  on  their  “initial”  voters.  Some 
of  those  voters  will  inevitably  be  dissatisfied  with 
the  performance  of  their  specific  representative. 
Thus,  the  representative  will  attempt  to  increase 
the  number  of  his  or  her  voters  by  taking  into 
account  a  larger  set  of  preferences  and  by  keeping 
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them  into  account  in  his  or  her  representational 
and  governing  activities  in  order  to  be  reelected.  A 
similar  process  will  take  place  where  there  is  sin¬ 
gle-party  government.  In  their  governing  activities, 
all  one-party  governments  are  bound  to  lose  some 
of  their  initial  supporters.  Hence,  party  leaders  will 
launch  their  representational  net  beyond  the  initial 
borders  of  their  electoral  consensus.  By  so  doing, 
they  will  attempt  to  take  into  account  more  inter¬ 
ests,  more  preferences,  and  more  opinions  than 
simply  those  of  the  people  who  had  voted  their 
party  into  office.  The  process  of  widening  the 
perimeter  of  their  support  seems  much  more  diffi¬ 
cult  for  representatives  elected  on  party  lines  and 
for  multiparty  governments.  Each  party  represen¬ 
tative  will  have  to  toe  the  party  line  lest  he  or  she 
jeopardizes  his  or  her  reselection.  Hence,  they  will 
all  stick  to  a  static  interpretation  of  their  account¬ 
ability:  taking  and  keeping  into  account  exclu¬ 
sively  what  was  learned  and  promised  through 
party  channels.  No  party  belonging  to  the  govern¬ 
mental  coalition  will  be  likely  to  afford  looking  for 
additional  outside  support  lest  the  political  jeal¬ 
ousy  and  envy  of  the  other  parties  making  up  the 
governmental  coalition  lead  to  its  breakup. 

All  these,  admittedly,  though  indispensably, 
hypothetical,  considerations  are  not  necessarily 
meant  to  prove  that  majoritarian  electoral  systems 
applied  in  single-member  constituencies  are  abso¬ 
lutely  preferable  to  proportional  electoral  systems 
from  the  point  of  view  of  accountability,  not  the 
reverse.  What  they  suggest  is  that  no  final  conclu¬ 
sion  concerning  the  superiority  of  proportional  sys¬ 
tems  in  terms  of  accountability  seems  justified.  Only 
empirical  research  done  countrywide  and  based  on 
different  electoral  systems  will  offer  sufficient  and 
satisfactory  material  to  understand  and  assess  the 
quantity  and  quality  of  electoral  accountability. 
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Accountability, 

Interinstitutional 


Interinstitutional  accountability  is  a  form  of  hori¬ 
zontal  accountability  in  which  the  different  insti¬ 
tutions  provide  a  check  on  one  another’s  behavior. 
The  legislature  is  supposed  to  check  the  actions  of 
the  executive  branch  of  government.  The  judi¬ 
ciary,  especially  constitutional  courts,  oversees 
the  behavior  of  governing  and  representative 
institutions.  Independent  authorities — that  is,  all 
those  agencies  with  power  of  oversight  on  spe¬ 
cifically  relevant  aspects,  such  as  central  banks, 
regulatory  agencies,  and  ombudspersons,  or  those 
authorities  who  oversee  the  regulation  of  media 
and  communication  or  of  competition — indi¬ 
rectly  check  the  government  and  can  be  con¬ 
trolled  by  it  in  their  actual  working.  Party 
government  remains  the  best  “mechanism”  for 
interinstitutional  accountability  when  party 
appointees  and  party  representatives  are  account¬ 
able  horizontally  to  their  leaders  and  vertically  to 
the  voters. 

This  entry  first  discusses  the  definition  of  inter¬ 
institutional  accountability  and  its  evolution  in 
Europe  and  the  United  States.  It  then  considers 
ways  in  which  such  accountability  has  been 
expanded  in  recent  decades  to  include  the  roles  of 
institutions  such  as  central  banks,  regulatory 
agencies,  and  ombudspersons. 

The  Nature  of  Interinstitutional 
Accountability 

Institutions  may  be  accountable  vertically  and 
horizontally.  Vertical  accountability,  that  is,  elec¬ 
toral  accountability,  exists  when  institutions  are 
exposed  to  the  control  of  the  voters  and  the  citi¬ 
zens.  Technically,  the  voters  and  the  citizens  are 
the  principals  and  the  institutions  are  the  agents. 
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The  citizen/voters  delegate  some  representatives  in 
institutions  such  as  government  and  parliament  to 
perform  some  tasks.  They  retain  the  capacity  and 
the  power  to  reward  or  punish  a  government  and / 
or  a  parliament/congress.  In  this  situation,  vertical 
accountability  is  said  to  be  at  work.  This  type  of 
accountability  is  the  hallmark  of  democratic 
regimes.  Where  vertical  accountability  does  not 
exist,  there  is  no  democracy,  though,  of  course,  a 
democratic  regime  requires  and  provides  much 
more  than  just  “vertical  accountability.”  On  the 
other  hand,  horizontal  accountability  is  a  relation¬ 
ship  established  among  institutions:  the  executive, 
the  legislative  (parliament  or  congress),  the  bureau¬ 
cracy,  the  military  organization,  the  judiciary,  and 
independent  authorities.  It  occurs  when  the  various 
institutions  in  different  ways  are  in  a  position  to 
control  the  activities  of  another  specific  institution. 
This  type  of  control  can  be  reciprocal  and  balanced, 
one-sided  and  skewed,  formal  and  informal,  accom¬ 
panied  by  predefined  sanctions,  or  largely  symbolic. 

Notwithstanding  their  often  very  significant 
differences,  all  these  relationships  can  be  legiti¬ 
mately  assigned  to  the  area  of  what  is  called 
“interinstitutional  accountability.”  In  a  way,  hor- 
izontal-interinstitutional  accountability  is  a  much 
more  complex  relationship  to  be  studied,  first  of 
all,  because  it  is  not  dyadic  (voters-leaders)  and 
may  entail  the  activities  of  more  than  two  institu¬ 
tions.  Second,  not  only  is  it  less  transparent  than 
any  electoral  process,  but  it  is  also  less  stream¬ 
lined.  Interinstitutional  accountability  is  also 
made  more  complicated  by  the  sheer  fact  that  it  is 
the  product  of  a  fair  amount  of  reciprocal  interac¬ 
tion  and  mutual  influence.  For  this  reason,  its  full 
understanding  requires  a  profound  knowledge  of 
the  dynamics  of  the  entire  political-institutional 
system.  Finally,  the  study  of  horizontal-inter- 
institutional  accountability  is  also  more  likely  to 
be  influenced  by  the  values  of  the  scholars,  who 
may  prefer  the  stability  to  the  instability  of  the 
policy  process  and  the  concentration  of  political 
decision-making  power  to  its  diffusion,  or,  more 
rarely,  vice  versa.  In  different  periods  of  time, 
some  of  these  preferences,  and  values,  advocated 
and  shared  by  scholars  and  power  holders,  have 
been  translated  into  concrete  institutional  arrange¬ 
ments.  Indeed,  a  short  exploration  of  selected 
constitutional  outcomes  is  a  good  introduction  to 
the  analysis  of  interinstitutional  accountability. 


According  to  Aristotle,  mixed  regimes  are  the 
preferred  form  of  government.  If  one  political 
actor  prevails  over  all  the  others,  it  results  in  tyr¬ 
anny.  If  few  actors  prevail  over  all  the  others,  the 
regime  is  an  oligarchy.  When  many  actors  have 
unbridled  power,  the  regime  runs  the  risk  of 
degenerating  into  what  Aristotle  called  democracy, 
the  positive  definition  of  a  well-functioning  and 
balanced  regime  being  politeia.  When  power  is 
highly  concentrated  in  the  hands  of  just  one,  a  few, 
or  many  actors,  it  is  impossible  to  control  the  exer¬ 
cise  of  that  power  and  to  impose  accountability  on 
the  power  holders.  On  the  contrary,  when  it 
became  possible  to  allocate  power  to  different 
institutions,  through  the  appearance,  for  instance, 
of  constitutional  monarchies,  the  seeds  of  interin¬ 
stitutional  accountability  were  sown. 

Evolution  of  Interinstitutional  Accountability 

Most  scholars  would  concur  that  the  famous  book 
by  Charles-Louis  de  Secondat,  Baron  de  La  Brede 
et  de  Montesquieu,  De  I’esprit  des  lois  (1748), 
represents  the  first  attempt  to  suggest  the  division 
of  institutional  power,  though  not  yet  the  creation 
of  mechanisms  capable  of  producing  interinstitu¬ 
tional  accountability.  At  the  time,  the  problem  to 
be  solved  was  how  to  take  away  from  the  king  the 
power  to  legislate  and  the  power  to  adjudicate  and 
give  them  to  two  specialized  institutions:  parlia¬ 
ment  and  the  judiciary.  In  practice,  the  king 
retained  some  legislative  and  judicial  powers,  but 
the  very  existence  of  two  institutions  enjoying 
most  of  those  powers  meant  the  first  appearance  of 
some  relatively  simple  tripartite  interinstitutional 
accountability.  The  separation  of  institutional 
powers  created  a  situation  in  which  different  insti¬ 
tutions  were  in  a  position  to  control  each  other, 
giving  birth  to  the  circuit  of  accountability  among 
the  three  most  important  institutions. 

The  next,  highly  significant  and  quite  deliber¬ 
ate,  attempt  to  construct  a  web  of  interinstitutional 
accountability — the  drafting  of  the  Constitution  of 
the  Republic  of  the  United  States  of  America — 
constituted  a  response  to  practical  problems  more 
than  a  full-blown  theoretical  formulation.  In  all 
likelihood,  James  Madison  and  Alexander 
Hamilton  thought  that  they  were  going  to  substan¬ 
tially  imitate  the  institutional  edifice  existing  in 
Great  Britain.  But  they  were  also  aware  of  the  need 
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to  decisively  innovate  and  improve  on  it. 
Montesquieu  had  fundamentally  aimed  at  subtract¬ 
ing  the  legislative  and  the  judicial  power  from  the 
executive  power  (the  king).  The  three  powers  could 
then  act  autonomously  and  independently,  giving 
birth  to  a  situation  of  positive  institutional  plural¬ 
ism.  However,  neither  autonomy  nor  independence 
was  in  itself  leading  to  interinstitutional  account¬ 
ability.  On  the  contrary,  in  Montesquieu’s  view, 
the  institutional  separation  of  the  three  “powers” 
was  positive  in  itself.  It  was  on  the  other  side  of  the 
English  Channel  that  a  somewhat  different  interin¬ 
stitutional  formula  made  its  appearance:  “the  king 
in  parliament,”  meaning  a  close  relationship  and 
some  reciprocal  accountability  between  the  chief 
executive  and  the  representative  assembly.  The 
king  could  prevent  parliament  from  passing  unde¬ 
sired  laws,  and  in  its  turn,  parliament  could  oppose 
bills  and,  above  all,  taxes  desired  by  the  king.  The 
product  of  a  prolonged  and  even  bloody  confronta¬ 
tion,  this  apparently  healthy  situation  represented  a 
most  prominent  instance  of  “checks  and  balances.” 

At  the  time  the  U.S.  Founding  Fathers  were 
drafting,  defending,  and  promoting  their  constitu¬ 
tional  package,  Madison  and  Hamilton  shared  the 
view  that  in  the  English  case  one  could  find  some 
separation  of  the  institutions  combined  with  some 
interaction  among  them.  Both  of  them,  but  espe¬ 
cially  Madison,  argued  that  the  separation  of  the 
institutions  was  a  good  formula  to  prevent  the 
accumulation  and  concentration  of  power  in  one 
specific  institution.  Though  Hamilton  wanted 
more  power  to  be  given  to  the  presidency,  in  the 
end  Congress,  being  more  representative  of  the 
American  people,  received  more  and  more  impor¬ 
tant  powers.  And  the  U.S.  political  system  parted 
forever  from  the  institutional  architecture  of  the 
British  constitutional  monarchy. 

Great  Britain 

The  development  of  the  political  and  institu¬ 
tional  dynamics  of  Britain  led  to  the  fusion  of  pow¬ 
ers,  the  secret  of  the  English  Constitution’s  effi¬ 
ciency,  according  to  Walter  Bagehot  (1867).  That 
is,  the  relationship  between  the  English  cabinet  and 
parliament  is  clearly  skewed  in  favor  of  the  prime 
minister,  who  obtains  and  retains  his  or  her  office 
because  he  or  she  is  the  leader  of  the  parliamentary 
party  that  owns  the  absolute  majority  of  seats.  As 


Max  Weber  correctly  wrote,  the  English  prime 
minister  is  “the  dictator  of  the  parliamentary  bat¬ 
tle  field.”  Hence,  the  separation  does  not  run 
neatly  between  the  prime  minister  with  the  cabinet 
and  the  parliament,  between  10  Downing  Street 
and  Westminster,  but  between  the  prime  minister 
together  with  his  or  her  parliamentary  majority 
and  the  opposition  led  by  its  shadow  government. 
In  a  way,  Winston  Churchill  was  wrong:  It  is  not 
parliament  that  is  fully  sovereign  and  omnipotent, 
but  the  parliamentary  majority  led  by  its  prime 
minister.  As  a  consequence,  the  English  cabinet  is 
accountable  not  to  parliament  as  such  but  above 
all  to  its  own  parliamentary  majority.  The  rela¬ 
tionship  of  accountability  does  not  occur  between, 
on  one  side,  government  and,  on  the  other,  parlia¬ 
ment,  but  between  the  governmental  majority  and 
the  opposition.  Still,  under  some  exceptional  cir¬ 
cumstances,  when  party  discipline  breaks  down, 
parliament  may  and  will  revive  the  circuit  of  inter¬ 
institutional  accountability  with  Her  Majesty’s 
government.  A  similar  situation  holds  in  those 
parliamentary  democracies  that  have  imported  the 
British  model,  for  example,  Australia,  Canada, 
and,  until  1993,  New  Zealand,  plus  some  former 
British  colonies  in  the  Caribbean  and  Africa.  In  all 
these  cases,  an  effective  two-party  system  seems  to 
be  the  necessary  condition  for  interinstitutional 
accountability  following  the  English  style. 

United  States 

Notwithstanding  the  oscillations  between  con¬ 
gressional  government  and  the  Imperial  presidency, 
the  definition  that  best  characterizes  the  function¬ 
ing  of  the  U.S.  political  system  was  formulated  by 
Richard  Neustadt  (1960):  “separate  institutions 
sharing  powers.”  This  widely  accepted  interpreta¬ 
tion  depicts  a  reality  that  is  quite  different  from, 
one  might  even  say  almost  the  opposite  of,  the 
English  institutional  arrangement.  In  the  United 
States,  the  executive  and  the  legislative  powers,  the 
president  and  Congress,  are  the  product  of  distinct 
electoral  processes.  The  president  does  not  have  the 
power  to  dissolve  Congress,  while  Congress  cannot 
dismiss  the  president  except  through  the  very 
extraordinary  procedure  of  impeachment,  which, 
incidentally,  does  represent  an  extreme  instance  of 
interinstitutional  accountability.  Congress  and  the 
president  are  obliged  to  live  and  work  together  lest 
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a  paralysis  of  the  political  system  follows.  They  are 
both  endowed  with  quite  powerful  instruments  to 
check  their  respective  activities  and  behavior  that, 
in  some  exceptional  instances,  for  example,  when 
they  disagree  on  the  federal  budget,  may  lead  to 
decision-making  paralysis  or  a  stalemate. 

Including  the  judiciary,  especially,  the  Supreme 
Court,  in  the  overall  framework,  the  situation  of 
“separate  institutions  sharing  powers”  is  com¬ 
plete.  The  president  holds  the  power  to  appoint  the 
justices  (as  well  as  several  other  federal  judges), 
but  the  Senate  retains  the  power  to  reject  those 
appointments  (“advice  and  consent”).  The  Senate 
has  even  the  power  to  block  the  appointment  of  all 
the  secretaries  nominated  by  the  president.  In  both 
instances,  interinstitutional  accountability  may 
also  be  at  work  in  a  rather  subtle  way:  deterrence 
and  anticipated  reactions.  Fearing  the  opposition 
of  the  Senate,  the  president  will  avoid  nominating 
someone  who  is  likely  to  be  rejected.  On  its  part, 
the  Senate  will  refrain  from  challenging  the  presi¬ 
dent  on  all  nominations.  Once  appointed,  the 
Supreme  Court  justices  will  hold  office  for  life, 
which  allows  them  to  behave  with  absolute  free¬ 
dom,  not  being  subject  to  any  kind  of  retaliation. 
In  a  way,  their  accountability  stops,  but  they  are  in 
a  position  to  impose  a  lot  of  accountability  both 
on  Congress  and  on  the  president.  The  members  of 
the  European  constitutional  courts  are  not 
appointed  for  life  but  for  fixed  terms.  Many  of 
them  have  a  career  of  their  own  after  their  experi¬ 
ence  in  the  constitutional  court.  It  is  not  rare  for 
them,  as  in  the  case  of  Italy,  to  position  themselves, 
in  some  cases  at  the  expense  of  interinstitutional 
accountability,  to  obtain  other  offices. 

As  to  the  interinstitutional  accountability  occur¬ 
ring  between  the  U.S.  president  and  Congress, 
though  Congress  enjoys  the  power  to  initiate  legis¬ 
lation,  it  is  exposed  to  presidential  vetoes,  which 
are  often  threatened  and  applied  and  which,  for 
want  of  a  three-fifth  majority,  are  rarely  overrid¬ 
den.  A  president’s  agenda  can  generally  be  achieved 
only  in  those  infrequent  cases  in  which  the  presi¬ 
dent’s  party  has  majorities  in  both  branches  of 
Congress  that  are  solid  and  disciplined.  Even  the 
formation  of  the  presidential  team  of  secretaries, 
the  so-called  administration,  needs  the  approval 
of — that  is,  confirmation  by — the  Senate.  All  these 
complex  processes  justify  Neustadt’s  definition 
and  evaluation.  Indeed,  U.S.  institutions,  though 


separate,  share  powers  and  compete  for  the  exer¬ 
cise  of  those  powers,  making  space  for  many 
opportunities  of  interinstitutional  accountability. 
This  is  even  truer  when  the  president’s  party  does 
not  have  a  majority  in  one  or  both  branches  of 
Congress.  A  divided  government  may  entail  not 
just  more  interinstitutional  accountability  but  also 
political  and  institutional  confusion,  stalemate  in 
decision  making,  and  pork  barrel  policies.  It  is  not 
at  all  farfetched  to  hypothesize  that  confusion  in 
the  interinstitutional  accountability  process  will 
negatively  affect  the  possibility  of  vertical  account¬ 
ability  as  well.  A  number  of  principals  (the  citizen- 
voters)  will  find  it  difficult  to  identify  which  agents 
(the  president  or  the  members  of  Congress)  to 
punish  or  reward.  At  this  point,  a  classic  institu¬ 
tional  problem  emerges:  How  much  are  we  willing 
to  sacrifice  in  terms  of  decision-making  efficacy  in 
order  to  obtain  and  maintain  democratic  control? 

France 

The  quite  different  institutional  arrangement  of 
the  Fifth  French  Republic,  and  more  generally  of 
semipresidential  systems,  entails  different  conse¬ 
quences  in  terms  of  interinstitutional  accountability. 
In  the  French  case,  interinstitutional  accountability 
is  a  more  complicated  phenomenon  because  it 
depends  also  on  the  changing  allocation  of  political 
power  between  the  president  and  the  Assemblee 
Nationale  (and,  eventually,  the  leader  of  its  parlia¬ 
mentary  majority).  Reacting  against  the  institu¬ 
tional  weakness  of  the  Fourth  Republic,  Charles  De 
Gaulle  favored  the  concentration  of  much  power  in 
the  hands  of  the  president  and  the  president’s  gov¬ 
ernment.  Semipresidentialism  was  meant  to  give 
more,  and,  in  a  way,  “shielded”  power  to  the  chief 
executive,  in  order  to  endow  the  chief  executive 
with  a  lot  of  (popular)  legitimacy,  certainly  not  to 
establish  a  circuit  of  interinstitutional  accountabil¬ 
ity.  De  Gaulle  never  envisaged  the  likelihood  of 
cohabitation,  that  is,  of  a  president  having  to  live 
with  a  party  of  the  opposite  political  alignment  that 
has  the  parliamentary  majority  and  a  prime  minis¬ 
ter  who  belongs  to  that  party.  Cohabitation  intro¬ 
duces  a  significant  amount  of  interinstitutional 
accountability,  though  at  a  price.  Unlike  the  U.S. 
president  (and  all  chief  executives  in  presidential 
systems),  the  French  president  has  the  option  of  dis¬ 
solving  parliament  in  the  hope  that  the  voters  will 
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return  the  majority  he  or  she  likes.  The  president 
can  dissolve  parliament  only  after  it  has  been  in 
office  for  at  least  a  year.  However,  no  president 
can  risk  tarnishing  his  or  her  image  by  having  the 
party  with  the  opposite  political  alignment  repeat¬ 
edly  returned  by  the  voters.  Hence,  the  network  of 
interinstitutional  accountability  is  enriched  by  the 
“interference”  or  the  calling  of  the  voters  into  the 
picture  rather  than  by  staying  within  the  limits  of 
interinstitutional  relationships.  A  prime  minister 
who  belongs  to  a  political  alignment  opposed  to 
the  president  will  avoid  deliberately  challenging 
the  president,  because  such  a  move  would  risk  the 
early  dissolution  of  a  parliament  in  which  the 
prime  minister’s  party  is  in  the  majority.  This 
potential  for  the  dissolution  of  parliament  thus 
obliges  both  the  president  and  the  prime  minister 
to  behave  accountably  and,  on  the  whole,  predict¬ 
ably.  Though  both  may  engage  in  buck  passing, 
the  citizen-voters  are  in  a  position  to  evaluate  spe¬ 
cific  responsibilities  and  to  punish  either  the  presi¬ 
dent  for  exaggeratedly  interfering  or  the  prime 
minister  and  his  or  her  majority  for  their  (lack  of) 
performance.  Finally,  there  is  another  important 
structure  enforcing  some  interinstitutional  account¬ 
ability:  the  Constitutional  Council,  which  can  be 
activated  by  just  10%  of  the  members  of  the 
Assemblee  Nationale  to  challenge  the  bills  approved 
by  the  majority. 

Germany 

All  parliamentary  governments  are  built  on  more 
or  less  complex  and  explicit  webs  of  interinstitu¬ 
tional  accountability.  All  governments  must  have 
and  maintain  a  relationship  of  confidence  with 
parliament,  otherwise  they  will  be  replaced  more  or 
less  smoothly.  The  German  constructive  vote  of  no 
confidence  represents  a  specific  mechanism  of  this 
type  of  accountability.  A  parliament  that  is  unable 
to  support  its  government  or  unwilling  to  enact  its 
policies  can  be  dissolved  either  by  the  head  of  the 
government  or  by  the  head  of  the  state  following  a 
request  by  the  head  of  the  government. 

German  constitution  makers  also  wanted  to  pre¬ 
vent  the  concentration  of  power.  The  semifederal 
system,  based  on  the  autonomy  of  the  Lander,  was 
the  answer,  especially  strengthened  by  the  many 
and  important  legislative  and  representative  powers 
given  to  the  second  chamber:  the  Bundesrat. 


Though  not  involved  in  the  procedure  of  voting 
confidence  in  the  chancellor  and  withdrawing  it, 
the  Bundesrat  is  an  important  actor  in  the  circuit  of 
interinstitutional  accountability.  It  has  significant 
powers  in  all  legislation  concerning  even  marginally 
the  policies  considered  in  the  domain  of  the  Lander. 
Its  power  to  prevent  the  passing  of  legislation 
approved  by  the  Bundestag  compels  this  assembly 
to  be  accountable.  More  generally,  all  types  of  fed¬ 
eralism  and  all  forms  of  political  and  administrative 
decentralization  translate  themselves  into  some 
interinstitutional  accountability  between  the  federal 
government  and  local  governments. 

Emerging  Roles  of  Other 

Agents  of  Accountability 

When  speaking  of  interinstitutional  accountability 
one  must  consider  another,  relatively  recent  phe¬ 
nomenon  that  identifies  other  authorities  and 
agencies  that  can  be  instruments  of  accountability, 
such  as  central  banks,  regulatory  agencies,  and 
ombudspersons. 

First,  many  countries  have  granted  full  indepen¬ 
dence  to  their  central  banks.  The  U.S.  Federal 
Reserve  has  enjoyed  such  independence  for  some 
time,  while  the  British  Central  Bank  acquired  it  in 
1997.  With  several  other  countries  following  suit, 
this  trend  means  that  one  can  find  many  instances 
in  which  interinstitutional  accountability  between 
the  treasury  and  the  central  banks  is  at  work. 

Second,  several  countries  have  created  indepen¬ 
dent  authorities  and  agencies  charged  with  tasks 
once  performed  by  the  executive  or,  at  least,  sup¬ 
posed  to  be  in  its  domain  or  that  of  the  legislature. 
In  the  second  case,  these  agencies  may  complement 
legislative  oversight.  Examples  are  agencies  that 
address  areas  such  as  consumer  protection,  food 
and  drug  safety,  communications,  and  privacy 
issues  raised  by  the  news  media;  enforce  regula¬ 
tions  intended  to  ensure  a  competitive  market;  and 
oversee  evolving  energy  technologies.  In  many 
cases,  those  agencies  and  authorities,  whose  chairs 
and  boards  are  appointed  either  by  government  or 
by  parliament,  report  directly  to  parliament. 

Finally,  in  many  countries  and  even  in  the 
European  Union  (EU)  there  is  an  ombudsperson 
who  deals  with  the  complaints  of  the  citizens 
against  their  state  and  its  bureaucracy  as  well  as 
the  very  decisions  of  the  institutions  of  the  EU 
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itself.  Also,  within  the  EU,  interinstitutional 
accountability  is  present  in  its  complex  circuit 
going  from  the  European  Council  to  the  European 
Commission  to  the  European  Parliament,  but  it 
also  plays  a  role  in  the  relationships  between  the 
institutions  of  the  EU  and  the  member  states. 

It  is  unclear  whether  the  proliferation  of  author¬ 
ities  and  agencies  has  actually  improved  the  work¬ 
ing  and  quality  of  interinstitutional  accountability. 
When  dealing  with  complex  processes  whose  com¬ 
ponents,  antecedents,  and  consequences  are  not  and 
cannot  be  crystal  clear,  no  attempt  at  evaluation  is 
bound  to  be  satisfactory.  One  may  want  to  point  to 
the  perils  of  too  dense  and  wide  a  network  of  inter¬ 
institutional  accountability.  The  first  peril  has  cer¬ 
tainly  to  do  with  the  (im)precise  definition  of  the 
spheres  of  action  of  several  institutions.  In  too 
many  instances,  tensions  and  conflicts  may  appear 
between  the  executive  and  the  legislative,  between 
the  federal  government  and  local  governments,  or, 
quite  often,  between  the  executive  and  the  judiciary. 
It  will  then  be  the  constitutional  court  that  decides 
where  the  respective  spheres  of  action  begin  and, 
above  all,  end.  But  there  will  always  remain  a  gray 
area.  Some  institutional  actors  will  try  to  take 
advantage  of  this  gray  area  and  engage  in  the  poli¬ 
tics  of  buck  passing,  that  is,  attempting  to  put  the 
blame  for  what  is  done,  not  done,  or  poorly  done 
on  other  institutions.  In  addition  to  tensions  and 
conflicts,  confusion  will  follow.  The  perils  of  inter¬ 
institutional  accountability  will  affect  the  demo¬ 
cratic  framework  itself.  Though  by  definition  the 
citizens  are  not  directly  involved  in  this  kind  of 
accountability,  still  they  are  more  or  less  informed 
spectators  who  can  look  at  what  goes  on  and  who 
can  form  their  own  opinion.  Not  being  directly  in 
the  position  to  provide  rewards  or  punishments, 
they  are  likely  to  reach  a  negative  judgment  on  all 
the  institutions  involved  in  this  exercise  of  political 
and  bureaucratic  buck  passing.  If  and  when  all  the 
institutions  are  considered  equally  responsible  for 
the  conflict  and  for  the  confusion,  some  political 
apathy  or  even  alienation  cannot  be  written  off. 

Technically,  the  mass  media  are  not  considered 
to  be  fully  a  component  of  the  network  of  interin¬ 
stitutional  accountability.  Nevertheless,  they  are 
often  called  the  fourth  power.  But  are  they  account¬ 
able  to  other  institutions?  Leaving  aside  their 
somewhat  peculiar  accountability  to  their  readers 
and  public  opinion  at  large,  at  the  most  the  mass 


media  are  accountable  to  the  law,  hence  indirectly 
to  the  judiciary.  They  do  play  a  political  role  that 
most  certainly  affects  the  way  interinstitutional 
accountability  is  performed  and  evaluated,  but  the 
mass  media  themselves  are  external  to  that  circuit. 

Conclusion 

In  sum,  there  is  a  vibrant  paradox  in  all  discussions 
of  interinstitutional  accountability.  Most  scholars 
have  maintained  that  this  kind  of  accountability  is 
quite  positive  for  the  working  of  democratic 
regimes.  It  seems  to  be  less  positive  for  the  power 
of  the  people.  Political  power  should  not  be  con¬ 
centrated  but  distributed  among  several  institu¬ 
tions  and  shared  by  them.  Checks  and  balances 
must  be  devised  in  such  a  way  as  to  put  a  limit  on 
the  exercise  of  power  by  all  institutions.  Some 
activities  are  so  important  that  they  have  to  be  con¬ 
trolled  by  special  bodies:  independent  authorities 
and  agencies.  Recently,  doubts  have  been  raised 
concerning  not  only  the  effectiveness  of  interinsti¬ 
tutional  accountability  but  also,  inevitably,  whether 
a  dense  network  of  interinstitutional  accountability 
may  in  practice  conspicuously  trim  all  the  occur¬ 
rences  of  political  accountability  in  which  the  citi¬ 
zens  are  the  principals.  When  accountability  escapes 
evaluation  by  the  citizens  because  some  policy 
areas  are  insulated  and  not  exposed  to  their  con¬ 
trol,  the  quality  of  democracy  may  significantly 
suffer.  Some  nostalgia  has  made  its  appearance  for 
the  period  and  the  situations  in  which  the  party 
government  and  a  partisan  opposition  faced  each 
other  in  a  visible  and  transparent  competition 
decided  by  the  voters.  There  is  more  than  a  kernel 
of  truth  in  the  statement  that  such  competition  in 
contexts  in  which  governing  parties  do  not  delegate 
power  to  outside  agencies  and  authorities  does  pro¬ 
vide  for  better  accountability.  When  the  parties  in 
government  are  many,  the  voters  will  still  encoun¬ 
ter  problems  in  their  attempt  to  punish  and  reward 
their  “agents.”  But  most  of  the  time  it  will  be  clear 
that  parties  and  their  leaders  are  accountable  for 
the  positive  and  negative  functioning  of  the  govern¬ 
ing,  representative,  and  bureaucratic  institutions. 

The  extension  of  instances  of  interinstitutional 
accountability  may  have  been  fueled  and  nour¬ 
ished  by  distrust  toward  parties  and  party  politi¬ 
cians.  It  may  have  had  as  a  goal  the  containment 
and  the  reduction  of  party  power.  But  in  many 
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cases,  politicians  seem  to  have  reacquired  their 
power  working  within  and  through  the  institu¬ 
tions,  thus  avoiding  electoral  accountability.  When 
necessary,  party  leaders  have  learned  to  practice 
and  implement  a  shrewd  politics  of  appointments 
of  loyal,  even  subservient,  collaborators,  in  all 
nongovernmental  bodies  and  agencies,  guiding 
them  to  enact  unpopular  policies.  The  prolifera¬ 
tion  of  institutions  endowed  with  some  decision¬ 
making  power  in  technical  areas  or  with  the  power 
to  check  the  activities  of  other  institutions  has 
seemed  to  some  a  democratic  conquest,  to  others 
an  expropriation  of  the  voters.  Most  of  the  time,  it 
is  still  party  politicians  who  control  the  exercise  of 
political  power  by  and  within  the  institutions, 
while  it  is  appointed  technocrats  and  career 
bureaucrats  who  come  to  bear  the  brunt  of  the 
blame  in  the  interinstitutional  web  of  accountabil¬ 
ity.  Not  all  that  glitters  is  gold  (democratic)  in  the 
circuit  of  interinstitutional  accountability. 
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Governments  in  many  Western  countries  decided 
in  the  1980s  that  traditional  administration  was 


no  longer  up  to  the  task  of  modern  government. 
They  looked  to  the  private  sector  to  define  a 
new  management  approach.  Management  intro¬ 
duces  a  new  vocabulary,  mindset,  and  culture  to 
government  bureaucracies.  The  purpose  is  to 
force  the  hand  of  bureaucrats  to  become  more 
dynamic  and  better  managers.  A  debate  has 
raged  in  the  political  science  and  public  admin¬ 
istration  literature  on  the  merits  of  the  new  pub¬ 
lic  management  (NPM)  in  relation  to  traditional 
public  administration  for  the  past  25  years.  A 
number  of  politicians  in  the  1980s  decided  that 
the  machinery  of  government  was  in  an  urgent 
need  of  repair.  The  old  ways  were  no  longer  up 
to  the  task,  and  they  packaged  a  series  of  reform 
measures  that  in  time  would  become  known  as 
NPM.  Proponents  of  NPM  deliberately  set  high 
standards  with  goals  to  “reinvent  government,” 
“get  government  right,”  and  designate  those 
that  would  “steer”  government  from  those  that 
would  be  doing  the  “rowing.” 

The  word  management  in  NPM  implies  a  deci¬ 
siveness,  a  bias  for  action,  and  a  dynamic  mindset. 
Traditional  public  administration,  meanwhile, 
conjures  up  images  of  rules,  regulations,  and 
lethargic  decision-making  processes.  Presidents 
and  prime  ministers  who  came  to  power  in  the 
1980s  concluded  that  the  problem  was  with 
bureaucracy,  not  political  institutions.  They 
accused  bureaucracy  of  being  bloated,  expensive, 
unresponsive,  a  creation  of  routine  deliberately 
resistant  to  change,  and  essentially  incapable  of 
dealing  with  new  challenges. 

It  is  one  thing  to  diagnose  the  patient,  but  it  is 
quite  another  to  come  up  with  the  remedy.  Initially, 
at  least,  political  leaders  were  left  to  try  this  or  that 
to  see  what  would  work  with  varying  degrees  of 
success.  In  time,  a  new  approach,  anchored  in  pri¬ 
vate  sector  management  practices,  began  to  take 
shape  and  a  label  was  attached  to  it — NPM. 

The  goal  was  nothing  short  of  introducing  a 
new  culture  in  government  departments  and  agen¬ 
cies.  The  old  culture  was  found  wanting  on  many 
fronts.  It  attached  too  much  importance  to  due 
process,  prudence,  probity,  and  centrally  pre¬ 
scribed  administrative  rules  and  regulations.  It  also 
encouraged  senior  civil  servants  to  focus  on  policy 
issues  rather  than  on  management.  The  old  cul¬ 
ture,  associated  with  traditional  public  administra¬ 
tion,  was  considered  not  only  outdated  but  also 
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counterproductive.  Public  administration  became 
synonymous  with  the  old  culture. 

NPM  would  introduce  a  new  vocabulary,  a  new 
way  of  thinking,  and  a  new  culture.  It  would  also 
give  rise  to  a  proliferation  of  management  tech¬ 
niques  to  force  government  operations  to  become 
more  efficient.  The  purpose  was  to  force  the  hand  of 
senior  government  officials  to  become  better  man¬ 
agers  and  to  learn  to  make  tough  management  deci¬ 
sions.  Taken  at  face  value,  the  political  rhetoric  that 
accompanied  the  arrival  of  NPM  to  government 
meant  setting  the  civil  service  at  its  own  throat. 

The  Canadian  government  published  a  report 
(see  Table  1)  designed  to  contrast  the  old  culture 
(public  administration)  with  the  new  culture 
(NPM). 

NPM  holds  important  advantages  for  politi¬ 
cians.  In  forcing  the  hand  of  senior  civil  servants  to 
become  better  managers,  politicians  would  gain 
the  upper  hand  in  shaping  policy  initiatives.  The 
thinking  was  that  civil  servants  had  too  much 


Table  I  Old  Culture  (Traditional  Public 

Administration)  Versus  New  Culture  (New 
Public  Management) 


Old  Culture 

New  Culture 

Controlling 

Empowering 

Rigid 

Flexible 

Suspicious 

Trusting 

Administrative 

Managerial 

Secret 

Open 

Power  based 

Task  based 

Input/process  oriented 

Results  oriented 

Preprogrammed  and 

Capable  of  purposeful 

repetitive 

action 

Risk  averse 

Willing  to  take  intelligent 
risks 

Mandatory 

Optional 

Communicating  poorly 

Communicating  well 

Centralized 

Decentralized 

Uniform 

Diverse 

Stifling  creativity 

Encouraging  innovation 

Reactive 

Proactive 

Source:  Government  of  Canada,  Public  Service  2000 
Secretariat,  Ottawa,  Canada. 


influence  on  policy  at  the  expense  of  politicians. 
The  message  from  politicians  to  senior  civil  ser¬ 
vants  could  not  be  clearer  under  NPM — you  worry 
about  managing  government  operations  better, 
and  we  will  worry  about  setting  policy  priorities. 

How  then  would  civil  servants  become  better 
managers?  NPM  encompasses  a  number  of  broad 
strategies  to  promote  cultural  change  in  govern¬ 
ment:  decentralization  of  decision  making,  empow¬ 
erment,  a  reduction  on  controls  on  managers, 
more  flexible  organizational  structures,  upgrading 
the  skills  of  government  managers,  and  a  stronger 
sense  of  service  to  the  public.  Government  depart¬ 
ments  and  agencies  were  also  encouraged  to 
launch  review  exercises  to  identify  “useless”  red 
tape  and  “delayer”  management  levels. 

In  time,  NPM  became  the  fashion  in  much  of 
the  Western  world.  Margaret  Thatcher  in  the 
United  Kingdom  (UK)  showed  the  way  with 
numerous  measures  designed  to  overhaul  govern¬ 
ment  operations.  She  cut  the  size  of  the  civil  service 
(from  733,000  strong  to  569,000),  restructured 
government  operations  by  creating  executive  agen¬ 
cies  and  gave  them  a  narrow  mandate  to  deliver 
public  services,  privatized  state  corporations,  del¬ 
egated  more  authority  to  frontline  managers, 
and  overhauled  the  government’s  financial  man¬ 
agement  system.  This  and  other  private  sector 
management-inspired  measures  gave  life  to  NPM 
in  the  UK.  Before  long,  Australia,  New  Zealand, 
the  United  States,  and  Canada,  among  others,  also 
introduced  numerous  NPM-type  measures  with 
varying  degrees  of  success.  Countries  that  did  not 
pursue  NPM  with  any  enthusiasm,  such  as  France, 
were  regarded  as  being  out  of  step  with  modern 
management  strategies. 

The  contrast  between  the  old  (public  administra¬ 
tion)  and  the  new  (NPM)  is  striking  and  instructive. 
However,  it  probably  came  as  a  surprise  for  retired 
civil  servants  to  discover  that  their  culture  was 
“rigid”  and  gave  rise  to  “suspicious”  and  “secre¬ 
tive”  behavior,  which  in  turn  served  to  “stifle  cre¬ 
ativity,”  which  led  them  to  “communicate  poorly.” 

Old  cultures  die  hard,  if  they  die  at  all.  NPM 
has  met  some  successes  but  also  failures.  Graham 
Allison  (1987)  went  to  the  heart  of  the  matter 
when  he  wrote, 

The  perception  that  government  performance 

lags  behind  private  business  performance  is 
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correct.  But  the  notion  that  there  is  any  significant 
body  of  private  management  practices  and  skills 
that  can  be  transferred  directly  to  public 
management  tasks  in  a  way  that  produces 
significant  improvements  is  wrong,  (p.  525) 

The  sharp  differences  between  the  public  and 
private  sectors  make  it  difficult  for  a  number  of 
NPM  measures.  The  public  sector  does  not  have 
a  clear  bottom  line  and  has  legislative  require¬ 
ments  of  reviewing  the  work  of  government  man¬ 
agers.  Indeed,  public  sector  managers  depend  on 
political  bodies  both  for  their  authority  and  for 
their  budgets,  and  they  are  subject  to  far  greater 
public  scrutiny  of  their  actions.  They  are  directly 
accountable  to  their  political  masters,  and  no 
matter  what  form  of  government  organization  is 
in  place  it  can  only  enjoy  legitimacy  through  the 
political  process.  Although  it  can  take  various 
forms,  governments  must  have  an  accountability 
process  to  make  the  exercise  of  power  responsi¬ 
ble.  This  speaks  of  the  merits  of  traditional  public 
administration. 

Traditional  public  administration  has  short¬ 
comings,  but  it  is  directly  tied  to  national  political 
institutions  and  their  accountability  requirements. 
Hierarchy,  command  and  control,  and  centrally 
prescribed  rules  and  regulations  enable  political 
leaders  to  reach  down  to  all  departments  and  agen¬ 
cies  and  determine  who  did  what  and  why  things 
went  wrong. 

NPM  had  to  compensate  somehow  for  its  call 
to  empower  managers,  to  flatten  organizations, 
and  to  import  numerous  private  sector  manage¬ 
ment  practices  to  government.  An  emphasis  on 
measuring  performance  became  another  defining 
characteristic  of  NPM.  Nothing  is  left  in  govern¬ 
ment  that  is  not  up  for  measurement  from  the 
performance  of  senior  civil  servants  to  program 
activities  in  all  sectors  whether  economic  or  social. 
The  thinking  is  that  establishing  performance 
standards  will  provide  for  more  effective  account¬ 
ability  and  also  enable  both  politicians  and  citi¬ 
zens  to  see  how  well  civil  servants  and  programs 
perform. 

The  demand  for  information  to  fuel  perfor¬ 
mance  measurement  and  evaluation  initiatives  by 
central  agency  and  senior  departmental  officials 
has  increased  substantially  in  recent  years  through¬ 
out  the  Western  world.  It  seems  that  everything  in 


government  now  needs  to  be  measured  to  have  any 
standing  in  the  expenditure  budget  process.  This 
squares  with  NPM’s  emphasis  on  outputs,  on  good 
management,  and  on  evaluating  how  well  indi¬ 
vidual  civil  servants  and  programs  perform.  This 
too  is  in  contrast  to  traditional  public  administra¬ 
tion  with  its  emphasis  on  controlling  input  costs 
and  holding  civil  servants  accountable  for  their 
administrative  decisions. 

Proponents  of  traditional  public  administration 
insist  that  politics  and  the  public  sector  do  not 
lend  themselves  to  big  answers.  Public  administra¬ 
tion  operates  in  a  political  environment  that  is 
always  on  the  lookout  for  “errors”  and  that  exhib¬ 
its  an  extremely  low  tolerance  for  mistakes.  The 
attention  of  the  national  media  and  political  oppo¬ 
nents  are  sufficient  to  explain  why  civil  servants 
are  cautious  and  why  they  strive  to  operate  in  an 
error-free  environment.  One  would  have  to  let  the 
imagination  run  wild  to  visualize  a  headline  in  the 
media  applauding  the  fine  work  of  “empowered” 
civil  servants.  Supporters  of  traditional  public 
administration  insist  that  it  is  unwise  to  think  that 
one  can  import  private  sector  management  strate¬ 
gies  to  government.  They  point  out  that  in  busi¬ 
ness  it  does  not  much  matter  if  you  get  it  wrong 
10%  of  the  time  as  long  as  you  turn  a  profit  at  the 
end  of  the  year.  In  government,  it  does  not  much 
matter  if  you  get  it  right  90%  of  the  time  because 
the  focus  will  be  on  the  10%  of  the  time  you  get  it 
wrong. 

In  the  end,  the  debate  that  truly  matters  between 
the  two  camps  centers  on  accountability.  Both  sides 
insist  that  their  approach  holds  greater  merit  in 
ensuring  that  politicians  and  civil  servants  are 
accountable  for  policies  and  the  delivery  of  govern¬ 
ment  programs.  For  politicians  and  civil  servants, 
both  approaches  have  strengths  and  drawbacks — 
traditional  public  administration  provides  more 
stability  and  predictability  while  NPM  provides 
greater  flexibility  and  a  capacity  to  measure  the  per¬ 
formance  of  both  senior  civil  servants  and  programs. 

Donald  J.  Savoie 
Universite  de  Moncton 
Moncton,  New  Brunswick,  Canada 

See  also  Administration  Theory;  Autonomy, 
Administrative;  Civil  Service;  Development 
Administration;  Performance  Management 
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There  is  no  single  theory  of  public  administration. 
Rather,  public  administration  theory  contains  a 
complex  of  statements  regarding  what  administra¬ 
tion  is,  the  meaning  of  public,  who  creates  public 
administration,  what  the  goals  of  public  adminis¬ 
tration  are,  and  how  public  administration  should 
fulfill  its  tasks.  Therefore,  administration  theories 
have  always  been  and  continue  to  be  interdisci¬ 
plinary,  influenced  by  concepts  from  history,  law, 
sociology,  political  science,  management,  and 
even  psychology.  Public  administration  theory 
also  applies  to  a  distinction  between  an  academic 
discipline  and  a  profession  and  contains  approaches 
from  organization  theory,  politics,  governance, 
and  law.  This  entry  provides  an  overview  of  the 
development  of  administrative  theories  for  the 
public  sector  and  also  examines  some  contempo¬ 
rary  challenges  in  this  area.  It  begins  with  early 
theories  that  sought  to  distinguish  between  poli¬ 
tics,  which  dealt  with  the  choice  of  the  values 
underlying  public  policy  as  articulated  through 
the  political  process,  and  administration,  which 
implemented  those  policies  in  an  efficient  way 


similar  to  the  way  that  businesses  operate.  It  then 
considers  the  impact  of  theories  of  scientific  man¬ 
agement  on  the  development  of  administrative 
theory.  Last,  it  examines  more  recent  thought  that 
sees  the  influence  of  values  on  administrative  deci¬ 
sions  and  proposes  that  the  distinction  between 
politics  and  administration  is  not  as  clear  as  it  has 
been  considered  in  the  past. 

Development  of  Administrative  Theories 

The  European  tradition  of  public  administrative 
theory  was  based  on  Max  Weber’s  conception  of 
bureaucracy  as  a  formal-legal  institution  responsi¬ 
ble  for  implementing  public  law.  This  model  was 
an  ideal  type  that  represented  the  highest  form  of 
rational-legal  development  for  the  state. 
Furthermore,  it  was  designed  to  ensure  equal  treat¬ 
ment  of  clients  through  formal  rules  and  through 
careful  maintenance  of  files.  While  bureaucracy  is 
generally  a  negative  term  in  ordinary  usage,  its 
original  intentions  were  more  positive.  In  particu¬ 
lar,  the  bureaucracy  concept  includes 

1.  division  of  labor  and  division  of  authorities, 

2.  hierarchy  of  administrative  bodies,  and 

3.  career  system  as  the  basis  for  the  organization. 

Public  administration  in  the  European  tradition 
has  been  based  on  law,  with  the  emphasis  on  the 
legal  foundations  of  all  public  actions.  Furthermore, 
executing  public  duties  defined  by  law  is  incorpo¬ 
rated  into  constitutional  formula  of  executive 
authorities  (in  countries  that  have  a  written  consti¬ 
tution),  among  legislative  and  judicature.  As  David 
Rosenbloom  and  Robert  Kravchuk  (2005)  note, 
negatively  defining  administration  emphasizes  the 
autonomy  of  administrative  agencies  from  legisla¬ 
tive  and  judicature,  also  indicating  that  adminis¬ 
trative  actions  affect  rights  and  duties  of  private 
subjects.  A  legal  approach  to  public  administra¬ 
tion,  defined  by  lawyers,  brings  out  the  legal  limits 
of  administration  and  its  executive  nature;  how¬ 
ever,  as  some  scientists  rightfully  state,  public 
administration  cannot  consist  solely  of  executing 
legal  acts — similarly,  it  cannot  be  simple  execution 
of  policies,  created  without  its  involvement. 

The  American  tradition  in  public  administra¬ 
tion  has  been  based  more  on  the  work  of  Woodrow 
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Wilson  and  his  concern  with  the  relationship 
between  administration  and  politics.  Wilson’s 
famous  essay,  titled  The  Study  of  Administration 
(1887),  concentrated  on  organization  and  methods 
of  activity  of  administrative  agencies,  particularly 
emphasizing  how  administrative  tasks  could  be 
performed  to  achieve  the  highest  possible  effi¬ 
ciency.  Wilson  also  clearly  distinguished  spheres  of 
politics  and  administration  and  initiated  the 
so-called  politico-administrative  dichotomy.  He 
argued  that  what  distinguishes  politics  from 
administration  is  that  administration  executes  the 
will  of  the  state  as  identified  through  the  political 
process.  Numerous  scholars  such  as  Frank  J. 
Goodnow,  William  F.  Willoughby,  and  Leonard 
White  built  on  Wilson’s  work  to  form  the  begin¬ 
ning  of  modern  administrative  theory. 

Following  Wilson’s  so-called  classic  contribu¬ 
tion,  public  administration  authorities  attempted 
to  develop  basic  principles  of  public  administra¬ 
tion.  The  classic  doctrine  assumed  that  democracy 
and  efficient  administration  were  compatible  and 
that  the  implementation  activities  of  government 
could  be  analyzed  scientifically.  The  early  theorists 
of  management  have  emphasized  that  organization 
and  control  can  be  recognized  almost  as  synonyms, 
because  almost  at  the  same  time  the  other  trend  of 
public  administration  patterns  named  scientific 
management  was  developing. 

Wilson  was  interested  in  management  science, 
and  Frederick  W.  Taylor,  recognized  as  the  father 
of  management  science,  began  research  that 
resulted  in  The  Principles  of  Scientific  Management 
(1911).  The  assumption  of  this  work  was  that 
there  is  a  single  best  way  to  achieve  any  assigned 
aim,  whereas  management  science  was  supposed 
to  help  achieve  the  increase  of  efficiency  by  invent¬ 
ing  the  fastest,  the  most  efficient,  and  the  least 
“tiresome”  production  methods.  This  emphasis  on 
management  led  to  an  identification  of  adminis¬ 
trative  science  with  bureaucratic  forms  of  organi¬ 
zation.  The  essential  scopes  of  interests  of  academ¬ 
ics  and  practitioners  in  the  area  of  public  adminis¬ 
tration  have  become  division  of  work,  the  scope  of 
control,  hierarchy  of  organization,  the  chain  of 
official  internal  instructions,  the  reporting  system, 
departmentalization,  development  of  standards, 
politics,  and  activity  procedures. 

The  French  engineer  Henri  Fayol  was  another 
significant  influence  on  administrative  theory  and 


scientific  management.  Fayol  studied  company 
management  seeking  to  enhance  efficiency. 
According  to  his  theory,  administration  is  recog¬ 
nized  as  the  social  science  ruled  by  similar  rules  to 
natural  sciences.  Fayol  analyzed  administration  in 
terms  of  the  functions  of  industrial  organizations, 
which  he  classified  as  planning,  organizing,  com¬ 
manding,  coordinating,  and  controlling.  Fayol  was 
a  rationalist  who  understood  management  of  both 
private  sector  (companies)  and  public  sector  (e.g., 
public  administration)  organizations  to  be  exactly 
the  same  thing.  These  rules,  which  he  identified  for 
the  private  sector,  apply  to  the  public  one  as  well. 

Classical  administrative  theory  was  developed 
further  during  the  1930s.  This  period  particularly 
emphasized  the  inside  aspects  of  public  adminis¬ 
tration:  management  methods  and  problems, 
structures  and  activities  of  organizations,  budget 
procedures,  and  staff  problems.  Luther  Gulick’s 
approach  was  contained  in  an  abbreviation — 
POSDCORB,  which  management  should  meet  as 
follows: 

Planning — planning,  elaborating  the  direction  and 
methods  of  planned  actions 

Organizing — creating  formal  structure  of  power, 
division  into  organizational  units,  which  cooperate 
for  an  assigned  aim 

Staffing — locating,  improving,  and  developing  staff 

Directing — ordering,  decision-making  process,  and 
handling  instructions 

Coordinating — combining  separate  parts  of  the 
same  action 

Reporting — informing  all  actors  in  the  process  of 
accomplishing  a  plan 

Budgeting — financial  planning,  accountancy  control 

The  classical  theory  was  challenged  through 
decision-making  approaches,  notably  those  associ¬ 
ated  with  Herbert  Simon  and  bounded  rationality. 
His  1947  book,  Administrative  Behavior,  served 
as  the  foundation  for  the  development  of  the 
so-called  Carnegie  School  that  emphasized  the  lim¬ 
its  on  rationality  and  the  organizational  basis  of 
decisions.  This  school  then  served  as  the  founda¬ 
tion  for  the  rebirth  of  institutional  analysis  in 
political  science  and  public  administration. 
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Simon  declared  that  making  administrative 
decisions  lies  at  the  heart  of  public  administration, 
emphasizing  the  feasibility  of  using  scientific  meth¬ 
ods  to  study  public  administration;  he  argued  that 
logical  positivism  was  the  right  approach  in  terms 
of  creating  administrative  policies.  He  sought  to 
develop  a  social  scientific  approach  to  administra¬ 
tion  rather  than  the  principles  associated  with  the 
then  conventional  wisdom  of  “principles  of  admin¬ 
istration.”  Simon  disapproved  Gulick’s  approach 
and  his  POSDCORB,  claiming  it  to  be  inappropri¬ 
ate  for  many  situations  that  civil  servants  face.  He 
created  the  idea  of  limited  rationality  in  the  deci¬ 
sion  making,  recognizing  that  people  are  rational 
in  making  decisions,  but  only  to  a  certain  extent. 
Simon  acknowledged  that  decisions  made  by  civil 
servants  are  rational  in  their  view,  but  at  the  same 
time  he  pointed  to  many  alternative  variants  that 
lead  to  achieving  the  most  rational  decision. 
According  to  Simon,  the  new  paradigm  for  the 
study  of  public  administration  means  that  there 
should  exist  two  trends  in  the  discipline,  harmo¬ 
nizing  and  mutually,  intellectually  stimulating: 

1.  One  trend  concentrated  on  progress  of  pure 
administrative  science,  strongly  based  on 
fundamentals  of  social  psychology  to 
understand  decision  making  more  effectively. 

2.  Another,  more  representative  trend,  aimed  at 
creating  guidelines  of  public  policies  that  could 
guide  decisions,  albeit  without  the 
understanding  of  decision-making  theory 
implied  in  the  first  principle. 

Robert  A.  Dahl  (1947),  one  of  the  most  impor¬ 
tant  early  behaviorists  of  political  science,  ana¬ 
lyzed  “state  of  the  art”  of  administrative  science. 
According  to  him,  administrative  science  should 

1 .  recognize  the  complexity  of  human  behavior, 

2.  relate  to  the  problems  of  values  eligible  in 
administrative  situations,  and 

3.  consider  the  connections  between  public 
administration  and  surrounding  environment. 

Critics  of  Simon’s  fundamentals  of  public  ad¬ 
ministration  and  political-administrative  dichot¬ 
omy  were  also  supported  by  another  scientist, 
Dwight  Waldo,  who  defined  public  administration 


from  the  perspective  of  the  surrounding  environ¬ 
ment,  emphasizing  the  importance  of  comparative 
administration.  From  Waldo’s  point  of  view,  pub¬ 
lic  administration  should  be  a  branch  of  political 
science. 

The  more  political  approach  to  public  adminis¬ 
tration  has  led  to  the  conclusion  that  public 
administration  is  not  only  accountable  to  elected 
public  officials  (members  of  parliament  and  execu¬ 
tive  authorities)  but  also  responds  to  their  needs,  in 
other  words,  being  an  instrument  in  politicians’ 
hands.  The  importance  of  public  administration 
from  this  perspective  is  stressed  by  the  fact  that  it 
has  the  discretional  authority.  B.  Guy  Peters  con¬ 
siders  public  administration  to  have  a  broader 
scope  than  a  typical  bureaucracy  (in  terms  of  ran¬ 
dom  organization  of  administrative  character)  and 
emphasizes  that  it  is  an  integral  part  of  decisive 
processes  of  executive  authorities. 

Contemporary  Challenges 

Public  administration  today  tends  to  equate  public 
administration  and  governance.  Etymologically, 
the  term  governance  derives  from  the  Greek  word 
kuberndn  (piloting,  steering)  and  was  used  by 
Plato  to  describe  the  creation  of  systems  of  rules. 
The  Greek  source  gave  birth  to  the  Latin  guber- 
nare ,  with  a  similar  meaning  (piloting,  making 
rules,  steering).  Governing,  as  we  make  a  reference 
to  it  here,  is  close  to  the  term  of  managing  (lead¬ 
ing),  conducted  by  governments  and  their  adminis¬ 
tration.  Governing  in  this  approach  is  connected  to 
the  new  forms  of  actions  of  public  administration, 
which  are  difficult  to  classify  strictly  as  executing 
the  law.  For  example,  the  World  Bank  created  a 
definition  of  “governance”  meaning  the  traditions 
and  institutions  by  which  authority  is  exercised  for 
the  common  good.  This  approach,  portraying 
“different”  governing  (with  regard  to  traditional 
administration),  is  close  to  the  new  public  manage¬ 
ment  (NPM)  conception  and  forms  the  base  for 
further  analysis  of  public  administration  and  mak¬ 
ing  new  trends,  such  as  new  public  service  (NPS). 

The  NPM  has  been  a  recent  and  pervasive  coun¬ 
terpoise  to  the  emphasis  on  law  in  most  continen¬ 
tal  (and  Latin  American)  administrative  systems. 
This  approach  to  public  administration  empha¬ 
sizes  the  economic  foundations  of  governing  and 
argues  that  public  and  private  management  are 
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essentially  the  same.  Therefore,  this  theory  of 
administration  in  the  public  sector  separates  poli¬ 
tics  and  administration  even  more  than  in  the  tra¬ 
ditional  Wilsonian  and  Weberian  formulations. 
The  NPM  approach  represents  a  recent  theory  for 
public  administration,  but  as  its  influence  wanes  at 
the  beginning  of  the  2f  st  century,  there  is  as  yet  no 
clear  replacement.  That  said,  the  political  and 
managerial  weaknesses  inherent  in  NPM  have  pro¬ 
duced  a  number  of  important  reactions  in  practice, 
if  not  in  theory. 

Patrycja  J.  Suwaj 
University  of  Bialystok 
Bialystok,  Poland 
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Advocacy 


Advocacy  is  a  kind  of  political  action  addressed  to 
a  governing  body  with  the  aim  of  influencing  pub¬ 
lic  policy  outputs.  Though  advocacy  is  often 
equated  with  lobbying,  it  is  in  fact  an  umbrella 
term  for  organized  activism  related  to  a  particular 
set  of  issues.  Lobbying,  meanwhile,  is  the  practice 
of  influencing  a  governing  body  through  direct 
communication  with  legislators.  Advocacy  groups 
strive  to  change  public  policy  according  to  their 
interests  without  seeking  political  mandates.  The 


definition  emphasizes  the  existence  of  some  kind 
of  organization,  ranging  from  loosely  coupled 
networks  to  highly  formalized  nonmembership 
organizations.  It  includes  social  movements,  vol¬ 
untary  associations,  or  membership  groups  as  well 
as  institutions  such  as  corporations,  universities, 
cities,  or  foundations.  This  entry  discusses  the 
major  actors,  targets,  and  strategies,  and  their 
respective  impacts. 

Actors 

The  increase  in  the  number  of  advocacy  groups  is 
one  of  the  most  striking  and  still  ongoing  trends  in 
Western  societies.  The  number  of  citizens  willing 
to  join  an  interest  group  has  increased  over  many 
decades.  In  the  United  States,  which  has  been  in 
the  vanguard,  this  so-called  advocacy  explosion 
started  around  1960.  Whereas  between  1920  and 
1960  the  number  of  interest  groups  represented  in 
Washington  merely  doubled,  it  quadrupled  within 
the  following  20  years.  Similarly,  at  the  end  of  the 
1980s,  the  number  of  interest  groups  represented 
at  the  European  Union  (EU)  in  Brussels  was 
10  times  higher  than  it  was  at  the  beginning  of  the 
1970s,  and  it  quadrupled  between  1985  and  1991. 
The  rapid  growth  in  number  was  accompanied  by 
a  structural  change  with  respect  to  the  scope, 
intensity,  and  shape  of  advocacy.  Today,  advocacy 
is  more  diverse  and  much  more  professional  than 
it  was  a  few  decades  ago,  including  an  increase  in 
staff  acting  as  in-house  lobbyists,  a  growth  of  dif¬ 
ferent  kinds  of  lobbying  firms  performing  surro¬ 
gate  representation,  and  the  development  of  more 
and  more  forms  of  direct  and  indirect  lobbying. 

In  the  first  half  of  the  20th  century,  the  universe 
of  advocacy  groups  in  the  Western  world  was 
dominated  by  trade,  professional,  and  labor  orga¬ 
nizations.  For  example,  before  1950  in  the  United 
States,  these  three  types  of  groups  accounted  for 
over  three  quarters  of  all  organized  interests  active 
in  politics.  Beginning  in  the  1960s,  however,  we 
can  find  a  significant  decline  in  the  rate  of  union¬ 
ization,  first  of  all  in  the  United  States,  and,  since 
the  1980s,  also  in  most  Western  European  coun¬ 
tries.  Although  the  number  of  the  other  types  of 
established  groups  continued  to  increase  even  in 
the  second  half  of  the  century,  their  share  of  the 
group  universe  declined  due  to  an  even  more  rapid 
growth  of  various  types  of  organizations  from  the 
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nonprofit  sector  concerned  with  health,  social  wel¬ 
fare,  culture,  education,  public  affairs,  consumers, 
senior  citizens,  taxpayers,  civil  rights,  women’s 
rights,  human  rights,  the  environment,  religion, 
and  other  areas.  Many  of  these  groups  originated 
in  the  social  movements  of  the  1960s  and  1970s, 
which  mobilized  their  constituencies  to  turn  the 
public’s  attention  to  new  interests  and  new  politi¬ 
cal  problems.  Parts  of  these  movements  turned 
into  associations  that  would  allow  the  groups  to 
maintain  their  capacity  to  advocate  even  in  times 
of  declining  mobilization. 

The  change  in  the  composition  of  the  group 
universe  was  paralleled  by  a  process  of  differentia¬ 
tion  in  the  mode  of  organization.  The  advocacy 
explosion  resulted  not  only  from  a  rise  in  the  num¬ 
ber  of  voluntary  associations  but  also — and  per¬ 
haps  even  more — from  the  fact  that  more  and 
more  institutions,  such  as  corporations,  govern¬ 
mental  entities,  universities,  hospitals,  and  think 
tanks,  have  begun  to  act  as  interest  groups.  Many 
advocacy  groups  thus  no  longer  represent  the 
aggregate  interests  of  individual  group  members 
but  are  lobbying  on  behalf  of  corporate  actors, 
which  do  not  have  any  members  at  all.  Today,  we 
find  a  highly  differentiated  pattern  of  advocacy  in 
almost  all  Western  countries  consisting  of  a  great 
variety  of  types  of  organizations,  including  broad- 
based  membership  groups  such  as  labor  unions, 
professional  groups,  and  citizen  groups,  as  well  as 
institutions  and  groups  with  corporate  members, 
such  as  trade  associations  and  complex  umbrella 
associations. 

The  different  developments  in  the  group  universe 
were  induced  by  several  factors.  First  of  all,  the 
rapid  social  and  economic  changes  in  Western 
countries  have  created  rising  numbers  of  potential 
members  and  an  overall  growth  in  organizational 
resources.  As  people  have  become  more  educated, 
more  articulate,  and  wealthier,  and  as  knowledge 
and  information  have  become  more  widespread, 
new  interests  have  emerged,  and  traditional  ones 
have  been  redefined.  Accelerated  by  advanced  com¬ 
munication  technologies,  the  translation  of  interests 
into  formal  group  organizations  became  a  cumula¬ 
tive  process,  stimulating  other  constituencies  to 
organize  or  to  amplify  political  activities.  Recently, 
another  determinant  of  the  expansion  of  the  advo¬ 
cacy  group  system  has  come  to  the  fore:  govern¬ 
ment  activity.  Whenever  government  activity  is 


expanded  to  new  issues  or  existing  government 
bodies  gain  new  jurisdictions,  new  groups  will 
form  or  existing  ones  will  focus  their  activities  on 
these  new  political  areas.  In  spite  of  these  driving 
forces,  advocacy  growth  may  slow  down  or  reach 
a  limit  in  the  future,  as  it  will  become  increasingly 
difficult  for  organizers  to  find  interests  not  yet 
politically  represented  by  any  group  or  to  mobilize 
for  advocacy  in  view  of  scarce  resources. 

Levels  and  Targets 

The  targets  of  advocacy  differ  depending  on  the 
institutional  settings  and  the  decision-making  pro¬ 
cedures  of  the  political  system.  Specific  structures 
such  as  the  separation  of  powers  or  the  formal  and 
informal  rules  of  policy  making  are  critical  to 
advocacy  groups’  decisions  as  to  whom  to  lobby 
and  where  to  try  to  gain  access.  Whereas  in  cen¬ 
tralized  systems  advocacy  will  be  focused  on  actors 
and  institutions  of  the  central  authority,  multilevel 
or  federated  systems  show  a  much  more  differenti¬ 
ated  pattern  of  advocacy.  For  example,  despite 
Germany  being  a  federal  state,  advocacy  groups  in 
Germany  focus  on  the  federal  level,  particularly  on 
the  ministries  and  on  leading  parliamentarians, 
such  as  the  chairs  of  committees,  due  to  the  strong 
and  centralized  party  system.  In  contrast,  the 
American  federal  system,  even  at  the  local  level 
with  its  more  than  80,000  governments,  forms  a 
significant  political  target  since  legislators  are 
heavily  dependent  on  the  financial  and  political 
support  of  local  coalitions.  Finally,  in  the  highly 
differentiated  multilevel  system  of  the  EU,  advo¬ 
cacy  groups  find  many  points  of  access  and  several 
veto  points,  offering  a  variety  of  opportunities  to 
exert  influence  on  policy  outcomes. 

The  choice  of  political  targets  also  depends  on 
the  character  and  type  of  the  advocacy  group. 
While  traditional  interest  groups  are  apt  to  focus 
their  activities  on  the  national  level,  nongovern¬ 
mental  organizations  (NGOs),  most  of  them  repre¬ 
senting  the  interests  of  disadvantaged  groups, 
focus  on  the  international  level,  where  their  con¬ 
cerns  have  a  better  chance  of  being  accepted  than 
at  the  national  level.  Thus,  Eurogroups  tend  to  be 
experts  in  multilevel  strategies,  acting  at  the 
European  as  well  as  on  the  respective  national  lev¬ 
els,  depending  on  which  of  the  two  is  most  critical 
for  the  decision-making  process  in  which  they 
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engage.  Legislators  and  governing  boards,  how¬ 
ever,  not  only  are  passive  targets  of  advocacy 
groups,  but  they  often  have  an  impact  on  group 
structures  and  resources  as  well.  The  most  com¬ 
mon  instrument  is  the  financial  support  given  to 
groups,  which  otherwise  would  not  be  able  to 
organize.  Such  support  is  intended  to  create  coun¬ 
tervailing  forces  to  the  powerful  interest  groups 
that  tend  to  dominate  the  political  process  or  to 
help  form  coalitions  of  advocacy  groups  that  will 
improve  the  legislators’  chances  of  getting  reelected 
or  foster  their  political  plans. 

Social  change  and  changes  in  technology,  insti¬ 
tutional  settings,  and  decision-making  procedures 
will  often  prompt  advocacy  groups  to  address  new 
targets  and  different  levels  of  the  political  system. 
As  the  European  Parliament  has  gained  increasing 
power  and  responsibilities,  it  has  increasingly 
become  a  target  of  lobbying  activity.  Similarly, 
globalization  and  denationalization  have  created 
new  opportunity  structures  for  social  movement 
organizations  and  NGOs.  Relying  on  new  com¬ 
munication  technologies,  they  have  developed 
transnational  ties  and  coordination  mechanisms 
that  enable  them  to  organize  protest  on  the  inter¬ 
national  level  and  to  develop  countervailing  power 
to  the  policies  of  international  institutions  such  as 
the  World  Trade  Organization  or  the  International 
Monetary  Fund.  But  these  processes  of  adaptation 
may  fail  if  the  characteristics  of  the  advocacy 
groups  do  not  correspond  to  the  institutional  con¬ 
text  in  which  a  group  wishes  to  act.  For  instance, 
the  lack  of  a  genuine  European  public  sphere  and 
the  specific  decision-making  style  of  the  European 
institutions  involve  high  transaction  costs  for  the 
mobilization  of  mass  protest  in  Brussels.  Therefore, 
the  activities  of  social  movements  have  remained 
predominantly  national  up  to  now. 

Strategies  and  Tactics 

To  begin  with,  advocacy  includes  many  activities 
that  seem  to  have  little  to  do  with  lobbying  at  all, 
such  as  gathering  information,  monitoring  the 
political  process,  and  conducting  research. 
Nevertheless,  today  we  find  a  great  variety  of  tac¬ 
tics  and  strategies  applied  by  advocacy  groups  and 
lobbyists  to  make  decision  makers  willing  to  pro¬ 
duce  policy  outputs  consistent  with  their  interests. 
The  single  group,  however,  has  to  decide  which 


tools  to  apply  and  which  not,  since  resources  are 
limited.  As  a  rule,  advocacy  is  focused  on  a  small 
number  of  subjects.  Lobbyists  spend  most  of  their 
time  on  only  a  few  critical  issues,  whereas  they 
spend  little  time  on  many  others. 

For  a  long  time,  scholars  have  agreed  that  one 
should  distinguish  between  inside  strategies  (based 
on  personal  contacts  between  lobbyists  and  policy¬ 
makers)  and  outside  strategies  (public  relations 
and  grassroots  contacts).  This  distinction  has  been 
useful  for  finding  correlations  between  group  char¬ 
acteristics  or  other  factors  and  the  selection  of 
tools  and  for  describing  shifts  in  the  strategic  ori¬ 
entations  of  advocacy  groups.  There  is  a  lot  of 
evidence  demonstrating  that  citizen  groups  will 
prefer  outside  strategies  while  established  groups, 
mainly  business,  rely  heavily  on  direct  lobbying. 
Recently,  however,  the  overall  picture  has  become 
more  multifaceted.  While  business  groups  tend  to 
apply  multivoice  strategies  that  give  them  the 
chance  to  be  present  wherever  a  channel  of  influ¬ 
ence  opens  up,  social  movements  and  citizen 
groups  have  also  tried  to  apply  inside  strategies. 
Considering  the  growing  variety  of  strategies  and 
tactics,  the  distinction  between  inside  and  outside 
strategies  seems  to  become  obsolete  and  should  be 
replaced  by  a  more  differentiated  typology  as  sug¬ 
gested  by  Jeffrey  Berry,  who  identifies  four  differ¬ 
ent  advocacy  group  strategies,  namely, 

1.  litigation  and  administrative  intervention, 

2.  confrontation  (protest,  whistle-blowing,  public 
relations,  etc.), 

3.  information  (making  personal  representations  to 
government,  releasing  research  results),  and 

4.  constituency  influence. 

One  of  the  major  factors  that  has  accounted  for 
an  expansion  of  lobbying  activities  is  advocacy 
explosion.  The  rise  of  countervailing  power  has 
strengthened  competition  between  new  and  estab¬ 
lished  groups,  forcing  the  latter  to  spend  more  on 
lobbying  and  to  expand  their  action  repertory. 
Furthermore,  technological  innovation  in  informa¬ 
tion  processing  has  created  new  opportunities  for 
lobbyists  to  gain  access  to  policymakers  and  to 
apply  new  forms  of  lobbying,  particularly  those 
that  address  the  general  public,  such  as  media-  or 
issue-based  campaigns.  These  trends  have  made  it 
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necessary  for  advocacy  groups  to  acquire  new 
skills  and  more  professional  knowledge,  required 
to  apply  the  new  tools  in  an  efficient  and  effective 
way.  The  rising  demand  for  advocacy  advice,  in 
turn,  is  the  main  reason  why  direct  representation 
by  in-house  lobbyists  has  increasingly  been  supple¬ 
mented  by  surrogate  representation  through  lob¬ 
bying  firms.  Acting  in  the  name  of  advocacy 
groups,  such  firms  can  help  manage  the  problems 
posed  by  new  developments  in  politics  and  com¬ 
munication  technology.  Lobbying  firms  are  com¬ 
mon  primarily  in  the  United  States,  where  they 
account  for  about  20%  of  all  lobbyists  present  in 
the  capital  and  where  they  form  a  special  sector 
with  different  types  of  businesses.  In  the  past  2 
decades,  we  have  also  seen  a  rapid  growth  of  lob¬ 
bying  firms  in  the  EU.  The  increasing  scope  and 
density  of  EU  regulations  have  created  a  rapidly 
rising  demand  for  professional  advice  on  how  to 
effectively  lobby  the  European  institutions. 

Structures 

Advocacy  not  only  involves  unilateral  political 
activities  by  single  groups  or  their  agents  but  usu¬ 
ally  results  in  more  or  less  intensive  and  stable 
relationships  with  other  groups  and  with  the 
groups’  counterparts  in  the  legislative  and  execu¬ 
tive  branches  of  government.  These  relationships 
come  about  if  advocacy  groups  have  at  their  dis¬ 
posal  certain  resources — be  it  power  based  on  the 
control  of  their  membership  or  information  that 
their  counterparts  are  interested  in — and,  in  turn, 
if  other  groups  or  policymakers  can  offer  them 
valuable  information  or  give  them  the  opportunity 
to  get  directly  involved  in  the  formulation  and 
implementation  of  policies.  One  basic  element  of 
the  structure  of  advocacy  is  thus  an  exchange  rela¬ 
tion  between  private  and  state  actors.  In  a  given 
policy  sector,  all  the  actors  that  are  connected  in 
this  way  form  a  specific  policy  network  character¬ 
ized  by  the  number  and  type  of  its  members,  its 
stability,  and  its  structure.  Basic  elements  of  a 
political  system,  such  as  the  characteristics  of  gov¬ 
ernment  and  the  mode  of  interest  group  participa¬ 
tion  in  decision  making,  form  a  framework  that 
promotes  the  development  of  certain  types  of 
networks  and  helps  them  persist.  By  contrast,  as 
competition  between  groups  has  increased,  the 
established  actors  have  been  forced  to  open  up 


policy  networks  and  to  integrate  new  advocacy 
groups  in  the  policy-making  process. 

Since  decision  making  in  contemporary  poli¬ 
tics  is  a  very  complex  process,  the  scope  and 
density  of  policy  networks  vary  to  a  considerable 
extent,  making  it  an  empirical  question  how  sta¬ 
ble  the  relationships  between  actors  are  and  how 
power  resources  are  distributed.  Most  scholars 
distinguish  between  several  types  of  networks 
that  can  be  arranged  in  a  continuum  ranging 
from  very  open  to  rather  closed  patterns  of  inter¬ 
action.  Whereas  in  the  American  context  scholars 
tend  to  distinguish  between  “iron  triangles”  or 
subgovernments  at  the  one  extreme  and  issue 
networks  at  the  other,  the  European  debate  on 
policy  networks  is  shaped  by  the  opposition  of 
corporatism  and  pluralism.  In  spite  of  these  dif¬ 
ferences  in  terminology,  descriptions  of  the  two 
extreme  types  are  similar.  While  the  former  is 
characterized  by  highly  restricted  participation, 
stability  of  interactions,  and  centralized  decision 
making,  the  main  features  of  the  latter  are  decen¬ 
tralization,  a  high  level  of  conflict,  dynamism, 
and  broad  participation. 

The  academic  discourse  on  pluralism  and  cor¬ 
poratism  is  characterized  by  a  typical  asymmetry. 
As  Grant  Jordan  stated,  pluralism  is  ultimately  no 
more  than  an  antitheory,  which  is  marked  by  what 
it  excludes  rather  than  by  what  it  establishes.  First 
of  all,  it  describes  a  complex  and  confusing  con¬ 
figuration  of  political  actors  that  is  open  for  new 
advocacy  groups  and  where  power  is  widely  dis¬ 
persed.  Its  core  element  is  open  competition 
between  advocacy  groups  for  political  influence, 
which  prevents  any  single  group  from  dominating 
the  political  process.  Since  pluralism  involves  a 
great  variety  of  single  acts  of  exchange  between 
private  groups  and  legislators,  the  results  of  politi¬ 
cal  decision  making  are  highly  unpredictable.  In 
the  corporatist  pattern,  however,  a  small  number 
of  advocacy  groups  have  considerable  resources  at 
their  disposal,  in  particular  technical  information 
on  the  implementation  process  that  makes  them 
essential  partners  of  the  legislators,  with  whom 
they  work  in  close  coordination.  Corporatist 
arrangements  consist  of  stable  sets  of  actors  in 
which  the  privileged  advocacy  groups  get  the 
chance  to  considerably  influence  policy  outcomes 
in  return  for  their  cooperation  in  the  implementa¬ 
tion  of  a  policy.  As  less  resourceful  societal  actors 
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are  excluded,  corporatism  tends  to  subdivide  the 
group  universe  into  insiders  and  outsiders. 

Whether  corporatist  arrangements  have  a  chance 
to  develop  depends  on  the  institutional,  political, 
and  social  context.  In  political  systems  with  a 
strong  central  state  and  encompassing  umbrella 
organizations,  as  in  several  European  countries, 
policy  processes  have  been  dominated  for  decades 
by  interelite  relations  often  based,  as  in  Germany, 
on  the  public  status  of  the  major  private  actors.  By 
contrast,  in  the  United  States,  the  fragmentation  of 
economic  interest  groups  and  the  division  of 
American  government  has  prevented  corporatist 
patterns  from  developing.  Since  labor  unions  have 
always  been  weak,  there  was  no  strong  stimulus  for 
business  organizations  to  develop  powerful  peak 
associations.  So  competition  between  groups  of 
labor  and  capital  has  remained  high  and  discour¬ 
aged  single  groups  from  becoming  privileged  part¬ 
ners  of  the  legislators.  Besides,  the  American  politi¬ 
cal  system  offers  little  opportunity  to  channel 
advocacy  group  activities  in  a  way  that  would 
favor  the  development  of  corporatist  arrangements. 
Federalism,  the  separation  of  powers,  the  fragmen¬ 
tation  of  power  within  both  Congress  and  the 
executive,  and  the  lack  of  a  strong  system  of  party 
discipline  are  factors  that  favor  pluralist  rather 
than  corporatist  patterns  of  interest  intermediation. 

Impact 

In  the  modern  world  of  politics,  the  political  activ¬ 
ity  of  advocacy  groups  is  only  one  of  a  multiplicity 
of  factors  having  an  impact  on  public  policy  deci¬ 
sions.  Therefore,  methodological  approaches  that 
attempt  to  isolate  this  factor  from  the  institutional 
and  situational  context  in  which  it  is  embedded 
have  produced  unsatisfactory  results.  Research  on 
advocacy  group  influence  came  up  with  better 
results  when  it  stopped  asking  whether  or  to  what 
extent  advocacy  groups  are  powerful  at  all  and 
instead  turned  to  the  less  fundamental  question  of 
under  which  circumstances  they  have  an  impact  on 
policy  outcomes.  Today  we  are  able  to  identify  the 
structural  and  situational  factors  that  help  explain 
why  advocacy  is  successful  in  some  cases  but  in  oth¬ 
ers  has  no  impact  at  all.  The  five  determinants  are 
as  follows: 

1.  group-related  characteristics  such  as  size  and 
type  of  membership;  resource  endowment, 


particularly  with  regard  to  money;  and  the 
tactical  and  strategic  repertory; 

2.  the  features  of  the  group  universe,  that  is,  the 
number  of  groups  in  a  policy  sector  and  the 
degree  of  cooperation  and  conflict; 

3.  legislators’  demand  for  group  resources  such  as 
information  and  political  support; 

4.  the  institutional  framework  as  a  channel  for  the 
distribution  of  opportunities  to  gain  access  to 
decision  makers  or  to  gain  legal  privileges;  and 

5.  issue  characteristics,  such  as  policy  type,  degree 
of  technicality,  and  public  salience. 

At  first,  it  seems  reasonable  to  assume  that, 
other  factors  being  equal,  advocacy  groups  that  are 
well  endowed  with  different  resources  will  be  more 
influential  than  groups  that  are  less  endowed.  The 
former  will  be  able  to  employ  more  staff  and  pur¬ 
sue  more  multifaceted  tactics  and  strategies  than 
the  latter.  Furthermore,  it  is  often  suggested  that  an 
imbalance  of  influence  is  due  to  the  unequal  distri¬ 
bution  of  money.  Indeed,  there  are  many  examples 
of  rich  groups,  particularly  corporations  and  trade 
associations,  gaining  easier  access  to  decision  mak¬ 
ers,  being  present  at  more  levels  of  government, 
and  supplying  decision  makers  with  more  valuable 
expertise  than  poorer  groups  are  able  to  do.  But 
there  are  only  few  academic  studies  that  would 
confirm  the  assumption  that  political  outcomes  are 
skewed  toward  the  interests  of  the  advocacy  groups 
that  spend  the  most  on  lobbying.  Money  has  only 
a  limited  effect,  since  there  are  other  resources, 
such  as  public  support,  a  large  number  of  members, 
or  political  credibility,  that  may  outweigh  material 
affluence  and,  still  more  important,  that  cannot  be 
bought  on  the  political  market.  Moreover,  recent 
research  results  suggest  that  group-related  factors, 
such  as  money  and  membership  and  also  tactics 
and  strategies,  all  in  all  may  have  only  a  limited 
impact  on  political  outcomes. 

Keeping  the  other  factors  in  mind,  these  find¬ 
ings  do  not  come  as  a  surprise.  First  of  all,  in  a 
competitive  group  universe  most  attempts  by 
advocacy  groups  to  exert  influence  evoke  counter¬ 
acting  activities  by  rival  groups.  For  example,  a 
sector  such  as  business  is,  in  most  cases,  unable  to 
act  as  a  unit  for  it  is  divided  by  too  many  conflict¬ 
ing  interests.  Furthermore,  the  advocacy  explosion 
has  intensified  competition  by  creating  more  and 
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more  countervailing  pressures.  Second,  advocacy 
success  is  determined  by  the  behavior  of  legisla¬ 
tors.  Whereas  legislators  are  often  considered  as 
passive  targets,  in  fact  they  are  active  players  shap¬ 
ing  their  relationship  with  advocacy  groups  accord¬ 
ing  to  their  interests.  Whether  a  decision  maker  is 
influenced  by  a  certain  group  depends  on  his  or 
her  political  goals,  strategies,  and  demand  for 
resources.  Third,  in  any  political  system,  institu¬ 
tions  form  a  screen  that  is  beneficial  to  some  advo¬ 
cacy  groups  and  discriminates  against  others. 
Generally,  institutional  mechanisms  may  preclude 
advocacy  groups  from  converting  resources  into 
power.  If  the  structures  and  strategies  of  a  group 
do  not  match  up  with  the  political  institutions,  as 
is  the  case  with  social  movements  on  the  EU  level, 
even  resourceful  groups  may  remain  politically 
ineffective.  Fourth,  the  likelihood  of  an  advocate 
prevailing  in  a  political  conflict  also  depends  on 
the  issue  at  stake,  that  is,  on  the  constellation  of 
actors  interested  in  the  issue  and  their  positions  on 
the  subject.  The  more  groups  that  are  involved  in 
the  issue  and  the  higher  the  level  of  conflict,  the 
more  interests  legislators  have  to  take  into  account. 
So  big  issues  have  an  adverse  effect  on  lobbying 
success. 

As  a  result,  the  impact  of  advocacy  is  limited, 
depending  on  factors  such  as  the  scope  of  the  pol¬ 
icy  network,  the  level  of  conflict,  the  number  of 
allies  and  opponents,  and  so  on.  Since  every  policy 
process  is  characterized  by  a  specific,  if  not  unique, 
constellation  of  structural  and  situational  ele¬ 
ments,  the  effect  of  group  tactics  and  strategies  is 
highly  uncertain.  Provided  that  the  combination  of 
factors  in  a  certain  policy  arena  is  constantly 
changing,  as  is  often  the  case,  the  results  of  public 
policy  decisions  are  highly  unpredictable,  making 
it  extremely  unlikely  for  single  groups  to  dominate 
the  political  process  over  time.  But  we  will  also 
find  political  contexts  with  a  smaller  number  of 
actors  and  more  stable  patterns  of  participation 
where  policy  outcomes  are  less  indeterminate. 
Advocacy  groups  that  are  formally  included  in 
policy-making  processes,  that  deal  with  an  issue 
that  does  not  attract  the  interest  of  the  media,  and 
that  act  in  a  political  niche  where  countervailing 
pressures  are  low  will  have  a  much  better  chance 
to  influence  policy  outcomes  than  groups  that  act 
under  largely  varying  conditions.  As  structural  and 
situational  factors  can  be  considered  as  variables, 
the  level  of  uncertainty  is  a  variable  too,  featuring 


different  contexts  that  are  sometimes  favorable  to 

some  groups  and  sometimes  to  others. 
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Advocacy  Coalition 
Framework 


The  advocacy  coalition  framework  (ACF)  was 
originally  designed  by  Paul  Sabatier  and  Hank 
Jenkins-Smith  in  the  late  1980s  to  explain  the 
political  behavior  of  actors  in  the  policy  process. 
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The  ACF  provides  researchers  a  theoretical  lens 
for  organizing  actors  into  coalitions  by  offering  a 
set  of  hypotheses  for  explaining  coalition  struc¬ 
ture  and  behavior,  the  role  of  science  and  technol¬ 
ogy  in  the  policy  process,  and  the  factors  condu¬ 
cive  to  learning  and  policy  change.  For  more  than 
2  decades,  the  ACF  has  been  applied  in  Europe, 
Canada,  the  United  States,  and  developing  coun¬ 
tries  to  a  range  of  topics  including  health  policy, 
environmental/energy  policy,  economic  policy, 
disaster  policy,  and  education  policy. 

Sabatier  and  Jenkins-Smith  created  the  ACF  in 
response  to  what  they  saw  as  three  shortcomings 
in  the  policy  process  literature.  The  first  was  their 
interpretation  of  the  stages  heuristic  as  both  an 
inadequate  causal  theory  and  an  inaccurate  por¬ 
trayal  of  the  policy  process.  The  second  was  a  lack 
of  progress  to  resolve  the  debate  about  top-down 
and  bottom-up  approaches  to  implementation 
research  and  a  need  for  system-based  theories  of 
policy  making.  The  third  was  the  apparent  lack  of 
theory  about  the  role  of  scientific  and  technical 
information  in  the  policy  process.  In  response,  the 
ACF  was  created  as  a  system-based  model,  inte¬ 
grating  most  of  the  policy  stages,  incorporating  the 
best  aspects  of  the  top-down  and  bottom-up 
approaches  to  implementation,  and  creating 
hypotheses  that  involve  the  role  of  scientific  and 
technical  information  in  learning  and  policy 
change.  This  entry  provides  a  broad  overview  of 
the  assumptions,  concepts,  and  hypotheses  in  the 
ACF  and  ends  with  directions  for  future  research. 

Foundation 

The  structural  underpinnings  that  the  ACF  begins 
with  are  a  set  of  assumptions  involving  a  central 
role  for  scientific  and  technical  information  in 
policy  processes,  a  time  perspective  of  10  years  or 
more  to  understand  policy  change,  and  the  assign¬ 
ment  of  policy  subsystems  as  the  primary  unit  of 
analysis.  Policy  subsystems  are  subunits  of  politi¬ 
cal  systems,  focused  on  a  topical  area  and  involv¬ 
ing  specialized  actors.  The  ACF  guides  researchers 
to  study  a  broad  set  of  subsystem  actors  including 
officials  from  all  levels  of  government,  nongovern¬ 
ment  actors,  consultants,  scientists,  and  members 
of  the  media.  For  these  subsystem  actors,  the  ACF 
assumes  a  mental  model  based  on  limited  abilities 
to  process  stimuli,  a  tendency  to  remember  losses 


more  than  gains,  and  a  need  to  rely  on  beliefs  as 
the  principal  heuristic  to  simplify,  filter,  and  some¬ 
times  distort  stimuli. 

The  ACF  assumes  that  actors  are  politically 
driven  by  their  beliefs  and  that  their  policies  and 
programs  are  best  thought  of  as  translations  of 
those  beliefs.  A  three-tiered  model  of  a  belief  sys¬ 
tem  for  its  actors  is  depicted.  Within  the  belief- 
systems  model,  the  broadest,  most  rigid,  and  pre¬ 
dominately  normative  are  deep  core  beliefs ,  such  as 
liberal  and  conservative  beliefs  or  the  relative  con¬ 
cern  for  the  welfare  of  present  versus  future  gen¬ 
erations.  Of  moderate  scope  and  spanning  the 
substantive  and  geographic  breadth  of  a  policy 
subsystem  are  policy  core  beliefs.  Policy  core 
beliefs  are  rigid  but  are  more  adaptive  in  response 
to  new  experiences  and  information  than  deep  core 
beliefs.  The  most  substantively  and  geographically 
narrow  and  the  most  empirical  are  secondary 
beliefs.  Compared  with  deep  core  and  policy  core 
beliefs,  secondary  beliefs  are  more  likely  to  change. 

At  the  heart  of  the  ACF  is  the  coalition  concept. 
Coalitions  consist  of  actors  with  shared  policy  core 
beliefs  who  engage  in  nontrivial  degree  of  coordi¬ 
nation,  ranging  from  developing  joint  plans  to 
sharing  information.  The  goal  of  a  coalition  is  the 
attainment  of  policy  objectives,  meaning  the  pro¬ 
tection  or  change  in  government  policies.  The 
behavior  of  a  coalition  toward  the  achievement  of 
their  policy  objectives  varies  considerably  and 
depends  on  their  resources  and  on  the  resources  of 
their  opponents,  all  of  which  is  constrained  by 
external  subsystem  affairs.  Common  coalition 
strategies  include  (a)  maintaining  a  long-term  per¬ 
spective  and  involvement  in  a  policy  subsystem, 
(b)  seeking  to  maintain  membership  and  mobilizing 
new  members,  (c)  attempting  to  shape  policies  indi¬ 
rectly  through  public  opinion  campaigns  and 
through  supporting  scientific  and  technical  informa¬ 
tion  that  reinforces  their  beliefs,  and  (d)  attempting 
to  shape  policies  directly  by  shopping  their  ideas  to 
venues  controlled  by  actors  with  authority  to 
change  policies,  such  as  the  courts,  executives,  agen¬ 
cies,  and  legislatures.  The  behavior  of  coalitions  is 
fundamentally  connected  to  the  level  of  conflict  in 
the  policy  subsystem.  In  some  subsystems,  rivals  are 
not  actively  present,  and  a  dominant  coalition 
maintains  a  monopoly  over  policy  affairs.  In  more 
adversarial  subsystems,  coalitions  behave  causti¬ 
cally  toward  their  rivals,  limiting  learning  between 
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coalitions.  In  more  collaborative  subsystems,  coali¬ 
tions  cooperate,  share  information,  and  engage  in 
cross-coalition  learning. 

While  coalition  members  share  policy  core 
beliefs  and  coordinate  their  behavior,  not  all  coali¬ 
tion  members  need  to  be  directly  tied  to  their  allies. 
Some  members  are  principal  members,  who  remain 
active  within  the  coalition  for  extended  periods  of 
time  and  are  well  connected  with  most  allies. 
Principal  members  anchor  the  coalition  by  bearing 
the  cost  of  coordination  and  by  maintaining  con¬ 
nections  with — and  hence  the  involvement  of — 
more  auxiliary  members,  whose  involvement  tends 
to  be  more  temporally  sporadic. 

A  flow  diagram  of  the  ACF  is  shown  in  Figure  1. 
On  the  right  is  a  policy  subsystem,  within  which 
coalitions  strive  to  attain  their  policy  objectives 
through  various  strategies.  Subsystems  operate 


within  a  broader  political  environment  defined  by 
relatively  stable  parameters  and  external  events 
along  with  long-term  coalition  opportunity  struc¬ 
tures  and  short-term  constraints  and  resources  of 
subsystem  actors. 

To  ease  the  application  across  political  systems, 
the  flow  diagram  in  Figure  1  depicts  three  sets  of 
variables  as  important  long-term  opportunity 
structures.  The  first  is  the  degree  of  consensus 
needed  for  major  policy  change,  which  shapes 
coalition  strategies  in  reaching  agreements.  The 
second  is  the  degree  of  openness  of  political  sys¬ 
tems.  For  example,  federalism  and  checks  and  bal¬ 
ances  in  the  United  States  create  decentralized 
processes  with  many  venues  and  encourage  entry 
participation  whereas  corporatist  systems  are  less 
open  and  more  centralized  and  restrict  participa¬ 
tion.  The  third  is  overlapping  societal  cleavages. 


Figure  I  A  Flow  Diagram  for  the  Advocacy  Coalition  Framework 

Source:  Adapted  from  Sabatier,  P.  A.,  Sc  Weible,  C.  M.  (2007).  The  advocacy  coalition  framework:  Innovations  and 
clarifications.  In  P.  A.  Sabatier  (Ed.),  Theories  of  the  policy  process  (2nd  ed.,  pp.  189-222).  Boulder,  CO:  Westview  Press. 
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Paths  to  Major  Policy  Change 

An  explanatory  focus  of  the  ACF  is  to  help  under¬ 
stand  and  explain  major  policy  change.  Four  paths 
to  major  policy  change  have  been  posited.  The  first 
is  shocks  that  occur  external  to  a  subsystem, 
including  broad  changes  in  socioeconomic  condi¬ 
tions,  public  opinion,  governing  coalitions,  and 
other  subsystems.  Among  the  factors  linking  an 
external  shock  to  major  policy  change  are  shifting 
coalition  resources  and  changing  beliefs  among 
coalition  actors. 

The  second  path  to  policy  change  is  shocks 
within  a  policy  subsystem.  An  example  might 
include  a  catastrophic  fire  in  a  forest  management 
policy  subsystem.  The  ACF  distinguishes  between 
external  shocks  and  internal  shocks  because  the 
latter  are  predicted  to  underscore  the  failures  in 
current  subsystem  practices  and  possibly  weaken¬ 
ing  a  dominant  coalition. 

The  third  path  to  policy  change  is  policy-oriented 
learning.  Policy-oriented  learning  involves  relatively 
enduring  changes  in  beliefs  and  strategies  resulting 
from  experience  and/or  new  information  as  related 
to  the  attainment,  and  possibly  the  modifications, 
of  policy  objectives.  As  coalition  members  learn 
over  time,  they  may  develop  new  ways  to  overcome 
a  rival  coalition  or  refine  their  policy  objectives, 
both  of  which  may  lead  to  major  changes  in  subsys¬ 
tem  policies.  Since  belief  systems  are  rigid  and  resis¬ 
tant  to  change,  policy-oriented  learning  primarily 
shapes  secondary  beliefs  or  secondary  aspects  of  the 
policy  subsystem  over  extended  periods  of  time. 

The  fourth  path  to  policy  change  occurs  through 
negotiated  agreements  among  competing  coalitions. 
Probably,  the  most  important  condition  enabling 
coalitions  to  negotiate  is  a  hurting  stalemate,  which 
occurs  when  members  of  rival  coalitions  view  the 
status  quo  as  unacceptable  and  perceive  no  other 
venues  to  achieve  their  policy  objectives  other  than 
direct  negotiations.  Additional  conditions  enabling 
negotiated  agreements  include  effective  leadership, 
consensus-based  decision  rules,  diverse  funding, 
duration  of  process  and  commitment  of  members,  a 
focus  on  empirical  issues,  an  emphasis  on  building 
trust,  and  lack  of  alternative  venues. 

Hypotheses 

Sabatier  and  Jenkins-Smith  provide  a  dozen  hypoth¬ 
eses  on  topics  ranging  from  belief  and  policy 


change  to  coalition  stability.  The  five  most  fre¬ 
quently  tested  hypotheses  are  the  following: 

f.  Significant  perturbations  external  to  the 
subsystem  (e.g.,  changes  in  socioeconomic 
conditions,  public  opinion,  systemwide 
governing  coalitions,  or  policy  outputs  from 
other  subsystems)  are  a  necessary,  but  not 
sufficient,  cause  of  change  in  the  policy  core 
attributes  of  a  governmental  program. 

2.  On  major  controversies  within  a  policy  sub¬ 
system  when  policy  core  beliefs  are  in  dispute, 
the  lineup  of  allies  and  opponents  tends  to  be 
rather  stable  over  periods  of  a  decade  or  so. 

3.  Policy-oriented  learning  across  belief  systems  is 
most  likely  when  there  is  an  intermediate  level 
of  informed  conflict  between  the  two  coalitions. 
This  requires  that  (a)  each  has  the  technical 
resources  to  engage  in  such  a  debate  and  (b)  the 
conflict  be  between  secondary  aspects  of  one 
belief  system  and  core  elements  of  the  other  or, 
alternatively,  between  important  secondary 
aspects  of  the  two  belief  systems. 

4.  Policy-oriented  learning  across  belief  systems  is 
most  likely  when  there  exists  a  forum  that  is 

(a)  prestigious  enough  to  force  professionals 
from  different  coalitions  to  participate  and 

(b)  dominated  by  professional  norms. 

5.  The  policy  core  attributes  of  a  governmental 
program  in  a  specific  jurisdiction  will  not  be 
significantly  revised  as  long  as  the  subsystem 
advocacy  coalition  that  instituted  the  program 
remains  in  power  within  that  jurisdiction — 
except  when  the  change  is  imposed  by  a 
hierarchically  superior  jurisdiction. 

Further  Development  and  Applications 

The  scope  of  the  ACF  continues  to  diversify  in  top¬ 
ics  and  theoretical  focus.  Areas  ripe  for  future 
research  include  (a)  explaining  coalition  structure 
and  behavior,  especially  factors  explaining  defec¬ 
tion  and  stability  of  members  and  choice  of  politi¬ 
cal  strategy;  (b)  developing  a  theory  of  policy 
subsystem  interdependencies;  (c)  explaining  the 
effects  of  scientific  and  technical  information  on 
learning  within  and  between  coalitions;  and 
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(d)  investigating  how  biased  assimilation  shapes 

coalition  behavior. 

Christopher  M.  Weible 
University  of  Colorado  Denver 
Denver,  Colorado,  United  States 
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Transnational  advocacy  networks  (TANs)  are  forms 
of  collective  action  that  cross  national  borders  and 
are  made  up  of  collaborative  connections  among  a 
myriad  of  actors  who  engage  in  joint  initiatives  on 


the  basis  of  common  goals  and  shared  values. 
These  networks  seek  to  change  states’  and/or  inter¬ 
national  organizations’  policies  and/or  public 
opinion,  at  both  a  domestic  and  an  international 
scale.  TANs  form  around  areas  of  specific  issues, 
but  these  vary  considerably,  ranging  from  human 
rights,  environmental  protection,  poverty,  gender 
equality,  and  labor  rights  to  networks  that  seek  to 
change  the  agendas  and  policies  of  international 
organizations  such  as  the  World  Bank  or  the 
International  Monetary  Fund.  Their  initiatives 
usually  have  a  limited  life  span.  Its  promoters  do 
not  seek  to  build  long-term  coalitions;  rather, 
TANs  emerge  and  die  as  new  issues  come  to  the 
fore  or  as  their  demands  gain  attention. 

The  concept  of  TANs  thus  describes  a  different 
form  of  organized  collective  action  from  the  more 
institutionalized  international  nongovernmental 
organizations  or  global  social  movements. 
Although  some  kind  of  internal  division  of  labor 
and  communication  structure  is  in  place,  TANs 
most  often  do  not  have  headquarters  or  paid  staff. 
Furthermore,  rules  for  affiliation  tend  to  be  loose. 
However,  the  internal  functioning  of  TANs  varies 
considerably,  and  it  changes  through  time  and 
across  issue  areas. 

The  literature  on  TANs  is  quite  recent.  It  flour¬ 
ished  in  the  1990s,  as  part  of  a  broader  effort  to 
understand  the  roles  of  nonstate  actors  in  interna¬ 
tional  relations.  Ever  since  the  publication  of  the 
pioneering  analyses  on  transnational  relations, 
around  the  end  of  the  1960s  and  the  beginning  of 
the  1970s,  its  study  has  gained  increased  promi¬ 
nence.  What  began  as  an  attempt  to  incorporate 
the  study  of  nonstate  actors — at  the  time,  mainly 
multinational  corporations — into  the  research 
agenda  of  international  relations  scholars  has 
become  a  vast  literature  that  takes  into  consider¬ 
ation  an  increasingly  heterogeneous  set  of  actors 
and  phenomena.  The  literature  on  TANs  has  con¬ 
tributed  to  moving  the  debate  from  the  question  of 
whether  nonstate  actors  are  relevant  in  interna¬ 
tional  relations  to  understanding  how  they  are 
relevant  and  how  to  evaluate  their  impacts. 

Studies  about  TANs  are  often  multidisciplinary. 
More  specifically,  this  is  a  literature  that  has  stimu¬ 
lated  a  fruitful  dialogue  between  international  rela¬ 
tions  and  social  movement  theories.  Although  most 
authors  will  deny  the  existence  of  a  simple  causal 
link  between  globalization  and  transnational 
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collective  action,  the  constitution  of  international 
regimes  and  the  augmented  power  of  international 
organizations  provide  a  common  political  back¬ 
ground  for  TANs.  These  are  understood  best  by 
considering  the  contributions  made  by  interna¬ 
tional  relations  theorists.  In  turn,  many  of  the  chal¬ 
lenges  that  social  movements  face  at  the  domestic 
level,  in  terms  of  coalition  building,  for  example, 
are  also  present  in  the  international  arena.  Most 
important,  social  movement  theorists  help  in  iden¬ 
tifying  the  mechanisms  by  which  actors  are  able 
(or  unable)  to  overcome  their  differences  and  con¬ 
struct  common  purpose. 

Studies  about  TANs  also  seek  to  go  beyond 
local-global  dichotomies.  Thus,  rather  than  speak 
of  the  emergence  or  existence  of  a  “global  civil 
society,”  scholars  refer  to  a  plurality  of  TANs 
springing  up  intermittently  across  a  wide-ranging 
set  of  issues.  These  networks  are  made  up  of  vari¬ 
ous  combinations  in  terms  of  participants’  national 
origins  and  the  pathways  to  transnationality  cho¬ 
sen  by  them.  While  many  actors  remain  rooted  at 
the  local  or  national  scale,  others  may  be  part  of 
global  social  movements  or  international  nongov¬ 
ernmental  organizations. 

At  least  two  issues  remain  as  key  challenges  for 
the  TAN  literature.  First,  it  is  not  always  clear  as 
to  how  to  set  the  boundaries  of  TANs,  that  is,  how 
to  identify  participants  and  differentiate  among 
actors,  given  the  internal  asymmetry  of  power  rela¬ 
tions.  Second,  although  there  has  been  a  concerted 
effort  to  evaluate  the  impacts  of  these  collective 
action  forms,  we  still  know  little  about  their  longer 
term  effects. 

Relationships  Among  Actors  in  TANs 

TANs  are  sponsored  by  a  wide  variety  of  actors 
located  in  different  social  positions  and  geographi¬ 
cal  locations,  so  much  so  that  it  is  often  hard  to 
draw  their  boundaries.  There  is  no  consensus  in  the 
literature  on  this  topic.  For  some  scholars,  TANs 
are  not  necessarily  limited  to  nonstate  agents  but 
may  include  sympathetic  government  officials  as 
well  as  international  organization  officials.  Among 
nonstate  actors,  these  may  include  nongovernmen¬ 
tal  organizations  (NGOs),  grassroots  organiza¬ 
tions,  foundations,  business  organizations,  the 
media,  churches,  and  various  other  types  of  asso¬ 
ciations  as  well  as  individuals.  For  other  authors,  it 


is  best  to  define  TANs,  more  restrictively,  as 
including  only  domestic  and  international  nonstate 
actors,  and  among  these  mostly  NGOs  and  social 
movements.  A  clearer  definition  of  the  boundaries 
of  TANs  remains  an  open  challenge. 

Whether  the  definition  of  participants  is  more  or 
less  restricted,  in  both  cases,  there  is  evidence  that 
actors  have  very  different  approaches  in  terms  of 
targets  chosen  and  strategies  used.  As  sets  of  inter¬ 
connected  nodes,  networks  cannot  be  aprioristically 
defined  as  formed  by  horizontal  or  homogeneous 
ties.  Nevertheless,  this  use  of  the  concept  of  “net¬ 
works”  has  been  common  among  activists  and 
scholars  alike.  It  has  helped  emphasize  the  novelty 
of  these  forms  of  collaboration  with  respect  to  more 
hierarchical  and  centralized  initiatives  of  the  past. 
However,  recent  scholarly  efforts  have  focused  on 
trying  to  specify  better  the  embeddedness  of  net¬ 
works  in  power  relations  as  spaces  of  negotiation 
(and  contention)  affected  by  asymmetrical  rela¬ 
tions  among  participants.  These  efforts  have  led  to 
a  more  detailed  analysis  about  the  internal  func¬ 
tioning  of  TANs,  which  focus,  for  example,  on 
questions  of  accountability,  transparency,  and 
decision-making  processes.  Asymmetries  may  fol¬ 
low  North-South  lines,  but  this  is  only  one  possi¬ 
ble  aspect.  Other  sources  of  tension  within  TANs 
refer  to  different  political  cultures  (among  coun¬ 
tries  and  also  among  sectoral  domains)  and  differ¬ 
ent  visions  in  terms  of  tactics,  both  of  which  can  be 
found  within  and  across  the  South  and  the  North. 

Impacts  of  TANs 

To  understand  under  what  domestic  and  interna¬ 
tional  circumstances  do  TANs  succeed  or  fail  to 
achieve  their  goals,  a  long  series  of  case  studies  and 
comparisons  among  them  have  been  undertaken. 
Scholars  have  focused  on  the  abilities  of  networks 
not  only  to  influence  political  actors  and  policies  at 
the  domestic  and  international  scales  but  also  to 
bring  about  shifts  in  public  opinion  on  given  issues. 
Research  has  been  done  on  campaigns  oriented 
toward  particular  political  problems  (e.g.,  debt 
relief,  use  of  land  mines)  and  policy  domains  (e.g., 
the  environment,  trade  negotiations,  gender  equal¬ 
ity).  While  there  are  numerous  case  studies  showing 
that  TANs  did  have  a  positive  impact,  this  is  often 
not  the  case.  Independent  variables  that  explain 
these  different  outcomes  in  the  literature  include 
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characteristics  of  the  networks  themselves  (e.g.,  the 
strength  of  local  grassroots  members  or  the  density 
of  ties),  characteristics  of  the  issue  chosen  (e.g.,  its 
resonance  with  existing  norms),  characteristics  of 
the  target  (e.g.,  the  vulnerability  of  states  to  lever¬ 
age  politics  or  the  access  of  activists  to  the  state), 
and  the  strategies  of  the  networks  (e.g.,  their  capac¬ 
ity  to  generate  reliable  information  quickly). 

Margaret  Keck  and  Kathryn  Sikkink  pioneered 
the  efforts  to  understand  the  impacts  of  TANs  in 
their  1998  book,  Activists  Beyond  Borders: 
Advocacy  Networks  in  International  Politics,  in 
which  they  propose  the  “boomerang  pattern”  as  a 
common  and  effective  tactic  used  by  TANs  to 
enhance  their  influence.  Most  especially  in  cases  of 
closure  of  the  domestic  political  system,  nonstate 
actors  seek  allies  in  other  countries  (or  are  sought 
by  them)  to  apply  pressure  on  other  states  and/or 
international  organizations.  These,  in  turn,  will  be 
able  to  use  their  leverage  on  the  original  target  and 
thus  help  bring  about  the  change  demanded  by 
domestic  activists.  Examples  of  the  successful 
throwing  of  “boomerangs”  include  the  well- 
known  case  of  the  Brazilian  rubber  tappers  who, 
in  the  1980s,  allied  with  U.S.  environmental 
NGOs  to  pressure  the  U.S.  Congress  to  use  its 
leverage  on  the  World  Bank.  The  World  Bank,  in 
turn,  helped  bring  about  changes  in  the  projects 
they  helped  sponsor  in  the  Amazon  region  and 
thus  bring  about  changes  in  the  Brazilian  state’s 
policies  vis-a-vis  environmental  protection  and  the 
rubber  tappers. 

Special  emphasis  has  been  given  by  TAN  schol¬ 
ars  to  impacts  that  are  not  easily  measurable,  such 
as  the  effects  of  transnational  activism  on  interna¬ 
tional  norms.  This  is  certainly  true  in  the  case  of 
scholars  who  adhere  to  a  constructivist  theoretical 
approach  to  the  study  of  transnationalism.  The 
argument  is  that  TANs  do  contribute  to  change  by 
shifting  the  norm  structure  of  global  governance. 
Norms,  understood  broadly  as  shared  expecta¬ 
tions  held  by  actors  about  what  is  their  appropri¬ 
ate  behavior,  are  affected  by  the  actions  of  TANs 
in  at  least  two  important  ways:  ( 1 )  as  references  by 
which  TANs  hold  states  and  other  actors  account¬ 
able  and  (2)  by  the  creation  of  new  norms  or  the 
initiation  of  norm  shifting.  Either  by  upholding 
existing  norms  or  by  proposing  new  ones,  TANs 
have  an  impact  on  how  people  in  general  think 
about  the  world. 


Notwithstanding  various  successful  examples  of 
norm  and  policy  shifting,  evidence  of  the  impacts 
of  the  campaigns  launched  by  TANs  is  mixed  and 
remains  an  important  topic  for  future  research. 
More  specifically,  longer  term  studies  are  needed 
to  understand  whether  these  changes  are  sustained 
through  time. 

Marisa  von  Billow 
Universidade  de  Brasilia 
Brasilia-DF,  Brazil 
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The  expression  “African  political  thought”  can  be 
considered  a  prima  facie  misunderstanding  in  that 
it  oversimplifies  the  pluralism  of  thoughts  that  has 
always  existed  in  Africa.  Deep  divergences  charac¬ 
terize  the  African  ideological  landscape.  While 
Jomo  Kenya tta,  the  Kenyan  political  thinker  who 
was  the  first  prime  minister  of  Kenya  and  its 
president  from  1964  to  1978,  positioned  precolo¬ 
nial  Africa  as  the  reference  for  the  rebuilding  of 
independent  Africa;  Leopold  Sedar  Senghor,  presi¬ 
dent  of  Senegal  from  1960  to  1980,  idealized  the 
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encounter  between  Africa  and  Europe  as  the  start¬ 
ing  point  of  any  modernity  in  Africa.  In  the  same 
way,  the  call  for  the  immediate  building  of  a 
United  States  of  Africa  was  contradicted  by  the 
advocacy  of  a  step-by-step  process  of  regional 
integration. 

Pluralism  of  political  thoughts  is  not  the  lone 
obstacle  besetting  the  emergence  of  a  unified 
school;  alongside  it,  there  is  also  the  issue  of  plu¬ 
ralism  in  meaning.  As  the  matter  of  fact,  the 
notion  of  “African  political  thought”  is  influ¬ 
enced  by  several  dialectics,  including  those 
between  the  African  continent  and  the  diaspora, 
between  North  Africa  and  sub-Saharan  Africa, 
and  between  tradition  and  modernity.  This  entry 
first  examines  each  of  these  dialectics.  It  then 
describes  the  substance  of  African  political 
thought  through  two  main  categories:  (1)  “again- 
stism”  and  (2)  “renaissance.” 

Dialectics  Influencing  African 
Political  Thought 

The  African  Continent  and  the  Black  Diaspora 

The  first  dialectic  is  that  between  the  African 
continent  and  the  Black  diaspora.  For  instance, 
without  reference  to  the  Jamaican  thinker  Marcus 
Garvey  or  the  American  thinker  William  Edward 
Burghardt  Du  Bois,  it  is  impossible  to  understand 
the  genealogy  of  Pan-Africanism;  the  Pan- 
Africanism  that  constitutes  the  main  paradigm  of 
African  political  thought  is  initially  a  Diaspora 
idea.  As  strongly  contesting  colonization  and 
neocolonialism,  African  radicalism  has  been  elab¬ 
orated  mainly  by  the  French  citizen  of  Martinique 
Frantz  Fanon.  The  relationship  between  these 
icons  of  African  political  thought  and  the  African 
continent  is  linked  to  the  color  of  their  skin. 
African  political  thought  is  to  some  extent  a 
Black  political  thought.  It  therefore  appears  that 
the  idea  of  African  political  thought  is  linked  to 
that  of  Black  political  thought.  The  debate  among 
the  diaspora  in  the  United  States,  in  the  Americas, 
and  in  the  modern  world  has  never  been  consid¬ 
ered  by  the  continental  African  thinkers  as  for¬ 
eign  to  their  concern.  Senghor  and  Kwame 
Nkrumah  were  always  in  conversation  with  Black 
American  thinkers  such  as  Du  Bois  and  Martin 
Futher  King  Jr. 


The  Relationship  Between  Sub-Saharan 
Africa  and  North  Africa 

The  second  dialectic  is  that  between  sub-Saha¬ 
ran  Africa  and  North  Africa.  This  one  concerns  the 
racial  divide  between  “Blacks”  and  “Arabs.” 
Africa  is  not  a  continent  with  ethnic  or  racial 
homogeneity.  The  impact  of  this  on  African  politi¬ 
cal  thought  is  the  cohabitation  of  “Pan- Africanism” 
and  “Pan-Arabism”  in  the  continent.  Even  if 
Pan-Africanism  is  unification’s  ideology,  some  of 
its  authors  restricted  Pan- Africanism  to  “Black 
Africa.”  The  Afrocentric  Senegalese  thinker  Cheik 
Anta  Diop  established  the  cultural  foundations  of 
a  Black  African  federal  state.  In  practical  terms,  this 
idea  was  reformulated  by  President  Mobutu  Sese 
Seko  of  Zaire  as  a  replacement  of  the  Organisation 
of  African  Unity  by  a  league  of  Black  people  and 
states.  Seko’s  proposal  was  clearly  directed  against 
the  Arabs  of  the  continent,  who  were  accused  of 
being  oriented  more  toward  Middle  East  through 
the  Arab  Teague.  At  the  same  time  that  Nkrumah  of 
Ghana  was  advocating  Pan-Africanism  from  Cape 
Town  to  Cairo,  Gamal  Nasser  of  Egypt  was  defend¬ 
ing  Pan-Arabism  from  Rabbat  to  Damascus.  The 
dialectic  of  the  African  attachment  to  the  continent 
and  their  attraction  to  what  is  going  on  in  Asia  indi¬ 
cates  the  variable  geometry  of  Pan- Africanism.  It  is 
worth  noting  that  whereas  Pan-Africanism  and 
blackness  are  intertwined,  the  interaction  between 
Pan-Arabism  and  Pan-Africanism  is  still  appre¬ 
hended  under  the  prism  of  antagonism,  instead  of 
complementarities.  The  presence  of  Arabs  in 
Northern  Africa  and  the  role  played  by  Arab  states 
such  as  Egypt,  Algeria,  and  Morocco  in  the  libera¬ 
tion  of  the  continent  seem  to  contradict  the  ongo¬ 
ing  misconception  of  Africa  as  a  Black  continent. 
At  this  stage,  South  Africa’s  episteme  of  represent¬ 
ing  an  African  rainbow  identity  is  a  revolution. 
Despite  the  influence  of  African  radicalism  that 
structured  the  African  National  Congress,  post¬ 
apartheid  South  Africa,  under  the  leadership  of 
Nelson  Mandela,  transcends  racial  cleavages:  To 
be  an  African  becomes  a  cosmopolitan  idea  that 
can  accommodate  Indians,  Whites,  and  Blacks.  It 
is  the  advent  of  a  postracial  African  that  was  envis¬ 
aged  by  Mandela’s  thought. 

Tradition  and  Modernity 

The  third  dialectic  that  operates  in  the  realm  of 
African  political  thought  is  that  between  tradition 
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and  modernity.  Traditional  African  political 
thought  is  perceived  as  political  thought  based  on 
what  is  authentic  to  Africa  through  a  precolonial 
reference  (Basil  Davidson,  1998;  Jomo  Kenyatta, 
1938).  Modern  African  political  thought  is  politi¬ 
cal  thought  generated  by  the  colonial  and  postco¬ 
lonial  eras.  From  this  artificial  dichotomy,  tradi¬ 
tion-generated  political  thought  is  given  the  label 
of  authentic  African  political  thought.  At  the 
same  time,  modern  political  thought  is  stigma¬ 
tized  for  being  Westernized.  This  is  a  false  opposi¬ 
tion,  because  tradition  and  modernity  are  relative 
notions  and  represent  intertwined  realties.  The 
search  for  an  authentic  African  political  thought 
is  an  impediment  to  the  understanding  of  the 
dynamics  of  Africa’s  ideological  landscape. 
Instead  of  initiating  a  futile  debate  on  the  criteria 
of  authenticity,  it  is  better  to  note  what  works  as 
an  African  political  thought,  be  it  modern  or  tra¬ 
ditional,  Afrocentrist  or  influenced  by  “Western” 
ideologies. 

African  Political  Thought  as  “Againstism” 

Modern  African  political  thought  is  to  be  under¬ 
stood  through  the  mediation  of  the  relation  with 
the  “intimate  enemy,”  that  is,  the  Western  world 
made  up  of  colonial  powers.  That  is  why  African 
political  thought  focuses  on  accusation.  In  this 
regard,  the  radical  discourse  dominates  the  ideo¬ 
logical  landscape.  The  Western  world  is  presented 
as  the  enemy  on  the  basis  of  the  experiences  of 
slavery  and  colonialism.  These  two  experiences 
are  seen  as  part  of  the  same  trend  of  African 
humiliation  in  the  modern  world.  If  one  agrees 
with  Carl  Schmitt  that  the  designation  of  the 
enemy  is  the  starting  point  of  politics,  then  African 
thought  is  political  and  unified  when  African 
thinkers  define  a  common  enemy  as  in  the  follow¬ 
ing  examples: 

In  the  1930s,  Kenyatta  accused  the  “Europeans”  of 
having  destroyed  the  human  dignity  of  the  African, 
denigrated  his  culture  and  religion,  and  stolen  his 
land  (Kenyatta,  1938).  This  discourse  on 
colonization  as  a  brutalization  process  is  shared  by 
the  main  radical  thinkers. 

Nkrumah’s  opposition  to  the  Western  world 
concerned  its  colonial  and  postcolonial  faces.  In  his 


analysis  of  the  European  domination  of  Africa, 
Nkrumah  (1961)  asserts, 

The  white  man  arrogated  to  himself  the  right  to 
rule  and  to  be  obeyed  by  the  non  white  .  .  .  Under 
this  cloak,  the  Europeans  robbed  the  continent  of 
vast  riches  and  inflicted  unimaginable  suffering 
of  the  African  people,  (p.  xi) 

Despite  the  independence  of  African  states, 
Nkrumah  identified  neocolonialism  as  “the  last 
stage  of  imperialism.”  Neocolonialism  is  the 
continuity  of  other  forms  of  Western  domination: 

In  place  of  colonialism  ...  we  have  today  neo¬ 
colonialism.  .  .  .  The  essence  of  neo-colonialism 
is  that  the  State  which  is  subject  to  it  is,  in  theory, 
independent.  ...  In  reality  its  economic  system 
and  thus  its  political  policy  is  directed  from 
outside.  (Nkrumah,  1965,  p.  30) 

Amilcar  Cabral’s  (1970)  denunciation  of  colonial¬ 
ism  is  based  on  the  fact  that  the  Western  colonial 
powers  had  taken  “up  arms  to  destroy  or  at  least 
to  paralyze  (the)  cultural  life  (of  Africans)”:  Foreign 
domination  can  be  maintained  “only  by  the 
permanent,  organized  repression  of  the  cultural  life 
of  the  people  concerned.”  Colonization  and  neo¬ 
colonialism  are  explained  as  negations  of  the 
history  and  culture  of  African  peoples. 

Fanon,  in  his  writings,  formulated  an  absolute 
clash  between  Africa  and  Europe:  Colonialism  is 
presented  as  “fundamentally  unforgivable”  (Fanon, 
2006,  p.  118)  and  Europe  as  having  proceeded 
with  cynicism  and  violence  against  Africa. 

These  categories  and  representations  with  their 
Marxist  background  have  exercised  a  real  ideo¬ 
logical  hegemony  in  Africa  and  still  do  so  today.  In 
the  intellectual  domain,  the  conservative  trend  of 
African  political  thought  constituted  a  minority. 
The  denunciation  of  Europe  is  common  to  both 
radical  and  conservative  African  political  thought. 
Disunity  appears  at  the  level  of  the  relationship 
with  the  colonial  power:  When  a  conservative 
African  political  thinker  such  as  Senghor  forgives 
the  former  colonial  power  and  regards  it  as  a 
friend  and  partner,  radical  thinkers  envisage  strug¬ 
gle.  The  synthesis  of  this  historic  cleavage  has  been 
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made  in  postcolonial  South  Africa  through  the 
“truth  and  reconciliation”  process  under  which  a 
radical  liberation  movement,  African  National 
Congress,  accepted  to  cooperate  with  apartheid 
elites. 

African  Political  Thought  as  a 

Renaissance  Political  Thought 

African  political  thought  is  dominated  by  the  idea 
of  renaissance.  It  is  a  normative  political  thought 
generated  by  the  traumatism  of  slavery,  colonial¬ 
ism,  and  underdevelopment.  These  historical 
events  constitute  the  implicit  or  explicit  back¬ 
ground  of  African  political  thought. 

The  renaissance  from  the  colonial  or  neocolonial 
domination  is  to  be  obtained  through  struggle  for 
national  liberation.  It  is  a  cultural  struggle.  The 
liberation  movement  defends  the  culture  of  the 
people  (Cabral,  1970).  The  struggle  is  also  a 
military  one  implying  the  use  of  revolutionary 
violence  (Fanon,  2002). 

The  renaissance  following  the  colonial  divide  of 
Africa  and  its  underdevelopment  is  to  be  made 
through  the  creation  of  United  States  of  Africa. 
Despite  its  practical  failure,  this  ideal  still  dominates 
African  political  thought  (Nkrumah,  1963). 

The  renaissance  is  a  terrain  in  which  several 
national  ideologies  situated  between  Marxism  and 
liberalism,  such  as  “African  socialism”  or 
“communal  liberalism”  flourished. 

The  renaissance  political  thought  is  also  a  thought 
of  liberty  and  equality.  Under  the  condemnation  of 
colonialism,  apartheid,  and  authoritarianism,  what 
are  at  stake  are  liberty  and  equality  (Mandela, 
1995).  African  political  thought  then  appears  in  its 
universality.  However,  as  elsewhere,  African 
political  thought  is  also  made  up  of  exclusion  of 
the  other.  The  light  of  renaissance  is  contradicted 
by  the  darkness  of  discourses  full  of  hate.  Ethnic 
exclusion  and  genocide  in  Africa  have  always  been 
justified  “ideologically.” 

Luc  Sindjoun 
University  of  Yaounde  II 
Yaounde,  Cameroon 
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Agencies 


In  this  entry,  the  concept  of  “agencies”  is  defined, 
and  a  short  review  of  its  history  and  relevance  for 
public  administration  practice  is  provided.  Finally, 
some  key  issues  that  have  emerged  from  the  ever¬ 
growing  literature  on  this  topic  will  be  discussed. 

Definition 

Agencies  are  the  organizational  emanation  of 
administrative  decentralization  in  the  public  sector 
by  which  large  monolithic  core  departmental 
bureaucracies  are  “unbundled”  into  smaller  execu¬ 
tive  organizations  that  operate  at  a  distance  from 
the  center  of  government.  There  are  many  descrip¬ 
tions  of  the  administrative  species  “agency”  in  the 
international  literature,  such  as  “nondepartmental 
public  bodies,”  “quangos,”  “nonmajoritarian 
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institutions,”  “quasi-autonomous  public  bodies,” 
and  “arm’s-length  agencies,”  only  to  name  some. 
For  reasons  of  clarity,  the  term  agency  will  be  used 
throughout  this  entry.  There  is,  however,  a  varia¬ 
tion  in  the  interpretation  of  the  concept  “agency” 
in  different  countries.  Some  argue  that  what  an 
agency  is,  and  what  functions  it  does,  may  vary 
considerably  across  national  cultures,  legal  sys¬ 
tems,  and  political  systems. 

Although  the  definition  of  an  agency,  or  the 
way  in  which  “agencies”  are  conceptualized, 
may  differ  from  country  to  country,  there  are 
some  common  features  by  which  agencies  can  be 
characterized: 

First,  in  most  cases,  agencies  are  public  law  bodies. 
This  means  that  they  are  subject  to  at  least  some 
public  law  procedures. 

Second,  agencies  have  some  capacity  for 
autonomous  decision  making  with  respect  to  their 
internal  management  and/or  policies. 

Third,  agencies  are  structurally  disaggregated  from 
the  center  of  the  governmental  apparatus  (core 
ministry).  This  means  that  in  most  cases,  agencies 
are  not  directly  subordinate  hierarchically  to  the 
oversight  authorities,  such  as  the  central 
administration  of  government  or  the  minister. 

Still,  fourth,  agencies  are  formally  under  at  least 
some  control  of  ministers  and  governmental 
departments,  as  there  often  is  a  (larger  or  smaller) 
degree  of  political  responsibility  for  what  the 
agency  does. 

Fifth,  in  most  cases,  agencies  are  (at  least  partly) 
staffed  with  civil  servants  and  (at  least  partly) 
financed  by  the  state  budget. 

In  the  literature,  the  distinction  is  made  between 
internally  autonomous  agencies  and  externally 
autonomous  agencies.  Internally  autonomous 
agencies  have  a  quasi-autonomous  status  within 
the  central  governmental  department.  These  agen¬ 
cies  are  often  labeled  as  “ministerial  agencies,”  as 
they  do  not  have  a  legal  personality  of  their  own 
(they  are  part  of  the  legal  person  of  the  govern¬ 
ment,  the  ministry,  or  the  department).  This 
implies  that  the  hierarchical  relationship  with  the 
minister  remains  unchanged  and  that  the  minister 
remains  responsible  for  the  organization.  Internally 


autonomous  agencies  do  have  extended  manage¬ 
rial  flexibilities,  however,  compared  with  the  core 
governmental  organizations  (from  which  their 
“autonomous”  status  is  derived).  Examples  of 
internally  autonomized  agencies  are  the  Next  Steps 
Agencies  (United  Kingdom  [UK]),  the 
Agentscbappen  (the  Netherlands),  or  many  federal 
agencies  in  the  United  States.  Externally  autono¬ 
mous  agencies  are  often  labeled  as  “nonministerial 
agencies”  because  of  their  own  legal  personality 
and  the  fact  that  they  are  structurally  detached 
from  the  core  of  the  governmental  apparatus.  In 
most  cases,  the  legal  personality  implies  that  agen¬ 
cies  are  controlled  and  steered  by  an  intermediary 
body  (like  a  board  of  governors).  The  existence  of 
a  board  implies  that  the  ministerial  responsibility 
toward  the  agency  decreases  and  often  is  only 
political  in  nature.  Examples  of  externally  autono¬ 
mous  agencies  are  the  so-called  nondepartmental 
public  bodies  (UK),  the  zelfstandige  bestuursor- 
ganen  (ZBOs;  the  Netherlands),  or  the  indepen¬ 
dent  administrative  institutions  (Japan). 

Historical,  Theoretical, 

and  Empirical  Context 

Agencies:  Not  Something  New  at  All 

With  the  rise  of  new  public  management  (NPM) 
as  one  of  the  dominant  administrative  doctrines  of 
the  past  decades,  many  countries  began  reorganiz¬ 
ing  their  administrative  apparatus.  One  particular 
reform  that  could  be  observed  in  many  countries 
was  the  so-called  unbundling  of  the  traditional 
large  monolithic  bureaucracies.  For  example,  in 
the  UK  under  Margaret  Thatcher,  the  Next  Steps 
Program  dissolved  the  central  bureaucracy  in  doz¬ 
ens  of  smaller  executive  agencies  that  were  respon¬ 
sible  for  the  implementation  of  public  policies. 
This  example  was  followed  by  similar  initiatives  in 
other  countries  such  as  the  Netherlands  or,  more 
recently,  Belgium  (Flanders).  However,  this  kind 
of  public  sector  reform  can  be  observed  not  only  in 
the  heartland  of  NPM  (including  many  English- 
speaking  countries)  and  in  the  broader  Western 
world  but  also  elsewhere.  Agencification  programs 
in  developing  countries  (e.g.,  Tanzania,  Jamaica), 
in  East  Asian  countries  (e.g.,  Thailand),  and  in 
Central  and  Eastern  European  countries  (e.g., 
Latvia)  have  been  documented  extensively  in  the 
literature.  Notwithstanding  the  fact  that  the  1980s 
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and  the  1990s  were  characterized  in  many  coun¬ 
tries  by  the  establishment  of  many  arm’s-length 
agencies,  the  existence  of  agency-type  administra¬ 
tive  entities  is  not  new  at  all.  For  example,  for 
centuries  the  central  government  in  Sweden  has 
consisted  of  a  small  core  bureaucracy,  responsible 
for  policy  making,  and  many  executive  agencies  at 
arm’s  length,  responsible  for  implementing  poli¬ 
cies.  As  a  second  example,  in  the  1970s  (so  before 
NPM  came  into  its  heyday),  Hong  Kong’s  central 
government  was  unbundled  into  a  lot  of  small 
arm’s-length  executive  agencies.  Also,  the  federal 
government  of  the  United  States  has  a  long  tradi¬ 
tion  of  working  with  agency-like  public  organiza¬ 
tions  that  are  structurally  disaggregated  from  the 
core  of  government. 

Why  Agencies? 

The  popularity  of  the  agency-like  administrative 
species  in  many  governmental  reforms  is  the  result 
of  the  normative  NPM-driven  belief  that  this  type 
of  public  organization  is  better  suited  to  deliver 
public  services  in  an  efficient  and  effective  way.  As 
such,  establishing  agencies  was  considered  by 
many  governments  as  part  of  the  solution  in  search 
of  an  answer  to  the  crisis  of  the  state  in  the  1970s 
and  1980s.  Agencies  are  presumed,  according  to 
NPM  advocates,  to  be  better  performers  than  tra¬ 
ditional  bureaucratic  organizations  because 

(a)  they  operate  closer  to  the  citizen/client  and 
hence  are  more  responsive  to  the  client’s  need, 

(b)  they  are  more  flexible  because  of  their  high 
levels  of  managerial  discretion,  and  (c)  their 
(quasi-)  independent  status  encourages  them  to 
behave  more  efficiently.  The  theoretical  justifica¬ 
tion  behind  this — often  normative — point  of  view 
can  be  found  in  rational  economic  theories  such  as 
property  rights  theory  and  principal-agent  theory. 
Agencies,  according  to  these  lines  of  reasoning,  are 
assumed  to  operate  more  efficiently  and  effectively 
compared  with  core  departmental  bureaucracies, 
because  their  autonomous  status  enables  them  to 
make  their  own  managerial  choices.  The  fact  that 
agencies  have  the  discretionary  power  to  handle  to 
a  certain  extent  their  own  resources  (personnel, 
finance,  etc.)  creates  an  incentive  to  use  these 
resources  efficiently  and  effectively  (“let  the  man¬ 
ager  manage”).  On  the  other  hand,  the  enlarged 
discretionary  managerial  competences  on  behalf  of 


the  agencies  create  a  need  to  control  these  agencies 
by  their  principals,  that  is,  the  political  and  admin¬ 
istrative  oversight  authorities  (“make  the  manager 
manage”). 

In  sum,  according  to  NPM  advocates,  the  cre¬ 
ation  of  agencies  is  something  rational,  under¬ 
pinned  by  rational-economic  theories,  as  agencies 
are  presumed  (due  to  their  quasi-autonomous  sta¬ 
tus)  to  be  better  deliverers  of  public  services  than 
large,  core-departmental  bureaucracies.  Recently, 
however,  some  scholars  have  argued  that  the 
agency  phenomenon  is  more  than  an  answer  in  the 
search  for  better  public  service  delivery.  These 
scholars  argue  that  the  agency  phenomenon  is  a 
trend,  a  practice  that  is  copied.  There  is  a  growing 
body  of  literature  that  shows  that  the  creation  of 
agencies,  as  part  of  public  sector  reform  processes, 
is  not  always  a  rational  decision.  In  many  coun¬ 
tries,  the  creation  of  agencies  on  a  large  scale  is  the 
result  of  processes  of  mimetic,  normative,  or  even 
coercive  isomorphism.  In  the  past  decades,  public 
sector  reforms  that  included  some  kind  of  “agenci- 
fication”  were  considered  as  a  best  practice. 
Governments  often  tend  to  copy  best  practices 
from  each  other.  Beside  that  many  international 
institutions  (e.g.,  World  Bank,  International 
Monetary  Fund  [IMF],  and  Organisation  for 
Economic  Co-operation  and  Development 
[OECD])  and  multinational  consulting  firms  have 
been  very  active  in  promoting  NPM-like  public 
sector  reforms,  including  agencification  programs. 

Diversity 

Despite  the  often  argued  fact  that  one  is  “con¬ 
verging”  toward  the  agency  form  all  over  the 
world,  which  is  also  true  to  a  certain  extent,  one 
should  recognize  that  the  agency  form  may  not  be 
that  homogeneous  as  one  might  sometimes  think. 
It  can  be  argued  that  although  the  numerous  simi¬ 
larities  among  these  organizations  have  been  cre¬ 
ated  all  over  the  world,  there  is  also  a  great  deal  of 
variation,  mainly  due  to  differences  in  national 
cultures,  legal  systems,  and  political  systems.  The 
variation  occurs  along  a  number  of  dimensions, 
such  as  their  governance  structure  and  their  rela¬ 
tion  with  ministerial  authorities,  their  degree 
of  autonomy,  and  the  variety  of  tasks  they  per¬ 
form.  In  some  models,  agencies  are  organizations 
that  are  more  or  less  directly  linked  to  the  core  of 
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government  (central  ministries).  In  English- 
speaking  countries,  such  as  the  UK,  the  chief 
executive  of  the  agency  is  responsible  for  an 
administrative  oversight  (a  ministerial  department) 
rather  than  a  corporate  body  such  as  a  board.  This 
implies  that  the  hierarchical  bonds  between  agency 
and  oversight  are  much  stronger  compared  with 
the  Scandinavian  model,  for  example.  In  the 
United  States,  the  typical  agency  is  a  public  law 
legal  person  of  its  own,  but  it  is  still  subordinate  to 
the  department  within  which  it  is  located.  Besides 
such  agencies,  established  by  a  congressional  act, 
there  are  also  agencies  that  are  established  by  and 
within  a  department.  In  other  models,  though  the 
bonds  of  agencies  with  ministries  may  be  weak,  yet 
they  have  strong  autonomy  and  independence.  In 
Scandinavian  countries  (e.g.,  Sweden),  agencies  are 
responsible  for  implementing  policies  that  are 
issued  by  small  ministries  responsible  for  policy 
formulation,  although  the  agencies  are  also  active 
in  policy  making  themselves  through  their  own 
corporate  body,  which  has  a  lot  of  autonomy  vis- 
a-vis  the  political  and  administrative  oversight 
authorities.  Also,  in  some  cases,  private  organiza¬ 
tions  performing  public  tasks  are  considered  as 
agencies.  Governments  may  move  some  functions 
out  of  their  control  (e.g.,  through  privatization), 
but  they  are  still  ultimately  responsible  for  provid¬ 
ing  basic  services  to  the  public  (e.g.,  the  privatiza¬ 
tion  and  breakup  of  British  Rail). 

Key  Issues 

Roles  and  Functions 

Agencies  typically  implement  policies.  They 
are,  in  most  cases,  responsible  for  delivering  pub¬ 
lic  services  and  public  goods,  such  as  public  trans¬ 
port,  public  infrastructure,  public  security,  and  so 
on.  This  means  that  the  role  of  agencies  can  be 
observed  in  virtually  all  policy  fields  or  areas  for 
which  government  is  responsible.  The  observation 
that  many  agencies  deliver  public  goods  and  ser¬ 
vices  is  in  line  with  some  NPM-like  doctrines  that 
advocate  a  so-called  primacy  of  politics.  According 
to  such  doctrines,  politicians  are  democratically 
legitimized  to  decide  on  (the  content  and  scope  of) 
policy,  while  the  main  role  of  administrative 
actors  (e.g.,  agencies)  is  to  implement  these  poli¬ 
cies.  This  strict  dichotomy  between  policy  (politi¬ 
cians)  and  operations  (administrative  actors) 


should  be  nuanced  on  at  least  two  points.  First,  in 
reality,  we  can  often  observe  in  many  countries 
that  agencies  have  a  large  role  to  play  in  regula¬ 
tory  policy.  In  the  policy  domains  of  many  coun¬ 
tries,  it  is  agencies  that  are  the  responsible  actors 
to  regulate  the  market  in  important  policy  fields 
such  as  energy  supply,  telecommunications,  and 
postal  services.  These  “regulatory  agencies”  have 
thus  a  strong  policy  element  in  their  task  and  role 
(setting  standards  and  norms,  controlling  market 
players,  [helping  inj  defining  the  regulatory  frame¬ 
work  in  the  policy  field,  etc.).  Second,  recent 
research  shows  that  agencies  that  are  mainly 
involved  in  implementing  policies  (delivering  pub¬ 
lic  goods  and  services)  often  have  a  large  role  to 
play  in  policy  making  too. 

Key  Variables:  Autonomy  and  Control 

Agencies  are  generally  characterized  by  two  key 
variables:  (1)  their  autonomy  vis-a-vis  the  political 
and  administrative  oversight  authorities  and  (2)  the 
way  in,  and  the  extent  to,  which  they  are  steered 
and  controlled  by  these  oversight  authorities. 

Autonomy 

Autonomy  of  agencies  refers  to  the  discretion 
these  agencies  have  to  make  decisions  of  their  own 
or  the  extent  to  which  the  agency  can  decide  itself 
about  issues  it  finds  important.  In  this  sense,  mak¬ 
ing  an  agency  more  autonomous  involves  the  shift¬ 
ing  of  decision-making  competences  from  external 
actors  (e.g.,  the  oversight  authorities)  to  the  agency 
itself.  The  level  of  organizational  autonomy  of 
agencies  (or  the  extent  to  which  the  agency  can 
make  decisions  of  its  own)  is  determined  by  the 
scope  and  the  extent  of  these  decision-making 
competences.  Two  different  scopes  can  be  dis¬ 
cerned.  First,  agencies  may  have  decision-making 
competences  that  are  delegated  to  them  from  the 
oversight  authorities  concerning  the  choice  and  use 
of  inputs  or  organizational  resources.  This  is  the 
managerial  autonomy  of  agencies.  An  agency  can 
have  managerial  autonomy  with  respect  to  several 
types  of  organizational  inputs,  resources  such  as 
financial  resources  and  human  resources,  or  other 
production  factors  such  as  logistics,  organization, 
and  housing  and  infrastructure.  The  second  scope 
of  organizational  autonomy  refers  to  decision¬ 
making  competences  of  the  agency  in  policy  issues 
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or  the  policy  autonomy  of  the  agency.  The  extent 
to  which  an  agency  enjoys  policy  autonomy  will 
depend  on  the  extent  to  which  the  agency  is  steered 
by  its  oversight  authorities  on  processes,  outputs, 
or  effects.  Having  policy  autonomy  means  that  the 
agency  can  make  decisions  about  the  choice  of 
policy  instruments,  the  outputs  that  should  be 
delivered  by  the  policy,  and  the  desirable  societal 
effects  of  the  policy.  According  to  NPM-like  doc¬ 
trines,  the  ideal-typical  agency  has  a  lot  of  manage¬ 
rial  autonomy,  to  fulfill  the  promise  of  “let  the 
managers  manage”  and  to  ensure  that  the  first  part 
of  the  equation  (more  managerial  competences  = 
better  organizational  performance)  is  guaranteed. 
Policy  autonomy,  however,  is  low  in  an  ideal-typi¬ 
cal  agency.  As  already  suggested  above,  according 
to  NPM,  agencies  are  mostly  expected  to  imple¬ 
ment  policies,  not  to  design  them  (the  exception  may 
be  regulatory  agencies,  cf.  supra).  In  reality,  how¬ 
ever,  many  agencies  that  primarily  implement  poli¬ 
cies  also  have  a  large  influence  in  designing  and 
evaluating  policy.  This  may  not  be  surprising  given 
the  fact  that  many  agencies,  as  day-to-day  policy- 
implementing  actors  in  the  policy  field,  are  experts  in 
their  field  and  have  a  clear  vision  on  what  is  happen¬ 
ing  in  the  policy  field.  A  final  issue  concerning  the 
autonomy  of  agencies  is  the  difference  between 
formal  (legal)  and  real  (empirical)  autonomy. 
Formal  or  legal  autonomy  is  the  level  of  autonomy 
the  agencies  enjoy  according  to  the  laws,  executive 
decisions,  and  decrees  that  deal  with  the  establish¬ 
ment  of  these  agencies.  In  these  (legal)  documents, 
it  is  formally  described  to  what  extent  and  under 
what  circumstances  agencies  enjoy  their  own  deci¬ 
sion-making  competences.  In  reality,  however, 
many  agencies  enjoy  more  or  less  autonomy  than  is 
formally  described.  Research  has  shown  that  this 
may  be  due  to  the  (political)  salience  of  the  task 
that  the  agency  performs.  In  a  politically  sensitive 
policy  field,  agencies  may  be  scrutinized  more 
intensely  by  politicians  compared  with  agencies  in 
a  politically  less  salient  policy  field. 

Control 

The  other  side  of  the  coin  is  the  extent  to,  and 
the  way  in,  which  agencies  are  controlled  by  their 
oversight  authorities  (which  may  be  political — e.g., 
ministers — or  administrative — e.g.,  ministries). 
The  ways  in  which  agencies  are  controlled/steered 


by  the  oversight  authorities  can  be  very  diverse. 
First,  agencies  may  be  controlled  ex  ante  (“before 
the  facts”).  Ex  ante  control  is  the  formulation  of 
rules  and  standards  that  give  direction  to  the  actor 
so  that  the  desired  output  (from  the  viewpoint  of 
the  oversight  authority)  will  be  achieved.  Oversight 
authorities  thus  control  the  agency  beforehand. 
They  can  do  so  by  setting  general  or  detailed  rules 
that  constrain  the  autonomy  of  the  agency  in  using 
the  inputs.  In  this  sense,  ex  ante  controls  take  the 
form  of  authoritative  mandates,  rules,  or  regula¬ 
tions  that  specify  what  the  agency  can  do.  Or  the 
oversight  authority  can  control  the  agency  by  tak¬ 
ing  the  major  decisions  itself  or  by  making  the 
decisions  of  the  agency  subject  to  approval  by  the 
oversight  authorities  (e.g.,  nullification  rights  or 
vetoes).  Second,  ex  post  control  or  results  control 
is  in  essence  checking  whether  the  intended  organi¬ 
zational  goals  have  been  achieved  by  the  agency 
(and  whether  there  is  a  need  for  corrective  future 
actions).  Result  control  involves  the  setting  of  the 
desired  organizational  results  (norms),  checking 
whether  or  not  the  organizational  results  meet  this 
norm,  and  eventually  the  adaptation  or  continua¬ 
tion  of  the  organizational  practice.  The  intention 
of  these  “after-the-fact  controls”  is  to  motivate  the 
agency  to  perform  well  and  to  make  good  deci¬ 
sions.  As  such,  agencies  are  made  responsible  for 
the  consequences  of  their  decisions  (“let  the  man¬ 
agers  manage”).  Some  argue  that  result  control 
can  be  regarded  as  a  cycle  that  consists  of  interre¬ 
lated  subsystems:  a  planning  system  (setting  orga¬ 
nizational  goals  and  norms  for  the  agency),  a 
monitoring  system  (to  measure  the  organizational 
results  ex  post),  and  an  evaluation  and  feedback 
system.  The  past  decades,  alongside  the  rise  of  the 
NPM,  a  shift  toward  contractualization  in  the  con¬ 
trolling  of  agencies  by  their  oversight  authorities 
can  be  observed.  In  many  instances,  newly  estab¬ 
lished  agencies  have  performance  contracts  or 
management  contracts  with  their  oversight  author¬ 
ities  in  which  the  rights  and  duties  of  both  parties 
are  agreed  on.  Generally,  the  commitment  of  the 
agency  is  to  deliver  goods  and  services,  on  behalf 
of  the  oversight  authorities,  with  a  high  level  of 
efficiency,  quality,  and  customer  friendliness. 
Indicators  are  developed  to  be  able  to  measure  the 
results  achieved  by  the  agencies.  The  commitment 
of  the  oversight  authorities  generally  has  to  do 
with  ensuring  that  the  agency  has  the  necessary 
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means  (financial,  personnel,  infrastructural)  to  be 
able  to  achieve  good  results.  Also  agencies  that 
have  existed  for  a  longer  time  often  see  their  con¬ 
trol  regime  shifting  from  ex  ante  control  on  inputs 
to  ex  post  control  on  results  and  have  to  agree  on 
performance  or  management  contracts  with  their 
oversight  authorities  too. 

Accountability  and  Performance 

Earlier  in  this  entry,  a  rather  normative  justifi¬ 
cation  for  the  trend  toward  agencification  in  the 
public  sector  was  discussed.  NPM  advocates  often 
argue  that  smaller,  single-task  executive  agencies 
are  better  vehicles  to  deliver  effective  and  efficient 
public  services  and  have  better  chances  to  behave 
innovatively  and  in  a  customer-friendly  way  com¬ 
pared  with  large  monolithic  bureaucratic  organi¬ 
zations.  The  practice  of  performance  contracts,  it 
is  often  argued,  also  enables  one  to  really  assess 
whether  or  not  agencies  are  performing  well. 
However,  it  is  not  certain  whether  the  promise  of 
better  performance  has  been  fulfilled.  Scientific 
evidence  is  at  least  inconclusive.  One  question  is 
what  “performance”  is.  In  a  public  sector  context, 
performance  is  a  broader  concept  than  just  (eco¬ 
nomical)  efficiency.  No  doubt  that  in  many  agen¬ 
cies,  financial  efficiency  may  have  increased  due 
to,  among  others,  a  better  focus  on  the  job.  But 
other  values  that  are  of  importance  in  a  public  sec¬ 
tor  context  such  as  equal  treatment  and  impartial¬ 
ity  in  service  delivery  are  much  harder  to  grasp  in 
indicators,  hence  to  measure  for  performance.  The 
point  is  that  increased  efficiency  does  not  always 
have  to  result  automatically  in  increased  perfor¬ 
mance  on  other  aspects  of  the  job  that  the  agencies 
perform.  Second,  it  is — from  a  methodological 
point  of  view — extremely  difficult  to  compare  per¬ 
formance  of  agencies  with  performance  of  other 
governmental  organizations  (e.g.,  bureaucracies). 
This  is  due  to,  among  others,  the  scarcity  of  public 
functions/tasks  that  are  in  reality  performed  by 
organizations  of  different  types  in  the  same  coun¬ 
try  (the  exceptions  are  schools,  public  and  private). 
Or  the  lack  of  null  measurement  of  the  perfor¬ 
mance  of  the  function  before  it  was  shifted  to  a 
new  organizational  form  (e.g.,  postal  services 
before  and  after  privatization).  In  sum,  most  com¬ 
mentators  agree  that  some  economical  gains  may 
be  observed  with  agencies  but  that  it  does  not 


necessarily  imply  that  gains  can  be  observed  in 
other  aspects  of  performance  such  as  democracy, 
quality,  and  impartiality  (which  are  important  too, 
especially  in  the  public  sector). 

This  brings  us  to  a  final  key  issue  to  be  dis¬ 
cussed  here.  Performance  of  an  organization  also 
has  to  do  with  the  accountability  of  that  organiza¬ 
tion  toward  its  key  stakeholders.  The  issue  of 
accountability  is  an  important  one  because  one  has 
to  be  aware  that  in  the  search  for  the  economic 
gains  of  working  with  agencies  (efficiency;  this  is 
why  they  are  created),  there  is  also  consideration 
of  the  possible  political  or  societal  costs  because  of 
the  lack  of  control  of  these  arm’s-length  agencies. 
In  the  case  of  agencies,  the  key  stakeholders  are  the 
oversight  authorities,  the  citizens  who  receive  pub¬ 
lic  services  and  goods  of  the  agency,  and  ulti¬ 
mately,  the  general  public  or  the  society  as  a 
whole.  Agencies  are  put  at  arm’s  length  of  the 
government,  which  makes  control  of  the  agency  by 
government  (the  oversight  authority)  more  diffi¬ 
cult.  Some  authors  therefore  argue  that  the  so- 
called  vertical  or  upward  accountability  (toward 
the  oversight  authorities)  may  be  weakened.  In 
turn,  it  can  be  argued  that  agencies  develop  mech¬ 
anisms  of  so-called  horizontal  or  downward 
accountability,  which  is  directed  to  the  citizen/cli- 
ent  of  the  agency  (e.g.,  customer  surveys,  reporting 
on  results,  ombudspersons).  Whether  or  not  down¬ 
ward  accountability  can  replace  upward  account¬ 
ability  remains  a  point  of  debate.  Other  observers 
argue  that  vertical  accountability  is  large,  given  the 
fact  that  many  agencies  are  subject  to  a  multitude 
of  control  practice  (nullification  rights,  perfor¬ 
mance  contracts,  audits,  parliamentary  hearings, 
etc.).  In  such  cases,  lack  of  accountability  seems  to 
be  a  lesser  problem. 
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For  any  decisions  to  be  made  in  the  political  sys¬ 
tem,  an  issue  must  be  placed  on  the  active  agenda 
of  government.  In  political  science,  agenda  setting 
is  an  important  topic  for  the  analysis  of  collective 
outcomes,  in  particular  for  the  evaluation  of  their 
democratic  nature.  Agenda  setting  concerns  the 
strategic  choice  or  selection  of  policy  alternatives 
from  a  set  of  all  possible  outcomes  on  which  indi¬ 
vidual,  corporate,  and  collective  actors  finally 
decide.  This  entry  discusses  two  alternative  con¬ 
ceptions  of  agenda  setting.  One  focuses  more  on 
the  politics  of  getting  an  issue  to  the  agenda.  The 
other  body  of  literature  focuses  on  the  impact  of 
agendas  on  the  outcomes  of  the  process  and  on  the 
power  of  agenda  setters  in  determining  outcomes. 


These  two  bodies  of  literature  are  complementary 
and  together  provide  a  full  understanding  of  the 
role  of  agendas  in  shaping  policy  choices. 

The  Politics  of  Agenda  Setting 

In  recent  years,  agenda-setting  theory  has  also 
become  a  prominent  approach  in  political  com¬ 
munication  research,  which  describes  the  transfer 
of  important  topics  to  the  public  sector  for  resolu¬ 
tion,  as  discussed  extensively  by  Frank  Baumgartner 
and  Bryan  Jones.  This  portion  of  the  literature 
discussing  agenda  setting  considers  the  political 
and  social  mechanisms  through  which  problems 
are  identified  and  then  politicized.  This  literature 
reflects  the  strong  political  role  of  agendas  in  mak¬ 
ing  policy. 

The  agenda-setting  literature  also  has  pointed  to 
the  varieties  of  different  agendas  that  exist  in  the 
public  sector  and  the  manner  in  which  issues  are 
moved  on  and  off  these  agendas.  The  systemic 
agenda  represents  all  those  issues  that  are  deemed 
appropriate  for  the  public  sector  to  take  into 
account,  whether  or  not  they  are  being  actively 
considered.  The  institutional  agendas  are  those 
that  are  being  acted  on,  or  at  least  actively  consid¬ 
ered,  at  any  point  of  time.  Furthermore,  there  are 
important  political  considerations  concerning 
which  institutions  constitute  the  foci  for  the  politi¬ 
cal  activity  around  any  issue.  Some  political  sys¬ 
tems,  for  example,  federal  systems  and  those  with 
active  court  systems,  provide  would-be  agenda 
setters  greater  opportunities  than  do  those  with 
fewer  points  of  access. 

For  an  issue  to  become  public  policy,  it  must 
pass  through  several  different  institutions.  In  most 
democratic  systems,  issues  must  go  through  the 
legislature,  and  often  committees  within  the  legis¬ 
lature,  although  the  source  of  the  idea  may  have 
been  the  bureaucracy  or  the  political  executive,  or 
perhaps  interest  groups.  Then  the  issue,  once  acted 
on,  will  have  to  go  to  the  bureaucracy  for  imple¬ 
mentation  and  then  perhaps  to  the  courts  for  adju¬ 
dication.  These  movements  may  occur  as  a  normal 
part  of  political  life  but  generally  require  some 
forms  of  political  impetus  for  them  to  occur. 

The  agenda-setting  literature  has  also  focused 
on  the  roles  of  policy  entrepreneurs  in  the  process 
of  making  policy.  Issues  do  not  move  on  and  off 
agendas  on  their  own  but  require  individual  actors 
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who  foster  their  development  and  their  adoption. 
These  entrepreneurs  have  to  be  prepared  when 
“windows  of  opportunity”  open  that  permit  issues 
that  might  otherwise  be  excluded  to  come  to  the 
active  political  agenda. 

Analytic  Models  of  Agenda 

Setting  and  Agenda  Control 

The  second  approach  to  agendas  and  agenda  set¬ 
ting  has  focused  on  the  influence  that  agendas  and 
the  manipulation  of  agendas  have  on  the  outcomes 
of  deliberative  processes.  Traditionally,  agendas 
and  agenda  formation  have  played  a  prominent 
role  since  the  20th  century  in  analytical  political 
science,  which  investigates  the  power  of  agenda 
setters  in  the  collective  decision-making  process. 
For  democratic  governance,  agenda-setting  power 
can  undermine  the  democratic  process  when  the 
agenda  setter  has  dictatorial  power  in  manipulat¬ 
ing  the  aggregation  of  individual  preferences  to 
collective  outcomes.  According  to  the  social  choice 
and  legislative  analysis  literatures,  a  major  precon¬ 
dition  for  this  threat  of  dictatorial  agenda-setting 
power  is  the  instability  of  collective  outcomes 
under  specific  democratic  voting  rules,  which  may 
allow  an  agenda  setter  to  manipulate  (or  hinder 
from  manipulating)  them.  Irrespective  of  the  insta¬ 
bility  of  collective  outcomes,  bargaining  theories 
and  logrolling  approaches  also  emphasize  the 
importance  of  agenda  setting  when  it  creates  a 
bargaining  advantage  for  the  first  mover.  In  other 
words,  actors  who  can  make  the  first  moves  in 
bargaining  are  often  the  ones  who  are  able  to 
shape  the  final  outcomes  of  the  process. 

According  to  Donald  Schon  and  Martin  Rein 
(1994),  a  more  optimistic  interpretation  comes 
from  the  deliberative  literature,  which  stresses  the 
framing  of  problems  to  find  (stable)  solutions. 
According  to  John  Dryzek  and  Christian  List 
(2003),  deliberation  has  been  argued  to  assist  in 
aggregating  individual  preferences  by  subdividing 
or  ordering  issues  in  a  way  that  induces  stability  of 
outcomes.  The  discussions  involved  in  deliberative 
decision  making  enable  the  participants  to  reach 
solutions  that  are  stable  and  reflect  the  underlying 
preferences  of  those  participants. 

According  to  Kenneth  Joseph  Arrow  (1951), 
most  theoretical  work  on  agenda-setting  power 
originates  in  social  choice  theory  that  focuses  on 


the  (in)stability  of  collective  outcomes  in  demo¬ 
cratic  systems,  which  results  from  the  aggregation 
of  actors’  individual  preferences  under  specific 
democratic  voting  rules.  With  regard  to  the  para¬ 
dox  of  voting,  Duncan  Black  (1948)  and  others, 
such  as  William  Riker,  have  found  that  collective 
outcomes  result  from  the  interaction  preferences  of 
the  actors  involved  in  decision  making  and  the 
structure  of  the  agenda,  that  is,  how  policy  alterna¬ 
tives  are  ordered  in  the  collective  decision-making 
process  of  democracies.  This  has  important  impli¬ 
cations  for  the  democratic  nature  of  collective 
outcomes  and  the  power  of  the  agenda  setter  who 
controls  the  decision-making  process  and  the  vot¬ 
ing  sequence  of  the  actors  involved.  Under  major¬ 
ity  rule,  this  power  is  restricted  only  when  a 
median  voter  guarantees  the  stability  of  collective 
outcomes  in  the  aggregation  of  actors’  individual 
preferences. 

In  many  decision-making  situations,  however, 
the  median  voter  hardly  exists.  The  existence  of  a 
median  voter  requires  that  actors  with  single- 
peaked  preferences  vote  sincerely  on  an  ordered 
single  dimension.  Put  differently,  the  existence  of  a 
median  voter  is  unlikely  when  preferences  are  not 
single  peaked,  when  more  than  one  dimension 
exists,  or  when  actors  vote  strategically.  From 
these  restrictive  conditions,  under  which  alone  the 
median  voter  theorem  holds,  early  studies  such  as 
Black  (1948)  show  that  an  agenda  setter  is  able  to 
achieve  any  outcome  under  majority  rule.  This 
insight  on  the  dictatorial  power  of  agenda  setters 
applies  more  generally  to  all  minimally  democratic 
voting  rules.  Another  insight  from  research  on 
the  (in)stability  of  collective  outcomes  is  that — in 
policy  spaces  with  more  than  one  dimension — 
outcomes  are  almost  always  influenced  by  the  char¬ 
acteristics  of  the  decision-making  processes.  When 
instability  of  the  collective  outcome  is  common 
among  the  participants,  democratic  governance  is 
highly  threatened  by  dictatorial  agenda  setting. 

This  skeptical  view  on  democratic  governance 
has  stimulated  further  discussion  and  research  on 
the  power  of  the  agenda  setters  in  collective  deci¬ 
sion  making.  More  specifically,  the  crucial  question 
has  turned  toward  the  extent  to  which  an  agenda 
setter  can  select  and  perhaps  manipulate  the  deci¬ 
sion-making  process  and,  if  so,  what  implications 
follow  from  this  manipulation  for  collective  out¬ 
comes.  Some  of  the  most  influential  developments 
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in  this  vein  of  research  come  from  studies  of  deci¬ 
sion  making  in  democratic  legislatures.  In  these 
analyses,  the  focus  is  on  who  is  able  to  draft  and 
to  amend  legislative  proposals  that  are  finally 
adopted  by  the  floor.  On  the  most  general  level, 
this  literature  distinguishes  between  different  legis¬ 
lative  procedures  by  the  level  of  institutional 
restrictions  that  define  the  power  relationship 
between  an  agenda  setter  and  the  floor.  More  pre¬ 
cisely,  they  find  that  two  provisions  often  restrict 
the  power  of  the  agenda  setter  in  democratic  legis¬ 
latures,  the  competitive  right  to  initiate  proposals 
and  the  amendment  right  on  the  floor  of  the  legis¬ 
lature.  In  many  democratic  legislatures,  every 
(individual,  collective,  or  corporate)  legislator  has 
the  right  to  draft  a  proposal,  which  can  be  adopted 
by  the  floor  under  either  closed  or  open  rule,  the 
latter  allowing  for  making  amendments. 

With  regard  to  the  findings  in  social  choice  the¬ 
ory,  the  power  of  the  agenda  setter  is  highest  in 
legislatures  without  the  competitive  right  of  initia¬ 
tive,  henceforth  referred  to  as  operating  under 
closed  rule.  Under  this  condition,  the  agenda  setter 
has  complete  control  over  drafting  legislative  pro¬ 
posals,  and  generally,  there  are  limited  options  for 
amendment.  The  sole  right  of  initiative  is  particu¬ 
larly  decisive  when  the  agenda  setter  can  “keep  the 
gate  closed,”  that  is,  he  or  she  can  also  ignore 
requests  for  drafting  proposals  or  possibly  for 
amendments.  However,  in  some  legislatures,  the 
right  of  initiative  is  indeed  in  the  hands  of  a  single 
institutional  actor  to  increase  accountability  for  tak¬ 
ing  legislative  action.  For  example,  in  the  European 
Union  (EU),  the  European  Commission  has  the 
exclusive  right  to  initiate  legislation.  However, 
scholars  debate  whether  the  Commission  is  capable 
of  keeping  the  gates  of  initiative  closed  when  the 
member  states  or  the  European  parliaments  request 
the  European  Commission  to  take  legislative  action. 
But  regardless  of  the  answer  to  this  debate,  the  pro¬ 
vision  of  an  exclusive  right  of  initiative  suggests  that 
there  is  a  trade-off  between  the  accountability  for 
the  decision  and  the  level  of  agenda-setting  power, 
which  is  sometimes  minimized  by  distributing 
agenda-setting  power  across  specialized  commit¬ 
tees.  Kenneth  Shepsle  and  Barry  Weingast  (1984) 
show  that  specialized  committees  can  influence  the 
outcome  on  the  floor  by  drafting  proposals.  Hence, 
the  competitive  right  to  initiate  proposals  may  open 
the  gates  for  alternative  proposals  and  thus  reduce 


not  only  agenda-setting  power  but  also  account¬ 
ability  for  taking  action. 

Another  institutional  provision  to  reduce 
agenda-setting  power  concerns  the  amendment 
right  on  the  floor  of  the  legislature.  Under  closed 
rules,  the  floor  can  only  adopt  or  reject  a  proposal, 
while  under  open  rules,  it  can  amend  the  propos¬ 
als.  Therefore,  the  fundamental  question  is  whether 
making  amendments  is  subject  to  (additional) 
restrictions  coming  from  internal  legislative  rules. 
For  example,  the  proposals  of  the  European 
Commission  can  formally  be  amended  by  the 
European  Council  and  sometimes  by  the  European 
Parliament  in  the  following  stages  of  the  legislative 
process,  which  would  suggest  a  lower  agenda¬ 
setting  power  for  the  European  Commission. 
However,  since  making  amendments  requires  an 
absolute  majority  in  the  Council,  and  sometimes 
the  additional  bicameral  consent  of  the  European 
Parliament,  the  European  Commission  can  exploit 
these  higher  voting  hurdles  in  a  way  that  makes 
amendments  unlikely.  To  the  extent  that  the 
European  Commission  is  able  to  exploit  this 
(bicameral)  restriction,  the  agenda  setter  remains 
de  facto  powerful  and  can  shape  the  final  outcome. 

In  powerful,  transformative  legislatures,  such  as 
the  U.S.  Congress,  the  floor  retains  a  substantial 
capacity  to  amend  proposals  coming  from  an 
agenda  setter  or  from  committees  that  have  been 
preparing  the  legislation  for  consideration  by  the 
plenary  body.  These  processes  tend  to  minimize 
the  powers  of  agenda  setters  and  make  the  deci¬ 
sion-making  process  less  predictable  than  if  there 
were  greater  control  exercised  through  some 
agenda  setter.  That  said,  such  an  open-rule  system 
may  have  more  positive  democratic  effects  than 
the  more  constrained  agenda-setting  processes. 

Another  feature  of  the  amendment  process  is 
that  legislators  often  face  the  problem  of  selecting 
among  several  amendments.  As  Bjorn  Rasch 
(2000)  notes,  theoretically,  a  large  number  of  vot¬ 
ing  procedures  exist  for  selecting  among  several 
amendments,  but  legislatures  conventionally  use 
two  procedures:  the  successive  procedure  and  the 
elimination  procedure.  In  the  successive  procedure, 
legislators  vote  successively  on  each  amendment 
and  decide  whether  or  not  it  should  be  adopted.  If 
an  amendment  is  adopted,  the  process  ends;  if  it  is 
rejected,  the  legislators  vote  on  the  next  amend¬ 
ment  in  the  specified  order.  In  the  elimination 
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procedure,  the  legislators  pairwisely  compare  and 
vote  on  amendments.  The  rejected  amendment  is 
dropped,  and  the  remaining  is  matched  against  the 
next  amendment  for  a  vote,  and  so  on.  Under  both 
procedures,  legislators  can  make  amendments,  but 
the  sequence  of  voting  (and  the  control  thereof) 
can  highly  influence  the  collective  outcome.  As 
under  closed  rule,  an  agenda  setter  can  manipulate 
the  collective  outcome  by  determining  the  voting 
sequence  when  legislators  vote  sincerely.  In  gen¬ 
eral,  amendments  that  are  introduced  late  on  the 
agenda  have  a  higher  probability  of  adoption. 
However,  when  legislators  vote  strategically  and 
anticipate  the  outcome,  the  reverse  is  also  possible. 

Although  this  research  commonly  finds  that 
agenda  setters  remain  powerful  actors  in  demo¬ 
cratic  legislatures,  some  additional  insights  into  the 
collective  instability  of  outcomes  in  legislatures 
warrant  greater  empirical  attention.  Theoretically, 
the  agenda  setter  can  produce  any  collective  out¬ 
come  when  legislators  vote  sincerely,  but  the  scope 
of  manipulation  is  often  restricted  when  extreme 
outcomes  are  much  more  difficult  to  achieve.  A 
major  reason  for  this  difficulty  is  that  the  voting 
sequence  usually  becomes  very  long  and  compli¬ 
cated  when  collective  outcomes  hardly  match  with 
the  preferences  of  the  legislative  majority.  In  prac¬ 
tice,  as  Rasch  (2000)  notes,  such  complicated 
sequences  are  hardly  observed  in  legislatures. 
Another  argument  against  the  importance  of  the 
instability  of  collective  outcomes  and  the  resulting 
dictatorial  power  of  the  agenda  setter  criticizes  the 
different  level  of  behavioral  assumptions  that  are 
made  for  the  agenda  setter  and  the  legislators,  that 
is,  when  the  former  is  assumed  to  behave  strategi¬ 
cally  while  the  latter  is  expected  to  vote  sincerely. 
When  the  same  behavioral  assumption  is  made 
and  legislators  also  behave  strategically,  they  can 
anticipate  an  outcome  from  the  decision-making 
process,  which  means  that  agenda  manipulation 
becomes  much  more  difficult.  Under  these  condi¬ 
tions,  collective  outcomes  are  located  in  the  uncov¬ 
ered  set,  which  generally  corresponds  to  the  prefer¬ 
ences  of  legislators  more  closely. 

According  to  Rasch  (2000),  in  most  (democratic) 
legislatures  the  structure  of  the  agenda  and  the 
sequence  of  voting  are  not  controlled  by  a  single 
actor.  The  responsibility  for  formulating  the  agenda 
is  often  delegated  to  the  government  of  the  day  or  to 
the  speaker,  but  proposals  can  usually  be  amended 


by  the  parliament  or  the  floor.  Hence,  when  agenda 
formation  is  de  facto  made  under  open  rules,  the 
power  of  the  agenda  setter  shrinks,  and  the  set  of 
feasible  outcomes  is  restricted  to  the  uncovered  set. 
Given  that  result,  most  empirical  research  suggests 
that  democratic  institutions  are  designed  in  a  way 
that  restricts  the  power  of  the  agenda  setter.  For 
example,  the  rules  can  make  it  possible  for  legisla¬ 
tors  to  gather  more  information  on  their  preferences 
in  the  decision-making  process  and  thereby  induce 
more  stability  by  providing  information.  As  with  the 
deliberative  processes  mentioned  above,  the  involve¬ 
ment  of  more  potential  decision  makers  can  pro¬ 
duce  more  stable  outcomes  than  more  hierarchically 
determined  outcomes  in  legislatures. 

A  highly  effective  (but  rarely  observed)  method 
of  agenda  setting  in  legislatures  is  to  employ  pro¬ 
cedures  in  which  the  intensity  of  individual  prefer¬ 
ences  is  expressed  by  the  number  of  votes  assigned 
to  an  alternative  (e.g.,  border  count  voting). 
Another  way  to  restrict  the  power  of  the  agenda 
setter  would  be  the  application  of  “backward- 
moving”  instead  of  “forward-moving”  agendas.  In 
forward-moving  agendas,  legislators  first  take  a 
vote  between  the  status  quo  and  a  single  alterna¬ 
tive,  while  backward-moving  agendas  reverse  this 
sequence  and  place  the  vote  on  the  status  quo  on 
the  final  stage.  As  described  by  Shepsle  and 
Weingast  (1984),  this  procedure  for  establishing 
agendas  usually  limits  the  ability  of  the  agenda  set¬ 
ter  to  manipulate  the  outcome  of  the  decision¬ 
making  process  vis-a-vis  the  status  quo.  Hence, 
empirical  analyses  rarely  support  the  skeptical 
view  on  the  democratic  nature  of  legislatures  when 
the  dictatorial  power  of  the  agenda  setter  is  only 
derived  from  an  inherent  instability  of  outcomes: 
Cases  in  which  such  manipulations  have  been 
observed  are  exceptionally  rare  in  the  real  world. 

Another  way  to  study  agenda  setting  under  open 
rules  is  described  by  bargaining  theories.  In  this 
approach,  the  implications  of  open  rules  often  fol¬ 
low  the  idea  that  legislators  make  alternating  offers 
to  each  other  and  decide  whether  they  should 
accept  a  proposal  for  a  final  vote  on  the  floor  or 
whether  they  should  attempt  to  amend  it.  In  this 
bargaining  process,  the  legislators  are  commonly 
assumed  to  behave  strategically  and  to  make  pro¬ 
posals  that  they  believe  are  acceptable  to  others 
involved  in  the  process.  They  still  would,  however, 
only  make  proposals  that  would  maximize  their 
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own  benefit.  A  key  feature  of  these  bargaining 
models  is  that  they  assume  that  time  is  costly;  that 
is,  the  benefits  from  reaching  an  early  agreement 
are  higher  than  those  derived  from  reaching  the 
same  decision  later.  Hence,  whenever  an  actor 
drafts  a  new  proposal,  the  benefit  is  assumed  to  be 
lower  than  that  derived  from  adopting  the  pro¬ 
posal  of  the  earlier  round,  and  so  on.  As  a  conse¬ 
quence,  actors  do  not  waste  their  time  in  infinite 
sequences  of  offers  and  counteroffers.  This  assump¬ 
tion  is  often  sufficient  to  identify  a  solution  from 
the  agenda-setting  process;  that  is,  when  no  actor 
has  an  incentive  to  amend  a  proposal,  then  some 
generally  acceptable  solution  has  been  reached. 
This  assumption  about  the  process  implies,  how¬ 
ever,  that  actors  can  benefit  from  making  early 
proposals  in  decision-making  situations.  For  the 
same  reason,  this  also  favors  patient  actors,  who 
can  better  afford  to  make  counterproposals  in  the 
long  run.  As  David  Baron  and  John  Ferejohn 
(1989)  note,  under  majority  rule,  first  movers  have 
an  additional  advantage  in  shaping  their  agenda 
because  their  proposal  can  affect  the  winning 
coalition  that  results  from  the  negotiations. 
Irrespective  of  the  instability  of  collective  out¬ 
comes,  bargaining  theories  also  support  the  view 
on  the  importance  of  agenda  setting  in  collective 
decision  making.  Apart  from  institutional  features, 
these  theories  demonstrate  that  actor-specific  char¬ 
acteristics,  such  as  the  impatience  of  legislators, 
have  an  important  impact  on  which  proposals 
come  to  a  vote  and  who  shapes  the  final  outcome 
in  the  negotiations. 

These  insights  into  agenda  setting  are  derived 
from  fixed  policy  spaces,  while  another  line  of 
research  addresses  the  question  on  how  issue  link¬ 
age  or  package  dealing  can  change  the  decision¬ 
making  process  and  collective  outcomes.  According 
to  Elmer  Schattschneider  (1960),  from  a  strategic 
perspective,  actors  may  seek  to  (de-)emphasize  par¬ 
ticular  issues  in  order  to  maximize  their  influence 
on  collective  outcomes.  With  regard  to  agenda  set¬ 
ting,  this  maximization  strategy  can  comprise  sev¬ 
eral  means:  First,  an  agenda  setter  may  bring  issues 
to  a  vote  on  which  he  or  she  has  a  winning  posi¬ 
tion,  creating  a  “bandwagon”  effect.  Another  strat¬ 
egy  might  be  to  introduce  new  issues  into  a  policy 
space  that  others  do  not  sufficiently  address. 
Hence,  issues  may  strategically  be  added  or  removed 
from  the  agenda  by  the  agenda  setter  to  reshuffle 


majorities  in  collective  decisions.  Following  this 
idea,  issues  become  prominent  when  the  agenda 
setter  is  interested  in  their  awareness. 

Conclusion 

These  analyses  of  the  literature  on  agendas  suggest 
that  analytical  political  science  and  political  com¬ 
munication  research  can  come  together  for  a 
future  common  research  agenda.  The  communica¬ 
tion  literature  emphasizes  the  role  of  entrepreneurs 
and  their  contribution  in  shaping  agendas.  The 
analytic  perspective  adds  an  even  more  strategic 
sense  and  examines  how  the  agenda  setter  can 
manipulate  the  political  process  to  produce  desired 
policy  outcomes.  These  two  perspectives  are  largely 
complementary  but  too  infrequently  are  brought 
together  in  a  more  comprehensive  analysis  of  the 
setting  and  manipulation  of  political  agendas. 
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Aggregate  data  are  quantified  attributes  of  collec¬ 
tivities  that  either  relate  to  the  body  of  interest  as 
a  whole  (e.g.,  the  level  of  democracy  for  countries) 
or  have  been  aggregated  on  the  basis  of  the  prop¬ 
erties  of  individual  members  of  the  collective. 
Aggregate  data  could  also  be  negative,  that  is,  not 
individual  data,  meaning  that  aggregate  data  refer 
to  bigger  entities  (e.g.,  nations,  regions,  or  compa¬ 
nies)  than  individual  data  do  (e.g.,  voters,  work¬ 
ers,  or  inhabitants  of  some  geographic  area).  The 
process  of  aggregation  can  be  performed  by  calcu¬ 
lating  sums  or  various  means  (e.g.,  geometric  or 
arithmetic  mean)  of  the  frequency  distribution  of 
individual  cases.  Aggregate  data  are  predomi¬ 
nantly  secondary  data;  that  is,  researchers  do  not 
usually  collect  such  data  on  their  own.  Another 
distinction  refers  to  macro-  versus  microdata, 
with  aggregate  data  always  referring  to  quantities 
on  a  higher  meso-  or  macrolevel.  In  this  entry, 
types  and  sources  of  such  data,  major  problems  of 
reliability  and  comparability,  and  the  possible 
applications  of  aggregate  data  are  discussed. 

Kinds  of  Aggregate  Data 

There  are  three  types  of  aggregate  data: 

1.  data  relating  to  a  collective  as  a  whole  (e.g., 
population  or  democracy  indices,  number  of 
veto  players  or  other  characteristics  of  political 
systems); 

2.  aggregated  individual  data  (e.g.,  the  Gini  Index 
for  measuring  income  disparities,  or  the 
unemployment  rate),  which  can  be  subdivided 
into  two  types: 

a.  census  data,  where  every  single  case  of  the 
whole  population  is  measured  (complete 
inventory  count) 


b.  data  from  samples,  which  should  fit  the 
context  of  the  research  question  and  be  as 
representative  as  possible;  and 

3.  event  data,  which  give  a  frequency  of  events 
within  a  given  period,  mostly  gathered  from 
media  sources  (event  data  have  become 
important  sources  for  analysis,  especially  in 
international  conflict  research). 

Basic  Problems  With  Aggregate  Data 

There  are  some  basic  problems  in  comparative 
studies  using  aggregate  data  that  have  to  be  taken 
seriously.  The  first  problem  is  whole-nation  bias, 
which  results  from  ignoring  subnational  variation; 
loss  of  information  and  reduced  complexity  are  the 
consequences.  Second,  aggregate  data  are  often 
measured  in  monetary  units  (e.g.,  for  expenditure, 
import  or  trade  volumes).  For  reasons  of  compara¬ 
bility  between  countries,  the  different  currencies 
are  most  often  translated  into  dollars  or  euros, 
which  poses  the  problem  of  finding  an  adequate 
exchange  rate  for  standardizing  to  the  dollar  or 
euro.  One  such  standardization  uses  purchasing 
power  parities,  which  may  be  misleading.  Another 
alternative  for  comparing  aggregated  data  is  to 
standardize  using  a  percentage  of  a  national  prod¬ 
uct  measure.  The  choice  and  correct  measurement 
of  such  indicators  (e.g.,  gross  domestic  product 
[GDPj  or  gross  national  product  [GNPJ)  are  diffi¬ 
cult  and  problematic;  official  data  do  not  account 
for  huge  differences  such  as  those  concerning  black 
markets  or  forms  of  subsistence  income. 

Intercountry  comparability  is  also  undermined 
by  the  use  of  different  bases  of  calculation  and  a 
dissimilar  inclusiveness  of  the  indicators.  An  exam¬ 
ple  makes  this  clear:  The  military  budget  is  not 
identical  with  the  budget  of  the  Ministry  of 
Defense.  There  may  be  a  number  of  military  expen¬ 
ditures  (e.g.,  for  the  suppression  of  terrorism)  that 
fall  under  the  authority  of  other  departments,  such 
as  the  Ministry  of  the  Interior.  Another  complica¬ 
tion  is  the  fact  that  the  allocation  of  funds  for  such 
expenditure  may  vary  from  one  country  to  another. 
The  varying  levels  of  quality  in  collecting  data 
among  countries  can  serve  as  a  source  of  heteroske- 
dasticity  (i.e.,  a  nonconstant  variance  where  one 
expects  a  constant  variance  in  the  data).  Therefore, 
different  methods  of  data  collection  (e.g.,  estima¬ 
tion  instead  of  more  exact  measurement)  lead  to 
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less  reliable  data  and,  in  the  end,  to  poorer  estima¬ 
tions  in  regression  analysis. 

Depending  on  which  institution  supplies  the 
data  (e.g.,  the  Organisation  for  Economic 
Co-operation  and  Development  [OECDJ  or  the 
International  Monetary  Fund  [IMF]  for  public 
finance  data),  huge  differences  for  measures  of  the 
same  variables  are  often  seen.  Also,  revisions  over 
time  can  lead  to  serious  distortions.  OECD  data, 
especially  GDP,  are  revised  every  5  years.  Apart 
from  comparisons  between  countries,  the  com¬ 
parisons  between  different  points  in  time  within 
one  country  can  be  flawed  as  well;  for  example,  if 
statistical  offices  change  their  systems  of  data 
acquisition  and  calculation,  poor  intertemporal 
comparability  can  result.  Another  problem  regards 
event  data  that  mostly  depend  on  media  sources;  in 
such  cases,  selective  media  coverage  can  lead  to 
overrepresentation  of  events  in  countries  with  a 
highly  developed  media  infrastructure. 

A  different  problem  in  interpreting  aggregate 
data  is  known  as  the  ecological  fallacy.  It  occurs 
when  inferences  drawn  about  individuals  are 
based  solely  on  data  at  a  higher  level  of  aggrega¬ 
tion.  For  example,  when  analyzing  referenda 
on  European  integration,  it  turns  out  that  rich 
countries — in  terms  of  GDP — tend  to  vote  less 
in  favor  of  Europe.  By  contrast,  Eurobarometer 
statistics  show  that  in  most  countries  individuals 
with  higher  incomes  tend  to  vote  more  pro-Europe 
than  do  those  with  lower  income.  The  ecological 
fallacy  would  be  to  falsely  conclude  from  the 
strong  correlation  at  the  highly  aggregated  level  a 
correlation  at  the  individual  level. 

Reasons  for  the  Use  of  Aggregate  Data 

So  why  should  political  scientists,  despite  all  these 
basic  problems,  still  be  interested  in  aggregate 
data?  The  answer  begins  on  a  basic  level  of  theo¬ 
retical  scientific  principles  with  a  paraphrase  of  the 
well-known  Kantian  claim  that  concepts  without 
data  are  empty,  while  percepts  without  concepts 
are  blind.  Data  without  theory  also  are  blind,  but 
theory  equally  needs  data,  because  standing  on  its 
own  it  would  be  empty.  Generally  speaking,  social 
researchers  need  to  have  access  to  data  in  order  to 
test  and  develop  hypotheses  derived  from  models 
and  theory.  Both  deductive  and  inductive 
approaches  need  data.  Aggregated  data  should  be 


used  in  particular  (a)  when  the  theory  is  already 
formulated  at  the  macrolevel  and  (b)  when  the  use 
of  primary  and  individual  data  would  be  problem¬ 
atic  in  terms  of  availability,  comparability,  usabil¬ 
ity,  and  cost. 

Time  series  of  aggregate  data  are  also  a  strong 
resource  for  detecting  causal  mechanisms.  For 
example,  a  detailed  analysis  of  the  historical  evolu¬ 
tion  of  democracies,  such  as  Samuel  Huntington’s 
“waves  of  democracy,”  would  have  been  impossible 
without  measurements  of  democracies  such  as  those 
made  in  the  Polity  IV  database.  Another  advantage 
of  aggregated  data  is  the  possibility  of  making  pre¬ 
dictions  on  the  macrolevel,  for  example,  for  eco¬ 
nomic,  demographic,  or  political  developments. 

Furthermore,  reasons  for  the  use  of  aggregate 
data  arise  from  the  fact  that  they  are  especially 
suited  for  the  use  of  statistical  methods  (from 
simple  descriptive  statistics  to  the  most  sophisti¬ 
cated  methods  of  analysis)  allowing  for  efficient 
comparisons  of  large  spatial  and  social  units  of 
analysis.  With  the  use  of  statistics,  political  scien¬ 
tists  are  also  able  to  test  reliability  and  to  replicate 
other  analyses  quite  easily. 

Sources  of  Aggregate  Data 

The  availability  of  aggregate  data  is  important 
when  it  comes  to  the  analysis  of  historical  data. 
One  such  aggregate  data  analysis,  generally  con¬ 
sidered  the  first,  was  a  study  of  suicide  rates  in 
different  countries  by  Emile  Durkheim.  Durkheim 
came  to  the  conclusion  that  suicide  is  more  com¬ 
mon  in  predominantly  Protestant  countries  than  in 
Catholic  ones.  Max  Weber  was  another  early  user 
of  aggregated  data,  using  information  provided  by 
the  national  statistical  offices,  which  were  estab¬ 
lished  in  the  late  18th  century  (1796  in  Sweden, 
1797  in  Norway,  1800  in  France,  1829  in  Austria). 
National  statistical  offices  today  are  still  one  of  the 
major  sources  for  aggregate  data.  Other  data  sup¬ 
pliers  are  international  organizations  (e.g., 
UNSTAT,  Eurostat,  the  World  Bank,  the  IMF), 
which  collect  and  prepare  national  data  for  inter¬ 
national  comparisons.  One  of  the  most  compre¬ 
hensive  archives  for  international  comparisons, 
especially  for  the  developed  countries,  is  hosted  by 
the  OECD.  Its  data  archives  developed  rapidly  (in 
part,  due  to  advances  in  computer  and  Internet 
technology)  and  provide  new  research  possibilities 
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in  terms  of  quantity,  quality,  and  access  to  data. 
The  steady  increase  in  data  availability  gives 
researchers  the  opportunity  to  tackle  new  scientific 
problems  and,  by  using  data  collected  at  different 
points  in  time,  to  employ  longitudinal  research 
designs.  National  social  science  data  archives  such 
as  the  Inter-University  Consortium  for  Political 
and  Social  Research  (ICPSR)  in  the  United  States, 
the  UK  Data  Archive  hosted  by  the  University  of 
Essex,  and  the  “Zentralarchiv”  at  the  University  of 
Cologne,  Germany,  provide  access  to  specific 
social  science  data,  often  international  as  well  as 
national  data.  There  are  also  special  data  archives 
serving  particular  purposes,  such  as  the 
Luxembourg  Income  Studies  (LIS),  which  provide 
comparable  income  data. 

Although  easy  to  use,  these  data  archives  bear 
some  potential  risks  as  well.  The  first  risk  is  empir¬ 
icism,  or  as  Jan-Eric  Lane  (1990)  put  it,  there  is  the 
“constant  hazard  that  data  considerations  govern 
the  conduct  of  comparative  inquiry  instead  of  the¬ 
oretical  considerations”  (p.  191).  Data  archives  are 
often  constructed  without  an  underlying  social  sci¬ 
ence  theory,  obstructing  a  sensible  selection  of 
information,  which  in  the  end  could  result  in  a 
boundless  gathering  of  data  that  are  inapplicable 
or  unusable.  This  problem  of  too  much,  perhaps 
irrelevant,  data  stored  in  archives  can  be  explained 
as  most  archives  have  some  bias  toward  certain 
types  of  data  that  are  readily  stored;  furthermore, 
relevant  information  that  is  more  difficult  to  mea¬ 
sure  is  thus  underrepresented.  Other  less  serious 
problems  of  data  archives  that  can  nevertheless 
result  in  a  lot  of  time-consuming  work  for  the 
social  scientist  are  the  contextual  problem  and  the 
processing  problem.  The  first  concerns  the  fact  that 
data  in  most  archives  have  been  assembled  only  for 
certain  types  of  research  problems  and  thus  may 
not  be  suitable  for  other  research  questions — one 
often  has  to  at  least  reassemble  the  data  in  a  way 
appropriate  for  testing  one’s  hypotheses.  The  latter 
refers  to  the  myriad  possibilities  of  data  storage 
and  formatting,  frequently  impeding  the  simple  use 
of  data  with  the  statistical  package  of  choice. 

In  addition  to  data  archives,  printed  sources  can 
be  used  to  construct  a  data  set:  The  classical  way 
is  to  take  handbooks  with  either  thematic  or 
regional  foci.  Examples  of  the  first  kind  are  Dieter 
Nohlen’s  compendia  of  elections  in  the  world,  the 
electoral  almanac  of  Thomas  Mackie  and  Richard 


Rose,  or  the  Political  Data  Handbook  by  Jan-Erik 
Lane,  David  McKay,  and  Kenneth  Newton. 
Furthermore,  the  revolution  in  information  tech¬ 
nology  has  cleared  the  way  for  new  possibilities  to 
gather  data,  so  that  the  Internet  has  become  the 
most  important  data  source  for  scholars  around 
the  world.  The  quantity  of  online  resources  is 
enormous,  but  from  an  academic  point  of  view  the 
quality,  especially  in  terms  of  the  validity  of  the 
data,  is  mixed.  Many  privately  run  websites  do  not 
meet  scientific  standards. 

Analyses  That  Can  Be  Done 
With  Aggregate  Data 

Existing  data  sets  can  be  used  either  for  replicating 
former  studies  (for  which  these  data  sets  have  once 
been  made)  or  for  running  further  analyses.  In  a 
metastudy  on  the  use  of  data  sets  in  comparative 
politics,  Andreas  Schedler  and  Cas  Mudde  found 
that  the  data  sets  used  in  articles  published  in  six 
of  the  leading  English-speaking  journals  of  politi¬ 
cal  science  between  1989  and  2007  show  some 
striking  characteristics: 

1.  Nearly  half  of  all  observed  studies  using  data 
sets  are  single  case  studies  and  only  a  minority 
(24.5%)  are  large-N  studies  with  more  than 
20  countries  in  the  sample.  Large-N  studies, 
however,  are  becoming  more  and  more 
common.  In  1989,  only  one  of  the  30  studies 
had  more  than  20  cases;  in  2007,  these  were 
already  41.4%. 

2.  Depending  on  data  availability,  most  studies 
center  on  the  Western  world  and  only  on  a 
small  subsample  of  all  the  thematic  fields 
comparative  politics  has  to  offer;  thus,  it  is  a 
small  world,  the  world  of  comparative  political 
statistics. 

3.  In  accordance  with  these  findings,  it  is  not 
surprising  that  the  well-known  global  political 
data  sets,  such  as  Freedom  House,  Correlates  of 
War,  or  Minorities  at  Risk,  are  not  used  as 
frequently  as  expected;  rather  it  is  the  class  of 
country-specific  data  sets  that  is  most  often 
applied. 

The  last  important  finding  of  their  study  is  that 
only  a  very  small  minority  of  researchers  are  able 
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to  perform  their  analyses  without  modifying  an 
existing  data  set.  Most  scientists  have  to  modify, 
update,  or  refine  existing  data  sets  to  fit  their 
research  questions.  For  a  considerable  number  of 
studies  (38.7%),  comparative  scholars  had  to  com¬ 
pletely  construct  new  data  sets  on  their  own. 

In  general,  each  aggregate  data  analysis  starts 
with  a  simple  description  of  the  data.  This  task  can 
be  performed  by  using  either  tables  or  graphical 
means  such  as  bar  charts  or  box  plots.  For  some 
simple  comparisons,  this  level  of  analysis  may  be 
sufficient,  but  most  researchers  have  more  specific 
research  questions  that  they  want  to  address  via 
aggregate  data.  Let  us  assume  that  a  political  scien¬ 
tist  is  searching  for  the  causes  of  government  termi¬ 
nation  in  parliamentary  democracies;  that  is,  the 
dependent  variable  is  the  time  in  office.  The  theory 
states  that  in  addition  to  other  factors,  the  polariza¬ 
tion  of  the  parliament,  the  economic  environment, 
as  well  as  the  time  already  spent  in  office  should 
play  a  significant  role  in  early  terminations.  The 
first  step  of  the  analysis  would  be  to  select  a  sample 
of  cases.  For  this  research  question,  a  most  similar 
case’s  design  seems  appropriate,  resulting  in  a  selec¬ 
tion  of  all  countries  having  a  sufficiently  long 
record  as  a  parliamentary  democracy  (say,  20 
years)  and  having  all  necessary  information  for  test¬ 
ing  the  theory  available.  This  leaves  us  most  likely 
with  the  OECD  countries  and  thus  quite  a  small 
number  of  cases  (approximately  N  =  20) — that  is, 
if  we  consider  each  country  as  one  case.  Doing  so, 
we  could  compare  the  mean  government  duration 
of  the  countries  with  the  means  of  the  explanatory 
variables,  for  example,  via  bivariate  cross-section 
regressions.  The  results  would  nevertheless  not  be 
very  strong  because  the  relationship  between  the 
independent  variables  and  the  dependent  variable  is 
likely  more  complex  than  a  simple  bivariate  rela¬ 
tion  and  the  independent  variables  are  probably 
inter  correlated.  Therefore,  it  makes  sense  to  test  the 
theory  with  multivariate  regression  methods. 

Here,  a  problem  arises  that  is  quite  common  in 
social  science.  On  the  one  hand,  there  is  a  small  N, 
and  on  the  other,  there  are  very  complex  models, 
consisting  of  many  possible  explanatory  variables. 
This  constellation  generates  a  severe  problem  for 
statistical  analyses,  as  the  degrees  of  freedom  are  in 
these  circumstances  mostly  not  sufficient  for  a 
sound  estimation  of  statistical  models.  There  are 
two  ways  to  handle  this  problem:  ( 1 )  decrease  the 


number  of  independent  variables  or  (2)  increase  N. 
One  way  to  minimize  the  explanatory  variables  is 
to  construct  indices.  In  our  example,  the  economic 
environment  measured  by  the  two  variables,  infla¬ 
tion  and  unemployment  rate ,  could  also  be  cov¬ 
ered  by  a  combination  of  both,  called  the  misery 
index.  Decreasing  the  number  of  independent 
variables  can  be  done  only  in  special  circum¬ 
stances;  in  most  instances,  researchers  will  have  to 
exclude  certain  explanatory  variables  from  their 
models.  The  other  possibility  is  to  increase  the 
number  of  cases.  This  task  is  generally  quite  diffi¬ 
cult.  A  potential  alternative  is  to  not  only  look  at 
the  national  level  but  also  include  subnational  enti¬ 
ties,  when  this  is  appropriate.  In  our  example  of 
government  duration,  this  could  be  possible  in 
some  federal  states,  but  the  independent  economic 
variables  being  defined  on  the  national  level  under¬ 
mine  a  meaningful  comparison.  Another  possibil¬ 
ity,  which  has  become  quite  popular  during  the 
past  few  decades,  has  been  the  use  of  pooled  time- 
series  cross-section  analysis.  The  pooling  of  yearly 
data  into  a  matrix  generates  more  cases  (more 
precise  observations)  for  the  analysis.  This  proce¬ 
dure  nonetheless  raises  new  problems  such  as 
autocorrelation,  which  violates  the  assumption  of 
random  distribution  of  cases  that  is  required  by  the 
standard  ordinary  least  squares  (OLS)  regression. 
Pooled  analyses,  therefore,  have  to  be  used — if  at 
all — in  an  extremely  careful  manner.  When  all 
these  procedures  do  not  result  in  a  satisfactory 
cases  to  independent  variables  ratio,  qualitative 
comparative  analysis  (QCA),  a  technique  using 
Boolean  algebra,  could  be  applied.  Fortunately,  in 
our  example,  it  is  easier  to  maximize  N,  because 
every  government  can  be  regarded  as  a  single  case, 
and  the  data  do  not  have  to  be  aggregated  further. 
Thus,  it  is  possible  to  run  multivariate  regressions 
on  a  sample  of  all  governments.  However,  the  best 
methodological  choice  is  probably  not  a  regression 
but  an  event  history  model,  which  takes,  apart 
from  the  other  independent  variables,  elapsed  time 
as  a  relevant  factor  into  account. 

Uwe  Wagschal  and  Sebastian  Jackie 
University  of  Heidelberg 
Heidelberg,  Germany 
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Political  alienation  was  introduced  in  political  and 
social  thought  in  the  19th  century.  From  the 
beginning,  the  concept  was  broadly,  ambiguously, 
and  inconsistently  used  by  scholars,  mainly 
because  of  its  ideological  connotations  and  pur¬ 
poses.  In  Hegel’s  and  Marx’s  analyses  of  civic  and 
capitalist  society,  alienation  meant  a  state  or  a 
process  of  human  disconnectedness.  According  to 
Claude  S.  Fischer,  “alienation  is  the  state  in  which 
the  actor  fails  to  perceive  a  positive  interdepen¬ 
dence  between  himself  and  social  relationships  or 
other  objectifications”  (1974,  p.  18).  According  to 
Melvin  Seeman,  the  most  important  elements  of 
this  conceptualization  are  the  following: 

1.  Alienation  refers  to  a  kind  of  “discrepancy” — 
something  that  is  only  implicitly  entailed  in  the 
definition  before. 

2.  The  individual  himself  or  herself  or  his  or  her 
social  environment  functions  as  a  target  or  a 
source  of  alienation. 

3.  Alienation  can  be  regarded  as  a  state  or  as  a 
process. 


4.  It  can  be  considered  as  a  social  situation/ 

relationship,  an  individual  mental  state,  a  social 
circumstance,  or  a  sentiment. 

These  aspects  are  discussed  below. 

In  contemporary  social  science,  alienation  was 
restated  as  an  essential  part  of  the  critique  of 
advanced  capitalism  by  authors  like  Erich  Fromm, 
Herbert  Marcuse,  and  Alain  Touraine.  Alienation, 
understood  as  an  objective  state  or  process,  refers  to 
a  lack  of  socioeconomic  resources,  an  underprivi¬ 
leged  social  status,  an  inferior  position  in  a  power 
relation,  a  lack  of  autonomy  in  labor  relationships, 
or  the  degradation  of  objective  life  conditions.  As  a 
subjective  feeling,  it  has  several  different  meanings, 
such  as  a  perceived  loss  of  something  considered 
valuable,  a  sense  of  inferiority,  feelings  of  frustra¬ 
tion,  isolation,  anomie,  lack  of  control  of  one’s 
own  life  conditions,  or  a  perception  of  being 
manipulated  by  powerful  others  or  by  social  struc¬ 
tures.  Drafted  like  this,  the  concept  found  its  way 
in  various  scientific  disciplines  such  as  philosophy, 
psychiatry,  psychology,  sociology,  and  political 
science.  Manifold  social  and  psychic/attitudinal 
states  and  processes  were  subsumed  under  the 
heading  of  alienation,  ranging  from  physical  or 
mental  illness  to  specific  attitudes  toward  politics, 
such  as  political  distrust  or  feelings  of  belonging  to 
a  discriminated  social  group  (class,  ethnicity,  etc.). 
In  modern  political  science,  alienation  was  used 
more  narrowly  and  referred  to  negative  or  critical 
political  attitudes  often  leading  to  deviant  political 
behavior.  The  topics  that  have  been  most  promi¬ 
nent  in  empirical  political  research  on  alienation 
are  analyses  of  various  dimensions  of  the  concept 
and  its  measurement  and  analyses  of  the  impact  of 
feelings  of  alienation  on  individuals’  political 
behavior,  particularly  political  apathy  and  partici¬ 
pation  in  protest  activities. 

Other  than  the  view  of  alienation  as  a  general, 
diffuse  disposition,  the  term  was  used  as  a  multidi¬ 
mensional  concept  in  empirical  research.  In  this 
regard,  Seeman  was  the  leading  scholar  proposing 
a  distinction  of  the  subsequent  six  different  dimen¬ 
sions  of  alienation,  which  were  primarily  under¬ 
stood  as  mental  states: 

1.  Powerlessness  refers  to  an  individual’s 
perception  of  low  self-esteem  or  inferiority  in 
political  life  or  to  a  sense  of  low  control  of  one’s 
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own  life  conditions  and  the  political  circumstances 
one  lives  in.  People  feeling  politically  powerless 
think  that  they  do  not  have  any  opportunity  to 
make  their  voices  heard  in  political  life  and  to 
influence  the  conduct  of  public  affairs.  Some 
scholars  distinguish  between  personal  and  political 
powerlessness,  with  the  first  set  of  attitudes 
describing  the  (lack  of)  ability  to  master  one’s  own 
life  and  the  second  set  referring  to  the  perceived 
(lack  of)  capacity  to  exert  some  degree  of  influence 
on  politics.  The  feeling  of  powerlessness  comes 
quite  close  to  the  sense  of  political  (in)efficacy  or 
subjective  (in)competence,  which  are  much  more 
familiar  concepts  in  political  science.  It  was  often 
measured  by  using  the  ANES  (American  National 
Election  Studies)  standard  items  on  political 
efficacy.  The  I-E  (Internal-External  Locus  of 
Control)  Scale  developed  by  Rotter  is  the  most 
common  alternative  measure. 

2.  Meaninglessness  has  to  do  with  the 
individuals’  sense  of  being  able  to  comprehend 
political  events,  situations,  and  decisions  and  with 
the  individuals’  assessment  of  the  impact  of  politics 
on  their  own  lives.  In  short,  people  suffering  from 
meaninglessness  are  cognitively  disconnected  from 
social  and  political  life.  This  attitude  resembles  to 
a  certain  degree  the  cognitive  component  of 
powerlessness  and  thus  is  often  interpreted  as  a 
facet  of  political  (in)efficacy. 

3.  Normlessness  as  the  third  subdimension  of 
alienation  is  largely  synonymous  to  anomie  and 
signifies  the  expectation  that  only  the  use  of 
socially  unapproved  means  will  lead  to  the 
achievement  of  given  goals.  This  expectation  is  due 
to  the  perception  that  social  norms  regulating 
individual  and  collective  behavior  do  not  perform 
their  function  any  longer.  Hence,  political  leaders 
do  not  react  in  a  calculable  and  responsive  way  to 
citizens’  demands.  Those  citizens  who  are  well 
aware  of  this  situation  will  rely  on  illegal  or 
illegitimate  forms  of  political  behavior.  In  the 
practice  of  empirical  research,  normlessness  is 
often  equated  to  distrust  and  measured  by  the 
ANES  standard  items  on  trust  in  government. 
Alternative  scales  such  as  Dean’s  scale  of 
normlessness  and  McClosky  and  Schaar’s  scale  of 
anomie  were  also  used  in  empirical  research. 

4.  Self  estrangement  seen  as  a  failure  of  self- 
realization  is  the  classical  master  theme  of 


philosophical  work  on  alienation  and  originated 
in  the  Marxian  analysis  of  working  conditions 
and  relations.  Accordingly,  a  lack  of  intrinsic 
fulfillment  in  work  or  an  incapability  of  controlling 
one’s  working  conditions  is  regarded  as  the  core 
element  of  self-estrangement.  In  widening  this 
narrow  conceptualization,  some  scholars  proposed 
regarding  self-estrangement  as  an  individual’s 
engagement  in  activities  that  are  not  appreciated 
as  intrinsically  rewarding  or  are  considered 
inauthentic.  In  empirical  political  science  research, 
this  dimension  is  not  particularly  important. 

5.  Cultural  estrangement  overlaps  to  a  certain 
degree  with  normlessness  and  self-estrangement.  It 
means  the  individual’s  rejection  of  commonly  held 
values  in  society,  while  normlessness  is  seen  as  a 
rejection  of  more  specific  social  norms.  Again,  the 
concept  does  not  play  a  major  role  in  empirical 
research,  despite  the  important  role  attributed  to 
value  consensus  as  a  source  of  social  and  political 
integration. 

6.  Social  isolation,  as  the  sixth  subdimension  of 
alienation,  was  originally  seen  as  a  lack  of  social 
relationships  rather  than  as  a  mental  state.  In  prac¬ 
tice,  however,  both  aspects  cannot  really  be  sepa¬ 
rated  from  each  other.  Lack  or  loss  of  social  ties 
does  not  become  a  serious  problem  for  individuals 
unless  it  will  be  perceived  and  assessed.  Thus, 
social  isolation  signifies  not  only  a  whole  set  of 
behaviors  and  attitudes,  a  sense  of  being  excluded 
or  lonely,  a  lack  of  identity,  a  feeling  of  being 
rejected  or  discriminated  as  a  member  of  a  particu¬ 
lar  social  group  but  also  deviating  behavior  and 
lack  of  integration  into  social  networks.  Social 
isolation  is  used  as  an  explanatory  concept  in  some 
community  studies,  but  it  is  clearly  less  influential 
in  empirical  research  than  powerlessness  and 
normlessness. 

Empirical  social  science  research  on  alienation 
reached  a  point  of  culmination  in  the  late  1960s 
and  early  1970s  when  it  was  mainly  embedded  in 
two  specific  research  contexts.  The  first  one  was 
the  theory  of  mass  society,  which  was  a  fashion¬ 
able  idea  to  many  social  scientists  in  that  period. 
The  second  impetus  came  from  empirical  research 
on  political  protest  activities  and  movements, 
which  used  the  notions  of  alienation,  cynicism, 
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and  subjective  deprivation  as  the  main  explanatory 
concepts.  The  approach  to  alienation  in  analyses 
of  mass  society  was  far  broader  than  in  empirical 
research  on  political  protest.  As  outlined  by 
Seeman,  the  general  research  question  of  the  for¬ 
mer  referred  to  the  relationship  of  social  change  to 
political  alienation,  while  the  latter  focused  on 
various  forms  of  behavior  induced  by  feelings  of 
alienation.  An  integration  of  these  three  sets  of 
variables  (structure,  attitude,  behavior)  was  largely 
missing  in  empirical  research  on  alienation.  An 
overview  of  the  assumed  links  between  the  respec¬ 
tive  variables  is  outlined  in  Table  1. 

Even  if  we  agree  to  the  idea  of  an  impact  of  social 
change  (particularly  rapid  and  deep  social  changes) 
on  the  individual’s  perception  of  society  and  his  or 
her  own  place  in  it  as  well  as  to  the  view  of  alien¬ 
ation  as  a  source  of  deviant  behavior,  the  specific 
hypotheses  implied  in  Seeman’s  scheme  are  far  from 
being  conclusive.  Political  research  focused  on  a 
smaller  set  of  behaviors  as  well  as  attitudes.  A  first 
line  of  empirical  research  evolved  in  analyses  of  par¬ 
ticipation  in  civil  rights  protest  movements  and  thus 
can  be  interpreted  as  an  extension  of  classical  studies 
of  the  behavioral  impact  of  deprivation.  Other 
research  focused  on  the  observation  of  an  increase  of 
political  protest  in  affluent  societies  and  on  political 
protest  as  a  means  of  privileged  groups.  The  under¬ 
lying  assumption  was  that  objective  deprivation  is 
not  the  only  determinant  of  protest,  even  violent 
protest;  negative  feelings  toward  the  political  environ¬ 
ment  may  also  be  a  prominent  source  of  protest  activi¬ 
ties.  Two  dimensions  of  alienation,  powerlessness 
and  normlessness,  became  particularly  prominent  in 


empirical  research.  According  to  the  efficacy-distrust 
hypothesis,  lack  of  powerlessness  (efficacy)  is  a  nec¬ 
essary  condition  for  any  type  of  political  activity, 
since  the  powerless  will  generally  abstain  from  par¬ 
ticipation  irrespective  of  its  specific  form.  Whether 
the  powerful  or  efficacious  will  use  protest  or  con¬ 
ventional  means  of  exerting  influence  will  depend  on 
the  presence  or  absence  of  normlessness  (distrust). 
While  the  combination  of  efficacy  and  distrust  leads 
to  (illegal)  protest,  the  efficacious  and  trusting  use 
conventional  forms  of  political  participation  such  as 
voting,  becoming  active  in  parties,  and  so  on. 
However,  the  empirical  evidence  on  the  efficacy- 
distrust  hypothesis  is  not  convincing,  as  is  the  case 
for  the  impact  of  alienation  in  general. 

In  assessing  the  state  of  research  on  political  alien¬ 
ation,  Seeman  came  to  a  rather  pessimistic  conclu¬ 
sion  that  still  holds  true  today.  According  to  him, 
research  was  often  characterized  by  ideological 
convictions  rather  than  by  conceptual  clarity  and 
imbalanced  by  dealing  mainly  with  powerlessness, 
normlessness,  and  social  isolation  and  by  neglecting 
the  other  components.  The  theoretical  underpin¬ 
nings  of  empirical  research  on  political  alienation 
have  remained  weak  so  far,  the  constructs  have  been 
used  in  an  inconsistent  manner,  the  measures  were 
poor,  and  the  empirical  findings  contradictory. 

Oscar  W.  Gabriel 
Universitat  Stuttgart 
Stuttgart,  Germany 

See  also  Apathy;  Efficacy,  Political;  Participation, 
Contentious;  Social  Movements 


Table  I  Factors,  Forms,  and  Consequences  of  Alienation 


Contemporary  Structural  Trends 

Forms  of  Alienation 

Behavioral  Consequences 

Kinship  to  impersonality 

Powerlessness 

Political  passivity 

Traditional  to  rational  forms 

Meaninglessness 

Wild  strikes 

Homogeneity  to  heterogeneity 

Normlessness 

Mass  movements 

Stability  to  mobility 

Value  isolation  (cultural 
estrangement) 

Ethnic  prejudice 

Enlargement  of  scale 

Self-estrangement 

Mental  disorder 

School  Absenteeism 

Social  isolation 

School  absenteeism 

Low  information  level 

Suicide 
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Alliances  are  among  the  most  studied  aspects  of 
international  cooperation  and  play  important 
roles  in  most  broad  theories  of  international  rela¬ 
tions,  particularly  theories  of  war  and  peace.  They 
are  an  important  component  of  the  security  poli¬ 
cies  of  most  states  in  the  international  system 
today.  This  entry  first  defines  and  describes  alli¬ 
ances,  then  discusses  the  motivations  of  states  to 
form  alliances  and  the  effects  of  alliances  on  inter¬ 
national  outcomes. 

An  alliance  is  a  formal  agreement  among  inde¬ 
pendent  states  in  the  international  system  to  coop¬ 
erate  militarily  in  the  event  of  militarized  conflict 
with  outside  parties.  Alliances  are  distinguished 
from  informal  alignments  by  their  codification  in  a 
written  document;  while  states  that  share  interests 
and  tend  to  coordinate  behavior  may  be  termed 
aligned,  only  states  that  have  made  an  international 
legal  commitment  to  assist  one  another  are  allied. 
Alliances  are  distinguished  from  protectorates,  uni¬ 
lateral  guarantees,  and  colonial  relationships  by 
their  joint  commitments;  all  member  states  retain 
sovereignty  and  independence  but  commit  to  adjust 
their  policies  to  meet  allied  goals.  Commitments 
need  not  be  symmetrical,  but  all  alliance  members 
must  make  some  cooperative  commitment.  Finally, 
alliances  involve  promises  of  assistance  in  the  event 
of  military  conflict  with  outside  parties.  Thus,  they 
are  distinguishable  from  other  forms  of  military 
cooperation  such  as  arms  sales  agreements  and 
intelligence-sharing  agreements . 

Formation  and  Provisions  of  Alliances 

Most  alliances  are  formed  through  treaties.  The 
specific  promises  made  in  these  treaties  can  vary 


significantly.  Some  alliances  commit  the  signato¬ 
ries  to  assist  one  another  in  the  event  a  member 
state  is  attacked;  these  are  often  referred  to  as 
defense  pacts.  An  example  of  a  defense  pact  is  the 
North  Atlantic  Treaty  Organization  (NATO). 
Other  alliances  provide  for  states  to  assist  one 
another  in  accomplishing  offensive  goals,  usually 
in  addition  to  defensive  promises.  The  Pact  of 
Steel,  signed  by  Germany  and  Italy  in  1939,  is  an 
example  of  this  sort  of  alliance.  Many  alliances, 
however,  fall  short  of  guaranteeing  active  partici¬ 
pation  in  conflicts  that  may  arise  and  instead  com¬ 
mit  the  member  states  to  remain  neutral  and  pro¬ 
vide  no  support  for  the  adversary  of  an  ally  in  the 
event  the  ally  becomes  involved  in  conflict  or  com¬ 
mit  the  members  to  consult  in  the  event  of  threat 
and  make  every  attempt  to  produce  a  coordinated 
response,  without  any  specificity  about  what  that 
coordinated  response  might  be.  The  Molotov- 
Ribbentrop  Pact  (also  known  as  the  Nazi-Soviet 
Nonaggression  Pact)  signed  in  1939  is  an  example 
of  a  neutrality  pact,  and  the  Treaty  of  Good 
Neighborly  Friendship  and  Cooperation  signed  by 
Russia  and  China  in  2001  is  an  example  of  a  con¬ 
sultation  pact  (also  known  as  an  entente). 

In  many  cases,  alliance  treaties  also  specify  the 
conditions  under  which  the  obligations  come  into 
force.  Rather  than  applying  to  any  military  con¬ 
flict,  an  alliance  may,  for  instance,  apply  only  to 
conflicts  with  specific  adversaries,  in  specific  loca¬ 
tions,  or  in  conjunction  with  a  specific  dispute. 
The  NATO  treaty,  for  instance,  specifies  that  it 
applies  only  to  the  home  territories  of  members 
and  does  not  commit  members  to  defend  their 
allies’  colonial  possessions.  Defense  pacts  signed 
by  France  with  both  Poland  and  Czechoslovakia  at 
Locarno,  Switzerland,  in  1925  commit  the  states 
to  assist  one  another  only  in  the  event  of  unpro¬ 
voked  attack  by  Germany.  And  some  treaties  (for 
instance,  the  Convention  of  London  signed  by 
France,  the  United  Kingdom,  and  Italy  during  the 
Crimean  War)  are  limited  to  a  specific  ongoing 
conflict,  committing  the  states  to  work  together 
only  through  the  conclusion  of  the  current  war. 

Alliances  also  vary  in  the  level  of  peacetime 
investment  and  coordination  they  require.  While 
some  alliance  treaties  only  require  action  in  the 
event  conflict  emerges,  others  provide  for  the 
development  of  organizations  and  military  integra¬ 
tion  during  peacetime,  for  one  state  to  place  troops 
on  the  territory  of  an  ally  during  peacetime,  and/or 
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for  shared  development  of  defense  and  foreign 
policy.  In  addition,  alliances  vary  in  size  and  length 
of  term.  Some  alliances  are  bilateral  agreements, 
while  others  are  large  multilateral  entities.  Some 
alliances  are  formed  for  temporary  purposes,  for 
instance,  for  the  duration  of  an  ongoing  war; 
whereas  others  are  designed  to  last  for  long  peri¬ 
ods  of  time,  for  instance,  20  years,  with  provisions 
allowing  for  renewal. 

Purposes  of  Alliances 

So  why  do  states  form  alliances?  The  most  basic 
reason  is  that  by  joining  their  military  forces  with 
those  of  other  states,  they  become  stronger  fighting 
forces.  Given  that  the  international  system  is  anar¬ 
chic,  all  bargaining  among  states  takes  place  in  the 
shadow  of  the  use  of  force.  States  that  can  muster 
more  military  force  can  retain  their  security  and 
compel  concessions  from  others.  States  maintain 
their  security  by  deterring  threats  against  them, 
and  successful  deterrence  requires  convincing 
potential  adversaries  that  pursuing  gains  at  a 
state’s  expense  through  the  use  of  force  would  be 
unsuccessful  and/or  prohibitively  costly  due  to  the 
state’s  ability  to  wage  a  successful  war.  Similarly, 
states  can  receive  concessions  from  other  states  by 
convincing  those  states  that  resisting  demands  will 
be  unsuccessful  and/or  prohibitively  costly  due  to 
the  superior  military  power  of  the  challenging  state 
or  coalition.  In  the  event  that  deterrence  or  com- 
pellence  fails,  and  states  find  themselves  at  war, 
they  are  more  likely  to  win  the  war  with  allied 
support  than  without. 

Alternatively,  states  may  use  alliances  to  reduce 
their  individual  defense  burdens.  To  the  extent 
that  economies  of  scale  exist  in  the  provision  of 
defense,  states  might  find  it  more  efficient  to  com¬ 
bine  their  defense  preparations  with  other  states 
rather  than  bear  the  full  burden  of  defense  provi¬ 
sions  on  their  own.  For  instance,  many  states 
found  it  more  efficient  to  rely  on  the  U.S.  or  the 
(former)  USSR  nuclear  umbrellas  during  the  Cold 
War  rather  than  to  develop  their  own  nuclear 
weapons  programs.  Similarly,  states  may  have  dif¬ 
ferent  comparative  advantages  in  defense  (e.g., 
land  power  vs.  sea  power)  and  benefit  from  com¬ 
bining  their  efforts. 

Interestingly,  however,  many  alliances  are 
formed  between  strong  states  and  weaker  states. 
Since  weaker  states  usually  cannot  offer  much 


increased  military  power  to  strong  states  (with  the 
exception  of  a  few  weak  states  that  occupy  strate¬ 
gically  important  geographic  locations),  one  might 
reasonably  ask  why  strong  states  would  see  benefit 
in  allying  with  weaker  states.  Two  explanations 
(which  are  probably  both  accurate)  have  been  pro¬ 
posed.  One  is  that  strong  states  have  a  self-interest 
in  the  security  of  some  weaker  states  due  to  factors 
such  as  valuable  economic  relationships  or  rival¬ 
ries  with  other  major  powers  and  are  willing  to 
bear  the  costs  of  defending  those  states.  The  other 
is  that  weaker  states  are  willing  to  offer  stronger 
states  other  advantages — for  instance,  foreign 
policy  control,  use  of  their  territory  and  resources, 
or  compliance  on  particular  issues  of  interest  to 
stronger  states — in  return  for  an  alliance. 

Implementation  and  Impacts  of  Alliances 

Yet  given  that  there  is  no  institution  to  provide 
external  enforcement  of  contracts  in  the  interna¬ 
tional  system,  one  might  wonder  why  states  believe 
that  alliances  will  be  reliable.  How  can  state  lead¬ 
ers  be  assured  that  their  allies  will  not  abandon 
them  in  the  event  of  attack?  In  fact,  this  is  a  sig¬ 
nificant  concern  and  almost  assuredly  prevents 
some  potentially  beneficial  alliances  from  occur¬ 
ring;  states  should  be  reluctant  to  depend  too  heav¬ 
ily  on  other  states  to  guarantee  their  security  and 
when  deciding  to  form  alliances  must  consider  the 
incentives  that  their  allies  will  have  for  fulfilling 
their  alliance  commitments  in  the  event  of  conflict. 
That  being  said,  many  allies  do  have  incentive  to 
work  with  their  partners  due  to  shared  interests, 
and  the  formalization  of  cooperation  in  an  alli¬ 
ance  treaty  enhances  the  probability  of  future 
joint  action.  Forming  and  institutionalizing  an 
alliance  are  costly.  Not  only  do  states  have  to 
negotiate  the  agreement,  they  then  have  to  imple¬ 
ment  military  coordination  clauses  and  coordinate 
their  foreign  policies  to  make  the  alliance  credible. 
In  addition,  violating  a  previous  commitment  can 
have  negative  repercussions  for  the  international 
reputation  of  a  state  and  the  domestic  reputation 
of  a  leader.  Thus,  many  scholars  believe  that  states 
that  invest  in  forming  alliances  have  increased 
incentive  to  fulfill  them  in  the  event  that  alliances 
are  invoked. 

So  how  do  alliances  affect  outcomes  in  the  inter¬ 
national  system?  The  strongest  effects  of  alliances 
are  on  the  probability  that  militarized  disputes 


62  Analysis  of  Variance 


occur  and  the  probability  that  wars  expand. 
Alliances  affect  the  probability  that  states  chal¬ 
lenge  the  current  status  quo  and  make  threats 
involving  the  use  of  military  force.  States  with 
allies  are  less  likely  to  find  themselves  challenged 
because  potential  adversaries  understand  that  to 
compel  compliance  would  involve  facing  a  joint 
military  effort  involving  the  target  and  its  allies. 
On  the  other  hand,  states  that  have  allies  commit¬ 
ted  to  help  them  may  be  more  willing  to  challenge 
the  status  quo  and  threaten  the  use  of  force  in  the 
expectation  that  their  targets  will  concede  their 
demands. 

Yet,  while  defensive  alliances  may  deter  the  ini¬ 
tiation  of  disputes,  when  deterrence  fails,  states 
with  allies  may  be  more  willing  to  resist  a  chal¬ 
lenger’s  demands  and  take  the  dispute  to  war.  If 
this  happens,  the  war  is  unlikely  to  remain  a  bilat¬ 
eral  affair.  Alliances  tend  to  diffuse  wars  beyond 
their  initial  participants  and  create  larger,  more 
severe  conflicts.  Because  defensive  alliances  both 
deter  disputes  and  make  it  more  likely  that  failures 
of  deterrence  result  in  particularly  large  and  severe 
wars,  it  is  not  easily  apparent  whether  alliances 
have  an  overall  dampening  effect  on  international 
conflict. 

Many  scholars  claim,  however,  that  alliances 
have  spillover  effects  on  other  kinds  of  interna¬ 
tional  cooperation.  Some  argue,  for  instance, 
that  allies  tend  to  trade  more  with  one  another, 
that  allies  are  more  likely  to  settle  disputes 
among  themselves  peacefully,  and  that  institu¬ 
tions  initially  formed  to  support  alliances  become 
useful  for  a  wide  range  of  other  cooperative 
activities  as  well.  A  large  number  of  alliance  trea¬ 
ties  include  specific  provisions  for  nonmilitary 
cooperation  in  addition  to  military  cooperation. 
Thus,  alliances  may  have  an  indirect  pacifying 
effect  as  well. 

In  conclusion,  scholars  today  have  considerable 
agreement  on  a  definition  for  alliances,  on  how 
alliances  can  be  distinguished  from  other  relation¬ 
ships,  and  on  the  varying  reasons  why  states  form 
alliances.  Research  continues  in  an  attempt  to 
specify  more  precisely  the  full  range  of  effects  of 
alliances  on  international  outcomes. 

Brett  Ashley  Leeds 
Rice  University 
Houston,  Texas,  United  States 
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Analysis  of  Variance 


Analysis  of  variance  (ANOVA)  is  a  method  for 
decomposing  variance  in  a  measured  outcome 
into  variance  that  can  be  explained,  such  as  by  a 
regression  model  or  an  experimental  treatment 
assignment,  and  variance  that  cannot  be 
explained,  which  is  often  attributable  to  random 
error.  Using  this  decomposition  into  component 
sums  of  squares,  certain  test  statistics  can  be  cal¬ 
culated  that  can  be  used  to  describe  the  data  or 
even  justify  model  selection.  Lab  experiments 
have  become  increasingly  popular  in  political 
science,  and  ANOVA  is  a  useful  tool  for  analyz¬ 
ing  such  experiments.  In  recent  years,  there  have 
been  a  number  of  laboratory  experiments  on  the 
effects  of  campaigning  and  media  advertising. 
Nicolas  Valentino,  Vincent  Hutchings,  and 
Ismail  White  (2002),  Diana  Mutz  and  Byron 
Reeves  (2005),  and  Ted  Brader  (2005)  have  all 
performed  lab  experiments  that  aim  to  determine 
the  effect  that  campaigning  and  advertising  in 
media  have  on  voters  views  and  decisions.  All 
three  experiments  employ  an  ANOVA  to  control 
for  observable  characteristics,  interactions 
between  treatment  regimes,  and  significance  of 
the  relative  effectiveness  of  treatments.  This 
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entry  discusses  ANOVA  and  its  applications  in 
greater  detail. 

In  the  familiar  regression  context,  the  “sum  of 
squares”  (SS)  can  be  decomposed  as  follows. 
Assuming  that  Y.  is  individual  z’s  outcome,  Yis  the 
mean  of  the  outcomes,  Y,  is  individual  /’ s  fitted 
value  based  on  the  ordinary  least  squares  (OLS) 
estimates,  and  e,  is  the  resulting  residual, 

N  N  ,  N 

Z  (r  -y)  =  S(t,-Y)-+  z  (<•?), 

f  =  1  i  =  1  i  =  1 

where 


assigns  observations  to  one  of  I  treatment 
assignments.  Each  treatment  assignment  has 
J  observations  assigned  to  it.  In  the  case  of  a 
randomized  controlled  trial  with  only  one  treat¬ 
ment  regime  and  N  subjects  randomly  assigned 
to  treatment  with  half  a  probability,  this  would 
mean  that  1  =  2,  one  treated  group  and  one  con¬ 
trol  group,  where  each  group  has  size  J.  In  this 
framework,  the  variance  decomposition  would 
be  as  follows: 

Z  Z  E(Y,-Yf 

i  =  lj  =  1  j  =  li  =  1 

+  ee  (y,i-n2, 

i  =  i ;  =  i 


SStotal  =  t  (Y,  -  Y)2 

where 


is  the  total  sum  of  squares, 


SS 


regression 


N  „  _  2 

I  (Y,-Y) 


1  1 

Y,  =  j  I 

is  defined  as  the  average  response  under  the  7th 
treatment  and 


refers  to  the  variance  explained  by  the  regression, 
and 


Y 


1 

U, 


i 

I 


E^ 


SSerror  =  t  (ef) 

i  =  1 


is  the  variance  due  to  the  error  term,  also  known 
as  the  unexplained  variance.  Commonly,  we  would 
write  this  decomposition  as 


SS 


total 


SS 


regression 


+  SS 


error  • 


The  equations  above  show  how  the  total  vari¬ 
ance  in  the  observations  can  be  decomposed 
into  variance  that  can  be  explained  by  the 
regression  equation  and  variance  that  can  be 
attributed  to  the  random  error  term  in  the 
regression  model. 

ANOVA  is  not  restricted  to  use  with  regres¬ 
sion  models.  The  concept  of  decomposing  vari¬ 
ance  can  be  applied  to  other  models  of  data,  such 
as  an  experimental  model.  The  following  is  the 
decomposition  of  a  one-way  layout  experimental 
design  in  which  an  experimenter  randomly 


is  defined  as  the  overall  average  of  all  observations, 
regardless  of  treatment  assignment.  Commonly, 
this  sum  of  squares  expression  is  written  as 


SS 


total 


SS 


between 


+  SS 


within  i 


where  SSbetWeen  refers  to  the  part  of  the  variance 
that  can  be  attributed  to  the  different  treatment 
assignments  and  YS'wlthin  refers  to  the  variance  that 
can  be  described  by  the  random  error  within  a 
treatment  assignment.  From  this,  we  can  see  that 
SSbetween  and  SSiegKSSion,  from  the  regression 
framework,  both  refer  to  the  explained  variance. 
SSwlthin  and  SSmot  both  refer  to  the  unexplained 
variance. 

Typically  an  ANOVA  table  is  constructed  to 
describe  the  variance  decomposition.  In  the  regres¬ 
sion  context,  where  p  is  defined  as  the  number  of 
independent  regressors  and  n  is  the  number  of 
observations,  the  ANOVA  table  typically  looks 
like  this: 
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Source 

df 

SS 

MS 

F 

Regression 

P 

SS 

"-'‘-'regression 

‘-^regression  /  P 

‘-^regression  /  P 

SStot3\/n  -  1 

Error 

n-p  -  1 

SS 

‘-'‘-'error 

r-H 

1 

1 

of 

03 

Total 

n  —  1 

‘-^total 

This  table  gives  us  an  idea  as  to  how  to  break 
down  our  analysis.  The  column  df  refers  to  the 
degrees  of  freedom.  In  the  regression  framework, 
the  degrees  of  freedom  for  the  regression  is  the 
number  of  parameters  in  the  regression  equation. 
The  degrees  of  freedom  for  SSemi  is  n  -  p  -  1.  The 
column  MS  refers  to  the  mean  squared  error, 
which  is  defined  as  SS/df  for  each  row  in  the  table. 
If  we  were  in  the  experimental  one-way  layout 
design,  then  the  first  row  would  refer  to  the 
between  variance,  and  the  number  of  treatments 
less  one,  that  is,  (7-1)  for  I  treatments.  The 
degrees  of  freedom  for  error  is  defined  as  the 
number  of  treatments  times  the  number  of  trials  in 
each  treatment  less  one,  or  /(/  -  1)  for  J  trials  in 
each  treatment. 

The  above  table  also  contains  a  column 
called  F,  which  refers  to  the  F-test.  The  F-test  is 
a  way  of  using  ANOVA  to  determine  if  all  the 
regressors  in  a  regression  equation  are  jointly 
zero.  In  a  one-way  experimental  analysis,  the 
F-test  determines  if  the  means  of  the  treatment 
groups  are  significantly  different.  If  we  have 
more  than  one  treatment  and  one  control 
group,  then  the  F-test  is  applied  to  see  if  any 
treatments  are  significantly  different  from  zero. 
The  null  for  an  F-test  is  that  all  coefficients  in 
our  model  are  jointly,  not  statistically,  distin¬ 
guishable  from  zero.  The  F-test  is  defined  as 
follows: 


F  = 


SS, 


regression 


Ip 


SStrror/n  -  p  -  1 ' 


Using  the  ANOVA  table,  we  can  also  determine 
the  R 2  value  of  our  treatment  or  model.  We  define 
R 2  as  follows: 


_  '-'‘-'regression  ^  ‘-'‘-'error 


SS, 


total 


SS, 


total 


The  R1  refers  to  how  much  of  the  variance  is 
explained  by  the  model.  A  high  R1  value  means 
that  much  of  the  variation  is  explained  by  the 
model,  implying  that  the  model  fits  the  data  well 
and  that  little  of  the  variance  is  explained  by  the 
random  error  term.  In  the  experimental  framework, 
a  high  R2  value  means  that  much  of  the  variation 
is  explained  by  the  treatment  assignment,  and  little 
of  the  variance  is  due  to  random  error  within  those 
treatment  assignments. 

Model  Selection  and  Analysis  of  Variance 

ANOVA  is  generally  used  with  linear  regression  to 
assess  model  selection.  When  selecting  the  best 
model,  we  seek  to  strike  a  balance  between  good¬ 
ness  of  fit  and  parsimony.  If  two  models  fit  the 
data  equally  well,  the  model  selected  should 
include  only  those  explanatory  variables  that 
explain  a  significant  degree  of  the  variance  in  the 
response  variable.  The  question  is  how  to  distin¬ 
guish  between  important  and  trivial  variables  in  a 
way  that  is  systematic.  ANOVA  is  one  method  for 
identifying  the  parameters  of  interest.  It  is  impor¬ 
tant  to  stress  that  ANOVA  makes  all  the  assump¬ 
tions  made  by  normal  linear  regression. 
Furthermore,  in  general  applications  of  ANOVA, 
all  the  explanatory  variables  must  be  mutually 
orthogonal,  although  in  some  limited  cases  this 
orthogonality  is  not  necessary  to  make  a  reason¬ 
able  justification  for  model  choice.  To  determine 
which  covariates  are  important  for  the  regression 
model,  ANOVA  can  be  run  multiple  times  in  suc¬ 
cession  to  determine  if  adding  an  additional 
covariate  contributes  any  more  to  the  explained 
variance. 
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Consider  two  normal  linear  models: 


y  =  a  +  PjXi  +  32x2  +  8, 

(1) 

y  —  a  +  +  s, 

(2) 

where  x1  and  x2  are  explanatory  variables,  a  is 
the  intercept,  and  (3j  and  p2  are  the  parameters 
of  interest.  The  second  is  obviously  a  simpler 
version  of  the  first.  We  can  think  of  Model  2  as 
the  version  of  Model  1,  in  which  p,  is  restricted 
to  zero.  For  this  reason,  we  often  refer  to  Model 
1  as  the  unrestricted  model  and  Model  2  as  the 
restricted  model.  The  question  is  which  model  is 
better.  If  the  restricted  model  fits  the  data 
equally  well,  then  adding  complexity  does  not 
improve  the  accuracy  of  the  estimation 
significantly  and  the  simpler  (restricted)  model  is 
preferable. 

In  model  selection,  ANOVA  analyzes  the 
degree  to  which  residual  variance  changes  with 
the  addition  of  explanatory  variables  to  the  basic 
model.  Note  that  the  vector  of  residuals  for  the 
restricted  model  (where  P2  =  0)  can  be  broken 
into  two  components: 

y  ~  9i  =  {y  ~  Ji}  +  {y2  ~  yJ 

yt  =  a.  +  3,Xi  5h  =  a  +  PjXi  +  p2x2 

Thus,  the  vector  of  residuals  for  the  restricted 
model  consists  of  a  vector  of  the  residuals  for  the 
unrestricted  model  plus  the  residual  difference 
between  the  two  models.  By  construction  of  OLS, 
the  vectors  (y  ~  y2)  and  (y2  —  y\)  are  orthogonal, 


and  Pythagoras’s  theorem  implies  that  the  sum  of 
squares  for  the  restricted  model  is  just  the  sum  of 
squares  for  the  unrestricted  model  plus  the 
difference  in  the  sum  of  squares  for  the  two 
models,  or  equivalently, 

SSPl  =  SSPl,fe  +  {SSfij  -  SSMl}. 

While  adding  complexity  reduces  the  amount  of 
unexplained  variance  in  the  residuals,  it  also 
reduces  the  degrees  of  freedom.  This  trade-off 
motivates  the  principal  of  parsimony  in  model 
selection.  Under  the  assumptions  of  OLS,  SSpj  and 
SSPl  ,p2  are  mutually  independent  and  have  a  chi- 
square  distribution.  The  F-test  is  therefore  the 
appropriate  test  to  determine  whether  the  degree  to 
which  inclusion  of  each  additional  explanatory 
variable  in  the  model  improves  the  precision  of 
estimation.  In  this  case,  the  F-test  would  look  as 
follows: 

p  _  {S-Sp,  -  SSp^p,}/ p  -  q 

SSpi  ,p2/ (P  ~  <7  “  1) 

where  p  and  q  represent  the  number  of  parameters 
in  the  unrestricted  and  restricted  model, 
respectively  (excluding  the  intercept).  Under  the 
null  hypothesis,  the  unrestricted  model  does  not 
provide  a  significantly  better  fit  than  the  restricted 
model;  reject  the  null  hypothesis  if  the  F  calculated 
from  the  data  is  greater  than  the  critical  value  of 
the  F  distribution  with  (p  -  q,  n  -  p)  degrees  of 
freedom.  The  models,  their  sum  of  squares,  mean 
square,  and  F-test  can  be  displayed  in  an  ANOVA 
table: 


Fitted  Model 

df 

ASS 

MS 

F 

9i  =  a 

1 

SS„mI  -  SSa 

SSerror  ~  SSA 
n  —  (n  —  1) 

sscnoi  -  ssa 

SS&/(n  -  1) 

9i  ~  a  +  Pi*! 

2 

SSA  -  SS Pj 

SSk  - 

n  —  1  —  (n  —  2) 

SS&  -  SSpj 

SSj(n  -  2) 

y\  =  Oi  +  PjXj  +  P  2X2 

3 

-  SSMl 

SSpj  -  SSp,  p2 

n  —  2  —  (n  —  3) 

-  5Spj,p2 

Total 

4 

SS 

terror 

SSami/(n  -  3) 
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Often  the  results  of  an  ANOVA  table  are  used 
to  justify  the  inclusion  of  each  individual  variable 
in  the  model.  As  the  model  expands  from  p  to 
p  +  1  explanatory  variables,  the  F- test  evaluates 
the  hypothesis  that  the  parameters  (3,, .. ,  =  0,  given 
the  assumptions  of  the  model  are  satisfied.  If  the 
explanatory  variables  that  constitute  the  design 
matrix  are  all  mutually  orthogonal  and  we  have  the 
correct  model,  then  the  ANOVA  results  can  be 
used  to  determine  whether  the  inclusion  of  xp+1 
significantly  increases  the  fitness  of  the  model. 
Without  orthogonality,  however,  we  do  not  know 
if  the  order  in  which  the  variables  are  added  mat¬ 
ters.  As  successive  variables  are  added  from  the 
model,  only  the  variance  of  the  part  of  the  variable 
that  is  orthogonal  to  the  previously  included  vari¬ 
ables  in  the  model  is  removed  from  the  variance  of 
the  error. 

This  entry  has  discussed  the  definition  of 
ANOVA  and  how  it  is  often  applied  to  regression 
and  experimental  data.  ANOVA  is  a  decomposi¬ 
tion  of  variance  into  component  parts.  There  is 
variance  that  is  attributable  to  a  model,  such  as  a 
regression  model  or  an  experimental  treatment, 
and  variance  that  is  attributable  to  random  error. 
ANOVA  is  often  used  to  construct  an  ANOVA 
table,  which  succinctly  presents  the  variance 
decomposition.  This  method  can  also  be  used  to 
justify  regression  model  selection  where  the  goal  is 
to  find  parsimony  between  fit  and  degrees  of  free¬ 
dom.  ANOVA  can  be  used  to  determine  how  much 
extra  variance  a  marginal  explanatory  variable 
explains  while  also  weighing  the  loss  of  a  degree  of 
freedom.  It  is  important  to  note,  however,  that  the 
order  in  which  variables  are  added  to  a  model  is 
important  in  these  tests  unless  the  variables  are 
orthogonal  to  one  another.  The  decomposition  of 
variance  using  the  analysis  of  variance  is  a  power¬ 
ful  tool  for  describing  data  and  the  fit  of  a  model. 

Erin  Hartman  and  Adrienne  Hosek 
University  of  California,  Berkeley 
Berkeley,  California,  United  States 

See  also  Quantitative  Methods,  Basic  Assumptions; 
Regression 
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Analytic  Narratives: 
Applications 


Often  derided  by  colleagues  for  “possessing  an  N  of 
1,”  those  who  focus  on  case  studies — Germany 
between  the  wars,  France  during  the  revolution,  or 
Kasumpa  village  (see  below) — nonetheless  aspire  to 
be  social  scientists.  While  their  commitment  to  field¬ 
work,  archival  research,  and  qualitative  methods 
runs  deep,  so  too  does  their  commitment  to  science. 
Analytic  narratives  offer  a  means  for  reconciling 
this  apparent  contradiction.  Avner  Greif,  Margaret 
Levi,  Jean-Lauran  Rosenthal,  Barry  Weingast,  and 
Robert  Bates  formulated  the  approach  while  focus¬ 
ing  on  historical  cases.  This  entry  illustrates  the 
approach  by  reviewing  its  use  in  a  study  of  Kasumpa 
village  in  the  Luapula  Valley  of  Zambia. 

In  this  essay,  “fieldwork”  is  defined  as  immer¬ 
sion  in  the  lives  of  people  who  remain  resident  in 
their  own  social  setting,  while  observing  and 
recording  their  behavior  and  discussing  with 
them  their  actions  and  the  values  and  beliefs  that 
shape  it.  This  also  includes,  of  course,  knowledge 
of  the  local  language.  “Science”  refers  to  the 
attempt  to  derive  valid  explanations,  which 
implies  two  things:  An  explanation  is  valid  if  it 
follows  logically  from  its  premises,  and  it  is  valid 
insofar  as  it  withstands  efforts  to  refute  it  through 
the  systematic  collection  and  analysis  of  data. 
The  best  test  of  logical  validity  is  formalization; 
that  of  empirical  validity  is  the  use  of  rigorous 
methods. 

Taken  together,  these  clarifications  highlight  a 
key  feature  of  the  agenda  that  underlies  this 
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method:  the  treatment  of  qualitative  research,  for¬ 
mal  theory,  and  empirical  methods  as  complemen¬ 
tary  rather  than  rival  approaches  to  social  research. 

Illustration 

Rural  Responses  to  Industrialization:  A  Study  of 
Village  Zambia  by  Bates  reports  on  a  study  of 
Kasumpa  village,  a  settlement  in  the  Luapula 
Valley  that  borders  Zambia  and  Congo  and  forms 
a  portion  of  the  hinterland  of  the  great  mining 
centers  of  those  two  countries.  The  author  of  this 
study  notes  how,  in  the  initial  stages  of  his  field¬ 
work,  he  sought  to  discern  the  values  held  by  the 
residents  of  the  village,  their  perception  of  the 
alternatives  available  to  them,  and  the  beliefs  that 
shaped  their  behavior.  The  residents  of  Kasumpa 
village,  he  learned,  quite  rightly  viewed  themselves 
as  poor  and  the  mining  towns  as  prosperous. 
Economic  life  revolved  around  the  export  of  pro¬ 
duce  and  labor  to  the  mining  towns;  political  life 
centered  on  efforts  to  persuade  the  government  to 
generate  money  and  jobs  by  investing  in  projects  in 
the  region.  The  residents  sought  to  use  the  markets 
for  produce  and  labor  to  gain  access  to  the  wealth 
of  the  mining  towns,  and  they  used  political  pro¬ 
test  to  levy  a  portion  of  it  for  themselves. 

Based  on  these  insights,  the  author  then  turned  to 
theory.  In  expositing  this  work,  this  essay  focuses 
on  his  treatment  of  migration,  a  subject  long  central 
to  research  in  the  region.  The  dominant  theory  of 
the  time,  developed  by  John  Harris  and  Michael 
Todaro,  viewed  migration  as  a  decision  made  by 
individuals  as  an  effort  to  maximize  their  expected 
incomes.  Several  implications  flowed  from  that  for¬ 
mulation,  and  the  author  randomly  selected  100  or 
so  out  of  around  1,000  villagers  to  test  them.  If  the 
theory  were  correct,  more  men  than  women  would 
migrate  to  seek  work  in  town,  because  there  were 
more  jobs  for  men  in  the  mines,  and  the  probability 
of  securing  a  job  was  therefore  greater  for  males 
than  for  females.  The  likelihood  of  departure  from 
the  village  should  be  higher  for  working-age  men: 
They  were  more  likely  to  secure  jobs  than  youths 
were  and  could  amortize  the  costs  of  job  searches 
better  than  old  men  could.  In  addition,  according  to 
this  theory,  when  people  left,  they  should  journey  to 
towns  where  others  had  settled  before  them,  thus 
lowering  the  costs  of  job  searches  and  increasing  the 
return  from  migrating.  The  demographic  structure 
of  the  village  should  therefore  be  distorted,  with  a 
gap  where  working-age  men  could  normally  be 


expected  and  a  disproportionate  number  of  females 
and  old  people  of  both  sexes.  An  implication  for 
politics  also  followed:  While  the  migrants  might 
choose  to  “exit”  as  a  way  of  escaping  poverty,  those 
who  remained  should  “give  voice,”  pressuring  the 
government  to  render  fishing  and  maize  growing 
more  profitable.  Phrased  another  way,  while 
migrants  might  employ  the  market  for  labor  as  an 
alternative  to  political  action,  those  who  remained 
in  the  village  should  treat  the  market  for  commodi¬ 
ties  as  a  complement  to  it. 

The  data  lend  support  to  each  implication,  with 
young  men  flocking  to  the  nearest  mines,  women 
and  older  men  tending  to  remain  behind,  and  fish¬ 
ermen  and  farmers  dominating  the  local  party 
branch  and  employing  their  positions  to  lobby  the 
government  for  loans  to  farmers  and  investments  in 
the  local  fisheries.  However,  while  collecting  the 
data,  the  researcher  gained  a  deeper  understanding 
of  the  migration  decision — one  that  suggested  that, 
in  fact,  the  expected  income  model  was  wrong. 
Migration,  it  became  clear,  involved  not  only  those 
who  chose  to  migrate  but  also  those  they  left  behind 
and,  in  particular,  their  elder  kin.  Nor  was  migra¬ 
tion  the  outcome  of  a  choice  made  at  one  point  of 
time;  rather,  it  was  a  portion  of  an  intertemporal 
sequence  of  moves  in  which  the  older  generation 
paid  for  the  costs  of  the  movement  of  younger, 
working-age  males  from  the  village  to  town.  Not 
only  did  they  pay  for  transport  and  advance  funds 
for  sustenance  during  the  initial  period  of  job 
search,  but  they  also  paid  for  the  costs  of  education 
so  that  the  youths  might  better  compete  for  jobs  in 
town.  It  was  made  clear  that  they  did  this  in  the 
expectation  of  later  receiving  remittances  from  the 
young,  thus  assuring  them  of  financial  support  in 
old  age.  In  effect,  rather  than  acting  as  individual 
agents,  the  residents  of  Kasumpa  village  acted  as 
members  of  families  and  organized  the  relationship 
between  generations  so  as  to  invest  in  the  formation 
of  human  capital.  By  responding  thus  to  the  oppor¬ 
tunities  offered  by  the  market  for  labor,  they  sought 
to  extract  income  from  the  towns. 

Stepping  back  for  a  moment,  note  the  sequence 
and  the  interplay  between  different  research  meth¬ 
ods.  First,  there  was  immersion  and  the  gathering  of 
qualitative  information,  then  the  formulation  of 
theory.  Next,  the  theory  was  tested,  using  survey 
methods  and  engaging  further  in  “soaking  and 
poking” — conversations  about  the  relationships 
between  parents  and  children,  discussions  about 
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family  life  in  the  United  States  and  Luapula  Valley, 
and  gossip  about  why  X  could  live  in  comfort 
(“great  kids”)  whereas  Y  lived  in  poverty  (“the 
way  he  neglected  his  children,  no  wonder”).  Then 
came  the  realization  that  the  theory  was  wrong. 
The  resultant  crisis  then  restarted  the  cycle  but  not 
at  a  random  starting  point.  Rather,  because  the 
failure  took  place  within  the  context  of  a  theory,  it 
highlighted  key  features  that  had  been  missed  the 
first  time  around. 

It  was  clear  that  the  decision  to  migrate  did  not 
result  from  a  comparison  between  rural  and  urban 
wage  rates,  made  by  individuals  who  sought  to 
maximize  expected  incomes,  as  the  Harris-Todaro 
model  might  suggest.  Rather,  it  resulted  from  the 
strategies  used  by  members  of  families  who  sought 
to  invest  by  forming  human  capital.  Indeed,  data 
from  the  surveys  confirmed  that  the  flow  of 
finances  between  generations  yielded  a  reasonable 
rate  of  return,  for  plausible  rates  of  discount.  But, 
the  researcher  realized,  if  this  revised  vision  of  the 
behavior  of  migrants  was  correct,  then  there  was  a 
lot  he  did  not  understand:  According  to  the  current 
theory,  what  was  being  observed,  in  fact,  should 
not  exist.  Less  dramatically,  it  suggested  that  the 
arrangement  was  highly  vulnerable  and  that  its 
persistence  suggested  that  additional  factors,  not 
yet  described,  must  be  at  play. 

The  theoretical  difficulty  came  from  two  fea¬ 
tures  of  the  behavior  of  the  families.  The  first  was 
that  it  involved  an  exchange;  the  second  was  that 
the  exchange  took  place  over  time.  The  elders 
expended  resources  on  the  young  at  one  point  in 
time  in  the  expectation  that  the  young  would  later 
repay.  But  the  preferences  of  the  young  should 
vary  over  time:  They  should  be  initially  willing  to 
pay  but  reluctant  to  do  so  when  the  bill  came  due 
later.  Given  this  variation  in  preferences,  the 
elders,  anticipating  the  later  actions  of  the  youths, 
should  not  be  willing  to  invest  in  the  first  place. 
The  (implicit)  contract  was  therefore  vulnerable  to 
the  problem  of  “time  consistency.”  And  yet  the 
elders  were  observed  investing  in  junior  kin. 

Highlighting  the  anomalous  nature  of  the  find¬ 
ing  was  that  local  conditions  reinforced  the  strength 
of  the  temptations  facing  the  young.  There  was  a 
long  lag  between  the  time  when  the  elders  bestowed 
resources  on  them  and  when  they  could  be  expected 
to  remit  money  from  town;  the  distance  between 
the  village  and  the  town  was  great  as  well,  making 


it  easy  for  the  young  to  dissimulate,  claiming  that 
the  loss  of  a  job,  illness,  or  unforeseen  expenses  in 
town  made  it  impossible  to  fulfill  their  obligations 
to  village  kin.  The  elders  were  of  course  aware  of 
the  impact  of  time  and  distance  on  the  ability  of  the 
young  to  elude  their  obligations.  And  yet  the  elders 
continued  to  use  their  families  in  order  to  invest. 
The  author  of  the  study  was  thus  driven  by  the 
failure  of  his  theory  to  engage  once  again  in  the 
lives  of  the  people  of  Kasumpa  and  to  seek  out 
what  was  being  missed — accounting  for  their  con¬ 
duct,  which  otherwise  appeared  so  rational. 

Once  again,  theory  had  failed.  However,  once 
again,  the  failure  proved  generative,  for  it  forced  a 
search  for  features  of  village  life  that  had  been 
overlooked  thus  far.  One  feature  was  that  the 
young  hoped  to  return  to  the  villages  and  depended 
on  their  families  when  seeking  to  do  so.  To  retire 
in  the  village,  a  worker  would  need  land  on  which 
to  build  a  house  and  plant  a  garden.  To  secure  that 
land,  he  would  need  to  be  kumwesu — that  is,  “one 
of  us.”  To  establish  his  bona  fides,  he  would  have 
to  rely  on  his  family  in  the  village  to  testify  to  his 
roots  there;  to  his  participation  in  its  affairs,  as  by 
returning  for  weddings  and  funerals;  and  to  his 
continued  support  of  the  welfare  of  its  citizens,  as 
by  remitting  funds  from  town.  Thus,  the  ability  of 
the  young  to  fulfill  their  plans  rested  on  the  will¬ 
ingness  of  their  family  to  confirm  that  they  had 
fulfilled  their  obligations  to  their  elder  kin. 

Theory  had  therefore  failed,  or  at  least  been 
exposed  as  inconsistent  with  the  known  data,  and 
its  failure  had  motivated  a  renewed  pursuit  of 
qualitative  data — one  focused  on  a  search  for  fea¬ 
tures  of  reality  that  would  fill  gaps  in  the  argu¬ 
ment.  Phrased  differently,  in  the  interplay  between 
theory  and  qualitative  research  that  characterizes 
the  generation  of  analytic  narratives,  formal  the¬ 
ory — which  is  often  viewed  as  a  deductive 
method — is  instead  used  inductively. 

Thus  far,  more  emphasis  has  been  given  to  field¬ 
work  and  theory  than  to  the  use  of  empirical  meth¬ 
ods.  Recall,  however,  their  use  when  testing  the 
implications  derived  from  the  comparative  statics 
of  the  expected  income  theory  of  migration.  As 
illustrated  in  Rural  Responses  to  Industrialization , 
they  play  an  additional  role  as  well:  They  are  used 
to  rule  out  alternative  explanations. 

Returning  to  the  contributions  of  the  elders  to 
the  young,  someone  might  offer  an  alternative 


Analytic  Narratives:  Applications 


69 


explanation:  The  elders  could  be  viewed  as  altruistic 
rather  than  self-interested.  Their  payments  to  the 
young  could  be  treated  as  gifts  made  out  of  affec¬ 
tion.  Given  that  both  accounts  are  consistent  with 
what  has  been  observed,  researchers,  when  mar¬ 
shalling  empirical  methods  to  choose  between  the 
two  accounts,  need  to  bring  them  to  bear  on  data 
drawn  from  outside  of  the  original  sample.  To 
gather  such  data,  the  author  shifted  his  attention  to 
the  plateaus  that  lay  above  the  valley.  Because  of  the 
tsetse  flies,  the  villagers  in  Luapula  could  not  raise 
livestock;  because  the  plateaus  were  free  of  tsetse 
flies,  those  who  lived  there  could.  There,  the  elders 
invested  in  cattle.  Because  cattle  reproduce,  herds 
grow  over  time,  and  as  the  urban  centers  grow,  so 
does  the  demand  for  meat.  All  else  being  equal, 
cattle  herds  therefore  increase  in  value,  enabling 
their  owners  to  recruit  clients,  bargain  for  brides, 
and  bid  for  favors.  If  the  elders  were  motivated  by 
self-interest,  then,  rather  than  investing  in  their  chil¬ 
dren,  they  would  invest  in  their  cattle;  if  they  were 
motivated  by  affection,  they  would  invest  in  junior 
kin.  The  evidence  showed  clearly  that  they  invested 
in  their  herds.  They  stinted  on  the  payment  of 
school  fees,  so  the  level  of  education  remained  low. 
The  rate  of  migration  of  cattle-keeping  peoples 
remained  low  as  well.  Gathering  empirical  data 
from  out  of  the  original  sample  thus  lend  credence 
to  the  researcher’s  interpretation  of  the  behavior  of 
the  villagers.  Had  affection  dominated  self-interest, 
then  the  flow  of  resources  between  generations  on 
the  plateau  would  have  resembled  that  in  the  valley. 
Clearly,  it  did  not.  The  analysis  of  theoretically  rel¬ 
evant  data,  collected  outside  of  the  original  sample, 
thus  dispensed  with  an  alternative  interpretation, 
leaving  the  original  intact. 

Conclusion 

The  end  product  of  this  research  was  a  “village 
study.”  But  it  was  also  a  study  that  was  the  product 
of  repeated  efforts  at  falsification.  When  ideas  were 
rendered  coherent,  then  data  were  gathered  to  test 
them;  when  reflection  exposed  weaknesses  in  inter¬ 
pretation,  then  additional  field  research  filled  in  the 
gap.  The  resultant  study  was  not  “large  N.”  Nor 
was  it  axiomatic  and  deductive.  But,  as  it  was 
shaped  by  theory  and  statistical  methods,  the  study 
“scaled  up.”  It  confirmed  that  villagers  in  the  mid¬ 
dle  of  Africa  were  rational  and  self-interested  in 


their  pursuit  of  wealth  and  their  use  of  power,  a 
controversial  claim  at  the  time  and  one  still  con¬ 
tested  by  cultural  anthropologists.  As  in  the  treat¬ 
ment  of  the  behavior  of  the  villagers  on  the  plateau, 
it  supported  those  who  contest  the  premise  of  cul¬ 
tural  conservatism  among  cattle-keeping  peoples.  It 
helped  redefine  the  understanding  of  how  migra¬ 
tion  takes  place  and  how  rural  people  react  to 
urban  opportunities.  And,  along  with  the  contribu¬ 
tions  of  others,  it  helped  alter  the  manner  in  which 
political  scientists  approach  the  study  of  agrarian 
politics. 

There  are  some  in  political  science  who  cham¬ 
pion  the  “deep  reading”  of  texts  and  “immersion” 
in  rich,  qualitative  data.  Others  label  themselves  as 
formal  theorists  and  generate  rigorous  proofs.  And 
still  others  consider  themselves  to  be  methodolo¬ 
gists  and  specialize  on  the  development  and  use 
of  systematic  empirical  methods.  Those  who 
develop  analytic  narratives  view  these  approaches 
as  complementary.  The  first  yields  Versteben  or 
comprehension;  the  second  generates  conviction, 
demonstrating  that  the  explanation  is  logically 
consistent;  and  the  third  determines  whether  it  is 
compelling — that  is,  it  can  be  demonstrated  con¬ 
vincingly  to  others.  By  combining  these  approaches, 
those  who  seek  to  produce  analytic  narratives  “do 
science”  even  while  focusing  on  unique  events  or 
particular  settings. 

Robert  H.  Bates 
Harvard  University 
Cambridge,  Massachusetts,  United  States 
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Analytic  Narratives: 
The  Method 


An  analytic  narrative  is  a  methodological  pro¬ 
posal  to  bridge  rational  choice  modeling  with 
more  traditional  narrative  explanations  of  phe¬ 
nomena  in  the  social  sciences.  Typically,  the  meth¬ 
odology  combines  models  from  game  theory  with 
historical  narrative.  The  analytic  narrative  proj¬ 
ect,  however,  does  not  represent  a  methodological 
breakthrough  in  itself.  Rather,  the  project  makes 
explicit  the  methodology  that  numerous  scholars 
have  adopted  when  combining  historical  and 
comparative  research  with  rational  choice  models. 
Influenced  by  the  work  of  Douglass  C.  North,  the 
analytic  narrative  project  started  off  with  a  com¬ 
mitment  to  understand  institutional  formation 
and  change.  In  their  book  Analytic  Narratives 
(1998),  Robert  Bates,  Avner  Greif,  Margaret  Levi, 
Jean-Lauran  Rosenthal,  and  Barry  Weingast  offer 
the  first  systematic  outline  of  the  key  elements  of 
the  methodology,  and  they  do  so  by  exploring 
institutional  change  in  a  wide  range  of  places  and 
times.  The  authors’  intent,  as  well  as  that  of  the 
majority  of  analytic  narrativists,  is  to  investigate 
enduring  questions  of  political  economy,  such  as 
political  order,  political  and  economic  gover¬ 
nance,  and  intra-  or  interstate  relations. 

Moreover,  a  distinguishing  feature  of  the  ana¬ 
lytic  narrative  project  is  that  it  underscores  the 
importance  of  the  institutional  context  within 
which  historical  events  occur.  Understanding  the 
institutional  context  helps  account  for  how  and 
why  certain  events  may  happen.  The  importance 


afforded  to  the  historical  and  institutional  detail  is 
in  fact  what  sets  analytic  narrativists  apart  from 
most  rational  choice  scholars.  Both  deduce  their 
hypotheses  from  the  assumptions  of  rational  choice 
and  the  logic  of  game  theory.  Rational  choice 
scholars,  however,  start  off  from  a  general  model 
and  then  test  their  hypothesis  with  appropriate 
data,  while  analytic  narrativists  formulate  and 
refine  the  model  itself  in  interplay  with  the  con¬ 
text-specific  institutional  elements  of  the  historical 
narrative.  A  major  objective  of  analytic  narratives, 
shared  certainly  with  social  science  research,  is  to 
successfully  identify  the  causal  factors  that  explain 
a  particular  historical  phenomenon — that  is,  to 
determine  how  some  combination  of  causes 
brought  it  about.  The  notion  of  “mechanism”  is 
commonly  used.  This  entry  discusses  the  condi¬ 
tions  under  which  analytic  narratives  are  most 
successful  in  providing  explanations  for  historical 
phenomena.  It  first  examines  the  distinctive  roles 
of  formal  theory  and  context-specific  evidence  in 
developing  a  conjecture  and  ascribing  causality. 
Second,  it  explores  the  behavioral  assumptions 
underlying  strategic  game  theory  and  their  impli¬ 
cations  for  analytic  narratives. 

Theoretical  Model  Versus  Narrative 

The  construction  of  an  analytic  narrative  proceeds, 
roughly,  as  follows.  First,  the  scholar  acquires  in- 
depth  knowledge  about  the  historical  phenomenon 
of  interest;  that  is,  a  detailed  account  of  the  con¬ 
text  and  the  historical  process,  based  on  studying 
the  past  through  primary  sources  or  reading  the 
already  existing  historical  accounts.  A  detailed 
account  is  essential  to  isolate  the  relevant  strategic 
elements  in  the  interaction:  the  key  actors,  their 
goals,  and  the  rules  that  structure  their  behavior. 
These  elements  can  then  be  formalized  in  a  model. 
The  formulation  of  the  model — generally  a  game 
theory  model — specifies  the  choices,  constraints, 
and  trade-offs  the  actors  face  in  the  phenomenon 
in  question.  The  model  is  supplemented  with  a 
narrative  that  provides  a  rich  explanation  of  the 
meaning  actors  attach  to  their  actions,  circum¬ 
stances,  and  surroundings:  the  significance  of  the 
local  culture.  The  outcomes  predicted  by  the  theo¬ 
retical  analysis  are  then  confronted  with  the  narra¬ 
tive;  the  narrative  serves  to  assess  the  predictions 
and  arbitrate  among  possible  explanations  in 
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instances  of  observational  equivalence.  Further 
refinement  of  the  model,  and  collection  of  more 
historical  detail,  can  result  from  additional  itera¬ 
tions  between  the  analytics  and  the  history. 
Analytic  narratives  stand  on  a  careful  balance 
between  context-specific  detail  and  rigorous  ana¬ 
lytic  techniques.  An  analytic  narrative,  however,  is 
problem  driven,  not  theory  driven.  Thus,  the 
explanation  of  a  particular  event  is  what  motivates 
each  study.  Nonetheless,  even  if  driven  by  a  par¬ 
ticular  case,  analytic  narratives  are  informed  by 
theoretical  modeling.  The  exercise  of  formaliza¬ 
tion,  by  isolating  the  relevant  strategic  elements  in 
an  interaction,  helps  identify  the  key  actors  and  the 
combination  of  causes  that  can  explain  the  strate¬ 
gic  situation  in  question. 

A  major  reason  for  applying  models  to  narra¬ 
tives  is  to  attain  logically  consistent  explanations; 
thus,  the  predictions  of  the  model  must  follow 
deductively.  However,  it  must  be  pointed  out  that 
the  model  does  not  need  to  do  the  bulk  of  the 
explanation.  The  theory  highlights  the  issues  to  be 
explored  and  the  general  considerations  and  evi¬ 
dence  that  need  to  be  examined,  while  the  knowl¬ 
edge  of  the  historical  context  is  used  to  develop  a 
conjecture  regarding  the  relevant  institution.  The 
theory  provides  categories  and  a  framework  in 
terms  of  which  the  conjecture  about  the  causal 
mechanism  is  formulated.  The  model,  thus,  con¬ 
strains  but  does  not  determine  the  conjecture. 

The  conjecture  is  then  evaluated,  refined,  or 
even  overhauled  through  the  interactive  use  of  a 
context-specific  model  and  the  historical  narrative. 
This  empirical  method  thereby  recognizes  and 
takes  advantage  of  the  context  specificity  and  his¬ 
torical  contingency  of  institutional  analysis.  A  suc¬ 
cessful  explanation  requires  a  well-confirmed 
causal  claim  about  why  and  how  a  certain  out¬ 
come  obtained — this  can  be  done  even  if  it  is  pri¬ 
marily  the  narrative  rather  than  the  model  that 
accounts  for  the  explanation.  The  explanation  that 
the  model  points  to  should  also  survive  competi¬ 
tion  with  other  explanations.  These  can  be  existing 
explanations  that  the  author  may  be  contesting  or 
other  potential  explanations  that  could  account 
for  the  historical  facts.  The  proposed  explanation 
must  be  confronted  with  these  other  explanations 
and  shown  to  be  superior. 

Analytic  narratives,  by  providing  a  detailed 
account  of  the  context  and  the  historical  process, 


acknowledge  the  uniqueness  of  situations  that  take 
place  in  particular  moments  and  at  specific  places. 
At  the  same  time,  analytic  narratives  seek  to  iden¬ 
tify  causal  mechanisms  that  are  generalizable  to 
other  situations,  inasmuch  as  they  explain  a  social 
phenomenon  by  identifying  how  some  combina¬ 
tion  of  causes  brought  it  about.  Analytic  narratives 
seek  to  capture  the  uniqueness  of  the  situation 
under  investigation  by  means  of  the  narrative 
while  using  the  model  to  capture  the  general  fea¬ 
tures  of  the  type  of  phenomena  under  which  this 
situation  falls.  The  use  of  rational  choice  in  ana¬ 
lytic  narratives,  by  carefully  identifying  the  causal 
mechanisms,  makes  possible  the  application  of  the 
logic  of  one  setting  to  another. 

Explanations  that  can  account  for  different  phe¬ 
nomena  using  few  arguments — that  is,  generaliz- 
able  explanations — are  of  course  desirable. 
Generalizability,  however,  is  not  a  requirement  of 
explanation,  since  a  causal  mechanism  may  be 
unique  to  a  phenomenon.  Thus,  for  an  analytic 
narrative  to  provide  a  successful  explanation,  it  is 
not  necessary  that  the  causal  mechanism  be  gener- 
alizable.  It  suffices  that  the  events  the  model  identi¬ 
fies  as  causes  and  effects  actually  take  place.  If  a 
generalizable  statement  of  the  mechanism  identi¬ 
fied  by  an  analytic  narrative  is  available,  this  is  of 
course  a  plus.  In  short,  for  analytic  narratives  to 
offer  successful  explanations,  the  formalization  at 
the  heart  of  the  narrative  must  correctly  identify 
some  of  the  causes,  effects,  and  enabling  condi¬ 
tions  in  question.  This  does  not  require  that  the 
causal  mechanism  be  generalizable.  What  is 
required  is  that  the  events  the  model  identifies  as 
causes  and  effects  actually  take  place  and  that  the 
statement  of  causal  mechanism  survives  competi¬ 
tion  with  other  explanations. 

Behavioral  Assumptions 

The  reliance  of  analytic  narratives  on  explicit  for¬ 
mal  theorizing  compels  scholars  to  take  a  system¬ 
atic  approach  and  to  reveal  the  reasoning  and 
assumptions  behind  their  explanations.  However, 
by  emphasizing  certain  aspects  of  reality,  this  reli¬ 
ance  on  models,  and  specifically  game  theory  mod¬ 
els,  highlights  certain  scope  conditions  for  which 
the  analytic  narrative  project  is  better  suited.  The 
scope  conditions  refer  to  the  situations  in  which 
game  theory  is  the  best  tool  to  explain  human 
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behavior.  In  terms  of  the  historical  phenomena  of 
interest  and  the  types  of  questions  addressed,  ana¬ 
lytic  narratives  are  a  better  tool  when  causal  expla¬ 
nation  is  found  in  strategic  interactions.  They  the¬ 
orize  strategic  interaction  and  not  the  structural 
conditions  under  which  the  interaction  takes  place. 
Macrolevel  structural  factors  are  taken  as  exoge¬ 
nous  during  modeling  interactions,  which  implies 
that  changes  in  such  factors  need  to  be  incorpo¬ 
rated  as  moves  by  “nature”  and  are  not  treated  in 
an  analytic  fashion.  This  emphasis  on  strategic 
interaction,  however,  is  not  a  weakness  of  analytic 
narratives  as  much  as  a  necessity  given  their  focus 
on  microlevel  historical  data. 

The  analytical  framework  of  classical  game 
theory  rests  on  seemingly  unrealistic  assumptions 
about  the  cognition,  information,  and  rationality 
of  the  parts  in  the  interaction.  What  are  the  costs 
of  these  assumptions?  Analytic  narratives  empha¬ 
size  the  importance  of  the  institutional  context  in 
which  situations  take  place.  This  institutional  con¬ 
text  can,  in  fact,  provide  the  analytic  narrativist 
with  what  Greif  calls  the  “socially  articulated  and 
disseminated  rules”  that  provide  individuals  with 
the  cognitive,  coordinative,  and  informational 
abilities  that  the  theory  assumes.  Thus,  the  speci¬ 
ficity  of  the  institutional  context  can  in  many  cases 
justify  the  behavioral  assumptions  (Greif,  2006, 
chap.  5). 

The  fact,  however,  that  game  theory  may  not  be 
useful  in  certain  conditions  does  not  imply  that 
analytic  narratives  must  be  abandoned.  They  can 
theorize  by  means  of  other  forms  of  game  theory 
that  rely  on  different  behavioral  assumptions — for 
instance,  evolutionary  game  theory  and  behavioral 
game  theory.  Uncertainty  about  the  possible  alter¬ 
natives  and  complexity  of  the  situation  can  lead  to 
biases  from  the  predicted  game  theoretic  behavior. 
In  these  situations,  the  scholar  needs  to  evaluate 
whether  the  theory  should  be  modified  to  incorpo¬ 
rate  uncertainty  or  the  pertinent  behavioral 
assumptions  by  making  use  of  behavioral  or  evo¬ 
lutionary  game  theoretic  models.  The  usefulness  of 
analytic  narratives  requires  an  awareness  of  the 
types  of  situations  in  which  game  theory  is  best 
applicable  and  that  can  benefit  most  from  the 
methodology. 

Analytic  narratives,  therefore,  recognize  and 
take  advantage  of  the  context  specificity  and  his¬ 
torical  contingency  of  institutional  analysis.  At  the 


same  time,  they  rely  on  explicit  formal  theorizing 
that  compels  scholars  to  take  a  systematic  approach 
and  reveal  the  reasoning  and  assumptions  behind 
their  explanations.  This  empirical  methodology 
allows  deduction  and  induction  to  complement 
each  other  and  to  be  complemented  by  a  context- 
specific  analysis. 
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Etymologically,  the  term  anarchism  derives  from 
two  ancient  Greek  words:  av,  meaning  “absence 
of,”  and  Apxr|,  meaning  “authority,”  “govern¬ 
ment,”  “ruler,”  or  “war  chief.”  Anarchism  thus 
carries  a  negative  charge:  the  negation  of  author¬ 
ity,  the  absence  of  rulers.  Anarchism  is  opposed  to 
all  forms  of  hierarchy,  including  the  state,  capital¬ 
ism,  religious  institutions,  patriarchy,  and  racism. 
Yet  the  term  also  conveys  a  positive  political 
project  of  justice,  liberty,  equality,  and  solidarity. 
According  to  the  basic  principles  of  anarchism, 
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genuine  freedom,  equality,  and  solidarity  are  logi¬ 
cal  and  political  impossibilities  in  a  system  or  a 
regime  where  some  rule  and  others  are  ruled.  All 
political  regimes  except  anarchy  are  oppressive 
and  tyrannical,  even  those  claiming  to  embody  the 
nation  or  the  sovereign  people.  That  is  why  anar¬ 
chism  can  be  seen  as  the  ultimate  revolutionary 
ideology  and  social  movement.  From  the  perspec¬ 
tive  of  all  elitist  ideologies,  including  liberalism, 
anarchism  is  fundamentally  flawed  because,  the 
argument  goes,  human  beings  cannot  find  satis¬ 
faction  unless  they  are  under  the  authority  of  rul¬ 
ers,  be  they  warlords,  priests,  nobles,  property 
owners,  or  elected  representatives. 

As  a  political  ideology  or  philosophy,  anarchism 
involves  three  elements:  (1)  a  positive  ideal  (anar¬ 
chy),  (2)  a  critical  discourse  (opposition  to  hierar¬ 
chies  and  oppression),  and  (3)  the  means  to  achieve 
this  ideal  (evolution  or  revolution,  nonviolence  or 
violence,  mass  movement,  or  affinity  groups).  In 
some  cases,  it  may  also  involve  an  ontological  con¬ 
ception  of  human  nature.  By  and  large,  anarchism 
adopts  a  neutral  sociological  outlook  posited  on  a 
binary  human  nature  with  a  positive  and  a  nega¬ 
tive  pole.  The  positive  pole  is  the  ability  to  live  with 
others  in  a  spirit  of  equality  and  solidarity,  while 
the  negative  pole  is  the  selfish  drive  for  domina¬ 
tion.  As  demonstrated  by  Peter  Kropotkin  and 
Elisee  Reclus,  it  is  the  social  structure  that  deter¬ 
mines  which  pole  prevails.  If  the  structure  is  hier¬ 
archical,  then  people  in  a  position  of  authority  will 
become  self-indulgent  and  arrogant  and  abuse 
their  authority.  Only  a  social  environment  where 
no  one  is  in  a  position  to  exert  power  over  others 
can  ensure  that  people’s  goodness  will  prevail. 

This  entry  describes  the  evolution  of  anarchism 
from  its  early  roots  to  its  development  during  the 
“anarchist  century”  from  the  mid-19th  century  to 
1939.  It  also  examines  the  resurgence  of  anarchism 
in  the  latter  part  of  the  20th  century,  influenced  by 
postmaterialist  thought. 

Historical  Development  of  Anarchism 

Some  commentators  assert  that  anarchism  as  a 
political  ideology  or  philosophy  has  existed  since 
the  dawn  of  humanity  and  can  be  found,  at  least  to 
a  degree,  in  Taoism,  stoicism,  the  works  of  Zeno, 
and  the  concrete  experiences  of  communities  on 
every  continent  that  practiced  egalitarianism 


(among  adult  males,  at  any  rate).  Kropotkin’s  book 
Mutual  Aid  is  instructive  in  this  connection.  Others 
consider  anarchism  as  belonging  essentially  to 
European  modernity,  with  its  forerunners  emerging 
in  religious  social  uprisings  (the  Anabaptists),  the 
English  Revolution  (the  diggers),  and  the  French 
Revolution  (the  enrages ).  William  Godwin  (1756- 
1836)  is  generally  regarded  as  the  first  true  anar¬ 
chist  philosopher  ( Enquiry  Concerning  Political 
Justice ,  1793).  The  Frenchmen  Anselme  Belle- 
garrigue,  editor  of  L’anarchie — Journal  de  L’ordre 
(of  which  only  two  issues  appeared,  in  1850),  and 
Pierre-Joseph  Proudhon  were  among  the  first  to 
call  themselves  “anarchists.”  They  were  followed 
by  Mikhail  Bakunin  (1814-1876),  Fouise  Michel 
(1830-1905),  Voltairine  de  Cleyre  (1866-1912), 
Peter  Kropotkin  (1842-1921),  Elisee  Reclus  (1830- 
1905),  Emma  Goldman  (1869-1940),  and  Errico 
Malatesta  (1853-1932). 

In  its  early  days,  anarchism  attracted  several 
prestigious  fellow  travelers,  such  as  the  painters 
Gustave  Courbet  and  Camille  Pissarro,  the  play¬ 
wright  Henrik  Ibsen,  the  literary  critic  Herbert 
Read,  and  the  philosophers  Emmanuel  Mounier 
and  Bertrand  Russell.  Some  famous  anarchists, 
like  the  geographers  Kropotkin  and  Reclus,  were 
eminent  scientists.  Contemporary  scholars  identi¬ 
fied  with  anarchism  include  Allan  Antliff  (art  his¬ 
tory),  Normand  Baillargeon  (education  science), 
Harold  Barclay  (anthropology),  Susan  Brown 
(political  philosophy),  John  Clark  (philosophy), 
David  Colson  (sociology),  Ronald  Creagh  (sociol¬ 
ogy),  Uri  Gordon  (social  ecology),  David  Graeber 
(anthropology),  John  Holloway  (political  philoso¬ 
phy),  Ruth  Kinna  (political  science),  and  Robert 
Wolff  (political  philosophy). 

There  is  an  anarchist  studies  network  linked 
with  the  Political  Science  Association  in  Britain 
and  a  number  of  scholarly  anarchist  journals,  such 
as  Anarchist  Studies ,  Refractions,  and  Social 
Anarchism.  Yet  it  is  in  the  nature  of  anarchism  to 
expect  intellectual  and  political  activities  to  go 
hand  in  hand  (praxism),  and  any  division  of  labor 
between  theorists  or  scholars  and  activists  is 
viewed  with  suspicion.  Bakunin  held  that  anar¬ 
chism  acknowledges  that  some  people  may  be 
more  talented  or  skilled  in  a  specific  area,  includ¬ 
ing  theorization.  However,  there  must  be  no 
monopoly  of  expertise.  To  be  free,  the  people  deal¬ 
ing  with  any  expert  should  have  the  following:  first, 
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access  to  other  experts  in  the  same  field;  second,  the 
right  to  ignore  the  advice  of  experts;  and,  finally, 
the  opportunity  to  become  experts  themselves 
through  education  and  practice.  In  other  words, 
while  expertise  may  confer  influence  on  its  pos¬ 
sessor,  it  should  be  available  to  all  and  not  be  used 
to  wield  power  over  others.  Anarchist  ideas  are  for 
the  most  part  expressed  and  disseminated  anony¬ 
mously  in  self-published  independent  journals, 
zines,  and  websites  (see  Infoshop)  or  in  the  publica¬ 
tions  of  anarchist  groups  and  organizations  (e.g., 
Federation  Anarchiste  and  Alternative  Libertaire  in 
France,  Anarchist  Black  Cross  and  Northeastern 
Federation  of  Anarchist-Communists  in  the  United 
States,  Class  War  in  Great  Britain,  and  Union 
Communiste  Libertaire  au  Quebec  in  Canada). 
Individual  artists,  like  the  composer  John  Cage,  or 
trends,  such  as  the  punk  movement  (e.g.,  Berurier 
Noir,  Crass),  are  also  identified  with  anarchism. 

Central  Elements  of  Anarchism 
The  Classical  Age  of  Anarchism 

From  a  historical  perspective,  there  are  two  dis¬ 
tinct  moments  of  anarchism  as  a  social  and  political 
movement.  First,  the  classical  age  of  anarchism — or 
the  “anarchist  century” — began  with  the  first  texts 
of  Proudhon  (who  in  1840  famously  stated, 
“Property  is  theft”)  and  ended  with  the  defeat  of 
the  Spanish  Revolution  in  1939.  The  period  was 
characterized  primarily  by  the  working-class’s 
expression  and  experience  of  anarchism,  although 
some  activists  were  already  concerned  with  issues 
such  as  women’s  emancipation  (abortion,  birth 
control),  free  love  and  bi-  or  homosexuality,  war 
and  peace,  racism  and  anticolonialism,  free  educa¬ 
tion,  and  vegetarianism.  This  epoch  encompasses 
the  Paris  Commune  of  1871,  the  Industrial  Workers 
of  the  World  in  the  United  States,  the  Makhnovist 
rebellion  during  the  Russian  civil  war,  and  the 
Spanish  Revolution  (1936-1939).  During  this  time, 
anarchism  was  influential  in  Central  and  Latin 
America,  especially  in  Argentina  from  the  late 
1800s  through  the  1920s.  Of  note  is  the  year 
1919,  particularly  the  “bloody  week,”  when  about 
1,000  people  were  killed  and  50,000  arrested  dur¬ 
ing  a  strike,  including  many  anarchists.  In  Chile, 
anarchist  organizations  boasted  50,000  members 
in  1910  out  of  a  total  population  of  3  million.  In 
Mexico,  anarchism  was  known  through  popular 


figures  such  as  Ricardo  Flores  Magon  (1874— 
1922),  while  in  Cuba  there  were  tens  of  thousands 
of  activists  in  the  1920s.  Anarchist  activists  and 
propagandists  were  very  often  at  the  forefront  of 
social  struggles  and  strike  movements,  such  as  the 
campaign  for  the  8-hour  working  day.  In  other 
parts  of  the  world,  anarchism  was  limited  to  small 
groups,  active  only  for  short  periods  of  time  but 
nevertheless  subjected  on  occasion  to  bloody 
repression.  Examples  include  the  Philippines  anti¬ 
colonialist  movement  around  1900  as  well  as  the 
Japanese  anarchist  and  antiwar  activist  Osugi 
Sakae  (1885-1923)  and  the  anarcha-feminist  Ito 
Noe  (1895-1923). 

The  classical  age  of  anarchism  is  still  widely 
associated  with  illegalism  (social  banditism)  and 
terrorism  (“propaganda  by  deeds”).  Around  1900, 
terrorists  assassinated  several  heads  of  state  (French 
President  Sadi  Carnot,  1897;  William  McKinley, 
president  of  the  United  States,  1901;  Russian  Prime 
Minister  Pyotr  Stolypin,  1911;  Spanish  Prime 
Minister  Jose  Canalejas,  1919;  King  George  I  of 
Greece,  1913),  property  owners,  and  military  offi¬ 
cers,  very  often  in  response  to  the  bloody  repres¬ 
sion  of  the  workers’  movement.  But  at  no  time  did 
so-called  anarchist  violence  even  come  close  to  the 
level  of  violence  (imprisonment,  torture,  death 
penalty,  mass  murders,  etc.)  perpetrated  against 
anarchists  by  agents  of  the  state,  private  paramili¬ 
tary  units  and  death  squads,  nationalist  and  fascist 
militias,  and  authoritarian  Marxist  militants. 
Anarchists  were  targeted  by  special  repressive  laws 
in  almost  all  Western  states  in  the  closing  decade  of 
the  19th  century.  Thousands  were  killed  by  their 
former  allies  in  Russia  (Leninists)  and  Spain 
(Stalinists),  and  they  were  among  the  first  political 
prisoners  to  enter  the  Nazi  concentration  camps  in 
Germany.  Violent  repression,  in  conjunction  with 
the  heightened  popularity  and  power  of  authoritar¬ 
ian  Marxists  following  the  1917  Russian  Revolution 
and  the  Red  Army  resistance  and  victory  against 
the  Nazi  armies  in  1941  to  1945,  accounts  for 
anarchism’s  declining  influence  within  the  progres¬ 
sive  and  revolutionary  movements  during  the  first 
half  of  the  20th  century. 

Postmaterialism  and  the  Revival  of  Anarchism 

The  second  moment  of  anarchism  relates  to  the 
so-called  postmaterialist  context  of  the  1960s. 
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With  spectacular  events  such  as  May  1968  in 
Paris,  anarchism  experienced  a  revival,  although 
authoritarian  Marxists  (Leninists,  Maoists, 
Trotskists,  Guevarists,  etc.)  remained  very  influen¬ 
tial  among  radicals.  The  anarchist  resurgence  was 
also  connected  with  environmentalism,  feminism, 
and  pacifism,  which  embodied  anarchism  not  only 
by  advocating  the  abolition  of  capitalism,  patriar¬ 
chy,  and  the  state  but  also  by  organizing  their 
militant  activities  with  neither  formal  leaders  nor 
hierarchy,  through  a  strictly  egalitarian,  participa¬ 
tive,  and  deliberative  decision-making  process. 

The  contemporary  resurgence  of  anarchism  as  a 
major  tendency  among  radical  progressive  forces 
came  with  the  collapse  of  the  Soviet  empire,  the 
intellectual  crisis  of  Marxism,  and  a  series  of 
events  at  the  turn  of  the  21st  century  that  signaled 
the  rise  of  the  so-called  antiglobalization  move¬ 
ment  and  resulted  in  part  from  anarchist  mobiliza¬ 
tion:  the  World  Social  Forum  in  Porto  Alegre, 
Brazil  (2001)  and  the  mass  protests  in  Seattle, 
Washington,  in  1999;  Washington,  D.C.,  and 
Prague,  Czech  Republic,  in  2000;  and  Quebec, 
Canada,  and  Genoa,  Italy,  in  2001.  Many  com¬ 
mentators  saw  this  as  a  “new  anarchism,”  charac¬ 
terized  by  (a)  the  innovative  tactics  of  direct  action, 
(b)  the  horizontal  structure  of  militant  organiza¬ 
tions,  and  (c)  the  inclusion  of  pluralist  concerns. 
The  direct  action  tactics  were  exemplified  by  the 
Black  Blocs,  a  street  tactic  originating  in  the 
German  Autonomen  movement  of  the  1980s; 
Reclaim  the  Streets  in  the  United  Kingdom;  and 
carnivalesque  actions  such  as  the  Pink  Blocs  or  the 
Clandestine  Insurgent  Rebel  Clown  Army.  The 
horizontal  structures  were  inherited  from  the  pre¬ 
vious  generation  of  radical  ecologists,  feminists, 
and  pacifists.  Finally,  the  pluralist  concerns, 
emphasizing  cultural  and  sexual  diversity,  were 
clearly  due  to  the  influence  of  the  feminist  and  gay 
and  lesbian  movements,  although  some  anarchists 
were  still  based  in  the  labor  movement  or  were 
actively  involved  in  labor  issues  and  anticapitalist 
mobilizations.  In  principle,  the  neo-anarchists  (or 
“postanarchists” — Hakim  Bey,  Lewis  Call,  Todd 
May,  and  Saul  Newman)  regard  as  equally  impor¬ 
tant  the  need  to  oppose  the  state,  war,  capitalism 
and  neoliberalism,  racism,  sexism  and  hetero- 
normativity,  ageism,  and  specism.  This  opposition 
must  furthermore  take  place  globally  as  well  as 
locally  and  in  everyday  life.  Indigenous  movements 


of  resistance,  such  as  the  1994  Zapatista  insurrec¬ 
tion  in  Chiapas,  Mexico,  are  viewed  as  stimulat¬ 
ing  examples  of  anarchism  in  practice,  mainly 
because  they  are  self-organized  through  participa¬ 
tive  and  deliberative  popular  assemblies  (the 
Mohawk  scholar  Taiaiake  Alfred  has  coined  the 
term  anar cho-indigenism) . 

Currents  in  Anarchist  Thought 

Historically,  there  have  been  several  anarchist  cur¬ 
rents,  each  with  its  particular  position  on  what  is 
primary  (the  individual  or  the  community,  liberty 
or  equality),  its  priorities  (the  emancipation  of 
workers  or  women,  the  environment),  and  its 
organizational  structure  (mass  movements,  unions 
or  communes,  affinity  groups).  Very  often,  they 
have  collaborated  in  solidarity,  at  times  they  have 
engaged  in  intense  debates,  and  sometimes  they 
have  clashed. 

Individual (ist)  Anarchism 

Individualist  anarchism,  according  to  Max 
Stirner,  1806-1856;  Lysander  Spooner,  1808- 
1887;  and  Benjamin  Tucker,  1854-1939,  empha¬ 
sizes  liberty  and  autonomy  and  denies  that  there  is 
something  other  than  individuals  to  which  an 
individual  must  submit  (e.g.,  nation/motherland, 
class/proletariat).  However,  such  anarchists  are  as 
a  rule  deeply  concerned  by  the  exploitation  of  the 
working  class  and  the  oppression  of  women  and 
may  very  well  take  part  in  social  struggles,  but 
through  affinity  groups  rather  than  mass  move¬ 
ments.  They  generally  believe  that  free  association 
can  be  beneficial  for  the  individual’s  happiness  and 
interests,  and  through  the  years  they  have  been 
involved  in  rural  or  urban  communes  and  in 
squats.  However,  membership  in  an  association 
must  be  totally  voluntary,  with  the  freedom  to  join 
or  leave  at  any  time. 

Libertarianism  and  anarcho-capitalism  are 
sometimes  used  as  synonyms  of  individual  anar¬ 
chism,  although  the  latter  is  in  fact  a  kind  of  radi¬ 
cal  liberalism,  advocating  free  market  capitalism 
for  property  owners  and  wageworkers,  under  the 
protection  of  either  a  state  whose  function  is 
strictly  limited  to  the  enforcement  of  law  and 
order  or,  in  the  absence  of  a  state,  of  private  secu¬ 
rity  agencies.  These  views  imply  that  property 
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owners  are  in  a  position  of  authority  and  power 
over  their  employees,  an  idea  inconsistent  with 
individual  as  well  as  collective  anarchism. 
Proprietarian  is  the  term  coined  by  Murray 
Bookchin  to  describe  libertarianism. 

Like  Marxism,  anarchism  views  capitalism  as 
an  exploitative  system,  protected  by  the  state, 
which  also  exploits  workers  to  sustain  itself.  This 
said,  anarchism  offers  a  variety  of  economic  pro¬ 
posals  for  abolishing  capitalism  and  reorganizing 
the  production  of  goods  and  services. 

Collectivist  Anarchism 

Proudhon  proposed  mutualism  (from  the 
Mutualists,  a  19th-century  secret  society  of  weav¬ 
ers  in  Lyon,  France).  He  suggested  that  just  as  the 
hive  belongs  to  all  bees,  the  means  of  production 
and  exchange  must  be  collective.  However,  each 
member  of  the  collective — a  free  association — 
should  share  to  some  degree  in  the  worst  tasks. 
This  job  rotation  would  provide  everyone  with  a 
better  knowledge  and  understanding  of  the  whole 
working  process.  Higher  functionaries,  such  as 
architects  and  engineers,  would  be  assigned 
through  election,  but  decisions  and  working  rules 
would  be  made  collectively  through  a  participative 
deliberative  process.  The  exchange  value  would  be 
assessed  and  then  converted  into  tickets  of  hours 
of  work  under  the  supervision  of  a  People’s  Bank, 
which  would  also  offer  interest-free  credit. 
Consequently,  there  is  no  need  for  a  violent  revolu¬ 
tion;  workers  need  only  organize  among  them¬ 
selves  to  be  free. 

Some  anarchists  have  identified  serious  issues 
arising  from  Proudhon’s  economic  proposal: 
Should  the  evaluation  of  hours  of  works  take  into 
account  job  intensity  and  risk,  required  training 
and  skills,  and  the  worker’s  social  responsibilities 
(such  as  dependents)?  Moreover,  this  remains  a 
very  individualistic  model  of  work  organization, 
which  draws  on  a  bygone  tradition  of  craftsman¬ 
ship  that  values  the  artisan’s  self-reliance  and  per¬ 
sonal  pride.  Finally,  according  to  some,  Proudhon 
is  not  a  true  anarchist,  because  he  devotes  consid¬ 
erable  energy  to  demonstrating  the  physical,  intel¬ 
lectual,  moral,  and  political  superiority  of  men 
over  women,  as  evidenced  by  the  more  than  500 
pages  of  blatant  misogyny  and  antifeminism  in  his 
De  la  Justice  dans  la  Revolution  et  I’Eglise  (1860). 


Anarcho-Syndicalist  Anarchism 

Anarcho-syndicalists,  identified  with  Rudolf 
Rocker  (1873-1958),  see  labor  unions  (syndicats 
in  French)  as  the  spearhead  of  social  emancipation 
and  revolution  and  the  seed  of  the  future  anarchist 
society.  Property  owners — the  bourgeoisie — and 
the  state  create  nothing;  they  are  mere  parasites  of 
workers’  productive  activity.  Hence,  if  workers 
unite  and  stop  working  for  their  exploiters,  the 
system  will  collapse  and  revolution  will  ensue.  The 
general  strike  is  therefore  the  main  instrument  for 
the  realization  of  anarchy.  Thereafter,  global  pro¬ 
duction  will  be  structured  according  to  “industrial 
groups”  in  each  sector  of  production  and  inte¬ 
grated  in  an  “economic  federation.”  However, 
collective  decision  making  would  remain  a  bot¬ 
tom-up  process  centered  on  the  general  assembly 
of  every  place  of  work.  Economic  production 
based  on  human  needs  will  ensure  that  all  receive 
the  goods  and  services  they  require. 

Anarcho-Communism 

With  anarcho-communism,  identified  with 
Peter  Kropotkin  and  Alexander  Berkman  (1870- 
1936),  among  others,  the  state  becomes  the  direct 
target  of  a  revolutionary  mass  movement.  Although 
anarcho-communism  often  stresses  the  importance 
of  the  workers’  movement,  it  is  more  open  than 
many  other  anarchist  trends  to  peasants,  women, 
and  the  lumpenproletariat.  Adopting  a  geographi¬ 
cal  perspective,  anarcho-communism  regards  the 
urban  or  rural  commune  as  the  center  of  the  new 
society.  After  the  revolution,  the  communes  may 
join  in  a  federation,  although  the  decision-making 
function  will  be  situated  at  the  local  level,  in  keep¬ 
ing  with  a  bottom-up  process.  In  the  absence  of 
exploiters,  such  as  the  bourgeoisie  and  state 
agents,  production  output  will  necessarily  surpass 
society’s  needs.  Anarchy  will  therefore  lead  to  a 
situation  of  abundance  where  all  will  receive  what¬ 
ever  goods  and  services  they  need  and  want. 

Working  Inside  Versus  Outside 
the  Political  System 

“Libertarian  municipalism”  is  a  contemporary 
version  of  anarchism,  primarily  developed  in  the 
1980s  by  Janet  Biehl  and  Murray  Bookchin. 
Libertarian  municipalism  lays  special  emphasis  on 
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ecological  issues,  contending  that  these  could  be 
dealt  with  more  effectively  at  the  local  level  but 
only  if  municipal  institutions  were  deliberative  and 
participative.  Bookchin  even  suggests  that  anar¬ 
chists  get  involved  in  municipal  politics,  a  view 
that  has  been  criticized  by  other  anarchists  as  obvi¬ 
ously  inconsistent  with  the  antistate  anarchist 
principle. 

Bookchin  is  not  the  only  self-proclaimed  anar¬ 
chist  calling  for  some  sort  of  involvement  in  offi¬ 
cial  politics.  Authors  and  activists  like  Colin  Ward 
and  Paul  Goodman,  or  Philippe  Corcuff,  in  France, 
who  coined  the  term  libertarian  social  democracy , 
argue  that  some  anarchism  in  the  official  system  is 
better  than  no  anarchism  at  all.  They  advocate 
reforms  with  regard  to  issues,  laws  and  regulations 
concerning  individual  freedom,  free  education,  free 
sexuality,  and  so  on.  According  to  Noam  Chomsky, 
it  is  the  duty  of  today’s  anarchists  to  support  the 
state  and  its  welfare  programs  against  neoliberal- 
ism  and  capitalism,  or  what  he  calls  “private  tyr¬ 
annies.”  Chomsky  is  even  in  favor  of  voting  for  the 
Democratic  Party  in  the  United  States,  though  he 
admits  that  mass  social  movements  are  still  neces¬ 
sary  to  push  the  political  elite  in  the  right  direction. 

Others  argue  that  it  is  possible  to  have  anarchy 
here  and  now  but  outside  the  official  system.  For 
instance,  the  green  anarchist  Clark  advocates 
“microcommunities.”  In  today’s  network  of 
squats,  for  instance,  activists  who  align  themselves 
with  anarchism  take  over  unoccupied  buildings  or 
land  and  try  to  live  outside  capitalism  as  much  as 
possible,  offering  free  meals  and  rooms  to  travelers 
and  traveling  to  participate  in  campaigns  of  soli¬ 
darity  with  immigrants  or  with  antiwar  and  anti¬ 
capitalist  movements. 

Feminism  and  Anarchism 

The  distinctions  between  various  forms  of  anar¬ 
chism  are  not  as  schematic  in  reality  as  what  is 
generally  presented  in  books,  articles,  and  encyclo¬ 
pedias.  Other  strands  of  anarchism  do  not  fit  per¬ 
fectly  within  the  classical  economic  and  political 
framework.  Anarcha-feminism — for  instance,  is  a 
feminist  version  of  anarchism,  devoted  to  fighting 
patriarchy  both  in  society  at  large  and  within  the 
Marxist  and  anarchist  networks,  where  misogynis- 
tic  and  antifeminist  attitudes  are  not  uncommon 
(witness  the  term  manarchy ,  recently  coined  by 


U.S.  activists  to  describe  inequalities  among  anar¬ 
chist  men  and  women).  In  the  19th  century, 
authors  and  militants  like  Goldman  and  de  Cleyre 
denounced  the  “sex  slavery”  of  marriage  and  pro¬ 
moted  free  love.  In  the  1930s,  the  anarcha-feminist 
autonomous  association  Mujeres  Fibres  boasted 
some  30,000  members  in  Spain  and  was  an  active 
force  during  the  Revolution.  It  focused  on  educa¬ 
tion  for  girls  and  women,  combat  training  for 
women,  medical  help  for  wounded  fighters,  and 
the  emancipation  of  prostitutes  through  psycho¬ 
logical  and  material  support  and  the  assassination 
of  certain  pimps.  From  the  1960s  onward,  Peggy 
Kornegger  and  Susan  Brown,  among  others,  high¬ 
lighted  the  similarities  between  anarchism  and 
feminism,  arguing  that,  to  be  consistent,  anarchists 
should  be  feminists  (although  some  feminists,  such 
as  the  liberal  or  Marxist  statist  feminists,  may  not 
be  anarchists,  i.e.,  opposing  all  forms  of  hierarchy 
and  inequality).  “Third-wave”  feminism,  with  its 
emphasis  on  individual  sexual  choice  and  queer 
identities,  is  sometimes  identified  with  anarchism. 
Nevertheless,  the  anarcha-feminist  Claire  Snyder 
warns  “choice  feminists”  that  under  patriarchy, 
women’s  sexual  life  may  not  be  totally  free  from 
the  external  control  of  men  or  from  “internal 
tyrants,”  the  name  given  by  Goldman  to  the  social¬ 
ization — internal  fears  and  false  beliefs — that 
undermines  women’s  freedom  of  will  and  choice. 
Finally,  with  regard  to  economic  production,  the 
anarchist  and  feminist  Carol  Ehrlich  notes  that  it  is 
not  only  the  bourgeoisie  as  a  class  that  exploits 
wageworkers  but  also  men  who  exploit  women’s 
work,  which  is  often  undertaken  with  no  monetary 
compensation. 

Anarchism  in  the  21st  Century 

Today,  a  large  number  of  principles  closely  linked 
to  anarchism  for  the  past  150  years  have  become, 
for  many  people,  ordinary  moral  and  political  lib¬ 
eralism:  the  8-hour  working  day;  education  for  all, 
boys  and  girls  alike;  women’s  freedom  of  choice 
with  regard  to  matters  such  as  contraception  and 
abortion;  freedom  in  sexuality  and  love,  including 
bi-  and  homosexual  rights  and  the  right  to  divorce 
or  to  cohabit  without  marrying;  freedom  of  con¬ 
sciousness  with  respect  to  religion  and  state  dogma; 
and  freedom  to  refuse  military  service.  While  not 
self-identifying  with  anarchism,  several  influential 
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contemporary  scholars,  especially  in  France  and 
Italy,  have  adopted  an  anarchist  approach  in  their 
work  on  state  power  (Giorgi  Agamben),  political 
representation  (Michel  Foucault,  Jean-Franqois 
Lyotard),  micropolitics  and  resistance  (Gilles 
Deleuze),  and  the  reorganization  of  production 
and  resistance  based  on  egalitarian  principles 
(Michael  Flardt  and  Antonio  Negri).  Others,  such 
as  Donatella  della  Porta,  are  investigating  new 
social  movements  that  practice  “direct  democ¬ 
racy,”  which  in  fact  amounts  to  studying  anar¬ 
chism  in  action. 

This  implicitly  or  explicitly  anarchist  activism 
within  the  “movement  for  global  justice”  is  the 
target  of  state  repression,  even  though  anarchism 
does  not  represent  a  significant  threat  to  social, 
political,  and  economic  liberalism,  as  might  have 
been  the  case  at  the  turn  of  the  20th  century. 
Thus,  there  were  legitimate  grounds  for  former 
British  Prime  Minister  Tony  Blair’s  jibe  about  the 
summit-hopping  “anarchist  traveling  circus.” 
Nevertheless,  anarchism  provides  the  most  vocal 
and  radical  left-wing  criticism  of  liberalism. 
During  the  2001  G8  (Group  of  Eight)  Summit  in 
Genoa,  after  violent  clashes  between  police  offi¬ 
cers  and  protesters — one  of  whom  was  shot  dead 
at  point-blank  range — Canadian  Prime  Minister 
Jean  Chretien  warned  about  the  “anarchists  who 
want  to  destroy  democracy.  ”  In  Greece  and  Italy, 
anarchist  networks  have  been  targeted  by  the 
police  in  recent  years:  Many  have  been  arrested, 
and  in  Greece,  one  person  was  killed  in  2008.  The 
Federal  Bureau  of  Investigation  and  Scotland 
Yard  have  also  been  preoccupied  since  the  late 
1990s  with  anarchists  reputedly  training  for  ter¬ 
rorist  actions  (which  never  occurred).  Even  certain 
political  scientists,  such  as  Tim  Dunne  of  Exeter 
University,  claim  that  anarchist  activists  and  A1 
Qaeda  militants  are  very  similar  in  nature.  In  sum, 
anarchism  said  goodbye  to  the  20th  century  much 
as  it  had  welcomed  it — that  is,  cast  in  the  role  of 
a  potential  “terrorist”  threat,  the  number  one 
internal  public  enemy. 
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Anarchy 


Anarchy  is  the  absence  of  government  or,  more 
generally,  political  authority  over  and  between  the 
units  of  a  political  system.  As  an  analytic  concept, 
the  term  does  not  imply  a  lack  of  political  order  or 
the  presence  of  chaos  and  thus  differs  from  infor¬ 
mal  and  colloquial  use.  The  term  also  differs  from 
anarchism,  a  normative  and  possibly  utopian 
position  that  advocates  minimizing  the  scope  of 
political  authority  to  maximize  the  domain  of 
individual  autonomy. 

The  condition  of  anarchy  is  widely  understood 
to  describe  the  modern  international  system  in 
which  states  are  the  units  of  analysis,  each  is  fully 
sovereign,  and  all  are  formally  equal.  It  is  this  con¬ 
dition  of  anarchy,  in  turn,  that  separates  interna¬ 
tional  relations  from  other  domains  of  politics  and 
renders  it,  for  many  analysts,  a  distinct  field  of 
inquiry  with  different  rules  and  patterns  of  interac¬ 
tion.  Although  other  political  arenas  may  also  be 
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anarchic,  such  as  legislatures  where  vote  trading 
between  members  cannot  be  legally  enforced,  the 
analytic  concept  has  not  been  widely  applied 
beyond  the  realm  of  international  relations  and,  in 
a  few  cases,  failed  states. 

Consequences  of  Anarchy 

For  most  scholars  of  international  relations,  anar¬ 
chy  requires  that  all  states  must  rely  only  on  their 
own  resources  and  abilities — a  practice  described 
as  the  principle  of  self-help.  Lacking  any  authority 
that  states  can  appeal  to  for  protection,  aid,  or 
binding  adjudication,  each  state  must  ultimately 
depend  on  its  own  efforts  and  wisdom.  This  fur¬ 
ther  implies  that  any  agreement  made  between 
states  must  be  self-enforcing  or  in  the  interests  of 
the  parties  to  carry  out  if  and  when  actually  called 
on  to  do  so. 

Although  the  assumption  that  the  international 
system  is  anarchic  is  widely  shared,  the  meaning 
and  consequences  of  this  assumption  are  still  vig¬ 
orously  debated.  For  realists,  anarchy  produces  a 
zero-sum,  competitive  struggle  between  states.  For 
political  realists,  anarchy  did  not  feature  promi¬ 
nently  and  was  more  a  passive  background  condi¬ 
tion;  Flans  Morgenthau — for  instance,  rooted  the 
drive  for  power  in  the  innate  character  of  political 
man,  not  in  the  nature  of  the  international  system. 
For  Neorealists  (sometimes  called  structural  real¬ 
ists),  however,  anarchy  is  one  of  the  defining  fea¬ 
tures  of  international  structure  with  significant 
causal  effects. 

Even  within  neorealism,  there  are  two  promi¬ 
nent  schools.  For  defensive  realists,  anarchy 
requires  only  that  states  seek  security,  although 
they  may  also  pursue  expansion,  glory,  or  power 
for  other  reasons.  Given  that  some  states  may  pos¬ 
sess  aggressive  tendencies,  however,  all  states  must 
be  ever  vigilant  and  prepared  to  defend  themselves. 
Uncertainty  over  the  motives  of  other  states  and 
problems  of  credible  commitment,  in  turn,  some¬ 
times  leads  to  bargaining  failures  and  war.  Offensive 
realists  believe  anarchy  is  such  a  challenging  condi¬ 
tion  that  states  must  pursue  power  at  all  times. 
Since  states  are  always  insecure,  the  fear  that  others 
will  exploit  them  forces  states  to  pursue  all  means 
necessary  to  impose  their  will  on  others.  Also,  as 
power  is  always  zero  sum,  anything  that  gives  an 
advantage  to  one  state  must  create  a  disadvantage 


for  at  least  one  other.  In  this  view,  anarchy  implies 
that  international  politics  are  a  perpetual  and 
intense  struggle  for  domination. 

Neoliberal  institutionalists  see  anarchy  as  a  con¬ 
dition  that  can  be  mitigated,  if  not  fully  resolved, 
by  voluntarily  negotiated  institutions  between 
states.  Unlike  in  neorealism,  anarchy  does  not 
define  the  goals  that  states  seek  but,  rather,  merely 
permits  dilemmas  of  collaboration  and  coordina¬ 
tion  to  arise  that  thwart  cooperation  between  util¬ 
ity-maximizing  states.  These  dilemmas,  in  turn, 
can  be  overcome  in  part  by  institutions  that  pro¬ 
vide  information  or  make  commitments  more  cred¬ 
ible.  In  a  form  of  self-organizing  order,  states  can 
avoid  some  of  the  harsher  implications  of  anarchy 
by  building  institutions  without  necessarily  subor¬ 
dinating  themselves  to  any  central  authority. 

Finally,  constructivists  see  anarchy  as  an  open- 
ended  condition  filled  by  the  social  purposes  of 
states,  which  are  themselves  constructed  of  socially 
appropriate  and  interpreted  roles  and  norms  that 
vary  over  time  and  space.  Thus,  within  the  condi¬ 
tion  of  anarchy,  states  may  represent  themselves 
and  others  in  ways  that  produce  a  Hobbesian  world 
depicted  by  realism,  a  Lockean  world  that  reflects 
important  elements  of  neoliberal  institutionalism  in 
its  focus  on  natural  rights,  or  a  Kantian  world  of 
greater  peace  and  cooperation  than  imagined  by 
either  of  the  alternatives.  In  all  of  these  different 
socially  constructed  world  orders,  there  is  a  poten¬ 
tial  for  systemic  transformation  that  is  excluded  by 
neorealism  or  neoliberal  institutionalism. 

Critiques  of  Anarchy 

Although  subject  to  continuing  controversy,  all 
three  perspectives  share  a  common  focus  on  anar¬ 
chy  even  while  they  debate  its  meaning  and  impli¬ 
cations.  Increasingly,  scholars  are  criticizing  the 
very  concept  of  anarchy  and  the  role  it  has  played 
in  the  development  of  international  relations  the¬ 
ory.  Too  numerous  to  describe  in  detail  here,  many 
of  the  specific  critiques  can  be  grouped  into  two 
larger  themes.  First,  the  common  notion  of  anar¬ 
chy  is  derived  from  an  overly  narrow  formal-legal 
conception  of  authority.  Alternative  conceptions 
of  authority  open  up  the  possibility  of  many  differ¬ 
ent  kinds  of  authority  existing  at  the  same  time 
within  the  international  system.  Second,  sover¬ 
eignty  is  not  indivisible,  as  commonly  averred,  but 
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is  a  bundle  of  different  authorities  that  can  be 
disaggregated  in  ways  that  do  not  necessarily  coin¬ 
cide  with  traditional  nation-states.  Divisible  sover¬ 
eignty  permits  a  patchwork  of  sometimes  overlap¬ 
ping,  competing,  or  complementary  authorities  to 
exist  simultaneously.  Taken  together,  these  critical 
themes  suggest  that  many  forms  of  global  gover¬ 
nance  that  have  been  ruled  out  by  the  common 
assumption  of  anarchy  may  have  actually  existed 
in  the  past  and  may  yet  expand  in  the  future. 

In  formal-legal  conceptions  of  authority,  the 
person  (or  unit)  in  authority  has  the  right  to  issue 
and  enforce  certain  commands  over  a  set  of  subor¬ 
dinates  because  of  the  lawful  position  or  office  that 
he  or  she  holds.  Authority  does  not  inhere  in  the 
individual  (or  individual  unit)  but  in  the  person  as 
an  officer  who  is  duly  appointed  or  elected  through 
some  lawful  procedure.  Elected  by  a  majority  of  the 
Electoral  College,  for  instance,  a  person  becomes 
the  president  of  the  United  States;  by  established 
rules  of  rotation,  a  country  collectively  holds  the 
presidency  of  the  Council  of  the  European  Union 
(EU).  Applied  to  international  relations,  since  there 
is  no  lawful  authority  or  procedure  above  states,  no 
state  or  other  unit  can  be  authorized  to  govern  over 
other  states.  As  a  result,  the  system  and  relations 
between  states  within  that  system  must  be  anarchic. 

Formal-legal  authority,  however,  is  simply  one 
possible  source  of  authority.  The  German  sociolo¬ 
gist  Max  Weber,  closely  identified  with  this  for¬ 
mal-legal  conception,  also  posited  that  legitimacy 
and  authority  could  derive  from  charisma,  tradi¬ 
tion,  or  religious  belief  as  well.  Others  argue  that 
legitimacy  and,  hence,  authority  can  arise  from 
psychological  principles  of  fairness  and  justice, 
socially  constructed  norms,  or  a  negotiated  social 
contact  between  ruler  and  ruled.  These  other 
sources  of  legitimacy  at  least  open  the  possibility 
of  authority  by  states,  international  organizations, 
nongovernmental  organizations  (NGOs),  and 
other  units  over  other  similar  units.  While  debate 
continues  over  who  has  authority  over  whom  for 
what,  a  growing  body  of  critical  research  suggests 
that  international  relations  are  not  entirely  anar¬ 
chic  but  better  described  as  a  variegated  system  of 
multiple  units  in  authority  drawn  from  multiple 
sources  of  authority. 

In  turn,  sovereignty  has  been  traditionally  con¬ 
ceived  as  indivisible  or  culminating  in  a  single  apex 
within  each  territorially  distinct  entity.  Sovereignty 


was  once  vested  in  a  king,  emperor,  or  “sovereign.” 
Today,  sovereignty  typically  resides  in  the  “people” 
and  has  itself  evolved  over  time  to  include  most 
citizens  of  a  country.  But  in  either  case,  there  is 
assumed  to  be  within  each  society  a  single,  ultimate 
authority  that  cannot,  even  in  part,  be  subordinate 
to  any  other  actor.  This  assumption  is  codified  in 
the  notion  of  Westphalian  or  juridical  sovereignty 
embodied  in  the  United  Nations  Charter;  today, 
states  need  not  actually  control  their  territory,  as  in 
the  past,  but  only  need  to  be  recognized  as  sover¬ 
eign  by  other  sovereigns  to  be  accorded  that  status. 

Yet  there  is  an  increasing  awareness  that  states 
have  never  fit  this  idealized  vision  of  Westphalian 
sovereignty.  Indeed,  even  historically,  sovereignty 
appears  better  described  as  a  set  of  authorities  that 
are  disaggregated  across  units  within  a  state,  long 
recognized  as  federalism,  or,  more  important, 
across  states  and  third  parties.  This  disaggregation 
is  not  only  exemplified  by  multilevel  governance  in 
the  contemporary  EU  but  also  reflected  in  numer¬ 
ous  international  restraints  on  the  freedom  of 
action  of  states,  including  rights  of  protection  or 
guarantee  (e.g.,  the  United  States  and  the  Federated 
States  of  Micronesia),  rights  of  economic  and 
financial  control  (the  United  States  and  the 
Dominican  Republic  from  1904  to  1941),  rights  of 
servitude  (the  United  States  and  Japan,  under  vari¬ 
ous  status  of  forces  agreements),  and  rights  of 
intervention  (the  United  States  and  Panama,  under 
the  neutrality  treaty  of  1977). 

Combined  with  multiple  forms  of  authority,  the 
possibility  of  multiple  sites  of  authority  suggest 
that  patterns  of  global  governance  are  likely  to  be 
more  varied,  complicated,  and  dynamic  than  once 
assumed.  Transnational  nonstate  groups  exercise 
authority  over  their  members,  whether  they  be 
religious  orders,  labor  unions,  or  other  collective 
bodies.  NGOs  earn  authority  over  firms  and  even 
states  in  standard-setting  boards  and  international 
creditor  cartels  and  regulate  behavior  through 
monitoring  and  certification  procedures.  States 
exert  authority  over  one  another  in  spheres  of 
influence,  protectorates,  or  informal  empires. 
International  organizations  possess  authority  as 
well,  including  the  World  Trade  Organization  and 
its  dispute  settlement  procedures  over  trade,  for 
instance,  and  the  United  Nations  and  the 
International  Atomic  Energy  Agency  over  nuclear 
facilities  and  programs. 
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The  assumption  of  anarchy,  critics  charge, 
blinds  scholars  of  international  relations  to  this 
variegated  system  of  global  governance.  The  cri¬ 
tique  of  anarchy  and  the  expansion  of  global  gov¬ 
ernance  implies  three  essential  challenges  for  the 
future:  ( 1 )  Analysts  must  map  the  forms  of  global 
governance,  identify  where  current  authority  is 
inadequate  or  itself  unregulated  (as  in  the  so-called 
democratic  deficit  in  the  EU),  and  propose  reforms 
to  improve  human  welfare;  (2)  policymakers  must 
accept  and  navigate  between  these  multiple  forms 
of  authority  while  harnessing  them  to  their  national 
purposes,  when  appropriate;  and  (3)  global  citi¬ 
zens  must  work  to  ensure  that  global  authorities 
act  in  the  general  interest. 

David  A.  Lake 
University  of  California,  San  Diego 
La  Jolla,  California,  United  States 
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Anomia  is  a  noun  that  comes  from  the  Greek  a 
(negative  prefix)  and  nomos  (law),  meaning 
absence  of  norms.  The  original  meaning  of  anomia 


in  the  frame  of  the  classical  Greek  polity  defined  a 
condition  of  being  against  or  outside  the  law  or  a 
situation  where  current  laws  were  not  applied, 
resulting  in  a  state  of  illegitimacy  or  lawlessness.  In 
its  modern  usage,  the  meaning  of  anomia  has  a 
much  broader  scope,  although  it  is  a  rather  recent 
concept  in  the  history  of  social  thought.  In  much 
of  the  literature,  particularly  in  the  English- 
speaking  world,  the  French  spelling  anomie  is  also 
extensively  used.  This  entry  discusses  the  use  of  the 
term  in  modern  sociology,  the  widening  of  its 
scope,  and  its  present-day  relevance. 

Anomia  in  Durkheim’s  Thought 

In  the  social  sciences,  anomia  has  been  a  key  con¬ 
cept  in  the  development  of  modern  empirical  soci¬ 
ology.  It  is  normally  associated  with  the  work  of 
Emile  Durkheim  (1858-1919),  the  French  sociolo¬ 
gist  who  introduced  the  concept  in  his  early  socio¬ 
logical  masterpieces,  The  Division  of  Labor  (1893) 
and  Suicide  (1897).  In  his  sociological  perspective, 
anomia  was  no  longer  related  to  the  rule  of  law; 
thus,  it  lost  its  original  legal  content  and  was 
instead  defined  as  the  absence  of  norms  (legal  rules 
just  being  one  class  of  social  norms).  One  of  the 
major  tenets  of  Durkheimian  sociology  is  that  a 
theoretical  explanation  of  social  agency  must  be 
given  in  terms  of  social  causal  factors  and  not  on 
the  basis  of  psychological  conditions.  In  fact,  one 
can  say  that  the  major  contribution  of  Durkheim 
to  the  foundation  of  modern  sociology  has  been 
the  possibility  of  grounding  the  explanation  of 
social  action  on  purely  social  variables.  In  the 
famous  Durkheimian  formula,  presented  in 
Chapter  5  of  Rules  of  Sociological  Method  (1895), 
social  facts  must  be  explained  by  other  antecedent 
social  facts.  In  relation  to  anomia,  this  thesis 
implies  that  the  absence  of  norms  is  not  a  subjec¬ 
tive  psychological  condition  of  the  agent  but  an 
objective  feature  of  the  social  structure  or,  in 
strictly  Durkheimian  terms,  of  the  collective  life. 

The  first  references  to  anomia  by  Durkheim 
appear  in  The  Division  of  Labor,  his  doctoral  the¬ 
sis  at  the  University  of  Bordeaux.  In  this  early 
work,  referring  to  the  social  causes  of  the  division 
of  labor  as  a  phenomenon  typical  of  modern 
industrial  society,  Durkheim  elaborates  on  the 
“abnormal”  forms  of  this  phenomenon,  one  of 
these  forms  being  the  “anomic  division  of  labor.” 
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In  his  ensuing  famous  essay  Suicide ,  in  which 
Durkheim  put  forward  a  plausible  sociological 
explanation  of  the  apparently  purely  psychological 
phenomenon  of  disposing  of  one’s  own  life,  he 
characterized  a  type  of  suicide  that  he  called 
“anomic.”  This  type  occurs  in  the  case  of  major 
social  crises  (e.g.,  an  economic  crash) — that  is, 
structural  phenomena  that  invalidate  previous  reg¬ 
ulations  to  such  a  degree  that  some  agents  are  inca¬ 
pable  of  coping  with  the  radical  ambiguity  of  the 
ensuing  state  of  normlessness,  and  thus  they  take 
their  own  life  in  a  supreme  act  of  “anomic  suicide.” 
The  common  factor  of  both  the  anomic  division  of 
labor  and  the  anomic  suicide  is  the  dissolution  of 
social  integration,  the  underlying  social  phenome¬ 
non  that  is  manifested  in  the  absence  of  norms.  The 
reformist  stance  of  Durkheim  vis-a-vis  the  major 
political  and  social  issues  of  his  time  is  a  reflection 
on  how  it  were  possible  to  confront  trends  toward 
anomic  forms  of  social  life  in  a  secularized  and 
functionally  differentiated  modern  society. 

In  one  of  his  most  politically  oriented  works, 
Legons  de  Sociologie  (1950),  published  in  English 
as  Professional  Ethics  and  Civic  Morals  (1957), 
Durkheim  analyzes  the  relationships  between  an 
individual  and  the  sphere  of  public  regulation.  In 
his  view,  the  institutionalization  of  individual 
rights  is  the  role  of  the  State;  thus,  the  latter  has 
not  been  created  to  guarantee  that  individuals  are 
able  to  exercise  their  natural  rights;  rather,  the 
State  creates  and  organizes  those  fundamental 
rights,  and  it  makes  them  a  reality.  Political  ano¬ 
mia  is  then  a  condition  where  public  regulations 
have  reached  a  point  of  extreme  weakness,  close  to 
the  absence  of  rules,  and,  therefore,  individual 
rights  cannot  be  exercised.  In  political  terms,  how¬ 
ever,  anomia  is  not  anarchy.  The  latter  refers  to  a 
lack  of  leadership,  hierarchy,  and  command, 
whereas  anomia  implies  a  weakening  of  rules, 
structures,  and  organizations.  Clearly,  the 
Durkheimian  concept  of  political  anomia  necessar¬ 
ily  implies  a  radical  weakening  of  public  regula¬ 
tions.  In  short,  anomia,  in  all  of  its  possible  specific 
manifestations,  can  be  understood  as  a  societal 
consequence  of  rapid  and  deep  social  change,  and 
it  is  in  this  sense  that  anomia  is  a  concept  that  runs 
in  parallel  with  the  notion  of  social  crisis.  According 
to  Durkheim,  without  clear  rules  individuals  can¬ 
not  find  their  place  in  society.  Such  a  state  tends  to 
occur  in  periods  of  social  disruption  (such  as  rapid 


economic  growth  or  economic  depression),  bring¬ 
ing  about  greater  anomia  and,  concurrently,  higher 
rates  of  suicide,  crime,  and  deviance. 

Anomia  and  Social  Change 

Social  change  may  generate  anomia  either  in  the 
whole  of  society  or  in  some  parts  of  it,  so  it  must 
be  stressed  that,  in  a  given  society,  social  groups 
may  be  differentially  affected  by  anomia.  As  recent 
events  in  the  world  economy  illustrate,  economic 
crises  brought  about  by  business  cycles  have  dif¬ 
ferential  impacts  on  groups  according  to  their 
location  in  the  social  pyramid.  Both  sudden  down¬ 
ward  mobility,  which  is  normally  associated  with 
economic  depression,  and  the  rapid  onset  of  mate¬ 
rial  prosperity,  leading  to  quick  upward  mobility, 
tend  to  upset  previous  networks  in  which  lifestyles 
are  embedded,  in  each  case  increasing  the  chance 
of  anomia.  Talcott  Parsons,  in  the  early  1940s, 
discussing  the  social  aspects  of  fascism,  contends 
that  an  increase  in  anomia  may  be  an  outcome  of 
almost  any  social  changes  that  upset  previous  rou¬ 
tines  of  life  or  symbolic  associations,  all  of  which 
imply  instability  of  expectations.  In  Parsons’s 
strain  theory  approach,  social  ambiguity  as  well  as 
conflicting  role  expectations  may  lead  to  increased 
anomia,  and  this,  in  turn,  may  create  a  sort  of 
anxiety-generated  pattern  of  anomic  action  in 
areas  such  as  reactionary  voting  behavior.  In 
expanding  the  use  of  the  concept  of  anomia  to 
diverse  areas  of  social  interaction,  Edward 
Tiryakian  has  made  an  elaboration  of  the  notion 
of  sexual  anomia  as  an  heuristic  tool  in  approach¬ 
ing  features  of  contemporary  society  that  involve 
transformations  in  sexual  relations  and  sexual 
identity  that  presuppose  deregulations  in  modern 
sexual  relations. 

Anomia  and  Deviant  Behavior 
in  Merton’s  Theory 

A  distinction  in  the  normative  sphere  between  regu¬ 
lation  of  goals  and  regulation  of  means,  introduced 
by  Robert  K.  Merton,  has  become  a  key  dimension 
in  current  anomia-related  theories  that  tend  to 
focus  on  “anomic”  deviant  behavior.  A  clear  exam¬ 
ple  of  this  shift  in  focus  is  the  way  in  which  the 
Columbia  Encyclopedia  (6th  ed.)  defines  anomia: 
“a  social  condition  characterized  by  instability,  the 
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breakdown  of  social  norms,  institutional  disorga¬ 
nization,  and  a  divorce  between  socially  valid  goals 
and  available  means  for  achieving  them.”  Although 
Merton,  when  referring  to  socially  accepted  goals 
(values)  and  means  (instruments  for  achieving 
those  values),  has  in  mind  the  overall  universal 
striving  for  success  in  the  American  value  system, 
his  theory  is  a  general  proposition.  In  Mertonian 
terms,  in  any  given  society,  there  is  a  range  of  cul¬ 
turally  accepted  values  (goals)  and  a  corresponding 
set  of  normatively  approved  means  of  securing 
these  goals,  but  the  structure  of  economic  resources 
in  that  society  enables  only  certain  privileged 
groups  and  classes  to  succeed. 

Disparity  between  goals  and  means  is,  accord¬ 
ing  to  Merton,  the  main  cause  of  anomia.  Merton 
identified  five  possible  reactions  to  the  different 
relationships  between  goals  and  means.  The  first 
and  most  common  reaction  is  conformity ,  which 
presupposes  a  nonanomic  relationship  of  appro¬ 
priate  means  to  reach  the  socially  accepted  goals. 
All  of  the  other  four  possible  reactions  are  more  or 
less  anomic.  The  second  possible  reaction  is  inno¬ 
vation,  which  implies  adherence  to  goals  and  rejec¬ 
tion  of  socially  accepted  means.  The  third  possible 
reaction  is  ritualism,  based  on  adherence  to  means 
but  rejection  of  socially  accepted  goals.  In  the 
fourth  reaction,  retreatism,  both  the  goals  and  the 
means  are  rejected.  Finally,  the  fifth  and  final  pos¬ 
sible  reaction  is  rebellion,  when  both  goals  and 
means  are  ambiguously  dealt  with.  In  conditions 
of  anomia  (maladjustment  between  goals  and 
means),  underprivileged,  deprived  individuals  will 
turn  to  various  forms  of  individual  deviance  that 
are  dependent  on  alternative  means  to  reach  the 
same  desired  ends.  In  other  words,  anomia  occurs 
as  the  disjunction  of  means  and  goals  that  differ 
according  to  the  profile  of  the  social  opportunity 
structure.  Merton  was  then  able  to  propose  a  the¬ 
ory  of  anomia  embedded  in  social  stratification 
and  the  class  structure,  and  it  is  in  this  sense  that 
his  theory  transcends  the  specificity  of  the  American 
case  and  becomes  a  general  theory  of  anomia.  An 
interesting  example  of  a  Merton-inspired  research 
of  anomic  behavior  in  a  non-Western  cultural  con¬ 
text  is  the  study  conducted  in  2006  by  Parviz 
Piran,  concerning  the  reactions  of  Iranian  youth  to 
anomic  conditions  created  by  the  rapid  and  exten¬ 
sive  change  in  the  economic,  political,  and  cultural 
fabric  of  their  society. 


Anomia  in  Contemporary 
Political  Economy 

The  concept  of  anomia  is  also  present  in  the  field 
of  contemporary  political  economy  in  the  work  of 
analysts  who  are  advancing  a  theory  of  “institu¬ 
tional  anomia”  that  basically  states  that  in  the 
present  globalized  context,  as  economic  institu¬ 
tions  gain  dominance  in  shaping  society,  noneco¬ 
nomic  institutions  are  weakened  and  forced  to 
accommodate  the  market,  thus  feeding  a  condi¬ 
tion  of  high  institutional  anomia  that  is  conducive 
to  higher  violent  crime  rates.  To  the  extent  that 
increased  anomia  tends  to  erode  and  dissolve 
social  networks  in  which  social  action  is  embed¬ 
ded,  there  is  also  a  clear  connection  of  this  con¬ 
cept  with  the  notion  of  social  capital  that  is  basi¬ 
cally  the  outcome  of  associative  networking.  Con¬ 
temporary  research  on  the  sources  and  the  uses  of 
social  capital  by  different  social  actors  in  promot¬ 
ing  their  own  advancement,  which  is  a  strong  field 
in  policy-oriented  studies,  is  an  interesting  area 
for  further  applications  of  the  general  theory  of 
anomia. 

In  short,  the  concept  of  anomia  has  provided 
quite  a  fertile  intellectual  ground  for  the  develop¬ 
ment  of  the  modern  disciplines  of  the  social  sciences. 

Raul  Atria 
University  of  Chile 
Santiago,  Chile 
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Apathy  looks  at  first  like  a  very  broad,  even  vague 
concept.  It  plays,  however,  a  very  important  and 
critical  role  in  political  analysis,  particularly  in  the 
analysis  of  the  decline  in  voting  turnout  and  civic 
engagement  and  more  generally  in  the  study  of 
participation  and  civic  involvement  of  citizens. 
The  concept  is  derived  from  a  long,  intellectual, 
interdisciplinary  tradition.  It  was  first  proposed  in 
philosophy  but  has  been  extensively  used  in  cul¬ 
tural  psychology,  anthropology,  sociology,  and 
political  science.  The  concept  also  has  intellectual 
connections  with  other  concepts  or  notions  com¬ 
ing  from  the  sociological  and  political  philosophi¬ 
cal  traditions  such  as  alienation  and  anomie.  In  a 
typical  use  of  the  apathy  concept,  theories  of  mass 
society  have  explained,  in  the  wake  of  World  War 
II,  that  modernization  and  urbanization  have  been 
responsible  for  detaching  individuals  from  their 
primary  groups  (community,  family,  early  social¬ 
ization  groups),  generating  withdrawal  from  the 
public  sphere  and  lack  of  interest  and  motivation 
in  public  affairs.  This  entry  discusses  the  history 
and  recent  criticisms  of  this  concept. 

The  more  recent  perspective  on  political  apathy 
started  in  the  1950s  and  1960s,  in  particular  with 
the  publication  of  the  book  Civic  Culture  by 
Gabriel  Almond  and  Sidney  Verba  (1963).  One  of 
the  issues  of  Almond’s  and  Verba’s  political  cul¬ 
ture  approach  is  that  the  stability  of  democracy 
and  democratic  institutions  depends  on  a  “civic 
culture,”  a  mixture  of  participatory,  subject,  and 
parochial  orientations  toward  politics.  The  impor¬ 
tant  point  here  is  that  this  civic  culture  seems  to 
allow  many  citizens  to  be  apathetic  about  politics. 
Many  critics  of  the  book  took  this  to  be  a  kind  of 
justification  for  apathy. 

Since  the  1960s,  the  political  sociology  perspec¬ 
tive  on  apathy  is  mainly  linked  to  the  question  of 


political  mobilization  and  political  participation. 
In  a  very  challenging  study  of  the  relationship 
between  social  class  and  voting  in  the  mid-1980s 
in  the  Netherlands,  Leo  B.  Van  Snippenburg  and 
Peer  Scheepers  recalled  that  a  positive  correlation 
between  social  protest  and  income  inequalities  at 
least  implicitly  referred  to  the  thesis  of  Verelendung 
(pauperization)  that  originated  from  Karl  Marx’s 
work.  They  also  recall  that  this  thesis  states  that 
under  (relatively)  deteriorating  socioeconomic 
conditions,  the  deprived  will  turn  to  social  protest 
and  revolt.  The  thesis  was  later  criticized,  empha¬ 
sizing  that  history  has  regularly  shown  the 
contrary:  Revolt  emerges  under  circumstances  of 
relative  social  deprivation  and  is  more  likely  to 
happen  in  middle-class  or  privileged  groups  rather 
than  in  deprived  ones.  According  to  these  alterna¬ 
tive  views,  instead  of  revolting  against  the  system 
that  generates  inequalities,  a  form  of  “collective 
apathy”  often  manifests  itself  in  deprived  groups 
(Marie  Jahoda,  Paul  Lazarsfeld,  &  Hans  Zeisel, 
1971).  This  doubt  and  criticism  of  the  pauperiza¬ 
tion  thesis  was  strongly  formulated  in  studies  by  a 
very  influential  group  of  social  scientists  in  the 
1930s  and  the  1940s,  the  so-called  Frankfurt 
School,  in  particular  their  study  of  the  “authori¬ 
tarian  personality”  (Theodor  Adorno,  Else 
Frenkel-Brunswik,  Daniel  Levinson,  &  Nevitt 
Sanford,  1950).  In  this  study,  it  was  observed  that 
during  the  economic  depression  of  the  1930s,  the 
workers  and  have-nots  in  Germany  did  not  revolt. 
They  instead  remained  outside  the  political  scene 
and  were  apathetic.  Parts  of  the  middle  class,  by 
contrast,  actively  engaged  in  politics,  as  did  the 
cultural  and  economic  elites.  In  light  of  these  facts, 
the  Frankfurt  School  criticized  the  Marxian  notion 
concerning  the  direct  effects  of  pauperization  on 
political  radicalism.  The  Frankfurt  School  intro¬ 
duced  an  original  hypothesis  by  putting  “person¬ 
ality”  as  an  intermediate  variable  between  social 
characteristics  and  political  consciousness, 
between  class  and  ideology.  In  their  view,  belong¬ 
ing  to  a  deprived  group  does  not  lead  to  active 
political  behavior  but  encourages  the  development 
of  an  authoritarian  personality  (i.e.,  subordination 
to  authority,  conformity  to  rigid  social  norms, 
rejection  of  those  not  sharing  these  norms). 
Instead,  authoritarian  personalities  show  defer¬ 
ence  to  authorities  and  remain  outside  the  politi¬ 
cal  field. 
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More  recent  interpretations  came  from  other 
studies  completing  the  traditional  authoritarian 
personality  theories  through  a  hypothesis  pro¬ 
posed  by  Leo  Festinger  (1957)  in  Theory  of 
Cognitive  Dissonance-.  Awareness  of  the  low  likeli¬ 
hood  of  benefiting  from  an  “achievement  society” 
generates  a  mental  state  of  anomie,  social  isola¬ 
tion,  lack  of  confidence  in  the  system,  and  apa¬ 
thetic  behavior.  Socioeconomic  frustration  and 
status  anxiety  thus  mediate  the  correlation  between 
class,  authoritarian  values,  and  anomie.  From  this 
tradition,  it  can  be  concluded  that  modern  societ¬ 
ies  generate  apathy  in  different  ways  but  especially 
by  creating  gaps  between  aspirations,  values,  and 
realities.  This  body  of  literature  has  an  obvious 
relationship  with  another  intellectual  tradition, 
which  is  known  as  the  relative  deprivation  or  rela¬ 
tive  frustration  hypothesis.  This  hypothesis  states 
that,  despite  the  socioeconomic  deprivation  and 
social  inequalities  experienced  by  the  popular 
classes  during  the  periods  of  economic  stagnation, 
there  was  no  political  protest.  To  explain  this 
paradox,  political  sociology  frequently  makes  ref¬ 
erence  to  a  Tocqueville  effect.  Alexis  de  Tocqueville 
proposed  a  theory  of  rising  expectations  that  may 
offer  some  clues  to  the  paradox  of  apathy  and 
mobilization  (those  who  normally  would  have 
interest  to  mobilize  are  not  doing  so).  According  to 
this  theory,  it  is  not  deprivation  but  the  obstruc¬ 
tion  of  favorable  prospects  and  rising  expectations 
that  encourage  social  protest  and  radical  political 
behavior.  As  remarked  by  Van  Snippenburg  and 
Eisinga  Scheepers  (1991),  citizens  experiencing 
such  frustrations  “become  morally  indignant, 
sometimes  socially  envious,  and  they  easily  turn  to 
political  protest  and  radicalism  as  a  consequence” 
(p.  45).  This  Tocqueville  effect  has  been  used  to 
explain  political  protest  when  social  groups  expe¬ 
rience  significant  discrepancies  between  their  rais¬ 
ing  expectations,  the  realities  of  their  occupational 
chances,  and  the  possibilities  for  participating  in 
political  decision  making. 

More  recently,  the  concept  of  apathy  has  been 
rejuvenated  and  updated  by  a  set  of  notions  pro¬ 
posed  in  the  context  of  empirical  studies  on  legiti¬ 
macy  and  political  support.  Political  apathy  has 
been  mentioned  in  the  explanation  of  the  apparent 
lack  of  political  support  and  political  trust  in  con¬ 
temporary  democracies.  This  also  used  to  be  the 
case  with  the  concept  of  democratic  deficit  that 


was  first  proposed  in  the  context  of  European 
Union  (EU)  integration  and  its  legitimacy.  Since 
the  beginning  of  the  1990s,  a  significant  body  of 
literature  has  investigated  the  democratic  gap  in 
EU  institutions.  This  democratic  deficit  is  sup¬ 
posed  to  explain  the  lack  of  interest,  motivation, 
and  civic  engagement  of  European  citizens  in  the 
process  of  EU  decision  making.  The  idea  of  a 
democratic  deficit  or  democratic  disenchantment  is 
also  related  to  a  set  of  phenomena  identified  as  the 
emergence  of  critical  citizens  (Pippa  Norris,  1999). 
According  to  Norris,  critical  citizens  aspire  to 
democracy  and  consider  it  as  their  ideal  form  of 
government,  but  at  the  same  time  they  are  deeply 
skeptical  when  evaluating  how  democracy  works 
in  their  own  country. 

Different  explanations  have  been  proposed  for 
such  phenomena  and  are  not  fully  integrated  even 
if  going  in  a  common  direction.  Since  the  mid- 
1990s,  for  instance,  many  commentators  pointed 
out  that  if  contemporary  democracies  were  not 
facing  a  system  crisis,  at  least  they  were  facing  a 
syndrome  of  disenchantment  and  democratic  mal¬ 
aise:  Citizens  in  many  industrial  societies  had 
become  disengaged  from  public  space.  It  is  worth 
noting  that  the  theory  of  critical  citizens  is  perfectly 
compatible  with  the  portrait  of  apathetic  citizens  or 
skeptical,  even  angry  or  cynical  citizens  rather  than 
with  the  image  of  apathetic  citizens  expecting  pub¬ 
lic  services  but  not  supporting  the  democratic  sys¬ 
tem  actively.  The  set  of  explanations  coming  from 
this  literature  about  the  democratic  gap  in  contem¬ 
porary  societies  claims  that  this  phenomenon  arises 
from  some  combination  of  growing  public  expecta¬ 
tions,  negative  news,  lack  of  public  trust,  and  fail¬ 
ing  government  performance.  Complementary 
explanations  have  been  proposed,  such  as  social 
capital  theory  focusing  on  a  decline  of  civic  engage¬ 
ment  and  social  trust  (Robert  Putnam,  1993)  or 
postmaterialist  theories  (Ronald  Inglehart,  1997), 
emphasizing  the  changing  value  orientations  linked 
to  processes  of  individualization.  Indeed,  the  World 
Values  Surveys  document  in  a  large  set  of  countries 
that  individualization  processes  detach  citizens 
from  the  public  sphere.  But  the  postmaterialist 
revolution  also  shows  the  capacity  of  educated  citi¬ 
zens  to  be  the  changing  force  of  a  cultural  revolu¬ 
tion  in  many  societies,  education  playing  a  crucial 
role  in  the  cognitive  mobilization  processes  under¬ 
lying  such  value  changes. 


86 


Area  Studies 


A  challenging  view  about  apathetic  citizens  has 
been  proposed  by  Pierre  Rosanvallon,  a  French 
historian  engaged  in  research  on  the  intellectual 
roots  of  contemporary  democracy.  In  the  context 
of  lamentations  about  the  low  turnout  in  European 
elections  and  the  apathy  of  citizens  regarding  EU 
issues,  Rosanvallon  contradicted  the  myth  of  a 
passive  citizen.  His  main  argument  was  that  the 
phenomenon  of  abstention  reflects  a  mutation  of, 
and  not  just  a  decline  in,  the  public  sphere. 
According  to  him,  the  decline  in  voter  turnout  has 
often  been  accompanied  by  other  democratic 
activities.  Rosanvallon  substitutes  the  view  on 
democratic  fatigue  in  contemporary  societies  (a 
view  that  too  many  elections  weaken  the  “demo¬ 
cratic  appetite”  of  citizen)  with  a  distinction 
between  three  facets  of  democracy:  (1)  expression 
of  citizens  in  making  judgments  on  the  rulers  and 
their  actions;  (2)  involvement ,  which  encompasses 
all  means  by  which  citizens  get  together  to  produce 
a  common  world;  and  (3)  intervention ,  which  con¬ 
sists  of  all  forms  of  collective  action. 

According  to  Rosanvallon,  the  characteristic  elec¬ 
tions  should  be  superimposed  on  these  different 
forms  of  civic  life.  In  other  words,  voting  is  the  most 
condensed  form  of  democracy  and  the  most  orga¬ 
nized  and  visible.  Rosanvallon  observes  that  although 
lower  participation  in  elections  has  undoubtedly 
eroded  the  democratic  expression,  involvement  and 
intervention  have  been  strengthened.  As  an  example, 
Rosanvallon  reports  the  following  data  on  the 
French  case:  In  terms  of  involvement,  participation 
has  not  declined.  The  French  are  now  more  likely  to 
have  signed  petitions  (68%  in  2000  compared  with 
53%  in  1990).  They  have  also  taken  part  in  more 
events  and  demonstrations  to  make  their  voices 
heard.  In  other  words,  citizens’  apathy  does  have 
some  empirical  foundations,  but  it  is  a  multifaceted 
and  multidimensional  phenomenon.  Depending  on 
the  definition,  the  available  empirical  indicators  can 
change  significantly  the  views  about  a  supposedly 
apathetic  citizenry. 
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Area  studies  is  the  generic  term  for  multidisci¬ 
plinary  social  research  that  focuses  on  specific 
geographic  regions  or  culturally  defined  areas. 
The  largest  scholarly  communities  in  this  respect 
are  loosely  defined  as  Asian,  African,  Latin 
American  or  Middle  Eastern  studies,  together 
with  a  variety  of  subfields  (Southeast  Asian  stud¬ 
ies,  Caribbean  studies,  etc.).  Political  science  plays 
a  major  role  in  area  studies  programs,  which  typi¬ 
cally  also  draw  on  disciplines  such  as  history, 
sociology,  ethnology,  geography,  linguistics,  liter¬ 
ature,  and  cultural  studies.  In  political  science,  the 
presence  of  area  studies  is  strongest  in  compara¬ 
tive  politics,  but  they  are  also  incorporated  in 
international  relations  (IR)  and  other  fields.  This 
entry  first  describes  the  origins  of  area  studies, 
from  early  colonial  studies  to  the  development 
and  institutionalization  of  area  studies  in  the  Cold 
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War  era.  It  then  outlines  the  not  always  easy  rela¬ 
tion  between  area  studies  and  the  discipline  of 
political  science,  highlighting  key  achievements  as 
well  as  remaining  problems.  The  final  section 
turns  to  the  current  research  frontier,  which  is 
marked  by  a  new  impetus  for  comparative  area 
studies  and  by  transnational  perspectives  that 
question,  among  other  things,  established  con¬ 
cepts  of  space  or  area. 

The  Origins  of  Area  Studies 

Today’s  area  studies  can  be  seen  as  having  their 
origins  in  the  colonial  expansion  of  European 
powers  and  the  accompanying  academic  efforts  to 
better  understand  those  “foreign”  or  “native” 
societies  that  demonstrated  a  variety  of  languages, 
cultures,  and  social  organizations  hitherto 
unknown.  In  this  sense,  area  studies  emerged  as  a 
“child  of  empire,”  often  driven  by  commercial  and 
political  interests  or  the  perceived  “civilizing  mis¬ 
sion”  of  the  colonial  powers.  At  the  same  time,  the 
study  of  ancient  civilizations,  ethnic  codes,  social 
hierarchies,  or  foreign  languages  was  part  of  the 
much  broader  process  of  the  extension  of  Western 
science  across  the  globe.  While  from  the  mid-18th 
century,  European  capitals  began  to  display  the 
treasures  and  arts  of  “exotic”  civilizations  as  much 
as  of  ancient  civilizations  in  public  museums,  the 
19th  century  saw  the  establishment  of  colonial 
studies  in  European  universities.  In  the  United 
States,  interdisciplinary  centers  for  area  studies 
first  emerged  after  World  War  I,  and  they  received 
a  strong  impulse  after  World  War  II,  parallel  to  the 
U.S.  rise  as  a  global  power.  A  better  understanding 
of  societies  in  Asia,  Africa,  the  Middle  East,  and 
Latin  America  was  seen  as  urgent  in  the  context  of 
the  Cold  War  rivalry  between  competing  super¬ 
powers  looking  for  local  clients  and  supporters, 
particularly  in  the  Third  World.  (A  similar,  secu¬ 
rity-driven  incentive  to  promote  the  study  of  for¬ 
eign  cultures  was  again  seen  after  the  terrorist 
attacks  of  September  11,  2001.) 

The  work  of  German  geographer  Alexander 
von  Humboldt  (1769-1859)  was  a  forerunner  of 
area  studies.  Von  Humboldt  was  also  one  of  the 
founding  fathers  of  multidisciplinarity,  which  then 
ranged  from  social  to  natural  sciences.  At  a  later 
stage,  a  critical  strand  of  area  studies  emerged  that 
openly  condemned  colonial  practices.  This  branch 


emphasized  respect  for  other  cultures,  challenged 
the  supposed  universality  of  the  Western  world¬ 
view  and  the  “Eurocentrism”  inherent  in  theories 
claiming  general  validity,  and  advocated  mutual 
learning  instead  of  unilaterally  copying  Western 
social  or  political  models. 

A  common  legacy  of  all  strands  of  area  studies, 
however,  is  that  they  almost  always  refer  to 
“other”  areas.  There  are  no  “German  studies”  in 
Germany  or  “U.S.  studies”  in  the  United  States. 
Until  today,  area  studies  have  focused  predomi¬ 
nantly  on  the  non-Organisation  for  Economic 
Co-operation  and  Development  (OECD)  world.  A 
remarkable  trend,  however,  has  been  the  emer¬ 
gence  and  expansion  of  “European  studies”  in  the 
wake  of  the  growing  importance  of  the  European 
Union. 

Area  Studies  and  Political  Science: 

An  Ongoing  Debate 

The  relationship  between  area  studies  and  the  dis¬ 
cipline  of  political  science  has  not  always  been  an 
easy  one.  Particularly  in  the  United  States,  the 
eclectic  use  of  changing  paradigms  by  area  special¬ 
ists  has  been  criticized.  Single-case  studies  have 
been  attacked  by  scholars  interested  in  generaliza¬ 
tion.  At  times,  area  specialists  have  been  expected 
to  simply  deliver  the  raw  data  for  universal  theo¬ 
ries.  Conversely,  area  specialists  have  accused  gen¬ 
eralists  of  superficiality,  claiming  that  an  in-depth 
knowledge  of  the  particular  language,  culture,  and 
social  context  is  needed  to  produce  adequate 
analyses  of  cases  that  do  not  conform  to  the 
OECD  “standard  model” — on  the  empirical  reali¬ 
ties  of  which  most  political  science  theory  has  been 
generated. 

Contemporary  political  science  has  developed 
most  strongly  in  the  post-World  War  II  period  and 
in  the  transatlantic  North.  This  explains  why  the 
bulk  of  existing  theories  and  methods  have  a  clear 
“transatlantic”  or  “OECD  bias.”  Democracy, 
elections,  state  and  administration,  security,  and 
so  on  are  conceptualized  in  mainstream  political 
science  using  idealized  blueprints  of  Western  soci¬ 
eties.  It  cannot  be  denied  that  these  concepts  have 
been  highly  influential  in  other  world  regions  as 
well  and  have  been  internalized  by  many  scholars, 
elites,  and  citizens  from  Argentina  to  Zimbabwe. 
However,  they  are  also  frequently  contested  and 
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compete  with  other  organizing  principles  of  soci¬ 
ety.  The  trajectories  toward  modernity  have  in  fact 
varied  enormously  around  the  globe,  with  religion, 
family  bonds,  or  industrialization  playing  diver¬ 
gent  roles  in  different  world  regions.  The  exclusive 
reliance  on  “imported”  models  without  local 
adaptation,  so  area  studies  scholars  have  argued, 
produces  systematic  errors  in  the  comprehension 
of  how  “other”  political  systems  function.  In  a 
more  and  more  globalized  world  and  for  advocates 
of  a  practice-oriented  approach  in  the  subdisci¬ 
pline  of  IR,  it  has  become  essential  to  understand 
the  motivations  and  capabilities  of  actors  in  the 
most  remote  places  of  the  globe  in  order  to  appro¬ 
priately  analyze  security  and  economic  challenges 
for  their  own  societies. 

Nonetheless,  a  balanced  view  of  the  strengths 
and  weaknesses  of  the  approaches  of  both  main¬ 
stream  political  science  and  area  studies  has  led  to  a 
more  fertile  relationship  and  compromises  between 
“nomothetic”  and  “idiographic”  worldviews.  Some 
of  the  most  prolific  strands  of  research  in  political 
science  have  been  strongly  influenced  by  area  spe¬ 
cialists;  for  instance,  the  understandings  of  corpo¬ 
ratism  as  well  as  of  transitions  to  democracy  in  the 
1980s  were  largely  developed  by  Latin  American 
studies  scholars,  before  their  concepts  “traveled”  to 
explain  European  and  other  cases.  More  recently, 
the  topic  of  the  consolidation  and  reform  of  “hybrid 
regimes”  has  been  studied  in  Africa  and  South  East 
or  Central  Asia,  and  much  of  the  study  of  persistent 
authoritarian  rule  draws  heavily  on  the  scholarship 
of  area  specialists  of  the  Middle  East  and  Asia. 
Similarly,  studies  of  clientelism  or  nationalism, 
informal  politics  or  political  culture,  and  develop¬ 
ment  models  or  revolution  would  not  be  possible 
without  the  insights  of  area  specialists.  Some  schol¬ 
ars  go  so  far  as  to  call  “area  studies”  a  misnomer, 
stating  that  area-focused  scholarship  by  political 
scientists  should  be  understood  as  an  integral  com¬ 
ponent  of  comparative  politics. 

The  “Orientalism”  Debate  and  the 
Demarcation  of  the  “Areas”  Studied 

Criticism  of  area  studies  has  also  been  raised  from 
within  the  regions  under  scrutiny,  most  prominently 
in  the  “orientalism”  debate  kicked  off  by  the  1978 
publication  of  Edward  Said’s  influential  critique  of 
Western  constructions  of  the  “Orient.”  Area  studies, 


according  to  this  critique,  expressed  an  imperialist 
and  condescending  worldview  regarding  the 
“other”;  thus,  the  object  of  research  had  to  be  rede¬ 
fined,  and  a  complete  overhaul  of  the  production  of 
academic  research  on  non-Western  societies  was 
necessary.  Postcolonial  studies  emerged  from  this 
line  of  thought  as  a  competing  paradigm  of  research 
that  sharply  criticized  mainstream  Western  aca¬ 
demic  approaches  as  being  part  of  an  international 
system  of  domination  in  continuity  with  the  colo¬ 
nial  past.  While  strongest  in  literary  theory  and 
cultural  studies,  postcolonialist  approaches  also 
concern  social  and  political  science. 

A  different  line  of  critique  has  come  from  schol¬ 
ars  in  countries  of  the  global  South,  who  consider 
themselves  political  scientists,  just  like  any  North 
American  or  European  scholar  studying  his  or  her 
own  country,  but  find  their  research  on  Brazilian 
parties,  South  African  trade  unions,  or  Indian  class 
structure  labeled  as  area  studies  rather  than  being 
accepted  as  political  science  on  an  equal  footing 
with  similar  research  in  the  OECD  world. 

A  particular  concern  in  area  studies  is  the  exact 
territorial  demarcation  of  the  “areas”  under  inves¬ 
tigation — all  the  more  so  given  the  recent  emphasis 
on  transnational  and  transregional  interrelation¬ 
ships.  Is  it  appropriate  that  African  studies  more 
often  than  not  deal  exclusively  with  the  Africa 
south  of  the  Sahara?  Put  differently,  is  North  Africa 
part  of  both  African  and  Arab  studies?  What  impli¬ 
cations  does  the  choice  between  “Arab  world”  and 
“Muslim  world” — with  an  emphasis  on  ethnicity 
rather  than  religion — have  for  the  understanding  of 
the  region?  Does  it  make  sense  to  group  Southeast 
Asian,  Central  Asian,  and  South  Asian  studies 
together  under  the  label  of  Asian  studies?  While 
Latin  American  studies  encompass  a  region  with  a 
largely  shared  historical  and  cultural  legacy,  some 
recent  initiatives  have  shifted  to  a  more  hemispheric 
approach  under  the  label  of  American  studies, 
which  includes  Canada  and  the  United  States. 

Intellectual  debates  on  these  matters  abound, 
but  the  persistence  of  the  existing  classifications  is 
a  sign  that  they  continue  to  provide  a  basis  for  the 
production  of  meaning.  Recent  research  has  in  fact 
argued  that  “regional  context”  is  one  crucial  vari¬ 
able  in  many  quantitative  research  designs,  and  in 
IR,  the  concepts  of  a  “world  of  regions”  and  an 
emerging  regional  architecture  in  world  politics 
have  become  an  important  focus  of  scholarly 
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attention,  implicitly  strengthening  the  key  assump¬ 
tions  underlying  the  concept  of  area  studies. 

Comparative  Area  Studies 

A  distinct  and  new  approach  is  “comparative  area 
studies.”  In  addition  to  intraregional  and  interre¬ 
gional  forms  of  comparison,  the  long-marginalized 
field  of  cross-regional  comparisons  has  emerged  as 
part  of  the  agenda,  for  example  the  comparison  of 
social  security  systems  in  Venezuela  and  India  or 
of  party  systems  in  Tanzania  and  Indonesia.  The 
case  for  comparative  area  studies  is  twofold:  First, 
given  that  the  emphasis  on  regional  expertise  has 
been  their  raison  d’etre,  the  different  branches  of 
area  studies  have  developed  largely  separated  from 
each  other  over  decades.  The  ability  of  concepts 
“to  travel”  from  one  regional  context  to  another 
has  yet  to  be  investigated  in  an  appropriate  way. 
While,  for  instance,  the  analysis  of  the  paradox  of 
concomitant  democratization  and  the  persistence 
of  neo-patrimonial  rule  was  pioneered  in  the 
African  context,  its  general  usefulness  can  be 
tested  on  Latin  American  and  Asian  cases.  Such 
cross-area  analysis  allows  the  detection  of  area 
specificities  as  much  as  commonalities.  A  more 
general  theory  may  emerge  more  easily  from  such 
a  research  design  than  from  single-area  approaches. 

The  second  line  of  argument  for  comparative 
area  studies  holds  that  cross-regional  small-N  com¬ 
parisons  are  particularly  well  suited  to  perform  a 
mediating  role  between  area-focused  scholarship 
and  the  general  theoretical  and  methodological 
debates  in  political  science.  While  there  will  be  few 
scholars  with  equally  profound  expertise  in  distant 
world  regions,  the  demanding  prerequisites  for 
conducting  such  cross-area  research  can  be  system¬ 
atically  fulfilled  through  collaborative  projects 
between  scholars  with  different  regional  expertise 
but  shared  research  interests.  Area  studies  are  not 
limited  to  a  particular  set  of  methods  or  schools 
but  employ  the  wide  array  of  methods  used  in  the 
disciplines  involved.  Typically,  however,  case  stud¬ 
ies  and  paired  and  small-N  comparisons  have  been 
dominant,  and — given  area  scholars’  emphasis  on 
profound  knowledge  of  the  area  under  investiga¬ 
tion — qualitative  approaches  have  traditionally 
been  stronger  than  quantitative  studies. 

Finally,  the  dynamic  development  and  increas¬ 
ing  self-assertion  of  scholarship  from  the  regions 


that  are  the  topic  of  area  studies  have  led  to  grow¬ 
ing  sensitivity  toward  ethical  considerations  voiced 
in  “the  South.”  Area  specialists  in  Northern  coun¬ 
tries  are  constantly  pressed  by  their  colleagues 
from  Africa,  Asia,  Latin  America,  or  the  Middle 
East  to  share  their  privileged  access  to  libraries, 
data,  and  funding.  One  element  of  a  more  collab¬ 
orative  approach  is,  for  example,  the  open-access 
movement  to  publish  social  science  findings  free  of 
charge  on  the  Internet  for  readers  worldwide.  This 
may  eventually  narrow  the  gap  in  the  access  to 
relevant  current  literature  between  the  “Harvards” 
and  the  “Have-nots.” 

Andreas  Mehler  and  Bert  Hoffmann 
German  Institute  of  Global  and  Area  Studies 

Hamburg,  Germany 

See  also  Comparative  Methods;  Hybrid  Regimes;  Neo- 
Patrimonialism;  Orientalism;  Postcolonialism 
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two  or  more  countries.  The  term  arms  race  is 
often  used  quite  loosely  to  refer  to  any  military 
buildup  or  spending  increases  by  a  group  of  coun¬ 
tries.  This  definition  requires  that  there  be  a  com¬ 
petitive  nature  to  this  buildup,  often  reflecting  an 
adversarial  relationship.  The  arms  race  concept  is 
also  used  fruitfully  in  other  fields,  especially  evo¬ 
lutionary  biology;  however,  this  entry  deals  only 
with  military  arms  races. 

Examples  of  Arms  Races 

One  example  of  an  arms  race  is  the  “Dreadnought” 
arms  race  between  Germany  and  Britain  prior  to 
World  War  I.  In  the  early  20th  century,  Germany 
as  a  rising  power  sought  to  challenge  the  United 
Kingdom’s  traditional  naval  dominance.  In  1906, 
Britain  launched  a  new,  more  advanced  warship, 
HMS  Dreadnought ,  triggering  a  naval  arms  race. 
Between  1909  and  the  outbreak  of  World  War  I  in 
1914,  Britain  launched  a  further  19  “dread¬ 
noughts”  and  a  further  nine  battle  cruisers,  while 
Germany  launched  13  dreadnoughts  and  five  bat¬ 
tle  cruisers.  This  arms  race  is  often  cited  as  one  of 
the  causes  of  World  War  I. 

The  Cold  War  nuclear  arms  race  between  the 
United  States  and  the  Soviet  Union  is  another 
example  of  a  20th-century  arms  race.  The  United 
States’  use  of  nuclear  weapons  to  end  World  War 
II  led  to  a  determined  effort  by  the  Soviet  Union  to 
acquire  these  weapons,  leading  to  a  long-running 
nuclear  arms  race  between  the  two  superpowers. 
The  Soviet  Union  conducted  its  first  nuclear  test  in 
1949.  At  the  end  of  1956,  the  United  States  had 
2,123  strategic  warheads  to  the  Soviet  Union’s  84. 
These  numbers  increased  rapidly  over  the  subse¬ 
quent  30  years.  The  U.S.  arsenal  peaked  in  1987  at 
13,002  warheads,  the  Soviet  Union  2  years  later  at 
11,320  (figures  from  Stockholm  International 
Peace  Research  Institute).  The  end  of  the  Cold  War 
effectively  ended  this  arms  race. 

Arms  races  may  involve  a  more  general  com¬ 
petitive  acquisition  of  military  capability.  This  is 
often  measured  by  military  expenditure,  although 
the  link  between  military  expenditure  and  capabil¬ 
ity  is  often  quite  weak.  Such  more  general  arms 
races  are  often  observed  among  countries  engaged 
in  enduring  rivalries,  which  may  sometimes  appear 
to  follow  each  other’s  military  spending  levels, 
especially  during  periods  of  heightened  tension. 


Examples  of  such  arms  races  include  India- 
Pakistan,  Israel-Arab  states,  Greece-Turkey,  and, 
since  1991,  Armenia-Azerbaijan. 

Consequences  of  Arms  Races 

Arms  races  are  frequently  regarded  as  negative 
occurrences  in  both  economic  and  security  terms. 

Large-scale  arms  acquisitions  require  consider¬ 
able  economic  resources.  If  two  countries  spend 
large  sums  of  money  just  to  cancel  out  each  other’s 
efforts,  the  expenditure  might  well  be  seen  as 
wasted.  There  is,  however,  considerable  debate 
surrounding  the  economic  effect  of  military  spend¬ 
ing.  Some  argue  that  it  provides  benefits  through 
technological  spin-offs,  job  creation,  and  infra¬ 
structure  development;  others  argue  that  it  dis¬ 
places  more  productive  forms  of  investment,  while 
its  final  output  is  not  itself  productive.  Certainly, 
countries  that  must  import  arms  will  see  more 
negative  economic  effects  of  an  arms  race,  and 
arms  imports  are  a  major  contributor  to  Third 
World  debt.  Even  for  arms-producing  countries, 
excessive  military  expenditure  is  likely  eventually 
to  have  negative  economic  consequences.  The 
Soviet  Union’s  economic  difficulties  were  certainly 
exacerbated  by  the  very  high  proportion  of  the 
gross  domestic  product  devoted  to  the  arms  race. 

The  question  of  whether  arms  races  contribute 
to  the  outbreak  of  war  is  also  the  subject  of  con¬ 
siderable  debate.  An  arms  race  may  heighten  fear 
and  hostility  on  the  part  of  the  countries  involved, 
but  whether  this  contributes  to  war  is  hard  to 
gauge.  Some  empirical  studies  do  find  that  arms 
races  are  associated  with  an  increased  likelihood  of 
war;  however,  it  is  not  possible  to  say  whether  the 
arms  race  was  itself  a  cause  of  war  or  merely  a 
symptom  of  existing  tensions. 

One  may  also  consider  the  gains  for  a  country 
that  “wins”  an  arms  race  in  the  sense  of  gaining  a 
decisive  military  advantage.  Arguably,  the  collapse 
of  the  Soviet  Union,  which  left  the  United  States  as 
the  sole  global  superpower,  was  partly  due  to  the 
cost  of  attempting  to  keep  up  with  the  United 
States. 

Modeling  Arms  Races 

There  is  an  extensive  literature  modeling  arms 
races  theoretically  and  empirically.  These 
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include  game-theoretic  models  based  on  the 
“prisoners’  dilemma”  (PD),  dynamic  mathemat¬ 
ical  models  based  on  the  Richardson  model,  and 
economic  models  frequently  based  on  a  “utility- 
maximizing”  framework.  There  is  overlap  between 
these  categories. 

Prisoners’  Dilemma  Models 

The  famous  PD  game  is  frequently  applied  to 
arms  races  between  two  countries.  It  is  assumed 
that  each  country  has  a  choice  between  a  high  or 
low  level  of  arms.  Each  country’s  most  preferred 
outcome  is  assumed  to  be  where  they  choose  high 
arms  and  their  rivals  low,  gaining  a  clear  military 
advantage.  Their  least  preferred  outcome  is  the 
reverse.  However,  the  second  best  outcome  for 
each  is  where  both  choose  low;  if  no  advantage  is 
gained,  it  is  assumed  to  be  cheaper  and  more 
secure  to  avoid  the  arms  race. 

Each  player’s  “dominant  strategy”  is  to  choose 
high  arms,  as  whichever  choice  the  rival  has  made, 
they  do  better  by  choosing  high  than  low.  This  out¬ 
come  illustrates  the  Nash  equilibrium  (named  after 
John  Nash),  in  which  each  player  has  chosen  the 
optimal  strategy  in  a  noncooperative  situation  or 
zero-sum  game,  given  the  other  player’s  strategy. 
The  Nash  equilibrium  outcome  of  the  game  is 
therefore  that  both  choose  high.  It  is  an  equilibrium 
because  neither  player  would  change  their  own 
choice,  given  the  choice  of  the  rival.  However,  this 
arms  race  outcome  is  worse  for  both  players  than  if 
both  had  chosen  low  arms.  The  logic  of  their  rivalry 
traps  them  in  a  mutually  disadvantageous  situation. 

However,  the  picture  may  not  necessarily  be  so 
bleak,  as  in  reality  the  “game”  is  not  played  once 
and  for  all  but  is  an  ongoing  series  of  decisions, 
which  can  be  modeled  by  the  “iterated  prisoners’ 
dilemma”  (IPD),  where  the  PD  game  is  played 
repeatedly  by  the  same  players.  This  opens  the  pos¬ 
sibility  for  cooperation  to  emerge  through  reward 
and  punishment  strategies  such  as  “tit  for  tat” — 
start  by  choosing  low  arms  but  then  match  the 
strategy  chosen  by  the  other  player  in  the  previous 
round.  Experiments  have  shown  this  to  be  a  highly 
successful  strategy  in  IPD  games. 

While  greatly  simplifying  real-world  situations, 
the  PD  may  be  a  useful  metaphor  to  capture  the 
essential  dilemma  facing  countries  engaged  in  an 
enduring  rivalry. 


The  Richardson  Model  and  Its  Elaborations 

In  his  seminal  work  Arms  and  Insecurity ,  Lewis 
Fry  Richardson  proposed  a  model  (which  he 
applied  to  the  Dreadnought  race)  of  an  arms  race 
between  two  countries  where  each  country  sets  its 
military  expenditure  (or  arms  acquisition)  level  in 
each  period  based  on  its  own  and  its  rival’s  level  in 
the  previous  period  in  an  “action-reaction”  pat¬ 
tern.  This  is  modeled  by  the  following  equations: 

Ml,  =  -a\M\t_x  +  +  gl, 

M2t  =  -a2M2t_1  +  b2Mlt_1  +  g2. 

Here,  Ml,  and  M2,  refer  to  the  military  spending 
levels  of  Countries  1  and  2  in  years  t  and  t  —  1, 
respectively.  The  coefficients  at  and  a2  (assumed  to 
be  positive)  are  “fatigue”  coefficients,  representing 
the  difficulty  of  maintaining  high  levels  of  military 
spending.  The  coefficients  b  1  and  b. 2,  also  positive, 
are  “reaction”  coefficients,  measuring  the  tendency 
for  each  country  to  respond  to  the  military  spending 
of  their  rival,  while  gl  and  g2  are  autonomous 
“grievance”  or  “ambition”  terms,  representing  each 
country’s  desire  for  military  capability  apart  from  the 
rivalry.  Depending  on  the  relative  size  of  the  fatigue 
and  reaction  coefficients,  the  arms  race  can  either 
reach  a  stable  equilibrium  or  spiral  out  of  control. 

The  basic  Richardson  model  has  been  extensively 
developed  by  other  authors,  both  theoretically  and 
empirically.  Developments  include  taking  into 
account  the  stock  of  weapons  of  each  country  as 
well  as  the  rate  of  spending,  introducing  explicit 
economic  criteria,  and  modeling  the  strategic  dynam¬ 
ics  of  the  relationship.  Richardson  models  can  easily 
be  applied  to  empirical  military  spending  data,  using 
regression  analysis  to  estimate  the  parameters  of  the 
equation  for  a  pair  of  countries.  The  key  question  is 
whether  the  reaction  terms  b  1  and  b2  are  signifi¬ 
cantly  greater  than  zero — if  so,  an  “action-reaction” 
or  Richardsonian  arms  race  is  said  to  exist.  A  wide 
variety  of  theoretical  models  and  statistical  tech¬ 
niques  starting  from  the  Richardson  framework 
have  been  applied  to  various  pairs  of  countries. 

Economic  Models 

A  third  approach  is  to  assume  that  countries’ 
military  expenditure  decisions  are  the  outcome  of 
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an  economic  resource  allocation  process  whereby 
the  government  seeks  to  achieve  a  set  of  economic, 
political,  and  security  objectives  by  allocating 
spending  between  military  and  civil  sectors. 
Neoclassical  rational  choice  models  are  most  fre¬ 
quently  employed,  but  others  are  also  used.  When 
two  rival  countries  are  considered,  each  country’s 
level  of  security  is  assumed  to  depend  on  both  its 
own  level  and  its  rivals’  level  of  military  spending. 
Each  country  makes  its  decision  taking  into 
account  the  likely  response  of  the  other.  The 
resulting  models  are  similar  to  certain  elaborations 
of  the  Richardson  model:  While  one  starts  from 
the  arms  race  and  builds  in  economic  allocation 
issues,  the  other  starts  with  the  allocation  problem 
and  builds  in  the  rivalry. 

Weaknesses  of  Richardsonian  Models 

While  the  Richardson  model  has  proved  extraor¬ 
dinarily  fruitful  in  generating  scholarly  analysis  of 
arms  races,  the  Richardson  approach  has  a  rather 
poor  empirical  record  in  demonstrating  the  exis¬ 
tence  of  actual  arms  races.  Some,  though  not  all, 
studies  of  India  and  Pakistan  have  found  evidence 
of  a  Richardsonian  arms  race,  but  few  other 
enduring  rivalries  have  produced  such  empirical 
results  and  none  consistently. 

A  problem  of  the  Richardson  model  is  that  it 
assumes  that  the  parameters  of  the  relationship 
(the  values  of  a,  b,  and  g)  remain  constant,  whereas 
in  reality  they  may  change  over  time  depending  on 
the  changing  relationship  between  the  countries.  A 
second  is  that  it  is  most  applicable  to  pairs  of  coun¬ 
tries  where  the  rivalry  is  the  overwhelming  factor 
for  each  country’s  security,  which  is  relatively  rare. 

One  suggestion  is  that  the  changing  levels  of  ten¬ 
sion  or  hostility  between  countries  may  be  a  better 
way  of  explaining  their  military  spending  decisions 
than  the  Richardson  action-reaction  framework — 
or  perhaps  a  combination  of  levels  of  hostility  with 
the  rival’s  military  spending.  This  would  suggest 
that  arms  races  are  characterized  more  by  short 
bursts  of  rapidly  increasing  spending  during  peri¬ 
ods  of  high  tension  than  by  long-term,  stable  rela¬ 
tionships  between  their  levels  of  spending. 

Sam  Perlo-Freeman 
Stockholm  International  Peace  Research  Institute 

Solna,  Sweden 
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Attitude  Consistency 


Three  types  of  attitude  consistency  can  be  distin¬ 
guished:  (1)  consistency  among  the  various  compo¬ 
nents  of  an  attitude,  (2)  consistency  between  differ¬ 
ent  attitudes,  and  (3)  consistency  between  attitudes 
and  behavior.  This  entry  addresses  the  relevant  con¬ 
ceptualizations,  measurements,  and  explanations. 

Intra-Attitudinal  Consistency 

An  attitude  is  said  to  be  consistent  when  all  elements 
elicit  similar  evaluative  judgments.  Intra-attitudinal 
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inconsistency  arises  when  some  of  these  elements 
are  positive  in  nature,  while  others  are  negative. 
Within  the  cognitive  component,  accessible  beliefs 
may  conflict,  cognitions  and  affects  may  also  con¬ 
flict,  and  the  various  affects  may  differ,  simultane¬ 
ously.  Intra-attitude  consistency  can  be  examined 
by  comparing  how  respondents  feel,  how  they 
think,  and  what  their  summary  evaluation  is.  The 
many  studies  that  showed  low  correlations  and 
consistency  between  the  cognitive,  affective,  and 
conative  (behavioral)  components  have  raised 
doubts  about  the  early  theorists’  multicomponent 
view  of  attitudes.  Martin  Fishbein  and  leek  Ajzen 
restricted  the  term  attitude  to  a  person’s  evaluation 
and  drew  a  clear  distinction  between  cognitions, 
attitudes,  behavioral  intentions,  and  behaviors. 
From  their  point  of  view,  an  attitude  is  a  person’s 
general  feeling  of  favorableness  or  unfavorableness 
and  an  index  of  the  degree  to  which  a  person  likes 
or  dislikes  an  object.  This  one-component  view  of 
attitude  opened  the  way  to  the  development  of 
more  sophisticated  explanatory  models  in  which 
the  summary  evaluation  is  at  the  core  of  an  atti¬ 
tude,  while  behavior  or  the  intention  thereto  is  an 
effect,  and  cognition  (knowledge  and  beliefs)  and 
affect  (general  moods  and  specific  emotions)  are 
origins  of  the  attitude.  The  affects  underlying  atti¬ 
tudes  are  more  easily  accessible  in  memory  than 
the  cognitions,  and  they  tend  to  predominate 
among  ambivalent  respondents  with  different  eval¬ 
uative-cognitive  and  evaluative-affective  consisten¬ 
cies.  Individuals  differ  in  their  tendency  to  base 
their  attitudes  on  cognition  or  affect,  as  Geoffrey 
Haddock  and  Mark  Zanna  have  shown.  Consistent 
attitudes  were  found  to  be  stronger,  more  predic¬ 
tive  of  subsequent  intentions  and  behavior,  more 
resistant  to  persuasive  communication  and  change, 
and  thus  more  stable  than  inconsistent  attitudes. 
Inconsistent  attitudes  seem  to  result  in  more  infor¬ 
mation  processing  and  knowledge  acquisition. 

Interattitudinal  Consistency 

Attitudes  toward  the  same  object  and  attitudes 
toward  different  objects  are  said  to  be  consistent 
when  they  do  not  contradict  each  other. 
Interattitudinal  consistency  with  respect  to  the 
same  object  (dual-attitude  model)  is  empirically 
examined  by  the  extent  to  which  a  respondent’s 
explicit  and  implicit  or  habitual  attitudes  are 


related  to  one  another.  Interattitudinal  consistency 
with  respect  to  different  objects  is  empirically 
examined  by  the  extent  to  which  these  attitudes 
are  related  to  one  another  and/or  the  level  of  cor¬ 
respondence  between  the  respondent’s  attitudes 
and  his  or  her  ideological  self-identification  (e.g., 
consistent  liberal  or  conservative  attitudes),  left- 
right  self-placement,  and  respondent’s  value  sys¬ 
tem  and/or  the  number  of  issues  on  which  a 
respondent  adopts  his  or  her  (liberal  or  conserva¬ 
tive)  party’s  position.  Interattitudinal  consistency 
is  considered  desirable  from  the  point  of  view  of 
“correct”  voting  in  representative  democracies, 
requiring  citizens  to  make  well-founded  voting 
decisions.  Scholars  vary  in  their  views  on  the  level 
of  interattitudinal  consistency.  Philip  Converse 
was  one  of  the  first  who  showed  that  the  political 
attitudes  that  many  people  have  are  not  consistent 
and  referred  to  this  aspect  of  public  opinion  as 
“nonattitudes.”  However,  the  evidence  presented 
by  Max  Kaase  and  Kenneth  Newton,  using 
Eurobarometer  data,  suggests  that  public  opinion 
(in  Western  Europe)  is  relatively  internally  consis¬ 
tent  (at  both  the  aggregate  and  the  individual  lev¬ 
els).  Mass  opinion  about  the  welfare  state  and  the 
scope  of  government  is  structured  into  relatively 
few  attitude  publics.  Mass  attitudes  about  spend¬ 
ing  on  government  services  and  attitudes  toward 
policy  issues  cluster  into  three  groupings  of  related 
issues  (welfare,  security,  and  quality  of  life).  There 
is  also  a  match  between  saying  that  a  particular 
issue  has  high  priority  as  a  public  service  and 
expressing  a  willingness  to  pay  taxes  for  it.  Finally, 
the  statistical  associations  with  left-right  orienta¬ 
tion  and  party  identification  are  usually  stronger 
than  with  any  of  the  social,  economic,  and  demo¬ 
graphic  variables.  However,  this  left-right  factor 
generally  explains  only  a  small  proportion  of  the 
variance  in  issue  positions.  A  factor  that  can  also 
help  us  understand  the  structure  of  political  atti¬ 
tudes  is  values;  there  is  now  a  substantial  amount 
of  evidence  that  values  are  a  major  source  of  struc¬ 
ture  for  political  attitudes,  according  to  Stanley 
Feldman. 

Explanations  of  Attitude  Consistency 

Consistency  and  inconsistency  have  both  interper¬ 
sonal  and  intrapersonal  sources.  Political  socializa¬ 
tion  theory,  represented  by,  among  others,  David 
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Easton,  Kent  Jennings,  Richard  Niemi,  Judith 
Torney,  and  Russell  Farnen,  emphasizes  interper¬ 
sonal  and  contextual  factors  and  the  level  of  con¬ 
sistency  or  inconsistency  of  the  cognitive  and  affec¬ 
tive  messages  that  people  receive  from  the  various 
socialization  agencies  (including  political  leaders, 
mass  media,  friends  and  colleagues,  religious  insti¬ 
tutions,  school,  family)  in  particular.  The  scale 
runs  from  very  high  message  consistency  in  totali¬ 
tarian  regimes  and  autocracies,  via  moderate  con¬ 
sistency  in  representative  democracies  with  weak 
elite  polarization,  to  very  low  consistency  in 
democracies  with  strong  elite  polarization. 

Various  intrapersonal  factors  have  been  identi¬ 
fied.  Individuals  differ  in  their  chronic  tendency  to 
engage  in  evaluative  responding  (which  can  be 
measured  by  a  set  of  items  developed  by  Blair 
Jarvis  and  Richard  Petty).  People  also  differ  in 
their  attempts  to  reduce  or  avoid  the  tension  that 
they  experience  when  they  become  aware  of  con¬ 
tradictions.  To  avoid  the  anticipated  tension,  peo¬ 
ple  prefer  exposure  to  consonant  media  and  mes¬ 
sages  (e.g.,  Fritz  Heider’s  balance  theory  and  Feon 
Festinger’s  cognitive  dissonance  theory).  The  level 
of  political  knowledge  is  another  important  vari¬ 
able.  In  general,  the  mean  level  is  very  low  but  the 
variance  is  very  high.  The  higher  the  level  of 
political  knowledge,  the  more  tightly  structured  or 
“constrained”  the  individual’s  system  of  attitudes, 
as  Converse  has  shown.  A  few  theoretical  frame¬ 
works  have  been  developed  as  attempts  to  explain 
the  organization  of  political  and  social  attitudes. 
Converse’s  “ideological  constraint  theory”  holds 
that  individual  attitudes  become  consistent  thanks 
to  deduction  from  higher  order  orientations  such 
as  ideology.  People  differ  in  attitude  consistency 
because  they  differ  in  the  use  of  ideological  sche¬ 
mas.  Howard  Favine  and  Cynthia  Thomsen  have 
developed  the  “shared-consequences  model”  to 
understand  the  psychological  mechanisms  through 
which  structural  organization  and  consistency 
between  attitudes  develop.  The  core  of  the  model 
is  the  strength  of  the  perceived  implicational  rela¬ 
tionship  between  a  pair  of  attitude  objects. 
Perceiving  two  policies,  for  example,  legalized 
abortion  and  capital  punishment,  as  instrumen- 
tally  influencing  a  similar  set  of  consequences,  for 
example,  the  attainment  of  similar  sets  of  value- 
related  goals,  constitutes  a  strong  implicational 
relationship  between  them,  which,  in  turn,  creates 


consistency  pressures.  Consistency,  then,  is  the 
psychological  agreement  between  the  valences  of  a 
pair  of  attitudes  and  the  valence  of  the  implica¬ 
tional  relationship  perceived  to  exist  between 
them.  The  development  of  interattitudinal  organi¬ 
zation  may  proceed  through  a  series  of  distinct 
stages:  Interattitudinal  connections  may  initially  be 
derived  from  affective  associations  that  issues  have 
in  common,  next  to  self-  and  group-interest  con¬ 
siderations,  then  from  values,  and  finally  from 
ideology. 

Attitude-Behavior  Consistency 

Attitudes  are  studied  primarily  because  of  their 
expected  effects  on  behavior.  It  is  expected  that  the 
more  favorable  an  individual’s  attitude  toward 
some  object,  the  more  likely  the  individual  will  be 
to  perform  any  given  positive  behavior  and  the  less 
likely  he  or  she  will  be  to  perform  any  negative 
behavior  with  respect  to  the  object.  In  Allan 
Wicker’s  analysis  of  45  studies,  however,  only  a 
small  minority  of  studies  showed  a  strong  or  modest 
positive  relationship  between  attitude  and  behavior. 
Hans  Benninghaus’s  analysis  of  57  studies  sup¬ 
ported  Wicker’s  findings.  Inconsistency  between 
attitude  and  behavior  is  higher  if  the  attitude  object 
is  not  related  to  one’s  self-interest,  if  intra-attitude 
inconsistency  is  high,  and  if  one’s  attitude  conflicts 
with  the  subjective  norm  or  the  perception  of  how 
relevant  others  think  one  should  behave. 

In  an  attempt  to  improve  the  attitude-behavior 
link,  the  conceptualizations  and  operationaliza¬ 
tions  of  the  two  key  concepts  have  been  reconsid¬ 
ered.  Fishbein  and  Ajzen  introduced  the  concep¬ 
tual  restriction  of  attitudes  to  a  person’s  evaluation 
or  feeling  of  favorableness  or  unfavorableness. 
With  respect  to  behavior,  distinctions  were  intro¬ 
duced,  for  instance,  between  behavioral  categories 
and  single  actions,  isolated  individual  behavior 
and  behavior  in  the  presence  of  others,  and  behav¬ 
iors  under  much  or  little  volitional  control.  In 
subsequent  behavior-explanatory  theories,  attitude 
is  still  an  independent  variable,  though  not  the  one 
and  only,  the  most  important,  or  the  most  immedi¬ 
ate  variable.  In  Fishbein  and  Ajzen’s  “theory  of 
reasoned  action,”  which  presents  a  causal  chain  to 
predict  behavior,  the  immediate  determinant  of 
behavior  is  the  individual’s  intention  to  perform 
the  behavior  in  question.  Attitudes  toward  the 
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behavior  and  the  “subjective  norm”  are  the  two 
determinants  of  the  individual’s  behavioral  inten¬ 
tion.  Thus,  behavioral  and  normative  beliefs  are 
the  antecedents  of  attitude  and  subjective  norm 
and,  as  a  result,  ultimately  determine  intention  and 
behavior.  This  reflects  a  cognitive  approach  to 
explaining  attitudes  and  a  rational  view  on  human 
behavior.  One  of  the  main  comments  on  the  rea- 
soned-action  theory  was  that  this  theory  is  only 
applicable  to  purely  volitional  behavior.  Ajzen’s 
“theory  of  planned  behavior”  extends  the  theory 
of  reasoned  action  by  adding  the  perception  of 
control  over  behavior.  Perceived  behavioral  con¬ 
trol  is  also  cognitive  and  rational  since  it  is  the 
individual’s  set  of  beliefs  about  the  presence  or 
absence  of  requisite  resources  and  opportunities. 
New  meta-analyses  by,  among  others,  Min-Sun 
Kim  and  John  Hunter  and  Christopher  Armitage 
and  Mark  Conner,  showed  stronger  attitude- 
behavior  relationships  and  a  relatively  strong  pre¬ 
dictive  power  of  the  theory  of  planned  behavior. 
Many  studies  have  demonstrated  the  applicability 
of  the  theory  in  various  content  domains,  includ¬ 
ing  the  political.  In  Marco  Perugini  and  Mark 
Conner’s  “model  of  goal-directed  behavior,”  the 
basic  variables  of  the  planned  behavior  theory 
were  added,  together  with  three  other  key  vari¬ 
ables:  desire/motivation,  emotions,  and  habit. 

In  general,  affective  variables  are  gaining  in 
importance  in  theory  and  research  aiming  at 
explaining  the  consistency  between  attitudes  and 
behavior.  Promising  political  science  theories, 
building  on  new  insights  from  political  neurosci¬ 
ence,  are  George  Marcus’s  “affective  intelligence 
theory,”  Milton  Lodge  and  Charles  Taber’s  “the¬ 
ory  of  motivated  political  reasoning,”  and  Richard 
Lau  and  David  Redlawsk’s  “model  of  motivated 
reasoning.  ” 
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Individuals  hold  attitudes — and  eventually  change 
them — in  relation  to  a  wide  diversity  of  topics. 
They  eventually  establish  connections  between 
one  topic  and  another.  Thus,  for  example,  for 
some  individuals,  attitudes  toward  a  political 
topic  may  be  based  on  attitudes  toward  other  sub¬ 
jects  not  related  to  politics.  The  idea  that  political 
attitudes  constitute,  or  should  constitute,  a  sepa¬ 
rate  field  has  two  main  roots.  One  is  the  tradition 
connecting  the  sphere  of  politics  with  the  kind  of 
well-articulated  sets  of  ideas  frequently  called 
“ideologies”;  the  other  is  the  normative  tradition 
postulating  that  human  beings  should  approach 
the  realm  of  politics  in  a  way  that  is  different  from 
how  they  normally  approach  other  spheres  of  life, 
basically  by  putting  aside  impulses,  affects,  and 
passions.  We  hear  and  record  opinions,  but  no 
one  can  hear  or  see  attitudes.  Attitudes  lie  within 
individuals’  minds;  therefore,  they  are  nonobserv¬ 
able,  or  latent,  properties.  To  speak  about  “atti¬ 
tudes,”  we  need  to  build,  analytically,  the  aspects 
of  reality  the  existence  of  which  we  postulate.  The 
concept  of  attitude  as  it  is  employed  in  various 
political  contexts  and  the  factors  shaping  them  are 
discussed  as  follows. 
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Definition  and  Measurement 

A  critical  problem,  as  ancient  as  the  notion  of 
“attitudes”  itself,  is  how  to  measure  them.  Being, 
by  definition,  unobservable  aspects  of  reality,  any 
attempt  to  measure  them  is  based  on  assump¬ 
tions.  The  first  and  best  known  attempts  to  mea¬ 
sure  attitudes  employed  projective  tools,  scales, 
attitudinal  indices,  and  mathematical  models 
allowing  inferences  about  latent  structures 
through  manifest  observations  and  data.  All  of 
these  tools  are  based  on  the  principle  of  a  stimu¬ 
lus  (a  question)  and  a  response  (a  verbal  answer) 
that  elicits  some  unobservable  mental  state.  The 
idea  that  attitudes  can  be  measured  by  recording 
manifest  verbal  expressions  from  which  they  can 
be  inferred  had  a  significant  impact  on  the  devel¬ 
opment  of  the  techniques  of  quantitative  survey 
analysis.  This  concept  provided  a  logic,  a  method¬ 
ological  and  an  empirical  support  for  the  field  of 
opinion  research. 

Let  us  think  of  voting.  How  do  we  account  for 
the  particular  decision  each  voter  makes  at  the 
ballot  box?  Is  this  act  something  determined  from 
outside  himself  or  herself,  such  as  his  or  her  class 
position,  or  is  it  derived  from  some  kind  of 
“objective  interest”?  Is  it  a  random  impulse?  Or 
is  it  a  consequence  of  the  multiple  messages  an 
individual  is  exposed  to  in  a  communication  pro¬ 
cess  called  an  electoral  campaign?  Or  is  it  to  a 
great  extent  the  result  of  media  and  public  rela¬ 
tions  manipulation?  Or  is  it  the  result  of  influ¬ 
ences  and  pressures  generated  in  the  social  groups 
the  person  belongs  to — family,  the  neighbor¬ 
hood,  friends,  colleagues,  and  peer  groups?  Is  the 
vote  the  product  of  political  convictions  and 
ideas  of  an  individual  and  his  or  her  ability  to 
match  those  ideas  with  the  ones  the  candidates 
communicate?  Or  is  it  just  the  result  of  prefer¬ 
ences  on  any  of  the  attributes,  or  issues,  the  voter 
sees  in  a  candidate? 

However,  political  convictions  or  preferences 
cannot  be  directly  observed.  They  can  only  be 
inferred  from  what  a  particular  voter  says.  This 
nonobservable,  latent,  level  is  called  attitude :  the 
propensity  to  produce  certain  behavioral  responses 
relating  to  specific  subjects,  once  the  individual  has 
been  exposed  to  a  stimulus.  To  the  extent  that  such 
attitudes  determine  opinions  and  behavior,  the  indi¬ 
vidual  is  an  autonomous  actor  rather  than  a  passive 
object  moved  by  causes  external  to  him  or  her. 


Attitudes  or  predispositions,  as  they  are  fre¬ 
quently  called,  vary  from  one  individual  to  another. 
For  some  individuals,  “ideas,”  in  the  sense  of  a  set 
of  articulated  beliefs,  matter;  to  others,  tastes  are 
more  important.  For  some  individuals,  particular 
issues  are  more  relevant  than  general  concepts;  for 
others,  moral  values  are  more  important  than  par¬ 
ticular  preferences.  So,  some  people  may  vote 
because  of  the  positions  of  candidates  or  parties  on 
specific  issues,  whereas  others  may  vote  with 
regard  to  more  general  public  policies.  Some  indi¬ 
viduals  prefer  a  candidate  just  because  they  like  or 
trust  him  or  her,  or  they  base  their  choices  on  the 
perceived  honesty  of  the  candidate  irrespective  of 
his  or  her  standing  on  public  policy  issues. 

The  idea  that  there  are  voters  who  make  their 
decisions  in  an  erratic,  random  way  has  not  been 
alien  to  the  social  sciences.  Frequently,  this  idea 
implies  the  assumption  that  individuals  are  strongly 
dependent  on  mass  media,  leaders,  or  material 
incentives,  so  they  are  not  really  autonomous 
actors.  In  any  case,  attitudes  are  important  factors 
accounting  for  individual  behavior  and  thought. 
They  constitute  a  term  in  an  equation  intended  to 
describe  why  autonomous  persons  do  the  things 
they  do,  other  terms  being  information  coming  to 
the  individual  from  society  and  pressures  and 
influences  from  the  near  environment. 

The  standard  representation  of  attitudes  postu¬ 
lates  the  existence  of  levels  of  generality  and  stabil¬ 
ity  in  the  judgments  people  are  able  to  formulate 
about  themselves  and  the  external  world.  A  higher 
degree  of  generality  means  that  an  idea  refers  not  to 
a  single  or  particular  subject  but  to  a  wider  set  of 
topics;  a  higher  degree  of  stability  means  that  ideas 
sustained  through  time  resist  the  impact  of  new 
information  the  individual  is  continually  exposed 
to.  In  these  assumed  hierarchies  of  judgments  each 
individual  holds  in  his  or  her  mind,  there  are,  on 
one  extreme  of  the  continuum,  the  more  general 
and  stable  ones — values,  values,  ideologies;  on  the 
other  pole,  judgments  refer  to  particular  issues — 
opinions;  and  in  the  middle  are  judgments  that  are 
less  general  and  stable  than  the  first  ones  but  not  as 
occasional  or  particular  as  the  latter — attitudes. 
The  expression  belief  system  is  sometimes  used  to 
refer  to  the  relationship  between  the  different  atti¬ 
tudes  a  person  can  hold  in  a  stable  manner. 

The  term  attitude  denotes  an  individual’s  pro¬ 
pensity  to  think  and  to  form  opinions  in  a  stable 
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way  when  he  or  she  refers  to  a  specific  topic. 
“Attitude”  also  refers  to  a  propensity  in  the  indi¬ 
vidual  to  mobilize  certain  mental  contents,  stable 
points  of  view,  every  time  he  or  she  formulates 
specific  judgments,  utters  opinions,  or  decides  on 
certain  conducts. 

Citizens  do  have  predispositions  toward  public 
affairs,  governments,  and  elections.  These  vary 
among  individuals.  For  example,  some  individuals 
hold  conservative  points  of  view,  others  hold  lib¬ 
eral  ones;  some  are  nationalist,  others  universalist; 
some  favor  a  certain  public  policy,  others  do  not. 

Some  dimensions  of  predispositions  are  thought 
of  as  if  they  were  a  given,  by  a  sort  of  necessity,  in 
each  individual.  Thus,  it  is  frequently  assumed  that 
individuals  are  always  either  liberal  or  conserva¬ 
tive,  either  left  wing  or  right  wing.  Empirical  evi¬ 
dence  in  many  countries  does  not  support  such 
assumptions.  Sometimes,  the  evidence  instead  sug¬ 
gests  that  there  are  individuals  who  to  some  extent 
may  be  both  liberal  and  conservative,  depending 
on  the  issues  that  are  considered.  In  some  coun¬ 
tries,  people  define  themselves  as  oriented  to  the 
left,  right,  or  center;  in  other  countries,  these  con¬ 
cepts  are  not  relevant  for  many  people. 

Attitude  Formation 

How  these  predispositions  are  formed  is  a  matter 
of  debate  among  social  scientists.  Predispositions 
may  result  from  genetic  factors,  socialization, 
learning,  or  many  other  influences.  Two  main  tra¬ 
ditions  have  developed  within  the  social  sciences. 
One  places  emphasis  on  interests :  Alexis  de 
Tocqueville  said  that  people’s  opinions  change 
with  their  fortunes.  The  other  focuses  on  member¬ 
ship  groups  each  individual  takes  part  in:  Emile 
Durkheim  talked  of  “likenesses  of  consciences”  in 
contrast  to  the  effects  of  the  division  of  labor.  The 
idea  that  each  social  position  must  correspond  to  a 
particular  set  of  attitudes  is  deeply  rooted  in  social 
theory;  it  frequently  receives  empirical  support, 
but  the  explanation  to  account  for  that  correlation 
is  far  from  being  consensual. 

The  theory  of  attitudes  postulates  an  inferred, 
nonobservable  relationship  between  a  given  attitu- 
dinal  dimension,  which  is  by  definition  unobserv¬ 
able,  and  a  manifest,  observable  one.  We  can 
conceive  of  that  relationship  as  a  probabilistic  one, 
in  the  sense  that  there  is  no  perfect  link  between 


the  real  position  of  a  person  on  the  attitudinal 
dimension  and  his  or  her  manifest  answer  to  a 
particular  question  at  a  given  time.  An  individual 
can  be  at  a  certain  latent  state  in  the  dimension  of 
“nationalism,”  but  that  position  does  not  deter¬ 
mine  a  single  answer  the  individual  would  produce 
to  a  given  question  aimed  to  elicit  that  attitude.  A 
student  who  knows  the  content  of  a  given  topic 
very  well  is  supposed  to  answer  all  the  questions  in 
an  exam  correctly,  but  sometimes  he  or  she  fails  to 
do  so. 

We  speak,  for  example,  about  attitudes  toward 
the  government.  We  could  conceive  of  these  atti¬ 
tudes  as  ranging  from  a  very  favorable  attitude  to 
a  very  unfavorable  one.  To  measure  that  attitude 
in  a  particular  individual  at  a  given  time,  we  may 
ask  him  or  her:  “What  do  you  think  about  the 
performance  of  the  present  government?”  and  we 
possibly  offer  him  a  limited  set  of  options:  “very 
well,”  “fairly  well,”  “about  average,”  “fairly 
bad,”  or  “very  bad”  or  5,  4,  3,  2,  1,  where  5  is  a 
very  favorable  and  1  a  very  unfavorable  evalua¬ 
tion.  The  relation  is  probabilistic,  because,  even  if 
the  individual  usually  is  in  favor  of  the  current 
government,  some  days  he  or  she  will  not  necessar¬ 
ily  give  the  same  answer,  assuming  we  ask  the 
same  question  in  different  ways  and  that  he  or  she 
does  not  remember  the  previous  answers. 
Moreover,  different  researchers  can  construct  the 
scale  in  different  ways:  Some  offer  five  options,  as 
in  the  example,  while  others  offer  four  or  seven 
choices.  The  answers  generate  a  variable  that  is 
statistically  dependent  on  the  latent  one,  but  that 
variable  could  also  be  dependent  on  many  other 
variables,  most  of  them  probably  unknown  to  us. 

Critics  of  this  method  of  measuring  attitudes 
may  argue  that  all  the  other  factors  that  may  influ¬ 
ence  a  response,  including  the  more  occasional  or 
situational  ones,  could  reduce  the  relationship 
between  the  latent  dimension  and  the  manifest  one 
to  zero,  making  the  measurement  irrelevant  and 
not  valid  to  measure  what  it  intends  to  measure. 
The  supporters  of  the  method  think  that  the  range 
of  variation  between  these  two  levels  of  reality  is  far 
more  restricted  and  that  there  is  no  better  way  to 
approach  attitudes — at  least  in  a  quantifiable  way. 

It  is  commonly  accepted  that  political  atti¬ 
tudes  help  the  analyst  explain  and  predict  politi¬ 
cal  preferences  and  behavior.  There  is  a  great 
deal  of  research  examining  the  effects  of  electoral 
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campaigns,  political  communication,  and  messages 
delivered  by  political  leaders  on  voters’  attitudes. 
While  there  are  no  universal  generalizations  sup¬ 
ported  by  empirical  research,  some  firm  trends  can 
be  established.  For  example,  a  voter’s  level  of  edu¬ 
cation  and  the  quantity  of  information  an  individ¬ 
ual  may  acquire  facilitate  a  higher  propensity  to 
change;  membership  groups  also  have  an  effect, 
but  this  effect  tends  to  become  weaker  the  more 
complex  societies  are.  Political  parties  in  demo¬ 
cratic  systems  used  to  have  a  strong  influence  on 
voters,  but  this  influence  is  becoming  weaker  as  the 
number  of  citizens  who  feel  independent  from  par¬ 
ties  is  increasing.  The  role  of  ideology  in  the  pro¬ 
pensity  of  individuals  to  become  exposed  to  certain 
kinds  of  messages  and  to  be  receptive  to  them  is 
also  a  matter  of  discussion.  The  influence  of  the 
print  media  is  also  a  matter  of  widely  divergent 
points  of  view — that  is,  whether  the  media  influ¬ 
ence  public  attitudes  or  whether  they  adapt  them¬ 
selves  to  the  prevailing  attitudes  of  their  publics. 

We  know  that  many  citizens  today  hold  stable 
political  predispositions  and  do  not  change  them 
easily.  They  derive  their  opinions  and  particular 
political  decisions  from  those  predispositions. 
However,  they  do  not  link  their  predispositions  to 
particular  opinions  and  decisions  in  a  strongly  lin¬ 
ear  way.  Rather,  they  afford  a  certain  degree  of 
variability  in  the  opinions  they  form  at  every 
moment — including  the  voting  decision.  So,  differ¬ 
ent  persons  eventually  may  make  the  same  voting 
decision  for  different  reasons,  while  persons  with 
similar  attitudes  may  eventually  arrive  at  different 
voting  decisions.  Electoral  campaigns  are  sources 
of  intense  stimuli,  concentrated  within  a  short 
period  of  time,  which  voters  cannot  easily  resist. 
During  a  campaign,  political  advertising,  press 
information,  and  the  messages  of  the  candidate 
and  other  leaders  are  able  to  produce  some  vari¬ 
ability  in  particular  voting  behaviors  even  while 
voters’  attitudes  may  remain  relatively  stable. 
Sometimes,  those  intense  messages  are  able  to 
modify  attitudes  as  well. 

Some  citizens  decide  their  vote  by  taking  into 
account  both  their  own  attitudes  and  the  ones  they 
attribute  to  the  candidates  and  then  matching 
them.  But  there  are  others  who  tend  to  take  into 
account  “tactical”  considerations,  such  as  defining 
their  vote  against  a  candidate  rather  than  in  sup¬ 
port  of  another  or  to  voting  for  the  presumptive 


loser  in  order  to  prevent  the  winner  from  obtaining 
a  landslide.  To  make  that  kind  of  decision,  elec¬ 
toral  polls  may  influence  some  voters. 

The  principle  that  political  behavior,  and  spe¬ 
cifically  voting  behavior,  must  be  related  to  what 
citizens  have  in  their  minds  is  at  the  root  of  demo¬ 
cratic  theory.  Quite  often,  mass  democracy  is 
judged  negatively  precisely  on  the  assumption  that 
voters  lack  the  capacity  to  make  consistent  judg¬ 
ments  anchored  in  adequate  information.  Thus, 
the  ancient  problem  of  the  relationship  between 
attitudes  and  behavior  is  central  in  the  realm  of 
politics — be  it  formulated  in  these  terms  or  not. 
Even  if  attitudes,  in  whatever  way  they  have  been 
formed,  tend  to  be  stable  and  do  not  vary  at  ran¬ 
dom,  they  are  susceptible  to  change.  One  impor¬ 
tant  question  is  whether  human  beings  are  able  to 
modify  their  attitudes  in  an  autonomous  way — 
that  is,  by  thinking  and  reasoning  themselves — or 
whether  they  change  their  attitudes  under  the 
influence  of  messages  from  the  outside  world. 

Attitude  Change 

We  are  far  from  a  theoretical  consensus  on  how 
attitudes  can  change.  While  one  theoretical  per¬ 
spective  postulates  that  better  informed,  more 
consistent  persons  are  more  stable  in  their  atti¬ 
tudes,  another  theoretical  perspective  proposes 
that  they  are  more  susceptible  to  nuances  and 
variations  in  particular  opinions.  Recent  research 
tends  to  give  support  to  the  view  that  particular 
opinions  may  vary,  to  some  extent,  independently 
from  well-established  attitudes.  Individuals  have 
many  different  ideas  in  their  minds;  so,  they  may 
hold  different  attitudes,  some  of  them  more  firmly 
rooted  than  others.  The  assumption  that  more 
general  judgments  are  more  stable  in  the  mind  of 
individuals  does  not  imply  that  specific  opinions 
and  decisions  may  vary  to  a  greater  extent.  That 
does  not  necessarily  mean  that  citizens  are  incon¬ 
sistent  but,  rather,  that  they  are  flexible. 

When  an  individual  receives  information  from 
outside,  he  can  dismiss  it  or  accept  it;  in  the  latter 
case,  he  can  process  it  and  record  it,  totally  or  par¬ 
tially,  in  his  memory.  The  new  information  may 
have  different  effects  on  already  existing  attitudes: 
It  can  modify  some  of  them,  it  can  be  conserved 
without  modifying  previous  attitudes  (sometimes 
even  if  it  produces  some  inconsistencies),  or  it  can 
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be  rejected.  The  effect  of  a  new  piece  of  informa¬ 
tion  on  previous  attitudes  can  also  be  sometimes 
delayed:  The  new  information  has  been  taken  into 
account,  but  the  mind  works  on  it  slowly.  So  there 
can  be  a  lag  between  the  moment  a  stimulus  is 
produced  and  the  moment  an  effective  change  of 
attitude  takes  place.  It  is  possible  that  those  effects 
are  more  intense  on  some  attitudes  held  by  an 
individual  than  on  others;  therefore,  as  each  indi¬ 
vidual  holds  a  collection  of  interrelated  attitudes 
rather  than  many  separate  ones,  it  is  possible  that 
some  attitudes  change  while  others  do  not,  thus 
producing  different  patterns  of  consistency  or 
inconsistency  over  time. 

Let  us  take  the  example  of  electoral  campaigns. 
Some  voters  do  not  change  their  attitudes  under 
the  effect  of  a  campaign  while  others  do;  but  to 
some  voters,  the  effect  comes  with  a  lag.  So,  the 
transition  from  one  attitudinal  property  to  another 
could  be  nil,  immediate,  or  deferred — that  is,  some 
persons  may  adopt  a  new  behavior  or  a  new  opin¬ 
ion  after  a  period  of  time.  This  leads  to  the  notion 
of  a  probability  of  transition  from  one  attitudinal 
state  to  another  as  a  function  of  a  new  stimulus  or 
new  information  received.  Much  of  public  opinion 
research  focuses  on  these  effects,  frequently  ana¬ 
lyzing  them  by  means  of  statistical  association 
coefficients  and  contingency  tables. 

In  summary,  attitudes  may  change  as  the  result 
of  changes  in  the  social  position  of  persons  or  of 
changes  in  the  sociocultural  context  or  because  of 
the  effect  of  communication — press  media,  adver¬ 
tising,  electoral  campaigns,  leaders’  messages. 

To  account  for  variations  in  voting  behavior,  it 
is  useful  to  transpose  the  electoral  market  to  any 
situation  where  individuals  must  make  a  choice 
between  different  options.  In  those  situations, 
there  are  factors  that  condition  the  individual’s 
own  perspective  and  there  are  factors  that  are 
under  the  control  of  external  sources.  When  the 
individual  who  has  to  make  a  choice  is  exposed  to 
political  communication,  he  or  she  may  or  may  not 
be  motivated  by  well-reasoned  political  purposes. 
There  is  no  rule  about  why  and  how  individuals 
process  their  own  motivations  and  the  information 
they  receive  from  outside. 

A  model  of  the  effects  that  lead  to  voting  deci¬ 
sions  in  each  individual  citizen  may  take  into 
account  three  forces:  (1)  the  initial  position  of  each 
individual,  (2)  the  various  forces  that  push  him  or 


her  toward  a  change  in  one  direction,  and  (3)  the 
various  forces  that  push  him  or  her  in  the  opposite 
direction.  The  balance  of  those  three  types  of 
forces  determines  whether  the  person  will  finally 
vote  for  the  candidate  he  or  she  would  vote  for  if 
the  campaign  did  not  exist  and  other  factors 
remain  the  same  or  whether  he  or  she  changes  his 
or  her  voting  decision.  Political  campaigns  are 
dynamic  factors  moving  against  the  other  factors 
that  keep  the  system  stable. 

Generally  speaking,  attitudes  are  at  the  base  of 
the  demand  side;  they  help  citizens  form  expecta¬ 
tions  and  respond  to  what  they  are  offered. 
Traditionally,  political  studies  have  given  more 
emphasis  to  the  supply  side:  articulated  ideologies 
and  widely  shared  systems  of  ideas,  political  par¬ 
ties,  leadership,  mass  media.  More  recently,  the 
focus  has  gradually  shifted  to  the  demand  side; 
there  has  been  increased  interest  in  understanding 
individual  predispositions  and  attitudes  and  the 
individual  sources  of  opinions  and  behavior. 
Market  research,  motivational  research,  opinion 
research,  and  communications  research  have  been 
the  engines  of  the  approaches  centered  on  the 
demand  side. 
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The  concept  of  “audit  society”  refers  primarily  to 
a  logic  at  work  in  regulatory  and  managerial  pro¬ 
cesses  and  to  tendencies,  rather  than  to  an  epochal 
state  or  condition.  In  this  context,  an  audit  refers 
to  a  systematic  assessment  of  institutional  perfor¬ 
mance  against  agreed  criteria  and  standards.  The 
processes  used  to  conduct  such  audits  reflect  the 
transformation  of  audit  and  evaluation  practices 
into  normative  principles  of  organizational  control 
with  a  wide  reach.  This  transformation  is  in  turn  a 
function  of  what  political  scientists  have  called  the 
“regulatory  state,”  meaning  in  part  a  state  with  an 
operating  philosophy  of  acting  “at  a  distance”  on 
policy  domains,  via  instruments  such  as  account¬ 
ing  and  audit.  Below,  the  role  of  the  “audit  soci¬ 
ety”  idea  as  a  critical  resource  is  developed,  fol¬ 
lowed  by  a  number  of  methodological  issues  and 
concerns  related  to  its  potential  application. 

The  idea  of  “audit  society”  provided  an  orga¬ 
nizing  motif  for  criticisms  of  the  public  service 
reforms  in  the  United  Kingdom  (UK)  and  else¬ 
where  during  the  1980s  and  1990s,  known  collec¬ 
tively  as  the  “new”  public  management.  While  the 
original  formulation  of  the  idea  relied  largely  on  a 
synthesis  of  secondary  empirical  evidence,  it  has 
provided  the  platform  for  substantive  studies  on 
health  workers,  teachers,  the  police,  social  work¬ 
ers,  and  psychoanalysts,  to  name  but  a  few.  The 
working  practices  of  these  groups  have  been  tar¬ 
geted  by  reforms,  and  they  have  found  their  pro¬ 
fessional  lifeworlds  transformed  to  a  greater  or 
lesser  extent  by  precise  demands  for  accountabil¬ 
ity.  The  critical  reaction  that  enlists  the  idea  of 
“audit  society”  has  focused  on  defending  the 
importance  of  professional  judgment,  demonstrat¬ 
ing  the  decline  of  trust  and  the  rise  of  anxiety,  and 
analyzing  the  unintended  consequence  of  auditable 
targets,  which  damage  the  very  service  quality  the 
reforms  have  been  intended  to  improve. 

The  auditing  process  is  used  in  a  variety  of 
ways  in  academia,  ranging  from  performance 
assessment  to  evaluating  the  effectiveness  of 
research  grants.  The  wide-ranging  appeal  of  the 


“audit  society”  motif  does  not  necessarily  mean 
that  it  has  been  applied  consistently  across  fields. 
It  has  often  been  taken  as  a  given,  supplying  only 
a  pretext  for  empirical  analysis.  In  the  academy, 
there  are  also  variations  in  disciplinary  sensitivi¬ 
ties.  For  example,  the  specific  methodological 
commitments  of  anthropology  make  it  particularly 
vulnerable  to  the  shorter  timescales  of  the  audit 
process  and  to  the  reductive  nature  of  performance 
measurement.  Marilyn  Strathern  and  others  have 
been  highly  critical  of  audit  and  related  forms  of 
performance  measurement  as  they  have  emerged  in 
universities  in  the  UK.  By  contrast,  economics, 
which  has  to  a  large  extent  internalized  evaluation 
mechanisms  as  part  of  its  own  “professionaliza¬ 
tion,”  has  been  relatively  silent  on  auditing  as  a 
specific  mechanism  of  control  and  oversight. 

If  the  concept  of  audit  society  is  to  be  taken  seri¬ 
ously  as  more  than  metaphor,  it  raises  a  number  of 
serious  issues  for  scholars.  First,  among  these 
issues  is  the  much  debated  boundary  between 
audit  and  evaluation.  Audit  can  be  narrowly 
defined  in  the  shadow  of  financial  auditing  largely 
as  an  exercise  in  compliance  verification.  From  this 
point  of  view,  it  addresses  the  following  question: 
Does  this  organization  comply  with  some  prees¬ 
tablished  set  of  rules  for  the  quality  of  its  account¬ 
ing?  Even  allowing  for  interpretation  and  discre¬ 
tion  in  the  application  of  such  rules,  audit  is  fun¬ 
damentally  concerned  with  conformity  with  them. 
Interesting  questions  then  concern  the  rules  of 
performance  themselves:  How  have  they  evolved 
and  how  are  they  implemented? 

Evaluation,  it  is  argued,  is  quite  different  from 
audit.  It  is  an  investigation  of  the  impact  of  public 
service  programs  for  change,  assessing  whether 
such  programs  succeed  or  fail  in  their  own  terms, 
and  what  unintended  consequences  they  may  have 
created.  Whereas  audit  produces  a  kind  of  yes/no 
answer  to  compliance  issues,  evaluation  often  pro¬ 
duces  questions  and  ambiguity.  Yet  this  ideal- 
typical  distinction  hides  a  dynamic  by  which  the 
audit  model  expands  its  influence  and  affects 
evaluation  in  subtle  forms.  It  is  easy  to  see  why 
audit  is  attractive  to  policymakers  seeking  simple 
measures  of  performance,  and  this  attraction  is 
grounded  in  a  deeper  logic  that  finds  its  purest 
expression  in  the  idea  of  financial  audit  but  also 
transcends  that  practice  and  is  at  work  in  evalua¬ 
tion  practices  as  they  are  broadly  understood.  The 
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power  and  impact  of  this  logic  in  different  institu¬ 
tional  settings  is  an  empirical  question,  but  it  is  a 
logic  that  aspires  to  transform  the  evaluated 
domain  according  to  its  own  operational  values  of 
visibility  and  traceability  (the  accountant’s  idea  of 
the  “audit  trail”)  in  the  representation  of  organiza¬ 
tional  and  individual  performance. 

Another  criticism  of  the  “audit  society”  idea  is 
that  it  is  a  case  of  mistaken  causality;  the  real 
driver  of  an  “audit  explosion”  in  the  UK  was  in 
fact  the  heightened  concern  with  performance 
improvement  across  a  range  of  public  services. 
While  this  is  a  very  reasonable  point  to  make — 
supported  by  the  observed  production  of  informa¬ 
tion  that  is  never  audited  or  checked  in  the  strict 
sense — the  concern  with  performance  has  been 
simultaneously  a  concern  with  the  production  of 
visible  and  auditable  traces  of  performance.  The 
logic  of  audit  that  requires  individuals  and  organi¬ 
zations  to  be  transparent  and  comparable  plays  a 
constitutive  role  in  shaping  conceptions  of  relevant 
performance  and  is  at  the  very  heart  of  what 
Michael  Moran  calls  the  attack  on  “club  govern¬ 
ment”  in  the  UK. 

This  means  that  “audit  society”  is  only  partly 
an  argument  about  the  rise  of  auditors,  although 
data  on  increases  in  headcount  for  oversight  activ¬ 
ity  are  suggestive.  It  is  more  a  claim  about  a  change 
process  that  is  governed  by  an  audit  logic,  which 
in  turn  shapes  the  design  of  performance  represen¬ 
tations.  This  emphasis  on  process  raises  an  impor¬ 
tant  question  for  the  scholarly  understanding  of 
“audit  society”  and  for  the  attention  given  to  the 
analysis  of  auditors  as  compared  with  those  who 
they  audit,  respectively.  Empirical  indicators  for 
the  audit  explosion  can  be  found  in  the  growth  of 
audit  institutions  and  expansion  of  their  remit  and 
scope.  However,  the  less  visible  and  more  pro¬ 
found  aspect  of  “audit  society,”  which  preoccupies 
many  studies  outside  political  science,  concerns  the 
manifest  impact  of  the  audit  on  professional  ser¬ 
vice  providers  and  the  process  by  which  the  logic 
of  audit  is  internalized  by  them. 

This  psychological  and  behavioral  level  of 
analysis  reveals  the  conflicts  and  the  range  of  strat¬ 
egies  that  individuals  use  to  cope  with  new 
auditized  demands  for  performance  reporting.  In 
turn,  it  suggests  that  the  real  “action”  of  the  audit 
society  happens  at  the  level  of  the  construction  of 
“auditees,”  who  are  not  docile  subjects  in  Michel 


Foucault’s  sense  and  may  put  up  considerable 
resistance  to  new  measures  of  what  they  do  but 
nevertheless  operate  in  the  shadow  of  a  logic  that 
values  precision  and  traceability  in  the  representa¬ 
tion  of  performance.  The  strong  form  of  “audit 
society”  analysis  goes  further  to  suggest  that  per¬ 
formance  representation  in  many  different  fields  is 
shaped  by  this  logic,  with  highly  dysfunctional 
consequences. 

Another  methodological  concern  about  “audit 
society”  is  that  it  is  a  case  of  selection  on  the 
dependent  variable,  a  consequence  of  which  is  the 
failure  to  recognize  the  UK-specific  nature  of 
the  phenomenon.  While  there  is  some  truth  in  this, 
the  argument  also  reveals  differences  between 
research  sensitivities  within  and  outside  political 
science.  Finding  new  and  puzzling  phenomena 
whose  character  and  logic  demand  analysis  is  nec¬ 
essarily  a  precursor  to  the  exploration  of  the  kind 
of  dependency  and  causality  that  provide  insight 
into  cross-sectional  variation  of  the  phenomena.  In 
short,  it  is  always  the  “dependent”  variables  that 
strike  us  first,  generate  puzzles,  and  provide  the 
“value  relevance”  for  research  into  their  indepen¬ 
dent  causes.  We  do  not  think  much  about  this 
question  when  the  dependent  variable  is  well 
formed,  for  example,  security  prices  in  liquid  capi¬ 
tal  markets,  but  when  it  is  not,  the  investigation  is 
more  like  a  “depth  hermeneutic”  that  is  explicative 
and  inductive  in  its  movement  from  phenomena  to 
their  underlying  logics.  The  audit  society  thesis  is 
more  than  a  mistaken  focus  on  a  well-formed 
dependent  variable. 

That  said,  cross-national  variations  in  audit 
society  tendencies  are  indeed  significant,  depend¬ 
ing  themselves  on  how  fiscal  and  service  perfor¬ 
mance  issues  are  problematized  and  the  sources  of 
expertise  available  in  different  national  settings 
and  fields.  The  distinctive  cultural  position  of 
accountants  in  UK  society  has  influenced  the  way 
reform  agendas  have  been  shaped,  and  there  is  also 
considerable  cross-sectional  variation.  So  audit  as 
an  institutional  logic  with  its  own  dynamic  must 
compete  with  many  others,  and  its  hegemony  in 
any  setting  is  at  best  highly  contingent. 

The  “audit  society”  idea  has  had  wide  cross- 
disciplinary  impact  because  of  its  role  in  providing 
a  label  for  the  challenges  faced  by  many  different 
professional  groups.  However,  to  suggest  that  an 
audit  logic  has  dysfunctional  side  effects  is  hardly 
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a  new  one  in  social  science.  Analyses  of  failed 
dreams  of  mastery  are  well-known  to  political  sci¬ 
entists  and  historians.  The  rise  of  audit  and  its 
underlying  logic  is  only  the  latest  version  of  this 
dream,  and  the  effects,  intended  and  unintended, 
are  not  particularly  surprising.  Behavioral  econo¬ 
mists  have  explored  the  “hidden  costs  of  control” 
and  the  paradoxes  of  transparency.  Bruno  Frey 
and  Reto  Jegen  have  provided  analyses  of  the 
“crowding”  of  intrinsic  motivation  by  extrinsic 
efforts  at  control.  This  work  overlaps  with  the 
“audit  society”  arguments  and  provides  the  poten¬ 
tial  for  a  more  formal  expansion  of  their  insights. 

Finally,  it  was  suggested  above  that  the  rise  of 
audit  is  closely  associated  with  neoliberal  reform 
agendas  and  the  rise  of  the  regulatory  state  in 
developed  economies.  And  yet  “audit  society”  is 
an  intriguing  concept  precisely  because  it  escapes 
easy  alignment  with  such  labels.  The  logic  of  audit 
embodies  values  of  visibility  and  transparency  that 
may  enable  greater  democratic  participation  in 
service  evaluation  but  may  equally  support  cen¬ 
tralist  aspirations  by  states.  Audit  has  been  associ¬ 
ated  with  a  Soviet-style  centralism  in  UK  public 
policy,  and  performance  audit  in  the  Chinese  con¬ 
text  is  an  expression  of  socialist  ideals.  The  malle¬ 
ability  of  audit  and  evaluation  in  different  cultural 
contexts,  and  the  capacity  of  their  underlying  log¬ 
ics  to  appear  in  quite  different  value  systems, 
means  that  there  is  still  some  way  to  go  in  order  to 
understand  the  complex  social  and  organizational 
dynamics  indentified  under  by  the  idea  of  “audit 
society.”  Not  least  among  these  complex  dynamics 
is  the  apparent  invulnerability  of  audit  to  its  own 
failure. 
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Auditing 


If  accountability  is  the  hallmark  of  democratic 
governance,  public  auditing  is  a  means  to  achieve 
it.  Auditing  is  a  form  of  oversight,  or  examination 
from  some  point  external  to  the  system  or  indi¬ 
vidual  in  question.  Technically,  auditing  is  a  form 
of  verification  by  an  independent  body,  which 
compares  actual  transactions  with  standard  prac¬ 
tices.  Because  it  evaluates  the  relationship  of  what 
is  against  what  ought  to  be,  auditing  is  a  norma¬ 
tive  operation.  Public  auditing  is  the  traditional 
instrument  to  hold  actors  entrusted  with  manag¬ 
ing  public  funds  accountable  by  providing  infor¬ 
mation  to  supervising  agents,  elected  officials,  and 
(sometimes)  constituents  about  compliance  with 
or  deviations  from  accepted  standards.  Although 
accountability  as  a  general  aim  of  auditing  is 
undisputed,  unavoidable  tensions  with  other  prin¬ 
ciples  of  good  governance  arise,  such  as  the  exer¬ 
cise  of  informed  discretion  by  elected  decision 
makers  or  individual  privacy  rights  when  the  cor¬ 
set  of  control  is  taken  to  an  extreme. 

The  media,  interest  groups,  private  overseers 
(e.g.,  credit-rating  agencies),  international  organi¬ 
zations,  public  audit  institutions,  and  the  general 
public  increasing  see  modern  government  as  an 
object  of  scrutiny.  At  the  same  time,  government 
regulates  society  and  delivers  services  through 
myriad  national,  international,  nonprofit,  and  pri¬ 
vate  organizations,  making  control  a  highly  com¬ 
plex  issue.  The  question  is:  Who  oversees  whom 
for  what  purpose,  under  whose  authority,  with 
what  kind  of  legitimacy,  and  how? 

The  balance  of  autonomy  and  control  is  a  clas¬ 
sic  topic  for  students  of  government.  It  touches  on 
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issues  such  as  responsibility,  trust,  functional  dif¬ 
ferentiation,  authorization  of  execution,  assur¬ 
ance,  and  accountability.  At  its  center  stands  the 
question  of  how  public  organizations  can  be  task 
efficient  and  accountable  at  the  same  time. 

This  entry  is  divided  into  four  sections:  The 
historical  traditions  of  auditing  are  traced  in  the 
first  section,  the  second  one  introduces  different 
institutional  arrangements  of  auditing,  the  third 
one  sketches  the  main  arguments  of  the  audit 
explosion  discourse,  and  the  final  one  illustrates 
the  politics  of  auditing. 

Tracing  the  Historical  Traditions  of  Auditing 

Auditing  is  one  of  the  oldest  and  most  eminent  of 
state  functions,  and  early  forms  preceded  the  rise  of 
the  modern  democratic  state.  The  Latin  word  audi- 
tus  means  “a  hearing.”  In  ancient  Rome,  the  orga¬ 
nization  of  a  “hearing  of  accounts”  was  a  way  in 
which  one  official  would  read  out  his  accounts  to 
another  who  would  compare  the  figures.  This  form 
of  oral  verification  was  intended  to  prevent  officials 
in  charge  of  funds  from  misusing  them.  Governors 
of  ancient  civilizations,  for  example,  Sumer,  Egypt, 
Phoenicia,  Greece,  and  Rome,  appointed  trustwor¬ 
thy  clerks  to  find  and  punish  employees  for  embez¬ 
zlements  and  to  protect  public  assets.  Early  Chinese 
writings  from  the  12th-century  BCE  illustrate  an 
advanced  understanding  of  the  role  of  auditing, 
describing  methods  and  practices  of  good  economic 
management  and  the  necessity  of  securing  an  inde¬ 
pendent  and  high-level  status  for  the  auditor.  One  of 
the  earliest  records  of  public  sector  financial  man¬ 
agement  in  India  is  the  Arthasbastra  written  by 
Kautilya,  who  was  a  Brahmin  minister  under 
Chandragupta  Maurya,  the  founder  of  the  Mauryan 
Empire,  about  2,300  years  ago.  Kautilya  developed 
bookkeeping  rules  to  record  and  classify  economic 
data,  emphasized  the  critical  role  of  independent 
periodic  audits,  and  proposed  the  establishment  of 
two  important  but  separate  offices — the  treasurer 
and  the  comptroller-auditor. 

The  origin  of  public  auditing  lies  therefore  in 
the  review  of  financial  transaction.  The  concept  is 
closely  related  to  the  development  of  private  sector 
accounting  techniques  in  16th-century  Venice,  but 
some  supreme  audit  institutions  date  their  origin 
back  to  the  13th  and  14th  centuries.  The  National 
Audit  Office  in  the  UK  sees  its  origin  in  the  early 


14th  century,  the  French  Cour  des  Gomptes  cites 
1318  as  the  year  of  its  inception,  professional 
forms  of  supervision  and  control  of  resources  in 
Mexico  date  back  to  the  Aztec  Empire.  These  audit 
institutions  were  not  independent  of  the  ruler  or 
crown  and  were  designed  to  control  the  financial 
conduct  of  the  crown  servants.  Their  aim  was  to 
protect  the  interests,  property,  and  wealth  of  a 
ruler  at  a  time  when  the  state  was  still  considered 
private  property.  In  concert  with  trends  of  the  18th 
and  19th  centuries — the  diminishing  role  of  family 
and  personal  loyalty,  the  increasing  division  of 
labor  in  economy  and  state  (functional  differentia¬ 
tion),  the  emergence  of  big  government,  new  ideas 
about  responsible  behavior  of  public  servants — 
audit  institutions  changed  in  scope,  quality,  and 
quantity. 

As  parliament  gained  power  to  grant  the  finan¬ 
cial  means  to  execute  public  tasks,  the  need  arose 
to  oversee  the  respective  administrations  or  institu¬ 
tions  using  the  financial  resources.  The  objective 
was  to  ensure  that  the  regularity  and  probity  of  the 
expenditures  was  in  accordance  with  the  defined 
aims  of  parliament.  The  state  budget  was  and  is 
the  prerequisite  for  this  form  of  formal  legal  ratio¬ 
nality.  Its  verification  rests  on  checking  receipts 
against  authorized  state  budget  expenditures  and 
vice  versa. 

The  clearest  transformation  was  from  internal 
to  external  auditing  and  with  it  the  emergence  of 
independent  audit  institutions,  often  considered  to 
be  the  extended  arm  of  the  parliament  via  the 
executive  government.  Over  time,  they  gained  a 
constitutional  status  in  many  countries,  which 
secured  their  statutory  and  operational  indepen¬ 
dence  from  executive  interference.  In  many  coun¬ 
tries,  however,  especially  those  under  authoritar¬ 
ian  rule,  this  was  the  exception  rather  than  the 
rule.  In  the  former  German  Democratic  Republic, 
an  independent  audit  system  did  not  exist  to  exe¬ 
cute  the  directives  of  the  Ministry  of  Finance, 
which  made  the  Central  Committee  of  the 
Communist  Party  a  political  tool  to  oversee  the 
completion  of  economic  plans  in  a  state-directed 
economy.  The  attribute  of  constitutionally  secured 
independence  of  the  auditor  from  those  holding 
and  executing  power  (relational  distance),  for 
example,  cabinet,  bureaucracy,  parliament,  and  the 
auditor’s  role  in  ensuring  good  governance  have 
become,  as  many  see  it,  the  key  to  the  auditor’s 
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legitimacy  in  democracies.  The  many  authoritar¬ 
ian  regimes  that  lack  an  autonomous  body  of 
oversight  in  effect  allow  the  instrumentalization  of 
spending  for  partisan  purposes.  The  Mexico 
Declaration  on  SAI  Independence  of  the 
International  Organization  of  Supreme  Audit 
Institutions  advocates  principles  of  good  auditing 
and  tries  to  establish  common  standards  of  audit¬ 
ing  among  its  global  member  organizations.  It 
specifies  organizational,  functional,  and  financial 
independence  as  the  core  prerequisite  to  accom¬ 
plishing  tasks  objectively,  effectively,  and  without 
external  interference. 

The  state  audit  evolved  from  a  rather  crude  tool 
to  prevent  chaos  and  ruin  in  public  finance  to  an 
advanced  instrument  for  safeguarding  responsive, 
accountable,  open,  regular,  and  efficient  govern¬ 
ment.  Compliance  auditing  (verification  of 
accounts,  legality  of  expenditures,  the  organiza¬ 
tion’s  adherence  to  regulatory  guidelines)  and 
financial  auditing  (the  overseeing  of  public 
accounts,  assessing  accuracy  and  fairness  of  an 
organization’s  financial  statements)  were  the  pri¬ 
mary  purposes  of  auditing;  performance  auditing 
(examination  of  how  government  programs  oper¬ 
ate  and  how  well  they  achieve  the  defined  objec¬ 
tives)  came  somewhat  later,  gaining  an  important 
place  in  the  auditing  realm  with  the  new  public 
management  reforms.  The  latter  is  often  referred 
to  as  value-for-money  auditing,  or  evaluating  the 
efficiency,  effectiveness,  and  economy  of  programs 
and  policies.  Such  a  focus  is  supposed  to  give  way 
to  comparisons  with  performance  criteria  of  best 
practices  from  the  private  sector.  Its  critics  claim 
that  performance  auditing  has  contributed  to  an 
overstretched  meaning  of  auditing:  For  example, 
conceptual  boundaries  with  forms  of  evaluation 
have  started  to  blur. 

Auditing  as  an  Institutional  Arrangement 

Internal  versus  external  auditing  is  the  most  basic 
differentiation  of  types  of  auditing.  It  refers  to  the 
organizational  status  of  the  auditing  entity  relative 
to  the  one  it  audits.  Internal  means  that  the  auditor 
is  part  of  the  organization  and  the  organization’s 
employee;  external  refers  to  an  external  oversight 
body  or  person  operating  independently  of  the 
entity  being  audited.  Because  internal  auditors  are 
part  of  the  organization  under  scrutiny,  they  work 


for  the  benefit  of  the  management  of  the  organiza¬ 
tion  under  examination  and  their  insights  remain 
mostly  disclosed  only  within  the  management.  In 
contrast,  external  auditors  review  financial  trans¬ 
actions  and/or  statements  and  serve  third  parties, 
for  example,  parliament,  president,  who  need  reli¬ 
able  financial  information  about  those  entrusted 
with  the  management  of  public  funds. 

Models  of  Auditing:  State,  Private,  and  People’s 

The  audit  of  public  sector  organizations  staffed 
with  civil  servants  differs.  The  most  general  clas¬ 
sification  is  based  on  the  question  of  who  conducts 
the  audit.  Here  we  can  distinguish  between  state, 
private,  and  people’s  auditing.  State  auditing  can 
be  divided  into  three  subgroups  according  to  gov¬ 
ernment  level: 

1.  supreme  audit  institutions  (e.g.,  the  British 
National  Audit  Office)  examine  the  affairs  of 
federal  government  departments; 

2.  second-  or  third-level  audit  institutions  (which 
can  be  fully  independent  of  other  government 
levels)  exist  in  states,  provinces,  and 
municipalities  of  federal  systems  to  audit  their 
own  government  entities;  and 

3.  internal  auditing  bodies  operate  within  any 
governmental  department  or  agency. 

Professional  accounting  firms  charged  with 
private  auditing  conduct  special-purpose  audits 
(such  as  reviewing  public  enterprise  accounts)  and 
have  increased  in  number  and  scope  in  recent 
decades  along  with  the  growth  of  government- 
owned  business  corporations  in  the  public  sector. 
The  third  audit  model,  people’s  auditing,  was 
exercised  in  the  former  Soviet  Union  by  the 
Narodnyi  Control  (Soviet  People’s  Control)  as 
part  of  a  combination  of  professional  auditors 
with  millions  of  volunteers  as  so-called  people’s 
inspectors  who  conducted  examinations  and 
inspections  in  their  spare  time.  This  system  was 
meant  to  educate  the  masses  and  be  a  training 
ground  in  the  transformation  of  the  country  into 
a  communist  society.  According  to  the  communist 
ideology,  the  professional  state  apparatus  would 
eventually  wither  away  and  the  ordinary  people 
would  take  over. 
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Organization  of  Auditing: 

Court  or  Administrative  System 

State  (supreme)  audit  institutions  can  be  classi¬ 
fied  into  two  broad  groups  according  to  their 
organization:  the  cameral  or  court  system  and  the 
administrative  or  monocratic  system. 

The  Cameral  System 

In  the  cameral  system,  the  audit  institution  is 
headed  by  a  group  of  auditors  whose  indepen¬ 
dence  is  ensured  by  their  judicial  status  and  who 
examine  governmental  accounts  as  well  as  the  uses 
of  public  funds.  To  this  ideal-type  model  belong 
most  of  the  Roman  law-based  countries,  such  as 
not  only  the  Corte  dei  Conti  in  Italy,  the  French 
Cour  des  Comptes,  and  the  Tribunal  de  Cuentas  in 
Spain  but  also  the  European  Court  of  Auditors,  the 
German  Bundesrechnungshof,  the  Dutch  Algemene 
Rekenkamer,  and  the  Auditoria  General  de  la 
Nacion  of  Argentina  in  Latin  America.  In  Italy, 
France,  and  Spain,  the  auditor  often  acts  as  an 
administrative  tribunal  with  quasi-judicial  powers 
in  administrative  matters,  whereas  the  variant  of 
this  collegiate  system  in  Sweden,  the  Netherlands, 
and  Germany  is  without  jurisdictional  authority. 
The  strong  legalistic  foundation  in  both  variants  of 
the  “cameral  system”  privileges  legal  and  financial 
compliance  over  performance  auditing.  Though  in 
principle  a  collegiate  model  in  which  everyone 
enjoys  the  same  rights  and  duties,  the  institution  is 
commonly  headed  by  a  kind  of  primus  inter  pares. 
As  a  “first  among  equals,”  he  or  she  can  exert,  as 
in  the  German  case,  considerable  influence  by 
selecting  the  heads  of  sections  and  tailoring  the 
areas  of  operation.  Despite  these  far-reaching 
organizational  competencies,  the  head  of  the  insti¬ 
tution  cannot  interfere  with  actual  matters  of 
auditing  or  the  content  of  inspectors’  decisions. 

The  Administrative  or  Monocratic  System 

In  the  United  Kingdom  (UK),  Canada,  and  most 
of  the  former  British  Empire  dominions  and  colo¬ 
nies,  as  well  as  countries  such  as  Mexico,  Chile, 
and  Colombia,  the  state  audit  institution  is  orga¬ 
nized  under  the  administrative  or  monocratic  sys¬ 
tem,  which  is  similar  to  a  hierarchically  structured 
government  department — that  is,  headed  by  a 
single  person  called  the  comptroller  or  auditor 


general  and  acting  as  an  auxiliary  institution  to  the 
legislature,  although  charged  with  ample  auton¬ 
omy.  The  head  of  the  institution  enjoys  an  inde¬ 
pendent  status,  but  the  institution  itself  functions 
according  to  hierarchical  principles  of  lines  of 
command.  Under  this  system,  controls  are  intended 
to  correct  rather  than  penalize.  Supreme  audit 
institutions  in  these  countries  focus  more  on  ex 
post  auditing,  rather  than  ex  ante  control,  and 
emphasize  financial  and  performance  auditing 
over  compliance  control. 

Interrelating  With  Others:  Parliamentary- 
Led  or  Executive-Led  Auditing 

Audit  institutions  are  embedded  in  political 
environments,  which  is  exhibited  by  the  way  they 
interrelate  with  other  actors.  Though  they  all  enjoy 
statutory  independence,  differences  arise  with 
regard  to  instrumental  institutions:  the  one  to 
which  they  report,  the  one  that  appoints  the  head 
of  the  Supreme  Audit  Institution,  and  the  one  that 
assigns  the  annual  budget.  Despite  their  statutory 
independence  and  regulations  to  protect  that  sta¬ 
tus,  auditors  are  dependent  on,  for  example, 
national  legislative  bodies  or  governmental  budget 
support — forms  of  leverage  that  can  be  used  to 
compromise  the  auditor’s  independence. 

In  Argentina,  Canada,  Germany,  the  United 
States,  Israel,  Mexico,  and  the  UK,  the  parliament 
or  a  special  committee  appoints  the  Auditor 
General,  acts  as  supervisor,  and  decides  on  the 
annual  budget.  This  system  has  been  labeled  par¬ 
liamentary  led,  as  the  auditor  is  considered  the 
extended  arm  of  the  legislator.  In  an  executive-led 
system  such  as  France  (also  Ecuador),  the  “Premier 
President”  and  other  magistrates  of  the  Cour  de 
Compte  are  appointed  by  a  decree  from  the 
President  of  the  Republic.  Appropriations  are  offi¬ 
cially  related  to  the  Prime  Minister  but  have  spe¬ 
cial  authorization  by  Parliament.  Reports  are 
directed  to  the  president  and  legislative  branch  of 
government.  China  also  has  an  executive-led 
arrangement:  The  auditor  reports  to  the  Premier  of 
the  State  Council  (the  head  of  the  Chinese  govern¬ 
ment)  and  the  ministry  of  finance  controls  the 
budget.  Sweden’s  Riksrevisionen,  which  until  2000 
was  similar  to  the  executive-led  model,  now  carries 
out  its  state  audit  functions  under  the  auspices  of 
Parliament. 
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Auditing  as  a  “Neurosis” 

Balancing  independence  with  control  is  a  classic 
topic  in  public  sector  research.  The  new  public 
management  (NPM)  hype  sparked  an  increased 
interest  in  performance  auditing,  primarily  under¬ 
stood  as  an  assessment  of  efficiency  goals  and  other 
indicators  of  economic  success  with  a  correspond¬ 
ing  system  of  controlling  and  reporting.  At  its  cen¬ 
ter  stands  the  question  of  how  public  organizations 
can  be  task  efficient  and  accountable  while  being 
relatively  free  of  bureaucratic  impediment. 

In  the  past  2  decades,  almost  every  local  govern¬ 
ment  and  many  national  administrations  in  OECD 
countries  have  implemented  some  kind  of  NPM 
reform.  This  “wave  of  modernization”  has  in  many 
cases  changed  the  way  governments  and  local 
authorities  operate  and  eventually  also  the  audit 
regime.  Although  the  NPM  doctrine  does  not  pre¬ 
scribe  a  well-defined  list  of  reform  steps,  the  stereo¬ 
typical  tool  box  includes  privatization,  deregulation, 
outsourcing,  service  delivery  competition,  semiau- 
tonomous  service  centers,  and — particularly  in  cen¬ 
tral  and  federal  government  reform — executive 
agencies,  performance  standards  and  measures, 
increased  consumer  power,  and  employment  of  pro¬ 
fessional  managers  and  noncareer  personnel. 
Auditing  bodies,  it  is  claimed,  respond  to  NPM-type 
reforms  by  institutional,  procedural,  and  cultural 
acclimation.  Increasing  coproduction  of  public 
goods  and  services  by  public,  private,  and  nonprofit 
organizations  seems  to  pose  additional  challenges  to 
systems  of  public  auditing  that  are  supposed  to 
counter  centrifugal  tendencies  of  decentralized  agen¬ 
cies  and  public  enterprises. 

The  conceptual  overstretching  of  auditing  is 
often  associated  with  a  debate  on  the  rise  of  the 
so-called  audit  society.  An  increasing  public  and 
academic  concern  about  unethical  behavior  of 
public  officials,  an  evaluation-obsessed  political 
and  social  context,  and  the  observation  that  mod¬ 
ern  Western  societies  would  evolve  into  low-trust 
societies  led  observers  to  conclude  that  the  “audit 
society”  is  a  way  of  dealing  with  this  type  of  neu¬ 
rosis.  This  diagnosis  had  its  origin  in  Michael 
Power’s  description  of  a  society  obsessed  with 
overseeing  almost  every  domain  of  public  activity 
and  the  paradoxical  effect  that  it  does  not  lead  to 
an  increase  in  trust  in  government.  This  audit  men¬ 
tality  is  very  often  decoupled  from  its  real  effects 


on  public  sector  accountability  and  has  become  an 
end  in  itself.  Auditing  is  not  seen  as  a  rational  reac¬ 
tion  to  the  well-defined  problem  of  accountability; 
rather,  the  very  concept  creates,  constitutes,  and 
shapes  the  public  perception  for  which  it  is  a  solu¬ 
tion.  Therefore,  by  “audit  explosion,”  Power 
refers  to  the  growth  of  audit  and  related  monitor¬ 
ing  practices  associated  with  public  management 
reform  processes.  As  a  multifaceted  concept,  the 
explosion  is  not  only  an  increment  of  auditing 
activities  in  quantitative  terms  but  also  a  shift  in 
qualitative  perspective,  mainly  toward  the  so- 
called  value-for-money  audit  and  other  internal 
control  systems.  It  also  means  that  oversight  bod¬ 
ies  offshoot  into  many  different  fields  such  as 
health  care  and  education.  The  concept  of  an  audit 
society  has  been  criticized  for  being  too  narrowly 
attributed  to  the  UK  or  other  English-speaking 
countries,  such  as  Australia  or  New  Zealand, 
meaning  that  it  does  not  sufficiently  capture  the 
reality  of  developments  in  other  countries,  for 
example,  the  United  States  or  Germany. 

The  Politics  of  Auditing 

Audit  institutions  do  not  commonly  have  the 
power  to  set  rules  or  sanction  the  deviant  behavior 
of  auditees.  In  principle,  they  lack  the  capacity  to 
exert  direct  hierarchical  control.  This  is  not  undis¬ 
puted,  as  the  Audit  Office  of  Finland,  the  Office  of 
the  Comptroller  General  of  Chile,  and  the  Court  of 
Audit  in  Spain  are  endowed  with  competencies  to 
prosecute.  The  Italian  Corte  dei  Conti  and  the 
French  Cour  de  Compte  also  enjoy  quasi-judicial 
powers  in  administrative  matters.  Despite  these 
examples,  the  foundation  of  their  authority  in  gen¬ 
eral,  however,  rests  on  their  constitutionally  guar¬ 
anteed  independence  from  political  interference, 
their  professional  expertise,  the  skill  to  informally 
cooperate  with  auditees  in  order  to  improve  pro¬ 
cesses,  and  the  ability  to  ally  themselves  with  like- 
minded  organizations  (such  as  the  public  accounts 
committee  in  parliament)  to  exercise  indirect  con¬ 
trol.  Providing  information  to  other  institutions 
about  a  specific  body’s  financial  management  is  the 
dominant  way  of  exerting  influence.  The  naming 
and  shaming  of  deviant  behavior  through  the  pub¬ 
lication  of  examination  results  must  be  seen  as 
ultimo  ratio.  On  one  hand,  publicizing  mismanage¬ 
ment  is  a  powerful  lever;  on  the  other,  it  politicizes 
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their  alleged  independence  as  the  auditor  is  drawn 
into  the  struggle  between  political  competitors. 
For  this  reason,  the  evaluation  of  government 
policies  by  auditors  is  a  critical  matter. 

Auditing  is  per  se  political  in  nature.  It  operates 
in  the  nexus  of  politics  and  policies.  The  tension 
between  politicization  and  institutional  indepen¬ 
dence  is  permanently  felt  in  the  arena  of  political 
competition  for  several  reasons.  First,  the  broad 
legal  mandate  with  the  discretion  to  look  into 
almost  any  government  activity  and  its  potential  to 
unveil,  for  example,  untoward  spending  practices 
unwittingly  makes  the  auditor  a  player  within  the 
political  game.  Second,  the  nature  of  the  auditing 
workforce — empowered,  well  positioned  in  govern¬ 
ment,  specifically  skilled — impels  political  parties  to 
strategically  place  favorites.  Third,  an  independent 
producer  of  creditable  and  presumably  apolitical 
information  is  a  highly  desirable  partner  in  politi¬ 
cal  arguments.  Auditing  as  a  political  power  or 
blaming  tool  is  most  evident  when  it  is  used  to 
detect  and  deter  deviant  behavior,  such  as  capital¬ 
istic  tendencies  in  communist  countries,  or  to 
achieve  certain  policy  or  partisan  objectives.  The 
claim  of  avoiding  policy  issues  and  focusing  on  the 
operation  of  the  apparatus  is  therefore  under¬ 
standable,  albeit  next  to  impossible,  especially  in 
the  context  of  issues  framed  not  objectively  but 
within  the  perceptions  of  political  actors. 

The  principle  of  the  auditors’  independence  is 
definitely  a  necessary  condition  for  a  well- 
functioning  system.  Other  conditions  are  more 
general  traits  of  the  political  system,  such  the  audit 
courts’  dependence  on  the  democratic  culture  of  a 
country,  the  respect  for  democratic  institutions, 
the  belief  in  a  division  of  powers,  and  respectful 
behavior  among  those  holding  power.  The  main 
challenge  in  the  development  of  auditing  in  mod¬ 
ern  states  is  therefore  the  acceptance  of  the  com¬ 
plementarities  of  the  auditor’s  role  by  relative 
political  actors  and  the  positive  use  of  its  potential 
to  improve  government  operations,  ensure  good 
public  records,  and  increase  public  trust. 

Patrick  von  Maravic 
Zeppelin  University 
Friedricbsbafen,  Germany 
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Authoritarian  Regimes 


Until  the  early  1960s,  there  was  a  fundamental 
dichotomy  in  political  science  between  democratic 
and  totalitarian  regimes.  Authoritarian  regimes 
were  often  treated  as  a  type  of  totalitarian  regime. 
However,  with  the  collapse  (Germany)  and  the 
subsequent  transformation  (in  the  former  USSR) 
of  totalitarian  regimes  after  World  War  II  and  in 
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the  1950s,  authoritarian  regimes  have  become  the 
dominant  form  of  nondemocratic  government. 

This  entry  provides  a  general  definition  of 
authoritarian  regime  as  well  as  exploring  subtypes. 
Among  the  many  subtypes  of  authoritarian 
regimes,  the  currently  most  widespread  one  is  the 
“electoral  authoritarian”  regime  that  combines  a 
facade  of  multiparty  elections  with  a  variety  of 
authoritarian  controls  that,  in  effect,  deprive  such 
elections  of  any  democratic  significance.  Therefore, 
this  entry,  while  briefly  discussing  other  subtypes 
of  authoritarian  regimes,  will  particularly  focus  on 
electoral  authoritarianism.  Because  such  regimes 
are  the  dominant  regime  type  in  the  Middle  East 
and  North  Africa,  a  special  emphasis  will  be  put 
on  this  region. 

A  Theory  of  Authoritarian  Regimes 

Juan  J.  Linz  developed  the  first  full-fledged  theory 
of  authoritarian  regimes,  as  distinct  from  both 
democracies  and  totalitarian  regimes,  as  well  as 
from  many  other  types  of  other  nondemocratic 
regimes  (traditional  autocracies,  sultanistic 
regimes,  semiconstitutional  monarchies,  oligarchic 
democracies).  As  early  as  1964,  Linz  defined 
authoritarian  regimes  on  the  basis  of  three  main 
characteristics:  (1)  limited  pluralism,  (2)  the  pres¬ 
ence  of  a  mentality  rather  than  an  ideology,  and 
(3)  the  absence  or  low  level  of  political  mobiliza¬ 
tion.  He  developed  and  refined  his  theory  in  his 
later  work  (especially  Totalitarian  and 
Authoritarian  Regimes,  2000).  More  recently, 
such  regimes  have  started  to  attract  greater  schol¬ 
arly  attention  for  several  reasons.  Lirst,  the  emer¬ 
gence  of  military-bureaucratic  authoritarian 
regimes  in  much  of  Latin  America  and  elsewhere 
as  a  new  type  of  authoritarianism  required  more 
detailed  research  into  their  dynamics.  Second, 
after  the  global  resurgence  of  democracy  or  the 
“third  wave  of  democratization”  in  the  1980s  and 
the  1990s,  there  emerged  a  still  newer  type  of 
authoritarianism  generally  called  “electoral 
authoritarianism.”  This  type  lies  in  the  gray  area 
between  liberal  democracies  and  “closed”  author¬ 
itarian  regimes,  an  area  that  they  share  with  “elec¬ 
toral  democracies.”  Hence,  in  this  entry,  while  all 
subtypes  of  authoritarian  regimes  will  be  dis¬ 
cussed,  the  emphasis  will  be  on  these  intermediary 
types. 


Limited  Pluralism 

Starting  with  the  three-dimensional  scheme  of 
Linz,  limited  pluralism  contrasts  with  the  almost 
unlimited  pluralism  of  liberal  democracies  and  the 
almost  total  absence  of  pluralism  in  totalitarian 
regimes.  The  degree  of  limited  pluralism  varies 
among  different  subtypes  of  authoritarian  regimes. 
They  may  exclude  only  political  opposition  groups 
or  may  exclude  certain  types  of  civil  society  insti¬ 
tutions  as  well.  Such  exclusion  may  be  based  on 
legal  regulations  or  de  facto  restrictions.  As  this 
observation  indicates,  the  degree  of  limited  plural¬ 
ism  is  a  critical  variable  in  distinguishing  the  sub- 
types  of  authoritarian  regimes,  as  will  be  spelled 
out  below. 

Mentality  Versus  Ideology 

A  second  defining  element  of  authoritarian 
regimes  is  the  presence  of  a  “mentality”  rather 
than  an  “ideology.”  In  Linz’s  view,  in  contrast  to 
the  more  rigid,  systematic,  rationally  developed, 
utopian,  and  comprehensive  nature  of  ideologies, 
mentalities  are  less  systematic  and  more  amor¬ 
phous  ways  of  thinking  and  feeling,  more  emo¬ 
tional  than  rational.  Although  the  distinction  is 
not  clear-cut  and  there  is  a  large  gray  area  in 
between,  “mentalities,”  as  recalled  by  the  German 
sociologist  Theodore  Geiger,  are,  actually,  more  or 
less  ambiguously  developed  values,  such  as  father- 
land,  nation,  order,  hierarchy,  tradition,  race,  eth¬ 
nicity,  and  others.  The  key  political  characteristic 
of  them  is  that,  precisely  because  they  are  ambigu¬ 
ous  and  intellectually  underdeveloped,  creating 
broader  political  coalitions  around  them  to  build 
the  bases  of  regime  legitimacy  is  much  easier  than 
with  precise,  developed  ideologies  (Linz,  2000). 

Lack  of  Political  Mobilization 

The  third  element  is  the  absence  of  extensive 
political  mobilization,  again  a  crucial  difference 
with  totalitarian  regimes,  even  though  authoritar¬ 
ian  regimes  may  also  have  followed  mobiliza- 
tional  policies  at  some  point  of  their  develop¬ 
ment.  Actually,  this  variable  is  closely  related  to 
the  second  one,  since  in  the  absence  of  an  ideol¬ 
ogy,  it  is  difficult  to  pursue  effective  mobiliza- 
tional  policies  and  to  attract  the  young,  students, 
and  intellectuals. 
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Borderline  Cases 

Admittedly,  in  terms  of  all  three  criteria,  the 
borderline  separating  authoritarian  regimes  from 
other  types  of  nondemocratic  regimes  on  the  one 
hand  and  from  pseudodemocracies  or  semidemoc¬ 
racies  on  the  other  is  a  rather  blurred  one.  Thus,  an 
authoritarian  regime  that  allows  a  fairly  large  area 
of  limited  pluralism  in  nonpolitical  fields  and  even 
tolerates  and  attempts  to  co-opt  some  of  its  politi¬ 
cal  opponents,  maintains  a  fairly  flexible  mentality 
that  leaves  room  for  different  interpretations,  and 
shows  little  interest  in  mobilizing  the  masses  in  a 
certain  ideological  direction  may  be  said  to 
approach  a  pseudodemocracy,  while  there  may  be 
cases  where  it  is  difficult  to  distinguish  authoritari¬ 
anism  from  totalitarianism.  Hence,  this  large  gray 
area  poses  a  number  of  difficult  borderline  prob¬ 
lems,  and  it  requires  an  effort  to  distinguish  sub- 
types  of  authoritarian  regimes.  Such  difficulties  of 
classification  are  further  aggravated  by  the  fact 
that  many  regimes  combine  elements  from  differ¬ 
ent  types  and  stress  different  elements  in  different 
phases  of  their  history.  Authoritarian  regimes  are 
often  complex  systems  combining  elements  of  con¬ 
tradictory  models  in  uneasy  coexistence. 

Subtypes  of  Authoritarian  Regimes 

Linz  distinguished  the  following  subtypes  of 
authoritarian  regimes  using  his  two  criteria  of  the 
degree  of  limited  pluralism  and  the  apathy  mobili¬ 
zation  axis: 

•  bureaucratic -military  authoritarian  regimes, 

•  organic  statism, 

•  mobilizational  authoritarian  regimes  in 
postdemocratic  societies, 

•  postindependence,  mobilizational  authoritarian 
regimes, 

•  racial  and  ethnic  democracies,  and 

•  posttotalitarian  authoritarian  regimes. 

Although  space  limitations  do  not  allow  for  a 
detailed  analysis  of  each  of  these  subtypes,  it  will 
be  useful  to  discuss  whether  this  typology  is  as 
meaningful  today,  many  decades  after  Linz’s 
masterful  essay. 

Certainly,  the  collapse  of  the  Soviet  type 
of  totalitarianism  has  made  the  fundamental  dis¬ 
tinction  between  totalitarian  and  authoritarian 


regimes  much  less  meaningful  at  least  for  practical 
purposes.  Probably  the  only  Soviet-type  totalitar¬ 
ian  regime  today  is  that  of  North  Korea.  Linz  clas¬ 
sifies  China  and  Cuba  as  posttotalitarian.  Certain 
former  Soviet  republics  of  Central  Asia  seem  to 
combine  posttotalitarian  and  sultanistic  elements. 
On  the  other  hand,  Central  and  Eastern  European 
countries  that  were  part  of  the  former  Soviet  bloc 
seem  to  have  made  a  successful  transition  to 
democracy,  while  some  of  the  former  Soviet 
republics  (Russia,  Belarus,  Ukraine,  Moldova, 
Transcaucasian  republics,  Kyrgyzstan)  are  still 
experiencing  serious  problems  of  democratic  con¬ 
solidation,  probably  because  of  their  much  longer 
totalitarian  past  and  difficulties  associated  with 
the  problem  of  “stateness.” 

Another  interesting  borderline  case  between 
authoritarianism  and  totalitarianism  is  Iran,  where 
an  almost  totalitarian  interpretation  of  a  religious 
ideology  is  combined  with  elements  of  limited  plu¬ 
ralism.  Under  the  Islamist  regime,  Islam  has  been 
transformed  into  a  political  ideology  with  a  totali¬ 
tarian  bent,  and  the  limited  pluralism  is  allowed 
only  among  political  groups  loyal  to  the  Islamic 
revolution.  Furthermore,  the  electoral  and  legisla¬ 
tive  processes  are  under  the  firm  control  of  tutelary 
institutions  such  as  the  Supreme  Leader  (appointed 
by  the  Assembly  of  Experts  composed  of  theolo¬ 
gians),  the  Council  of  Guardians  with  veto  powers 
over  all  laws  and  all  candidates  seeking  elective 
office,  and  the  Expediency  Council  added  to  the 
Constitution  in  1990  as  the  final  arbiter  in  cases  of 
disagreement  between  the  Majlis  (parliament)  and 
the  Council  of  Guardians,  all  inspired  by  Khomeini’s 
doctrine  of  “the  supremacy  of  the  Islamic  jurispru¬ 
dent”  (Velayet-e  Faqib).  These  tutelary  institutions 
effectively  blocked  the  moderate  reform  attempts 
of  President  Mohammad  Khatami  (elected  twice  in 
1997  and  2001,  himself  a  cleric  loyal  to  the  Islamic 
Revolution),  even  though  he  had  the  support  of  the 
Majlis  elected  in  2000  and  dominated  by  the 
reformists.  Thus,  Iran’s  limited  pluralism  appears 
very  limited  indeed. 

The  momentous  socioeconomic  and  political 
changes  in  the  past  3  or  4  decades,  at  the  top  of 
which  stands  “the  third  wave  of  democratiza¬ 
tion,”  made  some  of  Linz’s  subtypes  less  relevant 
today  than  they  used  to  be.  Thus,  with  the  transi¬ 
tion  to  democracy  in  the  South  African  Republic, 
the  leading  example  of  so-called  racial  or  ethnic 
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democracies  disappeared.  With  the  demise  of  total¬ 
itarianism,  defective  and  pretotalitarian  regimes  are 
no  longer  a  contemporary  category.  With  the  suc¬ 
cessful  transition  to  democracy  in  former  Soviet 
bloc  countries  of  Central  and  Eastern  Europe,  post- 
totalitarianism  is  limited  to  a  few  countries  such  as 
China  and  Cuba,  as  was  alluded  to  above. 
Mobilizational  authoritarian  regimes  in  postdemo- 
cratic  societies  also  seem  to  be  a  category  of  mainly 
historical  significance.  Such  regimes  emerged  in  the 
past  in  countries  with  some  democratic  experience 
and  a  degree  of  political  mobilization.  Therefore, 
the  authoritarian  regimes  had  to  use  a  mass  organi¬ 
zational  party  and  other  mass  organizations  in 
order  to  exclude  the  defeated  social  force  from 
political  participation.  However,  with  the  world¬ 
wide  preeminence  of  democratic  ideology  at  pres¬ 
ent,  it  seems  exceedingly  difficult  to  establish  such 
a  regime  in  a  country  with  some  prior  experience  in 
democratic  politics  and  mass  mobilization. 

Similar  observations  can  be  made  about  the 
subtype  called  “organic  statism”  by  Linz.  Such 
regimes  attempt  to  combine  the  Catholic  corpora- 
tivist  social  doctrine  with  fascist  elements  but  are 
distinct  from  fascist-totalitarian  regimes.  In  these 
regimes,  often  described  as  coorporatism  or  organic 
democracy,  a  variety  of  social  groups  or  institu¬ 
tions  that  are  created  or  licensed  by  the  state  are 
allowed  to  participate  in  some  degree  in  the  politi¬ 
cal  process.  Linz  admits,  however,  that  this  model, 
popular  in  the  1930s,  lost  its  attraction  for  a  num¬ 
ber  of  reasons.  Probably,  the  most  important  rea¬ 
son  was  the  disengagement  of  the  Catholic  Church 
from  its  commitment  to  essentially  undemocratic 
organic  theories  of  society.  A  number  of  “populist 
regimes”  that  emerged  especially  in  Latin  America 
in  the  1930s  and  later,  such  as  those  of  Getulio 
Vargas  in  Brazil  and  Juan  Peron  in  Argentina, 
interestingly,  combined  elements  of  corporatism 
with  those  of  “electoral  authoritarianism”  to  be 
discussed  later.  These  regimes  displayed  a  strong 
developmentalist,  nationalist,  statist,  and  elitist 
character.  With  the  failure  of  their  import  substitu¬ 
tion-based  development  strategy  and  the  emer¬ 
gence  of  the  currently  dominant  free  market 
economy,  however,  such  regimes  seem  to  have  lost 
their  appeal. 

Linally,  the  usefulness  of  another  subtype  called 
“tutelary  democracies”  by  Edward  Shils  is  also 
open  to  doubt.  Recent  research  has  suggested,  for 


example,  that  Kemalist  Turkey,  usually  presented 
as  the  prototype  of  such  regimes,  in  fact  bore  closer 
resemblance  to  corporatist  or  organic  statist  author¬ 
itarian  regimes.  Indeed,  it  is  difficult  to  conceive  of 
an  “enlightened”  leader  or  a  group  of  leaders  dedi¬ 
cated  to  “educate”  their  people  in  democratic  val¬ 
ues  and  prepared  to  lose  power  as  the  ultimate 
outcome  of  the  democratization  process. 

The  remaining  subtypes  of  authoritarian  regimes 
with  some  contemporary  relevance  seem  to  be  the 
military-bureaucratic  authoritarian  regimes  and 
the  postindependence  mobilizational  single-party 
regimes.  The  latter  are  normally  associated  with  a 
war  or  struggle  of  independence  from  colonial 
rule.  Countries  in  Black  Africa  and  the  Maghreb 
provide  examples  of  this  type.  Linz  observes  that 
these  regimes  emerged  in  countries  of  low  eco¬ 
nomic  development  where  the  colonial  rulers  had 
not  allowed  the  growth  of  an  indigenous  profes¬ 
sional  or  bureaucratic  middle  class  but  destroyed 
or  weakened  traditional  precolonial  authorities. 
Thus,  the  new  nationalist  leadership,  mostly 
trained  abroad,  successfully  mobilized  the  griev¬ 
ances  of  the  native  population  against  the  foreign 
rulers.  As  expected,  the  desire  for  independence 
obscured  the  importance  of  other  cleavages.  After 
a  short  period  of  experience  with  democratic  con¬ 
stitutions  inherited  or  inspired  by  their  former 
colonial  rulers,  most  of  these  regimes  were  trans¬ 
formed  into  single-party  regimes,  the  most  notable 
examples  of  which  were  Algeria,  Tunisia,  Ghana, 
Ivory  Coast,  Senegal,  and  Tanzania.  The  single¬ 
party  regimes  claimed  to  represent  the  entire 
nation  and  defined  their  chief  mission  as  nation 
building.  However,  their  mobilizational  and  ideo¬ 
logical  components  decayed  rapidly.  The  role  of 
the  mass  mobilizational  single-party  regime  lost  its 
preeminence.  Consequently,  they  moved  in  the 
direction  of  either  military-bureaucratic  or  organic 
statist  regimes.  This  process  can  be  compared  with 
“detotalitarianization  by  decay,”  namely,  the  loss 
of  the  revolutionary  ideological  elan  and  of  the 
mobilizational  capability  of  the  regime,  which  also 
explains  the  sudden  collapse  of  the  Soviet  Union 
probably  better  than  anything  else. 

Turkey  was  one  of  the  earliest  examples  of  sin¬ 
gle-party  regimes  and,  as  such,  attracted  the  atten¬ 
tion  of  a  great  number  of  Western  scholars,  includ¬ 
ing  Maurice  Duverger.  The  Republican  People’s 
Party  (RPP)  was  founded  by  Kemal  Atatiirk,  the 
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leader  of  the  War  of  Independence  (1919-1922)  in 
1923,  and  it  consolidated  its  monopoly  in  1925  by 
outlawing  the  only  opposition  party,  the  Progressive 
Republican  Party.  It  maintained  such  monopoly 
until  the  end  of  the  World  War  II,  when  the  party 
leadership  allowed  the  formation  of  an  opposition 
party,  the  Democratic  Party  and  lost  power  to  it  in 
the  first  free  election  of  1950.  The  RPP  (and  its 
ancestor,  the  Society  of  Defense  of  Rights  for 
Anatolia  and  Rumelia)  was  successful  in  achieving 
a  significant  degree  of  popular  mobilization  during 
the  War  of  Independence,  appealing  to  and  co¬ 
opting  Muslim  religious  orders,  Kurdish  tribes, 
radical  republicans,  and  loyal  monarchists,  and  so 
on.  After  the  victory,  however,  the  RPP  was 
quickly  transformed  into  an  “exclusionary”  single 
party,  in  the  words  of  Samuel  Huntington.  It 
showed  little  interest  in  mass  mobilization  and 
concentrated  its  efforts  on  nation  building  and 
secularizing  the  state  and  society  by  top-down 
authoritarian  methods,  almost  a  “mission  impos¬ 
sible”  in  a  deeply  divided,  pluralistic,  and  devoutly 
religious  society. 

The  military-bureaucratic  authoritarian  regimes 
emerged  as  probably  the  most  widespread  type  of 
authoritarianism  in  the  1960s,  especially  in  Latin 
America  but  also  elsewhere.  Guillermo  O’Donnell 
and  others  have  argued  that  the  emergence  of  such 
regimes  as  a  new  type  of  authoritarian  rule  in  Latin 
America  was  linked  to  the  failure  of  import  substi¬ 
tution-based  industrialization  (ISI).  The  early 
phases  of  this  industrialization  strategy  gave  rise  to 
populist  coalitions  that  brought  together  national 
industrialists,  urban  professionals,  and  the  emerg¬ 
ing  urban  working  class.  Both  components  of  the 
populist  coalition  benefited  from  this  expansionist 
policy:  Import-substituting  industrialists  were 
heavily  protected  against  foreign  competition  by 
import  quotas,  tariffs,  and  foreign  exchange  regu¬ 
lations;  the  urban  working  class,  in  turn,  obtained 
employment  opportunities,  union  rights,  welfare 
benefits,  and  relatively  higher  wages  because  the 
growth  of  the  domestic  market  was  supposed  to 
further  stimulate  the  ISI. 

The  early,  easy  phase  of  the  ISI,  however,  was 
soon  exhausted.  Once  the  limits  of  growth  of 
domestic  markets  had  been  reached,  the  ineffi¬ 
cient,  internationally  noncompetitive  national 
industries  failed  to  produce  the  necessary  foreign 
exchange  earnings.  Thus,  ISI-based  economies 


faced  severe  problems,  such  as  balance-of-payment 
deficits,  foreign  exchange  shortages,  low  or  nega¬ 
tive  growth  rates,  inflation,  and  unemployment. 
The  economic  crisis,  in  turn,  led  to  the  radicaliza- 
tion  of  the  popular  sector  (the  industrial  working 
class)  and  to  increased  political  polarization.  This 
created  fears  among  the  urban  middle  classes  of  a 
Cuban-style  radical  leftist  revolution.  With  the 
support  of  this  middle  class,  the  military  took  over 
in  many  important  countries  of  Latin  America 
such  as  Brazil,  Argentina,  Chile,  and  Uruguay  and 
established  what  is  known  as  the  bureaucratic- 
military  authoritarian  regimes.  The  fact  that  these 
regimes  emerged  in  the  most  highly  developed 
countries  of  Latin  America  lends  support  to 
O’Donnell’s  economy-based  explanation. 

Such  regimes  differ  from  earlier  types  of  military 
regimes  in  Latin  America  (poder  moderador )  in 
that  the  latter  were  of  relatively  short  duration  and 
intended  to  solve  a  particular  constitutional  crisis, 
while  the  former  were  of  much  longer  duration 
and  attempted  to  change  the  economic  structure  in 
a  more  free  market,  export-oriented  direction.  To 
accomplish  this  objective,  they  tried  to  demobilize 
the  already  activated  popular  sector  by  severely 
restricting  its  political  participation  opportuni¬ 
ties.  In  this  effort,  they  also  enlisted  the  help  of  a 
large  group  of  experts,  technocrats,  and  managers 
with  problem-solving  capabilities.  Hence,  these 
regimes  are  often  called  “military-technocratic- 
bureaucratic  authoritarian  regimes”  to 
distinguish  them  from  the  more  bureaucratic- 
military-oligarchic  authoritarian  regimes  of  inter¬ 
war  Eastern  Europe. 

Although  the  bureaucratic  authoritarian  regimes 
have  been  analyzed  mostly  within  the  Latin 
American  context,  the  applicability  of  this  model  is 
not  limited  to  that  continent.  In  fact,  the  National 
Security  Council  (NSC)  regime  in  Turkey  (1980— 
1983)  bears  strong  resemblances  to  its  Latin 
American  counterparts.  It  also  followed  a  period 
of  deep  economic  crisis  as  a  result  of  the  failure  of 
an  ISI  strategy,  accompanied  by  intense  political 
polarization,  radicalization  of  the  working  class, 
and  the  increasing  fears  of  the  middle  classes.  Like 
its  Latin  American  counterparts,  the  NSC  regime 
pursued  policies  aimed  at  shifting  the  basis  of  the 
economy  from  an  ISI  to  an  export  promotion 
model,  while  trying  to  demobilize  the  working 
class  and  depoliticize  the  society  at  large. 
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Parallel  to  the  “global  resurgence  of  democ¬ 
racy,”  pure  and  simple  military  regimes  have  virtu¬ 
ally  disappeared,  except  perhaps  for  a  transitional 
period.  As  Larry  Diamond  observes,  the  interven¬ 
tionist  armies  today  try  to  either  legitimize  their 
rule  by  running  for  president  in  contested  but 
manipulated  elections  or  carve  out  for  themselves 
a  large  autonomous  sphere  in  the  form  of  tutelary 
powers  and  reserved  domains  behind  a  veil  of  civil¬ 
ian,  multiparty  rule. 

Electoral  Authoritarianism 

While  some  of  the  subtypes  in  Linz’s  original 
typology  have  lost  their  contemporary  relevance, 
new  borderline  problems  have  arisen  as  a  result  of 
the  rise  of  democratic  ideology  as  the  almost  unri¬ 
valed  source  of  legitimacy.  Many  otherwise  quite 
authoritarian  regimes  have  been  obliged  to  accept 
at  least  some  outward  trappings  of  democratic 
regimes.  As  a  result,  while  in  the  past,  the  bound¬ 
ary  between  authoritarian  and  totalitarian  regimes 
was  not  very  clear,  as  opposed  to  the  relatively 
well-defined  boundary  between  nondemocratic 
and  democratic  systems,  today  the  reverse  seems  to 
be  true.  Indeed,  there  seems  to  be  a  large  gray  area 
between  authoritarian  regimes  and  certain  types  of 
democracies  variously  called  “electoral  democra¬ 
cies,”  “semidemocracies,”  “pseudodemocracies,” 
and  so  on.  A  newly  coined  term,  electoral  authori¬ 
tarianism ,  suggests  the  difficulty  of  arriving  at  a 
neat  classification.  Electoral  or  competitive  author¬ 
itarian  regimes  are  not  among  the  subtypes  in 
Linz’s  original  typology,  for  understandable  rea¬ 
sons.  As  Diamond  argues,  this  type  of  hybrid 
regime  is  very  much  a  product  of  the  contempo¬ 
rary  world.  The  worldwide  trend  toward  democ¬ 
racy  has  forced  many  authoritarian  regimes  to 
mimic  some  forms  of  electoral  competition. 
Andreas  Schedler  argues,  in  the  same  vein,  that  on 
the  bases  of  2001  data,  out  of  151  countries  in  the 
“developing  world,”  36  (23.8%)  are  liberal  democ¬ 
racies,  32  (21.2%)  are  electoral  democracies,  58 
(38.4%)  are  electoral  authoritarian  regimes,  and 
25  (16.5%)  are  closed  authoritarian  regimes. 
Thus,  electoral  authoritarian  regimes  make  up 
more  than  two  thirds  of  all  autocracies. 

The  breadth  of  this  gray  area  between  liberal 
democracies  and  full-scale  (closed)  authoritarian 
regimes  may  necessitate  the  search  for  further 


subdivisions  within  this  category.  Thus,  Schedler 
distinguishes  between  electoral  democracies  and 
electoral  authoritarianism,  while  Diamond,  in  a 
more  nuanced  distinction,  divides  it  into  four  sub- 
types  as  electoral  democracies,  ambiguous  regimes, 
competitive  authoritarian  regimes,  and  hegemonic 
electoral  authoritarian  regimes.  Both  authors  agree 
that  the  boundaries  between  these  regime  types  are 
“blurry”  or  “foggy”  and  that  it  is  difficult  to  fit 
elusive  realities  into  ideal  types.  The  distinction 
between  liberal  and  electoral  democracies  is  based 
on  variables  other  than  electoral  competitiveness, 
since  both  conduct  reasonably  free  and  competi¬ 
tive  elections  as  a  minimum  condition  for  democ¬ 
racy.  Electoral  democracies  fail  to  institutionalize 
the  other  vital  dimensions  of  liberal  democracies, 
such  as  the  rule  of  law,  political  accountability,  full 
protection  of  civil  and  political  rights,  and  so  on. 
Another  feature  distinguishing  electoral  democra¬ 
cies  from  liberal  democracies  may  be  the  presence 
of  tutelary  powers  and  reserved  domains  in  some 
of  them  outside  the  purview  of  elected  officials, 
most  often  in  favor  of  the  military  but  also  some¬ 
times  in  favor  of  clerical  leaders  as  in  the  case  of 
Iran.  Turkey,  where  free  and  fair  elections  have 
been  regularly  held  since  1950,  but  the  military 
have  enjoyed  significant  tutelary  powers  and 
reserved  domains  since  its  intervention  in  1960, 
and  even  more  so  after  its  intervention  in  1980,  is 
a  good  example  of  such  cases. 

On  the  other  hand,  the  difference  between  elec¬ 
toral  democracies  and  electoral  authoritarian 
regimes  centers  on  the  fairness  and  competitiveness 
of  the  electoral  process.  The  criteria  for  free  and 
fair  elections  are  well  known  and  need  not  be 
repeated  here.  However,  as  Schedler  puts  it,  his¬ 
torically,  elections  have  been  an  instrument  of 
authoritarian  control  as  well  as  a  means  of  demo¬ 
cratic  governance.  And  at  present,  there  are  many 
regimes  where  multiparty  elections  are  regularly 
held,  but  many  basic  democratic  norms  are  system¬ 
atically  violated.  Therefore,  they  should  be  classi¬ 
fied  as  authoritarian  regimes.  Indeed,  the  authori¬ 
tarian  use  of  elections  is  nothing  new,  as  attested 
by  an  old  but  influential  book,  Elections  Without 
Choice  (1978),  edited  by  Guy  Hermet,  Richard 
Rose,  and  Alain  Rouquie,  providing  earlier  exam¬ 
ples  of  such  regimes.  As  Schedler  rightly  observes, 
“the  menu  of  manipulation”  available  to  authori¬ 
tarian  rulers  is  quite  rich  and  varied,  ranging  from 
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banning  parties  and  disqualifying  candidates  to 
repressive  policies  against  dissenters  during  elec¬ 
tion  campaigns,  fraud  and  intimidation,  informal 
disenfranchisement,  unequal  use  of  state  resources, 
adoption  of  favorable  redistributive  rules,  and  so 
on.  Certainly,  one  can  further  subdivide  the  elec¬ 
toral  authoritarianism  category  into  competitive 
authoritarian  and  hegemonic  authoritarian  sub- 
types,  as  did  Diamond,  depending  on  their  degree 
of  competitiveness.  In  his  view,  competitive  author¬ 
itarian  regimes  are  defined  by  the  presence  of  a 
significant  parliamentary  opposition,  whereas  in 
the  hegemonic  subtype,  elections  are  largely  an 
authoritarian  facade:  The  ruling  or  dominant 
party  wins  almost  all  the  seats.  Similarly,  in  hege¬ 
monic  electoral  authoritarian  regimes,  the  presi¬ 
dent  is  elected  with  a  great  majority  of  the  popular 
vote.  Diamond  admits,  however,  the  difficulty  of 
distinguishing  between  the  two  types.  At  any  rate, 
the  difference  seems  to  be  one  of  degree  rather 
than  of  kind.  No  doubt,  they  both  firmly  belong  to 
the  authoritarian  type,  but  both  differ  from  full- 
fledged  or  “closed”  authoritarian  regimes. 

The  blurry  borderline  between  electoral  democ¬ 
racies  and  electoral  authoritarian  regimes  poses 
difficult  methodological  problems.  As  Schedler 
rightly  argues,  under  conditions  of  electoral  manip¬ 
ulation,  official  election  figures  do  not  faithfully 
reflect  the  actual  distribution  of  citizen  prefer¬ 
ences.  However,  it  is  difficult  to  observe  electoral 
manipulations,  since  these  are  usually  “undercover 
activities,”  constituting  “an  impenetrable  black 
box.”  Nevertheless,  available  information  on  the 
state  of  political  rights  in  general  may  give  us  clues 
about  the  extent  of  electoral  manipulation.  Thus, 
unequal  access  to  state  resources  or  mass  media, 
restrictions  on  freedom  of  expression  and  associa¬ 
tion,  bans  on  political  party  activities,  adoption  of 
electoral  rules  favorable  to  the  governing  party, 
gerrymandering,  and  so  on  may  provide  reason¬ 
ably  reliable  indicators  to  recognize  an  electoral 
authoritarian  regime.  Observations  by  interna¬ 
tional  election  observers,  if  available,  may  provide 
another  source  of  information. 

As  was  pointed  out  above,  electoral  authoritar¬ 
ian  regimes  constitute  the  largest  group  of  coun¬ 
tries  in  the  developing  world.  Certainly,  the  rise  of 
this  type  is  closely  associated  with  the  global  resur¬ 
gence  of  democracy.  With  the  collapse  of  Soviet- 
type  totalitarianism,  democracy  has  become  the 


only  legitimate  form  of  government  in  the  eyes  of 
a  much  greater  portion  of  the  world’s  population. 
As  a  result  of  a  favorable  international  climate, 
active  promotion  and  encouragement  of  demo¬ 
cratic  change  by  major  Western  powers,  the  desire 
to  gain  international  respectability,  and  domestic 
pressures  associated  with  rising  levels  of  socioeco¬ 
nomic  development,  some  forms  of  authoritarian¬ 
ism  have  become  less  sustainable.  Thus,  most 
hybrid  regimes  today  are  quite  deliberately  pseu- 
dodemocratic,  dreaming,  in  Schedler’s  words,  of 
reaping  the  fruits  of  electoral  legitimacy  without 
running  the  risks  of  democratic  uncertainty.  They 
mimic  democratic  institutions  not  only  in  the  elec¬ 
toral  field  but  also  by  creating  mimic  constitu¬ 
tional  courts  and  adopting  other  outward  trap¬ 
pings  of  a  system  based  on  the  rule  of  law. 
Conceivably,  such  institutions  create  potential 
areas  of  dissidence  and  conflict,  even  if  they  remain 
far  from  ensuring  democratic  accountability. 
Nevertheless,  the  concept  of  electoral  authoritari¬ 
anism  centers  on  the  electoral  area,  as  its  name 
indicates. 

Thus,  electoral  authoritarian  regimes  combine 
two  contradictory  dynamics  in  uneasy  coexistence. 
On  the  one  hand,  as  Schedler  points  out,  by  per¬ 
mitting  multiparty,  competitive  elections,  they 
recognize  the  principle  of  democratic  legitimacy 
instead  of  other  sources  of  legitimacy  (such  as 
revolutionary,  transcendental,  communitarian, 
charismatic,  or  substantive).  On  the  other  hand, 
they  subvert  it  in  practice  using  various  instru¬ 
ments  of  authoritarian  manipulation.  As  opposed 
to  democratic  regimes  in  which  the  rules  of  elec¬ 
toral  game  are  accepted  and  respected  by  all  par¬ 
ties,  in  electoral  authoritarian  regimes,  competi¬ 
tion  is  not  only  “within  rules”  but  also,  and  more 
important,  “over  rules.”  On  the  other  hand,  as 
opposed  to  closed  authoritarian  regimes,  the  rulers 
have  to  find  a  balance  between  “electoral  persua¬ 
sion”  and  “electoral  manipulation.” 

This  mixture  of  contradictory  elements  makes 
the  future  of  electoral  authoritarian  regimes  highly 
uncertain  and  unpredictable.  Again,  as  Schedler 
points  out,  the  game  of  authoritarian  elections 
may  lead  to  a  process  of  gradual  democratization 
as  in  Mexico  and  Senegal.  It  may  lead  to  democ¬ 
racy  through  the  sudden  collapse  of  authoritarian¬ 
ism  as  in  Peru  and  Serbia.  It  may  also  end  up  in  an 
authoritarian  regression.  Or  such  regimes  may 
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survive  for  an  extended  period,  through  various 
successful  survival  strategies  of  the  rulers. 

At  the  other  end  of  the  democracy-autocracy 
continuum,  “closed  authoritarianism”  seems  to  be 
the  relatively  smaller  category,  but  a  stable  one. 
Thus,  during  the  past  10  years,  such  a  nonfree 
regime  went  from  25%  of  all  states  (48  in  total; 
Freedom  in  the  World  2000-2001 )  to  24%  (47  in 
total;  Freedom  in  the  World  2009).  They  may  com¬ 
bine  some  elements  of  “sultanistic  regimes,”  con¬ 
sidered  a  separate  type  by  Linz.  He  has  argued  that 
the  exceedingly  personalistic  and  arbitrary  nature 
of  such  regimes  distinguishes  them  from  the  more 
institutionalized  and  predictable  behavior  of  the 
authoritarian  regimes.  In  sultanistic  regimes,  the 
ruler  exercises  his  power  unrestrained  by  legal  rules 
or  ideological  commitments.  Public  bureaucracy 
becomes  the  personal  entourage  and  servants  of  the 
ruler.  Classic  examples  of  sultanistic  regimes  are 
the  Trujillo  regime  in  the  Dominican  Republic,  the 
Batista  regime  in  Cuba,  the  Somoza  regime  in 
Nicaragua,  the  Duvalier  regime  in  Haiti,  the  Pahlavi 
regime  in  Iran,  and  the  Marcos  regime  in  the 
Philippines.  More  recent  examples  may  include  the 
Ceau§escu  regime  in  Romania,  the  Saddam  Hussein 
regime  in  Iraq,  and  the  North  Korean  regime. 

If  we  look  at  the  regional  distribution  of 
authoritarian  regimes,  we  see  its  highest  incidence 
in  the  Middle  East  and  the  North  Africa  and  its 
lowest  in  Eastern  Europe,  with  sub-Saharan  Africa 
in  between.  According  to  Schedler’s  figures  (as  of 
early  2006),  11  sub-Saharan  African  countries 
(Burkina  Faso,  Cameroon,  Chad,  Ethiopia, 
Gabon,  Gambia,  Guinea,  Mauritania,  Tanzania, 
Togo,  and  Zambia)  are  in  the  electoral  authoritar¬ 
ian  category.  According  to  his  figures  based  on 
2001  data,  52.6%  of  the  Middle  Eastern  and 
North  African  regimes  are  electoral  authoritarian 
regimes  and  42.1%  belong  to  closed  authoritari¬ 
anism  category.  Israel  is  the  only  liberal  democ¬ 
racy  in  his  classification,  with  Turkey  included  in 
the  electoral  authoritarian  category.  In  contrast, 
57.9%  of  Eastern  European  states  are  liberal 
democracies,  15.8%  electoral  democracies,  26.3% 
electoral  authoritarian  regimes;  no  Eastern 
European  regime  is  in  the  closed  authoritarianism 
category.  The  success  of  the  Eastern  European 
cases  can  be  explained  by  the  geographical  and 
cultural  proximity  of  Western  democracies  and 
many  forms  of  international  influences,  including 


the  European  Union  conditionality.  The  Middle 
East  and  North  Africa,  on  the  other  hand,  stand 
as  the  major  geographical  region  of  the  world 
least  affected  by  the  third  wave  of  democratiza¬ 
tion,  a  fact  that  requires  a  detailed  explanation. 

Islam  and  Democracy 

One  explanation  for  the  limited  democratization 
in  the  Middle  East  and  North  Africa,  based  on  the 
alleged  inherent  incompatibility  between  democ¬ 
racy  and  Islam,  the  dominant  religion  of  the 
region,  must  be  treated  with  utmost  care.  It  has 
been  argued  by  Orientalists,  for  example,  that 
many  notions  associated  with  Western  democracy, 
such  as  the  notions  of  popular  sovereignty,  repre¬ 
sentation,  elections,  secular  laws,  an  independent 
judiciary,  and  a  civil  society  composed  of  a  multi¬ 
tude  of  autonomous  groups,  are  alien  to  the 
Muslim  political  tradition.  Such  statements  abound 
in  the  Orientalist  literature  and  have  deeply  influ¬ 
enced  the  Western  conceptions  of  Islam,  so  much 
so  that  Huntington  argued  that  the  West’s  prob¬ 
lem  is  not  Islamic  fundamentalism  but  Islam.  It  is 
beyond  the  scope  of  this  entry  to  discuss  the  philo¬ 
sophical,  historical,  and  empirical  flaws  of  this 
argument.  Suffice  it  to  say  that  Islam,  like  any 
other  major  religion,  is  not  a  monolith  and  that  it 
has  given  rise  to  different  interpretations  and  prac¬ 
tices  at  different  times  and  in  different  places. 
Likewise,  historically  speaking,  it  has  been  com¬ 
patible  with  different  forms  of  government. 

Looked  at  from  a  purely  empirical  point  of 
view,  what  is  claimed  to  be  “Muslim  exceptional- 
ism”  seems  more  like  “Arab  exceptionalism.” 
Thus,  Alfred  Stepan  convincingly  argues  that  Islam 
is  sometimes  misleadingly  equated  with  Arab  cul¬ 
ture.  He  observes  that,  while  there  are  no  democ¬ 
racies  in  the  Islamic  countries  of  the  Arab  world, 
about  half  of  all  the  world’s  Muslims  live  in 
democracies,  near-democracies,  or  intermittent 
democracies.  He  then  analyzes  the  cases  of 
Indonesia,  Bangladesh,  Pakistan,  and  Turkey  as 
examples  of  such  hybrid  regimes.  Vali  Nasr 
describes  the  same  cases  (plus  Malaysia)  as  exam¬ 
ples  of  the  “rise  of  Muslim  democracy,”  which  he 
compares  with  the  early  stages  of  Christian  demo¬ 
cratic  parties  in  Europe.  He  states  that,  unlike 
hard-line  Islamists,  Muslim  democrats  have  a 
pragmatic  perspective  and  seek  to  build  governing 
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coalitions  within  a  democratic  arena  whose  bounds 
they  respect. 

Turkey,  with  its  militantly  secular  constitu¬ 
tional  and  legal  system  and  its  more  than  60  years 
of  (albeit  interrupted)  democratic  experience,  is 
perhaps  the  leading  example  in  this  category. 
However,  the  extensive  tutelary  powers  enjoyed  by 
the  military,  and  its  less  than  desirable  record  on 
political  rights  (judged  by  the  ease  and  frequency 
of  party  closures),  allow  us  to  classify  it  only  as  an 
electoral  democracy.  Thus,  according  to  the 
Freedom  in  the  World  2009  report,  it  is  an  elec¬ 
toral  democracy  in  the  “partly  free”  category  with 
a  rating  of  3  for  both  political  rights  and  civil  liber¬ 
ties,  on  a  scale  of  1  to  7,  with  7  being  the  lowest 
score.  According  to  the  same  survey,  the  major 
non- Arab,  Muslim-majority  countries  have  the  fol¬ 
lowing  ratings  (the  first  figure  refers  to  the  coun¬ 
tries’  score  on  political  rights  and  the  second  on 
civil  liberties):  Albania  (3  and  3),  Bangladesh 
(4  and  4),  Bosnia-Herzegovina  (4  and  3),  Indonesia 
(2  and  3),  Malaysia  (4  and  4),  Pakistan  (4  and  5), 
Senegal  (3  and  3),  and  Turkey  (3  and  3).  Of  these, 
Indonesia  is  placed  in  the  “free”  category  and  the 
rest  in  the  “partly  free”  category,  and  all  are  rated, 
except  for  Malaysia  and  Pakistan,  as  “electoral 
democracies.” 

Although  the  Freedom  House’s  threefold  clas¬ 
sification  of  “free,”  “partly  free,”  and  “not  free” 
categories  are  not  identical  to  Schedler’s  fourfold 
(liberal  democracy,  electoral  democracy,  electoral 
authoritarianism,  and  closed  authoritarianism)  or 
Diamond’s  sixfold  (liberal  democracy,  electoral 
democracy,  ambiguous  regimes,  competitive 
authoritarian,  hegemonic  electoral  authoritarian, 
and  politically  closed  authoritarian)  typologies, 
there  is,  no  doubt,  a  large  area  of  overlap.  Thus, 
liberal  democracy  clearly  corresponds  to  the 
“free”  category,  while  closed  authoritarianism 
corresponds  to  “not  free.”  On  the  other  hand, 
Freedom  House’s  partly  free  category  includes 
both  electoral  democracies  and  electoral  authori¬ 
tarian  regimes  (or  the  four  intermediate  regime 
types  in  Diamond’s  still  finer  typology),  and  the 
borderline  between  the  two  may  not  always  be 
clear.  It  may  be  said  as  a  rule  of  thumb,  however, 
that  countries  with  a  Freedom  House  score  of  3  or 
4  may  be  considered  electoral  democracies,  while 
those  with  a  score  of  5  or  6  belong  to  the  electoral 
authoritarian  category. 


Among  the  non- Arab,  Muslim-majority  coun¬ 
tries,  the  former  Soviet  states  occupy  a  special 
place  in  that  they  are  all  in  the  “not  free”  or 
“closed  authoritarian”  category,  with  the  excep¬ 
tion  of  Kyrgyzstan,  which  is  rated  partly  free  by 
the  Freedom  House  (with  scores  of  5  and  4). 
Thus,  Azerbaijan,  Kazakhstan,  and  Tajikistan 
have  scores  of  6  and  5,  while  Turkmenistan  and 
Uzbekistan  are  at  the  authoritarian  end  of  the 
continuum  with  scores  of  7  and  7.  This  can  be 
explained  by  their  strong  totalitarian  legacy.  Iran 
(discussed  above)  and  Afghanistan  are  the  other 
two  special  cases. 

Authoritarianism  in  the  Arab  World 

The  situation  is  not  at  all  encouraging  in  the  Arab 
core  of  the  Middle  East  and  North  Africa.  In  2000, 
among  16  countries,  10  (Algeria,  Egypt,  Iraq,  Libya, 
Oman,  Qatar,  Saudi  Arabia,  Syria,  Tunisia,  and  the 
United  Arab  Emirates) — or  12  (if  we  include  two 
border  states  Mauritania  and  Sudan  that  have  an 
Arab  majority) — were  in  the  “not  free”  category 
and  only  6  (Bahrain,  Jordan,  Kuwait,  Lebanon, 
Morocco,  and  Yemen)  in  the  “partly  free”  category. 
During  the  decade,  Bahrain,  Jordan,  and  Yemen 
came  into  the  “not  free”  category,  and  consequently, 
in  2009,  Freedom  House  scores  for  the  remaining 
partly  free  Arab  countries  were  as  follows:  Kuwait 
(4  and  4),  Lebanon  (5  and  3),  and  Morocco  (5  and 
4).  Even  more  interesting,  Stepan  observes  that, 
while  the  16  Arab  countries  are  “underachievers”  in 
electoral  competitiveness  (relative  to  what  one 
would  expect  from  their  levels  of  gross  domestic 
product  per  capita),  the  31  Muslim-majority  but 
non-Arab  countries  are  “overachievers.” 

To  explain  “Arab  exceptionalism”  requires  no 
less  an  effort  than  explaining  the  much  more 
debatable  “Muslim  exceptionalism.”  It  does  not 
seem  convincing  to  attribute  the  Arab  democracy 
gap  to  certain  inherent  characteristics  of  the  Arab 
political  culture.  Therefore,  one  should  look  for 
political,  rather  than  ethnic  or  religious,  particu¬ 
larities  of  the  region  to  explain  such  exceptional¬ 
ism.  But  exactly  what  kind  of  “political”  particu¬ 
larities?  Indeed,  authoritarianism  in  the  Arab 
world  has  given  rise  to  an  extensive  literature. 
Although  it  is  not  possible  to  give  the  subject  a  full 
treatment  here,  the  most  common  explanations 
can  be  summarized  as  follows. 
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One  such  explanation  is  related  to  the  historical 
conditions  under  which  Arab  states  emerged.  As 
Stepan  correctly  states,  many  contemporary  Arab 
states  have  relatively  new  and  arbitrary  boundaries 
cut  out  of  the  Ottoman  Empire  and  were  after¬ 
ward  occupied  and  colonized  by  European  pow¬ 
ers.  Compounding  this  problem  of  state  identity  is 
the  widespread  appeal  of  pan-Arabism  in  many 
parts  of  the  region,  based  on  a  common  language 
and  common  religion.  Against  this  background,  it 
is  remarkable  that  the  new  Arab  territorial  states 
have  shown  a  high  degree  of  resilience  and  durabil¬ 
ity.  Egypt,  with  its  strong  sense  of  identity  and 
tradition  of  statehood  going  back  to  several  mil¬ 
lennia,  may  be  considered  an  exceptional  case. 
Another  related  factor  is  the  impact  of  the  colonial 
or  semicolonial  experience  in  the  political  culture 
of  Arab  states,  where  independence  from  foreign 
rule  was  the  dominant  objective  overshadowing 
concerns  such  as  democracy  and  human  rights. 

A  second  explanation  centers  on  the  concept  of 
“rentier  state.”  Many  Arab  countries  are  rich  in  oil 
and  natural  gas  reserves.  The  vast  income  deriving 
from  these  sources  enabled  the  ruling  elites  to  buy 
or  co-opt  some  elements  of  the  potential  opposi¬ 
tion  as  well  as  satisfy  the  basic  material  needs  of  a 
large  majority  of  the  population.  This  has  had  a 
retarding  effect  on  the  emergence  of  an  indepen¬ 
dent  bourgeoisie  that  is  the  driving  force  for 
democratization  in  Muslim  as  well  as  other  coun¬ 
tries.  As  Nasr  says,  Muslim  democracy  needs  the 
bourgeoisie,  and  the  bourgeoisie  needs  Muslim 
democracy.  Nevertheless,  the  rentier  state  hypoth¬ 
esis  cannot  explain  the  prevalence  of  authoritari¬ 
anism  in  oil-poor  Arab  states. 

A  third  argument,  perhaps  the  most  convincing 
one,  provides  a  cyclical  explanation.  Thus,  the 
repressive  policies  of  Arab  authoritarian  rulers 
against  all  opposition  groups,  secular  and  Islamist 
alike,  and  their  refusal  to  even  partially  liberalize 
the  system  radicalize  the  opposition.  The  chief 
beneficiary  of  this  process  is  the  Islamist  opposi¬ 
tion,  much  stronger,  more  broadly  based,  and  bet¬ 
ter  organized  than  secular  opposition  groups.  This, 
in  turn,  increases  the  perception  of  threat  (real, 
exaggerated,  or  manipulated)  on  the  part  of  the 
ruling  groups  and  is  used  to  justify  even  more 
repressive  policies.  This  fear,  although  highly 
exaggerated  and  manipulated,  is  also  vivid  within 
the  Turkish  state  elites  (particularly  the  military 


and  the  judiciary),  and  it  constitutes  one  of  the 
most  formidable  roadblocks  on  the  way  to  further 
democratization. 

The  feeling  of  an  Islamist  threat  also  leads  to  a 
paradoxical  situation  in  that  liberal  or  secular  oppo¬ 
sition  groups  that  should  normally  have  been  on  the 
side  of  democratic  reforms  also  support  repressive 
policies  against  the  Islamist  opposition.  As  Daniel 
Brumberg  argues,  the  alliance  between  potential 
democrats  and  the  authoritarian  states  in  the  Middle 
East  is  the  reverse  of  the  political  reform  process  in 
Eastern  Europe.  In  the  latter  cases,  the  not  too 
democratic  elites  nonetheless  found  democratic  pro¬ 
cedures  useful  for  dealing  with  the  opponents  of  the 
regime,  thereby  leading  to  a  “democracy  without 
democrats.  ”  In  the  Middle  East,  by  contrast,  fear  of 
Islamist  victories  has  produced  “autocracy  with 
democrats,”  as  potential  democrats  now  actively 
support  or  at  least  tolerate  autocrats  as  a  lesser  evil 
than  an  Islamist  regime.  A  similar  process  is  under 
way  in  Turkey  in  the  ostensibly  “social  democratic” 
and  militantly  secularist  RPP’s  (Cumhuriyet  Halk 
Partisi)  support  for  the  military’s  harsh  attitudes  and 
manipulative  maneuvers  against  the  governing 
Justice  and  Development  Party  (Adalet  ve  Kalkinma 
Partisi;  AKP),  even  though  the  latter  can  in  no  way 
be  considered  an  Islamist  or  even  a  post-Islamist 
party. 

The  dilemma  of  the  authoritarian  rulers  in  the 
Middle  East  explains  the  zigzagged  course  of  liber¬ 
alization  policies.  It  has  been  observed  that  peri¬ 
ods  of  relative  liberalization  have  often  been  fol¬ 
lowed  by  those  of  deliberalization.  Although  in 
some  Arab  countries,  such  as  Jordan,  Morocco, 
Algeria,  Kuwait,  and  Egypt,  controlled  or  semi- 
competitive  elections  have  taken  place,  and  mod¬ 
erate  advances  have  been  made  in  areas  such  as  the 
freedom  of  the  press,  association,  and  assembly, 
none  of  these  regimes  has  advanced  beyond  the 
level  of  electoral  (or  competitive)  authoritarian¬ 
ism.  Many  studies  of  these  cases  are  rather  opti¬ 
mistic,  probably  reflecting  the  democratizing  bias 
of  their  authors. 

And  yet  it  seems  a  fruitful  line  of  inquiry  to 
distinguish  between  the  relatively  more  successful 
and  less  successful  of  these  cases.  Thus,  Brumberg 
distinguishes  between  what  he  calls  “dissonant” 
states  and  “harmonic”  states.  The  dissonant  states 
leave  some  room  for  competitive  or  dissonant 
politics.  The  examples  are  Morocco,  Egypt,  Jordan, 
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Kuwait,  and  Lebanon.  That  such  regimes,  however 
autocratic,  recognize  and  do  not  attempt  to  destroy 
societal  pluralism  creates  an  arena  for  competition 
and  negotiation.  The  “harmonic”  states,  on  the 
other  hand,  in  their  attempt  to  impose  an  often 
artificial  unity  through  repression  and  cooptation, 
leave  little  room  for  negotiation  and  compromise 
and  create  a  deadly  game  of  “winner  takes  all.” 
These  are  the  states  based  on  an  ideology  (be  it 
Islam,  Arab  socialism,  or  Kemalism)  and  a  strong 
sense  of  mission  that  attempt  to  mold  the  society 
according  to  their  image  of  the  “good  society.” 
John  Waterbury  characterizes  such  states  as  “ends- 
oriented”  states,  arguing  that  since  the  mission  is 
considered  sacred,  debates  over  ends  or  means  are 
seen  as  subversive  or  blasphemous. 

The  dissonant  Arab  states  and  those  in  the  non- 
Arab  parts  of  the  Muslim  world  permitted  a  degree 
of  electoral  competition  that,  in  turn,  helped  mod¬ 
erate  the  Islamist  groups  and  integrate  them  into 
the  political  system.  Turkey  is  the  leading  example 
of  this  process,  where  the  earlier  Islamist  parties 
were  transformed  into  a  moderate,  center-right, 
conservative  democratic  party  (the  AKP).  Other 
examples  include  Bangladesh,  Indonesia,  Malaysia, 
and  Pakistan.  Similar  processes  are  under  way  in 
Morocco,  Jordan,  Kuwait,  and  Lebanon.  As  Nasr 
observes,  competitive  elections  push  religious  par¬ 
ties  toward  pragmatism  and  make  other  parties 
more  sensitive  to  Islamic  values  in  a  game  to  win 
the  middle.  Electoral  competition  may  eventually 
lead  to  a  situation  of  “democracy  without  demo¬ 
crats,”  and  this  surely  is  much  more  preferable  to 
that  of  “autocracy  with  democrats.” 

Conclusion 

The  early  optimism  and  enthusiasm  about  the 
global  resurgence  of  democracy  have  been  replaced 
by  a  sense  of  greater  realism.  It  is  clear  that  author¬ 
itarian  regimes,  in  one  variety  or  other,  have 
shown  greater  resilience  and  staying  power  than 
originally  anticipated,  using  a  skillful  mixture  of 
coercion,  co-optation,  divide-and-rule  tactics,  and 
selective  democratic  openings.  However,  just  as 
the  prospects  for  sustaining  full-scale  (or  closed) 
authoritarian  regimes  in  an  age  of  democratization 
are  not  good,  so  are  the  prospects  for  electoral  (or 
competitive)  authoritarian  regimes.  The  uneasy 
mixture  of  democratic  and  authoritarian  elements 


creates  an  inherent  source  of  instability,  with  the 
possibility  of  going  in  the  direction  of  either  elec¬ 
toral  democracies  or  less  competitive  hegemonic 
authoritarian  regimes. 
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Within  democratic  systems,  public  organizations 
carry  out  a  large  number  of  functions,  ranging 
from  provision  of  education,  health,  and  national 
security  and  regulation  of  financial  markets  to 
ensuring  equal  treatment  for  all.  These  bodies 
perform  these  functions  because  delegation  is 
imperative:  Elected  politicians  lack  the  time, 
expertise,  and  resources  both  to  enact  and 
implement  laws  and  to  delegate  these  tasks  to 
public  organizations.  To  perform  their  tasks, 
these  bodies  need  a  certain  level  of  autonomy 
from  democratic  oversight.  The  laws  they  imple¬ 
ment  often  contain  goals,  such  as  “national 
security”  and  “social  welfare,”  that  need  further 
specification  before  they  can  be  attained.  Also, 
several  different  approaches  and  policy  instru¬ 
ments  to  complex  social  problems  may  exist, 
and  the  choice  of  approach  or  policy  instrument 
requires  experience  and  knowledge  of  these 
social  problems.  Public  organizations  are  staffed 
by  civil  servants  with  such  experience  and 
knowledge,  but  they  need  some  level  of  auton¬ 
omy  to  analyze  social  problems  and  to  choose 
the  best  possible  course  of  action  for  addressing 
these.  Administrative  autonomy  revolves  around 
a  delicate  balance.  On  the  one  hand,  if  public 
organizations  lack  administrative  autonomy, 
they  run  the  danger  of  being  micromanaged  by 
politicians.  On  the  other  hand,  if  there  is  too 
much  administrative  autonomy,  politicians  and 
voters  run  the  risk  of  having  created  public  orga¬ 
nizations  that  are  nonresponsive  to  democratic 
preferences. 


Administrative  autonomy  is  therefore  a  core 
concept  in  the  study  of  public  administration.  At 
its  heart  lie  democratic-theoretical  questions  con¬ 
cerning  the  relationship  between  voters,  politi¬ 
cians,  and  civil  servants  as  well  as  rational- 
instrumental  issues  concerning  the  management 
and  design  of  effective  and  efficient  public  poli¬ 
cies.  As  a  theoretical  concept,  administrative 
autonomy  represents  an  abstract  social  construct. 
Under  this  lemma,  the  background  of  the  concept 
of  administrative  autonomy  within  the  field  of 
public  administration  is  discussed  first.  Next,  the 
various  attributes  and  dimensions  of  the  concept 
of  administrative  autonomy  are  described.  Finally, 
an  overview  of  theories  of  administrative  auton¬ 
omy  is  provided. 

The  Concept  of  Autonomy  in  the 
Study  of  Public  Administration 

In  the  field  of  public  administration,  the  concept  of 
administrative  autonomy  has  been  enjoying  the 
close  attention  of  many  scholars  for  more  than 
several  decades  (see,  e.g.,  Daniel  Carpenter,  2001; 
Peter  Clark  &  James  Wilson,  1961;  Anthony 
Downs,  1967;  Philip  Selznick,  1957).  Admin¬ 
istrative  autonomy  is  considered  a  sine  qua  non  for 
nothing  less  than  the  “survival”  of  a  public  organi¬ 
zation.  To  survive  competition  with  other  public 
organizations  over  scarce  resources,  the  attaining 
of  a  substantial  degree  of  administrative  autonomy 
is  seen  as  a  critical  goal:  “Autonomy  gives  an  orga¬ 
nization  a  reasonably  stable  claim  to  resources  and 
thus  places  it  in  a  more  favorable  position  from 
which  to  compete  for  those  resources”  (Clark  & 
Wilson,  1961,  p.  158).  Administrative  autonomy 
as  the  lifeblood  of  successful  public  organizations 
is  still  a  vivid  argument  in  contemporary  public 
administration  scholarship.  In  his  study  of  the 
modernization  of  the  American  bureaucracy  dur¬ 
ing  the  late  19th  and  early  20th  centuries,  Carpenter 
(2001)  ascribes  the  success  of  various  agencies, 
among  them  the  U.S.  Postal  Service  and  the  U.S. 
Food  and  Drug  Administration,  to  their  leader¬ 
ships’  capabilities  to  forge  autonomy  for  their 
organization. 

During  the  1990s,  we  witnessed  a  renewed  inter¬ 
est  in  the  concept  of  administrative  autonomy.  Three 
developments  were  important  drivers  of  this  interest. 
First,  starting  in  the  late  1980s  and  the  beginning  of 
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the  1990s,  by  setting  up  “quasi-nongovernmental 
organizations,”  that  is,  quangos,  governments 
worldwide  embarked  on  the  path  of  restyling  their 
central  governmental  apparatus  according  to  one 
of  the  core  prescriptions  of  the  new  public  man¬ 
agement  (NPM)  paradigm.  In  traditional  Western 
parliamentary  systems,  governments  were  con¬ 
fronted  with  trade-offs  between  giving  autonomy 
to  parts  of  their  integrated  central  administrative 
apparatuses  and  to  retain  control  over  the  newly 
created  autonomous  bodies. 

The  second  driver  of  the  renewed  attention  for 
the  concept  of  autonomy  is  the  wide  diffusion  and 
creation  of  independent  regulatory  authorities 
(IRAs).  The  intellectual  foundations  for  the  cre¬ 
ation  of  IRAs  stem  from  the  same  source  as  that  of 
NPM  described  above.  The  financial  and  eco¬ 
nomic  reforms  of  the  1980s  and  1990s  resulted  in 
the  liberalization  of  markets  and  the  breaking 
down  of  public  monopolies.  Deregulation  led  to 
reregulation  as  contracts  and  property  rights 
within  these  newly  created  markets  needed  public 
supervision.  Legislatures  entrusted  the  protection 
of  property  rights  and  the  enforcement  of  con¬ 
tracts  on  these  new  markets  to  independent  regula¬ 
tory  agencies.  Their  autonomy  from  government  is 
considered  a  necessary  condition  for  guaranteeing 
the  credible  commitment  of  the  government  to  not 
renege  on  the  liberalization  of  these  markets. 
Consequently,  the  creation  of  IRAs  brought  imme¬ 
diately  to  the  fore  the  question  of  who  controlled 
these  highly  autonomous  bodies. 

The  third  and  final  driver  is  the  emergence  of 
neo-economic  theories  within  public  administration 
and  political  science.  Neo-economic  theories  entered 
the  fields  of  political  science  and  public  administra¬ 
tion  through  the  latter’s  adaptations  from  economic 
and  legal  approaches  to  contract  theories.  Neo- 
economic  theories  first  made  school  within 
American  political  science.  The  warm  reception  of 
the  principal-agency  framework  is  partly  the  result 
of  the  decades-old  American  interest  in  Congress’s 
capability  to  control  the  bureaucracy.  Principal- 
agency  and  transaction  costs  theories  introduced  a 
new  conceptual  tool  kit  into  public  administration 
and  concepts  such  as  “contract,”  “information 
asymmetry,”  “monitoring,”  and  “costs”  started  to 
dominate  our  talk  and  thinking  about  politics  and 
administration.  Neo-economic  theories  soon 
crossed  the  Atlantic  and  assumed  a  prominent  place 


within  European  studies  on  regulation, 
Europeanization,  and  quangocratization.  This 
resulted  in  the  growing  attention  on  the  concept  of 
administrative  autonomy. 

The  study  of  administrative  autonomy  is  col¬ 
ored  by  sociological  and  neo-economy-inspired 
political  science  theories.  Whereas  sociological 
theories  treat  administrative  autonomy  as  an 
important  asset  of  public  organizations  to  success¬ 
fully  function  within  their  own  environment, 
economic-political  science  theories  consider  the 
autonomy  of  public  organizations  as  a  potential 
liability.  We  will  see  that  these  perspectives  consti¬ 
tute  the  basic  view  within  two  of  the  three  main 
sets  of  theories  of  administrative  autonomy. 

The  Anatomy  of  Administrative  Autonomy 

Administrative  autonomy  is  an  abstract  concept 
that  is  not  directly  observable.  In  this  section,  we 
will  present  two  dimensions  of  administrative 
autonomy  that  are  commonly  used  in  studies  of 
this  concept. 

De  Facto  and  De  Jure  Autonomy 

The  first  distinction  is  between  formal  or  de  jure 
autonomy  and  actual  or  de  facto  autonomy  of 
public  organizations.  De  jure  autonomy  refers  to 
the  formal-legal  prescriptions  concerning  the  rela¬ 
tionship  between  politicians  and  public  organiza¬ 
tions.  The  laws  that  created  the  body  (or  the  laws 
and  regulations  that  delegate  specific  tasks  to  pub¬ 
lic  organizations)  state  how  a  political  principal 
and  its  administrative  agent  should  interact  in 
theory.  As  such,  these  laws  spell  out  the  rules  of 
the  game  that  should  be  obeyed  when  these  actors 
interact  with  each  other.  To  a  certain  extent,  the  de 
facto  or  actual  autonomy  of  a  public  organization 
is  reflected  by  its  formal  autonomy.  A  public  orga¬ 
nization  that  is  not  part  of  a  ministerial  depart¬ 
ment,  for  example,  will  have  more  formal  admin¬ 
istrative  autonomy  than  a  unit  of  a  ministerial 
department. 

However,  the  correlation  between  de  jure  and 
de  facto  administrative  autonomy  is  not  always 
strong.  George  Krause  and  James  Douglas  (2005), 
for  example,  found  that  it  was  not  the  formal 
administrative  autonomy  of  three  different  U.S. 
economic  agencies  that  affected  the  outputs  of 
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their  economic  forecasts  but  the  reputation  of  the 
economic  professionals  who  worked  within  these 
agencies.  Martino  Maggetti  (2007)  found  that  a 
higher  level  of  formal  administrative  autonomy  is 
not  a  sufficient  nor  necessary  condition  for  higher 
levels  of  de  facto  autonomy.  These  findings  point 
at  the  existence  of  a  discrepancy  between  rules  and 
competences  on  paper  and  rules  and  competences 
in  practice.  In  the  words  of  Kutsal  Yesilkagit 
(2004), 

Rules,  procedures,  and  competences  may  seem 
clear  on  paper  as  they  inform  all  actors  about  the 
formal  intentions  of  the  designers.  However, 
[after  the  body  is  created]  public  organizations 
leadership  can  interpret  the  rules  in  a  different 
way  than  politicians  had  in  mind  when  they 
designed  the  rules,  (p.  535) 

Dimensions  of  Autonomy 

Koen  Verhoest,  Guy  B.  Peters,  Geer  Bouckaert, 
and  Bram  Vermeulen  (2004)  distinguish  between 
seven  organizational  domains  and  related  dimen¬ 
sions  of  autonomy.  The  first  dimension  is  manage¬ 
rial  autonomy  and  refers  to  the  extent  in  which 
managers  may  decide  on  the  allocation  of  the  input 
(e.g.,  budget)  and  resources  (e.g.,  personnel)  of 
public  organizations  without  prior  approval  from 
a  supervisory  body  (i.e.,  a  minister).  Second,  policy 
autonomy  designates  the  discretionary  space 
within  which  managers  of  public  organizations 
can  decide  on  the  quantities  of  the  goods  their 
organizations  produce,  choose  from  among  the 
policy  instruments  they  have  to  their  avail,  and 
arrange  the  internal  procedures  within  their  orga¬ 
nizations.  Managerial  and  policy  autonomy  hence 
correspond  to  the  discretionary  authority  of  public 
managers:  a  space  within  which  managers  enjoy 
some  leeway  to  manage  their  organizations  as  they 
see  fit. 

External  stakeholder  may  actively  limit  the  dis¬ 
cretionary  space  of  the  management  of  public 
organizations.  Administrative  autonomy  is  then  a 
function  of  the  exemption  from  constraints  on  the 
actual  use  of  decision-making  competencies  that 
external  stakeholders — that  is,  the  ministers — 
grant  these  organizations.  In  democratic  systems, 
central  governments  always  retain  certain  powers 
to  fine-tune  the  amount  of  decision-making 


competences  of  public  organizations.  This  fine- 
tuning  may  take  place  through  the  adjustment  of 
an  organization’s  structural  autonomy,  legal 
autonomy,  interventional  autonomy,  and  financial 
autonomy.  Structural  autonomy  is  the  extent  to 
which  the  head  of  public  organizations  is  exempted 
from  direct  supervision  of  elected  officials — that  is, 
the  minister.  If  the  head  of  a  public  organization 
reports  to  a  governing  board  and  not  directly  to 
the  minister,  then  the  structural  autonomy  of  pub¬ 
lic  organizations  may  be  considered  as  high.  The 
financial  autonomy  of  a  public  organization  is 
high  when  it  is  entitled  to  draw  its  own  financial 
resources  from  the  revenues  of  the  services  the 
organization  delivers;  it  is  low  when  the  revenues 
of  the  organization  come  from  the  departmental 
budget.  With  regard  to  the  legal  dimension  of 
autonomy,  Verhoest  et  al.  take  the  legal  basis  of 
the  public  organization  as  the  measuring  rod.  If  a 
public  organization  lacks  legal  personality  or  it  is 
established  by  a  statutory  instrument  instead  of  a 
statute,  the  position  of  the  public  organization  vis- 
a-vis  the  legislature  or  the  cabinet  will  be  weak,  as 
the  latter  can  change  the  legal  status  of  the  public 
organization  with  lesser  (legislative)  costs  than 
when  public  organizations  was  founded  on  pri¬ 
mary  law  or  a  statute.  Finally,  interventional 
autonomy  is  the  extent  to  which  a  public  organiza¬ 
tion  is  exempt  from  ex  post  reporting,  evaluation, 
and  audit  requirements.  Table  1  gives  a  brief  sche¬ 
matic  overview  of  the  dimensions  of  autonomy 
and  the  substances  to  which  autonomy  and/or 
exemptions  refer. 

Theories  of  Administrative  Autonomy 

Until  now  we  have  discussed  the  background  and 
various  attributes  of  the  concept  of  autonomy. 
Here  we  will  focus  on  the  factors  that  affect  levels 
of  administrative  autonomy.  In  research  as  well  as 
in  practice,  we  are  primarily  interested  in  why 
some  public  organizations  have  more  administra¬ 
tive  autonomy  than  others  and  what  these  organi¬ 
zations  (can)  do  with  their  autonomy  in  public  life. 
Overlooking  academic  research  on  administrative 
autonomy  over  the  past  decades  suggests  that  there 
exist  three  sets  of  theories  of  administrative  auton¬ 
omy:  (1)  external-control  theories,  (2)  internal- 
administrative  theories,  and  (3)  dynamic-adaptive 
theories. 
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Table  1  Dimensions  of  Autonomy 

Type  of  Autonomy 

Autonomy  Dimensions 

Substance  of  Autonomy 

Level  of  decision-making 
competencies  (discretionary  powers) 

Managerial  autonomy 

Decisions  on  inputs  and  allocation 
of  resources 

Policy  autonomy 

Decisions  on  policy  instruments, 
production  of  goods 

Exemptions  from  constraints  on 
decision-making  competencies 

Structural  autonomy 

Exemptions  from  direct  hierarchical 
supervision 

Legal  autonomy 

Legal  basis  of  public  organization 

Financial  autonomy 

Source  of  agency  funding 

Interventional  autonomy 

Exemption  from  ex  post  controls 

Sources:  Adapted  from  Verhoest,  K.,  Peters,  G.  B.,  Bouckaert,  G.,  &  Vermeulen,  B.  (2004).  The  study  of  organisational 
autonomy:  A  conceptual  overview.  Public  Administration  and  Development,  24(2),  101-118;  Christensen,  J.  G.  (2001). 
Bureaucratic  autonomy  as  a  political  asset.  In  B.  G.  Peters  &  J.  Pierre  (Eds.),  Politicians,  btireaucrats  and  administrative 
reform  (pp.  119-131).  London:  Routledge. 


External-Control  Theories  of 
Administrative  Autonomy 

External-control  theories  of  administrative 
autonomy  claim  that  the  administrative  autonomy 
of  a  public  organization  is  a  function  of  the  level  of 
pressures  exerted  on  the  organization  by  political 
stakeholders  in  the  environment  of  the  public 
organization.  The  theories  that  are  grouped  under 
this  heading  have  in  common  that  they  apply  a 
(version  of)  principal-agency  theory  of  political 
control.  They  assert  that  the  degree  of  administra¬ 
tive  autonomy  of  public  organization  depends  on 
the  instruments  and  resources  that  political  princi¬ 
pals,  such  as  Congress  or  the  president  (in  U.S. 
studies)  and  parliaments  or  cabinets  and  ministers 
(in  studies  on  parliamentary  systems),  have  to  con¬ 
trol  the  actions  and  outputs  of  public  organiza¬ 
tions.  These  instruments  are  usually  divided 
between  ex  ante  controls,  such  as  agency  design, 
administrative  procedures,  and  ex  post  controls, 
such  as  audits,  reporting  requirements,  and  (threats 
of)  budgetary  cuts. 

External  control  is  not  limited  to  political  princi¬ 
pals.  Next  to  formal  political  actors,  interest  groups 
too  may  act  as  principals  of  public  organizations. 
Interest  groups  are  key  players  in  the  politics  of 
delegation  and  administrative  design,  as  they,  per¬ 
haps  more  than  elected  politicians,  care  about  the 
control  regime  of  public  organizations  as  these 
organizations  are  essential  to  the  policy  process  and 


the  distribution  of  public  services  to  their  mem¬ 
bers.  Political  control  is  therefore  not  an 

issue  ...  of  congressional  dominance  or 
presidential  control,  but  rather  of  shared  authority 
and  the  ability  to  influence  policy  in  one  direction 
or  another,  a  process  whereby  interest  groups’ 
ability  to  access  and  shape  policy  both  directly 
and  indirectly  is  evident  and  must  not  be 
overlooked.  (Scott  Furlong,  1998,  p.  61) 

In  other  words,  political  control  is  a  form  of  “joint 
custody”  of  president,  congress,  interest  groups, 
and  the  courts,  and  administrative  autonomy  is  the 
resultant  of  the  strategies  pursued  by  multiple 
interacting  actors. 

Internal-Administrative  Theories 
of  Administrative  Autonomy 

In  contrast  to  external  control  theories,  inter¬ 
nal-administrative  theories  of  administrative 
autonomy  put  public  organizations  and  their  inter¬ 
ests,  capacities,  identity,  and  mission  central  to 
their  explanatory  models.  The  main  thesis  of  this 
line  of  research  is  that  the  administrative  auton¬ 
omy  of  public  organizations  is  a  function  of  the 
organization’s  capabilities  to  forge  its  own  auton¬ 
omy  from  its  direct  environment.  Whereas  we 
could  say  that  in  the  former  perspective,  the  orga¬ 
nization  is  considered  as  a  passive  taker  of  the 


122  Autonomy,  Administrative 


discretion  that  its  environment  grants  to  itself,  in 
this  perspective,  public  organizations  are  consid¬ 
ered  as  active  seekers  or  forgers  of  autonomy 
(Carpenter,  2001). 

As  suggested  by  Wilson  (1989,  pp.  188-192), 
two  sets  of  internal-administrative  factors  are 
deemed  crucial  for  achieving  autonomy.  One  is 
“political  legitimacy”  (Carpenter,  2001) — that  is, 
the  belief  and  reputation  that  agencies  can  deliver 
the  services  that  are  “worthwhile  to  some  group 
with  influence  over  sufficient  resources  to  keep  it 
alive”  (Downs,  1967,  p.  7).  Agencies  should 
“impress  those  politicians  who  control  the  budget 
that  its  function  generate  political  support”  (p.  7) 
from  which  politicians  can  benefit  in  terms  of  their 
reelection  or  strengthening  of  their  party  political 
base.  Second,  administrative  autonomy  requires 
the  “organizational  capacities”  to  deliver  the  ben¬ 
efits  to  their  supporters:  It  must  have  the  staff  with 
the  right  skills  and  dedication  to  fulfill  the  organi¬ 
zation’s  ultimate  goal  of  securing  a  stable  and 
strong  autonomous  position  within  the  public 
space.  In  line  with  this,  Selznick  (1957)  saw  care¬ 
fully  designed  recruitment  instruments  as  a  prereq¬ 
uisite  for  autonomy.  The  leadership  of  an  organi¬ 
zation,  if  it  wanted  to  pass  the  initial  survival 
threshold  and  maneuver  through  critical  periods 
ahead,  had  to  carefully  select,  first,  the  “social 
base,”  that  is,  those  segments  of  “the  [organiza¬ 
tion’s]  environment  to  which  operations  will  be 
oriented”  (Selznick,  1957,  p.  104). 

Dynamic-Adaptive  Theories 
of  Administrative  Autonomy 

The  third  set  of  theories  of  administrative  auton¬ 
omy  commonly  claims  that  the  administrative 
autonomy  of  public  organizations  is  a  function  of 
a  complex  pattern  of  interactions  between  public 
organizations  and  its  (political)  environment.  The 
interactions  consist  of  a  mix  of  stimuli  or  signals 
and  responses  that  traverse  from  various  venues  in 
the  environment  to  a  public  organization  as  well  as 
from  agencies  to  the  political  actors  in  the  environ¬ 
ment.  “[CJausality”  between  control  and  auton¬ 
omy  “does  not  just  flow  from  each  democratic 
institution  towards  the  administrative  organization 
as  assumed  in  many  conceptualizations  of  political 
control  theories,  but .  .  .  also  flows  from  agency  to 
political  principals”  (Krause,  1996,  p.  1089). 


According  to  Dan  Wood  and  Richard  Waterman, 
stimuli  are  discrete  events  (e.g.,  a  crisis,  the  election 
of  a  new  president),  event  progressions  (e.g.,  yearly 
budgetary  appropriations,  congressional  oversight 
hearings),  and  tonal  stimuli  (i.e.,  changes  in  the 
tone  of  political-bureaucratic  relationships),  and 
the  responses  are  a  function  of  the  agencies’  techno¬ 
logical,  rational,  and  political  assets.  Unanticipated 
behavior,  a  function  of  the  general  uncertainty  in 
any  policy  environment,  also  influences  autonomy. 
Policies  are  made  in  ambiguity;  agencies  do  not 
only  often  poorly  anticipate  possible  events,  but 
they  are  also  often  not  capable  of  making  sense  of 
the  stimuli  that  reach  them,  causing  public  organi¬ 
zations  to  display  unanticipated  response  behavior. 

Conclusion 

Administrative  autonomy  is  a  core  concept  in 
the  study  of  public  organizations.  As  the  concept 
stands,  it  is  studied  in  close  relation  to  democratic- 
theoretical  notions  such  as  legitimacy,  public 
accountability,  and  democratic  control.  In  the  pre¬ 
ceding  sections,  we  have  discussed  the  various 
attributes  of  this  concept  and  theories  of  adminis¬ 
trative  autonomy.  We  have  seen  that  research  on 
this  topic  is  substantive.  However,  there  are  still 
aspects  of  administrative  autonomy  that  have  not 
yet  been  fully  studied.  Here,  three  lines  of  research 
that  are  worthy  to  be  explored  by  future  research 
are  discussed. 

In  the  studies  discussed  above,  administrative 
autonomy  is  treated  as  the  dependent  variable. 
However,  from  a  democratic  perspective,  the 
degree  of  administrative  autonomy  of  public  orga¬ 
nizations  is  particularly  important  for  the  extent  it 
enables  or  restrains  public  organizations  in  their 
operation.  Public  organizations  that  are  more  or 
less  autonomous  provide  public  services  and  regu¬ 
late  affairs  across  a  variety  of  societal  domains  and 
may  deeply  affect  the  lives  of  citizens.  So  one  line 
of  research  that  deserves  to  be  further  elaborated  is 
on  the  relationship  between  administrative  auton¬ 
omy  and  performance  or  policy  impact.  The  study 
of  administrative  autonomy  may  be  connected  to 
existing  studies  on  bureaucratic  responsiveness. 
These  studies  examine  the  impact  interest  groups 
have  on  decision-making  processes  of  public  orga¬ 
nizations  but  leave  the  relationship  with  adminis¬ 
trative  autonomy  somewhat  unexplored. 
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Second,  an  underexamined  aspect  of  adminis¬ 
trative  autonomy  is  of  a  more  normative  nature, 
namely,  the  discussion  from  which  stakeholders 
and  public  organizations  should  function  autono¬ 
mously.  Public  organizations  are  created  to  imple¬ 
ment  democratically  chosen  public  policies.  So, 
autonomy  from  government  will  never  be  absolute, 
as  public  organizations  have  been  entrusted  with 
the  implementation  of  collectively  agreed-on  pub¬ 
lic  policies.  Furthermore,  public  organizations  also 
interact  with  and  are  responsive  to  a  number  of 
nongovernmental  organizations.  So,  public  organi¬ 
zations  will  not  be  entirely  autonomous  from  non¬ 
governmental  stakeholders  as  well.  The  focus 
within  existing  research  has  predominantly  been 
on  the  varying  degrees  of  autonomy  from  govern¬ 
mental  and  nongovernmental  stakeholders,  with¬ 
out  questions  regarding  the  desirability  of  varying 
degrees  of  autonomy  from  different  stakeholders. 

Finally,  the  globalization  of  the  economy  and 
technological  developments  increasingly  blur  the 
lines  between  the  international  order  and  domestic 
administration.  We  move  toward  a  postnational 
bureaucracy  that  is  hallmarked  by  the  emergence  of 
dense  international  networks  of  civil  servants. 
Administrative  autonomy  becomes  situated  into  a 
new  context  as  the  environment  of  public  organiza¬ 
tions  becomes  more  and  more  transnational.  The 
consequences  of  globalization  of  the  executive  branch 
force  on  us  new  ways  of  thinking  about  the  relation¬ 
ship  between  politics  and  administration.  Here,  too, 
administrative  autonomy  will  be  a  core  concept. 
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Autonomy,  Subnational 


The  concept  of  autonomy  is  polysemous  in  scien¬ 
tific  literature  and  can  be  used  in  very  different 
contexts.  When  used  with  the  word  subnational, 
the  concept  refers  to  the  degree  of  autonomy  in 
governments  below  the  level  of  central  government, 
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with  the  exception  of  local  governments.  Examples 
include  the  Canadian  provinces,  the  U.S.  states, 
the  Spanish  autonomous  communities,  the  regions 
and  communities  in  Belgium,  the  German  and 
Austrian  Lander,  and  the  French  departments  and 
regions.  Autonomy  refers  to  the  degree  of  deci¬ 
sional  autonomy,  sovereignty  or  self-government 
within  a  nation-state,  enjoyed  by  these  subna¬ 
tional  governments. 

This  entry  first  discusses  subnational  autonomy 
and  the  structure  of  the  state.  Second,  it  describes 
the  power,  resources,  and  influence  of  subnational 
governments.  Regionalization  is  discussed  in  the 
third  section,  the  fourth  section  deals  with  multi¬ 
level  governance  and  subnational  autonomy,  and 
the  final  one  examines  minority  nationalism  and 
subnational  autonomy 

Subnational  Autonomy  and 
the  Structure  of  the  State 

Because  nation-states  vary  in  their  structure,  sub¬ 
national  governments  also  differ  from  country  to 
country.  In  many  cases,  vast  differences  are  found 
within  a  single  country.  This  diversity  results  in  a 
strong  asymmetry  between  subnational  govern¬ 
ments  in  terms  of  their  autonomy.  The  autonomy 
of  subnational  governments  is  based  on  several 
factors,  including  the  institutional  setting  of  a 
nation-state  and  the  constitutional  powers  devoted 
to  a  subnational  government.  The  more  a  country 
is  constitutionally  decentralized,  the  greater  is  the 
extent  to  which  subnational  governments  have 
legislative  powers  and  thus  autonomy. 

In  unitary  states  such  as  Denmark  or  Israel, 
subnational  governments  have  little  or  no  auton¬ 
omy.  A  unitary  state  is  a  state  governed  as  one 
single  unit  in  which  the  central  government  is  the 
decision  center.  Subnational  and  local  govern¬ 
ments  exercise  only  the  power  that  the  central 
governments  choose  to  delegate. 

In  the  case  of  a  devolved  or  decentralized  state, 
subnational  governments  have  more  autonomy.  A 
decentralized  state  is  generally  a  former  unitary 
state  such  as  France.  The  authority  and  responsi¬ 
bility  for  some  public  functions  have  been  trans¬ 
ferred  from  the  central  government  to  the  regional 
government.  A  devolved  state  is  a  centralized  state 
such  as  the  United  Kingdom  (UK),  where  subna¬ 
tional  governments  have  a  degree  of  autonomous 


power  devolved  from  the  central  government.  A 
devolved  government  cannot  challenge  the  consti¬ 
tutionality  of  central  government’s  law.  The  power 
given  by  the  central  government  can  be  revoked  or 
reduced.  For  example,  the  Northern  Ireland 
Assembly  has  been  suspended  many  times  by 
Fondon  since  its  creation.  In  theory,  decentralized 
and  devolved  states  are  different,  but  in  practice, 
the  differences  are  very  thin. 

A  federal  state  is  a  type  of  sovereign  state  in 
which  sovereignty  is  constitutionally  divided 
between  a  central  government  and  the  subnational 
governments  (e.g.,  the  Canadian  provinces  or  the 
German  Lander).  To  qualify  as  a  federation,  a 
minimum  requirement  is  that  the  constitutional 
powers  of  the  subnational  governments  cannot  be 
changed  unilaterally  by  the  central  government.  In 
federal  systems  such  as  Germany,  Canada,  the 
United  States,  or  Belgium,  subnational  govern¬ 
ments  have  considerable  autonomy.  In  Canada, 
for  example,  provinces  are  constitutionally  respon¬ 
sible  for  health  care,  education,  culture,  and 
municipalities.  They  can  also  act  on  economic 
development,  justice,  environment,  and  so  on.  In 
Belgium,  the  autonomy  of  communities  and  regions 
goes  even  further.  Since  1993,  Belgium  has  been, 
according  to  the  first  article  of  its  Constitution,  “a 
federal  state  composed  of  communities  and 
regions.”  The  constitutional  revision  of  1993  per¬ 
mits  the  regions  and  communities  to  become  real 
international  actors.  This  includes  the  power  of 
representation  and  power  to  sign  treaties  with  sov¬ 
ereign  states.  Since  the  revision  of  the  Constitution 
in  1993,  the  organization  of  Belgium’s  interna¬ 
tional  relations  is  fundamentally  adapted  to  the 
federal  state  structure.  The  autonomy  of  the 
Belgian  substate  actors,  with  regard  to  external 
policy,  is  unique  in  the  world.  Its  exceptional 
nature  arises  from  the  recognized  constitutional 
principle  of  in  foro  interno,  in  foro  externo.  On 
top  of  that,  there  is  an  absence  of  hierarchy 
between  different  levels  of  administration  (see 
Table  1). 

Subnational  Governments  Power, 
Resources,  and  Influence 

The  autonomy  of  subnational  governments  is  also 
related  to  the  resources  that  these  actors  can  mobi¬ 
lize.  Some  subnational  governments  such  as  those 
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Table  I  Typology  of  Democracies  and  Cases  in 
Relation  to  the  Autonomy  of  Subnational 
Governments 


Unitary  States 

Decentralized 
and  Devolved 
States 

Federal  States 

Denmark 

France 

Belgium 

Finland 

Netherlands 

Canada 

Greece 

Portugal 

Germany 

Ireland 

Great  Britain 

United  States 

Sweden 

Spain 

Austria 

Israel 

Italy 

India 

in  Quebec,  Catalonia,  Flanders,  California,  or 
Bavaria  possess  more  resources  than  many  sover¬ 
eign  states.  Let  us  compare  subnational  govern¬ 
ments  with  the  192  members  of  the  United 
Nations.  In  terms  of  population,  40  states  have  a 
population  of  less  that  1  million  and  around  13 
have  between  1  million  and  2  million.  In  the 
United  States  alone,  43  states  have  a  population 
greater  than  1  million,  and  36  have  a  population 
of  more  than  2  million,  according  to  U.S.  Census 
Bureau  data  for  2008. 

In  2008,  according  to  the  World  Bank,  nominal 
2008  gross  domestic  product  (GDP)  estimates  for 
191  nation-states  and  territories,  California  would 
rank  among  the  top  10  in  the  world.  The  top  11 
American  states  would  rank  among  the  first  25  in 
the  world,  the  top  26  in  the  United  States  are 
among  the  top  50  globally,  and  all  50  states  are 
among  the  top  86  nation-states.  The  state  of 
California  could  even  be  a  member  of  the  G8 
(Group  of  8;  or  the  G20  [Group  of  20J)  since  its 
GDP  is  greater  than  those  of  Canada  and  Spain. 
The  annual  budget  of  California  exceeds  that  of 
Mexico  and  numerous  sovereign  states.  Thus,  sub¬ 
national  governments  are  among  the  most  impor¬ 
tant  economic  actors  in  the  world. 

Regionalization 

Since  World  War  II,  authority  has  shifted  away 
from  the  central  state  to  international  organiza¬ 
tions,  such  as  the  European  Union  or  the  World 
Trade  Organization,  and  to  regions  and  munici¬ 
pal  governments.  Regionalization  represents  a 


transformation  of  regional  authority  where  the 
nation-state  creates  or  reinforces  the  mesogovern- 
ment  level  between  the  central  state  and  the  local 
level.  If  Canada  and  the  United  States  have  been 
federations  for  years,  in  Europe  regionalization 
has  gone  hand  in  hand  with  European  integration. 

After  World  War  II,  only  Germany,  Switzerland, 
and  Austria  were  considered  decentralized  federal 
states.  Since  the  1950s,  of  the  27  countries  that  are 
now  part  of  the  European  Union,  including  Belgium, 
Spain,  Italy,  the  UK,  and  France,  19  have  seen  the 
rise  of  a  stronger  and  deeper  regional  or  subna¬ 
tional  government.  These  governments  now  handle 
a  wide  variety  of  policy  responsibilities.  Since  1950, 
new  regional  governments  have  been  created  in  14 
countries  of  the  European  Union  (EU).  Eighteen 
regions  in  7  countries,  excluding  the  Belgian 
regions  and  communities  and  the  Spanish  autono¬ 
mous  communities,  have  also  been  given  a  special 
autonomous  or  different  status.  In  1950,  only  5 
countries  had  directly  elected  regional  assemblies 
covering  the  countries  as  a  whole;  by  2010,  16 
countries  had  such  assemblies.  Only  2  countries  in 
Europe  with  a  population  of  more  than  2.5  million, 
Sweden  and  Bulgaria,  have  not  followed  this  trend. 
In  Sweden,  decentralization  reinforced  the  role  of 
municipal  governments  instead. 

These  meso  or  regional  governments  are  a  fairly 
new  thing  in  many  countries  with  diverse  political 
history  and  constitutional  setting.  These  new 
regional  governments  are  reshaping  politics,  the 
distribution  of  power  between  the  central  govern¬ 
ment  and  the  subnational  units,  and  accountability. 

According  to  some  researchers,  the  driving  force 
behind  the  rise  of  these  regional  governments  is 
related,  in  part,  to  globalization.  According  to  some 
scholars,  the  nation-state  today  is  too  small  for  the 
big  problems  of  the  world  and  too  big  for  the  small 
ones.  In  this  context,  regional  governments  are  bet¬ 
ter  placed  to  deliver  important  public  goods. 

Other  scholars  identify  the  rise  of  regions  with 
the  decline  of  the  authority  of  the  nation-states.  In 
Europe,  European  integration  has  reinforced 
regionalization  and  decentralization.  European 
integration  has  noticeably  weakened  the  central¬ 
ized  uniform  states  with  the  introduction  of  the 
principle  of  subsidiarity  in  the  Treaty  of  Maastricht 
of  1992. 

There  are  also  more  technical  reasons  for 
regionalization  or  decentralization.  For  some 
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scholars,  the  virtues  of  centralization  and  decen¬ 
tralization  differ  from  one  policy  area  to  the  other. 
The  advantage  of  centralization  is  obvious:  econo¬ 
mies  of  scales  and,  in  theory,  greater  equality 
between  the  citizens  of  one  country.  The  virtues  of 
decentralization  are  different.  Some  scholars  sug¬ 
gest  that  regionalization  and  decentralization 
increase  the  efficiency  of  public  policy.  The  funda¬ 
mental  principle  of  public  goods  analysis  is  that  a 
jurisdiction  should  encompass  those  who  are  posi¬ 
tively  or  negatively  affected  by  a  specific  policy. 
Because  the  public  goods  differ,  the  scale  at  which 
they  are  most  efficient  also  differs.  Some  policies, 
such  as  urbanism  or  fire  protection,  are  best  han¬ 
dled  at  the  local  level.  Some  policies,  such  as  edu¬ 
cation  or  hospitals,  are  best  handled  at  the  regional 
level,  and  others  such  as  national  transportation, 
infrastructure,  trade,  and  national  security  are  bet¬ 
ter  handled  at  the  national  level. 

Some  believe  that  political  participation  is  bet¬ 
ter  and  more  effective  if  the  decision  center  is 
closer  to  voters.  Decentralized  governments  are 
closer  to  the  citizens  and  therefore  have  better 
knowledge  about  what  they  want  and  need. 
Regionalization  thus  contributes  to  better  democ¬ 
racy  because  it  multiplies  opportunities  for  citizens 
to  influence  governments. 

According  to  some  researchers,  regionalization 
and  decentralization  are  drivers  for  better  democ¬ 
racy.  Elected  officials  in  democratic  states  may  shift 
decision  making  away  from  the  central  state  if  it 
can  lead  to  more  efficient  decision  making  and  can 
attract  more  votes.  By  contrast,  authoritarian  states 
tend  to  centralize  the  decision-making  process.  At 
best,  authoritarian  regimes  will  deconcentrate  pow¬ 
ers  in  local  outposts  so  that  they  can  provide  more 
reliable  information  about  local  politics  and  imple¬ 
ment  more  effective  central  policy.  This  fact 
explains  why  the  democratization  of  southern, 
central,  and  eastern  Europe  went  hand  in  hand 
with  decentralization  and  subnational  autonomy. 
The  so-called  third  wave  of  democracy  generated 
more  decentralization.  Regionalization  and  decen¬ 
tralization  could  thus,  in  theory,  lead  to  more  effec¬ 
tive  public  administration  and  better  democracy. 

Finally,  some  specialists  think  that  decentraliza¬ 
tion  comes  primarily  from  the  public  finance  crisis. 
Regional  governments  are,  in  theory,  more  in 
phase  with  the  real  needs  of  the  population  and 
thus  allocate  resources  more  effectively.  With  the 


crisis  of  the  public  debt  in  numerous  countries, 
decentralization  is  a  way  for  the  central  state  to 
transfer  some  responsibility  to  regions  in  order  to 
reduce  the  pressure  on  public  finance  at  the  central 
level. 

Multilevel  Governance 

With  regionalization  and  decentralization,  the 
responsibilities  of  substate  governments  have 
expanded  considerably.  Subnational  governments 
have  more  power  concerning  economic  develop¬ 
ment,  education,  health  care,  environment,  public 
transport,  and  so  on.  Each  of  these  public  policy 
domains  requires  coordination  among  governments 
at  diverse  levels.  Because  no  government  is  an 
island,  increased  globalization  and  internationaliza¬ 
tion  have  reinforced  the  need  to  coordinate  local, 
regional,  national,  and  international  policies  as  the 
responsibilities  of  subnational  governments  expand. 

The  rise  of  regional  government  and  subna¬ 
tional  autonomy  has  a  big  impact  on  multilevel 
governance.  Decisions  taken  at  one  level  of  gov¬ 
ernment  affect  the  decisions  of  the  other  level  of 
government.  Most  policy  thus  require  some  form 
of  coordination  among  international,  European, 
national,  regional,  and  local  governments.  We 
know  that,  for  example,  according  to  a  Elnited 
Nations  estimate,  between  50%  and  70%  of  all 
adaptation  and  mitigation  measures  against  cli¬ 
mate  change  will  call  for  implementation  by  the 
regions  or  subnational  governments 

The  concept  of  multilevel  governance  was  cre¬ 
ated  within  the  framework  of  the  EE1  to  explain 
the  relation  between  the  various  levels  of  govern¬ 
ment  in  EE1  policy  making.  Multilevel  governance 
means  that  there  are  multiple  actors  from  various 
level  of  government  interacting  to  negotiate  and 
implement  public  policy  coming  from  the  EE1.  The 
multilevel  governance  approach  illuminates  the 
interdependence  between  the  local,  regional, 
national,  and  international  levels  of  authority. 

At  first,  multilevel  governance  was  developed 
for  the  study  of  the  EE1.  Now,  however,  it  is 
applied  in  various  situations  because  virtually  all 
government  activities  today  are  affected  by  the 
competence  of  at  least  one  intergovernmental 
organization,  and  frequently  many  more.  In  this 
way,  in  the  context  of  international  organizations 
and  international  conferences,  themes  are  dealt 
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with  that  relate  to  education,  public  health,  cul¬ 
tural  diversity,  the  environment,  business  subsi¬ 
dies,  the  treatment  accorded  to  investors,  the 
removal  of  nontariff  barriers,  barriers  to  agricul¬ 
ture,  to  services,  and  so  on.  This  phenomenon  is 
magnified  in  Europe  by  the  process  of  European 
integration  and  in  North  America  by  the  North 
American  Free  Trade  Agreement.  Likewise,  enlarg¬ 
ing  the  scope  of  international  issues  means  that  all 
government  departments  have  activities  that  are 
internationalized.  This  situation  makes  it  harder 
for  a  country’s  ministry  of  foreign  affairs  to  cen¬ 
tralize  the  decision-making  process. 

In  this  context,  subnational  governments  have 
become  more  aware  that  their  political  autonomy 
and  their  sovereignty — or,  in  other  words,  their 
ability  to  formulate  and  implement  policy — are 
subject  to  negotiation  in  multilateral  fora.  Thus, 
since  the  1960s,  there  has  been  a  noticeable 
increase  in  the  number  of  subnational  govern¬ 
ments  that  are  interested  and  participate  actively  in 
international  affairs. 

In  the  United  States,  for  instance,  only  4  states 
had  representative  offices  in  other  countries  in 
1970  versus  41  states  with  245  representative 
offices  in  34  countries  in  2008.  In  Germany,  the 
Lander  have  set  up  some  130  representative  offices 
since  1970,  of  which  21  are  located  in  the  United 
States.  In  2009,  Quebec,  one  of  the  pioneers  in  the 
field,  had  some  28  representative  offices  around 
the  world.  In  Spain,  the  autonomous  region  of 
Catalonia  operates  some  50  representative  offices 
abroad,  and  in  2010  the  Flemish  government  has 
more  than  100  political,  economic,  and  economic 
representations  abroad.  This  phenomenon  is  also 
evident  in  Japan  and  many  other  countries. 

When  considered  from  the  perspective  of  theo¬ 
ries  of  federalism  and  international  affairs,  the  two 
conceptions  conflict:  the  centralizing  school  and  the 
school  devoted  to  multilevel  governance.  From  the 
perspective  of  the  centralizing  approach,  some 
scholars  believe  that  a  monopoly  of  foreign  affairs 
is  a  “minimum  power”  of  all  central  governments. 
In  his  landmark  study,  Kenneth  Clinton  Wheare 
highlighted  the  negative  consequences  of  unbun¬ 
dling  central  control  over  foreign  affairs  for  the 
national  interest  and  for  the  functioning  of  the  inter¬ 
national  system.  To  some  scholars,  international 
relations  are  at  the  heart  of  federal  regimes. 
Centralization  of  the  foreign  affairs  power  is  a 


requirement  of  international  law  because  a  central¬ 
ized  political  system  is  a  necessary  condition  for 
states  to  be  able  to  play  the  role  they  are  assigned 
in  international  law  and  practice.  In  essence,  with¬ 
out  the  existence  of  a  central  government  that  has 
a  plenary  authority  on  its  territory  in  relation  to 
foreign  affairs  and  the  ability  to  participate  in 
international  relations  and  to  enforce  international 
obligations  in  the  domestic  order,  interstate  rela¬ 
tions  can  only  be  seriously  compromised.  If  the 
power  of  codecision  is  granted  regarding  treaty 
making,  there  would  be  a  risk  of  paralyzing  a 
state’s  foreign  affairs.  Every  player  would  have  a 
veto,  resulting  in  harm  to  the  state’s  image  in  the 
international  arena. 

Supporters  of  the  concept  of  multilevel  gover¬ 
nance  take  a  different  view.  According  to  some 
scholars,  diplomacy  or  foreign  policy  cannot  be 
considered  a  monopoly  of  the  central  government. 
Substate  government  will  always  have  an  impor¬ 
tant  role,  even  if  it  is  only  for  the  purposes  of 
implementing  international  agreements  concluded 
by  the  central  government.  In  addition,  giving  a 
monopoly  over  foreign  affairs  to  central  govern¬ 
ments  in  federal  regimes  puts  at  risk  the  distribu¬ 
tion  of  powers  between  the  different  orders  of 
government  for  the  benefit  of  central  authorities. 
There  are  many  examples  of  federal  regime  that 
must  operate  with  important  constitutional  limita¬ 
tions  on  their  powers  in  foreign  affairs. 

Foreign  policy  should  be  thought  of  as  a  complex 
system  where  different  actors  within  the  federal 
regime  structure  work  with  each  other.  Those  who 
favor  a  multilevel  governance  approach  thus  main¬ 
tain  that  “obligations  of  cooperation”  exist  between 
central  governments  and  substate  actors.  To  imple¬ 
ment  a  coherent  foreign  policy,  it  is  important  to 
consult  substate  actors  and,  indeed,  give  them  an 
important  role  by  means  of  intergovernmental 
mechanisms,  so  that  they  can  participate  actively  in 
the  country’s  treaty-making  process.  According  to 
this  view,  regional  integration,  multilateralism,  and 
globalization  have  thus  rendered  centralist  theses 
obsolete.  The  requirements  of  cooperation  between 
the  different  orders  of  government  are  more  and 
more  important,  and  it  is  for  this  reason  that  one 
notices  a  considerable  increase  in  executive  federal¬ 
ism  or  intergovernmental  relations  in  respect  of  the 
conclusion  of  international  treaties  in  federal 
regimes. 
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Subnational  Autonomy  and 
Minority  Nationalism 

In  multinational  states,  such  as  Canada,  Spain,  or 
Belgium,  decentralization  and  subnational  auton¬ 
omy  are  also  seen  by  many  scholars  as  a  way  to 
reduce  the  likelihood  of  secession  or  political  con¬ 
flicts.  Decentralization  is  a  way  to  accommodate 
substate  nationalist  movements  such  as  those  in 
Catalonia,  Flanders,  or  Quebec.  The  federalization 
or  decentralization  of  multinational  states  is  seen 
as  a  way  to  prevent  the  breakup  of  a  country  while 
permitting  political  and  cultural  autonomy  of 
minority  nations. 

Many  scholars  have  argued  that  some  measure 
of  autonomy  or  form  of  self-government  will  sat¬ 
isfy  the  majority  of  people  within  a  minority 
nation.  In  practice,  that  would  mean  that  a  minor¬ 
ity  nation  would,  to  some  extent,  govern  itself 
while  also  participating  in  the  nation-state  institu¬ 
tions.  Federalization  of  nation-states  does  not 
guarantee  the  end  of  the  national  question  in  mul¬ 
tinational  countries.  The  recent  experience  of 
Belgium  with  the  Flemish  movement,  that  of 
Scotland,  or  even  those  of  Catalonia  and  Quebec 
confirm  that  political  conflicts  will  remain,  but  it  is 
hard  to  see  any  form  of  successful  accommodation 
in  multinational  countries  that  does  not  include 
some  elements  of  federalism  and  subnational 
autonomy. 

Some  researchers  have  reservations  about  grant¬ 
ing  subnational  autonomy  to  accommodate  minor¬ 
ity  nationalism.  One  problem  is  that  federalism 
tends  to  accentuate  the  differences  between  the 
majority  and  the  minority  nation  within  a  nation¬ 
state.  The  creation  of  an  autonomous  subnational 
government  at  the  regional  level  gives  the  minority 
nation  new  tools  to  formulate  new  demands  for 
more  autonomy,  which  is  seen  as  a  never-ending 
story.  Granting  some  form  of  self-government  for 
minority  nations  tends  to  heighten  and  politicize 
the  minority  nation’s  self-consciousness  and  to 
facilitate  the  rise  of  a  self-conscious  intelligentsia. 
In  the  end,  federalization  is  seen  as  a  last-resort 
solution  that  risks  fueling,  rather  than  appeasing, 
minority  nationalism.  The  territorialization  of 
minority  nations  also  risks  reducing  the  cross¬ 
cutting  interactions  between  the  majority  nation 
and  the  minority  one.  Giving  some  form  of  self- 
government  to  a  minority  nation  holds  the 


potential  to  polarize  politics  at  the  national  level.  It 

is  thus  very  difficult  or  even  impossible  to  create  a 

national  consensus  on  national  issues. 

Stepbane  Paquin 
Universite  de  Sherbrooke 
Sherbrooke,  Quebec,  Canada 

See  also  Decentralization;  European  Integration; 
Federalism;  Multilevel  Analysis 
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The  balance  of  power  is,  arguably,  the  most  cen¬ 
tral  theoretical  concept  in  the  study  of  interna¬ 
tional  relations,  yet  it  is  also  one  of  the  most 
controversial  concepts  in  the  field.  It  is  a  signifi¬ 
cant  concept  because  it  underpins  how  theorists  of 
a  realist  predisposition  think  about  international 
relations,  and  realism  is  one  of  the  most  widely 
acknowledged  general  theories  in  the  discipline. 
The  concept  is  also  controversial  because  realism, 
in  general,  and  the  balance  of  power,  in  particu¬ 
lar,  have  always  been  fundamentally  challenged 
by  other  schools  of  thought.  Advocates  of  the  bal¬ 
ance  of  power  argue  that  it  helps  account  for  the 
most  fundamental  features  of  international  rela¬ 
tions  and,  in  particular,  the  survival  and  stability 
of  any  system  of  independent  states.  Critics,  by 
contrast,  attack  the  concept  by  insisting  that  it  is 
either  incoherent,  thereby  promoting  a  spurious 
understanding  of  international  relations,  or  dan¬ 
gerous,  because  it  promotes  policies  that  can  lead 
to  international  tensions  and  often  war.  This  entry 
first  examines  the  balance  of  power  metaphor  and 
describes  its  historical  development.  It  then  dis¬ 
cusses  the  roles  the  concept  has  in  contemporary 
thought,  especially  in  the  work  of  Hans  J. 
Morgenthau  and  Kenneth  N.  Waltz. 

The  controversy  that  surrounds  the  balance  of 
power  is  not  a  recent  phenomenon.  It  can  be  traced 
back  for  several  centuries,  although  there  is  no 
authoritative  genealogy  of  the  concept.  Debates 
about  the  balance  of  power  became  a  regular  feature 


of  the  theory  and  practice  of  international  rela¬ 
tions  in  Europe  from  the  16th  century  onward. 
The  balance  of  power  is  generally  regarded  as  an 
essentially  Eurocentric  concept,  although  this 
assessment  remains  somewhat  speculative  because 
so  little  is  known  about  the  theory  underpinning 
how  other  international  societies  have  operated. 

While  the  balance  of  power  is  generally  regarded 
as  a  concept  that  is  quintessentially  related  to 
international  relations,  it  is  also  a  concept  that  is 
regularly  used  across  the  social  sciences  as  well  as 
in  general  parlance.  However,  outside  interna¬ 
tional  relations,  the  balance  of  power  is  not  seen  to 
be  particularly  controversial.  The  term  is  employed 
ubiquitously  because  it  acts  as  a  metaphor  that 
effectively  transforms  our  conventional  under¬ 
standing  of  power.  Unadorned,  power  is  generally 
treated  as  an  attribute  that  enables  an  agent  pos¬ 
sessing  power  to  regulate  the  activities  of  other 
actors.  In  the  literature  on  power,  this  usage  is 
often  referred  to  as  a  behavioral  or  agential  con¬ 
ception  of  power.  The  balance  of  power  metaphor 
transforms  this  conventional  meaning  into  a  struc¬ 
tural  or  systemic  conception  of  power. 

When  treated  as  a  metaphor,  the  balance  of 
power  has  been  associated  with  a  wide  array  of 
images,  from  a  chandelier  to  the  arch  of  a  bridge. 
The  image  that  most  frequently  springs  to  mind  is 
a  set  of  scales.  From  the  perspective  of  interna¬ 
tional  relations,  the  scales  can  be  viewed  as  a  sys¬ 
tem  in  which  the  weights  that  are  placed  on  the 
pans  of  the  scales  are  equated  with  the  power 
capabilities  of  the  states  that  operate  within  the 
system.  What  the  metaphor  demonstrates  is  that  if 
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the  weight/power  is  increased  in  one  pan,  then  the 
weight/power  on  the  other  pan  is  automatically 
affected.  It  follows  that  power  must  be  treated  as  a 
relative  rather  than  an  absolute  phenomenon.  But 
the  metaphor  can  also  indicate  that  if  there  is  a 
disproportionately  heavy  weight  placed  in  one  pan 
of  the  scales,  then  it  is  possible  to  establish  an 
effective  counterbalance  by  placing  a  number  of 
lighter  weights  on  the  other  pan.  The  even  distri¬ 
bution  of  power  that  is  then  formed  is  often  repre¬ 
sented  as  the  definition  of  a  balance  of  power. 

If  the  metaphor  is  brought  into  focus,  however, 
it  is  clear  that  this  very  specific  meaning  should  be 
subsumed  under  a  much  broader  conception  that 
associates  the  balance  of  power  with  a  system  in 
which  power  is  a  structural  attribute.  Such  a  sys¬ 
tem  is,  in  effect,  constituted  by  the  distribution  of 
power  that  forms  within  the  boundaries  of  the 
system.  So  in  a  multipolar  system,  for  example, 
there  are  several  main  poles  of  power,  whereas  in 
a  bipolar  system  there  are  two  main  poles  of 
power,  and  in  a  unipolar  system  there  is  only  one 
main  pole  of  power.  This  structural  conception  of 
power  has  attracted  the  attention  of  many  histori¬ 
ans,  theorists,  and  practitioners  from  the  past  and 
the  present  who  have  wanted  to  understand  inter¬ 
national  relations,  and  it  has  led  them  to  presup¬ 
pose  that  the  behavior  of  states  is  or  at  any  rate 
should  be  significantly  influenced  by  the  structure 
or  balance  of  power  that  helps  constitute  the  inter¬ 
national  system. 

In  general  usage,  by  contrast,  the  same  kind  of 
assumption  is  not  made.  If  a  wife  goes  out  to  work 
and  earns  an  independent  income,  it  can  be  argued 
that  the  balance  of  power  within  the  marriage  will 
shift  as  a  consequence.  Or,  in  a  multiparty  system, 
if  two  opposition  parties  ally,  then  the  balance  of 
power  within  the  political  arena  will  alter.  In  both 
cases,  the  metaphor  brings  the  structure  of  the  sys¬ 
tem  into  focus.  But  there  is  no  presumption  that  a 
change  in  the  balance  of  power  will  necessarily 
strengthen  or  weaken  the  system.  The  metaphor 
simply  highlights  the  change  in  the  distribution  of 
power.  But  in  international  relations,  there  is  a 
long-standing  political  myth  that  states  will 
respond  to  changes  in  the  distribution  of  power  in 
a  way  that  ensures  that  international  stability  is 
restored.  But  the  veracity  of  this  myth  has  also 
long  been  challenged,  and  a  significant  counter¬ 
myth  has  arisen  asserting  that  the  preoccupation 


with  the  distribution  of  power  in  international 
relations  is  a  major  source  of  conflict  and  disorder. 
This  controversy  about  the  balance  of  power  has 
been  carried  forward  into  the  contemporary  disci¬ 
pline  and  accounts  for  the  very  distinctive  role  that 
the  concept  plays  in  international  relations. 

Historical  Development  of  the  Idea 

Despite  its  putative  importance,  we  still  do  not 
have  a  very  developed  understanding  of  how  the 
ideas  associated  with  the  balance  of  power  have 
evolved  across  time.  Nevertheless,  it  is  generally 
accepted  that  the  contemporary  conception  of  the 
balance  of  power  can  be  traced  back  to  the  Italian 
city-state  system  and  that  the  key  ideas  relating  to 
the  concept  began  to  take  shape  in  the  1 5th  cen¬ 
tury.  By  the  start  of  the  16th  century,  Francesco 
Guicciardini  (1483-1540),  a  practicing  diplomat 
and  an  acquaintance  of  Niccolo  Machiavelli,  was 
able  to  call  on  these  ideas  when  he  came  to  write 
The  History  of  Italy.  He  wrote  this  history  in 
response  to  the  French  invasion  of  Italy  in  1494; 
his  aim  was  to  trace  what  he  viewed  as  the  tragic 
loss  of  independence  by  the  Italian  city-states  in 
the  first  decades  of  the  16th  century.  He  shows 
how  these  states  failed  to  respond  adequately  to 
external  pressures  until  eventually  most  of  them 
were  absorbed  by  the  Hapsburg  Empire.  The  book 
provides  the  first  case  study  of  states  failing  to  fol¬ 
low  the  logic  of  a  balance  of  power  system  and,  as 
a  consequence,  succumbing  to  hegemony. 

Guicciardini’s  use  of  balancing  metaphors  is 
restricted  to  the  start  of  the  book  where  he  exam¬ 
ines  relations  among  the  city-states  in  the  final 
decades  of  the  15th  century.  The  focus  is  on 
Venice,  Naples,  Florence,  Milan,  and  Rome,  and 
he  acknowledges  that  relations  among  these  city- 
states  were  characterized  by  suspicion  and  jeal¬ 
ousy.  But  he  also  notes  that  Venice  was  by  far  the 
most  powerful  state  in  the  system  and,  moreover, 
that  it  had  hegemonic  ambitions.  But  the  logic  of 
this  situation  as  depicted  by  Guicciardini  is  that  it 
preserved  a  balance  of  power  system  because,  by 
forming  an  alliance,  the  other  city-states  were  able 
to  counterbalance  Venetian  power  and  thereby 
preserve  the  independence  of  all  the  city-states. 
Guicciardini,  however,  also  acknowledged  that 
rivalry  among  the  less  powerful  states  could  poten¬ 
tially  threaten  the  overall  stability  of  the  system. 
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But  this  stability  was  preserved,  according  to 
Guicciardini,  because  the  ruler  of  Florence,  Lorenzo 
de’  Medici,  recognized  the  importance  of  main¬ 
taining  peace  among  the  less  powerful  states  and 
preserving  the  alliance  against  Venice.  As  a  conse¬ 
quence,  he  was  willing  to  operate  as  a  balancer 
within  the  system.  In  other  words,  Guicciardini 
argues  that  Lorenzo  had  to  constantly  shift  his 
own  position  in  relation  to  the  other  states  in  the 
system  in  order  to  ensure  that  the  overall  balance 
was  sustained. 

What  began  to  destabilize  the  system  at  the  end 
of  the  15th  century  was  an  emerging  tendency  by 
the  city-states  to  invite  outside  states  to  help  settle 
disputes  among  the  city-states.  The  problem  with 
this  tactic,  as  Guicciardini  saw  it,  was  that  states 
such  as  France  outweighed  the  combined  strength 
of  all  the  Italian  city-states.  Nevertheless,  when  the 
Milanese  concluded  that  Florence  and  Naples  were 
forming  an  alliance  against  them,  they  called  on 
France  for  assistance.  But  this  move  had  the  effect 
of  destroying  the  Italian  balance  of  power  system 
and  Guicciardini  resorted  to  medical  metaphors  to 
characterize  the  subsequent  developments. 
Eschewing  the  balance  of  power  metaphor,  he 
argued  that  external  intervention  was  a  medicine 
with  effects  that  were  far  worse  than  the  original 
disease. 

The  History  of  Italy  was  translated  into  other 
European  languages,  and  the  ideas  associated  with 
the  balance  of  power  were  rapidly  diffused  across 
Europe  during  the  16th  century.  By  the  end  of  the 
century,  therefore,  the  balance  of  power  was 
viewed  as  a  Europe-wide  phenomenon.  But  during 
this  period,  the  meaning  of  the  balance  of  power 
was  also  extended  beyond  the  idea  of  shifting  alli¬ 
ances  in  a  competitive  or  adversarial  system.  For 
millennia,  there  has  been  a  metaphorical  link 
between  scales  and  justice,  and  during  the  16th 
century,  there  was  a  growing  tendency  to  discuss 
the  balance  of  power  in  terms  of  a  just  equilibrium. 
In  other  words,  the  balance  of  power  began  to  be 
associated  with  the  establishment  of  a  distribution 
of  power  that  was  not  only  regarded  as  stable  but 
also  fair.  Initially  it  was  argued  that  the  balancer 
state  must  have  this  capacity  to  establish  the  link 
between  stability  and  justice.  But,  over  time,  a 
stable  and  just  balance  of  power  was  seen  to  arise 
from  a  general  agreement  among  the  great  powers 
in  Europe,  and  so  the  concept  began  to  be  viewed 


as  a  product  of  great  power  cooperation  and 
mutual  association.  By  the  start  of  the  18th  cen¬ 
tury,  Europe-wide  peace  agreements  such  as  the 
Treaty  of  Utrecht  in  1713  began  to  make  reference 
to  a  just  equilibrium  in  their  formal  provisions.  At 
this  juncture,  therefore,  the  balance  of  power  was 
seen  to  provide  the  constitutional  basis  for  what 
was  called  the  Republic  of  Europe.  As  a  conse¬ 
quence,  it  then  became  possible  to  extend  the  idea 
of  the  balance  of  power  back  to  the  idea  developed 
by  Greek  and  Roman  theorists  of  a  mixed  or  bal¬ 
anced  constitution.  But  the  relationship  between 
these  sets  of  ideas  has  yet  to  be  teased  out  in  any 
detail. 

By  the  18th  century,  however,  there  was  also 
growing  resistance  to  the  idea  that  the  balance  of 
power  could  be  treated  as  either  some  kind  of  a 
natural  or  a  man-made  law  that  could  promote 
peace  and  stability.  On  the  contrary,  the  balance  of 
power  was  seen  to  be  irrevocably  tied  to  a  system 
where  war  was  endemic.  By  the  19th  century,  crit¬ 
ics  such  as  the  British  radicals  Richard  Cobden  and 
John  Bright  were  going  further  and  arguing  that 
the  balance  of  power  was  an  utterly  meaningless 
concept.  From  their  perspective,  only  by  moving 
beyond  the  balance  of  power  thinking  was  it  pos¬ 
sible  to  promote  a  peaceful  world.  The  battle 
between  advocates  and  critics  of  the  balance  of 
power  persisted  throughout  the  20th  century  and, 
almost  inevitably,  when  the  study  of  international 
relations  developed  as  a  formal  academic  disci¬ 
pline  during  the  present  era,  the  balance  of  power 
immediately  became  one  of  its  central  but  also 
deeply  contentious  concepts. 

Contemporary  Study  of 
the  Balance  of  Power 

It  is  not  difficult  to  demonstrate  that  the  balance  of 
power  occupies  a  central  position  in  the  contempo¬ 
rary  study  of  international  relations  because  the 
concept  lies  at  the  heart  of  two  of  the  very  most 
influential  theoretical  texts  published  since  the  end 
of  World  War  II.  The  first,  Hans  J.  Morgenthau’s 
Politics  Among  Nations ,  published  initially  in 
1948,  epitomizes  classical  realism  and  the  second, 
Kenneth  N.  Waltz’s  Theory  of  International 
Politics,  published  in  1979,  spearheaded  what  is 
now  known  as  neorealism  or  structural  realism. 
Despite  the  fact  that  these  texts  are  still  enormously 
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influential,  both  texts  and,  in  particular,  their  use 
of  the  balance  of  power  have  come  under  continu¬ 
ous  criticism  ever  since  they  were  published.  It  is 
certainly  not  the  case,  therefore,  that  either  realism 
or  the  balance  of  power  has  ever  occupied  a  hege¬ 
monic  position  in  the  field.  Instead,  it  is  more 
appropriate  to  see  them  as  occupying  the  central 
ground  but  at  the  same  time  being  engaged  in  a 
constant  debate  with  critics  coming  at  them  from 
a  number  of  very  different  directions. 

As  a  classical  realist,  Morgenthau  was  well 
aware  that  he  was  working  in  the  context  of  a 
long-standing  tradition  of  European  thought  that 
embraced  the  concept  of  the  balance  of  power,  and 
he  recognized  the  need  to  take  into  account  both 
its  adversarial  and  its  associational  dimensions. 
From  his  perspective,  the  American  Constitution 
and  other  similar  domestic  arrangements  provided 
the  most  effective  examples  of  a  functioning  bal¬ 
ance  of  power  system,  and  it  followed  that,  in  an 
international  context,  the  balance  of  power  could 
work  at  best  only  imperfectly.  For  Morgenthau, 
the  golden  age  for  the  balance  of  power  was  the 
era  of  dynastic  international  politics  in  the  18th 
century  when  states  were  governed  by  an  interna¬ 
tional  aristocracy  that  formed  a  distinctive  interna¬ 
tional  society.  Under  these  circumstances,  it  was 
possible  to  achieve  a  just  equilibrium  by  means  of 
largely  consensual  and  Europe-wide  peace  agree¬ 
ments.  But  even  when  these  agreements  broke 
down,  Morgenthau  argued  that  the  uncertainties 
associated  with  any  attempt  to  calculate  the  pre¬ 
vailing  balance  of  power  generated  restraint  on  the 
part  of  the  rival  dynasties  and  encouraged  them  to 
operate  on  the  basis  of  the  established  dynastic 
rules  of  the  game.  At  the  same  time,  however,  he 
readily  acknowledged  that  the  world  had  moved  a 
very  long  way  from  these  conditions.  In  the  after- 
math  of  World  War  II,  what  he  saw  were  two 
ideologically  driven  behemoths  operating  in  the 
absence  of  any  international  society.  The  United 
States  and  the  Soviet  Union  were  restrained  only 
by  the  crudest  kind  of  balance  of  power,  and  he 
feared  for  the  future  of  the  world.  Morgenthau 
argued  that  only  if  the  two  sides  adopted  the  tools 
of  classical  diplomacy  was  there  any  hope  that 
humans  would  survive. 

Waltz  approached  the  balance  of  power  from 
a  very  different  direction,  and  he  reached  a  very 
different  set  of  conclusions  from  those  of 


Morgenthau.  He  was  writing  in  the  context  of  a 
detente  that  marked  a  period  of  relaxation  in  the 
Cold  War  tensions  between  the  United  States  and 
the  Soviet  Union;  however,  his  aim  was  to  estab¬ 
lish  a  general  theory  of  international  relations 
that  would  transcend  historical  difference  and 
help  reveal  the  essential  differences  between 
domestic  and  international  politics  and  explain 
why  the  structure  of  the  international  system  has 
proved  so  enduring. 

In  the  first  instance,  therefore,  Waltz  argues  that 
all  political  systems  can  be  categorized  under  one 
of  two  headings:  hierarchy  or  anarchy.  In  a  hierar¬ 
chical  political  system,  actors  are  functionally  dif¬ 
ferentiated  and  power  is  distributed  on  a  vertical 
plane,  so  that  actors  can  exercise  power  over  sub¬ 
ordinate  actors  in  the  hierarchy  but  will  themselves 
be  subject  to  the  power  of  actors  that  occupy  a 
superior  position  in  the  hierarchy.  Having  made 
the  distinction,  however,  Waltz  focuses  his  atten¬ 
tion  almost  exclusively  on  anarchic  political  sys¬ 
tems  where  actors  are  seen  to  operate  on  a  hori¬ 
zontal  plane.  In  other  words,  these  actors  do  not 
consider  that  they  are  operating  in  a  hierarchy  or 
in  a  functionally  differentiated  system.  They  view 
themselves  as  independent  and  autonomous  actors 
operating  on  the  basis  of  self-help  and  so  they  are 
primarily  concerned,  in  the  first  instance,  with 
maintaining  their  independence  and  autonomy.  To 
be  able  to  do  this,  they  must,  from  the  start,  estab¬ 
lish  how  power  is  structured  in  the  system.  This 
requires  them  to  identify  the  dominant  actors  in 
the  system — those  actors  that  possess  a  dispropor¬ 
tionate  amount  of  the  overall  power  in  the  system. 
Cutting  through  complexity,  Waltz  distinguishes 
between  bipolar  and  multipolar  systems.  Then, 
contrary  to  Morgenthau,  he  aims  to  show  theo¬ 
retically  why  multipolar  systems  are  prone  to  gen¬ 
erate  an  unstable  balance  of  power,  whereas  bipo¬ 
lar  systems  are  prone  to  generate  a  stable  balance 
of  power. 

Focusing  first  on  multipolar  systems,  Waltz 
argues  that  if  the  balance  of  power  begins  to  move 
against  any  of  the  dominant  states  in  such  a  sys¬ 
tem,  they  will  be  pushed  to  respond  by  either  inter¬ 
nal  or  external  balancing.  Internal  balancing 
requires  the  actor  to  enhance  its  power  position  by 
domestic  means.  The  most  obvious  mechanisms 
are  either  to  expand  existing  military  resources  or 
to  seek  technological  improvements.  But  Waltz 
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also  acknowledges  that  in  a  self-help  or  anarchic 
political  system,  there  will  be  a  tendency  for  any 
action  that  gives  an  actor  a  potential  or  future 
power  advantage  to  be  emulated  by  the  other 
dominant  actors  in  the  system.  As  a  consequence, 
he  argues  that  in  an  anarchy  there  will  be  a  ten¬ 
dency  for  actors  to  take  the  form  of  like  units. 

In  a  multipolar  system,  however,  Waltz  argues 
that  when  the  balance  of  power  shifts,  there  is  the 
alternative  strategy  of  external  balancing.  In  other 
words,  states  can  form  alliances  with  each  other  to 
enhance  their  security.  But  Waltz  is  very  well 
aware  that  alliances  are  an  inherently  unstable 
feature  of  any  anarchic  system.  The  literature  on 
alliances  reveals  that  an  alliance  generates  at  least 
two  contradictory  fears.  On  the  one  hand,  there  is 
the  fear  of  entrapment,  as  the  result  of  being 
drawn  by  an  ally  into  an  unnecessary  or  dangerous 
conflict.  But,  on  the  other  hand,  there  is  also  the 
fear  of  abandonment.  Such  problems  are  endemic 
in  a  multipolar  system.  The  problem  persists  even 
in  the  face  of  a  rising  hegemon  when,  rather  than 
forming  an  overwhelming  alliance,  states  will 
often  choose  to  pass  the  buck  and,  at  least  in  the 
first  instance,  leave  it  to  other  states  to  confront 
the  hegemon. 

Because  of  these  sorts  of  uncertainties,  it 
becomes  extraordinarily  difficult  in  an  anarchic 
system  to  identify  the  nature  of  the  balance  of 
power  at  any  point  in  time.  Nevertheless,  despite 
these  uncertainties,  Waltz  argues  that  there  is  suf¬ 
ficient  flexibility  in  the  system  to  ensure  that  any 
potential  hegemonic  state  will  eventually  be  met  by 
an  effective  counterbalancing  alliance.  But,  by  the 
same  token,  Waltz  insists  that  it  is  very  much  eas¬ 
ier  to  identify  the  state  of  the  balance  of  power  in 
a  bipolar  system:  There  are  no  alliances  to  compli¬ 
cate  the  assessment,  and  the  two  dominant  states 
simply  have  to  monitor  each  other’s  activities.  As 
a  consequence,  not  only  are  there  fewer  sources  of 
instability,  but  it  is  also  easier  for  the  two  domi¬ 
nant  states  to  reach  mutual  agreement  and  thereby 
to  move  from  an  adversarial  balance  of  power 
through  to  an  associational  balance  of  power, 
although  Waltz  does  not  use  this  terminology. 

Waltz  has  proved  to  be  extremely  influential 
because  he  articulated  a  balance  of  power  theory 
in  such  unequivocal  terms,  and  it  has  encouraged 
other  theorists  to  either  revise  or  reject  the  theory. 
Offensive  realists  argue  that  dominant  states  aim 


to  maximize  their  power  position,  and  they  char¬ 
acterize  Waltz’s  approach  as  defensive  realism 
because  of  the  assumption  that  dominant  states 
only  aim  to  preserve  their  independence.  They  dis¬ 
miss  the  idea  of  an  associative  balance  of  power 
and  are  much  more  willing  to  entertain  the  possi¬ 
bility  of  hegemonic  success.  In  the  same  vein,  theo¬ 
rists  working  from  a  world-historical  perspective 
insist  that  anarchic  systems  transform  into  hierar¬ 
chical  or  at  least  unipolar  systems  on  a  very  regular 
basis.  This  poses  a  very  significant  problem  for 
Waltz  and  for  the  balance  of  power  theory  more 
generally.  But  a  more  immediate  problem  for 
Waltz  is  the  persistence  of  unipolarity  in  the  post- 
Cold  War  world.  Waltz  insisted  after  the  demise  of 
the  Soviet  Union  that  unipolarity  is  a  very  unstable 
structure  and  that  it  would  rapidly  give  way  to 
multipolarity.  But  realist  critics  argue  that  there 
are  few  signs  that  any  states  are  willing  to  compete 
with  the  United  States  in  the  military  arena  and 
that  in  any  event  Waltz’s  own  Neorealist  logic  can 
be  used  to  show  why  unipolarity  is  a  very  stable 
structure.  It  is  unlikely  that  these  debates  are  going 
to  be  easily  resolved,  and,  as  a  consequence,  the 
balance  of  power  will  continue  to  provide  a  theo¬ 
retical  focal  point  for  theorists  and  practitioners  in 
the  future. 
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Bargaining 


Bargaining  is  an  everyday  activity  that  pervades 
all  social  life  and,  arguably,  is  the  most  important 
mode  by  which  political  decisions  are  made.  It 
summarizes  all  social  activities  in  which  individu¬ 
als  or  corporate  actors  have  a  common  interest  in 
working  together  but  disagree  on  how  they  should 
cooperate.  Bargaining  is  therefore  a  voluntary 
process  through  which  negotiators  try  to  distrib¬ 
ute  among  themselves  a  mutual  benefit.  In  poli¬ 
tics,  the  mixture  between  cooperation  and  conflict 
that  characterizes  bargaining  situations  becomes 
apparent  at  all  levels  of  decision  making.  An 
example  of  a  recurrent,  difficult  bargaining  pro¬ 
cess  is  the  distribution  of  the  taxed  income  of  a 
polity  between  competing  interest  groups  who 
advance  contradicting  claims  to  bolster  their 
demand  for  a  large  share.  Below,  major  game 
theoretical  solutions  to  bargaining  problems  and 
their  possible  applications  in  political  science  are 
discussed. 

In  the  simplest  political  bargaining  problem, 
two  negotiators  disagree  over  how  to  divide  a  pie 
worth  one  unit  of  a  fictitious  currency  (eurodol¬ 
lar).  Any  outcome  in  the  “bargaining  zone,”  which 
is  delimited  by  the  minimal  individual  payoffs  the 
actors  expect,  can  result  from  the  negotiations. 
Trivially,  no  bargain  is  feasible  if  the  minimal 
expectations  do  not  overlap  or,  in  other  words,  if 
the  set  containing  the  possible  outcomes  is  empty. 
If  the  minimal  expectation — technically  often 
called  the  “reservation  price” — of  one  player  is 
zero,  an  extreme  solution  in  which  one  side  cashes 
in  1  eurodollar  while  the  other  side  leaves  the 
negotiation  table  empty-handed  can  occur,  since 
the  unlucky  negotiator  is  assumed  to  be  indifferent 
to  the  lack  of  a  reward  and  the  failure  of  negotia¬ 
tions.  The  “fair  solution,”  in  the  absence  of  any 
power  imbalance  or  neediness  of  a  negotiator,  is  a 


50:50  deal  that  splits  the  eurodollar  into  equal 
shares. 

In  many  bargaining  situations,  actors  demand  a 
disproportional  division  of  the  spoils.  The  other 
negotiators  then  have  to  judge  whether  a  credible 
claim  that  negotiations  would  fail  without  a  con¬ 
cession  from  the  other  side  supports  the  request.  If 
they  reject  this  demand,  bargaining  stops  or  is 
interrupted.  Another  suboptimal  outcome  materi¬ 
alizes  when  negotiators  accept  a  disproportional 
division  and  fail  to  call  the  bluff  by  the  other  side 
that  the  negotiations  would  break  down  if  no  con¬ 
cession  were  granted.  The  fate  of  a  claim  depends 
on  the  ability  of  the  negotiators  to  communicate 
credibly  that  their  claims  are  sincere.  This  creates 
in  return  an  incentive  to  bolster  the  credibility  of  a 
specific  position  through  strategic  moves. 

Social  scientists  who  try  to  understand  bargain¬ 
ing  belong  to  either  one  of  two  camps.  The  first 
approach  stems  largely  from  social  psychology  and 
is,  by  and  large,  empirical.  It  tries  to  assess  the 
importance  of  cognitive  failures  and  other  psycho¬ 
logical  features  based  on  the  performance  of  indi¬ 
vidual  negotiators.  In  political  science,  the  strategic 
theory  of  bargaining,  as  it  was  developed  by 
Francis  Edgeworth,  Arthur  Bowley,  Frederik 
Zeuthen,  and  John  Nash,  is  more  influential  as  it 
takes  into  account  that  negotiation  outcomes 
result  from  the  interaction  of  at  least  two  forward- 
looking  actors.  Technically,  the  strategic  theory  of 
bargaining  is  based  on  game  theory.  Its  applica¬ 
tions  pertain  to  any  subfield  within  political  sci¬ 
ence  and  cover  a  wide  variety  of  issues,  ranging 
from  the  analysis  of  interstate  war  to  delegation 
problems  in  public  bureaucracies. 

Bargaining  games  are  answers  to  what  is  known 
as  the  “bargaining  problem”:  Is  it  possible  to  pre¬ 
dict  the  equilibrium  outcome  of  a  social  negotiation 
in  which  all  participating  actors  expect  an  individ¬ 
ual  gain?  It  took  the  genius  of  Nobel  Prize  winner 
John  F.  Nash  to  predict  a  unique  equilibrium — that 
is,  a  single  point — to  the  bargaining  problem.  In 
the  Nash  bargaining  solution  (NBS),  this  out¬ 
come  represents  the  maximal  product  of  the 
differences  between  the  utility  a  negotiator 
attaches  to  his  or  her  share  x  and  the  utility  of 
the  reservation  price  q. 

By  way  of  illustration,  imagine  a  bilateral  bar¬ 
gain  in  which  the  reservation  price  of  two  equally 
skilful  negotiators  is  0  so  that  actor  i  receives  x  and 
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its  counterpart  j  receives  1  -  x  =  y.  We  obtain  the 
equilibrium  prediction  through  the  maximization 
of  the  Nash  product  x(l  -  x).  Setting  the  differ¬ 
enced  equation  to  zero  (1  -  2x  =  0),  x  =  1/2  and 
y  =  1/2  result.  A  power-free  bargain  situation  entices, 
in  other  words,  a  fair  division. 

As  bargaining  power  has  many  facets,  it  can 
enter  the  calculus  of  the  social  outcome  in  various 
ways.  One  possibility  is  to  equate  power  with 
capabilities  ct  that  enter  as  an  exponential  weight 
the  utility  function  of  the  individual  actor,  where 
the  sum  of  the  capabilities  add  up  to  1  within  a 
society.  Hence,  if  negotiator  i  is  twice  as  powerful 
as  counterpart  /,  the  maximization  problem  in  the 
bilateral  case  reduces  to  x2(l  —  x).  Differentiating 
x2  —  x3  and  setting  the  result  to  zero,  we  receive 
2x  —  3x2  =  0.  If  we  disregard  the  outcome  x  =  0, 
actor  V s  share  amounts  to  2/3. 

A  further  aspect  of  power  at  the  center  of  the 
strategic  theory  of  bargaining  stems  from  the 
attractiveness  of  a  so-called  outside  option.  This 
outcome  is  equivalent  to  what  an  individual  nego¬ 
tiator  could  reach  if  the  negotiations  fail  or  if  they 
are  stalled  for  some  time.  Another  Nobel  Prize 
winner,  Thomas  Schelling,  made  this  point  through 
the  counterintuitive  “paradox  of  weakness,” 
according  to  which  the  less  flexible  negotiator  pos¬ 
sesses  a  bargaining  advantage.  Generally,  the  pres¬ 
ence  of  an  attractive  outside  option  helps  negotia¬ 
tors  in  their  attempt  to  commit  themselves  credibly 
to  a  more  beneficial  bargaining  position.  Robert 
D.  Putnam  has  used  this  insight  in  his  “two-level 
games”  metaphor,  which  stands  for  negotiations 
between  governments  that  have  to  care  about 
domestic  pivot  players.  In  such  negotiations,  gov¬ 
ernments  who  face  a  strong,  isolationist  opposi¬ 
tion  and  a  supramajoritarian  ratification  hurdle 
back  home  can  convince  a  more  cooperation- 
minded  negotiation  partner  that  only  a  minimal 
agreement  is  feasible.  The  commitment  of  such 
constrained  governments  is  more  credible  precisely 
because  they  cannot  get  an  agreement  ratified  that 
does  not  find  the  support  of  certain  sections  of  the 
opposition.  A  treaty  that  is  too  close  to  the  bar¬ 
gaining  position  of  the  foreign  government  is  con¬ 
sequently  doomed  to  fail  in  the  negotiations  or  at 
the  ratification  stage. 

The  “Schelling  conjecture”  that  the  seemingly 
weak  is  in  real  life  the  strong  actor  can  be  illus¬ 
trated  within  the  NBS  framework  through  the 


assumption  that  one  of  the  actors  can  be  commit¬ 
ted  to  a  disagreement  point  of  0.5  in  the  bargain 
over  a  pie  of  unit  size.  This  credible  claim  lets  the 
“zone  of  agreement”  shrink  and  alters  the  maximi¬ 
zation  problem  to  (x  -  0.5)(1  -  x),  which  leads 
after  differentiation  to  a  division  of  x  =  0.75  and 
y  =  0.25. 

Criticisms  of  the  NBS  deal  with  its  static  nature, 
its  axiomatic  foundations,  and  its  reliance  on  coop¬ 
erative  game  theory.  The  first  objection  answers  to 
the  observation  that  most  (but  not  all)  negotiations 
consist  of  lengthy  haggling.  The  second  and  the 
third  of  the  perceived  problems  respond  to  the  gen¬ 
eral  criteria  (axioms)  that  a  bargaining  solution 
has,  in  Nash’s  view,  to  fulfill  in  order  to  qualify  as 
a  socially  rational  outcome  of  a  negotiation.  His 
model,  in  particular,  assumes  in  line  with  coopera¬ 
tive  game  theory  that  the  negotiators  agree  on 
certain  rules  of  the  game  and  do  not  try  to  out¬ 
smart  each  other  through  unilateral  moves. 

A  response  to  these  objections  is  the  develop¬ 
ment  of  noncooperative  bargaining  models  that  are 
dynamic  and  include  relevant  one-sided  actions. 
Noncooperative  bargaining  models  also  often  study 
the  impact  of  information  deficits  on  the  bargain¬ 
ing  outcomes.  The  most  prominent  analytical 
framework  that  assumes,  in  its  most  basic  form, 
fully  informed  actors  is  the  Stahl-Rubinstein  bar¬ 
gaining  model.  This  sequential  game  introduces  a 
strict  bargaining  protocol  (a  term  standing  for  the 
order  in  which  the  actors  are  allowed  to  move) 
where  the  right  to  make  an  offer  moves  back  and 
forth  between  two  players.  One  version  of  this  ana¬ 
lytical  framework  assumes  that  actors  discount  the 
future  with  the  factor  <v  (0  <  S  <  1).  Hence,  the 
longer  the  bargaining  goes  on,  the  less  attractive  it 
becomes.  The  model  predicts  again  a  unique  out¬ 
come  that  is  largely  driven  by  the  size  of  the  dis¬ 
count  parameters  and  hence  by  the  patience  of  the 
negotiators.  In  general,  the  more  patient  an  actor 
is,  the  larger  is  the  share  of  the  pie  that  she  or  he 
can  bring  home.  If  the  intervals  between  the  nego¬ 
tiation  rounds  converge  toward  zero,  the  Stahl- 
Rubinstein  bargaining  model  coincides  with  NBS. 

The  Stahl-Rubinstein  framework  of  analysis  has 
been  extended  to  negotiations  among  n  actors.  To 
make  sensible  equilibrium  predictions  in  such  a 
context,  some  additional  assumptions  about  the 
bargaining  protocol  and  the  feasibility  of  certain 
strategies  are  necessary.  The  Baron-Ferejohn  model 


136  Behavioralism 


of  legislative  bargaining,  for  instance,  assumes  that 
one  member  of  parliament  is  randomly  recognized 
as  an  agenda  setter  who  can  make  equilibrium 
proposals  to  a  majority  of  other  legislators. 

Generalizations  of  Schelling’s  insight  that  cred¬ 
ible  commitments  are  a  key  prerequisite  to  bar¬ 
gaining  success  incorporate  models  in  which  some 
negotiators  are  assumed  to  be  imperfectly  informed. 
Such  “asymmetric  information”  pertains  in  the 
bargaining  context  most  often  to  the  credibility  of 
a  negotiator’s  claim  that  an  outside  option  is  more 
attractive  than  the  proposal  made  by  the  badly 
informed  negotiator.  The  better  informed  side  tries 
in  such  a  situation  to  improve  the  credibility  of  its 
claim  through  what  is  called  “costly  signals”;  this 
is  why  such  bargaining  games  belong  to  the  cate¬ 
gory  of  “signaling  games.”  In  an  interstate  crisis, 
for  instance,  a  costly  signal  can  consist  of  a  public 
warning  or  the  mobilization  of  the  troops.  Signaling 
games  have  been  successfully  developed  to  under¬ 
stand  conventional  and  nuclear  deterrence;  crisis 
behavior  of  this  sort  belongs  to  what  is  known  as 
tacit  bargaining,  where  actors  use  nonverbal  means 
of  communication  to  bolster  their  claims  for  a  spe¬ 
cific  division  of  the  spoils.  The  disadvantage  of 
limited-information  models,  as  games  with  asym¬ 
metric  information  are  also  called,  is  the  plethora 
of  possible  equilibrium  solutions.  Oddly,  this  over¬ 
supply  of  predictions  can  only  be  trimmed  down  at 
the  cost  of  asking  the  model  agents  to  behave  even 
more  rationally. 

The  empirical  evidence  in  favor  of  the  equilib¬ 
rium  predictions  that  can  be  derived  from  bargain¬ 
ing  models  is  mixed.  In  comparison  with  other 
decision-making  modes  such  as  voting  or  delega¬ 
tion,  bargaining  models  predict  outcomes  more 
accurately.  However,  the  extreme  100/0  division 
that  some  bargaining  games  suggest  as  a  rational 
strategy  hardly  finds  any  support.  Both  laboratory 
and  field  experiments  have  shown  that  privileged 
negotiators  often  propose  fair  divisions  against 
their  own  self-interest.  The  theoretical  merit  of 
these  findings  remains,  however,  controversial, 
although  some  powerful  causal  mechanisms  that 
account  for  such  seemingly  irrational  behavior 
have  been  proposed. 

Gerald  Schneider 
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Konstanz,  Germany 
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The  behavioral  approach  may  be  characterized  as 
an  attempt  to  conduct  political  science  according  to 
the  example  of  the  natural  sciences — that  is  to 
describe,  explain,  and  predict  political  phenomena 
as  exactly  as  possible.  It  is  the  goal  of  behavioral¬ 
ism  to  gain  scientifically  valid — that  is,  methodi¬ 
cally  secured — generalizations  on  its  subject.  This 
method  is  not  unique  to  the  field  of  the  social  sci¬ 
ences;  this  application  of  the  natural  sciences  to 
political  science  must  be  interpreted  as  being  a  part 
of  a  greater  movement  of  which — despite  great 
changes — all  social  sciences  have  become  a  part. 
From  this  point  of  view,  behavioralism  is  a  phenom¬ 
enon  of  general  social  scientific  empiricism  that  has 
found  and  still  finds  many  supporters  particularly  in 
the  United  States.  This  entry  reviews  the  beginnings 
of  behavioralism  and  its  evolution  in  the  mid-20th 
century  as  a  movement  that  reshaped  the  discipline 
of  political  science.  It  traces  its  emphasis  on  expla¬ 
nation,  prediction,  verifiability,  and  quantitative 
research  and  its  growing  influence  in  academic  insti¬ 
tutions  as  well  as  professional  organizations.  The 
entry  concludes  with  a  discussion  of  the  goals  and 
methodological  assumptions  that  characterize  the 
behavioral  approach  in  political  science. 
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Origins  of  Behavioralism 

Although  behavioralism  as  a  trend  began  primarily 
after  World  War  II,  there  are  important  predeces¬ 
sors  in  the  context  of  American  political  science 
that  paved  its  way.  Beginning  in  the  1920s,  the 
Chicago  School  and  the  New  Science  of  Politics 
movement  urged  a  greater  use  of  empirical  research 
to  establish  a  scientific  identity  to  the  discipline. 
Charles  E.  Merriam,  a  leading  figure  in  political 
science  during  those  years,  and  Harold  D.  Lasswell, 
with  his  decades  of  work  in  the  interdisciplinary 
area  of  policy  studies,  were  key  figures  in  those 
times;  without  their  work,  political  science  may 
well  have  taken  a  different  course.  Early  behavior- 
alist  considerations  developed  in  the  1920s  as  a 
reaction  to  the  then  predominant  institutionalist 
approach,  which  was  called  “institutional  real¬ 
ism,”  “realistic  institutionalism,”  or  simply  “real¬ 
ism.”  The  realist  approach  focuses  on  the  analysis 
of  the  actual  relations  (in  contrast  to  relations  as 
legally  defined)  between  different  government 
institutions,  political  parties,  and  interest  groups. 
In  general,  the  traditional  political-scientific  analy¬ 
sis  of  political  institutions  aimed  less  at  the  devel¬ 
opment  of  theories  than  at  finding  facts.  A  strict 
distinction  between  factual  claims  and  value  judg¬ 
ments  was  an  exception  rather  than  the  rule. 

Originally,  behavioralism  was  a  protest  move¬ 
ment  due  to  dissatisfaction  with  traditional  politi¬ 
cal  science.  The  latter — in  contrast  to  its  neighbor¬ 
ing  sciences  of  sociology  and  economics — was 
accused  of  being  characterized  neither  by  cumula¬ 
tive  research  nor  by  reasonable  scientific  commu¬ 
nication  or  even  scientific  cooperation.  This  flaw, 
it  was  stated,  was  among  others  due  to  political 
scientists  having  no  common  language,  no  com¬ 
mon  issues,  and  no  agreement  on  methods.  This 
was  explained  by  the  lack  of  comprehensive,  gen¬ 
erally  accepted  models  or  terminology  schemes  of 
the  political  process  by  which  research  was  guided 
(David  Easton,  1969). 

This  protest  was  initiated  primarily  by  younger 
scientists  who  came  from  very  different  intellectual 
approaches  and  who  were  initially  rather  isolated, 
since  they  worked  in  different  fields  of  political 
science.  Protest  was  nourished  by  the  experience  of 
increased  cooperation  between  political  scientists 
and  government  institutions  following  the  New 
Deal  and,  most  of  all,  during  World  War  II,  when 


contact  with  representatives  of  the  other  social  sci¬ 
ences  had  also  increased.  One  result  of  this 
increased  interaction  between  political  scientists 
and  politicians  was  the  painful  insight  that  there 
was  an  almost  insurmountable  difference  between 
what  was  needed  for  political  advice  and  what 
political  science  could  offer.  This  difference 
between  theory  and  praxis  was  not  as  strong  in  the 
neighboring  disciplines,  particularly  psychology 
and  economics,  and,  to  a  lesser  extent,  in  sociol¬ 
ogy.  In  any  case,  government  institutions  were 
much  less  interested  in  advice  from  political  scien¬ 
tists  than  from  economists,  psychologists,  and 
sociologists,  whose  theoretical  orientation  and 
methodological  basis  were  generally  held  in  higher 
esteem  than  that  of  political  scientists.  In  this  con¬ 
text,  the  inability  of  traditional  political  science  to 
predict  events  was  particularly  criticized. 

The  inability  of  traditional  political  science  to 
predict  and  explain  the  rise  of  fascism  in  Europe 
was  another  reason  for  the  dissatisfaction  of 
behavioral-oriented  young  scientists.  But  fascism, 
particularly  national  socialism,  also  influenced  the 
rise  of  behavioralism  in  another  way:  in  the  sub¬ 
liminal  reorientation  of  the  philosophy  of  science 
in  American  social  sciences,  influenced  by  emi¬ 
grated  European  researchers.  The  popularization 
of  Max  Weber  by  Talcott  Parsons  and  the  publica¬ 
tion  of  some  of  Weber’s  science-theoretical  essays 
by  Hans  Gerth  and  C.  Wright  Mills  (1946)  were 
significant  because  they  left  out  some  of  the  prag¬ 
matist  philosophical  traditions.  With  many  later 
behavioralists,  the  pragmatic  philosophy,  stating 
that  value  statements  can  be  validated  by  practice, 
which  was  then  predominant  in  the  United  States, 
was  replaced  by  Weber’s  position  that  value  state¬ 
ments  cannot  be  proven  by  empirical  means. 

Another  reason  for  the  rise  of  behavioralism 
seems  to  be  the  changed  international  situation 
after  1945  and  particularly  the  leading  political 
role  played  by  the  United  States  in  the  postwar 
period.  Increasing  international  interdependence, 
decolonization,  and  finally  the  Cold  War  con¬ 
fronted  the  political-scientific  disciplines  of  inter¬ 
national  politics  and  comparative  politics  with 
tasks  that  could  not  be  solved  using  traditional 
concepts  such  as  diplomatic  history,  international 
law,  and  the  comparison  of  institutions. 
Dissatisfaction  with  traditional  political  science 
was  strongly  supported  by  the  exemplary  effect  of 
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some  social-psychological  and  sociological  publi¬ 
cations  such  as  The  People’s  Choice  (1944),  The 
American  Soldier  (1949),  and  The  Authoritarian 
Personality  (1950),  where  the  empirical  methods 
of  random  sampling,  the  questionnaire,  and  atti¬ 
tude  analysis  or  panel  analysis  were  employed  and 
techniques  of  statistical  evaluation  were  used  to 
answer  scientific  questions.  These  methods  previ¬ 
ously  seemed  to  have  been  reserved  for  economics 
and  psychological  diagnostics.  These  studies  some¬ 
how  served  as  signposts  for  a  successful  political- 
scientific  behavioral  research  as  it  was  imagined  by 
early  behavioralists. 

It  was  predominantly  the  younger  political  sci¬ 
entists  who  in  the  early  1950s  became  fierce  critics 
of  the  realist  approach,  most  of  all  Easton.  In  sev¬ 
eral  programmatic  publications,  reaching  back  as 
far  as  the  early  1950s,  he  criticized  the  theoretical 
deficit  of  realist  research,  the  lack  of  methodologi¬ 
cal  reflection  and  terminological  clarity,  the  absence 
of  modern  methods  of  data  collection  and  analysis, 
and  the  one-sided  emphasis  on  institutional  aspects. 

Realigning  the  Discipline  of  Political  Science 

In  the  beginning,  the  supporters  of  behavioralism 
were  a  small,  scattered  group  of  lonely  fighters 
who  were  not  bound  together  by  any  kind  of  orga¬ 
nization  and  who  felt  connected  to  each  other  only 
by  their  way  of  understanding  science.  They  met  at 
seminars  and  in  the  context  of  conventions  of  the 
American  Political  Science  Association  (APSA)  to 
discuss  new  methods  of  empirical  research  in  poli¬ 
tics.  It  was  their  goal  to  readjust  the  entire  disci¬ 
pline  of  political  science  according  to  the  criteria  of 
behavioralism,  not  simply  to  add  another  subdisci¬ 
pline.  It  is  not  surprising  that  such  a  demand  for  a 
fundamental  readjustment  of  political  science  was 
strictly  opposed  by  many  established  political  sci¬ 
entists  who  met  the  new  methods  and  theoretical 
perspectives  with  a  lack  of  appreciation  and  even 
reacted  with  open  dislike.  As  Dwight  Waldo 
(1975)  and  Jurgen  W.  Falter  (2001)  have  described, 
this  resulted  in  the  first  phase  of  the  so-called 
behavioralism  controversy. 

Behavioralism  has  been  characterized  by  its 
interpreters  and  proponents  as  a  “movement,”  as 
a  “mood,”  a  “conviction,”  and  a  “protest,”  as 
well  as  a  “revolution”  and  a  “renaissance.”  Easton 
distinguished  between  an  intellectual  trend  and  an 


academic  movement;  the  former,  he  stated,  had 
many  more  supporters  than  the  latter.  However, 
he  said  that  it  was  difficult  to  identify  both  the  sup¬ 
porters  of  the  movement  and  the  practitioners  of 
the  intellectual  trend,  because  membership  criteria 
were  vague  and  the  borders  of  the  behavioralist 
conviction  were  unclear.  In  particular,  he  stated 
that  it  was  almost  impossible  to  distinguish  “true” 
members  of  the  movement,  fellow  travelers,  and 
occasional  sympathizers  from  each  other. 

Easton  argued  that  behavioralism  did  not  show 
any  formal  organization  as  a  movement.  Although 
there  had  been  short-term  plans  in  this  direction, 
efforts  to  bring  the  movement  together  had  not 
been  very  fruitful.  On  the  other  hand,  there  had 
always  been  a  feeling  of  being  loosely  connected 
and  an  agreement  on  basic  assumptions  and  scien¬ 
tific  ideals,  as  well  as  lively  mutual  communication. 
Also,  certain  spokespeople  of  the  movement  had 
always  been  accepted  by  the  scientific  community. 
The  success  of  behavioralism,  he  stated,  was  due 
not  only  to  its  effective  leaders,  its  capable  support¬ 
ers,  and  the  exemplary  effect  of  important  publica¬ 
tions  but  also  to  a  number  of  favorable  conditions. 
In  addition  to  the  funding  policies  of  the  great 
foundations,  the  significance  of  the  fact  that  the 
Social  Science  Research  Council  and  its  committees 
provided  the  behavioral  movement  with  organiza¬ 
tional  vehicles  for  the  development  of  American 
political  science  cannot  be  overemphasized. 

Growing  Institutional  Influence 

In  the  1950s  and  the  early  1960s,  the  increase  in 
numbers  of  supporters  of  the  behavioral  approach 
seemed  unstoppable.  It  experienced  early,  impor¬ 
tant  success,  as  Harold  D.  Lasswell,  perhaps  the 
most  important  intellectual  pioneer  of  behavioral¬ 
ism,  was  elected  president  of  the  APSA  in  1955- 
1956.  His  successor,  V.  O.  Key  Jr.,  who  was 
president  in  1957-1958,  was  a  behavioralist  as 
well.  Under  their  presidencies,  for  the  first  time, 
the  APSA  established  a  special  team  dealing  with 
questions  of  political  behavior.  The  number  of 
behavioral-oriented  teams  at  APSA  meetings  grew 
considerably  until  1960.  The  rise  of  behavioralism 
became  manifest  in  a  dramatic  rise  of  research 
papers  after  1960.  Soon  there  was  no  longer  any 
subfield  of  political  science  without  behavioral- 
oriented  studies,  although  naturally  the  extent  to 
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which  behavioral  thought  penetrated  the  various 
subfields  varied  greatly.  The  focus  of  research  was 
the  analysis  of  decision-making  processes  and  pat¬ 
terns  of  behavior.  The  individual  served  as  a  basis 
for  collecting  data,  and  the  individual  actor,  the 
group,  the  institution,  or  even  the  political  system 
in  its  entirety,  was  the  subject  on  which  statements 
were  made.  The  core  concepts  of  behavioral 
research  took  shape  mainly  in  the  work  by  Herbert 
Simon  on  political  decision  making,  Heinz  Eulau 
on  roles,  David  Truman  on  interest  groups,  David 
Easton  on  the  political  system,  Harold  Lasswell 
and  Abraham  Kaplan  on  the  concept  of  power, 
and  others.  Political-scientific  research  now  focused 
on  theoretical  concepts  rather  than  on  the  analysis 
of  everyday  politics  as  the  realists  had  preferred. 

From  about  1960  on,  a  number  of  important 
research  papers  based  on  the  model  of  behavioral¬ 
ism  were  published,  among  them  The  American 
Voter  by  Angus  Campbell  and  Philip  E.  Converse 
(1960),  The  Civic  Culture  by  Gabriel  Almond  and 
Sidney  Verba  (1963),  and  Karl  Wolfgang  Deutsch’s 
The  Nerves  of  Government  (published  in  German 
in  1963).  In  addition  to  studies  on  elections  devel¬ 
oped  at  Columbia  University  and  the  University  of 
Michigan  ( Voting  and  The  Voter  Decides),  several 
programmatic  and  empirical  studies  were  published 
that  were  targeted  at  fundamentally  restructuring 
the  subfields  of  comparative  politics  and  interna¬ 
tional  relations.  After  1956,  the  profile  of  the 
American  Political  Science  Review  also  changed 
drastically,  indicating  a  gradual  change  in  the  disci¬ 
pline.  Clearly,  the  quantitative  element  was  gaining 
ground.  Now,  almost  all  American  universities 
appointed  behavioralists  to  their  departments  of 
political  science,  which  triggered  off  a  kind  of  chain 
reaction,  further  supporting  the  spread  of  the 
behavioral  conviction.  Studies  written  in  the  spirit 
of  behavioralism  started  to  dominate  the  appear¬ 
ance  of  the  most  important  scientific  magazines  in 
the  United  States.  With  just  a  few  exceptions,  for  a 
number  of  years  after  1964,  almost  all  APSA  presi¬ 
dents  came  from  the  behavioral  movement  or  were 
at  least  related  to  it.  In  the  course  of  only  1  Vi 
decades,  behavioralism  had  succeeded  in  fundamen¬ 
tally  changing  political  science  in  the  United  States. 

In  the  course  of  the  second  phase  of  the  struggle 
of  the  behavioral  research  program,  following  the 
students’  revolt  of  the  1960s,  a  strongly  applica¬ 
tion-oriented  dimension  was  added  to  behavioral 


political  science.  In  this,  policy  studies  tried  to  com¬ 
bine  a  strictly  scientific  way  of  proceeding  with 
practical  relevance  in  order  to  provide  political 
advice.  Although  it  has  often  been  declared  dead, 
the  behavioral  research  program  is  still  alive,  and 
contemporary  scientists  working  quantitatively  and 
empirically  would  call  themselves  behavioralists. 

There  has  always  been  a  fundamental  difference 
between  empirical  and  theoretical  behavioralism. 
Both  were  based  on  the  same  philosophy  of  sci¬ 
ence — that  is,  logical  empiricism.  However,  they 
deal  with  different  questions  and  use  different 
theoretical  analysis  units.  Whereas  a  great  part  of 
empirical  research  earlier  referred  to  and  still  refers 
to  explaining  individual  behavior,  theory-oriented 
behavioralism  was  mainly  interested  in  the  politi¬ 
cal  system.  Both  kinds  of  behavioralism  turned 
against  philosophically  normative,  purely  institu¬ 
tionalist,  or  social  criticism-oriented  approaches 
of  political  science.  As  Falter  (1982)  notes,  a  bitter 
behavioralist  struggle  in  American  political  science 
resulted,  from  which,  however,  the  behavioral 
research  discipline  survived  relatively  unscathed. 

Goals  and  Methodological  Assumptions 

In  this  section,  the  10  most  important  goals  and 
methodological  assumptions  of  behavioralism  are 
summarized  and  commented  on.  This  catalog  of 
behavioral  principles  tries  to  comprehensively 
explain  the  common  core  of  the  various  behavioral 
trends  in  the  context  of  political  science,  as  far  as 
it  is  possible  to  bring  the  variety  of  approaches 
together. 

The  Need  for  Theory 

The  goal  of  behavioralist  political  science  is  not 
merely  to  describe  political  processes  but  also  to 
explain  and  predict  them.  In  this  context,  theory  is 
seen  as  guiding  the  selection  of  research  topics  and 
the  description  and  integration  of  empirical  findings. 

Looking  for  Regularities 

According  to  the  behavioral  basic  assumption, 
one  must  reach  back  to  law  statements  in  order  to 
be  able  to  explain  and  predict  political  processes. 
One  pragmatic  precondition  for  the  search  for  reg¬ 
ularities  is  the  assumption  that  social  and  political 
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processes  are  subject  to  such  laws.  However,  with 
its  search  for  laws,  behavioralism  does  not  restrict 
itself  to  empirical  generalizations  but  tries  to  go  as 
far  as  possible  to  theoretical  statements.  This  makes 
it  different  from  the  classical  and  radical  behavior¬ 
ism  of  John  Watson  and  B.  F.  Skinner  with 
which — erroneously — it  is  often  equated.  This,  at 
first,  only  a  verbal  equation,  which,  however,  is 
often  supposed  to  disqualify  behavioralism  as  being 
“positivist”  and  thus  out-of-date,  blurs  important, 
fundamental  differences  between  the  two  trends. 
Behavioralism,  which  indeed  is  also  an  analysis  of 
behavior  using  empirical  and  quantifying  methods, 
goes  far  beyond  the  purely  descriptive  recording 
and  generalization  of  behavior.  Psychological 
behavioralism,  being  a  theory-guided  empiricism, 
(a)  looks  primarily  for  explanations  and  not  for 
empirical  generalizations  and  (b)  includes  interven¬ 
ing  variables  moderating  between  stimulation  and 
reaction,  for  example,  cognitive  consciousness  pro¬ 
cesses,  in  the  form  of  theoretical  constructions,  in 
its  analysis.  Thus,  in  contrast  to  purely  empirical 
generalizations,  the  theoretical  statements  of  behav¬ 
ioralism  deal  not  only  with  processes  that  could  be 
directly  observed  but  also  with  dispositional  factors 
such  as  personality  traits  and  attitudes  that  by 
empirical  means  can  only  be  described  indirectly — 
that  is,  with  the  help  of  indicators.  These  latent 
dimensions,  which  cannot  be  directly  observed,  are 
described  with  the  help  of  theoretical  concepts — 
that  is,  by  so-called  constructs.  An  example  of  such 
a  theoretical  construct  is  the  concept  of  identifica¬ 
tion  with  a  political  party.  Theoretical  constructs 
are  connected  by  definitions  and  lawlike  state¬ 
ments.  These  theoretical  concepts  are  connected  to 
the  observation  level  by  so-called  correspondence 
rules,  which  make  the  empirical  evaluation  of  a 
theory  possible.  According  to  the  behavioral  opin¬ 
ion,  all  theoretical  concepts  must  be  operational¬ 
ized  (or  rather  be  operationalizable). 

Striving  for  Verifiability  and  Objectivity 

It  is  typical  for  science,  in  the  behavioral  sense, 
that  all  statements  must  be  verifiable.  From  the 
empirical  point  of  view,  statements  are  verifiable 
only  if  they  either  immediately  refer  to  observable 
facts  or  can  at  least  be  based  on  statements  formu¬ 
lated  in  the  observational  language.  In  contrast  to 
the  analytical  philosophy  of  science,  which  only 


demands  that  statements  must  be  (at  least  indi¬ 
rectly)  observable  or  that  they  can  be  falsified, 
behavioralism  still  follows  the  verification  princi¬ 
ple  of  neo-positivism.  For  it,  sentences  that  cannot 
be  verified  must  be  excluded  from  the  scientific 
canon.  To  meet  the  criterion  of  verifiability,  the 
measurements  of  the  empirical  researcher  must  be 
as  objective  as  possible — that  is,  independent  of 
the  individual  scientist.  Objective  research  results 
are  characterized  by  the  possibility  of  their  being 
repeated  by  other  scientists  under  the  same  circum¬ 
stances.  To  be  verifiable,  the  applied  research  tools 
must  provide  reliable  measured  values.  A  measure¬ 
ment  tool  is  reliable  if  it  produces  the  same  results 
in  the  case  of  repeated  measurement  of  the  same 
events,  with  a  certain  leeway  for  mistakes  that 
must  be  as  small  as  possible.  This  way,  however, 
one  does  not  know  if  a  reliable  measurement  tool, 
for  example,  a  scale,  really  measures  that  what  is 
supposed  to  be  measured.  For  this  reason,  mea¬ 
surements  must  not  only  be  formally  reliable  but 
also  topically  valid.  In  this  context,  the  formal 
exactness  of  measurement  is  a  necessary  but  not 
sufficient  condition  for  the  topical  exactness  of 
measurement.  To  guarantee  the  verifiability  or 
more  generally  the  meaningfulness  of  its  results, 
empirical  research  must  also  argue  in  a  way  that  is 
without  logical  contradictions,  that  explicitly 
defines  its  terms  and  uses  them  consistently,  and 
that  excludes  all  concepts  from  the  scientific  lan¬ 
guage  that  do  not  refer  to  empirical  facts. 

Elaborated  Research  Techniques 

Empirical  social  research  employs  a  number  of 
tools  for  data  collection  and  data  evaluation.  These 
include  various  methods  of  attitude  measurement, 
techniques  such  as  the  interview,  the  methods  of 
quantitative  topical  analysis,  and  standardized 
observation  schemes.  Among  the  most  important 
research  techniques  of  empirical  political  science 
are  various  random  sample  procedures.  Statistical 
methods  of  evaluation  include  regression  analysis, 
variance  and  covariance  analysis,  as  well  as  factor 
and  path  analysis,  among  others.  It  is  the  goal  of 
the  statistical  methods  of  evaluation  to  cope  with 
extensive  amounts  of  data,  to  determine  connec¬ 
tions  between  analyzed  variables,  and  to  determine 
the  degree  of  safety  by  which  the  measured  connec¬ 
tions  can  be  accepted.  In  this  context,  the  use  of 
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constantly  improved,  refined  research  techniques  is 
not  an  end  in  itself  but  a  means  to  meet  the  previ¬ 
ously  described  quality  criteria  of  research. 

The  Trend  Toward  Quantification 

Inevitably,  the  demand  that  measurement  results 
must  be  as  exact  as  possible  results  in  the  attempt 
to  make  data  collection  and  data  evaluation  as 
precise  and  standardized  as  possible.  In  this  way, 
the  tendency  toward  verifiability  and  comparabil¬ 
ity  of  statements  is  increased.  According  to  its  own 
statement,  the  call  for  quantification  is  not  meant 
as  a  dogmatic  demand  but  as  a  useful  goal  to  strive 
for  where  it  is  reasonable  for  the  object.  Other 
goals,  such  as  the  theoretical  and  practical  rele¬ 
vance  of  research  results,  are  not  supposed  to 
retreat.  Often — and  sometimes  with  justification — 
the  opponents  of  behavioralism  object  that  its 
most  important  characteristic  is  its  quest  for  maxi¬ 
mum  quantification  while  often  losing  focus  on  the 
subject  under  analysis. 

Focusing  on  Individual  Behavior 

According  to  the  behavioral  point  of  view,  all 
statements  must  be  referred  to  as  observable  behav¬ 
ior.  On  rare  occasions,  the  behavioral  concept  of 
behavior  also  refers  to  small  groups.  However,  the 
behavioral  concept  of  behavior  is  very  comprehen¬ 
sive  and  includes  both  acts  of  behavior  in  the 
stricter  sense  and  the  expression  of  attitudes, 
intended  behavior,  or  values  as  they  are  collected 
by  using  questionnaires  or  scales.  If  at  the  begin¬ 
ning  of  the  behavioral  movement,  its  spokespeople 
assumed  that  empirical  and  theoretical  analysis 
units  could  be  made  to  coincide — that  is,  the  levels 
of  observation  and  statement  match — soon  this 
behavioral-psychological  remnant  was  abandoned. 
Political  science  deals  also  with  groups,  associa¬ 
tions,  and  institutions  that  cannot  be  excluded 
from  the  field  of  political  science  just  to  meet  a 
methodological  demand.  Otherwise,  behavioral 
political  science  would  actually  have  been  reduced 
to  the  social  psychology  of  the  political  behavior  of 
individuals,  as  is  often  claimed  by  its  critics. 

Inductivism 

Another  typical  feature  of  behavioral  research 
activity  is  its  inductive  way  of  proceeding. 


Usually,  research  happens  by  observing  empirical 
regularities — for  example,  that  Catholics  on  the 
European  continent  vote  for  conservative  parties 
more  often  than  Protestants  do.  These  empirical 
regularities  are  conceptually  expressed  by  way  of 
law  statements.  Also,  theoretical  constructions  are 
definitely  used — for  example,  that  of  the  dogmatic 
personality.  According  to  the  behavioral  point  of 
view,  these  theoretical  constructions,  which  are 
employed  to  explain  and  predict  political  behav¬ 
ior,  must  always  be  directly  operationalized, 
which,  however,  is  not  at  all  necessary  according 
to  the  point  of  view  of  the  analytical  philosophy 
of  science.  Thus,  they  stay  comparatively  close  to 
the  empirical  surface,  which  might  be  one  of  the 
reasons  why  behavioralist  theses  are  compara¬ 
tively  less  fruitful,  which,  however,  in  this  respect 
is  hardly  different  from  the  theories  of  its  neigh¬ 
boring  sciences  of  sociology  and  social  psychol¬ 
ogy.  What  exist  in  rich  quantities  are  empirical 
single  results  and  culture-specific  generalizations, 
which,  however,  are  comparatively  unconnected. 

Value  Relativism 

The  behavioral  position  and  the  philosophy  of 
science  share  the  opinion  that  it  is  not  possible  to 
give  reasons  for  value  judgments  with  the  help  of 
empirical  methods  alone.  Only  cognitive  state¬ 
ments  can  be  empirically  verified — that  is,  state¬ 
ments  on  the  world  as  it  is  or,  rather,  as  it  appears 
to  the  scientist,  but  not  statements  on  what  the 
world  should  be  like  or  how  the  scientist  would 
like  it  to  be.  Thus,  according  to  the  behavioral 
opinion,  value  judgments  are  not  capable  of  truth: 
They  cannot  be  inferred  from  statements  on  facts. 
According  to  Arnold  Brecht,  between  is  and  ought 
there  is  a  gap  that  cannot  logically  be  bridged  and 
that  makes  it  impossible  for  the  scientist  to  present 
value  judgments  as  a  result  of  his  or  her  research. 
Value  positions  that  are  included  into  empirical 
statements  will  distort  the  results  of  research. 
Thus,  the  individual  scientist  must  try  to  neutralize 
his  or  her  value  tendencies  and  prejudices.  This  is 
why  empirical  science  follows  the  ideal  of  impar¬ 
tiality  in  this  sense,  although  it  is  in  some  way  like 
the  “Blue  Flower”  for  scientists — a  goal  that  they 
will  never  fully  achieve.  Even  according  to  behav¬ 
ioral  opinion,  the  individual  will  always  be  some¬ 
what  a  prisoner  of  his  or  her  value  preferences. 
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Giving  up  on  empirically  reasoned  value  judg¬ 
ments  does  not  rule  out  that  values  might  be 
empirically  analyzed  as  dependent  or  independent 
variables.  Furthermore,  according  to  Immanuel 
Kant,  empirical  political  science  is  capable  of  ana¬ 
lyzing  systems  of  value  preferences  in  respect  of 
their  consequences  as  well  as  of  showing  inconsis¬ 
tencies  and  incompatibilities,  of  researching  the 
conditions  for  their  realization,  and  of  giving  tech¬ 
nological  recommendations  in  the  form  of  hypo¬ 
thetical  imperatives. 

Orientation  Toward  Basic  Knowledge 

The  use  of  scientific  insights  for  the  realization 
of  societal  goals  is  possible  only  if  there  is  a  suffi¬ 
cient  amount  of  basic  knowledge.  But  even  today 
this  is  not  the  case  with  most  subfields  of  political 
science.  Even  after  approximately  60  years  of 
behavioral-oriented  research,  sufficiently  con¬ 
firmed  theories  are  rather  an  exception.  On  the 
other  hand,  application-oriented  research  contrib¬ 
utes  little  to  the  growth  of  scientific  basic  knowl¬ 
edge.  To  be  able  to  make  sound  predictions, 
according  to  the  behavioral  program,  there  must 
be  knowledge  of  behavioral  regularities  as  well  as 
theories  using  general  terms,  among  others.  The 
technological  application  of  scientific  insights 
depends  on  the  existence  of  a  knowledge  stock  that 
must  be  worked  out  by  way  of  basic  scientific  ori¬ 
entation.  That  is  why  (at  least  originally)  behavior¬ 
alism  supports  a  program  of  “true  science,” 
although  as  a  consequence  of  Easton’s  postbehav- 
ioral  revolution  we  observe  that,  in  this  respect, 
points  of  view  are  not  so  strict  anymore.  Today, 
many  younger  researchers,  who  nevertheless  are 
definitely  connected  to  behavioralism  in  respect  of 
their  research  techniques,  try  to  combine  theoreti¬ 
cal  and  practical  relevance,  most  of  all  in  the  field 
of  policy  studies. 

Integration  and  Interdisciplinarity 

For  its  efforts  to  work  out  a  secured  and  grow¬ 
ing  knowledge  stock,  political  science  must  reach 
back  to  the  concepts,  procedures,  and  results  of 
other  social  sciences.  According  to  the  behavioral 
opinion,  politics  include  only  one  field  of  the 
social;  personality  traits  in  the  widest  sense  may  be 
as  significant  for  political  behavior  as  social  or 


economic  facts.  It  is  necessary  to  reach  back  to 
data  collection  and  evaluation  techniques  of 
other  social  sciences  because  there  are  only  a  few 
empirical  methods  developed  by  political  science 
itself.  Almost  all  the  research  techniques  used  by 
it  are  takeovers  or  modifications  of  sociological, 
social-psychological,  economic,  or  individual- 
psychological  methods  and  give  reason  to  the 
demand  for  an  interdisciplinary  approach  and  the 
integration  of  research  results. 

The  Lasting  Influence  of  Behavioralism 

Even  if  few  political  scientists  today  consider 
themselves  behavioralists  in  the  stricter  sense,  it  is 
a  fact  that  at  first  American  political  science  and 
later  international  political  science  have  been  char¬ 
acterized  and  penetrated  by  behavioralism  to  such 
an  extent  that  in  Thomas  Kuhn’s  sense — with  a 
grain  of  salt — we  may  speak  of  a  scientific  turn¬ 
around  whose  heuristic  possibilities  do  not  seem  to 
be  exhausted  at  all  even  after  60  years.  It  is  an 
achievement  of  the  behavioral  revolt  that  today 
empirical-quantitative  studies  are  a  matter  of 
course.  As  for  the  methodical-statistical  refinement 
of  analysis,  for  a  long  time  empirical  political  sci¬ 
ence  has  been  acting  at  the  same  level  as  its  neigh¬ 
boring  disciplines.  The  bitter  quarrels  between 
doctrines  from  the  1950s  to  the  1970s  have  been 
replaced  by  a  peaceful  coexistence  of  many 
approaches  and  theoretical  positions.  In  so  far, 
Robert  Dahl  proved  to  have  almost  prophetic 
qualities  when  he  subtitled  a  1961  article  with  an 
early  obituary  for  behavioralism,  “Epitaph  for  a 
Monument  to  a  Successful  Protest.  ” 
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Beliefs 


Political  beliefs  can  be  defined  as  individual  psy¬ 
chological  orientations  toward  objects  of  a  cogni¬ 
tive  nature  in  the  political  world  (e.g.,  polities, 
institutions,  actors)  that  consist  of  the  perceived 
likelihood  of  a  given  attribute  being  attached  to 
them.  Therefore,  a  political  belief  has  three  basic 
components:  an  object,  an  attribute,  and  the  per¬ 
ceived  subjective  probability  that  the  object  and 
the  attribute  go  together.  However,  as  with  other 
concepts  such  as  values  or  attitudes,  political 
beliefs  are  hard  to  define  properly,  having  been 
subject  to  different  and  even  contradictory  uses. 
Furthermore,  any  survey  of  the  literature  will 
show  how  political  beliefs  are  measured  in  differ¬ 
ent  and  often  imprecise  ways.  It  is,  thus,  a  concept 
still  “open”  to  full  discussion,  or,  as  Imre  Lakatos 
would  have  put  it,  it  is  a  concept  still  in  the  “mor¬ 
phological  phase.”  Therefore,  this  entry  first 
reviews  the  main  uses  and  definitions  of  the  con¬ 
cept  found  in  the  literature  to  appropriately  frame 
the  definition  advanced  above.  This  concept  of 


beliefs  is  then  discussed  in  the  wider  context  of 
belief  systems. 

Conceptual  Problems 

Generally,  one  can  think  of  political  beliefs  as  indi¬ 
vidual  orientations  toward  politics,  related  to 
other  similar  concepts  such  as  political  values  and 
political  attitudes.  But  the  terminological  jungle 
that  comes  along  with  these  concepts  makes  it  very 
hard  to  make  sense  of  such  a  definition;  as  William 
J.  McGuire  has  stated,  we  find  ourselves  with  a  set 
of  names  in  search  of  a  distinction  rather  than  with 
a  distinction  in  search  of  a  terminology,  making 
the  situation  rather  complicated  and  blurry.  An 
often  implicit  assumption  is  that  these  concepts 
(beliefs,  values,  and  attitudes)  can  be  placed  on  an 
underlying  continuum  ranging  from  the  most  basic 
or  abstract  to  the  most  specific.  The  most  abstract 
ones  would  therefore  be  the  filters  through  which 
citizens  form  their  more  specific  orientations. 
However,  this  continuum  is  often  defined  by  the 
opposition  between  attitudes  and  values.  Values 
are  commonly  placed  on  the  top  of  the  scale,  at  the 
highest  level  of  generality  or  abstraction,  and  atti¬ 
tudes  are  conceptualized  as  being  more  specific. 
But  the  place  of  political  beliefs  within  this  frame¬ 
work  is  not  so  clear.  A  review  of  the  uses  of  the 
concept  shows  that  there  is  deep  controversy  over 
how  to  locate  political  beliefs  within  this  context. 
Indeed,  on  one  hand  are  scholars  who  quasi  auto¬ 
matically  equate  political  beliefs  with  values  and 
use  both  terms  almost  interchangeably,  such  as 
Stanley  Feldman,  who  treats  “core”  beliefs  as 
roughly  equivalent  to  political  values.  On  the  other 
extreme,  there  are  many  scholars  who  consider 
beliefs  as  the  more  specific  components  of  atti¬ 
tudes.  In  this  latter  framework,  attitudes  are  con¬ 
ceptualized  as  summary  evaluations  made  of  opin¬ 
ions  and  beliefs,  and  therefore,  beliefs  would  be 
placed  at  a  lower  level  of  generality  or  abstraction. 

This  controversy  often  passes  unnoticed  because 
most  scholars  simply  do  not  pay  attention  to  the 
definitional  issue.  Nevertheless,  the  extremely 
divergent  uses  of  the  concept  must  lead  us  to  con¬ 
clude  that  the  placement  of  the  concept  in  a  latent 
(undefined)  scale  is  not  the  appropriate  path  to 
follow  if  we  aim  at  analytical  precision.  There  is  no 
clear  consensus  on  where  to  place  beliefs  in  such  a 
scale  and,  therefore,  its  usefulness  is  extremely 
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limited.  We  thus  turn  our  attention  to  other 
attempts  that  are  more  analytically  oriented. 

These  contributions  are  to  be  found  especially 
in  the  field  of  social  psychology.  Martin  Fishbein 
and  leek  Ajzen  advanced  perhaps  the  clearest  defi¬ 
nition  of  beliefs-.  They  define  the  term  as  the  per¬ 
ceived  likelihood,  or  the  subjective  probability, 
that  an  object  has  a  specific  attribute  or  character¬ 
istic  associated  with  it.  Therefore,  political  beliefs 
would  be  presented  only  by  statements  of  the  fol¬ 
lowing  nature:  “The  government  is  efficient,” 
“The  government  is  responsive,”  “The  judiciary 
system  is  fair,”  or  “Health  care  is  free  for  all,”  and 
so  on.  These  are  cognitive  thoughts  and  ideas  that 
subjectively  link  specific  attributes  to  political 
objects.  Additionally,  as  stated  by  Oskar 
Niedermayer  and  Bettina  Westle,  beliefs  can  differ 
in  strength,  certainty,  and  subjective  relevance  or 
salience.  Thus,  they  should  be  measured  on  a  sin¬ 
gle  dimension  of  subjective  probability  of  an 
object-attribute  relationship  (i.e.,  how  likely  or 
certain  it  is  that  object  X  has  the  attribute  Y),  com¬ 
monly  through  the  degree  of  agreement  or  support 
to  a  given  statement  that  must  be  both  object  and 
attribute  specific. 

This  conceptual  framework  allows  for  a  clearer 
conceptualization  of  the  term,  and  it  lays  the 
ground  for  a  distinction  between  beliefs  and  atti¬ 
tudes.  Beliefs  are  primarily  of  a  cognitive  nature, 
while  attitudes  are  evaluative.  As  Lester  Milbrath 
states,  beliefs  are  cognitions  with  a  feeling  of  cre¬ 
dulity  attached  to  them.  In  other  words,  a  belief 
contains  a  cognitive  component  of  an  object  (it  has 
to  be  known  for  the  belief  to  exist)  linked  to  “cred¬ 
ible  (believable)  or  incredible  (unbelievable)”  fea¬ 
tures  of  that  specific  object.  In  this  sense,  beliefs 
are  distinct  from  values  and  attitudes,  since  the 
latter  imply  the  attachment  of  a  feeling  of  valence 
(like/dislike)  to  the  cognition  rather  than  to  credu¬ 
lity.  (In  Milbrath’s  scheme,  both  beliefs  and  atti¬ 
tudes  have  a  cognitive  dimension  since  one  can 
only  believe  or  evaluate  known  objects.)  Both  feel¬ 
ings,  Milbrath  insists,  are  interrelated  (i.e.,  we  tend 
to  believe  what  we  like),  but  at  the  same  time,  they 
are  different  (we  can  believe  things  that  we  dislike 
and  vice  versa).  In  any  case,  the  argument  is  the 
same:  Beliefs  are  based  on  the  perceived  probabil¬ 
ity  that  an  object  has  a  given  feature  associated 
with  it.  An  implication  of  this  conceptualization  is 
that  beliefs  can  often  be  falsified  with  external, 


objective  criteria,  while  this  is  not  the  case  for  atti¬ 
tudes  and  values. 

However,  as  Stuart  Oskamp  notes,  this  way  of 
conceptualizing  beliefs  and  attitudes  raises  the 
problem  of  the  status  of  the  so-called  evaluative 
beliefs  that  combine  the  perceived  likelihood  of 
having  a  given  attribute  on  one  hand,  and  an 
evaluation  on  the  other  (e.g.,  “The  government  is 
inefficient,”  “The  authorities  are  responsive,”  or 
“The  political  system  is  democratic”).  This  is  so 
because  the  attributes  being  believed  or  disbelieved 
have  a  clear  and  widely  shared  positive  or  negative 
implication.  Even  if  these  statements  are  syntacti¬ 
cally  nonevaluative,  since  they  refer  to  the  likeli¬ 
hood  of  an  attribute  being  attached  to  an  object, 
semantically  they  are  clearly  evaluative  orienta¬ 
tions.  This  ambivalence  makes  evaluative  beliefs 
almost  indistinguishable  from  attitudes.  Evaluative 
beliefs  might  be  considered  as  an  intermediate  cat¬ 
egory,  but  what  they  reveal  is  indeed  a  continuum 
that  ranges  from  the  most  objective  (and  falsifi- 
able)  beliefs  to  the  most  evaluative  ones. 

The  inability  to  distinguish  beliefs  from  atti¬ 
tudes  in  this  kind  of  literature  points  again  to  a 
serious  flaw  in  conceptualization.  As  noted  above, 
any  meaningful  definition  of  beliefs  must,  at  least, 
allow  us  to  separate  them  from  political  attitudes. 
Scholars  using  this  theoretical  framework  try  to 
resolve  this  problem  by  going  back  to  the  same 
kind  of  scale  of  generality  or  abstraction:  Attitudes 
are  considered  to  be  generalized  and  enduring 
evaluations  of  an  object,  while  beliefs  are  singular 
and  specific  statements  (evaluative  or  not)  that 
refer  to  particular  characteristics  of  the  object. 
Therefore,  most  scholars  of  this  framework  con¬ 
sider  that  a  person’s  attitude  toward  an  object  can 
be  thought  of  as  a  summary  of  his  or  her  evalua¬ 
tive  beliefs.  For  example,  the  overall  evaluation  of 
the  political  system  would  be  an  attitude,  while  the 
evaluation  of  each  of  its  specific  features  (respon¬ 
siveness,  efficacy,  transparency,  etc. )  would,  within 
this  framework,  be  thought  of  as  a  belief. 

There  is  an  additional  empirical  problem  in  this 
conceptualization:  Cognitive  beliefs  are  the  foun¬ 
dations  for  evaluative  attitudes  and,  therefore,  we 
must  expect  a  close  empirical  relationship  among 
them.  However,  this  relationship  is  not  present  in 
many  instances.  Scholars  justify  this  lack  of  empir¬ 
ical  evidence  by  saying  that  cognitive  orientations 
and  evaluative  attitudes  may  not  be  empirically 
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related.  However,  attitudes  made  up  of  summary 
evaluations  must  show — at  least  partially — some 
congruence  with  the  set  of  beliefs  that  produces 
such  general  evaluations.  Without  such  an  empiri¬ 
cal  congruence,  therefore,  we  should  not  think  of 
those  attitudes  as  summarizing  a  set  of  beliefs. 

Ideology  and  Political  Beliefs: 

The  Belief-Systems  Debate 

Despite  the  relevance  of  the  preceding  definitional 
discussion,  we  must  acknowledge  that  the  core 
debate  in  political  science  on  political  beliefs  has 
basically  revolved  around  the  concept  of  belief 
systems — their  nature,  structure,  and  internal  con¬ 
sistency.  This  debate  was  originated  by  Philip 
Converse’s  seminal  work  The  Nature  of  Belief 
Systems  in  Mass  Publics.  The  following  section 
briefly  reviews  the  concept  and  the  terms  of  the 
debate,  which  has  been  both  conceptual  and 
empirical. 

The  key  concept  is  political  belief  systems.  In 
Converse’s  framework,  belief  systems  are  coherent 
structures  of  “attitudes  and  ideas”  whose  compo¬ 
nents  are  logically  organized.  The  internal  coherence 
of  belief  systems  stems  from  political  ideology — 
mainly  in  terms  of  left-right  ideology.  Ideology, 
therefore,  consists  of  an  overarching  set  of  inte¬ 
grated  principles  about  politics  and  the  social 
world  from  which  specific  beliefs  are  deductively 
derived.  This  concept  was  meant  to  overcome  the 
limits  of  the  term  ideology,  since  its  multiple  (and 
often  contradictory)  uses  and  its  normative  over¬ 
tones  made  it  of  limited  utility  for  research.  How 
do  belief  systems  acquire  their  structure  and  even¬ 
tually  their  internal  coherence?  Converse’s  per¬ 
spective,  which  has  its  foundation  in  physics, 
implicitly  assumes  randomness  to  be,  somehow, 
the  natural  condition  of  things.  This  underlying 
assumption  implies  that  whenever  we  find  a  given 
structure  in  citizens’  political  beliefs,  we  must 
look  for  an  external  force,  which,  in  his  frame¬ 
work,  is  called  a  “constraint.” 

What  are  these  external  forces,  or  constraints? 
Converse  discusses  two  types  of  constraints:  the 
social  and  the  psychological.  However,  his  focus, 
and  the  focus  of  the  literature  following  him,  has 
been  disproportionately  placed  on  the  first  one.  A 
social  constraint  for  the  political  belief  system 
comes  from  the  dominant  role  of  political  elites, 


who  “package”  attitudes  and  beliefs  in  coherent 
structures  for  consumption  by  the  public,  as 
expressed  by  Stanley  Feldman.  These  packages  are 
often  referred  to  as  “political  ideology.”  The  psy¬ 
chological  constraints  are  based  on  the  role  of 
superior  convictions  concerning  humans  and  soci¬ 
ety  that  bind  together  more  specific  orientations, 
such  as  attitudes  and  beliefs.  In  other  words, 
Converse  is  referring  to  what  has  otherwise  been 
called  “values”  or,  in  Stanley  Feldman’s  terminol¬ 
ogy,  “core  beliefs.”  Therefore,  the  ideological 
structure  of  mass  belief  systems  (in  terms  of  left- 
right  opposition)  might  be  thought  of  as  a  “social” 
or  elite-driven  constraint,  while  the  alternative 
perspective  of  values  and  core  beliefs,  or  even  non¬ 
political  normative  judgments,  may  fall  under  an 
alternative  psychological  constraint. 

The  actual  structure  (or  lack  of  structure)  of 
citizen  belief  systems,  as  well  as  its  main  determi¬ 
nants,  has  also  been  the  subject  of  an  enduring 
controversy.  The  empirical  findings  to  test  this 
theory  have  displayed  very  little  consistency  across 
time  and  across  issues  or  beliefs.  Only  a  reduced 
segment  of  the  publics  (the  so-called  ideologues) 
shows  a  coherent  pattern  of  beliefs  congruent  with 
the  basic  left-right  ideological  scheme.  This  low 
interitem  congruence  and  overtime  stability  may 
signify  a  limited  political  understanding  of  citizens 
in  ideological  terms,  but  it  could  also  be  an  artifact 
or  measurement  error.  Furthermore,  Converse 
himself  has  shown  that  party  identification  is  sub¬ 
stantially  more  stable  over  time  than  ideological 
self-positioning. 

More  recent  research  has  led  to  new  reviews  of 
Converse’s  findings.  For  example,  John  Zaller  has 
shown  in  The  Nature  and  Origins  of  Public 
Opinion  how  citizens’  beliefs  and  attitudes  are 
changed  and  polarized  according  to  political  elites’ 
discourses  even  within  the  group  of  citizens  in 
which  ideologies  are  not  structured  and  coherent 
belief  systems  are  not  dominant.  Additionally, 
comparative  research  has  shown  how  the  extent  to 
which  citizens  hold  ideologically  structured  belief 
systems  also  depends  on  some  contextual  features, 
such  as  the  political  party  system,  the  structure  of 
party  competition,  and  the  electoral  system. 
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Bilateralism 


Bilateralism  refers  to  any  relationship  between 
two  parties.  In  the  study  of  international  relations, 
we  normally  think  of  bilateralism  as  referring  to 
relations  between  two  states.  But  this  is  an  overly 
narrow  definition.  Bilateralism  can  denote  an 
arrangement  between  two  private  companies — 
economists,  for  instance,  write  of  a  “bilateral 
monopoly”  where  there  is  effectively  only  one 
seller  and  one  purchaser  in  a  market.  Alternatively, 
bilateralism  can  involve  two  nongovernmental 
organizations  (NGOs),  two  intergovernmental 
organizations,  or  any  two-party  combination  of 
these  actors  (including  states).  For  instance,  a 
bilateral  agreement  is  negotiated  when  an  inter¬ 
governmental  organization,  such  as  the  United 
Nations  Children’s  Fund  (UNICEF),  commissions 
a  local  NGO  to  manage  a  project  for  it. 


The  term  bilateralism  is  also  used  to  refer  to  a 
strategy  of  how  relations  are  conducted;  it  denotes 
an  approach  that  prioritizes  reaching  agreement 
with  one  other  party  rather  than  acting  by  oneself 
(unilateralism)  or  together  with  two  or  more  other 
parties  (multilateralism).  It  is  also  differentiated 
from  regionalism,  where  relations  are  conducted 
with  two  or  more  other  parties  within  the  same 
geographical  area.  Bilateralism  need  not  involve 
geographical  proximity — an  instance  is  the  U.S. 
free  trade  agreement  with  Bahrain. 

The  vast  majority  of  treaties  between  states  are 
of  a  bilateral  character.  These  cover  the  full  range 
of  diplomacy,  including  agreements  on  extradi¬ 
tion,  overseas  development  assistance,  cultural  and 
educational  exchanges,  aviation,  double  taxation, 
cooperation  on  criminal  and  terrorism  issues,  the 
exchange  of  nuclear  materials,  postal  services,  visa 
requirements,  mutual  acceptance  of  domestic  stan¬ 
dards,  cooperation  on  meteorological  services  and 
space  exploration,  as  well  as  defense,  investment, 
and  trade  issues.  It  is  in  the  areas  of  investment  and 
particularly  trade  that  strategies  of  bilateralism 
have  become  most  controversial  in  recent  years: 
The  focus  in  the  remainder  of  this  entry  is  on  these 
areas. 

Bilateralism  has  been  the  dominant  form  of 
international  trade  agreement  since  the  start  of 
international  commerce.  Such  treaties  have  a  his¬ 
tory  as  long  as  that  of  international  trade  itself — 
the  first  known  bilateral  trade  agreement  was 
between  Egypt  and  Babylonia,  dating  to  2500 
BCE.  As  international  commerce  expanded,  so  did 
the  number  of  bilateral  trade  treaties  multiply, 
driven  first  by  the  establishment  of  the  great  mari¬ 
time  empires  of  the  16th  and  17th  centuries  and 
then  by  the  Industrial  Revolution  in  the  18th  and 
19th  centuries. 

These  bilateral  agreements  customarily  assumed 
the  form  of  a  treaty  of  friendship,  commerce,  and 
navigation,  which  typically  outlined  the  terms 
under  which  trade  and  shipping  between  the  two 
states  would  take  place  and  the  rights  enjoyed  by 
individuals  and  firms  from  one  state  who  lived, 
conducted  business,  or  owned  property  in  the  part¬ 
ner  state.  The  treaty  also  detailed  the  tariff  treat¬ 
ment  to  be  applied  to  various  goods. 

In  the  1 9th  century,  in  particular,  these  bilateral 
commercial  treaties  were  used  not  just  to  protect 
existing  commerce  or  to  establish,  on  a  reciprocal 
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basis,  new  markets  for  the  two  parties  but  also  to 
impose  decidedly  unequal  arrangements  on  mili¬ 
tarily  weaker  states.  The  most  notorious  examples 
were  the  treaties  imposed  by  Britain  on  Qing 
Dynasty  China  after  the  First  Opium  War  (1839- 
1842).  The  Treaty  of  Nanking  opened  five  ports  to 
British  exports,  at  fixed  tariffs  to  be  agreed 
between  the  two  governments,  and  ceded  Hong 
Kong  to  Britain.  Other  industrializing  countries 
followed  suit,  the  United  States,  for  instance,  nego¬ 
tiated  a  commercial  treaty  with  China  in  1884  that 
was  markedly  unequal  in  the  concessions  made  by 
the  two  parties. 

Although  the  development  of  the  unconditional 
most-favored-nation  principle  by  leading  European 
trading  states  in  the  19th  century  laid  a  foundation 
for  multilateral  commercial  treaties,  bilateralism 
remained  the  norm  throughout  the  interwar  period. 
The  decline  in  commerce  following  World  War  I 
prompted  an  innovative  attempt  at  multilateral 
cooperation  on  trade  through  the  staging  of  a 
World  Economic  Conference  in  1927,  but  the 
treaty  negotiated  failed  to  gain  enough  signatories 
to  come  into  effect  when  countries  retreated  into 
protectionism  as  the  depression  deepened  at  the 
end  of  the  decade.  In  response  to  the  Great 
Depression,  governments  typically  sought  to  nego¬ 
tiate  bilateral  commercial  treaties. 

Only  with  the  establishment  of  the  General 
Agreement  on  Tariffs  and  Trade  (GATT)  in  1947 
did  a  multilateral  approach  to  trade  cooperation 
gain  precedence  over  bilateral  treaties.  The  GATT 
established  rules  intended  to  ensure  that  countries 
that  agreed  to  participate  in  multilateral  rounds  of 
tariff  reduction  negotiations  would  not  be  able  to 
act  unilaterally  to  negate  the  obligations  to  which 
they  committed.  The  GATT,  however,  did  permit 
some  exceptions  to  its  rule — that  contracting  par¬ 
ties  must  unconditionally  provide  most-favored- 
nation  status  to  other  parties.  It  thereby  opened 
the  way  for  bilateral  agreements  to  continue  to 
play  a  role  in  the  governance  of  global  trade.  First, 
it  “grandfathered”  existing  arrangements  between 
imperial  powers  and  their  current  and  former  colo¬ 
nies.  Second,  it  permitted  the  negotiation  of  free 
trade  agreements  and  customs  unions.  Although 
the  original  expectation  was  that  the  latter  provi¬ 
sion  would  apply  primarily  to  regional  arrange¬ 
ments,  it  has  increasingly  been  used  to  legitimate 
bilateral  arrangements. 


For  most  of  the  period  between  the  establish¬ 
ment  of  GATT  and  that  of  its  successor,  the  World 
Trade  Organization,  in  1995,  multilateralism  dom¬ 
inated  the  governance  of  international  trade.  All 
the  major  trading  powers  (the  United  States,  the 
European  Union  [EUJ,  and  Japan)  gave  strong  sup¬ 
port  to  multilateral  liberalization  despite  domestic 
resistance  to  the  opening  up  of  “sensitive”  sectors. 
In  the  1980s,  however,  bilateral  strategies  gained 
new  prominence  on  two  dimensions. 

First,  with  the  rapid  growth  of  exports  from 
East  Asia  that  caused  problems  for  domestic  man¬ 
ufacturers  in  North  America  and  Europe,  and  led 
to  rapidly  burgeoning  trade  imbalances,  govern¬ 
ments  increasingly  turned  to  bilateral  agreements 
(as  well  as  unilateral  measures)  to  alleviate  these 
problems.  Most  prominent  were  the  actions  taken 
by  the  U.S.  government  under  Section  301  of  the 
U.S.  Trade  Act.  The  1962  Trade  Expansion  Act 
had  contained  a  provision  granting  the  U.S.  presi¬ 
dent  authority  to  impose  retaliatory  measures 
when  foreign  governments  were  considered  to 
have  harmed  U.S.  trade  interests.  This  measure 
was  substantially  strengthened  by  Section  301  of 
the  1974  Trade  Act,  which  moved  the  primary 
responsibility  for  action  from  the  president  to  the 
U.S.  Trade  Representative  (USTR). 

Section  301  provided  that  in  the  event  of  the 
USTR  determining  that  foreign  trade  practices  were 
unfair,  negotiations  should  take  place,  and,  if  these 
failed  to  realize  a  satisfactory  outcome,  the  United 
States  could  then  impose  restrictions  on  its  partner’s 
exports  or  withdraw  existing  concessions.  Although 
the  act  allowed  for  unilateral  action,  the  outcome  of 
the  Section  301  investigations  often  took  the  form 
of  bilateral  negotiations.  These  produced  agree¬ 
ments  with  a  number  of  East  Asian  and  other  indus¬ 
trializing  economies  (most  notably  Japan,  Korea, 
and  Taiwan),  under  which  their  governments  under¬ 
took  to  limit  the  volume  of  exports  of  specific  prod¬ 
ucts  (at  various  times,  automobiles,  steel,  textiles, 
clothing,  and  footwear).  These  arrangements  cir¬ 
cumvented  GATT  regulations  in  that  the  country 
responsible  for  exports  “voluntarily”  undertook  to 
restrain  them.  In  1984,  the  EU  implemented  a  com¬ 
mercial  policy  regulation  that  had  similar  provisions 
to  the  U.S.  legislation.  “Voluntary  export  restraints” 
were  outlawed  by  the  World  Trade  Organization 
(WTO)  Treaty:  Such  bilateral  agreements,  subse¬ 
quently,  have  largely  disappeared. 
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Meanwhile,  a  new  form  of  bilateralism  gained 
prominence  from  the  mid-1980s:  preferential  trade 
agreements  (PTAs).  As  noted  above,  the  GATT 
exempted  parties  to  free  trade  areas  and  customs 
unions  from  the  requirement  of  applying  for  most- 
favored-nation  treatment.  In  the  1960s,  a  wave 
of  PTAs  had  been  negotiated  by  developing 
economies — but  most  of  them  proved  ineffective. 
Among  industrialized  economies,  only  the 
European  countries — through  the  EU  and  the 
European  Free  Trade  Association — had  pursued 
regional  strategies  that  led  to  significant  excep¬ 
tions  to  the  multilateralism  of  the  GATT.  The 
United  States  and  Japan  (and  other  East  Asian 
economies)  were  enthusiastic  supporters  of  multi¬ 
lateralism.  Their  attitude  began  to  change — in 
part  because  of  concerns  about  the  “unfair”  trad¬ 
ing  practices  of  East  Asian  economies  and  about  a 
perceived  loss  of  bargaining  power  vis-a-vis  the 
EU.  The  key  development  that  symbolized  a  U.S. 
move  away  from  a  multilateralism-only  strategy 
was  the  negotiation  of  its  first  bilateral  free  trade 
agreement — with  Israel — in  1985  (the  United  States 
had  earlier  entered  into  a  bilateral  sectoral  agree¬ 
ment  with  Canada — the  U.S.-Canada  Automotive 
Products  Agreement  of  1965 — and  had  introduced 
a  system  of  trade  preferences  for  Caribbean  coun¬ 
tries  in  1983).  The  Canada-U.S.  Free  Trade 
Agreement  followed  in  1989;  in  turn,  this  was  con¬ 
verted  into  the  North  American  Free  Trade 
Agreement  in  1993  with  its  extension  to  Mexico. 

Other  countries  quickly  followed  suit;  the  conse¬ 
quence  was  a  veritable  explosion  in  the  number  of 
bilateral  trade  agreements  in  the  years  after  the 
establishment  of  the  WTO  in  1995  (an  ironic  out¬ 
come  in  that  the  creation  of  the  WTO  had  been 
regarded  as  a  significant  advance  in  multilateralism). 
The  WTO  reported  that  it  had  been  notified  of 
nearly  300  new  PTAs  between  1995  and  2008; 
bilateral  agreements  accounted  for  more  than  three 
quarters  of  all  PTAs  notified  and  in  force.  Of  par¬ 
ticular  significance  in  this  context  was  the  abandon¬ 
ment  by  East  Asian  states  of  their  previous  commit¬ 
ment  to  nondiscrimination.  Whereas  only  one  pref¬ 
erential  trade  agreement  was  in  place  in  the  region  in 
the  mid-1990s  (the  ASEAN  Free  Trade  Agreement), 
a  decade  later  more  than  80  PTAs  either  were  being 
implemented  or  were  under  negotiation. 

Several  factors  contributed  to  this  growth  in 
bilateralism:  the  slow  progress  made  in  global 


trade  negotiations  and  perceptions  that  bringing 
the  Doha  Round  of  WTO  negotiations  to  a  suc¬ 
cessful  conclusion  would  be  difficult;  the  growth  in 
regionalism  in  other  parts  of  the  world  (especially 
the  enlargement  and  deepening  of  cooperation 
within  the  EU),  which  caused  a  “domino”  effect  as 
other  countries  sought  not  to  be  excluded  from  the 
new  preferential  agreements;  and  the  financial  cri¬ 
sis  in  East  Asia  in  1997-1998  (which  led  govern¬ 
ments  to  reconsider  prevailing  orthodoxies  on 
trade  policy). 

Proponents  of  the  new  bilateralism  believe  that 
it  offers  several  advantages  over  regional  and  mul¬ 
tilateral  negotiations:  In  principle,  bilateral  agree¬ 
ments  should  be  simpler  to  negotiate  given  the 
small  number  of  parties  involved;  they  also  afford 
an  opportunity  to  negotiate  “deeper”  cooperation 
than  in  the  WTO  and  in  doing  so  can  construct  a 
model  that  provides  states  with  an  incentive  to 
emulate  its  provisions  at  the  global  level.  Critics,  on 
the  other  hand,  argue  that  such  arrangements  are 
inherently  discriminatory,  cause  distortions  because 
partial  and  discriminatory  liberalization  can  advan¬ 
tage  actors  who  are  not  the  most  efficient  produc¬ 
ers,  politicize  trade  and  bring  power  considerations 
to  the  fore  (see  the  “Unequal”  treaties  of  the  19th 
century  or  those  between  Nazi  Germany  and  its 
Central  European  neighbors),  and  divert  attention 
and  scarce  resources  from  potentially  more  benefi¬ 
cial  negotiations  at  the  global  level. 

John  Ravenhill 
Australian  National  University 
Canberra,  Australia 
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For  most  of  the  past  century,  the  so-called  stan¬ 
dard  social  science  model  (SSSM)  has  been  the 
overwhelmingly  dominant  paradigm  in  American 
political  science.  As  we  know,  the  SSSM  holds 
three  tenets  to  be  critical: 

1.  that  humans  have  no  innate  behavioral 
tendencies; 

2.  that,  consequently,  human  nature  and  human 
behavior  are  solely  the  products  of  learning  and 
socialization;  and 

3.  that,  consequently,  human  nature  and  behavior 
are  essentially  malleable. 

The  launching  of  the  “biology  and  politics” 
movement  in  the  mid-1960s  by  a  handful  of  politi¬ 
cal  scientists  was  the  result  of  two  intellectual  con¬ 
victions  they  held  in  common.  One  was  an  acute 
dissatisfaction  with  the  state  of  the  discipline  in 
general  and  with  the  hegemonic  SSSM  in  particular; 
the  other  was  a  growing  appreciation  of  the  signifi¬ 
cance  for  political  science  of  recent  advances  in  the 
biological  sciences — especially  in  neo-Darwinian 
evolutionary  theory,  genetics,  psychopharmacology 
(the  demonstrated  ability  of  some  drugs  to  change 
behavior),  primatology,  and  ethology.  The  first 
conviction  led  to  a  profound  difference  with  their 
mainstream  colleagues  in  their  understanding  of  the 
factors  that  shape  human  nature  and  human  politi¬ 
cal  behavior.  The  second  produced  equally  sharp 
differences  with  regard  to  the  proper  scope  and 
method  of  political  science  research  and  the  third  in 
their  respective  understanding  of  what  constitutes 
sound  public  policy.  These  three,  then,  constitute 
the  major  points  of  disagreement  between  propo¬ 
nents  of  what  is  commonly  called  biopolitics  and 
the  loyal  partisans  of  the  SSSM. 

The  Wellsprings  of  Political  Behavior 

As  noted  above,  the  SSSM  tenet  that  culture  alone 
shapes  behavior  is  flatly  rejected  by  those  who 


believe  in  a  “more  biologically  oriented”  political 
science.  Yes,  culture  is  important,  they  agree,  but 
so  are  the  genetically  transmitted  behavioral  ten¬ 
dencies  that  our  species,  as  social  primates,  has 
evolved  over  literally  millions  of  years.  To  ignore 
or  even  to  slight  these  is  to  turn  a  blind  eye  to 
forces  that  often  powerfully  influence  how  we  act 
socially  and  politically.  If  the  discipline  ever  hopes 
to  understand  and  explain  political  behavior,  they 
insist,  both  nature  and  nurture  must  be  given  seri¬ 
ous  attention.  That  calls,  in  turn,  for  some  major 
changes  in  both  the  scope  of  what  we  study  and 
the  methods  employed  in  our  research. 

Research 

Expanding  the  Scope 

The  Importance  of  Genetic  Influences 

Mainstream  political  science,  to  be  sure,  rejects 
this  idea  entirely.  From  an  evolutionary  stance, 
like  it  or  not,  there  are  a  number  of  genetic  predis¬ 
positions  that  we  share  with  other  social  primate 
species.  Among  the  politically  most  important  of 
these  are  a  proclivity  for  hierarchical  social  and 
political  structures  characterized  by  dominance 
and  submission,  marked  differences  of  status 
resulting  in  unequal  access  to  the  good  things  of 
life,  aggressive  behavior  (especially  among  males), 
xenophobia,  and  nepotistic  favoritism. 

Relevance  of  Primatological  Data 

According  to  the  SSSM,  since  behavior  is  shaped 
entirely  by  culture,  knowledge  of  other  species, 
no  matter  how  akin  to  ours,  serves  no  useful  pur¬ 
pose.  From  a  biopolitical  viewpoint,  however,  the 
study  of  closely  related  species — and  especially 
the  social  primates — can  yield  valuable  clues  as  to 
the  possible  sources  of  our  own  behavior.  This 
being  the  case,  careful  attention  should  be  given  to 
the  other  apes  and,  especially,  to  the  species  with 
which  we  share  some  98%  of  our  genetic  material, 
the  chimpanzees. 

Better  Methods 

The  study  of  human  behavior  from  a  neo- 
Darwinian  perspective  emerged  from  ethology,  a 
biological  discipline  guided  by  the  dictum  that  if 
we  hope  to  understand  how  and  why  an  organism 
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acts  as  it  does  we  must  study  its  actual  behavior  in 
its  natural  setting.  This  means  that  while  there  are 
many  similarities  in  research  methodology  between 
the  two  approaches  there  are  also  some  key  differ¬ 
ences.  We  will  mention  only  two  here: 

Survey  Research 

The  strengths  and  shortcomings  of  survey 
research  as  a  means  of  ascertaining  attitudes,  pos¬ 
sible  future  behavior,  or  reporting  past  actions 
have  already  generated  an  abundant  literature,  so 
we  will  be  quite  brief.  As  John  Wahlke  (1979) 
argued  in  his  American  Political  Science  Association 
(APSA)  presidential  address,  survey  data,  which 
are  necessarily  based  on  self-report  and  with 
responses  often  possibly  colored  by  the  framing  of 
the  questions,  are  too  often  unreliable  guides  to  the 
respondents’  actual  political  behavior,  political 
beliefs,  or  the  intensity  of  those  beliefs.  Accordingly, 
biopolitics,  with  its  ethologically  derived  focus  on 
actual  behavior,  sees  surveys,  by  and  large,  as  a 
research  instrument  of  reluctant  resort. 

Experiments 

Mainstream  political  science  and  biopolitics 
alike  view  experiments  as  a  valid  means  of  inquiry. 
The  latter  insists,  however,  that  the  experiments 
mirror,  to  the  greatest  degree  possible,  the  chal¬ 
lenges,  problems,  and  environments  that  its 
subject(s)  would  encounter  under  real-life  condi¬ 
tions.  Accordingly,  its  practitioners  are  skeptical 
about  the  validity  of  experiments  where  subjects 
(often  college  students,  hardly  representative  of  the 
general  population)  are  placed  in  a  patently  artifi¬ 
cial  setting  and  asked  to  perform  tasks  that  lack 
meaningful  consequences. 

The  Formulation  of  Public  Policy 

Not  surprisingly,  the  disagreements  on  the  well- 
springs  of  human  behavior,  on  the  need  to  study 
closely  related  species,  and  on  key  aspects  of 
research  methodology  result  in  markedly  dis¬ 
similar  conceptions  of  how  to  best  shape  public 
policy.  Two  of  the  most  important  of  these  are 
the  view  that  past  behavior  can  guide  future 
behavior  and  that  it  is  able  to  be  converted  into 
policies. 


Past  Behavior  as  a  Useful 
Clue  to  Future  Action 

Evolutionary  theory  assumes  that,  barring 
mutation  and/or  some  profound  environmental 
change,  the  behaviors  that  a  species  has  evolved 
and  exhibited  in  the  past  are  likely  to  remain  essen¬ 
tially  the  same.  Thus,  as  those  in  biopolitics  see  the 
world,  when  efforts  to  alter  or  even  prohibit 
“undesirable”  social  and  political  behaviors  have 
consistently  failed  in  the  past,  similar  policies  are 
unlikely  to  be  any  more  successful  in  the  future.  In 
short,  the  leopard  does  not  readily  change  its 
spots.  On  the  contrary,  by  believing  in  the  essential 
malleability  of  human  nature,  the  SSSM  is  much 
more  prone  to  discount  past  failures.  Maybe  by 
better  manipulating  its  environment,  we  can 
induce,  or  even  better,  compel,  the  leopard  to 
change  its  coloration. 

Trade-Offs  Versus  “Solutions” 

Evolutionary  change  almost  always  entails  a 
trade-off — that  is,  a  price.  A  massive  protective 
shell  or  carapace  comes  at  the  cost  of  lessened 
mobility  and  greater  size  requires  greater  caloric 
demands;  as  we  have  seen  with  our  own  species, 
larger  brains  carry  the  cost  of  more  painful  and 
more  hazardous  childbirth.  Given  their  neo- 
Darwinian  set  of  mind,  those  in  biopolitics  tend  to 
approach  public  policy  problems  less  in  terms  of 
far-reaching  “solutions”  and  more  in  terms  of 
what  is  to  be  gained — and  what  might  possibly  be 
the  negative  political  or  social  consequences.  These 
are  matters  almost  totally  ignored,  to  take  some 
obvious  instances,  by  the  ill-fated  Eighteenth 
Amendment,  the  futile  “War  on  Drugs,”  and  the 
predictably  futile  efforts,  literally  over  the  centu¬ 
ries,  to  prohibit  prostitution. 

There  could  be  no  more  convincing  an  example 
of  the  “big-solution”  approach  than  the  decision 
to  forcibly  export  “democracy”  to  Iraq.  Given 
Homo  sapiens’  innate  inclination  to  hierarchical 
and  authoritarian  social  and  political  structures, 
an  inclination  evidenced  both  by  the  rarity  of 
democracies  historically  and  their  relative  scarcity 
today  in  a  so-called  Age  of  Democracy,  this  mode 
of  governance  requires  a  complex  of  “special 
enabling  conditions”  for  its  emergence  and  sur¬ 
vival.  But  even  to  the  most  untutored  eye,  very, 
very  few  of  these  essential,  enabling  conditions 
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were — or  even  now  are — present  in  that  unfortu¬ 
nate  nation. 

Slow  Start,  Brightening  Prospects:  Long-Term 
Implications  for  Political  Science 

The  attempt  of  the  biology  and  politics  movement 
to  reshape  the  discipline  quickly  received  formal 
recognition  from  the  International  Political  Science 
Association  in  1973  and,  a  bit  less  readily,  from 
the  American  Political  Science  Association  in 
1981.  The  task  of  achieving  a  distinct  “field”  iden¬ 
tity  was  basically  completed  with  the  establish¬ 
ment  of  a  professional  organization,  the  Association 
for  Politics  and  the  Life  Sciences  in  1980;  the 
establishment  of  a  peer-reviewed  journal,  Politics 
and  the  Life  Sciences ,  in  1982;  and  the  publication, 
several  years  later,  of  a  90-page  Master  Bibliography 
listing  more  than  1,000  “biopolitical”  papers, 
articles,  and  books. 

Nonetheless,  their  determined  evangelical  efforts 
notwithstanding,  until  quite  recently  the  propo¬ 
nents  of  a  “more  biologically  oriented”  political 
science  constituted  only  a  minuscule  fraction  of  the 
discipline,  with  the  Association  for  Politics  and  the 
Life  Sciences  having,  at  most,  only  300  or  400 
members  in  contrast  with  APSA’s  some  101,000 
members.  Understandably,  the  great  majority  of 
the  profession  was  unwilling  to  consider,  let  alone 
embrace,  a  conceptual  framework  at  loggerheads 
with  the  orthodoxy  in  which  they  had  been  trained 
and  had  labored.  Although  a  biopolitical  course  or 
two  eventually  made  its  way  into  the  graduate  cur¬ 
riculum  at  a  dozen  or  so  schools,  only  one  depart¬ 
ment  (at  Northern  Illinois  University)  initiated  a 
formal  doctoral  program  in  the  field;  very  few  of 
the  professional  journals  published  biopolitical 
articles;  with  rare  exceptions,  biopolitical  books 
somehow  escaped  the  attention  of  the  journals’ 
book  review  editors;  and  the  one  or  two  biopoliti¬ 
cal  panels  at  APSA’s  annual  meeting  were  fortu¬ 
nate  to  draw  more  than  a  dozen  or  so  attendees. 

Over  the  past  3  or  4  years,  however,  there  has 
been  a  definite  sea  change.  The  professional  journals 
have  become  much  more  receptive  to  biopolitical 
topics;  both  the  number  of  biopolitical  panels  and 
panel  attendance  at  the  APSA  meetings  have  notably 
increased;  and  a  biopolitically  oriented  Conference 
on  Experimental  Political  Science  held  its  third 
annual  meeting  in  2010. 


There  are  several  reasons  for  this  development. 
The  introduction  of  functional  magnetic  reso¬ 
nance  imaging  (MRI)  technology  has  opened  a 
new  way  to  study  how  the  brain  reacts  when 
experimental  subjects  are  presented  with  political 
questions  or  stimuli — and  this  research  has 
received  a  surprising,  if  regrettably  uncritical,  cov¬ 
erage  in  the  popular  media  (some  relevant  works 
on  the  neuropolitical  perspective  can  be  found  in 
Rose  McDermott,  2007,  and  Darren  Schreiber, 
2007).  Another  handful  of  biopolitical  papers 
have  reignited  interest  in  the  debate  about  the 
respective  roles  of  nature  and  nurture  in  shaping 
political  attitudes  (in  this  instance,  of  identical 
twins) — and  these,  too,  have  received  flattering 
media  attention. 

In  addition,  other  influences  have  also  been 
operative.  Over  the  past  couple  of  decades,  biolo¬ 
gists  have  done  a  superb  job  of  popularizing  evo¬ 
lutionary  theory  and  the  understanding  of,  and 
interest  in,  the  manner  in  which  genes  and  culture 
interact  to  shape  our  behavior.  At  the  same  time, 
an  evolutionary  approach  has  made  considerable 
headway  in  several  of  the  other  behavioral  sci¬ 
ences — anthropology,  sociology,  psychology,  law, 
and  what  is  now  called  “behavioral”  economics. 
Political  science  has  a  well-established  history  of 
borrowing  concepts  and  methods  from  our  sister 
disciplines,  and,  among  these,  psychology  and  eco¬ 
nomics  have  long  been  the  most  prestigious  and 
influential.  And  we  should  surely  mention  one 
more  possible  influence.  According  to  Thomas 
Kuhn  (1970),  new  paradigms  often  triumph  not 
simply  because  of  their  superior  scientific  merit  but 
also  because  of  the  implacable  toll  taken  by  rigor 
mortis  on  the  avatars  of  the  old  order. 

Whatever  the  relative  weight  of  these  several 
factors,  their  combined  impact  will  almost  surely 
profoundly  modify  our  discipline’s  understanding 
of  the  forces  shaping  political  behavior,  the  man¬ 
ner  in  which  it  conducts  its  research,  and  the  intel¬ 
lectual  framework  within  which  it  formulates  its 
public  policy  recommendations.  In  the  future, 
advocates  of  these  views  may  come  to  represent  a 
sizable  plurality  of  the  profession  rather  than  the 
current  small  minority. 

Albert  Somit 
Southern  Illinois  University 
Carbondale,  Illinois,  United  States 
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Bipolarity  and  Multipolarity 


Bipolarity  and  multipolarity  are  theoretical  con¬ 
cepts  that  refer  to  the  distribution  of  relative  power 
capabilities  among  the  major  powers  in  the  global 
and/or  regional  international  system.  Bipolarity 
exists  when  there  are  two  dominant  powers — 
commonly  referred  to  as  “superpowers” — in  the 
system  whose  power  capabilities  are  considerably 
greater  than  those  of  other  major  powers.  The 
prime  example  of  bipolarity  is  the  U.S.-Soviet 
competition  during  the  Cold  War.  Multipolarity 
exists  when  there  are  three  or  more  great  powers 


in  the  system.  Historically,  multipolarity  has  been 
the  dominant  power  configuration  in  the  modern 
European  states’  system. 

Polarity:  Theoretical  Approaches 

The  significance  of  the  distribution  of  relative 
power  capabilities  in  the  international  system  is 
widely  contested.  For  realists,  it  is  the  single  most 
important  determinant  of  international  politics. 
Liberals  also  accept  that  the  distribution  of  power 
is  an  important  factor  shaping  international  poli¬ 
tics,  although  they  attach  greater  significance  to 
domestic-level  factors  (such  as  regime  type)  or  the 
existence  of  international  regimes  and  institutions 
as  determinants  of  international  outcomes. 
Similarly,  the  English  School  has  traditionally  rec¬ 
ognized  the  importance  of  the  balance  of  power 
and  the  management  role  of  great  powers,  even 
though  this  approach  stresses  the  importance  of 
“international  society”  as  a  factor  that  ameliorates 
the  operation  of  power  politics. 

Critical  theorists  and  postpositivists,  on  the  other 
hand,  question  the  significance  of  polarity  as  a  theo¬ 
retical  tool  for  understanding  international  politics. 
These,  more  radical  approaches  challenge  the  very 
concept  of  “power”  as  used  by  realists  and  other 
rationalist  theories;  power,  they  argue,  cannot  be 
reduced  to  resources,  geography,  and  other  material 
factors.  Like  constructivists,  they  tend  to  emphasize 
the  significance  of  normative  and  ideational  factors 
and  stress  the  role  of  nonstate  actors.  Consequently, 
they  argue  that  concepts  of  bipolarity  and  multipo¬ 
larity  are  irredeemably  freighted  with  realist  assump¬ 
tions  and  that  focusing  on  polarity  obscures  the 
“real”  operation  of  power  in  the  international  sys¬ 
tem  that  serves  the  interests  of  privileged  elites. 

For  realists,  however,  the  distribution  of  power 
in  the  international  system  is  the  key  factor  shap¬ 
ing  any  regional  or  global  conj  uncture.  International 
political  systems  are  inherently  unequal,  diverse, 
and  pluralistic:  They  consist  of  a  variety  of  differ¬ 
ent  actors  with  wildly  differing  power  capabilities, 
with  a  relatively  small  number  of  more  powerful 
states.  Realists  argue  that  this  handful  of  powerful 
states — generally  known  as  great  powers — exert  a 
disproportionate  influence  on  international  affairs 
and  that  relations  between  them  provide  the  key 
determinants  of  the  structure  and  dynamics  of  the 
international  system. 
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Classical  Realism 

The  classical  realist  tradition  of  Hans 
Morgenthau,  E.  H.  Carr,  and  Reinhold  Niebuhr, 
as  well  as  English  School  theorists  such  as  Hedley 
Bull  and  Martin  Wight,  were  centrally  concerned 
with  the  balance  of  power.  They  explored  the  rela¬ 
tions  between  the  great  powers  from  a  perspective 
informed  by  the  study  of  history  and  philosophy, 
and  their  analyses  of  multipolarity  and  bipolarity 
drew  heavily  on  the  experience  of  the  modern 
European  states’  system.  Much  of  their  work 
explored  the  implications  of  multipolarity  and 
bipolarity  for  diplomacy  and  statecraft.  Classical 
realists  tended  to  regard  multipolar  systems  as 
more  stable  than  bipolar  systems,  because  their 
greater  uncertainty  and  unpredictability  encour¬ 
aged  foreign  policy  decision  makers  toward  cau¬ 
tion  and  circumspection. 

Waltz’s  Theory  of  International  Politics 

With  the  publication  of  Kenneth  Waltz’s  Theory 
of  International  Politics  in  1979,  the  concepts  of 
multipolarity  and  bipolarity  acquired  a  new  theo¬ 
retical  context  and  analytical  significance.  Waltz 
sought  to  introduce  a  far-reaching  “Copernican 
revolution”  into  the  study  of  international  rela¬ 
tions  by  demonstrating  how  much  of  states’ 
actions  and  interactions,  and  how  much  of  the 
outcomes  their  actions  and  interactions  produced, 
could  be  explained  by  forces  operating  at  the  level 
of  the  system  rather  than  at  the  level  of  the  units. 
His  neorealism  (also  known  as  structural  realism) 
was  a  parsimonious  theory  that  focused  almost 
exclusively  on  the  distribution  of  relative  power 
capabilities  between  states,  specifically  the  great 
powers.  Waltz  argued  that  the  determining  feature 
of  any  international  system,  and  the  basis  for  any 
theoretical  analysis  of  international  politics,  must 
be  about  how  power  is  distributed  between  the 
major  units.  If  there  were  three  or  more  great  pow¬ 
ers,  the  system  was  characterized  by  multipolarity; 
if  there  were  only  two  great  powers — or  super¬ 
powers — the  system  was  bipolar.  This  distribution 
of  relative  power  capabilities  was  thus  crucial  in 
determining  international  political  outcomes  and 
shaping  the  structural  conditions  for  war  or  peace. 

Waltz’s  Theory  of  International  Politics  was  as 
controversial  as  it  was  influential.  In  contrast  to 
many  of  his  classical  realist  predecessors,  Waltz 


argued  that  multipolar  systems  were  more  unsta¬ 
ble  and  war  prone  than  bipolar  systems  because  in 
the  former,  the  great  powers  had  a  number  of 
potential  conflict  dyads  and  defection  from  an  alli¬ 
ance  could  have  a  destabilizing  impact  on  the  bal¬ 
ance  of  power.  In  multipolar  systems,  he  argued, 
dangers  were  diffused,  responsibilities  unclear,  and 
definitions  of  vital  interests  easily  obscured. 
Moreover,  in  multipolarity,  it  was  not  always  clear 
who  was  a  danger  to  whom,  which  complicated 
the  process  of  alliance  building  and  the  establish¬ 
ment  of  a  stable  balance  of  power.  In  bipolar  sys¬ 
tems,  in  contrast,  there  were  only  two  superpowers 
and,  consequently,  the  question  of  who  was  a 
danger  to  whom  was  never  in  doubt.  This,  he 
argued,  was  the  first  big  difference  between  bipo¬ 
larity  and  multipolarity.  Moreover,  power  balanc¬ 
ing  between  them  depended  primarily  on  “internal 
balancing”  rather  than  on  “external  balancing” 
(i.e.,  on  the  generation  and  more  effective  use  of 
their  own  capabilities  and  resources  rather  than  on 
alliance  building,  which  relies  on  the  capabilities  of 
others).  Thus,  the  defection  of  allies  was  of  less 
significance  than  in  multipolar  systems. 

Waltz,  however,  also  argued  that  the  “default” 
position  of  international  systems,  whether  bipolar 
or  multipolar,  was  the  balance  of  power.  This,  he 
argued,  constituted  the  key  insight  of  his  theory  of 
international  politics.  From  the  theory,  he  argued, 
one  could  predict  that  states  will  engage  in  balanc¬ 
ing  behavior,  whether  or  not  they  consciously  seek 
a  balance  of  power.  The  expectation,  he  argued,  is 
not  that  a  balance,  once  achieved,  will  be  main¬ 
tained  but  that  a  balance,  once  disrupted,  will  be 
restored  in  one  way  or  another.  Waltz’s  argument 
that  international  political  systems  tend  toward  bal¬ 
ance,  and  that  states  were  primarily  concerned  with 
maximizing  their  security  rather  than  their  power, 
gave  his  brand  of  structural  realism  a  relatively 
benign  and  status  quo  flavor,  leading  some  of  his 
realist  critics  to  dub  his  theory  “defensive  realism.” 

Neorealist  Critiques  of  Waltz 

First  and  foremost  among  Waltz’s  Neorealist 
critics  was  John  Mearsheimer.  He  argued  that 
precisely  because  great  powers  were  security  maxi¬ 
mizers,  they  were  also  power  maximizers.  In  a 
competitive,  self-help  system,  Mearsheimer  argued, 
the  most  effective  way  for  great  powers  to  ensure 
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their  security  was  by  becoming  the  most  powerful 
state  in  their  neighborhood  and  establishing  their 
regional  hegemony.  Hence,  great  powers,  whether 
they  liked  it  or  not,  were  faced  by  a  systemic  imper¬ 
ative  to  maximize  their  power.  This  emphasis  on 
power  maximization  is  at  the  heart  of  Mearsheim- 
er’s  theory  of  “offensive  realism,”  which  presents  a 
much  more  conflict-prone  and  pessimistic  view  of 
the  “tragedy”  of  great  power  politics. 

Mearsheimer  also  introduced  an  important  dis¬ 
tinction  between  different  forms  of  multipolarity. 
He  argued  that  where  power  capabilities  are 
roughly  equivalent  between  the  great  powers,  a 
situation  of  “balanced  multipolarity”  existed  in 
which  no  one  great  power  could  make  a  bid  for 
regional  hegemony.  In  this  context,  great  power 
cooperation  was  possible,  as  occurred  in  the 
“Concert  of  Europe”  following  the  end  of  the 
Napoleonic  Wars.  However,  in  the  context  of  a 
multipolar  system  in  which  one  great  power  was 
significantly  stronger  than  the  others,  “unbalanced 
multipolarity”  existed.  Unbalanced  multipolar  sys¬ 
tems  were  inherently  war  prone,  because  the  stron¬ 
gest  great  power  would  face  a  systemic  imperative 
to  maximize  its  power  by  making  a  bid  for  regional 
hegemony.  For  Mearsheimer,  his  theory  of  “offen¬ 
sive  realism”  provided  an  explanation  for  the 
major  great  power  conflicts,  such  as  the  Napoleonic 
Wars  and  World  Wars  I  and  II. 

The  Contemporary  Debate 

With  the  end  of  Cold  War  bipolarity,  a  wide- 
ranging  debate  has  taken  place  on  the  nature  and 
significance  of  the  distribution  of  relative  power 
capabilities  in  the  international  system.  One  thread 
of  this  debate  focuses  on  the  concept  of  “unipolar¬ 
ity.”  Some  realists  have  argued  that  the  United 
States,  as  the  sole  remaining  superpower,  is  so  over¬ 
whelmingly  powerful  compared  with  other  great 
powers  that  the  system  is  best  described  as  “unipo¬ 
lar.  ”  Among  American  realists,  much  of  the  debate 
has  focused  on  whether  this  “unipolar”  world 
order  will  be  relatively  short  lived  or  long  lasting 
(i.e.,  whether  it  constitutes  a  unipolar  “moment”  or 
a  unipolar  “era”).  Waltz,  on  the  other  hand,  has 
argued  that  after  a  relatively  brief  interregnum  in 
which  former  great  powers  “relearn”  their  great 
power  roles  and  new  great  powers  emerge,  the  sys¬ 
tem  will  return  to  a  multipolar  dynamic.  In  this 


context,  the  emergence  of  the  “BRIC”  countries 
(Brazil,  Russia,  India,  and  China)  has  been  taken 
by  some  as  heralding  the  emergence  of  a  new  mul¬ 
tipolar  international  order. 

Against  these  debates  among  realists  on  the 
nature  of  contemporary  polarity,  more  radical 
approaches  have  taken  issue  with  the  realist  under¬ 
standing  of  the  nature  of  power  and  power  politics. 
Power  is  an  essentially  contested  concept  in  politi¬ 
cal  science,  and  some  have  argued  that  power  can¬ 
not  be  understood  in  terms  of  material  capabilities. 
Rather,  they  argue  that  nonmaterial  factors  such  as 
reputation,  trust,  and  the  attractiveness  of  states’ 
economic,  political,  social,  and  cultural  attributes 
constitute  a  new  form  of  power,  which  they 
describe  as  “soft  power.”  Some  critical  theorists 
even  speak  of  “normative  power”  (i.e.,  the  power 
of  norms  and  values),  which,  they  suggest,  can  have 
a  system-transforming  impact  over  the  long  term. 

These  more  radical  understandings  of  power  are 
linked  to  arguments  about  the  changed  nature  of 
international  politics.  Some  argue  that  economic 
interdependence  and  the  emergence  of  a  globally 
integrated  international  order  have  weakened  the 
significance  of  the  distribution  of  hard  power  capa¬ 
bilities  and  therefore  made  long-established  notions 
of  multipolarity  and  bipolarity  largely  redundant. 
Globalization,  some  now  argue,  has  blurred  the 
distinction  between  domestic  and  international 
politics  and  weakened  the  significance  of  sover¬ 
eignty.  In  this  postmodern  system,  borders  are 
increasingly  porous,  power  is  diffused,  and  security 
can  be  achieved  through  governance  and  interde¬ 
pendence  rather  than  through  the  balance  of  power. 
Concepts  of  bipolarity  and  multipolarity  thus  con¬ 
tinued  to  be  employed  in  international  relations, 
although  their  significance  is  deeply  contested. 
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Boolean  Algebra 


Boolean  algebra  is  the  subfield  of  mathematical 
logic  devoted  to  operations  and  functions  of  logi¬ 
cal  variables.  After  a  brief  historical  introduction, 
a  broad  idea  of  its  main  concepts,  ideas,  rules,  and 
applications  is  given  below. 

Origins 

In  his  work  about  the  laws  of  thought,  the  English 
mathematician  George  Boole  (1815-1864)  showed 
that  it  is  possible  to  represent  logical  thought  pro¬ 
cesses  using  a  system  of  binary  variables:  0-1, 
TRUE-FALSE,  and  YES-NO. 

As  early  as  3000  BCE,  the  Chinese  emperor  Fu 
Hi  used,  within  a  magical  symbol,  a  set  of  eight 
trigrams,  in  which  three  binary  variables  yield  a 
written  representation  of  integers  from  0  to  7 
(Figure  1).  Gottfried  W.  Leibniz  (1646-1716)  took 
inspiration  from  it  when  he  laid  down  the  basic 
rules  of  binary  arithmetic  ( Explanation  of  Binary 
Arithmetic ,  1703).  He  realized  that  he  had  found  a 
way  to  translate  logic  into  calculations  and  laid 
plans  for  the  first  four-operation  computing  engine. 
However,  it  needed  Boole’s  work  for  the  idea  to 
mature,  giving  birth  to  a  genuine  theory — Boole’s 
algebra — with  its  calculation  rules,  its  properties, 
and  its  theorems. 

Sets,  (Classical)  Logic,  and  Boolean  Algebra 

A  specificity  of  this  new  theory  is  that  it  may  be 
explained  under  three  completely  equivalent 
forms,  notwithstanding  their  apparent  differences, 
using  set  theory,  elementary  logic,  or  Boolean 
algebra.  In  a  way,  we  have  three  different  descrip¬ 
tions  of  the  same  theory,  in  three  different 


languages,  but  allowing  perfect  translation 
between  them. 

Sets 

“Naive  set  theory”  uses  a  set  of  well-defined 
elements — its  universe — and  considers  a  series  of 
relations  and  operations  between  those  elements 
and  subsets  of  this  universe.  For  example,  if  the 
universe  is  the  set  of  all  national  flags  at  some  spe¬ 
cific  moment  in  time,  one  can  easily  imagine  all 
kinds  of  subsets:  flags  using  red,  black,  or  both; 
those  featuring  a  cross,  a  circle,  bars,  stripes;  and 
so  forth.  Using  red  is  a  binary  variable,  with  value 
YES  (or  1)  or  NO  (or  0)  for  each  flag. 

Although  not  absolutely  needed  for  conceptual 
development,  a  visual  representation  using  Venn 
diagrams  can  show  this  relationship  (see  Figure  2). 
This  figure  shows  the  flag  Universe  and  three  of  its 
subsets,  namely  flags  using  red  (R),  those  using 
black  (B),  and  those  using  yellow  (Y).  It  immedi¬ 
ately  can  be  seen  in  the  diagram  that  each  subset 
automatically  determines  its  complement  (flags 
without  black,  without  red,  or  without  yellow). 
The  Italian  flag  (/),  for  example,  belongs  to  subset 
R  but  to  neither  B  nor  Y;  this  can  be  noted  as  i  G 
R,  i  B,  and  i  Y. 

It  is  also  possible  to  consider  operations  on  sub¬ 
sets.  The  intersection  (0)  of  R  and  B  is  the  subset 
of  all  flags  using  both  red  AND  black;  the  union 
( U )  of  R  and  B  is  the  subset  of  all  flags  using  either 
red  OR  black — or  both;  this  is  the  inclusive  variant 
of  OR.  One  can  see  that  the  French  flag  (/)  satisfies 
feRUB  and  fff  ROB.  Finally,  some  subset  may 
be  totally  included  (C)  within  another:  The  set  of 
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Figure  2  Venn  Diagram  With  Three  Subsets  of  Flags 


all  flags  using  both  red  and  black  is  included  in  the 
set  of  all  flags  that  merely  use  red.  This  is  noted  as 

R  n  B  C  R. 

Logic 

Classical  logic  is  able  to  translate  any  statement 
of  the  elementary  set  theory;  for  example,  R  fl  B 
C  R  translates  into  an  implication  (=>):  If  a  flag 
uses  both  red  and  black,  then  it  uses  red.  More 
generally,  any  statement  of  the  form  A  C  B 
becomes  A  =>  B  (if  A  is  true,  then  so  is  B).  Three 
logical  operators  are  defined  at  the  beginning: 
logical  conjunction,  “AND”;  logical  disjunction, 
“OR”;  and  negation  (or  denial),  “NOT.” 

Boolean  Algebra 

Boolean  algebra  is  based  on  the  following  con¬ 
ventions:  Each  variable  is  binary;  it  has  only  two 
possible  values;  and  those  will  be  noted  0  (corre¬ 
sponding  to  the  empty  set  and  logical  FALSE)  and 
1  (corresponding  to  the  universe  and  logical  TRUE). 

Boole  then  defined  three  operations:  Boolean 
product  (•),  Boolean  sum  (  +  ),  and  complement. 
Boolean  product  corresponds  to  logical  AND  and 
a  set-theoretical  intersection;  Boolean  sum  corre¬ 
sponds  to  OR  and  set-theoretical  union;  and 
complement  corresponds  to  set-theoretical  com¬ 
plement  and  logical  NOT. 

To  determine  how  operators  make  this  algebra 
work,  the  rules  of  the  game — that  is,  the  axioms — 
are  needed.  They  are  the  following: 

Law  of  excluded  middle:  A  +  A  —  1 

Noncontradiction  principle:  A  ■  A  —  0 


Boundedness  conditions:  A-1=A;A  +  0  =  A 

Twofold  commutativity:  A  +  B  =  B  +  A;  A  ■  B  =  B  ■  A 

Twofold  associativity:  (A  +  B)  +  C  =  A  +  (B  +  C); 
(A- B)  ■  C  =  A- (B  ■  C) 

Reciprocal  distributivity  of  “  +  ”  and  (A  +  B)  ■  C  = 
(A  -  C)  +  (B  •  C);  (A  •  B)  +  C  =  (A  +  C)  •  (B  +  C);  this 
last  identity  is  less  intuitive  than  others,  since  it  does  not 
hold  in  ordinary  (real-number)  arithmetic. 

From  there,  several  properties  may  be  deduced, 
among  which  are  the  following: 


0=1  and  1  =  0 

Idempotence:  A  ■  A  =  A;  A  +  A  =  A 
Absorbing:  A  +  1  =  1  and  A  •  0  =  0 

Successive  applications  of  these  rules  make  possible 
the  simplification  of  complex  Boolean  expressions, 
in  particular  by  using  the  following  results: 

Absorption  theorems:  A  +  (A  ■  B)  =  A;  A  ■  (A  +  B)  =  A 

Redundancy  theorems:  (A  ■  B)  +  (A  ■  C)  =  (A  ■  B) 
(A  ■  C)(B  ■  C) 

De  Morgan’s  laws:  A  +  B  =  A  B  :  A  ■  B  =  A  +  B 


Boolean  Functions  and  Truth  Tables 

Any  Boolean  function — that  is,  any  combination 
of  Boolean  variables  using  Boolean  operators — 
may  be  analyzed  using  its  truth  table.  The  truth 
table  for  a  function  of  n  variables  is  always  made 
of  2 n  lines  and  n  +  1  columns;  each  line  corre¬ 
sponds  to  one  combination  of  values  of  the 
variables  (0  or  1).  For  example,  the  truth  table  for 
function  F(A,B)  —  A  ■  B  +  A  ■  B.  B  is  as  follows 
(Table  1): 


Table  1 

Truth  Table  for  F(A,  B) 

A 

B 

F(A,B) 

0 

0 

0 

0 

1 

1 

1 

0 

1 

1 

1 

0 
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All  truth  tables  of  two  variables  will  have  the 
same  first  two  columns  as  this  one;  the  third  col¬ 
umn  completely  describes  the  function.  This  last 
column  is  made  up  of  four  binary  values;  there¬ 
fore,  there  are  24  =  16  different  functions  of  two 
variables.  Using  set-theoretical  representation, 
function  F(A,B)  =  A  B  +  A  ■  B.  B  is  shown  in 
the  shaded  part  of  Figure  3. 

Using  two  variables,  16  different  shaded 
zones  may  be  defined,  made  up  of  0  to  4  ele¬ 
mentary  zones  (the  quadrants  of  the  diagram) 
(Table  2). 

This  number  increases  rapidly  with  the  number 
n  of  variables. 

This  exponential  increase  explains  the  need 
for  using  algorithms  when  analyzing  Boolean 
functions  of  more  than  three  variables  (see 
Figure  3). 

Each  compound  zone  of  a  Venn  diagram  also 
corresponds  to  a  Boolean  function.  However,  there 
is  more  than  one  formula  for  a  given  function. 

It  is  therefore  interesting  to  find,  among  those 
logically  equivalent  formulae,  which  one  is  the 


Table  2  Number  of  Zones 

Number  of  Elementary  Compound 

Variables  Zones  Zones 

1  2  4  =  22 

2  4  16  =  24 

3  8  =  23  256  =  2s 

2”  2(2n) 


simplest.  Simplicity  is  not  a  well-defined  notion, 
but  an  intuitively  reasonable  approximation  will 
be  to  choose  the  shortest  formula. 


Reducing  a  Function’s  Algebraic  Expression 

The  longest  formula  for  a  given  function  (its 
canonical  disjunctive  form)  is  very  easy  to  write 
down:  It  is  merely  made  up  of  a  union  of  elemen¬ 
tary  zones,  therefore  of  a  sum  of  products  (mono¬ 
mials,  minterms)  using  all  variables.  For  example, 
to  the  following  truth  table  (see  Table  3)  corre¬ 
sponds  the  long  formula  F(A,£>,  C)  —  A  -B  ■  C  + 
ABC  +  ABC  +  ABC. 


This  formula  lacks  brevity.  Using  a  succession  of 
Boolean  algebra  rules  allows  us  to  simplify  it  until 
we  get  the  shortest  possible  formula: 

F{A,B ,  C)  =  ABC+ABC 
+  A-B-C+A-BC 


F^AtB,  C)  =  1BC+  1BC 


F(A,B,  C)  =  B  •  C  +  (B  •  C) 

=  (B  +  B)  ■  C  =  1  ■  C  =  C. 


This  search  for  the  shortest  form  of  the  function 
is  the  object  of  the  simplification  algorithms  men¬ 
tioned  above.  Karmaugh’s  algorithm,  a  heuristic 
method  for  simplifying  that  can  be  worked  by 
hand,  with  acceptable  results,  for  n  <  4,  may  give 
a  good  idea  of  how  such  algorithms  work;  other 
algorithms  need  a  computer  to  perform,  such  as  the 


Table  3  Truth  Table  for  F(A,  B,  Q 


A 

B 

c 

F(A,B,C) 

0 

0 

0 

0 

0 

0 

1 

1 

0 

1 

0 

0 

0 

1 

1 

1 

1 

0 

0 

0 

1 

0 

1 

1 

1 

1 

0 

0 

1 

1 

1 

1 
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Quine-McCluskey  algorithm,  used  in  qualitative 

comparative  analysis  software. 

Gisele  De  Meur 
Universite  Libre  de  Bruxelles 
Brussels,  Belgium 

See  also  Comparative  Methods;  Configurational 

Comparative  Methods;  Qualitative  Comparative  Analysis 
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Bootstrap 


See  Robust  Statistics 


Breakdown  of  Political 
Systems 


The  term  breakdown  of  political  systems  con¬ 
notes  the  collapse  of  any  rule-based  institutional 
network  engaged  in  the  allocation  of  scarce 
resources.  Though  this  rubric  could  technically 
embrace  networks  ranging  from  empires  to 
smaller,  nonstate  entities  such  as  clans,  the  vast 
majority  of  the  literature  on  the  breakdown  of 
political  systems  focuses  on  the  collapse  of  either 
regimes  or  states.  The  disintegration  of  both 
regimes  and  states  has  been  a  constant  through¬ 
out  history,  and  thus,  the  literature  on  the  break¬ 
down  of  political  systems  is  vast.  This  overview 
merely  introduces  this  literature  through  a  discus¬ 
sion  of  its  basic  terms,  its  origins,  and  some  of  its 
principal  arguments.  The  discussion  is  confined  to 
four  types  of  political  system  breakdown:  the 
breakdown  of  dictatorship,  the  breakdown  of 


democracy,  and  the  breakdown  of  states,  leading 
either  to  new,  functioning  states  or  to  failed  states. 

The  Basic  Notions  and  Its  Origins 

Drawing  on  Robert  Fishman’s  distinctions,  we  can 
define  a  regime  as  a  country-based  network  of  for¬ 
mal  and  informal  organizations  at  the  center  of 
political  decision  making  that  determines  who  has 
access  to  political  power  and  how  those  who  are  in 
power  deal  with  those  who  are  not.  Democracies 
and  dictatorships  are  the  dominant  regime  types  in 
our  literature. 

A  state  is  the  larger  institutional  context  in 
which  a  regime  exercises  its  power.  A  state  is  a 
more  permanent  structure  of  domination  and  coor¬ 
dination  in  part  because,  unlike  a  regime,  a  func¬ 
tioning  state  always  maintains  a  coercive  apparatus 
and  both  the  means  to  administer  a  society  and  to 
extract  resources  from  it.  Though  there  are  many 
state  types  in  our  literature,  the  distinction  most 
relevant  to  a  discussion  of  system  breakdowns  is 
the  one  between  functioning  states  and  failed  states. 

The  current  literature  on  the  breakdown  of  mod¬ 
ern  political  systems  has  its  roots  in  ancient  Athens. 
Plato  attributed  political  system  change  (and  differ¬ 
ence)  to  “the  dispositions  of  men”  and  argued  that 
regime  types  were  forged  from  “the  human  natures” 
of  their  citizens.  Later  classic  commentaries  linked 
the  breakdown  of  oligarchy  to  societal  crises  related 
to  taxation,  to  the  successful  uprising  against  the 
Spartan  occupation  that  followed  conflict  between 
Athenian  elites,  and  to  structural  changes  related  to 
the  Persian  Wars.  Aristotle  linked  political  system 
longevity  to  levels  of  economic  inequality. 
Arguments  relating  the  breakdown  of  political  sys¬ 
tems  to  cultural  dispositions,  fiscal  crises,  elite  divi¬ 
sions,  war,  and  economic  inequality  remain  funda¬ 
mental  to  political  science  literature  today. 

The  Ancients’  attempts  to  derive  general, 
explanatory  arguments  from  specific,  contempo¬ 
rary  political  events  remain  fundamental  to  current 
literature  as  well.  This  is  one  of  the  reasons  why 
scholarship  on  the  breakdown  of  political  systems 
has  so  often  come  in  clusters,  following  waves  of 
dramatic  political  system  change.  Four  clusters  of 
system  breakdown  stand  out  as  catalysts. 

The  Main  Explanations 

The  first  set  of  debates  about  regime  collapse  in 
the  20th  century  focused  on  the  breakdown  of 
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democracy  in  interwar  Europe.  Proponents  of  psy¬ 
chosocial  arguments  asserted  that  the  cause  lay  in 
the  nexus  of  personality  and  politics:  Individuals 
who  were  profoundly  insecure  found  the  uncer¬ 
tainty  of  democracy  intolerable  and  sought  comfort 
in  authoritarian  leaders  and  authoritarian  regimes. 
Though  this  argument  was  rooted  in  social  psychol¬ 
ogy,  it  resonated  with  scholars  of  politics  as  well. 
Hannah  Arendt  attributed  the  breakdown  of  inter¬ 
war  democracy  to  the  “negative  solidarity”  engen¬ 
dered  by  a  mass  of  “furious  individuals”  united  by 
“apprehension.”  Seymour  Martin  Lipset  attributed 
“working-class  authoritarianism”  to  economic 
insecurity  and  authoritarian  family  patterns. 

While  some  analysts  concentrated  on  citizen 
attitudes  as  a  root  cause  of  democratic  breakdown, 
other  scholars  concentrated  on  classes  instead. 
Barrington  Moore  attributed  the  breakdown  of 
interwar  democracy  to  an  alliance  between  landed 
elites  and  sectors  of  the  bourgeoisie  who  desired  a 
level  of  labor  repression  that  democracy  would  not 
allow.  Gregory  Luebbert  attributed  the  break¬ 
down  to  alliances  between  the  family  peasantry 
and  the  urban  middle  classes,  arguing  that  their 
interactions  with  socialist  parties  were  decisive. 

The  focus  on  class  actors  was  eventually  chal¬ 
lenged  by  approaches  that  focused  on  political 
institutions  and  political  (rather  than  class)  elites. 
Juan  Linz  and  Alfred  Stepan  took  the  lead  in  fos¬ 
tering  this  approach.  Though  Linz  ultimately 
blamed  the  breakdown  of  democratic  political 
systems  on  “citizens  of  weak  commitment,”  he 
attributed  citizen  defections  to  a  broad  range  of 
nonstructural  factors,  including  ineffective  leader¬ 
ship  and  problematic  institutional  design.  In  so 
doing,  he  argued  that  system  breakdown  might 
have  been  avoided  with  better  judgment  and  better 
institutions,  including  single  parliamentary  execu¬ 
tives,  constructive  votes  of  no  confidence,  and  laws 
preventing  the  proliferation  of  small  parties. 

The  breakdown  of  democracy  occurring  during 
the  Cold  War  was  the  second  major  stimulus  for 
explanatory  theory  and  Linz  intended  his  argu¬ 
ment  to  apply  to  these  cases  as  well.  Gabriel 
Almond  and  Sidney  Verba  also  sought  to  offer  a 
general  theory  of  democratic  breakdown  but  pro¬ 
moted  arguments  related  to  political  culture 
instead.  They  proposed  that  democratic  stability 
could  only  be  ensured  where  a  “civic  culture” 
based  on  moderate  attitudes,  interpersonal  trust, 
and  muted  interest  in  political  participation  existed. 


In  a  similar  vein,  Harry  Eckstein  proposed  that 
political  systems  would  be  viable  only  if  their 
authority  patterns  were  congruent  with  authority 
patterns  in  other  social  organizations. 

Suspecting  that  the  roots  of  citizen  attitudes  and 
behaviors  lay  in  economic  structures,  Lipset  pro¬ 
posed  that  a  democratic  system  breakdown  would 
be  less  likely  as  economic  development  progressed, 
arguing  that  the  prospects  for  democratic  system 
stability  improved  as  wealth,  industrialization, 
urbanization,  and  education  expanded.  A  large 
middle  class  would  mean  decreasing  class  differ¬ 
ences,  more  moderation,  and  better  odds  for 
democratic  system  stability. 

Locusing  on  the  frailty  of  democratic  political 
systems  in  South  American  countries  with  com¬ 
paratively  large  middle  classes,  Guillermo  O’Donnell 
challenged  Lipset  and  argued  that  economic  devel¬ 
opment  would  not  guarantee  democratic  stability  if 
it  took  place  in  the  context  of  economic  depen¬ 
dency.  Democracy  could  not  be  maintained  in  the 
face  of  contradictory  pressures  from  transnational 
capital  on  the  one  hand  and  a  popular  sector 
empowered  by  democratic  freedoms  on  the  other. 

Debates  about  the  cause  of  democratic  system 
breakdown  were  still  unresolved  when  authoritar¬ 
ian  regimes  began  to  break  down  in  the  1970s.  The 
theories  stimulated  by  this  third  set  of  events  were 
equally  diverse.  Philippe  Schmitter  and  Guillermo 
O’Donnell  offered  one  of  the  first  attempts  to 
make  sense  of  the  change.  They  proposed  that  the 
breakdown  of  dictatorship  is  always  the  conse¬ 
quence  of  divisions  among  regime  elites  and  that 
“elite  dispositions,  calculations,  and  pacts”  deter¬ 
mine  whether  system  breakdown  is  possible. 
Beyond  this,  they  insisted  that  the  breakdown  of 
authoritarianism  constituted  an  example  of  “under¬ 
determined”  social  change  where  class,  sectoral,  or 
institutional  analysis  had  little  predictive  utility. 

As  cases  of  authoritarian  breakdown  began  to 
accumulate,  scholars  argued  that  the  causes  of 
regime  breakdown  varied  across  countries.  Samuel 
Huntington  drew  attention  to  three  scenarios  for 
breakdown:  one  in  which  regime  elites  played  a 
major  role,  another  in  which  the  breakdown  of 
dictatorship  resulted  from  joint  action  by  regimes 
and  oppositions,  and  a  third  in  which  oppositions 
took  the  lead.  This  last  scenario  dominated  in 
Africa  and  Eastern  Europe. 

Barbara  Geddes  pointed  out  that  different  sorts 
of  authoritarian  systems  broke  down  for  different 
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reasons  and  at  different  rates.  Through  quantitative 
work,  she  found  that  military  dictatorships  were  the 
most  likely  to  break  down,  that  personalist  regimes 
were  more  resilient,  and  that  single-party  regimes 
were  the  most  resilient  of  all.  Military  regimes  were 
susceptible  whenever  the  interests  of  the  military  as 
an  institution  were  at  risk.  Personalist  regimes  were 
susceptible  when  an  equilibrium  based  on  coercion 
and  spoils  was  disturbed  by  the  death  of  the  leader 
or  by  economic  crisis.  Single-party  regimes  were  the 
least  susceptible  to  breakdown  because  their  leaders 
typically  had  no  alternative  source  of  power  and 
thus  had  stronger  incentives  to  face  crises  and 
resolve  disputes  internally. 

The  idea  that  economic  crises  or  economic  fac¬ 
tors  of  some  sort  are  the  root  cause  of  the  break¬ 
down  of  dictatorship  is  widespread  but  still  con¬ 
tested.  Studying  the  period  between  1950  and 
1990,  Adam  Przeworski  and  Fernando  Limongi 
found  that  the  relationship  between  economic 
development  and  the  breakdown  of  dictatorship 
was  bell  shaped.  Though  the  likelihood  of  regime 
breakdown  rose  with  development  in  middle- 
income  dictatorships,  dictatorships  in  countries 
with  either  very  low  or  very  high  incomes  tended 
to  be  stable  despite  growth.  Studying  the  period 
from  1800  to  1990,  Carles  Boix  and  Susan  Stokes 
drew  a  different  conclusion.  They  found  that  eco¬ 
nomic  development  always  increased  the  probabil¬ 
ity  that  dictatorships  would  become  democracies, 
though  the  impact  of  growth  on  the  likelihood  of 
regime  change  decreased  at  higher  levels  of  devel¬ 
opment  and  after  World  War  II. 

Debates  about  the  roots  of  dictatorial  political 
system  breakdown  were  enriched  by  the  wave  of 
regime  changes  associated  with  the  collapse  of  the 
Soviet  Union  in  1989.  This  fourth  stimulus  for 
explanatory  theory  brought  arguments  with  an 
international  focus  to  the  fore.  Scholars  such  as 
Marc  Beissinger  showed  how  the  breakdown  of 
dictatorships  was  affected  by  the  cross-border  dif¬ 
fusion  of  models  for  mobilization.  Huntington 
argued  that  demonstration  effects  were  so  power¬ 
ful  that  they  worked  independent  of  economic  and 
social  conditions.  Jon  Pevehouse  illustrated  that 
the  breakdown  of  dictatorship  was  linked  to 
membership  in  international  organizations  domi¬ 
nated  by  democracies.  Dankwart  Rustow  con¬ 
cluded  that  the  most  powerful  impetus  for  the 
breakdown  of  dictatorship  was  the  global  trend  of 


“intensifying  communication  and  economic  inte¬ 
gration,”  which  spread  the  awareness  of  “demo¬ 
cratic  lifestyles”  and  delegitimated  the  myths  that 
sustained  dictatorship. 

In  addition  to  invigorating  work  on  the  break¬ 
down  of  regimes,  the  disintegration  of  the  Soviet 
Union  sparked  new  interest  in  theories  explaining 
the  breakdown  of  state  systems.  Scholars  explained 
why  new  states  emerged  from  old  states  with  a 
variety  of  arguments.  Linz  and  Stepan  attributed 
the  change  to  problems  of  “stateness”:  Sub¬ 
nationalist  groups  that  never  accepted  the  bound¬ 
aries  of  the  state  moved  to  change  them  as  soon  as 
coercion  declined.  Valerie  Bunce  offered  an  institu¬ 
tional  explanation,  highlighting  the  fact  that  all  the 
European  states  that  split  apart  after  1989  were 
federal.  Carol  Skalnik  Leff  made  the  point  that 
democratization  had  a  “state-shattering”  impact 
because  it  shifted  power  to  regions  and  thereby 
changed  the  incentives  of  regional  elites. 

The  system  breakdowns  after  1989  also  sparked 
new  literature  explaining  the  emergence  of  “failed” 
states  as  places  where  state  actors  prey  on  their 
citizenry,  enjoy  no  monopoly  of  force,  and  fail  to 
control  rampant  violence.  This  form  of  system 
breakdown  is  most  likely  in  poor  countries,  but 
Robert  Bates  points  out  that  its  roots  may  lie  in 
poor  state  resources  rather  than  in  poverty  per  se. 
Robert  Rotberg  argues  that  poverty  and  institu¬ 
tional  weakness  contribute  to  state  failure  but 
accords  the  greatest  weight  to  leadership  errors  and 
the  drive  for  personal  gain.  Jack  Goldstone,  Robert 
Bates,  Ted  Robert  Gurr,  and  Monty  Marshall  con¬ 
clude  that  state  failure  is  most  rooted  in  political 
factors  and  that  hybrid  regimes  that  are  neither 
democracies  nor  dictatorships  are  most  susceptible 
to  collapse.  We  cannot  know  which  of  the  many 
explanations  for  system  breakdown  given  here  will 
prove  most  convincing  in  the  future,  but  we  can  be 
almost  certain  that  political  systems  will  continue 
to  collapse. 

Nancy  Bermeo 
Nuffield  College,  University  of  Oxford 
Oxford,  United  Kingdom 
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Buddhism 


Buddhism  is  frequently  regarded  as  a  quietist  reli¬ 
gion,  but  while  there  may  be  some  justification  for 
this  perception,  its  political  presence  and  impact  in 
many  parts  of  Asia  should  not  be  underestimated. 
Ever  since  the  period  of  the  Indian  emperor  Asoka 
(269-232  BCE),  the  religion  has  shown  itself  to  be 
a  distinctive  social  and  political  force  in  various 
countries,  having  considerable  backing  among  the 
relevant  populations.  Naturally,  where  Buddhism 
is  a  small  minority,  as  in  modern  Western  coun¬ 
tries,  its  influence  is  small  and  the  quietist  image  is 
strong,  but  in  the  countries  of  Asia,  where 
Buddhism  is  the  majority  religion,  the  situation  is 
quite  different.  Here,  both  Buddhist  monks  and 
lay  leaders  have  been  prominent  from  time  to  time 
in  elitist  and  military  maneuverings  and  more 
recently  in  revolutionary  and  electoral  situations. 
Self-consciously  Buddhist  political  activism  is  not 
a  stranger  in  India,  Burma,  Thailand,  and  Vietnam. 
Indeed,  in  most  Asian  countries,  Buddhist  leaders 
have  at  various  times  contributed  to  both  support 
the  prevailing  establishment  and  revolt  against  it. 

Characteristics  of  Buddhism 

The  underlying  reason  for  the  complex  interactions 
between  Buddhism  and  political  forces  may  be 


found  in  two  characteristic  features  of  the  religion. 
First,  we  must  look  back  to  the  story  of  the  histori¬ 
cal  Buddha’s  decision  to  proclaim  his  teaching  in 
the  first  place,  which  is  a  constitutive  narrative  both 
for  his  own  biography  and  for  the  emergence  of 
Buddhism  as  an  organized  religious  system.  The 
central  story  is  that  Gautama  (in  the  5th  century 
BCE),  though  blessed  with  many  privileges  as  a 
young  man,  passed  through  a  personal  heart¬ 
searching  experience  about  human  suffering,  illness, 
old  age,  and  death.  Recognizing  the  transience  of 
existence,  he  perceived  the  cause  of  suffering  to  lie 
in  ignorance,  desire,  and  attachment  to  transient 
things,  while  its  resolution  lay  in  knowledge  and 
detachment.  The  understanding  of  all  these  rela¬ 
tionships  is  summed  up  as  his  “enlightenment”  or 
bodhi,  which  is  why  he  was  regarded  as  a  buddha, 
an  enlightened  one.  Had  he  preferred  a  quietist, 
personal  resolution  of  these  general  questions  of 
human  existence,  which  according  to  the  narrative 
he  considered  steadily  as  an  option,  the  social 
impact  would  most  likely  have  been  nil.  As  it  was, 
the  very  decision  to  proclaim  his  system  of  teaching 
(in  Sanskrit  dharma ,  in  Pali  dhamma)  led  not  only 
to  the  first  ordinations  of  monks  but  also  to  spon¬ 
sorship  by  prosperous  laity  and  royalty.  This 
meant  that  a  strong  and  active  symbiosis  with  a 
supportive  laity  grew  up  from  the  very  beginning, 
so  that  the  social  and  political  ramifications  began 
to  extend  themselves  in  a  continuous  process. 

Second,  the  concept  of  the  “wheel  turner”  (in 
Sanskrit  cakravartin,  in  Pali  cakkavattl )  is  signifi¬ 
cant.  On  one  hand,  it  is  part  of  the  mythologiza¬ 
tion  of  the  figure  of  the  Buddha  himself,  for  he  is 
regarded  as  having  achieved  the  right  to  “turn  the 
wheel  of  dharma.”  However,  the  same  concept  is 
applied  to  kingship,  for  a  “wheel-turning  mon¬ 
arch”  is  one  who  claims  the  power  of  extending 
his  rule  to  the  four  quarters  of  the  earth,  spreading 
stability  in  support  of  a  moral  society.  When  a 
suitable  individual  is  reborn  as  a  “great  person,” 
with  excellent  qualities  and  at  an  appropriately 
high-class  of  society,  he — presumed  to  be  male — 
has  the  option  of  turning  the  wheel  of  dharma  or 
of  turning  the  wheel  of  political  power.  Ideally, 
therefore,  a  powerful  and  righteous  king  will  be 
the  contemporary  and  the  patron  of  a  living 
Buddha.  This  dual,  tandem-like  concept  was  later 
integrated  under  the  concept  of  the  “wonderful 
union  of  king  and  Buddha.”  In  some  cases,  such  as 
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the  Tibetan  model,  this  means  that  the  incarnated 
spiritual  leader  or  Dalai  Lama  is  at  the  same  time 
the  political  head  of  the  population,  which  natu¬ 
rally  causes  a  collision  with  any  modern  state  such 
as  China,  with  its  secular  ideological  base.  Tensions 
over  this  model  may  be  seen  to  a  lesser  extent  in 
Mongolia,  Bhutan,  and  Nepal.  In  a  variation  of 
this  union,  certain  kings  or  princes  have  been  given 
the  title  of  “Dharma  King”  (in  Sanskrit  dharmaraja), 
which  has  usually  implied  not  only  an  elitist 
espousal  of  Buddhism,  but  also  that  they  have 
authority,  direct  or  indirect,  over  senior  appoint¬ 
ments  in  the  monastic  hierarchy,  as  frequently  seen 
in  premodern  Japan. 

Evolution  of  Buddhism  in 
Southeast  and  East  Asia 

The  expansion  of  Buddhism  into  the  countries  of 
Southeast  and  East  Asia  took  place  largely  with  the 
help  of  political  domination  or  influence,  with  the 
notable  exception  of  China.  As  a  result,  these 
underlying  concepts  of  the  Buddha  as  both  politi¬ 
cal  and  spiritual  head  have  played  a  major  role. 
Starting  with  a  response  by  the  Indian  Emperor 
Asoka  to  the  Sri  Lankan  king  of  the  time, 
Devanampiyatissa,  the  Theravada  monastic  order 
was  set  up  in  Sri  Lanka.  While  this  order  had  royal 
patronage,  the  tradition  of  lay  support  for  the 
order  spread  to  most  levels  of  society.  As  a  result, 
the  postcolonial  independence  movement  looked 
to  Buddhism,  still  predominating  in  a  religiously 
and  culturally  plural  society  centuries  later,  to  pro¬ 
vide  an  ideological  reference  point  that  would  be 
nationally  recognizable  as  Sinhalese.  An  important 
modern  Sri  Lankan  Buddhist  was  Dharmapala 
(1864-1933),  who,  assuming  the  title  of  Anagarika 
or  “homeless  one,”  did  much  to  further  a  modern 
revival  of  Buddhism  in  India  itself,  where  it  had 
largely  died  out.  This  revival  was  massively  fur¬ 
thered  by  the  conversion  of  Dr.  Babasaheb  R. 
Ambedkar  (1891-1956),  a  major  contributor  to 
the  constitution  of  independent  India,  who  led 
some  millions  of  previous  “untouchables”  into  a 
casteless  state  under  the  banner  of  a  modernist, 
rationalist,  and  ethically  oriented  Buddhism. 

Following  the  earlier  mission  to  Sri  Lanka,  a 
transmission  of  the  Dhamma  to  Burma,  Thailand, 
Vietnam,  and  Laos  followed  in  various  stages, 
with  the  result  that  Buddhist  traditions  played  a 


major  role  in  state  formation  in  all  of  these  coun¬ 
tries.  There  were  also  establishments  in  Sumatra 
and  Java  that  had  less  long-term  influence.  While 
kingship  models  derived  from  Hinduism  were  also 
influential,  with  even  Brahmans  functioning  at  the 
Thai  court,  it  was  the  Buddhist  monastic  order 
that  has  remained  a  pillar  of  political  influence  in 
mainland  Southeast  Asia.  Thus,  regardless  of  con¬ 
temporary  politicians,  the  two  fundamental  foci  of 
authority  in  Thailand  remain  to  this  day  the  mon¬ 
archy  and  the  monastic  order.  A  key  figure  in  the 
formation  of  Thai  national  consciousness  was 
King  Mongkut  (Rama  4th,  1824-1851)  who  man¬ 
aged  the  modernization  process  of  the  whole  coun¬ 
try  while  preserving  its  independence  from  Western 
colonial  powers.  Before  becoming  king,  he  was  a 
dedicated  monk  who  played  a  leading  role  in 
reforming  and  modernizing  the  Buddhist  religion 
in  Thailand.  The  process  of  modernization  was 
continued  by  his  successor  Chulalongkorn  (Rama 
5th)  and  later  monarchs.  In  Burma,  Buddhist  lay¬ 
man  U  Nu  (1907-1995)  played  a  major  role 
in  postcolonial  political  development  under  the 
slogan  of  “Buddhist  socialism,”  while  at  the  begin¬ 
ning  of  the  21st  century  a  number  of  monks  have 
been  attempting  to  recall  the  country  to  its  rightful 
path  as  they  see  it.  In  Vietnam,  the  wars  of  inde¬ 
pendence  saw  cases  of  Buddhist  monks  burning 
themselves  alive  in  the  1960s  as  a  symbolic  act, 
protesting  against  the  repression  of  the  population. 
This  engagement  in  the  social  and  political  process 
became  widely  known  through  writings  of  the 
monk  Thich  Nhat  Hanh,  such  as  The  Lotus  in  the 
Sea  of  Fire. 

If  China,  with  its  strong  Confucian  traditions, 
did  not  really  need  the  Buddhist  religion  to  provide 
a  leading  civil  ideology,  the  importation  of 
Buddhism  from  India  via  Central  Asia  nevertheless 
presented  a  strong  challenge  to  the  incipient  reli¬ 
gious  institutionalization  of  Daoism.  Buddhism 
reached  a  high  point  of  influence  during  the  Tang 
dynasty,  but  it  suffered  greatly  as  a  consequence  of 
Daoist-inspired  persecution  in  the  mid-9th  cen¬ 
tury.  If  the  rivalry  of  the  three  religions  had  fre¬ 
quently  led  to  strife  and  persecution,  now  of 
Daoism  and  now  of  Buddhism,  there  were  also 
attempts  to  coordinate  them.  The  classic  model  for 
this  was  the  idea  of  the  “three  teachings”  (san 
jiao),  which  was  most  influentially  set  forth  in  a 
treatise  ascribed  to  the  first  Ming  emperor  shortly 
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after  he  had  taken  political  control  of  the  country. 
The  compromise  that  he  regarded  as  essential  for 
political  stability  does  not  give  a  privileged  place  to 
Buddhism  among  the  “three  teachings,”  but  it 
crystallized  a  model  that  even  today  implicitly 
underlies  the  law  of  a  secular  Chinese  regime.  Just 
as  the  Ming  solution  juxtaposed  Confucianism  on 
the  one  hand  with  Buddhism  and  Daoism  on  the 
other,  so  also  now  the  state  ideology  of  commu¬ 
nism  is  the  common  reference  point  for  the  permit¬ 
ted  religions:  Buddhism,  Daoism,  Islam, 
Catholicism,  and  Protestant  Christianity. 
Confucianism  does  not  occur  here  as  “religion,” 
having  been  displaced  by  communist  ideology  as 
the  normative  backdrop,  but  Buddhism  holds  a 
clear  position.  The  older  “three  teachings”  concept 
implied  the  exclusion  and  suppression  of  the  so- 
called  heretical  sects  on  the  assumption  that  such 
minority  sects  lead  to  instability  and  are  probably 
a  cloak  for  agitation.  This  reasoning  is  therefore 
not  a  new  communist  invention,  and  observers 
need  to  consider  contemporary  cases  carefully,  for 
example,  when  new  movements  such  as  Falungong 
claim  to  be  “Buddhist”  and  at  the  same  time  call 
for  the  overthrow  of  the  government. 

Buddhism  was  transmitted  first  to  Korea  and 
from  there  to  Japan  from  the  6th  century  onward. 
If  the  dominant  ideology  in  Korea  up  to  the 
Japanese  imperialist  period  was  Confucianist,  the 
postcolonial  and  postwar  period  has  seen  a  resur¬ 
gence  of  pride  in  the  shamanistic  traditions  of  that 
country,  thus  giving  them  a  civil  religious  function. 
The  political  and  social  influence  of  Buddhism 
arises  less  from  the  traditional  orders  in  Korea — 
strongly  established  though  these  are — and  more 
from  the  newly  founded  lay  movements  such  as  the 
self-consciously  modern  Won  Buddhism.  Japanese 
history  has  seen  immensely  complicated  variations 
of  Buddhist  influence  via  the  imperial  court  on  one 
hand  and  the  power  centers  of  major  denomina¬ 
tions  on  the  other.  While  monks  of  Mount  Hiei  to 
the  northeast  of  Kyoto  occasionally  led  armed 
raids  into  this  former  capital,  the  millions  of  fol¬ 
lowers  of  the  Shin  Buddhist  faith  and  others  in  the 
Nichirenite  tradition  were  also  a  political  force  to 
be  reckoned  with  and  occasionally  suffered  dire 
consequences.  In  modern,  postwar  times,  the  influ¬ 
ence  of  the  Buddhism-inspired  Komeito  (“Public 
Brightness  Party”)  has  been  considerable  but 
seems  to  be  receding  in  the  21st  century. 


Increasingly,  lay  Japanese  Buddhist  leaders  have 
promoted  a  “peace”  agenda,  the  Nichiren-oriented 
Rissho  Kosei-kai  and  the  Soka  Gakkai  being  prom¬ 
inent  in  this  respect.  The  strongly  established  Shin 
Buddhist  denominations  have  led  the  way  in 
reflecting  on  Japanese  war  responsibility  and  in 
educating  their  followers  accordingly.  In  Soto  Zen 
Buddhist  circles,  there  has  also  been  much  heart 
searching  while  various  influential  Buddhist  voices 
support  the  campaign  to  preserve  Clause  9  of 
Japan’s  present  constitution,  prohibiting  war  as  a 
means  of  settling  international  disputes. 
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Budgeting,  Rational  Models 


Budgeting  refers  to  decisions  about  how  much 
funding  to  provide  for  the  programs,  staffs,  and 
infrastructures  of  government,  as  well  as  about  the 
processes  used  to  arrive  at  funding  levels.  In  exam¬ 
ining  processes,  budgeting  theories  explain  how 
individuals  who  hold  budget-writing  positions 
choose  funding  levels  for  programs.  A  second  class 
of  theories  explain  how  and  why  political  elites  try 
to  manipulate  budgets  to  foster  their  goals. 

Rational  theories  of  individuals  responsible  for 
choosing  how  much  to  fund  budget  lines  offer 
bounded  rationality  explanations  of  these  decisions. 
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Due  to  limits  on  human  cognition,  vast  numbers  of 
lines  in  the  budget,  and  complex  decision-making 
environments,  budgeters’  forecasts  about  how  well 
their  choices  will  facilitate  their  goals  are  imper¬ 
fect.  Despite  these  limitations,  budgeters  use  deci¬ 
sion-making  strategies  that  help  them  foster  their 
goals.  Such  strategies  are  boundedly  rational 
because,  although  the  choices  and  behavior  result¬ 
ing  from  such  strategies  are  not  optimal,  budget¬ 
ers’  decisions  still  help  them  realize  program  goals. 
In  this  way,  budgeters  “satisfice” — they  make 
decisions  that  are  “good  enough.” 

The  first  generation  of  boundedly  rational  bud¬ 
geting  theories  was  developed  in  the  1960s  by 
Aaron  Wildavsky,  whose  contributions  to  under¬ 
standing  budgeting  continue  to  exert  a  profound 
influence  on  budgeting  scholarship.  This  perspec¬ 
tive,  developed  with  respect  to  the  federal  govern¬ 
ment  in  the  United  States,  emphasizes  that  indi¬ 
viduals  responsible  for  contributing  to  the  budget 
must  take  an  incremental  approach  to  arrive  at 
funding  levels.  Budgeters  begin  by  examining  the 
“base”  for  each  budget  line,  which  refers  to  the 
amount  that  was  spent  on  the  line  in  the  previous 
year’s  budget.  From  this  base,  budgeters  make 
small  adjustments  based  on  new  information  and 
limited  by  the  availability  of  funds.  For  example,  if 
program  advocates  can  make  an  effective  argu¬ 
ment  about  why  additional  funds  are  needed,  bud¬ 
geters  may  increase  funding  levels  slightly  from  the 
previous  year.  Nevertheless,  even  if  budgets  change 
from  year  to  year,  budgeters  do  not  make  drastic 
changes.  Budgeters  follow  this  approach  out  of 
necessity.  Examining  the  budget  holistically  would 
involve  the  impossible  task  of  investigating  each 
budget  line  to  ascertain  the  optimal  volume  of 
funds  required  to  realize  the  goals  embodied  in  the 
programs.  Unfortunately,  such  an  approach  is  not 
feasible  given  the  vastness  of  the  government  and 
the  resulting  volume  of  decisions  that  must  be 
made.  By  taking  an  incremental  approach,  how¬ 
ever,  budgeters  arrive  at  funding  levels  for  all  bud¬ 
get  lines,  ensuring  that  programs  continue  serving 
their  purposes  even  if  neither  the  funding  levels  nor 
the  achievement  of  program  goals  are  optimal. 

Additional  research,  however,  observed  that 
many  budgeting  decisions  were  not  incremental, 
spurring  refinement  of  Aaron  Wildavksy’s 
approach.  John  Padgett’s  serial  judgment  theory  of 
budgeting  stresses  continuity  with  the  incremental 


perspective  in  that  budgeters’  decisions  are  under¬ 
stood  from  a  boundedly  rational  perspective. 
Budgeters  cope  with  the  volume  and  complexity  of 
the  decisions  they  face  by  starting  with  the  level  of 
funding  from  the  previous  year  and  satisficing  to 
arrive  at  funding  levels  for  the  current  budget. 
However,  the  heuristic  employed  by  budgeters  is 
not  incremental  choice.  Rather  than  consciously 
choosing  a  funding  amount  that  differs  slightly 
from  that  disbursed  previously,  budgeters  con¬ 
sider,  one  after  another,  alternatives  for  funding 
levels  until  they  arrive  at  one  that  is  acceptable.  In 
this  way,  budgeters  satisfice  in  that  they  do  not  try 
to  discover  optimal  funding  levels  for  programs. 
Critically,  though,  budgetary  outputs  are  predicted 
to  differ  from  outputs  predicted  by  incremental 
approaches.  Because  budgeters  move  from  alterna¬ 
tive  to  alternative  until  they  reach  an  outcome  that 
is  good  enough,  it  is  possible  for  them  to  arrive  at 
funding  levels  that  are  substantially  different  from 
the  amounts  funded  in  the  previous  year’s  budget. 
Such  dramatic  changes  can  occur,  for  example, 
when  budgeters’  understandings  of  what  is  neces¬ 
sary  from  a  programmatic  standpoint  leads  them 
to  pick  a  funding  level  that  happens  to  be  a  sig¬ 
nificant  departure  from  the  previous  year’s  base. 
Research  focusing  on  the  distribution  of  year-to- 
year  changes  in  programs’  funding  levels  in  the 
context  of  U.S.  government  budgets  revealed  that 
this  perspective  explained  better  the  distribution  of 
year-to-year  changes  than  an  incremental  approach 
did.  By  specifying  more  precisely  the  decision¬ 
making  heuristics  employed  by  budgeters,  the 
serial  judgment  perspective  improved  on  the  incre¬ 
mental  model  while  remaining  anchored  firmly  to 
a  realistically  bounded  rational  view  of  human 
decision  making. 

More  recent  research  by  Frank  Baumgartner 
and  Bryan  Jones  integrates  research  on  agenda  set¬ 
ting  with  a  boundedly  rational  approach  to  bud¬ 
geting.  Because  the  set  of  issues  that  they  can 
attend  to  is  small,  budgeters  ignore  most  informa¬ 
tion  from  the  political  environment.  They  do  so 
even  in  the  face  of  political  mobilizations  by  coali¬ 
tions  favoring  substantial  funding  changes.  Under 
these  circumstances,  year-to-year  changes  in  bud¬ 
gets  are  characterized  by  incremental  shifts  in 
funding  levels.  However,  when  groups  in  favor  of 
change  are  successful  in  securing  space  for  their 
concerns  on  the  government’s  agenda,  budgeters 
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are  forced  to  attend  to  the  issues  that  these  groups 
prioritize.  In  doing  so,  budgeters  begin  to  consider 
alternative  funding  amounts  for  budget  lines  seri¬ 
ally.  Since  past  decisions  related  to  the  issues  have 
resulted  in  incremental  changes,  political  coalitions 
that  have  focused  budgeters’  attention  on  a  specific 
budget  line  are  likely  to  desire  significant  change  in 
that  line.  Therefore,  the  funding  levels  that  allow 
budgeters  to  satisfice  in  such  instances  are  likely  to 
constitute  sizable  increases/decreases.  After  all, 
substantial  changes  are  the  only  ones  likely  to  sat¬ 
isfy  the  political  forces  behind  the  mobilization.  In 
this  way,  the  heuristics  that  budgeters  employ  dif¬ 
fer  depending  on  whether  advocates  for  change 
can  secure  space  for  their  priorities  on  the  agenda. 
If  not,  budgeters  ignore  information  from  such 
groups,  leading  to  incremental  changes.  However, 
when  agenda  space  is  secured,  serial  judgment  heu¬ 
ristics  are  triggered.  What  is  more,  the  nature  of 
the  mobilization  implies  that  only  substantial 
changes  in  funding  levels  allow  budgeters  to  allay 
the  concerns  of  the  coalitions — and,  by  extension, 
allow  budgeters  to  satisfice. 

Shifting  gears,  a  separate  tradition  on  govern¬ 
ment  budgets  asks  whether  political  elites  manipu¬ 
late  budgets  to  serve  their  goals.  One  line  of  such 
research  asks  whether  ruling  political  parties  stim¬ 
ulate  the  economy  prior  to  elections  in  order  to 
maintain  their  control  of  government  and  its  deci¬ 
sions.  This  political  budget  cycle  thesis  was  spurred 
by  research  conducted  in  the  1970s  examining 
indicators  of  economic  health  in  light  of  election 
cycles  in  the  United  States  and  in  other  major 
industrial  democracies  during  the  post-World  War 
II  era,  observing  that  unemployment  dipped  and 
economic  growth  increased  in  years  during  which 
elections  occurred.  Naturally,  the  argument  behind 
the  thesis  is  that  the  ruling  party  prefers  to  remain 
in  control  of  government.  Therefore,  it  pursues 
policies  that  stimulate  the  economy  to  achieve  eco¬ 
nomic  growth  prior  to  the  election.  Voters,  the 
ruling  party  hopes,  observe  this  performance  and 
reward  it  by  returning  it  to  power. 

Subsequent  research  on  this  hypothesis  has  pro¬ 
duced  mixed  findings  regarding  the  ability  of  rul¬ 
ing  parties  to  pursue  this  strategy  successfully.  In 
recent  research,  James  Alt  and  David  Lassen  sug¬ 
gest  why:  The  ability  of  ruling  parties  to  exploit 
their  position  is  conditional  on  several  factors.  The 
first  involves  how  transparent  the  government’s 


fiscal  policy  decisions  are  to  the  public. 
Transparency  allows  the  media  and  political  oppo¬ 
sition,  and  by  extension  the  public,  to  observe  the 
government’s  intentions,  making  it  more  difficult 
for  the  ruling  party  to  try  to  create  economic 
growth  without  being  accused  of  doing  so  for  its 
own  benefit.  The  second  factor  is  the  level  of 
polarization  between  parties.  As  policy  conflict 
increases  between  the  ruling  party  and  its  rival(s), 
the  costs  to  the  party  of  losing  control  of  govern¬ 
ment  to  the  opposition  increases.  These  costs 
increase  with  policy  conflict  because  the  policies 
that  the  opposition  (as  the  future  ruling  party)  will 
adopt  will  be  more  offensive  to  the  (present)  ruling 
party  for  higher  levels  of  policy  disagreement 
between  parties.  In  summary,  while  it  may  be  fea¬ 
sible  for  the  ruling  party  in  government  to  rig  the 
economy  in  order  to  maximize  its  chances  of  con¬ 
tinuing  in  power,  the  circumstances  under  which 
this  strategy  is  feasible  may  be  limited. 

Another  line  of  research  focusing  on  elite  influ¬ 
ence  over  budgets  examines  the  participation  of 
bureaucrats  in  shaping  the  budgets  of  their  agen¬ 
cies.  One  theory  advanced  in  the  early  1970s  held 
that  bureaucrats  were  “budget  maximizers”  in 
that  they  wished  to  expand  the  size  of  their  agen¬ 
cy’s  budget  as  much  as  possible.  What  is  more, 
bureaucrats  are  well  positioned  to  do  so  because 
they  possess  more  information  about  what  is  nec¬ 
essary  to  achieve  the  policy  goals  their  agencies  are 
responsible  for  meeting  than  elected  officials  who 
create  budgets.  This  information  asymmetry  allows 
bureaucrats  to  receive  substantial  increases  in  their 
agencies’  budgets. 

Subsequent  research  in  the  tradition  of  the 
budget-shaping  perspective  by  Patrick  Dunleavy  and 
others,  however,  has  led  to  a  substantial  refinement 
of  this  view.  In  addition  to  noting  that  budgets  are 
not  characterized  by  the  growth  predicted  by  the 
maximization  perspective,  these  critics  note  that, 
because  an  agency’s  budget  is  a  collective  good, 
bureaucrats  face  a  collective  action  problem  in  advo¬ 
cating  budget  expansions.  Moreover,  because  agen¬ 
cies’  budgets  are  disaggregated,  bureaucrats  possess 
no  guarantee  that  increases  to  the  budget  will  accrue 
to  their  corner  of  the  agency,  undercutting  the  incen¬ 
tive  to  lobby.  Bureaucrats  also  face  varied  incentives 
to  lobby  for  expansions  based  on  the  type  of  agency 
they  work  for  and  their  position  within  their  agency. 
In  particular,  the  highest  ranking  officials  most 
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capable  of  lobbying  possess  the  least  incentive  to 
do  so  because,  from  a  pecuniary  standpoint,  their 
salaries  do  not  increase  with  budget  increases. 
Instead  of  budget  maximizing,  the  budget-shaping 
view  argues  that  agency  executives  prefer  to  use 
the  budget  to  construct  an  agency  such  that  its 
staff  perform  professional  functions  characterized 
by  high  levels  of  discretion  that  foster  creative 
solutions  to  problems.  Executives  in  public  agen¬ 
cies  are  thought  to  prioritize  such  “shaping” 
because  individuals  drawn  to  career  advancement 
in  such  agencies  are  likely  to  value  such  a  profes¬ 
sional  orientation.  Additionally,  and  at  any  rate, 
laws  forestall  the  ability  of  bureaucrats  to  capture 
pecuniary  rewards  from  public  service  beyond 
their  salaries. 

Jason  A.  MacDonald 
West  Virginia  University 
Morgantown,  West  Virginia,  United  States 

See  also  Agenda  Setting;  Political  Economy;  Public 
Budgeting;  Rationality,  Bounded 


Further  Readings 

Alt,  J.  E.,  &  Lassen,  D.  D.  (2006).  Transparency,  political 
polarization,  and  political  budget  cycles  in  OECD 
countries.  American  Journal  of  Political  Science,  50, 
530-550. 

Dunleavy,  P.  (1985).  Bureaucrats,  budgets,  and  the 
growth  of  the  state:  Reconstructing  an  instrumental 
model.  British  Journal  of  Political  Science,  15,  299- 
328. 

Jones,  B.  D.,  &  Baumgartner,  F.  R.  (2005).  The  politics 
of  attention:  How  government  prioritizes  problems. 
Chicago:  University  of  Chicago  Press. 

Padgett,  J.  F.  (1980).  Bounded  rationality  in  budgetary 
research.  American  Political  Science  Review,  74, 
354-372. 

Wildavsky,  A.  (1964).  The  politics  of  the  budgetary 
process.  Boston:  Little,  Brown. 


Bureaucracy 


The  term  bureaucracy  denotes  a  particular  form 
of  organization  that  is  complex  and  differentiated, 
has  defined  rules  and  procedures,  and  is  subject  to 


a  command-and-control  system  of  hierarchical 
authority.  This  entry  discusses  the  nature  and 
functions  of  bureaucracy  and  the  bureaucratic 
state,  as  well  as  the  problems  and  deficiencies 
associated  with  it. 

Definition  and  Characteristics 

The  illustrious  German  sociologist  Max  Weber 
noted  several  characteristics  of  the  bureaucratic 
form  of  organization  that  defined  it  as  an  ideal 
type. 

First,  bureaucracy  derives  its  authority  through 
law.  It  depends  neither  on  traditional  nor  on  per¬ 
sonal  modes  of  authority.  It  is  instead  based  on  a 
form  of  legal-rational  authority  that  defines  the 
foundations  of  the  organization’s  jurisdiction  and 
its  procedures  of  operation. 

Second,  bureaucracy  has  differentiated  offices 
or  units  that  have  specialized  competencies  and 
jurisdictions.  This  aspect  of  bureaucracy  provides 
it  with  some  obvious  advantages,  including  the 
ability  to  bring  technical  expertise  and  experience 
to  bear  on  particular  aspects  of  problems.  It  also 
raises  some  difficulties,  however,  in  coordination 
of  tasks,  and  it  sometimes  creates  conflicting  per¬ 
spectives  as  to  what  the  task  is  or  how  a  problem 
should  be  defined. 

Third,  bureaucracy  is  characterized  by  an  inter¬ 
nally  hierarchical  system  of  authority  required  to 
bring  its  distinctive  parts  together  as  part  of  a  uni¬ 
fied  system  of  coordination.  The  extent  to  which 
this  hierarchical  control  system  can  actually  be  put 
in  place  is  almost  always  imperfect,  though  the 
extent  to  which  it  can  be  consummated  rests  on 
many  factors  other  than  coercion.  These  include 
leadership  that  obtains  support  from  all  of  the 
organization’s  distinctive  offices  and  units,  clarity 
of  purpose  from  organizational  leadership,  and 
careful  assessments  of  feasible  paths  to  ends. 

Fourth,  bureaucracy  is  also  characterized  by 
rules  and  procedures  that  govern  its  internal  func¬ 
tions.  It  has  a  memory  and  a  set  of  procedures  for 
dealing  with  recurring  matters.  Weber  referred  to 
this  as  the  organization’s  “files,”  and  this  storage 
of  experience  and  written  rules  anchors  bureau¬ 
cracy  in  the  legal-rational  system  of  authority.  The 
files  provide  precedents  and  also  produce  standard 
operating  procedures  (SOPs)  that  enable  bureau¬ 
cracies  to  become  more  proficient  in  responding  to 
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problems  that  can  be  readily  identified  and  are 
within  the  organizational  repertoire.  The  more 
ambiguous  or  multifaceted  the  problem,  however, 
the  more  complicated  will  be  the  sorting  process 
within  a  bureaucratic  organization  and  the  more 
likely,  therefore,  that  problems  are  recognized  and 
defined  in  ways  reflecting  the  complex  differentia¬ 
tion  of  bureaucratic  organizations.  This  means 
that  there  are  likely  to  be  many  different  perspec¬ 
tives  rather  than  a  unified  one. 

Fifth,  bureaucracy  is  typically  characterized  by 
a  full-time  corps  of  officials.  In  governments,  these 
officials  are  known  as  civil  servants;  they  represent 
the  continuity  of  the  organizations  of  government, 
their  missions  and  functions,  and  the  antecedent 
ongoing  commitments  of  government.  Similarly,  in 
private  enterprise  and  even  in  nonprofit  organiza¬ 
tions  (nongovernmental  organizations),  full-time 
managers  and  subordinates  provide  the  basic 
maintenance  functions  within  an  organization, 
negotiate  its  external  environment,  and  organize 
its  production  and  marketing  functions. 

Realities  are  often  more  complex  than  this 
ideal-type  description  of  bureaucracy.  Sometimes 
bureaucratic  forms  exist  but  without  the  reputed 
benefits  of  expertise  and  efficient  allocations  of 
attention.  Bureaucratic  pathologies  may  run  deep 
reflecting  both  environmental  and  cultural  influ¬ 
ences,  shortages  of  resources,  and  incentives  for 
suboptimal  performance.  These  pathologies  are 
especially  notable  in  less  developed  countries. 
Consequently,  to  understand  the  topic  better,  this 
entry  addresses  the  following  questions:  Why  do 
we  have  bureaucracy?  What  are  the  relationships 
with  capitalism  and  market?  What  is  the  bureau¬ 
cratic  state  and  what  are  its  relationships  with 
democracy?  What  are  the  internal  and  external 
pathologies  of  bureaucratic  behavior? 

Why  Do  We  Have  Bureaucracy? 

In  Weber’s  ideal-type  description  of  bureaucracy, 
he  noted  that  bureaucracy  was  the  technically  most 
proficient  and  efficient  form  of  organization.  Far 
from  the  negative  stereotypes  that  have  arisen  in 
the  contemporary  era  from  pathologies  associated 
with  bureaucracy,  Weber  observed  that  bureau¬ 
cratic  organization  was  a  necessary  concomitant 
condition  for  the  rise  of  capitalism  based  on  eco¬ 
nomic  efficiency.  The  rise  of  the  modern  public 


stock  corporation  separated  management  and 
ownership  making  the  managers  extremely  impor¬ 
tant  to  the  direction  and  guidance  of  the  firm. 

Capitalism  necessitated  legal-rational  forms  of 
authority.  The  modern  corporation  is  itself  a  legal 
entity.  Unlike  a  person  or  even  a  partnership,  corpo¬ 
rations  have  limited  liability  for  their  officers. 
Corporations  and  other  types  of  firms  require  legally 
grounded  rules  in  a  capitalist  system.  Disputes  must 
ultimately  be  settled  through  juridical  means,  and 
agreements  must  be  carefully  and  legally  delineated 
so  that  there  is  a  record — what  Weber  referred  to  as 
“the  files” — or  a  recorded  precedent.  Impersonal 
authority  is  central  to  a  system  of  rational-legal 
authority,  and  bureaucracy  is  the  embodiment  of  an 
organization  defined  by  rules  of  procedure  and 
responsive  to  external  legal  authority. 

Above  all,  complexity  requires  organizational 
differentiation,  specialization,  and  limited  jurisdic¬ 
tion  coordinated  through  a  hierarchical  system  of 
authority.  These  crucial  aspects  of  bureaucracy  not 
only  allow  for  greater  efficiency,  economy  of 
effort,  and  expertise  but  also  lead  to  problems  of 
coordination. 

Bureaucracy  and  Markets 

On  the  face  of  it,  bureaucracy  and  capitalism  seem 
to  live  in  an  uneasy  coexistence.  One  is  structured 
by  authority  and  legal  rules,  while  the  other  in  its 
pure  form  reflects  an  uninhibited  free  exchange  of 
goods  and  services  mediated  by  the  price  mecha¬ 
nism.  The  reality,  however,  is  much  more  compli¬ 
cated.  Sellers  are  not  necessarily  numerous,  and 
where  possible,  they  seek  to  control  markets  and 
thus  influence,  if  not  dictate,  prices.  Public  stock 
corporations’  interests  lie  in  increasing  the  value  of 
their  stock.  Rarely  are  consumers  fully  armed  with 
perfect  information  as  classical  theories  of  econom¬ 
ics  assume.  The  rise  of  governmental  regulation  of 
markets  and  marketing  practices  and  of  the  safety 
of  products  provides  a  means  of  short-circuiting 
the  information  needs  that  consumers  by  them¬ 
selves  would  be  unlikely  to  obtain.  As  a  conse¬ 
quence,  the  rise,  and,  to  some  degree,  the  inevitable 
excesses  of  the  profit  motive  lead  to  the  necessity 
of  regulation  and  the  growth  of  governmental 
bureaucracy.  Newly  capitalistic  systems,  in  fact, 
often  suffer  from  a  rapid  expansion  of  production 
without  adequate  means  of  regulating  the  quality 
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or  safety  of  what  is  being  produced.  Such  mecha¬ 
nisms  tend  to  evolve  in  developed  capitalist  sys¬ 
tems  in  order  to  counter  the  adverse  impacts  of 
unregulated  capitalism. 

The  ascendency  of  capitalism  and  the  modern 
corporation  roughly  parallels  the  rise  of  profession¬ 
alism  in  government  and  the  development  of  the 
modern  state,  characterized  by  governmental 
bureaucracy.  Ironically,  while  it  is  sometimes 
claimed  that  capitalism  rests  on  the  free  exchange 
of  goods  and  services  through  the  price  mechanism 
and  is  thus  antithetical  to  bureaucracy,  the  reality  is 
that  modern  governments,  through  their  bureau¬ 
cracies,  pave  the  way  for  markets  to  function  by 
providing  infrastructure,  regulation  of  markets, 
transparency  in  market  processes,  and  regulation  of 
the  relationships  between  private  and  public  inter¬ 
ests.  In  reality,  states  and  markets  have  been  more 
often  collaborative  than  in  conflict.  Indeed,  it  is 
hard  to  imagine  the  effective  functioning  of  markets 
without  the  effective  functioning  of  government. 
The  complexity  of  the  modern  corporation  and  the 
complexity  of  modern  government  have  grown 
together  more  symbiotically  than  antagonistically. 

The  Bureaucratic  State 

It  is  hard  to  imagine  modern  government  without 
bureaucratic  capability.  Weak  states  lack  capabil¬ 
ity.  And  weak  states  are  often  characterized  as  ones 
affected  by  civil  strife,  lacking  effective  records, 
influenced  by  corruption,  and  unable  to  bring  ser¬ 
vices  to  their  populations,  especially  those  that  are 
remote  from  their  capital  cities.  They  are  also  typi¬ 
cally  incapable  of  collecting  revenues  in  an  impar¬ 
tial  fashion.  Cronyism  runs  amok.  A  professional 
corps  of  civil  servants  is  usually  absent  and  govern¬ 
ment  is  most  often  turned  into  a  spoils  system  of 
tribal  winners — or,  as  a  means  of  keeping  peace 
among  contending  factions,  sometimes  awarding 
some  of  the  spoils  to  each  of  them.  Weak  states  are 
inept  precisely  because  they  lack  the  desiderata  of 
professional  bureaucratic  organization  and  a  pro¬ 
fessional  civil  service  and  the  resources  to  ensure 
their  ability  to  function.  Consequently,  modern 
government  and  bureaucracy  are  nearly  inter¬ 
changeable.  Continuity,  specialized  expertise,  a 
professional  corps  of  civil  servants,  the  rule  of  law 
and  impersonal  authority,  and  documented  records 
are  the  essence  of  bureaucracy  and  of  modern 


states  capable  of  governing,  providing  services, 
and  collecting  revenues. 

For  the  most  part,  the  bureaucratic  state  largely 
preceded  the  arrival  of  democratic  government.  It 
served  the  interests  of  a  sovereign  crown  and  typi¬ 
cally  led  to  the  crown’s  extension  of  its  authority 
and,  eventually,  to  what  we  know  of  today  as  the 
modern  state.  As  a  result,  the  state  bureaucracy 
often  has  been  seen  as  a  conservatizing  force  in 
society — the  consolidator  of  existing  authority 
rather  than  a  challenger  to  it.  With  the  growth  of 
mass  enfranchisement,  the  bureaucracy  was  often 
viewed  as  the  upholder  of  the  existing  regime  and 
a  potential  threat  to  democracy.  Its  power  rests  on 
its  continuity  and  specialized  expertise  and  its  per¬ 
manence  and  durability,  while  the  temporarily 
mandated  leaders  of  government  are  often  in  a 
position  of  dependence  on  the  bureaucracy.  In  a 
frequently  invoked  phrase,  are  the  experts  on  tap 
or  on  top? 

One  major  exception  to  the  bureaucracy- 
democracy  sequence  was  the  United  States  where 
mass  enfranchisement  among  White  males  pre¬ 
ceded  the  development  of  a  professionalized 
bureaucracy.  Even  after  a  professional  civil  service 
emerged,  the  federal  government  had  few  respon¬ 
sibilities  until  the  1930s,  when  the  Franklin 
Roosevelt  administration  extended  the  federal  role 
into  the  realms  of  economic  and  business  regula¬ 
tion,  the  provision  of  social  insurance,  and,  ulti¬ 
mately,  the  welfare  state. 

There  has  been  much  discussion  across  many 
developed  societies  about  transforming  bureau¬ 
cracy,  lightening  its  load,  and  moving  toward 
market-oriented  incentives,  but  especially  in  the 
United  States,  the  bureaucracy  has  been  deeply 
controversial.  There,  the  identification  of  bureau¬ 
cracy  with  “big  government”  and  with  controver¬ 
sial  social  reforms  makes  it  both  a  point  of  parti¬ 
san  division  and  an  object  of  derision. 

The  essential  point  is  that  while  a  facade  of 
bureaucracy  is  no  guarantee  of  effective  govern¬ 
ment,  it  is  virtually  impossible  to  imagine  effective 
government  without  bureaucracy.  That,  however, 
does  not  mean  that  public  bureaucracy  will  be 
without  controversy  or  demands  for  reform  to 
which  it  has  been  subjected  especially  since  the 
1980s.  Nor  does  it  mean  that  the  bureaucratic 
organizational  form  is  without  contradictions  or 
that  it  can  lead  to  certain  pathologies. 
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Bureaucratic  Pathologies 

Despite  the  advantages  conferred  by  bureaucratic 
organization,  pathologies  result  from  the  internal 
contradictions  of  this  form  of  organization.  These 
we  will  call  internal  pathologies.  Another  set  of 
pathologies  results  from  the  ways  in  which  bureau¬ 
cracies  relate  to  their  environment.  These  we  will 
call  external  pathologies. 

Internal  Pathologies 

Analysts,  as  well  as  antagonists,  of  bureaucracy 
have  pointed  to  a  variety  of  problems  stemming 
from  the  logic  of  bureaucratic  organization.  These 
are  presented  in  no  particular  order  of  importance. 

One  pathology  stems  from  the  very  system  of 
impersonal  authority  and  legal  rationality  on 
which  bureaucratic  organization  is  based.  As  a 
rule-driven  form  of  organization,  bureaucracies 
must  defend  decisions  based  on  legal  rulings  and 
precedents.  Consequently,  bureaucracies  may  be 
short  on  compassion,  empathy,  or  extenuating 
circumstances  of  case  equity.  Uniform  procedures 
rather  than  case  equity  are  a  hallmark  of  bureau¬ 
cratic  organization.  A  frequent  criticism  of  this 
formalism  in  bureaucracy  is  characterized  by  accu¬ 
sations  directed  to  “faceless,”  “nameless,”  or 
“unfeeling”  bureaucrats  whose  decisions  defy 
“common  sense.”  Actually,  some  studies  of 
bureaucrats  dealing  with  clients  indicate  that 
bureaucrats  themselves  often  try  to  provide  sym¬ 
pathy,  guidance,  and  solace  within  the  constraints 
of  the  law.  But  law  is  what  guides  bureaucracy, 
and  this  particular  pathology,  in  reality,  is  a  cri¬ 
tique  of  the  rule  of  law.  By  contrast,  personal  rela¬ 
tionships  often  govern  politics,  and  these  relation¬ 
ships,  in  turn,  generate  charges  of  favoritism. 

A  second  pathology  stems  from  the  command 
and  control  system  of  hierarchy  in  bureaucracies. 
Hierarchical  authority  is  necessary  to  coordinate 
the  various  specialized  jurisdictions  as  well  as  to 
provide  an  ultimate  point  of  organizational 
accountability.  Still,  there  is  a  continuous  tension 
in  all  complex  organizations  between  strict  control 
from  the  top  and  devolution  of  authority  to  those 
at  more  operational  levels  within  an  organization. 
Paradoxically,  control  from  the  top  downward 
produces  more  definitive  accountability  while  also 
stymieing  an  organization’s  ability  to  adapt  to 
problems  it  encounters  on  the  ground.  It  also  may 


encourage  risk-averse  behavior  and  a  desire  on  the 
part  of  subordinates  to  keep  chastening  news  away 
from  their  superiors.  Studies  of  military  units 
observe  that  noncommissioned  officers  (the  bot¬ 
tom  rung  of  authority)  are  crucial  to  adaptability 
and  to  organizational  effectiveness.  How  to  appor¬ 
tion  and  allocate  authority  in  bureaucratic  organi¬ 
zations  in  ways  that  keep  it  from  flying  apart  and 
thus  adhering  to  no  common  identity  remains 
unresolved.  At  the  same  time,  how  to  imprint  lead¬ 
ership  direction  without  imposing  stultifying 
restrictions  on  operational  units  is  also  unclear. 
Organizations  need  to  be  responsive  and  adaptive 
to  their  environments  but  frequently  cannot  be 
if  control  from  the  top  is  excessively  restrictive. 
Obviously,  balance  is  needed,  but  it  is  unclear, 
certainly  in  the  abstract,  as  to  exactly  where  that 
balance  should  be  struck.  Without  sufficient 
control,  unaccountable  behaviors  may  become 
rampant — a  potential  issue  especially  for  organiza¬ 
tions  with  many  “street-level”  bureaucrats.  But 
too  much  control  is  likely  to  lead  to  cautious,  per¬ 
haps  even  timid,  behaviors  on  the  part  of  those 
responsible  for  hands-on  operations. 

A  third  pathology  of  bureaucratic  behavior 
stems  from  the  differentiated  and  specialized  juris¬ 
dictional  character  of  bureaucratic  organization. 
Especially  when  individuals  have  spent  much  of 
their  careers  in  a  given  unit,  they  are  likely  to  look 
at  and  define  problems  within  the  definitional  con¬ 
text  of  their  units.  A  number  of  studies  show  that 
each  unit  tends  to  see  the  “face  of  a  problem”  in  a 
way  consistent  with  their  unit’s  responsibilities  and 
the  subculture  of  their  unit.  Given  the  tendency  to 
search  for  problem  definition  and  responses  in  a 
cognitively  economical  fashion,  responses  are  apt 
to  rely  deeply  on  how  specific  organizational  units 
define  and  codify  responses  and  perpetuate  them. 
Does  this  produce  a  cacophony  of  voices  or  a  neces¬ 
sary  diversity  of  perspectives?  Again,  a  problem  of 
finding  the  ever-elusive  right  balance  appears. 
Organizational  subunit  responses  are  not  random. 
In  fact,  they  are  typically  predictable.  But  the  vari¬ 
ous  subunits’  responses  cannot  all  be  equally  satis¬ 
fied,  which  is,  of  course,  where  leadership  comes  in. 
Differentiation  and  the  codification  and  subcultural 
practices  that  stem  from  the  distinctive  component 
parts  of  bureaucratic  organization  provide  predict¬ 
ability  but  not  synthesis,  which  then  becomes  the 
responsibility  of  central  organizational  leadership. 
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Studies,  even  of  crises,  show  how  distinctive  “the 
face  of  the  problem”  will  appear  to  different  orga¬ 
nizational  units  in  accordance  with  their  distinct 
missions  and  their  core  cultures,  technologies,  and 
perspectives.  Can  unity  and  clarity  of  direction  be 
woven  against  the  backdrop  of  these  distinctive 
perspectives?  And  can  central  organizational  leader¬ 
ship  coax  the  losers  in  bureaucratic  battles  to  coop¬ 
erate  rather  than  to  continue  the  struggle  covertly? 

A  fourth  potential  pathology  of  bureaucratic 
organization  is  the  reliance  on  precedent.  One  of 
the  most  important  characteristics  of  bureaucracy 
is  that  it  provides  for  continuity.  Critical  to  conti¬ 
nuity  is  the  recording  of  precedents  and  the  use  of 
precedents  as  a  platform  through  which  to  assess 
current  issues.  Precedent  serves  to  economize  on 
cognitive  energy.  Basing  current  choices  on  prece¬ 
dents  reduces  the  costs  of  search  for  organizations. 
However,  reliance  on  precedents  may  lead  to  a 
highly  conservative  outlook  temperamentally,  one 
that  may  be  more  wedded  to  the  past  than  to  the 
future.  Bureaucracy  tends  to  be  thought  of  as  a 
lumbering  status  quo  form  of  organization.  That  is 
due  in  no  small  part  to  the  emphasis  on  following 
rules  and  procedures  and  the  reliance  on  prece¬ 
dents  set  by  the  organization.  Ultimately,  bureau¬ 
cracies  are  expert  at  routinizing  the  previously 
unexpected  or  contingent.  But  once  this  is  done, 
continuity  with  precedents  becomes  part  of  the 
operating  modality  and  departures  that  may  be 
required  for  adaptability,  equity,  or  even  sensibil¬ 
ity  become  harder  to  achieve.  Bureaucracy  pro¬ 
vides  ballast  and  stability  but  by  itself  is  unlikely  to 
provide  fresh  direction. 

A  fifth  possible  pathology  lies  in  the  intersection 
between  the  individual  interests  of  bureaucrats  and 
collective  well-being.  Some  analysis  based  on  eco¬ 
nomic  models,  for  example,  suggests  that  bureau¬ 
crats  have  a  stake  in  the  growth  of  their  agencies’ 
funding  as  they  themselves  are  apt  to  benefit  from 
the  growth  of  revenue.  A  more  compelling  inter¬ 
pretation,  though,  would  propose  that  where  the 
fundamental  technology  of  an  organizational  unit 
is  invested  in  significantly  by  the  bureaucrats  of 
that  unit,  changes  in  the  core  technology  puts  at 
risk  those  whose  training  has  been  part  of  a  prior 
technology.  The  more  intensive  and  specialized 
that  training,  the  more  resistance  there  may  be  to 
alterations  in  it  that  would  threaten  the  status  of 
the  current  technology.  The  usual  cliche  about  this 


is  that  all  military  forces  are  prepared  to  fight  the 
previous  war.  This  bit  of  conventional  wisdom 
may  well  be  overstated  but  there  is  a  fundamental 
truth  behind  it,  and  that  is  that  no  one  likes  to  be 
displaced  by  a  newer  or  different  technology  that 
renders  one’s  own  training  obsolete.  The  human 
being  inside  the  machinery  of  complex  bureau¬ 
cratic  organization,  therefore,  needs  to  be  taken 
into  account. 

In  summary,  the  virtues  of  bureaucratic  organi¬ 
zation  also  contain  the  seeds  of  its  deficiencies. 
Continuity,  predictability,  legality,  precedent,  and 
specialized  expertise,  accordingly,  may  lead  to  dif¬ 
ficulties  in  adapting  to  change  or  recognizing  nov¬ 
elty,  difficulties  in  applying  case  equity,  and  prob¬ 
lems  of  coordination.  Hierarchical  authority  also 
can  lead  to  a  lack  of  sensitivity  to  problems  on  the 
ground.  Yet  a  lack  of  hierarchical  control  may  lead 
to  a  lack  of  accountability.  Because  these  problems 
are  built  into  a  system  of  law  and  impersonal 
authority,  there  are  no  set  formulas  for  resolving 
them.  But  they  remain  problems  inherent  in 
bureaucracy. 

External  Pathologies 

Despite  our  dependence  on  bureaucracy  to  pro¬ 
vide  orderly  processes,  the  virtues  of  bureaucracy 
tend  to  be  hidden  to  the  unschooled  eye,  whereas  its 
deficiencies  seem  to  be  obvious.  One  of  these  defects 
lies  in  the  rule  boundedness  of  bureaucracy  resulting 
in  the  well-known  “red  tape”  problem.  In  contrast 
to  the  realm  of  personal  authority  where  one  may 
petition  to  have  a  problem  fixed  through  political 
means,  procedural  fairness  and  the  accountability  of 
bureaucracy  to  legal  and  auditing  requirements 
inevitably  produces  large  amounts  of  paperwork. 
The  objective  is  to  reduce  the  improper  use  of  funds 
and  procedural  irregularities  and  to  ensure  a  record 
of  transactions.  Few  people,  even  bureaucrats,  are 
enthralled  by  red  tape.  Most  are  frustrated  by  it.  But 
red  tape  is  the  product  of  demands  for  accountabil¬ 
ity  and  procedural  regularity,  especially  in  the  public 
sphere.  A  bureaucracy  notably  responsive  to  its  cli¬ 
enteles  might  be  regarded  as  showing  favoritism  and 
evading  lawful  procedures.  The  reality  is  that  while 
we  extol  the  rule  of  law  in  the  abstract,  we  often 
prefer  personal  relationships  concretely.  That  is,  we 
often  want  a  “fixer”  to  deal  with  the  complications 
that  regularity  of  procedures  demands.  Everyone 
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wishes  to  cut  through  the  maze  of  red  tape  to  get 
what  they  want,  but  they  also  want  to  have  proce¬ 
dures  in  place  to  deter  others  from  getting  what  is 
improper. 

Public  bureaucracies  particularly  are  expected 
to  be  both  accountable  and  responsive.  These 
terms,  however,  are  remarkably  elusive. 
Accountable  to  what  and  responsive  to  whom  are 
questions  that  immediately  arise.  There  appears  to 
be  no  definitive  agreement  as  to  what  precisely 
these  words  mean.  Often,  they  are  used  inter¬ 
changeably.  If,  however,  accountability  is  taken  to 
mean  being  answerable  to  a  legally  grounded 
authorization — a  constitution,  a  sovereign,  a  law, 
or  a  fiduciary  responsibility — and  if  responsiveness 
is  taken  to  mean  acceptance  of  the  demands  of 
temporary  political  authorities  or  of  agency  clien¬ 
teles,  one  can  see  how  these  ideas  may  come  into 
conflict.  The  great  dilemma  is  for  bureaucracies  to 
be  accountable  and  responsive  up  to  a  point — a 
point,  however,  that  rarely  can  or  even  ought  to  be 
satisfied  fully.  An  overly  responsive  bureaucracy 
will  no  longer  be  seen  as  neutral,  fair,  or  judicious. 
A  bureaucracy,  however,  that  is  unresponsive 
either  has  succeeded  in  displacing  the  elected  or,  at 
the  other  extreme,  has  lost  its  political  relevance. 
Bureaucrats  need  some  independence  from  the  cur¬ 
rent  political  authorities  if  they  are  to  function 
properly,  but  the  line  demarcating  independence 
and  unresponsiveness  is  very  murky  indeed.  To 
find  the  appropriate  balance  is  as  difficult  as  it  is 
necessary.  Studies  of  senior  civil  servants  in  many 
developed  countries  indicate  that  they  are  sensitive 
to  their  political  context  but  are  frequently  con¬ 
cerned  about  the  proper  equilibrium  between  neu¬ 
tral  competence  and  accountability  on  the  one 
hand  and  political/policy  responsiveness  to  the 
current  authorities  on  the  other. 

From  a  political  perspective,  another  criticism 
of  bureaucracy,  especially  as  a  regulator  of  busi¬ 
ness  is  that  it  is  a  “deadweight”  on  the  functioning 
of  markets  by  imposing  costs  on  firms  that  add  to 
prices.  While  regulation  by  bureaucrats  is  the 
product  of  political  decisions  made  by  duly  autho¬ 
rized  governments,  free  market  proponents  believe 
that  bureaucratic  regulation  diminishes  economic 
dynamism  and  efficiency.  Each  new  regulation 
drives  up  the  cost  of  doing  business  and  restricts 
the  options  that  businesses  may  have  available  to 
them.  When  the  regulatory  hand  is  made  lighter, 


however,  risks  often  arise  that  result  in  crises  or 
avoidable  tragic  outcomes,  and  the  call  for  regula¬ 
tion  is  renewed.  The  proper  balance  between 
regulation  and  business  flexibility,  as  with  other 
relationships  between  desirable  but  contradictory 
values,  is  hard  to  define,  and  the  proper  instru¬ 
mentation  for  overseeing  and  enforcing  regula¬ 
tory  behavior  is  equally  difficult  to  pinpoint. 

Finally,  a  significant  criticism  of  bureaucracy  is 
that  its  incentive  structures  are  all  wrong.  Civil 
servants  are  insulated  through  tenure.  This  insula¬ 
tion  may  lead  to  behavior  that  provides  little 
incentive  to  promote  efficiencies  that  could  disturb 
the  status  quo  since  there  is  little  reward  for  doing 
so.  A  contrast  is  frequently  drawn  with  private 
enterprise  where  continuous  innovation  is  believed 
to  be  necessary  for  a  firm’s  health  and  growth 
prospects.  According  to  this  perspective,  the  busi¬ 
ness  model  is  presumed  to  be  the  form  of  organiza¬ 
tion  worth  emulating,  though,  of  course,  most 
large-scale  businesses  are  also  bureaucracies. 
Administrative-reform  efforts  beginning  in  the 
1980s,  falling  under  the  rubric  of  “new  public 
management,”  essentially  emphasized  the  business 
model,  emphasizing  the  off-loading  of  many  gov¬ 
ernment  functions  to  the  private  sector  and  provid¬ 
ing  managerial  incentives  to  improve  performance 
in  a  scaled-down  public  sector.  The  assumption 
was  that  what  worked  in  the  market  place  would 
work  equally  well  in  government.  Freeing  up  the 
managers  and  holding  them  accountable  for  per¬ 
formance  was  one  reform  pathway.  Creating  agen¬ 
cies  and  managers  with  limited-term  contracts  was 
another.  In  at  least  one  case,  tenure  itself  was 
eliminated  in  the  public  sector. 

The  results  of  this  emphasis  on  making  govern¬ 
ment  more  like  business  are  still  uncertain,  and  the 
movement  toward  providing  incentives  for  pro¬ 
moting  efficiency  and  improved  performance  may 
rest  on  an  as  yet  unconfirmed  assumption  that 
officials  in  the  public  sphere  have  similar  motiva¬ 
tions  as  their  counterparts  in  the  private  sphere.  In 
fact,  what  evidence  there  is  on  this  matter  suggests 
otherwise.  In  the  business  model  critique  of  bureau¬ 
cracy,  performance  should  be  based  less  on  adher¬ 
ence  to  rules  and  more  on  results.  A  lessening  of 
rules-based  administration,  one  can  imagine, 
would  be  open  to  extensive  challenges,  and  a  fun¬ 
damental  search  for  consistent  application  of  law 
would  hardly  disappear.  Emphasizing  results,  of 
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course,  requires  agreement  on  what  the  results 
should  be  and  on  the  metrics  required  to  assess 
performance.  It  is  likely  that  some  agency  missions 
may  not  be  easily  susceptible  to  clear  metrics.  It  is 
also  likely  that  some  agencies  will  adjust  the  defini¬ 
tion  of  their  missions  in  ways  that  allow  for  favor¬ 
able  metrics  even  if  they  are  peripheral  to  their 
broader,  if  ill-defined,  missions. 

Living  With  Bureaucracy 

Not  all  organizations  are  bureaucratic  and,  indeed, 
some  of  the  most  innovative  ones  in  the  private 
sector  tend  to  be  flat  and  intermingled  to  some 
degree  in  their  functions.  However,  the  growth  of 
bureaucracy  reflects  modernity  and  a  system  ruled 
by  legal  authority  and  by  differentiated  function. 
That  produces  annoyances  such  as  the  heavy 
emphasis  on  procedures,  rules,  and  specialized 
offices  whose  understandings  may  not  be  fully 
integrated  with  one  another.  Nevertheless,  bureau¬ 
cracy  is  able  to  provide  continuity  in  government, 
consistency  in  application,  proficiency  through 
specialized  and  limited  jurisdictions,  and  profes¬ 
sionalism  through  a  full-time  civil  service  based  on 
merit. 

The  limits  and  critiques  of  bureaucracy  are 
notable.  Its  virtues  are  also  often  its  vices.  Balancing 
the  contradictory  tensions  of  the  bureaucratic 
form  of  organization  presents  a  continuing  chal¬ 
lenge.  There  is  also  no  doubt  that  much  of  the 
criticism  of  bureaucracy  is  distinctly  political  and 
has  little  to  do  with  bureaucracy  itself  and  more  to 
do  with  views  about  the  appropriate  scope  of  gov¬ 
ernment.  Finally,  to  appreciate  the  advantages  of 
bureaucracy,  it  is  useful  to  contemplate  life  with¬ 
out  it.  It  would  be  a  world  without  continuity, 
without  consistency,  and  with  a  shortage  of  com¬ 
petence.  We  may  not  like  bureaucracy,  but  we 
would  assuredly  dislike  a  world  without  it  even 
more. 
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Bureaucracy,  Rational 
Choice  Models 


This  entry  discusses  rational  choice  models  of  the 
bureaucracy.  It  first  defines  what  it  means  for  a 
theoretical  model  of  the  bureaucracy  to  employ 
rational  choice  and  discusses  two  major  applica¬ 
tions  of  these  models.  It  then  discusses  the  basic 
assumptions  underlying  most  models  of  the 
bureaucracy.  Finally,  it  presents  major  themes  and 
findings  in  the  literature  and  discusses  how  these 
are  related  to  modeling  assumptions. 

Rational  choice  models  of  the  bureaucracy  oper¬ 
ate  under  the  assumption  that  all  relevant  actors  are 
rational.  All  players  in  the  model — the  government, 
bureaucrats,  and  private  sector  entities  affected  by 
bureaucratic  decisions — have  well-defined  prefer¬ 
ence  orderings  over  possible  outcomes  and  act  in  a 
manner  that  ensures  that  their  preferred  outcomes 
are  realized. 
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To  facilitate  analysis,  scholars  in  this  tradition 
typically  assume  that  players’  preferences  satisfy 
several  properties.  It  is  generally  assumed  that 
players’  preferences  can  be  represented  by  a  utility 
function,  such  that  a  player  prefers  outcome  A  to 
outcome  B  if  and  only  if  that  player’s  utility  from 
outcome  A  is  greater  than  that  from  outcome  B.  It 
is  often  further  assumed  that  players  have  consis¬ 
tent  preferences  regarding  risk,  so  that  preferences 
over  uncertain  outcomes  may  be  represented  by 
expected  utilities.  Many  models  further  assume 
that  players  react  to  new  information  according  to 
Bayes’s  rule  (i.e.,  how  one  should  change  one’s 
existing  beliefs  in  light  of  new  evidence). 

Rational  choice  models  of  the  bureaucracy  tend 
to  be  game  theoretic.  All  players  are  assumed  to  be 
strategic  in  their  interactions — each  player  consid¬ 
ers  the  strategies  of  all  other  players  in  attempting 
to  reach  his  or  her  preferred  outcome.  Analysts 
derive  equilibrium  solutions  to  these  models, 
where  an  equilibrium  is  defined  as  a  set  of  strate¬ 
gies  wherein  each  agent’s  strategy  is  a  best  response 
to  all  other  agents’  strategies,  and  agents’  beliefs 
are  consistent  with  those  strategies. 

The  rational  choice  literature  on  bureaucracy 
has  primarily  focused  on  two,  interrelated,  prob¬ 
lems:  delegation  and  special-interest  influence. 
Models  of  delegation  involve  a  government  grant¬ 
ing  authority  to  a  bureaucratic  agent.  The  crucial 
problem  for  the  government  lies  in  the  fact  that  the 
bureaucracy  may  not  perfectly  share  its  prefer¬ 
ences.  The  government  must,  therefore,  oversee 
bureaucratic  decisions  and  provide  incentives  to 
ensure  that  the  bureaucracy  acts  in  the  desired 
manner.  This  problem  closely  relates  to  the  branch 
of  microeconomics  known  as  principal-agent  (or 
contract)  theory,  which  examines  a  similar  prob¬ 
lem  between  firms  and  their  employees. 

Models  concerned  with  the  influence  of  special 
interests  focus  on  the  interaction  between  the 
bureaucracy  and  the  private  sector.  Private  sector 
agents  may  wish  to  manipulate  the  behavior  of 
bureaucrats.  The  mechanisms  to  do  so  may  involve 
bribery,  appeals  to  bodies  charged  with  bureau¬ 
cratic  oversight,  or  the  provision  of  jobs  to  former 
members  of  the  bureaucracy  (following  the  prin¬ 
ciple  of  the  “revolving  door).  Such  models  may 
also  relate  to  delegation,  as  the  government  may 
wish  to  curtail  special-interest  influence  and  will 
conduct  oversight  accordingly. 


Model  Assumptions 

The  Preferences  of  Bureaucrats 

Since  rational  choice  models  of  bureaucracies 
assume  that  bureaucrats  act  to  achieve  their  most 
preferred  outcomes,  these  models  critically  depend 
on  assumptions  about  what  bureaucrats  prefer. 
Preferences  may  be  assumed  to  be  primitive,  or 
they  may  be  induced.  Primitive  preferences  describe 
outcomes  bureaucrats  inherently  enjoy.  Induced 
preferences  are  defined  by  actions.  Bureaucrats  are 
driven  to  prefer  certain  actions  as,  in  equilibrium, 
these  lead  to  preferable  outcomes. 

The  rational  choice  literature  on  bureaucracies 
has  generally  defined  bureaucrats’  preferences  in 
one  of  three  ways.  William  Niskanen  introduced 
the  assumption  that  bureaucrats  attempt  to  maxi¬ 
mize  their  agencies’  budget  allocations.  This  pref¬ 
erence  is  treated  as  primitive  for  modeling  pur¬ 
poses.  But  Niskanen  argues  that  the  preference  for 
budget  maximization  results  from  an  implicit  the¬ 
ory  of  the  interaction  between  bureaucrats  and  the 
government.  An  individual  bureaucrat  is  presumed 
to  attempt  to  maximize  his  or  her  salary  (both  cur¬ 
rently  and  in  the  future)  and  prestige.  Both  salary 
and  prestige  depend,  in  part,  on  the  budget  of  the 
bureaucrat’s  agency.  Since  the  bureaucrat’s  salary 
and  prestige  are  assumed  to  be  improving  in  the 
budget  allocated  to  the  bureaucrat’s  agency,  the 
bureaucrat  acts  to  increase  this  allocation. 

Other  models  of  bureaucracy  work  with  assump¬ 
tions  regarding  bureaucrats’  primitive  economic 
preferences.  Like  other  economic  agents,  bureau¬ 
crats  attempt  to  maximize  consumption  and  leisure. 
Such  models  may  be  used  to  derive  induced  prefer¬ 
ences  over  actions.  For  instance,  career  concerns 
models  predict  that  bureaucrats  act  to  maximize 
their  reputation  in  time  t  because  this  reputation 
affects  their  career  prospects,  and  thus  consumption, 
in  time  t  +  1.  This  result  is  an  equilibrium  prediction 
from  the  model — an  author  of  a  career  concerns 
model  need  not  assume  that  bureaucrats  have  prim¬ 
itive  preferences  regarding  their  reputation. 

Finally,  many  models  of  bureaucratic  politics 
treat  bureaucrats  (and  governments)  as  having  pref¬ 
erences  over  policy  outcomes — that  is,  ideological 
preferences.  Ideological  preferences  are  generally 
treated  as  primitive.  Individual  bureaucrats  may 
have  inherent  preferences  over  policy  outcomes — a 
reflection  of  their  individual  political  ideology. 
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Alternatively,  ideological  preferences  may  be  treated 
as  induced  by  an  unmodeled  interaction  between 
bureaucrats  and  other  entities.  For  instance,  many 
models  of  the  American  bureaucracy  assume  that 
agency  preferences  reflect  those  of  the  president. 
Because  the  president  is  able  to  appoint  and  remove 
many  agency  heads,  it  is  assumed  that  bureaucrats 
will  behave  as  if  they  share  the  president’s  ideology. 

Information 

The  information  available  or  not  available  to 
players  shapes  their  strategies  to  achieve  their  pre¬ 
ferred  outcomes.  In  models  of  delegation,  it  is 
typical  to  assume  that  bureaucratic  agencies  have 
an  informational  advantage  vis-a-vis  the  govern¬ 
ment.  The  government  delegates  rule-making 
authority  to  agencies  because  bureaucrats  have 
technical  knowledge  and  experience  that  are  not 
available  to  the  average  member  of  government. 
Governments  may  be  less  able  than  bureaucracies 
to  design  policies  to  achieve  desired  outcomes. 

Other  forms  of  informational  asymmetries  may 
also  be  incorporated  into  models  of  the  bureau¬ 
cracy.  For  instance,  if  the  government  cannot  per¬ 
fectly  observe  the  skill  or  ideology  of  potential 
bureaucrats,  it  is  faced  with  a  problem  of  adverse 
selection.  In  models  of  adverse  selection,  the  gov¬ 
ernment  must  conduct  oversight  and  provide 
wages  keeping  in  mind  the  type  of  agents  these 
incentives  will  attract.  For  instance,  if  some  possi¬ 
ble  recruits  are  willing  to  engage  in  corruption, 
while  others  are  not,  sufficiently  low  wages  may 
attract  corruptible  recruits  who  are  better  able  to 
supplement  their  salaries  with  illicit  income. 

If  the  actions  of  bureaucrats  in  office  are  only 
imperfectly  observable,  then  governments  are  faced 
with  a  problem  of  moral  hazard.  It  is  important  to 
note  that  moral  hazard  is  only  an  issue  if  outcomes 
are  also  imperfectly  observable  or  are  imperfectly 
determined  by  bureaucrats’  actions.  If  bureaucrats’ 
actions  perfectly  map  into  observable  outcomes, 
then  the  government  can  accurately  deduce  bureau¬ 
crats’  behavior  based  on  these  outcomes.  However, 
if  this  is  not  the  case,  then  the  government  must 
conduct  oversight  in  a  manner  that  minimizes  the 
costs  of  moral  hazard.  These  costs  arise  because  the 
government  is  not  able  to  optimally  reward  and 
sanction  agent  behavior.  Instead,  it  can  only  respond 
to  outcomes  that  imperfectly  reflect  this  behavior. 


Finally,  all  players  may  lack  information  over 
the  future  composition  of  the  government  due  to 
political  uncertainty.  Members  of  the  current  gov¬ 
ernment  may  be  replaced  by  elections,  coalition 
realignments,  retirement,  or  death.  As  new  indi¬ 
viduals  enter  government,  the  ideological  prefer¬ 
ences  of  the  government  may  change.  All  players  in 
the  model  are  therefore  uncertain  of  future  govern¬ 
ment  policy  preferences. 

Government  and  Bureaucratic  Structure 

The  final  elements  in  a  model  of  bureaucratic 
politics  are  assumptions  about  the  number  and 
composition  of  players.  Models  of  delegation  must 
include,  at  least,  a  unitary  government  and  a  single 
representative  bureaucratic  agent.  Similarly,  mod¬ 
els  of  interest-group  influence  require  at  least  a 
representative  bureaucrat  and  a  single  special- 
interest  group.  But  more  sophisticated  models  may 
incorporate  more  realistic  assumptions  about  the 
number  of  players  and  their  interactions — with 
important  effects  on  equilibrium  outcomes. 

For  instance,  models  may  vary  the  number  of 
players  considered  to  be  in  the  government.  Some 
models  may  incorporate  the  existence  of  multiple 
legislators  or  the  separation  of  powers  between 
branches  of  government.  Such  a  model  is  said  to 
include  multiple  principals. 

Similarly,  bureaucracies  may  be  modeled  as 
consisting  of  multiple  bureaucrats  or  multiple 
agencies.  Models  incorporating  more  than  one 
bureaucrat  are  said  to  feature  multiple  agents. 
Bureaucrats  or  agencies  may  be  redundant — that 
is,  assigned  to  similar  or  identical  tasks. 
Alternatively,  some  agencies  may  be  granted  over¬ 
sight  authority  over  others. 

Finally,  all  models  of  special-interest  influence — 
and  some  models  of  delegation — incorporate  pri¬ 
vate  sector  agents.  These  may  be  one  or  more  firms 
seeking  to  maximize  profits,  one  or  more  citizens 
or  groups  thereof  with  ideological  preferences,  or 
groups  of  consumers.  The  number  and  composi¬ 
tion  of  these  interests  affect  their  interactions  with 
the  government  and  the  bureaucracy. 

Models  of  Delegation 

As  noted  above,  models  of  delegation  involve  a 
government’s  decision  to  grant  authority  to  a 
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bureaucratic  agent,  who  may  not  perfectly  share 
the  government’s  preferences.  These  models  are 
used  to  derive  predictions  about  which  bureaucrats 
will  be  appointed,  how  they  will  perform  in  office, 
what  forms  of  oversight  will  be  employed,  and 
how  much  discretion  they  will  be  granted.  These 
predictions  will  depend,  in  part,  on  assumptions 
regarding  the  preferences  of — and  information 
available  to — all  players,  as  well  as  the  number  and 
composition  of  players  in  the  model. 

Appointment 

Models  of  appointment  attempt  to  deduce 
which  candidates  will  be  selected  for  bureaucratic 
office,  where  candidates  vary  on  some  dimension. 
Typically,  models  of  appointment  assume  that 
candidates  for  bureaucratic  office  have  primitive 
preferences  over  policy,  and  candidates  vary  in 
their  preferred  policies. 

In  most  settings,  models  of  bureaucratic  appoint¬ 
ment  predict  that  governments  will  appoint  officials 
whose  primitive  ideological  preferences  are  most 
proximate  to  the  government’s  own.  This  finding  is 
known  as  the  Ally  principle.  The  logic  of  this  find¬ 
ing  is  straightforward:  If  a  bureaucrat  is  granted 
discretion  to  implement  policy,  his  or  her  policy 
choice  will  be  dictated  by  his  or  her  preferences. 
The  closer  the  bureaucrat’s  ideological  preference 
to  that  of  the  government,  the  more  the  government 
can  expect  to  benefit  from  the  bureaucrat’s  policy 
choice.  This  principle  relates  closely  to  the  literature 
on  attempts  to  politicize  the  bureaucracy. 

However,  the  Ally  principle  may  be  violated 
under  conditions  of  uncertainty.  Assume  now  that 
candidates  for  office  vary  not  only  in  their  ideo¬ 
logical  preferences  but  also  in  their  technical 
knowledge.  More  precisely,  assume  that  some  can¬ 
didates  are  better  informed  of  the  mapping  from 
policies  into  outcomes  than  are  others.  In  this 
instance,  the  government  may  prefer  to  appoint  a 
more  knowledgeable  candidate  with  preferences 
more  removed  from  its  own  over  an  ill-informed 
candidate  with  proximate  preferences.  The  ideo¬ 
logically  proximate  candidate  may  be  unable  to 
ensure  that  the  policies  he  or  she  would  adopt  in 
office  would,  in  expectation,  lead  to  outcomes  that 
the  government  prefers  over  policies  that  the  more 
ideologically  distant — but  more  competent — 
candidate  would  adopt. 


The  Ally  principle  may  also  fail  to  hold  in  set¬ 
tings  where  bureaucratic  appointees  interact  with 
other  players  in  formulating  policies.  For  instance, 
a  government  may  prefer  to  appoint  a  central 
banker  who  is  more  inflation  averse  than  the  gov¬ 
ernment  itself.  This  result  holds  because  policy 
outcomes — here  inflation  and  unemployment — 
reflect  the  interaction  between  the  central  banker 
and  the  public.  Central  bankers  cannot  commit  to 
implement  policies  they  do  not  prefer.  Therefore,  if 
the  central  banker  is  not  inflation  averse,  the  pub¬ 
lic  will  expect  him  or  her  to  attempt  to  generate 
unexpected  inflation — thus  reducing  unemploy¬ 
ment.  These  inflationary  expectations  produce 
inflation — and  no  reduction  in  unemployment — in 
equilibrium.  However,  if  the  central  banker  is 
inflation  averse,  the  public  will  expect  inflation  to 
remain  low — a  more  favorable  outcome  for  the 
government. 

Bureaucratic  Slack 

A  second  major  concern  of  models  of  delegation 
is  the  danger  of  bureaucratic  slack.  Bureaucrats  are 
said  to  be  shirking — yielding  slack — when  they 
exert  little  or  no  effort  in  their  duties.  Slack  is 
typically  a  feature  of  models  wherein  bureaucrats 
are  assumed  to  have  economic  preferences — they 
maximize  consumption  and  leisure.  Slack  thus 
occurs  when  bureaucrats  face  insufficient  incen¬ 
tives  to  forgo  leisure. 

Questions  of  bureaucratic  effort  are  intimately 
linked  to  the  power  of  the  incentives  bureaucrats 
are  offered.  Incentives  are  said  to  be  high  powered 
when  rewards  and  punishments  are  heavily  condi¬ 
tioned  on  outcomes.  They  are  low  poivered  when 
rewards  and  punishments  are  largely  independent 
of  outcomes.  Low-powered  incentives  tend  to 
increase  slack. 

The  degree  of  bureaucratic  slack  will  also  depend, 
in  part,  on  the  structure  of  information  in  the 
model.  In  particular,  moral  hazard  will  tend  to 
increase  shirking.  The  rationale  behind  this  result  is 
intuitive:  If  the  government  is  unable  to  determine 
the  degree  of  effort  exerted  by  bureaucrats,  it  will  be 
difficult  to  reward  or  punish  this  effort.  It  is  there¬ 
fore  difficult  to  encourage  bureaucrats  not  to  shirk. 

Slack  will  also  be  affected  by  the  number  of 
players  with  authority  over  the  bureaucracy. 
Models  with  multiple  principals  will  tend  to  have 
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lower-powered  incentives  and  greater  slack  than 
models  with  a  single  principal.  Slack  will  also  tend 
to  increase  as  the  interests  of  the  principals  diverge. 
So,  for  instance,  one  might  expect  greater  bureau¬ 
cratic  slack  to  occur  when  powers  are  separated 
between  branches  of  government  and  when  differ¬ 
ent  parties  dominate  each  branch. 

The  presence  of  multiple  government  principals 
may  produce  bureaucratic  slack  for  one  of  two 
reasons.  First,  multiple  principals  may  all  try  to 
induce  bureaucrats  to  behave  in  their  preferred 
manner.  If  principals  have  different  interests,  these 
incentives  will  tend  to  conflict,  offsetting  one 
another.  This  type  of  model  is  most  likely  to  make 
sense  when  depicting  a  lower  level  bureaucrat  with 
multiple  overseers. 

Alternatively,  multiple  agents  within  the  gov¬ 
ernment  (e.g.,  legislators)  may  need  to  coordinate 
to  punish  or  reward  a  bureaucrat  for  his  or  her 
performance.  As  the  number  of  legislators  with 
conflicting  interests  rises,  it  grows  less  likely  that  a 
reward  or  punishment  will  be  agreed  on. 

Slack  will  also  be  affected  by  the  presence  of 
multiple  bureaucratic  agents  in  a  model. 
Redundancy — the  assignment  of  an  identical  task 
to  more  than  one  bureaucrat — will  tend  to  increase 
slack  when  bureaucrats  only  care  whether  one 
member  of  a  team  performs  a  task  successfully. 
Such  a  situation  may  result  if,  for  instance,  multi¬ 
ple  police  officers  are  assigned  to  a  given  case  and 
all  share  credit  if  the  case  is  successfully  solved.  In 
such  a  situation,  each  officer  prefers  to  shirk  while 
the  others  work  on  the  case.  He  or  she  then  might 
enjoy  leisure  time  and  benefit  from  the  rest  of  the 
team’s  efforts.  Since  all  officers  have  the  same  pref¬ 
erence  for  slacking,  no  officer  will  exert  the  opti¬ 
mal  amount  of  effort  in  solving  the  case.  As  a 
result  of  this  problem,  the  probability  that  at  least 
one  member  of  the  team  solves  the  case  may  not 
rise  with  the  number  of  officers  assigned. 

However,  in  situations  characterized  by  moral 
hazard,  redundancy  may  reduce  slack.  In  such  a 
situation,  the  government  may  compare  the  per¬ 
formance  of  one  bureaucrat  or  agency  with  that  of 
other  bureaucrats  or  agencies  charged  with  the 
same  task.  This  comparison  will  allow  the  govern¬ 
ment  to  deduce  better  the  level  of  effort  exerted  by 
the  agency  and  reward  or  punish  it  accordingly. 

Low-level  bureaucrats  may  also  slack  in  models 
where  the  bureaucracy  assumes  a  tiered  structure, 


such  that  some  agents  are  given  oversight  author¬ 
ity  over  others.  In  such  a  system,  low-level  bureau¬ 
crats  will  tend  to  face  low-powered  incentives. 
Were  they  to  face  high-powered  incentives,  low- 
level  bureaucrats  may  attempt  to  bribe  their  over¬ 
seers  to  ignore  poor  performance.  As  the  power  of 
their  incentives  increases,  low-level  bureaucrats 
have  greater  reason  to  collude  with  their  overseers 
and  are  thus  willing  to  pay  larger  bribes.  The  gov¬ 
ernment  will  thus  optimally  reduce  the  power  of 
incentives  faced  by  low-level  bureaucrats  so  as  to 
deter  such  collusion. 

Bureaucratic  Drift 

Bureaucratic  drift  refers  to  the  phenomenon 
wherein  bureaucrats  adopt  policies  that  differ  from 
those  preferred  by  the  executive  or  legislature. 
Drift  is  therefore  typically  a  concern  in  models  in 
which  bureaucrats  have  preferences  over  policy. 
Drift  results  when  bureaucrats  are  given  discretion 
to  implement  their  preferred  policy  outcomes. 

Drift  is  typically  controlled  through  one  of  two 
mechanisms:  ex  ante  limits  on  bureaucratic  discre¬ 
tion  or  ex  post  review  of  bureaucratic  decisions. 
The  former  refers  to  legislative  limits  that  prohibit 
certain  policy  positions  from  ever  being  adopted. 
The  latter  refers  to  the  ability  of  other  government 
entities  (the  legislature,  the  courts,  etc.)  to  review 
and  overturn  bureaucratic  decisions.  These  two 
mechanisms  act  as  substitutes  in  preventing  bureau¬ 
cratic  drift — as  ex  post  review  becomes  more  dif¬ 
ficult,  ex  ante  discretion  is  likely  to  decline. 

The  extent  of  bureaucratic  drift  will  be  affected 
by  the  extent  of  the  informational  asymmetry 
between  the  bureaucracy  and  the  government.  The 
more  technically  knowledgeable  the  bureaucracy, 
relative  to  the  government,  the  fewer  ex  ante  limits 
on  discretion.  As  levels  of  discretion  expand,  the 
bureaucracy  has  greater  latitude  to  implement  its 
own  ideological  preferences. 

Adverse  selection  may  also  affect  the  degree  of 
policy  drift.  A  provocative  recent  paper  demon¬ 
strates  that  the  degree  of  policy  discretion  afforded 
to  bureaucrats  may  affect  their  willingness  to  remain 
in  office  and  to  acquire  policy  relevant  expertise. 
Policy-motivated  bureaucrats  will  be  more  likely  to 
invest  in  acquiring  expertise  if  they  are  granted  dis¬ 
cretion.  To  the  extent  that  discretion  is  limited, 
policy-motivated  bureaucrats  will  value  office  and 
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expertise  less.  Such  officials  will  be  less  willing  to 
exert  effort  to  arrive  at  correct  decisions  when  dis¬ 
cretion  is  limited.  In  other  words,  policy  drift  may 
be  a  necessary  cost  for  an  informed  bureaucracy. 

The  number  of  government  principals  may 
also  play  a  role  in  bureaucratic  drift.  If  these  prin¬ 
cipals  must  coordinate  to  overturn  bureaucratic 
decisions — as  in  a  legislature — their  ability  to 
exercise  ex  post  review  may  be  limited.  The 
greater  the  number  of  principals  and  the  more 
conflicted  their  interests,  the  less  likely  ex  post 
review  will  be  exercised.  To  combat  the  resultant 
danger  of  drift,  ex  ante  restrictions  on  bureau¬ 
cratic  discretion  will  tend  to  rise. 

Finally,  the  threat  of  changes  to  the  governing 
coalition  may  affect  drift.  Suppose  that  the  govern¬ 
ment  at  time  t  fears  that  it  will  be  replaced  by  an 
alternative  government  with  divergent  preferences 
in  time  t  +  1.  To  the  extent  possible,  such  a  govern¬ 
ment  prefers  to  lock  in  current  policy  choices, 
limiting  the  policy  discretion  of  successor  govern¬ 
ments.  One  means  of  so  limiting  policy  discretion 
is  to  adopt  administrative  procedures  that  limit  the 
oversight  authority  of  future  governments.  Since 
future  governments  may  use  their  oversight  pow¬ 
ers  to  force  through  undesirable  policy  changes, 
the  government  at  time  t  will  seek  to  insulate  the 
bureaucracy  from  such  oversight. 

Models  of  Special  Interests 

Models  of  the  interaction  between  special  interests 
and  the  bureaucracy  are  centrally  concerned  with 
when,  and  whether,  special  interests  will  be  able  to 
influence  bureaucratic  decisions.  Influence  may 
emerge  through  a  number  of  mechanisms:  bribery, 
offers  of  future  employment,  and  appeals  to  the 
government  to  review  bureaucratic  decisions. 
Influence  might  also  be  exercised  through  more 
subtle  mechanisms — for  instance,  through  threat¬ 
ening  appeals  to  overseers.  Special-interest  influ¬ 
ence  may  affect  the  agency  relationship  between 
the  government  and  the  bureaucracy  and  the  deci¬ 
sion  of  whether  or  not  to  delegate  authority  to 
bureaucrats  in  the  first  place. 

Capture 

A  particular  concern  of  models  of  the  interac¬ 
tion  between  special  interests  and  the  bureaucracy 


is  whether  bureaucrats — most  particularly  regula¬ 
tors — will  collude  with  the  citizens  or  the  industry 
they  are  meant  to  regulate.  This  collusion  is  com¬ 
monly  termed  capture.  In  models  of  bureaucratic 
politics,  capture  is  typically  modeled  using  the 
assumption  that  bureaucrats  have  primitive  eco¬ 
nomic  preferences  and/or  are  influenced  by  career 
concerns.  This  assumption  ensures  that  bureau¬ 
crats  may  be  motivated  by  bribes  or  the  promise  of 
future  employment  in  the  regulated  sector. 

When  models  of  capture  incorporate  the  agency 
relationship  between  the  bureaucracy  and  govern¬ 
ment,  they  assume  that  the  government  is  faced 
with  the  problem  of  moral  hazard.  If  the  govern¬ 
ment  had  perfect  information  regarding  the  actions 
of  the  bureaucracy,  it  would  be  able  to  observe, 
capture,  and  punish  the  offenders  directly. 

Capture  may  be  deterred  by  legal  institutions 
that  investigate  the  bureaucracy.  All  else  being 
equal,  bureaucrats  will  be  less  willing  to  accept 
bribes  as  the  risk  of  detection  and  punishment 
rises.  Capture  may  also  be  deterred  by  the  payment 
of  an  efficiency  wage — a  wage  above  what  bureau¬ 
crats  would  receive  from  alternative  employment. 
The  efficiency  wage  raises  the  costs  to  the  bureau¬ 
crat  of  being  removed  from  office  and  thus  deters 
malfeasance. 

Appeals  to  Overseers 

Special  interests  need  not  resort  to  bribery  to 
influence  the  bureaucracy.  They  may  be  able  to 
influence  bureaucratic  decision  making  simply 
through  the  use — or  threatened  use — of  oversight 
mechanisms.  The  threat  of  appeals  to  the  govern¬ 
ment  or  to  the  courts  may  be  sufficient  to  dissuade 
the  bureaucracy  from  performing  its  anointed 
tasks. 

It  has  long  been  recognized  that  special  interests 
may  play  a  role  in  overseeing  bureaucratic  perfor¬ 
mance.  Interested  members  of  the  public  may 
review  bureaucratic  performance  and  report  mal¬ 
feasance  or  slacking  to  the  government.  This  form 
of  oversight  is  frequently  termed  fire  alarm  over¬ 
sight  and  has  the  advantage  of  being  costless  to  the 
government. 

However,  private  sectors  entities  may  be  self- 
interested  in  their  decisions  to  sound  fire  alarms.  A 
fire  alarm  oversight  may  therefore  result  in  biases  in 
favor  of  special  interests.  For  instance,  an  interest 
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group  may  only  report  bureaucratic  malfeasance  if 
bureaucrats  act  in  a  manner  that  harms  that  group’s 
interests.  Bureaucrats  may  thus  be  tempted  to  favor 
special  interests  to  avoid  review.  A  government 
interested  in  preventing  interest-group  influence 
may  therefore  need  to  rely  on  active  investigations 
into  bureaucratic  decisions  in  place  of,  or  in  addi¬ 
tion  to,  fire  alarm  oversight.  Such  investigations  are 
termed  police  patrol  oversight. 

It  is  possible  that  special  interests  may  exercise 
influence  without  resorting  to  the  oversight  pro¬ 
cess.  The  mere  threat  of  appeals  to  the  legislature  or 
courts  may  prove  sufficient  to  deter  regulators.  One 
model  of  such  deterrence  assumes  that  an  interest 
group  can  signal  its  willingness  to  appeal  bureau¬ 
cratic  decisions  by  making  campaign  contributions 
to  legislators.  These  contributions  act  as  costly — 
and  informative — signals  of  the  interest  group’s 
willingness  to  contest  bureaucratic  decisions.  The 
interest  group  is  thus  less  likely  to  be  regulated  as  it 
contributes  more  to  political  campaigns. 
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Bureaucracy,  Street-Level 


Throughout  the  world,  legislators  and  govern¬ 
ment  administrators  have  increasingly  called  for 
greater  accountability  and  improved  performance 
in  the  delivery  of  public  services,  reflecting  in  part 
the  continuing  influence  of  the  theory  of  new  pub¬ 
lic  management  that  stresses  more  market-based 
approaches  to  service  delivery  and  increased 
responsiveness  to  citizens  and  communities.  The 
worldwide  financial  crisis  has  further  intensified 
the  pressure  on  government  to  improve  the  effi¬ 
ciency  and  effectiveness  of  services. 

The  performance  and  accountability  movement 
puts  the  spotlight  on  the  central  role  of  street-level 
bureaucrats  in  the  development  and  implementa¬ 
tion  of  public  services.  As  defined  by  Michael 
Lipsky  in  1980,  street-level  bureaucrats  comprise 
workers  who  interact  directly  with  citizens  by  pro¬ 
viding  a  valued  public  service;  they  include  teach¬ 
ers,  police  officers,  case  workers,  physicians,  men¬ 
tal  health  counselors,  and  home  health  workers, 
and  they  typically  possess  a  substantial  degree  of 
discretion  over  their  work.  Importantly  though, 
this  discretion  is  bounded  by  the  restrictions  on 
individual  choice  placed  on  street-level  bureaucrats 
by  factors  common  in  public  and  nonprofit  agen¬ 
cies,  including  resource  constraints,  difficult  work¬ 
ing  conditions,  and  vague  and  sometimes  conflict¬ 
ing  expectations  on  performance.  To  cope  with 
these  work-related  challenges,  street-level  bureau¬ 
crats  can  sometimes  reach  decisions  that  are  at 
variance  with  the  rules  and  procedures  of  the 
agency.  Thus,  the  discretion  of  street-level  bureau¬ 
crats  can  result  in  policy  implementation  at  odds 
with  the  goals  and  priorities  of  top  administrators, 
legislators,  and  the  citizenry. 

Historically,  street-level  workers  occupied  key 
positions  in  public  agencies.  Indeed,  the  growth  of 
the  welfare  state  in  advanced  industrial  countries 
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in  the  20th  century  typically  entailed  substantial 
expansion  in  the  ranks  of  public  sector  workers, 
especially  in  regions  such  as  the  United  Kingdom 
(UK)  and  the  Scandinavian  countries.  However,  in 
some  countries  such  as  Germany  and  the 
Netherlands,  governments  worked  in  close  alliance 
with  large  nonprofit  organizations  who  provided 
social  and  health  services,  staffed  by  countless 
numbers  of  street-level  workers.  The  ranks  of 
street-levels  workers  rose  sharply  around  the  world 
in  the  1960s  and  1970s  with  the  expansion  of  the 
welfare  state  in  policy  areas  such  as  health  care, 
social  services,  criminal  justice,  and  education. 

Many  of  these  programs  in  the  United  States 
and  abroad  encountered  serious  implementation 
problems  prompting  a  subsequent  reassessment  of 
public  services  and  the  role  of  street-level  bureau¬ 
crats  in  service  provision.  As  a  result,  street-level 
practice  has  changed  profoundly  in  the  past  few 
decades.  First,  street-level  bureaucrats  in  social  and 
health  care  are  much  more  likely  to  work  in  a  non¬ 
profit  or  for-profit  service  agency  due  to  a  sharp 
rise  in  government  contracting  for  services  with 
these  agencies.  Indeed,  the  expansion  of  services 
such  as  home  care,  community  care  for  the  dis¬ 
abled  and  elderly,  substance  abuse  treatment,  and 
workforce  development  has  been  almost  entirely 
through  government  contracting  with  private  non¬ 
profit  and  for-profit  agencies.  While  this  increase 
in  contracting  is  particularly  pronounced  in  the 
United  States,  it  is  also  evident  in  many  other 
countries,  including  the  UK,  Australia,  New 
Zealand,  and  Germany. 

The  growth  of  contracting  has  been  accompa¬ 
nied  by  new  performance  management  strategies, 
including  performance  contracting,  intensified 
testing,  more  rigorous  practice  standards,  and  out¬ 
come  evaluation.  Overall,  these  strategies  are  often 
an  attempt  to  control  the  discretion  and  autonomy 
of  street-level  workers.  For  instance,  performance 
contracts  in  welfare-to-work  programs  include 
specific  expectations  for  street-level  workers  on 
the  target  program  goals  and  sanctions  for  the 
agency  if  these  goals  are  not  attained.  Similarly, 
policies  promoting  “evidence-based  practice”  seek 
to  enhance  the  extent  to  which  street-level  workers 
such  as  mental  health  counselors  or  child  welfare 
case  workers  adhere  to  professionally  recognized 
standards  of  best  practice.  Recurrent  fiscal  crises 
of  government  have  also  forced  public  and  private 


agencies  to  implement  new  policies  to  increase  the 
productivity  of  street-level  workers,  with  corre¬ 
sponding  limitations  on  their  ability  to  depart 
from  established  practice  standards. 

Importantly,  the  emphasis  on  productivity  by 
street-level  workers  is  also  a  ripple  effect  of  intensi¬ 
fied  competition  among  service  agencies.  The  par¬ 
adigmatic  street-level  bureaucrat  of  earlier  eras 
possessed  autonomy  and  discretion  and  was  largely 
immune  from  market  pressures;  teachers,  child 
welfare  workers,  and  physicians  certainly  fit  this 
model  well.  Currently,  though,  private  nonprofit 
and  for-profit  agencies  in  many  countries  and 
jurisdictions  actively  compete  with  each  other  for 
government  grants  and  contracts  as  well  as  private 
fee  income  (and  in  the  case  of  nonprofit  agencies, 
charitable  donations).  The  degree  of  competition, 
though,  varies  tremendously.  Rural  areas,  for 
example,  may  lack  a  sufficient  supply  of  providers 
to  create  a  competitive  market.  Government  rules 
and  regulations  also  may  affect  the  supply  of  pro¬ 
viders.  Nonetheless,  the  shift  toward  more  compe¬ 
tition  in  services  is  a  worldwide  trend  that  is  evi¬ 
dent  even  in  countries  with  comprehensive  welfare 
state  policies  and  long-standing  traditions  of  pri¬ 
vate  agency  and  professional  autonomy. 

Competition  in  services  is  also  fostered  by  the 
growing  trend  for  more  individual  choice  and  con¬ 
trol  in  service  delivery.  The  classic  street-level 
bureaucrat  working  in  a  large  public  or  nonprofit 
agency  existed  within  a  top-down  hierarchical 
authority  system  where  individual  clients  had  little 
control  or  choice  in  the  street-level  workers  or 
agencies  from  whom  they  received  service.  In  this 
role,  street-level  bureaucrats  played  a  key  role  in 
constructing  the  identity  of  service  recipients  as 
well  as  their  access  to  services  and  their  very  under¬ 
standing  of  their  service  rights  and  expectations. 

Changes  in  the  Bureaucrat-Client 
Relationship 

Two  developments  are  changing  this  street-level 
bureaucrat-client  relationship,  although  national, 
regional,  and  local  practices  vary  tremendously. 
First,  governments  are  employing  a  more  diverse 
set  of  policy  tools  to  address  public  problems, 
including  vouchers  and  quasi  vouchers  and  tax 
credits  and  deductions.  Vouchers  in  child  care, 
education,  housing,  and  substance  abuse  treatment 
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can  be  a  more  empowering  way  to  engage  clients 
in  key  decisions  regarding  their  public  services.  For 
instance,  in  community  care  services  for  the  dis¬ 
abled,  individual  client  accounts  have  been  estab¬ 
lished  by  government,  allowing  the  disabled  indi¬ 
vidual  to  purchase  needed  services  directly,  rather 
than  relying  on  street-level  workers.  The  wide¬ 
spread  use  of  vouchers  and  quasi  vouchers  and 
different  forms  of  payments  to  private  agencies  has 
restructured  the  contracting  relationship  so  gov¬ 
ernments  have  shifted  away  from  direct  contracts 
with  an  agency  to  funding  support  that  is  tied  to 
the  individual  citizen  users.  This  funding  relation¬ 
ship  is  inherently  more  unstable  from  the  contract 
agency’s  perspective  and  puts  further  competitive 
pressure  on  the  private  service  agency  and  its 
frontline  workers. 

The  diversification  of  policy  tools  is  also  indica¬ 
tive  of  a  second  trend:  broad  interest  and  support 
for  individual  choice  and  control  in  service  deci¬ 
sions,  evident  in  vouchers;  tax  credits  for  services 
such  as  child  care;  government-subsidized  client 
accounts  to  purchase  needed  services;  and  regula¬ 
tions  requiring  or  promoting  greater  choice.  Over 
time,  this  shift  is  likely  to  transform  the  work  of 
street-level  bureaucrats  who  will  need  to  be  respon¬ 
sive  to  client  goals  and  priorities  in  order  to  com¬ 
pete  effectively  for  client  service  dollars. 

Greater  client  choice  reflects  growing  world¬ 
wide  interest  in  “coproduction,”  or  the  idea  that 
important  services  such  as  mental  health  or  com¬ 
munity  care  are  coproduced  between  the  street- 
level  worker  and  individual  clients;  thus,  street- 
level  workers  need  to  pay  heed  to  client  motiva¬ 
tion  and  goals  in  order  to  achieve  successful  pro¬ 
grammatic  outcomes  (John  Alford,  2009). 
Coproduction  is  also  related  to  the  concept  of 
“backward  mapping,”  advanced  by  Richard 
Elmore  (1979/1980),  who  argued  that  policymak¬ 
ers  should  begin  the  formulation  of  policy  by 
examining  the  interactions  between  clients  and 
street-level  bureaucrats.  With  this  information, 
policymakers  should  be  able  to  design  more  effec¬ 
tive  programs. 

In  recent  years,  the  coproduction  approach  has 
broadened  to  incorporate  more  active  engagement 
of  clients  and  community  organizations  in  the 
work  of  street-level  bureaucrats.  Indeed,  street- 
level  bureaucrats  may  need  to  work  with  several 
different  public  and  private  organizations  on 


behalf  of  an  individual  client.  Many  service  pro¬ 
grams  are  designed  at  their  inception  as  public- 
private  partnerships  with  hybrid  governance  struc¬ 
tures  representing  an  array  of  different  local  inter¬ 
ests  and  organizations.  Thus,  the  governance  of 
street-level  work  is  vastly  more  complicated  today 
than  in  the  past,  with  street-level  bureaucrats  often 
engaged  with  multiple  internal  and  external  stake¬ 
holders  in  vertical  and  horizontal  relationships. 
Consequently,  sound  policy  design  is  even  more 
important  to  successful  policy  implementation 
today  than  in  the  past. 

This  evolving  role  of  street-level  bureaucrats  has 
profound  implications  for  citizenship.  Teachers, 
police,  case  workers,  and  emergency  shelter  staff, 
to  name  a  few,  provide  valuable  public  services 
that  may  determine  the  life  chances  of  individual 
citizens.  As  the  welfare  state  grew,  so  did  the  ranks 
of  street-level  bureaucrats  and  their  importance  in 
providing  social  benefits  such  as  health  care  and 
education  essential  to  full  citizenship.  Yet  street- 
level  bureaucrats  can  also  directly  affect  the  politi¬ 
cal  participation  and  civic  engagement  of  citizens, 
so  street-level  workers  also  can  significantly  shape 
political  rights  as  well. 

The  capacity  of  street-level  bureaucrats  to 
engage  in  effective  coproduction — and  effective 
services  in  general — may  be  severely  limited  by  the 
intensifying  market  and  accountability  pressures 
facing  public  and  private  service  agencies.  Quality 
street-level  practice  requires  an  ongoing  invest¬ 
ment  by  government  in  training  and  education.  To 
the  extent  that  street-level  workers  are  engaged  in 
contracting  and  partnerships,  new  skill  develop¬ 
ment  is  often  essential.  Moreover,  the  investment 
obligations  of  government  are  not  limited  to  sup¬ 
port  for  public  sector  workers;  the  staff  and  volun¬ 
teers  of  private  nonprofit  and  for-profit  agencies 
receiving  government  contracts  require  ongoing 
technical  assistance  and  capacity  building  to  pro¬ 
vide  responsive,  effective  services.  In  an  era  of 
scarce  resources  for  public  services,  street-level 
bureaucrats  will  need  to  work  actively  with  allies 
in  the  public  and  private  sectors  to  create  advo¬ 
cates  for  the  value  and  key  importance  of  public 
services  for  all  citizens. 

Steven  Ratbgeb  Smith 
Georgetown  University 
Washington,  D.C.,  United  States 
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Cabinets 


Cabinets  are  small  organizational  units  within 
governments  that  consist  of  politicians  who  are 
responsible  for  the  overall  policy  performance  of 
the  government.  This  entry  first  describes  the  his¬ 
tory  of  cabinets  and  their  functions  under  various 
forms  of  government.  It  then  discusses  a  frame¬ 
work  for  the  analysis  of  cabinets,  which  must 
consider  the  structural  characteristics  of  the  polit¬ 
ical  system,  the  roles  of  political  parties  and  of 
individual  ministers,  the  behavior  of  the  prime 
minister,  and  the  power  of  the  prime  minister’s 
office.  Last,  the  entry  describes  important  tasks 
for  future  research  on  cabinets  worldwide. 

Historical  Origin  and  Functions 

The  notion  of  a  cabinet  derives  from  the  designa¬ 
tion  “Cabinet  Council,”  first  given  in  Britain  by 
the  king  in  1622.  The  monarch  counted  on  this 
inner  core  of  privy  counselors  to  legitimize  his  or 
her  political  decisions  and  thereby  his  or  her  dura¬ 
tion  in  office.  With  the  emergence  of  the  modern 
state  in  the  19th  century,  monarchs  lost  power, 
and  newly  developed  political  parties  subsequently 
replaced  their  authority.  As  stressed  by  John 
Mackintosh  (1968),  after  these  mass  parties  had 
come  into  existence  in  the  1860s  and  1870s,  the 
contemporary  structure  of  the  modern  cabinet  sys¬ 
tem  became  visible.  Today,  a  cabinet  is  a  small 
group  of  politicians  (mostly  ministers)  responsible 
for  political  decision  making  in  countries  that  have 


a  form  of  government  known  as  cabinet  govern¬ 
ment  (i.e.,  in  Western  and  Central  Eastern  Europe 
and  much  of  the  Balkans;  in  many  Commonwealth 
countries  such  as  Canada,  Australia,  New  Zealand, 
India,  Malaysia,  and  Singapore;  in  most  ex-British 
Caribbean  and  Pacific  Islands;  in  Japan;  and  in 
Israel). 

Cabinets  may  also  exist  in  constitutional  presi¬ 
dential  systems  (typically  in  the  Americas),  in 
authoritarian  presidential  systems  (e.g.,  in  Africa 
and  in  parts  of  Asia,  including  the  Middle  East),  or 
in  traditional  monarchical  systems  (e.g.,  in  the 
Middle  East).  In  these  political  systems,  however, 
members  of  the  cabinet  serve  purely  as  advisors  to 
the  president  or  the  monarch.  They  are  not — as  in 
cabinet  government  systems — dependent  on  the 
majority  support  of  parliament  for  their  duration 
in  office  or  for  getting  legislative  proposals 
approved. 

In  cabinet  governments,  individual  members 
hold  a  collective  responsibility,  which  stipulates 
that  all  cabinet  members  are  bound  to  cabinet 
decisions.  They  may  disagree  with  these  decisions 
in  private  but  must  agree  in  public.  As  chairman  of 
the  cabinet,  the  prime  minister  wields  a  power  that 
is  generally  seen  as  superior  to  the  other  cabinet 
members  (“first  among  equals”).  The  cabinet 
members,  at  least  its  chair,  are  responsible  to  par¬ 
liament  and  are  expected  to  control  the  executive 
and  the  legislative  branches. 

Such  cabinets  can  function  satisfactorily  only  if 
an  equilibrium  is  found  between  two  opposite 
requirements — those  of  representativeness  and  of 
efficiency.  As  Jeffrey  Cohen  (1988)  notes,  the 
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requirement  of  representativeness  means  that  deci¬ 
sion  making  is  in  the  hands  of  the  whole  cabinet 
and  not,  as,  for  instance,  in  the  presidential  system, 
in  the  hands  of  the  leader  of  the  government  alone. 
A  cabinet  government  has  therefore  to  be  “collec¬ 
tive”:  Decisions  are  taken  “together.”  This  require¬ 
ment  of  “togetherness”  is  regarded  as  central  to  the 
cabinet  system  because  such  a  mode  of  operation  is 
felt  to  be  a  more  liberal,  more  democratic,  and 
therefore  a  superior  form  of  decision  making. 

Yet  this  representative-cum-collective  principle 
runs  directly  against  the  requirement  of  efficiency, 
as  efficiency  entails  that  decisions  be  taken  speed¬ 
ily  despite  the  continuous  increase  in  the  number 
and  complexity  of  these  decisions.  Since  represen¬ 
tativeness  entails  togetherness,  decisions  are  likely 
to  be  taken  only  after  substantial  discussions.  This 
may  mean  delays  in  elaborating  compromises  or 
even,  in  cases  of  extreme  conflict,  the  fall  of  the 
cabinet.  Bernard  Grofman  and  Peter  van  Roozen- 
daal  (1997)  note  that  because  the  fall  of  a  cabinet 
is  mostly  associated  with  political  crises,  cabinet 
stability  becomes  the  key  test  of  the  success  of  all 
cabinet  systems  and  the  number-one  indicator  of 
the  performance  of  cabinets. 

Framework  for  Analysis 

In  the  contemporary  literature,  such  as  the  work 
of  Jean  Blondel  and  Ferdinand  Miiller-Rommel 
(1997,  2001),  cabinets  are  analyzed  in  the  con¬ 
text  of  the  constitutional  setting,  of  the  cabinet 
structure,  of  the  external  and  internal  life  of  cabi¬ 
nets,  and  of  the  decision  making  among  their 
members. 

Constitutional  Setting 

The  general  setting  of  cabinets  is  provided  in 
broad  terms  by  constitutions.  Although  constitu¬ 
tions  usually  give  only  a  little  attention  to  the 
organization  of  cabinets,  they  typically  devote 
more  space  to  matters  of  appointment  and  dis¬ 
missal  of  cabinet  ministers,  in  particular  to  the  role 
of  the  head  of  state  in  this  respect.  Constitutions 
may,  for  instance,  give  the  head  of  state  the  power 
to  participate  in  the  formation  of  cabinets  (as  in 
France,  Finland,  Belgium,  Bulgaria,  and  the  Baltic 
States  and  in  Poland  until  1993).  Constitutions 
may  also  foster  cabinet  stability,  for  instance,  by 


making  it  relatively  difficult  for  parliaments  to 
pass  “no  confidence”  motions  against  the  prime 
minister  or  other  ministers.  Future  research  needs 
to  analyze  more  systematically  the  constitutional 
powers  of  presidents,  prime  ministers,  and  minis¬ 
ters  in  different  cabinet  systems. 

Internal  Operation  of  Cabinets 

The  cabinet  structure  is  foremost  determined  by 
its  size  and  by  its  organizational  subunits.  Most 
cabinet  governments  tend  to  have  around  20 
members  on  average.  Because  these  cabinets  are 
too  large  to  be  fully  involved  in  all  aspects  of  deci¬ 
sion  making,  cabinet  meetings  must  be  well  struc¬ 
tured  and  well  prepared.  In  this  context,  prime 
ministers,  cabinet  committees,  and  cabinet  secre¬ 
tariats  play  an  important  role.  Although  the  power 
of  prime  ministers  varies  significantly  across  coun¬ 
tries,  they  are  usually  more  influential  in  cabinets 
based  on  a  majority  party  than  in  coalition  cabi¬ 
nets.  Cabinet  committees  have  been  set  up  in  many 
cabinets  to  speed  up  the  decision-making  process. 
These  committees  are  different  in  size  and  status, 
but  they  usually  include  relevant  ministers,  junior 
ministers,  and  civil  servants  who  prepare  filtered 
suggestions  for  cabinet  decision.  Some  committees 
are  composed  only  of  civil  servants  who  aim  to 
iron  out  problems  that  might  occur  among  the  dif¬ 
ferent  governmental  departments.  Cabinet  secre¬ 
tariats  or  prime  minister’s  offices,  on  the  other 
hand,  ensure  efficiency  in  cabinet  decision  making. 
They  control  the  flow  of  proposals  coming  to  and 
emanating  from  cabinet  meetings  and  resolve 
problems  over  proposed  agenda  items  in  advance 
of  the  cabinet  meeting.  Studies  conducted  in  parlia¬ 
mentary  democracies  have  shown  that  the  impact 
of  prime  ministerial  staffs  on  cabinet  government 
can  be  excessive.  The  functions  of  these  staffs 
range  between  an  administrative  and  a  political 
pole.  These  officials  act  administratively  when 
organizing  meetings  of  the  cabinet  and  the  flow  of 
business  between  prime  minister  and  ministers. 
They  also  play  a  part  in  the  development  of  long¬ 
term  ideas  about  cabinet  activities  and  on  the 
improvement  of  cabinet  decision  making,  but  these 
forms  of  activities  are  never  wholly  administrative. 
In  most  cases,  they  affect  the  political  operation  of 
the  cabinet.  Future  research  must  therefore  con¬ 
sider,  in  a  systematic  and  comparative  manner,  not 
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just  personnel  and  budget  but  also  the  actual  tasks 
of  the  prime  ministers’  offices. 

Impact  of  External  Actors  on  Cabinets 

The  operation  of  cabinets  is  strongly  affected  by 
political  parties,  parliaments,  bureaucracy,  and 
other  external  actors.  Political  parties  clearly  have 
an  impact  on  the  formation,  the  composition,  and 
the  development  of  cabinets.  This  impact  depends 
on  the  overall  number  of  parties  in  the  cabinet 
(single-party  vs.  multiparty  government)  and  the 
type  of  cabinet  coalition  (e.g.,  with  or  without  a 
dominant  party,  minority  vs.  majority  party  gov¬ 
ernments).  There  is  usually  a  stronger  battle 
between  cabinets  and  parties  in  multiparty  coali¬ 
tions  than  in  one-party  majority  governments. 

The  impact  of  parliament,  civil  servants,  and 
other  political  groups  on  cabinets  is  closely  tied  to 
the  role  of  parties  and  to  the  character  of  coalitions. 
In  some  countries,  parliaments  have  substantial 
power;  for  example,  they  may  be  able  to  prevent 
political  parties  from  revolting  or  erupting.  This 
appears  to  be  particularly  true  in  coalition  govern¬ 
ments.  The  impact  of  civil  servants,  on  the  other 
hand,  depends  on  their  position  within  the  admin¬ 
istration.  The  higher  civil  servants  are  ranked,  and 
the  closer  they  are  to  the  ministers,  the  greater  their 
impact  on  the  preparation  of  cabinet  decisions.  The 
role  of  other  external  groups  in  the  life  of  the  cabi¬ 
net  can  be  seen  as  similar  to  that  of  political  parties. 
The  most  important  interest  groups  are  usually 
linked  to  specific  parties  and  thereby  exercise  an 
indirect  impact  on  cabinet  proposals. 

Decision  Making 

Cabinet  decision  making  takes  place  in  the  con¬ 
text  of  a  conflict  among  many  patterns  of  relation¬ 
ship.  There  are,  for  instance,  differences  in  the 
extent  to  which  cabinet  meetings  are  genuinely 
involved  in  decision  making  rather  than  simply 
ratifying  proposals  made  by  other  bodies.  The  effi¬ 
ciency  and  effectiveness  of  cabinet  decision  making 
to  a  large  extent  depend  on  the  ministerial  experi¬ 
ence  and  behavior  of  prime  ministers.  Studies  on 
cabinet  government  have  shown  that  ministers 
with  greater  political  experience  or  greater  expert 
knowledge  can  be  expected  to  exercise  more  influ¬ 
ence  on  cabinet  decision  making  than  other  cabinet 


members.  Future  research  on  cabinets  has  therefore 
to  collect  more  information  on  the  social  back¬ 
ground  and  the  careers  of  ministers  under  com¬ 
parative  perspective.  Further,  to  understand  minis¬ 
terial  behavior,  one  must  know  what  ministers 
think  about  their  own  work,  the  work  of  their 
colleagues,  and  the  part  played  by  the  cabinet  meet¬ 
ing  itself.  If  cabinet  decisions  matter,  cabinet  mem¬ 
bers  also  matter;  it  is  therefore  essential  to  find  out 
how  ministers  view  their  role  in  cabinet  govern¬ 
ment.  This  means  interviewing  (former)  ministers — 
an  enterprise  that  is  long  and  complex  but  is  surely 
rewarding.  Cabinet  decisions  are  also  affected  by 
the  behavior  of  prime  ministers  as  heads  of  cabi¬ 
nets.  Prime  ministers  are  “arbitrators”  or  “activ¬ 
ists”  with  respect  to  the  implementation  of  cabinet 
policy  goals.  If  they  are  arbitrators,  prime  ministers 
may  not  be  involved  personally  in  policy  proposals 
and  must  be  content  with  moving  the  cabinet 
toward  an  acceptable  solution.  If  they  are  activists, 
they  are  likely  to  be  deeply  interested  in  particular 
proposals  and  to  attempt  to  push  these  proposals 
through  cabinet.  A  systematic  comparative  investi¬ 
gation  of  cabinets  has  therefore  to  develop  a  typol¬ 
ogy  of  prime  ministerial  leadership  style  and  to 
relate  this  typology  to  the  different  phases  and 
aspects  of  cabinet  decision  making. 

Directions  for  Future  Research 

Given  the  difficulties  that  most  cabinets  in  demo¬ 
cratic  countries  currently  face,  a  common  theory 
about  the  functioning  of  cabinets  needs  to  be 
developed.  This  theory  needs  to  take  into  account 
five  key  factors  that  are  responsible  for  the  general 
functioning  of  cabinets  in  democratic  countries: 
(1)  the  structural  characteristics  of  the  political 
system,  (2)  the  role  of  the  political  parties,  (3)  the 
role  of  individual  ministers,  (4)  the  leadership  style 
of  the  prime  minister,  and  (5)  the  administrative 
characteristics  of  the  prime  minister’s  offices. 
Systematic  research  on  the  functioning  of  cabinets 
has  first  to  focus  on  these  five  aspects  in  succession 
to  obtain  reliable  and  comparable  information 
about  cabinet  governments  worldwide.  Meanwhile, 
hypotheses  linking  these  five  aspects  need  to  be 
formulated  and  tested.  This  is  a  vast  undertaking, 
especially  since  the  principles  of  classification  still 
have  to  be  elaborated  in  a  number  of  ways.  If  such 
an  approach  is  adopted,  however,  it  will  become 
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possible  to  discover  and  assess  the  explanations 
that  account  for  the  operation  of  cabinets. 
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Capitalism 


Since  the  early  20th  century,  the  concept  of  capi¬ 
talism  has  been  used  to  capture  the  structure  and 
dynamics  of  a  particular  historical  formation  of 
economy  and  society  that  first  emerged  during  the 
late  Middle  Ages  in  Southern  Europe  and  later 
spread  to  Northwestern  Europe.  Capitalism 
emphasizes  the  attainment  of  profit  through  the 
operation  of  the  market  and  private  ownership  of 
the  means  of  production.  This  entry  describes  the 
role  of  markets  and  property  rights  in  capitalism 
and  examines  the  functions  of  firms  and  institu¬ 
tions  in  its  operation.  The  characteristics  of  capi¬ 
talist  culture  are  then  discussed.  The  entry  closes 
with  a  review  of  endogenous  critiques  of  capital¬ 
ism  and  of  the  shape  it  may  take  in  the  future. 

Since  its  beginnings  in  Europe,  capitalism  has 
expanded  to  virtually  all  parts  of  the  globe. 
Capitalism  can  be  contrasted  with  subsistence 
economy,  feudalism,  socialism,  and  slave  econ¬ 
omy.  Third  World  developing  societies  may  con¬ 
tain  insular  capitalist  patterns  in  their  economy 


without  thereby  becoming  capitalist  societies. 
Comparative  social  scientists  and  historians  have 
distinguished  a  great  number  of  stages,  types, 
qualifiers,  and  variants  under  the  broad  umbrella 
concept  of  capitalism,  such  as  agrarian,  commer¬ 
cial,  industrial,  and  financial  capitalism;  state 
capitalism  and  coordinated  capitalism;  and  Nordic, 
Anglo-Saxon  (chiefly  British  and  American),  East 
Asian,  and  “Rhenish”  capitalism. 

Most  scholars  who  use  the  concept  of  capitalism 
in  a  holistic  way  view  it  not  only  as  an  economic 
system  but  also  as  comprising  a  type  of  social  struc¬ 
ture,  political  institutions,  and  specific  cultural 
norms  and  values.  The  complementarity,  goodness 
of  fit,  and  range  of  variation  that  exist  among  these 
realms — essentially  the  realms  of  capitalist  inter¬ 
ests,  institutions,  and  ideas  that  together  make  up 
capitalism — have  been  the  focus  of  social  science 
analysis  since  the  pioneering  works,  around  the 
turn  of  the  19th  century,  by  Max  Weber  and 
Werner  Sombart  to  contemporary  research  on 
comparative  capitalism.  On  the  European  conti¬ 
nent,  the  use  of  the  term  capitalism,  in  both  politi¬ 
cal  and  academic  contexts,  almost  always  had 
critical  overtones.  Authors  who  wish  to  avoid  such 
connotations  use  “social  market  economy,”  “indus¬ 
trial  society,”  or  simply  “modern  society”  instead; 
however,  such  usage  may  cause  them  occasionally 
to  lose  sight  of  the  problematiques  and  insights  of 
those  classical  authors  of  social  science. 

Defining  Features  of  Capitalism 

There  are  six  defining  features  of  capitalism: 
(1)  markets,  (2)  property  rights,  (3)  the  role  of  pri¬ 
vate  firms,  (4)  politico-economic  institutions, 

(5)  capitalist  patterns  of  the  cognitive  and  normative 
culture  (Weber’s  “spirit”  of  capitalism),  and 

(6)  reflexive  dynamics  of  critique  that  are  specific  to 
capitalist  societies.  Theorists  differ  as  to  the  emphasis 
they  attach  to  each  of  these  components  of  capital¬ 
ism.  The  study  of  capitalism  is  a  highly  inter¬ 
disciplinary  field  of  investigation  to  which  historians, 
economists,  sociologists,  lawyers,  political  scientists, 
and  philosophers  have  significantly  contributed. 

Markets 

Capitalist  societies  are  based  on  economic  sys¬ 
tems  in  which  most  goods  and  services  are  bought 
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and  sold  in  markets  for  a  monetary  price,  thus 
making  them  commodities.  The  commodification 
of  goods  makes  for  the  contingency  of  economic 
transactions,  meaning  that  the  parameters  of  these 
transactions — who  buys  from  and  sells  to  whom 
and  what  commodity  at  which  point  in  time  and  at 
what  price — become  a  matter  of  continuous  choice 
and  an  ongoing  competitive  recombination  of 
social  relations.  Market  transactions  are  governed 
by  a  regime  of  social  norms  and  legal  rules  (law  of 
contract)  that  is  enforced  by  a  neutral  state-oper¬ 
ated  court  system  and  that  specifies  the  mutual 
rights  and  liabilities  of  agents  entering  into  eco¬ 
nomic  transactions.  These  norms  and  rules  are 
designed  to  rule  out  the  use  of  openly  predatory 
practices  such  as  individual  or  organized  violence, 
fraud,  theft,  deception,  conspiracy  to  deny  poten¬ 
tial  suppliers  market  access,  to  some  extent  even 
practices  of  cartelization  and  monopolization,  and 
bribes  as  “unfair”  means  employed  in  the  pursuit 
of  economic  gain.  To  the  (historically  highly 
unlikely)  extent  that  such  rules  are  fully  imple¬ 
mented,  we  can  speak  of  a  civilizing  function  of 
market  competition,  ideally  leaving  only  prices 
and  qualities/novelty  of  goods  as  action  parame¬ 
ters  of  competitors.  Yet  as  prices  are  “given”  in 
any  (nearly)  perfect  (or  “atomistic”)  market,  sup¬ 
pliers  are  under  strong  incentives  to  innovate  both 
products  and  (technical  and  organizational)  pro¬ 
duction  processes.  Markets  determine  prices  in 
response  to  changes  in  the  volume  of  supply  of  and 
demand  for  specific  goods  and  services.  The  prom¬ 
inent  role  of  choice,  contingency,  and  civility  in 
economic  interaction  has  led  theorists  such  as 
Milton  Friedman  (1962)  to  equate  capitalism  with 
individual  freedom.  Markets  make  people  “free  to 
choose.” 

Yet  a  defining  feature  of  capitalist  market  soci¬ 
eties  consists  in  the  fact  that  not  only  goods  and 
services  manufactured  for  the  purpose  of  being 
sold  but  also  the  “factors  of  production”  employed 
in  the  process  of  manufacturing  are  subject  to  mar¬ 
ket  exchange  under  capitalism.  Even  though  these 
factors — natural  resources,  human  labor  power, 
and  money — have  not  been  “produced”  (and  cer¬ 
tainly  not  produced  with  the  intention  of  being 
marketed),  they  are  still  subsumed  under  the  com¬ 
modity  form.  This  “commodification  of  noncom¬ 
modities”  has  been  attacked,  both  in  the  Marxist 
tradition  and  famously  by  Karl  Polanyi  (1944),  as 


the  core  contradiction  of  capitalism — that  is,  a 
source  of  conflict  and  instability  that  constantly 
calls  for  (interventionist,  reformist,  revolutionary, 
authoritarian,  military,  etc.)  remedies  and  institu¬ 
tional  safeguards  to  be  installed  by  holders  of 
political  power.  The  commodification  of  money 
through  speculative  investments,  with  their  poten¬ 
tially  disastrous  effects  on  financial  markets;  the 
commodification  of  natural  resources  with  their 
associated  environmental  damages;  and  the  com¬ 
modification  of  human  labor  power  with  its  distri¬ 
butional  and  other  adverse  impacts  can  all  be  cited 
as  contemporary  instances  of  this  key  “mistake”  of 
commodifying  what  by  their  very  nature  are  non¬ 
commodities,  or  “fictitious”  commodities.  Thus, 
one  defining  feature  of  a  capitalist  society  and  its 
dynamics  is  the  existence  of  a  labor  market  in 
which  the  capacity  of  workers  to  perform  produc¬ 
tive  services  is  being  traded  under  labor  contracts. 

Such  commodification  of  noncommodities  has 
provided  for  the  enormous  gains  in  efficiency, 
growth,  and  prosperity  that  have  accompanied  the 
history  of  capitalism.  While  under  capitalism  the 
commodity  form  is  extended  to  noncommodities, 
it  is  on  the  other  hand  restricted  (compared  with 
precapitalist  monetized  exchange  relations)  to 
items  of  “economic”  value — that  is,  excluding 
items  such  as  positions  in  the  state  administration, 
court  decisions,  academic  titles,  marriage  licenses, 
or,  most  important  and  since  the  abolition  of  slav¬ 
ery,  human  beings  themselves,  who  are  instead 
governed  by  the  principle  of  inalienable  “self¬ 
ownership,”  a  concept  that  has  its  roots  in  the 
work  of  theorists  such  as  John  Locke. 

By  exposing  labor  to  market  contingency  under 
the  regulatory  regime  of  the  labor  contract,  capital¬ 
ism  inserts  workers  into  the  organizational  frame¬ 
work  of  productive  organizations  (firms),  which, 
due  to  the  division  of  labor,  organizational  control 
mechanisms,  and  efficiency-enhancing  investment 
goods,  allow  labor  power  to  be  used  much  more 
productively  than  was  the  case  in  precapitalist 
forms  of  production.  Yet  the  reverse  side  of  this 
growth  and  prosperity  lies  in  the  commodification 
of  (nominally  “free”)  contractual  labor  and  the 
distributional  patterns  as  well  as  the  contingencies 
following  from  it.  Not  only  does  the  individual 
employer  exercise  power  (as  authorized  by  the 
labor  contract)  over  the  employees  within  the  firm, 
but  the  collectivity  of  all  employers  also  exercises 


188  Capitalism 


“structural”  or  class  power  over  the  labor  force  as 
a  whole.  The  latter  arises  from  the  fact  that  labor  is 
tied  to  capital  in  a  relationship  of  asymmetrical 
dependency.  Because  wages  are  normally  workers’ 
only  source  of  subsistence,  workers  typically  depend 
more  urgently  on  being  employed  than  employers 
depend  on  employing  labor,  or  at  least  on  employ¬ 
ing  it  “here,  now,  and  continuously.”  One  impor¬ 
tant  cause  of  this  asymmetry  resides  in  the  fact  that 
employers  are  in  control  of  (as  well  as  incentivized 
through  competition  to  put  to  use)  productivity¬ 
enhancing  and  hence  of  labor-saving  technical  and 
locational  change,  whereas  workers,  by  themselves, 
can  do  little  (if  anything)  to  enhance  the  “welfare 
yield”  of  the  wages  they  earn  and  spend.  The  pro¬ 
cess  that  has  been  set  up  by  the  fundamental  capi¬ 
talist  institutions  of  the  labor  market  and  the  labor 
contract  can  thus  be  looked  on  as  a  “wealth-maxi¬ 
mizing  game”  and,  at  the  same  time,  a  “poverty- 
and  insecurity-generating  game.  ” 

After  the  end  of  the  golden  age  of  stable  growth 
and  full  employment — an  age  that  coincided 
roughly  with  the  third  quarter  of  the  20th  century 
for  the  nations  in  the  Organisation  for  Economic 
Co-operation  and  Development  (OECD) — one  of 
the  core  problems  of  open  (globalized),  advanced 
capitalist  economies  has  become  the  chronic 
imbalance  between  the  supply  of  labor  and  the 
demand  for  it,  as  was  pointed  out  by  Guy  Standing 
(2009).  This  core  problem  translates  into  the  divi¬ 
sive  dilemma  often  confronting  workers,  who 
must  either  yield  “flexibly”  to  the  pressures  to 
become — and  remain — “employable”  (which  they 
can  rarely  accomplish  through  their  own  means 
and  efforts  alone)  or  face  the  prospect  of  socioeco¬ 
nomic  insecurity,  precariousness,  unemployment, 
and  exclusion. 

Property  Rights 

Participants  in  markets  enjoy  state-enforced 
property  rights.  More  important,  such  property 
rights  do  not  imply  that  every  member  of  the  legal 
community  has  a  right  to  own  some  share  in  the 
total  of  the  material  resources  available  in  the 
community  (as  was  envisaged  by  18th-century 
revolutionary  writers  such  as  Thomas  Paine). 
Rather,  it  means  that  members  of  the  community 
who  already  happen  to  be  in  the  possession  of 
units  of  property  are  recognized  and  protected  in 


their  ownership  status  (provided  that  such  prop¬ 
erty  has  been  acquired  in  legally  permitted  ways). 
Such  protection  means  that  the  ownership  status  is 
safeguarded  against  the  loss  of  property  (e.g., 
through  theft,  destruction,  confiscation,  etc., 
although,  of  course,  not  against  losses  that  may 
result  from  unfavorable  choices  made  by  the  own¬ 
ers).  The  right  to  property  further  means  that  the 
owner  is  free  (within  limits  established  by  regula¬ 
tory  law)  to  determine  the  use  to  which  the  prop¬ 
erty  is  being  put  as  capital,  as  well  as  to  appropri¬ 
ate  the  gains  (profits)  flowing  from  its  use.  One 
highly  consequential  aspect  of  the  freedom  of 
owners  as  constituted  by  property  rights  is  to  use 
these  rights  to  hire  labor,  with  the  further  implica¬ 
tion  that  those  hired  as  workers  are  rendered  inca¬ 
pable  of  acquiring  property  themselves,  under  the 
terms  of  remuneration  under  which  they  are  hired 
and  in  the  absence  of  potentially  productive  prop¬ 
erty  of  their  own.  In  this  case,  the  availability  of 
property  rights  to  some  does  not  simply  coexist 
with  but  causes  the  denial  of  property  rights  to 
others  (particularly  as  the  latter  typically  lack  the 
collateral  that  they  would  need  to  obtain  a  com¬ 
mercial  bank  loan). 

Firms 

If  property  owners  decide  to  invest  their  mone¬ 
tary  resources  (which  is  the  only  chance  they  have 
to  make  their  property  a  durable  asset  or  even 
increase  it  through  accumulation),  this  investment 
will  show  up  as  capital  in  firms,  unless  it  is  a 
purely  speculative  investment  in  financial  markets. 
The  firm  is  the  key  capitalist  institutional  location 
where  investment  (in  buildings,  machinery,  raw 
materials,  etc.)  is  combined  with  wage  labor  for 
the  purpose  of  producing  marketable  commodi¬ 
ties.  The  two  defining  features  of  the  capitalist  firm 
are  (1)  its  distantiation  (in  space,  time,  social  func¬ 
tion,  and  accounting  principles)  from  the  house¬ 
hold  and  (2)  a  hierarchical  structure  of  command 
and  control  (with  an  entrepreneur  or  a  managing 
board  at  the  top)  and  a  formalized  vertical  and 
horizontal  division  of  labor  designed  to  promote 
the  efficiency  of  the  productive  process  and  the 
realization  of  its  results  in  markets.  In  sharp  con¬ 
trast  to  voluntary  and  highly  contingent  market 
transactions,  the  interaction  (which  Marx  called 
“despotic”)  that  occurs  within  firms  is  based  on 
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the  contractual  right  of  some  agents  to  give  orders 
and  the  duty  of  others  (as  enforced  by  some  mea¬ 
sure  of  the  right  of  superiors  to  terminate  the  labor 
contract)  to  carry  them  out.  While  the  degree  of 
authoritarianism  of  the  internal  regimes  of  capital¬ 
ist  firms  may  vary  greatly,  some  measure  of 
authoritative  coordination  seems  indispensible. 
This  is  so  because,  first,  the  labor  contract  is  (for 
good  reasons  in  terms  of  efficiency)  essentially  an 
incomplete  contract  with  gaps  that  need  to  be 
filled  by  commands  and,  second,  the  conflict  of 
interest  inherent  in  any  asymmetrical  contractual 
relation  will  make  it  highly  unlikely  that  those 
gaps  in  the  contract  will  reliably  be  filled  by  acts  of 
spontaneous  cooperation,  thus  overcoming  this 
tension.  The  result  is  friction  between  the  (nomi¬ 
nally)  voluntaristic,  egalitarian,  and  freely  chosen 
relationship  between  firms  and  their  external  mar¬ 
ket  partners,  on  the  one  hand,  and  the  authoritar¬ 
ian  and  hierarchical  relations  inside  the  firm  as  a 
formal  organization,  on  the  other. 

The  existence  of  this  relationship  of  social 
power  can  be  factually  accounted  for  in  terms  of 
the  asymmetry  that  capital  can  hire  labor;  yet 
labor,  in  the  absence  of  savings  or  a  collateral,  can¬ 
not  normally  hire  capital.  And  it  can  institutionally 
account  for  the  fact  that,  once  employees  enter 
into  the  labor  contract,  they  subject  themselves  to 
the  command  structure  of  the  enterprise.  Virtually 
all  of  the  labor  market,  labor  relations,  and  indus¬ 
trial  relations  policies  that  we  find  in  the  history  of 
capitalist  societies  (including  the  building  of  insti¬ 
tutions  for  collective  bargaining  and  codetermina¬ 
tion)  can  be  seen  as  ongoing  attempts  of  states, 
workers,  and  employers  to  regulate  and  (unbal¬ 
ance  these  two  basic  kinds  of  power  relationships. 
They  are  analytically  distinct  from  a  third  one:  the 
political  “exit  power”  of  capital  to  relocate  (or  to 
threaten  relocation  or  shifts  to  speculative  invest¬ 
ments  in  financial  markets),  to  which  states  and 
their  political  elites  are  vulnerable.  This  vulnerabil¬ 
ity  results  from  the  dependence  of  states  on  tax 
revenues  obtained,  directly  or  indirectly,  from 
investment,  growth,  and  employment,  all  of  which 
are  ultimately  controlled  by  decisions  of  capitalist 
enterprises. 

The  aggregate  effect  of  activity  in  firms  and 
labor  markets  generates  and  reproduces  specifi¬ 
cally  capitalist  patterns  of  inequality,  reflecting  the 
differential  marketability  (employability)  of  labor. 


As  Standing  (2009)  notes,  these  patterns  pertain  to 
earnings,  employment  opportunities,  income  secu¬ 
rity,  wealth,  organizational  resources,  political 
power,  and  even  life  expectancy.  These  inequalities 
unfold  in  interindividual,  intersectoral,  interre¬ 
gional,  international,  and  global  dimensions.  At 
the  bottom  of  distributional  hierarchies  are  people, 
regions,  and  even  an  entire  continent  (Africa) 
whose  survival  is  placed  in  jeopardy  by  the 
dynamic  of  capitalism  or  whom  the  dynamic  of 
capitalism  can  afford,  as  it  were,  “to  do  without.” 

Firms  are  the  institutional  location  where  a  spe¬ 
cific  kind  of  capitalist  rationality  unfolds.  At  every 
point  in  time,  the  intellectual  technique  of  rational 
capital  accounting  of  costs  and  returns  allows  a 
firm  to  assess  its  own  value  and  to  evaluate  alter¬ 
native  courses  of  action  in  terms  of  probable 
returns  to  the  capital  invested.  At  the  same  time, 
there  are  two  sources  of  irrationality,  one  at  the 
source  and  one  at  the  outcome  of  rational  calcula¬ 
tion.  As  to  the  former,  accumulation  is  seen  as 
originating  from  a  deeply  irrational  dynamic  of 
entrepreneurial  intuition  and  vision  that  can  be 
neither  taught  nor  learned — a  kind  of  imaginative 
“creativity  at  the  top”  that  guides  the  innovation 
of  products  and  processes.  As  to  the  latter,  the 
aggregate  outcome  of  the  capitalist  dynamic  trig¬ 
gers  social  (including  environmental  and  cultural) 
changes  that  just  “happen”  as  unintended  and 
unpremeditated  outcomes  and  cannot  be  attrib¬ 
uted  to  any  rational  design  or  calculation.  Even 
firms  themselves  can  never  be  certain,  given  the 
unpredictability  of  their  environment,  that  deci¬ 
sions  will  turn  out  to  have  been  rational  in  light  of 
the  outcomes.  This  latter  irrationality  of  outcomes 
is  a  point  at  which  the  dynamics  of  capitalism  and 
the  normative  ideals  of  modernization  diverge.  If 
the  project  of  modernity  means  the  effective  mas¬ 
tery  of  society  over  its  collective  fate,  this  is  clearly 
not  a  virtue  in  which  capitalism  (according  to  Max 
Weber,  arguably  still  the  “most  fateful  force  of 
social  life”)  excels. 

Institutions 

Firms  interact  not  only  with  external  market 
participants  (customers,  workers  to  be  hired,  and 
other  firms  as  suppliers  or  buyers)  but  also,  and  in 
ways  that  are  not  mediated  by  markets,  with  the 
institutional  environment  in  which  they  are 
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“embedded.”  Sociologists  and  “institutionalist” 
economists  have  explored  the  vast  field  of  non¬ 
commercial  interaction  that  both  firms  as  employ¬ 
ers  as  well  as  workers  as  employees  are  involved 
in.  These  noncommercial  relations  of  capitalist 
firms,  as  well  as  of  everyone  else  participating  in 
markets,  are  governed  by  institutions  and  the  legal 
rules,  formal  procedures,  and  social  norms  that 
institutions  consist  of.  The  institutional  environ¬ 
ment  of  market  actors  (beginning  with  private  law 
and  its  enforcement  in  courts  and  not  ending  with 
state-provided  investment  in  infrastructure)  is 
something  that  they,  on  the  one  hand,  depend  on 
to  reach  their  market  objectives;  yet  on  the  other 
hand,  the  institutional  environment  is  one  that 
market  actors  try  to  actively  shape  and  transform 
in  ways  that  best  suit  their  respective  interests. 
They  are  involved  in  a  reflexive  process  that  may 
be  termed  the  production  of  the  conditions  of  pro¬ 
duction  (as  well  as  their  distribution).  The  politi¬ 
cally  mediated  production  of  conditions  of  pro¬ 
duction  applies  to  the  areas  of  research,  develop¬ 
ment,  and  technical  change,  for  instance,  in  the 
areas  of  communication,  transportation,  and  the 
development  of  new  materials  and  sources  of 
energy.  It  also  applies  to  the  vast  policy  areas  of 
infrastructure  investment,  regulation  of  markets, 
trade  policy,  taxation,  labor  market  and  social 
policy,  and  macroeconomic  steering.  At  any  rate, 
we  would  get  a  seriously  deficient  and  distorted 
picture  if  we  were  to  model  action  in  capitalist 
societies  as  primarily  the  market  action  of  buying, 
selling,  and  investing.  To  succeed,  market  actors 
take  an  equally  strong  interest  not  only  in  comply¬ 
ing  with  but  also  in  strategically  shaping  the  non- 
commodified  environment  of  commercial  interac¬ 
tion  in  markets.  The  relationship  between  the 
actors  of  a  capitalist  economy  and  the  institutional 
environment  in  which  they  act  is  a  reciprocal  one: 
Firms,  consumers,  owners,  and  workers  would  not 
be  able  to  make  a  single  move  without  relying  on 
premises  such  as  laws,  courts,  legislatures,  regula¬ 
tory  agencies,  police  protection,  schools,  physical 
infrastructure,  systems  of  taxation  and  tariffs, 
social  insurance  systems,  central  banks,  research 
and  development  organizations,  and  many  others, 
mostly  supplied,  sponsored,  and  regulated  by  state 
agencies  and  all  kinds  of  private-public  hybrids, 
which  in  recent  literature  are  referred  to  as  agen¬ 
cies  of  “governance.”  Although  capitalism  is  a 


global  system,  the  configuration  of  capitalist  eco¬ 
nomic  actors  and  their  institutional  environment 
appears  to  be  still  largely  shaped  by  national  tradi¬ 
tions,  path  dependencies,  and  policy  approaches 
shared  by  the  political  and  economic  elites  of 
nation-states.  These  institutional-context  condi¬ 
tions  are  never  “given,”  fixed,  or  sacrosanct 
according  to  a  master  formula  of  the  “mixed 
economy”  but  are,  in  fact,  in  constant  flux  under 
the  impact  of  hegemonic  doctrines  of  social  order 
(e.g.,  neoliberalism)  as  well  as  the  strategic  efforts 
of  economic  actors  to  alter  them  in  ways  that 
allow  for  better  exploitation  of  the  emerging 
opportunities.  These  agents  depend  on  an  institu¬ 
tional  framework  of  social  order,  yet  at  the  same 
time  they  are  constantly  involved  in  strategic 
activities  designed  to  disorganize  and  reorganize  it. 
The  capacity  for  the  latter  is  derived,  in  spite  of  the 
apparent  primacy  of  political  state  power  over 
economic  exchange,  from  the  fact  that  modern 
states,  in  particular  modern,  liberal,  democratic 
states,  and  their  stability  depend  as  much  on  the 
reasonably  smooth  operation  and  growth  perfor¬ 
mance  of  the  capitalist  economy  as  the  agents  in 
the  capitalist  economy  depend  on  the  state-pro¬ 
vided  institutional  setup.  Again,  there  is  an  asym¬ 
metrical  mutual  dependency  (in  contrast  to  the 
notion  of  a  hierarchical  primacy  of  the  state  over 
the  economy),  due  to  the  fact  that  (capitalist) 
states,  in  their  turn,  depend  on  both  fiscal  resources 
and  political  support  (with  labor  market  outcomes 
as  one  of  its  important  determinants)  for  the  sake 
of  their  stability.  This  dependency  of  the  state  on 
capital  and  its  profitable  investment  is  all  the 
greater,  and  the  state’s  vulnerability  more  signifi¬ 
cant,  the  more  the  state  is  a  welfare  state  (i.e.,  a 
state  with  substantial  legal  commitments  to  the 
provision  and  maintenance  of  social  security)  and 
the  more  investors  enjoy  the  “exit  option”  that  a 
denationalized  pattern  of  trade  and  investment 
(globalization)  provides. 

What  agents  under  capitalism  actually  do  is 
thus  much  more  than  buying  and  selling  in  the 
pursuit  of  gain,  profit,  and  utility  maximization. 
Beyond  that,  they  act  reflexively  on  the  very  insti¬ 
tutional  context  conditions  under  which  they  act, 
revising,  as  it  were,  the  rules  as  the  game  goes  on 
by  constantly  reframing  what  Robert  Boyer  and 
Yves  Saillard  (2002)  call  their  “accumulation 
regimes.”  Firms  and  their  associations  are  involved 
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in  the  legal  or  managerial  design  and  ongoing 
adjustment  of  “production  regimes”  and  modes  of 
“corporate  governance.”  Moreover,  they  form 
cartels  and  alliances,  make  threats  and  promises, 
warn  and  demand,  bargain  and  negotiate,  associ¬ 
ate,  advertise,  influence,  lobby,  launch  campaigns, 
donate,  resist,  mobilize,  implicitly  blackmail  polit¬ 
ical  authority  by  virtue  of  the  fact  that  some  eco¬ 
nomic  agents  are  “too  big  to  die,”  complain  about 
state  policies  and  advocate  alternative  ones,  oppor¬ 
tunistically  evade  legal  and  contractual  obliga¬ 
tions,  strike  political  deals,  and  so  on — all  to 
shape,  reshape,  and  occasionally  also  subvert  the 
institutional  context  within  which  the  core  eco¬ 
nomic  process  of  capitalism  and  the  competitive 
pursuit  of  profits  is  going  on.  Capitalism  is  a 
political  economy  in  the  sense  that  it  can  hardly  be 
conceptualized  in  terms  of  a  durable  institutional 
equilibrium.  To  the  contrary,  rules  and  their  recog¬ 
nition  are  permanently  contested.  If  the  state  and 
its  institutions  can  be  said  to  be  devices  to  generate 
security  of  expectations  leading  to  trust  and  to 
protect  capitalist  market  society  from  its  own 
inherent  dangers  of  destabilization,  it  can  also  be 
said  that  this  device  is  itself  not  reliably  protected 
from  the  repercussions  of  such  destabilization. 
Charles  Lindblom  (1982)  has  even  compared  the 
market  to  a  “prison”  in  which  the  makers  of  pub¬ 
lic  policies  are  incarcerated.  The  assumption  that 
capitalism  is  at  all  “governable”  (as  opposed  to 
essentially  “anarchic”)  is,  in  other  words,  far  from 
axiomatic.  This  condition  of  uncertain  institu¬ 
tional  embeddedness  applies  even  to  the  overall 
political  regime  type.  For  while  it  is  true  that  all 
liberal  democracies  contain  capitalist  economies 
(in  spite  of  the  friction  that  exists  between  the 
two),  the  reverse  is  not  true:  Both  historically  and 
in  the  contemporary  world,  capitalism  has  coex¬ 
isted  with  (and  indeed  flourished  under)  various 
types  of  nondemocratic  regimes. 

Capitalist  Culture 

What  Max  Weber  has  termed  the  spirit  of 
capitalism  is  a  complex  and  multifaceted  phenom¬ 
enon  that  includes  cognitive  and  epistemic  as  well 
as  motivational  and  justificatory  elements.  It  has 
in  part  motivated  the  transition  from  precapitalist 
to  capitalist  modes  of  economic  life,  and  it  is  in 
turn  shaped  and  inculcated  by  the  realities  of  life 


under  capitalism.  It  also  applies  differently  to  dif¬ 
ferent  types  of  actors  (e.g.,  manual  workers, 
white-collar  workers,  the  self-employed,  entrepre¬ 
neurs,  managers,  and  consumers)  in  the  capitalist 
game,  as  well  as  to  different  stages  of  its  develop¬ 
ment  and  associated  “production  regimes.”  For 
example,  Boyer  and  Saillard  (2002)  contrast 
“Fordist”  mass  production  versus  post-Fordist 
“flexible  quality  production.”  Central  to  the  core 
model  of  capitalist  culture  is  the  notion  of  selfish 
and  “unfraternal”  (as  Max  Weber  put  it)  individ¬ 
uals’  pursuit  of  acquisitive  rationality  for  its  own 
sake.  These  individuals  methodically  explore  the 
physical  and  social  world  in  constant  search  of 
opportunities  for  gain.  In  doing  so,  they  follow 
their  interests,  control  their  own  passions  through 
self-imposed  discipline,  and  resist  the  passions  of 
rulers.  This  pursuit  of  interest  is  conceived  as  end¬ 
less — both  in  the  sense  that  there  is  no  end  or  state 
of  satiation  to  be  reached  where  further  efforts 
become  pointless  and  in  the  sense  that  it  can  (and 
in  fact  must)  go  on  forever,  as  any  standstill  spells 
failure  in  a  competitive  environment.  The  ratio¬ 
nality  that  governs  this  behavioral  dynamic  is 
“formal,”  “abstract,”  “self-referential,”  unend¬ 
ingly  and  relentlessly  expansive  in  time  and  space, 
and  boundless.  Everything  we  encounter  in  the 
world  is  first  of  all  being  framed  in  terms  of  costs 
and  returns,  risk  of  loss,  and  opportunity  for  gain 
alone.  At  the  same  time,  the  accounting  frame  of 
capitalist  assessment  of  costs  and  returns  is  too 
restricted  and  myopic — that  is,  insufficiently  intel¬ 
ligent — to  capture  long-term  and  collective  nega¬ 
tive  externalities,  which  therefore  tend  to  be  sys¬ 
tematically  ignored.  Weber  has  claimed  an  “elec¬ 
tive  affinity”  that  exists  between  the  urge  to 
accumulate  and  Puritan  asceticism,  which  abhors 
wealth  to  be  enjoyed  and  instead  lauds  its  being 
transformed  into  capital  to  be  invested,  with  the 
satisfaction  of  need  just  being  a  by-product  of  the 
process.  Weber,  in  his  1904  book  The  Protestant 
Ethic  and  the  Spirit  of  Capitalism ,  argued  that  the 
religious  values  of  Calvinism  provided  motivation 
for  work  and  the  accumulation  of  wealth  in  the 
secular  world.  He  saw  this  as  an  alternative  to 
Marx’s  doctrine  of  historical  materialism  as  the 
basis  of  economic  structures.  (Marx  almost  never 
used  the  term  capitalism.)  Relentless  and  often 
fear-driven  patterns  of  acquisitive  search  behavior 
have  become  common  today  not  just  among 
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entrepreneurs  and  the  self-employed  but  also 
among  a  wide  range  of  employees  who  have 
adopted  entrepreneurial  attitudes  and  values  such 
as  the  values  of  flexibility  and  mobility. 

But  this  characterization  of  capitalist  culture 
and  cognitive  style  captures  just  one  of  its  facets. 
As  Daniel  Bell  (1976)  pointed  out,  others  include 
patterns  of  hedonistic  consumerism  with  its  mind¬ 
less  shortsightedness  that  creates  ever  more  needs 
and  desires  amid  affluence.  Still  another  consists  in 
countertendencies  to  abstract  formal  rationality, 
the  spread  of  which  can  be  observed  in  the  irratio¬ 
nalities  of  superstition,  magical  thinking,  and 
prejudice  of  “authoritarian”  modal  personalities, 
or  in  tendencies  in  postmodernist  and  antiauthori¬ 
tarian  countercultures  that  challenge  dominant 
modes  of  rationality,  thus  causing  “cultural  con¬ 
tradictions  of  capitalism”  (Bell,  1976).  Whether  or 
not  we  can  speak  of  specific  cultural  prerequisites 
(as  opposed  to  formative  cultural  consequences)  of 
capitalism  is  an  issue  of  considerable  interest  for 
social  research,  given  that  East  Asian  capitalism 
has  thrived  within  the  cultural  framework  of 
Confucianism  and  also  given  that  there  was  hardly 
any  founding  generation  of  a  capitalistically  “spir¬ 
ited”  middle  class  in  some  of  the  now  capitalist 
societies  that  emerged  from  state  socialism  in  cen¬ 
tral  East  Europe,  where  capitalism  was  built 
“without  capitalists,”  as  was  stressed  by  Gil  Eyal, 
Ivan  Szelenyi,  and  Eleanor  Townsley  (1998).  As  is 
the  case  with  political  institutions,  both  high  cul¬ 
ture  and  popular  culture  and  both  the  normative 
and  the  cognitive  “mental  infrastructure”  of  capi¬ 
talist  societies  are  constantly  (re)negotiated  through 
the  joint  governance  of  private  actors  (e.g.,  much 
of  the  media  industries)  and  public  authorities. 

Critiques  of  Capitalism 

From  its  historical  beginnings,  capitalist  market 
societies  have  encountered  strong  endogenous  cri¬ 
tiques.  The  intellectual  and  political  critique  of 
capitalism  and  its  inherent  dynamics  comes  in  two 
main  variants  that  are  often  combined  by  critics. 
One  is  based  on  empirical  analysis  and  prediction 
and  focuses  on  the  observable  instability  of  the  sys¬ 
tems  and  its  built-in  self-destructive  tendencies;  this 
kind  of  critical  perspective  yields  crisis  theories 
according  to  which  the  system  will  become,  sooner 
or  later,  unsustainable.  The  other  critical  perspective 


is  normative  and  highlights  the  suffering,  depriva¬ 
tion,  exclusion,  sense  of  meaninglessness,  and 
various  kinds  of  injustice  that  are  perceived  as 
concomitant  features  of  capitalist  growth  and 
development;  in  response  to  this  experience  of 
injustice,  social  conflict,  be  it  in  the  form  of  class 
conflict  or  otherwise,  is  both  predicted  and  advo¬ 
cated  by  critics  to  overcome  capitalism  and  trans¬ 
form  it  into  a  type  of  society  that  is  both  more  just 
(at  the  level  of  “social  integration”)  as  well  as 
more  stable  as  a  viable  economic  system. 

However,  the  empirical  observation  of  cyclical 
patterns  of  crisis  that  unfold  under  capitalist  insti¬ 
tutions,  as  well  as  the  normative  focus  on  injustice, 
does  not  provide  a  robust  argument  to  the  effect 
that  we  are  entitled  to  anticipate  a  crisis  or  a  con- 
flictual  challenge  of  capitalism.  This  non  sequitur 
is  widely  recognized  today  as  an  analytically 
unwarranted  leap  of  political  faith.  For  just  as 
cyclical  crises  and  recessions  set  the  scene  for  ever 
new  rounds  of  accumulation  and  growth,  capital¬ 
ism  can  also  thrive  on  at  least  some  versions  of  its 
normative  critique,  thus  arguably  continually  con¬ 
tributing  to  the  system’s  perpetuation  rather  than 
its  demise. 

Conclusion 

It  seems  safe  to  state  that  the  notion  of  a  modern 
society  “after”  and  “without”  capitalism  and  its 
key  features — a  notion  that  has  inspired  much  of 
the  history  of  the  political  left — has  largely  been 
rendered  obsolete  today.  This  obsolescence  is 
epitomized  by  the  demise  of  European  state  social¬ 
ism  in  1989  to  1991.  Neither  the  probability  nor 
the  desirability  of  a  full-scale  historical  abolition 
(breakdown)  of  capitalist  patterns  of  socioeco¬ 
nomic  organization  is  widely  advocated  any  lon¬ 
ger.  Instead,  capitalism  is  seen  to  be  subject  to 
numerous  forces  of  endogenous  change,  leading  to 
a  great  deal  of  variation  and  institutional  diversifi¬ 
cation  of  capitalisms.  The  longitudinal  notion  of  a 
diachronic  sequence  of  types  of  social  order  has 
yielded  to  a  “synchronized”  perspective,  with 
post-  and  anticapitalist,  “decommodified,”  and 
solidaristic  patterns  of  socioeconomic  organization 
now  playing  a  role  in  ongoing  and  contingently 
reversible  modifications  of  enduring  capitalist  core 
structures  and  the  ongoing  recombination  of  its 
components.  As  stated  before,  it  is  in  the  nature  of 
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capitalism  that  it  consistently  breeds  reflexive  cri¬ 
tiques  of  capitalism.  These  aim  at  curbing  and 
holding  accountable  the  various  manifestation  of 
the  social  power  of  capital  and  propose  to  deploy 
a  variety  of  institutions  and  policies  for  its  domes¬ 
tication — be  it  social  power  over  a  firm’s  employ¬ 
ees,  power  at  the  level  of  class  relations  and  its 
distributional  consequences,  the  role  of  economic 
(veto)  power  in  the  making  of  public  policy,  the 
power  of  investors  to  inflict  vast  negative  exter¬ 
nalities  on  everyone  else  (and  even  on  themselves) 
through  economic  crises  and  environmental 
destruction,  or  the  power  of  capital  to  shape  and 
“colonize”  the  process  of  cultural  reproduction. 
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Although  much  of  what  we  know  about  the 
empirical  world  has  been  generated  by  case  stud¬ 
ies  and  case  studies  continue  to  constitute  a  large 
proportion  of  work  generated  by  the  political  sci¬ 
ence  discipline,  the  case  study  method  is  poorly 
understood.  Even  among  its  defenders,  there  is 
confusion  over  the  virtues  and  vices  of  this 
research  design.  Practitioners  continue  to  ply  their 
trade  but  have  difficulty  articulating  what  it  is 
they  are  doing,  methodologically  speaking.  The 
case  study  survives  in  a  curious  methodological 
limbo.  The  problem  of  ambiguity  begins  with  the 
term  itself.  To  refer  to  a  work  as  a  case  study 
might  mean  that  its  method  is  qualitative,  small 
N;  that  the  research  is  holistic  and  thick  (a  more 
or  less  comprehensive  examination  of  a  phenom¬ 
enon);  that  it  uses  a  particular  type  of  evidence 
(e.g.,  ethnographic,  clinical,  nonexperimental, 
non-survey  based,  participant  observation,  pro¬ 
cess  tracing,  historical,  textual,  or  field  research); 
that  its  method  of  evidence  gathering  is  naturalis¬ 
tic  (a  “real-life  context”);  that  the  research  inves¬ 
tigates  the  properties  of  a  single  observation;  or 
that  the  research  investigates  the  properties  of  a 
single  phenomenon,  instance,  or  example.  This 
entry  presents  a  more  precise  definition  of  this 
method  and  discusses  its  strengths  and  limitations. 

Evidently,  researchers  have  many  things  in  mind 
when  they  talk  about  case  study  research.  Confusion 
is  compounded  by  the  existence  of  a  large  number 
of  near  synonyms — single  unit,  single  subject,  sin¬ 
gle  case,  N  =  1,  case  based,  case-control,  case  his¬ 
tory,  case  method,  case  record,  case  work,  clinical 
research,  and  so  forth.  As  a  result  of  this  profusion 
of  terms  and  meanings,  proponents  and  opponents 
of  the  case  study  marshal  a  wide  range  of  argu¬ 
ments  but  do  not  seem  any  closer  to  agreement 
than  when  this  debate  was  first  broached  several 
decades  ago.  To  talk  about  this  subject  in  a  pro¬ 
ductive  fashion,  we  must  arrive  at  a  narrower  defi¬ 
nition.  In  this  entry,  we  stipulate  that  a  case  con¬ 
notes  a  spatially  delimited  phenomenon  (a  unit) 
observed  at  a  single  point  in  time  or  over  some 
period  of  time.  It  comprises  the  sort  of  phenomena 
that  an  inference  attempts  to  explain.  Thus,  in  a 
study  that  attempts  to  explain  certain  features  of 
nation-states,  cases  consist  of  nation-states  (across 
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some  temporal  frame).  In  a  study  that  attempts  to 
explain  the  behavior  of  individuals,  individuals 
make  up  the  cases.  And  so  forth.  Each  case  may 
provide  a  single  observation  or  multiple  (within- 
case)  observations. 

For  students  of  political  science,  the  archetypal 
case  is  the  dominant  political  unit  of  our  time,  the 
nation-state.  However,  the  study  of  smaller  social 
and  political  units  (regions,  cities,  villages,  com¬ 
munities,  social  groups,  and  families)  or  specific 
institutions  (political  parties,  interest  groups,  and 
businesses)  is  equally  common  in  other  subfields 
and,  perhaps,  increasingly  so  in  comparative  poli¬ 
tics.  Whatever  the  chosen  unit,  the  methodological 
issues  attached  to  the  case  study  have  nothing  to  do 
with  the  size  of  the  individual  cases.  A  case  may  be 
created  out  of  any  phenomenon  so  long  as  it  has 
identifiable  boundaries  and  comprises  the  primary 
object  of  an  inference.  Note  that  the  spatial  bound¬ 
aries  of  a  case  are  often  more  apparent  than  its 
temporal  boundaries.  We  know,  more  or  less, 
where  a  country  begins  and  ends,  even  though  we 
may  have  difficulty  explaining  when  a  country 
begins  and  ends.  Yet  some  temporal  boundaries 
must  be  assumed.  This  is  particularly  important 
when  cases  consist  of  discrete  events — crises,  revo¬ 
lutions,  legislative  acts,  and  so  forth — within  a  sin¬ 
gle  unit.  Occasionally,  the  temporal  boundaries  of 
a  case  are  more  obvious  than  its  spatial  boundaries. 
This  is  true  when  the  phenomena  under  study  are 
eventful  but  the  unit  undergoing  the  event  is  amor¬ 
phous.  For  example,  if  one  is  studying  terrorist 
attacks,  it  may  not  be  clear  how  the  spatial  unit  of 
analysis  should  be  understood,  but  the  events  them¬ 
selves  may  be  well  bounded.  Following  this  under¬ 
standing  of  “case,”  a  case  study  could  be  defined  as 
the  intensive  study  of  a  single  case  for  the  purpose 
of  understanding  a  larger  population  of  cases. 
Several  implications  flow  from  this  definition,  as 
applied  to  the  social  sciences. 

By  virtue  of  the  small  size  of  the  sample  and  the 
heterogeneous  nature  of  most  social  phenomena, 
the  representativeness  of  a  case  study  is  always 
rather  uncertain.  One  does  not  know  for  sure 
whether  the  chosen  case  or  cases  are  similar  (in 
relevant  ways)  to  the  larger  population  of  theo¬ 
retical  interest.  Of  course,  there  is  always  some 
uncertainty  attached  to  the  quest  for  external 
validity.  Even  so,  the  uncertainty  normal  to  this 


sort  of  inference  is  compounded  whenever  the  cho¬ 
sen  sample  is  very  small.  By  virtue  of  the  small  size 
of  the  sample,  it  can  usually  be  presumed  that  the 
evidence  at  hand  is  nonexperimental.  Note  that  if 
a  treatment  can  be  manipulated,  there  is  usually  an 
opportunity  to  generate  large  treatment  and  con¬ 
trol  groups,  moving  the  style  of  research  away 
from  an  intensive  focus  on  a  single  case.  (That 
said,  it  is  certainly  possible  to  combine  a  large-N 
cross-case  analysis  of  experimental  evidence  with 
the  in-depth  study  of  an  individual  case,  or  several 
cases,  within  that  sample — perhaps  with  the  goal 
of  shedding  light  on  causal  mechanisms.  Varying 
approaches  to  the  same  evidence  often  serve  a 
complementary  function.) 

Case  study  research  may  incorporate  several 
cases,  as  in  the  style  of  research  known  as  com¬ 
parative  historical.  However,  at  a  certain  point  it 
will  no  longer  be  possible  to  investigate  those  cases 
intensively.  At  the  point  where  the  emphasis  of  a 
study  shifts  from  the  individual  case  to  a  sample  of 
cases,  we  shall  say  that  a  study  is  a  cross-case  one. 
Thus,  the  distinction  between  a  case  study  and 
cross-case  study  must  be  understood  along  a  con¬ 
tinuum.  The  fewer  cases  there  are,  and  the  more 
intensively  they  are  studied,  the  more  a  work  merits 
the  appellation  case  study.  Although  the  case  study 
has  been  identified  thus  far  by  the  smallness  of  the 
sample,  it  is  important  to  appreciate  that  whenever 
one  is  conducting  an  intensive  analysis  of  a  single 
case  (or  a  small  number  of  cases)  one  is  gathering 
multiple  observations  from  the  case(s).  Intensive 
study  means  multiple  observations,  usually  at  a 
lower  level  of  analysis.  Insofar  as  these  observa¬ 
tions  are  comparable  with  one  another,  they  may 
be  analyzed  in  a  quantitative  fashion.  Thus,  the 
case  study  format  is  entirely  compatible  with  large- 
N  statistical  analysis  of  within-case  evidence. 

To  clarify,  what  distinguishes  a  large-N  analysis 
embedded  within  a  case  study  from  a  large-N 
cross-case  analysis  is  that  the  former  is  nested 
within  the  units  of  primary  theoretical  interest. 
Thus,  if  one’s  purpose  is  to  explain  the  process  of 
national-level  democratization,  the  units  of  theo¬ 
retical  interest  are  nation-states.  A  case  study 
approach  to  this  question  would  focus  on  a  single 
country  or  a  small  sample  of  countries.  As  part  of 
such  a  study,  one  might  conduct  a  survey  with 
thousands  of  respondents  to  determine  attitudes 
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and  behaviors  relevant  to  democratization.  This 
provides  the  fodder  for  a  quantitative  analysis, 
nested  within  the  case  study  format.  If,  on  the 
other  hand,  one’s  theoretical  purpose  is  to  explain 
individual-level  variation  in  attitudes/behavior 
related  to  democratization,  then  the  same  study  is 
probably  best  classified  as  cross-case,  rather  than 
case  study:  One  now  has  a  large  sample  of  the 
things  to  be  explained,  none  of  which  receives  a 
great  deal  of  attention. 

So  conceived,  a  case  study  analysis  may  contain 
qualitative  or  quantitative  evidence.  Typically,  it 
combines  both.  So  conceived,  the  case  study 
method  is  consonant  with  any  theoretical  frame¬ 
work  and  virtually  any  methodological  frame¬ 
work — for  example,  behavioralism,  ethnography, 
rational  choice,  institutionalism,  interpretivism, 
and,  very  occasionally,  experimental  research 
(N  =  1  experiment).  So  conceived,  it  is  possible  to 
look  dispassionately  at  the  strengths  and  weak¬ 
nesses  of  the  case  study  research  design,  in  contrast 
with  cross-case  research  designs.  These  trade-offs 
derive,  first  of  all,  from  basic  research  goals  such  as 

1.  whether  the  study  is  oriented  toward  hypothesis 
generating  or  hypothesis  testing, 

2.  whether  internal  or  external  validity  is 
prioritized, 


3.  whether  insight  into  causal  mechanisms  or 
causal  effects  is  more  valuable,  and 

4.  whether  the  scope  of  the  causal  inference  is  deep 
or  broad. 

These  trade-offs  also  hinge  on  the  shape  of  the 
empirical  universe — that  is, 

5.  whether  the  population  of  cases  under  study  is 
heterogeneous  or  homogeneous, 

6.  whether  the  causal  relationship  of  interest  is 
strong  or  weak, 

7.  whether  useful  variation  on  key  parameters 
within  that  population  is  rare  or  common,  and 

8.  whether  available  data  are  concentrated  or 
dispersed. 

Along  each  of  these  dimensions,  case  study 
research  usually  has  an  affinity  for  the  first  factor 
and  cross-case  research  has  an  affinity  for  the  sec¬ 
ond.  All  else  being  equal,  case  studies  are  more  use¬ 
ful  when  the  strategy  of  research  is  exploratory 
rather  than  confirmatory/disconfirmatory,  when 
internal  validity  is  given  preference  over  external 
validity,  when  insight  into  causal  mechanisms  is 
prioritized  over  insight  into  causal  effects,  when 
propositional  depth  is  prized  over  breadth,  when  the 


Table  I  Case  Study  and  Cross-Case  Research  Designs:  Affinities  and  Trade-Offs 


Affinity 

Case  Study 

Cross-Case  Study 

Research  goals 

1.  Hypothesis 

Generating 

Testing 

2.  Validity 

Internal 

External 

3.  Causal  insight 

Mechanisms 

Effects 

4.  Scope  of  proposition 

Deep 

Broad 

Empirical  factors 

5.  Population  of  cases 

Heterogeneous 

Homogeneous 

6.  Causal  strength 

Strong 

Weak 

7.  Useful  variation 

Rare 

Common 

8.  Data  availability 

Concentrated 

Dispersed 

Source:  Gerring,  J.  (2007),  p.  38.  Copyright  Cambridge  University  Press;  used  by  permission. 


196  Categorical  Response  Data 


population  of  interest  is  heterogeneous  rather  than 
homogeneous,  when  causal  relationships  are  strong 
rather  than  weak,  when  useful  information  about 
key  parameters  is  available  only  for  a  few  cases, 
and  when  the  available  data  is  concentrated  rather 
than  dispersed.  These  trade-offs  are  summarized  in 
Table  1.  To  clarify,  they  represent  methodological 
affinities ,  not  invariant  laws.  Exceptions  can  be 
found  to  each  one.  Even  so,  these  general  tenden¬ 
cies  are  often  noted  in  case  study  research  and  have 
been  reproduced  in  multiple  disciplines  and  sub¬ 
disciplines  over  the  course  of  many  decades. 

It  should  also  be  stressed  that  each  of  these 
trade-offs  carries  a  ceteris  paribus  caveat.  Case 
studies  are  more  useful  for  generating  new  hypoth¬ 
eses,  all  other  things  being  equal.  The  reader  must 
bear  in  mind  that  many  additional  factors  also 
rightly  influence  a  writer’s  choice  of  research 
design,  and  they  may  lean  in  the  other  direction. 
Ceteris  is  not  always  paribus.  One  should  not 
jump  to  conclusions  about  the  research  design 
appropriate  to  a  given  setting  without  considering 
the  entire  range  of  issues  involved — some  of  which 
may  be  more  important  than  others. 

In  conclusion,  it  may  be  granted  that  case  studies 
and  cross-case  studies  explore  the  world  in  different 
ways.  Yet,  properly  constituted,  there  is  no  reason 
why  case  study  results  cannot  be  synthesized  with 
the  results  gained  from  cross-case  analysis  and  vice 
versa.  Indeed,  they  are  often  complementary.  In  this 
light,  one  might  cite  the  current  popularity  of  multi¬ 
method  work  (triangulation),  which  often  combines 
large-N  cross-case  analysis  with  an  intensive  focus 
on  one  or  a  few  carefully  chosen  cases.  Although  this 
discussion  has  been  brief,  it  may  help  restore  a 
greater  sense  of  coherence,  purpose,  and  integrity  to 
the  case  study  method.  This  narrower  and  more 
carefully  bounded  definition  may  alleviate  some  of 
its  most  persistent  ambiguities.  It  is  also  hoped  that 
the  characteristic  strengths  of  this  method,  as  well  as 
its  limitations,  will  be  apparent  to  producers  and 
consumers  of  case  study  research.  The  case  study  is 
a  useful  tool  for  some  research  objectives  but  not  all. 

John  Gerring 
Boston  University 
Boston,  Massachusetts,  United  States 
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The  term  categorical  response  data  refers  to  data 
for  outcome  variables  whose  values  represent  dis¬ 
tinct  categories  as  opposed  to  continuous  quanti¬ 
ties.  Categorical  variables  are  pervasive  in  politi¬ 
cal  science.  Examples  are  as  diverse  as  party 
affiliation  (“Democrat,”  “Republican,”  “other,” 
“none”),  union  membership  (“yes,”  “no”),  form 
of  government  (“republic,”  “monarchy,”  “mili¬ 
tary  dictatorship,”  etc.),  voter  participation 
(“yes,”  “no”),  or  confidence  in  the  government 
(“a  great  deal,”  “quite  a  lot,”  “not  very  much,” 
“none  at  all”).  Although  examples  for  the  collec¬ 
tion  of  categorical  data  can  probably  be  traced 
back  as  far  as  the  ancient  censuses  in  Egypt,  it 
was  not  until  the  early  20th  century  that  a  sys¬ 
tematic  development  of  statistical  methods  for  the 
analysis  of  categorical  data  began.  The  starting 
point  was  the  work  of  Karl  Pearson  and  G.  Udny 
Yule,  who  debated  over  how  best  to  analyze  asso¬ 
ciations  between  categorical  variables.  Many 
fundamental  contributions  that  prepared  the 
ground  for  the  emergence  of  a  wide  variety  of 
approaches  in  the  subsequent  decades  were  also 
made  by  R.  A.  Fisher  in  the  1920s  and  1930s. 
This  entry  provides  a  brief  overview  of  the  rich 
collection  of  methods  for  the  analysis  of  categori¬ 
cal  data  available  today.  First,  a  description  of 
different  types  of  categorical  data  is  given.  Second, 
various  ways  to  analyze  categorical  data  are 
summarized. 
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Typology  of  Categorical  Data 

The  most  important  differentiation  of  categorical 
data  distinguishes  between  nominal  and  ordinal 
variables.  The  values  of  a  nominal  variable  identify 
categories  that  do  not  have  a  natural  rank  order. 
Examples  are  ethnicity  (“Black,”  “White,” 
“Hispanic,”  etc.)  or  marital  status  (“single,”  “mar¬ 
ried,”  “divorced,”  “widowed”).  Methods  using 
nominal  data  should  not  depend  on  the  numerical 
values  assigned  to  the  categories.  The  categories  of 
ordinal  data,  in  contrast,  possess  a  natural  rank¬ 
ing.  Examples  are  social  class  (“lower,”  “middle,” 
and  “upper”),  agreement  with  the  statement 
“Democracies  are  indecisive”  (“strongly  agree,” 
“agree,”  “disagree,”  and  “strongly  disagree”),  or 
self-reported  political  left-right  orientation  (on  a 
scale  from  1  =  left  to  10  =  right).  Methods  using 
ordinal  data  take  into  account  the  order  of  the 
values,  but  the  exact  spacing  does  not  matter.  The 
distinction  between  nominal  and  ordinal  variables 
is  not  always  clear.  For  example,  political  parties 
can  be  ranked  on  a  left-right  scale,  but  the  order 
may  be  ambiguous  for  some  parties  (because  they 
may  rank  differently  on  different  policies). 
Moreover,  whether  a  variable  is  treated  as  ordinal 
can  depend  on  the  research  question. 

A  further  distinction  is  made  between  qualitative 
and  quantitative  data.  Some  authors  use  the  former 
as  a  synonym  for  categorical  data.  However,  while 
nominal  data  are  clearly  qualitative,  ordinal  data 
are  an  intermediate  type,  often  treated  as  quantita¬ 
tive  in  practice.  Moreover,  quantitative  data  can 
sometimes  be  treated  as  categorical.  Quantitative 
data  are  either  continuous  or  discrete.  Continuous 
variables  can  take  on  any  real  value  in  a  given  inter¬ 
val,  whereas  discrete  variables  are  restricted  to  a 
fixed  set  of  distinct  values  (e.g.,  integers).  Discrete 
quantitative  variables  are  often  considered  as  cate¬ 
gorical  when  the  range  of  observed  values  is  small. 
Count  data  are  an  example. 

Categorical  variables  can  be  divided  into  dichot¬ 
omous  variables  that  only  have  two  possible  out¬ 
comes  (“yes”/“no”;  “male”/“female”)  and  polyto- 
mous  variables  with  more  than  two  categories 
(“Christian,”  “Muslim,”  “Hindu,”  “Buddhist,” 
“Jewish,”  etc.).  A  dichotomous  variable  that  is 
coded  0  and  1  is  called  an  indicator  or  binary  vari¬ 
able.  Dichotomous  variables  are  often  technically 
and  conceptually  easier  to  handle  than  polytomous 
variables. 


More  of  a  philosophical  question,  which  also 
was  the  root  of  the  dispute  between  Pearson  and 
Yule,  is  whether  categorical  variables  should  be 
treated  as  inherently  categorical  (Yule’s  view)  or 
whether  they  can  be  conceptualized  as  coarse¬ 
grained  manifestations  of  an  underlying  continuum 
(Pearson’s  position).  The  distinction,  however,  has 
practical  consequences  for  how  categorical  data 
are  modeled  and  analyzed.  Both  views  have  their 
justifications,  but  in  some  cases,  it  is  more  natural 
to  assume  fixed  categories  (for  nominal  variables 
such  as  gender  or  nationality),  whereas  in  others 
the  assumption  of  an  underlying  continuum,  or  a 
so-called  latent  variable,  is  more  applicable  (for 
ordinal  data,  in  particular). 

A  strong  case  for  the  latent  variable  approach 
can  be  made  for  hybrid  variables  where  a  continu¬ 
ous  variable  is  only  partially  observed.  For  exam¬ 
ple,  think  of  a  variable  classifying  households  by 
income;  for  each  household,  the  range,  but  not  the 
exact  value,  of  income  is  known.  For  such  interval 
data,  an  underlying  continuum  exists,  and  thus  it 
is  natural  to  model  the  data  using  a  latent  variable. 
Truncated  or  censored  variables  (sometimes  called 
limited  variables)  are  a  related  type  of  hybrid  vari¬ 
able,  where  the  continuous  data  are  only  observed 
under  certain  conditions  or  within  a  certain  range 
(e.g.,  top-coded  earnings).  A  somewhat  different 
type  of  hybrid  data  is  time-to-event  data  (survival 
data),  where  a  categorical  variable  (the  type  of 
event)  is  tied  to  a  quantitative  variable — the  dura¬ 
tion  until  the  event  occurs,  possibly  censored  for 
some  observations  (e.g.,  the  timing  of  military 
interventions  in  conflicts  or  the  career  paths  of 
politicians). 

Categorical  Data  Analysis 

Depending  on  research  interest  and  data  type,  a 
variety  of  categorical  data  measures  and  methods 
are  available. 

Describing  Distributions  of  Categorical  Variables 

One  may  first  wish  to  describe  the  relative  fre¬ 
quencies  (proportions)  of  the  different  categories 
of  a  categorical  variable.  For  example,  in  the  con¬ 
text  of  election  forecasts,  one  predicts  different 
parties’  shares  of  the  vote  by  polling  a  random 
sample  of  voters.  If  data  are  available  for  the  entire 
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population,  the  proportions  can  be  derived  by 
simple  counting.  Usually,  however,  the  shares  have 
to  be  inferred  from  a  random  sample.  The  theo¬ 
retical  distribution  used  to  model  statistical  prop¬ 
erties  of  dichotomous  variables  in  random  samples 
is  the  binomial  distribution ,  which  can  be  approx¬ 
imated  by  a  normal  distribution  for  large  samples. 
For  polytomous  variables,  the  closely  related  mul¬ 
tinomial  distribution  is  used.  Based  on  these  distri¬ 
butions,  exact  and  approximate  (i.e.,  large  sample) 
methods  have  been  developed  to  compute  confi¬ 
dence  intervals  for  proportions  and  to  test  whether 
an  observed  distribution  deviates  significantly 
from  a  given  theoretical  distribution  of  interest 
(goodness-of-fit  test).  The  most  common  (multino¬ 
mial)  goodness-of-fit  test  is  the  chi-squared  test, 
which  is  based  on  the  squared  differences  between 
the  observed  category  counts  and  the  counts  that 
are  to  be  expected  (on  average)  according  to  the 
theoretical  distribution.  For  ordinal  variables,  the 
cumulative  distribution  function  (which  is  a  step 
function)  and  tests  that  exploit  this  additional 
information,  such  as  the  discrete  Kolmogorov- 
Smirnov  test,  are  of  interest. 

For  nominal  data,  a  sensible  measure  of  loca¬ 
tion  is  the  mode  (most  frequent  category).  For 
ordinal  variables,  the  median  is  also  useful  (the 
middle  of  the  ordered  data),  as  well  as  the  arithme¬ 
tic  mean  if  the  differences  among  the  categories  are 
assumed  to  be  comparable.  A  measure  of  variation 
for  nominal  data  is,  for  example,  Simpson’s  index 
of  diversity,  which  reaches  its  maximum  if  the  dis¬ 
tribution  is  uniform  (an  equal  frequency  of  each 
category).  For  ordinal  data,  the  range  and  mea¬ 
sures  based  on  differences  between  quantiles,  such 
as  the  interquartile  range  (range  of  middle  half  of 
data),  are  also  meaningful. 

Association  Among  Categorical  Variables 

For  research  questions  examining  relationships 
between  categorical  variables,  the  joint  distribu¬ 
tion  of  the  variables  can  be  analyzed  using  a  cross- 
classification  table  ( contingency  table).  The  cells  of 
such  a  two-way  table  report  the  frequencies  or 
proportions  of  the  different  combinations  of  the 
variables’  categories.  It  is  often  useful  to  condition 
on  row  or  column  totals  so  that  the  differences 
between  conditional  distributions  can  easily  be 
identified.  For  example,  think  of  a  table  with  party 


preference  in  rows  and  ethnicity  in  columns. 
Proportions  conditioned  on  column  totals  (col¬ 
umn  percent)  would  then,  in  each  column,  reflect 
the  party  preference  distribution  for  a  specific 
ethnicity. 

To  test  whether  the  data  provide  support  for 
the  hypothesis  that  the  conditional  distributions 
across  rows  or  columns  are  different  or,  equiva¬ 
lently,  whether  the  two  variables  are  associated 
with  each  other,  researchers  often  use  the  chi- 
squared  homogeneity  or  independence  test,  which 
is  based  on  the  squared  differences  between 
observed  cell  counts  and  the  cell  counts  that 
would  be  expected  if  the  two  variables  were  unre¬ 
lated.  Since  the  chi-squared  test  may  be  biased  in 
small  samples  (especially  if  there  are  many  cells 
with  low  expected  frequency),  exact  tests  have 
also  been  developed  (Fisher’s  exact  test  for  2x2 
tables  and  its  generalizations  to  n  x  m  tables). 

The  chi-squared  test  statistic  depends  on  the 
number  of  observations  and  the  size  of  the  table 
and  is  therefore  not  a  very  useful  association  mea¬ 
sure.  However,  two  popular  chi-squared-based 
measures  that  can  be  compared  across  tables  and 
samples  of  different  size  are  Pearson’s  contingency 
coefficient  and  Cramer’s  V.  A  second  class  of  asso¬ 
ciation  measures  for  nominal  data  is  based  on  the 
concept  of  proportional  reduction  of  error  (PRE). 
PRE  measures  quantify  the  degree  to  which  knowl¬ 
edge  of  an  observation’s  value  for  one  variable  can 
be  exploited  to  predict  the  value  of  the  other  (or, 
more  precisely,  to  which  degree  the  prediction 
error  can  be  reduced).  Different  conceptualizations 
of  prediction  errors  lead  to  different  PRE  mea¬ 
sures.  Examples  include  Guttman’s  lambda, 
Goodman  and  Kruskal’s  tau,  or  the  uncertainty 
coefficient  (likelihood  ratio  index).  If  variables  are 
strongly  associated,  then  one  variable  provides  a 
lot  of  information  for  the  prediction  of  the  other, 
and  the  PRE  measures  are  larger.  In  the  case  of 
two  dichotomous  variables  (2x2  tables),  addi¬ 
tional  important  measures  are  the  odds  ratio  (the 
ratio  of  cross-  products  of  frequencies  in  the  main 
and  secondary  diagonal)  and  the  phi  coefficient 
(point  correlation  coefficient). 

The  measures  discussed  so  far  can  be  used  for 
any  type  of  categorical  variable.  For  ordinal  vari¬ 
ables,  additional  association  measures  that  indi¬ 
cate  the  direction  of  a  relation  have  been  proposed. 
Examples  are  Goodman  and  Kruskal’s  gamma, 
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Kendall’s  tau-b,  or  Somers’  d,  which  can  be 
expressed  in  terms  of  the  numbers  of  concordant 
and  discordant  pairs.  An  alternative  approach  is  to 
assign  scores  that  roughly  reflect  the  distances 
between  categories  and  then  compute  the  linear 
correlation.  Assigning  ranks  to  observations, 
instead  of  scores  (using  midranks  for  observations 
in  the  same  category),  leads  to  the  rank  correlation 
(Spearman’s  rho). 

Association  measures  are  useful  to  describe  the 
relation  between  two  variables.  A  more  general 
framework  to  uncover  association  patterns  among 
two  or  more  categorical  variables  is  provided  by 
loglinear  models.  The  basic  idea  of  loglinear  mod¬ 
els  is  to  express  the  cell  counts  in  contingency 
tables  as  a  function  of  the  categories  defining  the 
cells.  The  basic  independence  model  contains  one 
parameter  for  each  category  of  each  variable  and 
simply  models  the  marginal  distributions  of  the 
variables.  Association  patterns  are  then  captured 
by  additional  parameters  for  combinations  of  cat¬ 
egories  across  variables  (interaction  terms).  A  vast 
literature  on  loglinear  models  was  developed  from 
the  1960s  through  the  1980s,  but  the  methodology 
seems  to  be  somewhat  less  in  use  today. 

Other  multivariate  methods  for  categorical  data 
include  (multiple)  correspondence  analysis  and 
latent  class  analysis,  which  are  similar  to  principal 
component  analysis  and  factor  analysis  but  for 
categorical  data.  Latent  structure  models  with  cat¬ 
egorical  observed  and  continuous  latent  variables 
are  covered  by  latent  trait  analysis  ( item  response 
theory ). 

Further  special  topics  in  the  analysis  of  the  asso¬ 
ciation  between  categorical  variables  are  rater 
agreement  and  segregation.  The  degree  to  which 
raters  agree  with  respect  to  a  categorical  rating  or 
classification  can  be  analyzed  using  loglinear  mod¬ 
els,  but  specialized  measures  such  as  Cohen’s 
kappa  are  also  available.  Segregation  analysis  asks 
how  group  members  are  distributed  among  the 
values  of  a  categorical  variable  (occupational  sex 
segregation,  racial  residential  segregation)  and  has 
its  own  specialized  measures,  such  as  Duncan  and 
Duncan’s  index  of  dissimilarity. 

Explaining  Categorical  Outcomes 

Methods  for  contingency  tables  are  often 
exploratory  and  descriptive  in  the  sense  that  they 


do  not  distinguish  between  dependent  and  explan¬ 
atory  variables.  To  analyze  research  questions  that 
make  assertions  about  causal  relations,  regression- 
type  statistical  methods  are  required.  Explanatory 
variables  in  such  models  can  be  continuous  or 
categorical.  Regression  coefficients  for  binary 
explanatory  variables  simply  reflect  conditional 
group  differences,  and  categorical  predictors  with 
more  than  two  categories  (called  factors)  can  easily 
be  included  in  a  regression  equation  by  means  of 
indicator  variables  for  the  different  categories. 

First,  assume  the  dependent  variable  to  be 
dichotomous  (binary).  For  example,  think  of  ana¬ 
lyzing  union  membership  of  employees  (1  =  yes, 
0  =  no)  as  a  function  of  variables  such  as  sex,  age, 
or  political  orientation.  The  expected  value  of  a 
binary  variable  is  equal  to  the  probability  of 
Outcome  1.  As  such,  standard  linear  regression 
methods  that  model  the  conditional  expectation 
can  be  applied.  This  linear  probability  model 
(LPM)  has  the  advantage  that  regression  coeffi¬ 
cients  can  directly  be  interpreted  as  partial  effects 
on  the  probability  of  Outcome  1.  However,  the 
LPM  is  not  always  appropriate  (e.g.,  assuming  a 
linear  relation  between  the  explanatory  variables 
and  the  outcome  probability  is  usually  unreason¬ 
able),  and  dedicated  binary  response  models  such 
as  the  logistic  regression  (logit  model)  or  the  probit 
model  have  been  developed.  In  the  logit  model,  the 
log  odds,  the  logarithm  of  the  odds  of  Outcome  1, 
are  expressed  as  a  linear  function  of  the  explana¬ 
tory  variables.  In  the  probit  model,  the  probability 
of  Outcome  1  is  linked  to  the  linear  predictor 
through  the  cumulative  normal  distribution. 
Although  the  interpretation  of  the  coefficients  is 
somewhat  more  difficult,  these  models  are  gener¬ 
ally  considered  more  appropriate  than  the  LPM  to 
analyze  a  binary  dependent  variable. 

Multinomial  logit  or  probit  models  are  general¬ 
izations  of  these  models  to  nominal  dependent 
variables  with  more  than  two  outcome  categories. 
Multinomial  models  contain  a  separate  set  of  coef¬ 
ficients  for  each  outcome,  allowing  for  complex 
effect  patterns  of  the  explanatory  variables  on  the 
outcome  probabilities.  Likewise,  ordinal  logit  or 
probit  models  are  used  to  analyze  ordinal  depen¬ 
dent  variables,  although  these  models  usually  con¬ 
tain  only  one  set  of  coefficients  plus  a  number  of 
cutoff  parameters  that  reflect  the  latent  spacing  of 
the  categories.  More  flexible  variants  such  as  the 
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stereotype  logistic  regression  or  the  generalized 
ordered  logit  model  are  also  available.  Powerful 
extensions  to  multinomial  models  are  choice  mod¬ 
els  that  are  based  on  data  containing  alternative- 
specific  information.  The  classic  example  is  travel 
mode  choice,  with  information  on  characteristics 
such  as  costs,  comfort,  and  travel  duration  for  each 
mode.  Such  models  include  the  conditional  logit 
and  the  alternative-specific  multinomial  probit.  A 
related  model  is  the  nested  logit,  which  is  used  if  the 
alternatives  can  be  grouped  according  to  a  nested 
structure  (e.g.,  restaurants  grouped  by  cuisine). 

As  mentioned  above,  one  can  treat  count  data 
as  categorical,  particularly  when  counts  are  low. 
The  most  prominent  regression  procedure  for 
count  data  is  the  Poisson  regression  that  models 
event  counts  as  a  Poisson  process.  However, 
because  the  inherent  assumption  of  events’  inde¬ 
pendence  is  often  too  restrictive,  more  flexible 
models  such  as  the  negative  binomial  regression 
are  generally  preferred.  Further,  an  overrepresen¬ 
tation  of  zeros  is  often  observed  in  count  data, 
which  can  be  handled  by  a  zero-inflated  count 
model  or,  depending  on  the  assumed  process,  a 
hurdle  model  that  combines  a  binary  response 
model  with  a  zero-truncated  count  model.  Note 
that  regression  models  for  binary  variables  and  for 
count  variables  are  both  covered  in  the  framework 
of  generalized  linear  models  (GLMs). 

Limited  dependent  variables  are  analyzed  with 
methods  closely  related  to  binary  response  models. 
The  most  well-known  is  the  censored  normal 
regression  (tobit  model),  which  is  based  on  the 
concept  of  a  latent  variable  with  normally  distrib¬ 
uted  errors,  as  is  the  probit  model.  A  natural 
extension  are  sample  selection  models  (e.g.,  the 
Heckman  model),  which  explicitly  model  the  pro¬ 
cess  causing  the  censoring.  Sample  selection  mod¬ 
els  have  also  been  developed  for  the  case  of  a  fully 
categorical  dependent  variable. 

Another  closely  related  field  is  event  history 
analysis  (survival  analysis),  which  is  concerned 
with  the  explanation  of  the  duration  until  the 
occurrence  of  an  event.  Time-to-event  data  are 
often  modeled  in  terms  of  the  hazard  rate:  the  con¬ 
ditional  probability  of  an  event  at  a  certain  time 
point  given  that  the  event  has  not  yet  occurred. 
Accelerated  failure  time  models,  which  are  similar 
to  the  tobit  model,  can  also  be  used.  Further,  dis¬ 
crete-time  hazard  models  are  used  if  the  duration  is 


categorical.  These  models  apply  a  binary  response 
model  to  the  expanded  data  containing  one  record 
per  time  point  for  each  observation.  Another  topic 
in  event  history  analysis  is  sequence  analysis,  used 
to  identify  and  describe  typical  patterns  of  event 
sequences  (e.g.,  career  paths  of  politicians). 

Discrimination  and  classification  is  a  further 
area  in  categorical  data  analysis  that  has  some 
resemblance  to  the  regression  methods  discussed. 
A  key  difference,  however,  is  that  discriminant 
analysis  makes  no  assertions  about  direction  of 
causation.  It  is  an  exploratory  tool  to  evaluate  how 
well  groups  can  be  distinguished  with  respect  to  a 
set  of  observed  characteristics  and  to  derive  opti¬ 
mal  rules  for  classifying  new  observations  for 
which  group  membership  is  unknown. 

As  outlined  in  this  entry,  a  well-established  and 
diversified  toolbox  exists  for  the  analysis  of  cate¬ 
gorical  data.  Moreover,  special  areas  such  as  cat¬ 
egorical  panel  data  and  multilevel  models,  exact 
inference  for  binary  response  and  count  data  mod¬ 
els,  or  nonparametric  estimation  have  made  great 
advances  over  the  past  few  decades. 
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Caudillismo 


Caudillismo  refers  to  a  system  of  political-social 
domination  that  arose  after  the  independence 
wars  in  Spanish  America.  The  caudillo  (strong¬ 
man — from  the  Latin  word  capitellum ,  meaning 
head)  was  the  head  of  irregular  forces  who  ruled 
a  politically  distinctive  territory.  These  forces 
were  governed  through  an  informal  system  of  sus¬ 
tained  obedience  based  on  paternalistic  relations 
to  the  leader,  who  attained  his  position  as  a  result 
of  his  forceful  personality  and  charisma. 
Caudillismo  as  a  concept  was  first  used  in  Spanish 
America  to  describe  the  characteristics  of  leaders 
who  challenged  the  authority  of  the  governments 
arising  from  the  independence  process  after  1810 
and  to  refer  to  the  political  regimes  instated  by 
such  leaders.  In  this  limited  meaning,  the  notion 
of  caudillismo  is  a  heuristic  instrument  for  analyz¬ 
ing  a  given  historical  period  that  started  after  the 
wars  of  independence  and  concluded  with  the 
emergence  of  the  national  states  in  the  second  half 
of  the  19th  century.  John  Lynch,  one  of  the  histo¬ 
rians  who  made  the  greatest  efforts  to  define  the 
characteristics  and  attributes  of  Latin  America’s 
caudillos  and  their  political  leadership  conditions, 
stated  that  caudillismo  was  the  image  of  society 
and  that  caudillos  were  its  creatures.  This  entry  is 
a  review  of  different  interpretations  of  caudi- 
llismo,  its  origins,  and  its  forms  of  leadership. 

The  terms  caudillismo  and  caudillo  continued  to 
be  used  after  the  conditions  that  gave  rise  to  “clas¬ 
sical  caudillismo”  had  disappeared.  The  extension 
of  the  notion  to  encompass  any  kind  of  personal¬ 
ized  leadership  that  exercises  power  in  an  arbitrary 
manner  within  a  context  of  fragility  or  crisis  of  the 
political  institutions  is  not  that  relevant  to  critical 
qualitative  research.  Caudillismo  is  used  to  desig¬ 
nate  and  also  stigmatize  the  governments  of 
“strongmen”  with  no  contextual  reference. 

Caudillismo  After  Independence 
in  Spanish  America 

We  will  now  elaborate  on  the  limited  meaning  of 
caudillismo  restricted  to  the  framework  of  the 
struggles  to  control  power  that  followed  indepen¬ 
dence  in  Spanish  America.  In  1845,  the  book 
Facundo,  written  by  Domingo  Faustino  Sarmiento, 


provided  the  classical  interpretation  of  caudillismo 
in  Spanish  America  in  the  1800s.  From  that  stand¬ 
point,  caudillismo  is  the  expression  of  political 
barbarism  and  the  antithesis  of  a  government  that 
ensures  security,  freedom,  and  ownership  rights  for 
a  country’s  inhabitants.  Sarmiento’s  book  is  a  por¬ 
trait  of  Facundo  Quiroga,  the  “Tiger  of  the  Plains,” 
a  caudillo  in  the  first  half  of  the  19th  century.  In 
Quiroga,  Sarmiento  believes  that  he  sees  the  incar¬ 
nation  of  the  antinomy  of  civilization  and  barba¬ 
rism  faced  by  the  peoples  of  the  Americas  as  a  result 
of  their  revolutionary  experience  and  geography — 
the  desert — which  had  turned  violence  into  a  life¬ 
style.  Quiroga  in  the  Argentine  pampas  and  Jose 
Antonio  Paez  in  the  Venezuela  prairies  represent 
caudillismo  as  a  system  of  government  and  type  of 
political  leadership.  Physical  vigor,  spontaneous 
cruelty,  and  the  rusticity  inherent  in  the  rural  world 
they  come  from  can  account  for  the  despotism  of 
the  regimes  they  represent.  Antonio  Lopez  de  Santa 
Anna  from  Mexico  appears  as  the  “Attila  of  civili¬ 
zation”  and  Juan  Manuel  de  Rosas  as  the  “River 
Plate  Caligula”  of  Argentina. 

Latin  American  historiography  and  the  contri¬ 
butions  of  European  and  U.S.  studies  revealed 
practically  unknown  facets  of  the  caudillismo  phe¬ 
nomenon  in  the  postindependence  period,  which 
led  to  a  better  understanding  of  an  era  obscured  by 
myths  and  legends.  The  caudillo  figures  started  to 
appear  with  more  nuances  than  in  the  view  of  their 
contemporaries,  and  it  was  noted  that  their  gov¬ 
ernments,  in  many  cases,  adapted  traditional  legal¬ 
ity  within  the  emergence  of  a  new  context. 

The  origin  of  caudillismo  has  been  interpreted 
differently,  encompassing  factors  such  as  the  mili¬ 
tarization  of  politics  as  a  result  of  the  independence 
wars,  the  absence  of  formal  rules  after  the  collapse 
of  the  colonial  order,  the  ruralization  of  power,  the 
importance  of  monarchic  tradition,  the  legacy  of 
authoritarianism  and  anarchism  from  the  Spaniards, 
or  the  characteristics  of  the  village  societies.  In 
Venezuela  and  the  River  Plate  area,  where  the  war 
against  the  royal  troops  was  more  radical  and  the 
whole  of  the  population  took  arms,  caudillismo 
developed  quicker  and  in  a  more  pronounced  man¬ 
ner.  Cattle  herders  and  gauchos,  layabouts  and 
bandits  joined  the  Montoneros  (an  unruly  group  of 
horsemen  who  fought  against  Spanish  colonization 
in  the  1800s,  usually  under  the  federalist  caudillos 
of  the  provinces  outside  Buenos  Aires),  which  were 
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truly  “informal  armies”  that  got  supplies  by  pillag¬ 
ing  and  acted  under  the  subordination  of  the 
bosses’  prestige. 

The  militarization  of  politics  and  society  that 
outlived  the  battles  for  independence  linked  caudi¬ 
llismo  to  military  power  and  political  competition 
with  armed  struggles.  The  caudillo  was  first  a  war¬ 
rior,  during  wars  of  liberation,  civil  wars,  and 
national  wars;  the  caudillo  was  the  strongman  who 
could  recruit  troops  and  protect  his  people,  as  John 
Lynch  outlined.  Studies  on  Latin  American  caudi¬ 
llismo  in  the  1950s  considered  it  a  variable  of 
authoritarianism  and  militarism  in  response  to 
anarchy.  Agustin  Gamarra  of  Peru,  known  as 
“Cuzco’s  black  angel,”  was  seen  as  a  prototype  of 
military  leaders  who,  under  different  guises,  domi¬ 
nated  politics  in  postindependence  Spanish  America. 
The  militias  constituted  the  core  of  Gamarrismo  (a 
militaristic  state  based  on  the  [invented]  tradition 
of  the  Incas).  As  pointed  out  by  Charles  Walker, 
the  militias  controlled  local  society  by  monitoring 
any  threat  of  opposition,  and  they  also  served  as 
veritable  military  academies  and  vehicles  of  social 
mobility.  High-ranking  positions  in  the  militias 
represented  important  avenues  for  political  and 
economic  gain.  In  Mexico  and  Peru,  professional 
military  men  played  an  important  role  in  the  polit¬ 
ical  process  as  pressure  groups.  In  other  countries, 
the  military  organization  of  the  end  of  the  colonial 
period  was  swept  away  by  the  wars  of  indepen¬ 
dence,  although  with  different  outcomes. 
Nonetheless,  some  military  heads  were  prevailing 
figures — for  instance,  Francisco  de  Paula  Santander 
in  Nueva  Granada  (present-day  Colombia),  Juan 
Jose  Flores  in  Ecuador,  and  Paez  in  Venezuela  and 
Andres  de  Santa  Cruz  in  Bolivia.  As  Walker  notes, 
the  analysis  of  caudillismo  needs  to  shift  its  atten¬ 
tion  from  the  battlefield  and  to  reconsider  the  state, 
as  middling  officials  such  as  subprefects  and  militia 
leaders  proved  more  important  for  Gamarra  than 
military  officers  and  guerrillas. 

Caution  is  needed  with  respect  to  generalizations 
about  Spanish  America’s  politics  during  the  half- 
century  following  independence  because  of  the  dis¬ 
tinctive  institutional  legacies  of  colonialism,  as  well 
as  differences  in  geographical  and  ethnic  composi¬ 
tion  between  countries  where  the  indigenous  popu¬ 
lation  prevails  and  is  only  partly  assimilated  into 
Spanish  culture  and  countries  with  a  Creole  popula¬ 
tion  mainly  integrated  into  Spanish  culture.  The 
church  and  the  army  were  two  big  corporations 


during  colonial  times,  with  an  institutional  and  eco¬ 
nomic  preponderance  in  Mexico  not  found  in  other 
regions.  This  was  an  important  source  of  divergence 
between  Catholics  and  anticlericals  in  Mexico. 
Conflicts  between  rural  and  urban  areas,  between 
federalism  and  centralism,  and  between  rival  trends 
in  trade  policies — “protectionism”  and  “liberal¬ 
ism” — all  hindered  the  consolidation  of  a  stable 
national  policy.  The  duration  and  pace  of  the  transi¬ 
tion  to  postcolonial  order  varied  between  countries, 
as  did  the  role  of  the  regional  or  national  caudillos. 
In  all  cases,  however,  the  constant  feature  is  loyalty 
to  caudillos,  most  of  whom  had  no  defined  ideolo¬ 
gies  and  fluctuated  between  contradictory  positions, 
whether  unitary  or  federal,  conservative  or  liberal, 
or  favoring  protectionism  or  free  trade. 

Tulio  Halperfn  Donghi  explored  the  relation¬ 
ship  between  militarization  and  democratization 
that  is  at  the  heart  of  the  promotion  of  the  caudillo 
power.  Social  promotion  opportunities  arose  after 
the  rupture  of  colonial  order.  Agustin  de  Iturbide, 
the  “constitutional  emperor  of  Mexico,”  belonged 
to  a  poor  family;  Gamarra  and  Ramon  Castilla, 
both  from  Peru,  were  Creoles.  They  all  reached 
positions  that  would  have  before  been  inaccessible 
to  them,  but  this  relative  liberal  opening  was  an 
instrument  to  impede  the  excessive  dissemination 
of  popular  participation  within  a  context  in  which 
power  legitimacy  was  always  questioned. 

In  the  middle  of  the  1970s,  Eric  Wolf  and  Edgar 
Hansen  defined  caudillismo  as  a  subcategory  of 
“clientelism.”  These  authors  believe  that  caudi¬ 
llismo  refers  to  the  quest  for  power  and  wealth 
through  violent  conquest  that  establishes  a  link 
between  protector  and  protected  in  an  unstable 
society  lacking  the  institutional  channels  through 
which  people  can  compete  for  these  social  and 
financial  advantages.  John  Lynch  also  states  that 
the  patron-client  relationship  was  the  essential 
link  that  was  finally  built  into  the  state  and  became 
the  model  of  caudillismo.  Alan  Knight’s  more 
recent  view  underlines  the  relevance  of  traditional 
authority  in  village  societies,  combined  with  ele¬ 
ments  of  charisma,  to  understand  the  command- 
obedience  relationship  inherent  in  caudillismo. 

The  notion  of  caudillismo  encompasses  different 
leadership  subcategories,  for  instance,  the  “sword 
caudillos,”  the  warlords  who  dominated  during  a 
period  of  great  instability  when  popular  support 
and  force  were  decisive.  Pacification  and  progres¬ 
sive  institutionalization  of  power  relations  were 


Causality  203 


carried  out  by  new  leader  profiles,  the  “tame  caudi- 
llos,”  who  were  deemed  to  be  the  arbitrators  capa¬ 
ble  of  achieving  national  unification  and  building 
more  stable  institutions.  Rosas’s  government 
(1829-1852)  is  an  example  of  a  new  modality. 
Sarmiento  believed  that  was  the  end  of  caudillismo 
since  it  combined  elements  of  barbarism  with 
elements  of  civilization.  Cruelty  became  more 
sophisticated,  the  press  and  the  legislature  replaced 
barbarian  passion,  and  strong  personalism  allowed 
for  the  exertion  of  influence  on  the  political  orienta¬ 
tion  of  other  River  Plate  provinces,  although 
Rosas’s  powers  were  not  included  in  a  constitu¬ 
tional  system.  In  Mexico,  Benito  Juarez,  a  lawyer 
born  into  a  poor  family,  used  the  1857  Constitution 
to  take  on  emergency  powers  and  govern  in  an 
authoritarian  manner,  constitutional  but  cruel. 
Diego  Portales’s  constitutional  authoritarianism 
sought  support,  as  he  himself  defined,  in  “the 
weight  of  night,”  in  the  acquiescence  of  the  masses, 
passive  and  ignorant.  A  prevailing  political  figure  in 
Chile  in  the  1830s,  Portales  was  a  trader  and 
omnipotent  minister  but  never  took  office  as  presi¬ 
dent.  From  Sarmiento’s  viewpoint,  Portales  was  an 
example  of  an  illustrated  caudillo.  The  use  of  the 
term  caudillo  was  extended  to  encompass  authori¬ 
tarian  leaders  such  as  Juarez  or  Portales,  who  gov¬ 
erned  within  the  framework  of  constitutions. 
According  to  Lynch,  the  “impenetrable  dictator¬ 
ship”  of  Dr.  Jose  Francia  in  Paraguay  combined 
despotism  and  isolation  for  3  decades  in  which  he 
ensured  the  basic  needs  of  a  Creole  society. 

The  notion  of  caudillismo  makes  possible  a  bet¬ 
ter  understanding  of  the  political  processes  that 
took  place  during  that  period  since  interpretation 
of  the  past  cannot  ignore  the  categories  used  by 
their  contemporaries. 
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Causality 


Causality  refers  to  the  relationship  between  events 
where  one  set  of  events  (the  effects)  is  a  direct  con¬ 
sequence  of  another  set  of  events  (the  causes). 
Causal  inference  is  the  process  by  which  one  can 
use  data  to  make  claims  about  causal  relation¬ 
ships.  Since  inferring  causal  relationships  is  one  of 
the  central  tasks  of  science,  it  is  a  topic  that  has 
been  heavily  debated  in  philosophy,  statistics,  and 
the  scientific  disciplines.  This  entry  reviews  the 
models  of  causation  and  tools  for  causal  inference 
most  prominent  in  the  social  sciences,  including 
regularity  approaches,  associated  with  David 
Hume,  and  counterfactual  models,  associated  with 
Jerzy  Splawa-Neyman,  Donald  Rubin,  and  David 
Lewis,  among  many  others.  One  of  the  most 
notable  developments  in  the  study  of  causation  is 
the  increasing  unification  of  disparate  methods 
around  a  common  conceptual  and  mathematical 
language  that  treats  causality  in  counterfactual 
terms — that  is,  the  Neyman-Rubin  model.  This 
entry  discusses  how  counterfactual  models  high¬ 
light  the  deep  challenges  involved  in  making  the 
move  from  correlation  to  causation,  particularly  in 
the  social  sciences,  where  controlled  experiments 
are  relatively  rare. 

Regularity  Models  of  Causation 

Until  the  advent  of  counterfactual  models,  causa¬ 
tion  was  primarily  defined  in  terms  of  observable 
phenomena.  It  was  the  philosopher  Hume  in  the 
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18th  century  who  began  the  modern  tradition  of 
regularity  models  of  causation  by  defining  causa¬ 
tion  in  terms  of  repeated  “conjunctions”  of  events. 
In  An  Enquiry  Into  Human  Understanding  (1751), 
Hume  argued  that  the  labeling  of  two  particular 
events  as  being  causally  related  rested  on  an  untest- 
able  metaphysical  assumption.  Consequently, 
Hume  (1739)  argued  that  causality  could  be  ade¬ 
quately  defined  only  in  terms  of  empirical  regulari¬ 
ties  involving  classes  of  events.  He  asked,  “How 
could  we  know  that  a  flame  caused  heat?” — only 
by  calling  “to  mind  their  constant  conjunction  in 
all  past  instances.  Without  further  ceremony,  we 
call  the  one  cause  and  the  other  effect  and  infer  the 
existence  of  one  from  that  of  the  other”  (Treatise 
of  Human  Nature,  Book  1,  Pt.  3,  sec.  6).  In  the 
Enquiry,  Hume  argued  that  three  empirical  phe¬ 
nomena  were  necessary  for  inferring  causality: 
(1)  contiguity  (the  cause  and  effect  must  be  con¬ 
tiguous  in  time  and  space),  (2)  succession  (the 
cause  must  be  prior  to  the  effect),  and  (3)  constant 
conjunction  (there  must  be  a  constant  union 
between  the  cause  and  effect).  Under  this  frame¬ 
work,  causation  was  defined  purely  in  terms  of 
empirical  criteria,  rather  than  unobservable 
assumptions.  In  other  words,  Hume’s  definition  of 
causation  and  his  mode  of  inference  were  one  and 
the  same. 

John  Stuart  Mill,  who  shared  the  regularity 
view  of  causation  with  David  Hume,  elaborated 
basic  tools  for  causal  inference  that  were  highly 
influential  in  the  social  sciences.  For  Mill,  the  goal 
of  science  was  the  discovery  of  regular  empirical 
laws.  To  that  end,  Mill  proposed  in  his  1843  A 
System  of  Logic,  a  series  of  rules  or  “canons”  for 
inductive  inference.  These  rules  entailed  a  series  of 
research  designs  that  examined  whether  there 
existed  covariation  between  a  hypothesized  cause 
and  its  effect,  time  precedence  of  the  cause,  and  no 
plausible  alternative  explanation  of  the  effect 
under  study.  Mill  argued  that  these  research 
designs  were  effective  only  when  combined  with  a 
manipulation  in  an  experiment.  Recognizing  that 
manipulation  was  unrealistic  in  many  areas  of  the 
social  sciences,  Mill  expressed  skepticism  about 
the  possibility  of  causal  inference  for  questions  not 
amenable  to  experiments. 

The  most  widely  used  of  Mill’s  canons,  the 
“Direct  Method  of  Difference,”  entailed  the 
comparison  of  two  units  identical  in  all  respects 


except  for  some  manipulable  treatment.  The 
method  of  difference  involves  creating  a  counter- 
factual  control  unit  for  a  treated  unit  under  the 
assumption  that  the  units  are  exactly  alike  prior 
to  treatment,  an  early  example  of  counterfactual 
reasoning  applied  to  causal  inference.  Mill  (1884) 
stated  the  method  as  follows: 

If  an  instance  in  which  the  phenomenon  .  .  .  occurs 
and  an  instance  in  which  it  does  not .  .  .  have 
every  circumstance  save  one  in  common  .  .  .  [then] 
the  circumstance  [in]  which  the  two  instances 
differ  is  the  .  .  .  cause  or  a  necessary  part  of  the 
cause.  (Book  3,  chap.  8) 

The  weakness  of  this  research  design  is  that  in  prac¬ 
tice,  particularly  in  the  social  sciences,  it  is  very  dif¬ 
ficult  to  eliminate  all  heterogeneity  in  the  units 
under  study.  Even  in  the  most  controlled  environ¬ 
ments,  two  units  will  rarely  be  the  same  on  all  back¬ 
ground  conditions.  Consequently,  inferences  made 
under  this  method  require  strong  assumptions. 

Mill’s  and  related  methods  have  been  criticized 
on  a  variety  of  grounds.  His  canons  and  related 
designs  assume  that  the  relationship  between  cause 
and  effect  is  unique  and  deterministic.  These  condi¬ 
tions  allow  neither  for  more  than  one  cause  of  an 
effect  nor  for  interaction  among  causes.  The  assump¬ 
tion  that  causal  relationships  are  deterministic  or 
perfectly  regular  precludes  the  possibility  of  mea¬ 
surement  error.  If  outcomes  are  measured  with 
error,  as  they  often  are  in  the  social  sciences,  then 
methods  predicated  on  detecting  constant  conjunc¬ 
tions  will  fail.  Further,  the  causal  relationships  typi¬ 
cally  studied  in  the  social  and  biological  sciences  are 
rarely,  if  ever,  unique.  Causes  in  these  fields  are 
more  likely  to  have  highly  contingent  effects,  mak¬ 
ing  regular  causal  relationships  very  rare. 

Counterfactual  Models  of  Causation 

Regularity  models  of  causation  have  largely  been 
abandoned  in  favor  of  counterfactual  models. 
Rather  than  defining  causality  purely  in  reference 
to  observable  events,  counterfactual  models  define 
causation  in  terms  of  a  comparison  of  observable 
and  unobservable  events.  Linguistically,  counter- 
factual  statements  are  most  naturally  expressed 
using  subjunctive  conditional  statements  such  as 
“If  India  had  not  been  democratic,  periodic  famines 
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would  have  continued.”  Thus,  the  counterfactual 
approach  to  causality  begins  with  the  idea  that 
some  of  the  information  required  for  inferring 
causal  relationships  is  and  always  will  be  unob¬ 
served,  therefore  some  assumptions  must  be  made. 
In  stark  contrast  to  the  regularity  approach  of 
Hume,  the  fact  of  counterfactual  causation  is  fun¬ 
damentally  separate  from  the  tools  used  to  infer  it. 
As  a  result,  philosophers  like  Lewis  (1973)  could 
write  about  the  meaning  of  causality  with  little 
discussion  of  how  it  might  be  inferred.  It  was  stat¬ 
isticians,  beginning  with  Splawa-Neyman  in  1923 
and  continued  most  prominently  by  Rubin,  who 
began  to  clarify  the  conditions  under  which  causal 
inferences  were  possible  if  causation  was  funda¬ 
mentally  a  “missing-data  problem.” 

Counterfactual  Models  in  Philosophy 

In  philosophy,  counterfactual  models  of  causation 
were  largely  absent  until  the  1970s  due  to  Willard 
van  Orman  Quine’s  dismissal  of  the  approach  in 
his  Methods  of  Logic  (1950),  where  he  pointed  out 
that  counterfactual  statements  could  be  nonsensi¬ 
cal.  He  illustrated  this  point  by  his  famous  com¬ 
parison  of  the  conditional  statements  “If  Bizet  and 
Verdi  had  been  compatriots,  Bizet  would  have 
been  Italian”  and  “If  Bizet  and  Verdi  had  been 
compatriots,  Verdi  would  have  been  French.”  For 
Quine,  the  incoherence  of  the  two  statements 
implied  that  subjective  conditionals  lacked  clear 
and  objective  truth  conditions.  Quine’s  suspicion 
of  conditional  statements  was  also  rooted  in  his 
skepticism  of  evaluating  the  plausibility  of  coun¬ 
terfactual  “feigned  worlds,”  as  he  explained  in 
Word  and  Object  (1960): 

The  subjunctive  conditional  depends,  like  indirect 
quotation  and  more  so,  on  a  dramatic  projection: 
we  feign  belief  in  the  antecedent  and  see  how 
convincing  we  then  find  the  consequent.  What 
traits  of  the  real  world  to  suppose  preserved  in  the 
feigned  world  of  the  contrary-to-fact  antecedent 
can  only  be  guessed  from  a  sympathetic  sense  of 
the  fabulist’s  likely  purpose  in  spinning  his  fable. 

(p.  222) 

Perhaps  because  of  this  view  of  counterfactuals, 
Quine  had  a  dim  view  of  the  concept  of  causality. 
He  argued  that  as  science  advanced,  vague  notions 


of  causal  relationships  would  disappear  and  be 
replaced  by  Humean  “concomitances” — that  is, 
regularities. 

In  philosophy,  Lewis  popularized  the  counter- 
factual  approach  to  causality  50  years  after  it  first 
appeared  in  statistics  with  Splawa-Neyman’s  1923 
paper  on  agricultural  experiments.  For  Lewis, 
Quine’s  examples  revealed  problems  only  with 
vague  counterfactuals,  not  counterfactuals  in  gen¬ 
eral.  A  cause,  according  to  Lewis  in  his  1973  arti¬ 
cle  “Causation,”  was  “something  that  makes  a 
difference,  and  the  difference  it  makes  must  be  a 
difference  from  what  would  have  happened  with¬ 
out  it”  (p.  557).  More  specifically,  he  defined  cau¬ 
sality  in  terms  of  “possible”  (counterfactual) 
worlds.  He  postulated  that  one  can  order  possible 
worlds  with  respect  to  their  closeness  with  the 
actual  world.  Counterfactual  statements  can  be 
defined  as  follows: 

If  A  were  the  case,  C  would  be  the  case  is  true  in 
the  actual  world  if  and  only  if  (i)  there  are  no 
possible  A-worlds;  or  (ii)  some  A-world  where  C 
holds  is  closer  to  the  actual  world  than  is  any 
A-world  where  C  does  not  hold.  (p.  560) 

More  intuitively,  causal  inferences  arise  by  com¬ 
paring  the  actual  world  with  the  closest  possible 
world.  If  C  occurs  in  both  the  actual  and  the  clos¬ 
est  possible  world  without  A,  then,  according  to 
Lewis,  A  is  not  the  cause  of  C.  If,  on  the  other 
hand,  C  does  not  occur  in  the  closest  possible 
world  without  A,  then  A  is  a  cause  of  C.  Lewis’s 
theory  was  concerned  with  ontology,  not  episte¬ 
mology.  As  a  result,  one  might  argue  that  his  work 
has  limited  use  to  empirical  research  since  he  pro¬ 
vided  little  practical  guidance  on  how  one  could 
conjure  the  closest  possible  worlds  to  use  as  com¬ 
parison  cases.  Without  additional  assumptions, 
Lewis’s  model  suggests  that  causal  inference  is  a 
fruitless  endeavor  given  our  inability  to  observe 
nonexistent  counterfactual  worlds. 

Statistical  Models  of  Causation 

Fortunately,  statisticians  beginning  with  Splawa- 
Neyman  in  1923  elaborated  a  model  of  causation 
that  allowed  one  to  treat  causation  in  counterfactual 
terms  and  provided  guidance  on  how  empirical 
researchers  could  create  observable  counterfactuals. 
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Say  we  are  interested  in  inferring  the  effect  of  some 
cause  T  on  a  parameter  Y  of  the  distribution  of 
outcome  Y  in  population  A  relative  to  treatment  C 
(control).  Population  A  is  composed  of  a  finite 
number  of  units  and  YAT  is  simply  a  summary  of 
the  distribution  of  that  population  when  exposed 
to  T,  such  as  the  mean.  If  treatment  C  (control) 
were  to  be  applied  to  population  A,  then  we  would 
observe  YAC-  To  use  Lewis’s  terminology,  in  the 
actual  world,  we  observe  YAT,  and  in  the  counter- 
factual  world,  we  would  observe  YA  C.  The  causal 
effect  of  T  relative  to  C  for  population  A  is  a  mea¬ 
sure  of  the  difference  between  YA  T  and  Y/1C,  such 
as  Ya  t  —  Ya  c.  Of  course,  we  can  observe  only  the 
parameter  that  summarizes  the  actual  world  and 
not  the  counterfactual  world. 

The  key  insight  of  statistical  models  of  causa¬ 
tion  is  that  under  special  circumstances  we  can  use 
another  population,  B,  that  was  exposed  to  con¬ 
trol  to  act  as  the  closest  possible  world  of  A.  If  we 
believe  that  YA  C  =  YSiC,  then  we  no  longer  need  to 
rely  on  an  unobserved  counterfactual  world  to 
make  causal  inferences;  we  can  simply  look  at  the 
difference  between  the  observed  YAT  and  YBC.  In 
most  cases  YAC  /  Y/i  C.  however,  so  any  inferences 
made  by  comparing  the  two  populations  will  be 
confounded.  What  are  the  special  circumstances 
that  allow  us  to  construct  a  suitable  counterfactual 
population  and  make  unconfounded  inferences? 
As  discussed  below,  the  most  reliable  method  is 
through  randomization  of  treatment  assignment, 
but  counterfactual  inferences  with  observational 
data  are  possible — albeit  more  hazardous — as 
well.  In  either  case,  causes  are  defined  in  reference 
to  some  real  or  imagined  intervention ,  which 
makes  the  counterfactuals  well-defined. 

The  Neyman-Rubin  Model 

The  counterfactual  model  of  causation  in  statis¬ 
tics  originated  with  Splawa-Neyman’s  1923 
model,  which  is  nonparametric  for  a  finite  number 
of  treatments,  where  each  unit  has  a  potential 
outcome  for  each  possible  treatment  condition.  In 
the  simplest  case  with  two  treatment  conditions, 
each  unit  has  two  potential  outcomes,  one  if  the 
unit  is  treated  and  the  other  if  untreated.  In  this 
case,  a  causal  effect  is  defined  as  the  difference 
between  the  two  potential  outcomes,  but  only  one 
of  the  two  potential  outcomes  is  observed.  In  the 


1970s,  Rubin  developed  the  model  into  a  general 
framework  for  causal  inference  with  implications 
for  observational  research.  Paul  Holland  in 
1986  wrote  an  influential  review  article  that  high¬ 
lighted  some  of  the  philosophical  implications  of 
the  framework.  Consequently,  instead  of  the 
“Neyman-Rubin  model,”  the  model  is  often  sim¬ 
ply  called  the  Rubin  causal  model  or  sometimes 
the  Neyman-Rubin-Holland  model  or  the  Neyman- 
Holland-Rubin  model. 

The  Neyman-Rubin  model  is  more  than  just  the 
math  of  the  original  Neyman  model.  Unlike 
Splawa-Neyman’s  original  formulation,  it  does  not 
rely  on  an  urn  model  motivation  for  the  observed 
potential  outcomes  but  rather  on  the  random 
assignment  of  treatment.  For  observational  stud¬ 
ies,  one  relies  on  the  assumption  that  the  assign¬ 
ment  of  treatment  can  be  treated  as  if  it  were 
random.  In  either  case,  the  mechanism  by  which 
treatment  is  assigned  is  of  central  importance.  The 
realization  that  the  primacy  of  the  assignment 
mechanism  holds  true  for  observational  data  no 
less  than  for  experimental  data  is  due  to  Rubin. 
This  insight  has  been  turned  into  a  motto:  “No 
causation  without  manipulation.” 

Let  YiT  denote  the  potential  outcome  for  unit  i 
if  the  unit  receives  treatment,  and  let  Y,c  denote 
the  potential  outcome  for  unit  i  in  the  control 
regime.  The  treatment  effect  for  observation  i  is 
defined  by  t,  =  YiT  —  Y,c.  Causal  inference  is  a 
missing-data  problem  because  Y,r  and  YiC  are 
never  both  observed.  This  remains  true  regardless 
of  the  methodology  used  to  make  inferential 
progress — regardless  of  whether  we  use  quantita¬ 
tive  or  qualitative  methods  of  inference.  The  fact 
that  we  cannot  observe  both  potential  outcomes 
at  the  same  time  is  commonly  referred  to  as  the 
“fundamental  problem  of  causal  inference.”  Let 
T,  be  a  treatment  indicator:  1  when  i  is  in  the 
treatment  regime  and  0  otherwise.  The  observed 
outcome  for  observation  i  is  then 

Y,  -  T,Y,t  +  (1  -  T,)Y,C. 

The  average  causal  effect  t  is  the  difference 
between  the  expected  values  E(Yr)  and  E(YC).  We 
observe  only  the  conditional  expectations 
E(Yt\T  —  1)  and  E(YC|T  =  0),  not  the  uncondi¬ 
tional  expectations  required  for  obtaining  t.  Until 
we  assume  that  E(Yr|T  =  1)  =  E(YT)  and 
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£(YC|T  =  0)  =  E(YC),  we  cannot  calculate  the 
average  treatment  effect.  Note  that  the  estimand  of 
interest,  such  as  the  average  treatment  effect,  is 
conceptually  distinct  from  the  estimators  used  to 
infer  it  from  data,  such  as  difference  in  means, 
linear  regression,  or  other  techniques. 

Experiments 

To  estimate  the  average  treatment  effect,  we 
require  the  assumption  of  independence.  The  singu¬ 
lar  virtue  of  experiments  is  that  physical  random¬ 
ization  of  an  intervention  ensures  independence 
between  treatment  status  and  potential  outcomes. 
Ronald  Fisher,  in  the  1920s  and  1930s,  first 
emphasized  the  importance  of  random  assignment 
for  eliminating  bias,  calling  randomization  of  treat¬ 
ment  the  “reasoned  basis  for  inference.”  From  a 
Lewisian  perspective,  the  control  group  in  an 
experiment  functions  as  an  observable  “possible 
world.”  With  the  independence  assumption,  the 
average  treatment  effect  can  be  estimated  from 
observables  using  the  following  expression: 

t  -  E(YiT\T  =  1)  -  £(Y,C|T  -  0) 

-  £(YiT)  -  £(Y,C). 

Under  randomization,  the  assumption  that  T,  is 
independent  of  Y,T  and  Y,c  is  plausible,  making  the 
treatment  and  control  groups  exchangeable  in 
expectation. 

One  of  the  assumptions  that  randomization  by 
itself  does  not  justify  is  that  the  response  of  one 
unit  should  be  unaffected  by  the  particular  assign¬ 
ment  of  treatments  to  the  other  units.  This  “no 
interference  between  units”  is  often  called  the 
Stable  Unit  Treatment  Value  Assumption  (SUTVA). 
SUTVA  implies  that  the  potential  outcomes  for  a 
given  unit  do  not  vary  with  the  treatments  assigned 
to  any  other  unit  and  that  there  are  not  different 
versions  of  treatment. 

Observational  Data 

In  observational  data,  stronger  assumptions  are 
usually  required  to  estimate  causal  effects.  In  obser¬ 
vational  studies,  the  causal  quantity  of  interest  is 
often  the  “average  treatment  effect  on  the  treated” 
or  ATT,  which  is  the  average  effect  conditional  on 


being  in  the  treatment  regime.  The  parameter  of 
interest  is 

t|(T  -  1)  =  £(Y;T|T  =  1)  -  £(YiC|T  =  I). 

Since  the  counterfactual  control  units,  £  ( Y(C  |  T  =  1 ) , 
are  not  observed,  a  control  group  must  be  con¬ 
structed.  The  two  assumptions  required  to  construct 
a  valid  control  group  are  conditional  independence 
of  the  potential  outcomes  and  treatment  assignment 
and  overlap,  or 

1.  Y,r,  YiC±T\X, 

2.  0  <  Pr(T  =  1|X)  <  1. 

When  these  two  conditions  hold,  we  can  say  that 
treatment  assignment  is  strongly  ignorable.  Once  a 
control  group  is  constructed  that  enables  us  to 
satisfy  these  two  conditions,  the  ATT  can  be  esti¬ 
mated  as 

t\(T  =  1)  =  £[£(Y,t|T  -  1) 

-  £(Y;C|T  =  0)]|T  =  1. 

It  is  important  to  note  that  the  outer  expectation  is 
taken  over  the  distribution  of  -XI  (T  =  1),  which  is 
the  distribution  of  covariates  among  the  treated 
units. 

Note  that  the  ATT  estimator  is  changing  when 
individual  observations  are  weighted,  and  the 
observations  that  are  outside  of  common  support 
receive  zero  weights.  That  is,  if  some  covariate 
values  are  observed  only  for  control  observa¬ 
tions,  those  observations  will  be  irrelevant  for 
estimating  ATT  and  are  effectively  dropped. 
Therefore,  the  overlap  assumption  for  ATT 
requires  only  that  the  support  of  X  for  treatment 
observations  be  a  subset  of  the  support  of  X  for 
control  observations.  More  generally,  one  would 
also  want  to  drop  treatment  observations  if  they 
have  covariate  values  that  do  not  overlap  with 
control  observations.  In  such  cases,  it  is  unclear 
exactly  what  estimand  one  is  estimating  because 
it  is  no  longer  ATT  as  some  treatment  observa¬ 
tions  have  been  dropped  along  with  some  control 
observations. 

The  key  assumption  being  made  here  is  strong 
ignorability.  Even  thinking  about  this  assumption 


208  Causality 


presupposes  some  rigor  in  the  research  design.  For 
example,  is  it  clear  what  is  pretreatment  and  what 
is  posttreatment?  If  not,  one  is  unable  even  to  form 
the  relevant  questions,  the  most  useful  of  which 
may  be  the  one  suggested  by  Harold  Dorn  in  1953, 
who  proposed  that  the  designer  of  every  observa¬ 
tional  study  should  ask,  “How  would  the  study  be 
conducted  if  it  were  possible  to  do  it  by  controlled 
experimentation?”  This  clear  question  also  appears 
in  William  Cochran’s  1965  Royal  Statistical  Society 
discussion  paper  on  the  planning  of  observational 
studies  of  human  populations.  Dorn’s  question  has 
become  one  that  researchers  in  the  tradition  of  the 
Neyman-Rubin  model  ask  themselves  and  their 
students.  The  question  forces  the  researcher  to 
focus  on  a  clear  manipulation  and  then  on  the 
selection  problem  at  hand.  Only  then  can  one  even 
begin  to  think  clearly  about  how  plausible  the 
strong  ignorability  assumption  may  or  may  not  be. 

Structural  Equation  Modeling 

Another  prominent  approach  to  causal  inference 
using  counterfactuals  is  structural  equation  model¬ 
ing,  a  method  most  associated  with  the  work  of 
Judea  Pearl.  Structural  equation  modeling  is  an  old 
enterprise  that  has  a  rich  history,  including  founda¬ 
tional  work  on  causality  in  systems  of  structural 
equations  by  the  geneticist  Sewall  Wright  (1921), 
the  economist  Trygve  Haavelmo  (1943),  and  the 
political  scientist  Herbert  Simon  (1953).  Modern 
advocates  of  structural  equation  modeling  argue 
that  the  probability  calculus  approach  to  causal 
modeling  used  by  researchers  in  the  Neyman-Rubin 
tradition  is  too  narrow  in  that  it  does  not  explicitly 
take  into  account  knowledge  about  the  mechanisms 
linking  background,  independent  variables,  and 
dependent  variables.  Rather  than  modeling  causal¬ 
ity  in  relation  to  experiments,  structural  equation 
modelers  prefer  to  write  out  a  more  elaborate 
causal  model  of  the  relationships  under  investiga¬ 
tion  through  a  system  of  structural  functions.  A 
system  of  such  functions  is  said  to  be  structural  if 
the  functions  are  assumed  to  be  invariant  to  possi¬ 
ble  changes  in  the  form  of  the  other  functions. 
Under  this  framework,  the  effects  of  treatments  are 
understood  as  interventions  in  a  prespecified  struc¬ 
tural  model. 

For  structural  equation  modelers,  hypothetical 
interventions  should  be  explicitly  and  formally 


related  to  the  causal  mechanisms  under  study.  In 
Pearl’s  version  of  structural  equation  modeling,  for 
example,  the  mathematical  operator  udo(X)” is 
used  to  represent  physical  interventions  in  a  set  of 
equations  that  deletes  certain  functions  from  the 
model,  replaces  them  by  a  constant,  and  preserves 
the  rest  of  the  model.  The  counterfactual  condi¬ 
tional  “If  X  had  been  x”  is  interpreted  as  an 
instruction  to  modify  the  original  model  so  that 
some  causal  variable  X  is  set  to  x  by  some  inter¬ 
vention,  experimental  or  otherwise.  This  operator 
is  accompanied  by  a  set  of  rules  called  “do  calcu¬ 
lus,”  which  helps  a  researcher  judge  whether  or 
not  sufficient  information  exists  to  identify  the 
effect  of  the  intervention  of  interest.  Rather  than 
identifying  one  all-encompassing  assumption — 
strong  ignorability — as  in  the  Neyman-Rubin 
approach,  Pearl  proposes  that  researchers  adopt  a 
series  of  local  assumptions  about  how  an  interven¬ 
tion  interacts  with  a  prespecified  structural  model 
to  identify  causal  quantities.  Despite  the  rather 
substantial  conceptual  differences  between  these 
two  approaches,  however,  they  are  mutually  com¬ 
patible.  This  compatibility  arises  from  their  shared 
reliance  on  counterfactual  understandings  of  cau¬ 
sality. 

Causal  Mechanisms 

The  Neyman-Rubin  counterfactual  approach  is  pri¬ 
marily  concerned  with  defining  what  the  effect  of  a 
cause  is,  not  explaining  how  causes  affect  outcomes. 
The  apparatus  of  most  statistical  models  of  causa¬ 
tion  has  no  formal  role  for  social  theory,  explana¬ 
tion,  or  causal  mechanisms.  Given  social  scientists’ 
interest  in  these  issues,  a  common  critique  of  the 
Neyman-Rubin  model  and  its  cousins  are  that  they 
are  too  narrow  for  the  social  sciences.  Advocates  of 
the  statistical  approach  have  countered  that  coun¬ 
terfactual  models  of  causation  can  be  augmented  to 
take  into  account  the  causal  mechanisms. 

While  experiments  have  the  virtue  of  credibly 
identifying  the  causal  effect  of  an  intervention,  they 
are  sometimes  criticized  as  “black  boxes.”  To 
understand  the  pathways  by  which  interventions 
affect  the  outcome,  social  scientists  have  relied  on  a 
method  known  as  “mediation  analysis,”  which 
models  the  relationship  between  a  treatment,  a 
potentially  posttreatment  variable,  and  the  outcome 
ultimately  of  interest.  An  important  distinction  in 
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this  literature  is  whether  or  not  the  “mediator”  is 
treated  as  posttreatment  or  not.  If  the  mediator 
is  not  affected  by  treatment,  the  effect  of  interest  is 
how  the  manipulable  mediator  affects  or  moderates 
the  outcome  when  the  main  treatment  variable  is 
fixed,  known  as  the  “controlled  direct  effect.” 

This  controlled  direct  effect  is  not  always  the 
effect  of  interest,  however,  since  mediation  analy¬ 
sis  is  often  intended  to  shed  light  on  the  role  of 
mechanisms,  which  in  this  framework  can  be 
defined  as  a  process  that  can  transmit,  at  least  par¬ 
tially,  the  effect  of  a  treatment  on  an  outcome.  An 
important  distinction  between  a  manipulation  (a 
“treatment”)  and  a  mechanism  is  that  the  former 
involves  an  external  intervention,  while  the  latter 
does  not.  The  goal  of  this  type  of  mediation  analy¬ 
sis  is  to  estimate  what  fraction  of  a  causal  effect  is 
“indirect” — that  is,  due  to  the  treatment  changing 
the  level  of  the  mediator  and  consequently  the  out¬ 
come — and  what  fraction  is  “direct” — that  is,  due 
to  the  treatment  affecting  the  outcome  through 
other  pathways.  Expressed  in  counterfactual  lan¬ 
guage,  an  “uncontrolled”  indirect  effect  is  a  com¬ 
parison  between  the  outcome  when  the  mediator  is 
set  at  the  value  realized  in  the  treatment  condition 
and  the  outcome  when  the  mediator  is  set  to  the 
value  that  would  be  observed  under  the  control 
condition  while  holding  treatment  status  constant. 

Uncontrolled  mediation  effects  are  often  of  great 
interest,  but  unfortunately,  even  with  a  randomized 
intervention,  their  identification  rests  on  strong 
assumptions.  In  mediation  analyses,  the  level  of  the 
mediator  is  generally  assumed  to  be  independent  of 
the  counterfactual  outcomes  conditional  on  treat¬ 
ment  assignment — that  is,  the  mediator  is  assigned 
“as  if”  random.  Given  that  an  uncontrolled  media¬ 
tor  variable,  by  definition,  is  not  randomly  assigned, 
this  assumption  is  strong  indeed.  While  the  identifi¬ 
cation  assumptions  may  be  warranted  in  special 
circumstances,  the  main  lesson  of  the  statistical  lit¬ 
erature  is  that  the  quantitative  study  of  causal 
mechanisms  is  an  enterprise  fraught  with  difficul¬ 
ties,  even  in  the  context  of  randomized  experiments. 

Qualitative  Evidence  and 
Theory  Falsification 

While  the  quantitative  study  of  causality  is  well 
developed  and  increasingly  unified  under  counter- 
factual  models,  many  social  scientists  supplement 


statistical  methods  with  qualitative  reasoning  to 
aid  causal  inference.  Sometimes  called  “causal  pro¬ 
cess  observations,”  qualitative  evidence  can  be  an 
important  source  of  leverage  for  both  the  design  of 
causal  analyses  and  the  interpretation  of  their  find¬ 
ings.  In  the  social  sciences,  for  example,  most  evi¬ 
dence  for  mechanisms  is  qualitative,  not  quantita¬ 
tive.  Qualitative  researchers  argue  that  by  direct 
observation  of  causal  processes,  a  researcher  can 
discern  potentially  important  mechanisms  that  may 
have  escaped  notice.  Insight  derived  from  observa¬ 
tions  that  are  poorly  suited  for  rectangular  data 
sets  may  then  lead  to  more  formal  investigations 
using  experimental  and  observational  quantitative 
methods.  The  health  sciences,  for  example,  are 
replete  with  examples  of  qualitative  observation 
paving  the  way  for  groundbreaking  experiments. 

Given  that  most  questions  in  the  social  sciences 
are  studied  using  observational  research  designs, 
another  role  for  qualitative  insight  is  the  justifica¬ 
tion  of  the  conditional  independence  assumption. 
Although  many  and  perhaps  most  observational 
studies  pay  inadequate  attention  to  justifying  the 
adequacy  of  their  designs,  careful  observational 
research  must  identify  important  confounders  and 
uncover  fortuitous  “natural”  experiments  for  mak¬ 
ing  well-grounded  inferences.  Qualitative  evidence 
can  be  used  to  identify  appropriate  confounders  to 
adjust  for  as  well  as  to  justify  any  claim  that  treat¬ 
ment  was  allocated  “as  if”  random. 

For  many  questions  in  the  social  sciences,  how¬ 
ever,  a  research  design  guaranteeing  the  validity  of 
causal  inferences  is  difficult  to  obtain.  When  this  is 
the  case,  researchers  can  attempt  to  defend  hypoth¬ 
esized  causal  relationships  by  seeking  data  that 
subject  their  theory  to  repeated  falsification.  Karl 
Popper  famously  argued  that  the  degree  to  which 
we  have  confidence  in  a  hypothesis  is  not  necessar¬ 
ily  a  function  of  the  number  of  tests  it  has  with¬ 
stood  but  rather  the  severity  of  the  tests  to  which 
the  hypothesis  has  been  subjected.  A  test  of  a 
hypothesis  with  a  design  susceptible  to  hidden  bias 
is  not  particularly  severe  or  determinative.  If  the 
implication  is  tested  in  many  contexts,  however, 
with  different  designs  that  have  distinct  sources  of 
bias,  and  the  hypothesis  is  still  not  rejected,  then 
one  may  have  more  confidence  that  the  causal 
relationship  is  genuine.  Note  that  repeatedly  test¬ 
ing  a  hypothesis  with  research  designs  suffering 
from  similar  types  of  bias  does  not  constitute  a 
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severe  test,  since  each  repetition  will  merely  repli¬ 
cate  the  biases  of  the  original  design.  In  cases 
where  randomized  experiments  are  infeasible  or 
credible  natural  experiments  are  unavailable,  the 
inferential  difficulties  facing  researchers  are  large. 
In  such  circumstances,  only  creative  and  severe 
falsification  tests  can  make  the  move  from  correla¬ 
tion  to  causation  convincing. 
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Censored  and  Truncated  Data 


Both  censored  and  truncated  data  involve  a  lack 
of  information  about  a  random  variable  and 
occur  in  the  context  of  quantitative  analysis  of 
data,  when  one  is  using  that  variable  either  to 
estimate  a  population  mean  (or  other  population 
parameters)  or  as  the  dependent  variable  in  a 
regression  analysis.  The  key  distinction  between 
them  is  whether  one  has  information  about  miss¬ 
ing  values.  With  censored  data,  one  observes 
some  information  about  the  missing  data,  either 
in  the  form  of  a  range  of  values  that  they  might 
fall  into  or  in  the  form  of  the  knowledge  that  they 
are  missing.  With  truncated  data,  one  has  no 
information  about  the  existence  or  value  of  miss¬ 
ing  observations.  The  difference  in  the  structure 
of  information  for  two  types  of  data  determines 
how  one  approaches  censored  or  truncated  data, 
whether  in  the  context  of  a  single  random  variable 
or  as  a  dependent  variable  in  a  regression  analy¬ 
sis.  This  entry  discusses  the  consequences  of  this 
kind  of  missing  data  for  regression  analysis  and 
sample  selection. 

Concerns  about  censoring  and  truncation 
abound  in  empirical  analysis.  They  can  occur  either 
through  the  structure  of  data-gathering  efforts  or 
through  legal  requirements.  Historically,  research¬ 
ers  relied  on  assumptions  about  the  distribution  of 
the  variable  to  adjust  the  estimates  to  account  for 
truncation  or  censoring.  These  concerns  are  just  as 
important  when  censored  or  truncated  variables 
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are  the  dependent  variables  in  regression  analyses. 
Scholars  have  long  worried  about  the  consequences 
of  processes  such  as  self-selection  for  the  validity  of 
their  regression  results,  since  they  often  result  in 
biased  and  inconsistent  coefficient  estimates.  While 
estimators  to  correct  for  censoring  and  truncation 
have  been  around  for  more  than  3  decades, 
researchers  have  worried  about  their  sensitivity  to 
distributional  assumptions  and  model  specifica¬ 
tion.  While  recent  work  attempts  to  relax  some  of 
these  critical  assumptions  and  diagnose  sensitivity 
issues,  researchers  have  also  extended  previous 
work  by  designing  estimators  for  a  greater  variety 
of  data. 

Types  of  Missing  Data 

Censored  and  truncated  data  are  both  forms  of 
missing  data,  which  have  been  categorized  in  three 
ways:  (1)  missing  at  random  (MAR),  (2)  missing 
completely  at  random  (MCAR),  and  (3)  nonignor- 
able  (NI).  A  variable  that  is  MCAR  has  missing 
values  that  are  determined  randomly,  so  that  they 
occur  with  equal  probability  and  do  not  depend  on 
any  information  in  the  data  set.  A  variable  that  is 
MAR  exhibits  a  pattern  of  missingness  in  which 
the  probability  of  a  missing  value  depends  on  other 
observed  variables  for  that  same  observation.  A 
variable  exhibits  NI  missingness  when  unobserved 
information  in  its  value  helps  explain  its  missing¬ 
ness.  In  this  case,  the  pattern  of  missingness 
depends  on  information  beyond  that  which  is  con¬ 
tained  in  the  observed  variables. 

Censored  and  truncated  data  can  emerge 
through  any  of  these  three  forms  of  missing  data. 
Making  valid  inferences  requires  making  valid 
assumptions  about  the  structure  of  the  missing¬ 
ness.  Except  for  the  case  of  MCAR,  one  will  gener¬ 
ally  reach  inaccurate  conclusions,  whether  regard¬ 
ing  population  characteristics  or  regression  param¬ 
eters,  unless  one  properly  models  the  pattern  of 
missingness. 

Censored  and  Truncated  Random  Variables 

A  random  variable  is  censored  if  one  does  not 
observe  its  true  value  but  rather  observes  a  bound 
for  the  range  of  values  into  which  it  falls.  If  X  is  a 
random  variable,  then  X  is  censored  if  when  X  >  b 
the  researcher  observes  only  b.  Similarly,  X  may  be 


censored  from  below  by  a  value  a.  The  density 
function  of  the  observed  values  of  X  therefore  has 
point  masses  at  the  values  a  and  b.  This  form  of 
censoring  occurs  commonly  in  political  science  in 
the  study  of  the  duration  of  political  events.  For 
example,  if  a  researcher  observes  a  set  of  states 
that  might  adopt  a  particular  policy,  then  states 
that  have  not  adopted  the  said  policy  by  the  end  of 
the  study  period  are  said  to  be  right  censored.  For 
those  states,  the  researcher  does  not  know  exactly 
how  many  years  will  elapse  before  they  adopt  the 
policy,  only  that  it  exceeds  the  number  of  years  in 
the  study  period  so  far. 

A  random  variable  is  truncated  if  the  researcher 
has  no  information  about  missing  values,  includ¬ 
ing  their  existence.  For  example,  observations  with 
X  <  a  might  not  be  observed  at  all.  Observations 
of  truncated  data  do  not  exhibit  point  masses  at 
the  truncation  points.  Rather,  one  accounts  for  the 
missing  values  by  focusing  on  the  distribution  con¬ 
ditional  on  observation.  This  involves  normalizing 
the  distribution  of  the  underlying  variable,  which 
includes  observed  and  unobserved  values,  by  the 
probability  of  the  observed  region. 

An  example  occurs  in  the  study  of  campaign 
finance  in  the  United  States:  Political  contributions 
to  candidates  for  national  office  are  not  reported  if 
the  amount  does  not  exceed  $200.  Data  from  the 
government  therefore  list  no  contributions  less 
than  $200,  making  it  impossible  to  know  the  num¬ 
ber  of  such  contributions  or  their  amount  if  any 
such  contributions  were  made. 

Censoring  and  Truncation 
in  Regression  Analysis 

Regression  analysis  of  censored  and  truncated 
variables  often  requires  adaptations  of  the  stan¬ 
dard  regression  model.  If  the  dependent  variable  is 
censored,  then  one  must  treat  observations  that 
take  on  the  censoring  value  differently  (note  that 
the  value  at  which  censoring  occurs  and  the  value 
that  is  observed  for  censored  observations  need 
not  be  identical).  In  the  case  of  state  policy  adop¬ 
tion,  discussed  above,  the  data  are  censored  from 
above,  in  the  sense  that  the  dependent  variable  for 
states  that  have  not  adopted  the  said  policy  by  the 
end  of  the  study  period  takes  on  a  value  equal  to 
the  length  of  the  study.  In  the  context  of  duration 
analysis,  this  is  referred  to  as  right  censoring.  If 
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one  treats  these  observations  the  same  way  as 
those  observations  that  have  actually  been  adopted 
in  the  past  year,  bias  will  likely  ensue.  Rather,  one 
should  model  the  information  accurately:  One 
knows  only  that  the  observation  is  greater  than  the 
censoring  value. 

Another  common  form  of  censoring  occurs 
when  smaller  values  of  the  dependent  variable  are 
not  observed.  Often  this  corresponds  to  situations 
in  which  the  dependent  variable  takes  on  nonnega¬ 
tive  values,  such  as  one  country’s  foreign  aid  to 
other  countries.  Many  countries  will  receive  no 
foreign  aid  dollars,  while  others  will  receive  posi¬ 
tive  amounts.  The  clump  of  countries  receiving 
zero  dollars  can  be  thought  of  as  cases  in  which  the 
host  country  might  have  preferred  to  give  negative 
aid,  but  since  this  is  impossible,  the  observed  value 
is  censored  at  zero.  One  would  generally  analyze 
data  with  this  structure  with  a  Tobit  model,  which 
accounts  for  the  censoring  process. 

When  the  dependent  variable  suffers  from  trun¬ 
cation,  one  does  not  have  information  about  data 
for  which  the  dependent  variable  is  outside  the 
bound(s)  of  truncation.  To  account  for  this  struc¬ 
ture,  one  must  normalize  the  distribution  of  the 
dependent  variable  to  account  for  the  lack  of 
observations  in  certain  regions  (so  that  the  distri¬ 
bution  over  observed  values  integrates  to  one).  In 
the  case  of  campaign  contributions  discussed 
above,  one  would  observe  information  only  about 
contributors  that  donate  at  least  $200,  since  that  is 
the  legal  cutoff  for  reporting  requirements.  Thus, 
one  would  likely  have  a  data  set  with  complete 
information  for  all  individuals  who  contribute  at 
least  $200  but  no  information  and  no  observations 
for  individuals  who  contribute  less  than  that 
amount.  Unlike  the  case  of  censoring,  however, 
there  will  not  be  a  large  number  of  individuals 
clustered  at  $200. 

Censoring  and  Truncation 

Through  Sample  Selection 

A  related  form  of  censoring  and  truncation  occurs 
when  a  second  process  determines  whether  the 
dependent  variable  is  missing  for  a  given  observa¬ 
tion.  This  is  often  referred  to  as  nonrandom  sam¬ 
ple  selection  and  can  result  in  selection  bias  if  one 
ignores  the  selection  process.  One  can  think  of  the 
data-generating  process  as  occurring  in  two  stages: 


The  selection  equation  determines  whether  one 
observes  the  dependent  variable  of  interest  for  an 
observation,  and  the  equation  of  interest  deter¬ 
mines  the  value  of  this  variable  given  the  observa¬ 
tion.  When  the  unobserved  component  of  the 
selection  equation  is  correlated  with  the  unob¬ 
served  component  in  the  outcome  equation,  non- 
random  sample  selection  is  a  form  of  nonignorable 
missingness.  This  will  generally  result  in  biased 
coefficients  if  one  ignores  the  selection  process. 

Nonrandom  sample  selection  can  take  two 
forms.  If  one  observes  cases  for  which  the  depen¬ 
dent  variable  is  missing,  the  data  suffer  from  sto¬ 
chastic  censoring.  One  knows  that  the  observation 
could  have  experienced  the  outcome  of  interest, 
but  because  of  the  selection  process,  it  happened 
not  to  experience  it.  Alternatively,  one  might  have 
information  only  about  observations  with  observed 
values  of  the  outcome  of  interest.  For  example,  if  a 
survey  respondent  answers  all  questions  save  the 
dependent  variable,  then  the  data  exhibit  stochas¬ 
tic  censoring;  whereas  if  a  potential  respondent 
chooses  not  to  answer  the  entire  survey,  then  the 
data  exhibit  stochastic  truncation. 

Both  can  be  dealt  with  in  the  regression  context 
by  simultaneously  modeling  the  selection  process 
and  the  outcome  of  interest.  The  most  well-known 
estimator  of  this  type  was  developed  by  James 
Heckman  and  is  often  referred  to  as  the  Heckman 
model.  While  the  Heckman  model  is  designed  for 
a  continuous  dependent  variable  and  data  that 
suffer  from  stochastic  censoring,  similar  estima¬ 
tors  exist  for  discrete  outcomes.  Analogous  esti¬ 
mators  also  exist  for  continuous  and  discrete  out¬ 
come  variables  when  the  data  exhibit  stochastic 
truncation. 

Estimation  requires  proper  specification  of  the 
selection  equation.  Furthermore,  the  estimates 
tend  to  be  sensitive  to  the  distributional  assump¬ 
tion  for  the  error  terms  (generally  bivariate  nor¬ 
mal).  Dealing  with  stochastic  truncation  generally 
poses  a  greater  estimation  challenge  because  one 
has  no  information  about  incomplete  observa¬ 
tions,  making  the  distributional  assumption  even 
more  important,  since  it  is  the  mechanism  for  iden¬ 
tifying  the  selection  process. 
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Censorship 


Censorship  is  restriction  of  the  circulation  of 
information,  ideas,  and  images  or  of  political,  cul¬ 
tural,  religious,  and  artistic  opinions;  it  occurs 
mainly  when  the  rulers  consider  that  these  could 
weaken  or  damage  their  hold  on  power.  Char¬ 
acteristic  of  nondemocratic  systems,  and  also 
present  in  some  democracies,  censorship  is  usually 
exercised  by  an  authority  responsible  for  monitor¬ 
ing  the  dissemination  of  ideas,  including  those 
circulated  by  governmental  agencies  as  well  as 
those  from  private  individuals  and  organizations. 
The  term  censorship  has  its  origin  in  the  Roman 
Empire,  where  the  “censor”  was  the  person  who 
controlled  the  dissemination  of  political  ideas. 

Since  premodern  times,  censorship  has  existed 
under  different  political  regimes.  It  began  to  be 
questioned  in  the  17th  century,  when  critics  of  the 
period’s  absolutist  states  started  to  call  for  the 
limitation  of  state  power  and  discretion  and  for  the 
institutionalization  of  civil  and  political  rights  and, 
particularly,  freedom  of  speech. 

Censorship  has  also  existed  in  social  institutions 
such  as  the  Catholic  Church  as  well  as  in  other  reli¬ 
gions  such  as  Islam.  In  this  case,  the  upper  echelons 
of  the  religious  hierarchy  exercise  censorship  to 
ensure  that  the  content  of  writings  by  priests  and 
bishops  is  in  line  with  the  religion’s  main  tenets. 

Censorship  is  generally  at  odds  with  democracy, 
which  is  characterized  by  the  institutionalization 
of  civil  and  political  rights,  civil  liberties,  and  the 


existence  of  an  opposition.  Although  free  expres¬ 
sion  is  a  central  democratic  value,  some  opponents 
of  censorship  do  recognize  exceptions.  One  of  the 
most  commonly  proposed  exceptions  is  of  child 
pornography,  due  to  its  connection  with  individu¬ 
als  or  organizations  that  commit  crimes  such  as 
pedophilia.  Supporters  of  censorship  in  such  cases 
argue  that  the  right  to  free  expression  is  out¬ 
weighed  by  the  need  to  prevent  the  extraordinarily 
harmful  impact  on  children  caused  by  the  creation 
and  distribution  of  child  pornography.  In  contrast, 
critics  of  censorship  with  respect  to  child  pornog¬ 
raphy  argue  that  the  connection  between  pornog¬ 
raphy  and  actual  harm  to  children  has  not  been 
established.  Although  some  proponents  of  free 
speech  argue  that  it  has  precedence  over  the 
defense  of  children’s  interests,  most  Western 
democracies  ban  the  possession,  production,  and 
distribution  of  child  pornography. 

Hate  speech  is  another  controversial  subject  of 
censorship;  again,  supporters  argue  that  hate-filled 
speech  encourages  actual  violence  and  that  this 
potential  harm  outweighs  the  value  of  free  expres¬ 
sion.  For  example,  such  arguments  are  made  to 
justify  the  censorship  of  neo-Nazi  propaganda 
both  in  the  United  States  and  in  Europe. 

The  use  of  censorship  may  be  seen  not  only  in  a 
formal  way — that  is,  through  officials  entitled  to 
control  political  information  using  known  political 
means — but  also  through  informal  mechanisms, 
such  as  threatening  independent  journalists,  pun¬ 
ishing  a  critical  press  through  the  denial  of  paid 
advertisements,  and  so  forth.  These  informal  cen¬ 
sorship  mechanisms  can  be  found  in  young  democ¬ 
racies  established  after  military  or  authoritarian 
regimes,  in  which  there  are  political  constraints 
inherited  from  the  prior  political  order  limiting  the 
institutionalization  of  political  rights  or  caused  by 
the  lack  of  democratic  beliefs  among  the  governing 
elite.  Referred  to  by  Wolfgang  Merkel  as  “defective 
democracies,”  they  may  simply  not  have  repealed 
the  previous  regime’s  restrictions  on  press  freedom 
or  may  consider  that  censorship  is  necessary  in  cer¬ 
tain  cases  for  reasons  of  national  security.  In  this 
latter  case,  it  is  used  to  restrict  publication  of  infor¬ 
mation  about  issues  that  are  considered  very  sensi¬ 
tive  for  the  armed  forces  and  to  avoid  possible 
tensions  and  ensure  the  consolidation  of  democ¬ 
racy.  In  defective  democracies  or  hybrid  regimes, 
censorship  by  governing  officials  is  exercised 
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through  formal  and  informal  mechanisms,  includ¬ 
ing  threatening  journalists.  The  elimination  of  cen¬ 
sorship  is  one  of  the  priority  demands  of  an  agenda 
that  seeks  to  establish  real  democracy.  In  delegative 
democracies,  such  as  Venezuela  during  Hugo 
Chavez’s  presidency  (in  power  since  1999),  public 
officials  use  a  variety  of  methods  to  limit  freedom 
of  press,  including  not  only  censorship  but  also 
actions  such  as  the  cancellation  of  TV  and  radio 
licenses.  Although  censorship  is  formally  rejected  in 
established  democracies,  freedom  of  speech  can  be 
restricted  not  as  a  governing  decision  but  as  a  con¬ 
sequence  of  the  ownership  structure  of  the  media, 
when  business  people  seek  to  influence  public  opin¬ 
ion  with  their  own  beliefs  or  political  views,  reject¬ 
ing  the  dissemination  of  opposing  views.  In  these 
situations,  editors  may  employ  extensive  self-cen¬ 
sorship  to  block  the  dissemination  of  news  and 
ideas  that  may  damage  the  government. 

Censorship  can  be  imposed  preventively — by 
defining  conditions  for  the  dissemination  and  cir¬ 
culation  of  ideas  through  any  channel — or  ex  post 
facto,  with  the  authority  sanctioning  those  who 
infringe  the  norms  that  establish  limits  on  freedom 
of  speech. 

Although  censorship  is  incompatible  with 
democracy,  this  does  not  mean  that  public  officials 
lack  the  resources  with  which  to  attempt  to  per¬ 
suade  the  media  to  block  news  and  information 
that  may  be  damaging  or  favor  reports  that  may  be 
negative  to  the  opposition.  One  of  these  key  tools 
is  the  placement  of  advertising,  which  can  be  an 
important  source  of  media  revenues. 

With  the  development  of  modern  technology, 
particularly  the  expansion  of  Internet,  the  imposi¬ 
tion  of  censorship  is  more  difficult  than  ever,  posing 
enormous  problems  for  public  officials  seeking  to 
limit  the  dissemination  of  information  and  news 
critical  of  the  government.  The  Internet  opened  a 
new  channel  of  communication  for  dissidents  in 
nondemocratic  regimes  (e.g.,  Cuba  and  China)  to 
denounce  abuses  and  make  their  demand  for  plural¬ 
ism  and  freedom  known  to  the  world.  These  actions, 
in  turn,  trigger  support  from  NGOs  and  political 
parties  and  governments  in  democratic  countries, 
which  helps  protect  the  authors  from  reprisals. 
Large  international  corporations — notably  Google — 
have  also  emerged  as  new  players  in  favor  of  greater 
pluralism  in  nondemocratic  regimes  or  defective 
democracies. 


Censorship  plays  a  very  important  role  in 
authoritarian  regimes.  It  serves  as  a  tool  to  ensure 
that  limited  pluralism — a  central  element  in  their 
identity — remains  within  the  bounds  established 
by  the  authorities.  These  regimes  allow  certain  sec¬ 
tors  of  the  elite  access  to,  and  even  control  of,  the 
media  and  publishing  houses  through  which  they 
seek  to  gain  political  power  or  influence  while  also 
restricting  the  access  previously  enjoyed  by  other 
sectors  of  the  elite.  Self-censorship  is  widely  applied 
in  these  regimes  by  editors  of  private  media  com¬ 
panies  that  support  the  regime,  are  interested  in  its 
continuity,  and  are  particularly  strict  in  preventing 
the  publication  of  negative  news,  particularly  that 
referring  to  opposition  activities  or  conflicts  and 
divisions  within  the  governing  elite. 

Francisco  Franco’s  Spain  (1939-197 5)  provides 
an  example  of  the  evolution  of  censorship,  which 
was  not  applied  uniformly  during  his  regime  but 
underwent  a  number  of  changes  as  a  result  of  con¬ 
flicts  among  factions  of  the  ruling  elite,  which  had 
more  influence  when  the  regime  was  consolidated. 
With  the  consolidation  of  political  power  and  in  a 
context  of  economic  development  and  social  prog¬ 
ress,  there  was  a  tendency  to  relax  censorship  in 
order  to  allow  the  press  to  play  a  role  in  controlling 
excesses,  abuses  of  power,  and,  particularly,  cor¬ 
ruption  among  top  officials.  This  explained  the 
press  law  of  1966  in  Franco’s  Spain,  which  began  a 
certain  liberalization  of  the  official  press,  bringing 
greater  transparency  about  the  decisions  of  the 
authorities.  However,  the  information  concerning 
abuse  of  power,  particularly  of  economic  character, 
was  used  by  sectors  of  the  governing  elite  to  weaken 
the  power  position  of  other  factions,  provoking  a 
scandal  that  led  to  a  cabinet  reshuffle  and  the  dis¬ 
missal  of  ministers  involved  in  this  conflict. 
However,  it  also  permitted  the  appearance  of  oppo¬ 
sition  publications  and  even  magazines  (e.g., 
Cuadernos  para  el  Didlogo),  which  had  costs  for 
the  regime  in  terms  of  a  significant  increase  in  nega¬ 
tive  political  information  since  they  published  news 
about  cases  of  corruption  and  conflicts  within  the 
government  and  the  elite.  As  a  result  of  these  direct 
and  indirect  effects  of  the  liberalization  of  the  press, 
censorship  was,  in  practice,  reestablished. 

In  the  context  of  liberalization  of  authoritarian 
regimes,  the  relaxation  of  censorship  and  self¬ 
censorship  was  widely  extended,  as  in  the  Pinochet 
regime  in  Chile  (1973-1990)  during  a  period  of 
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political  instability  caused  by  the  economic  crisis 
in  1983  to  1984.  Independent  radios  and  maga¬ 
zines  could  inform  in  a  broader  way  on  several 
political  facts  that  had  been  censored  earlier, 
including  repression  against  opposition  organiza¬ 
tions.  However,  the  government  continued  threat¬ 
ening  and  punishing  journalists  with  formal  and 
informal  mechanisms  and  applying  censorship  in 
particular  cases.  The  political  space  opened  by  the 
weakening  of  censorship  was  widely  used  by  jour¬ 
nalists  and  media,  leading  to  a  significant  increase 
in  political  information,  including  adverse  infor¬ 
mation,  and  favored  the  development  of  the  oppo¬ 
sition  and,  as  a  result,  the  weakening  of  the  politi¬ 
cal  bases  of  authoritarian  rule. 

Carlos  Huneeus 
Universidad  de  Chile 
Santiago,  Chile 
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Central  Banks 


Modern  central  banks  are  public,  nonprofit,  eco¬ 
nomic,  and  political  institutions,  with  special 
functions  derived  from  their  command  over  mon¬ 
etary  resources.  They  shape  monetary  policy,  have 
a  strong  influence  on  exchange  rates,  and  are  the 
guardians  of  financial  stability.  Their  decisions 
influence  economic  variables  determining  growth, 
output,  and  national  development  policies  and 
have  a  bearing  on  the  conditions  of  international 
financial  and  monetary  cooperation.  As  a  general 
rule,  central  banks  are  regulated  by  a  mandate 
from  government(s)  specifying  their  goals  and  the 
conditions  under  which  control  over  monetary 
resources  is  exercised. 


After  situating  the  topic  in  its  theoretical  con¬ 
text,  this  entry  discusses  the  rules  and  social  insti¬ 
tutions  on  which  central  banks’  command  over 
monetary  resources  ultimately  rest;  stresses  the 
establishment  of  central  banks  as  a  trustworthy 
system  of  monetary  authority,  at  the  domestic  and 
international  levels;  and  examines  the  way  in 
which  the  institution’s  power  has  been  recast  and 
strengthened  in  the  financial  globalization  era,  in 
connection  with  the  widespread  trend  toward  cen¬ 
tral  bank  independence  from  elected  governments 
in  old  and  new  democracies. 

Central  Banks  as  Substate  Actors:  Theory 

An  updated  review  of  existing  central  banks  shows 
a  greater  variance  than  is  assumed  in  the  dominant 
rational  expectations  theories  framing  the  univer- 
salistic  prescriptions  and  the  role  models  recom¬ 
mended  over  the  past  30  years.  Influence  is  notice¬ 
able,  however,  in  two  global  trends:  (1)  in  terms  of 
goals,  the  convergence  toward  the  primacy  of  price 
stability,  and  (2)  in  terms  of  the  mandated  use  of 
monetary  resources,  the  dramatic  increase  in  the 
number  of  institutions  that  acquired  statutory 
independence  vis-a-vis  governments.  Yet  the  actual 
variance  in  the  practices  and  mandates  of  those 
institutions  across  countries  and  regions  limits  the 
scope  for  generalization.  Moreover,  central  bank 
functions  and  powers  have  been  recast  over  time 
since  the  establishment  of  the  first  institutions  of 
this  breed  in  the  mid-19th  century. 

A  central  bank’s  status  as  a  public,  nonprofit 
institution  is  the  outcome  of  an  evolutionary  pro¬ 
cess,  based  on  the  rules  and  social  institutions  that 
underpin  its  three  core  functions:  (1)  as  banker  to 
the  government,  (2)  its  monopoly  of  note  issuance, 
and  (3)  as  banker  to  the  banking  system — includ¬ 
ing  that  of  last-resort  lender.  These  functions  refer 
to  the  unique  relationships  between  the  institution 
and  its  principal  clients:  the  government  and  the 
profit-maximizing  financial  institutions,  mainly 
commercial  banks. 

The  Relationship  Between  the  Central 
Banks  and  the  Government 

As  a  public  national  institution,  the  central  bank 
derives  its  functions  from  a  state-backed  power  as 
the  sole  issuer  of  a  unique  currency  circulating  in  a 
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geographical  territory — to  which  it  has  been 
granted  the  status  of  legal  tender  {corns  force). 

Parallel  to  the  processes  of  political  delegation, 
a  contractual,  economic  relationship  with  the  gov¬ 
ernment  involves  mutual  economic  interest  and 
economic  calculation.  Lending  to  the  government, 
at  an  agreed  fixed  interest  rate,  generates  profits 
alongside  those  drawn  from  currency  issuance.  For 
the  government,  having  “its  own  bank”  ensures 
cheaper  deficit  financing;  its  power  as  principal 
borrower  is  used  to  press  for  a  minimum  rate  of 
interest.  On  the  other  hand,  central  banks  typically 
stand  out  as  major  holders  of  a  government’s  debt, 
insofar  as  they  invest  their  capital  in  government 
long-term  debt  instruments,  mainly  bonds.  Where 
a  domestic  public  debt  market  exists  or  is  being 
established,  voluntary  loans  from  private  savers 
provide  an  additional  source  of  deficit  financing.  It 
is  a  market  monitored  by  the  central  banks — add¬ 
ing  the  function  of  debt  management  to  their  other 
functions. 

While  the  government’s  preference  for  cheap 
finance  is  an  intrinsic  component  of  its  relation¬ 
ship  to  their  central  banks,  deficit  financing 
through  monetary  expansion — that  is,  through 
the  monetization  of  the  government’s  debt — was 
exceptional  in  industrialized  countries.  Long-term 
reviews  of  past  practices  supplied  by  Charles 
Goodhart’s  team  and  by  Barry  Eichengreen  indi¬ 
cate  economic  as  well  as  political  reasons  for 
central  banks  to  be  congenitally  inflation  and  risk 
averse.  Because  they  are  major  holders  of  the  gov¬ 
ernment’s  debt  and  because  they  already  profit 
from  lending  to  the  government,  they  have  an 
overriding  interest  in  keeping  the  real  value  of  its 
loans  stable.  There  is  also  a  reputation  dimen¬ 
sion  to  its  practice,  in  that  the  strength  of  the 
currency  issued  is  closely  related  to  the  credit- 
worthiness  of  the  issuer  and  to  the  commitment 
of  the  issuing  institution  to  price  stability.  Prior 
to  the  1970s,  the  commitment  of  central  banks 
to  convertibility  and/or  price  stability  was  over¬ 
ridden  in  only  one  circumstance:  when  the  sur¬ 
vival  of  the  state  was  threatened  by  wars.  Thus, 
the  “inflationary  bias”  of  economic  policy  and 
the  temptations  of  monetary  financing  of  govern¬ 
ment  spending  in  industrialized  countries  are  late 
developments  that  call  for  a  more  contextualized 
analysis  of  the  unstable  monetary  “regime”  in 
the  financial  globalization  era. 


Prompted  by  two  “structural”  factors,  some 
developing  countries,  mainly  in  Latin  America, 
resorted  to  an  “inflation  tax.”  The  first  factor  was 
a  weak  tax  system,  reflecting  the  limited  capacity 
of  the  state  to  establish  itself  as  a  widely  accepted 
fiscal  authority.  The  second  was  that  the  inflation 
tax  operated  as  a  conflict-minimizing  device  to 
socialize  the  costs  of  “catching-up”  industrializa¬ 
tion.  This  was  typically,  as  in  the  case  of  Brazil, 
where  the  state’s  commitment  to  a  proactive  strat¬ 
egy  of  rewarding  the  winners  was  based  on  the 
reiterated  distribution  of  skewed  fiscal  and  mone¬ 
tary  incentives  to  selected  economic  agents. 

Bank  to  the  Banking  System 

As  bankers  to  the  banking  system,  central  banks 
play  a  leading  role  mainly  vis-a-vis  commercial 
banks  and  less  effectively  vis-a-vis  other  profit- 
maximizing  financial  institutions.  Such  a  role  is, 
first  of  all,  one  of  guidance  and  monitoring.  Because 
commercial  banks  act  simultaneously  as  deposit 
takers  and  loan  makers,  they  are  credit  multipliers. 
Since  credit  is  money,  their  individual  micro- 
economic  decisions,  guided  by  profit-maximizing 
considerations,  have  a  direct  bearing  on  the  increase 
or  reduction  in  the  stock  of  money  available  in  the 
economy.  Central  banks’  overriding  concern  with 
financial  and  economic  stability  gradually  evolved 
toward  a  proactive  role  in  averting  this  source  of 
volatility.  Their  function  as  lender  of  last  resort  to 
the  banking  system — asserted  always  in  the  wake 
of  crises — revolves  around  the  bank’s  systemic 
responsibility,  in  exchange  for  enhanced  powers  of 
supervision  and  regulation  of  commercial  banks’ 
behavior.  While  the  necessary  function  of  lender  of 
last  resort  is  called  for  and  is  reiterated  in  crises,  it 
is  through  its  open  market  operations  that  the 
leadership  of  the  central  bank  over  profit-maxi¬ 
mizing  institutions  is  exercised.  To  maintain  day- 
to-day  control  over  the  short-term  nominal  inter¬ 
ests  rates  in  all  market  conditions,  the  central  bank 
purchases  or  sells  financial  securities  for  cash  to 
change  the  monetary  base.  Its  power  to  influence 
the  money  supply  and  other  economic  variables  in 
this  case  is  exercised  through  the  markets.  In  other 
words,  through  a  complex  system  of  incentives 
and  penalties — mainly  the  bank’s  discount  rate — it 
may  induce  commercial  banks  to  proceed  with 
their  lending  strategies  (or  dissuade  them).  Because 
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the  central  bank  can  impose  the  ultimate  condi¬ 
tions  for  granting  credit,  it  enjoys  the  “magical 
powers”  of  creating  or  destroying  money.  But  it 
does  so  by  relying  on  market  mechanisms — 
namely,  on  its  monopolistic  position  in  the  credit 
market  and  on  the  predictably  self-interested 
response  of  profit-maximizing  institutions  to  its 
interest  rate  policy. 

Set  against  this  background,  the  use  of  the  term 
central  bank  is  equivalent  to  and  interchangeable 
with  that  of  monetary  authority — a  use  that  is 
common  to  economists  of  all  persuasions.  This 
identity  relies  on  three  assumptions.  One,  wher¬ 
ever  a  market  exists,  financial  firms  and  investors 
act  independently  from  one  another.  Two,  the 
central  banks’  power  over  monetary  resources  is 
that  of  a  rule  maker — an  extension  of  the  state’s 
capacity  of  enforcement.  Three,  its  creditworthi¬ 
ness  derives  from  its  governance  capacity  in  terms 
of  price  and  financial  stability,  which  in  turn 
derives  mainly  from  its  technical  credentials. 

Political  scientists  assume  a  political  economy 
approach  to  the  international  monetary  regimes  in 
which  central  banks  operate  as  rule  based  and 
contingent.  They  take  as  a  premise  that  domestic 
and  world  markets  are  embedded  in  and  perme¬ 
ated  by  social  institutions.  From  this  perspective, 
the  history  of  the  international  monetary  order  (or 
disorder)  is  thought  of  as  “the  history  of  the 
construction  and  demolition  of  rules,  constitutive 
and  regulative,  explicit  or  tacit,  substantive  and 
procedural,”  in  the  words  of  Bruce  Hall  (2008, 
p.  10).  Central  to  this  cognitive  framing  are  the 
politico-economic  processes  underlying  the  estab¬ 
lishment  or  disruption  of  a  policy  consensus 
around  a  monetary  regime. 

The  International  Political 
Economy  and  Central  Banking 

A  major  distinctive  feature  of  the  current  monetary 
and  financial  regime  is  the  rising  power  of  central 
banks  vis-a-vis  established  governments  and  the 
redefinition  of  their  authority  vis-a-vis  cross-bor¬ 
der  market  actors.  To  situate  such  processes 
requires  a  shift  of  focus  from  its  role  as  a  predom¬ 
inantly  substate  institution  to  one  including  its 
capacities  as  a  public  transnational  actor  in  a  mul¬ 
tilevel  system  of  global  governance.  The  bank’s 
command  over  monetary  resources  has  been  recast 


in  tandem  with  new  modes  of  interaction  with 
government(s)  and  with  market  actors.  Those 
shifts  were  carried  out  within  the  cognitive  fram¬ 
ing  of  the  rational  expectations  revolution — 
centered  on  the  notion  of  central  banks’  credibility 
with  the  financial  markets.  They  are  closely  linked 
to  a  number  of  developments  that  distinguish  the 
post-Bretton  Woods  monetary  era,  extending 
from  the  1970s  to  the  present,  from  its  predeces¬ 
sors:  the  classical  gold  standard  (1844-1945),  the 
interwar  decades,  and  the  Bretton  Woods  system 
(1945-1971).  While  the  previous  systems  consti¬ 
tuted  international  monetary  regimes  proper, 
whose  rules  and  social  institutions  were  designed 
and  enforced  by  the  United  Kingdom  (UK)  and  the 
United  States,  respectively,  the  post-Bretton 
Woods  is  considered  by  many,  including  Barry 
Eichengreen,  as  a  “nonsystem.”  Successive  failures 
at  establishing  a  durable  international  monetary 
order  and  a  great  deal  of  technical  experimenta¬ 
tion  underlie  the  process  by  which  the  powers  of 
the  central  banks  were  recast. 

The  issues  around  which  the  practice  and  the¬ 
ory  of  central  banking  came  to  revolve  are  explored 
by  the  monetary  economist  Stanley  Fisher.  On  the 
one  hand,  the  inflationary  tendencies  are  brought 
about  by  the  conflict  between  the  short-term  and 
the  long-run  effects  of  monetary  expansion.  On 
the  other,  it  should  revolve  around  the  conflict 
between  the  need  to  shield  central  banks  from  the 
political  pressures  underlying  the  monetary  financ¬ 
ing  of  government  spending  and  the  principle  of 
accountability  to  the  public. 

Central  Banking  and  Governments 

Central  banks’  current  powers  are  tied  to  criti¬ 
cal  shifts  in  their  relationship  with  their  constituen¬ 
cies  and  in  the  cognitive  maps  adopted  by  central 
bankers.  A  salient  aspect  of  those  developments  is 
a  shift  in  the  priorities  of  monetary  management 
toward  the  primacy  of  price  stability  over  other 
policy  goals,  such  as  the  promotion  of  full  employ¬ 
ment  and  maximum  output.  The  underlying  polit¬ 
ico-economic  framework  is  shaped  by  the  threat  of 
runaway  inflation  in  the  dominant  capitalist 
democracies,  in  the  1970s  and  1980s,  mainly  in  the 
United  States  and  the  UK,  which  was  an  unprece¬ 
dented  experience  in  peacetime.  The  international 
political  consensus  underlying  the  Bretton  Woods 
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regime  had  already  been  shaken  in  1971  to  1973 
by  President  Nixon’s  unilateral  decision  to  detach 
the  dollar  (to  which  all  other  currencies  were 
pegged)  from  the  gold  standard  so  that  dollars 
could  no  longer  be  converted  to  gold.  Because  the 
dollar  was  the  reserve  currency,  severe  losses  in  the 
value  of  foreign  reserves  held  in  dollars  were 
inflicted  on  the  other  members  in  the  international 
monetary  system.  The  ensuing  legitimacy  crisis 
was  compounded  by  the  refusal  of  Germany  and 
France  to  import  inflation  engendered  in  the 
United  States. 

A  matter  of  interest  to  the  social  scientist  is 
brought  to  the  fore  by  this  episode.  It  highlighted 
the  capacity  of  a  powerful  state,  the  hegemon,  to 
deflect  the  costs  of  an  overdue  domestic  adjust¬ 
ment  to  trading  partners.  It  also  highlights  the 
critical  role  of  transnational  political  coordination 
in  counteracting  that  mode  of  power.  The  unique 
experience  of  Europe  is  a  case  in  point:  Both  that 
episode  and  the  crisis  of  the  European  Exchange 
Rate  System  in  the  early  1990s  propelled  the 
gradual  crafting  of  the  institutions  that  under¬ 
pinned  the  Maastricht  Agreement  provisions  aimed 
at  establishing  a  regional  monetary  order.  The 
powers  of  national  central  banks  were  redefined 
accordingly,  in  tandem  with  the  transfer  of  their 
command  over  monetary  resources  to  the  European 
Central  Bank.  Insofar  as  it  involved  a  long-term 
project,  multiple  acts  of  sovereign  political  delega¬ 
tion,  and  democratic  deliberation,  the  European 
Central  Bank  is  a  unique  case  of  establishing  a 
transnational  monetary  authority  through  state¬ 
craft,  in  the  sense  explored  by  Lourdes  Sola  and 
Laurence  Whitehead  (2005)  in  connection  with 
developing  countries. 

It  is  within  this  shifting  monetary,  financial,  and 
political  context  that  the  new  powers  and  func¬ 
tions  assigned  to  central  banks  vis-a-vis  the  gov¬ 
ernments  become  meaningful.  The  centrality  of  the 
monetary  policy  and  the  monetary  counterreaction 
to  the  dominance  of  Keynesian  thought  marked 
out  a  paradigmatic  shift  away  from  the  postwar 
economic  dirigisme  structured  around  demand 
management  and  based  on  exchange  rate,  price, 
and  interest  rate  controls.  It  brought  to  an  end  the 
subordination  of  central  banks  to  domestic  fiscal 
authorities. 

The  empowerment  of  such  institutions,  in  par¬ 
ticular  the  U.S.  Federal  Reserve  System,  highlights 


in  what  sense  the  exercise  of  monetary  authority — 
as  a  special  mode  of  political  authority — can  be 
strengthened  and  redesigned  within  the  domestic 
framework  of  liberal  constitutionalism.  Three 
dimensions  are  of  interest:  (1)  the  strengthened 
role  of  central  banks  as  rule  makers,  (2)  the  polit¬ 
ical  dimension  of  their  effective  capacity  for  mon¬ 
etary  governance,  and  (3)  the  relative  power  of  the 
hegemon — the  United  States,  as  the  issuer  of  the 
international  reserve  currency,  the  dollar — to  lead 
the  international  arrangements  that  deflected  the 
costs  of  adjustment  to  developing  countries. 

The  authority  of  the  Federal  Reserve  as  a  sub¬ 
state  actor  is  constitutionally  specified  and  limited. 
Its  autonomy  vis-a-vis  the  executive  is  constrained 
by  the  oversight  of  the  U.S.  Congress  and,  in  terms 
of  its  goals,  by  its  dual  mandate — price  stability 
and  full  employment.  Neither  these  constraints  nor 
the  prospect  of  the  huge  social  and  political  costs 
inseparable  from  recession  and  unemployment 
barred  the  option  for  a  shock  therapy  to  curb  run¬ 
away  inflation — basically  a  dramatic  increase  in 
the  U.S.  interest  rates  by  Federal  Reserve  Chairman 
Paul  Volcker.  The  political  dimensions  of  the 
decision-making  processes  underlying  a  sharp  turn¬ 
around  of  monetary  policy  remain  understudied — 
despite  the  evidences  that  it  was  premised  on 
successive  acts  of  political  delegation  from  both 
the  executive  and  the  legislative  powers. 

It  is  widely  agreed  that  the  Federal  Reserve’s 
capacity  for  monetary  governance,  in  terms  of  its 
success  in  curbing  inflationary  expectations,  is  better 
explained  by  its  political  components  than  by  its 
technical  ones.  Monetary  economists,  practitioners, 
and  political  economists  agree  that  as  a  technical 
exercise,  it  was  hardly  a  success.  Volcker’s  economic 
objective  was  achieved  thanks  to  his  decision  to 
emphasize  price  stability  and  to  de-emphasize  (also 
on  theoretical  grounds)  the  possibility  that  mone¬ 
tary  policy  could  affect  the  level  of  output.  This,  in 
turn,  relied  on  the  flexible,  dual-mandate  design  of 
the  Fed  and,  above  all,  on  Volcker’s  intellectual 
authority  and  moral  credentials.  To  many  practitio¬ 
ners,  such  as  Alan  Blinder,  this  is  a  critical  asset  to 
forge  the  near  unanimity  required  in  the  collegiate 
decisions  within  the  Fed.  Also,  insofar  as  monetary 
policy  works  through  the  markets,  the  technical 
and  moral  credibility  of  practitioners  in  the  eyes  of 
market  actors  are  critical.  The  technical  complexi¬ 
ties  of  monetary  management  make  this  mode  of 
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personal,  contingent  authority  a  powerful  instru¬ 
ment  of  persuasion  and  legitimation  in  the  delib¬ 
erative  processes  in  Congress — and  in  the  eyes  of 
the  relevant  constituencies. 

The  systemic  international  functions  of  the  Fed 
and  other  core  central  banks — both  as  rule  makers 
and  in  their  capacity  of  governance — relate  to  one 
of  the  most  consequential  implications  of  the  dra¬ 
matic  rise  in  U.S.  interest  for  developing  countries, 
particularly  for  Latin  America:  soaring  indebted¬ 
ness,  vulnerability  to  further  external  shocks,  and 
the  withdrawal  of  foreign  investors  for  9  years. 
Central  banks  in  core  countries  were  called  to  lead 
the  coordination  of  the  international  arrangements 
aimed  at  deflecting  to  debtors  the  costs  of  the  U.S. 
adjustments.  The  threat  that  their  rising  debts 
might  feed  on  the  international  banking  system 
prompted  the  emergence  of  new  institutional 
arrangements,  in  concert  with  the  International 
Monetary  Fund  (IMF)  and  the  representatives  of 
the  private  banking  system.  The  responses  to  the 
“debt  crisis”  in  Latin  America  were  designed  by 
the  committees  organized  to  this  end  and  enforced 
on  a  country-by-country  basis.  They  were  condi¬ 
tioned  on  the  implementation  of  structural  eco¬ 
nomic  reforms,  including  deregulation  of  the 
domestic  financial  systems  and,  ultimately,  central 
bank  independence.  Competition  for  credit  and 
foreign  investments  explains  the  compliance  with 
most  of  those  prescriptions.  Sylvia  Maxfield 
explains  the  move  toward  central  bank  indepen¬ 
dence  in  developing  countries  as  driven  by  the  need 
to  obtain  credibility  in  the  eyes  of  increasingly 
competitive  financial  market  actors.  However,  to 
the  extent  that  this  is  a  global  trend  across  five 
continents,  the  recasting  powers  of  the  institution 
at  the  global  level  must  be  specified  too. 

Independence  as  Global  Governance: 

The  Emergence  of  Private  Authority 

Underlying  the  problematique  of  central  bank 
independence  is  a  critical  shift  in  the  mode  of 
interaction  between  the  “monetary  authority” 
and  market  actors.  The  rising  influence  of  cross- 
border  financial  transactions  is  closely  linked  to 
the  empowerment  of  foreign  exchange  markets 
and  disintermediated  debt  and  bond  markets  in 
individual  nation-states.  The  intensified  competi¬ 
tion  among  banks  and  between  banks  and  other 


financial  intermediaries  in  the  past  3  decades  con¬ 
tinues  to  be  central  to  this  process.  Competitive 
pressures  explain  also  the  proliferation  of  “finan¬ 
cial  innovations” — that  is,  the  recurring  creation 
of  new  debt/credit  instruments  intended  to  make 
them  more  attractive  to  borrowers  and  to  lenders, 
both  corporate  and  public.  Such  innovations  lie 
outside  the  direct  control  of  domestic  central 
banks,  are  enhanced  by  information  technology, 
and  are  driven  by  the  deregulation  and  the  liberal¬ 
ization  of  domestic  financial  markets. 

In  an  international  system  where  there  is  no 
world  government,  governance  capacity  is  found  in 
multiple  sites  of  monetary  (and  fiscal)  authority — 
public  and  private,  national  and  transnational. 
Hall  hypothesizes  that  central  banks  are  integrated 
into  an  emerging  new  system  of  global  multilevel 
monetary  governance.  Public  transnational  institu¬ 
tions  endowed  with  supervisory  and  coordinating 
monetary  powers  include  the  Bank  of  International 
Settlements  (BIS)  and  the  European  Central  Bank. 
The  fact  that  adherence  to  such  rule-based  systems 
is  voluntary — and  applied  domestically  by  statu¬ 
tory  authority — indicates  that  its  implementation 
depends  also  on  domestic  political  variables.  At  the 
national  level,  the  executive  is  the  main  locus  of 
monetary  governance;  a  central  bank  may  share 
them  with  fiscal  authorities  if  it  has  not  been 
granted  independence  by  the  government. 

In  this  context,  private  market  actors  operating 
in  the  foreign  exchange  and  disintermediated  bond 
markets  are  empowered  to  adjudicate  the  credibil¬ 
ity  of  fiscal  and  monetary  policies  through  the 
operation  of  the  markets.  This  function  is  comple¬ 
mented  by  private  rating  agencies  that  grade  the 
market  of  sovereign,  corporate,  and  municipal 
debt  instruments.  All  such  shifts  reflect  the  unprec¬ 
edented  emergence  of  private  authority  conceptu¬ 
alized  by  Thomas  Biersteker  and  Bruce  Hall. 

It  is  against  this  shifting  context  of  deregulated, 
liberalized  financial  markets — where  all  money  is 
fiat  money — that  the  global  trend  toward  central 
bank  independence  becomes  meaningful.  Con¬ 
structing  a  trustworthy  monetary  authority  is 
about  making  credible  its  long-term  commitment 
to  price  stability,  ensuring  private  and  public 
wealth  holders  that  their  loans  will  be  redeemed  at 
their  real  value,  plus  a  risk  premium.  Insofar  as 
central  banks  can  directly  control  only  very  short 
nominal  rates  of  interest,  the  technical  question  of 
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how  monetary  policies  will  work  effectively  through 
the  markets  is  a  matter  of  great  concern.  Economists 
and  practitioners  of  all  persuasions  agree  that  it  has 
been  settled  technically  along  the  lines  of  the  ratio¬ 
nal  expectations  theory.  In  other  words,  monetary 
policy  can  work  through  the  markets  when  central 
banks  properly  anticipate  and  coordinate  future 
expectations  of  economic  agents.  For  this  reason, 
the  case  for  central  bank  independence  at  the 
global  level  has  been  strengthened  by  the  rational 
expectations  school  and  its  analysis  of  the  condi¬ 
tions  necessary  to  attain  credibility  with  financial 
markets.  The  central  bank’s  independence  from 
governments  and  transparency  in  communicating 
its  long-term  future  monetary  strategies  are  the 
major  conditions  enabling  the  institution  to  achieve 
credibility  with  market  actors.  In  this  intellectual 
and  economic  context,  the  economic  and  financial 
impacts  of  central  bankers’  utterances  and  commu¬ 
nicating  strategies  can  hardly  be  exaggerated. 

While  rationalists  assume  a  self-equilibrating 
free  market  of  actors  acting  independently  and 
including  in  their  calculations  the  expected  impact 
of  monetary  policies,  constructivists  assume  a  rela¬ 
tional  mode  of  interaction  between  them  and 
among  market  actors.  It  rests  on  shared  under¬ 
standings  of  the  rules,  norms,  and  values  that 
underlie  what  are  and  what  should  be  considered 
best  practices — underpinned  by  epistemic  commu¬ 
nities  of  monetary  economists  and  central  bankers. 
Hall,  a  political  economist  of  a  constructivist  cast, 
sets  the  generalized  trend  toward  central  bank 
independence  in  the  context  of  multilevel  global 
governance:  The  powers  assigned  to  the  institution 
are  related  to  its  ultimate  role  as  the  only  nominal 
anchor  to  achieve  international  stability.  Policy 
convergence,  in  this  approach,  is  meant  to  drive 
the  most  critical  structures  down  to  the  lowest  pos¬ 
sible  national  level  of  governance  so  that  a  uni¬ 
formly  effective  impetus  toward  price  stability  can 
be  provided.  Whether  or  not  this  desideratum 
implied  in  the  rational  expectations  theory  is  fea¬ 
sible  is  a  quintessentially  political  problem.  It  is 
constrained  by  the  current  shifts  in  the  axis  of 
global  monetary  power,  in  particular  the  diversity 
of  monetary  strategies  of  new,  “catching-up” 
countries.  China’s  central  bank  is  the  most  extreme 
case  of  subordination  to  the  unique  strategies  of 
the  government,  aimed  at  national  development 
and  integration  into  the  world  market. 


There  are,  however,  many  points  of  intersec¬ 
tion  between  the  two  schools.  Most  important 
among  them  is  the  unspoken  assumption  underly¬ 
ing  their  prescriptions  on  how  to  reconcile  two 
desiderata:  (1)  shielding  the  monetary  authority 
from  political  pressures  and  (2)  political  legiti¬ 
macy.  Both  schools  take  for  granted  the  political 
framework  within  which  central  bankers  operate 
in  core  democratic  governments — that  of  liberal 
constitutionalism.  The  solution  is  then  settled  by 
means  of  the  distinction  between  “full  opera¬ 
tional  independence”  and  “goals  independence.” 
The  discretion  of  central  banks,  in  the  first  case, 
would  be  limited  to  the  choice  of  monetary  tools 
to  achieve  the  goals  ultimately  set  by  elected  poli¬ 
ticians.  Basically,  these  tools  provide  free  access 
to  periodically  publicized  reports  on  the  fiscal 
practices,  the  economic  models,  the  forecasts  and 
outcomes  of  monetary  policy,  and  the  account¬ 
ability  of  elected  politicians,  in  periodic  hearings 
of  Congress. 

Emerging  market  democracies,  in  contrast, 
confront  the  task  of  reconciling  integration  into 
global  markets,  with  its  ensuing  fiscal  and  mone¬ 
tary  disciplines,  while  building  the  institutions  and 
policy  consensus  indispensable  to  legitimate  the 
construction  of  a  monetary  authority.  It  involves 
a  long-term  project,  an  open-ended  process  of 
democratic  deliberation,  leadership,  and  the  devel¬ 
opment  of  constituencies  supportive  of  price  and 
financial  stability  as  priority  goals.  In  this  regard, 
they  constitute  experiments  in  statecrafting  mon¬ 
etary  (and  fiscal)  democratic  authorities. 
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Scholars  in  political  science  have  approached  the 
problem  of  institutional  change  in  two  very  differ¬ 
ent  ways  that  can  be  explained  by  the  type  of 
institutions  on  which  they  have  focused. 
Institutions  are  most  commonly  understood  as  the 
rules  of  the  game.  As  such,  they  can  be  written 
formal  statutes,  such  as  constitutions,  electoral 
systems,  and  legal  codes.  They  can  also  be  gener¬ 
ally  established  social  norms,  such  as  routines, 
customs,  and  habits.  In  almost  all  accounts  of 
institutions,  they  are  understood  as  relatively  sta¬ 
ble  and  persisting  entities  that  do  not  change  eas¬ 
ily  or  instantly.  The  broad  notion  of  institutions 
and  the  focus  on  continuity  imply  that  under¬ 
standing  institutional  change  is  both  complicated 


and  equivalent  to  understanding  large-scale  social 
and  historical  change. 

The  reasons  why  institutions  are  usually  seen  as 
durable  and  difficult  to  change  vary.  One  is  that 
institutions  give  rise  to  self-reinforcing  “feedback” 
mechanisms  that  may  or  may  not  be  purposively 
designed.  For  example,  a  social  insurance  program 
may  be  organized  so  that  it  covers  a  majority  of 
the  population,  thus  creating  not  only  social  pro¬ 
tection  but  also  support  from  the  political  major¬ 
ity.  Or  it  may  cater  to  broadly  held  norms  about 
social  justice  in  the  population.  In  either  case,  a 
self-reinforcing  mechanism  is  at  work.  Another 
reason  for  institutional  stability  is  that  none  of  the 
actors  that  are  involved  have  an  incentive  to 
change.  For  example,  political  parties  in  a  “first- 
past-the-post”  two-party  political  system  have  lit¬ 
tle  reason  to  change  to  a  proportional  electoral 
system,  or  vice  versa.  Third,  formal  institutions 
may  become  dominating  social  conventions,  and 
agents  may  have  difficulties  imagining  a  different 
institutional  order. 

Explanations  for  institutional  change  can  be 
divided  into  three  broad  categories.  Change  may 
occur  as  a  result  of  unpredictable  and  unforeseen 
“exogenous  shocks.”  For  example,  countries  may 
radically  change  their  institutions  when  hit  by  an 
international  economic  crisis  or  when  drawn  into 
a  war.  A  second  type  is  a  change  through  an  evo¬ 
lutionary  functionalist  logic.  Institutions  that  best 
suit  the  underlying  structural  changes  survive 
through  the  operation  of  some  kind  of  selective 
mechanism,  and  institutions  that  do  not  follow 
this  functionalist  logic  are  weeded  out  by  the  com¬ 
petition  from  more  successful  institutional  orders. 
Third,  institutions  may  change  by  intentional 
design  by  strategically  acting  as  agents  that  con¬ 
struct  new  institutions  that  serve  their  future  inter¬ 
ests.  However,  the  outcome  of  such  strategic 
design  may  not  always  be  in  line  with  the  agents’ 
intentions  because  outcomes  from  institutional 
changes  are  hard  to  predict,  especially  if  there  are 
simultaneous  changes  of  other  institutions  outside 
the  control  of  the  designing  agents. 

The  Two  Schools  of  Institutional  Change 

There  are  currently  two  very  different  approaches 
to  the  problem  of  institutional  change.  In  develop¬ 
ment  research,  including  research  in  comparative 
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political  economy,  there  is  now  almost  a  consensus 
that  the  problem  of  massive  poverty  (and  the  many 
resultant  social  ills)  in  most  developing  countries  is 
due  to  their  dysfunctional  social,  legal,  and  political 
institutions  (e.g.,  Daron  Acemoglu  &  James 
Robinson,  2008).  This  problem,  also  known  as  the 
“good-governance”  or  the  “quality-of-government” 
problem,  has  been  well  captured  by  the  economist 
Dani  Rodrik  (2007,  p.  153),  who  argued  that  the 
encounter  between  neoclassical  economics  and 
developing  societies  has  revealed  the  great  extent 
to  which  market  economies  rely  on  a  complex  and 
not  well  understood  set  of  social,  legal,  and  politi¬ 
cal  institutions.  Rodrik’s  list  of  such  institutions  is 
extensive:  a  well-specified  system  of  property 
rights,  effective  regulation  that  hinders  monopolies 
from  dominating  markets,  uncorrupted  govern¬ 
ments,  the  rule  of  law,  and  social  welfare  systems 
that  can  accommodate  risks.  In  addition,  Rodrik 
adds  the  importance  of  informal  institutions  that 
foster  social  cohesion,  social  trust,  and  coopera¬ 
tion.  Following  Douglass  North,  Rodrik  criticizes 
neoclassical  economics  for  omitting  the  impor¬ 
tance  of  such  institutions  and  argues  that  the  prob¬ 
lem  is  that  most  economists  usually  take  them  for 
granted.  In  fact,  most  poor  countries  are  plagued 
by  a  very  different  set  of  institutions,  such  as  sys¬ 
temic  corruption,  systematic  infringements  on 
property  and  human  rights,  and  low  levels  of 
social  trust  and  lack  of  institutions  that  can  handle 
individual  and  collective  risks.  In  this  type  of 
analysis,  the  main  problem  is  to  explain  why  soci¬ 
eties  with  a  set  of  institutions  that  are  clearly  dys¬ 
functional  in  terms  of  general  social  well-being 
generally  seem  unable  to  adopt  the  “new  set”  of 
institutions  that  Rodrik  lists  and  that  are  known  to 
lead  to  economic  prosperity  and  social  well-being. 
As  Bo  Rothstein  (2005)  notes,  the  central  problem 
in  this  type  of  analysis  is  that  the  historical  record 
clearly  shows  that  dysfunctional  institutions,  such 
a  systemic  corruption,  clientelism,  and  patronage, 
are  very  hard  to  change.  Political  and  economic 
leaders  in  these  systems  may  be  able  to  change  the 
institutions  from  above,  but  since  they  are  the 
group  that  collects  the  most  rents  from  the  system, 
they  have  no  incentive  to  induce  change,  at  least 
not  in  a  short-term  perspective.  Change  of  these 
systems  from  below  is  difficult  because  agents  who 
want  change  face  a  formidable  collective  action 
problem.  Thus,  as  Douglass  North,  John  Wallis, 


and  Barry  Weingast  (2009)  have  shown,  in  this 
research  area,  the  main  issue  is  the  lack  of  institu¬ 
tional  change  despite  the  pressing  human  need  for 
such  change.  For  example,  systemic  corruption  not 
only  hurts  economic  growth,  causes  extreme  pov¬ 
erty,  and  induces  civil  strife,  but  it  also  has  devas¬ 
tating  humanitarian  consequences  in  areas  such  as 
provision  of  health  care,  level  of  infant  mortality, 
and  access  to  safe  water. 

However,  in  several  recent  studies  of  the  rich 
welfare  states  and  their  systems  of  industrial  rela¬ 
tions  and  social  provisions,  the  picture  is  very  dif¬ 
ferent.  Here,  change  of  important  institutions 
occurs  almost  constantly  in  a  mostly  incremental 
process,  either  because  agents  are  able  to  find  com¬ 
promises  and  viable  solutions  to  common  prob¬ 
lems  or  as  a  result  of  power  struggles.  As  James 
Mahoney  and  Kathleen  Thelen  (2009)  and  Thelen 
(2009)  have  shown,  adaptation  to  new  circum¬ 
stances  such  as  increased  international  competition 
for  goods  and  services,  globalization  of  financial 
markets,  and  new  technologies  in  the  production 
process  may  not  always  be  easy  in  these  societies 
and  may  be  the  outcome  of  difficult  political  dis¬ 
putes  and  difficult  compromises.  Still,  the  general 
picture  is  that  in  such  societies,  political  and  eco¬ 
nomic  elites  are,  after  some  initial  difficulties,  usu¬ 
ally  able  to  gradually  change  central  institutions  to 
bring  about  the  distribution  of  resources  and 
power  that  they  believe  will  follow  from  their  ideas 
of  new  institutional  designs. 

A  problem  for  our  understanding  of  institu¬ 
tional  change  is  that  these  two  research  approaches 
seldom  communicate  their  theories,  problems,  or 
results.  Why  do  the  political  and  economic  elites  in 
some  (rich,  capitalist)  societies  usually  manage  to 
change  institutions,  while  other  (poor,  developing) 
societies  seem  stuck  with  formal  as  well  as  infor¬ 
mal  institutions  that  are  fundamentally  detrimen¬ 
tal  to  their  need  for  economic  growth  and  general 
social  well-being?  One  reason  for  this  apparent 
paradox  may  be  that  these  different  approaches  to 
institutional  change  are  dealing  with  institutions 
that  are  fundamentally  different. 

Two  Logics  of  Institutional  Change 

The  idea  that  institutions  include  not  only  formal 
but  also  informal  rules  implies  that  it  is  difficult  to 
distinguish  them  from  a  society’s  basic  cultural 
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traits.  In  explaining  institutional  change,  this 
becomes  problematic  because  while  it  is  possible  to 
change  codified  systems  of  rules  and  “standard 
operating  procedures”  through  methods  such  as 
direct  and  deliberate  political  intervention,  this  is 
much  more  difficult  with  institutions  viewed  as 
shared  mental  models  and  other  such  generally 
held  basic  beliefs  that  are  rooted  in  a  society’s  his¬ 
torically  established  culture.  For  example,  one 
such  informal  institution  is  people’s  belief  that  oth¬ 
ers  in  their  society  are,  in  general,  trustworthy. 
Comparative  survey  research  shows  that  this  infor¬ 
mal  institution  varies  to  a  large  extent  among 
countries.  Clearly,  changing  such  a  system  of 
beliefs  is  not  the  same  thing  as  changing  a  piece  of 
social  insurance  legislation  or  a  system  of  wage 
negotiations.  Thus,  as  a  first  distinction,  we  can 
differentiate  between  two  basic  logics  of  institu¬ 
tional  change,  namely,  change  in  a  society’s  formal 
institutions  and  change  in  its  informal  ones. 

A  second  distinction  has  been  suggested  by 
George  Tsebelis  (1990) — namely,  between  “redis¬ 
tributive”  and  “efficient”  institutions.  Redistributive 
institutions  are  rules  that  move  resources  or  power 
from  one  group  of  agents  to  another.  A  familiar 
example  would  be  most  tax  systems  and  systems  of 
industrial  relations.  Historically,  one  can  also 
think  of  slavery  as  a  formal  redistributive  institu¬ 
tion.  Examples  of  redistributive  informal  institu¬ 
tions  would  be  professional  networks  that  give 
status  and  prestige  to  a  particular  set  of  its  mem¬ 
bers.  Another  example  is  economic  systems  in 
which  business  contracts  and  career  systems  follow 
family  lines.  Third,  many  political  systems  are 
characterized  by  informal  clan-based  nepotism  or 
various  forms  of  corruption. 

Socially  efficient  institutions,  on  the  other  hand, 
have  quite  the  opposite  character  since  their  pur¬ 
pose  is  to  improve  the  welfare  of  all  actors  in  a 
system  of  exchange.  As  such,  they  are  genuine 
public  goods,  implying  that  they  serve  a  common 
interest.  According  to  Elinor  Ostrom  (1990),  seen 
in  light  of  noncooperative  game  theory,  these  are 
institutions  that  make  it  possible  to  avoid  situa¬ 
tions  known  as  suboptimal  outcomes  in  prisoner’s 
dilemma,  collective  action,  or  tragedy  of  the  com¬ 
mons  types  of  interaction.  Rothstein  (2005)  notes 
that  in  the  closely  related  theory  about  social 
dilemmas,  socially  efficient  institutions  make  it 
possible  for  agents  to  avoid  ending  up  in  situations 


known  as  social  traps.  These  are  all  situations  in 
which  the  agents  know  that  they  would  all  be  better 
off  if  they  could  collaborate  to  establish  a  common 
set  of  institutions;  however,  because  they  do  not 
trust  that  the  other  agents  will  adhere  to  the  rules 
or  contribute  to  the  costs  of  establishing  and  run¬ 
ning  the  institution,  the  individual  agents  have  no 
incentive  to  collaborate.  This  lack  of  trust  leads  to 
a  situation  in  which  all  agents  are  worse  off.  For 
example,  a  corrupt  doctor  in  a  public  health  system 
may  realize  that  all  would  gain  by  ending  corrup¬ 
tion,  but  it  becomes  meaningless  for  the  individual 
doctor  to  stop  taking  bribes  if  she  or  he  cannot 
trust  that  most  other  doctors  would  do  the  same. 
Similarly,  it  makes  little  sense  for  the  single  judge  or 
police  officer  to  stop  taking  bribes  if  he  cannot  trust 
that  (almost)  all  his  colleagues  will  also  refrain 
from  this  practice.  In  an  emerging  democracy,  it 
makes  little  sense  to  be  the  only  political  party  that 
does  not  tamper  with  ballot  boxes.  The  list  of 
examples  illustrating  this  problem  is  endless. 

Examples  of  socially  efficient  formal  institu¬ 
tions  are  the  rule  of  law,  protection  of  property 
rights,  and  an  honest  and  impartial  civil  service.  In 
economic  terms,  such  institutions  are  public  goods 
because  they  dramatically  lower  what  are  known 
as  transaction  costs  and  thereby  contribute  greatly 
to  economic  prosperity.  Universal  social  insurance 
systems  can  also  be  categorized  as  socially  efficient 
institutions  since  they  handle  individual  risks  in  a 
more  cost-efficient  way  than  do  market-based 
social  insurance  systems. 

The  most  well-known  example  of  informal 
socially  efficient  institutions  is  the  existence  of  gen¬ 
eralized  trust  that  is  widespread  in  the  population. 
Institutions  such  as  this  increase  the  likelihood  that 
other  agents  will  not  use  opportunistic  or  treacher¬ 
ous  strategies  but  instead  will  behave  in  a  reciprocal 
manner.  Thereby,  generalized  trust  also  decreases 
transaction  costs,  but  in  a  less  costly  way  than  if 
agents  were  to  use  only  the  formal  legal  system. 

The  reason  why  it  is  so  difficult  to  change  from 
dysfunctional  to  socially  efficient  institutions  (e.g., 
establishing  honesty  in  a  corrupt  civil  service  or 
changing  political  exchange  from  clientelism  to 
impartiality)  is  that  such  a  change  necessitates  the 
trust  among  agents  that  (almost)  all  of  them  will 
play  by  the  new  rules  from  time  t1  and  that  such  a 
trust  is  not  easily  manufactured  by  political  means. 
As  Ostrom  (1990)  has  argued,  socially  efficient 
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institutions  can  be  seen  as  collective  action  prob¬ 
lems  of  the  second  order. 

It  should  be  added  that  the  distinction  between 
socially  efficient  and  redistributive  institutions  is  a 
theoretical  construct  and  that  many,  if  not  most, 
institutions  have  traits  of  both  redistribution  and 
social  efficiency.  Why  the  distinction  is  useful  may 
be  better  understood  from  the  following  parallel 
example.  Most  political  scientists  are  willing  to 
distinguish  between  democratic  and  nondemo- 
cratic  political  systems.  However,  all  democratic 
systems  contain  nondemocratic  elements,  and 
most  nondemocracies  feature  some  kind  of  system 
for  popular  influence  and  for  establishing  consent. 

The  reason  why  scholars  studying  the  rich 
democracies  and  scholars  studying  developing 
countries  have  come  to  such  different  conclusions 
about  the  possibility  for  institutional  change  is  that 
they  study  fundamentally  different  types  of  institu¬ 
tions.  The  former  usually  concentrate  on  redis¬ 
tributive  institutions  for  which  the  analysis  of 
power  constellations  explains  change.  The  latter 
study  the  lack  of  socially  efficient  institutions, 
which  are  difficult  to  establish  because  they  are 
public  goods.  According  to  Acemoglu  and  Rob¬ 
inson  (2008),  the  problem  is  that  there  is  currently 
no  generally  accepted  theory  for  how  they  can  be 
established  or  why  they  are  reproduced.  Following 
standard  assumptions  about  self-interested  ratio¬ 
nal  agency,  socially  efficient  institutions  of  the  type 
described  above  would  not  exist,  and  poor  coun¬ 
tries  that  lack  them  would  not  be  able  to  change 
from  inefficient  to  efficient  institutions. 
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Charisma 


Most  basically,  charisma  refers  to  a  rare  trait 
found  in  certain  humans  that  combines  unusual 
charm  and  an  ascribed  “magnetic”  quality  of  per¬ 
sonality  and/or  appearance.  Deriving  from  the 
Greek  word  xdpicrp.a  (charisma),  it  includes  the 
meaning  of  “gift”  or  “divine  favor,”  in  particular 
with  reference  to  innate  and  sophisticated  per¬ 
sonal  communication  and  persuasiveness. 

Charisma  as  a  concept  has  been  employed  in 
several  contexts,  as  a  religious  concept,  a  socio¬ 
logical  concept,  or  a  psychological  concept,  and 
also  in  journalistic  and  common  language  usage. 
Although  impossible  to  operationalize  accurately, 
charisma  is  often  used  to  describe,  or  rather  to 
label,  a  personality  trait  that  includes  the  seem¬ 
ingly  “supernatural”  ability  of  some  rare  persons 
to  lead,  charm,  persuade,  and  inspire  others  due  to 
their  “magnetic,”  alluring  quality,  without  using 
power  or  threat.  It  refers  to  an  ascribed  quality, 
not  to  be  achieved  purposefully,  in  certain  people 
who  draw  the  attention  and  admiration  (but  also 
hatred  if  the  attribution  of  charisma  is  perceived  to 
be  dangerous)  of  others. 

As  a  political  concept,  charisma  became  famous 
most  of  all  due  to  the  analytical  formulation  by 
Max  Weber  (1864-1920)  as  one  of  his  three  types 
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of  legitimate  rule  ( legitime  Herrschaft).  In  addi¬ 
tion  to  the  rational-legal  (-bureaucratic)  rule 
(. legale  Herrschaft)  and  the  traditional  rule  (tradi- 
tionale  Herrschaft),  the  charismatic  rule  (charis- 
matische  Herrschaft)  was  applied  by  Weber 
(1924/1947)  to  historical  forms  of  political  domi¬ 
nation  “resting  on  devotion  to  the  exceptional 
sanctity,  heroism  or  exemplary  character  of  an 
individual  person,  and  of  the  normative  patterns 
or  order  revealed  or  ordained  by  him”  (p.  215).  In 
his  writings  about  charismatic  rule,  Weber  (1946) 
applies  the  term  charisma  to 

a  certain  quality  of  an  individual  personality  by 
virtue  of  which  he  is  considered  extraordinary 
and  treated  as  endowed  with  supernatural, 
superhuman,  or  at  least  specifically  exceptional 
powers  or  qualities.  These  are  not  accessible  to 
the  ordinary  person,  but  are  regarded  to  be  of 
divine  or  similar  supernatural  origin,  and  on  the 
basis  of  which  an  individual  is  perceived  and 
accepted  as  a  “leader.”  .  .  .  How  the  quality  in 
question  is  ultimately  judged  from  any  more 
general  ethical,  aesthetic,  or  other  point  of  view 
is,  of  course,  irrelevant  for  purposes  of  definition. 
What  only  matters  is  how  the  individual  is 
actually  regarded  by  his  “adherents”  or 
“disciples.”  (p.  329) 

Charisma  thus  denotes  a  relationship  rather 
than  an  individual  personality  attribute.  Should  the 
strength  of  this  belief  fade,  the  domination  of  the 
charismatic  leader  can  also  fade  quickly,  which  is 
why  this  form  of  rule  can  be  very  unstable,  as  the 
exercise  of  charisma  is  founded  purely  on  its  recog¬ 
nition  by  the  leader’s  followers.  Although  charis¬ 
matic  domination  evolves  in  contexts  of  traditional 
and/or  rational-legal  rule,  it  tends  to  challenge 
these  forms  of  rule  and  is  therefore  interpreted  by 
Weber  as  revolutionary.  Although  charismatic 
domination  represents  personal  and  noninstitu- 
tionalized  leadership,  Weber  points  to  some  pro¬ 
cesses  of  longer  duration  for  which  he  employed 
the  term  routinization  of  charisma  ( Veralltagli  chung 
des  Charisma).  Under  the  constant  challenge  of 
traditional  and/or  rational-legal  forms  of  domina¬ 
tion,  he  recognizes  the  partial  institutionalization 
of  charisma  through  the  establishment  of  specified 
positions  open  exclusively  to  persons  who  demon¬ 
strate  special  personal  qualities.  Institutionalized 


charisma  is  also  represented  by  charisma  of  office 
(. Amtscharisma ),  which  pertains  to  beliefs  that  cer¬ 
tain  officeholders,  by  virtue  of  occupying  an  office 
(e.g.,  priesthood),  acquire  certain  special  powers 
or  qualities.  By  contrast,  the  pure  personal  cha¬ 
risma  attributed  to  revolutionaries,  prophets,  and 
sages  resists  institutional  influences;  it  is  antitheti¬ 
cal  to  stable  authority  based  on  fixed  codes  and 
customs. 

Weber  saw  the  charismatic  leader  as  a  vital 
agent  of  social,  political,  and  religious  change, 
which  led  to  some  criticism  attributing  an  idealistic 
“Great  Men”  theory  of  history  to  him.  In  any  case, 
charismatic  leaders  generally  are  said  to  arise  in 
unsettled  times  permeated  by  disorienting  socio¬ 
cultural  change.  In  such  periods,  unconventional 
political  and/or  religious  groups  arise  composed  of 
people  who  are  fearful  of  the  future,  who  hope 
that  by  placing  their  faith  in  some  charismatic 
leader  they  will  eradicate  the  past  and  protect  their 
lives  against  unknown  and  unseen  dangers.  For 
example,  the  two  most  charismatic  leaders  in  the 
20th  century  are  Mahatma  Gandhi  in  India  and 
Nelson  Mandela  in  South  Africa.  Ayatollah 
Khomeini  of  Iran  is  an  example  of  a  leader  with 
combined  personal  and  office  charisma. 

The  volatility  of  charismatic  authority  and  of 
groups  manifesting  charismatic  leadership  has 
been  a  persistent  theme.  In  essence,  charismatic 
domination  is  unstable  because  it  lacks  both  insti¬ 
tutional  restraints  and  institutional  supports.  The 
absence  of  institutional  restraints  on  charismatic 
leaders  coincides  with  the  lack  of  institutional  sup¬ 
ports  available  to  sustain  a  leader’s  authority. 
Charismatic  domination  is  fundamentally  a  pre¬ 
carious  state  of  affairs  because  the  leader’s  claims 
to  domination  rest  purely  on  subjective  factors,  as 
the  perceptions  of  the  followers  of  his  or  her 
extraordinary  qualities  may  be  situation  specific 
and  ephemeral.  The  charismatic  leader  must  con¬ 
tinually  face  the  prospect  that  her  or  his  special 
“gift”  of  “grace”  will  no  longer  be  perceived  the 
way  it  once  was  and  that  her  or  his  authority  will 
fade,  as  happened  in  the  case  of  Kwame  Nkrumah 
of  Ghana.  Charismatic  leaders  must  continually  be 
on  the  alert  for  threats  to  their  authority  from  out¬ 
siders,  dissidents,  and  rivals  within  the  movement 
as  well  as  from  their  administrative  staff  of  tradi¬ 
tional  and/or  rational-legal  domination.  The  latter 
is  generally  oriented  toward  expanding  the  scope 
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of  its  authority  and  rationalizing  administrative 
procedures  to  the  detriment  of  the  charismatic 
leader’s  freedom  of  action,  sometimes  leading  to 
his  or  her  actual  deposition. 

An  additional  consequence  of  the  lack  of  insti¬ 
tutional  supports  for  charismatic  leadership 
involves  the  absence  of  regularized  procedures  for 
the  transfer  of  authority,  that  is,  the  problem  of 
succession.  The  failure  to  effectively  institutional¬ 
ize  the  charisma  of  the  founding  leader  usually 
leads  to  intensifying  factionalism  and  ultimately  to 
the  end  of  charismatic  domination. 

The  precariousness  and  instability  of  charis¬ 
matic  leadership  and  its  consequences  in  terms  of 
group  volatility,  factionalism,  and  possible  vio¬ 
lent  episodes  represent  instances  in  which  the 
concept  of  charisma  can  facilitate  the  explanation 
of  sociohistorical  events.  In  this  context,  the  con¬ 
cept  of  charisma  has  been  criticized  as  basically  a 
descriptive  concept  that  labels  rather  than  explains 
the  power  of  leaders  and  the  submission  of  believ¬ 
ers.  It  has  been  suggested,  further,  that  the  devel¬ 
opment  of  controversial  new  movements  and 
noninstitutionalized  forms  of  charismatic  behav¬ 
ior  has  been  enhancing  the  importance  of  cha¬ 
risma.  Yet  charismatic  authority  and  its  concomi¬ 
tants  in  terms  of  the  tendency  to  view  social 
relationships  and  organizations  in  personal  terms 
and  to  envision  a  messianic  termination  of  present 
evils  is  generally  thought  to  be  associated  more 
with  traditional  rather  than  complex  modern 
societies. 
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Christian  Democratic  Parties 


Political  families  can  be  identified,  following 
Maurice  Duverger’s  approach  of  families  spiritu- 
elles,  as  parties  that  share  the  same  ideological 
inspiration,  by  adopting  Stein  Rokkan’s  structural 
approach,  which  identifies  parties  according  to 
their  roots  in  social  cleavages  or  in  terms  of  insti¬ 
tutional  international  networks,  which  connect 
ideologically  similar  parties  in  various  countries. 
In  the  case  of  the  Christian  Democratic  parties 
(hereafter  CDPs),  all  these  approaches  provide 
fruitful  hints  in  assessing  the  profile  of  this  politi¬ 
cal  family  across  countries,  since  the  evolution  of 
the  ideological  and  social  orientations  coincided 
with  changes  in  the  international  organization. 
This  entry  discusses  some  specific  features  of 
CDPs,  their  origins  and  their  evolution  over  time, 
and  their  role  in  the  present  world. 

CDP  is  a  specific  label  that  implies  a  rather  nar¬ 
row  focus — parties  sharing  the  “Christian- 
democratic  ideology.  ”  However,  the  acronym  CDP 
is  often  used  more  loosely.  If  we  were  to  follow  the 
structuralist  and  ideological  approaches  very 
strictly,  all  Christian  parties  would  be  part  of  this 
political  family.  To  classify  CDPs,  Rokkan’s  frame¬ 
work  can  be  used  to  refer  specifically  to  religious 
parties  that  either  arose  in  Catholic  countries  as  the 
result  of  a  church-state  conflict  or  were  formed 
when  independence  was  sought  by  Catholics  versus 
a  dominant  power  as  in  Belgium,  Ireland,  and  also 
partly  Poland  at  the  time  of  national  independence 
after  World  War  I.  Because  the  formation  of  CDPs 
involved  primarily  the  relationship  between  the 
Catholic  Church  (hereafter  the  Church)  and  the 
political  sphere,  Protestant  denominations  of 
Christianity  play  a  minimal  role  in  CDPs  (with  the 
partial  exception  of  the  Netherlands)  and,  thus,  are 
not  discussed  in  this  entry. 
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However,  to  provide  a  complete  picture,  other 
Christian  parties  do  exist  in  Europe — for  example, 
in  Norway  since  the  end  of  World  War  II  and  in 
Sweden  and  Finland  since  the  1960s  and  the  1970s, 
respectively.  But  these  parties  represent  only  a  very 
small  number  of  voters.  After  1989,  some  CDPs 
emerged  in  the  new  democracies  in  Central-Eastern 
Europe — for  example,  the  Christian  Democratic 
People’s  Party  (Keresztenydemokrata  Neppart, 
KDNP)  in  Hungary,  the  Slovenian  Christian 
Democrats  (Slovenski  krscanski  demokrati,  SKD) 
in  Slovenia,  the  Lithuanian  Christian  Democratic 
Party  (Lietuvos  krikscionys  demokratai  partija, 
LKDP,  which  in  2001  merged  with  the  Christian 
Democratic  Union  to  form  the  Lithuanian  Christian 
Democrats — Lietuvos  krikscionys  demokratai, 
LKD)  in  Lithuania,  the  Christian  Democractic 
Movement  (Krest’anskodemokraticke  hnutie, 
KDH)  in  Slovakia,  and  the  Christian  and  Democratic 
Union/Czech  People’s  Party  (Krest’anska  a 
demokraticka  unie/Ceskoslovenska  strana  lidova, 
KDU-CSL)  in  the  Czech  Republic.  All  these  parties 
and  the  various  subsequent  mergers  never  played  a 
pivotal  role  comparable  with  their  Western  coun¬ 
terparts.  In  most  of  these  cases,  they  did  not  even 
attain  10%  of  the  votes.  Most  striking  is  the 
absence  of  a  true  CDP  in  the  fervently  Catholic 
Poland,  except  for  a  very  short  period. 

Further,  CDPs  are  present,  in  descending  order 
of  political  influence,  in  Latin  America,  Africa,  and 
Asia.  In  Latin  America,  CDPs  played  a  significant 
role  in  various  countries  thanks  to  prominent  lead¬ 
ers  such  as  Eduardo  Frei  in  Chile  and  Rafael 
Caldera  in  Venezuela.  In  1949,  these  parties  cre¬ 
ated  a  regional  organization,  the  Christian 
Democratic  Organization  of  America  (CDOA). 
The  formation  of  CDPs  in  Latin  America  followed 
a  different  route  from  the  one  taken  in  Europe. 
Some  Catholic  parties  were  present  in  Latin 
America  in  the  19th  century,  but  they  were  much 
more  conservative  than  the  nascent  Catholic  polit¬ 
ical  organizations  in  Europe  at  the  time.  After  the 
1930s,  and  especially  after  the  1950s,  new,  more 
genuine  “Christian  Democratic”  parties  emerged 
in  Latin  America,  often  in  opposition  to  preexist¬ 
ing  Catholic  parties,  as  in  the  case  of  Ecuador  and 
Uruguay.  The  most  successful  case  among  these 
parties  was  the  Chilean  Christian  Democratic 
Party  (Partido  Democrata  Cristiano,  PDC),  which 
played  a  dominant  role  in  the  democratic  periods 


before  and  after  the  military  dictatorship  of 
Augusto  Pinochet.  Other  relevant  parties  are  the 
Social  Christian  Party  (Partido  Social  Cristiano  de 
Venezuela,  COPEI)  in  Venezuela,  the  Guatemalan 
Christian  Democracy  (Democracia  Cristiana 
Guatemalteca,  DCG)  in  Guatemala,  and  the  PDC 
in  El  Salvador,  all  of  which  dramatically  declined 
in  the  late  1990s,  and  the  Social  Christian  Unity 
Party  (Partido  de  Unidad  Socialcristiana,  PUSC)  in 
Costa  Rica  and  the  union  of  the  Christian 
Democratic  Party  with  the  Popular  Democrat  leg¬ 
islative  group  (Partido  Democrata  Cristiano/ 
Democracia  Popular,  PDC-DP)  in  Ecuador. 

Even  in  their  initial  formation,  CDPs  in  Latin 
America  had  a  much  looser  relationship  with  the 
Church  than  did  their  European  counterparts. 
However,  they  have  caught  up  with  European 
trends  both  in  their  emphasis  on  social  concerns — 
at  least  for  most  of  them — in  the  first  postwar 
decades  and  then  in  relaxing  their  religious  con¬ 
nection  following  the  European  trend  of  a  more 
secular-conservative  approach.  Given  the  instabil¬ 
ity  of  the  Latin  American  regimes  and  the  frequent 
disruption  of  democracy  (with  the  exception  of 
Costa  Rica),  this  analysis  will  focus  mainly  on  the 
European  context. 

In  Europe,  a  different  approach  to  the  study  of 
CDPs,  such  as  the  “actor-based”  one  offered  by 
Stathis  Kalyvas,  leads  to  the  same  focus  on 
Catholic  parties  only  (again  with  the  exception  of 
the  Netherlands).  According  to  Kalyvas’s  land¬ 
mark  contribution  (in  his  book  The  Rise  of 
Christian  Democracy  in  Europe)  “core  European” 
CDPs  were  the  by-product  of  Catholic  political 
entrepreneurs’  reactions  to  the  mounting  secular 
and  anticlerical  attitude  of  the  liberal  elites  at  the 
end  of  the  19th  century.  The  Church  did  not  spon¬ 
sor  the  formation  of  religious/Catholic  political 
organizations;  on  the  contrary,  it  acted  to  limit, 
discourage,  or  even  suppress  such  initiatives.  This 
interpretation  stresses  the  autonomous  role  of 
political  entrepreneurs  during  the  formation  of  the 
religious  parties  in  Western  Europe.  The  Catholic 
parties  of  the  late  19th  and  early  20th  centuries 
grounded  their  legitimacy  in  themselves;  they  did 
not  depend  on  an  external  institution  for  their 
legitimation.  Such  internal  legitimacy  is  an  impor¬ 
tant  asset  for  the  institutionalization  and  organiza¬ 
tional  development  of  a  political  party.  But  despite 
these  favorable  internal  conditions,  the  Catholic 
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parties  either  collapsed  or  underwent  profound 
modifications  after  World  War  II.  However,  even 
though  they  played  a  rather  limited  role  in  the  pre¬ 
war  party  systems,  their  role  became  pivotal  and  in 
some  cases  dominant  in  postwar  Europe.  Scholars 
disagree  on  the  question  of  whether  this  shift  in  the 
status  of  European  CDPs  is  best  understood  as  a 
complete  break  between  prewar  and  postwar  par¬ 
ties  or  whether  there  was  some  continuity  despite 
the  change.  This  academic  dispute  can  be  better 
understood  in  light  of  a  series  of  factors:  (a)  the 
attitude  of  the  Church  vis-a-vis  the  “modern 
world,”  and  of  CDPs  in  particular;  (b)  the  ideo¬ 
logical  renewal  of  CDPs;  (c)  the  novel  postwar 
international  context;  and  (d)  the  role  played  by 
CDPs  in  the  various  party  systems. 

Christian  Democratic  Parties’ 
Formation  and  Evolution 

The  Church  and  the  Modern  World 

As  the  first  Catholic  parties  in  Europe  emerged  at 
the  turn  of  the  20th  century,  they  were  in  more  or 
less  open  conflict  with  the  Church  over  the  accep¬ 
tance  of  the  principles  of  liberal  democracy.  The 
Church  still  indulged  in  its  self-representation  of  a 
“ societas  perfecta ”  with  the  Pope  at  its  apex — as  a 
sovereign  among  sovereigns — and  maintained  its 
condemnation  of  modern  society  and  liberal-demo¬ 
cratic  principles,  which  it  held  were  leading  to  a 
general  and  harmful  “social  disorder.”  The  anti¬ 
capitalist  “social  doctrine”  of  the  Church  as 
expressed  in  the  encyclical  Rerum  novarum  (1891) 
derives  from  this  vision  of  society.  However,  in  the 
same  period,  Catholic  intellectual  circles,  especially 
in  Italy  and  France,  elaborated  on  the  mundane  ide¬ 
ology  of  “Christian  democracy,”  which  advocated 
the  protection  and  promotion  of  Catholic  interests 
through  social  networks  and  communities  and  the 
fostering  of  these  positions  in  the  political  arena  as 
well.  In  other  words,  Christian  democracy  recog¬ 
nized  the  opportunity  offered  by  democracy  for 
developing  social  activities  based  on  Catholic  prin¬ 
ciples  and  defending  Catholic  values  (and  interests) 
through  democratic  institutions.  The  Church  radi¬ 
cally  condemned  this  ideology  in  the  encyclical 
Pascendi  (1907),  declaring  that  the  Christian- 
democratic  ideology  would  cause  “confusion 
between  evangelization  and  the  promotion  of 
democracy.”  In  accordance  with  this,  the  pontificate 


of  Pius  XI  (1922-1939)  aimed  at  restoring  the 
“kingdom  of  Christ”  against  the  liberal  society. 

This  intransigent  position  was  confronted  during 
World  War  I  and  its  aftermath.  The  war  involved, 
in  an  unprecedented  way,  the  religious  hierarchies 
and  the  religious  constituency  in  support  of  each 
respective  state,  and  this  relaxed  the  negative  rela¬ 
tionship  between  the  Church  and  the  liberal  state. 
For  the  first  time,  the  Catholic  world  overtly  sup¬ 
ported  the  state,  ending  that  sense  of  extraneousness 
that  had  been  growing  since  the  1880s.  Moreover, 
the  introduction  of  mass  male  suffrage  enfranchised 
millions  of  new  voters,  and  these  voters  needed  a 
guide  to  prevent  their  becoming  adherents  of  atheist 
socialism.  As  a  consequence,  the  Church  somewhat 
relaxed  its  negative  attitude  regarding  political 
activities  by  Catholics  and  Catholic  parties,  even  if 
it  avoided  any  direct  sponsorship. 

CDPs  did  not  exploit  this  “autonomy”  from  the 
Church  by  building  up  a  solid  identity  and  institu¬ 
tionalizing  the  party  organization;  on  the  contrary, 
they  suffered  from  this  distance  and  remained 
prone  to  adapting  themselves  to  whatever  position 
the  Church  took.  Lacking  open  legitimation  by  the 
Church,  they  were  constantly  seeking  the  Church’s 
approval  of  their  policies.  When  the  Church  signed 
concordats  with  fascist  Italy  and  Nazi  Germany  in 
the  1930s,  the  CDPs,  with  few  exceptions,  accom¬ 
modated  themselves  to  fascism  and  in  some 
instances  even  supported  it.  The  most  dramatic 
case  was  provided  by  the  Spanish  Civil  War 
(1936-1939).  The  radical  anticlerical  attitude  of 
the  Spanish  Republican  forces  led  the  Church  to 
sponsor,  even  with  enthusiasm,  General  Francisco 
Franco’s  restoration  and  the  intervention  of  the 
Italian  fascist  troops.  The  Church  espoused  the 
fascist  regimes  so  easily  because  of  its  long-drawn- 
out  opposition  to  modern  society,  whose  liberal 
institutions  and  worldview  the  Church  still 
despised.  In  addition,  the  economic  depression  was 
seen  as  a  failure  of  capitalism,  increasing  the  dis¬ 
satisfaction  with  liberal  society.  The  economic 
crisis  reinvigorated  Christian  social  doctrine,  which 
was  seen  now  as  a  “humanistic”  third  way 
between  capitalism  and  communism. 

The  Emergence  of  Personalism 

In  the  same  period,  however,  a  different  elabo¬ 
ration  was  emerging,  especially  in  Italian  and 
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French  intellectual  circles.  That  milieu,  where 
Emmanuel  Mounier  and  Jacques  Maritain  played 
a  crucial  role,  produced  an  original  and  quite  pow¬ 
erful  “ideology”  of  Christian  inspiration:  personal¬ 
ism.  In  a  nutshell,  personalism  emphasizes  the  role 
of  the  person  in  society:  The  person  is  not  concep¬ 
tualized  as  the  individual  of  the  liberal  tradition, 
an  “isolated  monad,”  but  rather  as  a  part  of  a 
collective  body,  intertwined  with  all  the  others 
through  a  network  of  associations  and  communi¬ 
ties.  Each  person  has  intrinsic  value  and  value  as  a 
member  of  a  community  or  an  intermediary  body: 
Persons  cannot  be  disconnected  from  their  social 
environment,  starting  with  the  family.  This  new 
ideology  played  a  dramatic  role  in  moving  CDPs 
(and  to  some  extent,  the  Church)  toward  accep¬ 
tance  of  the  modern  world  and  in  distancing  them 
from  fascism.  The  most  visible  confirmation  of  this 
change  came  with  the  Pope’s  Christmas  address  in 
1944,  when  for  the  first  time  he  acknowledged 
democracy  explicitly  and  clearly  (apertis  verbis). 
According  to  some  scholars,  however,  Pius  XII 
(1939-1958)  accepted  democracy  only  instrumen- 
tally,  as  the  antagonist  of  communism,  rather  than 
for  its  own  sake.  Thus,  according  to  Martin 
Conway  and  Tom  Buchanan,  Catholicism  in  the 
1940s  and  1950s  remained  intransigent  and  made 
few  concessions  to  the  modern  world. 

Postwar  Christian  Democratic  Parties 

At  any  rate,  the  CDPs  that  reemerged  or  were  cre¬ 
ated  ex  novo  after  World  War  II  proved  much 
stronger  than  those  of  the  prewar  years.  The  inter¬ 
national  context,  the  Church  strategy  and  profile, 
the  CDPs’  ideological-religious  references,  the 
party  systems’  new  configurations,  and  the  issue  at 
stake  are  the  factors  that  explain  this  change. 

Church  Profile  and  Strategy 

World  War  II  changed  the  image  and  the  mass 
perception  of  the  Church.  Contrary  to  the  anti¬ 
modern,  traditionalist,  and  mildly  profascist  stance 
of  the  prewar  Church,  after  the  conflict  it  emerged 
with  a  new  image  thanks  to  its  charitable,  compre¬ 
hensive,  and  universal  messages.  France  was  an 
exception  because  of  the  deep  involvement  of  the 
episcopate  with  the  Vichy  regime,  which  was  par¬ 
tially  counterbalanced  by  the  support  of  the  lower 


clergy  for  the  anti-Nazi  Maquis.  The  Church’s 
acceptance  of  democracy  and  its  more  or  less 
unequivocal  accommodation  with  the  modern 
world  facilitated  a  positive  reception  of  its  role  in 
a  larger  sector  of  the  population  than  was  the  case 
previously.  Even  more  important  was  the  Church’s 
modified  attitude  vis-a-vis  the  political  presence  of 
Catholics.  Whereas  in  the  interwar  years  the 
Church  had  kept  its  distance  from  the  Christian 
parties  and  even  despised  them,  during  the  war, 
the  Church  began  to  support  CDPs  and  even  spon¬ 
sored  their  resurgence  and  the  creation  of  new 
CDPs.  This  positive  relationship  provided  the 
CDPs  with  the  support  of  the  Church  organiza¬ 
tion:  Priests  became  quite  active,  and  parishes 
served  as  local  branches  or  centers  of  support  for 
the  CDPs. 

Anticommunism 

One  reason  for  this  change  of  attitude  lay  in  the 
new  international  setting.  With  the  descending  of 
the  Iron  Curtain  and  with  atheist  communism 
gaining  power  in  Eastern  Europe  and  making 
strides  in  Western  Europe,  too,  the  Church  called 
on  all  religious  persons  to  defend  Christian  civiliza¬ 
tion.  And  the  CDPs  followed  suit.  Anticommunism 
was  the  strongest  and  most  successful  appeal  for 
the  CDPs.  They  became  the  bastion  against  com¬ 
munist  subversion  and  its  antireligious  politics. 
This  appeal  led  to  an  enlargement  of  the  electoral 
constituency;  not  only  the  faithful  but  the  moderate 
voters  as  well  were  mobilized  by  anticommunism. 

Location  of  CDPs  in  the  Party  System 

CDPs  gained  a  pivotal  position  in  the  various 
party  systems.  They  were  almost  always  a  part  of 
coalition  governments  in  Belgium,  the  Netherlands, 
Luxembourg,  Italy,  and  France  until  1958  (and 
afterward  the  residual  component  in  the  United 
Democratic  Front  [UDFJ).  In  Austria  and  Germany, 
however,  they  had  to  be  in  the  opposition  during 
some  periods  (the  Austrian  People’s  Party 
[Osterreichische  Volkspartei,  OVPJ  in  the  1970s 
and  1980s  and  the  Christian  Democratic  Union/ 
Christian  Social  Union  [Christlich-Demokratische 
Union/Christlich-Soziale  Union,  CDU-CSUJ  in  the 
1970s  and  in  the  1998-2005  legislatures  in 
Germany).  The  CDPs’  central  position  and  broader 
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ideological  appeal  were  supported  by  the  postwar 
collapse  of  the  moderate-conservative  bourgeois 
parties.  These  parties  were  discredited  by  their 
inability  to  counteract  the  totalitarianism  of  the 
interwar  period  or  by  their  acquiescence  to  it. 
Where  they  attempted  a  resurgence  after  the  war, 
it  turned  out  to  be  a  fiasco.  The  CDPs  thus 
absorbed  the  conservative  constituency  too.  In  this 
way,  the  CDPs  went  beyond  their  traditional  rural 
constituency,  penetrating  into  the  urban  bourgeoi¬ 
sie  (and  sometimest  the  urban  proletariat,  too, 
thanks  to  the  espousal  of  social  welfare  in  the  first 
years  after  the  war,  following  the  example  of  the 
French  Popular  Republican  Movement  [Mouve- 
ment  Republicain  Populaire,  MRPJ,  the  most  left¬ 
ist  and  welfare  oriented  of  all  CDPs). 

Ideology  and  Policy 

The  postwar  CDPs’  ideological  bases  were  their 
reference  to  the  Church  on  moral  issues;  the  eleva¬ 
tion  of  personalism  as  an  encompassing  “ideol¬ 
ogy”;  the  adoption  of  welfare  systems  combined 
with  the  principle  of  subsidiarity,  placing  the  family 
at  the  center;  an  adequate  dose  of  state  intervention 
congruent  with  the  social  market  economy;  and  a 
consensual/consociational  conception  of  democ¬ 
racy,  multilateralism,  and  peaceful  coexistence.  In 
particular,  in  the  1950s  and  1960s,  CDPs  competed 
with  communists  and  socialists  on  their  own 
ground,  presenting  a  social  and  “humanist”  alter¬ 
native  to  the  Left’s  socialistic  project.  Later,  the 
CDPs  fully  endorsed  the  “social  market  economy,” 
which  became  their  distinctive  economic  policy. 
All  these  elements  have  not  changed  up  to  the  pres¬ 
ent  time,  except  that  state  intervention  has  been 
downsized  due  to  the  neoliberal  mood  of  the 
1980s  and  1990s.  Nevertheless,  CDPs  were 
affected  by  the  combination  of  the  processes  of 
modernization  and  secularization,  which  reduced 
the  social  basis  of  religion  and,  by  the  Second 
Vatican  Council,  had  distanced  many  believers 
from  active  political  support  of  CDPs.  The  Council 
had  a  tremendous  impact  on  the  Church  and  on 
most  fervent  Catholics.  As  far  as  politics  was  con¬ 
cerned,  the  Council  fully  recognized  the  principle 
of  liberal  democracy  as  such — and  not  instrumen- 
tally  as  a  bastion  against  communism — and  it 
disengaged  the  Church  from  day-to-day  politics 
and  from  open  support  of  CDPs.  These  changes 


provoked  some  criticism  and  backlash  against 
many  CDPs  but  not  all  of  them.  As  far  as  their 
electoral  fortunes  were  concerned,  in  the  Neth¬ 
erlands  and  Belgium,  the  impact  was  strong,  nega¬ 
tive,  and  immediate;  in  Austria  and  Switzerland, 
perceptible  but  limited;  and  surprisingly  quite 
limited  in  Italy.  Only  the  CDU-CSU  in  Germany, 
thanks  also  to  its  interreligious  nature,  did  not  suf¬ 
fer  any  reversal. 

The  process  of  secularization  and  the  decline  or 
collapse  of  most  “Christian  Democratic”  parties  in 
the  core  of  Europe  (the  Christian  Democrats 
[Democrazia  Cristiana,  DCJ  in  Italy,  the  Christian 
People’s  Party/Social  Christian  Party  [Christelijke 
Volkspartij/Parti  Social  Chretien,  CVP-PSCJ  in 
Belgium,  and  the  Christian  Democratic  Appeal 
[Christen-Democratisch  Appe,  CDAJ  in  the 
Netherlands)  favored  a  shift  toward  a  more  mod¬ 
erate  and  less  religious  orientation,  following  the 
imprint  of  the  CDU-CSU.  The  evolution  of  the 
European  Peoples  Party  (EPP)  in  the  European 
Union  (EU)  shows  this  tendency  quite  clearly.  EPP 
was  progressively  enlarged  by  moderate  secular 
parties,  which  relaxed  the  religious  tone  while 
keeping  some  basic  tenets  of  the  CDP  ideology: 
(a)  the  humanist  personalism,  (b)  the  centrality  of 
the  family,  (c)  the  welfare  concern  and  the  social 
market  economy,  (d)  subsidiarity,  (e)  the  pro-EU 
attitude,  and  (f)  the  multilateral  orientation  in  the 
international  sphere. 

Electoral  Evolution 

In  the  first  elections  after  World  War  II,  CDPs 
emerged  as  the  strongest  parties  in  France,  Italy, 
Belgium,  the  Netherlands,  Luxembourg,  Austria, 
and  Germany  and  as  the  third  strongest  in 
Switzerland.  With  the  exception  of  France,  where 
the  MRP  rapidly  declined  facing  a  newly  estab¬ 
lished  competitor,  the  Gaullist  Party,  and  then  col¬ 
lapsed  with  the  advent  of  the  Fifth  Republic,  the 
other  CDPs  retained  their  pivotal  position  for  at 
least  3  decades.  In  the  1970s,  their  electoral  for¬ 
tunes  and  key  positions  in  coalition  governments 
declined.  The  Belgian  and  Dutch  CDPs  suffered 
the  worst  blows,  losing  votes  and  their  dominant 
position  up  to  the  point  where  the  three  main 
Dutch  religious  parties  had  to  create  an  intercon¬ 
fessional  new  party  (the  CDA)  in  order  to  retain 
their  relevance  in  the  political  system.  In  Austria, 
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the  OVP  lost  its  primacy  to  the  Socialist  Party  in 
the  1970s  and  never  recovered  afterward.  In 
Germany,  the  CDU-CSU,  too,  scored  below  the 
Social  Democratic  Party  (Sozialdemokratische 
Partei  Deutschlands,  SPD)  in  the  1970s;  it  recov¬ 
ered  in  the  1980s  but  lost  again  in  the  1998  elec¬ 
tions.  The  most  dramatic  decline  was  experienced 
by  Italy’s  Christian  Democrats  (Democrazia 
Cristiana,  DC),  a  party  that  had  always  been  in 
government  since  1945  as  the  first  national  party; 
it  was  reduced  to  nearly  10%  in  1994,  losing 
almost  20  percentage  points  in  2  years  (from  1992 
to  1994).  From  the  DC,  a  variety  of  parties 
emerged:  The  most  relevant  were  the  Italian 
Popular  Party  (Partito  Popolare  Italiano,  PPI),  its 
more  direct  heir,  and  the  more  moderate  Christian 
Democratic  Union  (Cristiani  Democratici  Uniti, 
CDU)  and  Christian  Democratic  Center  (Centro 
Cristiano  Democratico,  CCD),  which  then  merged 
into  the  Union  of  the  Center  (Unione  dei 
Democratici  Cristiani  e  di  Centro,  UDC). 

The  crisis  of  many  CDPs  was  due  mainly  to  the 
process  of  secularization,  on  the  one  hand,  and  the 
reemergence  of  moderate-conservative  parties,  on 
the  other,  which  in  the  first  postwar  years  had  left 
the  political  space  empty.  This  reemergence  brought 
a  new  ideological  reference — neoliberalism  and 
neoconservatism — which  contributed  to  the  mar¬ 
ginalization  of  the  two  main  elements  of  the  CDP 
ideology,  personalism  and  social  concerns. 

International  Networks 

The  evolution  of  the  international  network  of 
CDPs  illustrates  quite  well  the  recent  changes  in 
this  political  family.  Immediately  after  the  war,  in 
1947,  CDP  leaders  met  informally  and  quite  dis¬ 
creetly  in  the  so-called  Geneva  Circle.  That  was  an 
occasion  for  facilitating  the  relationship  between 
the  German  Christian  Democratic  politicians  and 
the  other  European  partners,  especially  the  French 
ones.  It  was  meant  mainly  to  promote  mutual 
understanding  among  former  rivals  and  to  support 
the  German  chancellor  Konrad  Adenauer.  Parallel 
to  this  initiative  was  the  somewhat  more  formal 
Nouvelles  Equipes  Internationales  (NEI),  which 
was  a  gathering  of  Catholic  parties  and  associa¬ 
tions.  The  NEI  worked  together  with  the  confes¬ 
sional  parties  of  the  European  Coal  and  Steel 
Community  (ECSC)  and  later  in  the  European 


Economic  Community  (EEC)  member  states.  In 
1971,  the  NEI  evolved  into  the  European  Union  of 
Christian  Democrats  (EUDC). 

This  development  of  international  networks  in 
Europe  was  paralleled  by  a  more  global  initiative 
to  bring  together  all  the  CDPs  in  the  world. 
Significantly,  the  Christian  Democratic  World 
Union  (CDWU)  was  founded  in  Chile,  in  1961, 
emphasizing  in  this  way  the  relevance  of  Latin 
American  parties  and  the  truly  international  scope 
of  the  organization.  The  golden  era  of  CDPs,  as 
already  stated,  came  to  an  end  with  the  process  of 
secularization,  with  the  Church  distancing  itself 
from  daily  politics  since  the  1970s,  and  with  the 
return  with  a  vengeance  of  moderate-conservative 
ideology  and  political  forces. 

In  the  mid-1970s  under  the  auspices  of  the 
CDU-CSU,  a  new  organization  was  created,  the 
European  Democratic  Union  (EDU),  which 
brought  together  CDPs  such  as  the  CDU-CSU  and 
the  OVP,  and  also  secular  moderate-conservative 
parties.  This  move  influenced  also  the  former 
EUDC,  which  evolved  into  the  EPP  in  1978.  The 
new  name,  not  by  chance,  did  not  include  any  ref¬ 
erence  to  religious  inspiration.  The  German  CDU- 
CSU  endeavor  to  enlarge  the  former  CDP  network 
to  include  secular  moderate-conservative  parties 
proved  successful:  The  EPP  expanded  to  become 
the  strongest  parliamentary  group  in  the  European 
Parliament  in  the  1990s.  The  tendency  to  loosen 
the  religious  connection  was  not  counteracted  by 
the  core  Catholic  parties  because  of  their  growing 
marginality.  Such  trends  affected  also  the  interna¬ 
tional  organization  Christian  Democratic  World 
Union  (CDWU),  which  in  1999  changed  its  name 
to  Christian  Democratic  and  People’s  Parties 
International. 
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Christianity 


Christianity  generally  refers  to  a  system  of  belief 
that  centers  on  the  assertion  that  Jesus  of  Nazareth 
is  Christ  the  Messiah,  the  Savior  of  all  humanity. 
It  is  embodied  in  the  historical  reality  of  commu¬ 
nities  that  spread  across  the  world  through  2  mil¬ 
lennia,  with  a  wide  variety  of  expressions  in  life 
and  liturgy.  Because  of  this  earthly  embodiment  of 
their  belief  in  human  society,  Christianity  has 
been  the  fount  of  theories  and  practices  that  are  of 
particular  interest  to  those  in  the  field  of  politics. 

Source 

The  “good  news”  that  Christianity  proclaims  is 
salvation  by  the  person  and  the  work  of  Jesus  Christ 
as  attested  in  the  scriptures.  The  New  Testament 
provides  basic  accounts  of  the  advent  of  Jesus  and 
his  subsequent  reception.  Because  Jesus  announced 
that  his  advent  was  the  fulfillment  of  the  promise  of 
salvation  foreshadowed  in  the  writings  of  the 


Israelites,  the  Old  Testament  is  also  included  in  the 
authentic  canon  of  Christianity.  The  terms  Old 
Testament  and  New  Testament  therefore  suggest  a 
Christian  perspective.  In  Christian  tradition,  the 
Old  Testament  is  read  in  anticipation  of  the  com¬ 
ing  Christ  and  the  New  Testament  in  remembrance 
of  the  crucified  and  risen  Christ. 

The  Old  Testament  contains  the  same  books 
that  Judaism  views  as  scripture,  though  different  in 
sequence  and  importance.  It  has  three  parts, 
roughly  reflecting  the  Hebrew  distinction  of  law, 
prophets,  and  other  writings.  The  New  Testament 
may  be  divided  into  two  parts.  The  first  part  has 
four  narratives  of  Jesus  from  his  birth  to  his  death 
and  resurrection  and  a  record  of  the  deeds  of  the 
apostles.  The  Gospels  according  to  Matthew, 
Mark,  Tuke,  and  John  were  written  around  the 
late  1st  century,  with  different  authorships,  sources, 
and  viewpoints,  projecting  a  polyphonic  but  coher¬ 
ent  figure  of  Jesus.  The  second  part  is  composed  of 
letters,  written  by  the  Apostle  Paul  and  other  early 
disciples,  addressed  to  the  scattered  churches  and 
believers  of  the  1st  and  2nd  centuries.  Taken 
together,  the  Bible  gives  a  view  of  the  world  and  its 
history,  beginning  from  creation  to  its  apocalyptic 
consummation. 

Judaism,  Christianity,  and  Islam  share  not  only 
part  of  their  scriptural  texts  but  also  the  concept  of 
canon.  The  word  is  derived  from  the  Greek  word 
meaning  rule  and  refers  to  a  collection  of  writings 
exclusively  demarcated  as  authoritative  to  their 
faith.  The  concept  is  missing  in  Eastern  religious 
traditions,  notably  Buddhism,  and  its  absence 
results  in  ever-increasing  bodies  of  scriptures.  The 
canon  and  the  creeds  it  produces  mutually  authen¬ 
ticate  each  other  and  serve  as  the  ultimate  guaran¬ 
tor  of  “orthodoxy,”  a  concept  unique  to  the 
Abrahamic  traditions.  The  process  of  editing  and 
defining  the  canon  took  a  number  of  politically 
charged  stages  through  several  centuries,  and  its 
different  interpretations  have  often  lent  themselves 
to  controversy. 

Founding 

Historians  agree  that  a  Jew  called  Jesus  of  Nazareth 
became  the  source  of  a  distinct  religious  movement 
that  arose  from  within  Judaism.  He  was  born  in  line 
with  the  Old  Testament  prophecies,  proclaimed  the 
advent  of  the  reign  of  God,  and  was  crucified  by 
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Pontius  Pilate  sometime  between  CE  30  and  33. 
According  to  Christian  doctrine,  Jesus  was  resur¬ 
rected  from  the  dead  and  appeared  before  disbe¬ 
lieving  disciples  for  a  short  period  before  his  ascen¬ 
sion  into  heaven.  Christianity  claims  that  this  Jesus 
was  God  incarnate.  Not  merely  was  he  the  mes¬ 
senger,  he  was  the  message.  Not  merely  did  he  give 
moral  teachings  and  exemplify  them  by  his  life  and 
death,  he  was  God  himself  in  utmost  humiliation. 
His  resurrection  is  understood  as  divine  approba¬ 
tion  and  victory  over  death.  The  incarnation,  cru¬ 
cifixion,  and  resurrection  thus  constitute  the  fun¬ 
damental  basis  of  Christian  doctrine — that  God 
came  into  this  world  to  save  humanity  from  their 
sin  and  alienation  and  restored  them  to  wholeness 
by  the  sacrificial  death  of  the  sinless  Jesus  and  that 
those  who  believe  in  him  shall  be  saved. 

The  biblical  accounts  of  this  salvific  event  gave 
rise  to  a  series  of  difficult  questions  regarding  the 
nature  of  Christ  and  his  relationship  with  God. 
The  first  few  centuries  of  Christian  history  were 
spent  in  giving  definitive  forms  to  the  faith  that 
Christians  came  to  believe  and  confess.  The  first 
Council  of  Nicaea,  convened  by  the  Roman 
Emperor  Constantine  the  Great  in  CE  325,  formu¬ 
lated  the  doctrine  of  the  Trinity  to  confirm  that 
Christ  the  Son  of  God  is  of  the  same  substance  and 
coeternal  with  God  the  Father.  God  is  three  per¬ 
sons — Father,  Son,  and  Holy  Spirit — united  in  one 
eternal  essence.  The  Council  of  Chalcedon  in  CE 
45 1  dealt  with  the  issue  of  the  nature  of  Christ  and 
declared  that  he  is  truly  divine  and  truly  human  at 
the  same  time.  These  doctrinal  formulae  are  full  of 
mystery  not  amenable  to  arithmetic  reasoning,  but 
they  are  the  end  result  of  the  critical  and  often 
controversial  reflections  on  the  faith  expressions  of 
the  early  Christians. 

The  doctrinal  decision-making  process  was  natu¬ 
rally  subject  to  concerns  that  were  not  purely  reli¬ 
gious,  but  it  would  be  misleading  to  think  of  the 
development  of  Christian  doctrines  merely  in  terms 
of  political  manipulation  and  exploitation.  The 
conspiracy  theories  often  framed  for  contemporary 
entertainment  have  no  place  in  actual  history,  for 
there  is  no  individual  or  organization  that  has  the 
power  and  authority  strong  enough  to  steer  the 
voluntary  faith  of  the  multitude  of  believers  in  a 
preferred  direction.  Theologians  and  delegates  of  all 
levels,  even  the  pope  and  ecumenical  councils,  must 
give  due  recognition  to  what  the  congregations 


actually  believe  and  practice,  albeit  critically.  The 
modern  sanctioning  of  the  Marian  dogmas  by  the 
Roman  Catholic  Church  is  a  good  example.  The 
age-old  maxim  lex  orandi,  lex  credendi  (how  we 
pray  defines  the  rule  of  faith)  epitomizes  this 
organic  and  indissoluble  correspondence  of  his¬ 
torical  faith  and  its  theoretical  formulation. 

It  is  difficult  to  identify  the  political  vision  of 
early  Christianity.  While  Jesus  was  crucified  by 
Roman  authorities  for  his  allegedly  political  asser¬ 
tion  that  he  was  “the  king  of  the  Jews,”  the  com¬ 
ing  kingdom  he  proclaimed  did  not  lay  any  imme¬ 
diate  claim  beyond  individual  ethics.  “Render  to 
Caesar  the  things  that  are  Caesar’s,  and  to  God  the 
things  that  are  God’s”  (Matthew  22:21)  well  cap¬ 
tures  his  spiritual  claim.  The  teachings  of  both 
Jesus  and  Paul  are  in  most  cases  marked  with  the 
anticipation  of  the  impending  eschaton,  and  as 
such  they  offer  little  to  the  constructive  agenda  of 
temporal  politics.  Paul’s  injunction  not  to  resist 
but  to  obey  political  authorities  (Romans  13:1-7) 
should  also  be  understood  within  this  context  of 
imminent  eschatology.  Withdrawal  from  the  world 
was  the  guiding  principle  for  the  early  Christians. 

History 

For  the  sake  of  brevity,  an  overview  of  Christian 
history  is  presented  here  in  three  stages  that  com¬ 
bine  epochs  and  geography.  The  first  stage  is  called 
the  “Pan-Mediterranean”  era,  for  this  is  the  period 
in  which  Christianity  spread  along  the  European 
and  African  shorelines  of  the  Mediterranean  Sea. 
The  axiomatic  question  of  this  period  pertains  to 
their  self-definition:  “What  is  Christianity?”  Using 
the  wisdom  of  Greek  philosophy  and  Roman  law, 
Christianity  defined  orthodoxy  by  councils  and 
dogmas  and  organized  the  church  and  its  ruling 
hierarchy.  After  the  imperial  decision  to  employ 
Christianity  as  the  state  religion,  it  began  to  exer¬ 
cise  influence  in  all  realms  of  human  life  and 
extended  over  the  European  continent  to  form  a 
gigantic  compound  of  religion  and  culture  called 
corpus  Christianum.  The  early  principle  of  with¬ 
drawal  was  replaced  with  that  of  synthesis,  mutual 
interference,  and  even  domination.  The  11th  cen¬ 
tury  saw  the  official  severance  of  the  Eastern 
Church  from  the  Western,  but  both  traditions 
developed  elaborate  systems  to  balance  the  secular 
and  ecclesiastical  powers. 
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The  second  stage  is  the  “Pan-Atlantic”  era.  A 
major  split  caused  by  the  Protestant  Reformation 
in  the  16th  century  made  it  clear  that  there  could 
be  a  plurality  of  faith  in  geographical  proximity. 
For  Christianity,  this  meant  the  task  of  designing  a 
civil  society  in  which  religious  adherence  was  not 
necessarily  defined  by  region  or  clan.  The  para¬ 
digm  shift  from  parish  to  free  church  was  gradual 
on  the  European  continent,  but  a  small  number  of 
English-speaking  Puritans  took  an  epoch-making 
step  by  sailing  the  Atlantic  sea  to  the  New  World 
to  build  a  civil  society  in  accordance  with  their 
own  choice  of  religion.  The  Christian  world 
evolved  on  both  sides  of  the  Atlantic,  and  their 
common  question  was  “ How  should  Christianity 
be  woven  into  a  modern  civil  society?”  Medieval 
synthesis  was  taken  apart,  and  new  models  of  rela¬ 
tionship  between  church  and  state  were  explored. 
This  era  coincided  with  the  Enlightenment,  and 
issues  of  human  rights  began  to  command  atten¬ 
tion  through  repeated  experiences  of  persecution 
and  oppression,  providing  the  basis  for  the  claims 
of  religious  toleration  and  other  basic  rights 
including  the  freedom  of  speech,  press,  assembly, 
and  voluntary  association.  The  modern  vision  of 
Christian  politics  has  been  reshaped  accordingly, 
taking  into  account  the  distinction  between  the 
private  and  public  spheres  of  individual  life. 

Christianity  in  the  third  stage  moves  its  geo¬ 
graphical  center  further  West  to  represent  the 
“Pan-Pacific”  era.  Though  a  thread  of  early  mis¬ 
sions  had  reached  India  and  China  in  the  first  era, 
Asia  came  in  contact  with  Christianity  primarily 
via  missionaries  from  America  in  the  19th  and 
20th  centuries.  In  turn,  it  was  in  Asia  that 
Christianity  encountered  radically  different  forms 
of  religion  for  the  first  time.  Facing  traditions  such 
as  Hinduism  and  Buddhism,  the  predominant 
question  now  asked  was  “  Why  Christianity  among 
other  alternatives?”  The  presence  of  other  religions 
relativizes  the  internal  distinction  of  Catholic  and 
Protestant,  Western  and  Eastern  Churches.  As  the 
vanguard  of  Christian  history,  Asia  and  the  Pacific 
now  share  with  the  United  States  a  leading  role  in 
the  development  of  global  Christianity.  American 
Christianity  remains  exceptionally  active  in  com¬ 
parison  with  “post-Christian”  Europe,  where  it  is 
more  a  cultural  heritage  than  a  lived  reality.  In 
regions  where  Christianity  shows  a  strong  presence 
today,  the  surge  of  nationalism  and  ethnocentrism 


supported  by  conservative  Christians  has  gained 
the  attention  of  political  scientists. 

Contemporary  Outlook 

Statistics  show  that  Christianity  is  no  longer  a  reli¬ 
gion  of  the  West.  Christians  living  in  Europe  and 
North  America  now  account  for  a  mere  third  of 
the  world  Christian  population.  Among  other  reli¬ 
gions,  Christianity  continues  to  have  the  largest 
number  of  adherents,  with  Islam  having  the  second 
greatest  number.  Within  the  Christian  tradition, 
the  Roman  Catholic  Church  is  by  far  the  largest 
body,  with  more  than  1  billion  members.  The 
Eastern  tradition  is  divided  by  several  patriarchs 
including  Greek,  Russian,  and  other  Oriental 
Churches.  The  Episcopal  Church  originated  in 
England  but  now  has  the  worldwide  Anglican 
Communion,  in  which  the  English  Church  is  in 
fact  a  minority.  Among  the  Protestants,  Pres¬ 
byterian,  Methodist,  Baptist,  Lutheran,  and  other 
denominations  have  often  been  named  “main¬ 
line,”  but  in  most  parts  of  the  world  they  are  rap¬ 
idly  yielding  to  the  fast-growing  Pentecostal  or 
Charismatic  movement.  This  movement  empha¬ 
sizes  direct  and  personal  revelation  through  the 
Holy  Spirit,  with  enthusiastic  and  often  emotional 
affirmation  of  miraculous  phenomena  in  efforts  to 
imitate  the  fervency  of  the  early  church. 

The  church  is  a  human  and  earthly  entity  insti¬ 
tuted  for  the  transmission  and  promulgation  of  the 
original  revelatory  content  that  is  by  nature  diffi¬ 
cult  to  institutionalize.  As  long  as  Christianity  tries 
to  remain  faithful  to  its  contradictory  mandate, 
there  will  always  be  oscillation  between  institu¬ 
tionalism  and  the  free  spirit.  The  cycle  of  forma¬ 
tion  and  reformation  is  a  hallmark  of  life. 

Anri  Morimoto 
International  Christian  University 
Tokyo,  Japan 

See  also  Church-State  Relationships;  Religious 
Movements;  Theocracy;  Tolerance 

Further  Readings 

Barrett,  D.  B.,  Kurian,  G.  T.,  &  Johnson,  T.  M.  (Eds.). 
(2001).  World  Christian  encyclopedia  (2nd  ed.).  New 
York:  Oxford  University  Press. 


Church-State  Relationships  235 


Johnson,  P.  (1976).  A  history  of  Christianity.  London: 
Penguin. 

McGrath,  A.  E.  (Ed.).  (1993).  The  Blackwell 

encyclopedia  of  modern  Christian  thought.  Oxford, 
UK:  Blackwell. 

Niebuhr,  R.  (1953).  The  children  of  light  and  the  children 
of  darkness.  New  York:  Scribner’s. 

Troeltsch,  E.  (1992).  The  social  teaching  of  the  Christian 
churches  (O.  Wyon,  Trans.;  Vols.  1-2).  Louisville, 

KY:  Westminster  John  Knox  Press.  (Original  work 
published  1912) 


Church-State  Relationships 


Church-state  relationships  often  produce  compli¬ 
cated  political  and  constitutional  issues  even  in 
advanced,  consolidated  democracies.  Despite  the 
expectations  of  the  modernization  theory  that 
predicted  a  decline  in  the  political  importance  of 
religion  with  the  rise  of  modern  industrial  society, 
current  cross-national  studies  indicate  that  the 
relations  between  church  and  state  or  the  question 
of  secularism  are  still  major  issues  in  many  societ¬ 
ies.  Even  in  countries  where  a  basic  consensus 
exists  on  these  relationships,  religiously  driven 
moral  issues,  such  as  divorce,  abortion,  homo¬ 
sexuality,  euthanasia,  school  prayer,  and  public 
funding  of  church  schools  and  other  religious 
institutions,  are  still  hotly  debated.  This  entry 
examines  variations  among  Western  democracies 
in  their  approach  to  secularism  as  well  as  the  more 
complicated  problems  in  the  Islamic  world. 

The  common  brief  definition  of  secularism  is 
the  separation  of  state  affairs  and  religion.  It  is  also 
commonly  accepted  that  this  implies  the  absence 
of  an  official  (or  state)  religion  (established  church); 
freedom  of  religion  and  conscience  for  all  citizens 
regardless  of  religion  and  sect;  full  equality  before 
the  law  and  access  to  public  office,  again  regard¬ 
less  of  religion  and  sect;  the  absence  of  a  legal 
requirement  for  state  acts  to  conform  to  religious 
rules  and  injunctions;  and  the  separation  of  institu¬ 
tions  performing  religious  services  and  public 
(governmental)  activities. 

Not  all  Western  democracies  meet  all  these  con¬ 
ditions.  In  England,  the  king  (or  the  queen)  is  the 
head  of  the  Anglican  Church.  The  Norwegian 
Constitution  states  that 


the  Evangelical-Lutheran  religion  shall  remain 
the  official  religion  of  the  State.  The  inhabitants 
professing  it  are  bound  to  bring  up  their  children 
in  the  same.  (Article  2) 

The  King  shall  at  all  times  profess  the  Evangelical- 
Lutheran  religion,  and  uphold  and  protect  the 
same.  (Article  4) 

More  than  half  the  number  of  the  Council  of 
State  [i.e.,  Council  of  the  Ministers]  shall  profess 
the  official  religion  of  the  State.  (Article  12) 

Under  the  Danish  Constitution, 

the  Evangelical-Lutheran  Church  shall  be  the 
established  Church  of  Denmark  and  as  such  shall 
be  supported  by  the  State.  (Article  4) 

The  King  shall  be  a  member  of  the  Evangelical- 
Lutheran  Church.  (Article  6) 

The  Greek  Constitution  of  1975  states  that 

the  prevailing  religion  in  Greece  is  that  of  the 
Eastern  Orthodox  Church.  . .  .  The  text  of  the 
Holy  Scripture  shall  be  maintained  unaltered. 
Official  translation  of  the  text  into  any  other 
form  of  language,  without  prior  sanction  by  the 
Autocephalus  Church  of  Greece  and  the  Great 
Church  of  Christ  in  Constantinople,  is  prohibited. 
(Article  3) 

The  Greek  constitution  also  prohibits  proselytism 
(Article  13  [2])  and  provides  for  a  special  regime 
for  Aghion  Oros  (Mount  Athos),  under  which  the 
region  is  governed  by  its  20  Holy  Monasteries,  as 
a  self-governing  part  of  the  Greek  state  (Article 
105). 

The  preamble  of  the  Irish  Constitution  of  1999 
declares  that  the  Constitution  was  adopted 

in  the  name  of  the  Most  Holy  Trinity,  from 
Whom  is  all  authority  and  to  Whom,  as  our  final 
end,  all  actions  both  of  men  and  States  must  be 
referred  .  .  .  [and]  humbly  acknowledging  all  our 
obligations  to  our  Divine  Lord,  Jesus  Christ,  who 
sustained  our  fathers  through  centuries  of  trial. 

The  1997  Polish  Constitution  contains  the 
following: 
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all  citizens  of  the  Republic,  both  those  who 
believe  in  God  as  the  source  of  truth,  justice, 
good,  and  beauty,  as  well  as  those  not  sharing 
such  faith  but  respecting  those  universal  values 
as  arising  from  other  sources. 

It  also  refers  to  “the  Christian  heritage  of  the 
Nation”  and  “our  responsibility  before  God.” 

The  First  Amendment  to  the  U.S.  Constitution 
states  that  “  Congress  shall  make  no  law  respecting 
an  establishment  of  religion,  or  prohibiting  the  free 
exercise  thereof.”  The  U.S.  Supreme  Court  has 
issued  many  decisions  that  define  the  scope  of  this 
constitutional  provision,  known  as  the  Establish¬ 
ment  Clause. 

Despite  these  seemingly  significant  deviations 
from  the  conventional  meaning  of  secularism  in 
the  constitutional  formulas,  it  is  clear  that  religion 
no  longer  plays  a  determining  role  in  the  public  life 
of  the  advanced  industrial  democracies.  In  virtu¬ 
ally  all  these  societies,  the  principle  of  equality 
before  the  law  is  firmly  established,  discrimination 
on  the  basis  of  religion  is  forbidden,  full  freedom 
of  religion  and  conscience  is  recognized  for  all  reli¬ 
gions  and  sects  as  well  as  for  nonbelievers,  and  the 
legitimacy  of  public  (governmental)  acts  do  not 
normally  derive  from  conformity  to  religious  rules 
and  injunctions.  This  seems  to  be  the  irreducible 
core  meaning  of  secularism  and  also  an  indispens¬ 
able  prerequisite  of  democratic  government. 

Beneath  these  common  core  elements,  however, 
one  can  observe  two  different  conceptions  of,  and 
approaches  to,  secularism  in  Western  democracies. 
One,  which  may  be  termed  passive  secularism, 
essentially  means  a  state  neutral  with  respect  to,  or 
equidistant  from,  all  religions  and  sects  without 
precluding  the  public  visibility  of  religion. 
“Assertive”  or  “combative”  secularism  (laicite  de 
combat ),  on  the  other  hand,  aims  at  eliminating 
religion  from  the  public  sphere  and  confining  it  to 
the  private  domain,  that  is,  to  the  places  of  wor¬ 
ship  and  the  consciences  of  the  individuals. 

Most  Western  democracies  adhere  to  some  ver¬ 
sion  of  the  passive  secularism  model,  while  France 
and  Turkey  are  the  chief  protagonists  of  the  asser¬ 
tive  secularism  approach.  The  difference  between 
the  two  approaches  can  be  explained  by  different 
patterns  of  political  development.  The  critical  con¬ 
dition  that  affects  the  emergence  of  one  or  the 
other  model,  as  Ahmet  Kuru  argues,  is  the  absence 


or  existence  of  an  ancien  regime  that  combines 
monarchy  with  a  hegemonic  religion.  The  emer¬ 
gence  of  assertive  secularism  in  France  and  Turkey 
seems  to  be  a  response  to  such  a  combination  of 
historical  conditions.  The  assertive  character  is 
much  more  marked  in  Turkey  because,  in  contrast 
to  the  strict  separation  of  religion  and  state  in 
France,  the  Turkish  state  maintains  a  monopoly 
over  religious  services  and  religious  education. 
Therefore,  Turkey  provides  an  example  not  of 
strict  separation  but  of  a  system  where  the  indi¬ 
vidualized  version  of  religion  is  at  the  service  and 
under  the  control  of  the  state. 

The  situation  in  the  Muslim  world  is  more  com¬ 
plicated.  It  is  a  commonplace  to  argue  that,  as 
opposed  to  Christianity,  Islam  assumes  the  unity 
of  state  and  religion  and,  therefore,  is  inherently 
incompatible  with  secularism;  and  to  the  extent 
that  secularism  is  a  sine  qua  non  for  democracy, 
this  also  means  incompatibility  with  liberal  democ¬ 
racy.  Indeed,  many  Muslims  see  secularism  as  a 
creation  of  Christendom. 

The  picture  is  more  complicated  and  nuanced, 
however,  than  what  these  extreme  positions  sug¬ 
gest.  While  maintaining  the  unity  of  state  and 
religion  in  principle,  Islam  provides  almost  no 
rules  in  the  field  of  public  law.  Even  the  major 
Islamic  political  institution,  the  Caliphate,  is  not 
based  on  the  Koran  and  Sunnah  (the  Prophet’s  say¬ 
ings  and  actions)  but  on  the  consensus  of  believers 
liana)  after  the  death  of  the  Prophet,  and  it  ceased 
to  exist  with  its  abolition  by  the  Turkish  parlia¬ 
ment  in  1924.  The  Islamic  law  (sharia)  covers 
mostly  private  law  relations  (notably,  personal 
status,  marriage,  divorce,  inheritance)  and  pro¬ 
vides  some  rules  in  the  fields  of  criminal  law  and 
procedural  law.  Therefore,  Islam  has  been  consid¬ 
ered  compatible  with  very  diverse  forms  of  govern¬ 
ment  in  the  past  and  at  present.  Many  Muslim 
countries  have  secularized  their  legal  systems  to 
some  extent,  usually  leaving  areas  such  as  family 
and  inheritance  laws  subject  to  the  Islamic  rules. 
Again,  Turkey  is  one  of  the  very  few  states  with  a 
Muslim  majority  having  a  totally  secularized  legal 
system. 

In  addition,  there  remains  a  considerable  differ¬ 
ence  between  Muslim  countries  and  Western 
democracies  in  their  approaches  to  secularism. 
Even  in  the  constitutions  of  most  secularized  Arab 
countries,  it  is  stipulated  that  Islam  is  the  religion 
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of  the  state,  the  chief  of  the  state  has  to  be  a 
Muslim,  and  the  sharia  is  a  major  source  of  legisla¬ 
tion.  Further,  certain  rules  of  the  sharia,  such  as 
those  pertaining  to  inequality  between  genders, 
inequality  between  Muslims  and  non-Muslims,  the 
prohibition  of  apostasy,  and  a  number  of  rules  in 
the  field  of  criminal  law  are  clearly  incompatible 
with  liberal  democratic  norms. 

When  it  comes  to  popular  attitudes  toward 
democracy,  however,  there  does  not  seem  to  exist 
a  major  difference  between  Western  and  Muslim 
societies,  contradicting  Samuel  Huntington’s 
(1996)  clash-of-civilizations  thesis.  Pippa  Norris 
and  Ronald  Inglehart  (2004),  in  their  worldwide 
survey  including  11  societies  with  a  Muslim 
majority,  have  concluded  that  as  regards  demo¬ 
cratic  values  (support  for  democratic  ideals, 
evaluations  of  how  well  democracy  works  in 
practice,  and  disapproval  of  strong  leaders),  there 
is  minimal  difference  between  the  Muslim  world 
and  the  West.  According  to  their  findings,  the 
chief  cultural  difference  between  the  West  and  the 
Muslim  world  concerns  not  political  values  but 
social  values  toward  gender  equality  and  sexual 
liberalization. 

In  cases  of  many  advanced  democracies,  a  rea¬ 
sonable  degree  of  consensus  over  the  place  of  reli¬ 
gion  in  public  life  seems  to  have  been  constructed 
after  long  periods  of  public  debate  and  negotia¬ 
tions.  Even  after  such  a  consensus  is  crafted,  how¬ 
ever,  certain  religiously  inspired  issues  are  still  on 
the  political  agenda.  Indeed,  neither  the  passive 
nor  the  assertive  approaches  are  without  prob¬ 
lems.  Passive  secularism  presupposes  the  existence 
of  a  broad  consensus,  which  is  difficult  to  main¬ 
tain  in  countries  with  believers  of  many  religions 
as  well  as  nonbelievers.  Assertive  secularism,  on 
the  other  hand,  is  prone  to  lead  to  conflicts 
between  secularists  and  believers.  Secularists  will 
be  suspicious  of  believers’  commitment  to  a  secu¬ 
lar  democratic  system,  and  believers  will  react  to 
secularists’  efforts  to  eliminate  religion  from  the 
public  sphere. 

Perhaps  the  key  to  the  dilemma  is  the  construc¬ 
tion  of  “twin  tolerations”  (see  Stepan,  2001,  pp. 
213-253),  as  the  system  where  religious  groups  are 
allowed  to  organize  and  propagate  their  views  in 
civil  society  and  political  society  but  refrain  from 
imposing  their  views  on  the  other  members  of  soci¬ 
ety,  respecting  democratic  rules  and  procedures, 


and  where  this  boundary  line  is  carefully  guarded 
and  monitored  by  the  state. 

Ergun  Ozbudun 
Bilkent  University 
Anakara,  Turkey 
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The  concept  of  citizenship  has  a  great  elasticity  of 
meaning  because  of  its  varied  spatial  and  func¬ 
tional  dimensions  (e.g.,  we  speak  of  family  citizen¬ 
ship,  state-national  citizenship,  cosmopolitan  citi¬ 
zenship).  The  notion  is  all  the  more  difficult  to  use 
as  it  refers  to  very  different  historical  experiences, 
ranging  from  the  civil  makeup  of  ancient  Rome  to 
the  contemporary  experience  of  postnational  citi¬ 
zenship  in  Europe,  including  numerous  liberal, 
republican,  and  communitarian  declensions  from 
this  membership.  Another  difficulty  lies  within  an 
ethical  framework  that  is  an  essential  part  of  the 
scientific  debate  on  citizenship.  A  few  years  ago, 
issues  related  to  the  ethical  implications  of  citizen¬ 
ship  led  certain  theoreticians  to  believe  that  the 
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notion  had  become  theoretically  ineffective  in 
social  and  political  sciences.  Although  such  an 
opinion  may  be  somewhat  excessive,  it  highlights 
the  need  for  a  rigorous  definition  of  the  concept.  In 
this  entry,  citizenship  is  defined  as  (a)  a  juridical 
status  of  membership  in  a  political  community, 
(b)  a  status  that  is  the  condition  of  the  political 
participation  of  citizens  in  the  democratic  func¬ 
tioning  of  the  community,  and  (c)  a  status  that 
endows  individuals  with  a  sense  of  citizenship  that 
differs  from  their  other  social  and  cultural  identi¬ 
ties.  These  three  basic  elements  of  citizenship  are 
discussed  in  the  first  part  of  the  entry;  then,  recent 
transformations  of  modern  civic  configurations 
are  examined. 

Basic  Elements  of  Citizenship 

Three  basic  elements  are  necessary  to  consolidate 
the  notion  of  citizenship  and  to  outline  distinctive 
characteristics  that  make  using  the  term  heuristi- 
cally  possible  in  political  science. 

Citizenship  as  a  Juridical  Status  of  Membership 

The  first  element  identifies  the  juridical  founda¬ 
tion  of  citizenship  as  a  status  of  membership  in  a 
political  entity  (often  national  but  not  exclusively 
so)  that  is  juridically  codified.  In  this  regard,  citi¬ 
zenship  is  often  linked  to  nationality,  which  entails 
the  recognition  of  rights  and  obligations  associated 
with  this  status;  in  most  nation-states,  this  criterion 
of  nationality  is  still  the  condition  for  inclusion  in 
the  citizen  community.  With  the  development  of 
migration  trends  and  with  the  diversification  in  the 
scales  of  citizenship,  the  residency  criterion  is  also 
sometimes  taken  into  account  to  facilitate  the  civic 
integration  of  those  who  live  in  a  nation  or  other 
juridically  defined  territory  without  full  member¬ 
ship  as  a  citizen  of  that  nation-state  or  other 
group.  This  juridical  foundation  is  at  the  center  of 
the  sociological  debate  put  forward  by  Thomas 
Marshall  when,  during  his  famous  conference  in 
Cambridge  in  1949  (see  Gershon  Shafir,  1998, 
chap.  6),  he  made  citizenship  a  federative  concept 
for  political  science.  For  Marshall,  who  was  par¬ 
ticularly  concerned  about  the  coherence  of  an 
English  society  threatened  by  class  divisions,  the 
interest  in  this  concept  lies  in  the  ability  of  the 
juridical  status  to  guarantee  equal  rights  to  all 


citizens.  In  the  now  established  fashion,  Marshall 
characterizes  citizenship  as  consisting  of  three 
components:  (1)  the  civil  element,  that  is,  individ¬ 
ual  rights  such  as  freedom  of  the  person,  freedom 
of  speech,  freedom  of  thought,  freedom  of  belief, 
right  of  ownership,  right  to  sign  contracts,  right  to 
a  fair  legal  system,  and  so  on;  (2)  the  political  ele¬ 
ment,  including  the  right  to  participate  in  political 
activity,  right  to  run  for  office,  right  to  vote,  right 
to  petition,  and  so  on;  and  (3)  the  social  element, 
mainly  through  the  free  and  equal  access  to  its 
social  welfare  system.  Heavily  influenced  by  the 
British  historical  experience,  this  liberal  and  evolu¬ 
tionary  model  of  citizenship  still  reflects  an  interest 
in  putting  the  status  of  citizenship  into  a  historical 
perspective  that  includes  the  birth  of  the  constitu¬ 
tional  state,  the  advent  of  universal  suffrage,  and 
the  birth  of  the  welfare  state,  all  of  which  encour¬ 
aged  the  progressive  expansion  of  such  a  juridi¬ 
cally  codified  social  role.  Here,  citizenship  is  a  type 
of  indicator  of  political  modernity.  This  perspec¬ 
tive  on  citizenship,  contested  by  many  critics  of 
Marshall,  especially  Albert  Hirschmann,  is  based 
on  the  evolution  of  Western  societies  in  which 
these  societies  moved  away  from  feudalism  and  its 
system  of  discriminatory  orders.  Since  the  intro¬ 
duction  of  this  argument,  many  works  have  com¬ 
pleted  this  model  by  including  new  elements;  in 
particular,  scholars  such  as  Will  Kymlicka  (1995) 
have  added  an  important  cultural  dimension  that 
establishes  the  notion  of  “multicultural  citizen¬ 
ship,”  which  is  intended  to  take  into  account  cul¬ 
tural,  linguistic,  or  religious  diversity.  However,  in 
American  Citizenship ,  Judith  Shklar  (1991)  shows 
that  the  symbolic  possession  of  citizenship  status  is 
often  just  as  important  and  sometimes  even  more 
fundamental  than  the  actual  practice  of  rights 
attached  to  it.  Taking  the  example  of  American 
citizenship,  for  a  long  time  denied  to  those  of 
African  ancestry  and  linked  to  the  practice  of  slav¬ 
ery  until  the  passage  of  the  Thirteenth  Amendment 
in  1865,  Shklar  argues  that  for  many  who  have 
benefited  from  the  fight  for  civil  rights,  what  is 
important  is  not  only  the  regular  practice  of  rights 
as  the  recognition  of  a  juridical  and  a  political  sta¬ 
tus  that  give  a  minimum  of  social  dignity  but  also 
this  symbolic  recognition. 

This  first  element  of  citizenship  gives  rise  to  a 
number  of  important  debates  and  scientific  studies 
in  political  science,  including  current  discussions 
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surrounding  the  dissociation  between  citizenship 
and  nationality.  Whether  one  considers  a  postna¬ 
tional  model  of  civic  membership,  as  Rainer 
Baubock  or  Yasemin  Nuhoglu  Soysal  suggest  (see 
Shafir,  1998,  chap.  10),  or  analyzes  the  partial 
denationalization  caused  by  the  emergence  of 
European  citizenship  in  1992,  as  described  by  Yves 
Deloye  (2008),  the  debate  is  centered  on  the  perti¬ 
nent  degree  of  the  social  closure  of  the  limits  of 
citizenship  and  the  ability  today  of  this  status  to 
match  juridically  the  open  conception  of  human 
rights  that  now  contrasts  with  the  former  closed- 
minded  conception  of  national  rights. 

Citizenship  as  a  Foundation 
of  Democratic  Practice 

The  second  element  of  citizenship  is  character¬ 
ized  by  the  philosophical  and  historical  link 
between  this  membership  and  the  workings  of 
political  democracy.  Because  this  link  marks  the 
advent  of  a  new  way  of  legitimizing  power,  which 
is  completely  dissociated  from  any  aristocratic  or 
theocratic  perspective,  citizenship  entails  a  division 
of  political  work  between  the  government  and  the 
governed,  which  has  made  the  idea  of  a  change  in 
power  the  very  principle  of  representative  democ¬ 
racies.  Political  science  literature  usually  compares 
two  stances  on  this  issue.  In  an  elitist  version, 
defended  in  the  past  by  Gaetano  Mosca,  Joseph 
Schumpter,  and  more  recently  by  Samuel 
Huntington,  the  main  idea  is  to  protect  democracy 
from  excessive  interference  by  citizens,  because 
civic  apathy  is  seen  as  functional  for  the  demo¬ 
cratic  political  system,  helping  it  avoid  “govern- 
ability  crises”  such  as  those  encountered  in  Western 
democracies  in  the  1970s.  Conversely,  the  partici¬ 
patory  version  advocates  a  strong  involvement  (in 
the  sense  in  which  Benjamin  Barber  uses  this  term) 
of  citizens  in  the  public  sphere.  The  advocates  of 
this  opinion  are  therefore  inclined  to  increase  acts 
of  citizenship.  In  using  this  term,  some  theoreti¬ 
cians  have  recently  advocated  a  profound  transfor¬ 
mation  of  the  studies  dedicated  to  citizenship. 
They  leave  behind  the  perspective  of  citizenship  as 
the  possession  of  a  membership  status  representing 
the  modern  civil  freedoms  guaranteed  by  the  state, 
which  in  return  demands  the  loyalty  of  its  citizens, 
and  instead  put  forward  the  practice  of  civic  free¬ 
dom  and  the  repertoire  of  political  expression 


(e.g.,  flash  mobs,  consumer  awareness,  hijacking 
political  symbols  through  art),  which  play  a  role  in 
making  citizens  the  first  actors  of  a  citizenship  that 
escapes  the  state’s  control  and  sometimes  even 
questions  it.  The  reversal  of  this  perspective  from 
top  to  bottom  cannot  be  mentioned  without  refer¬ 
ring  to  the  old  dialogue  between  two  antagonistic 
civic  conceptions  observed  by  the  English  historian 
John  Pocock  (see  Shafir,  1998,  chap.  2).  According 
to  Pocock,  two  ideal  conceptions  of  citizenship 
have  coexisted  since  antiquity.  The  first,  expressed 
by  Aristotle,  emphasizes  the  civic  dimension  and 
defines  citizenship  as  a  mode  of  activity  specific  to 
humanity;  the  second  conception,  outlined  in  the 
work  of  the  Roman  lawyer  Gai'us,  sees  citizenship 
above  all  as  a  specific  legal  status  and  one  that 
values  its  civil  dimension.  Beyond  this  founding 
opposition,  the  contemporary  analysis  tends  to 
emphasize  the  fact  that  an  “act  of  citizenship” 
questions  the  nature  of  “civic  virtues”  expected 
from  citizens.  Usually  when  citizenship  is  dis¬ 
cussed,  this  questioning  is  related  to  the  normative 
position  of  those  who  participate  in  the  debate. 
The  argument  between  liberals  and  communitari¬ 
ans  is  largely  influential  in  political  science  and  in 
other  disciplines.  Following  John  Rawls,  liberals 
view  citizens  as  individual  rational  agents  rather 
than  as  members  of  a  community  that  helps  shape 
their  values.  Conversely,  the  communitarian  cri¬ 
tique  (as  presented  by  Michael  Walzer  and  Alasdair 
MacIntyre)  claims  that  the  status  of  citizen  presup¬ 
poses  an  engagement  in  the  public  sphere,  a  sphere 
that  constitutes  the  cement  of  the  community  and 
the  solidarity  that  characterizes  it.  The  topic  of  this 
intense  debate,  which  borders  on  normative  politi¬ 
cal  philosophy  and  political  science,  points  to  the 
third  element  of  citizenship:  the  impact  on  identity. 

The  Impact  of  Citizenship  on  Identity 

This  third  element  of  citizenship  emphasizes  the 
principle  of  separation  between  citizen  membership 
and  other  social  memberships — rival  sources  of 
identification.  As  Jean  Leca  (1990)  indicates  in  his 
classic  text,  citizenship  constitutes  “a  civil  society 
distinct  from  family,  lineage  and  seigneurial  com¬ 
munities”  (p.  148).  If  the  model  of  the  Western 
medieval  city  analyzed  by  Max  Weber  is  often 
referred  to  here  as  the  paradigm  of  this  necessary 
differentiation  between  civic  identity  and  corporate 
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or  religious  identities,  it  is,  however,  with  the 
political  experience  of  liberal  revolutions  of  the 
18th  century  that  this  illustration  of  modern  citi¬ 
zenship  develops.  It  is  then  associated  with  two 
fundamental  historical  movements:  (1)  individua¬ 
tion,  which  contributes  to  strongly  linking  indi¬ 
vidualism  to  citizenship,  to  the  extent  that  the 
individual  and  the  modern  citizen  merge  in  the 
Declaration  of  the  Rights  of  Man  and  of 
the  Citizen  of  1789,  and  (2)  the  universalization  of 
rights,  which  makes  citizenship  all  the  more  inclu¬ 
sive.  By  causing  a  reorganization  of  social  and 
religious  identities  between  the  public  and  private 
spheres,  citizenship  demonstrates  the  establish¬ 
ment  of  a  social  formation  where  the  combination 
of  status  and  affiliations  to  primary  groups  (fam¬ 
ily,  ethnic,  religious,  etc.)  is  compatible  with  the 
promotion  of  an  allegiance  to  a  political  commu¬ 
nity.  As  the  classic  sociologists  Emile  Durkheim 
and  Georg  Simmel  had  established,  the  system  of 
division  from  which  these  civic  allegiances  are  cre¬ 
ated  favors  the  emancipation  of  the  individual  and 
the  promotion  of  civil  liberty.  The  classic  works  of 
Reinhard  Bendix  {Nation-Building  and  Citizenship, 
1977)  and  Ernest  Gellner  ( Culture ,  Identity  and 
Politics,  1987)  show  that  this  civic  emancipation  is 
often  in  harmony  with  the  affirmation  of  the 
nation-state,  whose  cultural  cohesion  is  linked  to 
the  ability  of  citizenship  to  evoke  in  individuals  a 
strong  sense  of  identification  to  its  culture  and  to 
its  political  vision.  The  comparative  historical  soci¬ 
ology  of  the  nation-state,  as  described  by  Deloye 
(2008)  and  Juan  Linz  (1993),  claims,  however, 
that  this  cultural  homogeneity  is  often  questioned 
by  the  populations  and  therefore  depends  on  the 
level  of  success  in  establishing  political  and  social 
identities.  One  must  be  particularly  aware  of  the 
diversity  of  historical  paths  in  building  the  nation¬ 
state.  As  Linz  indicates,  the  process  of  nation 
building — which  aims  to  historically  complete  the 
process  of  state  building  and  to  develop  among  its 
citizens  a  strong  subjective  sense  of  belonging  to 
the  same  political  and  cultural  community — is 
often  in  the  end  thwarted  by  demands  for  cultural, 
religious,  or  linguistic  recognition.  Such  demands 
make  it  politically  very  costly  for  a  republican-type 
homogeneous  citizenship  to  emerge,  capable  of 
making  peripheral  or  social  identities  invisible — 
identities  that  individuals  hold  on  to  because  of 
their  primary  socialization.  Taking  this  historical 


fact  into  account,  Linz  puts  forward  a  conceptual 
reflection  useful  for  an  analysis  of  citizenship: 
Establishing  the  exceptional  character  of  the 
nation-state  (with  the  exception  of  the  successful 
model  of  the  Lrench  state  and  its  model  of  repub¬ 
lican  citizenship),  he  encourages  us  to  abandon 
“the  idea  according  to  which  every  state  should  try 
to  be  a  nation-state”  in  order  to  “turn  to  methods 
of  state  [and  civic]  integration  rather  than  those 
based  on  national  construction”  (Linz,  1993,  pp. 
356,  365).  At  the  heart  of  this  conceptual  shift — 
which  leads  him  to  prefer  the  notion  of  “state- 
nation”  to  “nation-state” — is  a  whole  series  of 
essential  questions  examined  by  political  theory 
and  by  the  comparative  approach  to  citizenship: 
the  recognition  of  cultural  diversity  in  a  liberal 
democracy,  the  development  of  “multicultural  citi¬ 
zenship”  models,  the  political  construction  and 
political  validity  of  multinational  states  founded 
on  the  plurality  of  civic  and  cultural  methods  of 
identification  (e.g.,  in  Canada,  India,  Belgium),  the 
“postnational”  future  of  contemporary  societies, 
and  the  transformations  of  the  different  types  of 
political  patriotism  and  allegiance  associated  with 
models  of  citizenship.  The  common  factor  here  is 
the  insistence  that  the  observer  distance  himself 
from  the  abstract  notion  of  citizenship,  that  of 
“man  born  again”  (a  favorite  expression  of  many 
Lrench  revolutionaries  of  1789),  without  the  pri¬ 
mary  social  or  cultural  characteristics,  to  recognize 
the  man  who  is  embedded,  enclosed  in  a  series  of 
social  identities  that  help  shape  his  behavior  in 
public  and  make  him  feel  like  a  “citizen.” 

This  shift  links  up  with  the  communitarian  and 
feminist  critique  formulated  in  contrast  to  a  uni- 
versalistic  conception  of  citizenship  linked  to  an 
“ideal”  separation  between  the  public  and  the  pri¬ 
vate  sphere.  Lor  the  communitarian  critique  of 
liberalism,  the  issue  is  the  atomized  conception  of 
“Oneself,”  with  no  true  meaning,  or  historical  and 
empirical  contingency,  which  is  capable  of  think¬ 
ing  in  an  independent  and  rational  way  and  capa¬ 
ble  of  implementing  civil  liberties,  which  are  recog¬ 
nized  by  one’s  status  as  citizen  in  a  purely  autono¬ 
mous  way.  Contrary  to  this  definition,  which 
Isaiah  Berlin  strongly  defends  in  his  Four  Essays  on 
Liberty  (1969),  the  communitarian  theoreticians 
developed  the  hypothesis  of  “Oneself,”  inevitably 
“embedded”  and  dependent  on  his  common  envi¬ 
ronment.  This  “communitarian  constitution  of 
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Oneself,”  to  borrow  Charles  Taylor’s  terms,  leads 
the  communitarian  critique  of  liberal  citizenship  to 
emphasize  the  fact  that  the  political  and  civic  iden¬ 
tity  of  citizens  is  constituted  on  the  basis  of  an 
attachment  to  one  or  several  communities  of  refer¬ 
ence  that  noticeably  reduces  the  degree  of  possible 
differentiation  from  the  political  city. 

This  tension  of  citizenship  between  private  man 
and  public  duty,  formerly  mentioned  by  H.  Mark 
Roelofs  (1957),  is  also  at  the  heart  of  the  feminist 
critique  of  liberal  and  republican  citizenship. 
According  to  Carole  Pateman  (1989),  quite  rightly, 
this  critique  betrays  the  androcentrism  of  the  tra¬ 
ditional  notion  of  citizenship,  which,  behind  a 
conception  of  the  abstract  universal,  tends  to 
reflect  the  civil  evolution  of  men  more  than  it  does 
of  women.  From  this  point  of  view,  it  is  obvious 
that  the  famous  typology  of  civic  rights  formulated 
by  T.  H.  Marshall  and  discussed  earlier  hardly 
concerns  women  who,  in  most  Western  countries, 
are  denied  the  civic  rights  considered  by  the  British 
sociologist.  In  the  same  way  that  universal  suffrage 
was  considered  “universal”  for  a  long  time  when 
only  men  had  the  right  to  participate  in  the  election 
of  parliamentary  elites,  the  notion  of  citizenship 
remained  for  too  long  a  prisoner  of  a  conception  of 
politics  that  overlooked  gender.  In  light  of  gender, 
studies  on  citizenship  are 
now  enriched  with  new  soci¬ 
ological  analysis  and  major 

conceptual  reappraisals;  the  Duties  (the  “we”  prevails) 
idea  that  the  public  and  the 
private  sphere  constitute  two 
hermetic  and  distinct  spheres, 
a  hypothesis  that  is  at  the 
foundation  of  liberal  and 
republican  conceptions  of 
citizenship  as  described,  for 
example,  by  Engin  Isin  and 
Bryan  Turner  (2002,  chaps.  8 
and  9),  has  been  criticized 
because  it  was  the  main 
source  of  the  exclusion  of 
women  from  the  political 
city.  In  a  similar  way,  the  uni¬ 
versal  perspective  of  citizen¬ 
ship  was  strongly  modified  to 
recognize  the  multiple  irre¬ 
ducible  differences  between 
men  and  women.  Some  of  the 


most  substantial  studies  in  gendered  political  sci¬ 
ence,  however,  deal  with  the  inclusion  of  women  in 
parliament.  Whether  in  terms  of  correcting  the 
flaws  of  representativeness,  adjusting  the  electoral 
system  to  increase  the  presence  of  women  in  elec¬ 
tive  assemblies,  or  thinking  of  the  effects  of  parity 
on  politics,  research  on  civic  representation  has 
been  greatly  expanded  as  a  result  of  a  gendered 
approach  that  most  often  combines  political  theory 
with  comparative  political  analysis. 

Transformations  in  the  Civic  Community 

Beyond  these  theoretical  debates,  the  concept  of 
citizenship  is  an  excellent  reflection  of  contempo¬ 
rary  political  metamorphoses,  as  Isin  and  Turner 
(2002)  have  shown.  The  controversies  enliven  the 
numerous  studies  on  citizenship  that  force  political 
science  to  put  the  political  actor  in  touch  with 
themes  such  as  rights  and  obligations,  political  loy¬ 
alty  in  political  configurations  made  more  complex 
by  the  globalization  and  the  uprooting  of  politics, 
and  new  methods  of  political  communalization  in 
multicultural  societies  often  marked  by  postcolo¬ 
nial  demands.  Without  trying  to  cover  every  civic 
transformation,  Figure  1  summarizes  the  two  main 
transformations  of  political  citizenship  observed  at 

19th  century 
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Figure  I  Historical  Changes  of  Civic  Configurations 
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the  beginning  of  the  21st  century.  The  first  line  of 
evolution  concerns  the  normative  impact  of  citizen¬ 
ship  (left-hand  side  of  Figure  1);  the  second  trans¬ 
formation  applies  to  the  degree  of  politicization  of 
the  contemporary  status  of  citizenship  (right-hand 
side  of  Figure  1). 

In  the  same  way  as  classic  works  approach  the 
topic  of  citizenship,  the  identification  of  individu¬ 
als  as  members  of  a  political  community  (local, 
national,  or  postnational)  is  historically  linked  to 
the  normative  integration  of  citizens.  It  is  because 
individuals  share  not  only  the  same  system  of  val¬ 
ues  and  representations  but  also  common  rules  of 
behavior  that  they  become  aware  of  their  “unity.” 
The  strength  of  the  idea  of  identification  with  a 
national  “we”  revealed  by  various  political  science 
studies  is  also  due,  in  one  part,  to  this  normative 
tonality  of  membership  in  a  nation-state  and  its 
emotional  capacity  to  unite  citizens  around  a  body 
of  central  and  shared  values  and  rules.  New  civic 
methods  experimented  with  in  Europe,  North 
America,  and  Asia  are  probably  in  line  with  a  dif¬ 
ferent  perspective.  In  many  current  political  con¬ 
figurations  (e.g.,  European  Union,  India,  Canada, 
Australia,  South  Africa),  a  plurality  of  values  coex¬ 
ist  in  the  public  sphere  and  set  the  social  actor 
against  cultural  subsystems  of  different  values.  In 
this  case,  the  question  of  political  legitimacy  is  no 
longer  asked  in  relation  to  ultimate  values  but 
often  on  an  administrative  and  a  technocratic 
level.  The  weakness  of  cultural  integration — which 
was  at  the  heart  of  this  state-national  civic  project, 
for  instance,  the  republican  one — is  compensated 
for  by  the  economic  capacity  of  the  social  welfare 
system  to  ensure  to  every  person  a  certain  equality 
of  access  to  material  well-being.  The  pragmatic 
running  of  society  occurs  here  without  reference  to 
the  founding  values  on  the  basis  of  which  citizens 
struggle  to  come  together  because  of  former  or 
present  distributive  conflicts.  This  new  paradigm 
creates  a  model  of  citizenship,  slightly  inclusive  in 
political  and  cultural  terms  but  also  capable  of 
encouraging  equal — more  or  less  fair — access  to 
economic  and  social  goods  and  services  (bottom  of 
Figure  1).  From  this  point  of  view,  it  might  seem  to 
have  an  affinity  with  historical  logics  used  in  this 
process  of  building  the  “state-nations”  referred  to 
by  Linz  (1993).  However,  such  a  conception  of 
citizenship  makes  it  very  difficult  to  develop  a 
strong  sense  of  belonging  to  contemporary  political 


spheres  whose  economy  is  also  highly  globalized 
and  increasingly  discriminatory. 

In  the  manner  of  an  ideal  type,  it  is  possible  to 
compare  these  two  models  of  civic  configuration. 
On  the  one  hand  is  the  model  of  citizenship,  which 
was  experimented  with  for  Western  countries  (top 
part  of  Figure  1),  encouraging  the  politics  of  vertical 
integration  and  a  certain  level  of  homogenization  or 
at  least  a  certain  level  of  cultural  convergence.  This 
model  is  based  on  a  common  body  of  values,  beliefs, 
and  representations,  which  conveys  norms  and 
duties  that  the  state  has  to  uphold.  At  the  other  end 
of  the  scale  is  a  model  of  multicultural  citizenship, 
whose  horizontal  inclusive  logic  is  limited  to  ensur¬ 
ing  that  every  citizen  has  equal  access  to  the  eco¬ 
nomic  and  social  welfare  of  society  and  is  free  to 
choose  his  or  her  cultural  and  identity  affiliations 
(bottom  part  of  Figure  1).  Numerous  intermediate 
positions  are  possible  between  these  two  models. 

A  second  line  of  evolution  takes  shape  in  the 
background  once  attention  is  turned  to  the  con¬ 
temporary  sources  of  civic  “communalization.” 
Three  dominant  sources  of  communalization  can 
be  identified  in  a  broad  outline,  thanks  to  Max 
Weber’s  old  typology:  a  cultural  source  (religion, 
e.g.,  for  feudal  societies),  a  political  source  (e.g., 
the  civic  loyalty  demanded  by  the  nation-state), 
and  an  economic  source  (e.g.,  access  to  the  eco¬ 
nomic  market).  Historically,  these  three  sources  of 
communalization  operate  in  a  mixed  way  but  in 
varied  proportions  and  according  to  different  con¬ 
nections.  What  is  at  stake  here  is  clarifying  the 
respective  contributions  of  the  political  order,  of 
the  cultural  order,  and  of  the  economic  order  to 
the  communalization  of  citizens.  Today,  in  Europe 
and  elsewhere,  the  market  and  economic  or  profes¬ 
sional  mobility  hold  a  decisive  place  in  the  pro¬ 
cesses  of  regional  integration,  and  these  forces  may 
lead  to  rivalry  between  groups  with  different  cul¬ 
tural  identities.  At  the  same  time,  other  social 
institutions,  such  as  religion  or  the  family,  may 
lose  some  of  their  power  to  regulate  relationships 
between  cultural  groups.  Thus,  these  contempo¬ 
rary  transformations  of  citizenship  call  into  ques¬ 
tion  the  very  status  of  the  political  with  respect  to 
its  role  as  a  source  of  integration  and  regulation. 
Whereas  citizenship  was  once  seen  as  a  vehicle  for 
distinguishing  civic  membership  from  other  social 
identities,  today  it  must  accommodate  itself  to 
societal  demands.  This  relative  “depoliticization” 
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appears  to  be  the  result  of  a  new  stability  between 
different  sources  of  civic  communalization  that 
clearly  reduces  the  importance  of  political  loyal¬ 
ties  in  favor  of  both  transnational  patterns  of 
material  consumption  and  self-assertion  on  a 
cultural  level. 

It  is  in  this  overall  historical  transformation  that 
it  seems  necessary  to  locate  present  conditions  of 
identification  and  of  political  participation  in  con¬ 
temporary  societies.  It  is  also  by  taking  into  con¬ 
sideration  these  profound  transformations  that  the 
concept  of  citizenship  can  remain  a  central  notion 
in  political  science. 
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Civic  Culture 


Civic  culture  is  a  subtype  of  political  culture,  that 
is,  a  set  of  political  attitudes,  habits,  sentiments, 
and  behavior,  related  to  the  functioning  of  a 
democratic  regime.  It  implies  that  although  citi¬ 
zens  are  not  necessarily  involved  in  politics  all  the 
time,  they  are  aware  to  a  certain  extent  of  their 
political  rights  and  also  of  the  implications  of  the 
decision-making  processes  that  affect  their  life  and 
society.  Both  political  awareness  and  participation 
are  supposed  to  be  relevant  to  the  stability  of  a 
political  regime.  By  contrast,  citizens’  withdrawal 
from  political  life  has  consequences  not  only  for 
their  ability  to  get  what  they  want  from  the  polit¬ 
ical  community  but  also  for  the  quality  of  democ¬ 
racy.  Civic  culture  involves,  therefore,  some  level 
of  perception  of  the  republican  character  of  mod¬ 
ern  politics  and  adds  a  psychological  dimension  to 
the  concept  of  citizenship.  In  the  following,  the 
evolution  of  this  concept,  its  criticism  and  limita¬ 
tions,  and  its  relevance  for  contemporary  politics 
are  discussed. 

The  concept  of  civic  culture  is  part  of  a  long 
tradition  of  thought  that  investigates  the  nature  of 
democracy  from  a  historical  perspective.  It  refers 
to  the  role  of  political  traditions,  values,  and  cul¬ 
ture  in  the  achievement  of  democratization  and  the 
stabilization  of  a  regime.  Its  rationale  goes  back  to 
the  thinking  of  ancient  political  philosophers  such 
as  Aristotle,  but  in  modern  and  contemporary 
times  also,  Niccolo  Machiavelli,  Baron  de 
Montesquieu,  Alexis  de  Tocqueville,  John  Stuart 
Mill,  Max  Weber,  and  Norberto  Bobbio,  among 
others,  have  discussed  whether  a  set  of  specific 
political  attitudes,  convictions,  and  behavior  are  a 
necessary  and/or  sufficient  condition  for  the  suc¬ 
cess  of  modern  democracies.  The  question  is  con¬ 
troversial,  but  it  has  never  disappeared  from  the 
debate  about  the  necessary  conditions  to  achieve 
the  “good  government,”  that  is,  a  political  regime 
committed  to  the  ideal  of  full  human  realization. 

While  Aristotle  and  some  Roman  philosophers 
were  mostly  concerned  with  the  excesses  of  the 
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original  models  of  democracy  and  popular  repub¬ 
lics,  Montesquieu  and  Tocqueville  focused  on  the 
links  between  the  “spirit”  of  political  institutions 
and  laws  and  the  “habits  of  the  heart”  that  drive 
people  toward  greater  public  cooperation  and 
associational  life.  Aristotle  spoke  not  only  of  the 
political  goodwill  necessary  for  a  political  regime 
to  be  able  to  fulfill  its  mission  but  also  of  public 
virtues  such  as  civic  partnership  and  political 
restraint.  Tocqueville  claimed  that  the  success  of 
modern  democratic  regimes  depended  on  an  [indi¬ 
vidual’s]  self-interest  rightly  understood”  and 
asserted  that  an  open  and  free  public  life  is  not  a 
natural  given  but  depends  on  human  will;  he  also 
emphasized  the  importance  of  a  sense  of  modera¬ 
tion  and  self-restraint  in  the  process  of  political 
participation. 

Contemporary  concern  with  civic  culture  is  due 
mainly  to  two  political  phenomena  of  modern 
times:  first,  the  excesses  of  political  revolutions 
(e.g.,  the  French,  the  Paris  Commune,  and  the 
Soviet),  and  second,  the  collapse  of  democracy  in 
Europe  in  the  period  between  World  Wars  I  and  II, 
as  was  the  case  with  the  Weimar  Republic. 
Democracy  cannot  fulfill  its  promises  if  there  are  no 
democrats;  and  if  the  nature  of  the  democratic 
regime  does  not  create  an  excess  of  political  par¬ 
ticipation — which  may  degenerate  into  violence 
against  minorities  and  opponents — neither  does  it 
predicate  a  cynical  or  apathetic  public,  which  with¬ 
draws  from  public  life  and  which  could  lead  to 
abuses  of  power.  As  a  system  of  government  both 
of  and  for  the  people,  but  not  directly  by  the  people, 
representative  democracy  demands  public  involve¬ 
ment,  political  participation,  and  vertical,  horizon¬ 
tal,  and  social  accountability.  The  existence,  inde¬ 
pendence,  and  autonomy  of  civil  society  are  integral 
parts  of  it.  Moreover,  a  widespread  tolerance  for  a 
plurality  of  views  and  interests  and  a  widely  distrib¬ 
uted  sense  of  political  efficacy,  trust  in  government, 
and  mutual  trust  among  the  citizenry  are  seen  as 
key  elements  in  effective  civic  culture. 

Civic  Culture  as  Political  Culture 

The  most  important  contemporary  contribution  to 
the  development  of  this  theme  appeared  early  in  the 
1960s  in  the  work  of  the  American  political  scien¬ 
tists  Gabriel  Almond  and  Sidney  Verba.  Their  book 
The  Civic  Culture  demonstrated  the  connection 


between  specific  political  attitudes  and  the  histori¬ 
cal  experience  of  democratic  stability  or  the 
absence  of  it  in  five  regions:  Germany,  Italy, 
Mexico,  the  United  Kingdom  (UK),  and  the  United 
States. 

In  the  context  of  the  process  of  decolonization 
and  redemocratization  that  emerged  after  World 
War  II,  Almond  and  Verba  were  concerned  with 
the  broader  question  of  the  emergence  of  a  new 
world  culture  involving  positive  and  negative 
political  tendencies  in  countries  that  had  become 
democratized  as  a  result  of  external  pressure,  such 
as  Germany,  Italy,  and  Japan,  and  also  in  Third 
World  nations  recently  freed  from  their  colonial 
status  but  not  necessarily  from  their  traditional 
political  heritage.  They  wanted  to  know  whether 
those  countries  had  the  necessary  conditions  to 
give  birth  to  effective  democratic  regimes  or 
whether  old  authoritarian  or  antidemocratic  val¬ 
ues  would  prevent  those  countries  from  establish¬ 
ing  free,  competitive,  and  effective  political  sys¬ 
tems.  The  continuing  existence  of  a  number  of 
authoritarian  regimes  in  Europe  (Spain,  Portugal, 
and  Greece)  and  similar  experiences  in  Latin 
America,  Africa,  and  Asia  in  the  1950s,  1960s,  and 
1970s  showed  the  relevance  of  their  concern. 

According  to  the  logic  of  political  culture 
theory — in  which  some  critics  found  that  psycho¬ 
logical  approaches  had  too  great  an  influence — the 
effective  performance  of  a  democratic  regime 
would  require  some  kind  of  accommodation 
between  the  elitist  conception  of  the  role  of  politi¬ 
cal  leaders  and  some  level  of  citizens’  involvement 
in  the  realm  of  politics.  While  it  was  later  defined 
in  terms  of  the  strategic  role  of  political  elites  in 
making  governments  govern — through  their  politi¬ 
cal  initiatives  and  performance — the  new  theory 
maintained  that  there  was  also  an  area  of  collective 
political  convictions  and  modes  of  participation  to 
be  taken  into  consideration.  The  latter  was  said  to 
provide  the  context  for  the  emergence  of  the  for¬ 
mer.  The  microdimension  of  politics,  perceived  at 
the  individual  level,  should  be  congruent  with  its 
macrostructures,  seen  at  the  national  level. 
Incongruence  of  political  attitudes  and  the  behavior 
of  the  masses  and  elites,  on  the  one  hand,  and  the 
functioning  of  democratic  institutions,  on  the  other, 
could  produce  a  situation  in  which  there  were 
incomplete,  untrustworthy,  or  dysfunctional  politi¬ 
cal  systems.  Looking  at  the  survival  of  democracy 
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in  Britain,  the  United  States,  and  some  European 
countries  in  the  interwar  period,  Gabriel  Almond 
and  Sidney  Verba  discussed  what  kind  of  citizenry 
was  needed  to  achieve  a  proper  balance  of  power 
and  to  enable  a  stable  democratic  regime  to  be 
responsive. 

This  theory  held  that  to  work  according  to  its 
principles,  the  democratic  process  should  involve  a 
healthy  tension  between  civic  obligation  and  actual 
civic  performance.  It  required  appreciation  by  the 
public  of  the  virtues  of  democracy  as  compared 
with  its  alternatives.  To  submit  to  the  law,  to  the 
mechanisms  of  its  enforcement,  and  to  the  author¬ 
ities,  citizens  should  give  both  formal  and  informal 
consent  to  the  functioning  of  the  political  system; 
but  at  the  same  time,  they  should  have  a  proper 
sense  of  their  own  duties.  In  this  respect,  both  the 
subjective  and  the  objective  political  competences 
of  citizens  were  considered  to  be  of  great  impor¬ 
tance.  Such  a  political  culture  depended  also  on  the 
levels  of  mutual  trust  among  citizens.  This  long- 
established  notion  referred  to  the  fact  that  inter¬ 
personal  trust  is  necessary  to  stimulate  common 
action  directed  toward  the  realization  of  collective 
goals.  Citizens  should  acknowledge  their  obliga¬ 
tion  to  be  part  of  the  system  and  also  believe  that 
political  institutions  would  be  accessible  to  their 
participation,  but  at  the  same  time,  they  should  be 
selective  if  their  performance  fell  short  of  what  was 
required.  Then,  how  could  one  balance  political 
activity  and  passivity  while  allowing  political  lead¬ 
ers  both  to  exercise  power  and  to  respond  to  the 
demands  and  preferences  of  citizens?  A  political 
division  of  labor — strongly  criticized  by  both  com¬ 
munitarian  and  republican  defenders — was  seen  as 
an  essential  condition  to  allow  the  political  system 
to  work  effectively,  to  make  political  choices,  and 
to  encourage  voters  to  evaluate  the  performance  of 
political  leaders  and  governments. 

Politics  presupposes  conflict  and  antagonism 
between  parties  and  groups  in  open  democratic 
systems,  but  these  forces  should  be  contained  by  a 
general  national  loyalty  and  support  for  the  politi¬ 
cal  system.  The  civic  culture  approach  presupposes 
that  a  political  culture  congruent  with  a  stable 
democracy  involves  a  high  degree  of  consensus 
concerning  the  legitimacy  of  democratic  institu¬ 
tions  and  the  content  of  public  policy. 

Political  culture  has  been  defined  as  a  set  of 
attitudes,  beliefs,  and  sentiments  that  give  order 


and  meaning  to  the  political  process  and  provide 
the  underlying  assumptions  and  rules  that  govern 
behavior  within  the  political  system.  Expanding  on 
this  definition,  Almond  and  Verba  distinguished 
between  three  different  types  of  political  orienta¬ 
tion:  (1)  cognitive ,  referring  to  knowledge  of  and 
belief  in  the  political  system,  its  roles,  incumbents, 
inputs,  and  outputs;  (2)  affective ,  translated  as 
feelings  and  sentiments  about  the  political  system, 
its  roles,  personnel,  and  outcomes;  and  (3)  evalua¬ 
tive,  based  on  judgments  and  opinions  about 
political  objects,  particularly,  the  input  and  output 
processes,  incumbents,  and  their  performance. 

The  political  culture  of  the  five  nations  that 
they  compared  was  characterized  in  terms  of  two 
key  attitudinal  variables:  commitment  and  involve¬ 
ment.  The  first  looks  at  individual  attitudes 
toward  the  political  system,  distinguishing  between 
allegiant,  apathetic,  and  alienated  political  orien¬ 
tations;  the  second  measures  attitudes  toward 
participation,  differentiating  between  parochial, 
subject,  and  participant  attitudinal  types.  Civic 
culture  would  combine  both  a  participatory  and  a 
deferential  perspective  of  politics,  forming  a  mixed 
political  culture  in  which  action  and  nonaction, 
obligation  and  performance,  and  conflict  and 
cooperation  would  be  balanced  and  combined. 
Citizens  would  have  to  mobilize  certain  particular 
civic  virtues  in  order  to  be  able  to  evaluate  issues 
relevant  to  them  and  their  society.  These  civic  vir¬ 
tues  would  then  help  prevent  both  the  overload  of 
the  political  system  and  an  excessively  deferential 
polity,  which  could  give  way  to  new  forms  of 
authoritarianism. 

Critical  Views  of  the  Concept 

The  concept  of  civic  culture  had  a  considerable 
influence  on  the  development  of  political  analyses 
in  the  late  1950s  and  early  1960s.  Conceptually 
linked  with  the  so-called  behavioral  revolution,  its 
proponents  wanted  to  signal  a  movement  away 
from  the  study  of  formal  institutions — typical  of  the 
so-called  old  institutionalism — toward  the  study  of 
informal  behavior  to  see  politics  in  real  life.  In  the 
1970s,  however,  the  cultural  approach  fell  out  of 
academic  fashion  and  came  under  strong  criticism 
for  being  conservative,  static,  and  tautological;  for 
ignoring  real-power  relations;  and  for  being  inca¬ 
pable  of  explaining  social,  political,  and  cultural 
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change.  The  view  of  political  culture  and  democ¬ 
racy  presented  in  The  Civic  Culture  rapidly 
became  the  subject  of  considerable  and  rather 
heated  debate.  Its  critics  said  it  was  deterministic, 
ideological,  and  inconsistent  with  the  real  nature 
of  liberal  capitalistic  societies  and  only  repre¬ 
sented  an  idealistic  (and  ideological)  version  of 
politics  in  the  United  States  and  the  United 
Kingdom,  their  prototypes  of  civic  cultures.  But 
the  strongest  charge  against  the  theory  concerned 
the  direction  of  causality  in  the  relation  between 
culture  and  political  structure.  The  British  politi¬ 
cal  philosopher  Brian  Barry  and  the  American 
political  scientist  Dankwart  Rustow  claimed  that 
Almond  and  Verba  failed  to  explain  the  assump¬ 
tion  about  the  congruence  between  political  struc¬ 
ture  and  the  culture  needed  for  democratic  regimes 
to  succeed.  The  culture  of  politics  is  a  conse¬ 
quence,  not  a  cause  of  institutional  structure,  they 
argued.  People  believe  in  a  system  through  learned 
processes  based  on  “habituation”  and  “accommo¬ 
dation”  with  the  logic  of  democratic  institutions 
and  not  the  other  way  round.  But  neither  Barry 
nor  Rustow  did  any  empirical  research  to  prove 
their  point.  At  the  time  he  was  making  his  criti¬ 
cism,  Barry  thought  that  there  was  not  enough 
research  to  allow  consistent  conclusions  about  the 
controversy  to  be  drawn  and  suggested  that  to 
advance  the  debate,  scholars  should  study  the  phe¬ 
nomenon  of  transformation  of  political  regimes  to 
be  able  to  grasp  the  real  sense  of  causality — at  the 
time,  however,  there  were  no  regimes  undergoing 
such  a  change. 

Political  values  and  orientations  certainly  can¬ 
not  exist  in  a  vacuum,  but  critics  were  missing  the 
point — as  Arendt  Lijphart  pointed  out.  Almond 
and  Verba  allowed  causality  to  work  both  ways;  in 
other  words,  political  culture  not  only  influences 
institutional  design,  but  it  is  also  influenced  by 
institutional  quality  and  its  functioning.  Moreover, 
the  critics  did  not  explain  the  normative  basis  of 
institutional  choices,  something  else  that  was  not 
to  be  taken  as  a  given.  In  other  words,  institutions 
are  chosen  to  carry  out  specific  functions  in  order 
that  societies  can  work  as  people  want  them  to, 
and  unless  one  assumes  that  institutional  choices 
are  disembodied  and  independent  of  human 
desires,  ideals,  and  objectives,  they  do  refer  to  nor¬ 
mative  goals,  which  are  part  of  political  tradition 
and  culture.  Individuals’  continuing  commitment 


to  political  principles — such  as  the  right  to  contest 
power  or  participate  in  the  political  process — is  a 
central  aspect  of  the  process  of  institutional  choice. 
Rational  motivations  involved  in  such  choices  are 
contextualized  by  power  relations,  political  cul¬ 
ture,  and  tradition. 

The  concept  has  also  been  criticized  for  its  sup¬ 
posed  inability  to  deal  with  political  and  cultural 
change.  Citizens  are  not  automatons,  passively 
receiving  and  internalizing  political  values  and 
norms,  said  the  critics,  and  although  political  cul¬ 
ture  is  transmitted  from  generation  to  generation,  it 
is  not  transmitted  unchanged,  nor  is  it  transmitted 
randomly  or  without  question.  Cultural  transmis¬ 
sion  is  an  active  and  responsive  process,  which  is 
continuously  being  negotiated  by  individuals.  The 
adult  political  experience  over  time,  rather  than  just 
political  socialization  during  childhood,  is  crucial  to 
explain  the  ways  political  views  are  shaped.  This 
means  that  experiences  with  institutions  do  matter, 
but  critics  of  The  Civic  Culture  never  acknowl¬ 
edged  that  its  authors  did  actually  draw  attention  to 
the  fact  that  previous  political  analysis  had  under¬ 
emphasized  the  crucial  role  of  experience  within 
political  systems.  The  study  was  also  criticized  for 
not  properly  considering  the  process  of  competing 
norms  and  values  within  a  society.  Marxist  critics 
or  liberals  such  as  Carole  Pateman  also  claimed  that 
they  did  not  deal  with  class  cleavages  in  their  study 
and  were  therefore  not  able  to  grasp  the  processes 
of  political  cultural  changes  occurring  as  a  conse¬ 
quence  of  the  dispute  for  power  and  political  hege¬ 
mony  in  contemporary  societies. 

Between  the  1970s  and  the  1990s  the  civic  cul¬ 
ture  approach  became  squeezed  between  two 
extremely  prestigious  and  critical  perspectives  of 
contemporary  political  science:  neo-institutional- 
ism  and  the  different  theories  that  investigate  the 
success  of  a  democratic  regime  on  the  basis  of  the 
political  and  economic  performance  of  govern¬ 
ments,  political  leaders,  and  parties.  While  the  first 
excludes  any  reference  to  a  normative  approach  to 
explain  the  design  and  functioning  of  institutions, 
the  second  emphasizes  the  effects  of  economic  and 
political  performance.  Both  approaches  contradict 
the  notion  that  an  individual’s  internalization  of 
values  and  cultural  norms  is  relevant  in  explaining 
the  outcomes  of  a  political  system. 

Consistent  responses  to  those  critics  appeared, 
however,  in  the  mid-1990s  when  the  complexity  of 
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processes  of  political  transition  in  different  coun¬ 
tries  around  the  world  demanded  more  sophisti¬ 
cated  explanations,  recognizing  factors  other  than 
only  those  relating  to  institutions  or  political  per¬ 
formance.  The  question  now  was  the  quality  of 
democracy,  not  just  its  mere  existence. 

The  Renaissance  of  Civic  Culture 

Recent  explanations  of  democratization  are  once 
more  returning  to  the  cultural  approach.  These 
studies  are  less  deterministic  and  are  based  on  a 
probabilistic  perspective  about  the  influence  of 
political  culture.  They  also  assume  that  political 
structures  and  civic  culture  have  a  mutual  influ¬ 
ence.  One  of  its  main  motivations  is  the  perception 
that  the  simple  transformation  of  institutions  that 
usually  characterizes  the  end  of  processes  of  politi¬ 
cal  transition  is  not  enough  to  explain  the  different 
outcomes.  The  fact  that  in  different  parts  of  the 
world  there  are  democratic  regimes  that  do  not 
fully  uphold  the  rule  of  law,  the  principles  of  civil 
and  political  rights,  and  the  mechanisms  of 
accountability  has  directed  researchers’  attention 
once  again  to  the  role  of  attitudes,  convictions,  and 
behavior  of  people  in  explaining  the  variance  of 
the  quality  of  democracy. 

This  renaissance  of  the  civic  culture  approach  is 
principally  due  to  two  kinds  of  contributions.  The 
first  is  the  work  of  Robert  Putnam  on  the  role  of 
social  capital  in  social  and  political  development. 
Social  capital — bonding,  bridging,  and  linking — is 
defined  as  networks  and  social  norms  of  trust  and 
reciprocity  that  strongly  connect  individuals  to  the 
common  interests  and  goals  of  their  community. 
Putnam  has  shown  with  his  work  on  Italy  and  the 
United  States  that  social  networks  based  on  mutual 
trust  and  the  desire  of  individuals  to  act  in  com¬ 
mon  is  decisive  for  the  achievement  of  social  goals 
and  for  the  development  of  a  democratic  institu¬ 
tional  environment.  While  in  the  south  of  Italy 
traditional  ties  do  not  work  toward  achieving 
openness  in  public  life  and  form  part  of  a  cycle  of 
political  abuse,  corruption,  and  governments  char¬ 
acterized  by  low  performance,  in  the  north  a  long 
and  cumulative  tradition  of  cooperation  and 
mutual  trust  has  engendered  higher  levels  of  social 
and  political  participation  and  allowed  regional 
governments  to  perform  much  better.  Social  trust 
derived  from  the  accumulation  of  social  capital 


makes  citizens  more  cooperative,  reciprocal,  and 
willing  to  associate  and  act  collectively.  Thus, 
Putnam’s  contribution  has  emphasized  how  differ¬ 
ent  objective  and  subjective  conditions  affect  insti¬ 
tutional  development. 

The  second  refers  to  the  extensive  work  of 
Ronald  Inglehart  about  the  relevance  of  postmate¬ 
rialist  values  from  a  human  development  perspec¬ 
tive,  which  involves  the  consolidation  of  attitudes 
that  contribute  toward  democratic  structures. 
Inglehart  revised  the  theory  of  modernization  and 
argued  that  social  and  economic  changes  pro¬ 
foundly  affect  the  traditional  political  culture  of 
contemporary  societies  in  such  a  way  that  indi¬ 
viduals  leave  behind,  as  time  passes,  their  attach¬ 
ment  to  survival  values  and  develop  means  of 
self-expression  that  enable  them  to  have  more 
autonomy  and  independent  patterns  of  relations 
with  political  authorities.  Basing  their  studies  on 
the  five  waves  of  the  World  Values  Survey,  Ronald 
Inglehart  and  Christian  Welzel  tested,  on  the  one 
hand,  the  impact  of  values  over  time  on  the  exis¬ 
tence  of  effective  democracies  and,  on  the  other, 
the  influence  of  democratic  institutions  and  the 
length  of  time  a  system  of  government  with  demo¬ 
cratic  values  is  in  place.  According  to  them,  the 
results  clearly  demonstrate  that  while  values  are 
decisive  in  determining  the  existence  and  duration 
of  democratic  institutions,  the  latter  have  only  a 
weak  influence  over  time  in  creating  a  civic  cul¬ 
ture;  indeed,  under  the  influence  of  other  variables 
their  effects  disappear  altogether.  They  claim  that 
studies  have  confirmed  some  basic  assumptions  of 
theories  of  political  culture  even  when  other  vari¬ 
ables  are  included  in  explanations  of  democratiza¬ 
tion  processes. 

Toward  the  end  of  the  1990s  and  at  the  begin¬ 
ning  of  this  century,  the  works  of  Pippa  Norris, 
Richard  Rose,  and  Doh  Shin  also  became  associ¬ 
ated  with  the  resurgence  of  the  civic-cultural 
approach.  Using  different  arguments,  they  reacted 
to  the  analyses  that  in  the  1970s  and  1980s  were 
deeply  concerned  with  the  decrease  of  political 
trust  in  Europe  and  the  United  States  and  also  to 
the  enormous  difficulties  faced  by  newly  democra¬ 
tized  countries  in  creating  effective  democracies.  In 
the  book  Critical  Citizens  (1999),  Norris  and  her 
collaborators  combine  the  institutionalist  approach 
with  the  recognition  that  citizens’  political  experi¬ 
ence  is  crucial  in  explaining  their  attitudes  and 
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behavior.  Shin,  Rose,  and  others  argue  that  ordi¬ 
nary  people’s  support  for  a  democratic  regime  is  a 
learning  and  relearning  process  in  which  both  their 
values  and  their  normative  perspectives  have  a  role 
to  play  as  well  as  their  rational  perception  and 
evaluation  of  the  functioning  and  performance  of 
institutions.  In  both  cases,  without  opposing  the 
perspectives  of  civic  culture  and  institutionalism, 
they  emphasize  the  importance  of  the  adult  politi¬ 
cal  experience  of  citizens  in  explaining  the  condi¬ 
tions  under  which  normative  expectations  about 
the  role  of  institutions  interact  with  rational  evalu¬ 
ations  of  institutional  and  governmental  perfor¬ 
mance.  The  critical  view  of  citizens  is  interpreted 
as  part  of  the  new  civic  culture. 

Conclusion 

The  Civic  Culture  remains  a  model  of  scientific 
inquiry,  allowing  researchers  to  test,  refine,  or 
reject  their  hypotheses.  According  to  Aaron 
Wildavsky,  Almond  and  Verba  made  their  evi¬ 
dence  speak  on  a  profound  theoretical  issue,  and 
their  empirical  work  allows  others  to  test  their 
assumptions.  Presently,  the  concept  is  undergoing 
a  renaissance,  but  this  does  not  simply  imply  going 
back  to  the  original  approach.  New  developments 
have  incorporated  a  more  flexible  interpretive  and 
less  formalistic  perspective,  integrating  both  insti¬ 
tutional  and  cultural  approaches  that  explain  how 
citizens  are  involved  in  politics  and  how  they  influ¬ 
ence  it.  Civic  culture  is  considered  to  be  relevant 
not  only  for  the  stability  of  any  democratic  regime 
but  also  for  its  quality. 
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Civic  Participation 


Civic  participation  can  be  defined  as 

citizens’  access  to  and  participation  in  information, 
decision  making,  and  implementation  of  public 
policies  broadly  construed; 

taking  part  in  democracy — individually  or  as  a  part 
of  organized  groups — through  communication  and 
public  actions  (including  electoral  campaigns  and 
elections)  where  public  interests  prevail  over 
private  ones; 

the  act  of  becoming  involved  in  the  political 
process  and  working  to  better  the  community;  and 
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a  means  to  guarantee  the  credibility  of  institutions, 
through  articulation  of  citizens’  demands  and 
holding  public  officials  accountable. 

As  the  base  concept  of  participation,  the  notion 
of  civic  participation  is  rooted  in  the  normative 
assumption  that  the  efficiency  of  any  economic, 
management,  and  other  social  system  as  well  as  the 
legitimacy  of  democratic  political  systems  depends 
on  the  involvement  and  participation  of  the  public 
and  does  not  simply  derive  from  the  output  dis¬ 
pensed  by  the  system. 

The  term  civic  participation  concentrates  on 
the  different  means  by  which  the  public  can  par¬ 
ticipate  in  the  decision-making  process  within  the 
political  community.  Democracy  is  considered  to 
be  the  political  regime  that  allows  for  the  best 
civic  participation,  because  democratic  regimes 
are  based  on  the  principle  of  interaction  and  com¬ 
munication  between  political  institutions  (parties, 
parliaments,  local  government,  etc.)  and  the  peo¬ 
ple.  Civic  participation  brings  a  great  number  of 
people  with  diverse  sources  of  information  and 
interests  into  the  process  of  public  decision  mak¬ 
ing.  Widespread  and  intensive  public  discussions 
enhance  the  critical  assessment  of  different  moral 
ideas,  informing  decision  makers.  As  a  result, 
civic  engagement  contributes  to  the  quality  of 
democracy. 

This  entry  presents  an  analytic  overview  of  the 
concept’s  academic  interpretations.  It  considers  the 
conceptualization  and  various  empirical  manifes¬ 
tations  of  civic  participation  and  evokes  several 
unresolved  theoretical  issues. 

Conceptualization 

Civic  participation  (civic  is  derived  from  the  Latin 
word  civis  [“citizen”]  and  refers  to  involvement  in 
the  public  sphere,  as  in  the  classical  Greek  polis)  is 
an  old  concept  that  was  introduced  when  civil  life 
surpassed  the  arbitrary  absolutistic  monarchies  or 
feudal  powers,  calling  for  individual  freedom  and 
the  balance  between  individual  liberty  and  social 
equality.  Alexis  de  Tocqueville  argued  that  partici¬ 
pation  in  purpose-driven  associations  socialized 
people  into  a  polity  with  more  than  merely  egotis¬ 
tic  desires,  and  he  concluded  that  democracy 
worked  through  people’s  participation  in  collec¬ 
tive  activities,  independent  of  the  state. 


The  concept  of  civic  participation  was  further 
elaborated  in  modernization  theories  (theories  of 
social  and  economic  development  and  democrati¬ 
zation)  and  became  one  of  the  main  vehicles  of 
international  (mostly  sponsored  by  the  United 
Nations)  development  policies  in  various  regions 
of  the  world.  As  Gabriel  Almond  and  Sidney 
Verba  note,  the  idea  is  that  prospects  of  socioeco¬ 
nomic  development  and  democratization  rely  on  a 
civic  culture,  participatory  public  attitudes,  and 
social  values.  Through  civic  participation,  people 
can  influence  social,  cultural,  and  economic  poli¬ 
cies,  and  their  participation  extends  beyond  regu¬ 
lar  participation  in  elections.  This  means  that 
decision  makers  must  continuously  take  into 
account  the  interests,  rights,  and  opinions  of  mem¬ 
bers  of  society.  If  participatory  culture  is  weak 
and/or  political  institutions  are  inaccessible  to 
people  (or  representation  is  biased  toward  the 
privileged),  then  the  connection  between  people 
and  politics  is  poor,  compromising  sustained 
socioeconomic  development.  Amartya  Sen  (1999) 
argues  that  “no  substantial  famine  has  ever 
occurred  in  any  independent  country  with  a  demo¬ 
cratic  form  of  government  and  a  relatively  free 
press”  (p.  152).  The  thrust  of  this  argument  is  that 
politicians  in  democratic  settings  (with  a  minimum 
of  respect  for  the  freedom  of  expression)  face  the 
demands  of  citizens  and  have  to  accommodate 
them  (including  those  of  the  most  disadvantaged 
groups). 

Robert  Dahl  presented  an  institutional  model, 
linking  participation  within  the  structure  of  politi¬ 
cal  institutions.  In  his  account,  political  actors  are 
individuals  who  rationally  calculate  if,  when,  and 
where  they  spend  their  scarce  resources  (e.g., 
money,  time,  know-how)  in  the  pursuit  of  broader 
collective  goals  in  public  affairs.  Individuals’  pub¬ 
lic  involvement  is  shaped  by  their  individual  taste 
and  preference  for  political  actions,  contrasted 
with  opportunities  for  other  social  activities  (and 
the  possibility  to  free  ride,  as  the  theory  of  collec¬ 
tive  action  emphasizes).  The  institutional  model  of 
participation  confines  the  issues  with  which  each 
individual  might  be  concerned  and  alleges  that  the 
scope  and  ways  of  people’s  participation  are  rather 
limited.  Yet  it  claims  that  conditions  of  pluralistic 
democracy  allow  people  to  participate  in  public 
affairs  roughly  as  equals.  This  approach  views 
democracy  instrumentally,  as  a  bargaining  process 
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among  different  groups,  where  interests  of  each 
group  are  not  subject  to  any  further  debate. 

In  normative  democratic  theory,  the  institu¬ 
tional  model  of  participation  is  prominently  criti¬ 
cized  by  Carole  Pateman,  who  refined  the  concept 
of  participation  and  focused  it  on  a  community  or 
a  social  group  rather  than  on  separate  individuals. 
Pateman  elaborated  the  notion  of  participation, 
which,  as  a  form  of  genuine  civic  action,  is  possible 
only  in  democracies,  contrasted  with  totalitarian 
or  authoritarian  states,  which  force  participation 
by  propaganda  or  sheer  repression.  Civic  action  is 
defined  as  being  undertaken  collectively  in  pursuit 
of  the  general  good,  in  line  with  the  principle  of 
political  morality  that  benefits  and  burdens  all 
citizens  equally.  The  individual  pursuit  of  self- 
interest  is  not  its  only  and  decisive  driving  force.  In 
her  generic  account  of  civic  participation,  Pateman 
puts  forward  public  discussions  of  common  issues 
rather  than  rational  strategies  aiming  at  individual 
or  narrow  group  goals.  In  a  way,  she  makes  a  con¬ 
ceptual  distinction  between  participation  as  an 
instrumental  action  (pursuing  mostly  protective 
functions)  and  as  an  interaction  (with  mostly  edu¬ 
cative  functions)  and  clearly  attaches  more  impor¬ 
tance  to  the  latter. 

In  the  tradition  of  Aristotelian  and  Rousseauian 
political  philosophy,  the  ideal  of  active  delibera¬ 
tion  lies  at  the  basis  of  Hannah  Arendt’s  concep¬ 
tion  of  participatory  citizenship,  which  in  demo¬ 
cratic  polities  rests  on  values  and  preferences  that 
are  discursively  examined,  acknowledged,  and 
renewed.  Active  engagement  of  citizens  in  the  pub¬ 
lic  realm  (discussing  public  issues  and  solving 
problems  of  their  communities)  provides  people 
with  the  experience  of  public  freedom  and  conveys 
a  sense  of  their  political  agency  (meaningfulness). 
In  that  respect,  it  is  important  to  underline  that 
Arendt  cherished  the  idea  of  direct  political  par¬ 
ticipation  and  claimed  that  citizenship  can  only  be 
practiced  as  a  common  action,  collective  delibera¬ 
tion,  and  the  sharing  of  power.  According  to 
Arendt,  relations  of  civility  and  solidarity  among 
citizens  can  be  established  and  maintained  only 
through  their  engaged  public  participation.  By 
contrast,  Arendt  argued  that  citizenship  should 
not  be  conceptualized  in  terms  of  intimacy, 
warmth,  authenticity,  and  communal  feelings, 
since  those  inevitably  led  to  the  loss  of  public  val¬ 
ues  such  as  impartiality  and  civic  friendship  and 


the  blurring  of  distinctions  between  public  and 
private  interests. 

Attempts  to  synthesize  these  two  apparently 
incompatible  models  of  institutional  and  commu¬ 
nal  participation  gave  birth  to  the  concept  of  civic 
engagement,  as  described  by  Robert  Putnam, 
Theda  Skocpol,  and  Morris  Fiorina.  Here,  engage¬ 
ment  means  being  interested  in  public  issues  and 
getting  and  staying  involved  in  social  activities. 
Civic  engagement  refers  to  individuals’  belonging 
and  participating  in  various  social  groupings  or 
associations.  Civic  engagement  is  grounded  in  the 
face-to-face  interaction  and  communication  among 
persons  (in  real  empirical  settings  such  as  neigh¬ 
borhoods,  clubs,  schools,  churches).  As  Putnam 
notes  (2000),  without  “face-to-face  interaction, 
without  immediate  feed-back,  without  being  forced 
to  examine  our  opinions  under  the  light  of  other 
citizens’  scrutiny,  we  find  it  easier  to  hawk  quick 
fixes  and  to  demonize  anyone  who  disagrees” 
(pp.  341-342).  However,  civic  engagement  inter¬ 
actions  usually  (and  realistically)  do  not  evolve 
into  any  elevated  political  discussion,  that  is,  they 
do  not  yield  acute  public  debate  about  the  future 
of  the  entire  community  or  about  the  totality  of 
social  activities  within  the  community.  The  con¬ 
versations  and  exchanges  here  mostly  concern 
everyday-life  issues  and  community  routines.  Civic 
engagement  generates  social  networks,  habits  of 
cooperation,  interpersonal  trust,  and  norms  of 
reciprocity,  which  further  contribute  to  the  social 
capital,  or  generalized  social  trust  (mostly  mea¬ 
sured  by  the  trust  in  public  institutions  and  in 
people  in  general).  Subsequently,  the  stock  of 
social  capital  might  be  applied  to  cooperative  dis¬ 
cussions  of  political  issues  and  joint  civic  actions, 
and  hence,  it  might  ultimately  contribute  to  better 
results  in  all  domains  of  public  life.  Putnam  (2000) 
observes  that  “in  the  language  of  economics,  social 
capital  lowers  transaction  costs  and  eases  dilem¬ 
mas  of  collective  action”  (p.  346). 

From  the  historical-institutional  perspective, 
Skocpol  claims  that  associative  activities  are  not  by 
and  of  themselves  sufficient  to  ensure  good  gover¬ 
nance.  Reminiscent  of  the  protracted  pernicious 
activities  of  various  racist  and  xenophobic  organi¬ 
zations,  Skocpol  argues  that  state  institutions 
remain  important  in  channeling  and  focusing  pub¬ 
lic  activism,  bringing  together  different  perceptions 
of  the  public  interest  as  well  as  limiting  arbitrary 
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political  power.  Along  with  Putnam’s  observation 
of  the  decline  of  social  capital  in  modern  societies, 
Fiorina  and  Skocpol  acknowledge  that  the  rise  of 
professional  organizations  and  single-issue  advo¬ 
cacy  groups  alienate  many  ordinary  citizens  from 
public  life.  The  authors  maintain  that  encouraging 
more  participation  by  ordinary  people  is  the  best 
way  to  counteract  these  destructive  tendencies  of 
constricted  participation  in  modern  democracies. 

Although  civic  engagement  places  broadly 
defined  social  participation  before  political  par¬ 
ticipation,  it  is  a  type  of  participation  that  is 
aimed  at  the  core  of  state  institutions  and  public 
affairs.  The  civic  engagement  model  helpfully 
addresses  issues  such  as  the  scope  and  types  of 
nongovernmental  (voluntary)  activities  in  society, 
the  fields  of  nonprofit  activity,  the  degree  of  pub¬ 
lic  affairs  politicization,  and  so  on.  Yet  more  ques¬ 
tions  arise  when  one  studies  the  problems  of 
defense  of  public  interests,  not  readily  concen¬ 
trated  in  some  geographical  space  and  not  united 
in  some  well-thought-out  groups.  These  include 
environmental  concerns,  consumer  awareness, 
good  governance  (transparency)  ideals,  and 
human  rights,  which  are  exceedingly  dispersed 
and  usually  opposed  by  concentrated,  entrenched 
interest  groups,  often  including  state  agencies.  The 
concept  of  social  movement  is  frequently  used  to 
address  such  research  puzzles. 

Social  movements  are  defined  as  a  series  of  con¬ 
tentious  performances,  displays,  and  campaigns  by 
which  ordinary  people  make  collective  claims. 
Charles  Tilly  enumerates  a  series  of  factors  that 
brought  about  modern  social  movements.  They 
include  urbanization  (the  massive  presence  of 
people  in  cities  facilitates  social  interaction  and  the 
formation  of  purposeful  groups),  industrialization 
(significant  groups  of  workers  are  concentrated  in 
factories),  mass  education  (higher  education  helps 
develop  and  articulate  ideologies  and  collective 
claims),  communication  (technological  progress 
makes  contacts  and  interaction  easier),  and  democ¬ 
ratization  (the  worldwide  spread  of  democratic 
regimes  legitimizes  the  quest  for  political  rights 
and  empowerment). 

In  the  contemporary  world,  given  the  uncertain¬ 
ties  caused  by  representative  democracy  and  the 
failures  of  formal  mechanisms  for  civic  participa¬ 
tion  under  conditions  of  modern  complexity,  the 
importance  of  large-scale  informal  actions  is  very 


great.  Even  though  informal  structures  and  ways 
of  participation  in  public  affairs  have  always  coex¬ 
isted  with  codified  procedures,  yet  the  recent  phe¬ 
nomenon  of  continuous  and  adaptable  civic  par¬ 
ticipation  through  social  movements  represents  a 
significant  expression  of  active  civil  society.  New 
consequential  societal  goals  (expressive  and  delib¬ 
erative  actions,  devoid  of  purely  pragmatic  aims) 
are  achieved  through  changes  brought  by  social 
movements  into  existing  ideological  and  organiza¬ 
tional  environments. 

Actually,  social  movements  extend  in  terms  of 
range  from  local  and  international  movements  to 
multilevel  and  global  movements.  In  terms  of 
methods,  violent  movements  continue  to  play  an 
important  role  and  are  often  shaped  as  semiterror¬ 
ist  and  terrorist  organizations  with  some  social 
support.  Yet  the  majority  of  social  movements 
seek  to  achieve  social  and  civic  goals  by  peaceful 
means.  Rights  and  obligations  associated  with 
civic  participation  require  the  constitutional  frame¬ 
work  that  can  be  jeopardized  by  violence.  There 
are  important  examples  of  (at  least  partially)  suc¬ 
cessful  peaceful  social  movements:  Mahatma 
Gandhi’s  civil  disobedience  movement  against  the 
British  rule  in  India  ( 1 930s— 1940s),  the  American 
civil  rights  movement  (late  1950s-1960s),  the 
Polish  Solidarnosc,  and  the  Lithuanian  Sqjiidis 
movements  against  the  Soviet-imposed  communist 
regime  (1980s-1990s).  Events  such  as  the  Orange 
Revolution  in  Ukraine  (2004)  and  the  Rose 
Revolution  in  Georgia  (2005)  can  also  be  termed 
social  movements  because  of  the  accepted  methods 
of  opposition  and  the  huge  popular  involvement. 

Concerning  the  interests  represented  in  social 
movements,  most  of  the  19th-century  movements 
had  their  roots  in  social  classes  or  cultural  com¬ 
munities  (workers,  peasants,  aristocrats,  petty 
bourgeois,  religious  [mostly  Protestant]  communi¬ 
ties,  Whites,  etc.).  They  attempted  to  advance  their 
living  conditions  or  social  status.  Often,  these 
types  of  social  movements  developed  into  political 
parties.  In  the  second  part  of  the  20th  century,  new 
social  movements  (feminist,  prolife,  pro-abortion, 
gay  rights,  antinuclear,  environmental,  etc.)  began 
to  emerge  and  reshape  the  sociopolitical  landscape 
in  many  countries  and  even  internationally.  The 
new  social  movements  present  an  opportunity  to 
reestablish  the  basic  standards,  values,  and  norms 
of  civic  participation.  They  require  continuous 
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commitment  and  personal  activity  in  the  public 
sphere. 

The  public  sphere,  in  Jurgen  Habermas’s  sense, 
is  defined  as  an  area  of  social  life  that  has  the  fol¬ 
lowing  four  primary  governing  principles:  (1)  the 
social  status  of  participants  is  disregarded,  (2)  dis¬ 
cussions  within  the  public  sphere  presuppose  the 
challenging  of  previously  unquestioned  issues, 
(3)  issues  in  the  public  sphere  are  driven  and 
decided  by  rational  arguments,  and  (4)  the  public 
is  inclusive.  Contrary  to  the  private  sphere,  where 
individuals  decide  to  participate  in  accordance 
with  their  private  interests,  and  to  the  public 
authority  sphere,  where  the  actions  of  individuals 
and  institutions  are  shaped  by  formal  decision¬ 
making  procedures  aimed  at  the  implementation 
of  public  policies  and  the  distribution  of  public 
goods,  the  public  sphere  is  a  sphere  suis  generis 
transcending  individual  interests  and  profit  maxi¬ 
mization  and  is  mostly  concerned  with  the  delib¬ 
eration  of  public  issues. 

Civic  Education  as  an  Integral 
Part  of  Civic  Participation 

In  all  these  conceptualizations,  civic  participation 
is  associated  to  some  extent  with  civic  education 
and  the  cognitive  mobilization  of  citizens.  As  early 
as  in  1861,  John  Stuart  Mill  argued  that  democ¬ 
racy  tends  to  enhance  the  moral  qualities  of  citi¬ 
zens  because  when  people  in  some  way  participate 
in  decision  making,  they  have  to  listen  to  others, 
they  are  obliged  to  justify  themselves  to  others, 
and  they  have  to  act  sensibly,  taking  into  account 
the  interests  and  concerns  of  others.  Rational 
choice  theorists  also  agree  that  when  people  find 
themselves  under  circumstances  of  participatory 
democracy,  they  genuinely  consider  the  issues  of 
the  common  good  and  justice.  Hence,  civic  par¬ 
ticipation  tends  to  enhance  the  autonomy,  ratio¬ 
nality,  and  morality  of  participants. 

In  their  analysis  of  survey  data  from  five  coun¬ 
tries,  Almond  and  Verba  considered  education 
(based  on  critical  thinking  and  analysis)  to  be  the 
most  important  factor  creating  attitudes  and  val¬ 
ues  vital  for  a  participant  culture.  Adherents  of 
deliberative  democracy  maintain  that  public  dis¬ 
cussions  represent  the  only  way  available  to 
democracies  for  the  promotion  of  the  intellectual 
and  moral  progress  of  the  citizenry,  which  is  their 


raison  d’etre.  Proponents  of  the  civic  engagement 
model  underline  the  importance  of  capacity  devel¬ 
opment:  empowering  people  to  effectively  partici¬ 
pate  in  public  life,  to  learn  the  routine  practices  of 
cooperation,  and  to  acquire  a  taste  for  public 
affairs.  Yet  it  must  be  acknowledged  that  people’s 
capacities  to  communicate,  organize,  analyze, 
negotiate,  and  take  positions  on  issues  have  to  be 
commensurate  with  adequate  capacities  of  staff  in 
government  institutions  and  other  organizations  in 
order  to  account  for  such  activities. 

Analytically,  the  extent  and  forms  of  civic  par¬ 
ticipation  depend  on  civic  dispositions  (interper¬ 
sonal  trust,  people’s  confidence  to  participate  in 
civic  life,  openness,  tolerance,  and  responsibility  of 
citizens),  civic  knowledge  (people’s  familiarity  with 
political  issues  and  their  social  context;  knowing 
one’s  political,  civil,  social,  and  economic  rights; 
understanding  the  roles,  rights,  and  responsibilities 
of  citizens  and  public  officials),  and  civic  skills  (the 
ability  to  explain,  analyze,  evaluate,  and  defend  a 
position;  using  knowledge  for  informed  participa¬ 
tion).  United  Nations  Development  Programme 
(UNDP),  for  example,  claims  that  sustainability, 
equity,  and  empowerment  are  direct  results  of  civic 
education  leading  to  inclusive  participation. 

Empirical  Measurements 
of  Civic  Participation 

Recognizing  that  democracy  is  the  prerequisite  for 
any  meaningful  civic  participation,  different  forms 
and  intensity  of  civic  participation  can  be  observed 
across  countries,  social  groups,  and  times. 
According  to  Samuel  Barnes  and  Max  Kaase 
(1979),  the  level  of  civic  participation  can  be  seen 
as  a  continuum.  At  one  extreme  are  people  who 
have  no  interest  in  politics  at  all  (they  do  not  vote, 
petition,  or  demonstrate;  do  not  read  newspapers; 
are  not  informed  and  do  not  want  to  be  informed 
about  public  issues;  and  never  volunteer);  at  the 
other  extreme  are  active  citizens  (they  are  inter¬ 
ested  in  politics;  vote;  are  involved  in  political  par¬ 
ties,  trade  unions,  nongovernmental  organizations 
and  civic  associations;  and  use  other  forms  of  con¬ 
tention).  Between  these  two  extremes,  most  citizens 
participate  through  conventional  electoral  events 
(they  get  mobilized  in  special  circumstances);  they 
more  or  less  understand  public  issues  and  occasion¬ 
ally  attempt  to  have  an  impact  on  them. 
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Political  scientists  identify  several  sets  of  incen¬ 
tives  and  disincentives  of  participation.  Pippa 
Norris  presents  an  institutional  account  of  incen¬ 
tives  of  political  participation  (defined  narrowly  as 
voting  in  elections):  a  proportional  electoral  sys¬ 
tem,  not  very  frequent  elections,  small  distances 
between  the  relevant  contenders,  and  great  elec¬ 
toral  choice  of  candidates  (parties).  These  factors 
are  intrinsically  related  to  the  cognitive  conditions 
of  democracy  and  certainly  foster  civic  participa¬ 
tion.  The  institutional  clarity  of  a  political  system 
and  party  polarization  facilitate  social  and  politi¬ 
cal  mobilization,  since  clear  ideological  conflicts 
socialize  citizens  to  comprehend  complex  political 
issues  through  simplified  and  normatively  coher¬ 
ent  discourses. 

Putnam  compiled  the  social  capital  index,  con¬ 
sisting  of  measures  of  participation  in  civic  and 
political  activities,  including  group  membership, 
attending  public  meetings  on  local  and  school 
affairs,  service  as  an  officer  or  committee  member 
of  an  organization,  attending  club  meetings,  vol¬ 
unteer  work  and  community  projects,  home  enter¬ 
taining  and  socializing  with  friends,  social  trust, 
electoral  turnout,  and  the  incidence  of  nonprofit 
organizations.  It  appears  that  these  indicators  of 
formal  and  informal  civic  participation  and  social 
trust  are  strongly  intercorrelated.  Research  shows 
that  local  differences  in  the  social  capital  index 
have  an  effect  on  child  welfare  and  education,  safe 
and  productive  neighborhoods,  economic  prosper¬ 
ity,  health  and  happiness,  and  democratic  citizen¬ 
ship  and  governance  performance. 

Comparative  research  initiated  in  the  1970s, 
such  as  that  by  Inglehart  (1997),  demonstrates  that 
citizens  not  only  want  to  hold  their  leaders  account¬ 
able  through  elections  once  every  4  or  5  years  but 
they  increasingly  intend  to  scrutinize  and  to  influ¬ 
ence  their  leaders  during  their  term  of  office.  It  is  a 
global  evolution  of  continuous  democracy,  combin¬ 
ing  electoral  rights,  freedoms  of  association,  and 
new  kinds  of  contentious  participation. 

Another  line  of  research  considers  the  idea  that 
people  understand  public  issues  and  eventually 
involve  themselves  in  public  affairs  because  of  and 
through  the  images  given  by  political  leaders  and 
the  media,  not  through  personal  experience. 
Therefore,  along  with  institutional  account  and 
civic  education  considerations,  research  on  politi¬ 
cal  communication  is  an  integral  part  of  the  study 


of  civic  participation.  Civic  journalism  dwells  on 
the  principle  that  adequate  coverage  of  politics  is 
important  for  public  life.  The  role  of  mass  media 
is  very  important  for  civic  participation:  When 
journalists,  academics,  human  rights  activists, 
intellectuals,  and  artists  publicly  contest  various 
norms  and  decisions,  they  safeguard  the  rights 
and  freedoms  of  the  entire  community,  benefiting 
from  mobilization  and  vigilance  of  its  most  active 
citizens. 

Theoretical  Tensions  Regarding 
Civic  Participation 

There  is  an  inherent  tension  between  the  universal- 
istic  (communal,  civic  engagement,  social  move¬ 
ment)  and  functional  (institutional)  concepts  of 
civic  participation.  The  universalistic  rhetoric 
claims  that  any  public  action  is  derivative  from  the 
civil  society  and  contributes  to  the  general  interest, 
common  good,  and  justice  and  not  just  to  the 
agenda  of  interest  groups.  The  idea  of  civic  par¬ 
ticipation  assumes  that  the  actions  undertaken  by 
organized  groups  (mobilized  minorities)  can  con¬ 
tribute  to  the  benefit  of  the  entire  community  and 
strengthen  the  administrative  accountability  and 
political  legitimacy  of  the  regime. 

Interest  groups  frequently  use  words  such  as 
citizens,  civil  society,  people,  and  general  public 
(practicing  pluralist,  lobbying,  or  corporatist  strat¬ 
egies).  Yet  most  concrete  proposals  concern  orga¬ 
nized  groups.  Therefore,  in  empirical  terms,  the 
logic  of  negotiation  between  the  institutions  and 
some  limited  groups  remains  at  the  core  of  active 
citizens’  participation. 

Another  problematic  issue  of  civic  participation 
has  to  do  with  its  limitation  to  the  nondecision  (the 
consultative,  predecision)  stage  or  the  deliberative 
part,  although  contemporary  democratic  govern¬ 
ments  claim  to  promote  civic  participation  through¬ 
out  the  whole  policy  chain.  Civic  participation  in 
the  implementation  and  evaluation  of  public  poli¬ 
cies  is  limited.  Considering  that  actual  public  policy 
decisions  usually  are  taken  by  public  authorities  in 
cooperation  with  selected  organized  groups,  other 
social  actors  have  to  face  the  decisions  ex  post. 

The  thrust  for  promotion  of  civic  participation 
is  derived  from  the  liberal  conception  of  democ¬ 
racy.  Openness  and  transparency,  organized  con¬ 
sultation,  better  communication,  decentralization, 
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and  so  on  can  make  institutions  more  accountable. 
However,  these  techniques  are  not  substitutes  for 
the  representative  forms  of  democracy  and  citizen¬ 
ship,  and  they  do  not  encourage  an  average  citizen 
to  become  active,  since  they  require  considerable 
intellectual  and  financial  resources. 

Finally,  the  concept  of  civic  participation  con¬ 
centrates  on  sets  of  functional  groups  with  particu¬ 
lar  ends  (trade  unions,  employers’  organizations, 
professional  associations,  community-based  orga¬ 
nizations,  charities  and  religious  groups,  etc.).  Yet 
civic  participation  research  often  overlooks  the 
general  actors  (e.g.,  political  parties)  who  represent 
a  global  view  at  the  grassroots,  while  only  the  top 
of  their  organizations  belong  to  the  public  author¬ 
ities,  acting  on  behalf  of  the  state. 

Irmina  Matonyte 
Institute  for  Social  Research 
Vilnius,  Lithuania 

See  also  Civic  Culture;  Civil  Society;  Interest  Groups; 
Participation;  Social  Capital;  Social  Movements 
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Civil  Service 


A  civil  service  is  a  system  for  managing  human 
resources  using  mechanisms  with  which  govern¬ 
ments  select,  employ,  and  promote  the  personnel 
who  act  as  public  administrators.  A  civil  service 
system  takes  into  account  the  qualifications  and 
aptitudes  of  employees.  In  other  words,  it  is  the 
institutional  framework  within  which  the  human 
resources  employed  in  the  public  sector  are  orga¬ 
nized  in  a  way  that  allows  governments  to  employ 
the  best  people.  These  employees  work  to  achieve 
the  results  of  projects  put  forward  by  their  gov¬ 
ernments,  regardless  of  their  political  affiliation  or 
party.  The  bases  of  the  civil  service  are  established 
in  statutes,  laws,  or  regulations,  and  for  its  suc¬ 
cessful  operation,  an  autonomous  structure  is 
required — a  body  that  will  oversee  its  strict  opera¬ 
tion  and  the  observation  of  the  norms  that  regu¬ 
late  the  civil  service,  so  that  its  usefulness  contrib¬ 
utes  to  strengthening  the  democratic  values  of  the 
societies  it  serves.  This  entry  describes  the  charac¬ 
teristics  of  career  civil  service,  the  forms  it  may 
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take,  and  the  subsystems  involved  in  its  operation. 
The  place  of  the  civil  service  in  public  administra¬ 
tion  and  the  benefits  associated  with  it  are  then 
examined  in  light  of  evolving  democratic  values. 

Characteristics  of  the  Civil  Service 

Given  its  autonomous  role  as  described  above,  the 
civil  service  is  not  simply  a  series  of  rules  and  tech¬ 
nical  conditions;  rather,  it  should  be  seen  as  a  type 
of  moral  code  for  public  service  that  is  designed  to 
achieve  common  objectives  and  should  be  shared 
by  all  its  functionaries.  Civil  services  are  based  on 
three  fundamental  values  for  a  democratic  public 
administration,  taken  principally  from  the  logic  of 
bureaucratic  organization  proposed  by  Max  Weber: 
(1)  equality,  (2)  merit,  and  (3)  impartiality.  Thus, 
modern  civil  services  are  based  on  the  following: 

1.  Recruitment  is  based  on  the  merits  of  the  civil 
servants  and  not  on  patronage  or  inherited 
posts. 

2.  The  idea  of  equality  allows  anyone  to 
participate  in  the  competition  to  occupy  public 
posts,  provided  that  they  meet  the  prerequisites 
established  for  the  position  in  question;  and, 
once  they  are  in  the  civil  service,  the  system  of 
evaluation  and  any  sanctions  should  be 
comparable  for  all  the  functionaries. 

3.  Impartiality  guarantees  that  the  members  of  the 
civil  service  should  maintain  a  neutral  position 
toward  political  parties,  related  to  an  internal 
logic,  to  grant  all  applicants  the  same 
opportunities  and,  when  dealing  with  the 
public,  provide  the  same  attention  and  quality 
in  the  services  offered  to  all  citizens  regardless 
of  their  gender,  religion,  or  partisan  preference. 

The  concept  of  a  career  civil  service  can  be 
given  at  least  two  meanings.  First,  it  is  used  as  a 
synonym  of  public  bureaucracy,  although  not  all 
the  government  functionaries  necessarily  form 
part  of  it.  The  Latin  American  Center  of 
Administration  for  Development  (CLAD)  main¬ 
tains  in  the  Ibero-American  Charter  of  Public 
Affairs  that,  in  Ibero-America,  the  terms  public 
function  and  civil  service  are  generally  used  as 
synonyms.  Within  this  acceptance,  there  is  an 
enormous  variation  in  the  use  of  the  term,  since 


sometimes  civil  service  may  in  fact  consist  of  any 
or  all  functionaries  who  carry  out  administrative 
work  within  a  government.  The  second  use  given 
to  the  term  civil  service  refers  specifically  to  the 
area  or  areas  of  the  government  in  charge  of  per¬ 
forming  the  recruitment  and  evaluation  of  the 
functionaries. 

Therefore,  not  all  civil  services  are  the  same;  the 
systems  of  recruitment,  sanctions,  and  recognition 
and,  above  all,  the  attributes  and  extent  of  the  civil 
service  vary  according  to  the  political  and  institu¬ 
tional  context  they  are  in.  It  is  important  to  men¬ 
tion  that  within  the  same  civil  service  there  can  be 
various  types  of  labor  relations;  this  means  that 
there  can  be  some  functionaries  appointed  by 
direct  political  nomination  and  others  by  means  of 
a  public  competition. 

Members  of  the  civil  service  are  known  as  career 
public  servants,  and  they  can  be  employed  at  any 
level  of  government  (federal,  provincial,  or  local), 
depending  on  the  size  of  the  civil  service,  or  in  the 
autonomous  institutions,  such  as  electoral  organ¬ 
isms  or  any  of  the  three  powers  of  government: 
executive,  legislative,  or  judicial.  In  some  countries, 
such  as  the  United  Kingdom  (UK),  career  civil  ser¬ 
vants  are  only  those  who  work  for  the  Crown; 
functionaries  who  serve  in  the  lower  levels  of  gov¬ 
ernment  are  not  included.  In  the  United  States,  on 
the  other  hand,  there  are  civil  servants  within  state 
and  local  governments,  and  in  France,  officials  of 
some  public  enterprises  are  also  included. 

Further,  career  civil  services  are  regulated  by 
laws  or  statutes  that  have  different  characteristics 
relative  to  the  rest  of  the  labor  legislation  for  the 
country  in  question;  as  a  result,  these  functionaries 
usually  remain  separate  from  other  public  employ¬ 
ees.  These  differences  become  apparent  in  the 
policies  relating  to  retirement  and  the  establish¬ 
ment  of  pensions,  as  well  as  the  type  and  degree  of 
sanctions,  or  in  the  capacity  to  organize  themselves 
in  unions  and  other  associations. 

As  the  name  suggests,  the  military  are  not 
included  in  career  civil  services,  although  some 
civil  servants  who  work  in  the  areas  of  national 
defense  are  generally  considered  to  be  career  civil 
servants.  There  are  also  functionaries  who  hold 
their  posts  because  they  were  political  nominees  or 
because  they  were  elected  by  popular  vote,  which 
places  them  outside  the  civil  service.  In  the  case  of 
Ireland,  for  example,  government  ministers  and 
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some  of  their  political  advisors  are  excluded  from 
the  civil  service. 

Types  of  Civil  Services 
Closed  Versus  Open  Services 

Conceptually,  a  career  civil  service  can  be  either 
open  or  closed.  A  closed  service  takes  for  granted 
that  persons  entering  the  public  service  plan  to 
develop  their  whole  career  there.  It  is  a  closed  sys¬ 
tem  because  one  can  enter  only  at  the  lowest  levels 
of  the  organizational  pyramid  and  from  there 
gradually  climb  up  to  the  higher  positions.  That  is, 
personnel  tend  to  be  hired  for  the  lower  echelons, 
and  from  there,  mobility  through  the  ranks  may  be 
possible.  To  enter,  one  has  to  have  a  certain  level  of 
education  and  submit  to  a  public  competition  in 
order  to  be  selected;  because  entrance  takes  place  at 
the  lower  levels  of  the  organization,  the  partici¬ 
pants’  experience  has  little  relevance  in  the  selection 
process.  One  great  advantage  of  this  system  is  the 
creation  of  an  ethos  in  the  public  service,  as  the 
functionaries  manage  to  accumulate  important 
knowledge  about  their  organizations.  The  following 
are  the  principal  disadvantages  of  this  model.  The 
first  is  that  upward  mobility  within  the  organization 
is  usually  linked  to  the  seniority  of  a  functionary 
and  not  to  his  or  her  performance.  The  second  is 
that  this  system  limits  the  universe  of  contestants  to 
those  who  have  accumulated  little  or  no  experience; 
finally,  this  model  can  produce  unwanted  results  by 
generating  incorrect  incentives,  and  it  can  even  lead 
to  poor  performance  by  the  functionaries,  by  not 
generating  competitiveness  with  the  outside  labor 
force.  The  oldest  civil  services  are  built  on  the  basis 
of  this  model  and  have  found  it  necessary  to  make 
important  revisions  to  their  schemes,  as  in  the  cases 
of  the  career  civil  services  in  France  and  Germany. 

An  open  civil  service,  in  contrast,  allows  for  pub¬ 
lic  competition  not  only  for  posts  at  the  lower  levels 
but  also  for  the  higher  positions  within  the  organi¬ 
zation.  The  system  is  therefore  open  to  outside 
participants,  although  there  may  be  a  clause  stipu¬ 
lating  that  preference  be  given  to  those  who  already 
belong  to  the  civil  service.  In  this  system,  the  cre¬ 
ation  of  a  spirit  of  public  service  becomes  less  evi¬ 
dent,  since  there  is  the  possibility  that  employees 
from  the  private  sector  can  become  public  function¬ 
aries  and  vice  versa.  Nevertheless,  one  evident 
advantage  of  this  assumption  is  that  by  alternating 


between  private  enterprise  and  the  public  sector  the 
number  of  participants  increases,  allowing  for  a 
broader  universe  for  recruitment  of  the  most  quali¬ 
fied  as  well  as  the  benefit  of  the  transfer  of  knowl¬ 
edge  from  one  sector  to  the  other.  These  employ¬ 
ment  civil  services  are  being  tried  out  in  countries 
that  have  recently  created  or  reformed  their  civil 
services,  such  as  New  Zealand  and  the  UK. 

Executive  Versus  Senior  Service 

Because  of  their  scope  at  the  different  levels  of 
government,  the  civil  services  are  also  classified  as 
executive  or  senior  civil  services.  This  distinction 
implies  the  separation  between  the  authority  of  the 
higher  and  middle  levels  of  command  within  the 
public  sector. 

Operational  Subsystems 

Whatever  model  is  adopted,  the  civil  service  oper¬ 
ates  with  certain  subsystems  that  are  indispens¬ 
able  for  their  proper  functioning.  The  subsystems 
that  make  up  a  “traditional”  civil  service  are  the 
following. 

Recruitment  and  Selection 

The  subsystem  for  recruitment  and  selection 
establishes  the  selection  procedures  in  the  contest 
for  the  position  (if  there  are  to  be  quotas  for  posi¬ 
tive  discrimination  or  not,  i.e.,  if  it  is  mandatory  to 
include  applicants  or  representatives  from  minority 
or  marginal  sectors  of  society).  In  addition,  it 
specifies  whether  the  contest  is  to  be  public  or 
closed,  what  type  of  exams  and  study  guides  are  to 
be  used,  and  what  interviews  are  to  be  held  and 
with  which  of  the  functionaries,  among  others.  It 
also  establishes  the  requisite  profile  and  academic 
qualifications,  experience,  samples  of  work,  and 
personal  references  that  the  candidates  for  the  post 
should  have,  as  well  as  establishing  the  conditions 
under  which  they  would  be  integrated  into  the  pub¬ 
lic  sector,  that  is,  what  type  of  contract  they  would 
subscribe  to,  what  salary  they  would  receive,  and  if 
they  would  be  evaluated  before  being  confirmed  in 
the  post  or  if  they  would  first  be  subjected  to  a  trial 
period.  For  this  procedure,  first,  there  should  be  a 
process  of  revision  of  the  applicants’  curricula 
vitae;  second,  they  should  undergo  examinations 
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and  psychometric  tests;  and  third,  they  should  sub¬ 
mit  to  an  interview,  and  on  occasion,  as  previously 
mentioned,  they  are  given  a  trial  period  in  the  post. 
Ideally,  this  subsystem  should  have  mechanisms 
allowing  rejected  candidates  to  appeal  the  decisions 
made  by  the  recruitment  team,  something  that  is  not 
always  taken  into  consideration.  Moreover,  the  sys¬ 
tem  demands  constant  revision  to  remain  adapted 
to  the  requirements  of  the  different  posts,  revising 
among  other  things  the  pertinence  of  the  evalua¬ 
tions  to  make  sure  they  correspond  to  the  catalog  of 
existing  posts. 

Education  and  Training 

The  subsystem  of  education  and  training  regu¬ 
lates  the  training  of  the  public  servants,  whether 
initial,  continuous,  or  a  combination  of  both. 
Initial  education  consists  of  generating  mechanisms 
that  allow  for  the  development  of  competence  rel¬ 
evant  to  the  post  aspired  to,  which  would  not  have 
been  acquired  during  the  professional  education  of 
the  functionary,  as  well  as  those  that  lead  to  the 
homogenization  of  the  knowledge  of  the  function¬ 
aries  to  facilitate  more  or  less  uniform  performance 
in  the  job.  It  should  be  mentioned  that  not  all  civil 
services  have  subsystems  for  initial  formation,  as 
many  demand  that  this  should  be  provided  by  a 
particular  institution,  which  could  be  internal,  such 
as  the  Civil  Service  College  of  the  UK,  or  by  an 
external  institution  such  as  the  Hautes  Ecoles  or 
the  Ecole  Nationale  d’Administration  in  the  case  of 
France.  Continuous  education,  on  the  other  hand, 
responds  in  some  cases  to  the  demands  of  the  pub¬ 
lic  and  in  others  to  the  modifications  or  updating 
of  the  forms  and  procedures  of  the  different  admin¬ 
istrations  or  the  incorporation  of  new  values  in  the 
face  of  changing  scenarios,  including  in  organiza¬ 
tional  culture.  Recently,  this  subsystem  has  greatly 
benefited  from  the  advances  in  information  tech¬ 
nology,  from  the  possibility  of  attending  courses 
and  presenting  evaluations  at  a  distance.  All  this 
has,  no  doubt,  resulted  in  better  use  being  made  of 
working  hours  and  improvement  in  the  activities 
undertaken  by  public  functionaries.  Although  one 
should  mention  that  in  many  cases  these  resources 
are  not  optimized  because  continuous  education 
has  been  focused  on  nominative  subjects  instead  of 
operative  affairs,  which  detracts  from  the  possibili¬ 
ties  of  immediate  usefulness. 


Evaluation 

The  subsystem  of  evaluation  seeks  to  identify 
the  strengths  and  weaknesses  in  the  public  manage¬ 
ment  from  individual  and  organizational  evalua¬ 
tions.  It  is  necessary  to  decide  in  what  domain  to 
conduct  an  evaluation,  how  evaluation  is  to  be 
carried  out,  what  indicators  will  be  used  and  with 
what  frequency,  and  what  is  to  be  corrected  or 
improved.  This  subsystem  can  be  linked  to  sanc¬ 
tions  applied  to  public  functionaries,  ranging  from 
verbal  reprimands  to  permanent  removal  from  the 
post.  It  also  covers  the  schemes  for  evaluating  per¬ 
formance,  where  there  is  a  relation  between  wages 
or  the  bonuses  of  the  functionaries  in  relation  to 
the  goals  set,  in  the  first  case,  and  to  an  outstand¬ 
ing  performance,  in  the  second.  Linking  the  assess¬ 
ment  of  performance  to  the  budget  may  be  advan¬ 
tageous  because  it  leads  to  increases  in  efficiency 
and  effectiveness,  better  planning,  and  a  clearer 
focus  on  results.  However,  it  can  also  have  its  dis¬ 
advantages,  by  exceeding  the  budgets  of  the  orga¬ 
nizations  or  by  cutting  resources  in  areas  where 
they  are  needed,  without  determining  accurately 
the  causes  of  a  deficient  performance,  which  may 
have  been,  for  example,  due  to  inadequate 
resources.  In  addition,  evaluations  should  become 
an  important  input  for  the  subsystem  of  continu¬ 
ous  education,  since  they  can  identify  the  areas  that 
require  a  greater  effort  in  formation  and  training 
and  the  contents  that  these  options  should  include. 

Promotions  and  Mobility 

The  subsystem  of  promotions  and  mobility 
manages  the  horizontal  and  vertical  mobility  of  the 
civil  servants.  Because  mobility  can  be  horizontal 
as  well  as  vertical  (particularly  in  the  case  of  an 
open  employment  system),  the  exchange  and 
transfer  of  knowledge  from  one  area  to  another 
are  encouraged,  although  this  situation  can  cause 
a  loss  of  experience  within  organizations.  Mobility 
can  occur  both  in  an  open  system,  where  anyone 
can  have  access  to  a  post,  and  in  the  so-called  elit¬ 
ist  systems,  in  which  only  graduates  from  certain 
universities  or  careers  can  occupy  positions  at  the 
highest  levels.  In  the  more  traditional  systems,  pro¬ 
motion  of  functionaries  has  been  related  to  senior¬ 
ity  in  the  post  and  within  the  schemes  of  the  civil 
service;  however,  as  a  result  of  reforms  arising 
from  the  adoption  of  ideas  associated  with  the  new 
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public  management  approach,  priority  is  now 
sometimes  given  to  the  performance  of  functionar¬ 
ies  rather  than  to  the  length  of  time  that  they  have 
held  the  post. 

Removal  From  Office 

Under  the  traditional  principles  of  administra¬ 
tion  of  the  subsystem  for  removal  from  office,  a 
functionary  would  leave  office  only  to  receive  a 
pension.  However,  the  stability  that  this  immobil¬ 
ity  gave  the  functionaries  is  a  negative  performance 
incentive.  Current  tendencies  indicate  that  removal 
from  office  can  occur  for  at  least  two  reasons: 
(1)  the  civil  servant’s  retirement  or  (2)  the  tempo¬ 
rary  removal  or  separation  from  office  due  to  the 
poor  performance  of  the  public  servant,  a  situation 
that  should  be  very  accurately  documented. 

Public  Administrations 
and  Career  Civil  Service 

The  successful  implementation  of  a  career  civil 
service  is  indispensable  for  improving  and  modern¬ 
izing  public  administrations,  since  this  allows  for 
greater  continuity  in  public  policies  and  also 
increases  the  efficiency  and  efficacy  of  the  adminis¬ 
tration.  In  addition,  to  set  in  motion  a  professional 
civil  service  requires  prior  institutional  action  to 
integrate  a  catalog  of  posts  that  allows  complete 
restructuring  of  public  administration  from  within 
and  in  the  medium  and  long  terms.  Detailed  job 
descriptions  of  each  post  also  favors  an  increase  in 
accountability  within  each  organization. 

The  career  civil  services  are  remarkably  effective 
in  reducing  corruption  and  increasing  the  equity  of 
the  public  services,  since  functionaries  know  that 
permanence  in  their  posts  depends  on  their  perfor¬ 
mance  and  not  on  electoral  cycles,  political  move¬ 
ments,  or  arbitrary  dismissals,  all  of  which 
discourage  them  from  creating  and  attending 
to  specific  clienteles.  The  autonomy  that  the 
administration  acquires  facilitates  continuity  in  the 
application  of  programs  and  laws,  because  the 
administration  is  evaluated  by  the  degree  of  fulfill¬ 
ment  of  the  programmed  objectives,  and  it  also 
minimizes  the  influence  of  the  politicians  on  the 
implementation  of  programs. 

Similarly,  adequate  administration  of  human 
resources  inside  government  results  in  enhanced 


transparency,  as  much  for  individuals  as  for  orga¬ 
nizations,  since  beginning  with  the  integration  of  a 
catalog  of  posts  and  a  clear  definition  of  organiza¬ 
tional  structures,  one  can  know  who  is  responsible 
for  the  results  of  programs  and  government  poli¬ 
cies.  At  first  glance,  civil  services  may  appear  rigid; 
however,  when  their  schemes  are  well  integrated, 
they  can  even  give  greater  flexibility  to  the  public 
administration  since  it  is  easier  to  make  quick  adap¬ 
tations  to  the  demands  of  changing  circumstances. 

The  professionalization  of  the  public  adminis¬ 
tration  can  also  result  in  greater  confidence  by 
citizens.  First,  by  legitimating  the  government 
through  more  efficient  administration,  and  second, 
by  being  more  flexible,  and  thus  more  successful, 
in  the  interaction  with  average  citizens.  In  fact,  a 
civil  service  system  also  tends  to  foster  values 
within  public  administration  such  as  impartiality, 
awareness  of  financial  limitations,  as  well  as  legal¬ 
ity  and  integrity  of  the  government  and  public 
officers.  These  values  contribute  to  the  collective 
construction  of  an  ethos  for  the  public  sector, 
indispensable  for  the  identification  of  the  person¬ 
nel  as  part  of  an  organization  that  aims  at  the 
collective,  consecutive  pursuit  of  objectives,  pre¬ 
serving  the  institutional  memory  and  promoting  its 
prestige  outside  the  organization.  The  public  sec¬ 
tor  ethos  also  has  a  function  within  the  public 
administration:  It  enhances  communication 
between  the  various  members  of  the  organization 
and  can  direct  efforts  toward  common  objectives. 
In  this  way,  communication  and  coordination 
between  bureaucratic  organizations  can  improve. 
Moreover,  career  professional  services  within  the 
civil  service  can  build  up  skills  and  expertise  within 
the  public  administration,  which,  in  the  long  run, 
will  enhance  not  only  the  government  institutions 
but  also  a  solid  democratic  system. 

From  their  perspective,  public  functionaries  are 
also  directly  benefited  by  forming  a  part  of  civil 
service,  since  apart  from  the  obvious  advantage  of 
not  being  arbitrarily  removed  from  their  offices, 
they  are  subjected  to  clear  regulations  and  specific 
responsibilities,  and  they  receive  a  continuous  edu¬ 
cation  that  allows  them  to  keep  their  knowledge 
up-to-date  and  to  eventually  make  a  career  within 
the  public  administration,  which  is  useful  socially. 

The  nature  and  usefulness  of  the  schemes  for  a 
career  civil  service  have  been  the  subject  of  intense 
debate  and  important  processes  of  adjustment 
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because,  among  other  reasons,  they  are  recognized 
as  a  fundamental  element  in  the  processes  of  reno¬ 
vation  and  modernization  of  public  administra¬ 
tions.  A  career  civil  service  plays  a  central  role  in 
the  transformations  favored  by  democratic 
schemes,  such  as  commitments  to  making  public 
administration  more  transparent  and  improving 
the  mechanisms  for  public  accountability  and  the 
management  of  public  resources.  Professionalizing 
the  public  function  is  an  indispensable  condition 
for  national  development. 

Maria  del  Carmen  Pardo 
El  Colegio  de  Mexico 
Mexico  City,  Mexico 

See  also  Administration;  Administration  Theory; 
Autonomy,  Administrative;  Development 
Administration;  Public  Employment 
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Civil  Society 


From  the  perspective  of  political  sociology,  the 
concept  of  civil  society  refers  to  a  certain  type  of 
society.  Its  basic  characteristic  is  the  attitudinal  capa¬ 
bility  of  its  members,  that  is,  citizens,  to  actively 
organize  themselves  to  pursue  certain  (common) 
goals,  within  a  framework  of  specific  formal  institu¬ 
tions.  In  detailed  definitions,  depending  on  particu¬ 
lar  research  goals,  civil  society  may  therefore  be 
defined  in  either  attitudinal  or  institutional  terms. 
This  entry  discusses  their  perspectives,  their  origins, 
and  their  applications  to  the  contemporary  world. 

In  the  attitudinal  perspective,  the  pivotal  concept 
is  that  of  citizenship  as  an  individual’s  attitude 
toward  the  entire  political  collectivity  and  other 
individuals,  as  well  as  to  institutions  and  proce¬ 
dures.  The  basis  of  this  attitude  is  seen  as  the  indi¬ 
vidual’s  internalization  of  the  rights  and  obligations 
associated  with  membership  in  the  collectivity,  and 
this  attitude  constitutes  the  foundation  of  the  citi¬ 
zen’s  role. 

Elements  of  citizenship  thus  defined  include,  for 
instance,  political  identity  and  self-awareness,  the 
character  and  sources  of  agency,  law  and  order, 
attitudes  toward  recognized  authorities,  attitudes 
toward  pluralism  and  diversity,  the  level  of  open¬ 
ness  toward  the  terms  of  membership  in  a  political 
community,  and,  finally,  convictions  regarding  the 
nature  of  civil  rights  (their  universality  or  particu¬ 
larity).  From  this  perspective,  a  civil  society  is  a 
state  of  democratic  political  culture,  a  conglomer¬ 
ate  of  norms  and  values  typical  of  a  community  of 
citizens  who  are  free  and  equal  before  the  law, 
which  they  shape  and  develop  through  their  active 
participation  in  a  public  life. 

In  the  institutional  approach,  civil  society  is 
seen  as  an  arena  of  citizens’  activity  embedded  in 
an  institutional  framework,  a  structurally  described 
sphere  of  social  and  individual  autonomy  that 
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extends  between  the  state  and  private  life.  This 
institutional  framework  includes,  among  others 
(constitutionally  guaranteed),  civil  rights  and  free¬ 
doms  (in  particular,  freedom  of  speech,  assembly, 
and  association  and  the  right  to  vote),  the  rules  for 
obtaining  citizenship,  the  rule  of  law,  and  welfare 
state  institutions.  In  this  context,  a  civil  society 
usually  stands  for  the  totality  of  institutions,  orga¬ 
nizations,  and  associations  operating  in  the  public 
sphere  with  relative  autonomy  from  the  state, 
established  at  the  grassroots  level  and  usually  char¬ 
acterized  by  voluntary  member  participation. 

No  matter  how  it  is  defined,  the  concept  of  civil 
society  is  inextricably  intertwined  with  the  con¬ 
cepts  of  collective  and  individual  autonomy  and 
political  sovereignty.  To  that  extent,  it  also  con¬ 
tains,  at  its  deepest  level,  a  strong  normative  pos¬ 
tulate  on  how  an  autonomous  community  of  free 
and  equal  citizens  should  be  structured.  This  nor¬ 
mative  aspect  is  manifested  in  various  theoretical 
approaches  that  present  diverse  solutions  regard¬ 
ing  the  relations  between  the  collective  and  the 
individual  and  between  the  private  and  the  public. 
The  opposing  positions  on  that  matter  are  espe¬ 
cially  clearly  displayed  in  the  disagreement  between 
classical  republican  (or  today,  communitarian) 
concepts  and  liberal  theories,  which  can  in  fact  be 
described  as  a  normative  controversy  over  civil 
society.  Theories  of  civil  society,  articulating  the 
ideal  of  a  free  and  self-governed  community, 
inevitably  refer  to  some  kind  of  democratic  social 
order.  In  the  Tocquevillean  perspective,  civil  soci¬ 
ety  is  a  space  for  associational  life  and  a  condition 
for  preserving  a  democratic  political  culture  and 
truly  free  democratic  institutions.  Hence,  this  con¬ 
cept  is  cognitively  useful  especially  in  analyzing 
both  the  theory  and  the  practice  of  democracy.  It 
also  gains  a  special  importance  in  the  era  of  global¬ 
ization,  in  the  modern  world  of  cross-border  struc¬ 
tures,  in  the  slogan  of  a  “global  civil  society.” 

Civil  Society:  Classical  Roots 
and  Modern  Developments 

The  idea  of  a  civil  society  is  deeply  rooted  in 
European  culture.  It  goes  back  to  ancient  Greek 
and  Roman  thought.  To  Aristotle,  a  civil  society 
( koinonia  politike)  was  an  ethical  and  political 
community  of  free  citizens  pursuing  and  achieving 
full  humanity  through  active  participation  in  the 
life  of  the  polis — its  tradition,  law,  morality,  and 


interest.  The  ancient  Greek  agora  as  a  space  of 
communication  for  citizens,  a  forum  for  exchange 
of  arguments,  for  sharing  visions  of  the  common 
good  and  taking  communal  decisions,  was  the 
proper  space  of  cogovernance  and,  hence,  the 
realm  of  politics.  What  the  Roman  tradition  con¬ 
tributed  to  the  idea  of  civil  society  was  the  concept 
of  natural  law,  discovered  through  rational  thought. 
According  to  Cicero,  the  civil  society  ( societas  civi- 
lis)  was  the  highest  kind  of  social  order,  reflecting 
the  eternal  and  unchangeable  natural  law,  a  com¬ 
munity  of  people  who  recognize  the  same  law  and 
work  together  for  the  common  good  (res  publica). 
A  continuation  of  such  republican  thought  on  the 
eve  of  modern  times  can  be  found  in  the  writings  of 
the  Renaissance  Italian  thinkers.  For  Niccolo 
Machiavelli,  the  civil  society  was  a  political  com¬ 
munity  bound  by  efficient  governance  that  ensured 
its  stability,  internal  order,  harmony,  security,  and 
glory.  At  the  same  time,  this  was  a  space  for  politi¬ 
cal  virtue  to  emerge  and  be  pursued.  Despite  differ¬ 
ences,  these  classical  republican  perspectives  have 
in  common  the  perception  of  civic  community  as 
valuable  in  itself,  defining  a  good  to  which  indi¬ 
viduals’  particular  interests  must  be  subordinated. 
Here,  the  freedom  and  sovereignty  of  a  political 
community  as  the  common  good  is  the  prerequisite 
for  an  individual  citizen’s  freedom,  which  is  only 
achievable  through  membership  in  this  political 
community.  What  regulates  those  freedoms  is  the 
authority  of  the  law  and  political  institutions  as 
well  as  shared  values  and  customs.  This  common 
good  is  discovered,  identified,  and,  as  a  conse¬ 
quence,  pursued  through  citizens’  participation  in 
public  life.  In  this  classical  republican  perspective, 
civil  society  is  therefore  a  moral  and  political  com¬ 
munity  pursuing  a  common  good,  where  its  citi¬ 
zens  realize  their  freedom  by  exercising  their  rights, 
fulfilling  their  duties,  and  exhibiting  civic  virtues. 

Modern  times  brought  a  dramatic  reversal  of  this 
relationship.  The  revolution  started  with  rational¬ 
ism  as  a  philosophical  trend  that  regarded  each 
human  being  as  capable  of  rational  judgment  and 
established  the  individual,  rather  than  the  commu¬ 
nity,  as  the  source  of  morality.  This  thought  was 
followed  up  by  the  pioneers  of  the  Enlightenment 
and  liberalism,  who  held  equal  rights  of  individuals 
(e.g.,  habeas  corpus,  private  property)  to  be  natural 
and  inalienable.  The  primordiality  of  those  rights 
vis-a-vis  the  political  assembly  is  expressed  in  the 
concept  of  the  “social  contract,”  according  to  which 
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a  voluntary  agreement  of  free  and  equal  individuals 
is  established  to  secure  their  natural  rights  against 
the  abuse  of  power.  The  despotism  of  nation-states 
that  emerged  after  the  Westphalian  Treaty  of  1648 
often  gave  rise  to  a  negative  definition  of  a  citizen’s 
freedom  as  a  sense  of  security  (founded  on  a  law 
that  restricts  the  arbitrariness  of  other  political 
actors).  Subsequent  political  revolutions  in  England, 
the  United  States,  and  France  adopted  and  strength¬ 
ened  this  modern  perspective  of  an  individual  as 
holding  natural  rights  and  freedoms  that  a  state,  as 
a  legal  institution  established  by  a  voluntary  politi¬ 
cal  assembly  of  its  citizens,  must  safeguard. 

The  classic  liberal  thinkers  believed  in  funda¬ 
mental  individual  rights  that  include  property 
rights  and  the  freedom  to  dispose  of  one’s  prop¬ 
erty.  According  to  liberalism,  the  basic  domain  of 
freedom  is  the  market,  through  which  individuals 
can  most  efficiently  pursue  their  particular  inter¬ 
ests.  The  clash  of  those  individual  interests  is  seen 
as  a  universal  model  shaping  the  dynamics  of  the 
free  social  space.  Hence,  in  this  liberal  perspective, 
civil  society  is  treated  as  a  universe  of  free  individu¬ 
als  who  pursue  their  own  goals  in  their  own  way 
while  respecting  agreed-on  principles.  It  is  a  sphere 
of  independence  and  freedom  extending  between 
the  family  and  the  state,  where  a  spontaneous  self¬ 
organization  of  individuals  emerges,  making  it 
pluralist  and  heterogeneous  (politically,  economi¬ 
cally,  and  culturally).  What  immanently  binds  this 
social  space  of  diversity  and  conflicting  interests  is, 
according  to  the  followers  of  the  Scottish 
Enlightenment,  either  the  inherent  social  impulse — 
the  friendship  and  altruism  of  human  beings — or 
conformism,  vanity,  and  a  desire  to  please  others. 
Even  though  distinguished  from  the  institution  of 
the  state,  the  sphere  of  civil  society  remains  in  a 
constructive  relationship  to  it,  as  citizens  use  it  to 
decide  with  others  on  matters  of  shared  interest. 

This  liberal  perspective  on  civil  society  became  a 
negative  point  of  reference  for  thinkers  such  as 
Georg  Hegel  and  Karl  Marx,  who  did  not  share  its 
optimism  as  to  the  immanent  powers  regulating  the 
conflicts  of  particular  interests  and  egotisms.  Both 
Hegel  and  Marx  explicitly  equated  the  civil  society 
(biirgerliche  Gesellschaft )  with  the  economic  soci¬ 
ety  and  the  private  sphere,  with  which  they  con¬ 
trasted  the  public  sphere  as  the  only  one  capable  of 
curbing  conflicts  and  disparities  within  civil  society. 
According  to  Hegel,  the  embodiment  and  guarantor 
of  this  superindividual  order  was  the  state.  For 


Marx,  it  was  a  political  civil  society  that  can  be  cre¬ 
ated  after  the  abolition  of  private  property,  which  is 
the  source  of  disparities,  domination,  and  social 
conflicts.  It  was  with  reference  to  these  two  thinkers 
that  Antonio  Gramsci  developed  his  concept  of  civil 
society  in  the  first  half  of  the  20th  century. 

However,  from  the  mid-  19th  century  until  the 
1970s,  the  interest  in  the  concept  of  civil  society 
virtually  disappeared,  replaced  by  a  more  detailed 
reflection  on  the  institutional  and  normative  frame¬ 
work  of  the  democratic  order.  The  renaissance  of 
this  idea  in  the  last  decades  of  the  20th  century  is 
related  to  the  birth  of  opposition  movements  in 
Latin  America  and  Central  and  Eastern  Europe, 
where  self-organizing  social  and  civic  structures 
operated  beyond  the  state  or  in  opposition  to  it. 
From  the  perspective  of  mainly  Western  analysts, 
the  category  of  a  civil  society  seemed  to  be  an 
appropriate  tool  to  describe  them.  Civic  move¬ 
ments  in  authoritarian  countries  were  undoubtedly 
a  major  impulse  for  the  renaissance  of  interest  in 
the  category  of  civil  society.  It  was  all  the  more 
important,  however,  that  they  occurred  concur¬ 
rently  with  social  and  political  phenomena  and 
changes  taking  place  in  Western  societies  them¬ 
selves  (e.g.,  the  welfare  state  crisis,  new  social 
movements).  In  the  public  debate,  criticism  was 
directed  either  at  the  excessive  welfare  state  or  at 
the  alienation  of  representative  democracies  from 
civil  society.  The  advocates  of  social  and  economic 
mobilization  or  social  and  political  democratiza¬ 
tion,  of  increasing  civic  participation  and  involve¬ 
ment  in  public  life,  and  of  combating  alienation 
and  individualism  quite  often  made  reference  to  the 
classical  republican  visions  of  the  truly  civil  society. 
At  the  same  time,  they  forced  political  thinkers  to 
revisit  this  issue  in  the  context  of  the  modern  era. 

With  respect  to  the  concept  of  civil  society,  the 
most  interesting  aspect  of  this  debate  is  the  dispute 
between  liberals  and  communitarians,  pursued 
with  varying  intensity  over  the  past  30  years  of  the 
20th  century.  Even  though  it  is  rooted  in  two  dif¬ 
ferent  visions  of  the  individual-collectivity  relation¬ 
ship  and  the  factors  constituting  the  Self,  this  dis¬ 
pute  finds  its  practical  expression  mostly  in  the 
different  understanding  of  a  civil  society  and  its 
relationship  to  the  democratic  state.  Putting  it  as 
simply  as  possible,  for  liberals,  a  state  is  one  of  the 
many  voluntary  assemblies  of  free  and  equal  citi¬ 
zens  who  agree  on  the  principles  of  justice,  which 
they  can  reflect  on,  question,  and  renegotiate.  Its 


262  Civil  Society 


basic  role  is  to  safeguard  the  agreed  procedures. 
Those  procedures  (e.g.,  the  rule  of  law)  and  the 
results  of  their  application  are  what  liberals  are 
inclined  to  consider  a  “procedural”  common  good. 
A  state,  largely  restricted  to  safeguarding  the  agreed 
rules,  would  open  even  more  space  for  the  develop¬ 
ment  of  a  civil  society.  As  a  guardian  of  procedures, 
it  should  remain  neutral  to  various  goals,  visions, 
and  ideals  of  good  life,  which  are  to  emerge  and 
compete  with  one  another  within  the  very  space  of 
the  civil  society,  in  the  “free  marketplace  of  ideas.” 
It  is  civil  society  rather  than  the  state  that  selects 
and  orders  these  procedures.  In  this  sense,  a  liberal 
civil  society,  as  distinct  from  the  state,  remains  in  a 
constructive  relationship  with  the  state  but  is  more 
significant  than  the  latter.  A  political  society  is 
important  but  has  no  absolute  priority. 

For  communitarians,  by  contrast,  the  state  is  the 
most  powerful  political  community,  defined  as  a 
particular  “community  of  experience”  in  which  a 
shared  past  and  future  are  intertwined.  The  com¬ 
munity’s  tradition  is  its  substantial  common  good, 
and  its  maintenance  is  a  common  goal  and  endeavor. 
Active  participation  and  involvement  in  the  com¬ 
munity’s  life  leads  to  identification  with  its  goals 
and  generates  a  sense  of  solidarity,  so  promoting 
the  culture  of  participation  is  a  value  worth  sup¬ 
porting.  A  state  may  remain  neutral  on  various 
matters  but  not  on  the  matter  of  patriotism.  Only 
by  discovering  the  principles,  ties,  and  loyalties  that 
bind  the  community  together  can  individuals  fully 
understand  their  social  and  civic  roles.  Civil  society 
is  not  a  chaotic  set  of  competing  or  cooperating 
associations  but  an  entity  united  by  a  shared  self- 
awareness  of  its  own  history,  tradition,  culture, 
institutions,  and  achievements;  the  goals  and  con¬ 
cepts  presented  in  the  “free  market  of  ideas”  must 
be  intelligible  to  everyone  and  must  be  embedded  in 
the  culture  or  tradition  of  the  community.  Thus,  the 
state  is  entitled  and  obliged  to  support  a  common 
understanding,  by  following  the  policy  of  prefer¬ 
ence  for  the  common  good  and  establishing  a  hier¬ 
archy  of  goals  and  values  to  inspire  its  members. 
This  policy  of  preferring  a  substantially  understood 
common  good  over  the  policy  of  neutrality  makes 
the  state  the  ultimate  entity.  In  this  sense,  the  state 
is  more  important  than  the  civil  society,  or  to  put  it 
in  other  words,  a  civil  society  is  tantamount  to 
both  the  political  community  and  the  state. 

This  controversy  over  ideas,  theories,  or  even  ide¬ 
ologies  can  often  be  found  in  daily  life  and  disputes 


in  liberal  democracies.  It  refers,  among  other  things, 
to  the  practical  dimension  of  the  relationship  between 
the  state  and  the  civil  society,  in  particular  in  differ¬ 
ent  models  of  cooperation  between  public  adminis¬ 
trative  structures  and  the  so-called  third  sector. 

State  and  Civil  Society 

Analytically,  we  can  distinguish  three  types  of  rela¬ 
tionships  between  the  civil  society  and  a  democratic 
state.  In  the  first,  a  civil  society  is  a  complement  to 
the  state.  In  the  second,  the  state  is  a  complement 
to  civil  society.  In  the  third,  the  state  and  civil  soci¬ 
ety  are  in  an  antagonistic  relationship.  The  first 
type  of  relationship  emerges  where  the  state  is  the 
major  agent  for  the  redistribution  of  national 
wealth  and  takes  responsibility  for  providing  and 
operating  many  spheres  of  collective  life  (e.g.,  edu¬ 
cation,  health  care,  pensions).  In  this  situation,  civic 
organizations  enter  only  those  areas  of  collective 
life  that  the  state  does  not  provide  with  sufficient 
social  services.  Such  a  relationship  can  be  observed 
in  welfare  states  characterized  by  an  extended 
sphere  of  social  benefits,  and  its  theoretical  grounds 
can  be  found  in  communitarian  concepts.  An  exces¬ 
sive  welfare  state  may,  however,  lead  to  the 
“learned  helplessness”  syndrome  (or  dependency 
on  social  benefits),  which  converts  some  citizens 
into  the  welfare  state’s  clients.  This  weakens  the 
vigor  of  civil  society. 

The  second  type  of  relationship  refers  to  the  situ¬ 
ation  where  the  state’s  responsibilities  are  restricted 
to  the  minimum  (e.g.,  police,  army,  courts,  diplo¬ 
macy).  The  remaining  part  of  public  life  is  taken 
care  of  by  grassroots  citizens’  groups.  Such  a  rela¬ 
tionship  stems  from  liberal  concepts  of  citizenship 
and  state.  It  is  worth  noting  that  where  a  state’s 
noninvolvement  is  too  restricted,  social  inequalities 
may  increase,  which  may  lead  to  the  marginaliza¬ 
tion  or  exclusion  of  some  segments  of  society.  This, 
in  turn,  reduces  the  vitality  of  a  civil  society  because 
for  marginalized  individuals,  dealing  with  public 
matters  ceases  to  be  of  importance,  having  to  focus 
on  survival.  The  third  type  of  relationship  echoes 
the  relation  between  individuals  and  the  authoritar¬ 
ian  state.  It  can  be  found  predominantly  in  young 
democracies  as  this  antagonistic  relationship 
between  citizens  and  the  nondemocratic  state  is  still 
present  in  the  collective  memory  and  in  common 
cognitive  structures.  In  this  case,  various  organiza¬ 
tions  of  civil  society  usually  have  little  trust  in  the 
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state  institutions  (and  vice  versa).  As  a  consequence, 
conflict  prevails  over  cooperation.  Over  time,  espe¬ 
cially  when  a  young  democracy  has  completed  the 
consolidation  phase,  this  generalized  antagonism 
may  disappear,  replaced  by  civic  defiance  focused 
on  certain  acts  of  a  democratic  state. 

Cognitive  Usefulness  of  the  Concept 

The  usefulness  of  the  concept  of  civil  society 
changes  over  time.  There  are  periods  where 
researchers’  attention  is  drawn  to  other  areas,  with 
the  civil  society  itself  being  treated  as  secondary  to 
some  more  fundamental  issues  (e.g.,  the  rule  of 
law,  functioning  of  democratic  state  institutions, 
party  system  development  and  functioning).  In 
such  cases,  sociological  or  political  science  narra¬ 
tives  push  the  concept  of  “civil  society”  to  the 
background,  if  they  refer  to  it  at  all.  This  happens 
where  the  subject  of  research  is  a  well-consolidated 
democracy  (at  least  in  some  of  its  aspects). 

There  are  times,  however,  where  researchers 
rediscover  the  cognitive  usefulness  of  the  concept. 
This  takes  place,  for  example,  where  a  stable  func¬ 
tioning  and  reproduction  of  the  democratic  order  is 
disrupted  and  the  very  existence  of  democracy  is 
jeopardized.  This  is  also  the  case  when  new  social 
movements  emerge  in  the  public  sphere,  contesting 
a  particular  part  of  reality  as,  for  example,  ecologi¬ 
cal  movements  that  have  had  an  impact  on  political 
decision  makers  and  made  the  public  opinion  sensi¬ 
tive  to  the  consequences  of  environmental  devasta¬ 
tion.  However,  the  revival  of  this  concept  can  be 
seen  most  clearly  in  times  of  great  historical  revolu¬ 
tions,  leading  to  a  democratic  order  that  emerged 
after  the  breakdown  of  authoritarian  systems,  as 
was  the  case  in  the  1980s  and  1990s. 

In  different  theories  of  democracy,  the  analyti¬ 
cal  usefulness  of  the  concept  of  civil  society  varies. 
Generally,  in  theories  that  focus  solely  on  the 
procedural  dimension  of  democracy,  where  the 
problem  of  civic  participation  in  public  life  is  of 
lesser  importance,  the  usefulness  of  this  concept  is 
less  than  in  theories  focusing  on  substantial 
aspects.  In  the  former  approach,  a  civil  society  is 
reduced  to  the  so-called  third  sector,  understood 
as  the  totality  of  voluntary  nongovernmental 
associations  that  articulate  various  interests  in  the 
public  sphere.  In  the  latter  approach,  the  useful¬ 
ness  of  civil  society  is  greater  as  the  activity  of 
citizens  in  the  public  realm  determines  the  vitality 


of  democracy.  Examples  of  such  a  perspective  are 
the  associational  democracy  model  as  well  as  the 
deliberative  democracy  model. 

Civil  Society  and  Globalization 

In  the  globalized  world  of  today,  with  the  progress 
of  technologies,  transport,  and,  most  of  all,  means 
of  mass  communication  that  cover  the  entire 
world,  problems  that  used  to  be  of  local  impact 
(humanitarian  disasters,  epidemics,  financial  cri¬ 
ses)  now  turn  into  global  issues.  These  problems 
are  addressed  by  governments  of  nation-states, 
international  economic  and  political  organizations 
(e.g.,  the  United  Nations,  the  World  Bank,  or  the 
World  Trade  Organization),  as  well  as  private 
financial  or  industrial  multinational  corporations. 
The  global  governance  of  transnational  decision¬ 
making  structures  creates  the  need  to  ensure  civil 
scrutiny  and  to  build  a  “global  civil  society.” 

However,  using  the  concept  of  civil  society  in  a 
global  context  creates  both  theoretical  and  practi¬ 
cal  difficulties.  It  is  true  that  some  institutions  of 
civil  society  are  becoming  globalized.  The  1980s 
were  a  breakthrough  decade  in  this  respect,  when 
some  civil  initiatives  (e.g.,  human  rights  and  paci¬ 
fist  movements)  transcended  the  borders  of  the 
nation-states.  They  referred  to  global  problems  and 
demanded  global  solutions  as  well.  Some  of  those 
initiatives  entered  the  institutional  phase,  which 
resulted  in  the  emergence  of  nongovernmental  civic 
institutions  of  global  scope  (e.g.,  Greenpeace, 
T ransparency  International,  Amnesty  International ) . 
However,  the  conclusion  that  we  are  dealing  with 
the  beginnings  of  a  global  civil  society  seems  to  be 
premature  for  a  number  of  reasons. 

First,  a  civil  society  consists  of  citizens  to  whom 
authorities,  legitimated  by  democratic  procedures 
(e.g.,  elections),  are  accountable  for  their  decisions. 
Global  governance  cannot  be  equated  with  political 
power  defined  in  this  way;  because  there  is  no 
global  accountability  procedure,  this  relationship 
cannot  be  directly  transposed  to  the  global  level. 
Second,  there  are  no  rules  to  legitimize  the  actions 
of  global  civil  structures,  as  a  result  of  which  their 
activity  faces  claims  of  usurpation.  Third,  the  emer¬ 
gence  of  “global  civil  society”  would  require  a  com¬ 
mon  normative  base,  on  which  even  contradictory 
civil  initiatives  could  meet  and  operate  in  mutual 
respect.  In  a  global  dimension,  there  is  no  such 
common  cultural  denominator.  Even  the  concept  of 
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basic  human  rights  is  not  generally  recognized  and 
is  sometimes  interpreted  as  a  product  of  the 
Western  culture  and  a  tool  for  its  expansion. 

The  importance  of  global  governance  is  rising, 
as  is  that  of  global  grassroots  civil  initiatives. 
However,  because  there  is  no  “social  contract”  on 
the  global  level,  both  institutions  that  execute 
global  governance  and  global  civil  organizations 
suffer  from  legitimacy  deficits.  Therefore,  their 
relationship  is  usually  antagonistic.  Hence,  the 
concept  of  civil  society  may  again  turn  out  to  be  a 
useful  analytical  category  to  describe  further  devel¬ 
opments  of  globalization. 
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Civil  War 


Civil  war  can  be  defined  as  a  violent  conflict  that 
pits  states  against  one  or  more  organized  nonstate 
actors  on  their  territory.  This  distinguishes  civil 
wars  from  interstate  conflict  (where  states  fight 
other  states),  violent  conflicts  or  riots  not  involving 
the  state  (sometimes  labeled  intercommunal  con¬ 
flicts),  and  state  repression  against  individuals  who 
cannot  be  considered  an  organized  or  cohesive 
group,  including  genocides  or  similar  violence  by 
nonstate  actors,  such  as  terrorism  or  violent  crime. 
This  entry  first  discusses  the  various  kinds  of  civil 
war  and  reviews  the  key  patterns  and  trends  in  such 
conflicts.  It  then  presents  theories  on  the  causes  of 
civil  war  and  the  current  research  agenda. 

The  above  conceptual  definition  of  civil  war 
clearly  encompasses  many  different  forms  of  con¬ 
flict.  Some  analysts  distinguish  between  civil  wars 
where  insurgents  seek  territorial  secession  or  auton¬ 
omy  and  conflicts  where  insurgents  aim  for  control 
of  the  central  government.  Conflicts  over  govern¬ 
ment  control  may  involve  insurgents  originating 
from  within  the  center  or  state  apparatus,  as  in 
military  coups,  or  challengers  may  come  from  the 
periphery ,  or  outside  the  political  establishment. 
Others  separate  between  ethnic  civil  wars,  where 
the  insurgents  and  individuals  in  control  of  the  cen¬ 
tral  government  have  separate  ethnic  identities,  and 
revolutionary  conflicts,  where  insurgents  aim  for 
major  social  transformation.  Colonial  conflicts  are 
sometimes  singled  out  as  a  set  distinct  from  civil 
wars  on  states’  core  territory.  However,  even  if 
these  suggested  categories  may  be  conceptually  dis¬ 
tinct  types  and  could  suggest  different  causes  and 
dynamics,  a  given  civil  war  will  often  combine  sev¬ 
eral  elements.  For  example,  insurgencies  may  be 
both  ethnic  and  ideologically  based,  and  aims  can 
shift  over  time  from  secession  for  a  limited  territory 
to  controlling  the  entire  state.  Thus,  most  research¬ 
ers  do  not  impose  such  strict  a  priori  distinctions 
between  varieties  to  analyze  these  as  separate  types 
but  instead  study  civil  war  as  a  common  class  of 
conflict. 

Patterns  and  Trends  in  Civil  War 

Even  though  the  modern  state  sometimes  is  defined 
by  its  alleged  monopoly  on  violence,  armed 
challenges  to  state  authority  are  as  old  as  states 


Civil  War  265 


themselves.  Despite  numerous  historical  accounts 
of  civil  wars,  there  are  few  systematic  data  sources 
on  civil  conflict  prior  to  1945,  since  so  little  is 
known  about  the  extent  of  conflicts  outside  the 
developed  countries  before  this  date.  Figure  1  dis¬ 
plays  the  number  of  ongoing  conflicts  (top)  and 
new  outbreaks  (bottom)  since  1945,  using  data 
from  the  Uppsala  Conflict  Data  Program  at  the 
Department  of  Peace  and  Conflict  Research, 
Uppsala  University  and  Centre  for  the  Study  of 
Civil  War  at  the  International  Peace  Research 
Institute,  Oslo  (UCDP/PRIO),  Armed  Conflict 
Dataset.  UCDP/PRIO  classifies  conflicts  claiming 
more  than  25  deaths  in  a  year  as  interstate  conflicts, 
extrasystemic  (or  colonial)  conflicts,  as  well  as 
internal  conflicts,  or  as  civil  wars,  including  interna¬ 
tionalized  internal  conflicts  where  other  states  fight 
on  the  side  of  the  government.  Figure  1  shows  that 
while  there  are  relatively  few  interstate  wars  since 
1945,  civil  wars  have  been  common.  Whereas  inter¬ 
state  conflicts  tend  to  be  short,  civil  wars  often 
persist  for  a  long  time.  Research  also  demonstrates 
that  civil  wars  are  less  likely  to  be  settled  by  formal 
agreements  than  interstate  wars  and  much  more 
likely  to  recur.  Many  observers  saw  the  initial  rise 
in  new  outbreaks  of  civil  conflict  after  the  end  of 
the  Cold  War  as  evidence  that  the  world  would  be 
more  turbulent  and  violent  after  a  period  of  stable 
deterrence  between  the  superpowers.  However, 
Figure  1  demonstrates  a  clear  decline  in  civil  war 
since  this  peak  after  the  Cold  War.  The  specific 
causes  that  may  underlie  this  decline  remain  dis¬ 
puted,  and  the  number  of  ongoing  civil  wars 
remains  high  in  absolute  terms. 

Civil  wars  are  generally  less  severe  than  interstate 
wars  in  terms  of  the  direct  battle  deaths.  However, 
civil  wars  have  been  more  frequent  and  often  persis¬ 
tent,  and  more  than  90%  of  the  recorded  deaths  in 
battle  since  the  Cold  War  stem  from  civil  wars. 
Further,  war  can  have  a  substantial  indirect  impact 
on  human  welfare  beyond  the  direct  loss  of  life. 
Studies  indicate  that  countries  experiencing  civil 
war  see  a  pronounced  fall  in  their  gross  domestic 
product  and  never  recover  their  earlier  growth  tra¬ 
jectory.  According  to  Paul  Collier  et  al.  (2003), 
“Civil  war  is  development  in  reverse.”  Civil  wars 
also  dispel  trade  and  investment  and  leave  large 
social  legacies  in  unemployed  former  combatants 
and  displaced  individuals.  The  negative  conse¬ 
quences  of  civil  war  are  not  limited  only  to  the 
countries  that  experience  them;  studies  find  that 


neighboring  countries  also  suffer  a  negative  eco¬ 
nomic  impact  and  may  be  more  prone  to  violence 
themselves  as  a  consequence  of  conflict  among 
neighbors. 

Causes  of  Civil  War 

The  literature  on  the  causes  of  civil  war  is  enor¬ 
mous,  and  it  is  impossible  to  provide  an  exhaustive 
review  of  the  many  arguments  that  have  been  pre¬ 
sented  here.  Contributions  have  emphasized  a  vari¬ 
ety  of  social,  economic,  and  political  factors. 
However,  theories  of  civil  war,  in  general,  point 
either  to  specific  motives  for  why  people  resort  to 
violence  or  to  specific  opportunities  that  make  vio¬ 
lence  more  or  less  feasible  or  attractive.  This  entry 
examines  theories  of  civil  war  by  the  specific  clus¬ 
ters  of  factors  emphasized,  in  an  order  roughly  fol¬ 
lowing  the  chronology  of  theory  development  and 
the  specific  events  and  cases  that  motivated  these 
arguments. 

Economic  Grievances,  Motives,  and  Opportunities 

Most  civil  wars  take  place  within  relatively 
poorer  societies,  and  many  studies  corroborate 
the  link  between  development  and  income.  Early 
contributions  to  the  study  of  violence  within  soci¬ 
eties  tended  to  focus  on  economic  deprivation 
and  grievances  as  key  motives.  In  particular,  Ted 
Gurr  highlighted  inequality  and  how  groups  may 
resort  to  rebellion  if  they  are  dissatisfied  with 
their  current  economic  status  relative  to  their 
aspirations.  The  literature  on  nationalist  conflicts 
has  emphasized  how  both  relatively  poorer  and 
wealthier  groups  are  likely  to  rebel  against  the 
center  if  they  believe  that  they  can  do  better  under 
independence.  Civil  wars  in  Latin  American 
countries  were  often  interpreted  within  a  frame¬ 
work  focusing  on  economic  grievances,  in  the 
form  of  either  unequal  land  distribution  or  high 
income  inequality.  However,  the  empirical  evi¬ 
dence  linking  individual  income  inequality  and 
conflict  is  mixed.  Older  studies  focusing  on  a 
broad  range  of  political  violence  often  found  a 
positive  effect  of  higher  inequality,  while  newer 
studies  of  civil  war  more  specifically  tend  to  find 
little  support.  However,  one  should  be  cautious  in 
making  strong  inferences  from  this,  given  the 
poor  quality  and  coverage  of  cross-national  data 
on  individual  income  inequality. 
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Figure  I  Ongoing  Armed  Conflicts  (top)  and  Onset  of  New  Conflicts  ( bottom ) 

Source:  Author.  Data  from  the  UCDP/PRIO  Armed  Conflict  Dataset. 

Note:  UCDP/PRIO  =  Uppsala  Conflict  Data  Program  at  the  Department  of  Peace  and  Conflict  Research,  Uppsala  University 
and  Centre  for  the  Study  of  Civil  War  at  the  International  Peace  Research  Institute,  Oslo. 
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More  recent  political  economy  studies  of  civil  war 
tend  to  be  very  dismissive  of  the  role  of  grievances. 
Some  researchers  have  argued  that  grievances  are 
ubiquitous  and  that  it  is  more  important  to  focus  on 
variation  in  the  opportunities  for  violence.  Collier 
and  Anke  Hoeffler  argue  that  low  income  makes  it 
easier  to  mobilize  insurgencies,  since  potential 
recruits  have  less  to  lose  in  forgone  income  from 
normal  economic  activities  and  wages  for  soldering 
will  be  much  lower  than  in  wealthier  societies.  James 
Fearon  and  David  Laitin  argue  that  civil  war  is  pri¬ 
marily  a  problem  of  weak  states,  which  in  turn  is 
largely  determined  by  economic  development. 
Researchers  in  this  tradition  also  link  mobilization 
to  the  role  of  individual  incentives.  Opportunities  for 
insurgencies  are  better  when  participants  can  do  well 
out  of  war,  for  example,  through  looting  or  rents 
derived  from  valuable  natural  resources.  Empirical 
studies  lend  some  support  to  link  between  natural 
resources  and  a  higher  risk  of  civil  war.  Civil  wars  in 
Africa  are  often  taken  to  support  these  perspectives. 

Political  Grievances,  Motives,  and  Opportunities 

Political  deprivation,  such  as  colonial  subordina¬ 
tion  or  lack  of  political  rights,  provides  another 
plausible  motivation  for  resort  to  violence.  Many 
conflicts  after  1945  first  emerged  as  groups  sought 
to  achieve  independence  for  areas  under  colonial 
rule.  The  Indochina  War  and  the  Algerian  War  of 
Independence  helped  mobilize  movements  in  other 
countries  by  showing  how  overwhelmingly  more 
powerful  colonial  powers  could  be  defeated  through 
sustained  violent  campaigns.  Although  overseas 
colonies  eventually  received  their  independence, 
many  ethnically  distinct  groups  within  contiguous 
empires  such  as  Russia  or  Ethiopia  see  themselves  in 
similar  struggles  of  national  liberation.  Researchers 
have  highlighted  how  violence  may  arise  around 
such  peripheral  ethnic  minority  groups.  The  violent 
conflicts  in  the  former  Yugoslavia  spurred  renewed 
interest  in  ethnic  conflict.  Many  researchers  postu¬ 
lated  strong  parallels  between  the  security  dilemmas 
in  an  anarchic  international  system  and  relations 
between  ethnic  groups,  where  only  full  territorial 
partition  could  provide  stable  solutions. 

Much  of  the  subsequent  empirical  literature  has 
emphasized  that  there  is  little  evidence  that  ethni¬ 
cally  more  diverse  countries  are  generally  more 
prone  to  conflict.  However,  looking  only  at  the 
number  or  relative  size  of  ethnic  groups  disregards 


their  political  status  and  the  extent  to  which  ethnic 
groups  are  systematically  excluded  from  political 
power  or  discriminated  against  by  the  state.  Studies 
looking  at  ethnic  exclusion  find  stronger  evidence 
for  a  relationship  to  violence.  Ethnically  diverse 
countries  are  not  necessarily  more  prone  to  conflict 
if  they  have  inclusive  institutions  or  grant  auton¬ 
omy  rights,  and  control  of  the  state  or  access  to 
power  does  not  always  follow  directly  from  the 
relative  size  of  ethnic  groups.  Many  ethnic  civil 
wars  see  minority  groups  controlling  political  insti¬ 
tutions  and  excluding  larger  groups  from  power, 
such  as  the  Amhara  in  Ethiopia  until  their  defeat  by 
a  coalition  of  other  groups  in  1991.  Political,  eco¬ 
nomic,  and  ethnic  grievances  are  often  linked. 
Studies  that  focus  on  inequalities  that  follow  group 
lines  rather  than  individual  income  inequalities  find 
a  stronger  relationship  to  conflict. 

Struggles  for  broader  political  rights  in  auto¬ 
cratic  systems  provide  another  context  where  vio¬ 
lence  may  occur.  Political  democracy  provides 
many  avenues  for  actors  to  express  dissent  through 
nonviolent  political  means.  Autocratic  regimes  typi¬ 
cally  deny  citizens  room  for  political  activities  and 
often  resort  to  severe  repression  of  protest,  which  in 
turn  may  motivate  resort  to  arms.  Protests  against 
autocratic  or  exclusionary  regimes  have  often 
turned  violent,  sometimes  leading  to  sustained  cam¬ 
paigns,  as,  for  example,  in  South  Africa.  Claims  for 
greater  political  rights  and  freedom  are  clearly 
important  elements  of  the  rhetoric  and  call  for 
mobilization  of  many  insurgent  movements,  even  if 
these  do  not  necessarily  move  to  implement  demo¬ 
cratic  institutions  if  successful  in  achieving  power. 

Much  of  civil  war  research  has  focused  on  the 
role  of  political  institutions  in  structuring  the 
opportunities  for  violence  rather  than  the  potential 
accommodative  effect  of  greater  democracy.  Many 
have  argued  that  although  autocratic  institutions 
provide  fewer  avenues  for  nonviolent  political 
activities  and  protest,  autocratic  regimes  are  often 
sufficiently  repressive  to  successfully  deter  dissent. 
By  contrast,  anocracies,  or  regimes  combining  auto¬ 
cratic  and  democratic  features,  are  seen  as  the  most 
prone  to  see  violent  conflict  by  combining  the  lack 
of  political  freedom  motivating  violence  with  the 
sufficient  opportunities  afforded  by  a  less  repressive 
regime.  Many  studies  have  found  evidence  for  such 
an  inverted  U-shaped  relationship  between  democ¬ 
racy  and  civil  war.  However,  there  is  no  consensus 
on  the  underlying  mechanisms  (e.g.,  reforms  in 
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autocracies  may  be  a  response  to  conflict  potential 
rather  than  a  prior  cause).  Some  hold  these  relation¬ 
ships  to  be  in  part  artifacts  resulting  from  the  defini¬ 
tion  of  empirical  democracy  measures. 

Opportunity  Structures  and  Dynamic  Events 

Most  of  the  theories  discussed  above  emphasize 
structural  factors  that  rarely  change  or  change  only 
slowly  over  time.  Such  persistent  structural  features 
do  not  provide  clear  explanations  for  why  civil 
wars  break  out  at  specific  times  and  not  others. 
Research  on  social  movements  suggests  that  certain 
events  can  create  “political  opportunity  structures” 
that  afford  groups  better  prospects  for  extracting 
concessions  from  the  state  or  center.  This  may 
include  demonstrations  of  state  weakness,  conflict 
between  elites,  or  events  that  make  it  easier  for 
groups  to  mobilize,  for  example,  by  bringing 
groups  together  or  indicating  focal  points  for  orga¬ 
nizing  protests.  The  concept  of  opportunity  struc¬ 
tures  has  so  far  not  had  much  direct  impact  on 
studies  on  civil  conflict,  but  there  are  many  existing 
arguments  and  findings  in  civil  war  research  that 
can  be  interpreted  within  this  framework.  Regime 
change  and  other  signals  of  weakened  state  author¬ 
ity  can  increase  the  perceived  chances  of  success  or 
extracting  concessions  from  a  government.  Studies 
have  shown  that  economic  crises  and  natural  disas¬ 
ters  can  increase  the  risk  of  conflict.  This  is  consis¬ 
tent  with  the  idea  that  crises  and  emergencies  can 
help  provide  a  setting  for  rallying  protest  against 
the  government.  For  example,  the  1973  earthquake 
in  Nicaragua,  and  the  massive  corruption  and  lack 
of  subsequent  reconstruction,  generated  widespread 
disillusionment  and  helped  a  long-standing  Marxist 
insurgency  dramatically  increase  recruitment. 

International  Dimensions  of  Civil  War 

Most  research  on  civil  war  has  assumed  that 
since  civil  wars  are  “internal”  conflicts  rather  than 
conflicts  between  states,  their  main  causes  must  also 
be  domestic  or  located  within  state  boundaries. 
However,  factors  outside  individual  countries  can 
play  an  important  role  in  the  outbreak  of  conflicts, 
as  well  as  how  they  evolve.  Many  actors  in  civil 
wars  are  not  necessarily  confined  within  individual 
countries.  Ethnic  groups  often  span  international 
boundaries,  and  transnational  kin  frequently  par¬ 
ticipate  in  or  provide  support  for  insurgencies  in 


other  states.  The  status  of  international  borders 
generates  different  constraints  and  opportunities  for 
governments  and  rebels.  Borders  are,  in  a  technical 
sense,  just  lines  in  the  sand  and  are  often  not  diffi¬ 
cult  to  cross  from  a  purely  military  perspective. 
However,  the  fact  that  borders  formally  delineate 
state  sovereignty  makes  it  more  difficult  for  govern¬ 
ments  to  violate  the  sovereignty  of  other  countries, 
while  such  constraints  are  less  relevant  for  rebels. 
Governments  risk  retaliation  from  neighboring 
countries  from  territorial  incursions  and  face  diffi¬ 
culties  in  targeting  transnational  support.  This  in 
turn  means  that  rebels  can  have  a  logistic  advantage 
in  operating  out  of  extraterritorial  bases,  and  trans¬ 
national  rebel  movements  can  be  more  difficult  for 
governments  to  deter  or  defeat.  The  presence  of 
conflict  in  another  state  can  help  facilitate  violent 
mobilization,  either  through  emulation  of  successful 
rebellions  or  through  the  direct  imports  of  arms  and 
combatants.  Finally,  civil  wars  are  often  closely 
linked  to  interstate  war.  Poor  relations  between 
states  may  motivate  governments  to  support  insur¬ 
gencies  in  rival  countries,  and  civil  wars  may  in  turn 
promote  military  conflict  between  states,  for  exam¬ 
ple,  as  a  result  of  conflict  over  border  violations, 
alleged  support  for  insurgents,  or  conflicts  over  the 
externalities  generated  by  conflicts.  Western  Africa 
in  the  1990s,  for  example,  provides  many  examples 
of  governments  supporting  insurgencies  in  neigh¬ 
boring  countries  and  retaliating  against  alleged  sup¬ 
port  and  border  violations.  The  fact  that  conflicts 
are  not  necessarily  limited  to  single  countries  and 
may  involve  participation  by  other  states  in  various 
forms  demonstrates  how  a  strict  dichotomy  between 
civil  and  interstate  wars  often  may  be  untenable. 

Civil  War  Termination  and  Duration 

Although  most  research  focuses  on  accounting  for 
the  original  outbreak  of  civil  war,  there  has  also 
been  growing  interests  in  understanding  the  pros¬ 
pects  for  conflict  termination  and  why  some  con¬ 
flicts  are  so  persistent.  Many  researchers  argued  that 
there  must  be  some  symmetry  between  the  original 
causes  of  civil  war  and  the  factors  that  lead  to  their 
eventual  termination.  From  this  perspective,  efforts 
to  foster  conflict  settlement  should  focus  on  address¬ 
ing  the  issues  that  gave  rise  to  the  conflict.  For 
example,  civil  wars  would  be  likely  to  end  earlier  in 
more  developed  countries,  where  the  opportunity 
costs  are  higher  and  states  are  stronger.  Events  such 
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as  political  reform  and  increasing  accommodation 
could  help  promote  the  end  of  conflicts.  However, 
other  researchers  argue  that  the  factors  that  make 
conflicts  endure  can  be  quite  different  from  those 
influencing  the  initial  outbreak  of  conflict.  While 
interstate  wars  tend  to  be  relatively  short  and  typi¬ 
cally  end  quickly  in  some  kind  of  formal  settlement 
once  the  relative  strength  of  the  actors  becomes 
clear,  civil  wars  often  persist  for  a  long  time,  as  in 
the  case  of  Southern  Sudan  or  Angola.  A  number  of 
explanations  have  been  proposed  to  account  for  this 
paradox.  Some  emphasize  how  civil  wars  tend  to  be 
low-intensity  conflicts  fought  in  the  periphery, 
where  the  features  that  facilitate  insurgency  make  it 
difficult  for  governments  to  conclusively  defeat  reb¬ 
els.  Others  point  to  how  insurgents  that  can  do  well 
during  war  through  looting  and  control  over  valu¬ 
able  resources  may  have  little  interest  in  seeking  an 
end  to  conflict.  Other  research  emphasizes  how 
conflict  termination  involves  severe  problems  of 
credible  commitment.  Even  if  it  may  be  relatively 
easy  to  reach  consensus  on  the  terms  of  an  agree¬ 
ment  in  principle,  carrying  through  with  an  agree¬ 
ment  may  be  difficult  since  parties  have  incentives 
to  make  promises  that  they  later  renege  on.  For 
rebels,  laying  down  arms  is  a  risky  choice,  since 
they  will  be  left  less  able  to  defend  themselves. 
Moreover,  it  can  be  difficult  for  governments  to 
ascertain  whether  rebels  will  uphold  their  end  of 
agreements,  for  example,  through  decommissioning 
or  controlling  violent  breakaway  factions  in  the 
aftermath  of  agreements.  Research  on  credible 
commitment  problems  after  civil  war  argues  that 
the  success  of  agreements  often  hinges  on  whether 
third  parties  can  serve  as  external  enforcers.  New 
research  on  peacekeeping  and  conflict  management 
suggests  that  involvement  by  international  organi¬ 
zations  such  as  the  United  Nations  (UN)  can 
increase  the  prospects  for  settling  conflict  and  pre¬ 
venting  wars.  However,  many  interventions  in  civil 
conflicts  have  clearly  been  less  than  fully  successful. 
There  is  little  consensus  on  how  best  the  UN  may 
contribute  or  to  what  extent  the  characteristics  of 
conflicts  or  initial  prospects  for  settlements  deter¬ 
mine  whether  external  actors  are  willing  to  be 
involved  in  the  first  place. 

Disaggregation  in  Civil  War  Research 

Civil  war  has  gone  from  being  a  peripheral  area  to  a 
major  focus  in  the  study  of  conflict.  Moreover,  the 


field  shows  clear  positive  synergies  from  the  interac¬ 
tion  between  theory  development  and  research 
design.  Older  studies  of  civil  war  were  often  quite 
descriptive,  focused  on  single  cases,  and  frequently 
reluctant  to  consider  general  theories  of  civil  war. 
The  renewed  interest  in  civil  wars  in  the  1990s  rec¬ 
ognized  the  need  to  take  into  account  nonconflict 
cases  and  turned  to  cross-national  studies,  compar¬ 
ing  the  various  characteristics  believed  to  make 
countries  more  prone  to  conflict  across  countries 
and  its  relationship  to  civil  war.  However,  recent 
research  has  highlighted  how  the  proxy  measures 
used  in  cross-country  comparative  studies  are  often 
quite  far  removed  from  the  key  theoretical  concepts. 
Moreover,  looking  at  states  and  countries  at  large 
often  downplays  the  nonstate  actors  in  civil  wars 
and  ignores  the  often  considerable  variation  within 
countries.  Conflicts  tend  to  be  localized  and  often 
involve  actors  and  regions  that  are  markedly  differ¬ 
ent  from  national  averages,  as,  for  example,  the 
Chechen  conflict  in  Russia.  Recent  contributions, 
accompanied  by  many  new  data  developments, 
often  turn  to  greater  disaggregation  within  countries 
and  a  more  specific  focus  on  actors  to  more  carefully 
evaluate  theories  of  civil  war.  A  number  of  studies 
have  looked  at  the  specific  conflictual  events  within 
individual  countries  over  time  to  better  understand 
the  microlevel  interactions  between  actors  in  con¬ 
flict.  Some  studies  have  looked  at  dyads  of  periph¬ 
eral  groups  and  the  center  to  take  into  account  how 
the  attributes  of  nonstate  actors  may  influence  the 
risk  of  conflict.  Other  studies  have  looked  at  the 
characteristics  of  specific  rebel  groups  to  better 
understand  capacity  or  opportunities  for  conflict  or 
used  individual-level  data  on  participants  in  insur¬ 
gencies  to  examine  motivations.  Some  researchers 
consider  geographically  disaggregated  data,  either 
on  the  characteristics  of  the  places  where  conflict 
occurs  or  using  smaller  units  within  countries,  for  a 
more  detailed  resolution  of  how  social,  economic, 
and  political  factors  may  be  related  to  conflict.  These 
studies  indicate  that  the  characteristics  of  conflict 
zones  tend  to  be  quite  different  from  national  aggre¬ 
gates  and  averages.  Moreover,  the  conclusions  from 
country-level  studies  on  who  participates  in  insur¬ 
gencies  and  their  motivations  are  often  not  sup¬ 
ported  by  more  direct  evidence.  Such  innovations  in 
the  study  of  civil  conflict  attest  to  the  vibrancy  of  this 
research  area.  Moreover,  they  underscore  the  impor¬ 
tant  relationship  research  design  and  the  conclusions 
that  we  reach,  and  how  theory  must  inform  the 
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former.  Although  existing  research  has  clearly  gener¬ 
ated  many  useful  insights  about  civil  war,  this  review 
illustrates  the  considerable  changes  in  the  conven¬ 
tional  wisdom  on  civil  war  over  time.  Advances  in 
theory  development  and  research  are  likely  to  con¬ 
tinue  to  spur  further  changes. 
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The  concept  of  class  is  one  of  the  most  important 
and  most  controversial  in  social  sciences.  It  is 
important  because  classes  have  been  traditionally 
considered  a  key  to  the  study  of  social  inequalities 
and  social  and  political  behavior.  It  is  highly  con¬ 
troversial  because  different  definitions  and  con¬ 
trasting  perspectives  on  the  role  and  impact  of  class 
have  been  proposed.  In  everyday  language,  class 
refers  to  social  groupings  based  on  similar  occupa¬ 
tional  conditions  that  have  the  same  access  to  eco¬ 
nomic,  social,  and  political  resources.  It  is  also 
assumed  that  experiencing  analogous  social 
inequalities  favors  the  formation  of  a  specific  cul¬ 
tural  identity  and  influences  social  and  political 
behavior.  However,  in  scientific  language,  classes 
are  to  be  considered  a  “modern”  phenomenon. 
This  means  that  whereas  social  inequalities  have 
always  been  present  in  human  history,  the  specific 
inequalities  based  on  class  require  two  conditions: 
the  equality  of  citizens  before  the  law  and  the  devel¬ 
opment  of  a  market  economy.  These  conditions 
occur  after  the  end  of  the  18th  century  with  the 
advent  of  modern  society,  shaped  by  democratic 
revolution  and  growth  of  the  market  economy  and 
industrialization. 

Therefore,  classes  can  be  defined  as  social  group¬ 
ings  that  are  based  on  similar  occupational  condi¬ 
tions  in  societies  where  a  market  economy  prevails 
and  the  formal  equality  of  citizens  before  the  law  is 
recognized.  People  belonging  to  the  same  class  de 
facto  experience  an  unequal  access  to  economic, 
social,  and  political  resources  depending  on  the 
jobs  they  occupy,  despite  the  formal  equality  of 
rights.  However,  they  can  change  their  class  through 
social  mobility.  This  possibility  clearly  distinguishes 
classes  from  other  social  groupings,  such  as  reli¬ 
gious  castes  or  medieval  estates,  for  two  main  rea¬ 
sons.  First,  in  the  case  of  castes  or  estates,  inequali¬ 
ties  are  established  by  traditional  laws.  Second,  the 
class  position  of  people  cannot  be  changed,  because 
social  mobility  is  not  allowed  (as  for  castes)  or  is 
subject  to  special  conditions  defined  by  law  (as  in 
the  case  of  medieval  estates).  In  modern  societies, 
mobility  is  a  normal  condition,  although  classes 
presuppose  a  certain  degree  of  enclosure. 

While  this  definition  of  class  is  widely  accepted 
in  the  literature,  sharp  differences  have  emerged 
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along  two  dimensions.  First,  opinions  diverge  on 
the  economic  base  to  be  used  to  analyze  the  class 
structure.  Second,  strong  disagreement  persists  on 
the  cultural,  social,  and  political  consequences  of 
classes.  On  the  one  hand,  this  is  related  to  the  spe¬ 
cific  mechanisms  that  trigger  the  transformation  of 
classes  from  mere  aggregates  of  people  into  social 
groups:  collective  actors  that  recognize  common 
values  and  interests  and  organize  themselves  to 
pursue  common  goals.  On  the  other  hand,  scholars 
differ  in  the  evaluation  of  the  impact,  over  time,  of 
classes  on  social  behavior  (lifestyle,  consumption) 
and  political  behavior  (participation,  voting). 

In  the  next  section,  the  main  approaches  to 
the  origins  and  consequences  of  social  classes  are 
discussed.  First,  the  perspectives  of  classical 
authors  such  as  Karl  Marx,  Max  Weber,  and 
Emile  Durkheim  are  contrasted.  The  influence  of 
these  authors  on  the  ensuing  research  is  exten¬ 
sive.  Contemporary  studies  are  still  affected  by 
their  different  views.  After  discussing  the  most 
important  proposals  for  reframing  class  struc¬ 
ture  in  the  light  of  economic  and  political 
changes,  attention  is  devoted  to  debate  over  the 
past  decade  on  the  decline  of  social  classes:  To 
what  extent  do  social  classes  remain  important 
as  an  instrument  for  understanding  contempo¬ 
rary  societies? 

Contrasting  Views  in  the  Classics 

Among  the  classicists,  Marx  is  the  author  who  gave 
greatest  emphasis  to  social  classes.  The  struggle 
between  classes  plays  a  fundamental  role  in  the  his¬ 
torical  development  of  human  societies.  Classes  are 
social  groupings  that  share  a  specific  position  in  the 
control  of  the  means  of  production.  Each  historical 
type  of  society  is  characterized  by  a  dominant  mode 
of  production  and  specific  classes.  The  capitalist 
mode  of  production  is  based  on  the  private  property 
of  the  means  of  production  and  on  the  role  of  the 
market  in  regulating  productive  activities  and  the 
distribution  of  incomes.  Capitalist  societies  were 
therefore  increasingly  shaped  by  the  capitalist  mode 
of  production  with  its  two  main  classes:  (1)  the 
owners  of  the  means  of  production  (capitalist  bour¬ 
geoisie)  and  (2)  the  workers,  who  are  employed  by 
the  former  and  receive  a  wage.  According  to  Marx, 
there  was  an  objective  conflict  of  interest  between 
these  two  classes,  because  the  industrial  bourgeoisie 


could  maintain,  and  strengthen,  its  position  of  eco¬ 
nomic,  social,  and  political  privilege  as  far  as  it  was 
able  to  exploit  the  workers  and  extract  from  their 
activities  a  surplus  value  as  determinant  of  profit. 
Over  time,  however,  the  growing  crises  of  the  capi¬ 
talist  economy  would  worsen  the  conditions  of  the 
working  class  and  would  encourage  its  social  and 
political  organization,  which,  in  turn,  led  to  a  revo¬ 
lutionary  change  in  the  mode  of  production  and  the 
advent  of  a  socialist  society. 

Marx  was  aware  that  the  class  structure  of 
capitalist  societies  was  more  complex.  At  the  same 
time,  he  recognized  that  the  making  of  the  working 
class  as  a  historical  actor  did  not  depend  on  eco¬ 
nomic  factors  alone.  Tie  frequently  discussed  the 
sociocultural  and  political  factors  that  could  affect 
the  formation  of  class  consciousness  (as  he  puts  it, 
the  passage  from  a  “class  in  itself”  to  a  “class  for 
itself”).  However,  over  time  the  extension  of  the 
capitalist  mode  of  production  would  lead  to  the 
increasing  polarization  of  bourgeoisie  and  workers 
and  to  an  overwhelming  trend  toward  conflict.  The 
history  of  the  past  2  centuries  has  shown  the 
importance  of  classes  and  class  struggle,  but  it  has 
also  demonstrated  that  Marx  overestimated  the 
role  of  class  conflict  and  overlooked  the  autono¬ 
mous  role  of  social  and  political  processes  and 
institutions. 

Weber  was  convinced  of  the  important  role 
played  by  class  and  class  conflict.  However,  he  saw 
class  structure  in  a  more  flexible  way,  which 
allowed  him  to  account  for  the  increasing  differen¬ 
tiation  introduced  by  the  development  of  a  modern 
capitalist  economy.  This  was  done  by  defining 
classes  as  social  groupings  based  on  a  common 
position  in  the  market  rather  than  on  the  control 
of  means  of  production,  as  Marx  asserted. 
Therefore,  a  class  is  composed  of  people  who  have 
similar  life  chances,  affected  by  the  income  they 
can  obtain  in  the  market.  This  common  power  in 
the  market  (“class  situation”)  is  determined  not 
only  by  the  ownership  of  the  means  of  production 
but  also  by  wealth,  education,  and  professional 
skills. 

Another  difference  concerns  the  passage  from  a 
common  class  situation  to  the  formation  of  self- 
aware  social  groups  that  organize  themselves  to 
pursue  common  interests.  While  Marx  was  con¬ 
vinced  that  the  economic  dynamic  of  capitalism 
would  lead,  in  the  end,  to  class  consciousness  and 
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collective  action,  Weber  was  much  more  skeptical 
about  this  possibility.  He  thought  that  people 
within  the  same  class  situation  would  share  life 
chances,  social  inequalities,  styles  of  consumption, 
social  and  even  political  behavior,  but  they  would 
usually  remain  aggregates  of  individuals  rather 
than  becoming  self-aware  social  groups  based  on 
social  interaction.  In  this  respect,  the  German  soci¬ 
ologist  distinguished  between  mass  action  and 
community  action.  A  common  class  situation  usu¬ 
ally  brings  about  mass  action.  But  the  passage  to 
community  action  requires  certain  noneconomic 
conditions:  Status  groups  (religious,  ethnic  groups) 
must  be  absent  or  marginal;  a  large  number  of 
people  must  be  in  the  same  class  situation;  because 
they  are  concentrated  in  large  firms  and  urban 
neighborhoods,  it  is  relatively  easy  to  organize 
them;  and  intellectuals  will  play  an  important  role 
in  the  diffusion  of  a  class  ideology  and  in  the  orga¬ 
nization  of  the  people  involved.  Despite  these  clear- 
cut  differences  in  their  view  of  classes  and  class 
conflict,  both  Marx  and  Weber  recognized  the 
importance  of  social  inequalities  as  a  determinant 
of  social  behavior  and  source  of  conflict  in  modern 
societies. 

A  different  perspective  was  adopted  by  another 
founding  father  of  sociology,  Durkheim.  Writing  at 
the  end  of  the  19th  century,  he  emphasized  the 
increasing  division  of  labor  as  the  main  feature  of 
modern  society.  A  fundamental  requisite  of  this 
society,  based  on  the  high  differentiation  and  inte¬ 
gration  of  specialized  roles,  concerns  the  recruit¬ 
ment  and  reward  of  individuals.  It  is  necessary  that 
each  individual  be  called  to  fulfill  the  function  he  or 
she  will  perform  best  and  receive  an  adequate 
reward  for  his  or  her  efforts.  In  other  words,  people 
must  be  motivated  to  perform  different  roles  thanks 
to  structured  social  inequalities.  This  is  the  main 
function  of  stratification  in  the  social  system  accord¬ 
ing  to  Talcott  Parsons,  the  most  influential  figure  in 
the  sociology  of  the  1950s,  who  developed  the  func¬ 
tionalist  premises  of  Durkheim’s  work.  Parsons  and 
other  scholars  of  the  functionalist  approach,  such  as 
Kingsley  Davis  and  Wilbert  Moore  (1945),  gave  a 
more  optimistic  account  of  the  role  of  social  strati¬ 
fication  in  the  stability  of  society.  This  trend — 
which  was  also  influenced  by  the  economic  and 
social  climate  of  the  postwar  period — fostered  the 
growth  of  descriptive  research  on  social  stratifica¬ 
tion,  especially  in  the  United  States.  Social  strata 


were  constructed  by  reputational  analysis  (asking 
people  to  rank  others)  or  by  a  mix  of  reputational 
evaluations  and  objective  indicators,  such  as  wealth, 
income,  and  education.  Usually,  as  in  the  well- 
known  work  by  Lloyd  Warner  (1963),  these  studies 
presented  a  classification  based  on  various  status 
groups  (upper  classes,  middle  classes,  and  lower 
classes). 

Reframing  Class  Analysis 

In  the  1950s  and  1960s,  the  stratification  approach 
largely  prevailed  in  the  study  of  social  classes.  This 
situation  changed  in  the  1970s  with  the  develop¬ 
ment  of  new  empirical  research  based  on  occupa¬ 
tional  classes.  In  this  strand  of  literature,  there  is  a 
shared  critique  of  the  descriptive  features  of  strati¬ 
fication  studies  and  an  emphasis  on  the  “rela¬ 
tional”  attributes  of  classes.  This  means  classes  are 
to  be  defined  on  the  basis  of  their  position  in  the 
market  and  in  the  productive  process,  as  in  Weber 
and  Marx,  and  therefore  in  relation  to  the  other 
classes  with  which  they  compete  for  greater 
resources.  However,  the  origins  and  aims  of  the 
new  studies  are  clearly  different.  The  component 
most  influenced  by  Marx  and  the  Marxist  tradition 
is  more  interested  in  the  analysis  of  the  middle 
classes  and  in  its  consequences  for  the  proletarian¬ 
ization  of  work,  originally  foreseen  by  Marx.  The 
other  component  is  closer  to  the  Weberian  perspec¬ 
tive  and  looks  at  the  empirical  study  of  classes  as 
the  necessary  basis  for  a  more  adequate  analysis  of 
social  mobility. 

The  followers  of  the  Marxist  perspective  tried 
to  account  for  both  the  persistence  of  the  petty 
bourgeoisie  and  the  unexpected  (in  Marxist  terms) 
growth  of  the  new,  dependent,  middle  classes 
(managers,  supervisors,  technicians).  In  designing 
an  empirical  map  of  social  classes,  Erik  Wright 
tried  to  solve  this  problem  through  the  concept  of 
“contradictory  class  locations.”  In  his  studies,  he 
distinguished  between  the  ownership  and  control 
of  the  means  of  production.  This  allows  him  to 
explain  the  new  middle-class  positions  as  based 
on  various  degrees  of  control  over  the  concrete 
use  of  the  productive  means  without  formal  own¬ 
ership.  As  a  consequence,  top  and  middle-level 
managers,  technicians,  and  supervisors  may  also 
experience  variable  degrees  of  autonomy  in  per¬ 
forming  their  role. 
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In  the  1980s  and  1990s,  another  influential  class 
map  was  provided  by  John  Goldthorpe.  He  distin¬ 
guished  between  a  labor  contract  and  a  service 
contract.  Employment  regulated  by  a  labor  contract 
entails  a  short-term  and  specific  exchange  of  money 
for  work,  while  a  service  contract  involves  a  delega¬ 
tion  of  authority,  specialized  knowledge,  and  a  cer¬ 
tain  degree  of  autonomy  on  the  part  of  employees. 
On  this  basis,  a  class  schema  for  empirical  analysis 
and  comparisons  was  worked  out.  It  included  seven 
main  classes:  (1)  service  class  (managers,  profession¬ 
als,  higher-grade  technicians),  (2)  less  skilled  white- 
collar  workers,  (3)  petty  bourgeoisie  (artisans,  small 
proprietors),  (4)  skilled  manual  workers,  (5)  un¬ 
skilled  manual  workers,  (6)  small  farmers,  and 
(7)  farm  laborers.  In  practice,  this  schema  is  similar 
to  the  map  designed  by  Wright.  Both  account  for 
the  growth  of  the  new  middle  classes  and  an 
increasing  differentiation  among  manual  workers. 
Both  of  them  can  be  used  in  diachronic  and  cross¬ 
country  comparisons.  For  example,  over  time  the 
class  structure  of  advanced  countries  shows  a 
decline  of  the  old  middle  classes  (the  petty  bour¬ 
geoisie);  an  increase  in  professionals,  technicians, 
and  white-collar  workers  (the  new,  dependent 
middle  classes);  and  a  growing  differentiation 
among  manual  workers.  Cross-country  compari¬ 
sons  may  help  analyze  the  differences  among 
advanced  countries,  which  are  related  not  only  to 
the  productive  model  and  organization  but  also  to 
the  role  of  the  state  and  the  extension  of  nonmarket 
forms  of  coordination  (“varieties  of  capitalism”). 
Interesting  insights  may  also  come  from  compari¬ 
sons  between  advanced  and  developing  countries. 
However,  the  main  application  of  the  Goldthorpe 
schema  has  been  in  the  comparative  study  of  social 
mobility.  In  this  respect,  robust  research  findings 
have  been  produced  on  the  strong  influence  of 
classes  on  social  mobility.  These  findings  do  not 
confirm  the  expectations  of  the  liberal  theory  that 
industrial  societies  tend  to  become  more  mobile 
and  more  open.  Rather,  mobility  rates  (in  particu¬ 
lar  relative  rates)  show  a  high  degree  of  temporal 
stability.  The  chances  for  the  children  of  less  privi¬ 
leged  classes  of  moving  to  the  upper  positions  of 
the  class  structure  remain  low,  without  revealing 
significant  differences  among  countries. 

In  spite  of  their  different  origins  and  goals,  both 
the  Marxist  and  Weberian  maps  achieved  similar 
results  and  shed  light  on  the  class-based  social 


inequalities  typical  of  advanced  capitalist  societies 
until  the  1970s.  However,  in  the  following  period, 
there  was  a  double  change.  On  the  one  hand,  the 
crisis  of  the  “Fordist”  model  of  production  brought 
about  a  decline  of  large  firms  and  their  organiza¬ 
tion  of  labor.  New  forms  of  productive  organiza¬ 
tion  developed,  based  on  flexible  specialization 
and  networks  linking  firms.  On  the  other  hand, 
welfare  systems  had  to  be  restructured  to  deal  with 
increasing  costs.  As  a  consequence,  important 
changes  affected  the  whole  class  structure.  New 
maps  had  to  be  designed.  “Postindustrial”  societ¬ 
ies  are  characterized  by  the  growing  role  of  the 
service  sector.  The  traditional  axis  of  stratification 
based  on  industrialism  has  shrunk,  both  because  of 
the  new  technologies  that  reduce  the  use  of  labor 
and  because  of  increasing  trends  to  outsource 
manufacturing  activities  to  developing  countries. 
In  advanced  capitalist  societies,  new  activities  have 
developed  in  business  services  (finance,  legal  and 
accounting  services,  software  programming),  in 
social  services  (health,  education,  and  welfare  ser¬ 
vices),  and  in  consumer  service  (restaurants,  laun¬ 
dries,  services  related  to  leisure).  Therefore,  as 
Gosta  Esping-Andersen  has  pointed  out,  a  new 
axis  of  stratification  has  emerged.  This  requires 
that  new  groupings  be  more  clearly  distinguished 
within  the  middle  classes  and  the  workers.  The 
roles  of  professionals  and  scientists,  together  with 
that  of  technicians  and  semiprofessionals  (school¬ 
teachers,  social  workers,  technical  designers), 
become  more  important  in  the  middle  classes  and 
have  to  be  differentiated  from  those  of  managers 
and  supervisors.  Among  the  workers,  it  is  neces¬ 
sary  to  distinguish  new  figures  such  as  skilled  ser¬ 
vice  workers  (police  officers,  hairdressers,  etc.)  and 
unskilled  service  workers  or  service  proletariat 
(cleaners,  wait  staff,  shop  assistants,  etc.).  The  new 
groupings  tend  to  experience  specific  forms  of 
career  regimes  and  life  chances  and  therefore  are 
involved  in  processes  of  class  closure  that  differen¬ 
tiate  them  from  the  traditional  classes  based  on  the 
Fordist  industrial  model. 

Persistence  or  Decline? 

Two  macrochanges  have  influenced  the  debate  on 
social  classes  in  the  past  decades:  (1)  postindustri¬ 
alism  and  (2)  globalization.  According  to  many 
scholars,  the  crisis  of  Fordism,  the  emergence  of  a 
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service-based  economy  in  advanced  capitalist  coun¬ 
tries,  and  the  increasing  delocalization  of  manufac¬ 
turing  activities  toward  the  developing  countries 
have  brought  about  a  decline  in  social  classes.  The 
basic  arguments  in  support  of  this  thesis  refer  to  two 
main  trends:  On  the  one  hand,  the  increasing  frag¬ 
mentation  of  class  structure  and  the  growing  indi¬ 
vidualization  of  life  chances,  and  on  the  other  hand, 
the  weakening  of  class-based  attitudes  and  behav¬ 
ior.  For  example,  according  to  Ulrich  Beck  (1992), 
social  inequalities  tend  to  become  “classless,” 
whereas  Anthony  Giddens  pointed  out  that — as  a 
consequence  of  fragmentation  and  globalization — 
classes  are  no  longer  experienced  as  a  significant 
source  of  collective  identity. 

Those  who  share  the  idea  of  a  class  decline  usu¬ 
ally  refer  to  both  dimensions:  the  loss  of  influence 
over  social  inequalities  and  social  behavior.  In 
contrast  to  this  thesis,  other  scholars  strongly  criti¬ 
cize  this  view  of  a  decline.  However,  they  mainly 
focus  on  the  relationship  between  classes  and 
social  inequalities.  Goldthorpe  provided  a  clear 
synthesis  of  the  latter  position,  which  makes  wide 
reference  to  empirical  findings.  He  pointed  to  the 
more  detailed  arguments  advanced  in  support  of 
the  decline:  insecurity  and  mobility.  The  theorists 
of  decline  assume  that  within  the  context  of  glo¬ 
balization,  the  threat  of  unemployment  is  not  con¬ 
fined  mainly  to  the  members  of  the  less  advantaged 
classes  but  tends  to  become  pervasive.  As  for 
mobility,  globalization  and  postindustrial  forms  of 
economic  organization  are  supposed  to  break  up 
the  continuity  of  working  careers  and  of  member¬ 
ship  in  the  traditional  occupational  classes.  The 
growth  of  flexible  and  nonstandard  work  and  the 
emergence  of  new,  more  fragmented  work  trajec¬ 
tories  reduce  the  significance  of  class  as  a  lifetime 
experience.  However,  in  contrast  to  these  expec¬ 
tations,  empirical  findings  show  the  persistence  of 
class  as  the  main  determinant  of  inequalities. 
Thus,  it  is  true  that  economic  changes  have 
brought  about  increasing  job  insecurity  and  a 
growth  of  nonstandard  employment,  but  the 
chances  of  losing  their  jobs  remain  much  higher 
for  skilled  and  especially  nonskilled  workers  than 
for  professional,  administrative,  and  managerial 
posts.  In  addition,  empirical  research  demon¬ 
strates  that  inequalities  in  important  spheres  such 
as  income,  health,  and  access  to  education  remain 
strongly  associated  with  the  class  of  people, 


defined  according  to  the  job  they  hold.  As  for 
social  mobility,  there  is  clear  evidence  that  the 
overall  association  between  class  origin  and  desti¬ 
nation  is  still  characterized  by  high  temporal 
stability  and,  therefore,  despite  changes  in  the 
organization  of  work,  classes  continue  to  provide 
lifetime  experiences. 

In  the  debate  on  class  decline,  particular  atten¬ 
tion  has  been  devoted  to  the  influence  of  class  on 
voting  and  political  participation.  According  to  the 
theories  that  point  to  the  weakening  of  class  experi¬ 
ence,  class  politics  gives  way  to  new  forms  of 
political  behavior  influenced  more  by  cultural  fac¬ 
tors,  lifestyles,  and  elective  choices.  One  of  the  most 
powerful  factors  that  feed  this  trend  is  the  dramatic 
reduction  in  size  of  the  traditional  working  class 
and  the  growth  of  service  workers  and  the  new 
middle  classes.  As  a  consequence,  class  cleavages 
become  a  less  important  basis  for  social  identity. 
Their  place  is  increasingly  taken  by  other  divisions, 
such  as  race  and  ethnicity,  gender  and  sexuality. 
Many  scholars  also  share  Ronald  Inglehart’s  (1990) 
thesis  of  the  new  “postmaterialism”  in  the  politics 
of  the  advanced  societies.  This  view  sees  politics  no 
longer  as  influenced  by  the  search  for  economic 
well-being  and  the  defense  of  material  interests  but 
as  increasingly  affected  by  a  personal  search  for 
autonomy  and  self-fulfillment  and  by  a  greater  cul¬ 
tural  interest  in  the  quality  of  life. 

As  in  the  case  of  social  inequality,  scholars 
mainly  involved  in  empirical  research  have  rejected 
the  thesis  of  a  declining  class  influence  on  politics. 
They  have  pointed  to  findings  that  clearly  question 
the  hypothesis  of  a  class  dealignment — a  reduction 
in  the  level  of  class  voting — controlling  for  changes 
in  the  relative  size  of  classes  and  the  overall  popu¬ 
larity  of  parties:  Old  relations  between  classes  and 
parties — such  as  those  of  workers  and  left-wing 
parties — still  hold.  It  has  also  been  noted  that  a 
possible  realignment  will  usually  reveal  new  links 
between  class  and  voting  patterns  rather  than  their 
disappearance.  Thus,  in  concluding  his  accurate 
comparative  study,  Geoffrey  Evans  pointed  out 
that  the  theory  of  a  generalized  decline  in  the  class 
basis  of  voting  is  wrong. 

However,  in  assessing  the  influence  of  class,  a 
more  clear-cut  distinction  has  to  be  made  between 
the  impact  on  voting  patterns  and  party  affiliation 
and  the  strength  of  social  classes  as  political  actors, 
that  is,  as  self-aware  collective  subjects,  capable  of 
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pursuing  their  own  interests  in  the  economic  field 
as  well  as  in  the  political  arena.  Empirical  findings 
show  the  persistence  of  this  impact  on  voting  and 
party  affiliations,  but  the  trend  does  not  contradict 
the  weakening  of  class  on  another  dimension:  the 
formation  of  self-aware  social  groups.  There  are 
clear  signs  of  a  decline  in  this  respect,  especially 
with  reference  to  the  role  of  the  working  class  in 
advanced  countries.  Its  decline  in  size,  the  frag¬ 
mentation  of  working  conditions,  and  the  growth 
of  the  service  economy  have  substantially  weak¬ 
ened  the  working  class  as  a  social  and  political 
actor.  This  trend  is  confirmed — with  variations 
from  one  country  to  the  next — by  a  decline  in 
trade  union  membership  and  union  strength  and 
by  the  changes  that  have  affected  left-wing  politi¬ 
cal  parties  in  their  attempt  to  adjust  to  the  trans¬ 
formations  in  class  structure.  The  opening  up  of 
income  differentials,  at  the  expense  of  the  less 
privileged  classes,  the  restructuring  of  welfare  sys¬ 
tems,  and  the  overall  weakening  of  social  protec¬ 
tion  are  clear  indicators  of  this  decline. 

Therefore,  it  may  be  said  that  class  still  clearly 
influences  social  inequalities  and  life  chances 
(income,  health,  education,  social  mobility),  but 
when  it  comes  to  action  its  impact  is  weaker.  To 
return  to  Weber’s  distinction  between  mass  action 
and  community  action,  one  can  maintain  that  the 
influence  of  class  on  social  and  political  behavior 
(i.e.,  on  some  aspects  of  consumption  and  on  vot¬ 
ing)  usually  takes  the  form  of  mass  action  involv¬ 
ing  aggregates  of  individuals.  Over  time,  however, 
the  impact  of  class  on  the  formation  of  self-aware 
social  and  political  groups  through  community 
action  has  declined. 

Carlo  Trigilia 
University  of  Florence 
Florence,  Italy 
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Cleavages,  Social  and  Political 


The  term  cleavage  identifies  social  and  political 
divisions  characterized  by  a  close  connection 
between  individuals’  positioning  in  the  social 
stratification  system,  their  beliefs  and  normative 
orientations,  and  their  behavioral  patterns.  This 
close  connection  contributes  to  the  resilience  and 
stability  of  cleavages  over  time.  In  this  entry,  the 
constitutive  elements  of  this  concept  are  discussed 
and  brought  into  a  coherent  overall  framework 
that  helps  illuminate  the  political  relevance  of 
such  cleavages. 

The  Concept 

Empirical  evidence  about  individual  actors  in  poli¬ 
tics  concerns  these  three  distinct  dimensions:  social 
structural,  attitudinal,  and  behavioral.  Individuals 
can  be  regrouped  in  social  aggregates  on  the  basis 
of  gender,  residence,  education,  property,  ethnic 
group,  religion,  competence  in  political  matters, 
available  social  relationships,  and  so  on.  Scholars 
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regroup  individuals  in  this  way  because  some  the¬ 
ory  or  previous  research  has  led  them  to  believe 
that  these  social  aggregates  have  an  impact  on  the 
other  two  kinds  of  information  we  can  collect 
about  the  same  individuals:  (1)  their  subjective 
perceptions  and  judgments  about  the  world  of 
politics  and  (2)  their  actual  political  behavior. 

Individuals  in  fact  have  opinions,  preferences, 
and  value  judgments  about  a  large  variety  of 
political  issues,  institutions,  and  actors,  as  well  as 
normative  views  on  good  government,  a  just  soci¬ 
ety,  welfare  and  redistribution,  race  and  ethnicity, 
religion,  and  so  on.  Some  of  these  beliefs  and  val¬ 
ues  are  superficial  and  unstable  and  change  rapidly 
and  frequently,  while  others  are  profound  and 
resilient  and  more  impermeable  to  environmental 
changes.  In  some  cases,  these  various  beliefs  are 
unrelated  to  each  other,  while  in  other  cases,  they 
are  strongly  correlated  and  constitute  clusters  or 
systems  of  beliefs. 

We  also  can  collect  information  about  what 
individuals  do — their  behavior.  People  establish 
wide  or  narrow  social  relationships  (they  marry, 
do  business,  establish  friendships)  in  their  environ¬ 
ment.  Some  individuals  invest  heavily  in  culture 
and  education,  while  others  have  different  con¬ 
sumption  patterns.  Some  cooperate  easily  with 
others  while  others  withdraw.  (See  Table  1.) 
People  vote  or  abstain,  join  parties  or  interest 
groups,  rebel  or  acquiesce,  and,  more  generally, 
engage  or  do  not  engage  in  a  vast  repertory  of 
political  acts. 

Most  sociopolitical  research  is  based  on  the 
attempt  to  investigate  relationships  between  social 
positions  and  aggregates,  beliefs  systems  and  value 
orientations,  and  behavioral  patterns.  While  it  is 
clear  that  objective  social  positions  are  derived 
from  theories  of  social  stratification  criteria  that 
are  independent  of  the  normative  orientations  and 
the  behavioral  patterns  of  the  individuals,  the  rela¬ 
tion  between  beliefs  and  behaviors  is  more  inti¬ 
mate  and  the  direction  of  influence  more  difficult 
to  ascertain.  These  three  sources  of  information 
should,  therefore,  remain  conceptually  distinct  as 
much  as  possible. 

The  literature  frequently  associates  these  three 
sources  of  information  about  individuals  with  dif¬ 
ferent  kinds  of  “cleavages.”’  Objective  social  posi¬ 
tions  are  often  thought  to  be  the  basis  for  “social” 
cleavages  (sometimes  called  “economic”  cleavages); 


Table  I  Types  of  Information  About  Individuals 


“Objective” 

Attribution  of  individuals  to 

social 

“relevant”  social  categories  as 

positions 

defined  by  the  observer 

Beliefs  and 

Subjective  orientations  toward 

value 

political  and  social  objects, 

orientations 

institutions,  and  actors 

Behaviors 

Repertory  of  socially  and  politically 
relevant  behaviors 

beliefs  and  values  are  considered  to  be  the  source 
of  “cultural”  cleavages  and  also  of  “political  cleav¬ 
ages”;  and  political  behavior  is  often  indicated  as 
an  expression  of  political  cleavages.  The  term 
cleavage  is  therefore  frequently  accompanied  by 
qualifying  adjectives  such  as  social,  political,  eco¬ 
nomic,  attitudinal,  behavioral,  cultural,  ideologi¬ 
cal,  structural,  segmental,  institutionalized,  and  so 
on.  This  generates  considerable  theoretical  confu¬ 
sion  and  has  led  to  a  soft  use  of  the  term  cleavage 
as  a  general  concept  that  is  largely  synonymous 
with  the  broad  notion  of  divisions,  oppositions,  or 
conflicts. 

In  fact,  the  historical  divisions  to  which  the  term 
cleavage  originally  was  applied  by  Seymour  Martin 
Lipset  and  Stein  Rokkan  in  reference  to  party  sys¬ 
tem  formation  (center-periphery,  urban-rural, 
class,  religion,  etc.)  included  socioeconomic  bases; 
attitudinal,  ideological,  or,  more  generally,  norma¬ 
tive  bases;  and  behavioral  or  organizational  bases. 
None  of  the  historical  cleavages  was  ever  purely 
“economic”  or  “demographic,”  merely  attitudinal 
or  ideological,  or  exclusively  behavioral.  On  the 
contrary,  they  represented  a  peculiar  combination 
of  these  divisions,  with  a  tendency  to  overlap  and 
to  reinforce  each  other.  Indeed,  the  peculiarity  of 
the  concept  of  “cleavage”  is  its  capacity  to  bridge 
social  with  ideological  or  normative  and  with 
behavioral/organizational  divisions.  In  short,  the 
theoretical  connotation  of  the  concept  of  cleavage 
refers  to  the  combination  of  interest  orientations 
rooted  in  social  structure,  cultural/ideological  ori¬ 
entations  rooted  in  the  normative  system,  and 
behavioral  patterns  expressed  in  organizational 
membership  and  action. 

To  properly  define  and  identify  cleavages, 
therefore,  it  is  necessary  to  locate  them  in  relation 
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to  the  various  kinds  of  “divides”  that  exist  and  are 
continuously  generated  and  redefined  in  our  soci¬ 
eties.  Cleavages  then  emerge  as  a  peculiar  combi¬ 
nation  of  different  types  of  divisions.  This  is 
achieved  by  first  discussing  separately  the  sources 
of  divisions  rooted  in  the  (a)  social  stratification 
system,  (b)  normative  system,  and  (c)  behavioral 
system. 

The  Social  Stratification  System 

Pure  divisions  of  interest  rooted  in  social  stratifica¬ 
tion,  based  on  property,  market  positions,  educa¬ 
tion,  religious  affiliation,  and  so  on,  are  unlikely  to 
constitute  a  sufficient  base  for  the  structuring  of 
cleavages  unless  they  tend  to  overlap  and  reinforce 
each  other.  The  higher  the  frequency  and  the 
degree  of  overlapping  membership  in  different 
social  contexts,  the  more  unlikely  it  is  that  any 
such  social  context  will  be  dominant. 

Individuals  play  a  range  of  different  social  roles, 
some  of  which  imply  competing  and  contradictory 
tasks,  activities,  expectations,  and  values.  Individuals’ 
group  memberships  (in  the  family,  employment, 
friendship,  communication,  and  leisure  activities) 
and  relative  social  positions  (with  respect  to  educa¬ 
tion,  property,  income,  class,  religion,  ethnic  group, 
etc.)  in  modern  societies  are  usually  highly  differen¬ 
tiated,  and  the  chances  are  low  that  many  individu¬ 
als  will  share  the  same  combination  of  stratification 
positions  and  group  membership.  At  the  extreme, 
however,  there  may  be  only  a  few  socially  relevant 
categories,  which  are  characterized  by  mutually 
exclusive  membership,  as  in  a  strongly  stratified 
caste  society  or  a  society  with  strong  ethnic,  reli¬ 
gious,  or  similar  divisions.  Then,  a  high  proportion 
of  agents  act  exclusively  in  one  category  or  social 
group.  This  second  situation  generates  and  exacer¬ 
bates  potential  divisions  among  self-containing 
groups  whose  nonoverlapping  but  reinforcing  mem¬ 
bership  facilitates  common  values  and  perceptions 
but  makes  an  exchange  of  information,  experiences, 
and  so  on  across  groups  much  more  difficult.  This 
accentuates  group  conflicts,  particularly  because 
closed  groups  will  tend  to  claim  the  complete  and 
total  involvement  of  their  members  toward  some 
collective  aspiration  or  design. 

In  low-mobility  and  high-status  contexts,  the 
social  boundaries  of  social  groups  may  be  so 
strong  and  so  institutionally  reinforced  that  exit 


options  at  the  individual  level  are  basically  unthink¬ 
able.  This  reinforces  cultural  solidarity  in  the 
group  and  reduces  individualistic  options  in  favor 
of  collective  choices.  The  more  the  empirical  evi¬ 
dence  approaches  this  situation,  the  greater  the 
likelihood  that  the  similarity  of  social  positions 
and  interests  will  give  birth  to  strong  and  cohesive 
normative  visions  of  the  group’s  role  in  society — 
that  is,  to  a  code  of  collective  identity. 

The  Normative  System 

At  the  core  of  all  codes  of  collective  identity  is  a 
distinction  between  “we”  and  “the  others.”  A 
typology  of  the  symbolic  codes  of  collective  iden¬ 
tity  distinguishes  between  codes  of  primordiality, 
civility,  and  culture. 

Primordial  elements  of  collective  identity  focus 
on  gender,  generation,  kinship,  ethnicity,  and  race 
as  bases  for  constructing  and  reinforcing  the 
boundaries  between  inside  and  outside.  This  code 
of  closure  makes  reference  to  original  and 
unchangeable  distinctions  that  are  by  definition 
exempted  from  communication,  reflexivity,  and 
exchange  among  the  members  of  the  community. 
Such  distinctions  cannot  be  changed  by  voluntary 
action;  they  relate  to  “nature”  and  as  such  provide 
a  firm  and  stable  basis  beyond  the  realm  of  volun¬ 
tary  actions  and  thus  do  not  allow  for  shifting 
involvements.  In  other  words,  primordial  types  of 
codes  basically  “naturalize”  the  constitutive 
boundary  between  insiders  and  outsiders,  and  they 
tend  to  be  exempted  from  social  definition  and 
alteration.  Therefore,  primordial  boundaries  are 
very  difficult  to  change,  and  complex  rituals  to 
cross  these  boundaries  actually  reinforce  and  reaf¬ 
firm  them. 

Identity  boundaries  built  on  the  civility  code  of 
closure  and  exclusion  are  constructed  on  the  basis 
of  familiarity  with  implicit  rules  of  conduct,  tradi¬ 
tions,  and  social  routines  that  define  and  demar¬ 
cate  the  boundary  of  the  community.  Tradition 
and  collective  civic  rituals  are  important  (the 
myths  of  the  founder;  particular  persons,  places, 
and  historical  events  such  as  revolutions;  migra¬ 
tions,  etc.).  Very  often  these  are  embedded  in  the 
constitutional  practices  of  the  public  arena.  There 
are  some  civic  rituals  but  no  peculiar  rituals  of 
initiation,  commitment,  or  confession  to  demar¬ 
cate  the  boundaries.  The  stranger  is  primarily 
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“unfamiliar.”  Civility  codes  allow  a  stronger  dif¬ 
ferentiation  between  the  “public”  and  “private” 
spheres,  between  polity  and  the  market,  and 
between  the  entitlements  to  public  goods,  which 
are  restricted  to  members  of  the  community,  and 
access  to  commodities  exchanged  in  the  market, 
which  are  open  to  everyone  on  the  basis  of  mate¬ 
rial  rewards  and  instrumental  calculation  and 
which,  therefore,  are  identity  free. 

The  third  code  of  collective  identity — the  cul¬ 
tural  code — is  based  on  the  boundary  between 
“us”  and  the  “other”  on  a  particular  relation  of 
the  collective  subject  to  some  fundamental  values. 
The  collectivity  so  defined  on  the  basis  of  values 
related  to  God,  reason,  progress,  rationality,  or  to 
some  sort  of  definition  of  the  ideal  mundane  or 
heavenly  world.  Boundaries  between  members  and 
nonmembers  can  be  crossed  by  communication, 
education,  and  conversion,  and  in  principle  every¬ 
one  is  invited  to  do  so.  The  cultural  code  usually 
represents  a  rupture  between  past  and  present,  in 
strong  opposition  to  primordiality  codes  that 
always  overemphasize  the  past  as  the  source  of 
legitimacy.  The  openness  of  the  boundary  is  com¬ 
pensated  by  stratified  access  to  the  center  by  rituals 
of  initiation. 

Primordial,  civic,  and  cultural  codes  of  identity 
can  be  ranged  in  terms  of  difficulty  in  crossing  the 
boundary  of  the  identified  community;  that  is,  to 
exit  and  to  enter  the  social  relationship.  We  go 
from  the  exclusive  codes,  based  on  primordial 
traits,  to  the  inclusive  and  cultural  codes,  while 
civility  codes  are  located  in  an  intermediate  posi¬ 
tion  between  these  two  extremes. 

The  level  of  identity,  solidarity,  and  loyalty  gen¬ 
erated  by  different  codes  entails  consequences  for 
the  allocation  of  resources  and  for  the  definition  of 
the  entitlements  of  the  members  of  the  collectivity 
as  distinct  from  those  of  outsiders.  Groups’  identi¬ 
ties  differ  with  respect  to  what  entitlements  are 
distributed  in  the  name  of  the  collectivity  to  its 
members  and  the  mode  of  distributing  those  enti¬ 
tlements  (hierarchical  vs.  egalitarian). 

The  Behavioral  System 

Collective  action  is  undertaken  by  social  move¬ 
ments,  interest  groups,  and  politico-electoral  orga¬ 
nizations  or  political  parties.  The  levels  of  rigidity/ 
permeability  of  the  social  stratification  system  and 


of  the  normative  closure  of  the  identity  code  affect 
the  predominant  form  of  political  behavior  and  the 
structure  of  political  agencies. 

There  is  no  one-to-one  relationship  between 
codes  of  closure  and  types  of  organizational  mem¬ 
bership.  However,  some  predominant  associations 
can  be  specified  on  the  basis  of  the  synthetic  scheme 
of  organizational  membership  variations  presented 
in  Table  2.  Forms  of  organized  political  action  can 
be  ranged  along  four  dimensions.  First,  they  can  be 
characterized  by  different  degrees  of  territorial  cen¬ 
tralization/decentralization  (national  vs.  regional  or 
local  forms).  Second,  forms  of  organized  political 
action  can  be  analyzed  in  terms  of  their  degree  of 
“segmentation”  along  cultural  dividing  lines  (as, 
e.g.,  in  the  case  of  religious  and  ethno-linguistic 
divisions).  Third,  they  can  be  organized  by  level  of 
“fragmentation”  along  ideological  dividing  lines 
(as,  e.g.„  the  political  Left-Right  fragmentation). 
Finally,  different  forms  of  organized  political  action 
can  be  studied  in  terms  of  the  level  of  their  “orga¬ 
nizational  interlocking”  with  other  forms — for 
instance,  the  more  or  less  interlocking  relationships 
between  political  parties,  unions,  interest  groups, 
or  sociopolitical  movements. 

The  presence  of  primordial  codes  of  closure  is 
likely  to  generate  territorially  decentralized  or  cul¬ 
turally  fragmented  political  organizations  and 
social  movements  within  a  country.  Civility  codes 
in  their  pure  form  are  more  likely  to  be  associated 
with  the  ideological  fragmentation  of  political 
action,  where  divisions  concern  primarily  the  level, 
type,  and  quality  of  resources  distributed  within 
the  collectivity  and  the  access  to  public  goods. 
However,  they  can  occasionally  be  associated  with 
decentralized  forms  of  political  action  resting  on 
the  specific  civic  traditions  of  certain  territories. 
Cultural  codes  are  more  encompassing  and  are 
conducive  to  both  cultural  segmentation  (e.g.,  reli¬ 
gious  mobilization)  and/or  ideological  fragmenta¬ 
tion  (progress  vs.  conservatism,  change,  and  tradi¬ 
tion).  Well-integrated  territorial  polities  with  no 
internal  primordial  groups  and  with  high  civility 
and  cultural  homogeneity  tend  to  produce  political 
organizations  that  more  closely  mirror  the  ideo¬ 
logical  differences  between  social  and  economic 
interest-based  groups. 

Therefore,  membership-  and  action-based  divi¬ 
sions  may  be  organizationally  expressed  by  ter¬ 
ritorial  decentralization,  cultural  segmentation, 
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Table  2  A  Map  of  Political  Structuring  Variations  in  Organizational  Membership 
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Source:  Adapted  from  Bartolini,  S.  (2005).  Restructuring  Europe  (p.  103).  Oxford,  UK:  Oxford  University  Press. 


and  ideological  fragmentation  and  by  different 
degrees  of  interlocking  relations  between  specific 
sociopolitical  movements,  corporate,  and  elec¬ 
toral  organizations. 

Types  of  Divisions  and  Cleavages 

At  this  stage,  a  general  typology  of  all  possible 
divisions  and  the  place  that  cleavages  occupy  in 
such  a  typology  can  be  achieved  by  combining  the 
elements  of  the  separate  dimensions  of  the  social 
stratification,  normative,  and  behavioral  systems. 
This  is  presented  in  Table  3,  which  identifies  the 
three  bases  for  all  sorts  of  conflicts  and  opposi¬ 
tions.  They  correspond  to  the  interest  orientation 
linked  to  the  social  stratification  system,  the  cul¬ 
tural/ideological  orientation  linked  to  the  norma¬ 
tive  system,  and  the  organizational  base  linked  to 
the  behavioral  system.  Taken  separately,  these 
oppositions  and  conflicts  identify  simple  divisions. 
They  generate  compound  divisions  when  they 
interact  and  cumulate.  In  line  with  the  definition  at 
the  beginning  of  this  entry,  cleavages  are  identified 
as  a  special  type  of  complex  division  that  combines 
these  three  bases  and  roots. 

Simple  Divisions 

Some  divisions  are  based  simply  on  processes  of 
interest  differentiation  and  perceptions  of  interest 


similarity.  Though  these  interest  divisions  are 
mainly  categorized  as  statistical  or  analytical,  they 
may  also  occasionally  generate  conflicts  and  oppo¬ 
sitions.  Cultural  divides  exist  among  social  identi¬ 
ties  linked  to  various  types  of  civility  and  cultural 
codes  (primordial  codes  are  by  definition  rooted  in 
the  social  structure  via  the  stigmata  of  the  group), 
and  they  have  proved  their  historical  resilience 
even  in  situations  unsuitable  for  their  behavioral 
expression  (e.g.,  repression).  Finally,  we  can 
observe  conflicts  and  oppositions  that  are  mainly 
related  to  the  behavioral  divisions  among  various 
types  of  voluntary  organizations  and  that  are  not 
linked  to  divisions  rooted  in  either  the  social  struc¬ 
ture  or  in  the  normative  cultural  system. 

However,  none  of  these  “simple  divides”  is  likely 
to  provide  long-lasting  sources  and  bases  for  the 
political  structuring  of  the  polity  unless  they  are 
combined.  Interest  divisions  are  unlikely  to  over¬ 
come  the  free  riding  problem  and  the  cost  of  orga¬ 
nizational  mobilization  without  normative  or  orga¬ 
nizational  foundations.  Interest  orientations  need  to 
be  “generalized”  and  somehow  made  “universal” 
through  an  idealized  appeal  to  solidarity.  Similarly, 
pure  membership  divisions  are  unlikely  to  maintain 
the  organizational  cohesion  of  the  membership 
groups  only  by  distributing  material  incentives  and 
resources.  Even  the  most  scandalous  and  corrupt 
“spoils”  party  or  organization  is  likely  to  require 
some  cultural  and/or  ideological  underpinning  to 
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Table  3  Types  of  Divisions 

Simple  Divides 
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divides 

Cultural 

divides 
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divides 

Corporate 

divides 

Social 

divides 

Political 

divides 

Cleavages 

Source:  Adapted  from  Bartolini,  S.  (2005).  La  formation  des  clivages  [The  formation  of  cleavages].  Revue  Internationale  de 
Politique  Comparee,  12,  32. 


survive  and  stabilize  over  time.  Cultural  divisions 
may  be  the  only  ones  that  are  able  to  survive 
“underground”  with  weak  interaction  bases  and 
loose  organizational  support. 

Compound  Divisions 

The  most  important  divisions  are,  therefore, 
“compound,”  and  they  are  the  only  ones  able  to 
generate  the  forms  of  organized  political  action  we 
usually  observe  as  social  and  political  movements, 
interest  organizations,  and  political  parties. 

The  combination  of  interest  orientations  with 
organizational  membership  may  be  sufficient  to 
generate  highly  contingent  and  goal-oriented  orga¬ 
nizational  cooperation  without  necessarily  being 
rooted  in  strong  cultural  orientations.  Indeed,  this 
type  of  division  may  allow  a  considerable  amount 
of  within-group  differentiation  in  many  other 
domains  different  from  the  one  in  which  the  orga¬ 
nizational  cooperation  is  generated.  This  is  the 
typical  and  elementary  form  of  interest  (or  pres¬ 
sure)  group  formation. 

Social  divisions  are  based  on  the  combination  of 
interest  orientations  rooted  in  the  social  structure 
with  stronger  normative  orientations  concerning 
the  identification  of  the  group  and  its  role  in  soci¬ 
ety.  In  this  type  of  division,  the  social  determina¬ 
tion  is  less  strong  than  in  corporate  divisions.  The 
distinctiveness  of  social  roots  may  be  more  or  less 
fundamental  but  the  normative  orientation  is  usu¬ 
ally  essential  and  decisive.  In  this  case,  cooperation 
is  not  based  on  specific  and  well-defined  interests, 


and  the  organizational  capacity  is  weak,  intermit¬ 
tent,  not  consolidated,  or  sometimes  nonexistent. 
The  normative  orientation  is  therefore  essential  to 
the  definition  of  the  group  identity.  This  seems  to 
be  the  typical  form  for  the  emergence  of  social 
movements. 

The  ideal  type  of  political  division  is  expressed 
via  a  combination  of  cultural/ideological  orienta¬ 
tions  with  a  strong  organizational  dimension.  The 
ideational  solidarity  takes  the  form  of  common  but 
general  goals  to  be  achieved  in  the  polity  through 
the  organizational-ideological  competition  for  the 
attainment  of  binding  decisions.  These  divisions 
are  more  institutionalized  and  strongly  embedded 
in  the  institutional  structure  of  the  polity.  The 
organizational  dimension  gives  them  solidity  and 
stability  over  time,  which  may  enable  them  to  sur¬ 
vive  even  when  the  original  cultural  and  ideologi¬ 
cal  bases  fade  way.  Political  parties  are  a  typical 
expression  of  such  political  divisions. 

Cleavages 

Finally,  “cleavages”  are  a  type  of  division  that 
incorporates  the  three  elements  of  a  clear  interest 
orientation,  a  clear-cut  normative  vision,  and  a 
strong  organizational  basis.  Cleavages  encompass 
corporate  and  social  divisions  with  political  ones. 
Cleavages  show  a  remarkable  degree  of  resilience 
over  time.  They  are  not  transient  and  intermittent 
but  durable,  entrenched,  and  entrapped.  Given 
their  high  degree  of  organizational  infrastructure 
and  institutionalization,  they  often  endure  over 
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and  beyond  the  interests  and  the  normative  orien¬ 
tations  that  historically  gave  birth  to  them. 
Cleavage  lines  manifest  themselves  through  an 
intense  interlocking  relationship  between  corpo¬ 
rate  interest  organizations,  social  movements,  and 
specific  electoral  organizations.  Actually,  the  dis¬ 
tinctive  characteristic  of  cleavages  as  opposed  to 
all  other  divisions  is  that  they  are  characterized  by 
a  dense  and  profound  overlapping  of  membership 
and  exchange  of  personnel  and  of  resources  among 
social  movements,  corporate  organizations,  and 
electoral  organizations.  From  the  organizational 
point  of  view,  the  specificity  of  cleavage  structures 
is  indeed  their  vertical  integration  of  different 
forms  of  organizational  action. 

As  bases  of  a  cleavage,  territorial  oppositions  are 
more  solidly  entrenched  than  cultural  segmenta¬ 
tions,  and  the  latter  are  more  solidly  entrenched 
than  ideological  fragmentations.  As  argued  previ¬ 
ously,  territorial  oppositions  are  rooted  in  either 
primordial  or  civility  codes  of  exclusion,  and  these 
have  proved  to  be  extraordinarily  resilient  and 
impermeable.  Cultural  codes  are  by  far  more  open 
and  permeable  and  therefore  both  cultural  segmen¬ 
tations  and  ideological  fragmentations  rest  on  roots 
that  are  historically  more  malleable  and  change¬ 
able.  For  example,  compared  with  their  impact  on 
ethno-linguistic  community  definitions,  it  is  more 
likely  that  socioeconomic  and  technological  changes 
make  social  class  positions  more  permeable  and 
denominational  affiliations  less  significant. 

Stefano  Bartolini 
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San  Domenico  di  Fiesole,  Italy 
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Clientelism 


Clientelism  involves  a  particularistic  exchange 
between  patrons  and  clients  in  which  resources  of 
some  kind  are  traded  in  a  mutually  beneficial  way. 
Both  parties  are  supposed  to  benefit  from  the 
exchange,  but  it  involves  actors  of  unequal  status. 
The  patrons,  for  example,  landlords  or  local 
notables,  are  the  more  powerful  actors  in  the 
exchange.  They  possess  privileged  access  to 
resources,  such  as  land,  which  is  denied  to  their 
clients,  typically  poor  peasants,  migrants,  and 
other  marginalized  groups.  The  patrons  release 
these  resources  in  a  selective  way  that  directly  ben¬ 
efits  the  clients,  and  in  return,  they  expect  their 
help,  support,  or  allegiance.  These  exchanges 
eventually  produce  an  enduring  social  relation¬ 
ship,  typical  for  traditional  societies,  governed  by 
a  sense  of  obligation  and  loyalty  that  binds  the 
client  to  the  patron.  In  modern  political  settings, 
clientelism  is  associated  with  the  selective  use  of 
public  resources  in  the  electoral  process.  The 
patrons,  who  can  be  individual  politicians  or 
political  parties,  selectively  release  various 
resources — jobs  in  state  administration,  titles,  pen¬ 
sion  schemes,  or  building  projects — to  individuals 
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or  groups  in  order  to  “purchase”  their  vote.  This 
is  an  understanding  dominant  in  the  political  sci¬ 
ence  literature  where  clientelism  is  seen  as  a  type 
of  representation  or  of  political  mobilization  that 
differs  from  both  universalistic  programmatic 
appeals  and  appeals  based  purely  on  personality 
traits  and  charisma.  This  entry  focuses  on  this 
understanding  of  clientelism.  The  first  part  dis¬ 
cusses  the  concept  of  clientelism,  in  particular  its 
relation  to  other  closely  related  concepts,  such  as 
patronage,  corruption,  and  pork  barrel.  The  sec¬ 
ond  part  discusses  the  relevance  of  clientelism  in 
contemporary  political  process. 

Identifying  Clientelism 

The  passage  from  traditional  to  fully  mobilized 
mass  democratic  societies  brought  about  substan¬ 
tial  changes  in  patron-client  relationships.  The 
traditional  clientelistic  linkage,  characterized  by 
face-to-face  contact  that  is  managed  by  a  powerful 
person,  has  been  replaced  by  exchanges  in  which 
an  organization,  the  political  party,  has  become 
the  key  distributor  of  goods  and  services  to  the 
clients.  The  individual  patron-client  ties  of  local 
notables  and  powerful  landlord  has  thus  eventu¬ 
ally  evolved  into  the  clientelism  managed  by  com¬ 
plex  political  organizations.  However,  studies  on 
the  subject  have  long  suffered  from  a  high  degree 
of  conceptual  vagueness  and  ambiguity.  This  prob¬ 
lem  is  particularly  evident  in  the  use  of  the  con¬ 
cepts  of  patronage  and  clientelism,  which  are  seen 
either  as  the  same  phenomenon  or,  in  contrast,  as 
two  different  phenomena  but  without  any  expla¬ 
nation  of  what  differentiates  the  two. 

Patronage  and  clientelism  do  fundamentally 
denote  the  same  type  of  exchange  but,  as  Simona 
Piattoni  (2001)  points  out  in  the  introduction  to 
her  edited  volume,  the  latter  is  a  far  more  penetrat¬ 
ing  phenomenon.  The  difference  between  the  two 
lies  in  the  objects  of  exchange  (i.e.,  the  type  of 
resources  that  are  used  in  the  trade)  and  in  the 
goals  of  the  exchange  (i.e.,  what  the  deal  is  sup¬ 
posed  to  achieve).  Regarding  the  object  of 
exchange,  the  key  feature  of  patronage  is  the  dis¬ 
cretional  allocation  of  public  jobs  by  (party)  politi¬ 
cians.  The  recipients  of  these  jobs  are  normally 
assumed  to  be  party  leaders,  party  activists,  influ¬ 
ential  party  backers,  or  party  voters,  though  the 
fact  that  politicians  have  power  for  discretional 


hiring  is  more  important  for  defining  patronage 
than  the  characteristics  of  appointees.  In  other 
words,  patronage  should  not  necessarily  imply 
that  appointees  are  exclusively  party  members  or 
party  supporters,  though  in  reality  it  has  always 
been  associated  with  rewarding  individuals  or 
groups  who  have  played  an  important  role  in  party 
life  and  party  strategy.  Concerning  the  goal  of  the 
exchange,  parties  or  politicians  are  typically  seen 
to  allocate  jobs  in  order  to  secure  supporters’ 
votes.  However,  given  that  most  modern  polities 
are  best  characterized  as  having  “too  few  jobs  and 
too  many  votes,”  the  use  of  patronage  as  a  way  of 
directly  securing  electoral  support  is  unlikely  to  be 
a  very  feasible  strategy.  Instead,  patronage  has 
always  served  mainly  organizational  and  govern¬ 
mental  goals,  such  as  entrenching  party  or  politi¬ 
cians’  networks  within  the  public  sector,  serving  as 
an  inducement  for  building  and  maintaining  party 
organizations  by  offering  selective  incentives  to 
party  activists  in  exchange  for  their  work  for  the 
party,  or  ensuring  the  effectiveness  of  (party)  gov¬ 
ernment  by  controlling  the  process  of  policy  mak¬ 
ing  and  policy  implementation  through  the  appoint¬ 
ment  of  like-minded  individuals  to  key  public 
positions.  This  understanding  of  patronage — as  a 
distribution  of  public  jobs — coincides  with  the 
Anglo-Saxon  use  of  the  term.  In  the  American  lit¬ 
erature,  for  example,  it  is  widely  used  to  describe 
the  politics  of  (city)  machines  in  which  bosses 
handed  out  patronage  jobs  to  shore  up  their  cam¬ 
paign  organizations;  it  is  also  employed  to  describe 
the  wholesale  changes  in  administration  following 
presidential  elections,  sometimes  referred  to  as  the 
“spoils  system.” 

In  contrast  to  patronage,  clientelism  should 
essentially  be  considered  an  electoral  tool  in  which 
benefits  are  delivered  to  obtain  the  recipients’  vote. 
In  fully  mobilized  mass  societies,  clientelism 
denotes  an  exchange  between  a  political  party  on 
the  one  hand  and  individuals  or  groups  on  the 
other  hand,  in  which  divisible  benefits  are  offered 
directly  and  on  a  wide  scale  to  tie  supporters  to  the 
party.  The  object  of  these  exchanges  can  of  course 
be  public  jobs,  but  this  is  unlikely  to  secure  any 
major  electoral  advantage  to  patrons,  as  argued 
above.  Clientelism  most  often  includes  distribution 
of  a  wide  variety  of  other  benefits,  ranging  from 
food  and  medicines  to  a  pension  or  a  low-interest 
loan,  access  to  housing,  or  award  of  selective 
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development  projects  in  a  particular  area.  There¬ 
fore,  in  contrast  to  patronage,  clientelism,  espe¬ 
cially  in  its  mass  society  incarnation,  requires  other 
methods  of  reaching  the  clients  than  just  a  distribu¬ 
tion  of  a  handful  of  state  jobs.  These  methods  are 
often  fairly  bureaucratized  and  based  on  imper¬ 
sonal  measures,  such  as  when  the  incumbents  pass 
a  decree  favoring  a  particular  category  of  persons, 
such  as  taxi  drivers,  teachers,  or  small  shopkeep¬ 
ers.  Clientelism  also  implies  an  asymmetrical  nature 
of  the  linkage,  which  takes  place  between  actors  of 
different  status  and  power.  Even  when  both  sides 
accrue  benefits  and  both  may  perceive  the  exchange 
as  mutually  beneficial,  the  clientelistic  linkage 
often  entails  an  element  of  inequality.  This  is  also 
why,  empirically,  clientelism  is  most  often  found  in 
the  context  of  widespread  urban  and  rural  poverty 
and  inequality.  In  contrast,  distribution  of  patron¬ 
age  jobs  to  party  activists  or  key  party  benefactors 
is  likely  to  involve  a  far  more  symmetrical  relation¬ 
ship  of  power,  at  least  in  that  the  recipients  will  be 
less  dependent  on  obtaining  these  jobs,  and  their 
social  status  will  often  be  equal  to  that  of  their 
political  patrons. 

Pork  barrel  politics  is  normally  subsumed  as  a 
subtype  of  either  clientelism  or  patronage.  Yet  it  is 
a  distinct  practice  that  denotes  tactical  allocation  of 
government  funds,  usually  in  the  form  of  legislation 
on  public  works  projects,  to  favor  specific  constitu¬ 
encies.  Susan  Stokes  (2007)  distinguishes  pork  bar¬ 
rel  from  clientelism  on  the  basis  of  their  different 
distributive  criterion.  While  the  distributive  crite¬ 
rion  of  clientelism  is  “Did  you  (will  you)  vote  for 
me?”,  the  implicit  criterion  in  the  distribution  of 
pork  barrel  politics  is  “Do  you  live  in  my  district?” 
In  other  words,  while  clientelism  entails  a  benefit 
for  particular  individuals  or  groups,  pork  barrel 
implies  that  a  whole  constituency  is  favored  by  a 
public  policy  decision.  Although  the  goal  of  both 
clientelism  and  pork  barrel  politics  is  to  obtain  the 
recipients’  electoral  support,  they  also  differ  in  that 
the  element  of  exploitation  and  inequality  that  char¬ 
acterizes  the  former  is  absent  in  the  latter.  That  is 
probably  the  reason  why  these  two  practices  are 
viewed  differently  in  normative  terms.  Politicians 
who  deliver  goods  and  services  on  a  clientelistic 
basis  usually  try  to  keep  it  as  a  secret  matter 
between  them  and  the  clients.  In  contrast,  politi¬ 
cians  who  manage  to  pass  pork  barrel  legislation 
are  often  eager  to  present  them  as  a  political  asset. 


Indeed,  both  patronage  and  clientelism  are 
nearly  always  seen  as  corrupt  or  morally  question¬ 
able  practices,  even  though  corruption  is  empiri¬ 
cally  and  conceptually  a  distinct  phenomenon. 
Political  corruption  denotes  illegal  public  decisions 
taken  by  parties  or  individual  politicians  to  obtain 
financial  resources,  such  as  when  parties  may 
favor  firms  by  handing  over  the  concession  of  a 
public  utility  or  permitting  the  development  of  an 
economic  activity  without  the  fulfillment  of  all 
legal  requirements  and  demand,  in  exchange, 
financial  kickbacks  to  party  coffers.  Clientelism 
and  patronage  can  undoubtedly  lead  into  corrup¬ 
tion,  such  as  when  discretionally  appointed  party 
hacks  use  the  state  office  for  personal  enrichment 
through  demanding  bribes  or  when  politicians  try 
to  obtain  resources  for  their  clientelistic  policies 
through  illegal  dealings  with  economic  firms.  In 
that  sense,  both  patronage  and  clientelism  can 
breed  corruption,  which  might  explain  why  the 
public  discourse  in  many  postcommunist  countries 
refers  to  clientelism  in  connection  with  the  pene¬ 
tration  of  the  state  by  organized  crime  and  by 
other  rent-seeking  groups.  However,  patronage 
appointments  are  often  legally  sanctioned  and 
above  board,  and  hence  not  corrupt;  similarly, 
many  of  the  clientelistic  deals  between  patrons  and 
their  voters  and  supporters  fit,  even  if  uneasily, 
within  the  legal  remit  of  policy  and  administrative 
decision  making.  Importantly,  clientelistic  ex¬ 
changes  are  never  directly  aimed  at  obtaining  mon¬ 
etary  goods  on  the  part  of  the  patrons. 

To  recap,  clientelism  and  patronage,  though 
closely  related,  can  usefully  be  distinguished  on  the 
basis  of  the  goals  and  objects  of  the  exchange. 
Clientelism — a  particularistic  form  of  political  and 
electoral  mobilization — is  by  definition  more  pene¬ 
trating  than  party  patronage,  usually  reaching 
larger  numbers  of  people  and  covering  wider  ranges 
of  exchanges.  However,  patronage — discretional 
allocation  of  jobs — is  the  necessary  (but  not  suffi¬ 
cient)  condition  for  the  emergence  of  clientelism,  for 
it  is  mainly  due  to  their  ability  to  control  public  jobs 
that  parties  and  politicians  are  able  to  manipulate 
administrative  processes  in  order  to  deliver  targeted 
divisible  benefits  to  their  clients.  In  other  words, 
insofar  as  parties  or  politicians  do  not  control  at 
least  the  key  jobs  in  the  public  administration,  they 
will  hardly  be  in  the  position  to  develop  large-scale 
“clientelistic”  exchanges.  This  is  why  patronage 
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and  clientelism  will  always  remain  closely  related  in 
empirical  reality,  in  spite  of  the  fact  that  handing 
out  state  jobs  will  lead  to  tangible  electoral  returns 
only  in  very  small  democracies  or  in  countries  with 
limited  suffrage  and  partial  mobilization. 

Clientelism  in  Contemporary  Democracies 

Traditional  clientelistic  practices  have  usually  been 
seen  as  a  product  of  early-modern  development 
and  to  be  particularly  important  in  rural  and  less 
economically  well-developed  regions  and  in  politi¬ 
cal  systems  in  developing  countries.  For  numerous 
reasons,  they  were  expected  to  disappear  in  the 
process  of  political  and  economic  modernization 
and  development  of  mass  party  organizations  that 
characterized  the  20th  century.  First,  the  individu¬ 
alized  processes  of  vertical  integration  and  mobili¬ 
zation  associated  with  patron-client  ties  were 
undermined  by  the  forms  of  horizontal  mobiliza¬ 
tion  and  integration  promoted  by  class  politics  and 
cleavage  structuring.  In  other  words,  as  a  society 
developed  mass  electoral  alignments,  the  scope  for 
individually  based  networks  of  supporters  became 
more  limited.  Second,  as  the  mass  party  itself  took 
over  from  premodern  and  cadre  parties,  and  as 
appeals  based  on  programs  and  ideology  replaced 
those  based  on  more  personalized  political  repre¬ 
sentation,  it  also  became  more  difficult  to  sustain 
patron-client  links,  and  especially  to  build  a  dis¬ 
tinctive  clientele.  Third,  as  polities  modernized  and 
professionalized,  meritocratic  systems  of  advance¬ 
ment  became  more  acceptable  and  widespread, 
and  hence  objective  rules,  exams,  and  qualifica¬ 
tions  replaced  favors,  friendships,  and  networks  in 
the  process  of  career  building.  Here  too,  then,  the 
scope  for  patronage  as  one  facet  of  clientelism 
became  limited,  in  this  case  as  a  result  of  a  short¬ 
age  of  supply.  Fourth,  as  economies  modernized, 
local  markets,  especially  in  poor  rural  areas, 
became  better  integrated  into  larger  regional, 
national,  or  even  supranational  units.  The  mem¬ 
bers  of  these  initially  isolated  communities  entered 
into  commercial  and  social  ties  with  persons  in 
other  places,  which,  in  turn,  diminished  demand 
for  patrons  to  advance  their  interests  at  the  center. 

Finally,  as  societies  also  became  richer  and  more 
educated,  the  demand  for  particularized  benefits 
or  favors  diminished,  while  citizens  also  became 
more  confident  of  their  own  ability  to  deal  with 


the  bureaucracy.  In  this  sense,  citizens  began  to 
handle  their  own  affairs  more  effectively  and  hence 
have  less  need  of  a  patron  to  work  on  their  behalf. 
As  a  result,  the  deals  between  patrons  and  clients 
also  became  increasingly  difficult  to  monitor  and 
enforce.  Now  it  was  no  longer  possible  to  scrutinize 
clients’  behavior  through  their  natural  seclusion  in 
relatively  small  and  homogeneous  communities;  in 
addition,  the  increased  mobility  and  education 
among  the  electorates  made  the  enforcement  of  cli¬ 
entelistic  deals  nearly  impossible. 

All  of  this  implies  a  more  or  less  secular  process, 
whereby  clientelism  becomes  steadily  eliminated  in 
favor  of  collective  representation  and  accountabil¬ 
ity.  The  key  actors  in  this  process  are,  of  course,  the 
emerging  mass  parties,  which  encourage  horizontal 
rather  than  vertical  integration  and  which  promote 
the  provision  of  universal  rather  than  particularis¬ 
tic  benefits.  However,  this  is  not  to  suggest  that 
clientelism  did  not  survive  the  advent  of  the  mass 
party.  In  many  countries,  for  example,  Italy, 
Austria,  Ireland,  and  Japan,  mass  parties  have  also 
been  known  to  adapt  these  practices  as  a  means  of 
ensuring  their  own  electoral  survival.  Rather  than 
making  a  full  leap  from  traditional  clientelistic  to 
programmatic  politics,  as  was  widely  expected  in 
the  1950s  and  1960s,  the  mass  parties  in  these 
countries  instituted  a  relatively  bureaucratized  and 
impersonalized  system  of  clientelism,  which  com¬ 
bined  particularistic  politics  with  the  more  univer- 
salistic  appeals.  This  was  no  longer  the  personalistic 
patronage  system  based  on  organic  ties  between 
patrons  and  clients  but  still  a  system  in  which  a 
large  proportion  of  public  services  and  jobs  was 
distributed  to  selective  clienteles  in  an  attempt  to 
build  political  support  and  win  elections. 

What  is  clear,  however,  is  that  clientelistic  prac¬ 
tices  in  general  have  gradually  become  more  diffi¬ 
cult  to  sustain  in  the  era  of  the  mass  party  and 
hence  proved  more  exceptional  than  conventional 
in  modern  democracies.  Indeed,  bureaucratic  cli- 
entelism  became  difficult  to  maintain  even  in  the 
advanced  democracies  in  which  it  functioned  well 
into  the  1970s  and  1980s.  This  happened  mainly 
for  two  reasons.  First,  parties  and  politicians  today 
face  more  and  more  constraints  on  acquiring  clien¬ 
telistic  resources.  With  the  shrinking  of  the  public 
sector  due  to  privatization  and  marketization  on 
the  one  hand,  and  increasing  exposure  of  domestic 
economies  to  pressures  of  globalized  markets  on 
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the  other  hand,  it  is  now  more  difficult  for  politi¬ 
cians  to  obtain  resources  for  greasing  the  clientelis- 
tic  system  without  compromising  performance  of 
the  economy  and  inflow  of  investments.  Second, 
the  shrinking  of  resources  is  also  related  to  the 
vigilance  of  mass  media  and  civil  society  groups  in 
reporting  clientelistic  and  other  abusive  state  prac¬ 
tices,  on  the  one  hand,  and  the  increased  willing¬ 
ness  of  judicial  authorities  to  punish  them,  on  the 
other.  These  may  not  be  universally  effective  in  all 
countries,  but  it  is  clear  that  anticorruption  pro¬ 
grams  together  with  various  measures  to  limit  the 
politicization  of  the  public  sector  have  constrained 
the  range  of  options  that  clientelistic  parties  and 
politicians  can  exploit  at  their  discretion.  All  in  all, 
therefore,  whether  one  looks  at  the  supply  or 
demand  side,  in  contemporary  modern  polities,  the 
tide  seems  to  operate  against  clientelism. 

Does  this  mean  that  clientelism  is  irrelevant  in 
contemporary  societies?  If  one  looks  at  the  upsurge 
of  works  on  clientelism  in  the  recent  decade,  such 
as  Piattoni  (2001)  and  Herbert  Kitschelt  and 
Steven  Wilkinson  (2007),  the  answer  must  be  a 
resounding  no.  These  new  studies  suggest  that  far 
from  being  dead,  clientelism  is  likely  to  be  an 
enduring  feature  of  politics  and  to  encourage 
inquiries  into  the  conditions  that  favor  or  con¬ 
strain  clientelism  in  contemporary  democracies. 
This  is  in  no  small  part  thanks  to  the  recent  waves 
of  democratization  in  Latin  America,  South  Asia, 
Eastern  Europe,  and  Africa.  In  many  of  these  emerg¬ 
ing  new  democracies,  clientelistic  practices  were 
well-entrenched  phenomena  prior  to  democratiza¬ 
tion  and  hence  were  always  likely  to  play  a  role  in 
the  quest  for  power  and  votes  after  the  (re)entry  of 
competitive  politics.  These  newly  democratized  poli¬ 
ties  are  often  marked  by  the  emergence  of  weakly 
structured  parties  and  party  systems  that  are  unlikely 
to  attract  voters  (only)  on  the  basis  of  coherent  pro¬ 
grammatic  packages.  Recent  developments  in  the 
patterns  and  processes  of  politics  in  advanced  indus¬ 
trial  democracies  also  suggest  at  least  a  partial 
rethinking  of  the  familiar  assumption  about  the  rela¬ 
tive  insignificance  of  clientelistic  practices  as  a  mode 
of  representation.  It  can,  for  example,  be  argued  that 
the  declining  intensity  of  ideological  differences 
across  parties  in  many  modern  democracies  may 
encourage  cultivation  of  clientelistic  links  at  the 
grassroots  level  at  the  same  time  as  it  encourages  a 
more  personalized  style  of  leadership  at  the  top. 


In  other  words,  whether  one  looks  at  contem¬ 
porary  new  or  long-established  democracies,  it 
seems  that  political  clientelism  has  received  a  fresh 
impetus  and  hence  might  prove  to  be,  somewhat 
paradoxically,  the  product  of  modernization  rather 
than  its  victim.  Few  case  studies  aside,  however, 
there  is  still  a  lack  of  solid  cross-national  empirical 
evidence  that  would  probe  into  these  theoretical 
expectations  concerning  the  resurgence  of  clien- 
telism  and  show  how  important  it  is  as  a  mode  of 
citizen-politician  linkage.  The  evidence  is  clearer 
with  respect  to  patronage,  which  was  distinguished 
here  from  clientelism  on  the  basis  of  the  type  of 
resources  that  are  traded  and  the  goals  that  it 
serves  to  politicians.  Like  clientelism,  patronage 
politics  can  also  prove  resilient  and  adaptable. 
Indeed,  patronage  is  still  likely  to  prove  an  effec¬ 
tive  strategy  for  dealing  with  problems  of  party 
organization  and  party  building,  particularly  as 
the  traditional  representational  links  between  par¬ 
ties  and  society  become  weaker.  In  other  words,  as 
modern  parties  become  more  entwined  within  the 
institutions  of  the  state,  and  as  they  lose  their  tra¬ 
ditional  grounding  within  the  wider  society, 
patronage  can  become  a  key  resource  in  anchoring 
the  party  presence  within  the  political  system  and 
in  controlling  flows  of  communication.  Through 
the  appointment  of  party  personnel  to  key  agencies 
and  institutions,  parties  can  hope  to  gain  an  over¬ 
sight  of  the  likely  demands  posed  to  political  lead¬ 
ers  as  well  as  of  the  likely  policies  and  programs 
that  are  needed  to  meet  these  demands.  Patronage 
can  therefore  compensate  for  otherwise  decaying 
organizational  networks. 

Several  studies  focusing  on  both  new  (e.g.,  Anna 
Grzymala-Busse,  2007;  Conor  O’Dwyer,  2006) 
and  established  democracies  (e.g.,  Guy  Peters  & 
Jon  Pierre,  2001)  have  effectively  shown  that  polit¬ 
ical  use  of  public  jobs  is  a  rampant  practice  in  many 
of  these  polities;  these  studies  also  ask  critical  ques¬ 
tions  about  factors  that  might  explain  cross¬ 
national  differences  in  the  pervasiveness  of  patron¬ 
age  practices.  However,  it  is  probably  fair  to  say 
that  political  science  literature  has  dealt  with  the 
development  of  patronage  networks  within  the 
state  usually  only  in  passing,  under  the  banner  of 
party  government  (see,  e.g.,  Jean  Blondel  & 
Maurizio  Cotta,  1996).  Moreover,  as  Dominic 
Bearfield  (2009)  correctly  points  out,  the  negative 
connotation  associated  with  patronage  has  led 
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academics  to  understand  patronage  as  corrupt  and 
pathological,  and  hence  something  either  undesir¬ 
able  or  difficult  to  study  empirically.  It  is  for  these 
reasons  that  many  excellent  studies  dealing  with 
patronage  appointments  can  be  found  in  public 
administration  rather  than  in  political  science  lit¬ 
erature,  even  though  the  specific  concern  of  that 
literature  is  usually  less  with  the  role  of  parties  and 
politicians  in  the  process  of  patronage  appoint¬ 
ments  and  more  with  the  impact  that  it  has  on  the 
performance  of  bureaucratic  organizations.  Largely 
as  a  result  of  this  disj  tincture  between  public 
administration  and  political  science  analytical 
objectives,  here  too  we  still  lack  solid  cross¬ 
national  evidence  that  would  allow  us  to  scrutinize 
the  range  and  depth  of  patronage  practices  and 
lead  us  to  appreciate  the  precise  goals  that  political 
patrons  seek  to  achieve  in  making  patronage 
appointments.  This  represents  a  significant  gap  in 
the  literature  on  the  workings  of  democracy, 
political  parties,  government,  and  institutions  of 
the  state  because  theoretical  reasons  for  the  sur¬ 
vival  and  adaptation  of  patronage  practices  in 
contemporary  democracies  are  perhaps  even  more 
compelling  than  the  expectations  concerning  the 
relevance  of  clientelism. 

Petr  Kopecky 
Leiden  University 
Leiden,  Netherlands 
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Coalitions 


A  coalition  is  the  temporary  cooperation  of  differ¬ 
ent  individuals,  groups,  or  political  parties  to 
achieve  a  common  purpose,  which  can  be  either 
short  term  or  long  term.  Almost  all  politics  can  be 
conceived  as  involving  the  formation  of  some  kind 
of  coalition.  Pressure  groups,  social  movements, 
and  political  parties  are  coalitions  of  individuals 
with  a  common  interest;  governments  can  be 
formed  by  coalitions  of  political  parties,  not  only 
in  parliamentary  regimes  but  also  in  regimes  of 
division  of  powers;  maintaining  political  stability 
or  resolving  political  or  ethnic  conflicts  can  induce 
the  formation  of  broad  coalitions  committed  to 
support  a  new  regime;  and  different  governments 
and  states  can  form  military  coalitions  unified 
under  a  single  command.  A  coalition  implies 
cooperation  among  its  members.  A  relationship  of 
conflict  usually  develops  between  different  coali¬ 
tions  whose  members  have  opposite  interests. 

The  most  usual  analyses  of  coalitions  in  poli¬ 
tics  deal  with  the  formation  of  multiparty  cabi¬ 
nets  in  parliamentary  regimes.  In  government 
coalitions,  several  political  parties  cooperate, 
usually  during  a  legislative  term  between  two 
elections.  Coalition  governments  are  the  most 
usual  form  in  most  parliamentary  regimes  using 
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electoral  rules  of  proportional  representation, 
which  typically  do  not  produce  a  single-party 
majority  of  seats.  This  includes  most  countries  in 
continental  Europe,  as  well  as  other  democratic 
countries  such  as  India,  Israel,  and  New  Zealand. 
However,  other  broad  coalitions  have  resulted 
from  political  settlements  or  conflict  resolution, 
such  as  in  South  Africa,  Zimbabwe,  Kenya,  or 
Madagascar,  often  involving  international  or 
external  moderators. 

An  example  of  government  formation  in  parlia¬ 
mentary  regimes  is  Germany,  where  all  cabinets 
since  the  end  of  World  War  II  have  been  multiparty 
coalitions.  The  chancellor  or  prime  minister  has 
always  been  either  a  Christian  Democrat  or  a  Social 
Democrat,  but  government  coalitions  include  sev¬ 
eral  formulas:  rightist  Christian  Democrats  (always 
with  their  allies,  the  Bavarian  Social  Christians) 
with  center-right  Free  Democrats,  center-left  Social 
Democrats  with  Free  Democrats,  Social  Democrats 
with  left  Greens,  and  the  so-called  “grand  coali¬ 
tion”  of  Christian  Democrats  and  Social  Democrats. 
All  these  governmental  formulas  have  had  majority 
support  in  terms  of  both  popular  votes  and  parlia¬ 
mentary  seats.  Due  to  the  long-term  participation 
of  a  few  parties  in  government  and  other  institu¬ 
tional  mechanisms,  the  degree  of  stability  in  major 
public  policies  in  Germany  is  very  high.  In  contrast, 
the  alternation  of  single-party  cabinets  when  differ¬ 
ent  parties  receive  a  majority  of  seats  in  successive 
elections,  as  in  Britain,  may  provoke  periodic  shifts 
in  major  public  policy  making. 

Within  political  science,  the  formation  and  ter¬ 
mination  of  cabinet  coalitions  have  been  analyzed 
as  cooperative  games  in  game  theory.  Both  the 
search  for  office  and  the  search  for  policy  or  ideo¬ 
logical  goals  can  be  presumed  to  be  realistic  and 
legitimate  motivations  of  politicians  when  they  try 
to  form  a  multiparty  coalition.  At  the  time  of  form¬ 
ing  government,  the  interest  of  members  of  parlia¬ 
ment  in  enjoying  as  much  power  as  possible  trans¬ 
lates  into  the  aspiration  to  accumulate  as  many 
government  portfolios  or  ministries  as  possible  for 
their  party.  This  becomes  a  criterion  to  form  a 
coalition  with  the  minimum  viable  size.  The  expla¬ 
nation  for  this  is  that  if  a  government  is  formed  of 
a  multiparty  coalition  without  superfluous  mem¬ 
bers,  it  can  give  each  party  a  relatively  high  share  of 
power  to  exert  and  enjoy.  In  a  minimum  winning 
coalition,  each  of  its  party  members  is  pivotal,  in 


the  sense  that  the  loss  of  a  party  would  render  the 
coalition  no  longer  a  winning  coalition.  In  the  par¬ 
ticular  case  when  a  party  has  an  absolute  majority 
of  seats  in  parliament,  the  minimum  winning 
“coalition”  is  the  majority  single-party  govern¬ 
ment  without  additional  partners. 

For  parties  interested  in  policy,  a  criterion  to 
select  potential  partners  in  a  government  coalition 
is  the  minimization  of  policy-ideology  distance. 
Specifically,  parties  may  try  to  form  a  coalition 
with  “connected”  parties,  for  example,  on  the  left- 
right  dimension,  that  is,  with  parties  that  are  con¬ 
tiguous  to  their  positions  and  then  devoid  of 
unnecessary  parties.  Closeness  can  facilitate  the 
negotiation  of  a  government  program  and  diminish 
internal  policy  conflicts  within  the  coalition.  For 
instance,  social-democratic,  leftist,  and  green  par¬ 
ties  are  more  likely  to  form  coalitions  among  them¬ 
selves  than  with  liberal,  Christian  Democratic,  or 
conservative  parties,  which  in  turn  can  be  prone  to 
unite  themselves  in  some  governmental  coalition. 

A  minimal  connected  winning  coalition  with 
more  than  two  parties  may  include  some  superflu¬ 
ous  partners  in  terms  of  size  that  are  located  on 
intermediate  ideological  positions  and  are  thus 
necessary  to  maintain  the  ideological  connection 
between  its  members.  But  also  in  a  minimal  con¬ 
nected  winning  coalition,  each  party  is  pivotal 
because  the  loss  of  a  party  would  render  the  coali¬ 
tion  either  no  longer  winning  or  no  longer  con¬ 
nected.  On  a  single-dimension  policy  space  such  as 
the  left-right  axis,  the  median  party  will  always  be 
included  in  a  connected  winning  coalition.  The 
median  is  the  position  having  less  than  half  posi¬ 
tions  on  each  side  and  is,  thus,  necessary  to  form  a 
consistent  majority  along  the  issue  space.  Empirical 
analyses  show  that  government  coalitions  contain¬ 
ing  the  median  party  in  parliament  are  more  likely 
to  be  formed. 

When  different  winning  coalitions  can  be 
formed  in  a  parliament,  the  party  composition  of 
the  government  may  depend  on  the  bargaining 
power  of  each  party  and  the  presence,  or  not,  of  a 
dominant  party.  Several  tools  can  be  used  to  ana¬ 
lyze  these  points.  Different  political  parties  may 
have  bargaining  power  to  form  a  government 
coalition,  which  does  not  mechanically  correspond 
to  their  numbers  of  seats.  A  relatively  minor  party, 
which  is  “pivotal”  to  form  a  majority,  that  is,  a 
party  able  to  contribute  with  the  necessary  number 
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of  seats  to  make  a  coalition  winning,  may  have 
relatively  high  power  to  negotiate  cabinet  member¬ 
ship  or  policy  decisions.  For  example,  some  center, 
agrarian,  ethnic,  radical,  or  democratic  parties 
may  be  located  in  a  central  place  able  to  form 
coalitions,  at  different  moments,  with  either  the 
parties  on  their  left  or  those  on  their  right.  In  con¬ 
trast,  a  relatively  large  party  whose  contribution 
can  be  easily  replaced  with  that  of  a  smaller  party 
may  have  relatively  low  bargaining  power  in  com¬ 
parison  to  its  size.  A  party’s  bargaining  power  in 
parliament  can  be  measured,  thus,  not  by  its  num¬ 
ber  of  seats  but  by  the  proportion  of  potential  win¬ 
ning  coalitions  in  which  the  party  is  pivotal.  There 
are  several  “power  indices”  available  to  measure  a 
party’s  bargaining  power.  They  slightly  differ  in 
their  assumptions  regarding  actors’  criteria,  coali¬ 
tion  models,  and  decision  rules,  but  most  of  them 
produce  similar  results. 

In  certain  configurations,  the  largest  party  in  par¬ 
liament,  even  if  it  does  not  have  a  majority  of  seats, 
can  be  dominant  if  its  central  position  makes  it  able 
to  block  any  coalition  cabinet  and  take  all  portfo¬ 
lios.  In  other  words,  a  party  is  dominant  if  the  other 
parties  cannot  form  a  winning  coalition  without  that 
party.  If  the  nondominant  parties  are  unable  to  form 
a  majority  coalition  among  themselves  due  to  their 
ideological  distance,  then  the  dominant  party  can 
block  any  coalition  and  form  a  minority  cabinet. 
Minority  cabinets  of  a  dominant  party  are  viable 
and  likely  to  be  formed  the  greater  the  policy-ideol¬ 
ogy  divisions  and  the  smaller  the  size  of  the  parties 
in  the  opposition.  For  example,  the  Indian  National 
Congress  party  in  India,  the  Christian  Democratic 
Party  in  Italy,  the  Liberal  Democratic  Party  in  Japan, 
and  the  Social  Democratic  parties  in  Denmark, 
Sweden,  and  Norway  were  dominant  single  parties 
for  long  periods. 

Additional  analyses  deal  not  only  with  which 
parties  are  more  likely  to  enter  a  coalition  cabinet 
but  also  with  the  allocation  of  ministries  to  the 
parties  within  the  cabinet.  The  distribution  of 
cabinet  portfolios  among  coalition  parties  tends  to 
be  proportional  to  the  number  of  seats  controlled 
by  each  party — that  is,  with  its  contribution  to 
making  a  coalition  winning.  However,  different 
parties  have  preferences  for  different  portfolios 
depending  on  the  policy  issues  they  emphasize  the 
most,  which  may  produce  varied  allocations.  The 
prime  minister’s  party  usually  controls  most  of  the 


portfolios  in  charge  of  major  policy  domains,  espe¬ 
cially  economy,  defense,  and  home  affairs.  Other 
cabinet  portfolios  can  be  allocated  to  parties  with 
a  strong  profile  on  certain  issues  on  which  they 
tend  to  campaign  and  attract  citizens’  votes,  such 
as  social  policy  for  Laborites  or  Social  Democrats, 
education  for  Christian  Democrats,  finance  for 
Liberals,  agriculture  for  Agrarians,  and  so  on. 

Different  coalition  formulas  are  associated  with 
different  durations  of  cabinets.  Regular  parliamen¬ 
tary  elections  are  usually  scheduled  at  intervals  of 
3,  4,  or  5  years,  depending  on  the  country.  But  a 
significant  number  of  parliamentary  cabinets  do 
not  last  as  long  as  they  legally  could  because  there 
are  anticipated  dissolutions  of  parliament,  elec¬ 
tions  (which  can  usually  be  called  by  the  prime 
minister),  resignations  by  prime  ministers,  and  suc¬ 
cessful  motions  of  censure  and  defeated  motions  of 
confidence.  Regarding  the  party  composition  of 
cabinets,  single-party  majority  cabinets  tend  to  last 
longer  than  multiparty  coalition  or  minority  cabi¬ 
nets.  In  single-party  governments,  conspiracies 
among  party  members  to  replace  the  incumbent 
prime  minister  are  relatively  likely,  especially  if 
party  members  expect  to  have  better  electoral 
prospects  with  a  new  candidate.  In  contrast,  in 
multiparty  coalition  governments,  internal  party 
cohesion  tends  to  increase,  but  coalition  partners 
are  more  willing  to  work  against  the  incumbent 
formula.  For  coalition  cabinets,  the  higher  the 
number  of  parties  and  the  broader  the  ideological 
distance  between  them,  the  more  vulnerable  to 
splits  and  departures  and  less  durable  they  should 
be  expected  to  be.  This  kind  of  crisis  is  relatively 
frequent  in  parliamentary  regimes.  In  15  countries 
of  Western  Europe  since  World  War  II,  about  one 
sixth  of  parliamentary  governments  have  not  con¬ 
cluded  their  term  due  to  a  change  of  prime  minis¬ 
ter,  a  change  in  the  party  composition  of  the 
government  coalition,  or  the  dissolution  of  parlia¬ 
ment  and  the  call  of  an  early  election. 

Multiparty  coalition  cabinets  in  parliamentary 
regimes  tend  to  induce  a  relatively  balanced  interin- 
stitutional  relationship  between  the  prime  minister 
and  the  parliament.  As  political  parties  need  to 
bargain  and  reach  agreements  in  order  to  make 
policy  decisions  and  pass  bills,  they  learn  to  share 
power  and  develop  negotiation  skills.  Cabinet 
members  from  different  parties  need  to  cooperate 
as  well.  The  prime  minister  cannot  prevail  over  the 
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cabinet  or  the  assembly  as  much  as  when  leading  a 
single-party  government  because,  even  if  he  or  she 
is  a  member  of  one  of  the  parties  involved,  he  or  she 
has  to  negotiate  with  the  other  parties  and  maintain 
the  coalition  united.  In  a  parliamentary  regime,  the 
institutional  role  of  the  parliament  thrives  when  no 
party  has  an  absolute  majority  of  seats. 
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A  typical  question  posed  by  researchers  of  politi¬ 
cal  socialization  is  whether  people  socialized 
under  distinct  political  and  social  conditions 
manifest  different  attitudinal  and  behavioral  pat¬ 
terns  and  the  extent  to  which  such  differences 
persist  during  their  life  span.  Addressing  this  ques¬ 
tion  at  the  individual  level  is  difficult  because  it 
requires  tracing  different  age-groups  during  their 
life  trajectories.  Since  long-term  panel  data  are  still 
scarce  in  political  science,  a  common  approach  is 
to  focus  on  aggregate  patterns  of  change.  The 
accumulation  of  cross-sectional  data  during  recent 
decades  has  enabled  the  use  of  pseudopanels,  cre¬ 
ated  by  pooling  repeated  cross-sections  over  a  long 
period  of  time.  Accordingly,  units  are  aggregated 
after  being  classified  according  to  their  age  and  the 
period  in  which  the  phenomenon  of  interest  is 


measured.  For  example,  if  the  aim  is  to  examine 
the  sources  of  variation  in  turnout  levels  over  the 
past  50  years,  25-year-olds  interviewed  in  the 
1970s  can  be  simultaneously  compared  with 
50-year-olds  interviewed  in  the  same  year  interval 
and  with  25-year-olds  interviewed  in  the  1990s.  In 
so  doing,  the  aim  of  this  type  of  analysis,  known 
as  cohort  analysis,  is  to  explore  whether  variation 
in  an  attribute  of  interest  is  mainly  accounted  for 
by  particular  political  periods,  age-groups,  or  sub¬ 
categories  defined  by  an  interaction  of  the  two. 
The  aim  of  this  entry  is  to  describe  this  analytical 
strategy  and  to  shed  some  light  on  some  of  its 
limitations. 

Period  Effects 

As  a  way  to  motivate  the  discussion,  let  us  imagine 
a  population  of  interest,  voting  for  one  of  the  two 
parties,  A  and  B,  which  compete  in  elections  that 
are  held  every  4  years.  Assuming  that  the  only 
source  of  change  in  the  composition  of  the  popula¬ 
tion  is  that  of  succession,  that  birth  and  death  rates 
are  approximately  equal,  and  that  the  structure  of 
the  party  system  remains  stable,  the  question  is 
how  can  political  change  occur  in  this  context?  It 
turns  out  that  change  regarding  a  social  attribute  of 
interest  under  these  circumstances  is  essentially 
produced  through  two  different  mechanisms,  both 
of  which  can  be  traced  in  Table  1,  which  distin¬ 
guishes  only  between  two  subgroups  of  interest, 
namely,  young  and  old  voters  (the  reason  for  using 
age  as  a  forcing  variable  will  be  clear  in  the  next 
paragraph).  The  first  mechanism  involves  a  coming 
event  or  a  sequence  of  events  that  might  induce 
people  to  change  their  minds.  This  is  referred  to  as 
a  period  effect.  For  instance,  people  might  vote 
predominantly  for  Party  A  during  Time  1.  An  event 
takes  place,  and  they  shift  to  Party  B.  If  this  shift 
occurs  at  approximately  equal  rates  between  differ¬ 
ent  age-groups,  one  should  observe  a  difference 
between  the  two  columns  of  Table  1,  whereas  no 
difference  should  be  found  between  the  two  rows 
of  each  column.  This  is  shown  in  the  upper  panel  of 
Table  1. 

Cohort  Effects 

Political  events,  however,  are  unlikely  to  affect  all 
people  in  the  same  way.  Most  of  the  time  researchers 
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Table  I  An  Illustrative  Example  of  Period,  Cohort, 
and  Life  Cycle  Effects 


Time  1 

Time  2 

Period  effects 

Young 

A 

B 

Old 

A 

B 

Cohort  effects 

Young 

A 

B 

Old 

A 

A 

Life  cycle  effects 

Young 

B 

B 

Old 

A 

A 

observe  (or  assume)  particular  cohorts,  that  is, 
groups  of  individuals  who  pass  some  crucial 
stage  of  their  lives  at  approximately  the  same 
time,  to  be  considerably  more  affected  by  politi¬ 
cal  events  than  other  cohorts.  Although  different 
cohorts  of  interest  have  been  examined,  the  most 
common  approach  is  to  distinguish  between  new 
cohorts  and  older  ones  with  the  first  deemed 
more  susceptible  to  political  stimuli.  That  this 
assumption  is  implicitly  made  in  most  cohort 
analyses  is  evident  from  the  definitions  given  to 
different  cohorts  so  as  to  distinguish  one  from  the 
other.  In  most  cases,  each  cohort  is  characterized 
by  the  salient  historical  and/or  political  features 
that  marked  its  early  adulthood  (the  “war  gen¬ 
eration,”  the  “1968  generation,”  etc.).  If  such 
shaping  socialization  processes  dominate  over 
inertial  forces  of  intergenerational  transmission 
of  political  attitudes,  young  adults  can  differ  in 
terms  of  their  value  orientations  or  any  other 
attitudinal  aspect  of  interest  from  one  generation 
to  another.  This  second  mechanism  of  change, 
known  as  a  cohort  effect,  is  presented  in  the  sec¬ 
ond  panel  of  Table  1.  Whereas  the  older  age- 
group  remains  stable  over  time,  young  cohorts 
shift  from  Party  A  to  Party  B.  According  to  this 
scenario,  change  is  driven  only  through  the 
entrance  of  new  generations. 

Life  Cycle  Effects 

Political  change  can  hardly  ever  be  attributed  to 
either  of  the  two  sources  independently.  In  most 
cases,  the  two  mechanisms  interplay  and  the 


evaluation  of  their  independent  contribution  in 
aggregate  change  is  far  from  a  straightforward 
enterprise.  This  effort  is  further  blurred  by  the  role 
of  life  cycle.  The  latter  can  incorporate  physiologi¬ 
cal,  social,  and  psychological  intracohort  changes 
taking  place  during  chronological  aging.  Phys¬ 
iological  changes  may  have  important  attitudinal 
implications  (e.g.,  participation  in  politics  becomes 
more  costly  for  the  elderly)  but  they  tend  to  vary 
only  slightly  among  different  political  contexts, 
and  thus,  they  do  not  constitute  the  primary  source 
of  interest  in  most  political  science  applications. 
Social  change  due  to  aging  involves  contextual 
influences  stemming  from  peer  groups  and  social 
roles  (getting  married,  finding  a  job,  settling  down, 
etc.)  that  most  people  assume  during  his  or  her  life 
span.  These  influences  might,  for  example,  lead 
young  people  to  opt  for  a  more  liberal  party  than 
their  older  counterparts.  Such  evidence  would  be 
the  outcome  of  an  age  gap,  largely  persistent 
through  time.  This  is  seen  in  the  final  panel  of 
Table  1,  which  indicates  an  age  difference  in  party 
choice  that  holds  at  both  points  in  time.  This  pat¬ 
tern  cannot  account  for  change  over  time  unless 
the  population  distribution  changes  in  a  consistent 
fashion.  For  instance,  if  older  people  opt  for  more 
conservative  parties  than  younger  people,  a  grad¬ 
ual  increase  of  their  relative  portion  in  the  compo¬ 
sition  of  the  population  might  reveal  an  aggregate 
difference  over  time  without  people  actually  chang¬ 
ing  their  attitudinal  profiles.  Such  a  pattern  would 
be  due  to  aging  effects. 

Psychological  change  can  manifest  itself  in  vari¬ 
ous  ways.  For  instance,  disengagement  theory 
asserts  that  some  otherwise  distinct  life  stages 
might  for  different  reasons  discourage  people  from 
participating  in  political  affairs.  More  important, 
aging  might  serve  as  a  proxy  for  the  process  of 
reinforcement  and  crystallization  of  people’s  prior 
orientations.  This  is  probably  the  most  salient 
effect  of  the  life  cycle,  especially  in  applications  of 
voting  research,  and  has  been  already  implicitly 
alluded  to  in  the  previous  section.  People  form 
their  political  predispositions  in  a  developmental 
manner,  and  thus,  change  in  attitudinal  respects  is 
increasingly  less  likely  to  occur  as  people  accumu¬ 
late  experience  with  the  political  world.  The 
example  provided  below  shows  how  this  process 
might  confound  the  observed  pattern  of  aggregate 
change. 
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An  Example 

Let  us  now  assume  that  in  our  imaginary  popula¬ 
tion  we  observe  a  trend  shown  in  Figure  1,  that  is, 
the  percentage  of  the  population  affiliated  to  a 
party  during  the  past  30  years  decreases  in  a  rather 
progressive  fashion.  What  could  we  infer  about  the 
sources  of  change?  On  the  one  hand,  people’s 
attachments  with  the  parties  might  have  declined 
as  a  consequence  of  disappointing  government 
performance.  On  the  other  hand,  the  coming  of 
new  cohorts  that  are  on  average  less  attached  to 
the  parties  might  also  account  for  this  pattern. 
Looking  only  at  Figure  1,  it  is  impossible  to  say 
which  of  the  two  explanations  is  at  work. 
Consequently,  a  typical  first  step  in  such  types  of 


analysis  is  to  disaggregate  this  general  trend  into 
different  cohorts.  Figures  2  and  3  do  that,  each  one 
giving  pride  of  place  to  one  of  the  two  mechanisms 
described  previously. 

As  can  be  seen,  the  two  figures  tell  a  completely 
different  story.  In  Figure  2,  pure  period  effects  are 
observed.  Each  new  cohort  enters  the  electorate 
from  the  exact  point  that  the  previous  cohort 
stopped  and  they  continue  together  thereafter.  No 
intercohort  difference  is  observed  and  hence  the 
cohort-specific  curves  merge  in  a  single  curve, 
making  the  graph  look  identical  to  Figure  1.  In 
other  words,  distinguishing  between  age-groups  is 
redundant  because  all  of  them  react  identically  to 
the  political  influences  they  experience.  Figure  3, 


Years 


Figure  I  Percentage  of  People  Registering  Partisanship  With  Either  Party  A  or  Party  B  in  a  30-Year  Time  Interval 
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Figure  2  Period  Effects 


in  contrast,  shows  that  all  cohorts  remain  rela¬ 
tively  stable  through  time  but  each  coming  cohort 
is  constantly  less  affiliated  to  either  of  the  two 
parties.  Consequently,  the  overall  trend  is  simply 
the  outcome  of  intercohort  variation,  although 
each  cohort  designates  remarkable  stability  in  its 
political  orientations  during  the  observed  time 
interval.  Political  change,  in  this  case,  is  simply 
due  to  the  lower  starting  level  of  each  coming 
cohort.  If  that  was  what  produced  the  average 
decline,  change  would  need  to  be  allocated  to  pure 
cohort  effects. 

Needless  to  say,  the  artificial  lines  drawn  here 
only  serve  as  idealized  benchmarks  useful  to  evalu¬ 
ate  the  role  of  each  of  the  two  mechanisms  against 


real  data.  That  said,  social  reality  is  much  more 
complicated  since  both  mechanisms,  albeit  at  dif¬ 
ferent  rates,  are  typically  at  work.  To  make  mat¬ 
ters  worse,  any  analysis  of  such  effects  needs  to 
account  for  the  role  of  aging,  which  can  confound 
either  cohort  or  period  effects.  In  the  first  case,  if 
aging  implies  a  societal  type  of  change,  not  taking 
it  into  account  might  give  room  to  spurious  cohort 
effects.  Addressing  this  possibility  requires  gather¬ 
ing  data  over  a  long  period  of  time.  This  is  the  only 
way  to  examine  whether  differences  between 
young  people  and  older  cohorts  persist  over  time. 
For  instance,  to  see  whether  the  fact  that  20-year- 
olds  in  the  1970s  vote  at  lower  levels  than  40-year- 
olds  is  due  to  aging  or  cohort  effects,  one  needs  to 
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Figure  3  Cohort  Effects 

Note:  There  is  always  some  noise  (error)  in  the  curves,  indicating  that  even  in  the  most  ideal  case,  some  fluctuation  would  still 
exist  due  to  sampling  variability. 


trace  both  groups  for  at  least  20  years.  If  both 
groups  continue  voting  at  the  same  levels  as  20 
years  ago,  we  would  observe  evidence  in  favor  of 
cohort  effects.  If  the  first  group  approaches  the 
turnout  levels  of  the  older  group,  this  would  con¬ 
stitute  evidence  for  life  cycle  effects.  The  problem, 
of  course,  is  that  on  many  occasions  these  mecha¬ 
nisms  might  coexist,  making  their  independent 
contribution  in  the  overall  pattern  of  change  diffi¬ 
cult  to  assess. 

Aging  can  also  be  a  confounder  or  a  mediator 
of  period  effects.  This  is  mainly  through  the  immu¬ 
nization  hypothesis.  To  explore  whether  aging 
implies  an  increased  resilience  to  political  change 
stimuli,  we  need  to  compare  the  trajectories  of  dif¬ 
ferent  cohorts  over  time.  This  is  done  in  Figures  4 
and  5.  In  the  first  case,  no  particular  period  shock 


is  assumed.  The  first  group,  representing  people 
with  an  already  long  record  in  political  affairs, 
seems  to  remain  stable  over  time.  Newly  coming 
cohorts,  however,  begin  from  a  low  starting  level 
but  progressively  strengthen  their  feeling  of 
belonging  to  a  particular  party  as  they  accumulate 
electoral  experience.  Failing  to  account  for  this 
intercohort  difference  might  falsely  result  in  attrib¬ 
uting  the  change  to  period  effects.  To  be  sure, 
there  are  ceiling  effects  in  this  process  that  make 
further  differences  due  to  age  evaporate.  This  pat¬ 
tern  can  be  grasped  by  the  comparison  of  Cohorts 
6  and  7  of  Figure  4,  where  initial  age-related  dif¬ 
ferences  diminish  as  both  cohorts  increase  their 
level  of  attitudinal  stability.  The  only  difference 
between  the  two  cohorts  is  that  the  more  recent 
one  began  its  trajectory  with  a  lag.  However,  their 
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Figure  4  Aging  Effects  I 


route  is  otherwise  indistinguishable.  In  the  pres¬ 
ence  of  ceiling  effects,  this  means  that  they  eventu¬ 
ally  converge  once  this  process  of  partisan  learning 
is  complete. 

Figure  5  shows  how  aging  can  work  as  a  media¬ 
tor  of  period  effects  by  approximating  the  idea 
that  period  shocks  are  more  consequential  among 
younger  cohorts.  A  temporal  shock  causing  a 
decline  in  the  level  of  partisan  affiliation  in  Year 
28  is  more  decisive  for  people  with  lower  levels  of 
prior  electoral  experience.  As  can  be  seen  from  the 
downward  spike,  the  magnitude  of  this  effect 
diminishes  in  a  monotonic  fashion  as  we  move 
from  younger  to  older  cohorts.  The  reason  for 
that,  once  again,  is  that  the  latter  experience  this 


shock  having  already  established  firmer  political 
beliefs. 

The  Age-Cohort-Period  Framework 
and  Its  Deficiencies 

A  systematic  attempt  to  simultaneously  address  all 
three  potentially  confounding  factors  is  next  to 
impossible.  Until  recently,  the  industry  standard  in 
this  subfield  was  to  disentangle  the  sources  of 
political  change  through  the  so-called  Age-Cohort- 
Period  (ACP)  framework.  Four  important  limita¬ 
tions  characterize  this  procedure.  The  first  problem 
is  the  well-known  identification  problem.  Since 
each  of  the  three  terms  is  a  linear  function  of  the 
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Cohorts  4  and  5 


Cohort  6 


Cohort  7 


Cohort  8 


Years 


Figure  5  Aging  Effects  II 

Note:  To  retain  simplicity,  each  group  is  allowed  to  return  to  previous  levels  immediately  after  the  shock,  hence  it  is  assumed 
that  there  is  no  difference  in  the  long-term  effects  of  this  shock  on  different  cohorts. 


other,  unless  one  of  them  is  assumed  to  have  a  non¬ 
linear  impact  on  change,  their  effect,  let  alone  inter¬ 
actions,  cannot  be  independently  estimated  from 
the  data.  To  see  why  this  is  so,  consider  the  follow¬ 
ing  equation,  where  A,  P ,  and  C  stand  for  age, 
period,  and  cohort,  respectively:  A  =  P  —  C;  equiv¬ 
alently,  C  —  P  —  A  or,  alternatively,  P  =  A  +  C.  In 
other  words,  any  of  the  three  variables  is  a  linear 
function  of  the  others:  Age  can  be  directly  deduced 
by  knowing  the  year  of  the  survey  and  the  cohort  in 
which  the  person  belongs.  Examining,  for  instance, 
respondents  belonging  in  the  35-year-old  cohort  in 
1987  is  equivalent  to  examining  those  born  in  1952. 
Evidently,  we  cannot  isolate  the  impact  of  belonging 


in  this  cohort  from  the  effect  attributed  to  the  role 
of  age. 

The  usual  way  out  is  to  constrain  the  impact  of 
one  of  the  three  factors  either  to  zero  or  to  a  non¬ 
linear  function.  If,  for  instance,  age  is  thought  to 
be  related  to  political  participation  in  a  curvilinear 
fashion,  or  cohort  effects  can  be  measured  in  a 
categorical  way,  a  simple  ordinary  least  squares 
(OLS)  regression  can  capture  adequately  the  inde¬ 
pendent  effects  of  each  confounding  factor.  To  be 
sure,  this  leverage  comes  only  at  the  cost  of  a  more 
parsimonious  specification,  which  assumes  that 
period  or  cohort  effects  operate  in  the  same  way 
across  cohorts  with  different  years  of  birth.  More 
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recently,  the  use  of  Bayesian  methods,  generalized 
additive  models,  nonlinear  regression  models,  dis¬ 
criminant  analysis,  or  dynamic  modeling  of  the 
effect  of  period  shocks  on  different  cohorts  has 
helped  examine  the  three  factors  independently, 
making  only  minimal  assumptions  about  the  way 
they  affect  the  phenomenon  of  interest. 

The  second  problem  is  that  even  when  a  typical 
regression  model  manages  to  produce  point 
estimates  for  the  effects  of  each  of  the  three  mecha¬ 
nisms,  the  standard  errors  will  typically  be  unreli¬ 
able.  The  reason  for  this  is  that  the  unit  of  analysis 
is  usually  an  aggregate  measure  (percentage,  mean 
values,  etc.)  of  a  group  of  individual  responses 
taken  from  a  sample  of  a  targeted  population. 
Thus,  even  in  a  case  of  a  full-probability  random 
sample,  each  cell  represents  an  estimate,  and  thus, 
the  uncertainty  associated  with  this  estimate  needs 
to  be  also  taken  into  account.  The  third  problem  is 
that  tracing  a  cohort  over  a  long  period  of  time  is 
also  subject  to  nonrandom  sampling  error.  As  a 
given  cohort  becomes  older,  the  number  of  observa¬ 
tions  representing  it  falls  drastically.  Importantly, 
the  probability  of  survival  may  well  be  nonignor- 
able  to  the  trait  of  interest. 

This  last,  less  cited  but  probably  more  serious, 
problem  of  the  ACP  framework  relates  to  its 
exploratory  rather  than  explanatory  character.  Any 
ACP  analysis  is  primarily  useful  as  an  effort  to  par¬ 
tition  change  into  each  of  its  components.  However, 
since  the  year  of  birth  and/or  the  year  of  survey 
cannot  be  logically  treated  as  autonomous  influ¬ 
ences  on  any  observed  phenomenon  of  interest, 
unless  some  further  examination  of  the  causal  pro¬ 
cesses  that  lead  to  this  observation  is  made,  any 
ACP  account  is  only  useful  as  suggestive  evidence 
for  what  might  be  considered  as  the  potential  expla¬ 
nations  for  the  underlying  phenomenon.  A  more 
elaborate  analysis,  however,  would  need  to  model 
these  processes  and  unpack  the  structural  factors 
lurking  behind  each  of  the  three  terms.  In  statistical 
parlance,  no  analysis  of  this  kind  can  account  for 
the  underlying  process  generating  the  data. 

This  last  remark  should  not  be  interpreted  as  an 
argument  against  the  use  of  cohort  analysis  alto¬ 
gether.  That  would  mean  throwing  out  the  baby 
with  the  bathwater.  In  effect,  cohort  analysis,  which 
started  in  the  1970s  as  a  powerful  tool  that  social 
scientists  adopted  from  demographers  to  study  pat¬ 
terns  of  social  change,  remains  a  creative  strategy  to 
disentangle  interrelated  sources  of  aggregate  change. 


Although  the  difficulty  of  the  ACP  framework  to 
separate  these  mechanisms  creates  limitations  in  the 
statistical  analysis,  it  is  also  useful  in  showing 
researchers  that  social  reality  is  often  more  complex 
than  assumed  by  models,  which  fail  to  take  into 
account  all  these  competing  mechanisms.  Cohort 
analysis  can  be  very  useful  either  as  a  first  explor¬ 
ative  step  accompanied  by  individual-level  panel 
analysis  or  as  a  way  to  examine  well-developed 
theories  that  mainly  focus  on  one  of  the  three 
mechanisms  of  change.  Any  such  analysis  needs  to 
provide  rigid  theoretical  argumentation  and  empir¬ 
ical  evidence  that  would  help  unpack  any  of  the 
three  terms  into  substantive  factors  that  operate 
through  this  process.  In  sum,  cohort  analysis  can 
provide  suggestive  evidence  about  the  dominant 
source  of  aggregate  change  when  such  exists  or,  as 
a  minimum,  make  it  clear  that  in  various  instances 
disentangling  these  effects  is  asking  too  much  from 
the  existing  information  that  we  have  at  hand. 
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Collaboration 


Collaboration  is  a  process  in  which  the  state  en¬ 
gages  with  business  and  civil  society  to  shape 
policy  agendas,  and  to  design  and  implement 
policy  programs.  Sometimes  this  is  interpreted  as 
meaning  that  the  public  policy  process  reflects  the 
principles  of  deliberative  democracy,  in  which 
actors  engage  in  a  rational  debate  with  the  inten¬ 
tion  of  seeking  a  consensus  solution.  Other  litera¬ 
tures  place  more  emphasis  on  collaboration  aris¬ 
ing  from  resource  interdependencies  and  complex 
problems  that  require  a  multisectoral  response.  A 
more  critical  understanding  of  collaboration  is  as 
one  of  the  mechanisms  through  which  the  state 
subjugates  populations  via  new  mechanisms  of 
self-governance. 

Despite  definitional  differences,  there  is  general 
agreement  that  collaboration  is  different  from 
cooperation  and  coordination.  All,  however, 
are  points  on  a  continuum  from  organizational 
independence  to  organizational  integration. 
Cooperation  involves  sharing  information  or  exper¬ 
tise  in  a  context  where  organizations  remain  auton¬ 
omous  from  each  other,  for  example,  in  client  refer¬ 
rals  in  social  welfare  networks.  Coordination 
involves  a  process  of  independent  organizations 
introducing  a  degree  of  integration  by  undertaking 
mutual  adjustment  to  produce  a  better  overall  out¬ 
come.  This  takes  place  when  organizations  agree  to 
redesign  their  procedures  so  that,  for  example,  cli¬ 
ents  can  use  a  single  point  of  contact  to  access  ser¬ 
vices  from  a  variety  of  agencies.  Collaboration  is 
the  next  step  in  the  continuum.  It  involves  organi¬ 
zations  recognizing  that  they  are  interdependent 
and  creating  a  greater  degree  of  integration  of  pro¬ 
cedures,  policies,  and  structures.  Such  collabora¬ 
tion  might  involve  the  creation  of  an  overarching 
board  to  make  strategic  decisions  that  are  binding 
on  the  constituent  organizations. 

Collaboration  is  often  regarded  as  an  activity 
that  takes  place  at  the  subnational  level,  and  there 


is  a  large  literature  on  partnership  and  other  forms 
of  collaboration  at  city  or  neighborhood  level,  for 
example,  in  urban  regeneration  and  sustainability. 
However,  it  would  be  incorrect  to  privilege  these 
spatial  scales.  In  an  era  of  multilevel  and  global  gov¬ 
ernance,  it  is  important  that  the  discussion  of  col¬ 
laboration  also  take  into  account  interactions  at 
national  and  transnational  level,  as  well  as  interac¬ 
tions  between  these  and  other  levels  of  governance. 
This  is  because  collaborative  mechanisms  engaging 
government  with  civil  society  and  business  have 
become  the  preferred  delivery  mechanisms  for  many 
aspects  of  European  and  national  policy,  including 
those  related  to  economic  restructuring,  social  inclu¬ 
sion,  environmental  sustainability,  and  the  knowl¬ 
edge  society.  These  policy  initiatives  stimulate  activ¬ 
ity  at  the  subnational  level  and  in  some  cases  (as  in 
a  number  of  European  Commission  initiatives) 
become  the  instrument  through  which  specific  pol¬ 
icy  intentions  are  expected  to  be  realized.  In  the  field 
of  international  relations,  the  creation  by  states  of 
instruments  of  global  governance  can  be  regarded  as 
the  outcome  of  a  process  that  is  intended  to  lead  to 
greater  collaboration,  for  example,  through  the  reg¬ 
ulation  of  standards  or  economic,  environmental,  or 
social  policies.  Collaboration,  therefore,  involves 
both  horizontal  and  vertical  forms  of  interorganiza- 
tional  engagement. 

How  Does  Collaboration 
Relate  to  Other  Concepts? 

The  concept  of  collaboration  is  closely  related  to 
other  concepts  in  common  usage.  The  first  term  to 
consider  is  governance.  In  political  science  and  pub¬ 
lic  policy,  this  typically  refers  to  a  changing  form  of 
state  rule  through  which  government  attempts  to 
engage  other  actors  in  its  political  projects  through 
cultural  rather  than  coercive  mechanisms.  However, 
the  extent  to  which  actors  from  business  and  civil 
society  are  able  or  willing  to  engage  in  collaborative 
activity  with  government  will  be  influenced  by  the 
nature  of  the  state  and  its  historical  relationships. 
So  the  experience  of  the  United  States  with  its  tradi¬ 
tion  of  “small  government”  and  extensive  private 
provision  of  public  services  will  be  very  different 
from  that  of  Western  Europe,  where  the  state  has 
played  a  much  more  prominent  role  in  service  pro¬ 
vision.  The  emergence  of  multilevel  governance  is 
especially  a  recent  consequence  of  European  inte¬ 
gration,  although  decentralization  and  different 
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levels  of  government  are  a  part  of  traditions  inside 
and  outside  Europe. 

Second,  collaboration  is  broadly  synonymous 
with  the  concept  of  “partnership.”  This  concept  is 
used  in  some  English-speaking  countries  (e.g.,  the 
United  Kingdom  [UK],  Australia,  New  Zealand)  to 
refer  to  the  quasi-formal  organizational  arrange¬ 
ments  that  provide  an  envelope  for  interaction 
between  state,  civil  society,  and  business  on  partic¬ 
ular  policy  questions.  In  these  countries,  “public- 
private  partnership”  typically  refers  to  contractual 
agreements  between  government  and  business 
related  to  the  provision  of  infrastructure  or  ser¬ 
vices,  whereas  in  Europe  and  the  United  States,  it 
tends  to  encapsulate  both  this  specific  meaning  and 
the  wider  Anglo  idea  of  partnership.  For  U.S.  schol¬ 
ars,  “collaborative  public  management”  has  a 
meaning  synonymous  with  the  idea  of  “partner¬ 
ship”  in  the  UK. 

The  third  concept  with  which  collaboration 
connects  is  that  of  “networks,”  which  develops  the 
older  idea  of  policy  networks.  Here,  the  emphasis 
is  on  an  institutionalized  pattern  of  relationships 
between  autonomous  actors  in  a  particular  policy 
realm.  Negotiated  agreements  between  actors 
could  be  understood  as  collaboration  to  the  extent 
that  the  actors  agree  to  forgo  some  of  their  auton¬ 
omy  in  order  to  realize  collective  benefits. 

Why  Is  Collaboration  Important  Today? 

The  contemporary  importance  of  this  idea  arises 
from  the  changing  role  of  the  state  in  the  late  20th 
and  early  21st  century.  The  argument  is  that  the 
state  lacks  the  necessary  capacity  and  resources  to 
deliver  public  policy  goals  in  advanced  capitalist 
economies  and  chooses  to  fill  this  gap  by  involving 
others  sectors  more  into  functions  that  would — 
under  traditional  public  administration — have 
been  reserved  to  the  state  and  its  functionaries. 
Collaboration  is  therefore  part  of  the  new  public 
management  movement,  which  includes  the  view 
that  public  services  could  be  delivered  more  effi¬ 
ciently  and  effectively  by  external  providers — 
including  the  widespread  fascination  with  the 
involvement  of  the  private  sector. 

A  related  explanation  is  that  neoliberal  ideology, 
especially  in  social  welfare  and  former  socialist 
states,  has  resulted  in  a  fragmentation  of  govern¬ 
ment  due  to  privatization,  contracting  out,  the 


creation  and  hiving  off  of  separate  agencies  from 
large  public  bureaucracies,  and  other  new  public 
management  initiatives.  Collaboration  arises  from 
the  need  to  retain  a  capacity  to  integrate  across  this 
distributed  organizational  landscape  so  that  policy 
intentions  can  be  realized  and  steering  is  possible. 
Such  integration  occurs  at  a  number  of  levels.  At 
the  discursive  level,  there  are  powerful  ideas — for 
example,  about  “partnership” — which  mobilize 
actors  around  collaborative  projects  despite  the  dif¬ 
ferences  between  them.  At  the  institutional  level, 
collaboration  involves  a  process  of  rule  creation 
where  there  is  greater  flexibility  for  actors  than  in 
representative  government.  And  at  the  behavioral 
level,  boundary  spanners  and  network  managers 
are  valued  because  of  their  capacity  to  build  coali¬ 
tions  between  organizations  and  groupings  in  an 
ambiguous  and  changing  environment. 

What  Are  the  Democratic 
Implications  of  Collaboration? 

From  a  progressive  perspective,  collaboration 
offers  a  way  of  developing  participative  forms  of 
governance  that  bypass,  or  at  least  complement, 
representative  democracy.  It  facilitates  the  engage¬ 
ment  of  a  wider  range  of  actors  in  the  policy  pro¬ 
cess  and  potentially  enables  them  to  contribute 
throughout  the  process  rather  than  just  being  con¬ 
sulted  at  the  start  and  excluded  from  the  black  box 
of  decision  making.  So  collaboration  can  be 
treated  as  synonymous  with  deliberative  and  par¬ 
ticipative  democracy,  and  the  opening  up  of  the 
state  to  other  interests  (and  especially  those  of  citi¬ 
zens  and  civil  society  actors).  This  progressive 
perspective  is  reflected  in  the  normative  tone  that 
can  be  found  in  the  literature. 

A  more  critical  reading  positions  collaboration  in 
the  light  of  Foucauldian  governmentality,  where 
close  engagement  with  the  state  introduces  a  regime 
of  discipline  on  participants.  In  this  process,  they 
become  part  of  the  mechanisms  of  governance  in  a 
complex  society,  taking  on  themselves  the  concerns 
of  the  state  and  self-regulating  their  actions  in  this 
way.  Thus,  citizens  involved  with  the  state  in  regen¬ 
erating  disadvantaged  communities  take  on  the 
mantle  of  the  state’s  discourse  of  disadvantage  and 
the  performance  management  systems  that  go  along 
with  delivering  public  policy.  This  constrains  their 
capacity  to  think  and  act  outside  this  discourse  in 
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ways  that  might  serve  their  interests  better  but  go 
against  the  state.  On  this  basis,  citizens  might  adopt 
a  political  strategy  of  exit  rather  than  remain 
trapped  by  the  false  reality  of  collaboration. 

This  discussion  reminds  us  that  collaboration — 
for  all  its  progressive  overtones — is  about  political 
action.  The  prevailing  discourse  of  collaboration 
conceals  the  darker  side  in  which  decisions  are 
removed  from  the  arena  of  representative  govern¬ 
ment  with  its  public  interest  safeguards  and  located 
instead  in  less  transparent  institutional  contexts 
whose  rules  in  use  are  evolving  and  more  mallea¬ 
ble.  If  we  agree  that  the  relations  of  power  between 
state,  business,  and  civil  society  are  structured  to 
give  advantage  to  the  former  two  actors,  and  thus 
biased  against  civil  society,  then  collaboration  is 
likely  to  do  little  to  redress  these  more  fundamen¬ 
tal  imbalances.  The  challenge  for  academics  ana¬ 
lyzing  these  new  arrangements  is  to  investigate 
beneath  the  epiphenomenon  of  political  discourse 
and  understand  the  nature  and  impact  of  collabo¬ 
ration  on  the  relations  between  actors  in  histori¬ 
cally  embedded  contexts. 
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Collective  Security 


A  collective  security  system  is  one  in  which  each 
state  in  the  system  accepts  that  the  security  of  one 


is  the  concern  of  all  and  agrees  to  join  in  a  collective 
response  to  threats  to,  and  breaches  of,  the  peace. 
A  weaker  version  posits  a  system  that  commits  gov¬ 
ernments  to  develop  and  enforce  broadly  accepted 
international  rules  in  the  area  of  international  peace 
and  security  and  to  do  so  through  collective  action 
legitimized  by  international  institutions. 

The  idea  of  collective  security  goes  back  at  least 
to  the  European  peace  plans  of  the  18th  century 
and  gained  ground  in  the  post-World  War  I  period 
as  international  society  sought  to  restrict  the  previ¬ 
ously  wide-ranging  right  of  states  to  resort  to  war 
as  an  instrument  of  state  policy,  first  in  the  League 
of  Nations  and  then  in  the  United  Nations  (UN). 
The  UN  did  not  constitute  a  pure  collective  secu¬ 
rity  system  (e.g.,  in  relation  to  the  veto  given  to 
permanent  members  of  the  Security  Council);  but 
it  contained  powerful  elements  of  collective  secu¬ 
rity,  above  all  in  terms  of  the  clear  prohibition  of 
aggressive  force  in  Article  2(4)  of  the  UN  Charter, 
and  in  the  far-reaching  responsibility  of  the  Security 
Council  for  the  maintenance  of  international  peace 
and  security,  including  the  authorization  of  man¬ 
datory  sanctions  and  military  action.  The  end  of 
the  Cold  War  appeared  to  many  to  open  new 
opportunities  for  collective  security,  with  an 
increasing  number  of  enforcement  resolutions 
adopted  under  Chapter  VII  and  a  very  significant 
expansion  in  the  number  and  coercive  character  of 
UN  peacekeeping  operation  and  of  UN-established 
international  administrations. 

The  Nature  of  Collective  Security 

Collective  security  raises  five  sets  of  questions 
about  its  nature,  with  respect  to  the  participants  in 
collective  security  arrangements,  the  kind  of  secu¬ 
rity  provided,  and  the  forms  of  collective  action  to 
be  taken. 

Security  for  Whom? 

The  traditional  conception  of  collective  security 
was  intended  to  strengthen  the  rights  of  states  to 
independence  and  to  reinforce  an  international 
legal  order  built  around  the  concepts  of  sover¬ 
eignty  and  nonintervention.  On  the  Wilsonian 
view,  one  of  its  chief  attractions  was  that,  in  con¬ 
trast  to  the  balance  of  power,  it  guaranteed  the 
independence  of  all  states,  including  small  and 
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weak  states.  Yet  the  stress  on  reinforcing  the  rights 
of  states  and  on  maintaining  the  sanctity  of  estab¬ 
lished  borders  against  forcible  change  has  given 
rise  to  one  of  the  most  enduring  dilemmas:  how  to 
accommodate  change  and  how  to  prevent  a  collec¬ 
tive  security  organization  from  becoming  an  instru¬ 
ment  for  maintaining  the  status  quo.  The  liberal 
assumption  of  a  shared  interest  in  the  peaceful 
maintenance  of  the  status  quo  has  been  repeatedly 
challenged. 

What  Kind  of  Security? 

Second,  what  kind  of  security  is  embodied  in 
the  phrase  collective  security ?  As  it  developed  in 
the  late  19th  and  early  20th  centuries,  collective 
security  was  conceived  as  a  response  to  the  dangers 
of  formal  interstate  violence  and,  in  particular,  to 
the  problem  of  the  aggressive  use  of  force  by  states. 
Faced  with  the  united  opposition  of  the  interna¬ 
tional  community,  states  would  come  to  accept 
that  aggression  simply  could  not  pay.  Although 
not  as  novel  as  sometimes  claimed,  the  post-Cold 
War  period  has  seen  an  enormous  literature  on  the 
changing  character  of  security  and  the  changing 
dynamics  of  the  global  security  landscape:  the  eas¬ 
ing  of  major  power  rivalry  and  the  emergence  of  a 
wide  range  of  new  security  challenges  connected 
with  civil  wars,  domestic  social  conflicts,  ethnic 
strife,  refugee  crises,  humanitarian  disasters,  and 
transnational  terrorist  threats.  In  addition,  many 
have  argued  that  much  greater  priority  should  be 
given  to  human  security,  rather  than  the  security  of 
states  or  of  regimes.  Such  moves  illustrate  the 
politically  contested  character  of  the  concept  of 
security,  emphasized,  in  particular,  by  critical  secu¬ 
rity  theory.  Collective  security  is  sometimes  under¬ 
stood  analytically  as  a  problem  of  capturing  a 
well-understood  and  broadly  shared  interest  (per¬ 
haps  by  viewing  security  as  a  global  public  good) 
in  the  face  of  the  twin  problems  of  defection  and 
free  riding.  Clearly,  these  problems  are  formidable 
in  a  world  in  which  states  and  government  leaders 
are  faced  with  powerful  incentives  to  protect  their 
immediate  short-term  interests.  But  such  rational¬ 
ist  logic  underplays  the  challenge  of  the  essentially 
contested  nature  of  security.  Very  different  histori¬ 
cal  circumstances  and  divergent  values  mean  that 
there  is  rarely  an  easy  answer  to  the  question  of 
whose  security  is  to  be  upheld  or  against  which 


threats  that  security  is  to  be  promoted.  Together 
with  the  deeper  intervention  required  to  deal  with 
many  new  security  challenges,  this  in  turn  increases 
the  problems  of  legitimacy  as  well  as  the  difficul¬ 
ties  of  securing  the  willingness  of  states  to  commit 
armed  forces  to  conflicts  that  are  often  seen  as 
marginal  to  core  foreign  policy  interests. 

Which  Collectivity? 

Third,  which  collectivity  is  involved  in  “collec¬ 
tive  security”?  There  have  always  been  strong 
arguments  for  the  broadest  possible  membership 
of  a  collective  security  system:  to  ensure  that  the 
power  of  the  collectivity  is  sufficient  to  deter 
aggression  and,  if  necessary,  to  enforce  its  deci¬ 
sions  against  all  states,  and  to  reduce  the  danger 
that  collective  security  will  merely  provide  a 
framework  within  which  power  political  competi¬ 
tion  and  alliance  politics  are  played  out  under  a 
different  guise.  On  the  other  hand,  there  have  also 
been  repeated  arguments  that  an  effective  collec¬ 
tive  security  system  requires  leadership  and  that  an 
effective  collectivity  will  consist  of  a  smaller  group 
of  like-minded  states  with  the  effective  (as  opposed 
to  theoretical)  power  to  enforce  their  decisions. 
Hence,  there  have  been  recurrent  arguments  that 
regionally  based  collective  security  systems  are 
most  likely  to  prove  effective,  and  the  post-Cold 
War  period  has  seen  an  expansion  of  the  role  of 
regional  security  organizations. 

What  Form  of  Collectivity? 

Fourth,  what  form  of  collectivity?  It  is  important 
to  disentangle  two  approaches  to  thinking  about 
collective  security  systems.  In  one  view,  the  domi¬ 
nant  one  in  the  recent  past,  collective  security  has 
been  seen  as  a  means  of  enforcing  order  between 
independent  political  communities,  of  achieving  a 
degree  of  centralization  that  does  not  radically 
threaten  the  independence  and  autonomy  of  states. 
An  alternative,  and  historically  deep-rooted,  con¬ 
ception  has  viewed  moves  toward  the  collective 
management  of  armed  force  as  part  of  a  broader 
process  of  reorganizing  the  political  system  and 
moving  beyond  the  state  system  toward  more  cen¬ 
tralized  or  federal  forms  of  global  political  order. 

The  other  issue  here  concerns  the  dilemma  of 
preponderant  power.  In  theory,  collective  security 
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offers  the  purest  solution  to  the  dilemma  of  pre¬ 
ponderant  power.  Inequality  is  not  to  be  feared, 
opposed,  or  balanced  against  but  is,  instead,  to  be 
harnessed  to  the  legitimate  collective  purposes  of 
the  international  community.  In  practice,  the  situ¬ 
ation  is  more  complex.  On  the  one  hand,  the  veto 
within  the  UN  Security  Council  reflects  the  reality 
of  a  power  distribution  in  which  attempts  to 
coerce  any  of  the  major  powers  of  the  system 
could  be  achieved  only  at  great  risk  and  high  cost. 
On  the  other  hand,  unable  to  command  substan¬ 
tial  military  forces  of  its  own  in  the  ways  envisaged 
in  the  Charter,  UN  enforcement  action  has  oper¬ 
ated  by  means  of  authorizing  the  use  of  force  by 
member  states,  especially  those  with  the  capacity 
to  deploy  effective  military  power.  Such  a  situation 
is  always  likely  to  create  problems  of  effective  del¬ 
egation  and  control  and  to  increase  the  risk  that  a 
system  aimed  at  collective  security  will  in  fact 
become  one  of  selective  security. 

What  Forms  of  Collective  Action ? 

Finally,  what  forms  of  collective  action  are 
envisaged  in  a  collective  security  system?  Collective 
security  involves  a  shared  acceptance  that  a  breach 
of  the  peace  threatens  the  interests  of  all  states.  It 
also  involves  a  shared  willingness  to  act  effectively 
to  enforce  the  law  and  to  protect  the  interests  of 
the  international  community.  Enforcement  has 
very  often  been  seen  as  critical,  but  not  always. 
Much  of  19th-century  liberal  thought,  for  exam¬ 
ple,  believed  fervently  that  the  clear  elaboration  of 
international  law  would  be  crucial.  Others  believed 
in  the  power  of  enlightened  international  opinion. 
The  arguments  for  effective  enforcement  grow 
through  the  century  and  are,  of  course,  given  par¬ 
ticular  force  by  the  catastrophe  of  World  War  I.  At 
the  same  time,  however,  these  older  liberal  views 
never  entirely  faded,  and  there  remained  the  hope 
that  the  paradox  of  fighting  a  war  to  ensure  peace 
could  be  avoided. 

It  should  also  be  noted  that  the  emphasis  on 
deterrence  and  enforcement  places  collective  secu¬ 
rity  analytically  close  to  mainstream  realist  writing 
on  alliances  and  the  balance  of  power.  This  can  be 
contrasted  with  constructivist  writing  on  coopera¬ 
tive  security  and  on  security  communities.  It  is  the 
possible  emergence  of  a  situation  in  which  coop¬ 
eration  goes  beyond  instrumental  calculation  and 


in  which  the  use  of  force  declines  as  a  tool  of  state¬ 
craft  that  opens  the  door  to  constructivist  theories. 
Such  theories  highlight  the  importance  of  histori¬ 
cally  constructed  interests  and  identities,  of  learn¬ 
ing  and  ideational  forces,  and  of  normative  and 
institutional  structures  within  which  state  interests 
are  constructed  and  redefined.  Instead  of  focusing 
solely  on  material  incentives,  constructivists  claim 
that  understanding  inter  subjective  structures 
allows  us  to  trace  the  ways  in  which  interests  and 
identities  change  over  time  and  new  forms  of  secu¬ 
rity  cooperation  and  community  can  emerge. 

Liberals  argue  that,  although  there  is  no  collec¬ 
tive  security  system,  the  collective  element  in  secu¬ 
rity  management  has  increased  and  even  major 
states  need  multilateral  security  institutions  both 
to  share  the  material  and  political  burdens  of  secu¬ 
rity  management  and  to  gain  the  authority  and 
legitimacy  that  the  possession  of  crude  power 
alone  can  never  secure.  Realists  continue  to  stress 
that  peace  is  not  indivisible,  that  states  and  their 
citizens  are  often  unwilling  to  bear  the  costs  of  col¬ 
lective  action  in  conflicts  in  which  their  direct 
interests  are  only  weakly  engaged,  and  that  large 
parts  of  the  global  security  system  continue  to  be 
shaped  by  the  unilateral  pursuit  of  state  interest 
and  by  practices  of  both  the  balance  of  power  and 
hierarchical  power. 
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Colonialism 


Colonialism  is  usually  understood  as  a  political 
doctrine  promoting  and  justifying  the  exploitation 
by  a  colonizing  power  of  a  territory  under  its  con¬ 
trol  either  for  its  own  benefit  or  for  the  benefit  of 
the  colons  settled  in  this  territory.  In  this  sense, 
colonialism  refers  mainly  to  the  unequal  relation¬ 
ships  developed  between  European  colonizers  and 
their  respective  colonial  empires.  This  conven¬ 
tional  definition  has  been  considered  too  restrictive 
by  some  scholars  who,  in  the  past  20  years,  have 
stretched  the  notion.  Colonialism  has  come  to 
include  many  different  kinds  of  unequal  power 
relationships  between  two  countries  (e.g.,  between 
Israel  and  Palestine)  and  between  the  West  and  the 
world  (the  concept  of  colonialism  replacing  to 
some  extent  that  of  imperialism).  Colonialism  can 
also  refer  to  unequal  relationships  between  a 
dominant  majority  group  and  a  minority  group 
that  is  an  indigenous  group  or  considered  to  be  not 
autochthonous  (internal  colonialism).  Colonialism 
has  also  been  used  in  association  with  larger  mod¬ 
ern  political  and  economic  processes  such  as  the 
economic  world  system  since  the  16th  century  or, 
more  generally,  with  a  vision  of  European  “moder¬ 
nity.”  The  polysemy  of  colonialism  is  largely  due 
to  the  renewed  interest,  since  the  1980s,  in  the 
colonial  and  postcolonial  periods  among  literary 
critics,  historians,  anthropologists,  and  political 
scientists.  In  this  entry,  colonialism  is  analyzed  only 
as  a  set  of  complex,  unequal,  and  past  relationships 
linking  European  colonizing  powers  to  colonies 
(for  other  meanings,  see  the  entry  on  postcolonial¬ 
ism).  It  is,  thus,  useful  to  dissociate  colonialism  as 
a  political  doctrine  forged  by  those  writing  during 
the  colonial  period  from  colonialism  as  a  paradigm 


reexamined  by  scholars  since  the  1980s.  In  the  last 
section  of  this  entry,  the  way  postcolonial  thinkers 
have  portrayed  colonialism  is  briefly  explored 
before  other  approaches  examining  colonialism 
from  a  more  political,  social,  and  economic  angle 
are  examined. 

Colonialism  as  a  Political  Doctrine 

The  word  colonialism  is  created  from  the  words 
colonial  and  colony ,  which  have  a  longer  history. 
English  and  French  terms  for  colonial  and  colony 
derive  from  colonus,  the  Latin  word  for  “farmer.” 
The  founding  of  colonies  was  one  of  the  strategies 
the  ancient  Romans  employed  in  establishing  their 
empire.  During  the  Renaissance  and  the  16th- 
century  European  expansion,  the  words  colon  and 
colony  gradually  took  on  their  current  meaning: 
The  word  colon  referred  to  a  person  living  in  a 
colony,  as  opposed  to  an  inhabitant  of  a  European 
colonial  power;  the  world  colony  designated  a  ter¬ 
ritory  dominated  and  administrated  by  a  foreign 
power  or  a  group  of  settlers;  the  verb  to  colonize 
gradually  came  to  mean  “to  conquer  a  territory.” 
At  different  periods  but  mainly  in  the  second  half 
of  the  18th  century,  the  words  colonial ,  colonize , 
and  colonization  appeared  in  English  and  were 
then  translated  into  French. 

The  term  colonialism  obviously  derived  from 
colonial,  but  it  appeared  later,  in  the  framework  of 
19th-century  imperialism.  It  first  appeared  in 
English  around  the  middle  of  the  19th  century  and 
was  used  to  mean  practices  or  idioms  peculiar  to, 
or  characteristic  of,  a  colony.  In  1886,  it  was  used 
to  mean  the  colonial  system  or  principle,  thus 
referring  to  colonialism  as  a  systematic  and  wide- 
ranging  phenomenon.  In  France,  it  also  followed 
the  pace  of  overseas  expansion.  The  word  colonist, 
common  after  the  conquest  of  Algeria  in  1830, 
referred  specifically  to  a  partisan  of  the  coloniza¬ 
tion  of  Algeria,  while  anticolonist  referred  to 
opponents  of  this  process  of  colonization.  This 
specific  use  did  not  last,  however,  and  colonialism/ 
anticolonialism  cam e  to  replace  the  terms  colonist/ 
anticolonist  in  the  early  20th  century. 

It  is  thus  not  really  surprising  that  colonialism 
became  a  political  doctrine  promoted  by  theorists 
and  defended  by  interest  groups  only  in  the  19th 
century,  during  the  heyday  of  imperialism. 
Colonialism  can  hardly  be  understood  without  the 
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element  of  self-legitimation  inherent  in  it.  The 
most  powerful  form  of  self-legitimating  was  the 
colonizer’s  claim  of  improving  the  conquered 
country  and  bringing  the  fruits  of  progress  and 
modernity  to  the  subject  peoples.  All  European 
powers  claimed  to  pursue  a  civilizing  project  in 
their  colonies,  but  they  used  different  terminolo¬ 
gies  to  this  end.  In  the  19th  century,  it  was  called 
improvement ,  betterment ,  or  moral  and  material 
progress.  All  these  terms  were  then  subsumed 
under  the  term  civilizing  mission,  which  became 
the  imperial  ideology  and  official  doctrine  from 
the  late  19th  century  onward.  Parallel  to  this  his¬ 
torical  trend,  there  was  an  increasing  feeling  of 
racial  superiority  in  European  nations,  along  with 
the  development  of  a  positivist  approach  in  the 
natural  and  human  sciences  associated  with  an 
increasing  obsession  with  classifying  people, 
plants,  and  animals.  Some  Western  philosophers, 
academics,  writers,  and  politicians  developed  a 
new  vision  of  the  world.  There  were  continents 
with  history  and  those  without  history  (Friedrich 
Hegel,  1770-1831);  there  were  superior  and  infe¬ 
rior  races  to  colonize  (Jules  Ferry,  1832-1893), 
fortunate  and  less  fortunate  races  to  educate 
(Rudyard  Kipling,  1865-1936),  inferior  and  supe¬ 
rior  languages  (Ernest  Renan,  1823-1892),  and  a 
primitive  or  prelogical  mode  of  thought  and  a 
logical  one  (Lucien  Levy-Bruhl,  1857-1939). 

In  the  19th  century,  however,  colonialism  as  a 
political  ideology  was  not  yet  a  unifying  body  of 
knowledge.  In  the  British  Empire,  the  aim  for 
moral  improvement  and  material  betterment  of 
society  became  common  among  a  generation  of 
administrators  guided  by  the  18th-century  legal 
reforms  of  Warren  Hastings  (1732-1818),  Lord 
Cornwallis  (1738-1805),  and  Jeremy  Bentham 
(1748-1832).  Gradually,  British  historiography 
started  to  describe  colonization  as  a  history  of 
progress.  This  is  well  exemplified  by  James  Mill’s 
History  of  British  India,  an  early  and  influential 
piece  of  colonial  historiography  that  was  pub¬ 
lished  in  1818.  In  contrast,  a  specific  doctrine  of 
modern  colonization  emerged  in  France  only  in  the 
early  years  of  the  Third  Republic  (1871-1940). 
Following  Germany’s  defeat  of  France  in  1870- 
1871  and  France’s  loss  of  the  two  provinces  of 
Alsace  and  Lorraine,  some  disparate  groups  devel¬ 
oped  a  common  set  of  arguments  to  justify  over¬ 
seas  expansion.  In  1874,  the  economist  Paul 


Leroy-Beaulieu  (1843-1916)  advanced  the  view  in 
his  famous  book  De  la  colonisation  chez  les  peu- 
ples  modernes  (Colonization  among  modern 
nations)  that  investing  in  colonies  was  the  best 
business  for  an  old  and  rich  country  such  as 
France.  The  most  influential  Republican  leaders, 
Leon  Gambetta  (1838-1882)  and  Jules  Ferry,  also 
defended  colonial  expansion  as  a  way  to  recover 
from  the  events  of  the  1870s.  While  colonization 
initially  divided  monarchists,  republicans,  and 
radicals  in  France,  no  political  parties  in  Europe 
opposed  colonization  prior  to  World  War  I. 

The  making  of  20th-century  colonialism  was  a 
process  driven  by  a  set  of  intellectuals,  writers,  and 
academics  and  by  a  new  administrative  machinery 
(the  colonial  office)  assisted  by  experts  (ethnogra¬ 
phers,  ethnologists,  and  later  on  anthropologists 
and  sociologists)  and  increasingly  after  World  War 
II  by  technicians  (medical  officers,  educationists, 
town  planners,  welfarists,  etc.).  While  the  idea  of 
progress  was  very  common  in  justifications  of 
colonial  conquest  in  the  19th  century,  a  consider¬ 
able  debate  emerged  in  the  early  colonial  days  of 
administration  concerning  what  kind  of  colonial 
rule  was  desirable.  Assimilation,  association,  and 
indirect  rule  are  generally  considered  to  be  the 
three  major  colonial  doctrines  that  emerged 
between  the  end  of  the  19th  century  and  the 
1920s.  This  debate  became  most  intense  in  France 
and  Germany;  it  was  more  restricted  in  the  United 
Kingdom,  where  there  was  general  agreement  that 
indirect  rule  should  be  the  underlying  philosophy 
of  the  system  of  government.  The  doctrine  of 
assimilation  was  based  on  the  heritage  of  the 
French  Revolution  and  the  belief  in  the  equality  of 
all  people  irrespective  of  their  racial  origin  or  cul¬ 
tural  background.  Colonized  people  had  to  prove 
themselves  worthy  of  assimilation  by  demonstrat¬ 
ing  to  the  authorities  that  they  had  the  attributes 
needed  for  citizenship.  Practically,  however,  access 
to  full  rights  of  French  citizenship  was  highly 
restricted,  mainly  to  the  inhabitants  of  four  com¬ 
munes  in  Senegal  (Dakar,  Goree,  Rufisque,  and 
Saint-Louis)  and  to  a  limited  number  of  educated 
people  (evolves)  before  World  War  II.  In  contrast, 
the  doctrine  of  indirect  rule  was  inspired  by  the 
idea  that  Europeans  and  Africans  were  culturally 
distinct,  and  indigenous  political  institutions  were 
necessary  for  the  purposes  of  local  government. 
This  was  conceptualized  by  Lord  Frederick  Lugard, 
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Governor  General  of  Nigeria  from  1914  to  1919, 
in  his  Dual  Mandate  in  Tropical  Africa,  published 
in  1922.  Rather  than  preserving  the  precolonial 
polity,  indirect  rule  largely  modified  the  public 
authority  of  traditional  chiefs  to  ensure  the  exploi¬ 
tation  of  the  country  through  means  such  as  the 
introduction  of  taxes  and  forced  labor  and  the 
modification  of  customary  law.  The  differences  in 
colonial  doctrines  should  not  be  overstated.  For 
instance,  association  was  mainly  another  word  for 
indirect  rule  in  French  colonial  theory.  In  the 
administrative  offices  within  the  colonies  them¬ 
selves,  colonial  theory  was  less  important  than  in 
European  capitals;  most  of  the  time,  colonial 
administration  used  indirect  rule  without  acknowl¬ 
edging  it  as  the  cheapest  way  to  govern  colonies. 

The  participation  of  soldiers  from  European 
colonies  in  the  two  World  Wars,  the  multiplication 
of  colonial  exhibitions  all  over  Europe,  the  increas¬ 
ing  number  of  colonial  magazines,  and  the  teach¬ 
ing  of  the  history  and  geography  of  the  empires  in 
schools  generated  popular  support  for  European 
colonization  between  the  two  wars.  The  legitimacy 
of  colonialism  was  little  contested  at  the  time. 
Anticolonialism  was  limited  to  a  few  writers  and 
to  the  extreme  left  (although  the  Communist  Party 
in  general  was  not  really  anticolonial  when  it  was 
in  government).  It  was  only  during  and  after 
World  War  II,  with  the  rise  of  nationalism  in  Asian 
and  African  countries,  that  colonialism  became 
radically  contested  both  in  colonizing  and  in  colo¬ 
nized  countries.  Providing  different  rights  to  differ¬ 
ent  peoples  on  the  basis  of  their  racial  classifica¬ 
tion  became  illegitimate  after  the  defeat  of  the 
Nazis  and  the  rise  of  a  bipolar  world.  In  addition 
to  the  revolutionary  anticolonialism  of  the  far  left, 
there  was  a  broader  moral  opposition  to  violence 
arising  in  the  colonial  context,  such  as  that  in 
Madagascar,  Kenya,  and  Algeria. 

The  various  European  colonial  powers  reacted 
differently  to  the  new  postwar  order.  Portugal 
decided  not  to  concede  independence  or  freedom 
to  its  African  colonies,  while  other  European  gov¬ 
ernments  granted  new  political  and  social  rights 
and  tried  to  reshape  a  more  legitimate  colonialism 
under  the  new  ideology  of  development  through 
technology,  public  health,  and  economic  and  local 
government  reforms.  Subjects  were  transformed 
into  citizens,  and  they  began  to  demand  the  same 
social  and  political  rights  as  those  in  the  European 


colonizing  countries.  The  cost  of  this  new  dispen¬ 
sation,  however,  was  too  high  for  postwar 
European  budgets;  these  economic  concerns, 
together  with  the  rise  of  nationalism,  convinced 
European  leaders  to  give  up  formal  political  ties 
with  their  former  colonies.  In  less  than  15  years, 
colonialism  ceased  to  be  a  legitimate  political  doc¬ 
trine  and  ultimately  disappeared  in  the  early 
1960s.  It  survived  only  in  Portugal,  in  Portuguese 
colonies  and  in  settler  colonies,  where  it  was 
described  sometimes  as  internal  colonialism  (as, 
e.g.,  in  South  Africa  or  Rhodesia).  Elsewhere,  the 
notion  of  colonialism  and  anticolonialism  was 
replaced  by  neocolonialism,  a  neologism  used  to 
describe,  and  more  often  to  denounce,  the  condi¬ 
tions  under  which  former  colonies  continued  to 
serve  the  economic,  political,  military,  and  other 
interests  of  powerful,  mostly  Western,  countries. 
The  concept  of  neocolonialism  was  popularized  by 
the  new  independent  leader  of  Ghana,  Kwame 
Nkrumah,  in  his  1964  book  Neo-Colonialism,  the 
Last  Stage  of  Imperialism. 

Colonialism  as  a  Contested  Paradigm 

When  colonialism  was  a  highly  contested  political 
issue  in  the  1950s,  the  French  sociologist  Georges 
Balandier  wrote  a  pioneering  article  on  the  colo¬ 
nial  situation  (1951),  in  which  he  explained  the 
necessity  of  understanding  colonization  as  both  a 
social  phenomenon  and  a  specific  historical  pro¬ 
cess.  Balandier’s  recommendation  was  not  fol¬ 
lowed,  however,  as  researchers  in  the  1960s  and 
1970s  were  no  longer  interested  in  a  political  ide¬ 
ology  that  had  been  banished  from  the  realm  of 
legitimate  forms  of  political  organization.  As  men¬ 
tioned  by  Frederick  Cooper,  there  is  something 
strange  about  the  writing  of  colonialism:  Scholarly 
interest  in  colonialism  arose  when  colonial  empires 
had  already  lost  their  international  legitimacy. 
This  rediscovery  is  partly  explained  by  the  rise,  in 
academic  institutions  of  the  English-speaking 
world,  of  what  has  been  labeled  as  postcolonial 
studies. 

During  the  first  decades  after  independence,  the 
ideology  of  progress  and  modernization  (then  per¬ 
ceived  to  be  Western)  was  largely  shared  by  the 
political  and  intellectual  elites  of  former  colonies. 
However,  the  various  projects  of  modernization 
of  the  state  and  the  economy  implemented  after 
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independence  were  seen  by  critics  as  inefficient  and 
socially  unjust.  It  is  against  this  background  that  a 
critique  of  colonialism  reemerged  within  a  group 
of  intellectuals  from  the  former  colonial  world, 
who  were  mostly  educated  in  Western  academic 
institutions.  Edward  Said’s  book  Orientalism ,  pub¬ 
lished  in  1978,  is  often  considered  to  be  a  starting 
point  for  the  analysis  of  colonialism  as  a  specific 
configuration  of  knowledge  and  power.  Influenced 
by  the  work  of  Michel  Foucault,  Said  argues  that 
the  intellectual  construction  of  the  Orient  by 
Western  scholars  has  served  as  an  implicit  justifica¬ 
tion  for  the  colonial  and  imperial  ambitions  of 
European  nations  and  the  United  States.  Valentin 
Mudimbe,  10  years  later,  proposed  a  similar  tex¬ 
tual  and  cultural  reading  of  the  colonial  domina¬ 
tion  of  Africa,  asserting  that  Africa  was  invented 
by  a  colonial  and  anthropological  discourse  and 
this  process  was  a  way  for  Europeans  to  build  their 
own  identity.  Also  important  in  the  critique  of  the 
European  colonial  ideology  was  the  initiative 
launched  by  a  group  of  Indian  historians  such  as 
Ranajit  Guha,  Partha  Chatterjee,  Gyanendra 
Pandey,  and  Dipesh  Chakrabarty,  who  contested 
the  nationalist  historiography  of  India.  In  1982, 
they  initiated  a  series  of  edited  books  titled 
Subaltern  Studies ,  which  focused  on  the  subaltern 
groups  (or  oppressed  people)  of  colonial  India.  The 
central  notion  of  Subaltern  Studies  is  the  notion  of 
“agency,”  which  includes  autonomy  of  the  subal¬ 
tern  in  the  political  arena  and  a  consciousness  of 
self  that  is  not  controlled  by  the  Western  elites  and 
their  nationalist  counterparts,  who  have  suppos¬ 
edly  adopted  the  values  of  their  colonizers.  This 
agency  can  be  particularly  understood  in  the  words 
and  knowledge  of  subaltern  people. 

The  major  aim  of  postcolonial  studies  was  to 
deconstruct  a  Western  epistemology  associated  with 
colonialism  that  reified  the  non-Western  world  in  an 
ahistorical  moment.  The  result  was  a  critique  of  the 
civilizing-mission  discourses;  the  works  of  Aime 
Cesaire,  Discourse  on  Colonialism  (1953);  Albert 
Memmi,  The  Colonizer  and  the  Colonized  (1957); 
and  Franz  Fanon,  The  Wretched  of  the  Earth  (1961) 
received  new  attention  from  the  1980s  onward  as 
texts  that  pioneered  the  denunciation  of  the  perni¬ 
cious  cultural  and  psychological  effects  of  colonial¬ 
ism.  This  rediscovery  has  largely  popularized  the 
idea  that  colonized  people  have  no  other  story  than 
that  of  colonial  oppression. 


According  to  the  historian  Frederick  Cooper, 
a  significant  part  of  postcolonial  thought  has 
taken  colonial  studies  out  of  its  historical  con¬ 
text,  treating  colonialism  abstractly,  generically, 
as  located  somewhere  between  1492  and  the 
1970s,  and  as  something  to  be  juxtaposed  with 
an  equally  flat  vision  of  European  “modernity.” 
Moreover,  colonial  societies  should  not  be  under¬ 
stood  merely  in  terms  of  a  “European  versus 
indigenous”  dichotomy,  and  the  actions  of  for¬ 
mer  colonized  people  should  not  be  reduced  to 
merely  resistance  to  colonialism  or  collaboration 
with  the  colonial  order.  Similarly,  when  colonial¬ 
ism  is  portrayed  as  a  power-demarcating  and 
-racializing  space,  the  complexities  of  the  colo¬ 
nial  situation  are  ignored  or  one  tends  to  neglect 
the  internal  divisions  within  European  and  indig¬ 
enous  communities  alike. 

The  analysis  of  the  relationship  between  colo¬ 
nialism  and  biomedicine  can  be  used  as  an  example 
to  explain  the  capacity  of  colonialism  to  produce 
both  a  contradictory  and  a  generic  discourse  on 
colonized  people.  According  to  Megan  Vaughan, 
who  traced  the  emergence  of  various  medical  dis¬ 
courses  in  Africa,  there  was,  on  the  one  hand,  a 
long  tradition,  elaborated  by  Christian  missionar¬ 
ies,  that  viewed  the  “primitiveness”  of  African 
societies  as  a  factor  predisposing  them  to  certain 
diseases.  On  the  other  hand,  research  on  the  rela¬ 
tionship  between  colonialism  and  biomedicine 
from  the  interwar  period  indicated  that  Africans 
got  sick  because  they  forgot  who  they  were: 
Urbanization,  industrialization,  and  deculturation 
were  seen  as  factors  leading  to  insanity,  sexually 
transmitted  diseases,  or  leprosy.  Taken  together, 
these  contradictory  viewpoints  show,  however,  a 
fundamental  difference  in  the  emergence  of  bio¬ 
medical  discourse  in  Europe:  Africans  were  always 
conceived  of  as  members  of  a  collectivity — as  colo¬ 
nial  people — and  beyond  that  as  members  of  col¬ 
lectivities — in  the  form  of  tribes  or  cultural  groups — 
and  this  led  to  different  views  in  tracing  the  rela¬ 
tionship  between  colonialism  and  biomedicine. 

This  reification  of  groups  by  colonialism  has 
been  contested  by  social  historians  and  anthro¬ 
pologists.  Groups  and  categories  under  colonial 
rule  (chiefs,  the  educated  elite,  traders,  peasants, 
workers,  and  also  women,  youth,  and  elders) 
pursued  their  own  agendas,  defended  their  own 
interests,  and  consequently  changed  the  limits  of 
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subordination  within  the  colonial  system.  In  this 
sense,  opposing  too  markedly  the  “bourgeoisie” 
and  the  “working  class”  or  the  “elite”  and  the 
“people”  includes  the  risk  of  reproducing  the  old 
colonial  dichotomy  as  well  as  creating  new  ones 
(opposing  the  modern  to  the  traditional).  A  large 
part  of  African  and  Indian  historiography  in  the 
past  3  decades  has  tried  to  overcome  this  vision 
present  in  colonialism  by  looking  at  the  agency  of 
individuals  and  groups,  the  contingent  nature  of 
their  connections  and  networks,  and  their  respective 
roles  in  shaping  power  relations.  Actually,  colonial¬ 
ism  did  not  produce  only  differences  between  colo¬ 
nizers  and  the  colonized.  It  also  produced  individu¬ 
als  with  varying  investments  in  a  range  of  identities, 
sexual  identities,  class  identities,  religious  identities, 
and  ethnic  identities.  Similarly,  colonialism  not  only 
led  to  conflicts  between  colonizers  and  the  colo¬ 
nized,  it  was  also  decisive  in  shaping  new  roles,  divi¬ 
sions,  and  conflicts  between  elders  and  youth,  men 
and  women,  and  migrants  and  urban  dwellers. 

The  understanding  of  conflicting  colonial  reper¬ 
toires  helps  explain  why  colonialism  was  not  per¬ 
vasively  efficacious  in  implementing  moderniza¬ 
tion.  According  to  Nicholas  Thomas  (1994), 
“colonialism  was  not  a  unitary  project  but  a  frac¬ 
tured  one,  riddled  with  contradictions  and 
exhausted  as  much  by  its  internal  debates  as  by  the 
resistance  of  the  colonized”  (p.  51).  Colonized 
people  were  able  to  turn  the  discourse  of  modern¬ 
ization  into  a  language  of  claims  concerning  the 
obligations  of  colonial  powers.  For  instance,  when 
development  emerged  as  a  colonial  project  with 
aid  from  the  former  colonial  power,  it  did  so  in  the 
face  of  serious  objective  challenges  from  the  West 
Indies  and  Africa  in  the  1930s  and  the  1940s. 

The  various  forms  of  present-day  oppressions  in 
some  former  colonies  have  sometimes  been  regarded 
as  a  legacy  of  colonialism.  For  instance,  Mahmood 
Mamdani  (1996)  argues  that  colonial  rule  created 
a  “bifurcated  state”  that  distinguished  between  a 
despotic  tribal  power  in  rural  areas  and  a  demo¬ 
cratic  civil  society  based  in  towns  and  cities. 
According  to  Mamdani,  this  duality  of  power  has 
been  a  major  obstacle  to  democratization  in  postin¬ 
dependence  Africa.  However,  historians  have 
shown  that  colonial  power  was  limited  by  tribal 
chiefs’  obligation  to  ensure  the  well-being  of  their 
community  to  maintain  the  legitimacy  on  which 
colonial  authorities  depended.  As  mentioned  by 


Thomas  Spear  (2003),  colonial  dependence  on 
chieftaincy  often  limited  colonial  power  as  much  as 
facilitating  it.  The  reification  by  colonialism  of  cat¬ 
egories  of  citizen/subject  and  urban/rural  overstates 
the  role  of  urban  and  elite  power  and  undervalues 
the  strategic  place  of  rural  constituencies  and  the 
importance  of  urban-rural  links  in  the  making  of 
African  politics.  Eventually,  drawing  a  direct  causal 
connection  between  the  indirect  rule  of  the  1920s 
and  1930s  and  the  politics  of  authoritarianism  and 
ethnicity  in  the  1980s  and  1990s  fails  to  see  what 
lies  in  between,  especially  the  explosion  of  citizen¬ 
ship  in  the  final  years  of  colonial  rule. 

Historians,  anthropologists,  and  political  scien¬ 
tists  interested  in  understanding  the  everyday  prac¬ 
tices  of  colonialism  have  changed  their  framework 
of  analysis  to  take  account  of  these  issues.  They 
today  insist  on  the  necessity  of  looking  simultane¬ 
ously  at  the  colonizing  countries  and  the  colonies 
and  the  process  by  which  each  entity  affected  the 
political  transformation  of  the  whole  empire.  This 
marks  an  important  break  with  former  imperial 
historiography,  which  had  long  treated  colonies  as 
marginal  to  a  history  that  remained  national  or  as 
a  projection  of  national  culture  and  power. 
Empirical  evidence  now  suggests  that  new  forms  of 
administration,  town  planning,  architecture,  polic¬ 
ing,  and  medical  practice,  to  mention  just  a  few 
examples,  were  not  only  a  one-way  imposition  of 
norms  coming  from  the  metropolis  but  were  also 
coproduced  in  the  colonies  and  contribute  to 
larger  debates  about  the  nature  of  colonialism  and 
its  impact.  Colonialism  profoundly  shaped  both 
colonized  and  colonizers’  society. 
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Social  scientists  call  goods  all  the  resources  that 
people  use  in  their  life.  A  well-established  taxon¬ 
omy  of  goods  distinguishes  them  on  the  basis  of 
two  attributes:  excludability  and  rivalry.  The  first 
attribute  refers  to  the  right  of  the  owner  to 
exclude  other  individuals  from  the  consumption 
of  the  goods.  The  second  means  that  the  con¬ 
sumption  of  a  unit  by  one  individual  prevents  the 
consumption  of  the  same  unit  by  another.  Private 
goods,  with  which  basic  microeconomics  is  con¬ 
cerned,  are  excludable  and  rivalrous.  Goods  that 
lack  one  or  both  of  these  attributes  are  called 
common  goods  by  political  analysts.  This  entry 
discusses  the  attributes  of  common  goods,  a  vari¬ 
ety  of  forms  of  these  goods,  and  ways  in  which 
they  can  be  managed. 


Three  different  types  of  common  goods  have 
been  envisaged  since  the  1950s.  Economists  have 
drawn  attention  to  public  goods,  which  are  nonex¬ 
cludable  and  nonrivalrous,  such  as  national 
defense,  traffic  lights,  air  to  breathe,  and  eradica¬ 
tion  of  contagious  diseases.  More  recently  toll 
goods  or  club  goods  have  been  considered,  that  is, 
nonrivalrous  and  excludable  goods,  such  as  high¬ 
ways,  cable  television,  shooting  or  fishing  permits, 
and  museums.  Finally,  environmental  scientists 
and  political  analysts  have  envisaged  common 
pool  resources  (CPRs)  or  commons,  which  are 
nonexcludable  but  subject  to  rivalry,  such  as  fish¬ 
eries,  free  meadows,  hunting  game,  and  ground- 
water  basins.  Table  1  illustrates  the  positions  of 
goods  in  the  taxonomy. 

Nonexcludability  and/or  nonrivalry  of  common 
goods  pose  a  series  of  problems  for  collective 
action  that  have  been  extensively  examined. 

Collective  Action  for  Public  Goods 

Political  scientists  began  to  study  the  role  of  groups 
in  politics  at  the  beginning  of  the  20th  century. 
With  time,  a  scholarly  tradition  became  estab¬ 
lished,  and  a  group  theory  was  proposed  that 
implicitly  assumes  that  whenever  all  the  individu¬ 
als  in  a  group  have  the  same  aim,  they  will  all  act 
to  achieve  that  aim.  Mancur  Olson,  in  his  book 
The  Logic  of  Collective  Action ,  was  the  first  to 
argue  that  this  apparently  obvious  assertion  is  in 
general  untrue  for  public  goods. 

Olson’s  argument  is  that  the  attribute  of  nonex¬ 
cludability  implies  that  no  individual  member  of  a 
group  has  an  incentive  to  contribute  voluntarily  to 
the  provision  of  a  public  good,  as  each  would  get 
his  or  her  part,  in  any  case,  once  it  came  into  exis¬ 
tence.  Irrespective  of  the  nature  of  the  group  (i.e.,  a 
rural  community,  a  town,  a  union,  or  a  nation),  this 
free  riding  behavior  may  spread  among  individuals, 
so  that  a  group  may  be  unable  to  give  rise  to  a 


Table  1 

Taxonomy  of  Goods 

Excludable 

Nonexcludable 

Rival 

Private  goods 

CPRs  (commons) 

Nonrival 

Toll  goods  (club  goods)  Public  goods 

Note:  CPRs  =  common  pool  resources. 
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single  unit  of  public  good,  no  matter  how  impor¬ 
tant  it  might  be  for  the  group.  Consequently,  the 
existence  of  a  public  good  depends  on  the  capacity 
of  the  group  to  prevent  free  riding,  which  means 
detecting  and  punishing  people  who  do  not  coop¬ 
erate  in  its  provision.  This  is  easiest  when  the  num¬ 
ber  of  group  affiliates  is  small,  because  they  know 
each  other  personally,  interact  frequently,  and  can 
regularly  monitor  their  respective  actions.  Because 
of  this  capacity  to  surmount  opportunistic  behav¬ 
ior,  small  groups  are  privileged  and  are  able  to 
activate  and  stabilize.  Large  groups,  on  the  con¬ 
trary,  are  anonymous,  so  that  it  may  be  difficult 
for  partners  to  detect  and  punish  free  riding.  This 
is  why  large  groups  are  difficult  to  mobilize  and 
may  remain  latent. 

The  Tragedy  of  the  Commons 

For  goods  with  the  rivalry  attribute,  a  unit  of  CPR 
taken  by  one  person  is  taken  away  from  another 
person.  For  goods  with  the  nonexcludability  attri¬ 
bute,  no  one  can  singularly  claim  property  rights. 
Because  CPRs  have  both  these  features,  they  not 
only  have  the  problem  of  free  riding,  shared  with 
nonexcludable  public  goods,  but  also  are  vulnera¬ 
ble  to  intense  exploitation,  which  may  completely 
erode  their  value.  Garrett  Hardin  drew  attention 
to  this  problem  in  a  1968  article  published  in 
Science ,  where  he  gave  the  example  of  an  open 
pasture.  Because  no  individual  property  right  can 
be  claimed  on  its  land,  each  herdsman  will  try  to 
keep  as  many  cattle  as  possible  on  the  pasture. 
This  spontaneous  arrangement  might  last  for  cen¬ 
turies,  as  long  as  wars  and  diseases  keep  the  popu¬ 
lation  of  both  men  and  cattle  below  the  highest 
level  that  the  meadow  can  sustain.  However,  were 
these  traditional  evils  to  be  defeated,  the  popula¬ 
tion  of  herdsmen  and  cattle  would  grow  beyond 
that  threshold,  and  a  new  evil,  called  by  Hardin 
the  tragedy  of  the  commons,  would  emerge. 

The  catastrophe  by  no  means  depends  on  the 
carelessness  of  the  herdsmen.  Deciding  whether  to 
introduce  a  new  unit  within  the  pasture,  a  perfectly 
rational  herdsman  will  weigh  up  expected  costs 
and  benefits:  Introducing  an  additional  beast  gives 
him  the  opportunity  to  add  nearly  the  value  of  this 
unit  to  his  income,  while  the  cost  of  overgrazing 
produced  by  that  new  unit  generates  only  a  little 
erosion  of  the  free  grass,  a  loss  that  furthermore  is 


distributed  among  all  herdsmen.  This  reasoning 
can  be  extended  to  any  one  additional  beast  and 
to  any  herdsman.  Therefore,  individual  rational¬ 
ity  pushes  each  herdsman  to  increase  his  herd 
without  limit,  progressively  reducing  the  likeli¬ 
hood  that  any  animal  will  find  enough  food  to 
survive,  and  finally,  leading  all  the  herdsmen  to 
complete  ruin. 

Managing  Common  Goods 

In  spite  of  the  tragedy  of  the  commons  and  the 
problems  with  collective  action,  groups  of  any  size 
have  always  existed  in  human  societies,  delivering 
common  goods  to  their  members  in  a  more  or  less 
efficient  manner.  Of  course,  this  is  well  known  to 
scholars,  who  have  pondered  the  various  ways  in 
which  practical  politics  has  adapted  to  cope  with 
the  problems  that  theoretical  analysis  has  envisaged. 

To  begin  with,  both  CPRs  and  public  goods  are 
nonexcludable,  which  implies  exposure  to  free  rid¬ 
ing.  Then,  the  theoretical  insights  considered  so  far 
would  be  corroborated  if  the  strategies  of  practical 
politics  were  to  be  aimed  at  increasing  the  cost  of 
free  riding  or  at  giving  to  common  goods  some 
attributes  of  excludability. 

The  first  strategy  amounts  to  making  use  of  the 
coercive  force  of  the  state — whether  or  not  legiti¬ 
mized  by  democratic  procedures  and  majority 
endorsement — in  order  to  deter  opportunistic 
behavior.  This  strategy  is  actually  largely  adopted 
by  all  public  authorities  under  all  regimes,  as  the 
reality  of  enforced  taxation  makes  clear  beyond 
any  doubt.  However,  the  alternative  strategy  of 
providing  common  goods  with  the  attribute  of 
excludability  is  no  less  important.  Convinced  that 
CPRs  generate  social  evils,  Hardin  maintained  that 
the  history  of  civilization  is  that  of  limiting  the 
“freedom  of  the  commons,”  introducing  property 
rights  by  law,  and  transforming  them  into  private 
goods,  as  happens  when  concessions  are  granted. 
Also,  public  goods  can  be  equipped  with  elements 
of  excludability.  This  is  accomplished  when  public 
authorities  introduce  fees  or  tickets  that  transform 
public  goods  into  toll  goods.  Appealing  to  this 
strategy,  Olson  is  able  to  explain  how  large  groups 
succeed  in  providing  their  members  with  public 
goods  when  no  mutual  control  is  available  to 
defeat  free  riding.  His  explanation  is  based  on  the 
role  of  selective  incentives.  These  are  benefits  that 
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the  group  offers  to  each  potential  member  as  an 
inducement  to  join  and  bear  a  share  of  the  collec¬ 
tive  enterprise.  Thus,  large  latent  groups  have  the 
opportunity  to  mobilize  and  gain  their  public 
goods  as  by-products  of  selective  incentives. 

An  Alternative  Way  to  Cope  With  CPRs 

The  strategies  so  far  considered  by  which  groups 
can  provide  their  affiliates  with  common  goods 
imply  an  institutional  change,  toward  either  cen¬ 
tralization  or  privatization.  However,  institutional 
changes  are  costly,  and  in  both  cases,  further 
analysis  of  additional  costs  is  needed.  This  has 
been  clearly  pointed  out  by  Elinor  Ostrom  in  her 
book  Governing  the  Commons ,  devoted  to  the 
analysis  of  CPRs.  Her  first  point  is  that  only  public 
authorities  are  entitled  to  introduce  property  rights 
on  CPRs.  That  means  that  creating  a  market  for 
commons  implies  the  costs  that  public  authorities 
must  bear  to  establish  an  agency  to  define  market 
rules  and  control  their  effectiveness  and  efficiency. 
On  the  other  hand,  if  property  rights  remain  in 
government  hands,  one  must  add  the  cost  of 
detecting  defectors  as  well  as  the  costs  of  errors, 
such  as  mistakenly  sanctioning  those  who  cooper¬ 
ate  or  failing  to  sanction  defectors. 

From  all  this,  the  conjecture  emerges  that  full 
privatization  or  full  centralization  imposed  by 
external  authorities  may  be  a  questionable  means 
of  coping  with  the  problems  with  CPRs  and  that 
institutional  arrangements  provided  by  the  com¬ 
moners  themselves  may  offer  better  solutions.  To 
examine  this  hypothesis,  Ostrom  makes  a  thought¬ 
ful  analysis  of  the  decision  processes  in  many 
empirical  cases  of  CPRs  ruled  by  self-governing 
communities,  such  as  forest  tenures,  irrigation  sys¬ 
tems,  or  inshore  fisheries  from  various  countries  in 
different  continents  with  different  levels  of  eco¬ 
nomic  development.  Her  scrutiny  reveals  that 
people  interacting  in  a  CPR  can  avoid  resource 
erosion  through  a  system  of  rules  contingent  on 
the  specific  situation,  monitored  by  accountable 
individuals,  and  regulated  by  graduated  sanctions. 
However,  to  realize  such  an  institutional  change, 
the  individuals  concerned  must  know  one  another 
through  other  repeated  interaction,  so  that  over 
time,  they  can  accumulate  the  social  capital  neces¬ 
sary  to  discover  shared  norms  promoting  mutual 
cooperation.  These  conditions  are  more  likely  to 


be  met  in  the  type  of  small-scale  CPRs  investigated 
in  Ostrom’s  book. 

It  is  important  to  underline  the  fact  that  Ostrom 
does  not  deny  the  relevance  of  the  problems  that 
Hardin  and  Olson  have  pointed  out  with  common 
goods.  Her  aim,  more  limited  and  yet  extremely 
important  for  political  analysis,  is  to  show  that  the 
assumption  that  problems  with  common  goods 
can  be  solved  only  by  external  authorities  impos¬ 
ing  privatization  or  centralization,  although  wide¬ 
spread  among  policy  analysts,  is  invalid. 

Social  analysts  have  scrutinized  common  goods 
using  contributions  of  increasing  complexity. 
Hardin’s  approach  exemplifies  the  environmental 
scientists’  concern  for  the  ultimate  tragic  conse¬ 
quences  of  the  commons.  Olson  enhances  Hardin’s 
analysis,  paying  due  attention  to  decision  pro¬ 
cesses  and  proposing  an  economic  model  where 
individual  incentives  interact  with  a  given  institu¬ 
tional  framework.  Ostrom  considers  many  aspects 
concerning  the  provision  and  maintenance  of  com¬ 
mon  goods.  Her  analysis  strongly  suggests  that 
designing  the  necessary  institutional  change  and 
considering  its  costs  should  not  be  omitted  when 
modeling  decisional  processes  concerning  human 
attempts  to  attain  collective  benefits.  Her  teach¬ 
ings  have  been  very  influential,  and  subsequent 
research  on  common  goods  has  followed  her  ana¬ 
lytic  framework,  focusing  on  institutional  design 
and  multilevel  governance.  Thus,  it  has  been  pos¬ 
sible  to  extend  her  method  far  beyond  the  small- 
scale  CPRs  to  research  into  the  international  gov¬ 
ernance  of  common  goods  such  as  the  Internet  and 
financial  markets  and  environmental  global  com¬ 
mons  such  as  Antarctica,  the  oceans,  and  the 
atmosphere. 
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Communism 


After  the  fall  of  the  Berlin  wall  in  1989  and  the 
collapse  of  the  Soviet  Union  in  1991,  it  seemed 
that  communism  belonged  only  to  the  past  and 
was  a  more  appropriate  subject  of  research  for 
historians  than  for  political  scientists.  This  impres¬ 
sion  is  wrong.  Communism  is  directly  relevant  to 
political  science,  not  only  because  communist 
regimes  still  exist  in  Cuba,  China,  and  elsewhere 
in  Asia,  but  also  because  an  adequate  account  of 
communism  must  include  an  explanation  of  why 
it  is  attractive  or,  on  the  contrary,  why  it  has 
failed.  An  understanding  of  communism  must  also 
include  an  examination  of  its  future  as  well  as  its 
legacy  in  former  communist  countries  that  are 
now  democracies.  This  entry  addresses  these  top¬ 
ics  by  analyzing  communism  as  a  theory,  a  type  of 
regime,  and  a  political  organization.  As  a  political 
theory,  communism  has  changed  over  the  years 
but  has  always  been  significant  to  some  philoso¬ 
phers  and  activists,  and  it  remains  an  important 
point  of  reference  in  critiques  of  capitalism  and 
current  political  debates.  In  this  respect,  it  is  nec¬ 
essary  to  distinguish  between  the  idea  and  its 
political  realizations. 

Evolution  and  Actualization 
of  the  Idea  of  Communism 

The  idea  of  communism  is  a  very  old  one  that  has 
its  roots  in  the  vision  of  a  society  based  on  the 
absolute  equality  of  human  conditions  and  the 
elimination  of  individual  enrichment.  It  is  present 
in  the  works  of  many  thinkers  from  Plato  through 
the  18th-century  Utopians,  the  Soviet  analysts,  and 
some  contemporary  theoreticians.  The  terms  com¬ 
munism  and  communist  were  first  used  in  France 


during  the  11th  century  to  designate  the  common 
practices,  interests,  and  rights  of  some  peasants. 
During  the  French  Revolution,  some  authors  used 
the  word  communism  in  its  more  modern  sense  of 
a  general  sharing  of  goods  in  a  regime  established 
by  a  revolutionary  process.  Others,  for  example, 
Robert  Owen,  Etienne  Cabet,  Wilhem  Weitling, 
and  Moses  Hess,  used  the  term  to  designate  uto¬ 
pian  projects  of  societies  based  on  a  new  system  of 
exchanges  and  distribution.  But  it  was  around  the 
mid- 19th  century  that  the  idea  of  communism 
became  more  widespread.  The  creation  of  the 
Communist  League  by  Karl  Marx  and  Friedrich 
Engels  in  1847  and  the  publication  of  their 
Communist  Manifesto  in  1848  marked  a  turning 
point  in  the  evolution  of  the  communist  idea  as 
embodied  in  a  political  structure  and  metamor¬ 
phosed  into  a  coherent  doctrine  that,  despite 
Marx’s  protests,  quickly  took  on  a  religious  and 
dogmatic  dimension.  The  word  communism  came 
to  refer  to  a  general  project,  a  political  action,  and 
a  political  ideology  presented  as  a  real  and  power¬ 
ful  science,  different  from  reformism.  For  Marx, 
communism  meant  not  only  social  equality  and  the 
end  of  private  property  but  also  the  necessity  of  a 
revolution  based  on  class  struggle  directed  by  an 
organized  proletariat  that  would  completely 
destroy  capitalism  and  the  bourgeoisie,  establish¬ 
ing  the  dictatorship  of  the  proletariat.  After  this 
first  step,  it  would  be  possible  to  establish  a  per¬ 
fected  new  order  with  collective  ownership,  a  pro¬ 
gressive  disappearance  of  the  state,  and  the  disso¬ 
lution  of  social  classes. 

Prior  to  1917,  there  was  confusion  within  the 
international  labor  movement  concerning  the  rela¬ 
tionship  between  socialism  and  communism.  After 
the  Russian  Revolution  of  1917,  Lenin  gave  the 
name  Communist  to  the  Bolshevik  party,  and  with 
the  creation  of  the  Third  International  in  1919,  the 
distinction  between  the  Communist  Party  and 
social-democratic  and  socialist  parties  was 
strengthened.  Lenin  and  his  comrades  set  them¬ 
selves  up  as  Marx’s  only  heirs,  delegitimizing  the 
numerous  other  interpretations  of  Marxism. 

Marxism-Leninism  became  the  main  and  stron¬ 
gest  codification  of  the  doctrine  of  communism 
during  the  20th  century,  with  the  Soviet  Union 
presented  as  a  model.  Thus,  the  Soviet  Union  can 
be  considered  the  first  incarnation  of  communism; 
however,  after  World  War  II,  other  nations,  such 
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as  Mao  Zedong’s  China,  Kim  Il-Sung’s  North 
Korea,  and  Fidel  Castro’s  Cuba,  created  their  own 
versions  of  communist  regimes. 

Communism  as  a  Type  of  Regime 

Because  of  the  existence  of  these  countries  who 
claimed  their  affiliation  to  communism  and 
Marxism,  communism  also  designates  a  type  of 
regime.  Usually,  they  show  important  specificities 
in  terms  of  how  they  implemented  the  communist 
program  and  in  their  respective  historical,  national, 
and  social  environments.  However,  there  were 
also  commonalities;  in  particular,  the  establish¬ 
ment  of  a  communist  regime  was  in  each  case  fol¬ 
lowed  by  an  intense  period  of  violence  and  terror 
marked  by  mass  crimes,  genocides,  trials,  deporta¬ 
tion,  and  the  creation  of  detention  and  concentra¬ 
tion  camps  for  all  political,  cultural,  and  religious 
“enemies”  as  well  as  bloody  purges  of  opponents 
inside  the  party.  The  prime  example  was  the 
Soviet  Union  under  Lenin  and  then  under  Stalin, 
until  the  latter’s  death  in  1953.  One  of  the  grim¬ 
mest  examples  of  communist  terror  was  in 
Kampuchea  (Cambodia),  where  from  1975  to 
1978,  the  communist  Khmer  Rouge  was  respon¬ 
sible  for  the  death  of  an  estimated  1  million  to  1.5 
million  people. 

Often,  a  period  of  acute  repression  in  a  com¬ 
munist  regime  was  followed  by  a  relaxation  of  the 
repression;  this  was  the  case  in  Russia  and  in  East 
and  Central  Europe  after  Stalin’s  death  in  1953 
and  Khrushchev’s  de-Stalinization  in  1956. 
However,  during  this  period,  there  remained  a 
restriction  of  liberties  and  a  general  fear  based  in 
part  on  memories  of  past  repression.  Other  com¬ 
mon  features  are  the  following: 

•  the  establishment  of  one-party  rule  or  a 
hegemonic  and  preeminent  communist  party, 
with  some  insignificant  allies  who  keep  alive  the 
fiction  of  plurality,  as  in  some  of  the  Eastern  and 
Central  European  countries  after  1945; 

•  the  monopoly  of  all  activities,  including  political, 
economic,  social,  intellectual,  and  cultural 
activities,  and  of  the  media,  who  were  used  for 
intensive  propaganda; 

•  an  emphasis  on  the  importance  of  ideology  for 
the  party  and  for  the  whole  population; 

•  the  rigorous  control  and  surveillance  of  society; 


•  the  endeavor  to  create  a  “new  man,”  who  was 
ideologically  convinced,  completely  devoted  to 
the  party,  and  willing  to  sacrifice  his  life; 

•  the  centralization  of  all  powers,  especially  in 
countries  with  great  cultural,  ethnic,  religious,  or 
geographical  diversity;  and 

•  the  preeminent  affirmation  of  the  leadership, 
which  may  lead  to  an  organized  cult  as,  for 
instance,  in  the  cases  of  Stalin,  Mao,  Kim  11 
Sung,  Ho  Chi  Minh,  or  Castro. 

During  the  years  1920  to  1930  and  after  World 
War  II,  these  features  sparked  polemics  about  the 
nature  of  communist  regimes  and  their  classifica¬ 
tion:  What  were  they  exactly?  Were  they  a  new 
version  of  classical  dictatorship  or  a  variety  of 
authoritarian  systems,  or  did  they  belong  to  totali¬ 
tarianism?  The  debate  about  this  last  notion  was 
always  fierce.  Supporters  of  the  totalitarian  con¬ 
cept  insist  that  it  is  as  valid  for  communism  as  for 
fascism  or  Nazism.  Some,  such  as  Hannah  Arendt 
and  Raymond  Aron,  thought  that,  after  1956,  the 
Soviet  Union,  for  instance,  became  a  posttotalitar- 
ian  regime  or  an  authoritarian  one  that  experi¬ 
enced  a  deep  political  and  economic  crisis,  result¬ 
ing  in  a  reduction  of  its  attraction.  Other  special¬ 
ists  emphasized  the  permanence  of  the  essential 
characteristics  of  totalitarianism  from  the  begin¬ 
ning  to  the  end  but  recognized  that  there  was  alter¬ 
nation  between  the  extreme  and  the  more  relaxed 
forms  of  totalitarianism.  In  contrast,  other  schol¬ 
ars  reject  the  pertinence  of  the  totalitarian  concept 
for  communism  and  criticize  the  validity  of  com¬ 
paring  it  with  fascism.  For  them,  totalitarianism  is 
too  wide  and  schematic  a  concept  for  comparison, 
because  it  does  not  take  into  account  the  differ¬ 
ences  between  fascism,  Nazism,  and  communism, 
as  well  as  those  between  communist  countries. 
These  critics  also  argue  that  the  concept  of  totali¬ 
tarianism  does  not  take  into  account  the  spaces  of 
autonomy  that  were  operating  despite  the  weight 
of  oppression.  In  any  case,  communist  regimes  also 
served  as  a  model  for  dictatorships  elsewhere. 

Communism  as  a  Political  Organization 

Communism  also  refers  to  an  important  mutation 
of  political  organization.  Marx  spoke  in  favor  of 
the  importance  and  the  necessity  of  a  communist 
party  but  without  giving  precise  definitions  or 
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recommendations  about  its  structure.  It  was  Lenin 
who  described  more  precisely  the  process  of  build¬ 
ing  such  a  party,  in  his  book  What  Is  to  Be  Done I 
published  in  1902.  Lenin  is  the  inventor  of  the  first 
prototype  of  an  ideological  party,  composed  of  a 
vanguard  of  professional  revolutionaries,  with  a 
military  and  secret  service  component,  publishing 
a  newspaper  conceived  as  a  key  element  of  the 
party’s  activity,  propaganda,  and  ideological  for¬ 
mation.  This  party  was  to  be  ruled  such  that  there 
was  complete  submission  of  the  inferior  echelons 
of  the  party  to  the  superior  ones,  a  strong  and 
centralized  authority  of  the  leadership,  absolute 
discipline  (whereby  the  minority  had  to  obey  the 
majority),  and,  in  democratic  countries,  total  con¬ 
trol  of  the  members  of  parliament  by  the  central 
leadership  of  the  party.  This  famous  invention, 
called  democratic  centralism  by  Lenin,  was  com¬ 
pletely  counter  to  the  classical  concept  of  a  mass 
social-democratic  party.  The  party  also  had  to 
form  specialized  mass  organizations  for  attracting 
a  variety  of  people.  Through  their  propaganda, 
the  ideas  of  communism  could  be  spread  more 
easily,  and  they  could  also  be  used  as  a  channel  of 
recruitment  for  the  party.  Trade  unions  were  con¬ 
sidered  a  decisive  influence  on  the  working  class, 
but  as  a  transmission  agent,  they  had  to  be  subject 
to  the  party’s  orders.  This  specific  communist 
party  was  supposed  to  be  adapted  to  the  condi¬ 
tions  of  political  struggle  under  tsarist  Russia.  But 
after  the  victorious  Soviet  revolution,  Lenin  and 
Trotsky  decided  to  generalize  the  model  and 
impose  this  organization  on  the  new  communist 
parties  all  over  the  world.  They  also  created 
another  important  rupture  with  the  international¬ 
ist  working  class  tradition  by  inventing  a  new 
Communist  International,  with  its  powerful  center 
in  Moscow.  The  Communist  International  was 
conceived  as  a  world  party  with  one  unique  goal: 
to  make  revolution  on  a  global  scale.  In  reality, 
with  Stalin’s  leadership  from  the  end  of  the  1920s, 
the  Communist  International  had  to  defend  the 
Soviet  Union  against  the  supposed  attacks  of 
“imperialism.” 

The  party  was  completely  Stalinized  and  became 
monolithic.  In  other  countries,  the  communist 
party  tried  to  transform  itself  into  a  mass  party 
open  to  different  social  categories,  for  example 
intellectuals,  but  with  a  prominence  of  working- 
class  members  at  all  levels.  In  some  countries,  and 


especially  in  China  under  Mao’s  authority,  priority 
was  given  to  the  peasants.  The  military  had  to 
consist  of  disciplined  members,  courageous  and 
always  ready  to  defend  the  party’s  positions  and,  if 
necessary,  to  sacrifice  their  lives,  as  was  the  case, 
for  instance,  during  World  War  II  in  the  Resistance, 
during  some  civil  wars  (in  Spain  1936-1939,  in 
Greece  1946-1949),  and  in  the  national  liberation 
movement  in  Vietnam.  The  party  was  attractive 
because  it  offered  the  only  possibility  for  political, 
social,  or  symbolic  promotion.  In  the  course  of 
time,  the  ideology  was  no  longer  an  element  of 
faith  but  just  empty  rhetoric,  with  its  sole  aim 
being  to  legitimize  those  who  were  in  power.  The 
party  was  a  party-state  where  the  state  was  only  an 
appearance;  the  reality  of  power,  leadership,  and 
decisions  was  under  its  control.  At  the  interna¬ 
tional  level,  the  communist  parties  had  been  orga¬ 
nized  from  1919  into  the  Communist  International, 
the  so-called  Komintern,  which  was  self-dissolved 
in  1943.  This  organization  was  also  Stalinized 
from  the  end  of  the  1920s  and  had  to  respect  the 
orders  from  Moscow.  The  leadership  of  the 
Communist  International,  directly  under  Stalin’s 
authority,  decided  the  strategy  of  each  party,  con¬ 
trolled  the  internal  life  of  the  party,  and  selected  its 
leaders.  After  World  War  II,  the  Kominform  was 
created  as  a  successor  organization.  The  Soviet 
party,  with  the  support  of  the  East  European  par¬ 
ties,  tried  to  maintain  the  unity  of  world  commu¬ 
nism  and  the  centrality  of  the  Soviet  Union  by 
organizing  international  communist  conferences, 
organizing  formal  meetings  among  parties,  and 
providing  material  (even  financial)  support.  The 
Soviet  party  tried  to  structure  this  world  commu¬ 
nist  system  to  achieve  its  expansion  over  all  conti¬ 
nents.  But  it  was  confronted  with  a  trend  toward 
autonomy  and,  sometimes,  a  defense  of  national 
interests  (e.g.,  with  the  Romanians  and  the  Polish). 
Some  parties  decided  to  oppose  Soviet  primacy 
and  to  split  communist  unity,  such  as  the  Yugoslavs 
in  1948  and  the  Chinese  and  Albanian  Communists 
in  the  1960s.  In  the  West,  communist  Italians  after 
1956,  the  French  in  the  1970s,  and  then  the 
Spanish  soon  after  tried  hard  to  invent  a  form  of 
Eurocommunism  that  emphasized  the  importance 
of  democracy  and  personal  freedoms  and  rejected 
dictatorship.  However,  they  always  defended 
Soviet  superiority  when  confronted  with  imperial¬ 
ism  and  capitalism. 
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Communism  as  an  Ideal 

The  theoretical,  political,  and  sociological  inter¬ 
pretation  of  communism  has  always  divided  the 
scientific  community.  Communism  has  been  inter¬ 
preted  as  a  contemporary  form  of  utopia  that 
would  fulfill  the  classical  quest  for  happiness  in  a 
perfect  and  pacified  world  and  give  expression  to 
the  “passion  of  equality”  that  Alexis  de  Tocqueville 
saw  as  inseparable  from  democratization.  Many 
thinkers  have  argued  that  although  communist 
regimes  vigorously  repressed  religion,  communism 
itself  took  the  form  of  a  secular  or  political  reli¬ 
gion.  In  Europe,  communism  had  more  success  in 
Catholic  and  Orthodox  countries  and  in  some 
Jewish  communities  than  among  Protestants. 
Where  Islam  was  the  dominant  religion,  commu¬ 
nism  generally  failed  to  take  hold;  however,  it 
prospered  in  Buddhist,  Taoist,  and  Confucian 
countries.  Thus,  communism  could  be  seen  as  a 
secular  substitute  for  traditional  religions  and  its 
establishment,  the  result  of  a  transfer  of  faith  from 
classical  religions  to  this  new  one. 

Communism  was  embraced  by  many  different 
sections  of  the  population,  from  the  uneducated  to 
intellectuals,  who  found  in  its  beliefs  a  basis  for 
political  involvement.  Even  though  it  was  closely 
associated  with  the  protection  of  Soviet  interests, 
communism  also  may  be  seen  to  involve  a  commit¬ 
ment  to  internationalism  as  the  foundation  of 
world  fraternity.  However,  communism  could  also 
strengthen  nationalistic  values,  as  it  did  in  some 
countries  that  were  fighting  for  their  indepen¬ 
dence,  such  as  Vietnam  or  Cambodia.  Such  ambiv¬ 
alence  existed  on  other  topics  as  well.  Communism 
succeeded  as  a  movement  determined  to  practice 
violence  against  dictatorships  but  also  within 
democracies  (as  the  communist  parties  did  from 
the  1920s  to  the  1940s);  yet,  at  the  same  time, 
especially  in  the  West,  communism  presented  itself 
as  a  peaceful  movement  allied  with  Catholic  and 
Protestant  associations  and  independent  personali¬ 
ties  who  were  not  properly  affiliated  to  a  party  but 
were  strongly  committed  to  leftist  values. 
Obviously,  in  theory,  communism  was  a  child  of 
the  Age  of  Enlightenment,  with  its  ideals  of  equal¬ 
ity  and  its  aspiration  to  emancipation  and  democ¬ 
racy.  But  in  practice,  although  there  were  differ¬ 
ences  between  communism  and  totalitarian  move¬ 
ments  such  as  fascism  and  Nazism,  communism 
translated  to  dictatorship.  In  Western  countries, 


communist  parties  criticized  the  “bourgeois”  and 
“formal”  democracies  in  the  name  of  a  future  and 
perfect  democracy.  After  World  War  II  and  their 
active  participation  in  antifascist  resistance  or  anti¬ 
colonialist  movements,  communist  parties — the 
most  powerful  of  which  were  in  Western  Europe — 
gradually  accepted  democratic  principles  and  made 
some  changes  in  their  own  organizations. 

But  the  main  question  is  whether  communism 
represented  an  archaic  political  phenomenon  or  a 
modern  one.  Communism,  as  an  affirmation  of  the 
values  of  a  classless  society,  can  be  seen  as  archaic 
because  of  its  rejection  of  capitalism,  the  market 
economy,  individualism,  and  liberal  and  represen¬ 
tative  democracy.  Communism  could  also  be  pre¬ 
sented  as  a  specific  authoritarian  form  of  economic 
and  social  modernization  developed  by  a  ruling 
communist  party — a  modernization  that  was  a 
complete  disaster  everywhere.  However,  in  demo¬ 
cratic  countries  where  strong  communist  parties 
existed,  as  in  France  or  Italy,  their  presence  and  the 
protection  they  gave  to  the  poorest  people  could  be 
seen  as  providing  a  foundation  for  the  modern 
welfare  state. 

Communism  and  Today’s  World 

Communist  regimes  no  longer  exist  in  Russia  and 
in  Eastern  Europe.  Efforts  to  reform  communism 
in  Hungary,  Poland,  Czechoslovakia,  and  the 
Soviet  Union  under  Gorbachev  failed.  Western 
European  communist  parties  are  in  decline  or 
have  changed  their  names  and  identities. 
Communism  as  a  whole  has  lost  its  momentum. 
But  it  is  still  alive  in  some  countries  as  a  totalitar¬ 
ian  regime,  as  in  North  Korea,  or  in  the  form  of 
coexistence  between  a  market  economy  and  the 
domination  of  a  single  party,  as  in  China. 
Communism  has  left  an  important  legacy. 
Communist  domination  did  leave  its  mark  on  wel¬ 
fare  in  countries  that  have  undergone  a  transition 
to  democracy,  which  explains  the  nostalgia  for 
equality  and  social  protection.  In  some  countries, 
Communists  have  maintained  a  good  reputation 
among  the  public  because  the  collective  memory 
associates  them  with  the  social  class  struggles  of 
the  past  and  with  antifascism.  Communist  doc¬ 
trine  itself  is  in  a  deep  crisis,  but  some  authors  find 
an  actuality  in  Marxism.  Some  elements  of  the 
communist  culture  are  still  alive  in  the  form  of 
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anticapitalism,  aspiration  to  utopian  ideals,  the 
search  for  a  radical  alternative,  contestation  of  the 
reality  of  democracy,  and  hostility  to  reformism. 
In  other  words,  communism  is  perhaps  finished  as 
a  political  centralized  and  authoritarian  organized 
movement  but  remains  alive  and  present  as  an 
aspiration. 

Marc  Lazar 
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Paris,  France 
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Communist  Parties 


Communist  parties  are  radical  Leftist  political 
organizations  that  aim  to  facilitate  either  a  coercive 


or  peaceful  transition  of  society  from  capitalism  to 
communism.  Such  parties  initially  emerged  as  fac¬ 
tions  within  many  European  socialist  and  social- 
democratic  parties  at  the  end  of  the  19th  century. 
Communist  parties  claim  to  represent  the  interests 
and  values  of  the  working  class.  The  seizure  of 
power  by  the  Russian  Social  Democratic  Labour 
Party  (Bolsheviks)  in  a  coup  d’etat  in  Tsarist 
Russia  in  November  1917  was  a  signal  for  the 
emergence  of  communist  parties  nearly  all  over 
the  globe.  After  World  War  II,  communist  parties 
seized  power  in  Eastern  Europe,  China,  and  some 
other  Asian  countries.  A  rapid  decline  of  commu¬ 
nist  parties  worldwide  began  after  Mikhail 
Gorbachev  as  General  Secretary  of  the  Communist 
Party  of  the  Soviet  Union  (CPSU)  initiated  reforms 
of  the  Soviet  regime  in  1985.  This  entry  discusses 
their  ideology,  origins,  organization,  and  world¬ 
wide  impact. 

Ideology 

The  term  communist  was  used  for  the  first  time  by 
the  utopian  socialist  Goodwyn  Barmby,  who 
introduced  the  word  into  the  English  language  as  a 
translation  of  the  French  word  commnniste  and 
founded  the  London  Communist  Propaganda 
Society  in  1841.  An  essential  distinctive  character¬ 
istic  of  communist  parties  was  the  importance 
accorded  to  ideology.  In  communist  parties,  ideol¬ 
ogy  occupies  a  much  more  fundamental  place  than 
in  other  parties  as  a  primary  concern  of  the  party 
is  to  indoctrinate  the  rank  and  file  with  commu¬ 
nism  or  Marxism. 

The  doctrine  of  communism  refers  to  the  theory 
of  society  developed  by  Karl  Marx  and  Friedrich 
Engels  in  the  1840s.  In  the  mid-1840s,  Marx  and 
Engels  became  engaged  in  politics  by  joining  the 
League  of  the  Just,  a  secret  society  consisting 
mainly  of  emigre  German  artisans.  In  February 

1846,  Marx  and  Engels  set  up  the  Communist 
Correspondence  Committee  with  the  aim  of 
reclaiming  control  of  the  League  of  the  Just.  In 
June  1847,  the  League  was  transformed  from  a 
secret  society  into  an  open  revolutionary  organiza¬ 
tion — the  Communist  League.  The  second  con¬ 
gress  of  the  Communist  League,  in  December 

1847,  asked  Marx  and  Engels  to  draw  up  a  mani¬ 
festo  stating  its  principles.  In  February  1848,  Marx 
and  Engels  published  the  Manifesto  of  the 
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Communist  Party ,  where  they  formulated  the 
basic  principles  of  communism.  The  major  author 
of  Manifest  der  Kommunistischen  Partei  was  Karl 
Marx;  it  was  originally  published  in  German  in 
London. 

A  cornerstone  of  Marxist  theory  was  the  eco¬ 
nomic  (often  called  the  materialistic)  interpretation 
of  history  and  theory  of  the  social  revolution. 
According  to  this  doctrine,  human  society  is  devel¬ 
oping  through  a  series  of  historical  stages  based  on 
changing  modes  of  production,  which  are  the  fun¬ 
damental  determinants  of  social  structures  and 
social  relations.  As  seen  by  Marx  and  Engels,  the 
history  of  human  society  is  the  history  of  class 
struggles.  According  to  Marxist  doctrine,  human 
society  has  passed  through  the  successive  phases  of 
a  primitive  society,  slavery,  feudalism,  and  capital¬ 
ism  and  must  advance  to  communism.  The  class 
struggle  under  capitalism  is  between  those  who 
own  the  means  of  production — the  bourgeoisie — 
and  those  who  labor  for  a  wage — the  working 
class.  Communism  means  an  abolition  of  private 
property  and  the  creation  of  a  classless  society  in 
which  productive  wealth  is  owned  collectively. 

Origins 

Several  predecessors  of  modern  communist  parties 
existed,  but  the  major  roots  of  revolutionary 
movements  with  a  clear  communist  leaning  and  a 
direct  involvement  of  the  father  of  the  communist 
doctrine,  that  is,  Marx,  go  back  to  the  International 
Working  Men’s  Association,  or  the  First  Inter¬ 
national.  The  First  International,  an  organization 
of  workers  that  represented  an  alliance  of  people 
from  diverse  groups,  such  as  the  French  Proud- 
honists,  Blanquists,  the  English  Owenites,  the 
Italian  Republicans,  proponents  of  individualistic 
anarchism,  and  other  socialists  of  various  persua¬ 
sions,  was  founded  in  1864.  Under  the  influence  of 
Marx,  the  Social  Democratic  Workers’  Party  of 
Germany,  the  strongest  member  of  the  First 
International,  was  founded  in  1869.  In  1872,  the 
First  International  split  between  Marxist  and  anar¬ 
chist  currents,  and  the  organization  was  disbanded 
4  years  later  at  its  1876  Philadelphia  conference. 

In  July  1889,  European  socialist  and  labor  par¬ 
ties  from  20  countries  established  the  Second 
International,  excluding  the  powerful  anarcho- 
syndicalist  movement.  One  of  the  most  prominent 


Russian  communist  leaders — Vladimir  Ilyich 
Lenin — was  a  member  of  the  Second  International 
from  1905.  The  Second  International  dissolved 
during  World  War  I,  in  1916,  as  the  national  par¬ 
ties  that  composed  it  did  not  agree  on  a  unified 
position  against  the  war. 

Marx  and  Engels  argued  in  the  Manifesto  of  the 
Communist  Party  that  among  the  immediate  goals 
of  Communists  were  the  “formation  of  the  prole¬ 
tariat  into  a  class,  the  overthrow  of  the  bourgeois 
supremacy,  and  the  conquest  of  political  power  by 
the  proletariat.”  The  ultimate  aim  of  communism 
was  the  replacement  of  the  old  bourgeoisie  by  a 
classless  communist  society. 

Until  the  collapse  of  the  Second  International, 
Marxists  worked  within  existing  social-democratic 
parties.  Marx  claimed  in  the  Manifesto  of  the 
Communist  Party  that  “the  Communists  do  not 
form  a  separate  party  opposed  to  the  other  work¬ 
ing-class  parties.”  However,  in  his  words, 
Communists  differ  from  other  working-class  par¬ 
ties  in  that  they  “bring  to  the  front  the  common 
interests  of  the  entire  proletariat,  independently  of 
all  nationality,”  and  “they  always  and  everywhere 
represent  the  interests  of  the  movement  as  a 
whole.” 

In  1902,  Lenin  wrote  a  political  pamphlet — 
What  Is  to  Be  Done f — where  he  developed  a  con¬ 
cept  of  a  “party  of  a  new  type.”  Here,  Lenin  for¬ 
mulated  the  basic  elements  of  the  concept  of  a 
revolutionary  party: 

a.  a  centralized  and  disciplined  (cadre)  party  of 
professional  revolutionaries, 

b.  a  party  organized  as  a  bureaucratic  or 
semimilitary  hierarchy,  and 

c.  a  party  able  to  communicate  socialist  ideology 
to  the  more  intellectually  developed  workers, 
who  could  act  as  the  vanguard  of  the  working 
class  and  be  capable  of  overthrowing  the  regime 
of  the  bourgeoisie  and  establishing  the 
dictatorship  of  the  proletariat. 

This  “Leninist”  type  of  party,  as  an  authoritarian 
structure,  was  to  become  the  highest  authority  of 
the  class-based  organization,  which  would  lead  all 
other  organizations  of  the  working  class  and  soci¬ 
ety  as  a  whole.  For  many  decades,  the  organiza¬ 
tion,  strategy,  and  tactics  of  the  Leninist  party 
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were  interpreted  as  providing  the  impetus  for  the 
establishment  of  the  totalitarian  regime  in  the 
Soviet  Union. 

In  1990,  however,  the  American  socialist  Hal 
Draper  published  an  article  “The  Myth  of  Lenin’s 
‘Concept  of  The  Party’  or  What  They  Did  to  What 
Is  to  Be  Done?”  where  he  argued  that  Lenin’s  con¬ 
cept  of  the  working-class  party  was  highly  misin¬ 
terpreted.  First  of  all,  Lenin  did  not  speak  about 
intellectuals  as  the  carriers  of  socialist  ideas  in  the 
political  organizations  of  the  working  class. 
Second,  Lenin’s  demand  of  centralism  did  not 
mean  a  “democratic  centralism” — that  is,  a  semi¬ 
military  hierarchy  within  the  party — but  rather  the 
aim  to  establish  an  organized  all-Russian  party  for 
the  first  time.  The  aim  of  having  a  class-based 
party  was  just  a  reaction  to  the  fact  that  through¬ 
out  history,  the  socialist  movement  was  mostly 
organized  in  sects.  Third,  Lenin  did  not  propose  a 
revolutionary  sect,  but  instead,  he  aimed  “to  orga¬ 
nize  the  revolutionary  current  as  a  political  centre 
of  some  sort  inside  the  mass  party.”  John  P. 
Plamenatz  (1954),  a  political  philosopher,  argued 
long  before  Draper  that  in  1902,  Lenin’s  ideas  of  a 
revolutionary  party  were  not  “undemocratic,  but 
merely  advice  adapted  to  the  needs  of  a  revolution¬ 
ary  party  active  in  Russia  in  the  first  decade  of  the 
twentieth  century”  (p.  225),  which,  however,  took 
an  antidemocratic  turn  in  “what  happened  after 
the  Bolshevik  Revolution.” 

In  1918,  the  Russian  Social  Democratic  Labour 
Party  (Bolsheviks)  was  renamed  the  Russian 
Communist  Party  (Bolsheviks),  and  from  1920 
until  1991,  it  was  the  only  legal  political  party  in 
the  Soviet  Union.  In  1925,  the  name  of  the  party 
was  changed  once  again,  and  it  became  the  All- 
Union  Communist  Party  (Bolsheviks)  and  eventu¬ 
ally  was  transformed  into  the  CPSU  in  1952.  After 
the  attempted  coup  d’etat  in  August  1991,  the 
CPSU  was  dissolved  by  Russia’s  President  Boris 
Yeltsin. 

Organization 

As  the  first  communist  parties  were  splinter  groups 
of  existing  social-democratic  and/or  socialist  par¬ 
ties,  initially,  they  inherited  the  traditional  organi¬ 
zation  of  mass  parties.  After  the  October  Revolution 
of  1917,  the  Russian  Communist  Party  gradually 
centralized  political  power  and  developed  a  strict 


cadre  policy.  A  high  degree  of  centralization  is  a 
distinctive  characteristic  of  communist  parties.  All 
mass-based  parties  tend  to  be  centralized,  but 
within  communist  parties,  it  was  significantly 
more  important  than  in  other  mass-based  parties. 

All  political  factions  and  dissent  were  forbidden 
within  the  Russian  Communist  Party  in  1921,  and 
afterward,  all  party  members  had  to  adhere  to  the 
decisions  that  had  been  made  by  the  central  bod¬ 
ies.  The  internal  party  organization  was  based  on 
the  principle  of  democratic  centralism,  where 
“democracy”  meant  the  freedom  of  party  mem¬ 
bers  to  discuss  matters  of  policy  and  to  pass  deci¬ 
sions  by  majority  vote,  and  “centralism”  was 
understood  as  a  duty  of  party  members  to  uphold 
decisions  taken  by  the  majority.  In  theory,  demo¬ 
cratic  centralism  also  included  elections  to  all 
party  institutions  from  the  bottom  to  the  top, 
accountability  of  party  structures  and  leaders  to 
the  rank  and  file,  strict  party  discipline  and  man¬ 
datory  decisions  of  the  upper  structures  for  the 
lower  structures,  and  personal  responsibility  of 
party  members  for  the  assignments  given  to  them 
by  the  party.  In  reality,  democratic  procedures 
were  an  empty  formality  as  leadership  at  all  levels 
of  the  party  organization  prohibited  criticism  and 
dissent  in  decisions  and  actions. 

The  Russian  Communist  Party  also  introduced 
a  nomenklatura  system  through  which  party  elites 
were  appointed  to  key  positions  in  the  administra¬ 
tive,  economic,  military,  cultural,  and  educational 
bureaucracies  as  well  as  throughout  the  party’s 
own  hierarchy.  Lenin  himself  formulated  the  first 
criteria  for  nomenklatura  appointments  as  fol¬ 
lows:  reliability,  political  loyalty,  and  administra¬ 
tive  ability.  Joseph  Stalin  finished  the  building  of 
the  party’s  patronage  system  and  used  it  to  distrib¬ 
ute  his  confidants  throughout  the  party  bureau¬ 
cracy.  Under  Stalin’s  direction  in  1922,  the  party 
created  departments  of  the  Central  Committee  and 
other  organs  at  lower  levels  that  were  responsible 
for  the  appointment  of  party  officials.  After  Leonid 
Brezhnev’s  accession  to  power  in  October  1964, 
the  party  considerably  expanded  its  appointment 
authority.  At  the  all-union  level,  the  Party  Building 
and  Cadre  Work  Department  supervised  nomen¬ 
klatura  appointments.  This  department  main¬ 
tained  records  on  party  members  throughout  the 
former  USSR,  made  appointments  to  positions  on 
the  all-union  level,  and  approved  nomenklatura 
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appointments  to  the  lower  levels  of  the  hierarchy. 
The  head  of  this  department  was  sometimes  a 
member  of  the  party  Secretariat  and  was  often  a 
protege  of  the  General  Secretary  of  the  CPSU. 

The  communist  parties  developed  a  new  struc¬ 
tural  organization.  While  other  mass-based  parties 
focused  their  organization  and  drew  their  support 
from  a  particular  geographical  or  territorial  area, 
the  communist  groups  formed  their  cells  in  the 
workplace.  Party  members  were  employed  in  the 
same  enterprise  or  professional  institution,  above 
which  were  territorial  district,  city,  regional,  and 
national  party  committees.  The  workplace  cell 
organization  proved  to  be  effective.  These  work¬ 
place  cells  were  more  numerous  than  territorial 
local  branches  in  other  mass-based  parties’  sec¬ 
tions.  Also,  this  system  led  each  cell  to  concern 
itself  with  problems  of  a  corporate  and  profes¬ 
sional  nature  rather  than  with  more  political  issues. 
Last  but  not  least,  this  organizational  system 
allowed  party  leaders  to  have  extensive  authority 
over  primary  party  cells.  At  the  end  of  the  1980s, 
the  CPSU  had  around  20  million  members  orga¬ 
nized  into  about  400,000  primary  party  cells. 

At  the  top  was  the  party  congress,  which  met 
only  every  few  years,  where  strictly  selected  dele¬ 
gates  elected  the  members  of  the  republican  and 
the  all-union  party  central  committees.  The 
Politburo  was  the  highest  political  and  executive 
institution  of  the  CPSU.  It  was  created  on  the  eve 
of  the  Bolshevik  Revolution  in  1917,  but  it 
became  fully  functional  only  in  1919  and  soon 
overshadowed  the  Central  Committee  in  power. 
Nominally,  the  Politburo  was  elected  by  the 
Central  Committee  to  direct  the  party’s  policy 
between  the  Central  Committee’s  plenary  ses¬ 
sions.  In  fact,  the  Politburo  was  the  most  impor¬ 
tant  decision-making  institution  in  the  Communist 
Party  and  has  commonly  been  seen  as  a  rough 
equivalent  to  the  cabinet  in  Western  political  sys¬ 
tems.  The  size  of  the  Politburo  of  the  CPSU  var¬ 
ied,  and  from  1952  until  1966,  it  was  called  the 
Presidium  of  the  Central  Committee. 

In  addition,  the  Communist  Party  was  sur¬ 
rounded  by  various  satellite  mass  organizations.  In 
the  Soviet  Union,  among  the  most  important  were 
the  Communist  Youth  Union  (Komsomol),  from 
which  nearly  75  %  of  party  members  were  recruited, 
and  the  All-Union  Central  Council  of  Trade 
Unions,  with  more  than  100  million  members.  The 


Communist  Party  controlled  all  facets  of  eco¬ 
nomic,  political,  military,  and  cultural  life,  and 
Article  6  of  the  1977  Soviet  Constitution  stated 
that  “the  leading  and  guiding  force  of  the  Soviet 
society  and  the  nucleus  of  its  political  system,  of  all 
state  organizations  and  public  organizations,  is  the 
Communist  Party  of  the  Soviet  Union.  ” 

Communist  Parties  Worldwide 

After  the  establishment  of  the  Communist 
International  (Comintern)  in  1919  by  representa¬ 
tives  of  34  parties,  most  emerging  communist  par¬ 
ties  worldwide  were  patterned  along  the  model  of 
the  organization  of  the  Russian  Communists.  In 
1920,  the  Second  Congress  of  the  Communist 
International  formally  adopted  the  Twenty-One 
Conditions — officially  the  Conditions  of  Admission 
to  the  Communist  International — as  prerequisites 
for  any  political  group  wanting  to  become  affili¬ 
ated  to  the  Communist  International.  These  condi¬ 
tions  encouraged  the  buildup  of  communist  orga¬ 
nizations  along  democratic  centralist  lines. 

Many  European  Socialist  parties  went  through 
splits  based  on  whether  they  adhered  to  the 
Communist  International.  The  French  Section  of 
the  Workers  International  (SFIO)  broke  away  with 
the  1920  Tours  Congress,  leading  to  the  creation 
of  the  new  French  Communist  Party.  The 
Communist  Party  of  Spain  was  created  in  1920  on 
the  basis  of  the  Federation  of  Socialist  Youth;  the 
Communist  Party  of  Italy  was  created  in  1921  fol¬ 
lowing  a  split  in  the  Italian  Socialist  Party;  the 
Belgian  Communist  Party  emerged  in  September 
1921  following  a  split  from  the  Belgian  Workers 
Party,  and  so  on.  Membership  in  the  Communist 
International  in  itself  formalized  the  split  between 
the  Communists  and  moderate  social  democracy. 

In  the  interwar  period,  the  Communist  Party  of 
Germany  was  not  only  a  major  party  in  Germany 
but  was  also  the  strongest  communist  party  in 
Europe  outside  of  the  Soviet  Union.  In  the  four 
Reichstag  elections  of  1930-1932,  the  Communist 
Party  polled  between  5  and  6  million  votes.  The 
party  was  then  suppressed  by  the  Nazi  regime.  A 
considerable  number  of  senior  German  communist 
leaders  in  exile  were  also  executed  during  Stalin’s 
Great  Purge  of  1937-1938,  and  more  than  a  thou¬ 
sand  were  handed  over  to  Nazi  Germany  by 
Stalin’s  regime.  The  Soviet-dominated  communist 
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parties  and  the  Comintern  in  the  interwar  period 
were  the  main  instruments  to  export  the  Bolsheviks 
Revolution.  By  the  end  of  the  1920s,  the  revolu¬ 
tionary  upsurge  in  Europe  was  over,  and  commu¬ 
nist  parties  were  established  in  most  countries,  but 
in  most  cases,  the  Communists  did  not  play  a  lead¬ 
ing  role  in  the  labor  movement. 

After  World  War  II,  the  Soviet  victory  over  fas¬ 
cism  contributed  to  a  surge  of  popularity  of  com¬ 
munist  parties,  even  in  Western  Europe,  which 
were  originally  established  in  1918  to  1923. 
Communist  parties  were  stronger  in  the  less  devel¬ 
oped  countries  of  Southern  Europe,  particularly  in 
Greece  and  Italy  but  also  in  France.  However,  their 
influence  significantly  decreased  from  the  1980s 
onward. 

The  Italian  Communist  Party  was  established  in 
1921  through  a  split  within  the  Italian  Socialist 
Party.  It  was  outlawed  by  the  Mussolini  regime  but 
eventually  reemerged  as  the  strongest  party  of  the 
Left  in  Italy,  and  at  its  peak,  it  was  the  largest  com¬ 
munist  party  in  Western  Europe,  with  a  member¬ 
ship  averaging  about  1.5  million  from  the  1940s 
until  the  early  1980s.  The  party  was  organized  like 
the  CPSU,  with  local  sections  forming  federations, 
which  in  turn  were  grouped  into  regional  commit¬ 
tees.  Since  the  late  1940s,  the  Italian  Communist 
Party  has  held  power  in  many  municipalities,  and 
since  the  1970s,  it  has  shared  control  of  major 
urban  centers  in  the  country  with  the  Socialist 
party.  However,  much  of  postfascist  politics  in 
Italy  was  organized  around  the  perceived  need  to 
keep  the  Italian  Communist  Party  out  of  central 
government. 

The  French  Communist  Party  was  founded  in 
1920  as  an  antisystem,  leftist,  and  secular  political 
organization  representing  French  workers  and 
became  especially  strong  in  the  industrial  areas  of 
northern  and  eastern  France  and  the  suburbs  of 
Paris,  obtaining  a  high  of  almost  29%  of  the  votes 
in  the  1946  elections.  Although  the  French 
Communist  Party  has  never  won  a  majority  of  vot¬ 
ers,  by  the  late  1970s,  it  was  the  largest  of  all 
French  parties,  with  a  membership  of  some 
700,000,  and  a  few  Communists  served  in  the  left¬ 
ist  cabinets  in  1981  to  1984  and  in  1997.  Beginning 
in  the  1970s,  some  currents  in  the  French 
Communist  Party  joined  with  those  of  Italy  and 
Spain  to  advocate  a  more  liberal  and  pluralistic 
form  of  communism,  that  is,  Eurocommunism. 


Greek  Communists,  similar  to  those  in  France 
and  Italy,  built  their  leading  role  on  the  claim  of 
anti-Nazi  resistance  during  World  War  II.  In  1944, 
Greek  Communists  participated  in  the  national 
unity  government,  holding  a  number  of  important 
portfolios,  including  the  position  of  Minister  of 
Finance.  After  the  Greek  Civil  War  in  1946  to 
1949,  the  Communist  Party  was  banned  until  the 
restoration  of  parliamentary  democracy  in  1974. 
Two  other  Southern  European  communist  parties — 
the  Spanish  and  the  Portuguese — were  politically 
strong  and  at  their  peak  after  the  transition  to 
democracy  in  the  1970s.  They  captured  about  15% 
and  20%,  respectively,  of  the  total  vote  in  national 
elections. 

In  Western  and  Northern  Europe,  with  the 
exception  of  Finland,  communist  parties  had  only 
a  marginal  political  influence.  In  Finland, 
Communists  created  the  Finnish  People’s 
Democratic  League,  which  received  around  20% 
of  the  vote  on  a  regular  basis  till  the  end  of  the 
1970s. 

Eastern  Europe,  with  the  exception  of 
Yugoslavia,  became  a  bastion  of  Soviet-style  com¬ 
munism,  as  communist  parties,  relying  in  most 
cases  on  Soviet  military  presence,  were  able  to  win 
and  consolidate  power  in  1945  to  1948. 
Nonetheless,  all  the  communist  parties  of  Eastern 
Europe,  with  the  exception  of  Albania  (the  party 
was  founded  in  1941),  had  their  origins  in  the 
period  1891  to  1921.  Communist-led  governments 
after  1945  were  formed  in  East  Germany,  Hungary, 
Czechoslovakia,  Bulgaria,  Romania,  and  Poland. 
In  some  cases,  communist  parties  cemented  their 
power  through  a  merger  with  other  socialist  or 
social-democratic  parties,  as  in  the  case  of  the 
Socialist  Unity  Party  of  Germany,  the  Hungarian 
Working  People’s  Party,  the  Romanian  Workers’ 
Party,  and  the  Polish  United  Workers’  Party.  Soon 
after  coming  to  power,  ruling  communist  parties 
abolished  free  elections  and  other  political  free¬ 
doms.  Attempts  to  reform  the  existing  regime  in 
the  Communist-ruled  countries  were  suppressed 
directly  by  the  intervention  of  Soviet-led  military 
forces — as  in  the  German  Democratic  Republic  in 
1953,  Hungary  in  1956,  and  Czechoslovakia  in 
1968 — justified  by  the  Brezhnev  Doctrine  of  the 
limited  sovereignty  of  socialist  countries  or  by  the 
repression  of  the  national  communist  government, 
as  in  Poland  in  1981  to  1983. 
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However,  the  CPSU  was  not  able  to  maintain 
unity  and  control  of  the  international  communist 
movement  and  communist  regimes.  As  early  as 
1948,  the  Yugoslav  Communists  removed  them¬ 
selves  from  Soviet  subordination.  Yugoslavia  and 
Albania  were  liberated  by  communist  partisans 
(led  by  Josip  Broz  Tito)  and  without  Soviet  mili¬ 
tary  aid.  This  was  the  main  reason  to  stay  largely 
independent  of  the  CPSU  hegemony  and  to  advo¬ 
cate  national  communism  as  an  independent  road 
to  socialism.  Albania  remained  allied  with  the 
Soviet  Union  from  1945  to  1960,  but  the  Party  of 
Labour  of  Albania  was  Stalinist  even  after  the 
CPSU  had  turned  away  from  Stalinism  in  the  late 
1950s.  After  1960,  it  broke  with  the  former  USSR 
and  joined  ranks  with  Maoist  China. 

The  supremacy  of  the  CPSU  was  most  chal¬ 
lenged  by  the  Chinese  Communist  Party  (CCP). 
The  party  was  founded  in  1921.  In  1949,  the 
Chinese  Communists  were  able  to  win  power  on 
their  own  and  established  the  People’s  Republic. 
Since  1949,  the  CCP,  initially  led  by  Mao  Zedong, 
has  been  the  country’s  only  legal  party. 
Organizationally,  the  CCP  followed  the  model  of 
the  CPSU,  having  at  the  top  a  Politburo  and 
Central  Committee,  together  with  party  commit¬ 
tees  at  the  provincial,  district,  county,  and  munici¬ 
pal  levels.  Divergence  between  the  Chinese  and 
Soviet  parties  gradually  emerged  from  the  mid- 
1950s  through  ideological  disputes  and 
Khrushchev’s  policy  of  de-Stalinization;  later,  it 
led  to  a  Cold  War-like  conflict  between  the  two 
communist  nations.  At  the  end,  the  CCP  tried  to 
win  over  to  its  side  communist  parties  throughout 
the  world.  This  was  most  successful  in  Southeast 
Asia  and  in  Albania. 

In  some  Latin  American  countries,  notably 
Chile,  El  Salvador,  Brazil,  Mexico,  and  Venezuela, 
mass  communist  parties  exercised  considerable 
political  influence  but  never  held  power  as  a  single 
party.  They  supported,  however,  various  leftist 
coalitions  and  sometimes  guerrilla  activities.  The 
only  ruling  communist  party  in  Central  America  is 
the  Cuban  Communist  Party,  which  came  to 
power  in  1959  through  a  revolutionary  movement 
led  by  Fidel  Castro.  The  Cuban  Communists  were 
heavily  dependent  on  the  former  USSR  for  finan¬ 
cial  support  during  the  1970s  and  1980s.  The 
Cuban  communist  government  was  able  to  survive 
the  collapse  of  communism  in  Eastern  Europe  after 


1989,  and  Fidel  Castro  ranked  among  the  world’s 
longest-serving  heads  of  state. 

In  the  1980s,  more  than  one  fourth  of  the 
world’s  population  lived  under  communist  rule. 
Two  of  the  world’s  most  populous  nations,  China 
and  the  former  USSR,  had  communist  govern¬ 
ments.  Communist  parties  held  power  in  Asia 
(Afghanistan,  Cambodia,  Laos,  Mongolia,  North 
Korea,  and  Vietnam),  many  countries  of  Eastern 
Europe  (Albania,  Bulgaria,  Czechoslovakia,  East 
Germany,  Hungary,  Poland,  Romania,  and  former 
Yugoslavia),  Africa  (Ethiopia),  and  Central 
America  (Cuba). 

The  communist  parties,  despite  the  same  or 
similar  principles  of  organization  and  similar  ideo¬ 
logical  beliefs,  have  never  been  a  fully  uniform  or 
monolithic  phenomenon.  It  is  possible  to  point  out 
the  following  distinct  phases  of  development  of 
communist  parties  worldwide: 

a.  a  period  of  establishment  of  these  parties  as 
autonomous  political  organizations  in  the 
second  and  third  decades  of  the  20th  century; 

b.  Bolshevikization  and  Stalinization  of  the 
communist  parties  in  the  1920s  and  1930s; 

c.  a  challenge  to  Soviet  hegemony  in  the 
communist  movement  of  the  late  1940s  and 
1950s  through  the  ideological  split  between  Tito 
and  Stalin  and  the  rise  of  Maoism; 

d.  the  de-Stalinization  of  the  CPSU  and  most 
Eastern  European  communist  parties  in  the  late 
1950s  and  early  1960s; 

e.  the  rise  of  Eurocommunism,  or  “socialism  with 
a  human  face,”  primarily  in  Southern  Europe 
since  the  1970s;  and 

f.  perestroika  in  the  Soviet  Union  and  the  end  of 
orthodox  communism  in  Eastern  Europe  in  the 
late  1980s  and  early  1990s. 

Altogether,  the  20th  century  could  be  called  the 
“age  of  shine  and  decline”  of  communist  parties 
worldwide. 

Communist  Parties  After  1989 

With  the  collapse  of  communism  in  the  Soviet 
Union  and  Eastern  Europe,  communist  parties 
were  severely  weakened  throughout  the  world. 


320  Communist  Systems 


CPSU  membership  declined  from  about  19.5  mil¬ 
lion  in  1988  to  15  million  in  1991,  and  after  the 
attempted  coup  d’etat  in  August  1991,  it  was 
finally  dissolved.  Responding  to  democratic 
upheavals  in  the  former  USSR,  European  and 
other  communist  parties  redefined  their  ideological 
positions  moving  in  a  direction  of  social  democ¬ 
racy.  Some  of  them  changed  their  names,  and  some 
disappeared  from  the  political  scene  altogether. 

However,  at  the  end  of  the  20th  century,  the 
transformation  of  the  communist  parties  was  not  a 
linear  process.  Unlike  the  communist  organiza¬ 
tions  of  Eastern  Europe,  the  CCP  was  able  to  stem 
the  democratic  protest  in  the  late  1980s  and  intro¬ 
duced  economic  reforms  from  above,  which  led  to 
an  impressive  modernization  of  the  country.  The 
CCP  is  the  largest  communist  party  in  the  world, 
with  an  estimated  membership  of  more  than  66 
million  at  the  beginning  of  the  21st  century. 

In  some  Eastern  European  countries  such  as 
Bulgaria,  Lithuania,  Poland,  and  Romania,  com¬ 
munist  organizations  were  transformed  into  social- 
democratic  parties,  and  these  successor  parties 
have  achieved  significant  electoral  success.  In 
many  post-Soviet  countries,  except  the  Baltic 
States,  communist  parties  survived  the  demise  of 
the  Soviet  Union  as  they  were  reorganized  along 
national  lines.  The  Communist  Party  of  the  Russian 
Federation  was  (re)founded  in  1993.  In  general, 
these  parties  preserved  certain  leftist  ideological 
beliefs  and  regained  reduced  political  influence 
emphasizing  nationalism  instead  of  communist 
ideology  (in  Moldova,  Ukraine,  and  Russia). 

In  recent  years,  in  a  number  of  countries  such  as 
Nepal,  Cyprus,  Brazil,  India,  Venezuela,  South 
Africa,  and  Sri  Lanka,  Communists  have  enjoyed  a 
certain  level  of  electoral  success  and  shared  power 
in  multiparty  national  and  local  governments.  In 
five  countries — China,  Laos,  Vietnam,  Cuba,  and 
North  Korea — communist  parties  remain  in  power. 
However,  three  countries — that  is,  China,  Vietnam, 
and  Laos — have  moved  toward  market  economies 
but  without  major  privatization  of  the  state  sector. 
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Communist  Systems 


Communist  systems  were  historically  shaped  in 
the  20th  century  and  based  on  oppressive  central¬ 
ized  political  power  aiming  at  control  over  a  usu¬ 
ally  nationalized  economy,  culture,  and  society. 
Such  systems  appeal  to  a  communist  ideology 
according  to  which  a  fully  communist  system 
ensures  the  liquidation  of  exploitation  and  equal¬ 
ity  in  a  classless  society.  Because  this  ideology 
remained  an  unfulfilled  utopia,  the  term  commu¬ 
nist  systems  refers  to  existing  systems  based  on  the 
characteristics  mentioned  above:  oppressiveness 
and  striving  for  control  over  a  nationalized  real¬ 
ity.  According  to  communist  ideology,  such  sys¬ 
tems  are  supposed  to  be  only  transitional  forms 
on  the  path  to  full  communism,  while  in  reality 
they  are  the  only  form  of  communism  in  exis¬ 
tence.  Although  this  entry  discusses  communism 
considered  as  a  system  rather  than  an  ideology,  it 
should  be  remembered  that  the  communist  ideol¬ 
ogy  caused  the  deaths  of  tens  of  millions  of  vic¬ 
tims  of  communist  repression.  This  ideology  was 
responsible  for  economic  waste  and  the  stifling  of 
human  energy  that  led  to  the  collapse  of  the  sys¬ 
tem  in  Europe  in  1989  and  the  fall  of  the  Soviet 
Union,  leaving  only  a  few  communist  countries  on 
the  map  of  the  world. 
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Communism  was  a  peculiar  system  in  which 
formal  solutions  generated  their  opposites,  as 
many  researchers  noted  (e.g.,  Alain  Besanqon, 
Elemer  Hankiss,  Robert  Sharlet,  Jadwiga  Stan- 
iszkis).  Centralization  gave  rise  to  spontaneous 
decentralization,  formal  regulation  resulted  in 
informality,  and  ideologically  based  unity  brought 
a  diversity  that  could  not  be  coped  with.  This  is  an 
example  of  the  dynamic  of  this  system.  What  cre¬ 
ated  communism  as  a  system  was  an  ability  to 
generate  and  absorb  these  contradictions  in  a  sin¬ 
gle  whole.  A  system  driven  by  contradictions  was 
integrated  by  the  relations  between  particular 
poles  of  opposition,  with  the  result  that  both 
opposing  poles  were  in  need  of  each  other  and 
were  strengthened.  This  created  communism  as  a 
system  and  allowed  it  to  reproduce  itself.  At  the 
same  time,  however,  these  contradictions  were 
also  a  factor  for  change  and  adaptation  of  the  sys¬ 
tem.  The  informal  economy  and  its  relation  to  the 
official  economy  can  serve  as  an  example  here. 
The  informal  economy  made  it  possible  to  func¬ 
tion  and  somehow  also  changed  the  system  but,  by 
consuming  resources,  also  contributed  to  its  ero¬ 
sion.  These  contradictions  of  the  official  form  of 
communism  were  the  result  of  its  inability  to 
adapt  to  social  needs  and  aspirations,  and  as  a 
result,  the  contradictions  were  tolerated  by  the 
official  system — a  mechanism  of  adaptation 
extending  the  life  of  the  system  and  enabling 
reproductions. 

This  entry  presents  three  basic  areas  of  function 
of  communist  systems:  politics,  the  economy,  and 
society.  Making  a  precise  separation  of  the  three 
areas  is  difficult  because  the  specificity  of  commu¬ 
nist  systems  lay  in  the  powerful  tangle  of  political 
economic  and  social  structures.  At  the  same  time, 
there  was  a  hierarchy:  Politics  was  the  prime  cause 
overriding  the  other  areas. 

From  Dominance  by  Politics 
to  a  Lack  of  Politics 

In  the  institutional  sense,  communism  was  a  sys¬ 
tem  based  on  the  domination  of  politics  over  other 
spheres  of  life,  the  organization  of  life  being  based 
on  principles  of  centralism  and  the  elimination  of 
political  and  social  pluralism.  One  consequence  of 
the  aim  to  eliminate  independent  structures  was 
an  economy  based  on  state  ownership,  but  the 


economy  is  just  one  example  of  the  attempts  to 
subordinate  social  life  to  the  political  authorities. 
The  system  sought  its  justification  in  an  ideology 
that  derived  communism  from  criticism  of  the 
structural  contradictions  of  capitalism  and  prom¬ 
ised  the  creation  of  a  society  free  from  contradic¬ 
tions.  Communism  in  reality  has  sometimes  been 
seen  as  a  system  fulfilling  the  requirements  of  the 
totalitarian  model,  something  rather  doubtful  in 
view  of  the  many  internal  contradictions  men¬ 
tioned  above.  The  totalitarian  model  then  can  be 
seen  as  a  certain  type  of  communist  system. 

An  important  characteristic  of  communism  as  it 
actually  existed  in  the  political  dimension  was  the 
paradoxical  coexistence  of  revolutionary  mecha¬ 
nisms  in  the  power  system  and  the  faqade  of  insti¬ 
tutions  of  parliamentary  democracy  remaining 
from  the  previous  system.  Although  the  commu¬ 
nist  party  was  the  dominant  actor,  the  existence  of 
a  government  and  parliament  provided  a  faqade  of 
parliamentary  democracy.  Its  main  function  was 
legitimation,  although  it  was  also  the  meeting 
point  and  place  of  negotiation  for  various  group 
interests,  above  all  at  the  local  level.  The  commu¬ 
nist  system,  based  on  the  assumption  of  a  unity  of 
interests  in  society — because  after  the  elimination 
of  antagonistic  classes,  the  basis  for  opposition  of 
interests  was  supposed  to  disappear — had,  how¬ 
ever,  to  deal  with  their  real  differentiation.  These 
interests  were  various  but  in  a  significant  way 
arose  from  the  segmentation  introduced  by  the 
system  itself,  in  various  branches  and  sectors 
throughout  the  institutional  whole.  Communism 
was  unable  to  suspend  the  working  of  the  laws  of 
economics,  which  is  why  real  if  not  overt  negotia¬ 
tions  and  peculiar  bargaining  appeared  in  the 
economy.  It  was  also  not  able  to  suspend  the  gen¬ 
eration  of  interests  by  all  organizational  struc¬ 
tures — interests  in  a  large  part  autonomous  with 
respect  to  central  decrees.  In  an  informal  way,  they 
had  therefore  to  penetrate  the  formal  structures. 
This  created  fertile  ground  for  phenomena  related 
to  corruption. 

The  necessity  of  taking  into  account  the  differ¬ 
entiation  of  interests  in  the  political  sphere  in  fact 
marks  one  of  the  institutional  contradictions  of  the 
communist  regime.  On  the  one  hand,  there  was  the 
principle  of  centralism,  while  on  the  other,  to 
maintain  its  ability  to  reproduce  and  adapt  itself,  it 
was  necessary  to  incorporate  into  its  structure  the 
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differentiated  interests,  at  least  to  an  extent  that 
enabled  it  to  continue  functioning.  The  coexistence 
of  those  incoherent  elements  constituted  both  an 
opportunity  for  reform  and  a  limitation  on  the 
reformability  of  the  institutional  system  of  com¬ 
munism.  The  opportunity  was  created  by  the 
incompletely  defined  character  of  the  boundaries, 
and  the  limit  was  the  fundamental  contradiction: 
The  more  bottom-up  representation  of  interests 
was  strengthened,  the  more  central  control  was 
weakened.  This  mechanism  was  clearly  visible  in 
the  case  of  unsuccessful  economic  reform. 

The  aim  of  total  political  control  not  only 
resulted  in  its  opposite  but  also  led  to  the  disap¬ 
pearance  of  any  meaningful  politics.  A  system  in 
which  everything  is  controlled  by  politics  in  fact 
does  not  leave  room  for  any  politics  at  all.  A  lack 
of  real  politics,  understood  as  negotiation  of  plu¬ 
ralistic  interests,  seems  obvious  in  the  case  of  com¬ 
munism — in  which,  paradoxically,  everything  was 
meant  to  be  politics. 

From  a  Centrally  Planned 
Economy  to  Disorganization 

The  economy  was  one  of  the  spheres  in  which  the 
communist  system  was  supposed  to  fulfill  ideo¬ 
logical  aims.  In  practice,  this  was  a  matter  of  sub¬ 
ordination  of  economic  choices  to  political  crite¬ 
ria.  This  mechanism  was  based  on  institutional 
structures  joining  economic  organisms  with  the 
administration  of  the  state,  which  moreover  was 
the  chief  owner  of  the  economy.  The  basis  for  eco¬ 
nomic  choices  and  decisions  was  a  system  of  eco¬ 
nomic  plans  whose  justification  was  the  overcom¬ 
ing  of  the  irrationality  and  contradictions  of  the 
capitalist  economy  and  the  ensuring  of  social  needs 
by  an  economy  of  real  socialism.  This  system  cre¬ 
ated  institutional  frameworks  for  the  so-called 
socialist  industrialization,  which  according  to  the 
Polish  sociologist  Witold  Morawski  was  charac¬ 
terized  by  preeminence  of  the  means  of  produc¬ 
tion,  central  planning,  hierarchic  administrative 
control,  strong  engagement  of  the  party  in  the 
economy,  and  a  lack  of  independent  economic 
institutions. 

The  striving  for  political  control  was  mani¬ 
fested  in  management  and  planning  carried  out 
centrally.  The  centralist  way  of  planning  and  the 
command-distribution  system  of  management  was 


justified  strictly  by  economic  necessity;  some¬ 
times,  it  was  argued  that  it  was  necessary  in  a 
situation  of  limited  resources.  In  practice,  it 
turned  out  that  linking  an  economy  of  scarcity  to 
centralization  produced  the  opposite  of  what  was 
intended.  Centralization  failed  as  a  mechanism  of 
control  precisely  when  resources  were  scarce,  and 
it  transformed  itself  into  its  own  opposite,  into 
spontaneous  decentralization.  Sociologists  such  as 
Lena  Kolarska-Bobinska  have  shown  that  in  such 
a  situation  of  an  economic  scarcity,  real  influence 
was  gained  by  suppliers  of  resources  at  the  cost  of 
the  political  authorities,  and  this  is  a  way  of 
decentralizing  without  a  precise  awareness.  As  a 
result,  the  system  of  management  of  the  economy, 
instead  of  being  a  system  of  its  hierarchical  con¬ 
trol  became  instead  the  space  for  various  games  in 
which  organizations  and  various  interest  groups 
within  the  institutions  of  management  partici¬ 
pated.  However,  that  spontaneous  decentraliza¬ 
tion  did  not  damage  the  essence  of  the  system 
because  it  still  had  at  least  two  mechanisms  of 
control.  The  first  was  control  by  personnel  policy, 
including  the  nomenklatura  mechanism,  through 
which  the  party  controlled  its  personnel.  Perhaps, 
the  economy  was  uncontrollable,  but  that  did  not 
mean  that  some  other  source  of  control  gained 
power.  The  economy  drifted  rather  than  moving 
in  another  ideologically  unacceptable  direction. 
The  economy  was  further  limited  by  the  fact  that 
due  to  communism’s  definition  of  the  preemi¬ 
nence  of  the  branch  producing  the  means  of  pro¬ 
duction,  suppliers  of  precisely  that  branch  gained 
more  influence  and  indirectly  reinforced  the  stra¬ 
tegic  aim  of  control  over  the  economy.  That  con¬ 
trol  at  best  became  more  static  than  dynamic;  the 
economy  did  not  achieve  the  set  goals. 

A  second  source  of  limitation  on  the  centraliz¬ 
ing  mechanisms  was  the  cycle  of  reform.  To  satisfy 
the  needs  of  society,  the  communist  economy  had 
to  realize  economic  aims,  not  merely  political  ones. 
The  implementation  of  communism  showed  that 
these  aims  remained  in  contradiction  with  one 
another,  and  in  the  last  instance,  political  aims 
took  precedence.  The  typical  cycle  of  unrealized 
reform  resulted  in  low  effectiveness  of  manage¬ 
ment  leading  to  the  danger  of  social  upheaval, 
which  was  quite  frequent  in  some  communist 
countries  (e.g.,  Poland).  In  response  to  this  threat 
to  its  power,  the  current  government  instituted 
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economic  reform  by  loosening  its  regulation  of  the 
economy;  however,  this  decreased  control  of  the 
economy,  which,  in  turn,  heightened  the  govern¬ 
ment’s  fear  of  losing  power,  and  therefore,  the 
government  retreated  from  this  economic  policy.  In 
this  way,  the  same  political  factor  (fear  of  loss  of 
power)  led  both  to  the  initiation  of  reform  and  to 
withdrawal  of  the  same.  This  does  not  mean,  how¬ 
ever,  that  the  reforms  had  no  effect.  Each  led  to  the 
discovery  of  successive  limits  to  the  reformability  of 
the  system.  At  different  points  of  time  during  com¬ 
munist  rule,  various  provisions  were  considered  as 
solutions  to  existing  problems,  such  as  change  of 
cabinets,  managerial  reform  (more  independence 
for  directors  of  firms),  and  a  role  for  mechanisms  of 
self-government.  None  of  these  reforms  on  its  own 
was  introduced  systematically,  but  together,  they 
had  a  cumulative  effect.  That  accumulation  of 
experience  of  unsuccessful  attempts  at  reform  led  in 
the  end  to  the  discovery  that  it  was  impossible  to 
reform  communist  economic  policy. 

The  next  characteristic  of  the  communist  model 
of  management  was  emphasis  on  formal  procedures 
and  mechanisms  that  also  generated  their  own  con¬ 
tradictions  in  the  form  of  the  role  of  informal 
mechanisms.  The  inability  to  satisfy  the  needs  of  the 
population  generated  various  informal  mechanisms 
that  can  be  regarded  as  types  of  supply  structures. 
The  informal  economy  is  compatible  with  different 
systems,  but  under  communism,  it  fulfilled  a  spe¬ 
cific  function.  This  is  because  its  essence  was  not 
avoidance  of  registration  for  reasons  such  as  taxa¬ 
tion,  as  happens  in  market  economies.  In  the  non- 
market  economy,  it  served  as  a  mechanism  compen¬ 
sating  for  the  deficit  of  market  goods.  From  this 
point  of  view,  it  can  be  said  that  it  constituted  a 
necessary  supplement  to  the  economy  of  “first  cir¬ 
culation”  to  a  certain  degree  tolerated  by  the  offi¬ 
cial  system.  Therefore,  it  is  possible  in  fact  to  talk 
about  a  powerful  entanglement  of  formal  and  infor¬ 
mal  structures  in  which  the  existence  of  the  latter 
was  an  important  element  of  the  economy. 

However,  not  every  deviation  from  the  eco¬ 
nomic  aims  of  enterprises  was  a  result  of  informality. 
Some  were  direct  results  of  ideological  assumptions 
according  to  which  the  workplace  was  supposed  to 
be  a  forum  for  achieving  not  only  pragmatic  but 
also  social  aims:  the  socialization  of  employees  to 
function  in  conditions  laid  down  and  preferred 
by  the  system.  Creation  of  the  real  communist 


economy  was  at  the  same  time  an  attempt  to  create 
a  class  of  socialist  employees.  For  this  reason,  there 
were  various  forms  of  competition,  ideological 
schooling,  and  methods  for  satisfying  the  needs  of 
employees  in  their  role  as  consumers,  not  through 
the  market  but  through  the  workplace.  The  whole 
system  of  the  communist  welfare  state  was  based 
to  a  large  extent  on  the  distribution  of  various 
goods  through  the  workplace.  The  provision  of 
health  care,  access  to  cultural  events,  and  educa¬ 
tion  were  often  tied  to  the  workplace.  This  social 
function  of  the  workplace — constituting  an  ele¬ 
ment  of  the  ideology — was  at  the  same  time  func¬ 
tional  in  addressing  the  permanent  state  of  econ¬ 
omy  of  scarcity.  Distribution  through  the  work¬ 
place  of  consumer  goods  in  short  supply  was  in 
fact  a  supply  mechanism  of  the  economy  of  scar¬ 
city.  Simultaneously,  it  found  its  justification  in  the 
ideology  of  the  system. 

Society:  From  a  Myth  of  Unity 
to  Real  Contradictions 

The  model  of  communist  society  in  its  actual  form 
never  achieved  full  unity.  This  section  first  out¬ 
lines  the  mechanisms  creating  inequality  into  a 
system  appealing  to  the  idea  of  equality  and  then 
discusses  the  rejection  of  the  legitimization  of  the 
system  that  according  to  its  ideology  rested  on 
mass  support. 

The  real  communist  principle  of  “from  each 
according  to  his  usefulness  to  the  authorities” 
infringed  the  principle  of  “from  each  according  to 
his  work,”  which  was  an  element  of  the  ideological 
basis  of  communism.  The  two  therefore  were  in 
contradiction  and,  what  is  more,  infringement  of 
either  of  them  brought  about  a  crisis  of  legitimiza¬ 
tion  with  respect  to  the  basic  principles  of  the 
identity  of  the  system.  The  mechanisms  of  the 
deviation  from  the  principle  of  “from  each  accord¬ 
ing  to  his  work  (ability,  contribution)”  were 
largely  caused  by  politics  that  by  a  system  of  pri¬ 
orities  favored  defined  institutional  segments  of 
the  system.  In  the  economic  sphere,  it  was  the 
branch  of  production  of  the  means  of  production, 
and  above  all  heavy  industry,  and  those  segments 
of  machine  building  that  were  useful  for  military 
production.  This  was  not  just  a  matter  of  ideologi¬ 
cally  motivated  segmentation.  In  the  case  of  the 
real  economy  of  scarcity,  branches  or  indeed  firms 
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producing  scarce  consumer  goods  took  on  particu¬ 
lar  importance. 

This  whole  mechanism  demolished  in  an  obvi¬ 
ous  way  the  official  vision  of  social  stratification. 
It  led  to  the  decomposition  of  status  factors 
described  in  the  literature  by  shaking  the  tie 
between  contribution  of  work  measured  in  terms 
of — for  example — qualifications  and  prestige  or 
earnings.  Instead,  in  the  definition  of  the  level  of 
income  from  work,  or  more  broadly  the  economic 
position  of  the  individual,  an  essential  role  was 
played  by  the  branch  of  industry  or  indeed  the  type 
of  organization  in  which  one  worked.  Systemic 
sources  of  social  inequality  in  communism  were 
supposed,  in  theory,  to  ensure  the  political  stability 
of  the  system  but,  in  practice,  led  to  further  erosion 
of  its  legitimization.  This  is  because  it  was  not  pos¬ 
sible  to  justify  any  of  the  legitimizing  principles — 
by  appeal  either  to  the  slogan  “from  each  accord¬ 
ing  to  his  work”  or  to  the  slogan  “from  each 
according  to  his  needs.”  This  is  another  example 
showing  how  the  adaptive  mechanisms  generated 
by  systemic  incoherence  sustained  its  further  repro¬ 
duction  in  the  short  term  while,  in  the  longer  term, 
contributing  to  its  collapse. 

All  this  revealed  that  mass  legitimization  (the 
conviction  that  subordination  to  the  system  is 
morally  justified)  was  a  myth  to  be  replaced  by 
multiple  processes  of  adaptation.  Various  authors 
have  shown  that  adaptation  rather  than  legitimiza¬ 
tion  was  the  factor  maintaining  stability,  albeit 
within  a  limited  scope  and  for  a  limited  time.  It 
lasted  for  some  time,  however,  and  those  factors  of 
accommodating  stabilization  were  varied  and 
evolving.  There  were  attempts  to  substitute  eco¬ 
nomic  achievements  for  political  legitimization, 
with  a  relative  quasi  legitimization  or  various 
forms  of  pragmatic  adaptation  to  the  system. 

Without  doubt,  the  establishment  of  the 
Solidarity  trade  union  in  Poland  in  1980  dealt  a 
vital  blow  to  the  legitimizing  myths  of  commu¬ 
nism.  This  was  because  for  the  first  time,  workers 
questioned  the  myth  of  unity  of  interests  in  an 
open  and  in  fact  institutionalized  manner.  They 
demanded  neither  joint  management  nor  partici¬ 
pation  in  decisions,  as  had  occurred  after  1956  on 
the  wave  of  popularity  of  self-management. 
Rather,  the  demand  was  for  trade  unions  as  sepa¬ 
rate  organizations  that  would  serve  their  interests 
as  employees,  interests  that  they  did  not  see  as 


identical  to  those  of  their  employers.  This  was  the 
beginning  of  the  end  of  the  system  based  on  the 
myth  of  unity  of  those  interests  because  the  state 
under  real  communism  was  supposed  to  be  pre¬ 
cisely  the  representative  of  employees’  interests 
since  that  was  why  private  ownership  of  the 
means  of  production  was  abolished.  In  this  way, 
the  legitimizing  myth  of  communism  came  to  an 
end.  Before  that  happened,  however,  communist 
systems  were  torn  by  waves  of  outbreaks  of  social 
dissatisfaction  (e.g.,  during  1956  in  Hungary  and 
in  1970,  1976,  and  1980  in  Poland)  prompted  by 
frustration  with  an  inefficient  economy  that  failed 
to  meet  workers’  interests  and  economic  needs.  By 
degrees,  however,  these  revolts  took  on  a  political 
character  and  articulated  a  demand  for  institu¬ 
tional  change,  one  example  of  which  was  the  cre¬ 
ation  of  trade  unions  independent  of  the  authori¬ 
ties,  as  noted  previously. 

Variations  in  Communist  Systems 

Communist  systems  were  not  static;  they  under¬ 
went  change  over  time.  This  dynamic  was  one  of 
the  two  fundamental  sources  of  their  variety.  The 
second  was  differentiation  resulting  from  various 
traditions  and  social  and  cultural  conditions.  The 
first  communist  state  constituting  a  sort  of  tem¬ 
plate  of  the  system  was  the  Soviet  Union.  (Formally, 
it  was  established  in  1922;  however,  Soviet  Russia 
came  into  existence  in  1917  and  was  not  dissolved 
until  1991.)  The  government  of  Joseph  Stalin 
(1878-1953),  who  led  the  Communist  Party  in  the 
USSR  from  1922  until  his  death,  can  be  seen  as  the 
formative  period  of  the  institutions  of  the  commu¬ 
nist  system.  This  period  coincided  with  the  build¬ 
ing  of  the  military  position  and  the  significance  of 
the  communist  system  that  controlled  many  coun¬ 
tries  of  Central  and  Eastern  Europe  after  World 
War  II  (1939-194 5),  realizing  the  imperial  mission 
of  the  Soviet  Union  in  the  face  of  the  passivity  of 
the  West  with  respect  to  the  expansion  of  the 
USSR,  recently  an  ally  in  the  war  against  Nazi 
Germany.  The  building  of  this  position  was 
achieved  at  the  cost  of  millions  of  victims  of  a 
whole  system  of  repression  comprising  police  ter¬ 
ror,  administration  of  justice  entirely  under  politi¬ 
cal  control,  and  a  system  of  labor  camps.  Estimates 
of  the  number  of  victims  of  the  entire  communist 
system  fall,  according  to  Jean-Louis  Margolin  and 
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Nicolas  Werth,  in  the  range  of  between  65  million 
and  93  million  (Stephen  Courtois  et  ah,  1999). 
After  the  death  of  Stalin,  the  communist  system 
underwent  gradual  liberalization,  with  a  some¬ 
what  looser  political  control  over  different  spheres 
of  life  and  a  weakening  of  the  system  of  mass 
repression  against  individuals  and  groups  seen  as 
enemies  of  the  system.  Even  though  the  wave  of 
terror  and  repression  of  Stalinist  times  did  not 
return,  nevertheless,  the  systems  never  achieved 
complete  liberalization  because  their  full  realiza¬ 
tion  would  have  simply  meant  questioning  their 
very  essence. 

Apart  from  changes  over  time,  it  is  also  worth 
noting  local  variations  in  the  communist  system. 
Communism  was  introduced  by  force  after  World 
War  II  in  many  countries  of  Eastern  and  Central 
Europe  and  was  brought  to  an  end  by  the  peaceful 
revolutions  of  1989.  Since  1949,  a  communist  sys¬ 
tem  has  prevailed  in  China,  which  today  has  a 
population  of  more  than  1.3  billion;  communist 
systems  also  exist  today  in  North  Korea  and  Cuba. 
There  is  therefore  a  group  of  communist  countries 
with  varied  traditions  and  social  and  ethnic  struc¬ 
tures.  Communist  systems  within  Europe  were 
also  quite  varied.  For  example,  the  Romanian  vari¬ 
ant,  which  maintained  a  certain  independence 
from  the  Soviet  Union,  was  much  more  repressive 
than  the  Hungarian  or  Polish  versions,  while  the 
Yugoslavian  variant  in  turn  strongly  accented  self¬ 
management  (under  the  control  of  the  Communist 
Party,  however).  Taking  the  Asian  variants  into 
account  only  deepens  the  differentiation.  The 
Chinese  model  links  a  powerful,  almost  fully  mar¬ 
ket  economy  with  political  control  by  the 
Communist  Party;  in  contrast,  the  North  Korean 
model  approaches  the  totalitarian  model  in  the 
scale  of  its  control  and  repression.  In  spite  of  the 
differences  among  communist  systems,  the  limits 
to  change  were  defined,  and  attempts  to  cross 
them  met  with  refusal.  They  were  emphatically 
manifested  in  the  military  interventions  by  the 
Soviet  army  in  Hungary  in  1956  and  in 
Czechoslovakia  in  1968  (when  the  Soviet  army 
was  accompanied  by  those  of  a  group  of  countries 
from  the  Warsaw  Pact)  and  also  the  introduction 
of  martial  law  by  the  Polish  Communists  in  1981. 
In  each  case,  these  actions  constituted  a  brutal 
reaction  to  attempts  to  liberalize  the  communist 
systems. 


Collapse  of  Communist  Systems 

The  sources  of  the  downfall  of  the  communist  sys¬ 
tem  lie  above  all  in  its  inability  to  accommodate 
itself  to  social  needs  and  aspirations.  This  brought 
about  many  manifestations  of  dissatisfaction  and 
the  establishment  of  movements  of  opposition  to 
communism  in  many  countries  (e.g.,  Sajudis  in 
Lithuania,  Charter  77  in  Czechoslovakia,  and  the 
Workers’  Defence  Committee  in  Poland).  The  cata¬ 
lyst  for  these  changes  was  without  doubt  the  disin¬ 
tegration  of  the  Soviet  Union,  being  an  unintended 
effect  of  reforms  begun  by  Gorbachev,  leader  of  the 
Soviet  Communist  Party  in  the  last  period  of  exis¬ 
tence  of  the  USSR.  Additional  factors  in  the  erosion 
of  communist  systems  were  the  opposition  to  com¬ 
munism  and  the  diplomatic  efforts  of  Pope  John 
Paul  II.  It  is  a  unique  phenomenon  in  the  history  of 
civilization  that  basically,  apart  from  certain  excep¬ 
tions,  the  collapse  of  the  Soviet  Union  and  the  com¬ 
munist  systems  in  the  European  countries  took  place 
peacefully,  mainly  as  a  result  of  negotiation  by 
opposition  elites  with  the  communist  authorities. 

Communist  systems,  then,  had  internal  sources 
of  dynamics  and  change.  The  dynamic  was  neither 
ideologically  based,  nor  one  consisting  exclusively 
of  constant  protest  under  which  communism  even¬ 
tually  folded  and  collapsed.  It  was  also  a  matter  of 
a  dynamic  resulting  from  contradictions  generated 
by  the  system  itself  that  turned  out  in  the  end  to  be 
insurmountable  and  that  led  to  its  collapse. 

Not  all  the  cracks  in  the  systems  arose  from 
contradictions  generated  by  the  dynamic  of  com¬ 
munism:  Some  of  them  were  the  result  of  the  per¬ 
sistence  of  certain  systemic  remnants  from  the 
previous  regime  (in  certain  countries,  e.g.,  in 
Poland,  a  large  role  was  played  by  private  owner¬ 
ship  of  agricultural  land  and  by  the  Roman 
Catholic  Church). 

In  this  way,  then,  the  processes  of  adaptation  that 
made  continuation  of  the  system  possible,  in  the 
end,  also  constituted  the  seed  of  its  collapse,  as  they 
led  to  erosion  of  resources  that  finally  rendered  the 
system  incapable  of  production.  They  were,  there¬ 
fore,  the  source  of  the  persistence  of  the  communist 
system,  its  modification,  and  finally  its  collapse. 
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Communitarianism 


Communitarianism  is  a  social  philosophy  that 
builds  on  the  assumption  that  the  good  should  be 
defined  socially.  This  core  assumption  is  in  sharp 
contrast  with  liberalism,  which  assumes  that 
each  person  ought  to  determine  the  good  indi¬ 
vidually.  Communitarianism  stresses  that  indi¬ 
viduals  are  socially  “embedded”  rather  than  free 
agents,  that  people  have  social  responsibilities  to 
each  other,  to  their  communities,  and  to  the  com¬ 
mon  good. 

Although  communitarianism  is  a  small  philo¬ 
sophical  school,  it  has  a  measure  of  influence  on 
public  dialogues  and  politics,  especially  as  an  anti¬ 
dote  to  the  laissez-faire  conservatism  championed 
by  Margaret  Thatcher  in  Britain  and  Ronald 
Reagan  in  the  United  States.  President  Barack 
Obama  gave  voice  to  communitarian  ideas  and 
ideals  in  his  book  The  Audacity  of  Hope  and 
repeatedly  during  the  2008  presidential  election 
campaign,  calling  on  his  fellow  citizens  to 
“ground  our  politics  in  the  notion  of  a  common 
good”  (2006,  p.  9),  for  an  “age  of  responsibility” 
and  to  forgo  identity  politics  in  favor  of  building 


community-wide  unity.  At  the  same  time,  it  should 
be  noted  that  many  in  the  West  consider  communi¬ 
tarian  an  awkward  term  that  evokes  misleading 
associations.  Hence,  Obama,  as  well  as  other  public 
leaders  who  have  embraced  communitarian  themes, 
such  as  Tony  Blair  in  the  United  Kingdom  and  Bill 
Clinton  in  the  United  States,  avoid  the  term  itself. 

Branches  of  Communitarianism 
Academic  (Philosophical)  Communitarianism 

In  the  1980s,  communitarian  thinking  was 
largely  associated  with  the  works  of  political  phi¬ 
losophers  such  as  Charles  Taylor,  Michael  Sandel, 
and  Michael  Walzer.  Others  sometimes  associated 
with  this  group  include  Alasdair  MacIntyre,  Seyla 
Benhabib,  and  Shlomo  Avineri.  These  scholars 
called  attention  to  the  mistaken  assumptions  about 
the  nature  of  the  self  on  which  liberal  philosophy, 
especially  as  espoused  by  John  Rawls,  rested. 
Liberalism,  these  communitarian  critics  pointed 
out,  views  the  person  as  divorced  from  all  his 
moral  commitments  and  communal  attachments. 
These  communitarians  challenged  this  view, 
depicting  the  self  as  fundamentally  “situated”  or 
“contextualized”  in  a  given  culture,  within  a  par¬ 
ticular  history,  and  with  a  particular  set  of  values. 
These  academic  communitarians,  and  the  sociolo¬ 
gists  who  preceded  them,  like  Emile  Durkheim  and 
Ferdinand  Tonnies,  stressed  that  individuals  in 
viable  communities  not  only  flourish  as  human 
beings  but  also  are  more  reasonable  and  produc¬ 
tive  than  isolated  individuals.  Only  if  social  pres¬ 
sures  to  conform  rise  to  excessively  high  levels  do 
they  undermine  the  development  and  expression  of 
the  self. 

Academic  communitarians  also  argued  that 
the  nature  of  the  political  community  was  mis¬ 
understood  by  liberalism.  Where  liberal  philos¬ 
ophers  described  a  neutral  framework  of  rules 
within  which  a  diversity  of  commitments  to 
moral  values  can  coexist,  communitarians 
showed  that  such  a  “thin”  conception  of  politi¬ 
cal  community  was  both  empirically  misleading 
and  normatively  dangerous.  Good  societies, 
according  to  these  authors,  rested  on  much 
more  than  “neutral”  rules  and  procedures;  they 
relied  on  shared  moral  culture. 

Some  academic  communitarians  argued  even 
more  strongly  on  behalf  of  particularistic  values, 
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suggesting  that,  indeed,  these  were  the  only  kind  of 
values  that  mattered  and  that  it  was  a  philosophical 
error  to  posit  any  universal  moral  values.  Alasdair 
MacIntyre  wrote  that  human  rights  are  as  real  as 
unicorns.  Michael  Walzer  initially  argued  that  con¬ 
crete  universal  values  were  philosophically  illusory 
and  that  societies  could  be  measured  only  according 
to  their  own  particularistic  moral  standards.  As  the 
debate  over  abstract  universal  values  gave  way  to  a 
discussion  about  cross-cultural  justifications  of 
human  rights,  the  problems  of  such  a  relativistic 
position  came  to  be  widely  (though  not  universally) 
acknowledged.  In  the  1990s,  responsive  communi¬ 
tarians  developed  a  position  that  accommodated 
both  particularistic  and  universal  values  (see  below). 

It  should  be  noted  that,  despite  being  widely 
referred  to  as  communitarians,  Taylor,  Sandel,  and 
Walzer  systematically  avoided  the  term.  Arguably, 
this  was  the  case  because  the  term  used  to  be,  and 
to  some  extent  still  is,  associated  with  authoritar¬ 
ian  communitarianism. 

Authoritarian  Communitarianism 

Often  referred  to  as  East  Asian  communitari¬ 
ans,  authoritarian  communitarians  argue  that  to 
maintain  social  order  and  harmony,  individual 
rights  and  political  liberties  must  be  curtailed. 
They  hold  that  individuals  find  their  role  and 
meaning  in  their  service  to  the  common  good,  are 
organic  parts  of  a  great  whole,  just  as  human  cells 
are  in  a  human  body. 

Some  East  Asian  communitarians  believe  in  the 
strong  arm  of  the  state  (e.g.,  former  Singaporean 
Prime  Minister  Lee  Kuan  Yew  and  Malaysian 
head  of  state  Mahathir  Bin  Mohammad)  and 
some  in  strong  social  bonds  and  the  voice  of  the 
family  and  community  (especially  the  kind  of 
society  Japan  had,  at  least  until  1990). 
Authoritarian  communitarians  argue  that  the 
Western  value  of  liberty  actually  amounts  to 
social,  political,  and  moral  anarchy  and  that  legal 
and  political  rights  are  a  distinctively  Western 
idea  that  the  West  uses  to  impose  its  own  vision 
on  other  cultures,  which  have  their  own  preferred 
values.  Over  the  years,  East  Asian  communitari¬ 
ans  have  moderated  many  of  these  claims,  have 
made  more  room  for  individual  rights,  and  have 
been  increasingly  eclipsed,  at  least  in  the  West,  by 
responsive  communitarians. 


Responsive  (Political)  Communitarianism 

Early  in  the  1990s,  a  new  group  was  founded  by 
Amitai  Etzioni,  working  with  William  A.  Galston, 
which  took  the  communitarian  philosophy  from  a 
small  academic  group,  introduced  it  into  public 
life,  and  recast  its  academic  content.  The  group, 
variously  referred  to  as  “responsive,”  “neo,”  or 
“political”  communitarians,  stressed  the  impor¬ 
tance  of  society  and  its  institutions  above  and 
beyond  that  of  the  state  and  the  market,  the  focus 
of  other  public  philosophies.  It  emphasized  the  key 
role  played  by  socialization,  moral  culture,  and 
informal  social  controls  rather  than  state  coercion 
or  market  pressures.  Responsive  communitarian¬ 
ism  served  as  a  major  correction  to  authoritarian 
communitarianism  by  stressing  that  strong  rights 
presume  strong  responsibilities  and  that  one  should 
not  be  neglected  in  the  name  of  the  other. 

The  group  started  by  forming  a  platform, 
whose  drafters  included  Mary  Ann  Glendon  (law); 
Thomas  Spragens  Jr.;  James  Fishkin  and  Benjamin 
Barber  (political  science);  Hans  Joas,  Phillip 
Selznick,  and  Robert  Bellah  (sociology);  and  Alan 
Ehrenhalt  (author).  The  platform  was  initially 
endorsed  by  more  than  150  public  leaders  from 
across  the  political  spectrum.  The  voice  the  group 
raised  was  soon  found  in  numerous  op-eds  and 
public  lectures  and  on  TV  and  radio  programs  in  a 
considerable  number  of  countries.  The  group  also 
issued  several  position  papers  on  subjects  such  as 
organ  donation,  character  education,  and  HIV 
testing.  In  the  1990s,  several  of  the  members  of 
this  group  worked  with  the  New  Democrats  and 
advocates  of  the  Third  Way  in  Europe. 

The  ideas  of  the  group  were  further  developed, 
both  on  the  public  side  and  on  the  academic  side, 
in  books  and  in  an  intellectual  quarterly,  The 
Responsive  Community.  These  works  stressed  that 
social  institutions  and  public  policies  should  reflect 
shared  values  and  the  common  good  in  addition  to 
aggregation  of  individual  preferences,  which  them¬ 
selves  are  culturally  penetrated.  Beyond  universal 
principles,  communitarianism  emphasizes  particu¬ 
larism,  the  special  moral  obligations  people  have 
to  their  families,  kin,  communities,  national  societ¬ 
ies,  and  the  nascent  global  community. 

Responsive  communitarians  showed  that  soci¬ 
ety  is  best  understood  not  as  composed  of  millions 
of  individuals,  but  as  pluralism  within  unity.  That 
is,  subcultures  and  loyalties  to  various  ethnic  and 
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regional  communities  do  not  undermine  the  integ¬ 
rity  of  society  as  long  as  a  core  of  shared  values 
and  institutions — such  as  the  Constitution  and  its 
Bill  of  Rights,  the  democratic  way  of  life,  and 
mutual  tolerance — are  respected.  These  observa¬ 
tions  are  of  special  import  today  for  societies  that 
are  in  the  process  of  coming  to  terms  with  mass 
immigration  (e.g.,  many  European  societies  and 
Japan)  and  in  which  minorities  are  questioning 
their  place  in  the  national  whole  (Quebecois, 
Scots,  Basques,  Sunnis  in  Iraq,  etc.). 

Although  this  model  of  the  good  society  is 
applicable  to  all  societies,  at  different  moments  in 
history,  a  given  society  is  likely  to  miss  the  desired 
balance  between  rights  and  the  common  good  or 
between  particularistic  loyalties  and  society-wide 
bonds  but  in  a  direction  different  from  that  of  oth¬ 
ers.  Hence,  different  societies  may  need  to  move  in 
different  directions  in  order  to  approximate  the 
same  balance.  Thus,  contemporary  East  Asian 
societies  require  moving  toward  much  greater  tol¬ 
erance  for  individualism  and  pluralism,  while  in 
American  society,  as  Robert  Bellah  and  his  col¬ 
leagues  along  with  many  others  have  shown, 
excessive  individualism  ought  to  be  reined  in. 

Applications  of  Responsive  Communitarianism 
for  Political  Science 

The  following  are  a  few  illustrative,  salient 
issues  to  which  responsive  communitarianism 
has  contributed. 

International  Relations  Theory 

Although  communitarianism  has  dealt  mainly 
with  national  societies,  it  has  also  been  applied  on 
the  international  level.  Given  the  rapid  increase  of 
pressing  transnational  issues,  such  as  environmen¬ 
tal  degradation,  illegal  immigration,  financial  cri¬ 
ses,  transnational  mafias,  and  terrorists  with  global 
reach,  the  need  for  significant  changes  in  transna¬ 
tional  governance  (sometimes  referred  to  as  global 
governance)  seems  evident.  As  a  rule,  transnational 
problems  are  still  tackled  by  the  Old  System — by 
national  governments  and  the  international  orga¬ 
nizations  managed  by  national  representatives  and 
funded  by  national  allotments.  But  in  most  cases, 
this  Old  System  has  proved  to  be  inadequate  to 
cope  with  these  problems. 


Communitarians  have  addressed  the  question  of 
how  new  transnational  institutions  may  be  con¬ 
structed.  They  argued  that  just  as  in  early  ages, 
loyalties  and  commitments  once  limited  to  local 
communities  were  transferred  in  part  to  national 
communities,  in  the  future,  some  such  transfer  will 
have  to  take  place  to  global  institutions. 
Supranationality,  the  concept  that  forms  the  back¬ 
bone  of  the  communitarian  theory  of  international 
relations  outlined  here,  characterizes  a  political 
body  that  has  acquired  some  of  the  attributes  usu¬ 
ally  associated  with  a  nation,  such  as  political  loy¬ 
alty  and  decision-making  power,  based  not  on  an 
aggregate  of  national  decisions  or  those  made  by 
representatives  of  the  member  states  but  rather  on 
those  made  by  the  supranational  bodies  themselves. 
Their  capacity  to  make  decisions  on  their  own 
terms  allows  supranational  bodies  to  move  with 
much  greater  agility  and  speed  than  international 
organizations.  At  this  stage,  few  such  institutions 
exist  and  those  that  do  face  strict  limitations 
because  nations  still  form  the  major  communitar¬ 
ian  bodies.  A  study  of  the  International  Criminal 
Court,  the  Internet  Corporation  for  Assigned 
Names  and  Numbers,  and  the  World  Trade 
Organization  illustrates  this  point.  The  European 
Union,  the  most  advanced  attempt  to  form  supra¬ 
national  institutions,  is  not  global  but  regional,  and 
it  too,  so  far,  has  had  only  limited  success  in  trans¬ 
ferring  significant  communal  loyalties  to  a  transna¬ 
tional  level. 

Homeland  Security 

Liberals  and  especially  libertarians  tend  to  seek 
to  protect  individual  rights  against  the  claims  of 
homeland  security.  For  instance,  they  call  for  treat¬ 
ing  terrorists  like  other  criminals,  according  them 
the  full  rights  of  citizens.  East  Asian  communitar¬ 
ians  tend  to  hold  that  security  is  a  common  good 
that  trumps  individual  rights  and,  hence,  support 
policies  that  give  up  civil  liberties  for  the  sake  of 
security.  Responsive  communitarianism  holds  that 
there  must  be  a  carefully  crafted  balance  between 
rights  and  security  and  that  the  point  of  balance 
changes  as  technical  and  international  conditions 
change.  They  point  out  that  courts  in  free  societies 
regularly  use  the  terminology  of  balance  between 
the  public  interest  and  individual  rights  and  allow 
the  latter  to  be  curtailed  when  they  undermine  a 
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“compelling  public  interest” — for  instance,  allow¬ 
ing  the  violation  of  privacy  of  sex  offenders  in 
order  to  protect  children  from  sex  abuse  and 
authorizing  wire  taps  for  suspected  killers. 
Responsive  communitarians  have  extended  the 
same  concept  to  homeland  security. 

On  reviewing  new  security  measures  introduced 
in  the  United  States  after  the  terrorist  attacks  of 
September  11,  2001;  in  the  United  Kingdom  after 
the  IRA’s  terrorism;  and  in  Spain  after  the  Madrid 
train  bombings,  responsive  communitarians  found 
a  somewhat  more  complex  picture  than  the  claim 
that  fear  mongering  had  led  to  wholesale  viola¬ 
tions  of  rights  (“privacy  is  dead”)  or  that  rights 
advocates  block  vital  security  measures  (indeed 
that  even  raising  such  questions  is  “aiding  and 
abetting  the  enemy”). 

Some  measures  are  fully  justified,  indeed  over¬ 
due.  These  often  entail  a  mere  adaptation  of  the 
law  for  technical  developments.  For  example,  the 
Foreign  Intelligence  Surveillance  Act  (FISA)  of 
1978  provided  guidelines  under  which  a  federal 
agent  could  obtain  authorization  to  conduct  sur¬ 
veillance  for  “foreign  intelligence  purposes.”  Prior 
to  9/11,  wiretap  warrants  were  limited  to  a  given 
phone.  Due  to  the  increasing  use  of  multiple  cell 
phones  and  e-mail  accounts  over  the  last  decades, 
federal  officials  engaged  in  court  authorized  sur¬ 
veillance  under  FISA  could  not  follow  suspects  as 
they  changed  the  instruments  they  were  using 
unless  they  got  a  new  court  order  for  each  com¬ 
munication  device.  The  USA  PATRIOT  Act, 
enacted  in  October  2001,  amended  the  FISA  law 
to  allow  what  is  called  “roving  surveillance  author¬ 
ity,”  making  it  legal  for  agents  to  follow  one  sus¬ 
pect,  once  a  warrant  is  granted,  whatever  instru¬ 
ment  he  or  she  uses.  It  seems  that  unless  one  holds 
that  terrorists  are  entitled  to  benefit  from  new 
technologies  but  law  enforcement  is  not  entitled  to 
catch  up,  this  is  a  reasonable  measure.  (Note  that 
the  American  constitution  bars  only  “unreason¬ 
able”  searches  and  was  never  absolute  in  this  mat¬ 
ter.  Although  the  PATRIOT  Act  has  become  a 
symbol  for  great  excesses  in  hasty  pursuit  of  secu¬ 
rity,  only  a  small  fraction,  about  15  of  its  more 
than  150  measures,  have  been  seriously  contested. 
Most  are  considered  reasonable.) 

Whatever  the  new  security  risks,  there  are  mea¬ 
sures  that  no  nation  should  resort  to.  These  taboo 
tactics  should  remain  so  even  if  one’s  adversaries 


employ  them.  Banning  these  measures  sets  free 
societies  apart  from  those  whose  regimes  and 
actions  are  illegitimate  and  provides  them  with  the 
moral  high  ground.  Key  examples  of  such  mea¬ 
sures  are  torture,  the  indefinite  suspension  of 
habeas  corpus,  and  mass  detention  on  the  basis  of 
race,  ethnicity,  or  national  origin. 

Further,  responsive  communitarians  hold  that 
many  measures  are  neither  inherently  justified 
because  enhanced  security  requires  them  nor  inap¬ 
propriate  because  they  wantonly  violate  rights. 
Instead,  their  status  is  conditioned  on  their  being 
subject  to  proper  oversight.  A  measure  that  may 
seem  tilted  toward  excessive  attention  to  security 
may  be  tolerated  if  closely  supervised  by  oversight 
organs,  while  a  measure  that  is  judged  as  tilting 
toward  excessive  attention  to  individual  rights  may 
be  tolerated  if  sufficient  exceptions  are  provided 
that  are  backed  up  by  second  balance  organs.  That 
is,  new  measures  can  either  be  excessively  privileged 
(undermining  either  security  or  the  regime  of  rights) 
or  excessively  discriminated  against  (leading  to  inac¬ 
tion  on  behalf  of  either  element  of  a  sound  balance). 

The  balance  sought  here  is  not  between  the  pub¬ 
lic  interest  and  rights  but  between  the  supervised 
and  the  supervisors.  Deficient  accountability  opens 
the  door  to  government  abuses  of  power,  and 
excessively  tight  controls  make  agents  reluctant  or 
incapable  to  act. 

In  effect,  the  history  of  homeland  protection  can 
be  told  in  terms  of  lack  of  accountability  being  over¬ 
corrected,  which  in  turn  is  overreduced,  in  the  quest 
for  a  sound  balance.  In  the  United  States,  the  FBI 
went  way  overboard  in  its  surveillance  of  alleged 
Communists  and  civil  rights  activists.  In  the  1970s, 
the  FBI  was  reined  in  by  the  Church  Committee  and 
additional  rules  the  FBI  itself  issued — rules  that 
in  turn  became  one  reason  why  9/11  was  not 
prevented.  In  reaction,  the  Bush  administration 
introduced  a  large  number  of  extreme  security  mea¬ 
sures — which  were  curtailed  by  the  Obama  admin¬ 
istration  as  of  2009.  Whether  the  proper  balance  has 
been  reached  between  rights  and  security  and  the 
proper  level  of  accountability  is  far  from  clear. 

Communal  Moral  Dialogues 
as  Agents  of  Change 

Thus  far,  this  entry  has  focused  on  responsive 
communitarianism’s  answer  to  the  question  of 
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what  makes  a  good  society — a  balance  between 
social  order  and  individual  rights.  The  remainder 
of  this  entry  deals  with  the  next  question  of  how 
understandings  of  the  good  are  shaped  and  spread 
in  societies. 

One  widely  shared  liberal  answer  to  the  ques¬ 
tion  just  articulated — what  are  the  main  ways  and 
processes  by  which  people’s  sense  of  what  is  legiti¬ 
mate  is  shaped  and  reshaped? — is  provided  by  the 
students  of  deliberative  democracy.  These  authors 
point  to  “reasoned  deliberations”  as  the  key  way 
in  which  citizens  of  a  democracy  come  to  change 
their  judgments  and  emphasize  “cool”  and  ratio¬ 
nal  processes  while  minimizing  the  role  played  by 
emotions  and  other  such  “hot”  factors. 

Responsive  communitarians  argue  that  such 
cool,  rational  deliberations  are  almost  impossible 
to  achieve  or  even  to  approximate  under  most  cir¬ 
cumstances.  The  examination  of  actual  processes 
of  decision  making,  especially  when  they  concern 
normative  matters,  shows  that  they  are  much  more 
impassionate  and  proceed  by  means  different  from 
those  depicted  by  the  champions  of  deliberative 
democracy.  More  important,  deliberative  democ¬ 
racy  is  the  “wrong”  model  for  determining  the 
normative  bases  of  political  acts — for  determining 
the  good.  Instead,  individual  preferences  and  judg¬ 
ments  are  shaped  largely  through  interactive  com¬ 
munications  about  values,  through  “moral  dia¬ 
logues”  that  combine  passion  with  normative 
arguments  and  rely  on  nonrational  processes  of 
persuasion,  education,  and  leadership.  According 
to  responsive  communitarians,  although  informa¬ 
tion  and  reason  have  an  appropriate  role  in  dia¬ 
logues  about  which  policies  are  legitimate,  infor¬ 
mation  and  reason  play  a  much  smaller  role  than 
is  often  asserted.  This  is  so  both  because  they  are 
much  weaker  tools  than  believed  and  because  a 
much  greater  role  is  played  by  another  factor:  the 
appeal  to  values. 

Moral  dialogues  often  have  no  clear  opening 
point  or  closing  event.  They  are  prolonged,  heated, 
and  seemingly  meandering.  However,  they  often 
lead  to  new  or  reformulated  shared  normative 
understandings  (cases  in  point  follow).  Even  very 
large  and  complex  societies  engage  in  moral  dia¬ 
logues  that  lead  to  changes  in  social  definitions  of 
the  good.  These  dialogues  take  place  by  linking 
millions  of  local  conversations  (e.g.,  between  cou¬ 
ples,  in  neighborhood  bars,  in  coffeehouses  or 


teahouses,  and  around  watercoolers  at  work)  into 
society-wide  networks  and  shared  public  focal 
points.  They  take  place  during  regional  and  national 
meetings  of  many  thousands  of  voluntary  associa¬ 
tions  in  which  local  representatives  dialogue — in 
state,  regional,  and  national  party  caucuses;  in  state 
assemblies  and  in  Congress;  and  increasingly  via 
electronic  links  (e.g.,  groups  that  meet  on  the 
Internet).  Focal  points  of  these  dialogues  are 
national  call-in  shows,  debates  on  network  televi¬ 
sion,  and  nationally  circulated  newspapers  and 
magazines. 

Society-wide  moral  dialogues  are  often  fostered, 
accelerated,  and  affected  by  public  events  such  as 
hearings  (e.g.,  the  Clarence  Thomas/Anita  Hill 
case  focused  discussion  on  sexual  harassment),  tri¬ 
als  (e.g.,  the  1925  Scopes  trial  challenged  the  nor¬ 
mative  standing  of  evolution),  and  demonstrations 
(e.g.,  those  that  highlighted  the  normative  case 
against  the  war  in  Vietnam). 

A  brief  illustration  follows.  Until  1970,  the  pro¬ 
tection  of  the  environment  was  not  considered  a 
shared  core  value  in  most  societies.  A  book,  Rachel 
Carson’s  Silent  Spring,  which  was  very  widely  read 
and  discussed,  triggered  a  society-wide  moral  dia¬ 
logue.  A  massive  oil  spill  and  the  ensuing  protests 
and  the  Three-Mile  Island  incident  further  estab¬ 
lished  the  subject  on  the  national  normative 
agenda.  An  estimated  200,000  people  gathered  on 
the  Capitol  Mall  in  1970  in  Washington,  D.C.,  to 
demonstrate  concern  for  the  environment  on  Earth 
Day.  These  are  a  few  of  many  such  incidents.  All 
these  served  as  dialogue  starters  and  were  followed 
by  billions  of  hours  of  dialogue,  culminating  in  a 
shared  sense  that  both  persons  and  communities 
had  a  moral  commitment  to  Mother  Earth. 

The  new  commitment  was  strong  enough  to 
legitimate  various  public  acts.  It  moved  a  conser¬ 
vative  president,  Richard  Nixon,  to  establish  the 
Environmental  Protection  Agency  and  pushed 
Congress  to  introduce  many  policies  favorable  to 
the  environment,  such  as  recycling  and  Corporate 
Average  Fuel  Economy  (CAFE)  standards.  Similar 
dialogues  led  to  profound  changes  in  what  is  con¬ 
sidered  just  treatment  of  minorities,  especially 
African  Americans  (dialogues  triggered  and  nur¬ 
tured  by  the  civil  rights  movement),  relationships 
between  genders  (by  the  women’s  movement),  and 
many  others.  Some  current  dialogues  in  American 
society,  concerning  same-sex  marriage  and  the 
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death  penalty,  have  already  affected  the  normative 
culture  and  public  policies,  though  they  are  less 
advanced  than  those  listed  above. 

The  opening  up  of  many  societies  (especially 
former  communist  ones),  the  spread  of  education, 
the  widening  of  people’s  attention  horizon  from 
the  local  toward  the  national  and  even  global  pub¬ 
lic  affairs,  the  rise  of  worldwide  TV  networks  (the 
“CNN  effect”),  increased  travel  and  immigration, 
and  the  World  Wide  Web — all  arising  over  the 
past  few  decades — have  led  to  the  development  of 
transnational  moral  dialogues,  in  general,  and  dia¬ 
logues  about  what  is  considered  legitimate,  in 
particular.  These  are  global  not  in  the  sense  that  all 
citizens  participate,  let  alone  agree,  but  in  that 
these  dialogues  reach  across  most  borders.  Thus, 
concern  for  the  environment  now  is  very  widely 
shared;  following  transitional  moral  dialogues,  a 
widely  shared  understanding  evolved  that  the 
2003  invasion  of  Iraq  by  the  Bush  administration 
was  not  legitimate;  and  shared  moral  understand¬ 
ings  have  been  inching  forward  on  matters  as  dif¬ 
ferent  as  the  bans  on  sex  tourism,  land  mines,  and 
whale  hunting. 
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Comparative  Methods 


At  the  most  general  level,  the  term  comparative 
methods  may  be  used  to  refer  to  any  research 
technique  that  focuses  on  patterns  of  similarities 
and  differences  across  cases.  More  commonly, 
however,  the  term  is  used  to  refer  to  a  family  of 
techniques  employed  by  small-N  researchers.  It  is 
this  family  of  techniques  that  is  the  main  focus  of 
this  entry.  Though  comparative  methods  do  have 
particular  advantages  for  small-N  researchers, 
there  is  no  inherent  connection  between  compara¬ 
tive  methods  and  small  Ns.  The  development  of 
formal  methods  of  comparative  research — specifi¬ 
cally,  qualitative  comparative  analysis  (QCA)  and 
related  techniques — has  facilitated  the  extension 
of  comparative  logic,  which  is  configurational  in 
nature,  to  large-N  studies. 

This  entry  has  six  major  sections.  The  first 
emphasizes  the  case-oriented  nature  of  compara¬ 
tive  research.  This  feature  clearly  differentiates  it 
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from  conventional  variable-oriented  research  and 
underscores  its  broad  links  to  qualitative  inquiry  in 
general.  The  second  section  considers  the  goals  of 
comparative  research,  linking  comparative  research 
to  both  inductive  theory  building  and  deductive 
theory  testing.  The  logic  of  comparative  analysis  is 
the  central  focus  of  the  third  section.  The  impor¬ 
tance  of  the  analysis  of  set-theoretic  relations  to 
comparative  research  is  emphasized,  and  the  anal¬ 
ysis  of  sufficiency  and  necessity  is  linked  to  subset 
and  superset  relations.  These  connections  are 
important  because  much  of  social  science  theory  is 
formulated  in  terms  of  set  relations,  and  it  is 
important  to  use  analytic  methods  that  are  true  to 
theoretical  formulations.  The  language  and  logic 
of  sets  underlie  both  informal  and  formal  methods 
of  comparative  analysis,  which  is  the  topic  of  the 
fourth  section.  Traditionally,  comparative  research¬ 
ers  have  made  use  of  informal  methods,  explicitly 
or  implicitly  relying  on  John  Stuart  Mill’s  methods 
of  agreement  and  difference.  Over  the  past  2 
decades,  Charles  Ragin  has  formalized  the  com¬ 
parative  method  as  QCA,  which  is  discussed  in 
detail  in  this  section.  In  the  fifth  section,  disconti¬ 
nuities  between  comparative  research  and  conven¬ 
tional  variable-oriented  research  are  addressed, 
and  in  the  closing  section,  current  development  in 
formal  methods  of  comparative  analysis  is 
described. 

The  Case-Oriented  Nature 
of  Comparative  Research 

Comparative  researchers  tend  to  conceptualize 
causality  in  terms  of  necessity  and  sufficiency.  A 
common  concern  is  the  combination  of  condi¬ 
tions  that  are  sufficient  for  an  outcome.  Often, 
comparative  researchers  find  that  different  com¬ 
binations  of  causal  conditions  generate  the  same 
outcome.  This  approach  to  causal  conditions  is 
distinct  from  that  of  variable-oriented  research 
programs,  which  seek  to  identify  the  net  effect  of 
each  independent  variable  on  the  dependent  vari¬ 
able,  isolating  its  effect  from  that  of  competing 
variables. 

While  the  sufficiency  of  causal  combinations  is  a 
major  preoccupation  of  comparativists,  necessary 
conditions  are  also  important,  especially  to  theory. 
For  example,  Barrington  Moore’s  famous  maxim, 
“No  bourgeoisie,  no  democracy,”  exemplifies  the 


necessary  condition:  The  rise  of  a  capitalist  class 
is  necessary  for — but  by  no  means  guarantees — 
the  development  of  democracy.  The  bourgeoisie 
would  also  need  to  resist  domination  by  the  aris¬ 
tocracy  and  peasantry,  promote  the  development 
of  commercial  agriculture,  and  participate  in  a 
revolutionary  break  with  the  past.  This  combina¬ 
tion  of  conditions,  Moore  found,  was  sufficient 
to  produce  a  democratic  state  in  the  cases  he 
studied. 

The  analysis  of  necessity  and  sufficiency  requires 
that  cases  be  considered  on  their  own  terms,  as 
interpretable  wholes.  To  identify  and  make  sense 
of  the  bourgeoisie’s  revolutionary  role,  Moore 
conducted  a  series  of  historical  case  studies.  For 
each  case,  Moore  examined  how  various  aspects  of 
the  case  (e.g.,  the  relationships  among  the  bour¬ 
geoisie,  aristocracy,  and  peasantry  and  the  devel¬ 
opment  of  commercial  agriculture)  interacted  with 
and  influenced  each  other.  This  is  the  defining 
characteristic  of  case-oriented  research:  A  case 
constitutes  a  coherent  whole,  and  its  various 
aspects  are  understood  relationally.  Thus,  case- 
oriented  research  is  inherently  configurational. 
The  comparative  aspect  of  comparative  research, 
therefore,  is  the  comparison  of  configurations  of 
interconnected  aspects — the  cross-case  analysis  of 
configurations  of  similarities  and  differences. 

Within  the  social  sciences,  comparative  research 
is  most  commonly  associated  with  small-N  macro¬ 
comparative  research.  But  in  fact,  its  logic  may  be 
applied  at  any  level  of  analysis.  The  same  tech¬ 
niques  that  a  macrolevel  comparative  researcher 
might  use  to  examine  the  relationship  between 
economic  development  and  the  emergence  of 
democracy  could  be  used  by  a  microlevel  compara¬ 
tive  researcher  exploring  the  relationship  between 
religious  beliefs,  socioeconomic  status,  and  divorce. 
What  matters  is  not  the  number  of  cases  under 
investigation  but  that  the  researcher  make  sense  of 
these  cases  by  treating  them  as  instances  of  inter¬ 
connected  aspects,  respecting  their  coherence  and 
integrity  as  cases. 

There  is,  of  course,  always  a  trade-off  between 
the  breadth  and  the  depth  of  an  analysis,  and  it  is 
the  researcher’s  responsibility  to  determine  whether 
a  given  research  question  is  best  answered  by 
examining  fewer  cases  in  great  detail  or  more  cases 
in  less  depth.  Generally,  this  choice  is  dictated  by 
the  goals  of  the  research. 
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The  Goals  of  Comparative  Research 

Comparative  methods  can  be  used  for  either  the¬ 
ory  development  or  evaluating  hypotheses.  When 
developing  theory,  the  researcher  conducts  a  series 
of  case  studies,  strategically  selecting  cases  so  as  to 
produce  an  accumulation  of  knowledge.  For 
example,  a  specific  type  of  evidence  that  is  not 
available  in  one  case  may  prompt  the  investigator 
to  select  for  his  or  her  next  study  a  case  that  offers 
this  evidence.  Questions  raised  by  previous  case 
studies  direct  the  investigator’s  subsequent  case 
selection.  In  this  manner,  comparative  researchers 
can  develop  their  theoretical  arguments  induc¬ 
tively,  building  their  theories  from  the  bottom  up. 
This  grounded  approach  of  theory  development 
can  be  contrasted  with  a  deductive,  hypothesis¬ 
testing  approach  that  begins  with  multiple  cases. 
Here,  the  researcher  starts  with  a  preliminary 
specification  of  the  relevant  cases.  For  example,  a 
researcher  interested  in  testing  Theda  Skocpol’s 
theory  of  social  revolution  would  begin  by  identi¬ 
fying  cases  that  fall  within  her  specified  scope 
conditions,  as  explained  by  Gary  Goertz  in  his 
book  Social  Science  Concepts:  A  User’s  Guide. 
The  researcher  would  then  examine  the  character¬ 
istics  of  these  cases  to  see  if  they  are  consistent 
with  Skocpol’s  theory — namely,  that  countries 
experiencing  social  revolutions  exhibit  the  relevant 
causal  conditions  (e.g.,  state  breakdown  combined 
with  peasant  revolts  and  some  form  of  interna¬ 
tional  pressure). 

The  inductive  approach  to  comparative  research 
seeks  to  develop  an  individual  explanation  for 
each  case,  in  relative  isolation  from  other  cases.  Of 
course,  it  is  impossible  for  researchers  to  wear 
blinders  or  forget  what  they  have  already  learned, 
and  knowledge  of  one  case  will  invariably  influ¬ 
ence  one’s  understanding  of  subsequent  cases.  Nor 
would  such  a  “clean-slate”  approach  be  desir¬ 
able — the  knowledge  gained  during  the  research 
process  assists  the  researcher  in  determining  where 
to  focus  his  or  her  attention  next.  Still,  the  goal  of 
the  inductive  approach  is  to  produce  a  series  of 
individual  case  studies,  allowing  for  maximum 
diversity.  It  is  only  on  completing  the  individual 
case  studies  that  the  researcher  turns  to  the  task  of 
making  an  encompassing  comparison.  The 
researcher  examines  each  case  against  the  others, 
sometimes  by  applying  formal  techniques,  such  as 
QCA.  The  goal  of  this  cross-fertilization  of  case 


studies  is  to  identify  what  the  cases  do  and  do  not 
have  in  common.  The  researcher  uses  these  com¬ 
parisons  to  develop  an  analytic  frame  that  clarifies 
the  nature  of  the  cases  under  investigation  and 
their  relationship(s)  to  one  another.  The  construc¬ 
tion  of  this  analytic  frame  produces  a  structured 
theoretical  account  of  the  cases  that  may  be  used 
to  identify  cross-case  patterns  of  similarity  and  dif¬ 
ference  (e.g.,  different  paths  to  the  same  outcome). 

Where  the  inductive  approach  concludes  with 
the  development  of  an  analytic  frame,  the  deduc¬ 
tive  approach  begins  with  it.  Based  on  existing 
theoretical  and  substantive  knowledge,  the 
researcher  develops  an  analytic  frame  to  guide  his 
or  her  investigation  of  the  hypothesis  that  he  or  she 
intends  to  test.  Here,  think  of  an  analytic  frame  as 
akin  to  a  survey  instrument  that  “interrogates” 
each  respondent  (case)  in  a  uniform  manner.  The 
analytic  frame  directs  the  researcher’s  investiga¬ 
tion,  identifying  plausible  cases  for  inclusion  in  the 
analysis  and  telling  him  or  her  which  characteris¬ 
tics  of  the  cases  to  examine,  what  questions  to  ask, 
and  how  to  record  the  information.  Fundamental 
to  the  deductive  approach  are  the  goals  of  unbi¬ 
asedness  and  efficiency,  both  of  which  a  clearly 
defined  analytic  frame  helps  achieve.  Having  com¬ 
pleted  data  collection,  the  researcher  then  seeks  to 
assess  the  hypothesis.  As  in  inductive  comparative 
research  projects,  a  formal  technique  such  as  QCA 
may  be  used  to  simplify  the  analysis.  In  fact,  it  is 
somewhat  more  common  for  QCA  to  be  used  in 
deductive  comparative  research  than  in  inductive 
research  because  Ns  tend  to  be  larger. 

The  inductive  and  deductive  approaches  are 
combined  when  a  researcher  seeks  to  study  a  theo¬ 
retically  specified  range  of  cases.  As  with  the  deduc¬ 
tive  approach,  the  researcher  relies  on  a  preexisting 
analytic  frame  to  guide  case  selection.  Here,  the 
analytic  frame  serves  as  a  device  for  classifying 
cases  according  to  aspects  of  theoretical  or  substan¬ 
tive  interest.  For  example,  the  researcher  might 
construct  a  contingency  table  cross-tabulating  the 
level  of  development  of  countries  with  their  level  of 
democracy.  The  researcher  then  seeks  to  fill  each 
cell  of  this  table  with  at  least  one  case.  He  or  she 
can  then  subject  these  cases  to  in-depth,  inductive 
analysis,  confident  in  the  knowledge  that  important 
case  aspects  are  well  represented  in  the  study. 

Inductive  comparative  research  proceeds  in  a 
bottom-up  fashion,  accumulating  case  knowledge 
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so  as  to  generate  new  theory.  By  contrast,  deduc¬ 
tive  comparative  research  follows  a  top-down  for¬ 
mula.  Beginning  with  theory,  the  deductive  com¬ 
parative  researcher  seeks  out  cases  to  subject  to 
some  form  of  hypothesis  testing.  Of  course,  neither 
of  these  strategies  is  practiced  in  pure  form,  and 
there  are  many  examples  where  the  two  are  explic¬ 
itly  blended.  Regardless  of  the  direction  in  which 
the  research  proceeds,  however,  the  same  basic 
analytic  logic  is  employed. 

The  Logic  of  Case-Oriented  Comparison 

As  noted  previously,  comparative  research  is  fun¬ 
damentally  case  oriented.  Cases  are  conceived  as 
configurations,  and  comparison  of  these  configu¬ 
rations  is  achieved  by  categorizing  cases  into  sets. 
Thus,  the  logic  of  comparative  research  is  funda¬ 
mentally  set  theoretic.  Set-theoretic  social  research, 
based  on  Boolean  algebra,  involves  the  analysis  of 
set  relations,  especially  the  identification  of  subset 
relationships.  Most  social  theoretical  statements 
are  framed  in  terms  of  subset  relationships.  For 
example,  when  social  researchers  observe  that 
“religious  fundamentalists  are  politically  conserva¬ 
tive,”  they  are  specifying  a  subset  relationship: 
Religious  fundamentalists  constitute  a  subset  of 
political  conservatives.  The  observation  that  eco¬ 
nomically  developed  countries  are  democratic  fol¬ 
lows  the  same  pattern;  the  conjecture  is  that  the 
former  constitute  a  subset  of  the  latter. 

Subset  relationships  such  as  these — ones  in 
which  the  cause  (e.g.,  religious  fundamentalism  or 
economic  development)  is  a  subset  of  the  outcome 
(e.g.,  political  conservatism  or  democracy) — are 
consistent  with  a  relationship  of  sufficiency.  That 
is  to  say,  membership  in  the  set  of  religious  funda¬ 
mentalists  (economically  developed  countries)  is 
sufficient  for  membership  in  the  set  of  political 
conservatives  (democratic  countries).  Conditions 
of  necessity,  by  contrast,  are  identified  by  estab¬ 
lishing  that  the  outcome  is  a  subset  of  the  cause. 
Referring  again  to  Moore’s  maxim,  which  may  be 
understood  as  the  specification  of  a  subset  rela¬ 
tionship,  democratic  countries  (outcome)  consti¬ 
tute  a  subset  of  countries  with  a  bourgeois  class 
(causal  condition). 

Of  course,  subset  relationships  need  not  be  per¬ 
fect.  For  example,  there  are  certainly  some  religious 
fundamentalists  who  are  politically  liberal.  Some 


relationships,  therefore,  might  be  more  accu¬ 
rately  characterized  as  “rough”  subsets.  Ragin  has 
proposed  a  measure  of  set-theoretic  consistency 
that  assesses  the  degree  to  which  a  given  set  of  evi¬ 
dence  conforms  to  the  subset  relationship.  Clearly 
defined  subset  relationships  produce  high  consis¬ 
tency  scores;  as  consistency  drops,  the  claim  that  a 
rough  subset  relationship  exists  is  less  tenable. 

While  having  a  measure  of  set-theoretic  consis¬ 
tency  is  useful  for  assessing  subset  relationships 
empirically,  researchers  ultimately  must  rely  on 
their  theoretical  and  substantive  knowledge  (which 
includes  their  knowledge  of  the  cases  under  inves¬ 
tigation)  when  determining  whether  the  claim  that 
a  subset  relationship  exists — even  a  rough  one — is 
justified.  Social  life  is  complex,  and  perfect  subset 
relationships  are  rare,  especially  when  the  number 
of  cases  is  large.  Individuals,  for  example,  are 
notoriously  inconsistent.  Although  it  is  highly 
unlikely  that  an  individual  with  little  education 
will  achieve  a  high-prestige  occupation,  it  has  hap¬ 
pened.  Whether  such  occurrences  are  considered 
common  enough  to  challenge  the  claim  that  a 
rough  subset  relationship  exists  between  educa¬ 
tion  and  occupational  prestige  is  a  determination 
that  the  researcher  must  make.  Is  a  high  level  of 
education  “almost  always  necessary”  for  a  high- 
prestige  occupation?  Such  questions  cannot  be 
answered  through  the  rote  execution  of  a  consis¬ 
tency  calculation  but  instead  require  that  research¬ 
ers  apply  their  theoretical  knowledge  and  interpre¬ 
tive  skills. 

The  analysis  of  set-theoretic  relations  has  other 
uses,  in  addition  to  the  assessment  of  conditions  of 
necessity  and  sufficiency.  Set-theoretic  analysis  also 
facilitates  theoretical  development  by  providing  an 
empirical  basis  for  constructing  and  evaluating 
ideal  types.  Max  Weber  conceived  of  the  ideal  type 
as  a  “one-sided  exaggeration”  that  serves  to  distin¬ 
guish  a  class  of  social  phenomena.  The  usefulness 
of  the  ideal  type,  Weber  argued,  is  that  it  is  not 
designed  to  perfectly  represent  all  aspects  of  the 
phenomenon  under  investigation  but,  rather,  its 
essential  aspects.  Once  these  essential  features  have 
been  established,  it  is  possible  to  assess,  using  set- 
theoretic  methods,  the  degree  to  which  empirical 
cases  conform  to  the  ideal-typical  formulation. 

The  categorization  of  cases  according  to  their 
degree  of  memberships  in  sets  (using  fuzzy  sets)  is 
crucial  to  assessing  their  degree  of  conformity  to 
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an  ideal  type.  G0sta  Esping-Andersen’s  distinction 
among  types  of  welfare  regimes  is  a  case  in  point. 
There  are,  of  course,  no  pure  cases  of  liberalism, 
social  democracy,  or  corporatism.  In  practice, 
states  of  each  type  borrow  from  the  others.  How, 
then,  does  one  decide  on  the  proper  categorization 
of  welfare  states?  The  first  step  is  to  conceive  of 
each  welfare  regime  as  an  ideal  type  and  to  specify 
the  essential  elements  that  constitute  it  as  a  type. 
These  elements  can  then  be  formulated  as  sets,  and 
the  degree  of  membership  of  each  empirical  case  in 
each  set  can  be  assessed  (via  fuzzy-set  membership 
scores).  The  membership  of  each  empirical  case  in 
each  ideal-typical  regime  type  is  given  by  the  mini¬ 
mum  of  its  memberships  in  the  constituent  sets  for 
each  regime  type.  (Using  the  minimum  in  this  way 
is  known  as  fuzzy-set  intersection.)  The  use  of  the 
minimum  is  based  on  the  simple  idea  that  degree 
of  membership  of  an  empirical  case  in  an  ideal  type 
is  only  as  strong  as  its  weakest  link  (lowest  mem¬ 
bership  score),  a  principle  that  follows  from  the 
theoretical  notion  of  the  ideal  type  as  a  specific 
combination  or  coalescence  of  essential  elements. 
The  analysis  of  necessary  and  sufficient  conditions 
and  the  construction  of  ideal  types  are  not  neces¬ 
sarily  independent  of  one  another.  Frequently,  the 
results  of  one  analysis  will  prompt  the  researcher 
to  reconsider  the  results  of  the  other.  Whether  they 
recognize  it  or  not,  comparative  researchers  com¬ 
monly  engage  in  both  forms  of  analysis. 

Formal  Versus  Informal  Methods 
of  Comparative  Research 

Classical  studies  in  comparative  political  economy 
such  as  Barrington  Moore’s  Social  Origins  of 
Dictatorship  and  Democracy  and  Theda  Skocpol’s 
States  and  Social  Revolutions  rely  on  informal 
methods  of  comparative  analysis,  and  to  date,  most 
comparative  researchers  continue  in  this  tradition. 
It  was  with  the  publication  of  The  Comparative 
Method  in  1987  that  formal  methods  of  compara¬ 
tive  analysis  first  gained  prominence.  Informal  and 
formal  comparative  methods  share  an  underlying 
logic  and,  in  the  hands  of  a  capable  researcher,  will 
produce  the  same  general  conclusions.  The  differ¬ 
ence  is  that  formal  methods  explicitly  make  use  of 
Boolean  algebra  to  structure  the  analytic  process. 

The  logic  of  comparative  analysis  and  inductive 
inquiry  in  general  can  be  traced  to  the  work  of 


John  Stuart  Mill  and  George  Boole.  In  A  System  of 
Logic,  Mill  proposed  a  series  of  methods  of  induc¬ 
tive  reasoning  that  would  lay  the  logical  founda¬ 
tion  of  informal  methods  of  comparative  research. 
Contemporaneously,  Boole  was  developing  his 
calculus  of  truth  values.  Boolean  algebra,  as  it 
came  to  be  known,  would  develop  into  set  algebra 
and,  ultimately,  be  integrated  into  formal  methods 
of  comparative  research,  especially  QCA. 

Mill’s  Method  of  Agreement  and 
Indirect  Method  of  Difference 

The  simplest  of  all  comparative  logics  are  Mill’s 
method  of  agreement  and  his  indirect  method  of 
difference,  both  of  which  assume  that  any  given 
outcome  has  one  and  only  one  cause.  Although 
Mill  described  these  methods  in  terms  of  individ¬ 
ual  causes,  it  is  straightforward  to  extend  these 
methods  to  accommodate  combinations  of  causal 
conditions  when  it  makes  sense  to  conceptualize 
them  as  decisive  causal  conjunctions.  A  researcher 
using  the  method  of  agreement  examines  a  set  of 
cases,  each  of  which  exhibits  the  same  outcome.  If 
the  researcher  finds  that  all  the  cases  share  one  or 
more  causal  conditions,  he  or  she  can  infer  that 
this  causal  condition  (or  combination  of  causal 
conditions)  is  linked  to  the  presence  of  the  out¬ 
come.  The  indirect  method  of  difference  operates 
under  related  premises,  except  that  both  positive 
and  negative  instances  of  the  outcome  are  exam¬ 
ined.  This  method  applies  agreement  and  differ¬ 
ence  seriatim-,  for  this  reason,  it  is  often  referred  to 
as  the  joint  method  of  agreement  and  difference. 
The  researcher  first  applies  the  method  of  agree¬ 
ment  to  cases  exhibiting  the  outcome.  On  identify¬ 
ing  a  causal  condition  (or  causal  combination) 
linked  to  the  outcome,  the  researcher  examines  the 
negative  cases  to  see  if  the  absence  of  the  outcome 
is  associated  with  the  absence  of  the  cause  or 
causal  combination.  This  double  application  pro¬ 
vides  researchers  with  stronger  evidence  on  which 
to  base  their  conclusions. 

Comparative  researchers  continue  to  employ 
these  methods  today,  albeit  under  different  moni¬ 
kers.  The  method  of  agreement  is  the  basis  of  the 
most  different  systems  design,  which  is  better  con¬ 
ceived  as  a  most  different/same  outcome  design 
(MDSO).  This  alternate  labeling  makes  it  clear  that 
in  this  design,  diverse  cases  share  the  same  outcome. 
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The  researcher  attempts  to  identify  the  decisive, 
causally  relevant  conditions  shared  by  these  diverse 
cases.  The  other  popular  design  implements  the 
indirect  method  of  difference.  It  is  commonly 
known  as  most  similar  systems  design,  but  it  is  bet¬ 
ter  understood  as  a  most  similar/different  outcome 
design  (MSDO),  making  it  clear  that  there  are  both 
positive  and  negative  cases.  Ideally,  negative  cases 
should  be  matched  as  closely  as  possible  with  posi¬ 
tive  cases. 

Mill’s  methods  also  provide  the  underlying  logic 
of  the  “most  likely,”  “least  likely,”  and  “crucial” 
case  study  research  methods.  The  most  likely  case 
study  employs  cases  that  theory  predicts  should 
strongly  exhibit  the  outcome,  while  the  least  likely 
case  study  employs  cases  that  would  do  so  weakly, 
if  at  all.  Least  likely  cases  that  conform  to  predic¬ 
tions  serve  to  marshal  support  in  favor  of  a  given 
theory.  In  contrast,  most  likely  cases  that  do  not 
behave  as  predicted  cast  doubt  on  a  theory’s  valid¬ 
ity.  The  strongest  claim  is  made  by  the  crucial  case 
study,  which  asserts  that  a  case  must  exhibit  the 
outcome  if  the  theory  is  not  to  be  falsified. 

The  limitation  of  Mill’s  methods  and  derivative 
techniques  is  that  they  are  incapable  of  addressing 
various  forms  of  causal  heterogeneity,  especially 
equifinality  and  multiple  conjunctural  causation. 
Mill  himself  noted  that  his  techniques  fail  when  an 
outcome  has  more  than  one  cause.  Researchers  are 
able  to  overcome  this  deficiency  by  immersing 
themselves  in  the  cases.  This  immersion,  however, 
necessarily  restricts  the  breadth  of  analysis.  It  is 
perhaps  for  this  reason  that  comparative  research 
has  become  so  closely  associated  with  small-N 
analysis.  QCA  challenges  this  affinity  by  formaliz¬ 
ing  many  aspects  of  the  comparative  research  pro¬ 
cess  and  enables  comparative  researchers  to  address 
larger  Ns. 

Qualitative  Comparative  Analysis 

QCA  was  originally  developed  for  use  with 
binary  data  or  “crisp  sets.”  The  logic  of  QCA, 
however,  extends  to  fuzzy  sets  in  which  set  mem¬ 
bership  scores  can  be  coded  in  the  interval  from  0.0 
to  1.0.  The  present  discussion  encompasses  both 
types  of  data.  Note  that  crisp-set  analysis  is  easier 
for  novices  to  grasp,  while  fuzzy-set  analysis  pro¬ 
vides  more  nuanced  results.  There  are  four  basic 
steps  in  most  applications  of  QCA.  In  practice,  of 
course,  these  are  intimately  connected,  and 


researchers  should  view  the  research  process  as  a 
dialogue  among  four  analytic  moments.  In  other 
words,  comparative  research  is  usually  retroductive 
rather  than  predominantly  inductive  or  deductive. 

Step  1:  Calibration 

As  has  been  emphasized,  comparative  research  is 
fundamentally  set  theoretic.  The  classification  of 
cases  into  sets,  therefore,  is  of  primary  importance. 
This  classification  process  is  referred  to  as  calibra¬ 
tion.  Calibration  is  a  routine  practice  in  the  natural 
sciences.  Simply  put,  calibration  refers  to  the  prac¬ 
tice  of  adjusting  measuring  instruments  so  that  they 
conform  to  dependable,  known  standards.  The  act 
of  adjusting  one’s  watch  is  an  act  of  calibration  as 
is  checking  the  accuracy  of  a  thermometer.  While 
calibration  is  routine  in  the  natural  sciences,  most 
social  scientists  make  use  of  uncalibrated  measures 
that  merely  show  the  positions  of  cases  relative  to 
one  another.  Uncalibrated  measures,  however,  are 
clearly  inferior  to  calibrated  measures.  With  an 
uncalibrated  thermometer,  for  example,  it  is  possi¬ 
ble  to  know  that  one  object  has  a  higher  tempera¬ 
ture  than  another,  but  one  still  does  not  know 
whether  either  object  is  hot  or  cold.  This  is  the  key: 
Calibrated  measures  provide  directly  interpretable 
values.  Calibration  permits  researchers  to  classify  a 
country  as  democratic  or  autocratic,  not  merely 
more  versus  less  democratic  than  the  next  (or  the 
average). 

The  reason  that  crisp-set  analysis  is  more  acces¬ 
sible  is  because  the  calibration  process  is  more 
straightforward.  When  calibrating  crisp  sets,  the 
researcher  need  only  determine  whether  each  case  is 
part  of  the  target  set  or  not.  So,  for  example,  a  pov¬ 
erty  researcher  might  seek  to  classify  respondents  as 
“poor”  or  “not  poor.”  When  calibrating  fuzzy  sets, 
by  contrast,  the  researcher  must  determine  the 
degree  to  which  each  case  belongs  to  the  target  set. 
The  relevant  distinctions  here  might  be  between 
respondents  who  are  poor  (membership  =  1.0), 
somewhat  poor  (membership  =  0.67),  not  that 
poor  (membership  =  0.33),  and  not  poor  (member¬ 
ship  =  0).  As  with  a  calibrated  thermometer,  a 
calibrated  fuzzy  set  integrates  both  qualitative  and 
quantitative  assessments  in  a  single  instrument. 

Regardless  of  whether  the  researcher  is  calibrat¬ 
ing  crisp  or  fuzzy  sets,  the  process  demands  that  he 
or  she  carefully  consider  the  nature  of  the  target 
set — specifically,  what  the  target  set  represents  and 
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how  it  is  labeled.  In  other  words,  conceptualiza¬ 
tion  and  calibration  are  closely  intertwined.  With 
crisp  sets,  a  case  is  either  in  or  out  of  the  target  set, 
and  sometimes,  cases  must  be  force-fitted  into  cat¬ 
egories.  Alternatively,  the  researcher  may  generate 
new  crisp  categories  to  capture  wayward  cases. 
Fuzzy-set  calibration,  by  contrast,  permits  the  use 
of  finely  grained  distinctions.  When  calibrating 
fuzzy  sets,  it  is  important  for  the  researcher  to 
specify  thresholds  for  set  membership  and  non¬ 
membership  in  the  set,  as  well  as  what  is  known  as 
the  crossover  point  (fuzzy  membership  =  0.5), 
which  distinguishes  cases  that  are  “more  in”  the 
target  set  from  those  that  are  “more  out.”  The 
calibration  process  concludes  with  the  production 
of  a  data  set.  As  with  a  conventional  data  set,  cases 
occupy  the  rows.  The  columns,  however,  do  not 
represent  conventional  variables  but  instead  repre¬ 
sent  sets,  and  the  values  in  the  cells  indicate  the 
degree  to  which  each  case  belongs  to  each  set. 

Step  2:  Necessity  Analysis 

Having  generated  the  calibrated  data  set,  the 
researcher  next  tests  for  the  presence  of  necessary 
conditions.  As  described  earlier,  the  necessity  test 
takes  the  form  of  a  set-theoretic  analysis.  A  neces¬ 
sary  causal  condition  is  one  that  is  a  superset  of  the 
outcome.  Visually,  a  set  relation  consistent  with 
necessity  is  represented  by  a  scatterplot  of  two  fuzzy 
sets  (with  outcome  membership  scores  on  the  Y  axis 
and  membership  in  the  causal  condition  on  the  X 
axis)  in  which  all  cases  reside  in  the  lower-right  tri¬ 
angular  region.  With  a  calibrated  data  set,  this  situ¬ 
ation  can  easily  be  tested,  and  a  relationship  of 
necessity  can  be  established  if,  for  each  case,  the 
value  of  the  causal  condition  is  greater  than  or  equal 
to  the  value  of  the  outcome  (i.e.,  Xt  >  Y),  within  the 
bounds  of  the  consistency  threshold.  This  test  can  be 
extended  to  combinations  of  causal  conditions,  and 
software  packages  such  as  fuzzy-set  QCA  (fsQCA) 
contain  functions  to  automate  the  procedure. 

Step  3:  Truth  Table  Construction 

Truth  tables  are  used  to  assess  the  sufficiency  of 
causal  conditions,  especially  in  combinations. 
Necessary  conditions  are  excluded  from  the  truth 
table  analysis,  which  is  why  the  researcher  tests  for 
them  first.  Sufficiency  is  indicated  when  a  causal 
condition  (or,  more  commonly,  a  combination  of 


causal  conditions)  constitutes  a  consistent  subset 
of  the  outcome  and  is  mathematically  represented 
by  the  equation,  X(  <  Y;.  Again,  the  consistency 
test  is  conducted  within  the  bounds  of  the  consis¬ 
tency  threshold  established  by  the  researcher. 

Superficially,  a  truth  table  appears  similar  to  a 
conventional  data  set  in  that  it  presents  a  cases-by- 
variables  format.  But  the  rows  of  a  truth  table  are 
not  observations  as  they  are  in  a  conventional  data 
set.  Rather,  each  row  represents  a  logically  possible 
combination  of  causal  conditions  (which,  in  turn, 
can  be  seen  as  constituting  an  ideal-typical  case  or 
configuration).  Consisting  of  one  row  per  logically 
possible  combination  of  conditions,  a  truth  table 
has  2k  rows,  where  k  equals  the  number  of  causal 
conditions.  As  the  number  of  causal  conditions 
increases,  the  size  of  the  truth  table  grows  exponen¬ 
tially,  and  analysis  becomes  increasingly  complex. 
Software  packages  help  manage  this  complexity; 
nevertheless,  most  practitioners  restrict  their  analy¬ 
sis  to  between  4  and  10  causal  conditions. 

To  construct  the  truth  table,  the  researcher  refers 
to  the  calibrated  data  set  and  uses  the  consistency 
measure  to  determine  which  combinations  of  causal 
conditions  are  linked  to  the  presence  of  the  outcome 
and  which  are  linked  to  its  absence.  With  a  large 
number  of  cases  and/or  causal  conditions,  this  pro¬ 
cess  can  become  error  prone.  Software  packages 
facilitate  the  conversion  of  a  calibrated  data  set  into 
a  truth  table  and  should  be  used  for  all  but  the  sim¬ 
plest  analyses.  Researchers  should  keep  in  mind  that 
the  process  of  converting  the  calibrated  data  set  into 
a  truth  table  will  frequently  force  them  to  reconsider 
the  measurement  and  calibration  of  both  their 
causal  conditions  and  outcome.  In  many  ways,  the 
process  of  constructing  the  truth  table  is  the  heart  of 
the  comparative  research  process,  and  it  must  not 
be  approached  mechanically.  It  is  during  this  phase 
of  research  that  the  researcher  refines  and,  ulti¬ 
mately,  finalizes  his  or  her  analytic  frame.  Indeed, 
for  a  descriptive  project,  the  researcher  may  choose 
to  conclude  simply  with  a  presentation  of  the  truth 
table.  Functionally,  the  truth  table  is  a  compact  way 
of  presenting  a  multidimensional  typology. 

Step  4:  Truth  Table  Reduction 

The  analysis  of  a  truth  table  involves  a  process 
known  as  truth  table  reduction  or  Boolean  minimi¬ 
zation.  This  process  results  in  a  Boolean  equation 
that  expresses  the  various  causal  combinations  that 
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are  linked  to  the  presence  of  the  outcome. 
Researchers  interested  in  the  mechanics  of  this  pro¬ 
cess  are  directed  to  Ragin’s  three  books  on  this 
subject:  The  Comparative  Method ,  Fuzzy-Set  Social 
Science,  and  Redesigning  Social  Inquiry.  The  fol¬ 
lowing  discussion  focuses  on  the  results  of  the 
reduction  procedure.  The  truth  table  reduction 
produces  two  main  types  of  solutions,  known  as  the 
complex  and  parsimonious  solutions,  depending  on 
whether  or  not  the  researcher  chooses  to  make  use 
of  remainders.  Remainders  are  rows  of  the  truth 
table  that  lack  cases  (empirical  instances). 
Remainders  are  not  uncommon.  The  social  world  is 
characterized  by  limited  diversity,  and  many  com¬ 
binations  of  causal  conditions  simply  do  not  exist. 

Remainders  are  best  understood  as  potential 
counterfactual  cases — combinations  of  conditions 
that  do  not  exist  empirically,  at  least  not  in  the 
investigator’s  data  set,  and  are  thus  subject  to  con¬ 
jecture.  By  including  remainders  in  the  truth  table 
reduction  process,  the  researcher  is  able  to  use 
QCA  to  conduct  counterfactual  analysis.  The  most 
common  use  of  remainders  in  QCA  is  to  produce 
a  parsimonious  solution  in  addition  to  the  usual 
complex  solution.  The  parsimonious  solution 
allows  the  use  of  any  remainder,  as  along  as  its 
inclusion  in  the  solution  results  in  a  logically  sim¬ 
pler  formula  for  the  outcome.  The  complex  solu¬ 
tion  bars  the  use  of  remainders  altogether  on  the 
assumption  that  all  the  different  combinations  that 
are  linked  to  the  outcome  in  question  are  repre¬ 
sented  in  the  investigator’s  data  set. 

QCA  also  provides  researchers  the  opportunity 
to  use  their  theoretical  and  substantive  knowledge 
to  produce  intermediate  solutions.  This  knowledge 
is  input  into  the  software  via  a  simple  table  and  is 
used  to  bar  the  inclusion  of  untenable  counterfactu- 
als  from  the  solution.  In  general,  intermediate  solu¬ 
tions  are  best  because  they  more  closely  mimic  the 
use  of  counterfactual  cases  in  traditional  case-ori¬ 
ented  comparative  research.  Of  necessity,  complex 
solutions  constitute  subsets  of  intermediate  solu¬ 
tions,  and  intermediate  solutions  constitute  subsets 
of  parsimonious  solutions.  In  other  words,  the 
three  solutions  are  logically  nested,  according  to  the 
degree  to  which  each  incorporates  counterfactual 
combinations.  Complex  solutions  incorporate  no 
counterfactual  combinations  and  thus  are  often 
needlessly  complex;  parsimonious  solutions  are  the 
simplest,  but  they  often  incorporate  counterfactual 


combinations  that  contradict  existing  theory  and 
knowledge.  Intermediate  solutions,  by  contrast, 
are  based  on  counterfactual  analysis  that  is  guided 
and  constrained  by  theoretical  and  substantive 
knowledge. 

As  has  been  repeatedly  emphasized  throughout 
this  discussion,  the  process  of  comparative  research 
is  iterative.  It  is  likely  that  the  results  of  the  truth 
table  reduction  will  motivate  the  researcher  to 
revisit  one  or  more  of  the  previous  steps.  It  is  also 
quite  common  for  researchers  to  reconsider  not 
simply  the  calibration  process  but  also  whether 
they  are,  in  fact,  capturing  the  appropriate  causal 
conditions  and  outcome.  Comparative  researchers 
regularly  reflect  on  the  nature  of  their  research 
questions  and  frequently  refine  their  questions  as 
the  research  proceeds.  QCA  does  not  eliminate  the 
necessity  of  this  process.  On  the  contrary,  it  makes 
it  explicit.  Both  overly  simplistic  solutions  and 
overly  complex  ones  serve  as  warnings,  directing 
the  researcher  to  reconsider  their  solutions,  revisit 
their  evidence,  and  sometimes  reformulate  their 
analytic  frames. 

Discontinuities  Between  Comparative 
and  Variable-Oriented  Research 

It  is  best  to  understand  comparative  analysis  as  a 
distinct  methodological  approach,  one  that  can  be 
conducted  using  either  informal  or  formal  meth¬ 
ods.  Far  too  often,  QCA  is  viewed  as  a  methodol¬ 
ogy  in  and  of  itself  when,  in  fact,  it  is  a  formaliza¬ 
tion  and  extension  of  case-oriented  comparative 
research.  As  has  been  emphasized  throughout  this 
discussion,  the  comparative  approach  is  funda¬ 
mentally  case  oriented,  and  as  such,  its  tools  are 
designed  to  answer  questions  that  differ  from 
those  of  conventional  variable-oriented  research. 
Nevertheless,  the  distinctiveness  of  comparative 
research  is  often  overlooked,  with  the  result  that 
comparative  researchers  and  variable-oriented 
researchers  often  talk  past  each  other  when  dis¬ 
cussing  their  research. 

Ironically,  the  popularity  of  QCA  has,  to  some 
degree,  exacerbated  this  issue.  Just  as  the  rise  of 
desktop  statistical  software  has  made  it  easy  to 
estimate  a  poorly  designed  regression  model,  it  is 
now  straightforward  for  naive  users  to  conduct  a 
QCA  without  recognizing  the  different  assump¬ 
tions  that  they  are  taking  on  board.  This  does  not 
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refer  to  philosophical  debates  regarding  epistemo¬ 
logical  and  ontological  differences  between  case- 
oriented  and  variable-oriented  strategies.  Rather, 
there  are  substantial  differences  between  what  the 
research  tools  themselves  produce  in  terms  of 
social-scientific  representations. 

The  importance  of  these  differences  can  be  seen 
clearly  in  the  contrasts  between  how  each  perspec¬ 
tive  establishes  causality.  As  this  entry  has  dis¬ 
cussed,  comparative  researchers  establish  causal¬ 
ity  by  identifying  subset  relationships:  “No  bour¬ 
geoisie,  no  democracy.”  But  such  a  claim  does  not 
make  sense  from  the  variable-oriented  perspective. 
Subset  relationships  typically  exhibit  little  to  no 
covariation,  which  is  the  foundation  for  the  vari¬ 
able-oriented  researcher’s  causal  inference.  Sim¬ 
ilarly,  the  covariation  of  variables  is  insensitive  to 
subset  relationships — the  type  of  explicit  connec¬ 
tion  that  is  central  to  case-oriented  comparative 
analysis.  From  each  perspective,  the  other’s  claim 
to  an  empirical  basis  for  causal  inference  appears 
flawed.  Although  both  approaches  use  theory  and 
data  to  construct  representations  of  social  life, 
they  do  so  in  ways  that  involve  sharply  contrast¬ 
ing  orientations  regarding  the  consideration  of 
cases,  the  relationship  between  causal  conditions 
and  outcomes,  and  the  criteria  of  a  successful 
analysis. 

The  Consideration  of  Cases 

From  the  perspective  of  a  comparative 
researcher,  the  primary  problem  with  variable- 
oriented  methods  is  that  they  veil  cases.  When  a 
variable-oriented  researcher,  for  example,  com¬ 
putes  a  correlation  between  two  variables,  it  is  the 
variables  that  take  center  stage.  The  cases  essen¬ 
tially  disappear.  In  contrast,  it  is  the  cases  that 
take  center  stage  in  a  comparative  analysis.  It  is  a 
simple  matter,  for  example,  to  map  cases  onto  the 
rows  of  a  truth  table,  and  some  software  packages 
automate  this  procedure.  The  two  approaches  also 
differ  in  their  methods  of  case  selection.  As  previ¬ 
ously  noted,  comparative  researchers  choose  cases 
purposefully,  based  on  either  theoretical  or  sub¬ 
stantive  criteria.  Further,  the  set  of  cases  included 
in  an  investigation  may  shift  as  the  researcher 
decides  that  one  or  more  cases  do  not  “fit”  with 
the  others  or  realizes  that  an  important  type  of 
case  is  missing  from  the  analysis.  Variable-oriented 


researchers,  by  contrast,  typically  delineate  their 
samples  of  cases  at  the  outset  of  their  research. 
What  matters  most  is  that  the  cases  are  selected  in 
a  manner  that  ensures  that  the  sample  is  represen¬ 
tative  of  the  larger  population,  however  conceived 
and  defined. 

The  Connection  Between  Causal 
Conditions  and  Outcomes 

Variable-oriented  research  is  usually  concerned 
with  explaining  variation  in  the  values  of  a  depen¬ 
dent  variable — an  outcome  that  varies  across  cases. 
The  goal  of  such  research  is  to  explain  how  changes 
in  the  values  of  independent  variables  affect  the 
value  of  the  dependent  variable.  Techniques  such 
as  linear  regression  accomplish  this  by  identifying 
the  net  contribution  of  each  independent  variable. 
Comparative  research,  in  contrast,  is  concerned 
with  how  causal  conditions  combine  to  produce  a 
particular  outcome.  The  solutions  of  a  QCA  appli¬ 
cation,  for  example,  represent  different  paths  or 
recipes  for  a  given  outcome. 

In  a  variable-oriented  study,  causation  is  inferred 
from  patterns  of  covariation.  Usually,  the  researcher 
measures  the  relative  strength  of  several  causal 
variables  simultaneously.  Frequently,  the  goal  is  to 
assess  the  relative  strength  of  competing  theories 
by  determining  which  independent  variable  (or  set 
of  independent  variables)  explains  the  most  varia¬ 
tion  in  the  outcome  variable.  In  other  instances, 
the  goal  may  be  simply  to  appraise  the  relative 
importance  of  the  different  independent  variables 
or  simply  to  establish  that  the  causal  impact  of  a 
theoretically  important  variable  is  significant  and 
can  be  estimated.  In  virtually  all  types  of  variable- 
oriented  analysis,  variables  compete  with  one 
another  to  explain  variation.  The  moniker  of  inde¬ 
pendent  is  important  here.  Each  independent  vari¬ 
able  is  considered  on  its  own  terms,  capable  of 
affecting  the  outcome  variable  regardless  of  the 
presence  or  level  of  the  other  causal  variables.  The 
comparative  researcher,  by  contrast,  does  not 
assume  that  causal  conditions  are  independent  of 
one  another.  Independence  is  the  exception  rather 
than  the  rule,  and  causation  is  understood  to  be 
conjunctural  in  nature.  The  goal  of  the  investiga¬ 
tion  is  to  identify  combinations  of  causal  condi¬ 
tions  that  are  shared  by  sets  of  cases  and,  in  tan¬ 
dem,  produce  the  outcome. 
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The  Criteria  of  a  Successful  Analysis 

It  is  useful  as  well  to  contrast  the  different  criteria 
used  to  judge  a  successful  conclusion  of  a  study.  For 
the  variable-oriented  researcher,  the  analysis  is  suc¬ 
cessful  when  a  theory  can  explain  why  changes  in 
the  values  of  the  independent  variables  affect  the 
value  of  the  dependent  variable  in  a  particular  man¬ 
ner.  For  the  comparative  researcher,  by  contrast,  a 
successful  conclusion  is  one  in  which  the  various 
paths  to  the  outcome  are  clearly  and  convincingly 
articulated  and  cases  can  be  associated  with  the  dif¬ 
ferent  paths.  In  short,  to  justify  their  conclusions, 
comparative  researchers  point  to  their  cases,  while 
variable-oriented  researchers  point  to  theory. 

From  the  perspective  of  variable-oriented 
researchers,  comparative  research  is  replete  with 
analytic  sins  and  errors.  Samples  are  purposively 
constructed,  and  their  sizes  are  small.  Researchers 
may  add  or  drop  cases  at  any  stage  of  the  analysis. 
Important  causal  factors  do  not  vary.  In  a  variable- 
oriented  analysis,  any  of  these  situations  would 
undermine  attempts  at  causal  inference.  Likewise, 
common  practices  in  variable-oriented  research 
would  seriously  undermine  the  validity  of  the  con¬ 
clusions  when  viewed  from  the  perspective  of  com¬ 
parative  research.  There  are  too  many  cases  for  an 
in-depth  analysis  to  be  conducted.  A  fixed  sam¬ 
pling  frame  assumes  a  level  of  comparability 
among  cases  that  may  be  unwarranted.  And  the 
focus  on  independent  effects  of  variables  violates 
the  configurational  assumption  of  comparative 
analysis.  Comparative  research  is  fundamentally 
case  oriented,  and  consequently,  its  practices  fun¬ 
damentally  clash  with  those  of  variable-oriented 
research.  It  is  therefore  not  surprising  that  research¬ 
ers  frequently  talk  past  one  another. 

Current  Developments  in  Comparative 
Research  Methods 

The  use  of  informal  methods  of  comparative 
research  is,  of  course,  well  established  among 
social  researchers,  as  is  the  use  of  crisp-set  QCA. 
The  extension  of  QCA  to  encompass  fuzzy-set 
analysis  has  been  welcomed  by  researchers, 
although  there  is  a  lag  between  new  developments 
in  the  approach  and  their  adoption  by  researchers. 
For  example,  the  inclusion  algorithm  (using  the 
principle  of  set  inclusion)  described  by  Ragin  in 
Fuzzy-Set  Social  Science  has  been  superseded  by  a 


hybrid  algorithm  that  uses  truth  tables  to  synthe¬ 
size  the  results  of  the  analysis  of  fuzzy-set  relation¬ 
ships.  Although  the  transition  to  the  truth  table 
algorithm  has  been  widely  documented,  a  number 
of  researchers  continue  to  rely  on  the  procedures 
discussed  in  the  2000  text.  The  publication  of 
Ragin’s  Redesigning  Social  Inquiry  establishes  the 
superiority  of  the  truth  table  algorithm. 

The  popularity  of  QCA  has  encouraged  a  num¬ 
ber  of  researchers  to  propose  various  extensions  to 
the  method.  Many  of  these  proposed  extensions 
can  be  found  in  working  papers  at  the  COMPASSS 
website,  which  serves  as  a  hub  for  researchers  inter¬ 
ested  in  the  development  of  formal  methods  of 
comparative  analysis.  Although  a  number  of  these 
proposals  aim  to  incorporate  variable-oriented 
techniques  into  QCA,  such  proposals  have  not 
gained  much  traction.  The  most  promising  of  these 
extensions  are  those  arising  from  or  inspired  by  the 
concerns  of  traditional  small-N  researchers. 
Particularly  promising  is  the  application  of  QCA  to 
the  analysis  of  scope  conditions,  which  are  used  to 
distinguish  between  cases  that  do  and  do  not  apply 
to  a  given  theory.  Also  promising  is  the  application 
of  QCA  to  possibility  analysis.  A  relatively  recent 
development,  the  analysis  of  possibility  seeks  to 
identify  those  cases  that  are  within  a  theory’s  scope 
but  for  which  the  outcome  is  nevertheless  impossi¬ 
ble  or  virtually  impossible.  The  question  of  whether 
an  individual  born  into  poverty  can  acquire  a  high- 
status  occupation,  for  example,  is  a  question  of 
possibility:  What  conditions  serve  to  make  social 
mobility  possible?  The  appeal  of  extending  QCA  to 
the  study  of  scope  and  possibility  is  easy  to  see: 
Such  questions  are  set  theoretic  in  nature.  They  are 
theoretically  important  questions  but  outside  the 
grasp  of  conventional  quantitative  methods. 

The  popularity  of  multimethod  research  sug¬ 
gests  that  researchers  will  increasingly  seek  to  use 
QCA  in  tandem  with  other  analytic  methods.  Such 
projects  have  proved  to  be  most  successful  when 
QCA  is  combined  with  both  case-oriented  and 
variable-oriented  techniques.  The  combination  of 
QCA  and  techniques  of  social  network  analysis 
have  also  proved  to  be  particularly  fruitful,  and 
development  in  this  direction  is  expected  to  con¬ 
tinue.  Initiatives  to  implement  the  QCA  techniques 
in  statistical  environments  such  as  R  and  Stata 
have  begun.  By  and  large,  these  translation  efforts 
are  incomplete,  and  the  canonical  package  remains 
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Ragin’s  fsQCA.  Cronqvist’s  Tools  for  Small-N 
Analysis  (TOSMANA)  is  also  popular.  Although 
lacking  fuzzy-set  capabilities  and  procedures  for 
counterfactual  analysis  and  intermediate  solutions, 
TOSMANA  includes  a  number  of  attractive  fea¬ 
tures  such  as  a  threshold-setting  tool — to  aid  the 
construction  of  crisp  and  multivalue  sets — and  the 
ability  to  graphically  represent  Boolean  solutions 
as  Venn  diagrams. 

The  popularity  of  formal  comparative  methods 
is  expected  to  continue  to  grow.  There  are  two 
reasons  for  this.  First,  QCA  permits  comparative 
researchers  to  include  more  cases  in  a  given  study. 
Research  projects  previously  too  complex  to  con¬ 
duct  due  to  the  limitations  of  informal  compara¬ 
tive  methods  are  now  feasible.  Second,  and  per¬ 
haps  more  important,  the  process  of  formalizing 
the  comparative  approach  served  to  highlight  the 
fact  that  despite  the  popularity  of  variable-oriented 
methods,  most  social  theory  is  actually  set  theo¬ 
retic  in  nature.  Many  research  questions  are  actu¬ 
ally  more  appropriately  answered  using  compara¬ 
tive  methods,  because  of  their  set-theoretic  nature, 
than  variable-oriented  ones.  Researchers  who  pre¬ 
viously  would  have  turned  to  variable-oriented 
techniques  are  more  likely  today  to  recognize  the 
value  of  comparative  methods. 

Charles  C.  Ragin  and  Claude  Rubinson 
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Tucson,  Arizona,  United  States 
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Comparative  politics  is  a  subdiscipline  of  political 
science.  The  goal  of  political  science  is  to  promote 
the  comparison  of  different  political  entities,  and 
comparative  politics  is  the  study  of  domestic  poli¬ 
tics  within  states.  It  differs  from  the  other  subdisci¬ 
pline  of  political  science — international  relations — 
which  instead  focuses  on  politics  between  states. 
Traditionally,  it  has  been  assumed  that  whereas 
comparative  politics  studies  politics  in  contexts 
where  there  is  an  ordering  principle  (the  sovereign 
state),  international  relations,  instead,  studies  poli¬ 
tics  in  contexts  without  such  a  principle  (the  inter¬ 
national  system).  The  first  is  interested  in  studying 
politics  in  stable  domestic  contexts,  the  second  in 
studying  politics  in  unstable,  extradomestic  con¬ 
texts.  The  first  has  concerned  itself  with  studying 
order  (because  it  is  guaranteed  by  the  sovereignty 
of  the  state),  the  second  with  studying  disorder  (an 
outcome  of  the  anarchy  of  the  relations  between 
states).  Some  have  questioned  whether  such  a  dis¬ 
tinction  between  these  subdisciplines  is  still  plausi¬ 
ble  at  the  beginning  of  the  21st  century. 

This  entry  is  divided  into  four  parts,  beginning 
with  an  analysis  of  the  main  methods  of  compara¬ 
tive  politics.  The  second  part  discusses  the  main 
theories  of  comparative  politics,  and  the  third  part 
identifies  some  of  the  issues  of  comparative  politics 
investigated  in  the  various  regions  of  the  world. 
The  entry  concludes  with  a  discussion  of  the  future 
of  comparative  politics  in  a  globalized  world. 

Methods  in  Comparative  Politics 

Although  comparative  politics  is  defined  primarily 
by  the  phenomena  it  researches,  it  is  also  character¬ 
ized  by  the  method  employed  in  that  research.  One 
cannot  engage  in  comparative  analysis  without  a 
method  for  comparing;  a  method  is  necessary  for 
testing  empirical  hypotheses  about  relations 
between  variables  in  different  cases.  Such  hypothe¬ 
ses  concern  the  relations  between  the  variables  that 
are  held  to  structure  the  political  phenomenon  one 
wishes  to  study,  investigate,  or  interpret.  In  general, 
political  research  aims  to  understand  the  links 
between  the  dependent  variable  (the  outcome  of  the 
process  one  wants  to  explain),  the  independent 
variable  (the  structure,  the  context,  the  cognitive 


frame  within  which  this  process  unfolds),  and  the 
intervening  variable  (the  factor  that  exerts  an 
influence  on  that  given  process).  As  Hans  Keman 
observed,  the  question  to  understand  is  which 
independent  variables  can  account  for  the  varia¬ 
tion  of  the  dependent  variable  across  different 
political  systems. 

Political  science  employs  several  methods — four 
in  particular — to  check  its  research  process  and 
falsify  its  results:  the  experimental  method,  the 
statistical  method,  the  case  study,  and  the  com¬ 
parative  method. 

The  Experimental  Method 

The  experimental  method  can  rarely  be 
employed  by  political  scientists.  In  contrast  to  the 
natural  scientist,  the  political  scientist  cannot  hope 
to  study  politics  in  a  laboratory,  in  which  the  inter¬ 
vening  factors  can  be  kept  constant  so  that  the 
causal  effects  of  the  independent  variable  on  the 
dependent  variable  can  be  reconstructed.  Politics  is 
rather  more  complex  than  what  is  studied  in  a  sci¬ 
ence  laboratory,  above  all  because  it  involves  fac¬ 
tors  that  cannot  be  isolated  and  is  structured  by 
interactions  that  cannot  be  separated.  Moreover, 
whereas  in  natural  science  the  objects  of  study  may 
be  inanimate,  the  same  cannot  be  said  of  politics 
and  of  those  who  engage  in  it.  Politics  is  activated 
by  actors  (heads  of  governments,  ministers,  mem¬ 
bers  of  parliament,  party  leaders  and  activists, 
members  of  movements  and  associations,  and  citi¬ 
zens)  who  continually  learn  from  their  experience, 
thus  modifying  their  behavior,  even  in  the  absence 
of  variation  in  the  independent  variables. 

The  Statistical  Method 

The  statistical  method,  however,  is  ever  more 
widely  employed  by  political  scientists.  It  obvi¬ 
ously  presupposes  the  availability  of  numerical 
data.  Those  data  are  the  product  of  a  standardized 
process  of  quantitative  measurement  of  aspects  of 
political  life — standardized  because  the  same  crite¬ 
ria  of  measurement  can  (and  should)  be  used  in 
different  contexts.  It  goes  without  saying  that,  to 
be  effective,  this  method  requires  a  large  number  of 
quantitative  and  reliable  data.  However,  whereas 
the  experimental  method  presupposes  the  existence 
of  a  cause-and-effect  relation,  the  statistical  method 
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does  not.  It  provides  information,  but  as  such,  it 
does  not  suggest  an  interpretation.  Thus,  without 
the  support  of  a  theory  concerning  relations 
between  variables  (measured  quantitatively),  the 
statistical  method  is  destined  to  be  of  little  use  in 
scientific  research.  Moreover,  when  using  this 
method,  one  is  forced  to  neglect  factors  that  are 
not  easily  quantifiable,  such  as  cultural  factors. 

The  Case  Study  Method 

The  analysis  of  the  idiosyncratic  factors  men¬ 
tioned  above,  instead,  constitutes  the  object  of  the 
third  method  at  the  disposal  of  political  research: 
the  case  study.  This  method  has  also  characterized 
historical  research  and  is  designed  to  collect  the 
largest  possible  amount  of  information  (quantita¬ 
tive  and  qualitative)  on  a  specific  country  (or  other 
political  entity).  This  method  is  also  referred  to  as 
ideographic.  A  case  study  may  have  a  purely 
descriptive  purpose  or,  instead,  may  have  an  inter¬ 
pretative  goal  or  even  be  designed  to  generate 
hypotheses  (if  not  theories)  susceptible  to  general¬ 
ization  (e.g.,  Alexis  de  Tocqueville’s  study  of  U.S. 
democracy,  from  which  several  hypotheses  as  well 
as  a  theory  on  the  tendencies  of  Western  democra¬ 
cies  have  been  derived).  This  method  is  largely  used 
in  the  United  States,  as  is  evident  even  from  the 
comparative  politics  section  of  the  book  reviews  in 
the  American  Political  Science  Association  journal 
Perspectives  on  Politics,  where  there  is  an  abun¬ 
dance  of  studies  on  single  countries.  This  is  not  so 
(or  rather  less  so)  in  Europe,  where  comparison 
generally  involves  the  study  of  several  cases. 

The  Comparative  Method 

There  can  be  no  doubt  that  the  main  method  at 
the  disposal  of  political  science  is  the  comparative 


method.  In  a  now-classic  1970  essay,  Giovanni 
Sartori  pointed  out  that  all  political  science  pre¬ 
supposes,  even  if  implicitly,  a  comparative  frame 
of  reference.  The  same  author  would  write  repeat¬ 
edly  that  he  who  knows  only  one  thing  knows 
nothing  at  all.  The  comparative  method  requires, 
first,  that  the  object  to  be  compared  be  defined, 
next  that  the  units  used  to  compare  and  the  time 
period  to  which  the  comparison  refers  be  identi¬ 
fied,  and,  finally,  that  the  properties  of  those  units 
be  specified.  Whereas  the  statistical  method  pre¬ 
supposes  that  the  variables  of  many  cases  can  be 
quantified,  the  comparative  method  is  generally 
applied  to  a  more  limited  number  of  cases.  These 
cases  may  be  very  similar — that  is,  the  strategy  of 
the  most  similar  research  design — or  they  may  be 
very  dissimilar — the  strategy  of  the  most  dissimilar 
research  design.  The  first  strategy  allows  more  in- 
depth  comparisons,  whereas  the  second  yields 
broader  comparisons.  The  choice  of  strategy 
depends  on  the  purpose  of  the  study  (see  Table  1). 

Comparative  politics  has  become  increasingly 
identified  with  the  comparative  method,  to  the 
extent  that  that  method  has  become  the  defining 
characteristic  of  the  academic  discipline.  It  has 
permitted  the  construction  of  a  particular  type  of 
scientific  explanation  based  on  correlations,  as  it 
assesses  the  validity  of  an  explanation  by  assuming 
a  correspondence  between  the  properties  of  the 
independent  variables  and  those  of  the  dependent 
variables.  But  how  and  where  do  such  correlations 
operate?  The  comparative  method  does  not  pro¬ 
vide  a  clear  answer.  Regarding  the  how,  the  cor¬ 
relations  operate  if  there  are  actors  that  activate 
them.  How  then  should  political  agency  be  con¬ 
ceptualized?  There  is  no  univocal  answer  to  this 
question.  Regarding  the  where,  the  comparative 
method  has  to  rely  on  few  cases.  The  method  of 
correlations  has  proved  to  be  satisfactory  when 


Table  I  Methods  of  Comparative  Analysis 


Method 

Strength 

Weakness 

Experimental  method 

(Few  applications) 

(Few  applications) 

Statistical  method 

Extensive  information 

Little  interpretation 

Case  study 

In-depth  knowledge 

Generic  comparison 

Comparative  method 

Hypothesis  testing 

Limited  application 
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those  cases  are  not  only  limited  but  also  homoge¬ 
neous  (e.g.,  established  democracies,  post-Soviet 
democratizing  countries).  But  if  one  wishes  to 
broaden  the  perspective  by  comparing  many  non- 
homogeneous  cases  (e.g.,  democracies,  nondemoc¬ 
racies,  and  emerging  democracies),  what  happens 
to  a  method  based  on  correlations  between  uni¬ 
form  variables?  It  is  no  coincidence  that  political 
scientists  have  “regional”  knowledge  that  cannot 
be  easily  applied  in  other  “regions.” 

Conclusions 

Certainly  the  buoyant  growth  of  available  data 
on  the  various  political  systems  of  the  world  has 
permitted  complementing  the  statistical  method 
with  the  comparative  one,  thus  arriving  at  some 
classificatory  systems  that  manage  to  comprise 
more  cases  belonging  to  different  “regions,” 
thereby  making  the  analyses  less  Western  centric. 
Yet  the  problem  raised  years  ago  by  Sartori 
remains  unsolved.  Sartori  argued  that  empirical 
concepts  are  subject  to  a  sort  of  trade-off  between 
their  extension  and  their  intensity.  If  a  concept  is 
applied  to  a  large  number  of  cases  (extension),  it 
will  display  only  a  limited  ability  to  generate  valid 
explanations  for  each  case  (intensity),  and  vice 
versa.  When  concepts  are  so  abstract  that  they  can 
be  applied  to  the  entire  world,  then,  analytical 
vagueness  is  inevitable. 

Comparison  is  a  method  used  to  test  research 
hypotheses.  The  comparative  method  is  generally 
employed  by  scholars  of  comparative  politics, 
although  it  is  increasingly  complemented  by  the 
statistical  method.  The  end  of  the  Cold  War  in  the 
early  1990s  has  brought  about  an  enormous  growth 
in  the  number  of  formally  sovereign  countries. 
Simultaneously,  the  available  information  resources 
(e.g.,  databases)  also  grew  significantly,  thus  mak¬ 
ing  it  possible  to  have  recourse  to  statistical  com¬ 
parisons.  Yet  a  method  is  a  tool  at  the  disposal  of 
researchers  who  take  a  scientific  approach.  It  is  a 
way  of  organizing  the  conceptual  relations  between 
the  factors  that  are  thought  to  structure  the  problem 
under  investigation.  Accordingly,  there  is  a  need  for 
a  theory  that  identifies  those  relations  on  the  basis 
of  a  logically  consistent  argument.  The  theory 
informs  us  what  to  research  and  why.  Bereft  of  a 
theoretical  orientation,  even  the  most  sophisticated 
method  provides  only  information  or  description. 


How  to  use  or  interpret  the  data  depends  on  the 
theory. 

Theories  of  Comparative  Politics 

There  are  many  theories  in  comparative  politics. 
The  main  theories  generally  have  a  focus  on  institu¬ 
tions;  they  are  variations  of  the  institutionalist 
approach.  Institutionalism  not  only  constitutes  the 
main  branch  of  the  theories  of  comparative  politics 
but  also  stands  at  the  origin  of  political  science  as  a 
whole.  Without  harking  back  to  Aristotle,  the 
genealogical  tree  of  comparative  politics  had  its 
roots  at  the  beginning  of  the  20th  century  in  the 
sociological  constitutionalism  of  scholars  such  as 
Max  Weber  and  Vilfredo  Pareto  and  the  legal  con¬ 
stitutionalism  of  scholars  such  as  James  Bryce  and 
Woodrow  Wilson.  From  there  developed,  after 
World  War  II,  the  comparative  historical  sociology 
of  scholars  such  as  Stein  Rokkan  and  Harry  Eckstein 
as  well  as  the  comparative  political  science  of  schol¬ 
ars  such  as  Robert  A.  Dahl,  Samuel  P.  Huntington, 
and  Giovanni  Sartori.  These  roots  obviously  have 
not  prevented  the  subsequent  emergence  of  nonin¬ 
stitutionalist  developments,  frequently  deriving 
from  political  scientists’  use  of  theoretical  constructs 
from  other  social  sciences.  Examples  include  behav- 
ioralism  (derived  from  social  psychology),  structural 
functionalism  (derived  from  anthropology  and  sub¬ 
sequently  from  sociology),  and  systems  theory 
(derived  from  the  new  cybernetic  sciences). 

All  these  new  developments  had  to  live  along¬ 
side  the  institutionalist  approach  (or  “old”  institu¬ 
tionalism),  which  never  ceased  to  exert  its  influ¬ 
ence  on  the  comparative  political  research  of  the 
post-World  War  II  period.  Since  the  1980s,  the  old 
institutionalism  has  been  superseded  by  theoretical 
developments  that  have  merged  in  the  new  institu¬ 
tionalism — new  because  it  is  distinct  from  its  pre¬ 
decessor  owing  to  its  nonformalistic  vision  of 
institutions  and  norms.  As  noted  by  Ira  Katznelson 
and  Helen  V.  Milner,  there  are  significant  differ¬ 
ences  within  the  new  or  neoinstitutionalism.  Some 
are  microlevel  institutionalist  theories,  as  they  take 
their  point  of  departure  from  the  preferences  or 
interests  of  individual  political  actors  or  collective 
political  actors  understood  as  unitary.  Others, 
instead,  are  macrolevel  institutionalist  theories,  as 
they  depart  from  supraindividual  aggregates.  Still 
others  are  mesolevel  institutionalist  theories,  as 
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they  study  the  cognitive  interactions  between 
determined  (but  not  limited)  institutional  configu¬ 
rations  as  well  as  the  actors  that  operate  within 
them.  Whereas  the  macrolevel  and  mesolevel 
approaches  treat  the  institutions  (i.e.,  their  struc¬ 
tural  characteristics  or  their  cultural  codes)  as  the 
independent  variable,  in  the  microlevel  theories, 
this  role  is  played  instead  by  the  preferences  or  the 
interests  of  the  individual  actors. 

Rational  Choice  Institutionalism 

The  Theoretical  Apparatus 

The  main  microlevel  theory  of  comparative  poli¬ 
tics  is  rational  choice  institutionalism.  It  constitutes 
an  adaptation  to  political  science  of  theories  devel¬ 
oped  in  the  field  of  economics.  The  scholars  who 
have  most  influenced  the  rational  choice  theorists 
are  Anthony  Downs,  William  Riker,  Mancur  Olson, 
and  Douglass  North.  The  point  of  reference  for 
rational  choice  theorists  is  the  neoclassical  eco¬ 
nomic  model,  which  conceptualizes  collective  action 
as  the  outcome  of  the  behavior  of  individuals  aim¬ 
ing  to  maximize  their  own  utility  (i.e.,  their  own 
egoistic  interests).  Similarly,  the  political  theory  of 
rational  choice  postulates  that  the  individuals  who 
participate  in  politics  are  rational  actors,  acting  on 
the  basis  of  strategic  considerations  in  order  to 
maximize  their  utility.  The  actors’  preferences  are 
formed  outside  the  political  process.  They  are  exog¬ 
enous  with  respect  to  the  interaction  between  the 
actors.  Accordingly,  individual  preferences  can  be 
treated  as  the  independent  variable  in  this  approach. 

The  basic  unit  of  analysis  of  the  rationalist  theo¬ 
ries  is  the  actor.  Politics  is  a  game  between  indi¬ 
vidual  actors  or  between  collective  actors  under¬ 
stood  as  unitary  subjects.  As  Riker  has  argued  in 
many  writings,  the  aim  of  this  theory  is  to  explain 
how  collective  action  emerges  in  a  multiactor  game 
and  to  examine  the  microlevel  foundations  of  pro¬ 
cesses  that  give  rise  to  macrolevel  effects.  This  does 
not  imply  that  the  macrolevel  effects  are  necessar¬ 
ily  rational  (on  the  social  level).  Rather,  as  pointed 
out  by  Margaret  Levi,  a  scholar  working  within 
this  approach,  collective  action  may  be  irrational 
even  though  individuals  act  in  a  rational  way, 
unless  they  are  subjected  to  the  constraints  of  spe¬ 
cific  rules  in  the  pursuit  of  their  interests. 

Institutions  matter  because  they  make  collective 
action  possible  by  lowering  the  transaction  costs 


between  the  actors,  by  furnishing  reliable  informa¬ 
tion  on  the  rules  of  the  transaction  itself,  and  by 
sanctioning  free  riding,  thus  making  individual 
behavior  predictable.  Rational  choice  institutional¬ 
ism  assumes  that  institutions  are  necessary  because 
they  make  possible  virtuous  interaction  (i.e.,  coop¬ 
eration)  between  the  actors.  Without  the  rules 
guaranteed  and  promoted  by  institutions,  the 
game  would  become  uncooperative.  Institutional 
equilibrium  obtains  when  none  of  the  actors  has 
an  incentive  to  question  the  status  quo  because  no 
actor  is  able  to  establish  whether  a  more  satisfac¬ 
tory  equilibrium  may  result  from  doing  so.  Such 
equilibriums  are  defined  as  Pareto  optimal,  with 
reference  to  the  well-known  Italian  economist  and 
sociologist  Vilfredo  Pareto  (1848-1923).  For  ratio¬ 
nal  choice  theory,  as  it  is  employed  in  comparative 
politics,  institutions  and  norms  are  intervening 
variables — factors  that  intervene  from  the  outside 
to  regulate  the  interactions  between  individual 
actors. 

Rational  choice  institutionalism  is  a  testable 
theory  because,  once  the  institutional  framework 
that  regulates  behavior  is  known,  it  can  yield  falsi- 
fiable  statements.  It  establishes  a  correlation  of 
events  identifying  the  causal  mechanism  that  links 
the  independent  with  the  dependent  variables.  By 
assuming  that  individual  behavior  (as  well  as  the 
behavior  of  collective  actors  whose  actions  are 
taken  to  be  unitary),  in  any  context  whatsoever,  is 
driven  by  the  maximization  of  individual  utility, 
rational  choice  theory  is  able  to  claim  that  its  ana¬ 
lytical  framework  has  universal  validity.  Moreover, 
this  scientific  claim  is  supported  by  quantifying 
variables  so  that  the  rational  choice  theory  can  be 
expressed  in  a  formal  mathematical  language. 
Indeed,  according  to  these  theorists,  political  sci¬ 
ence  may  legitimately  compete  with  economics  as 
long  as  it  adopts  the  highly  formalized  analytical 
apparatus  characteristic  of  the  latter.  Hence,  ratio¬ 
nal  choice  theory  derives  its  extraordinary  aca¬ 
demic  and  scientific  success,  especially  among  the 
community  of  U.S.  political  scientists,  where  it  still 
sustains  a  strong  hegemony  of  neoclassical  eco¬ 
nomics  within  the  social  sciences. 

Criticisms 

Rational  choice  theory  has  not  been  spared 
criticism.  First,  it  has  been  pointed  out  that  an 
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analysis  that  assumes  institutions  to  be  ahistorical 
entities  ends  up  being  overly  abstract.  Institutions 
are  not  aseptic  rules  but  structures  that  are  perme¬ 
able  to  history.  The  same  institutional  structure 
may  produce  different  effects  in  different  historical 
periods.  Second,  it  has  also  been  shown  that  insti¬ 
tutions  constitute  coherent  and  interconnected 
agglomerates  of  regulative  structures.  As  a  result, 
it  is  hardly  plausible  to  examine  a  given  institu¬ 
tional  variable  in  isolation,  assuming  that  the  other 
institutional  variables  will  remain  constant.  In 
reality,  there  is  a  reciprocal  interaction  between 
them,  since  the  institutional  variable  considered 
may  produce  different  effects  if  the  variables  with 
which  it  is  linked  are  subjected  to  specific  and 
unforeseen  influences.  Third,  it  has  been  argued 
that  rational  choice  institutionalism  runs  the  risk 
of  turning  into  a  remodeled  functionalist  theory,  to 
the  extent  that  it  explains  the  existence  of  an  insti¬ 
tution  with  reference  to  the  effects  that  it  produces. 

Fourth,  it  has  been  shown  how  an  intentionalist 
conception  of  human  agency  lurks  behind  the 
theory  of  rational  choice  because  it  assumes  that 
the  process  of  creating  an  institution  is  inten¬ 
tional — that  is,  controlled  by  actors  who  correctly 
perceive  the  effects  of  the  institutions  created  by 
them.  Indeed,  it  is  rarely  the  case  that  the  actors 
intend  to  create  institutions  or  that  the  actors  cor¬ 
rectly  foresee  the  future  impact  of  institutions. 
Moreover,  rational  choice  institutionalists  con¬ 
ceive  the  formation  of  institutions  as  a  quasi-con- 
tractual  process  marked  by  voluntary  agreements 
between  relatively  equal  and  independent  actors, 
almost  as  if  they  still  found  themselves  in  a  state  of 
nature.  Again,  this  is  rarely  the  case.  The  elegance 
of  the  formal  theory  is  insufficient  to  compensate 
for  the  weakness  of  the  hypotheses  of  rational 
choice  institutionalism  concerning  the  driving 
forces  of  individual  choice  and  the  persistence  of 
institutions. 

Rational  choice  has  been  shown  to  be  convinc¬ 
ing  in  the  analysis  of  microphenomena  (e.g.,  why  a 
certain  decision  was  made  in  a  committee  of  the 
U.S.  Congress  or  why  the  negotiations  within  the 
intergovernmental  conference  of  the  European 
heads  of  state  and  government  led  to  a  certain  out¬ 
come),  but  it  is  less  successful  in  analyzing  macro¬ 
level  processes  (e.g.,  why  the  U.S.  Congress  has 
established  itself  as  the  most  powerful  legislature 
of  the  democratic  world  or  why  the  process  of 


European  integration  has  given  rise  to  a  political 
system  unforeseen  by  the  states  that  had  launched 
this  process).  Rational  choice  theory  is  generally  of 
little  use  when  the  number  of  actors  involved  in  the 
phenomenon  under  examination  is  large  and  when 
the  timeframe  is  extended.  It  is  certainly  true  that 
rational  choice  theory  has  positively  contributed  to 
raising  the  degree  of  formalization  of  political  sci¬ 
ence.  Yet  as  Barbara  Geddes  points  out,  it  is 
equally  beyond  doubt  that  rational  choice  scholars 
have  focused  only  on  those  phenomena  that  would 
allow  such  formalization.  As  a  result,  rational 
choice  scholars  have  ended  up  studying  a  problem 
not  because  of  its  political  relevance  but  of  its  abil¬ 
ity  to  be  formalized. 

Historical  Institutionalism 

The  Theoretical  Apparatus 

The  main  macrolevel  theory  of  comparative 
politics  is  historical  institutionalism.  It  has  been 
developed  by  scholars  such  as  Paul  Pierson,  Theda 
Skocpol,  and  Kathleen  Thelen  among  others, 
elaborating  the  rich  tradition  of  the  historical 
social  sciences  of  the  1950s  and  1960s,  represented 
by  Barrington  Moore  Jr.,  Reinhard  Bendix,  and 
Seymour  M.  Lipset.  It  differs  from  rational  choice 
institutionalism  to  the  extent  that  it  understands 
institutions  not  simply  as  arrangements  that  serve 
to  regulate  an  interactive  game  but  as  historical 
structures  that  have  origins  and  develop  indepen¬ 
dently  from  those  that  operate  within  them. 
Moreover,  whereas  the  analytical  focus  of  rational 
choice  institutionalism  is  on  the  actor,  the  analyti¬ 
cal  focus  of  historical  institutionalism  lies  instead 
on  institutional  structures  and  their  evolution  over 
time. 

Historical  institutionalists  analyze  institutional 
and  organizational  configurations  rather  than  sin¬ 
gle  institutions  in  isolation,  and  they  pay  attention 
to  processes  of  long  duration.  They  show  how 
general  contexts  and  interactor  processes  give 
shape  to  the  units  that  organize  the  political  pro¬ 
cess.  For  them,  time  is  a  crucial  intervening  vari¬ 
able  in  explanations  of  specific  outcomes.  The  aim 
of  the  analysis  is  to  establish  the  sequences  and  the 
variations  of  scale  and  time  that  characterize  a 
given  political  process.  One  of  the  fundamental 
concepts  of  historical  institutionalism  is  path 
dependence.  The  theory  of  path  dependence  argues 
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that,  in  politics,  decisions  made  at  time  t  will  tend 
to  shape  the  decisions  made  at  time  t  +  1 .  Once  a 
given  institution  has  asserted  itself,  it  tends  to 
reproduce  over  time.  Contrary  to  what  occurs  in 
the  economy,  however,  in  politics,  the  marginal 
productivity  of  an  institution  increases  over  time, 
as  Paul  Pierson  has  shown. 

To  the  individualistic  outlook  of  rational  choice 
theory,  historical  institutionalism  has  placed  in 
opposition  a  vision  of  the  political  process  as  struc¬ 
tured  by  institutions  that  have  consolidated  over 
time  and  thus  shape  this  process.  Also,  windows  of 
opportunity  for  institutional  change  open  up 
under  conditions  of  institutional  crisis,  but  the 
actors,  nevertheless,  are  constrained  to  act  within 
the  bounds  inherited  from  the  previous  arrange¬ 
ments.  Here,  there  is  no  heroic  vision  of  actors  as 
in  the  voluntaristic  vision  of  agency  that  rational 
choice  institutionalism  assumes.  Simultaneously, 
regarding  historical  macro-analyses,  historical 
institutionalism  has  steered  clear  of  the  pitfalls  of 
teleology  that  have  frequently  imprisoned  histori¬ 
cal  research. 

Criticisms 

This  theory  too  has  been  subject  to  criticism. 
First,  historical  institutionalism  has  paid  less  atten¬ 
tion  than  rational  choice  institutionalism  to  the 
role  of  individual  actors,  and,  in  general,  it  has 
been  more  concerned  with  the  possibility  of  agency 
in  the  historical  evolution  of  a  given  institutional 
structure.  The  absence  of  a  theory  of  agency  has 
led  historical  institutionalists  to  emphasize  the 
inertia  of  institutions  (conceived  as  “sticky”  struc¬ 
tures),  even  though  they  were  frequently  con¬ 
strained  to  modify  or  adapt  themselves.  For  this 
reason,  they  are  unable  to  provide  precise  indica¬ 
tions  concerning  the  chains  of  cause  and  effect  that 
operate  between  the  institutional  macrostructures 
and  the  microlevel  individual  decisions.  Second, 
historical  institutionalism  does  not  have  at  its  dis¬ 
posal  an  analytical  device  to  falsify  its  conclusions. 
Once  a  particular  institution  or  policy  has  been 
reconstructed,  it  may  prove  difficult  to  imagine  an 
alternative  sequence.  If  every  historical  develop¬ 
ment  is  unique  and  if  it  is  not  possible  to  employ 
counterfactual  hypotheses,  then  the  possibility  that 
the  theory  will  fall  into  some  kind  of  determinism 
is  high. 


Third,  historical  institutionalists  have  compared 
only  a  limited  number  of  cases  (and  it  could  not 
have  been  otherwise,  given  their  need  to  examine 
each  case  in  depth),  so  how  can  valid  knowledge 
be  generated  from  these  few  cases  unless  supported 
by  an  extensive  verification  of  the  postulated 
hypotheses?  It  is  no  coincidence  that  the  scholars 
who  most  extensively  employ  statistical  methods 
criticize  historical  institutionalists  for  selecting 
their  case  studies  according  to  the  dependent  vari¬ 
able  that  they  wish  to  explain.  As  Gary  King, 
Robert  Keohane,  and  Sidney  Verba  have  pointed 
out,  historical  institutionalism,  like  rational  choice 
institutionalism,  tends  to  select  cases  that  fit  with 
the  approach  adopted  while  ignoring  those  that  do 
not.  For  example,  historical  institutionalists  com¬ 
pare  countries  in  which  a  political  revolution 
occurred  but  do  not  explain  why  this  did  not  hap¬ 
pen  in  other  countries  with  similar  economic, 
social,  and  cultural  characteristics. 

Sociological  Institutionalism 

The  Return  of  Political  Culture 

The  main  mesolevel  theory  of  comparative  poli¬ 
tics  is  sociological  institutionalism.  Among  the 
representatives  of  this  approach,  James  G.  March 
and  Johan  P.  Olsen  should  be  mentioned.  According 
to  sociological  institutionalism,  institutions  are  not 
just  the  source  of  rules  for  solving  the  problem  of 
collective  action  or  path-dependent  structures  that 
condition  future  decisions  but  configurations  of 
meanings  that  the  actors  come  to  adopt.  They 
solicit  the  formation  of  mental  maps  concerning 
the  appropriate  political  behavior  that  guide  the 
actors  operating  within  institutions.  They  are 
sources  of  meaning  independent  of  the  actors  that 
embrace  them.  Sociological  institutionalism  is  a 
mesolevel  theory  that  conceptualizes  interactions 
between  actors  within  institutional  systems  that 
are  not  limited  (unlike  limited  systems,  e.g.,  the 
committees  of  the  U.S.  Congress)  and  that  are  not 
extended  (unlike  systems  such  as  welfare  states). 

Sociological  institutionalism  exhibits  a  strategic 
interest  in  culture  (i.e.,  in  meanings,  symbols,  com¬ 
mon  sense,  ways  of  thinking,  and  cognitive  frames). 
Indeed,  it  is  a  development  of  the  rich  tradition  of 
cultural  theory  of  comparative  politics,  which  was 
rather  relevant  in  political  science  until  the  1970s. 
After  a  phase  of  decline,  during  the  1990s,  the 
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interest  in  political  culture  has  clamorously 
returned  to  the  stage  of  scientific  debate.  In  par¬ 
ticular,  with  the  development  of  large-scale 
researches  on  social  capital,  the  interest  in  political 
culture  was  and  continues  to  be  shared  not  only  by 
political  scientists  but  also  by  sociologists,  econo¬ 
mists,  and  anthropologists.  Robert  Putnam  has 
been  a  pioneer  in  this  regard,  first  conducting 
extensive  empirical  research  on  the  relations 
between  the  social  capital  of  the  various  regions  of 
Italy  and  their  institutional  performance  (Putnam, 
1993)  and  then  investigating  the  decline  of  social 
capital  in  the  United  States  and  its  effects  on  the 
associationalism  of  that  country  (Putnam,  2000). 
In  both  cases,  Putnam  has  shown  how  the  quality 
of  civic  life  forms  the  basis  for  the  development  of 
effective  institutions  in  the  context  of  a  democratic 
society.  This  model  was  thus  tested  in  a  compara¬ 
tive  research  effort,  directed  by  Putnam  and  pub¬ 
lished  in  2002,  which  analyzed  the  relation  between 
social  capital  and  institutional  performance  in  the 
main  contemporary  advanced  democracies. 

According  to  these  theorists,  political  culture 
has  to  be  treated  as  the  independent  variable  and 
institutional  performance  as  the  dependent  vari¬ 
able.  Societies  differ  because  they  are  characterized 
by  different  politico-cultural  attitudes.  Those  cul¬ 
tural  differences  are  relatively  durable,  even  if  they 
are  not  immutable.  They  can  explain,  for  example, 
why  some  countries  enjoy  a  stable  democracy 
whereas  others  do  not.  The  theorists  of  political 
culture  have  sought  to  provide  an  intersubjective 
analysis  of  politics,  in  which  the  intersubjectivity  is 
conditioned  (if  not  determined)  by  the  cultural 
context  in  which  it  develops.  This  approach  has 
been  of  considerable  relevance  for  the  contempo¬ 
rary  world  as  it  has  provided  an  explanation  of 
why  institutional  innovations  encounter  difficul¬ 
ties  in  unchanged  cultural  contexts  and  why  pro¬ 
cesses  of  democratization  find  it  difficult  to 
advance  in  certain  countries  where  the  cultural 
assumptions  of  the  previous  regime  have  not  been 
questioned.  Yet,  according  to  the  critics,  the  con¬ 
cept  of  political  culture  risks  being  tautological  in 
character.  Citizens  behave  in  a  certain  way  because 
of  the  presence  of  a  particular  culture,  but  that 
particular  culture  is  defined  by  the  fact  that  citi¬ 
zens  behave  in  a  certain  way.  As  a  consequence, 
this  concept  makes  it  hard  to  explain  the  changes 
in  social  behavior  and  individual  beliefs,  changes 


that  nevertheless  occur  regularly  in  contemporary 
political  systems. 

Sociological  institutionalists  have  sought  to 
come  to  terms  with  that  debate  by  reducing  the 
concept  of  political  culture  to  the  institutions  that 
shape  it  and  to  the  individuals  that  legitimate 
them.  According  to  these  scholars,  institutions  are 
not  only  rules  of  the  game  or  crystallized  historical 
arrangements  but  also  encompass  symbolic  sys¬ 
tems,  cognitive  maps,  and  moral  frameworks  of 
reference  that  represent  the  meanings  that  guide 
behavior.  Accordingly,  culture  is  transmitted  by 
means  of  institutions,  and  because  of  this,  they  are 
simultaneously  formative  and  constraining. 
Culture  functions  as  the  independent  variable  to 
the  extent  that  it  is  institutionalized.  Sociological 
institutionalism  has  undertaken  a  holistic  analysis 
that  seeks  to  recombine  agency  and  structure  in  a 
comprehensive  scheme  of  meanings  and  ways  of 
thinking.  As  has  been  noted,  anyone  who  has  spent 
time  waiting  at  a  traffic  light  when  nobody  else 
was  around  will  be  able  to  understand  the  impor¬ 
tance  of  internalized  ways  of  thinking.  In  stable 
institutional  contexts,  as  March  and  Olsen  have 
noted,  behavior  is  rarely  motivated  by  the  logic  of 
utility  but  rather  by  the  logic  of  appropriateness. 
Individuals  behave  in  a  certain  way  because  that  is 
what  is  expected  of  them  in  that  particular  institu¬ 
tional  context.  A  given  political  culture  is  a  com¬ 
ponent  of  the  process  that  produces  it,  thus  mak¬ 
ing  the  relation  between  the  independent  and  the 
dependent  variables  in  this  process  much  more 
interactive. 

Criticisms 

Sociological  institutionalism  has  not  been 
immune  to  criticism.  Two  points  are  of  particular 
relevance.  First,  as  Peter  A.  Hall  and  Rosemary  C.  R. 
Taylor  (1996)  have  argued,  sociological  institu¬ 
tionalism  is  based  on  the  assumption  of  stable  and 
legitimate  institutional  conditions.  But  what  hap¬ 
pens  if  one  wants  to  explain  the  emergence  or 
construction  of  new  institutions?  And  what  is  the 
appropriate  behavior  in  a  situation  of  change? 
Processes  of  institution  building  inevitably  tend  to 
involve  actors  with  diverse  and  contradictory  cog¬ 
nitive  frameworks  and  with  conflicting  interests. 
Second,  the  holistic  approach  of  sociological  insti¬ 
tutionalism  does  not  allow  for  a  conceptualization, 
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and  even  less  measurement,  of  the  contrasts 
between  cognitive  frameworks — that  is,  of  compe¬ 
tition  or  conflict  between  actors  over  how  to 
understand  and  apply  the  predominant  cultural 
frames.  Institutions  and  actors  stand  in  a  relation 
of  reciprocal  influence:  Whereas  institutions  con¬ 
tribute  to  defining  the  identity  of  the  actors,  socio¬ 
logical  institutionalism  must  also  hold  that  the 
characteristics  of  the  latter  (their  decisions,  their 
strategies,  their  visions)  tend  to  define  the  identity 
of  the  former.  Sociological  institutionalism,  thus, 
underestimates  the  impact  of  competition  and  con¬ 
flict  between  actors  (and  between  their  contrasting 
interpretations  of  the  appropriate  behavior)  on  the 
definition  of  those  same  institutions.  This  implies 
that  even  consolidated  institutions  are  cognitively 
less  determined  than  sociological  institutionalism 
assumes.  For  a  summary  of  the  discussion  on  the 
three  institutionalisms,  see  Table  2. 

Conclusion 

It  is  difficult  to  find  a  theory  of  comparative 
politics  that  does  not  refer,  in  one  way  or  another, 
to  institutions.  For  comparative  politics  scholars, 
institutions  matter.  Yet  these  theories  differ  sig¬ 
nificantly  with  respect  to  (a)  what  is  understood  to 
be  an  institution,  (b)  how  institutions  are  created, 
(c)  why  and  when  they  are  important,  and  (d)  how 
institutions  change.  The  different  viewpoints  of 
institutionalist  scholars  have  also  highlighted  the 
existence  of  different  research  programs.  The 


rationalist  approach  is  engaged  in  a  formidable 
undertaking  of  simplification  of  comparative  poli¬ 
tics,  as  these  scholars  aim  to  provide  their  research 
program  with  a  microeconomic  foundation. 
Historical  and  sociological  institutionalists, 
instead,  seem  to  be  engaged  in  an  equally  formi¬ 
dable  enterprise  of  complexification  of  compara¬ 
tive  politics,  as  they  start  from  less  limited  and  less 
restrictive  assumptions.  The  former  seek  to  con¬ 
struct  a  theory  on  the  basis  of  the  actor,  whereas 
the  latter  start  from  the  structures  or  the  meanings 
embedded  in  them. 

This  division  ultimately  refers  to  the  problem  of 
what  is  to  be  understood  as  a  theory  in  political 
science  (Charles  Ragin,  1994).  According  to  the 
rationalists,  political  science  needs  to  equip  itself 
with  an  epistemological  structure  similar  to  that  of 
the  most  formalized  social  science — namely,  eco¬ 
nomics.  For  nonrational  approaches,  its  social  util¬ 
ity  is  justified  by  the  ability  to  provide  conceptual 
frameworks  within  which  problems  of  public  rel¬ 
evance  can  be  examined  but  without  any  pretense 
of  becoming  a  positive  science.  The  first  wants  to 
predict  what  will  happen;  the  other  two  want  to 
explain  what  has  happened.  Which  one  of  the 
available  theories  to  employ  should  be  decided  by 
the  problem  under  investigation  and  not  as  a  mat¬ 
ter  of  principle.  Because  it  is  the  task  of  political 
science  to  investigate  different  problems,  then,  as 
Donatella  Della  Porta  and  Michael  Keating  sub¬ 
mit,  the  lively  theoretical  pluralism  of  the  disci¬ 
pline  should  be  welcomed. 


Table  2  Neo-Institutionalist  Theories 


Theories 

Level  of 
Analysis 

Independent 

Variable 

Intervening 

Variable 

Dependent 

Variable 

(Examples) 

Role  of  Institutions 

Rational  choice 
institutionalism 

Micro 

Individual 

preferences 

Institutions 

•  Decisions 

•  Allocation  of 
resources  and 
posts 

Make  collective 
action  possible 

Historical 

institutionalism 

Macro 

Institutional 

arrangements 

Time 

•  Systems  of 
regulation 

•  Public  policies 

Structure  the 
context  of  the 
political  process 

Sociological 

institutionalism 

Meso 

Institutionalized 

cultural 

systems 

Meanings 

•  Decisions 

•  Strategies 

Shape  the  identity  of 
the  actors 
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Issues  of  Comparative  Politics 

There  are  significant  regional  differences  in  the 
issues  investigated  by  scholars  in  Africa,  Asia,  Latin 
America,  Europe,  and  the  United  States  since  the  end 
of  the  Cold  War.  However,  those  issues  generally 
deal  with  the  implications,  functioning,  and  trans¬ 
formation  of  democracy.  This  is  perhaps  why  insti¬ 
tutional  theories  of  comparative  politics  have  become 
so  successful.  Democracy  is  a  political  regime  that 
requires  specific  institutions,  although  those  institu¬ 
tions  may  function  properly  if  legitimated  by  coher¬ 
ent  values  (or  political  culture)  diffused  among  the 
citizens.  Since  the  end  of  the  Cold  War,  democracy 
has  become  the  only  legitimate  game  in  town.  In 
1900,  only  10  countries  were  considered  democra¬ 
cies,  but  in  1975,  there  were  30  such  countries.  In 
2010,  115  out  of  194  countries  recognized  by  the 
United  Nations  were  considered  electoral  democra¬ 
cies  by  the  international  nongovernmental  organiza¬ 
tion  Freedom  House.  This  spectacular  diffusion  of 
democracy  has  inevitably  attracted  the  interest  of 
comparative  politics  scholars.  Certainly,  other  topics 
unrelated  to  democracy  have  been  investigated  in  a 
comparative  perspective  (e.g.,  revolutions,  civil  wars, 
ethnic  strife,  Islamic  regimes).  Nevertheless,  the 
issues  connected  to  democracy  have  represented  the 
operational  link  between  Western  and  non-Western 
scholars.  The  following  sections  identify  a  few  areas 
of  investigation  within  the  vast  literature. 

Democracy  and  Supranational  Developments 

The  debate  on  democratic  models  has  continued 
to  be  at  the  center  of  comparative  politics  in  the 
Western  world.  Thanks  to  the  pioneering  work  of 
Arend  Lijphart  (1999),  different  patterns  of  demo¬ 
cratic  organization  and  functioning  have  been 
detected  within  the  family  of  stable  democratic 
countries.  According  to  Lijphart,  democracies 
might  be  classified  according  to  the  two  ideal 
types — majoritarian  democracies  and  consensual 
democracies — as  a  consequence  of  the  structure  of 
their  social  cleavages  and  institutional  rules.  This 
classification  has  been  very  important  for  freeing 
the  analysis  from  the  old  normative  argument, 
which  assumed  that  there  were  more  developed 
democracies  (of  course,  the  Anglo-American  ones) 
and  less  developed  democracies  (of  course,  the  con¬ 
tinental  European  ones).  Lijphart’s  classification 
has  been  revised  by  several  authors.  For  some  of 


them,  such  as  Sergio  Fabbrini  (2008),  the  distinc¬ 
tion  between  patterns  of  democracy  concerns  more 
their  functional  logic  than  their  specific  institu¬ 
tional  properties.  What  matters  is  the  fact  that  cer¬ 
tain  democracies  function  through  an  alternation  in 
government  of  opposite  political  options,  whereas 
others  function  through  aggregation  in  government 
of  all  the  main  political  options.  Indeed,  alternation 
in  government  takes  place  regularly  in  democracies 
that  do  not  adopt  a  majoritarian  (Westminster) 
first-past-the-post  electoral  system,  such  as  Spain, 
Greece,  or  Germany.  These  democracies  are  com¬ 
petitive,  notwithstanding  their  nonmajoritarian 
electoral  systems.  The  consolidation  of  democracy 
in  Eastern  Europe,  Latin  America,  and  Asia  has 
increased  the  number  of  countries  to  be  considered 
for  the  identification  of  democratic  patterns.  In 
dealing  with  this  process,  Lijphart  (in  his  subse¬ 
quent  works)  has  gradually  (and  surprisingly) 
shifted  to  a  more  normative  approach,  arguing  that 
the  consensual  model  represents  a  better  model  for 
the  new  democracies  to  adopt. 

At  the  same  time,  the  end  of  the  Cold  War  and 
the  prospect  of  the  political  reaggregation  of  the 
continent  have  accelerated  the  process  of  European 
integration;  in  the  period  from  the  Maastricht 
Treaty  (1992)  to  the  Lisbon  Treaty  (2009),  Europe 
has  become  the  European  Union  (or  EU).  The  pro¬ 
cess  of  European  integration  was  traditionally  con¬ 
sidered  a  unique  experiment  in  international  rela¬ 
tions.  International  relations  scholars  were  interested 
in  explaining  the  process  of  integration  rather  than 
its  outcome  (i.e.,  the  community  system  and  its  insti¬ 
tutional  characteristics).  Since  the  Treaty  of 
Maastricht  (1992),  a  new  generation  of  studies  has 
started  to  investigate  the  EU  as  a  political  system. 
However,  although  the  EU  could  no  longer  be  con¬ 
sidered  an  international  regime,  it  could  not  be  com¬ 
pared  with  other  domestic  political  systems.  In  some 
cases,  its  supranational  character  has  come  to  be 
considered  exceptional,  sui  generis,  unique  by  sev¬ 
eral  observers  and  scholars.  Or,  if  compared,  the  EU 
has  been  compared  on  the  basis  of  generic  or  behav¬ 
ioral  criteria,  such  as  the  style  of  decision  making, 
ways  in  which  political  leaders  interact,  attitudes  in 
managing  public  policies,  and  relations  between 
interest  groups.  Lijphart  (in  his  book  of  1999)  con¬ 
sidered  the  EU  as  a  case  of  consensual  democracy. 

A  different  comparative  approach  has  been  taken 
by  other  authors.  Based  on  specific  institutional 
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criteria,  Fabbrini  (2010)  has  argued  that  the  EU  is 
a  political  system  organized  around  multiple  sepa¬ 
rations  of  powers.  In  the  EU,  there  is  no  govern¬ 
ment  as  such,  as  in  the  parliamentary  or  semipresi- 
dential  systems  of  its  member  states  that  are  orga¬ 
nized  according  to  the  principle  of  the  fusion  of 
powers.  Contrary  to  systems  of  fusion  of  powers, 
the  system  of  multiple  separations  of  power  func¬ 
tions  without  a  government  as  the  final  locus  of 
decision-making  power.  Such  systems  are  proper 
unions  of  states  rather  than  nation-states — in  par¬ 
ticular,  unions  of  asymmetrically  correlated  states. 
Because  of  this  (structural  and  cultural)  asymme¬ 
try,  such  unions  cannot  accommodate  the  central¬ 
ization  of  decision-making  power.  If  institutions 
matter,  then  to  classify  the  EU  as  a  consensual 
democracy  appears  highly  unconvincing.  Like 
other  democratic  unions  of  states,  such  the  United 
States  and  Switzerland,  the  EU  is  a  species  of  a  dif¬ 
ferent  democratic  genus,  and  could  be  called  a 
compound  democracy.  Asymmetrical  unions  of 
states  can  be  subsumed  neither  under  the  model  of 
consensual  democracy  nor  under  the  models  of 
majoritarian/competitive  democracy,  because  they 
have  neither  a  government  nor  an  opposition.  One 
might  argue  that  they  are  Madisonian  systems 
functioning  on  the  basis  of  checks  and  balances 
between  institutions  and  not  between  political 
options  as  in  fusion-of-powers  democracies.  The 
classification  of  democratic  patterns,  if  it  is  to  take 
into  consideration  institutional  systems,  needs  be 
enlarged  to  a  more  comprehensive  typology.  The 
development  of  the  EU  has  allowed  comparative 
politics  to  overcome  national  borders  and  apply  its 
tools,  concepts,  methods,  and  theories  to  the  study 
of  a  supranational  political  system.  At  the  same 
time,  the  EU  has  also  been  compared  with  other 
regional  organizations,  a  comparison  that  has 
shown  the  difference  between  political  and  eco¬ 
nomic  regionalism.  The  comparative  analysis  of 
politics  has  been  relaunched  by  the  development  of 
the  EU. 

Democratization  and  Consolidation 

The  end  of  the  Cold  War  also  had  dramatic 
effects  in  the  non-Western  world,  ushering  in  a  new 
democratic  era  in  Africa,  Latin  America,  and  Asia. 
In  these  regions,  a  process  of  democratization 
started  with  the  aim  of  creating  regimes  able  to 


provide,  as  it  was  claimed  in  many  quarters,  secu¬ 
rity  in  the  sense  of  protection  against  widespread 
and  arbitrary  violations  of  civil  liberties.  Many 
political  elites  of  new  democratizing  countries 
seemed  to  share  the  belief  that  a  democratic  regime 
has  an  intrinsic  value.  This  belief  was  epitomized 
by  the  introduction  of  democratic  rule  in  South 
Africa  in  1994.  On  these  empirical  bases,  the 
1990s  registered  a  diffusion  of  studies  on  democra¬ 
tization  (e.g.,  Geddes,  2007),  studies  that  benefi¬ 
ted  largely  from  the  previous  generation  of  works 
on  the  democratization  of  Southern  Europe  and 
Latin  America,  such  as  that  by  Leonardo  Morlino 
(1998). 

Democratization  has  entailed  the  introduction 
of  reforms  aimed  at  limiting  the  role  of  the  state  in 
the  political  sphere.  It  has  been  about  restoring 
political  pluralism,  whereby  different  political  and 
civic  organizations  participate  in  the  political  pro¬ 
cess  without  hindrance.  According  to  Ben  O. 
Nwabueze  (1993),  democratization  was  meant  to 
enhance  transparency  and  accountability.  Although 
most  countries  in  Africa  and  Asia  have  introduced 
political  changes  in  their  polities,  the  crucial  ques¬ 
tion,  especially  in  Africa,  has  continued  to  be 
whether  democratization  is  reversible  or  not.  As 
Larry  Diamond  and  Marc  Plattner,  among  others, 
have  pointed  out,  in  some  cases,  newly  established 
democratic  orders  have  devolved  into  pseudode¬ 
mocracies.  Despite  this,  Africa,  Asia,  and  Latin 
America  have  made  tremendous  progress  toward 
democratization,  although  North  Africa  and  the 
Middle  East  have  yet  to  make  a  major  step  in  this 
regard.  Especially  in  the  1990s,  scholars  of  com¬ 
parative  politics  devoted  their  work  to  seeking  to 
explain  why  some  countries  and  not  others  were 
successful  in  transitioning  from  nondemocratic  to 
democratic  systems.  Some  studies  focused  on  the 
strategic  role  played  by  individual  leaders  (Nelson 
Mandela  in  South  Africa  or  Mahathir  Mohamad 
in  Malaysia)  as  key  drivers  of  change  and  guaran¬ 
tors  of  political  transformation.  Others  empha¬ 
sized  the  role  of  specific  institutional  settings  for 
supporting  the  democratization  of  a  country.  Still 
others  have  investigated  the  role  of  civil  society  in 
fostering  or  contrasting  democratization,  accord¬ 
ing  to  an  approach  not  dissimilar  to  the  social 
capital  approach. 

With  the  diffusion  of  democracy  in  Africa,  Asia, 
and  Latin  America  in  the  first  decade  of  the 
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21st  century,  scholars  have  started  to  investigate 
the  conditions  that  have  helped  or  impeded  the 
consolidation  of  democracy  after  two  decades  of 
repeated  elections  and  broad  institutional  reforms. 
The  challenges  of  democratic  consolidation  have 
been  greater  in  Africa  than  in  other  parts  of  the 
world.  The  democratization  of  Eastern  European 
and  Latin  American  countries  has  been  largely  sup¬ 
ported  by  regional  organizations  (e.g.,  as  the  EU  or 
Mercosur  [Mercado  Comrin  del  SurJ).  One  of  the 
conditions  for  participating  in  these  organizations 
and  for  enjoying  their  economic  benefits  is  that 
democratic  principles  should  be  respected.  This  has 
not  been  the  case  in  Africa  because  of  the  fragility 
of  the  African  Union.  Although  there  are  still  cases 
of  conflict  and  instability  in  countries  such  as 
Venezuela,  Peru,  Timor-Leste  (East  Timor), 
Thailand,  Georgia,  and  Kyrgyzstan,  these  are  iso¬ 
lated  cases.  It  is  in  Africa  that  the  consolidation  of 
democracy  has  continued  to  be  an  open  question. 
Although  Africa  has  made  a  big  step  toward 
democratization,  democracy  is  far  from  having 
been  consolidated.  Democratic  reversal  has  contin¬ 
ued  to  be  a  likely  possibility  for  most  African  coun¬ 
tries.  The  political  crises  in  Kenya,  Zimbabwe,  and 
Madagascar  in  the  past  few  years  are  cases  in  point. 
Democratic  consolidation  requires,  among  other 
things,  a  government  turnover,  and  most  countries 
in  Africa  are  yet  to  undergo  this  crucial  test.  Even 
in  countries  that  have  maintained  stability,  such  as 
Botswana  and  Namibia,  one-party  dominance 
remains  a  key  feature  of  their  democracy. 

Quality  of  Democracy  and  Development 

Following  pioneering  works  on  the  quality  of 
democracy,  such  as  that  by  Larry  Diamond  and 
Leonardo  Morlino  (2005),  scholars  have  investi¬ 
gated  the  same  in  the  countries  of  Africa  and  Asia, 
reaching  very  critical  conclusions.  In  Africa,  due 
mainly  to  the  low  levels  of  development  and  wide¬ 
spread  poverty,  the  traditional  debate  on  democ¬ 
racy  versus  economic  development  has  been  very 
much  alive  (Adam  Przeworski,  Michael  Alvarez, 
Jose  Antonio  Cheibub,  &  Fernando  Limongi, 
2000).  In  the  1990s,  the  return  to  multiparty  dem¬ 
ocratic  systems  in  African  countries  raised  hopes 
and  expectations  for  development.  After  almost 
two  decades  of  multiparty  elections,  however, 
those  hopes  have  begun  to  dwindle  as  development 


is  slow  and  poverty  levels  have  remained  high.  A 
number  of  surveys,  such  as  those  by  the  Afro 
Barometer  Group,  United  Nations  Economic 
Commission  for  Africa  (UNECA),  and  Transparency 
International  and  several  others  based  on  regional 
and  individual  country  comparisons,  have  been 
conducted  in  Africa  on  the  subject  of  public  percep¬ 
tions  of  good  governance,  the  effective  delivery  of 
services  by  the  state,  and  levels  of  corruption  and 
performance  on  human  development  indicators.  In 
Asia,  similar  studies  on  public  perceptions  of 
democracy  and  the  effects  of  government  perfor¬ 
mance  on  development  have  been  common. 

Investigating  developments  in  Africa  and  Asia, 
comparative  politics  scholars  have  come  up  with 
indicators  that  purport  to  measure  democracy  and 
good  governance.  The  World  Bank  has  been  a 
leading  institution  in  asserting  that  good  gover¬ 
nance  is  the  basis  for  economic  success.  It  has 
argued  that  those  countries  that  have  successfully 
instituted  rule  of  law,  established  a  culture  of  regu¬ 
lar  free  and  fair  elections,  and  minimized  corrup¬ 
tion  have  been  able  to  attract  foreign  private  inves¬ 
tors  and  thereby  performed  much  better  in  devel¬ 
opment  than  those  that  have  been  unable  to  do  so. 
Other  organizations  based  mainly  in  Europe  and 
North  America,  such  as  Freedom  House, 
International  Institute  for  Democracy,  and  the 
United  Nations  Development  Program,  have  come 
up  with  criteria  for  assessing  performance  in  polit¬ 
ical  and  civil  rights,  democracy,  governance,  and 
economic  performance.  In  Africa,  the  UNECA  has 
developed  an  elaborate  survey  methodology  seek¬ 
ing  to  evaluate  public  perceptions  of  the  state  in 
various  countries,  comparing  their  performance  on 
some  key  governance  indicators,  such  as  rule  of 
law,  freedom  and  fairness  of  elections,  women’s 
participation  in  politics,  levels  of  corruption,  and 
the  effectiveness  of  the  checks  and  balances  between 
the  core  institutions  of  governance  (with  a  particu¬ 
lar  focus  on  the  independence  of  the  judiciary). 

Africa  experienced  poor  governance  and  ram¬ 
pant  corruption  in  the  decolonization  decades  of 
the  1970s  and  1980s,  in  part  because  of  the  diffuse 
corruption  of  public  officials  and  governors.  As  a 
result,  since  the  1990s,  Africa  has  been  under  pres¬ 
sure  from  international  organizations  and  local 
reformers  to  embrace  governance  reforms.  The 
African  Union  has  acknowledged  that  good  gover¬ 
nance  continues  to  be  a  challenge  in  Africa.  It  has 


Comparative  Politics  353 


therefore  introduced  the  African  Peer  Review 
Mechanism  and  the  African  Union’s  Convention 
on  Preventing  and  Combating  Corruption  and 
Related  Offences  for  improving  the  standards  of 
governance  in  the  continent.  Finally,  some  investi¬ 
gations  have  shown  a  correlation  between  human 
development  and  political  stability.  Indeed,  human 
development  has  evaded  most  countries  of  the 
developing  world  because  political  instability  has 
continued  to  be  a  major  and  recurrent  problem. 
Examples  include  Algeria,  Burundi,  Kenya, 
Nigeria,  Rwanda,  Somalia,  Sudan,  and  the  Dem¬ 
ocratic  Republic  of  the  Congo  in  Africa;  parts  of 
India,  Nepal,  and  Pakistan  in  Asia;  and  Colombia 
and  Guatemala  in  Latin  America. 

Democracy  and  Constitutionalization 

The  study  of  constitutions  and  constitution¬ 
making  processes  features  prominently  in  the  com¬ 
parative  politics  discourse,  especially  in  the  newly 
democratizing  countries.  The  return  to  democratic 
rule  has  ushered  in  new  debates  on  how  best  to 
craft  and  reform  constitutions  and  helped  ensure 
that  constitutions  facilitate  democratic  governance 
and  protect  human,  ethnic,  cultural,  and  other 
rights  that  characterize  complex  postcolonial  and 
postconflict  societies.  In  this  regard,  in  Africa,  the 
constitutions  of  Namibia  and  South  Africa  have 
been  considered  good  examples  because  of  their 
racial  and  ethnic  balancing,  power-sharing  mecha¬ 
nisms,  protection  of  basic  human  rights,  and 
enshrined  checks  and  balances.  Constitutional 
reforms  have  been  introduced  in  Kenya,  Zimbabwe, 
Swaziland,  and  Lesotho  to  deal  with  internal  con¬ 
flicts.  In  these  and  other  cases,  the  reform  of  the 
national  constitution  has  been  considered  neces¬ 
sary  for  engineering  peaceful  political  succession. 

Many  countries,  especially  in  Africa,  have  set  a 
two-term  limit  for  the  presidency.  In  practice,  this 
innovative  measure  has  sparked  controversy  and 
conflict  in  Malawi,  Namibia,  and  Zambia,  among 
others,  where  the  outgoing  president  has  sought  to 
change  the  constitution  to  ensure  that  he  could 
serve  a  third  term.  The  third-term  issue  has  thus 
become  central  in  political  debate.  Succession  is  an 
area  that  has  attracted  interest  and  controversy  in 
Africa  and  in  some  Latin  American  countries,  such 
as  Venezuela  and  Honduras.  The  succession  not 
only  to  the  state’s  presidency  but  also  to  the  party 


leadership  have  become  political  issues.  The  cases 
of  Botswana,  Kenya,  Malawi,  Namibia,  Nigeria, 
Uganda,  South  Africa,  and  Zimbabwe  show  that 
the  African  continent  has  not  yet  resolved  these 
issues.  Succession  in  the  party  has  been  a  concern 
because  it  has  been  closely  linked  to  succession  in 
the  presidency.  As  a  result,  political  scientists  such 
as  Roger  Southall  and  Henning  Melber  have 
focused  on  the  role  of  former  presidents,  the  lega¬ 
cies  of  political  power,  and  the  importance  of  term 
limits. 

The  issues  of  constitutions  and  constitution 
making  have  been  central  in  the  European  debate 
also.  Through  the  1990s,  all  the  Eastern  European 
countries  had  to  redefine  their  constitutional  set¬ 
tings  drastically.  At  the  same  time,  the  process  of 
enlargement  of  the  EU  has  accelerated  the  search 
for  a  new  constitutional  setting  able  to  guarantee 
the  functioning  of  a  regional  organization  encom¬ 
passing  (in  2010)  27  member  states  and  half  a  bil¬ 
lion  inhabitants.  The  first  decade  of  the  21st  cen¬ 
tury  has  been  the  constitutional  decade  of  the  EU, 
although  this  decade  has  witnessed  the  amendment 
of  the  existing  treaties  rather  than  the  approval  of 
a  new  and  encompassing  constitutional  treaty. 
Even  the  contrasted  process  of  constitutionaliza¬ 
tion  of  the  EU  has  appeared  less  exceptional  when 
compared  with  the  experience  of  other  compound 
democracies.  This  debate  has  also  led  to  a  vivid 
discussion  on  European  citizenship  and  more  gen¬ 
erally  on  how  to  guarantee  human  rights  in  a  mul¬ 
tilevel  supranational  system. 

Democracy  and  Representation 

Political  representation  has  changed  dramatically 
since  the  end  of  the  Cold  War.  In  established  demo¬ 
cratic  systems,  starting  with  the  United  States, 
political  parties  have  entered  a  long  phase  of  down¬ 
sizing  and  restructuring.  Mass  political  parties  have 
become  icons  of  the  past.  Political  parties  have 
become  state  agencies  in  Europe  (as  argued  by 
Richard  Katz  and  Peter  Mair)  and  electoral  commit¬ 
tees  in  the  United  States  (as  argued  by  Sandy 
Maisel).  On  both  sides  of  the  Atlantic,  they  have 
developed  as  supporting  structures  of  the  leader  at 
the  electoral  and  governmental  level.  A  vast  litera¬ 
ture  has  shown  how  parties  have  been  integrated  by 
other  actors  in  the  electoral  arena.  While  in  the  West 
scholars  have  been  investigating  the  consequences 
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on  governance  of  the  decline  and  transformation  of 
political  parties,  in  the  non-Western  world,  the 
research  issue  has  been  the  opposite:  What  role  can 
newly  founded  parties  play  in  promoting  democ¬ 
racy?  In  democratizing  countries,  strong  political 
parties  have  been  considered  a  necessary  pillar 
for  supporting  democratic  consolidation  (M.  A. 
Mohamed  Salih,  2003).  Political  parties  have  gen¬ 
erally  been  weak  and  constrained  by  a  lack  of 
resources.  This  has  also  drawn  attention  to  the  issue 
of  their  funding.  The  weakness  of  political  parties 
has  been  fed  also  by  their  tendency  to  fragment 
along  ethnic  lines  or  to  rally  around  a  founder 
patron  who  has  often  constrained  their  ability  to 
institutionalize  and  practice  internal  democracy. 
Opposition  parties,  in  particular,  have  been  conflict 
ridden  and  fragmented  throughout  the  new  democ¬ 
racies,  leading  to  their  poor  showing  at  elections. 

In  Africa,  Asia,  and  Latin  America,  comparative 
politics  has  focused,  in  particular,  on  the  function¬ 
ing  of  elections,  thus  contributing  to  identify  crite¬ 
ria  for  evaluating  the  legitimacy  of  the  electoral 
process.  The  issues  have  concerned  the  impact  of 
election  rules  on  the  legitimacy  of  emerging  democ¬ 
racies,  the  institutional  conditions  that  make  elec¬ 
tions  free  and  fair,  the  role  of  electoral  management 
bodies  as  institutions  of  governance,  and  the  impli¬ 
cations  of  observation  and  monitoring  on  the  out¬ 
come  of  the  electoral  process.  These  studies  have 
inspired  specific  regulations,  such  as  the  Southern 
African  Development  Community’s  (SADC) 
Principles  and  Guidelines  Governing  Democratic 
Elections  and  the  Electoral  Institute  of  Southern 
Africa/Electoral  Commissions  Forum’s  Principles 
for  Election  Management,  Monitoring  and 
Observation  in  the  SADC  Region,  both  made  pub¬ 
lic  in  2004.  That  notwithstanding,  in  many  African 
countries,  elections  have  not  called  into  question 
the  power  of  former  liberation  movements  or  rul¬ 
ing  parties  to  dominate  domestic  politics. 

Weakened  parties  in  the  European  parliamen¬ 
tary  democracies  have  contributed  to  the  decision¬ 
making  decline  of  legislatures,  thus  leaving  larger 
room  for  maneuvering  to  the  executives.  Indeed, 
parties  have  come  to  be  controlled  by  their  leader 
once  in  government,  thus  justifying,  as  noted  by 
Thomas  Poguntke  and  Paul  Webb,  a  process  of 
presidentialization  of  politics  in  modern  democra¬ 
cies.  This  process  has  not  concerned  separation  of 
powers  systems;  the  U.S.  Congress  has  continued 


to  be  the  most  powerful  legislature  in  the  demo¬ 
cratic  world.  Although  for  different  reasons,  in 
non-Western  democracies  also,  parliaments  have 
continued  to  remain  weak  institutions.  For 
instance,  it  has  been  argued  that  African  parlia¬ 
ments  have  been  weak  because  of  both  one-party 
dominance  of  the  country  and  an  overly  powerful 
executive,  although  other  studies  have  pointed  to  a 
lack  of  public  support  for  them. 

To  foster  such  support,  proposals  for  integrat¬ 
ing  traditional  institutions  of  leadership  and  gover¬ 
nance  into  modern  structures  of  representation 
were  advanced.  These  traditional  institutions  are 
still  influential  at  local  and  regional  levels  in  many 
countries  around  the  world.  In  Africa,  the  institu¬ 
tion  of  chieftaincy,  or  even  kingdom,  is  still  impor¬ 
tant  in  people’s  daily  existence.  Political  efforts  to 
sideline,  and  in  some  countries  to  abolish,  these 
traditional  institutions  have  not  succeeded.  Many 
countries,  in  Africa  especially,  have  molded  their 
modern  governance  institutions  around  traditional 
ones.  Botswana,  Namibia,  and  South  Africa  pres¬ 
ent  a  model  that  seems  to  have  struck  a  balance  of 
mutual  respect  between  modern  and  traditional 
institutions.  In  other  countries  such  as  Zimbabwe, 
chiefs  have  been  directly  co-opted  into  parliament, 
while  in  other  countries  such  as  Swaziland,  it  has 
been  the  traditional  monarchy  that  has  sought  to 
incorporate  modern  political  institutions. 

The  return  to  institutional  approach  in  compar¬ 
ative  politics  is  linked  to  global  support  for  democ¬ 
racy.  Many  development  agencies  (e.g.,  as  the  U.S. 
Agency  for  International  Development,  the  United 
Nations  Development  Programme,  and  other  non¬ 
governmental  organizations)  have  devoted  signifi¬ 
cant  resources  to  the  strengthening  of  the  institu¬ 
tions  of  governance.  In  particular,  across  the  for¬ 
mer  Soviet  Union,  Africa,  Asia,  and  Latin  America, 
the  focus  has  been  on  strengthening  parliamentary 
and  judiciary  institutions,  and  academics  and  prac¬ 
titioners  are  studying  these  programs  to  assess  their 
impact.  A  substantial  body  of  comparative  politics 
literature  on  development  assistance  has  thus 
emerged. 

Comparative  Policy  Analysis 

The  interest  in  the  organization  and  functioning 
of  democratic  regimes  has  inevitably  promoted 
research  on  the  latter’s  performance  by  scholars  of 
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comparative  politics.  Starting  with  the  1981  vol¬ 
ume  edited  by  Peter  Flora  and  Arnold  J. 
Heidenheimer  during  the  1980s,  the  comparative 
approach  to  policy  analysis  came  to  be  adopted  by 
many  scholars,  in  particular  for  understanding  the 
historical  development  of  different  welfare  systems 
and  for  explaining  the  different  features  of  welfare 
policies  in  Western  Europe  and  the  United  States, 
as  shown  by  the  contributions  of  Gosta  Esping- 
Andersen,  Paul  Pierson,  and  Francis  Castles.  One 
might  argue  that  the  analysis  of  Western  welfare 
policies  constitutes  the  starting  point  of  compara¬ 
tive  policy  analysis,  and  even  today,  it  represents 
its  core  business. 

However,  with  the  end  of  the  Cold  War  and  the 
development  of  the  globalization  process,  the  com¬ 
parative  study  of  welfare  systems  has  come  to 
include  the  analysis  of  new  social  risks,  risks  that 
epitomize  the  pathology  of  contemporary  postin¬ 
dustrial  societies,  as  argued  in  the  volume  edited 
by  Peter  Taylor-Gooby  in  2004.  At  the  same  time, 
the  diffusion  of  the  process  of  globalization  has 
made  the  Western  experience  with  welfare  systems 
less  peculiar  than  in  the  past.  The  problems  associ¬ 
ated  with  guaranteeing  social  security  in  market 
societies  have  come  to  be  shared  by  many  coun¬ 
tries — developing  as  well  as  developed,  democra¬ 
tizing  as  well  as  democratic.  Market  globalization 
has  generated  new  social  externalities,  bringing  to 
the  center  of  comparative  policy  analysis  new 
problems,  such  as  immigration  and  environmental 
issues.  In  this  regard,  one  has  to  consider  the  1997 
volume  edited  by  Martin  Janicke,  Helmut  Widner, 
and  Helge  Jorgens  on  national  environmental 
policies  and  the  2007  volume  by  Eytan  Meyers  on 
international  immigration  policy. 

Starting  in  the  1990s,  the  field  of  public  policy 
has  also  seen  a  dramatic  expansion.  Globalization 
and  Europeanization  have  not  only  created  new 
problems,  but  they  have  pressured  international 
institutions  to  promote  thorough  investigations  of 
the  economic,  financial,  administrative,  and  politi¬ 
cal  performance  of  the  various  countries,  investiga¬ 
tions  accompanied  by  frequent,  specific  recommen¬ 
dations  for  public  policy.  The  conspicuous  develop¬ 
ment  of  comparative  policy  analysis  has  been  made 
possible  by  an  easier  access  to  empirical  data  con¬ 
cerning  the  various  members  of  international  and 
regional  organizations.  The  latter  have  also  sup¬ 
ported  specific  policy  priorities,  introducing  new 


criteria  for  evaluating  countries’  approximation  to 
the  expectations  for  good  governance,  including  the 
call  for  the  promotion  of  gender  equality.  Indeed, 
since  the  second  half  of  the  1990s,  gender  policy 
has  emerged  as  a  significant  policy  interest,  and  not 
just  in  developed  countries,  as  shown  by  the  1995 
volume  edited  by  Dorothy  McBride  Stetson  and 
Amy  G.  Mazur.  This  policy  field  has  been  finalized 
not  only  to  investigate  the  structural  constraints  on 
women’s  participation  in  the  economic  and  social 
life  of  a  country  but  also  to  identify  resources  and 
practices  that  might  promote  women’s  interests  in 
the  various  national  contexts. 

The  gender  approach  to  public  policies  has  thus 
called  into  question  the  normative  premises  of 
many  public  policies.  In  fact,  the  latter  have  been 
based  on  a  culturally  defined  view  of  family  orga¬ 
nization,  social  needs,  and  individual  expectations. 
The  same  concept  of  security  has  been  redefined  to 
meet  new  social  preferences  and  personal  atti¬ 
tudes.  Indeed,  the  comparative  study  of  public 
policy  has  led  to  a  greater  understanding  of  the 
value  structures  of  contemporary  societies,  thereby 
helping  combine  empirical  analysis  and  normative 
assessments. 

The  Future  of  Comparative  Politics 

Comparative  politics  (as  an  academic  discipline) 
has  been  a  major  success.  In  the  post-Cold  War 
era,  our  knowledge  of  democratic,  democratizing, 
and  nondemocratic  political  systems  has  grown 
enormously.  The  methodologies,  concepts,  and 
theories  of  comparative  research  are  widely  used 
by  political  scientists.  With  the  exception  of  the 
United  States,  where  “American  politics”  scholars 
constitute  the  majority,  comparative  politics  has 
become  the  central  subdiscipline  of  world  political 
science.  Indeed,  as  noted  by  Henry  Brady  and 
David  Collier,  the  discipline  as  a  whole  has  been 
defined  epistemologically  by  the  debate  in  com¬ 
parative  politics.  However,  the  success  story  of 
comparative  politics  has  reached  a  critical  junc¬ 
ture.  Comparative  politics  is  encountering  issues 
with  confronting  the  political  problems  of  a  glo¬ 
balized  world.  Its  methods  and  theories  face  diffi¬ 
culties  when  applied  to  processes  that  transcend 
state  borders  and  undermine  the  structure  of  the 
traditional  political  relations  on  which  compara¬ 
tive  politics  is  based. 
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The  historical  transformations  that  have 
occurred  after  the  end  of  the  Cold  War  have  called 
into  question  the  concept  of  sovereignty  itself, 
which  is  the  foundation  of  the  study  of  compara¬ 
tive  politics.  With  the  processes  of  globalization 
that  have  unfolded  tempestuously  since  then,  the 
external  and  internal  sovereignty  of  the  nation¬ 
state  (the  basic  unit  of  analysis  of  comparative 
research)  has  been  eroded.  Simultaneously,  the 
complexity  of  political  systems  and  their  external 
relations  has  increased  to  such  an  unprecedented 
extent  as  to  give  rise  to  a  complex  interdepen¬ 
dence.  This  complex  interdependence  is  changing 
the  nature,  powers,  and  outlook  of  the  units  used 
by  comparative  analysis  for  the  study  of  politics.  It 
simultaneously  disarticulates  domestic  and  inter¬ 
national  politics,  creating  more  levels  of  correla¬ 
tion  between  variables,  levels  that  are  not  necessar¬ 
ily  connected  with  each  other. 

This  being  so,  it  becomes  increasingly  less  plau¬ 
sible  to  establish  what  constitutes  an  independent 
cause  of  a  dependent  outcome.  If  domestic  politi¬ 
cal  systems  are  not  independent  of  external  pro¬ 
cesses  and  if  the  actors  that  operate  within  them  do 
not  have  the  ability  to  act  as  agents  that  connect 
cause  and  effect,  then  the  fundamental  precondi¬ 
tions  of  comparative  analysis  are  being  eroded. 
Therefore,  it  is  increasingly  less  likely  to  assume 
that  the  various  political  systems  are  distinct  enti¬ 
ties,  because  in  reality,  they  are  not. 

Globalization  and  Europeanization 

Domestic  political  systems  are  embedded  in  an 
institutionalized  international  context  that  notice¬ 
ably  constrains  the  autonomy  of  their  decisions. 
The  majority  of  nation-states  are  members  of 
international  economic  organizations  (e.g.,  the 
World  Bank,  the  International  Monetary  Fund, 
and  the  World  Trade  Organization)  or  of  military 
alliances  (e.g.,  the  North  Atlantic  Treaty 
Organization  [NATO]  and  South  East  Asia  Treaty 
Organization  [SEATOJ)  where  decisions  are  made 
(with  their  participation,  of  course)  that  have  sig¬ 
nificant  domestic  implications.  Over  and  above 
this,  the  sovereignty  of  the  nation-state  is  also 
being  eroded  from  below.  At  least  in  Europe,  the 
unitary  and  centralized  state  is  being  superseded. 
New  regional  and  municipal  authorities  have 
emerged  and  have  become  institutionalized.  Not 


only  has  the  room  for  maneuver  of  the  national  or 
central  authorities  been  reduced,  but  the  cognitive 
context  itself,  within  which  their  actions  unfold, 
has  changed.  Our  conception  of  the  state  has  been 
modified. 

Globalization,  as  noted  by  Philippe  Schmitter, 
has  become  the  independent  variable  in  many 
national  contexts.  It  has  reduced  the  impermeabil¬ 
ity  of  the  nation-states  to  external  pressures.  This, 
in  turn,  has  weakened  relations  between  the  citi¬ 
zens  and  the  institutions  of  those  states,  as  soon  as 
the  former  have  become  aware  that  the  latter  are 
unable  to  respond  to  their  demands.  The  legiti¬ 
macy  of  public  institutions  has  been  further 
reduced  by  the  growing  role  that  noninstitutional 
actors  have  acquired  in  the  context  of  globaliza¬ 
tion.  These  actors  comprise  companies,  associa¬ 
tions,  and  transnational  nongovernmental  organi¬ 
zations  that  operate  outside  the  border  of  single 
states,  and  they  have  contributed  to  the  emergence 
of  new  supranational  regulative  systems  or  inter¬ 
national  regimes.  One  may  claim  that  no  nation¬ 
state  (not  even  the  most  important  ones  like  the 
United  States  or  China)  is  able  to  control  domestic 
decision-making  processes,  autonomously  steer  its 
own  economic  dynamics,  or  develop  its  own  sepa¬ 
rate  cultural  identity. 

If  globalization  has  challenged  the  assumptions 
of  comparative  politics,  this  is  even  truer  in  the 
case  of  the  Europeanization  induced  by  the  process 
of  European  integration.  There  are  many  defini¬ 
tions  of  Europeanization.  According  to  Vivien 
Schmidt,  it  consists  in  the  process  through  which 
the  political  and  economic  dynamics  of  the  EU 
have  become  part  of  the  institutional  and  cultural 
logic  of  domestic  politics.  However  one  defines 
Europeanization,  there  can  be  no  doubt  that  it 
consists  of  the  implementation,  at  the  level  of  the 
single  member  states,  of  institutional  procedures, 
public  policies,  and  cognitive  frameworks  to 
address  the  domestic  problems  deriving  from  the 
EU  level.  No  member  state  of  the  EU  (not  even 
those  most  proud  of  their  own  traditional  sover¬ 
eignty,  e.g.,  the  United  Kingdom  and  France,  or 
those  most  proud  to  have  recently  regained  sover¬ 
eignty,  e.g.,  Poland  and  the  Czech  Republic)  can 
decide  (in  myriad  areas  of  public  policy)  on  the 
basis  of  autonomous  considerations.  This  does  not 
mean  that  the  European  nation-states  have  disap¬ 
peared.  Rather,  it  is  even  plausible  to  argue,  as 
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Alan  Milward  has  done,  that  European  integration 
has  rescued  them,  taking  care  of  tasks  they  were  no 
longer  able  to  tackle.  This,  however,  means  that 
their  sovereignty  has  been  eroded  and  redefined. 
They  are  sovereign  in  some  areas  of  public  policy 
(e.g.,  defense  and,  partially,  foreign  affairs), 
whereas  they  are  entirely  bereft  of  sovereignty  in 
other  areas  (e.g.,  monetary  policy  and,  more  gener¬ 
ally,  all  areas  related  to  the  creation  of  the  com¬ 
mon  market).  In  yet  other  areas  (e.g.,  environmen¬ 
tal  and  research  policies),  they  share  sovereignty. 

It  is  clear  that  Europeanization  constitutes  a 
formidable  challenge  to  the  assumptions  of  com¬ 
parative  politics — namely,  to  conceive  of  the 
European  states  as  sovereign  and  independent 
units  that  are  autonomous  in  terms  of  decision 
making.  In  Europe  and  elsewhere,  the  dividing  line 
between  internal  (domestic)  politics  and  external 
(international)  politics  has  shifted  significantly 
(Robert  Elgie,  1999). 

Toward  an  International  Comparative  Politics 

Globalization  and  Europeanization  have  brought 
radical  transformations  to  the  states  and  to  the 
relations  between  them.  These  transformations  call 
into  question  the  traditional  distinction  between 
comparative  politics  and  international  relations. 
Regarding  comparative  politics,  the  sovereignty  of 
nation-states  has  been  fundamentally  questioned. 
Sovereignty  has  been  unpacked,  fragmented,  and 
segmented,  thereby  challenging  a  long  normative 
(and  hypocritical)  tradition  that  assumed  that  sov¬ 
ereignty  is  an  indivisible  reality.  The  same  supposed 
order  of  the  domestic  polity  is  dramatically  belied 
by  the  fact  that  most  of  the  major  conflicts  that 
occurred  during  the  1990s  have  happened  within 
states  rather  than  between  them.  Regarding  inter¬ 
national  relations,  it  is  no  longer  certain  that  the 
international  system  is  the  anarchic  world  that  has 
formed  the  basic  condition  of  interstate  relations. 
This  system  is  organized  in  several  international 
regimes  and  managed  by  several  international  orga¬ 
nizations,  each  of  them  equipped  with  tools  and 
norms  to  peacefully  adjudicate  disputes  between 
public  and  private  actors. 

Various  studies  have  recognized  that  the  episte¬ 
mological  and  methodological  boundaries  between 
the  two  subdisciplines  of  political  science  are  no 
longer  evident.  Scholars  developing  the  foreign 


policy  analysis  approach  have  included  the  domes¬ 
tic  structure  of  a  given  regime  in  their  analysis, 
showing  how  it  exerts  a  significant  influence  on 
the  decisions  and  styles  of  foreign  policy  of  a  coun¬ 
try.  Scholars  investigating  the  relations  between 
the  economic  structures  and  the  political  arrange¬ 
ments  of  various  countries  have  shown  the  interac¬ 
tions  between  international  pressures  and  domes¬ 
tic  arrangements  (with  their  effects  on  the  organi¬ 
zation  of  markets,  the  construction  of  welfare 
regimes,  the  organization  of  systems  of  representa¬ 
tion,  and  interest  intermediation).  One  could  men¬ 
tion  the  most  recent  literature  on  democratic 
peace,  which  has  sought  to  show  why  democratic 
countries  do  not  go  to  war  with  each  other, 
because  of  the  domestic  constraints  to  which  their 
decision  makers  are  subjected;  however,  this  does 
not  imply  that  they  are  not  inclined  to  go  to  war 
with  nondemocratic  countries.  Finally,  leading 
international  relations  scholars  have  continued  to 
work  with  models  connecting  international  and 
domestic  variables. 

These  and  other  studies  have  called  for  the 
development  of  an  integrated  political  science  that 
is  subdivided  by  the  topics  it  seeks  to  study  rather 
than  by  the  units  of  analysis  chosen  (the  domestic 
system  or  the  international  system).  A  political 
world  marked  by  complex  interdependence  calls 
for  a  political  science  ready  to  experiment  with 
new  methods  and  new  theories.  A  new  field  of 
study,  which  some  scholars  call  International 
Comparative  Politics,  might  be  developed  to  con¬ 
front  the  challenges  of  this  world.  However,  the 
structure  of  academic  careers,  still  rigidly  orga¬ 
nized  around  the  distinction  between  the  two  sub- 
disciplines,  will  make  such  development  difficult. 

Conclusion 

The  fundamental  transformations  induced  by  the 
processes  of  globalization  and  Europeanization 
have  ended  up  questioning  the  methodological  and 
theoretical  self-sufficiency  of  comparative  politics. 
These  processes  have  urged  scholars  of  compara¬ 
tive  politics  to  take  the  international  context  of  a 
country  into  account  as  an  essential  variable  in 
explanations  of  the  functioning  of  domestic  poli¬ 
tics.  Simultaneously,  the  effects  of  domestic  struc¬ 
tures  on  supranational  and  international  processes 
have  driven  international  relations  scholars  to 
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reexamine  the  methodological  and  theoretical  self- 
sufficiency  of  their  discipline.  Substantial  changes 
in  the  real  world  of  politics  are  urging  political 
scientists  to  develop  methods  and  theories  that  can 
come  to  terms  with  the  complex  domestic  and 
international  forces  that  shape  the  important  prob¬ 
lems  requiring  study  and  explanation.  After  all,  the 
undertaking  of  political  science,  as  of  all  other 
social  sciences,  is  justified  by  its  ability  to  furnish 
plausible  solutions  to  real  problems.  Accordingly, 
the  profession  should  not  be  afraid  to  question 
itself,  to  overcome  consolidated  divisions  between 
subdisciplines,  and  to  seek  new  perspectives.  A 
self-sufficient  political  science  serves  neither  politi¬ 
cal  scientists  themselves  nor  the  citizens  of  the 
contemporary  world. 
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Competition  is  understood  as  the  struggle  between 
two  or  more  agents  to  capture  scarce,  limited, 
and  valued  resources  within  a  defined  system  or 
context.  It  is  often  assumed  that  competition  is  a 
zero-sum  game,  in  which  one  participant’s  gain 
or  loss  is  exactly  matched  by  the  loss  or  gain  of 
another.  Political  competition  involves  the  strug¬ 
gle  for  power,  for  example,  through  elections  in  a 
democratic  regime.  It  is  a  concept  employed  in 
virtually  all  subdisciplines  of  political  science,  for 
it  is  a  universal  aspect  of  human  life.  Although  in 
one  way  or  another  everyone  is  affected  by  it,  a 
clear  definition  of  the  concept  remains  elusive. 
This  entry  is  structured  as  follows:  First,  the  term 
competition  is  isolated  from  other  concepts  with 
which  it  has  been  fused  or  mixed.  Second,  using 
an  analysis  of  measurement,  a  distinction  is 
drawn  between  the  two  most  important  areas  of 
political  competition  in  democracies:  the  electoral 
and  the  governmental.  Finally,  the  entry  differen¬ 
tiates  between  the  latest  developments  in  electoral 
and  governmental  competition  in  new  Third 
World  democracies  and  those  in  other  developed 
democracies. 


Definition  of  Competition 

One  of  the  reasons  for  the  concept’s  elusiveness  is 
that  the  term  has  often  been  conflated  with  com¬ 
petitiveness.  A  democracy  may  not  have  competi¬ 
tiveness  during  long  periods  of  time  despite  being  a 
competitive  regime  (as  in  the  example  of  predomi¬ 
nant  party  systems).  Indeed,  Robert  Dahl’s  defini¬ 
tion  of  polyarchy  allowed  for  the  free  exercise  of 
political  contestation  in — and  between — elections, 
but  it  never  implied  that  effective  competition  had 
to  occur.  Thus,  competition  comprises  a  possible 
competitiveness  but  not  otherwise.  In  the  words  of 
Giovanni  Sartori  (1976/2005),  “ Competition  is  a 
structure ,  or  a  rule  of  the  game.  Competitiveness  is 
a  particular  state  of  the  game ”  (p.  194,  emphasis 
original).  The  literature  is  rather  strong  in  claiming 
that  the  more  competitiveness  an  election  has,  the 
higher  the  probability  that  a  single  vote  can  affect 
the  outcome,  which  increases  the  expected  utility  of 
voting  and,  thereby,  voter  turnout.  Thus,  high  lev¬ 
els  of  competitiveness  are  strongly  related  to  the 
health  of  a  democracy. 

Areas  of  Competition: 

Elections  and  Government 

Contemporary  democracies  are  the  systems  in 
which  political  competition  occurs  par  excellence. 
Democracy  could  be  understood  as  a  system  of 
government  that  frames,  regulates,  and  limits 
political  competition  in  order  to  ensure  that  those 
elected  will  enjoy  broader  popular  support  and 
that  there  is  equal  opportunity  for  all  competitors. 
Of  course,  not  every  public  office  to  be  filled  in  a 
democracy  is  elected  (e.g.,  justices  of  the  highest 
court)  and  not  every  aspect  of  democratic  life  is 
subject  to  political  competition  (e.g.,  respect  for 
basic  human  rights  is  not  contingent  on  political 
competition  but  guaranteed  to  all).  The  scope  of 
political  competition’s  excluded  areas  is  still  a  fer¬ 
tile  ground  for  theoretical  discussion  (e.g.,  why  not 
elect  high  officials  through  lots  instead  of  votes 
and  why  not  select  the  best  instead  of  the  one  cho¬ 
sen  by  the  many?).  Despite  these  debates  about  the 
limits  of  political  competition  in  the  democratic 
realm,  political  competition  and  competitiveness 
are  clearly  observed  in  two  major  arenas  of  democ¬ 
racies:  the  electoral  and  the  governmental. 

Within  the  realm  of  electoral  competition,  Joseph 
Schumpeter  offered  perhaps  the  first  attempt  to 
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provide  a  working  definition  that  challenged  the 
classical,  more  normative  doctrine  of  democracy.  In 
his  view,  democracy  was  not  an  end  in  itself  but 
rather  a  political  competitive  process  or  struggle  for 
the  people’s  vote.  Schumpeter’s  “minimalist”  defini¬ 
tion  of  democracy  has  been  a  catalyst  for  innova¬ 
tion  in  this  literature.  Nonetheless,  it  is  only  after 
Anthony  Downs  that  the  formalization  of  political 
competition,  especially  among  political  parties,  was 
bolstered  by  the  analogy  to  markets.  Political  par¬ 
ties  are  crucial  organizations  in  contemporary 
democracies  as  competition  is  channeled  by  them. 
The  principal  way  to  understand  the  competition 
among  them  was  through  the  lens  of  partisanship: 
Parties  were  aggregates  of  vote-seeking  politicians. 
While  we  usually  see  political  competition  as  an 
ongoing  and  never-ending  process,  paradoxically, 
the  formalization  of  political  competition  has  almost 
the  sole  purpose  of  seeking  equilibrium — that  is,  an 
outcome  from  which  no  party  has  an  incentive  to 
deviate — something  hardly  ever  attained. 

Downs  delivered  a  model  of  political  competi¬ 
tion  of  candidates  with  respect  to  their  ideological 
position  in  a  single-issue  dimension.  Along  a  sin¬ 
gle-issue  dimension,  a  two-party  competition  is 
expected.  Parties  will  experience  strong  pressures 
to  converge  on  the  position  of  the  median  voter  if 
they  wish  to  avoid  electoral  defeat.  Important 
works  have  been  devoted  to  showing  that  this  pre¬ 
dicted  convergence  is  just  partial  in  real  life,  given 
that  Downs’s  theory  relies  on  highly  restrictive 
assumptions;  for  example,  much  of  this  literature 
has  been  centered  on  direct  democracy  or  voting  in 
committees,  where  very  few  players  are  involved. 
The  median  voter  theorem  produces  a  Nash  equi¬ 
librium,  in  which  no  player  has  anything  to  gain 
by  changing  strategies  unilaterally.  The  literature 
that  emanated  from  Downs’s  seminal  work  was 
strongly  influenced  by  Duncan  Black’s  median 
voter  model,  in  probably  what  is  one  of  the  found¬ 
ing  texts  of  social  choice  theory,  and  by  Kenneth 
Arrow’s  discussion  of  the  dilemmas  of  aggregating 
single-picked  individual  preferences  into  a  collec¬ 
tive  choice.  All  of  them  were  predated  by  Harold 
Hotelling’s  model  of  spatial  competition — the 
location  of  different  sellers  in  a  market  with 
respect  to  one  another — probably  the  pioneering 
paper  in  public  choice.  Here,  the  market  analogy  is 
more  than  an  analogy;  this  literature  really  does 
derive  from  the  idea  of  market  competition. 


Up  to  that  time,  the  literature  on  political  com¬ 
petition  was  notably  influenced,  and  inspired  by, 
the  bipartisan  background  of  the  U.S.  political 
system.  Although  U.S.  elections  have  tended  to 
exhibit  tremendous  competitiveness,  governing 
activity  exhibited  stable  patterns  of  competition 
(i.e.,  a  party  in  the  executive  and  basically  two 
large  parties  in  the  legislature  of  roughly  the  same 
size).  The  discussions  arise  concerning  how  the 
government  performs  when  the  party  that  holds 
the  executive  does  not  have  a  majority  in 
Congress — a  situation  usually  known  as  a  “divided 
government.  ” 

Yet  the  Downsean  perspective  was  not  unique 
in  observing  and  understanding  spatial  models  of 
party  competition.  George  Rabinowitz  and  Stuart 
Elaine  Macdonald,  for  example,  use  a  directional- 
spatial  model  of  electoral  competition  in  which 
utility  is  based  on  the  intensity  and  direction 
between  voters  and  candidates  in  contrast  to  the 
traditional  proximity  spatial  model,  which  repre¬ 
sents  utility  as  a  declining  function  of  distance 
between  voter  and  candidate  ideal  positions.  The 
directional-spatial  model  of  electoral  competition 
suggests  two  main  hypotheses:  The  first  is  that 
voters  prefer  candidates  who  are  on  their  side, 
and  the  second  is  that  given  two  equidistant  can¬ 
didates,  they  prefer  the  more  intense  and  more 
credible  to  their  cause.  From  the  directional  mod¬ 
els,  a  new  line  of  research  emerged — the  theory  of 
discounting — denoting  that  candidates  cannot 
fulfill  all  their  promises,  and  therefore,  voters  dis¬ 
count  the  campaign  and  judge  according  to  what 
they  expect  candidates  to  realistically  achieve  in 
government.  This  theory  emphasizes  the  role  of 
status  quo  in  setting  expectations  about  what  the 
candidates  can  really  do  and  assumes  a  program¬ 
matic  or  ideological  linkage  between  candidates’ 
and  voters’  understanding  that  the  electorate’s 
decisions  are  very  focused  on  policy  proposals  or 
ideological  stances. 

An  extremely  lucid  analysis  of  electoral  compe¬ 
tition  was  advanced  by  Adam  Przeworski  (1991). 
He  remarked  that  “democratization  is  an  act  of 
subjecting  all  interests  to  competition,  of  institu¬ 
tionalizing  uncertainty”  (p.  14).  Like  economic 
competition,  where  there  are  clear  limitations 
against  a  single  agent  imposing  a  price  or  product, 
political  competition  obeys  rules  among  relevant 
actors.  Thus,  the  institutionalization  of  uncertainty 
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has  three  main  features:  ex  ante  uncertainty  (any¬ 
one  can  win),  ex  post  irreversibility  (losers  do  not 
try  to  reverse  results),  and  repeatability.  Without 
any  of  these,  democracy  is  hardly  possible. 

Measures  and  Theory 

While  there  is  little  consensus  on  how  to  properly 
calculate  competition — students  of  democracy  and 
elections  have  developed  different  measures  of  this 
vague  concept — some  advances  are  evident  in  the 
literature.  For  example,  it  is  possible  to  measure 
political  competition  as  the  frequency  of  alterna¬ 
tion  in  power  over  a  delimited  period  of  time  or  by 
building  multidimensional  and  longitudinal  indices 
of  competition  for  a  few  political  units  (i.e.,  the 
American  states).  This  long-term  perspective  is  not 
very  useful  for  new  democracies,  in  which  just  a 
few  elections  may  have  taken  place.  Other  scholars 
have  measured  competition  as  the  winner’s  per¬ 
centage  of  the  votes,  the  percentage  margin  of  vic¬ 
tory,  or  the  raw  vote  margin  of  victory  in  elections. 
Such  measures  are  less  contingent  on  time  and 
provide  clearer  pictures  of  political  competition  in 
a  particular  moment,  yet  they  are  heavily  biased 
against  two-party  systems  because  margins  of  vic¬ 
tory  tend  to  be  smaller  in  multiparty  democracies. 

A  more  useful  measurement  of  competitiveness 
comes  from  Markku  Laakso  and  Rein  Taagepera’s 
widely  used  N  index,  which  is  a  weighted  average 
of  the  number  of  parties  that  compete  in  a  national 
political  system,  and  it  is  expressed  as  N  —  1  fYjPi , 
where  pt  is  the  share  of  seats  a  party  has  in  the 
legislature.  The  N  index  is  strongly  related  to  the 
I  Ic r f i  n d a h  1  -  H i  r sc  h  m  a n  Index,  an  indicator  of  the 
amount  of  competition  among  firms  in  relation  to 
the  industry,  which  is  denoted  as  H  —  l/^sj, 
where  si  is  the  market  share  of  firm  i  in  the  market. 
Though  both  indices,  the  N  and  the  H,  were  cre¬ 
ated  as  indices  of  diversity  (fragmentation)  of 
political  parties  or  firms,  they  are  frequently  used 
as  measurements  of  competitiveness.  The  higher 
the  index,  the  more  the  competition.  These  indices 
also  suffer  from  important  limitations,  such  as 
other  institutional  features  like  degree  of  propor¬ 
tionality  of  legislatures,  mathematical  distribu¬ 
tional  procedures  of  seat  allocation  (e.g.,  D’Hondt, 
Hare),  party  registration  limitations,  and  others. 

With  all  their  limitations,  the  Laakso  and 
Taagepera  or  Herfindahl-Hirschman  indices 


provide  an  accurate  view  of  the  correlation  of 
power  among  agents  in  a  given  arena  (the  legisla¬ 
ture  or  the  market  itself).  Still,  their  major  short¬ 
coming  is  that  little  is  said  in  regard  to  how  the 
legislature  or  the  market  would  perform.  Knowing 
that  counting  parties  is  not  sufficient  for  analyzing 
a  party  system,  Giovanni  Sartori  in  his  classic 
work  of  1976  presents  a  theory  of  the  dynamics  of 
political  competitiveness  in  regard  to  whether  cen¬ 
trifugal  or  centripetal  forces  act  in  a  given  party 
system,  which  is  an  improvement  over  the  earlier 
ones.  Recent  developments  have  tried  to  make  a 
more  subtle  connection  between  electoral  results 
and  governmental  activity.  For  instance,  David 
Altman  and  Anibal  Perez-Linan  developed  a  mea¬ 
sure  that  weights  the  sizes  of  the  “typical”  parties 
in  government,  G  —  YjSi /Yj&’i  ar*d  opposition 
O  =  X°? /X°'!  the  following  way:  C  =  1  — 
(IG  —  01/100).  The  value  of  C  tends  to  zero  when¬ 
ever  the  government  (or  the  opposition)  controls 
the  whole  legislature  and  to  one  if  there  is  balance 
between  government  and  opposition.  This  index 
provides  a  forecast  of  governmental  activity  and 
the  relationship  between  the  executive  and  legisla¬ 
tive  because  it  reflects  the  opposition’s  access  to 
the  legislative  process  rather  than  mere  electoral 
outcomes. 

In  spite  of  these  miscellaneous  indices,  a  whole 
body  of  research  was  detached  from  the  American 
model  and  jumped  the  Atlantic  in  search  of  new 
models  of  political  competition,  particularly  in  the 
context  of  governability  and  executive  coalition 
theories.  It  is  not  hard  to  understand  why  this  lit¬ 
erature  bloomed  in  the  context  of  the  study  of 
governing  coalitions  in  Western  European  parlia¬ 
mentary  democracies.  The  process  of  forming 
coalitions  in  parliamentarian  regimes  involves  a 
relatively  reduced  number  of  actors  (parties)  com¬ 
peting  in  a  closed  and  rule-bound  milieu  (parlia¬ 
ments)  for  clear  purposes  (membership  in  the 
executive  cabinet),  where  the  quantity  and  quality 
of  portfolios  are  crucial.  One  of  the  strengths  of 
this  literature  is  exactly  that  it  is  not  circumscribed 
by  political  competition,  but  it  necessarily  touches 
on  the  flip  side  of  the  coin — political  coopera¬ 
tion — and  hence  provides  a  more  comprehensive 
panorama  of  political  competition. 

Thus,  in  the  late  1950s  and  early  1960s,  in  the 
political  competition  literature,  mainly  via  the 
study  of  coalitions,  the  meaning  of  “getting  votes” 
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shifted  and  came  to  be  understood  as  a  means  to 
reach  other  objectives:  office,  and  then  policy  mak¬ 
ing.  Probably,  the  three  models  of  party  competi¬ 
tion  (votes,  offices,  and  policies)  are  relevant  to  all 
parties  at  any  instant  in  time,  although  more  so  at 
some  times  than  at  others.  Doubtless,  new  research 
on  governing  coalitions  in  presidential  regimes 
shows  that  parties  shift  their  objectives  in  synchro¬ 
nization  to  the  fixed  electoral  calendar.  Before  a 
national  election,  they  are  principally  concerned 
with  obtaining  votes.  After  elections,  party  mem¬ 
bers  strive  to  obtain  political  appointments  from 
elected  officials,  and  then  later,  policy-seeking 
behavior  ranks  first  in  their  preferences. 

This  literature  has  developed  a  great  deal  since 
William  Gamson’s  and  William  Riker’s  seminal 
works.  Gamson  theorized  that  parties  compete  for 
office  and  seek  their  payoff  (players’  utilities 
resulting  from  the  outcome  of  a  game),  which  is 
fair  as  it  is  directly  proportional  to  the  amount  of 
resources  that  they  contribute  to  a  coalition.  The 
prediction  of  his  theory  is  that  payoffs  in  any  gov¬ 
ernment  are  limited  and  therefore,  the  greater  the 
number  of  parties  in  a  coalition,  the  lower  is  the 
payoff  for  each.  Resources  are  limited  because,  for 
example,  a  governing  coalition  cannot  satisfy  all 
policy  demands  simultaneously  (e.g.,  lowering  and 
increasing  taxes).  As  a  result,  in  order  to  not  dis¬ 
tribute  unnecessary  payoffs,  Riker  predicted  the 
formation  of  winning  coalitions  of  minimum  size 
(meaning  the  number  of  parliamentary  seats  a 
coalition  of  political  parties  must  control  to  receive 
a  majority  vote  for  investiture). 

Since  then,  authors  have  devoted  themselves  to 
testing  variables  that  presumably  would  affect 
political  competition.  Robert  Axelrod  argues  that 
coalition  parties  are  likely  to  be  adjacent  in  a  one¬ 
dimensional  policy  space,  predicting  minimal  con¬ 
nected  winning  coalitions.  That  is,  parties  in  an 
executive  coalition  will  be  adjacent  to  each  other 
in  a  one-dimensional  policy  space,  and  also,  the 
coalition  will  not  include  any  more  parties  than 
necessary  to  be  connected  and  winning.  Others, 
such  as  Norman  Schofield,  have  centered  their 
arguments  on  the  importance  of  large  parties  and 
their  position  in  the  policy  space;  large  parties 
whose  policy  positions  are  located  in  the  core  of 
the  policy  space — that  is,  parties  with  policy  posi¬ 
tions  such  that  no  alternative  policy  is  preferred  by 
a  simple  majority — tend  to  form  minority  and 


surplus  majority  governments.  In  short,  most  of 
this  literature  on  political  competition  has  as  its 
hard  core  the  fragmentation  of  the  party  system 
and  each  party’s  location  and  coverage  of  the  ideo¬ 
logical  space. 

Although  a  vast  number  of  alternative  hypoth¬ 
eses  for  why  and  how  parties  compete  have  been 
put  forward,  three  facts  must  be  underlined.  First, 
little  research  has  been  done  on  how  differences  in 
party  organizations  affect  political  competition. 
Most  studies  from  a  rational  choice  perspective 
treat  parties  as  unitary  actors  that  bargain  over  a 
set  of  well-defined  gains — usually  office  or  policy. 
There  is  some  truth  in  this  assumption  but  more 
error.  Parties  are  complex  organizations,  where 
internal  divisions  and  structures  affect  how  they 
and  their  leaders  behave.  Important  works  in  the 
literature  have  already  underlined  the  problems 
attached  to  treating  parties  in  this  manner.  Despite 
the  fact  that  many  of  the  ordinary  political  phe¬ 
nomena  can  be  explained  by  treating  parties  as  if 
they  were  unitary  actors,  we  undoubtedly  need  to 
take  into  account  what  goes  on  inside  parties  if  we 
want  to  provide  an  explanation  of  party  decision 
making  in  a  model  of  some  political  course  of 
action.  While  some  scholars  stress  one  of  these 
motivations,  other  scholars  combine  them  in  one 
way  or  another. 

Second,  a  proper  specification  of  political  com¬ 
petition  requires  a  theory  whereby  parties  compete 
in  multidimensional  spaces,  because  even  if  one  is 
interested  only  in  one  narrow  particular  policy 
(say  taxes),  the  position  on  other  issues  will  affect 
the  equilibrium  in  that  narrow  particular  policy. 
Following  Downs,  most  commonly  used  theories 
of  political  competition  posit  a  unidimensional 
political  space  where  competition  occurs  or  at 
least — acknowledging  the  multidimensionality  of 
political  life — try  to  isolate  the  particular  dimen¬ 
sion  in  consideration.  Yet,  even  in  the  case  of  sin¬ 
gle-peaked  preferences  of  political  actors,  the 
results  might  be  unstable  and  inefficient  as  regards 
social  decisions,  as  was  clearly  demonstrated  by 
Condorcet’s  paradox  of  voting  and  Arrow’s  impos¬ 
sibility  theorem.  It  does  not  mean,  as  Gordon 
Tullock  shows,  that  unstable  social  decisions  gen¬ 
erate  total  chaos  or  instability  of  these  decisions 
because  institutions  play  a  crucial  role  in  suppress¬ 
ing  the  underlying  unsteadiness  (institutions  such 
as  agenda  setters,  party  discipline,  or  procedural 
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rules  in  parliaments).  Some  scholars  (e.g.,  Kenneth 
Shepsle)  call  the  equilibrium  resulting  from  these 
regulating  forces  a  structure-induced  equilibrium. 

Third,  the  regime  divide  has  been  minimized 
and  brought  into  the  discussion.  Incentives  for 
party  competition  are  quite  different  in  a  presiden¬ 
tial  system  than  in  a  parliamentary  one  for  four 
reasons.  First,  in  a  presidential  regime,  fewer  coali¬ 
tion  configurations  are  likely  because  one  party — 
the  president’s  party — is  normally  expected  to  be  a 
coalition  member.  The  rare  exceptions  to  this  rule 
so  far  have  not  undermined  this  normal  expecta¬ 
tion.  Second,  given  the  prominence  that  the  presi¬ 
dent’s  reputation  has  in  citizens’  evaluations  of 
governmental  performance  (either  positive  or  neg¬ 
ative),  the  president’s  coalition  partners  are  less 
able  to  claim  credit  for  good  government  perfor¬ 
mance.  Thus,  under  presidentialism,  players  have 
fewer  incentives  than  under  parliamentarism  to 
join  or  to  remain  in  a  coalition.  Consequently, 
third,  the  president’s  approval  rating  has  a  power¬ 
ful  effect  on  parties’  decisions  to  associate  them¬ 
selves  with,  or  to  distance  themselves  from,  the 
government.  And  fourth,  based  on  the  previous 
statement,  most  of  the  incentives  change  over  time 
in  synchronization  with  the  fixed  electoral  calen¬ 
dar.  Because  parties  know  when  the  next  elections 
will  take  place,  they  have  incentives  to  distinguish 
themselves  from  the  president  and  start  to  compete 
in  order  to  increase  their  chances  of  getting  elected 
in  the  next  election. 

Despite  important  progress  in  the  development 
of  coalition  theories,  the  jury  is  still  out  on  certain 
issues.  All  theories  of  coalition  formation  and  sur¬ 
vival  under  parliamentary  regimes  make  assump¬ 
tions,  hidden  or  explicit,  about  political  parties’ 
preferences  and  their  concomitant  logics  of  compe¬ 
tition.  Michael  Laver  provides  new  extensions  of 
policy-driven  political  competition  into  a  multi¬ 
party  environment  where  voters  constantly  evalu¬ 
ate  party  support  and  switch  parties  to  augment 
their  expectations;  parties  continually  readapt 
policy  positions  to  the  shifting  affiliations  of  vot¬ 
ers.  Political  competition  is  based  on  four  clear 
types  of  party  behaviors: 

1.  predator  (move  party  policy  toward  the  policy 
position  of  the  largest  party), 

2.  aggregator  (adapt  party  policy  to  the  ideal 
policy  positions  of  party  supporters), 


3.  hunter  (repeat  policy  moves  that  were  rewarded; 
otherwise  make  random  moves),  and 

4.  sticker  (never  change  party  policy). 

Laver’s  contribution  provides  the  first  steps 
toward  endogenizing  key  features  of  the  process  of 
competition,  including  the  birth  and  death  of  par¬ 
ties,  internal  party  decision  rules,  and  voter  ideal 
points. 

Developments  in  New 
Third  World  Democracies 

At  the  risk  of  making  a  gross  generalization,  some 
countries  of  what  is  often  called  the  Third  World 
are  lately  showing  signs  of  new  patterns  of  elec¬ 
toral  and  governmental  competition.  These  pat¬ 
terns  are  characterized  by  the  emergence  of  per- 
sonalistic  leaderships,  hegemonic  actors,  and  a 
tendency  to  institutional  innovation  rather  than 
consolidation,  all  in  the  context  of  high  levels  of 
civic  disaffection,  distrust  of  political  parties,  and 
in  general,  animosity  toward  democratic  actors 
and  institutions.  The  arena  of  electoral  competi¬ 
tion  involves  new  forms  of  citizen  mobilization 
through  a  large  dose  of  social  polarization,  often 
fed  from  above.  Once  in  power,  in  the  realm  of 
governmental  competition,  leaders  provide  com¬ 
prehensive  packages  of  institutional  reforms, 
endorsed  by  constituent  assemblies  and  usually 
ratified  by  direct  popular  votes  (plebiscites).  The 
proposed  reforms  more  often  than  not  include  the 
reelection  of  elected  officials,  particularly  of  presi¬ 
dents,  and  a  concentration  of  power  in  the  execu¬ 
tive  at  the  expense  of  others,  primarily  the  judi¬ 
ciary.  This  is  particularly  relevant  in  Latin  America 
(e.g.,  in  Venezuela,  Bolivia,  and  Ecuador)  but  is 
not  confined  to  this  region. 

In  short,  political  competition  is  probably  one 
of  the  most  complex  concepts  in  political  science, 
and  as  such,  a  clear  definition  of  it  is  likely  to 
remain  hard  to  pin  down.  In  any  case,  as  a  concept 
that  touches  on  every  aspect  of  political  life,  polit¬ 
ical  competition  will  remain  at  the  center  stage  of 
political  science. 

David  Altman 
Pontificia  Universidad  Catolica  de  Chile 

Santiago,  Chile 
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Complexity 


The  science  of  complexity  in  the  natural  and 
social  sciences  involves  the  use  of  formalized 
(mathematical  or  computational  or  both)  theo¬ 
retical  models  to  study  systems  of  interacting 
agents  where  the  following  hold: 

1.  Each  agent’s  behavior  is  governed  by  a  small 
set  of  simple  rules,  which  often  depend  on  local 
information  and  feedback  from  the  agent’s  past 
behavior  and  from  other  agents’  behavior. 


2.  Characterizing  and  understanding  the  behav¬ 
ior  of  each  of  the  agents  does  not  directly  lead  to 
predicting  or  understanding  the  behavior  of  the 
entire  system.  The  local  rules  produce  emergent 
patterns — stable  equilibria,  cycles,  unstable  equi¬ 
libria  and  long  transitions  to  new  equilibria,  and 
randomness — and  emergent  properties  such  as 
robustness. 

3.  Agents’  interactions  are  interdependent  and 
affect  others  in  the  system;  thus,  removing  an 
agent  has  consequences  for  the  system  beyond 
merely  subtracting  out  that  single  agent’s  direct 
effect  on  other  agents  with  which  it  interacts. 

This  entry  presents  the  foundations  of  complex 
system  modeling  and  points  to  diverse  applications 
in  political  science. 

Foundations 

It  is  said  that  Thomas  Schelling  proposed  the  first 
modern  complex  systems  model  in  the  social  sci¬ 
ences.  In  research  in  the  1960s  into  the  phenome¬ 
non  of  racial  segregation  in  housing,  he  modeled  a 
process  where  people  (agents)  decide  where  to  live 
based  on  the  racial  mix  of  their  neighborhood. 
Schelling  randomly  placed  nickels  and  dimes  on  a 
checkerboard,  with  each  coin  resting  on  only  one 
square  and  with  fewer  coins  than  available  squares. 
He  then  randomly  picked  one  of  the  coins,  say  a 
dime,  and  examined  its  neighborhood,  meaning 
the  squares  that  bordered  the  square  on  which  the 
dime  stood.  There  will  be  eight  squares  in  the 
neighborhood  (unless  it  rested  on  the  edge  of  the 
board,  which  we  shall  ignore  for  this  example).  If 
the  neighborhood  consisted  of  fewer  than  5  nick¬ 
els,  Schelling  did  nothing.  Otherwise,  he  moved 
the  dime  to  another  place  on  the  board  where 
there  were  fewer  than  5  nickels  in  the  neighbor¬ 
hood.  He  then  iteratively  moved  through  each  coin 
and  either  moved  it  somewhere  else  or  kept  it  in 
place,  using  the  5/8ths  rule  for  tolerance  of  neigh¬ 
bors  of  the  other  type.  Schelling  argued  that  5/8ths 
was  a  pretty  tolerant  threshold,  in  that  his  coins 
(agents)  would  stay  put  even  if  half  of  their  neigh¬ 
borhood  consisted  of  the  other  type.  He  showed 
that  even  with  this  relatively  tolerant  threshold, 
the  dimes  and  nickels  segregated  into  homoge¬ 
neous  zones  on  the  checkerboard.  He  then  started 
over  and  examined  what  happened  with  other 
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kinds  of  thresholds  (1/2,  3/4,  etc.)  and  how 
quickly,  if  at  all,  the  agents  segregated  themselves 
incrementally. 

Note  what  happens  in  the  dynamics  of  the 
Schelling  model.  When  a  coin  is  moved  to  another 
area  of  the  checkerboard,  it  changes  its  own  neigh¬ 
borhood  and  the  neighborhood  to  which  it  moves. 
A  single  move  alters  the  diversity  of  the  neighbor¬ 
hoods  of  multiple  agents  and  thus  potentially 
alters  the  decisions  of  multiple  other  agents.  This  is 
an  example  of  the  kind  of  feedback  that  occurs  in 
complex  systems.  Further,  it  is  difficult  to  predict 
simply  from  the  decision  rule  of  the  agents  how  the 
system  will  behave.  Schelling  confessed  that  he 
could  not  predict  in  advance  when  and  if  the  sys¬ 
tem  would  segregate  into  neighborhoods  of  all 
dimes  and  all  nickels. 

Today,  researchers  using  complex  systems 
methods  typically  rely  on  computer  programs  to 
simulate  the  interactions  of  artificial  agents,  exam¬ 
ining  the  aggregate  patterns  that  emerge  from 
many  instances  of  microlevel  behaviors.  (Some 
additional  terminology:  What  is  sometimes  called 
agent-based  modeling  is  the  use  of  computation  to 
study  the  links  between  microbehavior  of  agents 
and  macropatterns  in  a  complex  system.) 

As  can  be  seen  in  the  Schelling  example,  it 
would  be  misleading  to  say  that  this  branch  of  sci¬ 
ence  involves  the  study  of  highly  complex  models. 
As  with  any  theoretical  modeling  in  the  sciences, 
complex  systems  modeling  involves  making  sim¬ 
plifying  assumptions  to  craft  a  model  of  behavior, 
analyzing  the  consequences  of  those  assumptions, 
and  then  linking  the  model  to  real-world  phenom¬ 
ena.  The  use  of  the  word  complex  refers  to  the 
characteristics  of  the  social  system  under  study  and 
not  to  the  approach  of  the  modeler.  Many  com¬ 
plex  systems  models,  such  as  the  Schelling  exam¬ 
ple,  are  extremely  simple  in  their  construction. 

Applications  in  Political  Science 

A  variety  of  social  systems  have  been  characterized 
as  complex,  and  accordingly,  in  social  science  disci¬ 
plines,  the  use  of  complex  systems  modeling  has 
grown.  Within  political  science,  complex  systems 
modeling  has  been  applied  to  the  study  of  interna¬ 
tional  diplomacy  and  war,  electoral  competition, 
voting  systems,  the  spread  of  culture,  the  evolution 
of  cooperative  behavior,  criminal  behavior  and 


punishment,  the  sorting  of  people  into  communities, 
political  networks,  and  the  development  of  law. 

Researchers  studying  diplomacy  and  war,  for 
instance,  have  devised  modeling  frameworks  to 
analyze  the  sensitivity  of  international  alliances  to 
changes  in  resource  inequalities  across  alliance 
partners.  When  a  developing  country  discovers  oil, 
what  effect  does  this  have  on  military  or  trade  alli¬ 
ances?  Complex  systems  models  can  enable 
researchers  to  analyze  the  system-level,  often  non¬ 
linear,  effects  of  such  changes. 

Using  complex  systems  models  on  collective 
action  and  cooperation,  researchers  learned  of  the 
importance  of  reciprocity  and  forgiveness  in  the 
evolution  of  cooperation.  For  a  long  time,  scholars 
puzzled  over  the  question  of  how  cooperative 
behavior  arises  among  selfish  individuals.  Recently, 
researchers  have  made  new  discoveries  about  the 
importance  of  “tagging”  or  noninformative  mark¬ 
ers  on  agents  that  permit  discrimination  and  dis¬ 
cernment  in  choosing  partners.  Such  tags  can  actu¬ 
ally  promote  more  cooperation  among  all  agents 
rather  than  create  groupings  that  shut  out  others. 
And  in  a  practical  application  of  these  types  of 
models,  some  have  analyzed  social  systems  where 
some  agents  can  choose  to  commit  crimes  (citi¬ 
zens)  and  others  can  choose  to  enforce  community 
norms  (police).  These  models  have  led  to  discover¬ 
ies  about  the  trade-offs  among  different  strategies 
for  fighting  crime.  Societies  face  trade-offs  between 
devoting  resources  to  monitoring  behavior  and  to 
stricter  punishments  for  criminals.  The  conse¬ 
quences  of  particular  strategies  for  law  enforce¬ 
ment  are  difficult  to  understand  without  models 
that  specifically  incorporate  feedback  effects  from 
agent  behavior  to  the  development  of  new  agent 
strategies. 

Researchers  have  also  used  complex  systems 
modeling  to  discover  the  conditions  under  which 
a  group  of  diverse  individuals  with  different  ways 
of  framing  a  problem  can  choose  more  effective 
public  policies  than  a  group  of  homogeneous  indi¬ 
viduals  who  all  frame  the  same  problem  identi¬ 
cally.  Diversity,  these  models  tell  us,  can  in  com¬ 
mon  circumstances  improve  collective  decision 
making  in  legislatures,  councils,  organizations, 
and  committees. 

Research  into  social  and  political  networks  has 
increasingly  included  complex  systems  modeling. 
There  are  models  that  help  researchers  understand 
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better  the  conditions  in  a  social  network  that 
encourage  innovation  and  the  spread  of  innova¬ 
tions  among  agents  in  that  network. 

Complex  systems  modeling  is  sometimes  seen  as 
a  challenge  to  traditional  game-theoretical  meth¬ 
ods  of  modeling.  But  most  practitioners  of  com¬ 
plex  systems  modeling  within  political  science 
view  it  as  complementary  to  game-theoretical 
methods.  Not  only  are  quite  a  few  well-known 
complex  systems  models  within  political  science 
based  fundamentally  on  game  theoretical  tech¬ 
niques,  but  the  researchers  themselves  have  typi¬ 
cally  published  work  spanning  a  variety  of  model¬ 
ing  tools,  including  game  theory.  Well-known 
complexity  models  depict  iterated  prisoner’s 
dilemma  situations.  Agents  in  these  models  play 
repeated  prisoner’s  dilemma  games  with  each 
other,  and  researchers  seek  to  discover  the  kinds  of 
strategies  that  do  well  in  environments  with  agents 
that  can  alter  their  strategies  to  get  ahead. 

A  large  portion  of  complexity  research  in  politi¬ 
cal  science  focuses  attention  on  models  where  the 
agents  are  not  fully  rational  in  the  manner  defined 
by  economists  and  many  other  social  scientists. 
That  is,  agents  are  portrayed  as  adaptive  and 
boundedly  rational.  This  certainly  characterizes  the 
Schelling  agents.  More  generally,  this  means  that 
agents  display  a  subset  of  the  following  features: 

1.  they  do  not  fully  optimize  their  utility  functions 
given  their  information; 

2.  they  are  not  forward-looking  in  being  able  to 
predict  accurately  the  probability  of  certain 
outcomes  in  the  future,  given  their  behavior; 
and 

3.  they  are  myopic  in  that  they  do  not  have 
information  outside  of  some  defined  local 
“zone”  of  interaction  or  geography. 

Many  game-theoretical  models  in  political  sci¬ 
ence  have  agents  with  one  or  more  of  these 
features — modelers  through  various  assumptions 
end  up  restricting  the  information  or  actions  avail¬ 
able  to  agents.  Nevertheless,  one  generalization 
that  can  be  made  is  that,  at  heart,  game  theory 
relies  on  the  assumptions  following  the  work  of 
John  von  Neumann,  Oskar  Morgenstern,  and 
John  Nash  that  agents  are  rational  (optimize)  and 
intelligent  (they  know  the  game  they  are  in  as  well 


as  the  modeler  does).  Complexity  systems  models 
can  violate  both  of  these  assumptions,  especially 
the  latter.  Note  in  the  Schelling  model  how  the 
agents  are  assumed  to  make  decisions  solely  on  the 
mix  of  other  agents  in  their  limited  neighborhood. 
They  do  not  consider  the  implications  of  their 
actions  in  the  long  run  or  what  happens  outside 
their  own  neighborhood  (either  the  one  they  leave 
or  the  one  they  move  to). 

In  fact,  some  commentators  have  identified 
insight  gained  from  studying  systems  of  adaptive, 
less  than  fully  rational  agents  as  one  of  the  notewor¬ 
thy  benefits  of  using  complexity  methods.  There  is 
nothing  inherent  in  the  methodology  that  requires 
that  agents  be  boundedly  rational  or  adaptive,  but 
it  is  clearly  a  common  feature  of  such  models. 

Consider  the  recent  work  on  political  party  sys¬ 
tems  using  complex  systems  modeling  of  electoral 
competition.  In  the  work  of  one  researcher,  adap¬ 
tive  political  parties  were  assumed  to  be  one  of 
four  types,  each  boundedly  rational  but  in  a  differ¬ 
ent  way.  Hunter  parties  change  their  ideological 
positions  in  a  hill-climbing  pattern;  that  is,  they 
keep  changing  in  the  same  ideological  direction  as 
long  they  continue  to  win  more  votes.  Predator 
parties  change  their  ideological  positions  to  mimic 
the  most  popular  political  party.  Aggregator  par¬ 
ties  adopt  the  most  popular  ideological  position 
among  their  existing  supporters  in  the  electorate. 
And  Sticker  parties  do  not  change  their  ideologies. 
The  researcher  attempted  to  simulate  the  party 
politics  of  Ireland  using  this  model,  matching  up 
the  type  of  party  to  a  real-world  example.  For 
instance,  the  researcher  concluded  that  the  Fine 
Gael  party  was  like  a  Hunter,  while  the  Democratic 
Left  was  like  a  Sticker. 

What  is  innovative  about  this  work  is  the  incor¬ 
poration  of  real-world  data  on  micromotivations 
directly  into  the  assumptions  of  a  model.  Then, 
after  a  series  of  complex  systems  simulations,  the 
researcher  compared  the  distribution  of  votes 
among  the  artificial  parties  and  the  ideological 
positions  of  the  artificial  parties  with  real-world 
data  on  parties  from  Ireland.  The  methodology 
permits  researchers  to  unpack  the  dynamics  of  the 
system  by  running  the  simulations  under  various 
scenarios,  discovering  what  aspects  of  the  model 
are  driving  which  system-level  results. 

Complex  systems  models  are  best  described  as  a 
set  of  tools  for  analysis.  Researchers  consider  what 
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they  do  as  providing  a  platform  for  the  study  of 
specific  research  questions.  For  example,  what  can 
cause  the  international  system  to  fragment  into 
multiple  centers  of  power,  such  as  what  happened 
in  the  early  20th  century  prior  to  World  War  I? 
Alternatively,  what  causes  the  international  system 
to  fall  into  a  pattern  where  there  is  one  dominant, 
hegemonic  power,  such  as  what  happened  after 
1990?  The  models  enable  researchers,  equipped 
with  plausible  assumptions  about  microlevel  moti¬ 
vations  and  interactions  among  agents,  to  devise 
counterfactuals,  with  careful  attention  to  interac¬ 
tions,  feedback,  and  nonlinear  effects.  Researchers 
can  discover  the  causes  of  macrolevel  dynamics 
switching  from  one  phase  (e.g.,  multipolarity  in  the 
international  system)  to  another  (e.g.,  unipolarity). 
In  this  sense,  complex  systems  researchers  provide 
something  analogous  to  experimental  laboratories. 
(There  are  sophisticated  software  programs  that 
enable  students  and  researchers  to  develop  complex 
systems  models  even  if  they  have  only  limited  train¬ 
ing  in  writing  computer  code.)  The  models  are  plat¬ 
forms  for  researchers  to  study  a  variety  of  questions 
in  careful  detail,  especially  about  how  microlevel 
interactions  aggregate  into  macrolevel  patterns. 
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Ann  Arbor,  Michigan,  United  States 
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Compliance 


Compliance,  simply  put,  is  the  extent  to  which  an 
actor  fulfils  its  obligations.  In  the  realm  of  inter¬ 
national  relations,  scholars  are  most  concerned 
with  state  compliance  and  have  focused  their 
attention  on  the  degree  to  which  states  conform  to 
the  prescriptions  and  proscriptions  stipulated  by 
their  international  commitments. 

Interest  in  compliance  has  burgeoned  as  states 
increasingly  construct  and  enter  into  rule-based 
governance  arrangements  to  guide  behavior  and 
solve  problems  in  a  variety  of  issue  areas,  such  as 
trade,  security,  human  rights,  and  the  environ¬ 
ment.  Scholarship  on  international  cooperation, 
regimes,  law,  and  institutions  has  focused  on  why 
states  do  or  do  not  comply  with  their  international 
obligations  and  what  conditions  and  institutional 
features  facilitate  compliance. 

The  subject  of  compliance  is  often  connected  to 
the  debate  over  the  importance  and  efficacy  of 
international  institutions,  but  the  relationship 
between  compliance  and  effectiveness  is  a  murky 
one.  There  is  widespread  agreement  that  compli¬ 
ance  is  not  sufficient  for  effectiveness;  however, 
there  is  a  debate  over  whether  and  to  what  extent 
high  levels  of  compliance  are  necessary  for,  or  at 
least  associated  with,  consequential  institutions 
and  effective  agreements. 

Scholars  from  the  realist  tradition  are  generally 
skeptical  that  international  institutions  and  law 
are  significant  determinants  of  state  behavior.  In 
the  view  of  the  majority  of  realists,  power  relations 
are  the  most  reliable  predictor  of  international 
outcomes,  and  states  often  strive  to  avoid  comply¬ 
ing  with  inconvenient  obligations.  In  cases  where 
compliance  rates  are  high,  realists  tend  to  see  the 
outcome  as  the  result  of  shallow  agreements  that 
reflect  the  law  of  the  least  ambitious  party  or  as 
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merely  codifications  of  what  states  would  have 
done  even  in  the  absence  of  an  agreement. 

Other  perspectives,  such  as  neoliberal  institu¬ 
tionalism  and  social  constructivism,  are  more  san¬ 
guine  regarding  the  potential  influence  of  interna¬ 
tional  institutions  and  agreements  Neoliberal  insti¬ 
tutionalism,  with  its  rationalistic  metatheoretical 
orientation  and  focus  on  constellations  of  interests, 
sees  high  levels  of  compliance  with  international 
rules  as  a  possibility.  Moreover,  compliance  or  even 
good-faith  efforts  that  fall  short  of  full  compliance, 
together  with  other  factors  such  as  the  achievement 
of  policy  goals  and  behavioral  change,  can  be 
treated  as  an  indicator  for  evaluating  the  impact  of 
international  institutions  and  agreements.  This  per¬ 
spective  holds  that  states  operate  according  to  a 
logic  of  consequence  and  that  institutional  design 
matters  in  influencing  how  states  will  calculate 
their  interests  and  goals  and  evaluate  the  costs  and 
benefits  of  compliance  or  noncompliance. 
Institutional  properties  and  mechanisms,  such  as 
monitoring  capabilities  and  verification  systems 
(e.g.,  the  International  Atomic  Energy  Agency’s 
safeguard  system  fulfills  this  role  in  the  case  of  the 
nuclear  nonproliferation  regime),  can  result  in 
increased  transparency,  greater  information,  and 
the  creation  of  material  (sanctions,  loss  of  privi¬ 
leges)  and  social  (reputational)  costs  for  noncom- 
pliant  behavior. 

Social  constructivist  and  normative  approaches 
take  the  position  that  international  institutions  and 
regulations  are  meaningful  and  that  their  norms  and 
rules  tend  to  enjoy  widespread  compliance.  As  the 
legal  scholar  Louis  Henkin  (1979,  p.  47)  concluded, 
“Almost  all  nations  observe  almost  all  principles  of 
international  law  and  almost  all  of  their  obligations 
almost  all  of  the  time.”  Social  constructivism,  which 
has  a  sociological  metatheoretical  orientation, 
emphasizes  the  interplay  between  norms  and  iden¬ 
tity  and  stresses  knowledge  and  communication 
dynamics  to  explain  state  behavior.  Constructivists 
see  states  as  role  players  and  have  found  that  inter¬ 
national  institutions  can  play  an  important  role  in 
socializing  states  to  accept  and  internalize  particular 
norms  and  rules.  In  their  view,  decisions  about  com¬ 
pliance  are  driven  by  a  logic  of  appropriateness  that 
reflects  intersubjective  normative  understandings 
and  identity  concerns.  Behavior  is  not  strictly  deter¬ 
mined  by  rational  calculations  of  the  state’s  interests 
but  by  deeper  norms  and  shared  beliefs  about  what 
actions  and  policies  are  legitimate  and  appropriate 


in  international  relations  (Thomas  Franck,  1990, 
pp.  205-207).  Therefore,  from  this  perspective,  the 
decision  to  comply  with  international  provisions 
concerning  issues  such  as  slavery,  piracy,  human 
rights,  the  use  of  chemical  weapons,  and  the  acqui¬ 
sition  of  nuclear  weapons  serves  the  important 
function  of  both  shaping  and  reflecting  a  state’s 
identity. 

Promoting  Compliance  and 
Coping  With  Noncompliance 

As  Helmut  Breitmeier,  Oran  Young,  and  Michael 
Ziirn  (2006)  have  noted,  while  states  often  comply 
with  international  rules,  compliance  problems 
remain  a  persistent  challenge.  For  this  reason, 
scholars  have  attempted  to  identify  the  factors  that 
foster  compliance  and  have  debated  the  best  way 
to  reduce  noncompliance,  whether  it  be  deliberate, 
inadvertent,  or  due  to  lack  of  capacity.  The  man¬ 
agement  and  enforcement  approaches  provide 
contending  views  of  the  most  important  determi¬ 
nants  of  compliance. 

The  management  approach  has  stressed  the 
importance  of  transparency,  dialogue  among  the 
agreement’s  parties,  dispute  resolution  procedures, 
and  technical  and  financial  assistance  to  promote 
compliance  and  has  downplayed  the  need  for  puni¬ 
tive  enforcement  mechanisms,  such  as  sanctions. 
Advocates  of  the  managerial  school  argue  that  this 
approach  usefully  addresses  the  primary  types  of 
noncompliance.  The  provision  of  resources  or 
expertise  can  address  noncompliance  that  has 
resulted  from  a  lack  of  capacity.  Clarification  pro¬ 
cedures,  for  example,  can  be  used  to  address 
ambiguous  behavior  that  other  parties  might  per¬ 
ceive  as  possible  noncompliance.  Finally,  mecha¬ 
nisms  that  provide  transparency  can  deter  and 
expose  purposeful  breaches.  “For  a  party  deliber¬ 
ately  contemplating  violation,  the  high  probability 
of  discovery  reduces  the  expected  benefits  rather 
than  increasing  the  costs  and  would  thus  deter  vio¬ 
lation  regardless  of  the  prospects  of  sanctions” 
(Abram  Chayes  &  Antonia  Handler  Chayes,  1993, 
p.  iS). 

The  enforcement  approach  criticizes  the  manage¬ 
rial  school  for  underestimating  the  importance  of 
sanctions  and  punitive  measures.  Enforcement  schol¬ 
ars  have  argued  that  even  in  the  case  of  environmen¬ 
tal  regimes,  which  are  the  source  of  many  manageri¬ 
alist  examples  demonstrating  high  compliance 
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levels,  “enforcement  plays  a  greater  role  in  suc¬ 
cesses  than  one  is  led  to  believe  and  its  absence  is 
conspicuous  in  some  notable  failures”  (George 
Downs,  David  Rocke,  &  Peter  Barsoom,  1996, 
p.  395).  They  hold  that  in  matters  of  deep  coopera¬ 
tion,  such  as  security,  there  is  always  the  possibility 
that  the  benefits  of  cheating  might  be  too  great  to 
be  offset  solely  by  transparency.  According  to 
advocates  of  the  enforcement  approach,  elaborate 
verification  measures  attest  to  the  importance  of 
transparency,  but  to  believe  that  the  power  of 
transparency  can  be  exclusively  depended  on  to 
compel  compliance  in  the  absence  of  sanctions  is 
naive.  Therefore,  the  enforcement  approach  has 
stressed  monitoring  activities  in  conjunction  with 
potent  sanctions.  Ideally,  from  this  point  of  view, 
multilateral  agreements  should  be  equipped  with 
mechanisms  that  provide  incentives  (resource  car¬ 
rots)  for  compliance  while  applying  strong  sanc¬ 
tions  (punishing  sticks)  for  noncompliance. 

Rather  than  treating  the  management  and 
enforcement  approaches  as  diametrically  opposed 
propositions,  some  scholars  have  argued  that  well- 
designed  international  regimes  or  agreements  would 
benefit  from  including  elements  recommended  by 
both  approaches.  They  contend  that  the  provision 
of  help  and  assistance,  and  clarification  procedures, 
followed  by  investigation,  followed  by  inspection, 
followed  by  detection,  followed  by  material  and 
social  sanctions  is  better  thought  of  as  a  continuum 
or  process  to  be  carried  out  rather  than  as  a  choice 
between  preferred  extremes.  For  example,  a  study 
by  David  Victor  (1998)  of  the  noncompliance  pro¬ 
cedure  of  the  Montreal  Protocol — the  regime 
designed  to  protect  the  stratospheric  ozone  layer — 
found  that  the  protocol’s  implementation  commit¬ 
tee  was  most  effective  when  it  blended  the  two 
approaches.  “Management  avoids  the  most  severe 
and  unproductive  antagonism,  but  the  credible 
threat  of  tougher  actions,  including  sanctions,  helps 
ensure  cooperation,  especially  when  dealing  with 
parties  who  are  unswayed  by  management  alone” 
(Victor,  1998,  p.  139). 

An  important  study  that  used  a  relational  data¬ 
base  covering  23  international  environmental 
regimes  found  elements  of  both  the  management 
and  enforcement  approaches  wanting  and  con¬ 
cluded  that  neither  was  able  to  convincingly  “explain 
patterns  of  compliance  with  international  regimes” 
(Breitmeier  et  ah,  2006,  p.  110).  Although  this 
study  confirmed  that  institutionalized  enforcement 


measures  and  strong  verification  procedures  were 
indeed  crucial,  it  also  found  that  the  compliance 
pull  exerted  by  legal  rules  that  enjoyed  widespread 
legitimacy  was  more  important  than  commonly 
understood.  According  to  their  findings, 

adequate  and  even  impressive  rates  of  compliance 
with  international  environmental  rules  occur  when 
appropriate  incentive  mechanisms  are  coupled 
with  juridification,  participation  on  the  part  of 
transnational  NGOs  in  the  rule-making  process, 
and  a  responsive  approach  to  the  development  of 
compliance  mechanisms  over  time.  (p.  112) 

In  the  years  ahead,  additional  work — method¬ 
ological  and  conceptual — will  be  needed  to  help 
us  better  judge  to  what  extent  states  are  comply¬ 
ing  with  their  international  obligations  and  more 
effectively  evaluate  to  what  extent  international 
governance  arrangements  are  actually  achieving 
their  goals.  Moreover,  more  effort  needs  to  be 
expended  to  improve  our  understanding  of  the 
relationship  between  compliance  and  the  influ¬ 
ence  exerted  by  international  institutions, 
regimes,  and  agreements.  The  dense  thicket  of 
long-standing  and  emerging  transnational  prob¬ 
lems  that  demand  international  solutions  ensures 
that  the  subject  of  international  cooperation  and 
the  challenge  of  compliance  will  be  of  enduring 
relevance  and  interest  to  scholars  and  practitio¬ 
ners  alike. 

Charles  F.  Parker 
Uppsala  University 
Uppsala,  Sweden 
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Concept  Formation 


Concepts  are  central  to  the  enterprise  of  political 
science.  The  concepts  we  use  shape  the  world  we 
see.  Without  solid  conceptual  foundations,  the 
edifice  of  political  science  is  insecure.  If  we  fail  to 


develop  clear  and  precise  concepts,  our  theoretical 
insights  and  empirical  discoveries  will  fail  to  be 
clear  and  precise,  too.  This  entry  reviews  major 
pitfalls  for  conceptual  analysis  as  well  as  the  fun¬ 
damental  challenges  to  concept  formation  and 
conceptual  innovation  in  the  study  of  politics. 

In  contemporary  political  science,  concept  for¬ 
mation  is  often  regarded  as  a  distraction,  a  mere 
prelude  to  serious  research  that  is  given  scarce 
attention.  Scholars  sometimes  ignore  conceptual 
disputes,  resolve  them  by  fiat,  or  delegate  their 
resolution  to  political  philosophers.  At  the  same 
time,  a  strong  tradition  of  self-conscious  and  sys¬ 
tematic  concept  analysis,  resting  on  the  pioneering 
work  of  Giovanni  Sartori,  David  Collier,  and  oth¬ 
ers,  does  exist  in  the  discipline.  This  entry  offers  an 
analytical  synthesis  that  weaves  together  insights 
of  conceptual  debate  in  both  philosophy  and 
political  science. 

Conceptual  Commitments  Since  its  origins  in 
ancient  Greece,  Western  philosophy  has  been 
debating  the  nature  and  meaning  of  concepts.  For 
centuries,  thinkers  tried  to  resolve  one  fundamen¬ 
tal  problem:  the  relation  between  the  world  and 
the  mind,  the  objective  and  the  subjective,  things 
and  ideas.  They  conceived  the  mind  as  a  mirror 
and  concepts  as  mental  images  of  the  outside 
world,  as  cognitive  representations  of  objective 
realities  that  uphold  the  fragile  correspondence 
between  the  two  worlds.  In  the  mid-20th  century, 
the  so-called  linguistic  turn  in  modern  philosophy, 
brought  about  by  authors  like  Ludwig  Wittgenstein 
and  John  Austin,  redefined  the  basic  coordinates 
of  concept  analysis.  It  shifted  the  axis  of  concep¬ 
tual  debate  from  cognition  to  language  and  from 
language  as  a  system  of  representative  symbols 
(“Platonism”)  to  language  as  a  medium  of  social 
action  (“pragmatism”). 

Language  Acts 

According  to  the  classic  conception  of  language, 
concepts  are  our  basic  units  of  thought.  According 
to  a  pragmatic  understanding  of  language,  con¬ 
cepts  are  our  basic  units  of  (linguistic)  action.  In 
this  perspective,  concepts  are  not  interior  images 
that  correspond  to  external  realities  but  practical 
tools  that  allow  us  to  do  many  things,  many  more 
than  just  putting  vivid  labels  on  inanimate  objects. 
They  allow  us  to  threaten  and  promise,  to  bless 
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and  condemn,  to  give  orders  and  to  request  favors, 
to  express  tenderness  and  anger,  to  know  and 
believe,  to  contract  marriage  and  christen  ships, 
and  so  on.  Designating  objects  “out  there”  in  the 
external  world  (reference)  is  just  one  linguistic 
function  among  innumerable  others. 

As  practitioners  of  social  science,  too,  we  do 
more  than  offer  aseptic  statements  about  the 
world.  In  our  texts  and  speeches,  we  do  more  than 
describe  and  explain,  more  than  refer  to  facts  and 
associations  between  facts.  We  laud  and  criticize 
colleagues,  highlight  and  downplay  themes,  sup¬ 
port  and  refute  arguments,  persuade  and  dissuade 
readers,  and  so  forth.  However,  while  reference  is 
not  everything,  it  does  play  a  leading  role  in  the 
social  sciences.  All  types  of  “speech  acts”  (John 
Searle)  characteristically  contain  referential  ele¬ 
ments.  They  refer  to  something,  be  it  in  the  physi¬ 
cal  world  of  objects,  the  social  world  of  norms  and 
interaction,  or  the  subjective  world  of  emotion  and 
cognition.  Arguably,  articulating  empirical  and 
theoretical  propositions  constitutes  the  nucleus  of 
our  linguistic  activities.  It  is  what  we  are  supposed 
to  do  with  social  science  concepts:  developing 
descriptive  and  explanatory  inferences,  making 
and  breaking  claims  about  the  social  word.  In 
these  linguistic  performances — our  primary  speech 
acts  in  the  social  sciences — reference  is  key.  We 
need  our  concepts  to  perform  referential  roles.  We 
need  them  to  grasp  concrete  realities  in  abstract 
terms.  Classical  philosophy  was  centrally  con¬ 
cerned  with  one  specific  purpose  of  language:  its 
referential  role.  We  should  not  be  surprised  to  see 
that  conceptual  discussions  in  contemporary  social 
sciences,  too,  privilege  traditional  reference  over 
other  linguistic  roles. 

Meaning 

If  concepts  are  means  of  action,  their  meaning 
does  not  derive  from  their  correspondence  to 
objective  realities  but  from  their  practical  roles  in 
linguistic  communication.  In  Wittgenstein’s  famous 
dictum,  “The  meaning  of  a  word  is  its  use  in  the 
language”  (1952/1968,  sec.  43).  Tanguage  is  a 
medium  of  social  communication.  Its  rules  of 
usage  and  meaning  are  public,  not  private.  Our 
shared  knowledge  about  the  meaning  of  a  word 
derives  from  our  shared  linguistic  practices.  As 
competent  language  users,  we  know  what  others 


know  about  the  meaning  (the  conditions  of  legiti¬ 
mate  use)  of  a  concept.  As  responsible  language 
users,  we  accept  the  meaning  of  a  concept  (its  con¬ 
ditions  of  legitimate  use)  when  we  use  it  and  accept 
that  others  can  hold  us  to  account  for  using  it.  As 
in  other  realms  of  social  action,  responsibility 
means  that  we  accept  the  consequences  of  our 
deeds.  Take  the  standard  example  of  a  promise.  If 
I  promise  you  x,  I  understand  the  meaning  of 
promise  making  and  accept  its  conditions  of  valid¬ 
ity.  Among  other  things,  I  understand  and  accept 
that  x  is  a  future  action  that  benefits  you,  that  I  am 
able  to  perform  it,  that  my  promise  obliges  me  to 
perform  it,  and  that  I  actually  intend  to  perform  it 
(Searle,  1969,  chap.  3).  If  I  promise,  yet  violate  any 
of  these  conditions  of  validity  that  constitute  the 
meaning  of  promises  and  in  consequence  fail  to 
carry  out  x,  you  can  hold  me  accountable. 

When  we  employ  concepts  more  specifically  as 
means  of  propositions  (the  primary  form  of 
speech  acts  in  the  social  sciences),  we  use  them  as 
carriers  of  general  claims  about  the  empirical  phe¬ 
nomena  they  are  referring  to.  When  applying 
them  to  concrete  cases,  we  subscribe  to  these 
claims.  We  commit  ourselves  to  their  truth  (appli¬ 
cability).  If  I  call  a  man  a  friend,  a  thief,  or  a  left- 
wing  dictator,  I  articulate  (and  thus  embrace) 
certain  (contextually  understood)  claims  about 
my  relationship  to  him,  his  relationship  to  alien 
property,  or  the  form  and  substance  of  his  exer¬ 
cise  of  state  power.  In  case  of  doubt,  confusion,  or 
contestation,  I  must  be  ready  to  justify  my  concep¬ 
tual  choices  and  accept  the  consequences.  The 
bundles  of  claims  we  commit  ourselves  to  when 
employing  a  concept  comprises  its  meaning.  Often 
these  claims  and  commitments  remain  implicit. 
Formal  definitions  serve  to  make  them  explicit 
(Robert  Brandom,  2000). 

Reification 

In  political  science,  we  still  have  to  assimilate 
the  insights  of  pragmatic  philosophy.  Our  discus¬ 
sions  of  concepts,  as  far  as  they  take  place,  still 
tend  to  be  anchored  in  the  classical  distinction 
between  mental  creations  and  real  objects.  In 
addition,  we  tend  to  reify  both  sides  of  the  mind- 
world  distinction.  We  tend  to  treat  both  concepts 
and  their  referents  as  if  they  were  things.  The 
result  might  be  described  as  a  kind  of  double  false 
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consciousness.  We  tend  to  misrepresent  social 
reality  as  well  as  our  representations  of  reality. 

(a)  The  Reification  of  Reference.  Concepts  are 
abstractions,  not  proper  names.  They  do  not  serve 
to  designate  particular  objects  but  classes  of 
objects.  On  the  referential  side,  our  paradigms  of 
objects  are  still  concrete,  material  things  with 
observable  properties.  Very  few  objects  of  political 
research  correspond  to  this  model.  The  realities  we 
study  are  symbolic.  Our  concepts  are  not 
generalizations  from  observed  properties  but 
abstractions  of  symbolic  realities. 

(b)  The  Reification  of  Concepts.  On  the  conceptual 
side,  we  tend  to  treat  our  abstractions,  too,  as  if 
they  were  tangible  objects,  fixed  in  time  and  space. 
Employing  the  language  of  factual  propositions, 
we  tend  to  ask  what  a  concept  is  (and  is  not)  and 
what  its  essential  attributes  are  (and  are  not),  as  if 
comprehending  the  concept  required  discerning  its 
visible  properties. 

The  Triangle 

According  to  the  widely  used  tripartite  concep¬ 
tion  of  concepts  developed  by  Charles  Kay  Ogden 
and  Ivor  Armstrong  Richards  (the  Ogden-Richards 
triangle),  concepts  are  symbolic  entities  that  consist 
of  three  elements:  their  meaning  (connotation  or 
intension),  their  referents  (denotation  or  exten¬ 
sion),  and  a  term  or  word  (their  name).  This  con¬ 
ception  of  concepts,  introduced  by  Sartori  into 
political  science,  is  still  indebted  to  the  notion  of 
concepts  as  symbolic  intermediaries  between  mind 
(as  location  of  meaning)  and  the  objective  world 
(as  location  of  reference).  It  is  therefore  vulnerable 
to  the  reification  of  both  reference  and  meaning. 
Still,  it  serves  well  to  understand  the  contingent 
nature  of  conceptual  commitments  (the  element  of 
choice  in  the  relations  between  terms,  meaning,  and 
reference)  and  in  general  offers  a  set  of  useful  dis¬ 
tinctions  to  analyze  the  formation  and  de-forma- 
tion  of  concepts  (which  will  guide  parts  of  our 
subsequent  discussions). 

Conceptual  Disorders 

Conceptual  discussions  in  the  social  sciences  often 
carry  therapeutic  ambitions.  In  the  pursuit  of  clarity 


and  precision,  they  strive  to  cure  scientific  lan¬ 
guage  of  the  multiple  disorders  that  are  thought  to 
afflict  ordinary  language.  The  tradition  of  concep¬ 
tual  analysis  in  political  science  that  was  initiated 
by  Sartori  and  his  colleagues  in  the  1970s  and  is 
continued  today  most  prominently  by  David 
Collier  and  John  Gerring  (2009)  partakes  of  this 
therapeutic  project.  Ordinary  language  is  not  gen¬ 
erally  defective,  though.  It  is  as  clear  and  precise  as 
speakers  need  it  to  be.  Still,  scientific  language  dif¬ 
fers  from  ordinary  language  in  some  fundamental 
regards.  Among  other  things,  it  is  written  in  form 
and  literal  in  style;  it  involves  a  strong  commit¬ 
ment  to  truthful,  transparent,  and  evidence-based 
argumentation;  it  aims  at  generating  general 
knowledge;  and  it  demands  the  development  of  a 
common  specialized  vocabulary  within  the  aca¬ 
demic  community.  Most  of  these  distinctions  are 
normative,  not  empirical.  They  do  not  give  social 
scientists  a  mandate  to  remedy  the  deficiencies  of 
ordinary  language,  but  they  do  involve  the  profes¬ 
sional  obligation  to  craft  a  shared  specialized 
vocabulary  that  steers  clear  of  major  conceptual 
disorders  and  malpractices. 

Conceptual  Opacity 

Everyday  linguistic  communication  unfolds  on 
the  basis  of  implicit  meaning.  Neither  do  speakers 
offer  formal  definitions  of  the  words  they  use,  nor 
do  their  interlocutors  ask  for  them — except  when 
their  shared  understandings  turn  problematic  and 
when  communicative  irritations  arise,  which  are 
instances  of  incongruence  between  the  concrete 
application  of  concepts  and  their  social  meaning 
that  we  take  for  granted.  You  promised  to  be 
punctual  and  are  an  hour  late.  Is  this  your  notion 
of  punctuality?  You  say  we  are  friends,  but  you  left 
me  alone  in  the  face  of  danger.  Is  this  your  idea  of 
friendship? 

In  the  social  sciences,  we  run  higher  systemic 
risks  of  breaking  through  the  thin  ice  of  implicit 
understandings.  Our  key  concepts  are  often  com¬ 
plex  and  contested,  and  we  cannot  take  for  granted 
that  others  comprehend  them  in  the  same  manner 
as  we  do.  Linguistic  transparency  is,  therefore,  our 
first  obligation  in  the  social-scientific  use  of  con¬ 
cepts.  Karl  Marx  remarked  once  that  he  needed 
three  volumes  to  explain  the  concept  of  capital.  We 
need  not  go  that  far  in  explicating  the  core  concepts 
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we  use  in  our  research.  Concise  formal  definitions 
will  often  suffice.  Yet  if  we  fail  to  make  explicit  our 
central  conceptual  commitments,  our  theories  and 
findings  cannot  contribute  to  the  construction  of 
common  knowledge,  only  to  the  accumulation  of 
fragmentary  statements  whose  interrelations  are 
uncertain.  Conceptual  opacity  engenders  opaque 
research. 

Conceptual  Confusion 

As  the  descendents  of  Noah  in  ancient  Babylonia 
set  out  to  build  a  premodern  skyscraper  (“a  tower, 
whose  top  may  reach  unto  heaven”),  God,  alarmed 
by  their  capacity  of  monolingual  coordination, 
intervened  to  “confound  their  language,  that  they 
may  not  understand  one  another’s  speech” 
(Genesis  11:7).  More  than  anything  else,  the  con¬ 
cept  analytical  tradition  of  Sartori  has  been  con¬ 
cerned  about  conceptual  confusion.  According  to 
its  disciplinary  diagnosis,  the  builders  of  compara¬ 
tive  political  science  are  afflicted  by  a  similar  con¬ 
fusion  of  tongues  as  the  architects  of  the  Tower  of 
Babel.  Lacking  discipline  and  coherence,  schools 
and  scholars  are  speaking  past  each  other  with  dif¬ 
ferent,  mutually  incomprehensible  vocabularies. 

If  concepts  form  triangles  of  terms,  meanings, 
and  referents,  conceptual  confusion  may  arise 
from  three  sources:  confusing  relations  between 
terms  and  meanings  (ambiguity),  confusing  rela¬ 
tions  between  meanings  and  referents  (vagueness), 
and  confusing  stipulations  of  meaning  (definitional 
defects). 

Conceptual  Ambiguity 

To  ensure  unequivocal  associations  of  words 
and  conceptual  commitments  “the  golden  rule  is  to 
have  one  word  for  each  meaning”  (Sartori,  1984/ 
2009,  p.  113).  Confusion  may  arise  if  we  have 
several  words  for  one  concept  (synonyms)  or  one 
word  with  various  meanings  (homonyms). 

Genuine  synonyms  pose  no  problem  for  com¬ 
munication.  They  enrich  our  vocabulary  and  help 
us  avoid  tedious  repetition.  The  troubles  arise 
from  fuzzy  synonyms — neighboring  or  overlap¬ 
ping  terms  that  are  situated  in  a  disorderly  seman¬ 
tic  field  and  whose  exact  relations  remain  unspeci¬ 
fied.  Political  science  is  replete  with  such  terms. 
While  language  users  employ  them  (loosely)  as 


synonymous,  it  is  unclear  whether  they  actually  do 
carry  equivalent  conceptual  commitments.  For 
instance,  institutions  are  often  defined  as  rules , 
and  vice  versa.  Yet  we  do  not  know  to  what  extent 
we  can  treat  the  two  as  interchangeable  concepts. 
In  the  best  of  cases,  fuzzy  synonyms  share  a  recog¬ 
nizable  semantic  core  but  differ  in  their  precise 
connotations  (additional  shades  of  meaning).  For 
example,  genocide  and  ethnic  cleansing  may  refer 
to  the  same  murderous  acts,  yet  the  former  main¬ 
tains  analytic  distance,  while  the  latter  adopts  the 
hygienic  discourse  of  the  assassin. 

Although  words  frequently  carry  multiple  mean¬ 
ings,  in  ordinary  language,  homonymy  does  little 
to  disturb  our  ability  to  communicate.  In  everyday 
talk,  context  determines  meaning.  If  it  fails  to  do 
so,  we  can  always  ask  for  clarification.  In  the 
social  sciences,  by  contrast,  homonymous  terms 
are  more  corrosive  to  communication.  If  scholars 
attach  incongruent  meanings  to  key  terms  that 
define  their  fields  of  inquiry,  they  will  fail  to  create 
common  knowledge.  Instead  of  studying  one  class 
of  phenomena,  they  will  be  studying  different  sub¬ 
jects  under  the  same  name.  The  unity  of  their  field 
of  research  will  be  apparent  only,  a  nominal  delu¬ 
sion  veiling  the  substantive  fragmentation  of  their 
research.  For  instance,  if  some  hold  the  goal  of 
“democratic  consolidation”  to  be  the  prevention 
of  authoritarian  regression  while  others  take  it  to 
lie  in  the  achievement  of  democratic  deepening, 
comparative  inquiries  into  the  conditions  of  demo¬ 
cratic  consolidation  will  address  qualitatively  dif¬ 
ferent  substantive  problems. 

Conceptual  Vagueness 

Concepts  are  abstractions.  They  allow  us  to 
speak  about  empirical  phenomena  in  general  rather 
than  marvel  at  them  one  by  one,  in  puzzlement 
over  their  uniqueness  (which  we  could  not  grasp 
anyway  without  a  prior  notion  of  generality).  In 
the  social  sciences,  we  want  concepts  to  be  precise, 
to  circumscribe  clearly  the  realm  of  phenomena  to 
which  they  apply.  Vague  concepts  fail  to  do  so. 
They  leave  the  relation  between  conceptual  claims 
and  empirical  objects  indeterminate.  They  do  not 
allow  us  to  decide  which  phenomena  lie  inside  and 
which  outside  their  realm  of  application.  Political 
actors  often  apply  such  elastic  concepts  of  unclear 
denotation  as  weapons  of  political  struggle.  For 
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instance,  candidates  would  condemn  “vote  buy¬ 
ing”  by  their  adversaries  without  making  clear 
what  kind  of  acts  they  are  referring  to.  Are  they 
condemning  any  campaign  promise  that  offers  vot¬ 
ers  material  benefits  in  the  future?  Squeezed 
between  conflicting  expectations,  political  con¬ 
tenders  often  benefit  from  evasiveness.  Scholars, 
by  contrast,  cannot  leave  their  readers  guessing 
what  they  are  talking  about.  Leaving  its  empirical 
referents  in  the  dark,  vague  concepts  lead  our 
research  into  obscurity. 

Definitional  Defects 

Our  semantic  definitions  are  sources  of  confu¬ 
sion  if  their  relations  to  words  and  things  are  con¬ 
fusing.  They  are  confusing,  too,  if  their  internal 
structure  is  defective.  Definitional  defects  may  be 
manifold.  Our  definitions  may  be  contradictory  or 
tautological.  They  may  be  incomprehensible  or 
prone  to  provoke  misunderstandings.  They  may 
ignore  standard  rules  of  classification  by  offering 
classificatory  schemes  that  fail  to  be  exclusive, 
exhaustive,  and  one-dimensional.  Or  they  may 
confound  levels  of  abstraction  within  taxonomical 
orders  of  classes  and  subclasses.  As  a  matter  of 
fact,  there  is  nothing  easier  than  to  be  confusing.  A 
moment  of  distraction,  a  slip  of  the  tongue,  a  typo¬ 
graphical  mistake,  and  the  meaning  of  what  we 
meant  to  say  dissolves  in  mist. 

Conceptual  Instability 

In  the  social  sciences,  linguistic  stability  permits 
continuity  in  research.  The  diffusion  of  new  theo¬ 
retical  approaches  tends  to  involve  major  shifts  in 
our  vocabulary.  That  is  what  new  theories  often 
are:  new  vocabularies  and  redescriptions  of  the 
world.  The  waves  of  conceptual  instability  they 
induce  tend  to  reshape  collective  research  agen¬ 
das.  They  tend  to  disrupt  established  lines  of 
inquiry  even  when  substantive  concerns  remain 
essentially  the  same.  Each  time  we  replace  our 
theoretical  vocabularies,  we  tend  to  reinvent  the 
wheel  of  empirical  research.  For  instance,  the  new 
literature  on  the  political  economy  of  political 
regime  change  ignores  earlier  debates  on  capital¬ 
ism  and  democracy,  the  new  literature  on  state 
failure  ignores  the  earlier  literature  on  state  build¬ 
ing,  and  so  on.  Unless  we  develop  and  conserve 


the  ability  to  translate  between  theoretical  lan¬ 
guages,  destabilizing  our  core  categories  can  be 
deeply  damaging  to  the  much  cherished  accumula¬ 
tion  of  knowledge. 

Conceptual  Abuse 

The  tools  of  language  are  open  to  almost  limit¬ 
less  forms  of  abuse.  Two  symmetrical  strategies  of 
reality  distortion  are  of  particular  interest  to 
political  scientists:  conceptual  stretching  (Sartori) 
and  conceptual  masking.  Conceptual  stretching 
involves  the  application  of  (often  value-laden)  con¬ 
cepts  to  cases  that  lack  essential  characteristics  of 
these  concepts  (as  when  a  corrupt  politician 
describes  himself  as  an  honest  man).  Conceptual 
masking  involves  the  description  of  (morally  rele¬ 
vant)  cases  through  neutral  concepts  that  disguise 
the  essential  characteristics  of  these  cases  (as  when 
a  bank  robber  describes  himself  as  a  common  cus¬ 
tomer).  The  former  puts  forward  semantic  over¬ 
statements  in  which  concept  application  betrays 
fundamental  conceptual  commitments.  The  latter 
puts  forward  semantic  understatements  in  which 
concept  application  denies  fundamental  factual 
realities. 

These  forms  of  conceptual  abuse  are  surely 
more  frequent  and  more  severe  in  politics  than  in 
political  science.  Modern  authoritarian  regimes, 
masters  of  linguistic  abuse,  routinely  practice  both. 
Stretching  the  notion  of  popular  rule  beyond  rec¬ 
ognition,  they  tend  to  portray  themselves  as  higher 
forms  of  democracy,  as  when  Augusto  Pinochet 
described  his  form  of  military  dictatorship  in  Chile 
as  “protected  democracy.”  The  totalitarian  regimes 
of  the  20th  century  were  notorious  in  inventing 
quasi-neutral  technical  terms  and  bureaucratic 
acronyms  to  camouflage  the  unspeakable  atroci¬ 
ties  they  committed  against  humanity.  For  instance, 
the  Nazis  described  their  factories  of  assassination 
as  “concentration  camps,”  the  Soviets  under  Stalin 
their  officially  inexistent  colonies  of  slave  labor 
and  death  as  “Gulag,”  “the  zone,”  or  simply  and 
enigmatically  “the  other  side.” 

Both  malpractices  destroy  conceptual  validity. 
They  sever  the  link  between  connotation  and 
denotation,  between  conceptual  commitments  and 
factual  applications.  In  instances  of  conceptual 
stretching,  speakers  claim  too  much,  and  realities 
negate  the  essence  of  the  concepts  they  use  (often, 
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their  moral  essence).  In  instances  of  conceptual 
masking,  speakers  claim  too  little,  and  concepts 
deny  the  essence  of  the  realities  they  face  (often, 
their  moral  essence). 

Conceptual  Lumping 

Names  allow  us  to  designate  individuals,  con¬ 
cepts  classes  of  individuals.  More  general  concepts 
capture  larger  classes,  more  concrete  concepts 
smaller  ones.  According  to  the  well-known  “lad¬ 
der  of  abstraction”  introduced  by  Sartori,  the 
number  of  defining  attributes  of  a  concept  (con¬ 
notation  or  intension)  and  the  number  of  its  refer¬ 
ents  (denotation  or  extension)  are  inversely  related. 
At  a  high  level  of  abstraction,  concepts  carry  few 
defining  attributes  and  cover  many  cases.  At  low 
levels  of  generality,  they  contain  numerous  defin¬ 
ing  features  and  apply  to  few  cases. 

In  scholarly  research,  just  as  in  ordinary  lan¬ 
guage,  we  have  to  choose  the  level  of  conceptual 
abstraction  that  seems  appropriate  for  our  pur¬ 
pose.  If  we  talk  too  abstractly  in  everyday  life,  our 
interlocutors  may  get  irritated:  Come  down,  be 
concrete,  we  don’t  want  to  hear  generalities!  If  we 
are  overly  specific  and  draw  excessively  fine  dis¬ 
tinctions,  they  may  well  get  impatient,  too:  Focus 
on  the  relevant,  stop  splitting  hairs!  In  William 
Ockham’s  famous  formulation,  “You  need  no 
razor  to  cut  butter.” 

In  crafting  social-scientific  concepts,  we  have  to 
seek  a  pragmatic  balance  between  our  ambitions 
of  theoretical  generalization  and  our  needs  for 
analytical  differentiation.  As  in  everyday  interac¬ 
tions,  we  may  err  on  either  side  when  choosing  our 
levels  of  conceptual  abstraction.  If  we  aim  too  high 
and  employ  excessively  general  concepts  that  oblit¬ 
erate  “differences  that  make  a  difference”  (Gregory 
Bateson),  our  critics  will  accuse  us  of  conceptual 
lumping  (Sartori).  If  we  aim  too  low  and  choose 
excessively  specific  concepts  that  trace  irrelevant 
distinctions,  our  critics  will  reproach  us  with  con¬ 
ceptual  splitting. 

Overall,  to  the  extent  that  the  social  science  com¬ 
munity  is  amenable  to  “linguistic  therapy”  (Umberto 
Eco)  and  avoids  developing  conceptual  pathologies, 
it  strengthens  its  collective  capacity  to  communicate 
effectively.  Social-scientific  language  demands  more 
than  conceptual  health  and  discipline,  though.  It 
also  requires  theoretical  and  conceptual  creativity, 


grounded  in  linguistic  competence  and  empirical 
knowledge. 

Concept  Formation 

Concept  formation  is  the  systematic  development 
and  explication  of  the  core  claims  to  which  we  com¬ 
mit  ourselves  when  applying  a  concept.  It  requires 
us  to  understand  ordinary  and  specialized  uses  of 
the  concept,  map  its  location  within  its  semantic 
field,  situate  it  within  empirical  realities  and  ana¬ 
lytical  frames,  understand  its  structural  properties, 
choose  our  semantic  commitments,  and  choose  the 
term  that  best  resonates  with  its  meaning. 

Reconstruction 

If  the  meaning  of  a  word  lies  in  its  use,  we  need 
to  comprehend  the  usage  of  a  word  if  we  wish  to 
comprehend  the  semantic  commitments  it  involves. 
The  first  question  to  ask  concerns  usage  in  ordi¬ 
nary  language.  In  English,  this  question  has  a 
straightforward  answer:  the  Oxford  English 
Dictionary  (OED).  Initiated  well  before  the  advent 
of  20th-century  pragmatic  philosophy  of  language, 
the  OED  is  a  monument  to  a  pragmatic  under¬ 
standing  of  language  as  a  medium  of  social  prac¬ 
tice.  Alien  to  prescriptive  or  regulatory  preten¬ 
sions,  it  meticulously  registers  “the  meaning  of 
everything”  (Simon  Winchester)  by  documenting 
concrete  instances  of  word  usage  across  centuries 
of  linguistic  development. 

The  second  question  concerns  the  scientific 
usage  of  a  word.  According  to  Sartori’s  seminal 
“Guidelines  for  Concept  Analysis,”  the  semantic 
“reconstruction”  of  a  concept  starts  with  a  review 
of  the  relevant  scholarly  literature.  Unlike  the 
OED,  Sartori  does  not  recommend  tracing  con¬ 
cepts  in  usage  in  order  to  uncover  the  implicit 
claims  that  can  be  inferred  from  their  practical 
applications.  Rather,  he  directs  scholars  to  compile 
lists  of  explicit  definitions  (until  they  get  bored  by 
repetition  and  redundancy),  enumerate  the  attri¬ 
butes  included,  and  bring  them  “into  some  mean¬ 
ingful  kind  of  organization”  (Sartori,  1984/2009, 
p.  120).  Meaningful  organization,  indeed,  is  the 
central  task  of  concept  formation  and  the  most  dif¬ 
ficult  one,  as  it  escapes  rules  and  recipes.  To  begin 
with,  it  requires  us  to  place  our  concepts  in  their 
linguistic  contexts. 
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Semantic  Fields 

If  we  wish  to  understand  a  concept  properly,  we 
must  not  analyze  it  in  isolation.  We  need  to  map  the 
semantic  field  it  inhabits  and  locate  it  in  the  web  of 
relationships  that  connects  it  with  its  conceptual 
neighbors.  Unless  we  know  what  the  concept  shares 
with  proximate  concepts  and  what  separates  it 
from  them,  we  cannot  grasp  its  specificity. 
Neighboring  concepts  often  share  large  intersecting 
circles  of  meaning,  yet  carry  semantic  nuances  that 
are  relevant  for  our  research  purposes.  For  instance, 
if  we  set  out  to  study  civil  wars,  we  step  into  a  rich 
semantic  field  of  overlapping,  yet  not  identical,  con¬ 
cepts,  such  as  guerrilla  war,  revolution,  rebellion, 
ethnic  violence,  regional  violence,  organized  vio¬ 
lence,  political  violence,  state  failure,  anarchy, 
political  disorder,  and  political  fragmentation. 
Semantic  cartography,  surveying  and  mapping  the 
“systems  of  terms”  (Sartori,  1984/2009,  p.  142) 
that  constitute  semantic  fields,  provides  a  relational 
and  comparative  understanding  of  such  clusters  of 
concepts.  It  helps  us  grasp  better  not  only  the  cen¬ 
tral  connotations  of  interrelated  concepts  but  also 
their  finer  shades  of  meaning  that  may  be  decisive 
in  choosing  one  concept  over  another. 

Empirical  Boundaries 

In  shedding  comparative  light  on  the  precise 
meaning  (intension)  of  related  concepts,  the  analy¬ 
sis  of  semantic  fields  also  helps  clarify  their  precise 
referents  (extension).  To  the  extent  that  we  grasp 
the  differences  and  similarities  in  the  substantive 
claims  neighboring  concepts  contain,  we  grasp  the 
differences  and  similarities  in  the  empirical  phe¬ 
nomena  they  refer  to.  For  instance,  in  the  semantic 
field  of  civil  war,  some  concepts,  such  as  political 
violence,  are  situated  at  high  levels  of  abstraction 
and  include  violent  actions  outside  contexts  of 
societal  warfare  (e.g.,  terrorism),  while  others, 
such  as  regional  violence,  refer  to  more  narrow 
categories  of  violent  conflict.  Some  concepts,  such 
as  ethnic  violence,  involve  claims  about  the  motives 
of  violence,  while  others  do  not.  Some,  such  as 
guerrilla  war,  emphasize  the  presence  of  organized 
actors,  while  others,  such  as  political  disorder, 
emphasize  the  dissolution  of  central  authority. 
Some  concepts,  such  as  state  failure,  attribute 
agency  to  the  state,  while  others,  such  as  civil  war, 
distribute  it  among  societal  actors. 


All  these  semantic  differences  involve  empirical 
differences.  They  point  to  different  empirical  phe¬ 
nomena.  Understanding  these  differences  allows  us 
to  understand  the  empirical  scope  of  concepts — 
their  bounded  territories.  It  gives  us  an  idea  of 
what  they  include  and  exclude.  Depending  on 
what  we  want  to  see  and  what  we  decide  to  ignore, 
it  allows  us  to  choose  and  use  the  concepts  most 
akin  to  our  analytical  interests.  Concepts,  writes 
Gary  Goertz  (2006),  are  “theories  about  the  fun¬ 
damental  constitutive  elements  of  a  phenomenon” 
(p.  5).  Within  the  semantic  field  of  civil  war,  we 
will  select  our  concept  of  choice  depending  on  the 
empirical  dimensions  our  theories  designate  as  rel¬ 
evant — the  organized  nature  of  civil  wars,  their 
political  motives,  their  outcomes,  their  causal  asso¬ 
ciation  with  state  power,  or  their  membership  in 
the  general  category  of  political  violence. 
Sometimes,  our  inherited  vocabulary  does  not 
trace  the  precise  distinctions  we  are  interested  in. 
Still,  our  semantic  maps  allow  us  to  visualize  the 
configurations  of  empirical  boundaries  as  previous 
concept  users  have  found  them  relevant.  Though 
at  times  incomplete  and  inconclusive,  they  open  up 
a  first  dialogue  between  concepts  and  cases,  medi¬ 
ated  by  theory. 

Analytic  Frames 

When  scholars  reconstruct  the  scientific  usage 
of  their  core  concepts  in  relation  to  proximate  con¬ 
cepts  (semantic  fields),  they  may  come  up  with 
long  lists  of  contending  or  coexisting,  divergent  or 
convergent  terms,  definitions,  and  applications.  In 
and  of  themselves,  such  listings  are  of  little  analytic 
use.  The  key  challenge  is  to  order  them  in  a  man¬ 
ner  that  resonates  with  theoretical  traditions  and 
empirical  concerns  in  relevant  fields  of  research, 
and  this  is  easier  said  than  done.  The  standard 
recommendation  sounds  simple:  First  identify 
underlying  analytical  dimensions  and  then  show 
how  different  concepts  and  uses  of  concepts  differ 
along  these  dimensions.  But  which  are  these  under¬ 
lying  dimensions?  They  do  not  just  lie  around, 
self-evident  and  open  to  simple  inspection.  How 
then  can  we  find  them,  and  how  can  we  construct 
them? 

The  answer  is  perhaps  disappointing:  Concept 
formation  is  not  a  bureaucratic  enterprise  but  a 
constructive  one  (like  social  science  in  general).  It 


Concept  Formation  377 


is  not  about  the  mechanical  application  of  rules,  be 
it  of  logic  or  language,  but  about  the  creative  pro¬ 
cess  of  abstraction.  In  our  efforts  to  make  sense  of 
divergent  definitions  and  uses  of  concepts,  we  have 
to  renounce  the  comfort  of  rules.  This  is  the  weak 
point,  the  structural  lacuna,  regarding  guidelines 
for  concept  formation:  There  is  none  for  this  cru¬ 
cial  task. 

As  a  first  step,  it  is  always  helpful  to  ask  about 
units  of  analysis.  If  authors  talk  about  democracy, 
is  it  regimes  or  states  they  are  talking  about?  If 
they  talk  about  the  rule  of  law,  is  it  individual  deci¬ 
sions  or  judicial  systems  they  are  talking  about? 
Clarity  about  the  kinds  of  “objects”  different 
usages  of  a  concept  strive  to  grasp  often  helps  us 
locate  them  at  different  levels  of  abstraction. 
However,  it  does  not  tell  us  anything  about  the 
substantive  claims  different  usages  carry.  We  are 
not  biologists  looking  at  elephants  or  cows,  tangi¬ 
ble,  objective  phenomena  whose  observable  char¬ 
acteristics  make  them  easy  prey  to  the  classic  logic 
of  classification  per  genus  et  differentiam. 
Identifying  the  general  commonalties  that  deter¬ 
mine  their  genus  and  then  the  specific  characteris¬ 
tics  that  determine  their  species  still  involves  a 
good  deal  of  complexity  and  controversy  within 
the  biological  community  and  yet  in  principle 
requires  little  “sociological  imagination”  (C. 
Wright  Mills).  But  this  is  not  so  in  the  social  sci¬ 
ences.  If  we  have  determined,  for  example,  that 
students  of  “ideology”  alternatively  refer  to  struc¬ 
tures  of  thought,  language,  or  behavior,  we  still  do 
not  know  what  kind  of  structures  they  are  talking 
about.  We  still  have  to  discern  (somehow)  the  ana¬ 
lytical  dimensions  that  distinguish  different  claims 
about  the  defining  structures  of  ideologies,  such  as 
internal  coherence,  external  differentiation,  sophis¬ 
tication,  factual  accuracy,  abstraction,  hierarchy, 
stability,  dogmatism,  sincerity,  and  consciousness 
(Gerring,  1997). 

Scholars  do  not,  however,  enter  the  process 
empty-handed.  They  are  not  naive  observers  of 
unstructured  data.  They  know  the  literature,  they 
know  the  facts  and  the  theories,  and  they  know  the 
language  and  the  paradigmatic  cases  that  define 
their  field  of  study.  The  analytical  dimensions  they 
introduce  (as  well  as  the  analytical  dimensions  their 
predecessors  introduced  in  the  first  place)  to  bring 
“meaningful  order”  into  conglomerates  of  con¬ 
tending  definitions  are  anchored  in  their  theoretical 


and  empirical  knowledge.  It  is  such  theoretical  as 
well  as  empirical  anchorage  that  makes  the  mean¬ 
ingful,  fruitful,  useful  organization  of  semantic 
fields  possible. 

Conceptual  Structures 

Once  we  have  succeeded  in  bringing  analytical 
order  into  multiple  uses  of  a  concept,  we  can 
reconstruct  its  structural  properties.  Conceptual 
structures  are  configurations  of  conceptual  com¬ 
mitments.  Once  we  have  understood  the  claims 
authors  commit  themselves  to  when  using  a  certain 
concept,  two  questions  ensue:  How  strong  are 
their  conceptual  commitments?  And  how  do  those 
commitments  they  consider  binding  (the  “essential 
features”  of  a  concept)  relate  to  each  other? 

Conceptual  Cores 

The  strength  of  our  commitments  to  the  con¬ 
ceptual  claims  we  articulate  varies  by  degrees. 
Some  claims  we  hold  to  be  indispensable  across 
contexts.  They  constitute  the  core  meaning  of  a 
concept.  Other  claims  we  deem  to  be  secondary 
and  contingent.  They  form  the  peripheral  and  con¬ 
textual  connotations  of  a  concept.  According  to 
the  classical  approach  to  concept  analysis,  from 
Aristotle  to  Sartori,  if  we  wish  to  comprehend  a 
concept,  we  need  to  identify  the  former,  its  seman¬ 
tic  core.  In  the  face  of  multiple  uses  of  a  concept, 
the  semantic  core  is  located  at  the  intersecting  area 
of  those  claims  competent  concept  users  declare  to 
be  binding  (“necessary  and  sufficient”).  Staking 
out  a  common  ground  of  binding  conceptual  com¬ 
mitments  (“the  core  concept”)  often  allows  us  to 
distinguish  “narrow,”  “thin,”  or  “minimal”  defi¬ 
nitions  (that  limit  themselves  to  the  core)  from 
“broad,”  “thick,”  or  even  “maximal”  ones  (that 
go  beyond,  up  to  envisioning  ideal-typical  instances 
of  the  concept). 

Family  Resemblances 

According  to  the  classical  conception  of  con¬ 
cepts,  if  different  uses  of  a  term  do  not  share  com¬ 
mon  semantic  ground,  they  do  not  count  as  instances 
of  the  same  concept.  They  appear  as  instances  of 
multiple  concepts.  Modern  philosophers  of  lan¬ 
guage  (reinforced  by  more  recent  psychological 
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research  on  typicality  effects)  have  shed  doubt  on 
the  notion  that  our  ordinary  usage  of  concepts  is 
governed  by  the  strict  membership  rules  of  neces¬ 
sary  and  sufficient  conditions.  They  have  contested 
the  notion  that  concepts  carry  semantic  cores. 
Rather  than  committing  themselves  to  a  fixed  set  of 
indispensable  claims,  they  have  suggested  that  con¬ 
cept  users  often  commit  themselves  to  a  more  open 
set  of  mutually  substitutable  claims.  Wittgenstein 
(1952/1968,  sec.  66-71)  introduced  the  idea  of 
family  resemblances  to  describe  concepts  that  are 
defined  by  an  ensemble  of  alternative  attributes 
rather  than  sharing  a  core  of  necessary  attributes. 
In  set-theoretic  terms,  the  relevant  features  of  fam¬ 
ily  resemblance  or  radial  concepts  do  not  form 
intersections  but  unions.  In  terms  of  classical  logic, 
they  are  not  linked  by  the  operator  AND  (neces¬ 
sity)  but  by  the  operator  OR  (substitutability) 
(Goertz,  2006,  chap.  2). 

It  seems  indeed  to  be  the  case  that  ordinary  lan¬ 
guage  users  routinely  apply  empirical  concepts, 
such  as  fruit  and  furniture,  to  concrete  objects  on 
the  basis  of  their  closeness  to  typical  examples 
(“prototypes”).  These  intuitive  applications  are 
not  based  on  an  invariable  set  of  claims  all  compe¬ 
tent  language  users  subscribe  to.  However,  the  use 
of  family  resemblance  seems  less  frequent  (and  less 
compelling)  in  the  social  sciences.  When  students 
of  politics  use  the  notion  of  family  resemblance, 
they  commonly  apply  it  not  to  the  highest  level  of 
abstraction  (the  definition  of  general  properties  of 
a  concept)  but  to  lower  levels  of  abstraction  (the 
definition  of  constitutive  dimensions  or  the  obser¬ 
vation  of  concrete  instances  of  a  concept).  Concepts 
described  as  family  resemblances  often  do  seem  to 
share  an  abstract  semantic  core  (at  a  high  level  of 
generality),  even  if  either  their  constitutive  dimen¬ 
sions  or  their  observational  indicators  are  mutu¬ 
ally  substitutive  (at  lower  levels  of  generality). 

As  a  matter  of  fact,  the  observation  of  family 
resemblances  seems  to  be  generally  dependent  on 
the  prior  comprehension  (at  least  vaguely  and 
implicitly)  of  an  abstract  semantic  core.  It  is  only 
because  we  possess  a  general  notion  of  their  com¬ 
mon  nature  that  we  can  discern  the  elastic  obser¬ 
vational  resemblance  of  certain  classes  of  cases. 
Otherwise,  we  would  perceive  no  more  than  super¬ 
ficial  similarities  among  disjointed  phenomena. 
Arguably,  this  is  even  true  for  the  two  paradig¬ 
matic  concepts  of  “families”  and  “games”  from 


which  Wittgenstein  derived  his  notion  of  family 
resemblances. 

Consider  families.  The  sociological  literature 
offers  numerous  overlapping  definitions  of  families 
and  subtypes  of  families.  For  instance,  the  modern 
nuclear  family  typically  includes  spouses  and  their 
dependent  children.  The  notion  of  family  resem¬ 
blance,  however,  does  not  explicate  the  meaning  of 
the  concept  of  family.  It  does  not  respond  to  the 
semantic  question  of  conceptual  essence  but  to  the 
phenomenological  question  of  empirical  appear¬ 
ance.  It  does  not  pretend  to  identify  families  and 
distinguish  them  from  other  social  groups  on  the 
basis  of  their  general  characteristics  but  to  identify 
the  members  of  particular  families  on  the  basis  of 
their  physical  appearance.  Family  members  may 
look  alike,  but  they  are  not  members  of  a  family 
(neither  in  general  nor  in  particular)  because  they 
look  alike. 

Diminished  Subtypes 

In  cases  of  family  resemblance,  empirical  refer¬ 
ents  may  lack  relevant  characteristics  of  a  concept 
and  still  represent  genuine  instances  of  it.  In  cases 
of  “diminished  subtypes”  (David  Collier  &  Steven 
Levitsky,  1997),  empirical  referents  do  not  fully 
possess  the  relevant  characteristics  of  a  concept 
and  therefore  do  not  represent  genuine  instances  of 
it.  Classical  typologies  do  not  differentiate  among 
members  of  a  category.  Objects  are  either  in  or 
out.  If  they  are  in,  they  are  equal  members  of  full 
standing.  Diminished  subtypes  take  into  account 
that  all  members  are  equal — but  some  are  less 
equal  than  others. 

Diminished  subtypes  arise  from  continuous  mul¬ 
tidimensional  concepts.  When  all  constitutive 
dimensions  of  a  concept  are  held  to  be  essential,  the 
full  absence  of  any  dimension  involves  the  absence 
of  the  phenomenon  in  question.  However,  if  con¬ 
stitutive  dimensions  are  not  dichotomous,  but  con¬ 
tinuous,  cases  may  lie  somewhere  in  between  full 
presence  and  full  absence  on  any  specific  dimen¬ 
sion.  Situated  in  the  “gray  zone”  (Goertz)  of  one 
constitutive  dimension,  these  cases  are  still  recog¬ 
nizable  members  of  a  general  class  of  phenomena. 
Yet  they  are  less  than  full  members.  Due  to  their 
structural  deficiencies,  they  appear  as  distant,  dam¬ 
aged,  distorted  representatives.  By  adding  qualifi¬ 
ers  (adjectives)  to  the  original  concept,  we  can 
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avoid  conceptual  stretching  (fraudulent  claims  to 
full  membership  in  a  category)  and  point  to  their 
specific  deficiencies.  The  beauty  of  diminished  sub- 
types  lies  in  their  diagnostic  precision. 

For  example,  if  democracy  requires  competitive 
elections  under  universal  suffrage,  the  absence  of 
either  electoral  competitiveness  or  electoral  inclu¬ 
siveness  renders  a  political  regime  nondemocratic. 
Yet  governments  may  impose  partial  restrictions 
on  either  electoral  competition  or  electoral  partici¬ 
pation  that  are  not  severe  and  systematic  enough 
to  involve  the  absence  of  either  dimension.  Such 
bounded,  ambiguous  constraints  may  turn  elec¬ 
toral  regimes  into  diminished  subtypes  of  democ¬ 
racy.  In  the  face  of  bounded  restrictions  on  compe¬ 
tition,  we  might  speak  of  controlled  democracies. 
In  the  face  of  bounded  suffrage  restrictions,  we 
might  speak  of  exclusionary  democracies. 

Diminished  subtypes  of  multidimensional  con¬ 
cepts  may  arise  from  the  limited  presence  of  one  (or 
some)  of  their  essential  dimensions.  They  may  also 
arise  from  the  full  absence  of  desirable,  yet  nones¬ 
sential,  attributes.  Diminished  subtypes  of  democ¬ 
racy  often  seem  to  express  structural  deficiencies 
whose  benchmarks  are  not  the  core  of  democracy 
but  the  ideal  of  democracy.  They  refer  to  demo¬ 
cratic  regimes  that  are  in  full  accordance  with 
democratic  minimum  standards,  yet  fail  to  fulfill 
more  stringent  expectations  to  democratic  gover¬ 
nance.  For  example,  in  clientelist  democracies,  citi¬ 
zens  lack  the  programmatic  orientation,  and  in 
apathetic  democracies,  they  do  not  show  the  par¬ 
ticipatory  enthusiasm  we  expect  from  high-quality 
democracies.  In  a  symmetrical  manner,  as  empiri¬ 
cal  cases  may  be  underperforming  in  relation  to  a 
particular  standard,  they  may  also  be  overperform¬ 
ing,  thus  giving  rise  to  “enhanced  subtypes.” 

Contested  Concepts 

The  notion  of  family  resemblances  introduces 
some  degree  of  flexibility  and  fuzziness  into  the 
classical  idea  of  essential  attributes.  The  notion  of 
“essentially  contested  concepts,”  formulated  by 
British  philosopher  Walter  Bryce  Gallie  only  a 
couple  of  years  after  the  publication  of 
Wittgenstein’s  Philosophical  Investigations, 
involves  a  potentially  more  radical  objection  to  the 
classical  idea  of  conceptual  cores.  According  to 
Gallie’s  (1956)  seminal  paper,  all  concepts  are 


open  to  contestation,  yet  some  are  “essentially” 
contested  insofar  as  “their  proper  use  .  .  .  inevita¬ 
bly  involves  endless  disputes  about  their  proper 
uses”  (p.  169).  Which  is  the  source  of  such  intrin¬ 
sic  and  irresolvable  conceptual  disputes?  There  are 
two  plausible  answers.  One  points  to  the  com¬ 
plexities  of  concept  application,  the  other  to  inner 
tensions  that  lie  at  the  very  core  of  some  concepts. 

Gallie’s  (1956)  account  focuses  on  concept 
application.  For  concepts  to  become  essentially 
contested,  he  says,  they  must  be  multidimen¬ 
sional — “internally  complex” — and  normative — 
“appraisive”  (pp.  171-172).  Competent  speakers 
recognize  and  value  the  various  dimensions  of  the 
concept.  “Any  explanation  of  its  worth  must 
therefore  include  reference  to  the  respective  contri¬ 
butions”  of  each  dimension  (p.  172).  Yet,  despite 
their  abstract  consensus  on  the  fundamental  com¬ 
ponent  parts  of  the  concept,  speakers  are  likely  to 
weight  them  differently  and  apply  them  differ¬ 
ently.  They  are  likely  to  dispute  their  relative 
importance  as  well  as  their  practical  implications 
under  changing  circumstances.  For  instance,  actors 
may  agree  that  modern  democracy  rests  on  the 
principles  of  majority  rule  and  constitutional  gov¬ 
ernment  but  still  disagree  about  their  rank  order 
(which  is  more  important)  and  their  operational¬ 
ization  (how  they  are  to  be  put  into  practice). 

According  to  a  more  radical  reading,  essential 
contestation  affects  the  very  core  of  concepts,  not 
just  their  application.  It  arises  when  concepts 
involve  irresolvable  inner  contradictions  and  when 
they  are  founded  on  genuine  dilemmas,  paradoxes, 
or  impossible  ideals — “conflicting  imperatives” 
(Reinhard  Bendix).  Consider  the  ideas  of  democracy 
and  the  rule  of  law.  The  etymological  root  of 
democracy  (“rule  of  the  people”)  assigns  citizens  the 
double,  paradoxical  role  of  rulers  and  ruled.  Their 
powers  cannot  be  maximized  simultaneously.  In 
addition,  modern  democracy  arguably  embodies 
fundamental  principles  (e.g.,  popular  participation 
and  constitutional  restraints)  that  are  mutually  sup¬ 
portive  as  well  as  (at  times)  mutually  subversive. 
Balancing  the  trade-offs  they  impose  demands  more 
than  establishing  simple  orders  of  priorities.  The 
concept  of  “the  rule  of  law”  suggests  a  mode  of 
impersonal  domination  where  formal  rules  govern, 
rather  than  human  beings — the  government  of  laws, 
not  people.  Yet  the  abolishment  of  human  judgment 
in  the  exercise  of  power  constitutes  an  impossible 
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ideal.  These  inner  tensions  are  constitutive  for  con¬ 
cepts  such  as  democracy  and  the  rule  of  law.  Since 
they  are  irresolvable,  they  feed  an  inexhaustible 
stream  of  legitimate  disputes  that  are  irresolvable  as 
well.  In  this  perspective,  appraisive  concepts  are 
likely  to  be  essentially  contested  if  they  are  essen¬ 
tially  self-contradictory. 

Conceptual  Innovation 

Language  is  a  social  medium  of  communication 
and  is  public  and  restrained  by  shared  rules  of 
grammar  and  semantics.  At  the  same  time,  it  is  the 
quintessential  medium  of  personal  expression,  free 
and  open  to  variation  and  innovation  as  no  other 
societal  institution.  Ask  James  Joyce  and  Ernst 
Jandl,  for  example.  In  the  social  sciences,  our 
license  to  innovative  uses  of  language  is  more  lim¬ 
ited  than,  say,  in  poetry  or  insult.  Still,  within 
constraints,  we  do  enjoy  important  margins  of 
conceptual  innovation. 

New  Definitions 

In  the  face  of  contending  definitions,  scholars 
cannot  rely  on  the  common,  accepted  usage  of  a 
concept.  They  have  to  meddle  with  the  rules  of  the 
language  game  by  formulating  their  own  concep¬ 
tual  commitments.  The  particular  definitions  they 
develop  may  relate  in  various  ways  to  preexisting 
definitions.  They  may  privilege  one  usage  against 
others  (selective  definitions),  they  may  tie  new 
bundles  of  conceptual  claims  by  accepting  some 
but  shedding  others  (eclectic  definitions),  they  may 
strive  to  express  underlying  commonalities  among 
multiple  conceptions  (synthetic  definitions),  or 
they  may  embrace  new  claims  that  alter  the  central 
connotation  of  the  concept  (deviant  or  original 
definitions).  The  last  move  overlaps  with  the  cre¬ 
ation  of  new  concepts. 

New  Concepts 

When  do  we  need  new  concepts?  The  answer  is 
very  simple:  when  we  wish  to  draw  distinctions 
that  we  had  not  drawn  before  or  when  we  wish  to 
grasp  commonalities  that  we  had  not  grasped 
before.  Sometimes  we  wish  to  seize  new  empirical 
phenomena,  and  sometimes  we  would  like  to 
adopt  new  perspectives  on  old  phenomena.  In  the 


study  of  politics,  we  are  continually  confronted 
with  novel  realities;  continually,  we  are  trying  to 
see  together  what  political  actors  tend  to  look  at  in 
isolation;  and  continually,  we  are  developing  theo¬ 
ries  that  redefine  the  relevant  boundaries  of  the 
political  world.  Hence,  our  incessant  demand  for 
conceptual  creativity. 

Like  the  political  realities  they  try  to  capture 
and  the  political  theories  they  strive  to  express, 
new  concepts  seldom  emerge  de  novo  as  radical 
breaks  with  the  past.  More  often  than  not,  schol¬ 
ars  introduce  new  concepts  by  modifying  old  ones. 
The  repertoire  of  incremental  conceptual  innova¬ 
tion  is  broad.  Authors  may  craft  new  concepts  by 
(a)  redefining  the  substantive  meaning  of  a  given 
concept,  (b)  importing  concepts  from  other  lan¬ 
guages  or  scientific  disciplines,  (c)  remodeling  the 
ladder  of  abstraction  by  adding  new  distinctions  or 
removing  established  ones,  (d)  introducing  dimin¬ 
ished  or  enhanced  subtypes,  or  (e)  changing  the 
property  space  of  a  concept  (by  altering  the  dis¬ 
tance  between  conceptual  poles,  shifting  their  loca¬ 
tion,  or  introducing  intermediate  categories). 

New  Terms 

When  we  have  reviewed  the  various  uses  of  a 
concept  and  its  semantic  neighbors  and  when  we 
have  understood  the  configuration  of  conceptual 
claims  others  accept  as  binding  and  chosen  those 
we  do,  we  sometimes  face  the  task  of  “selecting  the 
term  that  designates  the  concept”  (Sartori, 
1984/2009,  p.  123) — sometimes  but  not  always. 
Sometimes  we  have  terminological  choices;  some¬ 
times  we  do  not. 

The  names  of  grand  concepts  in  political  science, 
such  as  justice,  power,  and  rationality,  the  terms 
we  use  to  designate  them  (and  have  used  to  desig¬ 
nate  them  for  years,  decades,  or  even  centuries),  are 
given  and  fixed  at  present.  Scholars  often  disagree 
about  the  precise  meaning  they  associate  with  cer¬ 
tain  concepts  designated  by  certain  terms.  They 
embrace  contending  “conceptions”  of  the  concepts 
under  dispute.  The  tight  coupling  between  terms 
and  concepts,  however,  prevents  their  semantic 
disagreements  from  spilling  over  into  terminologi¬ 
cal  disagreements.  Switching  the  term  under  discus¬ 
sion  would  involve  switching  the  concept  under 
discussion.  Authors  who  discuss,  for  instance,  the 
concepts  of  justice,  power,  and  rationality  either 
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discuss  these  concepts  under  these  names,  or  they 
discuss  something  else.  If  they  move  even  to  neigh¬ 
boring  terms,  such  as  fairness,  authority,  and  intel¬ 
ligence,  they  move  into  different  (albeit  contiguous) 
conceptual  terrains.  When  terms  and  concepts  are 
welded  together  through  strong  bonds  of  semantic 
history  and  when  the  former  represent  the  latter 
without  equivalent  substitutes,  terminological 
choices  precede  semantic  debates.  By  choosing  a 
term,  we  bring  the  concept  it  stands  for  into  focus, 
which  then  allows  us  to  partake  in  ongoing  dis¬ 
putes  about  disputed  aspects  of  its  meaning. 

When  concepts  are  less  deeply  anchored  in  his¬ 
tory  and  theory,  the  sequence  can  be  inverted.  We 
can  first  determine  the  substantive  claims  we  are 
interested  in  and  then  settle  on  appropriate  names, 
either  by  selecting  among  available  terms  or  by 
crafting  new  ones.  If  we  articulate  our  conceptual 
claims  in  a  precise  manner,  we  put  ourselves  in  a 
position  of  selecting  the  precise  terms  whose  con¬ 
notations  correspond  most  closely  to  the  substance 
of  our  concept.  Semantic  fields  that  are  densely 
populated  with  near-synonyms  of  similar  standing 
(with  none  of  them  dominating  all  the  others)  offer 
the  most  latitude  for  fine-tuning  our  terminologi¬ 
cal  choices.  For  instance,  if  we  study  the  consolida¬ 
tion  of  political  regimes,  we  may  choose  alterna¬ 
tive  terms  that  lie  in  its  semantic  vicinity  yet 
emphasize  diverging  substantive  concerns.  If  we 
wish  to  stress  the  duration  of  regimes  over  time, 
we  may  talk  about  continuity,  endurance,  or  per¬ 
sistence.  If  we  wish  to  stress  their  ability  to  weather 
systemic  crises,  we  may  talk  about  resilience,  via¬ 
bility,  or  sustainability.  If  we  wish  to  stress  the 
process  character  of  consolidation,  we  may  talk 
about  stabilization,  institutionalization,  or 
entrenchment. 

While  refined  concepts  ask  for  refined  vocabu¬ 
laries,  new  concepts  demand  new  terms.  Whenever 
the  diffusion  of  new  theories  (e.g.,  game  theory)  or 
the  emergence  of  new  realities  (e.g.,  electoral 
autocracies)  induces  waves  of  conceptual  creativ¬ 
ity,  they  are  accompanied  by  waves  of  termino¬ 
logical  innovation.  The  rise  of  game  theory  has 
brought  a  whole  new  vocabulary  into  political 
science  that  includes  notions  such  as  backward 
induction,  bounded  rationality,  perfect  equilib¬ 
rium,  incomplete  information,  cooperative  games, 
focal  points,  mixed  strategies,  and  so  on.  The  rise 
of  electoral  authoritarian  regimes  since  the  end  of 


the  Cold  War  has  led  comparative  scholars  to  pro¬ 
pose  a  broad  assortment  of  labels  designed  to  cap¬ 
ture  these  novel  political  systems,  such  as  hybrid 
regimes,  semiauthoritarian  regimes,  inconsistent 
regimes,  multiparty  autocracies,  competitive 
authoritarianism,  and  institutionalized  dictator¬ 
ship.  Overall,  driven  by  changing  theories  as  well 
as  changing  realities,  terminological  innovation  is 
a  pervasive  phenomenon  in  political  science. 

Semantic  Constraints 

Linguistic  innovation,  in  political  science  as  else¬ 
where,  is  always  anchored  in  linguistic  tradition. 
While  deviating  from  tradition,  it  cannot  cut  itself 
loose  from  it.  Conceptual  innovations  in  political 
science,  whether  they  involve  new  definitions,  new 
concepts,  or  new  terms,  are  therefore  inevitably 
constrained  innovations  (whether  tightly  or 
loosely).  They  are  constrained  by  the  semantic  past 
of  concepts  (etymology  and  conceptual  history)  as 
well  as  by  their  semantic  present  (ordinary  and 
specialized  usage).  They  are  also  constrained  by  the 
systemic  logic  of  semantic  fields.  Modifying  indi¬ 
vidual  nodes  in  the  web  of  interdependent  terms 
that  constitutes  a  semantic  field  reverberates 
throughout  the  entire  web.  Any  pretension  to 
remake  our  conceptual  tools  has  to  recognize  this 
twin  linguistic  reality  that  concepts  are  rooted  in 
their  semantic  past  and  present  and  embedded  in 
their  semantic  environment.  Semantic  constraints 
are  not  straitjackets.  They  leave  room  for  critique 
and  creativity,  for  selective  changes  and  selective 
continuities — some  more,  some  less.  Yet  if  concep¬ 
tual  innovations  depart  too  sharply  from  estab¬ 
lished  usage  and  create  arbitrary  associations 
between  words  and  meanings,  they  will  fail  to  serve 
as  effective  tools  for  communicating  novel  insights. 
They  will  be  outright  incomprehensible,  liable  to 
provoke  systematic  misunderstandings,  or  devoid 
of  resonance  and  thus  condemned  to  oblivion. 

Conclusion 

“Clear  thinking  requires  clear  language,”  Sartori 
(1984/2009,  p.  102)  wrote  a  quarter  of  a  century 
ago.  To  begin  with,  it  requires  clear  thinking  about 
language.  If  we  learn  to  incorporate  conceptual 
self-awareness  into  our  canon  of  methodological 
sophistication,  we  will  do  better  theory  and  better 


382  Conditionality 


research.  We  may  not  reach  the  impossible  ideal  of 
a  fully  transparent,  clear,  and  precise  technical 
language,  and  as  a  scientific  community,  we  may 
be  too  diverse  and  competitive  to  build  another 
tower  of  Babel.  Yet  we  will  put  the  edifice  of 
political  science  on  more  solid  foundations. 
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Conditionality  in  political  science  refers  to  the  use 
of  the  conditions  an  actor  attaches  to  awarding 
benefits  to — or  to  not  imposing  costs  on — another 
actor  in  order  to  influence  his  behavior.  Benefits  are 
granted  only  if  the  target  actor  has  fulfilled  certain 
requirements  (ex  ante  conditionality).  Such  exter¬ 
nal  rewards  can  also  be  suspended  or  withdrawn  if 
the  recipient  reneges  on  his  commitment  (ex  post 
conditionality).  Rather  than  withdrawing  a  prom¬ 
ised  or  granted  reward  (positive  conditionality),  the 
violation  of  predetermined  conditions  can  also  be 
punished  (negative  conditionality).  Conditionality 
targets  either  performance  and  outcomes  (policy 
conditionality)  or  the  governance  and  administra¬ 
tive  systems  (political/democratic  conditionality). 

The  term  refers  more  narrowly  to  the  Bretton 
Woods  institutions  seeking  to  impose  the 
Washington  Consensus  around  the  world  or,  more 
broadly,  to  all  explicit  and  implicit  requirements 
for  lending,  including  covenants  for  project-based 
aid,  environmental  safeguards,  and  performance- 
based  aid  allocation.  This  entry  follows  the  broader 
understanding  of  conditionality  as  a  political 
instrument  used  by  national  governments  and 
international  organizations  to  advance  their  for¬ 
eign  policy  goals.  First,  the  concept  is  linked  back 
to  the  instrumental  logic  of  social  action  in  order 
to  specify  the  causal  mechanisms  through  which 
conditionality  is  to  influence  the  behavior  of  third 
countries  (i.e.,  those  who  are  not  members  of  the 
European  Union  [EU]).  Second,  the  emergence  of 
conditionality  as  a  key  instrument  of  development 
cooperation  and  foreign  policy,  more  broadly 
speaking,  is  traced.  Finally,  critical  evaluations 
regarding  the  effectiveness  and  legitimacy  of  con¬ 
ditionality  are  discussed  to  outline  how  these  have 
led  to  readjustments  of  the  instrument. 

Conditionality  and  the  Manipulation 
of  Utility  Calculations 

Conditionality  seeks  to  influence  the  behavior  of 
actors  through  the  manipulation  of  utility  calcula¬ 
tions,  by  providing  negative  and  positive  incen¬ 
tives.  Donor  states  or  organizations  seek  to  pro¬ 
mote  structural  adjustments  in  third  countries  by 
promising  or  granting  additional  benefits,  such  as 
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financial  and  technical  assistance,  a  loan,  debt 
relief,  or  conditional  membership  in  an  organiza¬ 
tion.  Or  they  incur  costs  through  economic  and 
diplomatic  sanctions.  “Reinforcement  through 
reward”  (Frank  Schimmelfennig  &  Ulrich  Sedel- 
meier,  2006)  or  “correction  through  punishment” 
differs  from  political  instruments  based  on  a  nor¬ 
mative  logic  of  social  action,  such  as  political  dia¬ 
logue,  which  seek  to  change  the  behavior  of  actors 
through  persuasion  and  learning  (Jeffrey  T.  Chec- 
kel,  2005).  Conditionality  and  political  dialogue 
both  aim  at  influencing  the  choice  of  actors, 
whether  they  are  informed  by  cost-benefit  calcula¬ 
tions  or  guided  by  normative  concerns  about 
socially  accepted  behavior.  They  thus  contrast 
with  coercion  and  assistance.  While  the  former 
does  not  leave  actors  any  choice,  the  latter  pro¬ 
vides  unconditional  financial  and  technical  aid  to 
enable  actors  to  make  choices. 

Evolution  of  Conditionality  as  an 
Instrument  of  External  Governance 

Conditionality  emerged  in  the  field  of  development 
cooperation.  In  the  1980s,  the  International 
Monetary  Fund  and  the  World  Bank  increasingly 
linked  loans,  debt  relief,  and  financial  aid  to  certain 
conditions,  known  as  structural  adjustments,  which 
included  aid  effectiveness  (e.g.  anticorruption  mea¬ 
sures),  financial  austerity,  or  the  privatization  of 
key  public  services.  While  international  donors 
have  used  conditional  aid  to  promote  selective 
reforms  deemed  necessary  for  good  governance, 
the  EU  turned  conditionality  into  the  key  instru¬ 
ment  in  its  attempt  to  support  the  transformation 
in  postcommunist  countries  after  the  end  of  the 
Cold  War  (Heather  Grabbe,  2006).  The  Maastricht 
Treaty  had  made  any  contractual  relations  of  the 
EU  with  third  countries  conditional  on  the  respect 
for  human  rights,  the  rule  of  law,  and  democracy 
(see  Tanya  Borzel  &  Thomas  Risse,  2009).  States 
applying  for  membership  in  the  EU  have  to  fulfill 
the  so-called  Copenhagen  Criteria  (democratic 
conditionality).  Once  the  EU  agreed  to  open  acces¬ 
sion  negotiations,  accession  conditionality  gave  the 
European  Commission  a  powerful  tool  to  push 
candidate  countries  toward  downloading  the 
comprehensive  acquis  communautaire  and  intro¬ 
ducing  institutional  reforms  (see  Schimmelfennig 
&  Sedelmeier,  2006). 


Conditionality  is  also  applied  in  the  EU’s 
attempt  to  transform  the  domestic  structures  of  its 
neighbors,  which  do  not  have  a  membership  per¬ 
spective  (yet).  The  European  Neighbourhood 
Policy  offers  the  Mediterranean  countries  and  the 
Western  Newly  Independent  States  financial  and 
technical  assistance,  progressive  integration  into 
the  internal  market  and  its  regulatory  structures, 
and  closer  diplomatic  relations  (e.g.,  association) 
in  return  for  economic  and  political  reforms. 
Conversely,  it  may  (threaten  to)  suspend  bilateral 
agreements,  withhold  assistance,  and  impose  polit¬ 
ical  sanctions  (e.g.,  visa  bans).  Donor-recipient 
relationships  in  the  European  Neighbourhood 
Policy  are  less  tight  than  in  EU  enlargement  policy, 
but  association  agreements  still  create  closer  ties 
than  most  international  instruments. 

Evaluation  and  Adjustments 
of  the  Instrument 

The  efficiency  of  conditionality  has  become  a  key 
concern  for  analysts  and  practitioners.  The 
approach  as  developed  in  the  1980s  has  increas¬ 
ingly  been  called  into  question.  There  is  wide  con¬ 
sensus  over  the  limited  effectiveness  of  both  policy 
and  political  conditionality  in  development  coop¬ 
eration.  Donor  leverage  cannot  substitute  for 
domestic  political  will,  leading  to  the  call  for  stron¬ 
ger  government  ownership  and  ex  post  evaluation 
of  outcomes.  Research  on  EU  conditionality  points 
to  similar  limitations  regarding  the  impact  of  con¬ 
ditionality.  While  target  states  have  formally 
adopted  a  massive  amount  of  EU  legislation,  this  is 
often  not  properly  applied  and  enforced  and,  thus, 
has  not  changed  the  behavior  of  actors  (Gerda 
Falkner,  Oliver  Treib,  &  Elisabeth  Holzleithner, 
2008).  Weak  norm  internalization  has  given  rise  to 
concerns  about  shallow  Europeanization.  As  in 
development  cooperation,  the  limited  administra¬ 
tive  capacities  of  the  candidate  countries  have 
curbed  the  domestic  impact  of  EU  accession, 
which  accounts  for  its  differential  outcome 
(Schimmelfennig  &  Sedelmeier,  2006). 

Overall,  the  effectiveness  of  conditionality  seems 
to  depend  on  a  number  of  specific  scope  condi¬ 
tions.  The  promised  rewards  have  to  be  credible 
and  big  enough  to  pay  off  the  compliance  costs;  the 
same  applies  to  the  threat  of  sanctions.  Moreover, 
there  needs  to  be  a  coalition  of  actors  within  the 
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target  country  that  is  willing  to  use  external  incen¬ 
tives  to  push  for  compliance  domestically  (Jan 
Pronk,  2001).  These  conditions  are  seldom  met, 
even  among  the  current  EU  candidate  countries, 
including  the  Western  Balkans  and  Turkey. 

In  reaction  to  the  limited  effectiveness  of  condi¬ 
tionality,  issues  of  recipient  ownership,  greater 
selectivity  based  on  country  performance,  stronger 
results  orientation,  and  thus  a  shift  to  ex  post  con¬ 
ditionality  have  gained  relevance.  While  the  World 
Bank  and  the  International  Monetary  Fund  aim  to 
streamline  conditionality  and  rely  on  greater  selec¬ 
tivity  and  new  approaches,  such  as  programmatic 
lending,  the  European  Commission  attempts  to 
reconcile  a  predictable  budget  support  for  low- 
income  countries  with  a  stronger  performance 
orientation  that  bases  disbursements  on  outcomes. 
Other  critical  observers  call  for  the  abandonment 
of  conditionality  altogether  and  argue  instead  for 
broad  political  dialogue  and  the  allocation  of 
funds  on  the  basis  of  need. 

The  legitimacy  of  conditionality  is  equally  con¬ 
tested.  On  the  one  hand,  the  output  legitimacy  of 
external  interference  has  come  under  fire  because 
structural  adjustments  have  failed  to  produce  the 
intended  outcomes  or  have  even  been  blamed  for 
causing  negative  effects.  On  the  other  hand,  the 
input  legitimacy  of  conditionality  is  challenged  by 
international  norms  of  state  sovereignty  and  self- 
determination.  This  legitimacy  problem  is  exacer¬ 
bated  if  externally  promoted  norms  and  standards 
are  not  equally  promoted  inside  the  donor  organi¬ 
zation  or  state  (double  standards).  At  the  same 
time,  conditionality  has  a  legitimating  function 
inside  the  donor  states  to  justify  payments  to  third 
countries,  especially  under  high  uncertainty  regard¬ 
ing  the  effect  on  external  actors.  There  seems  to  be 
a  trade-off  between  legitimacy  concerns  in  recipi¬ 
ent  and  donor  countries  that  creates  a  serious  nor¬ 
mative  dilemma  and  adds  to  the  controversy  sur¬ 
rounding  the  use  of  conditionality  both  among 
policymakers  and  researchers. 
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Conditions,  Necessary 
and  Sufficient 


Social  science,  as  all  science,  is  a  continuous  quest 
for  an  explanation  and  understanding  of  the  world 
around  us.  This  search  is  carried  out  through 
Benjamin  Most  and  Harvey  Starr’s  “research  triad” 
of  theory,  logic,  and  research  design,  all  of  which 
are  central  to  both  hypotheses  and  results.  The  key 
element  of  explanation  and  understanding  is  causa¬ 
tion.  Social  scientists  are  concerned  with  causation 
as  applied  both  to  individual  events  or  cases  and  to 
classes  or  groups  of  events.  The  causal  relationship 
may  take  many  forms.  Two  of  the  most  prominent, 
important,  and  commonly  used  forms  of  the  causal 
relationship  involve  necessary  and/or  sufficient 
relationships.  As  emphasized  in  the  work  of  Most 
and  Starr  among  others,  analyses  must  be  con¬ 
cerned  with  the  form  of  the  relationship.  David 
Hume’s  classic  definition  of  cause  involves  the 
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“constant  conjunction”  of  an  object  followed  by 
another,  where  all  objects  similar  to  the  first  are 
followed  by  objects  similar  to  the  second.  Causation 
is  seen  as  including  three  elements,  the  first  being 
the  existence  of  correlation  between  two  factors  or 
variables.  That  is,  changes  or  attributes  in  one  fac¬ 
tor  are  associated  with  changes  or  attributes  in 
another.  Correlation  is  one  possible  form  of  any 
specific  relationship.  Correlation  itself  may  take 
several  forms,  such  as  linear  or  curvilinear.  The 
second  element  of  cause  involves  the  temporal 
dimension — the  proposed  causal  factor  must  take 
place  before  the  “effect”  or  the  phenomenon  to  be 
explained.  Cause  must  precede  effect.  The  third  ele¬ 
ment  is  the  most  rigorous  and  difficult — the  elimi¬ 
nation  of  other  explanatory  factors  beyond  those 
proposed  (the  central  idea  of  “control”).  Below,  the 
various  implications  of  these  relationships  and 
some  current  applications  in  political  science  are 
discussed. 

Necessity  and  sufficiency  are  themselves  differ¬ 
ent  forms  of  the  causal  relationship.  They  represent 
different  forms  of  constant  conjunction.  Scholars 
have  argued  that  research  designs  (the  nature  of  the 
theoretical  logic  and  research  hypotheses  being 
employed)  built  around  these  different  forms  of 
causation  will  be  affected  by  the  nature  and  types  of 
the  cases  selected,  the  controls  employed  for  dealing 
with  possible  other  (explanatory)  factors,  and  the 
methods  used  for  evaluating  the  theory  and  pro¬ 
posed  research  hypotheses.  All  three  elements  of  the 
research  triad — theory,  logic,  and  research  design — 
are  thus  affected  by  the  form  of  the  relationship, 
especially  looking  at  necessary  relationships  in  dis¬ 
tinction  from  sufficient  relationships  or  even  simple 
correlational  relationships.  The  differences  between 
how  necessary  and  sufficient  conditions  are  treated 
are  additionally  important  in  regard  to  the  research 
designs  needed  to  investigate  inference  in  the  small- 
N  studies  that  characterize  much  of  comparative 
politics  (and  qualitative  analyses  more  generally). 
The  logics  of  necessary  and  sufficient  conditions 
need  to  be  investigated  in  order  to  cross  the  bound¬ 
aries  between  political  science  subfields  as  well  as 
the  quantitative-qualitative  divide. 

Definitions  and  Logics 

The  most  basic  definition  of  sufficiency  states,  “if 
X  then  Y.”  That  is,  X  always  leads  to  Y,  but  Y  is 
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Figure  2  X  Is  Necessary  for  Y  (Only  if  XThen  Y) 

not  always  preceded  by  X.  For  example,  propo¬ 
nents  of  the  theory  of  democratic  peace  argue  that 
democracies  have  peaceful  relations  with  one 
another — that  is,  a  pair  of  democracies  (X)  will 
lead  to  peace  (Y),  which  is  defined  as  the  absence  of 
war.  However,  peace  may  result  from  many  other 
factors  (e.g.,  power  preponderance,  lack  of  contact 
or  opportunity),  so  that  it  may  occur  without  the 
presence  of  a  democratic  dyad.  The  presence  of 
democratic  dyads  predicts  the  presence  of  peace:  X 
“yes”  and  Y  “yes,”  as  in  Cell  A  of  Figure  1.  A  much 
less  important  prediction  (or  expectation)  is  that 
the  absence  of  a  democratic  dyad  is  followed  by  an 
absence  of  peace:  the  “no/no”  Cell  D  of  Figure  1. 
Cell  B  in  Figure  1,  where  a  democratic  dyad  is  not 
present  but  peace  is,  is  not  relevant  for  sufficiency, 
because  Y  may  occur  without  X  (i.e.,  when  X  is  a 
sufficient  but  not  necessary  condition  for  Y).  The 
key  relationship  is  in  Cell  B,  when  X  does  occur  but 
Y  does  not.  This  cell  will  be  empty  when  X  is  a  suf¬ 
ficient  condition  for  Y.  Thus,  a  research  design  that 
looks  for  sufficient  relationships  must  look  at  the 
full  range  of  Y,  the  dependent  variable — when  Y 
occurs  and  when  it  does  not.  Herein  rests  the  strong 
admonition  by  scholars  that  the  researcher  cannot 
select  on  the  dependent  variable  (which  would  toss 
out  the  ability  to  look  at  the  key  cell  that  permits 
the  investigation  of  sufficiency). 
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In  many  ways,  necessity  produces  reverse  expec¬ 
tations.  Necessity  means  the  following:  only  if  X, 
then  Y.  That  is,  Y  is  always  preceded  by  X,  but  X 
does  not  always  lead  to  Y.  Bruce  Bueno  de 
Mesquita’s  theory  of  positive  expected  utility  (EU) 
notes  that  positive  EU  is  a  necessary  condition  for 
war — that  war  (Y)  is  always  preceded  by  positive 
EU  on  the  part  of  decision  makers  going  to  war 
(X).  In  essence,  necessity  asks  us  to  look  for  pre¬ 
conditions  or  prerequisites  of  some  phenomenon 
Y.  It  should  be  noted  that  the  authors  in  Gary 
Goertz  and  Elarvey  Starr’s  edited  volume  define 
necessity  in  various  ways  and  using  different  log¬ 
ics:  set  theory,  calculus,  a  probabilistic  logic,  fuzzy 
logic/sets,  and  Aristotelian  two-valued  logic. 
Returning  to  the  current  example,  simply  having 
positive  EU  does  not  mean  that  there  will  be  war, 
for  a  variety  of  domestic  and  external  factors, 
again  including  lack  of  opportunity.  As  seen  in 
Figure  2,  Cell  A,  the  presence  of  the  causal  factor 
(positive  EU),  predicts  or  is  followed  by  the  pres¬ 
ence  of  war,  or  the  dependent  variable.  A  much 
less  important  prediction  (or  expectation)  is  that 
the  absence  of  the  independent  variable  or  causal 
factor  X  is  related  to  the  expectation  or  prediction 
that  the  effect,  or  dependent  variable,  Y  is  also 
absent  (Cell  D).  Cell  C  presents  the  condition 
where  X  is  present,  but  there  is  no  effect  on  Y.  This 
is  not  relevant  to  necessity  because  X  does  not 
always  lead  to  Y  (indeed,  this  has  been  a  major 
confusion  in  critiques  of  the  early  EU  models  of 
Bueno  de  Mesquita  and  colleagues).  For  necessity, 
the  key  relationship  is  when  the  causal  factor  X 
does  not  occur  but  the  effect  Y  still  appears — Cell 
B.  If  X  is  a  necessary  condition  for  Y,  this  cell  or 
set  will  be  empty.  Note  that  this  means  that  the 
whole  range  of  the  independent  variable  X  must  be 
included.  One  must  see  cases  where  X  does  not 
occur.  Necessity  then  can  be  tested  using  only  the 
occurrence  of  Y,  the  dependent  variable.  With 
questions  of  necessity,  therefore,  the  researcher 
may  select  on  the  dependent  variable  but  cannot 
select  only  on  the  occurrence  of  X,  the  independent 
variable.  This  distinction  between  sufficiency  and 
necessity  suggests  that  case  selection  should  be 
guided  by  the  outcome  that  the  researcher  is  inter¬ 
ested  in  and  how  the  logic  of  the  proposed  causal 
mechanism  identifies  that  outcome  (as  argued  by 
Goertz  and  colleagues).  As  noted  above,  one  evi¬ 
dent  way  to  do  this  is  with  set  theory.  Thus,  a 


strong  set-theoretic  basis  to  the  logic  of  necessary 
relationships  is  found. 

It  should  be  clear  that  the  logics  of  necessity  dif¬ 
fer  from  those  of  sufficiency.  Much  of  the  analyses 
performed  on  relationships  of  sufficiency  use 
continuous  variables,  looking  to  find  linear  rela¬ 
tionships  on  normally  distributed  values  and  to 
maximize  the  explanation  of  variance  in  the  depen¬ 
dent  variable  (these  have  been  called  frequentist 
statistical  methodologies).  Necessity  more  often 
looks  at  the  presence  or  absence  of  an  explanatory 
factor  X.  That  is,  a  necessary  analysis  looks  to  see  if 
values  on  a  measure  have  crossed  a  threshold, 
reached  a  “turning  point,”  or  have  come  to  a  “criti¬ 
cal  juncture”  (see  also  the  “powder  keg”  explana¬ 
tions  discussed  by  Gary  Goertz  and  Jack  Levy). 
Goertz  thus  argues  that  necessary  conditions  and  the 
statistical  models  related  to  them  set  out  different 
sorts  of  hypotheses  from  standard  quantitative 
analyses  of  sufficient  relationships.  Rather  than 
tracking  how  changes  in  any  X  are  related  to 
changes  in  some  Y,  most  necessary  analyses  focus 
on  whether  X  makes  Y  possible  or  more  likely. 
Goertz  contrasts  a  “necessary  condition  hypothe¬ 
sis,”  which  might  propose  that  some  minimum  level 
of  variable  X  is  necessary  for  an  outcome  variable 
Y,  with  a  “correlational  hypothesis,”  which  would 
propose  that  the  greater  the  value  of  some  X  then 
the  greater  the  value  of  some  Y.  Another  example  of 
analytical  differences  has  been  identified  by  Charles 
Ragin,  among  others.  He  has  argued  that  while 
standard  statistical  techniques  are  often  applied  to 
deal  with  (i.e.,  remove)  heteroskedasticity,  this  is  a 
central  condition  and  characteristic  of  necessary 
relationships  and  thus  needs  to  be  part  of  the  analy¬ 
sis  and  not  removed. 

Necessary  and  sufficient  hypotheses,  then,  often 
work  within  different  causal  chains.  Correlational 
hypotheses  tend  to  focus  on  classes  of  events,  while 
necessary-condition  hypotheses  tend  to  focus  on 
causation  within  a  single  case  (but  which  may  also 
be  aggregated  probabilistically  across  such  cases). 
The  latter  is  often  associated  with  process  tracing 
within  cases  to  identify  the  causal  mechanisms  at 
work  and  is  thus  of  importance  to  comparative 
politics,  qualitative  approaches,  and  comparative- 
historical  methodology.  Goertz  has  demonstrated 
the  range,  ubiquity,  and  importance  of  necessary- 
condition  hypotheses  found  throughout  the  politi¬ 
cal  science  literature.  They  often  take  the  form  of 
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counterfactuals,  especially  in  case  studies  focusing 
on  explaining  individual  events.  The  necessity  rela¬ 
tionship  claims  that  only  if  X  can  there  be  Y.  The 
counterfactual  claim  holds  that  without  X,  if  X  had 
not  happened,  Y  would  not  have  occurred.  Goertz 
and  Levy  contrast  a  necessary-conditions  counter- 
factual  approach  (looking  for  necessary  conditions 
within  individual  cases)  with  a  nomological  cover¬ 
ing-law  approach  (based  on  the  constant  conjunc¬ 
tion  of  covering  laws  stated  and  tested  with  larg e-n 
statistical/probabilistic  methods).  More  than  2 
decades  ago,  Most  and  Starr  demonstrated  that 
researchers  will  produce  meaningless  findings  when 
creating  research  designs  that  can  test  only  for  nec¬ 
essary  conditions  while  stating  their  hypotheses  in 
terms  of  sufficiency  (and  vice  versa).  That  is,  the 
cases  selected  and  the  methods  used  must  match  the 
questions  asked.  The  method  of  selecting  cases  will 
logically  preclude  some  questions  and  some  designs. 

Scholars  across  political  science  subfields  as  well 
as  different  research  traditions  have  come  more  and 
more  to  seek  designs  that  uncover  complex  causal¬ 
ity,  recognizing  that  there  may  be  multiple  causal 
paths  to  the  same  outcome.  This  is  often  called 
equifinality.  Contingency  plays  a  large  part  in  such 
causal  complexity — that  some  theory  or  hypothesis 
will  hold  only  under  certain  conditions  or  that  a 
pair  of  contending  theories  may  both  be  true  under 
different  conditions  or  contingencies  (or  what  Most 
and  Starr  call  “nice  laws”).  In  many  cases,  contin¬ 
gency  is  expressed  as  a  necessary  condition,  without 
which  some  effect,  outcome,  or  other  dependent 
condition  could  not  occur.  This  is  why  necessity  is 
a  key  component  of  a  philosophical  view  of  causal¬ 
ity  becoming  more  prominent  in  the  literature — the 
INUS  view  of  causation.  An  INUS  explanation  is  an 
“Insufficient  but  Nonredundant  [i.e.,  necessary] 
part  of  an  Unnecessary  but  Sufficient  condition,” 
for  which  James  Mahoney  and  Gary  Goertz  (2006, 
p.  232)  provide  a  succinct  explanation: 

An  INUS  cause  is  neither  individually  necessary  nor 
individually  sufficient  for  an  outcome.  Instead  it  is 
one  cause  within  a  combination  of  causes  that  are 
jointly  sufficient  for  an  outcome.  Thus,  with  this 
approach,  scholars  seek  to  identify  combinations  of 
variable  values  that  are  sufficient  for  outcomes  of 
interest.  The  approach  assumes  that  distinct 
combinations  may  each  be  sufficient,  such  that 
there  are  multiple  paths  to  the  same  outcome. 


Research  findings  with  INUS  causes  can  often  be 
formally  expressed  through  Boolean  equations 
such  as  Y  =  (A  AND  B  AND  C)  OR  (C  AND  D 
AND  E). 

The  logic  of  necessity  and  sufficiency,  and  mul¬ 
tiple  causal  paths,  may  thus  be  clearly  captured  by 
Boolean  logic  and  methods  (most  fully  introduced 
in  the  work  of  Charles  Ragin).  The  conjunctive 
Boolean  AND  denotes  necessary  conditions.  Thus, 
in  the  above  equation,  A,  B,  and  C  are  all  neces¬ 
sary  conditions  in  combination,  to  be  sufficient  for 
Y.  But  note,  so  are  C,  D,  and  E  in  combination. 
The  Boolean  disjunctive  OR  indicates  that  either 
of  these  two  combinations  is  sufficient  for  Y — mul¬ 
tiple  paths  to  the  same  outcome.  The  reader  should 
note  that  the  philosophy  of  science  dealing  with 
concepts  holds  a  similar  position — that  a  good 
definition  of  a  concept  is  one  with  necessary  condi¬ 
tions  that  are  jointly  sufficient. 

Conclusion 

This  brief  overview  has  attempted  to  present  the 
most  recent  reflections  and  findings  on  necessity 
and  sufficiency,  setting  out  the  basic  phenomena 
and  issues  that  are  currently  being  discussed  in  the 
political  science  literature.  The  reader  who  investi¬ 
gates  the  works  cited  and  suggested  below  and 
follows  the  footnotes  and  references  in  those  mate¬ 
rials  will  obtain  the  most  comprehensive  picture  of 
current  thinking  on  necessity  and  sufficiency. 
These  works  reinforce  the  observation  of  Most  and 
Starr  that  there  is  a  need  for  a  better  understanding 
of  logical  linkages  that  serve  as  alternatives  to  the 
standard  statistical  techniques  commonly  employed 
in  large-N  statistical  analyses  with  an  almost  exclu¬ 
sive  focus  on  sufficient  relationships. 

These  newer  works  provide  the  theory,  logic, 
and  findings  that  support  Mahoney’s  refutation  of 
a  set  of  “erroneous  beliefs”  about  necessity  and 
sufficiency.  Based  on  the  present  discussion  and 
the  works  cited,  one  can,  in  fact,  say  that 

1.  necessary  and  sufficient  causes  do  exist  and  are 
found  in  interesting  hypotheses; 

2.  necessary  and  sufficient  causes  are  not  trivial, 
tautological,  or  irrelevant  (see  Goertz’s  work  for 
an  in-depth  conceptualization  of  the  trivialness, 
importance,  and  relevance  of  necessity  and 
sufficiency  and  ways  to  measure  them); 
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3.  necessary  and  sufficient  causes  are  neither 
deterministic  nor  inconsistent  with  probabilistic 
analysis,  and  they  can  be  measured  continuously 
(see  especially  Ragin);  and 

4.  subsequently,  methods  do  exist  to  test 
hypotheses  of  necessity  or  sufficiency,  including 
tests  for  significance/trivialness. 

For  Points  3  and  4,  see  especially  work  by  Mahoney 
as  well  as  Goertz. 

Finally,  and  perhaps  most  important,  newer 
works  on  necessity  and  sufficiency  point  out  ways 
in  which  research  can  be  designed  to  capture  cau¬ 
sality  (in  its  various  forms)  across  small-N  and 
large-N  analyses  and  cross  boundaries  between 
so-called  qualitative  and  quantitative  analysis.  In 
so  doing,  this  work  also  enhances  boundary  cross¬ 
ing  between  international  relations  and  compara¬ 
tive  politics  at  a  time  when  greater  and  greater 
attention  is  being  directed  toward  two-level  analy¬ 
sis,  taking  into  account  the  impact  of  internal/ 
domestic  factors  and  context  along  with  external/ 
international  factors  and  context. 
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Configurational 
Comparative  Methods 


Configurational  comparative  methods  (CCMs) 
designate  both  a  methodological  approach  and  a 
set  of  specific  techniques  enabling  systematic  cross¬ 
case  comparison  while  also  taking  into  account 
within-case  complexity.  They  have  been  developed, 
initially,  for  small-  and  intermediate-N  research 
designs  in  the  social  sciences  (Benoit  Rihoux  & 
Charles  Ragin,  2009).  The  whole  CCMs  enterprise 
was  initiated  by  Ragin  (1987),  who  outlined  a 
“synthetic  strategy”  that  could  bridge  case-oriented 
and  variable-oriented  approaches  to  social  phe¬ 
nomena.  He  also  developed  a  first  technique,  QCA 
(qualitative  comparative  analysis — now  labeled 
crisp-set  QCA  [csQCAJ),  and  applied  it  to  macro¬ 
level  cases  such  as  countries.  Below,  the  basic 
assumptions,  specific  techniques,  and  applications, 
and  recent  developments  are  discussed. 

Core  Assumptions  and  Goals 

The  whole  ambition  behind  CCMs  is,  on  the  one 
hand,  to  make  qualitative  case  analysis  more  sys¬ 
tematic  and  to  offer  case-oriented  researchers  some 
tools  that  enable  them  to  systematically  compare 
thick,  complex  cases.  On  the  other  hand,  it  strives 
to  offer  an  alternative  way  to  envisage  causal  argu¬ 
ments  made  in  mainstream  quantitative  (read  sta¬ 
tistical)  social-scientific  work.  The  overarching  goal 
of  CCMs  is  to  unravel  causal  complexity  by  apply¬ 
ing  set-theoretic  methods  to  cross-case  evidence.  In 
more  concrete  terms,  the  different  QCA  techniques 
developed  within  CCMs  enable  one  to  identify  core 
combinations  of  conditions  (input  variables),  which 
explain  the  variation  of  a  given  outcome  (output 
variable)  of  interest.  Therefore,  QCA  techniques 
are  geared  toward  the  identification  of  so-called 
specific  connections  between  conditions  and  out¬ 
comes.  In  contrast  to  most  statistical  techniques, 
they  are  not  geared  toward  the  establishment  of 
general,  tendential,  or  correlational  connections 
between  each  independent  variable,  on  the  one 
hand,  and  the  dependent  variable,  on  the  other 
(Ragin,  2008;  Rihoux,  2008). 

Causality,  or  linkages  between  conditions  and 
the  outcome,  more  generally,  are  assumed  to  be 
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multiple  and  conjunctural.  This  implies  the  follow¬ 
ing  assumptions:  (a)  Most  often,  it  is  a  combina¬ 
tion  of  conditions  (rather  than  a  single  condition) 
that  generates  the  outcome;  (b)  several  different 
combinations  of  conditions  may  produce  the  same 
outcome;  and  (c)  there  is  no  fixed  effect  of  a  given 
condition  on  the  outcome — depending  on  how  it  is 
combined  with  other  conditions,  different  values 
of  this  condition  can  produce  the  outcome.  The 
concrete  goal  of  CCMs,  from  this  perspective,  is  to 
identify  those  different  causal  paths  leading  to 
some  outcome  of  interest,  each  path  being  relevant 
in  its  own  way,  regardless  of  the  number  of  cases 
it  covers.  Thus,  this  concept  of  causality  does  not 
take  on  board  most  of  the  core  assumptions  under¬ 
lying  mainstream  statistical  analysis,  such  as  per¬ 
manent  causality,  uniformity  of  causal  effects,  unit 
homogeneity,  additivity,  linearity,  and  causal  sym¬ 
metry  (Dirk  Berg-Schlosser,  Gisele  De  Meur, 
Benoit  Rihoux,  &  Charles  Ragin,  2009). 

The  logical  foundations  of  CCMs  can  be  found 
in  John  Stuart  Mill’s  “canons,”  in  particular  the 
method  of  agreement,  the  method  of  difference, 
and  the  joint  method  of  agreement  and  difference, 
which  are  all  logical  ways  to  systematically  contrast 
and  match  cases,  so  as  to  establish  common  causal 
relationships.  In  practical  terms,  in  CCMs,  this  is 
translated  into  tests  for  necessity  and  sufficiency. 

Techniques 

The  overall  rationale  behind  the  QCA  techniques 
is  that,  through  some  step-by-step  logical  opera¬ 
tions  (based  on  Boolean  algebra  or  set-theoretical 
logic),  one  is  able  to  reduce  complex  data  tables  to 
shorter  combinations  of  conditions  explaining  the 
outcome  of  interest.  Therefore,  at  the  heart  of 
CCMs  lies  the  quest  for  parsimony — but  not  in 
such  a  way  that  within-case  complexity  would  be 
sacrificed.  So  far,  three  specific  techniques  have 
been  developed  under  the  umbrella  term  of  QCA: 
crisp-set  QCA  (csQCA),  the  initial  technique  that 
uses  Boolean,  that  is,  dichotomous,  sets;  multi¬ 
value  QCA  (mvQCA),  which  allows  the  use  of 
multiple-category  conditions;  and  fuzzy-set  QCA 
(fsQCA),  which  uses  finer  grained  fuzzy-set  mem¬ 
bership  scores. 

These  techniques  can  be  exploited  in  at  least  five 
different  ways.  The  first  two  uses  are  more  descrip¬ 
tive  or  classificatory.  One  can  use  QCA  techniques, 


first,  simply  to  summarize  data  through  the  pro¬ 
duction  of  a  truth  table  (or  table  of  configura¬ 
tions),  which  can  be  examined  to  discover,  for 
instance,  how  cases  cluster  together,  or  to  elabo¬ 
rate  typologies.  Second,  QCA  techniques  can  be 
used  as  a  tool  to  assess  the  coherence  of  the  data — 
in  particular  through  the  detection  of  so-called 
contradictory  configurations — which  lead  the 
researcher  to  reflect  on  the  cases  and  the  theory. 

The  three  other,  more  full-fledged  uses  of  QCA 
techniques  imply  some  sort  of  testing  of  a  causal 
argument:  checking  hypotheses  or  existing  theo¬ 
ries,  quickly  testing  some  conjectures  made  by  the 
researcher,  and  developing  new  theoretical  argu¬ 
ments.  Thus,  QCA  techniques  can  be  used  for  both 
theory  testing  and  theory  building. 

In  a  full  QCA  application,  whatever  the  specific 
technique,  the  core  steps  are  basically  the  same 
(Benoit  Rihoux  &  Bojana  Lobe,  2009).  In  a  first 
phase,  upstream  of  QCA  proper,  a  comparative 
research  design  is  chosen,  cases  are  selected  in  a 
purposeful  way,  sufficient  intimacy  is  gained  with 
each  one  of  the  cases,  the  outcome  of  interest  is 
defined,  the  conditions  are  selected  (model  specifi¬ 
cation),  and  the  cases  and  the  model  are  visualized 
or  synthesized  in  various  ways.  These  first  opera¬ 
tions  already  result  in  some  level  of  reduction  of 
complexity  by  transforming  thick,  complex  case 
narratives  to  analytical  (i.e.,  variable-based)  data. 

The  second  core  phase  corresponds  to  the  more 
formal,  computer-run  part  of  QCA.  There  are  sev¬ 
eral  steps  to  be  performed:  operationalizing  the 
outcome  and  condition  variables  (dichotomization 
or  some  finer  grained  calibration),  producing  the 
truth  table,  solving  the  contradictory  configurations 
so  as  to  obtain  a  contradiction-free  truth  table, 
minimizing  the  truth  table  (with  and  without  the 
inclusion  of  logical  remainders,  i.e.,  nonobserved 
cases),  solving  contradictory  simplifying  assump¬ 
tions  produced  by  the  use  of  logical  remainders, 
and  finally  obtaining  minimal  formulae  that  con¬ 
tain  those  core,  most  parsimonious  combinations  of 
conditions  leading  to  the  outcome  of  interest. 

The  third  phase,  downstream,  consists  of  differ¬ 
ent  ways  of  moving  back  to  more  (case)  complex¬ 
ity:  identifying  more  crucial  conditions,  interpret¬ 
ing  the  minimal  formulae  in  a  case-by-case  man¬ 
ner,  interpreting  cross-case  patterns,  performing 
limited  historical  or  modest  generalizations,  and 
finally,  possibly,  going  through  the  next  cycle  of 
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QCA,  for  example,  through  the  examination  of 
some  more,  rather  proximate  cases. 

All  these  steps  should  be  performed  in  a  reflex¬ 
ive,  iterative  way,  and  with  frequent  loops  back  to 
theoretical  and/or  case-based  knowledge.  As  QCA 
techniques  have  been  consolidating  and  as  applica¬ 
tions  have  been  becoming  more  diverse,  a  body  of 
good  practices  has  also  been  consolidated. 

Applications 

The  applications  of  CCMs  have  become  increas¬ 
ingly  diverse.  More  than  two  thirds  of  them  are 
found  in  various  subfields  of  political  science — in 
particular,  comparative  politics  and  policy  studies. 
The  breadth  of  applications  has  expanded  in  several 
respects.  In  terms  of  number  of  cases,  though  QCA 
techniques  were  initially  geared  toward  small-  and 
intermediate-N  designs,  larger-N  applications  have 
also  proved  successful.  In  terms  of  the  nature  of 
cases  envisaged,  most  applications  so  far  envisage 
macrolevel  (e.g.,  countries,  political  systems,  and 
policy  processes)  or  mesolevel  (e.g.,  party  organiza¬ 
tions,  social  movements,  and  collective  actors) 
cases.  However,  some  researchers  are  beginning  to 
apply  QCA  on  microlevel  cases  as  well.  Another 
type  of  broadening  is  related  to  the  fact  that  some 
users  who  are  more  application  oriented  are  also 
becoming  interested  in  such  methods,  for  example, 
in  the  field  of  policy  evaluation  and  monitoring, 
beyond  the  academic  sphere.  With  regard  to  the 
number  of  conditions  included  in  the  analysis,  there 
is  also  broad  variation,  though  a  good  practice  dic¬ 
tates  that  QCA  models  be  kept  relatively  short  due 
to  the  limited-diversity  issue — that  is,  too  long  a 
model  will  not  be  in  the  interests  of  parsimony. 

Gradually,  applications  have  become  techni¬ 
cally  more  sophisticated.  Two  main  computer 
programs  have  been  developed  and  are  now  used 
for  most  applications:  TOSMANA  (Tools  for 
Small-N  Analysis;  for  csQCA  and  mvQCA)  and 
FSQCA  (fuzzy-set  qualitative  comparative  analysis 
for  csQCA  and  fsQCA).  Some  other  interfaces  are 
also  being  gradually  exploited,  such  as  R  and 
STATA. 

Debates  and  Further  Developments 

As  CCMs  are  still  rather  recent  and  as  they  are 
being  perceived  by  some  as  a  challenge  to  some 


other  methodologies,  they  are  subject  to  a  lot  of 
debate.  The  core  critiques  concentrate  in  particular 
on  the  following  points:  the  dichotomization  of  the 
data,  the  use  of  nonobserved  cases,  case  sensitivity, 
the  difficulty  in  selecting  conditions,  the  inability 
of  CCMs  to  actually  establish  causal  linkages  and 
broad  generalizations  (inference),  and  the  nonex¬ 
plicit  inclusion  of  the  time  dimension.  Naturally, 
as  with  all  methods,  there  are  limitations  and 
weaknesses  with  CCMs,  but  many  strategies  have 
been  developed  to  overcome  most  of  these  draw¬ 
backs.  One  should  also  bear  in  mind  the  specific 
goals  and  assumptions  of  CCMs.  For  instance, 
contending  that  CCMs  are  weaker  than  statistical 
techniques  because  they  do  not  produce  inference 
(generalization  from  a  sample  to  a  whole  popula¬ 
tion)  is  a  misplaced  criticism,  because  this  is  not 
the  goal  of  CCMs — they  follow  a  more  case-  and 
diversity-oriented  logic. 

Many  developments  and  improvements  of 
CCMs  are  under  way — the  launching  of  mvQCA 
and  fsQCA  is  an  example;  these  programs  were 
developed  as  a  response  to  some  of  the  limitations 
of  the  dichotomous  csQCA.  Among  the  many 
ongoing  refinements,  one  can  mention  the  follow¬ 
ing:  the  improvement  of  computer  software,  the 
inclusion  of  the  time  dimension,  the  distinction 
between  distant  and  proximate  conditions  (two- 
step  models;  see  Carsten  Schneider  &  Claudius 
Wagemann,  2006),  the  Most  Similar  Different 
Outcome/Most  Different  Same  Outcome  (MSDO/ 
MDSO)  procedure  in  the  process  of  model  build¬ 
ing  (Berg-Schlosser  &  De  Meur,  2009),  more 
elaborate  strategies  to  tackle  logical  remainders 
and  contradictory  simplifying  assumptions,  better 
visualization  of  the  cases  and  the  minimal  formu¬ 
lae,  and  so  on. 

In  another  line  of  improvements,  CCMs  have 
been  increasingly  combined  or  sequenced  with 
qualitative  and/or  quantitative  methods.  On  the 
smaller-N  side  of  the  spectrum,  the  connection 
with  qualitative,  case-oriented  work  is  self-evident, 
as  in-depth  case  knowledge  goes  along  with  the 
QCA  protocol — the  graphic  display  of  synthetic 
case  descriptions  is  one  way  to  bridge  more  explic¬ 
itly  the  thick  case  evidence  and  the  formal  QCA 
procedure  (Rihoux  &  Lobe,  2009).  In  intermedi¬ 
ate-  and  larger-N  designs,  several  works  have 
fruitfully  confronted  or  mutually  enriched  some 
QCA  techniques  with  statistical  techniques,  such 
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as  discriminant  analysis,  factor  analysis,  and  vari¬ 
ous  types  of  regression-based  analyses. 

Many  more  developments  can  be  expected 
within  the  next  few  years  as  the  community  of 
users  of  CCMs  is  enlarging  and  also  becoming 
more  diverse  in  disciplinary  terms — in  fact,  some 
of  the  cutting-edge  work  is  now  also  being  done  in 
fields  such  as  management,  organizational  studies, 
and  communication  studies.  The  speed  of  further 
improvements  of  CCMs  will  largely  depend  on  the 
ability  of  this  growing  community  to  share  experi¬ 
ences  as  well  as  innovations  and  to  make  this  a 
cumulative  enterprise. 

Benoit  Rihoux 
Universite  Catholique  de  Louvain 
Louvain-la-Neuve,  Belgium 
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Conflict  resolution  is  a  term  that  we  all  under¬ 
stand  until  we  try  to  define  it.  Synonyms  abound: 
such  as  conflict  settlement,  conflict  termination, 
conflict  management,  and  conflict  transforma¬ 
tion.  Moreover,  conflict  resolution  is  both  a  situ¬ 
ation  and  a  process.  A  conflict  is  resolved  when  all 
the  parties  to  a  dispute  agree  that  it  is  over  for 
good,  in  full  knowledge  of  the  situation,  and  with¬ 
out  any  form  of  coercion,  whether  personal, 
manifest  or  structural,  since  they  acknowledge 
that  their  respective  interests  and  values  have  been 
satisfied.  The  concerned  parties  are  those  who 
cannot  be  seen  off  or  defeated  and  without  whom 
there  can  be  no  resolution  in  the  long  run. 
Resolution  requires  a  new  relationship  to  be  self- 
sustaining  without  any  form  of  coercion. 

All  conflicts  end.  Even  bitter  and  decade-long 
disputes  involving  several  wars  have  been  resolved. 
On  the  way  to  resolution,  there  may  be  truces  or 
temporary  settlements  in  which  coercion  is  exerted, 
but  the  roots  of  the  conflict  remain  embedded  so 
that  any  weakening  or  withdrawal  of  coercive 
mechanisms  may  risk  a  new  flare-up.  Thus,  for 
example,  in  the  Franco-German  context,  the  Treaty 
of  Versailles  (1919)  was  a  conflict  settlement, 
whereas  the  Charter  of  Paris  (1990)  was  a  conflict 
resolution. 

Conflict  resolution  is  at  the  far  end  of  a  spec¬ 
trum  of  outcomes  that  begins  with  a  diktat  imposed 
by  a  victor.  But  losers  usually  have  leeway  since 
victors  are  often  dependent  on  local  cooperation. 
Very  rarely  is  the  complete  destruction  of  the 
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enemy,  or  genocide,  envisaged.  A  second  outcome 
is  a  negotiated  one  involving  compromise  and 
lengthy  negotiations  often  accompanied  by  consid¬ 
erable  violence.  Such  a  settlement  is  contingent  on 
the  balance  of  forces,  and  should  these  change,  the 
smoldering  conflict  may  again  flare  up.  A  stasis 
outcome  is  one  in  which  actual  physical  violence 
has  ended  and  its  recurrence  is  unlikely,  but  the 
protracted  conflict  continues  by  other  means  since 
major  issues  remain  unresolved.  Finally,  a  full 
resolution  is  one  in  which  all  the  parties  have  their 
values  and  interests  satisfied.  Conflict  resolution  is 
the  most  ambitious  outcome  sought  since  resolu¬ 
tion  based  on  a  noncoercive  framework  is  hard, 
but  not  impossible,  to  achieve. 

Conflicts  occur  at  all  levels  and  in  all  social  rela¬ 
tionships.  Each  has  its  special  characteristics,  but 
there  are  elements  in  common.  General  statements 
can  be  made  about  conflict  resolution,  which 
apply  to  international,  intercommunal,  industrial, 
marital,  or  other  conflict.  Many  principles  and, 
indeed  techniques,  may  be  similar,  albeit  with  due 
account  being  paid  to  the  idiosyncrasies  of  level 
and  type. 

In  political  conflict,  broadly  defined,  we  can 
identify  three  traditions  with  differing  approaches, 
conceptions  of  its  origin  and  modalities,  as  well 
as  aspirations  for  ending  it.  The  first  is  the  realist 
or  power  politics  tradition  that  lays  emphasis  on 
the  nature  of  international  relations  in  an  anar¬ 
chical  society.  Societal  elements  exist,  but  they 
are  weak  and  relations  are  close  to  the  Hobbesian 
characterization  of  life  as  nasty,  poor,  brutish, 
solitary,  and  short.  This  is  particularly  true  of  the 
international  system  where  the  only  reliable  help 
is  self-help  due  to  the  absence  of  sufficiently 
strongly  held  shared  values  to  form  the  basis  for 
a  collective  security.  The  ensuing  security  dilemma 
confronts  actors  with  a  tragic  choice  between 
guns  or  butter.  Judgments  are  required  about  the 
capabilities  and  intentions  of  potential  adversar¬ 
ies.  Moreover,  if  this  security  dilemma  is  also 
linked  to  the  notion  that  all  individuals  and 
groups  have  a  drive  to  dominate,  then  there  can 
be  no  peace  in  a  real  sense.  There  can  be  a  truce, 
a  cold  war,  order,  and  stability,  but  all  this  is 
contingent  on  a  balance  of  forces  between  those 
who  dominate  and  those  who  are  dominated.  It  is 
an  eternal  struggle  between  the  status  quo  and  the 
revisionist  powers.  Order  and  stability  are  the 


products  of  a  particular  balance  of  forces,  and 
any  major  change  will  upset  that  balance  and  a 
new  equilibrium  will  have  to  be  established  to 
reflect  those  changes,  perhaps  on  the  battlefield. 
This  is  a  classic  case  of  conflict  settlement. 

The  second  traditional  approach  points  to  the 
importance  of  structures,  with  conflict  being  the 
result  of  incompatible  interests  built  into  structures. 
Structural  violence  suggests  that  peace  is  more  than 
the  absence  of  overt  violence.  Structural  violence 
occurs  where  an  actor  may  be  a  “happy  slave,” 
unable  even  to  conceive  of  behaving  as  others  can  or 
prevented  from  so  doing  by  prejudice,  ignorance,  or 
lack  of  access  to  facilities  available  to  others.  Such 
impediments  are  often  not  of  a  legal  nature  or 
enforced  by  overt  pressure,  but  they  are  found  in  the 
mind,  history,  education,  and  culture  of  individuals 
and  the  society  within  which  they  live. 

Not  all  structures  are  instruments  of  structural 
violence.  Self-sufficiency  is  not  necessary  where  we 
can  benefit  from  comparative  advantage,  as  long 
as  the  “terms  of  trade”  are  seen  as  fair  by  all  par¬ 
ties  in  full  knowledge  of  all  aspects  of  the  relation¬ 
ship,  including  the  structural  ones.  Then  a  role 
differentiation  can  be  legitimized  and  with  it  the 
structure  within  which  the  roles  are  embedded. 
Different  roles  do  not  necessarily  imply  an  unfair 
relationship  dependent  on  coercion.  All  may  ben¬ 
efit  equitably  if  the  differentiation  is  acceptable  to 
all  with  their  eyes  open. 

Dyed-in-the-wool  structuralists  in  a  Marxist 
tradition  would  regard  the  concepts  of  legitimacy 
and  evolutionary  change  as  simply  a  means  of 
throwing  sand  in  the  eyes  of  the  masses.  They 
insist  that  structures  with  embedded  incompatible 
interests  cannot  be  reformed  and  must  be  destroyed. 
Peace,  or  conflict  resolution,  is  thus  dependent  on 
the  destruction  of  existing  exploitative  structures 
and  the  creation  of  a  revolutionary  peace,  if  neces¬ 
sary  by  violence.  Increasing  the  degree  of  confron¬ 
tation  in  society  is  thus  a  step  toward  revolution 
and,  thereafter,  conflict  resolution. 

At  all  levels,  conflict  resolution  is  very  different 
in  the  pluralist  tradition  since  it  gives  salience  to 
the  notion  of  harmony  of  interests.  Collective  secu¬ 
rity  will  work  if  all  the  actors  in  a  system  come 
together  to  agree  on  the  rules  of  behavior  and  on 
how  to  alter  the  rules  to  accommodate  change. 
They  further  accept  that  if  anybody  strays  from  the 
fold  they  shall  be  brought  back  into  it,  forcibly  if 
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necessary.  Underpinning  this  is  the  idea  that  all 
human  beings  have  a  high  degree  of  shared  values 
and  that  they  can  pursue  these  in  a  rational  man¬ 
ner,  in  the  right  circumstances. 

Rationality  in  this  context  is  primarily  con¬ 
cerned  with  the  opportunity  cost,  that  is,  what 
other  desired  goals  have  to  be  sacrificed  in  order  to 
achieve  a  particular  goal.  Parties  in  conflict  fre¬ 
quently  have  a  tunnel  vision  in  which  they  become 
obsessed  by  one  particular  goal,  paying  a  very  high 
price  in  other  desired  goals  to  attain  it.  To  see  this 
in  context,  actors  are  encouraged  to  spell  out  the 
full  range  of  their  perceived  goals  and  then  attempt 
to  minimize  the  opportunity  cost  between  those 
goals.  The  next  stage  is  to  repeat  the  exercise,  but 
this  time  in  the  context  of  others,  so  that  in  the 
long  run  all  parties  will  be  in  a  position  where  they 
are  maximizing  the  totality  of  their  values  and 
minimizing  the  overall  opportunity  cost.  Such  an 
exercise  in  rational  opportunity  costing  frequently 
leads  to  a  consideration  of  basic  human  needs  that 
must  be  satisfied  if  conflict  is  to  be  resolved. 

Basic  human  needs  include  food  and  water  as 
well  as  social  needs  such  as  security,  identity,  par¬ 
ticipation,  self-actualization,  and  esteem.  But  these 
needs  are  not  necessarily  in  short  supply.  One  per¬ 
son’s  or  group’s  identity  is  not  necessarily  at  the 
expense  of  another’s  nor  is  their  security,  their 
development,  or  their  self-actualization.  They  may 
be  seen  as  zero  sum  at  a  particular  moment,  but 
conflict  resolution  as  a  process  seeks  to  move  to  a 
win-win  position.  This  is  difficult  since  there  is  no 
panacea  and  it  often  requires  third-party  facilita¬ 
tion.  Nevertheless  conflicts  can  be  resolved  because 
the  things  that  we  all  seem  to  care  about  are  not 
inherently  in  short  supply. 

Article  33  of  the  United  Nations  Charter  says 
that  the  member  states  “shall,  first  of  all,  seek  a 
solution  by  negotiation,  enquiry,  mediation,  con¬ 
ciliation,  arbitration,  judicial  settlement,  resort  to 
regional  agencies  or  arrangements,  or  other  peace¬ 
ful  means  of  their  own  choice.”  Conflict  settle¬ 
ments  in  the  realist  tradition  rely  on  an  uncertain 
balance  of  power.  A  revolutionary  peace  requires 
the  destruction  of  exploitative  structures  and  the 
eradication  of  structural  violence.  The  pluralist 
approach  to  the  process  of  conflict  resolution  is  of 
a  different  ilk.  In  deep-rooted  conflicts  where  there 
is  little  or  seemingly  no  value  consensus,  negotia¬ 
tion  between  the  parties  very  quickly  descends  into 


a  rut  of  accusation  and  counteraccusation  with 
little  understanding  or  effective  communication. 
Thus,  recourse  to  other  measures  under  Article  33 
involving  a  third  party  is  frequent.  If  the  goal  is 
merely  conflict  settlement  then  a  biased  mediation 
from  a  major  actor  to  impose  a  settlement  may 
suffice.  However,  if  the  goal  is  conflict  resolution, 
it  is  more  likely  that  the  process  is  one  of  facilita¬ 
tion,  which  has  a  number  of  characteristics. 

Facilitators  in  the  resolution  process  are  non- 
judgmental  since  they  do  not  pronounce  guilt  or 
innocence  nor  are  they  directive.  They  do  not  con¬ 
tend  that  they  know  best.  They  are,  however, 
extremely  supportive  of  all  the  parties  since  the 
values  and  interests  of  all  must  be  met.  The  process 
is  highly  participatory  with  all  the  veto  holders 
present,  no  matter  what  their  past.  Finally,  there  is 
a  disempowerment  of  the  parties  by  emphasizing 
that  they  share  a  joint  problem  that  they  can  only 
resolve  together.  This  approach  typifies  a  more 
general  movement  at  all  conflict  levels  from  judg¬ 
mental  decision  making  to  that  of  a  supportive 
framework.  This  is  particularly  apposite  in  “new 
international  conflicts,”  where  civil  wars,  transna¬ 
tional  conflicts,  and  fragile  and  collapsed  states  are 
seemingly  endemic  in  struggles  of  greed  and  griev¬ 
ance.  Wars  and  conflicts  that  go  beyond  the  inter¬ 
national  system  require  approaches,  methods,  and 
institutions  that  do  likewise.  Second-track  diplo¬ 
macy,  especially  facilitation,  becomes  thereby  a 
major  tool  for  resolution,  involving  a  variety  of 
actors  both  governmental  and  nongovernmental. 

The  process  of  conflict  resolution  is  frequently 
the  last  resort  when  the  parties  know  that  they  can 
neither  win  nor  are  likely  to  lose.  They  cannot 
escape  from  the  conflict,  but  they  know  it  is  cost¬ 
ing  a  great  deal  in  various  ways.  The  process  of 
conflict  resolution  therefore  accumulates  many  of 
the  hard  cases.  It  is  brought  in  when  all  else  has 
failed  and  it  is  trying  to  achieve  the  most  difficult 
goal.  However,  conflict  resolution  is  relevant  ear¬ 
lier  in  the  conflict  cycle,  as  a  preventive  measure. 
Much  thought  has  been  given  to  this  due  to  the 
upsurge  of  international  interventionism  and  the 
evident  need  for  emancipatory  resolution  based  on 
social  justice  and  solidarity  in  a  local  context. 

A.  J.  R.  Groom 
University  of  Kent 
Canterbury,  United  Kingdom 
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Conflicts 


In  everyday  English-language  usage,  conflicts  refer 
to  (a)  serious  disagreements,  (b)  prolonged  strug¬ 
gles  (e.g.,  armed  clashes  resulting  in  loss  of  life,  or 
labor  disputes),  or  (c)  the  clash  of  incompatible 
claims  or  principles,  as  in  conflicts  of  interest. 

In  political  science,  conflicts  can  be  analyzed  as 
the  outcomes  of  a  certain  kind  of  relationship, 
comparing  relationships  of  cooperation  and  com¬ 
petition  with  those  of  conflict.  When  there  is  coop¬ 
eration,  humans  work  together  in  pursuit  of  com¬ 
mon  objectives.  When  there  is  competition,  they 
struggle  with  one  another  in  contests  governed  by 
rules  that  determine  who  succeeds  and  that  iden¬ 
tify  the  sanctions  to  be  imposed  for  noncompli- 
ance.  When  there  is  conflict,  either  no  rules  are 
recognized  or  any  relevant  rules  are  not  properly 
enforced.  While  rules  may  be  embodied  in  laws  or 
treaties,  there  may  be  dispute  about  their  applica¬ 
tion.  Moral  principles  may  sometimes  be  regarded 
as  rules  of  conduct.  It  should  be  noted  that  even  in 
armed  conflicts  between  states,  offenders  can  be 
punished  for  war  crimes,  while  the  Geneva 
Conventions  specify  the  ways  in  which  civilians 
are  to  be  protected  in  international  conflicts. 

Though  the  absence  of  conflict  is  not  considered 
newsworthy,  it  can  be  just  as  important  to  political 


science  as  the  analysis  of  violence.  Distinguishing 
the  dimensions  of  conflict  makes  it  possible  to 
uncover  underlying  principles,  such  as  has  been 
done  in  game  theory.  Less  abstract  lines  of  analy¬ 
sis,  such  as  field  investigations  of  the  strategies 
pursued  by  the  parties  to  conflicts,  and  the  options 
open  to  them,  can  be  equally  illuminating. 

Everyday  Language 

In  the  newspapers,  and  in  popular  discussion,  con¬ 
flicts  are  divided  into  kinds  or  classes  of  conflict, 
such  as  economic,  ethnic,  national,  racial,  and 
religious  conflicts.  Yet  conflicts  do  not  fall  into  any 
natural  classification.  If  they  are  classified,  it  can 
only  be  for  a  specific,  and  limited,  purpose. 

National  conflicts  have  been  of  particular  interest 
to  political  scientists;  they  are  commonly  associated 
with  the  formation  of  states  and  the  delimitation  of 
their  territory.  After  major  wars,  state  boundaries 
have  been  drawn  or  redrawn  at  international  confer¬ 
ences  convened  by  what  used  to  be  called  “the  Great 
Powers.  ”  Thus,  in  the  history  of  Europe,  two  long- 
running  wars  were  brought  to  an  end  in  1648  by  the 
Treaty  of  Westphalia;  this  marked  the  birth  of  the 
concept  of  the  nation-state.  After  the  Napoleonic 
Wars,  the  Congress  of  Vienna  in  1815  redrew  the 
political  map.  After  World  War  I,  the  Paris 
Conference  of  1919  revised  that  map  and  gave  birth 
to  the  League  of  Nations.  After  World  War  II,  the 
three-power  conference  at  Yalta  delimited  “spheres 
of  influence”;  the  United  Nations  (UN)  was  estab¬ 
lished,  and  its  Security  Council  provided  a  forum  for 
the  regulation  of  interstate  conflicts.  The  UN  has 
made  states  responsible  for  the  regulation  of  con¬ 
flicts  arising  within  areas  subject  to  their  jurisdiction. 

The  then-UN  secretary-general,  Boutros 
Boutros-Ghali,  in  his  1992  report  An  Agenda  for 
Peace,  stated, 

Since  the  creation  of  the  United  Nations  in  1945, 
over  100  major  conflicts  around  the  world  have 
left  some  20  million  dead.  The  United  Nations 
was  rendered  powerless  to  deal  with  many  of 
these  crises  because  of  the  vetoes — 279  of  them — 
cast  in  the  Security  Council,  which  were  a  vivid 
expression  of  the  divisions  of  that  period.  With 
the  end  of  the  cold  war  there  have  been  no  such 
vetoes  since  31  May  1990,  and  demands  on  the 
United  Nations  have  surged. 
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International  conferences,  such  as  the  Paris  Con¬ 
ference  of  1919,  have  often  changed  state  boundar¬ 
ies  or  created  new  ones  without  the  agreement  of  the 
populations  concerned.  Campaigning  for  the  resto¬ 
ration  to  a  country  of  territory  occupied  by  coeth¬ 
nics  is  known  as  irredentism.  One  of  the  challenges 
to  political  science  is  to  explain  why,  after  the  end¬ 
ing  of  the  Cold  War  in  1989,  Armenia,  Croatia,  and 
Serbia  engaged  in  expensive  irredentist  campaigns 
while  Hungary,  Romania,  and  Russia  did  not. 

The  new  international  order  has  been  unable  to 
resolve  some  interstate  conflicts.  For  example,  the 
United  Kingdom  (UK)  decided  that  it  would  with¬ 
draw  from  India  in  1947  and  that  the  territory 
should  be  divided  to  create  the  new  state  of 
Pakistan.  It  made  this  decision  in  advance  of  any 
agreement  about  the  course  of  the  boundary 
between  the  two  states  and  about  whether  Kashmir 
(i.e.,  what  was  then  the  princely  state  of  Jammu 
and  Kashmir)  would  form  part  of  India  or  Pakistan. 
With  partition,  Muslims  and  Hindus  moved  across 
the  new  border,  rioting  occurred  elsewhere,  and 
some  2  million  men,  women,  and  children  lost 
their  lives.  The  conflict  over  Kashmir  remains 
unresolved  and  is  a  threat  to  peace  in  the  region. 

The  following  year  (1948)  saw  the  creation  of 
the  state  of  Israel  and  the  intensification  of  the 
conflict  between  Israelis  and  Palestinians;  this  con¬ 
flict  has  also  remained  unresolved  and  is  a  threat 
to  peace  in  the  Middle  East. 

The  Paris  Conference  led  to  the  recognition  of 
the  Kingdom  of  Serbs,  Croats  and  Slovenes,  as  an 
extension  of  what  had  earlier  been  the  Kingdom  of 
Serbia.  In  1945,  it  was  reconstituted  as  the  Federal 
Republic  of  Yugoslavia.  The  Federation  broke  up 
after  Croatia  and  Slovenia  seceded  in  1991. 
Disputes  have  persisted  over  recognition  of  the 
state  of  Macedonia  (and  its  name)  and  over  the 
status  of  Kosovo,  traditionally  part  of  Serbia  but 
with  a  largely  Albanian  population. 

Elsewhere  there  have  been  serious  intrastate 
conflicts,  sometimes  initiated  by  elements  within 
the  government  itself,  such  as  the  attempted  geno¬ 
cide  in  Rwanda  in  which,  in  1994,  many  ethnic 
Hutus  attempted  to  murder  all  ethnic  Tutsis. 

The  Dimensions  of  Conflict 

It  is  difficult  to  classify  conflicts  because  each 
conflict  is  unique.  A  conflict  is,  by  its  very 


nature,  political,  but  it  may  also  have  other 
dimensions. 

Consider  the  case  of  Northern  Ireland,  an  entity 
that  was  created  in  1922  when  what  is  now  the 
Republic  of  Ireland  was  allowed  to  leave  the  UK 
and  become  independent.  A  narrow  majority  of 
the  population  of  Northern  Ireland  identified  as 
Protestant  and  wished  to  remain  part  of  the  UK.  A 
minority,  identified  as  Catholic  and  Republican, 
believed  that  no  boundary  should  have  been 
drawn,  because  the  island  was  one  unit.  This  was 
not  a  conflict  between  two  homogeneous  blocs,  for 
many  Republicans,  despite  believing  that  they  had 
legitimate  grievances,  would  have  been  satisfied 
without  a  united  Ireland,  and  many  in  the  majority 
disassociated  themselves  from  the  extreme  expres¬ 
sions  of  Unionism.  On  both  the  Unionist  and 
Republican  sides,  there  was  more  than  one  politi¬ 
cal  party  competing  for  votes,  quite  apart  from  the 
party  that  was  trying  to  bridge  the  divide. 

Over  the  years  there  were  issues  that  mobilized 
varying  degrees  of  support  either  for  protest  or  for 
the  repression  of  what  was  perceived  as  a  terrorist 
movement.  Some  sort  of  conflict  continued,  but  it 
varied  in  intensity.  It  had  a  national  dimension 
because,  although  the  border  had  been  guaranteed 
by  the  two  states,  it  was  still  contested.  It  had  an 
economic  dimension  because  the  Unionists  monop¬ 
olized  the  better-paid  jobs.  It  had  a  religious 
dimension  because  many  identified  themselves  by 
the  faith  they  professed.  Insofar  as  there  were  dis¬ 
tinctive  communities,  it  may  be  said  that  there  was 
a  conflict  with  an  ethnic  dimension.  Different  indi¬ 
viduals  had  different  motives  for  participating  in 
the  conflict  (or  for  evading  it  by  emigration). 
Individual  identification  with  the  opposing  parties, 
and  with  the  actions  which  expressed  the  conflict, 
varied  continually. 

If  a  broad  definition  is  employed,  it  could  be 
said  that  Northern  Ireland  provides  an  example  of 
an  ethnic  conflict,  though  ethnicity  is  scarcely  its 
distinguishing  characteristic,  and  a  classification  as 
“ethnic”  does  not  contribute  anything  of  value  to 
the  analysis  of  events  there. 

Pakistanis  are  affronted  that  Kashmir,  with  a 
majority  Muslim  population,  should  remain  a  part 
of  India,  therefore  the  conflict  there  also  has  both  a 
national  and  a  religious  dimension.  Since  the  terri¬ 
tory  is  valuable,  it  has  an  economic  one  as  well.  The 
conflict  between  Israelis  and  Palestinians  likewise 
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has  a  national  dimension,  reflecting  a  struggle 
between,  on  the  one  hand,  a  people  with  a  Zionist 
state,  and,  on  the  other,  a  people  who  want  either 
a  new  state  of  their  own  or  equality  in  a  reconsti¬ 
tuted  non-Zionist  state.  It  has  a  religious  dimen¬ 
sion  and,  because  of  the  great  inequality  of  the 
conditions  of  the  two  peoples,  an  economic  dimen¬ 
sion  as  well.  What  is  now  the  People’s  Republic  of 
China  contends  with  two  conflicts  that  are  either — 
depending  on  interpretations  of  the  historical 
record — within  or  on  its  borders.  Tibet  and 
Xinjiang  have  at  times  been  subjected  to  Chinese 
overrule;  in  both,  the  indigenous  people  practice 
their  own  religions  and  demand  a  greater  auton¬ 
omy  than  that  accorded  by  Chinese  policies.  Since 
the  parties  to  each  of  these  four  conflicts  are  dis¬ 
tinctive  peoples,  they  are  often  represented  as  hav¬ 
ing  an  ethnic  dimension.  This  suggests  that  all 
persisting  conflicts,  like  all  persisting  social  groups 
of  the  kind  called  communities,  are  almost  certain 
to  be  multidimensional. 

Theories  of  Conflict 

In  the  1960s  and  1970s,  some  sociologists  criti¬ 
cized  prevailing  assumptions  that  societies  were 
based  on  consensus  and  that  conflicts  were  there¬ 
fore  a  deviant  condition.  Such  inferences  were 
criticized  by  Ralf  Dahrendorf  and  others  who 
maintained  that,  in  industrial  societies,  there  were 
necessarily  conflicts  of  interest,  such  as  those 
between  employees  and  their  employers.  Indeed, 
according  to  the  Marxist  version  of  what  was 
sometimes  called  “conflict  theory,”  the  institu¬ 
tional  structure  of  any  capitalist  society  had  to  be 
explained  as  an  imposed  scheme  for  the  regulation 
of  class  conflict.  This  was  a  useful  correction  of  the 
approach  known  as  structural  functionalism,  but 
to  call  it  “conflict  theory”  was  to  use  the  word 
theory  to  describe  a  philosophy  of  society  rather 
than  to  build  a  set  of  testable  propositions. 

A  different  example  of  theory  in  political  science 
is  the  application  of  game  theory  to  the  study  of 
conflict.  Some  conflicts  resemble  a  zero-sum  game: 
If  one  side  makes  a  gain,  the  other  side  loses  by  an 
equal  amount.  Gains  and  losses  sum  to  zero.  Other 
conflicts  have  a  positive-sum  outcome.  For  example, 
conflicts  serve  the  common  good  if  they  bring 
underlying  disputes  to  the  surface  and  lead  to  their 
resolution.  In  industrial  relations,  a  threatened  strike 


over  pay  can  force  an  employer  to  make  changes 
(e.g.,  investing  in  new  machinery)  that  increase  pro¬ 
duction  and  improve  the  income  of  both  employers 
and  employees.  The  sum  of  gains  and  losses  on  both 
sides  is  then  more  than  zero.  Yet  other  conflicts 
either  leave  both  sides  worse  off,  or  any  gain  is  more 
than  counterbalanced  by  losses  elsewhere;  these 
then  resemble  a  negative-sum  game. 

During  the  Cold  War,  game  theory  was  used  for 
calculating  strategies  for  military  opposition 
between  East  and  West.  The  planners  reasoned,  “If 
we  do  this,  they  may  retaliate  with  Policy  A  or 
Policy  B.  What  is  then  the  best  course  for  us  to  fol¬ 
low  in  either  event?”  This  was  an  application  in 
which  the  two  superpowers  were  seen  as  playing  a 
game.  The  same  reasoning  can  picture  individuals 
as  players — for  example,  “Will  my  gain  exceed  my 
losses  to  a  greater  degree  if  I  support  one  side  or 
the  other  in  a  given  conflict?  Or  will  it  pay  me  to 
support  neither?”  This  reasoning  relies  on  rational 
choice  theory  to  calculate  the  costs  incurred  by  any 
failure  to  follow  the  course  that  maximizes  net 
advantages. 

In  recent  times,  some  major  conflicts  have  been 
resolved  by  the  intervention  of  a  third  party. 
During  the  Cold  War,  each  of  the  superpowers 
intervened  to  resolve  major  conflicts  within  its  own 
sphere  of  interest  lest  the  opposing  superpower 
exploit  them  in  its  interest.  (For  example,  the 
North  Atlantic  Treaty  Organization  [NATO]  pow¬ 
ers  could  not  allow  the  conflict  between  Greece 
and  Turkey  over  Cyprus  to  become  an  armed  con¬ 
flict.)  Since  the  ending  of  the  Cold  War,  the  United 
States  (helped  by  Canada)  has  intervened  success¬ 
fully  to  resolve  the  conflict  in  Northern  Ireland 
and,  so  far  unsuccessfully,  the  conflict  over  the 
rights  of  Palestinians.  Superpower  intervention  is 
more  potent  because  only  such  a  power  is  able  to 
promise  rewards  for  all  parties  and  thus  hold  out 
the  prospect  of  a  positive-sum  outcome. 

Those  who  apply  game  theory  liken  the  parties 
to  a  conflict  to  the  players  of  a  game,  but  they  can¬ 
not  explain  why  there  is  a  conflict  in  the  first  place. 
In  the  game  of  chess,  one  party  plays  white  and  one 
plays  black.  Game  theory  cannot  explain  which 
party  gets  to  play  white  rather  than  black.  Nor  has 
this  approach  yet  been  very  successful  in  linking 
the  considerations  of  domestic  policy  to  those  of 
foreign  policy,  though  models  of  two-level  game 
theory  take  a  step  in  this  direction.  In  domestic 
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negotiations,  the  executive  is  pictured  as  absorbing 
the  concerns  of  interest  groups  and  building  coali¬ 
tions  with  them;  at  the  international  level,  the 
executive  then  tries  to  further  these  concerns  with¬ 
out  committing  to  anything  that  will  have  deleteri¬ 
ous  effects  at  home.  When  the  objectives  of  actors 
at  both  levels  are  in  harmony,  an  international 
agreement  can  be  negotiated. 

Strategy 

That  conceptions  of  interest  may  be  overlaid  by 
other  perceptions  can  be  illustrated  from  studies  of 
conflicts  arising  within  the  Hindu  caste  system  of 
India.  That  system  institutionalized  inequality  in 
ways  that  can  be  seen  as  attempting,  with  only 
limited  success,  to  prevent  competition  between 
caste  groups  and  to  make  protest  illegitimate. 

Some  conflicts  can  be  observed  at  the  level  of 
the  village.  A  study  in  the  state  of  Orissa  by 
Frederick  George  Bailey  in  the  early  1950s  described 
a  village  in  which  the  population  was  divided  into 
caste  groups  (or  jatis ).  Almost  everyone  was  a 
Brahmin,  a  warrior,  a  distiller,  a  herdsman,  a  pot¬ 
ter,  a  washerman,  or  a  weaver,  and  there  was  no 
exit  option.  The  weavers  were  “untouchables” 
(now  often  referred  to  as  Dalits).  They  knew  that 
the  passing  of  the  Temple  Entry  Act  in  1947  had 
made  it  an  offense  to  bar  Hindus  from  temples  on 
the  grounds  of  untouchability,  and  they  decided  to 
assert  their  new  right.  They  notified  the  local  police 
headquarters  that  on  the  occasion  of  a  particular 
festival,  they  would,  as  usual,  take  their  offerings 
to  a  temple,  but  this  time  they  would,  like  the 
higher  castes,  take  them  into  the  forechamber.  The 
upper  castes  mounted  a  guard  to  prevent  their 
doing  so.  The  police  arrived.  They  apparently 
advised  the  weavers  to  seek  a  remedy  through  the 
courts  and  probably  said  that  if  there  were  any 
more  reports  of  trouble,  two  constables  would  be 
stationed  in  the  village.  The  upper  castes  punished 
the  weavers  by  ceasing  to  employ  them  as  musi¬ 
cians  on  festive  occasions,  but  the  alleged  failure  to 
observe  the  1947  Act  did  not  go  to  court,  and  the 
previous  equilibrium  was  restored. 

The  strategy  of  the  weavers  showed  calculation 
in  their  choice  of  a  particular  temple  and  of  a  non¬ 
violent  form  of  protest,  and  in  declining  the 
opportunity  to  pursue  legal  action.  They  invited 
oversight  of  their  dispute,  but  they  did  not  press 


for  intervention.  Everyone  knew  that  if  a  police 
unit  were  to  be  billeted  on  the  village  it  would  be 
a  collective  punishment,  a  negative-sum  outcome. 
By  demonstrating  a  capacity  to  press  a  shared 
interest,  they  had  presumably  gained  an  enhanced 
self-respect  at  the  price  of  a  small  material  loss. 
They  had  warned  the  higher  caste  groups  not  to 
take  traditional  distinctions  for  granted. 

Interpersonal  relations  within  the  village  were 
multidimensional.  Throughout,  the  caste  groups 
cooperated  but  on  terms  that  were  not  universally 
accepted.  Dissent  over  these  terms  led  to  the  chal¬ 
lenge  in  which  all  concerned  counted  the  cost  of 
their  actions. 

Tolerance  of  what  some  regard  as  unjustified 
inequalities  can  be  changed  by  the  eloquence  of 
mobilizers  or  by  the  messages  of  the  mass  media. 
Commentators  agree  that  the  highly  selective  use 
of  television  and  radio  to  disseminate  misleading 
accounts  of  the  conduct  of  members  of  the  national 
groups  in  the  former  Yugoslavia  did  much  to 
stimulate  violent  conflict  and  led  to  the  breakup  of 
the  federation.  In  Rwanda,  radio  messages  inciting 
Hutus  to  slaughter  Tutsis  helped  mobilize  the 
mobs  responsible  for  the  genocide  there.  The  cal¬ 
culation  of  strategy  at  the  local  level  may  be 
skewed  by  events  beyond  local  control. 

Questions  and  Answers 

Political  science  research  on  the  nature  of  conflicts 
needs  to  delimit  the  area  of  inquiry  by  examining 
a  particular  kind  of  conflict,  or  a  particular  aspect 
of  it.  Only  by  delimiting  the  question  posed  can  an 
inquiry  culminate  in  the  sort  of  answer  that  char¬ 
acterizes  social  science. 

Many  studies  have  concentrated  on  violence, 
especially  deadly  violence.  As  forms  of  collective 
violence,  riots  have  been  distinguished  from  feuds, 
violent  protests,  terrorism,  civil  war,  and  genocide. 
Some  of  the  questions  posed  have  had  a  top-down 
character:  Why  are  there  more  violent  conflicts  in 
some  regions  than  in  others?  Why  does  their  inci¬ 
dence  vary  from  one  historical  period  or  time  to 
another?  Why  do  some  localities  experience 
repeated  violence?  Most  of  the  violent  conflict  in 
India  is  not  between  caste  groups  but  centers  on 
the  hostility  between  Hindus  and  Muslims.  Why 
should  eight  cities,  containing  just  18%  of  India’s 
population,  have  accounted  for  nearly  half  of  the 


398  Conflicts 


total  deaths  from  Hindu-Muslim  urban  violence 
between  1950  and  1995? 

Ashutosh  Varshney  compared  the  conflict- 
prone  city  of  Aligarh  with  the  relatively  conflict- 
free  city  of  Calicut.  In  both  cities  there  were  con¬ 
flicts  between  Hindus  and  Muslims,  and  in  both 
cities  committees  to  prevent  violence  had  been 
established.  They  were  effective  in  Calicut  for  sev¬ 
eral  reasons,  among  them  the  counterbalancing 
effect  of  caste  divisions  among  the  Hindus.  In 
Calicut,  the  interest  of  the  political  elite  lay  in  the 
prevention  of  violence.  In  Aligarh,  by  contrast,  dif¬ 
ferent  sections  of  the  elite  could  gain  from  it.  The 
study  counted  Hindu-Muslim  violence  as  ethnic 
conflict,  which  is  questionable;  it  took  violence  as 
a  criterion  for  determining  when  an  opposition 
became  a  conflict,  which  may  not  suffice  as  a  cri¬ 
terion  of  conflict  in  other  circumstances. 

Is  there  an  association  between  the  nature  of  the 
state  and  the  nature  of  the  violence?  For  example, 
it  has  been  argued  that  whereas  a  democratic 
country  with  a  powerful  state,  such  as  Japan,  expe¬ 
riences  low  levels  of  violence,  a  less  democratic 
regime  with  a  powerful  military,  such  as  Turkey, 
experiences  more  and  that,  because  of  state  power, 
it  has  to  take  the  form  of  terrorism  rather  than  of 
rioting.  Such  an  explanation  must  be  qualified 
because,  in  Turkey,  the  actions  described  as  terror¬ 
ist  have  been  taken  on  behalf  of  a  minority  seeking 
recognition  as  an  ethnic  or  national  unit,  and  there 
is  no  comparable  minority  in  Japan. 

Some  questions  have  a  bottom-up  character. 
Why  is  one  category  of  persons  attacked  rather 
than  another?  After  the  assassination  of  Prime 
Minister  Indira  Gandhi  in  1984  by  a  Punjabi  Sikh, 
mobs  in  Delhi  attacked  Sikhs,  but  those  they 
attacked  were  Sikhs  from  parts  of  India  other  than 
the  Punjab,  and  the  victims  were  not  supporters  of 
Punjabi  separatism.  In  the  Northern  Nigerian 
town  of  Kano,  in  1953,  and  again  in  1966,  Ibo 
settlers  from  the  southern  regions  were  attacked 
but  not  Yoruba  settlers.  The  Northerners  had  just 
as  much  reason  to  suspect  the  political  and  com¬ 
mercial  ambitions  of  the  Yoruba  from  the 
Southwest  as  the  Ibos  from  the  Southeast.  Why 
should  one  group  have  been  victimized  while 
another  remained  unharmed? 

Rioters  often  justify  their  actions  as  responses  to 
provocation,  yet  sometimes  incidents  that  could 
be  represented  as  provocative  are  ignored.  Such 


differences  inspire  important  questions  about  how 
violent  incidents  are  organized  and  the  willingness, 
or  ability,  of  officials  (such  as  police  commanders), 
and  of  political  elites,  to  suppress  them. 

Why,  it  was  asked  earlier,  did  Armenia  go  to 
war  in  1991  with  Azerbaijan  in  order  to  establish 
a  corridor  linking  up  with  an  Armenian  enclave 
living  in  Azerbaijani  territory?  It  proved  an  expen¬ 
sive  venture,  and  the  result  has  not  been  acceptable 
to  other  states.  Why  did  Croatia  try  to  grab  por¬ 
tions  of  Bosnia  inhabited  by  fellow  Croats  and 
Serbia  risk  so  much  on  behalf  of  Serbs  outside 
Serbia?  Can  these  actions  be  attributed  to  nation¬ 
alism?  If  so,  why  did  Hungary  not  try  to  renegoti¬ 
ate  its  boundary  with  Romania  to  recover  some  of 
the  territory  it  lost  in  1920?  Romania  lost 
Bessarabia  in  1939;  later  it  became  the  state  of 
Moldova.  Why  did  Romania  and  Moldova  not 
reunite  after  1989?  Why,  after  the  dissolution  of 
the  USSR,  was  not  more  done  to  bring  the  25  mil¬ 
lion  people  outside  Russia  into  closer  contact  with 
their  motherland?  A  closer  analysis  of  these  cases 
shows,  for  example,  that  Armenian  nationalism 
was  exceptionally  sensitive  because  of  the  geno- 
cidal  persecution  of  Armenians  in  Turkey  in  1916. 
Nationalist  sentiment  in  itself  achieves  little  unless 
there  are  politicians  who  can  attain  power  by  incit¬ 
ing  it,  while  the  voters  do  not  want  to  carry  the 
costs  of  incorporating  a  minority  from  whom  they 
have  grown  distant. 

Apart  from  the  questions  about  the  policies  of 
states,  there  are  many  questions  about  the  behav¬ 
ior,  in  conflict  situations,  of  the  public.  Why  do 
otherwise  peaceful  neighbors  kill  each  other?  If 
they  kill  others,  why  do  they  kill  them  in  one  way 
rather  than  another?  Why  are  the  bodies  of  victims 
of  collective  violence  sometimes  mutilated?  Why 
are  certain  sorts  of  people  more  inclined  to  par¬ 
ticipate  in  collective  violence?  Within  the  same 
group,  some  prefer  peace  and  others  riot.  Why  are 
men  more  prone  to  violence  than  women?  Anyone 
who  favors  a  particular  answer  to  such  a  question 
then  has  to  ask  why  that  answer  obtains  in  certain 
circumstances  and  not  in  others. 

Conclusion 

Germany  and  France  were  at  war  with  one  another 
from  1870  to  1871,  1914  to  1918,  and  1939  to 
1945.  In  the  21st  century,  it  is  inconceivable  that 
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either  country  would  declare  war  on  the  other 
because  their  economies,  and  the  lives  of  their 
peoples,  are  now  so  closely  intertwined.  The 
sources  of  peace  are  to  be  sought  in  the  relations 
between  humans,  their  understanding  of  the  nature 
of  these  relations,  and  their  knowledge  of  other 
people.  By  seeking  these  sources,  political  science 
research  aims  to  advance  knowledge  about  the 
nature  and  regulation  of  conflicts,  and  it  has  there¬ 
fore  to  account  for  the  absence  of  conflict  as  well 
as  for  variations  in  the  intensity  and  scope  of  con¬ 
flicts.  While  the  detailed  historical  examination  of 
particular  conflicts  is  essential,  there  are  general 
issues  that  continue  to  stimulate  research. 

Michael  Banton 
University  of  Bristol 
Bristol,  United  Kingdom 
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Confucianism 


Confucianism  is  the  general  system  of  ethics  that 
the  Chinese  thinker  and  social  philosopher 
Confucius  (ca.  551-479  BCE),  his  interpreter 
Mencius  (ca.  372-289  BCE),  and  their  early  fol¬ 
lowers  advocated  to  build  a  moral  community  of 
datong  shehui,  known  as  the  Great  Harmony 
Society,  in  which  people  could  live  happy  and 
worthy  lives.  To  build  such  a  community,  the 
Confucian  ethical  system  offered  a  unique  model 
of  benevolent  government  through  moral  educa¬ 
tion  and  virtuous  leadership.  This  entry  first  intro¬ 
duces  the  central  values  and  principles  of 
Confucianism  and  then  highlights  its  differences 
from  the  Western  liberal  democratic  model  of 
government. 

Confucian  Principles  of  Good  Government 

What  makes  good  government?  Who  should  rule 
and  how?  What  should  rulers  do  to  promote  good 
government?  On  the  basis  of  a  positive  conception 
of  humans  as  moral  and  social  beings,  not  as  free 
and  autonomous  individuals,  Confucius  and 
Mencius  sought  to  address  these  questions  con¬ 
cerning  both  the  ultimate  ends  and  operational 
means  of  governance  from  the  perspective  of  build¬ 
ing  a  harmonious  moral  community  of  datong 
(grand  harmony),  where  robbers,  thieves,  rebels, 
and  traitors  had  no  place,  and  hence  the  outer 
doors  remained  open  and  were  not  shut.  Specifically, 
they  conceived  of  government  as  an  institutional 
mechanism  to  achieve  such  a  community.  They 
endorsed  economic  prosperity,  physical  security, 
and  popular  trust  in  government  as  the  three  essen¬ 
tial  substantive  components  of  good  government. 

To  create  a  harmonious  and  prosperous  com¬ 
munity,  who  should  rule?  In  the  Confucian  con¬ 
ception  of  good  government,  the  quality  of  govern¬ 
ment  depends  exclusively  on  the  quality  of  people 
in  the  government.  It  has  very  little  to  do  with  the 
way  in  which  institutions  are  organized  or  how 
authority  is  distributed  across  the  institutions 
(such  as,  for  example,  by  separation  of  powers). 
Accordingly,  Confucius  and  Mencius  advocated 
an  open  system  of  meritocracy  by  dividing  people 
into  the  two  categories  of  governors  and  the  gov¬ 
erned  (The  Analects  of  Confucius,  IV  14).  To  these 
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and  other  early  Confucians,  society  becomes  har¬ 
monious  only  when  those  who  occupy  positions  of 
responsibility  are  the  ones  with  the  ability  to  dis¬ 
charge  those  responsibilities. 

What  specific  quality  is  required  of  those  who 
should  govern?  Confucianism  endorses  the  posses¬ 
sion  of  virtue — understanding  of  and  commitment 
to  the  common  good — as  the  only  proper  basis  for 
a  claim  to  governmental  authority.  Accordingly, 
Confucius  and  his  followers  called  for  government 
by  the  virtuous  and  wise,  not  by  ordinary  people. 
Yet  they  did  not  favor  the  hereditary  system  of 
aristocracy.  Instead,  they  advocated  a  merit-based 
system  of  government,  which  would  allow  every 
citizen  to  take  competitive  examinations  that  test 
virtues  and  knowledge.  Thus,  the  most  talented 
would  rise  to  the  highest  offices.  This  system  of 
meritocracy  replaced  the  rule  by  hereditary  aris¬ 
tocracy  in  Confucian  Asia  long  before  the  same 
was  done  in  the  West. 

How  should  rulers  govern  the  state  to  build  a 
harmonious  and  prosperous  community?  Early 
Confucians  advocated  the  principle  of  minben  as 
the  most  important  principle  of  governing.  Minben, 
which  originates  from  the  pre-Confucian  period, 
means  treating  “people”  {min)  as  “roots”  {ben). 
Thus,  they  form  the  foundation  of  the  state.  To 
govern  according  to  this  principle  is,  therefore,  to 
govern  for  the  people  (min),  for  their  economic 
prosperity,  and  for  their  physical  security  just  as 
the  roots  of  trees  (ben)  should  be  tended  {Analects, 
XIII  9).  By  embracing  ordinary  citizens  as  the  roots 
of  government,  this  principle  demands  that  rulers 
seek  the  prosperity  and  welfare  of  ordinary  citizens 
as  the  ultimate  end  of  good  government. 

To  govern  according  to  the  principle  of  minben 
involves  not  only  providing  sufficient  food  and 
security  but  also  disseminating  virtue  throughout 
society  {Analects,  XII  7).  In  this  regard,  Confucius 
said  that  people  should  be  educated  as  soon  as 
their  livelihood  is  secured  {Analects,  XIII  9). 
Confucianism  gives  priority  to  the  task  of  securing 
livelihood  because  people  can  only  be  expected  to 
behave  morally  after  they  are  relieved  of  poverty. 
As  Benjamin  Schwartz  (1985)  points  out,  Mencius 
(3a.3)  emphasized  an  economic  livelihood  as  an 
indispensable  precondition  for  moral  education 
and  advocated  the  hierarchy  of  human  needs  as  a 
conceptual  tool  for  governance  more  than  2  mil¬ 
lennia  before  Abraham  Maslow  did. 


To  early  Confucians,  moral  education  was 
essential  to  the  building  of  a  harmonious  commu¬ 
nity  because  law  and  punishment  were  viewed  as 
ineffective  means  for  maintaining  order.  The  higher 
goal  of  social  harmony  could  be  attained  only 
when  people  became  virtuous  and  fulfilled  their 
duties  to  others  voluntarily.  To  build  a  nation  of 
virtues,  therefore,  Confucianism  emphasized  the 
importance  of  providing  education  to  all  segments 
of  the  population.  Confucius  himself  said  there  is 
no  class  distinction  in  education  {Analects,  VII  7). 
He  was  the  first  scholar  to  advocate  and  popular¬ 
ize  the  notion  of  mandatory  universal  education. 

The  minben  principle  also  holds  that  good  gov¬ 
ernment  depends  on  the  mutually  beneficial  rela¬ 
tionship  between  rulers  and  the  ruled  {Analects, 
XII  9).  For  this  reason,  Confucius  advocated  not 
only  good  intentions  but  also  benevolent  behavior 
on  the  part  of  rulers  and  the  avoidance  of  cruelty, 
oppression,  injury,  and  meanness  {Analects,  XX 
2).  Confucius  urged  rulers  to  treat  people  as  gener¬ 
ously  and  honestly  as  their  own  guests.  They  were 
not  to  impose  on  others  what  they  themselves  did 
not  desire  {Analects,  XII  2).  Only  when  rulers 
practice  this  golden  rule  of  ren  (benevolence)  can 
the  people  provide  a  sense  of  legitimacy  to  their 
government.  In  other  words,  rulers  can  govern 
people  only  when  they  have  acquired  their  trust. 
Otherwise,  people  will  begin  to  think  they  are 
oppressed  {Analects,  XIX  10). 

According  to  early  Confucians,  heaven  consents 
to  the  legitimacy  of  a  government  when  its  people 
consent  to  it.  According  to  Mencius  (5a. 5),  heaven 
sees  with  the  eyes  of  its  people;  heaven  hears  with 
the  ears  of  its  people.  They  further  emphasized 
popular  consent  as  the  most  essential  component 
of  good  government,  more  essential  than  sufficient 
food  and  sufficient  weapons  {Analects,  XX  2).  In 
short,  early  Confucians  proposed  an  important 
political  theory  that  holds  that  governmental 
legitimacy  depends  solely  on  popular  support  and 
trust.  This  notion  of  governmental  legitimacy  is  at 
the  heart  of  the  contemporary  theory  of  demo¬ 
cratic  regime  consolidation.  As  Juan  Linz  and 
Alfred  Stepan  (1996)  observe,  democracy 
becomes  legitimized  only  when  people  embrace  it 
unconditionally. 

To  obtain  good  government,  the  early 
Confucians  advocated  the  use  of  moral  persuasion 
rather  than  universally  applicable  penal  laws, 


Confucianism  401 


because  the  latter  often  require  coercion.  Having 
assumed  that  people  are  inherently  good  and  thus 
capable  of  learning  and  achieving  moral  develop¬ 
ment,  they  can  be  persuaded  to  overcome  their 
selfishness  and  exercise  moral  discipline  instead  of 
being  forced  to  do  so  by  law.  By  motivating  them 
to  follow  positive  examples  and  fulfill  their  duties 
voluntarily,  the  rule  of  virtue  or  rule  by  morality 
can  win  their  loyalty  to  the  government. 

Rule  by  law,  on  the  other  hand,  involves  forcing 
people  to  submit  to  government  through  punish¬ 
ments,  thus  making  them  distrust  it.  For  another 
reason,  early  Confucians  preferred  the  rule  of 
morality  and  customs  to  that  of  laws  and  regula¬ 
tions.  They  believed  that  a  sense  of  losing  face  and 
being  shamed  would  serve  as  a  more  effective 
deterrent  to  crime  than  the  sense  of  guilt  that  the 
latter  engenders. 

The  third  Confucian  principle  of  good  govern¬ 
ment  is  known  as  zbongyong,  or  the  doctrine  of 
the  Mean.  This  tells  rulers  how  to  make  decisions 
in  order  to  achieve  the  state  of  harmony  among 
those  with  different  preferences  and  interests.  To 
underline  the  importance  of  this  decision-making 
principle,  Confucius  said  that  perfect  is  the  virtue 
that  is  according  to  the  Mean.  The  core  idea  of  this 
principle  is  that  going  beyond  the  limit  is  as  bad  as 
falling  short  (A.  T.  Nuyen,  2000).  It  requires  being 
reasonable  and  moderate  without  going  to  the 
extreme  or  being  one-sided.  As  a  decision-making 
rule,  this  principle  predates  the  contemporary 
median  voter  theory,  holding  that  politicians 
should  commit  to  the  median  policy  position  pre¬ 
ferred  by  the  electorate  in  order  to  maximize  their 
votes  (Anthony  Downs,  1957). 

Conclusion 

Confucianism  equates  good  government  primarily 
with  a  benevolent  government  built  on  the  consent 
of  the  people.  It  also  equates  good  government 
with  primary  reliance  on  virtue  and  morality.  The 
Confucian  governing  model  calls  for  government 
for  the  people  by  a  virtuous  and  meritocratic  lead¬ 
ership,  not  government  by  the  people.  The 
Confucian  model  contrasts  sharply  with  the  liberal 
democratic  notion  of  government,  with  its  empha¬ 
sis  on  a  government  elected  by  the  people  and 
operated  according  to  the  rule  of  law.  Due  to  these 
differences,  there  is  contemporary  debate  among 


scholars  such  as  David  Hall  and  Roger  Ames, 
Samuel  Huntington,  and  Fareed  Zakaria  over 
whether  China  and  other  East  Asian  societies 
shaped  by  Confucianism  can  fully  accommodate 
to  liberal  democracy  as  practiced  in  the  West. 

Nonetheless,  we  should  note  that  Confucianism 
has  played  a  vital  role  in  the  process  of  socioeco¬ 
nomic  modernization  in  Asian  societies  by  inducing 
their  governments  to  promote  universal  education 
as  a  means  of  national  development.  To  the  schol¬ 
arly  community,  it  offers  the  original  notion  of  a 
human-needs  hierarchy  and  theories  of  governmen¬ 
tal  decision  making  and  legitimacy.  The  rich 
Confucian  cultural  heritage,  moreover,  provides 
the  mass  publics  of  the  Asian  nations  the  essential 
premises  on  which  they  can  construct  a  viable  bal¬ 
ance  of  individual  rights  and  interests  with  com¬ 
munal  goods  and  obligations  as  an  alternative  to 
the  liberal  democracy  model  practiced  in  the  West. 
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University  of  Missouri 
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Consensus 


Unlike  many  concepts  in  political  science  whose 
meaning  is  quite  contentious — for  example,  democ¬ 
racy,  ethnicity,  and  state — the  concept  of  consen¬ 
sus  is  among  the  simplest  and  most  transparent  in 
the  discipline.  It  is  universally  agreed  that  the  term 
denotes  agreement  or  unanimity.  It  is  often  used  in 
phrases  and  in  conjunction  with  qualifiers,  such  as 
near-consensus,  consensus  building,  international 
consensus,  moral  consensus,  consensus-based 
approach,  permissive  consensus,  and  cross-party 
consensus.  Although  there  is  strong  agreement  on 
the  meaning  of  consensus,  there  are  many  concepts 
that  refer  to  its  absence,  including  cleavage,  con¬ 
flict,  competition,  contestation,  division,  majori- 
tarianism,  pluralism,  and  polarization. 

The  concept  of  consensus  has  played  a  key  role  in 
four  areas  of  research  within  political  science:  polit¬ 
ical  philosophy,  where  John  Rawls  proposed  the 
term  overlapping  consensus  to  describe  a  frame¬ 
work  for  reconciling  fundamentally  divergent  com¬ 
prehensive  worldviews;  comparative  politics,  where 
Arend  Lijphart  proposed  the  term  consensus  democ¬ 
racy  as  an  ideal  type  within  the  genus  of  democratic 
regimes;  economic  policy,  where  the  concept  of  the 
Washington  Consensus  has  been  highly  influential 
since  the  1980s;  and  organization  theory,  where  the 
concept  of  domain  consensus  denotes  the  fact  that 
different  organizational  settings  are  characterized  by 
distinctive  norms  and  behavior.  While  the  concept 


of  consensus  has  been  centrally  important  in  these 
four  significant  research  areas,  it  has  been  used  in 
each  without  recognition  of  its  key  role  in  the  other 
three  research  programs.  This  entry  first  describes 
the  four  uses  of  the  term  and  then  identifies  some 
applications  of  consensus  in  political  practice. 

Four  Scholarly  Uses  of  the 
Concept  of  Consensus 

Overlapping  Consensus 

Rawls  proposed  the  term  overlapping  consensus 
to  provide  a  practical  solution  to  a  fundamental 
political  problem  for  liberal  democratic  theory: 
How  is  political  order  maintained  when  reason¬ 
able  people  hold  deeply  divergent  comprehensive 
worldviews?  Rawls  believed  that  there  is  no  philo¬ 
sophical  solution  to  the  problem,  that  it  is  impos¬ 
sible  to  reach  a  consensus  by  rational  discussion  on 
the  meaning  of  the  public  good.  His  solution  was 
to  propose  a  liberal  political  framework,  one 
maintained  by  overlapping  consensus  among  those 
with  fundamentally  different  comprehensive 
worldviews.  The  framework  reflects  a  “thin”  the¬ 
ory  of  the  good — that  is,  one  not  based  on  a  single 
conception  of  the  good.  However,  Rawls  claims 
that  this  liberal  framework  of  overlapping  consen¬ 
sus  is  not  a  second-best  pragmatic  compromise,  a 
mere  result  of  a  stalemate  among  competing 
worldviews,  and  a  regrettable  necessity  given  that 
no  one  worldview  is  sufficiently  powerful  to  dom¬ 
inate  all  others.  Instead,  he  claimed  that  a  liberal 
framework  is  the  best  political  solution,  one  that 
can  be  a  crucial  element  in  competing  comprehen¬ 
sive  worldviews.  (Some  critics  questioned  his  opti¬ 
mism  on  this  score.) 

Other  political  theorists,  including  Charles 
Taylor  and  Jurgen  Habermas,  have  advocated 
seeking  to  develop  consensus  through  rational  dis¬ 
cussion,  and  the  concept  known  as  deliberative 
democracy  has  highlighted  the  value  of  promoting 
extended  discussion  to  reach  consensus  rather  than 
foreclosing  deliberation  by  majority  vote.  (Political 
theorists  who  have  contributed  to  the  project  are 
James  S.  Fishkin  and  Jon  Elster.) 

From  another  vantage  point,  some  political  the¬ 
orists  have  challenged  the  normative  value  of  con¬ 
sensus.  They  claim  that  a  hallmark  of  democracy  is 
divergence,  debate,  and  conflict.  For  example, 
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Chantal  Mouffe  praises  the  value  of  what  she  terms 
antagonistic  pluralism.  At  an  extreme,  Isaiah  Berlin 
warned  decades  ago  of  the  danger  of  a  totalitarian 
temptation  implicit  in  seeking  and  imposing  a  sin¬ 
gle  consensual  understanding  of  the  common  good. 

Consensus  Democracy 

Lijphart  suggested  the  term  consensus  democ¬ 
racy  as  an  extension  and  partial  revision  of  the 
earlier  concept  that  he  developed:  consociational 
democracy.  Consociationalism  is  a  pattern  of  gov¬ 
ernance  that  Lijphart  first  identified  in  the 
Netherlands  and  later  applied,  in  both  descriptive 
and  normative  terms,  to  other  deeply  divided  soci¬ 
eties.  It  involves  a  pattern  of  decision  making  in 
which  elites  from  separate  sectors  or  pillars  reach 
accommodation  or  consensus  across  ethnic,  reli¬ 
gious,  or  ideological  divides. 

Lijphart  identified  consensus  democracy  as  a 
broader  form  of  political  accommodation.  It  is  an 
ideal  type  of  regime,  one  characterized  by  inclu¬ 
sion,  in  contrast  to  its  polar  opposite,  what 
Lijphart  termed  majoritarian  democracy.  Some 
key  institutional  features  of  consensus  democracies 
are  bicameral  legislatures,  the  use  of  proportional 
representation  to  elect  the  lower  house  of  the  leg¬ 
islature,  multiparty  systems,  multiparty  governing 
coalitions  that  typically  promote  supermajority 
support  for  the  passage  of  significant  reforms,  and 
ample  protection  for  minorities.  Consensus  democ¬ 
racies  are  associated  with  broad  governing  coali¬ 
tions.  Rather  than  reaching  decisions  by  narrow 
majorities,  consensus  democracies  promote  policy 
changes  only  if  they  enjoy  broad  political  support. 

Critics  of  Lijphart  have  challenged  what  they 
regard  as  antidemocratic  features  of  his  approach. 
They  charge  that  the  accommodations  reached  by 
elites  in  consociational  democracies  come  at  the 
expense  of  democratic  participation  and  decision 
making.  And  requiring  supermajorities  within  con¬ 
sensus  democracies  enables  minorities  to  block  the 
majority  will.  Lijphart  responded  that  comparisons 
between  the  two  regime  types  suggest  that  consen¬ 
sus  democracies  outperform  majoritarian  democ¬ 
racies  on  significant  measures  of  good  governance. 

Washington  Consensus 

The  term  Washington  Consensus  was  coined  in 
1990  to  describe  a  policy  agenda  developed  by 


officials  in  the  U.S.  government  and  the  World 
Bank  and  International  Monetary  Fund  (IMF), 
influential  international  financial  institutions  whose 
headquarters  are  in  Washington,  D.C.  Leaders  of 
these  institutions  agreed  that  neoliberal  economic 
policies  were  the  most  appropriate  and  desirable  to 
promote  economic  development  and  growth.  The 
Washington  Consensus  accorded  priority  to  eco¬ 
nomic  “fundamentals,”  including  balanced  bud¬ 
gets,  price  stability,  tax  and  spending  cuts,  trade 
liberalization,  deregulation,  and  privatization  of 
state-provided  goods  and  services  such  as  utilities, 
transport,  and  education.  These  goals  were  to  be 
achieved  by  adopting  structural  adjustment  policies 
consistent  with  neoliberal  doctrine.  In  simplest 
terms,  the  Washington  Consensus  advocated  more 
markets  and  less  state.  It  was  crafted  in  reaction  to 
the  approach  stressing  state-directed  economic 
policies,  which  was  practiced  by  less  developed 
countries,  especially  in  Latin  America,  in  the  1950s 
and  1960s.  A  key  element  of  the  statist  approach 
involved  import  substitute  industrialization — that 
is,  state  intervention  to  foster  domestic  industrial 
development,  especially  by  substituting  local  pro¬ 
duction  of  consumer  goods  for  foreign  imports. 
The  Washington  Consensus  was  extremely  influen¬ 
tial  for  several  decades.  It  informed  the  aid  policies 
of  the  U.S.  government,  the  World  Bank,  and  the 
IMF  and  shaped  the  orientation  of  many  govern¬ 
ments  throughout  the  less  developed  world. 

The  Washington  Consensus  was  far  from  com¬ 
manding  a  consensus.  Opponents  included  pro¬ 
gressive  economists  in  the  advanced  industrial 
world  and  governmental  leaders  and  popular 
movements  in  countries  such  as  Malaysia,  Russia, 
and  Argentina,  which  experienced  economic  dislo¬ 
cations  and  crises  after  adopting  the  policies  of  the 
Washington  Consensus.  Critics  charged  that  the 
Washington  Consensus  promoted  austerity  for  the 
broad  mass  of  the  population  without  generating 
adequate  economic  benefits,  reflected  a  one-size- 
fits-all  approach  that  ignored  extensive  variations 
among  countries,  emphasized  the  value  of  large 
industrialization  projects  that  often  caused  social 
dislocations  and  environmental  degradation, 
bypassed  democratic  decision  making,  produced 
financial  panics  and  economic  crises,  and  neglected 
problems  of  poverty,  corruption,  and  autocracy. 

By  the  1990s,  modifications  occurred  within  the 
Washington  Consensus.  For  example,  the  World 
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Bank  reduced  support  for  large-scale  industrial 
projects  such  as  giant  dams;  placed  priority  on 
developing  honest,  effective  political  institutions; 
and  provided  for  wider  consultation  in  developing 
policy  recommendations.  However,  as  a  general 
matter,  changes  have  occurred  within  the  Wash¬ 
ington  Consensus;  no  comparably  broad  alterna¬ 
tive  has  replaced  the  paradigm. 

Domain  Consensus 

The  concept  of  domain  consensus  has  been  an 
important  element  within  the  field  of  organization 
theory.  It  refers  to  the  fact  that  expectations  about 
what  is  appropriate  behavior  varies  for  members 
of  different  domains,  including  sectors  such  as  civil 
society,  the  market,  and  the  state;  different  regions 
and  localities;  and  specific  organizations.  The  con¬ 
cept  refers  to  expectations  about  how  members  in 
the  domain  will  interact  with  each  other  and  with 
nonmembers,  about  the  nature  of  their  respective 
roles  and  responsibilities,  and  about  what  the  col¬ 
lectivity  will  and  will  not  do. 

Political  Applications  of  Consensus 

Consensus-based  approaches  to  decision  making 
are  relatively  rare  in  nation-states.  Bureaucracies 
usually  make  decisions  by  fiat,  often  after  top- 
down  consultation.  Legislative  decision  making  is 
usually  made  on  the  basis  of  majority  rule,  although 
in  many  cases  (especially  in  Lijphart’s  consensus 
democracies)  supermajorities  are  the  rule.  The  sit¬ 
uation  is  different  in  most  international  organiza¬ 
tions,  where  decisions  typically  require  a  consen¬ 
sus  of  member  states.  Examples  include  the 
Association  of  Southeast  Asian  Nations  (ASEAN), 
Asia-Pacific  Economic  Cooperation  (APEC), 
Southern  Common  Market  Agreement  (Mercado 
Comun  del  Sur,  MERCOSUR),  and  World  Trade 
Organization  (WTO).  There  are  exceptions.  In  the 
European  Union,  voting  on  many  issues  is  by  a 
simple  majority,  decisions  on  others  is  by  a 
weighted  majority  in  which  certain  member  states 
have  plural  votes,  and  some  issues  are  of  such 
importance  (notably,  the  admission  of  new  mem¬ 
bers  or  changes  of  constitutional  importance)  that 
unanimity  by  member  states  is  required.  The 
United  Nations  Security  Council  practices  a  form 
of  supermajority  voting.  Not  only  must  a  proposal 


pass  by  majority  vote  but  all  five  permanent  mem¬ 
bers  of  the  Council  must  either  support  or  at  least 
refrain  from  vetoing  proposals. 

There  are  many  examples  of  microlevel  institu¬ 
tions  that  employ  a  consensus-based  approach 
based  on  the  principles  of  deliberative  democracy, 
including  consensus  conferences  in  Denmark,  citi¬ 
zens’  juries  in  the  United  States,  planning  cells  in 
Germany,  and  citizens’  panels  in  Britain.  These 
mechanisms  often  bring  together  technical  experts, 
government  officials,  and  rank-and-file  citizens  to 
analyze  issues  such  as  the  siting  of  waste  treatment 
plants  or  the  risks  involved  in  genetically  modified 
organisms  and  other  technological  innovations. 
The  goal  is  to  encourage  those  of  diverse  perspec¬ 
tives  to  engage  in  extended  dialogue  in  order  to 
forge  a  consensus  or  at  least  avoid  polarization. 

As  is  evident  from  this  overview,  if  the  definition 
of  consensus  is  unproblematic,  there  is  no  consen¬ 
sus  on  how  the  concept  is  applied  and  evaluated.  It 
has  played  a  role  in  highly  diverse  political  analyses 
and  practices.  This  both  testifies  to  its  utility  and 
invites  further  reflection  and  experimentation  by 
scholars,  practitioners,  and  citizens. 

Mark  Kesselman 
Columbia  University 
New  York  City,  New  York,  United  States 
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Conservatism 


Conservatism  is  one  of  the  basic  currents  of  demo¬ 
cratic  political  life,  which  is  founded  on  a  (moder¬ 
ate)  critique  of  liberalism.  Conservatism  generally 
accepts  the  political  institutions  of  liberal  democ¬ 
racy,  but  it  denounces  the  dangers  that  come  from 
the  decline  of  traditions  and  from  the  modern  eman¬ 
cipation  of  individuals.  The  study  of  the  diverse 
traditions  of  conservatism  must  start  with  the  expla¬ 
nation  of  its  complex  nature  as  philosophy  and  as 
political  attitude;  it  requires  a  historical  study  of  the 
growth  of  conservatism  from  the  Enlightenment  to 
the  birth  of  modern  democracy  through  the  demo¬ 
cratic  revolutions  of  Europe  and  America  and,  in 
the  end,  a  philosophical  examination  of  its  theoreti¬ 
cal  foundations.  This  entry  begins  with  a  definition 
of  conservatism  as  both  a  philosophy  and  a  political 
ideology,  briefly  traces  its  historical  origins  and 
development  in  British  politics,  and  concludes  with 
a  look  at  contemporary  conservative  thought. 

Conservatism  is  both  a  political  philosophy, 
with  its  concepts,  traditions  and  classical  authors, 
and  a  political  ideology,  which  is  one  of  the  main 
currents  of  political  life  in  modern  democratic 
politics.  As  a  philosophy,  it  is  founded  on  a  cri¬ 
tique  of  modern  liberalism,  which,  as  Philippe 
Beneton  has  shown,  is  directed  against  what  con¬ 
servatism  regards  as  three  great  illusions  that  are 
the  consequences  of  individual  liberalism.  The  first 
of  these  illusions  is  epistemological  and  comes 
from  one  of  the  main  tendencies  of  modern  phi¬ 
losophy,  originating  with  Rene  Descartes  and  the 
age  of  Enlightenment:  It  consists  in  identifying 
individual  reason  as  the  judge  of  truth.  The  second 
one,  more  directly  political,  makes  individual  rea¬ 
son  the  only  foundation  of  political  legitimacy. 
The  third  could  be  named  “sociological”  and  leads 
to  the  notion  of  society  as  a  mere  aggregate  of 
individuals,  to  the  detriment  of  community  and/or 
of  social  hierarchy.  To  these  illusions,  conserva¬ 
tism  objects  that  the  individual  is  never  really  inde¬ 
pendent,  that  all  his  or  her  accomplishments  have 
some  underlying  communitarian  or  traditional 
condition  that  escapes  individual  reason,  and  that 
the  neglect  of  such  a  debt  is  a  danger  for  the  com¬ 
munity  and  finally  even  for  the  individual. 

Conservatism  is  not  only  a  philosophy  but  also  a 
political  attitude  that  interacts  with  other  currents 


in  political  life.  To  understand  its  nature,  we  can 
start  with  the  usual  tripartition  that  makes  conser¬ 
vatism  one  of  the  three  basic  political  currents, 
together  with  liberalism  and  socialism:  Liberalism 
is  the  center  of  gravity  of  modern  democratic 
politics  but,  as  it  cannot  satisfy  all  the  expecta¬ 
tions  that  arise  in  an  open  society,  it  engenders 
permanent  criticisms  that  can  be  “socialist,”  if 
they  argue  for  a  new  reconstruction  of  society,  or 
“conservative,”  if  they  try  to  maintain  or  restore 
something  of  the  “world  we  have  lost.”  In  both 
definitions,  conservatism  appears  not  only  as  an 
expression  of  permanent  dispositions  of  human 
nature  but  also  as  a  paradoxical  product  of 
modernity,  which  is  dependent  on  what  it  con¬ 
tests,  namely,  the  faith  in  individual  reason  as  the 
foundation  of  authority,  and  the  philosophical 
and  political  order,  and  which  can  be  defined  by 
three  essential  features: 

1.  Conservatism  is  a  modern  criticism  of 
modernity,  which  criticizes  or  even  condemns 
some  principles  of  modernity  but  does  not  reject 
all  the  consequences  of  liberal  modernity. 

2.  Even  if  conservatism  was  strengthened  by  its 
opposition  to  the  French  Revolution,  its  main 
themes  were  born  before  it,  within  liberal 
politics,  as  an  internal  criticism  of  classical 
liberalism. 

3.  The  most  central  idea  of  conservative  thought  is 
that  of  the  risk  of  self-destruction  of  liberal 
traditions  and  institutions,  which  is  supposed  to 
be  the  consequence  of  a  violent  rupture  with  the 
past  and/or  with  the  “natural”  basis  of  human 
societies.  The  diverse  varieties  of  conservatism 
differ  in  their  interpretations  of  this  common 
idea. 

Thus,  comprehension  of  conservatism  requires  a 
historical  examination  of  the  growth  of  conserva¬ 
tive  politics  between  the  17th  and  19th  centuries 
and  then  a  philosophical  examination  of  the 
diverse  expressions  of  the  conservative  arguments. 

The  British  Sources  of  Conservatism 

The  distinction  between  liberals  and  conservatives 
emerged  within  English  politics,  with  the  opposi¬ 
tion  between  the  Tory  and  Whig  Parties,  and  it 
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took  on  a  new  significance  with  the  French 
Revolution. 

Whig  and  Tory  are  two  English  words  that  refer 
to  the  two  great  tendencies — respectively,  liberal 
and  conservative — of  British  politics  before  the  rise 
of  the  Labour  Party.  A  Whig  is  not  only  a  liberal 
but  also  a  conscious  heir  of  the  constitutional 
English  tradition,  that  of  rule  of  law  and  parlia¬ 
mentary  government;  a  Tory  is  devoted  to  a  certain 
type  of  social  relations,  in  which  the  authority  of 
“gentlemen”  goes  with  a  protective  function  and 
which  is  connected  with  High  Church  Anglicanism. 
Before  1688,  the  appellation  “Tory”  designated 
the  supporters  of  James  II  (who  the  “Whigs”  mis¬ 
trusted  because  of  his  alleged  “papism”  and  his 
commitment  to  a  French-style  “absolutist”  monar¬ 
chy),  but  after  the  Glorious  Revolution  the  signifi¬ 
cation  of  the  opposition  changed.  Most  Tories 
abandoned  the  absolutist  doctrine  and  partially 
accepted  the  Whig  interpretation  of  the  English 
regime  as  a  limited  monarchy,  but  they  remained 
faithful  to  the  institution  of  prerogative,  which 
authorizes  the  king  to  make  exceptions  to  the  law 
of  parliament  in  some  special  cases,  and,  more  gen¬ 
erally,  they  were  devoted  to  everything  that,  in  the 
English  regime,  was  favorable  to  traditional  hierar¬ 
chies.  The  Tory  Party  was  marginal  during  the 
long  period  of  Whig  supremacy,  which  begins  with 
the  accession  of  George  I  to  the  throne  (1714),  but 
the  opposition  Whig  and  Tory  became  relevant 
again  at  the  end  of  18th  century.  The  Tory  Party 
found  a  new  vigor  with  William  Pitt  the  Younger, 
who  founded  a  new  alliance  between  gentry  and 
the  commercial  elite,  while  the  New  Whigs  became 
more  democratic  thanks  to  the  leadership  of  James 
Fox.  The  French  Revolution  showed  the  signifi¬ 
cance  of  this  evolution;  it  caused  the  rupture 
between  James  Fox  and  Edmund  Burke  who, 
although  a  Whig,  denounced  the  French  experience 
as  a  radical  subversion  of  a  tradition  that  English 
and  even  American  Revolutions  had  respected. 
Whiggism  and,  more  generally,  liberalism  divided 
into  two  tendencies,  “radical”  and  “conservative,” 
which  never  disappeared  afterward. 

Has  English  conservatism  experienced  a  general 
significance  beyond  this  historical  context?  One 
may  think  so,  if  one  leans  on  the  interpretation  of 
English  politics  that  David  Hume  gave  and  on  a 
reflection  on  the  philosophical  implications 
Edmund  Burke’s  criticism  of  the  French  Revolution. 


David  Hume  (1711-1776)  was  certainly  a  “lib¬ 
eral”  with  some  “conservative”  elements  in  his 
doctrine,  but  he  was  neither  a  Whig  nor  a  Tory, 
because  he  interpreted  the  English  constitution  in  a 
sense  that  explains  the  permanent  division  of 
English  politics  between  successive  pairs  of  parties 
(first  Tory/Whig,  then  Court/Country).  According 
to  Hume,  the  English  regime  is  both  republican, 
because  liberty  is  expressed  through  the  institution 
of  Parliament,  and  monarchic,  because  of  the  exec¬ 
utive  power  of  the  king.  So  the  “parties  of  Great 
Britain”  are  for  him  the  expressions  of  the  dual 
nature  of  the  constitution.  In  political  philosophy, 
this  opposition  gave  birth  to  two  conflicting  doc¬ 
trines.  The  Whigs  consider  that  all  legitimate  gov¬ 
ernment  must  be  founded  on  explicit  consent 
(“original  contract”);  the  Tories  offer  an  extremist 
interpretation  of  loyalism  and  argue  for  the  idea  of 
“passive  obedience”  to  legal  authorities  (especially 
to  the  king).  Hume  refuses  both  principles:  “Original 
contract”  cannot  explain  the  real  nature  of  political 
obligation,  and  “passive  obedience”  is  incompatible 
with  the  real  interest  of  society.  Nevertheless,  he 
considers  that  these  two  conflicting  principles  have 
some  kind  of  truth  in  the  English  context,  as  expres¬ 
sions  of  the  two  elements  of  the  constitution  of 
Great  Britain,  and  he  shows  how  some  strong  “con¬ 
servative”  feeling  can  be  a  functional  feature  of  a 
free  government.  But,  even  if  he  is  not  himself  a 
“Tory,”  Hume  provides  a  political  philosophy  that 
is  “conservative”  in  another  sense.  Hume’s  doctrine 
of  politics  accepts  the  effects  of  revolution  without 
assuming  its  principles  because  he  refuses  the  illu¬ 
sion  of  a  rational,  deliberate  reconstruction  of 
political  order;  he  argues  for  a  sort  of  synthesis 
between  “liberalism”  and  “conservatism,”  accord¬ 
ing  to  which  the  durable  foundation  of  liberty 
requires  the  stability  of  institutions  and  a  historical 
continuity  that  makes  possible  a  translation  of 
political  innovations  into  the  language  of  tradition. 
Beyond  the  Tory  pastoral,  another  kind  of  conser¬ 
vatism  is  possible,  philosophically  skeptical  and 
open  to  political  innovation,  intended  as  a  para¬ 
doxical  mean  of  conservation  of  social  order  (the 
well-known  sentence  of  novelist  Giuseppe  di 
Lampedusa’s  hero  in  The  Leopard — “Everything 
must  change  so  that  everything  can  stay  the 
same” — could  just  as  well  be  from  Hume). 

The  French  Revolution  was  seen  as  the  critical 
turning  point  in  the  history  of  liberalism,  which 
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produced  a  division  between  two  wings:  the  radical 
or  democratic  and  the  conservative.  The  most 
important  author  is  Edmund  Burke,  who  was  the 
first  great  critic  of  the  French  Revolution,  as  early 
as  1790,  with  his  major  book  Reflections  on  the 
Revolution  in  France.  Burke  was  himself  a  genuine 
Whig,  who  had  supported  every  great  liberal  cause, 
from  the  fight  against  political  corruption  to  reli¬ 
gious  freedom,  and  who  had  even  accepted 
American  Independence;  so  his  opposition  to  the 
French  Revolution  was  especially  significant  as  an 
expression  of  the  divisions  of  liberal  opinion. 

According  to  Burke,  there  is  an  essential  differ¬ 
ence  between  the  French  Revolution  of  1789  and 
the  Glorious  Revolution  of  1688  or  even  the 
American  Revolution.  In  one  case,  people  make  an 
exception  to  the  constitutional  rule  in  order  to  save 
a  regime  at  risk  (the  Glorious  Revolution  of  1688) 
or  to  resist  tyranny  (the  American  Revolution  of 
1776);  in  the  other  one,  a  rule  is  made  of  the  excep¬ 
tion  itself,  by  the  affirmation  that  political  liberty 
and  the  capacity  to  change  the  rulers  are  human 
rights.  For  that  reason,  the  French  theory  of  droits 
de  I’homme  (human  rights)  represents  an  aggres¬ 
sion  against  the  European  political  order  and  a 
rupture  of  tradition.  On  this  foundation,  Burke 
builds  a  coherent  conservative  doctrine,  the  influ¬ 
ence  of  which  is  very  large.  The  basic  idea  is  that  of 
the  fragility  of  human  nature,  which  implies  that 
freedom  cannot  be  natural  but  emerges  only  as  a 
product  of  history  and  chance,  much  more  than  of 
deliberate  action  (such  is  the  meaning  of  the  well- 
known  opposition  between  “rights  of  man”  and 
“rights  of  English  people”).  For  this  reason,  the 
first  condition  of  liberty  is  historical  continuity, 
and  it  explains  the  ambivalence  of  the  Enlightenment: 
If,  as  in  England,  modernity  respects  tradition,  its 
effects  can  be  beneficent,  but  when  modern  reason 
revolts  against  the  Christian  and  aristocratic  sources 
of  liberty,  it  becomes  destructive.  Thus,  the  English 
regime  is  an  example  of  moderate  liberty,  in  which 
open  aristocracy  associates  itself  with  new  elites 
thanks  to  a  transaction  between  “landed  interest” 
and  “monied  interest,”  whereas  France  is  the  the¬ 
ater  of  a  mortal  conflict  between  noblesse  and 
bourgeoisie,  which  produces  the  insurrection  of  the 
mob.  This  philosophy  implies  a  sound  criticism  of 
modern  individualism,  which  constitutes  the  cen¬ 
tral  argument  of  Burke  in  favor  of  “prejudices”  and 
tradition,  which  are  more  rational  than  individual 


reason.  Burke  reasserts  many  liberal  views,  most  of 
which  come  from  English  and  Scottish  traditions, 
but  he  reinterprets  them  in  a  new  sense:  The  market 
is  a  way  not  only  of  cooperation  but  also  of  subor¬ 
dination;  the  social  contract  is  not  equalitarian  but 
is  founded  on  the  alliance  between  God  and  His 
creatures.  So  this  theory,  which,  in  Burke’s  works, 
remains  liberal  and,  in  some  way,  faithful  to  mod¬ 
erate  Enlightenment,  is  also  one  of  the  main  sources 
of  illiberal  doctrines,  as  those  of  the  French  Counter- 
Revolution  or  German  Political  Romanticism. 

If  the  legacy  of  classical  conservatism  (“Tory,” 
“Humean,”  or  “Burkean”)  is  rich  and  even  impos¬ 
ing  in  the  English-speaking  world,  it  is  commonly 
seen  as  less  impressive  in  continental  Europe  and 
especially  in  Germany  and  in  France  (even  if  politi¬ 
cians  like  Guizot  and  Bismarck  or  de  Gaulle  and 
Adenauer  can  be  considered  as  great  conservative 
statesmen).  In  this  part  of  the  world,  as  political 
liberty  comes  principally  from  the  French 
Revolution,  liberalism  is  not  traditional,  and  the 
criticism  of  liberal  and/or  radical  politics  is  often 
more  reactionary  than  conservative. 

Main  Currents  of  Conservative  Thought 

To  identify  the  main  themes  of  conservative 
thought,  one  can  either  use  a  historical  investiga¬ 
tion  (which  presupposes  some  hypothesis  about 
the  nature  of  conservatism)  or  try  to  give  a  general 
analysis  that  follows  from  a  view  of  the  uses  and 
functions  of  conservatism  in  political  life. 

The  first  way  can  be  exemplified  by  Russell 
Kirk’s  The  Conservative  Mind ,  which  is  itself  a 
classical  work  of  conservative  philosophy.  Kirk 
presented  himself  as  the  American  champion  of  a 
forgotten  tradition,  who  tried  to  make  Americans 
conscious  of  the  conservative  elements  of  their  own 
culture.  He  goes  back  to  the  origins  of  American 
tradition  and  tries  to  resuscitate  the  classical  or 
traditional  dimension  of  the  American  regime, 
which  he  interprets  as  a  mixed  constitution,  includ¬ 
ing,  thanks  to  John  Adams,  some  aristocratic  fea¬ 
tures  not  reducible  to  the  democratic  spirit  of 
Jefferson.  Referring  to  diverse  American  and 
English  thinkers,  Kirk  proposes  a  kind  of  ideal  type 
of  conservatism,  which  combines  features  such  as 
the  belief  in  a  transcendent  order,  affection  for  the 
“variety  and  mystery”  of  human  existence,  accep¬ 
tance  of  social  hierarchy,  and  devotion  to  custom 
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and  tradition  as  conditions  of  legitimate  innova¬ 
tions.  Invoking  sometimes  tradition,  sometimes 
divine  revelation  or  natural  law,  Kirk  is  a  religious 
(Christian)  and  traditionalist  thinker  whose  phi¬ 
losophy  conveys  a  kind  of  American  Toryism. 

One  can  find  a  good  example  of  a  structural 
analysis  of  conservatism  in  Albert  O.  Hirschman’s 
The  Rhetoric  of  Reaction:  Perversity,  Futility, 
Jeopardy,  which  describes  the  general  features  of 
conservative  rhetoric  when  confronted  by  various 
examples  of  liberal  or  radical  arguments  or  action. 
Conservatism  (which  could  be  called  here  a  “reac¬ 
tion”)  is  supposed  to  underlie  the  typical  argu¬ 
ments  that  can  be  opposed  to  any  attempts  to 
transform  or  ameliorate  the  social  order.  One  can 
say  the  projected  action  is  dangerous,  because  it 
can  produce  unintended  and  bad  or  “perverse” 
effects  (the  argument  of  “perversity”).  One  can 
insist  on  the  uselessness  of  reforms  or  revolutions, 
which  never  can  change  the  human  condition  or 
the  basic  social  relations  (“futility”).  One  can, 
finally,  assert  that  attractive  innovations  are  dan¬ 
gerous  for  social  goods  (“jeopardy”  vs.,  for  exam¬ 
ple,  the  criticism  of  the  welfare  state  as  a  danger 
for  civil  and  political  liberty). 

Both  theories  present  some  interest  as  descrip¬ 
tions  of  the  conservative  spirit  or  attitude,  but  to 
some  observers,  neither  of  them  seems  really  suf¬ 
ficient.  One  possible  weakness  of  Kirk’s  account  is 
a  lack  of  emphasis  on  certain  themes  that  are 
independent  of  religious  faith:  Many  “conserva¬ 
tive”  authors,  notably  in  what  Robert  Nisbet  calls 
the  “sociological  tradition,”  were  in  fact  nonbe¬ 
lievers  or  even  atheists,  whose  criticism  of  moder¬ 
nity  is  not  founded  on  nostalgia  for  the  age  of 
faith  or  on  the  idea  of  natural  law.  As  for 
Hirschman,  his  model  can  be  applied  to  any  argu¬ 
mentation  against  deliberate  action:  the  so-called 
reactionary  rhetoric  can  be  used  by  liberals  or 
progressives  (if  they  want  to  prevent  some  “reac¬ 
tionary”  decision)  and,  on  the  contrary,  conserva¬ 
tives  can  abandon  and  even  reject  it  when  they  try 
to  convince  people  to  act  with  courage  or  to 
accept  some  risks  for  the  public  good.  There  are 
probably  several  sorts  of  conservatism,  insofar  as 
the  criticism  of  modernity  can  be  made  from  dif¬ 
ferent  points  of  view.  Perhaps  the  definition  of 
conservatism  must  be  essentially  negative,  starting 
with  what  conservatives  refuse  rather  than  with 
what  they  desire.  If  one  accepts  the  hypothesis 


with  which  this  entry  starts  (conservatism  is  a 
basically  a  modern  criticism  of  modernity),  one 
can  say  that  the  main  theme  of  conservatism  lies 
in  the  refusal  of  two  liberal  and  progressive  ideas: 
that  of  the  self-sufficiency  of  the  individual  and 
that  of  a  possible  social  solution  to  the  permanent 
problems  of  humanity.  This  refusal  is  evidently 
central  in  the  thinkers  that  Kirk  admires,  but  it  is 
no  less  important  in  other  traditions  that  are  not 
committed  to  the  nostalgia  for  religious  tradition. 
In  fact,  besides  lovers  of  transcendence,  we  find 
among  conservatives  many  sober  (and  sometimes 
even  cynical)  critics  of  any  sort  of  idealism.  This  is 
notably  the  case,  in  the  sociological  tradition,  of 
the  so-called  Machiavellians,  who  contributed 
notably  to  the  growth  and  vitality  of  conservative 
thought. 

One  of  the  most  interesting  contributions  of 
social  science  to  conservative  thought  can  be  found 
in  Vilfredo  Pareto’s  Treatise  on  General  Sociology, 
which  is  at  the  same  time  very  critical  of  humani¬ 
tarian  values  and  almost  skeptical  toward  tradi¬ 
tional  ideas.  The  basic  thesis  of  the  Treatise  is  that 
of  the  eternity  of  the  division  of  society  between 
the  elite  and  “the  people”:  It  comes  from 
Machiavelli  but  without  any  of  the  democratic 
connotations  that  one  can  find  in  the  Florentine 
philosopher.  For  Pareto,  society  is  always  divided 
between  the  mass  of  individuals  and  the  ruling, 
honored  (and  generally  propertied)  minority, 
whose  definition  is  itself  value  neutral.  The  elites 
are  not  necessarily  virtuous  or  even  beneficent: 
They  include  all  the  persons  or  groups  that  are  in 
some  way  above  the  average.  Concerning  the  rul¬ 
ing  elite,  it  is  defined  by  some  capacity  for  domina¬ 
tion,  either  by  strength  or  by  cunning  (see 
Machiavelli’s  division  of  rulers  into  “lions”  and 
“foxes”).  So  this  “sociology”  is  supposed  to  refute 
the  democratic  and  socialist  idea  of  an  egalitarian 
society,  and  it  implies  some  “conservative”  or 
“reactionary”  argument  in  the  sense  of  Hirschman’s 
theory:  If  division  of  society  is  eternal,  social 
change  is  an  illusion,  and  revolutions  are,  at  best, 
futile.  Besides,  Pareto  gives  not  only  an  argument 
against  some  specific  action  but  also  a  general 
theory,  which  includes  a  pessimistic  view  of  human 
capacity  to  rationalize  society.  However,  this  the¬ 
ory  is  not  “conservative”  in  Kirk’s  sense  of  the 
word;  the  social  order  has  no  transcendent  basis  or 
value,  and  the  first  aim  of  the  ruling  class  is  not  to 
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promote  the  social  good  but  to  make  its  domina¬ 
tion  legitimate.  So  we  can  consider  Pareto’s  theory 
as  an  example  of  a  conservative  theory  that  aban¬ 
dons  the  arguments  favoring  tradition  and  legiti¬ 
mates  the  ruling  elites  without  accepting  their 
opinions  or  arguments. 

From  this  point  of  view,  the  most  important  ele¬ 
ments  of  Pareto’s  theory  are  probably,  on  the  one 
hand,  the  idea  of  the  circulation  of  elites  and,  on 
the  other  hand,  the  division  of  motives  of  actions 
between  residues  and  derivations.  The  theory  of 
circulation  of  elites  derives  from  the  elite-mass 
division:  If  that  division  is  eternal,  all  that  happens 
can  be  only  a  replacement  of  an  old,  exhausted  elite 
by  a  new  and  vigorous  one.  But  it  implies  also  some 
political  teachings  for  the  ruling  class.  If  it  wants  to 
keep  its  position,  the  elite  must  not  submit  to 
humanitarian  illusion  of  egalitarian  relations  with 
the  ruled,  but  it  must  also  be  open  to  some  elements 
from  the  ruled,  both  to  attenuate  the  conflict  with 
other  classes  and  to  regenerate  itself  thanks  to  the 
incorporation  of  talented  people.  So,  even  if  it 
includes  some  emphatic  exaltation  of  the  elite,  the 
Paretian  ideal  of  a  ruling  class  is  close  to  the  moder¬ 
ate  English  idea  of  open  aristocracy.  The  distinc¬ 
tion  between  residues  and  derivations  gives  a  socio¬ 
logical  equivalent  of  a  theory  of  human  nature.  For 
Pareto,  the  main  problem  of  sociology  is  to  explain 
what  he  calls  “nonlogical”  actions — that  is,  actions 
in  which  the  means  are  not  adapted  to  the  ends, 
even  if  the  actor  believes  that  they  are.  People  give 
many  arguments  for  such  actions,  but,  in  fact, 
experience  shows  that  these  justifications  are  less 
important  than  the  basic  feeling,  or  instinct,  that 
underlies  human  actions.  So  one  can  distinguish 
between  two  classes  of  motives,  a  and  b.  The  first 
class — a,  that  of  residues — includes  constant  and 
permanent  motives  of  action;  the  second  one — b, 
that  of  derivations — comes  from  the  “work  accom¬ 
plished  by  mind  in  order  to  explain  a ”  and  is  much 
more  variable  because  it  comes  from  “fancy” 
( Treatise ,  sec.  850-851).  Pareto  enumerates  six 
classes  of  residues,  among  which  the  two  first  ones 
have  the  greatest  importance  and  pertinence  for 
political  analysis.  The  first  class  is  the  “instinct  of 
combinations,”  which  underlies  invention  and 
innovation  and  which  is  the  basis  of  progress  or 
civilization.  The  second  class  is  “persistence  of 
aggregates,”  which  explains  the  fondness  for  past 
combinations  and  the  mistrust  of  innovations.  Each 


of  them  corresponds  to  distinct  types  of  human 
beings  and  the  predominance  of  one  class  of  resi¬ 
dues  in  a  society  explains  much  of  its  culture  and 
of  its  political  trends.  “Conservatism”  in  the  vul¬ 
gar  sense  evidently  has  a  strong  affinity  with  “per¬ 
sistence  of  aggregates,”  but,  for  Pareto,  the  conser¬ 
vation  of  society  (and  of  an  elite)  requires  an 
association  between  the  two  classes  of  residues. 
On  one  side,  we  have  ancient  Athens,  or  modern 
France,  in  which  “instinct  of  combination”  pre¬ 
dominates:  These  societies  are  brilliant  but  unsta¬ 
ble.  On  the  other  side,  one  finds  Sparta,  or  19th- 
century  Prussia,  in  which  “persistence  of  aggre¬ 
gates”  is  very  important:  They  can  seem  stronger, 
but  they  lack  the  capacity  of  adaptation  to  change. 
The  middle  between  these  extremes  could  be 
ancient  Rome  or  modern  England,  in  which  the 
combination  of  the  two  classes  is  such  that  innova¬ 
tion  does  not  jeopardize  the  conservation  of  soci¬ 
ety  or  of  political  combination.  So,  the  general 
theory  of  human  motives  (or  of  human  nature) 
confirms  the  teaching  of  the  analysis  of  social 
stratification:  The  best  way  for  the  elite  is  to  accept 
change  and  circulation  without  adoring  them  and 
this  “dynamic”  conservatism  is  founded  on  a  secu¬ 
lar  vision  without  any  faith  in  Natural  Law  or  in 
religion. 

Pareto  has  many  descendants  in  social  science 
who  could  be  defined  as  realists  and  modern  con¬ 
servatives.  The  most  prominent  of  them  is  proba¬ 
bly  Joseph  Schumpeter,  whose  theory  of  capitalism 
and  of  democracy  is  a  touchstone  of  conservative 
sociology.  In  his  political  theory,  Schumpeter 
shows  that  democracy  is  not  really  the  government 
of  the  people  by  the  people  and  for  the  people  but 
is  simply  the  free  competition  of  elites  for  power, 
with  the  people  as  arbiter.  However,  he  admits 
implicitly  that  people  count,  since  he  considers 
that  democracy  can  undermine  capitalism  and 
even  engender  “socialism.”  His  vision  of  capital¬ 
ism  includes  a  major  topic  of  conservatism. 
Capitalism  is  defined  by  its  capacity  for  innova¬ 
tion,  but  this  strength  is  at  the  same  time  its  weak¬ 
ness:  The  danger  is  that  capitalism  could  itself 
destroy  the  cultural  conditions  of  its  development. 

So  conservatism  includes  a  traditionalist  criti¬ 
cism  of  modernization  but  is  not  reducible  to  that. 
If  it  has  its  roots  in  disillusionment  with  the  effects 
of  modern  liberalism,  it  is  possible  to  designate  as 
“conservative”  some  other  premodern  doctrines, 
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when  they  are  founded  on  a  similar  skeptical  reac¬ 
tion  to  social  change.  Inversely,  one  can  consider 
that  conservatism  is  always  difficult  in  modern 
democracies,  where  innovation  is  generally  more 
appreciated  than  stability.  One  example  of  this  dif¬ 
ficulty  could  be  the  so-called  American  neoconser¬ 
vatives,  who  advocated  the  use  of  radical  means  to 
attain  conservative  ends. 
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Conservative  Parties 


Parties  are  organizations  that  seek  to  mobilize 
public  support  in  order  to  compete  for  political 


power.  In  principle,  attempts  at  support  mobiliza¬ 
tion  can  be,  and  not  infrequently  are,  based  on 
appeals  other  than  those  rooted  in  political  views 
(e.g.,  the  promise  of  patronage  benefits).  However, 
where  the  publics  to  be  mobilized  are  very  large 
(as  in  most  liberal-democratic  regimes),  parties 
cannot  avoid  having  to  compete  by  espousing 
alternative  sets  of  public  policies — in  their  turn 
linked  to,  and  informed  by,  contrasting  political 
principles  or  ideologies.  At  their  simplest,  then, 
conservative  parties  can  be  defined  as  organiza¬ 
tions  bringing  together  people  committed  to  the 
quest  for  power  in  order  to  advance  an  agenda  of 
conservatism — which  may  in  turn  be  defined  as  a 
set  of  political  principles  arising  from  and  express¬ 
ing  a  commitment  to  the  status  quo  or  the  status 
quo  ante.  Yet  on  this  definition  alone,  even  parties 
that  most  political  scientists  would  not  see  as 
belonging  to  the  category — for  example,  the  Soviet 
Communist  Party  after  1917 — would  qualify.  So 
there  is  also  a  historical  criterion  that  must  be 
fulfilled,  since  the  parties  that  have  tended  to 
attract  the  label  are  ones  that  first  emerged  in 
some  parts  of  the  world  about  200  years  ago  to 
counter  the  demands  of  liberals — here  meaning 
18th-  and  19th-century  Europeans  committed  to 
four  essential  tenets:  (1)  a  belief  in  the  ontological 
priority  of  individuals  as  against  society  and 

(2)  an  unlimited  potential  for  human  improve¬ 
ment  through  the  application  of  reason,  therefore 

(3)  a  normative  commitment  to  individual  free¬ 
dom  and  the  view  that  (4)  government  is  legiti¬ 
mate  only  insofar  as  it  rests  on  the  consent  of  the 
governed.  Conservatives  especially  opposed, 
among  the  political  positions  stemming  from  these 
tenets,  those  according  to  which  there  was  neither 
inherent  value  in  tradition  nor  any  role  for  hered¬ 
itary  or  religious  criteria  in  political  institutions  or 
decision  making.  This  entry  reviews  conservative 
political  principles  in  more  detail,  looking  at  how 
they  have  evolved  over  time.  It  also  considers  how 
the  parties  espousing  them  have  been  organized, 
who  has  supported  them  and  how  they  have  per¬ 
formed  electorally,  and  the  role  they  have  played 
in  the  party  systems  that  they  have  been  part  of. 

Political  Principles 

Conservative  parties  have,  generally,  been  reluctant 
to  call  themselves  such,  preferring  more  inclusive 
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titles  such  as  “People’s  Party”  and  sometimes  even 
eschewing  “party”  as  well.  The  reason  is  that 
“party”  and  “conservatism”  implied  that  they 
sought  to  make  a  case  for  a  specific  point  of  view  or 
interest,  whereas  they  saw  themselves  as  defending 
a  way  of  life,  society  being  viewed  as  an  organic 
whole — and  from  their  point  of  view,  this  was  not 
something  they  ought  to  have  been  obliged  to  argue 
over  in  the  first  place.  From  this  outlook  stemmed 
their  professions  of  antipathy  to  “ideology” — the 
product  of  reason — with  its  capacity  to  disarticulate 
a  body  politic  whose  parts  were  integrated,  not  by 
design  but  thanks  to  the  accumulated  wisdom  of  the 
ages.  This  did  not  mean  that  the  conservative  posi¬ 
tion  was  one  of  opposition  to  change  of  whatever 
kind:  On  the  contrary,  societies,  like  organisms,  had 
to  adjust  to  new  circumstances  if  they  were  to  sur¬ 
vive.  But  it  meant  that  change  was  to  be  adaptive, 
not  revolutionary. 

The  role  of  change  in  the  conservative  outlook 
has  meant  that  over  the  years  its  agendas  and  pro¬ 
grams  have — paradoxically — altered  much  more 
radically  than  have  liberal  and  socialist  agendas 
and  programs  with  their  greater  attachment  to 
abiding  principles.  Of  the  changes  and  trends  that 
required  this  alteration,  the  two  most  significant 
following  the  conservative  parties’  emergence 
were  the  spread  of  industrialism  and  extension  of 
the  franchise.  The  first  brought  about,  in  many 
instances,  a  gradual  reconciliation  and  integration 
of  the  interests  of  the  landed  elite  with  those  of  the 
rising  bourgeoisie — the  successful  pursuit  of  whose 
activities  required  priority  to  be  given  to  the  quint  - 
essentially  liberal  principles  of  due  process,  the 
rule  of  law,  and  meritocracy.  In  such  instances, 
the  principles  became  conservative  ones  as  well. 
The  second  trend  threatened  to  render  conserva¬ 
tism  unappealing  on  account  of  its  traditional 
opposition  to  democracy — but  also  offered  the 
means  to  its  survival.  Conservatives’  defense  of 
the  established  order  had  always  entailed  a  pater¬ 
nalistic  sensitivity  to  the  legitimate  concerns  of 
ordinary  people  and  the  poor.  The  mass  franchise 
now  enabled  and  required  a  renewed  emphasis  on 
this  social  reform  aspect  of  the  conservative 
agenda. 

A  third  significant  trend  was  the  growth  of 
nationalism.  Originally  popular  and  democratic,  it 
was  not  therefore  a  distinctively  conservative  prin¬ 
ciple.  However,  as  it  increasingly  became  an 


accomplished  fact — through  the  unification  of 
Italy  and  Germany  and  similar  events — it  offered  a 
means  of  countering  the  most  significant  threat 
posed  by  the  advent  of  the  mass  franchise:  the 
growth  of  socialist  parties  and  ideologies  of  inter¬ 
national  workers’  solidarity. 

The  final  major  adjustment  in  conservative 
thinking  took  place  in  the  period  after  World  War 
II.  Initially,  the  collapse  of  fascism  had  brought  a 
“technocratic”  kind  of  conservatism.  A  social- 
democratic  hegemony  based  around  the  principles 
of  the  welfare  state  and  Keynesian  demand  man¬ 
agement  had  gone  hand  in  hand  with  the  growth  of 
so-called  catchall  parties  seeking  votes  wherever 
they  could  find  them  and  bending  principles  and 
ideology  to  that  end.  Under  these  circumstances, 
conservatives  found  the  distinctiveness  of  their  out¬ 
look  increasingly  difficult  to  convey,  seeming  to 
many  to  stand  for  little  more  than  a  claim  to  be  able 
to  manage  politics  more  competently  than  their 
rivals.  Later,  the  increasing  difficulties  encountered 
by  social-democratic  politics  in  the  economic 
sphere,  coupled  with  the  growth  in  the  social 
sphere  of  new  movements  pressing  the  demands  of 
environmentalists,  peace  activists,  feminists,  gays, 
and  ethnic  minorities,  provided  conservatives  with 
a  renewed  opportunity  to  press  a  distinctive  agenda. 
This  gave  the  pride  of  place  not  only  to  the  old 
liberal  principles  of  free  markets  and  the  minimal 
state  but  also,  in  opposition  to  the  left  libertarian¬ 
ism  of  the  new  social  movements,  to  the  strong 
state:  a  state  that  would  enforce  free  markets 
through  firmness  in  welfare  and  trade  union  mat¬ 
ters  and  robustly  defend  laws  embodying  tradi¬ 
tional  social  values. 

Organization 

In  most  cases,  conservative  parties  originated  as 
informal  groups  within  predemocratic  legislatures, 
only  later,  with  extensions  of  the  franchise,  develop¬ 
ing  formally  organized  extraparliamentary  struc¬ 
tures  covering  the  national  territory.  Even  then,  in 
many  instances,  the  degree  of  formality  of  these 
structures  for  long  remained  incomplete  and  the 
real  power  acquired  by  them  limited.  The  British 
Conservative  Party,  for  example,  did  not  have  indi¬ 
vidual  members  before  the  20th  century,  and  until 
the  1970s,  its  local  associations  not  infrequently 
refrained  from  fielding  candidates  in  municipal  and 
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county  council  elections:  giving,  in  such  instances, 
a  free  run  to  “Independents,”  who  were 
Conservatives  at  the  national  level.  This  was  a  con¬ 
venient  way  of  ensuring  that  local  influence 
remained  in  the  hands  of  preexisting  Conservative 
elites  and  did  not  need  to  be  shared  with  less  privi¬ 
leged  groups  in  the  party.  Nowadays,  there  is  little 
that  is  especially  distinctive  about  conservative  par¬ 
ties  from  an  organizational  point  of  view.  They 
differ  between  themselves  depending  on  the  mix  of 
resources — money,  activists,  charismatic  leader¬ 
ship,  and  so  on — available  to  enable  them  to  com¬ 
pete.  Like  parties  generally,  their  penetration  of 
society  has  declined  in  the  postwar  period,  with 
internal  distributions  of  power  increasingly  con¬ 
solidated  in  the  hands  of  elected  officials,  progres¬ 
sively  more  important,  with  the  spread  of  the 
media,  for  mobilizing  a  mass  electorate. 

Social  Base  and  Electoral  Performance 

Generally  speaking,  the  likelihood  of  voting  for  a 
conservative  party  increases  with  religious  obser¬ 
vance  and  the  belief  that  religion  is  important, 
with  high  socioeconomic  status,  with  age,  and 
with  a  view  of  oneself  as  being  located  on  the  right 
of  the  left-right  spectrum.  However,  generalizing 
about  the  precise  impact  of  variables  such  as  these 
is  impossible  for  two  sets  of  reasons,  the  first  being 
the  influence  of  context  in  mediating  their  impact. 
For  example,  if  in  Britain  the  Conservatives  have 
traditionally  attracted  very  high  levels  of  support 
among  farmers,  the  same  cannot  be  said  of 
Conservatives  in  places  such  as  Norway  and 
Sweden,  whose  party  systems,  from  the  1920s 
onward,  included  influential  agrarian  parties. 
Second,  there  is  the  problem  of  the  choice  of  par¬ 
ties  to  include.  Frequently  in  comparative  analyses, 
conservative  parties  have  been  grouped  together 
with  other  parties  of  the  moderate  right,  such  as 
Christian  democratic  parties  with  which  they  have 
sometimes  shared  important  features,  for  example, 
size  and  significance  in  the  party  system,  ideologi¬ 
cal  flexibility,  and  the  conviction  that,  in  standing 
for  the  interests  of  the  entire  nation,  theirs  is  more 
than  a  political  point  of  view  on  a  par  with  the 
rest.  The  choice  makes  a  significant  difference. 
Comparing  the  British  Conservative  Party  with  the 
pre- 1990s  Italian  Christian  Democrats,  for  exam¬ 
ple,  would  reveal  not  similarities  but  enormous 
differences,  both  quantitative  and  qualitative,  in 


the  impact  of  religious  observance  and  left-right 
self-placement  on  propensities  to  vote. 

In  few  other  places  in  the  postwar  period  have 
conservatives  matched  the  electoral  success  of  the 
British  Conservative  Party.  Again,  much  depends 
on  the  choice  of  specific  parties  to  include,  with 
particular  controversy  surrounding  the  status  of 
the  U.S.  Republicans  and  Ireland’s  Fianna  Fail,  but 
if  we  include  only  the  uncontroversial  cases,  then 
the  British  Conservatives’  only  real  rival  in  Western 
Europe  is  Greece’s  New  Democracy  (which  has 
averaged  43.9%  of  the  vote  in  parliamentary  elec¬ 
tions  since  1974  as  compared  with  37.8%  for  the 
Conservatives).  Elsewhere  in  Western  Europe,  the 
best  average  performance  has  been  by  Luxembourg’s 
Christian  Social  People’s  Party  (with  an  average  of 
32.3%  since  1974).  In  some  countries,  conserva¬ 
tive  parties  are  essentially  nonexistent:  the  contrast 
with  countries  where  they  are  strong  being  due  to 
differences  of  party-system  context  inherited  from 
the  past. 

Role  in  Party  Systems 

In  Britain,  the  arrival  of  democratic  politics  enabled 
the  conservatives  to  consolidate  a  position  as  one 
of  two  large  parties  by  virtue  of  the  relative  insig¬ 
nificance  of  three  of  the  major  social  cleavages — 
that  is,  center-periphery,  state-church,  land-indus¬ 
try — which  elsewhere  spawned  more  complex 
party  systems;  and  by  virtue  of  the  fact  that  exten¬ 
sions  of  the  franchise  had  begun  early  and  taken 
place  over  a  long  period  of  time,  enabling  the  party 
to  adapt  successfully  to  the  emergence  of  the  fourth 
major  cleavage,  that  between  owners  and  workers. 

In  other  major  European  countries — Germany 
and  Italy — the  party  systems  supporting  the  reemer¬ 
gence  of  democratic  politics  after  World  War  II 
could  find  no  place  for  conservative  parties  because 
of  the  experience  of  Nazism  and  fascism:  This  made 
it  impossible  for  anyone  to  espouse  important  con¬ 
servative  themes  such  as  nation,  tradition,  order, 
and  hierarchy  without  arousing  painful  memories 
and  the  suspicion  that  they  harbored  extreme  right- 
wing  sympathies.  Consequently,  in  these  countries 
the  role  of  principal  party  of  the  center-right  came 
to  be  played  by  a  Christian  democratic  party. 

In  France,  conservatism,  though  present,  was 
able  to  make  less  headway  than  in  Britain,  for  the 
country’s  deep  divisions — between  church  and  state; 
between  monarchists,  parliamentarians,  and 
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Bonapartists;  between  land  and  industry;  and 
between  owners  and  workers — gave  it  a  highly  frag¬ 
mented  party  system  within  which  conservatives 
had  many  significant  rivals.  Meanwhile,  the  local¬ 
ism  of  party  politics  and  the  two-ballot  system 
inhibited  the  development  of  cohesive  national 
parties — and,  thus,  the  emergence  of  conservatism 
as  a  unified  force.  Though  the  constitutional  over¬ 
haul  that  ushered  in  the  Fifth  Republic  in  1958 
brought  an  initial  decline  in  polarization,  a  degree 
of  party-system  fluidity  continued,  and  today  at 
least  two  significantly  sized  entities — the  Union  for 
French  Democracy  and  the  Union  for  a  Popular 
Movement — themselves  coalitions,  can  claim  to 
provide  a  home  to  conservatives. 

At  the  European  level,  the  main  transnational 
party  organization  embracing  conservative  parties  is 
the  European  People’s  Party.  This  also  encompasses 
Christian  Democratic  parties  and  the  internal  life  of 
the  organization  is  not  without  tensions  between 
pro-European  (mainly  Christian  Democratic)  and 
Eurosceptic  (mainly  British  Conservative)  members. 
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Constitutional  Engineering 


Constitutional  engineering  refers,  generally,  to  the 
process  by  which  political  actors  devise  higher  law, 


which  is  usually — but  not  always — specified  in  a 
formal  written  document  and  labeled  the  “constitu¬ 
tion.”  Constitutional  design  is  a  close  synonym. 
Any  particular  instance  of  constitutional  engineer¬ 
ing  must  deal  with  certain  basic  questions  of  orga¬ 
nization  and  process.  These  include  designating 
who  is  to  be  involved,  when  that  involvement  takes 
place,  and  how  the  actors  are  to  proceed  in  formu¬ 
lating,  discussing,  and  approving  a  text.  Although 
there  are  conceivably  as  many  variants  in  the  pro¬ 
cess  as  there  are  constitutions,  several  common 
patterns  emerge.  This  entry  describes  the  factors 
that  distinguish  instances  of  constitutional  engi¬ 
neering  and  some  of  the  typical  patterns. 

Stages 

Constitution  making  occurs  in  discernible  stages, 
some  of  which  resemble  an  ordinary  legislative 
process  familiar  to  many  drafters  in  consolidated 
democracies.  A  schematic  design  of  these  phases 
might  include,  in  sequential  order,  the  mobiliza¬ 
tion  of  interests  (and  counterinterests),  drafting, 
consultation,  deliberation,  adoption,  and  ratifica¬ 
tion.  These  different  stages  interact  with  the  pos¬ 
sible  actors  who  might  fill  the  roles  to  create  a 
matrix  of  options  for  designers.  Afghanistan’s 
constitution  of  2004,  for  example,  was  drafted  in 
relative  secrecy  by  a  commission  with  foreign 
advice  and  then  sent  to  the  president’s  office 
before  deliberation  and  adoption  at  an  inclusive 
constituent  assembly,  the  Loya  Jirga.  In  this 
model — which  appears  to  be  relatively  common — 
each  stage  is  potentially  consequential,  although  it 
is  likely  that  inertial  forces  and  the  power  of 
agenda  setting  will  apportion  disproportionate 
influence  to  actors  involved  at  earlier  stages.  Still, 
it  is  quite  possible  that  early-stage  actors  will 
anticipate  the  preferences  and  needs  of  later  stage 
actors,  thus  mitigating  any  sequence  effects.  Jon 
Elster  has  introduced  the  vivid  distinction  between 
upstream  and  downstream  constraints  in  the  pro¬ 
cess:  Upstream  constraints  are  imposed  by  the 
powers  setting  up  the  constitution-drafting  body, 
whereas  downstream  constraints  result  from  the 
anticipation  of  preferences  of  those  involved  in 
later  stages.  Ratification  by  public  referendum,  for 
example,  is  a  downstream  constraint  that  can 
hamstring  leaders  in  an  earlier  stage  who  recog¬ 
nize  that  their  document  must  ultimately  obtain 
public  approval. 
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Actors 

Perhaps  the  most  critical  variable  in  constitution 
making  has  to  do  with  which  actors  are  included 
in  the  process.  Actors  involved  in  constitution 
making  can  include  expert  commissions,  legislative 
bodies  or  committees,  the  executive,  the  judiciary, 
national  conferences,  elite  roundtables,  transi¬ 
tional  legislatures,  specially  elected  constituent 
assemblies,  interest  groups  and  nongovernmental 
organizations,  foreign  advisors,  and  the  public 
itself.  Public  involvement  has  become  the  subject 
of  particular  attention  in  recent  years  and  is  urged 
by  scholars,  governments,  and  international  orga¬ 
nizations.  But  not  all  constitutions  involve  the 
public,  and  some  are  drafted  by  a  handful  of  lead¬ 
ers  behind  closed  doors. 

Certainly,  a  central  dimension  on  which  consti¬ 
tution-making  processes  differ  is  the  degree  of 
public  participation.  Because  the  constitution  is 
the  highest  level  of  lawmaking  and  provides  the 
ultimate  rule  of  recognition  for  lawmaking  pro¬ 
cesses,  it  arguably  requires  the  greatest  possible 
level  of  legitimation  in  democratic  theory.  In  an 
ideal  world,  one  might  desire  universal  consent 
over  the  rules  of  society,  a  standard  that  is  obvi¬ 
ously  impractical.  Higher  levels  of  participation 
are  presumed  to  function  like  supermajority  rules, 
restricting  the  adoption  of  undesirable  institutions 
and  protecting  prospective  minorities  in  the  demo¬ 
cratic  processes  that  are  established.  Participation 
thus  legitimates  and  constrains,  substituting  inclu¬ 
sive  processes  for  consent  to  make  effective  gov¬ 
ernment  possible. 

The  modal  form  of  participation  in  constitu¬ 
tional  design  is  the  power  to  approve  the  charter, 
usually  by  referendum  on  the  final  document  as  a 
whole.  Available  data  suggest  a  significant  trend, 
beginning  in  the  early  20th  century,  toward  public 
ratification.  Approval  by  referendum  may  be  an 
increasingly  popular  mode  of  public  involvement, 
but  it  is  clearly  a  limited  one  in  that  it  involves  only 
an  up  or  down  vote  over  a  package  of  provisions. 
Since  at  least  World  War  II,  however,  participation 
in  constitutional  design  has  become  more  direct 
and  has  penetrated  more  deeply  (or  at  least  earlier) 
in  the  process.  One  common  approach  is  to  involve 
the  public  in  selecting  those  who  will  draft  or 
deliberate  over  aspects  of  the  charter.  This  sort  of 
voice  is  possible  whether  the  representative  group 
is  a  constituent  assembly  elected  expressly  for  the 


purpose  or  is  a  regular  legislature  that  takes  on  the 
project  in  addition  to  other  duties.  Some  constitu¬ 
tional  processes  have  experimented  with  more 
bottom-up  methods  of  direct  democracy,  such  as 
the  citizen  initiative,  in  which  ideas  can  bubble  up 
from  civil  society. 

Still  another  mode  of  participation  involves 
direct  consultation  with  the  public  or  representa¬ 
tive  groups  at  various  stages,  which  might  occur 
before,  during,  or  after  the  drafting  of  the  initial 
text.  The  drafting  phase  seems  to  be  especially 
crucial  because  we  can  expect  a  fair  degree  of  iner¬ 
tia  in  the  later  stages  of  the  process.  But  the  phase 
is  also  likely  to  be  the  least  participatory,  given  the 
challenges  of  writing-by-committee,  much  less 
writing-by-nation.  Indeed,  in  some  well-known 
cases,  the  public  is  excluded  from  the  drafting  pro¬ 
cess  and  not  consulted  at  all. 

Of  course,  actors  and  their  accompanying  con¬ 
straints  may  come  from  outside,  as  well  as  inside, 
a  state’s  borders.  An  extreme  case  is  that  of  the 
“occupation  constitution,”  a  document  drafted 
when  a  country  is  under  the  control  of  a  foreign 
military  power.  Such  constitutions  are  usually  pre¬ 
sumed  to  have  less  involvement  on  the  part  of  local 
actors  and  hence  to  be  less  legitimate.  Some  schol¬ 
ars  believe  that  international  involvement  creates 
disincentives  to  enforce  the  constitution  locally,  as 
actors  will  strategically  acquiesce  to  conditions 
they  have  no  intention  of  fulfilling  simply  to 
remove  external  oversight. 

Of  course,  international  constraints  on  constitu¬ 
tion  making  can  range  in  their  intensity  and  degree 
of  coordination,  from  borrowing  to  imposition. 
External  influence  need  not  be  as  blatant  as  in 
occupation  constitutions.  Constitutional  drafting 
that  occurs  concurrently  with  peace  negotiations 
often  attracts  international  advisers  and  interests, 
be  they  donors,  creditors,  interested  states,  or  the 
United  Nations.  The  prospect  of  future  member¬ 
ship  in  the  European  Union,  for  example,  led  some 
Eastern  European  countries  to  make  modifications 
to  their  draft  constitutions  at  the  behest  of  the 
Council  of  Europe.  Many  accounts  of  foreign  bor¬ 
rowing  point  to  the  decisive  role  of  influential 
consultants  who  appear  to  be  part  of  a  cottage 
industry  of  constitutional  advisors. 

The  voluminous  literature  on  policy  diffusion 
reminds  us  that  any  sort  of  policy  or  institutional 
reform  will  be  a  highly  interdependent  process. 
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Constitution  making — often  undertaken  during 
moments  of  crisis  when  states  are  at  their  most 
amenable  to  foreign  models  and  suggestions — may 
be  especially  interdependent  and  networked. 
Certainly,  scholars  have  long  noted  a  high  degree 
of  similarity  across  documents,  and  nearly  anyone 
privy  to  the  details  of  a  case  of  constitution  making 
can  recount  an  episode  of  international  borrowing. 
The  persistence  of  presidentialism  in  Latin  America, 
the  use  of  French  and  Westminster  models  of  gov¬ 
ernment  in  former  colonies,  and  the  recent  use  of 
national  conferences  in  Francophone  Africa  are  all 
examples  of  diffusion  that  occurs  at  a  subglobal 
level.  Given  the  persistent  centrality  of  the  U.S. 
Constitution  to  the  American  legal  academy,  there 
has  been  a  fair  amount  of  interest  in  documenting 
its  influence  over  the  years,  but  other  constitu¬ 
tional  models  have  also  clearly  had  some  impact. 

Other  Conditions 

There  are  potentially  consequential  aspects  of  pro¬ 
cess  other  than  the  identity  of  the  actors  involved. 
Some  constraints  reflect  the  circumstances  that 
lead  to  constitution  making  in  the  first  place.  The 
conventional  wisdom  is  that  constitution  making 
is  coincident  with  a  cataclysmic  event  of  some 
kind,  such  as  war,  coup,  economic  crisis,  or  revo¬ 
lution.  In  fact,  some  evidence  suggests  that, 
although  crises  do  frequently  precede  constitu¬ 
tional  reform,  the  degree  of  noncrisis  constitution 
making  is  probably  underestimated.  Sweden’s 
1972  reform  of  its  163-year-old  constitution  is  a 
prominent  example  of  crisis-free  reform.  The  vari¬ 
ous  socialist  constitutions,  such  as  those  in  the 
Soviet  Union  (1936,  1977)  and  China  (1982), 
seem  to  follow  the  installation  of  new  leaders,  a 
practice  that  was  often  justified  by  the  Marxist 
view  of  evolution  in  stages.  These  different 
patterns,  reflecting  various  degrees  of  crisis  or 
continuity,  will  affect  the  process,  creating  an 
atmosphere  of  either  urgency  or  deliberation. 

The  process  can  also  vary  in  terms  of  time 
involved.  At  one  extreme,  the  secretive  process  that 
led  to  Myanmar’s  2008  constitution  took  17  years. 
At  the  other  extreme,  a  small  group  of  American 
bureaucrats  working  for  the  occupation  authorities 
drafted  the  basic  form  of  Japan’s  1946  constitution 
in  a  little  over  a  week,  and  the  entire  process, 
including  elections,  legislative  deliberation,  and 


approval  by  the  emperor,  took  a  mere  8  months. 
Both  of  these  cases  are  distant  outliers.  On  aver¬ 
age,  constitution  making  takes  around  16  months. 
Anecdotally,  constitution-making  processes  involv¬ 
ing  either  a  very  short  or  very  long  amount  of  time 
seem  to  occur  in  nondemocracies.  Speedy  pro¬ 
cesses  do  not  allow  sufficient  time  for  mobilization 
of  the  public  and  civil  society,  whereas  extended 
processes  are  unlikely  to  hold  public  attention  for 
the  duration. 

Another  dimension  on  which  processes  differ  is 
the  size  of  the  deliberative  body,  an  issue  that  has 
also  plagued  those  who  design  legislatures.  The 
concern  is  that  large  bodies — which  have  the 
advantages  of  minimizing  deal  making  and  assur¬ 
ing  representativeness — can  be  unwieldy  and  lead 
to  collective  action  problems. 

Constitutions  and  their  design  have  long  been 
central  to  the  work  of  political  scientists  since  at 
least  Aristotle.  For  whatever  reason,  however,  the 
scientific  study  of  the  rules  that  govern  the  process 
of  constitutional  engineering  has  lagged.  This  state 
of  affairs  is  regrettable  but,  happily,  quite  remedi¬ 
able.  Processes  of  constitutional  design  and  adop¬ 
tion  vary  widely  along  a  number  of  identifiable 
dimensions.  Speculation  about  the  effects  of  these 
differences  runs  rampant  and  evidence  will 
undoubtedly  be  close  behind. 

Zachary  Elkins 
University  of  Texas  at  Austin 
Austin,  Texas,  United  States 
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Constitutionalism 


Constitutionalism  is  sometimes  regarded  as  a  syn¬ 
onym  for  limited  government.  On  some  accounts, 
this  doctrine  is  associated  in  turn  with  minimal  or 
less  government.  But  that  is  only  one  interpretation 
and  by  no  means  the  most  prominent  historically. 
A  more  representative  general  definition  would  be 
that  constitutionalism  seeks  to  prevent  arbitrary 
government.  At  its  most  generic  level,  arbitrariness 
consists  in  the  capacity  of  rulers  to  govern  will¬ 
fully — that  is,  with  complete  discretion — and  to 
serve  their  own  interests  rather  than  those  of  the 
ruled.  Constitutionalism  attempts  to  avoid  these 
dangers  by  designing  mechanisms  that  determine 
who  can  rule,  how,  and  for  what  purposes. 
However,  constitutional  traditions  differ  as  to 
what  precisely  counts  as  an  arbitrary  act  and 
which  mechanisms  offer  the  best  defense  against 
their  occurring.  The  classical,  neo-republican  tradi¬ 
tion  of  political  constitutionalism  identifies  arbi¬ 
trariness  with  domination  of  the  ruled  by  their 
rulers  and  seeks  to  avoid  it  by  establishing  a  condi¬ 
tion  of  political  equality  characterized  by  a  balance 
of  power  between  all  the  relevant  groups  and  par¬ 
ties  within  a  polity,  so  that  no  one  can  rule  without 
consulting  the  interests  of  the  ruled.  The  more 
modern,  liberal  tradition  identifies  arbitrariness 
with  interference  with  individual  rights  and  seeks 
to  establish  protections  for  them  via  the  separation 
of  powers  and  a  judicially  protected  constitution. 

Both  traditions  are  present  within  most  democ¬ 
racies  and  can  be  found  side  by  side  in  many  con¬ 
stitutions.  The  first  tradition  focuses  on  the  design 
and  functioning  of  the  democratic  process,  includ¬ 
ing  the  selection  of  electoral  systems  and  the  choice 
between  presidential  or  parliamentary  forms  of 
government,  of  unitary  or  federal  arrangements, 


and  of  unicameralism  or  bicameralism.  Although 
the  detailing  of  these  procedural  mechanisms  and 
the  relations  between  them  usually  forms  the  bulk 
of  most  constitutional  documents,  their  constitu¬ 
tional  importance  has  come  to  be  eclipsed — in 
legal  circles  particularly — by  the  second  tradition. 
This  view  emphasizes  the  specification  and  judicial 
protection  of  the  different  competences  of  the 
political  system  and  of  constitutionally  entrenched 
rights  by  a  constitutional  court.  Political  theorists 
and  scientists  disagree,  however,  on  whether  these 
two  traditions  are  complementary,  mutually 
entailed,  or  incompatible.  The  second  is  often  seen 
as  necessary  to  ensure  the  fairness  of  the  proce¬ 
dures  and/or  the  outcomes  of  the  first.  Yet  it  lays 
itself  open  in  turn  to  doubts  that  courts  are,  or 
could  ever  be,  truly  bound  by  constitutions  so  that 
law  rather  than  judges  rule  and  if  so  whether  judi¬ 
cial  processes  are  not  more  arbitrary  and  prone  to 
error  for  deciding  constitutional  outcomes  than 
the  democratic  procedures  and  outcomes  they  are 
often  thought  legitimately  to  limit.  In  the  following 
sections,  this  entry  traces  these  two  traditions  and 
then  turns  to  exploring  their  respective  advantages 
and  disadvantages  and  any  tensions  and  comple¬ 
mentarities  that  exist  between  them. 

Two  Traditions  of  Constitutionalism 

Political  Constitutionalism:  From  Mixed 
Government  to  Representative  Democracy 

The  theory  of  mixed  government  originated  with 
ancient  thought  and  the  classification  of  political 
systems  on  the  basis  of  whether  one,  a  few,  or  many 
ruled.  According  to  this  theory,  the  three  basic  types 
of  polity — monarchy,  aristocracy,  and  democracy — 
were  liable  to  degenerate  into  tyranny,  oligarchy, 
and  anarchy,  respectively.  This  corruption  stemmed 
from  the  concentration  of  power  in  the  hands  of  a 
single  person  or  group,  which  created  a  temptation 
to  its  abuse  through  allowing  arbitrary  rule.  The 
solution  was  to  ensure  moderation  and  proportion 
by  combining  or  mixing  various  types.  As  a  result, 
the  virtues  of  each  form  of  government,  namely,  a 
strong  executive,  the  involvement  of  the  better  ele¬ 
ments  of  society,  and  popular  legitimacy,  could  be 
obtained  without  the  corresponding  vices. 

Three  elements  underlie  this  classic  theory  of 
mixed  government.  First,  arbitrary  power  was 
defined  as  the  capacity  of  one  individual  or  group 
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to  dominate  another — that  is,  to  possess  the  ability 
to  rule  them  without  consulting  their  interests.  To 
be  dominated  in  such  an  arbitrary  way  was  to  be 
reduced  to  the  condition  of  a  slave  who  must  act 
as  his  or  her  master  wills.  Overcoming  arbitrari¬ 
ness  so  conceived  requires  that  a  condition  of 
political  equality  exist  among  all  free  citizens. 
Only  then  will  no  one  person  or  group  be  able  to 
think  or  act  as  the  masters  of  others.  Second,  the 
means  to  minimize  such  domination  was  to  ensure 
none  could  rule  without  the  support  of  at  least  one 
other  individual  or  body.  The  aim  was  to  so  mix 
social  classes  and  factions  in  decision  making  to 
ensure  that  their  interests  were  given  equal  consid¬ 
eration,  with  each  being  forced  to  “hear  the  other 
side.”  To  quote  another  republican  motto,  “The 
price  of  liberty  is  eternal  vigilance,”  with  each 
group  watching  over  the  others  to  ensure  none 
dominated  them  by  ignoring  their  concerns.  Third, 
the  balance  to  be  achieved  was  one  that  aspired  to 
harmonize  different  social  interests  and  maintain 
the  stability  of  the  polity,  preventing  so  far  as  was 
possible  the  inevitable  degeneration  into  one  of  the 
corrupt  forms  of  government. 

Thus,  mixed  government  provides  a  model  of 
constitutionalism  according  to  the  institutions  that 
structure  the  way  decisions  are  taken.  Although  ele¬ 
ments  of  the  theory  can  be  found  in  Aristotle’s 
Politics,  the  locus  classicus  is  Book  VI  of  Polybius’s 
Histories.  He  underlined  its  prime  purpose  as  pro¬ 
viding  mechanisms  whereby  no  individual,  body, 
or  group  could  rule  alone,  thereby  curbing  the 
descent  into  tyranny,  oligarchy,  or  anarchy. 
Polybius  regarded  the  republican  constitution  of 
ancient  Rome  as  exemplifying  this  theory.  Thus, 
the  consuls  provided  the  monarchical  element,  the 
Senate  provided  the  aristocratic,  while  the  popular 
element  was  represented  by  the  Tribunes  of  the 
People,  the  Plebeian  Council,  and  the  electoral, 
judicial,  and  legislative  powers  the  people  could 
exercise  directly.  As  he  noted,  the  key  feature  of 
Roman  republican  government  was  that  each  of 
these  three  groups  exercised  slightly  different  pow¬ 
ers  but  required  the  cooperation  of  the  others  to  do 
so.  So  consuls  might  exercise  war  powers,  yet  they 
needed  the  Senate  to  approve  generals,  reward 
them,  and  provide  the  necessary  funds,  while  the 
people  approved  treaties  and  could  try  high  offi¬ 
cials  and  generals  for  misconduct.  Meanwhile,  the 
more-executive  roles  possessing  the  most  discretion 


were  further  weakened  by  their  power  being  shared 
among  multiple  officeholders  and  its  being  depen¬ 
dent  on  elections  and  of  short  duration.  Thus,  there 
were  two  consuls,  each  able  to  veto  the  other’s 
decisions,  10  tribunes  with  similar  countervailing 
powers,  and  so  on,  with  none  able  to  hold  office  for 
more  than  a  year. 

The  resulting  need  for  different  groups  to  work 
together  was  summarized  in  the  slogan  Senatus 
Populusque  Romanus  (The  Senate  and  the  Roman 
People,  frequently  abbreviated  to  SPQR).  In  real¬ 
ity,  though,  their  relationship  was  far  from  harmo¬ 
nious,  with  the  patrician  element  largely  predomi¬ 
nating,  except  when  factional  disputes  led  a  given 
group  among  them  to  seek  the  support  of  the  ple¬ 
beians.  The  conflict  between  social  classes  was 
given  greater  emphasis  by  Machiavelli,  who  offered 
a  radical  version  of  the  Polybian  argument  in  his 
Discorsi.  He  observed  how  all  polities  contain  two 
classes,  the  nobles  (grandi)  and  the  people  (popolo), 
whose  desires  conflict.  However,  he  claimed  that 
their  discord,  far  from  being  destructive,  actively 
promoted  “all  the  laws  made  in  favour  of  lib¬ 
erty” — each  was  led  to  promote  freedom  by  virtue 
of  seeking  ways  of  checking  the  arbitrary  power  of 
the  other.  However,  like  Polybius,  Machiavelli 
believed  that  all  systems  ultimately  became  corrupt 
and  degenerated  into  either  tyranny  or  anarchy — 
the  balance  of  power  merely  served  to  stave  off  this 
inevitable  cycle. 

The  17th  and  18th  centuries  brought  three  main 
changes  to  the  doctrine.  The  first,  explored  below, 
was  the  development  of  the  separation  of  powers 
as  a  variation  on  the  doctrine  of  mixed  govern¬ 
ment.  The  theory  of  mixed  government  involves 
no  clear  distinction  between  the  different  branches 
of  government.  Executive,  legislative,  and  espe¬ 
cially  judicial  tasks  were  shared  between  the  differ¬ 
ent  social  classes  and  exercised  by  all  the  govern¬ 
ment  bodies.  Indeed,  the  popular  element  exercised 
certain  legislative  and  judicial  functions  directly 
through  plebiscites  and  as  jurors.  The  second 
change  was  in  the  type  of  “balance”  mixed  govern¬ 
ment  was  supposed  to  achieve.  The  classic  theory 
took  the  idea  of  the  “body”  politic  literally.  Just  as 
bodily  health  was  said  to  rely  on  a  sound  physical 
constitution  and  a  balanced  diet  and  way  of  life,  so 
the  health  of  the  polity  depended  on  a  sound  con¬ 
stitution  that  achieved  a  “natural”  balance  between 
the  various  organs  and  “humors”  of  the  political 
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body.  As  we  saw,  in  line  with  this  organic  imagery, 
the  aim  was  to  hold  off  the  inevitable  degeneration 
and  corruption  of  the  system.  Balance  was  a  static 
equilibrium,  designed  to  maintain  the  status  quo. 
However,  the  17th  and  18th  centuries  saw  a  new, 
more  dynamic  notion  of  balance,  inspired  by 
Newtonian  physics  and  based  on  mechanics  and 
physical  forces.  In  this  conception,  balance  could 
involve  a  harnessing  of  opposed  forces,  holding 
them  in  a  dynamic  equilibrium  that  combined  and 
increased  their  joint  power.  The  change  can  be 
seen  in  the  notion  of  the  “balance  of  trade,”  which 
went  from  being  an  equal  exchange  of  goods 
between  states  to  become  a  competition  between 
trading  nations  that  encouraged  their  mutual  pro¬ 
ductivity  and  innovation.  In  this  account,  the 
“cycle  of  life,”  where  growth  was  followed  by 
decay,  became  replaced  by  the  idea  of  progress,  in 
which  change  and  transformation  had  positive 
connotations. 

The  third  development  drew  on  these  two.  This 
was  the  idea  that  political  balance  now  consisted 
in  the  competition  between  government  and  a 
“loyal”  opposition.  As  parties  evolved  from  simple 
factions  and  patronage  networks  among  rivals  for 
office,  and  became  electoral  machines  defined  as 
much  by  ideology  and  social  composition  as  by  the 
personal  ambitions  and  interests  of  the  political 
class,  they  became  the  organs  of  this  new  type  of 
balance.  In  keeping  with  the  older  theory  of  mixed 
government,  one  of  the  virtues  of  parties  was  their 
ability  to  mix  different  social  classes  and  interests 
and  combine  them  around  a  common  program. 
Indeed,  just  as  economic  competition  led  rival 
firms  to  compete  over  price,  innovate,  and  explore 
untapped  markets,  so  electoral  competition  led 
rival  parties  to  compete  over  policy  efficiency  and 
effectiveness,  devise  novel  forms  of  delivery,  and 
focus  on  areas  appealing  to  different  sections  of 
the  electorate.  This  modern  form  of  political  con¬ 
stitutionalism  has  proven  constitutional  in  both 
form  and  substance.  Equal  votes,  majority  rule, 
and  competitive  party  elections  offer  a  mechanism 
for  impartially  and  equitably  weighing  and  com¬ 
bining  the  views  of  millions  of  citizens  about  the 
nature  of  the  public  good.  And  in  making  politi¬ 
cians  popularly  accountable,  it  gives  them  an 
incentive  to  rule  in  nonarbitrary  ways  that  respond 
to  the  concerns  of  the  different  minorities  that 
form  any  working  majority,  thereby  upholding 


both  rights  and  the  public  interest  rather  than  their 
own  interests. 

Meanwhile,  mixed  government  has  developed 
in  new  ways  through  federal  and  convocational 
arrangements  that  likewise  seek  to  ensure  that  dif¬ 
ferent  kinds  of  interest  are  involved  in  the  policy- 
and  lawmaking  processes  on  an  equal  basis.  Yet 
nobody  would  deny  that  the  systems  of  most 
democracies  are  far  from  perfect,  and  it  has 
become  increasingly  common  to  look  to  other  con¬ 
stitutional  traditions  to  rectify  these  problems. 

Legal  Constitutionalism:  From  the  Separation 
of  Powers  to  Rights  and  Judicial  Review 

According  to  Article  16  of  the  French  Declaration 
of  the  Rights  of  Man  and  of  the  Citizen  of  1789, 
“A  society  where  rights  are  not  secured  or  the 
separation  of  powers  established  has  no  constitu¬ 
tion  at  all.”  Though  widely  accepted  today,  this 
view  was  novel  at  the  time,  shaped  by  the  experi¬ 
ence  of  the  English,  American,  and  French  revolu¬ 
tions.  The  separation  of  powers  developed  out  of 
the  theory  of  mixed  government  during  the  English 
civil  war  of  the  mid-17th  century.  In  1642,  Charles 
I  belatedly  invoked  the  doctrine  of  mixed  govern¬ 
ment  to  defend  the  joint  rule  of  Monarch,  Lords, 
and  Commons  as  implied  by  the  notion  that 
Parliament  meant  all  three  (the  doctrine  of  “King 
in  Parliament”  as  the  sovereign  body  of  the  realm). 
His  execution  posed  the  problem  of  how  to  control 
government  in  a  society  without  distinctions  of 
rank.  Dividing  the  executive,  legislative,  and  judi¬ 
cial  functions  between  three  distinct  agencies 
appeared  to  provide  a  response  to  this  dilemma. 
However,  it  took  some  time  to  evolve.  Although 
Book  11,  Chapter  6  of  Montesquieu’s  The  Spirit  of 
the  Laws  has  been  credited  with  offering  a  defini¬ 
tive  statement  of  the  doctrine,  his  account  still  bore 
the  hallmarks  of  its  origins  in  the  system  of  mixed 
government — not  least  because  of  its  being  based 
on  an  analysis  of  the  British  parliamentary  system 
and  the  respective  roles  of  monarch,  lords,  and 
commons  within  it.  The  functional  division  also 
remained  far  from  clear-cut,  with  the  judicial 
branch  and  function  still  imperfectly  differentiated 
from  the  other  two.  Only  with  the  drafting  of  the 
U.S.  Constitution  and  the  debates  surrounding  it, 
most  notably  the  Federalist  Papers,  did  the  doc¬ 
trine  emerge  in  its  mature  form. 
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The  underlying  rationale  of  this  separation  is 
that  individuals  or  groups  should  not  be  “judges  in 
their  own  cause.”  The  division  between  the  three 
branches  aims  to  ensure  that  those  who  formulate 
the  laws  are  distinct  from  those  entrusted  with 
their  interpretation,  application,  and  enforcement. 
In  this  way,  lawmakers  are  subject  to  the  same 
laws  and  so  have  an  incentive  to  avoid  self-inter¬ 
ested  legislation  and  to  frame  it  in  general  terms 
that  will  be  equally  applicable  to  all.  These  laws 
then  guide  the  decisions  of  the  executive  and  judi¬ 
ciary,  who  because  they  are  similarly  under  the  law 
also  have  good  reason  to  act  in  an  impartial  man¬ 
ner.  Separating  functions  also  brings  the  efficiency 
gains  associated  with  the  division  of  labor.  In  par¬ 
ticular,  the  activity  of  the  legislature  is  made  less 
cumbersome  through  delegating  more  short-term 
decisions  to  an  executive  branch  capable  of  acting 
with  greater  coherence  and  dispatch. 

On  its  own,  it  is  unclear  how  effective  this 
separation  is.  Not  only  are  the  four  functions  hard 
to  distinguish  clearly,  but  unless  a  different  group 
operates  each  branch,  there  is  nothing  to  prevent 
their  acting  in  concert.  However,  four  other  theo¬ 
retical  developments  accompanied  the  shift  from 
mixed  government  to  the  separation  of  powers 
that  changed  its  character.  First,  mixed  govern¬ 
ment  had  been  challenged  earlier  by  theorists  of 
sovereignty,  such  as  Jean  Bodin  and  Thomas 
Hobbes,  who  regarded  the  idea  of  dividing  power 
as  incoherent.  The  separation  of  powers  came  into 
being  in  a  context  shaped  by  the  notion  that  at 
some  level  power  had  to  be  concentrated,  and,  in 
the  context  of  the  English,  American,  and  French 
revolutions,  the  natural  assumption  was  to  shift 
the  sovereign  power  of  the  monarch  to  the  people 
as  a  whole.  Second,  the  notion  of  the  people  as  a 
whole  was  likewise  new.  Previously,  the  “people” 
had  simply  meant  the  “commons”  or  the  “many.” 
The  whole  people  became  the  authors  of  the  con¬ 
stitution,  which  as  the  embodiment  of  their  will 
became  sovereign  over  the  will  of  any  subdivision 
of  the  people,  including  the  majority.  Third,  as  a 
corollary,  constitutions  became  entrenched  writ¬ 
ten  documents  expressing  a  “higher”  law,  which 
could  be  amended  only  by  the  people  as  a  whole 
or  by  some  supermajority  that  could  plausibly  be 
said  to  represent  their  will.  Fourth,  notions  of 
rights  became  key  aspects  of  the  constitution. 
Initially  rights  were  no  more  an  intrinsic  part  of 


the  separation  of  powers  than  they  had  been  of 
mixed  government.  The  Bill  of  Rights  was  an 
appendix  to  the  U.S.  Constitution,  which  had  pre¬ 
viously  been  confined  to  describing  the  system  of 
government.  Nevertheless,  the  securing  of  indi¬ 
vidual  rights  gradually  became  the  goal  of  all 
constitutional  arrangements. 

These  four  developments,  but  particularly  the 
last  two,  had  a  tremendous  impact  on  constitu¬ 
tionalism  and  proved  crucial  in  moving  it  in  a  legal 
and  especially  a  judicial  direction.  Within  the 
“pure”  theory  of  the  separation  of  powers,  all 
three  branches  were  coequal.  As  with  the  theory  of 
mixed  government,  the  aim  was  to  prevent  any  one 
section  of  society  dominating  another  by  obliging 
each  to  collaborate  with  the  others.  If  anything, 
the  legislative  power  was  logically  prior  to  the  oth¬ 
ers — producing  in  the  U.S.  scheme  federal  and 
bicameral  arrangements  within  the  legislature  that 
harked  back  to  the  doctrine  of  mixed  government 
and  a  clear  division  between  the  legislature  and 
executive.  As  noted  earlier,  the  distinctiveness  of 
judicial  functions  was  weak  in  the  doctrine  of 
mixed  government  and  slow  to  emerge  in  the  the¬ 
ory  of  the  separation  of  powers.  However,  making 
a  legal  document  sovereign — only  challengeable  by 
the  sovereignty  of  the  people  as  a  whole — inevita¬ 
bly  empowered  the  judiciary,  particularly  given  the 
comparative  length  of  judicial  appointments  and 
their  relative  isolation  from  electoral  pressures  by 
contrast  to  the  other  branches.  The  judiciary  now 
decided  the  competences  of  the  various  branches 
of  government,  including  their  own,  and  set  limits 
not  only  to  the  processes  of  government  but  also  to 
its  goals  with  regard  to  individual  rights.  These 
features  have  come  to  define  modern  constitution¬ 
alism  and  are  reflected  in  all  the  constitutional 
arrangements  of  postwar  democracies.  Yet  they 
also  coexist  with  forms  of  political  constitutional¬ 
ism  and  mixed  government.  It  remains  to  explore 
their  respective  advantages  and  disadvantages  and 
the  tensions  between  them. 

Political  and  Legal 
Constitutionalism  Compared 

An  entrenched,  rights-based,  and  justiciable  con¬ 
stitution  is  said  to  ensure  stable  and  accountable 
government,  obliging  legislatures  and  executives 
to  operate  according  to  the  established  rules  and 
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procedures,  and  above  all  prevents  their  sacrific¬ 
ing  individual  rights  to  administrative  conve¬ 
nience,  popular  prejudices,  or  short-term  gains. 
Given  no  working  constitutional  government  has 
not  been  also  a  working  democracy,  few  analysts 
believe  constitutions  can  restrain  a  genuinely 
tyrannical  government.  Rather,  the  aim  is  to  pre¬ 
vent  democratic  governments  from  falling  below 
their  self-professed  standards  of  showing  all  equal 
concern  and  respect.  So  a  legal  constitution  is  seen 
as  a  corrective  to — even  a  foundation  for — a 
working  political  constitution.  Yet  it  remains  a 
moot  point  whether  it  performs  its  appointed  task 
any  more  effectively  or  legitimately. 

Democratic  governments  are  said  to  be  prone  to 
overreacting  to  emergency  situations,  sacrificing 
civil  rights  to  security,  and  pandering  to  either 
electorally  important,  yet  unrepresentative,  minor¬ 
ities  or  the  populist  sentiments  of  the  majority. 
Insulated  from  such  pressures,  a  court  can  be  more 
impartial  while  its  judgments  are  bound  by  consti¬ 
tutional  law.  However,  others  contend  these  sup¬ 
posed  advantages  turn  out  to  be  disadvantageous. 
Going  to  law  offers  an  alternative  to  entering  the 
political  realm,  yet  access  is  more  restricted  than 
voting  and  the  costs  of  a  case  as  prohibitive  to 
most  ordinary  citizens  as  founding  a  new  party. 
Meanwhile,  it  allows  those  with  deep  pockets  to 
fasten  on  to  a  single  issue  that  affects  their  interests 
without  the  necessity  of  winning  others  to  their 
point  of  view.  Courts  may  be  restricted  to  the  law 
in  their  judgments — but  what  does  that  mean?  Is 
the  law  to  be  found  in  the  text  of  the  constitution, 
the  original  intentions  of  those  who  drafted  it,  the 
objective  meaning  of  the  principles,  or  the  com¬ 
mon  understandings  of  the  people?  Words  are 
open  to  multiple  meanings,  so  textualism  hardly 
proves  binding  on  judgments,  while  semantics 
seems  an  odd  way  to  decide  difficult  moral  and 
political  issues.  The  intentions  of  the  drafters  are 
unlikely  to  be  consistent  or  knowable  and  may  well 
be  inappropriate  in  contemporary  conditions.  Being 
bound  by  the  past  favors  the  status  quo  and  those 
who  are  privileged  by  current  arrangements,  thereby 
hindering  progressive  reform.  If  the  principles 
behind  the  constitution  are  universal  and  timeless, 
then  it  could  be  applied  to  any  and  all  situations. 
Yet  legal  philosophers — no  less  than  citizens — 
disagree  whether  such  principles  even  exist,  let 
alone  what  they  might  require  in  particular  cases. 


Appealing  to  a  popular  consensus  will  not  resolve 
that  problem,  for  it  is  either  unlikely  or  better  pro¬ 
vided  by  a  political  constitutionalism  that  consults 
popular  views  directly.  In  all  these  respects,  judi¬ 
cial  review  risks  becoming  arbitrary  rather  than 
being  a  block  on  arbitrariness. 

As  legal  constitutionalism  has  spread,  establish¬ 
ing  itself  not  just  in  former  authoritarian  regimes 
but  also  in  the  United  Kingdom  and  Commonwealth 
countries  where  political  constitutionalism  had 
hitherto  held  sway  alone,  so  empirical  scholarship 
has  highlighted  these  drawbacks.  More  often  than 
not,  legal  constitutionalist  arrangements  have  been 
introduced  by  hegemonic  groups  fearing  political 
challenges  to  their  position,  with  the  record  of  the 
new  regimes  faring  no  better  overall  on  civil  rights 
and,  from  an  egalitarian  perspective,  rather  worse 
on  social  and  economic  rights.  Whereas  political 
constitutionalism  responds  to  majority  views  for 
enhanced  and  more  equal  public  goods,  legal  con¬ 
stitutionalism  has  invariably  inhibited  such  reforms 
on  grounds  of  their  interfering  with  individual 
property  and  other  rights.  Nor  has  it  upheld  polit¬ 
ical  constitutional  arrangements  particularly 
well — for  example,  blocking  campaign  finance 
limits  in  many  jurisdictions.  Of  course,  important 
exceptions  exist,  with  the  progressive  rulings  of  the 
Warren  Court  (1953-1969)  in  the  United  States 
offering  an  apparent  contrast  to  the  free  market 
decisions  of  the  Lochner  era  (1897-1937). 
However,  these  decisions  largely  reflected  sus¬ 
tained,  national,  majority  opinion  and  only  became 
effected  when  backed  by  legislative  rulings  and 
executive  action.  At  best,  legal  constitutionalism 
proves  only  as  good  as  the  political  constitution;  at 
worst,  it  inhibits  its  more  equitable  and  legitimate 
working. 

Richard  Bellamy 
University  College  of  London 
London,  United  Kingdom 
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Constructivism 


Constructivism  is  a  theory  according  to  which 
social  phenomena  are  constructed  through  interac¬ 
tions  among  humans,  who  interpret  one  another’s 
actions  and  define  situations  based  on  those  inter¬ 
pretations.  Thus,  constructivism  offers  a  way  of 
studying  social  phenomena,  which  people  tend  to 
treat  as  though  they  were  objective  entities. 
However,  from  the  viewpoint  of  constructivism, 
what  people  believe  to  be  objective  entities  are 
actually  accomplished  through  interactions  between 
human  actors  who  interpret  those  phenomena 
within  specific  social  and  historical  contexts. 

Constructivism  is  not  a  theory  composed  of  a 
series  of  hypotheses  but  a  perspective  that  studies 
discourse  in  order  to  analyze  phenomena.  This 
perspective  gained  prominence  following  the  pub¬ 
lication  in  1966  of  Peter  Berger’s  and  Thomas 
Luckmann’s  The  Social  Construction  of  Reality. 
Since  then  it  has  become  widely  influential  through¬ 
out  the  social  and  human  sciences.  For  example,  in 
anthropology  and  sociology,  there  were  the  debates 
between  “essentialism”  and  “constructivism”  con¬ 
cerning  sex,  race,  and  ethnicity.  The  debate  made 
it  clear  that  sex  and  race  cannot  be  differentiated 
using  only  biological  standards  nor  reduced  to 
unchanging  essences.  People  use  these  categories  in 
practical  ways,  contingent  on  the  context: 
Depending  on  the  situation,  a  certain  gender  or 
racial  category  is  attributed  to  a  particular  person. 

Thus,  it  is  impossible  to  identify  sex  using  only 
an  objective  biological  standard.  There  are  people 
who  experience  an  inconsistency  between  their 
biological  sex  and  their  subjective  consciousness  of 
the  sex  to  which  they  think  of  themselves  as 
belonging.  In  Japan,  a  law  was  passed  in  2003,  by 
which  people  who  undergo  gender  reassignment 
surgery  and  who  do  not  have  any  juvenile  children 
can  be  categorized  legally  as  being  of  their  new  sex 
by  getting  permission  from  the  family  court.  They 
can  marry  people  of  their  previous  gender.  Similar 


laws,  some  of  which  allow  more  lenient  conditions 
for  changing  one’s  legal  sex,  were  legislated  in  a 
number  of  European  countries. 

Race  (usually  understood  as  rooted  in  biology) 
and  ethnicity  (understood  as  cultural)  also  prove 
hard  to  classify.  Many  people  are  so-called  mixed 
race,  and  ascribing  racial  categories  to  them  is  not 
easy.  For  example,  the  U.S.  Census  treats  “Hispanic” 
and  “Latino”  as  ethnic  categories,  and  Hispanics 
or  Latinos  may  classify  themselves  as  belonging  to 
the  racial  category  of  White,  Two  or  More  Races, 
or  Some  Other  Race.  However,  in  informal  con¬ 
texts,  Latino  and  Hispanic  may  be  considered  to  be 
either  ethnic  or  racial  classifications. 

Ethnicity  is  characterized  by  cultural  traits  such 
as  language,  religion,  customs,  and  social  behav¬ 
ior,  but  standards  for  ascribing  ethnicity  also  are 
uncertain.  In  the  United  Kingdom  (UK),  Chinese 
are  sometimes  considered  an  independent  cate¬ 
gory,  differentiated  from  the  separate  Asian  cate¬ 
gory,  while  there  are  different  ethnic  groups 
among  Arabic  people.  Also,  as  ethnicity  has 
become  the  focus  for  many  nationalist  movements 
in  the  world,  it  becomes  apparent  that  the  concept 
of  ethnicity  itself  is  a  historical  product. 

On  Terminology 

Readers  may  have  encountered  two  terms:  con¬ 
structivism  and  constructionism.  Concerning  these 
different  terms,  Holstein  and  Gubrium,  the  editors 
of  a  comprehensive  handbook  on  the  study  of  con¬ 
structivism  published  in  2008,  point  out  that, 
although  constructivism  is  the  preferred  term  in 
science  and  technology  studies  and  construction¬ 
ism  is  more  widely  used  in  the  social  sciences,  the 
two  terms  can  be  used  interchangeably  in  most 
cases.  Joel  Best,  another  sociologist,  notes  that 
constructivism  has  high  cultural  overtones  and 
appears  to  be  favored  by  British  scholars,  although 
American  sociologists  seem  to  use  the  two  terms 
interchangeably.  Thus,  this  entry’s  use  of  construc¬ 
tivism  as  a  generic  term  in  the  social  sciences 
encompasses  constructionism ,  the  term  often  used 
in  empirical  research  by  political  scientists. 

Constructivist  Studies  of  Science 

No  one  can  dispute  that  the  roots  of  constructiv¬ 
ism  are  in  the  sociology  of  knowledge.  However, 
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the  sociology  of  social  problems  and  the  sociology 
of  science  were  the  specialties  that  took  the  lead  in 
exploring  constructivism  and  within  which  this 
perspective  became  dominant.  Using  ethnographi¬ 
cal  and/or  anthropological  studies,  sociologists 
interested  in  studying  science  from  a  constructivist 
viewpoint  described  how  scientists  reach  agree¬ 
ment  when  they  discover  new  information. 

Although  these  researchers  focused  on  the  inter¬ 
actions  among  scientists  in  the  laboratory,  they 
tended  to  neglect  outside  influences.  Constructivists 
who  study  the  sociology  of  science  often  do  lim¬ 
ited,  microsphere  ethnography  in  laboratories. 
Some  of  them  focus  narrowly  on  scientists’  conver¬ 
sations,  as  though  only  discourse  can  determine 
what  is  considered  a  scientific  finding.  These 
researchers  are  liable  to  be  committed  to  relativ¬ 
ism.  That  is,  some  of  them  seem  to  argue  that 
anything  might  be  recognized  as  true  so  long  as 
there  is  consensus  through  discourse  within  a  com¬ 
munity,  or  they  seem  to  imply  that  there  is  no  way 
to  make  sure  of  the  certainty  of  assertions.  When 
some  of  them  expanded  their  research  to  scientists’ 
networks  outside  the  laboratory,  they  discovered 
much  broader,  structural  contexts  that  influence 
the  activities  of  scientific  research. 

Constructivist  Studies  of  Social  Problems 

The  study  of  social  problems  is  the  constructivist 
work  most  relevant  to  political  science.  Research  in 
which  sociologists  escaped  from  the  trap  of  relativ¬ 
ism  can  be  seen  in  the  study  of  social  problems. 
Social  constructivist  studies  of  social  problems 
evolved  from  the  labeling  perspective  on  deviant 
behavior.  Labeling,  which  flourished  in  the  late 
1960s  and  early  1970s,  focused  less  on  the  deviant 
behavior  itself  than  on  the  process  through  which 
some  behavior  is  defined  and  treated  as  deviant.  It 
studied  how  deviant  behavior  is  socially  constructed. 

Positivists  who  study  social  problems  search  for 
the  causes  of  social  problems.  Constructivists, 
however,  point  out  that  social  problems  are  taken 
for  granted  by  positivist  scholars.  Positivists  pre¬ 
suppose  what  are  social  problems  and  treat  them  as 
though  they  are  social  conditions,  rather  than  first 
studying  how  some  phenomenon  is  constructed 
into  a  social  problem  through  the  interactions 
among  social  actors,  agencies,  groups,  organiza¬ 
tions,  and  institutions.  A  social  problem  is  what 


those  people — not  academic  researchers — define 
as  a  social  problem.  Thus,  as  Malcolm  Spector  and 
John  Kitsuse  (1977)  assert,  social  problems  are  not 
objective  conditions  but  the  activities  of  individu¬ 
als  or  groups  who  make  claims  with  respect  to 
some  “putative  conditions.”  Analysts  should  focus 
on  the  people  making  and  responding  to  claims 
and  counterclaims  and  on  the  interactions  among 
claims-making  groups  and  responding  groups  and 
institutions. 

When  this  constructivist  perspective  was  criti¬ 
cized  for  inevitably  presupposing  some  character¬ 
istics  of  the  “putative  conditions,”  it  divided  into 
two  schools:  strict  constructivism  and  contextual 
constructivism.  Strict  constructivism  claims  that 
researchers  should  refrain  from  any  presupposi¬ 
tions  about  the  characteristics  of  “putative  condi¬ 
tions”  and  aims  for  a  pure,  presupposition-free 
constructivist  position.  Contextual  constructivism 
says  that  it  is  impossible  to  have  presupposition- 
free  constructivism.  It  also  says  that  scientists  can 
sometimes  check  the  “putative  conditions”  and 
find  that  some  claims  might  be  absurd,  even 
though  the  aim  of  contextual  constructivist 
research  is  not  debunking  the  content  of  the 
claims.  At  the  beginning,  there  seemed  to  be  much 
support  for  strict  constructivism,  yet  the  great 
majority  of  published  research  adopts  the  stance  of 
contextual  constructivism. 

Even  when  the  same  behavior  is  claimed  to  be  a 
social  problem,  the  people  making  the  claims  can 
adopt  very  different  “frames,”  depending  on  his¬ 
torical  and  social  contexts.  Social  problems  that 
are  constructed  using  frames  that  are  easy  to 
accept  tend  to  become  established,  familiar  social 
problems.  Recent  examples  of  such  widely  accepted 
frames  include  “human  rights,”  “health,”  and 
“democracy.”  Consider  the  case  of  smoking. 
Selling  cigarettes  was  banned  in  15  states,  such  as 
Kansas,  Illinois,  and  Minnesota,  in  the  United 
States  by  1909.  (During  World  War  I,  people 
became  tolerant  of  cigarette  smoking,  and  these 
laws  were  later  abolished.)  This  was  accomplished 
by  defining  it  as  a  vicious  habit  and  an  immoral 
behavior  from  the  viewpoint  of  Protestant  ethics. 
More  recently,  smoking  became  regulated  because 
of  its  harmful  effects  on  health.  But  even  the  health 
frame  can  be  constructed  in  different  ways:  Initial 
claims  emphasized  the  damage  to  the  smoker’s 
own  health;  however,  more  recent  regulations 
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have  been  justified  in  terms  of  the  risks  to  others’ 
health  caused  by  passive  smoking.  Thus,  the  cur¬ 
rent  construction  invokes  a  synthesis  of  health  and 
rights  frames. 

Once  a  social  problem  gains  acceptance,  it  can 
undergo  what  Best  (2008)  calls  “domain  expan¬ 
sion.”  When  “hate  crime”  was  first  categorized,  it 
was  defined  as  a  crime  caused  by  racial  prejudice. 
However,  its  domain  has  expanded  to  include 
prejudice  against  sexual  orientation.  Similarly, 
child  abuse  initially  meant  physical  violence 
inflicted  on  children,  such  as  beatings,  but  now  it 
has  been  expanded  to  include  verbal  and  psycho¬ 
logical  abuse. 

When  the  mass  media  focus  on  some  phenom¬ 
enon  claimed  to  be  a  social  problem,  to  attract 
more  attention  and  have  a  stronger  impact,  they 
tend  to  depict  an  extremely  serious  case.  Because  it 
is  referred  to  repeatedly,  it  becomes  a  high-profile 
case,  and  people  are  liable  to  think  of  this  instance 
as  though  it  were  a  typical  case  of  that  social  prob¬ 
lem.  For  example,  claims  about  missing  children 
are  illustrated  with  cases  of  children  kidnapped 
and  murdered  by  strangers.  However,  the  reality  is 
that  most  child  abductions  are  committed  by  a 
separated  partner  in  the  course  of  a  family  dispute, 
while  the  great  majority  of  children  reported  as 
missing  have  been  runaways  who  returned  home 
safely  and  voluntarily. 

Social  problems  are  constructed  by  using  the 
language  and  narratives  of  claims  makers,  victims, 
supporters,  and  the  media  and  through  interaction 
among  them  and  their  readers  and  audiences.  First, 
a  phenomenon  should  be  recognized  and  named. 
In  that  sense,  it  is  constructed  as  being  linguisti¬ 
cally  different  from  other  phenomena.  Such  claims 
serve  as  a  kind  of  advertising  activity,  promoted  by 
mobilizing  resources  to  make  the  phenomenon 
recognized  as  a  serious  problem  to  others  and  to 
demand  some  sort  of  solution  from  among  various 
alternatives.  Rhetoric  is  crucial  to  stimulate  peo¬ 
ple’s  emotions  and  to  persuade  them.  Thus,  vic¬ 
tims  are  presented  as  innocent  and  vulnerable. 

This  discourse  of  claims  can  be  analyzed  from 
different  approaches  to  constructivism.  Historically 
sensitive  constructivism  inspired  by  Michel 
Foucault  analyzes  discourse  at  a  macrolevel. 
Constructivism  informed  by  ethnomethodology 
and  conversation  analysis  studies  discourse  at  a 
microlevel. 


Construction  of  Crime  Problems 
and  Social  Policy 

In  advanced  societies,  social  problems  and  politics 
intersect  when  deviant  behavior  is  being  con¬ 
structed  as  a  criminal  problem.  One  strategy  taken 
to  solve  such  problems  is  to  “get  tough”  and  mete 
out  harsher  punishment.  This  policy  is  most  evi¬ 
dent  in  the  United  States  and  the  UK. 

In  the  United  States,  the  “broken-windows” 
theory  has  been  put  into  practice.  According  to 
this  theory,  broken  windows,  graffiti,  and  similar 
public  displays  of  neglected  property  and  petty 
criminality  tend  to  encourage  further  criminal 
behavior.  The  strategy  in  this  theory  is  to  regulate 
such  minor  offenses  in  order  to  prevent  serious 
offenses.  In  the  UK,  the  government  has  instituted 
laws  to  control  antisocial  behavior.  Drawing  graf¬ 
fiti,  making  noise  to  disturb  neighbors,  annoying 
pedestrians,  and  other  similar  behaviors  are  defined 
as  antisocial.  When  people  commit  such  behavior 
for  the  first  time,  they  are  cautioned  and  get  an 
Anti-Social  Behavior  Order  (ASBO),  which  leads 
to  a  civil  case  before  a  magistrate’s  court.  If  they 
exhibit  antisocial  behavior  again  and  breach  the 
ASBO,  they  can  be  prosecuted  by  an  agency  of  the 
local  government  at  a  magistrate’s  court  as  a 
criminal  case  and  can  be  imprisoned. 

People  are  afraid  of  youth  crime.  By  showing 
themselves  to  be  “tough  on  crime,”  politicians 
gain  popularity  and  receive  more  votes  from  their 
constituents.  Even  though  politicians  may  know 
that  the  true  dichotomy  is  not  tough  versus  soft  on 
crime,  when  they  see  the  tabloid  newspaper  head¬ 
lines  and  articles,  which  they  think  reflect  public 
opinion,  they  hesitate  to  be  seen  as  soft,  being 
afraid  to  lose  popularity  among  voters. 

Media  coverage  can  foster  a  strong  fear  of  crime. 
Also,  it  encourages  readers  and  audiences  to  empa¬ 
thize  with  victims.  The  fear  of  being  victimized  and 
the  desire  for  revenge  against  the  perpetrators  rises, 
and  people  demand  tougher  policies  against  crime. 
Politicians  not  only  think  it  their  responsibility  to 
respond  to  the  population’s  desire  but  also  try  to 
use  the  situation  as  an  election  instrument.  This  is 
“penal  populism,”  promoted  by  distorted  public 
opinion  inspired  by  the  mass  media’s  sensational 
crime  reporting,  politicians’  posturing  so  as  not  to 
lose  popularity,  campaigns  by  interest  groups 
including  formal  social  control  agencies,  and  the 
enterprises  that  run  private  correctional  facilities. 
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In  2008,  the  International  Centre  for  Prison 
Studies  reported  that  the  number  of  people  incar¬ 
cerated  in  U.S.  prisons  and  jails  was  2,293,157 
(including  pretrial  detainees).  The  incarceration 
rate  was  756  per  100,000  of  population:  the  high¬ 
est  rate  in  the  world,  and  according  to  U.S. 
Department  of  Justice  statistics,  one  in  nine  Black 
males  between  the  ages  of  20  and  34  years  was  in 
jail.  The  International  Centre  reported  that  in 
England  and  Wales,  the  number  of  prisoners 
nearly  doubled,  from  42,000  in  1991,  to  more 
than  83,000  in  2008.  The  2008  incarceration  rate 
was  153  per  100,000  population. 

Another  aspect  of  the  social  construction  of 
policy  is  the  social  construction  of  target  popula¬ 
tions  of  social  policy.  Research  done  by  construc¬ 
tivist  policy  scholars  shows  that  people  who  are 
most  vulnerable  tend  to  participate  least  in  poli¬ 
tics,  so  their  interests  are  liable  to  be  ignored  in  the 
designs  of  social  policies.  It  is  also  the  case  that 
people  who  are  deemed  the  target  of  benevolent 
social  policies  tend  to  be  disadvantaged  and  there¬ 
fore  neglected,  as  they  suffer  from  a  scarcity  of 
resources  and  fail  to  participate  in  politics. 

Constructivism  and  International  Relations 

In  the  study  of  international  politics,  realism  has 
been  the  dominant  theory.  That  perspective  sup¬ 
poses  that  nations  unwaveringly  pursue  power  and 
wealth.  However,  the  situation  of  “the  war  of  all 
against  all”  is  not  always  a  natural  condition  nor 
universally  adaptive  to  all  situations.  If  the  leading 
nations  adopt  an  attitude  that  assumes  all  other 
nations  are  enemies,  the  situation  of  the  war  of  all 
against  all  seems  to  be  accomplished  by  the  reactions 
of  other  nations,  which  is  to  take  the  same  attitude 
to  defend  themselves  from  stronger  nations.  This 
becomes  a  self-fulfilling  prophecy.  However,  this  is 
not  an  unavoidable  or  necessary  situation.  Alexander 
Wendt  (1999)  identified  three  types  of  cultural  con¬ 
ditions  of  anarchy  and  called  this  type  “Hobbesian.” 
He  pointed  out  that  there  are  alternatives  such  as 
“Lockean,”  which  is  based  on  roles  of  rivalry, 
and  “Kantian,”  which  is  based  on  friendship  roles. 
Using  the  structuration  theory  of  Anthony  Giddens 
and  symbolic  interactionism  in  sociology,  Wendt 
postulates  that  agents  and  structures  are  mutually 
constitutive,  and  he  emphasizes  agents’  interpretive 
activities  through  which  collective  identities  are 
formed. 


Thus,  what  looks  to  be  a  general  principle  pre¬ 
supposes  some  conditions  that  are  not  universal. 
Constructivism  points  out  what  is  taken  for 
granted  by  ordinary  citizens,  diplomats,  leaders, 
groups,  organizations,  agencies,  and  states.  Thus, 
in  the  field  of  international  relations,  there  cannot 
be  objective  and  universal  rules  such  as  those  that 
govern  in  the  natural  sciences. 

From  the  viewpoint  of  constructivism,  the 
actions  of  nations  are  also  performed  according  to 
their  accounts  and  interpretations  of  what  is  con¬ 
sidered  legitimate,  appropriate,  or  authentic.  It  is 
not  only  material  power  but  also  ideas  and  norms 
that  influence  their  actions.  Actions  of  nations  are 
not  automatic  reactions  to  the  global  power  struc¬ 
ture.  Further,  they  cannot  be  predicted  using  only 
a  rational  choice  theory  based  on  the  calculation  of 
nations’  material  powers  and  interests. 

Not  only  nations  but  also  agencies,  such  as  non¬ 
governmental  organizations,  nonprofit  organiza¬ 
tions,  human  rights  organizations,  the  United 
Nations  (UN),  the  UN’s  Human  Rights  Committee, 
the  European  Union,  the  Council  of  Europe,  the 
European  Commission  of  Human  Rights  (now 
obsolete),  and  the  International  Criminal  Court, 
have  become  more  significant  in  international  rela¬ 
tions.  It  is  important  to  take  into  account  these 
groups’  own  definitions  of  the  situations  concern¬ 
ing  norms,  appropriateness,  legitimacy,  roles,  and 
identities  that  affect  their  behavior. 

Because  nations  interpret  and  define  situations, 
they  are  not  puppets  or  organs  that  produce  the 
same  outcome  confined  by  the  international  struc¬ 
ture.  Compared  with  the  adherents  of  realism, 
which  emphasizes  material  power,  constructivists 
are  interested  in  researching  international  norms 
that  affect  international  relations.  Constructivists 
explore  how  new  norms  emerge,  cascade,  and 
become  internalized  within  international  relations. 
Although  the  emergence  of  norms  in  international 
relations  is  an  interesting  theme  of  constructivism, 
in  sociology  the  control  of  human  behavior  by  rules 
has  been  studied  mainly  by  structural  functionalists. 
However,  in  international  relations,  global  situa¬ 
tions  are  much  more  fluid  than  the  social  structure 
of  a  single  state,  and  constructivists  study  all  the 
facets  of  norms,  such  as  their  emergence,  interpreta¬ 
tion,  role  taking,  and  life  cycles.  This  complexity  is 
why  constructivism,  rather  than  structural  func¬ 
tionalism,  is  needed  to  study  international  relations. 
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When  a  norm  is  applied,  we  can  expect  that 
interpretations  of  it  will  be  very  different  from 
situation  to  situation.  The  pragmatics  of  norms, 
the  rhetoric  that  is  endorsed  when  norms  are  acti¬ 
vated,  and  the  discourse  that  can  mobilize  con¬ 
cepts  such  as  legitimacy,  authenticity,  appropriate¬ 
ness,  and  conventions  are  interesting  themes  of 
constructivism.  One  of  the  main  characteristics  of 
constructivism  is  that  it  does  not  assume  that  the 
behavior  of  nations  is  objectively  predetermined 
by  material  power  or  interest;  rather,  constructiv¬ 
ism  recognizes  that  interpretations  of  ideas  and 
definitions  of  situations  by  human  agencies  shape 
international  relations. 

Conclusion 

Constructivism  has  become  too  influential  in  most 
of  the  social  sciences  to  be  ignored.  In  some  fields, 
such  as  the  study  of  ethnicity  and  race,  it  has 
become  almost  impossible  to  find  studies  that  have 
nothing  to  do  with  constructivism  or  were  not 
inspired  by  it.  The  number  of  academic  papers  in 
the  social  sciences  that  contain  terms  relevant  to 
social  construction  has  increased  (Best,  2008).  Since 
the  connotations  of  “social  construction”  or 
“socially  constructed”  have  become  so  diversified  in 
academic  papers  and  books,  it  looks  as  if  the  term 
constructivism  (or  constructionism )  might  no  lon¬ 
ger  cover  all  that  their  usages  have  come  to  signify. 

Although  the  majority  of  research  in  political 
science  and  the  social  sciences  continues  to  be 
within  the  positivist  tradition,  constructivism  has 
become  increasingly  valued  for  its  insights  and 
creativity,  and  it  will  likely  continue  to  play  a 
growing  role. 

Jun  Ayukawa 
Kwansei  Gakuin  University 
Nishinomiya,  Japan 
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Constructivism  in 
International  Relations 


Constructivism  in  international  relations  (IR)  refers 
to  a  family  of  theoretical  approaches  that  share 
three  analytical  focal  points  in  appraising  world 
politics:  inter  subjectivity,  the  mutual  constitution  of 
agents  and  structures,  and  the  double  hermeneutics 
(i.e.,  the  interpretation  of  interpretations).  Heavily 
inspired  by  sociological  thinking,  constructivism  is 
not  a  substantive  theory  of  international  politics  in 
the  same  way  that  neorealism  is,  for  instance.  In  the 
IR  discipline,  constructivists  generally  seek  to 
redress  the  lack  of  attention  given  to  social  factors 
in  political  life,  which  characterizes  rationalist  utili¬ 
tarian  models.  Starting  from  the  premise  that  world 
politics  basically  consist  of  social  relations,  con¬ 
structivist  scholars  believe  that  international  poli¬ 
tics  are  not  fundamentally  different  from  other 
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spheres  of  human  activity,  where  practices  are 
produced,  reproduced,  and  contested  inside  a 
meaningful  and  patterned  social  context. 

Although  constructivism  emerged  5  decades  ago 
in  philosophy,  sociology,  and  anthropology,  it  did 
not  reach  the  field  of  IR  until  the  late  1980s  and 
early  1990s.  Two  main  evolutions  favored  the  rise 
of  constructivism  in  IR.  First,  the  failure  of  domi¬ 
nant  theories  to  predict  or,  more  simply,  to  explain 
the  end  of  the  Cold  War,  arguably  one  of  the  most 
important  international  changes  in  decades,  led  to 
something  of  an  existential  crisis  in  the  discipline. 
Second,  the  rise  of  the  “third  debate”  in  IR  theory 
about  epistemology  shook  up  more  orthodox 
understandings  about  how  social  scientists  should 
go  about  the  study  of  world  politics.  During  the 
1990s,  constructivism  gradually  imposed  itself  as  a 
via  media,  or  middle  ground,  in  the  IR  theoretical 
landscape,  revisiting  a  number  of  disciplinary  foun¬ 
dations  with  novel  ideas  such  as  Alexander  Wendt’s 
“anarchy  is  what  states  make  of  it”  (Wendt,  1999). 

One  evocative  way  to  locate  constructivism  in 
IR  is  to  look  at  the  notion  of  interest  and  the  ana¬ 
lytical  work  that  it  does  according  to  different  theo¬ 
ries.  Like  rationalism,  constructivism  assumes  that 
agents  have  interests  and  that  they  act  on  the  basis 
of  those  interests.  But  while  rationalism  takes  pref¬ 
erences  to  be  exogenously  given  (i.e.,  the  origin  of 
interests  falls  outside  the  scope  of  the  theory),  for 
constructivism  the  question  of  where  interests  come 
from  is  front  and  center  in  the  analysis.  Just  how  do 
people  come  to  want  what  they  want?  From  a  con¬ 
structivist  perspective,  the  crucial  issue  of  interest 
formation  cannot  be  dismissed  without  losing  sight 
of  a  fundamental  dimension  of  politics.  If  it  is  true 
that  interests  drive  the  world,  then  we  must  know 
where  they  come  from  and  how  they  form.  By  con¬ 
trast  with  the  rationalist  science  of  decision  mak¬ 
ing,  focused  on  how  people  act  on  the  basis  of 
preexisting  preferences,  then,  constructivism  favors 
the  analysis  of  sense  making — that  is,  how  people 
define  and  construe  their  interests.  Instead  of  being 
magically  read  off  a  material  structure,  interests  are 
contextually  defined  (Focal  Point  1:  inter  subjectiv¬ 
ity);  they  are  not  individually  defined  by  atomized 
individuals  but  generated  in  and  through  social 
relations  (Focal  Point  2:  mutual  constitution  of 
structure  and  agency);  and  they  need  to  be  inter¬ 
preted,  at  the  levels  of  action  and  observation 
(Focal  Point  3:  double  hermeneutics). 


The  following  portions  of  this  entry  introduce 
constructivism  in  IR  in  two  different  ways.  The 
first  section  reconstructs  the  three  developmental 
stages  that  the  family  of  theories  has  gone 
through  over  the  past  20  years.  The  second  sec¬ 
tion  discusses  and  illustrates  the  three  conceptual 
building  blocks  that  constructivism  is  premised 
on.  The  conclusion  assesses  whether  constructiv¬ 
ism,  one  generation  after  its  rise,  has  lived  up  to 
its  promises  to  open  new  thinking  space  in  the  IR 
discipline. 

The  Three  Moments  of  Constructivism 
in  International  Relations 

The  word  constructivism  appeared  on  the  IR  radar 
screen  for  the  first  time  in  Nicholas  Onuf’s  1989 
book  World  of  Our  Making.  In  the  following 
decade,  the  theoretical  label  quickly  gained  in 
prominence,  experiencing  its  heyday  at  the  turn  of 
the  millennium  with  the  publication  of  Wendt’s 
seminal  Social  Theory  of  International  Politics 
(1999).  Now  a  vibrant  approach  in  the  IR  theo¬ 
retical  landscape,  constructivism  has  shifted  gears 
lately,  taking  a  break  from  metatheory  to  move 
into  questions  of  applied  research  and  methodol¬ 
ogy.  Twenty  years  after  its  rise,  constructivism  is 
not  an  exotic  approach  any  more  in  IR,  as  the 
disciplinary  aggiornamento  that  accompanied  its 
rise  came  to  pass.  To  be  sure,  as  is  always  the  case 
with  historiography,  this  narrative  simplifies  and 
rationalizes  post  hoc  a  much  more  complex  and 
contested  disciplinary  trajectory.  Its  heuristic  pur¬ 
pose  is  limited  to  understanding  better  what  con¬ 
structivism  does  differently  from  other  IR 
approaches,  by  grasping  the  evolutionary  stages 
that  the  approach  went  through  over  the  past  20 
years:  an  epistemological  moment,  an  ontological 
moment,  and  a  methodological  moment. 

An  Epistemological  Moment 

The  “third  debate”  in  IR,  which  emerged  in 
reaction  to  the  neorealist  orthodoxy  of  the  early 
1980s,  was  the  necessary  condition  for  constructiv¬ 
ism  to  emerge.  Influenced  by  continental  philoso¬ 
phy,  a  number  of  critical  scholars  started  to  ques¬ 
tion  the  rampant  positivism  of  the  IR  discipline. 
Building  on  the  insights  of  the  linguistic  turn  (a 
development  in  20th-century  thought  that  focused 
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on  the  relationship  between  philosophy  and  lan¬ 
guage),  they  interrogated  the  scientific  ambition  by 
emphasizing  the  performativity  of  language  and,  by 
implication,  of  theory  and  practice.  In  replacement 
for  the  naturalist  orthodoxy,  critics  promoted  a 
postpositivist  epistemology  premised  on  reflexivity 
and  the  plurality  of  viewpoints  and  methods.  More 
than  20  years  later,  it  is  fair  to  say  that  the  third 
debate  in  IR  was  inconclusive,  with  both  sides 
retreating  to  their  respective  trenches  with  many 
bruises  but  little  advancement  of  their  positions. 

In  this  context  of  disciplinary  upheaval,  con¬ 
structivism  emerged  as  a  form  of  epistemological 
third  way  that  had,  for  a  time  at  least,  much  to 
offer  to  both  positivists  and  postpositivists.  For  the 
former,  constructivism  facilitated  the  cooptation 
of  certain  novel  insights  into  the  IR  theoretical 
mainstream — for  instance,  about  the  role  of  ide¬ 
ational  variables.  For  the  latter,  constructivism’s 
rising  profile  in  IR  appeared  to  grant  some  legiti¬ 
macy  to  a  large  part  of  the  postpositivist  epistemo¬ 
logical  critique.  In  the  end,  however,  the  middle 
ground  proved  a  very  uneasy  position  to  occupy, 
and  fierce  debates  over  “explaining  versus  under¬ 
standing”  continued  to  rage  among  IR  scholars. 
To  this  day,  there  still  exists  a  fracture  between 
(scientific)  realist  constructivists,  whose  epistemol¬ 
ogy  can  accommodate  positivism,  and  interpretive 
constructivists,  who  are  philosophically  closer  to 
postpositivism.  Though  inconclusive,  the  episte¬ 
mological  moment  of  constructivism  came  to  pass 
during  the  1990s  as  key  proponents  of  the  approach 
moved  the  spotlight  onto  ontological  matters. 

An  Ontological  Moment 

Born  out  of  an  epistemological  controversy, 
constructivism  imposed  itself  in  IR  largely  by  shift¬ 
ing  attention  away  from  the  third  debate  and 
toward  ontology.  Building  on  scientific  realism, 
several  constructivists  argued  that  the  “stuff”  of 
reality  that  is  out  there  should  take  analytical  pre¬ 
cedence  over  the  way  that  we  get  to  know  it. 
Ontology,  in  other  words,  matters  more  than  epis¬ 
temology.  While  this  position  was  crucial  in  estab¬ 
lishing  IR  constructivism  as  a  “legitimate” 
approach  in  the  eyes  of  the  disciplinary  main¬ 
stream,  among  constructivists  it  has  been  hotly 
contested  ever  since  Wendt  and  others  put  it  for¬ 
ward.  The  majority  view  nowadays  rather  portrays 


ontology  and  epistemology  as  two  sides  of  the 
same  coin:  After  all,  as  Stefano  Guzzini  notes,  con¬ 
structivism  is  about  the  social  construction  of 
knowledge  and  the  construction  of  social  reality. 

Constructivism  brought  two  key  ontological 
innovations  to  IR,  which  are  discussed  at  greater 
length  below.  First,  building  on  the  sociology  of 
knowledge,  constructivism  emphasizes  the  social 
construction  of  reality:  Ideas,  meanings,  and  other 
forms  of  collectively  held  beliefs  matter  in  the  expla¬ 
nation  of  social  and  political  life.  In  IR,  that  argu¬ 
ment  is  a  direct  rejoinder  to  neorealism,  by  which 
ideas  are  nothing  but  epiphenomena  of  deeper 
material  forces,  as  well  as  to  cognitivism,  which 
tends  to  reduce  beliefs  to  ideas  held  by  individuals. 
Second,  taking  inspiration  from  developments  in 
sociological  theory,  constructivism  puts  the  mutual 
constitution  of  agents  and  structures  at  the  center  of 
analysis.  Against  both  individualism  and  holism, 
constructivism  argues  that  actors  and  their  environ¬ 
ments  determine  one  another  in  a  recurring  pattern 
of  coevolution.  Building  on  these  two  innovative 
insights,  IR  constructivists  have  sought  to  rejuve¬ 
nate  the  ontological  foundations  of  the  IR  discipline 
by  putting  forward  a  number  of  new  concepts  and 
by  revisiting  old  ones  such  as  norms,  identities, 
rules,  communicative  action,  culture,  and  commu¬ 
nities.  Pointing  to  the  many  socially  constructed 
realities  that  make  up  world  politics,  constructivists 
throughout  the  1990s  legitimized  their  frameworks 
by  “discovering”  novel  ontological  entities — the 
most  famous  one  being  Wendt’s  1999  reinterpreta¬ 
tion  of  anarchy  as  a  cultural  and  historically  contin¬ 
gent  structure  of  interaction. 

A  Methodological  Moment 

Despite  its  success,  the  ontological  revolution 
that  constructivism  brought  about  in  IR  did  not  go 
without  problems.  First,  it  had  been  made  possible 
by  brushing  under  the  carpet  a  number  of  important 
epistemological  controversies.  To  take  an  obvious 
one,  if  reality  is  socially  constructed,  what  are  the 
implications  for  the  development  of  social-scientific 
knowledge?  And  second,  the  success  of  constructiv¬ 
ism  sparked  a  number  of  defensive  reactions  from 
the  disciplinary  mainstream.  By  the  beginning  of  the 
21st  century,  a  metatheoretical  fatigue  had  set  in  as 
calls  for  constructivists  to  do  “real  research”  and 
study  actual  problems  in  world  politics  multiplied. 
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Since  the  turn  of  the  millennium,  IR  constructiv¬ 
ism  has  focused  largely  on  issues  of  methodology, 
operationalization,  and  empirics.  How  should  con¬ 
structivism  proceed  with  the  study  of  world  politics 
given  its  ontological  innovations  and  epistemologi¬ 
cal  diversity?  In  the  past  decade,  the  number  of  in- 
depth  empirical  analyses  of  world  political  phenom¬ 
ena  has  exploded.  That  evolution  is  paralleled  by  an 
increasingly  loud  debate  about  methods.  What  is 
discourse  analysis  and  how  should  it  be  practiced? 
Is  quantification  amenable  to  the  constructivist  style 
of  reasoning?  What  type  of  fieldwork  data  can  best 
support  a  constructivist  argument?  How  can  we 
adjudicate  between  explanations — constructivist  or 
otherwise?  Can  constructivist  findings  travel  from 
one  case  to  another?  Often  grounded  in  the  empiri¬ 
cal  analysis  of  international  politics,  these  very 
important  methodological  issues  have  received 
increasing  scrutiny  over  the  past  few  years,  allowing 
constructivism  to  enter  a  constructive  and  grounded 
dialogue  with  the  rest  of  the  IR  discipline. 

The  Three  Building  Blocks  of 

Constructivism  in  International  Relations 

What  do  constructivists  do,  as  they  research  world 
politics,  that  makes  them  distinct  from  scholars 
who  favor  other  theoretical  approaches  in  IR? 
There  are  obviously  many  ways  to  answer  this 
question.  In  the  following  paragraphs,  constructiv¬ 
ism  is  characterized  by  the  three  analytical  focal 
points  that  its  advocates  typically  put  to  work  in 
their  analyses:  (1)  intersubjectivity,  (2)  the  mutual 
constitution  of  structure  and  agency,  and  (3)  the 
double  hermeneutics. 

Intersubjectivity 

Intersubjectivity  refers  to  the  condition  of  mean¬ 
ings  that  do  not  depend  on  a  particular  point  of  view 
to  exist.  In  an  intuitive  sense,  ideas  belong  to  indi¬ 
viduals  and  are  located  between  people’s  earlobes. 
Taking  this  view,  a  number  of  IR  specialists  have 
emphasized  the  role  of  perceptions  in  shaping  world 
politics.  But  constructivism  stresses  a  different 
(though  related)  kind  of  meaning — not  subjective 
but  bzfersubjective.  Contrary  to  ideas  held  by  indi¬ 
viduals  (e.g.,  a  cognitive  bias),  intersubjective 
meanings  define  reality  as  something  independent 
of  our  volition.  The  classic  example,  used  by  John 


Searle  (1995),  is  that  of  money:  In  a  banking  sys¬ 
tem,  certain  bits  of  paper  engraved  with  specific 
markings  are  consensually  taken  to  be  worth  20 
dollars.  This  socially  constructed  meaning,  which 
is  attached  to  bits  of  paper  by  a  collectivity,  is  very 
real  in  its  effects  (e.g.,  in  buying  groceries),  regard¬ 
less  of  one’s  personal  misgivings  about  capitalism 
and  despite  the  fact  that  for  Martians  paper  money 
would  probably  be  worthless.  Conventions — just 
like  rules,  norms,  identities,  cultures,  or  lan¬ 
guages — are  intersubjective  structures,  that  is, 
objectified  sets  of  meanings  that  order  social  con¬ 
figurations  irrespective  of  what  the  specific  agents 
that  take  part  in  them  believe.  Such  is  also  the  case, 
for  instance,  in  a  football  game:  Whatever  specific 
strategies  individual  players  may  have,  the  rules  of 
the  game  will  define  their  interaction  along  ide¬ 
ational  constraints  that  do  not  depend  on  any 
player’s  individual  point  of  view  to  exist.  Social 
facts  exert  effects  on  politics  by  virtue  of  a  critical 
mass  of  relevant  agents  taking  their  reality  for 
granted.  As  such,  they  make  social  action  possible 
by  creating  elements  of  a  common  world. 

Building  on  this  insight,  IR  constructivists  have 
focused  attention  on  a  wide  variety  of  social  facts 
and  artifacts  in  international  politics.  International 
threats,  for  instance,  are  not  self-evident  physical 
facts  but  socially  constructed  realities.  Similarly, 
norms,  identities,  institutions,  rules,  cultures,  prac¬ 
tices,  languages,  ideologies,  and  narratives  are  all 
forms  of  intersubjective  meanings  that  shape  world 
politics  and  guide  action.  States’  national  identities 
constitute  foreign  policies;  international  cultures  of 
anarchy  drive  interstate  relations;  world  structures 
of  rules  and  norms  determine  appropriate  behav¬ 
ior  on  the  international  stage.  As  John  Ruggie 
argues,  for  instance,  it  is  not  simply  hegemony  or 
preponderance  in  material  power  that  explains  the 
content  of  our  contemporary  world  order  but  the 
social  purposes  and  norms  that  intersubjectively 
define  the  identity  of  the  American  hegemon.  All  in 
all,  taking  intersubjectivity  seriously  implies  not 
only  that  meanings  matter  but  also  that  they  mat¬ 
ter  as  structural  forces  in  world  politics. 

The  Mutual  Constitution 
of  Structure  and  Agency 

The  mutual  constitution  of  structure  and  agency 
refers  to  the  dynamic  processes  through  which 
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actors  and  contexts  coevolve  and  codetermine  one 
another.  Inspired  by  Anthony  Giddens’s  structura¬ 
tion  theory  and  Pierre  Bourdieu’s  theory  of  prac¬ 
tice,  the  notion  stresses  that  the  social  world  com¬ 
prises  neither  freestanding  individuals  nor  free- 
floating  structures.  Instead,  structures  are  made 
possible  by  agents’  practices,  which  are  themselves 
constituted  by  their  context.  According  to  Wendt, 
the  idea  can  be  simplified  with  two  “truisms” 
about  social  life: 

1)  Human  beings  and  their  organizations  are 
purposeful  actors  whose  actions  help  reproduce 
or  transform  the  society  in  which  they  live;  and 

2)  society  is  made  up  of  social  relationships, 
which  structure  the  interactions  between  these 
purposeful  actors.  Taken  together  these  truisms 
suggest  that  human  agents  and  social  structures 
are,  in  one  way  or  another,  theoretically 
interdependent  or  mutually  implicating  entities. 
(Wendt,  1987,  pp.  337-338) 

In  other  words,  structures  emerge  out  of  agents’ 
practices  and  vice  versa;  neither  is  reducible  to  the 
other.  Note  that  this  view  entails  a  processual 
ontology  by  which  agents  and  structures  should 
not  be  treated  as  reified  or  static  substances  but  as 
evolving  sets  of  relations  and  interplays  of  prac¬ 
tices.  As  stabilized  as  certain  patterns  may  be,  any 
practice  either  reinforces  or  undermines  the  exist¬ 
ing  order  of  things.  Agency,  the  power  to  deviate 
from  structure,  introduces  contestation  as  well  as 
contingency. 

To  grasp  what  the  mutual  constitution  of  struc¬ 
tures  and  agents  changes  for  the  study  of  world 
politics,  it  is  convenient  to  compare  constructivism 
with  rationalist  frameworks  in  IR.  The  latter 
ontology,  based  on  utilitarianism,  envisions  an 
atomistic  set  of  self-constituted  and  self-regarding 
units  whose  identity  is  given  and  fixed.  In  this 
framework,  structural  effects  are  limited  to  regu¬ 
lating  behaviors  because  units  preexist  structures. 
In  the  constructivist  view,  structures  also  affect  the 
properties  of  agents,  constituting  their  identities 
and  interests  in  the  first  place.  This  makes  for  a 
much  deeper  structural  effect:  Agents  would  not  be 
what  they  are  but  for  structural  constitution.  A 
classic  example  of  this  constitutive  effect  can  be 
found,  as  noted  by  Searle  (1995),  in  the  practice  of 
playing  chess.  The  rules  of  the  game  do  not  simply 


regulate  the  movement  of  the  pieces  on  the  board; 
they  make  the  game  possible  in  the  first  place. 
Without  these  constitutive  rules,  one  cannot  play 
chess;  at  the  same  time,  without  the  application  of 
these  rules  in  and  through  practice,  there  cannot  be 
an  intersubjective  structure  called  the  game  of 
chess.  Similarly,  the  norm  of  sovereignty  makes  the 
current  international  society  possible.  Given  the 
importance  of  constitutive  theorization,  the  next 
challenge  for  constructivists  is  methodological:  If 
intersubjective  structures  and  meaningful  practices 
make  world  politics  possible,  how  can  we  account 
for  them? 

The  Double  Hermeneutics 

Double  hermeneutics  refers  to  a  methodology 
centered  on  the  interpretation  of  interpretations.  It 
is  central  to  the  constructivist  approach  in  IR 
because  world  politics  presents  itself  as  a  reality 
that  is  already  interpreted  by  its  actors.  Since 
human  beings  are  meaning  makers,  their  interpre¬ 
tations  are  the  primary  object  of  study  for  con¬ 
structivists.  People  are  of  course  not  like  rocks: 
They  attach  meanings  to  the  different  parts  of  their 
environment  and  act  on  the  basis  of  those  mean¬ 
ings.  The  first  interpretive  moment,  then,  happens 
at  the  level  of  action.  The  second  interpretive 
moment  takes  place  at  the  level  of  observation,  as 
social  scientists  seek  to  establish  the  meanings  of 
the  practices  that  they  observe.  In  summary,  as 
Giddens  notes,  the  double  hermeneutics  captures 
the  core  of  the  social-scientific  enterprise  from  a 
constructivist  perspective,  which  is  to  develop  sci¬ 
entific  interpretations  of  lay  interpretations. 

In  the  practice  of  research,  the  notion  is  also 
reminiscent  of  the  hermeneutic  circle,  according  to 
which  making  sense  of  something  involves  relating 
the  parts  in  terms  of  the  whole  and  vice  versa. 
Hans  Georg  Gadamer  famously  compared  inter¬ 
pretation  to  a  “fusion  of  horizons,”  drawing  an 
analogy  with  the  practice  of  attaining  fluency  in  a 
second  language  to  better  translate  from  that  lan¬ 
guage  into  one’s  mother  tongue  (Mark  Neufeld, 
1993).  Because  of  interpretation,  scholars  cannot 
construe  their  task  as  simply  improving  the  match 
between  the  world  and  our  knowledge  about  it 
(i.e.,  the  correspondence  theory  of  truth).  As 
Patrick  Jackson  has  observed,  taking  the  double 
hermeneutics  seriously  entails  that  validity  does 
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not  simply  stem  from  matching  the  world  with 
words,  because  words  partake  in  constituting  the 
world.  After  all,  most  of  the  time,  it  is  language 
that  makes  intersubjectivity  possible  by  “detach¬ 
ing”  the  meaning  from  immediate  expressions  of 
subjectivity  and  representing  them  as  “facts”  exter¬ 
nal  to  agents.  If  discourse  is  constitutive  of  reality, 
then  the  social-scientific  discourse,  with  its  inter¬ 
pretation  from  afar  of  lay  interpretations,  also 
bears  tremendous  normative  consequences.  This  is 
perhaps  why  the  majority  of  IR  constructivists 
exhibit  a  strong  disposition  toward  reflexivity,  try¬ 
ing  to  make  strange  what  seems  obvious,  prob- 
lematize  the  taken-for-granted,  and  denaturalize 
alleged  universal  truths.  In  this  view,  the  task  of  IR 
scholars  is  not  to  define  which  international  threats 
are  “really  real”  and  which  ones  are  not,  for 
instance;  instead,  they  should  document  the  politi¬ 
cal  technologies  that  make  certain  threats  look 
“really  real”  to  various  publics  and  should  analyze 
the  consequences  that  ensue  for  world  politics. 

Conclusion 

This  entry  has  argued  that  constructivism  in  IR  is 
a  broad  family  of  theories  whose  distinctiveness 
stems  from  their  common  use  of  three  crucial 
notions  in  analyzing  world  politics:  inter  subjectiv¬ 
ity,  the  mutual  constitution  of  agents  and  struc¬ 
tures,  and  the  double  hermeneutics.  In  historio¬ 
graphical  terms,  constructivism  has  attained  a 
degree  of  disciplinary  legitimacy  in  IR  by  going 
through  three  stages  of  development:  (1)  an  episte¬ 
mological  moment,  which  remains  largely  incon¬ 
clusive  to  this  day;  (2)  an  ontological  moment, 
which  has  been  successfully  incorporated  by  large 
sections  of  the  discipline;  and  (3)  a  methodological 
moment,  which  is  currently  underway.  One  gen¬ 
eration  later,  the  “constructivist  turn”  has  now 
been  taken  in  IR,  and  while  the  approach  remains 
far  from  dominant  in  the  discipline,  it  cannot  be 
relegated  to  the  margins  anymore.  Constructivism 
is  evidently  here  to  stay.  So,  as  an  interim  assess¬ 
ment  20  years  on,  has  the  constructivist  project 
lived  up  to  its  promises  in  IR? 

To  be  sure,  the  fact  that  constructivism  has  not 
provided  a  unified  and  applied  theory  of  interna¬ 
tional  politics  in  the  way  that  Kenneth  Waltz’s 
neorealism  has  cannot  be  construed  as  a  failure.  In 
fact,  the  pluralism  that  characterizes  the  approach 


today  speaks  of  its  vibrancy  and  its  capacity  to 
accommodate  theoretical  debates.  Since  conversa¬ 
tions  among  scholars  hold  the  best  hopes  of  prag¬ 
matic  progress,  the  lack  of  orthodoxy  should  be 
welcome.  In  addition,  the  absence  of  one  overarch¬ 
ing  and  exclusive  theoretical  narrative  is  quite 
coherent  with  constructivist  premises,  including  its 
critical  disposition  toward  the  imposition  of  the 
order  of  things.  So  far,  so  good. 

But  that  may  well  be  where  the  good  news  ends, 
because  one  outcome  of  the  constructivist  turn  in 
IR  that  is  more  problematic,  one  could  argue,  is 
the  further  polarization  of  a  discipline  that  was 
already  suffering  from  a  particularly  debilitating 
degree  of  fragmentation.  To  caricature  a  bit,  con¬ 
structivism  has  become  so  mainstream  in  most 
parts  of  Europe  that  it  is  now  risking  redundancy: 
World  politics  is  socially  constructed — so  what? 
By  contrast,  in  many  North  American  universities, 
constructivism  has  become  eminently  suspect  and 
the  target  of  gatekeeping  strategies.  This  polariza¬ 
tion  is  obviously  very  damaging  for  interparadig- 
matic  dialogue.  While  the  responsibility  for  this 
regrettable  state  of  affairs  does  not  fall  on  con¬ 
structivism  alone,  it  is  hard  not  to  take  some  share 
of  the  blame  for  the  metatheoretical  fatigue  that 
now  plagues  the  IR  discipline.  In  this  context, 
there  may  be  a  danger,  much  to  constructivism’s 
detriment,  that  the  window  of  opportunity  for 
theoretical  rejuvenation  in  IR  that  opened  with  the 
end  of  the  Cold  War  is  now  shutting  down. 
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Containment 


The  concept  of  containment  was  presented  by  the 
American  diplomat  George  F.  Kennan  in  a  long 
telegram  to  the  U.S.  Department  of  State  on 
February  22,  1946,  and  in  an  article  published  in 
the  July  1947  issue  of  Foreign  Affairs,  signed  “X.” 
The  concept  was  intended  to  influence  the  U.S. 
policy  response  in  the  specific  strategic  context  of 
the  Cold  War,  but  with  the  collapse  of  the  Soviet 
Union  in  1991,  the  end  of  a  bipolar  world  raised 
new  questions  regarding  the  relevance  of  contain¬ 
ment — especially  as  the  George  W.  Bush  adminis¬ 
tration  began  to  target  new  kinds  of  enemies. 

The  Cold  War  and  the 

Origins  of  the  Concept 

Containment  was  adopted  by  President  Harry  S. 
Truman’s  administration  (1945-1953),  both  as  a 
doctrine  and  as  a  rationale  for  external  action,  and 
was  carried  on  by  Truman’s  successors,  especially 
Lyndon  Johnson  (1963-1969).  The  main  objective 


of  this  new  doctrine  was  to  use  military,  economic, 
and  diplomatic  means  to  oppose  what  Kennan 
depicted  as  the  Soviet  Union’s  “hegemonic”  strat¬ 
egy:  “The  main  element  of  any  United  States  policy 
toward  the  Soviet  Union  must  be  that  of  a  long¬ 
term,  patient  but  firm  and  vigilant  containment  of 
Russian  expansive  tendencies”  (X  [KennanJ,  1947, 
Part  2).  The  United  States  pursued  a  host  of  poli¬ 
cies  in  the  name  of  keeping  an  essential  commit¬ 
ment  to  prevent  the  spread  of  the  Soviet  influence 
throughout  the  world.  A  large  proportion  of  these 
actions  were  carried  out  in  Europe  and  Asia.  Some 
examples  include  support  given  to  Greece  to  fight 
against  the  “Communist  subversion”  (1947);  the 
launching  of  the  Marshall  Plan  (a  program  of 
direct  economic  aid  to  Europe);  the  promise  to 
support  “free  peoples  who  are  resisting  attempted 
subjugation  by  armed  minorities  or  by  outside 
pressures,”  as  stated  by  President  Truman  in  a 
speech  on  March  12,  1947;  the  construction  of  the 
North  Atlantic  Treaty  Organization  (NATO) 
(1949);  and  the  Korean  and  Vietnam  Wars. 

Although  declining  in  the  late  1960s  (during  the 
“detente”  promoted  by  U.S.  President  Richard 
Nixon  and  the  U.S.  Secretary  of  State  Henry 
Kissinger,  then  during  the  easing  of  tensions  with 
the  USSR  and  communist  China),  and  challenged 
by  the  more  radical  concept  of  “rollback”  (most 
notably  under  President  Eisenhower’s  secretary  of 
state  John  Foster  Dulles,  who  called  for  the  “libera¬ 
tion”  of  Eastern  Europe),  the  policy  of  containment 
continued  to  mark  the  American  foreign  policy 
landscape  until  the  end  of  the  Cold  War,  as 
President  Ronald  Reagan  (1981-1989)  sent  mili¬ 
tary  aid  to  anticommunist  movements  in 
Afghanistan,  Angola,  Cambodia,  and  Nicaragua; 
deployed  the  Pershing  II  missiles  in  Europe;  and 
promoted  the  Strategic  Defense  Initiative,  which 
would  use  ground-  and  space-based  systems  to  pro¬ 
tect  the  United  States  against  nuclear  missile  attack. 

Both  external  and  internal  factors  influenced  the 
relative  power  of  containment  as  a  doctrine.  The 
Truman  administration  had  the  advantage  of  a 
favorable  economic  context.  Soviet  Premier  Joseph 
Stalin’s  death  in  1953  allowed  U.S.  President 
Dwight  D.  Eisenhower  to  be  less  influenced  by  bal¬ 
ance  of  power  relations.  Instead,  he  focused  more 
on  cutting  government  spending  and  reducing  eco¬ 
nomic  assistance  plans  and  gave  less  priority  to 
expensive  conventional  military  deployments.  It 
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would  have  thus  been  reasonable  to  deduce  that 
the  end  of  the  Cold  War  (1989-1991)  would  have 
marked  the  official  end  of  the  United  States’  reli¬ 
ance  on  containment  policy.  Once  America’s 
40-year  enemy  disappeared,  the  expectation  was 
that  a  “brave  new  world”  would  pay  “dividends  of 
peace”  and  render  obsolete  an  expensive  contain¬ 
ment  policy.  But  20  years  after  the  fall  of  the  Berlin 
Wall,  there  are  more  than  700  operable  U.S.  mili¬ 
tary  bases,  and  approximately  370,000  American 
troops  remain  deployed  in  more  than  150  coun¬ 
tries.  Nevertheless,  fears  continue  to  be  expressed 
about  the  need  to  circumscribe  emerging  peer 
competitors,  new  global  enemies,  and  new  threats. 

Post-Bipolar  Containments? 

If  the  rhetoric  of  containment  has  officially  disap¬ 
peared  with  the  demise  of  the  Soviet  Union,  its 
spirit  can  be  found  in  the  targeting  of  new 
“threats”  by  the  United  States,  especially  in  three 
main  categories:  rogue  states,  peer  competitors, 
and  ambiguous  entities. 

Rogue  States 

Iraq  was  the  subject  of  a  containment  policy 
between  the  1991  Gulf  War  (following  its  invasion 
of  Kuwait)  and  the  attacks  on  the  United  States  in 
September  2001.  Severe  sanctions  were  applied, 
United  Nations  weapons  inspections  were  imposed, 
U.S.  troops  and  air  patrols  in  the  Persian  Gulf  were 
deployed,  and  the  economic  weakening  and  politi¬ 
cal  isolation  of  Iraq  was  put  into  place.  The  unique¬ 
ness  of  the  Iraqi  example  stems  from  the  fact  that  it 
led  to  a  radical  change  of  course  when  the  adminis¬ 
tration  of  George  W.  Bush  opted  for  a  policy  of 
regime  change  via  military  action  in  2003  and  top¬ 
pled  Saddam  Hussein.  This  raised  questions  about 
whether  a  similar  fate  awaited  Iran  and  North 
Korea,  the  two  other  states  (besides  Iraq)  character¬ 
ized  in  Bush’s  2002  State  of  the  Union  address  as 
the  “axis  of  evil”  and  potential  nuclear  threats. 

Peer  Competitors 

A  new  containment  of  China  was  hinted  at  (but 
officially  denied)  by  U.S.  Secretary  of  State 
Condoleezza  Rice  as  she  paid  tribute  to  Kennan 
(March  2005  in  Tokyo)  following  his  death.  The 


context  was  the  elaboration  of  a  new  logic  of 
maintaining  alliances,  along  with  a  mentioning  of 
the  need  for  China  to  play  “a  positive  role.”  In 
2009,  the  United  States  had  military  bases  in  South 
Korea,  Japan,  and  Afghanistan,  much  as  it  pro¬ 
vided  military  equipment  to  South  Korea,  Japan, 
India,  and  Taiwan  (whose  security  is  guaranteed 
by  Washington).  Even  after  the  election  of  Barack 
Obama  to  the  presidency  in  November  2008,  dec¬ 
larations  by  the  new  Secretary  of  State,  Hillary 
Clinton,  evoked  the  need  to  diminish  China’s 
global  economic  influence.  Russia,  after  invading 
neighboring  Georgia  in  August  2008,  was  also 
considered  to  be  somewhat  of  a  threat  requiring 
containment:  Moscow  felt  directly  targeted  by  the 
installation  of  a  U.S.  missile  defense  shield  in 
Poland  and  the  Czech  Republic  (even  though 
Washington  insisted  its  official  purpose  was  to 
counter  “rogue  states”  such  as  Iran). 

Ambiguous  Entities 

A  relevant  innovation  following  the  terrorist 
attacks  on  September  11,  2001,  was  the  effort  by 
the  United  States  to  drag  its  allies  into  a  third  type 
of  containment  against  “terror.”  Identified  by 
Washington  as  the  new  global  security  priority 
since  2001,  the  “war  on  terror”  became  a  massive 
endeavor  to  counter  political  or  religious  groups 
and  movements — even  social,  economic,  or  cultural 
practices — with  potential  links  to  terrorism.  With 
efforts  concentrated  mostly  in  the  Middle  East  and 
the  Muslim  world  (in  Iraq,  then  Afghanistan;  in  the 
Gulf  but  also  in  Central  Asia,  Southeast  Asia,  and 
Africa),  the  United  States  has  been  deploying  troops 
and  military  bases,  fighting  against  social  and 
financial  networks,  and  working  to  change  political 
regimes  and  to  control  transnational  flows. 

The  revelation  found  in  this  new  type  of  con¬ 
tainment  lies  in  the  nature  of  the  enemy  targeted. 
With  neither  a  clear  political  nucleus  nor  a  defined 
territory  (even  if  linked  to  the  Muslim  world  in 
American  rhetoric),  the  “terrorist”  entity  hardly 
conforms  to  Kennan’s  initial  containment  theory, 
which  was  proposed  to  counter  the  Komintern  and 
which  included  cutting  diplomatic  relations  and  all 
exchanges  with  the  state  in  question.  Still, 
Washington’s  insistence  on  organizing  the  eradica¬ 
tion  of  both  “terror”  and  its  state  and  nonstate 
sponsors  recalls  the  initial  containment  doctrine 
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and  raises  the  question  of  whether  the  concept  of 
containment  is  enduring,  relevant  only  to  a  specific 
time,  or  altogether  obsolete. 

Containment  Versus  Engagement 

Was  containment  policy  efficient?  And  can  it  be 
effective  today?  In  his  1947  article,  Kennan  advo¬ 
cated  “the  adroit  and  vigilant  application  of  coun¬ 
terforce  at  a  series  of  constantly  shifting  geographi¬ 
cal  and  political  points”  by  the  United  States.  He 
believed  that  a  policy  of  containment  would  add  to 
internal  pressures  on  the  Kremlin  that  would  ulti¬ 
mately  lead  to  “the  breakup  or  the  gradual  mellow¬ 
ing  of  Soviet  power.”  Because  the  Soviet  Union  col¬ 
lapsed  in  1991,  the  containment  policy  would  seem 
to  have  proven  successful.  Still,  the  approach  was 
controversial  from  the  beginning  and  remains  so. 

In  the  1940s,  the  lauded  columnist  Walter 
Lippmann  put  forth  a  harsh  criticism  against  the 
“X”  article  for  failing  to  differentiate  between  vital 
and  peripheral  American  interests.  Moreover,  the 
debate  persists  on  whether  the  end  of  the  Cold 
War  was  the  direct  result  of  U.S.  foreign  policy 
strategies  or  whether  it  owes  more  to  complex 
interaction  between  domestic  and  global  economic 
and  social  factors. 

The  containment  strategy  now  appears  to  be 
outdated  for  several  reasons.  First,  containing  one 
specific  enemy  in  a  globalized  world  is  an  illusory 
goal — the  possibilities  of  circumventing  any  attempts 
at  isolation  are  now  virtually  infinite.  Second,  con¬ 
tainment  has  been  used  almost  exclusively  by  the 
United  States  and  its  allies,  and  it  led  to  a  counter¬ 
productive  result:  States  considered  to  be  the  main 
enemies  of  the  United  States  may  gain  solidarity 
from  other  “protest  diplomacies”  and  the  support 
of  public  opinion  (mostly  in  the  South).  Third, 
political  and  economic  isolation  has  proved  useless, 
and  even  dangerous,  in  an  interdependent  world, 
notably  when  the  targeted  country  or  entity  has 
already  acquired  enough  potential  to  cause  havoc. 

In  contrast,  dialogue  and  diplomatic  commit¬ 
ment  have  emerged  as  more  serious  means  of  deal¬ 
ing  with  difficult  interlocutors.  Containment  of 
China,  Iran,  North  Korea,  Venezuela — even  Hamas 
or  Hezbollah — would  indeed  be  hazardous  from 
material,  social,  political,  and  military  perspectives. 
Finally,  the  post-bipolar  structure  of  the  interna¬ 
tional  system  no  longer  allows  for  such  an  approach; 


for  containment  was  forged  in  a  different  interna¬ 
tional  system  with  one  permanent  peer  competitor 
in  mind  and  not  conceived  for  fast-changing  and 
asymmetrical  targets  in  a  turbulent  world. 

The  formal  temptation  of  a  containment  policy  is 
still  present  and  possible.  Its  reemergence  is  possible 
in  the  context  of  a  new  global  competition  between 
the  United  States  and  China.  However,  in  material, 
political,  and  social  terms,  containment  is  no  longer 
tenable.  Classical  in  its  conception  (a  struggle  for 
survival  against  an  enemy  in  a  balance-of-power 
competition),  ambitious  in  its  implementation,  and 
influential  among  U.S.  allies,  the  concept  of  contain¬ 
ment  is  a  remnant  of  the  Cold  War  whose  transla¬ 
tion  to  modern  times  is  a  troubled  one. 
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Contingency  Theory 


Universalist  theories  of  organization  say  that  the 
highest-performing  organizations  have  maximum 
levels  on  the  variables  that  compose  their  structure, 
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such  as  maximum  decentralization  of  decision  mak¬ 
ing.  In  contrast,  contingency  theory  says  that  the 
highest-performing  organizations  have  levels  on 
their  structural  variables  that  fit  the  levels  on  their 
contingency  variables — that  is,  the  level  of  decen¬ 
tralization  that  fits  the  level  of  organizational  size. 
Thus,  the  optimal  structural  level  is  seldom  the 
maximum  level,  and  it  is  not  universal  across  all 
organizations.  Rather,  the  level  of  the  structure  that 
causes  highest  performance  is  contingent  on  the 
contingency  variables  (or  factors).  In  other  words, 
the  contingency  variable  moderates  the  effect  of 
structure  on  performance.  Performance  is  high  only 
if  structure  fits  the  contingency  variable.  The  more 
that  structure  misfits  the  contingency  variable,  the 
lower  is  performance  in  consequence.  This  entry 
discusses  the  intellectual  backgrounds  of  contin¬ 
gency  theory  and  its  application  to  the  public  sector. 

Key  Variables  for  Performance 

Specialization,  formalization,  decentralization, 
and  divisionalization  are  some  of  the  variables  of 
organizational  structure  that  have  contingent 
effects  on  performance.  Some  of  the  contingency 
variables  of  organizational  structure  are  size, 
diversification,  and  task  uncertainty.  For  instance, 
size  is  a  contingency  of  decentralization,  in  that  the 
larger  the  organization  is  (i.e.,  the  more  members 
it  has),  the  more  complex  are  the  issues  in  manag¬ 
ing  it;  therefore,  top  management  cannot  make  all 
the  decisions,  and  some  must  be  delegated  down 
the  managerial  hierarchy.  This  reduces  the  number 
of  levels  in  the  hierarchy  through  which  informa¬ 
tion  must  pass  and  brings  decision  making  closer 
to  the  bottom  level,  which  interacts  with  custom¬ 
ers  or  clients  and  which  produces  products. 

As  another  instance,  diversification  is  a  contin¬ 
gency  of  divisionalization.  Undiversified  (i.e.,  sin¬ 
gle  product  or  service)  organizations  can  be  effec¬ 
tively  organized  as  structures  in  which  the  main 
subunits  reporting  to  the  chief  executive  officer 
(CEO)  are  functions.  In  contrast,  diversified  (i.e., 
multiple  product  or  service)  organizations  can  be 
effectively  organized  as  structures  in  which  the 
main  subunits  reporting  to  the  CEO  are  divisions, 
each  containing  its  own  operating  functions  (e.g., 
sales  and  production),  so  that  each  is  an  autono¬ 
mous  organization.  In  this  way,  diversification  is 
also  a  contingency  of  decentralization. 


Hence,  both  size  and  diversification  are  contin¬ 
gencies  of  decentralization.  Thus,  some  structural 
variables  have  more  than  one  contingency  variable 
they  need  to  fit  to  have  high  performance. 

Task  Uncertainty 

Task  uncertainty  is  a  contingency  of  formalization, 
specialization,  and  decentralization.  Where  the  task 
has  high  certainty,  rules  (formalization)  can  be  for¬ 
mulated  that  provide  guidance  for  effective  deci¬ 
sions.  The  preprogramming  of  decisions  also  allows 
jobs  to  be  narrowly  defined  and  hence  highly  spe¬ 
cialized.  In  contrast,  where  tasks  are  highly  uncer¬ 
tain,  it  is  not  possible  to  formulate  rules  that  provide 
guides  for  effective  decisions,  and  jobs  cannot  be 
narrowly  defined;  hence  formalization  and  special¬ 
ization  need  to  be  low.  Instead,  more  highly  quali¬ 
fied  and  skilled  employees  need  to  be  empowered  to 
make  some  of  the  decisions,  so  that  decentralization 
is  higher.  The  higher  the  task  uncertainty,  the  lower 
the  formalization  and  specialization  and  the  higher 
the  decentralization  need  to  be  in  order  to  fit  the 
task  to  the  uncertainty  contingency. 

Uncertainty  enters  tasks  in  a  major  way  when 
organizations  seek  to  innovate,  such  as  by  creating 
new  products  or  services  or  by  using  new  technolo¬ 
gies  for  producing  its  products  or  services. 
Organizations  repeatedly  producing  the  same 
products  and  services  are  fitted  with  a  mechanistic 
structure,  which  is  high  on  formalization,  special¬ 
ization,  and  centralization.  Organizations  innovat¬ 
ing  their  products  and  services  are  fitted  with  an 
organic  structure,  which  is  low  on  formalization, 
specialization,  and  centralization.  Innovating  orga¬ 
nizations  also  need  more  interaction  between  their 
functional  departments.  This  requires  lateral  coor¬ 
dination  mechanisms  such  as  cross-functional 
project  teams  headed  by  integrators  (project  lead¬ 
ers  independent  of  the  functions  from  which  the 
team  members  are  drawn).  In  contrast,  noninno¬ 
vating  organizations  can  adequately  manage  the 
lesser  interaction  between  their  functional  depart¬ 
ments  by  hierarchy  and  planning. 

Fits  and  Misfits 

Some  form  of  contingency  theory  conceives  of  the 
fits  as  configurations  or  gestalts — that  is,  as  bun¬ 
dles  of  structural  variables  where  the  levels  of  the 
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structural  variable  are  discrete.  For  instance,  an 
organization  that  is  small  is  fitted  by  a  simple 
structure  that  is  low  on  formalization,  while  a 
large  organization  is  fitted  by  being  high  on  for¬ 
malization.  These  fitting  levels  of  formalization 
differ  considerably  on  any  scale  quantifying  for¬ 
malization.  The  intermediary  levels  of  formaliza¬ 
tion  between  low  and  high  are  all  misfits  that  cause 
low  performance.  Hence,  an  organization  will  tend 
not  to  remain  at  intermediary  levels  of  formaliza¬ 
tion.  Therefore,  organizations  make  quantum 
leaps  from  low  to  high  formalization  but  do  so 
only  infrequently.  This  is  also  the  case  for  other 
structural  variables  such  as  specialization  and 
decentralization,  so  that  organizations  are  com¬ 
posed  of  bundles  of  such  structural  variables — the 
configurations  or  gestalts.  Empirically,  analysts 
quantify  misfit  by  calculating  the  exact  score  on 
each  structural  variable  that  is  the  ideal  for  that 
configuration  and  then  measure  the  distance  of 
each  organization  from  that  ideal. 

In  contrast,  one  form  of  contingency  theory 
conceives  of  the  fits  in  a  Cartesian  manner.  The 
contingency  and  structural  variables  are  both  con- 
tinua  and  so  are  their  fits.  There  is  a  line  of  fitting 
points  such  that  each  level  of  the  contingency  vari¬ 
able  has  a  level  of  the  structural  variable  that  is  its 
fit.  For  instance,  size  and  formalization  are  both 
continuous  variables,  being  the  contingency  and 
structural  variables,  respectively.  Small  size  is  fit¬ 
ted  by  low  formalization  and  high  size  by  high 
formalization  (as  in  configuration  theory),  but  also 
medium  size  is  fitted  by  medium  formalization, 
and  all  the  intermediary  levels  of  size  are  fitted  by 
intermediary  structural  levels  (unlike  in  configura¬ 
tion  theory).  Thus,  there  are  many  fits  in  Cartesian 
contingency  theory,  forming  the  fit  line.  Therefore, 
a  growing  organization  can  readily  move  from  one 
level  of  structure  that  fits  it,  to  the  next  level.  Thus, 
the  fit  line  is  a  set  of  stepping  stones  that  allow 
organizations  to  change  incrementally  and  gradu¬ 
ally,  thereby  possibly  growing  from  small  and 
unformalized  to  medium  sized  and  medium  for¬ 
malized  or,  eventually,  to  large  and  highly  formal¬ 
ized.  Empirically,  analysts  quantify  misfit  by  locat¬ 
ing  the  fit  line,  from  theory  or  data,  and  then 
measuring  the  distance  of  each  organization  from 
that  fit  line. 

The  internal  distribution  of  power  between 
departments  at  the  same  hierarchical  level  addresses 


an  aspect  of  the  politics  within  organizations.  The 
strategic  contingencies  theory  of  intraorganiza- 
tional  power  says  that  the  department  that  success¬ 
fully  deals  with  the  key  challenge  to  the  organiza¬ 
tion  will  become  the  most  powerful  department. 
However,  departments  become  powerful  only  if 
they  are  nonsubstitutable,  so  that  the  organization 
is  dependent  on  them.  For  example,  in  brewing 
firms,  where  marketing  was  severely  constrained 
by  government  regulations  about  pricing  and  the 
like,  financial  success  came  to  those  whose  manu¬ 
facturing  departments  produced  at  low  cost. 
Consequently,  these  manufacturing  departments 
received  more  of  the  budget  in  their  firms  and  also 
controlled  decisions  pertaining  to  their  boundaries 
with  other  departments — for  example,  the  packag¬ 
ing  department.  In  other  industries,  powerful 
departments  such  as  these  have  been  shown  to  sup¬ 
ply  the  next  CEO. 

Contingency  theory  is  functionalist  in  that  it 
explains  the  structures  that  are  adopted  as  being 
those  that  fit  the  contingency  and  so  produce 
higher  performance.  It  is  also  positivist  in  that  it 
features  quantitative  evidence  and  general  models, 
in  which  factors  of  the  situation,  such  as  the  envi¬ 
ronment  or  size,  determine  the  structure,  so  that 
there  is  a  contingency  imperative.  Some  critics 
assert,  in  contradistinction,  that  managers  exercise 
free  will,  unconstrained  by  contingency  variables. 
Others,  more  moderately,  hold  that  organizations 
whose  level  of  the  contingency  variable  have 
changed  and  so  are  misfitted  by  the  existing  struc¬ 
ture  nevertheless  retain  it  until  there  is  a  crisis  of 
poor  organizational  performance,  when  they  move 
into  fit.  Again,  there  is  an  argument  that  organiza¬ 
tions  can  regain  fit  by  adjusting  the  contingency 
variable  to  fit  the  structure.  However,  empirically, 
organizations  usually  change  their  structure  to  fit 
the  contingency,  so  there  is  a  contingency  impera¬ 
tive. 

Overall,  there  has  been  much  empirical  research 
supporting  contingency  theory,  across  organiza¬ 
tions  of  many  types,  industries,  and  nationalities. 
However,  some  issues,  such  as  configuration  ver¬ 
sus  Cartesiansim,  or  contingency  imperative  versus 
free  choice,  remain  in  contention,  with  evidence 
for  and  against. 

Traditionally  in  contingency  theory,  fit  is  seen 
as  iso-performance,  meaning  that  the  high  level  of 
performance  produced  by  the  fit  of,  for  instance, 
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low  formalization  to  small  size  is  the  same  (hence 
“iso-”)  as  the  high  level  of  performance  produced 
by  the  fit  of  high  formalization  to  large  size.  But 
this  would  provide  no  incentive  for  firms  to  grow 
and  increase  their  level  of  formalization.  In  con¬ 
trast,  hetero-performance  says  that  the  level  of 
performance  produced  by  the  fit  of,  for  instance, 
high  formalization  to  large  size  is  greater  than  the 
performance  produced  by  the  fit  of  low  formaliza¬ 
tion  to  small  size.  Thus,  organizations  have  an 
incentive  to  grow  and  increase  their  level  of  for¬ 
malization.  Whereas  in  the  traditional  iso-perfor¬ 
mance  theory,  the  contingency  variable  plays  the 
role  of  only  being  a  moderator  of  the  effect  of 
structure  on  performance,  hetero-performance  the¬ 
ory  sees  the  contingency  variable  as  also  making  a 
direct,  positive  contribution  to  performance.  While 
hetero-performance  has  logical  appeal,  it  is  at  the 
present  time  only  a  theoretical  conjecture  that  has 
yet  to  be  proven.  Moreover,  some  contingency 
variables  are  beyond  managerial  control,  such  as 
uncertainty  in  the  environment,  so  that  the  envi¬ 
ronmental  contingency  can  increase  its  level  with¬ 
out  any  action  by  managers  in  an  organization  or 
any  incentive  on  them  to  do  so.  Thus,  while  some 
contingency  effects  are  probably  hetero-perfor¬ 
mance,  some  are  iso-performance. 

Traditionally  again,  contingency  theory  holds 
that  underfit  has  the  same  effect  as  overfit,  in  that 
too  little  structure  (e.g.,  formalization)  for  a  given 
level  of  the  contingency  (e.g.,  size)  decreases  per¬ 
formance  by  the  same  amount  as  too  much  struc¬ 
ture.  However,  it  has  been  argued  that  underfit 
produces  worse  performance  in  that  the  organiza¬ 
tion  is  unable  to  attain  its  goals,  whereas  in  overfit 
the  organization  can  attain  its  goals  but  just 
incurs  the  unnecessary  costs  of  an  overelaborate 
structure. 

Research  is  ongoing  in  contingency  theory,  giv¬ 
ing  rise  to  new  theoretical  developments  and 
empirical  findings.  It  represents  a  continuing  pat¬ 
tern  of  development  in  the  study  of  organizations 
in  both  the  public  and  private  sectors. 
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Contract  Theory 


The  expression  contract  theory  refers  to  a  large 
variety  of  conceptions  in  social  sciences.  In  the 
vast  literature  dedicated  to  the  topic  in  economics, 
contract  theory  consists  of  the  study  of  microeco¬ 
nomic  relations  between  agents  within  a  market. 
In  law,  contract  theory  is  equally  important  and 
pertains  to  the  analysis  of  contract  law.  But  the 
most  dominant  usage  of  the  expression  contract 
theory  is  political  and  this  is  also  the  sense  that 
will  apply  in  this  entry. 

Within  the  realm  of  political  theory,  contract 
theory  refers  to  a  set  of  philosophical  ideas,  mostly 
developed  since  the  17th  century,  the  goal  of 
which  is  to  offer  a  rational  understanding  of  the 
creation  of  society  and  political  powers.  It  also 
induces  a  normative  reflection  on  the  legitimacy  of 
powers.  A  subset  of  preoccupations  derives  from 
this  broad  consideration  and  regards  as  crucial  for 
political  theory  topics  such  as  the  place  of  religion 
in  a  collectivity,  the  question  of  the  sovereignty, 
the  respective  role  of  the  people  and  the  prince, 
and  basic  rights  and  liberties.  Beyond  the  borders 
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of  political  thought,  contractarian  theoreticians 
have  often  addressed  issues  relevant  to  psychology, 
anthropology,  law,  and  economics. 

The  core  of  contract  theory,  in  this  political 
sense,  is  the  fundamental  intuition  that  all  social 
dealings  are  based  on  an  initial  agreement  between 
individuals  to  belong  to  the  same  society  and  to 
obey  certain  designated  authorities.  Hence  the  idea 
of  a  “contract”  at  the  root  of  social  existence — 
often  known  as  the  “social  contract”  after  Jean- 
Jacques  Rousseau’s  famous  book,  On  the  Social 
Contract ,  published  in  1762.  According  to 
Rousseau  (1712-1778),  the  ties  that  bind  individu¬ 
als  within  a  society  are  the  outcome  of  a  contract 
between  them  that  was  initiated  at  the  beginning 
of  this  very  society.  Regarding  the  contract  itself,  it 
is  a  deliberate  decision  of  each  individual  who  sees 
the  value  of  being  engaged  in  society  where  rules 
limit  liberty  rather  than  being  left  alone  in  total 
freedom.  This  is  why  contract  theories  are  all 
based  on  the  idea  of  a  reasoning  individual  able  to 
recognize  the  greater  benefits  of  society. 

Whether  this  contract  is  regarded  by  authors  as 
actually  having  been  sealed  in  the  course  of  history 
or  whether  it  is  for  them  only  a  fruitful  hypothesis, 
the  basic  principle  of  contract  theory  is  to  regard 
society  as  the  result  of  such  a  pact  between  indi¬ 
viduals.  Further,  contract  theory  purports  struc¬ 
tural  consequences  such  as  individual  rights  and 
liberty.  Indeed,  if  individuals  had  no  rights,  they 
could  not  logically  engage  in  a  contract  (they 
would  literally  not  have  the  right  to  do  so).  And  if 
the  contract  they  are  engaged  in  did  not  guarantee 
some  rights  or  liberty — in  other  terms,  if  social 
existence  meant  the  total  alienation  of  individu¬ 
als — it  would  be  impossible  to  understand  why 
individuals  would  have  chosen  to  submit  to  such 
an  unfair  contract  and  on  what  basis  it  should 
subsist. 

Contract  theory  is  quintessentially  modern.  It 
reflects  the  decisive  evolution  of  Western  political 
thought  from  the  16th  century  on  toward  an  “indi¬ 
vidualistic”  understanding  of  society  and  politics  for 
which  any  collectivity  is  regarded  as  the  distant  even¬ 
tual  result  of  individual  decisions.  As  a  matter  of 
fact,  contract  theory  is  not  the  only  modern  political 
conception  able  to  accommodate  individualism.  But 
it  has  been  one  of  the  most  prominent:  If  the  rise  of 
revolutionary  doctrines  challenged  the  contractarian 
approach  in  the  19th  and  20th  centuries,  it  never 


totally  superseded  it.  It  then  gained  renewed  atten¬ 
tion  after  the  works  of  the  philosopher  John 
Rawls  (1921-2002)  in  the  1970s,  which  insisted 
on  the  necessity  of  having  common  principles  of 
justice  agreed  on  in  a  well-ordered  society.  His 
influential  Theory  of  Justice,  published  in  1971, 
claims  the  heritage  of  the  tradition  of  the  social 
contract  and  has  had  a  considerable  influence, 
which  still  continues  to  feed  contemporary  politi¬ 
cal  thought. 

Normative  Diversity  and  the 
Intellectual  Unity  of  Contract  Theory 

The  term  contract  theory  did  not  come  into  wide¬ 
spread  use  until  the  20th  century,  and  the  expres¬ 
sion  should  be  considered  as  a  flexible  label  by 
which  to  categorize  a  tradition  of  thinkers  among 
whom  influences  are  easily  spotted,  even  across 
centuries,  but  who  are  also  very  different.  Their 
very  understanding  of  what  the  “contract”  at  the 
origin  of  society  is,  the  depiction  of  its  nature,  and 
the  role  it  plays  in  their  conceptions  vary  greatly 
among  them.  This  lack  of  unity  is  obvious  with 
regard  to  the  kind  of  normative  judgments  over 
politics  it  has  supported  among  the  different  “con¬ 
tractarian”  philosophers  and  political  thinkers. 
Positions  have  ranged  from  justifying  royal  abso¬ 
lutism  (Thomas  Hobbes,  17th  century)  to  leaning 
on  a  republican  understanding  of  just  politics 
(Rousseau,  18th  century)  or  accommodating  a 
“liberal”  philosophy  (Rawls,  20th  century). 

Although  one  cannot  disregard  the  fundamental 
differences  and  oppositions  that  appear  between 
contractarian  philosophies,  it  would  be  a  serious 
mistake  to  ignore  their  underlying  unity.  This  is 
not  to  be  found  in  what  they  advocate  politically 
but  rather  in  a  relatively  homogeneous  pattern  of 
ideas.  The  most  constant  one  is  an  inquiry  into  the 
legitimacy  of  powers.  These  are  no  longer  consid¬ 
ered  as  imposed  by  nature  or  by  God.  Contractarian 
theories  provide  another  explanation:  We  live  in  a 
collectivity  because  we  want  it.  We  obey  laws 
because  we  want  them.  Society  and  authority  are 
the  reflection  of  our  will,  if  only  implicitly. 
Legitimate  powers  are  those  that  respect  the  condi¬ 
tions  of  this  will. 

This  does  not  mean  that  obeying  laws  and 
respecting  more  generally  the  constraints  that  any 
social  life  carries  are  dependent  on  the  fantasy  of 
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its  members  or  that  we  can  be  free  from  our  obli¬ 
gations  “at  will.”  Here  the  theme  of  the  contract  is 
precisely  to  signify  that  there  is  no  contradiction 
between  binding  obligations  and  the  personal  will 
to  enter  them.  We  are  bound  to  obey  the  rules  in 
society,  just  as  in  any  contract  we  engage  willingly 
to  perform  services  and  duties  that  we  are  not  free 
to  disengage  from  without  a  penalty. 

The  emergence  of  social  contract  theories  at  the 
end  of  the  1 6th  century  should  be  understood  in  its 
historical  context  as  the  progressive  awareness  of  a 
deep  contradiction  regarding  the  modern  condi¬ 
tion  of  politics.  How  is  it  that  human  beings  both 
accept  and  reject  authority,  as  shown  by  the  blood¬ 
shed  of  the  religious  wars  on  the  continent  and  the 
civil  war  in  England?  How  is  it  possible  to  have 
people  live  together,  while  wars  and  revolutions 
give  the  constant  example  of  what  the  philosopher 
Immanuel  Kant  (1724-1804)  calls,  in  his  1784 
Idea  for  a  Universal  History  From  a  Cosmopolitan 
Point  of  View  (fourth  thesis),  “the  unsociable 
sociability  of  man”?  Only  a  sound  understanding 
of  what  human  beings  should  accept  in  political 
matters — in  other  words,  only  a  rational  account 
of  the  legitimacy  of  power — could  provide  under¬ 
standing  and  perhaps  a  solution  to  the  evils  of  the 
time  and  grant  a  more  stable  future.  This  is  ulti¬ 
mately  the  promise  of  contract  theories. 

The  historical  situation  is  not  in  itself  the  sole 
cause  of  such  interrogations.  After  all,  the  course 
of  history  has  always  been  a  run  of  blood  and  mis¬ 
ery.  But  the  wakening  of  modern  times  makes  it 
more  intolerable  intellectually.  More  and  more,  a 
new  spirit  inherited  from  the  Renaissance  values 
reason  as  a  tool  for  improvement  and  insists  on  the 
equality  in  reason  of  human  beings.  This  new  sense 
of  progress  clashes  with  the  dark  situation  of 
Europe  in  the  17th  century  and  the  harshness  of 
social  existence  in  the  18th.  The  new  sense  of 
equality  is  at  odds  with  the  hierarchical  orders  of 
the  times.  To  these  contradictions,  contract  theo¬ 
ries  have  been  the  very  much  needed  intellectual 
response. 

The  sense  of  novelty  in  contract  theories  is 
expressed  in  the  constant  rejection  of  conceptions 
of  political  life  that  we  can  label  as  “traditional” 
and  that  considered  society  as  natural.  No  one 
more  explicitly  than  Hobbes  (1588-1679)  in  the 
first  pages  of  his  De  Cive  (The  Citizen)  has  rejected 
this  interpretation  of  society,  inherited  from 


Aristotle  (and  more  generally  from  the  whole  of 
antiquity),  for  whom — man  being  a  “sociable  ani¬ 
mal” — any  human  being  seeks  association  with 
others  in  a  city.  For  Hobbes,  it  is  a  delusion  or  a  lie: 

The  greatest  part  of  those  men  who  have  written 
ought  concerning  Commonwealths,  either 
suppose,  or  require  us,  or  beg  of  us  to  believe, 
That  Man  is  a  Creature  born  fit  for  Society 
[  .  .  .  J  which  Axiom,  though  received  by  most,  is 
yet  certainly  False.  (I,  chap.  1,  para.  2) 

Hobbes  thinks  that  the  prominence  of  strife  and 
conflicts  over  benevolence  and  enjoyment  in 
human  ties  belies  the  idea  that  society  is  about 
cooperation.  He  is  drawn  to  the  conclusion,  which 
is  shared  by  the  whole  contractarian  tradition,  is 
that  if  people  live  together,  it  is  not  out  of  a  natural 
instinct.  It  is  because  they  commit  themselves  to 
overcome  the  natural  difficulty  of  togetherness. 
This  commitment  takes  the  form  of  a  voluntary 
contract  at  the  root  of  social  existence. 

History 

It  has  sometimes  been  argued  that  the  ancient 
Greeks  and  Romans  recognized  some  form  of 
social  contract.  It  is  difficult  to  concur  with  this 
view.  The  idea  that  cities  had  to  start  at  some  point 
with  the  gathering  of  a  handful  people  was  not 
foreign  to  ancient  philosophers,  it  is  true.  For 
instance,  Plato  (ca.  427-ca.  347  BCE),  in  the  third 
book  of  his  Republic,  relates  the  beginning  of  a 
city.  But,  he  insists,  it  is  the  consequence  of  the  fact 
that  “human  beings  aren’t  self-sufficient”  (369b); 
like  other  philosophers  in  antiquity,  he  pictures  the 
beginning  of  societies  (or  rather  cities)  as  the  prime 
accomplishment  of  human  nature,  which  would  be 
incomplete  otherwise.  Contractarian  theorists  have 
a  radically  opposite  point  of  view.  For  them, 
human  beings  are  initially  self-sufficient.  However, 
this  autonomy  leads  to  unbearable  conflicts  and 
threatens  their  survival.  Rather  than  being  their 
natural  destiny,  society  is  their  second-best  choice 
(through  a  contract)  to  avoid  extinction.  This 
emphasis  among  contractarian  theorists  on  the 
individual  is  another  incompatibility  of  ancient 
theories  for  which  the  individualization  of  behav¬ 
iors  is  the  end  of  a  well-formed  city;  as  Plato  sees 
it,  for  instance,  the  path  to  tyranny  starts  when 
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“everyone  has  the  right  to  do  as  he  chooses” 
( Republic ,  557b).  In  contractarian  philosophy,  the 
individual  freedom  of  choice  is  the  very  basis  of  the 
contract  that  gives  way  to  collective  existence. 

It  is  not  necessary  to  multiply  quotations  and 
examples  to  realize  that  no  Greek  or  Latin  author 
would  go  so  far  as  to  consider  that  society  was 
based  on  a  contract  between  its  members.  For 
them,  society  was  part  of  the  natural  order  of  the 
universe  and  nature  provided  a  spontaneous  impe¬ 
tus  to  collective  life.  Even  when  the  idea  is  devel¬ 
oped  that  social  life  is  organized  by  conventions 
among  human  beings — an  important  claim  by  the 
sophists  that  is  vigorously  contested  by  authors 
such  as  Plato  and  Aristotle  (384-322  BCE) — the 
role  of  these  conventions  is  not  quite  similar  to  a 
proper  social  contract.  The  sophists  think  that 
conventions  organize  the  collective  life  of  human 
beings.  Unlike  contractarian  theorists  centuries 
after  them,  they  never  consider  that  they  could 
have  established  it.  Moreover,  for  the  sophists, 
social  conventions  are  mostly  arbitrary  or  delu¬ 
sional,  while  for  contractarian  theorists,  the  con¬ 
vention  between  the  members  of  a  society  is  a 
rational  one,  based  on  the  ability  of  human  beings 
to  make  informed  decisions  and  bind  themselves 
through  promises. 

If  we  are  to  consider  the  influence  of  past  theo¬ 
ries  on  the  emergence  of  contractarian  theories,  the 
legitimate  reference  should  be  not  to  antiquity  but 
to  social  and  political  practices  that  developed 
later  with  the  organization  of  feudal  societies, 
which  had,  despite  their  own  diversity,  one  remark¬ 
able  characteristic:  At  its  core,  feudalism  is  the 
contract  between  a  vassal  and  his  lord.  This  con¬ 
tract  states  reciprocal  obligations:  the  vassal  serves, 
the  lord  protects.  Eventually,  feudal  societies 
become  shaped  by  a  tight  net  of  reciprocal  obliga¬ 
tions  properly  chartered  as  contracts. 

This  conception  has  led  to  spectacular  practices, 
well  beyond  the  Middle  Ages.  For  instance,  at  the 
highest  level  of  politics,  contractualization  was 
used  in  the  16th  century  by  Henry  VIII  and 
Elizabeth  I  through  a  policy  called  “surrender  and 
regrant”:  By  contract,  the  rebel  English  lords  in 
Ireland  had  to  surrender  their  lands  to  the  crown 
and  receive  them  back  in  exchange  for  their  sub¬ 
mission,  obtaining  new  extended  rights  of  prop¬ 
erty  they  did  not  possess  previously.  This  very  idea 
will  find  an  outstanding  philosophical  extension  in 


Rousseau’s  Social  Contract:  According  to  him  (as 
it  is  for  John  Locke  before  him  and  Kant  after),  the 
effect  of  the  original  contract  is  precisely  to  turn 
factual  possession  into  legitimate  property  with 
the  agreement  of  all. 

Another  (famous)  example,  the  Mayflower 
Compact,  showcases  the  continued  taste  in  the 
17th  century  for  contractualization  as  a  gover¬ 
nance  tool,  even  at  a  microlevel.  Less  than  2 
decades  after  the  death  of  Elizabeth,  a  group  of 
settlers  navigate  to  the  new  world — they  later 
become  the  American  icon  of  the  colonization  of 
the  continent  under  the  name  “the  Pilgrim  Fathers.” 
On  the  ship,  the  Mayflower ,  dissent  provokes  a 
serious  crisis.  To  solve  it,  the  passengers  decide  to 
“Covenant  and  Combine  [them] selves  together 
into  a  Civil  Body  Politic.”  This  agreement  has 
quite  naturally  been  the  subject  of  many  interpre¬ 
tations,  which  see  in  it  a  sort  of  social  contract,  if 
not  the  “original  American  social  contract.”  Such 
a  comparison  is  flawed  with  misinterpretation.  If  a 
text  such  as  the  Mayflower  Compact  is  to  be 
regarded  as  having  some  tie  to  contract  theory,  it 
is  not  as  an  illustration  and  still  less  as  an  actual¬ 
ization  of  it.  It  is  rather  a  good  empirical  example 
showing  how  the  theme  of  the  social  contract  is  a 
highly  sophisticated  intellectual  generalization  of  a 
rather  common  practice  of  the  time  to  establish 
covenants  in  political  matters. 

It  does  not  follow  that  the  evolution  from  the 
practices  of  contract  in  the  Middle  Ages  to  con¬ 
tractarian  doctrines  is  seamless,  and  even  at  its 
limited  level  the  Mayflower  Compact  points  out 
one  of  the  most  important  differences.  Medieval 
conceptions  were  rooted  in  hierarchies,  whereas 
the  Compact  is  an  agreement  between  equal  par¬ 
ties.  Even  if  it  is  an  exaggeration  to  describe  the 
episode — as  it  has  often  been  the  case — as  “demo¬ 
cratic,”  it  is  nevertheless  an  example  of  the  greater 
sense  of  equality  in  the  17th  century.  This  sense  is 
pervasive  in  contractarian  philosophies  and  makes 
them  break  free  from  the  very  meaning  of  feudal 
contractualization,  the  goal  of  which  was  to  regu¬ 
late  a  constant  exchange  of  services  between  the 
inferior  and  the  superior.  Contract  theories,  on  the 
contrary,  state  that  inequalities  can  be  understood 
only  after  a  primary  agreement  to  live  in  society. 

Contract  theories  are  radically  distinct  from 
medieval  conceptions  on  another  crucial  instance: 
While  politics  and  their  representation  in  the 
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Middle  Ages  gravitate  around  religion,  contract 
philosophies  are  inherently  secular.  In  the  Middle 
Ages,  religion  is  the  ultimate  justification  of  the 
authority  of  the  princes  whose  power  is  regarded 
as  proceeding  from  God’s  will.  References  to  reli¬ 
gion  are  repetitive,  insistent,  and  lean  on  an  often 
quoted  passage  of  Paul’s  Letter  to  the  Romans 
(XIII  1—2):  “For  there  is  no  power  but  of  God;  and 
the  powers  that  be  are  ordained  of  God.  Therefore 
he  that  resisteth  the  power,  withstandeth  the  ordi¬ 
nance  of  God:  and  they  that  withstand  shall 
receive  to  themselves  judgment.” 

Even  if  contractarian  thinkers  such  as  Samuel 
von  Pufendorf  (1632-1694)  or  John  Locke  (1632- 
1704)  consider  that  the  idea  of  a  fundamental 
contract  coheres  with  the  plans  of  God,  they  never 
recognize  a  direct  divine  influence.  The  idea  that 
society  is  the  outcome  of  a  covenant  between 
human  beings  implies  stepping  away  from  such  an 
immediate  consecration  of  rulers  by  God.  This  is 
why,  despite  prudent  maneuvers  to  avoid  the  accu¬ 
sation,  contractarian  philosophers  have  often  been 
accused  of  being  hostile  to  religion:  Whether  they 
adopt,  like  Hobbes,  a  strategy  of  compulsive  justi¬ 
fication  of  their  positions  by  references  taken  from 
the  Bible  or  whether  they  contemptuously  dismiss 
faith  as  a  source  of  comprehension  for  society,  like 
Rousseau,  they  were  equally  considered  in  their 
time  as  impious. 

Thus,  the  real  novelty  of  contractarian  theories 
is  not  that  they  assume  the  existence  of  a  contract 
between  the  government  and  its  subjects — this  is 
the  traditional  aspect,  inherited  from  the  Middle 
Ages.  The  novelty  is  that  they  describe  the  very 
existence  of  society,  the  mere  fact  of  living  together, 
as  the  outcome  of  a  pact  between  the  members  of 
society  without  direct  reference  to  God’s  will. 
Assuming  the  idea  of  a  global  pact  at  the  founda¬ 
tion  of  society,  contract  theories  have  transformed 
what  was  a  local,  discrete  agreement  of  submission 
into  a  general  mechanism  accounting  for  all  basic 
social  and  political  relations. 

The  State  of  Nature 

The  assumption  of  contract  theories  that  societies 
were  created  by  an  initial  agreement  purports  the 
consequence  that  there  is  a  state  before  society,  a 
state  in  which  human  beings  live  without  being 
tied  by  the  requirements  of  social  life.  In  the  17th 


century,  an  expression  is  coined  to  describe  this 
moment  when  human  beings  are  not  formally 
engaged  in  the  bonds  of  society:  “the  state  of 
nature.”  The  use  of  the  word  nature  to  character¬ 
ize  what  is  “not  social”  is  remarkable:  It  is  a  direct 
translation  of  the  new  conviction  in  the  17th  cen¬ 
tury  that  politics  oppose  society  and  nature.  This 
antagonism  itself  is  complex  but  rests  on  a  convic¬ 
tion  shared  by  all  the  authors:  Society  is  about 
restraining  an  unlimited  thirst  for  freedom  and 
license  that  dominates  the  state  of  nature.  Hence, 
there  is  a  fundamental  contrast:  On  the  one  hand, 
man  (few  authors  reflect  on  the  status  of  women, 
usually  considering  that  the  division  of  genders  is 
a  natural  one)  is  naturally  free  and  wants  to  be 
infinitely  free;  on  the  other  hand,  this  freedom  is 
not  sustainable  and  ultimately  leads  to  anarchy. 
Society  is  about  the  rules  and  constraints  that  bind 
this  initial  liberty.  For  this  reason,  contractarian 
authors  have  a  repressive  conception  of  society 
generally — even  if,  along  with  Rousseau  or  Kant, 
they  explain  that  this  repression  is  true  liberty, 
liberty  within  autonomous  rules. 

In  this  perspective,  the  state  of  nature  serves  as 
a  way  to  explain  this  liberty  that  men  have  given 
up  by  entering  into  the  ties  of  society.  In  its  most 
common  versions,  it  is  described  as  an  age  only 
dominated  by  “natural  law,”  in  the  new  under¬ 
standing  that  the  expression  has  had  since  the 
16th  century.  Natural  law  is  the  law  that  deter¬ 
mines  the  behavior  of  human  beings  without  ref¬ 
erence  to  any  positive  norm.  In  his  1690  Second 
Treatise  of  Civil  Government,  Locke  has  summed 
up  what  natural  law  entails  for  most  contractar¬ 
ian  authors:  “perfect  freedom  [for  men]  to  order 
their  actions,  and  dispose  of  their  possessions  and 
persons,  as  they  think  fit  .  .  .  without  asking  leave, 
or  depending  upon  the  will  of  any  other  man” 
(chap.  2,  sec.  1). 

Nothing  but  strength  or  opportunity  seems  to 
limit  men  in  their  endeavors.  It  leads  to  a  situation 
of  at  least  potential  threat,  and  more  surely  to 
violence  and  to  anarchy.  As  Hobbes  puts  it,  the 
“state  of  nature”  is  a  “state  of  war” — “the  war  of 
everyone  against  everyone,”  to  recall  a  famous 
quote  from  his  Leviathan,  published  in  1651  (I  4). 
The  place  of  strife  in  the  state  of  nature  is  not 
always  as  pregnant  as  in  Hobbes’s  philosophy: 
Rousseau  strongly  opposes  the  idea,  considering 
that  conflict  follows  the  invention  of  society — an 
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ill-regulated  society  without  a  social  contract — 
instead  of  preceding  it.  Locke  thinks  that  while  not 
being  constant,  conflict  may  pose  a  problem  at  any 
moment.  In  all  these  cases,  despite  deep  opposi¬ 
tions  about  the  nature  of  conflict  and  its  presence 
in  the  state  of  nature,  the  theme  of  the  contract 
surfaces  as  a  response  to  a  potential  menace:  At 
some  point,  human  beings  decide  to  live  together 
under  the  rules  of  a  centralized  government  that 
will  protect  them  as  long  as  they  limit  their  liberty 
and  obey  the  rules  that  this  government  is  in 
charge  of  enforcing. 

John  Milton’s  1650  Tenure  of  Kings  and 
Magistrates  includes  a  good  summary  of  these 
main  ideas: 

All  men  naturally  were  borne  [sic]  free  .  .  .  and 
were  .  .  .  born  to  command  and  not  to  obey:  and 
that  they  liv’d  so.  Till  .  .  .  falling  among 
themselves  to  doe  wrong  and  violence,  and 
foreseeing  that  such  courses  must  needs  tend  to 
the  destruction  of  them  all,  they  agreed  by 
common  league  to  bind  each  other  from  mutual 
injury,  and  joyntly  to  defend  themselves  against 
any  that  gave  disturbance  or  opposition  to  such 
agreement.  Hence  came  Citties,  Townes  and 
Commonwealths.  And  because  no  faith  in  all  was 
found  sufficiently  binding,  they  saw  it  needfull  to 
ordaine  som  authoritie,  that  might  restrain  by 
force  and  punishment  what  was  violated  against 
peace  and  common  right. 

This  authoritie  and  power  of  self-defence  and 
preservation  being  originally  and  naturally  in 
every  one  of  them,  and  unitedly  in  them  all,  for 
ease,  for  order,  and  least  each  man  should  be  his 
own  partial  Judge,  they  communicated  and 
deriv’d  either  to  one,  whom  for  the  eminence  of 
his  wisdom  and  integritie  they  chose  above  the 
rest. 

Milton  sums  up  the  starting  point  of  contractar¬ 
ian  theories.  Beginning  with  an  anthropology  that 
claims  that  the  fate  of  man  is  liberty  and  com¬ 
mandment  and  not  obedience,  Milton  points  out 
the  danger:  If  nobody  obeys  common  rules,  human¬ 
ity  is  doomed  to  suffer  from  violence  because  there 
are  no  possible  restraints  on  whatever  action  is 
undertaken  by  whoever  wants  something.  However, 
the  response  is  the  ability  of  men  to  “foresee”  the 
trouble  and  to  surrender  their  original  freedom 


and  equality  to  secure  peace  against  potential 
troublemakers.  So  is  born  the  age  of  law,  an  age  of 
limited  freedom  under  law  and  authority.  The  sub¬ 
stance  of  what  is  called  “the  social  contract”  is 
made  of  this  agreement  disclosing  rights  and 
duties,  legitimacy  and  obedience. 

Authors  pay  more  or  less  attention  to  the  detail 
of  the  state  of  nature.  We  find  ample  depictions  in 
Hobbes,  Pufendorf,  or  Locke;  and  Rousseau  takes 
the  idea  of  the  state  of  nature  to  an  unequal  level 
of  theoretical  sophistication  in  his  Second  Discourse 
(published  in  1755).  In  contemporary  philosophy, 
the  reprise  of  Sigmund  Freud’s  narrative  about 
primitive  society  in  Carole  Pateman’s  notion  of 
sexual  contract  also  gives  a  developed  narrative 
about  the  equivalent  of  the  state  of  nature.  But 
authors  such  as  Kant  or  Rawls  mention  only 
briefly  the  “state  of  nature.”  Its  sole  goal  is  to  give 
a  sense  of  what  human  attitudes  toward  engaging 
in  political  life  could  be  if  existing  social  con¬ 
straints  had  no  influence  on  the  choices  they  make. 
Whatever  its  development  in  the  different  theories, 
the  state  of  nature  and  its  equivalent  form  an 
extraordinary  hypothesis  about  a  humanity  freed 
from  any  kind  of  social  constraint  and  depict, 
often  under  the  guise  of  fiction,  the  experience  of  a 
humanity  for  which  every  path,  from  radical  anar¬ 
chy  to  complete  social  integration,  is  open. 

Using  the  state  of  nature  for  a  heuristic  hypoth¬ 
esis  about  human  freedom  is  not  without  prob¬ 
lems.  The  first  is  its  logics.  If  humanity  has  stepped 
away  from  nature,  how  can  a  proper  understand¬ 
ing  of  the  latter  be  retrieved  since — as  Rousseau 
put  it — the  necessary  experiences  to  retrieve  the 
“original  man”  are  impossible  to  create?  Some 
authors  just  assume,  like  Hobbes,  that  the  state  of 
nature  can  still  be  perceived  in  contemporary 
human  behaviors:  As  soon  as  the  constraints  of 
rules  stop  making  their  grip  felt,  all  the  violence  of 
nature  comes  back,  with  abuses,  desire  for  vio¬ 
lence,  and  strife.  The  second  question  is  the  one  of 
the  reality  of  the  state  of  nature  and,  beyond,  of  the 
initial  agreement  that  gave  way  to  the  existence  of 
society.  For  Rousseau  it  is  just  a  “conjecture,”  a 
state  that  is  “no  more”  and  that  has  “probably 
never  been”  (as  Rousseau  puts  it  in  his  Second 
Discourse).  But  Locke,  for  instance,  never  renounces 
the  idea  that  the  initial  pact  corresponds  to  some 
extent  to  historical  events.  Pufendorf — being  care¬ 
ful  not  to  contradict  the  biblical  narrative  of  the 
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creation  of  humanity — considers  that  “the  whole 
human  race  has  never  at  one  and  the  same  time 
been  in  the  natural  state”  (Pufendorf,  II,  chap.  1): 
It  is  a  fragmentary  experience. 

From  Rousseau  on,  the  interest  in  the  historical 
accuracy  of  the  contractarian  narrative  fades. 
Rousseau  certainly  indulges  in  a  long,  detailed, 
and  forcefully  evocative  narrative  about  the  state 
of  nature  in  his  Second  Discourse.  But  he  also 
decides  to  “lay  facts  aside,  as  they  do  not  affect  the 
question”:  The  narrative  of  the  state  of  nature  is 
bound  to  remain  a  fiction.  This  renunciation  to 
ground  the  theory  of  contract  in  empirical  evi¬ 
dences  is  connected  to  the  assumption  that,  as 
Kant  puts  it,  the  notion  of  a  social  contract  is  a 
pure  a  priori  idea  of  reason.  Kant  opines  that  such 
ideas  allow  us  to  think  and  to  understand  reality 
but  are  not  derived  from  experience.  That  is  why — 
apart  from  an  explicit  fear  of  revolution — Kant 
criticizes  empirical  inquiries  about  the  origins  of 
the  society:  “It  is  vain  to  inquire  into  the  historical 
origin  of  the  political  mechanism;  for  it  is  no  lon¬ 
ger  possible  to  discover  historically  the  point  of 
time  at  which  civil  society  took  its  beginning” 
(Kant,  Eternal  Peace,  p.  146).  Rawls’s  contempo¬ 
rary  version  of  contractarianism  follows  this  tradi¬ 
tion:  His  own  radical  position  advocates  the  sup¬ 
pression  of  particular  information  in  the  “original 
position,”  which  is  the  first  step  to  determine 
principles  of  justice,  an  undertaking  that  he  pres¬ 
ents  as  an  analog  of  the  social  contract.  Here,  as 
with  Kant,  the  reality  described  by  the  social  con¬ 
tract  is  not  the  reality  of  social  facts  but  the  reality 
of  a  social  reasoning  giving  way  to  an  appropriate 
perspective  on  social  reality.  After  Rousseau,  con¬ 
tract  theories  are  no  more  about  history,  they  are 
a  rational  standpoint  to  study  society. 

An  Individualistic  Understanding 
of  Political  Life 

Considering  societies  as  owing  their  existence  to 
an  initial  agreement  involves  three  main  questions 
that  have  been  dealt  with  thoroughly  throughout 
the  history  of  contract  theories:  (1)  about  the  sta¬ 
tus  of  the  contracting  parties,  (2)  about  its  motiva¬ 
tion  of  the  covenant,  and  (3)  about  its  limit — who, 
why,  what? 

Answering  the  second  question  first  is  probably 
the  easiest,  because  there  is  a  certain  consensus 


among  authors  about  “why,”  at  a  certain  point, 
human  beings  find  it  fit  to  engage  in  the  ties  of 
society,  abandoning  their  initial  freedom  and 
equality  to  respect  common  laws  and  the  authority 
in  charge  of  enforcing  them.  All  agree  that  the  state 
of  nature  is  by  nature  unstable  and  unsustainable: 
Unlike  animals,  the  human  fate  is  not  to  stay  in 
nature.  Only  Rousseau  disagrees  with  this  view. 
He  considers  that  it  is  only  by  accident,  through 
some  unknown  catastrophe,  that  men  have  come 
to  be  in  contact  with  one  another,  while  they  could 
have  stayed  forever  in  the  animal  state.  This  con¬ 
ception  allows  him  to  avoid  the  less  gentle  expla¬ 
nation  that  other  philosophers  were  keen  on  offer¬ 
ing:  Man  cannot  remain  an  animal,  because  he  is 
worse  than  the  animals.  As  Pufendorf  writes  in  On 
the  Duties  of  Man  and  Citizen,  man  is  “more 
wretched  than  that  of  any  wild  beast”  (I  4)  and 
“no  animal  is  fiercer  or  more  untameable  than 
man,  and  more  prone  to  vices  capable  of  disturb¬ 
ing  the  peace  of  society”  (V  6).  Hobbes  coins  a 
famous  sentence  to  sum  up  this  somber  anthropol¬ 
ogy  in  a  vision  of  “the  life  of  man,  solitary,  poore, 
nasty,  brutish,  and  short”  ( Leviathan ,  chap.  13). 
Whether  by  accident  (for  Rousseau)  or  out  of  the 
continuous  strife  that  the  vices  of  men  initiate 
when  there  are  no  rules  to  tame  them,  there  is  a 
moment  when  the  natural  state  of  man  is  unten¬ 
able.  The  reason  why  man  agrees  on  a  contract 
that  forces  him  to  renounce  his  initial  liberty  for 
the  sake  of  securing  what  is  the  most  important  to 
him  (property  for  Locke,  rights  for  Rousseau,  life 
for  Hobbes,  a  bit  of  everything  for  Pufendorf)  is 
the  direct  consequence  of  this  state  of  violence  to 
which  the  initial  condition  has  evolved. 

The  reason  why  a  contract  seems  to  be  compul¬ 
sory  to  contractarians  shows  how  narrowly  it  is 
linked  with  a  conception  of  humanity — in  other 
words,  an  anthropology.  The  anthropology  at 
stake  has  one  consensual  element  among  philoso¬ 
phers  who,  otherwise,  have  very  different  views  on 
human  nature:  Man  who  is  party  to  the  contract  is 
defined  as  an  individual.  Baruch  Spinoza  (1632— 
1677)  spells  it  out  in  the  Tractatus  Theologico- 
Politicus:  “Each  individual  transfers  the  whole  of 
the  power  he  has  of  himself  to  the  community” 
(chap.  14).  Hobbes  uses  the  first  person  to  indi¬ 
vidualize  the  engagement  in  the  compact:  “I  autho¬ 
rise  and  give  up  my  right  of  governing  myself  to  this 
man,  or  to  this  assembly  of  men,  on  this  condition; 
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that  thou  give  up,  thy  right  to  him  ...”  ( Leviathan , 
II  17).  Rousseau  equally  emphasizes  the  individu¬ 
alistic  dimension  of  the  social  compact,  which  is 
for  him  “the  total  alienation  of  each  associate, 
together  with  all  his  rights,  to  the  whole  commu¬ 
nity;  for,  in  the  first  place,  as  each  gives  himself 
absolutely,  the  conditions  are  the  same  for  all” 
( On  Social  Contract,  16). 

Such  formulations  clearly  indicate  the  reorgani¬ 
zation  of  the  conception  of  society  with  contract 
doctrines:  Collectivities  are  to  be  understood  start¬ 
ing  with  the  individuals.  They  also  lead  contracta¬ 
rian  philosophers  to  develop  anthropological 
inquiries  in  order  to  determine  that  what  is  genu¬ 
inely  human  as  the  core  opposition  of  contractar¬ 
ian  theories  is  the  one  between  a  state  before  soci¬ 
ety  and  civil  society.  This  anthropology,  diverse 
thought  the  ideas  of  the  authors  can  be,  is  based  on 
some  convictions  that  they  all  share.  All  agree,  for 
instance,  that  man  is  naturally  free,  equal  to  any 
other.  The  question  of  reason  is  more  debated,  and 
usually  authors  describe  humanity  in  the  state  of 
nature  as  dominated  by  desire  and  appetite,  the 
development  of  reason  requiring  in  their  opinion 
exchanges  made  possible  only  by  the  existence  of 
society.  Once  again,  Spinoza  provides  an  accurate 
summary  of  this  anthropology  in  his  Tractatus 
Theologico-Politicus : 

The  natural  right  of  every  man  therefore  is 
determined  by  appetite  and  power,  not  by  sound 
reason.  For  all  are  not  constituted  by  nature  to 
act  according  to  the  rules  of  reason.  On  the 
contrary,  all  are  born  ignorant  of  everything;  and 
before  they  can  know  the  true  rule  of  life,  and 
acquire  virtuous  habits,  a  great  part  of  their  lives 
must  already  have  passed,  (chap.  16) 

However,  the  consequences  drawn  from  such  a 
description,  which  is  not  solely  Spinoza’s,  but  could 
be  agreed  on  by  most  authors  of  the  17th-  and 
18th-century  contractarian  thinkers,  are  often  at 
odds.  Hobbes  considers  that  the  fear  of  death,  and 
hence  the  potential  threat  that  any  human  being 
represents  to  any  other,  is  the  dominant  passion, 
while  Rousseau  notes  that  the  fear  of  death  is  too 
complex  an  idea  to  have  it  attributed  to  the  natural 
state  of  man.  Locke  insists  on  the  role  of  property 
and  considers  that  the  need  to  protect  possession  is 
the  driving  factor  for  the  establishment  of  a  civil 


society.  For  many  contractarian  philosophers, 
while  a  person  is  described  as  an  individual,  this 
individual  is  often  engaged  in  different  kinds  of 
social  intercourses.  These  do  not  lead  to  a  properly 
established  society — these  relations  are  not  formal¬ 
ized  as  laws  and  enforced  by  the  existence  of  a 
state.  As  Kant  repetitively  states  it  in  The  Science  of 
Right,  the  natural  state  of  man  can  be  a  “social 
state”  even  if  it  is  by  no  means  a  “civil  state” 
because  the  civil  state  is  characterized  by  the  estab¬ 
lishment  of  permanent  laws,  the  violation  of  which 
is  punished  by  a  recognized  authority. 

Only  Rousseau  seeks  to  build  a  philosophical 
anthropology  that  would  be  a  radically  individual¬ 
istic  one,  by  eliminating  from  his  consideration  of 
man  before  the  social  contract  any  type  of  interac¬ 
tions  between  human  beings  in  the  state  of  nature. 
It  seems  contradictory  to  him  to  establish  society 
on  the  basis  of  an  individual  agreement  if  we  are 
unable  to  consider  the  individuals  as  such,  without 
connections  to  one  another.  Even  Hobbes’s  “war 
of  all  against  all”  in  the  state  of  nature  seems  to 
him  a  contradiction:  If  one  is  on  her  or  his  own, 
why  should  there  be  a  conflict  and  how  could  two 
individuals  who  are  entirely  free  to  flee  fight  if  they 
have  no  possession  to  defend  and  no  complex  idea 
such  as  honor  and  recognition  to  drive  them?  This 
opposition  is  far  from  being  a  point  of  detail.  It  is 
a  way  for  Rousseau  to  develop  to  its  furthest  con¬ 
sequence  the  individualistic  standpoint  of  contrac¬ 
tarian  doctrines  and  allows  him  either  to  develop 
or  reformulate  them  or  to  overcome  some  of  their 
difficulties. 

For  instance,  Rousseau’s  radically  individualis¬ 
tic  stance  gives  him  the  possibility  to  address  an 
aspect  of  the  social  contract  that  is  often  carefully 
avoided  by  most  authors:  the  question  of  women. 
Contractarian  philosophers  tend  to  think  that 
female  subordination  is  natural — for  example, 
Pufendorf,  for  whom  marriage  is  part  of  the  natu¬ 
ral  law.  Hence,  women  have  hardly  a  word  to  say 
in  the  original  covenant,  and  the  social  contract  is 
indeed  gendered — Pateman  is  far  from  being  wrong 
in  reading  the  social  contract  as  a  sexual  one, 
opposing  the  social  contract,  which  is  about  free¬ 
dom,  to  the  sexual  contract,  which  is  about  subor¬ 
dination.  Even  if  Rousseau’s  claims  about  the 
legitimate  subjection  of  woman  to  man  in  society 
are  conventional  and  quite  disappointing  for  the 
modern  reader,  his  initial  intuition  is  remarkable. 
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An  individualistic  society  is  gender  neutral.  This 
claim  was  long  forgotten  before  being  rediscovered 
in  the  20th  century — for  instance,  by  Rawls, 
according  to  whom  the  decision  about  the  princi¬ 
ples  of  justice  has  been  made  under  a  veil  of  igno¬ 
rance  so  thick  that  one  does  not  even  know  the 
gender  of  the  founder. 

The  emphasis  on  the  individual  is  also  decisive 
for  the  substance  of  the  contract — the  “what  ques¬ 
tion.”  With  his  radical  insight,  Rousseau  is  able  to 
overcome  some  of  the  least  liberal  aspects  of  con¬ 
tractarian  doctrines  and  to  give  them  a  more 
democratic  undertone — though  technically 
Rousseau  was  never  in  favor  of  democracy,  a 
regime  he  imagined  only  fit  for  “a  people  of 
gods.”  Admitting  a  social  relation  of  some  sort 
among  men  before  the  founding  contract  of  soci¬ 
ety  means  that  there  is  something  wrong  with 
human  beings  in  the  state  of  nature  since  they  are 
unable  to  cope  with  it  in  a  sustainable  fashion.  For 
many  contract  philosophies,  the  place  of  the  indi¬ 
vidual  is  purely  disruptive:  Individuals  tend  to  free 
themselves  from  the  natural  law — the  rules  that 
commend  the  action  of  man  in  the  state  of 
nature — since  there  is  no  institution  to  guarantee 
retaliation  for  misdeeds.  Punishment  requires  the 
invention  of  a  developed  society  and  is  a  function 
of  the  state  that  is  created  after  the  contract  is 
agreed  on.  If  individuals  spontaneously  behaved 
according  to  the  law  of  nature,  indeed,  there 
would  be  no  need  for  a  social  contract.  To  be  pre¬ 
cise,  one  of  the  goals  of  the  contract  is  to  have 
man  abandon  natural  law  to  ensure  what  this 
natural  law  cannot  guarantee:  its  own  enforce¬ 
ment.  The  risk  is  then  to  interpret  the  contract 
that  founds  a  society  either  in  a  purely  repressive 
way — this  is  basically  the  case  with  Hobbes — or 
to  be  exposed  to  a  fallacy,  as  might  be  the  case 
with  Locke. 

Indeed,  if  the  main  issue  in  human  interactions 
in  the  state  of  nature  is  the  menace  of  violence, 
because  individuals  have  the  right  to  anything  they 
wish  under  the  natural  law,  the  scope  of  the  con¬ 
tract  can  be  strictly  limited  to  guarantee  the 
absence  of  conflict  by  all  means.  It  is  the  case  with 
Hobbes,  who  considers  that  the  only  limitation  of 
the  authority  to  which  the  contract  has  transferred 
each  individual’s  power  and  right  is  that  it  cannot 
harm  the  life  of  its  subjects.  In  such  a  context,  even 
if  it  has  been  argued  that  Hobbes  was  the  first 


“liberal”  thinker,  authoritarian  regimes  can  be 
perfectly  legitimate  ones. 

But  if,  as  Rousseau  did,  one  starts  with  an  indi¬ 
vidual  isolated  from  the  others,  then  conflict  only 
appears  when  society  is  formed  because  it  puts  into 
contact  with  each  other  human  beings  who  are  not 
meant  to  be  placed  in  such  a  situation.  Hence,  sup¬ 
pressing  conflicts  means  retrieving  the  initial  state 
by  finding  a  way — which  is  precisely  the  con¬ 
tract — to  guarantee  what  has  been  lost  with  the 
gathering  of  individuals  who  were  initially  sepa¬ 
rate  entities.  In  a  way,  Rousseau  plays  the  civil 
state  against  the  social  state:  society  without  laws 
oppresses  the  individuals;  society  with  laws  should 
enable  them  to  thrive  again  in  a  superior  way.  This 
leads  to  a  major  step  ahead  in  the  values  linked 
with  the  idea  of  a  social  contract.  For  Rousseau, 
the  aim  of  the  initial  contract  is  not  only  to  protect 
the  physical  life  of  the  individual  but  also  to  pro¬ 
tect  his  or  her  inherent  qualities:  freedom,  rights, 
and  equal  standing  with  everybody  in  society.  To 
that  extent,  he  follows  the  lesson  of  Locke  in  his 
Second  Treatise,  who  says  that  “law,  in  its  true 
notion,  is  not  so  much  the  limitation  as  the  direc¬ 
tion  of  a  free  and  intelligent  agent  to  his  proper 
interest”  (sec.  57).  And  Rousseau  extends  the  idea 
to  its  ultimate  consequence:  if  law  is  here  to  pro¬ 
tect  liberty,  as  Locke  claims  it,  then  Rousseau  feels 
that  the  logical  follow-up  is  that  “obedience  to  a 
law  which  we  prescribe  to  ourselves  is  liberty”  (On 
the  Social  Contract,  18). 

Rousseau’s  difference  with  Locke  is  precisely 
that  Locke  also  considers  that  human  beings  in  the 
state  of  nature  are  not  isolated.  He  goes  so  far  as  to 
think  that  they  have  some  common  understanding 
of  the  law  (of  nature)  they  should  respect.  But  the 
problem  with  this  view — and  the  possible  fallacy  in 
the  reasoning — is  that  it  shows  that  social  relations 
are  right  away  flawed  with  conflict  and  violations 
of  norms  that  are  however  respected.  The  forma¬ 
tion  of  civil  society  through  the  contract  only  adds 
government  and  law  to  the  problems  of  society  but 
does  not  solve  them.  By  retracing  the  origins  of 
society,  not  to  a  state  of  nature  that  would  have 
been  to  some  extent  social  but  to  the  idea  of  the 
individual  considered  in  himself  or  herself,  Rousseau 
is  in  a  better  position  to  reinforce  the  claim  of  an 
original  contract  that  is  in  place  to  preserve  indi¬ 
vidual  rights,  because  they  can  be  thought  indepen¬ 
dent  from  any  kind  of  interactions. 
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Beyond  the  Contract 

No  author  more  clearly  than  Pufendorf  in  his  Duty 
of  Man  and  Citizen  has  grasped  the  essence  of  the 
questions  raised  by  contractarian  theories.  “For  a 
state  to  coalesce  regularly,”  Pufendorf  writes,  it  is 
necessary  that  first  each  individual  agrees  “to  a 
permanent  community”  with  the  others,  or,  in 
other  words,  it  is  necessary  “that  they  mutually 
desire  to  become  fellow-citizens.”  This  is  the  first 
compact.  Then,  once  engaged  in  society,  the  col¬ 
lectivity  has  to  decide  in  “one  decree”  under  which 
regime  it  chooses  to  live.  Then, 

another  compact  is  needed,  when  the  person,  or 
persons,  upon  whom  the  government  of  the 
nascent  state  is  conferred  are  established  in 
authority.  By  this  compact  these  bind  themselves 
to  take  care  of  the  common  security  and  safety, 
the  rest  to  yield  them  their  obedience;  and  by  it 
also  all  subject  their  own  wills  to  the  will  of  that 
person  or  persons.  (II  6) 

One  compact  is  thus  about  living  together, 
another  about  submission  to  authority.  This  the¬ 
ory,  sometimes  known  as  the  “two-pacts  theory,” 
underlines  the  twofold  character  of  contractarian 
theories:  on  the  one  hand,  the  question  of  power; 
on  the  other,  the  question  of  togetherness — in 
other  terms,  the  question  of  political  legitimacy 
and  the  question  of  social  diversity.  If  Rousseau’s 
formula  “the  social  contract”  has  been  so  success¬ 
ful  in  summing  up  the  inspiration  of  contract  the¬ 
ory,  it  is  precisely  because  the  original  contract  is 
not  solely  about  government.  It  is  about  the  very 
existence  of  any  collectivity,  any  society.  The  pur¬ 
pose  of  contract  theory  is  thus  not  only  to  identify 
the  characteristics  of  legitimate  power.  It  also  deals 
with  the  mere  possibility  of  living  together  and 
raises  the  question  of  social  ties. 

We  can  safely  say  that  this  new  question  is  born 
with  contractarianism  and  that  contract  theory  has 
been  the  first  to  give  an  account  of  it.  With  the 
emergence  of  the  figure  of  the  individual,  social 
bonds  appear  frailer  because  one  realizes  the 
power  that  each  member  has  to  seclude  himself 
from  ties  that  can  be  challenged.  While  tyranny  has 
always  been  considered  as  a  political  evil,  contract 
theories  also  admit  that,  beyond  the  oppression, 
seeing  communities  torn  down  by  internal  disputes 
is  an  equal  danger.  The  times  of  revolutions  and 


civil  wars  that  saw  the  birth  of  contractarian  doc¬ 
trines  gave  ample  illustration  to  this  fact  to  authors 
who  tried  to  figure  out  what  response  to  the  trag¬ 
edies  of  the  time  could  be  offered. 

Pufendorf,  by  separating  the  will  to  live  together 
from  the  consent  to  power,  spells  out  this  new 
condition — the  social  condition — of  politics  in  the 
wake  of  modernity.  However,  if  he  is  analytically 
right,  he  has  been  seen  as  politically  wrong,  and 
the  rest  of  the  contractarian  tradition  has  rejected 
the  theory  of  the  two  pacts.  The  fundamental 
problem  of  modern  politics  might  be  twofold — liv¬ 
ing  together,  obeying  a  legitimate  authority — but  it 
does  not  consist  of  two  separate  problems. 
Determining  the  right  authority,  the  right  “princi¬ 
ple  of  justice,”  as  Rawls  puts  it  in  the  20th  century, 
is  inseparable  from  our  way  of  living  together. 
According  to  Rawls,  it  is  not  the  case  that  there  is 
our  political  existence  on  one  side  and  the  kind  of 
relations  we  have  with  other  members  of  our  soci¬ 
ety  on  the  other  side.  Politics,  relations  of  author¬ 
ity,  pervade  the  entire  network  of  social  links: 
Political  sociology  always  meshes  with  the  frame¬ 
work  of  governance.  With  contractarian  doctrines, 
political  philosophy  has  entered  the  age  of  ideol¬ 
ogy:  Social  relations  are  to  be  accounted  for  in 
terms  of  politics. 

By  the  same  token,  the  presence  of  contract 
theory  in  political  thought  has  been  abating  when¬ 
ever  those  questions  have  appeared  as  secondary 
or  even  suspect  ones  or  when  alternative  theories 
of  modernity  have  shown  a  better  way  of  giving 
explanations  of  the  facts  that  contract  theorists 
wanted  to  explain.  Another  kind  of  critique  has 
also  come  from  those  who,  as  David  Hume  (1711— 
1776)  puts  it  in  one  of  his  essays  dedicated  to  the 
critique  of  contractarian  theories,  “trace  up” 
“government  to  the  Deity.”  The  very  inspiration  of 
a  secular  doctrine  is  for  these  critics  its  very  and 
most  fundamental  flaw.  But  as  this  kind  of  norma¬ 
tive  position  has  lost  its  grip  on  Western  political 
thought,  the  real  critique  has  come  from  doctrines 
that  have  assumed  their  own  identification  with 
modernity,  showing  that  contract  theory  could 
embrace  the  whole  of  modern  politics. 

In  the  first  phase,  after  the  French  Revolution, 
one  of  the  major  problems  of  contract  theories — 
social  plurality  and  the  question  of  how  to  live 
together  in  society — loses  some  of  its  importance. 
The  19th  century  was  an  age  of  nation  building 
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and  social  reengineering.  The  national  imaginary 
favors  the  conception  of  tightly  knitted  communi¬ 
ties,  whose  main  problem  was  to  get  rid  of  external 
influence.  The  neighbor  was  not  a  problem  pro¬ 
vided  that  he  or  she  was  from  the  same  kin,  and 
the  legitimate  government  was  not  primarily  the 
one  that  grants  rights  but  the  national  one. 
Socialist  doctrines,  then  so  influential  and  whose 
influence  extended  far  into  the  20th  century,  were 
equally  at  odds  with  this  question  of  social  plural¬ 
ity.  Their  insistence  on  collective  organization 
could  not  but  repel  the  general  inspiration  of  con¬ 
tract  philosophies.  Insisting  on  class  conscience,  on 
class  organization,  meant  having  very  little  interest 
in  cultural  diversity  or  in  political  struggles  that 
would  not  be  linked  to  structural,  economic  fac¬ 
tors.  It  also  meant  a  clear  inability  to  even  consider 
the  significance  of  any  individualistic  point  of  view 
on  society,  which  contract  theories  always  involve. 
Eventually,  the  style  of  political  thought  involved 
in  contractarian  theories  also  could  be  challenged 
with  the  development  of  a  scientific  approach  of 
politics  favoring  the  collection  and  the  analysis  of 
empirical  data:  It  did  not  leave  much  room  for 
theories  that  were  highly  speculative. 

Another  type  of  critique  of  contractarian  doc¬ 
trines  deals  with  the  issue  of  liberty.  One  of  the 
most  constant  representations  of  liberty  in  con¬ 
tractarian  theories  rests  in  its  dramatic  character: 
Human  beings,  left  on  their  own  without  the 
restraints  of  a  political  order,  are  led  into  a  state  of 
violence.  But  since  the  18th  century,  liberal  and 
then  libertarian  philosophies  have  considered  that 
the  lack  of  liberty  is  the  sole  source  of  violence. 
Left  to  themselves,  human  beings  are  able  to  live  in 
harmony.  This  is  the  gist  of  the  inspiration  of  mar¬ 
ket  theories,  which  have  a  reach  that  is  initially  far 
beyond  a  limited  explanation  of  how  economics 
work  or  should  work.  To  be  free,  only  local  con¬ 
tracts  are  necessary,  and  the  government,  far  from 
being  the  outcome  of  a  collective  decision  to  pro¬ 
tect  liberty,  should  not  intervene  in  matters  regard¬ 
ing  liberty.  Safeguarding  liberty  does  not  need  the 
establishment  of  an  original  contract  but  freedom 
from  the  government. 

These  competing  doctrines  also  have  shown  their 
limits,  again  placing  contractarian  doctrines  in  the 
foreground.  The  publication  in  1971  of  John  Rawls’s 
A  Theory  of  Justice  is  largely  recognized  as  a  mile¬ 
stone  in  the  renewal  of  interest  in  contractarian 


approaches.  It  coincided  with  the  awareness  that  a 
free  market  could  not  be  a  substitute  for  a  just  soci¬ 
ety,  and  that  questions  of  principles  were  raised 
through  new  political  issues  such  as  discrimination 
among  a  population:  national  citizenship  alone  was 
not  enough  to  build  a  political  community.  The  rise 
in  the  United  States  and  the  United  Kingdom  of 
political  agendas  more  centered  on  individuals  in 
social  interactions  than  in  the  previous  decades,  as 
well  as  the  declining  influence  of  socialist  and 
Marxist  interpretations  of  society,  also  allowed  the 
contractarian  framework  to  appear  as  a  credible 
alternative.  They  have  provided,  under  a  new  guise, 
a  way  to  conciliate  the  renewed  prominence  of  the 
figure  of  the  individual  and  the  continued  necessity 
of  organizing  collective  existence.  In  societies  that 
are  becoming  more  and  more  diverse,  and  where 
governance  faces  repetitive  crisis,  contractarian 
theories  might  not  offer  the  solution  but  should  still 
count  among  the  most  powerful  tools  we  have  to 
understand  the  contemporary  world  of  politics. 
This  is  indeed  the  problem  of  what  we  have  eventu¬ 
ally  called  “democracy”  and  where  we  now  recog¬ 
nize  the  very  definition  of  our  age. 
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Many  definitions  of  politics  emphasize  the  conflic- 
tual  aspect  of  human  relations.  But  cooperation 
should  be  considered  the  essential  element  of  poli¬ 
tics  even  if  it  is  not  always  achieved  at  the  degree 
that  would  produce  optimum  results.  Cooperation 
is  action  for  the  common  benefit.  Only  human 
beings  are  able  to  cooperate  and  abide  by  collec¬ 
tive  rules  for  their  common  interests,  including 
making  exchanges  in  their  mutual  benefit,  form¬ 
ing  coalitions  and  stable  organizations,  making 
enforceable  decisions  on  collective  affairs,  and 
living  in  large  communities  under  shared  norms. 
This  entry  describes  some  of  the  aspects  of  human 
existence  in  which  cooperation  can  play  a  key 


role,  examines  ways  in  which  game  theory  can 
illuminate  the  nature  of  cooperation,  and  dis¬ 
cusses  some  of  the  factors  that  are  important  in 
developing  and  maintaining  cooperation. 

Crucial  Areas  for  Cooperation 

All  fundamental  problems  in  politics  face  the  cru¬ 
cial  question  of  how,  under  what  circumstances, 
and  to  what  extent  human  beings  can  be  moti¬ 
vated  to  cooperate  in  their  common  interest. 
Cooperation  is  at  the  core  of  the  issues  of  convivi¬ 
ality,  democracy,  peaceful  coexistence  between 
different  communities,  and  the  preservation  of 
human  life  on  Earth,  as  is  briefly  reviewed  in  the 
following  paragraphs. 

Community 

In  what  certain  classical  authors  called  “the  state 
of  nature,”  conflict  is  pervasive.  If  human  interac¬ 
tions  are  unconstrained,  anyone,  with  the  advan¬ 
tage  of  surprise,  can  try  to  impose  his  or  her  will  on 
others.  But  if  all  do,  then  people  may  find  them¬ 
selves  living  in  a  state  of  chaos  in  which,  in  Thomas 
Hobbes’s  famous  words,  life  tends  to  be  “solitary, 
poor,  nasty,  brutish,  and  short.”  In  such  an  envi¬ 
ronment,  it  is  not  reasonable  to  risk  unilateral 
cooperation,  while  cooperation  within  groups  is 
precarious.  However,  human  beings  can  do  better. 
People  can  agree  on  creating  a  government  equipped 
with  tools  of  coercion  to  enforce  rules  mandating 
cooperative  actions  that  produce  beneficial  results 
for  society  as  a  whole.  The  government  may  apply 
sanctions  against  “defectors” — that  is,  violators  of 
mutually  beneficial  rules  of  conduct — discourage 
free  riding  on  public  goods,  and  craft  incentives  for 
cooperation.  People  can  rationally  accept  condi¬ 
tional  consent.  By  an  agreed  “social  contract,”  the 
efficient  outcome  of  civilization  or  “common¬ 
wealth,”  in  which  each  can  live  in  peace  and  secu¬ 
rity,  can  be  attained. 

Democratization 

In  situations  of  institutional  regime  crisis  in 
which  authoritarian  rulers  cannot  go  on  as  they 
were  accustomed  to  do,  actors  with  opposite 
political  regime  preferences  can  generate  violent 
conflict  or  a  civil  war  in  which  both  sides  may 
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fight  to  eliminate  each  other.  Eventually,  one  of 
the  sides  can  become  a  single,  absolute  winner.  But 
choosing  confrontation,  with  an  uncertain  out¬ 
come,  also  entails  the  risk  of  becoming  an  absolute 
loser,  as  well  as  the  costs  of  significant  destruction 
on  both  sides.  In  contrast,  by  anticipating  the  fore¬ 
seeable  consequences  of  their  choices,  either  the 
rulers  or  the  opposition  leaders  can  offer  condi¬ 
tional,  retractable  cooperation.  Negotiations  can 
lead  to  a  provisional  compromise,  including  the 
calling  of  a  multiparty  election  that  does  not  secure 
an  absolute  winner,  which  may  open  further  devel¬ 
opments  in  favor  of  either  of  the  actors  involved, 
as  has  happened  in  so  many  cases  of  democratiza¬ 
tion  in  different  parts  of  the  world  since  the  last 
quarter  of  the  20th  century. 

Deterrence 

International  relations  have  traditionally  been 
dominated  by  conflict,  which  culminated  in  the 
Cold  War  between  the  United  States  and  the  Soviet 
Union  during  the  second  half  of  the  20th  century. 
However,  a  major  clash  was  prevented  by  underly¬ 
ing  cooperation.  An  arms  race  triggered  by  the 
Soviets  choosing  to  build  new  weapons  and  the 
United  States  doing  the  same,  and  vice  versa,  put 
both  countries  at  risk.  The  “balance  of  terror” 
without  an  actual  frontal  war  was  durable  because, 
with  nuclear  weapons  and  the  possibility  of  total 
destruction,  the  stakes  were  so  high.  The  United 
States  and  the  Soviet  Union  attempted  to  cooper¬ 
ate  in  order  to  reduce  and  control  arms  through 
negotiations  and  agreements.  Nevertheless,  the 
arms  race  stopped  only  with  the  dissolution  of  one 
of  the  players. 

Environment 

Climate  change  has  become  a  broad  concern, 
leading  many  people  to  call  to  stop  the  planet  from 
overheating.  Yet  some  skepticism  persists  regard¬ 
ing  human  agency  in  climate  change,  and  few 
governments  are  willing  to  deal  with  the  problem 
by  themselves.  The  Kyoto  Protocol,  placing  volun¬ 
tary  limits  on  emissions  of  greenhouse  gases,  was 
formally  accepted  by  36  developed  countries  for 
the  period  2005  to  2012.  However,  the  United 
States  and  Australia  initially  refused  to  sign.  Even 
some  of  the  protocol  signatories,  as  well  as  many 


developing  countries,  including  most  prominently 
China,  continue  to  tolerate  high  levels  of  industrial 
emissions  known  to  exacerbate  the  greenhouse 
effect.  Without  international  cooperation,  the 
world  may  be  condemned  to  an  eventual  burning, 
according  to  those  who  hold  the  global  warming 
hypothesis.  Most  countries  seem  to  have  recently 
found  some  incentive  to  avoid  being  sanctioned  for 
their  misconduct.  The  Asia-Pacific  Partnership  on 
Clean  Development  and  Climate  has  attained  new 
cooperation  on  development  and  technology  trans¬ 
fer  enabling  a  reduction  in  gas  emissions.  The 
United  Nations  held  successful  talks  with  virtually 
all  countries  in  the  world  to  replace  the  Kyoto 
Protocol  after  its  deadline  expires.  If  things  get  bad 
enough,  then  cooperation  may  flourish. 

Developing  and  Sustaining  Cooperation 

The  conditions  in  which  cooperation  among  dif¬ 
ferent  actors  for  their  common  interest  can  emerge 
and  hold  up,  such  as  in  the  cases  mentioned  above, 
can  be  illuminated  with  the  help  of  some  analytical 
tools  provided  by  game  theory.  In  particular,  com¬ 
petitive,  non-zero-sum  games,  such  as  the  famous 
“prisoners’  dilemma,”  involve  different  combina¬ 
tions  of  cooperation  and  conflict.  When  mutual 
cooperation  can  produce  gains  for  all  participants, 
it  is  said  that  the  sum  of  the  players’  benefits  is 
positive.  But  people  may  fail  to  cooperate  with 
others  even  if  cooperation  would  produce  a  better 
collective  outcome  for  all  the  participants. 

The  lessons  from  prisoners’  dilemma  can  be 
applied  to  any  group  or  community  facing  a  coop¬ 
eration  problem  among  its  members.  On  the  one 
hand,  the  previously  conditions  of  the  state  of 
nature,  civil  war,  arms  race,  or  destruction  of  natu¬ 
ral  resources  can  be  conceived  as  inefficient,  equi¬ 
librium  outcomes  of  prisoners’  dilemma  type  of 
interactions.  On  the  other  hand,  the  state  of  civili¬ 
zation,  democracy,  peaceful  coexistence,  and  con¬ 
servation  of  the  atmosphere  can  correspond  to 
alternative,  efficient,  although  somewhat  vulnera¬ 
ble,  outcomes  of  this  type  of  game  produced  by 
mutual  cooperation. 

Game  theory  models  with  “repeated”  games  sug¬ 
gest  that  the  greater  the  uncertainty  over  the  future 
relationship  and  the  higher  the  number  of  interac¬ 
tions  between  the  parties,  the  more  conflict  can 
diminish  and  mutual  cooperation  emerge  and  be 
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sustained.  If  people  are  going  to  engage  in  repeated 
interactions,  it  may  make  sense  to  try  to  cooperate 
in  order  to  receive  others’  cooperation  in  the  future. 
A  community  or  institutional  setting  in  which 
everybody  can  expect  to  keep  interacting  with  the 
same  people  regularly  for  some  time  in  the  future 
may  include  a  household  or  a  neighborhood,  the 
workplace,  a  mall  or  a  school,  a  professional  orga¬ 
nization,  or  a  political  party,  as  well  as  the  city,  the 
state,  the  empire,  or  the  world  one  is  aware  of  living 
in  and  where  one  intends  to  stay.  A  reasonable 
behavior  for  repeated  interactions  with  other  indi¬ 
viduals  with  some  common  interest  can  imply  con¬ 
ditional  cooperation  and  a  positive  response  to  the 
other’s  behavior.  In  the  long  term,  cooperation  may 
spread  and  become  the  prevailing  way  of  conduct. 

According  to  this  insight,  we  can  observe  that 
cooperation,  as  can  be  substantiated  in  the  form  of 
collective  action  or  a  joint  organization,  is  indeed 
more  intense  and  sustained  among  certain  groups 
of  people  interacting  for  long  periods  of  indefinite 
length.  Cooperation  should  be  higher,  for  exam¬ 
ple,  among  members  of  a  condominium  associa¬ 
tion  rather  than  among  motel  clients,  among  town 
residents  rather  than  among  tourists  or  occasional 
visitors,  among  fixed  employees  rather  than  among 
temporarily  unemployed  people  expecting  to  find 
a  job  soon,  among  civil  servants  rather  than 
among  seasonal  workers,  and  among  store  owners 
in  a  commercial  mall  rather  than  among  sporadic 
vendors  in  a  street  market. 

If  people  reciprocate  to  cooperation  with  coop¬ 
eration  in  repeated  interactions,  they  can  build  a 
good  reputation  for  themselves  that  may  move  oth¬ 
ers  to  cooperate  with  them.  Feelings  of  trust  may 
emerge  among  people  having  information  about 
others’  past  actions  and  among  new  participants 
obtaining  regular  positive  retribution  for  their  con¬ 
duct.  In  the  middle  or  long  term,  increasing  and 
sustained  cooperation  among  members  of  a  com¬ 
munity  may  induce  them  to  construct  institutional 
environments  that  limit  individual  competition  and 
tend  to  homogenize  the  population.  Internal  sanc¬ 
tions  against  defectors  can  go  together  with  the 
promotion  of  values  such  as  honesty  and  empathy 
with  the  distress  of  others,  thus  reinforcing  social 
cooperativeness.  Emerging  and  self-sustaining 
cooperation  in  the  long  term  can  also  help  promote 
conviviality,  democracy,  peaceful  coexistence,  and 
environmental  conservation  mentioned  earlier. 


Nevertheless,  the  development  of  cooperation 
among  members  of  a  community  requires  some 
mutual  commitment  to  stay  within  its  contours.  If, 
conversely,  people  living  within  the  same  institu¬ 
tional  setting  consider  themselves  as  belonging  to 
different  groups  with  opposite  goals,  asymmetric 
relations  of  cooperation  and  conflict  within  the 
community  can  develop.  The  difference  between 
groups  can  be  based  on  language,  race,  ethnicity, 
religion,  family  or  tribal  traditions,  contrary  eco¬ 
nomic  interests,  or  adversarial  preferences  for  the 
location  of  public  goods.  Thus,  high  degrees  of  suc¬ 
cess  or  failure  in  attaining  cooperation  within  a 
group  do  not  necessarily  correspond  to  socially  effi¬ 
cient  or  inefficient  solutions.  The  collective  strength 
of  some  groups  may  indeed  provoke  conflict  with 
other  groups  or  favor  asymmetric  and  biased  redis¬ 
tributions  of  resources,  hindering  more  satisfactory 
outcomes  for  greater  numbers  of  people. 
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Coordination 


An  initial  statement  of  what  is  typically  meant  by 
the  pursuit  of  coordination  is  the  ambition  to 
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achieve  greater  coherence  among  the  departments, 
agencies,  authorities,  services,  and  professions  of 
government,  as  well  as  those  providing  public  ser¬ 
vices  under  contract  or  license,  each  of  which  has 
some  responsibilities  for  a  given  problem  or  issue. 
This  entry  provides  a  framework  for  understand¬ 
ing  coordination  in  terms  of  the  problems  it  seeks 
to  address  and  the  means  through  which  its  goals 
are  pursued.  It  also  discusses  the  way  in  which  the 
concern  with  coordination  evolved,  the  main  the¬ 
oretical  approaches  that  have  been  taken  in 
research  on  it,  and  the  limitations  of  efforts  to 
provide  evidence  of  its  effectiveness. 

Goals  and  Means 

The  word  coordination  is  sometimes  used  to 
describe  a  class  of  policy  goals  or  outcomes, 
loosely  labeled  as  “coherence.”  This  entry  distin¬ 
guishes  several  distinct  goals  that  fall  under  this 
general  term.  Each  of  these  goals,  though,  consists 
to  some  degree  in  the  reduction  or  even  removal  of 
tensions  among  policies,  so  that  they  do  not  under¬ 
mine  each  other  and,  perhaps,  instead  even  sup¬ 
port  each  other. 

Because  coordination  covers  a  range  of  initia¬ 
tives  designed  to  combat  certain  types  of  prob¬ 
lems,  one  way  to  understand  its  meaning  is  as 
something  offered  as  the  opposite  of  any  or  sev¬ 
eral  of  those  problems.  Broadly,  the  problems 
for  which  greater  coordination  is  presented  as 
a  solution  can  be  divided  into  six  kinds,  ranked 
here  in  order  of  declining  ferocity  of  conflict 
between  the  imperatives  set  out  in  the  institu¬ 
tional  or  organizational  guidelines,  organiza¬ 
tional  or  professional  goals,  incentives,  regulatory 
requirements,  and  legal  duties  (see  Christopher 
Pollitt,  2003): 

Conflict.  Mutually  undermining  imperatives  lead 
to  conflict.  For  example,  goals  of  promoting 
economic  growth  and  limiting  the  environmental 
impact  of  human  activity  quickly  come  into 
conflict,  but  almost  every  government  is  expected 
to  promote  both  to  some  degree,  and  they  are  often 
given  to  distinct  agencies  to  pursue.  Another  such 
case  arises  when  one  regulatory  agency  demands 
adaptations  to  buildings  to  enable  wheelchair 
access,  while  another  regulatory  body  forbids  it  on 
the  ground  that  they  would  violate  the  integrity  of 
a  historic  building. 


Contradiction.  This  entails  inconsistent  imperatives 
that  have  not  actually  come  into  direct  conflict  but 
may  do  so.  For  example,  consider  the  goal  of 
designing  welfare  systems  to  get  people  who  have 
few  skills  and  some  disadvantages,  such  as  learning 
difficulties  or  mental  health  problems,  into  the 
work  force.  Such  a  program  may  conflict  with  the 
imperative  of  productivity  even  in  a  strong  economy, 
but  the  conflict  may  remain  latent.  However,  in  a 
weak  economy,  the  inconsistency  in  goals  may 
become  important  and  constitute  an  actual  conflict. 

Competition.  This  involves  imperatives  that  are 
not  strictly  inconsistent  but,  in  practice,  come  into 
conflict  or  at  least  are  in  rivalry  for  priority  or 
resources.  This  is  a  perennial  problem,  in  which  the 
competition  between  emphasizing  investment  in 
secondary  and  tertiary  education  and  between 
crime  reduction  and  reduction  of  the  fear  of  crime 
can  serve  as  examples. 

Incoherence  and  Fragmentation.  Imperatives  that 
are  neither  mutually  contradictory  nor  in 
competition  may  have  little  to  do  with  each  other, 
when  arguably  gains  could  be  achieved  if  they 
were  brought  together.  The  frequent  complaints  of 
“departmentalism,”  “silo  mentality,”  and  “territor- 
ialism”  between  neighboring  local  authorities  are 
all  examples  of  this  kind  of  problem. 

Duplication.  This  refers  to  similar  or  even  identical 
imperatives  expressed  in  different  arrangements  that 
result  in  wasteful  and  excessive  use  of  resources.  For 
example,  there  are  often  cases  where  very  similar 
programs  are  provided  by  both  health  authorities 
and  local  authorities  or  by  local  authorities  and 
probation  services;  a  similar  issue  can  arise  when 
multiple  regulators  all  conduct  on-site  inspections, 
each  looking  at  the  same  sets  of  organizational 
records  or  processes  but  for  different  purposes, 
where  there  might  be  possibilities  to  reduce  the 
burden  on  regulated  bodies  by  combining  inspections. 

Pursuit  of  coordination  as  a  goal  rests  on  the 
claim  that  better  outcomes  such  as  health,  educa¬ 
tional  achievement,  security  against  crime,  eco¬ 
nomic  development,  or  effectiveness  in  defense  are 
better  promoted  when  the  policies  pursued  by  each 
particular  agency,  department,  or  authority  within 
the  system  of  governance  stand  in  little  tension 
with  each  other.  To  many  people,  this  claim  seems 
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obviously  and  intuitively  true.  However,  as  we 
shall  see  below,  it  is  contested,  and  some  of  the 
arguments  offered  against  it  have  very  distin¬ 
guished  pedigrees  in  both  organizational  and  polit¬ 
ical  thought. 

Often,  however,  “coordination”  is  also  used  to 
describe  the  means  by  which  the  aims  of  better 
integrated  policy  are  pursued.  Indeed,  in  this 
respect,  the  term  is  used  to  refer  to 

•  general  processes,  such  as  markets,  hierarchy, 
clans,  or  clusters,  as  broad  types  of  change 
process  that  bring  people  and  organizations  into 
some  kind  of  alignment  with  and  in  adjustment 
to  each  other; 

•  policy  instruments,  such  as  incentives,  directives, 
or  the  provision  of  information  (or,  more  simply, 
sticks,  carrots,  and  sermons,  which  form  the 
canonical  trinity  of  types  of  tools  of 
government);  or 

•  very  specific  interorganizational  arrangements, 
such  as  joint  committees,  joint  budgets,  or  even 
activities  or  outputs  such  as  a  “one-stop  shop” 
or  shared  services  centers  that  are  outlets  to  the 
service  users  or  the  public. 

For  the  most  part,  governments  use  these  means 
in  combination.  Rarely  do  they  rely  exclusively  on 
one  type  of  process  or  instrument  even  in  a  single 
field  of  policy,  let  alone  across  the  whole  span  of 
public  management.  Much  of  the  recent  effort  in  the 
study  of  coordination  is  now  devoted  to  the  effort 
to  understand  the  likely  consequences,  in  different 
circumstances,  of  different  combinations  of  pro¬ 
cesses,  instruments,  and  arrangements.  A  common 
slogan  for  this  is  that  “it’s  the  mix  that  matters.” 

Just  what  relationship  these  means  of  coordina¬ 
tion  have  to  the  goals  of  coordination  is  the  subject 
of  some  disagreement  and  some  disappointment. 
For  example,  some  researchers  have  found  that  the 
very  proliferation  of  large  numbers  of  joint  com¬ 
mittees,  budgets,  and  customer  service  centers  itself 
produces  new  kinds  of  fragmentation.  Certainly,  all 
these  things  are  expensive  to  establish  and  run.  This 
raises  the  question  of  whether  the  gains  in  improved 
policy  outcomes  are  worth  the  higher  administra¬ 
tive  costs  of  introducing  and  operating  all  the  coor¬ 
dinating  machinery.  There  is  also  evidence  from  the 
study  of  mergers  that  directing  managerial  atten¬ 
tion  to  the  working  of  committees  and  budgets  and 
centers  can  distract  them,  sometimes  for  quite  long 


periods,  from  the  larger  policy  goals,  while  they 
concentrate  on  the  minutiae  of  interorganizational 
relations.  It  would  be  quite  wrong  to  say  that  coor¬ 
dination  efforts  typically  fail  or  cost  more  than  they 
are  worth.  Indeed,  the  literature  includes  many 
positive  evaluations.  But  as  yet,  we  lack  sufficient 
evidence  to  make  many  general  claims  about  when, 
where,  and  why  the  means  support  the  goals  and  to 
what  degree  they  can  be  expected  to  do  so. 

Origins 

Coordination  has  been  an  eternal  concern  of  gov¬ 
ernment  and  the  public  services.  Between  the  dif¬ 
ferent  branches  of  the  armed  forces,  it  has  been  a 
topic  for  institutional  innovation,  political  anxi¬ 
ety,  and  intellectual  work  since  antiquity.  The 
need  to  reduce  conflict  between  the  impacts  of  the 
tax-collecting  and  economic  activity-stimulating 
work  of  different  functions  of  government  was  a 
matter  of  enduring  concern  throughout  the 
Roman  Empire.  Incoherence  and  conflicting 
jurisdiction  became  a  major  problem  for  the 
absolutist  monarchs  of  early-modern  Europe, 
because  the  combination  of  the  sale  of  offices,  the 
farming  of  taxes,  and  the  conflict  of  monarchical 
laws  with  local  systems  of  laws  produced  grow¬ 
ing  contradictions. 

With  the  development  of  responsibilities  for  the 
welfare  of  citizens  in  domestic  policy  toward  the 
late  19th  and  early  20th  centuries,  the  issue  came 
to  be  of  central  importance  in  many  countries.  The 
“Progressive  Era”  reforms  in  late- 19th-century 
United  States  produced  some  systematic  thought 
on  the  question.  In  British  imperial  public  admin¬ 
istration  and  especially  in  India  after  1857,  the 
need  for  coordination  became  a  constant  litany. 

In  1918  in  Britain,  the  Haldane  Committee 
published  its  report,  examining  more  systemati¬ 
cally  than  any  previous  study  had  done  the  various 
principles  and  administrative  designs  available  by 
which  greater  coordination  might  be  pursued. 
Options  considered  included  those  of  organizing 
around  geographical  jurisdiction,  clientele,  or 
function,  each  requiring  different  approaches  to 
horizontal  interorganizational  linkage. 

In  the  decades  after  World  War  II,  many  coun¬ 
tries  experimented  with  mergers  to  produce  ever 
larger  departments  or  groups  of  departments  in  the 
hope  that  this  would  produce  greater  cohesion,  but 
the  results  were  generally  disappointing.  In  the 
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1960s,  some  came  to  pin  their  hopes  for  coordina¬ 
tion  on  techniques  of  central  control  rather  than  in 
structural  change,  emphasizing  management  by 
objectives,  program  analysis,  and  review  and  other 
forms  of  managerial  and  financial  accountability. 
Yet  by  the  mid-1970s,  disappointment  with  these 
too  had  become  widespread. 

The  1980s  and  1990s  were  characterized  in 
Britain,  New  Zealand,  Australia,  and  the  United 
States  by  a  series  of  reforms,  closely  documented  in 
the  scholarly  literature,  which  are  commonly 
labeled  the  new  public  management,  although  this 
term  probably  gives  them  greater  unity  than  they  in 
fact  exhibited.  By  the  mid-  to  late  1990s,  concern 
was  growing  that  these  reforms  had  overempha¬ 
sized  the  role  of  the  dedicated  specialist  agency, 
often  working  under  a  contract  for  the  performance 
of  its  own  principal  function  exclusively,  to  the 
detriment  of  coordination.  In  Britain,  the  aspiration 
for  a  major  initiative  at  every  level  toward  greater 
cross-government  working  was  given  the  label 
“joined-up  government”  by  the  New  Labor  admin¬ 
istration  that  came  to  power  in  1997;  in  practice, 
its  interest  in  this  waned  significantly  after  a  few 
years.  In  Australia,  the  preferred  term  was  the 
whole  of  government  approach;  political  commit¬ 
ment  there  was  sustained  for  a  longer  period.  The 
movement  had  no  national  programmatic  emphasis 
in  the  United  States,  not  least  because  many  of  the 
functions  that  would  be  candidates  for  spearhead¬ 
ing  greater  coordination  are  in  that  country  admin¬ 
istered  at  state  level.  Indeed,  it  was  at  the  state  level 
that  a  wide  range  of  efforts  in  things  described 
variously  as  integration,  horizontal  working,  and 
“network  governance”  came  to  be  pursued. 

In  practice,  different  reform  programs  have 
pursued  rather  different  things  in  the  name  of 
coherence  and  coordination.  In  some  countries, 
the  priorities  have  been  around  integrating  access 
to  specific  local  services  targeting  the  same  clien¬ 
teles,  through  “one-stop  shop”  centers  on  main 
business  streets  or  online.  In  others,  the  priority 
has  been  the  formation  of  joint  boards  between  a 
modest  number  of  particular  public  authorities  to 
manage  special  programs.  In  yet  others,  the  princi¬ 
pal  effort  has  been  put  into  designing  systems  of 
government-wide  accountability  to  sharpen  incen¬ 
tives  for  the  pursuit  of  particular  outcomes,  irre¬ 
spective  of  agency;  finally,  for  some  governments, 
improving  the  mutual  understanding  and  joint 


efforts  of  their  civil  servants  working  in  different 
departments  of  state  to  formulate  policies  is  the 
greater  priority. 

It  seems  that  the  high  watermark  of  this  most 
recent  tide  of  efforts  in  coordination  had  passed  by 
the  late  2000s,  as  governments  became  more  inter¬ 
ested  in  “modernization,”  supporting  individual 
service  user  choice  of  provider,  or  as  fiscal  posi¬ 
tions  deteriorated  in  the  wake  of  the  financial  and 
then  the  general  economic  crisis.  No  doubt,  how¬ 
ever,  in  due  course,  interest  in  aspirations  for 
enhanced  coordination  will  revive. 

Although  some  advocates  in  the  late  1990s  had 
perhaps  hoped  otherwise,  few  votes  have  been  won 
on  the  basis  of  greater  coordination.  At  best,  the 
contribution  to  electoral  success  achieved  by  coordi¬ 
nation  is  indirect.  Moreover,  many  of  those  who 
make  the  greatest  use  of  those  frontline  public  ser¬ 
vices  for  which  coordination  is  typically  a  high  pri¬ 
ority  are  often  poor  and,  therefore,  typically  less 
likely  to  vote.  But  this  measure  understates  the 
political  importance  of  the  issue.  Coordination  has 
provided  governments — and  in  particular,  center- 
left  governments — with  an  aspiration  and  ideas  for 
programs  of  reform  to  the  machinery  of  government 
with  which  to  answer  the  charge  that  government  is 
somehow  inherently  incapable  of  improving  its  per¬ 
formance.  Although  the  details  of  administrative 
arrangements  are  of  little  interest  to  most  voters,  the 
ideological  battle  between  those  who  hope  for  better 
government  and  those  who  argue  for  less  govern¬ 
ment  is  a  central  one  in  defining  political  projects, 
and  it  is  to  this  conflict  that  the  debates  about  coor¬ 
dination  make  their  political  contribution. 

Theories 

Four  kinds  of  questions  about  coordination  have 
been  the  principal  subjects  for  theoretical  work  in 
the  literature: 

1.  explaining  the  waxing  and  waning  over  time 
and  the  differences  between  countries  in 
political  commitment  to  put  into  action 
programs  of  reform  for  coordination; 

2.  explaining  the  emergence,  whether  by  conscious 
and  deliberate  choice  or  otherwise,  of  forms  of 
interorganizational  relations  that  have  prevailed 
in  pursuit  of  coordination  in  different  countries, 
policy  fields,  and  so  on; 
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3.  explaining  the  choice  of  instrumentation  made 
by  particular  governments  in  pursuit  of 
coordination;  and 

4.  explaining  the  conditions  under  which  we  can 
expect  certain  programs  to  be  successfully  put 
in  place  administratively,  or — more 
ambitiously — to  make  a  positive  difference  to 
outcomes. 

On  the  first  and  second  questions,  academic  writ¬ 
ing  about  coordination  is  a  field  where  empirical 
work  dominates  both  theory  development  and  the¬ 
ory  testing.  Certainly,  the  grand  explanatory  frame¬ 
works  of  rational  choice,  historical  institutionalism, 
structural  functionalism,  ideationalism,  and  so  forth 
that  dominate  much  of  political  science  and  interna¬ 
tional  relations  research  and  from  which  particular 
theories  are  often  derived  are  not  addressed  by 
many  scholars  working  on  coordination. 

The  most  commonly  deployed  argument  in 
answer  to  each  of  the  four  questions  is  the  broadly 
historical  institutionalist  one  that  the  prior  institu¬ 
tionalization  of  departments,  organizations,  tiers, 
and  functions  among  the  public  services  continues 
to  inhibit  the  scope  for  greater  coordination  and 
joint  working  across  boundaries.  Much  of  the  lit¬ 
erature  is  devoted  to  the  identification  of  institu¬ 
tionalized  barriers  to  cooperation.  Making  lists  of 
barriers,  obstacles,  constraints,  or,  conversely, 
facilitating  conditions  abounds.  What  is  often 
unclear  about  the  theoretical  status  and  role  of 
these  sorts  of  factors  is  whether  they  are  offered  as 
necessary  conditions  for  successful  implementation 
of  any  particular  level  and  degree  of  coordination, 
jointly  necessary  conditions  for  it  to  be  sought  at 
all,  or  simply  factors  that  influence  probabilities. 
Nor  is  it  very  often  clear  what  causal  relationship 
these  structural  factors  have  with  agency.  A  partial 
exception  is  the  attempt  by  Perri  6,  Diana  Leat, 
Kimberly  Seltzer,  and  Stoker  Gerry  (2002)  to 
apply  the  neo-Durkheimian  institutional  theoretic 
tradition  to  develop  theories  about  both  the  first 
and  the  second  questions. 

On  the  first  question,  6  and  colleagues  argue 
that  political  commitment  to  coordination  is  sus¬ 
tained  principally  during  periods  of  centrist  gov¬ 
ernment  and  that  this  pattern  can  be  traced  back 
throughout  the  20th  century.  By  contrast,  Helen 
Sullivan  and  Chris  Skelcher  argue  that  it  is  a  recent 
development  and  an  artifact  of  reaction  against  the 


excesses  of  the  reforms  of  the  1980s  and  1990s 
commonly  referred  to  as  the  “new  public  manage¬ 
ment.”  The  disagreement  here  turns  on  empirical 
questions  about  just  how  different  the  recent 
efforts  in  coordination  really  are  from  those  pur¬ 
sued  in  previous  periods  of  administrative  history. 
Whereas  6  and  colleagues  and  Christopher  Hood 
see  continuities  in  the  elementary  forms  of  organi¬ 
zation  and  interorganizational  relations,  Sullivan 
and  Skelcher  argue  that  the  range  of  tasks  with 
which  the  public  services  are  charged  now  and  the 
context  of  the  reaction  against  the  excesses  of  the 
reforms  of  the  1980s  represent  a  distinct  historical 
conjuncture. 

Most  theorizing  about  the  second  question  has 
used  the  conventional  classification  of  types  or 
forms  of  interorganizational  relations  that  parses 
them  into  hierarchies,  markets,  and  networks.  This 
trichotomy  has  its  roots  in  institutional  theories  in 
economies  descending  from  Ronald  Coase,  William 
Ouchi,  and  Oliver  Williamson.  Nonetheless,  there 
has  been  growing  recognition  of  the  fact  that  many 
arrangements  or  relationships  that  exhibit  the  for¬ 
mal  features  of  markets  or  networks  are,  at  the 
level  of  their  informal  institutions,  more  genuinely 
hierarchical.  Conversely,  it  has  long  been  recog¬ 
nized  that  many  formally  hierarchical  partnerships 
are,  in  their  informal  institutions,  highly  individu¬ 
alistic.  Finally,  according  to  Perri  6,  Nick  Goodwin, 
Edward  Peck,  and  Tim  Freeman,  it  has  been  found 
that  the  concept  of  a  “network”  increasingly  does 
not  correspond  to  any  single  institutional  form. 

Authors  have  differed  in  the  extent  to  which 
they  have  sought  to  use  the  trichotomy  for  truly 
theoretical  and  explanatory  purposes  or  the  extent 
to  which  it  has  been  used  mainly  descriptively  as  a 
coding  schema.  For  Koen  Verhoest,  Geert 
Bouckaert,  and  Guy  Peters,  the  three  institutional 
forms  achieve  their  explanatory  force  by  enabling 
the  operation  of  what  they  describe  as  mecha¬ 
nisms ,  a  term  that  in  their  usage  largely  corre¬ 
sponds  to  policy  instruments  used  for  control, 
incentive,  or  persuasion  and  for  accountability. 
Some,  such  as  Sullivan  and  Skelcher,  propose 
positive  feedback  explanations,  such  that  prior 
and  latent  institutional  patterns  are  reinforced  dur¬ 
ing  periods  of  deliberate  effort  to  cultivate  greater 
coordination:  This  sits  slightly  uneasily  with  their 
negative  feedback  theory  in  answer  to  the  first 
question,  and  reconciliation  depends  on  very  exact 
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periodization  that  has  not  yet  been  tested  with 
detailed  historical  research. 

There  have  been  many  attempts  to  engage  in 
theory  development  in  the  field  using  concepts  of 
trust  and  networks.  It  is  common  to  find  authors, 
such  as  Robert  Agranoff,  claiming  that  coordina¬ 
tion  is  best  facilitated  by  nonhierarchical,  high- 
trust  consensual  managerial  practices,  although  it 
is  rather  rare  to  find  close  comparative  analysis  of 
cases  where  more  hierarchical  approaches  are 
used,  holding  the  service  field  and  political  juris¬ 
diction  constant;  this  leaves  continuing  question 
marks  over  these  kinds  of  claims. 

It  is  common  to  find  small-N  studies  used  to 
develop  necessary-condition  hypotheses  about  the 
extent  of  prior  trust  or  prior  operation  of  certain 
patterns  of  networks  of  organizations  before  we  can 
expect  to  see  even  administrative  implementation  of 
programs  being  sustained,  let  alone  positive  impact 
on  outcomes.  Other  studies,  such  as  that  by  Jeanette 
Moore,  Robert  M.  West,  Justin  Keen,  Mary 
Godfrey,  and  Jean  Townsend,  or  by  H.  Brinton 
Milward  and  Keith  Provan,  are  concerned  with  net¬ 
works  of  referral  of  clients  between  organizations, 
and  they  draw  on  sociometric  approaches  to  net¬ 
work  analysis  to  offer  predictive  understandings  of 
these  relations  between  organizations  and  services. 

Eugene  Bardach,  presenting  his  argument  rela¬ 
tively  informally,  has  offered  a  strategy  for  answer¬ 
ing  the  second  question  by  reference  to  a  thesis  that 
assumes  a  rather  strongly  individualistic  institu¬ 
tional  character  of  organization,  arguing  that  such 
a  setting  allows  individual  entrepreneurs  to  broker 
their  way  into  fluid  patterns  of  interorganizational 
relations,  in  which  trust  is  taken  to  be  a  largely 
pragmatic  affair.  In  a  similar  vein,  there  is  a  large 
body  of  work  inferring  from  qualitative  work  on 
the  role  of  particular  individuals  that  “leadership” 
is  a  necessary  condition  for  success,  although  quite 
what  types  of  leadership  have  just  which  effects 
and  in  which  circumstances  remains  ill  specified. 

The  fourth  question,  about  conditions  for  “suc¬ 
cess,”  is  most  frequently  addressed — for  example, 
by  Bob  Hudson,  Brian  Hardy,  Melanie  Henwood, 
and  Gerald  Wistow — using  theories  of  organiza¬ 
tional  process,  ranging  from  theories  of  the  life 
cycles  of  initiatives  through  to  more  eclectic 
approaches  that  combine  institutional  constraints 
with  models  of  interaction  processes  and  organiza¬ 
tional  capability  theory. 


Opposing  Principles 

Advocates  of  coordination  argue  that  the  six  con¬ 
cerns  discussed  above  (in  the  “Goals  and  Means” 
section  of  this  entry)  are  problems  to  be  combated, 
as  a  matter  of  priority.  But  by  no  means  do  all 
governments  or  policymakers  agree.  For  many, 
conflict,  contradiction,  and  competition  may  be 
regarded  as  broadly  healthy,  and  fragmentation 
and  duplication  might  be  regarded  as  losses  that 
are  acceptable  as  the  price  of  other  important 
gains.  For,  indeed,  coordination  is  directly  opposed 
to  a  number  of  principles  for  the  organization  of 
policy  making  and  of  public  services  that  have  long 
had,  and  continue  to  have,  powerful  advocates. 

Four  of  the  commonest  principles  that  can  and 
sometimes  have  implied  contrary  principles  of 
organization  might  be  called  (1)  specialism  in 
accountability ,  (2)  specialism  in  creative  tension , 
(3)  liberal  constitutionalism ,  and  (4)  populism. 

Specialism  is  the  argument  that  organizing  the 
principal  accountabilities  of  policy  making  and 
public  services  around  functions,  by  which  we  con¬ 
ventionally  mean  the  administration  of  discrete 
services  that  call  for  particular  bodies  of  knowl¬ 
edge,  skill,  professionalism,  and  so  on,  is  the  most 
effective  way  either  to  ensure  efficient  use  of  inputs 
or,  perhaps,  to  secure  the  responsiveness  of  the 
services  to  the  will  of  the  government  of  the  day.  It 
is  generally  much  more  straightforward  to  attach 
estimates  of  costs  and  control  of  expenditure  to 
inputs  than  it  is  to  attach  them  to  outputs  or  out¬ 
comes,  though  it  is  true  that  outcomes  such  as 
upward  social  mobility,  health  status,  levels  of 
educational  attainment,  the  rate  of  unemployment, 
or  the  prevailing  level  of  the  fear  of  crime  are  each 
affected  by  the  work  of  many  agencies,  depart¬ 
ments,  and  services.  But  measuring  the  contribu¬ 
tion  of  each  to  the  outcome  in  a  way  that  enables 
accountability  or  managerial  control  is  extremely 
challenging;  indeed,  the  particular  relationships 
may  not  be  very  stable  over  time,  yet  systems  of 
managerial  control  have  to  be  kept  in  place  for 
some  time  if  we  are  to  be  able  to  tell  whether  they 
are  capable  of  being  implemented  at  all,  let  alone 
if  we  want  to  gather  evidence  about  whether  they 
might  or  might  not  be  effective.  We  can,  in  short, 
manage  and  hold  accountable  each  of  the  services 
for  what  they  can  do,  but  we  will  find  it  difficult 
to  manage  and  hold  each  to  account  for  what  they 
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contribute  to  what  all  should  care  about.  Second, 
the  specialist  position  argues  that  trying  to  manage 
and  hold  them  to  account  for  shared  outcomes  in 
practice  results  in  the  blurring  of  imperatives, 
scope  for  buck-passing,  and  the  blunting  of  the 
commitment  to  the  best  use  of  specialist  expertise, 
which  is  often  critical  to  the  best  use  of  the  profes¬ 
sionals  of  medicine,  policing,  teaching,  counseling, 
land  use  planning,  building,  and  even  administer¬ 
ing  of  payments  and  processes  that  make  up  the 
interventions  supplied  by  the  public  services.  In  the 
reforms  of  public  management  that  were  adopted 
in  the  central  government  in  Britain  and  New 
Zealand  and  many  other  administrations  during 
the  1980s,  the  argument  of  specialism  was  widely 
deployed,  often  under  the  banners  of  “managerial 
focus”  and  the  “dedicated  agency.” 

A  second  kind  of  argument  for  specialism 
directly  contests  the  claim  with  which  this  entry 
began:  that  better  outcomes  result  from  reducing 
tensions  between  the  policies  implicitly  or  explicitly 
pursued  by  agencies.  On  the  contrary,  it  could  be 
suggested  that  it  is  precisely  these  tensions  between 
the  goals  of  different  agencies  that  spur  thought, 
innovation,  adjustment,  development,  and  sophis¬ 
tication  related  to  both  means  and  ends  in  those 
agencies.  Tension,  according  to  this  argument,  can 
be  creative.  At  any  one  time,  it  may  appear  stati¬ 
cally  inefficient,  because  managing  tensions  must 
always  take  up  managerial  time  and  often  cause  no 
little  irritation.  It  is  easy  to  fall  into  the  trap  of 
imagining  that  if  the  tensions  did  not  exist,  then 
that  time  would  be  better  deployed.  However,  this 
view  would  suggest  that  it  is  not  necessarily  or  even 
typically  true.  Rather,  too  much  pressure  for  coor¬ 
dination  may  undermine  the  creativity  that,  over 
the  longer  term,  might  produce  much  greater 
dynamic  efficiency,  which  would  more  than  com¬ 
pensate  for  any  loss  of  static  efficiency. 

The  liberal  constitutionalist  argument  holds 
that  both  the  coordinators  and  the  specialists  are 
misguided  in  their  pursuit  of  either  effectiveness  or 
efficiency.  Instead,  the  liberal  constitutionalist 
fears  the  excess  and  abuse  of  governmental  power 
above  all  and  seeks  to  harness  conflict,  contradic¬ 
tion,  competition,  and  incoherence  as  sources  of 
checks  and  balances  and  as  a  mutually  countervail¬ 
ing  force  for  the  competing  imperatives.  Based  on 
this  view,  the  real  threat  is  not  that  government 
may  not  work  very  well  but  that  it  might  become 


too  powerful;  we  should  be  prepared  to  accept  the 
consequences  for  any  measures  used  to  derail  it,  in 
order  to  prevent  its  becoming  overweening. 

The  populist  argument  is  that  government 
should  be  organized  around  the  principle  that  the 
people  should  be  able  immediately  to  understand 
its  organization  and  that  the  units  to  which 
accountability,  budgets,  legal  powers,  and  other 
resources  are  granted  should  be  defined  around 
things  that  the  public  can  instantly  recognize  as 
having  to  do  with  tangible  services.  Organizing 
around  grandiose  and  abstract  outcomes  and  inter¬ 
departmental  committees,  creating  crosscutting 
matrices  of  partnerships  and  joint  budgets,  and 
combining  professions  and  services  under  headings 
that  correspond  to  no  single  category  of  work  may 
all  be  valuable  instruments  if  all  we  ever  cared 
about  were  that  government  produce  results  that 
can  be  measured  statistically  (see  below  on  instru¬ 
ments  of  coordination).  However,  for  the  populist, 
the  most  important  principle  for  the  organization 
of  government  is  not  the  achievement  of  results 
that  are  measured  in  that  way  but  the  intelligibility 
and  immediacy  of  the  organizational  system  of 
government  to  the  governed  and  the  ability  of  the 
governed  to  look  into  that  system  and  readily 
appreciate  what  it  does.  If  for  the  specialist,  loss  of 
managerial  control  over  inputs  for  efficiency  is  the 
great  risk,  and  if  for  the  liberal  constitutionalist, 
the  overweening  ambition  and  intrusion  of  govern¬ 
ment  is  the  primary  threat,  then  for  the  populist, 
the  worst  prospect  is  a  system  of  government  that 
is  opaque  and  technocratic.  In  practice,  populism 
has  often  led  to  an  emphasis  on  organizing  govern¬ 
ment  according  to  the  principles  of  clientele,  terri¬ 
tory,  or  jurisdiction  or  visible  service  function. 

Readers  of  the  history  of  the  American  Revolution 
will  recognize  the  roots  of  specialism,  liberal  consti¬ 
tutionalism,  and  populism  in  at  least  some  of  the 
ideas  associated  with  the  names  of  Alexander 
Hamilton,  James  Madison,  and  Thomas  Jefferson, 
respectively.  The  enduring  political  significance  of 
these  rival  traditions,  and  the  importance  of  media 
and  pressure  group  activity  to  further  their  claims, 
forms  part  of  the  explanation  for  the  fact  that  the 
periods  when  coordination  is  a  central  strategic 
commitment  of  reforming  governments  tend  not  to 
last  very  long.  Sooner  or  later,  however  strongly 
they  are  committed  to  coordination,  politicians 
must  begin  to  give  some  weight  to  some  of  these 
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other  considerations.  Bluntly,  politics  must  care 
about  many  things  in  government  other  than  the 
question  of  whether  or  not  it  works  very  well. 

Instruments 

The  range  of  available  policy  instruments,  mecha¬ 
nisms,  tools,  or  administrative  arrangements  by 
which  coordination  can  be  pursued  is  very  wide 
indeed.  However,  for  the  sake  of  convenience,  they 
can  be  grouped  as  follows: 

Bringing  Together  Inputs.  This  approach  to  the 
promotion  of  coordination  spans  the  creation  of 
budgets  held  jointly  by  two  or  more  departments 
or  agencies  for  a  defined  purpose:  the  creation  of 
common  information  systems,  programs  of  joint 
training,  and  individual  posts  that  report  to  two  or 
more  agencies  and  pursue  goals  for  which  each 
agency  has  some  part  of  the  overall  responsibility. 

Bringing  Together  Organizations.  Although  one 
extreme  of  the  distribution  on  this  group  of 
instruments  includes  the  full  merger  of  whole 
departments  and  agencies,  there  is  a  plethora  of 
other  ways  in  which  organizations  can  be  brought 
together.  Joint  matrix  management  is  sometimes 
possible  for  organizations  that  have  not  fully 
merged  but  where  each  is  asked  to  work  toward 
goals  previously  expressed  principally  within 
another  organization.  The  commonest  instrument 
is  the  formal  partnership,  with  its  own  joint  board, 
but  there  are  also  several  kinds  of  joint  subsidiaries 
for  joint  ventures,  all  of  which  may  be  temporary 
or  permanent  and  may  cover  all  or  only  some  of 
the  functions  of  the  participating  agencies. 

Bringing  Together  Accountability  and  Authority. 
Holding  agencies  to  account  for  their  contribution 
to  achievements  or  problems  for  which  each  has 
some  responsibility  can  be  done  in  a  variety  of 
ways.  Financially,  the  release  of  funds  can  be  made 
conditional  on  the  making  of  certain  kinds  of 
efforts  and  commitments.  Managerially,  executive 
oversight  systems  can  be  put  in  place  that  provide 
for  review  and  accountability  of  teams  and 
individual  managers  on  the  contribution  of  their 
work  to  goals  that  run  beyond  those  of  their 
immediate  function.  Parliamentary  scrutiny  can  be 
exercised  in  a  manner  that  does  not  confine  itself  to 


the  oversight  of  its  particular  departments  by  each 
committee.  Panels  of  service  users  can  be  recruited 
and  supported  to  scrutinize  the  performance  of 
many  services  to  meet  the  needs  or  achieve  the 
goals  that  they  care  and  are  informed  about. 

Types  and  Levels 

Coordination  takes  its  place  in  an  extensive  lexi¬ 
con  of  cognate  terms.  At  least  some  of  the  com¬ 
moner  ones  are  noted  below.  These  terms  can  be 
grouped  under  the  four  headings  shown  below. 
The  definitions  of  these  cognate  terms  that  are 
presented  serve  to  display  some  of  the  main  ways 
in  which  scholars  have  developed  typologies  and 
taxonomies  for  types  of  strategic  approach,  levels 
of  coordination,  and  policy  instruments  used  to 
pursue  those  different  approaches. 

Strategic  Approaches 

Horizontal  government:  a  synonym  for  coordination 

Joined-up  government:  coordination  to  combat 
inconsistency  in  activities  and  goals 

Holistic  government:  coordination  to  pursue 
settlements  in  which  potentially  rival  imperatives 
are  made  mutually  reinforcing  in  pursuit  of 
enhanced  outcomes 

Levels 

Whole-of-government  approach:  coordination 
relying  principally  on  bringing  together 
accountabilities  and  oversight  authority 

Collaboration:  any  kind  of  coordination  that 
involves  bringing  together  either  inputs  or 
organizations 

joint  working:  collaboration  on  specific  programs 
rather  than  for  the  whole  pattern  of  relations 
between  organizations 

Integration:  coordination  that  emphasizes  bringing 
together  organizations  in  especially  tight  ways — for 
example,  through  long-term,  tightly  supervised 
strategic  partnerships  or  even  mergers 

Interdepartmental  working:  coordination  at  the  tier 
of  central  government  policy  formulation  work 

Multiagency  working:  coordination  in  which  the 
participating  agencies  retain  their  distinct  identities 
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Interagency  working:  coordination  that  seeks  to 
blur  the  distinct  identities  of  the  participating 
agencies 

Interprofessional  working:  coordination  at  the  level 
of  the  professions  rather  than  the  organizations 

Instruments 

Partnership:  a  formal  structure  with  a  constitution, 
a  board,  legal  powers,  a  legal  personality,  and  a 
specific  task,  in  which  participating  organizations 
take  full  membership 

Collaborative:  a  small  team  comprising 
professionals  from  several  distinct  professions 
working  together  with  a  distinct  clientele  or  on  a 
series  of  consecutive  processes 

One-stop  shop:  a  “Main  Street”  business  area- 
based  or  web-based  facility  in  which  citizens  can 
access  a  wide  range  of  services  in  one  place 

Problems  Targeted 

Crosscutting  issue:  any  issue  addressed  by 
coordination  that  affects  a  wide  range  of  different 
agencies,  organizations,  and  professions 

Wicked  issue:  any  issue  addressed  by  coordination 
that  affects  a  wide  range  of  different  agencies, 
organizations,  or  professions;  where  the 
measurement  of  success  in  addressing  the  problem 
raises  fundamental  methodological  problems;  and 
where  the  causality  by  which  interventions  work 
appears  to  be  nonlinear 

Evidence  of  Effectiveness 

There  is  a  truly  vast  body  of  literature — including 
official  governmental,  practitioner,  and  peer- 
reviewed  research  texts — that  claims  to  offer  evi¬ 
dence  for  and  against  the  effectiveness  either  of 
some  of  the  particular  recent  programs  or  else  of 
the  uses  of  particular  instruments.  Unfortunately, 
much  of  this  literature  is  rather  unsatisfactory. 
Most  studies  are  not  conducted  to  evaluate  effec¬ 
tiveness  against  any  clear  alternatives  used  for 
comparison  or  control.  Indeed,  it  is  not  always 
clear  just  what  the  relevant  practicable  alternative 
program  or  instrument  might  be,  against  which 
comparison  ought  to  be  made.  Before-and-after 


comparisons  have  well-known  methodological 
weaknesses  for  the  purpose  of  determining  effec¬ 
tiveness.  Moreover,  most  studies  examine  very 
small  numbers  of  initiatives,  limiting  the  scope  for 
comparison.  Because  most  of  the  outcomes  for  the 
environment,  poverty,  employment,  health  status, 
fear  of  crime,  rates  of  reported  crime,  and  so  on 
are  affected  by  a  great  many  other  factors  than 
those  of  the  deployment  of  instruments  of  coordi¬ 
nation,  for  which  few  studies  can  realistically  con¬ 
trol,  it  is  very  difficult  to  place  much  confidence  in 
the  claims  made.  Recognizing  these  difficulties, 
most  authors  offer  only  process  evaluations,  rather 
than  making  claims  for  outcome  evaluations.  The 
process  evaluation  literature  is  replete  with  lists  of 
“barriers”  or  “obstacles,”  either  to  the  possibility 
of  attempting  particular  programs  of  coordination 
or  to  the  use  of  certain  instruments  at  all  or  else  to 
the  chances  of  achieving  satisfaction  in  the  self- 
reports  of  the  practitioners  studied  in  the  process. 
It  is  very  difficult  to  know  what  to  make  of  these 
many  lists.  For  many  of  the  entries  have  to  do  with 
things  such  as  apathy,  inertia,  mistrust  of  others, 
protection  of  bureaucratic  turf,  risk  aversion,  con¬ 
cern  for  professional  status,  and  fear  of  loss  of 
control  of  budgets  or  other  resources.  Many 
of  these  things  are  either  among  the  eternal  verities 
of  organizational  life  or  else  “barriers”  to  attempt¬ 
ing  anything  and  hardly  specific  to  the  instruments 
of  coordination.  At  this  stage  in  the  development 
of  the  research,  therefore,  it  is  hard  to  be  confident 
that  much  has  been  learned  from  the  studies  con¬ 
ducted  in  the  1990s  and  the  2000s  that  was  not 
already  known  from  the  many,  much  earlier  stud¬ 
ies  on  the  difficulties  experienced  in  the  joint 
implementation  of  policies  in  earlier  decades — for 
example,  from  Jeffrey  Pressman  and  Aaron 
Wildavsky’s  famous  (1973/1984)  study  of  the  dif¬ 
ficulties  of  multi-  and  interagency  working  in  U.S. 
efforts  to  implement  some  of  the  “Great  Society” 
economic  development  programs  launched  by  the 
Lyndon  Johnson  administration  in  the  1960s. 
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Corporativism 


Despite  a  centuries-long  debate  on  corporativist 
forms  of  political  representation  and  statehood,  the 
concept  of  corporativism  (corporatism)  is  still  elu¬ 
sive,  owing  to  insufficient  definition  and  ideologi¬ 
cal  controversy.  This  entry  discusses  the  definition 
and  the  basic  idea  of  corporativism  with  reference 
to  its  historical  roots  in  the  Middle  Ages  and  its 
subsequent  developments  up  to  the  19th  and  20th 
centuries.  The  more  recent  Catholic  element  and 
the  secular  one  are  also  presented.  Finally,  the 
authoritarian  features  of  corporatist  rule,  especially 
in  Latin  America  but  also  in  other  parts  of  the 
world,  are  discussed. 

The  Notion 

In  its  most  basic  meaning,  corporativism  refers  to  a 
political  power  structure  and  practice  of  consensus 
formation  based  on  the  functional  representation 
of  professional  groups.  Associations  of  farmers, 
craft  workers,  industrialists,  laborers,  lawyers,  doc¬ 
tors,  or  the  clergy  (churches)  act  as  self-governing 
bodies  on  their  own  behalf  and  as  intermediaries 
between  the  government  and  their  members. 
Political  status  and  rights  are  attached  to  occupa¬ 
tion  and  group  membership  and,  thus,  differ  from 
those  of  modern  citizenship  and  equal  representa¬ 
tion  of  individuals  in  parliaments  related  to  a  cer¬ 
tain  territory.  The  concept  of  corporativism  dates 
back  to  the  medieval  estates  and  guild  system.  It 
found  renewed  attention  among  romanticist  phi¬ 
losophers  in  the  Germanic  world  during  the  19th 
century  as  a  remedy  against  social  uprooting  in  the 
wake  of  industrialization  and  class  conflict.  The 
Catholic  social  teaching  has  drawn  inspiration 
from  corporativist  ideas,  as  have  a  number  of 
authoritarian  regimes  in  Europe  and  South 
America.  As  a  political  ideology,  corporativism 
has  been  fiercely  rejected  by  liberals  as  a  move¬ 
ment  that  would  elevate  collectivist  corporate 
bodies  to  the  cornerstones  of  politics  and  the 
economy  while  denying  individual  representation, 
civil  liberties,  free  competition,  and  democracy. 
Socialists  and  Communists  fought  the  idea  as  a 
particularly  oppressive  variant  of  capitalist  class 
rule  that  would  eliminate  the  Left  and  control  the 
working-class  masses  with  a  carrot-and-stick 
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approach.  Not  the  least,  it  was  the  modern  state 
itself  and  its  ideal  of  universal  citizenship,  territo¬ 
rial  instead  of  occupational  representation,  and 
indivisible  sovereignty  that  ran  counter  to  the 
establishment  of  corporate  power  holders  acting  as 
intermediaries  between  governments  and  segments 
of  the  society.  Nevertheless,  modern  welfare  states 
bear  some  features  of  corporativism  and  neo-cor- 
poratist  policy  making  that  have  become  manifest 
in  networks  and  negotiations  between  state  admin¬ 
istrations  and  powerful  corporate  actors  such  as 
business  associations  or  labor  unions  in  fields  such 
as  industrial  and  social  policy.  In  contrast  to  such 
current  patterns  of  corporatist  policy  making,  the 
term  corporativism  (corporatism,  corporativismo, 
corportativisme,  and  Korporativismus)  denotes  a 
specific  political  philosophy  and  controversial  ide¬ 
ology  as  well  as  a  political  regime  type. 

The  Model  and  Idea  of  Corporativism 

For  centuries,  debates  on  corporativism  referred — 
directly  or  indirectly — to  the  medieval  guild  system. 
Being  collective  bodies  (Latin  corpora),  medieval 
guilds  served  manifold  economic,  social,  cultural, 
religious,  and  not  the  least  political  functions  for 
their  professional  membership.  Among  them  one 
finds  the  setting  of  standards  for  quality,  prices  and 
wages,  education  and  work,  caring  for  widows  and 
orphans,  representation  on  town  councils,  serving 
in  courts  and  town  militia,  and  maintaining  chari¬ 
table  institutions  such  as  hospitals,  orphanages, 
poorhouses,  and  more.  With  the  rise  of  the  modern 
state  and  capitalist  economies,  those  tasks  became 
subject  to  royal  law,  state  administrations,  or  mar¬ 
kets.  The  decline  of  medieval  corporativism  began 
at  the  turn  of  the  1 4th  century.  It  was  driven  not  by 
absolutist  claims  to  sovereignty  at  first  but  by 
recurrent  pestilence,  severe  famine,  and  extreme 
weather  as  well  as  the  spread  of  Renaissance 
humanism  and  Reformationist  ideas  about  religion, 
science,  and  society  that  began  to  shake  the  old 
corporate  order  of  the  High  Middle  Ages  and 
paved  the  way  for  a  growing  emphasis  on  individu¬ 
alism,  centralized  territorial  rule,  and  secular 
supremacy. 

Medieval  Corporativism 

Elements  of  corporativism  appeared  in  many  dif¬ 
ferent  sites  of  medieval  political  philosophy, 


though  Johannes  Althusius  (1563-1638)  was  the 
first  to  formulate  a  comprehensive  theory  of  a 
corporativist  or,  in  his  words,  “consociationalist” 
constitution  where  the  medieval  order  had  already 
lost  ground  against  new  thoughts  of  monarchic 
absolutism  and  indivisible  territorial  sovereignty. 
Antony  Black  refers  to  Althusius  as  being  one  the 
“few  great  theorists  of  corporatism”  (Black, 
1984,  p.  141),  providing  us  “with  perhaps  the 
most  substantial  exposition  of  guild  ideas  ever 
known”  (p.  131).  The  universal  commonwealth 
(consociatio  universalis ),  Althusius  proclaims,  has 
to  be  understood — in  his  own  words — as  “an 
association  inclusive  of  all  other  associations 
(families,  collegia  [guilds],  cities,  and  provinces) 
within  a  determinate  large  area  and  recognizing 
no  superior  to  itself”  (Althusius,  1603/1964, 
chap.  12).  In  conceiving  the  social  contract  as  a 
real  pact  among  corporate  legal  entities  that  com¬ 
pose  society,  Althusius  differed  from  his  near¬ 
contemporary  Thomas  Hobbes,  who  thought  of  a 
single  agreement,  entered  into  by  individuals,  who 
commit  themselves  to  an  absolute  subjection  to  a 
common  power:  “one  Man  or  one  Assembly  of 
men,  that  may  reduce  all  their  Wills,  by  plurality 
of  voices,  unto  one  Will”  ( Leviathan ,  Part  II, 
chap.  17).  Althusius  had  a  notion  of  shared  sover¬ 
eignty  that  stands  in  deep  contrast  not  only  to 
Hobbes’s  unitarism  but  also  to  Jean  Bodin’s  doc¬ 
trine  of  monarchical  sovereignty.  Due  to  his 
emphasis  on  corporative  autonomy,  the  subsidiar¬ 
ity  principle,  and  the  multilevel  character  of  his 
constitutional  system,  Althusius  is  now  reputed 
for  being  an  early-modern  protagonist  and  fore¬ 
runner  of  federalism. 

Corporativism  has  been  criticized  for  its  emphasis 
on  collective,  instead  of  individual,  autonomy.  While 
this  is  true  for  authoritarian  concepts  connected  with 
various  semicorporativist  but  in  fact  centralist 
regimes  of  the  19th  and  20th  centuries,  early-modern 
“corporativist”  thinkers  like  Althusius  and  Marsilus 
of  Padua  fought  not  only  for  a  decentralized  polity 
but  also  for  popular  sovereignty.  In  formulating  that 
the  sovereign  rights  of  the  people  were  inalienable, 
unassignable,  and  imprescriptible,  Althusius  rejected 
Bodin’s  notion  of  sovereignty  according  to  which  the 
people  enter  into  the  social  contract  with  the  mon¬ 
arch  as  a  collectivity  and  not  as  free  individuals.  This 
is  not  surprising  if  one  considers  that  medieval  cor¬ 
porations,  both  in  constitutional  theory  as  in  the 
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everyday  life  of  guilds  and  communes,  was  marked 
by  a  constant  tension  and  interplay  between  the 
claims  of  individuals  and  those  to  be  found  on  the 
level  of  corporations  and  their  interrelations — a 
factually  conflict-ridden  power  structure  that  fed 
Unitarian  anticorporativist  political  sentiments  in 
the  long  run. 

In  serving  the  public  and  private  needs  not  only 
of  corporate  status  groups  but  also  of  communities 
at  large,  medieval  corporativism  ideologically 
aimed  at  an  organic  whole  and,  thus,  was  devoid 
of  modern  differentiations  such  as  those  between 
the  collective  and  the  individual,  government 
(state)  and  society,  politics  and  religion,  or  the 
public  and  the  private  sphere.  The  antimodernist 
overtone  of  corporativism  derives  from  its  organi- 
cist  idea  of  segmenting  society  into  an  articulate 
multiplicity  of  interrelated  semiautonomous  cor¬ 
porations  regulated  by  the  principle  that  men 
should  live  freely  only  in  the  narrow  sphere  of  their 
God-given  social  status. 

Challenging  the  Liberal  Paradigm 

The  medieval  model  of  a  static  organic  order 
received  new  attention  from  philosophers  and 
state  theorists  during  the  19th  century.  The  dark 
side  of  rapid  industrialization,  together  with  the 
decay  of  traditional  social  security  mechanisms 
and  an  emerging  class  conflict,  resulted  in  intel¬ 
lectual  attempts  to  find  alternatives  to  the  indi¬ 
vidualist  market-liberal  paradigm.  Corporativist 
ideas  came  up  again  after  the  French  Revolution, 
particularly  regarding  its  failures  and  disappoint¬ 
ments  of  widespread  beliefs  in  a  republican  solu¬ 
tion.  They  have  to  be  seen  as  an  attempt  to  recon¬ 
struct  intermediary  corporations  as  moral  institu¬ 
tions  to  support  communitarian  politicization  and 
individual  orientation  in  times  of  rapid  social 
change  and  as  a  barrier  against  social  uprooting. 
Corporativist  thoughts  in  the  works  of  Johann 
Gottlieb  Fichte,  Adam  Muller,  and  G.  W.  F.  Hegel 
(see  Tetsushi  Harada,  1989)  should  help  fill  the 
gap  between  a  society  of  individuals  and  the  gov¬ 
ernmental  administration  with  semiautonomous 
communal  institutions  that  would  eventually 
strengthen  the  state  as  an  embodiment  of  the  gen¬ 
eral  interest.  Social  stabilization  in  the  face  of  revo¬ 
lutionary  threats  has  been  the  one — conservative — 
facet  of  such  concepts,  whereas  the  protection  of 


craft  workers,  unskilled  workers,  and  industrial¬ 
ists  against  social  and  commercial  threats  repre¬ 
sents  a  more  constructive  if  not  humanistic  con¬ 
cern.  For  Hegel  (1770-1831),  corporativism  was 
the  solution  to  the  problem  of  an  increasingly 
atomized  society.  Starting  from  the  assumption 
that  the  rule  of  law  is  based  on  the  need  to  articu¬ 
late  modern  civil  society  as  the  realm  of  particu¬ 
larity,  on  one  hand,  and  the  state  as  the  concrete 
form  of  moral  generality,  on  the  other,  corpora¬ 
tions  are  meant  to  embrace  particularity  from 
below  and  generality  from  above  and,  thus,  to 
function  as  integrating  links  between  civil  society 
and  the  state.  In  his  critique  of  Hegel’s  Philosophy 
of  Right,  Karl  Marx  identified  the  Achilles  heel  of 
Hegel’s  corporativist  design  exactly  in  the  ambigu¬ 
ity  of  corporations  (Estates)  as  intermediate  power 
holders: 

While  the  Estates,  according  to  Hegel,  stand 
between  the  government  in  general  on  the  one 
hand  and  the  nation  broken  up  into  particulars 
(people  and  associations)  on  the  other,  the 
significance  of  their  position  ...  is  that,  in 
common  with  the  organized  executive,  they  are  a 
middle  term.  Regarding  the  first  position,  the 
Estates  represent  the  nation  over  against  the 
executive,  but  the  nation  en  miniature.  This  is 
their  oppositional  position.  Regarding  the 
second,  they  represent  the  executive  over  against 
the  nation,  but  the  amplified  executive.  This  is 
their  conservative  position.  They  are  themselves 
a  part  of  the  executive  over  against  the  people, 
but  in  such  a  way  that  they  simultaneously  have 
the  significance  of  representing  the  people  over 
against  the  executive.  (Marx,  1978,  p.  68) 

Corporatist  theories  have  always  struggled  with 
the  dilemma  of  intermediate  political  corporations 
to  represent  and  discipline  the  demands  of  their 
membership  at  the  same  time.  One  speaks  of  “soci¬ 
etal”  or  “state”  corporatism  depending  on  whether 
a  bottom-up  representative  approach  or  a  top- 
down  disciplinary  one  prevails. 

During  and  after  Hegel’s  time,  a  number  of 
scholars — Otto  von  Gierke  (1841-1921)  and 
Emile  Durkheim  (1858-1917)  in  particular — sug¬ 
gested  corporativist  associations  and  intermedia¬ 
tion  as  a  remedy  against  the  disorder,  social 
anomie,  and  isolation  resulting  from  a  growing 
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division  of  labor  and  the  crisis  of  traditional  insti¬ 
tutions  of  solidarity. 

Modern  Corporativist  Ideologies 

Besides  being  a  theoretical  concept  and  regime 
type,  corporativism  has  been  perceived  first  and 
foremost  as  a  political  ideology,  just  like  liberalism 
and  socialism — the  other  great  “ism,”  as  Howard 
J.  Wiarda  called  it.  Among  contemporary  secular 
ideologies,  it  is  not  only  the  oldest  but  also  the 
most  manifold  and  inconsistent  one.  It  is  difficult 
to  draw  a  clear  dividing  line  between  individualist 
liberalism  and  collectivist  socialism  since  corpora¬ 
tivist  thoughts  and  methods  took  up  elements  of 
market  liberalism  as  well  as  economic  planning.  It 
was  a  common  trait  of  20th-century  corporativism 
to  protect  private  property  and  simultaneously  to 
fight  against  free  competition  in  markets  and  poli¬ 
tics.  Experiences  in  Europe,  Latin  America,  and 
Asia  clearly  show  that  authoritarian  corporativism 
together  with  rightist  populism  has  served  as  an 
instrument  of  conservative  rule  in  periods  of  inten¬ 
sified  class  struggles  and  leftist  revolts. 

Papal  Encyclicals 

The  Roman  Catholic  school  of  thought  on  the 
social,  political,  and  economic  order  of  capitalism, 
as  put  down  first  in  the  Papal  encyclical  Rerum 
Novarum  (Of  New  Things,  1891)  and  supple¬ 
mented  by  Quadragesimo  Anno  (In  the  40th  Year, 
1931),  above  all  builds  on  the  corporativist  prin¬ 
ciples  of  solidarity  and  subsidiarity.  Rerum 
Novarum  has  to  be  seen  as  a  response  to  the  social 
instability  and  labor  conflicts  of  its  time.  It  advo¬ 
cates  a  “natural  right”  to  form  associations  and 
engage  in  free  collective  bargaining  and  makes  it 
the  duty  of  governments  to  protect  the  poor. 
Faced  with  strong  communist  and  fascist  move¬ 
ments  in  Europe,  Quadragesimo  Anno  also  con¬ 
centrates  on  the  dangers  to  human  freedom  and 
dignity  arising  from  unrestrained  capitalism  and 
totalitarian  communism. 

The  Catholic  plea  for  an  organic  and  hierarchi¬ 
cal  reconstruction  of  industrial  societies  was 
inspired  by,  and  in  turn  influenced  by,  the  works 
of  a  number  of  19th-century  social  philosophers 
from  Italy  (Frederic  Ozanam  and  Luigi  Taparelli 
D’Azeglio)  and  France  (Philip  Buchez,  Count 


Albert  de  Mun,  and  Count  La  Tour  du  Pin).  They 
imagined 

Catholic  Guilds  where  the  interest  of  the  profession 
is  superior  to  private  interest,  where  antagonism 
between  capitalist  and  workingman  gives  way  to 
patronage  exercised  in  a  Christian  spirit  and  freely 
accepted. ...  It  is  always  the  same  thought:  limit 
competition,  associate  common  interests,  impose 
upon  the  employer  the  duty  of  patronage,  uplift 
labor  and  the  condition  of  the  laborer.  (Albert  de 
Mun,  1847,  as  cited  in  Moon,  1921,  p.  99) 

Though  full  of  normative  ideology  and  never 
free  of  theoretical  inconsistency,  the  social  Catholic 
movement  succeeded  in  maintaining  a  fairly  coher¬ 
ent  political  reform  program  that  is  still  focused  on 
corporativist  attributes  of  solidarity  and  subsidiar¬ 
ity.  The  insistent  demand  for  subsidiary  autonomy 
of  the  social  sphere  resulted  in  a  somewhat  muted 
critique  of  totalitarian  fascism  in  Quadragesimo 
Anno.  The  Catholic  Church,  however,  never  did 
endorse — as  German  constitutional  lawyers 
feared — a  decomposition  of  statehood  in  the  name 
of  the  subsidiarity  principle. 

Secular  Corporativist  Ideologies 

Among  secular  corporativist  ideologies,  the  idea  of 
guild  socialism  found  a  number  of  supporters 
among  intellectuals  as  well  as  workers  and  labor 
unions  in  Britain.  Functional  representation  was 
one  of  the  most  distinctive  doctrines  of  British  guild 
socialism,  according  to  which  the  population  should 
be  represented  both  as  producers  and  as  consumers 
in  a  multilevel  system.  Municipalities,  regions,  and 
the  national  state  would  be  governed  by  two  cham¬ 
bers:  one  elected  by  professional  guilds  and  the 
other  by  territorial  constituencies.  George  D.  H. 
Cole,  who  formed  the  British  National  Guilds 
League  in  1915,  proposed  a  central  guild  congress 
that  should  be  the  supreme  industrial  body,  stand¬ 
ing  for  the  people  as  producers  in  the  same  way  as 
parliament  will  stand  for  the  people  as  consumers. 
In  advocating  the  cosovereignty  of  workmans’ 
guilds  and  the  representative  government  guild, 
socialism  rejected  the  traditional  notion  of  sover¬ 
eign  statehood.  The  concept  had  been  indirectly 
inspired  by  Althusius’s  corporativist-cum-federalist 
medieval  concept  through  the  reception  of  Gierke’s 


462  Corporativism 


theory  of  associations.  Part  of  Gierke’s  magnum 
opus  Das  Deutsche  Genossenschaftsrecht  (German 
Cooperative  Law)  was  published  in  Britain  in 
1990  as  Political  Theories  of  the  Middle  Ages  and 
subsequently  gave  an  impetus  to  a  newly  emerging 
pluralist  school  of  academic  political  thought,  of 
which  guild  socialist  ideologies  attained  the  most 
far-reaching  though  rather  short-lived  political 
impact.  The  sudden  decline  of  guild  socialism  after 
World  War  I  had  been  attributed  to  a  changing 
ideological  climate  that  was  no  longer  in  favor  of 
ideas  requiring  employers  and  business  owners  to 
share  or  give  up  control  over  industry. 

The  corpora tivist  wave  that  began  in  the  1920s 
after  World  War  I  was  conservative  and  ranged 
from  right-wing  to  totalitarian  political  ideologies. 
Authoritarian  nationalists  in  Franco’s  Spain, 
Salazar’s  Portugal,  Dollfuss’s  Austria,  Pilsudski’s 
Poland,  Vargas’s  Brazil,  Peron’s  Argentina,  or 
Calles’s  Mexico  made  use  of  corporativist  theories 
to  legitimize  their  claim  to  power  as  did  totalitar¬ 
ian  Fascists  throughout  Europe.  During  the  1930s, 
there  were  a  number  of  fascist  regimes  not  only  in 
Mussolini’s  Italy  and  Nazi  Germany  but  also  in 
Hungary,  Greece,  and  Romania  as  well  as  fascist 
movements  and  parties  in  almost  all  European 
countries.  The  corporativist  and  religious  elements 
of  fascism  had  been  strongest  in  Italy  and  among 
(though  also  present  within)  the  Falange  move¬ 
ment  in  Spain  and  Romania.  The  attempt  to  vio¬ 
lently  infiltrate  and  control  every  sector  of  social 
life,  together  with  their  militaristic  orientation, 
cast  doubts  over  whether  fascist  dictatorships  fall 
under  the  category  of  corporativism  at  all.  Of 
course,  there  were  a  host  of  separate  nongovern¬ 
mental  organizations  controlled  by  government 
for  all  kinds  of  purposes,  such  as  economic  plan¬ 
ning,  leisure,  sports,  science,  and  education,  but 
without  any  higher  coordinating  organs.  Gaetano 
Salvemini,  in  his  1936  book  Under  the  Axe  of 
Fascism,  noted  that  to  search  for  real  cooperation 
and  genuine  consultation  taking  place  through 
corporatist  institutions  was  like  “looking  in  a  dark 
room  for  a  black  cat  which  is  not  there.  ”  Contrary 
to  the  imagination  produced  by  their  propaganda 
machine,  fascist  regimes  generally  caused  bureau¬ 
cratic  confusion  and  provoked  the  decline  of  con¬ 
stitutional  and  administrative  order  for  the  benefit 
of  the  revolutionary  movement  and  violent  oppres¬ 
sion.  Today,  as  Juan  J.  Linz  has  pointed  out,  there 


is  widespread  consensus  not  to  subsume  totalitar¬ 
ian  fascism  under  the  corporati(vi)st  paradigm  but 
rather  to  treat  it  as  a  separate  regime  type. 

Authoritarian  Corporativism 

The  term  authoritarian  corporativism  originally 
referred  to  a  variety  of  political  regimes  in  Latin 
America.  Most  basically,  it  means  that  autocratic 
governments  seek  to  impose  a  system  of  interest 
representation  and  intermediation  on  functional 
interest  groups,  especially  labor  unions,  in  order  to 
deal  with  perpetual  threats  of  industrial  conflict 
and  popular  protest.  Different  structures  and  meth¬ 
ods  of  incorporation  across  countries  and  over 
time,  together  with  inconsistent  or  even  lacking 
principles  of  political  design,  bear  witness  to  the 
complexities  involved  in  this  political  regime  type. 

Authoritarian  corporativism  emerged  during  the 
stages  of  late  economic  development  from  situa¬ 
tions  where  weakly  integrated  societies,  widespread 
clientelism,  internal  migration,  and  organizational 
fragmentation  caused  governments  to  reorganize, 
mobilize,  and  control  socioeconomic  groups  in 
order  to  increase  their  predictability  and  eventually 
overcome  economic  stagnancy.  Whereas  the  landed 
gentry,  peasants,  rural  workers,  clerics,  and  shop¬ 
keepers  continued  to  be  grouped  territorially  along 
vertical  chains  of  patron-client  relations — irrespec¬ 
tive  of  class,  race,  caste,  or  region — up  to  the  gov¬ 
ernment  level,  new  urban  groups  associated  them¬ 
selves  along  occupational  and  class  lines  on  the 
national  level.  When,  spurred  by  economic  reces¬ 
sion,  the  traditional  landed,  export,  and  commer¬ 
cial  elite  failed  to  check  the  manifold  demands, 
political  divisions,  and  modernization  conflicts  of 
such  asynchronous  societies  “clientelism  declined 
in  favor  of  corporatism”  (Guillermo  O’Donnell, 
1977,  p.  67),  and  in  many  cases,  it  resulted  in 
hybrid  regimes  of  corporatist,  populist,  nationalist, 
and  military  authoritarian  rule. 

One  can  find  similar  sequences  of  events  in  many 
late-developing  nations,  with  similar  outcomes. 
South  Korea,  Taiwan,  Singapore,  Indonesia,  the 
Philippines,  China,  and  others  have  relied  on  vari¬ 
ous  forms  of  functional  representation  though  usu¬ 
ally  avoiding  the  label  corporati(vi)sm  because  of 
its  pejorative  connotation  of  authoritarianism. 
Most  East  Asian  developmental  states  borrowed 
heavily  from  Japan’s  earlier  experiences. 
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Government-assigned  farmer  associations  had 
been  established  there  in  the  early  1900s  for  rea¬ 
sons  of  productivity  improvement  and  political 
subordination.  During  the  1930s,  the  Japanese 
government  reorganized  the  small-  and  medium- 
business  sectors  into  peak  associations,  which 
were  sanctioned  to  control  their  memberships 
through  administrative  decrees.  Administrative 
guidance  and  cartelization  have  become  symbols 
for  the  close  state-society  relations  of  the  Japanese, 
but  in  the  meantime,  they  have  come  to  character¬ 
ize  a  number  of  late-developing  East  Asian  coun¬ 
tries.  In  Taiwan,  the  government  took  over  the 
farmers’  associations  that  the  Japanese  colonial 
regime  had  established  and  afterward  awarded  an 
official  quota  of  seats  to  functional  associations  in 
the  country’s — almost  powerless — National 
Assembly.  South  Korea  orchestrated  its  economic 
takeoff  by  means  of  government  bureaus  within  a 
system  of  state  corporatist  arrangements,  and 
China  is  about  to  top  all  previous  attempts  of  cor- 
porativist  control.  Corporativist  bodies  dating 
back  to  the  era  of  Mao  Zedong  have  been  revital¬ 
ized  as  centers  of  guidance  and  legitimacy  in  eco¬ 
nomic  and  social  matters.  New  associations  in 
fields  such  as  health,  sports,  culture,  social  welfare, 
or  science  have  been  established  on  the  govern¬ 
ment’s  own  initiative  and — for  the  politically  rele¬ 
vant — based  on  obligatory  membership. 

Apart  from  the  more  or  less  corporativist  and 
authoritarian  elements  that  can  be  located  in  cur¬ 
rent  developmental  states  in  Latin  America,  Asia, 
Southeastern  Europe,  or  Africa,  one  can  hardly 
find  countries  with  a  manifest  corporativist  consti¬ 
tutional  background — with  only  a  few  exceptions. 
Among  them  is  Croatia,  with  its  long-established 
corporativist  traditions  that  have  led  to  a  rather 
unique  constitutional  second  chamber  based  on 
the  corporative  representation  of  trade  unions, 
employers’  and  farmers’  organizations,  universities 
and  colleges,  craft  workers,  freelance  profession¬ 
als,  and  so  on.  Another  case  is  Elong  Kong,  having 
up  to  half  of  the  legislature  elected  by  functional 
constituencies  defined  by  professional  occupations 
or  economic  sectors  since  1985.  Research  on  these 
cases  suggests  that  strong  corporativist  regime  ele¬ 
ments  as  well  as  experiments  with  constitutional 
corporativism — other  than  neo-corporatist  meth¬ 
ods  of  policy  coordination — have  adverse  affects 
on  party  development  and  the  achievement  of 


universal  civil  rights  and  have  fostered  legislative 

fragmentation  through  particularistic  bargaining. 

Roland  Czada 
University  of  Osnabruek 
Osnabruek,  Germany 

See  also  Interest  Groups;  Neo-Corporatism;  Pluralism; 
Representation 

Further  Readings 

Althusius,  J.  (1964).  The  politics  of  Johannes  Althusins 
(F.  Carney,  Trans.).  Boston:  Beacon  Press.  (Original 
work  published  1603) 

Black,  A.  (1984).  Guilds  and  civil  society  in  European 
political  thought:  From  the  twelfth  century  to  the 
present.  Ithaca,  NY:  Cornell  University  Press. 

Harada,  T.  (1989).  Politische  okonomie  des  idealismus 
und  der  romantik:  Korporatismus  von  Fichte,  Muller 
und  Hegel  [Political  economy  of  idealism  and 
romanticism:  The  corporatism  of  Fichte,  Muller,  and 
Hegel].  Berlin,  Germany:  Duncker  &  Humblot. 

Linz,  J.  (2000).  Totalitarian  and  authoritarian  regimes. 
Boulder,  CO:  Lynne  Rienner. 

Malloy,  J.  M.  (Ed.).  (1977).  Authoritarianism  and 
corporatism  in  Latin  America.  Pittsburgh,  PA: 
University  of  Pittsburgh  Press. 

Marx,  K.  (1970).  Critique  of  Hegel’s  philosophy  of  right 
(J.  O’Malley,  Ed.).  Cambridge,  UK:  Cambridge 
University  Press.  (Original  work  published  1844) 

Moon,  P.  T.  (1921).  The  labor  problem  and  the  social 
Catholic  movement  in  France:  A  study  in  the  history 
of  social  politics.  New  York:  Macmillan. 

O’Donnell,  G.  A.  (1977).  Corporatism  and  the  question 
of  the  state.  In  J.  M.  Malloy  (Ed.),  Authoritarianism 
and  corporatism  in  Latin  America  (pp.  47-87). 
Pittsburgh,  PA:  University  of  Pittsburgh  Press. 

Unger,  J.,  &  Chan,  A.  (1995).  China,  corporatism,  and 
the  East  Asian  Model.  Australian  Journal  of  Chinese 
Affairs,  33,  29-53. 

Wiarda,  H.  J.  (1997).  Corporatism  and  comparative 
politics:  The  other  great  “ism”  (Comparative  Politics 
Series).  Armonk,  NY:  M.  E.  Sharpe. 


Correlation 


Correlation  is  a  statistical  measure  of  the  associa¬ 
tion  between  two  or  more  variables.  Two  or  more 
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variables  are  associated  if  they  change  together 
(covary).  In  statistics,  such  simultaneous  move¬ 
ment  is  called  dependence.  Thus,  correlation  is 
simply  an  indication  of  a  lack  of  independence 
between  variables.  Measures  of  correlation  can 
also  indicate  the  direction  of  the  relationship 
between  variables — that  is,  whether  it  is  positive  or 
negative.  In  addition,  while  correlation  alone  is 
insufficient  to  establish  a  causal  relationship,  it 
often  illuminates  the  potential  for  such.  Below,  the 
basic  properties  and  applications  of  current  corre¬ 
lation  measures  are  discussed. 

The  modern  statistical  concept  of  correlation 
was  initially  put  forward  by  Sir  Francis  Galton  in 
a  series  of  papers  on  heredity  in  the  late  1800s. 
Correlation  was  subsequently  refined  in  various 
works  by  Karl  Pearson  and  G.  Udny  Yule.  These 
initial  works  culminated  in  a  statistic  now  called 
the  Pearson  product-moment  correlation  coeffi¬ 
cient,  Pearson’s  r.  Pearson’s  r  is  perhaps  the  most 
widely  used  of  all  statistics  because  of  its  value  as 
an  indicator  of  correlation  and  its  relationship  to 
multivariate  analysis,  in  particular  linear  regres¬ 
sion.  However,  it  is  only  one  of  several  indicators 
of  statistical  correlation.  Today,  the  concept  of 
correlation  is  a  broad  term  that  comprises  a  host 
of  indices  used  to  measure  both  parametric  and 
nonparametric  association.  Of  the  latter, 
Kendall’s  t  and  t h,  Cramer’s  V,  and  Goodman 
and  Kruskal’s  y  are  especially  prevalent  in  applied 
statistics. 

Pearson’s  r  indicates  both  the  degree  and  direc¬ 
tion  of  linear  dependence  between  variables. 
Direction  in  a  linear  relationship  can  take  one  of 
two  forms:  positive  or  negative.  For  example,  take 
two  variables  X  and  Y.  If  high  scores  on  X  corre¬ 
spond  to  low  scores  on  Y,  there  is  a  negative  rela¬ 
tionship.  If  high  scores  on  X  correspond  to  high 
scores  on  Y,  there  is  a  positive  relationship. 
Similarly,  degree  measures  the  strength  of  a  rela¬ 
tionship  between  variables.  Such  a  relationship 
might  be  strong  or  weak,  where  a  strong  correla¬ 
tion  means  that  large  values  of  X  are  associated 
with  large  values  of  Y  and  vice  versa. 

Correlation  may  be  visualized  via  scatter- 
plots — that  is,  by  arranging  the  values  of  one  vari¬ 
able  on  one  axis  and  the  corresponding  values  of 
another  variable  on  another  axis.  In  scatterplots, 
points  falling  close  to  a  straight  line  imply  strong 
correlation,  whereas  a  cloud  of  points  are  more 


suggestive  of  a  weak  correlation  or  a  lack  of  one. 
Similarly,  if  most  points  are  in  the  bottom-left  and 
top-right  quadrants,  a  positive  relationship  is 
plausible,  whereas  if  most  points  are  in  the  top  left 
and  bottom  right,  a  negative  relationship  is  likely. 
A  scatterplot  may  also  detect  nonlinear  relation¬ 
ships  between  variables. 

The  Pearson  r  can  be  intuitively  derived  from  a 
standardization  of  the  covariance  between  vari¬ 
ables.  The  covariance  between  X  and  Y  is 

Z(X  -  X)(Y  -  Y) 

N 

where  N  is  the  sample  size,  and  X  and  Yare  the 
means  of  the  variables.  Covariance  combines  the 
deviations  computed  using  the  scores  from  each 
variable  and  accounts  for  the  sample  size  in  the 
denominator. 

However,  covariance  does  not  control  for  the 
potential  for  substantial  differences  in  the  amount 
of  deviation  within  each  variable.  Thus,  to  derive 
Pearson’s  r,  the  covariance  is  adjusted  by  the 
deviation  of  each  variable.  Simply  divide  the  cova¬ 
riance  by  the  product  of  standard  deviations  as 
given  below: 

Z(X  -  X)(Y  -  Y) 

_ N _ 

E(*  -  x)1 ,  /l(y  -  ’ 

N  V  N 

which,  through  some  basic  algebra,  becomes  the 
typical  formula  for  Pearson’s  r: 

I(X-Y)(Y-Y) 

yy(X-Y)2(Y-Y)2' 

Pearson’s  r  has  the  attractive  property  of  lying 
between  —1  and  1.  Positive  values  signify  positive 
relationships  and  negative  values  negative  ones. 
The  higher  the  absolute  value  of  a  measure,  the 
stronger  the  relationship.  Thus,  a  value  of  zero 
indicates  that  the  variables  are  linearly  indepen¬ 
dent  of  each  other.  A  correlation  of  zero,  however, 
does  not  indicate  that  the  variables  are  necessarily 
completely  independent.  The  relationship  must  be 
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linear  for  Pearson’s  r  to  be  an  appropriate  indica¬ 
tor  of  correlation.  Curvilinear  or  alternative 
relationship  forms  will  not  be  properly  detected 
with  Pearson’s  r.  In  addition,  caution  should  be 
used  when  applying  Pearson’s  r  to  small  sample 
sizes. 

Consider  Figure  1  (next  page)  that  presents 
scatterplots  of  a  strong  positive  and  a  weak 
negative  correlation  between  two  pairs  of  vari¬ 
ables  for  23  countries  that  belong  to  the 
Organisation  for  Economic  Co-operation  and 
Development  (OECD).  In  the  left  scatterplot,  the 
tight  grouping  of  countries’  (X,  Y)  coordinates 
along  the  best  fitting  line  suggests  a  strong  posi¬ 
tive  correlation  between  countries’  economic 
openness  and  growth  in  real  gross  domestic 
product  (GDP).  Pearson’s  r  is  .73,  which  con¬ 
firms  the  relationship  illustrated  in  the  scatter¬ 
plot.  In  the  right  scatterplot,  the  looser  cloud  of 
points  suggests  a  weaker  negative  correlation 
between  the  percentage  of  women  in  parliament 
and  the  percentage  of  right-wing  parties  in  cabi¬ 
net  posts.  In  this  case,  Pearson’s  r  is  —.51. 

Pearson’s  r  can  be  used  to  make  statistical  infer¬ 
ences  about  the  population  correlation  p.  Inference 
typically  assumes  a  bivariate  normal  distribution, 
such  that  the  relationship  between  X  and  Y  must 
be  linear,  and  X  and  Y  must  be  continuous.  Given 
a  Pearson’s  r  from  a  reasonably  large  sample,  the 
null  hypothesis  of  p  =  0  can  be  tested  with  a  t  dis¬ 
tribution,  such  that 


r\Jn  —  2 

,=  7 T^7' 

where  n  —  2  represents  the  degrees  of  freedom. 

The  formula  for  Pearson’s  r  is  both  simple  and 
versatile,  so  much  so  that  it  can  be  expanded  to 
relationships  between  variables  at  various  levels 
of  measurement.  As  a  particularly  useful  case, 
consider  the  relationship  between  two  ranked 
variables,  which  in  the  case  of  ties  between  vari¬ 
ables’  scores  can  also  be  computed  with  Pearson’s 
r.  When  there  are  no  ties,  the  formula  can  be  sim¬ 
plified  to  the  Spearman  rank-order  correlation 
coefficient: 


f; rank  1 


n(n2  —  1)  ’ 


where  d  =  X  —  Y  or  the  difference  in  the  ranked 
scores.  Testing  p  for  ranked  variables  is  also  per¬ 
formed  via  sampling  from  a  t  distribution  with 


\/(1  _  rLk)/{n  ~  2) 

and  degrees  of  freedom  n  —  2. 

Correlation  indices  can  detect  a  relationship 
between  variables  even  when  no  causal  link  exists 
between  them.  There  are  hosts  of  examples  where 
two  variables  are  perceived  to  correlate,  because  a 
third  variable  is  causing  them  both  to  change 
together.  This  is  called  a  spurious  relationship. 
Thus,  there  is  no  direct  causal  relationship  between 
the  two  variables,  but  there  is  a  common  depen¬ 
dence  on  the  third.  For  example,  when  consider¬ 
ing  the  correlation  of  partisanship  with  general 
attitudes  on  violence,  the  potential  for  a  spurious 
relationship  might  be  furthered  by  examining  the 
role  of  gender,  where  women  are  more  likely  to 
join  a  left-leaning  party  and  also  oppose  violence. 

Given  concerns  about  a  spurious  relationship, 
applied  researchers  have  taken  recourse  to  statisti¬ 
cal  control  with  partial  correlation.  Partial  corre¬ 
lation  measures  the  relationship  between  two 
variables  while  controlling  for  one  or  more 
variables.  For  example,  the  correlation  of  X 
with  Y  controlling  for  a  series  of  control  variables, 
Z,  —  {Zi, Z2, . . . ,  Zk},  can  be  estimated  as  the  cor¬ 
relation  between  the  set  of  residuals,  Rx  and  Ry, 
calculated  from  the  least  squares  regression  of  X 
on  Zj  and  of  Y  on  Z„  respectively. 

Assuming  that  a  single  Z  is  the  only  variable 
that  affects  X  and  Y,  the  partial  correlation  coef¬ 
ficient,  rxy.z,  can  be  simplified  to 


t”xy  .z 


Y xy  T xz^yz 


A  hypothesis  test  of  the  population  estimate,  p  , 
can  be  performed  with  a  t  test,  where 


1  —  r 

xy.z 


-Vn  -  3, 


where  Vn  —  3  is  the  degrees  of  freedom. 
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Figure  I  Scatterplots  of  a  Strong  Positive  Correlation  and  a  Weak  Negative  Correlation 

Source:  Data  taken  from  Armingeon,  K.,  Engler,  S.,  Gerber,  M.  Leimgruber,  P.,  &  Beyeler,  M.  (2009).  Comparative  political 
data  set  1960-2007.  Bern,  Switzerland:  Institute  of  Political  Science,  University  of  Bern. 


It  may  already  be  clear  that  correlation  is 
closely  related  to  multivariate  analysis  with  least 
squares  regression.  Pearson’s  r  essentially  mea¬ 
sures  the  direction  and  degree  to  which  observa¬ 
tions  fall  close  to  a  line,  which  indicates  a  rela¬ 
tionship  between  two  variables.  The  best  fitting 
line  in  such  a  relationship  is  the  least  squares 
regression  line. 

The  regression  line  is  arrived  at  by  minimizing 
the  squared  errors  with  the  function  Y  — 
a  +  bX  +  e  ,  where  a  and  b  are  constants  and  e  is 
the  residual.  Thus,  in  the  regression  context, 
Pearson’s  r  denotes  the  potential  for  a  stronger 
fit  in  the  regression  line.  With  a  strong  relation¬ 
ship,  the  sum  of  residuals  is  smaller,  and  the 
independent  variable,  X,  explains  more  of  the 
variance  in  the  dependent  variable,  Y.  In  fact, 
squaring  r,  we  get  the  coefficient  of  determina¬ 
tion,  r2,  in  a  regression  equation.  Therefore,  r 2  is 
an  estimate  of  the  amount  of  variance  in  Y 
explained  by  the  predicted  relationship  a  +  bX. 
Accordingly,  the  coefficient  of  nondetermina¬ 
tion,  1  —  r2,  indicates  the  amount  of  unexplained 
variance  in  the  model. 


To  make  the  connection  clear,  consider  an 
alternative  derivation  of  r2from  a  simple  linear 
regression;  r 2  is  a  ratio  of  the  expected  sum  of 
squares  to  the  total  sum  of  squares,  or 

X(Y-  Y)2' 

This  can  be  calculated  from  the  residuals  instead  of 
the  expected  squares  such  that  one  less  the  residual 
sum  of  squares  over  the  total  sum  of  squares  is  the 
coefficient  of  determination: 

^  = !  _  m  - 

HY-  Y)2' 

Because  r 1  indicates  the  ratio  of  explained  to  total 
variance,  it  can  be  considered  an  estimate  of  the  fit 
of  the  regression  model.  In  a  linear  regression 
model,  the  coefficient  of  determination  can  range 
from  0  to  1.  Returning  to  the  example  above,  on  the 
correlation  between  trade  openness  and  GDP  (see 
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Figure  1),  the  coefficient  of  determination  would  be 
simply  r 1  =  .732  =  .53.  In  a  regression  context,  this 
suggests  that  changes  in  trade  openness  explain 
about  5  3  %  of  the  variance  in  GDP  growth.  Likewise, 
according  to  the  coefficient  of  nondetermination, 
about  47%  of  the  variance  in  GDP  growth  is  unex¬ 
plained  by  this  simple  bivariate  relationship. 

Measures  of  correlation  may  also  be  nonpara- 
metric.  Nonparametric  correlation  applies  to  rela¬ 
tionships  where  the  distributional  assumptions 
above  (e.g.,  drawing  from  a  bivariate  normal  dis¬ 
tribution)  are  not  met.  Oftentimes,  this  is  the 
result  of  dealing  with  noncontinuous  variables — 
that  is,  ordinal,  nominal,  or  a  mix  of  measurement 
levels.  Similar  to  Spearman’s  rank-order  coeffi¬ 
cient  above,  Kendall’s  t  and  jh  are  performed  to 
test  the  correlation  between  ordinal  variables. 
Kendall’s  t  counts  all  pairs  of  observations  between 
two  variables  that  are  consistently  above  or  below 
the  rankings  of  correspondent  pairs  as  concor¬ 
dant,  or  C.  More  formally,  a  C  is  counted  every 
time  the  product  of  (X,  —  X;)(  Y,  —  Y;)  is  positive. 
Similarly,  all  pairs  that  are  inconsistently  above  or 
below  other  pairs  are  counted  as  discordant  and 
D.  Kendall’s  t  is  calculated  to  expose  the  differ¬ 
ence  between  discordant  and  concordant  pairs: 

=  2(C  -  D) 
n(n  —  1) 

Here,  t  ranges  from  —  1  to  1,  and  the  direction  of  the 
relationship  is  evident  from  the  formula  (C  —  D), 
such  that  negative  outcomes  suggest  a  negative  rela¬ 
tionship  and  vice  versa.  For  sample  sizes  greater 
than  10,  a  test  of  t  =  0  can  be  done  with  a  z  test: 


s/2(2n  +  5)  ' 

3 \Jn(n  -1) 

When  there  are  tied  ranks  in  the  data,  slight 
derivations  to  the  t  formula  given  above  can 
account  for  ties.  Kendall’s  and  Stuart’s  tc  require 
only  a  few  additional  terms  to  model  the  ranked 
ties.  In  the  former,  ties  among  the  X  values  are  col¬ 
lected  in  u,  which  enters  the  equation  as 

U  =  \  !«(«-!), 


while  ties  in  Y  are  collected  in  v,  which  similarly 
enters  the  equation  as 

V  =  \Yv{y-  1). 

In  the  latter,  m  is  the  smaller  number  of  ranked 
categories  in  X  and  Y.  Kendall’s  Th  is  then  a  calcu¬ 
lation  of  discrepancy  in  concordant  and  discordant 
pairs  while  controlling  for  the  number  of  ties  in 
each  variable: 

=  _ C-  D _ 

-  V 

Similarly,  Stuart’s  tc  takes  into  account  tied  ranks 
via  the  smaller  number  of  ranked  categories,  m: 

_  2 m(C  ~  D) 
rf{m  —  1) 

Note  that  when  there  are  many  ties,  both  of  the 
above  measures  will  slightly  underestimate  the  asso¬ 
ciation.  In  that  case,  Goodman  and  Kruskal’s  gamma 
does  a  better  job  of  correcting  for  several  ties: 

(C-D) 

7  (C  +  D)' 

When  there  are  no  ties,  this  equation  is  equivalent 
to  that  for  Kendall’s  t. 

The  relationship  between  multichotomous  vari¬ 
ables  can  also  be  explored  with  correlation  coeffi¬ 
cients.  Despite  the  prevalence  of  the  contingency 
coefficient,  C,  Cramer’s  V  is  a  more  practical  choice 
for  dealing  with  this  kind  of  data.  Consider  a  three- 
by-three  contingency  table  of  two  multichotomous 
variables:  partisanship  (Republican,  Independent, 
and  Democrat)  and  campaign  advertisement  expo¬ 
sure  (television,  Internet,  and  radio).  To  test  the 
independence  assumption  between  partisanship  and 
campaign  media,  one  applies  the  X  test.  However, 
the  X  test  only  provides  information  about  the 
existence  of  a  relationship,  not  the  strength  of  a 
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relationship.  To  acquire  the  degree  of  association 
between  the  variables  and  control  for  the  ratio  of 
rows  to  columns,  one  applies  Cramer’s  V: 


where  [min(r,  c)\  is  the  smaller  of  the  number  of 
rows  or  columns  in  the  contingency  table,  which 
provides  an  accurate  measure  of  association  for 
multichotomous  variables. 

The  correlation  statistics  presented  here  are  only 
a  sample  of  those  most  commonly  used  in  social  sci¬ 
ence.  Because  correlation  plays  a  central  role  in 
understanding  causal  and  spurious  relationships,  it 
has  garnered  a  great  deal  of  attention  in  the  100-plus 
years  since  its  inception.  In  modern  applied  statis¬ 
tics,  it  remains  one  of  the  foundational  concepts  and 
the  starting  point  for  most  practical  research. 

Dino  P.  Christenson 
Ohio  State  University 
Columbus,  Ohio,  United  States 

See  also  Quantitative  Methods,  Basic  Assumptions; 
Regression;  Statistics:  Overview 
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Correspondence  Analysis 

Correspondence  analysis  is  a  method  for  inter¬ 
preting  tabular  data  visually  in  the  form  of  spatial 


maps  in  which  the  rows  and  columns  of  the  table 
are  depicted  as  points.  Two  forms  of  the  method 
are  common:  simple-correspondence  analysis 
(CA)  and  multiple-correspondence  analysis 
(MCA).  Cross-tabulations  and  raw  categorical 
data  in  the  social  sciences  are  prime  examples  for 
being  visualized  by  CA.  Typically,  a  cross-tabula¬ 
tion  is  subjected  to  a  test  for  association  between 
the  row  and  column  variables  using  a  chi-square 
test,  for  example.  By  contrast,  CA  visualizes  the 
actual  structure  of  this  association,  whether  it  is 
statistically  significant  or  not.  MCA  generalizes 
this  method  to  many  variables,  usually  questions 
in  a  survey,  showing  how  the  response  categories 
interrelate.  In  this  entry,  each  approach  is 
explained  using  data  from  an  international  survey 
on  the  role  of  government. 

The  theory  underlying  CA  has  its  origins,  as 
early  as  the  1940s,  in  the  scaling  of  categorical 
variables  (i.e.,  assigning  numerical  scores  to  their 
category  levels)  to  achieve  an  objective  such  as 
maximizing  their  pairwise  correlations  or  maxi¬ 
mizing  between-row  or  between-column  variances. 
The  geometric  approach  along  with  its  name  is 
derived  from  the  French  term  analyse  des  corre- 
spondances,  developed  and  popularized  by  Jean- 
Paul  Benzecri  and  colleagues  starting  in  the  early 
1960s.  It  is  this  approach  that  is  presented  here. 

Simple  Correspondence  Analysis 

As  a  first  example,  consider  data  from  the 
International  Social  Survey  Programme’s  Role  of 
Government  survey  in  2006,  in  particular  a  ques¬ 
tion  concerning  government’s  contribution  to  the 
public  health  system.  Respondents  were  asked 
whether  government  should  pay  “much  more,” 
“more,”  “same  as  now,”  “less,”  or  “much  less”  for 
health  services,  and  the  responses  were  cross-tabu¬ 
lated  with  the  respondents’  interest  in  politics,  in 
five  levels  (from  1  =  very  much  interested  to  5  = 
not  at  all  interested ).  Table  1  shows  these  counts  for 
three  different  samples,  from  France  and  the  former 
West  and  East  Germany,  the  latter  still  being  kept 
separated  for  research  purposes.  In  this  table,  the 
response  categories  “less”  and  “much  less”  have 
been  combined  because  of  very  low  frequencies  of 
the  latter  response  to  the  question.  To  simplify  the 
present  analysis,  respondents  with  missing  values 
have  been  removed.  Missing  data  such  as  “can’t 
choose,”  “don’t  know,”  or  “refused  to  answer”  can 
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be  coded  as  an  additional  level,  either  combined  or 
separately,  but  these  nonsubstantive  responses  often 
dominate  the  results,  obscuring  the  interpretation  of 
the  substantive  responses  (Michael  Greenacre  & 
Jorg  Blasius,  2006). 

The  first  thing  to  note  about  CA  is  that  it  depicts 
the  relative  frequencies  of  response,  called  profiles. 

Table  1  Government  Spending  on  Health 

For  example,  the  profile  of  French  respondents  with 
a  high  interest  in  politics  (row  FI  of  Table  1)  is 
[73/229,  73/229,  65/229,  18/229J  -  [0.319,  0.319, 
0.284,  0.079J — hence,  a  profile  is  a  set  of  propor¬ 
tions  summing  to  1.  The  profiles  of  the  French 
sample  as  a  whole  and  the  two  German  samples,  as 
well  as  the  respondents  all  together,  called  average 

Country/ 

Much  More 

More 

Same 

Less/Much  Less 

Political  Interest 

Hel 

He2 

He3 

He45 

Sums 

FI 

73 

73 

65 

18 

229 

F2 

143 

237 

199 

53 

632 

France 

F3 

111 

204 

171 

37 

523 

F4 

45 

88 

95 

20 

248 

F5 

19 

27 

22 

1 

69 

DW1 

12 

38 

33 

14 

97 

DW2 

30 

88 

83 

20 

221 

West  Germany 

DW3 

82 

207 

159 

30 

478 

DW4 

37 

99 

44 

15 

195 

DW5 

17 

34 

22 

1 

74 

DEI 

9 

16 

13 

3 

41 

DE2 

22 

48 

28 

0 

98 

East  Germany 

DE3 

65 

112 

48 

3 

228 

DE4 

26 

68 

16 

4 

114 

DE5 

8 

23 

5 

2 

38 

F 

391 

629 

552 

129 

1701 

0.230 

0.370 

0.325 

0.076 

DW 

178 

4  66 

341 

80 

1065 

0.167 

0.438 

0.320 

0.075 

DE 

130 

267 

110 

12 

519 

0.250 

0.514 

0.212 

0.023 

ALL 

699 

1362 

1003 

221 

3285 

0.213 

0.415 

0.305 

0.067 

Source:  Created  by  the  author  Michael  Greenacre  using  data  from  International  Social  Survey  Programme,  Role  of 
Government  IV  (2006).  Distributor:  GESIS  Cologne  Germany,  ZA  No.  4700. 

Notes:  Frequencies  of  responses  to  the  question  “How  much  should  government  pay  for  health  services?”  are  cross-tabulated 
by  political  interest  (from  1  =  very  much  interested  to  5  =  not  at  all  interested)-,  samples  are  from  France  (N  =  1701),  the 
former  West  Germany  (N  =  1065),  and  the  former  East  Germany  (N  =  519).  Country  totals  and  overall  total,  with 
corresponding  relative  frequencies,  are  given  in  the  last  rows.  He  =  public  health;  F  =  French;  DW  =  former  West  Germany; 
DE  =  former  East  Germany. 
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profiles ,  are  given  in  the  last  rows  of  Table  1  in 
bold  italics,  and  the  data  for  group  FI  can  be  com¬ 
pared  with  these  averages.  Thus,  compared  with  all 
French,  those  with  a  high  interest  in  politics  are 
proportionally  higher  in  number  in  saying  that 
government  should  spend  much  more  on  health 
(0.319  compared  with  0.230);  and  they  are  pro¬ 
portionally  fewer  in  saying  that  they  should  spend 
the  same  as  now  (0.284  compared  with  0.325). 
This  pattern  is  similar  when  compared  with  the 
total  sample. 

All  the  comparisons  of  the  profiles  with  one 
another,  and  by  implication  with  the  averages  as 


well,  can  be  depicted  optimally  in  one  CA  map, 
shown  in  Figure  1.  A  distance  function,  called  the 
chi-square  distance,  is  defined  between  profiles, 
and  an  approximate  map  of  these  distances  is 
drawn.  The  chi-square  distance  recognizes  that 
differences  between  high  relative  frequencies  are 
inherently  greater  than  those  between  low  ones 
and  adjusts  for  this  disparity  using  the  average  of 
each  response  category  (given  in  the  last  row  of 
Table  1).  For  example,  the  chi-square  distance 
between  the  first  two  rows,  FI  and  F2,  of  Table  1, 
with  profiles  [0.319,  0.319,  0.284,  0.079J  and 
[0.226,  0.375,  0.315,  0.084J  is 


/ (0.319  -  0.226)2 
0.213 


(0.319  -  0.375)  (0.284  -  0.315)“ 


0.415 


0.305 


+ 


(0.079  -  0.084)2 
0.067 


=  0.227. 
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Figure  I  Correspondence  Analysis  Map  of  the  Cross-Tabulation  in  Table  I. 

Source:  Created  by  the  author  Michael  Greenacre  using  data  from  International  Social  Survey  Programme,  Role  of 
Government  IV  (2006).  Distributor:  GESIS  Cologne  Germany,  ZA  No.  4700. 

Notes:  The  average  profiles  for  F  =  France,  DW  =  former  West  Germany,  and  DE  =  former  East  Germany  are  shown  as 
supplementary  points.  The  overall  profile  (last  row  of  Table  1)  is  the  center  of  the  map. 
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The  smaller  squared  difference  (0.079  —  0.084)2 
between  the  fourth  category  is  increased  by  divid¬ 
ing  it  by  the  smaller  average  value  of  0.067,  com¬ 
pared  with  the  larger  squared  differences  in  the 
other  categories,  which  are  divided  by  their  respec¬ 
tively  larger  averages.  It  is  this  distance  that  the 
map  of  Figure  1  attempts  to  emulate  in  the  posi¬ 
tioning  of  the  points  FI  and  F2.  The  points  F,  DW, 
and  DE  in  Figure  1,  depicting  the  average  profiles 
in  each  country,  are  called  supplementary  points : 
They  are  not  included  in  the  CA  when  computing 
the  map  but  are  added  to  the  map  afterward.  The 
point  F,  for  example,  is  the  weighted  average  of  the 
five  points  FI,  F2, .  .  .,  F5,  where  these  five  politi¬ 
cal  interest  points  are  weighted  proportionally  to 
their  respective  frequencies  in  the  French  sample. 

Apart  from  the  row  profiles  and  their  averages, 
the  response  categories  of  the  spending  question 
(column  points)  are  visualized  in  the  same  map, 
and  this  provides  the  basic  interpretation  of  the 
different  regions  of  the  map.  Hence,  the  more  to 
the  right  of  the  map,  the  more  frequent  is  the  atti¬ 
tude  toward  more  spending,  with  the  upper  quad¬ 
rant  corresponding  to  “more”  and  the  lower  quad¬ 
rant  to  “much  more,”  while  the  more  to  the  left  of 
the  map,  the  more  frequent  the  attitude  is  toward 
spending  the  “same”  or  “less/much  less.”  The  cen¬ 
ter  of  the  map  corresponds  to  the  average  over  all 
the  samples  studied,  so  by  “more  frequent”  we 
really  mean  “more  frequent  compared  with  the 
average.”  Individual  profile  points  can  be  com¬ 
pared:  For  example,  the  point  FI  corresponding  to 
the  group  described  above  is  the  one  that  is  the 
most  in  the  direction  of  “much  more”  spending — 
it  can  be  verified  that  there  is  no  other  row  in  the 
table  that  has  a  higher  proportion  of  respondents 
having  this  opinion.  The  map  allows  an  easy 
comparison  of  each  of  the  five  levels  of  political 
interest  in  each  country  as  well  as  an  overall  com¬ 
parison  between  the  countries.  East  German  opin¬ 
ion  is  well  to  the  side  of  spending  “more”  and 
“much  more,”  as  opposed  to  West  German  opin¬ 
ion  on  the  other  side,  along  with  the  French.  The 
French  average,  however,  is  lower  down — all  five 
French  points  are  lower  negative  on  the  vertical 
dimension  on  the  side  of  “much  more.”  Of  the  five 
levels  of  political  interest,  the  high  level  shows  the 
greatest  differences  between  the  countries,  with  FI 
the  most  toward  “much  more,”  DW1  the  most 
toward  “less/much  less,”  and  DEI  in  between.  To 


illustrate  further  the  style  of  interpretation  of  these 
maps,  note  that  the  East  German  groups  DE4  and 
DE5  (low  levels  of  political  interest)  must  have 
higher  proportions  of  opinions  toward  “more” 
spending  than  the  West  German  group  DW4,  for 
example.  It  is  not  the  proximity  of  the  groups  to 
the  response  category  that  indicates  a  higher  pro¬ 
pensity  toward  that  category  but  rather  how  far 
out  from  the  average  (the  center)  the  points  are  in 
the  direction  indicated  by  the  response  category — 
“more”  in  this  case. 

CA  aims  to  visualize  the  relationships  between 
the  rows  and  columns  of  a  cross-tabulation.  A 
separate  issue  is  to  test  whether  this  association  is 
statistically  significant  or  not.  Chi-square  statistics, 
for  example,  can  be  computed  for  the  three  5X3 
tables  in  Table  1  and  are  equal  to  22.5  (p  —  .033), 
28.0  (p  =  .006),  and  24.1  (p  —  .020),  respectively. 
For  the  entire  15  X  3  table,  chi-square  is  equal  to 
142.9  (p  <  .0001);  so  from  a  statistical  viewpoint, 
we  are  looking  at  significant  differences  both 
within  each  country  and  among  them.  In  CA,  the 
chi-square  statistic  divided  by  the  grand  total  of  the 
table  is  called  the  inertia,  a  dimensionless  quantity 
that  measures  the  total  variation  in  the  data:  In  this 
case,  the  inertia  is  equal  to  142.9/3285  =  0.0435. 
This  inertia  is  decomposed  along  the  axes  of  the 
CA  solution  in  a  similar  way  to  the  variance 
decomposition  in  principal  component  analysis. 
These  parts  of  inertia  are  indicated  on  the  axes  of 
Figure  1,  as  also  their  percentages  of  the  total  iner¬ 
tia:  0.0263  (60.6%)  and  0.0114  (26.1%),  so  that 
86.7%  of  the  inertia  is  accounted  for,  while  13.3% 
is  not  shown  in  the  two-dimensional  solution. 

In  summary,  correspondence  analysis  of  a  cross¬ 
tabulation  maps  the  association  (measured  by  the 
inertia)  between  the  rows  and  the  columns  of  a 
cross-tabulation  into  a  low-dimensional  solution, 
facilitating  the  interpretation  of  this  association.  The 
method  maximizes  the  inertia  represented  in  the 
chosen  solution,  which  is  usually  two-dimensional. 

Multiple  Correspondence  Analysis 

The  simple  form  of  CA  described  above  is  appli¬ 
cable  to  a  single  cross-tabulation,  visualizing  the 
relationship  between  two  categorical  variables.  In 
MCA,  there  are  typically  several  variables  describ¬ 
ing  a  particular  phenomenon  of  interest,  and  the 
method’s  objective  is  to  visualize  how  all  these 
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variables  interrelate  among  each  other.  For  exam¬ 
ple,  consider  not  one  but  several  questions  about 
government  spending:  in  addition  to  spending  on 
public  health  (abbreviated  as  He),  spending  on 
environmental  protection  (En),  law  enforcement 
(La),  education  (Ed),  defense  (De),  retirement  pen¬ 
sions  (Re),  unemployment  benefits  (Un),  and  cul¬ 
ture  (Cu).  There  are  eight  questions  in  all,  and 
each  has  the  same  five  response  categories  (here 
we  can  keep  the  categories  “less”  and  “much  less” 
separated,  because  for  some  of  the  questions, 
there  are  quite  high  frequencies  of  the  response 
“much  less”).  After  the  respondents  with  missing 
values  on  any  of  the  questions  have  been  removed, 
we  have  a  total  sample  of  2,913  respondents: 
1,486  French,  957  West  Germans,  and  470  East 
Germans. 

There  are  several  equivalent  ways  to  define 
MCA:  Here,  we  describe  the  method  as  an  analysis 
of  dummy  variables  for  all  the  response  categories. 
The  original  2913  X  8  matrix  of  responses  can  be 
converted  into  a  2913  X  40  indicator  matrix  of 
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dummy  variables,  where  there  are  five  columns  per 
question  for  its  five  response  categories.  The 
responses  are  coded  as  0s  and  Is,  with  a  1  indicat¬ 
ing  the  response  category:  For  example,  the 
response  “more,”  which  is  the  second  response 
category,  is  coded  as  0  1  0  0  0.  MCA  is  the  appli¬ 
cation  of  CA  to  this  indicator  matrix,  leading  to 
points  for  all  categories  of  response  and  for  all 
respondents,  as  shown  in  separate  maps  (A  and  B) 
in  Figure  2.  The  method  then  visualizes  the  asso¬ 
ciations  between  these  categories  across  the  respon¬ 
dents:  As  in  simple  CA,  categories  that  co-occur 
much  more  frequently  than  one  would  expect  from 
their  marginal  frequencies  will  be  grouped  together, 
while  those  that  have  fewer  than  expected  respon¬ 
dents  in  common  will  be  placed  far  apart. 

Correspondingly,  respondents  with  similar 
response  patterns  will  be  located  close  to  one 
another,  and  ones  with  different  patterns  will  be 
located  far  apart.  The  substantive  questions  are 
again  used  to  interpret  the  space — thus,  it  can  be 
seen  that  the  more  the  respondents  (or  groups  of 


Figure  2  MCA  of  Responses  to  Eight  Questions  on  Government  Spending 

Source:  Created  by  the  author  Michael  Greenacre  using  data  from  International  Social  Survey  Programme  (ISSP),  Role  of 
Government  IV  (2006).  Distributor:  GESIS  Cologne  Germany,  ZA  No.  4700. 

Notes:  Responses  were  obtained  from  2,913  respondents  from  France  and  Germany  (ISSP,  2006).  Map  A  shows  the 
respondents,  and  Map  B  shows  the  corresponding  map  of  the  response  categories.  MCA  =  multiple-correspondence  analysis; 
He  =  environmental  protection;  En  =  environmental  protection;  La  =  law  enforcement;  Ed  =  education;  De  =  defense;  Re  = 
retirement  pensions;  Un  =  unemployment  benefits;  Cu  =  culture. 
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respondents)  are  located  up  on  the  vertical  axis, 
the  less  they  advocate  government  spending,  and 
the  more  they  are  to  the  left  on  the  horizontal  axis, 
the  more  they  advocate  spending.  The  most 
extreme  views  in  these  two  directions  are  “spend¬ 
ing  much  less  on  education”  (point  Ed5)  and 
“spending  much  more  on  defense”  (Del).  The 
dense  cloud  of  respondent  points  at  the  lower 
right,  however,  shows  that  the  majority  of 
responses  are  at  or  near  the  middle-response  cate¬ 
gories  “same  as  now.”  If  respondents  tend  in  the 
opposite  direction  toward  the  upper  left,  this 
would  reflect  a  type  of  polarization  in  their 
response  pattern,  combining  the  two  attitudes  cap¬ 
tured  by  the  two  axes.  For  example,  the  position  of 
La5,  which  is  pulled  somewhat  into  this  quadrant, 
away  from  the  vertical  spread  of  the  other  “spend 
much  less”  categories  toward  the  “spend  much 
more”  categories,  implies  that  there  are  several 
respondents  who — apart  from  advocating  much 
less  spending  on  law  enforcement — are  in  favor  of 
more  spending  on  several  of  the  other  aspects. 
Another  way  of  thinking  about  the  correspon¬ 
dence  between  response  categories  and  the  respon¬ 
dents  is  that  the  position  of  each  response  category 
is  the  average  of  the  respondents  who  gave  that 
particular  response.  The  relative  positions  of  the 
categories  thus  reflect  associations  in  the  responses: 
For  example,  the  category  Edl  (spend  much  more 
on  education)  is  much  closer  to  the  bunch  of  mid¬ 
dle  responses  than  Ed5  (spend  much  less  on  educa¬ 
tion),  showing  that  spending  much  more  on  educa¬ 
tion  is  not  an  extreme  view  but  associated  with  an 
attitude  generally  associated  with  middle  responses, 
while  spending  much  less  on  education  is  an 
extreme  attitude. 

In  the  same  way,  a  supplementary  categorical 
variable,  such  as  the  country  or  level  of  political 
interest,  can  be  added  to  the  map  afterward  to  see 
if  it  has  any  connection  with  the  configuration  of 
response  category  associations.  Figure  3  shows  the 
positions  of  the  countries,  genders,  levels  of  politi¬ 
cal  interest,  and  education  levels  as  mean  positions 
of  the  respondents  in  the  corresponding  category. 
In  other  words,  the  point  m  shows  the  average  of 
the  1,538  male  respondent  points  in  Figure  2  and 
f  the  average  of  the  1,375  female  respondents — so 
males  are  on  average  more  in  favor  of  spending 
less,  and  females  on  average  are  more  in  favor 
of  spending  more.  Interestingly,  the  highest  and 


Figure  3  Average  Positions  of  Groups  of 

Respondents  in  Correspondence  Analysis 
Map  of  Figure  2 

Source:  Created  by  author  Michael  Greenacre  using  data 
from  International  Social  Survey  Programme,  Role  of 
Government  IV  (2006).  Distributor:  GESIS  Cologne 
Germany,  ZA  No.  4700.. 

Notes:  F  =  France,  DW  =  former  West  Germany,  DE  = 
former  East  Germany;  m  =  male,  f  =  female;  II,  12,  ...  , 
15  =  interest  in  politics,  from  high  to  low;  E0,  El,  .  .  .  , 

E5  =  education  level,  from  low  to  high. 


lowest  categories  of  political  interest  fall  close  to 
each  other,  together  with  the  lower  categories  of 
education,  all  on  the  left  side  of  the  map  toward 
more  spending. 

Comparing  the  scale  of  Figure  3  with  that  of 
Figure  2,  one  can  see  that  these  differences  are,  in 
fact,  quite  small  and  possibly  need  to  be  qualified 
by  some  type  of  hypothesis  test  if  the  statistical 
significance  of  the  findings  is  sought. 

In  summary,  MCA  analyzes  respondent-level 
data  on  several  categorical  variables  simultane¬ 
ously  in  one  spatial  map.  Although  individual 
respondent  points  exist,  it  is  the  response  category 
points  that  are  of  main  interest.  These  category 
points  are  positioned  to  reflect  optimally  their 
associations  with  one  another  across  the  respon¬ 
dents.  Additional  categorical  variables  that  clas¬ 
sify  the  respondents  into  groups  can  be  displayed 
as  supplementary  points  after  the  map  is  con¬ 
structed,  to  assist  in  the  interpretation — these  are 
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the  average  positions  of  the  respondents  in  the 

respective  categories. 

Michael  Greenacre 
Universitat  Pompeu  Fabra 
Barcelona,  Spain 
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The  term  corruption  is  used  with  reference  not  only 
to  politics  and  to  public  administration  but  also  to 
personal  life  and  to  business.  It  may  refer  to  the  per¬ 
version  of  any  accepted  standard.  Common  usage 
also  includes  the  corruption  of  language,  as  with  the 
corruption — that  is,  unauthorized  alteration — of  a 
written  text.  The  first  section  of  this  entry  examines 
the  main  problems  of  defining  the  concept  of  corrup¬ 
tion.  A  typology  of  political  corruption  and  its  pat¬ 
terns  are  discussed  in  the  second  and  third  sections; 
measures,  explanations,  effects,  and  possible  cures 
for  corruption  are  presented  in  the  subsequent  sec¬ 
tions;  and  academic  analyses  versus  practical  actions 
is  the  final  topic  analyzed. 

Problems  of  Definition 

Moral  corruption  refers  to  the  state  of  an  individ¬ 
ual  or  an  entire  society  in  which  virtues  have  been 
lost  and  citizens  have  become  depraved  and  ruined 


by  the  pursuit  of  luxury  and  gratification.  For 
example,  what  were  seen  as  corrupt  practices 
within  the  Roman  Catholic  Church  led  to 
Protestantism.  Also,  the  ideals  of  republican  virtue 
and  the  warning  lessons  of  the  moral  corruption 
that  had  led  to  the  fall  of  the  Roman  Empire  were 
much  in  the  minds  of  the  framers  of  the  constitu¬ 
tion  of  the  United  States. 

In  the  business  sphere,  corruption  is  said  to  occur 
when  a  person  betrays  trust  and  misuses  knowledge 
or  authority  for  personal  gain,  for  example,  if  a 
member  of  a  firm’s  staff  sells  trade  secrets. 

Political  corruption  involves  the  perversion  of 
accepted  standards  of  behavior  in  political  life.  In 
particular,  it  may  be  defined  as  the  misuse  of  pub¬ 
lic  office  or  authority  for  unauthorized  private 
gain.  Nevertheless,  this  and  other  definitions  need 
to  be  treated  with  caution  since  the  meaning  of 
“political  corruption”  is  itself  a  complex  matter. 
The  difficulty  of  finding  a  clear  and  agreed-on 
definition  poses  problems  both  for  political  scien¬ 
tists  seeking  to  measure  and  compare  corruption  in 
different  countries  and  for  those  responsible  for 
drawing  up  anticorruption  laws. 

First,  not  all  forms  of  misbehavior  by  those  in 
positions  of  public  trust  constitute  corruption.  A 
politician  who  betrays  secret  information  to  an 
enemy  country  for  ideological  reasons  is  guilty  of 
treachery  but  not  corruption  since  he  or  she  acts 
from  an  ideological  motive  and  not  for  monetary 
gain.  If  that  same  politician  reveals  the  secret  infor¬ 
mation  for  payment,  the  action  can  then  be  charac¬ 
terized  as  corrupt  as  well  as  treacherous.  Political 
corruption  usually  involves  the  pursuit  of  some  form 
of  personal  gain,  particularly  material  advantage. 

Second,  the  mere  fact  that  politicians  or  public 
officials  derive  material  benefits  from  their  posi¬ 
tions  does  not  automatically  mean  that  they  are 
corrupt.  It  is  accepted  that  they  are  entitled  to 
salaries  and  benefits  for  their  work.  It  is  only 
when  they  accept  some  unauthorized  or  illegal 
benefit  that  corruption  is  involved.  However,  it  is 
not  always  clear  which  benefits  a  politician  is  or  is 
not  permitted  to  enjoy.  For  example,  a  major 
scandal  in  the  United  Kingdom  (UK)  ensued  in 
2009  from  revelations  about  allowances  used  by  a 
large  number  of  members  of  parliament.  In  many 
of  these  cases,  the  parliamentarians  had  been  per¬ 
mitted  by  the  parliamentary  authorities  to  make 
expense  claims,  which  then  were  publicly  con¬ 
demned  as  unseemly  when  they  were  revealed.  In 
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only  a  few  cases  were  the  claims  considered 
strictly  illegal. 

Third,  and  connected  with  the  previous  point, 
illegality  is  not  the  only  criterion  whereby  an  activ¬ 
ity  or  payment  may  be  judged  to  be  corrupt. 
Especially  during  times  of  changes  in  public  atti¬ 
tudes  about  expected  standards  of  behavior  of 
politicians,  the  law  may  itself  be  deemed  to  be  cor¬ 
rupt.  This  is  because  some  may  see  existing  laws  as 
contrary  to  what  they  feel  are  proper  standards  of 
morality  and  may  wish,  therefore,  to  alter  the  law. 

Not  only  are  expected  standards  of  behavior 
subject  to  change,  but  also  the  expectations  of  the 
political  class  may  differ  from  those  of  ordinary 
electors.  In  countries  ruled  by  occupying  or  colo¬ 
nial  authorities,  there  may  be  differences  between 
the  standards  imposed  by  the  authorities  and  the 
traditional  standards  within  the  society.  Sometimes 
the  clash  of  cultures  is  seen  in  the  way  officeholders 
treat  members  of  their  extended  families  or  tribes. 
According  to  laws  and  rules  drawn  up  by  colonial 
governments  or  those  enacted  under  pressure  from 
international  development  aid  agencies,  politicians 
and  officials  must  not  show  favoritism.  By  con¬ 
trast,  traditional  social  mores  may  oblige  them  to 
reserve  jobs  in  the  public  service  for  their  relatives. 
(Edward  C.  Banfield  has  referred  to  this  traditional 
view  as  “amoral  familism.”  Apparently,  this  has 
been  an  aspect  of  politics  in  Sicily.  In  Albania,  tra¬ 
ditional  familial  obligations  relating  to  honor  kill¬ 
ings  and  other  matters  are  set  out  in  the  Kanun 
[canon]  of  Lek  Dukagjin,  a  code  dating  from  the 
1 5th  century.  By  using  the  term  amoral,  Banfield  is 
reflecting  a  Western  view  of  a  local  culture.) 

Fourth,  the  term  corruption  often  refers  not 
only  to  misconduct  by  those  already  holding  elec¬ 
tive  office  but  also  to  abuses  relating  to  the  elec¬ 
toral  process  whereby  such  office  is  won. 

Fifth,  though  corruption  is  used  primarily  to 
mean  the  unsanctioned  use  of  public  office  for 
personal  gain,  it  may  apply  too  when  the  intended 
gain  is  not  strictly  “personal”  but  for  an  official’s 
political  party.  For  example,  in  a  case  in 
Philadelphia  where  lawyers  paid  bribes  to  obtain 
appointments  as  judges,  the  money  went  into  the 
coffers  of  the  ruling  party  and  was  not  used  for  the 
direct  financial  benefit  of  the  bribe  takers. 

If  it  is  accepted  that  corruption  is  involved  when 
the  gain  goes  to  a  group  (such  as  a  political  party) 
as  well  as  to  a  politician  or  to  the  close  family  mem¬ 
bers  of  a  politician,  a  further  definitional  problem 


arises.  For  it  is  a  part  of  normal  politics  that  politi¬ 
cians  represent  groups  and  not  society  as  a  whole. 
In  a  system  where  voters  in  a  particular  geographi¬ 
cal  area  elect  a  legislator,  it  is  expected  that  he  or 
she  then  will  try  to  represent  their  local  interests. 
Gaining  a  material  benefit  for  a  locality  is  a  normal 
aspect  of  pork-barrel  politics;  gaining  a  material 
benefit  for  a  political  party  is  a  corrupt  practice. 

Sixth,  corruption  need  not  involve  any  favor  by 
the  bribe  taker  beyond  a  disclosure  of  unpublished 
information.  Where  the  government  is  about  to 
make  an  announcement  that  will  affect  the  value  of 
the  shares  of  a  company  or  sector  of  the  stock  mar¬ 
ket,  prior  notice  enables  favored  persons  to  buy  and 
sell  shares  to  their  advantage.  Also,  where  a  local 
government  authority  is  about  to  make  a  decision 
about  the  route  of  a  new  road  or  rail  line,  a  tip-off 
to  party  supporters  will  permit  them  to  make  prop¬ 
erty  purchases  or  sales  guaranteed  to  be  to  their 
advantage  once  the  decision  has  been  announced. 

Seventh,  in  cases  of  bribery,  does  the  corruption 
consist  in  the  taking  of  a  bribe  or  in  the  giving  of 
an  unauthorized  favor  in  return  for  the  bribe?  If  a 
judge  accepts  a  bribe  from  a  defendant  but  then 
finds  the  defendant  guilty  as  charged,  and  the  bribe 
did  not  affect  the  judge’s  decision,  is  the  judge 
guilty  of  corruption? 

Eighth,  one  defining  test  for  whether  an  act  is  or 
is  not  corrupt  is  transparency.  Typically,  unaccept¬ 
able  transactions  must  be  conducted  in  secret.  A 
judge  who  accepts  a  bribe  will  be  foolish  to  boast 
about  it.  However,  not  all  corrupt  acts  are  secret. 
Some  politicians  are  so  powerful  that  they  feel  able 
to  flaunt  their  privileges.  The  Romanian  dictator 
Nicolae  Ceauce§cu  (1918-1989)  felt  so  confident 
in  his  immunity  from  prosecution  that  he  built 
huge  palaces  for  himself. 

Ninth,  there  are  differences  of  usage  between 
scholars  as  to  whether  “political”  corruption  includes 
the  acts  of  civil  servants  and  other  public  employees 
or  whether  it  should  be  restricted  to  elected  officials 
and  those  who  owe  their  jobs  to  political  influence. 

Types  of  Political  Corruption 

It  is  convenient,  in  the  first  place,  to  categorize  cor¬ 
ruption  by  the  position  held  by  the  corrupted  per¬ 
son,  as,  for  example,  with  the  following: 

Legislative  corruption 

Judicial  corruption 
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Police  corruption 

Administrative  corruption 

Corruption  of  local  and  national  political 
executives  (presidents,  prime  ministers,  ministers, 
and  mayors) 

When  there  is  corruption  in  obtaining  office,  it  can 
be  classified  as 

Electoral  corruption 

A  second  categorization  is  by  the  type  of  action 
involved.  According  to  the  U4  Anti-Corruption 
Resource  Centre  group  of  development  agencies,  it 
is  convenient  to  distinguish  between  corrupt 
actions  by  those  already  in  power  aimed  at  gaining 
unauthorized  benefits  (“corrupt  accumulation  and 
extraction”)  and  corrupt  actions  aimed  at  winning 
and  retaining  political  office  (“corrupt  means  of 
power  preservation”).  Corrupt  accumulation  and 
extraction  includes  the  following: 

Bribes,  “commissions,”  and  fees  taken  from  private 
sector  businesses 

Undue  extraction  through  taxation  and  customs 
Fraud  and  economic  crime 
Politically  created  rent-seeking  opportunities 
Politically  created  market  favors  benefiting 


businesses  owned  by  political  elites 

Off-budget  transfers  and  manipulated  processes  of 
privatization 

Extorting  party  and  campaign  funding  from  the 
state,  private  sector,  and  voters 

Corrupt  means  of  power  preservation  include 
actions  such  as  the  following: 

Buying  political  support  and  majorities  from  other 
parties  and  politicians 

Co-optation  and  maintenance  of  patron-client 
networks 

Buying  decisions  from  parliament,  judiciary,  and 
control  and  oversight  bodies 

Favoritism  and  patronage  in  allocation  of 
government  resources 

Buying  voters  and  votes  and  electoral  fraud 
Use  of  public  money  for  political  campaigns 
Buying  off  the  media  and  civil  society 

A  third  categorization  distinguishes  between 
national  (high-level)  grand  corruption  and  local 
(low-level  or  petty)  corruption.  Examples  of  petty 
corruption  are  small  payments  demanded  by  offi¬ 
cials  to  avoid  delays  in  processing  requests  for 
licenses,  passports,  and  other  routine  documents 
(“speed  money”)  or  small  bribes  demanded  by 


Table  I  Money  Allegedly  Embezzled  by  Leading  Politicians 


Head  of  Government 

Mohamed  Suharto,  President  of  Indonesia,  1967-1998 
Ferdinand  Marcos,  President  of  Philippines,  1972-1986 
Mobutu  Sese  Seko,  President  of  Zaire,  1965-1997 
Sani  Abacha,  President  of  Nigeria,  1993-1998 
Slobodan  Milosevic,  President  of  Serbia/Yugoslavia, 
1989-2000 


Estimates  of  Funds  Allegedly  Embezzled  (in  U.S.  dollars) 
15  billion  to  35  billion 
5  billion  to  10  billion 
5  billion 

2  billion  to  5  billion 
1  billion 


Jean-Claude  Duvalier,  President  of  Haiti,  1971-1986  300  to  800  million 

Alberto  Fujimori,  President  of  Peru,  1990-2000  600  million 


Source:  Global  Corruption  Report  2004,  Transparency  International.  Retrieved  November  9,  2010,  from  http://www.u4.no/ 
themes/political-corruption/introduction.cfm 

Note:  Estimates  of  funds  allegedly  embezzled  are  drawn  from  publicly  available  sources.  The  estimates  do  not  reflect  any  legal 
developments  that  have  occurred  since  2003. 
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traffic  police  or  customs  officials.  Grand  corrup¬ 
tion  or  “kleptocracy”  is  the  demand  for  huge 
payments  (typically,  from  those  seeking  major 
government  contracts)  for  top  politicians. 

In  its  Global  Corruption  Report  2004,  the  anti¬ 
corruption  organization  Transparency  International 
gave  estimates  of  the  money  allegedly  embezzled  by 
some  leading  politicians  (see  Table  1). 

Patterns  of  Corruption 

Corruption  not  only  takes  many  forms  and  exists 
in  different  parts  of  a  political  system,  but  it  is  usu¬ 
ally  clandestine  and  is  thus  hard  to  measure.  Thus, 
any  attempt  to  assess  which  countries  are  most 
affected  and  which  parts  of  their  political  systems 
are  especially  corruption-prone  is  problematic. 

The  following  activities  frequently  feature  in 
corruption  scandals  in  many  countries. 

Arms  Sales 

Contracts  for  the  purchase  of  guns,  military 
aircraft,  and  other  forms  of  defense  equipment 
may  involve  sums  of  money  amounting  to  billions 
of  U.S.  dollars.  Because  the  competition  for  such 
contracts  is  so  fierce  and  the  stakes  are  so  high  for 
the  providers  of  the  equipment,  massive  bribes 
may  be  given  to  secure  them.  Some  of  the  most 
spectacular  corruption  cases  of  recent  decades 
have  involved  the  arms  trade.  The  Lockheed 
Scandal  of  the  1970s  involved  bribes  by  the 
United  States  aircraft  company  to  the  then 
Japanese  prime  minister  Kakuei  Tanaka  (who  was 
sentenced  to  4  years’  imprisonment  but  remained 
free  on  appeal  at  the  time  of  his  death  in  1993); 
Prince  Bernhard  of  the  Netherlands;  and  to  lead¬ 
ing  political  figures  in  Italy,  West  Germany,  and 
elsewhere.  The  Bofors  scandal  of  the  1980s  con¬ 
cerned  bribes  reportedly  paid  by  the  Swedish  arms 
manufacturer  to  the  Indian  premier  Rajiv  Gandhi 
to  win  a  contract  for  howitzer  guns.  The 
Al-Yamama  deals  involved  contracts  for  military 
aircraft  between  the  British  manufacturer  British 
Aerospace  (BAE  Systems)  and  Saudi  Arabia. 
These  occurred  between  1985  and  2006.  Following 
press  and  television  reports  of  corrupt  payments 
to  members  of  the  Saudi  royal  family,  there  was 
an  inquiry  by  the  U.K.  Serious  Fraud  Office. 
However,  this  was  dropped  in  December  2006 


under  pressure  from  the  Saudi  authorities.  The 
U.K.  attorney  general  announced  that  the  investi¬ 
gation  was  being  halted  on  national  interest 
grounds.  The  Agusta-Dassault  Scandal  led  in 
1995  to  the  resignation  of  Belgium’s  former  for¬ 
eign  minister  as  secretary-general  of  the  North 
Atlantic  Treaty  Organization  and  to  his  later  con¬ 
viction.  Belgium’s  former  deputy  premier  was 
assassinated.  The  scandal  involved  the  purchase 
of  military  helicopters  by  the  Belgian  army. 

Illegal  Sales  of  Alcohol,  Illegal  Gambling, 
Pornography,  Prostitution,  and  Drugs 

Attempts  to  ban  forms  of  vice  in  which  large 
numbers  of  people — including  “decent”  citizens — 
indulge  have  led  regularly  to  the  corruption  of  law 
enforcement  authorities  by  gangs  of  organized 
criminals.  To  service  an  extensive  clientele,  vice 
activities  need  to  be  open  or  semiopen.  If  drug  tak¬ 
ers  can  find  the  illegal  suppliers,  so  can  members 
of  the  police.  To  safeguard  their  illegal  activities, 
the  organized  gangs  who  control  them  have  a 
strong  incentive  to  bribe  members  of  police  vice 
squads.  In  addition,  they  need  to  ensure  that  politi¬ 
cians  tolerate  such  bribery. 

Characteristically,  organized  criminals,  there¬ 
fore,  supply  campaign  contributions  to  potentially 
cooperative  politicians. 

Local  Government  Corruption 

Decisions  on  zoning  (whether  land  may  be  used 
for  building  purposes)  greatly  affect  the  value  of 
land.  Thus,  bribery  of  local  politicians  who  have 
the  authority  to  make  or  to  influence  zoning  deci¬ 
sions  is  common.  It  has  been  the  subject  of  major 
ongoing  scandals  in  Spain.  (A  similar  scandal  led 
in  the  1970s  to  the  resignation  of  the  British  dep¬ 
uty  premier.)  Another  common  form  of  local  brib¬ 
ery  concerns  kickbacks  on  local  government  con¬ 
tracts.  The  tangentopolis  (“bribe  city”)  scandal  in 
Milan,  Italy,  which  broke  in  the  early  1990s,  is  a 
major  example.  A  further  common  form  of  local 
government  corruption  is  the  assignment  of  jobs, 
housing  benefits,  and  other  forms  of  patronage  as 
spoils  to  party  activists.  Such  patronage  politics 
was  a  defining  feature  of  the  “machine  politics”  in 
cities  in  the  United  States  in  the  19th  century  and 
in  much  of  the  20th  century. 
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Corruption  Relating  to  Campaign  Contributions 

Political  donations  may  be  considered  corrupt  if 

1.  they  contravene  existing  political  finance  laws, 

2.  they  derive  from  the  proceeds  of  a  corrupt 
transaction, 

3.  they  involve  the  unauthorized  use  of  state 
resources  for  partisan  purposes, 

4.  they  are  accepted  in  exchange  for  a  promise  of 
some  unauthorized  favor  or  contract, 

5.  they  are  from  disreputable  sources  (such  as  drug 
barons  or  other  criminals),  or 

6.  the  money  is  to  be  spent  on  illegal  electoral 
practices  such  as  vote  buying. 

There  have  been  major  scandals  in  recent  years 
in  Antigua  and  Barbuda,  the  Bahamas,  Brazil,  the 
Czech  Republic,  Croatia,  Ecuador,  Germany, 
India,  Italy,  Japan,  Mexico,  Pakistan,  Papua  New 
Guinea,  Spain,  Suriname,  South  Korea,  the  UK, 
and  Venezuela. 

Electoral  Corruption 

The  stuffing  of  ballot  boxes  with  fake  votes, 
deliberate  miscounting  of  votes  (sometimes  by  offi¬ 
cials  who  have  received  bribes),  and  vote  buying 
are  some  of  the  most  common  forms  of  electoral 
corruption.  Other  corruption  techniques  include 
the  inclusion  of  fake  names  on  the  voters’  register 
and  personation  (falsely  pretending  to  be  another 
voter).  Such  practices  are  potentially  widespread  in 
elections  that  are  held  in  countries  wracked  by 
violence  or  in  developing  nations  where  it  is  almost 
impossible  to  draw  up  a  reasonably  accurate  regis¬ 
ter  of  eligible  electors.  In  advanced  democracies 
too — the  UK  is  an  example — the  phenomenon  of 
electoral  fraud  is  not  unknown,  especially  in  some 
urban  areas  with  mobile  populations. 

Measures  of  Corruption 

It  is  extremely  hard  to  measure  the  overall  extent 
of  corruption  within  a  locality  or  an  entire  coun¬ 
try.  A  count  of  the  number  of  legal  cases  may  do 
nothing  more  than  demonstrate  the  effectiveness 
(or  lack  of  it)  of  law  enforcement  authorities  or  of 


the  press  in  bringing  attention  to  scandals. 
Moreover,  cases  may  vary  in  importance,  thus 
making  a  simple  count  of  prosecutions  misleading. 
Any  attempt  to  compare  levels  of  corruption  in 
different  countries  runs  up  against  the  definitional 
problems  already  described:  Behavior  that  is  con¬ 
sidered  corrupt  (or  is  actually  illegal)  may  not  be 
seen  in  the  same  way  in  other  countries. 

Especially  since  the  1990s,  various  anticorrup¬ 
tion  organizations  therefore  have  used  surveys  or 
the  subjective  estimates  of  experts  as  surrogate 
measures  of  corruption  or  of  indicators  of  corrup¬ 
tion  (such  as  the  degree  of  transparency  within  a 
country).  There  is  considerable  competition 
between  rival  corruption  indices.  Transparency 
International  promotes  its  Corruption  Perceptions 
Index  as  well  as  a  Bribe  Payers’  Index  and  Global 
Corruption  Barometer.  The  Opacity  Index  comes 
from  the  accounting  firm  PricewaterhouseCoopers. 
The  Center  for  Public  Integrity  entered  the  field 
with  a  Public  Integrity  Index — one  of  the  best  indi¬ 
ces  in  “State  Capture,”  developed  by  World  Bank. 
A  total  of  20  measures  of  transparency  are  listed 
by  Ann  Bellver  and  Daniel  Kaufmann  of  the  World 
Bank  in  their  August  2005  paper  “Transparenting 
Transparency:  Initial  Empirics  and  Policy  Appli¬ 
cations.”  In  2009,  the  Corruption  Perceptions 
Index  rated  New  Zealand  as  the  least  corrupt  and 
Somalia  as  the  most  corrupt  of  the  180  countries 
studied. 

One  of  the  curious  by-products  of  the  rivalry 
between  different  anticorruption  organizations  is 
that  Transparency  International  treats  as  strictly 
confidential  the  base  data  on  which  it  bases  its 
Corruption  Perceptions  Index.  The  value  of  these 
well-publicized  metrics  is  open  to  question,  espe¬ 
cially  since  the  results  of  information  from  country 
experts  depend  inevitably  on  the  choice  of  experts. 
Moreover,  there  is  a  tendency  to  rely  on  the  assess¬ 
ments  of  experts  from  economically  advanced 
countries  about  the  state  of  corruption  in  poorer 
countries. 

Causes  of  Corruption 

According  to  one  view,  corruption  exists  to  evade 
governmental  regulation.  It  follows  that  the  less 
the  regulation,  the  smaller  the  scope  for  corrup¬ 
tion.  Thus,  socialist  countries — such  as  the  former 
Soviet  Union — experienced  an  exceptionally  high 
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level  of  corruption  as  members  of  the  power  elite 
sought  ways  to  evade  egalitarian  policies.  Similarly, 
the  prohibition  of  alcohol  in  the  United  States 
proved  impractical.  It  merely  produced  widespread 
corruption  and  encouraged  organized  crime. 
This — some  argue — is  the  inevitable  consequence 
of  laws  to  regulate  widespread  forms  of  private 
behavior. 

There  is  an  opposite  view  that  capitalism,  not 
socialism,  encourages  corruption.  It  is  the  essence 
of  the  capitalist  ethic — according  to  this  argu¬ 
ment — that  “greed  is  good.”  Hence,  companies 
will  do  everything  they  can  to  bend  the  rules  if  it  is 
in  the  interest  of  making  a  larger  profit.  Large  cor¬ 
porations  are  especially  rapacious  when  it  comes 
to  dealings  in  foreign,  less  developed  nations. 

A  third  view,  popular  in  the  1960s  and  1970s, 
is  that  corruption  is  especially  prevalent  during 
periods  of  social  and  economic  transition. 
Developmental  theorists  such  as  Arnold  J. 
Heidenheimer  posited  that  there  is  relatively  little 
corruption  in  socially  stable,  traditional  societies. 
When  there  is  rapid  industrialization  and  migra¬ 
tion,  the  old  norms  are  abandoned,  and  corruption 
flourishes.  However,  corruption  then  greatly 
declines  (without  the  need  for  any  anticorruption 
policies)  with  the  coming  of  an  affluent  society  in 
which  middle-class  respectability  and  suburban 
lifestyles  predominate. 

This  theory  now  seems  much  less  convincing 
than  it  was  at  the  time  Heidenheimer  published  his 
influential  work  in  1970.  The  Watergate  scandal 
in  the  United  States  during  1972  to  1974  and  the 
Poulson  Affair  in  the  UK  at  about  the  same  time 
showed  that  the  theory  of  middle-class  civic  virtue 
was  overoptimistic. 

Several  other  causes  are  sometimes  suggested: 
lack  of  democracy,  bad  governance,  low  salaries 
for  public  officials,  one-party  rule  in  a  locality,  or 
lack  of  a  free  and  active  press.  Such  explanations 
either  are  too  vague  or  they  are  (at  least  partly) 
defective  and  subject  to  notable  exceptions.  For 
example,  Singapore  is  a  country  with  low  demo¬ 
cratic  credentials  but  with  a  well-regarded  record 
in  the  field  of  anticorruption. 

Effects  of  Corruption 

In  the  1960s  and  1970s,  influential  writers  such  as 
Samuel  Huntington,  Nathaniel  Leff,  and  Colin 


Leys  argued  against  the  “moralistic”  view  that  cor¬ 
ruption  is  necessarily  damaging.  In  fact,  corrup¬ 
tion  could  oil  the  wheels  of  progress  by  enabling 
investors  to  find  ways  around  restrictive  rules. 
Especially  in  developing  countries,  corruption 
therefore  was  sometimes  to  be  welcomed.  This  was 
the  predominant  view  of  Heidenheimer  mentioned 
previously. 

There  was  a  sudden  and  massive  change  of 
opinion  following  a  World  Bank  report  in  1969  on 
corruption  in  Africa.  From  then  onward,  corrup¬ 
tion  came  to  be  seen  as  a  major  evil  and  as  an 
impediment  to  economic  growth.  The  World  Bank 
itself  became  active  in  promoting  a  campaign 
against  corruption.  Transparency  International 
was  founded  in  1993  as  a  campaigning  organiza¬ 
tion,  partly  by  former  officials  of  the  World  Bank. 

The  sweeping  statements  that  have  been  made 
first  about  the  virtues  and  then  about  the  vices  of 
corruption  have  tended  to  act  as  substitutes  for 
open-minded  studies  of  the  effects  of  corruption. 
The  problem  of  relating  corruption  and  economic 
development  is  seen  by  comparing  countries  with 
high  levels  of  corruption — such  as  South  Korea 
and  the  Philippines.  In  the  Philippines,  economic 
growth  has  been  low,  but  it  has  been  very  high  in 
South  Korea.  This  shows  that  the  effects  of  corrup¬ 
tion  on  economic  growth  are,  at  the  very  least, 
complex  and  serves  as  a  warning  against  general¬ 
izations  and  moral  exhortations. 

Cures  for  Corruption 

Many  different  cures  for  corruption  have  been  sug¬ 
gested,  though  none  of  them  has  proved  to  be 
widely  effective. 

The  proposals  include 

1 .  cutting  the  number  of  regulations  and  freeing 
markets; 

2.  international  anticorruption  conventions,  such 
as  that  of  the  United  Nations,  which  came  into 
force  in  2005; 

3.  national  laws  to  outlaw  international  bribery, 
such  as  the  U.S.  Foreign  Corrupt  Practices  Act 
of  1977; 

4.  specialized  anticorruption  agencies  such  as 
Hong  Kong’s  Independent  Commission  Against 
Corruption; 
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5.  campaigns  by  nongovernmental  organizations 
such  as  Transparency  International; 

6.  improved  accounting  procedures; 

7.  improved  procedures  to  ensure  competitive 
bidding  for  public  contracts; 

8.  improved  international  efforts  to  combat 
organized  crime; 

9.  higher  salaries  for  officials  to  reduce  incentives 
for  corruption; 

10.  better  training  for  law  enforcement  officials; 

11.  ethical  training  for  officials; 

12.  reforms  of  libel  and  other  press  laws  to  enable 
journalists  to  expose  and  report  corruption; 

13.  aid  conditionality — threatened  or  actual 
withdrawal  of  assistance  to  countries  whose 
governments  fail  to  implement  anticorruption 
measures; 

14.  measures  to  combat  international  money 
laundering;  and 

15.  reforms  of  laws  relating  to  the  funding  of 
political  parties  and  election  campaigns  to 
reduce  the  reliance  of  candidates  and  parties  on 
large  (and  possibly  corrupt)  political  donations. 

Conclusion:  Academic  Studies 
Versus  Practical  Actions 

In  the  past  20  years,  international  organizations, 
governmental  agencies,  and  pressure  groups  have 
devoted  major  attention  and  large  sums  of  money 
to  the  fight  against  corruption.  This  campaign  has 
had  a  vital  impact  on  the  academic  study  of  cor¬ 
ruption.  It  has  tended  to  divert  attention  from 
fundamental  research  into  areas  related  to  advo¬ 
cacy  and  to  the  recommendation  of  cures.  Most  of 
the  corruption  indices,  which  have  become  the 
vogue,  provide  pseudomeasurements.  They  may  be 
of  some  practical  value,  but  they  are  dangerous  if 
treated  with  too  much  reverence. 

Michael  Pinto-Duschinsky 
Brunei  University 
Uxbridge,  United  Kingdom 

See  also  Administration;  Clientelism;  Election  Research; 

Electoral  Campaigns;  Elites;  Judiciary;  Party  Finance; 

Police 


Further  Readings 

Banfield,  E.  C.  (1958).  The  moral  basis  of  a  backward 
society.  Glencoe,  IL:  Free  Press. 

Heidenheimer,  A.  J.  (Ed.).  (1970).  Political  corruption: 
Readings  in  comparative  analysis.  New  Brunswick, 

NJ:  Transaction. 

Heidenheimer,  A.  J.,  &  Johnston,  M.  (Eds.).  (2002). 
Political  corruption:  A  handbook.  New  Brunswick, 

NJ:  Transaction. 

Kaufmann,  D.,  &  Bellver,  A.  (2005).  Transparenting 
transparency:  Initial  empirics  and  policy  applications. 
World  Bank  Policy  Research  Working  Paper. 

Retrieved  March  30,  2011,  from  http://siteresources. 
worldbank.org/INTEAPREGTOPRURDEV/Resources/ 
573691-1 175901454225/seminarl_background_ 
reading.pdf 

Kotkin,  S.,  &  Sajo,  A.  (Eds.).  (2002).  Political  corruption 
in  transition:  A  sceptic’s  handbook.  Budapest, 
Hungary:  Central  European  University  Press. 

Pinto-Duschinsky,  M.  (2002).  Financing  politics:  A  global 
view.  Journal  of  Democracy,  13(4),  69-86. 

Robinson,  M.  (Ed.).  (1998).  Corruption  and 
development.  Abingdon,  UK:  Frank  Cass. 

U4  Anti-Corruption  Resource  Centre:  http://www.U4.no 


Corruption,  Administrative 


Administrative  corruption  is  the  abuse  of  roles, 
powers,  or  resources  found  within  public  bureau¬ 
cracies.  It  may  be  initiated  by  line  or  staff  officials, 
their  superiors,  or  the  agency  clients.  The  latter 
will  usually  be  private  parties  (e.g.,  applicants  for 
licenses),  but  particularly  in  large  and  centralized 
governments;  clients  might  also  be  individuals  or 
institutions  from  elsewhere  in  the  public  sector. 
This  entry  considers  the  complexities  of  defining 
administrative  corruption,  some  common  variet¬ 
ies,  major  causal  factors  and  consequences,  the 
central  role  of  administrative  corruption  and  pub¬ 
lic  administration  theories  in  reform  movements, 
and  the  effects  of  changing  relationships  between 
the  public  and  private  sectors  on  the  basic  con¬ 
cepts  of  administrative  corruption. 

Administrative  corruption  is  a  subset  of  the 
broader  phenomenon  of  corruption,  and  is  com¬ 
monly  distinguished  from  political  corruption. 
Corrupt  practices  in  the  administrative  realm,  such 
as  bribery,  extortion,  graft,  patronage,  and  official 
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theft,  to  name  a  few,  can  occur  in  the  political 
arena  too  and,  indeed,  can  be  common  in  private 
sector  administration.  Indeed,  some  varieties  of 
administrative  corruption  involve  “middlemen” 
who  station  themselves  at  the  boundaries  between 
state  and  society,  promising  to  reduce  transaction 
costs  for  citizens  and  bureaucrats  alike — even 
though  such  gains  may  be  small  or  even  illusory. 
Still,  the  term  administrative  corruption  is  usually 
reserved  for  abuses  in  the  realm  of  government 
agencies  and  programs. 

Conceptually,  administrative  corruption  shares 
some,  but  not  all,  of  the  definitional  problems  that 
plague  the  analysis  of  corruption  generally.  Formal 
rules  and  roles  are  also  usually  more  clearly  speci¬ 
fied  in  administration  than  in  political  life.  But  in 
societies  with  dominant  and  pervasive  government 
agencies,  or  with  very  weak  public  institutions,  the 
limits  of  the  administration  or  its  internal  organi¬ 
zation  may  not  be  particularly  clear.  Where  the 
state  lacks  legitimacy  or  credibility — or,  where  it  is 
dominated  by  a  dictator,  ruling  inner  circle,  the 
military,  or  one  political  party — the  formal  and  de 
facto  norms  of  administration  may  diverge  sharply. 
Moreover,  what  constitutes  “abuse”  may  well  be 
more  than  just  a  question  of  administrative  pro¬ 
cess:  Laws  and  regulations  can  be  vague  or  contra¬ 
dictory  (perhaps  deliberately  so).  Obligations  and 
preferences  flowing  from  politics  or  social  ties  may 
not  only  cause  rules  to  be  broken  but  can  also 
become  normative  systems  themselves,  rivaling 
formal  rules  in  salience  and  power. 

Like  corruption  generally,  the  causes  of  admin¬ 
istrative  corruption  may  be  personalistic,  institu¬ 
tional,  or  systemic.  In  the  first  category  are  factors 
such  as  venal,  or  poorly  recruited  and  poorly 
trained  personnel.  The  second  includes  not  only 
poor  institutional  management  but  also  adminis¬ 
trative  systems  that  encourage  or  conceal  corrup¬ 
tion  because  of  their  internal  structures  (consider  a 
tax  collection  agency  that  gives  individual  agents 
the  power  to  alter  assessments  as  well  as  to  collect 
funds)  or  incentive  systems  (e.g.,  very  low  pay). 
Systemic  causes  of  corruption  might  include  exces¬ 
sive  political  interference  by  elected  officials  or 
citizens,  widespread  poverty,  or  a  lack  of  govern¬ 
ment  legitimacy. 

The  consequences  of  administrative  corruption 
have  been  a  matter  of  much  debate.  Many  have 
portrayed  it  as  functional — as  “grease  for  the 


wheels,”  providing  incentives  to  speed  up  official 
processes — or  as  creating  informal  price  systems 
and  a  rough-and-ready  kind  of  accountability  that 
would  not  otherwise  exist.  Since  the  late  1980s, 
however,  improved  theory  and  data  have  shown 
that  administrative  corruption  is  much  more  like 
sand  in  the  gears.  There  is  not,  after  all,  a  finite 
amount  of  inefficiency  in  administrative  agencies; 
bureaucrats — particularly  the  very  poorly  paid 
officials  of  developing  states — can  create  much 
more  of  it  if  they  stand  to  gain  as  a  result.  Bribes 
are  thus  more  powerful  signals  that  money  can  be 
made  by  contriving  new  requirements,  “losing” 
records,  or  simply  doing  nothing  until  clients  pay 
up.  Regulatory  and  extractive  functions  such  as 
inspections  and  tax  collections  are  subject  to  a 
similar  logic. 

Measuring  corruption  of  any  sort  is  problematic 
because  it  is  usually  clandestine  and  lacks  an 
immediate  victim  with  a  stake  in  reporting  abuses. 
Vulnerability  to  administrative  corruption  may  be 
somewhat  easier  to  estimate;  however,  compiling 
indicators  of  government  performance  and  com¬ 
paring  them  against  realistic  benchmarks  is  a 
promising  approach.  A  licensing  process  that  takes 
7  weeks  and  involves  33  steps  in  City  A,  for  exam¬ 
ple,  likely  has  worse  corruption  than  a  similar 
process  requiring  4  days  and  5  steps  in  City  B.  The 
numerous  steps  and  long  delays  in  the  first  instance 
may  well  reflect  past  corruption — bureaucrats 
have  learned  that  they  can  make  money  by  adding 
requirements  and  delays,  as  noted  above — and 
also  tell  us  something  about  current  incentives  sus¬ 
taining  it,  since  the  longer  and  less  responsive  the 
process,  the  greater  the  temptation  becomes  to  lay 
out  some  “speed  money.” 

Administrative  corruption  has  generally  been 
the  main  focus  for  anticorruption  movements 
around  the  world.  That  is  so  for  several  reasons.  At 
a  practical  level,  many  of  the  international  and 
intergovernmental  bodies  that  have  energized 
reform  efforts  are  barred  from  involvement  in  soci¬ 
eties’  internal  politics.  Nongovernmental  organiza¬ 
tions  and  grassroots  reformers  often  find  it  prudent 
to  de-emphasize  political  corruption,  since  taking 
on  such  issues  often  means  confronting  powerful 
entrenched  interests.  Focusing  on  administrative 
corruption  is  a  way  to  get  at  the  problem  without 
directly  confronting  regimes  and  may  allow  reform¬ 
ers  to  draw  on  legal  and  administrative  sources  of 
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support,  for  which  there  may  be  few  counterparts 
in  political  life. 

Initiatives  against  administrative,  as  opposed 
to  political,  corruption  also  draw  on  more  exten¬ 
sive  bodies  of  theory  and  evidence.  In  the  former 
category  are  a  variety  of  “micro”-theoretical  ap¬ 
proaches  that  strip  processes  and  structures  down 
to  their  essentials.  Examples  include  principal- 
agent  or  principal-agent-client  models  that  sim¬ 
plify  administrative  pathologies  in  useful  ways, 
identifying  vulnerabilities  and  incentive  problems. 
Another  elegant  framework,  Klitgaard’s  widely 
applied  “equation”  suggesting  that  Corruption  = 
Monopoly  +  Discretion  —  Accountability,  con¬ 
siders  the  individual  official’s  power  in  somewhat 
greater  detail  and  compares  those  powers  against 
key  constraints.  Theories  of,  and  arrangements 
for,  queuing — that  is,  how  cases  are  assigned  to 
officials — highlight  the  importance  of  client  vul¬ 
nerabilities  and  official  collusion,  both  on  a  single 
structural  level  and  involving  dealings  with  supe¬ 
riors.  Often,  such  micro  accounts  of  administra¬ 
tive  corruption  can  be  usefully  nested  within  the 
broader  analysis  of  rent-seeking  opportunities 
and  practices.  A  shortcoming  of  some  micro  theo¬ 
ries  is  that  they  treat  administrative  corruption 
and  bribery  as  nearly  synonymous,  underestimat¬ 
ing  both  the  variety  of  corrupt  practices  and  the 
ingenuity  of  both  officials  and  clients  seeking 
illicit  benefits. 

At  the  overall  organizational  level,  public  admin¬ 
istration  theories  examine  structure  and  process, 
internal  control,  management  of  personnel  and 
resources,  and  accountability  of  several  sorts. 
Historically,  the  struggle  against  administrative 
corruption  has  helped  shape  and  establish  the  value 
of  public  administration  approaches.  Frank  Anechi- 
arico  and  James  Jacobs  point  out  that  in  the  United 
States,  an  anticorruption  movement  emerging  out 
of  abolitionism  and  determined  to  reform  corrupt 
cities  intertwined  with  emerging  theories  of  public 
administration  to  define  four  distinctive  “visions” 
of  administrative  corruption  and  reform,  each 
dominant  within  a  particular  era.  The 
“Antipatronage  Vision”  (roughly  between  1870 
and  1900)  was  aimed  not  only  at  political  reform 
but  also — critically  for  the  theories  that  followed — at 
driving  machine  patronage  and  the  influence  of 
political  bosses  out  of  public  administration. 
The  “Progressive  Vision”  (1900-1933) — like  its 


predecessor,  part  of  a  larger  political  movement — 
sought  a  nearly  total  segregation  of  administration 
from  politics  and  did  so  on  the  national  stage;  it 
also  emphasized  efficiency  in  internal  processes  as 
both  the  means  and  the  metric  for  reduced  corrup¬ 
tion  and  better  government.  The  “Scientific 
Administration  Vision”  (1933-1970)  shifted  the 
emphasis  from  political  reform  to  rigorous  internal 
management  of  government  guided  by  organiza¬ 
tion  theory,  pointing  to  a  science  of  administration 
independent  of  the  substantive  competence  of  spe¬ 
cific  agencies.  Finally,  the  “Pan optic  Vision”  (1970 
onward)  took  advantage  of  new  technology  to 
institute  internal  controls  that  were  so  pervasive  as 
to  become  problems  in  their  own  right. 

Insistence  on  the  separation  of  politics  from 
administration  has  thus  shaped  both  reform  and 
public  administration  thinking.  But  it  also  raises 
problems.  Few  would  argue  for  the  pervasive 
politicization  of  administration,  but  critics  suggest 
that  reforms,  with  their  focus  on  efficiency,  spe¬ 
cialization,  and  independent  administration,  may 
create  governments  that  are  accountable  to  no 
one — neither  voters  nor  elected  officials. 

That  separation  of  politics  and  administration 
may  come  under  increasing  strain  as  privatization, 
the  rise  of  parastatal  bodies,  and  an  increasing  (if 
not  always  appropriate)  expectation  that  public 
administration  be  carried  out  in  more  business-like 
ways  alter  state  agencies  and  their  relationships  to 
society.  Privatization  and  the  devolution  of  govern¬ 
ment  functions  onto  corporate-style  entities  are 
often  proposed  as  a  way  of  eradicating  administra¬ 
tive  corruption  by  taking  bureaucrats  out  of  the 
loop:  Public  functionaries  with  nothing  to  sell  can¬ 
not  be  corrupt.  The  result,  however,  may  be  even 
less  accountability  and  responsiveness  in  the  ways 
in  which  people  are  governed,  as  more  functions 
move  into  the  private  sector  or  are  conducted,  even 
partially,  by  its  rules.  Public-private  partnerships, 
for  their  part,  may  invite  private  influence — which 
may  well  be  nonpartisan  but  will  often  have  strong 
political  implications — into  the  halls  of  govern¬ 
ment.  Both  strategies,  in  differing  ways,  mingle 
state  with  society  and  public  with  private  enter¬ 
prise,  and  they  do  so  in  a  setting  of  weakened 
political  accountability.  Such  trends  not  only  raise 
practical  problems,  but  they  also  suggest  that  the 
basic  distinction  between  administrative  corruption 
and  other  varieties  may  be  more  an  extension  of  the 
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normative  doctrine  that  such  separation  should  be 
maintained  than  of  any  sharp  empirical  boundaries 
surrounding  the  administration  process. 

Michael  Johnston 
Colgate  University 
Hamilton,  New  York,  United  States 
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Cost-Benefit  Analysis 


Cost-benefit  analysis  (CBA)  is  an  accounting 
framework  that  compares  benefits  and  costs  asso¬ 
ciated  with  a  project  for  the  purposes  of  informa¬ 
tion  and  discussion.  As  discussed  below,  CBA  is 
widely  used  by  governments,  and  if  used  properly, 
can  reduce  unnecessary  or  inefficient  government 
spending.  Concerns  in  CBA  considered  below 
include  whose  benefits  and  costs  should  count,  on 
what  scale  the  analysis  should  be  done,  how  to 
account  for  the  value  of  future  costs  and  benefits, 
and  how  to  account  for  risk,  uncertainty,  and  non- 
market  values. 

The  Framework 

CBA  or  benefit-cost  analysis  (BCA;  these  terms 
are  used  interchangeably)  is  an  accounting  frame¬ 
work  for  government  projects.  In  this  framework, 


benefits  and  costs  associated  with  a  public  invest¬ 
ment  or  decision  are  laid  out  for  purposes  of  infor¬ 
mation  and  discussion.  A  related  framework, 
widely  used  in  health  care  but  also  in  other  areas, 
is  cost-effectiveness  analysis,  sometimes  known  as 
least  cost  planning.  In  least  cost  planning,  the  goal 
is  given,  for  example,  to  achieve  a  given  level  of 
carbon  monoxide  in  a  city,  and  the  least  expensive 
way  of  achieving  this  goal  is  sought. 

Externalities 

The  rationale  for  government  investment  is  exter¬ 
nalities.  Externalities  are  effects  that  are  not 
accounted  for  in  market  operations.  They  arise 
solely  from  the  expenses  of  operating  markets.  Air 
pollution,  for  example,  is  considered  an  externality 
as  the  polluting  firm  does  not  take  into  account  the 
damaging  effect  of  its  pollution  on  the  surrounding 
population.  In  a  market  system,  the  population 
could  pay  the  firm  to  reduce  pollution.  Or  the  firm 
could  pay  the  individuals  to  tolerate  it,  whichever 
course  was  the  cheapest.  Yet  any  such  market 
arrangement  might  be  too  expensive  to  implement 
due  to  transaction  costs.  Sometimes  the  government 
can  institute  a  quasi  market  as  in  cap-and-trade 
permits.  In  this  case,  the  government  fixes  the  quan¬ 
tity  of  pollution  allowed  and  then  sells  (or  gives 
away)  rights  to  this  quantity.  These  rights  can  then 
be  traded.  This  sort  of  arrangement  is  increasingly 
used  in  fisheries  to  prevent  overfishing.  The  total 
quota  of  fish  allowed  to  be  caught  is  set  and  rights 
are  then  distributed  among  fishermen  for  portions 
of  the  total  quota.  The  decision  about  whether  to 
use  a  quota  system,  or  to  use,  say,  pollution  taxes 
or  emissions  limits  on  the  basis  of  the  available 
technology  is  the  sort  of  decision  that  CBA  can  be 
used  to  address.  More  commonly,  CBA  is  used  to 
address  questions  in  areas  such  as  transportation, 
dam  building,  and,  more  recently,  expenditures  on 
social  programs  such  as  Head  Start. 

Government  Use 

The  U.S.  Federal  Government  widely  uses  CBA 
under  presidential  executive  orders.  Under  these 
orders,  the  Office  of  Management  and  the  Budget 
(OMB)  requires  the  use  of  CBA  by  executive  agen¬ 
cies  and  many  congressional  agencies  as  well.  OMB 
has  created  a  set  of  guidelines  for  the  use  of  CBA. 
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CBA  is  also  used,  but  less  widely,  by  state  and 
municipal  governments.  If  used  properly,  the  CBA 
framework  can  reduce  poorly  conceived  govern¬ 
ment  spending.  In  fact,  it  has  already  played  this  role 
to  a  significant  extent  but  not  to  the  extent  possible. 

What  Are  Benefits  and  Costs? 

Benefits 

Benefits  are  measured  by  the  willingness  to  pay 
(WTP)  for  gains.  Thus,  in  evaluating  the  building 
of  a  new  road,  the  government  might  calculate  the 
time  savings  to  users.  Then,  they  would  estimate 
the  value  of  users’  time  and  determine  a  total  value 
of  time  saved.  This  figure  would  then  represent  an 
estimate  of  the  users’  WTP  and  therefore  of  their 
benefits. 

Costs 

Costs  are  the  amount  that  those  bearing  them 
would  be  willing  to  accept  (WTA)  in  order  to  bear 
the  burden  of  the  project  or  decision.  Thus,  in 
building  a  new  road,  the  analyst  would  seek  to 
determine  what  payment  the  taxpayers  would  have 
to  make.  In  addition,  homeowners  who  live  near 
the  road  might  bear  certain  costs  in  terms  of  pollu¬ 
tion  and  noise.  The  amount  it  would  take  to  com¬ 
pensate  these  homeowners  would  be  an  addition  to 
the  costs.  Alternatively,  the  government  might 
build  a  barrier  to  reduce  noise  so  that  the  cost  to 
nearby  homeowners  is  small  and  might  be  ignored. 

Standing 

Not  everyone  affected  nor  all  sentiments  created 
are  included  in  practice  in  a  CBA.  The  decision 
about  whom  to  include  or  exclude  in  a  CBA  is  to 
decide  the  standing  of  those  affected.  Most  analy¬ 
ses  are  done  from  a  particular  viewpoint.  A  munic¬ 
ipality  might  consider  only  the  effects  to  municipal 
revenues  or  only  those  felt  within  the  city.  The 
state  government  might  not  consider  the  effects  of 
its  pollution  policy  on  neighboring  states,  and  the 
federal  government  might  not  consider  its  effects 
on  foreign  countries.  Many  projects  will  have  only 
minor  effects  on  certain  groups,  and  it  may  be  con¬ 
sidered  too  expensive  to  calculate  these.  In  addi¬ 
tion,  certain  sentiments  may  be  disregarded  as  not 
being  acceptable;  for  example,  one  might  suffer 


harm  from  envy,  but  this  harm  would  generally  go 
unrecognized  in  a  CBA.  A  more  useful  example 
concerns  the  value  of  stolen  goods  to  the  thief. 
Clearly,  they  have  value  to  the  thief,  but  in  consid¬ 
ering  whether  or  not  these  goods  should  be 
returned,  one  would  not  count  their  value  to  the 
thief,  as  holding  such  goods  is  illegal. 

Partial  and  General  Equilibrium  Analysis 

How  many  markets  will  a  project  affect?  Suppose 
a  buyer  is  considering  whether  or  not  to  buy  a  less 
expensive  regular  refrigerator  or  a  more  expensive 
energy-saving  refrigerator.  The  energy-saving 
model  will  save  energy  costs  each  year,  and  the 
buyer  wonders  if  the  extra  expense  is  worthwhile. 
This  can  be  determined  by  the  buyer’s  own  per¬ 
sonal  CBA.  Any  effect  on  markets  is  negligible  and 
can  be  ignored.  An  analysis  such  as  this  that  is  very 
limited  in  its  inclusion  of  markets  is  called  partial 
equilibrium  analysis.  However,  if  one  is  consider¬ 
ing  a  countrywide  policy  to  subsidize  energy¬ 
saving  models,  the  analysis  should  consider 
markets  more  broadly.  When  the  effects  in  many 
markets  are  taken  into  account,  the  analysis  is  gen¬ 
eral  equilibrium  analysis.  Consider  a  national  oil 
tax  and  all  of  the  markets  affected:  transportation 
markets;  the  price  of  products  made  with  oil,  such 
as  plastics;  labor  markets;  and  others.  One  analysis 
is  that  such  a  policy  should  be  implemented  from 
a  general-equilibrium  perspective. 

Discounting 

Consider  the  choice  between  a  more  expensive 
energy  saver  refrigerator  and  a  less  expensive  one. 
Suppose  the  buyer  faces  the  following  costs  and 
benefits  over  time.  For  simplicity,  we  will  assume 
that  the  life  of  each  refrigerator  is  5  years  and  that 
the  energy  costs  are  constant  (see  Table  1). 

The  buyer  would  spend  an  extra  $200  now  to 
save  $255  over  5  years.  Is  it  worthwhile  buying  the 
energy  saver?  The  savings  of  $45  per  year  amounts 
to  a  total  of  $225,  which  is  greater  than  the  $200 
extra  cost,  but  this  comparison  fails  to  take  into 
account  the  time  value  of  money.  Suppose  you 
know  that  you  can  obtain  a  5  %  yearly  return  from 
investing  in  a  safe  Certificate  of  Deposit  (CD).  The 
question  now  is  what  sum  of  money  you  could 
invest  today  at  5%  that  would  yield  $45  per  year 
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Table  I  Discounting:  An  Illustration 

Energy  Costs  ($) 


Choice 

Initial  Costs  ($) 

Year  1 

Year  2 

Year  3 

Year  4 

Year  5 

Cheaper  refrigerator 

1,000 

60 

60 

60 

60 

60 

Energy-saver  refrigerator 

1,200 

15 

15 

15 

15 

15 

Cost  difference 

-200 

45 

45 

45 

45 

45 

for  5  years.  The  answer  to  this  question  is  the  pres¬ 
ent  value  (PV)  of  the  stream  of  5-year  payments. 
This  PV  will  be  given  by  the  formula  for  an  annu¬ 
ity  (a  uniform  stream  of  payments),  which  is 

PV  =  A[  1  -  (1  +  r)“fJ/(l  +  rf, 

where  A  is  the  yearly  amount,  t  is  time,  and  r  is  the 
discount  or  interest  rate  (here  5%).  In  the  case  of 
the  energy  saver  refrigerator,  this  will  be 

PV  -  $45 [1  -  (1.05)sJ/(.05)  -  $194.83. 

Since  this  PV  is  less  than  the  $200  cost,  it  is  not 
worthwhile  buying  the  energy  saver.  If  one’s  dis¬ 
count  rate  were,  for  example,  1%,  then  the  PV  of 
the  energy  saving  would  be  $218.40,  and  it  would 
be  worthwhile.  The  lower  the  discount  rate,  the 
greater  is  the  PV  of  future  benefits.  The  net  present 
value  (NPV)  is  the  PV  of  benefits  minus  costs.  The 
CBA  test  is  to  determine  if  the  NPV  is  positive,  in 
which  case  it  is  said  that  the  project  is  financially 
desirable.  The  NPV  in  the  present  example  would 
be  as  follows: 

NPV  =  $194.83  -  $200  =  -$5.17. 

Since  this  is  a  negative  value,  the  project  is  not 
financially  desirable.  One  way  to  look  at  this 
example  is  that  if  one  receives  less  than  the  5% 
return,  one  can  gain  from  investing  the  $200  in  a 
safe  CD.  The  NPV  is  not  the  only  measure  used 
but  is  the  one  most  used  by  economists. 

Risk  and  Uncertainty 

CBA  almost  always  involves  considering  cash 
flows  in  the  future.  The  future  is  always  uncertain 


and  risky.  Risk  is  usually  defined  as  occurring 
when  the  probabilities  of  events  are  known. 
Uncertainty  occurs  when  the  probabilities  are 
unknown  or  subject  to  unknown  variances.  In  con¬ 
sidering  the  purchase  of  the  refrigerator,  for  exam¬ 
ple,  we  might  expect  the  energy  savings  to  increase 
over  time  with  some  certain  probability.  This 
would  be  a  question  of  risk.  As  a  simple  proce¬ 
dure,  we  might  calculate  the  expected  value  of  the 
future  energy  savings,  given  the  known  probabili¬ 
ties  of  increases  in  energy  prices.  The  expected 
value  is  the  energy  savings  times  its  probability. 
Suppose  we  expect  the  value  of  the  energy  savings 
to  be  $20  per  year  due  to  the  increase  in  energy 
costs  and  that  this  increase  in  savings  is  expected 
with  90%  probability.  The  expected  value  of  this 
in  each  year  would  be  $5.00  times  90%  or  $4.5 
dollars  per  year.  Taking  the  PV  of  this  annuity  at 
5%  for  5  years,  we  have  $19.48.  When  we  add  this 
to  the  PV  in  the  previous  scenarios  with  the  5% 
discount  rate,  we  would  have  a  total  of  NPV  = 
$f4.31.  Since  this  is  positive,  the  energy  saver 
passes  the  CBA  test,  given  that  the  energy  costs  go 
up  by  $5  per  year  with  90%  probability. 

In  a  more  sophisticated  analysis,  we  would  have 
a  distribution  of  probabilities.  The  NPV  can  also 
be  calculated  in  these  situations.  A  common  proce¬ 
dure  is  to  use  a  Monte  Carlo  simulation  program 
that  allows  us  to  account  for  the  entire  probability 
distribution.  Further  discussion  of  this  issue  lies 
beyond  this  essay. 

Nonmarket  Values 

Although  CBA  is  a  sort  of  financial  accounting,  it 
considers  many  values  not  always  counted  in  busi¬ 
ness  accounting.  Many  preferences  can  be  valued, 
although  they  are  not  always  readily  apparent.  The 
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value  of  a  house  with  a  view  compared  with  the 
same  house  without  a  view  is  readily  determined, 
for  example.  The  value  of  cleaner  air  and  water 
can  also  be  determined.  The  value  of  fishing  in  a 
less  polluted  stream  as  compared  with  a  more  pol¬ 
luted  stream  is  more  challenging.  Similarly,  the 
value  of  visiting  a  national  park  requires  more 
sophisticated  work.  Economists  have  developed  a 
number  of  techniques  for  determining  environ¬ 
mental  and  other  values.  For  example,  they  ask 
how  far  a  person  might  drive  to  arrive  at  a  park 
destination.  Valuations  can  be  made  by  looking  at 
the  cost  of  driving  to  drivers  arriving  from  various 
estimated  distances  from  the  park.  Also,  question¬ 
naires  can  be  used  to  determine  values.  Analogies 
from  market  behavior  can  be  applied  to  nonmar¬ 
ket  behavior.  Prices  used  to  value  nonmarket 
behavior  are  called  shadow  prices.  Thus,  CBA 
attempts  to  perform  a  sort  of  social  accounting  to 
determine  the  best  public  investments. 

Richard  O.  Zerbe  Jr. 
University  of  Washington 
Seattle,  Washington,  United  States 
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Credible  Commitment 


Individuals  are  generally  assumed  to  choose  ratio¬ 
nally  in  ways  that  make  themselves  better  off. 


However,  there  are  times  when  a  political  actor 
can  benefit  by  denying  himself  or  herself,  early  in 
a  political  interaction,  the  option  of  choosing  self- 
serving  actions  later  on.  A  general  may  know  that 
retreat  could  be  a  valuable  option  in  the  future, 
but  “burning  one’s  bridges”  may  elicit  more 
advantageous  behavior  from  one’s  soldiers  (or 
one’s  opponents).  Constraining  self-interested 
choice  to  induce  beneficial  actions  from  other 
players  is  called  “credible  commitment.” 

This  entry  examines  several  systematic  and 
foundational  credible  commitment  problems.  In 
international  relations,  credible  commitment  can 
strengthen  deterrence.  Further,  opportunism  by 
government  officials  can  intimidate  economic 
actors;  healthy  economic  growth  requires  the  gov¬ 
ernment  to  be  credibly  constrained  from  opportu¬ 
nistic  behavior.  Constitutions,  courts,  independent 
central  banks,  and  international  agencies  can 
facilitate  the  kind  of  credible  commitment  that 
enhances  economic  development.  In  democracies, 
credible  commitment  necessarily  entails  a  limita¬ 
tion  on  the  scope  and  power  of  majority  rule. 

Credible  Commitment, 
Programming,  and  Delegation 

At  the  heart  of  the  problem  of  credible  commit¬ 
ment  is  the  strategic  interdependence  of  two  or 
more  actors  making  sequenced  choices.  An  exam¬ 
ple  explored  in  the  early  1960s  by  Thomas 
Schelling  is  nuclear  deterrence.  A  stylized  version 
of  this  problem  is  as  follows:  Assume  that  the 
Soviet  Union  moves  first,  choosing  whether  or  not 
to  invade  Europe.  The  United  States  then  responds 
to  an  invasion  with  either  conventional  or  nuclear 
weapons.  The  Soviet  Union’s  choice  depends  on 
what  it  thinks  the  United  States  will  do  in 
response.  The  Soviet  Union’s  dominance  in  con¬ 
ventional  weaponry  would  give  it  a  victory  in 
Europe  if  the  United  States  restricts  itself  to  con¬ 
ventional  weapons.  However,  if  the  United  States 
responds  with  nuclear  weapons,  the  result  will  be 
a  nuclear  holocaust  that  is  the  worst  outcome  for 
both  sides. 

The  United  States  would  like  to  convince  the 
Soviet  Union  of  its  willingness  to  use  its  nuclear  arse¬ 
nal.  However,  after  an  invasion,  the  self-interested 
choice  for  the  United  States  is  not  to  initiate  a 
nuclear  exchange.  The  self-interest  of  the  United 
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States  diminishes  the  deterrent  effect  of  its  nuclear 
arsenal.  Based  on  its  assessment  of  the  U.S.  self- 
interested  response,  the  Soviet  Union  can  invade 
with  impunity.  Paradoxically,  the  option  of  mak¬ 
ing  a  self-interested  choice  works  against  the  U.S.’s 
own  best  interests — thus,  encouraging  the  very 
action  that  it  would  like  to  deter. 

The  United  States  would  be  better  off  if  it  could 
commit  itself,  prior  to  the  invasion,  to  a  course  of 
action  that  would  not  be  in  its  best  interests  later 
on.  If  it  could  commit  itself  to  using  the  nuclear 
deterrent,  willingly  initiating  a  nuclear  war  in 
response  to  a  Soviet  invasion,  then  the  best  choice 
for  the  Soviet  Union  would  be  not  to  invade. 

But  how  could  the  United  States  commit  itself, 
prior  to  an  invasion,  to  an  action  that  would  defy 
its  self-interest  in  the  event?  One  answer  is  pro¬ 
gramming  of  behavior.  If  the  United  States  creates 
bureaucratic  machinery  that  mechanically  imple¬ 
ments  a  nuclear  response  in  the  event  of  an  inva¬ 
sion,  then  the  Soviet  Union  would  choose  not  to 
invade. 

Closely  related  to  programming  is  delegation.  If 
the  response  to  a  Soviet  invasion  were  transpar¬ 
ently  and  credibly  delegated  to  someone  who 
could  be  expected  to  choose  a  nuclear  exchange, 
then  the  Soviet  Union  would  be  given  pause.  For 
example,  the  chairman  of  the  Joint  Chiefs  of  Staff 
from  1961  to  1965  was  General  Curtis  LeMay. 
LeMay  had  not  only  implemented  the  firebombing 
of  Tokyo  during  World  War  II,  but  he  had  also 
expressed  support  for  the  use  of  nuclear  weapons. 
The  Soviet  government’s  awareness  of  LeMay’s 
central  role  in  strategic  decision  making  might  well 
have  made  the  nuclear  deterrent  more  credible — 
not  because  he  was  representative  of  other  public 
officials  but  because  he  was  unrepresentative. 

The  delegation  solution  to  credible  commitment 
problems  defies  our  conventional  wisdom.  Con¬ 
ventional  wisdom  (and  a  good  deal  of  principal- 
agent  theory)  concludes  that  selecting  an  agent 
with  preferences  similar  to  the  principal’s  prefer¬ 
ences  is  desirable;  or  failing  this,  the  principal 
should  shape  the  agent’s  behavior  through  incen¬ 
tives  that  cause  the  agent  to  choose  as  the  principal 
would  in  any  decision  situation.  However,  delega¬ 
tion  is  an  effective  means  of  credible  commitment 
only  when  the  person  receiving  the  delegated 
authority  has  preferences  different  from  the  person 
doing  the  delegating. 


Political  Moral  Hazard 
and  Central  Banking 

In  the  case  of  nuclear  deterrence,  credible  commit¬ 
ment  had  the  potential  to  help  the  United  States 
while  depriving  the  Soviet  Union  of  a  victory.  Other 
uses  of  credible  commitment  make  all  players  better 
off.  For  instance,  consider  an  investor  who  has  hid¬ 
den  some  money  underneath  her  mattress.  If  she 
were  to  invest  in  government  bonds,  then  she  could 
increase  her  wealth,  and  the  government  would 
have  funds  necessary  to  pursue  its  agenda. 

Is  the  investor’s  money  safe  with  government 
bonds?  They  may  be  a  safe  investment  if  the  gov¬ 
ernment  pursues  a  stable,  noninflationary  mone¬ 
tary  policy.  However,  democratically  elected  pub¬ 
lic  officials  inevitably  have  an  incentive  to  give  a 
surprise  “boost”  to  the  economy  with  monetary 
policies  that  would  end  up  with  inflation — dimin¬ 
ishing  the  value  of  the  bonds.  If  the  government  is 
not  credibly  constrained  from  engaging  in  infla¬ 
tionary  policies,  then  the  investor  may  seek  a 
hedge  against  inflation  and  pass  up  the  govern¬ 
ment’s  bonds.  Both  the  investor  and  the  govern¬ 
ment  would  be  better  off  if  monetary  policy  were 
delegated  to  an  agency  that  is  guaranteed  to  be 
beyond  the  control  of  the  democratically  elected 
officials — a  central  bank. 

Creating  an  independent  central  bank  is  increas¬ 
ingly  thought  of  as  an  essential  feature  of  modern 
political  economy — precisely  because  it  denies 
public  officials  the  opportunity  to  pursue  their 
morally  hazardous  preferences.  The  evidence  has 
shown,  as  noted  by  Robert  J.  Franzese  Jr.  (1999), 
that  independence  in  central  banks  does  decrease 
inflation,  by  differing  the  amounts  depending  on 
trade  openness.  Most  interestingly,  as  Giandomen- 
ico  Majone  (1999)  observes,  to  resolve  credible 
commitment  issues,  “the  delegate  is  chosen  pre¬ 
cisely  because  his  or  her  preferences  do  not  mirror 
those  of  the  delegating  authority”  (p.  69). 

Credible  Commitment  to 
Nonmajoritarian  Institutions 

The  case  of  central  banking  is  one  in  which  we  are 
accustomed  to  thinking  of  delegation  to  a  nonma¬ 
joritarian  institution  as  a  form  of  credible  commit¬ 
ment  that  protects  the  public  from  the  dangerous 
impulses  of  elected  officials.  However,  delegation 
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may  play  a  similar  role  in  a  variety  of  policy  arenas 
in  which  elected  officials  experience  temptations  to 
use  power  opportunistically  in  ways  that  endanger 
property  rights  and  contract  enforcement  and, 
thus,  diminish  investment  and  long-run  economic 
growth. 

Many  of  a  government’s  actions  are  crucial  to 
the  strategic  thinking  of  investors,  entrepreneurs, 
homeowners,  and  other  citizens.  Citizens  may  have 
reason  to  suspect  opportunistic  behavior  on  the  part 
of  the  government;  this  is  seen  around  the  world  in 
the  concept  of  “political  risk.”  If  a  firm  invests  in  an 
expensive,  immobile  manufacturing  plant,  will  the 
government  then  use  its  taxation  powers  to  extract 
all  the  profits?  If  an  investor  embarks  on  an  invest¬ 
ment  strategy  that  assumes  low  inflation  rates,  will 
the  politicians  undermine  that  strategy  with  mone¬ 
tary  surprises?  If  an  employer  hires  workers  assum¬ 
ing  fixed  employment  regulations  and  contracts, 
will  those  be  suddenly  changed  as  a  result  of  the 
latest  election  results? 

All  of  these  may  be  thought  of  as  credible  com¬ 
mitment  problems,  in  which  the  prior  constraint  of 
political  actors  may  encourage  beneficial  economic 
growth.  Seen  through  the  lens  of  credible  commit¬ 
ment,  a  number  of  constitutional  features  can  be 
thought  of  as  aiding  the  resolution  of  credible 
commitment  problems.  A  Madisonian  separation 
of  powers  is  beneficial  because  it  diminishes  the 
ability  of  the  branches  of  government  to  unite  in 
opportunistic  schemes. 

Separation  of  powers  also  strengthens  the  inde¬ 
pendence  of  the  judiciary.  In  many  regimes,  the 
courts  may  do  a  good  job  of  arbitrating  disputes 
between  citizens,  but  in  disputes  between  citizens 
and  the  state,  the  courts  may  defer  to  the  state  and 
its  authority.  When  economic  actors  are  convinced 
that  the  judiciary  can  serve  as  a  check  on  govern¬ 
ment  officials,  they  are  more  willing  to  make  invest¬ 
ments  contributing  to  economic  development. 

The  creation  of  an  independent  regulatory 
agency  may  be  thought  of  as  an  attempt  to  limit 
the  opportunistic  actions  of  politicians.  Key  deci¬ 
sions  are  delegated  to  commissioners  and  staff  who 
have  only  tenuous  links  to  political  factions,  and 
they  are  expected  to  embed  their  decision-making 
process  in  a  fixed  procedure  that  discourages 
abrupt  changes  in  policy.  In  the  United  States,  for 
example,  the  National  Labor  Relations  Board  has 
embedded  policy  decision  making  in  a  bureaucratic 


and  legalistic  process  that  has  resulted  in  the  long¬ 
term  development  of  a  body  of  labor  laws  that 
diminishes  uncertainty  surrounding  employment 
contracts.  In  these  and  other  cases,  “Political  prop¬ 
erty  rights — the  rights  to  exercise  public  authority 
in  certain  policy  areas — have  been  altered  by  del¬ 
egating  important  powers  to  nonmajoritarian 
institutions”  (Majone,  2001,  p.  58). 

In  general,  problems  of  credible  commitment 
constitute  a  change  of  direction  for  rational  choice 
theories  of  politics.  Whereas  early  rational  choice 
theories  sought  to  explain  political  phenomena  in 
terms  of  the  self-interested  choices  of  actors,  the 
notion  of  credible  commitment  tries  to  explain 
certain  political  phenomena  (central  banks,  inde¬ 
pendent  judges,  and  autonomous  bureaucrats)  as 
institutional  manifestations  of  the  need  to  deny 
rational  actors  opportunities  to  pursue  their  mor¬ 
ally  hazardous  short-term  incentives. 

Problems  of  credible  commitment  constitute  a 
challenge  to  democratic  theory  as  well.  While 
democratic  theorists  may  think  of  accountability 
to  elected  officials  as  the  sine  qua  non  of  constitu¬ 
tional  arrangements,  empowering  democratic 
forces  to  make  monetary  and  regulatory  decisions 
can  pose  a  threat.  An  alternative  constitutionalist 
perspective  maintains  the  necessity  of  dividing, 
checking,  and  ultimately  constraining  democratic 
institutions. 

Gary  J.  Miller 
Washington  University  in  St.  Louis 
St.  Louis,  Missouri,  United  States 
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Crisis 


At  first  sight,  political  crises  appear  as  sudden  and 
acute  disruptions  in  the  “normal”  working  of 
political  systems.  This  ordinary  perception  of 
political  crises  is  hereafter  taken  as  their  provi¬ 
sional  definition,  which  offers  at  least  the  conve¬ 
nience  of  illuminating  a  paradoxical  situation.  The 
history  of  our  societies  is  extremely  rich  in  periods 
or  events  that  may  serve  as  examples.  However, 
political  crises  have  only  belatedly  been  approached 
as  phenomena  deserving  to  be  systematically  ana¬ 
lyzed  and  explained  for  their  own  sake,  worthy  of 
being  elevated  to  the  status  of  explicit  research 
objects.  Indeed,  in  most  works  dealing  with  them, 
political  crises  have  been  approached  as  a  means 
of  understanding  or  explaining  something  else. 
This  orientation  of  the  scholar’s  interest  has  con¬ 
stituted  a  long-lasting  handicap  for  the  knowledge 
of  these  critical  phenomena.  The  main  root  of  this 
handicap  lies  in  the  common  fascination  with  out¬ 
comes  of  historical  processes,  especially  with  the 
outcomes  of  critical  processes,  and  in  the  auto¬ 
matic  assumption  that  these  outcomes  should 
constitute  the  explicanda  of  these  process  analy¬ 
ses.  This  entry  examines,  first,  how  fascination 
with  outcomes  pervades  most  of  the  approaches 
that  have  dominated  this  field  of  research;  second, 
the  perspectives  that  have  posited  crises  as  particu¬ 
lar  stages  on  the  paths  of  political  development 
and  the  ones  that  have  attempted  to  insert  into 
crisis  analysis  the  activities  and  choices  of  actors; 
and  third,  the  approaches  that  have  aimed  at  iden¬ 
tifying,  within  the  historical  paths  or  sequences, 
turning  points  or  branchings  that  are  supposed  to 
explain  their  course  toward  diverse  outcomes. 

It  is  precisely  the  departure  from  this  fascination 
with  outcomes  that  has  cleared  the  way  for  a  radical 


reorientation  of  the  analysis  of  political  crises.  The 
next  part  of  the  entry  focuses  on  the  crux  of  this 
reorientation,  which  consists  in  grasping  crises  as 
particular  states  of  the  structures  of  the  societies 
affected  by  these  crises.  This  perspective  not  only 
allows  us  to  account  for  a  considerable  number  of 
the  properties  of  crises  but  also  to  understand  why 
the  mundane  idea  that  analysis  and  explanation  of 
periods  perceived  as  abnormal  would  require  excep¬ 
tional  approaches  is  misleading.  The  final  section  of 
the  entry  comprises  a  brief  discussion  of  several 
implications  of  this  reorientation  regarding  some 
central  features  of  the  processes  of  crisis,  particu¬ 
larly  the  uncertainty  that  characterizes  such  pro¬ 
cesses,  the  frequent  emergence  of  charismatic  lead¬ 
ers,  the  question  of  the  alleged  prevalence  of  hidden 
arenas  over  what  is  at  stake  in  the  open,  and, 
finally,  the  question  of  the  autonomy  of  the  pro¬ 
cesses  of  crisis  from  their  etiology. 

The  Focus  on  Outcomes 

The  intellectual  attraction  of  the  outcomes  of  crises 
is  perfectly  exemplified  by  conceptualizations  of 
political  development  or  political  modernization. 
For  instance,  in  a  seminal  set  of  studies,  Leonard 
Binder  and  associates  have  distinguished  several 
types  of  crises  of  political  development  such  as 
identity,  legitimacy,  participation,  distribution, 
and  penetration  crises.  These  crises  have  been 
thought  of  as  functional  requisites  or  necessary 
conditions  for  the  emergence  of  a  developed  politi¬ 
cal  system.  Concurrently  with  their  teleological 
biases,  such  approaches,  as  Sydney  Verba  has 
noticed,  did  not  succeed  in  distinguishing  crises  as 
processes  of  “disruption”  of  the  ordinary  working 
of  political  systems  from  problems  that  political 
elites  had  to  solve  in  order  to  modernize  their 
country — even  though  some  of  these  problems 
have  occasionally  given  birth  to  conflicts  more 
closely  fitting  the  provisional  definition  of  crises. 
This  difficulty  is  heightened  in  those  approaches 
that  tried  to  link  specific  outcomes  (typically 
democracy)  with  their  preconditions  seized  through 
global  economic  or  social  indicators  (e.g.,  literacy 
or  urbanization  rates,  gross  national  product  per 
capita),  because  the  quest  for  statistical  correla¬ 
tions  between  these  indicators  and  their  assumed 
outcomes  excludes  from  the  research  agenda  all  the 
main  components  of  political  crises — the  activity 
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of  the  protagonists  as  well  as  what  shapes  or  con¬ 
strains  it. 

In  contrast,  other  approaches,  while  assigning 
themselves  the  same  explicanda  (the  outcomes  of 
crises),  have  attempted  to  rehabilitate  actors’  activ¬ 
ity  and  strategies.  Their  major  contribution  lies  in 
the  historical  description  of  the  elites’  or  leaders’ 
moves,  of  their  tactics  and  dilemmas  of  coalition 
building,  and  of  the  bargainings  among  the  “great” 
protagonists  of  the  events  scrutinized,  particularly 
the  bargainings  of  the  contested  regimes’  incum¬ 
bents  with  their  opponents.  Often  rich  in  insights, 
such  approaches  nonetheless  analyze  the  actors’ 
tactical  activity  with  the  aim  of  identifying,  for  each 
type  of  outcome,  a  specific  historical  path  or  trajec¬ 
tory — especially  a  distinctive  sequence  of  typical 
stages  (each  type  of  outcome  being  thus  character¬ 
ized  by  a  distinctive  path).  This  affects  works  deal¬ 
ing  with  the  establishment  of  democratic  systems 
and  with  the  crises,  collapses,  and  breakdowns 
these  systems  experienced  as  well  as  works  con¬ 
cerning  the  processes  that  resulted  in  various  types 
of  authoritarian  regimes  (particularly  in  the  inter¬ 
war  period  in  Europe)  and,  later,  in  the  transitions 
to  democracy.  The  intellectual  appeal  of  outcome  is 
easy  to  understand:  Once  it  has  occurred — but  only 
then — the  outcome  (the  seizure  of  power  by  Italian 
Fascists  or  the  survival  of  democracy  in  France 
after  the  May  1958  crisis)  matters;  it  often  weighs 
heavily  on  the  fate  of  individuals  and  societies.  The 
appeal  of  outcome  constitutes,  however,  a  huge 
trap.  First,  it  determines  the  scholar’s  modus  ope- 
randi — as  a  matter  of  fact  that  of  the  natural  his¬ 
tory  of  revolutions,  according  to  Crane  Brinton: 
The  aim  being  to  explain  the  actual  outcome,  it 
seems  self-evident  to  cut  off  the  historical  sequence 
through  a  regressive  analysis,  taking  as  a  starting 
point  the  outcome  itself;  the  selection  of  facts  then 
operates  so  as  to  make  them  converge  toward  the 
outcome  that  occurred.  Second,  a  heavy  historicist 
bias  informs  such  reasoning,  leading  the  scholar  to 
ignore  de  facto  the  contingency  of  the  crises’  out¬ 
comes  and  also  implying  that  outcomes  reflect  or 
reveal  the  nature  or  essence  of  the  critical  processes 
or  events  that  brought  them  about. 

Many  scholars  have  thought  that  they  could 
avoid  these  pitfalls  by  introducing  into  historical 
paths  some  branchings ,  or  moments  when  “history 
hesitates” — that  is,  by  proposing  a  treelike  image 
of  the  course  of  history.  In  an  influential  essay  on 


the  breakdowns  of  democratic  regimes,  Juan  Finz 
has  thus  described  the  supposedly  characteristic 
path  of  this  particular  set  of  outcomes  by  propos¬ 
ing  roughly  the  following  sequence:  emergence  of 
unsolvable  problems/loss  of  incumbents’  power/ 
power  vacuum/transfer  of  power  to  the  antidemo¬ 
cratic  opposition,  civil  war,  and  re-equilibration  of 
the  democratic  system — here  lies  the  crucial 
branching  point.  Remarkably  enough,  that  way  of 
thinking  about  historical  branchings  permeates 
works  referring  to  the  most  different  sociological 
traditions,  such  as,  prior  to  Finz’s  essay,  the  struc¬ 
tural  Marxist  analysis  that  Nicos  Poulantzas 
devoted  to  the  historical  paths  that  led  to  the  fas¬ 
cist  regimes  in  Italy  and  Germany  (the  author 
combines  the  branchings  with  the  idea,  rather 
implicit  in  other  works,  that,  at  some  stage,  the 
processes  under  study  reach  points  of  no  return 
and  are  conceived  of  as  reversible  before  that 
point).  Unfortunately,  the  remedy — the  introduc¬ 
tion  of  branchings — proves  insufficient;  it  even 
leads  to  a  dead  end.  Although  these  approaches 
claim  to  emphasize  the  role  of  political  leaders  and 
their  decisions,  they  retain  the  hard  core  of  histori¬ 
cist  reasoning.  They  link  each  type  of  outcome — 
whether  the  process  led  to  the  cooptation  of  dis¬ 
loyal  opposition,  to  civil  war,  and  so  on — with 
one,  and  only  one ,  historical  path  supposed  to  be 
characteristic  of  it  and  supposed  to  be  different 
from  the  paths  or  sequences  ascribed  to  processes 
that  resulted  in  other  types  of  outcomes.  We  shall 
go  back  to  this  question  again  below  from  a  differ¬ 
ent  viewpoint.  Nevertheless,  it  is  far  more  mean¬ 
ingful  that,  commenting  on  their  own  analysis  of 
transitions  to  democracy,  Guillermo  O’Donnell 
and  Philipp  Schmitter — who  took  Finz’s  sequence 
of  stages,  supposedly  typical  of  the  breakdown  of 
democracy,  as  an  explicit  model  of  explanation — 
acknowledged  their  failure  to  uncover  a  distinctive 
historical  path  proper  to  transitions.  But  this  fail¬ 
ure  itself  constitutes  an  interesting  result  of  their 
work — probably  the  most  important  one.  This 
observation  actually  applies  to  all  the  approaches 
mentioned  previously. 

Crisis  as  Structural  Transformation 

There  is  only  one  way  to  elude  what  is  merely  a 
particular  kind  of  belief  in  laws  of  history  (laws  of 
historical  development):  namely,  to  bracket  off,  at 
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least  temporarily,  the  outcomes  of  crises  and,  more 
broadly,  the  outcomes  of  critical  processes  and  to 
break  with  the  idea  that  to  account  for  these  phe¬ 
nomena  consists  in  explaining  their  outcomes — 
that  is,  to  choose  a  different  puzzle  to  solve.  But 
what  is  at  stake  in  this  move  is  not  only  the  elimi¬ 
nation  of  historicist  biases;  it  is  above  all  to  iden¬ 
tify  and  explain  what  the  crises  are  made  of  and 
what  happens  during  crises.  Everyone  today  agrees 
on  the  idea  that  matter — physical  matter — exists  in 
different  states:  solid,  gaseous,  or  liquid.  For  the 
theory  of  fluid  conjunctures  outlined  by  Michel 
Dobry,  social  structures  and,  more  broadly,  con¬ 
figurations  of  institutions  and  social  relations, 
even  when  deeply  objectified,  are  not  necessarily 
more  structurally  stable  than  is  matter.  They  can 
undergo  short-term  transformations  of  their  state, 
and  they  can  experience  different  states  (which 
gain  from  being  termed  conjunctures ,  as  particular 
states  of  social  structures).  This  perspective  thus 
opens  the  possibility  of  bringing  to  light  the  plas¬ 
ticity  of  structures  and  their  sensitivity  to  actors’ 
mobilizations. 

All  this  must  be  specified.  First,  the  theory  of 
fluid  conjunctures  applies  exclusively  to  critical 
conjunctures  peculiar  to  complex  social  systems. 
These  systems  are  defined  by  their  high  internal 
differentiation  into  a  multiplicity  of  social  sectors, 
spheres,  or  fields,  relatively  autonomous  from  each 
other,  strongly  institutionalized,  tending  toward 
self-reference,  and  each  of  them  endowed  with  its 
own  social  logic  (a  set  of  constraints,  resources, 
stakes,  pragmatic  and  normative  rules  of  the  game, 
etc.),  a  logic  that  weighs  heavily,  in  routine  con¬ 
junctures,  on  the  perceptions,  calculations,  and 
tactical  activities  of  actors.  This  means  that  within 
these  systems — which  are  not  necessarily  demo¬ 
cratic  ones — ordinary  politics  operates  as  a  rela¬ 
tively  autonomous  sector  or  field,  with  its  own 
specific  logic.  Second,  many  of  the  empirical  regu¬ 
larities  of  the  political  crises  that  affect  these  sys¬ 
tems  have  been  already  noticed  or  observed  by 
their  actors  or  by  scholars.  The  point  is  that  these 
regularities  can  be  fully  explained  when  thought  of 
as  a  process  that  alters,  in  a  short  term,  the  struc¬ 
tural  features  of  these  systems,  which,  in  the  first 
place,  is  their  particular  form  of  differentiation — 
that  is,  more  precisely,  when  thought  of  as  a  pro¬ 
cess,  or  as  a  dynamics,  of  desectorization  of  these 
systems.  Crucially,  this  dynamics — which  occurs 


particularly  under  the  impact  of  multisectoral 
mobilizations  (those  that  spread,  on  a  competitive 
mode,  in  several  differentiated  social  sectors) — is  a 
process  that  accounts  also  for  a  lot  of  regularities 
previously  poorly  identified  or  unidentified. 

This  is  the  main  reason  why,  to  understand 
political  crises  or,  better,  fluid  conjunctures,  the 
major  features  of  this  desectorization  dynamics 
must  qualify — or  even  substitute  for — the  provi¬ 
sional  definition  sketched  above.  Desectorization 
first  involves  a  manifest  reduction,  and  sometimes 
the  complete  collapse,  of  the  autonomy  of  the  dif¬ 
ferent  sectors;  a  visible  fading  or  crumbling  of 
sectoral  frontiers;  and  a  rapidly  growing  permea¬ 
bility  to  tactical  moves  and  to  stakes  external  to 
sectoral  social  logics,  notably  external  to  those  of 
the  political  sectors  or  fields.  Among  other  fea¬ 
tures,  this  results  in  a  dramatic  and  unexpected 
mobility  of  stakes — puzzling  for  protagonists  of 
crises  as  for  observers — sudden  disruptions  of  sec¬ 
toral  institutionalized  temporalities  or  rhythms, 
rapid  alterations  of  the  ordinary  division  of  politi¬ 
cal  labor,  and  the  no  less  abrupt  outbursts  of  pro¬ 
cesses  of  de-objectification — the  loss  of  the  natu¬ 
ral,  taken-for-granted  dimension  that  is  also  the 
legitimacy — of  the  social  reality  of  institutions  and 
collective  bodies. 

Further,  the  effects  of  this  conjunctural  desec¬ 
torization  can  be  fully  checked  when  looking  at  the 
perceptions,  anticipations,  tactical  dilemmas,  cal¬ 
culations,  and  moves  of  actors.  This  deserves, 
however,  a  detour  by  one  delicate  question:  the 
belief  that  abnormal  or  exceptional  periods  require 
an  exceptional  approach,  different  from  the  view¬ 
point  of  “normal”  social  sciences.  Fet  us  revert  to 
the  historical  branchings  discussed  previously:  In 
most  works,  the  introduction  of  such  branchings 
cumulates  the  effects  of  historicism  and  the  effects, 
a  priori  logically  incompatible,  of  methodological 
exceptionalism.  Ultimately,  justified  by  the  alleged 
abnormal — in  olden  times  called  “pathological” — 
characteristic  of  crisis  phenomena,  this  exception¬ 
alism  takes  a  classical  form.  On  the  one  hand, 
periods  of  political  stability,  or  political  “equilib¬ 
rium,”  would  fall  within  the  scope  of  so-called 
normal,  that  is,  structural  (whatever  meaning  one 
gives  to  this  notion),  and  deterministic  approaches. 
On  the  other  hand,  periods  of  crisis  (or  “transi¬ 
tion,”  “revolution,”  etc.)  would  require,  by  their 
very  nature,  to  be  explained  in  terms  of  choice,  of 
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decision  making,  or  of  strategic  calculation.  Such 
an  exceptionalism,  seemingly  taken  for  granted,  is 
constitutive  of  the  heroic  fallacy.  First,  it  leads  the 
scholar  to  abandon  any  sustained  interest  in  struc¬ 
tures  and  for  what  they  become  during  critical 
periods.  Second,  it  conveys  a  conception  of  actors’ 
choices  or  decisions  as,  at  a  minimum,  underdeter¬ 
mined  and,  most  often,  undetermine — that  is,  free 
from  the  causalities  that  “normal”  social  sciences 
attempt  to  establish.  It  is  noteworthy  that  this  fal¬ 
lacy  is  not  confined  to  scholars:  It  replicates  the 
ordinary  point  of  view  of  practitioners  of  modern 
revolutions  (e.g.,  Lenin  or  Trotsky)  who,  to  cope 
with  revolutionary  situations,  oppose  the  catego¬ 
ries  of  objective  factors  and  subjective  factors  of 
revolutions  and  conceive  of  situations  such  as  the 
exclusive  realm  of  subjective  factors.  All  these 
remarks  entirely  apply  to  the  standard  conception 
of  critical  junctures:  If  these  junctures  are  sup¬ 
posed  to  constitute  the  starting  points  of  path 
dependency  processes,  they  are  usually  thought  of 
as  moments  of  choices,  choices  freed  from  any 
determination  or  structural  constraint;  that  is 
why  critical  junctures — which  can  also,  as  many 
authors  forget  and  as  Paul  Pierson  reminds  us, 
correspond  to  events  in  no  way  critical — must  not 
be  confused  with  fluid  or  critical  conjunctures. 
The  heroic  fallacy  presumably  is  an  illusion  nec¬ 
essary  to  the  practice  of  actors.  In  terms  of 
knowledge,  it  leads  to  losing  at  once  on  several 
counts:  first,  by  ignoring  what  structures  become 
in  periods  of  crisis  and,  second,  because  it 
excludes  any  possibility  of  identifying  and 
explaining  not  only  the  factors  shaping  actors’ 
perceptions  and  actions  but  also  the  differences 
distinguishing  critical  conjunctures  from  routine 
ones.  This  has  an  important  consequence:  We 
have  to  refuse  to  change  approaches  when  switch¬ 
ing  from  the  analysis  of  normal  periods  to  the 
analysis  of  abnormal  or  exceptional  periods.  In 
other  words,  the  understanding  of  political  crises 
calls  for  a  methodological  normalization  of  these 
phenomena  (i.e.,  for  a  principle  of  methodologi¬ 
cal  continuity). 

The  question  especially  matters  when  confront¬ 
ing  the  uncertainty  that  the  actors  of  crises  have  to 
deal  with.  This  uncertainty  does  not  boil  down  to 
a  narrow  uncertainty  on  the  outcome  of  crises  or 
on  the  postcrisis  course  of  possible  ensuing  path 
dependency  processes — this  is  only  a  marginal  part 


of  it.  Uncertainty,  rather,  is  an  overwhelming,  mul¬ 
tiform,  and  pervasive  component  of  the  situa¬ 
tional  constraints  stemming  from  the  desectoriza- 
tion  of  social  space,  which  affects  all  the  protago¬ 
nists  of  crises.  The  specific  social  logics  of  the 
various  differentiated  social  sectors,  spheres,  or 
fields  cease  to  be  the  reference  and  the  material  of 
actors’  calculations,  assessments  of  varied  lines  of 
action’s  efficiency,  and  definitions  of  situations. 
The  often  dramatic  collapse  of  the  sectoral  logics’ 
hold  on  actors’  calculations  goes  hand  in  hand 
with  the  emergence  of  a  widened  form  of  interde¬ 
pendence  that  tends  to  replace  social  games — par¬ 
ticularly  political  games — that,  in  routine  con¬ 
junctures,  are  characterized  by  more  local,  sector¬ 
ized,  and  compartmentalized  forms  of  interdepen¬ 
dence.  This  widened  interdependence  not  only 
tends  to  introduce  into  actors’  calculations 
resources,  actors,  and  stakes  that  are  external  to 
the  ordinary  logics  of  political  sectors  but  also 
deprives  actors  of  their  routine  means  or  tools  of 
calculation  and  anticipation  and  their  routine  ref¬ 
erences,  clues,  markers,  pragmatic  rules  of  the 
game,  and  so  forth.  Widened  interdependence  is 
the  mainspring  of  the  sudden  and  unpredictable 
variations  of  the  efficiency,  or  value,  of  various 
lines  of  action  and  resources.  It  simultaneously 
results  in  making  it  difficult  and  often  very  costly 
for  actors  to  access  information  on  that  efficiency 
as  well  as  information  on  orientations  of  other 
actors  and,  regarding  collective  actors,  on  their 
very  consistency  and  identities  (Who  exactly  are 
my  allies  today?  Who  are  the  real  hardliners?  Are 
the  moderates  still  moderates?).  That  is  why  it  is 
relevant  to  term  this  uncertainty  structural  uncer¬ 
tainty,  in  its  strong  meaning.  This  entails  that 
conjunctures  of  crisis  are  no  less  constraining  for 
actors  than  are  normal  situations;  actors  are  no 
less  rational,  but  they  are  compelled  to  manage  to 
calculate,  anticipate,  and  orient  themselves  in 
other  ways.  For  instance,  from  this  derives,  in 
these  conjunctures,  the  special  and  growing  attrac¬ 
tion  of  situational  focal  points. 

Implications  for  the  Study  of  Crises 

All  this  has  multiple  and  varied  implications.  It 
first  induces  a  reconsideration  of  several  classical 
topoi  of  political  sociology.  Two  of  them  can  illus¬ 
trate  the  point  fairly  well: 
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1.  The  Weberian  theory  of  charismatic  leader¬ 
ship  in  critical  periods  stresses  the  personal  quali¬ 
ties,  words,  claims,  and  deeds  of  the  leader; 
however,  as  in  the  case  of  the  French  May  1958 
crisis,  this  conception  should  be  questioned: 
Actually,  the  charismatic  leadership  attributed  to 
General  de  Gaulle  stems  primarily  from  the  fact 
that  he  represented  a  strictly  situational  focal  point 
for  diverse  opposing  actors,  most  of  whom  were 
not  his  proponents. 

2.  Delegitimization  processes  in  political  crises 
are  traditionally  conceived  as  necessary  precondi¬ 
tions  of  crises,  but  the  desectorization  dynamic 
peculiar  to  fluid  conjunctures  (and  the  resulting 
de-objectification,  or  de-reification,  of  institutions) 
prompts  us  to  think  of  delegitimization  processes 
also,  and  often  primarily,  as  by-products  of  desec¬ 
torization  dynamics,  and  to  observe  that  crises 
may  occur  without  any  prior  delegitimization  of 
authorities  or  regimes. 

Second,  it  permits  accounting  for  usually  misin¬ 
terpreted  empirical  phenomena — for  example,  the 
idea  that  the  hidden  overdetermines  and  the  visible 
informs  many  analyses  of  critical  processes.  In  this 
way,  while  somewhat  reticent  about  conspiracy 
theories,  Linz  nevertheless  asserts  that  in  one  of  the 
crucial  stages  of  the  breakdown  of  democratic  pro¬ 
cesses,  we  should  observe  a  narrowing  of  the 
political  arena,  marked  by  the  regime  leaders’ 
opening  up  to  the  disloyal  opposition  (e.g., 
Mussolini  in  the  Italian  case),  the  supposedly  deci¬ 
sive  role  of  secret  bargaining,  and  the  replacement 
of  the  parliamentary  arena  by  a  hidden  one,  acces¬ 
sible  only  to  a  very  restricted  number  of  individu¬ 
als.  Although  in  such  contexts — as  in  many  others, 
often  more  ordinary  ones — actors  may  be  tempted 
by  secret  negotiations  and  moves,  we  hardly  find 
this  kind  of  overdetermination  of  the  visible  or  the 
open  by  the  secret.  Since  these  critical  contexts  are 
situations  of  widened  interdependence,  hidden  are¬ 
nas,  restricted  groups,  conspiracies,  and  covert 
agreements  per  se  can  never  benefit  from  any 
causal  or  ontological  superiority  over  visible  moves 
and  events  and,  more  specifically,  over  the  expres¬ 
sive,  nonintentional  information  that  visible  moves 
carry  and  that  focus  all  actors’  attention,  whether 
they  like  it  or  not.  The  March  on  Rome,  seen  by 
some  historians  as  an  inefficient  theatrical  staging, 
nonetheless  constrained,  at  that  moment,  the 


anticipations,  calculations,  and  tactics  of  all  pro¬ 
tagonists,  even  those  operating  undercover.  More¬ 
over,  Linz  rightly  points  out  the  coming  into  play, 
during  that  stage,  of  actors  external  to  the  political 
field,  such  as  the  church,  the  unions,  and  the  army 
or  high-ranking  civil  servants;  obviously,  all  of  this 
is  much  more  consistent  with  the  desectorization 
perspective  than  with  the  narrowing  hypothesis. 

There  remains  one  last  implication  to  consider. 
According  to  Carl  von  Clausewitz,  if  war  is  the 
continuation  of  political  relations  by  other  means, 
it  nevertheless  has  the  property,  once  produced  by 
politics,  of  depending  also  on  its  own  dynamics  or 
its  own  “grammar,”  which  is  the  rise  to  the 
extremes  of  violence,  whatever  the  ends  or  will  of 
actors.  The  reader  already  knows  that  the  dynam¬ 
ics  of  political  crises,  despite  the  imagery  of  the 
intensification  of  conflict  and  of  the  rise  in  the  scale 
of  violence  haunting  many  works,  is  not  that  of 
wars.  And  yet,  like  wars,  political  crises  tend  to 
escape,  to  pull  themselves  away  from  their  condi¬ 
tions  of  genesis,  but  because  of  their  own  distinc¬ 
tive  dynamics,  that  is,  that  of  desectorization.  The 
autonomy  of  critical  processes  from  their  causes  or 
conditions  of  emergence  is  not  innocuous:  It  further 
undermines  the  premises  and  causal  imagery  of 
historicist  approaches  and  prompts  a  critical  reflec¬ 
tion  on  what  the  social  sciences  have  often  expected, 
and  still  expect,  from  the  unveiling  of  the  historical 
conditions  of  emergence  of  political  crises  and 
critical  processes — particularly  the  belief  that  the 
ultimate  sociological  truth  of  these  phenomena 
exclusively  or  primarily  lies  in  the  factors  that 
engendered  them.  It  is  easy  to  guess  that  the  target 
of  this  criticism  is  not  causality  as  such  but  rather 
some  pedestrian  forms  of  misplaced  determinism. 
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Crisis  Management 


We  speak  of  crisis  when  a  community  of  people — 
an  organization,  a  town,  or  a  nation — perceives 
an  urgent  threat  to  core  values  or  life-sustaining 
functions,  which  must  be  urgently  dealt  with 
under  conditions  of  deep  uncertainty.  Crisis  man¬ 
agement  pertains  to  all  activities  addressing  that 
threat  and  aimed  at  minimizing  its  consequences. 

Crisis  and  the  Modern  Society 

Public  authorities  face  a  variety  of  crises,  such  as 
natural  disasters  and  war,  financial  meltdowns 


and  terrorist  attacks,  epidemics  and  explosions, 
and  infrastructural  dramas  and  failures  of  infor¬ 
mation  and  communications  technology  (ICT). 
Crises  are  not  routine  events  (e.g.,  fires  or  traffic 
accidents).  Crises  are  inconceivable  events  that 
often  take  politicians,  citizens,  and  the  media  by 
complete  surprise. 

Crises  create  tough  challenges  for  public  author¬ 
ities  and  their  organizations.  Citizens  expect  public 
authorities  to  safeguard  them  from  the  threat, 
make  critical  decisions,  and  implement  them — 
under  considerable  time  pressure  and  in  the  absence 
of  essential  information  about  causes  and  conse¬ 
quences.  Two  factors  make  it  increasingly  hard  to 
meet  these  challenges. 

First,  the  very  qualities  that  increase  welfare 
and  drive  progress  in  modern  societies  make  them 
vulnerable  to  crises.  Modern  society  has  become 
increasingly  complex  and  integrated.  Complexity 
makes  it  hard  to  fully  understand  emerging  vul¬ 
nerabilities,  which,  as  a  result,  can  go  long  unrec¬ 
ognized;  attempts  to  deal  with  them  often  produce 
unintended  consequences  (fueling  rather  than 
dampening  the  crisis).  Tight  coupling  between  a 
system’s  component  parts  and  those  of  other  sys¬ 
tems  facilitates  the  rapid  proliferation  of  distur¬ 
bances  throughout  the  system.  Crises  may  thus 
have  their  roots  far  away  (in  a  geographical 
sense),  but  they  can  rapidly  snowball  through 
global  networks,  jumping  from  one  system  to 
another  and  gathering  destructive  potential  along 
the  way. 

All  this  makes  it  very  hard  to  recognize  a  crisis 
before  its  consequences  materialize.  When  a  crisis 
begins  to  unfold,  policymakers  often  do  not  see 
anything  out  of  the  ordinary.  Hidden  interactions 
eat  away  at  the  pillars  of  the  system,  but  it  is  only 
when  the  crisis  is  in  full  swing  that  policymakers 
can  recognize  it  for  what  it  is.  Once  a  crisis  has 
escalated  into  view,  authorities  can  only  try  to 
minimize  its  consequences. 

Second,  the  contested  nature  of  crisis  compli¬ 
cates  the  situation.  A  crisis  rarely,  if  ever,  “speaks 
for  itself.  ”  The  definition  of  a  situation  is,  as  social 
scientists  say,  the  outcome  of  a  subjective  process 
and  is  continuously  subjected  to  the  forces  of 
politicization.  One  person’s  crisis  is  another’s 
opportunity.  For  public  authorities,  this  spells 
trouble:  Many  seemingly  innocent  events  can  be 
transformed  into  crises.  Western  citizens  have 


Crisis  Management  495 


grown  impatient  with  imperfections;  they  have 
come  to  fear  glitches,  and  they  see  more  of  what 
they  fear.  In  this  culture  of  fear — sometimes 
referred  to  as  the  “risk  society” — the  modern  mass 
media  play  an  amplifying  role  (Ulrick  Beck,  1992). 

Challenges  of  Crisis  Management 

Crisis  management  has  two  dimensions.  The 
technical-administrative  dimension  pertains  to  the 
coping  capacity  of  governmental  institutions  and 
public  policies  in  the  face  of  emerging  threats.  But 
there  is  also  a  political  dimension:  Crisis  manage¬ 
ment  is  a  deeply  controversial  and  intensely  politi¬ 
cal  activity.  A  combination  of  these  dimensions 
translates  into  what  Arjen  Boin,  Paul  ‘t  Hart,  Eric 
Stern,  and  Bengt  Sundelius  have  identified  as  the 
five  critical  challenges  of  crisis  management:  sense 
making,  decision  making,  meaning  making,  termi¬ 
nating,  and  learning. 

Sense  Making 

Crises  seem  to  pose  a  straightforward  challenge: 
Once  a  crisis  becomes  manifest,  crisis  managers 
must  take  measures  to  deal  with  its  consequences. 
Reality  is  much  more  complex,  however.  Most 
crises  do  not  materialize  with  a  big  bang. 
Policymakers  must  recognize  from  vague  and  con¬ 
tradictory  signals  that  something  out  of  the  ordi¬ 
nary  is  developing.  They  must  appraise  the  threat 
and  decide  what  the  crisis  is  about. 

Crisis  managers  often  find  it  hard  to  meet  this 
challenge.  The  bewildering  pace,  ambiguity,  and 
complexity  of  a  crisis  can  easily  overwhelm  normal 
modes  of  situation  assessment.  Stress  may  further 
impair  sense-making  abilities.  Organizational 
pathologies  can  produce  additional  barriers  to  cri¬ 
sis  recognition. 

Some  categories  of  people  are  known  for  their 
ability  to  stay  clearheaded  under  pressure.  They 
have  developed  a  mode  of  information  processing 
that  enables  competent  performance  under  crisis 
conditions.  Veteran  military  officers,  journalists, 
and  fire  and  police  commanders  are  known  for 
this.  Some  organizations  have  developed  a  proac¬ 
tive  culture  of  “looking  for  problems”  in  their 
environment.  These  so-called  high-reliability  orga¬ 
nizations  have  somehow  developed  a  capacity  for 
thorough  yet  fast-paced  information  processing 


under  stressful  conditions.  The  unresolved  ques¬ 
tion,  as  noted  by  Karl  Weick  and  Kathleen  Sutcliffe, 
is  whether  organizations  can  design  these  features 
into  existing  organizational  cultures. 

Making  Critical  Decisions 

During  a  crisis,  critical  decisions  must  be  made. 
Scarce  resources  may  have  to  be  prioritized.  This  is 
much  like  politics  as  usual,  except  that  in  crisis 
circumstances  the  disparities  between  demand  and 
supply  of  public  resources  are  much  bigger,  the 
situation  remains  unclear  and  volatile,  and  the 
time  to  think,  consult,  and  gain  acceptance  for 
decisions  is  restricted.  Crises  confront  govern¬ 
ments  and  leaders  with  issues  they  do  not  face  on 
a  daily  basis,  for  example,  concerning  the  deploy¬ 
ment  of  the  military,  the  use  of  lethal  force,  or  the 
radical  restriction  of  civil  liberties.  Crisis  decision 
making  is  making  hard  calls,  which  involve  tough 
value  trade-offs  and  major  political  risks. 

An  effective  response  to  crisis  requires  coordi¬ 
nation.  After  all,  each  decision  must  be  imple¬ 
mented  by  a  variety  of  organizations;  effective 
implementation  requires  that  these  organizations 
work  together.  Getting  public  bureaucracies  to 
adapt  to  crisis  circumstances  is  a  daunting — some 
say  impossible — task.  Most  public  organizations 
were  originally  designed  to  conduct  routine  busi¬ 
ness  in  accordance  with  values  such  as  fairness, 
lawfulness,  and  efficiency.  The  management  of 
crisis,  however,  requires  flexibility,  improvisation, 
redundancy,  and  the  breaking  of  rules. 

Coordination  is  not  a  self-evident  feature  of 
crisis  management  operations.  The  question  of 
who  is  in  charge  typically  arouses  great  passions. 
In  disaster  studies,  the  “battle  of  the  Samaritans” 
is  a  well-documented  phenomenon:  Agencies  rep¬ 
resenting  different  technologies  of  crisis  manage¬ 
ment  find  it  difficult  to  align  their  actions. 
Moreover,  a  crisis  does  not  make  the  sensitivities 
and  conflicts  that  governed  the  daily  relations 
between  authorities  and  others  before  the  crisis 
disappear. 

An  effective  crisis  response  is  to  a  large  extent 
the  result  of  a  naturally  evolving  process.  It  cannot 
be  managed  in  a  linear,  step-by-step,  and  compre¬ 
hensive  fashion  from  a  single  crisis  center.  There 
are  simply  too  many  hurdles  that  separate  a  critical 
decision  from  its  timely  execution  in  the  field. 
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Meaning  Making 

In  a  crisis,  leaders  are  expected  to  reduce  uncer¬ 
tainty  and  provide  an  authoritative  account  of 
what  is  going  on,  why  it  is  happening,  and  what 
needs  to  be  done.  Leaders  must  get  others  to  accept 
their  definition  of  the  situation.  If  they  are  not  suc¬ 
cessful,  their  decisions  may  not  be  understood  or 
accepted. 

Public  leaders  are  not  the  only  ones  trying  to 
frame  the  crisis.  Their  messages  coincide  and  com¬ 
pete  with  those  of  other  parties,  who  hold  other 
positions  and  interests  and  who  are  likely  to 
espouse  various  alternative  definitions  of  the  situ¬ 
ation  and  advocate  different  courses  of  action.  If 
other  actors  succeed  in  dominating  the  meaning¬ 
making  process,  the  ability  of  incumbent  leaders  to 
decide  and  maneuver  is  severely  constrained. 

It  is  often  difficult  for  authorities  to  provide  cor¬ 
rect  information  right  away.  They  struggle  with 
the  mountains  of  raw  data  (reports,  rumors,  and 
pictures)  that  are  quickly  amassed  when  something 
extraordinary  happens.  Turning  these  data  into  a 
coherent  picture  of  the  situation  is  a  challenge. 
Getting  it  out  to  the  public  in  the  form  of  accurate, 
clear,  and  actionable  information  requires  a  major 
public  relations  effort.  This  effort  is  often  hindered 
by  the  aroused  state  of  the  audience:  People  whose 
lives  are  deeply  affected  tend  to  be  anxious  and 
stressed  (distressed).  Moreover,  they  do  not  neces¬ 
sarily  see  the  government  as  their  ally. 

Terminating  a  Crisis 

Crisis  termination  is  twofold.  It  is  about  shifting 
back  from  emergency  to  routine  mode.  This 
requires  some  form  of  downsizing  of  crisis  opera¬ 
tions.  At  the  strategic  level,  it  also  requires  render¬ 
ing  an  account  of  what  has  happened  and  gaining 
acceptance  for  it.  These  two  aspects  of  crisis  termi¬ 
nation  are  distinct,  but  in  practice,  they  are  often 
closely  intertwined.  The  system  of  governance — its 
rules,  its  organizations,  and  its  power  holders — 
has  to  be  (re)stabilized;  it  must  regain  the  neces¬ 
sary  legitimacy  to  perform  its  usual  functions. 
Leaders  generally  cannot  bring  this  about  by  uni¬ 
lateral  decree,  even  if  they  possess  the  formal  man¬ 
date  to  terminate  crises  in  a  legal  sense.  Formal 
termination  gestures  can  follow  but  never  lead  the 
mood  of  a  community.  Premature  closure  may 
even  backfire:  Allegations  of  underestimation  and 


cover-up  are  quick  to  emerge  in  an  opinion  climate 
that  is  still  on  edge. 

Accountability  debates  can  easily  degenerate 
into  blame  games.  Crisis  leaders  can  be  competent 
and  conscientious,  but  that  alone  says  little  about 
how  their  performance  will  be  evaluated  when  the 
crisis  is  over.  Policymakers  and  agencies  that  failed 
to  perform  their  duties  prior  to  or  during  the  criti¬ 
cal  stages  may  manage  the  crisis  aftermath  well, 
thus  preventing  losses  to  their  reputation,  auton¬ 
omy,  and  resources.  Crises  have  winners  and  los¬ 
ers.  The  political  (and  legal)  dynamics  of  the 
accountability  process  determines  which  crisis 
actors  end  up  where. 

Learning 

A  crisis  offers  a  reservoir  of  potential  lessons  for 
contingency  planning  and  training  for  future  cri¬ 
ses.  One  would  expect  all  those  involved  to  study 
these  lessons  and  feed  them  back  into  organiza¬ 
tional  practices,  policies,  and  laws.  This  does  not 
always  happen,  however.  Lesson  drawing  is  one  of 
the  most  underdeveloped  aspects  of  crisis  manage¬ 
ment.  In  addition  to  cognitive  and  institutional 
barriers  to  learning,  lesson  drawing  is  constrained 
by  the  role  of  these  lessons  in  determining  the 
impact  that  crises  have  on  a  society. 

The  depiction  of  a  crisis  as  a  product  of  preven¬ 
tion  and  foresight  failures  would  force  people  to 
rethink  the  assumptions  on  which  preexisting  poli¬ 
cies  and  rule  systems  rested.  Other  stakeholders 
might  seize  on  the  lessons  to  advocate  measures 
and  policy  reforms  that  incumbent  leaders  reject. 
Leaders  thus  have  a  large  stake  in  steering  the  les¬ 
son-drawing  process  in  the  political  and  bureau¬ 
cratic  arenas. 

Arjen  Boin 
Louisiana  State  University 
Baton  Rouge,  Louisiana,  United  States 
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Critical  Theory 


Today,  the  term  critical  theory  refers  to  a  broad 
variety  of  theoretical  approaches  in  social  and 
cultural  studies  as  well  as  social  philosophy. 
Originally,  critical  theory  was  the  programmatic 
name  of  a  German  group  of  philosophers  and 
social  scientists  who,  in  1930,  began  to  work 
together  under  the  directorship  of  Max  Hork- 
heimer  at  the  Frankfurt  Institute  of  Social  Research 
and  who  continued  their  work  after  emigrating  to 
the  United  States  in  the  early  1930s.  This  group 
has  also  been  labeled  the  “Frankfurt  School”  and 
is  the  subject  of  this  entry. 

From  1932  until  1941,  the  group  published 
Zeitschrift  fiir  Sozialforschung  (Journal  for  Social 
Research).  After  World  War  II,  Horkheimer 
returned  to  Frankfurt  and  together  with  Theodor 
Adorno,  who  during  the  1940s  had  become  his 
most  important  theoretical  companion,  reestab¬ 
lished  the  Frankfurt  Institute.  Next  to  Adorno, 
among  the  important  members  of  the  first  genera¬ 
tion,  are  the  philosophers  Herbert  Marcuse  and 
Walter  Benjamin,  the  economist  Friedrich  Pollock, 
the  psychologist  Erich  Fromm,  and  the  sociologist 
Leo  Loewenthal. 

With  Jurgen  Habermas,  who  in  1966  became 
Adorno’s  successor  at  the  Frankfurt  Institute,  not 
only  did  a  new  generation  take  over,  but  a  theo¬ 
retical  shift  within  the  Frankfurt  School  appeared 
and  soon  became  internationally  recognized  in 
political  science  and  political  theory  in  addition  to 
philosophy.  Nevertheless,  Habermas  in  his  social 
theory  continued  the  critiques  of  “late  capitalism” 
initiated  by  his  predecessors.  Today,  some  speak 
about  a  “third  generation,”  referring  to  authors 


like  Seyla  Benhabib,  Nancy  Fraser,  or  Axel 
Honneth  or  focus  primarily  on  Honneth  as  the 
successor  to  Habermas’s  chair  and  directorship  of 
the  Frankfurt  Institute.  However,  the  international 
diffusion  and  pluralization  of  the  original  ideas  of 
critical  theory  renders  questionable  any  claim  to 
continuity  of  critical  theory  by  intergenerational 
transmission.  Compared  with  the  first  generation 
and  Habermas,  no  such  coherent  school  or  theory 
exists  any  longer.  The  term  critical  theory  today 
lacks  a  single  specific  meaning,  and  critical  theory 
as  a  specific  approach  has,  arguably,  found  its  his¬ 
torical  fulfillment  with  the  life  work  of  Habermas. 

Origins  and  Early  Development 

The  origin  of  critical  theory  has  to  be  understood 
in  the  context  of  the  deep  political  and  theoretical 
crisis  of  the  Marxist  Second  International  during 
and  after  World  War  I,  the  Bolshevik  Revolution, 
and  the  founding  of  the  Soviet  Union  under  V.  I. 
Lenin.  Perry  Anderson  has  coined  the  term  Western 
Marxism  to  summarize  the  different  theoretical 
approaches  of  Georg  Lukacs,  Karl  Korsch,  and 
Antonio  Gramsci  among  others,  who  during  the 
1920s  reacted  to  this  crisis  by  creating  their  own 
rereadings  of  Marxism.  In  the  late  1920s,  this 
Marxist  revisionism  came  to  be  strongly  influ¬ 
enced  by  the  first  publication  of  Marx’s  early  so- 
called  humanistic  writings.  In  all  their  variety, 
these  new  approaches  were,  on  the  one  hand, 
opposed  to  the  deterministic  interpretation  of  his¬ 
tory  by  “Marxism”  (historical  materialism)  and, 
on  the  other,  critical  of  the  political  instrumental- 
ization  of  the  Marxist  approach  as  a  mere  power 
ideology  in  what  soon  came  to  be  called  “Marxism- 
Leninism.” 

When  Horkheimer  delivered  his  famous  inaugu¬ 
ral  speech  as  director  of  the  Institute  in  1932,  he 
introduced  a  research  program  that  has  been  called 
“interdisciplinary  materialism”  (Wolfgang  Bonis  & 
Norbert  Schindler,  1982).  Its  goal  was  to  integrate, 
in  a  new  manner,  the  Hegel-Marx  tradition  of  phi¬ 
losophy  of  history  with  empirical  social  research  in 
various  disciplines  into  a  new  paradigm  of  practical 
philosophy.  In  this  project,  the  role  of  philosophy 
was  to  integrate  empirical  research  into  a  theoreti¬ 
cal  totality  that  could  both  represent  reality  and 
provide  a  normative  perspective  on  it.  Given  the 
fact  that  in  Western  societies  the  proletariat  had 
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not  proven  to  be  the  revolutionary  class  that 
would,  according  to  Marx,  overthrow  the  capital¬ 
ist  system  of  exploitation  and  alienation,  the  new 
paradigm  of  interdisciplinary  materialism  never¬ 
theless  was  to  remain  devoted  to  the  practical  aim 
of  emancipating  society  in  totality  by  developing 
its  inherent  rational  potential  and  its  possibilities 
for  liberation.  The  critique  of  ideology  was  one 
more  traditional  way  to  demonstrate  the  norma¬ 
tive  cleavages  between  the  universal  promises  of 
the  French  Revolution — liberte,  egalite,  frater- 
nite — and  the  unjust  class  realities  of  bourgeois 
society,  based  on  a  capitalist  economy  in  which 
monopolistic  structures  had  irreversibly  under¬ 
mined  the  socioeconomic  foundations  of  liberal¬ 
ism.  Despite  the  critique  of  liberalism,  critical  the¬ 
ory  in  its  normative  perspective  always  favored 
individualism  over  collectivism.  Only  an  emanci¬ 
pated  society  could  finally  provide  the  social  basis 
of  “real  individualism,”  while  capitalism  had 
reduced  “individualism”  to  mere  market-driven 
egoism  and  class  antagonism. 

In  the  face  of  the  triumph  of  both  Nazism  and 
Stalinism,  and  with  disappearing  hope  for  a  practi¬ 
cal  solution  to  the  revolutionary  question, 
Horkheimer  reformulated  the  program  of  the 
group,  in  a  famous  article  in  1937,  as  “critical 
theory.”  Critical  theory,  as  opposed  to  “tradi¬ 
tional  theory”  (i.e.,  positivism),  should  at  least 
keep  the  consciousness  alive  in  the  absence  of  a 
revolutionary  subject — that  normative  alternatives 
beyond  the  mere  positivist  affirmation  of  the  given 
status  quo  in  the  fate  of  humankind  were  think¬ 
able,  at  least  in  principle — that  is,  critical  theory  as 
the  notorious  “message  in  a  bottle”  in  the  absence 
of  a  practical  subject  of  emancipation. 

By  the  early  1940s,  when  Horkheimer  and 
Adorno  wrote  their  famous  Dialektik  der  Auf- 
klarung  ( Dialectic  of  Enlightenment-,  published  in 
English  in  1972),  their  diagnosis  of  time  and  his¬ 
tory  had  become  even  more  pessimistic.  From  the 
beginnings  of  enlightenment  early  in  human  his¬ 
tory,  the  development  of  “reason”  in  the  domi¬ 
nant  form  of  “instrumental  rationality”  or  “pur¬ 
posive  rationality”  alienated  individuals  and  espe¬ 
cially  their  system  of  social  interaction  from 
“objective  reason”  in  constituting  the  subject — 
and  thus  their  human  potential.  The  history  of 
humankind  was  thus — against  Hegel’s  and  Marx’s 
views — perceived  as,  at  best,  an  ambivalent  story. 


In  this  pessimistic  vision  of  mankind  “logic  of 
decay”  at  present,  there  was  no  longer  a  theoreti¬ 
cal  basis  for  hope  of  change.  Later,  in  the  1960s, 
Adorno  in  his  Negative  Dialectic  (published  in 
English  in  1973)  confirmed  this  negative  diagnosis 
in  the  topos  of  a  “totally  administered  society.” 
Marcuse,  despite  some  political  and  personal  dis¬ 
tance  from  Horkheimer  and  Adorno  since  the  late 
1930s,  came  very  close  to  this  pessimistic  approach 
with  his  concept  of  the  one-dimensional  man 
totally  shaped  by  the  main  dominant  values,  in  the 
early  1960s — though  it  was  perceived  quite  differ¬ 
ently  by  the  international  student  protest  move¬ 
ment  of  the  late  1960s.  The  first  generation  of 
critical  theorists,  at  least  Horkheimer  and  Adorno, 
at  the  end  of  their  theoretical  path,  which  began 
without  the  practical  subject  of  revolutionary 
practice,  were  deprived  of  a  convincing  theoretical 
foundation  of  rationality  that  could  have  been  the 
background  of  their  critical  claim. 

Habermas’s  Contributions  to  Critical  Theory 

It  was  Habermas  who,  as  a  former  assistant  to 
Adorno  in  the  Frankfurt  Institute,  started  a  pro¬ 
found  critique  of  the  aporia  concerning  the  way  in 
which  reason  is  held  to  be  constituted.  In 
Habermas’s  proclaimed  “critical  theory  without 
praxis,”  he  searched  during  the  1960s  for  a  new 
immanent  theoretical  basis  for  the  foundation  of 
reason  or  rationality  that  could  effectively  claim  to 
critique  mere  instrumental  or  purposive  rational¬ 
ity.  According  to  Habermas,  the  way  back  to 
Kantian  transcendentalism  would  lead  to  norma¬ 
tive  philosophy  only,  which  could  not  as  such 
relate  to  the  tradition  of  “materialist”  social 
research.  He  critiqued  the  Hegelian  naturalism 
inherent  in  the  early  program  of  critical  theory; 
after  some  diversions,  he  began  to  develop  a  new 
foundation  for  critical  theory  in  a  linguistic  turn 
away  from  the  traditional  Marxist  approach.  One 
could  definitely  ask  the  question  whether  tradi¬ 
tional  critical  theory  did  not  come  to  an  end 
already  with  this  fundamental  shift  in  theoretical 
foundation,  but  Habermas  vigorously,  and  finally 
successfully,  always  claimed  the  successorship. 

Based  on  his  reception  of  the  work  of  Ludwig 
Wittgenstein,  pragmatism,  hermeneutics,  and  the 
theory  of  speech  acts  as  developed  by  John  L. 
Austin  and  John  Searle,  Habermas  proclaimed  that 
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in  the  logic  of  communicative  interaction  a  “telos 
of  understanding”  is  always  inherent.  So  he  shifted 
the  basis  of  socialization  ( Vergesellschaftung )  from 
the  Marxist  emphasis  on  labor  to  the  communica¬ 
tion  of  people  in  their  lifeworld.  Properly  analyzed 
and  turned  into  reflexive  practices,  “communica¬ 
tive  rationality”  now  could  serve  as  the  pragmatic 
fundamentum  of  various  forms  of  rationality.  This 
is  the  task  of  the  discurs  proposed  and  simulated 
at  once  in  Habermas’s  new  version  of  critical  the¬ 
ory:  In  a  discourse,  the  various  implicit  performa¬ 
tive  claims  of  participants  become  reflexive  and,  in 
Immanuel  Kant’s  phrase,  “under  the  coercion-free 
coercion  of  the  better  argument,”  accordingly 
transformative.  Thus,  in  every  real  practice  of  dis¬ 
cursive  deliberation,  participants  have  to  agree  on 
the  normative  principles  of  discursive  justification. 

Theory  of  Communicative  Action 

In  his  magnum  opus,  the  Theorie  kommunika- 
tiven  Handelns  (Theory  of  Communicative  Action), 
Habermas  elaborated  the  different  forms  of  ratio¬ 
nality,  necessarily  inherent  if  speakers  claim  the 
truth,  rightness,  or  justice  of  what  they  say.  In  this 
analysis,  he  refers  to  a  “counterf actual  ideal  speech 
situation,”  to  which  every  person  would  have 
equal  access,  in  which  no  power  relations  inter¬ 
vene  and  only  the  validity  of  arguments  counts. 
According  to  Habermas,  under  such  ideal  condi¬ 
tions,  participants  would  normally  reach  a  consen¬ 
sus  that,  thus,  could  claim  universal  validity  as 
reasonable.  As  one  can  see,  at  the  heart  of  this 
argument  lies  the  Kantian  assumption  that  gener- 
alizability  grounds  normative  universality. 

The  Public  Sphere 

While  Habermas  exposed  this  philosophical 
reconstruction  of  practical  communication,  he  at 
once  embedded  it  into  a  more  empirically  grounded 
social  theory.  Modern  society — which  he  contin¬ 
ued  to  describe  as  “late  capitalist”  (1975) — is 
functionally  differentiated  in  “systems,”  such  as 
the  “political-administrative  system”  or  the 
“economy,”  that  are  perpetuated  through  an 
abstract  logic  of  purposive  rationality,  on  the  one 
hand,  and  the  Lebenswelt  (lifeworld),  on  the 
other,  which  is  coordinated  through  communica¬ 
tive  interaction;  the  “public  sphere”  functions  in 


between,  somehow  intermediating.  As  an  actual 
critical  diagnosis,  Habermas’s  thesis  on  a  “coloni¬ 
zation  of  the  lifeworld  by  abstract  systemic  pro¬ 
cesses,”  especially  its  “economization,”  became 
very  prominent  and  was  popularized  thereafter 
and  perceived  as  the  definitive  new  commentary  of 
critical  theory  on  the  present  society  in  “late  capi¬ 
talism.”  “Colonization”  was  substituted  for  the 
Marxist  term  of  alienation  in  its  critical  function. 

Critical  theory  always  claimed  to  be  more  than 
just  a  normative  theory  by  grounding  the  norma¬ 
tive  potential  of  its  actual  critique  in  the  structures 
and  logics  of  the  society  itself.  While  Habermas 
could  claim  that  he  has  found  this  grounding  on  a 
microlevel  in  his  discursive  theory  based  on  his 
pragmatic  analysis  of  communication,  for  a  critical 
theory  of  the  present  society,  he  had  to  demon¬ 
strate  the  applicability  of  his  ideas  on  a  macrolevel 
as  well.  Already,  in  his  early  Der  Strukturwandel 
der  Offentlichkeit  (The  Structural  Transformation 
of  the  Public  Sphere ),  he  reconstructed  in  an  ideal- 
type  manner  the  development  and  function  of  the 
public  sphere  ( Offentlichkeit )  as  part  of  the  politi¬ 
cal  and  social  emancipation  of  “bourgeois  society” 
from  feudal  domination.  In  the  institutions  of  this 
“public  sphere” — journals,  parties,  associations, 
clubs,  salons,  and  finally  the  parliament — emerg¬ 
ing  and  prospering  during  the  late  18th  and  the 
19th  centuries,  the  practice  of  discursive  delibera¬ 
tion  in  public  formed  decisively  the  political  and 
normative  development  of  bourgeois  society  dur¬ 
ing  its  liberal  period.  Publicity — which  had  already 
played  a  role  in  Kant’s  philosophy — now  emerged 
as  a  normative  principle  guiding  the  development 
of  the  rule  of  law  ( Rechtsstaatlichkeit )  and  the 
legitimization  of  political  power.  The  “public 
sphere”  thus  represented  in  real  history  the  societal 
requisite  for  public  deliberation.  Unfortunately 
and  due  foremost  to  the  capitalist  structure  of  the 
developing  mass  media,  this  public  sphere,  in  the 
past  liberal  period  of  capitalism,  had  turned  into 
an  instrument  of  mass  manipulation  in  the  hands 
of  big  capital — leading  to  the  “structural  change” 
in  the  title  of  the  book.  So  the  normative  function¬ 
ing  of  the  public  sphere  is  just  a  potential  reality 
that  requires  certain  supporting  conditions  such  as 
open  and  fair  access  for  various  social  groups,  free¬ 
dom  from  censorship,  and  pluralistic  media.  In  his 
later  writings,  Habermas  in  part  also  used  the  con¬ 
cept  of  civil  society  in  which  all  sorts  of  informal 
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groups,  associations,  and  social  movements  con¬ 
tribute  to  this  public  process  of  normative  and 
practical  deliberation. 

Deliberative  Democracy 

In  recent  years,  Habermas  has  worked  on 
reconstructing  the  interplay  between  the  systemic 
reproduction  of  the  economy  (market)  and  the 
political-administrative  system,  on  the  one  hand, 
and  the  norm-generating  communication  of  life- 
world  and  civil  society,  on  the  other.  In  Faktizitdt 
und  Geltung  (Between  Facts  and  Norms),  Haber¬ 
mas  offered  a  new  account  of  the  history  of 
democracy  and  the  rule  of  law  (Rechtsstaatlichkeit) 
and  applied  it  to  the  analysis  of  present  Western 
democracies.  Compared  with  the  writings  of  criti¬ 
cal  theory  up  until  the  1960s,  Habermas  now 
finally  dropped  the  idea  of  any  revolutionary  tran- 
scendentalization  of  critical  theory  and  recon¬ 
structed  it  as  a  normative  critique  based  on  the 
practical  lifeworld  perspective  of  participants. 
Against  the  imperatives  of  the  purely  purposive 
rationality  of  systemic  reproduction  in  the  fields  of 
market  economy  and  political  administration, 
active  participants  (“citizens”)  in  civil  society 
involved  in  all  sorts  of  mobilization,  from  protest 
movements  to  political  parties,  now  have  the 
chance  to  raise  issues  and  to  impregnate  the  sys¬ 
temic  processes  with  the  normative  repertoire  of 
lifeworld  communication.  Democracy,  a  specific 
version  of  it,  now  plays  a  central  strategic  role. 
Against  the  critique  of  mere  liberal  majority  rule  a 
la  Joseph  Schumpeter,  Habermas  develops  his 
more  philosophical  discursive  justification  into  a 
model  of  “democratic  deliberation.”  According  to 
this  model,  the  democratic  process  does  not  just 
recognize  and  aggregate  the  prepolitical  prefer¬ 
ences  of  individuals  as  in  the  liberal  model  but 
views  citizens  as  active  participants  in  a  process  of 
public  deliberation  on  practical  and  normative 
questions.  This  process  is  bound  to  what  Philip 
Petitt  (1992)  has  called  “preference  laundering”: 
Participants  are  thus  required  to  accept  the  prin¬ 
ciple  of  deliberative  justification  and  change  their 
original  preferences  in  light  of  better  arguments. 
Analogously  to  the  normative  philosophical  model 
of  discourse,  the  deliberative  communications  of 
the  “public  sphere”  in  the  end  also  deserve  the  sup¬ 
position  that  the  generated  outcome  in  the  form  of 


“public  opinion”  on  a  certain  question  is  reason¬ 
able;  a  supposition,  as  Habermas  continues  to 
proclaim,  that  first  depends  on  the  approximation 
of  the  real  communications  process  to  the  norma¬ 
tive  ideal  requirements  of  discursive  justification 
and,  second,  always  remains  fallible  in  the  future. 

In  Faktizitdt  und  Geltung ,  this  theory  of  delib¬ 
erative  democracy  now  serves  as  the  normative 
foundation  of  the  legitimacy  of  law;  in  its  develop¬ 
ment,  the  system  of  law  in  a  democratic  society  at 
best  reflects  the  normative  outcome  of  the  life- 
world-based  communications  in  practical  and 
ethical  issues  in  civil  society;  if  and  when  the  chan¬ 
nels  and  institutions  of  the  public  sphere  work 
properly,  deliberation  and  communication  are  not 
distorted,  and  open  access  is  given  to  the  various 
normative  preferences  of  a  pluralist  society. 

The  evolution  of  law  under  democratic  condi¬ 
tions  can  thus  in  the  long  run  be  viewed  as  a  social 
process  of  learning  by  which  modern  societies  per¬ 
manently  adapt  their  system  of  positive  law  to  new 
requirements  in  their  internal  and  external  environ¬ 
ment.  The  “logic  of  decay,”  so  characteristic  of  the 
main  representatives  of  critical  theory  until  the 
1960s,  has  been  gradually  transformed  by  Habermas 
into  an  evolutionist  framework  of  affirming  the 
inherent  potential  of  modern  societies  for  normative 
and  reasonable  progress.  In  recent  years,  this  more 
or  less  optimistic  and  somehow  even  neo-Hegelian 
view  of  historical  progress  has  been  expanded  by 
Habermas  to  the  evaluation  of  international  law. 
While  he  vigorously  critiques  the  unilateral  inter¬ 
ventionist  policies  of  the  United  States  under  the 
George  W.  Bush  administration,  he  has  at  the  same 
time  judged  certain  “humanistic  interventions” 
based  on  moral  grounds  as  “anticipating  the  univer¬ 
sal  rights  of  the  people”  and  the  reformed  UN  sys¬ 
tem  as  the  core  of  a  global  legal  system  of  justice. 

Habermas’s  Influence  in  Political 
Science  and  Policy  Studies 

Habermas’s  philosophical  works  in  recent  years 
have  had  an  enormous  impact  on  analytical  and 
theoretical  developments  within  political  science.  In 
the  political  science  literature,  Claus  Offe  repre¬ 
sents  next  to  Habermas  the  turn  to  a  more  analyti¬ 
cally  based  approach  of  critical  theory;  especially, 
the  concepts  of  deliberation  and  justification 
through  deliberation  have  widely  spread  from  the 
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subdisciplines  of  international  relations  to  the 
empirical  field  of  policy  studies.  In  the  former, 
Habermas’s  ideas  have  backed  and  reformulated 
the  constructivist  approach  to  international  rela¬ 
tions,  such  as  that  of  Thomas  Risse,  and  even 
inspired  ideas  of  “cosmopolitan  democracy,”  as  in 
the  work  of  David  Held,  while  in  various  approaches 
to  policy  research,  “deliberation  in  networks”  is 
today  viewed  not  only  as  a  proper  analytical  tool 
but  also  as  a  practical  device  for  more  effective 
governance. 

This  cannot  be  covered  here  in  detail.  Only 
some  questions  and  fundamental  problems  related 
to  these  adoptions  of  Habermas’s  normative  the¬ 
ory  can  be  briefly  raised. 

Most  such  problems  arise  in  the  process  of 
transforming  the  “quasi-transcendental”  norma¬ 
tive  theory  of  discourse  and  discursive  justification 
of  practical  and  ethical  norms  into  a  real  model  of 
policy  making  and  democracy.  The  real  political 
process  in  pluralistic  societies  has  to  take  into 
account  not  only  arguments  but  also  particular 
interests  and  even  beliefs  and  emotions  that  are  not 
rationally  grounded.  Even  the  analytical  distinc¬ 
tion  between  processes  of  “arguing”  for  practical 
norms  and  “negotiating”  for  interests  is  not  com¬ 
plex  enough  to  cover  all  the  legitime  possibilities  of 
expression  of  individual  and  collective  preferences. 
The  normative  requirement  of  participation  in 
public  deliberation — namely,  of  preference  trans¬ 
formation  in  discourses — can  turn  into  illiberal 
coercion  against  individuals  and  groups,  which 
might  deprive  them  of  their  privacy. 

As  far  as  the  future  of  “democracy  beyond  the 
state”  (see  Michael  Greven  &  Louis  Pauly’s  2000 
book  by  that  title)  is  concerned,  another  problem 
related  to  the  deliberative  conception  of  democ¬ 
racy  becomes  even  more  evident  and,  indeed, 
already  is  on  the  level  of  nation-states:  Who  can 
and  who  should  participate  in  deliberative  gover¬ 
nance  networks  as  they  have  developed  and  are 
proposed  on  the  transnational  level?  Not  every 
kind  of  participation  in  arrangements  of  “partici¬ 
patory  governance”  is  democratic,  because  often, 
the  essential  normative  requirement  insisting  on 
the  equality  of  participants  is  violated  by  the  “par¬ 
ticipation”  of  collective  actors  and  all  kinds  of 
organizations.  The  normative  principal-agent 
problem  refers  to  collective  actors  who  also  claim 
to  be  advocates  of  public  interests. 


Confronted  with  the  ongoing  and  seemingly 
irreversible  process  of  denationalization  and  grow¬ 
ing  interdependence  of  former  state-organized 
democracies,  which  both  threaten  the  idea  of  con¬ 
gruence  between  a  democratic  demos  and  a  terri¬ 
torial-based  regime  of  governance,  critical  theory 
should  perhaps  return  to  its  original  historical 
approach  and  start  to  think  about  a  future  beyond 
traditional  democracy. 

Michael  Th.  Greven 
Universitat  Hamburg 
Hamburg,  Germany 
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Critical  Theory  in 
International  Relations 


The  development  of  explicitly  critical  approaches 
to  the  study  of  international  relations  (IR) — that  is, 
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critical  IR  theory — is  generally  dated  to  the  early 
1980s.  Today,  it  can  be  seen  as  a  pluralistic  family 
of  theories,  containing  both  modernist  and  post¬ 
modernist  variants,  including  Frankfurt  School- 
influenced  IR  theory,  Gramscian  international 
political  economy,  feminist  IR  theory,  postmodern 
IR  theory,  and  most  recently,  postcolonial  theory. 

What  these  admittedly  disparate  traditions  have 
in  common  is  a  commitment  to  the  enterprise  of 
critique.  At  its  most  general  level,  this  can  be 
understood  in  terms  of  the  distinction,  common  to 
the  work  of  many  critical  IR  theorists,  between 
two  forms  of  theorizing — “problem  solving”  (or 
“traditional”),  on  the  one  hand,  and  “critical,”  on 
the  other.  Proponents  of  critical  theory  start  from 
the  proposition  that  theory  is  never  neutral  but 
always  constructed  in  response  to  a  specific  agenda 
and  in  the  service  of  a  specific  project.  Problem¬ 
solving  and  traditional  theory — of  which  main¬ 
stream  IR  theory  stands  as  a  prime  example — is 
judged  by  the  proponents  of  critical  theory  to  be  a 
form  of  theorizing  that  stands  in  the  service  of  the 
status  quo.  The  knowledge  associated  with  main¬ 
stream  IR  theory  is  in  their  view  not  value  free; 
rather,  it  is  generated  to  help  society’s  elites  main¬ 
tain  their  positions  of  power  and  privilege,  and,  at 
its  most  extreme,  it  is  in  the  service  of  a  totalitarian 
project  on  a  global  scale. 

In  contrast,  critical  forms  of  theorizing  are  char¬ 
acterized  by  the  following  elements. 

The  first  element  common  to  these  critical 
forms  of  theorizing  is  methodological  opposition 
to  positivism  in  mainstream  IR  theory.  From  the 
perspective  of  critical  theory,  positivism,  with  its 
defining  characteristics  of  naturalism,  the  separa¬ 
tion  of  fact  and  value,  and  the  separation  of  the 
observed  and  the  observer,  is  more  than  just  an 
approach  to  the  study  of  social  phenomena. 
Rather,  it  contains  within  it  a  telos  of  domina¬ 
tion — a  telos  that  derives  from  positivism’s  interest 
in  constituting  the  world  for  the  purposes  of  con¬ 
trol  and  from  its  tendency  to  reinforce  the  status 
quo  by  treating  the  socially  constructed  as  given 
and  by  representing  that  which  is  alterable  as 
something  that  cannot  be  changed. 

It  should  be  noted  that  the  critique  of  the  highly 
normative  agenda  of  mainstream  IR  theory  is  not 
an  argument  about  the  ideological  biases  of  main¬ 
stream  theorists  and  how  they  inform  and  distort 
the  social-scientific  enterprise.  Such  biases  may 


well  exist,  but  in  the  view  of  critical  theorists,  it  is 
the  knowledge-constituting  interests  of  main¬ 
stream  IR  theory  itself,  not  the  values  of  individ¬ 
ual  researchers,  that  give  it  its  conservative  polit¬ 
ico-normative  agenda.  Therefore,  mainstream 
researchers  cannot  overcome  their  conservative 
bias  by  being  more  vigilant  in  their  research 
design  or  by  being  more  self-aware  of  their  biases. 

In  contrast,  critical  theory  integrates  so-called 
qualitative  methodologies  aimed  at  unearthing  the 
way  inter  subjective  meanings  are  both  embedded  in 
and  instantiate  social  practices.  The  objective  here  is 
not  to  deny  the  existence  of  regularities  in  the  social 
world  but  rather  to  determine  when  such  observ¬ 
able  regularities  are  regularities  tout  court  and 
when  they  can  be  understood  to  be  the  manifesta¬ 
tions  of  ideologically  petrified  relationships  of 
domination  that  could  be  challenged  and  overcome. 

A  second  element  of  critical  forms  of  theorizing 
is  the  adoption  of  a  strategy  of  immanent — as 
opposed  to  transcendent — critique.  Where  the  lat¬ 
ter  applies  transhistorical  (and  therefore,  ahistori- 
cal)  standards  to  objects  or  phenomena  under 
examination,  immanent  critique  applies  historically 
emergent  standards  for  the  purposes  of  criticism 
that  are  generated  by  the  object/phenomenon  itself. 

A  third  element  is  a  shared  commitment  to  pro¬ 
moting  self-consciously  normative  discussions 
within  the  discipline.  What  is  distinctive  about  this 
activity,  moreover,  is  that  in  light  of  the  critique  of 
the  separation  of  fact  and  value  noted  above,  criti¬ 
cal  theory’s  promotion  of  normative  critique  is 
formulated  so  as  not  to  reproduce  the  problematic 
separation  of  fact  and  value  characteristic  of  posi¬ 
tivism.  That  is,  critical  IR  theory  problematizes  the 
conventional  distinction  between  “normative”  and 
“empirical”  theory,  noting  the  inherently  norma¬ 
tive  content  of  all  forms  of  theorizing,  including 
so-called  “empirical”  efforts. 

A  fourth  element  is  the  critique  of  ideology. 
From  the  conclusion  that  neorealism  and  neoliber- 
alism  are  two  sides  of  the  same  mainstream  coin, 
to  the  critique  of  the  gendered  nature  of  the  global 
political  economy  (not  to  mention  the  theories 
developed  to  “explain”  it),  and  to  the  way  domi¬ 
nant  academic  discourse  continues  to  contribute  to 
the  view  that  the  underdeveloped  world  is  a  simple 
casualty  of  inherent  weaknesses  and  limitations, 
critical  IR  theory  carries  its  agenda  of  ideology 
critique  forward  on  a  number  of  different  levels 
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and  in  terms  of  a  range  of  distinct,  but  interrelated, 
foci  (class,  race,  gender,  sexual  orientation,  etc.). 

A  fifth  element  is  a  shared  interest  in  questions 
of  oppositional  practice  and  their  potential  to 
contribute  to  emancipatory  change.  Here,  the 
focus  is  on  social  forces — social  movements,  old 
and  new  (e.g.,  labor,  ecological  movements, 
women’s  movements,  and  human  rights) — and 
the  way  in  which  critical  theory  can  help  explain 
the  emergence  and  nature  of  these  movements  as 
well  as  how  critical  IR  theory  can  generate 
knowledge  that  might  prove  to  be  of  use  to  these 
movements. 

As  already  noted,  differences  exist  between 
critical  IR’s  subtraditions.  An  early  tension  was 
one  between  modernist — enlightenment-inspired 
forms  of  theorizing — and  postmodernist  challeng¬ 
ers.  The  latter’s  skepticism  about  metanarratives  in 
general — including  metanarratives  framed  in  terms 
of  the  telos  of  human  emancipation — raised  funda¬ 
mental  questions  both  about  the  viability  of  mod¬ 
ernist  variants  of  critical  IR  theory  and  about  the 
political  content  of  postmodernist  variants. 

Beyond  this  tension,  others  have  manifested 
themselves.  An  early  success  by  mainly  postmod¬ 
ernist  critical  theorists  in  being  given  a  special  issue 
of  the  International  Studies  Quarterly  was  marred 
by  their  failure  to  include  even  one  contribution 
from  a  feminist  theorist.  Wariness  on  the  part  of  the 
feminist  IR  community  with  regard  to  “masculin- 
ist”  theorizing — including  critical  theorizing — con¬ 
tinues  (and  not  without  justification).  More  recently, 
postcolonial  theorists  (who  include  a  number  of 
feminists)  have  suggested  that  insensitivity  to  mat¬ 
ters  of  race  as  well  as  gender  leaves  at  least  some 
critical  IR  theory  open  to  the  charge  that  it  is  not 
only  masculinist  but  “White  supremacist”  as  well. 

In  recent  years,  a  number  of  efforts  have  been 
made  to  assess  the  impact  of  the  critical  theoretical 
tradition  on  the  discipline  as  a  whole.  Some  have 
argued  that  the  critical  engagement  with  positiv¬ 
ism  and  the  openness  to  interpretive  methodolo¬ 
gies  have  helped  create  greater  “thinking  space” 
within  the  mainstream  IR  theory  (see,  e.g.,  the 
movement  within  mainstream  IR  known  as  con¬ 
structivism).  This  judgment  gives  rise  to  the 
following  question:  Having  had  a  positive  influ¬ 
ence  on  mainstream  IR  theory,  does  an  explicitly 
oppositional  critical  theory  have  any  further  use  in 
the  discipline? 


More  skeptical  critics  have  expressed  doubts 
about  whether  the  metatheoretical  discourse  of 
critical  IR  theory  has  done  more  than  waste  intel¬ 
lectual  energies  that  might  have  been  better  put  to 
use  studying  the  concrete  problems  of  world  poli¬ 
tics.  Some  have  even  gone  so  far  as  to  charge  criti¬ 
cal  theory  with  an  ideologically  motivated  “cleans¬ 
ing”  agenda,  with  mainstream  approaches  such  as 
realism  being  driven  increasingly  to  the  margins  of 
the  discipline. 

In  addition,  some  who  profess  to  share  ongoing 
sympathies  with  the  critical  project  nonetheless 
expressed  doubts  about  its  adequacy  in  terms  of 
practice — specifically,  critical  IR’s  failure  to  make 
meaningful  links  to  antisystemic  movements — 
whether  as  a  result  of  inadequacy  at  the  level  of 
conceptualization  or  in  terms  of  the  origins  and 
prejudices  of  critical  theorists  themselves. 

Finally,  the  origins  and  geographical  presence  of 
critical  IR  theory  itself  is  worth  noting.  The  disci¬ 
pline  of  IR  is  overwhelmingly  a  discipline  of  the 
English-speaking  world,  with  the  United  States 
being  the  unquestioned  center  in  which  the  bulk  of 
IR  theory  is  produced.  Significantly,  critical  IR 
theory  is  largely  a  product  of  the  academic  “semi¬ 
periphery.”  That  is,  while  some  critical  IR  theo¬ 
rists  are  to  be  found  within  the  borders  of  the 
United  States,  a  disproportionate  number  are 
products  of  and  are  academically  active  in  the 
United  Kingdom,  Canada,  Australia,  and  New 
Zealand.  A  sociology  of  knowledge  account  of  this 
aspect  of  critical  IR  theory  remains  to  be  written. 
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Trent  University 
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Cross-National  Surveys 


Comparative  research  has  been  broadly  applied 
by  the  social  sciences  research  community  for  at 
least  60  years  now.  Its  application  in  disciplines 
such  as  cultural  or  social  anthropology  counts 
among  the  most  important  advancements  in  the 
contemporary  understanding  of  cultures  and  soci¬ 
eties.  In  political  science,  cross-cultural  studies 
have  increasingly  been  based  on  cross-national 
surveys.  The  word  survey  itself  is  broad,  even  if 
many  researchers  use  it  with  a  “quantitative” 
meaning:  the  collection,  through  a  questionnaire 
survey  and  on  a  representative  sample,  of  obser¬ 
vations  coming  from  the  quantitative  coding  of 
the  responses  chosen  by  respondents.  In  the  fol¬ 
lowing,  the  use  of  the  word  survey  will  follow  this 
tradition.  This  entry  discusses  the  developments 
in  this  field  and  addresses  remaining  problems. 

Evolution  of  Cross-National  Research 

The  developments  of  European  integration  and 
the  interdependences  created  by  globalization 
have  clearly  increased  the  value  of  comparisons 
across  different  national  systems  and  across  cul¬ 
tures;  we  are  in  the  context  of  what  Anthony 
Heath  calls  the  “globalization  of  public  opinion 
research.”  This  is  reflected  in  the  greater  number 
of  available  sociopolitical  “barometers.”  Following 
the  example  of  the  Eurobarometer  since  the 
1970s,  these  are  now  combined  in  the  “Global 
Barometers  ”  proj ect,  including  also  Afrobarometer, 
Latinobarometro,  and  East  Asia  Barometer. 

Despite  these  significant  recent  developments, 
cross-national  survey  methodology  has  a  long  his¬ 
tory.  The  process  started  more  than  4  decades 
ago,  with  Gabriel  Almond’s  and  Sidney  Verba’s 
seminal  contribution  in  The  Civic  Culture  (1963), 
which  was  updated  and  revised  in  1980.  Previously, 
a  few  other  cross-national  surveys  of  political  atti¬ 
tudes  had  been  developed,  notably  William 
Buchanan  and  Hadley  Cantril’s  How  Nations  See 


Each  Other  (1953),  sponsored  by  UNESCO 
(United  Nations  Educational,  Scientific  and 
Cultural  Organization).  In  a  recent  overview  of 
available  cross-national  surveys,  the  German  data 
archive  (Zentralarchiv  in  Cologne)  found  that 
from  1948  (fieldwork  date  for  How  Nations  See 
Each  Other)  to  2004,  more  than  60  projects  of 
cross-national  surveys  have  taken  place.  Profusion 
of  data  rather  than  a  lack  of  data  is,  in  fact,  the 
case  today.  But  profusion  does  not  necessarily 
mean  availability  of  “truly”  comparative,  harmo¬ 
nized,  and  replicated  studies.  In  the  beginning  of 
the  1990s,  a  European  Science  Foundation  (ESF) 
paper  reported  that  Europe  was  rich  in  social  sci¬ 
ences  data  but  poor  in  harmonized  and  compara¬ 
ble  data.  Since  that  period,  much  progress  has 
been  made  (in  particular  due  to  the  European 
Social  Survey,  ESS),  but  a  few  important  method¬ 
ological  problems  comparing  surveys  remain. 

The  profusion  of  available  data  makes  it  impos¬ 
sible  to  present  an  exhaustive  picture  of  cross¬ 
national  surveys.  It  is  more  fruitful  to  present  some 
of  the  major  series  and  the  methodological  issues 
that  they  pose  in  terms  of  measurement,  comparison, 
and  data  analysis.  In  another  recent  survey  of  major 
comparative  studies,  Pippa  Norris  (2009)  presents  a 
quasi-exhaustive  list  of  the  most  relevant  ones.  She 
proposes  to  keep  as  most  relevant  the  studies 
“defined  as  those  covering  more  than  one  indepen¬ 
dent  nation-state  which  have  established  a  regular 
series  of  surveys  of  social  and  political  attitudes  and 
behavior”  (p.  522).  This  includes  the  Eurobarometer 
and  related  European  Union  (EU)  surveys  (which 
started  in  1970),  the  European  Elections  Study 
(1979),  the  European  Values  Survey  and  the  World 
Values  Survey  (1981),  the  International  Social  Survey 
Programme  (1985),  the  Global  Barometers  (from 
1990),  the  Comparative  National  Elections  Project 
(1990),  the  European  Voter  and  the  Comparative 
Study  of  Electoral  Systems,  the  ESS  (2002),  the 
Transatlantic  Trends  survey  (2002),  the  Pew  Global 
Attitudes  project  (2002),  and  the  Gallup  World  Poll 
(2005).  In  the  following,  only  a  few  of  these  studies 
can  be  discussed  in  greater  detail. 

The  Eurobarometer 

Among  these,  the  Eurobarometer  is  the  most 
important  with  regard  to  its  time  series  and  its  cov¬ 
erage.  The  Eurobarometers,  initiated,  developed, 
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and  organized  by  the  European  Commission,  have 
played  a  key  role  in  the  expansion  of  political  and 
social  attitudinal  measurement  across  Europe. 
Initiated  by  Jacques-Rene  Rabier,  a  high-level  civil 
servant  at  the  European  Commission  having 
visionary  ideas  about  the  role  of  citizens  in  the 
construction  of  a  legitimate  Europe,  this  series 
investigates  mainly  the  key  question  of  European 
citizens’  attitudes  toward  European  integration. 
The  series  has  been,  and  continues  to  be,  more 
than  just  a  public  communication  tool  for  the  EU 
Commission.  Its  findings  and  data  have  been 
extensively  used  by  social  scientists  in  particular 
in  two  fields  of  political  sociology:  the  analysis  of 
attitudes  toward  EU  (integration,  enlargement, 
EU  domains  of  public  policies,  citizenship  and 
national  attachment,  trust  and  knowledge  about 
EU  institutions,  and  support  for  EU)  and  the 
analysis  of  the  postmaterialist  “revolution.”  This 
research,  initiated  by  Ronald  Inglehart,  was  clearly 
linked,  at  least  in  the  first  steps,  to  Eurobarometer 
survey  items. 

Even  if  academic  researchers  have  many 
opportunities  to  use  Eurobarometer  data,  this 
project  has  its  own  agenda  and  is  largely,  if  not 
exclusively,  directed  toward  EU  matters. 
Conducted  two  times  a  year  since  1973,  the 
“regular”  Eurobarometer  series  (the  EU 
Commission  also  organizes  other  “special” 
Eurobarometer  studies  on  focused  topics)  offers 
a  unique  time  series  to  investigate  trends  in  sup¬ 
port  for  the  EU.  A  recent  development  of  the 
questionnaire  concerns  the  introduction  of  items 
related  to  globalization  and  the  attitudinal  con¬ 
nections  between  perceptions  of  the  EU  and  per¬ 
ceptions  of  globalization.  The  Eurobarometer 
covers  the  population  of  the  respective  nationali¬ 
ties  of  each  EU  member  state  aged  15  years  and 
older.  An  interesting  point  of  the  series  is  that  it 
also  samples  the  populations  of  candidate  coun¬ 
tries  (in  2010,  Croatia,  Turkey,  and  the  former 
Yugoslav  Republic  of  Macedonia;  it  also  samples 
the  Turkish  Cypriot  Community).  With  few 
exceptions  (e.g.,  Luxembourg),  the  sample  sizes 
of  national  samples  are  about  1,000. 

European  Values  Studies 

A  second  well-established  cross-national  survey 
series  is  the  European  Values  Studies  (EVS),  from 


which  the  World  Values  Study  (WVS)  developed. 
Funded  by  a  private  foundation  (the  EVS  founda¬ 
tion,  based  at  Tilburg  University  in  the  Netherlands), 
the  primary  focus  of  the  EVS  is  on  changing  moral 
and  social  values,  with  particular  emphasis  on 
Christian  values  and  their  possible  replacement  by 
alternative  “values  systems”  (see  Loek  Halman, 
2001).  The  EVS  series  is  profoundly  marked  by  this 
original  intellectual  project,  whose  goal  is  to  under¬ 
stand  the  value  changes  across  Europe,  being  inves¬ 
tigated  in  specific  domains  such  as  family,  religion, 
morality,  work,  and  politics.  Some  new  issues  were 
covered  in  the  last  EVS  wave,  such  as  environmen¬ 
tal  concerns.  The  questionnaire  of  the  EVS  series  is 
among  the  most  extensive  in  contemporary  social 
surveys,  and  the  contribution  of  the  EVS  series  to 
comparative  political  sociology  is  certainly  very 
important;  the  quality  of  the  questionnaire  and  its 
methodological  and  substantive  integration  are  of  a 
very  high  standard.  A  major  difference  with  the 
Eurobarometer  series  is  the  restriction  of  the  EVS 
questionnaire  to  a  set  of  articulated  and  intellectu¬ 
ally  integrated  research  questions  and  items.  The 
first  round  of  the  EVS  was  conducted  in  1981,  the 
second  in  1990,  the  third  in  1999/2000,  and  the 
last  one  in  2008/2009.  EVS  is  completely  social 
science  driven,  managed  by  Tilburg  University,  in 
connection  with  national  academic  teams.  It  is 
conducted  only  every  8  to  9  years  because  politi¬ 
cal  and  social  values  do  not  change  rapidly.  In 
other  words,  the  EVS  perspective  is  on  gradual 
cultural  changes  rather  than  on  the  yearly  moni¬ 
toring  of  a  public  opinion  barometer.  The  exclu¬ 
sive  academic  orientation  of  the  EVS  series  also 
makes  a  big  difference  compared  with  the 
Eurobarometer:  A  “theory  group”  and  a  “meth¬ 
odology  group”  investigate  and  monitor  closely 
the  production  of  the  questionnaire  and  the  har¬ 
monization  of  fieldwork,  translations,  and  data 
coding  protocols.  The  last  wave  of  the  EVS 
(2008/2009)  has  in  particular  been  the  object  of 
intensive  methodological  analysis  by  these  two 
committees.  The  EVS  is  now  composed  of  46 
national  representative  samples  (there  were  16 
countries  in  the  EVS  series  in  1981,  23  in  1990, 
and  33  in  1999).  The  sampling  procedure  for  EVS 
is,  for  each  country,  a  representative  (multistage 
or  stratified  random  probability)  sample  of  the 
adult  population  of  the  country  aged  18  years  and 
older.  The  sample  size  is  about  1,500  respondents 
per  country.  Although  EVS  is  rather  different 
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from  Eurobarometer,  it  is  similar  to  it  in  that  it  is 
not  a  “general  social  survey.” 

The  International  Social  Survey  Programme 

The  International  Social  Survey  Programme  series 
(ISSP)  is  also  not  a  general  social  survey,  but  it  is 
academically  oriented  and  managed  like  the  EVS.  It 
is  conducted  on  a  yearly  basis,  which  makes  it  pos¬ 
sible  to  follow  a  set  of  indicators  over  time.  From  the 
perspective  of  a  questionnaire  design,  the  particular¬ 
ity  of  ISSP  is  the  “rotating-modules”  component: 
Every  year  the  questionnaire  is  devoted  to  a  particu¬ 
lar  topic  (e.g.,  religion,  role  of  government,  and 
social  inequalities).  Each  module  is  designed  to  be 
repeated  at  nonregular  intervals  to  allow  measures 
of  change  over  time  and  between  countries.  From 
the  population  coverage  point  of  view,  ISSP  has  an 
international  perspective  with  a  broad  and  diverse 
set  of  countries:  In  2010,  44  countries  from  all  con¬ 
tinents  participated  in  ISSP,  even  if  South  Africa 
remained  the  only  African  country.  The  method¬ 
ological  rules  of  ISSP  (the  harmonization  of  sam¬ 
pling  designs,  questionnaire  translation,  and  modes 
of  interview)  are  quite  strict.  ISSP  is  organized  with 
an  annual  general  assembly  that  decides  key  issues, 
in  particular  the  choice  of  the  rotating  modules.  The 
lack  of  resources  for  permanent  management  weak¬ 
ens  the  strong  methodological  requirements  some¬ 
what,  and  ISSP  is  more  like  a  collection  of  highly 
integrated  and  harmonized  national  studies  than  a 
truly  pan-international  study.  Samples  are  represen¬ 
tative  samples  of  the  population  of  the  countries 
aged  18  years  and  older;  no  telephone  interviews  are 
permitted  (but  alternatives  to  face-to-face  interviews 
are  possible,  e.g.,  postal  surveys);  and  sample  sizes 
and  sampling  designs  may  vary  somewhat  from 
country  to  country. 

The  European  Social  Survey 

The  ESS  is  the  most  recent  cross-national  data 
enterprise  and,  surely,  with  EVS,  the  most  advanced 
one  in  terms  of  methodological  controls.  The  ESS 
found  its  original  motivation  in  a  “blueprint”  docu¬ 
ment  coming  from  a  group  of  experts  commissioned 
by  the  ESF,  following  the  ESF  program  “Beliefs  in 
Government.”  The  ESS  has  many  particularities  in 
funding  (it  has  been  funded  through  the  European 
Commission’s  Framework  Programmes,  the 


European  Science  Foundation,  and  national  funding 
bodies  in  each  country),  in  organization  (the  ESS  is 
directed  by  a  Central  Coordinating  Team,  CCT), 
and  in  methodological  excellence.  The  ESS  can  be 
considered  as  one  of  the  most  integrated  and  harmo¬ 
nized  comparative  projects  in  the  social  sciences,  and 
in  2005,  it  has  been  awarded  Europe’s  very  presti¬ 
gious  annual  social  science  award — the  Descartes 
Prize.  Five  waves  (the  fieldwork  for  the  fifth  round 
finished  in  early  2011)  have  been  completed  on  ran¬ 
dom  samples  of  resident  populations  over  the  age  of 
15  in  most  EU  countries  as  well  as  in  some  non-EU 
member  states  (e.g.,  Switzerland  and  Norway). 
Depending  on  the  waves,  a  set  of  about  30  countries 
is  covered  by  ESS,  which  runs  every  2  years.  The 
technical  specification  of  the  ESS  requires  that  a 
minimum  “effective  achieved  sample  size”  should  be 
1,500  respondents,  or  800  in  countries  with  popula¬ 
tions  of  fewer  than  2  million. 

The  ESS  represents  a  very  significant  step  forward 
in  the  methodological  controls  before,  during,  and 
after  the  fieldwork  period.  The  methodological  high 
standards  have  been  a  key  issue  for  the  ESS  from  its 
beginning,  and  the  establishment  of  a  permanent 
team  taking  decisions,  supporting,  and  controlling 
strictly  the  national  teams  is  an  important  aspect  of 
this  organization.  To  get  the  official  label  of  “ESS,” 
a  national  team  has  to  strictly  follow  and  implement 
a  methodological  and  organizational  template, 
which  covers  sampling  (strict  random  sampling), 
translation  of  the  questionnaire,  fieldwork  adminis¬ 
tration,  documenting,  and  delivering  the  different 
data  files  to  the  NSD  archive  in  Bergen,  Norway. 
Since  the  creation  of  the  ESS  in  2002,  many  scholars 
across  the  world  have  used  it,  and  very  significant 
improvements  of  survey  practices  and  data  uses  have 
been  made.  The  CCT  is  supported  by  a  number  of 
advisory  and  consultative  groups,  principal  among 
which  is  the  Scientific  Advisory  Board  under  the 
chairmanship  of  Max  Kaase.  The  Scientific  Advisory 
Board  consists  of  one  representative  each  from  the 
participating  national  funding  agencies,  one  from 
the  ESF,  and  two  from  the  Commission.  A  small 
multinational  “methods  group”  advises  on  other 
methodological  issues.  For  every  round  of  the  ESS,  a 
core  set  of  questions  is  replicated  (achieving  in  so 
doing  the  objective  of  a  regular  general  social  sur¬ 
vey),  while  two  “rotating  modules”  are  chosen  after 
an  open  and  international  competition  between 
research  teams. 
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Methodological  and  Substantive  Issues 

This  short  and  selective  overview  of  cross-national 
surveys  leads  to  a  set  of  conclusions.  Very  signifi¬ 
cant  progress  has  been  made  on  the  data  quality 
and  on  the  harmonization  of  sampling  designs, 
questionnaire  translations,  equivalence  of  mea¬ 
sures,  and  data  analysis.  The  problem  of  functional 
equivalence  persists,  however,  and  using  cross¬ 
national  surveys  requires  a  lot  of  attention  to  meth¬ 
odological  concerns.  The  long  and  sometimes  dif¬ 
ficult  road  to  the  production  of  truly  comparative 
surveys  is  certainly  not  yet  at  its  end,  even  if  the  ESS 
has  achieved  a  high  standard  of  methodological 
controls.  Methodological  and  substantive  issues  are 
closely  connected  in  achieving  comparative  cross¬ 
national  data;  comparative  analysis  is  (or  should 
be)  about  testing  if  apparent  commonalities  or  dif¬ 
ferences  are  “true”  and  to  investigate  the  role  of 
deep  (thick)  idiosyncrasies  (e.g.,  of  history,  institu¬ 
tions,  and  territories)  in  those  findings.  Even  with 
the  best  intentions  (as  in  the  studies  mentioned 
above),  some  methodological  problems  remain. 
For  example,  response  rates  have  some  variance 
across  countries,  difficulties  in  translating  some 
items  and  concepts  persist,  and  controlling  for 
fieldwork  specificities  is  critical.  Documenting  and 
reporting  about  these  issues  as  well  as  details  about 
interviewers  and  fieldwork  is  very  important. 
Knowing  the  possible  sources  of  errors  and  misin¬ 
terpretations  in  the  comparative  analysis  of  cross¬ 
national  surveys  is,  indeed,  fundamental.  The 
achievements  of  the  different  data  archives  (e.g., 
Council  of  European  Social  Science  Data  [CESSDAJ 
archives,  Interuniversity  Consortium  for  Political 
and  Social  Research  [ICPSRJ,  or  comparative  data 
labs  such  as  European  Centre  for  Analysis  in  the 
Social  Sciences  [ECASSJ  or  Network  of  Economic 
and  Social  Science  Infrastructures  in  Europe 
[NESSIEJ)  in  this  respect  are  already  remarkable 
but  not  yet  finished.  In  a  digital  world,  the  easy 
access  of  users  to  very  diverse  collections  of  data, 
but  those  that  are  sometimes  not  comparable  and 
not  fully  documented,  creates  an  even  greater  need 
for  infrastructures  with  permanent  duties  and 
resources  for  harmonizing,  documenting,  archiving, 
and  disseminating  data  and  training  the  users. 

The  profusion,  diversity,  and  availability  of 
cross-national  surveys  also  raises  a  set  of  new  meth¬ 
odological  issues.  In  increasing  the  N  (in  both  space 


and  time),  the  availability  of  new  data  sets  has 
returned  an  old  question  on  the  agenda  of  com- 
parativists:  the  challenging  intellectual  project 
defined  in  1970  by  Adam  Przeworski  and  Henry 
Teune  to  replace  the  proper  names  of  nations  (or 
states  or  cultures)  with  the  names  of  variables. 
Cross-national  surveys,  in  particular  when  they 
are  conducted  in  a  large  number  of  countries  and / 
or  repeated  through  time  on  a  regular  basis,  make 
it  possible  to  think  about  this  variable-oriented 
approach.  If  this  project  is  challenged  by  case- 
oriented  approaches,  because  cultures,  nations, 
or  states  are  not  just  “control”  variables,  the 
availability  of  cross-national  surveys  such  as  the 
ESS,  EVS,  ISSP,  or  Eurobarometer  at  least  makes 
it  possible  to  go  in  that  direction.  The  return  of 
this  old  question  also  challenges  the  statistical 
techniques  used  to  analyze  the  data,  in  particular 
to  test  for  country  effects.  Many  users  still  simply 
juxtapose  their  results  country  by  country,  not 
really  facing  the  logic  of  statistical  control  for 
country  effects,  or  they  just  introduce  countries 
as  dummy  variables  in  their  models.  But  this  is 
not  enough.  Recent  advancements  in  statistical 
techniques  make  it  possible  to  analyze  the  data 
coming  from  studies  such  as  the  ESS,  EVS,  ISSP, 
or  Eurobarometers  with  modeling  techniques 
with  more  sophisticated  contextual  analysis  tech¬ 
niques  (e.g.,  multilevel  models,  latent  class  analy¬ 
sis,  and  multivariate  techniques  coping  with 
country  effects).  A  new  challenge  is  thus  going  to 
be  the  production  of  data  sets  articulating  the 
microdata  (respondents)  with  proper  contextual 
variables  not  reduced  to  the  macrovariables  at 
the  national  level.  Availability  of  contextual  vari¬ 
ables  at  the  subnational  level  will  be  an  important 
concern  as  well  as  the  availability  of  political 
contextual  variables  “closer”  to  the  respondents 
than  the  national  ones.  The  impact  of  institu¬ 
tional  designs  on  political  attitudes  and  behaviors 
also  could  benefit  greatly  from  more  detailed 
contextual  data. 

Bruno  Cautres 
CEVIP  OF /Sciences  Po 
Paris,  France 
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Cross-Tabular  Analysis 


Cross-tabulation  is  the  statistical  technique  by 
which  two  or  more  discrete  random  variables 
(usually  variables  that  can  take  only  on  a  finite 
number  of  distinct  values)  are  cross-classified  to 
fully  characterize  their  joint  distribution  and 
thereby  the  relations  between  them.  The  analysis 
of  the  resulting  cross-classification  of  frequencies, 
or  contingency  table,  is  most  often  carried  out  to 
establish  relations  of  (in)dependence  between  the 
variables  and  to  gauge  the  strength  of  these  rela¬ 
tions,  although  several  other  objectives  are  equally 
well  served  with  the  use  of  cross-tabular  analysis. 
This  type  of  statistical  analysis  is  relatively  simple 
to  understand,  easy  to  implement,  and  well  suited 
for  a  variety  of  interesting  questions.  As  follows, 
its  major  features  using  a  simple  example  from 
political  science  are  discussed. 

To  illustrate  the  use  of  cross-tabular  analysis, 
consider  the  following  hypothetical  example.  A 
researcher  is  interested  in  understanding  the 
cosponsorship  decisions  of  legislators.  From  a  ran¬ 
dom  sample  of  legislators,  she  or  he  has  collected 
data  on  the  number  of  times  legislators  cospon¬ 
sored  with  members  of  the  parties  they  ran  against 


in  the  previous  election  (i.e.,  their  adversary  par¬ 
ties)  as  well  as  a  dichotomous  measure  of  how 
electorally  vulnerable  legislators  reported  to  have 
felt  in  the  previous  election.  She  or  he  is  interested 
in  knowing  (a)  whether  these  two  variables  are 
discernibly  related  and,  if  they  are,  (b)  how  sizable 
the  effects  of  the  relation  are. 

A  two-way  (i.e.,  including  two  variables),  two- 
by-two  (each  taking  on  two  distinct  values)  contin¬ 
gency  table  reports  the  results  of  her  or  his  data 
collection  (Table  1).  In  this  case,  variables  C  (for 
Column  and  Cosponsorship)  and  R  (for  Row  and 
Reported  vulnerability)  were  laid  out  in  this  par¬ 
ticular  order  following  the  convention  of  placing 
the  categories  of  the  response  variable  in  the  col¬ 
umns  and  the  categories  of  the  explanatory  vari¬ 
able  in  the  rows.  This  is  mere  convention,  and 
transposing  the  table  would  not  lead  to  different 
conclusions.  Three  sample  distributions  can  be 
derived  that  fully  summarize  the  component  vari¬ 
ables:  (1)  the  sample  joint  distribution  of  R  and  C, 
(2)  the  sample  marginal  distribution  of  R  (or  C), 
and  (3)  the  sample  conditional  distribution  of  R 
given  C  (or  C  given  R). 

The  joint  distribution  of  R  and  C  gives  the 
probability  of  classifying  an  observation  in  a  par¬ 
ticular  cell  of  the  table  and  can  be  approximated 
by  calculating  the  proportion  of  the  data  that  fall 
into  each  cell;  this  is  the  sample  joint  distribution. 
The  marginal  distribution  of  R  (or  C)  is  the  prob¬ 
ability  of  classifying  observations  in  a  particular 
row  (or  column)  of  the  table,  and  once  more,  it  can 
be  approximated  by  the  sample  marginal  distribu¬ 
tion  of  R  (or  C),  obtained  by  summing  across  col¬ 
umns  (or  rows)  and  dividing  by  the  total  amount 
of  observations.  Finally,  the  conditional  distribu¬ 
tion  of  R  (or  C)  comprises  the  probabilities  of  clas¬ 
sifying  an  observation  in  a  particular  row  (or  col¬ 
umn)  given  that  they  have  been  classified  in  a 
particular  column  (or  row),  and  it  can  be  approxi¬ 
mated  by  dividing  the  proportion  of  observations 
being  classified  in  a  particular  cell  over  the  overall 
proportion  of  observations  being  classified  in  the 
corresponding  column  (or  row),  yielding  the  sam¬ 
ple  conditional  probability  of  R  given  C  (or  C 
given  R). 

To  answer  question  (a)  from  the  previous  page, 
the  researcher  must  evaluate  the  proposition  of 
independence.  In  the  cross-tabular  setting,  inde¬ 
pendence  means  that  the  probability  of  classifying 
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Table  I  Contingency  Table  of  Cosponsorship  Patterns  Versus  Reported  Vulnerability 

Bills  Cosponsored  With  Adversary  Bills  Not  Cosponsored  With  Adversary 
Reported  vulnerability  568  287 

Did  not  report  vulnerability  94  246 


an  observation  in  a  particular  cell  is  equal  to  the 
product  of  the  (marginal)  probability  of  classify¬ 
ing  an  observation  in  the  corresponding  row  and 
the  (marginal)  probability  of  classifying  an  obser¬ 
vation  in  the  corresponding  column.  The  differ¬ 
ence  between  the  values  expected  under  indepen¬ 
dence  and  the  values  actually  observed  becomes 
the  basis  for  evaluating  question  (a).  More  for¬ 
mally,  the  researcher  must  calculate 

n  2  v  V  Kc  ~  ercf 

Pearson  x  =  L  L - > 

r= 1  c= 1 

where  nrc  is  the  observed  count  at  cell  rc,  and  erc 
is  the  expected  count  at  cell  rc  under  indepen¬ 
dence  (i.e.,  the  product  of  the  marginals).  The 
researcher  then  compares  this  test  statistic  with  an 
X2  distribution  with  (R  —  1)(C  —  1)  degrees  of 
freedom.  First,  the  test  of  question  (a)  (i.e.,  the 
test  of  independence)  can  be  carried  out  by 
obtaining  the  test  statistic,  choosing  a  significance 
level  for  the  test  a,  and  then  comparing  the  value 
of  the  test  statistic  with  that  of  the  quartile  of  the 
X2  table  for  (R  —  1)(C  —  1)  degrees  of  freedom 
and  the  chosen  a  level.  If  the  value  of  the  test  sta¬ 
tistic  is  higher  than  the  resulting  quartile  obtained 
from  the  table,  the  researcher  rejects  the  null 
hypothesis  of  independence. 

This  procedure  is  known  as  the  Pearson  X2  test. 
It  is,  by  far,  the  most  widely  used  test  of  indepen¬ 
dence  and,  for  a  large  number  of  observations,  is 
equivalent  to  other  tests  for  cross-tabulated  data — 
namely,  the  randomization  X2  and  the  likelihood 
ratio  X2.  As  a  rule  of  thumb,  the  expected  cell 
frequencies  under  the  null  hypothesis  of  indepen¬ 
dence  should  exceed  5  for  all  cells  for  the  test  to 
produce  reliable  results.  When  cell  sparseness  is  an 
issue,  Fisher’s  exact  test  of  independence  is  recom¬ 
mended  instead.  Also,  if  at  least  one  of  the  compo¬ 
nent  variables  is  ordinal,  more  powerful  (i.e.,  more 
likely  to  reject  a  false  null  hypothesis)  tests  are 
available,  and  therefore,  their  use  is  recommended 


whenever  appropriate  (e.g.,  the  mean  score  statis¬ 
tic  test). 

In  the  above  hypothetical  case,  the  researcher 
finds  a  test  statistic  of  148.1,  which  was  not  likely 
drawn  from  an  X2  distribution  with  1  =  (R  —  1) 
(C  —  1)  degrees  of  freedom.  Hence,  the  evidence 
supports  the  idea  that  these  two  variables  are,  in 
fact,  not  independent. 

Next,  the  researcher  will  try  to  establish  how 
big  a  substantive  effect  this  dependence  entails — 
question  (b).  The  most  commonly  used  measure  of 
effect  magnitude  is  the  odds  ratio.  Odds  are  a 
measure  of  relative  probabilities,  which  convey 
how  much  more  likely  an  outcome  is  than  its  alter¬ 
native.  Hence,  odds  ratios  are  themselves  relative 
measures  of  how  much  more  likely  an  outcome  is 
than  its  alternative  for  a  given  individual  than  it  is 
for  another  one,  defining  individuals  as  actors  with 
different  probabilities  assigned  to  each  outcome. 
These  ratios  can  take  any  nonnegative  value  and 
are  usually  expected  to  be  different  from  one — a 
ratio  of  one  implying  no  difference  in  the  odds  of 
an  event  for  the  two  different  individuals  (or  com¬ 
pared  groups).  Therefore,  they  can  be  used  to 
investigate  the  effects  of  belonging  to  different 
categories  of  the  R  variable  on  the  odds  of  belong¬ 
ing  to  a  particular  category  in  the  C  variable.  To 
calculate  odds  ratios,  the  sample  joint  probability 
distribution  is  needed  again,  and  the  proportions 
of  observations  of  the  cells  are  denoted  as  prc.  In 
general,  the  odds  ratio  in  a  two-by-two  setting  is 
calculated  by 

n  p  lip  22 

pupil ' 

The  fact  that  main  and  off-diagonal  elements 
are  multiplied  and  divided  earned  the  odds  ratio 
the  name  “cross-product”  ratio,  and  this  is  also  the 
reason  why  the  results  are  impervious  to  table 
transposition.  In  the  case  of  our  researcher,  the 
odds  of  vulnerable  legislators  sponsoring  a  bill 
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with  their  adversaries  are  estimated  to  be  5.18 
times  the  odds  of  a  safe  legislator  sponsoring  a  bill 
with  adversaries.  Finally,  a  confidence  interval  can 
be  calculated  for  this  estimate  of  the  odds  ratio 
based  on  the  normal  distribution.  By  taking  the 
natural  logarithm  of  the  odds  ratio,  its  variance  is 
defined  as 


instance,  observers  evaluating  the  same  phenome¬ 
non.  In  the  last  case,  the  preferred  statistic  is  called 
Cohen’s  kappa. 

Santiago  Olivella 
Washington  University  in  St.  Louis 
St.  Louis,  Missouri,  United  States 
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making  the  expression  for  the  (1  —  a)%  confi¬ 
dence  interval 

iXp^log(Q)  ±  2yVar(l°g(ft))): 

where  z  is  the  standard  normal  quartile  at  the  ( 1  — 
a)  cumulative  probability.  In  the  case  of  our 
researcher,  the  90%  confidence  interval  around 
the  estimated  odds  ratio  is  (4.11,  6.53). 

Once  again,  if  one  of  the  random  variables  in 
the  cross-tabulation  is  ordinal,  other  more  power¬ 
ful  tests  are  available — namely,  the  gamma  test, 
Kendall’s  tau  b,  and  the  lambda  test  (if  both  vari¬ 
ables  are  ordinal). 

These  methods  can  be  extended  to  situations 
involving  more  than  two  rows  and  more  than  two 
columns,  and  they  can  even  be  applied  when  other 
covariates  are  expected  to  mediate  the  relation 
between  a  pair  of  random  variables.  In  these  cases, 
researchers  are  interested  in  partial  associations 
between  random  variables,  conditional  on  the  val¬ 
ues  of  any  number  of  “control”  or  “stratifying” 
variables.  Failure  to  control  for  these  variables  can 
result  in  spurious  results.  In  practice,  for  cross¬ 
tabulations  with  more  than  three  variables,  these 
methods  become  increasingly  harder  to  perform, 
and  more  sophisticated  estimation  procedures 
such  as  those  involved  in  fitting  logistic  models  are 
recommended. 

Finally,  although  these  are  the  two  main  uses  of 
cross-tabulation,  there  are  other  questions  that  can 
be  addressed  via  this  technique.  In  particular,  it  is 
possible  to  perform  cross-tabular  analysis  in  order 
to  judge  homogeneity  across  populations,  to 
answer  questions  about  differences  in  proportions 
across  groups,  or  to  judge  agreement  between,  for 
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Culturalism 


Culturalism,  which  would  ground  explanations  of 
political  phenomena  in  cultures  rather  than  in 
notions  of  structure  or  agency,  is  problematic 
because  of  the  lack  of  consensus  on  how  to  define 
culture.  Moreover,  there  is  little  agreement  on  the 
added  value  of  culturalism:  Whether  culture  is  the 
main  explanatory  variable  or  a  residual  intervening 
factor  is  a  cause  of  enduring  debate  among  schol¬ 
ars.  Culturalism  may  be  seen  as  belonging  to  the 
archaeology  of  politics  as  do  structuralism,  func¬ 
tionalism,  and  behaviorism,  once  very  popular 
themselves.  Alternatively,  it  could  be  viewed  as 
living  a  new  life,  rejuvenated  within  the  field  of 
“cultural  studies,”  which  is  nevertheless  little  con¬ 
cerned  with  politics  but  tries  to  capture  all  sorts  of 
discourses  and  ways  of  life.  Finally,  it  may  be  over¬ 
lapping  with  emerging  paradigms  that  make  an 


Culturalism  511 


extensive  use  of  culture  without  claiming  it — such 
as  constructivism  and  cognitive  science. 

Discussing  culturalism  has  some  virtue.  First,  it 
brings  to  the  fore  some  particular  aspects  of  social 
and  international  life  that  are  ignored  by  “struc¬ 
tural”  or  “rationalist”  theories — such  as  identity 
issues,  which  are  still  divisive  everywhere.  Second, 
its  anthropological  stance  enriches  overly  parsimo¬ 
nious  analyses  of  decision  making  that  rely  on  a 
limited  set  of  factors  to  fit  scientific  standards,  as 
does  “rational  choice.”  Third,  whereas  its  birth 
coincides  with  the  demise  of  functionalism — whose 
self-proclaimed  biological  inspiration  from  the 
19th  century  onward  was  a  source  of  numerous 
caveats — culturalism  was  at  its  apex  when  political 
economy  failed  to  explain  change.  At  the  very 
least,  then,  it  was  instrumental  in  discrediting 
“organicism”  and  Marxism.  Fourth,  although 
excluded  from  most  comparative  research,  it 
recently  made  a  great  comeback  in  international 
studies,  where  the  topics  of  conflicts  between  civi¬ 
lizations  and  intercultural  negotiations  have 
become  popular.  Therefore,  it  is  now  taken  for 
granted  that  culturalism  has  some  merits  to  be 
properly  evaluated. 

Unfortunately,  culturalism  itself  cannot  be  eas¬ 
ily  defined,  since  culture  can  be 

•  what  makes  a  polity  distinct  (“Culture”  with  a 
capital  C); 

•  a  set  of  shared  beliefs  about  causation,  norms, 
and  values  within  a  local,  domestic,  or 
international  community  (“culture”  in  the 
singular);  or 

•  shorthand  for  a  long  list  of  things  that  people 
living  in  the  same  country  have  in  common, 
from  lifestyles  to  sources  of  national  pride,  not 
to  forget  a  common  understanding  of  what 
alternative  policy  issues  are  (“cultures”  in  the 
plural). 

As  for  the  term  culturalism ,  it  suggests 

•  awareness  of  the  possible  contribution  of  cultural 
variables  to  the  aggregated  explanation  of  a 
political  phenomenon  ( “thin”  culturalism)  or 

•  privileging  cultural  factors  over  more  political 
variables  such  as  social  stratification,  ideological 
and  partisan  cleavages,  political  attitudes, 
institutional  constraints,  and  strategic  choice 
(“thick”  culturalism). 


Because  science  is  deterministic  and  multicausal, 
the  second  and  fourth  definitions  are  adopted  here. 
That  both  lack  precision  stems  from  the  ontologi¬ 
cal  status  of  culture:  Becoming  visible  only  through 
transmission  from  one  generation  or  country  to 
the  next  (i.e.,  the  “socialization”  and  “accultura¬ 
tion”  processes),  it  can  only  be  deducted  from  its 
impact  on  receivers. 

A  Short  Genealogy 

Until  the  end  of  the  Cold  War,  sociologism  and 
functionalism  were  deeply  opposed  in  methodol¬ 
ogy  as  well  as  with  respect  to  the  goals  of  social 
science.  On  the  left,  sociologists  of  all  creeds  pre¬ 
dicted  the  end  of  capitalism,  exploitation,  alien¬ 
ation,  and  imperialism.  On  the  right,  functionalists 
assumed  that  social  change  was  not  as  effective  as 
expected:  Because  basic  functions  had  to  be  ful¬ 
filled  at  any  rate,  a  “new  class”  of  people  coming 
from  a  different  background  would  inevitably 
replace  their  predecessors.  Consequently,  there 
would  be  no  predictable  end  of  history. 

Culturalism  seemed  less  divisive  than  its  rivals. 
It  helped  explain  why  some  societies  were  more 
tolerant  than  others  toward  difference,  why  some 
were  risk  taking  and  others  risk  averse,  why  some 
were  stable  and  others  unstable,  and  why  most 
were  lagging  behind  a  minority  of  the  rest  in  eco¬ 
nomic  development  and  political  liberalization. 
For  instance,  “thin”  culturalism  claimed  that 
Africa  and  India  got  a  late  start  in  the  aftermath  of 
independence  not  because  of  colonial  rule  or  cor¬ 
rupted  elites  but  because  investment  was  not  a 
value;  “thick”  culturalism  traced  the  gap  between 
rich  and  poor  nations  in  these  regions  to  an  alleged 
incompatibility  between  religion  and  profit — an 
argument  turned  upside  down  farther  East  when 
an  incredibly  rapid  industrial  growth  was  attrib¬ 
uted  to  “Asian  values.” 

In  spite  of  numerous  caveats,  cultural  differ¬ 
ences  do  have  deep  consequences  for  institutional 
design  and  political  action.  Beliefs  have  an  impact 
on  individual  behavior,  and  the  way  peoples  solve 
their  internal  and  external  conflicts  varies  among 
countries.  Memories  of  conflict  resolution  frame 
current  decisions.  Even  false  beliefs  about  reality 
matter:  The  misperception  of  Arabs’  capabilities 
explain  the  early  defeat  in  1973  of  the  overconfi¬ 
dent  Israelis;  the  belief  that  Iraq  had  weapons  of 
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mass  destruction  contributed  to  support  for  an 
enduring  civil  war  that  Western  conceptions  of 
cost-benefit  analyses  did  not  anticipate;  and  the 
French  or  Americans  may  well  believe  in  their 
“exceptionalism.  ” 

However,  if  culture  reproduced  itself  fully  from 
one  generation  to  the  next,  change  would  be 
impossible.  Deviants,  revolutionaries,  and  reform¬ 
ists  of  all  sorts — those  who  do  not  believe  in  the 
core  values  shared  by  their  fellow  citizens — make 
history,  and  culturalism  is  little  equipped  to  exam¬ 
ine  them.  Could  neo-culturalism  address  change 
better  than  classical  culturalism  does? 

Toward  a  New  Conception  of  Culturalism 

Each  time  a  particular  explanation  of  political  and 
international  life  loses  attractiveness,  a  new  genera¬ 
tion  of  scholars  adapt  it  to  the  social  and  global 
context  of  their  time.  Two  contenders  succeeded 
behaviorism:  political  psychology  and  rational 
choice.  Functionalism  (as  in  European  studies); 
structuralism,  realism,  and  liberalism  (as  in  interna¬ 
tional  relations);  or  institutionalism  (in  policy  analy¬ 
sis)  were  rejuvenated  with  the  addition  of  the  word 
neo  to  their  traditional  denomination.  Culturalism 
did  not  follow  suit,  and  no  scholar  coined  a  hyphen¬ 
ated  “neo-culturalism”  to  save  it  from  growing 
irrelevance.  The  time  has  come  to  argue  that  rejuve¬ 
nating  culturalism  could  open  new  avenues  to  both 
“positive”  and  “meaningful”  knowledge.  This 
would  also  preempt  the  occupation  of  available 
academic  space  by  “constructivism”  (when  it  refers 
to  imagined  communities ,  identity  building,  etc.) 
and  “cognitivism”  (whose  learning  processes  look 
very  similar  to  “socialization”  and  “acculturation”). 

Neo-culturalism  could  be  shorn  of  embarrassing 
claims  that  mattered  much  in  the  demise  of  classical 
culturalism:  On  top  of  the  list  is  an  insistence  on 
“specificity”  that  did  not  bring  to  its  predecessor 
the  benefits  historians  gained  from  their  own  use  of 
“historicity.”  On  the  contrary,  classical  culturalism 
was  deeply  affected  by  critics  of  its  contradictory 
assumptions  that  values  were  either  universal  or 
not  comparable.  Every  society  is  deeply  divided 
and  multicultural  to  some  extent:  Even  the  society 
considered  as  the  most  homogeneous  in  the  world, 
Japan,  is  composed  of  several  communities  of  ori¬ 
gin  and  will  accommodate  more  migrants  in  the 
future.  As  evidenced  in  several  comparative 


surveys,  “pride”  and  “loyalty”  are  shared  less 
widely  across  nations  than  assumed  by  pollsters. 
And  it  is  a  solid  lesson  of  history  that  even  in  the 
ancient  world,  “national”  cultures  never  erased 
local  ones  despite  considerable  efforts  made  by 
writers  and  architects  to  promote  the  official  cul¬ 
ture  in  monuments,  poetry,  epics,  and  drama. 

Neo-culturalism  would  also  show  how  diverse 
were  the  answers  given  to  the  same  questions  in 
world  history  and  what  these  differences  actually 
mean.  For  instance,  public  deliberation  on  policy 
issues  is  universally  attested  in  every  documented 
case,  but  the  format  of  the  debate  differs  from  one 
case  to  another,  and  variations  matter.  The  ico¬ 
nography  of  public  deliberation  is  full  of  insight 
about  the  reasons  why  a  particular  configuration 
was  once  preferred  over  other  alternatives. 

There  is  also  a  realm  in  which  neo-culturalism 
has  no  rival:  tracing  individual  behavior  to  inter¬ 
personal  connections.  This  would  apply  first  to  the 
records  of  reciprocity  and  solidarity  obligations 
and  the  rights  and  duties  assigned  to  each  actor 
according  to  his  or  her  place  in  a  vast  system  of 
interactions.  It  would  also  show  that  lineages  are 
at  the  heart  of  such  networks,  be  they  family  links, 
kinship  ties,  or  relationships  with  neighbors.  After 
all,  the  unexpected  success  of  “social  capital” 
stems  from  its  ability  to  focus  on  networks  of 
interactions  among  people. 

It  may  even  happen  that  cultural  differences 
account  for  variations  in  social  organization,  insti¬ 
tutional  design,  and  policy  making.  The  most 
striking  example  is  the  preference  for  endogamy: 
In  Arabic-speaking  countries,  marrying  a  first 
cousin  is  an  ideal,  as  was  the  case  several  millennia 
ago  in  the  Middle  East.  Being  incompatible  with 
political  alliances,  it  is  not  conducive  to  state  for¬ 
mation,  whereas  exogamic  marriage  prevents  mili¬ 
tary  feuds,  enlarges  property,  and  makes  “foreign” 
rule  acceptable.  On  the  contrary,  breaking  with 
blood  ties  as  in  the  Japanese  adoption  system 
through  which  promising  young  adults  could 
change  family  names  (yosbi)  is  conducive  to  peace 
and  prosperity. 

Finally,  neo-culturalism  has  much  to  say  about 
languages  as  frames  of  perception  of  the  real 
world.  It  casts  new  light  on  the  meaning  of  institu¬ 
tions  that  are  usually  taken  as  essentially  uncon¬ 
tested.  For  instance,  states  (in  the  languages  derived 
from  Latin  or  borrowing  from  it)  are  presumably 


Culturalism  513 


“established,”  whereas  dawla  (in  Semitic  lan¬ 
guages)  experience  revolutionary  cycles.  The  kokul 
goku  conflation  of  nation  and  country  in  North¬ 
eastern  Asia  differs  from  the  Western  practice  of 
hyphenating  states  and  nations  and  is  quite  telling 
about  the  autochthonous  versus  nonautochtho- 
nous  conceptions  of  loyalty.  In  the  same  vein, 
symbolic  distinctions,  for  instance,  commemora¬ 
tions  and  official  narratives,  were  made  possible 
because  people  forgot  the  cleavages  and  hard  dis¬ 
putes  that  gave  birth  to  them,  their  role  being  to 
celebrate  a  current  union  carved  out  of  historical 
divisions. 

The  Added  Value  of  Culturalism 
in  Political  Science 

Cultural  variables  are  useful  when  every  other  fac¬ 
tor  fails  to  give  satisfying  explanations  for  a  phe¬ 
nomenon.  Therefore,  its  minimum  contribution  to 
science  lies  in  its  capacity  to  mobilize  residual 
factors  that  may  have  been  neglected  by  other 
paradigms. 

However,  neo-culturalism  also  underscores  the 
role  of  kinship,  heredity,  and  clienteles  in  politics. 
It  brings  to  the  fore  ethnicity  and  identity  issues 
without  being  prejudiced  against  demands  for  rec¬ 
ognition  that  may  appear  to  be  too  far-fetched  in 
normative  theory.  It  certainly  says  much  more 
about  worldviews  as  conveyed  by  language  and 
their  impact  on  domestic  and  international  politics. 

It  is  difficult  to  forecast  the  future  of  neo-cultur- 
alism  in  political  science.  Whereas  there  is  little 


chance  that  it  will  ever  benefit  from  the  kind  of 
appeal  that  boosted  classical  culturalism  in  the 
1960s  and  early  1970s,  it  may  engender  explana¬ 
tions  complementary  to  more  structural  and  ratio¬ 
nal  theories.  Moreover,  this  paradigm  may  survive 
in  other  guises,  such  as  constructivism  (with  its 
insistence  on  identity  and  recognition  issues)  and 
cognitivism  (with  its  promises  to  better  understand 
how  ideas,  inherited  attitudes,  core  principles,  and 
causal  beliefs  influence  perceptions  of  current 
issues  and  solutions  to  collective  problems). 
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Although  any  data  that  are  safely  stored  and 
retrievable  can  be  described  as  archived  or  archi¬ 
val,  here  the  term  applies  only  to  the  records  of 
people  or  organizations  that  are  created  for  their 
own  use,  whether  or  not  they  have  been  collected 
or  manipulated  by  a  researcher.  Sometimes  the 
records  are  numerical.  Then,  if  the  data  are  judged 
sufficiently  accurate,  complete,  and  relevant,  they 
can  be  compiled  and  analyzed  statistically.  The 
material  is  more  often  plain  text  (or  sometimes 
images)  originally  created  to  meet  contemporary 
personal  or  organizational  requirements.  This 
entry  discusses  challenges  in  obtaining  access  to 
materials,  some  methodological  questions,  the 
impact  of  digitization  of  information,  and  fron¬ 
tiers  in  archival  research. 

Access  to  Materials 

Because  modern  governments  and  most  nongov¬ 
ernmental  organizations  usually  produce  a  written 
record  of  their  activity,  the  variety  of  archival  data 
and  the  purposes  for  which  researchers  use  such 
data  are  too  numerous  to  count.  Probably  the  most 
common  objective  in  academic  research  on  politics 
has  been  to  analyze  high-level  government  deci¬ 
sions,  especially  foreign  policy  decisions.  Archived 
records  are  often  the  most  important  sources  of 
information  about  these  decisions,  and  sometimes 
the  only  one.  Even  though  attempts  to  perform 
process  tracing  on  policy  decisions  can  also  use 


interviews  or  oral  histories,  analyses  of  open  gov¬ 
ernmental  and  nongovernmental  sources,  statisti¬ 
cal  analyses  of  observable  actions,  and  findings 
from  other  published  research,  access  to  even  a 
small  portion  of  the  internal  records  often  pro¬ 
vides  information  not  available  by  other  means, 
and  the  opening  of  formerly  secret  records  often 
leads  to  significant  changes  in  how  decisions  are 
understood. 

Researchers  who  use  archival  data  generally  do 
not  regard  them  as  offering  an  unbiased  and  com¬ 
plete  account  of  events.  Most  governments  do  not 
operate  under  laws  that  establish  a  presumption  of 
public  access  to  government  records.  They  almost 
always  restrict  access  to  much  of  their  records, 
especially  when  they  have  been  recently  created, 
relate  to  foreign  policy,  or  are  politically  sensitive 
for  some  reason.  Although  in  some  countries  the 
law  provides  that  citizens  have  certain  legal  rights 
to  government  information  and  can  request  the 
release  of  restricted  material,  governments  can 
implement  these  laws  in  ways  that  partially  nullify 
their  effect  by  allowing  long  delays,  imposing  large 
fees  for  search  and  document  production,  constru¬ 
ing  search  requests  as  narrowly  as  possible,  and 
sometimes  simply  ignoring  their  legal  obligations. 
Although  determined  petitioners  with  the  requisite 
money,  time,  or  political  influence  can  sometimes 
overturn  these  restrictions,  they  can  frustrate  some 
researchers  and  discourage  others.  Even  when  they 
make  available  a  large  portion  of  their  records  on 
a  subject,  governments  often  hold  back  material 
when  its  release  is  deemed  to  have  significant  nega¬ 
tive  political  consequences.  Governments  also 
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destroy  a  substantial  share  of  the  documents  they 
create,  and  they  often  redact  portions  of  docu¬ 
ments  that  they  release  to  the  public.  They  might 
alter  their  policies  on  the  release  of  restricted  mate¬ 
rial  in  response  to  current  political  events,  in  some 
cases  actually  reimposing  restrictions  on  material 
that  formerly  had  been  available  (this  has  hap¬ 
pened  in  both  the  United  States  and  the  Russian 
Federation). 

Governments  can  facilitate  or  hinder  the  use  of 
archival  material  by  how  they  manage  their 
records.  Even  when  documents  have  been  depos¬ 
ited  in  an  archive,  they  must  usually  be  sorted, 
catalogued,  and  listed  in  a  finding  aid,  checked 
against  mandated  restrictions  on  their  full  or  par¬ 
tial  release,  and  stored  in  suitable  containers. 
Funding  levels  and  legal  or  donors’  restrictions  can 
greatly  affect  the  rate  at  which  processing  can  pro¬ 
ceed,  and  fees  for  photocopying  or  retrieving  mate¬ 
rial  will  constrain  those  without  the  necessary 
funding.  Archives  typically  hold  large  inventories 
of  unprocessed  materials;  how  much  value  this 
material  has  to  researchers  depends  not  only  on 
how  rapidly  it  is  processed  but  also  on  how  archi¬ 
vists  use  any  discretion  that  they  might  possess  to 
alert  researchers  to  unprocessed  material  that 
might  be  relevant  to  their  research  interests.  In  the 
United  States,  the  delays  in  governments’  response 
to  many  requests  for  declassification  are  a  signifi¬ 
cant  hindrance  to  the  research  for  books,  articles, 
theses,  and  dissertations  that  rely  on  such  material, 
as  researchers  often  face  deadlines  that  might  be 
impossible  to  meet  if  important  material  has  not 
been  promptly  declassified. 

Methodological  Questions 

Very  little  has  been  written  about  the  methodology 
of  archival  research.  A  pioneering  effort  is  Marc 
Trachtenberg’s  The  Craft  of  International  History, 
which  discusses  search  strategies,  evaluations  of 
document  quality  and  significance,  and  how  mul¬ 
tiple  sources  can  be  used  to  cross-check  document 
contents  and  researchers’  conclusions. 

In  a  country  such  as  the  United  States,  the  mas¬ 
sive  holdings  of  the  National  Archives  (as  of  2003: 
2.8  million  cubic  feet  of  paper  text  and  4.7  billion 
electronic  records)  require  that  researchers  who 
intend  to  investigate  this  material  possess  some 
framework  or  theory  that  guides  their  search. 


Relying  on  material  used  by  previous  research  or 
presented  in  government  publications  as  a  starting 
point  is  common.  If  one  views  archival  records  as 
documenting  the  flow  of  information  within  a  sin¬ 
gle  social  network,  the  choice  of  starting  point 
might  not  seem  crucial;  in  principle,  all  one  needs  to 
do  is  to  follow  the  documentary  trail  as  it  takes  the 
researcher  first  to  one  decision  maker,  then  to  oth¬ 
ers,  until  the  full  network  is  covered.  As  a  practical 
matter,  several  factors  often  make  tracing  the  infor¬ 
mation  flows  quite  difficult  and  comprehensive 
coverage  unlikely.  The  organization  of  material  is 
often  dependent  on  the  personal  practices  of  specific 
officials,  agencies,  or  presidential  administrations, 
and  learning  where  to  search  is  not  always  quick  or 
easy.  Especially  sensitive  information  might  never 
be  written  down.  Even  when  it  is,  it  might  be  stated 
in  a  blind  memo  or  worded  such  that  is  very  diffi¬ 
cult  to  understand  without  very  good  knowledge  of 
the  historical  context.  It  is  sometimes  difficult 
to  know  whether  an  official  has  actually  read  a 
document — as  opposed  to  him  merely  being  on  the 
distribution  list.  Finally,  the  time  and  resources 
available  generally  are  insufficient  to  examine  all 
material,  so  some  sampling  procedure  is  usually 
followed,  even  if  only  implicitly. 

The  interaction  of  the  sampling  method  with  the 
decision  to  begin  with  the  material  used  in  previous 
research  involves  some  risk.  For  example,  in  study¬ 
ing  diplomatic  history,  a  presupposition  and  a 
consequence  of  archival  research  that  follows  such 
a  strategy  is  that  the  information  collected  will  say 
relatively  little  about  officials’  interaction  with  pri¬ 
vate  persons.  Government  publications  of  archival 
material  ordinarily  include  very  few  items  that 
were  not  created  by  government  officials.  Although 
archives  often  contain  correspondence  files  that 
contain  letters  from  private  persons  and  less  often 
records  of  office  appointments,  telephone  calls,  or 
other  interactions  with  nongovernmental  contacts, 
it  might  require  a  fair  amount  of  contextual  knowl¬ 
edge  to  appreciate  which  contacts  are  significant 
and  which  are  not.  The  impression  that  such  inter¬ 
actions  have  little  impact  is  further  strengthened  by 
a  widely  followed  practice  of  writers  of  govern¬ 
mental  documents,  who  typically  justify  and 
explain  policies  in  terms  of  plausible  “public  inter¬ 
est”  claims.  This  strategy  is  sensible  because  such 
justifications  by  definition  will  have  a  wide  appeal, 
and  because  considerations  of  narrow  political 
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advantage  are  often  viewed  as  illegitimate  motiva¬ 
tions  for  government  policy.  While  scholarship 
that  emphasizes  the  role  of  interpretation  in  the 
reading  and  writing  of  texts  has  become  one  of  the 
main  intellectual  currents  within  the  social  sciences 
over  the  past  quarter  century,  its  concern  with 
“authorial  positioning”  and  interpretation  as  a 
choice  rather  than  a  reflex  seems  to  have  been 
focused  more  often  on  those  who  read  and  write 
about  the  documents  than  on  those  who  create 
them. 

Source  evaluation  and  interpretation  is  also  a 
challenge  that  can  typically  be  met  only  with  a 
good  understanding  of  the  context  within  which 
material  was  created.  Before  investigating  an 
archive,  one  usually  has  acquired  some  back¬ 
ground  knowledge  from  previous  research.  At  a 
minimum,  this  provides  a  list  of  the  most  visible 
participants  in  the  formal  decision-making  pro¬ 
cess,  their  most  visible  concerns,  and  some  concep¬ 
tion  of  the  flow  of  events.  Information  about 
informal  mechanisms  and  more  sensitive  (and  thus 
more  private)  concerns  and  information  is  valu¬ 
able  in  opening  a  path  to  understanding  the  less 
obvious  factors  bearing  on  a  decision.  As 
Trachtenberg  explains,  prior  understandings  held 
at  the  outset  of  archival  research  are  generally 
reshaped  by  the  encounter  with  new  material.  This 
creates  a  dynamic  process  in  which  interpretation 
of  new  material  triggers  changes  in  the  interpreta¬ 
tion  of  material  previously  collected,  which  then 
alter  search  patterns  or  the  interpretation  of  addi¬ 
tional  new  material.  As  the  amount  of  potentially 
relevant  material  is  often  vast  and  cannot  be  inter¬ 
preted  by  any  one  individual,  this  process  is  a 
social  rather  than  merely  a  personal  activity. 

Impact  of  Digitization  of  Information 

The  widespread  computerization  of  records  and 
the  shift  to  electronic  communications  such  as 
e-mail  and  texting  has  had  complex  effects  on 
archival  research.  Making  document  images  or 
files  available  on  the  Internet  makes  them  much 
more  accessible,  but  only  a  very  small  proportion 
of  government  and  nongovernment  records  are 
available  this  way.  Having  finding  aids  and  cata¬ 
logues  available  online  is  a  significant  advance, 
making  it  possible  to  learn  in  detail  about  archival 
collections  without  needing  to  visit.  If  documents 


were  originally  created  in  electronic  form,  archival 
storage  of  the  documents  is  facilitated,  provided 
durable  and  stable  storage  media  are  used.  If  text 
is  machine  readable,  then  it  is  much  easier  to 
search  it  systematically,  analyze  it  with  text  analy¬ 
sis  software,  and  excerpt  it  for  teaching  or  publica¬ 
tion  purposes.  However,  whether  governments 
will  decide  that  e-mails,  texts,  and  other  electronic 
communications  are  as  “real”  as  paper  documents, 
and  devote  substantial  effort  to  preserving  them,  is 
difficult  to  predict. 

Archival  Research 

Trachtenberg  is  the  first  scholar  to  present  system¬ 
atic  “declassification  analysis,”  which  inspects 
streams  of  declassified  material  on  a  topic  over  an 
extended  period  to  infer  the  basis  for  declassifica¬ 
tion  decisions  from  the  characteristics  of  what  was 
released  and  withheld  at  various  times.  This  practice 
also  illuminates  what  material  a  government  believed 
to  be  especially  sensitive,  though  that  understanding 
is,  of  course,  heavily  context  dependent. 

Text  analysis  software  is  now  beginning  to  be 
used  in  archival  research.  Even  ordinary  word  pro¬ 
cessors  can  facilitate  analysis  by  searching  text 
databases  for  strings  of  text.  Specialized  software 
provides  for  more  complex  search  strategies  (such 
as  Boolean  searches),  attaching  annotative  tags  to 
text,  and  relatively  easy  and  rapid  recombining  of 
document  excerpts. 

Government  information  policy  is  a  topic  that 
has  received  little  scholarly  attention  in  political 
science.  In  view  of  its  pervasive  effect  on  how  citi¬ 
zens  and  researchers  understand  government,  it  is 
a  topic  that  should  receive  significantly  more 
attention. 

Timothy  J.  McKeoiun 
University  of  North  Carolina  at  Chapel  Hill 
Chapel  Hill,  North  Carolina,  United  States 

See  also  Data,  Textual;  Discourse  Analysis;  Process 
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Missing  data  problems  arise  frequently  in  social 
science  applications.  Household  survey  respon¬ 
dents  may  fail  to  provide  responses  to  questions  for 
reasons  of  refusal,  fatigue,  enumerator  error  or 
perhaps  a  split  sample  design  is  used,  in  which  case 
the  missingness  is  an  intentional  strategy  to  con¬ 
serve  resources.  Personal  income  is  often  missing  in 
surveys  due  to  respondents’  refusal  to  reveal  what 
they  earn.  Cross-national  data  on  macroeconomic 
conditions  may  fail  to  include  entries  for  some 
countries  in  some  years  due  to  crises,  lack  of  capac¬ 
ity  to  generate  data  regularly,  or  ex  post  realization 
that  past  reporting  methods  were  flawed.  Measures 
of  national  income  inequality  are  often  missing  for 
some  countries  in  some  years  in  cross-national  data 
because  their  calculation  requires  an  income  sur¬ 
vey;  such  surveys  are  too  expensive  for  some  coun¬ 
tries  to  implement  every  year.  In  the  household 
survey  example,  the  analyst  must  decide  whether 
and  how  to  use  data  from  respondents  who  refused 
to  state  their  income.  In  the  cross-national  exam¬ 
ple,  the  same  issue  arises  for  country-years  for 
which  the  inequality  measure  is  missing.  These  are 
instances  of  what  analysts  may  label  as  “item  miss¬ 
ingness.”  This  is  the  focus  of  this  entry. 

The  “items”  are  the  individual  survey  questions 
or  bits  of  information.  The  idea  is  that  for  each 
household  or  country,  analysts  have  information 
on  some  items  but  not  for  others.  This  is  to  be 
distinguished  from  “unit  missingness,”  where 
“units”  refer  to  units  of  observation — the  house¬ 
holds  or  country-years  in  the  examples  above. 


That  is,  unit  nonresponse  refers  to  the  situation  in 
which  data  collection  failed  to  accumulate  any 
data  for  some  units,  for  example,  some  households 
were  unreachable  or  refused  to  participate  alto¬ 
gether,  or  some  country-years  contained  no  usable 
data  on  any  variables.  Fixes  for  unit  nonresponse 
in  surveys  include  ex  post  weighting  (via  post¬ 
stratification)  to  externally  available  population 
information,  such  as  a  census.  For  cross-national 
data,  the  analyst  typically  has  no  way  to  adjust  for 
unit  missingness,  and  so  he  or  she  is  required  to 
state  that  the  results  hold  only  for  the  subpopula¬ 
tion  for  which  data  are  available. 

Listwise  Deletion 

By  far  the  most  common  approach  to  missing  data 
in  political  science  has  been  to  simply  drop  incom¬ 
plete  cases,  something  known  as  “listwise  dele¬ 
tion”  in  the  methodological  literature.  This  is  tan¬ 
tamount  to  taking  cases  of  item  missingness  to  be 
instances  of  unit  missingness.  In  special  circum¬ 
stances,  ignoring  cases  exhibiting  item  or  unit 
missingness  is  a  perfectly  reasonable  thing  to  do. 
For  a  unit  i,  consider  an  outcome  variable — call  it 
y. — and  a  set  of  predictor  variables  recorded  for 
the  unit — call  them  x;,  collectively.  Suppose  that 
the  distribution  of  yt  depends  on  xt  in  a  manner 
characterized  by  a  probability  density  function  f(.). 
In  that  case,  the  analyst  can  write  the  probability 
density  of  yt  given  x  as  f(y\xt).  An  example  would 
be  a  linear  structural  model  of  the  form  y;  =  |30  + 
fhx^  +  P2xi2  +  £„  with  £,  ~  N( 0,  cr2).  In  this  case, 
xi  =  (xn>  xa)>  and 

The  goal  of  data  analysis  in  such  a  circum¬ 
stance  is  to  estimate  the  parameters  that  charac¬ 
terize  f(y,\Xj) — here,  the  (3  coefficients  and  the 
variance,  a2,  of  the  error  term,  £(.  Listwise  deletion 
allows  the  analyst  to  do  so  without  bias  as  long  as 
two  conditions  hold:  (1)  the  analyst  has  correctly 
specified  the  functional  form  of /"(y;lx,),  and  (2)  the 
probability  that  an  item  for  i  is  missing  systemati¬ 
cally  depends  only  on  the  elements  of  x,  that  are 
always  observed  for  all  units. 
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This  is  an  important  result,  but  one  should 
understand  that  Conditions  1  and  2  are  never 
exactly  true  in  social  science  data  analysis.  In 
social  science,  analysts  work  with  approximations 
(e.g.,  regressions  with  linear  or  maybe  quadratic 
terms)  of  the  complex  and  unknown  functional 
forms  that  govern  the  relationships  in  their  data. 
This  violates  Condition  1 .  The  result  for  unbiased¬ 
ness  does  not  hold  for  estimating  parameters  in  an 
approximation  model.  Missingness  may  depend 
on  the  value  of  yi  or  the  value  of  the  missing  item 
itself  (if  other  than  yt),  violating  Condition  2. 
Therefore,  using  listwise  deletion  requires  a  leap  of 
faith:  Analysts  assume  that  the  inevitable  bias  is 
small  and  thus  of  little  importance.  This  may  or 
may  not  be  true. 

Even  when  bias  is  not  a  major  concern,  listwise 
deletion  can  be  highly  inefficient.  It  can  cause 
analysts  to  discard  lots  of  information  that  could 
improve  the  certainty  of  their  inferences  about 
parameters  of  interest.  To  see  this,  suppose  that 
an  analyst  has  a  data  set  with  five  variables  that 
are  needed  for  the  analysis.  Suppose  each  variable 
has  a  5%  chance  of  being  missing.  The  analyst 
expects  to  have  recorded  information  95%  of  the 
time  for  each  variable,  but  listwise  deletion 
implies  that  about  (1  —  ,95s)  X  100%  =  23%  of 
the  observations  will  be  discarded.  This  is  a  size¬ 
able  amount  of  data  to  discard,  making  infer¬ 
ences  considerably  less  precise.  The  analyst  would 
be  throwing  away  many  cases  with  only  one  miss¬ 
ing  item,  which  is  wasteful.  There  are  better  ways 
to  proceed. 

Multiple  Imputation 

This  entry  does  not  discuss  what  is  known  as 
“selection  bias,”  where  missingness  occurs  only 
on  an  outcome  variable  that  cannot  be  observed 
or  where  missingness  is  a  result  of  actions  of 
actors  that  should  be  modeled.  Such  missingness 
occurs,  for  example,  when  analysts  do  not 
observe  the  votes  of  those  who  abstain  or  the 
outcomes  of  wars  that  states  decide  not  to  engage 
in.  The  remainder  of  this  entry  is  concerned  with 
cases  of  missing  data  for  items  that  are  either 
designated  as  “predictor”  variables  in  a  regres¬ 
sion  analysis,  can  sometimes  serve  as  predictors 
and  sometimes  as  outcomes,  or  will  be  used  for 


descriptive  analysis  (e.g.,  to  estimate  a  popula¬ 
tion  mean).  The  presentation  is  based  on  Rod 
Little  and  Donald  Rubin’s  seminal  book  Statistical 
Analysis  With  Missing  Data,  first  published  in 
1987. 

Imputation  would  seem  to  offer  a  solution. 
Imputation  refers  to  “filling  in”  (i.e.,  “imputing”) 
missing  items  with  reasonable  estimates  of  what 
those  items  should  be.  Simple  imputation  methods 
include  selecting  item  values  from  similar  units 
(what  is  called  “hot  deck”  imputation),  imputing 
the  mean  or  median  value,  or  using  a  regression  to 
predict  a  conditional  mean.  Census  bureaus  and 
other  statistical  agencies  employed  such  simple 
methods  until  recently.  As  Little  and  Rubin  have 
helped  clarify,  such  simple  imputation  methods 
can  create  problems.  Lirst,  they  distort  covariance 
relationship  between  variables  in  a  data  set,  intro¬ 
ducing  bias  into  subsequent  analysis.  Second, 
imputing  a  single  item  and  then  treating  it  as  the 
“true”  item  value  for  that  unit  ignores  one’s  uncer¬ 
tainty  about  whether  or  not  the  imputed  item 
value  is  correct. 

Lor  these  reasons,  methodologists  have  devel¬ 
oped  “multiple  imputation”  as  a  statistically  sound 
way  to  address  item  missingness  in  a  principled 
manner.  This  method  takes  care  to  preserve  the 
overall  covariance  structure  of  the  data  set  by, 
essentially,  adding  the  right  kind  of  noise  to  the 
imputations.  Uncertainty  about  the  imputed  items 
is  propagated  by  generating  multiple  data  sets  in 
which  missing  items  are  imputed.  The  substantive 
statistical  analysis  is  then  conducted  on  each  of 
these  data  sets,  and  the  results  are  averaged. 
Standard  error  estimates  incorporate  both  within- 
sample  variability  and  the  variability  across  the 
imputed  data  sets.  The  method  and  accompanying 
theory  were  developed  in  Rubin’s  1987  work, 
Multiple  Imputation  for  Nonresponse  in  Surveys. 

Multiple  imputation  requires  that  either  miss¬ 
ingness  is  completely  random  or  variables  exhib¬ 
iting  missingness  can  be  modeled  well  using 
other  variables  in  the  data  set.  This  is  an  instance 
of  what  statisticians  refer  to  as  “ignorability.” 
This  is  not  true  when  the  missingness  pattern  in 
a  variable,  y,  depends  on  the  values  of  y  itself  in 
such  a  manner  that  other  variables  in  the  data  set 
are  of  limited  use  in  modeling  this  pattern.  The 
latter  case  is  called  “nonignorable”  missingness 
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and  can  be  corrected  only  by  making  non- 
innocuous  assumptions  about  how  the  missing 
data  behave. 

Many  multiple  imputation  algorithms  have 
been  incorporated  into  commonly  used  statistical 
software  packages.  The  two  most  common  types 
of  algorithms  are  those  that  use  (1)  a  multivariate 
model  of  the  entire  data  set  or  (2)  a  series  of  regres¬ 
sions.  An  example  of  the  first  type  is  the  Amelia 
algorithm,  first  introduced  by  Gary  King,  James 
Honaker,  Anne  Joseph,  and  Kenneth  Scheve.  Their 
article,  “Analyzing  Incomplete  Political  Science 
Data,”  may  be  credited  with  doing  the  most  to 
make  political  scientists  think  about  using  multiple 
imputation  rather  than  listwise  deletion.  Amelia 
models  the  entire  data  set  with  a  multivariate  nor¬ 
mal  model.  It  uses  a  novel  iterative  approach  to 
estimate  all  of  the  mean  and  covariance  parameters 
for  the  model  and  fills  in  the  missing  data  with 
random  draws  from  the  model.  Regression-based 
algorithms  are  based  on  the  following  idea:  For  a 
variable,  y,  exhibiting  missingness,  a  regression 
model  using  the  other  variables  in  the  data  set  is 
specified.  Then,  the  parameters  of  the  regression 
model  are  estimated,  including,  importantly,  the 
variance  of  the  random  errors  and  terms  character¬ 
izing  uncertainty  about  the  parameters  of  the 
model.  Then,  missing  values  on  y  are  filled  in  with 
random  draws  based  on  the  fitted  model  and 
incorporating  all  sources  of  random  variability  in 
the  model  and  the  parameter  estimates.  Because 
the  regression  model  will  generally  include  predic¬ 
tors  that  themselves  exhibit  missingness,  an  itera¬ 
tive  sampling  algorithm  is  used  to  fit  the  models. 
Regression-based  algorithms  may  alternatively  use 
“predictive-mean  matching,”  which  allows  the 
analyst  to  avoid  having  to  model  the  data  directly. 
Predictive-mean  matching  uses  linear  approxima¬ 
tions  to  match  units  that  exhibit  missingness  to 
“donor”  units  that  do  not  exhibit  missingness. 
Missing  values  are  filled  in  with  the  values  from 
the  donors. 

Other  alternatives  exist  in  principle  for  resolv¬ 
ing  problems  of  item  missingness.  These  include 
maximum  likelihood  or  Bayesian  methods  that 
skip  the  imputation  process  and  directly  adjust  the 
substantive  analysis  by  taking  missingness  patterns 
into  account.  Such  approaches  are  rarely  used  in 
practice,  however,  because  they  require  significant 
application-specific  statistical  programming.  Such 


methods  have  not  been  shown  to  perform  any  bet¬ 
ter  than  multiple  imputation,  although,  as  men¬ 
tioned,  the  number  of  attempts  are  few. 

Cyrus  Samii 
Columbia  University 
New  York  City,  New  York,  United  States 
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Data  of  interest  to  a  political  scientist  often  refer 
to  attributes  of  units — for  example,  individuals  or 
nation-states — that  have  a  spatial  location. 
Although  analysts  typically  disregard  geographi¬ 
cal  position  and  spatial  configuration  and  treat 
their  data  as  collections  of  independent  units, 
there  are  many  reasons  why  it  may  be  essential  to 
consider  the  spatial  ordering  of  the  data.  In  par¬ 
ticular,  attention  to  the  spatial  dimension  can  help 
generate  new  insights  and  may  be  essential  for 
drawing  correct  inferences  about  the  influence  of 
other  features  in  statistical  analyses,  even  when 
spatial  patterns  are  not  the  researcher’s  primary 
concern.  In  the  following,  the  impact  of  spatial 
relations  and  their  use  in  regression  analyses  are 
discussed. 

Spatial  Relations 

One  way  to  appreciate  the  importance  of  spatial 
patterns  in  data  is  to  consider  the  analogy  to  time- 
series  data.  Researchers  are  increasingly  sensitive 
to  how  individual  data  points  collected  over  time 
often  will  not  be  independent  of  one  another,  as 
many  social  and  political  processes  display  clear 
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trends  and  persistence  over  time.  To  use  a  concrete 
example,  consider  how  people’s  evaluations  of  an 
incumbent  government  typically  change  slowly 
over  time.  Data  on  approval  taken  at  two  points 
that  are  close  in  time  are  thus  likely  to  display 
similar  values  or  serial  correlation.  It  would  be 
misleading  to  consider  the  individual  observations 
a  random  sample,  and  analysts  will  need  to  take 
into  account  their  temporal  ordering  if  they  want 
to  examine  how  other  features  such  as  economic 
performance  may  influence  changes  in  approval.  If 
economic  performance  is  believed  to  influence 
approval  and  such  data  also  display  a  trend  over 
time,  then  analysts  risk  making  spurious  inferences 
if  they  do  not  take  into  account  the  possibility  of 
serial  correlation. 

Social  science  researchers  have  been  much  less 
attuned  to  the  possibility  that  data  collected  for 
units  at  the  same  point  in  time  may  not  be  indepen¬ 
dent  of  one  another  across  space.  Francis  Gabon 
pointed  out  in  an  early  influential  comment  on  an 
article  comparing  marital  institutions  across  societ¬ 
ies  that  the  observed  outcomes  could  result  from 
diffusion  between  societies  rather  than  independent 
processes  operating  in  isolation  in  each  unit.  Stated 
differently,  common  marital  institutions  may  not 
reflect  a  functional  relationship  to  certain  shared 
social  characteristics,  as  implied  by  the  author  of 
the  article  Gabon  commented  on,  but  could  stem 
from  a  diffusion  process  in  which  societies  may  be 
likely  to  adopt  the  institutions  found  in  other  soci¬ 
eties  that  they  interact  with.  Distinguishing  between 
the  results  of  independent  outcomes  within  each 
unit  and  diffusion  may  be  difficult,  hence  the  term 
Gallon’s  problem.  For  example,  societies  that 
interact  more  with  one  another  may  be  similar  in 
other  respects,  such  as  in  economic  or  other  social 
structures.  Analyzing  the  observations  as  indepen¬ 
dent  units  and  looking  for  functional  relationships 
between  social  characteristics  and  marital  institu¬ 
tions  would  run  the  risk  of  making  unwarranted 
inferences  about  causal  relationships  from  social 
characteristics  to  marital  institutions,  unless  the 
possibility  of  diffusion  or  spatial  dependence 
among  units  is  taken  into  account. 

There  are  many  reasons  to  suspect  that  political 
and  social  phenomena  often  reflect  diffusion  or 
spatial  dependence,  in  the  sense  that  observations 
that  are  “near”  or  “connected”  in  some  manner 
tend  to  have  similar  values.  For  example,  individual 


attitudes  may  not  just  reflect  an  individual’s  own 
characteristics  but  could  also  be  influenced  by  the 
people  that  this  person  interacts  with.  As  such,  two 
individuals  who  are  connected  by  common  ties 
may  have  similar  attitudes,  even  if  they  at  a  first 
glance  would  seem  very  different  on  other  social 
characteristics.  Likewise,  policies  and  institutions 
may  be  influenced  by  the  policies  and  institutions 
adopted  by  other  states.  For  example,  the  pros¬ 
pects  for  democratic  reform  in  autocracies  may  be 
influenced  not  just  by  domestic  events  and  charac¬ 
teristics  but  also  by  international  ties  and  events  in 
other  states.  Likewise,  certain  policies  such  as 
smoking  bans  have  evolved  from  being  perceived 
as  an  extreme  form  of  legislation  to  become  a 
mainstream  policy  adopted  by  many  states  with 
very  different  characteristics.  Beth  Simmons,  Frank 
Dobbin,  and  Geoffrey  Garrett  provide  a  typology 
of  different  diffusion  mechanisms  for  policies  and 
institutions,  distinguishing  between  coercion,  com¬ 
petition,  learning,  and  emulation.  Attention  to 
spatial  dependence  can  be  very  helpful  in  learning 
new  insights  or  developing  new  hypotheses  about 
the  phenomenon  of  interest.  However,  it  is  unlikely 
that  a  single  theory  of  diffusion  or  spatial  depen¬ 
dence  will  be  generally  applicable,  and  such  theo¬ 
rizing  is  probably  best  done  on  a  case-specific 
basis. 

More  generally,  there  is  spatial  clustering  or 
dependence  if  the  values  of  a  response  variable  for 
two  units  that  are  linked  or  connected  tend  to  be 
similar,  so  that  positive/negative  deviations  from 
the  sample  average  for  one  unit  are  likely  to  go 
together  with  similar  deviations  from  the  average 
for  the  other  unit.  The  map  of  autocracies,  anocra- 
cies,  or  democracies  in  2004,  as  classified  by  the 
Polity  data  (Figure  1),  provides  an  example  of 
spatial  dependence,  in  the  sense  that  democracies 
are  much  more  likely  to  have  democratic  neigh¬ 
bors  and  autocracies  are  much  more  likely  to  be 
surrounded  by  other  autocracies. 

The  consequences  of  spatial  dependence  for 
statistical  analysis  are  similar  to  the  consequences 
of  serial  correlation  over  time.  Although  spatial 
correlations  in  principle  could  be  both  positive  or 
negative,  nontrivial  spatial  correlations  in  the 
social  sciences  tend  to  be  positive,  and  this  sec¬ 
tion  focuses  on  this  case  (although  one  could 
imagine  negative  correlation  arising,  e.g.,  through 
free-riding  behavior,  where  higher  spending  by 
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Figure  I  Political  Regimes  in  2004  According  to  Polity  Data 


one  unit  makes  other  connected  units  likely  to 
spend  less).  In  a  linear  regression  setting,  the  coef¬ 
ficient  estimates  are  no  longer  efficient  estimates  of 
the  true  parameters  if  one  ignores  spatial  depen¬ 
dence  and  could  be  substantially  off  the  correct 
values.  Moreover,  the  standard  errors  assuming 
independence  are  incorrect  and  generally  too  small 
in  the  presence  of  positive  spatial  correlation.  A 
helpful,  intuitive  way  to  understand  why  standard 
errors  are  likely  to  be  too  small  in  the  presence  of 
spatial  correlation  is  to  consider  its  impact  on 
effective  sample  size.  When  observations  are  seri¬ 
ally  correlated,  either  in  time  or  in  space,  so  that 
close  observations  are  similar  to  one  another,  the 
sample  does  not  contain  as  much  independent 
information  as  would  be  suggested  by  the  apparent 
sample  size  N.  The  normal  standard  error  estimate 
ignores  the  correlation  between  nearby  observa¬ 
tions  and  will  thus  understate  the  true  variance. 

To  detect  whether  data  display  spatial  cluster¬ 
ing  first  requires  the  specification  of  what  is  meant 
by  “space.”  In  general,  the  structure  of  spatial 
dependence  must  be  prespecified  by  the  analyst, 
and  it  is  only  possible  to  test  for  a  specific  form  of 
spatial  dependence.  Whereas  time  has  an  inherent 
ordering  and  data  are  usually  collected  at  intervals 
that  suggest  likely  temporal  dependence  struc¬ 
tures,  space  is  multidimensional  and  there  may  be 
many  possible  forms  of  dependence  between 
observations.  Since  much  interaction  tends  to  be 


geographically  determined  so  that  units  that  are 
close  to  one  another  tend  to  interact  more,  geo¬ 
graphical  distance  is  often  considered.  However, 
one  can  imagine  forms  of  dependence  other  than 
geographical  distance  being  important.  For  exam¬ 
ple,  observed  indicators  of  interactions  such  as 
trade  could  be  used  to  specify  likely  patterns  of 
dependence,  or  one  could  consider  links  between 
units  that  are  similar  on  some  other  criterion,  such 
as  a  shared  language. 

Regression  Analysis 

Given  some  criteria  for  potential  spatial  depen¬ 
dence,  the  analyst  can  represent  the  potential  struc¬ 
ture  of  dependence  as  a  graph,  specifying  how  units 
are  connected  to  one  another,  or  as  a  connectivity 
matrix  W,  where  the  value  of  an  entry  ma.  indicates 
whether  (and  to  what  extent)  one  unit  i  is  linked  or 
connected  to  another  unit  /.  Connectivity  matrices 
can  represent  a  very  wide  range  of  possible  depen¬ 
dence  structures.  The  connectivity  matrix  may  be 
based  on  a  single  threshold,  where  two  units  i  and 
/  are  considered  connected,  and  their  corresponding 
entries  w:j  acquire  nonzero  values  if  they  meet  this 
threshold.  Alternatively,  the  connections  could  be 
weighted — in  the  sense  that  units  are  “more”  con¬ 
nected  the  closer  two  units  are  on  some  criterion — 
for  example,  with  values  based  on  inverse  decay  of 
the  distance  between  units.  A  connectivity  matrix 
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could  be  symmetric,  to  reflect  a  reciprocal  relation¬ 
ship,  where  a  link  from  i  to  j  goes  together  with  a 
link  from  j  to  i;  or  it  could  be  directed  in  an  asym¬ 
metric  manner,  so  that  the  fact  that  i  influences  j 
does  not  necessarily  imply  that  there  is  a  reverse 
link  from  /  to  i. 

A  variable  y  displays  spatial  correlation  to  the 
extent  that  the  value  of  y  for  one  individual  unit  i 
is  similar  to  the  values  of  y  for  proximate  or  con¬ 
nected  observations  /.  Since  a  single  unit  usually 
has  many  connected  units,  one  must  take  into 
account  the  degree  of  similarity  across  the  con¬ 
nected  observations  to  assess  the  overall  spatial 
correlation.  For  many  purposes,  it  is  useful  to  row- 
normalize  the  connectivity  matrix  so  that  all  the 
connectivities  for  a  single  unit  sum  to  1.  This  is 
often  referred  to  as  a  standardized  spatial  weights 
matrix  W  in  the  spatial  econometrics  literature. 
This  allows  one  to  generate  a  “spatial  lag”  of  a 
variable  y  by  postmultiplying  it  with  the  weights 
matrix  W — that  is,  W y.  The  individual  elements  of 
the  resulting  vector  for  an  observation  i  can  be 
interpreted  as  the  average  of  y  among  neighbors  or 
the  other  observations  j  that  are  connected  to  i  fol¬ 
lowing  the  specification  of  the  connectivity  matrix 
W.  A  variable  displays  positive  spatial  correlation 
if  the  values  of  y  for  one  unit  i,  y;,  are  similar  to  the 
values  for  its  connected  observations  given  by  row 
i  of  the  connectivity  matrix — that  is,  ivy. 

One  useful  summary  measure  for  spatial  corre¬ 
lation  of  a  variable  y  is  Moran’s  I  correlation  coef¬ 
ficient.  This  is  based  on  comparing  the  deviations 
from  the  mean  of  y  for  a  unit  i  with  the  deviations 
from  those  of  its  neighbors — that  is, 

j  _  NLE>,>y,w-/(y-  ~  y)(y>  -  y) 

“  y )2 

A  standard  error  can  be  approximated  for  Moran’s 
I,  based  on  sampling  assumptions,  which  in  turn 
allows  testing  whether  the  spatial  clustering  is 
significant  or  deviates  from  what  would  be 
expected  if  there  were  no  spatial  pattern.  Moran’s 
I  can  be  given  a  useful  graphical  interpretation. 
Figure  2  shows  a  plot  of  a  variable  (in  this  case  the 
Polity  democracy  scale  in  2004)  on  the  horizontal 
axis,  standardized  to  have  a  mean  of  0  and  a 
standard  deviation  of  1,  against  its  spatial  lag  on 
the  vertical  axis  (here  based  on  taking  countries 
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within  500  kilometers  of  the  outer  boundaries  of  a 
state  as  connected  to  one  another).  Drawing 
vertical  and  horizontal  lines  through  the  origin 
(0,  0)  divides  the  figure  into  four  quadrants,  where 
the  values  of  the  individual  observations  variable 
are  either  below  or  above  the  mean  on  the  input 
variable  and  its  spatial  lag.  If  there  is  positive 
spatial  correlation,  one  would  expect  to  see  positive 
deviations  from  the  mean  on  the  input  variable  for 
one  unit  to  systematically  go  together  with  positive 
deviations  from  the  mean  for  its  neighbors  or  the 
spatial  lag.  There  should  be  relatively  few 
observations  far  from  the  dashed  lines  of  the 
averages  in  the  upper  left  (I)  and  lower  right  (IV) 
quadrants,  where  the  sign  of  the  two  deviations 
diverge,  and  the  bulk  of  the  individual  observations 
should  be  in  Quadrants  II  and  III,  where  positive 
and  negative  deviations  go  together  for  both 
variables.  Although  some  cases  fall  inside  the 
boundaries  of  Quadrants  I  and  IV  in  Figure  2,  the 
bulk  of  the  observations  are  clearly  in  Quadrants 
II  and  III,  reflecting  how  most  countries  tend  to 
have  levels  of  democracy  similar  to  the  weighted 
average  of  the  levels  found  among  their  neighbors. 
Moran’s  I  is  the  slope  of  the  regression  of  the 
spatial  lag  on  the  standardized  input  variable.  This 
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is  indicated  in  Figure  2  as  a  gray  dashed  line,  and 
one  can  see  that  the  slope  of  the  regression  line 
clearly  is  positive.  In  contrast,  if  there  is  no  spatial 
correlation,  one  would  not  see  any  tendency  for 
the  deviations  to  have  similar  signs,  and  the  slope 
of  the  regression  line  would  be  close  to  0.  Moran’s 
I  statistic  can  be  used  to  detect  both  spatial 
clustering  in  an  input  variable  and  spatial 
correlation  in  the  residuals  from  a  regression,  and 
this  is  implemented  in  common  software  options 
(see  Figure  2). 

When  theory  or  preliminary  analysis  suggests 
spatial  dependence,  researchers  may  wish  to  try 
to  integrate  the  spatial  component  in  a  regres¬ 
sion  model.  The  appropriate  model  specification 
depends  on  specific  theoretical  arguments  or  the 
likely  causes  of  spatial  dependence  as  well  as  on 
whether  the  dependent  variable  is  continuous  or 
categorical.  Model  estimators  for  spatial  regres¬ 
sion  models  with  continuous  dependent  vari¬ 
ables  are  relatively  well  established,  and  this 
section  focuses  only  on  this  case.  Models  where 
the  dependent  variable  is  categorical,  such  as 
binary  responses  or  counts,  are  considerably 
more  complex,  but  various  estimation  approaches 
relying  on  simulation  or  data  augmentation  have 
been  proposed. 

As  in  the  case  of  time  series,  one  might  approach 
spatial  dependence  either  as  a  nuisance  to  estimation 
that  must  be  taken  into  account  when  considering 
other  phenomena  of  interest  or  as  a  phenomenon  of 
interest  in  its  own  right.  In  the  former  case,  research¬ 
ers  may  find  it  helpful  to  consider  a  regression  model 
where  the  errors  display  a  spatially  correlated  struc¬ 
ture.  This  can  be  written  as  y  —  X(3  +  \W£  +  8, 
where  Xp  denotes  the  systematic  component  of  the 
model,  W£  indicates  a  spatially  correlated  error 
structure  as  specified  by  the  connectivity  matrix 
W,  and  8  is  a  residual  random  error.  The  parame¬ 
ter  X  reflects  the  extent  to  which  errors  are  spa¬ 
tially  clustered.  If  X  —  0,  then  the  model  reduces  to 
a  standard  linear  regression,  where  the  observa¬ 
tions  are  independent  of  one  another.  If  \  A  0, 
there  is  a  spatially  clustered  pattern,  where  a  linear 
regression  treating  the  observations  as  independent 
of  one  another  may  give  misleading  results.  This 
approach  may  be  appropriate  if  a  researcher 
believes  that  there  is  likely  to  be  spatial  dependence 
arising  from  some  spatially  clustered  omitted  vari¬ 
able.  If  the  model  is  correct,  the  consequences  of 


spatial  dependence  for  inference  can  be  addressed 
by  letting  the  errors  of  the  model  be  spatially  clus¬ 
tered,  so  that  the  sign  and  size  of  errors  for  indi¬ 
vidual  observation  covary  spatially.  This  can  be 
seen  as  analogous  to  time-series-correlated  error 
models,  in  the  sense  that  observations  are  interde¬ 
pendent  only  due  to  omitted  variables  that  are 
spatially  correlated.  Once  removed  from  the  sys¬ 
tematic  component  to  the  error,  space  no  longer 
matters. 

One  interesting  alternative  model  of  spatial 
dependence  from  a  substantive  perspective  is  the 
so-called  spatially  lagged  dependent  variable, 
which  can  be  written  as  y  —  pWy  +  X(3  +  8.  The 
dependent  variable  for  connected  units  itself  is 
here  specified  as  having  a  direct  impact  on  the 
value  of  the  dependent  variable  for  unit  i  and 
appears  on  the  right-hand  side  of  the  model.  The 
parameter  p  indicates  the  extent  to  which  the  value 
of  the  dependent  variable  y  for  a  unit  i  is  influ¬ 
enced  by  the  values  of  y  of  its  neighbors  j.  This  can 
be  seen  as  a  spatial  analogy  to  a  time-series  model 
with  a  temporally  lagged  dependent  variable.  The 
spatially  lagged  dependent  variable  model  is 
straightforward  to  estimate  as  a  standard  linear 
regression  if  the  spatial  impact  operates  with  a 
time  lag,  so  that  the  values  of  y  for  the  connected 
units  affecting  i  can  be  taken  as  predetermined  at 
time  t.  However,  if  the  spatial  impact  is  considered 
to  be  contemporaneous,  or  within  the  same  period, 
there  is  a  problem  of  simultaneity,  since  y  at  time  t 
appears  on  both  the  left-  and  the  right-hand  side  of 
the  equation  and  the  individual  values  are  jointly 
determined.  The  model  must  be  estimated  via 
either  instrumental  variables  or  a  maximum  likeli¬ 
hood  estimator,  which  is  now  implemented  in 
many  common  software  options.  Standard  signifi¬ 
cance  tests  can  be  used  to  test  whether  the  spatial 
lag  has  a  significant  impact  on  the  response  vari¬ 
able,  and  the  model  with  the  spatially  lagged 
dependent  variable  can  be  compared  with  a  model 
assuming  independent  observations  using  standard 
comparisons  and  tests  of  nested  models. 

The  interpretation  of  a  model  with  a  spatially 
lagged  dependent  variable  differs  notably  from 
that  of  a  model  where  the  units  are  considered  to 
be  independent.  In  the  latter,  the  coefficient  esti¬ 
mates  p  for  a  right-hand-side  independent  variable 
x  can  be  interpreted  as  the  expected  change  on  the 
dependent  variable  y  following  a  one  unit  change 
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in  x.  However,  this  interpretation  no  longer  holds 
in  a  model  with  a  spatially  lagged  dependent  vari¬ 
able,  since  the  model  implies  spatial  feedback.  The 
net  effect  of  a  change  in  x  for  i  will  now  depend  on 
how  the  effect  of  the  change  of  x  on  y  for  i  influ¬ 
ences  the  dependent  variable  for  other  units  j  con¬ 
nected  to  i,  which  in  turn  will  feed  back  on  to  the 
value  of  y  for  i.  Assessing  the  net  effect  of  a  change 
in  x  for  i  must  take  into  account  not  just  the  imme¬ 
diate  effect  ff  that  increasing  x  will  have  on  i  but 
also  the  long-term  effect  of  the  change  on  y  for 
unit  i  on  the  value  of  y  for  connected  observations 
and  the  implications  that  these  will  have  for  other 
units  connected  to  these  until  the  system  settles  on 
some  new  equilibrium.  If  the  spatial  feedback 
implications  are  large,  the  net  effect  of  a  change  in 
x  could  be  considerably  larger  than  the  immediate 
impact  given  by  (5.  Through  rearranging,  it  is  pos¬ 
sible  to  show  that  the  expected  effect  of  a  change 
in  x  is  given  by  (I  —  pW)_1Xp,  where  I  is  an  iden¬ 
tity  matrix.  This  tells  one  that  the  specific  impact 
depends  not  just  on  the  first-order  connectivities 
stipulated  in  the  connectivity  matrix  W  but  also  on 
the  higher-order  connectivities  implied  in  the  spa¬ 
tial  multiplier  (I  —  pW)_1.  Since  units  i  can  have 
different  degrees  of  connectives  to  other  observa¬ 
tions  and  different  degrees  of  influence  on  other 
observations,  the  effects  of  changes  in  x  will  be 
unit  specific  and  may  vary  considerably  across 
units. 

Applications  of  spatial  dependence  in  the  social 
sciences  have  until  recently  been  fairly  uncommon, 
but  the  past  decade  in  particular  has  seen  a  wealth 
of  new  applications  appearing  in  print.  These 
applications  have  been  facilitated  by  a  renewed 
interest  in  interaction  and  networks  in  social  sci¬ 
ence  theory  as  well  as  the  increased  availability  of 
software  to  conduct  spatial  analyses,  including 
geographic  information  systems  resources  and 
routines  for  general  statistical  packages  such  as 
MATLAB,  R,  or  Stata. 
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Textual  data  refer  to  systematically  collected  mate¬ 
rial  consisting  of  written,  printed,  or  electronically 
published  words,  typically  either  purposefully 
written  or  transcribed  from  speech.  Text  collected 
for  use  as  data  typically  reflects  a  conscious 
research  purpose,  motivated  by  a  design  aimed  at 
yielding  insight  on  some  feature  of  the  social  or 
political  world.  This  entry  outlines  the  purpose, 
issues,  and  challenges  involved  in  selecting,  pre¬ 
paring,  and  analyzing  textual  data. 

The  process  of  textual  data  analysis  follows 
several  main  steps.  First,  a  text  and  its  author  are 
identified  that  will  directly  inform  the  research 
question  at  hand,  for  instance,  legislative  speeches 
if  one  is  interested  in  lawmakers’  policy  agenda. 
Second,  this  text  must  be  processed,  typically 
involving  numerous  decisions  about  how  to  con¬ 
vert,  store,  edit,  and  combine  texts.  Next,  the  pro¬ 
cessed  text  is  analyzed,  a  step  that  may  take  any  of 
a  wide  variety  of  forms.  Humans  may  read  the  text 
to  classify  it,  possibly  after  dividing  texts  into 
smaller  units;  computerized  tools  may  analyze  the 
text  to  count  frequencies  of  preidentified  words  or 
phrases;  statistical  methods  may  be  applied  to  spe¬ 
cific  words,  word  patterns,  or  sentences  to  con¬ 
struct  scales,  clusters,  or  to  identify  other  patterns. 
Finally,  the  results  of  the  analysis  are  summarized, 
interpreted,  and  reported  to  inform  the  research 
question  that  motivated  the  analysis  of  the  textual 
data. 
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The  steps  in  this  process  are  not  dissimilar  to 
those  found  in  any  empirical  research  design,  but 
the  systematic  use  of  textual  data  may  present  spe¬ 
cial  challenges  in  the  stages  related  to  identifying 
the  data  to  be  collected  and  preparing  the  data  for 
analysis.  In  the  modern  era  of  near-ubiquitous 
electronic  textual  content,  this  problem  stems  not 
from  a  lack  of  availability  of  textual  data  or  tools 
for  working  with  them  but,  rather,  from  the  oppo¬ 
site:  The  staggering  quantity  and  variety  of  textual 
data  create  real  challenges  in  selecting  which  texts 
to  analyze.  Furthermore,  the  technical  challenges 
of  working  with  different  formats  for  recording 
electronic  texts,  as  well  as  converting  these  into 
formats  or  “data  sets”  that  suit  the  purpose  of 
analysis,  are  often  far  from  trivial.  Despite  such 
challenges,  however,  textual  data  remain  one  of 
the  most  promising  and  one  of  the  least  explored 
sources  of  systematic  information  about  the  politi¬ 
cal  and  social  world. 

The  Appeal  of  Textual  Data 

Textual  data  in  the  political  and  social  sciences 
have  several  principal  advantages.  The  first  can  be 
found  in  the  nature  of  text  and  the  information  it 
may  contain  about  the  authors  or  speakers  gener¬ 
ating  that  text.  If  the  task  at  hand  is  to  gauge  overt 
messages  or  signals  conveyed  through  text,  of 
course,  then  no  better  source  of  information  exists 
than  the  texts  themselves.  For  instance,  in  1951,  in 
a  well-known  study  of  articles  by  other  Politburo 
members  about  Stalin  on  the  occasion  of  his  70th 
birthday,  Nathan  Leites,  Else  Bernaut,  and 
Raymond  Garthoff  were  able  to  discern  differ¬ 
ences  in  groups  with  regard  to  communist  ideol¬ 
ogy.  In  this  instance,  the  messages  signaled  not 
only  an  underlying  orientation  but  also  a  degree  of 
political  maneuvering  with  regard  to  a  leadership 
struggle  following  the  foreseeable  event  of  Stalin’s 
death.  The  messages  are  themselves  significant, 
and  these  could  only  be  gleaned  from  the  public 
articles  authored  by  each  Politburo  member,  writ¬ 
ten  in  the  full  knowledge  that  they  would  be 
reprinted  in  the  party  and  general  Soviet  press. 
Similar  content  is  present  in  the  overt  messages 
from  advertising — although  such  analyses  fre¬ 
quently  focus  on  nontextual  content  as  well — as 
well  as  in  political  speeches  and  political  advertis¬ 
ing.  The  principal  advantage  of  textual  data  is  that 


what  political  actors  write  or  say  may  be  a  signifi¬ 
cant  political  signal  or  act,  a  form  of  purposive 
political  act  best  judged  using  the  textual  data  as 
the  best  manifestation  of  the  act  itself. 

A  second  notable  advantage  of  textual  data  for 
political  analysis  lies  in  the  possibly  unique  capac¬ 
ity  of  text  to  inform  us  about  the  qualities  of 
political  and  social  actors  that  are  unobservable 
through  direct  means.  The  most  frequently  ana¬ 
lyzed  quantities  of  this  type  in  political  science  are 
political  “ideal  points”  referring  to  an  actor’s  pref¬ 
erence  on  left-right  policy  scales,  such  as  a  relative 
preference  for  socially  and  morally  liberal  policies 
versus  conservative  ones.  Other  preferences  could 
include  being  relatively  for  or  against  a  specific 
policy,  such  as  the  repeal  of  the  Corn  Laws  in 
Britain  in  1846;  being  for  or  against  further 
European  integration  during  the  debate  over  the 
Laeken  Convention;  or  being  for  or  against  a  no 
confidence  motion.  The  preferences  are  inner  states 
of  political  actors,  whether  these  are  legislators, 
parties,  delegates,  or  candidates,  and  hence  cannot 
be  directly  observed.  The  most  common  alternative 
approach  not  based  on  texts  is  to  infer  preferences 
from  observed  voting  patterns.  Yet  it  has  been 
widely  shown  that  voting  is  a  highly  strategic 
political  act,  subject  in  particular  to  strict  party 
discipline  in  most  contexts,  and  therefore  reflects 
preferences  strategically  rather  than  sincerely.  Texts 
and  speeches,  by  contrast,  are  less  subject  to  politi¬ 
cal  monitoring  and  punishment  than  voting,  since 
their  content  in  democratic  systems  tends  to  be 
relatively  unregulated  as  well  as  having  fewer  direct 
consequences  for  policy  outcomes.  In  sum,  textual 
data  frequently  contain  important  information 
about  orientations  and  beliefs  for  which  nonverbal 
forms  of  behavior  may  serve  as  poor  indicators. 
Short  of  reading  the  minds  of  political  and  social 
actors  to  find  out  what  they  are  really  thinking — 
something  that  neither  technology  nor  ethical 
codes  of  research  make  possible — the  next  best  step 
is  to  read  what  they  are  writing  and  saying. 

A  final  advantage  of  text  is  its  ready  availability 
for  analysis.  Talk  is  cheap  not  just  in  the  political 
sense  but  also  in  the  sense  of  being  readily  and,  in 
most  cases,  freely  available  in  electronic  formats 
easily  amenable  to  systematic  analysis.  Political 
actors  leave  a  trail  of  recorded  text  behind  them  fol¬ 
lowing  almost  all  of  their  official  verbal  activities: 
speeches,  debates,  websites,  party  platforms,  press 


Data,  Textual  527 


releases,  all  easily  accessible  usually  for  minimal 
cost  and  effort.  Recorded  text  never  ages,  and 
never  forgets,  unlike,  for  instance,  participants  in  a 
panel  survey  or  an  expert  interview,  and  can  there¬ 
fore  be  used  to  analyze  historical  episodes  as  well 
as  contemporary  ones.  Text  never  gets  tired  or 
worn  out,  no  matter  how  many  times  it  is  ana¬ 
lyzed.  While  these  are  properties  shared  by  other 
behavioral  data,  such  as  voting  records,  they  are 
decidedly  not  features  involving  human  recall,  par¬ 
ticipant  or  expert  interviews,  or  surveys — other 
(indirect)  methods  frequently  used  to  assess  the 
unobservable,  inner  orientations  of  political  actors. 
For  practical  reasons  then,  textual  data  offer 
immense  advantages  of  availability  and  durability, 
provided  that  valid  and  reliable  means  can  be 
deployed  to  unlock  their  secrets. 

Substantive  Decisions  in  the 
Analysis  of  Textual  Data 

Any  research  using  textual  data  begins  with  the 
researcher  identifying  the  corpus  of  texts  relevant 
to  the  research  question  of  interest.  From  this  cor¬ 
pus,  texts  will  be  sampled  or  selected  for  analysis. 
Texts  are  generally  distinguished  from  one  another 
by  attributes  relating  to  the  author  or  speaker  of 
the  text,  perhaps  also  separated  by  time,  topic,  or 
act.  A  week’s  worth  of  lead  daily  opinion  pieces 
from  The  New  York  Times,  for  instance,  could 
form  a  set  of  seven  distinct  texts.  A  set  of  debates 
during  a  no  confidence  motion  in  the  British 
House  of  Commons  could  form  a  set  of  numerous 
distinct  texts  differentiated  by  speaker.  The  set  of 
election  manifestos  from  1948  to  1997  of  the  Irish 
Fianna  Fail  Party  could  form  a  set  of  texts  differ¬ 
entiated  by  election  date.  In  each  case,  distinguish¬ 
ing  external  attributes  attached  to  each  text  allows 
researchers  to  partition  the  verbal  data  they  con¬ 
tain,  thereby  distinguishing  one  text  from  another. 
Knowing  that  a  single  text  represents  an  identifi¬ 
able,  purposeful  verbal  act  gives  it  meaning  as  well 
as  a  set  of  variable  attributes  that  can  be  used  to 
compare  that  text  with  another.  This  does  not  nec¬ 
essarily  mean,  however,  that  in  analyzing  the  text, 
these  distinguishing  attributes  will  form  the  basis 
for  analyzing  these  data.  Here  it  is  useful  to  distin¬ 
guish  the  sampling  units  of  textual  data  from  the 
units  of  analysis  for  analyzing  these  data.  For 
example,  while  texts  are  generally  stored  as  “Bush’s 


2007  State  of  the  Union  Address”  and  “Bush’s 

2008  State  of  the  Union  Address,”  these  textual 
data  will  usually  be  analyzed  by  breaking  them 
down  into  constituent  textual  units.  The  most 
common  constituent  units  of  analysis  for  textual 
data  analysis  are  words,  consecutive  word 
sequences,  and  sentences.  Determining  which  are 
most  suitable  will  depend  on  the  research  question, 
the  nature  of  the  analytical  technique,  and  some¬ 
times  the  length  and  nature  of  the  texts  themselves. 
Knowing  how  the  sampling  procedure  for  the  tex¬ 
tual  data  selection  relates  to  the  sampling  units  and 
the  units  of  analysis  may  have  implications  for 
subsequent  inference,  given  that  the  units  of  analy¬ 
sis  are  not  randomly  sampled  textual  data,  irre¬ 
spective  of  the  sampling  units. 

In  many  political  science  applications  using  tex¬ 
tual  data,  the  “sample”  of  texts  may,  in  fact,  be  all 
possible  texts  generated  by  the  political  world  for 
that  application.  In  tracking  the  words  spoken  on 
abortion  per  day  in  the  U.S.  Congress,  for  instance, 
a  study  might  examine  every  spoken  utterance  in 
the  Senate  from  1997  to  2004.  Yet  even  in  such 
situations  where  a  researcher  may  not  confront 
overt  sampling  decisions,  such  as  how  many  news¬ 
paper  articles  to  select  from  which  set  of  days, 
from  a  larger  population  that  is  too  large  to  cover 
in  its  entirety,  it  is  still  important  to  be  aware  of 
selection  issues  that  shape  what  sort  of  text 
becomes  a  recorded  feature  of  the  social  system. 
Such  “social  bookkeeping”  has  long  been  noted  by 
historians  seeking  texts  to  gain  leverage  on  events 
long  past,  but  it  may  also  feature  in  many  forms  of 
observed  political  text,  especially  spoken  text  in 
structured  settings  such  as  legislatures.  The  key  is 
to  be  aware  of  the  mechanisms  governing  the  gen¬ 
eration  of  text,  with  an  aim  to  making  sure  that  the 
observable  text  provides  a  valid  representation  of 
the  phenomenon  that  it  will  be  used  to  investigate. 

Practical  Issues  in  Dealing 
With  Textual  Data 

Text  comes  in  many  shapes,  sizes,  and  forms,  but 
for  text  to  be  useful  as  data,  it  is  almost  always 
converted  to  electronic  form  prior  to  analysis. 
Electronic  text  has  the  advantage  of  being  easily 
stored  and  easily  manipulated.  Fortunately  for 
modern  users  of  textual  data,  new  texts  are  almost 
invariably  recorded  in  electronic  form  that  can  be 
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easily  converted  for  analysis.  Websites,  electronic 
documents,  text  stored  on  optical  media,  elec¬ 
tronic  mail,  word  processing  files,  news  feeds — all 
provide  text  that  can  be  easily  captured  and 
manipulated  for  analysis.  The  exception  tends  to 
be  text  generated  from  interactive,  spoken  activi¬ 
ties  such  as  interviews.  In  such  cases,  interviewers 
may  need  to  transcribe  textual  data  or  use  speech 
recognition  software  (combined  with  a  thorough 
human  check  on  the  software’s  transcription)  to 
convert  spoken  text  into  written,  electronic  text. 

One  potentially  tricky  practical  issue  when  deal¬ 
ing  with  textual  data  concerns  the  manner  in  which 
text  is  electronically  encoded  by  computers.  Broadly 
speaking,  “text”  means  the  representation  of  lan¬ 
guage  as  a  set  of  recorded  characters,  following 
language-specific  rules  for  syntax,  grammar,  and 
style  to  be  meaningful.  Pre-el ectronic  textual  data 
were  typically  series  of  characters  drawn  or  printed 
on  paper.  For  text  to  be  stored  electronically,  how¬ 
ever,  each  character  must  be  encoded  in  a  way  that 
corresponds  to  a  digital  format  used  by  computers. 
Morse  code,  for  instance,  encodes  characters  as  a 
series  of  dots  and  dashes.  Unfortunately — but 
hardly  surprisingly — different  computers  encode 
character  sets  in  different  ways,  and  this  can  create 
challenges  when  preparing  texts  for  processing 
together  when  these  texts  use  different  encodings. 
For  instance,  two  instances  of  democratic  may  be 
considered  as  two  different  words  by  software 
designed  to  compile  a  word  frequency  table,  when 
the  term  occurs  in  two  different  documents  stored 
with  different  text  encodings.  Encoding  issues  are 
further  complicated  when  texts  must  be  converted 
from  other  formats,  the  most  common  being 
Microsoft  Word,  HTML,  or  Adobe’s  PDF  format. 
The  same  Microsoft  Word  document  converted  to 
text  twice,  once  in  UTF-8  (an  implementation  of 
Unicode)  and  once  in  Microsoft  Windows’ 
1,252-character  set  for  Western  European  lan¬ 
guages,  will  not  be  equivalent  if  the  document 
includes  words  with  diacritic  marks  (such  as 
democratic  or  Lander)  or  even  specialized  format¬ 
ting  or  punctuation  marks — such  as  “smart  quotes” 
or  long-dashed  elements — contained  in  nearly  all 
formatted  text,  including  this  sentence. 

Many  users  of  textual  data  find  it  useful  to  pro¬ 
cess  raw  text  prior  to  analysis,  with  an  aim  to 
identifying  more  appropriate  units  of  analysis  than 
is  offered  by  the  raw  text.  The  most  common  types 


of  preprocessing  are  reduction  of  words  to  their 
word  stems  (or  “lemmas,”  which  strips  words  to 
their  stems  while  also  considering  context)  or 
elimination  of  words  through  the  use  of  “stop 
lists”  or  based  on  their  relative  infrequency.  The 
first  form  of  textual  data  preprocessing  treats 
words  as  equivalent  that  differ  only  in  their 
inflected  forms,  so  that,  for  example,  the  different 
words  taxes,  tax,  taxation,  taxing,  taxed,  and  tax¬ 
able  are  all  converted  to  the  word  stem  “tax.”  The 
second  common  textual  preprocessing  practice  is 
to  remove  words  that  are  considered  unlikely  to 
contribute  useful  information  for  analysis.  These 
words,  commonly  called  “stop”  words,  are  usually 
function  words  such  as  conjunctions,  prepositions, 
and  articles  that  occur  in  the  greatest  frequency  in 
natural  language  texts  but  add  little  specific  politi¬ 
cal  meaning  to  the  text  that  would  be  deemed  use¬ 
ful  to  analyze  from  textual  data.  These  stop  words 
may  be  identified  in  advance  in  the  form  of  a  “stop 
word  list” — on  a  language-specific  basis  of 
course — or  may  be  identified  by  their  relative  fre¬ 
quencies  across  and  within  sampling  units  (docu¬ 
ments).  The  problem  with  excluding  words  from 
an  a  priori  list,  however,  is  that  there  may  be  no 
universal,  cross-applicable  list  of  words  known  to 
contribute  nothing  useful  to  all  textual  data  uses. 
For  instance,  the  pronoun  “her,”  it  has  been  dem¬ 
onstrated,  has  a  decidedly  partisan  orientation  in 
debates  on  abortion  in  the  U.S.  Senate.  For  these 
reasons,  it  has  been  noted  that  a  general  trend 
in  preparing  textual  data  for  analysis  has  been 
gradually  to  reduce  or  eliminate  reliance  on  stop 
lists.  Another  approach  to  restricting  the  focus 
of  textual  data  analysis  from  all  words  to  only 
potentially  informative  words  is  to  filter  words  by 
indices  constructed  from  their  relative  frequency 
across  as  well  as  within  documents.  For  such 
purposes,  the  “tf.idf”  or  “term  frequency-inverse 
document”  frequency  measure  (and  its  many  vari¬ 
ants)  has  become  a  staple  of  computational  lin¬ 
guistics,  usually  serving  as  a  weight  to  assign  to 
words  rather  than  as  a  simple  inclusion  or  exclu¬ 
sion  mechanism.  The  idea  behind  using  tf.idf  to 
weight  textual  data  used  in  analysis  is  that  the 
most  informative  words  about  a  particular  text  are 
those  that  appear  many  times  in  that  text  but  in 
relatively  few  others.  At  the  extreme,  weighting 
words  by  their  tf.idf  will  exclude  words  that  occur 
equally  in  all  documents. 
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Other  methods  of  preprocessing  textual  data 
aimed  at  generating  units  of  analysis  from  sam¬ 
pling  units  include  converting  text  to  “n- grams,” 
defined  as  sequences  of  n  consecutive  items,  usu¬ 
ally  words  in  political  science  applications — 
thereby  distinguishing  “command  economy”  from 
“market  economy.”  In  an  application  of  a  political 
dictionary  to  text  fragments  where  election  mani¬ 
festos  defined  the  sampling  units,  for  instance,  one 
might  define  sampling  units  as  10- word  sequences, 
irrespective  of  punctuation.  Words  may  also  be 
tagged  by  their  part  of  speech,  using  language- 
specific  algorithms  developed  in  computational 
linguistics,  with  different  parts  of  speech  treated 
differently  in  whatever  analysis  follows. 

Identifying  units  of  analysis  may  also  be  done 
qualitatively,  based  on  reading  the  texts  and  iden¬ 
tifying  politically  relevant  units  of  text.  The  best 
known  example  in  political  research  is  the  “quasi 
sentence”  that  forms  the  unit  of  analysis  for  the 
long-running  Comparative  Manifesto  Project. 
Quasi  sentences  are  textual  units  that  express  a 
policy  proposition  and  may  be  either  a  complete 
natural  sentence  or  part  of  one.  Because  some 
authors  may  express  two  distinct  policy  statements 
within  a  single  natural  sentence,  the  use  of  “quasi” 
sentences  permits  a  more  valid  and  complete  rep¬ 
resentation  of  the  content  of  the  textual  data.  The 
trade-off,  however,  is  that  the  same  human  deci¬ 
sion  process  that  interprets  the  sentence  structure 
to  identify  text  units  causes  the  procedure  to  be 
unreliable  and  often  difficult  or  impossible  to  rep¬ 
licate.  This  trade-off  between  reliability — whether 
repetition  of  a  procedure  produces  stable  results — 
and  validity — whether  the  measurement  or  analy¬ 
sis  reflects  the  truth  of  what  is  being  measured  or 
represented  by  the  textual  data — is  a  recurrent 
theme  in  research  involving  textual  data.  This 
affects  not  just  the  identification  and  preparation 
of  units  for  analysis  but  also  the  design  of  coding 
frames  and  measurement  and  scaling  models  for 
analyzing  textual  data. 

Confronting  these  practical  issues  may  seem  like 
a  potentially  complex  matter,  but  fortunately,  for 
nearly  every  practical  need,  there  exist  several 
ready-made  software  solutions.  Stemmers,  stop 
word  lists,  word  frequency  generators,  n-gram 
parsers,  parts  of  speech  identifiers,  encoding  con¬ 
verters,  and  a  variety  of  utilities  to  translate  from 
one  format  to  another  are  readily,  and  often  freely, 


available.  The  practical  challenge  of  dealing  with 
textual  data  then  becomes  more  one  of  work  flow 
management  and  research  design  than  of  the  tech¬ 
nical  matter  of  solving  specific  practical  needs. 

Analyzing  Textual  Data 

Once  texts  have  been  identified,  selected,  and  pre¬ 
pared,  they  are  ready  for  analysis.  The  analysis  of 
textual  data  is  not  different  from  the  analysis  of 
any  other  data:  no  matter  what  its  form,  the  analy¬ 
sis  procedure  is  only  of  value  if  it  can  somehow 
produce  propositions,  measures,  scales,  or  inter¬ 
pretations  of  lesser  complexity  than  the  textual 
data  themselves.  Analyzing  textual  data  involves 
applying  some  explicit  procedure  to  units  of  analy¬ 
sis  from  the  texts,  whether  this  procedure  is  purely 
qualitative,  purely  quantitative,  or  some  hybrid  in 
between.  In  practice,  the  variety  of  ways  that  text 
may  be  analyzed  is  vast.  Some  techniques  are 
purely  qualitative,  involving  interpretation  of  tex¬ 
tual  data  by  simply  reading  them  and  producing 
some  summary  account.  Other  qualitative  meth¬ 
ods  involve  reading  a  text  and  classifying  the  text 
according  to  a  relatively  simple  coding  frame,  such 
as  prowar,  neutral,  or  antiwar.  Yet  other  qualita¬ 
tive  methods  rely  on  human  judgment  to  apply 
rules  to  units  of  textual  data  and  possibly  also  in 
identifying  those  units.  The  Comparative  Manifesto 
Project  is  an  example  of  such  a  scheme,  using 
human  coders  to  identify  and  classify  variable  tex¬ 
tual  units  using  a  5  6-fold  scheme  of  political  policy 
categories:  so  is  the  65-category  coding  scheme 
developed  by  Frank  Baumgartner,  Suzanna  DeBoef, 
and  Amber  Boydstun  in  2008  to  capture  argu¬ 
ments  for  and  against  the  death  penalty. 

Many  methods  for  analyzing  textual  data  com¬ 
bine  schemes  involving  human  decisions  and 
qualitative  judgments  with  the  use  of  computers  to 
perform  the  mechanics  of  the  processing  and 
analysis.  Tagging  text  with  a  predefined  scheme  or 
dictionary  is  an  example  of  such  a  procedure. 
Several  scaling  algorithms,  for  example  the  “word- 
scores”  method  developed  by  Michael  Laver, 
Kenneth  Benoit,  and  John  Garry,  require  human 
judgment  to  identify  reference  texts  and  to  specify 
their  relative  locations  on  the  scale  of  interest  in 
order  to  estimate  scaled  positions  for  additional 
texts  subsequently,  using  a  purely  automated  pro¬ 
cedure  requiring  no  further  human  judgment. 
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The  ultimate  quantitative  extreme  in  textual  data 
analysis  uses  scaling  procedures  borrowed  from 
item  response  theory  methods  developed  originally 
in  psychometrics.  Both  Jon  Slapin  and  Sven-Oliver 
Proksch’s  Poisson  scaling  model  and  Burt  Monroe 
and  Ko  Maeda’s  similar  scaling  method  assume 
that  word  frequencies  are  generated  by  a  probabi¬ 
listic  function  driven  by  the  author’s  position  on 
some  latent  scale  of  interest  and  can  be  used  to 
estimate  those  latent  positions  relative  to  the  posi¬ 
tions  of  other  texts.  Such  methods  may  be  applied 
to  word  frequency  matrixes  constructed  from  texts 
with  no  human  decision  making  of  any  kind.  The 
disadvantage  is  that  while  the  scaled  estimates 
resulting  from  the  procedure  represent  relative  dif¬ 
ferences  between  texts,  they  must  be  interpreted  if  a 
researcher  is  to  understand  what  politically  signifi¬ 
cant  differences  the  scaled  results  represent.  This 
interpretation  is  not  always  self-evident. 

Recent  textual  data  analysis  methods  used  in 
political  science  have  also  focused  on  classifica¬ 
tion:  determining  which  category  a  given  text 
belongs  to.  Recent  examples  include  methods  to 
categorize  the  topics  debated  in  the  U.S.  Congress 
as  a  means  of  measuring  political  agendas.  Variants 
on  classification  include  recently  developed  meth¬ 
ods  designed  to  estimate  accurately  the  propor¬ 
tions  of  categories  of  opinions  about  the  U.S. 
presidency  from  blog  postings,  even  though  the 
classifier  on  which  it  is  based  performs  poorly  for 
individual  texts.  New  methodologies  for  drawing 
more  information  from  political  texts  continue  to 
be  developed,  using  clustering  methodologies, 
more  advanced  item  response  theory  models,  sup¬ 
port  vector  machines,  and  semisupervised  and 
unsupervised  machine  learning  techniques. 

Kenneth  Benoit 
Trinity  College 

Dublin,  Ireland 
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John  W.  Tukey,  the  definer  of  the  phrase  explor¬ 
atory  data  analysis  (EDA),  made  remarkable  con¬ 
tributions  to  the  physical  and  social  sciences.  In 
the  matter  of  data  analysis,  his  groundbreaking 
contributions  included  the  fast  Fourier  transform 
algorithm  and  EDA.  He  reenergized  descriptive 
statistics  through  EDA  and  changed  the  language 
and  paradigm  of  statistics  in  doing  so.  Interestingly, 
it  is  hard,  if  not  impossible,  to  find  a  precise  defi¬ 
nition  of  EDA  in  Tukey’s  writings.  This  is  no  great 
surprise,  because  he  liked  to  work  with  vague 
concepts,  things  that  could  be  made  precise  in 
several  ways.  It  seems  that  he  introduced  EDA  by 
describing  its  characteristics  and  creating  novel 
tools.  His  descriptions  include  the  following: 

1.  “Three  of  the  main  strategies  of  data  analysis 
are:  1.  graphical  presentation.  2.  provision  of 
flexibility  in  viewpoint  and  in  facilities, 

3.  intensive  search  for  parsimony  and 
simplicity.”  (Jones,  1986,  Vol.  IV,  p.  558) 

2.  “In  exploratory  data  analysis  there  can  be  no 
substitute  for  flexibility;  for  adapting  what  is 
calculated — and  what  we  hope  plotted — both  to 
the  needs  of  the  situation  and  the  clues  that  the 
data  have  already  provided.”  (p.  736) 

3.  “I  would  like  to  convince  you  that  the 
histogram  is  old-fashioned.  .  .  .”  (p.  741) 
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4.  “Exploratory  data  analysis  .  .  .  does  not  need 
probability,  significance  or  confidence.” 

(p.  794) 

5.  “I  hope  that  I  have  shown  that  exploratory  data 
analysis  is  actively  incisive  rather  than  passively 
descriptive,  with  real  emphasis  on  the  discovery 
of  the  unexpected.”  (p.  lxii) 

6.  ‘“Exploratory  data  analysis’  is  an  attitude,  a 
state  of  flexibility,  a  willingness  to  look  for 
those  things  that  we  believe  are  not  there,  as 
well  as  those  we  believe  to  be  there.”  (p.  806) 

7.  “Exploratory  data  analysis  isolates  patterns  and 
features  of  the  data  and  reveals  these  forcefully 
to  the  analyst.”  (Hoaglin,  Mosteller,  &  Tukey, 
1983,  p.  1) 

8.  “If  we  need  a  short  suggestion  of  what 
exploratory  data  analysis  is,  I  would  suggest 
that:  1.  it  is  an  attitude,  AND  2.  a  flexibility, 
AND  3.  some  graph  paper  (or  transparencies, 
or  both).”  (Jones,  1986,  Vol.  IV,  p.  815) 

This  entry  presents  a  selection  of  EDA  tech¬ 
niques  including  tables,  five-number  summaries, 
stem-and-leaf  displays,  scatterplot  matrices,  box 
plots,  residual  plots,  outliers,  bag  plots,  smoothers, 
reexpressions,  and  median  polishing.  Graphics  are 
a  common  theme.  These  are  tools  for  looking  in 
the  data  for  structure,  or  for  the  lack  of  it. 

Some  of  these  tools  of  EDA  will  be  illustrated 
here  employing  U.S.  presidential  elections  data 


from  1952  through  2008.  Specifically,  Table  1 
displays  the  percentage  of  the  vote  that  the  Demo¬ 
crats  received  in  the  states  of  California,  Oregon, 
and  Washington  in  those  years.  The  percentages 
for  the  Republican  and  third-party  candidates  are 
not  a  present  concern.  In  EDA,  one  seeks  displays 
and  quantities  that  provide  insights,  understand¬ 
ing,  and  surprises. 

Table 

A  table  is  the  simplest  EDA  object.  It  simply 
arranges  the  data  in  a  convenient  form.  Table  1  is 
a  two-way  table. 

Five-Number  Summary 

Given  a  batch  of  numbers,  the  five-number  sum¬ 
mary  consists  of  the  largest,  smallest,  median,  and 
upper  and  lower  quartiles.  These  numbers  are  use¬ 
ful  for  auditing  a  data  set  and  for  getting  a  feel  for 
the  data.  More  complex  EDA  tools  may  be  based 
on  them.  For  the  California  data,  the  five-number 
summary  in  percents  is  shown  in  Figure  1. 

These  data  are  centered  at  47.6%  and  have  a 
spread  measured  by  the  interquartile  range  of 
8.75%.  Tukey  actually  employed  related  quanti¬ 
ties  in  a  hope  to  avoid  confusion. 

Stem-and-Leaf  Display 

The  numbers  of  Table  1  provide  all  the  informa¬ 
tion,  yet  condensations  can  prove  better.  Figure  2 


Table  I  Percentages  of  the  Votes  Cast  for  the  Democratic  Candidate  in  the  Presidential  Years  1952-2008 


State 

Year 

1952 

1956 

1960 

1964 

1968 

1972 

1976 

1980 

1984 

1988 

1992 

1996 

2000 

2004 

2008 

California 

42.7 

44.3 

49.6 

59.1 

44.7 

41.5 

47.6 

35.9 

41.3 

47.6 

46.0 

51.1 

53.4 

54.3 

61.0 

Oregon 

38.9 

44.8 

44.7 

63.7 

43.8 

42.3 

47.6 

38.7 

43.7 

51.3 

42.5 

47.2 

47.0 

51.3 

56.7 

Washington 

44.7 

45.4 

45.4 

62.0 

47.2 

38.6 

46.1 

37.3 

42.8 

50.1 

45.1 

49.8 

50.2 

52.8 

57.7 

Source:  Statistical  Abstracts  of  the  U.S.  Census  Bureau. 


Minimum 

Lower  Quartile 

Median 

Upper  Quartile 

Maximum 

35.9 

43.50 

47.6 

52.25 

61.0 

Figure  I  A  Five-Number  Summary  for  the  California  Democrat  Percentages 

Note:  The  minimum,  35.9%,  occurred  in  1980  and  the  maximum,  61%,  in  2008. 
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Figure  2  Stem-and-Leaf  Displays,  With  Scales  of  I 
and  2,  for  the  California  Democratic  Data 


provides  a  stem-and-leaf  display  for  the  data  of 
the  table.  There  are  stems  and  leafs.  The  stem  is  a 
line  with  a  value.  See  the  numbers  to  the  left  of  the 
“  I  ” .  The  leaves  are  numbers  on  a  stem,  the  right- 
hand  parts  of  the  values  displayed. 

Using  this  exhibit,  one  can  read,  off  various 
quartiles,  the  five-number  summary  approxi¬ 
mately;  see  indications  of  skewness;  and  infer  mul¬ 
tiple  modes. 


Scatterplot  Matrix 


Figure  3  Scatterplots  of  Percentages  for  the  States  Versus  Percentages  for  the  States  in  Pairs 

Notes:  A  least  squares  line  has  been  added  as  a  reference.  CA  =  California;  OR  =  Oregon;  WA  =  Washington. 


Data  Analysis,  Exploratory  533 


Scatterplot  Matrix 

Figure  3  displays  individual  scatterplots  for  the 
state  pairs  (CA,  OR),  (CA,  WA),  and  (OR,  WA). 
A  least  squares  line  has  been  added  in  each  display 
to  provide  a  reference.  One  sees  the  x  and  y  values 
staying  together.  An  advantage  of  the  figure  over 
three  individual  scatterplots  is  that  one  sees  the 
plots  simultaneously. 

Outliers 

An  outlier  is  an  observation  strikingly  far  from 
some  central  value.  It  is  an  unusual  value  relative 
to  the  bulk  of  the  data.  Commonly  computed 
quantities  such  as  averages  and  least  squares  lines 
can  be  drastically  affected  by  such  values.  Methods 
to  detect  outliers  and  to  moderate  their  effects  are 
needed.  So  far,  the  tools  discussed  in  this  entry 
have  not  found  any  clear  outliers. 

Box  Plots  A  box  plot  consists  of  a  rectangle  with 
top  and  bottom  sides  at  the  levels  of  the  quartiles, 
a  horizontal  line  added  at  the  level  of  the  median, 
and  whiskers,  of  length  1.5  times  the  interquartile 
range,  added  at  the  top  and  bottom.  It  is  based  on 
numerical  values.  Points  outside  these  limits  are 
plotted  and  are  possible  outliers.  Figure  4  presents 
three  box  plots.  When  more  than  one  box  plot  are 
present  in  a  figure,  they  are  referred  to  as  parallel 
box  plots. 

Figure  4  presents  a  parallel  box  plot  display  for 
the  presidential  data.  The  California  values  tend  to 
be  higher  than  those  of  Oregon  and  Washington. 
Those  show  a  single  outlier  each  and  a  skewing 
toward  higher  values.  Both  the  outliers  are  for  the 
1964  election. 

Residual  Plots 

A  residual  plot  is  another  tool  for  detecting  outli¬ 
ers  and  noticing  unusual  patterns.  Suppose  there  is 
a  fit  to  the  data,  say,  a  least  squares  line.  The 
residuals  are  then  the  differences  between  the  data 
and  their  corresponding  fitted  values. 

Consider  the  percentages  in  the  table  depending 
on  the  year  of  the  election — that  is,  consider  the 
data  as  a  time  series  (Figure  5). 

The  time  series  of  these  three  states  track  each 
other  very  well,  and  there  is  a  suggestion  of  an 
outlier  in  each  plot. 


Parallel  box  plots 


CA  OR  WA 


Figure  4  Parallel  Box  Plots  for  the  Percentages,  One 
for  Each  State 

Note:  CA  =  California;  OR  =  Oregon;  WA  =  Washington. 

Figure  6  shows  the  residuals  for  the  three 
states. 

Each  display  in  Figure  6  shows  an  outlier  near 
the  top.  They  all  correspond  to  year  1964.  This 
was  the  first  year  after  John  Kennedy  was  assassi¬ 
nated,  and  Lyndon  Johnson  received  a  substantial 
sympathy  vote.  There  also  is  a  suggestion  of  tem¬ 
poral  dependence. 

With  today’s  large  data  sets,  one  wishes  for 
automatic  ways  to  identify  and  handle  outliers 
and  other  unusual  values.  One  speaks  of  resis¬ 
tant/robust  methods,  resistant  methods  being 
those  not  overly  sensitive  to  the  presence  of  outli¬ 
ers  and  robust  ones  being  those  not  affected 
strongly  by  long  tails  in  the  distribution.  In  the 
case  of  bivariate  data,  one  can  consider  the  bag 
plot. 

Bag  Plots 

The  bag  plot  is  a  generalization  of  the  box  plots  of 
Figure  4.  It  is  often  a  convenient  way  to  study  the 
scatter  of  bivariate  data.  In  the  construction  of  a 
bag  plot,  one  needs  a  bivariate  median,  analogs  of 
the  quartiles,  and  whiskers.  Tukey  and  his  collabo¬ 
rators  developed  these.  The  center  of  the  bag  plot  is 
the  Tukey  median.  The  “bag”  surrounds  the  center 
and  contains  the  50%  of  the  observations  with  the 
greatest  depth.  The  “fence”  separates  inliers  from 
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CA  percentages  vs.  year 


OR  percentages  vs.  year 


WA  percentages  vs.  year 


Figure  5  Graphs  of  the  Individual  State  Democrat  Percentages  Versus  the  Election  Years 

Notes:  A  least  squares  line  has  been  added  as  a  reference.  CA  =  California;  OR  =  Oregon;  WA  =  Washington. 


CA  residuals  OR  residuals  WA  residuals 
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Figure  6  Residuals  From  Least  Squares  Line  Versus  Year  With  0-Line  Added 

Note:  CA  =  California;  OR  =  Oregon;  WA  =  Washington. 


outliers.  Lines  called  whiskers  mark  observations 
between  the  bag  and  the  fence.  The  fence  is 
obtained  by  inflating  the  bag,  from  the  center,  by 
a  factor  of  3. 

Figure  7  provides  bag  plots  for  each  of  the  pairs 
(CA,  OR),  (CA,  WA),  and  (OR,  WA). 


One  sees  an  apparent  outlier  in  both  the 
California  versus  Oregon  and  the  California  versus 
Washington  cases.  Interestingly  there  is  not  one  for 
the  Oregon  versus  Washington  case.  On  inspec¬ 
tion,  it  is  seen  that  the  outliers  correspond  to  the 
1964  election.  One  also  sees  that  the  points 
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CA  vs.  OR  CA  vs.  WA  OR  vs.  WA 


Figure  7  Bag  Plots  for  the  State  Pairs,  Percentages  Versus  Percentages 

Note:  The  bivariate  median  is  the  black  spot  within  the  bag  (darker  shading),  and  the  fence  is  the  outer  boundary. 


vaguely  surround  a  line.  Because  of  the  bag  plot’s 
resistance  to  outliers,  the  unusual  point  does  not 
affect  its  location  and  shape. 


Smoother 

Smoothers  have  as  a  goal  the  replacement  of  a 
scatter  of  points  by  a  smooth  curve.  Sometimes  the 
effect  of  smoothing  is  dramatic  and  a  signal 
appears.  The  curve  resulting  from  smoothing 


might  be  a  straight  line.  More  usefully,  a  local  least 
squares  fit  might  be  employed  with  the  local 
curves,  y  =  f(x),  a  quadratics.  The  local  character 
is  often  introduced  by  employing  a  kernel.  A  sec¬ 
ond  kernel  might  be  introduced  to  make  the  opera¬ 
tion  robust/resistant.  It  will  have  the  effect  of 
reducing  the  impact  of  points  with  large  residuals. 

Figure  8  shows  the  result  of  local  smoothing  of 
the  Democrat  percentages  as  a  function  of  election 
year.  The  loess  procedure  was  employed. 


CA  percentage  vs.  year 


OR  percentage  vs.  year 


WA  percentage  vs.  year 


Figure  8  Percents  Versus  Year  With  a  Loess  Curve  Superposed 

Note:  CA  =  California;  OR  =  Oregon;  WA  =  Washington. 
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CA:  robust  smooth 


CA:  robust  smooth 


WA:  robust  smooth 


Figure  9  The  Curves  Plotted  Are  Now  Resistant  to  Outliers 

Note:  CA  =  California;  OR  =  Oregon;  WA  =  Washington. 


Median  polish  results  for  CA,  OR,  and  WA 


i-  i-  i-  i-  i-  <M  CM 

Figure  10  The  Year  Effect  Obtained  for  the  Election 
Data  via  Median  Polishing 

Note:  CA  =  California;  OR  =  Oregon;  WA  =  Washington. 

The  curves  have  a  similar  general  appearance. 
They  are  pulled  up  by  the  outlier  at  the  1964  point. 

Robust  Variant 

The  behavior  in  1964  being  understood  to  a 
degree,  one  would  like  an  automatic  way  to  obtain 
a  curve  not  so  strongly  affected  by  this  outlier.  The 
loess  procedure  has  a  robust/resistant  variant.  The 
results  follow  in  Figure  9. 

Having  understood  that  1964  was  an  unusual 
year,  one  can  use  a  robust  curve  to  understand  the 
other  values  better.  The  plots  have  similar  shapes. 
One  sees  a  general  growth  in  the  Democrat  per¬ 
centages  starting  around  1980.  In  this  two-step 


procedure,  it  is  important  to  study  both  the  outlier 
and  the  robust/resistant  curve. 

Reexpression 

This  term  refers  to  expressing  the  same  information 
by  different  numbers,  for  example,  using  logit  = 
log(p/(l  —  p ))  instead  of  the  proportion  p.  The 
purpose  may  be  additivity,  obtaining  straightness 
or  symmetry,  or  making  variability  more  nearly 
uniform. 

The  final  method  is  a  tool  for  working  with 
two-way  tables. 

Median  Polish 

This  is  a  process  of  alternately  finding  and  sub¬ 
tracting  medians  from  rows  and  then  columns  and 
perhaps  continuing  to  do  this  until  the  results  do 
not  change  much.  One  purpose  is  to  seek  an  addi¬ 
tive  model  for  a  two-way  table,  in  the  presence  of 
outliers  in  the  data. 

The  state  percentages  in  Table  1  form  a  3  X  15 
table  and  a  candidate  for  median  polish.  The 
resulting  row  (year)  effects  are  shown  in  Figure  10. 

These  effects  are  not  meant  to  be  strongly 
affected  by  outliers.  Figure  10  shows  the  same  gen¬ 
eral  curve  as  in  Figure  5. 
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This  entry  ends  with  Tukey’s  1973  rejoinder: 

“Undoubtedly,  the  swing  to  exploratory  data 

analysis  will  go  somewhat  too  far”  (cited  in  Jones, 

Vol.  Ill,  p.  lxii). 

David  R.  Brillinger 
University  of  California,  Berkeley 
Berkeley,  California,  United  States 

See  also  Cross-Tabular  Analysis;  Data  Visualization; 
Graphics,  Statistical;  Statistics:  Overview 
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Data  Visualization 


The  basic  objective  of  data  visualization  is  to  pro¬ 
vide  an  efficient  graphical  display  for  summariz¬ 
ing  and  reasoning  about  quantitative  information. 
Data  visualization  should  be  distinguished  from 
other  types  of  visualization  used  in  political  sci¬ 
ence  (more  general  information  and  knowledge 
visualization,  concept  visualization,  strategy  and 
work  flow  visualization,  metaphor  visualization, 
etc.)  as  it  is  more  specific  to  the  representation  of 


quantitative  data  existing  in  the  form  of  numerical 
tables.  In  the  following  sections,  the  different 
types  and  methods  of  data  visualization  and  their 
application  in  political  science  are  presented. 

Chart  Types  and  Methods 

During  the  past  decades,  political  science  has  accu¬ 
mulated  a  large  corpus  of  various  kinds  of  data  such 
as  comprehensive  fact  books  and  atlases,  character¬ 
izing  all  or  most  of  existing  states  by  multiple  and 
objectively  assessed  numerical  indicators  within 
certain  time  periods  (e.g.,  OECD  Factbook  and 
Political  Atlas  of  the  Modern  World).  As  a  conse¬ 
quence,  there  exists  a  tendency  for  political  science 
to  gradually  become  a  more  quantitative  scientific 
field  and  to  use  quantitative  information  in  analysis 
and  reasoning.  Any  analysis  in  political  science 
must  be  multidimensional  and  combine  various 
sources  of  information;  however,  human  capabili¬ 
ties  for  perception  of  large  amounts  of  numerical 
information  are  limited.  Hence,  methods  and 
approaches  for  the  visualization  of  quantitative  and 
qualitative  data  (especially  multivariate  data)  are  an 
extremely  important  topic  in  political  science.  Data 
visualization  approaches  can  be  classified  into  sev¬ 
eral  groups,  starting  from  creating  informative 
charts  and  diagrams  (statistical  graphics  and  info¬ 
graphics)  and  ending  with  advanced  statistical 
methods  for  visualizing  multidimensional  tables 
containing  both  quantitative  and  qualitative  infor¬ 
mation.  Data  visualization  in  political  science  takes 
advantage  of  recent  developments  in  computer  sci¬ 
ence  and  computer  graphics,  statistical  methods, 
methods  of  information  visualization,  visual  design, 
and  psychology.  Data  visualization  in  political  sci¬ 
ence  has  certain  special  features  such  as  the  frequent 
use  of  geographical  maps,  which  creates  a  link  with 
the  well-developed  field  of  geographic  information 
systems  (GIS).  Furthermore,  numerical  tables  in 
political  science  are  often  incomplete,  which  makes 
important  the  use  of  methods  dealing  with  missing 
or  uncertainly  measured  data  entries. 

There  are  two  main  types  of  numerical  tables 
that  can  be  the  subject  of  data  visualization.  The 
first  one  is  called  an  object-feature  table,  where 
each  row  represents  an  observation  or  an  object  and 
each  column  corresponds  to  a  numerical  feature  or 
indicator  commonly  measured  for  the  whole  set  of 
objects.  An  example  of  such  an  object-feature  table 
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Figure  I  Examples  of  Data  Visualization  for  a  Small  Subset  of  Data  (14  Socioeconomic  Numerical  Indicators  for 
40  Countries) 

Source:  Organisation  for  Economic  Co-operation  and  Development.  (2006).  OECD  factbook  2006:  Economic,  environmental 
and  social  statistics.  Paris:  Author. 

Notes:  The  indicators  describe  education,  energy  consumption,  employment  situation,  macroeconomic  trends,  population  and 
migration,  prices,  and  quality-of-life  aspects  of  each  country.  The  table  itself  is  a  graphical  information  display.  The  bars  and 
arrows  in  the  table  cells  show  the  indicator  values  (arrow  up — above  the  median;  arrow  down — below  the  median;  arrow 
right — near  the  median).  The  rows  and  columns  are  reordered  so  that  the  correlated  indicators  and  statistical  country  profiles 
are  ranked  closely.  The  dendrogram  on  the  right  (or  at  the  bottom)  shows  the  separation  of  countries  into  groups  (or  grouping 
correlated  indicators)  as  suggested  by  hierarchical  clustering.  CPI  =  Corruption  Perception  Index;  GDP  =  gross  domestic  product. 


is  a  fact  book  for  a  set  of  countries  (see  Figure  1), 
where  the  objects  are  countries  and  the  features  are 
numerical  indicators  such  as  gross  domestic  product 
(GDP)  per  capita,  employment  rate,  life  expectancy, 


and  so  on.  The  second  type  of  numerical  tables  is 
called  connection  or  distance  tables,  where  both 
rows  and  columns  correspond  to  objects  and  a 
numerical  value  characterizing  a  link  between  two 
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objects  is  found  at  the  intersection  of  a  row  and  a 
column.  Alternatively,  such  tables  can  be  repre¬ 
sented  in  the  form  of  a  list  of  links.  A  typical  exam¬ 
ple  of  a  connection  table  is  the  table  representing 
migration  rates  or  the  mutual  volumes  of  exports 
and  imports  for  a  set  of  countries  (at  the  intersection 
of  Row  A  and  Column  B,  the  volume  of  export 
from  Country  A  to  County  B  is  found). 

Data  visualization  plays  several  important 
roles  in  science:  It  (a)  helps  create  informative 
illustrations  of  the  data,  recapitulating  large 
amounts  of  quantitative  information  on  a  dia¬ 
gram;  (b)  helps  formulate  new  hypotheses  or  to 
confirm  existing  hypotheses  for  quantitative  data; 
and  (c)  guides  a  statistical  analysis  of  data  and 
checks  its  validity.  One  can  also  mention  the  role 
of  infographics  in  creating  images  with  a  clear 
and  visual  message,  based  on  numbers;  thus,  data 
visualization  can  serve  as  a  powerful  public  rela¬ 
tions  or  educational  tool.  Graphical  display 
allows  not  only  visualizing  and  analyzing  the 
message  contained  in  data  but  also  remembering 
it,  since  for  most  people  visual  memory  is  more 
persistent  than  verbal  or  auditory  memory  (the 
phenomenon  of  pictorial  superiority). 

Several  groups  of  methods  for  data  visualization 
can  be  distinguished  in  political  science: 

•  statistical  graphics  and  infographics,  with 
extensive  use  of  color,  form,  size,  shape,  and 
style  to  superimpose  many  quantitative  variables 
in  the  same  chart  or  diagram; 

•  geographic  information  systems  to  visualize 
geographically  linked  data; 

•  graph  visualization  or  network  maps  for 
representing  the  relations  between  objects;  and 

•  data  cartography — that  is,  the  projection  of 
multidimensional  data  on  low-dimensional 
screens  with  further  visualization. 

As  follows,  a  description  of  these  principal  groups 
of  data  visualization  methods,  along  with  references 
to  examples  of  their  application  in  political  science, 
is  provided. 

Statistical  Graphics  and  Infographics 

Statistical  graphics  started  to  be  used  in  science 
in  the  18th  century  but  began  to  be  widely 
exploited  only  from  the  end  of  the  19th  century. 
Many  of  the  currently  used  types  of  charts  were 


introduced  by  William  Playfair,  a  Scottish  politi¬ 
cal  economist,  and  Johann  Heinrich  Lambert,  a 
Swiss  German  mathematician.  With  the  appear¬ 
ance  of  computer-based  technologies  toward  the 
end  of  the  20th  century,  the  use  of  statistical 
graphics  exploded  and  included  exploitation  of 
three-dimensional,  dynamic,  and  interactive  data 
representations. 

Ralph  Lengler  and  Martin  Eppler  suggested  a 
“periodic  table”  of  visualization  methods  with 
many  examples  of  elements  of  statistical  graphics. 
The  most  commonly  used  quantitative  data  dis¬ 
plays  can  be  classified  as  follows: 

•  Univariate  plots  are  designed  to  visualize  value 
distribution  of  a  single  variable.  The  basic  ones 
are  pie  charts,  bar  charts,  histograms,  box  plots 
(see  Figure  2). 

•  Bivariate  plots,  are  designed  to  visualize  relations 
between  two  variables.  These  include  various 
types  of  scatterplots  and  line  plots  (see  Figure  3). 
A  particularly  important  case  is  the  line  plot- 
visualizing  time  series  in  which  one  variable  is 
time. 

•  Multivariate  plots,  designed  to  visualize  the 
values  of  several  variables  at  the  same  time  and 
comparing  them.  Examples  of  these  are  area 
charts,  radar  charts,  mosaic  plots,  parallel 
coordinates  plots,  and  the  more  sophisticated 
Chernoff  faces. 

This  classification  is  elementary;  however, 
modern  statistical  graphics  usually  superimpose 
several  types  of  plots  and  use  color,  shape,  size, 
and  style  of  their  elements  to  increase  the  dimen¬ 
sionality  of  the  plot.  New  types  of  data  displays 
are  invented  continuously  and  proposed  as  parts 
of  commonly  used  software  for  data  analysis.  As 
an  example,  the  data  table  itself  can  serve  as  a 
data  display  (see  Figure  1).  Moreover,  the  table 
can  be  converted  into  the  more  elaborate  informa¬ 
tion  lens  display.  Time-series  representation  can 
be  significantly  enhanced  by  colorful  horizon 
graphs  that  allow  visualizing  hundreds  of  time 
series  simultaneously  and  identifying  patterns 
in  their  behavior  visually.  Treemaps  are  a  space¬ 
filling  approach  to  show  hierarchies  in  which  the 
rectangular  screen  space  is  divided  into  regions, 
and  then  each  region  is  divided  again  for  each 
level  in  the  hierarchy  (e.g.,  treemaps  were  used  for 
visualization  of  the  U.S.  budget).  Intersections 
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Figure  2  Examples  of  Standard  Statistical  Graphs 

Source:  Organisation  for  Economic  Co-operation  and  Development.  (2006).  OECD  factbook  2006:  Economic,  environmental 
and  social  statistics.  Paris:  Author. 


between  sets  of  objects  can  be  represented  by 
Venn  diagrams.  Many  useful  graphical  displays 
result  from  application  of  multidimensional  statis¬ 
tical  analysis  techniques  such  as  factor  analysis 
(including  principal  components  analysis)  and 
clustering  (see  Figures  1  and  4  and  the  section  on 
multidimensional  data  cartography). 

In  the  physical,  biological,  and  social  sciences, 
the  scatterplot  is  the  predominant  type  of  data  dis¬ 
play  (according  to  Edward  R.  Tufte,  approximately 
75%  of  graphs  used  in  science  are  scatterplots). 
Scatterplots  serve  two  basic  purposes:  ( 1 )  to  visually 
detect  linear  and  nonlinear  relations  between  two 
variables;  the  human  eye  can  do  this  efficiently  and 
is  extremely  robust  to  the  effects  of  anomalous 
observations  (outliers)  and  other  aberrations  in  the 


data,  and  (2)  to  create  a  map  of  data  allowing  one 
to  visualize  other  information  on  top  of  it.  In  the 
case  of  a  large  number  of  points,  the  scatterplot  can 
be  improved  by  visualizing  the  point  density,  using 
gradient  shading  (Figure  4)  or  isodensity  levels  (i.e., 
contours  connecting  points  with  the  same  density). 

In  contrast,  it  was  shown  that  the  use  of  statisti¬ 
cal  graphics  in  news  media  is  mainly  limited  to 
univariate  plots  (if  time-series  plots  are  not  consid¬ 
ered),  mainly  bar  charts  and  pie  charts.  Curiously, 
several  Japanese  news  journals  and  the  British 
Economist  create  notable  exceptions  and  provide 
up  to  9%  of  data  visualizations  in  the  form  of 
bivariate  plots.  One  of  the  first  books  on  the  his¬ 
tory  of  data  visualization,  Graphic  Presentation, 
was  written  by  Willard  Brinton  in  1939.  It  contains 
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Source:  Organisation  for  Economic  Co-operation  and  Development.  (2006).  OECD  factbook  2006:  Economic,  environmental 
and  social  statistics.  Paris:  Author. 


Notes:  The  five  indicators  are  GDP  and  population  growth  rates,  using  abscissa  and  ordinate  point  position;  employment  rate, 
using  marker  size;  GDP  per  capita,  using  marker  color;  and  population,  using  the  text  size  of  the  data  labels.  In  addition,  the 
geographical  localization  is  visualized  in  the  form  of  the  marker.  The  box  plots  on  the  axes  depict  the  median  value,  lower  and 
upper  quartiles,  extent  of  the  data,  and  outlier  values  (by  crosses).  GDP  =  gross  domestic  product. 


many  examples  of  thoughtful  data  visualization  in 
the  precomputer  era. 

Geographic  Information  Systems 

In  1854,  John  Snow  depicted  a  cholera  out¬ 
break  in  London  and  detected  the  source  of  the 
epidemic  using  points  to  localize  some  individual 
cases.  This  was  one  of  the  earliest  applications  of 
the  geographic  method — that  is,  combining  the  use 
of  a  geographical  map  and  statistical  data. 
Nowadays,  the  use  of  geographical  maps  with  sev¬ 
eral  layers  of  information  (also  called  thematic 
maps  or  semantic  layers)  is  facilitated  by  GIS.  GIS 


is  the  merging  of  cartography,  statistical  analysis, 
and  database  technology.  Geovisualization  tools 
have  been  used  extensively  in  electoral  studies, 
urbanization  studies,  and  various  geopolitical  stud¬ 
ies  of  empires,  wars,  boundaries,  and  trade  routes. 

In  the  mid-2000s,  several  publicly  available  GIS 
systems  called  virtual  globes  appeared,  with  Google 
Earth  being  the  most  popular  one.  These  systems 
allow  the  visualization  of  geographical  maps  and 
various  types  of  global  and  local  semantic  layers 
mapped  on  them  in  a  highly  dynamic,  interactive 
fashion.  Importantly,  these  systems  themselves  can 
collect  information  from  users  (Public  Participation 
GIS).  Because  of  their  wide  use  by  the  public  for 
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Figure  4  Data  Cartography  for  Multivariate  Data  Visualization 

Source:  Organisation  for  Economic  Co-operation  and  Development.  (2006).  OECD  factbook  2006:  Economic,  environmental 
and  social  statistics.  Paris:  Author. 

Notes:  The  nonlinear  data  map  is  shown  on  the  right  and  created  with  the  use  of  the  elastic  map  algorithm  implemented  in 
ViDaExpert  software  (http://bioinfo.curie.fr/projects/vidaexpert).  The  gray  shading  represents  the  density  of  points  (from  this 
semantic  map,  four  groups  of  countries  can  be  distinguished).  The  geographical  map  on  the  left  shows  geographical 
localization  of  some  of  the  points  corresponding  to  European  countries.  The  marker  gray  scale  here  represents  GDP  per  capita 
(from  white  to  black).  The  countries  close  on  the  geographical  map  are  colocalized  physically,  while  the  countries  close  on  the 
data  map  are  neighbors  in  the  multidimensional  space  of  indicators  and  have  similar  statistical  profiles.  The  axes  of  the  data 
map  themselves  do  not  have  an  explicit  interpretation;  what  matters  is  the  distance  between  points,  showing  similar  statistical 
profiles  for  points  with  a  short  distance  and  different  ones  for  those  with  a  larger  distance.  GDP  =  gross  domestic  product. 


various  purposes,  these  systems  have  a  high  poten¬ 
tial  to  be  used  in  political  and  social  studies  and 
are  the  subject  of  several  educational  courses  in 
social  science. 

Graph  (Network)  Visualization 

Graph  visualization  (where  a  graph  is  understood 
as  a  set  of  vertices  connected  by  arrows,  directed  or 
undirected),  also  known  as  network  maps,  is  a 
graphical  data  display  that  allows  one  to  visualize  a 
specific  type  of  connection  tables.  On  such  a  dis¬ 
play,  a  vertex  represents  an  object  or  an  observa¬ 
tion,  and  an  arrow  between  two  vertices  represents 
a  connection  between  two  objects.  Usually, 
only  arrows  representing  nonempty  connections  (or 


having  a  strength  exceeding  a  certain  threshold) 
are  introduced  into  the  graph.  On  the  network 
maps,  one  can  use  the  size  and  the  color  of  vertices 
to  visualize  some  features  of  the  objects  and  the 
thickness  and  the  color  of  arrows  to  represent 
some  features  of  connections  between  objects 
(typically,  the  strength  of  the  connection).  If  the 
connections  are  asymmetrical  (e.g.,  export  and 
import  flows,  migration  rates  between  countries), 
then  two  directed  arrows  connecting  the  same 
objects  can  be  used  to  represent  the  asymmetric 
flow  in  both  directions.  Network  maps  can  also 
represent  distances  between  objects  in  the  multidi¬ 
mensional  space  of  multivariate  observations. 
Network  maps  are  widely  used  for  visualizing 
social  network  structures,  trading  relations 
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between  countries,  migration  rates,  and  so  on.  A 
good  example  of  such  visualization  is  the  Mapping 
Globalization  Project  of  Princeton  University. 

The  principal  difficulty  in  using  network  map 
displays  is  that  they  easily  become  too  entangled 
with  a  growing  number  of  arrows.  This  poses 
technical  challenges  for  (a)  finding  a  good  layout 
for  placing  vertices  on  a  two-dimensional  plane  to 
minimize  the  number  of  arrow  intersections  or 
visualize  the  internal  structure  of  the  graph  better 
and  (b)  creating  handy  browsers  for  huge  network 
presentations.  For  example,  in  the  Walrus  net¬ 
work  visualization  tool,  three-dimensional  interac¬ 
tive  network  map  representations  are  combined 
with  a  hyperbolic  viewer  that  creates  a  fish  eye- 
like  distortion  to  magnify  (to  zoom)  a  particular 
part  of  the  graph  while  the  rest  of  it  remains  less 
detailed  and  serves  rather  for  orientation  and 
navigation. 

Multidimensional  Data  Cartography 

Inventors  of  statistical  graphics  thought  of  data 
displays  as  analogues  of  geographical  maps.  For 
bivariate  data  visualization  plots,  it  was  a  crucial 
invention  to  replace  geographical  latitude  and  lon¬ 
gitude  with  arbitrary  measurement  axes,  which 
required  the  notion  of  a  coordinate  system  formal¬ 
ized  in  Rene  Descartes’  La  Geometrie  written  in 
1637.  Thus,  on  a  scatterplot,  instead  of  putting 
geographically  colocated  objects  together,  one  puts 
together  the  points  corresponding  to  similar  com¬ 
binations  of  measured  x  and  y  values. 

For  multivariate  observations,  one  can  formu¬ 
late  a  similar  problem:  How  to  map  a  set  of 
objects  (vectors)  from  a  multidimensional  space 
onto  a  two-dimensional  plane  (or  into  a  three- 
dimensional  space)  such  that  the  objects  with 
similar  numerical  feature  profiles  would  be  located 
close  and  dissimilar  objects  would  be  located  at  a 
larger  distance  after  projection.  There  are  a  num¬ 
ber  of  statistical  methods  aiming  at  producing 
such  data  maps.  That  is  a  part  of  exploratory  data 
analysis. 

The  most  fundamental  method  for  mapping 
multidimensional  data  into  low-dimensional  space 
is  the  principal  component  analysis  (PCA),  invented 
by  Karl  Pearson.  The  method  can  be  applied  for 
the  numerical  tables  of  the  object-feature  type. 
The  set  of  table  rows  is  represented  as  a  cloud  of 


data  points  (vectors)  in  the  multidimensional 
space  of  features.  Then  the  PCA  constructs  an 
optimal  linear  two-dimensional  screen  (a  plane) 
embedded  into  the  multidimensional  space  of  data 
such  that  the  sum  of  squared  distances  between 
the  data  points  and  the  screen  is  minimal.  After 
that,  the  data  points  can  be  projected  onto  the 
screen  and  the  distribution  of  the  projections  is 
represented  using  standard  statistical  graphics 
techniques.  The  great  advantage  of  such  visualiza¬ 
tion  is  that  it  takes  into  account  all  numerical 
dimensions  at  the  same  time  and  not  only  two 
dimensions  as  on  the  standard  bivariate  scatter- 
plots.  The  disadvantage  of  all  multidimensional 
data  mapping  methods  is  that  the  new  axes  of  the 
resulting  scatterplot  do  not  have  explicit  meaning 
(in  general,  they  are  complex  functions  of  the 
initial  numerical  features).  What  matters  on  such 
data  visualization  displays  are  the  relative  dis¬ 
tances  between  projections  that  represent  the 
distances  between  objects  in  the  initial  multidi¬ 
mensional  space.  The  method  of  principal  compo¬ 
nents  as  a  multivariate  data  visualization  tool  was 
used  in  several  studies,  including  Political  Atlas  of 
the  Modern  World.  The  classical  PCA  method  can 
have  nonlinear  generalizations  (principal  mani¬ 
folds  approach),  and  in  this  case,  it  becomes  more 
precise  and  informative  in  visualizing  the  data 
structure.  In  this  case,  a  nonlinear  (curved)  two- 
dimensional  screen  is  constructed  in  the  multidi¬ 
mensional  space  and  used  in  the  same  way  for 
projecting  the  data  as  in  classical  PCA.  Other 
multivariate  data-mapping  techniques  such  as  cor¬ 
respondence  analysis,  self-organizing  maps,  metro 
maps  visualization  of  principal  trees,  multidimen¬ 
sional  scaling,  locally  linear  embedding,  and 
ISOMAP  have  been  recently  developed  and  applied 
in  political  and  social  science. 

Visualizing  multivariate  data  by  projection 
onto  a  low-dimensional  screen  is  a  subject  of  mul¬ 
tidimensional  data  cartography:  The  projection 
creates  a  data  map  that  is  an  alternative  to  the 
geographical  map  (see  Figure  4)  and  on  which  the 
objects  (not  necessarily  linked  to  geography)  with 
similar  feature  profiles  are  colocalized.  For  these 
data  maps,  the  methods  of  data  visualization 
developed  for  GIS  can  be  reused.  For  example,  an 
atlas  of  semantic  layers  can  be  created,  each  layer 
corresponding  to  the  values  of  a  particular  numer¬ 
ical  attribute. 
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Data  Visualization  Problems  and  Risks 

Despite  the  undeniable  role  of  data  visualization 
in  providing  an  efficient  tool  for  reasoning  on 
quantitative  data,  there  are  a  number  of  problems 
connected  with  the  possible  misuse  of  graphical 
data  displays.  Potentially,  this  can  lead  to  a  wrong 
interpretation  of  the  message  contained  in  the 
diagram. 

In  their  books,  Edward  Tufte  and  Howard 
Wainer  provided  numerous  examples  of  mistakes 
or  intentionally  introduced  distortions  in  data 
visualization  plots  that  can  result  in  a  message  that 
is  significantly  different  or  even  opposite  to  the  one 
contained  in  the  data.  Most  often  these  problems 
are  related  to  misuse  of  axis  scales,  color  palette, 
or  elements  of  design. 

Sabrina  Bresciani  and  Martin  Eppler  provided  a 
two-dimensional  classification  of  data  visualization 
problems.  First,  the  problems  can  be  induced  by  the 
designer  (intentionally  or  unintentionally)  or  by  the 
user  of  the  diagram.  Second,  these  problems  can  be 
classified  into  cognitive,  emotional,  and  social  ones. 
Cognitive  problems  can  be  connected  to  inappro¬ 
priate  use  of  graphical  elements,  lack  of  clarity,  or 
oversimplification  or  overcomplexification  of  the 
graphical  display  or  induced  by  heterogeneity  of 
target  user  groups  (e.g.,  women  have  a  more  accu¬ 
rate  perception  of  the  color  palette  than  men). 
Emotional  problems  can  be  connected  to  a  repul¬ 
sive  content  of  the  graphical  design.  Social  prob¬ 
lems  can  be  connected  to  cross-cultural  differences 
of  users  (e.g.,  in  some  Eastern  countries,  time  is 
shown  from  right  to  left,  and  the  meaning  of  red 
and  green  is  not  identical  to  that  accepted  by 
Western  countries). 

In  the  field  of  data  cartography,  the  main  prob¬ 
lem  of  data  visualization  is  the  possible  distortion 
of  mutual  distances  when  projecting  data  points 
from  a  multidimensional  to  a  low-dimensional 
space.  Distant  points  in  the  multidimensional  space 
can  be  projected  at  a  short  distance  in  the  classical 
PCA  method,  and  points  close  in  the  multidimen¬ 
sional  space  can  be  projected  at  a  large  distance  in 
applications  of  nonlinear  data-mapping  tech¬ 
niques.  There  are  visualization  methods  warning 
the  user  about  possible  distortions;  however,  in 
general,  any  conclusion  derived  from  analyzing  a 
data  map  should  be  verified  carefully  by  rigorous 
statistical  techniques  of  hypothesis  testing. 

Yet  another  source  of  problems  in  data  visu¬ 
alization  comes  from  the  use  of  categorical  or 


qualitative  measurements  for  which  no  stan¬ 
dardized  and  well-established  graphical  displays 
exist. 

Data  visualization  risks  should  be  distinguished 
from  challenges  posed  to  scientific  data  visualiza¬ 
tion  in  various  fields.  These  challenges  aim  at 
making  data  visualization  more  informative  and 
taking  advantage  of  recent  achievements  in  com¬ 
puter  graphics,  psychology,  and  computer  science. 
For  example,  10  top  data  visualization  problems 
were  formulated,  such  as  the  need  for  ameliorating 
usability  and  scalability  of  graphical  displays  as 
well  as  shifting  the  visualization  focus  from  visual¬ 
izing  static  structures  to  visualizing  dynamics. 
Several  international  conferences  (such  as  IEEE 
Visualization  and  IEEE  Information  Visualization) 
provide  a  wide  forum  for  answering  these  chal¬ 
lenges  through  international  and  interdisciplinary 
research  efforts. 

Andrei  Zinovyev 
Institut  Curie 
Paris,  France 

See  also  Correspondence  Analysis;  Data,  Missing;  Data, 
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Series  Analysis 
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Websites 

Data  visualization  portal:  http://datavisualization.ch 

The  premier  news  and  knowledge  resource  for  data 
visualization  and  infographics. 

Gapminder:  Unveiling  the  beauty  of  statistics  for  a  fact- 
based  worldview:  http://www.gapminder.org 
An  excellent  dynamic  and  interactive  data 
visualization  tool  and  a  database  of  socio- 
economical  indicators  collected  for  200  years  of 
world  history. 

Google  Public  Data  Explorer:  http://www.google.com/ 
publicdata 

Makes  large  data  sets  easy  to  explore,  visualize,  and 
communicate.  The  public  data  sets  available  contain 
several  fact  books  (such  as  OECD  Factbook)  that 
can  be  used  in  political  science  studies. 

Human  Development  Reports:  http://hdr.undp.org/en/ 
reports 

Contains  tons  of  examples  of  thoughtful  data 
visualization,  especially  in  recent  reports. 

Mapping  Globalization  Project  of  Princeton  University: 
http://qed.princeton.edu/main/MG/Data_and_Analysis 
The  aim  of  the  project  is  to  visualize  trading 
relations  between  countries. 
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The  concept  of  decentralization  refers  to  the  alloca¬ 
tion  of  power  in  organizations  or  social  structures 


usually  from  the  higher  to  the  lower-level  struc¬ 
ture!  s)/organization(s).  It  can  describe  either  an 
existing  structure  in  which  smaller  or  peripheral 
units  have  effective  powers  or  a  process  of  struc¬ 
tural  change  implying  a  shift  of  power  from  the 
center  to  these  units.  The  structure  concerned  can 
be  a  network  differentiated  into  a  center  and  a 
periphery,  a  hierarchical  organization  internally 
differentiated  into  sectors  or  subdivisions,  or  a  ter¬ 
ritorial  organization  differentiated  into  levels  of 
geographical  space. 

In  political  science,  decentralization  usually  refers 
to  multilevel  structures  of  government  or  adminis¬ 
tration.  It  results  from  (re)allocation  of  power  to 
elect  or  denominate  policymakers  of  legislative  or 
administrative  competences  or  of  fiscal  resources 
from  higher  to  lower  levels.  Given  interdependen¬ 
cies  between  levels,  decentralization  is  to  be  regarded 
as  a  relational  concept.  The  effects  of  policies  made 
by  lower-level  units  depend  on  the  power  of  the 
center,  and  shifts  of  power  toward  lower  levels  usu¬ 
ally  affect  the  degree  of  centralization  or  decentral¬ 
ization,  while  only  major  reforms  change  the  char¬ 
acter  of  a  political  system,  for  example,  by  turning 
a  unitary  into  a  federal  system.  Nevertheless, 
regardless  of  its  extent,  decentralization  affects  gov¬ 
ernance  and  democratic  legitimacy.  For  this  reason, 
it  is  not  only  a  matter  of  multilevel  power  games  but 
also  contested  for  normative  reasons. 

For  long,  decentralization  was  not  a  salient  con¬ 
cept  in  political  theory.  History  of  government  was 
about  centralization  of  power  to  manage  conflicts 
of  competing  local  elites  and  social  groups.  The 
modern  state  evolved  in  a  process  of  center  forma¬ 
tion.  The  20th  century  saw  a  trend  toward  inter-  or 
transnational  governance.  However,  these  develop¬ 
ments  never  went  in  only  one  direction.  Central¬ 
ization  was  also  often  thwarted  by  countervailing 
powers  from  below.  In  the  final  decades  of  the  20th 
century,  a  general  trend  toward  more  decentraliza¬ 
tion  gained  ground  in  many  developed  states.  At 
the  same  time,  decentralization  became  a  kind  of 
paradigm  for  governance  in  developing  countries. 

In  political  theory  and  practice,  decentraliza¬ 
tion  is  now  mostly  regarded  as  a  preferable  alter¬ 
native  to  centralization.  As  a  principle  guiding  the 
organization  of  government,  it  is  justified  by  dif¬ 
ferent  normative  theories.  All  of  them  can  be 
traced  back  to  different  political  theories  and 
ideas,  but  all  are  under  dispute,  as  is  exemplified 
by  the  writings  of  Johannes  Althusius  and  Jean 
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Bodin  or  the  controversies  between  the  American 
Federalists  and  anti-Federalists. 

The  first  reason  for  decentralization  is  derived 
from  the  principle  of  subsidiarity.  Rooted  in  the 
political  philosophy  of  reformed  Protestantism  in 
the  early  17th  century  and  in  social  theory  of  the 
Catholic  Church  of  the  19th  century,  it  defends  the 
priority  of  small  social  communities  against  large 
societies  and  a  centralized,  bureaucratic  state. 
While  this  collectivist  view  reemerged  in  communi- 
tarist  theories,  the  principle  has  a  strong  impact  in 
discourses  on  constitutional  design.  Here,  subsid¬ 
iarity  has  turned  into  a  legal  principle  stipulating  a 
prerogative  for  decentralization  in  case  of  dispute. 

To  justify  this  prerogative,  two  main  arguments 
are  relevant:  The  first  holds  that  decentralization 
improves  democracy.  If  government  is  closer  to  the 
people,  communication  between  citizens  and  their 
representatives  should  be  more  effective  and  indi¬ 
vidual  preferences  should  have  better  chances  to  be 
considered  in  political  decisions.  Moreover,  decen¬ 
tralization  is  said  to  reflect  plurality  and  regional 
differentiation  of  societal  interests.  Liberalist  theo¬ 
ries  prefer  decentralization  as  a  device  against 
dictatorship  either  of  autocrats  or  of  majorities. 
Against  these  assumptions,  empirical  studies  have 
revealed,  that  governance  at  lower  levels  is  often 
more  elitist  and  exclusive  compared  with  represen¬ 
tative  democracy  in  larger  territorial  units  and  that 
it  may,  under  particular  conditions,  support 
regional  conflicts  or  separatism. 

Economic  theories  stipulate  that  decentralization 
fosters  efficiency  of  policy  making.  If  citizens  have 
an  opportunity  to  choose  between  alternative  sup¬ 
plies  of  public  goods  or  services  provided  by  lower- 
level  governments  demanding  different  tax  burdens, 
they  force  governments  competing  for  taxpayers  to 
make  every  effort  to  find  an  optimal  ratio  of  costs 
and  benefits.  However,  to  achieve  this  positive 
effect,  two  basic  conditions  must  apply:  First,  a  suf¬ 
ficient  number  of  taxpayers  from  different  groups 
of  a  society  must  be  able  and  willing  to  move 
between  jurisdictions.  Second,  policies  should  not 
produce  external  effects  spilling  over  the  borders  of 
a  jurisdiction.  In  practice,  moving  to  change  juris¬ 
dictions  (“voting  by  feet”)  is  hardly  a  real  opportu¬ 
nity  for  ordinary  citizens,  and  external  effects  occur 
frequently.  Therefore,  to  encourage  innovation, 
economic  theory  now  emphasizes  “yardstick  com¬ 
petition,”  in  which  suppliers  of  goods  and  services 


are  evaluated  against  best  practices  in  that  economic 
sector.  But  again,  this  outcome  can  be  achieved  only 
under  particular  provisions.  Moreover,  if  policies 
have  redistributive  effects,  decentralization  is  prob¬ 
lematic.  In  regionally  divided  societies,  it  can  cause 
vicious  cycles  leading  to  inequality,  inefficiency,  and 
instability. 

Finally,  political  scientists  praise  decentralization 
as  a  device  to  avoid  a  concentration  of  power  in  a 
government.  Powers  should  be  divided  not  only 
between  executive,  legislative,  and  judiciary  but  also 
between  central  and  regional  or  local  levels. 
Although  the  general  argument  is  convincing,  it 
leaves  open  the  question  of  which  powers  should  be 
centralized  and  which  should  be  decentralized. 
Moreover,  decentralization  can  divide  powers  for 
legislation,  as  is  typical  in  a  federal  state,  or  it  can 
refer  to  executive  powers  while  legislation  remains 
to  the  center,  as  is  often  the  case  in  unitary  states. 
There  is  no  convincing  evidence  to  conclude  which 
system  creates  better  checks  and  balances  and  which 
one  is  protected  better  against  centralist  trends. 

Normative  disputes  on  the  effects  of  decentral¬ 
ization  can  be  settled  by  empirical  research. 
However,  decentralization  is  difficult  to  measure. 
First,  as  mentioned  above,  power  is  a  multidimen¬ 
sional  concept.  It  can  be  based  on  formal  compe¬ 
tences,  on  effective  capacities  to  make  or  impede 
decisions,  on  support  by  electors/voters  or  influen¬ 
tial  associations,  or  on  fiscal  resources.  Usually, 
scholars  find  no  congruence  in  the  allocation  of 
these  different  sources  of  power,  and  weighting 
their  relative  significance  is  rather  problematic. 
Second,  the  degree  of  centralization  and  decentral¬ 
ization  can  vary  between  the  legislative,  executive, 
and  judiciary  branches  of  government.  The  party 
system  reflecting  the  relations  between  government 
and  society  may  also  matter  when  measuring  decen¬ 
tralization  and  may  differ  from  other  institutional 
dimension  of  multilevel  government.  Third,  to  mea¬ 
sure  the  degree  of  decentralization,  researchers  have 
to  assess  the  power  relations  between  levels  of  gov¬ 
ernment.  Decentralization  is  often  identified  with 
“self-rule,” — that  is,  the  power  of  regional  or  local 
governments  to  decide  on  their  affairs  autono¬ 
mously.  But  usually,  central  governments  can  inter¬ 
vene  in  lower-level  policy  making  by  different 
means  such  as  regulation,  supervision,  incentives,  or 
capacity  building.  On  the  other  hand,  decentralized 
authorities  may  be  able  to  evade  central  control. 
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Also,  in  federal  systems,  regional  governments  (i.e., 
representatives  from  states,  Lander,  or  provinces) 
participate  in  decisions  of  the  central  government 
(“shared  rule”).  Finally,  the  real  power  generated 
by  decentralization  of  formal  competences  or 
resources  depends  on  governance  capacities  of 
lower  levels.  Under  certain  condition,  decentraliza¬ 
tion  may  overburden  governments  with  unsolvable 
conflicts  and  problems  and  at  the  same  time  unbur¬ 
den  the  central  government  from  pressure. 

Nevertheless,  by  constructing  indicators,  schol¬ 
ars  have  been  able  to  provide  information  about 
decentralization  in  governments.  In  contrast  to  eco¬ 
nomics,  where  fiscal  issues  are  considered,  political 
scientists  try  to  measure  relative  political  power  of 
governments.  By  applying  their  Regional  Authority 
Index,  Gary  Marks,  Liesbeth  Hooghe,  and  Arjan 
H.  Schakel  (2008)  recently  revealed  that,  among  the 
42  advanced  industrialized  countries  covered  in 
their  study,  29  saw  an  increase  of  regional  powers, 
while  only  two  countries  saw  a  decline.  Studies  on 
local  government  have  portrayed  a  similar  evolu¬ 
tion,  although  often  with  the  qualification  that 
many  cities  have  suffered  from  fiscal  constraints 
during  the  last  decades.  In  general,  decentralization 
seems  to  constitute  a  general  trend  in  government, 
at  least  in  developed  countries. 

This  trend  can  be  explained  by  different  factors. 
One  is  the  change  in  public  policies,  in  particular, 
the  shift  from  distributive  welfare  policies  to  the 
provision  of  services.  The  process  of  democratiza¬ 
tion  is  also  said  to  have  supported  decentralization. 
Many  governments  have  to  respond  to  pressure 
from  new  nationalist  or  regionalist  movements 
demanding  political  autonomy.  Economic  global¬ 
ization  has  reduced  the  effects  of  national  policies 
to  support  firms  and  increased  the  relevance  of 
regional  and  local  responsibilities  for  infrastructure 
and  administrative  services.  Discourses  about  pub¬ 
lic  sector  reform  have  been  guided  by  the  paradigm 
of  decentralization,  which  influenced  institutional 
reforms  in  many  countries.  Besides  these  forces 
toward  decentralization,  institutionalist  theories 
rightly  point  out  that  the  “stickiness,”  the  ability  to 
succeed,  of  existing  institutions  constrains  a  real- 
location  of  power.  In  multilevel  governments, 
proposals  to  decentralize  raise  conflicts  between 
central,  regional,  and  local  governments.  Even  if 
external  forces  or  approved  reform  concepts 
support  decentralization,  central  governments  try 


to  maintain  their  powers.  Therefore,  the  outcome 
of  processes  of  decentralization  usually  reflects 
compromises  between  actors  involved  in  bargain¬ 
ing  for  power,  which  is  often  perceived  as  a  zero- 
sum  game.  However,  these  explanations  do  not 
add  up  to  a  theory  and  most  of  the  individual 
hypotheses  are  still  not  sufficiently  tested  in  empir¬ 
ical  research. 

Unlike  what  is  suggested  in  normative  reason¬ 
ing,  there  is  also  no  theory  explaining  the  effects  of 
decentralization,  whichever  aspect  is  under  consid¬ 
eration.  On  the  contrary,  as  Daniel  Treisman  has 
shown  in  a  comprehensive  review  of  the  relevant 
literature,  available  empirical  studies  provide  at 
best  mixed  results.  What  decentralization  brings 
about  depends  not  only  on  the  particular  mode  of 
decentralization  but  also  on  additional  conditions 
of  governance,  such  as  the  patterns  of  democracy, 
organization  of  society,  economic  situation,  or 
political  culture.  As  a  rule,  benefits  of  decentraliza¬ 
tions  have  to  be  charged  up  against  the  costs,  and 
the  evaluation  of  the  balance  depends  on  interests, 
standards,  and  perspectives.  For  this  reason,  decen¬ 
tralization  should  not  be  taken  as  a  value  or  prin¬ 
ciple  for  organizing  government.  Rather,  it  should 
be  regarded  as  an  issue  in  institutional  or  constitu¬ 
tional  policy,  which,  from  a  normative  point  of 
view,  requires  a  careful  study  of  positive  and  nega¬ 
tive  effects  under  particular  conditions,  and,  from 
an  analytical  point  of  view,  is  a  matter  of  political 
conflicts  and  decisions  making. 
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Delegation 


Delegation  occurs  in  politics  whenever  one  actor 
or  body  grants  authority  to  another  to  act  on 
behalf  of  or  to  carry  out  a  function  for  the  first 
in  a  political  process.  In  such  general  terms,  del¬ 
egation  is  ubiquitous  and  a  defining  feature  of 
politics  beyond  direct  individual  actions.  Voters 
delegate  to  elected  officials  in  representative  gov¬ 
ernment;  governments  delegate  to  ambassadors 
in  foreign  affairs;  legislatures  delegate  to  com¬ 
mittees  the  authority  to  study  policy  issues  and 
report  bills  and  to  bureaucracies  the  authority  to 
make  policy. 

Because  of  the  breadth  of  the  topic,  this  entry 
focuses  specifically  on  delegation  from  legislatures 
to  bureaucracies  in  administrative  and  bureau¬ 
cratic  governments.  Delegation  has  become  inher¬ 
ent  in  this  mode  of  governance  as  the  reach  of 
public  policy  has  expanded  beyond  what  elected 
legislatures  can  possibly  handle.  Such  delegation 
presents  particularly  interesting  institutional  and 
political  problems  in  the  United  States  due  to  sepa¬ 
ration  of  powers,  and  thus,  this  entry  focuses  even 
more  specifically  on  the  rationale  for  and  determi¬ 
nants  of  delegation  in  this  case.  Because  of  space 
limitations,  this  entry  does  not  address  whether 
agency  use  of  delegated  authority  is  responsive  to 
the  policy  goals  of  external  political  actors 
(Congress,  the  president,  courts,  interest  groups) 
or  the  channels  by  which  this  responsiveness  is 
effected. 

Scope  of  Delegation  to  Bureaucracies 
in  the  United  States 

Delegation  to  bureaucracy  presents  important 
questions  of  political  and  democratic  legitimacy 
because  it  often  imparts  to  the  bureaucracy  some 


power  to  shape  the  law.  In  practice,  delegation 
involves  the  bureaucracy  in  making  law,  beyond 
the  nonlegislative  functions  of  executing  laws 
passed  by  Congress.  Yet  Article  1  of  the 
Constitution  explicitly  allocates  the  power  to 
make  law  only  to  Congress.  Moreover,  delegation 
of  lawmaking  power  implies  that  bureaucracies 
blend  the  formal  powers  (legislative,  executive, 
and  judicial)  that  the  Constitution  separates.  The 
Supreme  Court  held  such  delegation,  in  its  most 
expansive  form,  unconstitutional  in  its  1935  deci¬ 
sion  in  Schechter  Poultry  Corporation  v.  United 
States ,  which  invalidated  the  administrative  nerve 
center  of  Franklin  Delano  Roosevelt’s  New  Deal. 
Since  then,  the  typical  formulation  to  legitimize 
Congress’s  delegation  of  its  Article  I  power  to 
make  law  is  that  Congress  articulates  at  least  a 
“general  standard”  to  guide  agency  policy  making, 
and  agencies  merely  “fill  in  the  details.”  However, 
no  statute  has  been  invalidated  by  the  Supreme 
Court  on  delegation  grounds  since  1935;  even 
extremely  broad  guidelines  stipulating  only  that 
agencies  regulate  “in  the  public  interest”  have  been 
upheld.  Thus,  it  is  debatable  whether  Congress 
faces  any  meaningful  legal  restrictions  on  its  dele¬ 
gation  to  agencies. 

James  M.  Landis,  one  of  the  foremost  legal 
scholars  of  regulation  and  architect  of  the  Securities 
and  Exchange  Commission,  disputed  such  cri¬ 
tiques  of  delegation  on  the  grounds  that,  first,  each 
constitutional  branch  of  government  possessed 
ample  checks  over  administrative  agencies,  and 
second,  delegation  to  administrative  boards  was 
necessary  to  reconcile  democratic  government 
with  the  formidably  complex  policy  problems  cre¬ 
ated  by  economic  developments.  This  defense  has 
retained  intellectual  currency  since  Landis  first 
offered  it. 

Causes  and  Effects  of  Delegation 

The  history  of  delegation  to  bureaucracy  in  the 
United  States  is  a  history  of  Congress  working  hard 
to  give  away  a  measure  of  formal  power  allocated 
to  it  in  the  Constitution.  Since  Congress  often 
keeps  close  watch  on  its  constitutional  preroga¬ 
tives,  this  is  a  choice  that  is  interesting  to  try  to 
explain.  Scholars  have  offered  a  wide  variety  of 
theories  to  do  so.  Broadly  speaking,  the  contempo¬ 
rary  political  science  literature  avers  that  delegation 
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of  policy-making  authority  to  bureaucracies  allows 
a  legislature  to  achieve  policy  ends  that  it  could  not 
achieve  through  legislation  itself. 

First,  Congress  may  delegate  simply  because  of 
the  opportunity  cost  of  its  time  spent  on  any  one 
issue.  For  example,  in  policy  pertaining  to  the  elec¬ 
tromagnetic  spectrum  (EMS),  Congress  could  set 
technological  standards  and  allocate  spectrum 
rights  to  various  classes  of  users  itself.  Instead  it  has 
charged  the  Federal  Communications  Commission 
(FCC)  to  do  so  “in  the  public  interest.”  Congress 
can  therefore  reserve  its  attention  for  broader  and 
more  nationally  significant  policy  choices,  leaving 
the  FCC  with  the  relatively  more  arcane  choices 
over  EMS  use. 

This  rationale  suggests  that  delegation  presents 
Congress  with  a  “principal-agent  problem.”  The 
bureaucratic  agent  may  pursue  different  goals  with 
its  authority  from  those  Congress  (the  “principal”) 
would  pursue.  Several  scholars  have  noted  that  leg¬ 
islative  and  executive  policy  goals  are  more  congru¬ 
ent  under  unified  than  under  a  divided  government. 
This  reasoning  has  led  to  one  of  the  more  robust 
empirical  findings  on  delegation:  legislative  delega¬ 
tion  of  policy-making  authority  to  bureaucracies 
increases  under  a  unified  government.  David  Epstein 
and  Sharyn  O’Halloran  (1999)  established  this  at 
the  federal  level  in  a  study  of  delegation  from  1947 
to  1995.  John  Huber  and  Charles  Shipan  (2002) 
further  established  that  this  pattern  holds  at  the 
state  level  in  the  United  States  and  comparatively 
among  the  Organisation  for  Economic  Co-operation 
and  Development  countries  (where  “policy  con¬ 
flict”  between  the  legislature  and  executive  is  opera¬ 
tionalized  by  minority  coalition  governments). 

A  second  and  closely  related  explanation  for 
delegation  is  that  Congress  delegates  to  tap  into 
expertise  that  bureaucrats  have  over  some  policy 
domain.  For  instance,  both  the  FCC  and  the 
Department  of  Justice  make  policy  with  respect  to 
market  competition;  by  delegating  this  authority, 
Congress  can  leverage  the  expertise  these  organiza¬ 
tions  have  developed  in  the  economic  analysis  of 
the  competitive  effects  of  industrial  structure.  The 
implicit  presumption  is  that  Congress  does  not 
have  or  wish  to  develop  this  expertise  itself,  which 
at  some  level  is  presumably  a  result  of  the  oppor¬ 
tunity  cost  of  time  spent  doing  so. 

Formally,  delegation  of  policy-making  authority 
is  often  conceived  as  granting  a  “zone  of  discretion” 


to  an  agent,  a  set  of  policies  among  which  the  agent 
can  choose  freely  and  without  further  interference 
from  the  legislature.  The  legislature  decides  which 
policies  are  in  the  zone  and  which  are  not.  This  was 
formulated  in  1984  by  Bengt  Holmstrom  and  was 
first  applied  in  the  political  science  literature  on 
bureaucracy  by  Epstein  and  O’Halloran  (1994). 
The  critical  strategic  problem  confronted  by 
Congress  is  to  induce  the  agency  to  use  its  expertise 
as  Congress  itself  would  use  it,  if  Congress  pos¬ 
sessed  it,  by  tailoring  the  zone  of  discretion  in  par¬ 
ticular  cases.  Because  the  zone  of  discretion  is  a 
blunt  instrument,  this  problem  generally  cannot  be 
perfectly  “solved”  from  Congress’s  point  of  view;  it 
is  inherent  that  the  agent  will  sometimes  use  its 
information  to  pursue  policies  Congress  does  not 
always  prefer.  This  “agency  loss”  is  simply  a  neces¬ 
sary  cost  of  an  otherwise  desirable  process  and  does 
not  undermine  the  case  for  delegation  in  normative 
terms.  A  Congress  aware  of  this  agency  loss  would 
simply  not  delegate  if  it  were  too  large  compared 
with  the  benefits  of  delegation. 

Empirical  evidence  bears  out  that  bureaucratic 
expertise  is  a  rationale  for  delegation.  For  instance, 
in  their  comprehensive  study  of  major  postwar 
federal  legislation,  Epstein  and  O’Halloran  (1999) 
found  that  laws  pertaining  to  more  technically 
complex  policy  areas  delegated  more  authority  to 
bureaucratic  agencies,  net  of  constraints  simulta¬ 
neously  placed  on  those  agencies,  than  laws  per¬ 
taining  to  less  complex  policy  areas. 

Of  course,  agencies  cannot  always  be  presumed 
to  possess  expertise,  especially  when  they  are  ini¬ 
tially  created  to  deal  with  a  novel  policy  problem. 
Delegation  can  also  help  induce  agencies  to  develop 
expertise.  First,  agencies  have  an  incentive  to 
develop  expertise  if  they  anticipate  that  their 
policy  authority  over  tasks  they  deem  important 
will  increase  as  a  result  (Sean  Gailmard  &  John 
Patty,  2007).  Second,  delegating  authority  over  a 
task  that  an  agent  considers  important  provides  an 
incentive  for  the  agent  to  acquire  information 
about  how  best  to  perform  the  task.  Given  the 
delegated  authority  over  an  important  task,  the 
agent  remains  ignorant  at  its  peril;  acquiring 
expertise  allows  the  agent  to  obtain  private  bene¬ 
fits  from  good  performance. 

Third,  delegation  can  act  as  a  commitment  by 
Congress  to  a  future  course  of  policy.  For  instance, 
under  the  Steel  Trigger  Price  Mechanism,  the 


550  Delegation 


International  Trade  Commission  (ITC)  investigates 
possible  instances  of  “dumping”  in  the  United 
States  by  international  trading  partners.  The  policy 
commits  the  United  States  to  import  restrictions 
whenever  dumping  is  found.  If  Congress  had  not 
delegated  this  authority  to  the  ITC  but  instead 
retained  the  authority  to  determine  policy  on  a 
case-by-case  basis,  import  restrictions  in  response 
to  dumping  would  not  be  so  evident.  It  would 
depend  on  the  policy  preferences  of  members  of 
Congress  and  the  president  toward  import  restric¬ 
tions.  A  protrade  Congress  or  president  would  be 
sufficient  to  prevent  enactment  of  import  restric¬ 
tions,  even  if  illegal  dumping  were  identified.  By 
delegating  fact-finding  authority  and  program¬ 
ming,  a  Congress  at  one  time  can  preprogram  the 
policy  response  to  changing  conditions  even  after  a 
new  Congress  is  seated.  In  celebrated  analyses  of 
the  political  origins  of  administrative  institutions, 
Matthew  McCubbins,  Roger  Noll,  and  Barry 
Weingast  (1987)  offer  separate  but  related  argu¬ 
ments  that  commitment  of  future  policy  is  an 
important  rationale  for  delegation.  Such  delega¬ 
tion  as  commitment  is  only  effective  if  the  admin¬ 
istering  agency  is  able  to  hold  steady  in  changing 
political  winds;  thus,  T.  M.  Moe  predicts  that  in 
such  cases  agencies  will  be  deliberately  insulated 
from  and  unresponsive  to  future  Congresses  and 
presidents. 

Opportunity  cost  theories  suggest  implicitly  that 
delegation  inherits  whatever  normative  appeal  is 
inherent  in  congressional  policy  goals.  Commitment 
theory  is  more  ambiguous  because  it  suggests  that 
one  legislative  coalition  achieves  its  goals  by  com¬ 
promising  the  ability  of  future  coalitions  to  do  the 
same,  thus  dampening  the  responsiveness  of  public 
policy  to  shifts  in  congressional  goals. 

Besides  explanations  based  on  achieving  policy 
ends,  scholars  have  also  argued  that  delegation 
allows  legislators  to  evade  accountability  and  dif¬ 
ficult  policy  choices.  Congress  may  delegate  to 
shift  blame  if  policy  may  be  unpopular  or  fails  to 
achieve  desired  results.  In  some  policy  domains, 
success  is  expected  and  failure  is  a  high-profile 
event.  In  airline  safety,  the  public  does  not  cele¬ 
brate  flights  that  land  safely.  When  airplanes  oper¬ 
ate  safely  the  public  may  even  protest  that  safety 
and  security  measures  are  too  onerous  in  terms  of 
time  and  money.  But  when  airplanes  crash  we  can 
expect  recriminations  about  gaps  in  the  system. 
Much  the  same  story  applies  to  homeland  security 


issues.  The  blame-shifting  explanation  suggests 
that  Congress  delegates  authority  over  these  policy 
areas  so  that  Congress  will  not  be  held  responsible 
in  case  of  a  problem,  and  doing  so  may  even  allow 
Congress  to  claim  credit  for  solving  problems.  This 
explanation  implicitly  assumes  the  public  does  not 
realize  that  Congress  chose  to  arrange  the  system 
this  way  and  is  thus  still  ultimately  responsible. 
For  instance,  if  the  decision  to  delegate  is  separated 
by  a  long  interval  from  resulting  problems,  it  may 
not  make  sense  for  voters  to  hold  sitting  legislators 
answerable  for  the  delegation  and  oversight  of  the 
agency’s  initial  policy  choices. 

Conclusion 

Delegation  from  Congress  to  bureaucratic  agen¬ 
cies,  including  the  delegated  power  to  make  law,  is 
pervasive  in  the  United  States.  Indeed,  it  is  a  given 
in  most  major  legislation  passed  since  the  New 
Deal.  Delegation  to  administrative  agencies  cannot 
remove  politics  from  policy  making;  it  can  merely 
move  the  political  choices  around  in  the  policy 
process  and  push  politics  inexorably  further  into 
administration.  Delegation  is  a  choice  that  requires 
both  normative  justification  and  positive  explana¬ 
tion.  The  positive  explanations  reviewed  above  are 
not  necessarily  mutually  exclusive,  though  they 
carry  dramatically  different  normative  implica¬ 
tions  for  how  delegation  from  legislatures  to 
bureaucracies  should  be  evaluated. 
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Deliberative  Policy  Making 


Deliberative  policy  making  applies  principles  de¬ 
rived  from  the  theory  of  deliberative  democracy  to 
public  policy  making.  The  theory  of  deliberative 
democracy  responds  to  the  perceived  shortcomings 
of  a  majoritarian  democracy  where  actors  bargain 
to  defend  their  interests  (strategic  action),  bargain¬ 
ing  and  voting  procedures  do  not  change  actors’ 
preferences  while  discouraging  social  learning,  the 
strong  impose  their  will  on  the  weak,  and  con¬ 
straints  on  lying,  deception,  and  manipulation  are 
few  and  far  between.  Against  this  vision  of  democ¬ 
racy  as  a  pluralist  interest  aggregation,  deliberative 
democracy  upholds  an  alternative  model  based  on 
discussion  and  persuasion  where  actors  must 
defend  and  criticize  proposals  with  reasoning  they 
believe  others  will  accept  (communicative  action), 
public  discussion  can  transform  actors’  prefer¬ 
ences,  the  majority  prefers  to  compromise  with  the 
minority,  and  inclusion  in  public  discussion  of  all 
social  positions  and  perspectives  aims  to  maximize 
social  learning.  Deliberative  policy  making  aims  for 
genuine  preference  transformation  rather  than 
mere  preference  aggregation  by  expanding  in  gov¬ 
ernance  institutions  the  role  of  deliberation  based 
on  norms  such  as  reason  giving,  publicity,  joint 
problem  solving,  and  inclusive  participation.  This 
entry  reviews  the  aims  and  varieties  of  deliberative 
policy  making  before  assessing  its  achievements 
and  shortcomings. 

Aims 

Deliberative  policy  making  seeks  to  reach  policy 
decisions  that  are  both  democratically  legitimate 


and  technically  sound.  Especially  as  applied  to 
environmental  regulation,  theorists  such  as  Walter 
Baber  and  Robert  Bartlett  have  defended  the  sub¬ 
stantive  claim  that  expanding  deliberation  in  pol¬ 
icy  making  increases  the  ecological  rationality  of 
decisions.  Unlike  traditional,  hierarchical  forms  of 
administrative  decision  making,  deliberative,  hori¬ 
zontal  methods  are  said  to  be  more  flexible  and 
robust  in  the  face  of  the  epistemic  complexity  and 
uncertainty  that  characterizes  such  policy  prob¬ 
lems  in  the  “network  society.”  By  revitalizing  the 
role  of  nonexpert  citizens  in  policy  inquiry  and 
generating  policy-relevant  information  on  as  wide 
a  social  basis  as  possible,  deliberative  policy  mak¬ 
ing  helps  guard  against  closing  off  debate  prema¬ 
turely  and  discarding  policy  alternatives. 

A  normatively  demanding  conception  of  public 
deliberation,  which  theorists  argue  is  superior  to 
pluralist  bargaining,  is  said  to  promote  these  aims. 
Public  deliberation  as  conceived  by  theorists  such  as 
James  Bohman  is  not  merely  talk;  rather,  it  involves 
the  exchange  of  reasons  aimed  at  evaluating  alter¬ 
native  proposals  for  action.  Citizens  submit  their 
ideas  and  beliefs  for  discussion  and  criticism  by  fel¬ 
low  citizens,  making  it  less  likely  that  selfish  or 
poorly  thought-out  proposals  will  survive  the 
debate.  Note  that  public  deliberation  requires  a 
considerable  level  of  freedom  and  equality:  inclu¬ 
sion  of  everyone  affected  by  a  decision,  substantial 
political  equality  including  equal  opportunities  to 
participate  in  deliberation,  equality  in  methods  of 
decision  making  and  setting  the  agenda,  free  and 
open  exchange  of  information  and  reasons  suffi¬ 
cient  to  acquire  an  understanding  of  both  the  issue 
in  question  and  the  opinion  of  others.  These  condi¬ 
tions  are  much  more  restrictive  than  the  standards 
usually  applied  to  political  debate  and  difficult  to 
achieve  in  practice. 

Efforts  to  adapt  such  principles  of  public  delib¬ 
eration  for  practical  use  have  yielded  public  consul¬ 
tation  techniques  that  go  beyond  government  agen¬ 
cies  merely  seeking  public  input  from  individual 
citizens  or  firms.  Rather,  their  goal  is  to  generate 
high-quality  deliberation  conceived  as  dialogue 
among  the  participants.  These  techniques  include 
the  consensus  conferences  pioneered  in  Denmark, 
the  deliberative  opinion  polls  developed  by  James 
Fishkin,  collaborative  governance  schemes  involv¬ 
ing  multiple  stakeholders,  and  the  citizen  jury.  The 
next  section  classifies  these  varieties  of  deliberative 
policy  making,  using  a  simple  typology. 
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Varieties:  Hot  Versus  Cold, 

Weak  Versus  Strong 

Deliberative  policy-making  techniques  vary  along 
two  dimensions:  partisanship  and  authority. 
Deliberative  forums  differ  widely  in  how  they  con¬ 
front  the  question  of  partisanship.  Some  delibera¬ 
tive  forums  seek  to  accommodate  partisanship  by 
inviting  partisan  stakeholders  to  participate  in  the 
deliberations  and  then  structuring  the  proceedings 
so  as  to  maximize  deliberation  while  minimizing 
bargaining  (consensus  conference,  collaborative 
governance).  Stakeholders  are  included  in  such 
forums  on  the  assumption  that  decisions  reached 
without  their  participation  lack  the  broad-based 
acceptance  required  to  translate  decisions  into 
practice.  Because  they  aim  to  accommodate  plural¬ 
ism,  partisan  forums  may  be  thought  of  as  a  delib¬ 
erative  extension  of  pluralist  interest  aggregation. 
Nonpartisan  forums,  by  contrast,  seek  to  minimize 
partisanship  by  excluding  interest  groups  from  the 
deliberations  (deliberative  opinion  poll,  citizen 
jury)  in  the  expectation  that  interest  representation 
inevitably  diminishes  partisans’  ability  to  follow 
the  norms  of  deliberation  outlined  above.  Here  the 
goal  is  to  approximate  Jurgen  Habermas’s  “ideal 
speech  situation,”  which  brackets  forms  of  manip¬ 
ulation  and  coercion  external  to  the  logic  of  rea¬ 
soned  argument  and  persuasion.  Insofar  as  they 
hope  to  transcend  pluralism,  nonpartisan  forums 
may  be  thought  of  as  a  deliberative  alternative  to 
pluralist  interest  aggregation.  The  respective  vir¬ 
tues  of  “hot”  and  “cold”  forms  of  public  delibera¬ 
tion  are  subject  to  increasingly  refined  empirical 
analysis  in  the  literature. 

A  further  variation  among  forms  of  deliberative 
policy  making  is  their  degree  of  political  authority. 
This  is  expressed  as  the  difference  between  “strong” 
and  “weak”  publics  in  the  literature.  Weak  publics 
such  as  the  informal  public  sphere  engage  only  in 
opinion  formation,  while  strong  publics  such  as 
parliaments  also  make  collectively  binding  deci¬ 
sions  about  the  proposals  before  them.  Endowing 
nonelected,  nonexpert,  ad  hoc  deliberative  forums 
with  the  authority  to  make  policy  decisions  is  a 
contentious  proposition,  however.  Is  it  a  permissi¬ 
ble  delegation  of  legislative  authority,  or  does  it 
violate  basic  norms  of  representative  democracy? 
In  practice,  weak  publics  predominate.  Deliberative 
policy  making  is  typically  advisory  in  nature;  par¬ 
ticipants  are  confined  by  sponsoring  agencies  to 


preparing  recommendations  as  a  basis  for  decision 
making  by  elected  officials  and  public  managers. 

Deliberative  forums  can  be  evaluated  in  terms  of 
their  deliberative  capacity,  impact,  and  legitimacy. 
High-quality  deliberation  may  be  more  easily 
achieved  in  nonpartisan  forums,  while  partisan 
forums  may  have  a  greater  impact  on  policy  deci¬ 
sions  and  enjoy  greater  legitimacy — especially  where 
external  conditions  provide  sufficient  incentives  for 
powerful  stakeholders  to  prefer  deliberation  to 
other  ways  of  resolving  their  differences,  such  as 
litigation.  However,  given  the  tremendous  variabil¬ 
ity,  first,  among  designed  deliberative  forums  and, 
second,  among  the  political,  administrative,  and 
legal  contexts  in  which  they  must  function,  these 
empirical  relationships  are  not  particularly  robust. 
The  complex  interplay  between  deliberative  designs, 
their  internal  dynamics,  and  external  context  limits 
the  potential  for  generalizing  about  deliberative 
policy  making. 

Critiques  of  Deliberative  Policy  Making 

This  section  considers  some  popular  objections  to 
deliberative  policy  making — namely,  that  its  con¬ 
ception  of  democratic  legitimacy  is  substantively 
too  demanding,  that  it  is  too  costly,  and  that  it 
cannot  accommodate  interests. 

Deliberative  Policy  Making  Asks  Too 
Much  of  Democratic  Legitimacy 

It  is  rarely  realistic,  critics  contend,  to  expect 
reasoned  debate  to  lead  to  a  consensus  supported 
by  all  participants.  Public  deliberation,  however, 
does  not  demand  that  all  participants  support  an 
agreement  for  the  same  reasons,  or  that  they  come 
to  any  agreement  at  all.  Bohman’s  “plural  agree¬ 
ment,”  for  example,  only  requires  a  commitment 
to  continued  cooperation  in  public  deliberation, 
even  with  persistent  disagreements.  The  very  condi¬ 
tions  of  public  deliberation — publicity,  reversibility 
of  decisions,  and  social  inclusiveness — encourage 
an  ongoing  cooperation  with  others  who  disagree, 
which  is  at  least  not  unreasonable.  The  give-and- 
take  of  reasons  in  public  deliberation  makes  it  less 
likely  that  irrational  and  untenable  arguments  will 
decide  outcomes.  Decision-making  procedures 
allow  revisions  of  arguments  and  decisions  and 
even  include  procedures  that  either  take  up  features 
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of  defeated  positions  or  improve  their  chances  of 
being  heard.  And  finally,  deliberative  decision¬ 
making  procedures  are  broadly  inclusive,  so 
minorities  may  hope  to  affect  future  outcomes. 

The  condition  of  publicity  ensures  that  deci¬ 
sions  are  more  likely  to  be  made  on  the  basis  of  all 
relevant  perspectives,  interests,  and  information 
and  that  they  are  less  likely  to  exclude  legitimate 
interests,  relevant  knowledge,  or  dissenting  opin¬ 
ions.  When  proposals  are  subjected  to  a  wide 
range  of  possible  alternative  opinions  in  a  forum 
that  is  open  to  all  societal  interests  and  perspec¬ 
tives,  the  quality  of  reasons  for  decisions  is  there¬ 
fore  likely  to  improve.  Participants — and,  it  is 
hoped,  citizens  generally — should  be  able  to  accept 
decisions  reached  in  this  manner  as  reasonable 
unless  and  until  they  are  shown  to  be  otherwise  in 
future  rounds  of  deliberation. 

Deliberative  Policy  Making  Is  Inefficient 

Deliberative  policy  making  requires  additional 
resources,  increasing  administrative  costs.  Insofar 
as  decisions  reached  through  deliberation  can  be 
expected  to  enjoy  broader  public  support  than 
decisions  reached  by  nondeliberative  means,  they 
may  avoid  costly  and  time-consuming  legal  chal¬ 
lenges  and  delays  in  implementation.  At  the  very 
least,  deliberative  policy  making  provides  adminis¬ 
trators  with  better  information.  Adding  the  cost  of 
deliberation  to  an  already  complex  decision¬ 
making  process  may  improve  the  odds  that  a  given 
decision  will  survive  legal  scrutiny.  In  the  United 
States,  for  example,  most  Environmental  Protection 
Agency  regulations  face  court  challenges  at  some 
point  in  their  history.  Even  major  regulations, 
often  the  result  of  several  years  of  work,  do  not 
always  survive  legal  scrutiny.  More  and  better 
public  deliberation  in  rule  making  makes  such  fail¬ 
ures  less  likely  because  it  improves  the  odds  that  all 
relevant  perspectives,  interests,  and  relevant  objec¬ 
tions  will  be  considered  from  the  outset. 

Deliberative  Policy  Making  Cannot 
Accommodate  Interests 

Is  deliberative  policy  making  naive  about  the 
role  of  interests,  and  hence  of  negotiation  and  bar¬ 
gaining,  in  politics?  Deliberative  democracy  makes 
two  claims  about  interests,  neither  of  which  denies 


its  role  in  democratic  politics.  The  first  is  that 
interests  cannot  be  known  prior  to  public  delibera¬ 
tion.  On  this  view,  interests  are  not  fixed  or  given 
but  subject  to  discovery  and  transformation 
through  public  debate  about  joint  problems  and 
conflicts.  In  the  absence  of  public  deliberation, 
therefore,  interests  cannot  be  adequately  taken 
into  account  in  decision  making.  The  second  claim 
is  that  policy  proposals  cannot  be  defended  merely 
on  the  basis  of  self-interest  because  such  a  defense 
is  unlikely  to  survive  deliberative  scrutiny. 
Participants  will  be  suspicious  of  proposals  that 
serve  selfish  purposes  but  are  falsely  advanced  in 
the  name  of  public  interests.  There  is  no  denying  in 
deliberative  policy  making  that  politics  involves 
interests;  rather,  it  encourages  their  disclosure  and 
sometimes  their  transformation. 
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Democracy,  Consociational 


Consociational  (from  Latin,  consociatio — close 
political  and  social  interlinkage)  democracy  denotes 
a  democratic  political  order  in  which  political  deci¬ 
sions  are  reached  mainly  by  negotiations  between 
political  elites.  Majority  rule  is  not  the  dominant 
technique  in  decision  making.  Rather,  political 
actors  seek  to  find  broad  compromise  based  on 
amicable  arrangement.  Switzerland  and  the 
Netherlands  are  frequently  cited  as  examples  of 
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consociational  democracy.  There  is  a  variety  of 
conceptual  definitions  of  consociational  democ¬ 
racy.  However,  consensus  exists  about  the  anti¬ 
pode.  This  is  a  majoritarian  democracy  in  which 
elites  eschew  negotiations  and  compromise  as  soon 
they  have  a  plurality  of  50%  of  the  vote  plus  one. 
The  best  known  empirical  illustration  is  the  politi¬ 
cal  system  of  the  United  Kingdom.  Therefore,  the 
antipode  to  consociational  democracy  is  frequently 
labeled  “Westminster  democracy”  since  the  British 
parliament  resides  in  the  Palace  of  Westminster. 
The  political  system  of  postwar  New  Zealand 
comes  even  closer  to  the  ideal  type  of  a  majoritar¬ 
ian  democracy. 

The  concept  of  consociational  democracy  is 
important  for  the  comparative  analysis  of  political 
systems  since  it  emphasizes  variations  between 
forms  of  democratic  government.  This  is  in  stark 
contrast  to  the  traditional  argument  that  English- 
speaking  countries  have  a  homogeneous  political 
culture — that  is,  the  actors  share  general  orienta¬ 
tions  in  the  political  system.  There  is  consensus 
with  regard  to  general  political  ends  and  means. 
This  is  a  prerequisite  of  a  stable  democratic  order. 
The  more  homogeneous  a  political  culture  is,  the 
higher  the  stability  of  democracy,  with  the  Anglo- 
Saxon  countries  being  particularly  homogeneous. 
In  a  nutshell,  there  is  one  form  of  democratic  gov¬ 
ernment,  and  the  Anglo-Saxon  countries  are  the 
best  examples.  In  contrast,  Continental  European 
countries  have  a  heterogeneous  political  culture. 
Subgroups  such  as  Catholic  groups,  workers’ 
movements,  and  the  middle  classes  differ  consider¬ 
ably  in  ideology  and  strategy.  These  fundamental 
differences  are  the  germ  of  system  instability. 

However,  postwar  history  demonstrated  that, 
in  deeply  divided  European  societies,  heterogeneity 
in  political  culture  and  a  stable  democracy  are  not 
mutually  exclusive.  The  concept  and  underlying 
causal  hypotheses  of  consociational  democracy 
provided  explanations  for  the  political  stability  in 
segmented  societies.  Obviously,  there  are  different 
routes  to  the  Rome  of  democracy  and  consocia¬ 
tional  democracy  is  one  of  these  variants. 

Varieties  of  Consociational  Democracies 

The  term  consociational  democracy  entered  the  sci¬ 
entific  debate  in  1961,  when  it  was  used  to  describe 
African  political  systems  that  accommodated  by 


compromises  a  large  number  of  groups  with  diver¬ 
gent  ideologies  to  achieve  a  common  goal.  Arend 
Lijphart  and  Gerhard  Lehmbruch  applied  the  term 
to  Western  European  countries  at  the  end  of  the 
1960s.  In  his  book  about  the  Dutch  political  sys¬ 
tem,  Lijphart  argued  that,  in  a  critical  historical 
period  during  World  War  I,  the  leaders  of  the 
Dutch  societal  segments  agreed  on  pragmatic  solu¬ 
tions  to  deep  conflicts  over  the  relation  of  state  and 
churches  in  education  and  over  the  electoral  for¬ 
mula.  This  pacificatie  (peaceful  settlement)  of 
1917  had  three  major  characteristics:  (1)  the  elites 
negotiated  on  behalf  of  their  segments  without  any 
extensive  consultation  with  their  constituencies, 
(2)  elites  of  all  segments  participated  in  the  agree¬ 
ment,  and  (3)  the  principle  of  proportionality  in 
the  substance  of  the  settlement  was  of  utmost 
importance  (Lijphart,  1968).  A  deliberate  decision 
of  the  elites  to  cooperate  as  the  best  response  strat¬ 
egy  underlay  this  pacificatie.  Once  practiced,  this 
mode  of  top-level  compromise  became  institution¬ 
alized  in  the  Netherlands.  The  basic  sociocultural 
prerequisite  of  this  system  was  the  pillarization 
(verzuiling )  of  society.  Individuals  belong  to  clearly 
separated  societal  pillars — that  is,  sociocultural 
subgroups — and  only  the  elites  on  the  top  of  these 
pillars  had  contact  with  and  negotiated  with  elites 
from  other  pillars.  Operational  criteria  for  the 
classification  as  a  consociational  democracy  are 
grand  coalitions,  cultural  autonomy  of  subgroups, 
proportionality,  and  minority  voting  (Lijphart, 
2008). 

Lehmbruch  (1967)  analyzed  the  pattern  of  con¬ 
flict  management  in  Austria  and  Switzerland.  In 
Switzerland,  the  important  conflicts  were  those 
that  emerged  between  groups  that  are  regionally 
concentrated:  cantons  with  Catholic  or  Protestant 
orientation,  agricultural  or  industrial  structures,  or 
German-  or  French-speaking  populations.  Hence 
bargaining  between  regional  elites  was  the  domi¬ 
nant  pattern  of  conflict  regulation.  In  contrast, 
Austria  resembled  the  Netherlands  in  the  sense  that 
the  conflicts  existed  between  sociocultural  groups — 
Catholic- Conservative  versus  Socialist  camps — 
without  a  regional  identification.  Lehmbruch  used 
the  Austrian  term  of  Proporzdemokratie  (democ¬ 
racy  based  on  proportionality)  for  consociational 
democracy;  he  also  pointed  to  the  Swiss  term  of 
Konkordanzdemokratie  (democracy  based  on  con¬ 
cordance).  Hence,  consociational  democracy, 
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Konkordanzdemokratie,  or  Proporzdemokratie 
can  be  used  synonymously  to  describe  a  form  of 
government  in  which  the  majority  rule  is  not  the 
dominant  technique  of  decision  making.  The  under¬ 
lying  reason  is  a  deep  sociocultural  or  political  divi¬ 
sion  of  the  country  that  does  not  allow  for  a  system 
of  alternating  majorities.  Either  there  is  no  group 
that  could  be  a  majority  or  the  minority  groups  are 
very  suspicious  of  the  other  groups  and  see  little 
chance  of  becoming  a  majority  sometime  in  the 
future. 

Obviously,  consociational  democracy  is  hard 
to  define  in  operational  terms.  This  may  be  one 
reason  for  the  development  of  a  new  concept — 
consensus  democracy — which  is  described  as  a 
system  of  power  sharing.  Lijphart  (2008)  points 
to  four  major  differences  between  consociational 
and  consensus  democracy.  The  first  difference  is 
that  consociational  democracy  denotes  the  peace¬ 
ful  system  of  conflict  regulation  in  deeply  divided 
societies.  The  term  condenses  the  main  findings 
from  the  descriptive  analysis  of  these  political 
systems.  In  contrast,  consensus  democracy  started 
from  the  ideal  type  of  a  majoritarian  democracy, 
enumerating  all  the  major  characteristics  and 
then  defining  the  nonmajoritarian  as  a  contrast. 
This  led  to  the  following  list  of  variables.  It  can 
be  subdivided  into  the  two  dimensions  of  “Parties- 
Executives”  and  “Federal-Unitary.”  These 
dimensions  are,  however,  only  weakly  correlated 
empirically  (Lijphart,  1999). 

Parties-Executives 

1.  Concentration  of  executive  power  in  single¬ 
party  majority  cabinets  versus  executive  power¬ 
sharing  in  broad  multiparty  coalitions. 

2.  Executive-legislative  relationships  in  which  the 
executive  is  dominant  versus  executive- 
legislative  balance  of  power. 

3.  Two-party  versus  multiparty  systems. 

4.  Majoritarian  and  disproportional  electoral 
systems  versus  proportional  representation. 

5.  Pluralist  interest  group  systems  with  free-for- 
all  competition  among  groups  versus 
coordinated  and  “corporatist”  interest  group 
systems  aimed  at  compromise  and  concertation. 
(pp.  3-4) 


Federal-Unitary 

1.  Unitary  and  centralized  government  versus 
federal  and  decentralized  government. 

2.  Concentration  of  legislative  power  in 
unicameral  legislature  versus  division  of 
legislative  power  between  two  equally  strong 
but  differently  constituted  houses. 

3.  Flexible  constitutions  that  can  be  amended  by 
simple  majority  versus  rigid  constitutions  that 
can  be  changed  only  by  extraordinary 
majorities. 

4.  Systems  in  which  legislatures  have  the  final 
word  on  the  constitutionality  of  their  own 
legislation  versus  systems  in  which  laws  are 
subject  to  judicial  review  of  their 
constitutionality  by  supreme  or  constitutional 
courts. 

5.  Central  banks  that  are  dependent  on  the 
executive  versus  independent  central  banks. 

(pp.  3-4) 

The  second  difference  concerns  the  principles  of 
consociationalism,  which  are  broader  than  those 
of  consensus  democracy.  For  instance,  in  the  con¬ 
sociational  framework,  grand  coalition  could 
mean  more  than  just  a  multiparty  coalition;  grand 
coalition  could  be  created  informally  by  advisory 
arrangements  or  by  alternating  presidencies.  This 
is  connected  to  the  third  difference:  Consensus 
democracy  tends  to  emphasize  formal  institutional 
devices  while  consociational  democracy  relates 
mainly  to  informal  practices.  Finally,  consocia¬ 
tional  democracy  is  more  demanding  in  the  sense 
that  group  autonomy  or  inclusion  of  all  relevant 
groups  is  required  while  consensus  democracy  pro¬ 
vides  incentives  for  cooperative  behavior. 

To  avoid  the  normative  undertones  and  evalua¬ 
tions  of  the  term  consensus  democracy,  Andre 
Kaiser  (1997)  suggests  the  term  negotiation  democ¬ 
racy.  This  seems  to  be  particularly  appropriate  since 
consensus  democracy  combines  basically  three  dif¬ 
ferent  aspects  of  coordination:  between  political 
elites  (which  is  the  sole  focus  of  consociational 
democracy),  between  governments  and  interest 
groups  (which  is  the  focus  of  the  debate  on  corpo¬ 
ratism  versus  pluralist  interest  representation),  and 
between  veto  points  that  may  have  competitive  or 
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collective  orientations.  According  to  Vicki  Birchfeld 
and  Marcus  Crepaz  (1998),  competitive  veto 
points  occur  when  different  political  actors  oper¬ 
ate  through  separate  institutions  with  mutual  veto 
powers,  as  in  federalism  and  strong  bicameralism. 
Collective  veto  points  emerge  from  institutions 
where  the  different  political  actors  operate  in  the 
same  body  and  whose  members  interact  with  each 
other  on  a  face-to-face  basis,  as  in  proportional 
electoral  systems  and  multiparty  governments. 

Genesis  and  Determinants 

The  literature  distinguishes  between  at  least  three 
causal  explanations  of  consociational  democracy. 
The  institutional  explanation  points  to  incentives 
and  constraints  produced  by  institutions,  such  as 
electoral  laws  or  direct  democracy.  Electoral  rules 
allowing  for  proportional  representation  tend  to 
lead  to  multiparty  systems.  The  need  to  negotiate 
and  to  build  coalitions  is  larger  in  such  party  sys¬ 
tems  since  there  is  no  clear  majority  party  that  is 
able  to  dominate  all  the  other  parties.  Likewise, 
direct  democracy  creates  strong  incentives  for 
elites  to  cooperate  and  to  build  compromises.  If 
they  try  to  push  through  their  goals,  they  risk  the 
possibility  that  neglected  groups  may  trigger  a 
popular  vote  and  that  the  policy  of  the  majority 
group  in  parliament  or  government  may  be  termi¬ 
nated  by  a  popular  vote.  This  is  one  reason  why 
Switzerland  embarked  on  conflict  regulation  by 
negotiation  and  inclusion  of  all  groups  that  are 
capable  of  triggering  a  popular  vote. 

A  second  explanation  relates  to  deliberate  deci¬ 
sions  of  the  elites  (Lijphart,  1968).  According  to 
this  explanation,  in  a  critical  historical  situation, 
elites  consider  their  options.  They  then  take  a 
deliberate  decision  for  compromise  solutions — and 
waive  the  option  of  conflictual  strategies — based 
on  either  careful  cost-benefit  analyses  or  orienta¬ 
tion  to  the  common  good.  Once  the  basic  decision 
for  negotiation  is  taken,  it  becomes  institutional¬ 
ized.  Lijphart  (2008,  pp.  51-52)  identified  nine 
facilitating  conditions  that  make  this  decision 
more  likely: 

1.  the  absence  of  a  solid  majority  that  prefers  pure 
majority  rule  to  consociationalism; 

2.  the  absence  of  large  socioeconomic  differences 
between  the  subgroups  of  the  society; 


3.  a  moderate  number  of  groups,  so  that 
negotiation  is  a  feasible  option  and  is  not  made 
too  difficult  and  too  complex; 

4.  the  groups  having  roughly  the  same  size; 

5.  a  relatively  small  population — such  as  in  the 
classical  cases  of  European  consociational 
democracy  (Austria,  Belgium,  Luxembourg,  the 
Netherlands,  and  Switzerland); 

6.  an  external  economic  or  military  threat  that 
promotes  internal  unity; 

7.  overarching  loyalties,  such  as  a  common 
national  identity; 

8.  federalism  as  a  means  to  foster  group  autonomy 
(if  groups  are  regionally  concentrated),  which  is 
conducive  to  consociationalism;  and 

9.  traditions  of  compromise  and  accommodation. 

Controlling  for  these  favorable  conditions,  the 
core  argument  of  this  explanation  is  a  voluntaristic, 
deliberate  choice  by  elites.  Hence,  in  principle,  the 
option  of  consociationalism  exists  in  all  divided 
societies;  it  just  depends  on  elites  who  are  convinced 
of  this  type  of  conflict  regulation. 

A  third  explanation  emphasizes  the  role  of  his¬ 
tory  and  learning.  Consociational  democracy  is 
more  likely  when  elites  are  used  to  compromise 
and  negotiate.  This  tradition  amplifies  learning 
processes.  Elites  become  socialized  in  systems  of 
negotiations,  and  they  learn  that  negotiation  and 
accommodation  are  appropriate  techniques  of 
decision  making.  Alternative  conflictual  decision 
modes  are  normatively  inferior,  less  efficient,  and 
more  costly  in  several  respects.  Lehmbruch  has 
radicalized  this  explanation  by  arguing  that  the 
consociational  and  corporatist  European  systems 
can  be  traced  back  to  the  religious  peace  treaties  of 
the  17th  and  18th  centuries. 

Impacts  and  Consequences  of 
Consociational  Democracy 

Consociational  and  consensus  democracy  have  a 
mixed  balance  of  impacts  and  consequences. 
Manfred  Schmidt  (2008,  pp.  313-316)  identified 
six  deficits: 

1.  Consociational  techniques  depend  on  top-level 
negotiations  and  compromise  between  elites. 
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Most  of  this  accommodation  is  done  behind 
closed  doors;  otherwise,  politicians  will  not 
have  enough  flexibility  given  the  constraints  of 
electoral  competition.  By  implication,  voters 
cannot  check  whether  their  representatives  stay 
within  the  corridors  of  policy  choices  that 
conform  to  the  preferences  of  the  electorate. 
Since  the  compromises  are  considered  to  be  a 
result  of  quid  pro  quo,  it  is  hard  to  assign 
responsibility  to  particular  political  groups  and 
parties. 

2.  These  compromises  require  that  elites  can 
decide  on  behalf  of  their  constituencies  and  that 
these  constituencies  follow  passively  the  leaders 
of  their  sociocultural  group.  Politically 
interested  and  active  individual  citizens  are  a 
disturbance  variable  in  consociational 
democracies. 

3.  Minority  groups  have  the  right  to  veto  and  can 
obstruct  solutions  that  are  in  the  interest  of  the 
overwhelming  majority  of  the  nation.  While 
competitive  democracy  has  to  face  the  danger  of 
the  tyranny  of  the  majority,  consociational  and 
consensus  democracies  may  suffer  from  the 
tyranny  of  the  minority. 

4.  Policy  decisions  in  consociational  democracies 
need  considerable  time  due  to  processes  of 
inclusion  and  negotiation.  Hence,  these  forms  of 
government  may  appear  particularly 
disadvantageous  if  the  political  system  has  to 
respond  quickly  to  threats,  such  as  an  imminent 
economic  crisis. 

5.  Negotiations  and  compromise  imply  that  policy 
decisions  have  a  considerable  likelihood  of 
being  incoherent  and  hence  less  efficient  than 
policy  packages  that  are  shaped  by  a  single 
political  goal  or  ideology. 

6.  This  system  of  compromise  is  vulnerable  to 
populist  movements  on  the  right  and  the  left,  and 
it  may  frustrate  citizens  in  their  political  activity. 

These  disadvantages  used  to  be  considered  as  the 
price  for  a  stable  democracy  in  deeply  divided  soci¬ 
eties.  However,  recent  empirical  research  found 
other  benefits  and  evidence  that  the  disadvantages 
of  consociatonal  systems  are  much  lower  than  sug¬ 
gested  by  the  critics.  In  addition  to  political  stability, 
consociational  democracy  is  inclusive,  giving  smaller 


groups  (or  their  representatives)  a  much  better  say 
in  the  political  process.  This  may  outweigh  some  of 
the  democratic  deficits  of  consociationalism.  In 
addition,  since  the  policy  packages  are  based  on 
compromise,  implementation  of  policies  may  be 
quicker  and  more  efficient  than  in  competitive 
democracies.  Hence,  consensus  systems  may  make 
up  some  of  the  loss  in  time  and  efficiency  during  the 
phase  of  policy  formation.  Finally,  consensus 
democracies  tend  to  be  less  violent,  economically 
superior,  and  “kinder  and  gentler”  compared  with 
majoritarian  democracies.  In  consensus  systems, 
citizens  are  politically  more  satisfied;  the  govern¬ 
ment  is  more  redistributive;  strike  activity,  unem¬ 
ployment,  inflation,  and  inequality  are  less;  political 
activity  and  participation  are  higher;  and  policies 
are  more  “woman  friendly.”  Some  of  these  findings 
are  controversial;  however,  there  is  strong  evidence 
that  the  political  and  economic  achievements  of 
consociational  democracies  are  not  inferior  to  those 
of  majoritarian  democracies. 

Criticism 

Criticism  of  consociational  democracy  pointed  to 
conceptual  and  empirical  deficits.  While  the  litera¬ 
ture  on  consociationalism  argued  that  this  decision 
mode  allows  for  a  stable  political  order  in  deeply 
divided  societies,  Ian  Lustick  (1979)  developed  a 
control  approach  that  focuses  “on  the  emergence 
and  maintenance  of  a  relationship  in  which  the 
superior  power  of  one  segment  is  mobilized  to 
enforce  stability  by  constraining  the  political 
actions  and  opportunities  of  another  segment” 
(p.  328).  Brian  Barry  (1975)  detected  a  tautological 
argument  in  Lijphart’s  work  in  that  consociational¬ 
ism  was  described  as  a  deliberate  choice  of  elites, 
since  for  this  choice  elites  need  to  compromise  from 
the  very  beginning.  Other  authors  emphasized  that 
separated  societal  segments  as  the  sociocultural 
precondition  of  consociational  democracies  started 
to  decline  exactly  when  consociational  techniques 
were  strongly  used  in  the  1950s  and  1960s. 
Finally,  critics  identified  problems  in  mixing  nor¬ 
mative  and  empirical  typologies,  noting  that 
Lijphart’s  list  of  favorable  conditions  for  consocia- 
tionalism  changed  since  the  first  publications 
about  consociationalism. 

Rein  Taagepera  (2003)  argues  that  Lijphart’s 
indicators  of  the  first  dimension  of  consensus 
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democracy  are  mainly  output  variables,  which  are 
not  directly  amenable  to  institutional  design, 
while  the  indicators  of  the  second  dimension  are 
based  on  expert  judgments  and  measured  mostly 
on  a  nominal  level.  Likewise,  Steffen  Ganghof 
(2005)  states  that  Lijphart’s  concepts  refer  to 
institutions,  while  many  of  his  indices  relate  to 
observed  behavior  such  as  coalition  building  or 
social  partnership. 

Two  criticisms  have  strong  implications  for  the 
political  importance  of  consociational  democracy. 
Lijphart  suggests  that  elites  have  an  option  for 
consociational  democracy  and  it  is  up  to  them  to 
decide  in  favor  or  against.  Based  on  this  assump¬ 
tion,  he  recommends  to  politicians  in  deeply 
divided  societies  to  adopt  the  consensus  or,  even 
better,  the  consociational  system.  Already  early  in 
the  debate,  this  voluntaristic  approach  was  ques¬ 
tioned  by  Lehmbruch.  If  consociationalism  depends 
on  learning  processes,  long-term  elite  socialization, 
and  supporting  institutions,  it  cannot  be  inserted 
successfully  from  outside  into  a  political  system 
that  lacks  these  enabling  conditions. 

The  second  problem  may  be  the  decline  of  the 
prerequisite  of  any  system  of  compromise  in  seg¬ 
mented  societies:  Individuals  are  integrated  into 
sociocultural  segments,  and  these  segments  have 
legitimate  political  representatives.  The  processes 
of  individualization  and  modernization  tend  to 
undermine  this  precondition.  Increasingly — even  in 
formerly  strongly  segmented  societies — individuals 
do  not  build  stable  links  to  groups  and  political 
organizations.  Hence,  the  sociocultural  founda¬ 
tions  of  consociationalism  vanish,  endangering  the 
long-term  stability  of  consensus  institutions.  A  lim¬ 
ited  decline  of  consociational  democracy  has  been 
observed  in  many  countries  (see  Adrian  Vatter, 
2008).  On  the  other  hand,  one  of  the  preconditions 
of  an  ideal  type  of  majoritarian  democracy  suffers 
from  the  same  modernization  processes.  Modern 
societies  do  not  consist  any  longer  of  two  groups  of 
similar  size  whose  political  representatives  alternate 
in  government.  Rather,  political  integration  in  large 
groups  declines,  and  democracies  may  benefit  from 
institutions  of  negotiation  that  are  able  to  include 
these  numerous  small  groups  of  modern  society, 
none  of  which  is  able  to  produce  a  political 
majority.  The  obvious  problem  seems  to  be  that 
these  societies  may  be  able  to  solve  their  problems 
with  consociational  techniques;  however,  they 


lack  the  sociocultural  preconditions  to  use  these 
techniques. 
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Direct  democracy  means  forms  of  direct  participa¬ 
tion  of  citizens  in  democratic  decision  making  in 
contrast  to  indirect  or  representative  democracy, 
based  on  the  sovereignty  of  the  people.  This  can 
happen  in  the  form  of  an  assembly  democracy  or 
by  initiative  and  referendum  with  ballot  voting, 
with  direct  voting  on  issues  instead  of  for  candi¬ 
dates  or  parties.  Sometimes  the  term  is  also  used 
for  electing  representatives  in  a  direct  vote  as 
opposed  to  indirect  elections  (by  voting  for  an 
electing  body,  electoral  college,  etc.),  as  well  as  for 
recalling  elected  officeholders.  Direct  democracy 
may  be  understood  as  a  full-scale  system  of  politi¬ 
cal  institutions,  but  in  modern  times,  it  means 
most  often  specific  decision-making  institutions  in 
the  broader  system  environment  of  representative 
democracy.  The  following  sections  (a)  introduce 
some  historical  background  and  theoretical  ideas, 
(b)  elaborate  on  various  forms  of  direct  democ¬ 
racy,  (c)  describe  regulations  and  use  of  direct 
democracy  in  some  important  countries,  and 
(d)  present  main  issues  and  controversies. 

History  and  Theoretical  Context 

The  most  important  historical  reference  of  direct 
democracy  is  to  assembly  democracy  in  ancient 
Greek  city-states,  particularly  Athens,  where 


decisions  were  taken  by  people’s  assemblies  of 
some  1,000  male  citizens.  Later,  people’s  assem¬ 
blies  were  used  in  many  Swiss  cantons  and  towns, 
as  well  as  in  town  meetings  in  some  American 
colonies  and  states.  Early  U.S.  states  also  started 
using  procedures  in  which  constitutions  or  con¬ 
stitutional  amendments  were  ratified  by  referen- 
dums,  which  later  became  common  in  the  United 
States.  Popular  sovereignty,  proclaimed  in  the 
French  Revolution,  had  rather  been  distorted, 
however,  in  Napoleon’s  autocratic  plebiscites. 
Switzerland  and  many  U.S.  states  incorporated 
direct  democracy  in  their  constitutions  during  the 
19th  century,  while  Germany  and  few  other 
countries  adopted  some  elements  after  World 
War  I.  In  a  more  general  perspective,  the  ensuing 
introduction  or  practical  use  of  direct-democratic 
institutions  originated  from  three  major  types  of 
developments: 

1.  social  class  conflict  to  curb  the  political  power 
of  a  dominating  oligarchy  (e.g.,  Switzerland, 

U.S.  states); 

2.  processes  toward  political/territorial  autonomy 
or  independence  for  legitimizing  and  integrating 
the  new  state  unit  (beginning  after  World  War 
I);  and 

3.  processes  of  democratic  transformation  from 
authoritarian  rule  (e.g.,  Germany’s  regional 
states  after  1945,  some  Latin  American 
countries). 

Some  countries  show  gradual  reform  developments 
(e.g.,  Uruguay). 

Modern  democracy  most  often  developed  not 
from  the  starting  point  of  assembly  democracy 
but,  under  absolutist  or  feudal  conditions,  from 
people  gradually  claiming  a  larger  share  of 
political  representation  and  extension  of  repre¬ 
sentative  voting  rights.  Constitutions,  civil 
rights,  and  universal  suffrage,  which  had  been 
achieved  in  European  and  many  other  countries 
(generally  by  the  end  of  World  War  I),  were 
usually  identified  with  “democracy”  on  the  nor¬ 
mative  basis  of  the  principles  of  popular  sover¬ 
eignty,  freedom,  and  political  equality.  Thus,  in 
many  countries  and  theories,  these  principles 
have  been  tied  to  and  absorbed  by  a  narrow 
notion  of  representative  democracy  rather  than 


560  Democracy,  Direct 


being  used  to  support  a  more  comprehensive 
concept  of  democracy. 

Normative  theory  of  direct  democracy  still  rests 
basically  on  popular  sovereignty,  freedom,  and 
political  equality,  with  Jean-Jacques  Rousseau  as  the 
outstanding  theorist  of  unanimous  consent  of  the 
people  for  a  free  republican  constitution  and  subse¬ 
quent  forms  of  participation.  During  the  19th  cen¬ 
tury,  these  principles  were  increasingly  challenged, 
or  they  were  deprived  of  their  substance  beyond 
representative  institutions.  So,  in  many  countries, 
direct-democracy  institutions  have  not  been  estab¬ 
lished  or  implemented  since  representative  elites 
developed  a  strong  interest  in  monopolizing  power. 
In  addition,  pragmatic  theories  contended  that 
direct  democracy  could  not  work  under  space  and 
time  conditions  of  large  modern  states. 

With  this  background  of  historical  and  theo¬ 
retical  restrictions,  the  normative  theory  of  direct 
democracy  cannot  exclusively  rest  on  popular 
sovereignty,  which  is  also  claimed  by  representa¬ 
tive  democracy.  More  specific  arguments  origi¬ 
nate  from  the  participatory  theory  of  democracy 
and  the  critique  of  a  lack  of  responsiveness  and 
legitimacy  of  representative  (party)  democracy. 
The  two  sets  of  democratic  institutions  are  dis¬ 
tinguished  by  basic  features  of  direct  participa¬ 
tion:  (1)  direct  democracy  focuses  on  specific 
issues ,  in  contrast  to  voting  on  candidates  and 
general  programs  for  long  terms  of  office,  and 
(2)  citizens  themselves  act  as  decision  makers 
rather  than  delegating  these  powers.  Like  electoral 
systems,  a  variety  of  procedural  forms,  designs, 
and  regulations  are  likely  to  influence  processes 
and  outcome.  One  must  also  keep  in  mind  that 
direct-democratic  processes  cannot  operate  in  iso¬ 
lation  but  are  always  linked  to  the  structures  of  an 
overall  political  system  that  includes  major  repre¬ 
sentative  institutions.  Thus,  interactions  between 
the  two  types  of  institutions  will  be  an  important 
challenge  for  analysis.  For  instance,  as  George 
Tsebelis  notes,  referendum  voters  can  be  seen  as 
an  additional  veto  player.  Some  authors  contend 
that  direct  democracy  may  undermine  representa¬ 
tive  democracy,  while  others  focus  on  the  deliber¬ 
ative  functions  for  a  democratic  public  sphere  and 
the  capacity  for  integrating  citizens  in  the  demo¬ 
cratic  process.  One  can  also  assume  that  basic 
types  or  forms  of  direct-democracy  procedures 
may  result  in  different  consequences. 


Variety  of  Forms  and  System  Environment 

Direct  democracy  comes  in  a  variety  of  institu¬ 
tional  forms,  with  the  common  feature  of  proce¬ 
dures  focusing  on  popular  votes  on  political  issues. 
Their  main  forms  can  be  distinguished  by  the 
actors  who  start  the  procedure:  Mandatory  refer- 
endums  have  to  be  held  when  a  referendum  vote  is 
required  by  law  (e.g.,  a  constitution)  for  deciding  a 
specific  subject.  Referendums  of  governmental 
authorities  take  place  when  a  president,  cabinet,  or 
legislature  decides,  under  preregulated  conditions 
or  ad  hoc,  to  call  a  popular  vote  on  a  particular 
issue.  Sometimes,  a  minority  of  a  legislature  also  is 
entitled  to  demand  such  a  vote.  Citizens’  initiatives 
that  are  supported  by  a  required  number  of  signa¬ 
tures  allow  the  electorate  to  vote  on  political  mea¬ 
sures  proposed  by  a  group,  on  legislative  acts  by  a 
parliament  not  yet  in  force,  or  on  existing  laws 
(citizen-demanded  referendums).  A  popular  vote 
may  be  binding  according  to  the  simple  or  specific 
majority  or  turnout  requirements  for  a  valid  vote 
or  may  be  defined  as  only  consultative  or  advisory. 

Some  jurisdictions  provide  an  agenda  initiative 
that  allows  citizens  with  the  support  of  a  minimum 
number  of  signatures  to  place  a  particular  issue  on 
the  agenda  of  a  government  or  legislative  author¬ 
ity.  Such  proposals  have  to  be  considered  by  the 
authority  addressed,  but  they  do  not  lead  to  a  ref¬ 
erendum  vote. 

There  is  some  ambiguity  and  controversy  as  to 
whether  procedures  with  a  focus  on  directly  elect¬ 
ing  or  recalling  holders  of  public  office  (executive 
positions,  legislators)  may  be  meaningfully  included 
in  the  concept  of  direct  democracy.  These  proce¬ 
dures  refer,  in  fact,  to  the  institutional  system  of 
representative  democracy  and  its  typical  processes 
and,  therefore,  are  not  at  the  core  of  debates  on 
direct  democracy.  However,  there  may  be  some 
differences  in  the  degree  to  which  voters  have  a 
direct  influence  on  the  final  outcome  of  an  electoral 
procedure  (e.g.,  fixed  or  flexible  list  of  candidates, 
direct  vote,  or  vote  for  members  of  an  intermediate 
body).  In  recall  procedures,  interrupting  routine 
patterns  of  fixed  office  terms  may  stress  the  aspect 
of  citizens  reclaiming  control  of  office  functions.  In 
practice,  recall  options  of  executive  office  holders 
are  much  more  common  than  of  members  of  legis¬ 
lative  bodies  or  of  complete  legislatures. 

Procedural  types  of  direct  democracy  should  be 
distinguished  according  to  the  main  initiating 
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actor  of  a  procedure  because  they  typically  show 
different  features  regarding  the  agenda  setter,  the 
contents  and  wording  of  the  proposal,  the  function 
of  the  ballot  vote  in  terms  of  legitimation,  innova¬ 
tion,  and  so  on. 

Governmental  authorities  initiating  a  referen¬ 
dum  vote  generally  seek  legitimation  for  policies 
on  the  government  agenda,  will  regularly  advocate 
an  affirmative  vote,  and  will  have  many  ways  of 
influencing  process  and  outcome,  including  official 
communication  resources.  Therefore,  the  term 
plebiscite  is  often  used,  even  more  so  when  they 
are  employed  by  autocratic  or  dictatorial  regimes 
that  cannot  be  called  democratic  at  all. 

Mandatory  referendums  also  very  often  origi¬ 
nate  from  governmental  authorities  entitled  to 
bring  forward  proposals  for  which  ratification  by 
a  referendum  vote  is  required,  particularly  in  the 
case  of  constitutional  amendments  or  matters  of 
state  sovereignty,  territory,  or  identity.  Thus, 
whereas  a  popular  vote  on  such  specific  subjects  is 
required  by  law,  the  agenda  and  the  substance  of 
the  referendum  proposal  are  most  often  deter¬ 
mined  by  governmental  authorities.  In  some  juris¬ 
dictions,  however,  specific  issues,  again  like  consti¬ 
tutional  amendments,  may  also  be  proposed  by 
citizens’  initiative  and  lead  to  a  mandatory  ballot 
vote  (Switzerland,  the  United  States,  or  German 
states). 

In  citizens’  initiative  procedures,  the  agenda  for 
issues  and  the  proposals  generally  originate  “bot¬ 
tom  up”  from  some  opposition  or  civil  society 
groups  that  demand  new  political  measures  or 
legislation  (law-promoting  initiative)  or  object  to  a 
particular  government  project  or  legislative  act 
(law-controlling  initiative).  In  such  a  setting,  the 
political  initiative  comes  from  social  or  minority 
forces,  whereas  governmental  authorities  are  likely 
to  be  in  a  defensive  position  and  want  to  defeat  the 
proposals  in  a  referendum  vote. 

Except  for  ad  hoc  referendum  calls  by  govern¬ 
mental  authorities,  procedures  of  direct  democ¬ 
racy,  particularly  citizen-initiated  procedures,  are 
regulated  in  various  aspects.  The  area  of  admissi¬ 
ble  subject  matters  may  be  very  restrictive,  the 
number  of  signatures  required  for  qualifying  an 
initiative  for  a  ballot  vote  may  range  from  about 
1  %  to  one  third  of  eligible  voters,  the  time  allowed 
for  collecting  signatures  may  be  very  short. 
Requirements  for  the  validity  of  a  popular  vote 


may  also  vary  from  a  majority  of  voters  to  quali¬ 
fied  or  double  majorities  or  to  specific  turnout 
quorums.  Usage  will  clearly  be  restrained  by  high 
initiating  or  validity  requirements  and  initiating 
actors  with  strong  resources  will  be  privileged.  Yet 
a  higher  level  of  approval  may  support  the  legiti¬ 
macy  of  a  vote. 

Countries  and  Developmental  Background 

Provisions  for  direct-democratic  instruments  as 
well  as  their  usage  are  distributed  rather  unevenly 
across  continents,  countries,  and  different  levels  of 
states. 

National  Level 

On  the  national  level,  procedures  and  usage  are 
most  frequent  in  Europe  and  Latin  America, 
whereas  in  Africa,  Asia,  or  North  America  their 
number  is  small.  Switzerland  traditionally  has 
the  most  elaborated  system  of  direct  democracy  at 
the  national,  cantonal,  and  municipal  levels.  On 
the  national  level,  mandatory  referendums  on  con¬ 
stitutional  amendments  were  introduced  in  1848, 
citizen-demanded  rejective  referendums  on  new 
legislation  of  parliament  in  1874,  citizens’  initia¬ 
tives  on  constitutional  amendments  in  1891,  and 
mandatory  referendums  on  major  international 
treaties  in  1921.  Up  to  2008,  222  mandatory  ref¬ 
erendums  on  constitutional  and  treaty  issues,  162 
rejective  (“facultative”)  referendums,  and  165  citi¬ 
zen-initiated  referendums  on  constitutional  amend¬ 
ments  have  been  held.  On  the  lower  levels,  even 
more  instruments  are  often  available,  such  as  the 
mandatory  financial  referendum  and  legislative 
citizens’  initiative.  More  than  one  of  the  instru¬ 
ments  of  direct  democracy  are  also  provided  for 
and  practiced  on  the  national  level  in  Uruguay 
and,  more  recently,  in  some  of  the  Eastern  European 
countries  such  as  Latvia,  Lithuania,  or  Hungary. 

On  the  national  level  in  most  other  countries, 
basically  one  specific  instrument  stands  out.  In 
Italy,  for  instance,  most  of  the  popular  votes  origi¬ 
nated  from  citizens’  initiatives  for  an  “abrogative 
referendum”  to  repeal  an  existing  law  or  parts 
thereof  (some  60  cases  since  1970).  In  Australia 
and  Ireland,  only  mandatory  referendums  on  con¬ 
stitutional  amendments  are  possible.  In  Austria, 
the  national  parliament  can  call  a  vote  (used  only 
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once).  In  the  Fifth  Republic  of  France,  the  presi¬ 
dent  has  the  right  to  call  a  referendum  on  impor¬ 
tant  matters  of  sovereignty  and  state  structure 
since  2008;  one  fifth  of  the  National  Assembly  in 
combination  with  10%  of  the  electorate  can  also 
do  so.  In  many  countries,  governmental  authorities 
call  referenda  not  on  a  preregulated  but  on  an  ad 
hoc  basis.  The  subjects  of  these  referenda  have 
included  accession  to  the  European  Economic 
Community  or  to  the  European  Union  (EU),  inde¬ 
pendence  or  national  unity  in  Canada,  and  ratifi¬ 
cation  of  a  new  constitution  or  of  an  agreement  for 
conflict  or  constitutional  settlement  in  Bolivia, 
Chile,  Kenya,  Russia,  Spain,  and  South  Africa. 

Regional  Level 

The  level  of  regional  states  within  federations 
also  offers  important  examples  of  direct  democ¬ 
racy.  In  particular,  about  half  of  the  states  in  the 
United  States  provide  citizens’  initiative  rights  and 
some  other  procedures  that  quite  often  allow  refer¬ 
endum  votes  on  public  finance.  In  many  of  these 
states,  including  Oregon,  California,  Colorado, 
and  North  Dakota,  citizens’  initiatives  (introduced 
before  1914)  are  frequently  used.  In  Germany,  all 
regional  states  (Lander)  have  introduced  citizens’ 
initiatives,  which,  on  the  average,  are  used  less 
often  due  to  more  restrictive  requirements.  In  the 
United  States  and  in  Germany,  initiatives  and  ref- 
erendums  are  available  and  frequently  used  on  the 
municipal  level  as  well.  In  some  countries,  includ¬ 
ing  the  Czech  Republic,  Japan,  Norway,  and 
Poland,  initiatives  and  referendums  are  only  pos¬ 
sible  at  the  municipal  level. 

Paths  to  Direct  Democracy 

In  systems  of  representative  democracy,  the  intro¬ 
duction  of  instruments  of  direct  democracy  is  not 
very  likely  since  they  are  regarded  as  undermining 
established  power  structures.  The  historical  origins 
of  direct  democracy  institutions  can  be  distin¬ 
guished  in  typical  paths: 

1.  antioligarchic  conflict,  where  initiative  and 
referendum  are  directed  against  economic- 
political  domination,  like  in  Switzerland  and  the 
progressive  movement  in  the  Western  states  of 
the  United  States  before  1914; 


2.  system  transformation  toward  democracy,  like 
in  post-Fascist  Italy  (implementation  delayed 
until  1970),  in  German  regional  states  after 
dictatorships  in  1945  and  1990,  or  in  Latin 
America  in  the  1980s  and  1990s; 

3.  state  independence,  like  in  Ireland  (1922/1937) 
or  in  Latvia,  Lithuania,  and  Slovenia  after 
Soviet  or  Yugoslav  domination. 

From  the  first  two  paths,  initiative  instruments 
with  lower  requirements  quite  often  emerged, 
whereas  national  independence  often  coincided 
with  requirements  stressing  majoritarian  unity. 
The  origins  of  referendums  called  by  governmental 
authorities  are  much  more  dispersed  since  they  are 
mainly  regarded  as  under  government  control  and, 
thus,  less  dangerous  to  the  governmental  system. 
Issues  of  state  independence,  but  also  accession  of 
states  to  supranational  organizations  such  as  the 
EU,  very  often  are  dealt  with  in  government- 
initiated  or  in  mandatory  referendums.  The  first 
instrument  of  direct  democracy  with  a  transna¬ 
tional  character  is  the  European  Citizens’  Initiative 
of  the  Lisbon  Treaty,  entered  into  force  on 
December  1,  2009  (Article  11.4),  an  agenda  initia¬ 
tive  that  allows  1  million  European  citizens  to 
propose  legislation  to  the  European  Commission 
(without  a  referendum). 

Issues  and  Controversies 

Discussions  on  direct  democracy  institutions  deal 
with  several  issues.  The  strongest  normative  grounds 
for  direct  democracy  are  the  democratic  principles 
of  popular  sovereignty,  political  equality,  and  all 
the  arguments  for  participative  democracy  that  sup¬ 
port  the  idea  that  all  citizens  should  have  the  right 
not  only  to  elect  representatives  but  also  to  vote  on 
policy  issues  in  referendums.  Since  assembly  democ¬ 
racy  cannot  be  an  option  in  modern  societies,  direct 
democracy  institutions  are  not  regarded  as  a  full- 
scale  system  alternative  to  representative  democ¬ 
racy  but  as  a  supplement  to  or  counterweight 
within  democratic  systems  with  major  representa¬ 
tive  features.  Nevertheless,  the  institutional  differ¬ 
ence  and  competition  between  representative  and 
direct  democratic  processes  lie  at  the  core  of  the 
controversy  whether  direct  democracy  contributes 
to  undermining  representative  democracy  or  can 
offer  enrichments  of  democracy. 
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In  general,  representative  democracy  is  often 
seen  as  superior  because  general  elections  give  citi¬ 
zens  an  encompassing  choice  between  alternative 
governments  and  complex  and  coherent  programs, 
because  governments  and  parliaments  have  greater 
capacity  for  informed  decisions  including  expert 
judgment,  and  because  representatives  can  be  held 
accountable  for  their  decisions.  Arguments  in  sup¬ 
port  of  direct  democratic  instruments  refer  to  vari¬ 
ous  aspects: 

1.  Direct  democratic  issue  voting  can,  during 
terms  of  office,  deal  with  issues  that  have  not  been 
discussed  at  general  elections.  Citizens’  initiatives 
in  particular  can  enrich  the  political  agenda  and, 
thus,  contribute  to  the  function  of  political  articu¬ 
lation  and  innovation.  The  range  of  political  actors 
tends  to  be  broader  than  already  present  in  the 
party  system. 

2.  Direct  democracy  also  offers  citizens  addi¬ 
tional  and  more  specific  instruments  of  political 
control  during  terms  of  office,  particularly  initia¬ 
tive  proposals  and  citizen-demanded  referendums 
to  reject  new  legislation  or  delete  existing  laws. 

3.  One  major  area  of  controversy  deals  with 
information,  competence,  and  the  quality  of  deci¬ 
sion  making.  While  representative  institutions  may 
indeed  hold  intense  deliberations  on  many  sub¬ 
jects,  direct-democratic  decision-making  processes 
can  also  provide  for  specific  issues  the  opportunity 
of  intense  and  widespread  public  debates,  during 
which  citizens  can  become  informed  about  contro¬ 
versial  value  and  factual  considerations.  Yet  as 
voters  are  often  described  as  badly  informed  and 
incompetent,  the  danger  of  manipulation  by 
resourceful  actors  (parties,  strong  interest  organi¬ 
zations,  corporations,  media)  is  a  major  issue. 
Design  and  regulations,  however,  can  make  a  dif¬ 
ference,  for  example,  comparing  Switzerland  and 
U.S.  states  such  as  California,  since  Switzerland 
does  not  allow  TV  advertising  in  referendum  cam¬ 
paigns.  Hanspeter  Kriesi  (2005)  and  Daniel  Smith 
and  Caroline  Tolbert  (2004)  especially  found  that, 
as  a  general  trend,  referendum  debates  and  cam¬ 
paigns  provide  a  major  potential  for  dissemination 
of  information  and  for  political  education. 
Important  factors  are  a  broader  field  of  political 
actors  in  such  campaigns  and  more  intense  com¬ 
munication  of  arguments  in  the  media. 


4.  How  voters  select  their  choice  in  referendum 
voting  attracted  two  rather  opposite  assumptions. 
According  to  one,  party  orientations  might  be  sim¬ 
ply  duplicated  in  issue  voting  behavior;  the  other 
one  contends  that  interest  groups,  media,  and  even 
“demagogues”  can  influence  voters  strongly.  The 
trend  in  the  literature  seems  to  be  that  the  less 
informed  parts  of  the  electorate  look  for  party  ori¬ 
entation,  while  voters  who  are  better  informed  and 
educated  may  take  a  more  independent  choice  by 
using  more  arguments  for  forming  opinions.  Thus, 
as  Ian  Budge  (1996)  notes,  direct  democracy  must 
not  be  unmediated,  since  governments  and  parties 
can  also  play  an  important  mediating  role.  This 
may  be  particularly  true  when  government  author¬ 
ities  initiate  a  referendum  vote  in  optional  or  man¬ 
datory  referendums. 

5.  One  promise  of  direct  democracy  is  that 
more  political  participation  can  be  realized.  This  is 
surely  the  case  since  more  opportunities  and  occa¬ 
sions  to  debate  policy  issues  and  to  vote  in  referen¬ 
dums  are  offered.  Nevertheless,  some  criticism 
remains  that  the  participation  goal  is  not  realized, 
particularly  for  social  groups  that  also  participate 
little  in  electoral  politics.  It  is  argued  that  in  refer¬ 
endums  turnout  is  often  lower  than  in  general  elec¬ 
tions  and  that  referendums  lead  to  a  lower  turnout 
in  elections  also.  Yet  this  cannot  be  generalized 
since  turnout  varies  significantly  dependent  on 
issues,  for  example,  in  Switzerland  from  30%  to 
around  80%  of  registered  voters.  In  addition, 
when  ballot  votes  regularly  do  not  take  place  in 
conjunction  with  general  elections  (like  in 
Switzerland),  turnout  variation  will  be  stronger 
than  when  referendum  votes  are  mostly  held  on 
election  days  (like  in  the  United  States).  Other 
shortcomings  of  extended  participation  of  direct 
democracy  are  seen  in  a  “social  bias,”  where  lower 
social  strata  with  deficits  in  status,  income,  and 
education  tend  to  be  less  motivated  or  competent 
to  participate  in  discussions  or  in  voting.  Again,  at 
least  to  some  degree,  this  seems  to  be  balanced  by 
policy  issues  specifically  relevant  for  these  groups 
in  which  they  participate  to  a  larger  degree. 

Finally,  if  a  larger  share  of  citizens  do  not  par¬ 
ticipate,  but  abstain  from  voting,  legitimation 
problems  in  referendum  votes  may  arise.  In  some 
jurisdictions,  regulations  respond  by  requiring  a 
qualified  majority  for  a  valid  vote  in  the  form  of 
turnout  or  approval  quorums.  The  disadvantage  of 
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turnout  quorums  is  that,  in  turn,  it  invites  even 
more  abstentions  and  campaigns  to  abstain  even 
from  voting  “no,”  while  approval  quorums  at  least 
devaluate  majority  votes. 

6.  Direct  democracy  institutions  have  also  been 
reflected  in  their  relation  to  majorities  and  minori¬ 
ties.  Citizen-initiated  procedures  are  supposed  to 
serve  as  potential  instruments  of  minorities  since 
they  can  present  new  proposals  or  demand  a  refer¬ 
endum  on  new  legislation.  This  is  likely  to  be  true 
for  placing  issues  on  the  agenda.  In  the  referendum 
vote,  however,  the  majority  principle  applies, 
which  means  that  minority  rights  or  interests  can 
be  endangered.  Specific  concerns  relate  to  basic 
rights  of  minorities,  which,  however,  can  best  be 
protected  against  offensive  majority  rule  by  consti¬ 
tutional  guarantees,  courts,  and  prereferendum 
constitutionality  checks.  Sometimes,  the  validity  of 
referendum  votes  is  regulated  by  qualified  or  dou¬ 
ble  majorities  to  protect  minorities.  More  gener¬ 
ally,  referendums  can  also  support  developments 
toward  autonomy  or  even  independence  of  regional 
“minority”  populations. 

7.  Policy  impacts  of  direct  democratic  decision 
making  also  received  attention.  Research  on  eco¬ 
nomic  and  financial  effects  at  the  regional  and 
local  levels  of  Swiss  and  U.S.  direct  democratic 
institutions  found  beneficial  consequences  in  mac¬ 
roeconomic  and  fiscal  performance.  According  to 
one  thesis,  strong  interest  groups  will  gain  more 
from  initiatives  and  referendums,  whereas  empiri¬ 
cal  economic  studies,  such  as  the  one  by  John 
Matsusaka  (2004),  tend  to  find  advantages  rather 
with  the  broader  population.  In  other  policy  issues 
such  as  the  environment  or  moral  topics,  tentative 
and  controversial  evidence  prevails. 

8.  Other  consequences  attributed  to  direct 
democracy  refer  to  structural  or  system  impacts  on 
representative  democracy  or  the  overall  system  of 
democracy.  In  the  case  of  Switzerland,  particu¬ 
larly,  it  has  been  argued  that  direct  democracy  had 
a  long-term  effect  toward  a  system  of  consensus 
democracy  as  opposed  to  majoritarian  democracy. 
Mechanisms  of  consensus  governments  may  indeed 
have  developed  to  anticipate  and  integrate  as  many 
interests  as  possible,  which  otherwise  might  be 
able  to  initiate  referendums  against  new  legislation 
(instruments  such  as  citizens’  initiatives  and  man¬ 
datory  referendums  would  be  less  relevant  here). 


In  other  jurisdictions,  however  (e.g.,  in  Italy  or 
German  or  U.S.  states),  similar  effects  away  from 
majoritarian  party  competition  and  toward  con¬ 
sensus  democracy  could  not  be  observed;  in  presi¬ 
dential  systems  such  as  in  the  U.S.  states  legislative 
majorities  and  the  executive  are  disconnected  any¬ 
way.  Thus,  generalizations  from  the  Swiss  example 
on  a  developmental  logic  toward  consensus 
government  should  not  be  easily  drawn. 

One  should  keep  in  mind  that  government- 
initiated  and  government-controlled  referendums 
may  in  many  respects  show  distinct  features  from 
citizen-initiated  procedures.  Government- driven 
instruments  tend  to  be  more  influenced  by  policy 
projects  and  campaign  capacities  of  central  politi¬ 
cal  authorities.  Citizen-initiated  procedures  are 
more  open  for  minorities,  participation,  innova¬ 
tion,  and  government  control,  yet  they  are  less 
likely  to  succeed  in  the  ballot  vote.  Nevertheless,  as 
a  process,  they  tend  to  offer  a  greater  potential  for 
supplementing  and  balancing  the  institutional 
shortcomings  and  power  structures  of  representa¬ 
tive  democracy.  Particularly  in  times  of  political 
crises,  direct  democracy  can  provide  an  important 
function  in  offering  channels  for  reactivating  pop¬ 
ular  sovereignty  as  the  fundamental  value  and 
force  of  democracy.  This  power  of  preserving  the 
sources  of  popular  sovereignty  alone  makes  it 
worth  while  to  keep  direct  democracy  going  under 
routine  conditions  of  democracy. 
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In  recent  decades,  discontent,  dissatisfaction,  and 
alienation  have  been  growing  political  phenomena 
in  old,  established  democracies.  At  the  same  time, 
more  and  more  countries  have  been  defined  as 
democratic.  Democratization  has  also  brought 
about  a  large  number  of  so-called  hybrid  regimes. 
From  these  three  different  perspectives,  the  same 
question  emerges:  What  is  the  quality  of  democ¬ 
racy  in  any  specific  country,  and  what  does  this 
tell  us  in  terms  of  (a)  how  actual  and  deep  the 
satisfaction  or  dissatisfaction  is,  (b)  what  is  behind 
democratic  appearances,  and  (c)  how  ambiguous 
cases  can  be  turned  into  democracies  by  improv¬ 
ing  their  quality?  This  entry  focuses  on  the  notion 
of  quality,  its  key  dimensions  or  qualities,  and  the 
mechanism  of  quality  subversion  as  a  key  aspect 
to  explore  when  dealing  with  this  topic. 

What  Is  Quality? 

There  are  at  least  three  main  streams  of  literature 
dealing  with  this  topic.  A  first  group  of  scholars 
have  been  working  on  democratizations,  consoli¬ 
dation,  and  crisis  and  in  their  analyses  of  democ¬ 
racy  have  also  pointed  out  aspects  related,  directly 


or  indirectly,  to  the  topic.  Among  them  are 
Guillermo  O’Donnell  with  his  notion  of  “delega- 
tive  democracy,”  Arend  Lijphart  with  his  thesis  on 
the  superiority  of  consensual  democracy  vis-a-vis 
majoritarian  democracies  in  terms  of  implement¬ 
ing  democratic  quality,  and  David  Altman  and 
Anibal  Perez-Linan  who  refer  to  three  aspects  that 
draw  on  Robert  Dahl’s  concept  of  polyarchy  (civil 
rights,  participation,  and  competition).  The  second 
group  of  scholars  focused  on  established  democra¬ 
cies,  especially  those  belonging  to  the  Anglo-Saxon 
tradition — that  is,  the  United  Kingdom,  Canada, 
and  Australia.  Among  them,  David  Beetham  (see 
especially  David  Beetham,  Sarah  Bracking,  Iain 
Kearton,  &  Stuart  Weir,  2002)  developed  an 
analysis  in  terms  of  audit — that  is,  a  systematic 
assessment  of  institutional  performance  against 
agreed  criteria  and  standards.  The  auditing  proce¬ 
dure  should  follow  four  steps: 

1.  Identify  appropriate  criteria  for  assessment. 

2.  Determine  standards  of  good  or  best  practice 
that  provide  a  benchmark  for  the  assessment. 

3.  Assemble  the  relevant  evidence  from  both 
formal  rules  and  informal  practices. 

4.  Review  the  evidence  in  the  light  of  the  audit 
criteria  and  defined  standards  to  reach  a 
systematic  assessment. 

A  number  of  authors  followed  Beetham  by 
implementing  his  framework  in  other  countries, 
such  as  Canada  and  Australia  (see,  e.g.,  Marian 
Sawer,  Norman  Abjorensen,  &  Phil  Larkin,  2009; 
see  also  Todd  Landman,  2006). 

The  third  stream  is  formed  by  a  number  of  data 
banks,  such  as  those  of  Polity  IV,  Freedom  House, 
the  Economist  Intelligence  Unit,  World  Bank,  and 
also  the  Bertelsmann  Index,  which  provide  mea¬ 
sures  of  aspects  related  to  democratic  performance 
and,  more  generally,  to  the  quality  of  democracy. 
The  different  data  banks  are  massive  efforts  to  pro¬ 
vide  scores  and  rank  orders  of  a  large  number  of 
countries  or,  in  some  cases  (first  and  foremost 
Freedom  House),  of  all  existing  independent  coun¬ 
tries  on  key  aspects  such  as  rule  of  law  and  freedom. 

Thus,  phenomena  such  as  democratization  pro¬ 
cesses  and  political  science  research  paved  the  way 
for  the  development  of  a  more  systematic  theoreti¬ 
cal  analysis  of  the  quality  of  democracy,  where  a 
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key  point  is  spelling  out  a  clear  notion  of  quality. 
A  survey  of  the  use  of  the  term  in  the  industrial 
and  marketing  sectors  suggests  three  different 
meanings  of  quality. 

1.  Quality  is  defined  by  the  established  procedural 
aspects  associated  with  each  product;  a 
“quality”  product  is  the  result  of  an  exact, 
controlled  process  carried  out  according  to 
precise,  recurring  methods  and  timing;  here  the 
emphasis  is  on  the  procedure. 

2.  Quality  is  defined  by  the  structural 
characteristics  of  a  product,  be  it  the  design, 
materials,  or  functioning  of  the  good  or  other 
details  that  it  features;  here,  the  emphasis  is  on 
the  content. 

3.  The  quality  of  a  product  or  service  is  indirectly 
derived  from  the  satisfaction  expressed  by  the 
customers,  by  their  repeated  requests  for  the 
same  product  or  service,  regardless  of  how  it  is 
produced  or  what  the  actual  contents  are  or 
how  the  consumers  go  about  acquiring  the 
product  or  service;  according  to  such  a 
meaning,  the  quality  is  simply  based  on  result. 

The  three  different  notions  of  quality  are 
grounded  in  procedures,  contents,  or  results.  Each 
has  different  implications  for  empirical  research. 
Importantly,  even  with  all  the  adjustments 
demanded  by  the  complexity  of  the  object  under 
examination — democracy — it  is  still  necessary  to 
keep  these  conceptualizations  of  quality  in  mind  as 
definitions  and  models  of  democratic  quality/ies 
are  elaborated.  The  next  questions,  then,  are 
“What  is  a  quality  democracy?”  and,  more  pre¬ 
cisely,  “What  are  the  procedural,  content,  and 
result  qualities  of  a  democracy?” 

“Good”  Democracy 

A  quality  democracy  or  a  “good”  democracy  pres¬ 
ents  a  stable  institutional  structure  that  realizes  the 
liberty  and  equality  of  citizens  through  the  legiti¬ 
mate  and  correct  functioning  of  its  institutions  and 
mechanisms — that  is,  a  good  democracy  is  first 
and  foremost  a  broadly  legitimated  regime  that 
completely  satisfies  citizens  ( quality  in  terms  of 
result );  one  in  which  the  citizens,  associations,  and 
communities  of  which  it  is  composed  enjoy  liberty 


and  equality,  even  in  different  forms  and  degrees 
(i quality  in  terms  of  content)-,  and  one  in  which  the 
citizens  themselves  have  the  power  to  check  and 
evaluate  whether  the  government  pursues  the 
objectives  of  liberty  and  equality  according  to  the 
rule  of  law  ( quality  in  terms  of  procedure). 

Alternative  normative  definitions  of  democracy 
could  be  recalled,  such  as  liberal  democracy,  respon¬ 
sive  democracy,  participatory  democracy,  delibera¬ 
tive  democracy,  associative  democracy,  egalitarian 
or  social  democracy,  and  good  governance. 
However,  regardless  of  what  normative  definition 
of  democracy  is  accepted,  empirical  analysis  tends 
to  reveal  the  same  specific  features. 

What  Qualities? 

Eight  possible  dimensions  or  qualities  on  which  a 
“good”  democracy  might  vary  are  at  the  core  of 
the  empirical  analysis  to  cover  the  normative 
notions  of  democracy  mentioned  above.  The  first 
five  are  procedural  dimensions.  Though  also  rele¬ 
vant  to  the  contents,  these  dimensions  mainly  con¬ 
cern  the  rules.  The  first  procedural  quality  is  the 
rule  of  law.  The  second  and  third  procedural 
qualities  regard  the  two  forms  of  accountability 
(electoral  and  interinstitutional).  The  fourth  and 
fifth  are  the  classic  participation  and  competition, 
which,  however,  have  a  special  theoretical  status 
(see  below).  The  sixth  and  seventh  dimensions  are 
substantive  in  nature  and  deal  with  quality  as 
defined  in  terms  of  content:  The  sixth  is  full  respect 
for  rights  that  are  expanded  through  the  achieve¬ 
ment  of  a  range  of  freedoms.  The  seventh  is  the 
progressive  implementation  of  greater  political, 
social,  and  economic  equality.  The  final,  eighth, 
dimension  concerns  the  responsiveness  or  corre¬ 
spondence  of  the  system  to  the  desires  of  the  citi¬ 
zens  and  civil  society  in  general,  and  this  is  quality 
defined  in  terms  of  results.  Some  essential  consid¬ 
erations  about  these  dimensions  have  still  to  be 
added. 

The  institutions  and  mechanisms  of  representa¬ 
tive  democracies  are  actually  the  main  objects  of  the 
empirical  analysis  of  democratic  qualities.  This  is 
not  to  ignore  the  direct  democracy  as  the  highest 
expression  of  democratic  quality  but  to  acknowl¬ 
edge  the  centuries-long  empirical  experience  of  rep¬ 
resentative  democracies.  If  the  analysis  is  to  focus  on 
representative  democracies,  then  accountability — a 
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core  feature  in  the  representative  democracy — 
becomes  a  truly  central  dimension  insomuch  as  it 
grants  citizens  and  civil  society  in  general  an  effec¬ 
tive  means  of  control  over  political  institutions. 
This  feature  attenuates  the  difficulties  that  exist 
objectively  when  there  is  a  shift  from  direct  to  rep¬ 
resentative  democracy. 

The  principal  actors  of  such  a  democracy  are 
the  citizen-individuals,  the  territorial  communities, 
and  the  various  forms  of  associations  with  com¬ 
mon  values,  traditions,  or  aims.  In  this  sense,  the 
possibility  for  good  democracy  exists  not  only  in 
the  case  of  a  defined  territory  with  a  specific  popu¬ 
lation  controlled  by  state  institutions  under  a 
democratic  government  but  also  for  wider  ranging 
entities.  The  main  point  is  that  the  above-named 
subjects  are  at  the  heart  of  a  democracy  in  which 
the  most  important  processes  are  those  that  work 
from  the  bottom  up,  and  not  vice  versa.  In  this 
way,  the  transfer  of  the  analytical  dimensions  from 
the  national  level  to  the  supranational  level — 
though  not  uncomplicated  and  without  difficulty — 
is  possible. 

It  is  particularly  important  to  point  out  the  spe¬ 
cific  empirical  aspects  to  explore  for  each  of  the 
eight  qualities.  They  are  briefly  summed  up  here. 

Quality  in  Terms  of  Procedure 

Rule  of  Law 

If  the  rule  of  law  is  at  stake,  the  relevant  ele¬ 
ments  to  examine  are 

1.  individual  security  and  civil  order,  with  a  focus 
on  right  to  life,  freedom  from  fear  and  torture, 
personal  security,  and  right  to  own  property 
guaranteed  and  protected  through  the  country; 

2.  an  independent  judiciary  and  a  modern  justice 
system,  focused  on  mechanisms  establishing  an 
independent,  professional,  and  efficient  judiciary 
system  that  allows  equal  access  to  justice,  free  of 
the  undue  pressures  and  enforcement  of  decisions; 

3.  institutional  and  administrative  capacity  to 
formulate,  implement,  and  enforce  the  law, 
where  the  focus  is  on  the  governance  system 
(parliament  and  government),  the  capability  to 
ensure  the  production  of  quality  legislation  and 
the  implementation  through  the  country  of  a 
transparent  policy-making  process  allowing  for 


the  participation  of  civil  society,  and  the 
presence  of  a  professional,  neutral,  accountable, 
and  efficient  state  bureaucracy; 

4.  effective  fight  against  corruption,  illegality,  and 
abuse  of  power  by  state  agencies,  where  the 
focus  is  on  the  existence  and  implementation  of 
the  comprehensive  legislative  framework  to 
prevent  and  fight  the  corruption;  and 

5.  security  forces  that  are  respectful  of  citizen 
rights  and  are  under  civilian  control;  the  focus 
is  on  the  mechanisms  of  civilian  control  over 
security  forces  as  well  as  on  efficient, 
noncorrupted,  disciplined  police  forces 
respectful  of  human  and  political  rights. 

Electoral  Accountability 

If  the  quality  to  analyze  is  electoral  accountabil¬ 
ity,  the  best  strategy  for  detecting  it  empirically  is 
to  refer  to  its  most  immediate  conditions: 

1.  free,  fair,  and  recurrent  elections,  with  their 
specific  procedural  aspects; 

2.  plural  and  independent  information; 

3.  freedom  of  the  party  organization  and  related 
aspects;  and 

4.  the  presence  and  stability  of  alternatives. 

Interinstitutional  Accountability 

If  the  quality  to  analyze  is  interinstitutional 
accountability,  the  main  subdimensions  and  indi¬ 
cators  to  explore  refer  to 

1.  legislative-executive  relations,  with  special  focus 
on  the  parliamentary  opposition  or  the  role  of 
the  legislative  body; 

2.  constitutional  courts; 

3.  ombudsmen; 

4.  audit  courts;  and 

5.  modes  and  extent  of  decentralization. 

Political  Participation 

If  political  participation  has  to  be  considered, 
one  should  look  at 
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1.  identifying  participation,  to  become  a  “part,”  to 
revive,  or  to  restate  the  belonging  or 
identification  with  a  group  of  a  different  sort 
and 

2.  instrumental  participation,  to  try  to  achieve 
some  goal. 

So  the  basic  forms  in  the  case  of  conventional 
participation  are  referendum,  electoral  participation 
at  the  country  and  local  levels,  and  membership  and 
other  forms  of  affiliation  in  political  organizations 
and  interest  associations.  In  the  case  of 
nonconventional  participation,  the  basic  forms  are 
strikes,  demonstrations,  and  riots;  there  are  other 
forms  involving  participation,  in  addition  to  forms 
of  participation  with  regard  to  specific  policies. 

Political  Competition 

The  basic  salient  subdimensions  of  political 
competition  are  the  following: 

f.  competition  among  political  and  societal  actors, 
characterized  by  freedom  for  all  political  parties 
to  compete  with  each  other  and  by  fairness  of 
political  competition; 

2.  competition  within  political  and  societal  actors; 
and 

3.  the  output  side  of  political  competition. 

Quality  in  Terms  of  Content 
Freedom 

The  three  main  subdimensions  of  freedom  are 
the  following: 

f.  personal  dignity, 

2.  civil  rights,  and 

3.  political  rights. 

For  all  of  them,  there  is  the  existence  of  opportunity 
in  the  legal  system  of  the  country  and  the  actual 
guarantee  of  each. 

Equality 

The  guarantee  and  effective  implementation  of 
social,  economic,  and  cultural  rights  helps  reduce 


differences  in  those  domains.  In  addition,  when 
considering  equality,  other  basic  aspects  to  analyze 
include  the  following: 

1.  the  allocation  of  economic  resources  within  the 
population, 

2.  the  extent  of  poverty, 

3.  the  diffusion  of  education, 

4.  the  existence  of  gender  discrimination,  and 

5.  the  existence  of  ethnic  discrimination. 

Quality  in  Terms  of  Result 

Responsiveness 

Responsiveness  refers  to  the  legitimacy  of 
government — that  is,  the  citizens’  perception  of 
responsiveness.  In  other  words,  the  empirical 
aspects  to  consider  should  be  the  diffusion  of  atti¬ 
tudes  favorable  to  the  existing  democratic  institu¬ 
tions  and  the  approval  of  their  activities  because  of 
the  assumption  that  attitudes  of  satisfaction  show 
the  effective  perception  by  civil  society  of  existing 
responsiveness.  On  this  quality,  limited  resources 
and  economic  constraints  on  public  spending  affect 
the  responsiveness  of  even  the  wealthiest  countries. 
Likewise,  the  persistent  problems  posed  by  unem¬ 
ployment  and  immigration  are  also  illustrative  of 
the  near  impossibility  of  finding  generally  satisfac¬ 
tory,  legitimate,  and  responsive  solutions  in  con¬ 
temporary  democracies.  Indeed,  the  situation  is 
more  and  more  characterized  by  discontent,  dis¬ 
satisfaction,  fear  of  poverty,  and  general  demo¬ 
cratic  malaise. 

Recurrent  Patterns  of  Subversion 

A  different,  meaningful  perspective  on  the  analysis 
of  qualities,  above  all  a  more  realistic  one,  is  to 
look  at  all  the  recurrent  ways  in  which  elites  and 
citizens  consciously  or  otherwise  try  to  subvert 
those  qualities  for  their  political  or  private  pur¬ 
poses.  Here  the  eight  qualities  with  their  possible 
and  often  probable  subversions  are  reviewed. 

Rule  of  Law 

Starting  with  the  rule  of  law,  first  and  foremost, 
a  rigorous  application  of  laws  or,  in  certain  cases, 
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the  relationships  with  an  only  apparently  efficient 
bureaucracy  can  have  particularly  negative  conse¬ 
quences  for  the  most  socially  weak  and  vulnerable 
members  of  society.  Then,  there  is  the  possible  use 
of  the  law  as  a  genuine  political  “weapon.”  Here, 
one  can  see  a  persistent  and  diffuse  temptation  for 
politicians  to  use  the  law  against  their  adversaries 
if,  for  example,  the  opposition  is  condemned  to 
remain  as  such  for  a  long  time  and  has  no  chance 
of  electoral  victory  in  the  near  future.  Politicians 
are  also  tempted  to  use  judicial  acts  to  reinforce 
their  own  position  against  the  opposition.  In  other 
cases,  when  there  is  collusion  among  politicians, 
the  judges  themselves,  with  the  support  of  the 
media,  are  tempted  to  turn  to  the  judiciary  in 
retaliation  for  certain  political  decisions  that  they 
consider  unacceptable.  On  a  different  level,  there  is 
also  a  growing  tendency  among  individual  citizens 
or  economic  groups  to  resort  to  the  law  to  assert 
their  own  interests.  Some  scholars  note  this  phe¬ 
nomenon  as  a  “juridification”  of  contemporary 
democracy.  Finally,  and  not  altogether  different, 
there  is  the  popular  and  diffuse  cultural  attitude 
that  interprets  the  law  as  a  severe  impediment  to 
realizing  one’s  own  interests  that  should  be  cir¬ 
cumvented  in  any  way  possible.  This  attitude, 
which  is  common  in  various  countries  throughout 
the  world  from  Southern  Europe  to  Latin  America, 
Eastern  Europe,  and  also  Asian  democratic  coun¬ 
tries,  extends  from  the  popular  to  the  entrepre¬ 
neurial  classes. 

Accountability 

With  respect  to  electoral  accountability,  given 
the  well-known  opacity  of  political  processes  and 
the  complexity  of  reality,  politicians  have  ample 
opportunity  to  manipulate  their  contexts  in  such  a 
way  as  to  absolve  themselves  of  any  concrete 
responsibility.  Accountability  frequently  becomes 
a  catchphrase  more  connected  to  the  image  of  a 
politician  than  to  any  decisions  he  or  she  may 
have  taken  or  results  he  or  she  might  have  pro¬ 
duced.  Negative  outcomes  are  easily  justified  by 
making  reference  to  unforeseen  events  or  by  tak¬ 
ing  advantage  of  a  favorable  press  to  influence 
public  opinion.  At  the  same  time,  good  results, 
obtained  sometimes  at  the  cost  of  sacrifices  by  the 
governed,  might  result  in  negative  or  punitive 
judgments  for  the  governor  at  the  time  of  the  next 


elections.  Thus,  when  a  politician  supports  some¬ 
thing  that  is  unpopular  but  necessary  for  the  wel¬ 
fare  of  the  state,  he  may  lose  the  support  of  the 
electorate.  The  very  action,  often  ideological  and 
instrumental,  of  parties  or  other  components  of 
the  political  opposition  or  even  of  media  actors  in 
a  position  to  conduct  public  processes  can,  some¬ 
times  on  inconsistent  grounds,  make  the  effective 
implementation  of  electoral  accountability  more 
difficult.  The  lack  of  clear  distinctions  between 
incumbent  leaders  and  party  leaders — the  head  of 
government  often  also  controls  the  parties — 
means  that  parties,  be  they  of  the  opposition  or  of 
the  majority,  are  hindered  in  carrying  out  their 
role  as  watchdogs  for  their  constituents.  At  the 
parliamentary  level,  party  discipline  is  considered 
more  important  than  accountability  toward  the 
electors  and,  in  practice,  the  parliamentary  major¬ 
ity  supports  the  government  without  controlling 
it.  Furthermore,  there  should  also  be  a  clear  dis¬ 
tinction  between  the  responsible  leader,  either  of 
the  government  or  of  the  opposition,  and  the 
intermediate  layers  of  party  actors  that  range  from 
militants  to  sympathizers.  These  latter  should  trig¬ 
ger  a  bottom-up  process  that  gives  direction  to 
how  parties  should  control  the  government  or 
organize  their  opposition.  However,  recent  empir¬ 
ical  studies  on  party  organization  in  a  number  of 
advanced  democracies  indicate  an  opposite  trend, 
which  is  characterized  by  strong  party  leaders 
who  act  in  collusion  (instead  of  in  competition) 
with  other  parties  or  party  leaders.  The  most 
extreme  scenario  relating  to  this  phenomenon  is 
that  parties,  supported  by  public  financing,  effec¬ 
tively  form  “cartels”  where  the  political  opposi¬ 
tion  is  actually  disappeared. 

Citizens  in  European  countries  encounter  fur¬ 
ther  difficulties  in  ensuring  electoral  accountabil¬ 
ity  because  of  the  existence  of  the  supranational 
dimension  created  by  the  European  Union  (EU). 
The  most  fitting  example  of  how  governments  in 
these  countries  avoid  accountability  is  the  well- 
known  tactic  of  blame  shifting.  Here,  the  political 
responsibility  for  every  unpopular  decision  taken 
by  the  government  is  shifted  from  the  national  to 
the  European  level,  even  if  they  concern  clear-cut 
issues  such  as  streamlining  national  administra¬ 
tions  or  reorganizing  state  finances  to  meet  large 
national  deficits.  Governments  or  national  politi¬ 
cians  justify  actions  resulting  in  widespread  public 
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opposition  by  claiming  that  their  hands  were 
forced  by  opposing  coalitions  in  the  Council  of 
Ministers  of  the  EU  or  in  the  European  Council  of 
prime  ministers  and  chiefs  of  state  or  by  votes  in 
the  European  Parliament. 

As  Jose  Maravall  (1997)  has  discussed,  there 
are  many  ways  by  which  government  leaders  can 
avoid  electoral  accountability.  At  the  same  time, 
the  absence  or  extreme  weakness  of  interinstitu- 
tional  accountability  leaves  electoral  account¬ 
ability  as  the  only  instrument  for  guaranteeing 
this  dimension  of  quality  democracy.  The  chances 
of  exercising  electoral  accountability,  however, 
are  only  periodic,  and  in  some  cases  citizens 
must  wait  several  years  before  the  next  elections. 
As  O’Donnell  (1994)  notes,  the  result  is  that  we 
obtain  a  “delegative  democracy” — a  democracy 
of  poor  quality  in  which  the  citizen  casts  his  or 
her  vote  and  is  subsequently  ignored  until  the 
next  election.  Citizens  are  left  without  any 
means  of  controlling  corruption  and  bad  govern¬ 
ment,  and  there  are  no  other  institutions  really 
capable  of  guaranteeing  interinstitutional 
accountability. 

Participation 

Participation  can  be  subverted  and  constrained 
in  a  variety  of  subtle  and  overt  ways  in  democra¬ 
cies  around  the  world.  Citizen  dissatisfaction,  pas¬ 
sivity,  indifference,  and  alienation  are  key  reasons 
for  the  consequent  sharp  decline  of  voting  and 
other  forms  of  citizen  participation.  But  the  sub¬ 
version  of  the  meaning  and  consequence  of  par¬ 
ticipation  can  be  seen  when  it  is  no  longer  sponta¬ 
neous,  voluntary,  and  free  but  instead  comes  to  be 
influenced  and  even  shaped  by  a  different  sort  of 
elites.  A  key  role  in  this  can  be  played  by  television 
and  other  mass  media.  The  so-called  audience 
democracy  is  the  main  context  of  such  pseudopar¬ 
ticipation.  In  fact,  within  a  highly  personalized 
politics  where  communication  elites  are  very 
important  and  the  political  debate  is  transferred 
from  institutional  arenas  to  public  opinion,  effec¬ 
tive  participation  has  almost  no  room.  Moreover, 
there  are  attempts  to  secure  a  controlled  participa¬ 
tion  that  may  just  take  the  form  of  obedient  sup¬ 
port  for  government  actions.  That  is,  there  is  an 
effort  to  get  people  to  participate  but  only  with 
behaviors  that  support  the  incumbent  authorities. 


Other  forms  of  participation  are  discouraged,  and 
this  is  not  difficult  in  social  and  political  contexts 
with  a  poor  tradition  of  active,  autonomous  civil 
society.  Participation  compounded  by  various 
forms  of  violence  is  also  a  subverted  way  of  “tak¬ 
ing  part”  in  politics.  As  suggested  by  Dahl  some 
years  ago  (and,  more  recently,  in  1998),  a  key, 
necessary,  definitional  element  of  democracy  is  a 
firm  commitment  to  “the  peaceful  solution  of  con¬ 
flicts.”  Consequently,  the  use  of  violent  means 
twists  and  distorts  the  very  working  of  every 
democracy. 

Competition 

There  are  also  a  few  recurrent  patterns  of  sub¬ 
verting  competition.  The  first  one  is  the  attempt 
to  exclude  competition  in  some  area  where  the 
effective  working  of  competition  is  supposed  to 
have  relevant  consequences,  for  example,  by 
making  a  pact  between  two  parties  participating 
in  an  election  or  by  agreeing  to  exclude  a  priori 
a  political  actor,  person,  or  group  from  fair  par¬ 
ticipation  in  an  election.  Second,  a  distortion  of 
competition  can  be  the  end  result  of  inadequate 
implementation  of  rules  regulating  electoral  cam¬ 
paigns  and  financial  support  for  parties.  A  third, 
recurrent  way  of  distorting  the  competition  is  in 
obfuscating  the  program  and/or  policy  differ¬ 
ences  among  parties  or  party  coalitions.  Collusive 
pacts  between  government  and  opposition  may 
also  be  formed.  Especially  in  hybrid  regimes, 
competition  can  be  seriously  subverted  if  leaders 
and  parties  are  able  to  arbitrarily  control  the 
implementation  of  rules,  especially  the  electoral 
ones,  or  are  able  to  constrain  pluralism  of  infor¬ 
mation.  Finally,  use  of  violence  is  another  way  of 
subverting  the  competition  among  political 
actors. 

Freedom,  Equality,  and  Responsiveness 

Without  going  into  details,  freedom  and  equal¬ 
ity  are  subverted  when  they  are  merely  formally 
acknowledged  as  rights  but  not  put  into  practice. 
The  failure  to  allocate  the  funding  needed  for 
implementation  of  these  rights  is  one  common 
barrier  to  their  realization.  Responsiveness  can 
also  be  subverted  when  citizens  are  not  adequately 
informed  about  the  impact  of  political  actions,  as 
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such  information  is  at  the  root  of  the  formation  of 
responsiveness  perceptions. 

Empirical  research  on  the  varieties  of  ways  the 
dimensions  or  qualities  can  be  subverted  can  also 
make  it  possible  to  detect  the  democracies  with 
lesser  or  without  qualities  and  even  to  understand 
from  a  different  perspective  how  and  why  problems 
of  delegitimation  and  eventually  related  problems  of 
consolidation  can  emerge  in  the  scrutinized  country. 

Leonardo  Morlino 
LUISS  Guido  Carli 
Rome,  Italy 
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Theories  of  democracy  consist  of  definitions  and 
generalizations  used  to  describe,  explain,  and 
evaluate  existing  or  past  political  orders.  Their 
purpose  is  thus  twofold:  (1)  to  get  a  better  under¬ 
standing  of  the  preconditions  and  performance  of 
democratic  systems  and  (2)  to  try  to  judge  this  real¬ 
ity  according  to  certain  democratic  core  values. 

Democracy  means  collective  self-determination. 
Its  purpose  is  to  form  political  decisions  according 
to  the  will  of  the  citizens.  Apart  from  this  rather 
abstract  meaning,  there  is  a  great  difference  between 
ancient  and  modern  democracy.  The  first  section  of 
this  entry  recapitulates  the  mainly  negative  use  of 
democracy  in  ancient  thought  and  the  development 
from  the  ancient  to  the  modern  understanding  of 
democracy,  outlining  three  semantic  transforma¬ 
tions  from  the  ancient  to  the  modern  understand¬ 
ing.  The  second  section  focuses  on  contemporary 
theories  of  democracy,  distinguishing  three  types: 
empirical,  positive,  and  normative.  Finally,  two  dif¬ 
ferent  answers  to  the  postnational  challenge  of 
democracy  are  outlined:  the  concepts  of  global 
democracy  and  postdemocracy. 

Ancient  and  Modern  Theories  of  Democracy 
Ancient  Critics  of  Democracy 

In  ancient  political  theory,  “democracy”  was  a 
polemic  and  negative  concept.  In  the  late  5th  century 
BCE,  Pseudo-Xenophon,  the  first  Athenian  critic  of 
democracy  we  know  of,  calls  it  a  regime  in  which 
the  many  rule  in  a  selfish  and  destructive  manner. 
Democratic  men  strive  for  their  personal  gain,  and 
to  this  end,  they  not  only  suppress  the  aristocratic 
best  and  the  population  of  the  naval  colonies  but 
also  rule  without  regard  to  the  common  weal. 
According  to  Pseudo-Xenophon,  this  leads  to  an 
unjust  political  and  moral  order.  However,  he  does 
not  deny  a  certain  rationality  of  the  democratic 
regime  insofar  as  the  many  live  better  in  a  democ¬ 
racy.  Thus,  he  ends  with  the  paradox  that  from  an 
aristocratic  perspective  the  democratic  regime  is 
clearly  unjust,  whereas  the  same  order  seems  at  least 
internally  rational  from  a  democratic  point  of  view. 

It  is  in  Plato’s  work  that  this  paradox  is  resolved 
inasmuch  as  he  develops  a  metaphysical  founda¬ 
tion  of  the  political  order.  In  concurrence  with 
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Pseudo-Xenophon,  he  describes  the  supposed  short¬ 
comings  of  the  democratic  regime,  such  as  rhetorical 
betrayal  and  demagoguery.  Because  the  many 
cannot  know  the  political  arete,  they  are  merely  a 
pawn  for  the  sophists  who  are  interested  only  in 
their  personal  gain.  To  counter  these  democratic 
practices,  Plato  formulates  his  model  of  the  philoso¬ 
pher-king,  which  is  founded  in  an  epistemic  under¬ 
standing  of  politics.  For  Plato,  good  politics  is  based 
on  higher  knowledge,  which  only  the  few  with  spe¬ 
cial  philosophical  talent  and  training  can  attain. 

There  are  at  least  two  problems  with  this  fusion 
of  politics  and  philosophy.  The  first  is  its  meta¬ 
physical  character.  Plato  bases  his  political  philos¬ 
ophy  in  an  idealistic  framework  dubious  even  to 
his  contemporaries.  The  second  problem  is  the 
utopian  character  of  the  philosopher-king,  as  Plato 
himself  mentioned  in  the  Politeia.  Democratic 
practices  and  norms  are  very  widespread  in  the 
Athenian  demos,  so  his  model  is  simply  not  realiz¬ 
able.  Therefore,  Plato  argues  in  the  Laivs  for  a 
second-best  regime.  It  is  no  longer  the  philosopher- 
king  who  is  to  guarantee  the  good  order  but  a 
system  of  laws  that  regulate  even  the  smallest 
details.  To  integrate  the  many  into  the  regime,  he 
considers  democratic  modes  of  decision  making, 
such  as  the  participation  of  the  many  in  elections 
and  even  drawing  lots.  However,  these  concessions 
to  democratic  practices  do  not  mean  that  Plato  has 
abandoned  his  aristocratic  ideals.  Democratic 
institutions  are  subordinated  to  exclusive  ones, 
thereby  thwarting  the  rule  of  the  many. 

Aristotle’s  reflections  on  democracy  differ  from 
Plato’s  in  at  least  two  ways.  First,  he  develops  his 
insights  by  examining  the  empirical  world.  Second, 
he  transcends  Plato’s  dichotomies  of  philosophy 
and  democracy,  the  few  and  the  many,  and  formu¬ 
lates  a  more  integrated  and  therefore  realistic 
understanding  of  the  political  world.  Aristotle 
even  concedes  that  the  democratic  order  displays 
some  rationality  insofar  as  the  many,  if  they  delib¬ 
erate  together,  can  obtain  more  information  than 
an  oligarchic  assembly  can.  Nevertheless,  he  is  no 
friend  of  democracy.  In  Politics ,  he  criticizes  the 
democratic  order  of  Athens,  which  he  regards  as 
“extreme,”  and  as  Pseudo-Xenophon  does,  he 
criticizes  demagoguery  and  the  tendency  of  the 
demos  to  neglect  the  common  weal.  Therefore, 
Aristotle  argues  for  a  more  moderate  constitution, 
the  “polity,”  which  he  sometimes  also  calls  the 


“best”  democracy.  This  constitution  is  character¬ 
ized  by  the  rule  of  law,  and  oligarchic  and  demo¬ 
cratic  institutions  are  mixed.  In  this  regime,  a 
strong  separation  exists  between  the  political 
experts — that  is,  the  educated  few,  who  are  chosen 
mainly  by  elections  and  not  by  lot,  and  the  orderly 
demos.  With  this  conception,  Aristotle  abandons 
the  Athenian  praxis  of  direct  democracy  and 
moves  in  the  direction  of  a  modern,  representative 
understanding  of  democracy. 

In  short,  for  these  “intellectual  critics  of  popular 
role”  (Josiah  Ober,  1998),  the  democratic  praxis 
of  the  ancient  world  with  the  direct  involvement  of 
the  demos  was  unjust  and  highly  pathological. 
This  assessment  did  not  change  after  the  decline  of 
the  Grecian  city-states  at  the  end  of  the  4th  century 
BCE.  On  the  contrary,  this  change  further  sup¬ 
ported  the  antidemocratic  bias  of  political  thought. 
The  reference  point  was  no  longer  the  political 
praxis  of  the  city-states  but  the  antidemocratic 
writings  of  Thucydides,  Plato,  and  Aristotle.  In  the 
Roman  discussion  and  the  political  theory  of  the 
Middle  Ages,  democracy  was  not  only  regarded  as 
a  regime  of  the  past  but  also  as  an  illegitimate 
order.  In  the  early-modern  age,  this  critical  or  even 
negative  assessment  of  democracy  changes  only 
slightly.  Until  the  end  of  the  18th  century,  the  pejo¬ 
rative  connotation  of  “democracy”  dominated 
political  thought.  However,  after  the  “democratic” 
revolutions  of  the  late  18th  century,  there  was  a 
new  beginning  in  democratic  theory.  It  was  a 
beginning  that  fundamentally  changed  the  seman¬ 
tics  of  democracy  from  the  direct  involvement  of 
the  masses  to  a  system  in  which  elected  representa¬ 
tives  rule. 

This  shift  in  meaning  from  the  antique  to  the 
modern  concept  of  democracy  was  carried  out  in  a 
multistage  process  of  transformation.  The  basic 
semantic  changes  concern  the  evaluation,  tempor- 
alization,  and  institutionalization  of  the  concept  of 
democracy  and  can  be  described  with  the  terms 
positivation,  futurization,  and  completion. 

Three  Semantic  Transformations  of  “Democracy” 

Positivation 

In  the  ancient  theories  of  Plato  and  Aristotle  as 
well  as  Cicero  and  Polybios,  “democracy”  was  a 
negative  concept.  All  major  primary  sources  from 
which  the  ancient  concept  of  democracy  is  handed 
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down  to  us  are  by  critics,  if  not  enemies,  of  democ¬ 
racy.  Their  critique  was  vehement,  and  their  list  of 
democracy’s  shortcomings  contained,  as  shown, 
very  different  points:  It  permits  unqualified  citizens 
to  participate  in  politics,  it  complicates  political 
decision-making  processes,  it  produces  bad  deci¬ 
sions,  it  debauches  the  political  culture,  or  it  is 
simply  an  amoral  order — just  to  mention  the  most 
important  points  of  criticism.  This  negative  usage 
of  the  concept  continued  uninterrupted  from  the 
Middle  Ages  to  modern  times,  and  only  in  the 
writings  of  Spinoza  and  in  the  political  speeches  of 
some  Dutch  Republican  thinkers  in  the  1780s  can 
one  find  attempts  to  give  democracy  a  positive 
designation.  This  positive  connotation  of  democ¬ 
racy  gradually  became  accepted  after  the  French 
Revolution  and  then  in  the  course  of  the  extension 
of  suffrage  in  the  United  States,  Western  Europe, 
New  Zealand,  and  Australia  in  the  19th  century. 
This  process  was  accompanied  by  ideological  dis¬ 
putes  that  ended  in  the  mid-20th  century.  Today, 
the  transformation  to  a  positive  concept  is  com¬ 
plete  at  least  in  Western  society;  the  concept  has 
developed  into  a  category  of  self-description  in 
global  political  disputes.  While  democracy  in  mod¬ 
ern  democracies  has  many  critics,  it  no  longer  has 
any  fundamental  enemies,  at  least  in  the  Western 
world. 

Futurization 

Even  the  political  thinkers  of  Hellenism  and 
later  Roman  authors  such  as  Cicero  regarded 
democracy  as  a  form  of  government  of  a  bygone 
era.  They  considered  it  a  thing  of  the  past  and  asso¬ 
ciated  it  with  the  existence  of  small  city-states  of 
the  lost  world  of  ancient  Greece.  If  only  for  that 
reason,  and  regardless  of  its  negative  aspects, 
authors  such  as  Baron  de  Montesquieu,  John 
Locke,  or  the  writers  of  the  Federalist  Papers  did 
not  think  it  a  serious  option  as  a  concept  for  the 
political  future.  Admittedly,  Montesquieu  arrives 
at  a  positive  understanding  of  democracy  in  his 
Spirits  of  the  Laws,  but  he  binds  the  idea  of  democ¬ 
racy  so  tightly  to  the  prerequisites  of  equality  and 
rurality  that  there  is  no  room  for  a  democratic 
order  in  his  age  with  its  advancing  economical  and 
social  differentiation.  For  him,  democratic  prac¬ 
tices  could  only  be  a  subordinate  element  in  a 
mixed  regime.  Montesquieu’s  understanding  is 


typical  for  the  equation  of  democracy  with  the 
praxis  of  the  ancient  city-states.  For  him,  democ¬ 
racy  was  a  regime  of  the  past  and  not  a  realistic 
option  for  the  future.  Georg  Wilhelm  Friedrich 
Hegel  had  a  more  positive  view  of  the  contempo¬ 
rary  meaning  of  ancient  democracy,  but  since  the 
liberation  of  the  subjective  mind,  he  too  was 
unsure  of  its  future.  Even  authors  such  as  Johannes 
Althusius  and  Jean-Jacques  Rousseau,  whose  theo¬ 
ries  ventured  to  take  big  steps  toward  positivation, 
were  rather  cautious  concerning  a  realistic  future 
for  democracy. 

It  was  only  with  Alexis  de  Tocqueville’s  book 
on  America  that  a  political  rhetoric  prevailed  that 
turned  around  the  structure  of  time  dominating  in 
most  contemporaries’  minds  and  enabled  them  to 
see  democracy  as  a  project  of  the  future.  In 
Tocqueville’s  view,  North  America  was  already 
mostly  a  democracy,  and  Europe  would  soon  be 
predominantly  a  democracy  as  well,  as  problem¬ 
atic  as  he  felt  this  tendency  to  be.  This  futurization 
made  the  concept  of  democracy  a  key  term  for  the 
political  battles  of  the  19th  and  20th  centuries. 
This  perspective  electrified  enemies  as  well  as  pro¬ 
ponents  of  democracy,  the  former  because  they 
now  faced  a  challenge  that  laid  claim  to  the  future, 
the  latter  because  they  had  a  feasible  political  proj¬ 
ect  with  the  name  “democracy”  before  them. 
Today,  the  futuristic  character  of  democracy  is 
undisputed.  Democracy  is  a  project  on  perfecting 
in  which  we  all  cooperate,  in  the  hope  of  one  day 
accomplishing  it  completely. 

Completion 

Third,  the  concept  of  democracy  underwent  a 
fundamental  change  in  its  institutional  inventory. 
While  there  was  a  primacy  of  political  participa¬ 
tion  in  antiquity,  slowly  a  constitutional  usage 
prevailed  that  systematically  restricted  the  moment 
of  direct  participation.  It  is  the  transition,  so  wel¬ 
comed  by  Benjamin  Constant,  from  the  freedom 
of  the  old  to  the  freedom  of  the  new  at  the  begin¬ 
ning  of  the  1 9th  century  that  makes  this  paradig¬ 
matic  rupture  apparent.  The  change  from  a  nega¬ 
tive  to  a  positive  evaluation  of  the  concept 
of  democracy  coincides  historically  with  the  tran¬ 
sition  to  a  primacy  of  liberal  defensive  rights  and 
the  installation  of  a  representative  system. 
Democracy  is  now  regarded  as  an  institutional 
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order  that  must  be  complemented  by  a  system  of 
“checks  and  balances”  so  that  negative  freedom — 
the  protection  of  the  individual  from  decisions  by 
the  democratic  majority — is  secured.  Accordingly, 
the  list  of  proposals  of  how  the  institutions  of 
democracy  should  be  complemented  is  long  and 
bears  witness  to  a  high  level  of  institutional  cre¬ 
ativity  on  the  part  of  contemporary  authors.  The 
most  important  ones  are  constitutionalism  (e.g., 
independence  of  the  judiciary,  a  coherent  legal 
system),  different  models  for  the  separation  of 
powers,  federalism,  and  multistage  representative 
systems. 

It  is  only  because  of  these  three  semantic  trans¬ 
formations  that  a  concept  of  democracy,  which 
stands  in  such  conspicuous  discrepancy  to  its 
original  usage  in  antiquity,  could  survive.  In  view 
of  the  great  shifts  in  meaning,  it  is  hard  to  answer 
the  obvious  question  of  why  the  concept  of 
“democracy”  was  not  simply  given  up,  instead  of 
being  intricately  filled  with  new  meanings.  Its 
astounding  ability  to  survive  can  probably  be 
explained  best  by  the  attraction  of  associations 
inspired  by  the  parts  of  the  Greek  compound — 
“demos”  and  “kratein.”  The  rhetorical  reference 
to  “the  people”  and  their  “rule”  constitutes  a — 
however  weakened — reference  to  participatory 
components  in  political  systems  and  provides  them 
with  mass  legitimacy. 

Modern  Theories  of  Democracy 

Empirical,  Positive,  and  Normative 
Theories  of  Democracy 

There  are  different  ways  to  comprehend  the 
extensive  and  confusing  debates  of  modern  theo¬ 
ries  of  democracy.  Basically,  the  discussions  can  be 
grouped  in  two  approaches:  diachronic  and 
synchronic  ones.  The  first  approach  traces  the  his¬ 
torical  development  of  democratic  thought.  The 
purpose  of  this  method  is  to  detect  the  crucial  steps 
and  the  striking  changes  of  democratic  thought. 
The  “advancements”  of  modern  theories  of  democ¬ 
racy  are  shown,  for  example,  the  history  of  the 
concept  of  representative  democracy  from  John 
Locke  to  Robert  A.  Dahl.  In  the  second  approach, 
theories  of  democracy  are  condensed  to  models  or 
paradigms,  for  example,  liberal  or  republican  theo¬ 
ries.  The  purpose  of  this  method  is  to  compare  the 
different  models  and  to  rank  them.  This  ranking 


can  take  place  according  to  their  degree  of  accor¬ 
dance  with  the  institutions  of  liberal  democracy  or 
the  requirements  of  global  governance.  However, 
there  is  another  distinction  underlying  these  differ¬ 
ences.  It  concerns  the  scientific  modality  of  the 
theories,  the  way  they  look  at  democracy. 
Therefore,  one  can  distinguish  between  three 
“logical”  modes  of  democratic  theory:  the  empiri¬ 
cal,  the  positive,  and  the  normative  type. 

Empirical  Theories  of  Democracy 

Empirical  theories  of  democracy  try  to  rank 
political  systems  according  to  a  scale  of  democratic 
values  and  institutions  or  to  determine  the  neces¬ 
sary  functional  preconditions  of  democratic  sys¬ 
tems  and  measure  how  such  systems  perform. 

The  goal  of  the  first  group  is  to  construct  reli¬ 
able  and  standardized  scales  in  order  to  obtain  a 
yardstick  for  comparing  different  political  systems 
that  can  then  be  ranked  according  to  their  degree 
of  democracy.  Dahl  wrote  the  classical  study  in 
1971.  Dahl  formulates  seven  indicators  of  a 
“polyarchy,”  which  in  his  view  is  the  modern  form 
of  democracy:  (1)  the  freedom  to  form  and  join 
organizations,  (2)  freedom  of  expression,  (3)  the 
right  to  vote,  (4)  the  right  of  political  leaders  to 
compete  for  support,  (5)  alternative  sources  of 
information,  (6)  free  and  fair  elections,  and 
(7)  institutions  for  making  government  policies 
responsive.  The  points  of  reference  for  these  demo¬ 
cratic  standards  are,  on  the  one  hand,  the  norms 
and  institutions  of  Western  democracies  and,  on 
the  other,  normative  theories  of  democracy,  which 
try  to  justify  these  norms  and  institutions.  In  the 
next  step,  Dahl  measures  political  systems  accord¬ 
ing  to  these  indicators  and  then  orders  them, 
applying  a  scale  ranging  from  full  polyarchies  to 
near-polyarchies  to  nonpolyarchical  systems. 
Dahl’s  empirical  finding  is  that  the  Western  democ¬ 
racies  are  not  the  only  full  polyarchies  but  also 
countries  such  as  India  and  Costa  Rica. 

Dahl’s  work  inspired  many  empirical  studies 
based  on  different  theoretical  foundations.  Especially 
in  recent  years,  measuring  democracy  has  become  a 
burgeoning  academic  pursuit.  The  discussion  is  this 
field  is  focused  on  adequate  indicators  of  democ¬ 
racy  and  their  application  in  empirical  research.  At 
this  point,  a  fundamental  problem  of  measuring 
democracy  arises:  The  selected  indicators  as  well  as 


Democracy,  Theories  of  575 


their  operationalization  rest  on  more  or  less  care¬ 
fully  considered  normative  assumptions  about  the 
nature  of  democracy  (see  below). 

A  second  group  of  empirical  theories  tries  to 
analyze  the  functional  preconditions  and  the  perfor¬ 
mance  of  democratic  systems.  Their  starting  point 
is  the  sociological  theory  of  modernization,  which 
in  the  1950s  asserted  that  the  connection  between 
economic  development  and  the  political  system  of 
societies  was  narrow.  In  1960,  Seymour  Martin 
Lipset  wrote  one  of  the  first  studies  in  this  field.  He 
investigated  the  connection  between  the  degree  of 
democratization  and  social-economic  variables  such 
as  average  income  or  level  of  education.  His  main 
result  confirmed  the  theory  of  modernization:  The 
higher  the  level  of  economic  development,  the 
more  democratic  a  society  will  be.  Many  scholars 
have  followed  Lipset  and,  with  some  variations  in 
research  design,  have  come  to  similar  conclu¬ 
sions.  Compared  with  autocratic  systems,  demo¬ 
cratic  ones  perform  much  better  in  most  policy 
areas  (e.g.,  health  care,  education,  or  environ¬ 
mental  resource  management).  Such  research  can 
be  carried  out  contrasting  democracies  with  other 
political  orders  such  as  monarchies  or  one-party 
systems.  Another  way  is  to  estimate  the  perfor¬ 
mance  of  different  subtypes  of  democratic 
systems,  for  example,  parliamentarian  or  presi¬ 
dential  systems. 

Positive  Theories  of  Democracy 

Positive  theories  of  democracy  construct  formal 
models  of  the  democratic  process,  for  example, 
voting  behavior.  They  do  not  involve  empirical 
study  of  the  workings  of  real  democracies  but 
rather  are  deductive  theories  of  political  processes 
under  constructed  conditions,  such  as  the  rational¬ 
ity  of  agents  or  the  closed  logic  of  functional  sys¬ 
tems.  Their  starting  points  are  certain  axioms  that 
are  used  as  a  basis  for  developing  the  main  charac¬ 
teristics  of  democratic  systems.  In  contrast  to 
empirical  theories,  neither  the  basic  assumptions 
nor  the  causal  or  functional  explanations  of  posi¬ 
tive  theories  claim  to  be  normative.  Rather,  the 
authors  of  positive  theories  explicitly  do  not  want 
to  formulate  normative  statements;  they  seek  only 
to  characterize  the  democratic  process  and  to 
explain  typical  political  sequences  in  modern 
democracies. 


There  are  two  versions  of  positive  democratic 
theories,  and  they  focus  on  the  opposite  ends  of  the 
democratic  process:  One  is  based  on  the  findings 
of  rational  choice  theories;  the  other  is  Niklas 
Luhmann’s  systems  theory.  Rational  choice  theo¬ 
ries  focus  on  the  microlevel  of  society  and  start 
with  the  assumption  that  individual  actors  are 
mainly  motivated  to  maximize  their  personal 
gains.  With  the  same  theoretical  grounding,  they 
try  to  explain  the  political  actions  of  collective 
actors  such  as  political  parties,  interest  groups,  and 
even  states.  In  contrast,  systems  theory  begins  on 
the  macrolevel  of  society.  Luhmann  regards  the 
different  realms  of  society,  for  example,  the  econ¬ 
omy,  the  political  sphere,  or  the  scientific  commu¬ 
nity,  as  distinct  and  self-contained  systems  and 
analyzes  their  structures  as  well  as  their  functions 
for  the  whole  of  society.  Individual  actions  are 
outside  of  the  theory’s  focus,  they  are  simply  irrel¬ 
evant.  In  Luhmann’s  view,  systems  function 
according  to  their  own  logic,  which  is  independent 
of  the  actions  of  individual  persons. 

Even  though  rational  choice  theory  and  sys¬ 
tems  theory  try  to  avoid  normative  claims,  they 
nevertheless  possess  enormous  critical  potential 
because  they  demonstrate  two  shortcomings  of 
democracy:  the  irrationality  of  democratic  deci¬ 
sion  making  and  the  constrained  range  of  political 
actions.  One  of  the  most  important  normative 
arguments  for  democratic  modes  of  decision  mak¬ 
ing  is  the  assumption  that  one  can  determine  the 
correct  will  of  the  majority  in  this  way.  However, 
according  to  the  findings  of  Moisei  Ostrogorski, 
one  of  the  forerunners  of  rational  choice  theory, 
this  is  a  myth.  The  Ostrogorski  paradox  shows 
that  even  the  smallest  changes  in  voting  behavior 
can  lead  to  big  differences  in  voting  results.  This 
finding  is  explosive  from  a  normative  point  of 
view  because  it  casts  doubt  on  the  legitimacy  of 
majority  decisions.  Besides  that,  rational  choice 
theory  does  not  regard  irrational  decision  as 
deviations  from  democratic  norms  but  as  the 
inevitable  result  of  the  aggregation  of  individual 
votes  or  of  the  merging  of  different  forms  of  ratio¬ 
nality  in  the  political  process. 

Similar  statements  can  be  made  about  the  find¬ 
ings  of  systems  theory.  Although  Luhmann  regards 
democratic  political  systems  as  the  appropriate 
form  of  modern,  functionally  differentiated  societ¬ 
ies,  he  criticizes  the  normative  bias  of  democratic 
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practices.  In  his  view,  the  political  system  in  mod¬ 
ern  societies  is  no  longer  at  the  top  of  a  pyramid 
but  is  just  one  system  among  others,  with  a  code 
of  its  own.  In  addition,  this  code  of  government 
and  opposition  cannot  be  transferred  into  other 
functional  systems,  for  example,  the  economic  or 
the  juridical  system,  without  harm.  Therefore,  the 
effects  of  democratic  politics  are  limited.  According 
to  Luhmann,  democratic  imperatives  simply  can¬ 
not  govern  the  other  functional  spheres,  the  code 
of  government  and  opposition  can  only  irritate,  at 
worst  destroy,  the  reproduction  of  the  other  sys¬ 
tems.  The  function  of  the  political  system  and  its 
democratic  form  is  thus  only  to  reproduce  the 
necessary  illusion  that  a  society  can  be  governed  by 
collective  decisions. 

In  contrast  to  empirical  and  positive  theories, 
normative  approaches  try  to  formulate  convincing 
justifications  of  democratic  orders.  The  goal  of 
normative  approaches  is  to  deliver  criteria  for 
praising  or  criticizing  normative  and  institutional 
orders.  In  this  way,  empirical  theories  can  use  their 
findings  to  evaluate  existing  political  systems. 
Normative  theories  explicitly  do  not  strive  for 
ethical  neutrality  or  freedom  from  value  judgment. 
Far  from  it,  these  value-based  justifications  are 
seen  as  an  essentially  scientific  endeavor.  Some 
scholars  cast  doubts  on  the  scientific  character  of 
normative  approaches  due  to  this  ethical  ground¬ 
ing.  However,  as  seen,  even  positive  and  empirical 
theories  are  not  value-free,  and  it  can  be  said  that 
normative  justifications  elucidate  the  inescapable 
nature  of  any  type  of  democratic  theory. 

The  Three  Normative  Axes 
of  Theories  of  Democracy 

Normative  theories  of  democracy  differ  from 
each  other,  too,  in  regard  to  the  way  they  are  rea¬ 
soned  as  well  as  in  their  institutional  conse¬ 
quences.  In  the  past  20  years,  we  have  witnessed 
a  tremendous  differentiation  and  refinement  of 
the  traditional  models.  In  addition  to  the  classical 
approaches  of  liberal,  elitist,  conservative,  social¬ 
ist,  and  participatory  theories  of  democracy, 
deliberative,  neorepublican,  neoliberal,  communi¬ 
tarian,  cosmopolitan,  associative,  feminist, 
ecological,  experimental,  multiculturalist,  and 
postmodern  theories  have  come  along,  to  mention 
only  the  most  important  ones.  The  subjects  of  the 


normative  theories  are  by  now  all  imaginable 
aspects  of  democracy:  its  traditions,  goals,  institu¬ 
tional  settings,  and  procedures.  Normative  theo¬ 
ries  have  been  and  still  are  largely  influenced  by 
political  fashions  whose  themes  are  at  the  center  of 
scientific  debates.  Therefore,  a  listing  and  sorting 
of  these  debates  offers  only  little  orientation  in  the 
jungle  of  the  numerous  normative  theories. 

Rather  than  simply  listing  them,  it  is  more  help¬ 
ful  to  ask  how  normative  theories  proceed  in 
assessing  real  or  hypothetical  political  arrange¬ 
ments  as  “democratic.”  In  other  words,  how  do 
they  construct  the  criteria  that  are  to  generate 
normative  statements  about  political  systems  and 
procedures?  Obviously,  the  development  of  theses 
criteria  does  not  take  place  in  a  vacuum  but  is 
incorporated  in  certain  political  experiences  and 
assumptions  about  the  central  problems  of  democ¬ 
racy.  Thus,  they  have  historical  underpinnings. 
The  forms  of  those  experiences  and  assumptions 
that  have  structured  the  debates  in  the  past 
3  decades  can  be  named  the  “three  normative  axes 
of  theories  of  democracy.” 

The  Social  Object  of  Democracy 

On  the  first  axis,  the  discussions  concern  the 
social  spheres  in  which  democratic  norms  should 
and  could  apply.  The  recent  debates  about  the 
appropriate  social  object  of  democracy  focus  on 
four  issues.  Common  to  them  is  an  inclusive  per¬ 
spective — that  is,  the  integration  of  additional 
social  spheres  under  democratic  rule.  An  initial 
focus  in  the  1970s  was  the  call  to  democratize  fur¬ 
ther  realms  of  society,  especially  the  economy,  the 
workplace,  and  educational  organizations.  These 
demands  found  a  certain  resonance  in  theories  of 
“strong”  or  “radical”  democracy.  In  the  1980s 
and  1990s,  feminist  theories  of  democracy  were  a 
second  focal  point.  Starting  from  the  empirically 
based  assumption  of  a  “gender  gap”  in  modern 
democracies,  their  goal  is  to  include  females  and 
female  perspectives  in  the  political  process.  In  most 
feminist  approaches,  the  fulfillment  of  this  demand 
is  connected  with  a  qualitative  advancement  of 
politics  and  policies.  These  positive  expectations 
are  related  to  a  new,  more  communicative  style  of 
politics  and  a  political  agenda  directed  to  making 
family  life  and  the  working  environment  more 
compatible  in  modern  societies.  In  a  third  debate 
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about  inclusion,  and  in  parallel  to  the  feminist 
theories  of  democracy,  various  concepts  of  multi¬ 
cultural  democracy  have  been  developed.  Based  on 
the  observation  that  minorities  in  liberal  democra¬ 
cies  are  discriminated  in  many  ways  and  that  they 
are  not  adequately  represented,  the  purpose  of 
multiculturalist  approaches  is  to  integrate  the  dif¬ 
ferent  collective  identities  in  the  political  sphere. 
The  proposals  for  reform  are  centered  on  the  open¬ 
ing  of  existing  institutional  arrangements.  Some 
authors  propose  special  mechanisms  of  group  rep¬ 
resentation  or  quotas  that  have  to  be  balanced 
with  the  liberal-democratic  demand  for  equality. 

In  a  fourth  and  more  recent  discussion,  some 
authors  argue  for  an  inclusion  of  children,  future 
generations,  and  even  apes  in  the  democratic  pro¬ 
cess.  At  the  very  least,  the  demand  to  consider  the 
interests  of  apes  in  political  decision  making, 
which  seems  odd  at  first  glance,  highlights  the  dif¬ 
ficulties  of  obtaining  adequate  criteria  for  limiting 
the  social  object  of  democracy.  In  the  semantics  of 
“democratization,”  democracy  is  in  principle  a 
never-ending  process  during  which  its  boundaries 
and  goals  must  be  discussed  again  and  again. 

Degree  and  Ways  of  Participation 

On  the  second  axis,  normative  theories  of 
democracy  formulate  statements  about  the  optimal 
degree  of  participation  and  the  ideal  relationship 
between  the  entire  demos  and  the  political  elites. 
The  focus  of  all  relevant  controversies  about 
democracy  during  the  20th  century  was  the  degree 
and  the  forms  of  citizens’  involvement  in  the  politi¬ 
cal  process:  the  debates  between  the  advocates  of 
government  by  council  democracy  and  those  of 
parliamentarism  in  the  1 920s  and  again  in  thel  970s, 
the  political  disputes  between  the  champions  of  a 
Fascist  leader  and  their  Liberal  and  Leftist  adver¬ 
saries  in  the  1930s,  the  controversy  between  the 
advocates  of  a  representative  and  those  of  a  plebi¬ 
scitary  democracy  in  the  1960s  and  in  the  1980s, 
or  the  dispute  between  the  theorists  of  an  elite 
democracy  and  those  of  a  grassroots  democracy  in 
the  late  1960s  and  early  1970s  and  again  in  the 
1990s.  In  all  those  debates,  it  was  and  still  is  con¬ 
tested  how  the  demos  is  to  be  included  in  the  pro¬ 
cess  of  political  decision  making.  Authoritarian 
concepts  consider  the  process  of  acclamation  to  the 
charismatic  leader  to  be  the  genuine  democratic 


way  of  declaring  consent;  liberal  and  elitist  concep¬ 
tions  link  their  versions  of  democracy  with  the 
principle  of  representation  and  elections;  and  theo¬ 
rists  of  a  grassroots  democracy  or  of  a  government 
by  council  democracy  support  the  idea  of  an 
intensely  participating  citizen. 

While  the  controversy  about  the  pros  and  cons 
of  a  higher  degree  of  citizen  participation  in  the 
1970s  and  1980s  were  hot-tempered  and  ideologi¬ 
cally  based,  it  has  cooled  down  since  then.  One  of 
the  main  reasons  for  this  development  is  the  find¬ 
ings  of  empirical  research  on  direct  democracy. 
Neither  the  hopes  of  the  advocates  nor  the  fears  of 
the  adversaries  have  been  fulfilled.  The  empirical 
results  show  instead  that  procedures  of  direct 
democracy  can  lead  to  a  higher  degree  of  citizen 
satisfaction  and  rational  problem  solving  in  certain 
cultural  and  institutional  contexts,  but  they  can 
also  be  used  as  an  instrument  of  populism.  Because 
of  these  complex  and  contradictory  findings  of 
empirical  research,  the  question  about  the  forms 
of  citizen  involvement  is  no  longer  at  the  center  of 
normative  theories  of  democracy.  By  now,  the 
mainstream  literature  focuses  on  the  institutions  of 
democracy  and  the  question  of  their  performance. 
This  leads  us  to  the  third  axis. 

Degree  of  Rationality 

On  a  third  normative  axis,  theories  of  democracy 
finally  contain  certain  assumptions  about  the  degree 
of  rationality  of  democratic  decision  making.  They 
make  statements  about  the  technical  and  factual 
reasonableness  of  democratic  decisions  and  even 
about  their  moral  quality.  Thus,  conservative  and 
liberal  critics  have  frequently  accused  democracy  of 
leading  to  irrational  decisions.  Leftist  advocates  of 
democracy,  on  the  other  hand,  have  considered  the 
democratic  character  to  be  the  ultimate  condition 
that  makes  correct  and  therefore  rational  decisions 
possible.  Only  in  recent  discussions  have  these  fac¬ 
tional  struggles  faded  away. 

The  rise  of  the  theory  of  deliberative  democracy 
in  the  past  2  decades  has  demonstrated  that  the 
question  of  rationality  has  gained  relevance  sig¬ 
nificantly.  Deliberation  is  the  exhaustive  and 
reflective  debate  about  political  questions.  The 
deliberative  give-and-take  of  arguments  aims  to 
elucidate  individual  and  collective  interests.  More 
ambitious  forms  of  deliberative  democracy  do  not 
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stop  at  this  point  and  demand  of  the  participants 
that  they  transcend  their  own  wishes  in  view  of  new 
moral  insights.  This  change  of  the  guiding  interests 
is  attributed  to  the  public  nature  of  deliberative 
processes  and  is  characterized  as  a  clarification  and 
moral  bettering  of  preferences.  Although  the  goal  is 
not  to  discover  an  unchangeable  moral  truth,  the 
expectation  is  that  in  the  deliberative  process  all 
arguments  that  serve  only  private  goals  can  be 
eliminated.  For  this  reason,  advocates  of  delibera¬ 
tive  democracy  claim  that  its  results  have  a  higher 
degree  of  legitimacy  than  elections  and  voting. 

Theories  of  deliberative  democracy  stand 
against  two  alternative  theories.  On  the  one  hand, 
they  criticize  rational  choice  conceptions  of  the 
aggregation  of  preferences  and  insist  that  prefer¬ 
ences  can  change  in  communicative  processes.  On 
the  other  hand,  deliberative  approaches  fault  mod¬ 
els  of  democracy  that  demand  a  higher  degree  of 
participation  without  showing  how  the  citizen  can 
gain  the  moral  resources  required  for  this  ambi¬ 
tious  endeavor. 

The  institutional  implications  of  deliberative 
approaches  go  in  three  directions.  The  first  is 
directed  toward  the  individual  citizen.  According 
to  Robert  E.  Goodin,  each  member  of  the  political 
community  should  reflect  on  the  moral  implica¬ 
tions  of  his  or  her  preferences  (“deliberation 
within”).  The  practical  consequence  of  this 
approach  is  the  demand  that  all  citizens  receive  a 
better  education,  which  is  to  lead  to  higher  sensi¬ 
bility  for  the  moral  interests  of  the  other  citizens.  A 
second  strain  argues  in  connection  with  Jurgen 
Habermas  for  the  deliberative  character  of  political 
institutions  as  representative  assemblies  and  chan¬ 
nels  of  the  public  sphere.  These  approaches  do  not 
strive  for  a  basic  change  of  the  political  institutions 
but  for  a  widening  of  the  deliberative  character  of 
liberal  democracies,  for  example,  in  strengthening 
the  discursive  character  of  parliaments,  or  for 
enhancing  the  rationality  of  public  debates.  Other 
authors  argue  for  institutional  innovations  within 
liberal  democracy.  One  prominent  suggestion  is 
the  deliberative  opinion  poll  by  James  Fishkin, 
where  randomly  selected  citizens  discuss  political 
questions  in  a  deliberative  setting.  The  results  of 
these  deliberations  can  inform  the  elected  authori¬ 
ties  or  even  lead  directly  to  political  decisions. 

A  third  and  more  radical  version  is  not  satisfied 
with  such  a  deliberative  interpretation  of  existing 


societies  but  argues  for  changes  in  the  basic  struc¬ 
tures  of  liberal  democracies  to  achieve  the  moral 
goals  of  deliberative  democracy.  They  criticize  the 
different  forms  of  exclusion  in  modern  societies 
that  are  still  relevant  in  deliberative  settings. 
Therefore,  they  demand  a  democratization  of  all  of 
society  as  a  prerequisite  for  deliberative  proce¬ 
dures.  These  radical  approaches  have  some  obvi¬ 
ous  connections  with  theories  of  participation  and 
grassroots  theories  of  democracy,  and  they  sup¬ 
port  a  politicization  of  civil  society. 

Interplay  of  Empirical  and  Normative  Theories 
and  the  Rationalization  of  Democracy 

Empirical  and  normative  theories  of  democracy 
are  of  course  not  entirely  separated.  As  seen, 
empirical  theories  draw  their  criteria  for  assessing 
the  democratic  nature  of  political  systems  from 
theories  about  the  essential  norms  and  institutions 
of  democracy,  such  as  Dahl’s.  Empirical  democ¬ 
racy  research  is  assisted  by  systems  theory  and 
rational  choice  theory,  each  of  which  in  its  own 
way  theoretically  deduces  why  political  participa¬ 
tion  in  modern  mass  democracies  is  nearly  without 
effect  and  ultimately  makes  no  sense  at  all. 
Nevertheless,  this  is  only  one  side  of  their  influenc¬ 
ing  one  another;  contemporary  normative  theories 
in  turn  are  oriented  to  the  findings  of  empirical 
research  on  theories  of  democracy. 

All  normative  approaches  that  can  currently 
claim  scientific  relevance  follow  the  findings  of 
empirical  democratic  theory.  In  the  academic 
debate  about  democracy,  no  idea  is  taken  seriously 
if  it  does  not  demonstrate  its  ability  to  be  con¬ 
nected  to  empiricism  and  thus  its  proximity  to 
reality.  Thus,  the  findings  of  empirical  research  on 
democracy,  for  example,  on  the  degree  of  political 
interests  or  on  the  irrationality  of  most  citizens’ 
political  preferences,  produce  subliminal  but  none¬ 
theless  powerful  pressure  on  the  making  of  norma¬ 
tive  theories.  In  this  perspective,  democracy 
becomes  a  regime  type  that  produces  a  certain 
amount  of  legal  certainty,  cultural  and  educational 
goods,  welfare,  and  other  collective  goods  (recently, 
above  all,  security  from  terrorism)  but  that  has  lost 
the  active  political  participation  of  its  citizens. 

With  this  result,  normative  democracy  theory 
again  provides  the  justification  for  empirical 
democracy  research  in  which  rule  of  law  and  the 
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production  of  welfare  and  stability  constitute  the 
most  important  parameters,  more  important  than 
the  participation  of  all  citizens:  At  best,  participa¬ 
tion  is  used  as  a  dependent  variable.  The  indices  of 
empirical  democracy  measurement  stubbornly  test 
for  certain  institutionalizations,  for  example,  the 
existence  of  individual  rights  or  of  the  basic  build¬ 
ing  blocks  of  a  parliamentary  democracy  with 
separation  of  powers.  The  participatory  compo¬ 
nent  has  become  a  ballast  of  the  concept  of  democ¬ 
racy,  standing  in  the  way  of  its  continued  success. 
When  reviewing  the  most  important  approaches  in 
current  political  theory,  one  notices — despite  all 
the  differences — a  common  terminological  shift 
with  which  the  path  to  a  fourth  and  new  semantic 
transformation  in  democratic  theory  was  paved. 
Most  current  theories  use  a  concept  of  democracy 
that  discerns  deep  chasms  between  political  par¬ 
ticipation  and  “rational”  results  and,  when  in 
doubt,  argues  against  political  participation. 

This  transformation  can  be  described  as  the 
“rationalization”  of  democratic  theory.  It  means 
that  the  focus  of  modern  theories  of  democracy 
has  shifted  to  the  evaluation  of  the  quality  of  the 
results  of  politics.  Democratic  theory  is  becoming 
more  and  more  output  oriented,  and  its  normative 
efforts’  main  goal  is  to  increase  the  degree  of  ratio¬ 
nality  of  this  output.  Larger  differences  within  this 
paradigm  occur  only  where  the  following  criteria 
of  rationality  are  concerned:  effectiveness,  feasibil¬ 
ity,  representation  of  interests,  justice,  or  the  pub¬ 
lic  weal.  Political  participation  is  no  longer 
regarded  as  the  goal  but  as  one  of  several  possible 
ways  to  enhance  the  degree  of  rationality  of  col¬ 
lectively  binding  decisions. 

The  Future  of  Democratic  Theory: 

Global  Democracy  or  Postdemocracy? 

One  of  the  most  important  motivations  to  “ratio¬ 
nalize”  democracy  is  the  decline  of  the  nation-state 
in  the  era  of  globalization.  In  the  past  2  decades, 
we  have  witnessed  a  displacement  of  political  deci¬ 
sions  from  the  national  to  the  international  or 
supranational  level.  This  is  of  course  a  reaction  to 
the  growing  need  for  global  coordination  of 
economic  and  political  processes.  The  answer  to 
this  postnational  constellation  is  the  development 
and  strengthening  of  regimes  and  organizations 
as  means  for  “global  governance.”  The  most 


impressive  of  these  supranational  organizations  is 
the  European  Union  (EU)  with  a  huge  amount  of 
political  competencies.  However,  even  at  the  global 
level,  there  is  a  large  variety  of  regimes  and  orga¬ 
nizations  for  coordinating  international  politics. 
One  only  has  to  think  about  the  World  Bank,  the 
International  Monetary  Fond,  and  the  World 
Trade  Organization  to  comprehend  the  relevance 
of  international  organizations  for  the  well-being  of 
nearly  every  person  on  the  planet.  However,  these 
global  economic  multilaterals  are  only  the  best 
known  organizations  of  global  governance.  It 
seems  that  the  era  of  the  nation-state  has  gone;  at 
least  the  political  power  of  the  states  to  determine 
the  lives  of  their  citizens  has  been  weakened  drasti¬ 
cally  in  the  past  20  years.  This  raises  the  question 
as  to  whether  postnational  forms  of  democracy  are 
imaginable. 

Theories  of  Global  Democracy 

The  process  of  globalization  is  a  major  chal¬ 
lenge  for  democratic  theory  because  most  of  the 
conceptions  discussed  above — with  the  exception 
of  deliberative  democracy,  see  below — have  taken 
the  national  base  of  democracy  for  granted.  In 
addition,  the  all-important  institutions  of  modern 
democracy  such  as  elections,  parliaments,  political 
parties,  and  the  public  sphere  are  embedded  in  the 
nation-state.  Nevertheless,  the  era  of  the  demo¬ 
cratic  nation-state  seems  to  be  coming  to  an  end. 
Accordingly,  there  is  a  growing  literature  on  the 
question  of  the  democratization  of  global  gover¬ 
nance.  One  of  the  issues  discussed  most  is  how  to 
transform  the  democratic  institutions  and  proce¬ 
dures  to  the  supranational  and  even  the  global 
level.  At  least  three  strains  of  the  debate  about 
democracy  in  the  global  order  can  be  discerned. 

The  first  group  of  authors  doubts  that  the  new 
structures  of  global  governance  are  responsible  for 
the  significant  decline  of  democracy  we  are  wit¬ 
nessing  today.  In  contrast,  the  second  group  begins 
by  asserting  a  fundamental  democratic  deficit  and 
tries  to  transfer  the  values  and  institutions  of 
national  democracy  to  the  supranational  and 
global  sphere.  In  addition,  a  third  group  makes  the 
effort  to  change  the  semantics  of  democracy  in 
order  to  demonstrate  the  perspectives  global  gov¬ 
ernance  may  open  up  for  democracy.  In  the  con¬ 
text  of  the  debate  about  a  so-called  democratic 
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deficit  of  the  EU,  but  also  in  view  of  the  institu¬ 
tions  of  world  politics,  authors  such  as 
Giandomenico  Majone  and  Andrew  Moravcsik 
argue  that  this  accusation  is  misleading  for  two 
reasons.  First,  there  exists  a  kind  of  legitimating 
chain  from  the  direct  or  indirect  election  of 
national  parliaments  and  governments  up  to  the 
supranational  institutions  such  as  the  European 
Commission.  It  is  a  kind  of  political  delegation  of 
power  from  the  constituents  in  each  country  to 
their  national  representatives  and  delegates  in  the 
transnational  or  global  settings.  Therefore,  even 
the  World  Trade  Organization  is  considered  dem¬ 
ocratically  responsive.  In  a  second  and  supporting 
way,  Majone  and  Moravcsik  argue  that  many  of 
the  supranational  and  transnational  institutions  of 
world  politics  are  directed  toward  technical  issues 
that  should  be  depoliticized.  Examples  include  the 
world  finance  institutions  or  the  European  Central 
Bank  or  institutions  occupied  with  juridical  and 
economic  issues.  In  these  authors’  view,  it  is  in  the 
interest  of  the  people  that  experts  manage  these 
issues,  because  only  they  have  the  necessary 
knowledge.  It  is  a  governing  not  by  but  for  the 
people.  Countering  this  opinion,  many  scholars 
have  objected  that  they  have  overlooked  the  fact 
that  juridical  and  economic  issues  of  global  gover¬ 
nance  are  inevitably  political.  Moreover,  without 
the  involvement  of  the  people,  there  is  the  risk  that 
these  bodies  of  experts  will  consider  only  the  inter¬ 
ests  of  strong  actors. 

Thus,  in  contrast  to  Majone  and  Moravcsik,  a 
second  group  of  authors  such  as  David  Held  and 
Daniele  Archibugi  begins  by  claiming  that  there  is 
a  real  democratic  deficit  in  global  governance. 
Their  theoretical  background  is  the  norms  and 
institutions  of  national  democracy,  and  they  try  to 
ascertain  how  these  norms  and  institutions  can  be 
transformed  to  the  supranational  and  even  the 
global  sphere.  Only  if  the  global  order  can  be  sub¬ 
jected  to  democratic  values,  such  as  the  equality  of 
citizens,  the  majority  principle,  and  the  duty  of 
governments  to  act  in  the  interests  of  the  people, 
can  the  new  forms  of  governance  be  legitimate. 
The  crucial  question  in  the  global  age  therefore  is, 
“How  can  democracy  preserve  its  core  values  and 
yet  adapt  to  new  circumstances  and  issues?” 
(Archibugi,  2004,  p.  446).  The  answer  to  this 
question  is  the  concept  of  cosmopolitan  democ¬ 
racy,  which  is  conceived  as  a  multilevel  order. 


There  should  be  different  degrees  of  democratic 
participation  at  the  local,  national,  interstate, 
supranational,  and  global  levels.  In  the  concept  of 
a  cosmopolitan  democracy,  the  participation  of 
the  people  at  the  local  and  the  national  levels  is  to 
follow  the  traditional  understanding  of  democratic 
norms,  institutions,  and  practices,  but  this  is  also 
demanded  regarding  the  supranational  and  the 
global  levels.  Advocates  of  cosmopolitan  democ¬ 
racy  argue  that  there  is  an  emerging  global  public 
sphere,  consisting  of  international  nongovernmen¬ 
tal  organizations  (INGOs)  such  as  Greenpeace  and 
Amnesty  International,  as  well  as  a  global  media 
system.  Even  if  these  structures  do  not  fulfill  the 
demands  of  a  democratic  public  sphere,  they  are  at 
least  the  necessary  conditions  for  strengthening 
and  increasing  the  transparency  and  accountability 
of  global  politics.  However,  these  are  only  the  pre¬ 
conditions  of  a  cosmopolitan  democracy.  Both  at 
the  supranational  and  the  global  levels,  parliamen¬ 
tary  institutions  are  regarded  as  the  cornerstone  of 
a  democratic  order.  For  the  advocates  of  a  cosmo¬ 
politan  democracy,  a  world  parliament  is  thus  the 
sine  qua  non.  This  parliament  is  to  be  elected  by  all 
citizens  of  the  world  (at  least  those  of  democratic 
states)  and  have  the  right  to  make  legislation  valid 
worldwide.  One  of  the  logical  necessities  of  this 
model  is  therefore  a  new  understanding  of  the  role 
of  states.  They  are  no  longer  sovereign  actors  but 
only  one  vehicle  of  the  democratic  governance  of 
the  people.  Some  authors  even  foster  the  notion  of 
a  world-state,  and  therefore  some  kind  of  world 
government,  to  implement  global  justice  and  demo¬ 
cratic  demands.  Opponents  of  cosmopolitan  democ¬ 
racy  fundamentally  doubt  the  prospects  of  a 
democratic  world  order.  They  refer  mainly  to  the 
structure  of  power  in  world  politics  and  to  the  enor¬ 
mous  obstacles  to  the  participation  of  the  people  in 
global  governance.  Nearly  6  billion  voters,  for 
example,  would  elect  a  world  parliament.  How  can 
one  guarantee  the  representative  nature  of  this  par¬ 
liament?  In  addition,  what  about  the  populations  of 
nondemocratic  systems — today  nearly  three  quar¬ 
ters  of  the  world’s  population?  The  critics  of  cos¬ 
mopolitan  democracy  view  even  the  global  public 
sphere  and  INGOs  critically.  For  others,  the  project 
of  cosmopolitan  democracy  is  just  a  new  way  to 
establish  and  strengthen  the  hegemonic  power  of 
the  West  and  the  values  and  practices  of  a  new 
global  class  as  cosmopolitanism  of  the  few. 
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A  third  group  also  begins  by  claiming  a  demo¬ 
cratic  deficit  in  global  governance,  but,  in  contrast 
to  the  advocates  of  cosmopolitan  democracy,  they 
do  not  want  to  transfer  the  institutions  of  national 
democracy  to  the  global  level.  Instead,  these 
authors  try  to  reconsider  the  notion  of  democracy 
in  order  to  close  the  legitimacy  gap  in  world  poli¬ 
tics.  There  are  at  least  two  ways  of  redescribing 
democracy  along  these  lines.  The  first  consists  in 
highlighting  new  ways  of  political  engagement. 
According  to  authors  such  as  Ann-Marie  Slaughter, 
international  non-INGOs  are  the  representatives 
of  a  global  demos.  Together  with  the  postulate  of 
a  growing  global  public  sphere  and  a  growing  pro¬ 
cess  of  juridical  constitutionalization,  INGOs  are 
seen  as  an  essential  component  of  a  new  demo¬ 
cratic  world  order.  Their  inclusion  in  the  processes 
of  global  governance,  for  example,  in  the  hearings 
of  the  World  Trade  Organization,  is  seen  as  a  cru¬ 
cial  step  toward  the  democratization  of  interna¬ 
tional  politics.  Critics  have  countered  this  scenario 
by  arguing  that  a  global  public  sphere  and  a  vivid 
array  of  international  civil  actors  are  only  the  pre¬ 
condition  but  not  the  essence  of  democracy. 
Others  criticize  the  oligarchic  structure  of  most  of 
the  INGOs  and  that  they  mostly  articulate  the 
demands  of  the  rich  countries  of  the  Western 
world.  The  question  therefore  is,  Who  has  autho¬ 
rized  these  organizations?  Furthermore,  critics 
object  that  the  influence  of  the  INGOs  in  interna¬ 
tional  politics  should  not  be  overestimated. 
According  to  these  critics,  world  politics  is  a  game 
played  only  by  powerful  elites  and  their  experts.  In 
their  view,  the  participation  of  nongovernmental 
organizations  merely  functions  to  legitimize 
undemocratic  ways  of  decision  making. 

A  second  way  of  adapting  the  meaning  of 
democracy  to  the  needs  of  global  governance  is 
taken  by  the  advocates  of  a  deliberative  under¬ 
standing  of  politics.  Inspired  in  particular  by  the 
work  of  Habermas,  some  authors  rely  on  the  epis- 
temic  functions  of  transnational  decision-making 
bodies.  The  starting  point  is  Habermas’s  distinc¬ 
tion  between  a  substantial  and  a  proceduralist 
understanding  of  popular  sovereignty.  Habermas 
refers  to  an  anonymous  civil  society  without  a  con¬ 
crete  democratic  subject.  This  opens  up  the  possi¬ 
bility  of  transferring  the  concept  of  democracy  to 
the  global  sphere  where  the  demos  is  only  conceiv¬ 
able  in  form  of  the  manifold  demands  articulated 


there.  The  crucial  step  is  the  integration  of  these 
demands  into  the  global  decision-making  bodies. 
In  addition,  a  second  feature  of  deliberative  democ¬ 
racy  is  relevant  here — namely,  the  epistemic  under¬ 
standing  of  politics.  As  seen  above,  the  goal  of  the 
deliberative  process  is  to  tease  out  the  better  argu¬ 
ment  by  means  of  discussions  between  the  relevant 
groups.  But  it  is  often  difficult  to  include  all  rele¬ 
vant  groups  in  international  politics.  However, 
according  to  the  advocates  of  deliberative  democ¬ 
racy,  this  need  not  lead  to  an  undemocratic  way  of 
decision  making.  As  far  as  the  relevant  viewpoints 
are  included,  it  is  sufficient  that  political  experts 
discuss  the  relevant  topics  to  gain  “rationally 
acceptable  results.”  Critics  argue  against  this 
understanding  of  democracy,  insisting  on  the 
necessity  of  real  participation  by  the  people. 
Habermas’s  legitimation  of  the  epistemic  gains  of 
global  governance  threatens  to  erode  the  very 
meaning  of  democracy — that  is,  the  involvement 
of  all  the  people  and  not  only  the  advocates  of  the 
better  argument. 

Postdemocracy  and  Beyond 

Another  way  to  capture  the  changing  reality  of 
democracy  in  the  era  of  globalization  is  the  strat¬ 
egy  of  Colin  Crouch,  who  describes  Western 
political  systems  as  “postdemocracies.”  According 
to  Crouch,  genuinely  democratic  institutions  such 
as  parliament,  regular  elections,  party  competition, 
and  the  rule  of  law  still  exist.  Therefore,  these  soci¬ 
eties  differ  in  a  significant  way  from  autocratic 
societies.  Nevertheless,  the  processes  of  globaliza¬ 
tion  and  the  weakening  of  the  state’s  capacity  to 
regulate  the  economy  are  progressively  undermin¬ 
ing  these  institutions.  This  leads  not  only  to  a  loss 
of  importance  of  central  democratic  ways  of 
political  decision  making  but  also  to  a  shift  in 
power  relations.  According  to  Crouch,  Western 
societies  are  therefore  characterized  by  a  funda¬ 
mental  ambivalence.  On  the  one  hand,  the  forms 
of  democracy  still  exist,  and  are  even  expanding 
(e.g.,  due  to  the  establishment  of  new  forms  of 
direct  democracy),  but  on  the  other,  they  have  lost 
their  relevance  and  are  getting  wedged  in  by  new 
intransparent  forms  of  national  and  global  gover¬ 
nance,  where  powerful  elites  dictate  the  rule  of  the 
game.  The  most  significant  consequences  of  this 
new  power  structure  are  the  decline  of  egalitarian 
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politics  and  the  expansion  of  new  forms  of  politi¬ 
cal  marketing. 

Thus,  for  Crouch  the  use  of  the  term  democracy 
only  obfuscates  the  fact  that  in  Western  societies, 
too,  the  vital  circumstances  that  influence  the  lives 
of  citizens  can  be  decided  collectively  only  to  a 
very  small  extent — a  fact  that  was  conceded  in  the 
professional  discourse  of  political  science  long  ago. 
This  generates  not  only  expectations  but  more  or 
less  subliminally  also  a  deceptive  appearance. 

The  thesis  of  postdemocracy  is  a  polemical  reac¬ 
tion  resulting  from  the  desire  to  draw  attention  to 
the  fact  that  modern  Western  political  systems 
have  drifted  away  from  the  basic  democratic 
impulse.  However,  there  are  few  indications  that 
this  strategy  can  prevail  beyond  a  tight  circle  of 
scholars.  Rather,  one  can  expect  that  the  promise 
of  democracy  will  not  lose  its  political  force,  even 
in  a  world  in  which  the  national  base  of  demo¬ 
cratic  practices  is  weakened. 
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Since  the  end  of  the  Cold  War,  democracy  has 
become  the  unrivaled  form  of  government  in  the 
world.  Acceptance  of  a  country  as  a  full  partner  in 
the  global  community  of  nations  is  considerably 
facilitated  by  its  being  characterized  as  a  political 
democracy;  international  military  interventions, 
as  in  the  case  of  Iraq  and  Afghanistan,  state  their 
goal  as  the  building  of  democracy;  less  democratic 
countries  are  asked  to  improve  the  quality  of  their 
democracy  to  gain  esteem;  and  countries  that 
hardly  possess  the  attributes  of  democratic  gover¬ 
nance  claim  to  be  democratic  because  such  char¬ 
acterization  is  thought  to  bestow  prestige  on 
them. 

Among  democracies,  there  is  no  single  mode  of 
organizing  a  polity  as  a  political  democracy. 
Institutional  arrangements  of  democratic  gover¬ 
nance  have  varied  across  time  and  countries. 
Furthermore,  democratic  systems  have  evolved  and 
operated  in  countries  that  have,  among  others,  dif¬ 
ferent  histories,  cultures,  traditions,  economies, 
demographic  compositions,  and  socioeconomic 
characteristics.  These  factors  have  all  put  their 
imprint  on  how  democratic  institutions  are  orga¬ 
nized  and  operate  in  specific  countries.  Attempts  at 
presenting  and  discussing  types  and  typologies  of 
democracies  are  several.  Some  types  and  typologies, 
not  widely  employed  by  students  of  democracy 
thus  far,  have  not  been  included  in  this  entry.  But 
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before  presenting  the  most  well-known  analyses,  a 
methodological  “warning”  is  necessary. 

The  word  type  may  be  used  in  two  different 
senses.  First,  it  may  be  used  to  define  the  necessary 
set  of  characteristics  that  need  to  be  present  to 
identify  certain  phenomena  as  distinct  from  others 
and  give  it  a  name.  The  word  model  is  also  some¬ 
times  employed  to  convey  the  meaning  of  type  in 
this  sense.  Second,  it  may  be  a  category  that 
emerges  from  classifying  phenomena  according  to 
certain  criteria.  In  this  second  sense,  “type” 
emerges  as  a  subcategory — that  is,  as  the  product 
of  an  effort  at  generating  a  typology.  One  may,  for 
example,  identify  all  political  systems  that  satisfy 
the  criteria  for  being  a  democracy  (model)  but  then 
group  them  according  to  some  key  features  (vari¬ 
ables)  along  which  they  may  differ  from  each 
other.  This  exercise,  which  aims  at  developing  a 
typology,  generates  subcategories  that  are  also 
called  “types.”  Types  as  subcategories  of  a  typol¬ 
ogy  include  all  attributes  of  democracy  and  then 
some  that  make  them  unique,  helping  us  distin¬ 
guish  them  from  others. 

This  entry  is  a  discussion  of  the  types  of  democ¬ 
racy.  The  first  section  focuses  on  the  two  funda¬ 
mental  models  of  democracy,  direct  and  liberal 
representative,  as  major  concerns  of  political 
thought.  This  section  on  normative  models  intro¬ 
duces  the  fundamental  philosophical  underpin¬ 
nings  of  actual  democratic  systems  as  they  have 
developed  in  different  societies.  The  second  section 
offers  a  typology  of  democratic  systems  as  they 
operate  in  the  world  today.  Here  the  scheme  of 
classification  is  based  on  whether  minimum  or 
maximum  possible  majorities  are  sought  in  mak¬ 
ing  decisions  since  such  a  distinction  constitutes  a 
major  variable  along  which  contemporary  democ¬ 
racies  differ  from  each  other.  Some  types  based  on 
the  empirical  study  of  how  systems  function  are 
also  offered  here.  Finally,  normative  models  criti¬ 
cal  of  the  outcome  of  the  operation  of  contempo¬ 
rary  democracies  are  taken  up. 

Normative  and  Empirical  Models 

Two  different  approaches  may  be  used  in  defining 
types  and  developing  typologies.  The  first 
approach  confines  itself  to  positing  a  set  of  criteria 
or  norms,  thereby  defining  what  a  democracy  is  or 
ought  to  be.  Using  this  normative  theoretical 


approach,  one  may  identify  models  of  democracy. 
The  second  approach  focuses  on  societies  that 
already  have  systems  that  satisfy  the  norms  that 
are  depicted  in  the  definition  of  democracy  but 
then  proceed  to  classify  them,  thereby  developing 
typologies  and  then  types  as  subcategories  based 
on  how  each  differs  from  the  others,  for  example, 
in  terms  of  its  institutional  arrangements,  its  cul¬ 
tural  and  behavioral  environment,  and  its  func¬ 
tioning.  Understandably,  this  second  approach 
requires  an  empirical  examination  of  democratic 
societies.  This  approach,  in  contrast  to  normative 
democratic  theory,  constitutes  the  basis  of  empiri¬ 
cal  democratic  theory. 

Classic  Normative  Models  of  Democracy 

Dictionaries,  relying  on  the  Greek  origins  of  the 
word,  demos  (people)  and  kratos  (rule),  often 
define  democracy  as  rule  by  and  for  the  people. 
More  scholarly  definitions  have  elaborated  the 
basic  idea.  Charles  Tilly  (2007),  for  example,  has 
emphasized  “the  degree  that  political  relations 
between  the  state  and  its  citizens  feature  broad, 
equal,  protected  and  mutually  binding  consulta¬ 
tion”  (pp.  13-14).  Such  definitions  have  focused 
neither  on  who  the  people  are  nor  on  how  or 
through  what  mechanisms  they  rule  themselves. 
The  question  of  who  the  people  are,  after  historical 
and  long  struggles,  has  been  answered  by  the  real¬ 
ization  of  universal  suffrage.  How  the  people 
should  rule  themselves,  on  the  other  hand,  has 
been  given  two  different  answers,  leading  to  the 
emergence  of  two  normative  models  of  (1)  direct 
or  participatory  democracy  and  (2)  liberal  repre¬ 
sentative  democracy. 

Direct  Democracy  and  Its  Variants 

In  terms  of  historical  order,  direct  democracy 
comes  first.  Its  basic  idea  was  developed  and  prac¬ 
ticed  in  some  of  the  city-states  of  ancient  Greece, 
notably  Athens,  for  about  a  century  and  a  half  and 
then  disappeared.  The  basic  idea  of  direct  or  par¬ 
ticipatory  democracy  is  that  all  members  of  the 
political  community  take  part  in  discussion,  debate, 
and  decision  making  on  matters  belonging  to  the 
public  domain  without  the  intermediation  of 
agents.  Decisions  may  be  reached  by  the  collectiv¬ 
ity  by  voting  or  by  reaching  consensus  through 
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deliberation.  As  regards  who  qualify  as  members 
of  the  political  community,  throughout  history, 
different  criteria  for  exclusion-inclusion  have  been 
employed,  including  those  based  on  age,  gender, 
race,  being  a  free  man,  property  ownership,  and 
paying  taxes.  In  contemporary  times,  only  a  mod¬ 
erate  age  requirement  and  some  other  reasonable 
limitations  such  as  not  being  insane  constitute  the 
only  restrictions  to  being  a  citizen  with  full  rights 
to  participate  in  the  political  debate  and  decision 
making. 

Athenian  Democracy.  The  city-state  of  Athens  is 
usually  cited  as  the  place  where  democracy  was 
born.  Yet  Athenian  democracy  deviated  in  signifi¬ 
cant  ways  from  what  we  know  as  democracy 
today.  Only  free  men  of  25  years  or  older  qualified 
as  members  of  the  political  community  to  partici¬ 
pate  in  town  meetings  or  Ekklesiae  where  discus¬ 
sion  about  the  public  affairs  of  the  community 
affairs  took  place.  Decisions  were  made  by  direct 
vote.  Forty  Ekklesiae  were  hold  during  a  year.  An 
administrative  council  whose  members  were  iden¬ 
tified  by  drawing  lots  was  given  the  responsibility 
of  implementing  the  decisions  arrived  at  these 
meetings. 

As  a  normative  model  of  self  rule,  Athenian 
democracy  appears  attractive.  From  a  more  con¬ 
temporary  empirical  perspective,  it  may  be  criti¬ 
cized  for  not  being  sufficiently  inclusive  since  it 
allowed  only  free  men  of  25  years  or  older  to 
participate  in  decision  making  about  the  political 
affairs  of  the  community,  leaving  slaves  and 
women  out.  An  empirical  evaluation,  to  the  extent 
this  is  possible  in  view  of  scant  information, 
reveals  that  only  a  small  minority  of  citizens  actu¬ 
ally  attended  these  meetings.  In  fact,  meeting 
places  were  not  designed  to  accommodate  large 
crowds  of  citizens.  Leaders  with  oratorical  and 
organizational  skills  were  often  capable  of  deter¬ 
mining  both  the  agenda  and  the  decisions  that 
were  eventually  made.  Inevitably,  the  Athenian 
system  was  inefficient  because  it  took  much  time 
to  produce  a  decision.  And  finally,  since  atten¬ 
dance  was  often  limited  and  the  composition  of 
those  who  attended  the  meetings  varied,  policy 
coherence  and  continuity  were  difficult  to  achieve. 
Perhaps  it  should  not  be  surprising  that  neither 
Plato  nor  Aristotle  or  Thucydides  express  admira¬ 
tion  for  the  Athenian  system  whose  legend  has 


come  to  be  so  venerated  among  the  later  exponents 
of  democracy. 

What  have  we  inherited  from  Athenian  democ¬ 
racy?  Two  ideas  may  have  been  important:  first, 
the  idea  of  a  citizen  that  has  approximately  an 
equal  standing  and  voice  against  the  state;  and 
second,  the  rotation  of  civic  responsibilities  among 
citizens  (Tilly,  2007,  pp.  26-27).  Many  problems 
encountered  in  defining  and  implementing  democ¬ 
racy  in  contemporary  times  were  also  anticipated 
in  the  Greek  experience. 

Contemporary  Varieties  of  Direct  Democracy.  The 
Athenian  democratic  experiment  disappeared  after 
failing  to  meet  external  challenges.  The  notion  of 
direct  democracy  has  been  practiced,  however,  in 
later  times  in  two  different  ways.  First,  in  local 
government,  it  has  been  adopted  by  small  commu¬ 
nities  in  the  form  of  town  meetings.  Second,  in 
some  representative  democracies,  the  mass  elector¬ 
ates  have  been  asked  to  make  policy  decisions  by 
means  of  referenda. 

Although  the  practice  has  been  declining,  direct 
democracy,  sometimes  also  called  primary  democ¬ 
racy ,  where  all  members  of  the  political  commu¬ 
nity  come  together  to  decide  on  public  matters 
pertaining  to  their  town,  continues  to  be  practiced 
in  parts  of  New  England  (town  meetings)  and  in 
Switzerland  (Landgemeinde) .  Direct,  participatory 
democracy  as  a  comprehensive  system  of  rule 
appears  not  to  be  suited  for  communities  with 
populations  more  than  a  few  thousands  or  for 
complex  tasks  that  contemporary  governments 
need  to  address. 

Another  direct  democratic  practice  with  an 
increasing  frequency  in  recent  years  is  the  holding 
of  public  referenda,  asking  the  voters  to  determine 
policy  choices.  The  more  widely  used  form  of  ref¬ 
erendum  democracy  is  for  the  government  to  sub¬ 
mit  a  policy  question  to  a  public  referendum  either 
on  its  own  or  because  the  laws  require  it.  The  less 
widely  practiced  form,  known  as  the  citizen  initia¬ 
tive ,  allows  citizens  to  collect  a  required  number  of 
signatures  in  order  to  place  a  proposal  on  the  bal¬ 
lot.  The  initiative  is  used  at  all  levels  of  government 
in  Switzerland,  in  a  substantial  number  of  American 
states,  as  well  as  in  Italy  (Ian  Budge,  1996,  p.  85). 

Direct  Democracy  in  the  Future.  Technological 
developments  appear  to  have  opened  the  way  for 
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new  possibilities  for  practicing  direct  democracy. 
With  Internet  service  reaching  a  growing  number 
of  homes,  it  may  be  possible  to  consult  voters 
about  their  preferences  regarding  major  policy 
questions.  The  practice  of  such  tele-democracy 
may  lead  to  more  widely  approved  policies.  It  has 
also  been  suggested  it  may  even  be  possible  in  the 
future  to  realize  interactive  communications  and 
create  a  modern  version  of  Athenian  democracy,  so 
to  speak,  a  cyber-democracy. 

Liberal  Representative  Democracy 

After  the  disappearance  of  Athenian  democracy, 
there  was  an  interim  of  nearly  2,000  years  before 
the  idea  reappeared  but  this  time  in  liberal  repre¬ 
sentative  form.  The  new  system  involved  the  place¬ 
ment  of  individuals  into  public  decision-making 
roles  by  popular  elections,  thereby  conferring 
legitimacy  on  the  authority  of  those  who  were 
elected.  Liberal  democracy,  as  the  name  already 
suggests,  is  a  synthesis  of  two  underlying  concepts: 
liberalism  and  democracy.  Liberalism,  although 
attributed  changing  meanings  in  different  contexts 
and  time  points,  was  a  reaction  to  the  power  of 
absolute  monarchs  and  an  interventionist  church  in 
all  aspects  of  community  and  personal  life.  Arguing 
for  tolerance,  reason,  and  freedom  of  choice,  liber¬ 
alism  aimed  to  create  a  uniquely  private  sphere  for 
the  individual  in  which  neither  the  state  nor  the 
church  could  intervene.  Placing  the  individual  in 
the  center  of  their  thinking,  the  Liberals  advocated 
limiting  the  powers  of  the  state  through  constitu¬ 
tions  and  defended  private  property  and  the  mar¬ 
ket  economy  as  a  means  by  which  the  individual’s 
interests  would  be  served  (David  Held,  1996).  In 
liberal  representative  democratic  thought,  the  mar¬ 
ket  constituted  an  arena  where  individuals  pursue 
their  personal  gain  while  politics  served  as  the 
domain  in  which  the  interests  or  the  common  good 
of  the  community  would  prevail. 

The  idea  of  representative  government  appears 
to  be  a  natural  outcome  of  the  concern  of  liberal 
philosophers  with  the  protection  of  the  individual 
against  the  state.  This  would  be  achieved  by  giving 
the  citizens  the  power  to  choose  those  who  govern 
them  through  periodic  elections  and  by  holding 
those  elected  accountable  for  their  decisions  and 
actions  within  the  context  of  a  constitutional  sys¬ 
tem.  The  election  of  representatives  to  do  what  the 


citizens  were  expected  to  do  on  their  own  in  a 
direct  democracy  appeared  to  provide  the  solution 
to  the  question  of  how  large  number  of  citizens 
could  still  rule  themselves.  The  questions  of  who 
comprised  the  citizens  and  the  conditions  neces¬ 
sary  for  them  to  exercise  the  right  to  choose  those 
who  shall  govern  them  continue  to  constitute  criti¬ 
cal  questions  in  societies  that  are  in  the  process  of 
democratizing  and,  on  occasion,  in  societies  that 
are  already  democracies  today.  While  universal 
suffrage  appears  to  have  acquired  reasonably  uni¬ 
versal  acceptance,  the  conditions  under  which 
political  competition  is  to  take  place,  such  as  free¬ 
dom  of  association,  information,  and  expression, 
present  both  existing  democracies  and  societies 
that  aspire  to  develop  into  liberal  representative 
democracies  with  dilemmas  of  how  to  balance 
authority  and  the  coercive  capabilities  of  the  state 
and  the  collective  interests  of  the  community  with 
the  liberties  of  the  individual. 

A  multiplicity  of  social,  economic,  and  political 
processes  led  to  the  emergence  of  liberal  representa¬ 
tive  democracy  as  ideology.  By  the  19th  century, 
aided  by  growing  commerce  and  industrialization, 
European  kings  had  succeeded  in  unifying  small 
political  entities  into  nation-states,  creating  politi¬ 
cal  communities  much  larger  than  those  that  had 
existed  before.  The  industrial  revolution  had  also 
produced  social  classes,  most  notably  the  bourgeoi¬ 
sie  and  the  workers  that  rejected  the  absolute 
power  of  the  monarchs  and  searched  for  ways  to 
limit  the  scope  of  their  political  decisions  and  influ¬ 
ence  their  content.  Factors  such  as  urbanization, 
factory  production,  and  new  forms  of  communica¬ 
tion  and  transportation  provided  opportunities  for 
self-organization  not  possible  in  earlier  times.  The 
philosophy  of  liberal  representative  democracy 
developed,  in  this  historical  context,  as  the  frame¬ 
work  through  which  power  could  be  shared 
between  monarchs,  aristocrats,  and  the  mass 
publics  and  could  gradually  be  transferred  to  the 
latter.  The  size  of  nation-states,  in  terms  of  popula¬ 
tion  and  the  geographical  space,  necessitated  an 
arrangement  whereby  public  decisions  would  be 
made  by  relatively  few  people  that  represented  the 
many  who  had  elected  them. 

The  adoption  and  consolidation  of  liberal  repre¬ 
sentative  democratic  philosophy  and  arranging 
political  systems  along  democratic  lines  did  not 
become  a  fully  established  reality  even  in  Western 
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Europe  until  after  the  World  War  II.  Apart  from 
Spain  and  Portugal,  which  were  ruled  by  the  rem¬ 
nants  of  fascist  dictatorships  of  the  interwar 
period,  and  Greece  and  Turkey,  which  were 
plagued  by  occasional  military  rule,  other  mem¬ 
bers  of  the  Atlantic  Community,  calling  them¬ 
selves  the  “Free  World,”  possessed  regimes  based 
on  liberal  democracy.  The  adoption  of  liberal 
representative  democratic  systems  in  the  world 
received  an  initial  boost  from  the  replacement  of 
the  authoritarian  Spanish,  Portuguese,  and  later 
some  Latin  American  systems  by  democracies. 
But  the  most  significant  change  came  with  the 
demise  of  the  Warsaw  Pact  first  and  then 
the  Soviet  Union.  The  Baltic  States  and  many  of 
the  East  European  countries  managed  quickly  to 
convert  themselves  into  regimes  based  on  liberal 
representative  democracy.  This  opened  the  way 
for  democratization  in  other  parts  of  the  world 
and  created  pressures  on  democracies  to  improve 
their  performance. 

Empirical  Typologies  and  Types 

We  may  discuss  in  normative  terms  what  attri¬ 
butes  a  liberal  representative  democracy  ought  to 
have,  what  the  proper  basis  of  authority  is,  and 
what  purposes  it  may  serve.  But  in  determining 
whether  or  not  a  particular  system  is  a  liberal  rep¬ 
resentative  democracy  and  if  so,  what  type  of  a 
democracy  it  is,  an  empirical  and  descriptive 
approach  is  needed.  Many  scholars  have,  in  fact, 
tried  to  develop  an  empirically  testable  set  of  cri¬ 
teria — that  is,  a  procedural  or  operational  defini¬ 
tion — to  distinguish  political  democracies  from 
nondemocracies  and  then  offer  an  empirically 
derived  typology  of  democracies.  A  survey  of 
democracy  literature  shows  that  a  significant 
number  of  procedural  elements  have  been  pro¬ 
posed  that  identify  a  system  as  a  democracy. 
These  have  included  the  following: 

1.  the  right  to  vote,  equality  in  voting,  women’s 
right  to  vote,  elimination  of  property  and 
wealth  qualifications  for  voting,  and  inclusion 
of  all  adults; 

2.  the  right  to  be  elected; 

3.  the  right  of  political  leaders  and  political 
parties  to  compete  for  support  and  votes — a 
competitive  party  system; 


4.  periodic  free  and  fair  elections,  secret  ballot, 
and  absence  of  massive  fraud; 

5.  freedom  of  association; 

6.  freedom  of  expression; 

7.  freedom  of  information,  the  existence  of 
alternative  sources  of  information,  and 
opportunities  to  learn  about  different  policies; 

8.  institutions  for  making  public  policies 
depending  on  votes  and  other  expressions  of 
preference,  responsibility  of  all  power  holders 
to  the  electorate,  and  opportunities  for  effective 
participation; 

9.  constitutions  explicitly  describing  and  limiting 
the  authority  of  the  power  holder,  institutional 
checks  to  prevent  elected  leaders  from 
governing  arbitrarily  and  without  restraint,  and 
the  presence  of  accessible  procedures  for 
protecting  the  liberties  of  citizens; 

10.  control  of  the  agenda  by  the  elected  officials; 

1 1 .  the  ability  of  elected  officials  to  exercise  their 
constitutional  powers  without  being 
significantly  constrained  by  unelected  officials, 
such  as  bureaucrats  and  members  of  the 
military;  and 

12.  a  self-governing  polity  possessing  the  ability  to 
act  independently  of  the  constraints  of  an 
overarching  political  system,  and  minimum 
consensus  or  support  among  the  general  public 
for  values  such  as  respect  for  the  rights  of 
others  and  tolerance. 

As  is  evident,  other  elements  may  be  added  to 
this  list,  some  may  be  combined  under  more  gen¬ 
eral  headings,  and  some  may  be  broken  down 
further.  Some  have  also  argued  that  if  only  some  of 
these  procedural  elements  are  present,  a  system 
could  be  classified  as  a  lesser  or  diminutive  type  of 
democracy,  while  others  have  insisted  that  a  sys¬ 
tem  is  either  a  democracy  or  a  nondemocracy. 
Diminutive  types  are  covered  in  hybrid  regimes  in 
another  entry  in  this  encyclopedia. 

Classifying  Democracies 

Among  systems  that  are  classified  as  political  democ¬ 
racies,  there  are  significant  differences  in  terms  of 
how  political  institutions  are  organized  and  how 
they  operate.  Since  many  of  the  world’s  democracies 
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were  initially  located  in  Western  Europe  and  North 
America  or  in  countries  that  had  been  settled  by 
West  Europeans,  initial  attempts  at  classification 
focused  on  the  differences  between  how  democratic 
systems  operated  in  English-speaking  countries  and 
countries  in  Continental  Europe.  Such  geographi¬ 
cally  based  designations  were,  however,  gradually 
changed  since  democracies  not  only  operated  in 
other  geographies  but  also  seemed  to  be  spreading  to 
different  parts  of  the  world. 

Majoritarian  Versus  Consensus  Models 

The  departure  from  geography-based  classifica¬ 
tions  led  to  the  identification  of  two  basic  types: 
majoritarian  and  consensus  models  of  democracy. 
To  the  related  questions  of  who  shall  govern  and 
whose  interests  the  government  will  respond  to 
when  voter  preferences  diverge,  two  answers  were 
offered:  (1)  the  majority  of  the  people  or  (2)  as 
many  people  as  possible  rather  than  a  simple 
majority  (Arend  Lijphart,  1999).  If  a  majority, 
however  small,  was  considered  sufficient  for  gov¬ 
erning  and  making  rules  and  policies,  then  such  a 
democracy  was  a  majoritarian  democracy.  Since 
the  British  parliamentary  democracy  constituted 
the  foremost  example  of  this  type,  the  model  was 
frequently  referred  to  as  the  Westminster  model  of 
democracy,  though  this  was  but  one  example  of  a 
majoritarian  system. 

In  some  societies,  the  achievement  of  a  majority 
was  seen  as  a  minimum  condition  for  governing 
while  building  as  big  a  majority  as  possible  was 
accepted  as  a  political  goal.  A  democracy  in  which 
an  attempt  was  made  to  build  the  largest  possible 
majority  was  referred  to  as  the  consensus  model. 
Institutional  arrangements,  political  practices,  and 
patterns  of  interaction  among  political  actors  dif¬ 
fered  in  the  two  models.  For  example,  whereas 
executive  power  was  concentrated  in  one  party 
and  the  cabinet  in  the  Westminster  model,  leading 
to  the  domination  of  the  system  by  the  executive, 
power  was  shared  in  broad  coalition  cabinets  pro¬ 
ducing  a  government-legislature  balance  in  the 
consensus  model.  Majoritarian  democracies  were 
usually  characterized  by  a  two-party  government, 
consensus  models  by  a  multiparty  government. 
The  former  generally  used  the  first-past-the-post 
voting  system,  as  opposed  to  the  prevalence  of 
proportional  representation  in  the  latter. 


Efforts  to  develop  this  initial  classification  even¬ 
tually  led  to  the  development  of  a  more  refined 
typology  deriving  from  the  cross-tabulation  of  two 
axes:  the  structure  of  society  and  elite  behavior. 
The  structure  of  a  society  could  be  homogeneous 
or  plural,  while  the  behavior  of  the  elite  could  be 
coalescent  or  adversarial  (Lijphart,  1977).  The 
combination  of  the  two  axes  produced  four  types 
of  democracy: 

1.  plural-coalescent:  consociational  democracy, 

2.  plural-adversarial:  centrifugal  democracy, 

3.  homogeneous-coalescent:  depoliticized 
democracy,  and 

4.  homogeneous-adversarial:  centripetal 
democracy. 

Consociational  and  Centrifugal  Democracies 

In  the  case  of  societies  with  socially  heteroge¬ 
neous  populations,  whether  the  political  elite  pur¬ 
sues  coalescent  or  adversarial  behavior  appears  to 
make  a  determining  difference.  Coalescent  behav¬ 
ior  produces  a  consociational  democracy,  whereas 
adversarial  behavior  leads  to  a  centrifugal  democ¬ 
racy.  In  consociational  democracies,  political  lead¬ 
ers  strive  to  achieve  the  largest  majority  possible, 
incorporating  the  support  of  as  many  groups  and 
citizens  as  they  can  persuade.  The  preference  of  the 
political  elite  to  enlist  the  participation  of  larger 
numbers  than  needed  is  a  way  of  controlling  the 
centrifugal  tendencies  that  are  usually  present, 
even  if  not  always  inherent,  in  plural  societies.  To 
achieve  a  substantial  majority  at  the  societal  level, 
leaders  of  different  segments  of  the  population 
work  together  to  mobilize  majorities  in  their 
respective  communities.  They  also  respect  mutual 
vetoes.  In  this  way,  political  power  is  dispersed 
and  shared.  To  cement  the  system  further,  public 
funds  are  allocated  to  different  groups  comprising 
the  plural  society  on  a  proportional  basis  and  each 
group  is  granted  extensive  autonomy  to  organize 
its  own  affairs. 

If  the  political  elite  in  a  society  with  a  heteroge¬ 
neous  population  pursues  adversarial  behavior,  if 
they  insist  on  considering  the  achievement  of  a 
bare  majority  sufficient  to  produce  decisions  that 
are  binding  for  all  (i.e.,  if  they  pursue  a  policy  of 
strong  majoritarianism),  those  finding  themselves 
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frequently  or  permanently  in  the  minority  may 
find  the  outcome  unacceptable.  In  such  a  contin¬ 
gency,  the  tendency  would  be  for  each  group  to 
form  a  coalition  in  which  it  would  always  aim  to 
be  in  the  winning  camp,  directing  it  to  undo  any 
coalition  where  it  is  not  included  or  alternatively 
opt  out  of  a  system  that  consistently  fails  to  serve 
its  interests,  hence  the  centrifugal  democracy.  For 
example,  one  way  to  cope  with  inability  to  form 
stable  and  working  coalitions  in  the  face  of  cen¬ 
trifugal  tendencies  particularly  (but  not  only)  in  a 
presidential  system  is  for  an  elected  leader  to  opt 
for  delegative  democracy.  Generally,  consocia- 
tional  democracies  are  rather  stable,  while  centrif¬ 
ugal  democracies  are  exposed  to  intensive  tensions, 
instability,  and  the  possibility  of  malfunctioning 
and  breakdowns. 

Depoliticized  and  Centripetal  Democracies 

To  the  extent  societies  and  their  politics  are 
always  changing,  systems  of  specific  countries 
may  move  from  one  type  to  another.  For  example, 
during  World  War  II,  many  democracies  in 
Western  Europe  turned  into  depoliticized  democ¬ 
racies.  All  political  parties  were  united  around  the 
fundamental  goal  of  winning  the  war,  while  poli¬ 
tics  was  perceived  less  in  government  versus  oppo¬ 
sition  terms  and  more  in  terms  of  maintaining  the 
unity  of  all  political  actors  to  achieve  the  common 
objective  of  victory.  There  may  also  be  occasions 
when  temporary  conditions  encourage  the  form¬ 
ing  of  a  coalition  between  government  and  the 
opposition,  such  as  the  one  between  the  Christian 
Democratic  Union  and  the  German  Social 
Democratic  Party  under  Kurt  Georg  Kiesinger  in 
Germany  (1966-1969),  which  led  to  the  practice 
of  depoliticized  politics  for  a  limited  period  of 
time. 

Depoliticized  democracy  and  centripetal  democ¬ 
racy  are  distinguished  by  the  fact  that  in  the  latter, 
politics  is  conceptualized  much  more  as  a  competi¬ 
tive  game  between  government  and  the  opposition, 
whereas  depoliticized  democracy  is  characterized 
by  the  minimization  of  conflict  and  a  sustained 
effort  toward  achieving  a  broadly  based  consensus. 
The  British  system  is  given  as  the  example  of  a  cen¬ 
tripetal  democracy.  In  the  U.S.  system,  the  majority 
party  in  the  Senate  and  in  the  House  will  generally 
try  to  ensure  that  its  members  vote  with  the  party 


on  particular  pieces  of  legislation.  In  centripetal 
democracies,  the  competition  to  achieve  a  mini¬ 
mum  majority  (strong  majoritarianism)  does  not 
threaten  the  unity  of  the  system  or  the  integration 
of  rival  parties  into  it.  The  alternation  of  majori¬ 
ties  between  elections  reinforces  the  desire  of  com¬ 
peting  parties  to  win  the  elections,  inviting  them 
to  adopt  positions  that  will  appeal  to  large  groups 
of  voters  and  drawing  them  toward  the  center — 
hence  the  designation  as  centripetal  democracy. 
The  homogeneity  of  social  structure  stands  in  the 
way  of  political  fragmentation  and  the  consequent 
emergence  of  permanent  minorities  that  may 
sometimes  even  desire  to  break  off  from  the  sys¬ 
tem.  Strongly  adversarial  conceptualization  of 
oppositions  is  further  eroded  by  two  other  factors. 
First,  in  many  economically  advanced  societies, 
ideological,  religious,  and  to  a  more  limited  extent 
racial  tensions  have  receded  to  the  background  in 
recent  decades,  leading  voters  to  fail  to  perceive 
major  differences  between  competing  parties. 
Therefore,  a  change  in  the  governing  party  is  not 
seen  to  be  particularly  critical.  Second,  all  major 
interests  in  society  find  opportunities  to  be  repre¬ 
sented  in  decision  making  through  a  variety  of 
mechanisms.  These  points  will  be  further  elabo¬ 
rated  in  the  models  of  polyarchy  and  functional 
democracies. 

Polyarchy 

The  critical  problem  that  needs  to  be  overcome 
in  sustaining  a  democratic  system  and  ensuring  its 
stability  appears  to  be  that  no  group  be  cornered 
into  a  position  of  a  permanent  political  minority — 
that  is,  be  placed  permanently  in  an  inferior  status 
under  the  mercy  of  a  permanent  majority.  Such  an 
outcome  may  be  achieved  either  by  shifting  major¬ 
ities  or  by  ensuring  that  the  majority  is  as  large  as 
possible.  The  latter  option  is  associated  with  the 
consociational  democracy  that  is  usually  found  in 
plural  societies.  There  are  many  societies  that  are 
neither  plural  nor  consciously  subscribe  to  the  con¬ 
sensus  model.  What  is  the  mechanism  in  such  soci¬ 
eties  for  ensuring  that  the  same  majority  does  not 
prevail  over  the  same  minority  on  a  permanent 
basis?  The  simple  answer  is  that  voters  change  their 
preferences.  In  a  democratic  society  where  people 
can  form  associations,  express  their  thoughts,  and 
reach  alternative  sources  of  information  freely  and 
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where  free  and  fair  elections  are  held  regularly, 
voters  have  the  opportunity  to  replace  the  majority 
in  power  by  another  majority.  But,  this  explana¬ 
tion  does  not  rule  out  the  possibility  that  some 
groups  would  be  left  permanently  out  of  the  deci¬ 
sion-making  process. 

The  way  out  of  this  predicament,  Robert  Dahl 
argued  (1971),  is  the  recognition  of  how  contem¬ 
porary  democracies  operate  in  fact.  Although 
political  competition  appears  to  be  dominated  by 
political  parties,  there  is  also  a  proliferation  of 
organized  interest  groups.  The  political  competi¬ 
tion  in  a  democratic  society  does  not  simply  pro¬ 
duce  a  majority  and  a  minority.  Rather,  a  plurality 
of  actors  compete  and  cooperate  with  each  other, 
usually  forming  issue-based  majorities,  to  produce 
policy  decisions.  Different  coalitions  form  behind 
different  policy  choices  in  an  environment  in 
which  power  is  dispersed  among  various  actors  in 
society.  What  obtains  is  not  majority  rule  as  such 
but  a  rule  of  different  coalitions  of  minorities. 
These  minorities  negotiate,  bargain,  compromise, 
and  form  alliances  and  coalitions  to  produce  out¬ 
comes  that  they  desire.  Dahl  called  his  explanation 
of  how  democracy  works  polyarchy. 

Its  proponents  argue  that  polyarchy  is  an 
empirically  derived  description  of  how  democra¬ 
cies  function.  It  has  been  criticized  on  the  ground 
that  it  relies  to  a  large  extent  on  the  study  of 
American  experience.  It  assumes  that  organized 
interests  prevail  in  a  society,  that  these  represent  a 
significant  segment  of  society,  and  that  through 
competition  they  are  capable  of  controlling  each 
other  such  that  none  is  able  to  prevail  perma¬ 
nently  over  others.  These  assumptions  require 
empirical  scrutiny  particularly  outside  of  the 
United  States  and  specifically  in  European  and 
non-Western  environments.  Some  sample  ques¬ 
tions  would  include  whether  all  groups  in  society 
are  equally  capable  of  organizing  themselves  into 
effective  organizations,  whether  they  have  equal 
resources  to  influence  the  political  process,  and 
whether  some  groups  get  much  of  what  they  want 
much  of  the  time  while  others  get  hardly  anything 
at  all. 

Polyarchy  involves  a  permanent  process  of 
building  coalitions  among  different  groups  to  form 
majorities.  Each  majority  comprises  a  different 
configuration  of  groups.  This  would  lead  actors  to 
confine  themselves  to  the  achievement  of  bare 


majorities  rather  than  pursuing  the  consensus 
model  and  trying  to  build  as  large  a  majority  as 
possible. 

Functional  Democracy 

The  polyarchy  model  assumes  that  multifarious 
interests  will  organize  and  compete  with  each 
other  in  affecting  public  policies  in  their  favor, 
with  none  getting  all  of  what  it  wants.  There  does 
exist  a  rather  different  way  of  linking  interest 
groups  to  the  democratic  political  process:  that  of 
corporatism.  Although  corporatist  or  functional 
democracy  may  take  many  forms,  its  basic  idea  is 
to  incorporate  major  organized  interests  (usually 
each  represented  by  a  single  organizational  entity) 
into  the  policy-making  and  implementation  pro¬ 
cesses.  Corporatist  arrangements,  it  is  sometimes 
argued,  are  more  effective  in  ensuring  that  the 
interests  of  all  major  groups  are  represented  in 
politics.  Yet  a  number  of  questions  remain.  First, 
how  can  we  be  sure  that  all  major  interest  groups 
are  included  in  this  arrangement?  This  question 
becomes  all  the  more  important  since  any  corpo¬ 
ratist  system  would  tend  to  work  in  favor  of  those 
groups  that  have  existed  for  a  very  long  time  and 
at  the  expense  of  those  that  are  newly  emerging. 
Second,  we  may  ask,  are  the  interest  groups  that 
are  linked  with  the  government  run  democrati¬ 
cally  themselves?  Again,  it  is  important  to  know 
the  answer  not  only  because  there  may  exist  dif¬ 
ferent  interpretations  of  what  is  in  the  interest  of 
a  particular  group  or  what  its  interests  are,  but 
also  because  group  leaders  often  develop  an  inter¬ 
est  in  sustaining  themselves  in  office  at  the  expense 
of  the  groups  that  they  are  supposed  to  represent 
and  whose  interests  they  are  supposed  to  protect. 
Third,  what  concessions  do  the  leaders  of  orga¬ 
nized  interests  have  to  make  to  maintain  their 
favored  status  vis-a-vis  the  government?  Implicit 
in  the  question  is  the  possibility  that  the  leaders  of 
interest  groups  may  simply  ease  into  becoming 
agents  of  government  policy,  transmitting,  explain¬ 
ing,  defending,  and  even  helping  implement  it. 
Fourth,  what  effect  do  the  corporatist  arrange¬ 
ments  have  on  other  democratic  institutions?  For 
example,  do  these  arrangements  undermine 
respect  for  institutions  such  as  political  parties 
that  have  traditionally  served  as  a  channel  for 
managing  conflicting  interests?  These  questions 
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point  to  some  of  the  potential  shortcomings  of 
functional  democracy. 

Delegative  Democracy 

In  examining  how  democracies  functioned  in 
Latin  America,  Guillermo  O’Donnell  observed 
that  while  these  systems  met  the  criteria  for  being 
designated  “democratic,”  they  failed  to  display 
many  of  the  characteristics  of  liberal  representative 
democracies  because  they  operated  on  the  premise 
that  the  winner  of  the  national  election  acquired 
the  right  to  govern  as  he  or  she  sees  fit.  The  sys¬ 
tems  under  study  are  presidential  systems,  where 
apparently  the  president  feels  that  for  a  constitu¬ 
tionally  defined  period  of  time  between  elections, 
he  or  she  is  at  liberty  to  rule  mainly  according  to 
his  or  her  preferences,  only  constrained  by  the  dis¬ 
tribution  of  power  in  society.  O’Donnell  termed 
these  systems  delegative  democracies. 

In  systems  where  delegative  democracy  prevails, 
accountability  runs  vertically,  whereas  in  institu¬ 
tionalized  representative  democracies,  there  is,  in 
addition,  horizontal  accountability.  Vertical 
accountability  denotes  electoral  accountability. 
Horizontal  accountability,  on  the  other  hand, 
refers  to  the  presence  and  effectiveness  of  a  net¬ 
work  of  autonomous  institutions  that  exercise 
oversight  on  government  officials  to  ensure  that 
they  discharge  their  responsibilities  properly  or  be 
subject  to  sanctions.  Presidents  in  delegative 
democracies  treat  institutions  of  horizontal  respon¬ 
sibility  as  unwanted  impediments  to  the  perfor¬ 
mance  of  their  duties  as  the  custodian  of  national 
interest.  They  accept,  however,  the  existence  of 
rival  parties  and  a  reasonably  free  press  as  normal. 

Delegative  democracy  is  a  less  liberal  form  of 
democracy  than  a  fully  representative  liberal 
democracy.  It  is  strongly  majoritarian  with  elec¬ 
tions  that  give  powers  to  a  president  who  claims  to 
serve  as  the  “interpreter  of  the  high  interests  of  the 
nation.”  Although  they  are  not  institutionalized, 
delegative  democracies  may  be  enduring. 

Parliamentary  and  Presidential  Democracies 

When  one  focuses  on  the  institutional  arrange¬ 
ments  within  the  liberal  representative  democra¬ 
cies,  the  most  common  distinction  is  between 
parliamentary  democracies  and  presidential  ones. 


The  presidential  arrangement  was  the  product  of 
the  American  Revolution.  Not  having  a  fully  func¬ 
tioning  parliamentary  system  before  them  that 
they  could  choose  to  emulate,  the  leaders  of  the 
American  Revolution  converted  the  institution  of 
the  Absolute  Monarch  to  a  president  who  was 
elected  for  a  defined  period  of  time  and  with  lim¬ 
ited  powers.  Parliamentary  democracy,  on  the 
other  hand,  evolved  in  Britain  and  Continental 
Europe  as  a  result  of  the  rise  of  new  social  classes 
during  the  industrial  revolution  that  claimed  a 
share  in  ruling  society.  The  conversion  of  consulta¬ 
tive  mechanisms  among  the  king,  the  nobility,  and 
sometimes  the  church  into  parliaments  whose 
members  were  elected  through  a  competitive  pro¬ 
cess  was  achieved  often  through  a  combination  of 
violent  change  and  peaceful  transfer  of  power. 

Presidential  democracy  as  initially  developed  in 
the  United  States  was  part  of  a  constitutional  sys¬ 
tem  based  on  the  separation  of  powers  as  well  as 
building  checks  and  balances  between  the  three 
branches  of  government.  The  president  was 
elected  by  an  electoral  college  whose  members 
were  elected  explicitly  to  elect  a  president.  The 
members  of  the  legislature,  the  U.S.  Congress,  on 
the  other  hand,  were  elected  directly  through  gen¬ 
eral  elections.  The  president,  elected  indepen¬ 
dently,  did  not  need  to  maintain  the  confidence  of 
the  legislature  to  stay  in  power  and  served  a 
4-year  term.  Initially  by  convention,  much  later 
by  constitutional  requirement,  his  tenure  was  lim¬ 
ited  to  two  terms.  The  American  presidential 
system  became  more  democratic  over  time.  The 
adoption  of  a  Bill  of  Rights,  the  abolishment  of 
slavery  after  a  destructive  civil  war,  and  the  exten¬ 
sion  of  franchise  to  Blacks  and  later  to  women 
constituted  some  of  the  landmark  developments 
in  this  evolution. 

The  presidential  system  was  adopted  by  almost 
all  Latin  American  countries,  but  in  none  did  it 
function  as  democratically  as  in  the  United 
States.  In  many  instances,  the  presidential  system 
constituted  a  method  of  promoting  full  or  semi¬ 
dictatorships.  Many  Latin  American  presidential 
democracies  today  (not  all  are  necessarily  demo¬ 
cratic)  continue  to  exhibit  the  characteristics  of 
delegative  democracies. 

Parliamentary  democracy,  a  must  for  royal  sys¬ 
tems  in  which  the  office  of  the  chief  executive  and 
the  head  of  state  is  hereditary,  is  mainly  a  product 
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of  the  20th  century.  While  some  parliamentary 
systems  were  in  operation  intermittently  in  some 
European  countries  and  somewhat  more  regularly 
in  Britain,  they  were  neither  fully  democratic  nor 
consolidated.  Most  European  parliamentary 
democracies  did  not  extend  suffrage  to  women 
until  after  World  War  I.  Introduction  of  universal 
suffrage  destabilized  many  of  them,  leading  some, 
such  as  the  Weimar  Germany,  Spain,  Italy,  and 
Austrian  systems,  to  collapse  while  producing  deep 
political  polarization  in  such  countries  as  Belgium, 
France,  and  Finland. 

Parliamentary  democracy  is  characterized  by  a 
fusion  of  powers  in  that  the  government  is  generally 
formed  among  parties  in  the  parliament.  The  prime 
minister  is  almost  always  required  to  be  an  elected 
member  of  parliament,  while  other  ministers  are 
also  usually  deputies  even  if  that  is  not  always  a 
legal  requirement.  The  government  is  required  to 
enjoy  the  confidence  of  a  parliamentary  majority  to 
stay  in  office.  The  parliament,  on  the  other  hand, 
scrutinizes  the  government  by  questions,  inquiries, 
committee  hearings,  interpellations,  and  motions  of 
censure  or  no  confidence.  But,  not  unlike  the  presi¬ 
dential  variant,  parliamentary  democracies  exhibit 
significant  variety  among  themselves. 

Presidential  and  parliamentary  democracies 
differ  from  each  other  in  important  ways.  As 
already  said  in  part,  legislatures  may  not  dismiss  a 
president  (except  in  cases  of  corruption,  mental 
incompetence,  etc.),  while  parliaments,  although 
subject  to  some  limitations,  may  dismiss  a  govern¬ 
ment  by  a  vote  of  no  confidence.  In  parliamentary 
democracies,  governments  may  choose  the  timing 
of  the  elections,  while  election  dates  are  more 
likely  to  be  fixed  in  presidential  democracies. 
Almost  universally,  presidential  democracies  have 
term  limits  for  the  president  because  he  or  she 
wields  so  much  power.  One  is  more  likely  to  find 
multiparty  systems  in  parliamentary  than  in  presi¬ 
dential  democracies.  Although  it  has  been  sug¬ 
gested  that  presidential  democracies  enjoy  strong, 
personalized  leadership,  the  rise  of  party  leaders 
to  unequal  supremacy  in  parliamentary  democra¬ 
cies  has  also  produced  a  similar  type  of  leadership. 
Furthermore,  many  parliamentary  democracies 
have  evolved  in  the  direction  of  prime  ministerial 
government  where  the  former  many  appoint  and 
dismiss  ministers  pretty  much  as  if  he  or  she  were 
a  president. 


There  is  a  hybrid  institutional  arrangement  that 
also  deserves  brief  mention:  the  semipresidential 
democratic  system,  an  arrangement  that  is  most 
closely  associated  with  France.  In  these  systems,  the 
elections  of  the  president  and  the  members  of  par¬ 
liament  usually  take  place  in  coterminous  but  dis¬ 
tinct  elections.  The  timing  of  the  elections  is  fixed. 
In  the  French  case,  to  ensure  that  the  mandate  of 
the  president  is  backed  by  a  majority,  a  runoff  elec¬ 
tion  is  required  among  leading  contenders.  There  is 
a  division  of  responsibilities  between  the  president 
and  the  prime  minister,  not  very  precise,  regarding 
domains  in  which  they  are  to  be  active.  As  in  the 
case  of  presidential  democracies,  in  semipresiden¬ 
tial  systems,  the  majority  in  the  legislature  and  the 
partisan  affiliation  of  the  president  may  be  at  vari¬ 
ance,  necessitating  cohabitation. 

Critical  Approaches:  Democracy 
in  Theory  and  in  Practice 

The  diffusion  of  liberal  democratic  ideologies  and 
the  prevalence  of  democratic  systems  in  the  world 
have  produced  two  contradictory  outcomes.  On 
the  one  hand,  the  crumbling  of  nondemocratic 
systems  has  given  satisfaction  to  those  that  have 
been  ruled  by  democratic  regimes  as  well  as  a  sense 
of  confidence  that  democracies  have  outlasted 
what  appeared  to  be  powerful  authoritarian 
regimes.  On  the  other  hand,  democracies  have 
come  under  greater  scrutiny  as  scholars  have 
begun  to  take  a  closer  look  at  the  imperfections  of 
democracy  as  an  ideology  and  as  an  operating 
political  system. 

Criticism  of  Democracy  in  Operation 

Some  problematical  outcomes  of  democratic 
systems  in  operation  have  been  observed  and  criti¬ 
cized  by  different  observers.  One  relatively  fre¬ 
quently  made  observations  is  that  the  liberal  repre¬ 
sentative  model  has  evolved  into  party  democracy 
in  which  representatives  are  obliged  to  serve  the 
parties  under  whose  auspices  they  have  been 
elected;  therefore,  they  are  neither  able  to  express 
their  own  judgment  nor  represent  without  con¬ 
straints  the  views  of  their  own  constituency. 
Parties,  on  the  other  hand,  publicize  their  policy 
priorities  in  the  electoral  campaign  and  then  pro¬ 
ceed  to  implement  them  once  in  office.  Parties, 
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however,  are  in  fact  generally  directed  by  their  lead¬ 
ers.  In  this  way,  democracy  in  operation  has  become 
plebiscitary  leadership  democracy,  an  exercise  in 
which  the  voters  have  periodic  chances  to  express 
or  deny  confidence  in  the  political  leadership  that 
runs  the  government  (Held,  1996).  Furthermore, 
the  parties  in  power  may  represent  only  a  minority 
of  the  voters  and  may  use  their  being  in  office  as  a 
resource  for  extending  their  rule. 

Approaching  democracy  in  operation  from 
another  angle,  it  has  been  suggested  that  contem¬ 
porary  democracy  is  elitist  democracy  or  that  it  is 
characterized  by  democratic  elitism  because  key 
political  decisions  are  made  by  small  minorities. 
This  is  seen  as  a  necessity  deriving  from  the  com¬ 
plexities  of  decisions,  the  time  frame  within  which 
decisions  have  to  be  made,  and  the  voluminous 
topics  on  which  decisions  have  to  be  rendered.  In 
this  process,  the  role  of  the  voter  is  no  more  than 
choosing  among  rival  teams  of  elites.  Even  more 
problematical,  the  voters  are  often  not  well- 
informed  and  not  sufficiently  committed  to  the 
fundamental  values  of  democracy  such  that  too 
much  involvement  on  their  part  may  become  a 
source  of  instability,  a  threat  to  the  functioning  of 
the  democratic  system  (Peter  Bachrach,  1969). 

Radical  Democracy 

Radical  democracy  is  an  imprecise  term  that 
offers  criticism  of  the  contemporary  applications 
of  democracy  from  a  variety  of  perspectives.  Many 
persons  whose  names  are  associated  with  radical 
democracy  are  not  interested  in  the  replacement  of 
the  liberal  and  pluralistic  democracy  with  another 
type,  rather,  they  argue  that  it  may  be  deepened 
and  expanded  to  cover  many  areas  of  public  life. 
One  example  is  a  deliberate  effort  to  incorporate 
into  the  public  all  potential  voters  by  enfranchising 
them  so  as  to  curb  the  ability  of  majorities  to  use 
their  position  to  maintain  their  dominance  of  soci¬ 
ety.  Another  example  is  the  struggle  to  undo  the 
relations  of  subordination  deriving  from  differ¬ 
ences  in  gender,  race,  and  sexual  preference  in 
addition  to  those  deriving  from  the  political  econ¬ 
omy.  In  all  these  instances,  it  is  argued  that  the 
expansion  of  liberty  and  equality  to  all  aspects  of 
public  life  within  the  framework  of  political  liber¬ 
alism  will  enhance  democracy  and  protect  the 
rights  of  the  individual  against  the  tyranny  of  the 


majority.  To  conclude,  the  various  proponents  of 
radical  democracy  argue  that  contemporary  appli¬ 
cations  of  liberal  democracy  fail  in  being  suffi¬ 
ciently  inclusive  to  bring  in  all  segments  of  the 
population  into  the  political  process.  They  further 
argue  that  current  liberal  democracies  do  not  suf¬ 
ficiently  extend  liberal  democratic  applications  to 
all  areas  of  public  life. 

Deliberative  Democracy 

The  prevalence  of  liberal  representative  democ¬ 
racy  as  the  normative  basis  for  the  functioning 
democracies  in  the  world  today  has  led  some  ana¬ 
lysts  to  focus  on  some  of  the  problematical  aspects 
and  outcomes  and  proposals  to  remedy  them.  The 
best  known  school  of  thinking  that  has  come 
under  different  names  such  as  communicative  or 
discursive,  but  most  frequently  as  the  deliberative 
theory  of  democracy,  offers  a  normative  frame¬ 
work  for  addressing  some  of  the  problematical 
aspects  of  liberal  democratic  theory. 

Proponents  of  deliberative  democracy  point  out 
that  the  two  ideas  of  liberalism  and  democracy  that 
are  brought  together  in  liberal  democracy  in  fact 
harbor  conflicts.  While  liberalism  dwells  on  plural¬ 
ism,  individualism,  and  freedom,  critical  demo¬ 
cratic  principles  include  unity,  community,  and 
equality.  First,  there  is  a  conflict  between  the  liberal 
emphasis  on  pluralism  and  the  basic  need  for  the 
basic  social  and  political  unity  of  democratic  soci¬ 
ety.  Second,  while  individualism  is  ontological  to 
liberalism,  democracy  assumes  communitarianism. 
Finally,  freedom,  so  fundamental  to  liberal  think¬ 
ing,  and  the  principle  of  equality,  so  fundamental 
to  democratic  thought,  may  be  seen  as  being  pitted 
against  each  other. 

These  conflicts,  in  fact,  are  not  conciliated  in 
liberal  democracy  because  while  there  may  be 
argument  and  bargaining,  the  decision  is  formed 
by  voting,  which  is  an  aggregative  act,  not  one  of 
reasoning  in  which  all  members  of  the  community 
are  persuaded  by  argumentation  and  discussion. 
Therefore,  a  decision  that  emerges  as  a  result  of 
voting  does  not  reflect  a  consensus  among  all  citi¬ 
zens.  Deliberative  democracy  argues  that  free  and 
open  debate  among  equal  citizens  should  continue 
until  the  emergence  of  a  community  consensus.  In 
this  process  of  what  may  be  called  practical  rea¬ 
soning,  participants  offer  ideas  and  proposals  for 
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how  to  best  solve  problems  and  meet  the  needs 
and  so  on  of  the  community.  Each  presents  argu¬ 
ments  to  persuade  the  others  to  accept  his  or  her 
proposals,  eventually  leading  to  a  consensus.  In 
this  model,  democratic  process  is  primarily  a  dis¬ 
cussion  of  problems,  disagreements,  expressed 
needs,  and  interests  where,  through  dialogue,  each 
tests  and  challenges  the  proposals  and  arguments 
of  others. 

For  a  deliberative  democracy  to  operate,  citi¬ 
zens  must  share  or  demonstrate  a  commitment  to 
solve  their  collective  problems  through  public  rea¬ 
soning  and  accept  the  framework  for  public  delib¬ 
eration  as  legitimate.  Jurgen  Habermas  has  argued 
that  the  successful  operation  of  the  deliberative 
democratic  procedure  is  based  on  the  following 
postulates: 

1.  Deliberation  is  in  argumentative  form.  There  is 
a  regulated  exchange  of  information  among 
parties  who  introduce  and  critically  test 
proposals. 

2.  All  those  who  are  affected  by  the  decision  have 
an  opportunity  to  take  part  in  the  debate.  In 
principle,  no  one  may  be  excluded  from  the 
deliberations  that  are  public. 

3.  The  deliberations  are  free  of  external  and 
internal  coercion  that  undermine  the  equality  of 
participants. 

4.  The  deliberations  aim  to  achieve  rationally 
based  agreement.  They  may  be  discontinued  and 
resumed  any  time. 

5.  It  is  expected  that  the  predeliberation  attitudes 
and  preferences  of  parties  will  be  changed 
through  arguments. 

It  is  evident  that  deliberative  democracy  does 
not  offer  a  practical  alternative  to  the  operation 
of  contemporary  democracies.  Its  application  in 
societies  with  large  populations  where  the  gov¬ 
ernments  are  expected  to  address  many  prob¬ 
lems  simultaneously  is  not  a  realistic  possibility 
since  the  time  needed  to  implement  the  process 
would  be  unacceptably  long  and  near-full  con¬ 
sensus  might  be  impossible  to  attain  in  many 
situations.  Nevertheless,  the  proponents  of  delib¬ 
erative  democracy  point  to  some  problematical 
aspects — that  is,  contradictions  and  weaknesses 


in  contemporary  applications  of  liberal  democ¬ 
racy.  Reaching  decisions  by  majority  vote  may 
exclude  many  of  those  most  closely  affected  by  a 
decision;  majorities  and  minorities  may  display  a 
reasonable  permanent  character,  frustrating  those 
that  are  pushed  to  the  position  of  a  minority. 

Cosmopolitan  Democracy 

In  contrast  to  the  radical  and  deliberative  criti¬ 
cisms  of  contemporary  liberal  democracy  that 
emphasize  current  shortcomings,  the  proponents 
of  cosmopolitan  democracy  point  to  the  direction 
they  think  democracies  should  take  in  view  of  the 
changes  in  the  world.  Noting  that  democratic  sys¬ 
tems  get  transformed  and  become  redefined  as 
societies  evolve,  as  new  needs  appear,  and  as  some 
functions  and  institutions  no  longer  respond  to 
needs  and  become  redundant,  Daniel  Archibugi 
and  David  Held  (1995)  have  pointed  out  that 
democratic  systems  operate  under  the  highly  ques¬ 
tionable  assumption  that  a  government  exercises 
sovereignty  over  a  national  political  community 
that  inhabits  a  delineated  territory.  Yet  national, 
regional,  and  global  connectedness  belies  the  valid¬ 
ity  of  this  assumption.  Therefore,  there  is  a  need  to 
adjust  democratic  systems  and  applications  to 
global  change.  The  solution  may  be  the  develop¬ 
ment  of  a  multipronged  global  model  of  democracy 
where  innovations  such  as  regional  parliaments 
would  produce  decisions  that  would  be  recognized 
as  sources  of  law;  the  holding  of  referenda  across 
nation-states  on  issues  such  as  energy  policy,  trans¬ 
portation  systems,  and  so  on;  and  the  entrenchment 
of  a  body  of  individual  rights  with  a  political,  eco¬ 
nomic,  and  social  content.  It  may  be  noted,  in  this 
regard,  that  some  basis  has  already  evolved  for  the 
implementation  of  these  ideas  since  there  are 
already  various  products  of  regional  integration  in 
the  world  as  well  as  a  growing  acceptance  of  uni¬ 
versal  standards  for  the  observation  of  human 
rights.  These  may  well  mark  the  beginning  of  a 
process  that  will  take  us  to  a  democracy  that  tran¬ 
scends  the  borders  of  the  nation-states,  keeping  in 
mind,  however,  that  there  is  no  single  inevitable 
way  societies  respond  to  pressures  for  change. 

Social  Democracy  Versus  Liberal  Democracy 

The  discussion  of  the  types  of  democracy  has 
focused  so  far  on  institutional  arrangements.  Since 
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political  democracy  is  usually  thought  of  as  a 
political  regime,  this  is  natural.  The  content  of 
social  and  economic  policies  that  democracies 
have  pursued,  however,  has  also  constituted 
grounds  for  distinguishing  between  two  policy- 
based  types:  liberal  and  social  democracies.  As  a 
political  ideology,  liberal  democracy  and  social 
democracy  are  based  on  the  two  rival  premises  of 
equality  of  opportunity  and  equality  of  condi¬ 
tion,  respectively.  Liberal  democracy  argues  that 
the  market  constitutes  the  most  reliable  instru¬ 
ment  for  offering  equality  of  opportunity  to  all 
citizens,  leading  it  to  advocate  policies  that  rely 
on  the  market  as  the  mechanism  that  brings  solu¬ 
tions  to  social  and  economic  problems. 
Consequently,  it  favors  policies  that  promote  the 
undisturbed  operation  of  the  market,  such  as  not 
introducing  wage  and  price  controls,  maintaining 
low  taxes,  and  pursuing  free  trade.  It  expects  the 
individual  to  pay  fully  for  his  or  her  needs,  such 
as  housing  and  health  care.  Social  democracy,  on 
the  other  hand,  assuming  a  society  in  which 
members  consider  themselves  as  being  socially 
equal,  proposes  the  adoption  and  implementa¬ 
tion  of  policies  that  aim  to  enhance  the  equality 
of  conditions,  such  as  a  large  public  presence  in 
the  economy,  minimum  wages,  unemployment 
insurance,  social  security,  universal  health  ser¬ 
vice,  and  free  public  education,  as  well  as  other 
similar  functions.  Many  political  parties  that 
pursue  social-democratic  goals  and  policies  also 
bear  the  name  “Social  Democratic”  (in  capital 
letters). 

It  has  been  observed  that  originally  liberal 
democracy  evolved  in  societies  where  the  middle 
classes  were  united  and  coherent,  while  the  urban 
working  class  was  weakly  organized  and  divided. 
In  contrast,  social  democracy  emerged  in  areas 
where  the  middle  classes  were  fragmented  and 
lacked  coherence,  while  urban  working  classes 
were  capable  of  organizing  and  forcing  a  social 
compact.  As  some  of  the  policy  goals  initially 
sought  by  social  democracy  (e.g.,  social  security, 
minimum  wages,  unemployment  insurance) 
became  standard  policy  in  contemporary  demo¬ 
cratic  societies,  the  policy  content  of  both  liberal 
and  social  democracy  has  become  redefined,  and 
their  differences  have  become  less  clear. 
Nevertheless,  these  two  types  continue  to  consti¬ 
tute  one  of  the  fundamental  bases  of  political 


choice  on  which  electoral  competition  is  conducted 
in  most  societies  ruled  by  democracy. 
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governmental,  2:359 
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definition,  2:367 
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by  states,  2:367-369 
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Computational  models.  See  Models,  computational/ 
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Carlo  methods 
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Concept  formation,  2:370-382 
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diminished  subtypes,  2:378-379 
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instability,  2:374 
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Conceptual  lumping,  2:375 
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definition,  2:382 
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welfare  policies,  8:2745,  8:2751 
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3:809,  7:2203,  8:2417 
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Condorcet  efficiency,  3:754 
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8:2461 

Configurational  comparative  methods  (CCMs),  2:388-391 
applications,  2:389,  2:390 
assumptions,  2:388-389 
causality,  2:388-389 
debates  on,  2:390 
definition,  2:388 

further  developments,  2:390-391 
goals,  2:388-389 
techniques,  2:389-390 
See  also  Qualitative  comparative  analysis 
Confirmatory  factor  analysis  (CFA),  3:881,  3:882-884, 
5:1524,  8:2554 

Conflict  resolution,  2:391-394 

civil  wars,  1:268-269 
definition,  2:391 
between  democracies,  3:621 
mediation,  5:1545-1548 
participants,  2:393 
peaceful  means,  5:1545,  6:1759 
superpower  interventions,  2:396 
theoretical  approaches,  2:392-393 
third-party  roles,  6:1841 
Conflict  theory,  2:396,  6:2027-2028,  6:2029 
Conflicts,  2:394-399 
armed,  8:2723-2725,  8:2726-2729 
classifications,  2:394-395 
definition,  2:394,  8:2727 
dimensions,  2:3  95-396 
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nonstate,  8:2724,  8:2727 
origins,  2:394 

possible  outcomes,  2:391-392 
religious,  5:1351,  6:1720,  7:2262 
research,  2:397-398 
in  state  of  nature,  2:447 
strategies,  2:396,  2:397 
territorial,  8:2589,  8:2727,  8:2728 
theoretical  approaches,  2:396-397 
violent,  2:397-398 

See  also  Civil  wars;  Ethnic  conflicts;  Revolutions; 

Violence;  Wars 

Confounding  variables,  5:1581-1582,  5:1597, 
7:2181-2182,  8:2699,  8:2701 
Confucianism,  2:399-402 
capitalism  and,  1:192 
in  China,  1:163 

compatibility  with  democracy,  2:401 
good  government  principles,  2:399^401 
moral  education,  2:400 
political  statements,  6:2052 
view  of  politics,  ldiii 
Confucius,  2:399,  2:400,  2:401,  6:2051 
Congo  crisis,  8:2681 
Congress,  U.S. 

agenda-setting,  1:50 
apportionment,  3:751,  3:754 

delegation  to  bureaucracy,  2:548-550,  7:2128,  7:2203 
impeachment  process,  8:2405 
powers,  1:18-19,  8:2405 
Senate  Intelligence  Committee,  4:1057 
war  powers,  8:2406,  8:2407 
Congress  of  Vienna,  8:2450,  8:2677 
Conner,  Mark,  1:95 

Conscientious  objectors,  6:1721,  6:1758 
Consensus,  2:402-404 
definition,  2:402 

deliberative  policy  making,  2:551 

domain,  2:402,  2:404 

on  foreign  policy,  3:695 

in  international  organizations,  4:1253 
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political  applications,  2:404 
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See  also  Deliberative  democracy;  Election  by  lot; 
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Consensus  democracy,  2:402,  2:403,  2:555-556, 

2:564,  2:587 

See  also  Democracy,  consociational 
Consequentialism,  3:824 

Consequentialist  libertarianism,  5:1442,  5:1444-1445 
Consequentiality,  logic  of,  5:1475 
Conservatism,  2:405-410 
British  sources,  2:405-407 
classical,  2:407-408 
critique  of  liberalism,  2:405,  2:410 
definition,  2:405 
as  ideology,  2:405 
individualism,  4:1176-1177 


main  currents  of  thought,  2:407-410 
as  philosophy,  2:405 

See  also  Christian  Right;  Fundamentalisms; 
Neo-conservative  idealism 
Conservative  dictatorships,  3:656 
Conservative  parties,  2:410-413 
British,  2:406,  2:411-412,  6:1801,  6:1802 
definition,  2:410 
electoral  performance,  2:412 
in  Europe,  2:412-413 
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organization,  2:411-412,  6:1801 
political  principles,  2:410-411 
privatization  policies,  7:2133 
role  in  party  systems,  2:412 — 413,  5:1427 
social  bases,  2:412,  6:1794 
See  also  Christian  Democratic  parties 
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Consistency.  See  Attitude  consistency 
Consociational  democracy.  See  Democracy,  consociational 
Consolidation,  democratic.  See  Democratic  consolidation 
Constitution,  U.S. 

Bill  of  Rights,  2:419,  5:1430,  6:1720,  6:1721,  7:2319, 
7:2337 

Establishment  Clause,  1:236 

executive  branch,  3:864 

federalism,  3:896 

Fifth  Amendment,  7:2138 

First  Amendment,  1:236,  3:825,  6:1720,  6:1721 

goals,  6:1723 
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presidential  system,  7:2123 
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republic,  7:2289-2290 

separation  of  powers,  1:4,  1:6,  1:10,  1:17-19,  8:2403, 
8:2404,  8:2405-2407 
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constraints,  2:413,  2:414,  2:415 
definition,  2:413 
duration,  2:415 
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See  also  Separation  of  powers 
Constitutional  monarchy,  5:1613,  5:1614,  7:2290 
Constitutional  patriotism,  5:1649,  5:1657,  6:1833-1834 
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constraints  on  government,  6:1722-1723 
definition,  2:416 
democratic,  2:353 
goals,  2:416,  6:1722-1723 
identity  building,  5:1649 
international,  4:1243 
judiciary,  5:1383-1384 
legal,  2:416,  2:418-420,  4:1037 
legislative  voting  rules,  8:2718,  8:2719-2720 
liberalism  and,  5:1429,  5:1430,  5:1431,  5:1442 
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processes,  5:1649,  7:2340 
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rights,  2:419,  7:2337 
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in  consociational  democracies,  2:555 
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as  institutions,  4:1203 
Construct  validity,  5:1515-1517 
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Constructivism,  2:421-425 
contextual,  2:422 
definition,  2:421 
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on  ethics,  3:826-827 
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globalization,  4:976 
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international  law,  4:1241 
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methodologies,  3:798 
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sovereignty,  8:2471 
strict,  2:422 
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view  of  reality,  3:796-798,  6:1879 
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anarchic  world,  1:79,  4:1292 
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domestic  politics  and,  3:694-695 
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irredentism,  5:1348 
methodological  issues,  2:427-428 
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norms,  2:424-425 
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rise  of,  2:426 
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subdivisions,  4:1292 
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alliances,  2:432 
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during  Cold  War,  2:431-432,  2:433,  4:1280 
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objectives,  2:431 
origins  of  concept,  2:431-432 
in  post-Cold  War  period,  2:432-433 
of  rogue  states,  2:432 
Content  validity,  5:1514-1515 

Contentious  participation.  See  Participation,  contentious 
Contested  concepts,  2:379-380 
Continental  Congress,  8:2404-2405 
Contingency  theory,  2:433-436 
definition,  2:434 

explanations  of  organizational  performance,  2:434 
fits  and  misfits,  2:434-436 
task  uncertainty,  2:434 
variables,  2:434 

Continuous  variables,  8:2698-2699 
Contract  theory,  2:436-447 
consent  and,  6:1723,  6:1724 
contemporary,  2:442 
critiques  of,  2:445-446,  6:1723,  7:2334 
definition,  2:436 
gender,  2:443-444 
history,  2:438-442 
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modernity  and,  2:437,  2:438,  2:445,  2:446 
moral  dimension,  6:1723-1724 
necessity  of  contract,  2:442-443 
on  society,  2:437-438 
state  of  nature,  2:440^442,  2:444 
two  pacts,  2:445 
See  also  Social  contract 

Converse,  Philip,  1:145,  3:726,  5:1502,  6:1807,  6:2004 
Cooley,  Charles  Horton,  4:1132 
Coombs,  Clyde,  7:2355 
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areas,  2:447-448 
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complex  system  modeling,  2:365 
definition,  2:447 
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distinction  from  collaboration,  2:297 

game  theory  models,  2:448-449,  4:1251,  7:2130-2132 

international,  4:979,  4:981,  4:1012-1013,  7:2382-2383 

transaction  costs,  8:2648 
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definition,  2:449—450 
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by  executive,  2:449-457,  3:866,  4:1161,  7:2215 
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informal  governance,  4:1020 
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normative  theory,  6:2057-2058 
opposing  principles,  2:454-45 6 
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Cornell  Series  in  Political  Economy,  4:1262 
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Corporations.  See  Business  interest  groups;  Firms; 

Multinational  corporations 
Corporatism.  See  Corporativism 
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advocacy  activities,  1:31-32 
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Catholic  concepts,  2:461 
contemporary,  2:459,  5:1670-1673 
criticism  of,  2:458—459,  5:1668 
definition,  2:458 
Durkheim  on,  3:698-699 
functional  democracy,  2:589-590 
history,  2:459-461,  5:1668 
interest  intermediation,  6:1875 
medieval,  2:459—460 
model,  2:459 

modern  ideologies,  2:459,  2:461,  5:1485 
policy  formulation,  6:1920-1921 
regimes,  1:110,  4:1222,  6:2043 
secular,  2:461-462 
social  insurance,  8:2743 
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coefficients,  2:467-468 
contemporaneous,  6:1762 
definition,  2:463—464 
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Correspondence  analysis  (CA),  2:468-474 
definition,  2:468 
maps,  2:470-471,  2:473 
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Corruption,  2:474-480 
categories,  2:475-477 
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control  of,  6:1972-1973 
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cures,  2:479—480 
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electoral,  2:475,  2:478,  6:1804 
embezzlement,  2:476 
measures,  2:478,  7:2235 
in  natural  resources  management,  5:1667 
in  party  finance,  6:1804 
patronage,  1:283,  1:286 
patterns,  2:477—478 
political,  1:283,  1:286,  2:474-479 
rule  of  law  and,  7:2340-2341 
tax  evasion  and,  8:2586-2587 
See  also  Governance,  good 
Corruption,  administrative,  2:480-483 
causes,  2:481 
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definition,  2:480-481 

distinction  from  political  corruption,  2:480—481 
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Cortright,  David,  7:2353 
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Cosmopolitan  interventionists,  5:1438 
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of  foreign  aid,  3:912 
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delegation,  2:487-488 
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of  negotiated  settlements,  6:1841 
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Crimes  against  humanity,  4:968-969,  5:1336 
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See  also  Judicial  systems;  Law 
Crises,  2:489-494 
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constitutional  reforms,  2:415 
coordination  during,  2:495 
definition,  2:489,  2:494 
legitimacy,  3:623,  3:624 
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market  failures  and,  5:1491-1492 
natural  disasters,  3:668 
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Crisis  management,  2:494-497 
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cooperation,  7:2386 
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Democracy:  Chinese 
Perspectives 


As  a  universal  idea  today,  democracy  has  different 
meanings  in  various  countries.  The  Chinese  per¬ 
spectives  of  democracy  are  a  product  of  not  only 
Western  cultural  diffusion  but  also  ideas  that  were 
shaped  by  the  country’s  history  and  political  real¬ 
ity.  Indeed,  the  vision  of  the  rule  by  the  people 
came  to  China  from  abroad.  However,  the  thought 
of  the  rule  for  the  people  (minben)  has  deep  his¬ 
torical  roots  and  now  takes  a  central  position  in 
Chinese  official  ideology.  As  the  Communist  Party 
of  China  (CPC)  is  defined  as  the  only  ruling  party 
constitutionally,  multiparty  competition  is  beyond 
the  imagination  of  most  Chinese  due  to  their  value 
orientation  and  political  calculation.  In  other 
words,  Chinese  history,  ideological  context,  and 
constitutional  framework  as  well  as  Western  cul¬ 
tural  influences  have  set  up  parameters  for  the 
Chinese  perspectives  of  democracy.  In  the  follow¬ 
ing,  both  the  official  view  of  Chinese  democracy 
and  some  contending  voices  concerning  other  and 
more  recent  positions  are  examined. 

Socialist  Democracy  With  Chinese 

Characteristics:  The  Official  View 

China  officially  claims  to  unswervingly  develop 
socialist  democracy  as  an  indispensable  goal  of 
socialism  with  Chinese  characteristics.  According 
to  the  official  doctrine,  the  Chinese  socialist  democ¬ 
racy  must  integrate  “the  leadership  of  the 
Communist  Party,  the  position  of  the  people  as 
masters  of  the  country  (renmin  dangjia  zuozhu), 
and  the  rule  by  law  ( yifa  zhiguo).”  Such  a  trinity, 
reflecting  an  ideological  mix  of  guardianship  and 
populism,  demonstrates  the  basic  notion  and  prin¬ 
ciple  of  democracy  in  the  Chinese  official  view. 
From  the  perspective  of  the  Chinese  authorities,  the 
ideal  type  of  democracy  is  the  socialist  democracy 
with  Chinese  characteristics.  Its  basic  form  is  peo¬ 
ple’s  democratic  dictatorship,  a  phrase  incorpo¬ 
rated  into  the  Constitution  of  the  People’s  Republic 
of  China  (PRC)  by  Mao  Zedong,  the  founder  of  the 
republic.  Such  a  phrase,  derived  from  the  dictator¬ 
ship  of  the  proletariat,  is  notable  for  being  one  of 
the  few  cases  in  which  the  term  dictatorship  is  used 
in  a  nonpejorative  manner.  The  premise  of  the 


people’s  democratic  dictatorship  is  that  the  party 
and  state  democratically  represent  and  act  on 
behalf  of  the  people  but  possess  and  may  use  dicta¬ 
torial  powers  against  reactionary  forces. 

Such  a  form  of  democracy  means  to  ensure 
democratic  rights  for  people  while  exercising 
dictatorship  in  relationship  to  people’s  internal 
enemies.  Implicit  in  the  concept  of  the  people’s 
democratic  dictatorship  is  the  notion  that  dicta¬ 
torial  means  are  a  necessary  evil  and  that,  with¬ 
out  a  dictatorship,  the  government  may  collapse 
and  create  a  situation  that  is  worse  than  the 
dictatorship.  In  the  post-Mao  era,  however,  “dic¬ 
tatorship”  has  been  mentioned  more  rarely,  and 
by  contrast,  “socialist  democracy”  has  gained 
more  prominence.  As  the  former  Chinese  leader 
Jiang  Zemin  argued,  the  essence  of  people’s  demo¬ 
cratic  dictatorship  is  people’s  democracy.  The  17th 
CPC  National  Congress  in  2007  proclaimed  that 
people’s  democracy  is  the  lifeblood  of  socialism. 
Etymologically,  democracy  means  people’s  rule. 
By  using  the  tautology  “people’s  democracy,”  the 
CPC  attempts  to  conceptualize  socialist  democracy 
and  distinguish  it  from  liberal  democracy,  which  is 
usually  regarded  as  Western  democracy  or  capital¬ 
ist  democracy.  According  to  the  Chinese  official 
doctrine,  capitalist  democracy  or  bourgeois  democ¬ 
racy  is  only  a  democracy  for  a  few  people — the 
bourgeoisie,  while  the  people’s  democracy  or 
socialist  democracy  means  a  democracy  for  the 
overwhelming  maj  ority  of  the  people.  Theoretically, 
the  socialist  democracy  with  Chinese  characteris¬ 
tics  aims  to  develop  the  people’s  democracy  to  a 
high  degree;  as  Party  General  Secretary  Hu  Jintao 
put  it,  the  party  has  been  consistently  pursuing  the 
goal  of  developing  socialist  democracy,  and  the 
essence  and  core  of  socialist  democracy  are  that 
the  people  are  masters  of  the  country. 

The  “people’s  democracy”  in  China,  therefore, 
carries  a  strong  flavor  of  direct  democracy  as 
opposed  to  liberal  democracy.  At  least,  democracy 
in  the  Chinese  context  is,  to  some  extent,  a  mixture 
of  direct  democracy  and  representative  democracy. 
According  to  the  Chinese  official  proclamation, 
the  CPC  must  ensure  that  all  power  of  the  state 
belongs  to  the  people,  expand  the  orderly  partici¬ 
pation  of  citizens  in  political  affairs  at  each  level 
and  in  every  field,  and  mobilize  and  organize  the 
people  as  extensively  as  possible  to  manage  state 
and  social  affairs  as  well  as  economic  and  cultural 
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programs.  In  the  official  conception  of  people’s 
democracy,  democratic  election,  decision  making, 
administration,  and  oversight  are  the  four  important 
elements  featured  by  the  spirit  of  direct  democracy, 
whereas  the  ideas  of  democratic  decision  making 
and  democratic  administration  both  presume  that 
people  are  capable  of  getting  involved  in  public 
administration  and  policy  making  directly  without 
relying  on  their  representatives.  This  ideological 
inclination  is  consistent  with  the  populist  view  of 
politics,  which  assumes  that  each  person  has  an 
equal  say  on  all  policy  issues  at  all  times.  Actually, 
for  most  Chinese,  including  many  leaders,  democ¬ 
racy  means  people’s  rule,  government  by  a  mass  of 
people,  “the  position  of  the  people  as  masters  of  the 
country,”  or  even  mob  rule  and  anarchism. 

By  contrast,  the  idea  of  socialist  democracy  with 
Chinese  characteristics,  or  people’s  democracy, 
carries  a  legacy  of  people’s  democratic  dictator¬ 
ship.  Although  the  Chinese  leadership  has  down¬ 
played  the  concepts  of  class  struggle  and  social 
revolution  since  China’s  reforms  and  opening  up 
in  1978,  it  still  maintains  concepts  such  as  “antag¬ 
onistic  forces”  ( didui  shili)  and  “contradiction 
between  people  and  its  enemy”  (diivo  maodun )  in 
the  official  language.  Any  hostile  forces  are  subject 
to  people’s  dictatorship,  without  the  freedom  of 
enjoying  people’s  democracy.  A  truly  liberal 
democracy  is  applicable  to  all  citizens  (except  for 
criminals)  in  a  community. 

Systems 

To  support  people’s  democracy,  the  party  and 
state  have  established  a  series  of  political  systems, 
including  the  system  of  people’s  congresses,  the 
system  of  multiparty  cooperation  and  political  con¬ 
sultation  under  the  leadership  of  the  CPC,  the  sys¬ 
tem  of  regional  ethnic  autonomy,  and  the  system  of 
self-governance  at  the  primary  level  of  society. 

The  system  of  people’s  congresses,  as  China’s 
fundamental  political  system,  is  an  organizational 
form  of  the  state  power  in  China.  According  to 
China’s  constitution,  the  power  in  the  People’s 
Republic  of  China  belongs  to  the  people,  and  the 
organs  for  the  people  to  exercise  state  power  are 
the  National  People’s  Congress  (NPC)  and  local 
people’s  congresses  at  all  levels.  The  NPC  and  local 
people’s  congresses  are  established  through  elec¬ 
tions,  and  they  are  responsible  to  and  supervised 


by  the  people.  State  executive  and  judicial  organs 
at  the  different  levels  are  created  by,  responsible 
to,  and  supervised  by  the  people’s  congresses.  The 
NPC  is  the  organ  with  supreme  state  power,  and 
local  people’s  congresses  are  local  organs  of  state 
power.  The  formal  powers  of  the  people’s  con¬ 
gresses  stipulated  by  the  constitution,  however, 
have  been  neutralized  by  the  leading  role  of  the 
CPC,  which  is  defined  as  the  only  ruling  party  by 
the  constitution  and  operates  under  the  principle 
of  democratic  centralism.  Despite  the  growing 
assertions  of  the  NPC  and  its  standing  committee 
vis-a-vis  the  State  Council,  the  Supreme  People’s 
Court,  and  the  Supreme  People’s  Procuratorate, 
the  NPC’s  supreme  powers  in  elections,  legislation, 
and  supervision  are  constrained  by  the  Party’s  dis¬ 
cipline  and  ideology. 

Historically,  the  NPC  was  designed  according 
to  the  Marxist-Leninist  principle  of  “combining 
legislative  and  executive  into  one  organ”  (yixing 
heyi),  in  order  to  ensure  power  concentration  and 
political  efficiency.  According  to  Marx,  the  prin¬ 
ciple  of  checks  and  balances  would  leave  govern¬ 
mental  branches  outside  the  direct  control  of  the 
electorate;  all  levels  of  government  therefore  should 
be  a  working  body  fully  accountable  to  the  people. 
Informed  by  Marx’s  idea,  Lenin  adopted  the 
model  of  the  Paris  Commune  and  created  the 
Soviet  regime  in  Russia.  To  transform  an  old  bour¬ 
geois  parliament — “a  talkative  organ”  in  Lenin’s 
words — into  a  proletarian  working  body,  Lenin 
proposed  that  every  member  of  the  Soviet  should 
take  both  legislative  and  executive  jobs.  Following 
the  Leninist  principle  of  democratic  centralism  and 
the  Soviet  model  of  political  regime,  the  Chinese 
ruling  elite  denigrated  the  liberal  proposition  of 
separation  of  powers  from  the  very  beginning.  For 
the  ruling  elite,  institutional  checks  and  balances 
among  legislative,  executive,  and  judicial  branches 
would  lead  to  a  divided  sovereignty  and  therefore 
reduce  political  efficiency.  These  are  at  odds  not 
only  with  the  ruling  party’s  belief  in  democratic 
centralism  but  also  with  the  traditional  vision  of 
the  rule  by  virtue  ( dezbi ),  which  prescribes  a  uni¬ 
tary  and  harmonious  government  as  an  ideal  goal. 

The  multiparty  cooperation  and  political  con¬ 
sultation  under  the  leadership  of  the  CPC  is  a  basic 
political  system  in  China.  Besides  the  CPC,  there 
are  eight  minor  “democratic  parties”  ( minzhu  dan- 
gpai)  in  China,  which  are  neither  the  ruling  parties 
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nor  opposition  parties,  because  they  do  not  chal¬ 
lenge  the  leadership  of  the  CPC  and  demand  for 
competitive  elections  in  China.  Rather,  they  are 
called  “participatory  parties”  (canzheng  dang) 
mainly  through  the  mechanism  of  political  consul¬ 
tation,  particularly  via  the  institution  of  the  Chinese 
People’s  Political  Consultative  Conference 
(CPPCC).  The  political  powers  and  rights  enjoyed 
by  the  “democratic  parties”  are  delegated  or 
endowed  by  the  ruling  party,  rather  than  being 
based  on  their  own  electorate.  Nevertheless,  the 
people’s  congresses,  the  CPPCC,  and  government 
agencies  at  all  levels  have  members  from  the  “dem¬ 
ocratic  parties”  as  part  of  their  leaders. 
Occasionally,  some  members  of  “democratic  par¬ 
ties”  can  serve  as  ministers  in  the  State  Council. 

In  addition  to  the  system  of  people’s  congresses 
and  the  system  of  multiparty  cooperation  and 
political  consultation,  self-governance  in  some  eth¬ 
nic  areas  and  grassroots  communities  normatively 
displays  one  of  the  features  of  the  Chinese  socialist 
democracy,  which  are  called  the  system  of  regional 
ethnic  autonomy  and  the  system  of  self-governance 
at  the  primary  level  of  society.  Empirically,  self- 
governance  in  Chinese  villages  has  developed  rap¬ 
idly  in  the  past  years,  and  especially,  competitive 
elections  for  village  committees  have  been  spread¬ 
ing  all  over  the  country. 

Conceptualizing  a  Chinese  Model  of 

Democracy:  Academic  Exploration 

Although  the  Chinese  official  view  of  democracy 
illustrates  an  outline  of  the  pattern  and  framework 
for  democracy  and  democratization  in  contempo¬ 
rary  China,  it  is  far  from  a  sophisticated  theory. 
Among  others,  one  of  the  vague  issues  is  what  the 
“Chinese  characteristics”  mean  within  the  socialist 
democracy  advocated  by  the  Chinese  authorities. 
In  other  words,  what  model  could  China  develop 
for  democracy?  With  reference  to  such  a  question, 
Chinese  intellectuals  have  different  viewpoints 
under  the  umbrella  of  socialist  democracy  or 
beyond,  in  search  of  an  ideal  or  possible  demo¬ 
cratic  model  for  the  future  China. 

Direct  Democracy/Autonomous  Democracy 

The  socialist  democracy  or  people’s  democracy 
in  China,  as  mentioned  above,  has  an  orientation 


of  direct  democracy.  Accordingly,  the  official  doc¬ 
trine  insists  on  democratic  elections,  democratic 
decision  making,  democratic  administration,  and 
democratic  oversight,  which  constitute  the  basic 
framework  of  Chinese  orthodoxy  on  democracy 
with  properties  of  autonomy  or  direct  democracy. 
Furthermore,  grassroots  autonomy  in  Chinese  vil¬ 
lages  has  developed  rapidly  since  the  1990s,  giving 
an  empirical  support  to  such  orthodoxy.  Resulting 
from  this  development,  autonomous  democracy  is 
regarded  as  one  major  option  of  the  Chinese  dem¬ 
ocratic  model. 

From  a  perspective  of  the  state-society  dichot¬ 
omy,  some  scholars  write  that  the  political  system 
can  be  divided  into  state  power  and  grassroots 
communities.  Accordingly,  the  state  and  grass¬ 
roots  levels  of  democracy  can  also  be  distinguished; 
at  the  state  level,  indirect  or  representative  democ¬ 
racy  is  employed,  and  at  the  grassroots  level,  direct 
or  autonomous  democracy  is  practiced.  In  China, 
grassroots  democracy  includes  rural  autonomy  in 
villages,  urban  autonomy  in  communities,  demo¬ 
cratic  administration  in  the  enterprises,  and  urban 
and  rural  civic  participation  in  public  affairs.  What 
is  more,  some  scholars  believe  that  villagers’ 
autonomy  would  be  a  breakthrough  or  growing 
point  for  China’s  democratization.  To  enhance 
autonomous  democracy,  some  scholars  suggest 
expanding  the  scope  of  grassroots  democracy  from 
villages  and  urban  neighborhood  communities  up 
to  the  township  or  higher  levels. 

Besides  the  grassroots  democratic  autonomy  in 
the  rural  and  urban  communities,  some  other 
kinds  of  direct  democracy  have  also  emerged  in 
recent  years.  Some  leaders  and  scholars  emphasize 
the  importance  of  citizens’  direct  participation  in 
public  affairs  and  policy  making  through  public 
hearings,  participatory  budgeting  (open  discussion 
on  local  public  budgets),  Internet  discussions,  and 
so  on.  In  particular,  cyber  democracy  has  devel¬ 
oped  quite  rapidly  and  is  more  prevalent  compared 
with  other  forms  of  democracy  in  China,  even 
exceeding  its  development  in  Western  democracies 
to  some  extent.  This  is  mainly  because  in  China 
there  are  hardly  any  other  effective  channels  for 
citizens  to  articulate  their  views  except  for  the 
Internet.  More  and  more  Chinese  people  prefer  to 
use  the  Internet  as  a  means  to  express  their  opin¬ 
ions  on  public  affairs,  to  influence  public  policy, 
and  to  oversee  public  officials.  With  the  greatest 
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population  of  netizens  in  the  world,  China’s  cyber 
democracy  is  developing  ahead  of  representative 
democracy  and  is  welcomed  by  most  Chinese 
people,  who  regard  it  as  a  new  form  of  socialist 
democracy.  Although  for  most  scholars,  cyber 
democracy  does  not  necessarily  challenge  the 
prudence  of  representative  democracy,  some  intel¬ 
lectuals  consider  it  to  be  an  alternative  to  represen¬ 
tative  democracy. 

Deliberative  Democracy/Consultative  Democracy 

In  today’s  China,  various  ideas  of  democracy, 
including  direct  democracy,  autonomous  democ¬ 
racy,  participatory  democracy,  and  cyber  democ¬ 
racy,  are  becoming  increasingly  influential 
compared  with  liberal  democracy.  In  addition, 
deliberative  democracy  has  been  regarded  as  a 
possible  model  for  China’s  democracy  by  some 
officials  and  scholars.  The  concept  of  deliberative 
democracy  was  created  in  Western  society. 
Ironically,  deliberative  democracy  serves  as  a 
supplementary  mechanism  accompanying  repre¬ 
sentative  democracy  in  the  West,  but  it  is  consid¬ 
ered  by  some  Chinese  scholars  and  government 
officials  as  an  alternative  to  electoral  competition. 

The  popularity  of  deliberative  democracy  in 
China  is  partly  attributable  to  the  Chinese  advo¬ 
cacy  of  political  consultations  between  the  ruling 
party  and  other  political  actors,  which  bears  a 
resemblance  to  the  idea  of  Western  deliberative 
democracy.  According  to  the  system  of  multiparty 
cooperation  and  political  consultation  under  the 
leadership  of  the  CPC,  the  CPPCC  has  been  estab¬ 
lished  at  both  central  and  local  levels.  Although 
the  Chinese  practice  of  political  consultation  has  a 
different  implication  from  the  Western  idea  of 
democratic  deliberation,  in  Chinese,  the  term  con¬ 
sultation  ( xiesbang )  shares  the  same  characters  as 
deliberation.  Consequently,  consultative  democ¬ 
racy  in  Chinese  ( xiesbang  minzhu)  becomes  an 
equivalent  for  deliberative  democracy  in  English, 
thus  blurring  the  nuance  between  the  two  key 
words.  Within  this  linguistic  context,  some  Chinese 
scholars  further  argue  that  Western  democracy  is 
traditionally  characterized  by  electoral  democracy, 
whereas  Chinese  democracy  is  traditionally  char¬ 
acterized  by  deliberative  democracy  (actually  con¬ 
sultative  democracy)  since  China  has  already 
established  the  CPPCC  system,  which  symbolizes 


the  characteristics  of  Chinese  democracy  or  the 
Chinese  model  of  democracy. 

Other  scholars  assert  that  deliberative  democ¬ 
racy  and  liberal  democracy  can  support  each 
other.  This  perspective  is  endorsed  by  the  White 
Paper  on  China’s  Party  System,  issued  by  the 
Chinese  government  in  2007,  which  recognizes 
the  importance  of  both  electoral  democracy  and 
deliberative  democracy.  Furthermore,  some 
scholars  consider  the  two  main  Chinese  political 
institutions — people’s  congress  and  people’s 
political  consultative  conference — as  examples  of 
electoral  democracy  and  deliberative  democracy, 
respectively. 

It  is  worth  pointing  out  the  significant  differ¬ 
ence  between  political  consultation  in  China’s 
context  and  deliberative  democracy  in  Western 
backgrounds.  However,  there  are  also  eye-catch¬ 
ing  experiments  on  the  very  sense  of  deliberative 
democracy  in  a  few  Chinese  grassroots  communi¬ 
ties,  such  as  the  democratic  discussion  meetings  in 
Wenling,  Zhejiang  Province.  Other  forms  of  com¬ 
munity  discussion  councils  and  public  forums  have 
been  created  in  a  few  localities  thereafter. 

Inner-Party  Democracy 

Compared  with  other  proposed  models,  inner- 
party  democracy  is  a  widely  recognized  scheme 
for  China’s  potential  democratic  model.  Some 
scholars  believe  that  China  should  first  develop 
inner-party  democracy  since  the  CPC  is  the  only 
ruling  party  normatively  and  empirically.  Inner- 
party  democracy  is  a  legitimate  tenet  that  has 
been  accepted  by  the  ruling  party  from  the  very 
beginning,  and  is  characterized  by  criticism  and 
self-criticism  and  a  democratic  lifestyle  within  the 
party.  Since  the  1990s,  it  has  been  reinvented  by 
Chinese  scholars  as  a  strategy  for  China’s  democ¬ 
ratization,  while  searching  to  democratize  in  the 
existing  political  system  itself  with  one-party  lead¬ 
ership.  The  logical  sequence  for  political  democra¬ 
tization  in  China,  accordingly,  is  from  the  CPC  to 
the  outside  of  the  ruling  party,  from  the  elite  to 
the  masses,  and  from  intrasystem  to  extrasystem 
domains.  This  strategy  was  acknowledged  offi¬ 
cially  when  the  party  leader  Zemin  proclaimed  to 
“promote  people’s  democracy  actively  through 
the  development  of  inner-party  democracy”  in 
2001  at  the  80th  CPC  anniversary  gathering. 
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For  some  scholars,  the  basic  principle  for  inner- 
party  democracy  is  to  handle  correctly  the  rela¬ 
tionships  among  the  CPC’s  National  Congress,  the 
Central  Committee  and  its  Politburo,  and  Politburo 
Standing  Committee  in  order  to  allow  the  party’s 
National  Congress  and  the  Central  Committee  to 
perform  their  basic  roles.  The  key  approach  to 
achieve  this  goal  is  to  improve  the  electoral  func¬ 
tion  of  the  National  Congress  and  Central 
Committee  by  encouraging  multicandidate  elec¬ 
tions  (cba’e  xuanju),  which  will  create  an  institu¬ 
tionalized  channel  of  interest  aggregation  for  the 
party  elites.  These  proposals  have  been  more  or 
less  endorsed  by  the  party  since  the  beginning  of 
the  21st  century.  Major  measures  being  adopted  or 
that  are  under  consideration  include  improving  the 
function  of  party  congresses,  expanding  power 
sharing  among  party  committee  members,  increas¬ 
ing  information  sharing  among  party  members, 
developing  a  division  of  power  within  the  Party, 
and  allowing  limited  inner-party  electoral  competi¬ 
tion.  The  CPC  first  introduced  multicandidate 
elections  in  the  13th  Party  Congress  in  1987  and 
expanded  electoral  competition  in  the  17th  Party 
Congress  in  2007.  Such  a  reform  program  could 
possibly  become  the  potential  germination  point  of 
democratic  transition  and  competitive  elections  in 
China. 

Inner-party  democracy,  therefore,  does  not 
challenge  the  wisdom  of  representative  democracy 
and  electoral  competition;  rather,  it  tries  to  accom¬ 
modate  the  plurality  of  social  interests  and  the 
spirit  of  free  competition  by  continuously  expand¬ 
ing  the  scope  of  democratic  elections  within  the 
party  and  finding  a  way  to  reconcile  the  model  of 
“one-party  dominance”  with  the  model  of  “one- 
party  pluralism”  derived  from  the  experiences  of 
East  Asian  political  development.  In  other  words, 
supporters  of  inner-party  democracy  believe  that 
in  searching  for  a  democratic  model  with  Chinese 
characteristics,  they  can  take  a  third  path — an 
institutional  compromise  between  single-party 
authoritarianism  and  multiparty  democracy.  In 
the  opinion  of  some  scholars,  the  strategy  of 
building  inner-party  democracy  or  following  the 
model  of  one  dominant  party  with  pluralism 
within,  from  a  long-term  perspective,  may  result 
in  a  chain  reaction  in  the  Chinese  democratization 
process  and  would  be  an  alternative  to  a  multi¬ 
party  system. 


Democracy  in  Controversies: 

Liberal  Democracy  and  Beyond 

Though  there  is  no  lack  of  advocates  of  liberal 
(Western  style)  democracy  in  Chinese  intellectual 
circles,  more  and  more  criticisms  of  liberal  democ¬ 
racy  have  emerged  in  contemporary  China,  which 
has  even  been  conceived  as  a  wave  of  antidemo¬ 
cratic  ideology  in  a  general  sense.  Debates  on 
democratic  theories,  in  general,  and  liberal  democ¬ 
racy,  in  particular,  concern  whether  democracy  is 
a  good  thing  for  China  and  whether  China  can 
find  an  alternative  (such  as  deliberative  democ¬ 
racy)  to  liberal  democracy  featured  by  electoral 
politics. 

Democracy:  A  Good  or  a  Bad  Thing ? 

Supporters  of  democracy  believe  that  democ¬ 
racy  is  a  good  thing  for  China.  Some  scholars  con¬ 
tend  that  democracy  is  a  universal  truth,  observing 
that  a  string  of  antidemocratic  ideas,  combined 
with  the  old  thesis  of  “China  uniqueness,”  is 
gradually  developing  and  has  severely  blocked 
China’s  movement  toward  a  socialist  democracy. 
One  of  the  influential  arguments  is  that  only  demo¬ 
cratic  socialism  can  save  China,  and  only  demo¬ 
cratic  constitutionalism  can  fundamentally  resolve 
the  problem  of  political  corruption  within  the 
government.  Some  scholars  are  more  cautious  in 
supporting  democracy,  proclaiming  that  democ¬ 
racy  is  neither  a  bad  nor  a  good  thing,  but  then  is 
also  the  worst  form  of  government  except  for  all 
those  other  forms  that  have  been  tried,  as  Winston 
Churchill  once  said,  or  proclaiming  that  what 
China  needs  is  a  good  democracy,  not  a  bad  one, 
paying  more  attention  to  substantial  democracy 
rather  than  procedural  democracy. 

Opponents  of  democracy,  by  contrast,  believe 
that  democracy  will  do  harm  to  China.  Some 
argue  that  political  order  and  social  harmony  are 
more  important  than  other  values  and  what  China 
needs  is  not  democracy  but  a  limited  government 
of  checks  and  balances  with  the  rule  of  law. 
Presupposing  grassroots  mobilization,  popular 
elections,  majority  rule,  and  power  politics, 
democracy  would  result  in  the  “tyranny  of  the 
majority”  in  China’s  context.  Some  even  bluntly 
declare  that  democracy  is  a  bad  thing  and  that 
democratization  is  the  worst  choice  and  would 
cause  social  disturbance  and  disaster  in  China 
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rather  than  resolving  the  problems  of  political  cor¬ 
ruption  and  social  inequality.  Some  scholars  also 
dismiss  the  idea  that  democracy  is  a  universal 
truth  that  can  avoid  the  worst  features  of  political 
life. 

Arguments  Against  Liberal  Democracy 

Debates  on  general  democratic  theories  in 
China  focus  on  the  value  of  liberal  democracy,  the 
mainstream  of  Western  democratic  theories. 
Liberal  democracy  is  not  the  mainstream  in  Chinese 
academic  discourse.  On  the  contrary,  various  criti¬ 
cisms  of  liberal  democracy  and  electoral  competi¬ 
tion  are  popular,  particularly  in  official  media.  For 
some  Chinese  new-Leftists,  furthermore,  the  ideal 
type  of  government  is  demarchy  or  lottocracy,  a 
political  system  run  by  randomly  selected  decision 
makers  who  have  been  selected  by  lot.  They  argue 
that  economic  equality  (rather  than  freedom)  is  the 
precondition  for  political  equality,  and  representa¬ 
tive  democracy  through  the  mechanism  of  free 
elections  cannot  guarantee  an  equal  say  for  all 
citizens  in  public  affairs.  Unsatisfied  with  Western 
democracy,  one  scholar  argues  that  once  democ¬ 
racy  is  defined  by  any  other  concept,  such  as  lib¬ 
eral  democracy  or  representative  democracy,  it 
will  lose  its  true  meaning  and  value.  The  core  value 
of  pure  or  direct  democracy  is  equality;  hence 
democracy  can  be  better  guaranteed  through  the 
mechanism  of  selecting  leaders  and  making  policy 
by  casting  lots,  in  addition  to  enhancing  political 
consultation,  cyber  participation,  and  working- 
place  democracy.  One  can  find  a  link  between  such 
an  idea  and  Athenian  democratic  practice.  A  com¬ 
mon  thread  running  through  these  models  is  a 
strong  commitment  to  egalitarianism,  even  at  the 
cost  of  liberalism. 

From  the  perspective  of  liberal  democracy, 
open  and  fair  elections  are  the  bottom  line  of 
democracy,  though  it  may  be  insufficient  or 
unachievable  without  the  support  of  other  favor¬ 
able  conditions.  In  the  Chinese  context,  however, 
many  perceive  competitive  election  as  either  a  bad 
thing  or  as  being  irrelevant  for  democratic  devel¬ 
opment.  The  new  Leftists  are  not  satisfied  with 
liberal  democracy  because  they  want  people’s  rule 
or  direct  democracy;  meanwhile,  the  conservatives 
argue  that  competitive  elections  will  bring  about 
political  chaos,  money  politics,  social  discord,  and 


so  on.  For  all  the  opponents  of  liberal  democracy, 
the  best  solution  for  China’s  dilemma  regarding 
democracy  is  to  develop  a  democracy  without 
elections. 

Debates  on  Deliberative  Democracy 

Contrary  to  liberal  democracy,  deliberative 
democracy  has  gained  more  popularity  in  China 
than  in  Western  democracies,  and  deliberative 
democracy  in  China  is  usually  linked  to  multi¬ 
party  cooperation  and  political  consultation  under 
the  leadership  of  the  CPC.  As  some  scholars  point 
out,  this  is  a  misreading  or  misunderstanding  of 
deliberative  democracy  in  its  original  meaning.  As 
for  the  scholars  who  know  well  the  theory  of 
deliberative  democracy,  some  forms  of  delibera¬ 
tive  democracy,  such  as  democratic  discussion 
forums  practiced  in  local  politics  and  public  hear¬ 
ings,  are  also  heralded  by  them.  In  their  opinion, 
institutional  deliberation,  as  one  form  of  democ¬ 
racy,  can  improve  the  quality  of  China’s  local 
politics  and  bring  China  to  democracy.  Some 
scholars  expand  the  connotation  of  deliberative 
democracy  to  include  other  elements  of  democ¬ 
racy,  such  as  elections,  checks  and  balances  of 
powers,  rational  discussion,  and  direct  participa¬ 
tion,  and  advocate  the  promotion  of  deliberative 
democracy  as  a  rational  strategic  choice  for  China 
in  developing  socialist  democracy.  Some  even 
suggest  considering  deliberative  democracy  as  an 
alternative  to  electoral  competition. 

Other  scholars,  however,  take  a  more  cautious 
approach  concerning  the  feasibility  of  deliberative 
democracy  in  China.  They  argue  that  there  is  a  gap 
between  China’s  political  consultative  system  void 
of  competitive  elections  and  Western  deliberative 
democracy  based  on  representative  politics.  As  an 
ideal  and  nonmainstream  theory  in  the  West, 
deliberative  democracy  cannot  function  as  an 
alternative  to  representative  democracy.  The 
strength  of  deliberative  democracy  lies  in  its 
encouragement  of  rational  thinking,  dialogue,  and 
participation,  but  it  consumes  more  time  and 
money  than  periodical  elections.  They  argue, 
therefore,  that  the  feasible  step  for  China  to  take  is 
to  develop  representative  democracy  based  on 
elections,  and  without  the  context  of  competitive 
elections,  true  deliberation  on  an  equal  basis  can¬ 
not  take  place,  even  at  the  grassroots  level. 
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Divergence  in  the  Chinese  Model  of  Democracy 

While  the  Chinese  authorities  declare  that  they 
intend  to  build  a  socialist  democracy  with  Chinese 
characteristics  and  some  scholars  are  keen  to  illus¬ 
trate  the  Chinese  model  of  democracy  along  with 
China’s  economic  rise,  many  intellectuals  in  today’s 
China  believe  that  democracy  is  a  universal  con¬ 
cept  and  that  there  is  no  particular  model  for 
China’s  democracy.  Democracy  as  practiced  in 
China  and  other  countries  in  the  world  should 
share  some  fundamental  principles  and  institu¬ 
tional  arrangements. 

Some  scholars  believe,  by  contrast,  that  China 
has  created  a  unique  model  of  democracy,  which  is 
more  advanced  than  that  practiced  in  the  West.  As 
one  scholar  argues,  China  has  experienced  the  most 
successful  industrialization  in  the  world,  guaranteed 
people’s  rights  and  created  a  dynamic  society 
through  orderly  political  reforms,  and  enjoyed  a 
leap  forward  because  of  the  concentration  of  national 
resources.  This  suggests  that  the  Chinese  democratic 
model  is  the  best  one  in  the  world.  This  perspective 
has  been  endorsed  by  the  White  Paper  on  the 
Building  of  Political  Democracy  in  China,  issued  by 
the  Chinese  government  in  2005.  According  to  this 
white  paper,  there  is  no  single,  absolute,  and  univer¬ 
sally  applicable  democratic  model  in  the  world. 

Rather  than  highlighting  the  Chinese  character¬ 
istics  of  democracy  or  the  best  democratic  model 
in  China,  some  scholars  argue  that  China  has  not 
crossed  the  threshold  of  democracy  at  all  but 
remains  an  authoritarian  regime  or  posttotalitar- 
ian  state.  While  Chinese  elites  have  made  great 
efforts  since  the  reforms  to  promote  grassroots 
democracy  from  below  and  inner-party  democracy 
from  inside,  no  substantial  progress  in  terms  of 
democratization  has  been  made  in  the  view  of  the 
liberals.  Up  to  now,  there  is  no  consensus  on 
China’s  democracy  and  its  model.  Philosophically, 
however,  China’s  democracy  will  be  a  combina¬ 
tion  of  universalism  and  particularism,  which  is 
accepted  by  most  intellectuals. 
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Democracy:  Middle  East 
Perspectives 


Notwithstanding  the  political  change  that  has 
taken  place  to  varying  degrees  during  the  past 
2  decades  in  the  countries  of  the  Middle  East  and 
North  Africa  (MENA),  an  extensive  region  com¬ 
prising  the  22  members  of  the  Arab  League  as  well 
as  Israel,  Iran,  and  Turkey,  Middle  East  scholars 
concur  in  the  assessment  that,  with  few  exceptions 
proving  the  rule,  MENA  is  still  a  democratic  waste¬ 
land.  Although  most  of  the  region’s  regimes  run 
some  sort  of  representative  body  or  have  a  formally 
democratic  political  system,  this  statement  holds 
true  even  when  applying  a  minimalist  Schumpeterian 
definition  of  democracy  focusing  on  the  legitimiza¬ 
tion  of  governance  through  regular  free  and  fair 
elections.  The  relative  absence  of  both  democratic 
regimes  and  substantial  democratization  processes, 
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especially  in  the  Arab  region,  has  led  to  intensive 
discussions  about  why  the  region  has  largely  been 
unaffected  by  the  “Third  Wave  of  Democratization” 
(Samuel  P.  Huntington).  To  denote  the  region’s 
alleged  uniqueness  in  autocracy,  economic  misery, 
and  the  abundance  of  conflicts,  the  term  Middle 
Eastern  Exceptionalism  has  gained  wide  usage, 
even  though  many  Middle  East  scholars  feel  uneasy 
about  the  term  itself  and  its  implications.  While  the 
assumption  that  MENA  is  the  most  conflict-prone 
world  region  and  that  conflict  dynamics  there  fol¬ 
low  a  deviant  logic  as  compared  with  non-MENA 
conflicts  has  been  invalidated,  many  scholars  also 
challenge  the  core  assumption  of  Middle  Eastern 
Exceptionalism — that  is,  that  MENA  proves  to  be 
exceptionally  hostile  to  democratization.  First,  it  is 
argued  that,  given  the  sobering  record  of  persistent 
authoritarianism  and  stalled  or  aborted  democrati¬ 
zation  processes  in  a  considerable  number  of  coun¬ 
tries  in  Central  Asia,  Africa,  and  the  former  Soviet 
Union,  MENA  can  hardly  be  referred  to  as  being 
unique.  Second,  promising  democratization  pro¬ 
cesses  in  Turkey,  a  candidate  country  for  member¬ 
ship  in  the  European  Union  (EU),  and  Israel’s  con¬ 
solidated  democracy  not  only  question  the  term’s 
spatial  scope  but  also  reveal  its  lack  of  explanatory 
power  when  discussing  global  trends. 

In  order  to  approach  the  prospects  for  Middle 
Eastern  democracy,  this  entry  first  gives  a  review 
of  the  obstacles  to  democracy  in  MENA  as  dis¬ 
cussed  in  the  academic  literature.  It  then  outlines 
recent  sociopolitical  developments,  including  the 
role  of  Islamist  parties  in  the  domestic  political 
process  and  their  impact  on  democratization. 

Roadblocks  to  Democracy 

The  quest  for  the  reasons  that  account  for  the 
absence  of  democracy  in  MENA  has  spawned  an 
immense  body  of  literature  over  the  past  years.  At 
least  five  major  determinants  are  put  forward  to 
explain  why  authoritarianism  is  still  the  dominant 
type  of  regime  in  the  region:  weak  civil  societies, 
patrimonialism,  rentier  structures,  geostrategic 
location,  and  Islam.  None  of  these  determinants  is 
without  controversy,  though.  Before  elaborating 
on  the  single  determinant,  it  is  worth  noting,  how¬ 
ever,  that  the  shift  from  the  macrolevel  of  analysis 
in  democratization  theory  to  agency-based 
approaches  as  introduced  by  the  seminal  work 
Transitions  From  Authoritarian  Rule  by  Guillermo 


O’Donnell  and  Philippe  C.  Schmitter  (1986)  was 
accompanied  by  the  conceptual  break  with  the 
“precondition”  tradition.  That  is,  theories  of 
democratization  that  preceded  agency-based  tran¬ 
sition  theory  took  for  granted  that  the  establish¬ 
ment  of  democracy  would  be  a  by-product  of 
modernization  characterized  by  greater  wealth,  the 
formation  of  a  bourgeoisie,  more  tolerant  civic 
cultures,  and  the  overcoming  of  economic  depen¬ 
dency.  But  as  democratic  transitions  in  former 
authoritarian  countries  in  Latin  America  and 
Eastern  Europe  have  shown,  that  reality  somewhat 
belied  theory  since  those  countries  democratized 
without  meeting  the  preconditions  or  prerequisites 
of  democracy  in  advance.  Thus,  recent  democrati¬ 
zation  theory  argues  that  favorable  socioeconomic 
conditions  account  for  the  survivability  of  democ¬ 
racy  rather  than  the  transition  to  it.  It  is,  therefore, 
valuable  that  Eva  Beilin  (2004)  reminds  us  of  the 
pitfalls  in  the  precondition  paradigm,  which  seems 
to  have  been  extended  in  the  case  of  Middle 
Eastern  studies,  even  though  it  is  ill-founded  from 
a  theoretical  perspective. 

The  following  reasons  are  listed  to  illustrate  the 
failure  of  Middle  Eastern  democratization:  First, 
civil  society,  which  is  commonly  assumed  to  play  an 
essential  role  in  the  development  of  a  vibrant 
democracy,  is  overwhelmingly  judged  to  be  weak  in 
MENA.  However,  assessments  on  civil  societal 
structures  in  MENA  are  ambivalent.  While  some 
authors  even  wonder  whether  “there  is  any  civil 
society  at  all  in  the  Arab  Middle  East,”  others  like 
Saad  Eddin  Ibrahim  are  far  more  confident,  detect¬ 
ing  a  potential  for  grassroots  pressure  in  Middle 
Eastern  civil  societal  organizations.  To  be  sure,  a 
balanced  evaluation  of  the  strength  of  civil  society 
depends  on  the  differences  among  individual  MENA 
countries.  Concerning  Egypt,  Syria,  and  Lebanon, 
for  instance,  the  Arab  Human  Development  Report 
2005  positively  speaks  of  an  increasing  role  of  civil 
societal  organizations  in  the  political  public  sphere. 
On  the  other  hand,  some  authors  dislike  the  Islamist 
domination  of  the  civil  society  sector,  a  fact  that 
need  not  necessarily  appear  in  a  bad  light  since, 
even  among  Islamists,  there  is  broad  agreement  that 
enlarging  a  democratically  engaged  public  sphere  is 
top  priority  for  MENA  development. 

Related  to  this  argument  is  the  assumption  that 
a  patriarchal  tradition  of  power  is  deep-rooted  in 
the  Arab-Islamic  culture  and  thus  impedes  democ¬ 
ratization.  The  persistence  of  patrimonial  regimes, 
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however,  is  more  convincingly  elucidated  by  the 
rentier  state  paradigm  than  by  mentality.  As 
Giacomo  Luciani  has  noted,  rentier  states,  that  is, 
states  whose  revenue  is  predominantly  derived 
from  nonproductive  sources  of  income,  such  as  oil 
or  other  foreign  sources,  and  whose  expenditure 
constitutes  a  substantial  share  of  gross  domestic 
product  (GDP),  are  less  dependent  on  the  extrac¬ 
tion  of  wealth  from  their  populations  to  finance 
the  state.  Accordingly,  there  is  no  need  for  account¬ 
ability  on  the  part  of  the  ruling  elites  whose  actions 
remain  unchecked  for  the  reason  that,  from  an 
alleged  population’s  point  of  view,  it  is  not  imper¬ 
ative  to  stand  up  to  arbitrary  taxation  and  demand 
greater  political  participation  (“No  representation 
without  taxation”).  Political  quietism  is  further¬ 
more  evoked  by  governmental  distribution  of 
rents,  which,  in  addition,  solidifies  the  existing 
patterns  of  political  patronage.  The  share  of  the 
private  sector  in  GDP  remains  marginal  and 
accounts  for  both  the  calamitous  economic  perfor¬ 
mance  of  rentier  states  and  the  impediment  to  the 
emergence  of  a  large  middle  class,  which  is  com¬ 
monly  regarded  as  the  germ  cell  of  democracy. 

The  “oil  factor”  leads  to  another  major  road¬ 
block  for  Middle  Eastern  democracy:  the  geostrate¬ 
gic  location  of  MENA  states.  It  is  contended  that 
the  desire  of  the  United  States  and  Europe  for  stable 
Arab  regimes  outweighs  their  desire  for  democrati¬ 
zation,  and  thus  potentially  destabilization,  of  the 
region  since  only  stable  regimes  can  safeguard 
Western  energy  supply  and  contain  the  terrorist 
threat.  Western  claims  for  a  democratic  Middle 
East  are  hence  undermined  by  Western  economic 
and  military  aid  to  MENA  regimes,  in  the  eyes  of 
local  human  rights  activists.  The  geostrategic  deter¬ 
minant,  furthermore,  subsumes  the  impact  of  the 
Arab-Israeli  conflict  that  is  often  believed  to  inhibit 
democratization  as  the  regional  authoritarian 
regimes  argue  that  the  external  threat  of  Israel  ren¬ 
ders  the  maintenance  of  a  strong  military  and  secu¬ 
rity  apparatus  necessary,  which,  in  fact,  turns  out  to 
be  a  coercive  apparatus  for  internal  repressive  mea¬ 
sures  in  the  first  instance,  as  these  regimes  use  the 
Israel  threat  as  an  excuse  for  internal  repression. 

Finally,  Islam  is  probably  the  most  contentious 
point  of  debate.  Some  well-known  scholars  main¬ 
tain  that  Islam  is  utterly  inconsistent  with  Western 
notions  of  democracy  and  pluralism.  By  contrast, 
there  exists  a  voluminous  literature  demonstrating 
that  Islam  and  democracy  are  not  only  compatible 


but  can  also  be  complementary.  Here,  it  is  neces¬ 
sary  to  distinguish  between  two  levels  of  analysis. 
First,  it  is  important  not  to  confuse  Islam  with 
Islamism,  with  the  latter  being  defined  here  and 
commonly  as  the  active  assertion  and  promotion  of 
beliefs,  prescriptions,  or  policies  that  are  supposed 
to  be  Islamic  in  character — a  definition  that,  by  its 
nature,  awkwardly  encompasses  jihadi  groups, 
such  as  A1  Qaeda,  as  well  as  the  Syrian  liberal 
reformer  Muhammad  Shahrur.  The  Islamic  author¬ 
itative  scriptures  (Qur’an  and  Hadith)  leave  wide 
margins  for  interpretation,  especially  when  it  comes 
to  the  political,  social,  and  economic  relations 
among  humans  as  well  as  the  relations  between 
humans  and  governmental  organizations.  Having 
said  that  and  bearing  in  mind  the  variety  of  Islamic 
doctrinal  schools  and  traditions,  one  can  under¬ 
stand  the  existence  of  a  wide  spectrum  of  conflictive 
interpretations  by  Islamic  scholars  (ulema)  or 
Islamist  activists,  ranging  from  the  liberal  interpre¬ 
tations  of,  for  example,  the  Egyptian  Islamist  Wasat 
Party  to  the  fundamentalist  tenets  of  the  influential 
Egyptian  Islamic  scholar  Yusuf  al-Qaradawi.  In 
addition,  empirical  findings  have  shown  that  the 
support  for  democracy  as  an  ideal  is  high  among 
Muslims  and,  more  important  in  this  context,  that 
Islam  has  less  influence  on  political  attitudes  than  is 
frequently  presumed  (Steven  Hofman,  2004). 

This  list  of  reasons,  which  are  supposed  to 
account  for  the  absence  of  democracy  in  MENA,  is 
not  comprehensive.  Other  causes  associated  with 
the  major  categories  introduced  above  are  also 
highlighted,  such  as  the  low  level  of  education,  the 
weakness  of  national  identities  based  on  artificial 
Middle  Eastern  state  formation,  or  the  remoteness 
of  a  democratic  neighborhood.  While  various  rea¬ 
sons  are  commonly  cited  in  a  cumulative  way  to 
dwell  on  the  Middle  Eastern  failure  of  achieving 
democracy,  few  authors  contest  that  the  region  has 
undergone  important  sociopolitical  changes  in  the 
past  few  decades.  Rather,  it  is  now  the  scope,  depth, 
and  intention  of  political  reforms  that  are  debated. 

Recent  Trends  in  Sociopolitical  Development 

Over  the  past  few  years,  the  MENA  countries  have 
launched  a  number  of  reforms  in  response  to 
domestic  and  global  challenges.  The  dimensions  of 
political,  economic,  and  social  crises  that  the  Arab 
Middle  East  faces  are  vividly  captured  in  the 
recently  published  Arab  Human  Development 
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Reports.  These  reports,  indeed,  demonstrate  that 
Arab  regimes  lag  behind  with  regard  to  some  sig¬ 
nificant  political  and  economic  reforms.  However, 
the  steps  that  have  been  taken  in  several  MENA 
countries  to  countervail  the  present  crises  may  also 
be  considered  as  a  basis  for  notable  change  in  the 
medium  and  longer  term.  Subsequently,  the  ambiv¬ 
alent  record  of  reforms  in  MENA  again  opens  the 
door  for  scholarly  discussion. 

Regarding  Middle  Eastern  economies,  the  provi¬ 
sion  of  finance  by  the  International  Monetary  Fund 
and  the  World  Bank  has,  in  return,  led  to  structural 
reforms  in  key  Arab  states,  such  as  Tunisia,  Egypt, 
Jordan,  and  Morocco.  These  reforms  induced  pro¬ 
grams  of  economic  liberalization  that  opened  up 
recipient  economies  to  the  world  economy.  This 
so-called  infitah  policy  has  raised  hope  among  pro¬ 
ponents  of  modernization  theory  that  economic 
liberalization  will  be  followed  by  political  openings 
in  the  long  run.  In  terms  of  political  reform,  the 
MENA  region  has  witnessed  slight  improvements 
in  political  rights  and  civil  liberties.  The  Freedom 
House  Survey  2009  asserts  that  during  the  Bush 
presidency,  nine  of  the  region’s  countries  experi¬ 
enced  some  advancement  on  the  Freedom  in  the 
World  scale,  including  several  of  the  Gulf  States 
and  Saudi  Arabia.  Yet  despite  the  first  formally 
competitive  presidential  elections  in  Algeria  (2004) 
and  Egypt  (2005),  the  first  Saudi  municipal  elec¬ 
tions  in  2005,  the  introduction  of  a  new  family 
code  in  Morocco  in  2004,  and  the  appearance  of 
political  protest  movements  (e.g.,  Kifaya  in  Egypt 
and  the  Damascus  Declaration  in  Syria),  all  of 
which  have  been  interpreted  as  important  steps 
toward  more  political  liberalization,  no  major 
breakthroughs  had  been  detected  in  MENA  by 
2009.  Israel  is  still  the  only  country  in  the  region 
that  the  Freedom  in  the  World  2009  survey  (by 
Freedom  House)  classifies  as  free,  whereas  only  six 
MENA  states  are  categorized  as  partly  free  (Turkey, 
Jordan,  Morocco,  Lebanon,  Bahrain,  and  Kuwait). 
The  rest  fall  into  the  category  of  not  free. 

However,  events  in  2011  might  turn  the  political 
landscape  of  the  Arab  world  inside  out.  On  January 
14,  the  long-serving  Tunisian  President  Zayn 
Al-‘Abidin  Bin  ‘Ali  surrendered  to  public  demands 
and,  following  a  relatively  short  period  of  massive 
popular  protest,  fled  the  country.  The  seemingly 
effortless  fall  of  Bin  ‘Ali,  whose  security  apparatus 
was  deemed  one  of  the  strongest  and  most  oppres¬ 
sive  in  the  region,  came  to  be  known  as  the  Jasmine 


Revolution  and  incited  further  popular  upheaval 
across  the  Arab  world,  primarily  borne  and  engi¬ 
neered  by  the  region’s  urban,  marginalized,  and 
well-educated  youth.  Only  a  short  time  after,  on 
February  11,  Egypt’s  president,  Hosni  Mubarak, 
was  swept  away.  While  other  Arab  leaders  may  still 
meet  the  same  fate,  it  remains  to  be  seen  whether 
the  so-called  Peaceful  Youth  Revolution  (thawra 
al-sbabab  al-silmiyya )  in  the  Middle  East  will  even¬ 
tually  result  in  liberal  democracies  or  not. 

Skeptics  admit  that  today  a  rapid  and  extensive 
political  transformation  process  is  underway  in 
MENA  but  point  to  the  robustness  of  Middle 
Eastern  authoritarianism.  In  light  of  the  ambiva¬ 
lent  inventory  of  reforms  predating  the  Arab  upris¬ 
ing,  only  few  authors  saw  the  Arab  world  as  being 
on  the  path  toward  democracy.  Rather,  there  was 
widespread  agreement  that  the  region’s  authoritar¬ 
ian  regimes  had  learned  not  only  to  accommodate 
themselves  to  new  political,  economic,  and  social 
conditions  but  also  to  shape  them  according  to 
their  own  best  interests.  Steven  Heydemann  hence 
speaks  of  “Authoritarian  Upgrading”  and  illus¬ 
trates  how  Arab  regimes  have  succeeded  in  manag¬ 
ing  political  contestation,  benefiting  from  selective 
economic  reforms,  controlling  new  communica¬ 
tion  technologies,  and  stemming  as  well  as  appro¬ 
priating  civil  society.  Or,  in  other  words,  they  have 
up  to  now  skillfully  handled  Western  democracy 
promotion  by  means  of  superficial  reforms  while 
simultaneously  retaining  domestic  authority. 

The  Rise  of  Islamist  Competitors 

Yet  another  trend  characterizes  the  recent  sociopo¬ 
litical  development  in  the  region — the  rise  of 
Islamist  competitors  to  authoritarian  rule.  Within 
the  past  2  decades,  Islamist  movements  (baraka) 
such  as  the  Muslim  Brotherhood  (al-ikbwan  al- 
muslimun )  have  become  popular  grassroots  orga¬ 
nizations  with  their  associated  political  parties 
( hizb )  being  capable  of  winning  the  majority  of 
votes  in  most  MENA  countries,  if  free  and  fair 
elections  were  held.  From  Morocco  to  Iraq, 
Islamists  are  represented  in  the  national  parlia¬ 
ments,  if  granted  access  to  the  political  system,  or, 
like  the  Mouvement  pour  la  Societe  de  la  Paix 
(MSP)  in  Algeria,  even  participate  in  government 
coalitions.  This  thriving  of  Islamist  parties  is 
closely  observed  by  Western  academics  and  politi¬ 
cians  alike.  Still,  many  voices  disapprove  of  and 
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fear  the  Islamists’  political  successes.  In  the  mean¬ 
time,  however,  the  majority  of  Middle  East  scholars 
distinguish  between  moderate  and  radical  Islamists. 
While  the  latter  aim  at  the  creation  of  an  Islamic 
state  through  revolution  in  the  Qutbian  tradition  (or 
seek  to  maintain  it,  as  is  the  case,  for  instance,  with 
the  Shiite  ulema  appointed  to  the  Council  of 
Guardians  in  the  formally  democratic  Islamic 
Republic  of  Iran),  the  former  are  prepared  to  pursue 
their  political  agenda  by  peaceful  means  within  the 
existing  political  institutions.  This  is  a  more  bal¬ 
anced  approach,  which  gives  consideration  to 
Middle  Eastern  realities.  It  is  far  from  reasonable  to 
look  at  Turkey’s  ruling  Islamist  Justice  and 
Development  Party  (Adalet  ve  Kalkmma  Partisi : 
AKP)  in  the  same  way  as  at  the  fundamentalist  Hi  zb 
al-Tabrir,  which  is  banned  in  many  MENA  coun¬ 
tries.  Vali  Nasr,  for  instance,  counts  the  AKP  among 
“Muslim  democrats.”  According  to  Nasr,  Muslim 
democratic  parties,  which  are  found  mainly  in  the 
non-Arab  Muslim  world,  are  akin  to  Christian 
democratic  parties  due  to  their  commitment  to 
democratic  values,  their  moderate  religious  ideol¬ 
ogy,  and  their  systemic  modus  operandi.  Such  mod¬ 
erate  Islamist  parties,  however,  are  also  emerging  in 
the  Arab  Middle  East.  Or,  more  precisely,  as  Islamist 
parties  are  overwhelmingly  composed  of  radical  and 
moderate  wings,  moderates  there  are  getting  the 
upper  hand  over  their  more  radical  party  members. 

The  success  of  Islamist  parties  must  not  only  be 
attributed  to  the  eroding  legitimacy  of  secularist 
and  nationalist  elites  in  MENA  but  also  to  the  abil¬ 
ity  of  their  Islamist  mother  organizations  to  synthe¬ 
size  the  interests  of  the  lower  and  the  middle 
classes.  Besides,  Islamist  movements  are  popular 
because  of  their  welfare  activities,  which  provide 
broad  social  strata  with  basic  health,  nutrition,  and 
educational  services,  a  task  insufficiently  performed 
by  public  institutions.  Also,  as  a  result  of  antici¬ 
pated  electoral  successes  on  account  of  a  strong 
popular  backing,  Islamist  parties  have  turned 
toward  political  participation  within  existing  state 
institutions — all  the  more  as  former  strategies  of 
overthrowing  regimes  using  violence  have  failed. 
This  change  of  strategy  can,  in  addition,  be  sub¬ 
stantiated  by  the  rationale  of  rent  seeking:  The 
middle  classes  who  are  incorporated  into  Islamist 
movements  to  a  large  extent  do  not  engage  in  con¬ 
flicts  with  the  state  but  rather  hope  for  more 
upward  mobility  and  access  to  rents  from  within 
the  state  apparatus.  Evidently,  the  inclusion  of 


Islamists  in  the  political  process  also  presupposes 
the  preparedness  of  the  regime  to  include  rather 
than  exclude  Islamists. 

At  present,  Islamist  parties  are  on  a  learning 
curve,  and  the  moderation  of  many  Islamist  parties 
arises  from  their  inclusion  in  the  political  process. 
After  their  moving  into  parliament,  Islamists  have 
to  assume  responsibility  for  taking  action  as  well 
as  refraining  from  doing  so.  Like  every  political 
party,  Islamists  have  to  compile  party  platforms, 
which  give  answers  to  domestic  and  global  chal¬ 
lenges  and  justify  their  positions  in  public.  Once 
accountable  to  the  electorate,  they  swiftly  realize 
that  there  are  no  such  easy  remedies  for  urgent 
social  and  socioeconomic  problems,  such  as  just 
proclaiming  “Islam  is  the  Solution”  (al-Islam 
huwa  al-hall ).  The  day-to-day  work  in  parliaments 
or  in  parliamentary  committees  also  leads  to  an 
institutionalization  and  modernization  of  party 
structures.  Similarly,  it  requires  getting  along  with 
former  ideological  opponents.  To  convince  non- 
Islamist  parliamentarians,  Islamists  learn  to  argue 
on  a  political  and  economical  basis  in  different 
policy  fields  instead  of  incessantly  resorting  to  reli¬ 
gious  reasoning.  All  this  leads  to  the  professional¬ 
ization  and  de-radicalization  of  Islamist  parties,  as 
can  be  observed  in  the  cases  of  the  Moroccan 
Justice  and  Development  Party  ( Parti  de  la  Justice 
et  du  Developpement,  PJD)  or  the  Algerian 
Movement  for  the  Society  of  Peace  (MSP).  Within 
the  party  leadership,  new  technocratic  elites  are 
eclipsing  the  old  religious  elites  step  by  step,  a  pro¬ 
cess  that  involves  both  the  gradual  establishment 
of  inner-party  democracy,  including  internal  party 
strife  as  a  matter  of  course,  and  growing  indepen¬ 
dence  from  their  mother  organizations. 

Obviously,  not  all  Islamist  parties  chose  to  take 
the  path  of  political  participation  or,  as  mentioned 
before,  were  barred  by  the  regime  from  doing  so. 
However,  all  Islamist  parties  that  did  embark  on 
the  strategy  of  participation  have  been  faced  by 
what  is  best  described  as  the  “participation 
dilemma.”  That  is  to  say,  as  Islamist  parties  are 
co-opted  into  the  regime,  they  run  the  risk  of  los¬ 
ing  credibility  in  the  eyes  of  their  supporters.  Votes 
of  Islamist  parties  in  Jordan,  Algeria,  Morocco, 
and  Yemen  demonstrate  that  Islamist  parties, 
accordingly,  have  performed  badly  in  elections 
once  they  entered  parliament  or  the  government — 
all  the  more  as  the  hoped  for  exertion  of  influence 
on  political  decision  making  often  proved  to  be 
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unrealistic.  Since  the  political  system  that  they 
presently  operate  in  does  not  allow  for  serious 
policy  changes,  Islamists  often  confine  themselves 
to  playing  the  role  of  moral  watchdogs  in  parlia¬ 
ment.  Nevertheless,  in  this  way,  Islamists  have 
become  a  major  pillar  of  the  regimes  since  by  their 
very  participation  in  the  political  process  they 
bestow  legitimacy  on  the  regimes  and  foreclose  the 
emergence  of  a  more  powerful  opposition. 

Moderate  Islamists:  Partners  of  the  West? 

Even  though  there  is  no  doubt  that  there  have  been 
significant  changes  in  the  ideologies  of  many 
Islamist  parties  and  their  course  of  action,  the 
question  of  whether  this  change  is  genuine  and  can 
be  trusted  is  a  highly  controversial  one.  Many  hold 
that  even  moderate  Islamists  will  only  show  their 
true  colors  after  having  assumed  power.  While  it  is 
virtually  impossible  to  disprove  the  latter  assump¬ 
tion  ex  ante,  drawing  a  balance  sheet  of  academic 
findings  about  the  political  performance  of  moder¬ 
ate  Islamists  in  the  past  can  be  helpful.  Whether 
out  of  strategic  considerations  or  conviction,  mod¬ 
erate  Islamist  parties  have  stipulated  democracy  in 
their  party  platforms.  Here,  democracy  is  no  lon¬ 
ger  understood  as  a  Western  product,  rather  gen¬ 
eral  procedural  aspects  are  emphasized.  Islamists 
strongly  advocate  transparency  as  well  as  the 
establishment  of  and  abidance  by  democratic  stan¬ 
dards.  By  way  of  inclusion  into  the  parliamentary 
system,  Islamists  have  learnt  to  focus  on  national 
interests  and  to  put  ideology  on  hold.  In  this  way, 
Islamists  are  guided  by  increasing  pragmatism 
when  attempting  to  handle  political  challenges. 
Furthermore,  Islamists  have  shown  their  willing¬ 
ness  to  seriously  work  in  coalition  with  secular 
parties  and,  as  is  the  case  with  the  Yemeni  Joint 
Meeting  Parties,  also  with  other  denominational 
forces.  Having  said  that,  to  this  day  Islamist  par¬ 
ties  have  not  yet  completed  the  process  of  modera¬ 
tion,  and  it  is  still  uncertain  whether  their  interpre¬ 
tations  of  Islamic  law  ( shari'a )  are  flexible  enough 
to  fully  reconcile  religious  with  democratic  values. 

Islamist  parties  have  taken  a  firm  stand,  too, 
with  regard  to  another  important  policy  field,  the 
economy.  An  analysis  of  party  platforms  shows 
that  Islamists  favor  a  social  market  economy  mod¬ 
eled  on  continental  European  examples.  That  is, 
they  clearly  reject  a  state-directed  economy  while 
approving  a  state-led  regulation  of  the  public 


sector.  And  they  demand  the  strengthening  of  the 
private  sector  while  endorsing  public  welfare 
spending.  Hence,  Islamist  economic  programs  are 
directed  to  their  party  clientele,  the  middle  class. 
And  it  is,  ironically,  the  Islamist  parties  that  are 
committed  to  agreements  with  regard  to  the 
Washington  Consensus. 

Against  the  background  of  the  dismal  prospects 
for  democracy  in  MENA,  on  the  one  hand,  and  the 
rising  relevance  of  Islamist  parties,  on  the  other, 
more  and  more  Middle  East  scholars  contend  that 
democratization  in  MENA  is  impossible  to  reach 
without  the  participation  of  Islamist  parties.  Thus, 
Western  governmental  elites  and  scholars  alike 
have  begun  to  address  the  issue  of  whether  or  not 
moderate  Islamist  parties  can  be  seen  as  reliable 
partners  in  future  democratic  developments. 
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Democracy:  Russian 
Perspectives 


The  first  use  of  the  word  deMOKpamun  (demokratiia) 
in  Russian  can  be  traced  to  the  translation  of 
Samuel  Pufendorf’s  (1718)  Introduction  to  the 
History  of  the  Principal  Kingdoms  and  States  of 
Europe.  But  this  does  not  mean  that  there  was  no 
idea  of  democracy  in  Russia  before  that  time. 
Russian  chroniclers  or  statesmen  have,  centuries 
earlier,  described  some  democratic  practices  with 
words  such  as  caModepotcaeue  (samoderzhavie — 
literally  “self-consolidation”),  caMoenacmue 
(samovlastie — literally  “self-rule”),  and  ca.uoco.me 
(samovolie — literally  “self-will”),  and  they  called 
some  Cossack  communities  “a  kind  of  a  republic” 
using  a  transliterated  Polish  word  pentnocnojiuma 
( rzeczpospolita )  or  simply  called  democratic 
orders  “free”  ( ceododnuu — svobodyi)  or  “unre¬ 
stricted”  (eojibHbiu — vol’nyi).  This  entry  discusses 
some  of  the  historical  roots  and  various  interpre¬ 
tations  of  democracy  in  Russia. 

Despite  the  relatively  late  use  of  the  term, 
democracy  was  quickly  given  a  very  specific 
Russian  implication.  Democracy  was  considered 
to  be  deeply  inherent  in  the  everyday  life  of  the 
Russian  peasant  community — obschina — and 
exceptionally  specific  in  this  regard,  nothing  like 
its  plain  Western  counterpart.  The  19th  century 
was  a  time  of  profound  social  crisis  and  ongoing 
preparations  for  the  liberation  of  serfs  and  subse¬ 
quent  modernizing  reforms.  It  was  the  time  when 
talk  about  the  democratic  instincts  of  Russian 
peasants  and  the  archetypal  centrality  of  pure  and 
incorrupt  (direct)  democracy  to  Russian  obschina 
were  common  in  intellectual  circles.  In  fact,  demo¬ 
cratic  potential  can  be  traced  to  primordial  tribal 
times  in  any  tradition.  As  for  Russia,  such  demo¬ 
cratic  ways  were  fairly  well  expressed  in  history 
and  were  still  alive  in  local  communities  in  the 
19th  century  and  even  later. 

Beginning  around  the  9th  century,  the  poleis  of 
Eastern  Slavic  tribes  and  tribal  federations  interacted 


with  the  equally  rudimentary  military  democracy 
of  Varangian  (druzhina — princely  retinue,  literally 
“camaraderie”)  and  set  up  a  common  military- 
trade  infrastructure.  Their  interface  contributed  to 
the  development  of  a  patrimonial  power  structure 
of  the  Kievan  Rus ’  principality,  where  interactions 
between  the  prince  and  the  people  produced  legit¬ 
imizing  effects.  Polis  structures  with  their  demo¬ 
cratic  potential  (such  as  veche — ecclesia  and/or 
boule  in  various  cities — literally  talk,  deliberation, 
council)  were  important  elements  of  early  demo¬ 
cratic  elements  in  Russia,  as  were  ves’  grad  (gen¬ 
eral  assembly  or  polis,  as  a  partner  to  the  prince, 
literally  all  the  city),  startsy  gradskie  (city  elders), 
and  sbor  (assembly).  Equally  significant  were  con¬ 
ciliar  structures  such  as  druzhina  (early  princely 
council,  literally  camaraderie)  and  duma  (late 
princely  council,  literally  thought). 

The  Mongols  (the  “Golden  Horde”)  invaded 
Russia  during  1236  to  1242.  With  the  conquest  of 
Rus’  by  the  Horde,  princes  emerged  as  the  only 
authority  to  control  political  order  in  Russian 
dependencies  of  the  Horde  in  the  eastern  and 
southern  parts  of  the  former  Kievan  Rus’.  Their 
rule  was  granted  by  Khans  (iarlyk — mandate, 
Mongolian  jarligh — literally  order).  Actually,  the 
Khan  was  the  only  source  of  all  power.  The  power 
was  delegated  from  the  top  down.  Still,  there  was 
only  one  but  significant  exception.  Princes  also 
had  to  maintain  the  Christian  derzhaval  (literally 
something  holding  together),  which  united  them 
with  the  people  and  domestically  was  an  alterna¬ 
tive  source  of  power  and  princely  authority. 

In  ancient  and  medieval  Rus’,  ideas  of  democracy 
and  autocracy  were  not  opposed  to  each  other  but 
in  fact  closely  interwoven.  In  fact,  in  medieval 
Russian  texts,  the  terms  mentioned  earlier — samod¬ 
erzhavie  (“self-consolidation”),  samovlastie  (“self- 
rule”),  or  samovolie  (“self-will”) — were  used  to 
denote  both  democracy  and  autocracy.  The  differ¬ 
ence  was  contextual.  It  depended  on  who  was  the 
self  in  question — the  whole  community  or  the  ruling 
authority  or,  more  typically,  both,  integrated  into  a 
single  body  politic.  It  was  conceptual  substance 
(consolidation  of  power)  that  was  important, 
not  the  form  of  the  consolidation — monocentric, 
polycentric,  or  dispersed. 

The  Russian  word  samoderzhavie  is  usually 
translated  as  “autocracy.”  The  word  actually 
emerged  as  a  Slavic  counterpart  of  the  respective 
Greek  term  autokratia.  But  the  translation  was  not 
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exact.  While  the  first  components  of  both  words 
[auto-  and  samo -)  may  be  considered  fairly  equiv¬ 
alent  as  denomination  of  Self,  the  second  ones 
were  somewhat  different.  Both  implied  power  but 
of  a  different  kind.  The  Greek  kratos  denoted 
coercive  and  instrumental  power,  mingling  the 
ideas  of  “military  or  physical  supremacy”  (i.e., 
krat-)  and  “hardiness”  (i.e.,  kartu -)  (Emile 
Benveniste,  1969).  The  Slavic  derzbava  connoted 
integrative  power  of  holding  people  together 
originating  from  dher-  (“to  hold  together”).  So 
while  autocracy  suggested  coercive  power  of  an 
unrestricted  Self  over  one’s  subordinates,  samod- 
erzhavie  stood  for  the  power  of  self-integration. 
The  reason  was  that  a  fairly  primitive  tribal  power 
structure  would  not  differentiate  between  the 
power  emanating  from  the  tribal  community  and 
the  power  of  its  chief.  Conceptually,  it  implied 
that  both  authorities  and  people  are  fully  inte¬ 
grated  into  a  single  body  politic.  In  fact,  this  con¬ 
ceptual  scheme  has  been  so  strong  that  it  produced 
one  of  the  most  widespread  Soviet  slogans  “People 
and  the  Party  are  integral”  (narod  i  partiia  ediny — 
literally  “people  and  the  party  are  one,  single, 
united”).  In  fact,  any  ruling  authority  in  Russia 
has  been  obsessed  with  the  idea  of  unity  with  the 
people.  Typically,  Putin’s  dominant  party  is  called 
Edinaia  Rossia  (literally  “single  or  integral 
Russia”)  and  not  Obyedinennaia  Rossia  or 
“United  Russia”  as  it  is  inaccurately  translated. 

Authority  Versus  People 

The  emergence  of  the  Grand  Duchy  of  Muscovy  in 
the  late  1 5th  century  was  a  great  act  of  emancipa¬ 
tion  from  nearly  3  centuries  of  Mongol  external 
domination,  known  as  the  Tatar  Yoke.  Around 
that  time,  Ivan  III  Vasilevich  (Ivan  the  Great)  of 
Muscovy  and  his  brother-in-law  Stefan  cel  Mare 
the  Great  of  Moldova  introduced  a  conceptual 
innovation.  Claiming  political  independence  from 
the  Golden  Horde  and  the  Ottoman  Empire,  they 
called  themselves  gosudar’  and  gospodar’  (two 
alternative  forms  of  the  same  word),  respectively, 
and  called  the  polities  they  controlled  their  “house¬ 
holds” — gosudastvo  and  gospodarstvo,  respec¬ 
tively.  In  current  Russian,  gosudastvo  is  a  standard 
term  for  the  state.  The  word  gosudar’  is  just  a  form 
of  the  Old  Slavic  word  Gospod’ — the  Lord.  Its 
meaning  can  be  traced  back  to  the  Indo-European 


notion  of  authority  over  aliens.  In  other  words,  the 
notion  referred  to  a  very  important  function  of 
distinguishing  kin  from  alien  and  deciding  which 
aliens  could  be  treated  as  adopted  kin  and  which 
were  to  be  rejected  as  enemies. 

To  stress  his  exceptional  status,  Ivan  III  demanded 
to  be  called  not  just  gosudar’  but  velikii  (great)  or 
samoderzbavnyi  (independent,  self-integrating). 
Typically  another  Russian  polity,  the  Novgorod 
Republic,  was  often  called  Gospodin  Velikii 
Novgorod,  which  can  be  translated  as  the  Great 
Overlord  Novgorod  or  more  accurately  Great 
Sovereign  Novgorod.  In  any  case,  both  Muscovite 
and  Novgorod  claims  to  the  status  of  samoder¬ 
zbavnyi  gosudar’  implied  that  both  the  household 
lord — princely  or  republican — and  his  people  were 
integrated  by  the  same  derzbava,  which  made 
them  all  free.  The  Grand  Duchy  of  Muscovy 
reemerged  as  a  sovereign  (samoderzbavnaia) 
Russia  in  its  first  revolution  of  self-integration,  as 
Russia’s  emancipation  from  the  outside  threat  to 
its  very  existence  turned  into  subordination  to  the 
very  authority  that  was  considered  instrumental  in 
bringing  about  that  emancipation.  Thus,  the 
grand  act  of  total  emancipation  immediately 
turned  into  ultimate  subordination.  This  pattern  is 
being  repeated  in  the  later  history  of  the  country 
as  well. 

Development  of  the  Russian  polity  was  uneven 
and  contradictory.  Still,  new  important  elements 
of  a  democratic  nature  appeared  including  a  sys¬ 
tem  of  representative  bodies.  In  1549,  Ivan  IV 
(Ivan  the  Terrible)  convoked  the  so-called  Sobor 
Primirenia  (Assembly  of  Reconciliation).  It  was 
composed  of  three  bodies — Boyar  Duma  (Royal 
Council),  Osvescbennyi  Sobor  (Blessed  or  Holy 
Assembly),  and  Zemski  Sobor  (Assembly  of  the 
Land).  A  short-lived  attempt  was  made  to  create  a 
structure  of  four  “estates” — tsar,  nobility,  clergy, 
and  commoners,  with  the  last  three  having  their 
assemblies. 

Since  the  mid-  16th  century,  all  kinds  of  assem¬ 
blies  were  more  or  less  regularly  convoked. 
Zemski  Sobor  was  an  important  but  subordinate 
body  under  Ivan  the  Terrible.  It  was  essentially 
used  to  outbalance  Boyar  Duma.  Sobors  of  the 
late  16th  and  early  17th  centuries  were  instru¬ 
ments  of  political  manipulation  rubber-stamping 
adventurous  takeovers  and  conquests.  Zemski 
Sobor  of  the  “tsarless”  period  of  1610-1613 
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became  an  actual  legislative  and  even  executive 
authority.  In  1613,  it  had  carried  out  a  series  of 
regional  sobors  and  a  special  joint  session  to  elect 
Mikhail  Romanov  as  a  new  tsar  and  the  founder 
of  a  new  dynasty.  Under  Mikhail,  sobors  met 
regularly  until  1622;  their  meetings  resumed  in 
1632,  but  their  roles  changed.  They  were  only  to 
consent  to  the  tsar’s  decisions  and  ceased  to  be 
convoked  after  1684. 

The  long  and  dramatic  story  of  changing  con¬ 
cepts  and  institutions  of  Russia  (gosudarstvo,  tsar- 
stvo,  derzgava,  otecbestvo,  etc.)  arrived  at  a  critical 
point  in  the  mid-17th  century.  In  Europe,  it  was  the 
time  of  termination  of  the  epoch  of  confessionalism 
marked  by  the  Peace  of  Westphalia  (1648). 
European  sovereigns  were  emancipated  from  a 
mandatory  subordination  to  transcendental  author¬ 
ity  and  could  rely  on  ratio  status.  They  had  to 
rationalize  relations  with  their  subjects  by  develop¬ 
ing  networks  of  bureaucracies  and  representative 
bodies.  In  Russia,  the  time  was  equally  decisive.  It 
reemerged  again  as  a  sovereign,  samoderzhavnaia 
power  in  its  second  revolution  of  self-integration 
after  the  Times  of  Trouble  of  the  early  17th  cen¬ 
tury,  marked  by  famine,  the  Polish  invasion,  and 
the  end  of  the  Rurik  dynasty.  This  time,  it  was  the 
so-called  Velikaia  Russkaia  Samoderzhavnaia 
Revolutsia  (Great  Russian  Autocratic  Revolution; 
see  Yuri  Pivovarov  &  Andrei  Fursov,  2001),  in 
which  the  new  Romanov  dynasty  emerged  follow¬ 
ing  Polish  domination.  Again,  radical  emancipa¬ 
tion  ultimately  became  subordination  as  history 
repeated  itself,  and  domination  of  subjects  by  their 
rulers  continued.  In  contrast  to  Western  Europe, 
which  was  territorially  divided,  Russia  or  rather 
Northern  Eurasia  was  territorially  integrated.  In 
Europe,  sovereign  states  developed  agencies  of 
various  levels  to  circulate  the  power.  In  Russia,  all 
former  political  agencies  were  reduced  to  insignifi¬ 
cance,  and  the  tsar  alone  emerged  as  the  sole 
authority  to  his  tsarstvo  and  his  people.  With  all 
the  centralizing  effects  of  the  Great  Russian 
Authoritarian  Revolution,  a  country  as  big  as 
Russia  simply  could  not  be  governed  by  using  only 
one  simple  principle  of  integration  of  power  and 
the  populace,  which  was  the  main  feature  of  the 
Russian  system  that  ensued  after  the  Great  Russian 
Autocratic  Revolution.  Many  other  patterns  of 
rule,  management,  and  decision  making  were 
applied  particularly  at  geographical  and  social 


peripheries  and  specific  domains  of  activities.  It  is 
there  that  undeveloped  or  even  primeval  patterns 
of  democratic  governance  survived,  including  local 
village  communities  or  small  townships  typically 
called  sloboda  (literally  “free  settlement”),  military 
orders  of  Cossacks  (literally  “free  rambler”  in 
Turkic),  religious  and  ethnic  self-governing  com¬ 
munities,  companies  of  explorers  (important  for 
Russia’s  frontier  of  unexplored  land  in  Siberia,  the 
steppes,  and  the  Far  North),  merchants  and  arti¬ 
sans  (so-called  artel’),  and  so  on. 

It  took  nearly  2  centuries  (and  more  than  a  cen¬ 
tury  of  Europeanization)  to  expand  the  polity  to 
such  an  extent,  and  for  practical  reasons,  a  vast 
network  of  institutions  had  to  be  developed  to 
regulate  relations  between  individuals,  groups  of 
individuals,  and  the  imperial  superstructure.  Along 
with  democratic  patterns  of  operation  at  the 
peripheries  of  all  kinds,  these  institutions  were 
influenced  and  distorted  by  the  Russian  system  but 
were  to  a  lesser  or  greater  extent  resistant  to  its 
sway. 

During  that  period  two  important  political 
notions  emerged:  (1)  Otecbestvo — Fatherland — a 
nation  with  a  long  conceptual  history  from  collec¬ 
tive  heritage  within  lineage  through  votchina  and 
(2)  a  member  of  the  nation  typically  called  Syn 
Otecbestva — Son  of  the  Fatherland.  Those  two 
notions  were  extremely  important  alternatives 
both  to  the  power  and  to  the  populace.  Highly 
personalized  power  was  depersonalized,  and 
impersonal  populace  turned  into  a  community  of 
personalized  sons  of  the  nation. 

By  the  beginning  of  the  19th  century,  Russia 
was  ready  for  another  revolution.  It  was  acceler¬ 
ated  by  the  Napoleonic  invasion.  The  Patriotic 
War  of  1812,  called  in  Russian  Otechestvennaia 
voina  (the  war  that  sons  of  the  Fatherland  waged 
to  save  their  Fatherland),  turned  into  the  third 
revolution  of  self-integration.  Russia  reemerged 
again  as  a  sovereign,  samoderzhavnaia  power  des¬ 
tined  to  maintain  the  political  and  moral  order  of 
post-Napoleonic  Europe  as  Alexander  I  the  Blessed 
solemnly  declared  at  the  Vienna  Congress.  Another 
grand  act  of  total  liberation  from  foreign  occupa¬ 
tion  turned  into  strengthening  the  autocrat  and  his 
or  her  power. 

Still  it  was  not  all  that  simple  and  one-sided. 
Endogenous  political  developments  coupled  with 
efforts  to  Westernize  and  integrate  into  European 
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political  order  produced  significant  changes. 
Probably  the  most  decisive  factor  was  the  emer¬ 
gence  of  a  notion  of  people  that  would  cover  not 
only  the  populace  but  also  a  potential  actor  safe¬ 
guarding  Russia  itself.  Typically  it  was  “silent,”  for 
example,  in  the  symbolic  scene  of  the  election  of  the 
tsar  in  Pushkin’s  “Boris  Godunov,”  but  it  was  able 
to  act  spontaneously  and  decisively  during  crucial, 
“revolutionary”  moments  such  as  Otechestvennaia 
voina.  An  important  reflection  of  the  changes  and 
of  a  new  balance  of  essentials  of  Russian  politics 
manifested  itself  with  the  declaration  of  the  guiding 
ideological  formula — pravoslavie  (orthodoxy), 
samoderzhavie  (autocracy  but  also  sovereignty  and 
self-integrity),  and  narodnost’  (nationality  or  peo¬ 
ple  mindedness,  ability  to  think  and  act  as  the  inte¬ 
gral  Russian  people).  It  is  true  that  the  formula 
coined  by  the  Russian  minister  of  education  Sergei 
Uvarov  in  1833  was  a  counterpart  of  the  formula 
of  the  French  revolution — Liberte,  Egalite, 
Fraternite — and  to  that  extent  was  “the  Russian 
version  of  a  general  European  ideology  of  restora¬ 
tion  and  reaction”  (Nicholas  Riasanovsky,  2005, 
p.  133).  But  it  is  equally  true  that  it  redefined  the 
power  setup  within  the  Russian  System.  Pravoslavie 
reaffirmed  the  old  principle  of  sacral  derzbava  with 
all  the  power  coming  from  above.  Samoderzhavie 
provided  pragmatic  derzbava  of  the  autocrat  and 
his  Polizeistaat  administration  and  also  for  Russia’s 
sovereignty  and  integrity.  Narodnost’  indicated 
that  in  practical  terms  Russian  people  were  the  core 
frame  of  reference  for  conceptualizing  and  enacting 
politics.  Conceptually,  it  implied  that  the  Russian 
people  along  with  the  Supreme  Lord  and  the 
Autocrat  were  a  source  of  power.  The  modern  prin¬ 
ciple  of  organizing  politics  bottom-up  was  implic¬ 
itly  recognized.  As  we  can  see  it  now,  it  was  a  major 
precondition  to  anticipated  modernization  and  still 
murky  democratization. 

The  tripartite  formula  was  a  step  forward  to 
rationalize  political  integrity  not  only  conceptually 
but  also  functionally  and  structurally.  It  was  con¬ 
fronted  with  the  strong  determination  of 
Slavophiles ,  such  as  Ivan  Kireevskiy  and  Alexei 
Khomiakov,  to  work  out  a  new  holistic  idea  for 
Russia  that  would  be  essentially  based  on  the 
integrity  of  Orthodox  ecclesia  and  Russian 
obscbina.  They  introduced  the  idea  of  Sobornost’ — 
holistic  spirituality  integrating  worldly  communi¬ 
ties  of  Orthodox  Russians.  This  new  idea  was 


potentially  extremely  revolutionary  since  it  could 
imply  that  there  was  no  need  either  for  absolutist 
Polizeistaat  machinery  or  for  the  state  as  such. 
Sobornost’  would  rely  not  on  the  mundane  practi¬ 
calities  of  rule  and  subordination  but  rather 
on  traditions  of  peasant  and  boyar  conciliarism 
having  clear  connections  with  the  sobors  of  the 
patriarchal  Muscovite  past  and  the  romanticized 
spiritual  integrity  of  the  people  and  the  tsar. 

Sovereignty  of  the  People 

In  the  discourse  of,  first,  Alexander  Herzen  and  then 
Nikolay  Chernyshevskiy  and  his  associates,  samod¬ 
erzhavie  tzarei  (autocracy  of  the  tsars)  was  opposed 
to  samoderzhavie  naroda  (popular  sovereignty).  A 
clearly  modern  way  of  thinking  was  embedded  into 
the  national  tradition.  Integration  was  a  clear  prior¬ 
ity  but  could  be  achieved  only  on  the  basis  of  narod 
(the  people)  becoming  a  true  sovereign  or  samoder- 
zhets  (autocrats).  Early  Russian  democrats  were 
very  optimistic  about  the  prospects  of  peasant 
democracy  and,  hence,  socialism  in  Russia.  In  1862, 
they  created  a  revolutionary  organization  Zemlia  i 
volia — land  and  liberty,  or  rather  unrestricted  self- 
will  (Russian  eojw,  English  will,  and  Greek  fkrvAip 
all  derive  from  the  uel — “to  wish”).  The  name  of  the 
organization  clearly  demonstrated  the  objective  to 
surpass  the  moderate  results  of  the  emancipation  of 
serfs  and  to  achieve  a  far  more  radical  redistribution 
of  land  alongside  with  provision  of  individual  liber¬ 
ties  of  all  kinds.  This  venture  was  short-lived.  It  was 
brutally  crushed  a  year  later  when  the  Polish  upris¬ 
ing  provoked  harsh  political  repressions  throughout 
the  whole  country. 

Soon  thereafter,  in  the  late  1860s  and  1870s,  a 
broader  movement  of  narodniks  (usually  translated 
as  “populists,”  but  a  more  exact  rendering  would 
be  “people  minded,  concerned  with  people’s  cause”) 
emerged.  From  1873  to  1875,  many  narodniks  left 
the  cities  for  villages  on  a  mission  of  khozhdenie  v 
narod  (literally  “going  into  the  people”)  in  an 
attempt  to  reintegrate  with  the  peasantry  and  thus 
bring  about  moral  and  social  revolution.  This  cam¬ 
paign  failed.  The  radicals  among  the  narodniks 
joined  terrorist  organizations,  such  as  Narodnaya 
Volia  (People’s  Will),  which  was  active  from  1879 
to  1883.  Some  others  joined  the  anarchist  move¬ 
ment,  inspired  by  Mikhail  Bakunin  and  Petr 
Kropotkin.  More  moderate  and  liberal-minded 
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persons  accepted  a  strategy  of  “small  ventures” 
and  became  active  in  local  self-government — the 
Zemstvo  movement. 

In  practical  terms,  Zemstvo — a  name  for  a  local 
representative  body,  literally  “community  of  a 
land”) — was  one  of  the  most  significant  achieve¬ 
ments  of  the  so-called  Great  Reforms.  In  1864, 
Alexander  II  created  self-governing  bodies  in  a 
number  of  provinces  and  localities.  Their  number 
as  well  as  competence  gradually  increased.  Zemstva 
ran  educational  and  medical  establishments,  devel¬ 
oped  all  kinds  of  infrastructural  projects,  and  so 
on.  What  is  particularly  important  is  that  they 
practiced  elections  to  representative  bodies,  for 
example,  representative  councils  (zemskoie  sobra- 
nie)  and  controlled  executive  ones  (e.g.,  executive 
boards — zemskaia  uprava).  This  was  an  invaluable 
democratic  experience  for  Russia.  Glasnost’  (usu¬ 
ally  translated  as  “openness”  but  deriving  from  the 
Russian  word  for  “voice”  thus  meaning  “voiceful- 
ness”)  became  an  important  key  word.  It  implied 
accountability  of  authority  and  importance  of 
public  opinion  in  the  running  of  reforms  (Bruce 
Lincoln,  1981).  Perestroika  (“rebuilding”)  was 
another  key  word  of  the  time. 

With  all  the  validity  of  local  and  autonomous 
self-government  of  specific  communities,  the 
imperative  of  Volonte  Generate  of  the  entire  peo¬ 
ple  still  prevailed.  Despite  the  growing  diversity  of 
the  political  and  intellectual  landscape  of  postre¬ 
form  Russia,  popular  feelings  and  intellectual 
efforts  focused  on  ideas  of  political  integrity.  One 
of  the  most  significant  instances  was  a  notion  of 
samoderzhavnaya  respublika  (autocratic  republic) 
introduced  by  the  Founding  Father  of  Russian 
liberalism,  Konstantin  Kavelin  (1818-1885).  His 
point  of  departure  was  “organic  integrity  (“one¬ 
ness”)  of  the  authority  and  the  people.”  On  that 
basis,  he  concluded,  “Since  the  people  (collective 
singular)  no  doubt  in  its  very  essence  are  (literally 
is)  autocratic,  so  single  with  it  authority,  eo  ipso 
must  be  autocratic.”  He  further  developed  his 
prognosis: 

Tsar  is  the  only  and  most  secure  bulwark  of 
peasantry  against  aristocratic  and  citizen  [mesch- 
anskikh,  literary  “town  dwellers”  meaning 
“bourgeois”]  constitutions.  In  future  as  well  he  is 
the  best  security  against  emergence  of  any 
privileged  ruling  classes.  There  is  no  doubt  that  by 


all  its  mass  Russia  would  follow  only  the 
autocratic,  i.e.  free  Tsar,  who  is  independent 
either  of  boyars  or  of  plutocrats.  The  history 
itself  makes  us  create  a  new,  unprecedented  and 
unique  political  order  that  no  other  wording 
would  fit  but  autocratic  republic.  (Kavelin,  1989, 
p.  436  ff). 

A  very  similar  political  order  emerged  after  the 
October  Revolution  of  1917.  But  the  project,  as 
well  as  the  conceptualization,  was  different.  During 
the  revolution  in  1905,  Lenin  started  to  speak 
about  Soviet  Power  as  a  new  form  of  political  orga¬ 
nization  or  direct  democracy  of  the  masses.  In 
1917,  the  slogan  “All  power  to  the  soviets!”  (Vsya 
vlast  sovyetam!)  was  used  by  the  Bolsheviks  to  suc¬ 
cessfully  crush  Kerenski’s  Provisional  Government 
and  to  establish  the  new  political  order. 

Democracy  of  Soviets 

Both  sovereign  and  autocratic  (samoderzbavnaia) 
power  reemerged  again  in  its  fourth  liberation  revo¬ 
lution  of  self-integration.  This  time  it  was  the 
so-called  Velikaia  Oktiabrskaia  Sotsialisticheskaia 
Revolutsia  (Great  October  Socialist  Revolution). 
Again,  the  grand  act  of  total  emancipation  turned 
into  outright  dictatorship.  Soviet  Power  was  highly 
ambivalent:  On  the  one  hand,  it  relied  on  mass 
participation  and  thus  had  a  clear  democratic  call¬ 
ing  coupled  with  the  institutional  form  of  direct 
democracy  of  Soviets — “Councils  of  Workers  and 
Peasants  Deputies” — but  on  the  other,  the  emer¬ 
gent  system  could  only  be  run  by  a  highly  inte¬ 
grated  and  disciplined — a  new  type  of — vanguard 
party.  Lenin,  in  his  seminal  1920  book,  “Left- 
Wing”  Communism:  An  Infantile  Disorder,  clearly 
fixed  a  hierarchy  of  power:  leaders,  party,  (work¬ 
ing)  class,  and  masses. 

Democratic  centralism  was  an  answer  to  the 
practical  running  of  the  country.  Initially,  it  was  a 
set  of  principles  of  internal  organization  of  the 
nascent  Russian  social-democratic  party  advocated 
not  only  by  Bolsheviks  but  also  by  Lenin  person¬ 
ally.  In  its  fuller  version,  democratic  centralism 
implied  five  key  points: 

1.  election  of  all  party  organs  from  bottom  to  top 
and  systematic  renewal  of  their  composition,  if 
needed; 


612  Democracy:  Russian  Perspectives 


2.  responsibility  of  party  structures  to  both  lower 
and  upper  structures; 

3.  strict  and  conscious  discipline  in  the  party — that 
is,  the  minority  must  follow  the  majority 
decisions  until  such  time  as  the  policy  is  changed; 

4.  decisions  of  upper  structures  mandatory  for  the 
lower  structures;  and 

5.  cooperation  of  all  party  organs  in  a  collective 
manner  at  all  times  and,  correspondingly, 
personal  responsibility  of  party  members  for  the 
assignments  given  to  them  and  for  the 
assignments  they  themselves  create. 

With  the  consolidation  of  Soviet  Russia,  those 
five  principles  were  introduced  into  the  internal 
machinery  of  the  Soviet  system  of  rule  with  very 
minor  adjustments  of  a  purely  technical  nature. 
Democratic  centralism  was  formally  fixed  in 
Article  7  of  the  Soviet  Constitution  of  1977  as  a 
principle  for  organizing  the  state: 

The  Soviet  state  is  organized  and  functions  on  the 
principle  of  democratic  centralism,  namely  the 
electiveness  of  all  bodies  of  state  authority  from 
the  lowest  to  the  highest,  their  accountability  to 
the  people,  and  the  obligation  of  lower  bodies  to 
observe  the  decisions  of  higher  ones.  Democratic 
centralism  combines  central  leadership  with  local 
initiative  and  creative  activity  and  with  the 
responsibility  of  each  state  body  and  official  for 
the  work  entrusted  to  them. 

Another  way  to  describe  the  Soviet  rule  was 
“totalitarian  democracy”  (Jacob  Talmon,  1952). 
In  his  book,  Talmon  explores  the  reshaping  of  gov¬ 
ernment  into  one  in  which  social  utility  takes  abso¬ 
lute  precedence.  While  for  Talmon  the  origins  of 
totalitarian  democracy  are  a  merger  of  the  interests 
of  the  individual  and  the  state  into  a  single  Volonte 
Generate,  in  the  Russian  case,  Rousseauist-Marxist 
and  Jacobin  Bolshevik  lineages  were  only  comple¬ 
mentary  to  the  recurring  mode  of  the  three  succes¬ 
sive  liberation  revolutions  of  self-integration  of  the 
country  and  its  people.  This  system  of  governance 
was  developing.  During  the  Khruschevian  period, 
the  concept  of  an  “All-Peoples  State”  stressing 
the  democratic  ideals  of  the  nation  of  USSR  was 
introduced.  The  Party  was  keen  to  maintain  the 


integrity  of  the  people  and  authority.  A  common 
slogan  of  the  period  was  that  the  people  and  the 
Party  are  one. 

Perestroika:  More  Democracy, 

More  Socialism 

In  March  1985,  Mikhail  Gorbachev  became  the 
General  Secretary  of  the  Communist  Party  of  the 
Soviet  Union  (CPSU).  The  key  notions  of 
Perestroika  and  Glasnost’  entered  public  discourse 
at  the  time  of  the  27th  Congress  of  the  CPSU  in 
February  1986.  Slogans  such  as  “more  democracy, 
more  socialism”  and  “back  to  Lenin”  implied  the 
embedding  of  democratic  reforms  in  Soviet  tradi¬ 
tion.  Yegor  Ligachev,  the  party  boss  from  Siberia 
whom  Gorbachev  had  brought  into  the  Politburo, 
asserted,  “The  Party  expresses  the  profound 
essence  of  perestroika  by  the  formula  ‘more  social¬ 
ism’”  (Pravda,  November  6,  1986). 

The  terms  democracy  and  democratization, 
which  had  been  used  in  various  formulas  by  the 
Party  in  all  periods,  became  essentially  contested 
concepts.  Typical  of  perestroika  jargon  was  the 
“democratization”  of  everything.  During  the 
Congress,  Gorbachev  himself,  as  well  as  others, 
spoke  about  the  “free  revelation  (vyjavlenie)  of 
interests  and  the  will  of  all  classes  and  social 
groups”  and  about  the  “self-regulation  and  self- 
government  of  society.”  Though  far-reaching  in 
their  implications,  such  statements  gave  no  details 
of  the  institutional  designs;  a  vague  notion  of 
“democratization”  covered  everything.  When  the 
legislative  framework  did  appear,  it  was  presented 
as  a  return  to  original  Soviet  institutions.  Although 
the  reforms  marked  a  radical  departure  in  the 
country’s  political  structures,  the  constitutional 
arrangements  adopted  by  the  Supreme  Soviet  in 
November  1988  also  left  scope  for  traditional 
structures.  Citizens  would  directly  elect  the  People’s 
Deputies,  who  would  constitute  the  Congress  of 
People’s  Deputies,  which  would  in  turn  elect  a 
bicameral  Supreme  Soviet.  One  third  of  the  depu¬ 
ties  would  be  elected,  not  by  territory  but  by  “pub¬ 
lic  organizations”  such  as  the  Communist  Party, 
the  Young  Communist  League,  and  the  trade 
unions.  After  a  turbulent  campaign,  the  first  com¬ 
petitive  elections  were  held  on  March  26,  1989, 
and  more  than  30  top  party  officials  were  defeated. 
The  First  Congress  of  the  People’s  Deputies  of  the 
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USSR  met  in  May  1989.  The  Extraordinary  Fifth 
Congress  met  in  September  1991  only  to  start  the 
process  of  dismantling  the  USSR. 

From  a  Crippled  to  a  Sovereign  Democracy 

The  end  of  the  USSR  in  December  1991  marked  a 
new  stage  in  the  fifth  liberation  revolution.  It  was 
the  stage  of  consolidating  new  polities  all  claiming 
to  become  democracies.  Various  aspects  of  auto¬ 
cratic  and  democratic  traditions  as  well  as  of  their 
uneasy  symbiosis  manifested  themselves  in  each  of 
the  cases  of  the  15  new  independent  states.  In  each 
case,  the  grand  act  of  total  emancipation  turned 
into  specific  configurations  of  political  institutions 
and  practices.  For  the  most  part,  those  configura¬ 
tions  were  and  continue  to  be  highly  ambiguous 
and  contradictory.  It  was  very  symbolic  that  the  key 
word  for  the  Boris  Yeltsin  regime  was  sovereignty — 
not  samoderzhavie  this  time,  but  still,  integrity  of 
power  was  very  much  the  main  political  issue. 

In  the  Russian  Federation,  interpretations  of 
democracy  and  of  new  institutions  of  the  country 
differed  immensely,  not  only  ideologically  or 
partywise  but  also  conceptually.  There  was  a  great 
gap  between  normative  and  pragmatic  interpreta¬ 
tions  of  democracy  as  such  and  the  emerging 
democracy  of  the  new  Russia  in  particular.  Often, 
if  not  typically,  both  visions  were  intricately  inter¬ 
twined.  An  apt  example  was  the  phase  attributed 
to  Deputy  Prime  Minister  and  actual  head  of  the 
cabinet  from  1991  to  1994,  Yegor  Gadar — 
“Russia  is  a  crippled  democracy,  but  it  is  a  democ¬ 
racy  at  the  end  of  the  day.  ” 

Feelings  that  the  newborn  Russian  democracy 
was  inefficient  and/or  insufficient  were  very  wide¬ 
spread  in  all  political  quarters  of  the  country,  but 
interpretations  were  quite  different.  That  there  was 
a  great  gap  between  the  expectations  and  results 
achieved  could  not  be  denied.  Some  attributed  the 
problems  to  the  loss  of  an  independent  and  consoli¬ 
dated  power  base  of  the  regime.  For  others,  the 
problem  lay  in  the  inability  of  the  regime  to  stand  up 
to  normative  ideals  and  best  international  practice. 
While  the  first  view  seemed  more  practical  and  con¬ 
sonant  with  Russian  traditions,  the  second  one  was 
clearly  impractical,  wishful,  and  foreign.  The  first 
one  gradually  prevailed  both  in  the  minds  of  the 
authorities  and  the  populace  in  general.  The  second 
view  was  characteristic  of  dogmatic  “democrats” 


and  zealous  critics  of  the  Yeltsin  and,  later,  the 
Vladimir  Putin  regimes. 

It  was  only  natural  that  gradual  consolidation 
of  the  political  order  and  infrastructure  of  gover¬ 
nance  led  to  a  reconceptualization  of  the  constitu¬ 
tional  framework  of  the  Russian  Federation  as 
sovereign  democracy  (suveryennaya  demokratiya). 
This  formula  was  first  introduced  by  Vladislav 
Surkov,  the  deputy  head  of  the  presidential  admin¬ 
istration,  in  his  speech  to  the  trainees  of  United 
Russia’s  Center  for  Party  Personnel  Training  on 
February  7,  2006.  Soon,  the  text  was  published  as 
an  article,  “The  Nationalization  of  the  Future”  in 
Expert  magazine.  It  explained  the  notion  as  a 
proper  synonym  for  “political  competitiveness.” 
Critics  were  quick  to  interpret  the  notion  as  a 
cover  for  exceptional  democracy  that  would  not 
tolerate  any  outside  criticism.  While  this  may  be  a 
secondary  interpretation,  the  champions  of  sover¬ 
eign  democracy  in  fact  insist  that  it  highlights  the 
great  power  status  of  Russia.  Another  possible,  but 
unfortunately  not  widespread,  interpretation  is 
that  of  a  sovereign  (constitutional,  etc.)  state 
working  to  develop  democracy. 

It  is  very  indicative  that  both  Putin  and  Dmitry 
Medvedev  deliberately  refrained  from  public  sup¬ 
port  of  the  concept.  Rather,  they  voiced  their  res¬ 
ervations  but  agreed  that  the  idea  was  worth  pon¬ 
dering  about.  Still,  it  cannot  be  denied  that  with 
the  notion  of  sovereign  democracy,  traditional 
priorities  of  political  integrity  have  gained  momen¬ 
tum.  Although  institutional  and  procedural  aspects 
of  accountability  of  authorities  and  of  transpar¬ 
ency  remain  grossly  underdeveloped,  sovereign 
democracy  gives  room  for  further  promotion  and 
anchoring  of  democratic  institutions. 

This  short  outline  of  Russia’s  leanings  toward 
democracy  shows  that  it  is  wrong  to  treat  all  its 
traditions  as  outright  autocratic  and  democracy  as 
something  that  had  to  start  there  from  scratch. 
Russia’s  democratization  story  began  as  far  back  as 
the  mid-19th  century.  Its  heritage  is  rich  albeit  often 
tragic.  Temptations  to  achieve  radical  breakthroughs 
to  “pure”  or  complete  integrity  of  authority  and  the 
people  (far  older  than  the  democratization  story 
itself)  recurrently  led  to  reestablishing  the  autocratic 
“integration”  of  Russia  top-down.  The  present 
phase  of  sovereign  democracy  may  be  replaced 
by  further  moves  toward  more  consequential 
accountability  of  authorities  and  popular  political 
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participation  as  long  as  the  essential  elements  of 
the  Russian  democratic  tradition  are  adequately 
used  and  pitfalls  of  radicalism  avoided  by  the 
gradual  growth  of  moderation  and  readiness  to 
accept  compromise  solutions. 

Mikhail  Ilyin 
MGIMO  University 
Moscow,  Russian  Federation 
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Democratic  Consolidation 


Democratic  consolidation  may  be  described  as  the 
process  of  defining  and  firmly  establishing  the 


essential  characteristics  (and  adjusting  the  second¬ 
ary  ones)  of  the  structures  and  norms  inherent  in  a 
democratic  regime,  which  comes  about  also,  but 
not  exclusively,  with  the  passage  of  time.  Institutions, 
procedures,  practices,  customs,  and  routines  are 
defined  and  adapted,  and  at  the  same  time,  the 
structures  and  regulations  for  the  peaceful  resolu¬ 
tion  of  conflicts  become  accepted,  thus  strengthen¬ 
ing  the  regime’s  legitimization.  In  short,  consolida¬ 
tion  is  one  of  the  possible  outcomes  arising  out  of 
the  introduction  of  democracy:  This  may  result  not 
only  in  a  fully  functioning  democracy  but  also  in  its 
bare  survival  or  in  a  new  crisis  of  the  political  sys¬ 
tem  and  a  subsequent  shift  toward  authoritarian¬ 
ism.  This  entry  defines  democracy  and  clarifies  the 
principal  meanings  of  consolidation  used  in  this 
analysis.  After  a  brief  discussion  of  some  criticisms 
of  the  concept,  the  entry  examines  the  empirical 
dimension  and  the  mechanisms  through  which 
democracies  can  be  consolidated.  Finally,  following 
an  analysis  of  the  factors  that  favor  this  process, 
some  concluding  considerations  are  offered. 

Definitions 

Defining  the  concept  requires  first  of  all  (a)  identi¬ 
fying  when  a  government  can  claim  to  be  at  least 
minimally  democratic  and  (b)  outlining  the  differ¬ 
ent  types  of  democracy  along  a  continuum  running 
from  partial  forms  of  democracy  to  more  complete 
ones.  As  far  as  the  first  objective  is  concerned,  one 
can  start  with  the  well-accepted  notion  of  polyar¬ 
chy,  whereby  a  political  regime  is  democratic  when 
key  civil  and  political  rights  are  protected  and  elec¬ 
tions  are  free,  transparent,  and  competitive.  The 
minimum  forms  of  democracy  have  been  defined 
as  “electoral  democracies”:  In  these  democracies, 
some  of  the  procedural  requisites  of  polyarchy  are 
respected,  but  elections  are  only  partially  free,  and 
the  chances  of  an  opposition  victory  are  limited. 
For  a  regime  of  this  type  to  claim  to  be  democratic, 
however,  election  results  must  reflect  the  will  of 
the  voters,  and  in  particular,  it  must  be  possible  to 
remove  unpopular  politicians  from  power.  This 
necessitates  an  open  electoral  arena,  with  substan¬ 
tial  freedom  for  parties  and  candidates  to  publicize 
their  ideas  and  canvass  votes,  along  with  guaran¬ 
tees  that  they  may  speak,  publish,  meet,  organize, 
and  move  freely  within  the  country  to  pursue  their 
objectives  peacefully. 
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In  some  historical  and  geographical  contexts, 
democracies  have  achieved  a  certain  level  of  matu¬ 
rity,  the  government  and  public  administration 
operate  better,  the  judicial  system  is  more  indepen¬ 
dent  and  effective,  parties  and  interest  groups  are 
better  structured,  the  democratic  political  culture  is 
more  robust,  and  civil  society  is  autonomous  and 
vibrant.  Likewise,  governments  react  responsibly 
to  requests  from  voters  and  social  organizations 
and  endeavor  to  represent  the  interests  of  all — 
including  marginal  and  weaker  groups  and  ethnic, 
cultural,  and  religious  minorities.  In  this  case,  one 
can  legitimately  speak  of  “liberal  democracies.” 

To  consolidate  a  democracy  implies  strength¬ 
ening  it.  But  democracy  can  be  strengthened  in 
different  ways.  Originally,  it  was  thought  that 
strengthening  a  democracy  was  synonymous  with 
making  it  “secure,”  extending  its  “life  expec¬ 
tancy”  and  safeguarding  it  from  the  danger  of 
authoritarianism.  Subsequently,  this  interpreta¬ 
tion  was  widened  to  incorporate  a  series  of  other 
“missions.”  The  resulting  long  list  thus  includes 
neutralizing  antisystem  actors,  building  a  party 
system,  organizing  economic  and  social  interests, 
stabilizing  electoral  laws,  decentralizing  state 
power,  reforming  the  judicial  system,  and  solving 
problems  of  poverty  and  underdevelopment.  The 
new  objective,  therefore,  seems  to  be  that  of  get¬ 
ting  as  close  as  possible  to  the  organizational 
forms  and  political  and  social  performances  of 
advanced  Western  democracies.  In  this  sense, 
strengthening  a  democracy  means  deepening  its 
characteristics  and  realizing  its  full  potential. 

Seen  in  this  light,  democratic  consolidation  takes 
on  two  very  different  meanings.  The  first  is  sub¬ 
stantially  negative:  It  implies  avoiding  authoritari¬ 
anism — that  is,  moving  from  an  “electoral  democ¬ 
racy”  to  a  new  authoritarian  regime — or  prevent¬ 
ing  the  erosion  of  democracy  and  its  recession  from 
advanced  forms  to  other  more  precarious  ones.  In 
the  first  case,  the  danger  is  that  of  a  sudden  death, 
that  is,  of  democratic  breakdown;  in  the  second, 
the  risk  is  of  impoverishment  and  a  slow  death,  of 
a  gradual  unraveling  of  democracy  and  an  almost 
imperceptible  return  to  authoritarian  political 
forms.  The  second  meaning  of  consolidation  is 
positive.  Consolidating  democracy  in  this  sense 
means  bringing  it  to  fruition,  progressing  from  an 
initial  form  to  other  more  complete  forms,  from 
“electoral”  forms  of  democracy  to  a  democracy  in 


which  civil  and  political  rights  are  more  numerous, 
more  detailed,  and  better  protected.  According  to 
many,  social  and  economic  rights  should  also  be 
fully  protected,  and  consolidating  democracy  thus 
means  creating  an  “advanced  democracy,”  which 
represents  the  ideal  goal  of  more  progressive  polit¬ 
ical  practices. 

Which  of  these  interpretations  is  preferable?  It 
seems  appropriate  to  limit  the  term  to  the  first  set 
of  meanings,  the  negative  ones  and  in  particular  to 
that  of  survival:  Consolidation  should  imply  only 
the  expectation  of  the  regime’s  endurance.  Some 
find  the  other  definitions  problematic,  since  democ¬ 
racy  and  its  identity  cannot  be  statically  and 
definitively  fixed,  and  its  contents  are  in  perennial 
movement,  always  open  to  new  clarifications. 
Hence,  it  would  be  analytically  impossible  and 
futile  to  attempt  to  define  once  and  for  all  what 
constitutes  a  “good  democracy”  and  a  “finally 
consolidated”  democracy.  Given  the  confusion 
surrounding  it,  the  concept  of  positive  consolida¬ 
tion  associated  with  this  interpretation  is  equally 
empty.  Those  scholars  who  believe  that  one  can 
study  democratic  performance  and  its  potential  for 
full  realization  through  other  concepts,  such  as  the 
quality  of  democracy,  have  reached  similar  conclu¬ 
sions:  For  them,  too,  the  typical  sphere  of  consoli¬ 
dation  remains  the  original  one,  rooted  in  the 
prospects  of  democracy’s  maintenance  and  mere 
survival  rather  than  the  possibility  of  full  demo¬ 
cratic  realization,  which  would  again  involve  the 
concept  of  quality. 

Although  they  are  always  changeable  and  unde¬ 
fined,  the  specific  contents  of  democracy  are  not 
entirely  indeterminate  and  are  relevant  to  the  pros¬ 
pects  for  consolidation.  In  fact,  it  is  no  coincidence 
that  the  consolidation  of  modern  democracies 
requires  the  crafting  of  mutually  acceptable  rules 
that  concern  crucial  themes  such  as  negotiation 
between  capitalists  and  workers,  guarantees  of  dif¬ 
ferent  forms  of  property,  and  assurances  concern¬ 
ing  levels  of  social  protection  to  be  established  or 
defended.  In  more  general  terms,  democracy  enjoys 
better  consolidation  prospects  when  the  major 
political  actors  have  a  stake  in  its  survival.  In  this 
setup,  actors  outside  the  electoral  arena,  especially 
the  military  and  entrepreneurs,  receive  guarantees 
that  their  vital  interests  will  be  respected.  Likewise, 
political  parties  and  movements  not  in  government 
must  have  realistic  chances  of  taking  power 
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through  elections.  Only  when  an  agreement  of  this 
type  exists  can  they  too,  over  time,  become  accus¬ 
tomed  to  democracy.  Once  they  realize  that  their 
initial  fears  were  mainly  groundless,  they  can 
transform  lukewarm  acceptance  of  the  new  regime 
into  full  support. 

Criticism 

Some  political  scientists  have  (at  times  sharply) 
criticized  the  concept  of  democratic  consolidation 
and  deemed  it  unacceptable  on  account  of  its 
strong  teleological  flavor.  Cases  that  have  not 
achieved  full  institutionalization  are  seen  as  endur- 
ingly  unconsolidated.  Such  a  view  presupposes 
that  there  are  factors  working  in  favor  of  increased 
consolidation  or  institutionalization  but  that  coun¬ 
tervailing  obstacles  obstruct  a  process  of  change 
that  otherwise  would  operate  unfettered.  The  cur¬ 
rent  existence  of  many  countries  that  have  been  in 
a  state  of  failed  or  partial  consolidation  for 
decades,  however,  casts  the  existence  of  this  linear 
progression  into  doubt.  To  claim  that  consolida¬ 
tion  occurs  only  when  the  behavior  of  the  main 
political  actors  is  adapted  to  the  formal  rules  of 
democracy  would  be  to  ignore  the  experience  of 
many  countries,  such  as  India  or  Italy,  that  are 
widely  considered  to  be  consolidated  democracies 
but  in  which  particularism  and  patronage  remain 
widespread.  Others,  finally,  find  no  trace  of  a  spe¬ 
cific  empirical  dimension  of  the  concept,  which 
should  be  observable  if  the  chances  of  a  demo¬ 
cratic  regime’s  survival  over  time  were  increased 
simply  because  it  has  already  survived  for  a  suffi¬ 
ciently  long  period. 

Empirical  Indications  of  Consolidation 

Many  political  scientists  have  focused  their  atten¬ 
tion  on  this  dimension,  wondering,  at  first,  how  a 
consolidated  democratic  regime  can  be  empirically 
recognized.  Even  if  this  analysis  is  limited  to  the 
prospects  of  regime  survival,  as  discussed  above, 
the  answer  to  this  question  is  problematic.  First 
and  foremost,  it  is  important  to  note  that  some 
have  studied  the  behavior  of  political  actors,  and 
others  their  attitudes.  The  former  in  particular 
have  concentrated  on  antidemocratic  behavior  as  a 
symptom  of  the  weakness  of  a  democracy  and  have 
tried  to  determine  how  and  when  such  behavior 


violates  its  basic  rules.  Principally,  democracy  is 
challenged  when  political  actors  try  to  achieve 
their  goals  through  violence.  These  acts  include  a 
series  of  actions  ranging  from  the  systematic 
destruction  of  public  or  private  property  to  the 
assassination  of  political  adversaries,  and  to 
attempts  to  forcibly  overturn  the  government  and 
intimidate  voters  and  candidates.  The  refusal  to 
recognize  the  validity  of  genuinely  free  political 
elections  can  also  trigger  an  irreversible  demo¬ 
cratic  crisis,  for  example,  by  deciding  not  to  par¬ 
ticipate  in  voting,  stopping  others  from  participat¬ 
ing,  or  manipulating  electoral  results  at  will. 
Finally,  consolidation  implies  that  political  actors 
must  respect  the  laws  and  the  constitution,  and 
accept  the  agreed  rules  of  political  behavior.  When 
the  government  violates  the  established  rules  while 
legislating,  enforcing  laws,  interpreting  them,  or 
resolving  conflicts,  and  especially  when  these  vio¬ 
lations  become  customary,  the  survival  of  democ¬ 
racy  is  severely  at  risk. 

Having  clarified  these  essential  points,  many 
other  questions  nevertheless  remain  unanswered. 
For  example,  which  levels  of  antidemocratic 
behavior  are  sufficient  to  trigger  a  democratic  cri¬ 
sis  and  can  therefore  be  considered  a  symptom  of 
de-consolidation?  It  is  obvious  that  the  higher  the 
levels,  the  greater  the  risk  for  the  new  democracy, 
but  it  is  also  possible  that  relatively  low  levels  can 
accumulate  over  time  to  produce  a  crisis  that  may 
be  unexpected.  And  it  is  perfectly  possible  that  a 
democratic  crisis  will  serve  to  bolster  the  new 
democratic  regime  rather  than  destroy  it.  This  was 
the  case,  for  example,  with  Colonel  Antonio 
Tejero  Molina’s  attempted  coup  d’etat  in  Spain  in 
1981.  Finally,  consolidation  can  be  considered 
authentic  only  if  it  guarantees  democratic  survival 
when  adverse  external  conditions  are  present.  It  is 
essential,  for  example,  that  a  governing  party 
peacefully  accepts  giving  up  power  in  the  event  of 
an  opposition  election  victory.  The  turnover-in- 
government  test  reflects  this  rationale,  and  if  turn¬ 
over  takes  place  twice,  then  the  test  is  even  more 
stringent:  In  this  latter  case,  not  only  are  we  then 
certain  that  two  large  democratic  parties  have  been 
formed  but  also  that  elite  groups  and  public  opin¬ 
ion  fully  accept  the  functioning  of  the  democratic 
mechanism,  as  they  have  proved  that  they  are  will¬ 
ing  to  change  the  government,  but  not  the  demo¬ 
cratic  regime,  when  things  have  gone  badly. 
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A  second  empirical  indicator  of  consolidation 
concerns  the  attitudes  of  political  actors  as  regards 
both  normative  and  cognitive  aspects.  Democracy 
is  consolidated  normatively  when  its  citizens  con¬ 
sistently  give  it  genuine  support  and  value  it  as  an 
intrinsic  good,  not  merely  an  instrumental  one.  For 
many,  this  transformation  signals  the  conclusion 
of  the  process:  Although  democracies  can  survive 
at  relatively  low  levels  of  consensus,  it  is  undoubted 
that  marked  legitimacy  provides  significant  help  in 
tackling  and  overcoming  moments  of  crisis,  includ¬ 
ing  acute  ones.  The  most  widely  used  tool  to  deter¬ 
mine  whether,  and  to  what  extent,  the  citizens  of 
new  postauthoritarian  regimes  appreciate  democ¬ 
racy  is  opinion  polling.  Undoubtedly,  this  is  still  a 
fragile  tool,  especially  if  polls  consist  of  generic 
questions  that  are  difficult  to  interpret  with  any 
certainty,  such  as  those  concerning  public  prefer¬ 
ence  for  democracy  over  other  political  regimes. 
This  is  not  only  because  alternative  experiences  to 
democracy  in  some  cases  are  only  a  distant  mem¬ 
ory  but  also  because  the  dimensions  and  experi¬ 
ences  linked  to  democracy  are  too  numerous,  and 
in  part  contradictory,  to  allow  for  a  single  reply. 
Among  the  attitudes  to  consider  are  the  demo¬ 
cratic  expectations  of  political  actors,  that  is, 
the  extent  to  which  they  think  democracy  will 
survive — a  cognitive  aspect  relating  to  personal 
evaluations.  Again,  as  in  the  case  of  public  consen¬ 
sus,  opinion  polls  are  a  significant,  though  not 
always  complete,  tool  to  analyze  some  of  their 
empirical  features. 

Mechanisms  of  Consolidation 

Having  attempted  to  define  the  concept  in  its  theo¬ 
retical  and  empirical  components,  this  entry  now 
looks  at  how  it  operates.  Which  processes  accom¬ 
pany  the  consolidation  of  democracy?  Who  are  the 
actors  and  what  strategies  do  they  employ? 
Consolidation  implies  the  definition  of  relations 
between  representative  and  governing  institutions 
and  civil  society.  These  relationships  develop  in 
two  directions:  from  the  bottom  upwards,  that  is, 
from  society  to  the  institutions,  and  from  the  top 
downward,  in  other  words,  from  the  newly  created 
institutions  through  intermediate  structures  to 
society.  In  the  first  case,  it  is  civil  society  that  backs 
the  regime  through  a  process  of  legitimization  and 
support.  It  is  worth  noting  that  some  scholars 


claim  that  the  crucial  aspect  for  the  survival  of  a 
regime  is  not  active  support  but  rather  the  lack  of 
organized  alternatives.  Consensus,  therefore,  may 
often  be  only  acquiescence  and  passive  acceptance 
of  the  status  quo.  However,  no  matter  how  one 
defines  the  foundations  of  legitimization,  it  must 
be  stressed  that,  if  legitimization  is  widespread,  if 
all  political  organizations  are  integrated  and 
involved  in  accepting  and  supporting  democratic 
institutions,  then  an  important  step  toward  demo¬ 
cratic  consolidation  has  already  been  taken  in 
practice.  By  contrast,  where  support  is  limited, 
consolidation  is  only  possible  through  the  opposite 
mechanism,  known  as  “anchoring,”  which  oper¬ 
ates  from  the  top  down. 

The  key  “anchors”  of  democratic  consolidation 
are  political  parties  and  their  organizations  (par¬ 
ties  may  be  more  or  less  institutionalized),  their 
gate-keeping  function  (with  which  they  control  the 
access  of  interest  groups  to  the  decision-making 
arena  and  establish  priorities  among  different 
demands),  their  links  of  patronage  (particularistic 
relationships  with  large  and  small  private  entrepre¬ 
neurs,  intellectuals,  and  ordinary  citizens),  and 
neocorporatist  agreements.  In  sum,  political  par¬ 
ties  exercise  their  function  of  political  “anchors” 
by  organizing,  integrating,  and  sometimes  even 
controlling  general  civil  society  and  the  specific 
sectors  that  make  it  up,  consequently  helping 
strengthen  the  democratic  political  regime.  The 
crucial  point  is  that  the  process  of  consolidation 
and  the  particular  mix  of  its  two  components 
(legitimization  and  anchorage)  give  rise  to  differ¬ 
ent  consolidation  formulas  in  different  countries, 
thus  enabling  one  to  determine  the  type  of  democ¬ 
racy  that  derives  from  it.  For  example,  in  Italy, 
legitimization  of  the  democratic  regime  created 
after  World  War  II  was  limited,  and  the  parties 
established  a  position  of  domination  over  interest 
groups,  giving  rise  to  consolidation  of  the  party- 
based  type.  By  contrast,  in  Spain  after  Franco’s 
death,  the  parties  were  generally  weaker  than  in 
Italy  and  were  more  neutral  toward  interest 
groups.  However,  in  the  period  after  democratiza¬ 
tion,  the  new  regime  acquired  strong  legitimacy, 
partially  due  to  the  role  played  by  the  party  elites: 
Spain  is  therefore  an  example  of  consolidation 
through  the  elite.  It  should  be  noted,  though,  that 
especially  if  they  have  contributed  to  democratic 
consolidation,  the  principal  “anchors”  may  not  be 
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very  adaptable  to  the  changing  circumstances  of 
political  life,  and  a  series  of  factors — long  perma¬ 
nence  in  power,  lack  of  turnover,  absence  of  coali¬ 
tion  partners  in  government — may  throw  the  new 
democratic  regime,  and  the  prevalent  type  of  con¬ 
solidation,  into  crisis.  This  in  turn  may  lead  to 
instability  and  produce  the  contrary  phenomenon 
of  “disanchoring”  of  civil  society  from  politics. 

Factors  Influencing  the 
Success  of  Consolidation 

The  conditions  that  favor  the  success  of  demo¬ 
cratic  consolidation  remain  to  be  discussed.  The 
list  is  long;  here  a  general  outline  is  provided,  dis¬ 
tinguishing  between  political,  economic,  and  inter¬ 
national  scenario-related  factors.  Among  the  first, 
the  nature  of  the  previous  authoritarian  regime  is 
significant:  Some  scholars  argue  that  if  political 
repression  was  intense  or  if  the  authoritarian 
regime  collapsed  after  military  defeat,  the  new 
democratic  government  will  enjoy  “inverse”  legiti¬ 
mization  thanks  simply  to  having  replaced  a 
widely  discredited  government;  however,  this  effect 
does  not  last  long.  Others  are  of  the  view  that 
democratic  structures  are  more  easily  reinforced  if 
less  violence  has  been  used  and  the  attitude  of  the 
defeated  authoritarian  government  is  one  of  com¬ 
promise  and  negotiation  with  the  democratic 
forces. 

Previous  democratic  experiences  may  be  of 
more  significance:  If  some  democratic  institutions 
were  maintained  during  the  authoritarian  interval, 
consolidation  is  more  likely  to  be  successful,  com¬ 
pared  with  a  scenario  in  which  such  institutions 
have  to  be  created  ex  novo.  In  the  first  case,  the 
prior  democratic  experience  greatly  facilitates  the 
restoration  of  previous  alliances  between  parties 
and  groups  and  the  reemergence  of  associations, 
trade  unions,  and  other  organizations  that  support 
democracy.  Indeed,  during  the  20th  century,  few 
countries  succeeded  in  introducing  a  stable  and 
lasting  democracy  at  the  first  attempt.  Nonetheless, 
more  recent  experiences,  involving  the  progressive 
democratization  of  countries  with  little  or  no  pre¬ 
vious  democratic  history,  may  open  new  and  inter¬ 
esting  scenarios  for  the  survival  of  democracy  in 
such  cases. 

Certain  structural  elements,  such  as  some  degree 
of  ethnic  and  religious  homogeneity  and  the 


absence  of  deep  ideological  divisions  or  severe  eco¬ 
nomic  and  social  inequality,  appear  to  favor  the 
emergence  of  a  strong  political  culture,  which  is 
another  important  determinant  for  consolidation. 
These  characteristics  are  also  associated  with  a 
robust  civil  society.  Democratic  consolidation  is 
thus  held  to  benefit  from  associations  and  institu¬ 
tions  that  are  strongly  rooted  in  civil  society,  pro¬ 
vided  that  these  try  to  achieve  their  own  interests 
also  in  opposition  to  political  power,  in  turn 
restricting  the  desire  of  the  latter  to  expand  dispro¬ 
portionately.  In  order  to  be  effective,  these  associa¬ 
tions  must  be  strong,  centralized,  and  willing  to 
mobilize  for  political  objectives;  otherwise,  the 
government  will  find  it  easy  to  deal  with  those 
opposing  it,  whether  by  intimidating  them  or  buy¬ 
ing  their  services. 

Among  the  political  institutions  that  facilitate 
the  process,  parties  are  undoubtedly  of  central 
importance;  this  entry  has  already  mentioned 
their  role  in  the  development  of  consolidation,  as 
agents  promoting  legitimacy  bottom-up  and,  if 
this  fails,  anchoring  it  top-down.  Parliamentary 
systems  are  thought  by  some  scholars  to  be  more 
favorable  to  democratic  consolidation  than  presi¬ 
dential  ones,  since  the  former  reduce  the  harsher 
aspects  of  political  confrontation  through  the 
creation  of  party  coalitions  necessary  to  form  a 
government  and  because  they  provide  a  better  bal¬ 
ance  between  the  powers  of  the  president  and 
those  of  the  head  of  government.  Electoral  sys¬ 
tems  are  also  the  subject  of  debate,  with  the  ques¬ 
tion  posed  of  which  more  effectively  strengthens 
democracy:  a  proportional  system,  which  better 
represents  the  claims  of  economic,  social,  local, 
and  ideological  minorities,  or  a  majoritarian  sys¬ 
tem,  seen  as  more  capable  of  delivering  a  clearer 
and  more  credible  governing  alternative?  Although 
the  empirical  evidence  remains  somewhat  ambig¬ 
uous,  the  prevailing  current  of  thought  is  that 
parliamentary  systems  are  better  in  this  sense  than 
presidential  ones,  especially  if  the  latter  are  com¬ 
bined  with  a  proportional  party  system,  which 
multiplies  its  limitations. 

A  second  group  of  conditions  that  facilitate 
democratic  consolidation  are  economic  in  nature 
and  involve  two  possible  scenarios.  Reaching  cer¬ 
tain  levels  of  development  is  thought  to  reflect  the 
fact  that  a  democratic  regime  has  been  consoli¬ 
dated  and  that  it  no  longer  runs  a  significant  risk 
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of  lapsing  into  authoritarianism.  In  Argentina,  the 
per  capita  income  in  1975  was  above  $6,000,  still 
one  of  the  highest  levels  of  wealth  ever  associated 
with  a  democracy  that  was  overturned.  However, 
wealth  is  not  necessary  for  consolidation:  Even 
poorer  countries  can  maintain  a  democratic  regime 
if  they  can  generate  strong  economic  growth, 
accompanied  by  moderate  and  stable  levels  of 
inflation  and  if  they  extend  the  new  wealth  to  the 
majority  of  the  population.  The  probabilities  of  a 
democracy  dying,  in  particular,  also  appear  to  vary 
inversely  with  economic  inequality,  in  line  with 
what  was  already  suggested  in  the  1960s.  In  brief, 
it  is  reasonable  to  hypothesize  that  a  modern 
industrialized  economy,  in  a  complex  relatively 
egalitarian  society  with  an  educated  population, 
favors  the  maintenance  of  democracy.  One  of  the 
implications  of  this  analysis  is  that  while  economic 
crises  destabilize  all  democracies,  they  hit  poor 
democracies  particularly  hard  since  they  are  more 
vulnerable  to  weak  economic  performance. 
Consolidation  is  also  helped  by  the  presence  of  an 
adequate  social  welfare  system.  If  democratic  gov¬ 
ernments  succeed  in  reducing  social  and  economic 
inequality  through  appropriate  social  policies,  dis¬ 
order  and  rebellion  in  the  poorer  countries  will  be 
reduced,  and  in  all  cases,  the  sensation  of  the 
legitimacy  and  efficacy  of  the  new  regime  will  be 
strengthened.  Indeed,  recent  studies  have  docu¬ 
mented  the  growth  in  social  spending  during  the 
phases  following  the  introduction  of  democracy  in 
several  of  the  third  wave  of  democratic  transitions, 
with  taxation  often  having  been  increased  to 
finance  and  extend  welfare  systems,  reduce  pov¬ 
erty  levels,  improve  medical  assistance,  and  expand 
education. 

The  international  scenario  and  external  actors 
may  also  be  of  considerable  importance.  Think, 
for  example,  of  the  role  played  by  Federal  Germany 
in  the  democratization  and  political  consolidation 
of  East  Germany  and  that  of  the  European 
Community  (later  the  European  Union)  with 
regard  to  Southern  and  Eastern  European  coun¬ 
tries.  Especially  in  the  cases  of  more  recent  democ¬ 
ratizations,  the  international  system  has  either 
facilitated  or  hindered  existing  democratic  tenden¬ 
cies:  in  the  former  cases,  acting  to  support  them 
when  the  democratic  context  was  widespread  and 
solid,  and  in  the  latter  cases,  acting  as  an  obstacle 
when  prevalent  governments  and  political  regimes 


were  hostile  or  indifferent  to  democracy.  It  is 
worth  noting  that  some  paradoxical  and  unex¬ 
pected  effects,  however,  for  example,  in  some 
cases,  the  excessive  encouragement  of  democrati¬ 
zation  by  powerful  international  actors — before 
adequate  economic  and  social  conditions  have 
taken  root  in  the  country  in  question — have  exac¬ 
erbated  problems  of  consolidation.  There  is  also  a 
danger  that  regimes  to  which  aid  is  destined  are 
satisfied  with  acquiring  a  veneer  of  international 
legitimacy,  without  actually  modifying  the  author¬ 
itarian  nature  of  domestic  power  management. 

Conclusion 

The  concept  of  democratic  consolidation  is  com¬ 
plex,  problematic,  and  to  some  extent  contradic¬ 
tory.  This  makes  it  all  the  more  important  to 
clarify  and  circumscribe  the  meaning  of  the  term. 
It  has  been  suggested  by  some  scholars  that  the 
concept  be  limited  to  its  original  significance,  that 
is,  the  survival  of  democracy  to  avoid  the  risk  of  an 
authoritarian  regression.  As  proposed  above,  this 
implies  limiting  its  application  to  the  “negative” 
sphere:  preventing  democratic  collapse  or  protect¬ 
ing  democracy  from  a  slower  death  through  ero¬ 
sion.  First  and  foremost,  this  choice  enables  us  to 
reestablish  the  many  meanings  associated  with  the 
concept,  which  can  then  be  used  more  consciously 
and  consistently.  The  other  sense  in  which  it  is 
widely  used — that  of  completing  and  satisfying  the 
many  “unfulfilled  promises”  of  democracy,  to  use 
Norberto  Bobbio’s  well-known  expression — calls 
for  an  analysis  of  the  goals  and  criteria  held  to  be 
indispensable  for  achieving  “full”  and  desirable 
democracy.  This  type  of  analysis  appears  to  belong 
to  studies  on  the  “quality”  or  “full  achievement” 
of  democracy  and  should  be  developed  in  that 
context.  In  the  wake  of  the  more  recent  experi¬ 
ences  of  democratization,  studies  of  this  type  are 
now  increasing  sharply.  They  are  still  only  at  the 
beginning,  and  their  preliminary  results  require 
verification;  however,  they  represent  a  coherent 
means  of  completing  the  analysis  of  the  function¬ 
ing  and  performance  of  democracy  that  originated 
in  studies  of  democratic  consolidation. 

Davide  Grassi 
Universita  degli  studi  di  Torino 
Turin,  Italy 


620  Democratic  Peace 


See  also  Democracy,  Quality;  Democracy,  Theories  of; 
Democratization;  Legitimacy;  Transition 

Further  Readings 

Bobbio,  N.  (1984).  II  futuro  della  democrazia  [The  future 
of  democracy].  Torino,  Italy:  Einaudi. 

Huntington,  S.  P.  (1991).  The  third  wave: 

Democratization  in  the  late  twentieth  century. 
Norman:  University  of  Oklahoma  Press. 

Linz,  J.,  &  Stepan,  A.  (1996).  Problems  of  democratic 
transition  and  consolidation.  Baltimore:  Johns 
Hopkins  University  Press. 

Morlino,  L.  (1998).  Democracies  between  consolidation 
and  crisis:  Parties ,  groups  and  citizens  in  Southern 
Europe.  Oxford,  UK:  Oxford  University  Press. 
O’Donnell,  G.  (1996).  Illusions  about  consolidation. 

Journal  of  Democracy,  2,  34-51. 

Przeworski,  A.,  Alvarez,  M.  E.,  Cheibub,  J.  A.,  & 
Limongi,  F.  (2000).  Democracy  and  development. 
Cambridge,  UK:  Cambridge  University  Press. 

Rustow,  D.  (1990).  Democracy:  A  global  revolution? 
Foreign  Affairs,  4,  75-91. 

Schedler,  A.  (1998).  What  is  democratic  consolidation? 

Journal  of  Democracy,  2,  91-107. 

Schedler,  A.  (2001).  Measuring  democratic  consolidation. 
Studies  in  Comparative  International  Development,  1, 
66-92. 

Schmitter,  P.  (1998).  Some  basic  assumptions  about  the 
consolidation  of  democracy.  In  T.  Inoguchi,  E. 
Newman,  &  J.  Keane  (Eds.),  The  changing  nature  of 
democracy  (pp.  23-36).  New  York:  United  Nations 
University  Press. 


Democratic  Peace 


Democratic  peace  refers  most  specifically  to  the 
proposition  that  democratic  states  have  not  fought 
and  are  not  likely  to  fight  interstate  wars  against 
each  other.  It  refers  more  generally  to  the  notion 
that  democracy  has  an  important  pacifying  impact 
on  international  politics,  whether  by  making  indi¬ 
vidual  states  less  warlike,  by  creating  peaceful 
relationships  for  pairs  of  states,  or  by  making  the 
entire  globe  more  peaceful.  It  is  a  proposition  that 
lies  at  the  heart  of  the  American  academic  field  of 
international  politics,  born  in  the  wake  of  a  world 
war  ostensibly  fought  to  make  the  world  safe  for 
democracy.  Though  its  philosophical  roots  go 


back  to  Immanuel  Kant  and  Thomas  Paine,  in  its 
contemporary  form,  democratic  peace  focuses 
most  intently  on  democracy’s  impact  on  interac¬ 
tions  within  pairs  of  states.  This  entry  reviews 
research  on  that  impact,  its  theoretical  bases, 
major  criticisms  of  it,  and  the  most  important 
responses  to  those  criticisms. 

Impact  of  Democracy 

Recent  interest  evoked  by  democratic  peace  is  the 
result,  in  an  important  part,  of  the  strikingly  sim¬ 
ple  claim  that  no  two  democratic  states  have  ever 
fought  a  war  against  each  other.  This  is  not  a 
trivial  claim.  Though  interstate  wars  are  far  too 
common,  statistically  speaking,  they  are  rare 
events.  In  most  years,  99%  of  the  pairs  of  states  in 
the  international  system  avoid  fighting  wars  against 
each  other.  For  the  rate  of  warfare  among  demo¬ 
cratic  states  to  be  significantly  different  from  that 
for  states,  in  general,  the  number  of  wars  between 
democratic  states  must  be  at  least  close  to  zero. 

So  critics  of  democratic  peace  point  out  excep¬ 
tions  to  the  alleged  rule  about  democratic  states 
having  universally  peaceful  relationships  with  each 
other.  The  War  of  1812  between  the  United  States 
and  Great  Britain,  the  U.S.  Civil  War,  the  Spanish 
American  War  in  1898,  even  World  War  I,  the 
official  state  of  war  between  Great  Britain  and 
Finland  in  World  War  II,  the  war  between  Lebanon 
and  Israel  in  1948,  and  the  military  conflict 
between  India  and  Pakistan  in  1999  are  among  the 
most  frequently  mentioned  wars  on  this  list. 

Resolving  the  debate  about  whether  any  or  all  of 
these  are  actually  exceptions  to  the  democratic 
peace  rule  obviously  must  involve  definitions  of 
“war”  and  “democracy.”  A  specific  definition  of 
war  widely  adopted  by  researchers  focusing  on 
quantitative  analysis  of  evidence  regarding  its 
causes  specifies  that  an  interstate  war  involves 
military  conflict  between  independent  states  lead¬ 
ing  to  the  deaths  of  at  least  1,000  soldiers.  This 
definition  has  been  adopted  by  most  analysts  con¬ 
ducting  systematic  empirical  evaluations  of  hypoth¬ 
eses  regarding  the  causes  of  war  for  several  decades. 

Definitions  and  measures  of  democracy  are 
probably  more  contestable.  Most  of  the  research 
on  democratic  peace  has  adopted  a  numerical 
threshold  based  on  data  consisting  of  annual 
scores  rating  states  on  a  continuum  from  fully 
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autocratic  to  entirely  democratic.  These  thresholds 
are  inescapably  arbitrary  to  some  extent.  In  fact,  all 
definitions  contain  an  important  arbitrary  element. 
Furthermore,  to  evaluate  the  validity  of  the  state¬ 
ment  that  democratic  states  never  fight  wars 
against  each  other,  all  states  must  be  sorted  into 
“democratic”  or  “not  democratic”  categories. 
Obviously,  states  do  not  naturally  or  clearly  fall 
into  neatly  exclusive  or  exhaustive  categories  of 
that  kind.  So,  ultimately,  the  proposition  that  needs 
to  be  evaluated  is  “states  that  are  sufficiently  demo¬ 
cratic  never  (or  rarely)  fight  wars  against  each 
other.  ”  How  much  democracy  is  enough,  and  how 
can  that  level  of  democracy  be  identified,  are  cru¬ 
cial  questions  with  no  answers  that  will  generate 
universal  consensus.  One  characteristic  receiving 
some  attention  focuses  on  the  ability  of  a  state  to 
stage  elections  in  which  executive  leadership  of  the 
state  passes  from  one  independent  political  party  to 
another,  different  independent  political  party. 

Theoretical  Frameworks 

The  debate  about  which  attributes  commonly 
thought  of  as  “democracy”  possess  the  most  effec¬ 
tive  pacifying  impact  leads  inevitably  to  a  more 
general  issue  regarding  the  theoretical  basis  for  the 
democratic  peace  proposition.  This  debate  shifts 
the  focus  from  questions  about  whether  it  is  true 
that  democratic  states  have  fought  wars  or  become 
involved  in  militarized  disputes  at  a  lower  rate 
than  other  kinds  of  states  to  a  rather  different 
issue:  Assuming  an  acceptance  of  the  claim  that 
jointly  democratic  states  do  have  a  historical 
record  of  conflict  that  is  distinctive  in  the  way  the 
democratic  peace  theory  would  suggest,  what  has 
brought  about  this  record?  One  prominent  answer 
focuses  on  the  potential  cultural  or  psychological 
impact  of  democracy  on  states  and  their  leaders. 
Political  leaders  in  democratic  states  are  accus¬ 
tomed  to  resolving  political  debates  and  disputes 
in  a  peaceful,  rule-based  fashion.  Leaders  of  demo¬ 
cratic  states  can  be  expected  to  bring  attitudes  and 
expectations  about  conflict  resolution  cultivated  in 
domestic  political  processes  to  interactions  with 
their  counterparts  in  other  states.  For  their  own 
protection,  democratic  leaders  bring  quite  different 
attitudes  and  expectations  to  their  interactions 
with  leaders  of  autocratic  states.  They  find  such 
leaders  ethically  and  morally  suspect  from  the 


outset.  And  they  have  no  expectation  that  conflicts 
with  them  can  be  resolved  in  a  peaceful  manner. 
But  if  the  leaders  of  other  states  are  democratic, 
leaders  of  democratic  states  anticipate  that  dis¬ 
agreements  can  be  resolved  in  a  peaceful  manner. 

The  second  main  theoretical  argument  about  the 
potential  pacifying  effect  of  democracy  focuses  on 
the  structure  of  domestic  political  systems.  An 
important  starting  point  for  such  arguments  posits 
that  political  leaders  of  all  states  place  the  highest 
priority  on  staying  in  power.  Analysts  point  out 
that  to  initiate  military  interventions  or  attacks, 
leaders  of  democratic  states  need  to  persuade  vari¬ 
ous  elements  in  the  government,  interest  groups, 
and  even  the  general  population  to  support  such 
ventures.  This  process,  relatively  cumbersome  com¬ 
pared  with  that  faced  by  autocratic  leaders,  makes 
democratic  states  more  hesitant  and  cautious  about 
getting  involved  in  military  conflict.  Then,  too, 
democratic  leaders  are  more  vulnerable  to  being 
removed  from  office  if  they  become  involved  in  a 
war,  or  even  a  less  serious  militarized  conflict,  and 
they  lose  that  war  or  conflict.  Since  democratic 
states  tend  to  win  the  wars  in  which  they  become 
involved,  democratic  states  are,  everything  else 
being  equal,  opponents  to  be  avoided  in  military 
conflicts.  Autocratic  leaders  tend  to  be  not  so  cau¬ 
tious  and  not  so  anxious  to  win  the  wars  in  which 
they  are  entangled.  All  they  need  to  do  to  maintain 
themselves  in  power  is  to  satisfy  or  appease  their 
relatively  small  winning  coalitions.  And  since  those 
winning  coalitions  are  small,  supporters  of  dicta¬ 
tors  tend  to  be  loyal.  They  understand  that  they 
might  well  be  excluded  from  the  small  winning 
coalition  supporting  any  successor  regime. 

Democratic  leaders  must  satisfy  a  much  larger 
coalition.  Therefore,  they  depend  more  crucially 
on  providing  public  goods  to  that  larger  coalition, 
such  as  victories  in  war.  If  a  war  is  lost,  they  prob¬ 
ably  cannot  provide  enough  private  goods  to  a 
sufficiently  large  portion  of  their  coalition  to  main¬ 
tain  themselves  in  power.  This  is  especially  the  case 
because  supporters  of  democratic  leaders  tend  to 
be  much  less  loyal  than  supporters  of  autocratic 
leaders.  The  winning  coalition  of  any  successor  in 
a  democratic  regime  must  necessarily  be  large,  at 
least  approaching  one  half  of  the  population 
involved  in  selecting  leaders — that  is,  the  entire 
electorate.  In  other  words,  in  political  systems 
based  on  relatively  large  winning  coalitions,  those 
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who  desert  the  coalition  supporting  the  current 
leader  also  have  a  reasonable  chance  of  joining  the 
winning  coalition  of  the  successor.  All  of  these 
considerations  make  democratic  states  cautious 
about  getting  involved  in  wars  or  militarized  con¬ 
flicts,  especially  against  other  democratic  states. 

Criticism  and  Response 

Some  critics  of  both  these  approaches  to  explaining 
the  alleged  lower  rate  of  warfare  and  militarized 
conflict  among  democratic  states  acknowledge  that 
the  statistical  pattern  exists.  They  argue,  however, 
that  the  correlation  between  democracy  and  peace 
for  pairs  of  states  is  not  the  reflection  of  a  causal 
linkage.  One  argument  has  it  that  it  is  actually 
peace  that  produces  democracy.  That  is,  states  that 
for  whatever  reason  come  into  existence  and/or 
exist  for  prolonged  periods  of  time  in  an  environ¬ 
ment  of  peace  are  more  likely  to  be  democratic.  But 
this  does  not  prove  that  democracy  causes  peace. 

One  of  the  most  prominent  antidemocratic  peace 
arguments  points  out  that  most  of  the  historical 
examples  of  democratic  pairs  of  states  avoiding 
wars  against  each  other  come  from  the  decades  fol¬ 
lowing  the  onset  of  the  Cold  War.  Democratic 
states  in  that  era,  it  is  argued,  avoided  wars  against 
each  other  not  because  they  were  democratic  but 
because  they  shared  the  unifying  threat  of  a  power¬ 
ful  common  enemy  in  the  Soviet  Union  or  the  com¬ 
munist  coalition  of  the  Soviet  Union,  the  People’s 
Republic  of  China,  and  various  allies. 

However,  several  statistical  analyses  of  the  rela¬ 
tionship  between  democracy  and  peace  control  for 
the  impact  of  alliance  ties  and  still  find  that  democ¬ 
racy  apparently  leads  to  peace.  And,  it  is  not  true 
that  the  potentially  unifying  impact  of  opposition 
from  the  communist  world  was  universally  effec¬ 
tive.  That  opposition  did  not  prevent,  for  example, 
a  war  between  El  Salvador  and  Honduras  in  1969, 
military  conflict  between  Greece  and  Turkey  over 
Cyprus  in  1974,  or  the  clash  between  Great  Britain 
and  Argentina  over  the  Malvinas/Falkland  Islands 
in  1982.  In  each  of  these  cases,  at  least  one  of  the 
antagonists  was  not  democratic.  Furthermore,  the 
communist  world  was  confronted  by  a  capitalist/ 
democratic  world  even  more  imposing  than  itself 
in  terms  of  military  and  industrial  capabilities,  and 
the  allegedly  unifying  impact  of  that  opposition 
did  not  prevent  the  Soviet  Union  from  invading 


Hungary,  Czechoslovakia,  or  Afghanistan,  the 
serious  clashes  between  China  and  Vietnam,  and 
Vietnam’s  attack  on  and  occupation  of  Cambodia. 

Democratic  peace  ideas  influenced  the  foreign 
policies  of  the  Clinton  and  Bush  administrations 
during  the  2  decades  after  the  end  of  the  Cold 
War.  Especially  in  the  latter  case,  some  foreign 
policy  initiatives  ostensibly  based  on  those  ideas 
had  problematic  outcomes.  One  important  criti¬ 
cism  of  the  democratic  peace  literature  is  that  it 
has  never  laid  out  clearly  the  policy  implications  of 
its  findings,  even  though  it  is  also  true  that  few  if 
any  democratic  peace  advocates,  for  example, 
endorsed  the  U.S.  intervention  in  Iraq  in  2003.  In 
short,  democracy  may  have  an  important  pacifying 
impact  on  relationships  between  states.  How  poli¬ 
cymakers  might  best  take  advantage  of  that  impact 
is  not  always  clear. 

James  Lee  Ray 
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Democratization 


Democratization  can  be  defined  as  two  different 
political  processes: 
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1.  Democratization  is  the  transformation  from  a 
nondemocratic  regime  to  a  democratic  political 
regime.  It  is  a  transition  from  nondemocratic  to 
democratic  types  of  political  regime  and 
involves  regime  change. 

2.  Democratization  is  the  process  of  political 
transformation  from  an  electoral  or  partial 
democracy  toward  a  full  or  consolidated 
democracy.  It  is  a  transition  between  different 
degrees  of  democracy  within  a  specific 
democratic  political  system. 

This  entry  covers  both  meanings,  by  discussing 
the  actors  who  may  bring  democracy,  the  factors 
of  and  impediments  to  democracy,  and  historical 
and  future  perspectives  of  democracy  and  the 
democratic  trend. 

Agency  Factors  of  Democratization 

The  main  theory  with  emphasis  on  “agency,”  as 
developed  in  sociological  theory,  is  the  transition 
approach,  which  was  proposed  by  Guillermo 
O’Donnell  and  Philippe  C.  Schmitter  (1986)  and 
further  developed  by  Gretchen  Casper  and  Michelle 
Taylor  (1996).  This  agency  approach  analyzes  the 
process  of  transition  toward  democracy  with 
political  elites  as  the  main  actors  and  as  the  crucial 
and  critical  factor  of  democratization.  One  of  the 
conditions  that  help  initiate  a  transition  to  democ¬ 
racy  in  an  authoritarian  regime  is  if  the  ruling  elite 
splits  into  factions  with  opposing  interests.  This  is 
likelier  to  happen  in  more  developed  societies 
whose  complexity  creates  multifaceted  regime 
coalitions  that  are  not  as  easily  held  together.  Rifts 
within  the  ruling  elite  are  also  more  likely  when 
there  is  a  mounting  legitimacy  crisis  due  to  eco¬ 
nomic  setbacks,  unfulfilled  policy  promises,  and 
failures  in  crisis  management. 

In  heterogeneous  regime  coalitions,  legitimacy 
crises  encourage  elite  splits  because  they  create  an 
opportunity  for  some  elite  groups  to  try  to 
strengthen  their  position  in  the  regime  coalition  by 
pursuing  a  reform  strategy  that  they  hope  will 
bring  them  popular  support,  thus  regaining  legiti¬ 
macy.  Accordingly,  many  transitions  to  democ¬ 
racy  have  been  instigated  by  the  emergence  of  a 
reform  camp  within  the  regime.  Typically,  the 
reformers  initiate  a  liberalization  program  that 
opens  a  space  for  criticism  and  alternative  voices. 


As  a  result,  opposition  groups  surface  from  the 
underground  and,  in  many  cases,  advance  further 
claims  for  democratization.  If  the  opposition 
groups  remain  moderate  in  their  methods  (avoid¬ 
ing  violence)  and  demonstrate  their  readiness  for 
compromise  but  at  the  same  time  muster  wide¬ 
spread  public  support,  a  negotiated  transition  to 
democracy  becomes  possible. 

The  emergence  of  opposition  to  a  regime  does 
not  always  result  from  an  elite-initiated  opening 
process.  Sometimes,  policy  failures  lead  to  sponta¬ 
neous  manifestations  of  widespread  mass  opposi¬ 
tion,  launching  a  legitimacy  crisis  that  impels  an 
intra-elite  reform  camp  to  surface  and  engage  in 
negotiations  with  the  opposition.  Again,  this  con¬ 
figuration  of  events  often  leads  to  negotiated,  or 
“pacted,”  transitions. 

The  institutional  basis  of  a  given  authoritarian 
regime  is  an  important  factor  in  this  context 
because  different  types  of  authoritarian  regimes 
show  different  vulnerabilities  to  democratizing 
pressures.  For  instance,  the  weakness  of  military 
regimes  is  that  they  lack  an  ideological  mission 
that  legitimates  them  on  a  long-term  basis. 
Usually,  they  take  power  as  crisis  managers,  so 
their  justification  is — often  explicitly — only  tem¬ 
porary.  The  legitimacy  of  military  regimes  is  rela¬ 
tively  easily  questioned,  either  because  the  junta 
fails  to  manage  the  crisis,  in  which  case  its  justifi¬ 
cation  lacks  credibility,  or  because  things  run 
smoothly,  in  which  case  the  need  for  crisis  man¬ 
agement  becomes  obsolete.  One  obvious  advan¬ 
tage  of  military  regimes  is  that  they  control  the 
means  of  coercion,  so  they  can  silence  emerging 
opposition  by  brute  force.  But  confronted  with 
widespread  mass  opposition  that  proves  resilient 
even  in  the  face  of  oppression,  the  loyalty  of  the 
troops  may  erode  if  they  are  ordered  to  turn  on 
peaceful  protestors.  On  the  other  hand,  even 
though  military  regimes  sometimes  exit  quickly 
from  power,  they  also  easily  return,  as  the 
repeated  oscillations  between  military  and  civilian 
rule  in  countries  such  as  Turkey,  Pakistan,  or 
Thailand  demonstrate. 

Personalistic  regimes  put  all  their  eggs  into  the 
basket  of  the  central  ruler’s  charisma.  Accordingly, 
when  the  ruler  dies,  there  is  an  opportunity  for 
political  change,  as  has  been  clearly  demonstrated 
in  Spain.  Whether  or  not  this  opportunity  is  used 
for  a  transition  to  democracy  then  depends  on  the 
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power  balance  between  prodemocratic  and  anti¬ 
democratic  forces  and  their  relative  support  among 
the  population. 

One-party  regimes,  whether  leftist  or  rightist, 
profit  from  a  more  strongly  institutionalized  power 
basis.  These  regimes  usually  have  an  ideological 
mission  that  inspires  their  existence  and  provides 
legitimacy.  It  generally  takes  longer  and  is  a  bigger 
challenge  for  a  potential  opposition  group  to  erode 
the  ideological  basis  of  one-party  regimes.  One 
strategy  that  proved  successful  in  the  former  com¬ 
munist  bloc  is  to  demonstrate  that  a  regime  betrays 
its  very  own  ideals.  When  communist  countries 
signed  the  human  rights  declaration  in  the  Final 
Act  of  the  Conference  on  Security  and  Co-operation 
in  Europe  (CSCE)  while  refusing  to  respect  these 
rights  in  practice,  civil  rights  movements  such  as 
the  Charta  77  effectively  publicized  this  contradic¬ 
tion  and  in  doing  so  helped  erode  the  regime’s 
legitimacy.  Eventually,  the  legitimacy  crisis  went 
so  deep  that  even  within  the  communist  parties  no 
one  believed  any  longer  in  the  regime’s  ideals.  In 
this  situation,  reform  camps  surfaced  in  a  number 
of  communist  parties  (most  notably  in  the  Soviet 
Union  and  Hungary),  together  with  organizations 
outside  the  party  that  were  opposed  to  the  regime, 
once  Mikhail  Gorbachev’s  nullification  of  the 
Brezhnev  Doctrine  in  1998  made  this  seem  like  a 
viable  strategy  in  Central  and  Eastern  Europe. 

Splits  in  the  ruling  elite  are  important  because 
they  give  leverage  to  domestic  as  well  as  interna¬ 
tional  actors,  enhancing  their  bargaining  options 
to  push  a  democratization  agenda  through.  The 
leverage  that  international  actors  have  in  pushing 
for  democracy  increases  insofar  as  a  country 
depends  on  international  aid.  In  some  cases, 
dependence  on  international  assistance  can  be  so 
strong  that  external  powers  can  trigger  democrati¬ 
zation  even  in  the  absence  of  a  prodemocratic 
opposition  group  within  the  country.  In  the 
extreme  case,  democratic  powers  can  enforce 
democratic  institutions  by  military  intervention  as 
was  attempted  in  Afghanistan  and  Iraq.  But  exter¬ 
nally  triggered  processes  of  democratization  are 
unlikely  to  penetrate  very  deep  unless  there  are 
strong  domestic  forces  inside  a  country. 

When  parties  of  the  ruling  elite  are  unified  to 
sustain  an  authoritarian  system,  the  transition  to 
democracy  is  less  easily  achieved,  particularly  if  the 
regime  is  able  to  isolate  itself  from  international 


democratizing  pressures.  In  such  cases,  the  chances 
to  democratize  depend  strongly  on  whether  a 
prodemocratic  opposition  group  emerges,  how 
massive  it  grows,  and  how  skillfully  it  uses  its  rep¬ 
ertoire  of  elite-challenging  actions.  If  the  opposi¬ 
tion  can  mobilize  support  from  all  layers  of  the 
population,  if  it  is  able  to  demonstrate  this  sup¬ 
port,  and  if  it  remains  resilient  even  in  the  face  of 
oppression,  loyalty  to  the  regime  erodes,  even 
among  the  armed  forces,  undermining  the  regime’s 
repressive  capacities.  Thus,  massive,  determined, 
and  well-organized  opposition  groups  can  over¬ 
come  the  resistance  of  the  ruling  elite  to  democrati¬ 
zation.  If,  however,  the  opposition  remains  limited 
to  isolated  sectors  of  society,  is  unable  to  demon¬ 
strate  popular  support  across  the  country,  and  can¬ 
not  remain  resilient  in  the  face  of  repression,  its 
chances  of  success  will  be  limited. 

To  a  considerable  extent,  then,  democratization 
is  a  matter  of  the  skills  and  virtues  of  mass  opposi¬ 
tion  leaders.  It  matters  how  willing  and  able  they 
are  to  advance  claims  that  resonate  with  many 
people,  to  mobilize  resources  for  popular  cam¬ 
paigns,  and  to  make  use  of  the  full  set  of  ruling 
elite-challenging  actions  even  in  the  face  of  repres¬ 
sion.  Tactical  and  strategic  factors,  such  as  the 
presence  of  skilful  political  dissidents,  benevolent 
reforms  by  the  ruling  elite,  and  international  assis¬ 
tance,  are  important,  but  when  it  comes  to  full 
democratization,  these  factors  can  hardly  compen¬ 
sate  for  deficiencies  in  the  development  of  ordinary 
people’s  capabilities  and  motivation  to  practice 
democracy. 

Factors  of  Democratization:  Economic, 
Social,  and  Cultural  Modernization 

Mounting  and  sustaining  a  prodemocratic  opposi¬ 
tion  against  authoritarian  rulers  require  that  soci¬ 
eties  embark  on  a  process  of  human  empowerment 
that  develops  organizational  skills  and  resources 
among  the  population.  It  also  requires  that  sub¬ 
stantial  segments  of  the  population  give  high  prior¬ 
ity  to  democratic  freedoms,  making  them  able  and 
willing  to  struggle  for  democratic  institutions. 
Ordinary  people’s  readiness  to  struggle  for  demo¬ 
cratic  freedoms  is  necessary  for  deep  democratiza¬ 
tion  to  be  attained,  for  authoritarian  leaders  are 
unlikely  to  surrender  their  powers  unless  they  are 
pressured  to  do  so. 
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Paramount  is  a  type  of  economic  development 
that  is  knowledge  driven  and  distributes  human 
social  and  political  resources  widely  throughout 
society  rather  than  concentrating  them  in  small 
minorities  of  the  population.  Industrialization  and 
the  rise  of  the  knowledge  society  equip  growing 
segments  of  the  population  with  the  material 
means,  intellectual  skills,  and  social  opportunities 
needed  to  mount  effective  pressures  on  elites.  As  a 
consequence,  ordinary  people’s  action  repertoires 
expand  in  ways  that  make  the  value  of  democratic 
freedoms  intuitively  obvious,  giving  rise  to  eman¬ 
cipative  worldviews  that  value  freedoms  highly. 
These  long-term  developmental  factors  enhance  a 
society’s  ability  and  willingness  to  struggle  for 
democracy. 

The  crucial  and  critical  importance  of  eco¬ 
nomic,  social,  and  cultural  modernization  has 
been  at  the  core  of  a  modernization  theory  of 
democratization.  The  main  author  of  such  a  the¬ 
ory  of  modernization  as  the  most  important  factor 
of  democratization  is  Seymour  Martin  Lipset 
(1994).  This  modernization  theory  of  democrati¬ 
zation  was  expanded  and  refined  by  Larry 
Diamond  (1992).  Hence,  this  theory  of  democrati¬ 
zation  can  be  called  the  Lipset-Diamond  theory  of 
democratization. 

The  human  empowerment  approach  of  democ¬ 
ratization  is  a  hybrid  structure-agency  theory  of 
democratization  and  has  been  developed  by  Ronald 
Inglehart  and  Christian  Welzel  (2005). 

External  Threats  and  Group  Hostilities 
as  Impediments  to  Democracy 

Various  situations  can  arise  that  prevent  develop¬ 
mental  factors  from  giving  rise  to  the  prodemo- 
cratic  effects  associated  with  them.  Perceptions  of 
external  threats  and  internal  group  hostilities  tend 
to  be  detrimental  to  democracy  because  they 
diminish  tolerance  of  opposition — a  basic  princi¬ 
ple  of  democratic  organization.  External  threats 
help  leaders  use  “rally  around  the  flag”  strategies 
that  silence  inner  opposition.  Group  hostilities  do 
the  same  within  groups,  closing  ranks  around  lead¬ 
ers  and  silencing  opposing  views. 

Involvement  of  a  country  in  an  enduring  inter¬ 
national  conflict  can  undermine  democratic  insti¬ 
tutions  because  they  provide  a  sense  of  being 
threatened  that  allows  skilful  leaders  to  present 


suppression  of  the  opposition  as  crucial  to  the 
nation’s  survival.  Similarly,  internal  group  divi¬ 
sions  are  not  inherently  threatening  to  democracy, 
but  ethnic,  linguistic,  religious,  and  other  easily 
discernible  group  divisions  can  be  manipulated  to 
foment  support  for  authoritarian  leaders.  Extremist 
leaders  almost  always  mobilize  support  by  playing 
on  group  hostilities.  Thus,  democracy  has  histori¬ 
cally  been  more  easily  established  and  consoli¬ 
dated  in  societies  that  are  relatively  homogeneous 
culturally  and  relatively  egalitarian  economically. 

Regardless  of  whether  such  hindering  factors 
are  present,  deep  democratization  requires  that  a 
society’s  people  acquire  the  capability  and  motiva¬ 
tion  to  struggle  for  the  freedoms  that  define 
democracy.  This  is  because  democracy  is  a  socially 
embedded  phenomenon,  not  just  an  institutional 
machine  that  operates  in  a  vacuum.  Shallow 
democratization  involves  crafting  institutions,  but 
deep  democratization  involves  the  development  of 
supportive  values  and  skills  among  large  segments 
of  a  society. 

Democratization  and  Transition  in 

a  Long-Term  Historical  Perspective 

How  can  we  understand  the  fact  that  democratiza¬ 
tion  processes  in  separate  countries  cluster  into 
coherent  and  sweeping  international  waves,  behav¬ 
ing  as  if  they  were  centrally  coordinated  by  a  mas¬ 
ter  agent  when  in  fact  neither  that  master  agent 
nor  central  coordination  of  the  international  waves 
exists?  The  answer  is  that  evolutionary  forces  are 
at  work  that  go  beyond  the  awareness  and  control 
of  even  the  most  powerful  elites.  These  evolution¬ 
ary  forces  bestow  a  systematic  selective  advantage 
on  democracies  over  autocracies.  To  the  extent 
that  such  selective  advantages  exist,  it  is  essential 
to  understand  them  in  order  to  assess  the  future 
potential  of  democracy  and  in  order  to  understand 
the  limits  and  opportunities  within  which  agents 
pursuing  a  democratic  agenda  are  acting. 

An  evolutionary  dynamic  is  present  when  pro¬ 
cesses  are  directed,  behaving  as  if  they  were  cen¬ 
trally  coordinated,  when  in  fact  such  coordination 
does  not  exist.  Markets  behave  in  this  way  and  so 
does  the  long-term  evolution  of  democracy.  But 
why  do  levels  of  democracy  evolve  over  time? 
Higher  levels  of  democracy  emerge  as  higher  levels 
of  action  resources  become  available  to  the  average 
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person  in  a  society.  In  an  era  of  mass  politics, 
democracies  enjoy  three  distinct  selective  advan¬ 
tages  over  autocracies.  First,  there  is  a  selective 
advantage  by  international  confrontations.  States 
have  been  involved  in  international  conflicts  and 
wars,  and  often  the  winning  states’  political  regimes 
replaced  the  losing  states’  ones.  Success  in  interna¬ 
tional  confrontations  has  been  related  to  the  type 
of  regime.  Democracies  usually  won  the  wars  they 
were  engaged  in,  partly  because  in  the  long  run, 
they  could  mobilize  their  people  and  resources 
more  effectively.  Moreover,  democracies  tend  not 
to  fight  each  other,  avoiding  extinguishing  their 
own  kind.  Autocracies  do  not  have  this  tendency. 

Second,  there  is  a  selective  advantage  by  eco¬ 
nomic  performance.  Democracies  have  emerged 
and  persisted  in  technologically  and  economically 
more  advanced  and  powerful  states,  which  partly 
explains  their  superiority  in  international  confron¬ 
tations  with  autocracies.  Democracies  have  been 
established  in  more  prosperous  economies  from 
the  start.  In  addition,  democracies  continued  to 
outperform  autocracies  economically,  greatly 
increasing  their  initial  prosperity  advantage  over 
time.  Equally  important,  autocracies  repeatedly 
lost  their  more  prosperous  members  to  the  demo¬ 
cratic  camp. 

The  third  selective  advantage  of  democracies  is 
an  advantage  by  popular  support,  which  is  a  truly 
selective  force.  Because  they  grant  power  to  the 
people  and  because  their  rulers  are  selected  by  the 
population,  democracies  tend  to  have  more  popular 
support  than  autocracies,  which  enables  them  to 
mobilize  their  people  more  effectively.  This  also 
allows  them  to  limit  mass  disaffection  more  effec¬ 
tively  than  autocracies.  Even  autocracies  that  seem 
stable  on  the  surface,  lacking  obvious  signs  of  mass 
opposition,  are  vulnerable  to  the  “element  of 
surprise”  that  becomes  apparent  in  democratic 
revolutions  when  massive  opposition  to  a  regime 
suddenly  emerges  and  persists,  toppling  a  regime 
that  may  have  lasted  for  decades.  Democracies  are 
less  vulnerable  to  extinction  by  popular  revolutions. 
They  simply  change  their  rulers  through  elections. 

Democratization  and  the  Future  of  Politics 

The  selective  advantages  of  democracy  are  of  such 
a  long-term  nature  and  so  deeply  rooted  in  basic 
developmental  processes  that  there  is  no  reason  to 


assume  that  the  odds  will  fundamentally  turn 
against  democracy  in  the  foreseeable  future. 
Setbacks  will  occur  in  specific  countries,  but  the 
achievements  of  the  global  wave  of  democracy  are 
unlikely  to  be  reversed.  But  this  does  not  mean  that 
there  are  no  future  challenges.  Instead,  we  see  a 
number  of  challenges  on  the  democratic  agenda, 
which  can  be  formulated  in  terms  of  the  following 
questions: 

1.  Will  democracy  continue  to  spread 
geographically? 

2.  Will  the  deficiencies  of  new  democracies,  such 
as  those  in  the  former  Soviet  Union,  be 
overcome? 

3.  Will  the  democratic  qualities  of  established 
democracies  be  further  deepened? 

One  might  also  question  the  viability  of  the 
democratic  principle  in  an  era  in  which  the  major 
organizational  frame  of  democracy,  the  nation¬ 
state,  is  said  to  have  lost  its  significance.  And  one 
might  question  the  viability  of  the  democratic  prin¬ 
ciple  in  a  world  in  which  decisive  ecological  mea¬ 
sures  seem  to  be  unpopular,  though  they  may  be 
necessary  to  save  our  planet. 

Spreading  Democracy  to  New  Regions 

Two  important  geographical  areas  have,  so  far, 
proved  to  be  relatively  immune  to  the  democratic 
trend:  China,  and  the  predominantly  Islamic 
Middle  East  and  North  Africa.  Anchoring  democ¬ 
racy  in  these  two  areas  would  without  doubt  con¬ 
stitute  a  major  breakthrough  for  the  democratic 
principle.  As  far  as  the  Middle  East  and  North 
Africa  are  concerned,  a  sweeping  democratic  trend 
throughout  the  region  does  not  seem  likely  in  the 
near  future.  The  terror  and  violence  nurtured  by 
the  Israeli-Palestinian  conflict,  Islamic  fundamen¬ 
talism,  and  the  predominance  of  patrimonial  states 
based  on  oil  rents  all  amount  to  powerful  obstacles 
to  democratization.  In  addition,  there  is  through¬ 
out  much  of  the  Islamic  world,  but  especially  in  the 
Middle  East,  a  cultural  self-appraisal  of  Islam  as 
the  West’s  countercivilization — an  understanding 
that  is  sometimes  mirrored  in  Western  views  of 
Islam  as  its  countercivilization.  On  this  basis, 
democracy  is  considered  to  be  a  Western  product 
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in  much  of  the  Islamic  world,  which  might  dis¬ 
qualify  it  in  the  eyes  of  many  people.  Evidence 
from  the  World  Values  Surveys  indicates  that  even 
among  those  segments  of  Islamic  populations  that 
overtly  support  democracy,  there  is  often  a  funda¬ 
mental  misunderstanding  of  democratic  principles. 
Evidence  from  the  World  Values  Surveys  also  sug¬ 
gests  that  patriarchal-authoritarian  values,  which 
are  incompatible  with  democracy,  are  prevalent  in 
much  of  the  region,  particularly  the  Arab-speaking 
countries.  Although  these  factors  hinder  the  emer¬ 
gence  of  democracy,  the  general  idea  of  democracy 
is  widely  approved,  and  democratic  institutions 
seem  to  be  taking  root  in  some  historically  Islamic 
societies  such  as  Turkey. 

Consolidating  and  Improving 
New  Democracies 

Many  new  democracies  in  Latin  America,  sub- 
Saharan  Africa,  and  Central  and  Eastern  Europe 
show  serious  deficiencies  concerning  the  rule  of 
law,  accountability,  and  transparency.  Not  surpris¬ 
ingly  then,  there  is  widespread  popular  cynicism 
about  the  integrity  of  representatives,  the  trustwor¬ 
thiness  of  institutions,  and  policy  performance  in 
these  new  democracies.  This  popular  cynicism 
often  leads  to  political  apathy  rather  than  mass 
political  activism,  weakening  civil  society  and  plac¬ 
ing  corrupt  leaders  under  little  popular  pressure  to 
behave  more  responsively.  But  in  those  new  democ¬ 
racies  where  cynical  citizens  become  “critical  citi¬ 
zens”  who  sustain  a  high  level  of  elite-challenging 
mass  activities,  government  is  consistently  more 
effective,  transparent,  and  accountable.  Civic 
action  matters:  Both  within  new  and  old  democra¬ 
cies,  relatively  widespread  civic  action  helps  in 
increasing  accountable  governance.  This  insight  is 
important.  It  shows  that  the  quality  of  democracy 
is  not  solely  the  concern  of  the  ruling  elite.  It  is  also, 
and  very  markedly  so,  a  matter  that  concerns  the 
citizens.  When  they  are  motivated  to  put  the  ruling 
elite  under  popular  pressure  and  actually  do  so, 
they  can  improve  the  quality  and  effectiveness  of 
governance.  There  is  no  reason  for  civic  defeatism. 

Deepening  Old  Democracies 

The  most  obvious  aspect  of  the  global  democratic 
trend  is  the  geographical  spread  of  democracy.  But 


the  global  democratic  trend  has  a  second,  often 
forgotten  aspect:  the  deepening  of  democracy.  This 
occurs  even  where  democracy  has  been  in  place  for 
many  decades.  This  trend  is  well  documented  in 
the  book  Democracy  Transformed ?  Expanding 
Political  Opportunities  in  Advanced  Industrial 
Democracies  by  Bruce  E.  Cain,  Russell  J.  Dalton, 
and  Susan  E.  Scarrow  (2005),  showing  that  over 
the  past  25  years  most  postindustrial  democracies 
have  widened  the  elements  of  direct  democracy, 
opened  channels  of  citizen  participation  in  policy 
planning,  extended  the  scope  of  civic  rights,  and 
improved  accountability  to  the  public.  These  insti¬ 
tutional  changes  have  been  accompanied  and 
driven  by  cultural  changes  that  gave  rise  to  eman¬ 
cipative  values  and  high  levels  of  sustained  elite- 
challenging  actions.  In  fact,  a  major  reason  why 
long-established  democracies  show  high  levels  of 
accountable  governance  is  because  they  are  con¬ 
stantly  exposed  to  popular  pressure  by  increas¬ 
ingly  critical  citizens.  This  should  affect  our  views 
of  what  kind  of  citizenry  is  needed  to  consolidate 
democracies  and  keep  them  flourishing. 

In  The  Civic  Culture ,  Gabriel  Almond  and 
Sidney  Verba  (1963)  assumed  that  in  order  for 
democracy  to  flourish,  citizen  participation  should 
be  limited  to  the  institutional  channels  of  repre¬ 
sentative  democracy,  focusing  on  elections  and  the 
activities  around  them.  This  view  was  reinforced 
by  Samuel  Huntington’s  (1968)  influential  work 
Political  Order  in  Changing  Societies,  contribut¬ 
ing  to  deep-seated  suspicions  of  noninstitutional- 
ized,  assertive  citizen  action.  This  suspicion  is  so 
deeply  ingrained  in  political  science  that,  even 
today,  prevailing  concepts  of  social  capital  and 
civil  society  still  focus  on  institutionally  channeled 
participation,  emphasizing  membership  and 
participation  in  formal  associations.  By  contrast, 
noninstitutionalized  forms  of  assertive  citizen 
action  are  rarely  recognized  in  prevailing  concep¬ 
tions  of  civil  society.  The  dominant  view  of  what 
sort  of  citizenry  makes  and  keeps  countries  demo¬ 
cratic  needs  to  be  revised.  Democracy  flourishes 
with  an  uncomfortable  citizenry  that  makes  life 
difficult  for  their  rulers,  exposing  them  to 
constant  popular  pressure.  Democracy  requires  a 
citizenry  who  place  a  high  value  on  democratic 
freedoms  and  are  capable  of  struggling  for  them — 
to  attain  them  when  they  are  denied  and  to  sustain 
them  when  they  are  challenged. 
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Unfortunately,  such  a  citizenry  cannot  be 
ordered  into  existence  by  elite  decree,  nor  can  it  be 
crafted  by  institutions.  Its  emergence  reflects  a 
more  basic  process  of  human  empowerment  and 
social  and  cultural  modernization  through  which 
people  acquire  the  resources  and  skills  to  demand 
responsive  government  and  the  values  that  moti¬ 
vate  them  to  practice  democracy.  Democratic  insti¬ 
tutions  can  be  imposed  from  outside,  but  if  these 
conditions  are  absent,  it  is  likely  to  be  a  flawed 
version  of  democracy  if  it  survives  at  all.  Anchoring 
democracy  is  not  just  about  crafting  institutions.  It 
is  about  shaping  the  development  of  economic, 
cultural,  and  political  modernization  of  a  given 
society. 
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Dependency  Theory 


According  to  dependency  theory,  Third  World 
countries  in  the  periphery  of  the  global  capitalist 
system  are  poor  and  underdeveloped  because  they 
are  exploited  by  the  advanced  capitalist  countries 
in  the  core.  Their  major  problem  is  dependency  on 
the  core  countries.  That  is  the  central  claim  made 
by  dependency  theory,  a  mode  of  analysis  devel¬ 
oped  primarily  by  Latin  American  scholars  in  the 
1960s  and  1970s. 

Dependency  theory  emerged  as  a  reaction 
against  the  modernization  paradigm,  which  domi¬ 
nated  Western  liberal  approaches  to  development 
during  the  decades  after  World  War  II. 
Modernization  theory  argued  that  Third  World 
countries  should  be  expected  to  follow  the  same 
developmental  path  as  taken  earlier  by  developed 
countries  in  the  West:  a  progressive  journey  from 
a  traditional,  preindustrial,  agrarian  society, 
toward  a  modern,  industrial,  mass-consumption 
society.  Development  meant  overcoming  barriers 
of  preindustrial  production,  backward  institutions, 
and  parochial  value  systems  that  impeded  the  pro¬ 
cess  of  growth  and  modernization. 

The  theoretical  endeavors  among  moderniza¬ 
tion  theorists  concerned  identification  of  the  full 
range  of  impediments  to  modernization  as  well  as 
all  factors  that  promote  modernization.  A  famous 
economic  modernization  theory  by  Walt  Whitman 
Rostow  specifically  stressed  that  the  “takeoff,”  the 
crucial  push  in  moving  from  traditional  toward 
modern,  is  characterized  by  a  marked  increase  in 
modern  sector  investment  to  a  minimum  of  10% 
of  the  gross  national  product.  Another  critical 
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element  concerns  the  relationship  of  Third  World 
countries  to  the  world  market.  Close  market  rela¬ 
tions  with  the  developed  countries  are  seen  to  have 
a  positive  developmental  effect  on  Third  World 
economies.  Foreign  trade  is  viewed  as  a  road  to 
market  expansion  and  further  growth  of  the  mod¬ 
ern  sector.  Foreign  direct  investment  in  the  Third 
World  by  transnational  corporations  (TNCs) 
brings  in  the  much  needed  modern  technology  and 
production  skills. 

Early  Critiques  of  Liberal  Economic  Theory 

This  liberal  understanding  of  development  was 
subjected  to  increasing  criticism  during  the  1960s 
and  1970s.  That  was  partly  in  reaction  to  the  lack 
of  progress  in  many  Third  World  countries  at  the 
time.  While  growth  rates  in  the  developed  world 
reached  unprecedented  highs  in  the  postwar 
decades,  many  Third  World  countries  had  difficul¬ 
ties  in  getting  economic  development  under  way. 
Their  economies  refused  to  “take  off.”  That  natu¬ 
rally  led  to  increasing  dissatisfaction  with  modern¬ 
ization  theory. 

Already  in  the  late  1940s,  the  Argentinean 
economist  and  central  bank  director  Raul  Prebisch 
had  criticized  liberal  economic  theory  in  relation 
to  the  Third  World.  In  particular,  he  turned 
against  David  Ricardo’s  famous  notion  of  com¬ 
parative  advantage  according  to  which  countries 
in  the  periphery  would  be  better  off  specializing 
in  production  and  export  of  raw  materials  and 
agricultural  products,  leaving  advanced  indus¬ 
trial  production  to  the  core  countries.  Prebisch 
argued  that  production  from  the  periphery  was 
subjected  to  deteriorating  terms  of  trade  com¬ 
pared  with  products  from  the  advanced  coun¬ 
tries.  Two  factors  were  involved.  The  first  is 
income  elasticity,  which  states  that  raw  materials 
and  agricultural  products  are  less  in  demand 
when  incomes  increase;  consumers  turn  to  non¬ 
food  items,  and  technological  advance  decreases 
the  demand  for  raw  materials.  Second,  because  of 
strong  labor  unions  in  the  advanced  countries, 
productivity  increases  lead  to  real  wage  improve¬ 
ments  in  the  core  rather  than  to  lower  prices  for 
manufactured  products.  In  effect,  comparative 
advantage  does  not  work  for  the  periphery.  The 
core  countries  keep  the  benefits  of  exchange  for 
themselves. 


Emergence  of  Dependency  Theory 

Dependency  theorists  were  inspired  by  Prebisch. 
They  took  his  critique  and  developed  it  further,  to 
a  full-scale  attack  on  liberal  ideas  of  moderniza¬ 
tion,  specialization,  and  comparative  advantage. 
According  to  Theotonio  dos  Santos  (1970), 
dependency 

is  a  situation  in  which  the  economy  of  certain 
countries  is  conditioned  by  the  development  and 
expansion  of  another  economy  to  which  the 
former  is  subjected.  .  .  .  [Sjome  countries  (the 
dominant  ones)  can  expand  and  become  self- 
sustaining  while  other  countries  (the  dependent 
ones)  can  do  this  only  as  a  reflection  of  that 
expansion,  (p.  231) 

Based  on  this  general  notion,  scholars  primarily 
from  the  Latin  American  school  of  theory  devel¬ 
oped  their  particular  theories  of  dependency.  In 
addition  to  dos  Santos,  they  include  Fernando 
Henrique  Cardoso,  Andre  Gunder  Frank,  Ruy 
Mauro  Marini,  Osvaldo  Sunkel,  and  several  others. 
In  the  present  context,  it  is  relevant  to  focus  on 
Frank  and  Cardoso  because  they  represent  the  most 
forceful  and  popular  varieties  of  dependency  theory. 
They  are  neo-Marxist  in  the  sense  that  they  both 
draw  on  classical  Marxist  analysis  of  the  capital- 
labor  relationship,  but  they  differ  from  classical 
Marxism  in  a  basic  respect.  Unlike  Marx,  depen¬ 
dency  theorists  do  not  expect  capitalist  development 
to  take  root  and  unfold  in  the  Third  World  in  the 
same  way  that  capitalism  evolved  in  Western 
Europe  and  North  America.  At  the  same  time,  they 
disagree  about  the  prospects  for  development  in  the 
Third  World.  Radical  theorists,  such  as  Frank, 
believe  that  a  separation  from  capitalism  and  the 
creation  of  a  popular-based  socialist  model  is  the 
only  way  forward.  Moderate  theorists  such  as 
Cardoso  argue  that  some  form  of  dependent  devel¬ 
opment  can  take  place  in  the  periphery  in  context  of 
the  capitalist  system.  In  both  cases,  however,  focus 
is  less  on  future  strategies  for  development  and 
more  on  present  weaknesses  of  the  capitalist  system. 

Frank  was  born  in  Berlin  and  educated  as  an 
economist  in  Chicago  before  he  came  to  Chile  and 
Mexico  to  take  up  the  analysis  of  dependency.  His 
book  from  1967,  Capitalism  and  Underdevelop¬ 
ment  in  Latin  America,  was  a  central  factor  in  the 
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establishment  of  dependency  theory  as  a  major 
approach  to  the  study  of  the  Third  World;  it  was 
strongly  inspired  by  Paul  Baran’s  study  The 
Political  Economy  of  Growth  from  1957. 

For  modernization  theorists,  “traditional  soci¬ 
ety”  was  the  place  where  all  countries  started  their 
process  of  development  and  modernization.  Frank 
rejects  that  view.  The  starting  point  for  him  is  not 
tradition — it  is  underdevelopment.  Underdevelop¬ 
ment  is  not  a  condition  that  once  characterized  all 
countries.  It  is  a  process  within  the  framework  of 
the  global  capitalist  system  to  which  Third  World 
countries  have  been  subjected:  They  have  been 
underdeveloped  as  an  intentional  by-product  of 
the  development  of  the  advanced  core  countries. 
Earlier  forms  of  society  in  the  Third  World  may 
have  been  undeveloped;  underdevelopment  begins 
only  with  the  arrival  of  a  global  capitalist  system. 
That  is,  global  capitalism  in  one  single  process 
generates  development  and  wealth  (in  the  industri¬ 
alized  world)  and  underdevelopment  and  poverty 
(in  the  Third  World). 

This  is  due  to  what  Frank  (1967)  calls  the  three 
basic  contradictions  in  capitalism:  (1)  the  expro¬ 
priation/appropriation  of  economic  surplus,  (2)  the 
polarization  into  metropolitan  center  and  periph¬ 
eral  satellites,  and  (3)  the  continuity  of  the  capitalist 
structure  and  its  generation  of  underdevelopment. 
In  other  words,  the  core  countries  began  the  exploi¬ 
tation  of  the  periphery  early — in  the  16th  century  in 
Chile’s  case.  The  core  appropriates  economic 
surplus  from  the  satellites  and  uses  it  for  its  own 
development.  Some  industrial  development  in  the 
periphery  may  be  possible,  but  it  is  dominated  by 
external  economic  interests;  these  external  forces 
result  in  crippled  and  distorted  societal  structures 
inside  peripheral  countries.  To  overcome  underde¬ 
velopment,  a  delinking  from  external  dominance  is 
required. 

Frank’s  analysis  is  simple  and  clear.  It  is  also 
conceptually  imprecise.  “Capitalism”  is  said  to 
have  existed  since  the  16th  century;  “economic 
surplus”  has  been  appropriated  by  the  core  for 
several  centuries;  and  “underdevelopment”  char¬ 
acterizes  the  periphery  irrespective  of  the  level  of 
industrialization.  Consider  the  example  of  Brazil. 
From  the  sugar  mills  run  by  slaves  in  the  17th  cen¬ 
tury  to  the  advanced  petrochemical  industries  of 
today  where  transnational  corporations  partici¬ 
pate,  history  merely  repeats  itself,  in  that  global 


capitalism  creates  underdevelopment  because  of 
the  system’s  basic  contradictions.  In  one  sense, 
Frank’s  dependency  theory  turns  modernization 
theory  on  its  head:  What  comes  out  of  the  core  is 
always  bad  (whereas  for  modernization  theory,  it 
is  always  good),  and  capitalism  perpetuates  under¬ 
development.  As  Aidan  Foster-Carter  (1976) 
notes,  where  modernization  theory  had  difficul¬ 
ties  in  explaining  why  “traditional”  societies 
would  inevitably  move  toward  modernity,  depen¬ 
dency  theory  has  difficulties  in  explaining  why  the 
result  must  always  be  underdevelopment  rather 
than  development. 

The  Egyptian  economist  Samir  Amin  basically 
agrees  with  Frank,  but  he  proposes  an  analysis  that 
tackles  some  of  the  problems  mentioned  here. 
Amin  focuses  on  Africa.  He  analyses  the  different 
historical  phases  of  African  dependency  and  argues 
that  the  specific  structure  of  peripheral  economies 
is  based  on  export  of  raw  materials  and  import  of 
(mostly  luxury)  consumer  goods.  Yet  Amin  agrees 
with  Frank  that  any  economic  development  of  the 
periphery  is  blocked  due  to  the  subordination  of 
the  periphery  to  the  core. 

Criticism  of  the  “Blocked 
Development”  Theory 

From  early  on,  this  radical  claim  of  “blocked 
development”  in  the  periphery  had  been  met  with 
skepticism  by  several  Latin  American  scholars. 
Already  in  1969,  Cardoso  and  Enzo  Faletto  had 
published  a  book  on  dependency  and  development 
in  Latin  America  rejecting  the  soundness  of  a  gen¬ 
eral  theory  of  “blocked  development”  and  arguing 
in  favor  of  concrete  historical  analysis  of  specific 
situations  of  dependency.  In  the  mid-1970s, 
Cardoso  published  a  critique  of  the  radical  claims, 
exposing  what  he  called  “five  false  theses”  about 
capitalism  in  Latin  America.  Cardoso  argued  that 
some  form  of  “dependent-associated  develop¬ 
ment”  was  possible  in  the  periphery  through  a 
form  of  capital  accumulation  similar  to  capitalism 
in  the  advanced  countries. 

This,  then,  was  the  major  issue  of  contention 
among  dependency  theorists  in  the  1970s:  Is  the 
periphery  doomed  to  nondevelopment?  Or,  is  it 
more  relevant  to  speak  of  a  process  of  “dependent 
development”  formed  by  the  specific  national  con¬ 
ditions  in  each  country,  especially  by  the  relative 
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strength  and  alliance  of  social  classes  and  factions? 
The  case  of  Brazil  would  appear  to  support  the 
latter  view:  Industrial  development  in  that  country 
was  shaped  by  an  alliance  between  elements  of  the 
international  bourgeoisie,  the  local  bourgeoisie, 
and  public  and  entrepreneurial  bureaucratic  elites. 
The  North  American  sociologist  Peter  Evans  wrote 
a  well-known  analysis  of  Brazil  in  1979  based  on 
this  view. 

Most  of  the  critique  of  dependency  theory, 
which  emerged  in  the  1970s  and  1980s,  was 
directed  against  the  “blocked  development”  view 
of  Frank,  Amin,  and  others.  A  number  of  countries 
in  Southeast  Asia,  most  notably  the  “Four  Tigers” 
(South  Korea,  Taiwan,  Singapore,  and  Hong 
Kong),  experienced  rapid  economic  growth  com¬ 
bined  with  world  market  integration.  That  was  a 
blow  to  radical  dependency  theory’s  prediction  of 
stagnation  and  misery  in  the  periphery.  The 
“dependent  development”  view  was  less  easy  to 
reject.  At  the  same  time,  Cardoso’s  theoretical 
views  were  also  less  clear  because  they  tied  the  fate 
of  peripheral  countries  to  particular  historical  cir¬ 
cumstances.  That  is  a  difficult  starting  point  for 
attempts  at  a  general  theory  of  development  and 
underdevelopment.  (Eater,  Cardoso  entered  active 
politics  and  was  president  of  Brazil  for  two 
periods — between  1995  and  2003.) 

World  Systems  Theory 

Some  of  the  core  claims  of  radical  dependency 
theory  were  repeated  in  the  world  system  analysis 
by  Immanuel  Wallerstein.  At  the  same  time, 
Wallerstein  repaired  some  of  the  theory’s  weak¬ 
nesses.  His  focus  was  on  the  capitalist  world 
economy  established  in  “the  long  16th  century” 
(1450-1640).  It  was  based  on  an  international 
division  of  labor  that  covered  Europe  first  but 
soon  expanded  to  the  Western  hemisphere  and 
later  to  other  parts  of  the  world  as  well.  Within 
this  division  of  labor,  a  process  of  specialization 
took  place;  at  first,  this  somehow  happened  acci¬ 
dentally.  For  a  number  of  reasons,  Northwest 
Europe  was  in  a  better  position  to  diversify  its 
agriculture  and  to  connect  it  with  industrial 
advances  in  textiles  and  shipping.  So  the  capitalist 
world  economy  is  built  on  a  hierarchy  of  core 
areas,  peripheral  areas,  and  semiperipheral  areas. 
The  core  areas  contain  the  advanced  and  complex 


economic  activities  (mass  market  industries  and 
sophisticated  agriculture).  Furthermore,  these 
activities  are  controlled  by  an  indigenous  bour¬ 
geoisie.  Peripheral  areas  are  at  the  bottom  of  the 
hierarchy.  They  produce  staple  goods  such  as 
grain,  wood,  and  sugar.  They  often  employ  slavery 
or  coerced  labor.  Semiperipheral  areas  are  eco¬ 
nomically  mixed  and  constitute  a  middle  layer 
between  the  upper  stratum  of  the  core  and  the 
lower  stratum  of  the  peripheral  countries. 

A  basic  mechanism  of  the  capitalist  world 
economy  is  unequal  exchange.  Economic  surplus  is 
transferred  from  the  periphery  to  the  core.  This 
transfer  is  further  accentuated  by  the  emergence  of 
strong  state  machineries  in  the  core  and  weak  state 
machineries  in  the  periphery.  Strong  states  can 
enforce  unequal  exchange  on  weak  ones.  In  the 
process  of  unequal  exchange,  tensions  are  created 
in  the  system.  The  semiperiphery  has  an  important 
function  in  this  regard.  It  provides  an  element  of 
political  stability  because  the  core  countries  are 
not  facing  a  unified  opposition.  The  semiperiphery 
acts  as  a  buffer  or  shock  absorber.  At  the  same 
time,  the  world  economy  is  not  entirely  static;  any 
single  area  of  the  system  may  change  from  periph¬ 
ery  to  semiperiphery,  from  semiperiphery  to  core, 
and  vice  versa.  But  Wallerstein  emphasizes  that  the 
capitalist  system  as  such  does  not  change.  It 
remains  a  hierarchy  of  core,  semiperiphery,  and 
periphery,  characterized  by  unequal  exchange. 

Wallerstein  sees  the  end  of  the  Cold  War  and 
the  destruction  of  the  Soviet  bloc  as  a  consequence 
of  the  development  of  the  capitalist  world  econ¬ 
omy.  However,  the  long-term  prospect  is  the 
demise  of  the  capitalist  system  because  the  contra¬ 
dictions  of  that  system  are  now  unleashed  on  a 
world  scale.  Success,  not  failure,  is  the  real  threat 
to  global  capitalism.  When  the  possibilities  for 
expansion  are  all  used  up,  the  never-ending  quest 
for  more  profit  will  lead  to  new  crises  in  the  world 
capitalist  economy,  which  sooner  or  later  will  spell 
its  transformation. 

Wallerstein  injects  some  dynamic  elements  into 
the  picture  of  unequal  exchange  and  exploitation 
of  the  periphery  by  the  core.  Single  countries  can 
move  up  and  down  in  the  system.  The  semiperiph¬ 
ery  is  categorized  as  a  new  “buffer  zone.”  Politics 
and  conditions  in  single  countries  play  a  larger 
role.  But  still,  focus  remains  on  economics  and  the 
subordination  of  the  periphery.  Recent  analyses 
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such  as  that  of  John  Darwin  (2007)  reject  that 
view  as  one  dimensional  and  reductionist.  Already 
by  the  early  1990s,  several  areas  in  the  Third 
World  experienced  dynamics  of  development  dif¬ 
ficult  to  reconcile  with  radical  dependency  analysis 
even  in  its  more  sophisticated  Wallerstein  version. 
Instead,  more  emphasis  was  laid  on  the  moderate 
“dependent  development”  view,  which  was  now 
combined  with  new  analyses  of  the  international¬ 
ization  of  capital  or  economic  globalization. 

New  Approaches 

Cardoso  and  Faletto  studied  the  internationaliza¬ 
tion  of  capital  as  seen  from  the  periphery.  A  new 
generation  of  analyses  took  up  this  topic  as  seen 
from  the  core  countries.  This  latter  approach  had 
already  been  used  by  the  classical  theories  of  impe¬ 
rialism,  for  example,  those  of  Lenin,  Rosa 
Luxembourg,  John  Hobson,  and  others.  The  more 
recent  generation  of  studies  focuses  on  the  so- 
called  new  international  division  of  labor  where 
industrial  production  is  relocated  from  the  core  to 
the  periphery  in  search  of  cheap  labor  and  other 
favorable  conditions  of  operations  and/or  in  search 
of  local  markets  that  could  not  easily  be  accessed 
from  the  outside.  A  pioneering  study  from  the 
three  German  scholars  Folker  Frobel,  Jurgen 
Heinrichs,  and  Otto  Kreye  argued  that  already 
during  the  1960s,  a  world  market  for  industrial 
activity  emerged.  It  was  based  on  three  factors: 

(1)  a  huge  supply  of  cheap  labor  with  basic  skills 
in  the  Third  World  and  in  Eastern  Europe, 

(2)  communication  and  transport  technologies 
that  facilitate  relocation  on  a  global  scale,  and 

(3)  production  technologies  allowing  the  separa¬ 
tion  of  complex  processes  of  production  into  rela¬ 
tively  simple  component  parts.  The  analysis  points 
out  that  not  only  labor-intensive  industries,  such 
as  textiles,  clothing,  footwear,  and  toys,  but  also 
industries  that  are  associated  with  the  production 
of  electronic  parts,  machine  building,  and  com¬ 
puter  manufacturing  are  good  candidates  for 
global  scale  manufacturing. 

Other  studies  focused  on  the  attempt  to  access 
lucrative  domestic  markets  in  the  Third  World  as 
the  major  push  to  internationalization.  In  Brazil, 
foreign  investment  helped  create  a  substantial  auto¬ 
mobile  industry  and  also  participated  in  the  devel¬ 
opment  of  petrochemical,  computer,  and  aircraft 


industries.  These  developments  raised  the  issue 
anew  of  whether  foreign  capital  should  be  seen  as 
a  primarily  “constructive”  or  primarily  “destruc¬ 
tive”  factor  in  terms  of  economic  development  in 
the  periphery.  Cardoso  had  hinted  at  both:  There 
was  development,  but  there  was  also  dependency. 
The  new  generation  of  analyses  further  developed 
this  view.  It  is  argued  that  foreign  transnationals 
can  be  both  “constructive”  and  “destructive”  in 
the  periphery.  Whether  they  are  primarily  one 
or  the  other  depends  on  the  conditions  under  which 
the  companies  operate  in  the  host  countries.  A  deci¬ 
sive  element  in  this  regard  concerns  the  power  posi¬ 
tion  (economically  and  politically)  of  local  capital 
relative  to  that  of  foreign  capital.  The  stronger  the 
position  held  by  local  capital  in  an  industry,  the 
better  the  possibilities  of  benefiting  from  the  “con¬ 
structive”  effects  and  avoiding  the  “destructive” 
effects  of  transnationals  (Georg  Sorensen,  1983).  In 
other  words,  successful  modernizers,  such  as  the 
Asian  “tigers,”  China,  Brazil,  and  India,  will  be  bet¬ 
ter  positioned  to  reap  benefits  from  foreign  invest¬ 
ment  than  will  very  poor  and  weak  states  (in  sub- 
Saharan  Africa  and  elsewhere)  where  local  capital 
is  next  to  nonexistent,  and  the  state  is  incapable  of 
effective  regulation  and  control. 

In  sum,  even  if  one  can  still  find  examples  of 
radical  dependency  analysis,  it  has  gone  into 
decline.  The  1980s  and  1990s  saw  a  strong  revival 
of  liberal  economic  ideas  in  development  thinking. 
Ronald  Reagan’s  presidency  in  the  United  States 
and  Margaret  Thatcher’s  administration  in  the 
United  Kingdom  both  promoted  liberal  policies 
that  emphasized  the  role  of  free  market  forces  and 
the  downsizing  of  state  bureaucracies  and  state 
regulations.  Third  World  countries  were  encour¬ 
aged  to  pursue  similar  policies. 

Some  scholars  continue  to  employ  radical 
dependency  ideas  in  their  analysis  of  macrohistori- 
cal  developments  of  capitalism,  but  most  analyses 
of  Third  World  countries  have  moved  in  different 
directions.  The  period  dominated  by  liberal  eco¬ 
nomics  in  the  1980s  and  part  of  the  1990s  also  had 
dissenting  voices.  Some  of  these  voices  were 
inspired  by  the  moderate  “dependent  develop¬ 
ment”  position  in  their  attempt  to  evaluate  pros¬ 
pects  for  Third  World  development.  This  current 
view  is  critical  of  neoliberal  free  market  ideology. 
We  may  call  it  modern  mercantilism.  Modern  mer¬ 
cantilism  suggests  that  Third  World  countries  need 
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to  steer  between  the  extremes  of  economic  delinking 
on  the  one  hand  and  full  integration  in  the  world 
economy  on  the  other.  They  argue  that  the  success 
of  economic  development  of  East  Asian  countries  is 
precisely  due  to  their  pursuit  of  a  modern  mercantil¬ 
ist  strategy.  A  second  core  area  of  development 
where  the  modern  mercantilists  strike  a  balance 
concerns  the  market  and  the  state.  Economic  liber¬ 
als  argue  that  free  market  forces  and  a  minimal  role 
for  the  state  are  best  for  the  promotion  of  economic 
development.  Mercantilists  reply  that  there  may  be 
serious  flaws  in  the  alleged  efficiency  of  the  market. 
Furthermore,  there  is  no  firm  support  in  economic 
theory  for  maintaining  that  state  intervention  is  by 
definition  counterproductive.  The  fact  that  some 
interventions  are  flawed  does  not  constitute  a  case 
against  intervention  per  se.  South  Korea’s  and 
Taiwan’s  development  achievements  have  been 
based  on  states  actively  working  toward  building 
desire  structures  of  production. 

Modern  mercantilism,  then,  is  based  on  three 
ideas:  (1)  strike  a  balance  between  national  auton¬ 
omy  and  international  integration — that  is, 
between  incorporation  into  the  world  market  and 
self-reliance;  (2)  strike  a  balance  between  state 
and  market — that  is,  between  free  market  forces 
and  state  regulation;  (3)  foreign  direct  investment 
by  TNCs  can  be  a  strong  force  for  modernization 
but  only  provided  that  TNCs  are  counterbalanced 
by  local  industry  and  host  government  supervision. 

In  conclusion,  radical  dependency  theory  is  no 
longer  a  major  approach  to  Third  World  develop¬ 
ment  problems,  but  elements  of  moderate  depen¬ 
dency  theory  remain  centrally  relevant  for  thinking 
about  economic  development  in  the  periphery. 

Georg  Sorensen 
Aarhus  University 
Aarhus,  Denmark 

See  also  Imperialism;  International  Trade;  Modernization 
Theory;  Multinational  Corporations  (MNCs) 
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Deregulation 


In  the  broadest  sense,  deregulation  can  be  described 
as  the  nonmarginal  reduction  or  total  withdrawal 
of  the  state’s  legal  control  over  the  social  and  eco¬ 
nomic  activities  of  firms  and  citizens.  Although 
deregulation  is  often  associated  with  the  aim  of 
increasing  competition  in  economic  sectors,  such 
as  transport,  energy,  and  audiovisual  markets,  the 
concept  covers  social  and  administrative  regula¬ 
tion  too.  In  this  case,  competition  is  not  the  only 
aim  of  deregulation,  since  social  regulation  can  be 
inspired  by  the  notion  of  increasing  access  (i.e., 
via  simplification)  and  participation  or  the  aim  of 
respecting  individual  choice  in  delicate  areas  (e.g., 
the  consumption  of  recreational  drugs).  Admin¬ 
istrative  deregulation  has  triggered  ambitious 
government  programs  to  reduce  administrative 
burdens  affecting  citizens  and  firms  in  Europe, 
North  America,  and  some  developing  countries. 

However,  the  notion  of  deregulation  must  be 
put  in  context  to  identify  its  purpose  and  implica¬ 
tions  as  well  as  the  inextricable  link  with  its 
counterpart — regulation.  Indeed,  when  new  policy 
is  designed,  deregulation  is  one  of  the  instruments 
of  regulatory  choice,  together  with  self-regulation, 
regulation  by  information,  coregulation,  com¬ 
mand,  and  control  regulation. 
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Deregulation  can  be  analyzed  from  both  his¬ 
torical  and  theoretical  angles.  In  turn,  these  per¬ 
spectives  are  linked  to  three  clusters  of  questions: 
(1)  How,  where,  and  when  does  deregulation 
occur?  (2)  Why  does  it  occur?  (3)  What  (actors  or 
structural  forces)  caused  it?  In  addition,  a  geo¬ 
graphical  angle  is  also  needed,  especially  in  light  of 
debates  on  regulatory  reform  and  the  diverse  mean¬ 
ing  that  deregulation  has  in  different  continents, 
transition  economies,  and  leading  economies. 

Historical  Perspective 

Historically,  deregulation  emerged  as  a  response  to 
the  welfare  and  Keynesian  programs  of  the  1950s 
and  1960s  and  the  economic  instability  of  the 
1970s.  Conventionally,  deregulation  is  associated 
with  the  rise  to  power  of  Margaret  Thatcher  in  the 
United  Kingdom  (1979)  and  Ronald  Reagan  in  the 
United  States  (1981).  In  this  received  view,  deregu¬ 
lation  is  associated  with  the  economic  paradigm  of 
neoliberalism.  The  international  diffusion  of  dereg¬ 
ulation  was  facilitated  by  the  programs  of  the 
International  Monetary  Fund  (IMF;  e.g.,  in  Latin 
America),  the  ideas  cementing  the  Washington 
Consensus,  and  the  promotion  of  approaches  to 
deregulation,  such  as  the  ones  pursued  by  labor 
governments  in  Australia  and  New  Zealand.  When 
labor  governments  joined  the  bandwagon  of  dereg¬ 
ulation,  the  international  spread  of  the  phenome¬ 
non  seemed  to  some  observers  universal  and 
unstoppable.  The  conventional  wisdom,  however, 
ignores  that  (a)  as  a  matter  of  fact,  the  total 
amount  of  regulation  continued  to  grow  in  the 
1980s  and  is  still  on  the  rise;  (b)  the  association 
between  deregulation  and  other  so-called  neolib- 
eral  ideas  is  dubious,  as  shown  by  the  spurious 
correlation  between  deregulation  and  privatiza¬ 
tion;  (c)  there  was  strong  political  opposition  to 
deregulation  ideas  (“genuine,”  “persistent,”  and 
“vicious,”  according  to  Sam  Peltzman),  hence  they 
were  never  uncontested  and  triumphant;  with  the 
financial  crisis  triggered  by  the  credit  crunch  of 
2008-2009,  deregulatory  ideas  lost  much  of  their 
intellectual  appeal,  amid  several  calls  for  “new” 
forms  of  regulatory  capitalism;  and  (d)  in  conti¬ 
nental  Europe  at  least,  the  deregulation  of  domes¬ 
tic  utilities  and  other  crucial  markets  was  caused 
by  a  newly  emerging  regulatory  power,  that  is,  the 
European  Union  (EU)  as  “regulatory  state.” 


National  rules  were  removed  and  remodeled,  but 
this  happened  in  the  context  of  a  transfer  of  regu¬ 
latory  powers  to  the  supranational  European  level 
to  facilitate  the  creation  of  the  EU  Internal  Market. 

Theoretical  Perspectives 

The  theoretical  perspectives  address  the  question  of 
why  this  phenomenon  occurs  in  the  first  place. 
Basically,  there  are  four  theoretical  rationales  or 
explanations.  The  first  looks  at  the  politics  of  ideas. 
As  mentioned,  the  conventional  wisdom  of  deregu¬ 
lation  points  in  the  direction  of  the  ideas  of  the  New 
Right,  the  economic  policy  missionaries  of  interna¬ 
tional  organizations,  and  the  doctrines  of  economic 
rationalism.  However,  a  more  accurate  interpreta¬ 
tion  in  terms  of  politics  of  ideas  was  provided  by 
Martha  Derthick  and  Paul  J.  Quirk.  They  argue 
that  deregulation  was  the  product  of  successful, 
convincing,  evidence-based  economic  arguments 
that  demonstrated  the  large  diffuse  benefits  of 
deregulation  to  the  public  and  politicians,  thus 
overcoming  the  resistance  of  client  politics,  that  is, 
the  narrow,  concentrated  benefits  from  regulatory 
rents.  But  interest  group  theorists  propose  a  second 
explanation,  rooted  in  the  advantages  that  firms 
exposed  to  international  competition  gain  from 
deregulation  (a  variant  of  this  explanation  looks  at 
the  power  of  corporate  consumers  or  business 
consumers).  A  third  explanation  turns  to  historical 
narratives  and  sees  deregulation  as  the  effect  of  pre¬ 
vious  dynamics  of  regulatory  policy — a  case  of 
policy  self-destruction  once  regulation  reached  its 
politically  sustainable  climax.  By  focusing  on  the 
formal  analysis  of  the  rules  of  the  political  system 
(but  often  blended  with  historical  narratives  of  spe¬ 
cific  cycles  of  deregulation),  a  fourth  explanation 
sheds  light  on  the  institutional  capacity  for  reform, 
showing  when  and  how  institutions,  often  in  com¬ 
bination  with  the  emergence  of  policy  entrepreneurs 
or  reform  champions,  can  overcome  the  inertial 
dynamics  of  regulatory  agencies  and  client  politics. 
This  fourth  explanation  ties  in  with  the  question  of 
who  (de)regulates.  Comparative  research  has 
explained  the  different  incentives  and  constraints  to 
deregulation,  how  institutional  rules  and  legacies 
provide  opportunity  structures  to  reform  agents, 
and  finally  how  these  structures  can  be  changed. 

The  natural  complement  to  these  theoretical 
interrogations  is  provided  by  empirical  studies  of 
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deregulatory  policies  and  their  actual  outcome  in 
several  sectors.  Analyses  often  focus  on  the  eco¬ 
nomic  deregulation  of  specific  industries  and  show 
the  redistribution  of  the  costs  and  benefits  of 
deregulation  across  different  societal  groups.  While 
the  effects  of  (economic)  deregulation  on  prices, 
competition,  and  market  structure  are  clearly  iden¬ 
tifiable  in  several  instances,  at  times,  the  final  out¬ 
come  of  this  policy  cannot  be  fully  separated  from 
other  variables  such  as  technological  change  and, 
more  broadly,  from  other  ingredients  of  regulatory 
reform  (e.g.,  reregulation). 

Regulatory  Strategies 

Contemporary  capitalism  is  an  era  of  regulatory 
flux  where  regulatory,  deregulatory,  and  reregu- 
latory  shifts  are  occurring  simultaneously.  New 
regulatory  domains  have  emerged,  such  as  risk, 
the  regulation  of  private  security  companies, 
financial  (re)regulation,  and  corporate  gover¬ 
nance,  as  well  as  ambitious  attempts  for  forging 
international  regulatory  cooperation  in  areas 
such  as  climate  change  and  intellectual  property 
rights. 

Deregulation  was  widely  adopted  by  develop¬ 
ing  countries  with  the  aims  of  increasing  foreign 
direct  investment  and  reducing  regulatory  burdens 
on  endogenous  growth.  Yet  deregulatory  failures, 
especially  but  not  exclusively  in  developing  coun¬ 
tries,  have  triggered  a  recalibration  of  policy  strat¬ 
egies.  The  emphasis  has  shifted  from  deregulation 
to  regulatory  quality — the  question  is  not  the  total 
level  of  regulation  but  the  efficiency,  accountabil¬ 
ity,  consistency,  and  transparency  of  regulation.  In 
turn,  research  on  regulatory  quality  has  shown 
that  it  cannot  be  achieved  by  simply  clamping 
down  on  the  total  number  of  rules.  It  requires  a 
proper  institutional  design  of  regulatory  bodies 
and  regulatory  oversight  institutions.  This  explains 
the  political  attention  to  institutional  innovations 
such  as  independent  regulatory  authorities  and 
other  nonmajoritarian  bodies,  the  economic  anal¬ 
ysis  of  proposed  regulation  (also  known  as  regula¬ 
tory  impact  assessment),  and  the  role  of  central 
oversight  units  in  charge  of  regulatory  quality 
goals  (Giandomenico  Majone,  2001;  OECD, 
2002).  It  is  regulatory  quality,  rather  than  the 
total  level  of  regulation,  that  has  been  identified  as 
an  important  factor  of  growth  (see  Hossein 


Jalilian,  Colin  Kirkpatrick,  &  David  Parker, 
2007,  for  econometric  evidence  on  developing 
countries). 

Thus,  although  at  the  time  some  authors  inter¬ 
preted  deregulation  as  yet  another  sign  of  the  end 
of  history,  the  academic  analysis  and  the  policy 
debate  on  deregulation  is  very  much  alive  30  years 
after  Ronald  Reagan  famously  said  on  January  20, 
1981,  in  his  inauguration  speech:  “Government  is 
not  the  solution  to  our  problem,  government  is  the 
problem.” 

Claudio  M.  Radaelli  and  Lorna  Scbrefler 
University  of  Exeter 
Exeter,  United  Kingdom 
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Detente 


Detente  is  a  word  of  French  origin  meaning  “a 
relaxation  of  tension.”  In  its  traditional  diplomatic 
usage,  detente  has  most  commonly  been  used  to 
indicate  a  reduction  of  tension  between  two  adver¬ 
sarial  states.  Detente  has  alternately  been  used  to 
define  a  condition,  a  process,  a  policy,  and  a 
period  of  history.  The  concept  is  not  synonymous 
with  peace  or  a  natural  harmony  of  interests.  In 
fact,  a  central  element  to  any  definition  of  detente 
is  the  precondition  of  intense  tension  and  rivalry 
between  two  states  with  substantial  conflicting 
interests.  Detente,  as  Richard  W.  Stevenson  (1985) 
proposed,  might  be  best  defined  as  “the  process  of 
easing  tension  between  states  whose  interests  are 
so  radically  divergent  that  reconciliation  is  inher¬ 
ently  limited”  (p.  11).  Within  that  process,  policies 
of  engagement,  accommodation,  and  cooperation 
may  be  pursued  by  adversarial  states  to  reduce  the 
tension  created  by  their  underlying  conflict,  which 
left  unrestrained  might  eventually  lead  to  war. 

Detente  first  became  part  of  diplomatic  parlance 
in  1908  as  the  European  great  powers  attempted  to 
reduce  the  tension  created  by  the  then  crises  in 
Morocco  and  other  imperial  rivalries.  However, 
detente  is  most  famously  associated  with  the  foreign 
policy  of  former  U.S.  President  Richard  Nixon  and 
National  Security  Advisor — eventually  Secretary  of 
State — Henry  Kissinger  during  the  1970s.  When 
Nixon  came  to  power  in  1969,  constant  threat  of 
nuclear  war  and  deep  hostility  of  the  Cold  War  had 
characterized  U.S. -Soviet  relations  for  more  than 
2  decades.  To  be  sure,  the  real  fear  of  nuclear  catas¬ 
trophe  created  by  the  Cuban  Missile  Crisis  of  1962 
introduced  an  element  of  caution  into  the  Cold  War 
era  and  was  perhaps  the  impetus  for  agreements 
such  as  the  Partial  Test  Ban  Treaty  and  the  Nuclear 
Non-proliferation  Treaty,  but  U.S.-Soviet  relations 
throughout  the  late  1960s  continued  to  be  marked 
by  suspicion  and  intense  ideological  rivalry. 


International  conflict,  such  as  the  Soviet  invasion  of 
Czechoslovakia  in  1968  and  continuing  American 
war  against  communist  forces  in  Vietnam,  only 
heightened  the  danger  of  U.S.-Soviet  military  con¬ 
frontation  and  possible  escalation. 

Nixon  and  Kissinger  pursued  detente  in  this 
strategic  context.  The  first  indication  of  the  new 
U.S.  approach  to  the  Soviet  Union  and  China 
appeared  in  Nixon’s  inaugural  address  on  January 
20,  1969,  when  he  declared,  “After  a  period  of 
confrontation,  we  are  entering  an  era  of  negotia¬ 
tion.”  Furthermore,  Nixon  sketched  a  rough  out¬ 
line  of  the  framework  for  detente  when  he  invited 
the  adversaries  of  the  United  States  into  a  “peace¬ 
ful  competition”  and  proposed  cooperation  to 
“reduce  the  burden  of  arms.”  The  elements  of 
arms  control  and  restrained  competition  would 
come  to  be  central  features  of  the  Nixon-Kissinger 
detente  efforts. 

The  mechanics  of  detente  were  primarily  left  to 
Kissinger,  who  as  Nixon’s  national  security  advi¬ 
sor  centralized  control  of  foreign  policy  in  the 
White  House  under  the  auspices  of  the  National 
Security  Council  (NSC).  Circumventing  the  tradi¬ 
tional  foreign  policy  bureaucracy  of  the  State 
Department,  Kissinger  established  a  “back  chan¬ 
nel”  with  the  Soviet  ambassador  to  the  United 
States,  Anatoly  Dobrynin.  This  back  channel  pro¬ 
vided  an  outlet  for  Nixon  to  communicate  directly 
with  Communist  Party  General  Secretary  Leonid 
Brezhnev  on  sensitive  bilateral  issues  and  during 
emerging  crises.  The  origins  of  many  of  the  signifi¬ 
cant  U.S.-Soviet  agreements  and  accords  of  the 
detente  era  can  be  traced  to  the  Kissinger-Dobrynin 
back  channel. 

The  Nixon-Kissinger  approach  to  detente  can 
be  viewed  as  an  attempt  to  find  a  modus  vivendi 
with  the  ascendant  Soviet  Union.  By  the  late 
1960s,  the  Soviet  Union  had  achieved  a  rough  par¬ 
ity  with  the  United  States  in  terms  of  strategic 
military  power,  and  detente  provided  Nixon  and 
Kissinger  with  a  method  of  managing  the  Soviet 
Union’s  rise  to  true  superpower  status  while  con¬ 
currently  attempting  to  mitigate  the  negative  after- 
math  of  the  Vietnam  War  on  U.S.  power  and 
prestige.  Nixon  and  Kissinger  promoted  the  con¬ 
cept  of  linking  multiple  policy  issues  to  gain 
greater  diplomatic  leverage  over  the  Soviet  Union. 
In  their  view,  linkage  politics  provided  a  method  of 
addressing  a  broad  range  of  issues  while  trying  to 
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prevent  any  single  issue  from  derailing  the  overall 
process  and  milieu  created  by  detente.  Despite  the 
Soviet  Union’s  emergence  as  a  global  power  in  the 
military  sense,  Soviet  power  was  not  nearly  as  mul¬ 
tidimensional  as  U.S.  power.  As  a  result,  there 
were  power  asymmetries,  which  provided  oppor¬ 
tunities — in  Kissinger’s  view — for  the  United  States 
to  exploit  these  asymmetries  through  linkage  poli¬ 
tics  thereby  restraining  Soviet  actions. 

Detente  between  the  United  States  and  the 
Soviet  Union,  as  Mike  Bowker  and  Phil  Williams 
have  noted,  consisted  of  four  primary  and  inter¬ 
connected  elements:  (1)  European  normalization, 
(2)  arms  control,  (3)  codes  of  conduct,  and 
(4)  East-West  trade.  The  issue  of  normalizing 
European  relations  with  the  Soviet  Union  and 
Eastern  Bloc  countries  had  remained  unresolved 
following  the  end  of  World  War  II,  and  in  1969, 
West  German  Chancellor  Willy  Brandt  instituted  a 
policy  of  Ostpolitik  (Eastern  politics)  designed  to 
improve  and  increase  relations  between  West 
Germany,  East  Germany,  and  the  Soviet  Union. 
Detente  in  Europe  proceeded  slightly  apart  from 
superpower  detente  despite  Kissinger’s  efforts  to 
prevent  any  separation  of  the  two  diplomatic 
tracks.  In  1970,  the  Moscow  Treaty  formally  rec¬ 
ognizing  Europe’s  post-World  War  II  borders  and 
the  division  of  Germany  was  signed  by  West 
Germany  and  the  Soviet  Union.  The  historically 
thorny  issue  of  Berlin  involved  greater  participation 
by  the  superpowers,  and  the  Kissinger-Dobrynin 
back  channel  proved  to  be  a  useful  device  during 
these  negotiations.  The  formal  agreement  on  the 
status  of  Berlin  was  signed  by  the  four  occupying 
powers — (1)  the  United  States,  (2)  the  United 
Kingdom,  (3)  France,  and  (4)  the  former  USSR — in 
September  1971.  Several  other  treaties  normalizing 
the  borders  and  relations  between  West  Germany 
and  other  Eastern  Communist  states,  including 
Poland  (1970),  East  Germany  (1972),  and 
Czechoslovakia  (1973),  were  enacted  during  the 
early  years  of  European  detente.  The  normalization 
of  European  relations  through  detente  did  much  to 
relax  the  tension  on  what  had  historically  been  one 
of  the  Cold  War’s  central  fronts. 

The  issue  of  bilateral  arms  control  was  perhaps 
the  centerpiece  of  detente  between  the  United 
States  and  the  Soviet  Union.  Preliminary  negotia¬ 
tions  on  arms  control,  commonly  known  as  the 
Strategic  Arms  Limitation  Talks  (SALT),  began  in 


November  1969,  but  little  progress  was  made  until 
the  Kissinger-Dobrynin  back  channel  reportedly 
produced  a  breakthrough  in  May  1971.  Protracted 
negotiations  resulted  in  the  Anti-Ballistic  Missile 
(ABM)  Treaty  and  the  Interim  Agreement  on  the 
Limitation  of  Strategic  Offensive  Arms,  which 
were  signed  in  May  1972  by  Nixon  and  Leonid 
Brezhnev  at  a  highly  publicized  summit  meeting  in 
Moscow.  The  ABM  Treaty  limited  the  number  of 
missile  defense  sites  in  either  country  to  a  total  of 
two  and  prohibited  the  further  development,  test¬ 
ing,  and  deployment  of  various  types  of  ABM 
capabilities.  The  Interim  Agreement  limited  the 
number  of  land-based  Inter-Continental  Ballistic 
Missiles  (ICBMs),  Submarine  Launched  Ballistic 
Missiles  (SLBMs),  and  ballistic  missile  submarines. 
The  ABM  Treaty  and  the  Interim  Agreement  aris¬ 
ing  from  SALT  were  hailed  as  significant  diplo¬ 
matic  accomplishments  critical  for  sustaining 
superpower  detente. 

The  1972  Moscow  summit  also  produced  dip¬ 
lomatic  agreements  designed  to  provide  a  frame¬ 
work  for  limiting  the  possibility  of  escalation 
resulting  from  superpower  competition.  The  Basic 
Principles  Agreement  (BPA),  also  known  as  the 
Basic  Principles  of  U.S.-Soviet  Relations,  repre¬ 
sented  a  loose  code  of  conduct  for  the  continuation 
of  detente.  The  superpowers  pledged  to  avoid 
actions  that  might  increase  bilateral  tension  and 
the  risk  of  military  confrontation,  refrain  from 
threats  or  use  of  force  against  allies  or  each  other, 
urgently  consult  each  other  during  crises  featuring 
a  possibility  of  a  nuclear  conflict,  and  use  negotia¬ 
tion  to  settle  disputes  peacefully.  Additionally,  the 
Agreement  on  Avoiding  Incidents  at  Sea  was  also 
signed  at  the  Moscow  summit.  Nixon’s  desire  for 
“peaceful  competition”  and  Kissinger’s  plan  for 
using  detente  to  construct  a  more  stable  interna¬ 
tional  system  based  on  a  triangular — United  States, 
former  USSR,  and  China — balance  of  power  were 
readily  apparent  in  the  form  and  substance  of  these 
agreements.  Similarly,  the  June  1973  U.S.-Soviet 
summit  meeting  in  Washington,  D.C.,  resulted  in 
further  diplomatic  agreements  based  on  the  emerg¬ 
ing  detente.  Nixon  and  Brezhnev  signed  a  declara¬ 
tion  of  principles  designed  to  jumpstart  the  stalled 
Strategic  Arms  Reduction  Treaty  II  (START  II) 
negotiations,  and  they  also  signed  the  Agreement 
on  the  Prevention  of  Nuclear  War  that  elaborated 
on  the  Basic  Principles  Agreement. 
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Improved  U.S.-Soviet  relations  in  the  early 
1970s  also  provided  an  opportunity  to  expand 
commercial  ties.  U.S.-Soviet  trade  dramatically 
increased  during  the  early  years  of  detente,  and 
there  was  even  an  effort  to  grant  most  favored 
nation  (MFN)  status  to  the  Soviet  Union. 
However,  the  element  of  increasing  trade  with  the 
Soviet  Union  was  quite  controversial  in  U.S. 
domestic  politics,  especially  after  a  massive  Soviet 
purchase  of  U.S.  grain  at  subsidized  prices  caused 
a  sharp  increase  in  U.S.  domestic  grain  prices  and 
turned  into  a  political  debacle  for  the  Nixon 
administration. 

Detente  during  this  era  also  faced  domestic  and 
international  challenges.  Whereas  Nixon’s  rap¬ 
prochement  with  China  was  most  likely  the  key 
factor  originally  motivating  the  Soviet  Union  to 
pursue  detente,  other  international  events  under¬ 
mined  the  process.  U.S.  involvement  in  Vietnam 
was  a  constant  source  of  friction,  especially  since 
Nixon  and  Kissinger  believed  that  the  Soviet 
Union  could  exert  greater  pressure  on  North 
Vietnam  to  conclude  a  peace  agreement.  The  1973 
Arab-Israeli  War  strained  detente  to  its  limits  by 
heightening  the  risks  of  escalation — the  United 
States  increased  its  nuclear  alert  at  one  point  in  the 
crisis — and  brought  into  question  the  efficacy  of 
the  BPA  in  restraining  superpower  actions. 
Likewise,  the  Soviet  involvement  in  Angola  in 
1975  further  weakened  detente,  led  to  a  cessation 
of  U.S.-Soviet  trade,  and  prompted  President 
Gerald  Ford  to  withdraw  support  for  granting 
MFN  status  to  the  Soviets. 

The  domestic  political  challenges  to  detente  in 
the  United  States  were  no  less  severe.  By  circum¬ 
venting  the  traditional  foreign  policy  bureaucracy 
and  emphasizing  secrecy  and  executive  control,  the 
Nixon-Kissinger  approach  failed  to  build  strong 
institutional  support  for  detente.  This  lack  of 
domestic  support  was  compounded  by  presidential 
electoral  politics  and  congressional  opposition. 
Presidential  politics  led  Nixon  and  Kissinger  to 
overstate  the  diplomatic  successes  of  1972  and 
oversell  the  future  promise  of  detente,  and  the 
Watergate  scandal  only  added  to  Nixon’s  political 
difficulties.  Congressional  opposition,  especially 
from  Senator  Henry  “Scoop”  Jackson 
(D-Washington),  further  complicated  the  overall 
process.  The  Jackson-Vanik  Amendment  of  1974 
is  a  case  in  point.  Underlying  the  entire  issue  of 


domestic  support  for  detente  was  the  worldview  of 
the  average  American  citizen.  As  Paul  Lauren, 
Gordon  Craig,  and  Alexander  George  (2007)  note, 

The  pursuit  of  detente  required  a  more 
sophisticated  mental  and  emotional  analysis  than 
that  of  Cold  War  rivalry.  .  .  .  Members  of 
Congress  and  the  public  often  were  unable  or 
unwilling  to  grasp  the  subtleties  of  a  strategy  that 
combined  both  sticks  of  punishment  and  carrots 
of  incentives,  (pp.  102-103) 

An  example  of  that  is  provided  by  the  Helsinki 
Accords  of  1975,  scoffed  at  by  the  Soviets  in 
tandem  with  elements  of  the  U.S.  public  but  not 
insignificant  in  ending  the  Cold  War. 

The  last  vestiges  of  superpower  detente  perished 
in  the  wake  of  the  1979  Soviet  invasion  of 
Afghanistan.  The  Carter  administration  removed 
the  SALT  II  Treaty  from  Senate  consideration, 
increased  military  spending,  and  initiated  the 
reevaluation  of  U.S.  nuclear  war-fighting  strategy. 
With  the  election  of  Reagan  in  1980,  the  Cold  War 
returned  in  earnest  until  the  Soviet  Union’s  even¬ 
tual  collapse  in  1991. 

Matthew  Gratias  and  Patrick  James 
University  of  Southern  California 
Los  Angeles,  California,  United  States 
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Cleavages  matter  in  all  social  relations.  Given  that 
international  relations  are  a  specific  form  of  social 
relations,  it  is  normal  that  the  role  of  cleavages  is 
taken  seriously.  Cleavages  can  be  geographic, 
political,  cultural,  or  economic.  They  indicate  the 
extent  of  polarization  of  the  world  around  differ¬ 
ent  criteria.  For  instance,  cultural  cleavages  derive 
from  various  differences,  including  religion,  eth¬ 
nicity,  and  nationality.  Political  cleavages  are 
linked  to  differences  in  ideology  or  institutional 
setup,  such  as  authoritarian  regimes  versus  demo¬ 
cratic  ones.  Economic  cleavages  have  to  do  with 
the  level  of  development.  In  international  rela¬ 
tions,  they  are  translated  through  the  antagonism 
between  the  developed  world  (the  Global  North) 
and  the  developing  world  (the  Global  South).  The 
concept  of  the  developing  world  is  an  important 
one  that  has  acquired  an  enduring  meaning  in 
international  relations.  This  entry  examines  the 
concept  of  the  developing  world  through  a  discus¬ 
sion  of  the  cleavage  between  the  North  and  the 
South. 

The  developing  world  cannot  be  understood 
without  reference  to  the  developed  world — they 
form  an  antagonistic  couple.  Each  of  these  worlds 
has  specific  attributes.  It  could  be  said  that  while 
the  developed  world  is  a  world  of  high  economic 
performance  and  standard  of  living,  the  develop¬ 
ing  world  is  a  world  of  relative  or  absolute  depri¬ 
vation.  However,  it  is  important  to  bear  in  mind 
that  the  concept  of  the  developing  world  is  an 
overgeneralization  of  a  complex  reality.  Instead  of 
a  single  developing  world,  there  are  several  such 
worlds.  The  concept  of  the  developing  world  does 


not  refer  to  the  same  reality  in  Asia  and  in  Africa. 
The  level  of  poverty  and  the  lack  of  industrializa¬ 
tion  are  not  the  same.  For  example,  in  1970,  China 
was  viewed  as  an  underdeveloped  country  in  rela¬ 
tion  to  the  average  Northern  developed  country, 
not  underdeveloped  African  countries.  There  is  a 
hierarchy  within  the  developed  world  in  which 
some  countries  of  the  South  are  seen  as  nearer  to 
those  of  the  North.  Moreover,  the  developed 
world  is  not  a  monolithic  world.  This  concept  of 
development  is  dynamic:  One  country  can  be  con¬ 
sidered  part  of  the  developing  world  at  one  time 
and  of  the  developed  world  at  another.  For  exam¬ 
ple,  today,  China  is  viewed  as  much  more  eco¬ 
nomically  powerful  than  it  was  several  decades 
ago.  While  some  Arab  countries  such  as  Libya, 
Saudi  Arabia,  and  Kuwait  are  global  financial 
players  due  to  the  incomes  generated  by  oil,  they 
remain  poorly  industrialized  and  have  a  lower  lit¬ 
eracy  rate  than  other  developed  countries.  Within 
the  developing  world,  countries  can  be  classified  as 
having  relatively  low,  medium,  or  high  levels  of 
development.  Therefore,  it  appears  that  the  con¬ 
cept  of  developing  world  is  elastic,  and  it  can  easily 
be  simplified  when  those  realities  are  under  scru¬ 
tiny.  Thus,  the  notion  of  developing  world  should 
be  considered  as  relative  and  provisional.  Through 
the  mediation  of  international  institutions,  the 
dynamics  of  international  relations  has  progres¬ 
sively  defined  the  notion  of  developing  world.  For 
instance,  the  United  Nations  Development 
Programme  (UNDP)  uses  its  Human  Development 
Index  to  measure  development  in  terms  of  several 
indicators  such  as  life  expectancy  at  birth,  adult 
literacy  rate,  gross  domestic  product  (GDP)  per 
capita,  and  human  poverty.  The  UNDP’s 
Millennium  Development  Goals  are  another  entry¬ 
way  to  understanding  the  developing  world.  The 
goals,  adopted  in  2000  by  many  members  of  the 
international  community,  identify  eight  objectives 
to  be  achieved  by  2015.  For  example,  the  first  goal, 
“to  eradicate  extreme  poverty  and  hunger,” 
includes  the  target  of  reducing  by  one  half  the  pro¬ 
portion  of  people  living  on  less  than  $1  per  day. 
The  UNDP  goals  are  a  way  of  identifying  the  objec¬ 
tive  reality  that  characterizes  developing  countries. 
However,  a  country’s  recognition  and  identifica¬ 
tion  of  itself  as  underdeveloped  also  constitutes  a 
subjective  reality;  the  underdevelopment  status  is 
also  created  and  demanded  by  the  underdeveloped 
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countries.  A  supplementary  illustration  of  this 
point  is  an  African  country  that  seeks  World  Bank 
status  of  “highly  and  poorly  indebted  country”  to 
receive  international  relief. 

From  the  preceding  discussion,  it  appears  that 
the  developing  world  is  an  element  in  international 
relations.  Here,  two  meanings  of  international 
relations  are  to  be  taken  into  consideration:  first, 
the  phenomenon  of  international  relations  in 
which  the  “developing  world”  intervenes  as  an 
actor  and,  second,  the  concept  of  international 
relations  as  a  science  that  has  to  explain  the 
dynamics  of  the  “developing  world.” 

The  Phenomenon  of  International  Relations 
and  the  Developing  World 

Like  all  social  relations,  international  relations  are 
structured  by  the  balance  of  forces.  In  this  context, 
power  is  based  on  the  level  of  development.  The 
developed  world  is  the  dominant  world,  the  world 
of  the  powerful,  while  the  developing  world  is  the 
dominated  world,  the  world  of  the  powerless. 
Thus,  the  developing  world  stands  as  testimony  to 
the  inequality  in  international  relations  and  consti¬ 
tutes  a  specific  field  of  action. 

The  existence  of  an  international  hierarchy  is 
confirmed  by  the  category  of  “developing  world.” 
It  is  the  world  that,  compared  with  the  one  that  is 
developed,  is  considered  backward.  In  contempo¬ 
rary  international  relations,  the  developing  world 
has  often  been  named  the  Third  World.  This  termi¬ 
nology  comes  from  the  period  during  the  French 
Revolution,  when  Third  Estate  referred  to  the 
commoners.  According  to  this  analogy,  the  term 
Third  World  was  used  for  countries  that  were  nei¬ 
ther  Western  capitalist  countries  (First  World)  nor 
the  socialist  countries  of  Eastern  and  Central 
Europe  (Second  World).  The  term  Third  World 
indicates  the  lower  position  occupied  by  countries 
of  Africa,  Asia,  and  Latin  America.  Despite  the 
collapse  of  Marxism  and  the  Second  World,  the 
notion  of  Third  World  is  still  used  to  designate  the 
developing  world.  It  is  the  world  on  which  devel¬ 
oped  countries  wield  their  power  through  various 
means:  economic,  cultural,  political,  or  military. 

Economically,  the  lower  position  is  reflected  by 
the  weakness  of  the  national  currencies,  raw  mate¬ 
rials  dependent  on  the  trade  structure,  high  rate  of 
human  poverty,  or  insufficient  industrialization. 


These  elements,  with  variations,  are  common  to 
developing  countries  and  explain  their  dependence 
on  the  developed  world.  The  inequality  of  develop¬ 
ing  countries  is  well  captured  by  the  Millennium 
Development  Goals:  eradicate  extreme  poverty 
and  hunger,  achieve  universal  primary  education, 
promote  gender  equality  and  empower  women, 
reduce  child  mortality,  and  improve  maternal 
health.  The  fact  that  these  objectives  are  proposed 
as  goals  is  an  indication  of  the  insufficiencies  that 
characterize  the  developing  world.  Therefore,  in 
international  relations,  developing  countries  are  to 
a  large  extent  subordinate  to  developed  countries, 
which  create  explicitly  or  implicitly  international 
clientelism:  Patron  states  are  developed  states;  cli¬ 
ent  states  are  developing  states.  Clientelism  makes 
it  difficult  for  developing  states  to  have  diplomatic 
autonomy  or  to  go  beyond  vocal  affirmation  of 
sovereignty.  However,  the  economic  dependence 
of  the  developing  world  is  not  an  absolute. 
Ideological,  political,  or  religious  parameters  may 
determine  the  diplomatic  demarche  of  a  develop¬ 
ing  state.  For  example,  the  overreliance  of  Egypt 
on  U.S.  economic  aid  has  not  led  to  Egypt  aligning 
with  the  United  States  on  issues  regarding  the 
Middle  East. 

As  a  specific  space  of  action,  the  “developing 
world”  has  at  least  two  meanings.  The  “develop¬ 
ing  world”  emerges  from  the  solidarity  approach 
to  international  relations  used  by  states  and  inter¬ 
national  organizations  in  their  action.  Each  devel¬ 
oped  state  defines  its  south.  For  instance,  while  the 
United  States  had  privileged  South  Asia  and  Latin 
America  during  the  Cold  War  era,  France  was 
organizing  its  aid  policy  with  an  emphasis 
on  Africa.  International  organizations  such  as 
the  World  Bank  or  UNDP  elaborate  quantitative 
criteria  to  measure  underdevelopment  and  then 
rationalize  the  contribution  of  the  international 
community.  In  this  regard,  the  “developing  world” 
is  the  space  filled  by  world  generosity.  The  “devel¬ 
oping  world”  has  influenced  the  structuring  of 
international  relations  through  the  agencies,  funds, 
and  programs  aimed  at  the  development  of  Third 
World  countries.  Inasmuch  as  the  “developing 
world”  induces  the  creation  of  development  orga¬ 
nizations,  international  institutions  crystallize  the 
existence  of  the  “developing  world.” 

The  “developing  world”  has  also  emerged  as  a 
distinct  language  of  international  relations.  The 
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“developing  world”  was  the  world  politically  cre¬ 
ated  by  peoples  of  Africa  and  Asia  meeting  in 
Bandung  in  1955;  the  Bandung  conference  was  the 
beginning  of  the  collective  claim  of  Third  World 
countries  for  decolonization  and  development. 
From  this  conference,  a  collective  consciousness  of 
belonging  to  the  underdeveloped  world  and  a  clear 
resolution  to  change  international  relations  arose. 
For  this  reason,  the  trademark  of  developing  coun¬ 
tries  in  international  relations  has  been  (and  still  is) 
the  denunciation  of  the  international  order.  In  the 
same  way  that  Westphalia  constitutes  a  mythical 
reference  for  the  foundation  of  European  public 
order,  Bandung  is  a  reference  in  international  rela¬ 
tions.  Its  revolutionary  dimension  should  not  be 
minimized.  Even  though  notions  of  equality  and 
noninterference  in  internal  affairs  were  already 
part  of  the  language  of  international  relations, 
these  notions  were  applied  only  within  the 
European  world.  They  did  not  guide  relations  with 
non-European  states.  That  is  why  European  public 
law  has  not  outlawed  colonization  or  imperialism. 
The  Bandung  conference  was  a  Third  World 
appropriation  of  principles  of  European  public  law 
and  the  United  Nations  Charter.  The  creation  of 
the  Non-Aligned  Movement  in  1961  in  Belgrade 
was  part  of  the  same  dynamics.  The  developing 
world  is  a  world  in  rebellion  against  the  interna¬ 
tional  order:  the  Economic  International  Order 
and  the  Information  and  Communication  World 
Order. 

The  clash  between  the  “developing  world”  and 
the  “developed  world”  is  relative.  There  are  alli¬ 
ances  that  transcend  cleavages.  The  French  aid 
policy  toward  its  former  colonies  of  sub-Saharan 
Africa  had  been  an  instrument  of  political  control. 
In  return,  a  small  developing  country  such  as 
Gabon  has  influenced  France  through  oil  exporta¬ 
tion.  The  Non-Aligned  Movement  as  a  way  to 
institutionalize  the  ideological  autonomy  of  devel¬ 
oping  countries  was  counterbalanced  by  the 
Marxist  orientations  of  its  main  members  such  as 
Cuba  and  Ghana.  Therefore,  the  “developing 
world”  does  not  refer  to  an  autonomous  entity. 
There  is  even  internal  rivalry  due  to  external  inter¬ 
ference  or  local  stakes.  The  rivalry  between  the 
presidents  of  Cote  d’Ivoire  and  Guinea  during  the 
1960s  cannot  be  explained  only  by  the  dependence 
of  Felix  Houphouet-Boigny  of  Cote  d’Ivoire  on 
France  and  the  rebellion  of  Sekou  Toure  of  Guinea. 


The  regional  leadership  in  French-speaking  Africa, 
which  concerns  states  and  individuals,  was  also  at 
stake.  In  summary,  economic  cleavages  are  not 
sufficient  conditions  for  the  formation  of  two 
fighting  communities:  “them”  and  “us.” 

However,  to  some  extent,  developing  countries 
have  succeeded  in  constructing  the  “North”  as  the 
enemy  of  the  “South”  through  a  mobilization  of 
the  history  of  colonization,  the  systematization  of 
neocolonialism,  and  the  explanation  of  underde¬ 
velopment  by  exogenous  factors.  Thus,  the  recog¬ 
nition  of  the  fluidity  of  each  pole  and  the  relativity 
of  the  cleavage  between  the  “North”  and  the 
“South”  should  not  lead  to  the  denial  of  this  oppo¬ 
sition.  If  a  misunderstanding  does  exist,  it  is  a 
workable  one.  In  international  relations,  the  devel¬ 
oping  world  has  been  institutionalized  as  a  cate¬ 
gory  of  vision,  division,  and  action.  It  is  a  basis  of 
solidarities  of  both  resemblance  and  interests:  soli¬ 
darity  of  resemblance  through  the  regrouping  of 
developing  states  in  circles  such  as  the  Group  of  77 
and  solidarity  of  interests  due  to  the  relative  simi¬ 
larity  of  the  economic  problems  of  developing 
states,  as  exemplified  in  the  dialogue  between  the 
European  Union  (EU)  and  the  African,  Caribbean, 
and  Pacific  Group  of  States.  It  is  not  just  chance 
that  despite  the  differences  that  exist  between  the 
developing  states  of  Latin  America,  their  respective 
national  and  regional  diplomatic  agendas  are 
dominated  by  the  theme  of  international  democra¬ 
tization,  which  is  in  fact  a  euphemism  for  the 
decolonization  of  international  relations. 

The  Concept  of  International  Relations 
and  the  Developing  World 

The  concept  of  international  relations  refers  to  the 
science  of  the  specific  social  relations  characterized 
as  international  relations.  As  a  body  of  organized 
knowledge,  the  science  of  international  relations 
does  not  explain  the  situation  and  the  role  of  the 
states  in  the  same  way.  The  science  of  international 
relations  does  not  just  involve  general  knowledge 
but  also  particular  knowledge  of  given  situations. 
Therefore,  the  science  of  international  relations  as 
related  to  the  developing  world  involves  tension 
between  universalism  and  relativism.  As  with 
every  science,  international  relations  has  general 
concepts  such  as  those  of  the  state,  sovereignty, 
and  power  that  are  applied  to  the  whole  world 
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regardless  of  the  level  of  development.  States  in  the 
North  and  the  South  are  both  discussed  with  terri¬ 
tory,  population,  and  government  as  bases.  Such  a 
generalization  is  a  lazy  one.  Territory  does  not 
have  the  same  meaning  in  different  political  and 
cultural  contexts.  The  dialectics  of  unique  and 
multiple  meaning(s)  are  useful.  The  globalization 
of  the  state  is  paired  with  its  vernacularization. 
Therefore,  its  understanding  has  to  be  both  on 
global  and  local  levels.  A  general  science  of  inter¬ 
national  relations  is  to  some  extent  an  epistemo¬ 
logical  obstacle  for  the  study  of  the  developing 
world,  because  it  blurs  specificities  and  differences. 
It  is  in  this  context  of  local  knowledge  that  a  fertile 
science  of  international  relations  is  a  particular  sci¬ 
ence,  that  is,  a  science  that  opts  for  the  relativity  of 
general  categories  and  concepts.  In  this  regard,  it 
must  be  acknowledged  that  some  paradigms  of 
international  relations  possess  a  real  heuristic 
power  to  study  the  role  and  state  of  the  developing 
world.  However,  while  universal  concepts  may  be 
useful  in  some  contexts,  it  is  also  important  to 
recognize  that  seemingly  straightforward  concepts 
such  as  state  and  sovereignty  can  mask  inequalities 
and  impede  the  development  of  knowledge  about 
the  developing  world,  which  requires  local  as  well 
as  global  knowledge. 

The  status  of  the  underdeveloped  world  in 
international  relations  gave  birth  to  a  specific 
theorization,  and  under  this  rubric  some  theories 
of  development  and  underdevelopment,  as  Ronald 
H.  Chilcote  (1981)  pointed  out,  can  be  located. 
The  dominant  explanatory  paradigm  of  the  devel¬ 
oping  world  in  international  relations  is  Marxist 
oriented  even  though  societies  in  Africa,  Asia, 
and  Latin  America  were  not  the  main  concern  of 
Karl  Marx’s  thought;  the  developing  world  is  an 
interesting  terrain  for  critical  international  rela¬ 
tions  theories  and  more  precisely  for  the  use  of 
Marx’s  writings.  Three  main  themes  are  taken 
into  consideration  by  the  Marxist-oriented 
approach:  (1)  underdevelopment,  (2)  dependency, 
and  (3)  imperialism. 

According  to  the  interpretation  of  the  Marxist- 
oriented  approach  presented  by  Samir  Amin  (1974, 
1976),  Walter  Rodney  (1972),  and  Andre  G. 
Frank  (1966),  the  formation  of  an  underdeveloped 
world  can  only  be  understood  in  the  context  of 
international  relations  where  capitalism  evolves  as 


a  process  of  relationship  between  unequal  part¬ 
ners.  The  underdeveloped  world  is  the  product  of 
the  development  of  the  capitalist  mode  of  produc¬ 
tion  at  the  periphery.  Therefore,  the  relations 
between  developed  countries  and  underdeveloped 
ones  are  like  metropole-satellite  relations.  This 
school  of  thought  perceives  underdevelopment  as 
related  to  capitalist,  colonialist,  and  imperialist 
exploitation.  From  this  perspective,  the  notion  of 
the  developing  world  is  meaningless;  only  that  of 
an  underdeveloped  world  matters,  because  Third 
World  countries  cannot  be  liberated  from  their 
backward  status.  Underdeveloped  countries  and 
developed  ones  are  perceived  as  completely  differ¬ 
ent  in  international  relations:  The  development  is 
unequal  because  underdeveloped  countries  are 
bound  to  organize  their  economies  according  to 
the  needs  of  the  dominant  ones. 

Dependency  of  peripheral  states  on  central  ones 
is  one  of  the  main  characteristics  of  the  interna¬ 
tional  relations  structure  (Fernando  Enrique 
Cardoso,  1973).  The  argument  of  the  dependence 
of  the  underdeveloped  world  is  based  on  several 
elements  among  which  are  internal  colonialism, 
which  is  the  domination  of  the  economy  by  the 
metropolis,  and  the  extraversion  toward  central 
states  of  the  economy. 

Imperialism  is  proposed  as  an  explanation  for 
the  actions  of  some  dominant  nations  in  the  world. 
The  independence  of  the  underdeveloped  states  is 
seen  as  formal.  The  actual  reality,  however, 
involves  the  concept  of  neocolonialism.  Inspired 
by  Vladimir  Lenin,  who  considered  imperialism  to 
be  the  highest  stage  of  capitalism,  Kwame  Nkrumah 
assumed  neocolonialism  to  be  the  supreme  stage  of 
imperialism. 

As  briefly  summarized,  the  Marxist-oriented 
approach  to  international  relations  positions  the 
developing  world  in  a  specific  way  in  international 
relations.  Contrary  to  positivist  approaches  such  as 
realism,  which  take  the  position  of  underdeveloped 
states  as  given,  the  Marxist-oriented  approach  his- 
toricizes  the  weakness  of  those  states:  Where  does 
underdevelopment  come  from?  Under  this  theoreti¬ 
cal  prism,  the  classical  concepts  of  states  and  sover¬ 
eignty  appear  as  variables.  States  and  sovereignty 
do  not  have  the  same  meaning  in  metropolitan 
states  (colonizers)  and  satellite  states  (the  colonies). 
It  is  clear  that  the  sovereignty  of  the  latter  is  more 
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normative  than  that  of  the  former.  However,  the 
Marxist  critique  of  international  relations  is  to  be 
nuanced  in  light  of  the  experience  of  the  underde¬ 
veloped  states.  First,  the  fiction  of  sovereignty  and 
equality  among  states,  be  they  from  the  North  or 
the  South,  has  led  to  a  positive  reality  of  decoloni¬ 
zation  (often  not  the  product  of  the  colonial  pow¬ 
er’s  goodwill)  and  of  the  ability  of  weak  southern 
states  to  oppose  strong  northern  states  through 
diplomatic  engagement  made  possible  by  the  legal 
and  formal  superstructure  of  international  rela¬ 
tions  that  can  affect  the  balance  of  forces.  Second, 
the  excessive  focus  on  the  dependence  dimension  of 
the  relationship  between  the  North  and  the  South 
hides  the  complex  reality  of  the  interdependence 
that  leads  to  compromise  or  a  variable  sum  game. 
Third,  the  presentation  of  the  underdeveloped 
world  as  clay  in  the  hands  of  the  developed  world 
is  far  from  the  reality  of  the  relative  autonomy  of 
Third  World  leaders  and  their  capacity  to  contract 
alliances  according  to  their  interests. 

The  classical  realist  paradigm  of  international 
relations  has  indirect  consequences  in  the  explana¬ 
tion  of  the  underdeveloped  world.  It  is  known 
that,  historically,  realism  is  elaborated  from  the 
experiences  of  major  Western  states.  It  has  not 
been  easy  for  classical  realism  to  apprehend  the 
underdeveloped  states  as  they  are.  During  the  Cold 
War  period,  these  states  were  seen  under  the  prism 
and  of  their  relations  with  the  two  superpowers. 
Their  relative  autonomy  was  ignored.  However, 
under  specific  conditions,  realism  can  help  under¬ 
stand  the  underdeveloped  world  in  international 
relations. 

Classical  realism  demystifies  the  idea  of  the 
“underdeveloped  world”  as  an  actor  in  interna¬ 
tional  relations.  The  unit  of  analysis  and  level  of 
observation  are  constituted  by  the  “state.”  The 
notion  of  “underdeveloped  world”  is  de-globalized 
for  the  benefit  of  underdeveloped  states.  These 
states  are  recognized  as  being  different,  with  diver¬ 
gent  or  convergent  national  interests  and  with 
peaceful  or  conflict-ridden  relations.  Based  on  the 
universal  concepts  of  state,  power,  and  sover¬ 
eignty,  classical  realism  can  then  contribute  to  the 
de-provincialization  of  underdeveloped  states  in 
international  relations.  They  are  states  like  others. 
This  does  not  mean  that  classical  realism  disre¬ 
gards  absolutely  sets  of  states  and  recognizes  only 


peculiar  states.  Sets  of  states  are  considered  when 
they  are  political,  economic,  or  military  alliances, 
for  instance;  an  alliance  with  the  underdeveloped 
world  is  not  an  alliance  to  be  taken  for  granted: 
There  are  often  alliances  between  Northern  states 
and  Southern  states  or  among  some  Southern 
states  that  crystallize  the  heterogeneity  of  the 
underdeveloped  world.  At  the  same  time  that  real¬ 
ism  prevents  the  illusion  of  spontaneous  formation 
of  an  alliance,  it  destroys  the  illusion  of  the  similar¬ 
ity  of  underdeveloped  states:  Like  other  states, 
underdeveloped  states  have  distinct  resources. 
Those  resources  determine  their  power  position. 
Among  underdeveloped  states,  there  are  some 
states  that  are  weaker  than  the  others.  Therefore, 
the  realist  image  of  a  hierarchy  of  states  is  still 
meaningful. 

The  redefinition  of  realism  under  the  traits  of 
“ethnorealism”  (Paul  Roe,  1999)  or  “subaltern 
realism”  (Mohammed  Ayoob,  1998)  constitutes 
operating  attempts  to  adapt  realism  to  the  specific¬ 
ity  of  Third  World  states.  Both  adaptations  of  clas¬ 
sical  realism  aim  at  shedding  more  light  on  the 
intrastate  dimension  of  international  relations. 
Linked  to  the  experience  of  Western  states  where 
war  and  peace  are  supposed  to  depend  on  inter¬ 
state  relations,  classical  realism  does  not  under¬ 
stand  the  dynamics  of  internal  conflicts  that  are 
preponderant  in  the  underdeveloped  world.  To 
cross  the  boundary  that  stops  classical  realism 
from  analyzing  the  intrastate  level,  ethnorealism 
and  subaltern  realism  were  proposed.  The  concep¬ 
tual  basis  of  international  relations  is  to  be  rein¬ 
vented  from  this  perspective.  Initially,  the  great 
divide  is  legitimated  by  the  belief  that  “inside”  is 
the  place  of  order — peace — while  the  “outside”  is 
dominated  by  disorder.  In  several  states  of  the 
underdeveloped  world,  internal  life  is  character¬ 
ized  by  the  intensification  of  conflicts.  In  Africa, 
there  are  more  intrastate  wars  than  interstate  ones 
(Luc  Sindjoun,  2002).  Thus,  it  could  be  said  that 
the  study  of  the  underdeveloped  world  implies  a 
reformulation  of  classical  realism. 

The  situation  of  the  underdeveloped  world  in 
international  relations  has  been  a  pretext  for  the 
denunciation  of  international  relations  theory.  In 
fact,  this  is  based  on  the  relationship  that  is  sup¬ 
posed  to  exist  between  indigenous  culture  and 
Western  domination  (Homi  Bhabha,  2007).  An 
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epistemology  of  the  dominant  international  rela¬ 
tions  knowledge  shows  that  there  is  an  interaction 
between  realism  and  imperialism.  That  is  the  reason 
for  the  call  for  a  postcolonial  theory  of  interna¬ 
tional  relations. 

Luc  Sindjoun 
University  of  Yaounde  II 
Yaounde,  Cameroon 
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Political  development  gained  recognition  as  a  sub¬ 
field  of  comparative  politics  in  the  1960s.  This 
entry  describes  its  roots  in  modernization  theory 
and  dependency  theory,  examines  broad  themes  in 
its  subsequent  evolution,  and  concludes  with  a 
look  at  the  role  of  area  studies  in  the  discipline 
today. 

Prior  to  the  1960s,  comparative  politics  had 
tended  to  focus  on  the  forms  of  government  that 
prevailed  in  the  advanced  industrial  world,  that  is, 
Europe  and  the  former  Soviet  Union.  But  with  the 
break  up  of  Europe’s  empires  following  World 
War  II  and  the  entry  of  numerous  postcolonial 
states  into  the  global  political  system,  an  increas¬ 
ing  number  of  scholars  chose  instead  to  focus  on 
what  became  known  as  the  developing  world. 
Figure  1  shows  the  age  composition  of  the  states 
that  became  independent  in  different  periods  and 
captures  the  size  of  the  new  group  of  nations  in  the 
past  66  years:  120  out  of  194,  that  is,  almost  two 
thirds. 

The  study  of  political  development  was  fore¬ 
shadowed  by  studies  of  nationalism.  Initially,  schol¬ 
ars  treated  the  subject  as  a  branch  of  contemporary 
history:  The  rise  of  nationalism  in  the  postwar 
world  seemed  to  resemble  the  rise  of  nationalism  in 
Europe,  be  it  in  19th-century  Germany,  Greece,  or 
Italy,  or  in  Central  and  Eastern  Europe  following 
the  conference  in  Versailles.  But  many  soon  recog¬ 
nized  that  politics  in  the  newly  developed  nations 
differed  from  that  in  the  developed.  They  found  it 
necessary  to  address  new  themes:  economic  growth, 
the  rise  of  industry,  and  the  marginalization  of  agri¬ 
culture  as  well  as  the  politics  that  accompanied 
these  transformations. 

While  diverse  intellectually,  those  making  early 
contributions  to  the  field  tended  to  subscribe  to 
what  became  known  as  “modernization”  theory. 
Modernization  theory  constituted  a  claim  that  the 
breakdown  of  the  previous  agrarian  and  tradi¬ 
tional  order  made  people  available  for  political 
mobilization.  Increasing  levels  of  urbanization, 
literacy,  and  income,  its  proponents  argued,  bore  a 
close  relationship  with  the  rise  of  political  partici¬ 
pation.  They  posited  sociological  and  social  psy¬ 
chological  mechanisms — the  desire  for  community 
and  the  impact  of  anomie,  for  example — to  link 
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Figure  I  Date  of  Independence:  Four  Periods 


economic  and  social  change  to  the  growth  of  mass 
movements.  Drawing  on  the  work  of  area  special¬ 
ists,  several  stressed  the  manner  in  which  national¬ 
ist  politicians  harnessed  the  political  energies  of 
voluntary  associations  and  in  particular  groups 
formed  in  town  by  recent  migrants  from  the  coun¬ 
tryside.  Rather  than  being  highly  organized  and 
disciplined  electoral  machines,  some  nationalist 
parties,  these  specialists  found,  were  but  loose 
assemblages  of  these  organizations — something 
that  lent  credibility  to  the  reasoning  of  the  mod¬ 
ernization  theorists. 

The  following  is  a  key  question  faced  by  stu¬ 
dents  of  the  developing  nations:  What  kind  of 
polities  would  emerge  in  the  developing  world? 
Given  the  bipolar  structure  of  power  in  the  post¬ 
war  era,  this  question  was  equivalent  to  asking 
whether  these  nations  would  favor  democracy  or 
totalitarianism.  When  Seymour  Martin  Lipset 
reported  a  systematic  relationship  between  the 
level  of  per  capita  income  and  the  level  of  democ¬ 
racy  in  a  cross-sectional  regression  of  nation¬ 
states,  the  answer  seemed  clear.  Given  that  higher 
levels  of  urbanization  and  literacy  covary  with 
higher  levels  of  income  and  that — as  Lipset  found — 
higher  levels  of  income  correlate  with  democracy, 
then  the  developing  world,  as  it  modernized,  was 
destined  to  be  democratic. 

This  claim  soon  became  the  focal  point  of  heated 
debate,  however,  one  that  brought  to  the  fore  an 
alternative  form  of  political  analysis.  In  the  midst  of 
the  recession  and  debt  crisis  of  the  1970s  and 
1980s,  military  governments  overthrew  democratic 


regimes  in  countries  such  as  Brazil,  Argentina,  and 
Chile.  As  these  nations  numbered  among  the  most 
prosperous  in  the  developing  world,  when  these 
regimes  closed  newspapers,  banned  political  par¬ 
ties,  and  jailed  or  assassinated  political  opponents, 
they  made  a  mockery  of  the  claim  that  democracy 
covaried  with  development.  Rather,  their  behavior 
lent  credence  to  what  became  known  as  “depen¬ 
dency  theory.  ” 

Modernization  theory  traced  its  intellectual 
roots  to  the  writings  of  Max  Weber,  Emile 
Durkheim,  and  other  sociologists  (e.g.,  Ferdinand 
Tonnies)  who  studied  the  emergence  of  Europe  as 
a  modern  society.  The  roots  of  dependency  theory 
lay  instead  in  the  writings  of  Karl  Marx,  who 
focused  on  the  economic  forces  that  propelled 
Europe’s  19th-century  transformation.  For  impor¬ 
tant  reasons,  however,  the  dependency  theorists 
did  not  draw  on  Marx  himself  but  rather  on  his 
followers,  in  particular,  Vladimir  I.  Lenin. 

As  Lenin  recognized,  the  predictions  of  Marx 
had  been  falsified.  In  modern  industry,  wages  of 
employees  were  rising;  employers  such  as  Ford 
found  that  by  paying  higher  wages,  they  could  sell 
more  products.  As  a  result,  it  was  becoming 
increasingly  difficult  to  write  of  the  immiseration 
of  the  working  class.  Nor  could  observers  find 
evidence  of  political  conflict  between  the  bourgeoi¬ 
sie  and  the  proletariat.  Thus,  Lenin’s  acknowledg¬ 
ment  that  at  the  outbreak  of  World  War  I,  in  each 
major  European  nation,  socialist  parties  had  joined 
bourgeois  parties  in  voting  to  finance  their  militar¬ 
ies.  They  were  even  prepared  to  kill  workers  if  they 
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wore  the  uniforms  of  another  nation.  The  proletar¬ 
ian  parties  had  acted  as  if  they  had  more  interests 
in  common  with  their  own  nation’s  bourgeoisie 
than  with  another  nation’s  working  class. 

Faced  with  these  realities,  Lenin  turned  to  a  key 
question:  If  the  bourgeoisie  were  not  exploiting 
workers  in  the  advanced  industrial  countries,  then 
from  where  did  they  draw  their  capital?  The 
answer,  he  responded,  was  from  abroad.  Through 
multinational  firms,  international  bankers,  and 
global  markets,  the  rich  nations  of  Europe  were 
able  to  extract  resources  from  the  poor  nations  of 
the  periphery.  Rather  than  taking  place  in  the 
advanced  industrial  economies,  the  extraction  of 
surplus  took  place  at  the  global  level. 

For  the  dependency  theorists,  then,  the  fall  of 
elected  governments  in  the  most  prosperous 
nations  in  the  developing  world  was  not  paradoxi¬ 
cal.  For  the  purpose  of  governments  in  the  devel¬ 
oping  world,  they  believed,  was  not  to  serve  the 
interests  of  their  own  people  but  rather  the  inter¬ 
ests  of  investors  from  abroad.  That  the  military 
governments  repressed  labor,  destroyed  trade 
unions,  and  jailed — or  killed — socialist  agitators 
was  what  was  to  be  expected  of  any  government 
occupying  their  position  in  the  global  system. 

Lending  further  credence  to  the  Leninist  inter¬ 
pretation  was  the  war  in  Vietnam.  In  the  United 
States,  workers — the  so-called  hard  hats,  in  the 
argot  of  the  time — supported  the  war;  and  in 
Vietnam,  a  peasant-based  army  fought  the  army  of 
a  capitalist  nation.  Political  forces  thus  aligned  in 
a  manner  redolent  of  Lenin’s  analyses.  In  addition, 
the  war  reminded  scholars  of  Barrington  Moore’s 
claim  that  all  modern  revolutions  were  based  on 
the  peasantry  rather  than  the  proletariat — some¬ 
thing  that  would  follow  from  the  reasoning  of  the 
dependency  school. 

Despite  its  initial  appeal,  dependency  analysis 
fell  victim  to  discordant  facts.  One  was  the  rise  of 
the  industrialized  economies  in  regions  that  for¬ 
merly  had  been  underdeveloped,  most  notably  in 
Asia.  Not  only  did  these  economies  grow,  and  their 
nations  escape  the  ranks  of  the  underdeveloped, 
but  also  their  growth  was  spurred  by  foreign 
trade — an  instrument  of  immiseration,  in  most 
Marxian  accounts.  Also  significant  was  the  so- 
called  third  wave  of  democratization  that  culmi¬ 
nated  in  the  fall  of  the  Soviet  Union.  With  Russia 
reforming  politically;  Chile,  Brazil,  and  Argentina 


restored  to  the  ranks  of  democracies;  and  several  of 
the  newly  industrialized  nations  of  Asia  choosing 
their  rulers  in  contested  elections,  countries  that 
had  lain  off  Lipset’s  regression  line  snapped  back 
on  to  it.  Modernization  theory  appeared  redeemed. 

While  Marxist  political  analysis  may  have  lost 
its  appeal,  scholars  continue  to  explore  its  central 
concerns:  the  impact  of  global  trade,  international 
finance,  and  the  multinational  structure  of  firms. 
And  while  democratic  political  forces  may  have 
displaced  military  and  authoritarian  regimes,  many 
of  the  new  democracies  exhibit  authoritarian  fea¬ 
tures:  legislatures  with  but  token  powers,  limits  on 
association  and  expression,  and  the  maintenance  in 
power  of  incumbent  leaders  or  political  parties.  The 
result  is  a  rapidly  growing  literature  on  “partial 
democracies.”  Contemporaneous  with  the  spread 
of  democracy  in  the  developing  world,  moreover, 
was  the  growth  of  civil  conflict,  which  leads  to 
copious  research  into  the  causes  of  civil  war  and 
state  failure  in  the  developing  world.  As  ethnicity 
and  religion  have  played  a  major  role  in  these  con¬ 
flicts,  attention  has  refocused  on  the  cultural  roots 
of  politics  in  the  developing  world.  While  each  of 
these  themes  figured  prominently  in  the  early  litera¬ 
ture,  scholars  now  approach  them  in  new  ways. 
Although  the  importance  of  cultural  forces  had 
long  been  recognized,  for  example,  scholars  now 
seek  to  find  how,  and  under  what  conditions,  they 
affect  political  behavior  and  collect  data  and  design 
experiments  to  test  their  claims. 

The  Broad  Themes 

The  decline  of  dependency  theory  left  the  field  of 
development  bereft  of  a  “master  narrative.”  While 
accepting  many  of  the  basic  empirical  findings  of 
modernization  theory,  few  scholars  were  inclined  to 
subscribe  to  its  tacit  endorsement  of  the  cultural, 
economic,  or  political  triumph  of  the  West.  Their 
demurral  became  more  pronounced  following  the 
late-century  rise  of  a  prosperous  and  powerful  Asia. 
Rather  than  achieving  a  theoretical  consensus,  then, 
the  field  is  marked  by  themes,  which  in  turn  lead  to 
a  redefinition  of  disciplinary  boundaries. 

Political  Economy 

Modernization  and  dependency  theory 
focused  on  the  political  impact  of  economic 
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change,  linking  (variously)  democracy  or  authori¬ 
tarianism  to  capital  accumulation  and  economic 
growth.  More  recently,  scholars  have  focused  on 
the  impact  of  politics  on  the  performance  of  the 
economy.  Some  have  looked  at  the  role  of  interest 
groups  and  the  way  in  which  they  shape  policies 
toward  trade,  others  on  the  regulation  of  markets 
and  the  manner  in  which  they  are  transformed 
into  political  machines,  and  still  others  on  the 
way  in  which  the  movement  to  democratic  forms 
of  government  alter  policy  decisions,  inducing  or 
failing  to  induce  them  to  employ  public  goods 
rather  than  private  transfers  in  search  of  office. 
These  lines  of  research  have  brought  students  of 
development  into  close  contact  with  political  sci¬ 
entists  in  the  field  of  international  political  econ¬ 
omy  and  economists  studying  politics. 

Security  Studies 

Those  studying  development  have  also  forged 
closer  ties  with  those  in  the  field  of  security.  In 
recent  decades,  the  locus  of  military  security  has 
shifted  from  the  international  to  the  domestic 
level,  with  far  more  deaths  resulting  from  civil 
conflict  than  from  wars.  For  years,  students  of 
international  conflict  have  been  able  to  apply  geo¬ 
political  reasoning  to  “the  state,”  viewing  the  lat¬ 
ter  as  a  unitary  actor;  the  latter  assumption  is  no 
longer  tenable  when  studying  conflicts  at  the  sub¬ 
national  level.  Just  as  students  of  development  gain 
insights  into  the  use  of  military  power  from  their 
colleagues  in  security  studies,  those  in  security 
studies  now  find  themselves  benefiting  from  the 
insights  of  those  who  study  the  domestic  politics  of 
developing  countries.  Driven  by  the  importance  of 
this  research  theme,  the  subfields  have  come  into 
greater  contact. 

History 

While  thus  probing  deeply  into  the  politics  of  the 
developing  world,  scholars  are  simultaneously 
viewing  it  from  a  more  detached  vantage  point. 
The  recent  rise  of  Asia  reminds  many  that  not  so 
long  ago,  it  was  Asia  that  constituted  the  core 
while  the  West  languished  in  the  periphery.  Inspired 
by  this  realization,  students  of  development 
increasingly  work  in  concert  with  students  of  eco¬ 
nomic  history.  Some  return  to  the  study  of  Western 


imperialism  and  the  manner  in  which  it,  by  top¬ 
pling  the  nations  of  Asia,  generated  a  reversal  of 
economic  fortunes  on  the  one  hand  while  laying 
the  foundations  for  subsequent  and  enduring 
inequality  on  the  other.  Others  look  for  factors, 
such  as  institutions,  that  can  account  for  both  dif¬ 
ferences  in  economic  performance  over  time  and 
across  regions.  Many  look  for  lessons  from  his¬ 
tory  that  might  be  applied  to  the  developing 
world,  focusing  in  particular  on  the  origins  of 
democracy  and  the  role  of  warfare  in  rendering 
states  “developmental.” 

Area  Studies  and  the  Discipline 

A  last  major  topic  should  be  addressed:  the  rela¬ 
tionship  between  those  doing  area  studies  and 
those  who  “do”  social  science,  making  extensive 
use  of  formal  theory  and  empirical  methods. 
Rather  than  treating  the  approaches  as  rivals,  as 
has  been  the  case  in  the  past,  scholars  are  now 
more  inclined  to  treat  them  as  complements. 
Drawn  into  political  science  because  of  their  love 
of  a  region  and  their  commitment  to  its  develop¬ 
ment,  scholars  are  increasingly  inclined  to  combine 
a  deep  understanding  of  its  politics  and  culture, 
derived  from  field  research  and  the  use  of  ethno¬ 
graphic  methods,  with  the  use  of  formal  theory, 
statistical  analysis,  and  experimental  methods.  The 
clash  of  cultures  that  once  marked  the  field  now 
appears  muted;  area  studies  are  now  firmly  based 
within,  rather  than  in  opposition  to,  the  discipline. 

Robert  H.  Bates 
Harvard  University 
Cambridge,  Massachusetts,  United  States 
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Development  Administration 


Development  administration  refers  to  a  form  of 
public  administration  meant  to  be  suitable  for 
developing  countries.  The  concept  gained  wide¬ 
spread  recognition  in  the  1960s,  coinciding  with 
the  emphasis  at  the  time  on  state-led  development. 
Development  was  something  that  needed  to  be 
centrally  managed.  The  concept  has  since  lost 
much  of  its  initial  value,  and  what  many  expected 
to  be  a  new  field  of  inquiry  has  never  materialized. 
Instead,  the  concept  has  become  subsumed  under 
others  such  as  development  management  and 
governance.  This  entry  traces  the  history  of  and 
contending  perspectives  on  the  concept  before  con¬ 
cluding  with  a  discussion  of  its  demise  and  gradual 
inclusion  into  rivaling  intellectual  traditions. 

History 

Development  administration  emerged  as  a  signifi¬ 
cant  concept  in  the  field  of  public  administration 
in  the  late  1950s  and  early  1960s.  Its  promotion 
came  from  three  distinct  sources. 

First,  progressive  civil  servants  who  realized 
that  the  rigid  system  of  administration  that  they 
associated  with  colonial  rule  would  not  work  once 
these  countries  in  Africa  and  Asia  were  indepen¬ 
dent  constituted  one  constituency  in  favor  of  a 
more  developmental  form  of  administration. 
Several  of  these  civil  servants  who  themselves  had 
personal  experience  of  working  in  the  colonial 


administration  shifted  to  becoming  international 
civil  servants  working  on  technical  assistance  con¬ 
tracts  in  these  countries. 

A  second  group  was  made  up  of  philanthropic 
foundation  officers  with  an  interest  in  facilitating 
the  development  of  new  states  in  Africa  and  Asia. 
Among  U.S.  entities,  the  Ford  and  Rockefeller 
Foundations  came  to  play  an  especially  significant 
role.  The  former  was  very  active  in  India  and 
Pakistan  as  well  as  in  Anglophone  African  coun¬ 
tries.  Support  by  these  foundations  was  critical  not 
only  for  improvement  in  administrative  practices 
but  also  for  experimenting  with  new  theories  of 
administration.  The  Ford  Foundation-funded 
African  Association  for  Public  Administration  and 
Management  (AAPAM)  played  a  particularly 
important  role  in  fulfilling  this  dual  objective  in 
the  1960s  and  1970s. 

The  third  group  consisted  of  academics  with  an 
interest  in  making  a  contribution  to  the  cause  of 
development  by  inventing  new  theories  of  admin¬ 
istration  that  would  be  applicable  in  particular  to 
the  new  states  in  the  developing  regions  of  the 
world.  These  scholars  were  largely  from  developed 
countries  such  as  Australia,  the  United  Kingdom, 
and  the  United  States.  This  group  eventually  grew 
to  include  scholars  from  developing  countries. 
Onkar  P.  Dwivedi  from  India,  Jorge  Nef  from 
Brazil,  and  Dele  Olowu  from  Nigeria  are  among 
the  most  significant  contributors. 

The  enthusiasm  with  which  the  concept  was 
met  in  the  1960s  lasted  into  the  1970s  but  began 
to  wane  subsequently  for  two  main  reasons.  The 
first  was  the  shift  in  intellectual  paradigm  from  a 
state-led  to  a  market-inspired  approach  to  devel¬ 
opment.  Reforms  in  public  administration  became 
less  urgent  than  similar  actions  in  the  economy. 
The  second  was  the  dearth  of  achievements  in 
development  administration.  The  theorists  had 
largely  failed  to  translate  their  ideas  into  practical 
action.  What  had  been  presented  as  a  promising 
new  academic  field  with  practical  value  for  devel¬ 
oping  countries  had  little  to  show  for  itself  by  the 
1980s.  What  happened  thereafter  was  an  attempt 
to  redefine  reform  in  broader  terms  than  had  been 
the  case  in  the  previous  2  decades. 

Contending  Perspectives 

There  has  never  been  a  single  and  coherent  theory 
guiding  the  field  of  development  administration.  It 
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failed  to  develop  beyond  a  preparadigmatic  stage. 
There  were  four  main  theoretical  perspectives  that 
competed  for  domination  of  the  field,  although 
none  ever  reached  that  point. 

The  first  theoretical  effort  by  Victor  Thompson 
drew  its  inspiration  from  an  earlier  study  of  enter¬ 
prise  management  in  the  United  Kingdom.  The 
authors  of  this  study  made  a  distinction  between 
two  forms  of  management — mechanistic  and 
organic.  The  former  would  coincide  with  a  classi¬ 
cal  bureaucratic  form  in  which  authority  follows 
hierarchical  lines,  roles  are  clearly  defined  and 
rules  strictly  enforced.  The  latter  relies  on  a  more 
flexible  form  of  organization  in  which  relations 
among  employees  are  more  collegial,  and  rules  can 
be  bent  in  the  interest  of  achieving  a  common  over¬ 
arching  goal.  Thompson  borrowed  these  ideas  and 
developed  his  own  theory  of  organization  that 
applied  to  how  administration  should  be  struc¬ 
tured  to  be  conducive  to  development.  His  assump¬ 
tion  was  that  since  organic  management  was  espe¬ 
cially  suitable  in  conditions  of  change,  for  exam¬ 
ple,  in  technology  or  market  opportunities,  this 
could  be  applied  also  to  administration  of  develop¬ 
ment  since  the  latter  implies  change.  Thompson’s 
theory  focused  on  how  behavioral  change  could  be 
induced  by  senior  administrators.  It  assumed  that 
bureaucratic  organizations  could  be  reformed 
from  within.  His  theory,  therefore,  was  attractive 
to  practitioners  interested  in  making  public  admin¬ 
istration  in  the  new  states  more  developmental. 

A  second  approach  that  is  associated  especially 
with  Fred  Riggs  started  from  a  systems  point  of 
view  and  was  less  precise  in  terms  of  what  could  or 
should  be  done.  His  “prismatic  theory  of  society” 
was  inspired  by  modernization  theory,  according 
to  which  societies  move  from  being  “traditional” 
to  becoming  “modern.”  Riggs’s  contribution  was 
that  in  the  field  of  administration,  states  in  Asia, 
Africa,  or  Latin  America  were  caught  somewhere 
in  between,  with  influences  coming  both  from  the 
past  and  the  present,  hence  creating  a  syncretic 
combination  that  was  different  but  not  necessarily 
developmental.  His  work  attracted  interest  from 
other  scholars  as  long  as  modernization  theory  had 
its  influence  until  the  early  1970s  but  lost  it  imme¬ 
diately  thereafter. 

Whereas  Thompson  had  concentrated  on  what 
could  be  done  with  development  administration 
theory  inside  organizations,  Riggs  argued  that 
administrative  organizations  reflected  the  values 


intrinsic  in  society.  Theories  of  development 
administration  could  not  be  applied  without 
attention  to  its  “ecology”  or  societal  environment. 
These  contending  perspectives  dominated  the 
debate  in  the  1970s.  David  Leonard  made  perhaps 
the  most  original  and  empirically  based  contribu¬ 
tion  to  this  debate  with  his  pathbreaking  study  of 
the  agricultural  extension  services  in  Kenya.  He 
reached  two  important  conclusions  that  helped  set 
the  debate  into  the  1980s.  The  first  was  that 
rejecting  a  mechanistic  form  of  organization  in 
favor  of  an  organic  one  had  its  own  costs  because 
there  are  certain  basic  functions  in  bureaucracies, 
such  as  personnel  management  and  accounting, 
that  cannot  be  overlooked  if  administration  is 
going  to  be  developmental.  Preferring  the  concept 
of  “development  bureaucracy,”  Leonard  sug¬ 
gested  that  there  is  not  necessarily  a  tension 
between  mechanistic  and  organic  forms  of  organi¬ 
zation.  The  second  contribution  was  that  although 
societal  factors  matter,  organizations  can  make  a 
difference  if  properly  administered.  Government 
ministries  are  not  “hopeless  cases”  but  rather 
potential  change  agents.  Leonard,  therefore,  kept 
the  field  of  development  administration  alive  by 
merging  previously  contending  perspectives  and 
arguing  that  development  is  possible  through 
more  careful  and  systematic  attention  to  bureau¬ 
cratic  organizations. 

A  fourth  perspective  on  development  adminis¬ 
tration  was  provided  by  George  Gant,  who  made 
a  distinction  between  what  may  be  called  a  more 
down-to-earth  administration  in  specific-line  min¬ 
istries,  on  the  one  hand,  and  a  comprehensive 
development  administration  concentrated  in  a 
superior  branch  of  government,  for  example,  a 
President’s  Office  or  a  Planning  Commission,  on 
the  other.  Gant  was  skeptical  that  a  developmental 
form  of  administration  could  emerge  in  day-to-day 
bureaucratic  settings  and  advocated,  therefore,  the 
creation  of  a  “super”  administration  in  charge  of 
development,  an  idea  that  he  appears  to  have  bor¬ 
rowed  from  the  literature  and  practice  of  develop¬ 
ment  planning. 

This  account  of  significant  contributions  to  the 
field  would  not  be  complete  without  mentioning 
two  other  names.  The  first  is  Milton  Esman,  who 
through  various  contributions  followed  the  evolu¬ 
tion  of  the  field  from  its  heyday  to  its  demise. 
Together  with  colleagues  at  Cornell  University,  he 
was  also  responsible  for  applying  organizational 
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theory  to  empirical  reality  in  South  Asia.  The  other 
is  Bernard  Schaffer,  who  wrote  a  seminal  piece  in 
1969  in  which  he  was  the  first  to  criticize  the 
ambitions  of  the  field  for  being  naive  about  the 
possibility  of  making  bureaucratic  forms  of  admin¬ 
istration  more  developmental.  Drawing  especially 
on  his  studies  and  experience  in  India,  Schaffer 
argued  somewhat  along  the  same  lines  as  Riggs 
that  where  norms  in  society  are  different  from 
those  required  in  a  functioning  administrative 
organization,  development  administration  is 
unlikely  to  emerge. 

Demise 

Development  administration  never  took  off  as  a 
field  of  its  own.  It  became  a  victim  of  its  own  high 
aspirations  and  the  difficulties  of  realizing  them. 
The  shift  in  the  1980s  away  from  state-led  devel¬ 
opment  exacerbated  this  difficulty.  What  has  hap¬ 
pened  since  may  be  captured  in  two  distinct  trends. 
One  has  been  the  incorporation  of  development 
administration  into  the  field  of  public  administra¬ 
tion — a  return  to  where  it  all  came  from  in  the  first 
place.  This  manifests  itself  in  the  belief  that  for 
developing  countries  to  become  successful,  they 
require  a  public  administration  similar  to  that 
found  in  developed  countries.  The  multiple  public 
sector  reform  programs  that  have  been  launched  in 
the  1990s  and  2000s  with  generous  funding  from 
the  donor  community  are  a  confirmation  of  this 
trend.  In  this  perspective,  development  administra¬ 
tion  has  been  reduced  to  simply  being  administra¬ 
tion  in  developing  countries.  It  has  no  longer  a 
theory  of  its  own. 

The  second  trend  has  been  the  move  from  devel¬ 
opment  administration  to  governance.  This  meta¬ 
morphosis  has  subordinated  issues  of  public 
administration  to  a  broader  set  of  concerns,  both 
economic  and  political.  Thus,  rather  than  treating 
issues  of  administrative  reform  as  a  separate  con¬ 
cern,  these  issues  are  interpreted  in  a  broader  eco¬ 
nomic  and  political  reform  perspective.  Costs  of 
administration  have  become  a  central  issue  thanks 
to  its  encounter  with  economics.  Likewise,  admin¬ 
istration  and  politics  are  no  longer  two  distinct 
fields  but  brought  together  in  the  name  of  good 
governance  to  make  both  activities  more  relevant 
and  appealing  to  the  public,  an  objective  that  is 
now  applied  by  both  scholars  and  development 


practitioners  to  developed  and  developing  coun¬ 
tries  alike. 
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Diaspora 


The  word  diaspora  comes  from  the  ancient  Greek 
dia  speiro,  meaning  to  sow  over.  It  refers  to  popu¬ 
lations  that  originated  from  the  same  place  but 
have  now  scattered  to  different  locations.  The 
concept  of  diaspora  has  long  been  used  to  refer  to 
the  Greeks  in  the  Hellenic  world  and  to  the  Jews 
after  the  fall  of  Jerusalem,  and  beginning  in  the 
1950s  and  1960s,  scholars  began  to  use  it  with 
reference  to  the  African  diaspora.  However,  it  has 
come  to  be  used  more  widely  during  the  past 
2  decades.  This  entry  first  puts  the  concept  of 
diaspora  in  historical  perspective  and  discusses  the 
various  typologies  that  researchers  have  devel¬ 
oped  in  analyzing  it.  The  entry  then  focuses  on  the 
political  implications  of  both  the  phenomenon 
and  its  interpretations,  and  finally,  it  considers 
what  is  at  stake  in  the  transformation  of  the 
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nation-state  system  that  arises  out  of  the  diaspora 
experience. 

Evolution  of  the  Concept  of  the  Diaspora 

The  concept  of  diaspora  did  not  figure  promi¬ 
nently  in  the  social  sciences  until  the  late  1960s, 
and  the  use  of  the  plural  is  recent.  It  used  to  refer 
primarily  to  the  Jewish  experience,  in  Greek  ver¬ 
sions,  particularly  through  the  expulsion  of  the 
people  and  destruction  of  the  Jerusalem  temple 
under  the  Babylonian  empire.  The  Jewish  popula¬ 
tion’s  experience  of  a  dispersal  made  necessary  by 
its  loss  of  territory  shapes  a  tragic  vision  of  the 
diaspora,  long  shared  by  many  analysts. 
Nonetheless,  since  ancient  times,  the  concept  has 
been  used  in  a  positive  though  much  less  influen¬ 
tial  way  to  refer  to  the  Greek  colonization  between 
the  6th  and  4th  centuries  BCE  in  all  the 
Mediterranean  lands,  from  the  shores  of  present 
Turkey  and  Crimea  to  the  Strait  of  Gibraltar, 
spreading  civilization  in  these  territories  through 
many  Hellenic  cities. 

Both  experiences — rooted  in  the  Western  tradi¬ 
tion — have  constituted  stereotypes  of  diasporas, 
though  other  noticeable  cases  from  the  East  have 
developed  for  the  medieval  and  modern  times.  For 
instance,  the  expansion  of  China  has  been  per¬ 
ceived  as  an  acceptable  phenomenon,  described  in 
an  ancient  Chinese  poem:  “Wherever  the  ocean 
waves  touch,  there  is  an  overseas  Chinese” — thus 
naturalizing  diasporic  initiatives  along  trade  routes. 
India’s  expansion,  especially  throughout  the  Indian 
Ocean,  has  also  provided  an  example  of  the  settle¬ 
ment  of  a  population  beyond  its  own  boundaries. 
Since  the  19th  century,  the  increase  in  the  popula¬ 
tions  of  coolies  to  work  in  agricultural  or  industrial 
plants  worldwide  has  drawn  particular  attention. 

In  fact,  when  scholars  attempted  to  classify  dif¬ 
ferent  experiences,  from  the  late  1980s  onward,  in 
order  to  be  able  to  identify  diasporic  processes 
beyond  the  unique  and  restrictive  Jewish  case,  the 
Greek,  Chinese  Indian,  and  African  ones  served  to 
establish  a  typology,  along  with  the  former.  As 
Robin  Cohen  (1997)  describes,  diasporas  were 
classified  as  victim,  imperial/colonial,  trade,  or 
labor  diasporas,  according  to  the  main  motives 
that  generated  original  migration — namely,  expul¬ 
sion,  expansion,  commerce,  or  work.  Other  exam¬ 
ples,  fitting  into  such  types,  demonstrated  the 


transhistorical  relevance  of  such  a  classification. 
For  instance,  the  Armenian  exile  in  Europe,  British 
thalassocratic  empire,  Lebanese  trading  posts  in 
Africa  and  Latin  America,  and  Moroccan  cheap 
labor  settlements  in  Western  Europe. 

Other  typologies,  departing  from  functionalist 
views,  emphasize  historical  or  political  factors. 
Thus,  according  to  Gabriel  Sheffer  (2003),  they 
mention  traditional/historical  (Jewish,  Greek, 
Phoenician),  dormant  (American/U.S.),  incipient 
(Russian,  post-Soviet),  stateless  (Palestinian,  Roma/ 
gypsy),  and  state-linked  (Mexico,  China)  diasporas. 
Such  classifications  assume  that  diasporas  are  unsta¬ 
ble  social  bodies  whose  morphology  can  change 
over  time,  passing  from  one  category  to  another. 
This  evolutionary  and  flexible  approach  shows  that, 
if  diasporas  have  indeed  existed  since  ancient  times, 
some  have  formed,  others  have  disappeared,  but 
even  more  have  been  recently  created. 

Most  of  the  scholars  converge  on  the  fact  that 
massive  population  moves  since  the  middle  of  the 
19th  century  have  generated  multiple  diasporas 
that  became  especially  visible  in  the  late  20th  cen¬ 
tury.  As  a  world  map  of  the  impact  of  migrations 
would  show,  durable  expatriate  communities  have 
been  established  around  the  globe.  The  question 
that  arises  is,  Do  these  settlements  share  enough 
common  points  to  be  all  denominated  diasporas? 
The  issue  of  criteria  divides  the  academic  commu¬ 
nity.  Some  scholars  insist  on  permanence  (of  the 
settlements),  others  on  internal  community  organi¬ 
zation,  on  the  contrast  with  or  exclusion  from  host 
societies,  on  social  composition  of  expatriate  com¬ 
munities,  on  the  modes  of  communication  between 
these,  on  the  number  of  dispersed  units,  or  on  the 
will  to  come  back  to  the  place  of  origin,  and  so  on. 
According  to  the  evidence  they  collected,  they 
argue  that  one  or  several  of  these  criteria  will  pre¬ 
vail  over  the  others  and  define  a  true  diaspora. 

Political  Significance 

The  minimal  feature  of  diasporas  is,  in  fact,  the 
dispersion  from  a  common  origin.  This  origin  may 
be  partly  mythical  or  purely  symbolic,  as  in  the  case 
of  the  Black/ African  diaspora,  whose  collective 
identity  resides  more  in  a  sociocultural  experience 
of  suffering  (based  on  a  history  of  exploitation) 
than  in  an  actual  relationship  (Stephane  Dufoix, 
2003).  However,  most  of  the  diasporas  have  kept 
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relationship  with  the  place  of  origin  and  among  the 
scattered  groups  themselves.  As  Georges  Prevelakis 
(1996)  notes,  the  network  structure  that  links  the 
different  parts  is  therefore  a  general  feature. 
According  to  Sheffer  (2003)  and  William 
Berthomiere  and  Christine  Chivallon  (2006), 
because  the  origins  of  recent  diasporas  are  existing 
or  potential  nation-states,  some  authors  qualify 
these  as  ethno-national  diasporas  to  explicitly 
distinguish  them  from  transnational  networks  in 
general  that  have  developed  in  the  context  of 
globalization.  In  that  sense,  diasporas  appear  as 
transborder  extensions  of  nation-states  and  deter- 
ritorialized  countries’  populations,  in  an  increas¬ 
ingly  normal  pattern. 

Today’s  estimate  is  that  almost  1  out  of  10 
human  beings  is  living  in  a  diasporic  situation 
(between  400  and  600  million).  The  number  of 
individuals  with  dual  citizenship  has  exploded  in  a 
short  period  of  time.  In  Latin  America  alone,  from 
4  countries  allowing  it  in  the  1980s,  the  number 
had  reached  10  by  early  2000  (Brazil,  Colombia, 
Costa  Rica,  the  Dominican  Republic,  Ecuador,  El 
Salvador,  Mexico,  Panama,  Peru,  and  Uruguay; 
see  Michael  Jones-Correa,  2001). 

Surveys  show  that  dozens  of  countries  today 
have  set  up  organizations,  institutions,  procedures, 
and  devices  of  all  sorts  to  reach  and  capitalize  on 
their  expatriates.  Financial  remittances  of  migrants 
(not  only  first  generation)  have  gone  up  to  several 
hundred  billion  dollars  per  year  and  are  increas¬ 
ingly  channeled  for  productive  collective  projects, 
not  just  for  individual  consumption  purposes. 
Social  remittances  such  as  technology  transfers, 
information  or  knowledge  exchanges,  organiza¬ 
tional/management  norms,  or  democratic  values 
transmission  are  also  mentioned  as  positive  inputs 
for  development  in  home  countries.  Migrants’  and 
expatriates’  associations  are  burgeoning  in  many 
host  countries. 

The  emerging  interest  of  diaspora  populations 
in  their  countries  of  origin  leads  host  countries  to 
worry  about  whether  diaspora  populations  may 
manipulate  or  exploit  them.  These  host  countries 
fear  a  fifth  column  operating  against  national 
interest  or  suspicious  ethnic  networks  involved  in 
delinquent  or  terrorist  activities.  However,  the  host 
countries’  approach  is  generally  favorable  to  dias¬ 
poras  and  supportive  of  their  organizations.  They 
consider  that  it  is  easier  to  control  migrant  groups 


through  these  organizations  than  to  deal  with 
amorphous  and  heterogeneous  populations  with¬ 
out  proper  representation.  At  the  same  time,  coop¬ 
eration  through  diaspora  groups  creates  all  sorts  of 
opportunities  abroad  for  the  receiving  nations. 

Cooperation,  however,  cannot  be  taken  for 
granted.  In  a  number  of  cases,  diasporas  come  from 
origin  countries  where  their  members  are  not  wel¬ 
come  and  where  free  circulation  is  limited,  making 
cooperation  impossible.  On  the  other  side,  xeno¬ 
phobia  and  a  reluctance  to  accept  foreign  people 
have  not  disappeared  and  can  spread  in  crisis  situ¬ 
ations.  Diasporic  conditions  in  a  nation-state  sys¬ 
tem  remain  submitted  to  territorial  legislations  and 
decision  making,  which  may  limit  their  initiatives. 

Future  Perspectives 

The  diaspora  may  sometimes  appear  as  a  step 
toward  a  postnational  system  of  governance,  as  a 
transition  to  a  cosmopolitan  regime.  The  fact  that 
nation-states  are  no  longer  contiguous,  but  rather 
expand  in  networks  in  which  national  boundaries 
overlap,  does  change  the  landscape  of  interna¬ 
tional  relations.  So  far,  it  seems  that  this  generally 
does  not  bring  problems.  The  fear  that  double  citi¬ 
zenship,  for  instance,  would  create  a  conflict  of 
loyalty  between  the  local/host  government  and  the 
country  of  origin  has  not  materialized.  The  reason 
is  simply  that  the  probability  of  irremediable 
antagonism  between  both  poles  of  migration  is 
extremely  low.  When  it  happened  in  the  past,  in 
exceptional  cases,  expatriates  tended  to  identify 
with  the  country  where  they  live,  work,  and  raise 
children.  Generally  speaking,  today,  individuals 
and  groups  from  the  diaspora  combine  identities, 
feeling  that  they  belong  to  both  home  and  host 
countries  and  that  they  can  mix  both  easily  in  their 
daily  life  in  a  nonexclusive  and  productive  manner. 

Today,  many  people  claim  to  be  living  in  a  dias¬ 
pora,  to  be  part  of  a  minority,  or  to  have  ancestors 
from  a  different  ethnic  group.  Contrary  to  former 
pessimistic  views,  they  attach  a  positive  value  to 
this  as  if  it  added  a  premium  of  identity,  rather 
than  a  negative  stigmatization.  The  present-day 
mobility  of  individuals  no  longer  means  an  experi¬ 
ence  of  being  uprooted  and  immersed  in  a  foreign 
world;  rather,  it  is  seen  as  the  ability  to  keep  in 
touch  with  relatives  and  contacts  abroad/at  home 
and  to  remain  connected  to  cultural,  cognitive,  and 
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symbolic  values  of  remote  places.  Information  and 
communication  technologies  have  obviously  facili¬ 
tated  this  new  proximity,  but  host  countries’  evolu¬ 
tion  from  a  homogeneous  conception  of  citizenship 
toward  more  pluralistic,  multiethnic  approaches  is 
also  crucial.  Today,  more  than  in  the  past,  politi¬ 
cal,  and  socioeconomic  integration  may  be  dissoci¬ 
ated  from  cultural  and  relational  assimilation. 

Some  authors  consider  this  as  a  postmodern 
condition  in  which  individuals  have  multiple  and 
fragmented  identities  that  the  diaspora  would  be 
reflective  of.  There  are  indeed  strong  examples  of 
hybridization  through  the  diaspora — for  instance, 
expatriate  engineers  and  information  scientists 
joining  associations  of  Asian  natives  in  North 
America  to  promote  their  mutual  interests  and 
careers  through  social  networking.  Ethnic  back¬ 
ground  thus  becomes  a  means  for  achieving  eco¬ 
nomic  goals  in  a  foreign  environment,  with  an 
individual’s  multifaceted  affiliation  serving  as  a 
professional  advantage.  In  that  sense,  the  very 
term  eth no-national  diasporas  may  be  restrictive, 
today,  as  it  reduces  the  expatriates  groupings  to  a 
single  dimension.  A  social-constructivist  approach, 
in  which  identity  building  results  from  local  inter¬ 
actions  and  interests,  articulated  with  larger  deter¬ 
minants  through  network-operated  negotiations,  is 
appropriate.  Identities  are  thus  no  longer  petrified 
in  macrosocial  categories,  but  neither  have  they 
vanished  in  an  atomistic  world. 

In  such  a  situation,  the  nation-state  still  remains 
sovereign.  According  to  Sheffer  (2003),  until  now, 
there  has  been  no  case  in  which  the  diaspora  has 
imposed  its  own  decisions  on  the  country  of  origin, 
even  though  there  have  been  many  effective  inter¬ 
ventions  of  expatriates  (e.g.,  the  struggles  for 
democratization  in  South  Africa  and  Eastern 
Europe  in  the  1990s).  At  the  opposite  end,  direct 
and  decisive  involvement  in  nation-states  has  been 
seen  in  the  actions  of  the  former  Soviet  Union 
toward  incipient  Russian  diasporas.  This  reminds 
us  that  new  actors  and  relations,  such  as  diasporas, 
may  be  translated  and  interpreted  in  traditional 
politics,  where  irredentism  might  be  perceived 
again  as  a  source  of  problems,  in  a  purely  realistic 
approach  to  international  relations. 
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Dictatorship 


A  dictatorship  is  a  form  of  government  in  which  a 
person  or  a  group  has  absolute  power,  unlimited 
by  constitution  or  other  laws  and  not  based  on 
traditional  legitimacy.  The  meaning  of  the  term, 
and  the  content  of  the  phenomenon  itself,  has 
changed  over  time.  In  political  science,  the  study  of 
dictatorship  constitutes  an  important  subject. 
Understanding  it  is  vital  for  comparative  politics 
and  for  the  study  of  new  democracies  that  emerged 
from  the  ashes  of  former  dictatorial  regimes.  This 
entry  first  describes  the  history  of  dictatorships  and 
reviews  the  major  typologies  used  to  classify  them. 
It  then  discusses  the  causes  of  dictatorships  and  the 
various  ways  in  which  they  have  been  ended. 

History 

The  term  dictator  was  used  for  the  first  time  in 
ancient  Rome.  In  times  of  calamities,  consuls,  on 
the  recommendation  of  the  Senate  confirmed  by 
the  popular  assembly  (comitia  curiata),  appointed 
a  dictator  (with  the  title  of  magister  populi)  from 
among  former  consuls.  Such  dictators  had  unlim¬ 
ited  power  ( imperium  summum)  for  a  limited 
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period  of  6  months.  A  dictator  was  first  appointed 
in  501  BCE.  In  the  1st  century  BCE,  the  institution 
changed  its  character;  under  the  dictators  Lucius 
Cornelius  Sulla  (82-79  BCE)  and  Gaius  Julius 
Caesar  (49-44  BCE),  dictatorship  became  the  per¬ 
manent  rule  of  powerful  individuals.  After  the 
abolishment  of  the  Republic  in  33  BCE,  the 
Roman  Empire  became  an  early  version  of  military 
dictatorship  with  the  Praetorian  Guard  holding 
effective  power  to  appoint  and  overthrow  the 
emperors.  Late  Roman  praetorianism  became  a 
prototype  for  the  military  dictatorship  in  modern 
times.  The  collapse  of  the  Roman  Empire  resulted 
in  the  disappearance  of  the  dictatorial  system  for 
more  than  12  centuries. 

The  concept  of  dictatorship  appeared  in  politi¬ 
cal  thought  in  the  16th  century.  Niccolo 
Machiavelli,  in  the  34th  chapter  of  Discourses, 
criticized  the  Roman  dictatorship  for  having  led  to 
tyranny,  but  he  advocated  the  idea  of  a  dictator 
ruling  by  the  will  of  the  people  to  bring  unity  and 
freedom  to  Italy.  Jean  Bodin  used  the  term  dicta¬ 
torship  to  define  sovereignty,  and  Jean-Jacques 
Rousseau  defined  dictatorship  as  a  system  sus¬ 
pending  the  legitimate  government. 

Early-modern  dictatorships  were  the  result  of 
revolutions,  which  abolished  the  monarchy  but 
were  unable  to  establish  stable  democratic  govern¬ 
ments.  The  rule  of  Oliver  Cromwell  in  England 
(1649-1658)  and  Napoleon  Bonaparte  in  France 
(1799-1814)  are  the  best  known  examples  of  suc¬ 
cessful  military  commanders  becoming  absolute 
rulers  due  to  the  weakness  of  the  democratic  insti¬ 
tutions  in  their  respective  countries  and  the  sup¬ 
port  of  the  armed  forces  under  their  command. 
Following  the  national  revolutions  in  Latin 
America  in  the  early  19th  century,  dictatorial  sys¬ 
tems  headed  by  military  commanders  (caudillos) 
were  established,  and  in  many  countries  of  the 
region,  they  became  the  dominant  pattern  of  gov¬ 
ernment  until  the  second  half  of  the  20th  century. 

Europe  faced  the  strongest  wave  of  dictator¬ 
ships  in  the  first  half  of  the  20th  century.  The 
Russian  Revolution  of  1917  resulted  in  the  estab¬ 
lishment  of  the  first  dictatorship  of  the  Communist 
Party.  Its  ideological  justification  was  based  on  the 
Marxist  concept  of  the  “dictatorship  of  the  prole¬ 
tariat,”  coined  by  Karl  Marx  but  redefined  by 
Vladimir  Lenin.  According  to  the  Leninist  doc¬ 
trine,  during  the  long  process  of  constructing  the 


communist  society,  power  should  be  concentrated 
in  the  hands  of  the  Communist  Party,  defined  as 
the  avant-garde  of  the  working  class. 

The  collapse  of  the  Ottoman  Empire  resulted  in 
the  establishment  of  a  secular  one-party  regime  in 
Turkey  under  the  leadership  of  Mustafa  Kemal 
Pasha. 

During  the  interwar  period  (1918-1939)  in  all 
newly  independent  states  of  Central  Europe  with 
the  exception  of  Czechoslovakia,  democratic  gov¬ 
ernments  were  overthrown  and  dictatorships, 
mostly  military,  were  established.  In  1922,  the 
fascist  party  of  Benito  Mussolini  seized  power  in 
Italy;  in  1926,  a  dictatorial  regime  was  established 
in  Portugal;  and  in  1933,  the  National  Socialist 
Workers  Party  of  Germany  (Nationalsozialistische 
Deutsche  Arbeiterpartei,  or  NSDAP)  headed  by 
Adolf  Hitler  came  to  power  in  Germany.  After 
winning  the  Spanish  Civil  War  (1936-1939),  the 
military  under  Francesco  Franco  established  a  dic¬ 
tatorial  system  based  on  a  coalition  of  the  military, 
the  Roman  Catholic  Church,  and  the  fascist  party 
Falange.  On  the  eve  of  World  War  II,  the  majority 
of  European  states  were  ruled  by  dictatorial 
regimes.  The  Japanese  military  established  their  de 
facto  power  behind  the  facade  of  the  absolute  rule 
of  the  emperor. 

World  War  II  resulted  in  the  defeat  of  dictato¬ 
rial  regimes  in  Germany,  Italy,  and  Japan,  opening 
the  road  to  the  democratization  of  those  three 
countries.  In  East-Central  Europe  and  North 
Korea,  however,  the  hegemony  of  the  Soviet  Union 
resulted  in  the  establishment  of  dictatorial  rules  of 
the  communist  parties.  In  later  years,  communist 
dictatorships  have  been  established  through  revo¬ 
lutions  and  wars  for  independence  in  China 
(1949),  Cuba  (1959),  Vietnam  (1954  in  the  North 
and  1975  in  the  whole  country),  and  Cambodia 
(1975).  In  Western  Europe  prewar  dictatorships 
survived  only  in  Portugal  (until  1974)  and  Spain 
(until  1975).  In  Greece,  the  military  established 
dictatorship  in  1967,  only  to  abdicate  its  power  in 
1974  after  the  abortive  attempt  to  incorporate 
Cyprus. 

The  process  of  decolonization  produced  several 
dictatorial  regimes  in  the  new  states  of  Africa 
where  the  democratic  governments  established  by 
the  departing  colonial  powers  proved  to  be  weak 
and  inefficient.  The  same  process  took  place  in 
some  newly  independent  states  of  Asia  (Pakistan, 
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Burma,  and  South  Korea).  In  the  Arab  countries  of 
North  Africa  and  the  Middle  East,  nationalistic 
dictators  came  to  power  following  the  pattern  of 
the  Egyptian  revolution  of  1952.  As  the  result  of 
these  processes,  in  the  early  1970s,  dictatorships 
of  various  types  ruled  over  more  than  half  of  the 
world  population.  Of  the  122  states  with  more 
than  1  million  inhabitants,  only  30  (24.6%)  were 
democracies  in  the  1970s  (data  from  Freedom 
House  definition  with  regard  to  civil  and  political 
rights).  Democratic  governments  survived  in 
Western  Europe,  North  America,  and  Australia 
and  in  a  few  other  states. 

The  retreat  of  dictatorship  began  during  the  20 
years  of  the  third  wave  of  democratization  (1974- 
1995).  Between  1974  and  1975,  Portugal,  Greece, 
and  Spain  abolished  dictatorships.  In  the  1980s, 
democracy  was  reestablished  in  most  of  the  Latin 
American  states,  and  in  1989,  the  process  of 
democratization  started  in  the  communist  states  of 
East-Central  Europe,  followed  by  the  abolishment 
of  the  communist  rule  and  the  eventual  dissolution 
of  the  Soviet  Union.  In  the  late  20th  and  early  21st 
centuries,  there  were  also  democratic  changes  in 
some  countries  of  Africa  and  Asia.  The  net  result 
of  the  third  wave  of  democratization  was  the 
reduction  in  the  number  of  dictatorial  regimes.  In 
2008,  according  to  the  Economic  Intelligence  Unit, 
dictatorial  regimes  survived  only  in  51  (30.5%) 
states  with  34.9%  of  the  world  population.  The 
largest  and  most  powerful  contemporary  dictator¬ 
ship  is  the  rule  of  the  People’s  Republic  of  China 
by  the  Communist  Party  of  China.  Communist 
dictatorships  survived  also  in  North  Korea, 
Vietnam,  and  Cuba.  In  some  post-Soviet  republics 
in  Central  Asia  and  the  Caucasus  (Azerbaijan, 
Kazakhstan,  Tajikistan,  Turkmenistan,  and 
Uzbekistan),  authoritarian  rule  substituted  for  the 
former  communist  dictatorship.  Dictatorships  sur¬ 
vived  also  in  several  countries  of  Africa  and  Asia. 

Typologies 

Modern  dictatorships  differ  in  their  political  form, 
social  content,  and  ideological  orientation. 

Political  Form 

As  far  as  the  political  form  of  dictatorship  is 
concerned,  the  most  important  distinction 


introduced  in  the  early  1950s,  mostly  by  scholars 
such  as  Hannah  Arendt,  Carl  J.  Friedrich,  Karl 
Wittfogel,  and  Zbigniew  K.  Brzezinski,  was  between 
totalitarianism  and  other  forms  of  dictatorship. 
Later,  Juan  J.  Linz  and  Leonardo  Morlino  developed 
this  typology  by  comparing  totalitarian  and 
authoritarian  types  of  dictatorship.  According  to 
them,  totalitarian  dictatorship  differs  from  other 
forms  of  dictatorial  rule  by  the  existence  of  the 
“totalitarian  syndrome”  composed  of 

1.  an  ideology  professing  the  total  transformation 
of  social  relations, 

2.  the  rule  of  a  single  party  opposition  to  which  is 
illegal, 

3.  state  monopoly  of  the  means  of  violence  and  of 
the  media, 

4.  state  control  of  the  economy,  and 

5.  police  terror. 

Two  of  the  most  important  forms  of  totalitarian 
dictatorship  have  been  fascism  and  Nazism  on  one 
hand  and  communism  on  the  other.  Authoritarian 
regimes  make  up  a  more  diversified  category,  with 
military  dictatorships  being  the  most  common  but 
not  the  only  form  of  such  a  system.  They  also  tend 
to  be  less  stable  than  totalitarian  regimes. 

A  different  typology  related  to  the  political  form 
of  dictatorship  focuses  on  who  holds  power  in  a 
dictatorial  regime.  From  this  perspective,  dictator¬ 
ships  can  be  divided  into  three  main  categories: 
personalistic,  military,  and  party  dictatorships. 

Personalistic  dictatorship  is  characterized  by  the 
concentration  of  power  in  the  hands  of  the  supreme 
leader  who  dominates  the  institutions.  Totalitarian 
dictatorship  took  this  form  under  powerful  leaders 
such  as  Hitler,  Stalin,  Mao  Zedong,  and  Fidel 
Castro,  who  dominated  the  political  institutions. 
Military  regimes  based  on  the  domination  of 
armed  forces  were  the  most  common  form  of  dic¬ 
tatorship  in  the  19th  century,  particularly  in  Latin 
America.  Military  dictatorships  in  the  early  21st 
century  include  those  in  Myanmar  (Burma),  Fiji, 
Gambia,  Ivory  Coast,  Democratic  Republic  of 
Congo,  Libya,  Mauritania,  and  Syria,  as  well  as 
authoritarian  regimes  established  by  military  coups 
(in  Egypt,  Tunisia).  Party  dictatorships  have  had 
two  main  forms:  totalitarian  (fascist,  Nazi,  and 
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communist)  and  non-totalitarian  (Turkey  1923- 
1945,  Mexico  1929-1977).  Some  communist  dic¬ 
tatorships  (Yugoslavia  after  1949,  Poland  after 
1956)  lost  their  totalitarian  character,  remaining 
authoritarian  versions  of  party  dictatorship.  There 
have  been  several  dictatorships  based  on  a  mixture 
of  military,  party,  and  bureaucratic  characteristics. 

The  German  political  writer  Carl  Schmitt  intro¬ 
duced  a  third  typology  that  distinguished  between 
two  main  types  of  dictatorship  based  on  the  politi¬ 
cal  form  of  dictatorship:  commissariat  and  sover¬ 
eign.  The  first  is  established  to  restore  order  so  that 
the  existing  constitution  can  function  normally, 
while  the  latter  abrogates  the  existing  constitution 
to  establish  a  new  political  order. 

Social  Content 

A  typology  based  on  the  relation  between  the 
dictatorship  and  the  social  structure  distinguishes 
between  three  types  of  dictatorial  rule:  conserva¬ 
tive ,  modernizing,  and  counterrevolutionary.  The 
conservative  dictatorship  serves  the  interests  of  the 
dominant  social  classes  and  defends  the  existing 
social  order.  A  peculiar  version  of  the  conservative 
dictatorship,  subordinated  to  the  Islamic  clergy 
and  devoted  to  the  preservation  of  religious  values, 
was  established  in  Iran  in  1979  and  in  Afghanistan 
in  1992.  The  modernizing  dictatorships  attempt  to 
reform  the  society  from  above,  sometimes  with  the 
support  of  the  underprivileged  strata  of  the  soci¬ 
ety.  The  counterrevolutionary  dictatorship  came 
to  power  as  the  result  of  a  successful  use  of  force 
in  defense  of  the  interests  of  the  privileged  strata 
endangered  by  the  radical  reforms  of  a  moderniz¬ 
ing  government.  The  great  majority  of  the  military 
dictatorships  belong  to  the  conservative  category, 
but  some  have  been  modernizing  regimes  (e.g., 
Turkey  under  Mustafa  Kemal  Pasha,  1923-1938; 
Egypt  under  Abdel  Nasser,  1952-1970;  and 
Argentina  under  Juan  Peron,  1945-1955).  Spain 
under  Francesco  Franco  (1936-1975)  and  Chile 
under  Augusto  Pinochet  (1973-1989)  are  classic 
cases  of  counterrevolutionary  dictatorships.  They 
were  established  by  military  coups  to  undo  the 
socialist  reforms  carried  out  by  democratically 
elected  governments.  With  the  passing  of  time,  the 
social  role  of  the  dictatorship  often  changes,  with 
the  regime  becoming  more  the  guardian  of  the 
vested  interests  of  the  ruling  elite  rather  than  a 


champion  of  social  change.  Such  change  took 
place  in  the  communist  regimes. 

Ideological  Orientation 

Typology  based  on  the  relation  between  the 
regime  and  ideology  distinguishes  between  ideo¬ 
logical  and  pragmatic  dictatorships.  In  addition  to 
totalitarian  dictatorships,  which  have  been  ideo¬ 
logical  by  definition,  there  have  been  some  other 
ideological  dictatorships,  but  most  dictatorial 
regimes  are  based  rather  on  vaguely  defined  ideas, 
such  as  the  national  interest  or  the  political  legacy 
of  the  founder  of  the  regime  (e.g.,  Kemalism  in 
Turkey,  Peronism  in  Argentina),  than  on  more 
articulated  ideologies.  In  addition  to  communism, 
fascism,  and  Nazism,  which  were  the  ideologies  of 
the  totalitarian  regimes,  religious  ideologies  and 
nationalism  have  been  the  most  frequent  types  of 
justification  for  modern  dictatorships.  Spanish  and 
Portuguese  dictatorships  relied  on  Roman 
Catholicism  as  the  ideological  rationale  of  the 
regime,  while  Islam  has  been  the  ideological  base 
of  dictatorships  in  several  countries  in  the  Muslim 
world,  particularly  in  Iran  since  1979  and  in 
Afghanistan  from  1992  to  2001. 

Causes  and  Mechanisms  of  Dictatorship 

Explanations  of  the  emergence  and  survival  of  dic¬ 
tatorships  refer  to  (a)  political  and  societal  condi¬ 
tions  that  make  it  possible  to  abolish  the  existing 
system  and  to  establish  a  dictatorship,  (b)  motiva¬ 
tions  of  the  principal  actors,  and  (c)  techniques 
employed  by  the  architects  of  dictatorships. 

Political  and  Social  Causes 

The  main  political  cause  of  dictatorships  is  the 
weakness  of  the  former  regime.  Such  a  political 
situation  results  from  a  variety  of  causes,  such 
as  (a)  the  weakness  of  traditional  monarchy,  no 
longer  commanding  the  loyalty  of  its  subjects; 
(b)  weakness,  ineffectiveness,  and  corruption  of 
the  democratic  governments  combined  with  the 
lack  of  democratic  mechanisms  to  change  such  a 
situation;  (c)  the  lack  or  weakness  of  democratic 
political  culture,  both  at  the  elite  and  at  the  mass 
level;  (d)  humiliation  and  frustration  caused  by  the 
defeat  in  a  war;  and  (e)  imposition  of  a  dictatorial 
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regime  by  the  hegemonic  power  (as  was  the  case  in 
most  of  the  Communist  dictatorships  in  East- 
Central  Europe  after  World  War  II).  Social  causes 
of  dictatorship,  often  combined  with  the  political 
ones,  include  the  following:  (a)  sharp  social  con¬ 
flicts  endangering  the  existing  social  and  economic 
system,  (b)  economic  crisis  resulting  in  a  drastic 
deterioration  of  the  standard  of  life  of  the  large 
part  of  the  society,  and  (c)  the  emergence  of  a  large 
number  of  people  (mostly  young  males)  who  have 
lost,  or  never  gained,  a  stable  place  in  the  social 
structure  and  are  likely  to  become  foot  soldiers  of 
a  movement  to  establish  dictatorship. 

Motivations 

Motivations  to  establish  a  dictatorship  include 
ideological  or  religious  beliefs,  frustration  with  the 
existing  political  and/or  economic  situation  and 
the  feeling  that  extraordinary  means  are  necessary 
to  save  the  state,  group  interests  (particularly  the 
corporate  interests  of  the  military),  and  the  per¬ 
sonal  ambitions  of  a  strong  leader. 

Techniques 

In  most  cases,  techniques  of  the  establishment  of 
a  dictatorial  regime  are  based  on  the  use  of  force, 
particularly  military  force.  Such  a  coup  takes  vari¬ 
ous  forms:  from  the  threat  to  use  force  (pronuncia- 
mento  in  Latin  American  terminology)  through  the 
forceful  removal  of  the  existing  government  to  a 
prolonged  civil  war.  In  the  first  two  cases,  cohesion 
of  the  forces  supporting  the  coup  is  crucial  for  its 
success.  In  the  case  of  a  civil  war,  its  result  depends 
on  a  number  of  military  and  nonmilitary  factors, 
particularly  on  the  ability  to  win  the  support  of  a 
considerable  part  of  the  population. 

Newly  established  dictatorships  often  seek  a 
degree  of  legitimacy.  Some  rely  on  plebiscites  in 
which  citizens  ratify  the  results  of  the  coup. 
Napoleon  Bonaparte  first  introduced  the  plebisci¬ 
tary  form  of  legitimating  a  dictatorship.  Some 
regimes  use  a  version  of  noncompetitive  (plebisci¬ 
tary)  election  to  gain  a  degree  of  legitimization. 
Mass  propaganda  campaigns,  sometimes  with 
direct  participation  of  the  dictator,  serve  also  to 
legitimize  the  regime.  The  effectiveness  of  such 
campaigns  varies.  Some  dictatorships  have  been 
able  to  obtain  a  strong,  albeit  nondemocratic, 


mandate,  while  others  are  forced  to  rely  mostly  on 
naked  coercion. 

Dictatorships  differ  in  the  degree  to  which  they 
use  coercion.  The  common  characteristic  of  dicta¬ 
torships  is  that  all  of  them  use  coercion  to  suppress 
opposition  and/or  to  achieve  other  goals.  However, 
the  degree  and  nature  of  coercion  vary. 

In  their  formative  stage,  all  dictatorial  regimes 
tend  to  use  repression  against  real  or  alleged  ene¬ 
mies,  but  the  intensity  of  such  repression  varies. 
Thousands  of  republicans  were  killed  after  the 
Spanish  Civil  War,  and  more  than  3,000  support¬ 
ers  of  the  overthrown  government  were  extermi¬ 
nated  by  the  Chilean  military  after  the  coup  of 
1973.  There  were,  however,  relatively  benign  dic¬ 
tatorships,  which  used  coercion  rarely  and  toler¬ 
ated  independent  political  activities,  which  were 
severely  punished  in  other  dictatorships  (Poland 
after  the  military  coup  of  1926).  In  most  cases,  the 
prolonged  existence  of  the  dictatorships  leads  to 
the  reduction  of  the  intensity  of  coercion  and,  in 
many  cases,  to  the  gradual  liberalization  of  the 
regime.  However,  even  in  a  relatively  benign  dicta¬ 
torship,  the  state  of  human  rights  departs  signifi¬ 
cantly  from  the  democratic  standards. 

Extrication  From  Dictatorship 

The  great  majority  of  modern  dictatorships  have 
existed  for  relatively  short  periods  of  time,  usually 
less  than  2  decades.  Authoritarian  dictatorships  in 
Portugal  (1926-1974)  and  Spain  (1936-1975) 
were  exceptionally  durable.  In  Egypt  after  the  1952 
military  coup,  the  problem  of  succession  in  an 
authoritarian  dictatorship  was  twice  resolved  by 
the  orderly  access  to  power  of  former  vice  presi¬ 
dents,  Anwar  Sadat  in  1970  and  Hosni  Mubarak  in 
1981,  following  the  death  of  their  predecessors; 
however,  Mubarak’s  regime  was  overthrown  in 
2011.  Communist  dictatorships  lasted  much  longer 
than  most  of  the  noncommunist  dictatorial  regimes. 
The  Soviet  system  survived  74  years  and  the  com¬ 
munist  regimes  in  East- Central  Europe  for  more 
than  40  years.  The  still  existing  communist  dicta¬ 
torships  in  China,  Cuba,  North  Korea,  and  Vietnam 
belong  to  the  oldest  dictatorial  regimes  in  the  con¬ 
temporary  world.  Even  they,  however,  are  much 
younger  than  old  democracies,  which  function 
successfully  for  many  generations,  in  some  cases 
for  more  than  2  centuries  (the  United  Kingdom 
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[UKJ,  the  United  States)  and  in  many  for  more  than 
a  century. 

The  relative  instability  of  dictatorships  results 
from  several  structural  weaknesses.  Dictatorships 
usually  have  problems  in  establishing  secure  mecha¬ 
nisms  of  succession.  This  is  particularly  true  in 
personalistic  dictatorships  since  as  long  as  the 
supreme  leader  is  alive  nobody  is  allowed  to 
become  the  official  successor.  Selection  of  compe¬ 
tent  functionaries  is  difficult  because  political  loy¬ 
alty  is  considered  more  important  than  profession¬ 
alism.  Dictatorships  also  have  problems  with  han¬ 
dling  popular  dissatisfaction.  Since  there  are  few 
possibilities  of  legal  expression  of  protest,  dissatis¬ 
faction  leads  to  alienation  from  the  regime  and  to 
spontaneous  outbursts.  When  the  level  of  such  dis¬ 
satisfaction  is  too  high,  naked  force  is  not  enough 
to  secure  the  survival  of  the  regime.  After  World 
War  II,  the  international  environment  made  it  more 
difficult  to  maintain  dictatorial  systems  of  govern¬ 
ment  in  those  parts  of  the  world  where  the  influence 
of  the  democratic  powers  were  the  strongest. 

There  have  been  several  ways  of  extrication 
from  dictatorships.  Not  all  of  them,  however,  were 
fully  successful  in  the  sense  of  establishing  stable 
democracies.  In  several  cases,  departure  from  a 
dictatorial  system  led  to  establishment  of  another 
dictatorship.  In  some  others,  the  result  was  unsta¬ 
ble  democracy  or  even  anarchy  and  the  collapse  of 
state  power. 

The  main  patterns  of  extrication  from  dictator¬ 
ship  are  (a)  defeat  in  war  and  foreign  occupation, 
(b)  revolution  or  a  coup,  (c)  democratization  of  the 
regime  from  above  with  or  without  negotiations 
with  the  opposition,  and  (d)  capitulation  caused  by 
a  major  setback  for  the  existing  regime. 

Military  Defeat  and  Foreign  Occupation 

Some  dictatorships  have  been  abolished  as  a 
consequence  of  the  war  lost.  Napoleon  Bonaparte 
lost  power  after  his  defeat  in  the  anti-Napoleonic 
Wars  of  Liberation  (1813-1814).  Several  European 
dictatorships,  including  the  German  and  Italian 
ones,  as  well  as  the  military  rule  in  Japan,  were 
destroyed  as  a  consequence  of  World  War  II.  The 
dictatorship  of  the  Red  Khmers  in  Cambodia  was 
removed  from  power  by  the  invading  army  of 
Vietnam  in  1978.  In  Afghanistan  in  2001  and  in 
Iraq  in  2003,  dictatorial  regimes  were  removed  by 


the  military  intervention  of  the  United  States  and 
its  allies.  The  removal  of  a  dictatorship  by  foreign 
forces  opens  the  door  for  the  construction  of  a 
democratic  system  under  temporary  foreign  pro¬ 
tectorate,  in  some  cases  lasting  several  years.  Not 
always,  however,  was  the  process  as  successful  as 
it  had  been  in  Germany,  Italy,  and  Japan. 

Revolution  or  Coup 

Sometimes  dictatorships  are  overthrown  by  a 
revolution  or  by  a  military  coup.  Such  cases  were 
few,  due  to  the  effective  use  of  force  by  the  dictato¬ 
rial  regimes  and  the  relative  weakness  of  their 
opponents.  The  instances  of  successful  revolutions 
launched  against  the  dictatorship  regimes  were 
Cuba  in  1959  and  Portugal  in  1974.  Originally,  a 
popular  revolt  in  Hungary  in  1956  managed 
almost  to  destroy  the  communist  dictatorship,  but 
soon  it  was  crushed  by  the  military  intervention  of 
the  Soviet  Union.  Only  in  one  case — Romania  in 
1989 — a  communist  dictatorship  has  been  over¬ 
thrown  by  popular  reform,  supported  by  the 
armed  forces.  Revolts  and  coups  often  result  in  one 
dictatorship  being  substituted  by  another.  In  sev¬ 
eral  African  states,  dictators  were  overthrown  by 
armed  rebels  who  then  established  their  own  dicta¬ 
torial  rule.  In  some  Latin  American  states  (e.g., 
Argentina  in  1955),  the  military  removed  dictators 
only  to  establish  their  own  rule. 

Democratization 

In  several  cases,  dictatorships  have  been  abol¬ 
ished  through  a  process  of  democratization  initiated 
by  the  reform-oriented  wing  of  the  existing  regime, 
with  or  without  cooperation  with  the  moderate 
wing  of  the  opposition.  In  Turkey,  the  ruling  party 
initiated  the  democratization  in  1945  by  permitting 
the  formation  of  opposition  and  by  introducing 
competitive  elections.  Brazilian  democratization 
from  above  (abertura,  a  Portuguese  term  that 
means  “opening,”  used  to  describe  Brazil’s  transi¬ 
tion  from  military  to  elected  government)  began  in 
the  second  half  of  the  1970s  and  resulted  in  full 
return  to  democracy  in  about  10  years.  A  similar 
process  took  place  in  Mexico  where  the  new  law  of 
1977  opened  possibilities  for  electoral  competition 
between  the  ruling  party  and  the  opposition  and  in 
the  Republic  of  Korea  in  the  last  years  of  the  20th 
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century.  In  Spain,  after  the  death  of  General  Franco 
in  1975,  King  Juan  Carlos  opened  the  process  of 
negotiations  with  the  (then  illegal)  opposition, 
which  resulted  in  restoration  of  democracy  in  few 
years.  In  Poland  in  1989  and  in  Hungary  in  1990, 
negotiations  between  the  ruling  communist  parties 
and  the  opposition  led  to  competitive  elections  and 
to  restoration  of  democracy.  Similar  processes  of 
democratization  from  above  took  place  in  some 
republics  of  former  Yugoslavia  (Slovenia,  Croatia, 
and  Macedonia)  as  well  as  in  Bulgaria,  Albania, 
and  Mongolia.  In  the  Soviet  Union,  democratic 
reforms  were  initiated  by  the  last  leader  of  the 
Communist  Party,  Mikhail  Gorbachev,  in  the  sec¬ 
ond  half  of  the  1980s  and  resulted  in  the  collapse  of 
the  communist  system  in  1991. 

Capitulation 

Abdication  of  a  dictatorial  regime  takes  place 
when  a  serious  external  fiasco  or  a  peaceful  mass 
protest  makes  the  status  quo  untenable.  In  Greece, 
the  military  government  abdicated  its  power  after 
it  had  provoked  an  international  crisis  over  a  failed 
attempt  to  incorporate  Cyprus  in  1974,  and  in 
Argentina,  the  military  rulers  gave  up  their  power 
after  the  humiliating  defeat  in  the  short  war  with 
the  UK  over  the  Falkland  Islands  in  1983.  The 
authoritarian  regime  in  Philippines  collapsed  in 
1986  as  the  result  of  mass  protests,  supported  by 
the  Roman  Catholic  Church  and  the  armed  forces. 
Communist  governments  in  Czechoslovakia  and  in 
the  German  Democratic  Republic  capitulated  in 
late  1989  when  they  had  been  confronted  with 
mass  protests  and  were  unable  to  suppress  the  pro¬ 
test  due  to  the  lack  of  Soviet  support. 
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Diplomacy 


In  its  broadest  sense,  diplomacy  refers  to  the  con¬ 
duct  of  human  affairs  by  peaceful  means,  employ¬ 
ing  techniques  of  persuasion  and  negotiation.  In 
the  more  specific  sphere  of  international  politics, 
through  the  utilization  of  such  techniques,  it  has 
come  to  be  regarded  as  one  of  the  key  processes 
characterizing  the  international  system  and  a 
defining  institution  of  the  system  of  sovereign 
states — often  referred  to  as  the  “Westphalian” 
system  after  the  1684  Peace  of  Westphalia.  Its 
usage,  however,  embraces  some  important  distinc¬ 
tions.  First,  at  the  state  level,  it  has  frequently  been 
used  (particularly  in  studies  of  diplomatic  history) 
as  a  synonym  for  foreign  policy — as  in  “Russian,” 
“German,”  and  “Japanese”  diplomacy  (foreign 
policy).  More  commonly,  however,  it  is  used  to 
refer  to  one  means  by  which  such  policies  are 
implemented.  Second,  viewed  as  an  institution  of 
the  international  system,  a  distinction  can  be  made 
between  diplomacy  as  a  set  of  processes  and  as  a 
set  of  structures  through  which  these  processes  are 
conducted.  Debates  about  the  continuing  utility  of 
diplomacy  in  contemporary  international  politics 
frequently  reflect  confusion  between  these  mean¬ 
ings.  In  the  course  of  the  following  discussion,  the 
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origins  of  diplomacy  are  outlined,  together  with 
differing  analytical  approaches  to  its  nature  and 
significance  as  a  feature  of  international  politics. 
The  changing  nature  of  diplomatic  processes  is 
then  discussed,  followed  by  an  examination  of  the 
evolution  of  the  structures  through  which  diplo¬ 
macy  has  been  conducted  at  both  the  state  and 
international  levels. 

Theoretical  Approaches 

While  the  study  of  diplomacy  has  a  long  and  hon¬ 
orable  tradition  dating  back  to  Machiavellian 
thought,  it  is  only  in  recent  years  that  diplomatic 
practice  has  started  to  receive  detailed  theoretical 
attention.  There  is  a  notable  absence  of  conscious 
theorizing  in  much  of  the  scholarship  on  diplo¬ 
macy.  Instead,  the  bulk  of  the  scholarship  offers 
detailed  historical  accounts  of  diplomatic  events 
(diplomatic  history)  as  well  as  texts  on  diplomatic 
practice.  Most  scholars  of  diplomacy  implicitly 
choose  from  a  very  narrow  range  of  analytical 
frameworks  drawn  almost  exclusively  from  the 
realist  tradition  in  international  relations  (IR).  As 
a  consequence,  the  orthodox  study  of  diplomacy 
has  been  marked  by  a  remarkably  unified  theoreti¬ 
cal  approach — something  quite  unique  in  political 
science.  There  is  a  surprising  ontological  consensus 
about  what  diplomacy  is  and  who  the  diplomats 
are.  This  consensus  arises  from  the  dominant  influ¬ 
ence  of  rationalist  thinking.  The  upshot  of  this 
dominance  is  that  the  range  of  the  scholarship  in  a 
majority  of  studies  of  diplomacy  tends  to  be  lim¬ 
ited  to  analysis  of  the  international  realm  of  sover¬ 
eign  states  in  the  context  of  high  politics.  There  is 
little  enthusiasm  among  mainstream  scholars  to 
explore  the  diplomatic  world  beyond  interstate 
relations  or  low  politics  and  that  relating  to  the 
nonpolitical.  This  is  now  changing.  There  is  a 
growing  body  of  work  interested  in  diplomacy  not 
simply  as  a  foreign  policy  tool  of  states  but  as  a 
means  of  connecting  cultures,  polities,  economies, 
and  societies.  This  section  aims  to  highlight  briefly 
the  orthodox  and  unorthodox  approaches  to  the 
study  of  diplomacy. 

Diplomacy  and  Realism 

The  realist  core  of  orthodox  approaches  to 
diplomacy  is  undisputed  and  is  clearly  evident  in  a 


number  of  key  features  found  in  this  prevailing 
approach.  The  most  telling  is  the  focus  on  the  sov¬ 
ereign  state  as  the  primary  unit  of  analysis  in  diplo¬ 
macy  such  that  the  study  of  diplomacy  is  confined 
to  the  study  of  the  process  and  content  of  interstate 
relations — that  is,  how  sovereign  states  seek  to 
engage  with  each  other.  Prevailing  models  of 
diplomacy  focus  almost  exclusively  on  singular 
state-to-state  relations.  The  orthodoxy  defines 
diplomacy  as  processes  of  communication,  nego¬ 
tiation,  and  information  sharing  among  sovereign 
states.  Diplomatic  processes  revolve  around  the 
activities  of  professional  diplomats — that  is,  offi¬ 
cials  of  foreign  ministries  and  overseas  missions. 
More  common,  especially  in  North  American 
scholarship,  is  the  narrower  definition  of  diplo¬ 
macy  as  a  foreign  policy  tool  of  states — that  is, 
diplomacy  as  statecraft.  This  more  limited  defini¬ 
tion  has  led  to  a  great  deal  of  foreign  policy  analy¬ 
sis  passing  itself  off  as  diplomatic  studies,  despite 
the  fact  that  it  does  not  consider  the  processes  of 
interstate  relations  as  its  main  focus.  Both  defini¬ 
tions,  however,  share  the  view  that  diplomacy  has 
an  ordering  role  to  play  in  the  otherwise  anarchic 
and  unstable  international  system  of  states — a 
view  that  has  theoretical  roots  in  realism.  Successful 
diplomacy,  it  is  argued,  creates  a  system  of  states. 
It  constructs  balances  of  power,  facilitates  hege¬ 
monic  structures,  and  fashions  post-hegemonic 
regimes.  When  diplomacy  fails  or  is  absent,  con¬ 
flict  and  war  usually  follow.  Indeed,  it  is  the  very 
fact  of  conflict  between  states  (a  core  realist 
assumption)  that  warrants  the  emergence  of  diplo¬ 
matic  systems. 

Beyond  Realism 

In  recent  years,  however,  there  have  been  sig¬ 
nificant  conceptual  shifts  in  the  study  of  diplo¬ 
macy,  and  as  a  result,  those  studying  diplomacy 
are  able  to  choose  from  a  wider  range  of  analytical 
approaches.  The  customary  view  of  what  the 
proper  study  of  diplomacy  entails  is  now  contested 
by  scholars  who  apply  analytical  strategies  drawn 
from  constructivist,  postmodern,  and  critical  IR 
theory  to  draw  attention  to  the  necessity  of  under¬ 
standing  IR — and  diplomacy — beyond  the  state 
and  the  international  state  system.  As  a  conse¬ 
quence,  the  study  of  diplomacy  has  stepped  out¬ 
side  the  narrow  state-centric  security  nexus  into  a 
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world  of  diplomacy  that  is  more  varied  but  also 
more  difficult  to  specify.  It  is  perhaps  this  lack  of 
specificity  in  what  is  being  analyzed,  and  why,  that 
explains  why  unorthodox  approaches  continue  to 
be  marginalized. 

Unorthodox  approaches  are  analytically  diverse, 
yet  they  share  a  key  point  of  departure  from  ortho¬ 
dox  approaches — a  refusal  to  accept  the  state  as 
the  exclusive  unit  of  diplomatic  analysis.  Diplomacy 
is  seen  as  a  more  open-ended  process  where  diplo¬ 
matic  agency  includes  not  only  the  state  but  also  a 
range  of  nonstate  actors  such  that  a  sociological 
concept  of  diplomacy  emerges  where  diplomacy 
possesses  economic,  cultural,  social,  as  well  as 
political  forms  and  functions.  A  common  theme 
within  these  approaches  is,  therefore,  the  problem¬ 
atic  core  idea  of  the  foreign  ministry  and  its  over¬ 
seas  missions  as  the  sole  agent  of  diplomacy. 
Unorthodox  approaches  suggest  that  the  proper 
terrain  of  the  study  of  diplomacy  includes,  and 
extends  beyond,  foreign  ministries,  overseas  mis¬ 
sions,  and  the  state  officials  that  work  in  these 
government  institutions  and  international  organi¬ 
zations  to  potentially  include  diplomatic  networks 
largely  drawn  from  all  sections  of  domestic  and 
international  society  covering  any  number  of 
issues  from  the  environment  and  e-commerce  to 
avian  flu  and  land  mines.  An  important  implica¬ 
tion  of  this  is  that  diplomacy  has  many  modes, 
including  conventional  interstate  relations,  non- 
conventional  intercultural  relations  or  commercial 
relations,  and  modes  that  mix  the  two.  Moreover, 
the  study  of  diplomacy  entails  the  rejection  of  the 
simple  reproduction  of  the  status  quo  of  interstate 
power  relations  (described  as  antidiplomacy)  at 
the  heart  of  orthodox  studies  of  diplomacy  and,  in 
the  case  of  postmodern  approaches,  the  produc¬ 
tion  of  the  concept  of  “otherness,”  which,  it  is 
claimed,  is  the  core  of  all  diplomatic  modes.  In  this 
sense,  the  world  of  diplomacy  is  characterized  not 
by  the  commonality  of  the  material  and  security 
interests  of  states  but  by  differences — different 
interests,  diverse  cultures,  and  varied  identities. 

While  orthodox  approaches  import  analytical 
tools  from  realist  IR  to  develop  concepts  such  as 
summit  diplomacy,  bilateral  diplomacy,  and  multi¬ 
lateral  diplomacy,  unorthodox  approaches,  by 
contrast,  import  analytical  tools  from  other  social 
science  fields  such  as  political  economy,  business 
and  management  studies,  philosophy,  theology, 


sociology,  and  anthropology  to  explore  diplo¬ 
matic  practice.  As  a  result,  new  concepts  of  diplo¬ 
macy  such  as  catalytic  diplomacy,  network 
diplomacy,  sustainable  diplomacy,  and  multi¬ 
stakeholder  diplomacy  have  been  developed  to 
provide  analytical  means  to  explain  the  contempo¬ 
rary  diplomatic  practice  in  ways  that  draw  atten¬ 
tion  to  the  different  interests,  cultures,  and  identi¬ 
ties  represented  by  state  and  nonstate  actors  and 
the  varied  modes  of  diplomacy  that  emerge  with 
the  dual  engagement  of  this  “otherness.”  These 
new  concepts  also  draw  attention  to  the  changing 
character  of  contemporary  diplomatic  forms, 
sometimes  casting  doubt  on  the  notion  of  the  pro¬ 
gressive  development  of  diplomatic  systems  found 
in  traditional  approaches  to  diplomacy. 

In  sum,  unorthodox  approaches  to  diplomacy 
do  not  always  tie  diplomatic  practice  to  the  state 
or  to  the  problem  of  anarchy.  Instead,  diplomacy 
is  seen  as  a  means  of  connecting  individuals, 
groups,  societies,  economies,  and  states  to  build 
and  manage  social  relations  in  domestic  and  sys¬ 
temic  environments.  By  moving  beyond  traditional 
realism,  unorthodox  approaches  to  the  study  of 
diplomacy  have  promoted  greater  theoretical 
reflection  and  created  an  intellectual  multiplicity  in 
the  analysis  of  diplomatic  practices,  modes,  and 
processes.  It  is  to  these  practices,  modes,  and  pro¬ 
cesses  that  we  now  turn. 

The  Emergence  and  Development 
of  Diplomacy 

As  the  above  discussion  of  the  theories  of  diplo¬ 
macy  indicates,  those  who  study  diplomacy  remain 
divided  over  whether  it  is  essentially  a  state-based 
set  of  political  processes  or  whether  it  is  a  set  of 
network-based  political  processes.  Those  who 
maintain  that  diplomacy  is  primarily  the  pursuit  of 
the  foreign  policy  interests  of  the  state  in  the  inter¬ 
national  system  of  states  argue  that  diplomacy  is 
confined  to  a  quite  narrow  set  of  bilateral  and 
multilateral  processes  of  communication,  represen¬ 
tation,  and  mediation  focused  on  the  foreign  min¬ 
istry  and  its  overseas  missions.  Diplomatic  pro¬ 
cesses  continue  to  exhibit  some  regularity  so  that 
functions,  institutions,  codes,  conventions,  and 
cultures  of  diplomacy  are  marked  by  continuity 
and  marginal  change  and  so  that  diplomatic  rules 
and  norms  continue  to  hold  in  the  future.  The 
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obvious  casualty  in  this  approach  is  any  in-depth 
analysis  of  change  in  diplomatic  structures  and 
processes.  By  contrast,  those  who  conceptualize 
diplomacy  outside  state-centric  framework,  tend 
to  emphasize  continual  change  in  the  conduct  and 
context  of  diplomacy.  The  principle  objective  of 
network-based  approaches  is  to  highlight  and  ana¬ 
lyze  the  challenges  posed  to  diplomacy  by  contem¬ 
porary  changes  in  the  international  system. 
Scholars  turn  to  issues  of  globalization  and  region¬ 
alization  to  emphasize  the  increasingly  complex 
social,  economic,  and  political  context  of  diplo¬ 
macy  (at  domestic,  regional,  and  international 
levels).  For  these  scholars,  change  and  transforma¬ 
tion  in  diplomatic  processes  and  structures  is  the 
central  concern  of  analysis,  and  in  this  frame, 
diplomacy  is  seen  to  have  both  formal  and  infor¬ 
mal  structures.  Diplomatic  processes  are  network 
based  and  draw  in  a  range  of  public  and  private 
actors;  there  is  an  absence  of  agreed  rules  and 
norms  of  diplomatic  engagement  such  that  new 
codes  and  conventions  are  emerging  or  in  need  of 
development.  In  short,  diplomacy,  in  terms  of  both 
the  varying  processes  through  which  it  is  effected 
and  the  machinery  through  which  it  is  conducted, 
is  a  closely  linked  phenomenon  that  is  the  subject 
of  differing  interpretations.  We  will  now  examine 
how  these  have  developed  in  response  to  changes 
in  both  domestic  and  international  environments. 

The  Origins  of  Diplomacy 

The  origins  and  development  of  diplomacy  are 
frequently  equated  with  that  of  the  European  sys¬ 
tem  of  states.  In  this  view,  it  is  associated  with  the 
system  of  states  that  emerged  and  consolidated  its 
forms  and  practices  in  the  wake  of  the  1648  Peace 
of  Westphalia  marking  the  end  of  the  Thirty  Years’ 
War.  However,  diplomacy  and  its  institutions  have 
a  much  longer,  and  more  complex,  pedigree  and 
have  been  identified  as  existing  in  some  of  the  ear¬ 
liest  human  societies.  Rather  than  being  associated 
with  a  specific  historical  era,  diplomacy  has  been 
seen  as  a  response  to  a  set  of  needs  and  require¬ 
ments — namely,  the  mediation  of  separateness 
between  communities  and  the  desire  and  need  to 
establish  modes  of  communication  between  them. 
Thus,  the  earliest  documents  recording  what  we 
would  now  regard  as  formalized  diplomatic  prac¬ 
tices  are  to  be  found  in  approximately  2500  BCE 


in  what  is  now  the  Eastern  Mediterranean  and  the 
Middle  East,  and  one  of  the  most  familiar  features 
of  contemporary  diplomacy — namely,  the  practice 
of  using  resident  ambassadors — predates  its  usage 
in  the  modern  European  context  by  some  3,000 
years.  Elements  of  diplomacy  and  diplomatic  insti¬ 
tutions  can  be  identified  in  a  variety  of  interna¬ 
tional  systems,  including  those  of  Greece,  Rome, 
and  China,  but  none  of  these  possessed  what  has 
come  to  be  regarded  as  the  key  characteristics  of  a 
fully  fledged  diplomatic  system — namely,  effective 
communication,  a  set  of  procedures  and  conven¬ 
tions  governing  patterns  of  communication,  and  a 
capacity  to  mediate  between  diverse  cultures.  In 
general  terms,  the  development  of  diplomacy  has 
been  determined  by  the  character  of  the  societies 
that  it  has  sought  to  mediate,  the  international 
environment,  the  available  modes  of  communica¬ 
tion,  and  the  technologies  that  determined  them. 

Thus,  in  the  European  context,  the  medieval  era 
witnessed  the  growth  of  diplomatic  processes  as 
international  relationships  became  more  complex 
and  dense.  But  this  occurred  in  a  period  when  the 
sovereign  state  as  we  recognize  it  today  had  not 
emerged.  Against  a  background  in  which  univer- 
salist  ideas  represented  by  the  concept  of 
Christendom  underpinned  by  the  authority  of  the 
Catholic  Church  and  the  Holy  Roman  Empire 
were  a  dominant  reality,  diplomacy  was  not  yet 
associated  with  the  state,  involving  diverse  politi¬ 
cal  units.  Moreover,  while  rulers  engaged  in  the 
sending  of  missions  to  one  another,  so  too  did  other 
entities — commercial,  ecclesiastical,  and  private — 
in  the  medieval  landscape.  Relative  distance  under¬ 
scored  by  the  difficulties  of  communication  meant 
that  the  dispatch  of  diplomatic  missions  was  infre¬ 
quent,  and  their  success  marked  by  a  high  degree  of 
uncertainty  resulting  from  the  hazards  associated 
with  medieval  travel.  Furthermore,  the  precise  func¬ 
tions  of  the  representative  were  circumscribed.  In 
the  early  part  of  the  Middle  Ages,  the  most  com¬ 
mon  diplomatic  agent  was  the  nuncius ,  whose  func¬ 
tion  was  to  act  as  the  mouthpiece  of  the  principal 
on  whose  behalf  he  was  acting  and  whose  capacity 
to  negotiate  was  nonexistent  or  strictly  circum¬ 
scribed.  While  this  matched  the  requirements  of  the 
period  in  which  it  developed,  the  growing  complex¬ 
ity  of  interactions  marking  the  later  Middle  Ages 
required  the  use  of  officials  (procurators)  granted 
the  ability  to  engage  in  negotiations.  In  short,  while 
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we  can  see  the  beginnings  of  the  European  diplo¬ 
matic  environment,  this  was  the  presovereign-state 
phase  of  diplomacy  marked  by  quite  fluid  and  flex¬ 
ible  procedures  representative  of  a  period  of  major 
social,  political,  and  economic  change. 

It  was  during  the  f  5th  and  f  6th  centuries  that  a 
clear  outline  of  the  diplomatic  system  as  it  was  to 
develop  over  the  next  200  years  became  visible.  By 
this  time,  the  collapse  of  the  universal  concept  of 
Christendom  had  been  accompanied  by  the  gradual 
emergence  of  the  sovereign  state.  This  not  only 
required  a  greater  capacity  to  communicate  within 
a  changing  political  and  economic  environment 
and  thus  the  development  of  ways  in  which  this 
could  be  effected  but  also  provided  enhanced 
domestic  administrative  resources  necessary  to  its 
operations.  It  was  in  Northern  Italy  that  the  earliest 
manifestations  of  this  new  phase  of  diplomacy  were 
commonly  identified.  Here,  an  early  form  of  what 
was  to  become  the  European  system  of  states  could 
be  seen.  Significant  factors  were  the  geographical 
proximity  of  the  Italian  city-states,  their  relative 
similarity  in  terms  of  power  and  thus  an  inability  to 
exercise  hegemonic  power,  and  a  shared  cultural 
environment  that  facilitated  communication. 

Modern  Diplomacy 

Against  this  background,  the  practices  of  mod¬ 
ern  diplomacy  were  honed.  In  institutional  terms, 
the  key  development  was  the  growing  utilization 
of  the  resident  ambassador.  As  noted  above,  it  was 
not  that  this  practice  was  unknown  in  earlier  peri¬ 
ods  but  that  diplomacy  by  mission  for  specific 
purposes  was  far  more  common,  meeting  the  per¬ 
ceived  requirements  of  the  time.  Again,  it  was  a 
combination  of  political  and  social  change  and  the 
consequent  requirements  imposed  on  diplomacy 
that  underlay  this  development.  In  particular, 
while  its  ceremonial  and  symbolic  functions 
remained  significant,  a  growing  need  for  the  gath¬ 
ering  of  reliable  and  continuous  information 
replaced  the  earlier  emphasis  on  the  exchange  of 
messages.  Gradually,  the  practices  developed  in 
this  region  of  Europe  were  to  spread  across  the 
continent  and,  subsequently,  would  be  adopted  as 
key  principles  for  the  conduct  of  diplomacy  as  the 
international  system  expanded  beyond  its  shores. 

These  principles  assumed  several  forms.  On  the 
one  hand,  as  already  seen,  more  regularized  and 


permanent  structures  were  deemed  appropriate 
and  necessary.  During  the  ensuing  centuries,  the 
exchange  of  permanent  representatives  between 
national  governments  would  become  the  norm  of 
diplomatic  intercourse  as  its  structures  and  pro¬ 
cesses  were  aligned  with  the  state.  Consequently, 
the  rules  and  norms  of  diplomacy  were  refined  to 
support  the  diplomatic  system  through  the  con¬ 
solidation  of  the  principle  of  immunity  for  diplo¬ 
mats  and  the  development  of  protocol — such  as 
the  rules  of  precedent — established  at  the  1815 
Congress  of  Vienna  and  codified  in  the  1961 
Vienna  Convention  on  Diplomatic  Relations.  The 
growing  importance  attached  to  the  practice  of 
diplomacy  is  reflected  in  the  numerous  treatises  on 
the  necessary  qualities  required  of  the  ambassador, 
notably  in  the  writings  of  Bernard  du  Rosier, 
Philippe  de  Commynes,  and  Franqois  de  Callieres. 
Not  only  did  these  writings  serve  as  diplomatic 
manuals  for  ambassadors,  they  reflected  the  ways 
in  which  diplomacy  was  adapting  to  the  realities  of 
a  developing  system  of  sovereign  states.  At  the 
governmental  level,  enhanced  importance  was 
attached  to  the  capacity  to  process  the  growing 
flow  of  intelligence  generated  by  diplomatic 
networks.  France,  under  Cardinal  Richelieu,  is 
credited  with  the  creation  of  the  first  recognizable 
foreign  ministry  in  the  early  17th  century  and  the 
gradual  separation  in  the  conduct  of  domestic  and 
foreign  policy.  Thus,  by  the  18th  century,  the  pat¬ 
terns  of  diplomacy  at  both  the  international  and 
national  levels  had  assumed  the  shape  that  would 
become  a  familiar  feature  of  the  international 
order  in  the  ensuing  centuries. 

The  19th  century  witnessed  the  consolidation  of 
these  patterns  but,  at  the  same  time,  saw  consider¬ 
able  change  in  response  to  developments  at  national 
and  international  levels.  At  the  national  level,  the 
administrative  apparatus  for  the  conduct  of  diplo¬ 
macy  would  become  larger,  more  elaborate,  and 
more  professional.  This  not  only  reflected  changes 
in  the  role  and  structure  of  the  state  and  the  con¬ 
sequent  need  for  more  sophisticated  bureaucratic 
systems  but  also  mirrored  the  growing  complexity 
of  foreign  policy  and  the  demands  that  this  placed 
on  national  governments.  The  emergence  of  the 
modern  state  and  the  professionalization  of 
bureaucracy  affected  foreign  ministries,  as  did  the 
profound  social  change.  Although  a  gradual  and 
uneven  process,  the  image  of  diplomacy  as  the 
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preserve  of  the  aristocracy  was  weakened  as 
recruitment  became  less  a  matter  of  patronage  and 
more  a  matter  of  talent.  Across  Europe,  foreign 
ministries  and  their  diplomatic  services  developed 
systems  of  recruitment,  selection  by  means  of 
examination,  promotion  by  merit  rather  than 
patronage,  and  embryonic  training  programs.  An 
early  form  of  the  latter  was  the  creation  of  the 
Oriental  Academy  (later  Consular  Academy) 
established  in  the  Hapsburg  Empire  in  the  mid- 
18th  century.  From  a  focus  on  language  training, 
diplomatic  education  was  to  expand  in  scope  to 
include  aspects  such  as  commercial  diplomacy  as 
international  economic  linkages  developed.  Despite 
the  fact  that  diplomacy  had  begun  to  embrace  the 
middle  classes  and  to  lose  some  of  its  aristocratic 
connotations,  the  ambience  that  the  latter  bestowed 
on  it  was  slow  to  disappear,  particularly  in  some 
European  states.  In  France,  for  example,  by  the 
early  20th  century,  the  diplomatic  profession  was 
dominated  by  the  middle  class,  whereas  in 
Germany,  the  nobility  were  a  dominant  presence. 
One  factor  that  assisted  this  process  of  democrati¬ 
zation  was  the  recognition  that  diplomats  could 
not  be  expected  to  finance  their  activities  from 
their  own  resources,  which  had,  at  least  in  part, 
been  a  feature  of  past  practice,  and  that  unpaid 
attaches  seeking  an  opening  in  diplomacy  were  no 
longer  part  of  a  professional  service,  hence  the 
development  of  career  structures  and  the  grading 
of  salaries,  however  meager  these  might  be,  along¬ 
side.  Nevertheless,  none  of  this  was  to  take  from 
diplomacy  the  air  of  exclusivity  that,  to  a  degree,  it 
continues  to  possess  and  that  came  to  be  seen  as  a 
feature  of  what  would  be  designated  as  the  “old 
diplomacy.” 

Many  of  the  characteristic  structures  of  the  for¬ 
eign  ministry  would  also  be  established  in  this 
period.  One  of  these  was  the  distinction  between 
geographical  and  functional  organizational  princi¬ 
ples,  the  latter  a  recognition  of  the  growing  com¬ 
plexity  of  IR  that  cut  across  the  division  of  the 
world  into  geographical  regions.  It  would,  how¬ 
ever,  be  incorrect  to  assume  that  the  foreign  minis¬ 
try  was  a  focus  of  policy  making.  Much  of  its  work 
was  of  a  clerical-administrative  nature  with  for¬ 
eign  policy  being  made  at  the  political  level  with 
direct  communications  between  a  foreign  minister 
and  ambassadors.  Moreover,  despite  later  asser¬ 
tions  on  the  part  of  foreign  ministries  that  they  had 


once  been  the  gatekeeper  between  states  and  their 
international  environments,  it  was  not  always  the 
case  that  they  enjoyed  the  privileged  position  that 
this  implies.  For  some  states,  it  was  other  depart¬ 
ments — particularly  those  overseeing  commercial 
relations — that  were  regarded  as  possessing  greater 
functionality  and  prestige. 

Developments  at  the  national  level  went  hand  in 
hand  with  those  at  the  international  level.  This  was 
represented  by  the  extension  of  the  diplomatic  net¬ 
work.  By  the  latter  part  of  the  19th  century,  all  the 
great  powers  of  Europe  had  exchanged  missions, 
thus  marking  bilateral  diplomatic  relations  con¬ 
ducted  through  permanent  residential  posts  as  the 
hallmark  of  the  international  diplomatic  system. 
Furthermore,  the  practice  had  spread  beyond 
European  shores.  Despite  suspicions  of,  and  a 
reluctance  to  engage  in,  what  was  often  regarded 
as  a  manifestation  of  old  world  ills,  the  United 
States  began  to  expand  its  diplomatic  service  dur¬ 
ing  the  19th  century.  Elsewhere,  countries  as 
diverse  as  Japan,  Persia,  and  Brazil  would  develop 
the  makings  of  a  diplomatic  machinery  at  home 
and  a  diplomatic  service  overseas.  But  of  particular 
significance  was  the  gradual  spread  of  European 
diplomatic  norms — not  least  those  relating  to  dip¬ 
lomatic  privilege  and  immunities — of  great  signifi¬ 
cance  as  the  international  system  became  global  in 
its  scope  during  the  20th  century. 

Alongside  these  developments  was  the  emer¬ 
gence  of  “conference”  diplomacy  heralding  the 
growth  of  multilateral  diplomacy  in  the  ensuing 
decades.  One  manifestation  of  this  was  the  short¬ 
lived  “Congress  System”  following  the  end  of  the 
Napoleonic  Wars,  comprising  sovereigns  and  chief 
ministers.  Greater  effects  flowed  from  the  creation 
of  standing  conferences  of  ambassadors  in  major 
capitals  dealing  with  specific  issues — such  as  that 
set  up  in  London  on  the  abolition  of  the  slave  trade 
after  the  Congress  of  Vienna.  By  the  latter  half  of 
the  century,  technological  developments,  particu¬ 
larly  in  communications,  had  prompted  recogni¬ 
tion  of  the  need  for  international  cooperation  in 
areas  such  as  telegraphic  (the  International 
Telegraphic  Union  created  in  1865)  and  postal 
(Universal  Postal  Union  created  in  1874)  commu¬ 
nications.  This  not  only  generated  an  awareness  of 
the  need  for  diplomatic  activity  in  this  area,  it  also 
brought  with  it  two  effects  that  would  become 
themes  for  diplomatic  change  in  later  years.  One 
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was  the  need  for  technical  expertise  in  diplomacy, 
and  the  other  was  the  gradual  involvement  of 
“domestic”  departments  in  international  negotia¬ 
tions,  an  early  example  of  which  was  the  represen¬ 
tation  of  the  Home  Office  in  the  British  delegation 
to  the  1910  International  Aerial  Navigation 
Conference.  By  the  eve  of  the  Great  War,  then,  not 
only  had  the  structures  and  processes  of  diplomacy 
assumed  many  of  their  modern  forms,  the  chal¬ 
lenges  that  they  would  confront  in  the  coming 
decades  were  equally  identifiable. 

Diplomacy  in  the  20th  Century 

Developments  in  the  20th  century  posed  just  as 
many  challenges  to  diplomacy  as  those  in  the  pre¬ 
vious  century  and  indeed  some  developments  were 
to  exact  a  high  price  on  the  reputation  and  prestige 
of  the  diplomatic  system  and  professional  diplo¬ 
mats.  The  crisis  of  the  Great  War  (1914-1918) 
was  one  such  development,  and  it  holds  particular 
significance  since  it  led  to  widespread  condemna¬ 
tion  of  the  old  European-based  diplomacy,  which 
had  not  only  failed  to  prevent  war  but  also  had,  as 
many  concluded,  contributed  to  its  outbreak.  One 
key  failing  of  diplomacy  in  this  period  was  the 
abandonment  of  established  diplomatic  channels 
of  communication  by  resident  ambassadors  in 
favor  of  more  secretive  diplomatic  practices. 
Diplomacy  quickly  deteriorated  into  a  closed  sys¬ 
tem,  where  behind-the-scenes  bilateralism  and 
propaganda  fed  an  appetite  for  the  brutal  pursuit 
of  national  interest  in  an  atmosphere  of  mistrust 
and  rivalry  that  the  diplomats  themselves  had  done 
much  to  create.  The  old  diplomacy  that  emerged  at 
the  end  of  the  19th  century  had  created  the  very 
problems  that  drove  the  European  powers  to  all- 
out  war  within  the  first  2  decades  of  the  following 
century,  and  condemnation  of  the  old  diplomacy 
quickly  led  to  demands  for  a  new,  and  more  open, 
diplomatic  system  where  diplomats  could  be  held 
accountable  to  their  executives. 

European  states’  response  to  these  demands  was 
almost  universal.  Across  Europe,  the  semiautono- 
mous  resident  ambassador  was  replaced  by  a  cen¬ 
trally  controlled  system  of  overseas  permanent 
missions.  While  bilateralism  remained  a  core  dip¬ 
lomatic  process  around  these  new  permanent  over¬ 
seas  missions,  during  the  Great  War  period  and  in 
the  following  decade,  multilateral  diplomacy  took 


off.  A  significant  amount  of  intergovernmental 
diplomacy  was  now  taking  place  outside  the  more 
established  bilateral  diplomatic  structures  and  for¬ 
eign  ministries  in  the  form  of  intergovernmental 
conferences  of  state  leaders  and  other  government 
ministers  such  as  finance  and  trade.  Multilateral 
diplomacy  involving  officials  from  departments 
across  government  became  a  key  vehicle  for  allied 
cooperation  during  the  war  on  issues  such  as  food 
and  munitions  transportation,  as  well  as  intelli¬ 
gence  sharing  and  military  coordination. 

Important  and  influential  though  it  certainly 
was,  the  Great  War  did  not  exhaust  the  challenges 
that  the  so-called  new  diplomacy  would  face  in  the 
20th  century.  Within  just  a  few  decades,  the 
European-based  diplomatic  system  would  be  both 
overhauled  and  expanded  to  other  continents  as 
the  diplomatic  system  adapted  to  two  open-ended 
developments:  (1)  the  growing  interdependence  of 
states  in  the  international  system,  which  increased 
the  demand  for  effective  coordination  of  interna¬ 
tional  cooperation  in  an  ever-growing  number  of 
policy  areas  but  especially  in  trade  and  finance, 
and  (2)  the  onset  of  decolonization  and  indepen¬ 
dence  that  more  than  quadrupled  the  number  of 
sovereign  states  in  the  international  system  by  the 
end  of  the  20th  century. 

The  Great  Depression  of  the  1930s  demon¬ 
strated  very  starkly  the  economic  interdependence 
of  states,  and  after  the  end  of  World  War  II,  finan¬ 
cial  and  trade  integration  intensified,  creating 
demands  on,  as  well  as  opportunities  for,  diplo¬ 
mats  to  coordinate  international  economic  policy 
in  bilateral,  regional,  and  multilateral  relation¬ 
ships.  Indeed,  the  development  of  the  Bretton 
Woods  system  created  a  number  of  powerful  inter¬ 
national  economic  organizations  such  as  the 
General  Agreement  on  Tariffs  and  Trade  where 
diplomats  would  negotiate  international  trade 
policies  and  rules  governing  trade.  Economic  inter¬ 
dependence  within  regions  led  to  a  huge  expansion 
in  the  number  of  regional  organizations  in  Europe, 
Asia,  the  Americas,  and  eventually  Africa  and  the 
Caribbean.  The  vast  increase  in  the  number  of 
international  organizations  from  around  40  at  the 
beginning  of  the  20th  century  to  almost  400  at  its 
end  is  one  of  the  most  significant  developments  in 
IR  during  this  period.  These  organizations  would 
also  present  both  opportunities  for,  and  demands 
on,  diplomacy  to  work  in  new  institutional  and 
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policy  environments  and  to  develop  new  diplo¬ 
matic  methods  in,  for  example,  multilateral  bar¬ 
gaining.  With  the  development  of  nuclear  arms 
and  the  Cold  War  from  the  1950s  and  interna¬ 
tional  terrorism  from  the  1970s  onward,  strategic 
interdependence  between  states  became  world¬ 
wide.  Diplomats  would  populate  the  increasing 
number  of  multilateral  and  regional  strategic  orga¬ 
nizations  from  the  United  Nations  Security  Council 
and  the  International  Atomic  Energy  Agency  to  the 
North  Atlantic  Treaty  Organization  and  the 
Warsaw  Pact.  Diplomats  would  quickly  develop 
new  diplomatic  methods  in,  for  example,  coercive 
diplomacy,  deterrence,  and  intelligence  gathering, 
as  well  as  management  of  new  processes  such  as 
superpower  summitry  and  international  peace¬ 
keeping.  The  development  of  rapid  mass  commu¬ 
nications  in  the  20th  century  linked  domestic 
developments  in  one  country  directly  with  others, 
again  creating  opportunities  for,  and  demands  on, 
diplomats  to  develop  new  practices  in  order  to 
influence  political  and  policy  developments  in  each 
others’  countries.  These  included  public  diplo¬ 
macy.  Interdependence  essentially  brought  an 
international  dimension  to  almost  all  aspects  of 
policy  and  strategy  such  that  the  realm  and  content 
of  diplomacy  during  the  20th  century  covered  the 
entirety  of  world  governance. 

While  interdependence  between  societies  inten¬ 
sified  and  deepened  as  a  result  of  globalization  and 
regionalization  processes  throughout  the  20th  cen¬ 
tury,  it  was  especially  so  in  the  latter  decades  as 
new  technologies  expedited  and  reduced  commu¬ 
nication  and  transportation  costs.  The  pattern  of 
IR  would  change  dramatically  during  this  period 
as  transnational  relations  between  nonstate  actors 
developed  and  new  global  and  regional  actors 
from  the  private  sector  and  civil  society  emerged 
and  sought  to  influence  policy  and  processes  at  all 
levels  creating  demands  for  open  and  accountable 
global  and  regional  governance  processes.  These 
developments  posed  fundamental  questions  about 
the  role  and  influence  of  the  state — and  hence 
diplomacy — relative  to  other  actors  in  the  system 
such  as  transnational  business  and  global  civil 
society.  If  the  state  was  indeed  in  decline,  then 
state-based  diplomatic  systems  would,  it  seems, 
have  decreasing  utility. 

These  new  patterns — whether  viewed  as  the 
cause  or  response — went  hand  in  hand  with  the 


development  of  linkage  of  increasingly  complex 
and  technical  policy  issues.  Issues  of  interdepen¬ 
dence,  globalization,  and  regionalization  raised 
practical  matters  crucial  to  the  continued  effective 
practice  of  diplomacy.  How  could  the  diplomat — a 
generalist  by  nature  and  by  training — have  suffi¬ 
cient  grasp  of  such  highly  technical  policy  issues 
and  how  could  the  foreign  ministry  continue  to 
manage  policy  issues  that  cut  across  several  domes¬ 
tic  department  concerns? 

At  the  same  time  that  these  pressures  were  rais¬ 
ing  questions  about  the  effectiveness  of  the  state 
and  state-based  diplomacy,  the  processes  of  decol¬ 
onization  and  independence  in  the  mid-  and  late 
20th  century  highlighted  the  continued  relevance 
of  the  state  and  the  continuing  appeal  of  European 
diplomatic  institutions.  Decolonization  and  inde¬ 
pendence  of  the  colonial  states  and  the  former 
Soviet  states  increased  the  number  of  sovereign 
states  in  the  international  system  and,  almost  with¬ 
out  fail,  each  of  the  new  states  created  diplomatic 
institutions  in  the  image  of  the  European  model  of 
a  foreign  ministry  and  system  of  overseas  missions 
and  permanent  delegations.  The  expansion  in  the 
number  of  sovereign  states  raised  questions  about 
the  diplomatic  capacity  of  the  new  states,  the 
impact  a  flush  of  new  states  would  have  on  multi¬ 
lateral  and  regional  diplomacy,  and  the  develop¬ 
ment  of  bilateral  diplomacy  between  the  new 
states  and  the  old  states. 

With  the  exception  of  India,  which  had  for 
some  time  before  independence  in  1947  acquired  a 
quasi-diplomatic  system  to  represent  itself  in,  for 
example,  the  League  of  Nations,  most  new  states, 
and  in  particular  African  states,  had  very  limited 
resources  to  spend  on  developing  a  European-style 
diplomatic  system  of  an  extensive  network  of  over¬ 
seas  missions.  Most  relied  on  a  handful  of  diplo¬ 
mats  in  key  international  organizations  and  key 
capitals,  and  it  became  common  practice  for  diplo¬ 
mats  from  developing  countries  to  provide  diplo¬ 
matic  representation  in  multiple  arenas.  An  African 
diplomat  is  often,  for  example,  the  permanent 
delegate  to  the  UN  as  well  as  ambassador  in 
Washington  and  Ottawa.  Similarly,  the  ambassa¬ 
dor  for  Tonga  usually  fills  a  number  of  posts: 
permanent  delegate  at  the  World  Trade 
Organization  (WTO)  in  Geneva  and  European 
Union  (EU)  in  Brussels,  as  well  as  ambassador  in 
London. 
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The  extent  of  the  newly  developed  states’  involve¬ 
ment  in  multilateral  diplomacy  was,  inevitably, 
limited,  at  least  until  developing  countries  began  to 
form  strategic  alliances  such  as  the  Non-Aligned 
Movement  and  the  Group  of  77  in  the  UN.  Newly 
independent  states  also  created  their  own  intergov¬ 
ernmental  organizations,  such  as  the  Organization 
for  African  Unity  created  in  1963,  as  a  way  of 
managing  regional  integration  and  security  as  well 
as  building  collective  diplomatic  strategies. 

The  arrival  of  many  new  states  into  the  system 
raised  strategic  issues  for  the  old  states,  since  they 
did  not  always  have,  or  wish  to  spend,  the 
resources  required  to  establish  diplomatic  repre¬ 
sentation  in  so  many  new  countries.  Internal 
debates  ensued  about  how  to  keep  costs  down  and 
retain  effective  coverage  of  key  strategic  countries 
and  regions.  Many  European  governments,  for 
example,  faced  demands  to  reduce  bilateral  mis¬ 
sions  in  European  capitals  in  the  wake  of  the 
development  of  very  large  permanent  delegations 
to  the  EU. 

Contemporary  Trends  in  Diplomacy 

Three  broad  trends  in  contemporary  diplomacy 
are  now  evident:  (1)  fragmentation,  as  the  conduct 
of  diplomacy  at  the  governmental  level  now 
involves  government  departments  traditionally 
associated  with  purely  “domestic”  issues;  (2)  con¬ 
centration,  as  the  fusion  of  domestic  and  interna¬ 
tional  politics  has  been  accompanied  by  the 
expanding  involvement  of  heads  of  government  in 
international  policy;  and  (3)  diffusion,  as  profes¬ 
sional  diplomats  have  found  themselves  required 
to  engage  with  a  growing  range  of  nongovernmen¬ 
tal  stakeholders  in  complex  policy  networks. 

The  first  of  these  trends,  fragmentation,  came  to 
be  associated  from  the  1970s  onward  with  the 
development  of  an  expanded  “foreign  policy  com¬ 
munity” — that  is,  an  expanding  range  of  govern¬ 
mental  agencies  and  a  multiplicity  of  channels  in 
the  conduct  of  external  relations  supplementing 
and  often  challenging  the  role  claimed  by  foreign 
ministries.  One  consequence  of  this  has  been  a 
growing  emphasis  on  the  need  for  policy  coordina¬ 
tion  at  the  national  level,  underpinned  by  recogni¬ 
tion  that  an  uncoordinated  stance  in  international 
negotiations  reflecting  various  bureaucratic  inter¬ 
ests  has  potential  costs  in  terms  of  attainment  of 


policy  goals.  This  partly  explains  the  trend  toward 
concentration.  Awareness  of  the  potential  costs  of 
lack  of  bureaucratic  and  political  coordination  and 
politicization  of  international  policy,  combined 
with  a  growing  international  role  for  heads  of  gov¬ 
ernment,  have  resulted  in  a  tendency  to  centralize 
the  conduct  of  diplomacy  in,  for  example,  prime 
ministerial  and  presidential  offices.  Additionally, 
the  imperatives  of  coordination  have  resulted  in 
the  merging  of  departments  in  the  quest  for  greater 
efficiency  in  the  management  of  external  relations, 
notably  in  the  area  of  external  trade  relations. 
Hence,  both  Canada  and  Australia  merged  their 
foreign  ministries  and  international  trade  depart¬ 
ments  during  the  1980s. 

The  third  trend,  diffusion,  reflects  the  fact  that 
diplomatic  processes  have  increasingly  required 
the  development  of  policy  networks  as  complex 
policy  issues  demand  that  state-focused  diplomacy 
is  supplemented  by  linkages  with  civil  society  orga¬ 
nizations  and,  in  specific  contexts,  the  business 
community.  Part  of  this  development  mirrors  the 
changing  nature  of  international  negotiation  as  it 
assumes  the  character  of  a  management  process 
marked  by  its  technical  qualities,  complexity, 
uncertainty,  and  bureaucratization.  In  this  context, 
diplomacy  has  become  much  more  than  the  trad¬ 
ing  of  concessions  in  pursuit  of  a  negotiated  settle¬ 
ment.  Many  contemporary  negotiations,  such  as 
those  in  the  area  of  the  environment,  involve  pro¬ 
cesses  of  mutual  learning  and  the  creation  and 
systematizing  of  new  knowledge  and  mutual  edu¬ 
cation  among  a  group  of  interests,  each  of  which 
has  contributions  to  make  to  the  management  of 
policy  issues.  The  impact  of  this  can  be  seen  not 
only  in  the  growing  engagement  between  actors  at 
the  national  level  but  also  in  multilateral  diplo¬ 
matic  environments.  Stakeholder  engagement  has 
become  a  watchword  in  the  majority  of  interna¬ 
tional  organizations,  from  the  UN  to  the  World 
Bank  and  the  WTO.  Underpinning  these  develop¬ 
ments  at  the  national  and  international  levels  is  the 
enhanced  emphasis  on  public  diplomacy  from  the 
late  1990s  onward.  But  from  being  primarily  con¬ 
cerned  with  image  management,  public  diplomacy 
strategies  are  increasingly  founded  on  an  awareness 
that  the  routes  of  influence  within  the  international 
system  are  changing  and  that  shaping  international 
policy  outcomes  demand  strategies  for  influencing 
a  much  wider  range  of  constituencies  than  those  at 
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the  governmental  level.  Consequently,  by  the  early 
21st  century,  public  diplomacy  had  become  one  of 
the  primary  concerns  of  foreign  ministries.  One 
obvious  manifestation  of  the  awareness  of  the 
importance  of  engaging  with  an  expanded  audi¬ 
ence  at  home  and  abroad  has  been  the  attention 
paid  by  foreign  ministries  to  the  use  of  the  Internet 
and  increasingly  sophisticated  and  accessible  Web 
sites.  Recognition  of  the  growing  demand  for 
interactive  processes  means  that  these  ministries 
are  now  using  social  networking  sites  such  as 
Facebook  together  with  YouTube  and  Twitter  to 
engage  in  a  dialogue  with  audiences  at  home  and 
overseas.  Hesitant  forays  into  the  virtual  world, 
Second  Life,  have  begun.  Sweden  has  established 
an  embassy  there  and  the  U.S.  State  Department 
has  sought  to  develop  its  public  diplomacy  strate¬ 
gies  by  using  this  resource  to  engage  with  bloggers 
in  the  Middle  East. 

Change  has  not  been  limited  to  the  foreign  min¬ 
istry,  however.  The  structure  and  operations  of 
diplomatic  services  have  undergone  significant 
change,  partly  as  a  result  of  resource  constraints 
but  also  due  to  the  evolving  international  order 
itself.  Here,  the  need  to  operate  more  economically 
has  combined  with  an  awareness  that  patterns  of 
international  representation  have  failed  to  keep 
pace  with  geopolitical  and  geoeconomic  change.  In 
the  United  States,  the  announcement  by  former 
Secretary  of  State  Condoleezza  Rice  of  the  Trans¬ 
formational  Diplomacy  program  was  stimulated 
by  these  concerns.  In  addition  to  this,  the  impact 
of  the  twin  processes  of  fragmentation  and  con¬ 
solidation  has  meant  that  the  overseas  network  of 
diplomatic  posts  is  now  regarded  as  representative 
of  the  entire  governmental  apparatus  and  not  only 
of  the  foreign  ministry.  Indeed,  in  some  diplomatic 
posts,  the  number  of  traditional  diplomats  is 
dwarfed  by  the  presence  of  officials  from  a  range 
of  “domestic”  government  departments. 

A  further  significant  development  is  the  revolu¬ 
tion  in  communications  and  the  role  of  the  media 
in  international  policy.  The  adoption  of  e-mail  and 
secure  facsimile  links  between  foreign  ministries 
and  overseas  missions  (contrary  to  the  traditional 
arguments  concerning  the  impact  of  enhanced  com¬ 
munications  on  diplomacy)  has  allowed  missions  to 
play  a  more  direct  role  in  the  policy  processes. 
Simultaneously,  the  development  of  the  electronic 
mass  media  creates  pressure  on  governments  to 


respond  to  events  almost  instantaneously  and,  at 
the  same  time,  provides  opportunities  to  project 
their  policies  to  domestic  and  foreign  audiences. 
Again,  this  has  implications  for  the  respective  roles 
of  foreign  ministries  and  diplomats  in  the  field. 
The  enhanced  speed  of  events  can  often  assume  as 
great  an  importance  as  the  events  themselves. 
Nowhere  is  this  more  evident  than  in  natural  and 
man-made  disasters.  The  growing  incidence  of  ter¬ 
rorist  attacks  and  events  such  as  the  tsunami  in 
2004  has  placed  renewed  emphasis  on  the  consular 
dimension  of  diplomacy.  The  rise  of  mass  tourism 
and  a  media  ready  to  judge  diplomats  by  the 
immediacy  of  their  response  to  such  crises  has 
established  a  new  benchmark  by  which  the  diplo¬ 
matic  profession  is  judged. 

Cumulatively,  these  developments  have  had  a 
significant  impact  on  the  role  of  the  professional 
diplomat.  Increasingly,  this  is  portrayed  in  terms  of 
a  “coordinator-manager”  and  “facilitator”  in  com¬ 
plex  processes  spanning  the  boundaries  between 
the  international  and  national  domains.  They  also 
pose  questions  concerning  the  traditional  norms  of 
behavior  associated  with  diplomacy.  To  take  one 
significant  example,  the  traditional  emphasis  on 
confidentiality  and  secrecy  is  challenged  by  the 
norms  demanded  by  the  need  to  work  with  a  range 
of  “nondiplomatic”  stakeholders  in  specific  policy 
milieus.  The  latter,  working  to  their  own  codes  and 
norms  of  behavior,  do  not  always  respect  the  tradi¬ 
tions  associated  with  the  conduct  of  diplomacy, 
and  establishing  mutual  understanding  and  coop¬ 
eration  is  one  of  the  major  challenges  of  contempo¬ 
rary  diplomacy. 

Adaptation  and  Change  in  Diplomacy 

Several  clear  themes  emerge  from  this  discussion  of 
the  emergence  and  development  of  diplomacy.  The 
first  is  that  diplomacy  has  a  history  and  a  logic 
that  transcends  the  system  of  states  with  which  it 
is  often  equated.  Just  as  diplomacy  preceded  the 
emergence  of  the  sovereign  state,  so  also  it  has 
adapted  to  the  latter’s  transformation  in  response 
to  forces  associated  with  globalization  and  region¬ 
alization.  The  second  theme  relates  to  the  first: 
Diplomacy  has  a  capacity  to  adapt  to  change.  It  is 
something  of  a  truism — and  a  recurrent  topic  of 
diplomatic  memoirs — that  diplomacy  is  not  what 
it  was.  Thus,  change  runs  as  a  leitmotif  through  its 
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evolution  as  the  processes  of  diplomacy  and  the 
structures  through  which  it  has  been  conducted 
have  responded  to  transforming  environments. 
This,  however,  has  been  a  gradual  process,  so 
much  so  that  the  frequently  used  distinction 
between  “old”  and  “new”  diplomacies  is  mislead¬ 
ing,  ignoring  as  it  does  the  inherent  adaptive 
capacity  of  the  processes  on  which  diplomacy  has 
relied  over  time.  Similarly,  debates  about  the 
“decline”  of  diplomacy  are  usually  founded  on  the 
association  of  these  processes  with  particular 
structures  or  forms — such  as  the  emergence  of 
resident  bilateral  diplomacy.  To  understand  the 
nature  and  significance  of  diplomacy  in  its  histori¬ 
cal  and  contemporary  manifestations  we  must 
recognize  that  there  is  no  single  mechanism  through 
which  its  objectives  can  be  served.  Diplomacy’s 
capacity  for  change  ensures  that  in  the  early  21st 
century,  where  complex  agendas  require  evermore 
inventive  modes  of  global  governance,  the  pro¬ 
cesses  associated  with  diplomacy  remain  a  major 
component  of  international  life. 
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Disarmament 


The  term  disarmament  is  generally  used  to  describe 
a  process  of  reducing  and  eliminating  certain 
weapons  systems,  but  it  may  also  depict  an  end 
state  when  a  specific  type  of  weapon  has  been 
abolished.  The  quest  for  a  definition  is  compli¬ 
cated  because  the  concept  of  disarmament  may  be 
employed  for  different  purposes  in  diplomacy,  in 
international  relations,  and  in  national  and  inter¬ 
national  security  debates.  Disarmament  can  apply 
to  any  armaments  system,  from  nuclear  weapons 
to  land  mines,  and  depending  on  political  perspec¬ 
tive  may  be  cast  as  negative  (inducing  vulnerabil¬ 
ity)  or  positive  (promoting  peace). 
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Though  disarmament  may  be  linked  with  arms 
control,  the  two  approaches  are  different.  Arms 
control  covers  partial  measures  and  mechanisms 
for  managing  and  restricting  the  development  and 
possession  of  certain  kinds  of  armaments,  whereas 
disarmament  is  the  process  or  accomplishment  of 
the  elimination  and  abolition  of  such  weapons  sys¬ 
tems.  Arms  control  may  be  an  end  in  itself,  leaving 
some  weapons  in  the  hands  of  some  countries,  or  it 
can  provide  a  stepping  stone  toward  disarmament, 
but  the  two  concepts  are  not  interchangeable  and 
should  not  be  confused.  Concepts  of  disarmament 
are  related  not  only  to  reducing  and  eliminating 
weapons  but  also  to  challenging  traditional,  mili¬ 
tary  concepts  of  security,  defense,  and  deterrence 
and  constructing  alternative  approaches  for  peace 
and  security.  This  entry  discusses  various 
approaches  to  disarmament. 

General  and  Complete  Disarmament 

When  used  in  relation  to  nuclear  weapons,  disar¬ 
mament  and  nonproliferation  are  frequently 
described  as  two  sides  of  the  same  coin,  implying 
that  one  is  not  sustainable  without  the  other.  The 
Treaty  on  the  Non-Proliferation  of  Nuclear 
Weapons  (NPT),  which  entered  into  force  in  1970 
with  the  purpose  of  preventing  the  spread  of 
nuclear  weapons  and  technologies,  contained  in  its 
Article  VI  an  obligation  to  pursue  nuclear  disarma¬ 
ment.  Article  VI  also  referred  to  “general  and 
complete  disarmament,”  a  phrase  drawn  from 
United  Nations  (UN)  resolutions  going  back  to 
1946.  No  timetable  was  set  for  achieving  these 
objectives,  and  there  are  many  competing  interpre¬ 
tations  of  what  would  actually  need  to  be  done  to 
implement  Article  VI. 

In  the  aftermath  of  World  War  II,  “general  and 
complete  disarmament”  reflected  the  aspiration  of 
resolving  all  conflict  through  diplomacy,  without 
recourse  to  force  and  arms,  but  its  meaning  has 
never  been  clarified.  Was  it  intended  to  mean  a 
world  without  all  weapons  or  a  world  without 
nationally  controlled  war-fighting  weapons?  It 
appears  that  the  founders  of  the  UN  wanted  this 
aspiration  to  guide  international  relations  toward 
more  peaceful  directions.  Certainly,  the  world  would 
have  to  change  fundamentally  if  total  disarmament, 
peace,  and  security  were  ever  to  be  possible.  The 
inclusion  of  “general  and  complete  disarmament” 


in  Article  VI  of  the  NPT  has  more  often  been  used 
by  some  of  the  nuclear  weapon  states  (defined  in 
the  NPT  to  include  China,  France,  Russia,  the 
United  Kingdom  [UK],  and  the  United  States)  to 
justify  not  eliminating  their  nuclear  weapons  until 
there  is  general  and  complete  disarmament.  So  that 
the  laudable  but  remote  goal  of  total  disarmament 
would  not  be  used  as  an  excuse  to  hold  on  to 
nuclear  weapons,  the  Sixth  Review  Conference  of 
the  NPT,  held  in  2000,  made  clear  in  its  consensus 
final  document  that  nuclear  disarmament  is  the 
most  urgent  priority  under  the  treaty  and  that 
achieving  nuclear  disarmament  is  not  contingent 
on  general  and  complete  disarmament,  which  is 
the  “ultimate  goal.” 

Disarmament  Initiatives, 
Conferences,  and  Agreements 

While  the  aspiration  of  general  and  complete  dis¬ 
armament  is  unlikely  ever  to  be  fully  achieved, 
much  can  be  done  to  reduce  and  eliminate  many 
weapons  systems.  From  government-led  action  to 
remove  all  guns  from  Japan  in  the  I5th  and  16th 
centuries  to  negotiations  on  banning  cluster  muni¬ 
tions  in  2008,  there  have  been  a  range  of  disarma¬ 
ment  initiatives,  conferences,  and  agreements.  The 
Hague  Peace  Conferences  that  took  place  from 
1899  effectively  outlawed  dum-dum  bullets.  After 
World  War  I,  the  Geneva  Conventions  were  devel¬ 
oped  to  regulate  the  conduct  and  customs  of  war¬ 
fare,  including  aspects  of  arms  developments  and 
trade.  The  1 925  Geneva  Protocol  for  the  Prohibition 
of  the  Use  in  War  of  Asphyxiating,  Poisonous,  or 
Other  Gases,  and  of  Bacteriological  Methods  of 
Warfare  (known  as  the  Geneva  Gas  Protocol)  was 
an  early  disarmament  effort  that  focused  on  ban¬ 
ning  the  use  of  inhumane  weapons  rather  than 
mandating  physical  disarmament  of  the  actual 
weapons.  It  took  a  further  47  years  to  negotiate 
the  1972  Biological  and  Toxin  Weapons 
Convention,  which  prohibited  the  development, 
manufacture,  and  stockpiling  of  bioweapons. 

During  the  Cold  War,  arms  control  was  consid¬ 
ered  to  be  an  important  part  of  the  strategic  rela¬ 
tionship  between  the  United  States  and  the  Soviet 
Union.  This  was  particularly  true  of  nuclear  weap¬ 
ons,  where  a  number  of  bilateral  agreements 
sought  to  cap  a  spiraling  arms  race  in  which  the 
dominant  powers  developed  more  than  70,000 
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nuclear  weapons.  One  of  these,  the  1987 
Intermediate-Range  Nuclear  Forces  Treaty, 
removed  and  eliminated  all  nuclear-armed  Cruise 
and  Pershing  Missiles  from  Europe.  This  treaty  was 
brought  about  by  changing  perspectives  and  leader¬ 
ship  in  the  Soviet  Union  and  the  United  States  in 
response  to  a  massive  upsurge  of  civil  society  pro¬ 
tests  and  campaigning  for  nuclear  disarmament 
and  human  rights  in  the  1980s.  Unusually,  it  met 
the  criteria  for  both  disarmament  and  arms  control. 

In  political  discourse,  disarmament  is  often 
qualified  as  “unilateral,”  denoting  one  country’s 
decisions  and  actions  to  renounce  a  particular  type 
of  weapon,  and  “multilateral,”  denoting  decisions 
and  processes  undertaken  by  many  states,  gener¬ 
ally  through  negotiations.  Though  unilateral  and 
multilateral  approaches  may  be  different,  they  are 
not  mutually  exclusive  and  can  be  mutually  rein¬ 
forcing.  Nation-states  are  members  of  the  various 
bodies  and  institutions  that  constitute  the  world’s 
disarmament  machinery.  The  Conference  on  Dis¬ 
armament  (CD)  was  formally  constituted  in 
Geneva  following  the  First  UN  Special  Session  on 
Disarmament  in  1978,  which  had  identified  the 
need  for  a  “single  multilateral  disarmament  nego¬ 
tiating  forum  of  limited  size  taking  decisions  on 
the  basis  of  consensus.”  The  CD  accomplished 
little  until  the  Cold  War  ended.  In  1993,  it  con¬ 
cluded  negotiations  on  the  Chemical  Weapons 
Convention,  which  prohibited  the  acquisition, 
manufacture,  and  development  of  chemical  weap¬ 
ons  and  created  the  multilateral  Organization  for 
the  Prohibition  of  Chemical  Weapons  to  oversee 
and  verify  the  complete  elimination  of  such  arma¬ 
ments.  As  its  membership  went  from  38  to  66  UN 
member  states,  the  CD  negotiated  the  Compre¬ 
hensive  Test  Ban  Treaty  (CTBT),  which  was  con¬ 
cluded  and  opened  for  signature  in  1996.  By  ban¬ 
ning  nuclear  tests,  the  CTBT  contributes  to  nuclear 
disarmament  as  well  as  nonproliferation. 

The  UN  General  Assembly’s  First  Committee 
covers  international  security  and  disarmament 
issues  and  meets  annually  to  consider  between  60 
and  70  resolutions  on  these  issues,  ranging  from 
nuclear  weapon-free  zones  to  prevention  of  an 
arms  race  in  outer  space.  There  is  also  a  UN 
Disarmament  Commission,  which  is  open  to  all 
UN  member  states,  but  this  meets  only  for  2  weeks 
every  year  and  does  not  have  decision-making 
powers.  The  UN  Office  for  Disarmament  Affairs  is 


headed  by  the  Secretary-General’s  High  Rep¬ 
resentative  for  Disarmament  Affairs  and  provides 
support  for  multilateral  treaties  and  UN-related 
institutions  dealing  with  weapons  of  mass  destruc¬ 
tion  and  conventional  arms,  including  small  arms 
and  light  weapons. 

At  the  end  of  the  Cold  War,  disarmament  was 
promoted  for  humanitarian  as  well  as  strategic  or 
conflict  reduction  purposes.  In  the  field  of  conven¬ 
tional  weapons,  civil  society — notably  disarmament 
campaigners,  war  veterans,  and  humanitarian  aid 
workers — built  momentum  around  the  demand  to 
ban  land  mines  on  grounds  that  the  weapons  con¬ 
tinued  to  maim  and  kill  noncombatants  long  after 
the  military  purpose  had  ceased.  They  promoted 
conditions  for  a  group  of  concerned  middle-power 
states  to  establish  a  forum  and  begin  negotiations  to 
ban  antipersonnel  land  mines.  The  1997  Mine  Ban 
Treaty  was  negotiated  outside  the  traditional  multi¬ 
lateral  machinery  because  efforts  to  get  negotia¬ 
tions  on  this  issue  in  the  CD  or  among  parties  to 
the  1981  Inhumane  Weapons  Convention  were 
blocked  by  states  with  a  vested  interest  in  retaining 
the  weapons.  A  decade  later,  another  coalition  of 
governments  and  civil  society  created  their  own 
series  of  negotiating  conferences  to  achieve  a  ban 
on  the  use  and  manufacture  of  cluster  munitions. 
Though  some  of  the  major  governments  chose  to 
boycott  the  negotiations,  they  were  not  able  to 
block  them.  The  resulting  disarmament  treaties 
have  proved  significant  in  stigmatizing  certain 
weapons  as  inhumane  and  creating  a  normative 
framework  within  which  even  nonsignatories  find 
that  their  deployment  and  use  of  such  weapons  is 
constrained.  Once  the  treaties  establish  the  disar¬ 
mament  norm,  they  change  states’  behavior.  Most 
of  the  states  that  boycotted  the  negotiations  have 
later  signed  and  ratified. 

Public  Involvement 

In  recent  history,  it  has  taken  active  public  engage¬ 
ment  and  civil  society  pressure  to  persuade  govern¬ 
ments  to  renounce  certain  kinds  of  weapons.  Though 
lip  service  is  paid  to  the  desirability  of  disarmament, 
it  does  not  occur  in  a  vacuum.  As  one  type  of 
weapon  is  banned  and  disarmed,  there  are  vested 
interests  keen  on  developing  new  weapons.  Arms 
trading  is  big  business,  netting  huge  profits  for  arms 
manufacturers,  defense  contractors,  governments, 
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and  black  marketers.  Governments  may  support 
nonproliferation  to  disarm  others  while  retaining 
high-value  weapons  for  themselves. 

Where  traditional  concepts  of  defense  and  arms 
control  are  associated  with  national  security  para¬ 
digms,  disarmament  has  come  to  be  associated 
with  the  evolving  human  security  paradigm  and 
linked  with  development,  human  rights,  and  a 
redistribution  of  resources  away  from  military 
methods  and  expenditure. 

Considered  as  an  end  state,  disarmament  may 
never  be  complete,  though  specific  weapons  sys¬ 
tems  have  been  comprehensively  prohibited  and 
eliminated.  Viewed  as  a  process,  disarmament  is 
taking  place  on  many  levels  all  around  the  world: 
from  the  women  who  help  implement  the 
Programme  of  Action  on  Small  Arms  and  Light 
Weapons  by  coordinating  local  initiatives  to  col¬ 
lecting  and  destroying  guns  to  the  diplomats  nego¬ 
tiating  the  next  nuclear  treaties  and  from  disarma¬ 
ment  activists  halting  nuclear  weapon  convoys  to 
the  international  inspectors  verifying  that  states 
are  complying  fully  with  their  treaty  obligations. 
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Discourse  Analysis 


The  aim  of  discourse  analysis  is  to  reveal  the  onto¬ 
logical  and  epistemological  premises  that  are 
embedded  in  language  and  that  allow  a  statement 
to  be  understood  as  rational  or  interpreted  as 
meaningful.  Discourse  analysis  investigates 
whether — in  statements  or  texts — it  is  possible  to 
establish  any  regularity  in  not  only  the  objects 
that  are  discussed,  the  subjects  designated  as 
actors,  and  the  causal  relations  claimed  to  exist 
between  objects  (explanans)  and  subjects 
(explanandum)  but  also  the  expected  outcome  of 
subjects  trying  to  influence  objects,  the  goal  of 
their  action,  and  finally  the  time  dimension  by 
which  these  relations  are  framed.  Discourses  thus 
comprise  the  underlying  conditions  for  a  state¬ 
ment  to  be  interpreted  as  meaningful  and  rational. 
At  the  same  time,  discourse  analysis  is  the  study  of 
rationality  and  how  it  is  expressed  in  a  particular 
historical  context.  Discourse  analysis  is  part  of  the 
constructivist  (or  social-constructivist)  approach 
within  the  humanities  and  social  sciences.  It 
assumes  that  basic  assumptions  with  regard  to 
being,  self,  and  the  world  are  constructed  by  indi¬ 
viduals  living  in  a  historical  and  cultural  context 
that  is  produced  and  reproduced  by  their  speech 
and  acts. 

There  is  no  mainstream  definition  of  discourse 
within  the  social  sciences.  Neither  is  there  any  gen¬ 
erally  accepted  understanding  of  what  discourse 
analysis  is  or  which  method(s)  its  practitioners 
should  use.  Consequently,  it  is  difficult  to  give  a 
precise  description  of  what  characterizes  discourse 


Discourse  Analysis  673 


analysis.  This  entry  reviews  several  forms  of  dis¬ 
course  analysis  and  their  application  to  politics. 
Three  approaches  are  distinguished,  all  of  which 
are  called  discourse  analysis,  but  they  alternate  in 
their  approaches  to  what  a  discourse  is  and  what 
the  aim  of  analyzing  discourses  is.  The  first 
approach  is  the  discourse-analytical,  the  second  is 
the  discourse-theoretical,  and  the  third  is  the  criti¬ 
cal  discourse  analysis. 

Varieties  of  Discourse  Analysis 

Analyses  of  discourse  have  been  carried  out  within 
a  variety  of  social  science  disciplines,  including 
linguistics,  anthropology,  sociology,  international 
relations,  communication  studies,  and  political  sci¬ 
ence.  Although  the  concept  of  political  discourse 
has  been  used  for  centuries  to  describe  political 
debate  or  for  deliberation  in  political  theory  and 
philosophy,  it  is  only  during  the  past  40  years  or 
so  that  there  has  been  a  theoretical  and  method¬ 
ological  interest  in  how  to  study  the  relationship 
between  language  and  political  action.  This  started 
in  the  1960s  in  Europe  as  part  of  a  philosophical 
renewal  of  the  humanities  (including  the  social  sci¬ 
ences),  later  to  be  known  as  structuralism  and 
poststructuralism  or  in  more  general  terms  as  the 
linguistic  turn.  In  the  1970s,  it  spread  to  the  United 
States  with  studies  of  how  political  concepts  and 
political  news  play  a  role  in  the  construction  of 
social  problems.  Today,  there  are  several 
approaches  on  how  to  understand  the  role  of  lan¬ 
guage  in  politics.  Among  these  are  conceptual  his¬ 
toriography  ( Begriffsgescbichte ),  the  history  of 
political  ideas,  and  the  theory  of  narration.  They 
all  differ  from  discourse  analysis  by  the  fact  that 
their  object  of  study  is  concepts,  narration,  and 
ideas  and  not  discourses.  The  most  important  dif¬ 
ference  among  discourse  analytical  approaches  is 
between  those  that  seek  to  understand  discourse  as 
a  contingent  form  of  knowledge  and  use  discourse 
analysis  to  see  how  knowledge  and  the  production 
of  knowledge  have  changed  over  time  and  those 
that  take  for  granted  that  “the  world”  is  a  product 
of  how  we  categorize  it  through  our  statements 
and  therefore  look  on  discourse  as  a  universal  type 
of  social  action  and  use  discourse  analysis  to  estab¬ 
lish  a  general  theory  of  discourse.  Although  dis¬ 
course  analytic  approaches  emphasize  the  connec¬ 
tion  between  discourse  and  power,  they  differ  in 


how  they  attach  the  concept  of  discourse  to  other 
concepts  such  as  knowledge,  ideology,  ideas,  and 
truth. 

Discourse  Analysis 

This  understanding  of  discourse  and  discourse 
analysis  is  closely  connected  to  Michel  Foucault 
and  his  publications  from  1963  to  1971.  In  this 
period,  Foucault  studied  the  history  of  language 
and  how  words  (or  language)  were  placed  in  rela¬ 
tion  to  things  (or  what  is  observable)  at  various 
periods  in  history.  Foucault  was  concerned  with 
the  fact  that  from  the  mid-1800s,  the  human  sci¬ 
ences  had  begun  to  analyze  language  and  to  argue 
that  all  human  actions  and  social  formations  are 
somehow  related  to  language  or  can  even  be 
understood  as  constructed  in  (or  by  the  use  of) 
language.  According  to  Foucault,  then,  discourse 
analysis  is  not  some  independent  theory  or  method 
but  a  way  in  which  the  human  sciences  perceive 
the  world.  Foucault  shows  how  language  was 
turned  into  an  empirical  object  for  scientific  stud¬ 
ies  and  views  discourse  analysis  as  a  historically 
specific  manner  in  which  the  human  sciences  relate 
to  reality.  Hence,  Foucault  considers  that  scientific 
interest  in  language  is  a  historical  event:  the  end  of 
the  modernity  period,  where  man  stood  at  the  cen¬ 
ter  of  scientific  interest,  and  the  start  of  a  new 
period,  where  language  became  the  central  object 
of  study.  It  is  in  this  context  that  Foucault  intro¬ 
duces  the  concepts  of  archives  and  archaeology  in 
an  attempt  to  portray  discourses  as  historically 
determined  forms  of  knowledge,  which,  together 
with  other  discourses,  enters  into  a  form  of  institu¬ 
tionalized  rationality  (an  archive).  The  archive,  in 
consequence,  is  a  historically  determined  knowl¬ 
edge  horizon,  a  framework  for  how  ideas  are  pro¬ 
duced  and  sustained  and  for  how  knowledge  is 
accepted  (as  being  trustworthy)  or  not.  Ideas  are 
created  in  discursive  events,  which  subsequently — 
by  historical  analysis — can  be  understood  to  have 
added  new  positions  to  the  archive  or  to  have  trans¬ 
formed  already  existing  positions  in  the  archive.  At 
the  same  time,  the  archaeology  is  the  knowledge  we 
possess  about  the  history  of  the  various  forms  of 
knowledge  and  of  the  limitations  and  possibilities 
that  exist  for  creating  knowledge  and  generating 
ideas.  These  limitations  and  possibilities  are  not 
exclusively  linguistic.  They  are  also  extradiscursive 
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and  institutional.  Discourses  are  supported  by 
institutions,  and  together  with  various  technolo¬ 
gies  (e.g.,  disciplining  or  sanctioning),  they  consti¬ 
tute  a  historically  determined  rationality. 

Discourse  Theory 

In  contrast  to  Foucault,  discourse  theory  aims 
at  developing  a  universal  theory  of  discourses. 
Discourse  theory  sees  all  social  phenomena  as  dis¬ 
cursive  constructions  and  assumes  that  all  social 
phenomena  can  be  studied  by  discourse  analysis.  It 
is  in  this  sense  that  discourse  theory  turns  social 
phenomena  into  language  and  language  into  an 
object  for  discourse  analysis.  Ernesto  Laclau  and 
Chantal  Mouffe  are  the  best  examples.  They  have 
advanced  discourse  theory  by  deconstructing  other 
theories.  First,  with  inspiration  from  structuralist 
linguistics,  they  emphasize  that  the  smallest  unit  in 
a  discourse  (langue)  is  the  sign  and  that  discourses 
include  a  system  of  signs  characterized  by  every 
sign  being  different  from  other  signs.  Second,  with 
inspiration  from  poststructuralism,  they  emphasize 
that  signs  are  infused  with  meaning  through  artic¬ 
ulation  (signifie),  while  the  content  of  signs  (sig¬ 
nificant)  is  always  contingent  and  never  fixed. 
Finally,  with  inspiration  from  Neo-Marxism,  they 
stress  that  the  articulation  is  embedded  in  a  politi¬ 
cal  process.  In  their  definition,  discourse  is  a 
system  of  signs  that  allocates  meaning  through 
articulation.  The  articulation,  on  the  one  hand,  is 
understood  as  a  conflict  between  persons  whose 
object  is  to  achieve  political  status  by  imposing  a 
particular  taken-for-granted  understanding  of  the 
world.  On  the  other  hand,  discourse  analysis  is 
used  to  map  or  trace  this  process  as  a  political 
process.  The  task  of  discourse  analysis  is  to  find 
the  nodal  points  that  give  other  signs  their  mean¬ 
ing  and  to  observe  the  process  through  which  the 
allocation  of  meaning  is  taking  place.  In  political 
theory,  for  example,  “democracy”  is  a  nodal  point 
around  which  conflicts  are  constantly  taking  place. 
In  contrast  to  Foucault’s  discourse  analysis,  the 
concept  of  ideology  (or  objectivity)  plays  an 
important  role  in  discourse  theory.  All  discourses 
are  ideological  because  they  appear  as  objectivity- 
that  which  is  taken  for  granted — and  thus  conceal 
alternative  realities.  Also,  in  contrast  to  Foucault, 
the  concept  of  knowledge  does  not  enter  into  the 
vocabulary  of  discourse  theory.  Where  Foucault 


can  study  how  knowledge  has  become  an  archive 
with  his  archaeological  (diachronic)  approach,  the 
discourse  theoretical  (synchronic)  approach  is  ana¬ 
lyzing  how  meaning  is  created  by  politics.  And 
finally,  where  Foucault  sees  institutions  as  support¬ 
ing  knowledge  and  therefore  capable  of  having  an 
independent  (nondiscursive)  status,  in  discourse 
theory  institutions  are  understood  as  discursive 
constructions  without  any  extradiscursive  status. 

Critical  Discourse  Analysis 

Critical  discourse  analysis  emphasizes  the 
necessity  of  establishing  methods  for  empirical 
investigation  of  relations  between  discursive  and 
nondiscursive  practices.  In  this  sense,  it  distin¬ 
guishes  itself  mainly  from  discourse  theory.  The 
work  of  Norman  Fairclough  is  central  here. 
According  to  Fairclough,  discourse  is  not  only  a 
communicative  act  but  also  a  social  practice. 
Discourses  not  only  constitute  social  phenomena 
but  are  also  constituted  by  social  phenomena  in 
the  form  of  social  (or  political)  practice.  Any  use 
of  language  (a  communicative  action)  therefore 
consists  of  a  discursive  practice  where  discourses 
are  produced  or  consumed  and  a  social  practice  or 
an  institutional  context  of  which  a  communicative 
action  is  a  part.  The  communicative  action  can 
draw  on  (consume)  or  create  (produce)  discourses, 
but  it  will  always  be  part  of  an  order  of  discourse, 
where  several  discourses  are  articulated  simultane¬ 
ously.  The  communicative  act  is  linked  to  social 
practice  through  the  use  of  genres  or  conventional 
text  types;  the  news  media  can,  for  example,  draw 
on  interviews  as  a  genre,  while  the  family  can  use 
the  dinner  table  conversation  in  the  same  manner. 
Fairclough  combines  linguistic  textual  analysis 
with  macro-  and  microsociological  analysis  of 
texts  and  conversations,  using  a  comprehensive 
research  design  that  recognizes  five  components 
(problem  formulation,  choice  of  empirical  meth¬ 
ods,  transcription,  analysis,  and  results),  each 
extended  by  specific  methods  and  checklists. 

In  contrast  to  the  theory  of  discourse,  the  critical 
discourse  analysis  distinguishes  between  discourse 
and  institutions  as  two  different  types  of  social 
phenomena.  It  studies  how  discourse  and  institu¬ 
tion  interact  in  the  constitution  of  a  social  world 
and  how  discursive  practices  are  institutionalized 
or  are  moved  from  being  linguistic  utterances  to  set 
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conditions  for  stable  social  relations.  While  critical 
discourse  analysis  attempts  to  uncover  the  ideolo¬ 
gies  that  contribute  to  the  production  and  repro¬ 
duction  of  power,  it  also  has  a  political  aim:  It 
looks  not  only  for  how  discourses  limit  our  under¬ 
standing  of  the  world  (i.e.,  function  as  an  ideology) 
but  also  for  how  they  contain  several  competing 
discourses  and  therefore  the  possibility  of  domi¬ 
nant  ideologies  being  contested.  The  neo-Marxist 
concept  of  hegemony  is  used  in  this  context. 
Ideology  is  understood  to  be  embedded  in  discur¬ 
sive  practice  and  discourses  to  be  more  or  less 
ideological,  where  the  ideological  discourses  are 
those  that  contribute  to  maintaining  (or  establish¬ 
ing)  a  power  relation. 

Discourse  Analytical  Approaches  to  Politics 

Discourse  analysis  has  developed  rather  rapidly 
from  emphasizing  the  ontological  (constructivist  or 
antirealistic)  approach  to  building  up  methods  and 
tools  for  studying  language  and  texts.  Most 
recently,  the  question  of  method  has  come  to 
occupy  a  central  position,  as  has  the  requirement 
of  being  able  to  evaluate  the  validity  of  findings 
produced  by  discourse  analysis.  Still,  discourse 
analysis  remains  an  alternative — critical — approach 
to  mainstream  political  science  and  other  social 
sciences.  It  assumes  that  political  science,  like  other 
social  science  disciplines,  is  a  form  of  knowledge 
that  constitutes  a  historically  constructed  under¬ 
standing  of  what  can  be  studied  as  politics  (an 
ontology)  and  a  historically  constructed  interpreta¬ 
tion  of  what  are  assumed  to  be  true  statements 
about  politics  (an  epistemology).  It  reflects  on 
political  science  as  a  discipline  and  on  the  role  of 
the  discipline  in  constructing  a  certain  form  of 
political  order  (or  polity).  It  finally  applies  con¬ 
cepts  such  as  polity,  politics,  and  policy  in  ways 
different  from  mainstream  political  science.  In  con¬ 
trast  to  the  realism  that  characterizes  mainstream 
political  science,  discourse  analysis  is  built  on  an 
antirealistic,  or  constructivist,  approach,  perceiv¬ 
ing  any  form  of  political  order  to  be  embedded  in 
language  (and  institutions)  and  articulated  by  the 
use  of  speech  acts.  Thus,  for  discourse  analysis,  the 
conflicts  through  which  what  politics  is  (and  what 
is  nonpolitical)  is  defined,  therefore,  are  an  impor¬ 
tant  dimension  to  every  type  of  polity  and  thus  also 
an  important  object  for  political  science  to  study. 


The  most  important  contribution  of  discourse 
analysis  to  political  science  lies  in  the  theories  of 
power.  From  Thomas  Hobbes  to  Robert  A.  Dahl, 
power  has  been  seen  as  the  ability  to  affect,  to 
limit,  or  to  control  the  behavior  of  people  or  as  the 
ability  A  has  to  compel  B  to  do  something  that  B 
would  not  otherwise  have  done.  In  contrast  to  this 
view,  discourse  analysis  understands  power  as 
entrenched  in  the  taken-for-granted.  Foucault 
speaks  about  the  productivity  of  power  and  per¬ 
ceives  power  as  one  of  the  technologies  by  which 
individual  and  collective  identities  are  constructed 
and  in  which  the  understanding  of  self  and  of  com- 
munality  comes  to  be  taken-for-granted  or  to  be 
understood  as  meaningful  and  rational.  Where 
mainstream  power  theory  (e.g.,  Dahl)  takes  for 
granted  that  A  and  B  exist,  each  with  its  own  set 
of  preferences,  Foucault  sees  the  constitution  of 
subjectivity  and  of  the  relation  between  A  and  B  as 
the  productivity  of  power.  His  analysis — together 
with  the  two  other  types  of  discourse  analysis — 
looks  for  the  “hidden  power”  embedded  in  both 
individual  and  collective  identities  and  for  the 
underlying  conditions  for  interests,  expectations, 
and  interpretations  to  be  understood  as  rational. 
Apart  from  regarding  power  as  ontological,  the 
three  approaches  therefore  agree  that  it  is  also 
epistemological.  Power  is  not  only  the  ability  to 
affect  the  behavior  of  others.  It  is  also  the  produc¬ 
tive  force  by  which  A  and  B  are  constructed,  each 
with  its  set  of  interests  and  each  with  its  set  of 
expectations  and  interpretations. 

By  combining  the  critical  with  the  analytical 
approach,  discourse  analysis  is  characterized  as 
standing  at  the  crossroads  of  several  approaches  in 
the  study  of  politics.  It  opens  avenues  that  bring 
political  science  into  close  relationship  not  only 
with  history  and  institutional  theory  but  also  with 
linguistic  and  narrative  theory.  Underlying  the 
three  discourse-analytical  approaches  are  two 
approaches  to  time  and  space.  The  first  is  the  dia¬ 
chronic,  where  the  question  is  raised  about  how 
epochs  come  into  being  and  what,  over  their  long 
history  (the  longue  duree),  makes  the  various  types 
of  political  orders  different  from  each  other.  The 
diachronic  understanding  generates  a  history  of 
polities,  their  periods,  and  geography,  which  also 
allows  discourse  analysis  to  sit  at  the  crossroads  of 
history  as  a  scientific  discipline  (archaeology)  and 
political  science  as  the  history  of  political  orders. 
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Foucault  is  an  example  in  point.  He  describes  the 
history  of  bio-politics,  and  by  doing  so  also  estab¬ 
lishes  the  historiography  of  a  specific  type  of 
political  technique  called  governance.  The  second 
is  the  synchronic  understanding  of  time,  where  the 
question  is  raised  about  how  political  orders  are 
constructed  and  changed  by  the  means  of  politics. 
By  studying  how  politics  constructs  political 
orders,  the  short-term  and  long-term  perspectives 
are  connected,  and  the  history  of  political  orders  is 
combined  in  the  study  of  how  polities  are  con¬ 
structed  through  changes  in  understandings  of 
what  politics  is.  The  combination  of  the  two 
makes  it  possible  to  study  the  history  of  political 
orders  as  a  precondition  for  the  study  of  political 
change.  Fairclough  is  an  example  in  point.  He 
describes  how  an  ideology  becomes  dominant  and 
how  social  conflicts  are  taking  place  within  the 
context  of  dominant  ideologies.  He  also  studies 
how  discursive  practices  are  institutionalized  and 
how  they  proceed  within  a  context  of  institutions. 
The  synchronic  process  tracking  thus  generates 
insight  into  the  political  processes  as  conflicts  over 
dominance,  which  positions  the  discourse  analysis 
at  the  crossroads  of  institutional  theory  (and  theo¬ 
ries  of  institutional  change)  and  political  science  as 
the  theory  of  politics  (or  political  change). 

Within  the  past  few  years,  institutional  theory 
has  become  aware  of  the  role  of  ideas  in  explaining 
institutional  change.  Attempts  are  being  made  to 
understand  how  ideas  are  causally  powerful  in 
explaining  the  form  and  content  of  institutional 
change.  The  ideational  account  of  institutional 
change  is  thus  moving  institutional  theory  in  the 
direction  of  discourse  analysis.  This  applies  to 
studies  of  economic  ideas  and  their  role  in  defining 
economic  crises,  for  example,  in  the  1970s  and 
1980s.  It  applies  to  studies  of  the  role  of  socioeco¬ 
nomic  understanding  in  establishing  preconditions 
for  institutionalized  processes  of  compromising  and 
consensus  making  in  the  postwar  period  in 
the  Nordic  and  other  small  Western  European 
countries.  It  also  applies  to  policy  studies  of  how 
environment,  energy,  and  social  and  labor  market 
policies  are  established  and  change  within  epistemic 
communities.  Finally,  it  also  applies  to  studies  of 
European  integration  and  international  relations.  In 
this  way,  both  approaches  have  focused  on  the  role 
of  speech  acts  and  texts  in  analyzing  the  articulation 
of  politics;  and  while  different  concepts  are  put  to 


use  (ideas  or  discourse),  both  approaches  take  it 
for  granted  that  language  matters,  just  as  it  is 
accepted  that  politics  takes  place  through  the  con¬ 
struction  of  interpretative  frameworks.  This  finally 
places  discourse  analysis  at  the  crossroads  of  theo¬ 
ries  of  narration  and  political  science  as  a  theory  of 
political  communication.  Within  mainstream 
political  science,  there  is  a  growing  interest  in  the 
role  of  the  genres  (drama,  history,  and  the  epics)  in 
producing  political  news  or  in  how  polities  are 
framed  and  named.  The  use  of  narratives  in  politi¬ 
cal  argumentation  is  explored  in  understanding 
news  as  a  political  institution  and  public  policy  as 
agendas  for  political  discourse.  Within  political 
science,  it  is  also  becoming  common  to  study  the 
organizations  in  which  ideas  are  produced  (or  con¬ 
sumed)  and  the  epistemic  communities  through 
which  ideas  are  debated  and  disseminated.  The 
role  of  institutional  entrepreneurs  in  universities, 
think  tanks,  expert  systems,  and  the  media,  using 
narratives  to  frame  and  name  political  problems,  is 
becoming  an  important  topic. 

Ove  K.  Pedersen 
Copenhagen  Business  School 
Frederiksberg,  Denmark 

See  also  Constructivism;  Epistemological  and 

Methodological  Foundations;  Institutional  Theory; 
Institutions  and  Institutionalism;  Policy,  Discourse 
Models;  Political  Communication;  Power 

Further  Readings 

Campbell,  J.  L.,  &  Pedersen,  O.  K.  (in  press).  Knowledge 
regimes  and  comparative  political  economy.  In 
D.  Beland  &  R.  H.  Cox  (Eds.),  Ideas  and  politics  in 
social  science  research.  Oxford,  UK:  Oxford 
University  Press. 

Edelman,  M.  (1988).  Constructing  the  political  spectacle. 

Chicago:  University  of  Chicago  Press. 

Fairclough,  N.  (2003).  Analysing  discourse:  Textual 
analysis  for  social  research.  London:  Routledge. 
Foucault,  M.  (1966).  Les  Mots  et  les  choses:  Une 
archeologie  des  sciences  humaines  [The  order  of 
things:  An  archaeology  of  the  human  sciences].  Paris: 
Gallimard. 

Foucault,  M.  (1969).  L’ archeologie  du  savoir  [The 
archaeology  of  knowledge].  Paris:  Gallimard. 

Hay,  C.  (2002).  Political  analysis:  A  critical  introduction. 
Basingstoke,  UK:  Palgrave. 


Discretion  677 


Kogler,  H.  H.  (1999).  The  power  of  dialogue:  Critical 
hermeneutics  after  Gadamer  and  Foucault. 

Cambridge:  MIT  Press. 

Laclau,  E.,  &  Mouffe,  C.  (1985).  Hegemony  and  socialist 
strategy:  Towards  a  radical  democratic  politics. 
London:  Verso. 

Pedersen,  O.  K.  (1991).  Nine  questions  to  a  neo- 

institutional  theory  in  political  science.  Scandinavian 
Political  Studies,  14(2),  125-148. 

Wetherell,  M.,  Taylor,  S.,  &  Yates,  S.  J.  (2001). 

Discourse  as  data:  A  guide  for  analysis.  London:  Sage. 
Wetherell,  M.,  Taylor,  S.,  &  Yates,  S.  J.  (2001). 

Discourse  theory  and  practice:  A  reader.  London: 

Sage. 


Discretion 


Discretion  is  an  issue  of  interest  not  only  to  polit¬ 
ical  scientists  but  also  to  scholars  of  public  admin¬ 
istration,  law,  and  organizations.  In  everyday 
language,  using  one’s  discretion  refers  simply  to 
the  employment  of  good  judgment,  caution,  or 
care  in  decision  making  and,  hence,  is  normally 
juxtaposed  with  acting  according  to  formal  stan¬ 
dards  or  rules.  Political  scientists  are  interested  in 
the  exercise  of  discretion  delegated  to  politicians 
by  their  electorate,  but  it  is  normally  within  the 
administrative  realm  that  the  issue  of  discretion 
(and  the  related  matter  of  decision-making  legiti¬ 
macy)  provokes  most  debate  and  discussion.  This 
is  because  considerable  discretionary  authority 
tends  to  be  granted  to  bureaucracies  in  order  to 
make  and  implement  policy,  as  legislatures  are 
content  to  delegate  the  implementation  of  agreed 
policy  to  those  at  the  “street  level.”  For  students 
of  public  administration,  therefore,  discretion  can 
be  defined  as  the  autonomy  extended  to  bureau¬ 
crats  to  implement  or  disregard  a  law  in  accor¬ 
dance  with  the  principles  established  in  a  political 
arena  for  that  law,  based  on  values  appropriate  to 
the  public  interest.  Those  who  exercise  bureau¬ 
cratic  discretion  range  from  effective  standard 
setters  (including  regulators)  who  exercise  consid¬ 
erable  discretionary  authority  to  those  enjoying 
more  limited  authority  when  applying  established 
standards  to  individual  situations. 

Historically,  as  a  core  concept  in  law  and  lat¬ 
terly  in  public  administration,  discretion  has  been 


examined  by  way  of  consideration  of  the  different 
external  mechanisms  used  for  its  limitation  and 
control,  including  legislative  oversight,  executive 
command,  and  judicial  review.  As  the  scope  of 
state  power  has  expanded,  it  is  the  role  of  the  judi¬ 
cial  system  in  limiting  discretionary  behavior  that, 
however,  has  generated  an  almost  universal  debate. 
Indeed,  much  of  contemporary  writing  on  the  issue 
of  discretion  stems  from  legal  theory  with  a  strong 
focus  on  the  role  of  law  in  policy  implementation. 
For  lawyers  and  legal  philosophers,  there  is  consid¬ 
erable  discussion  as  to  how  and  why  the  state’s 
administrators,  from  police  “on  the  beat”  to 
judges  in  courtrooms,  determine  the  public  interest 
and  exercise  their  discretion.  This  literature  is  con¬ 
cerned  with  the  relationship  between  discretion 
and  various  conceptions  of  justice.  Key  questions 
include  how  much  discretion,  if  at  all,  should  be 
granted,  how  will  it  be  granted,  and,  more  funda¬ 
mentally,  who  will  decide  on  the  public  interest? 

Arguments  for  and  Against 
the  Use  of  Discretion 

The  arguments  for  and  against  discretionary 
behavior  in  decision  making  are  well  developed. 
For  advocates  of  discretion,  its  principal  virtue  is 
that  it  provides  decision  makers  (politicians, 
judges,  regulators,  and  civil  servants)  the  ability  to 
fill  the  gaps  emerging  between  established  rules  of 
law.  In  other  words,  for  a  policy  to  be  matched 
with  the  appropriate  type  of  implementing  mecha¬ 
nism,  forms  of  discretion  are  required  so  that  the 
spirit,  if  not  the  letter,  of  the  law  is  adhered  to.  For 
this  school  of  thought,  discretion  is  an  essential 
tool  to  help  decision  makers  resolve  problems  in 
ways  that  can  accommodate  the  parties  involved 
and  also  soften  the  often  harsh  and  indiscriminate 
impact  of  rules.  They  argue  that  rules  limiting  dis¬ 
cretion  may  formally  legitimate  decisions,  but  such 
decisions  may  not  be  fair  or  may  conflict  with  the 
public  interest.  Therefore,  and  in  line  with  the 
theory  of  representative  bureaucracy,  administra¬ 
tors  must  have  discretion  as  it  allows  them  to  pro¬ 
duce  results  that  reflect  the  public  interest — that  is, 
the  values  and  beliefs  held  by  the  public  they  serve. 
It  follows  that  officials  should  be  provided  with 
“formal”  discretionary  powers  when  options  for 
policy  implementation  are  written  into  a  law. 
Thus,  discretion  forms  an  integral  part  of  the  legal 
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process,  which,  among  other  things,  allows  for 
contexts  and  circumstances  to  be  taken  into 
account  when  assessing  breaches  of  the  law. 

The  arguments  against  discretion  are  also  well 
rehearsed,  particularly  within  legal  philosophy 
where  the  rule  of  law  constitutes  a  buffer  between 
the  citizen  and  the  arbitrary  exercise  of  power.  In 
contrast  with  those  who  advocate  the  provision  of 
discretion,  advocates  of  rules  and  rule  adherence 
(most  prominently  Kenneth  Culp  Davis)  point  to 
the  fact  that  discretion  facilitates  an  uneven  or 
subjective  implementation  of  the  law  by  providing 
decision  makers  the  ability  to  employ  illegitimate 
considerations  in  the  formulation  of  a  decision.  In 
such  circumstances,  decisions  can  be  made  using 
criteria  that  are  in  conflict  with  those  established 
and  envisaged  by  the  rules,  for  example,  nepotism 
or  gender  discrimination.  Therefore,  the  enforce¬ 
ment  of  rules  ensures  that  like  cases  are  treated 
alike  and  avoids  bringing  a  “private”  dimension 
into  decision  making.  Discretionary  powers  vary 
considerably  and  may  in  certain  circumstances 
extend  to  rule  breaking,  arbitrary  behavior,  and 
injustices.  It  follows  that  the  scope  for  inappropri¬ 
ate  discretion  (as  opposed  to  necessary  discretion 
as  in  the  case  of  regulators)  should  be  narrowed  as 
much  as  possible  by  legislation. 

The  Nature  of  Discretion 

Discretion  is  routinely  contrasted  with  the  rule  of 
law — that  is,  those  obligations  and  duties  that  con¬ 
strain  and  determine  the  choice  of  action.  There  is 
much  debate  within  public  administration  and 
legal  philosophy  as  to  whether  discretionary 
behavior  is  rule-free  or  is  in  fact  embedded  within 
and  influenced  by  rule  frameworks  (the  famous 
“doughnut”  conceptualization  of  discretion  pro¬ 
posed  by  Ronald  Dworkin  in  which  discretion 
occupies  the  hole  in  the  doughnut’s  center).  Many 
scholars  seek  compromise  and  argue  that  dichoto¬ 
mies  between  rules  and  discretion  fail  to  recognize 
the  complex  interdependence  between  the  two 
concepts,  particularly  as  a  greater  number  of  del¬ 
egated  bodies  and  actors  with  varieties  of  rule 
making  and  hence  discretionary  powers  now  exist 
in  modern  polities. 

Within  legal  texts,  there  are  two  broad  catego¬ 
ries  of  discretion.  The  first  is  the  discretion  to 
make  rules ,  which  is  normally  the  preserve  of 


political  institutions  such  as  legislatures  or  local 
authorities  (which  are  subject  to  electoral  sanction) 
or  indeed  of  other  fora  to  which  such  functions  are 
delegated.  The  second  concerns  the  discretion  to 
find  facts  and  interpret  them  as  the  law — that  is, 
the  discretion  to  decide  cases.  This  power  is 
retained  by  courts  and  administrative  agencies.  Of 
course,  the  two  are  connected,  and  the  central  issue 
in  the  analysis  of  discretion  is  the  extent  to  which 
legislatures,  in  the  process  of  making  law,  con¬ 
strain  or  provide  discretion  to  courts  and  agencies 
to  decide  on  cases.  At  its  core,  administrative  law 
centers  on  the  creation  of  rules  that  allow  govern¬ 
ment  and  its  agencies  the  freedom  to  achieve 
agreed  goals  while  discouraging  them  from  abus¬ 
ing  this  freedom. 

Courts  and  judges  tend  to  enjoy  considerable 
discretionary  powers  in  the  enforcement  and 
administration  of  the  law.  In  this  instance,  discre¬ 
tion  is  most  tangible  when  a  judge,  having  con¬ 
sulted  all  relevant  legal  material,  is  left  free  by  law 
to  decide  on  a  course  of  action.  The  type  of  public 
law  used  is  also  of  relevance  here,  as  it  is  generally 
thought  that  within  common  law  legal  systems, 
judges  are  afforded  more  discretion  than  their  civil 
law  counterparts. 

Applications  of  Discretion 

For  scholars  of  political  science  and  public  admin¬ 
istration,  since  the  mid-1930s,  research  into  admin¬ 
istrative  discretion  has  been  primarily  concerned 
with  analysis  of  how  the  use  of  discretion  by  street- 
level  bureaucrats  leads  to  varieties  of  outcomes  in 
public  service  delivery.  Within  the  study  of  public 
management,  a  well-developed  strand  of  research 
concerns  the  tensions  inherent  in  the  search  for 
norms  of  value-free  administration  and  the  exercise 
of  discretion  in  increasingly  complex  policy  envi¬ 
ronments.  More  recently,  the  debate  on  discretion 
has  become  a  core  concern  of  regulation  and  regu¬ 
latory  governance,  best  captured  by  the  literature 
on  the  “regulatory  state”  and  its  accountability.  In 
particular,  the  work  of  independent  regulatory 
agencies,  which  can  be  quasi  judicial  in  character, 
raises  important  questions  concerning  the  proper 
balance  between  formula-driven  and  discretionary 
approaches  to  regulation.  Much  recent  scholarship 
has  demonstrated  a  wide  variety  in  the  interpreta¬ 
tion  by  regulatory  agencies  of  their  discretionary 


Discretion  679 


role  and  the  principles  on  which  their  decisions  are 
based. 

The  understanding  and  use  of  discretion  depends 
on  its  context,  and  many  tasks  will  inevitably 
remain  too  complicated  to  legislate  for  in  a  detailed 
manner,  thus  requiring  the  individual  bureaucrat 
to  respond  on  a  case-by-case  basis.  It  may  also 
occur  that  bureaucrats  are  subject  to  contradictory 
rules  and  therefore  have  discretion  to  decide  on 
which  rules  to  implement.  Discretion  also  tends  to 
be  of  more  significance  in  contexts  where  resources 
are  in  short  supply  but  demands  are  unlimited, 
providing  opportunities  for  discrimination  as  well 
as  rationing.  By  way  of  contrast,  contracts  between 
public  authorities  and  private  companies  over  ser¬ 
vice  delivery  (such  as  waste  disposal  or  road  build¬ 
ing)  seek  to  ensure  that  discretion  rests  solely  with 
the  principal  rather  than  with  the  agent.  Discretion 
sharing  also  occurs  in  collaborative  forms  of  orga¬ 
nizations,  such  as  networks. 

As  street-level  bureaucrats  often  provide  the  pub¬ 
lic  face  of  government,  their  activities  have  a  sym¬ 
bolic  import  that  stretches  beyond  the  service  in 
question.  Their  decisions  can  reinforce  or  undermine 
political  objectives.  For  political  and  administrative 
managers,  therefore,  problems  of  asymmetrical 
information  mean  that  controlling  the  discretionary 
activities  of  street-level  bureaucrats  through  nor¬ 
mal  superior-subordinate  routines  is  problem¬ 
atic.  This  does  not  stop  street-level  bureaucrats 
from  being  subjected  to  ever-increasing  amounts 
of  rules.  Therefore,  apart  from  the  external 
means  of  restricting  administration  discretion 
(above),  internal  means  such  as  administrative 
ethics,  peer  review,  and  an  emphasis  on  profes¬ 
sional  norms  are  emphasized.  The  development 
of  common  public  service  values  through  induc¬ 
tion  and  training  is  also  routinely  identified  as  a 
means  of  ensuring  that  discretionary  behavior 
operates  within  commonly  shared  norms  and 
understandings. 

Organizational  theory  holds  that  wherever  work 
is  delegated,  there  is  some  loss  of  control  by  the 
delegating  person  or  institution.  Also,  the  greater 
the  degree  of  task  complexity,  the  greater  the  need 
for  discretion  by  those  involved.  Thus,  discretion  is 
closely  connected  to  levels  of  trust  within  organiza¬ 
tions,  with  managers  seeking  a  virtuous  circle  of 
high  trust  and  high  discretion  in  performance.  One 
strand  of  research  on  organizations  is  increasingly 


interested  in  the  evolution  of  rules  and  discretion 
within  hierarchical  organizations.  They  examine 
how  organizations  may  shape  discretion  by 
imposing  constraints  on  the  way  in  which  mem¬ 
bers  use  it.  The  formal  and  informal  means 
through  which  discretionary  behavior  is  toler¬ 
ated  or  sanctioned  is  also  an  emerging  field  of 
inquiry. 

Within  organizational  theory,  it  is  also  proposed 
that  culture  plays  a  role  in  the  use  of  discretion, 
and  work  from  this  field  of  inquiry  has  identified 
how  increased  bureaucratic  formalization  tends  to 
diminish  discretion.  Organizational  sociologists  are 
also  keenly  interested  in  the  concept,  proposing 
that  a  greater  understanding  of  the  determinants  of 
discretionary  behavior  requires  analysis  of  the 
social  norms  prevalent  within  an  organization’s 
operating  environment.  The  dilemmas  inherent  in 
the  exercise  of  discretion  thus  help  explain  the  ori¬ 
gins  of  trust,  behavioral  norms,  and  boundaries  of 
autonomy. 
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Discrimination 


Originating  from  the  Latin  word  discriminare  (dis¬ 
tinguish,  separate),  in  contemporary  societies,  the 
term  discrimination  is  applied  to  related  but  varying 
phenomena  about  the  content  of  which  there  is  no 
general  agreement.  The  least  controversial  meaning 
is  the  intended  or  accomplished  differential  treat¬ 
ment  of  persons  or  social  groups  for  reasons  of 
certain  generalized  traits.  The  targets  are  often 
minorities,  but  it  may  also  be  majorities,  as  Black 
people  were  under  apartheid  in  South  Africa.  For 
the  most  part,  discrimination  implicitly  entails  the 
idea  of  acts  resulting  in  a  disadvantage  for  the  target 
of  such  action.  More  precise  terms  are  intentional 
or  willful  discrimination.  An  ever-growing  number 
of  terms  has  been  coined  to  label  phenomena  of  this 
kind  related  to  groups,  traits,  or  properties,  such  as 
race  (racism),  religion  (e.g.,  anti-Semitism, 
Islamophobia),  sex  (sexism),  sexual  orientation 
(e.g.,  homophobia),  class  (classism),  age  (ageism), 
disability  (ableism),  a  person’s  stature  (heightism, 
discrimination  against  short  people),  physical 
appearance  (lookism),  overweight  (weightism),  and 
many  more.  Although  there  are  many  other  forms  of 
discrimination,  not  all  of  them  have  been  identified 
as  specific  targets  of  social  action.  In  the  following 
sections,  the  different  elements  of  such  a  definition, 
their  controversial  treatment  in  the  social  sciences, 
and  their  political  implications  are  discussed. 

Controversies  in  Defining  Discrimination 

While  the  definition  of  discrimination  as  differen¬ 
tial  treatment  on  the  basis  of  generalized  traits  is 
relatively  straightforward,  a  number  of  additional 
elements  that  might  be  added  to  the  definition  are 
controversial. 

Must  Behavior  Be  Unjustified? 

As  Michael  Banton  (1994)  notes,  “One  analyti¬ 
cal  tradition  excludes  considerations  of  immorality 
and  illegality  altogether  for  the  sake  of  conceptual 
clarity  and  to  avoid  contradicting  opinions  about 
their  justification”  (p.  1).  But  politicians  argue  that 
the  definition  should  serve  the  purpose  of  identify¬ 
ing  a  social  problem  and  thus  restrict  the  term  to 
unjustified  behavior. 


Are  There  Legitimate  Exceptions? 

Should  references  to  legitimate  exceptions  be 
included?  Many  scientific  and  political  definitions 
of  discrimination  contain  conditions  under  which 
differential  treatment  is  deemed  undue  versus 
those  under  which  it  is  deemed  admissible. 
Discrimination  here  relates  only  to  unequal  treat¬ 
ment  that  is  based  on  criteria  lacking  objective  or 
rational  justification.  Aaron  Antonovsky  (1960), 
for  example,  speaks  of  “effective  injurious  treat¬ 
ment  of  persons  on  grounds  rationally  irrelevant  to 
the  situation”  (p.  81).  The  weakness  of  this  defini¬ 
tion  lies  in  its  assumption  that  rationality  in  the 
Weberian  sense  can  define  uniform  collective  goals 
that  should  override  the  legitimacy  of  egotistic, 
individual  claims  that  run  counter  to  them. 
Conflicts  over  protectionist  discrimination  against 
foreign  companies  illustrate  that  competitors,  con¬ 
sumers,  state  economic  planners,  environmental¬ 
ists,  and  Third  World  and  workers’  rights  activists 
may  have  conflicting  particular  interests  preclud¬ 
ing  a  single  universal  rationality.  What  can  be  said 
is  that  discrimination  is  seen  to  exist  when  reasons 
for  actions  are  perceived  as  illegitimate  by  relevant 
groups.  But  concepts  of  legitimacy  vary  between 
and  within  societies  and  change  over  time.  Barring 
women  from  the  ballot  was  long  considered  to  be 
justified,  and  even  in  Western  democracies,  full 
formal  economic  rights  for  women  were  estab¬ 
lished  only  in  the  course  of  the  20th  century.  In 
feudal  societies,  it  was  generally  accepted  that 
access  to  positions  of  leadership  was  made  difficult 
for,  or  denied  to,  persons  of  lowly  origin,  regard¬ 
less  of  their  ability  or  merit.  Only  in  the  modern 
age  did  the  bourgeois  classes  question  the  nobili¬ 
ty’s  inherited  privileges.  Since  modern  societies 
consider  themselves  universalistic  in  the  sense  that 
only  merit  shall  determine  social  positions,  a  con¬ 
vergence  of  public  opinion  has  occurred  to  exclude 
ascriptive  (hereditary,  unchosen,  and  unchange¬ 
able)  criteria  in  favor  of  achievement  only. 

Human  Rights  or  Positive  Law? 

Does  any  contravention  of  the  universal  princi¬ 
ple  of  equal  rights  constitute  discrimination  regard¬ 
less  of  the  traits  involved  or  does  the  definition 
relate  to  an  inventory  of  specific  qualities? 
Brandishing  discrimination,  activists  often  invoke 
natural  law  or  general  fundamental  human  rights. 
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Wary  of  potential  liabilities  resulting  from  open 
definitions,  legislators  prefer  to  enact  positive  law 
that  defines  specific  new  rights  rather  than  relying 
on  concepts  of  natural  law. 

Are  There  Exemptions  for  Private  Actors ? 

Which  classes  of  actors  fall  under  the  scope  of 
the  concept?  While  there  is  broad  agreement  that 
governments  and  para-statal  agencies  should  not 
discriminate,  private  sector  economic  actors  often 
successfully  seek  exemption  on  the  ground  that 
nondiscrimination  will  disadvantage  them  (e.g.,  by 
resulting  in  lower  productivity,  by  overriding  cus¬ 
tomers’  preferences).  In  general,  definitions  are 
ambiguous  as  to  their  application  in  the  realm  of 
private  life  in  which  differential  treatment  based 
on  personal  preferences  is  widespread  but  consid¬ 
ered  to  be  legitimate. 

Is  Discrimination  an  Attitude  or  an  Act ? 

Does  a  mere  attitude ,  such  as  the  intention  to 
treat  someone  differentially,  constitute  discrimina¬ 
tion,  or  must  a  tangible  disadvantage  following 
accomplished  action  occur?  Much  research  in 
social  psychology  focuses  on  the  conditions  under 
which  members  of  one  group  develop  unfavorable 
attitudes  toward  members  of  other  groups,  a  pro¬ 
cess  labeled  outgroup  discrimination.  Other  disci¬ 
plines  such  as  the  economic,  political,  and  social 
sciences  have  traditions  of  observing  discrimina¬ 
tory  acts  and  their  detrimental  consequences. 
Discriminatory  attitudes  do  not  always  lead  to 
corresponding  acts. 

The  Contingency  of  Discrimination 

Discrimination  is  a  contingent  phenomenon  since 
acceptance  of  universalistic  principles  and  aware¬ 
ness  of  inequality  have  developed  historically. 
Statistical  evidence  can  demonstrate  certain  forms 
of  social  inequality  while  explanations  remain 
contested.  Demands  that  discrimination  be  recog¬ 
nized  are  made  in  contexts  of  power  relations  and 
often  succeed  only  as  a  result  of  pressure  groups 
claiming  minority  rights  against  vested  interests. 
Discrimination  thus  is  as  much  a  state  of  political 
consciousness  as  it  is  an  ontological  matter. 
This  holds  all  the  more  true  at  the  level  of  legal 


codification.  Specific  legislation  has  been  enacted 
in  some  countries,  for  example,  with  regard  to 
mandating  equal  treatment  for  women,  gays,  and 
people  with  disabilities;  however,  differential  treat¬ 
ment  on  grounds  of  foreign  citizenship  is  consid¬ 
ered  defensible  for  raisons  d’etat  and  has  failed  to 
achieve  recognition  as  discrimination.  A  glance  at 
the  fate  of  left-handed  people,  vegans,  and  others 
reveals  that  traits  and  categories  exist  that  are  con¬ 
nected  to  some  kinds  of  inequality  but  that  have 
not  yet  been  associated  with  discrimination. 

Types  of  Discrimination 

Statistical  discrimination  is  considered  to  occur 
when  distinctions  are  made  between  demographic 
groups  based  on  aggregate  characteristics  rather 
than  on  an  actor’s  individual  preference  and  preju¬ 
dice.  While  a  group  is  known  to  possess  certain 
characteristics,  an  actor  unaware  of  the  personal 
characteristics  of  an  individual  treats  him  or  her  as 
an  average  member  of  a  group.  But  the  perception 
of  similar  phenomena  of  this  type  is  inconsistent. 
Ethnic  profiling — that  is,  differential  treatment  of 
an  ethnic  or  racial  group  by  security  personnel, 
may  be  considered  discriminatory  even  if  evidence 
existed  to  support  it  as  a  rational  strategy  of  law 
enforcement,  because  members  of  certain  groups 
figure  high  in  criminal  statistics.  When  employers 
promote  equally  qualified  women  to  leading  posi¬ 
tions  less  frequently  because  experience  shows  a 
higher  likelihood  of  interruption  of  employment 
due  to  pregnancy  or  child  care,  political  reactions 
often  are  ambiguous:  This  can  be  seen  as  illegiti¬ 
mate  with  regard  to  the  individual  person,  whereas 
employers  defend  it  as  rational  behavior  maximiz¬ 
ing  productivity.  By  contrast,  it  is  less  noncontro- 
versial  that  insurance  companies  grant  preferential 
conditions  according  to  age,  sex,  occupation,  or 
other  criteria  based  on  evaluations  of  risk  statistics. 

While  intentional  discrimination  is  located  at 
the  level  of  individuals,  institutional  discrimination 
denotes  explicit  policies  of  social  institutions  that 
exclude,  impede,  or  otherwise  harm  certain  groups 
irrespective  of  adverse  attitudes  of  implementing 
agents.  Famous  examples  are  laws  restricting 
rights  of  racial  or  ethnic  minorities  or  banning 
women  from  politics.  Recently,  courts  in  Europe 
have  held  proceedings  against  laws  barring  for¬ 
eigners  from  particular  professions.  By  contrast, 
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structural  (i.e.,  indirect)  discrimination  character¬ 
izes  policies  that  are  neutral  in  intent  and  in  imple¬ 
mentation  but  still  result  in  adverse  effects  for 
minorities.  In  the  literal  sense,  structural  discrimi¬ 
nation  constitutes  the  very  opposite  of  discrimina¬ 
tion  and  it  is,  therefore,  most  controversial.  Its 
proponents  hold  that  states  based  on  universal 
principles  have  to  bring  about  equal  life  chances 
for  all,  which  includes  the  duty  on  the  part  of  insti¬ 
tutions,  for  example,  in  education,  to  compensate 
proactively  for  the  inequalities  minorities  have 
experienced  from  discrimination  suffered  in  the 
past  or  in  other  social  systems.  They  argue  that  the 
failure  to  do  so  constitutes  discrimination. 

Explanations  of  Discrimination 

Social  psychological  explanations  of  discrimina¬ 
tion  based  on  social  identity  theory  presume  that 
humans  rely  on  the  groups  they  belong  to  for  a 
part  of  their  identity.  Belonging  to  a  group  that  is 
more  prestigious  and  powerful  than  others  boosts 
one’s  sense  of  self-esteem.  Discrimination  in  the 
sense  of  debasing  and  impairing  outgroup  mem¬ 
bers  or  declining  them  access  to  resources  and 
wealth  serves  the  purpose  of  strengthening  the 
relative  position  of  one’s  in  group  and  also  indi¬ 
rectly  boosts  individual  self-esteem.  Empirical 
studies  confirm  that  persons  with  a  low  sense  of 
social  recognition  display  more  outgroup  devalua¬ 
tion  and  group-focused  enmity  based  on  an  ideol¬ 
ogy  of  human  inequality  (Wilhelm  Heitmeyer, 
2002).  Negative  attitudes  toward  different  out¬ 
groups  (ethnic  and  religious  minorities,  women, 
and  people  who  are  disabled  or  homeless)  strongly 
intercorrelate,  indicating  the  unspecific  nature  of 
discrimination.  Social  dominance  theory  argues 
that  societies  as  a  consequence  of  evolution  gener¬ 
ally  develop  hierarchical  structures  and  humans 
have  a  disposition  of  dominance  orientation  and 
willingness  to  discriminate,  but,  in  addition,  a  high 
group  status  exerts  a  reinforcing  influence  on  this 
attitude. 

Beyond  statistical  discrimination,  economic 
explanations  argue  that  a  part  of  the  difference 
between  groups  in  income,  prestige,  and  status 
achievement  is  due  to  differences  in  productivity. 
Women,  for  example,  are  assumed  to  choose  occu¬ 
pations  compatible  with  family  demands  and  akin 
to  female  role  models  that  are  on  average  paid 


worse  and  to  invest  less  in  training  since  their 
returns  would  be  smaller  than  men’s.  Differential 
treatment  based  on  productivity  gradients  is  held 
to  be  legitimate.  According  to  Gary  Becker  (1971), 
illegitimate  differences  are  explained  by  “tastes  of 
discrimination”  on  the  part  of  employers,  custom¬ 
ers,  and  landlords.  Since  such  tastes  imply  ineffi¬ 
cient  attribution  of  factors  of  production,  they 
cause  disadvantages  on  perfect  markets  and  are 
therefore  deemed  to  vanish  over  time.  It  has  been 
shown,  however,  that  in  majority-minority  situa¬ 
tions,  under  state  control,  and  in  other  market 
irregularities  discrimination  due  to  taste  may  per¬ 
sist.  Economic  explanations  have  been  criticized 
for  failing  to  address  the  root  causes  of  differential 
preferences  along  gender,  class,  and  ethnic  lines 
and  the  mechanisms  behind  discrimination. 

Sociological  analyses  have  shown  widespread 
ideological  constructions  of  superiority  of  certain 
groups,  for  example,  “gender  status  beliefs,” 
legitimizing  unequal  access  to  status  positions. 
One  area  of  research  is  dedicated  to  the  intergen- 
erational  stability  of  class  positions.  Pierre 
Bourdieu  (1980),  for  example,  argues  that  power¬ 
ful  groups  define  a  specific  taste  that,  internalized 
at  an  early  age,  serves  as  a  lifelong  means  of 
distinction  between  classes  and  regulates  status 
assignment  in  contradiction  to  meritocratic  prin¬ 
ciples  of  modernity. 

Wilhelm  Heitmeyer  and  Kurt  Salentin 
Bielefeld  University 
Bielefeld,  Germany 
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Discursive  Institutionalism 


The  term  discursive  institutionalism  is  an  umbrella 
concept  for  the  wide  range  of  works  in  political 
science  that  focus  on  the  substantive  content  of 
ideas  and  the  interactive  processes  of  discourse  by 
which  they  are  generated  and  communicated  in 
given  institutional  contexts.  This  makes  for  a 
definition  that  is  somewhat  broader  than  when 
the  same  or  similar  terms  are  used  by  those  who 
focus  primarily  on  the  substance  of  political 
agents’  ideas  in  the  ideational  turn  or  on  the  text 
in  discourse  analysis.  Below,  relations  to  other 
institutionalist  approaches,  varieties  of  this  con¬ 
cept,  and  their  applications  in  political  science  are 
discussed. 

Discursive  institutionalism  can  be  considered 
the  fourth  “new  institutionalism,”  because  it  is  a 
distinctive  approach  that  contributes  to  our  under¬ 
standing  of  political  action  in  ways  that  the  other 
three  neo-institutionalisms — rational  choice,  his¬ 
torical,  and  sociological — cannot.  It  provides 
insight  into  areas  of  politics  that  political  scientists 
working  within  the  other  neo-institutionalist  tradi¬ 
tions  have  long  neglected:  the  role  of  ideas  in  con¬ 
stituting  politics  and  in  reconstructing  political 
interests  and  values,  the  power  of  discourse  for 
persuasion  in  political  debate  and  legitimation  in 
democracy,  the  importance  of  ideas  and  discourse 
in  explaining  the  microdynamics  of  change  in 
rationalist  interests  and  structures,  historical  paths 
and  regularities,  and  cultural  frames  and  norms. 

The  word  institutionalism  in  the  term,  more¬ 
over,  suggests  that  this  approach  is  not  only  about 
the  discursive  construction  and  communication  of 
ideas  but  equally  about  the  institutional  context  in 
which  and  through  which  ideas  are  communicated 
via  discourse.  This  institutional  context  refers  first 
and  foremost  to  the  structure,  construction,  and 
communication  of  meaning.  But  it  can  also  be 
understood  as  the  background  information  pro¬ 
vided  by  the  other  three  neo-institutionalisms  in 


political  science  with  which  discursive  institution¬ 
alists  may  engage  and  from  which  they  often 
emerge. 

Varieties  of  Approaches 

Discursive  institutionalism  encompasses  a  wide 
range  of  approaches  across  fields  of  political  sci¬ 
ence,  from  constructivism  in  international  relations 
to  the  ideational  turn  in  American  political  devel¬ 
opment  and  comparative  politics,  from  the  delib¬ 
erative  democracy  of  political  theory  to  the  analy¬ 
sis  of  campaign  rhetoric  and  media  discourse  in 
American  electoral  politics,  and  from  the  discourse 
analysis  of  critical  legal  studies  or  poststructuralist 
theory  to  that  of  feminist  gender  studies.  In  such 
approaches,  when  discursive  institutionalists  focus 
on  ideas,  they  may  consider  different  types  of 
ideas — cognitive  ideas  justified  in  terms  of  interest- 
based  logics  and/or  normative  ideas  legitimated 
through  value-based  logics  of  appropriateness — at 
different  levels  of  generality,  from  policy  ideas  to 
programmatic  ideas  or  from  paradigms  to  deeper 
philosophical  ideas.  When  they  are  more  con¬ 
cerned  with  discourse,  they  may  concentrate  on  the 
representation  of  ideas  through  frames,  narratives, 
myths,  collective  memories,  stories,  scripts,  and  the 
like  or  on  the  discursive  processes  by  which  such 
ideas  are  constructed  and  communicated  in  the 
public  sphere. 

Discursive  institutionalists  may  also  be  divided 
over  which  part  of  the  public  sphere  they  ought  to 
investigate.  Some  are  more  interested  in  the  coordi- 
native  discourse  of  the  policy  sphere,  in  which 
policy  actors  construct  ideas  through  epistemic 
communities  of  loosely  connected  individuals  with 
shared  ideas  about  a  common  policy  enterprise, 
advocacy  coalitions  of  more  closely  connected 
elites  with  shared  ideas  and  access  to  policy  making 
in  local  and  national  politics,  advocacy  networks  of 
activists  contesting  ideas  in  international  politics, 
and  entrepreneurs  or  mediators  who  serve  as  cata¬ 
lysts  for  change  as  they  articulate  the  ideas  of 
discursive  communities.  Other  discursive  institu¬ 
tionalists  are  more  focused  on  the  communicative 
discourse  of  the  political  sphere,  in  which  political 
actors  discuss,  deliberate,  and  legitimate  the  ideas 
developed  in  the  coordinative  discourse.  These 
actors  include  elected  officials,  opposition  parties, 
or  spin  doctors,  civil  servants,  and/or  policy  actors 
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involved  in  the  mass  electoral  process  of  public  per¬ 
suasion.  But  they  also  involve  opinion  leaders,  pub¬ 
lic  intellectuals,  and  the  media  engaged  in  public 
debates;  organized  interests  in  specialized  policy 
forums;  members  of  civil  society  engaged  in  grass¬ 
roots  organizations  and  social  movements;  or  mini 
publics  participating  in  citizen  juries  and  issue 
forums,  as  well  as  citizens  whose  voices  are  heard 
not  only  in  opinion  polls  but  also  in  votes — where 
actions  speak  even  louder  than  words.  The  com¬ 
municative  discourse  itself  may  be  seen  as  top- 
down,  as  elites  seek  to  form  mass  public  opinion  or 
legitimize  public  policy;  as  bottom-up,  as  social 
activists  attempt  to  influence  national  and  interna¬ 
tional  debates;  at  the  top  among  policy  and  political 
actors,  as  elites  debate  ideas  in  national  or  global 
policy  forums;  or  at  the  bottom,  among  everyday 
actors  engaged  in  deliberative  democracy  or  in 
expressing  ideas  through  their  everyday  practices. 

Discursive  Institutionalism 
in  Institutionalist  Perspective 

All  discursive  institutionalists  define  institutional 
context  first  of  all  in  terms  of  the  meaning  context 
in  which  ideas  and  discourse  make  sense,  such  that 
speakers  “get  it  right”  in  terms  of  the  ideational 
rules  or  rationality  of  a  given  setting  by  addressing 
their  remarks  to  the  “right”  audiences,  at  the 
“right”  times,  and  in  the  “right”  ways.  But  beyond 
this,  they  may  differ  in  terms  of  how  they  refine 
their  definitions  of  institutional  context  and  their 
objects  of  explanation,  depending  on  the  back¬ 
ground  information  they  add  from  one  or  another 
of  the  other  neo-institutionalist  traditions. 

For  example,  discursive  institutionalists  who 
engage  with  rational  choice  institutionalism  also 
often  use  the  language  of  interest  maximization 
and  incentive  structures.  But  unlike  rational  choice 
institutionalists,  who  tend  to  put  interests  first, 
leaving  aside  ideas  as  little  more  than  mechanisms 
to  choose  among  objective  interests  or  as  focal 
points  to  choose  among  equilibria,  discursive  insti¬ 
tutionalists  put  ideas  first,  as  constituting  or  con¬ 
structing  subjective  interests  that  encompass  a 
wider  range  of  strategic  ideas  and  social  norms 
than  most  rational  choice  institutionalists  admit. 
As  for  institutions,  rather  than  the  rational  choice 
institutionalists’  “neutral”  incentive  structures, 
discursive  institutionalists  see  them  as  infused  with 


ideas  and  values  or  collective  memories  that  make 
them  objects  of  trust  or  mistrust  and  changeable 
over  time  as  actors’  ideas  and  discourse  about 
them  change  along  with  their  performance. 
Moreover,  instead  of  attributing  interests  to  “ratio¬ 
nal”  actors  who  are  assumed  to  live  in  a  world  of 
knowable  risk,  discursive  institutionalists  investi¬ 
gate  the  actual  ideas  about  interests  of  real  actors 
who  live  in  a  world  that  is  often  characterized  by 
radical  or  immeasurable  uncertainty.  Finally, 
against  the  rational  choice  institutionalists’  view  of 
talk  as  by  definition  “cheap”  and  actions  as 
“speaking  more  loudly  than  words,”  discursive 
institutionalists  take  what  people  say  about  their 
actions  as  well  as  the  exchange  of  ideas  in  public 
debates  and  discourse  as  key  to  explaining  politics, 
as  the  basis  of  leadership  through  the  top-down 
processes  of  mass  persuasion  and  of  democracy 
through  the  bottom-up  processes  of  opinion  for¬ 
mation  and  deliberation. 

Discursive  institutionalists  who  engage  with 
sociological  institutionalism  have  much  less  to  dis¬ 
agree  with  since  sociological  institutionalists 
already  use  the  language  of  ideas,  norms,  and  dis¬ 
course.  Moreover,  for  the  most  part,  they  share  a 
constructivist  ontology  about  the  nature  of  inter¬ 
ests  and  institutions.  For  both,  in  place  of  rational¬ 
ists’  assumptions  about  material  interests  as  deter¬ 
minant  factors  in  individual  action,  they  consider 
political  agents’  ideas  about  interests  as  well  as 
norms  in  response  to  material  reality.  For  discur¬ 
sive  institutionalists,  the  main  sources  of  disagree¬ 
ment  arise  with  those  sociological  institutionalists 
who  come  down  more  on  the  side  of  institutional 
stasis,  by  emphasizing  how  cultural  frames  con¬ 
strain  ideas,  discourse,  and  action,  as  opposed  to 
the  institutional  dynamics  that  comes  from  the 
reframing  of  action  through  ideas  and  discourse. 

Discursive  institutionalists  who  engage  with 
historical  institutionalism  have  similar  concerns 
about  the  static  nature  of  approaches  that  empha¬ 
size  institutional  path  dependence  and  see  change 
as  the  result  of  unexplained  exogenous  shocks. 
They  find,  moreover,  that  even  though  the  recent 
shift  in  historical  institutionalism  from  exogenous 
to  incremental  or  evolutionary  theories  has  gone  a 
long  way  toward  endogenizing  change,  this  only 
does  more  to  describe  what  changes  occur  than  to 
explain  why  change  occurs.  Any  such  explanation 
requires  a  move  from  a  focus  on  structure  to 
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agency,  to  explore  the  microfoundations  of  the 
macropatterns.  But  instead  of  continuing  to  turn 
for  such  agency  either  to  the  incentive-driven  ratio¬ 
nal  agents  of  rational  choice  institutionalism  or  to 
the  culturally  framed  social  agents  of  sociological 
institutionalism,  discursive  institutionalists  argue 
that  historical  institutionalists  would  do  better  to 
look  to  the  ideas  and  discourse  of  sentient — or 
reflexive,  meaning  self-reflective  thinking  and 
speaking — agents . 

For  discursive  institutionalists,  the  problem 
with  the  three  other  neo-institutionalisms  is  that 
the  structures — of  incentives,  culture,  or  path 
dependence — that  make  up  the  institutional  con¬ 
text  act  mainly  as  external  constraints  on  action. 
Instead,  discursive  institutionalists  treat  institu¬ 
tions  at  one  and  the  same  time  as  given  (i.e.,  as 
structures  that  are  the  context  within  which  agents 
think,  speak,  and  act)  and  as  contingent  (i.e.,  as 
constructs  that  result  from  agents’  thoughts,  words, 
and  actions.  Here,  continuity  and  change  in  insti¬ 
tutions  can  be  explained  by  the  dynamic  processes 
through  which  agents  use  not  only  their  back¬ 
ground  ideational  abilities — a  generic  term  for 
John  Searle’s  background  abilities  or  Pierre 
Bourdieu’s  habitus — to  create  and  maintain  not 
only  their  institutions  but  also  their  foreground 
discursive  abilities — akin  to  Jurgen  Habermas’s 
communicative  action — to  communicate  and 
deliberate  about  taking  action  collectively  to 
change  (or  maintain)  those  institutions. 

The  relationship  between  discursive  institution¬ 
alism  and  the  other  neo-institutionalisms  is  com¬ 
plicated,  therefore,  since  discursive  institutionalists 
may  accept  some  of  the  language  and  logic  of  the 
other  approaches  even  as  they  critique  them.  With 
regard  to  rational  choice  institutionalism,  discur¬ 
sive  institutionalists  sometimes  take  its  account  of 
interest-based  ideas  as  a  shortcut  to  that  which  can 
be  expected  (albeit  not  predicted)  although  even 
the  expected  is  always  open  to  investigation.  With 
regard  to  sociological  institutionalism,  discursive 
institutionalists  often  take  its  account  of  culture, 
norms,  and  frames  as  a  useful  starting  point  for  a 
closer  examination  of  the  dynamics  of  change 
(or  continuity).  With  regard  to  historical  institu¬ 
tionalism,  discursive  institutionalists  help  explain 
the  dynamics  of  change  in  historical  structures 
through  an  agent-centered,  meaning-based  logic 
of  communication.  By  the  same  token,  however, 


historical  institutionalism  can  lend  a  hand  to  dis¬ 
cursive  institutionalism,  by  helping  show  how 
formal  institutions  may  shape  discursive  interac¬ 
tions.  For  example,  compound  polities,  in  which 
governing  activities  are  dispersed  among  multiple 
authorities,  tend  to  produce  more  elaborate  coor- 
dinative  discourses  among  policy  actors,  whereas 
simple  polities,  in  which  governing  activities  tend 
to  be  channeled  through  a  single  authority,  facili¬ 
tate  more  elaborate  communicative  discourses  to 
the  public. 

Discursive  Institutionalist  Methods 

But  how  can  discursive  institutionalists  demon¬ 
strate  that  ideas  and  discourse  make  a  difference 
for  political  action,  let  alone  have  a  causal  influ¬ 
ence?  Political  scientists  often  use  methods  based 
on  comparative  case  studies  and  process  tracing  to 
demonstrate  how  ideas  and  discourse  are  tied  to 
action.  This  may  involve  pointing  to  the  ideas  that 
serve  as  before-the-fact  guides  for  action,  as  after- 
the-fact  sources  of  justification  and  legitimation 
for  action,  or  as  ongoing  frames  through  which 
actors  understand  as  well  as  explain  their  actions. 
But  it  may  instead  involve  considering  the  discur¬ 
sive  exchange  of  ideas,  as  actors  in  debates  or 
negotiations  change  their  ideas  as  a  result  of  the 
process  of  persuasion — something  that  Habermas 
calls  “arguing” — in  contrast  to  “bargaining.” 
Another  such  method  involves  showing  that 
ideas  and  discourse  can  be  independent  variables 
by  demonstrating  that  no  other  neo-institutional- 
ist  explanation,  whether  in  terms  of  rationalist 
interests,  historical  path  dependencies,  or  cul¬ 
tural  norms,  does  a  better  job  of  accounting  for 
the  clear  changes  (or  continuities)  in  political 
action.  Examining  matched  pairs  of  cases  in 
which  everything  is  controlled  for  except  the 
ideas  and  discourse  can  also  usefully  show  when 
these  phenomena  make  a  difference.  Finally, 
although  qualitative  methods  are  most  prevalent, 
discursive  institutionalists  also  employ  a  variety 
of  quantitative  techniques  such  as  surveys,  opin¬ 
ion  polls,  and  fuzzy  sets  to  establish  the  influence 
of  ideas  and  discourse. 

Vivien  A.  Schmidt 

Boston  University 
Boston,  Massachusetts,  United  States 
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Discursive  Policy  Analysis 


Discursive  policy  analysis  focuses  on  discussion  or 
debate  as  the  main  object  of  scrutiny.  Originally, 
the  word  discourse  was  derived  from  the  Latin 
noun  discursus,  which  means  “running  to  and 
fro.”  It  is  this  dynamic  aspect  that  we  still  find  in 
discursive  policy  analysis,  which  often  concerns 
itself  with  the  development  of  meaning.  How  do 
specific  meanings  evolve  during  the  policy  pro¬ 
cess?  Why  do  certain  terms  pop  up  more  often? 
Can  we  see  how  coalitions  are  organized  around 
these  terms?  Why  does  a  policy  process  move  first 
this  way,  then  that  way?  This  entry  first  gives  a 
short  overview  of  the  various  meanings  associated 
with  the  term  discursive  policy  analysis ,  then 
elaborates  on  the  relationship  between  discourse 


and  practice  and  language  and  power,  and  ends 
with  a  short  note  on  the  normative  or  prescriptive 
potential  of  discursive  policy  analysis. 

The  Meaning  of  Discursive  Policy  Analysis 

The  discursive  analyst  takes  discussion  as  the 
object  of  analysis  and  then  sets  out  to  trace  linguis¬ 
tic  regularities  within  those  debates.  These  linguis¬ 
tic  regularities  are  generally  called  discourses. 
However,  the  term  discourse  is  used  in  different 
ways,  ranging  from  the  speech-act  theory  to  the 
“archeology”  of  Michel  Foucault. 

What  the  approaches  share  is  an  anti-essentialist 
ontology.  Discursive  analysis  can  be  placed  within 
the  constructivist  or  interpretive  tradition  of  the 
social  sciences:  It  assumes  that  language  is  consti¬ 
tutive  of  social  realities  and  takes  a  critical  stance 
toward  a  singular  notion  of  an  objective  “truth” 
waiting  to  be  discovered.  This  is  not  to  assume 
that  phenomena  exist  only  in  language  (as  critics 
tend  to  state),  but  it  does  suppose  that  we  can  per¬ 
ceive  phenomena  only  by  linguistically  making 
sense  of  them.  This  assumption  of  multiple  social 
realities  is  of  particular  importance  for  the  policy 
analyst,  because,  in  the  words  of  Carol  Bacchi 
(2000), 

The  premise  behind  a  policy-as-discourse  approach 
is  that  it  is  inappropriate  to  see  governments  as 
responding  to  “problems”  that  exist  “out  there” 
in  the  community.  Rather  “problems”  are 
“created”  or  “given  shape”  in  the  very  policy 
proposals  that  are  offered  as  “responses.”  (p.  48) 

Societal  events  are  almost  by  definition  ambiguous 
and  so  complex  that  they  can  never  be  fully 
described.  Every  description  or  narrative  gives  a 
particular  perspective  on  the  problem;  indeed,  the 
very  perspective  will  determine  whether  a  particular 
phenomenon  will  be  seen  at  all  as  a  problem  that 
requires  political  action.  Words,  catchphrases, 
metaphors,  and  story  lines  frame  societal  problems, 
and  as  this  framing  influences  our  answer  to  the 
question,  “Who  gets  (and  who  should  get)  what, 
when,  and  how?”  it  is  inherently  political. 

As  language  is  not  transparent — we  cannot  “see 
through”  it — there  is  a  continuous  interaction 
between  the  terms  in  which  a  policy  is  described 
and  the  way  we  think  about  it.  In  their  famous 
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book  Metaphors  We  Live  By  (1980),  George 
Lakoff  and  Mark  Johnson  explained  how  describ¬ 
ing  something  in  terms  of  something  else  (what 
they  call  the  “thisness  of  a  that”)  structures  our 
thought  processes.  For  instance,  it  matters  whether 
we  describe  integration  in  terms  of  a  melting  pot, 
an  invasion,  or  even  a  war,  as  the  description  will 
shape  not  only  our  understanding  of  what  it  is  but 
also  the  ways  we  respond  to  it.  Moreover,  a  par¬ 
ticular  policy  will  always  come  with  categories  and 
target  groups  and  descriptions  of  deserving  and 
undeserving  people,  thereby  creating  its  own 
political  reality.  In  the  words  of  Norman  Fairclough 
and  Ruth  Wodak  (1997), 

discourse  is  socially  constitutive  as  well  as  socially 
shaped:  it  constitutes  situations,  objects  of 
knowledge,  and  the  social  identities  of  and 
relationships  between  people.  It  is  constitutive 
both  in  the  sense  that  it  helps  to  sustain  and 
reproduce  the  status  quo,  and  in  the  sense  that  it 
contributes  to  transforming  it.  (p.  258) 

The  central  question  then  becomes  how  certain 
meanings  are  created,  challenged,  or  sustained. 

Language  in  Use 

To  answer  that  question,  studying  language  alone 
will  often  prove  insufficient.  After  all,  language 
does  not  “float”  in  society  but  is  uttered  in  par¬ 
ticular  contexts  that  influence  its  meaning;  the 
words  “Yes,  I  do”  have  an  entirely  different  mean¬ 
ing  at  the  altar  in  a  church  than  at  the  card  table 
in  a  pub.  Likewise,  a  particular  policy  might  be 
documented  in  papers  and  reports,  but  in  most 
cases,  its  meaning  will  be  formed  and  solidified 
during  implementation,  in  the  interaction  between 
street-level  bureaucrats  and  (would-be)  target 
groups.  Meanings  are  not  only  localized  in  lan¬ 
guage  but  also  in  symbolic  artifacts  and  the  physi¬ 
cal  environment  in  which  the  interaction  takes 
place.  Discourse  analysis,  then,  is  contextual;  it  is 
the  study  of  language  in  use. 

Such  a  statement  could  be  placed  in  a  Foucaultian 
tradition  of  discursive  analysis  that  tried  to  “over¬ 
come  the  traditional  distinction  between  what  one 
says  (language)  and  what  one  does  (practice)” 
(Stuart  Hall,  2001,  p.  72).  Thus,  Maarten  Hajer 
(1995)  defined  discourse  as  a  set  of  ideas,  concepts, 


and  categories  through  which  meaning  is  given  to 
social  and  physical  phenomena.  That  “ensemble” 
is  produced  and  reproduced  through  an  identifi¬ 
able  set  of  “practices.”  There  is,  however,  a  second 
problem  of  definition,  because  practice  is  a  catch¬ 
all  term  as  well.  According  to  Etienne  Wenger 
(1998),  practice  is  “doing  in  a  historical  and  social 
context  that  gives  structure  and  meaning  to  what 
we  do”  and  includes  not  only  “the  language,  tools, 
documents,  images,  symbols,  well-defined  roles, 
specified  criteria,  codified  procedures,  regulations 
and  contracts”  but  also  “the  implicit  relations, 
tacit  conventions,  subtle  cues,  untold  rules  of 
thumb,  recognizable  intuitions,  specific  percep¬ 
tion,  well-tuned  sensitivities,  embodied  under¬ 
standings,  underlying  assumptions,  and  shared 
world-views”  (p.  47). 

This  leads  to  the  paradoxical  situation  that  dis¬ 
course  can  include  practice,  that  practice  also 
includes  discourse,  and  that  both  terms  can  be 
used  to  describe  almost  anything.  It  seems  justified 
to  ask,  paraphrasing  Aaron  Wildavsky,  “If  dis¬ 
course  is  everything,  maybe  it  is  nothing?” 

Words  and  Interests 

When  reading  the  literature  on  discursive  policy 
analysis  with  its  contradictory  and  often  implicit 
definitions,  one  might  be  tempted  to  answer  affir¬ 
matively.  By  making  the  explanandum  (social  real¬ 
ity)  into  an  explanans  (practice),  a  structure-agency 
dilemma  can  be  easily  kept  out  of  sight.  No  wonder 
then  that  Bacchi  (2000)  argues  that  discursive  ana¬ 
lysts  tend  to  be  positioned  on  the  left  of  the  politi¬ 
cal  spectrum,  to  come  in  with  an  agenda  for  change 
and  to  define  discourse  in  a  way  suiting  their 
political  purpose  as  constraints  on  change. 

While  the  truth  of  this  observation  can  hardly 
be  denied,  the  ability  to  answer  not  only  “what” 
but  also  “why”  and  “how”  questions  remains  a 
crucial  strength  of  discursive  policy  analysis,  but 
this  requires  explicit  attention  for  the  way  in  which 
language  is  related  to  power  and  vice  versa.  To 
what  extent  are  subjects  “controlled”  by  the 
unconscious  use  of  language  in  context  that  in 
itself  incorporates  particular  power  differences  or, 
on  the  contrary,  to  what  extent  do  subjects  control 
their  environment  by  strategically  using  discourse? 
And  if  subjects  are  captivated  in  the  worldview  of 
dominant  discourses,  how  would  transformation 
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be  possible?  Again,  various  discursive  policy  ana¬ 
lysts  would  answer  these  questions  in  different 
ways;  whereas  some  scholars  (e.g.,  the  political 
scientist  Murray  Edelman)  emphasize  the  strategic 
and  conscious  use  of  symbolic  language  by  those  in 
power,  practice  researchers  point  to  the  way  in 
which  the  relationship  between  actors  is  consti¬ 
tuted  by  the  contextualized  practices  in  which  they 
act. 

Both  practices  and  discourses  have  meaning 
only  to  the  extent  that  they  are  taken  up  by  the 
actors  involved  in  a  particular  process.  Whereas 
subjects  will  almost  always  draw  on  stock  texts 
and  practices  and  on  memory,  conventions,  and 
historical  references,  situations  might  arise  in 
which  these  default  narratives  are  no  longer  suffi¬ 
cient  and  can  be  challenged  by  new  discourse 
coalitions.  During  these  moments  of  dislocation, 
discursive  policy  analysis  can  be  helpful  not  only 
for  studying  how  power  is  given  shape  in  particu¬ 
lar  practices  but  also  to  understand  how  interests 
come  about  in  the  first  place.  Whereas  money  and 
power  are  constant  factors  in  politics,  as  both 
means  and  ends,  it  is  only  via  language  that  actors 
can  set  their  priorities  and  determine  what  their 
interests  are.  Vice  versa  are  the  chances  for  a  par¬ 
ticular  discourse  to  become  dominant,  or  at  least 
partly  dependent,  on  unevenly  distributed  means 
for  symbolic  distribution  (e.g.,  easy  access  to  the 
media). 

What  About  the  “Real”  Problems? 

The  adage  of  “speaking  truth  to  power”  seems  hard 
to  combine  with  the  ontological  assumptions  of 
discursive  policy  analysis,  but  this  does  not  mean 
that  all  normative  or  prescriptive  ambitions  should 
be  set  aside.  Various  analysts  have  asked  themselves 
whether  discursive  interpellations  could  also  be 
actively  promoted  by  active  discourse  management. 
Policy  problems  are  not  discovered  “out  there,”  but 
they  are  linguistically  constructed.  Yet  some  of 
these  linguistic  constructs  may  be  more  helpful  for 
understanding  complex  phenomena  and  thus  for 
developing  effective  policy  practices.  Whereas  this 
is  obviously  a  complicated  endeavor,  it  also  raises 
questions  about  the  role  of  the  policy  analyst. 
Rather  than  regarding  oneself  as  an  objective  pro¬ 
vider  of  knowledge,  the  policy  analyst  becomes  an 
active  mediator  in  the  complex  interaction  between 


politics,  science,  and  societal  groups.  The  art  of 
discursive  policy  analysis  could  then  be  considered 
as,  in  the  words  of  Piet  Hein,  “the  solving  of  prob¬ 
lems  that  cannot  be  expressed  until  they  are 
solved.” 

Maarten  Hajer  and  Wytske  Versteeg 
Universiteit  van  Amsterdam 
Amsterdam,  Netherlands 
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Dissatisfaction,  Political 


Political  dissatisfaction  is  the  attitudinal  expres¬ 
sion  of  unhappiness  or  lack  of  satisfaction  based 
on  the  belief  that  the  government  or  incumbent 
policies  and  their  outcomes  are  falling  short  of  the 
citizen’s  preferences  or  expectations,  resulting  in  a 
negative  feeling  toward  the  government  and  the 
incumbents.  This  definition  implies  the  following 
from  a  theoretical  point  of  view:  First,  it  means 
that  this  is  an  attitude  related  mostly  with  the 
evaluation  of  policies  and  their  outcomes.  Second, 
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it  implies  that  citizens  are  able  to  distinguish  and 
identify  those  policies  and  their  consequences, 
evaluate  them,  and  compare  them  with  their  given 
preferences.  Third,  the  definition  also  signifies 
that  citizens  attribute  these  policy  decisions  to 
specific  political  authorities  who  are  responsible 
for  selecting  and  implementing  them.  Fourth,  it 
means  that  citizens  think  that  there  are  alternative 
policies  that  better  suit  their  problems  and  expec¬ 
tations,  opening  the  possibility  of  giving  the  sup¬ 
port  to  other  contenders.  Below,  the  origins  and 
various  interpretations  of  this  concept,  its  rela¬ 
tionship  with  similar  ones,  and  possibilities  of  its 
operationalization  are  discussed. 

Origin  and  Historical  Evolution 
of  the  Concept 

The  political  dissatisfaction  concept  is  linked  theo¬ 
retically  to  David  Easton’s  model  of  system  sup¬ 
port.  According  to  Easton,  it  is  citizens’  evaluation 
of  political  outcomes  that  determines  the  degree  to 
which  public  authorities,  governments,  and  politi¬ 
cal  systems  as  a  whole  are  supported  by  mass  pub¬ 
lics.  Easton  considers  that  political  dissatisfaction 
is  part  of  the  political  support,  and,  more  con¬ 
cretely,  part  of  the  “specific  support”  (or  the  lack 
thereof)  that  lies  in  the  relationship  with  the  satis¬ 
faction  that  members  of  a  system  feel  they  obtain 
from  the  perceived  outputs  and  performance  of  the 
political  authorities  if  they  are  opposed  to  them, 
disquieted  by  these  policies,  and  dissatisfied  with 
their  conditions  of  life. 

This  contribution  has  led  many  scholars  to  mis¬ 
interpret  the  concept  that  remained  conceptually 
and  methodologically  undefined.  This  is  why  there 
is  a  diverse  use  of  the  concept  and  many  variations 
of  its  operationalization  in  the  literature,  leading 
to  important  theoretical  and  empirical  confusions. 
Often,  this  problem  arises  when  linking  political 
dissatisfaction  with  political  mistrust  or  cynicism 
toward  the  entire  political  system,  making  all  these 
attitudes  part  of  the  same  attitudinal  continuum 
pertaining  to  the  same  one-dimensional  concept 
of  political  support.  A  good  example  of  this 
occurred  when  William  Gamson  provided  a  gen¬ 
eral  theoretical  notion  of  the  process  underlying 
the  development  of  political  distrust  and  political 
disaffection.  He  maintained  that  dissatisfaction 
can  lead  to  a  more  general  problem  of  institutional 


and  system  distrust  when  an  undesirable  outcome 
is  seen  as  producing  a  general  distrust  vis-a-vis 
political  authorities  and  the  whole  representation 
process.  This  interpretation  also  can  be  found  in 
the  writings  by  Robert  Dahl  and,  more  recently, 
Leonardo  Morlino,  when  they  assert  that  political 
dissatisfaction  results  from  the  perception  of  gov¬ 
ernment’s  general  inability  to  deal  effectively  with 
problems  regarded  by  citizens  as  important. 

However,  these  scholars  and  other  authors  fol¬ 
lowing  this  line  of  thought  forget  that  each  politi¬ 
cal  object,  including  political  authorities,  can  be 
the  object  of  diffuse  or  specific  support.  What 
distinguishes  the  type  of  support  is  not  the  object 
of  the  support  but  the  nature  of  the  support — that 
is,  if  it  is  conditional  (instrumental)  or  uncondi¬ 
tional.  Specific  support  can  be  object  specific  in 
two  ways:  First,  people  are  assumed  to  be  aware  of 
the  political  authorities  responsible  for  decision 
making;  second,  it  takes  into  account  the  perceived 
decisions,  policies,  actions,  proclamations,  and 
style  of  the  authorities.  In  this  sense,  the  members 
of  a  political  system  can  perform  a  rational  calcu¬ 
lation  to  find  out  whether  the  authorities’  actions 
address  their  needs  and  demands.  Under  such  con¬ 
ditions,  specific  support,  which  can  only  exist  in 
societies  whose  institutions  allow  authorities  to  be 
held  accountable  for  their  actions  and  the  resulting 
policies,  will  fluctuate  according  to  people’s  per¬ 
ceived  benefits  and  preferences.  So  dissatisfaction 
with  specific  policies  and  the  authorities  approving 
and  implementing  them,  which  is  the  result  of 
instrumental  calculations  conditioned  by  people’s 
preferences,  does  not  necessarily  affect  the  diffuse 
support  accorded  to  the  governmental  institution 
as  such.  Furthermore,  this  relationship  between 
output  and  general  political  trust  is  empirically 
questionable.  Recent  literature  has  demonstrated 
why  this  may  not  happen  in  many  cases  (see,  e.g., 
Russel  J.  Dalton,  2004;  Richard  P.  Gunther  &  Jose 
Ramon  Montero,  2006).  Finally,  Gamson’s  devel¬ 
opmental  model  does  not  incorporate  a  concern 
with  cause  and  effect  or  the  intricate  set  of  interac¬ 
tions  that  may  occur  over  time  between  policy  dis¬ 
satisfaction,  distrust,  and  social  or  historical  events. 
Declining  levels  of  political  trust  may  be  the  result 
of  increased  policy  dissatisfaction,  or  it  may  be  the 
other  way  around,  or  there  might  be  an  indepen¬ 
dent  factor  responsible  for  the  decline  in  both  of 
these  variables.  Some  societal  factor  may  cause  an 
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increase  in  political  cynicism,  which  in  turn  acts  as 
an  intervening  variable  to  bring  about  a  rise  in  dis¬ 
satisfaction  with  government  policies.  Alternatively, 
cynicism  and  dissatisfaction  could  both  be  con¬ 
nected  with  some  changing  social  conditions  and, 
therefore,  be  only  spuriously  related  to  each  other. 

Some  authors,  however,  depart  from  this  mis¬ 
leading  interpretation  of  the  concept.  In  this  line  of 
thought,  Guiseppe  Di  Palma  developed  at  the 
beginning  of  the  1970s  a  definition  of  political  dis¬ 
satisfaction  very  close  to  the  one  presented  above. 
According  to  Di  Palma,  political  dissatisfaction  is 
an  expression  of  displeasure  resulting  from  the 
belief  that  the  performance  of  the  government  or 
political  system  is  falling  short  of  citizens’  wishes 
or  expectations.  This  is  an  important  clarification 
because  of  the  following  two  reasons.  First,  it 
points  out  that  political  dissatisfaction  is  the  prod¬ 
uct  of  the  difference  between  expectations  and 
policy  outcomes,  and,  as  is  well-known  by  now, 
expectations  are  endogenous  to  one’s  own  political 
system  and  the  fulfillment  of  demands  provided  by 
the  incumbent  government;  second,  it  focuses  on 
the  policies  produced  by  a  given  government  or 
incumbent  actors,  implying  some  sort  of  short- 
time  evaluation.  Still  more  important  is  the  contri¬ 
bution  by  Arthur  Miller,  one  of  the  protagonists  of 
the  well-known  Miller-Citrin  controversy  that 
took  place  during  the  mid-1970s.  Miller,  distanc¬ 
ing  himself  from  Jack  Citrin’s  (1974)  comment, 
maintains  that  satisfaction  with  the  political 
authorities  and  political  trust  are  two  theoretically 
different  things  that  should  be  separated  in  empir¬ 
ical  research.  The  contribution  by  Miller  (1974) 
has  become  crucial  over  the  years.  He  is  signaling 
that  the  applicability  of  the  concept  of  political 
support  depends  on  the  validity  of  the  assumption 
that  people  can  be  aware,  however  vaguely,  of  a 
relationship  between  their  needs,  wants,  and 
demands,  on  the  one  hand,  and  the  actions  of  the 
political  authorities  on  the  other.  The  relationship 
has  to  be  such  that  the  citizens  perceive,  whether 
correctly  in  some  objective  sense  or  not,  that  the 
fulfillment  of  their  needs  and  demands  can  be  asso¬ 
ciated  with  the  authorities  in  some  way.  Finally, 
this  way  of  understanding  and  defining  political 
support  and  political  dissatisfaction  has  been  con¬ 
tinued  by  Barbara  Farah,  Samuel  Barnes,  and  Felix 
Heunks  (1979)  when  they  affirm  that  the  concept 
of  political  support  is  based  on  the  belief  that  the 


political  order  should  be  responsive  to  citizens’ 
demands  and  that  these  demands  are  expected  to 
result  in  governmental  outputs.  It  is  the  responsibil¬ 
ity  of  the  elected  officials  to  translate  these  demands 
into  public  policy,  and  subsequently,  their  perfor¬ 
mance  can  be  evaluated  by  the  general  public. 
Dissatisfaction,  then,  emerges  when  performance  is 
viewed  negatively.  This  is  the  line  of  thought  of 
more  recent  works  by  Mariano  Torcal  and  Jose 
Ramon  Montero  (2006),  although  they  make  the 
concept  of  political  dissatisfaction  interchangeable 
with  the  concept  of  political  discontent. 

The  final  point  important  to  mention  here  is 
that  this  attitude  is  the  result  of  evaluations  of 
incumbents  and  policies.  It  tends  to  produce  a 
critical  support  for  concrete  authorities,  being 
strongly  related  to  the  electoral  support  of  these 
authorities.  So  the  most  concrete  and  conspicuous 
consequence  of  political  dissatisfaction  is  elec¬ 
toral  change,  either  approving  or  choosing  other 
parties  or  producing  electoral  abstention.  This  is 
the  most  frequent  result  of  political  dissatisfac¬ 
tion.  Another  question,  even  though  not  fully 
investigated,  concerns  the  potential  consequences 
of  political  dissatisfaction  on  other  attitudes  such 
as  political  trust  when  the  political  system  does 
not  offer  real  alternative  policies  to  those  who  are 
dissatisfied. 

Measurement  and  Operationalization 

There  have  been  different  attempts  to  measure  this 
concept.  Several  authors  attempted  to  measure  this 
concept  with  indicators  of  internal  or  external 
political  efficacy,  institutional  trust,  or  even  politi¬ 
cal  cynicism.  These  attempts  have  mostly  produced 
confusing  and  misleading  results.  Following  the 
definition  proposed  above,  measurement  of  politi¬ 
cal  dissatisfaction  should  include  indicators  of 
approval  and  evaluations  in  terms  of  the  distance 
between  the  individual’s  own  policy  or  party  pref¬ 
erence  and  the  policy  alternative  that  the  individual 
identifies  with  a  particular  party.  Underlying  here  is 
the  assumption  that  if  an  individual  completely 
agrees  with  the  policy  alternative  of  a  particular 
party  as  he  or  she  perceives  it  (i.e.,  when  a  respon¬ 
dent  places  both  himself  or  herself  and  the  party  at 
the  same  point  on  a  scale),  the  individual  is  more 
satisfied  with  the  party’s  policies  than  if  the  indi¬ 
vidual  identified  the  party  with  a  policy  some 
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distance  from  his  or  her  own  position  on  this  issue. 
In  this  line  of  thought,  Miller  emphasized  the  role 
of  policy  evaluations  and  perceptions  in  trying  to 
measure  political  dissatisfaction,  creating  an  index 
based  on  policy  preferences.  However,  as  informa¬ 
tion  on  issue  preferences  is  limited,  costly  to 
obtain,  and  not  always  very  valid  or  reliable, 
scholars  tend  to  use  other  general  indicators 
related  with  governmental  or  incumbent  approval 
and  other  political  and  economic  evaluations. 
Some  authors  have  even  shown  that  the  indicator 
of  general  satisfaction  with  the  functioning  of  a 
democracy  can  be  a  good  indicator  of  political  dis¬ 
satisfaction  since  it  is  highly  contaminated  by 
incumbent  and  government  evaluations. 
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Domestic  Politics  and 
International  Relations 


The  study  of  the  impact  of  domestic  politics  on 
international  relations  is  currently  experiencing  a 
renaissance  after  a  long  period  of  neglect.  Almost 
all  work  in  international  relations  today  deals  in 
some  way  with  the  effect  of  processes  within  state 
borders  on  events  outside  of  them.  Scholars  are 
interested,  for  instance,  in  how  different  interest 
groups  and  political  parties  formulate  their  for¬ 
eign  policy  preferences,  how  institutions  aggregate 
those  different  agendas,  and  how  the  combination 
affects  strategic  interaction  among  states  and  their 
ability  to  get  what  they  want. 

This  entry  provides  a  review  of  the  recent  con¬ 
tributions  to  the  study  of  domestic  politics  in  the 
international  relations  (IR)  literature.  As  this  liter¬ 
ature  is  vast,  the  notion  of  paradigms  of  IR  is 
used — that  is,  particular  ways  of  looking  at  world 
politics  that  are  premised  on  different  key  concepts 
and  variables.  Certainly,  not  all  the  research  being 
done  on  the  role  of  domestic  politics  fits  neatly 
into  one  of  these  approaches,  nor  is  it  necessary  for 
it  to  do  so.  However,  paradigms  do  provide  a  way 
of  categorizing  the  different  ways  in  which  domes¬ 
tic  politics  is  understood  by  IR  scholars  to  affect 
IR,  and  in  doing  so,  they  provide  a  fruitful  tool  for 
systematizing  such  research.  First,  however,  a  very 
brief  history  of  the  study  of  domestic  politics  in 
“modern”  IR  will  be  provided,  covering  roughly 
the  past  30  years,  noting  the  rise  and  fall  and  then 
the  rise  again  of  interest  in  domestic  politics. 

By  a  domestic  politics  approach,  IR  scholars 
generally  mean  analysis  that  tries  to  unpack  the 
state  into  its  constituent  parts — that  is,  opening  up 
the  “black  box”  for  closer  examination. 
Consequently,  scholars  intent  on  demonstrating 
how  states  with  different  domestic  institutions  act 
differently  in  their  IR  can  be  said  to  use  such  an 
approach.  This  should  be  distinguished  from  those 
analysts  who  focus  on  the  microlevel  or  individual 
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level,  such  as  the  foreign  policy  role  of  key  decision 
makers  in  the  executive  branch.  This  is  the  crite¬ 
rion  used  here  to  distinguish  domestic  politics 
from  the  individual-  and  systemic-level  analyses. 

A  Brief  History  of  the  Study 
of  Domestic  Politics  in  Modern 
International  Relations  Theory 

Scholarship  aiming  to  understand  how  what  occurs 
within  a  state  affects  its  dealings  with  other  states 
has  a  long  intellectual  pedigree,  generally  catego¬ 
rized  under  the  rubric  of  “liberalism”  in  IR  theory. 
Liberalism  is  sometimes  understood  as  any  argu¬ 
ment  or  approach  that  disaggregates  the  state  into 
its  component  parts.  It  finds  its  origins  in  the  plu¬ 
ralism  evident  in  liberal  political  theory  in  which 
individuals  each  pursue  their  own  destiny. 
However,  examining  the  processes  occurring 
within  a  state  is  not  confined  to  democracies. 
Autocracies  have  their  own  domestic  political  pro¬ 
cesses,  although  they  are  generally  less  obvious 
and  transparent  to  the  outside  world.  This 
approach  to  explaining  world  politics  reached  its 
apex  with  the  publication  of  Power  and 
Interdependence  in  1977  by  Robert  Keohane  and 
Joseph  Nye.  In  it,  they  not  only  described  an  ideal 
type  of  world  in  which  nation-states  were  disag¬ 
gregated  into  their  constituent  parts — branches  of 
government,  bureaucracies,  the  media,  advocacy, 
and  interest  groups — but  they  also  argued  that 
these  parts  coordinated  with  one  another  across 
country  lines,  sometimes  in  pursuit  of  a  common 
agenda  that  put  them  at  odds  with  other  elements 
in  their  own  societies.  In  this  view,  the  state  was 
not  unimportant,  but  it  was  certainly  viewed  as 
becoming  increasingly  weaker  in  an  increasingly 
interdependent  world. 

With  what  one  might  call  the  structural  revolu¬ 
tion  in  IR  theory,  approaches  focusing  on  domestic 
politics  were  placed  on  the  back  burner  for  almost 
20  years.  Just  after  Keohane  and  Nye’s  book 
appeared,  Kenneth  Waltz’s  Theory  of  International 
Politics  dealt  an  almost  deadly  blow  to  the  more 
pluralistic  approach  to  IR  that  includes  a  study  of 
domestic  politics.  Waltz  argued  that  only  by 
understanding  states  as  operating  within  a  system 
could  the  persistence  of  patterns  in  world  politics, 
most  notably  the  pursuit  of  the  national  interest 
through  the  acquisition  of  military  power,  be 


explained.  Due  to  the  anarchic  nature  of  IR,  a 
characteristic  of  the  international  system,  states  of 
all  different  shapes  and  sizes  pursue  very  similar 
policies.  Given  the  lack  of  any  enforceable  law  or 
governance  in  international  affairs,  states  need  to 
help  themselves.  Survival  requires  the  accumula¬ 
tion  of  power  and  the  subordination  of  other  ele¬ 
ments  of  state  policy,  such  as  trade  or  the  pursuit 
of  human  rights,  to  that  end. 

Under  this  logic  and  given  the  stakes  involved, 
domestic  politics  should  not  be  allowed  to  intrude 
into  the  state  decision-making  process.  Given  indi¬ 
viduals’  shared  interest  in  security  within  a  nation¬ 
state,  the  system  encourages  them  to  come  together 
in  support  of  the  national  interest.  This  justifies 
what  has  been  known  as  the  “billiard  ball”  or 
“black  box”  approach,  according  to  which,  what 
happens  inside  the  nation-state  is  seen  as  unim¬ 
portant.  For  all  intents  and  purposes,  all  states  can 
be  understood  as  being  unitary  actors  responding 
to  the  same  systemic  pressures,  with  only  their 
relative  power  left  to  distinguish  them  from  other 
states.  The  implication  is  that  it  is  the  obligation  of 
the  state  to  guard  the  interests  of  society,  even  if 
this  involves  disregarding  particular  domestic 
interests.  It  should  act  as  a  unitary  actor  protecting 
a  country’s  national  interest,  understood  as  more 
than  just  the  aggregation  of  the  preferences  of  indi¬ 
viduals  and  groups  within  society.  States  should 
judge  and  interpret  their  environment  as  objec¬ 
tively  as  possible,  since  mistakes  can  have  serious 
consequences. 

This  left  little  room  for  domestic  politics.  Only 
through  systemic  analysis  can  one  understand  the 
continuity  in  IR.  By  this,  Waltz  meant  the  big  ques¬ 
tions  such  as  why  phenomena  such  as  war  and  the 
balance  of  power  continually  occur,  which  consti¬ 
tute  the  core  of  the  discipline.  Under  Waltz’s  logic, 
domestic  politics  explains  variation  in  foreign 
policy — that  is,  the  smaller  questions  of  day-to-day 
management  of  international  affairs  in  contrast  to 
international  politics  proper.  Hence,  domestic 
politics  is  a  matter  of  foreign  policy,  not  of  IR.  A 
generation  of  neorealist  or  structural  realist  schol¬ 
ars  set  out  to  show  how  the  state  might  be  under¬ 
stood  as  a  unitary  actor,  coming  together  in  times 
of  threat  to  meet  challenges  in  the  rational  way 
dictated  by  the  system.  The  clear  influence  of  this 
structural  revolution  in  IR  theory  was  reflected  in 
the  adoption  of  the  unitary  action  assumption  by 
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Keohane  in  his  next  critique  of  realist  IR  theory, 
After  Hegemony.  Keohane  took  issue  with  the  pes¬ 
simistic  conclusions  reached  by  structural  realism 
but  not  with  its  foundational  premises. 

In  recent  years,  however,  there  has  been  an 
explosion  of  interest  in  the  influence  of  domestic 
politics  in  IR.  This  is  partially  the  product  of  two 
new  approaches,  rationalism  and  constructivism, 
which  question  realist  assumptions.  However, 
even  realists  have  begun  to  investigate  domestic 
politics,  although  generally  in  keeping  with  the 
basic  premises  and  constraints  of  structural  real¬ 
ism.  Certainly,  not  all  research  focusing  on  domes¬ 
tic  politics  explicitly  identifies  with  these  three 
approaches,  or  needs  to.  Given  the  enormous  size 
of  the  literature,  however,  looking  at  the  logic  of 
these  three  strands  helps  us  sort  through  and  make 
sense  of  the  broader  trends  in  the  discipline. 

Rationalism 

Although  rationalism  is  a  very  broad  approach 
that  may  not  constitute  a  coherent  paradigm  of  IR, 
it  is  nevertheless  possible  to  identify  central  themes 
and  a  certain  logic  to  the  approach.  It  is  based  first 
and  foremost  on  the  presumption  of  utilitarianism 
on  the  part  of  individual  political  actors. 
Rationalism  understands  its  objects  of  inquiry  as 
calculating,  egoistic  players  whose  only  possible 
bond  is  self-interest.  Focusing  on  individual  utility 
maximization  leads  rationalists  naturally  to  break 
apart  the  state  and  look  into  the  black  box  of 
structural  realism. 

While  the  premise  of  self-regarding  rational 
action  might  strike  one  as  somewhat  trivial  at  first, 
it  has  been  revolutionary  for  IR  theory  and  has  led 
scholars  to  question  what  was  often  the  received 
wisdom  of  structural  realism.  For  example,  while 
a  realist  would  stress  how  the  importance  of 
national  security  serves  as  a  powerful  incentive 
against  petty  domestic  politics  interfering  with  for¬ 
eign  and  security  policy,  rationalists  have  focused 
on  the  often  diverging  interests  within  a  country, 
for  example,  economic  interests  or  office-seeking 
politicians.  There  is  a  large  literature  that  argues 
that  major  questions  of  war  and  crisis  initiation 
are  dictated  by  the  electoral  calendar  or  the  domes¬ 
tic  popularity  of  the  government,  sometimes 
known  as  the  “diversionary  theory  of  war.”  Many 
have  found  an  economic  rationale  underlying 


grand  strategy — the  fundamental  and  guiding  prin¬ 
ciples  of  a  state’s  foreign  policy  generally  assumed 
to  be  driven  by  a  state’s  structural  position  (i.e.,  its 
relative  power  or  its  geography)  or  its  core  ideol¬ 
ogy.  In  this  view,  instead  of  reflecting  a  national 
consensus  or  security  imperatives,  security  policy 
is  actually  a  post  hoc  rationalization  of  the  eco¬ 
nomic  interests  of  powerful  constituencies  seeking 
to  secure  and  protect  export  markets. 

By  examining  domestic  politics  in  this  manner, 
rationalists  allow  us  to  see  that  there  is  often  a 
divergence  between  state  policy  and  what  is  good 
for  the  society  as  a  whole.  This  often  makes  for 
suboptimal  policy.  For  instance,  leaders  might 
continue  fighting  wars  long  past  the  point  at  which 
victory  is  likely  because  they  fear  the  domestic 
political  consequences  of  defeat.  The  same  coun¬ 
terintuitive  logic  is  seen  in  international  political 
economy,  as  states  embrace  protectionism  that 
does  not  serve  their  interest  in  terms  of  collective 
welfare  but  rather  serves  a  narrow  parochial  eco¬ 
nomic  interest  with  a  greater  stake  in  the  outcome. 

Institutions  are  generally  crucial  for  explaining 
this  gap  between  the  aggregate  welfare  of  a  nation¬ 
state  and  its  actual  policy.  Depending  on  their 
configuration,  institutions  privilege  some  actors 
over  others,  giving  a  few  the  opportunity  to  direct 
policy  in  a  direction  that  serves  their  parochial 
interests  rather  than  the  common  good.  This  dis- 
juncture,  however,  is  something  that  varies. 
Democratic  foreign  policy  is  thought  to  reflect 
more  accurately  the  aggregate  welfare  of  society  as 
a  whole  inasmuch  as  democracies  have  more  inclu¬ 
sive  representation.  However,  given  the  asymme¬ 
try  of  the  stakes  and  information,  there  is  often  a 
wide  gap  between  collective  welfare  and  policy  in 
democracies  as  well. 

The  sum  total  of  this  view  is  a  very  pluralistic 
approach  of  “boxes  within  boxes.”  Individuals 
aggregate  into  groups  on  the  basis  of  common 
interests  and  engage  in  a  process  of  bargaining  and 
strategic  interaction  to  realize  their  preferred  ends. 
Domestic  institutions  are  a  key  mediating  influ¬ 
ence  as  they  privilege  some  groups  over  others. 
Once  a  particular  policy  preference  emerges  among 
competing  groups,  say  political  parties  in  a  democ¬ 
racy,  the  broader  group  to  which  they  both 
belong,  say  the  state,  then  engages  in  a  process  of 
bargaining  and  strategic  interaction  with  other 
like  units. 
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Domestic  institutions  are  also  important  in  stra¬ 
tegic  interaction  in  that  they  allow  for  credible 
signaling  in  bargaining  situations,  a  primary  con¬ 
cern  of  rationalism  given  its  utilitarian  core.  For 
instance,  in  the  case  of  a  deterrence  dispute  or  a 
trade  negotiation,  a  state  must  undertake  an  action 
that  states  that  lacked  resolve  would  or  could  not 
pursue.  This  can  be  done  through  “costly  signal¬ 
ing.”  As  the  accountability  of  democratic  institu¬ 
tions  means  that  leaders  face  significant  costs  in 
case  of  failure,  military  escalation  by  democratic 
countries  during  crises  is  therefore  a  more  credible 
signal  of  resolve  than  similar  action  by  nondemoc¬ 
racies.  Another  option  is  to  make  private  informa¬ 
tion  about  a  country’s  commitment  and  capability 
more  public  to  demonstrate  that  it  has  the  power 
and  the  resolve  to  prevail.  Democracies  are  also 
more  transparent  and  information  rich,  allowing 
them  to  convey  more  persuasively  their  interests 
and  will  when  resolute  or  their  peaceful  intentions 
when  they  want  to  reassure.  However,  the  nature 
of  democracies  can  also  prove  negative  in  negotia¬ 
tions.  Given  the  presence  of  elections  in  demo¬ 
cratic  societies,  negotiating  partners  might  fear 
that  democracies  are  less  able  to  commit  long  term 
to  their  agreements,  such  as  an  agreement  to  liber¬ 
alize  trade. 

Even  while  rationalists  embrace  the  domestic 
level  of  analysis,  its  emphasis  on  institutions  often 
comes  at  the  neglect  of  domestic  politics  per  se. 
Domestic  institutions  structure  the  political  envi¬ 
ronment,  acting  as  constraints  on  self-interested 
political  actors.  Rationalists  focus  on  structural 
features,  albeit  at  the  domestic  level,  but  do  not 
devote  great  attention  to  the  political  process  that 
plays  out  within  them.  As  in  the  case  of  neoreal- 
ism,  the  preferences  of  political  actors  are  more 
often  assumed  than  explained.  A  common  assump¬ 
tion  is  that  politicians  are  “office  seekers”  who 
design  their  political  agenda,  in  foreign  as  well  as 
domestic  affairs,  around  the  goal  of  obtaining  or 
maintaining  office.  Using  these  universally  appli¬ 
cable  assumptions,  rationalists  can  explain  varia¬ 
tion  in  terms  of  the  differing  types  of  domestic 
institutions.  These  are  structural  arguments  that 
do  not  problematize  the  origin  of  political  prefer¬ 
ences  and  the  political  process  that  brings  about 
policy.  Interestingly  enough,  rationalism  can  be 
thought  in  some  ways  to  apply  structural  realist 
assumptions  at  the  domestic  level  of  analysis. 


Politicians,  like  states,  seek  to  survive,  with  sur¬ 
vival  meaning  in  this  case  staying  in  office  and 
maintaining  power.  Rationalism  does  not  truly 
eliminate  the  black  box  assumption.  It  merely  adds 
boxes  of  different  colors  and  sizes. 

Constructivism 

The  constructivist  research  project  is  a  powerful 
critique  of  structural  realism.  At  its  core  is  the  idea 
that  the  nature  of  IR  is  social  and  as  such  is  a  con¬ 
stant  project  of  construction.  If  foreign  affairs  are 
marked  by  mistrust,  balance  of  power,  and  war, 
this  is  because  states  in  the  system  have  embraced 
a  particular  culture  of  anarchy ,  to  use  Alexander 
Wendt’s  term.  Given  the  social  nature  of  IR,  how¬ 
ever,  it  is  capable  of  being  reconstituted  on  the 
basis  of  different  social  practices.  States  can  adopt 
different  norms  and  identities  that  lead  to  different 
behaviors  and  transform  the  character  of  interna¬ 
tional  interaction.  The  nature  of  anarchy  is  not 
determinate  or  given;  it  is  mutable.  Constructivism’s 
logic  leads  to  a  natural  focus  on  “ideational  fac¬ 
tors”  such  as  norms,  values,  culture,  and  ideology. 

While  this  opens  up  the  potential  for  political 
agency  and  therefore  domestic  politics,  the  ten¬ 
dency  among  IR  constructivists  has  been  to  focus 
on  social  interaction  at  the  international  level  rather 
than  the  domestic  level.  The  norms  it  uncovers  and 
explains  are  generally  systemic  ones  that  lead  to 
transnational  convergence  in  behavior.  The  empiri¬ 
cal  fact  that  states  of  very  different  kinds  adopt  the 
same  norms  is  a  tool  for  showing,  theoretically,  that 
norms  are  powerful.  Given  that  the  primary  task  of 
the  paradigm  is  to  illustrate  that  what  are  regarded 
as  immutable  features  of  international  politics,  such 
as  self-help  or  sovereignty,  are,  in  fact,  social  con¬ 
structions,  this  is  not  surprising. 

Demonstrating  that  IR  are  social  also  involves 
showing  that  these  cultures  of  anarchy  become  rei¬ 
fied,  accepted  as  a  given  on  the  part  of  those 
embracing  the  norms  undergirding  the  system. 
That  is,  constructivists  emphasize  the  structure  in 
the  mutual  constitution  of  structure  and  agency. 
Constructivism’s  interest  tends  to  be  directed  at 
these  types  of  major  macro  outcomes  or  constitu¬ 
tive  principles  of  the  international  system  as  a 
whole. 

While  this  focus  on  reification  and  systemic 
norms  is  intrinsic  to  their  efforts  to  show  that 
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norms  matter,  constructivists  sometimes  lose  sight 
of  why  the  social  nature  of  IR  means  that  there 
should  be  great  domestic  political  contestation  on 
issues  of  IR.  Reification  means  the  absence  of 
agency,  as  political  actors  do  not  question  but 
rather  take  these  institutions  for  granted. 

There  are  some  exceptions,  however.  Con¬ 
structivist  literature  has  focused  on  the  effect  of  a 
state’s  political  culture,  embodied  in  its  political 
institutions.  Democratic  institutions  lead  to  differ¬ 
ent  types  of  foreign  policy  on  the  part  of  democra¬ 
cies  not  because  of  the  incentives  they  provide  to 
self-serving  politicians  but  because  they  are  the 
instantiation  of  a  set  of  values  externalized  in  for¬ 
eign  policy.  Democracies  build  stronger  alliances 
based  on  consultation,  mutual  identity,  and  trust. 

Constructivists  show  that  countries  are  marked 
by  particular  foreign  policy  cultures,  an  ideational 
consensus  on  the  principles  that  guide  a  country’s 
relations  with  others,  and  that  these  are  generally 
based  on  a  unique  conception  of  self  or  identity. 
These  cultures  are  not  objectively  given  by  a  state’s 
strategic  setting  but  socially  constructed  on  the 
basis  of  collective  experience  and  the  interpreta¬ 
tion  given  to  those  events. 

Both  the  focus  on  culturally  revealing  institu¬ 
tions  and  foreign  policy  cultures  tends  to  imply 
political  consensus  at  home.  However,  constructiv¬ 
ists  are  also  interested  in  ideological  contestation 
over  foreign  policy,  often  expressed  by  political 
parties  that  serve  as  vehicles  for  different  concep¬ 
tions  of  a  state’s  role  in  the  world  consistent  with 
the  set  of  values  they  represent  in  domestic  affairs. 
Such  considerations  may  affect  a  state’s  alignment 
choices,  steering  them  in  the  direction  of  like- 
minded  actors  abroad.  It  facilitates  analogies 
between  domestic  and  foreign  policy  situations 
that  allow  political  parties  to  develop  unique 
stands  on  foreign  policy  issues  that  contrast  with 
their  domestic  political  competitors  at  home. 

Neoclassical  Realism 

Ironically,  some  of  the  most  innovative  and  inter¬ 
esting  work  on  domestic  politics  and  IR  is  being 
done  by  neoclassical  realists,  heirs  to  Waltz’s  leg¬ 
acy  of  structural  realism.  While  Waltz  stressed  that 
the  anarchic  nature  of  the  international  system 
provided  incentives  to  states  to  help  themselves,  he 
was  careful  to  note  that  it  was  ultimately  up  to 


states  themselves  to  understand  their  environment 
and  heed  its  lessons.  The  system  could  only  punish 
after  the  fact,  not  force  states  to  act  in  a  particular 
way.  Waltz  does  not  necessarily  expect  that  all 
states  will  act  in  ways  that  suit  the  system’s  imper¬ 
atives,  only  that  those  that  do  not  will  be  punished. 
Domestic  politics,  by  privileging  parochial  inter¬ 
ests  over  that  of  the  collective  whole,  can  divert 
states  from  their  optimal  path.  When  they  do  so, 
the  system  will  punish  them.  Waltz  thus  offers  an 
observation  and  a  caution — where  we  see  exces¬ 
sive  intrusion  of  domestic  politics  into  foreign 
policy  making,  the  state  will  not  act  as  a  unitary 
actor  and  foreign  policy  will  be  suboptimal. 

Neoclassical  realists  demonstrate  that  foreign 
policy  yields  the  best  outcomes  when  the  state  acts 
as  a  coherent  and  unified  political  entity.  For 
instance,  strong  states  better  able  to  extract 
resources  from  society  are  more  powerful  than 
those  that  cannot  and  better  able  to  innovate  mili¬ 
tarily  to  meet  security  challenges  by  emulating  the 
practices  of  the  best  performing  states.  More  frag¬ 
mented  states,  which  resemble  liberalism’s  plural¬ 
ist  model,  are  less  adapted  to  this  environment. 
When  states  are  captured  by  parochial  interest 
groups  serving  their  own  narrow  agenda,  states  are 
led  in  disastrous  directions. 

Social  and  elite  cleavages  inhibit  group  feeling 
and  cohesion,  which  makes  states  less  able  to  act  as 
unitary  actors.  These  divisions  reflect  the  prioriti¬ 
zation  of  more  parochial  interests  such  as  ethnic, 
cultural,  ideological,  religious,  class,  bureaucratic, 
regional,  or  party  over  the  national  interest. 
Governments  presiding  over  fractured  societies  are 
weak  and  are  unable  to  take  the  steps  necessary  to 
counter  real  threats.  These  factors  interfere  with 
the  proper  operation  of  self-help  and  the  balance  of 
power.  Institutions  also  matter.  Strong  legislatures 
representing  smaller  sectional  interests  interfere 
with  the  making  of  a  rational  foreign  policy  by 
policy  executives.  Legislatures  provide  an  access 
point  into  foreign  policy  for  narrow  constituencies 
that  pull  the  settlement  away  from  an  ideal  point 
that  the  state  would  otherwise  prefer. 

Unlike  in  rationalism,  however,  the  state  is  a 
genuine  entity  with  a  life  of  its  own  that  acts  as  the 
true  representative  of  national  interests.  It  serves  as 
the  perspicacious  and  sagacious  agent  of  a  country’s 
true  interests,  threatened  by  smaller,  sectional 
interests.  Society  might  be  powerful  enough  to 
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sometimes  trump  the  state,  but  the  latter  is  gener¬ 
ally  not  merely  the  reflection  of  aggregation  of  the 
former’s  interests,  as  is  the  case  with  the  more 
pluralist  theory  of  the  state  adopted  by  liberals. 
This  concept  of  the  state  separates  the  neoclassical 
realist  literature  from  others  that  stress  how  state 
representatives  are  primarily  motivated  by  their 
own  egoistic  and  parochial  interests  in  staying  in 
office.  Where  this  does  occur,  neoclassical  realism 
endeavors  to  show  that  there  are  severe  conse¬ 
quences.  The  system  punishes. 

Current  Trends 

These  three  paradigms  offer  three  unique  ways  in 
which  processes  at  the  domestic  level  of  analysis 
influence  IR.  In  rationalism,  domestic  politics  are 
the  battleground  for  competing  individual  and 
group  interests  over  the  state.  In  constructivism, 
the  domestic  level  is  the  place  where  we  locate 
culture  and  identity  conceptions  that  guide  a  coun¬ 
try’s  foreign  relations  in  a  unique  way  or  a  place  in 
which  this  is  contested  on  ideological  grounds.  In 
realism,  domestic  politics  is  the  source  of  obstacles 
to  a  state’s  pursuit  of  the  genuine  national  interest. 
Currently,  rationalism  is  the  starting  point  for 
most  who  take  domestic  politics  seriously. 
Constructivism  has  dealt  less  with  the  subject. 
Ironically,  neoclassical  realists  provide  some  of  the 
most  exciting  new  thinking  on  the  topic. 

Brian  Rathbun 
University  of  Southern  California 
Los  Angeles,  California,  United  States 
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Durkheim,  Emile  (1858-1917) 


Emile  Durkheim  was  a  French  sociologist  who 
taught  at  the  University  of  Bourdeaux  and  succes¬ 
sively  at  the  Paris  Sorbonne.  His  main  works 
include  De  la  Division  du  Travail  Social  (The 
Division  of  Labor  in  Society,  1893),  Les  Regies  de 
la  Methode  Sociologique  (Rules  of  Sociological 
Method,  1895),  Le  Suicide:  Etude  de  Sociologie 
(Suicide:  A  Study  in  Sociology,  1897),  and  Les 
Formes  Elementaires  de  la  Vie  Religieuse  (The 
Elementary  Forms  of  Religious  Life,  1912). 
Durkheim  committed  his  whole  professional  life, 
within  the  French  university  system,  to  the  task  of 
establishing  sociology  as  a  scientific  discipline  dis¬ 
tinct  from  philosophy,  psychology,  and  other 
social  sciences.  His  intent  (one  might  label  it 
“positivistic”)  was  to  ground  sociological  dis¬ 
course  on  an  objective  approach  to  social  facts 
(i.e.,  collective  “manners  of  thinking,  acting,  and 
feeling”).  In  pursuing  this  goal,  he  gave  relatively 
little  attention  in  his  writings  to  expressly  political 
phenomena,  the  intellectual  discussion  of  which, 
in  the  fin  de  siecle  French  context,  was  chiefly  the 
concern  of  philosophical,  juridical,  and  ideologi¬ 
cal  discourses. 

It  can  plausibly  be  said  that  Durkheim’s  work 
was  chiefly  aimed  at  providing  a  scientific  solution 
to  the  problem  of  social  order.  Aware  as  he  was 
that  the  problem  had  been  posed  in  the  most  com¬ 
pelling  manner  by  Thomas  Hobbes,  a  political 
philosopher,  Durkheim  did  not  accept  Hobbes’s 
own,  expressly  political  solution  of  that  problem: 
the  fear-motivated  submission  of  all  individuals  to 
an  unchallengeable  sovereign  brought  into  being 
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artificially  by  the  social  contract.  According  to 
Durkheim,  humans  are  by  nature  intensely  social 
beings,  and  the  individuals’  ability  and  willingness 
to  coordinate  their  reciprocal  activities  could  not 
be  reliably  and  durably  sustained  by  the  coercive 
enforcement  of  the  sovereign’s  commands. 
Normally,  instead,  it  expressed  a  condition  of  soli¬ 
darity  between  individuals. 

However,  in  the  course  of  social  evolution,  soli¬ 
darity,  while  always  called  on  to  ground  and  guar¬ 
antee  the  social  order,  presents  itself  in  varying 
forms.  In  Division  (the  first  and  perhaps  the  great¬ 
est  of  his  works),  Durkheim  sharply  simplified 
such  variation  by  subsuming  all  forms  of  solidarity 
under  one  or  the  other  of  two  basic,  sharply  con¬ 
trasted  types:  mechanical  and  organic. 

Mechanical  solidarity  is  associated  with  the  ear¬ 
liest  stages  of  social  evolution.  Here,  societies  that 
might  be  called  “primitive”  have  small  popula¬ 
tions  operating  over  relatively  large  territories 
(thus,  characterized  by  low  densities).  Their  sub¬ 
units  differ  very  little  from  one  another,  enjoy 
rather  poor  command  over  natural  resources,  and 
are  sustained  only  by  rudimentary  technology  and 
by  minimal  division  of  labor;  thus,  only  to  a  very 
limited  extent  do  they  exchange  their  products 
with  one  another.  In  turn,  within  each  of  these 
subunits,  the  individuals  themselves,  the  ultimate 
components  of  the  society,  again  barely  differ  from 
one  another,  for  the  existence  of  all  is  managed 
with  reference  to  universally  shared  bodies  of  prac¬ 
tical  knowledge.  They  all  harbor  within  their 
minds  the  same  cultural  patrimony,  which  all  sub¬ 
units  periodically  revisit  and  celebrate  on  ceremo¬ 
nial  occasions.  Thus,  in  a  society  of  this  nature, 
solidarity  rests  on  the  extent  to  which  individuals 
are  like  one  another,  for  they  all  share  understand¬ 
ings  of  what  is  true  and  proper,  which  cover  all  the 
contingencies  of  the  society’s  simple,  tradition- 
based  mode  of  existence. 

The  second  type  of  solidarity,  the  organic,  is 
common  to  all  other  societies,  which  may  be 
labeled  “advanced.”  These  have  evolved  out  of 
“primitive”  ones  and  are  characterized  chiefly  by  a 
more  or  less  advanced  degree  of  division  of  labor, 
and  thus  by  their  complexity.  Typically,  these  soci¬ 
eties  have  large  populations,  densely  settled  over 
the  territory,  and  their  subunits  are  differentiated 
from  one  another;  part  of  the  population,  for 
instance,  inhabits  towns;  the  rest  inhabits  the 


countryside.  Different  regions  specialize  in  the  pro¬ 
duction  of  different  goods  and  services  and 
exchange  these  with  one  another.  Furthermore,  the 
individuals  inhabiting  each  locale  are  in  turn  dif¬ 
ferentiated  from  one  another  with  regard  to  their 
occupation,  their  skills,  and  the  techniques  they 
employ  in  performing  their  specialized  tasks. 
Inevitably,  they  share  only  a  small  portion  of  the 
increasingly  wide,  diverse,  and  changing  cultural 
patrimony  of  society  as  a  whole  while  they  make 
their  own  and  identify  with  a  narrow  subset  of  it 
that  varies  according  to  their  differentiated  condi¬ 
tions.  Put  otherwise,  individuals  are  increasingly 
individualized  and  expected  and  authorized  to 
operate  as  self-conscious,  self-activated,  and  self- 
seeking  units. 

The  solidarity  typical  of  such  a  society  can  no 
longer  be  grounded  on  the  universal  sharing  of 
beliefs,  values,  and  norms;  rather,  it  rests  on  the 
extent  to  which  society’s  subunits  (and  within 
these  the  individuals  themselves)  have  become  dif¬ 
ferentiated,  for  this  process  makes  them  interde¬ 
pendent  and  generates  wide  and  flexible  networks 
of  exchange.  Individuals,  in  particular,  are  induced 
to  relate  to  one  another  largely  by  means  of  con¬ 
tractual  arrangements,  where  each  party’s  legiti¬ 
mate  concern  to  foster  its  own  well-being  finds 
expression  in  its  efforts  to  provide  for  the  other’s 
needs  and  to  contribute  to  the  other’s  well-being. 

One  could  say  that  according  to  Durkheim,  in 
the  course  of  social  evolution,  the  basis  of  solidar¬ 
ity  (thus,  the  solution  to  the  problem  of  social 
order)  shifted  from  cultural  homogeneity  to  mate¬ 
rial  interdependence.  Neither  basis,  on  the  face  of 
it,  is  expressly  political  in  nature.  However, 
Durkheim’s  profound,  sophisticated  sociological 
discourse  could  not  fail  to  confront  at  different 
points,  in  a  more  or  less  explicit  and  elaborate 
manner,  the  political  dimension  of  social  affairs, 
and  as  it  did  so,  it  produced  results  worthy  of 
attention,  including  some  of  marked  originality. 

Perhaps  Durkheim’s  awareness  of  political 
phenomena  was  inspired  by  some  traits  of  his  per¬ 
sonality.  He  was  an  austere  man  and  rather  force¬ 
ful  man,  whose  strong  sense  of  hierarchy  and 
authority  found  expression,  among  other  things,  in 
the  way  he  selected  and  trained  his  intellectual 
followers  and  monitored  their  performance.  Also, 
in  the  pursuit  of  his  lifelong  mission — again,  estab¬ 
lishing  the  discipline  of  sociology  in  the  French 
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academic  system — he  found  it  useful  to  associate 
himself  in  the  capacity  of  a  privileged  consultant 
with  the  ministerial  authorities  who  managed  that 
system  with  a  maximum  of  central  direction  and 
control  from  above.  He  took  the  same  approach  in 
attempting  to  accomplish  a  broader  task — having 
sociology  (which  to  him  meant  his  own  sociology) 
provide  an  intellectual  inspiration  for  the  structur¬ 
ing  of  the  educational  system  at  large  and  for  the 
shaping  of  its  programs  specifically.  Finally,  he 
was  a  committed  republican,  convinced  of  the 
superior  merits  (not  just  for  France)  of  the  “prin¬ 
ciples  of  1789” — egalite,  liberte,  fraternite — the 
last  of  which  inspired  his  concern  for  solidarity. 

In  any  case,  Durkheim  also  acquired  a  sense  for 
the  significance  of  political  phenomena  from  the 
substance  of  his  own  sociological  discourse,  begin¬ 
ning  with  Division  and  other  early  works. 
According  to  Regies  de  la  Metbode  Sociologique, 
for  instance,  only  “manners  of  thinking,  acting 
and  feeling”  that  are  “sanctioned”  can  be  qualified 
as  proper  “social  facts.”  That  is,  there  have  to  be 
arrangements,  no  matter  how  varied  across  time 
and  space,  for  rewarding  those  observing  some 
manners  (positive  sanctions)  and/or  for  taking  to 
task  those  disrespecting  and  violating  them  (nega¬ 
tive  sanctions).  Now,  the  idea  of  sanction  itself, 
and  especially  of  negative  sanction,  imparts,  so  to 
speak,  a  vertical  dimension  to  the  collective  unit 
(a  society  or  a  group),  which  privileges  some 
“manners”  and  makes  them  its  own;  it  suggests, 
however,  a  structured  and  justified  command-and- 
obedience  relationship  between  one  part  of  the 
unit  and  the  others,  and  such  a  relationship  has 
traditionally  been  seen  as  intrinsically  political. 

In  Division,  Durkheim  used  as  an  indicator  of 
the  changing  nature  of  solidarity  the  laws  and  in 
particular  the  prevailing  nature  of  the  sanctions 
applied  to  violators  of  legal  expectations.  In  primi¬ 
tive  societies  (where  the  laws  were  mostly  custom¬ 
ary),  punitive  sanctions  prevailed,  for  society’s 
intense  reprobation  of  universally  shared  and  cher¬ 
ished  norms  found  expression  in  the  infliction  of 
suffering  on  violators.  In  advanced  societies,  pun¬ 
ishment  became  less  significant  than  restitutive 
sanctions.  Here,  whenever  a  party  to  a  contract  fails 
to  fulfill  its  obligations  toward  the  other,  it  allows 
the  latter,  after  fulfilling  its  own  obligations,  to 
recover,  in  some  way  or  other,  what  it  legitimately 
expected.  Again,  this  way  of  conceptualizing  the 


critical  difference  points  up,  however  implicitly, 
the  vertical  structuring  of  both  societies;  for  even 
the  application  of  restitutive  sanction,  no  less  than 
that  of  punitive  ones,  requires  some  authoritative, 
power -ful  arrangements  for  identifying  and  deal¬ 
ing  with  violators.  And  it  seems  appropriate  to 
think  of  such  arrangements  as  political  in  nature. 

Furthermore,  in  Division,  Durkheim  opposes 
the  excessive  emphasis  placed  by  some  authors 
(both  in  his  time  and  ours)  on  contract  (and  thus 
on  the  market)  as  the  distinctive,  superior  way  of 
coordinating  the  actions  of  individuals  in  advanced 
society.  Durkheim  agrees  that  contract  is  central  to 
such  a  society  and  is  associated  with  significant 
moral  values  (such  as  individual  autonomy  and 
responsibility)  but  insists  that  it  should  not  dis¬ 
place  authoritative,  public  institutions  as  regula¬ 
tors  of  private,  self-interested  activities.  For  him, 
“not  everything  is  contractual  about  the  contract.” 
Individuals  can  enter  binding  contracts  only  by 
availing  themselves  of  the  institution  of  contract,  a 
set  of  preexistent  authoritative  arrangements  estab¬ 
lishing  who  can  make  contracts  about  what  mat¬ 
ters,  in  what  forms,  and  with  what  consequences. 

The  notion  of  institution  is  of  utmost  signifi¬ 
cance  for  Durkheim,  who  at  one  point  defined 
sociology  itself  as  the  science  of  institutions.  These 
are  arrangements  (bodies  of  personnel,  distinctive 
practices,  resources,  etc.)  focused  on  matters  of 
central  concern  to  the  whole  society  and  backed  by 
a  diffuse  moral  appreciation  of  them  as  morally 
valid.  But  one  can  also  see  institutions  as  operating 
“vertically” — operating  on  society,  as  it  were, 
from  above. 

This  aspect  of  institutions  is  apparent  when 
Durkheim  raises  political  matters — developing  and 
applying  expressly  sociological  insights  and  eschew¬ 
ing  any  involvement  in  the  politics  of  parties — and 
proposes  institutional  changes  he  obviously  hoped 
would  be  realized  via  legislative  acts. 

The  characteristic  dynamism  of  modern  society, 
propelled  by  the  competition  between  economic 
actors  and  by  technological  innovation,  often  gen¬ 
erates  conflicts  and  crises  and  atomizes  individuals, 
depriving  them  of  the  normative  guidance  previ¬ 
ously  provided  by  tradition  and  by  local  member¬ 
ships.  The  machinery  for  regulation  operated  by 
the  state  and  activated  by  the  competition  of  parties 
mostly  cannot  deal  effectively  with  these  new  cir¬ 
cumstances.  To  remedy  this,  the  state  itself  should 
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empower  “corporations” — public  bodies  identified 
with  specific  sectors  of  the  economy — to  monitor 
closely  the  economic  process,  discipline  the  unruly 
relations  between  employers  and  employees,  and 
instill  in  both  a  sense  of  responsibility  toward  cus¬ 
tomers  and  toward  the  public  at  large. 

In  a  lecture  course  Durkheim  taught  repeatedly 
at  Bordeaux  and  at  the  Sorbonne,  whose  content 
was  made  available  after  World  War  II,  this  corpo- 
ratist  proposal  (put  forward  also  in  other  writings) 
complements  a  sophisticated  and  original  argu¬ 
ment  on  expressly  political  affairs.  According  to  an 
insight  already  present  in  Division,  a  critical  step 
in  the  advance  of  the  division  of  labor  was  consti¬ 
tuted  by  the  emergence  of  a  part  of  society 
entrusted  with  the  task  of  controlling  the  whole  by 
making  and  implementing  specific  decisions  con¬ 
cerning  its  relations  with  the  (natural  and  social) 
environment. 

In  modern  society,  this  task  is  performed  in  a 
particularly  purposeful  and  self-conscious  manner 
by  the  state.  This  stands  to  the  rest  of  society  as  the 
brain  stands  to  the  organism  of  advanced  animal 
species;  its  essential  function  is  thus  the  acquisition 
and  processing  of  information  about  the  relevant 
environments,  leading  to  the  formation  and  imple¬ 
mentation  of  collective  decisions. 

This  bold,  “cybernetic”  view  of  the  state 
connects  in  Durkheim’s  views  with  a  particular 
understanding  of  democracy.  Durkheim  consid¬ 
ered  democratic  arrangements  for  political  repre¬ 
sentation  via  parties,  coupled  with  bureaucratic 
administration,  as  particularly  effective  ways  of 
generating  and  accumulating  information  on  social 
affairs  and  communicating  it  upward  and  down¬ 
ward,  thus  allowing  decisions  to  be  made  in  a 
particularly  deliberate  and  sophisticated  manner. 
Thus,  according  to  Durkheim,  democracy  was  a 


particularly  apt  mechanism  for,  as  it  were,  putting 
and  keeping  state  and  society  in  touch  with  one 
another.  By  the  same  token,  democratization  was 
the  most  appropriate  political  expression  of  the 
advance  of  modernity. 

Although  here  Durkheim  confronts  expressly 
political  themes,  and  imaginatively  emphasizes 
their  informational  and  communicational  aspects, 
he  articulates,  once  more,  a  “vertical”  understand¬ 
ing  of  them,  attending  chiefly  to  political  institu¬ 
tions  and  paying  little  attention  to  what  could  be 
considered  the  “horizontal”  aspect  of  political 
experience — that  is,  to  politics.  This  may  express, 
again,  his  prevalent  concern  with  social  order, 
which  induced  him  to  consider  with  distaste  social, 
and  thus  also  political,  conflict. 

Gianfranco  Poggi 
University  of  Trento 
Trento,  Italy 
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Economic  Policy 


Economic  policy  is  one  of  the  central  activities  of 
government.  This  entry  describes  the  changing 
nature  of  economic  policy,  discusses  the  impor¬ 
tant  forms  of  government  intervention,  and 
describes  the  emerging  trends  in  public  sector 
interventions  in  the  economy.  The  decision  mak¬ 
ers,  content,  and  policy  instruments  of  economic 
policy  have  changed  substantially  over  the  past 
50  years.  It  was  once  a  more  straightforward  mat¬ 
ter  to  define  economic  policy  than  it  is  in  the  early 
21st  century.  At  one  time,  economic  policy  was 
about  macroeconomic  policy — the  targeting  of 
policy  objectives  such  as  growth,  employment, 
and  inflation  through  measures  to  influence  the 
demand  side  of  the  economy  principally  using  the 
policy  instruments  of  fiscal  policy,  taxation,  and 
government  spending.  National  governments  were 
the  principal  decision  makers  and  were  held 
accountable  by  their  electorates  in  democratic 
societies  for  the  conduct  of  economic  policy.  In 
communist  societies,  the  economic  plan  was  a 
central  device  for  achieving  socialism. 

Internationalization  of  Economic  Policy 

At  the  domestic  level,  governments  now  share  their 
economic  decision-making  authority  with  central 
banks  and  a  range  of  bodies  that  seek  to  regulate 
the  financial  sector  in  particular.  However,  eco¬ 
nomic  policy  has  become  much  more  international¬ 
ized  with  a  complex  system  of  multilevel  decision 


making.  Bodies  such  as  the  World  Trade  Organiza¬ 
tion  (WTO)  set  the  rules  within  which  economic 
activity  takes  place,  while  the  G-8  and,  more 
recently,  the  G-20  seek  to  coordinate  the  economic 
policies  of  major  governments.  The  27  member 
states  of  the  European  Elnion  (ELI),  particularly 
those  using  the  euro,  face  another  regional  level  of 
decision  making  that  shapes  that  economic  policy. 

This  more  internationalized  system  of  decision 
making  reflects  the  increasing  internationalization 
of  economic  activity.  In  the  period  between  World 
War  I  and  World  War  II,  nation-states  pursued 
policies  of  autarchy  in  which  they  aimed  for  self- 
sufficiency  either  within  their  own  national  bor¬ 
ders  or  in  economic  systems  that  embraced  their 
colonies.  These  policies  of  protectionism,  com¬ 
bined  in  some  countries  with  the  denomination  of 
their  currency  in  gold — the  gold  standard — which 
produced  overvalued  currencies,  undermined  inter¬ 
national  trade  and  the  efficiency  gains  it  could 
provide  through  comparative  advantage. 

After  World  War  II,  the  world,  under  the  lead¬ 
ership  of  the  Elnited  States,  sought  to  move  toward 
a  more  liberal  international  economic  system,  at 
least  within  the  Western  bloc.  The  so-called 
Bretton  Woods  institutions  and  agreements — the 
International  Monetary  Fund  (IMF)  and  the  World 
Bank,  along  with  the  General  Agreement  on 
Tariffs  and  Trade  (GATT) — sought  to  create  the 
economic  and  political  space  within  which  this 
system  could  operate.  In  the  period  between  the 
end  of  World  War  II  and  the  1970s,  this  system 
was  built  around  fixed  exchange  rates,  pegged 
against  the  dollar  backed  by  gold  and  by  relative 
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capital  immobility.  These  arrangements  were  not 
without  costs  for  the  United  States,  but  it  was 
often  prepared  to  bear  them  in  the  pursuit  of 
broader  policy  objectives  such  as  underpinning 
Western  economies  in  the  context  of  the  Cold 
War.  Through  this  period,  international  trade  was 
stimulated  by  the  reduction  of  tariffs  through  the 
GATT  negotiating  rounds,  although  one  unin¬ 
tended  consequence  was  that  nontariff  barriers  to 
trade  started  to  assume  a  greater  importance. 

This  system  came  under  increasing  strain  in  the 
late  1960s.  In  part,  this  was  a  consequence  of  the 
long-run  decline  in  the  economic  dominance  of 
the  United  States,  exacerbated  in  the  short  run  by 
the  economic  strains  arising  from  the  Vietnam 
War  and  the  costs  of  programs,  including  Medicare, 
enacted  as  part  of  President  Lyndon  Johnson’s 
Great  Society  program.  In  the  early  1970s,  the 
Bretton  Woods  system  collapsed  when  the  United 
States  removed  the  link  between  the  dollar  and 
gold.  There  was  no  longer  a  fixed  rate  system  but 
only  currencies  that  fluctuated  substantially  against 
each  other.  One  consequence  was  that  the  European 
Economic  Community  started  moving  toward  a 
zone  of  regional  currency  stability,  initially  through 
the  creation  of  the  exchange  rate  mechanism  of  the 
European  Monetary  System  in  1979.  Another  was 
the  need  for  a  system  of  economic  policy  coordina¬ 
tion  between  the  major  economic  powers  to  com¬ 
pensate  for  the  inability  of  the  United  States  to 
take  a  leadership  role  by  the  establishment  of  eco¬ 
nomic  summits  in  1975  that  became  institutional¬ 
ized  as  the  G-7  (later  G-8)  process. 

The  industrial  dominance  of  the  United  States 
and  the  leading  European  countries  was  being 
challenged  by  the  emergence  of  newly  industrial¬ 
izing  countries  (NICs).  The  first  of  these  countries 
was  Japan.  The  economic  success  of  the  United 
States  had  been  built  on  Fordist  systems  of  the 
mass  production  and  consumption  of  standardized 
economic  goods,  which  relied  on  securing  efficien¬ 
cies  that  drove  down  unit  costs  to  produce  price- 
competitive  goods  that  could  first  be  sold  to  the 
large  American  market  and  then  through  branch 
plants  overseas.  A  major  motive  for  establishing 
the  common  market  in  Europe  was  to  achieve 
comparable  economies  of  scale  there. 

Japan  had  initially  competed  with  the  West  by 
producing  relatively  standard  industrial  goods  such 
as  steel  and  ships  using  Western  technology.  Among 


the  first  motor  vehicles  that  Japan  sold  to  the 
United  States  were  pick-up  trucks.  Over  time,  how¬ 
ever,  Japan  came  to  increasingly  emphasize  quality 
in  production  and  produce  more  sophisticated 
goods  that  challenged  Western  goods  in  their  home 
markets.  As  Japan  moved  upmarket,  a  number  of 
countries,  initially  “Asian  tigers”  such  as  South 
Korea  and  Taiwan,  moved  into  more  standardized 
goods  production  using  “off-the-shelf”  technology. 
They  were  able  to  sell  these  goods  into  Western 
markets  because  of  the  lowering  of  tariff  barriers. 
This  led  to  the  development  of  new  policy  instru¬ 
ments  in  the  form  of  nontariff  barriers  to  restrain 
the  penetration  of  Western  domestic  markets  by 
these  NICs. 

The  sharp  increase  in  oil  and  other  commodity 
prices  in  the  early  1970s  triggered  a  process  known 
as  stagflation.  Since  1945,  the  Western  countries 
had  enjoyed  a  long  postwar  boom  with  high 
growth  rates  and  low  levels  of  unemployment, 
although  this  was  accompanied  by  increasing  prob¬ 
lems  in  restraining  inflation.  This  had  been  made 
possible  by  a  variety  of  factors:  technological 
breakthroughs  (e.g.,  the  widespread  use  of  new 
materials  such  as  plastics),  postwar  reconstruction, 
demographic  shifts,  and,  more  controversially,  new 
forms  of  economic  policy.  In  the  1970s,  growth 
slowed,  unemployment  rose,  and  inflation  increased. 
These  problems  were  compounded  by  the  second 
oil  crisis  or  shock  at  the  end  of  the  1970s,  which 
triggered  a  recession  in  the  early  1980s. 

Information  Technology 
and  Economic  Policy 

Toward  the  end  of  the  1980s,  the  world  began  to 
experience  the  benefits  of  an  industrial  revolution 
comparable  to  the  harnessing  of  steam  power  and 
the  development  of  the  factory  system  at  the  end  of 
the  1 8th  century  and  the  science-led  industrial  revo¬ 
lution  at  the  end  of  the  19th  century  based  on  the 
application  of  electricity  to  industrial  production 
and  the  development  of  science-based  industries 
such  as  chemicals.  This  third  industrial  revolution 
was  based  around  information  technology. 
Mainframe  computers  had  been  available  for  some 
time  and  had  been  used  for  tasks  such  as  processing 
inventory  and  wages.  However,  it  became  possible 
to  network  micro  or  desk  computers,  and  this 
opened  up  new  possibilities  in  information 
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management  in  companies.  Advances  in  computer- 
aided  manufacturing  and  computer-aided  design 
also  facilitated  automation  and  precision  produc¬ 
tion  in  manufacturing.  This  in  turn  made  possible 
more  variations  in  forms  of  production,  thus  facili¬ 
tating  differentiated  batches  of  production  to  serve 
niche  markets.  The  Fordist  world  of  production  was 
to  become  a  post-Fordist  one,  in  terms  of  both  sys¬ 
tems  of  production  and  consumer  preferences. 

By  creating  new,  faster,  and  more  reliable  sys¬ 
tems  of  communication,  information  and  commu¬ 
nication  technology  (ICT)  also  facilitated  the 
increasing  integration  of  the  world  economy.  This 
had  been  a  long-term  process,  driven  by  technol¬ 
ogy,  market  organization,  and  policy.  It  increas¬ 
ingly  became  referred  to  as  “globalization,” 
although  this  was  a  highly  contested  term.  It 
implied  that  there  had  been  a  step  change  in  the 
organization  of  the  world  economy,  and  not  all 
analysts  accepted  that  this  had  been  the  case. 
There  was  also  considerable  normative  dispute 
about  whether  globalization  was  beneficial  or  not, 
and  this  conditioned  policy  responses  to  it. 

Trends  in  Economic  Policy 

It  is  possible  to  identify  a  general  trend  in  the  world 
economy  with  the  following  characteristics.  Except 
in  intermittent  recessions,  trade  tended  to  grow 
faster  than  national  output,  and  this  upward  trend 
line  was  invariably  resumed  once  a  recession  had 
ended.  Foreign  investment  tended  to  grow  faster 
than  trade  as  multinational  companies  became  more 
dominant  in  the  international  economy.  Indeed, 
trade  increasingly  took  the  form  of  trade  within 
multinational  companies  as  they  integrated  produc¬ 
tion.  The  development  of  these  multinational 
companies  created  a  need  for  a  more  integrated 
international  financial  system  with  a  wider  and 
more  sophisticated  range  of  financial  instruments. 
These  financial  instruments  increasingly  developed 
a  momentum  of  their  own  with  a  particular  devel¬ 
opment  of  trading  in  debt.  Regulators  seeking  to 
supervise  these  developments  faced  asymmetric 
information  problems:  There  were  too  many  instru¬ 
ments  to  supervise  effectively  with  the  resources 
available,  particularly  given  their  complexity,  and  in 
any  case  the  prevailing  philosophy  was  one  of  “light 
touch”  regulation.  The  risks  associated  with  this 
approach  became  apparent  in  the  financial  crisis  of 


2008,  which  was  in  large  part  triggered  by  the  issu¬ 
ance  of  subprime  mortgages  to  borrowers  with 
poor  credit  histories.  These  subprime  mortgages 
were  then  bundled  into  packages  of  debt  that  were 
traded. 

The  fundamental  problem  was  that  economic 
policy  was  still  conducted  largely  at  a  national 
level,  yet  the  economic  structure  was  increasingly 
internationalized.  What  institutions  were  available 
to  coordinate  economic  policy  at  an  international 
level,  and  how  effective  were  they?  The  IMF  had 
been  created  after  World  War  II  to  resolve  the 
problem  of  global  imbalances.  These  problems 
remained  in  the  first  decade  of  the  21st  century, 
although  now  the  imbalance  was  principally 
between  the  G-2  countries  of  China  and  the  United 
States,  an  economic  relationship  that  was  compli¬ 
cated  by  tensions  over  security  and  human  rights 
issues.  In  economic  terms,  China  played  a  major 
role  in  funding  U.S.  government  debt,  enabling  the 
United  States  to  run  both  a  budget  deficit  and  a 
balance  of  payments  deficit,  the  latter  enabling 
China  to  sell  its  exports  into  the  U.S.  market. 

The  IMF  had  a  clearer  role  in  the  era  of  fixed 
currencies  and  relative  capital  immobility  in  the 
period  up  to  the  early  1970s.  After  that,  its  role 
became  less  certain,  although  it  was  able  to  make 
a  significant  intervention  when  British  economic 
policy  ran  into  trouble  in  1976,  imposing  restraint 
on  public  expenditure.  However,  as  the  story  of 
those  events  has  become  better  known  as  the 
archives  opened  up,  it  has  become  apparent  that 
domestic  actors  were  using  IMF  intervention  to 
impose  greater  restraint  than  was  strictly  neces¬ 
sary,  in  part  as  an  effort  to  restore  government 
authority,  which  had  been  undermined  by  the  role 
of  the  trade  unions  as  veto  groups  on  economic 
policy.  As  larger  sums  of  money  started  to  move 
around  the  international  financial  system,  the 
sums  available  to  the  IMF  became  less  significant 
as  leverage  on  government  policy.  In  the  financial 
crisis  of  2008,  the  IMF  was  able  to  bail  out 
smaller  economies  in  serious  trouble,  such  as 
Iceland  and  Latvia,  but  was  not  able  to  come  up 
with  any  general  solution  to  the  world’s  economic 
problems. 

That  role  would  have  at  one  time  fallen  on  the 
G-7,  which  became  the  G-8  with  the  addition  of 
Russia.  In  the  1970s,  the  G-7  had  a  clear  focus  on 
economic  issues,  but  it  increasingly  became  a  general 
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summit  covering  topics  such  as  terrorism.  Even 
when  it  did  make  pronouncements  on  economic 
issues,  such  as  international  trade  negotiations,  it 
seemed  to  have  little  impact.  The  leaders,  particu¬ 
larly  the  host  country,  seemed  to  treat  the  event  as 
a  major  profile-raising  exercise  for  domestic  audi¬ 
ences,  and  if  any  effective  work  was  done,  it  was  by 
the  “sherpas” — officials  who  prepared  the  ground 
for  the  meetings  through  bilateral  and  multilateral 
contacts.  Moreover,  the  G-7/G-8  appeared  to  be 
increasingly  irrelevant  to  the  structure  of  the  world 
economy.  It  did  not  include  China  and  other 
emerging  economies.  The  G-7  included  Canada 
and  four  EU  member  states:  Britain,  France,  Italy, 
and  Germany.  The  inability  of  the  EU  to  have  one 
representative  demonstrates  its  lack  of  unity  on 
economic  issues,  with  Britain,  in  any  case,  outside 
the  eurozone.  The  EU’s  initial  response  to  the 
financial  crisis  was  sluggish,  with  no  clear  central 
policy  making  on  economic  policy,  despite  the 
increasing  prominence  of  the  Economic  and 
Financial  Council  (Ecofin).  Perhaps  the  most 
important  set  of  policy  instruments,  certainly  in  the 
context  of  a  recession — namely,  those  of  fiscal 
policy — remained  under  the  control  of  the  individ¬ 
ual  member  states. 

The  international  response  was  to  establish  a 
new  body,  the  G-20,  which  included  emerging 
countries  such  as  Brazil,  China,  and  India,  reflecting 
the  shift  in  balance  of  economic  power  in  the  world. 
Some  commentators  were  concerned  that  the  new 
body  had  gone  too  far  in  the  direction  of  inclusive¬ 
ness  at  the  expense  of  effectiveness  by  bringing  in 
countries  such  as  Mexico,  Saudi  Arabia,  and  South 
Africa.  Nevertheless,  it  made  a  relatively  effective 
start  to  its  work,  eclipsing  the  G-7/G-8.  This  body 
was  left  with  an  uncertain  future  role.  However,  it 
was  important  not  to  exaggerate  the  extent  to 
which  power  had  shifted  away  from  the  traditional 
power  holders.  Even  if  the  United  States  could  no 
longer  impose  its  preferred  policies  on  others,  an 
economic  policy  solution  that  did  not  have  U.S.  sup¬ 
port  was  unlikely  to  meet  with  much  success. 

The  international  architecture  of  economic  pol¬ 
icy  making  has  become  increasingly  important,  but 
the  domestic  level  remains  significant.  It  is  impor¬ 
tant  to  appreciate  how  domestic  policy  has  evolved 
over  time  and  policy  instruments  have  changed. 
This  in  turn  needs  an  appreciation  of  the  impor¬ 
tance  of  economic  ideas,  the  way  in  which  economic 
orthodoxies  have  changed,  and  the  significance  of 


economists,  within  government,  in  business,  in  the 
media,  and  as  academics  in  influencing  policy. 

Although  the  precise  origins  of  the  term  macro- 
economic  policy  are  the  subject  of  some  dispute,  it 
was  not  in  common  use  before  World  War  II.  This 
was  because  governments  did  not  seek  to  influence 
the  macroeconomic  aggregates  beyond  pursuing 
policies  of  sound  money  through  a  balanced  bud¬ 
get  and  a  stable  currency.  In  the  1930s,  govern¬ 
ments  in  the  United  States,  Britain,  and  elsewhere 
did  become  involved  in  seeking  to  rationalize  and 
revive  depressed  economic  sectors,  but  these  were 
sector-specific  policies. 

World  War  II  led  to  a  step  change  in  the  involve¬ 
ment  of  governments  in  economic  management, 
particularly  as  a  consequence  of  a  widespread  com¬ 
mitment  to  the  maintenance  of  full  employment, 
although  what  exactly  constituted  full  employment 
was  left  ambiguous.  In  his  General  Theory  (1936), 
John  Maynard  Keynes  had  set  out  an  approach 
that  sought  to  run  the  economy  at  a  higher  equilib¬ 
rium  employment  level.  Because  Keynes  wrote  so 
extensively,  and  his  work  was  interpreted  and 
developed  by  his  disciples  after  his  death,  there  is 
some  controversy  about  what  exactly  may  be 
called  “Keynesianism.”  In  essence,  it  advocated  the 
use  of  fiscal  policy  measures  to  avert  high  levels  of 
unemployment  by  stimulating  investment  and  con¬ 
sumption  by  reducing  taxes  and/or  increasing  pub¬ 
lic  expenditure.  Spending  on  public  works  was 
seen  to  be  especially  effective,  because  through  the 
“multiplier”  effect  originally  identified  by  Richard 
Kahn,  an  initial  investment  could  have  a  far  greater 
effect  on  levels  of  economic  activity.  To  prevent  the 
economy  from  “overheating”  as  a  result  of  produc¬ 
tive  capacity  not  keeping  pace  with  growing  aggre¬ 
gate  demand  (as,  e.g.,  in  an  economic  boom),  taxes 
could  be  increased  and  public  expenditure  reduced. 
The  existence  of  an  inverse  relationship  between 
unemployment  and  inflation  rates,  as  described  by 
the  Phillips  curve,  offered  a  range  of  choices  to 
policymakers.  Keynesian  ideas  spread  more  rapidly 
in  some  countries  than  in  others,  but  by  1971,  even 
an  economic  conservative  like  President  Nixon  was 
able  to  state  to  an  ABC  News  reporter,  “I  am  now 
a  Keynesian  in  economics.” 

These  new  techniques  of  economic  manage¬ 
ment  seemed  to  be  associated  with  a  prolonged 
period  of  relatively  high  growth  and  high  levels  of 
employment  after  World  War  II.  However,  it  is 
difficult  to  disentangle  how  much  of  this  good 
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performance  was  due  to  Keynesianism  and  how 
much  due  to  other  factors  such  as  the  deployment 
of  technological  advances  made  during  the  war, 
the  needs  of  postwar  reconstruction,  and  demog¬ 
raphy.  Keynesianism  also  brought  problems  in  its 
train.  It  tended  to  produce  stop-go  cycles  as  the 
economy  was  accelerated  and  decelerated.  This 
was  particularly  a  problem  in  Britain,  and  it  was 
thought  that  one  solution  was  the  adoption  of  the 
French  indicative  planning  model,  but  a  watered- 
down  version  was  applied  in  the  United  Kingdom. 

As  Keynes  himself  had  admitted,  he  had  no  solu¬ 
tion  to  the  inflation  problem  in  a  full-employment 
economy.  Countries  resorted  to  various  forms  of 
income  policies,  either  agreements  between  central¬ 
ized  employers’  organizations  and  trade  unions, 
which  held  back  wage  growth,  or  interventions  by 
government  to  impose  restraint.  The  more  volun¬ 
tary  agreements,  provided  they  were  negotiated  by 
centrally  controlled  trade  unions,  tended  to  be 
more  effective,  especially  in  countries  such  as 
Austria  and  Sweden,  and  came  to  be  known  as 
“corporatism”  in  the  academic  literature,  although 
many  policymakers  did  not  like  this  term  because 
of  its  fascist  associations.  These  arrangements 
worked  well  enough  in  a  period  of  rapid  growth 
when  no  one  left  the  bargaining  table  empty- 
handed,  but  they  became  more  challenged  when 
growth  rates  fell  in  the  1970s,  which  led  to  the 
phenomenon  known  as  stagflation:  low  growth 
and  high  inflation  (and  increasing  unemployment). 
Barry  Eichengreen  argues  that  the  institutional 
arrangements  that  worked  so  successfully  up  to  the 
1970s  were  less  successful  afterward  and  slowed 
down  the  ability  of  the  corporatist  economies  to 
readjust  to  changed  conditions. 

The  apparent  failure  of  Keynesianism  led  to  a 
search  for  a  new  economic  orthodoxy,  and  this  was 
found  in  the  work  of  Milton  Friedman  and  his 
advocacy  of  monetarism.  Even  in  Britain,  money 
supply  indicators  proved  to  be  too  volatile  and 
unreliable  to  provide  a  decision  rule  for  the  econ¬ 
omy,  but  they  ushered  in  a  period  when  there  was  a 
substantial  reduction  in  government  involvement  in 
the  economy.  Controlling  inflation  was  prioritized 
over  maintaining  employment,  and  the  accepted 
economic  orthodoxy  was  reversed  so  that  micro- 
economic,  supply-side  policy  was  seen  as  the  mech¬ 
anism  for  achieving  growth,  while  macroeconomic 
policy  focused  on  the  control  of  inflation.  Monetary 
policy  instruments,  principally  the  control  of  the 


interest  rate,  became  more  important  relative  to  fis¬ 
cal  policy  instruments,  and  emphasis  was  placed  on 
targeting  the  inflation  rate  as  the  principal  eco¬ 
nomic  objective. 

One  consequence  was  an  increase  in  the  impor¬ 
tance  of  independent  or  quasi-independent  central 
banks.  The  Bundesbank  as  a  constitutionally  guar¬ 
anteed  autonomous  actor  had  been  central  to  the 
success  of  postwar  German  economic  policy.  In 
Britain,  from  1997  onward,  the  New  Labour  Gov¬ 
ernment  had  handed  over  operational  control  of 
interest  rates  to  the  Bank  of  England.  The  reality 
of  “all  power  to  the  central  bankers”  was  reflected 
in  the  importance  of  the  Federal  Reserve  to  U.S. 
economic  policy.  In  the  EU,  the  European  Central 
Bank  faced  a  disj uncture  in  the  eurozone:  It  con¬ 
trolled  monetary  policy,  but  fiscal  policy  remained 
the  responsibility  of  the  member  states.  The  Stabil¬ 
ity  and  Growth  Pact  was  supposed  to  restrain 
excessive  deficit  spending  by  member  states  but  in 
practice  failed  to  do  so  as  it  was  defied  by  France 
and  Germany. 

Colin  Crouch  and  others  have  traced  the  evolu¬ 
tion  of  a  system  of  “privatized  Keynesianism”  in 
which  an  economic  stimulus  was  provided,  par¬ 
ticularly  in  Britain  and  the  United  States,  by  large 
amounts  of  consumer  and  corporate  debt.  This 
increasingly  became  linked  to  the  housing  market, 
with  Britain  using  an  inflation  measure  as  a  policy 
guide  that  omitted  the  cost  of  housing.  However, 
the  provision  of  increasingly  risky  loans  and  the 
increased  selling  of  financial  instruments  made  up 
of  packages  of  debt  undermined  the  foundations 
of  the  financial  system.  It  led  to  the  subprime  cri¬ 
sis  in  the  United  States  triggered  by  defaults  on 
loans  to  poor-quality  borrowers,  which  developed 
into  a  serious  global  financial  crisis  and  contrib¬ 
uted  to  the  onset  of  a  severe  economic  recession 
by  2008. 

A  range  of  new  policy  instruments  had  been 
deployed  to  cope  with  this  crisis,  with  renewed 
efforts  to  achieve  effective  international  coordina¬ 
tion.  Some  saw  the  events  as  vindicating  Keynes¬ 
ianism,  although  not  all  countries,  particularly 
Germany,  were  enthusiastic  about  a  fiscal  stimulus 
based  on  government  debt.  New  techniques  of 
“quantitative  easing”  involved  central  banks  buying 
up  corporate  and  government  debt,  so  the  central 
banks  remained  key  actors  alongside  government 
economic  ministries.  The  result  of  these  measures 
was  a  massive  increase  in  public  debt.  Although  low 
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interest  rates  reduced  the  cost  of  servicing  it,  taxes 
would  have  to  be  raised  and  public  expenditure  cut 
in  order  to  repay  it. 

These  events  triggered  a  renewed  debate  about 
the  role  of  financial  services  regulation.  Although 
there  was  a  general  view  that  financial  markets 
regulation  needed  to  be  tighter,  it  was  less  clear 
how  that  might  be  achieved.  Some  argued  that  risk 
was  central  to  a  market  economy  and  reducing 
it  too  much  would  stifle  the  economy,  whereas 
others  argued  that  reckless  risk  taking  had  been 
encouraged  and  needed  to  be  curtailed.  After  a 
period  when  economic  policy  had  been  “depoliti- 
cized”  and  automatic  rules  supervised  by  central 
bankers  had  replaced  political  discretion,  eco¬ 
nomic  policy  was  suddenly  no  longer  a  matter  of 
purely  technical  debates  but  moved  to  the  center  of 
the  political  stage  once  again. 

Wyn  Grant 
University  of  Warwick 
Coventry,  United  Kingdom 
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Economic  Statecraft 


Economic  statecraft  is  the  use  of  economic  means 
to  pursue  foreign  policy  goals.  Foreign  aid,  trade, 


and  policies  governing  the  international  flow  of 
capital  can  be  used  as  foreign  policy  tools  and  are 
considered  the  most  common  forms  of  economic 
statecraft.  In  principle,  policies  governing  the 
international  movement  of  labor  could  also  be 
considered  instances  of  economic  statecraft  if  they 
are  intended  to  promote  foreign  policy  goals;  but 
such  measures  are  not  usually  included  under  the 
rubric  of  economic  statecraft  and  are  not  consid¬ 
ered  here.  This  entry  discusses  the  forms  of  eco¬ 
nomic  statecraft,  the  uses  of  such  instruments,  and 
approaches  to  the  study  of  economic  statecraft. 

Economic  techniques  of  statecraft  are  distin¬ 
guished  from  other  foreign  policy  tools,  such  as  the 
following:  (a)  military  statecraft,  which  concerns 
the  use  or  threat  of  military  force;  (b)  diplomacy, 
which  concerns  negotiation;  and  (c)  propaganda, 
which  concerns  manipulating  verbal  or  visual  sym¬ 
bols.  Most  foreign  policies  consist  of  some  combi¬ 
nation  of  these  techniques. 

Economic  statecraft  takes  many  different  forms, 
including  both  positive  and  negative  sanctions. 
Negative  sanctions  are  actual  or  threatened  pun¬ 
ishments,  while  positive  sanctions  are  actual  or 
promised  rewards.  Examples  of  negative  sanc¬ 
tions  include  the  following:  refusing  to  export 
(embargos),  refusing  to  import  (boycotts),  covert 
refusals  to  trade  (blacklists),  purchases  intended 
to  keep  goods  out  of  the  hands  of  target  countries 
(preclusive  buying),  deprivation  of  ownership 
(expropriation),  punitive  taxation,  aid  suspen¬ 
sions,  and  asset  freezes.  Examples  of  positive 
sanctions  include  preferential  tariffs,  subsidies, 
foreign  aid,  investment  guarantees,  and  preferen¬ 
tial  taxation  of  foreign  investment. 

Neither  the  study  nor  the  practice  of  economic 
statecraft  is  of  recent  origin.  Although  the  Athenian 
use  of  the  Megarian  Decree  may  be  the  most 
famous  example  from  ancient  times,  it  was  surely 
not  the  first.  Examples  of  various  types  of  eco¬ 
nomic  statecraft  can  be  found  throughout  recorded 
history.  The  use  of  economic  means  to  pursue  for¬ 
eign  policy  goals  has  been  discussed  by  a  number 
of  thinkers  through  the  ages,  including  Plato, 
Aristotle,  John  Locke,  Francis  Bacon,  Montesquieu, 
David  Hume,  Adam  Smith,  Immanuel  Kant, 
Alexander  Hamilton,  Friedrich  List,  John  Stuart 
Mill,  Woodrow  Wilson,  and  John  Maynard  Keynes. 

Tools  of  economic  statecraft  have  been  used  to 
pursue  a  wide  variety  of  foreign  policy  goals. 
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These  include  preparing  for  war,  preventing  war, 
fighting  a  war,  promoting  democracy,  punishing 
human  rights  violators,  promoting  communism, 
opposing  communism,  promoting  economic  devel¬ 
opment,  discouraging  economic  development,  pre¬ 
venting  regime  change,  encouraging  regime  change, 
and  many  other  goals.  Pericles,  Theodore  and 
Franklin  Roosevelt,  Woodrow  Wilson,  Adolf 
Hitler,  Joseph  Stalin,  Winston  Churchill,  and  the 
United  Nations  (UN)  have  used  various  forms  of 
economic  statecraft  to  pursue  goals  that  were 
sometimes  noble  and  sometimes  nefarious.  Like 
other  techniques  of  statecraft,  economic  tools  can 
be  used  wisely  or  unwisely,  justly  or  unjustly, 
depending  on  the  situation. 

In  the  first  part  of  the  20th  century,  the  League 
of  Nations  generated  hopes  that  warfare  could,  to 
some  extent,  be  prevented  or  replaced  by  economic 
sanctions.  The  League  of  Nations  imposed  sanc¬ 
tions  on  Italy  in  response  to  its  invasion  of  Ethiopia 
in  1935  partly  to  punish  Italy  but  also  to  warn 
Hitler  of  its  members’  determination  to  resist 
aggression.  The  failure  of  the  sanctions  with 
respect  to  both  goals  generated  a  widespread  belief 
that  such  measures  do  not  work.  Much  of  the  last 
half  of  the  20th  century  was  dominated  by  the 
acceptance  of  this  belief  as  conventional  wisdom. 

Answering  the  question  of  whether  economic 
sanctions  “work,”  however,  is  more  complicated 
than  it  seems.  There  is  not  even  a  consensus  on 
what  “work”  means  in  this  context.  Does  it  mean 
complete  achievement  of  the  primary  goal?  Of  all 
goals?  At  what  cost?  And  in  comparison  with 
which  alternative  techniques?  These  are  just  some 
of  the  questions  begged  by  the  misleading  ques¬ 
tion:  “Do  economic  sanctions  work?” 

The  first — and  most  important — step  in  evaluat¬ 
ing  the  utility  of  any  technique  of  statecraft,  includ¬ 
ing  economic  sanctions,  is  identifying  what  goals 
were  being  pursued  with  respect  to  which  targets.  In 
the  case  of  the  League  of  Nations  sanctions  against 
Italy,  for  example,  impressing  Hitler  was  probably 
more  important  than  stopping  Italian  aggression.  In 
addition,  there  was  a  desire  not  to  impose  such 
hardship  on  Italy  that  it  might  undermine  the  fascist 
regime  and  bring  the  communists  to  power. 

Human  beings  in  general — and  nation-states  in 
particular — rarely,  if  ever,  pursue  only  one  goal  at  a 
time  with  respect  to  only  one  other  individual  or 
group.  When  a  country  imposes  economic  sanctions 


on  another  country,  it  is  usually  pursuing  multiple 
goals  of  varying  degrees  of  importance  with 
respect  to  a  multiplicity  of  other  actors  in  the  inter¬ 
national  arena.  Although  no  assessment  of  the 
overall  success  of  an  influence  attempt  based  on 
economic  means  could  be  expected  to  consider  all 
goals  and  targets,  it  is  reasonable  to  expect  such 
assessments  to  consider  the  most  important  goals 
and  targets.  Most  attempts  to  assess  the  success  of 
economic  sanctions,  however,  consider  only  one 
goal  with  respect  to  one  target. 

An  additional  complication  is  that  success  is 
almost  always  a  matter  of  degree.  In  statecraft,  as  in 
everyday  life,  complete  success  in  goal  attainment 
occurs  rarely — if  ever.  The  potential  number  of 
degrees  of  success  (or  failure),  of  course,  is  infinite; 
thus,  any  attempt  to  measure  degrees  of  success 
must  involve  simplification.  Whether  a  3-point  scale 
or  a  16-point  scale  is  more  appropriate  is  a  matter 
for  reasonable  dispute,  but  a  dichotomous  concep¬ 
tion,  which  fails  to  allow  for  any  degree  of  success, 
is  difficult  to  justify.  Most  influence  attempts  are 
likely  to  result  in  some  degree  of  success  with 
respect  to  at  least  some  goals  and  some  targets. 

In  addition  to  estimating  goal  attainment,  deter¬ 
mining  the  success  of  economic  sanctions  involves 
estimating  the  costs  of  the  undertaking.  Measuring 
the  costs  of  economic  sanctions  is  usually  easier 
than  measuring  the  noneconomic  costs.  Political 
costs  are  likely  to  be  both  important  and  difficult 
to  measure.  This  does  not  mean  that  reasonable 
estimates  are  impossible. 

Perhaps  the  most  difficult  step  in  assessing  the 
utility  of  economic  techniques  of  statecraft  is  set¬ 
ting  such  assessments  in  the  context  of  the  logic  of 
choice.  Without  comparable  evaluations  of  the 
costs  and  benefits  of  alternative  techniques  of 
statecraft,  estimates  of  the  likely  costs  and  benefits 
of  economic  sanctions  are  of  little  or  no  interest. 
For  policymakers,  it  is  the  relative  utility  of  a 
policy  option  that  matters.  This  is  especially 
important  when  the  alternative  to  economic  sanc¬ 
tions  is  military  force.  In  such  cases,  economic 
sanctions  often  provide  a  policy  alternative  with 
modest  expectations  of  goal  attainment  at  rela¬ 
tively  low  cost,  while  military  force  involves  a 
higher  probability  of  goal  attainment  accompa¬ 
nied  by  much  higher  costs. 

If  the  goal  is  to  choose  the  policy  alternative 
with  the  most  utility,  the  rational  choice  may  be 
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one  that  accomplishes  fewer  goals  at  much  lower 
cost.  The  U.S.-led  invasion  of  Iraq  in  2003  pro¬ 
vides  an  example.  The  economic  sanctions  against 
Iraq  were  not  providing  the  United  States  with  the 
degree  of  assurance  that  Iraq  was  not  building 
weapons  of  mass  destruction  that  it  desired,  and 
sanctions  were  not  bringing  about  regime  change 
even  though  several  of  the  goals  specified  by  the 
UN  had  been  accomplished.  Advocates  of  war 
argued  that  the  costs  of  maintaining  the  sanctions 
could  go  on  for  years,  whereas  the  costs  of  war 
would  be  limited  in  both  time  and  magnitude.  As 
is  often  the  case,  the  costs  of  war  were  significantly 
underestimated.  Thus,  to  make  a  policy-relevant 
assessment  of  the  utility  of  economic  sanctions,  it 
is  necessary  not  only  to  estimate  the  costs  and  ben¬ 
efits  of  sanctions  but  also  the  costs  and  benefits  of 
alternative  courses  of  action,  such  as  war. 

The  critics  of  economic  sanctions  often  dis¬ 
missed  them  as  “merely  symbolic,”  implying  that 
they  were  empty  gestures  aimed  at  giving  domestic 
audiences  the  false  impression  that  “something 
was  being  done.”  Although  some  economic  sanc¬ 
tions  may  well  fit  this  description,  not  all  do.  As 
scholarly  work  in  foreign  policy  and  international 
politics  has  come  to  recognize  the  importance  of 
signaling,  the  symbolic  uses  of  economic  sanctions 
have  received  serious  scholarly  attention.  Economic 
sanctions  can  serve  as  costly  signals  that  increase 
the  credibility  of  foreign  policy  stances.  Thus,  the 
symbolic  use  of  economic  sanctions  can  be  an 
important  way  to  influence  the  actions  and  atti¬ 
tudes  of  other  countries.  Game-theoretical  models 
have  been  especially  useful  in  the  study  of  this 
aspect  of  economic  statecraft. 

In  sum,  economic  statecraft  is  the  use  of  eco¬ 
nomic  means  in  pursuit  of  foreign  policy  goals.  As 
with  other  foreign  policy  tools,  economic  state¬ 
craft  can  be  used  to  make  either  threats  or  prom¬ 
ises  and  to  either  punish  or  reward.  And  as  with 
other  foreign  policy  tools,  success  is  often  difficult 
to  evaluate. 

David  A.  Baldwin 
Princeton  University 
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Economic  Theories  of  Politics 


Economic  theories  of  politics  use  theoretical 
approaches  from  economics  to  analyze  political 
phenomena.  This  entry  examines  assumptions 
about  rational  choice  that  are  shared  by  the  two 
fields  and  discusses  the  application  of  rational 
choice  approaches,  including  game  theory,  to  the 
analysis  of  individual  choice  and  strategic  interac¬ 
tions.  It  then  considers  the  way  in  which  social 
choice  theory  can  illuminate  the  aggregation  of 
individual  choices  in  collective  decision  making. 
The  relationship  of  such  processes  to  voting,  par¬ 
ticularly  in  the  context  of  democracy,  is  described. 
Finally,  the  entry  looks  at  the  application  of  eco¬ 
nomic  theories  to  international  relations  with 
respect  to  issues  such  as  arms  races,  war,  develop¬ 
ment,  and  trade. 

Rational  Choice  and  Strategic  Interactions 

Most  political  scientists  would  agree  that  politics 
involves  “control,  influence,  power,  or  authority.” 
If  one  adds  Max  Weber’s  concerns  about  govern¬ 
ment,  legitimacy,  and  the  state,  together  with 
Aristotle’s  more  normative  focus  on  issues  of  par¬ 
ticipation,  citizenship,  and  justice,  one  has  a  fairly 
complete  picture  of  what  Robert  Dahl  calls  the 
“political  aspect.”  One  can  see  immediately  how 
politics  touches  every  dimension  of  human  activity, 
including  the  procedural  or  distributional  dimen¬ 
sion  (who  gets  what,  when,  how,  why,  and  at 
whose  cost?);  the  legal  or  statist  dimension,  which 
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involves  issues  of  governance  and  legitimacy; 
and  the  ethical  or  normative  dimension,  which 
revolves  around  questions  of  citizenship,  justice, 
and  participation. 

The  study  of  politics,  like  economics,  also 
involves  preferences,  interests,  and  trade-offs.  But 
unlike  economics,  where  the  emphasis  is  on  scar¬ 
city  and  efficiency,  in  politics  the  primary  emphasis 
is  on  power,  influence,  and  authority,  with  strong 
ethical  and  normative  overtones,  concerning  jus¬ 
tice,  membership,  and  citizenship.  In  a  free  market, 
the  allocation  of  scarce  goods  and  resources  takes 
place  according  to  the  logic  of  the  marketplace  (the 
price  mechanism) — that  is,  the  interaction  of  sup¬ 
ply  and  demand.  The  exercise  of  power,  however, 
takes  place  in  the  ideational,  legal,  and  institutional 
confines  of  political  systems. 

Then,  what  have  economic  theories  added  to 
the  study  of  politics?  Political  scientists  know  that 
politics,  unlike  economics,  is  not  interested  nar¬ 
rowly  in  the  allocation  of  scarce  goods  and 
resources.  Although  politics  affects  markets 
through  policies,  laws,  and  rules  that  regulate 
competition,  in  a  mixed  capitalist  system  politics  is 
not  directly  concerned  with  the  individual  eco¬ 
nomic  decisions  of  consumers  and  producers  or 
the  optimal  allocation  of  scarce  resources.  But 
politics,  like  economics,  does  involve  choices  and 
strategic  interactions.  This  is  where  those  who 
advocate  a  positive  approach  to  the  study  of  poli¬ 
tics  join  forces  with  economists  to  lay  the  micro¬ 
foundations  of  political  analysis.  So-called  rational 
choice  approaches  in  political  science  share  com¬ 
mon  assumptions  with  economics  about  human 
rationality  and  strategic  decision  making,  and 
economists  seek  to  construct  economic  theories  to 
explain  political  behavior. 

The  most  common  economic  theories  of  politics 
take  rational  choice  approaches.  They  assume  that 
individuals  are  rational  in  the  sense  that  they  will 
make  choices  to  “maximize  their  chances  of  achiev¬ 
ing  their  goals”  (Barbara  Geddes,  2003,  p.  177). 
They  give  priority  to  agency  (individual  rational 
actors)  over  structure  (institutions  and  other  politi¬ 
cal  constraints).  They  assume  that  individuals  have 
goals  and  that  institutions  and  other  factors  affect 
individual  strategies  and  preferences.  In  this  frame¬ 
work,  utility-maximizing  individuals  will  do  what 
they  can  to  achieve  their  goals,  engaging  in  strategies 
to  anticipate  the  actions  of  others  (their  opponents), 


who  will  in  turn  anticipate  the  actions  of  the  other 
side.  Strategic  interactions  therefore  refer  to  the 
ways  in  which  each  individual  not  only  looks  out 
for  his  or  her  own  interests  but  also  takes  into 
account  the  interests  and  strategies  of  others.  In 
this  rational  choice  framework,  conflict,  coopera¬ 
tion,  and  the  give  and  take  of  political  life  are  out¬ 
comes  of  myriad  strategic  interactions. 

Rational  choice  approaches  often  use  game  the¬ 
ory  to  understand  the  complexity  of  strategic  inter¬ 
actions  in  situations  of  conflict.  Developed  by 
applied  mathematicians  in  the  mid-20th  century, 
game  theory  is  widely  used  by  economists  and,  to 
an  increasing  extent,  by  political  scientists.  Game 
theory  is  not  a  “theory”  in  the  sense  of  a  set  of 
claims,  laws,  or  propositions  about  the  way  the 
world  works.  It  is  rather  a  method  of  constructing 
theories,  and  it  offers  the  analyst  a  set  of  concepts 
and  tools  that  enable  him  or  her  to  formalize  argu¬ 
ments.  Game-theoretic  analysis  requires  careful 
specification  of  the  beliefs,  wants,  and  needs  of 
individuals  and  a  clear  understanding  of  what  strat¬ 
egies  are  available  to  them.  The  need  for  specificity 
makes  game  theory  less  useful  as  a  tool  for  applied 
political  and  social  science;  nonetheless,  it  helps 
political  and  social  scientists  understand  the  logic 
and  structure  of  politics,  whether  they  are  studying 
voting,  international  relations,  or  policy  making. 

Strategic  Interactions  and  Collective  Choice 

Rational  choice  approaches  or  “economic  theo¬ 
ries”  of  politics  begin  with  assumptions  about 
individual  rationality.  But  how  do  collective  deci¬ 
sions  relate  to  individual  choices?  To  answer  this 
question,  social  choice  theory  focuses  on  how 
individual  preferences  are  aggregated  and  what 
roles  institutions  play  in  “engineering”  social  (or 
collective)  choices.  The  disjuncture  between  indi¬ 
vidual  and  collective  preferences  is  summed  up  in 
Condorcet’s  paradox — the  puzzle  that  calls  into 
question  the  notion  of  majority  rule,  which  is 
incapable  of  producing  a  stable  relationship 
between  individual  preferences  and  collective 
decisions.  Kenneth  Arrow  sought  to  address  the 
puzzle  of  how  individual  preferences  affect  collec¬ 
tive  choices  and  concluded  that  there  is  no  mecha¬ 
nism  short  of  a  dictatorship  that  can  achieve  col¬ 
lective  rationality.  This  is  known  as  Arrow’s 
impossibility  theorem. 
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As  an  economic  theory  of  politics,  Arrow’s 
theorem  helped  explain  how  democracies  work. 
Arrow  does  not  prove  that  collective  rationality — 
which  would  guarantee  the  aggregation  of  individ¬ 
ual  preferences  transitively — requires  concentration 
of  power  in  the  hands  of  a  single  decision  maker  or 
dictator.  The  main  implication  of  Arrow’s  theorem 
is  that  institutions  are  the  critical  link  in  under¬ 
standing  how  radically  divergent  individual  prefer¬ 
ences  are  translated  into  collective  action  in  rather 
stable  ways. 

Building  on  Arrow’s  social  choice  theory, 
William  Riker  (1980)  argues  that 

politics  is  the  dismal  science  because  we  have 
learned  from  it  that  there  are  no  fundamental 
equilibria  to  predict.  In  the  absence  of  such 
equilibria  we  cannot  know  much  about  the 
future  at  all,  whether  it  is  likely  to  be  palatable 
or  unpalatable,  and  in  that  sense  our  future  is 
subject  to  the  tricks  and  accidents  of  the  way  in 
which  questions  are  posed  and  alternatives  are 
offered  and  eliminated,  (p.  443) 

While  Riker  recognized  the  failure  to  find  equilib¬ 
ria  for  collective  rationality,  he  stressed  the  impor¬ 
tance  of  institutions  for  the  smooth  functioning  of 
a  democracy.  His  answer  to  the  question  of  how 
democracies  can  reach  collective  decisions  despite 
the  lack  of  equilibria  was 

[in  a  democracy]  the  way  .  .  .  tastes  and  values 
are  brought  forward  for  consideration, 
eliminated,  and  finally  selected  is  controlled 
by  .  .  .  institutions.  And  institutions  may  have 
systematic  biases  in  them  so  that  they  regularly 
produce  one  kind  of  outcome  rather  than  another. 

(p.  443) 

Following  Riker’s  expansion  of  Arrow’s  theo¬ 
rem,  political  and  social  scientists  must  ask  how 
institutions  matter  in  democratic  decision  making. 
Many  studies  have  debated  whether  and  how  insti¬ 
tutional  structures  determine  the  existence  and 
location  of  equilibria  for  collective  choice.  Anthony 
Downs  points  out  that  governments  are  not  really 
interested  in  maximizing  individual  voters’  prefer¬ 
ences  but  in  maximizing  votes.  In  his  analysis,  the 
sole  point  of  politics  is  to  gain  and  hold  power;  in 
a  two-party  system,  politicians  are  forced  to  take 


positions  as  close  as  possible  to  the  median  voter. 
Thomas  Romer  and  Howard  Rosenthal  (1978), 
however,  showed  that  once  agenda  control  is  taken 
into  consideration,  the  collective  outcome  of  policy 
making  is  often  not  the  median  voter’s  ideal  point. 
Kenneth  Shepsle  (1979),  in  a  more  generalized 
way,  shows  that  institutions  could  induce  equilib¬ 
rium  for  collective  choice  where  individual  prefer¬ 
ences  alone  would  not  yield  equilibrium. 

Voting  has  long  been  a  principal  subject  of 
political  analysis,  from  the  so-called  paradox  of 
voting,  first  identified  by  Downs — why  should 
individuals  bother  to  vote  when  the  costs  of  voting 
(time  and  effort)  far  outweigh  the  benefits  (the  like¬ 
lihood  that  a  single  vote  will  influence  the  out¬ 
come)? — to  strategic  voting.  Maurice  Duverger 
points  out  that  the  strategic  behavior  of  voters  and 
candidates  is  heavily  influenced  by  electoral  institu¬ 
tions.  Duverger’s  law  holds  that  plurality  voting  in 
single-member  districts  tends  to  produce  two-party 
systems,  whereas  voting  based  on  proportional 
representation  or  multimember  districts  leads  to 
multiparty  systems  and  coalition  governments.  In 
two-party  systems,  candidates  have  an  incentive  to 
obfuscate  and  avoid  taking  strong  stands  on  key 
issues  as  they  jockey  for  position  vis-a-vis  the 
median  voter;  in  multiparty  systems,  however,  can¬ 
didates  have  an  incentive  to  take  stronger  positions 
to  attract  a  significant  minority  of  voters,  but  their 
positions  may  change  once  they  enter  a  coalition 
government.  In  retrospect,  these  arguments  may 
seem  obvious  or  almost  self-evident,  but  they  are  in 
fact  early  examples  of  economic  reasoning  in  the 
study  of  politics. 

Following  Duverger  and  others,  one  can  see 
that  political  parties  and  electoral  systems  are  the 
most  important  institutions  for  the  smooth  func¬ 
tioning  of  a  democracy.  They  translate  and  aggre¬ 
gate  individual  preferences  into  policy.  John 
Aldrich  (1995)  argues  that  democracies  would  not 
work  without  parties.  He  shows  how  parties  regu¬ 
late  the  number  of  people  seeking  office  and  how 
they  mobilize  voters  to  achieve  and  maintain  the 
majorities  needed  to  implement  policy  once  they 
have  gained  power.  By  aggregating  individual  pref¬ 
erences,  parties  help  solve  (or  at  least  side  step) 
Arrow’s  impossibility  theorem,  and  they  move 
society  toward  equilibria  that  make  governing  pos¬ 
sible.  This  argument  for  the  importance  of  parties 
relies  heavily  on  economic  reasoning,  and  it 
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assumes  that  voters  are  able  to  make  informed 
choices. 

To  take  a  concrete  example,  the  founders  of  the 
American  Republic  believed  that  voters  would 
have  enough  information  to  make  the  “right 
choices.”  In  the  Federalist  Papers,  James  Madison 
argued  that  people  should  have  the  “virtue  and 
intelligence  to  select  men  of  virtue  and  wisdom.” 
Studies  of  democratic  elections,  however,  have 
found  that  individuals  lack  detailed  information 
about  politics.  Voters  make  their  choices  by  trust¬ 
ing  expert  opinion,  by  accepting  consensus  judg¬ 
ments,  and  by  using  what  little  information  they 
have  to  make  their  choices  at  the  ballot  box.  In 
short,  voters  vote  as  if  they  are  well  informed  and 
using  the  limited  information  available  to  them. 

Strategic  Interactions  and 
International  Relations 

Economic  theories  of  politics  abound  in  the  study 
of  international  relations.  At  first  glance,  it  is  not 
surprising  to  see  that  states  are  more  often  than  not 
in  conflict,  because  each  state  must  pursue  its  own 
interests,  maximizing  its  power  and  wealth  to  pro¬ 
vide  security.  States  are  trapped  in  a  security 
dilemma;  if  they  do  not  correctly  assess  threats 
from  other  states,  they  risk  their  very  survival.  The 
logic  of  the  security  dilemma  is  often  explained  in 
game-theoretic  terms,  using  the  so-called  prisoner’s 
dilemma,  whereby  states  are  compared  with  two 
prisoners,  accused  of  a  crime,  jailed,  and  isolated 
by  prosecutors.  Each  prisoner  is  offered  a  deal  if  he 
or  she  will  confess.  If  both  confess,  each  will 
receive  a  moderate  sentence;  if  neither  confesses, 
both  will  receive  a  reduced  sentence;  and  if  one 
confesses  and  the  other  remains  silent,  the  one  who 
confesses  goes  free  while  the  other  receives  a  life 
sentence.  The  smart  play  is  to  keep  quiet,  but  by 
not  knowing  what  his  or  her  partner  in  crime  will 
say  to  the  authorities,  each  prisoner’s  temptation  to 
confess  (defect)  is  overwhelming.  The  point  is  that 
actors  have  an  individual  incentive  to  defect,  which 
leads  to  an  outcome  of  mutual  defection  even 
though  both  would  be  better  off  with  mutual  coop¬ 
eration.  With  respect  to  the  security  situation,  it 
may  seem  irrational  for  states  to  engage  in  an  arms 
race,  thereby  increasing  the  propensity  to  go  to  war 
and  making  it  more  difficult  to  resolve  their  differ¬ 
ences  through  negotiations.  However,  military 


expansion  yields  a  better  outcome  than  disarming 
when  the  other  side  arms,  even  though  the  best 
outcome  would  be  for  both  states  to  disarm.  Thus, 
the  states  engage  in  an  arms  race,  which  is  costly 
and  may  lead  to  war. 

Motivated  by  this  puzzle,  James  Fearon  (1995) 
offers  several  “rationalist  explanations  for  war.”  He 
postulated  three  mechanisms  by  which  conflict  can 
lead  to  war.  First,  war  may  occur  if  there  is  uncer¬ 
tainty  about  the  adversary’s  capabilities,  such  as  the 
size  of  the  military,  the  effectiveness  of  military 
technology,  the  quality  of  leadership,  and  the  con¬ 
tribution,  if  any,  of  allies  to  the  cause.  There  may 
also  be  uncertainty  about  the  adversary’s  resolve  to 
fight  wars,  which  raises  questions  about  how  much 
each  side  values  the  “good”  that  is  in  dispute  and 
what  the  ultimate  cost  of  war  will  be  in  terms  of 
blood  (casualties)  and  treasure  (wealth).  Second, 
Fearon  argues  that  war  can  occur  if  states  are 
unable  or  unwilling  to  honor  bargains — versions  of 
the  prisoner’s  dilemma.  If  there  are  advantages  in 
striking  first,  if  the  relative  power  of  one  side  is 
expected  to  grow  rapidly,  or  if  the  bargaining  is 
over  goods  that  affect  future  bargaining  power, 
then  states  may  face  a  choice  between  war  today  on 
favorable  terms  and  the  threat  of  war  tomorrow  on 
unfavorable  terms  (the  shadow  of  the  future). 
Finally,  bargaining  may  not  help  avoid  war  if  the 
good  in  dispute  is  indivisible. 

Hence,  Fearon’s  economic  theory  of  war  shows 
us  that  a  mutual  (and  rational)  preference  for  peace 
is  not  sufficient  for  states  to  overcome  the  “asym¬ 
metric  information  problem.”  As  a  result,  states 
generally  have  incentives  to  exaggerate  and  misrep¬ 
resent  their  capabilities  and  their  resolve  to  fight — 
recall  the  bluster  of  Saddam  Hussein  in  the  run-up 
to  the  2003  invasion  of  Iraq  by  the  United  States 
(also  known  as  the  second  Gulf  War).  In  such  a 
situation,  the  danger  is  that  brinksmanship — the 
act  of  sending  credible  signals  about  capabilities  or 
resolve — may  raise  the  risk  of  war. 

One  of  the  enduring  issues  in  the  study  of  inter¬ 
national  relations  is  the  so-called  democratic  peace 
argument,  with  many  competing  arguments  about 
why  democracies  rarely,  if  ever,  fight  wars  against 
one  another.  One  explanation  is  that  democracy 
increases  the  political  costs  of  war  by  making 
elected  leaders  accountable  to  people  who  ulti¬ 
mately  must  pay  the  costs  of  war.  Another  expla¬ 
nation  is  that  democracy  increases  transparency, 
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which  may  help  overcome  the  asymmetric  infor¬ 
mation  and  commitment  problems  in  strategic 
interactions  between  states,  thereby  reducing  the 
severity  of  the  security  dilemma.  An  alternative 
argument  is  that  democratic  states  never  go  to  war 
with  each  other  because  they  almost  always  have  a 
common  enemy — an  example  would  be  the  Soviet 
Union  versus  the  Western  democracies  during  the 
Cold  War  period. 

Some  have  argued  that  international  institutions 
reduce  the  risks  of  war  and  help  alleviate  the  secu¬ 
rity  dilemma.  The  United  Nations  (UN),  for  exam¬ 
ple,  provides  a  set  of  rules  that  structures  interna¬ 
tional  relations  in  such  a  way  as  to  promote  collec¬ 
tive  security.  However,  the  UN,  like  other  interna¬ 
tional  institutions,  lacks  the  capacity  to  enforce  its 
rulings.  States  have  the  right  to  defend  themselves 
when  attacked — the  self-help  rule — but  there  is  no 
guarantee  that  other  states  will  come  to  their  aid. 
Still,  some  would  argue  that  international  institu¬ 
tions  mitigate  the  effects  of  anarchy  and  promote 
cooperation.  By  setting  up  norms  and  rules  and 
through  creating  repeated  interactions  (iterative 
games),  international  institutions  help  states  over¬ 
come  the  security  dilemma,  and  cooperation  rather 
than  conflict  becomes  the  norm  of  international 
relations — this  is  the  so-called  shadow  of  the 
future.  Institutions  offer  a  set  of  norms  and  stan¬ 
dards  of  acceptable  behavior;  they  provide  infor¬ 
mation  about  activities  of  other  states  (potential 
adversaries),  thereby  creating  linkages  across  issues 
and  raising  the  costs  of  free  riding  and  defection 
(Lisa  Martin  &  Beth  Simmons,  2001). 

Economic  theories  of  politics  not  only  help  us 
understand  the  classic  issues  of  war  and  peace  in 
international  relations,  but  they  are  essential  to 
understanding  trade  policy.  If  economists  are  cor¬ 
rect  in  arguing  that  trade  is  beneficial  for  all  trad¬ 
ing  partners,  then  why  do  some  states  insist  on 
protecting  their  markets,  restricting  trade,  and 
damaging  the  prospects  for  world  economic  growth 
and  welfare?  Is  this  not  irrational  behavior?  The 
theory  of  comparative  advantage  shows  that  trade 
is  beneficial  for  society  as  a  whole,  but  it  is  known 
that  trade  creates  winners  and  losers.  Using  the 
simple  economic  logic  of  the  Stolper-Samuelson 
theorem,  which  postulates  that  protectionism  leads 
to  an  increase  in  the  income  of  a  state’s  scarce  fac¬ 
tor,  Ronald  Rogowski  (1989)  developed  a  theory 
to  explain  how  political  coalitions  form  to  support 


or  oppose  free  trade.  The  logic  of  collective  action 
suggests  that  when  the  benefits  of  protectionism 
are  concentrated  and  the  costs  are  diffuse,  the  state 
may  be  captured  by  powerful  interests  who  stand 
to  win  from  protectionism.  Conversely,  if  a  state 
pursues  free  trade  and  opens  its  markets,  the  ben¬ 
efits  will  accrue  to  the  exporting  sectors,  in  which 
case  the  state  is  able  to  overcome  collective  action 
problems  and  pursue  a  trade  liberalization  policy. 
This  is  a  classic  example  of  an  economic  theory  of 
politics,  which  helps  us  understand  the  politics  of 
trade. 

The  same  reasoning  can  be  applied  to  problems 
of  development.  At  a  meeting  of  the  World  Trade 
Organization  (WTO)  in  Cancun  in  2003,  talks 
aimed  at  reducing  protectionism  and  liberalizing 
trade  broke  down  due  to  a  conflict  between  rich 
and  poor  states.  Here,  one  can  see  how  the  decline 
of  U.S.  economic  power  has  made  it  more  difficult 
to  achieve  multilateral  trade  agreements.  In  the 
absence  of  a  leader  (hegemon),  there  is  a  greater 
propensity  for  states  to  pursue  beggar-thy-neighbor 
policies  and  to  free  ride — gain  the  benefits  of  trade 
but  without  paying  the  costs  (lower  tariffs  and  less 
protectionism)  needed  to  maintain  an  open  interna¬ 
tional  economy.  More  important,  the  integration 
of  developing  countries  into  the  global  market  has 
generated  a  backlash  against  trade  liberalization. 
As  Geoffrey  Garrett  (1998)  notes,  unskilled  work¬ 
ers  in  developed  countries — losers  from  free  trade 
according  to  the  Stolper-Samuelson  theorem — have 
opposed  globalization  because  of  fear  of  down¬ 
ward  pressure  on  wages,  while  people  in  develop¬ 
ing  countries  (and  the  activists  who  speak  for  them) 
fear  vulnerabilities  to  global  markets.  Though 
economists  argue  that  social  welfare  policies  to 
support  the  losers  of  free  trade  and  the  recent  trend 
of  globalization  (such  as  income  compensations) 
are  more  efficient  than  trade  protectionism,  the 
formal  model  of  Daren  Acemoglu  and  James 
Robinson  (2006,  chap.  10)  shows  that  global 
capital  mobility  makes  it  more  difficult  for  a 
democracy  to  provide  social  welfare  policies  that 
the  majority  of  the  citizens  may  want. 

Conclusion 

Supporters  of  economic  theories  of  politics,  spe¬ 
cifically  rational  choice  approaches,  argue  that  it  is 
the  most  scientific  (hypothetico-deductive)  way  of 
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studying  politics,  and  as  such  they  contend  that  it 
is  high  time  that  the  discipline  of  political  science 
embraces  a  more  rationalist  and  utilitarian  theory 
of  politics.  Some  would  go  so  far  as  to  say  that 
rational  choice  theory  constitutes  a  “scientific 
revolution”  a  la  Thomas  Kuhn,  moving  the  study 
of  politics  away  from  its  formal-legal  and  social- 
psychological  roots  and  in  the  direction  of  more 
systematic  and  falsifiable  propositions.  The  irony 
is  that  economics  seems  to  be  moving  in  the  oppo¬ 
site  direction,  with  a  renewed  emphasis  on  socio- 
psychological  approaches  to  the  study  of  markets 
and  economic  behavior.  Perhaps  the  two  disci¬ 
plines  will  meet  halfway  and  a  true  political  econ¬ 
omy  will  emerge;  but  this  seems  unlikely  because 
the  objects  and  the  subjects  of  inquiry  are  quite 
different.  This  entry  cannot  begin  to  resolve  the 
dispute  between  rationalists  and  social  psycholo¬ 
gists  and  institutionalists;  instead,  it  falls  back  on 
Weber  who  leaves  ample  room  for  rationalist  and 
interpretivist  approaches  to  the  study  of  politics. 
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Effectiveness,  Bureaucratic 


This  entry  discusses  bureaucratic  effectiveness  and 
its  meanings.  These  various  meanings  include  an 
agency’s  political  capabilities  and  effectiveness  at 
acquiring  resources,  its  ability  to  preserve  its 
autonomy,  and  ultimately  its  ability  to  produce 
public  value  and  to  measure  that  value.  The  valid¬ 
ity  of  metrics,  however,  is  related  to  clarity  of 
goals,  and  some  goals  and  functions  are  inherently 
clearer  than  others.  In  addition,  political  contro¬ 
versy  makes  clear  goal  definition  problematic. 

Varieties  of  Bureaucratic  Effectiveness 

It  is  difficult  to  find  a  single  definition  of  bureau¬ 
cratic  effectiveness.  A  big  part  of  the  explanation 
for  that  lies  in  the  many  distinct  functions  that 
bureaucratic  agencies  have.  Some  have  very  defined 
customer  service  characteristics.  An  automobile 
licensing  agency,  for  example,  may  be  able  to  cal¬ 
culate  the  wait  time  for  its  customers  and  seek  to 
cut  that  through  better  management.  Taxing  agen¬ 
cies  might  well  calculate  their  effectiveness  in 
terms  of  recovering  revenue.  But  what,  some  have 
asked,  does  a  foreign  ministry  do  and  how  does 
one  measure  whether  or  not  it  is  meeting  its  goals, 
assuming  it  could  define  them? 

It  is  necessary  to  distinguish  between  bureaucra¬ 
cies  in  the  private  and  public  spheres.  Generally, 
private  firms  measure  their  success,  if  they  are  pub¬ 
lic  stock  corporations,  in  the  confidence  of  their 
investors.  This  is  typically  reflected  in  the  value  of 
its  stock  price  and  its  trajectory.  In  the  public 
sphere,  however,  bureaucratic  agencies’  missions 
are  defined  by  legal  and  political  authority,  and 
usually  these  missions  are  articulated  in  ways  that 
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are  highly  general,  aspirational,  vague,  and  poten¬ 
tially  even  contradictory. 

Bureaucracies  may  be  effective  or  ineffective 
across  many  different  dimensions.  Some  dimen¬ 
sions  consider  their  prowess  in  politics  and  gaining 
power,  others  with  maintaining  their  autonomy 
and  organizational  culture,  and  yet  others  with 
procuring  resources  sufficient  for  the  tasks  to 
which  they  have  been  assigned.  Additional  dimen¬ 
sions  have  to  do  with  defining  and  gaining  accep¬ 
tance  for  favorable  accounts  of  success,  gaining 
reputation  for  innovation  and  efficient  manage¬ 
ment,  and  other  measurements  of  organizational 
outputs.  And  sometimes  bureaucratic  agencies  have 
the  good  (or  bad)  fortune  to  have  outcomes  move 
in  a  favorable  (or  unfavorable)  direction  despite 
their  own  activity.  The  perceived  quality  of  police 
agencies,  for  example,  is  often  related  to  the  rise 
and  fall  of  crime  rates  whether  or  not  the  police 
actually  are  directly  responsible  for  these  trends. 
Ironically,  the  higher  the  crime  rate,  the  more  the 
police  may  be  valued  whereas  the  lower  the  crime 
rate,  the  more  the  public  and  its  politicians  may 
find  downsizing  the  police  acceptable.  This  pro¬ 
duces  perverse  incentives  for  the  police  agencies, 
although  they  are  not  the  only  agencies  for  whom 
less  demand  threatens  their  resources. 

Resources  and  Alliances 

As  individuals,  we  generally  prefer  more  to  less. 
Organizations  are  no  different.  They  too  prefer 
having  greater  resources  to  work  with  than  fewer. 
For  one  thing,  it  makes  it  more  likely  that  agencies 
can  achieve  more  of  their  goals  than  not.  It  is  also 
a  sign  of  an  agency’s  power,  raising  the  stakes  for 
those  who  may  be  trying  to  cut  it  down  to  size. 
Having  more  resources  rather  than  fewer  also 
helps  maintain  morale  and  makes  it  more  possible 
to  retain  highly  qualified  employees. 

There  is  a  need  to  distinguish,  however,  between 
resources  that  are  mandated  under  law,  such  as 
pension  benefits,  and  discretionary  or  appropri¬ 
ated  resources.  Transfer  payments,  for  example, 
may  account  for  a  high  level  of  mandated  expendi¬ 
tures,  whereas  increasing  agency  personnel  or  giv¬ 
ing  more  discretion  as  to  how  personnel  may  be 
used  is  a  discretionary,  and  highly  valued,  resource. 

Ensuring  the  availability  of  slack  resources  (suf¬ 
ficiency  plus  a  reserve)  is  often  an  indicator  of  an 


organization’s  political  prowess.  Resource  availabil¬ 
ity  or  lack  thereof  is  also  a  function  of  the  priorities 
of  political  leadership  outside  the  organization.  In 
view  of  these  shifting  priorities  from  the  outside,  the 
ability  to  generate  support  and  allies  of  a  long¬ 
standing  nature  and  to  be  buffeted  from  political 
turbulence  is  the  mark  of  a  politically  effective  orga¬ 
nization.  Effective  engagement  with  stakeholders 
and  sources  of  political  support  and  the  ability  to 
make  its  functions  seem  indispensable  are  essential 
to  the  political  effectiveness  of  bureaucracy. 

Capability  and  Instrumentation 

No  organization  wants  to  be  caught  lacking  the 
capability  or  instruments  to  do  effectively  the  jobs 
for  which  it  is  tasked.  Bureaucracies  with  capital 
and  labor-intensive  functions — notably  in  defense, 
transportation,  infrastructure,  and  other  high- 
capital  investment  sectors,  including  science  and 
technology — often  have  two  competing  incentives. 
One  is  technical  efficiency,  defined  here  as  the 
most  effective  route  for  the  best  marginal  return  on 
resources.  Another  is  political  support,  emphasiz¬ 
ing  the  extent  to  which  capital  projects  can  spread 
largesse  sufficient  to  attract  and  sustain  political 
support  or  be  able  to  provide  a  compelling  narra¬ 
tive.  Sending  astronauts  to  Mars,  for  example, 
provides  a  more  compelling  narrative  than  sending 
robots,  though  the  scientific  merits  of  doing  so  are 
at  least  debatable.  Nearly  all  public  agencies  and 
especially  those  that  can  disperse  capital  projects 
face  the  choice  between  efficiency  and  political 
support.  It  is  necessary  to  distinguish  between  the 
instruments  and  capabilities  that  are  needed  to 
effectively  perform  an  agency’s  tasks  and  those 
needed  to  attract  substantial  political  support.  Too 
much  emphasis  on  the  first  likely  will  lead  to  insu¬ 
larity  and  even  irrelevance.  But  excessive  emphasis 
on  the  latter  frequently  leads  to  the  displacement 
of  goals  and  certainly  to  inefficiencies. 

Preserving  Organizational  Definition 

All  bureaucratic  organizations  face  conflicts 
between  their  missions  and  the  political  needs 
related  to  their  budgets  and  resources.  This  con¬ 
flict  can  be  viewed  as  a  struggle,  on  the  one  hand, 
between  the  desire  for  organizational  autonomy 
and  securing  the  integrity  of  an  organization’s 
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mission  from  outside  interference  and,  on  the 
other,  the  need  to  gain  broad  legitimacy  in  order  to 
sustain  organizational  functions.  As  organizations 
accept  more  functions  and  diversify  their  stake¬ 
holders,  they  run  the  risk  of  losing  control  over 
mission  definition.  Not  all  bureaucracies  desire  to 
absorb  new  functions  for  precisely  these  reasons, 
but  others,  perhaps  most,  prefer  more  to  less  and 
will  adapt  their  missions  to  gain  political  support. 

Typically,  bureaucracies  dealing  with  finance, 
treasury,  and  overhead  functions  in  financial 
accountability  have  been  able  to  minimize  external 
clientele  interference  in  their  affairs  and  even,  to 
some  degree,  political  interference.  Some  special¬ 
ized  scientific  and  technological  functions  as  well  as 
similarly  situated  military  or  intelligence  units  have 
also  managed  to  preserve  significant  autonomy 
partly  because  their  activities  seem  sufficiently  eso¬ 
teric  and  technical.  The  ability  of  bureaucracies  to 
sustain  autonomy  depends  on  creating  an  aura  of 
professionalism  and  success  on  matters  that  every¬ 
one  cares  about.  An  organization  whose  initial 
leadership  has  played  an  outsized  role  in  establish¬ 
ing  an  aura  of  invincibility  and  indispensability  can 
help  sustain  substantial  organizational  autonomy — 
and  resources.  From  an  agency’s  standpoint,  the 
combination  of  resources  and  autonomy  is  the  goal. 

Innovation  and  Adaptation 

So  far,  the  discussion  of  bureaucratic  effective¬ 
ness  has  focused  on  political  problems — procuring 
support,  gaining  resources,  ensuring  that  capabili¬ 
ties  exist  for  the  functions  that  have  been  assigned, 
and  preserving  organizational  autonomy.  These  are, 
however,  means  to  ends.  From  an  agency’s  stand¬ 
point,  those  means  provide  advantages  in  power 
struggles  and  enable  the  bureaucracy  to  control  its 
destiny.  Public  bureaucracies,  however,  are  sup¬ 
posed  to  do  more  than  harbor  resources.  Presumably, 
their  purpose  is  to  achieve  public  goals  efficiently. 

Thus,  another  way  of  examining  organizational 
improvements  is  to  assess  how  well  a  bureaucracy 
innovates  or  adapts  to  new  conditions.  Non- 
incremental  change  does  not  come  naturally  to 
bureaucracies  weighted  down  by  inertia.  Both 
experts  and  leaders  may  account  for  organizations’ 
adaptive  and  innovative  capabilities.  Technical 
experts  within  bureaucracies  can  play  an  important 
role  in  policy  innovations.  Organizational  leaders 


may  emphasize  the  importance  of  continuous  adap¬ 
tation.  Slogans  abound  to  call  attention  to  leaders 
who  claim  to  find  ways  to  innovate,  for  example, 
“doing  more  with  less.”  Whether  there  is  actual 
innovation  or  mere  cost-cutting  is  another  ques¬ 
tion.  Organizational  leaders  acquire  a  reputation 
by  appearing  to  be  innovative,  and  that  becomes 
useful  currency  to  the  leaders,  if  not  to  their  orga¬ 
nizations.  The  language  and  symbols  of  innovation, 
however,  usually  outpace  its  actual  occurrence. 

Innovation  is  itself  a  challenging  concept  in  the 
literature  on  bureaucratic  effectiveness.  It  is  not 
easily  defined.  One  thing  that  can  be  said  about  an 
innovation  is  that  it  represents  a  parametric  shift  in 
the  way  in  which  some  activity  is  conducted  or  in 
the  way  it  is  defined.  The  standard  assumption  is 
that  it  is  better  to  innovate  than  enervate. 

From  Goals  to  Metrics 

Increasingly,  bureaucracies  have  to  operationally 
define  their  goals  and  assess  their  progress  through 
quantitative  indicators.  These  procedures  may  be 
related  to  an  organization’s  actual  goals,  but  the 
goals  may  be  redefined  so  as  to  be  measurable.  To 
the  extent  that  metrics  are  outcomes  related  rather 
than  outputs  related,  there  likely  are  many  uncon- 
trollables  present. 

Policy  areas  especially  subject  to  extensive 
analysis  of  metrics  are  education,  health,  and 
crime.  An  analysis  of  outcomes  requires  distinc¬ 
tions  to  be  drawn  between  manipulable  matters 
such  as  program  design,  or  what  agencies  should 
emphasize,  and  nonmanipulable  matters,  such  as 
demographics  and  social  backgrounds.  Policies 
may  be  responsible  for  only  a  small  part  of  positive 
or  negative  change.  Bureaucratic  innovations  to 
improve  operations  and  outputs  can  be  taken  only 
within  the  existing  policy  parameters.  Ultimately, 
assessing  bureaucratic  activities  through  metrics 
requires  agreement  on  goals  and  on  what  the  met¬ 
rics  should  be.  In  areas  such  as  education,  debate 
continues  on  what  the  purpose  of  education  is  and 
how  narrowly  or  broadly  it  should  be  pursued. 

There  is  no  doubt  that  there  is  more  self-assess¬ 
ment  undertaken  by  bureaucratic  organizations 
and  more  assessments  demanded  by  leadership. 
Performance  evaluation  is  ongoing,  and  as  with 
any  measurable  activity,  much  depends  on  the 
validity  of  what  is  being  measured  and  how  the 
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outcomes  can  best  be  assessed.  All  assessments  are 
imperfect,  but  that  is  still  better  than  operating 
blindly.  As  agencies  demand  impact  assessments  in 
some  cases  from  those  whom  they  regulate,  their 
own  regulations  also  have  become  increasingly 
subject  to  cost-benefit  analytics. 

The  most  difficult  element  to  measure  in  an 
agency  often  is  management.  Outside  experts  and 
agencies  have  collaborated  in  the  United  States  to 
try  to  assess  organizational  goals  for  each  federal 
agency.  It  is  probably  not  terribly  surprising  to  learn 
that  less  controversial  agencies  tend  to  do  better  at 
clarifying  their  goals  or  that  agencies  with  higher 
proportions  of  civil  servants  among  their  senior 
executives  tend  to  be  managed  better.  Agencies  that 
are  politically  controversial  have  more  difficulties 
defining  goals  since  these  are  likely  to  change  with 
the  political  context.  These  agencies  also  tend  to 
have  a  higher  proportion  of  politically  appointed 
officials.  In  most  governments,  some  agencies  are 
regarded  as  too  important  to  fail,  and  party  politics 
is  minimized.  But  equally  in  most  systems,  other 
agencies  are  seen  as  offshoots  of  the  policies  of  cur¬ 
rent  governments  or  administrations,  and  some  are 
seen  as  indulging  in  pork  barrel  politics. 

In  sum,  performance  evaluation  is  an  imperfect, 
but  necessary,  science,  and  there  is  simply  no  ques¬ 
tion  that  organizations  are  being  required  to  know 
more  about  potential  problems,  continually  assess 
improvement  efforts,  and  evaluate  the  costs  of 
alternatives.  This  is  the  essence  of  a  scientific- 
rational  model  of  effectiveness  evaluation  in 
bureaucracies.  But  metrics  demand  clarity,  and  it  is 
easier  to  distill  an  appropriate  set  of  metrics  when 
there  is  clarity  about  goals.  That  tends  to  push 
against  the  world  of  politics,  which  in  turn  limits 
bureaucratic  effectiveness. 
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Purdue  University 
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Political  efficacy  is  one  of  the  most  important  top¬ 
ics  in  research  on  political  attitudes  and  political 
culture.  A  large  number  of  terms  such  as  ego 
strength,  self-esteem,  self-reliance,  subjective  or 
civic  political  competence,  or  (lack  of)  powerless¬ 
ness/futility  can  be  found  in  the  relevant  literature 
as  equivalents.  The  concept,  which  was  first  intro¬ 
duced  into  empirical  research  by  Angus  Campbell, 
Gerald  Gurin,  and  Warren  Edward  Miller  (1954), 
referred  to  the  citizen’s  self-perception  as  a  knowl¬ 
edgeable,  active,  and  self-confident  participant  in 
political  life.  According  to  Campbell  et  ah,  the 
sense  of  political  efficacy  refers  to  the  feeling  that 
individual  political  action  has  or  can  have  an 
impact  on  the  political  process.  Political  efficacy 
includes  cognitive  and  behavioral  components.  The 
cognitive  dimension  refers  to  an  individual’s  sub¬ 
jective  belief  that  he  or  she  is  able  to  understand 
what  is  going  on  in  politics,  while  the  behavioral 
component  taps  the  citizen’s  conviction  that  he  or 
she  can  play  an  active  role  in  politics,  exert  political 
influence,  and  thus  cause  political  leaders  and  the 
political  system  to  eventually  react  responsively  to 
the  citizen’s  actions.  The  following  describes  the 
development  and  measurement  of  the  concept  and 
then  turns  to  the  way  in  which  political  efficacy  is 
used  in  political  research. 

Development  and  Measurement 

In  the  first  stage  of  research,  political  efficacy  was 
regarded  as  a  unidimensional  concept  important  to 
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the  explanation  of  electoral  participation.  A  cumu¬ 
lative  scale  to  measure  feelings  of  political  efficacy 
consisting  of  five  items  was  developed  and  included 
in  the  American  National  Election  Survey  (ANES) 
of  1952.  The  items  were  worded  as  follows: 

1.  “I  don’t  think  public  officials  care  much  what 
people  like  me  think.” 

2.  “The  way  people  vote  is  the  main  thing  that 
decides  how  things  are  run  in  this  country.” 

3.  “Voting  is  the  only  way  that  people  like  me  can 
have  any  say  about  how  government  runs 
things.” 

4.  “People  like  me  don’t  have  any  say  about  what 
the  government  does.” 

5.  “Sometimes  politics  and  government  seem  so 
complicated  that  a  person  like  me  can’t  really 
understand  what’s  going  on.” 

Rejecting  Items  1,  3,  4,  and  5  and  accepting  Item  2 
indicated  feelings  of  political  efficacy. 

The  second  stage  of  research  was  characterized 
by  refining  the  theoretical  concept  of  political  effi¬ 
cacy  gradually  and  improving  the  measurement. 
From  a  theoretical  perspective,  Robert  E.  Lane’s 
study  Political  Life:  Why  People  Get  Involved  in 
Politics  (1959)  was  an  important  step  forward  in 
the  analysis  of  political  efficacy.  Lane  regarded  the 
prevailing  unidimensional  concept  as  too  simplis¬ 
tic  because  it  neglected  to  distinguish  the  individu¬ 
al’s  political  self-image  as  a  political  actor  from  the 
individual’s  orientation  toward  the  political  envi¬ 
ronment.  Accordingly,  he  proposed  a  distinction 
between  the  two  separate  dimensions  of  internal 
and  external  efficacy.  The  first  component,  inter¬ 
nal  political  efficacy,  entailed  the  belief  that  the 
normal  citizen  is  able  to  understand  and  influence 
politics.  External  political  efficacy  referred  to  the 
perceived  openness  and  responsiveness  of  the 
political  system  and  the  political  elites. 

Lane  himself  did  not  validate  the  distinction 
between  internal  and  external  efficacy.  But  his  con¬ 
tribution  served  as  a  starting  point  for  a  broad  and 
intense  methodological  debate  on  the  meaning  and 
the  measurement  of  the  concept.  Several  studies 
published  from  the  mid-1970s  on  dealt  with  the 
validity  of  the  concept  and  the  quality  of  the  items 
developed  to  measure  feelings  of  political  efficacy. 


Research  carried  out  in  this  context  followed  two 
different  lines:  on  the  one  hand,  validating  the  dis¬ 
tinction  of  internal  from  external  efficacy  and,  on 
the  other  hand,  improving  the  quality  of  the  items 
used  as  measures  of  these  two  facets  of  the  concept. 

A  first  important  contribution  to  the  validation 
of  the  concepts  of  internal  and  external  efficacy 
was  made  in  an  article  published  by  Balch  in  1974. 
In  a  construct  validation  based  on  external  criteria, 
Balch  showed  convincingly  that  internal  and  exter¬ 
nal  efficacy  correlated  with  different  indicators  of 
political  attitudes  and  behaviors:  While  internal 
efficacy  items  were  related  to  political  interest, 
knowledge,  and  conventional  forms  of  political 
participation,  external  efficacy  was  closely  linked 
to  trust  in  government  and  participation  in  uncon¬ 
ventional  political  activities,  particularly  civil  dis¬ 
obedience.  The  construct  of  internal  efficacy  was 
represented  by  the  “Voting  is  the  only  way”  and 
“Politics  is  too  complex”  items,  while  “Officials 
don’t  care”  and  “People  have  no  influence”  turned 
out  to  be  measures  of  external  efficacy.  One  of  the 
original  items  (“Vote  decides  how  things  are  run”) 
was  dropped  from  the  ANES  surveys  after  1952. 

During  the  1970s  and  1980s,  efficacy  items 
were  included  in  a  considerable  number  of  surveys. 
The  improvement  of  the  data  situation  encouraged 
a  considerable  number  of  methodological  studies 
aimed  to  validate  the  concept  further.  They  not 
only  developed  and  tested  additional  items  but 
they  also  used  more  sophisticated  strategies  of 
validation,  particularly  structural  equation  mod¬ 
els.  All  studies  confirmed  the  theoretical  and 
empirical  relevance  of  the  distinction  between 
internal  and  external  efficacy.  However,  they  did 
not  convey  clear  results  regarding  the  quality  of 
the  items  and  their  relationship  toward  the  con¬ 
structs  of  external  and  internal  efficacy.  To  improve 
the  quality  of  the  scales,  two  additional  measures 
of  external  efficacy  were  included  in  1968  in  the 
ANES:  (1)  “Generally  speaking,  those  we  elected 
to  Congress  in  Washington  lose  touch  with  the 
people  pretty  quickly”  and  (2)  “Parties  are  only 
interested  in  people’s  votes  but  not  in  their  opin¬ 
ions.”  These  latter  two,  together  with  the  item 
“Public  officials  don’t  care,”  were  mostly  used  to 
measure  external  political  efficacy,  while  the 
remaining  three  items  served  as  indicators  of  inter¬ 
nal  efficacy.  These  six  items  also  became  part  of 
opinion  surveys  conducted  outside  the  United 
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States  and  in  several  cross-national  surveys.  In 
most  recent  studies,  the  “voting”  item  is  no  longer 
used  as  a  measure  of  internal  efficacy  due  to  its 
ambiguity  but  replaced  by  some  new  items  better 
suited  for  measuring  the  citizens’  cognitive  capa¬ 
bility  and  feelings  of  competence.  The  distinction 
of  these  two  latter  subdimensions  of  feelings  of 
internal  efficacy  was  also  confirmed  empirically  by 
several  German  studies  published  in  the  1990s. 

Use  in  Political  Research 

The  concern  with  developing  appropriate  mea¬ 
sures  played  an  important,  if  not  dominant,  role  in 
research  on  political  efficacy.  However,  the  con¬ 
cept  was  also  applied  in  several  fields  of  substan¬ 
tive  research  on  peoples’  political  attitudes  and 
behaviors.  In  the  initial  stages  of  research,  feelings 
of  political  efficacy  were  not  so  much  regarded  as 
a  research  topic  in  its  own  right  but  rather  as  an 
explanatory  variable  in  analyses  of  political  par¬ 
ticipation.  It  was  assumed  and  confirmed  by 
empirical  research  that  feelings  of  political  efficacy 
were  positively  related  to  electoral  turnout  and 
other  forms  of  political  participation,  even  if  edu¬ 
cation  and  political  interest  were  held  constant.  In 
some  of  these  national  and  cross-national  empiri¬ 
cal  studies,  feelings  of  (internal)  political  efficacy 
were  introduced  as  independent  explanatory  con¬ 
cepts,  and  in  others,  they  were  considered  as  being 
part  of  a  broader  class  of  attitudes,  labeled  as 
“psychological  involvements  in  politics.”  The  tra¬ 
ditional  assumptions  on  the  direction  of  the  causal 
link  of  (internal)  political  efficacy  to  political  par¬ 
ticipation  were  challenged  by  some  empirical  stud¬ 
ies  in  the  1980s  and  1990s,  which  showed  a  recip¬ 
rocal  relationship  between  political  efficacy  and 
participation.  Accordingly,  feelings  of  political 
efficacy  not  only  encourage  people  to  become 
active  participants,  but  active  participation  in 
political  life  also  consolidates  and  strengthens  the 
citizens’  self-perception  as  competent  and  influen¬ 
tial  political  actors. 

Most  of  the  studies  mentioned  so  far  referred  to 
conventional  forms  of  political  participation.  As 
political  protest  became  more  widespread  in  the 
Western  world,  some  attempts  to  explain  the  new 
phenomenon  also  referred  to  political  efficacy. 
Some  studies  emphasized  the  interaction  of  politi¬ 
cal  distrust  and  political  efficacy  as  determinants 


of  different  types  of  political  behavior. 
Accordingly,  the  coincidence  of  low  efficacy  and 
low  trust  was  assumed  to  be  a  source  of  extreme 
political  disengagement,  because  people  charac¬ 
terized  by  a  likewise  combination  of  attitudes  did 
not  feel  sufficiently  competent  and  at  the  same 
time  doubted  that  government  officials  would 
behave  in  an  appropriate  way  if  in  the  face  of 
civic  activity.  On  the  other  hand,  a  combination 
of  high  efficacy  and  low  trust  was  seen  as  leading 
to  political  protest,  civil  disobedience,  and  some¬ 
times  even  political  aggression  or  violence,  because 
people  felt  influential  but  did  not  trust  the  gov¬ 
ernment  to  behave  appropriately.  People  who 
primarily  relied  on  conventional  or  reformist 
styles  of  political  influence  were  characterized  by 
a  strong  sense  of  political  efficacy  going  along 
with  strong  trust  in  politics.  Explanations  of  this 
type  became  familiar  under  the  heading  of  the 
“efficacy-distrust  hypothesis.”  However,  empiri¬ 
cal  support  for  this  assumption  was  rather  weak. 

Apart  from  research  on  political  participation, 
the  analysis  of  civic  self-consciousness  developed 
as  an  independent  field  of  research  in  the  1950s. 
Scholars  such  as  Gabriel  Almond  and  Sidney 
Verba,  Robert  Lane,  Paul  Sniderman,  and  Alex 
Inkeles  introduced  political  efficacy  and  related 
concepts  as  parts  of  the  democratic  personality  or 
of  a  broader  syndrome  called  “civic  culture”  or 
“participant  citizenship,”  which  was  considered 
an  important  cultural  prerequisite  of  a  modern, 
particularly  democratic,  political  community.  In 
their  study  The  Civic  Culture:  Political  Attitudes 
and  Democracy  in  Five  Nations ,  Almond  and 
Verba  (1963)  introduced  subjective  civic  compe¬ 
tence  as  an  indispensable  element  of  a  democratic 
political  culture.  According  to  them,  the  civic  cul¬ 
ture  is  characterized  by  a  pattern  of  political  atti¬ 
tudes  contributing  to  two  different  requirements  of 
democratic  politics:  the  power  to  govern  effectively 
on  the  one  hand  and  responsiveness  to  the  people’s 
demands  on  the  other.  While  they  conceived  polit¬ 
ical  trust  as  the  attitude  providing  power  to  elites 
to  make  and  implement  authoritative  decisions 
and  policies,  subjective  political  competence  func¬ 
tions  as  a  counterbalance  to  power  in  their  view. 
Only  competent  citizens  become  active  to  make 
their  voices  heard  in  political  life.  The  mere  per¬ 
ception  of  the  potentiality  of  civic  activity — and 
less  the  activity  in  itself — was  regarded  by  Almond 
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and  Verba  as  a  sufficient  condition  of  elite  and 
systemic  responsiveness.  The  idea  of  a  necessary 
balance  of  political  power  and  responsiveness 
(structure),  and  political  trust  and  efficacy  (cul¬ 
ture)  is  emphasized  by  several  theoretical  and 
empirical  contributions  to  research  on  political 
culture  and  democratic  citizenship.  Other  contexts 
in  which  political  efficacy  or  concepts  similar  to 
political  efficacy  can  be  found  are  studies  of  politi¬ 
cal  apathy  and  political  alienation.  In  research  on 
political  apathy  (or  lack  of  political  involvement), 
lack  of  political  efficacy  is  analyzed  together  with 
low  political  interest  and  political  knowledge,  and 
sometimes  political  participation.  Research  on 
alienation  distinguishes  between  several  compo¬ 
nents  of  the  respective  syndrome.  Feelings  of  pow¬ 
erlessness,  which  are  the  most  broadly  analyzed 
component  of  alienation,  are  measured  by  efficacy 
items  in  most  instances. 

Oscar  W.  Gabriel 
Universitat  Stuttgart 
Stuttgart,  Germany 
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Election  by  Lot 


The  election  of  public  officers  by  lots,  or  lottery, 
is  a  procedure  that  can  prevent  the  formation  of  a 


permanent  leading  group  and  diffuse  knowledge  of 
public  affairs  among  the  members  of  the  commu¬ 
nity.  It  can  be  an  appropriate  formula  in  settings  in 
which  an  assembly  of  members  or  representative 
council  makes  decisions  by  broad  consensus  or 
unanimity,  and  public  jobs  do  not  require  high 
technical  skills.  The  selection  of  delegates  by  turns 
and  the  subsequent  rotation  of  people  in  public 
offices  can  have  about  the  same  effects  as  lotteries. 
There  is  an  old  tradition  of  choosing  public  officers 
by  drawing  lots  that  can  be  found  in  ancient  and 
medieval  local  democracies,  modern  private  set¬ 
tings,  and  some  international  organizations.  In 
most  cases,  it  goes  together  with  the  central  role  of 
the  assembly  to  make  decisions  on  the  most  rele¬ 
vant  issues,  typically  by  consensual  agreement,  on 
the  assumption  that  the  identification  of  a  com¬ 
mon  interest  should  not  be  too  difficult  a  task. 

The  most  relevant  historical  experience  of  selec¬ 
tion  of  delegates,  representatives,  or  public  officers 
by  lots  was  developed  in  Athens  during  the  demo¬ 
cratic  period  from  the  mid-5th  century  to  the  end 
of  the  4th  century  BCE.  On  the  basis  of  this  experi¬ 
ence,  the  philosopher  Aristotle  built  his  concept  of 
democracy,  which  included  the  possibility  of  “rul¬ 
ing  and  being  ruled  by  turns.”  In  Book  IV,  Chapter 
15,  of  his  Politics,  Aristotle  introduced  a  sharp 
distinction  between  the  appointment  of  magistrates 
by  lot,  which  was  thought  to  be  democratic,  and 
their  election,  which  was  considered  oligarchic.  By 
democracy,  he  meant  government  by  all  the  peo¬ 
ple,  while  election  entailed  government  by  an  aris¬ 
tocracy  consisting  of  the  few  best,  which  could 
lead  to  a  perverse  form  of  oligarchy.  This  classical 
criterion  was  taken  up  in  the  18th  century  by  the 
French  provincial  Charles-Louis  de  Secondat, 
baron  de  la  Brede  et  de  Montesquieu,  who  regarded 
suffrage  by  lot  as  “natural  to  democracy.” 
According  to  Montesquieu,  the  advantages  of  mak¬ 
ing  choices  by  lot  are,  first,  that  it  “is  unfair  to 
nobody,  and  [second,  that]  it  leaves  each  citizen  a 
reasonable  hope  of  serving  his  country”  (The  Spirit 
of  Laws,  Book  2,  chap.  2).  Likewise,  in  Book  IV  of 
The  Social  Contract,  the  Genevan  philosopher 
Jean-Jacques  Rousseau  highlighted  the  role  of  lots 
in  an  ideal  democracy,  in  which,  according  to  his 
Athens-inspired,  assembly-based  model,  public 
offices  should  be  considered  “a  burdensome 
charge,”  and  administrative  acts  should  be  reduced 
as  much  as  possible. 
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The  choosing  of  public  officers  by  lots  may 
have,  thus,  two  types  of  advantages.  First,  by  hav¬ 
ing  this  done  frequently  and  establishing  short 
terms  of  office,  it  can  produce  a  high  rotation  of 
members  in  administrative  or  arbitral  posts,  thus 
preventing  the  formation  and  self-reproduction  of 
permanent  and  closed  elite,  whether  in  the  form  of 
an  economic  oligarchy  or  a  class  of  professional 
politicians.  By  replacing  public  officials  frequently 
and  opening  public  jobs  to  wide  layers  of  society, 
no  one  can  be  blamed  for  making  or  implementing 
unpleasant  decisions,  but  no  one  can  be  praised 
either.  Only  the  assembly  members  remain  ulti¬ 
mately  responsible  for  the  consequences  of  collec¬ 
tive  decisions. 

The  second  type  of  advantage  to  choosing  by  lots 
and  the  subsequent  rotation  in  public  offices  is  the 
production  of  wide  dispersal  of  knowledge  of 
political  and  administrative  affairs  among  the  citi¬ 
zens.  The  experience  of  learning  and  becoming 
familiar  with  the  problems  of  satisfying  collective 
common  interests  can  be  a  good  platform  for  fur¬ 
ther  occasions  of  participation  in  voting  and  elec¬ 
tions,  such  as  the  assembly’s  decision  making  and 
the  choice  of  some  other  public  officials. 

Thus,  a  lottery  can  be  an  appropriate  procedure 
to  select  public  officers  where  there  is  a  clear  iden¬ 
tification  of  the  common  interest  of  the  members 
of  the  community,  there  are  relatively  low  techni¬ 
cal  requirements  to  fill  some  public  jobs,  and  there 
are  alternative  solid  instruments,  such  as  the 
assembly  of  members,  through  which  the  commu¬ 
nity  can  make  other  important  decisions,  including 
control  of  those  appointed  by  lots. 

In  the  Athenian  democracy,  in  order  to  preserve 
the  central  role  of  the  assembly  to  make  decisions 
by  acclamation  or  assent,  the  Council  of  500  mem¬ 
bers  in  charge  of  setting  the  agenda  for  the  assembly 
was  formed  of  50  members  selected  by  lot  by  each 
of  the  10  tribes,  which  had  evolved  from  military 
into  basic  administrative  units.  The  permanent 
committee  of  the  Council  and  its  president  were 
also  selected  by  lots.  About  600  of  the  other 
700  public  officers  were  also  selected  by  lots  from 
among  candidates  previously  presented,  including 
the  following:  the  10  members  of  the  archonship, 
approximately  equivalent  to  the  post  of  the  modern 
attorney  general,  as  well  as  the  body  in  charge  of 
organizing  religious  ceremonies,  who  were  appointed 
by  lots  from  a  pool  of  candidates  previously  selected 


by  each  tribe,  also  by  lots;  the  tribunal  members, 
chosen  by  lots  from  a  pool  of  all  adult  citizens,  who 
were  in  charge  of  passing  judgment  on  the  legality 
of  the  conduct  of  public  officials;  and  those  holding 
a  number  of  administrative  jobs,  encompassing 
treasurers,  those  in  charge  of  settling  public  con¬ 
tracts  and  collecting  public  revenues,  and  those 
supervising  streets  or  inspecting  markets.  The  pro¬ 
cedure  of  selecting  candidates  for  public  offices  by 
lots  was  based,  initially,  on  candidates  drawing 
white  and  black  beans  from  a  container  with  an 
open  top.  In  a  further  development,  Athenians  also 
used  allotment  machines,  usually  a  tube  in  which 
balls  could  be  inserted  at  random  and  released  at 
the  other  end. 

Among  further  occurrences,  the  first  apostles  of 
Jesus  drew  lots  to  select  the  replacement  for  the 
traitor  Judas  Iscariot,  according  to  the  Acts  of  the 
Apostles.  On  the  basis  of  this  precedent,  some 
early  nonorthodox  Gnostic  Christians  drew  lots  at 
each  of  their  meetings  to  elect  priests,  bishops,  and 
other  officers.  This  device  could  also  be  aimed  at 
preventing  a  sacerdotal  oligarchy  from  developing. 
But  the  Christian  Church  condemned  such  a  prac¬ 
tice  as  blasphemy  and  solemnly  forbade  the  choice 
of  priests,  bishops,  or  other  prelates  by  lot,  more 
formally  after  the  1 3th  century. 

A  number  of  late-medieval  and  renaissance  city- 
republics  and  communes  around  the  Mediterranean 
Sea  used  lots  for  choosing  magistrates  and  allocat¬ 
ing  officers  in  charge  of  implementing  assembly 
decisions.  This  was,  in  particular,  the  case  in  Venice 
for  the  indirect  election  of  the  Duke  (Doge)  from 
the  13th  century  following  direct  election  by  the 
people’s  assembly  from  the  end  of  the  7th  century. 
The  popularly  elected  Great  Council  adopted  an 
increasingly  complicated,  5-day-long  procedure  to 
choose  the  Duke  with  up  to  nine  stages  of  approval 
ballots  and  lots,  which  was  conceived  with  the  aim 
of  making  manipulative  maneuvers  impossible. 
Likewise,  the  Florentine  republics  during  the  14th 
and  15th  centuries,  and  again  in  the  late  15th  and 
early  16th  centuries,  elected  their  main  governmen¬ 
tal  body,  the  Lordship  (Signoria),  chaired  by  the 
standard  bearer  of  justice  (Gonfaloniere),  by  means 
of  a  complex  system  of  approval  ballots  and  very 
frequent  lots.  Again,  the  aim  was  to  prevent  fraud, 
manipulation,  and  the  commune’s  domination  by  a 
few  powerful  families.  Also,  in  Barcelona,  at  least 
from  the  15th  century,  the  popularly  elected 
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Council  of  100  chose  the  members  of  the  Consulate 
of  the  Sea,  the  judicial  body  for  commercial  and 
maritime  affairs,  by  an  indirect  procedure  involv¬ 
ing  lots. 

Finally,  lots  were  still  being  used,  in  combination 
with  several  stages  of  indirect  elections,  in  Spain 
and  the  Spanish  colonies  in  the  Americas  in  the 
early  19th  century.  First,  elections  called  by  the 
central  junta  formed  to  organize  the  resistance 
against  Napoleon’s  troops  were  held  in  1809,  with 
municipalities  electing  candidates  for  deputies  that 
were  finally  selected  by  lot.  New  elections  in  1810 
to  form  an  extraordinary  assembly  (“Cortes”), 
which  gathered  in  Cadiz  and  produced  a  new  con¬ 
stitution,  also  involved  a  combination  of  indirect 
elections  in  three  stages  and  a  final  selection  of  one 
deputy  in  each  district  by  lots  among  the  two  or 
three  candidates  previously  chosen.  After  the 
approval  of  the  so-called  constitution  of  Cadiz  in 
1812,  this  type  of  procedure  was  not  used  again  in 
Spain,  but  it  was  followed  in  some  further  elections 
in  Spanish  America.  Specifically,  in  Buenos  Aires, 
indirect  elections  of  colleges  (usually  called  “jun¬ 
tas”)  led  to  the  final  selection  of  members  of  the 
provincial  assembly  by  lots  in  1811,  while  a  mixed 
procedure  of  voting,  lots,  and  final  popular  vote  by 
plurality  was  used  for  the  election  of  governors  in 
1815.  In  Mexico,  local  elections  in  1812  involved 
some  stage  of  selection  of  candidates  by  lots.  In 
Chile,  a  combination  of  lots  and  plurality  voting 
was  still  being  used  in  1822.  Lots  remained  the 
usual  practice  in  indigenous  communities  that  were 
not  politically  integrated  into  the  new  independent 
states’  political  institutions.  Ironically,  they  became 
part  of  the  supposedly  traditional  “usages  and  cus¬ 
toms”  of  the  indigenous  people  to  be  preserved  in 
the  21st  century — although  the  use  of  lots  was  actu¬ 
ally  the  most  visible  legacy  of  Spanish  colonial  rule. 

The  choice  of  public  officers  by  lots  was  replaced 
by  election  of  representatives  based  on  popular 
votes  as  new,  increasingly  large  communities  and 
modern  states  addressed  collective  issues  of  higher 
complexity  and  as  different  interests  and  values 
developed  among  their  citizens.  In  current  times, 
lots  are  used  as  a  method  to  distribute  goods  and 
responsibilities  in  some  private  corporations  as  well 
as  for  allocating  temporary  jobs,  vacation  periods, 
or  household  tasks  in  other  private  settings.  For 
public  affairs,  they  are  used  in  some  countries  for 
certain  sectoral  or  relatively  minor  tasks,  such  as 


selecting  jurors  for  jury  trials,  appointing  election 
administrators,  breaking  election  ties,  or  selecting 
candidates  for  military  service. 

At  higher  political  levels,  similar  procedures  are 
used  in  certain  institutional  settings  in  confederal 
or  international  organizations  when  members 
require  that  decisions  be  made  only  by  near  una¬ 
nimity  or  a  broad  consensus.  Some  of  them  use,  in 
particular,  procedures  of  rotation  by  turns  of  high 
public  offices,  which,  a  priori  and  in  the  long  term, 
produce  the  same  effect  of  random  selection  as  lot¬ 
teries.  Amajor  example  is  the  Flelvetic  Confederation 
of  Switzerland,  which  is  still  mainly  an  instrument 
for  preserving  popular  self-government  of  the  com¬ 
munes  and  the  cantons,  where  the  presidency  of  the 
Federal  Council  is  filled  in  rotation  among  the 
Council  members  by  turns.  Likewise,  for  a  long 
period,  the  chairmanship  of  the  European  Council 
was  held  in  6-month  turns  by  the  member  states. 
The  United  Nations  Organization  also  distributes 
some  high  offices  by  informal  rotation  and  turns 
among  its  member  states.  The  presidency  of  the 
General  Assembly  is  filled  by  “symmetric  rotation” 
among  countries  of  the  five  regional  groups  (Africa, 
Asia,  Latin  America  and  the  Caribbean,  Western 
Europe  and  other  developed  countries,  and  Eastern 
Europe).  The  Security  Council,  which  works  by 
near-unanimity  decisions  of  the  five  permanent 
members,  is  also  formed  by  a  number  of  temporary 
members  rotating  in  post  for  periods  of  2  years. 
They  are  formally  elected  by  the  Assembly,  but 
they  must  also  be  distributed  fairly  from  among  the 
different  world  regions,  including  two  for  Latin 
America  and  the  Caribbean,  two  for  Western 
Europe,  one  for  Eastern  Europe,  and  five  for  Africa 
and  Asia.  Similar  proportions  are  used  for  filling, 
by  informal  turns,  the  posts  in  the  Economic  and 
Social  Council  and  other  committees. 

Josep  M.  Colomer 
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Election  Observation 


Election  observation  entails  “the  purposeful  gath¬ 
ering  of  information  regarding  an  electoral  process, 
and  the  making  of  informed  judgments  on  the  con¬ 
duct  of  such  a  process  on  the  basis  of  the  informa¬ 
tion  collected,  by  persons  who  are  not  inherently 
authorized  to  intervene  in  the  process”  (UN  Code 
of  Practice  quoted  in  Harris,  1997,  p.  27). 

Thus,  election  observation  is  not  only  central  to 
but  also  has  become  part  of  the  electoral  process 
worldwide,  especially  in  emerging  democracies. 
The  purposes  of  electoral  observation  are  to  pro¬ 
mote  transparency  and  encourage  free  and  fair 
elections.  Openness  is  an  essential  element  of 
ensuring  confidence  in  the  election  process.  Election 
observation  helps  strengthen  the  democratic  pro¬ 
cess  and  institutions,  enhances  the  value  of  elec¬ 
tions,  and  instills  confidence  in  the  outcome  of  the 
election  (John  Dugard,  1998).  Election  observation 
is  expected  to  be  nonpartisan,  so  that  all  interested 
parties  can  accept  the  election  observers’  findings 
on  the  observed  elections.  For  an  election  observa¬ 
tion  to  thrive,  it  has  to  be  credible  in  the  eyes  of 
contesting  parties.  This  trustworthiness  comes 
from  the  election  observers’  refusal  to  take  sides 
and  the  impartial  and  fair-minded  nature  of  the 
observation  endeavor. 

Transparency  in  the  electoral  process  is  of  criti¬ 
cal  importance  in  ensuring  public  confidence  in  the 
election  system  and  recognition  of  the  election 
outcome  (Carl  Dundas,  1994).  Election  observers 
might  persuade  some  citizens  and  politicians  to 
take  part  in  the  electoral  process  rather  than  to 
resort  to  violence,  as  was  the  case  in  Angola  and 
Mozambique.  In  Zambia,  in  1991,  observers 
calmed  intense  disagreements  over  electoral  rules 


and  regulations  by  asking  for  concessions  from  gov¬ 
ernment  and  electoral  officials  on  areas  of  disagree¬ 
ment  (National  Democratic  Institute  for 
International  Affairs,  1992).  Thus,  as  Gerhard 
Totemeyer  and  Denis  Kadima  (2000)  note,  election 
observation  can  perform  a  constructive  function  in 
generating  an  environment  favorable  for  everyone 
to  take  part  in  general  elections.  Election  observa¬ 
tion  adds  value  to  the  electoral  process  and  thereby 
gives  it  legitimacy.  Moreover,  the  involvement  of 
observers  may  also  encourage  a  government  that 
has  lost  power  to  recognize  the  outcome  and  there¬ 
fore  step  down.  Nicaragua  in  1990  under  President 
Daniel  Ortega  and  Zambia  in  1991  under  President 
Kenneth  Kaunda  are  cases  in  point.  In  this  way, 
election  observation  helps  promote  stability. 
International  observers  may  be  asked  to  arbitrate 
disagreements  among  contesting  political  organiza¬ 
tions  in  an  attempt  to  lessen  hostilities  prior  to, 
during,  and  in  the  postelection  periods.  According 
to  the  National  Democratic  Institute  for  Intern¬ 
ational  Affairs  (1992),  for  an  election  outcome  to 
be  acceptable  to  all  interested  parties,  it  should  not 
only  be  seen  as  free  and  fair  but  should  also  not  be 
marred  by  allegations  of  fraud.  Thus,  although  free 
and  fair  elections  are  not  a  sufficient  condition 
toward  democratic  consolidation,  they  are  central 
to  democracy  development  because  of  “their  ability 
to  jump-start  the  process  of  democratisation  and 
boost  the  morale  of  prodemocracy  forces”  (Neil 
Nevitte  &  Santiago  Canton,  1997,  p.  51). 

Historical  Origins 

Previously,  journalists,  academics,  and  embassy 
staff  observed  elections  in  foreign  countries. 
However,  following  World  War  I,  political  partici¬ 
pation  in  government  came  to  be  accepted  as  a 
fundamental  right  of  all  citizens;  since  then,  elec¬ 
tion  observation  has  been  institutionalized  interna¬ 
tionally.  The  United  Nations  initially  took  part  in 
election  observation  in  South  Korea  in  1948, 
because  monitoring  the  elections  in  countries  com¬ 
ing  out  of  dictatorial  military  regimes  or  authori¬ 
tarian  rules  was  considered  essential.  Since  then, 
election  observation  has  become  common  and  is 
used  in  developed  and  developing  countries. 
Subsequently,  a  number  of  organizations,  such  as 
the  Commonwealth  in  1971  and  the  Conference 
on  Security  and  Cooperation  in  Europe  in  1990, 
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became  involved  in  election  observation.  There  has 
been  an  increasing  acceptance  of  the  contribution 
observation  makes  to  the  election  process  not 
withstanding  its  limitations. 

As  M.  Kupe  (2000)  has  noted,  participation  of 
observers  could  help  decrease  propensities  toward 
improper  practices  by  overzealous  political  parties 
or  government  administrators  in  charge  of  elec¬ 
tions.  The  involvement  of  observers  during  voting 
and  at  the  count  can  help  calm  down  volatile  elec¬ 
tion  environments  and  is  thought  to  encourage  the 
openness  of  both  the  polling  and  counting  of  the 
ballot  papers  (Dundas,  1994).  The  presence  of 
international  observers  is  intended  to  promote 
confidence  in  the  process,  to  discourage  electoral 
irregularities,  and  to  give  an  account  on  the  impar¬ 
tiality  of  the  elections  to  the  international  commu¬ 
nity.  Election  observation  ensures  that  the  electoral 
outcome  is  credible  internationally  in  the  light  of 
specific  democratic  conditions  imposed  by  donors 
who  provide  economic  assistance  and  other  help. 
This  explains  why  election  observation  has  become 
so  important  in  recent  years,  especially  in  countries 
undergoing  political  change. 

In  established  democracies,  election  observation 
seeks  to  act  as  an  example  to  emerging  democracies. 
Election  observation  is  not  merely  about  verifying 
the  fairness  of  an  election;  instead,  and  more  cru¬ 
cially,  observation  efforts  strengthen  the  legitimacy 
of  democratic  norms  and  procedures  to  thousands  of 
people  and,  thus,  help  in  building  and  strengthening 
a  culture  of  democracy.  It  is  in  this  context  that  elec¬ 
tion  observation  has  come  to  be  accepted  as  a  key 
part  of  the  democratic  process  because  such  efforts 
seek  to  promote  transparency  and  accountability — 
the  central  tenets  of  democracy. 

Election  observation  missions  take  different 
shapes.  Their  diversity  depends  on  the  type  and 
size  of  the  organization  involved,  the  organiza¬ 
tion’s  period  of  stay  in  the  country  holding  elec¬ 
tions,  and  the  kind  of  report  the  organization 
produces  after  the  elections.  Nevertheless,  there 
are  certain  experienced  and  credible  election 
observers — the  Commonwealth,  the  European 
Union,  and  the  Carter  Center  for  Democracy  in 
Atlanta,  Georgia — that  have  greater  resources  for 
observation  than  others.  Generally,  election 
observers  oversee  the  administrative  arrangements; 
preparations  of  the  electoral  authority;  the  behav¬ 
ior  of  election  officials,  police,  party  agents,  and 


voters;  the  sealing  of  ballot  boxes  before  and  after 
voting;  the  escorting  of  boxes  to  the  counting  cen¬ 
ter;  and  counting  (The  Catholic  Justice  and  Peace 
Commission,  1994).  The  observers  are  expected  to 
produce  a  report,  which  highlights  the  problems 
noted,  and  make  recommendations.  They  are  also 
expected  to  determine  whether  the  elections  were 
free  and  fair.  As  Kupe  (2000)  states, 

Election  observers  are  “watchdogs  working  for 
the  electorate  and  the  political  parties  involved  in 
the  elections.  It  is  their  business  to  make  sure  that 
elections  are  conducted  properly  during  the 
prescribed  times  and  at  the  designated  venues.  It 
is  the  observer  who  reports  directly  or  indirectly 
to  the  outside  world  regarding  the  fairness,  or 
otherwise,  of  the  election  process,  (p.  102) 

Types  of  Election  Observers 

There  are  two  main  types  of  observer  groups:  local 
and  international  groups.  Local  observers,  who  are 
normally  citizens  of  the  country  whose  elections 
are  being  monitored,  have  an  advantage  over  inter¬ 
national  observers  in  that  they  are  familiar  with 
the  country  concerned.  However,  although  local 
observers  promote  transparency  in  the  electoral 
process,  they  are  often  treated  with  suspicion  as 
having  a  different  agenda  and  thus  not  seen  as 
impartial.  On  the  other  hand,  international  observ¬ 
ers  “have  added  a  new  dimension  to  election  trans¬ 
parency”  (Dundas,  1994,  p.  45).  Nevitte  and 
Canton  (1997)  observed  that  “public  confidence  in 
internationally  driven  [observation]  efforts  charac¬ 
teristically  hinges  on  the  reputation  and  legitimacy 
of  the  international  or  regional  organization 
involved,  and  derives  in  large  part  from  the  multi¬ 
national  membership  of  the  observation  team  in 
place”  (p.  50).  For  Phiroshaw  Camay  and  Anne 
Gordon  (1999)  “international  observers  bring  an 
added  credibility  to  the  monitoring  and  assessment 
of  elections,  in  that  they  are  able  to  refer  to  their 
experience  elsewhere  and  apply  international  stan¬ 
dards  of  good  practice  wherever  they  go”  (p.  259). 
Such  observations  demonstrate  that  international 
observers  act  as  a  stamp  of  approval. 

Moreover,  observer  groups  can  use  different 
approaches  to  observe  elections.  These  can  take 
the  form  of  regular  short  visits,  permanent  groups, 
mobile  teams,  or  stationary  teams.  Nevertheless, 
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election  observers  face  a  number  of  limitations. 
First,  as  Dundas  (1994)  noted,  “there  are  limita¬ 
tions  to  the  extent  to  which  observers  generally 
can  impact  on  the  transparency  of  the  system, 
since  they  are  not  in  charge  of  the  machinery 
which  runs  the  election.  ”  Second,  the  other  limita¬ 
tion  faced  by  election  observers  is  that  “they  have 
a  limited  time  to  see  only  the  final  days  of  the 
campaign  leading  up  to  the  polls,  and  sometimes 
many  leave  before  the  final  results  are  known” 
(p.  45).  This  limit  on  the  time  available  for  obser¬ 
vation  compromises  the  role  of  election  observers 
in  promoting  transparency  and  accountability  in 
the  electoral  process. 

Election  observation  is  common  in  countries 
undergoing  political  transformation  from  auto¬ 
cratic  regimes  to  multiparty  systems.  The  uncer¬ 
tainty  and  confusion  in  these  countries  create 
concerns  about  whether  election  administrations 
can  deliver  accurate  and  impartial  electoral  results. 
International  observation  of  elections  in  Africa 
initially  occurred  as  nations  made  the  transition 
from  colonial  rule  to  independence  (in  Zimbabwe 
in  1980,  Namibia  in  1989)  or  when  there  was  no 
centrally  controlled  authority,  as  in  Uganda  in 
1980.  In  1991,  Zambia  became  the  first  indepen¬ 
dent  African  country  to  ask  for  international  elec¬ 
tion  observers.  According  to  the  National 
Democratic  Institute  for  International  Affairs, 
local  and  external  election  observers  can  help 
ensure  that  elections  are  free  and  fair,  which  in 
turn  can  contribute  to  the  development  of  account¬ 
able,  effective  governance.  International  election 
observation  provides  evidence  of  the  extent  to 
which  a  regime  is  committed  to  a  variety  of  demo¬ 
cratic  values  and  procedural  norms.  It  also  gives 
voters  a  critical  opportunity  to  challenge  authori¬ 
tarian  governments  (Nevitte  &  Canton,  1997). 
Such  observation  occurs  primarily  in  cases  where 
there  are  doubts  that  a  free  and  fair  election  can  be 
conducted  (Ananias  Elago,  1999).  As  Bojosi 
Otlhogile  (1994)  notes,  these  transitional  countries 
are  attempting  to  reconstruct  themselves  politically 
and  seek  affirmation  from  observers  that  elections 
were  fairly  conducted  and  that  they  meet  the  mini¬ 
mum  requirements  of  an  international  standard. 
Such  validation  can  help  these  countries  regain 
their  places  in  the  international  community. 

As  Otlhogile  notes,  countries  undergoing  trans¬ 
formation  attract  international  observers  because 


there  is  unease  surrounding  an  election  in  such  a 
country.  Observation  is  meant  to  reestablish  confi¬ 
dence  among  those  contesting  such  an  election  as 
well  as  among  foreign  investors  and  the  inter¬ 
national  community  generally. 

Notwithstanding  the  importance  of  election 
observation  in  promoting  transparency  and 
accountability  in  the  electoral  process,  a  few  obser¬ 
vations  can  be  made  about  election  observers. 
Although  election  observers  are  widely  appreciated 
and  at  times  overemphasized,  their  findings  have 
never  resulted  in  a  reelection  in  any  country, 
regardless  of  the  assessment.  The  April  and 
December  2003  elections  in  Nigeria  and  Russia, 
respectively,  are  cases  in  point.  In  fact,  election 
observation  has  become  a  standard  institution  that 
is  expected  to  contribute  to  the  democratic  process, 
especially  for  the  reasons  discussed  above. 
Moreover,  even  if  they  are  expected  to  be  impartial, 
an  element  of  bias  cannot  be  ruled  out.  However, 
their  reports  might  indirectly  prompt  people  to 
revolt,  resulting  in  a  new  election. 

David  Sebudubudu 
University  of  Botswana 
Gaborone,  Botswana 
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Election  Research 


As  the  defining  institution  of  democracy,  elections 
have  always  been  a  major  focus  of  attention  both 
for  political  theorists  and  analysts — and  for  jour¬ 
nalists  and  practicing  politicians.  Indeed,  it  is  hard 
to  talk  about  democracy,  either  generally  or  at  a 
national  level,  without  referring  to  elections.  This 
entry  focuses  on  specialized  research,  mostly  quan¬ 
titative,  into  voting  at  general  elections.  It  thus 
passes  over  the  interesting  and  growing  literature 
on  direct  policy  elections  (referendums  and  initia¬ 
tives)  since  they  are  still  not  a  major  feature  of 
contemporary  democracies  except  in  Switzerland 
and,  at  the  state  level,  in  the  United  States.  On  the 
other  hand,  the  choices  voters  make  are  framed  by 
political  parties,  so  the  entry  has  to  consider  their 
activities,  particularly  as  they  impinge  on  voting. 

Demographics  and  Political  Geography 

While  analyses  of  party  organization  and  cam¬ 
paigning  appeared  from  the  latter  part  of  the  19th 
century  onward,  the  tradition  of  systematic  quan¬ 
titative  studies  of  voting  statistics  began  with 
Andre  Siegfried  in  France,  Harold  Gosnell  in  the 
American  Midwest,  and  V.  O.  Key  in  the  American 
South  from  1900  onward.  Such  analyses  used 
demographic  data — social  and  voting  statistics, 
generally  from  the  smallest  constituency  unit  avail¬ 
able — to  make  sophisticated  inferences  about  indi¬ 
vidual  voting  behavior  and  its  influence  on  both 
personal  characteristics  and  political  history. 

This  tradition  of  research,  however,  passed  out 
of  the  mainstream  under  the  impact  of  two  devel¬ 
opments.  The  first  was  the  discovery  of  the  eco¬ 
logical  fallacy  involved  in  inferring  individual 


behavior  from  aggregate  statistics.  For  example, 
the  finding  that  constituencies  in  the  United 
Kingdom  with  a  large  Black  population  voted  dis¬ 
proportionately  for  the  racist  British  National 
Party  (BNP)  does  not  imply  that  the  BNP  attracts 
non-White  votes.  On  the  contrary,  it  shows  that 
Whites  in  such  constituencies  vote  for  it  dispropor¬ 
tionately,  under  the  impact  of  high  immigration.  It 
is  hard  to  tell  from  aggregate  statistics  what 
exactly  is  going  on  at  the  individual  voter  level. 
The  second  development  relates  to  the  use  of  mass 
survey  techniques. 

Survey-Based  Research  and 
Social  Group  Theory 

In  addition  to  difficulties  involving  the  ecological 
fallacy,  the  problem  for  the  statistical  foundations 
of  demographic  research  was  compounded  by  the 
application  of  mass  survey  techniques  to  study 
individual  voters  directly.  This  approach  was  pio¬ 
neered  by  Paul  Lazarsfeld  and  his  associates  in  Erie 
County,  New  York,  for  the  U.S.  presidential  elec¬ 
tion  of  1940,  and  replicated  in  Elmira,  New  York, 
in  1948.  Not  only  did  these  studies  demonstrate 
the  feasibility  of  applying  statistical  techniques  of 
sampling  and  quantitative  analysis  to  voting,  but 
they  also  demonstrated  the  way  in  which  data  col¬ 
lection  and  analysis  had  to  be  driven  by  theory  to 
produce  relevant  conclusions  or  even  to  develop  a 
questionnaire  in  the  first  place. 

Coming  from  a  market  research  background, 
Lazarsfeld  envisaged  voters  as  behaving  like  con¬ 
sumers  choosing  between  tins  of  beans  on  the 
supermarket  shelves.  Relying  solely  on  advertising 
for  their  information,  brands  promoted  more  in 
the  media  would  sell  better  than  ones  promoted 
less.  The  1940  questionnaire  thus  featured  many 
questions  on  voter  media  exposure  (with  parallel 
content  analyses  of  actual  media  coverage  of  can¬ 
didates),  with  a  few  questions  on  social  character¬ 
istics  thrown  in  for  classification  purposes.  To 
their  surprise,  the  research  team  found  few  and 
weak  correlations  between  media  coverage  and 
voting  behavior  but  strong  correlations  with  class, 
religion,  and  urban  versus  rural  residence. 

In  response,  they  jettisoned  the  media  and 
adopted  instead  a  social  group  theory  of  voting 
behavior,  measured  through  an  Index  of  Political 
Predisposition  (IPP),  which  very  roughly  measured 
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probabilities  for  individuals  with  varying  combina¬ 
tions  of  social  characteristics  voting  Democrat  or 
Republican.  On  this  basis,  the  questionnaire  for  the 
1948  U.S.  presidential  election  sought  to  understand 
how  voters’  flow  of  information  and  voting  prefer¬ 
ences  were  structured  by  their  social  group  member¬ 
ship,  pulling  them  back  to  the  historical  group 
choice  in  the  course  of  the  campaign.  Social  group 
theory  reached  its  apotheosis,  with  the  help  of  com¬ 
puters,  when  Ithiel  de  Sola  Pool,  Samuel  Popkin, 
and  Paul  Abelson  pooled  all  existing  public  opinion 
surveys  of  American  voters  to  create  demographic 
profiles  of  the  U.S.  states  and  the  impact  of  current 
issues  on  small  subgroups  of  voters.  On  the  basis  of 
these  simulations,  they  correctly  predicted  the  result 
of  the  1960  and  1964  presidential  elections  in 
advance,  achieving  high  correlations  with  the  actual 
electoral  college  votes  of  the  non-Southern  states. 

The  Michigan  School:  Party 
Identification  and  Issue  Impacts 

The  publication  by  Angus  Campbell,  Philip 
Converse,  Warren  Miller,  and  Donald  Stokes  of 
The  American  Voter  in  1960  not  only  substituted 
national  mass  surveys  for  local  ones  but  also  pro¬ 
posed  an  alternative  conceptualization  of  voting 
choice  to  social  group  theory.  It  was  not  that  this 
“Michigan  School”  ignored  the  influence  of  social 
factors.  They  saw  them,  however,  as  lying  rather 
far  back  in  the  “funnel  of  causality”  preceding  the 
voting  act.  Their  influence  was  channeled  through 
an  individual’s  party  identification — their  affective 
orientation  to  an  important  group  object  in  their 
environment.  Social  influences  were  thus  trans¬ 
muted  into  individual  psychology.  Party  loyalty 
then  colored  the  voters’  reactions  to  immediate 
influences  on  how  they  viewed  the  candidates  and 
issues  of  the  campaign.  The  change  in  these  from 
the  end  of  the  1940s  to  the  1950s  explained  the 
landslide  election  of  the  Republican  Dwight 
Eisenhower  after  the  Democratic  rally  of  Harry 
Truman  in  1948. 

This  conceptualization  of  voting  as  essentially 
individual  and  psychological  in  nature  dominated 
empirical  research  on  elections  in  both  the  United 
States  and  the  rest  of  the  world  for  the  next 
30  years.  In  a  modified  form,  it  continues  to  this 
day.  Its  dominance  is  only  partly  explained  by  its 
comprehensive  reach  and  intellectual  power.  It 


was  also  embedded  in  the  design  of  questionnaires 
for  the  mass  election  surveys  now  carried  out  regu¬ 
larly  in  almost  all  the  developed  democracies  of  the 
world.  The  Michigan  authors  themselves  were 
active  in  stimulating  and  organizing  such  surveys 
in  Europe,  from  whence  they  spread  to  Asia,  Latin 
America,  and  Australasia. 

For  the  first  time,  such  surveys  provided  an 
insight  into  the  political  thinking  and  behavior  of 
ordinary  individuals,  complemented  by  the  mas¬ 
sive  increase  in  both  the  number  and  sophistica¬ 
tion  of  public  opinion  polls.  In  addition,  for  the 
first  time,  successive  surveys  provided  information 
about  the  dynamics  of  opinion  change  between 
elections  and  the  effect  this  had  on  party  support. 
Such  over-time  analyses  required  continuity  in  the 
questions  asked  at  each  election,  simply  to  provide 
comparable  information  over  long  periods.  This 
embedded  the  Michigan  School’s  conceptualiza¬ 
tions  and  measures  even  in  studies  that  did  not 
necessarily  share  their  preconceptions  or  that  even 
set  out  to  challenge  them. 

Party  Identification  and 

Voting  Predispositions 

The  major  challenge  was  to  understand  the  con¬ 
cept  of  party  identification  and  whether  it  really 
differed,  methodologically  or  conceptually,  from 
voting  intention.  Michigan-style  surveys  carried 
out  in  Europe  seemed  to  indicate  that  the  two  were 
correlated  and  fluctuated  together.  If  so,  what  was 
the  point  of  distinguishing  the  two? 

One  answer  was  to  generalize  from  the  specific 
concept  of  party  identification  to  the  idea  of 
enduring  predispositions  toward  voting  in  a  cer¬ 
tain  way.  The  majority  of  electors  vote  in  the  same 
way  from  election  to  election.  At  the  same  time, 
some  change  their  vote,  providing  the  dynamics 
for  election  change  and  party  competition.  It  there¬ 
fore  seems  that  both  predispositions  and  specific 
campaign  influences  such  as  issues  and  candidates 
have  to  be  brought  into  the  explanation  of  voting 
choice.  A  group  of  largely  European  scholars  in 
Party  Identification  and  Beyond  (2009)  argued 
that  both  Campbell  and  his  associates  and 
Lazarsfeld  and  his  associates  had  been  trying  to  do 
just  that.  Both  party  identification  (PI)  and  the  IPP 
were  trying  to  measure  predispositions,  while  both 
allowed  for  the  differentiated  impacts  on  strong 
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partisans  and  weaker  ones  of  short-term  campaign 
effects  or  election  cues. 

Issue  Voting  and  Rational  Choice  Theory 

This  recognition  of  short-term  effects  rendered  the 
debate  of  the  1970s  on  the  “replacement”  of  PI  by 
“issue  effects”  from  new  issues,  such  as  Vietnam, 
gender  bias,  and  the  environment,  rather  superflu¬ 
ous.  It  did,  however,  mark  the  emergence  for  the 
first  time  of  purely  theoretical  influences  on  voting 
research  as  distinct  from  the  theory-driven  data 
analysis  that  had  dominated  the  field  up  to  then. 
These  influences  stemmed  from  the  seminal  pre¬ 
sentation  by  Anthony  Downs  of  An  Economic 
Theory  of  Democracy  (1957) — a  collection  of 
many  theories  and  models,  not  generally  reconcil¬ 
able  with  each  other,  of  how  office-seeking  parties 
competed  for  votes  and  electors  cast  votes  for  the 
party  closest  to  them  on  policy. 

The  most  influential  of  Downs’s  models  (because 
it  seemed  to  fit  Anglo-American  politics  so  well) 
was  where  two  parties  competed  with  each  other 
on  policy  in  a  one-dimensional  left-right  space, 
along  which  electors  ranged  themselves  in  a  sym¬ 
metrical  unimodal  distribution,  with  the  central 
peak  coinciding  with  the  median  (middle)  voter 
position.  To  form  a  majority  of  50%  plus  one,  the 
median  vote  was  crucial,  so  both  parties  would 
move  toward  the  more  moderate  position  of  the 
median  voter  to  secure  a  majority.  This  accounted 
for  the  consensual  bargaining  and  policy  compro¬ 
mises  of  the  Anglo-American  democracies,  as  con¬ 
trasted  with  the  immobilisme  and  ideological 
rigidity  of  multiparty  governments. 

Turnout 

Downs’s  two-party  spatial  model  had  an  increas¬ 
ing  influence  on  election  research  from  the  1970s 
onward,  in  line  with  the  growing  dominance 
within  political  science  of  rational  choice  reason¬ 
ing  and  mathematical  modeling.  One  aspect  of  his 
ideas  was  not  closely  pursued,  however — his  dem¬ 
onstration  that  it  was  not  rational  to  vote.  He 
assumed  that  electors  wish  to  consider  their  policy 
preference  and  decide  whether  to  vote  on  the  basis 
of  their  utility  from  this,  discounted  by  the  proba¬ 
bility  of  their  vote  changing  the  election  outcome. 
Since  this  probability  is  always  vanishingly  small, 


nobody  should  vote.  As  the  majority  do  (in  general 
elections  at  least),  other  factors,  principally  a  sense 
of  duty,  have  to  be  drafted  in  as  explanations — 
supported  so  far  as  they  go  by  the  survey  data. 

Economic  Voting 

Rational  choice  theorists  seemed  to  have  little  dif¬ 
ficulty  with  incorporating  duty  or  altruism  into 
their  equations  on  turnout.  On  the  other  hand, 
voting  choice  seemed  potentially  explainable  by 
self-interested,  primarily  economic,  calculations. 
Government’s  ability  to  sustain  prosperity  and 
growth,  increase  employment,  and  avoid  inflation 
were  increasingly  seen  as  the  main  drivers  of  vote. 
Conceptually,  theorists  also  seemed  capable  of 
linking  political  science  more  closely  with  econom¬ 
ics  in  the  study  of  political  economy. 

While  such  analyses  could  be  and  were  carried 
through  at  the  individual  level,  on  survey  data  both 
economic  indicators  and  voting  statistics  were  read¬ 
ily  available  in  aggregated  form.  Where  necessary, 
survey  responses  themselves  could  be  aggregated  to 
give  preference  and  policy  time  series.  As  a  result,  the 
focus  of  empirical  research  shifted — from  surveys 
and  individual-level  analysis  to  the  aggregate  level — 
in  the  1980s.  As  the  major  objective  was  to  explain 
(and  preferably  predict)  the  overall  election  outcome, 
aggregate  equations  and  econometric  methods 
formed  a  more  direct  way  to  approach  this.  Moreover, 
the  general  availability  of  aggregate  indicators  lent  a 
comparative  dimension  to  research — all  govern¬ 
ments  published  them,  whereas  election  surveys  were 
not  always  carried  out  in  the  countries  of  interest. 

Despite  2  decades  of  intensive  analysis,  how¬ 
ever,  little  progress  was  made  toward  the  ultimate 
goal  of  a  unified  theory  of  economic  voting.  A 
review  of  around  2,000  concluded  that  economic 
conditions  produced  contradictory  and  conflicting 
political  results.  The  main  generalization  to  emerge 
from  numerous  studies  over  different  countries 
was  that  governments  generally  lost  votes  at  the 
rate  of  2.2%  of  their  previous  vote.  The  loss  was 
the  cost  of  governing,  expressed  in  economic  fash¬ 
ion  but  hardly  due  to  exclusively  economic  factors. 

Predispositions  and  Cues 
at  the  Aggregate  Level 

One  reaction  to  the  failure  of  economic  trends  to 
produce  consistent  voting  responses  was  to  regard 
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them  as  one  type  of  issue  among  others.  A  compre¬ 
hensive  classification  of  issues  could  be  made  on 
the  basis  of  surveys  and  newspaper  reports,  and  a 
universal  direction  and  weight  can  be  assigned  to 
each  issue  type.  By  seeing  what  issues  were  promi¬ 
nent  in  each  campaign,  the  result  could  be  pre¬ 
dicted  and  compared  with  the  actual  outcome. 

The  idea  that  certain  issues  always  favor  certain 
parties  stemmed  from  a  salience  theory  of  party 
competition  and  elections,  whereby  parties  always 
strove  to  make  their  own  issues  prominent  and 
downgrade  those  of  their  opponents.  However,  the 
number  of  votes  affected  by  issues  prominent  in 
the  campaign  was  always  less  than  the  basic  vote 
of  the  party — the  average  vote  it  received  indepen¬ 
dently  of  issue  effects.  This  vote  was  explained  as 
being  cast  by  core  supporters  or  party  loyalists — 
party  identifiers  or  core  social  group  voters  under 
another  name.  In  this  way,  the  old  survey-based 
frameworks  reemerged  at  aggregate  level  under 
other  names — still  necessary  concepts,  however,  to 
cope  with  the  stability  and  change  simultaneously 
present  in  election  results. 

Ian  Budge  and  Dennis  Farlie’s  attempt  to  explain 
and  predict  elections  in  this  way  remained  an  iso¬ 
lated  venture.  They  used  the  same  logic  in  ad  hoc 
models  designed  to  predict  the  outcome  of  a  general 
election.  In  Britain,  these  took  the  form  of  the  Essex 
model.  This  was  a  multivariate  regression  equation, 
which  allowed  previous  voting  support,  economic 
expectations,  and  tax  levels  to  explain  all  the 
month-to-month  fluctuation  in  voting  intentions 
that  they  could.  Sharp  falls  or  increases  in  voting 
support  were  identified  with  events  that  occurred  at 
that  time  (e.g.,  the  Blair  effect  following  Tony 
Blair’s  election  as  Labor  leader  in  1994).  The  com¬ 
bined  equation,  modified  for  each  election,  was 
used  to  predict  votes  in  the  1997  and  2001  general 
elections  successfully.  This  predictive  success  was 
bought  at  some  cost  in  terms  of  generalizability  and 
explanatory  power  (why  should  Blair  have  this 
effect?).  However,  the  overall  approach  to  predic¬ 
tion  is  itself  generalizable  to  other  contexts. 

The  Macro  Polity:  An  Update  of  the 
American  Voter  at  the  Aggregate  Level 

The  same  approach  involving  econometric  model¬ 
ing  and  aggregate  data  was  used  in  adapting  the 
concerns  of  The  American  Voter  to  modern  times. 


Using  basically  the  same  framework  as  The 
American  Voter,  The  Macro  Polity  (2002),  by 
Robert  Erikson,  Michael  McKuen,  and  James 
Stimson,  models  voting  at  the  aggregate  level  as  the 
effect  of  aggregate  predispositions  to  vote  for  a 
party  and  the  effects  of  issues,  particularly  economic 
issues,  in  changing  votes.  Reflecting  a  growing  rec¬ 
ognition  that  voting  has  to  be  put  in  context,  Macro 
Polity  also  concerns  itself  with  the  end  result  of  the 
process — how  election  outcomes  affect  government 
policy  intentions  and  enacted  policy — and  even 
with  how  these  match  up  with  voter  preferences, 
thus  addressing  the  central  concern  of  democracy.  A 
major  technical  achievement  of  the  project  has  been 
to  devise  a  measure  of  the  (left-right)  policy  mood 
of  the  electorate  from  hundreds  of  different  policy 
questions  asked  in  opinion  polls  over  the  past  50 
years.  This  not  only  provides  a  basis  for  explaining 
voting  behavior  but  also  for  investigating  the  extent 
to  which  public  policy  reflects  popular  preferences. 

The  limitations  of  The  Macro  Policy  are  also 
those  of  its  predecessor — it  focused  on  the  politics 
of  one  country  over  a  limited  period  of  time.  This 
limits  the  number  of  cases  to  12  (postwar  presiden¬ 
tial  elections).  It  also  runs  the  danger  of  tying  its 
investigation  too  much  to  the  specifics  of  U.S. 
politics,  which,  as  The  American  Voter  showed, 
have  their  peculiarities.  Parallel  studies  are  cur¬ 
rently  under  way  in  Britain  and  France.  One  may 
hope  after  40  years  that  they  do  not  stir  up  the 
same  controversy  over  concepts  and  measures  as 
party  identification  did.  So  far,  policy  mood  seems 
to  apply  as  well  in  Europe  as  in  the  United  States. 

Multimotivated  Voting 

When  reading  through  long  lists  of  detailed  and 
variable  findings  from  the  election  studies,  one  is 
sometimes  tempted  to  ask  just  what  the  detailed 
and  expensive  analyses  of  elections  over  the  past  5 
decades  have  shown.  There  is  much  confusion 
about  this.  One  consistent  lesson,  however,  from 
the  earliest  to  the  latest  studies  is  that  voters  are 
multimotivated.  That  is,  they  do  not  vote  on  their 
policy  preferences  alone,  and  some  votes  in  fact 
may  be  cast  on  grounds  unrelated  to  policy.  This 
is  not  solely  due  to  electors  themselves  having  non¬ 
policy  concerns.  To  a  considerable  extent,  the 
institutional  structure  of  general  elections  requires 
them  to  settle  nonpolicy  questions:  Who  is  best  at 
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governing  us?  Which  party  has  the  most  appealing 
and  trustworthy  candidate?  Which  has  best  ridden 
out  the  most  recent  scandal?  All  these  factors  may 
enter  quite  legitimately  into  the  voting  decision. 

The  fact  that  voters  are  multimotivated  puts 
current  rational  choice  models  largely  out  of  court 
so  far  as  explaining  elections  are  concerned.  They 
all  postulate  that  voters  are  purely  policy  orien¬ 
tated.  If  they  are  not,  parties  have  no  need  to  con¬ 
verge,  and  the  median  preference  (or  seeming 
median  preference,  as  the  declared  distribution  of 
votes  cannot  be  unambiguously  interpreted  as  a 
policy  distribution)  loses  its  preeminent  position. 

Representational  Consequences 

Elections  are  at  the  heart  of  the  democratic  repre¬ 
sentational  process  and  are  thus  expected  to  reveal 
and  enforce  popular  preferences.  As  Bingham 
Powell  (2000)  notes,  recognition  of  their  role  has 
prompted  the  broadening  of  election  research 
noted  above,  to  the  extent  that  elections  are 
increasingly  being  studied  in  their  full  representa¬ 
tional  context  as  “instruments  of  democracy.” 

According  to  Joseph  Schumpeter  (1942),  here 
the  recognition  of  voters  as  multimotivated  and  not 
driven  by  pure  policy  concerns  poses  an  acute 
dilemma  for  representative  democracy.  If  elections 
do  not  clearly  reveal  the  popular  will  so  far  as  pub¬ 
lic  policy  is  concerned,  how  can  the  democracy  be 
directed  to  effect  it?  Or  do  we  simply  define  democ¬ 
racy  as  a  struggle  for  power  between  competing 
teams  of  leaders  decided  by  voters  in  elections — 
who  have,  however,  to  put  up  with  whatever  the 
policy  leaders  decide  on  subsequently? 

The  solution  may  be  to  recognize,  with  Powell, 
that  elections  express  the  popular  will  in  two  ways: 

1.  the  proportional  way,  operationalized  in  the 
preference  of  the  median  voter  (even  though  the 
median  position,  which  emerges  from  the 
declared  distribution  of  party  votes,  may  be  a 
bit  unstable  and  subject  to  challenge),  and 

2.  the  majoritarian  way,  in  which  the  popular  will 
is  identified  with  the  policy  preference  of 
plurality  party  voters. 

The  plurality  voter  position  may  also  be  subject 
to  challenge  since  nobody  knows  how  many  votes 
cast  for  the  largest  party  were  actually  cast  on 


nonpolicy  grounds.  In  the  end,  however,  the 
declared  distribution  of  votes  has  to  be  accepted  as 
providing  authoritative  guidance  on  policy  prefer¬ 
ences  as  on  other  matters  as  well.  With  no  way  of 
deciding  between  median  and  plurality  claims  to 
represent  the  popular  will,  researchers  have  to 
accept  both  and  arrange  compromises  between 
them.  How  this  is  done,  however,  is  more  a  matter 
for  research  into  governments  than  into  elections. 

Election  Data 

While  election  research  has  generated  and  tested 
many  theories  of  the  determinants  of  voting 
choice,  its  most  significant  achievement  to  date 
could  well  be  described  as  the  generation  of  vast 
amounts  of  information,  primarily  survey  data, 
but  broadening  out  to  time  series  and  comparative 
collections  of  economic  indicators,  voting  statis¬ 
tics,  government  expenditures  and  policy  indices, 
personnel  and  structures  of  governments,  text- 
based  counts  of  policy  emphases,  campaign  issues, 
manifestos,  and  speeches — the  range  and  quantity 
of  the  information  is  truly  staggering  and  still 
underexplored.  The  combination  of  information  at 
the  several  levels  of  electors,  parties,  governments, 
and  policies  gives  political  scientists  the  ability  to 
settle  many  of  the  unanswered  questions  raised  in 
the  previous  discussion.  If  there  is  not  yet  a  com¬ 
plete  theory  of  democratic  decision  making  and  of 
the  role  of  elections  within  it,  researchers  certainly 
now  have  the  ability  to  formulate  and  check  one 
against  the  material  now  available — not  just  for 
any  one  country  but  for  the  whole  range  of  democ¬ 
racies.  This  is  perhaps  the  major  product  of  elec¬ 
tion  research  over  the  past  50  years,  which  has 
contributed  cumulatively  to  the  understanding  of 
voting  choice  and  of  the  democratic  electorate. 

Ian  Budge 
University  of  Essex 
Colchester,  United  Kingdom 
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Primary  elections — or  preliminary,  earlier  elections — 
can  come  first  both  in  order  and  in  importance. 
They  come  first  in  order  because  selecting  candi¬ 
dates  is  one  of  the  first  things  that  political  parties 
must  do  before  an  election.  They  can  be  first  in 
importance  as  well  because  those  who  are  eventu¬ 
ally  elected  to  office  are  the  successful  candidates 
whom  the  parties  previously  selected;  and  they  are 
the  ones  who  will  determine  what  the  party,  the 
parliament,  and  the  country’s  politics  will  look  like 
until  the  next  elections. 

In  the  study  of  preliminary  elections,  the  unit  of 
analysis  is  the  single  party  in  a  particular  country  at 
a  specific  time.  There  are  very  few  established 
democracies  where  the  legal  system  specifies  criteria 
for  candidate  selection — for  example,  Finland,  Ger¬ 
many,  New  Zealand,  and  Norway  (until  2002) — 
and  only  in  the  United  States  does  the  legal  system 


extensively  regulate  the  primary  elections.  Prelimin¬ 
ary  elections,  or  candidate  selection  methods,  are 
thus  the  nonstandardized  and  predominantly  unreg¬ 
imented  party  mechanisms  by  which  political  par¬ 
ties  choose  their  candidates  for  general  elections. 
The  result  of  this  process  is  the  designation  of  a 
candidate,  or  list  of  candidates,  as  the  candidate(s) 
of  the  party.  The  party  then  becomes  effectively 
committed  to  the  candidate(s)  and  to  mobilizing  its 
strength  behind  the  chosen  Candida te(s). 

This  entry  delineates  the  most  important  distin¬ 
guishing  variable  for  explaining  how  political  par¬ 
ties  choose  their  candidates — the  selectorate,  that 
is,  who  is  allowed  to  take  part  in  the  selection  of 
the  candidates.  Along  with  another  important 
variable,  that  of  candidacy  requirements,  these 
two  criteria  are  akin  to  the  supply  and  demand 
sides  of  candidate  selection.  While  candidacy  nar¬ 
rows  the  supply  of  contestants  who  can  be  selected, 
it  is  the  selectorate  that  decides  who  will  eventually 
face  the  voters  in  the  general  election.  Due  to  the 
scope  of  its  political  consequences,  the  selectorate 
is  the  most  significant  and  far-reaching  explana¬ 
tory  variable  when  it  comes  to  primary  elections. 

The  selectorate  continuum  starts  with  the  most 
inclusive  body — all  voters — and  ends  with  the  most 
exclusive  one — the  single  leader.  Each  part  of  the 
continuum,  and  the  variations  along  it,  is  illus¬ 
trated  by  empirical  examples  from  both  established 
and  new  democracies.  This  entry  begins  by  focus¬ 
ing  on  the  well-known  case  of  primary  elections  in 
the  United  States,  which  reflect  the  most  inclusive 
of  selectorates.  It  then  expands  to  cover  other 
democracies  and  their  use  of  preliminary  elections. 
The  entry  concludes  with  a  short  analysis  of  the 
political  consequences  of  different  selectorates. 

Classifying  Primary  Elections 

Primary  elections  can  be  classified  by  distinguish¬ 
ing  the  five  archetypical  kinds  of  selectorates, 
from  the  most  inclusive  to  the  most  exclusive: 


Voters 

Party 

Party 

Party 

Single 

Members 

Delegates 

Elite 

Leader 

Inclusive  • - 

- •  Exclusive 

Figure  I  Party  Selectorates  in  Primary  Elections 
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1.  The  most  inclusive  selector  ate:  voters.  This 
selectorate  includes  the  entire  electorate  that  has 
the  right  to  vote  in  the  general  elections. 

2.  The  highly  inclusive  selectorate:  party  members. 
This  refers  to  dues-paying  party  membership  in 
its  European  sense,  implemented  and  controlled 
by  the  party  itself,  rather  than  simply 
registration  as  a  party  supporter  as  in  the 
United  States,  which  belongs  to  the  above 
category. 

3.  The  in-between  selectorate:  party  delegates.  This 
selectorate  is  composed  of  representatives 
selected  by  the  party  members.  They  can  be 
members  of  party  agencies  (e.g.,  conventions, 
central  committees,  congresses)  or  delegate 
bodies  that  were  selected  for  this  purpose  alone. 

4.  The  highly  exclusive  selectorate:  the  party  elite. 
This  includes  small  party  agencies  and 
committees  that  were  indirectly  selected  or 
whose  composition  was  ratified  by  wider  party 
agencies  and  other  less  formal  groups. 

5.  The  most  exclusive  selectorate:  a  nominating 
entity  of  a  single  leader. 

Voters 

The  primary  elections  in  the  50  states  of  the  United 
States  provide  most  of  the  examples  on  the  inclu¬ 
sive  end  of  the  continuum.  The  exact  location  of 
American  primaries  depends  on  the  restrictions 
defined  by  the  different  state  laws.  At  the  extreme 
end  are  the  American  nonparty  primary  elections, 
used  in  Louisiana  from  f978  to  select  candidates 
for  Congress.  In  these  primary  elections,  every  reg¬ 
istered  voter  could  vote  for  candidates  from  any 
party.  Blanket  primary  elections — used  in 
Washington  (since  1938),  Alaska  (since  1968),  and 
California  (1998,  2000) — are  also  at  the  extreme 
inclusiveness  pole.  Here,  voters  receive  a  single 
ballot  listing  all  the  candidates  from  all  the  parties 
and  decide,  for  each  post  separately,  which  party 
candidate  to  vote  for.  In  both  the  nonparty  and 
blanket  primaries,  participants  do  not  need  to 
declare  their  party  affiliation  in  order  to  take  part 
in  candidate  selection. 

Open  primary  elections — used  in  Hawaii, 
Michigan,  Vermont,  and  other  states — are  slightly 
less  inclusive  than  the  two  previous  kinds.  As  in 


nonparty  and  blanket  primaries,  the  voters  are 
allowed  to  decide  in  which  primaries  they  want  to 
take  part  without  the  need  to  announce  their  par¬ 
tisan  preference.  Yet,  unlike  these  previous  types, 
voters  can  take  part  in  the  primaries  of  only  one 
party. 

Semiclosed  primary  elections — also  used  by  sev¬ 
eral  states  such  as  Arizona,  Indiana,  and  Rhode 
Island — require  participants  to  declare  their  party 
affiliation  only  on  the  selection  day  and/or  allow 
independents  to  take  part  in  the  party  primary  they 
announce  their  wish  to  vote  in.  Semiclosed  pri¬ 
mary  elections  move  ever  so  slightly  away  from  the 
inclusive  end  of  the  selectorate  continuum  because 
voters  need  to  publicly  affiliate  with  a  political 
party. 

The  inclusive  end  of  the  continuum  also  includes 
examples  from  Iceland,  Taiwan,  and  Spain.  From 
1971  on,  parties  in  Iceland  adopted  primary  elec¬ 
tions  in  some,  and  sometimes  all,  electoral  dis¬ 
tricts,  where  every  citizen  in  a  particular  electoral 
district  could  participate.  In  Taiwan,  the  National 
Party  adopted  inclusive  primary  elections  in  1998. 
The  Catalan  Socialist  Party  in  Spain  opened  its 
candidate  selection  to  “registered  sympathizers” — 
nonmembers  who  could  register  as  party  support¬ 
ers  without  paying  a  membership  fee. 

American  closed  primary  elections,  which 
demand  that  voters  register  according  to  their 
party  affiliation  before  the  day  of  the  primaries, 
are  located  further  away  from  the  inclusive  end. 
The  level  of  inclusiveness  of  the  selectorate  of  the 
Democratic  Progressive  Party  in  Taiwan  (1998- 
2001)  also  places  it  at  this  point.  It  used  a  unique 
weighted  selectorate  that  combined  the  results  of 
both  a  vote  of  the  party  members  and  a  public 
opinion  poll. 

The  open  convention,  which  allows  any  voter  to 
take  part  in  a  candidate  selection  meeting,  is  prob¬ 
ably  a  less  inclusive  version  of  the  most  inclusive 
family  of  selectorates.  While  it  allows  any  voter  to 
take  part  in  a  selection  meeting  without  the  need 
to  prove  party  affiliation  or  even  to  preregister,  it 
is  still  quite  a  demanding  system  as  it  requires  the 
voter  to  attend  a  meeting  at  a  certain  date,  time, 
and  location.  This  kind  of  selection — which  is 
similar  to  the  American  caucus — was  used  in  the 
past  in  Canada,  in  the  1920s  to  1950s  period, 
before  the  parties  began  institutionalizing  their 
membership. 
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Party  Members 

This  discussion  now  moves  into  the  party  mem¬ 
bers’  zone.  At  its  inclusive  end,  it  comes  close  to 
including  all  the  voters — as  in  the  case  of  the 
Dutch  Democrats  66,  which  allowed  party  mem¬ 
bers  to  vote  via  postal  ballots  but  then  held  meet¬ 
ings  where  both  party  members  and  all  voters 
could  participate.  Another  example  is  the  Argentine 
Peronist  and  Radical  parties,  which  in  some  dis¬ 
tricts  (1983-2001)  allowed  party  members  and 
independents  to  participate  in  their  primaries. 

In  the  middle  of  the  party  members’  zone  is  the 
typical  European  closed  primary  elections,  which — 
as  opposed  to  American  closed  primary  elections — 
usually  mean  “party  primaries”  in  which  the 
selectors  are  party  members,  not  just  supporters. 
From  this  point  on  in  the  selectorate  continuum, 
mere  party  supporters  are  excluded.  The  purest 
type  of  party  primary  is  where  the  party  mem¬ 
bers’  votes  alone  determine  the  candidates.  A 
number  of  parties  have  used  such  party  primaries, 
albeit  not  consistently  over  time  and  not  necessar¬ 
ily  in  all  the  districts.  Among  them  are  the  follow¬ 
ing  examples:  the  Australian  Labor  Party;  the 
Belgian  ECOLO;  the  German  Social  Democrats, 
Christian  Democrats,  and  Greens  at  the  single¬ 
member  district  level;  the  Icelandic  Independence, 
Social  Democratic,  Progressive  and  People’s 
Alliance  parties;  the  Israeli  Labor,  Likud,  and 
Kadima  parties;  and  the  Mexican  Institutional 
Revolutionary  and  the  Democratic  Revolution 
parties. 

When  party  members  have  a  dominant  or  sig¬ 
nificant  role  in  candidate  selection — but  are  not 
the  sole  selectors  and  other,  more  exclusive  party 
actors  take  part  in  the  selection  of  candidates — the 
selectorate  remains  in  the  party  members’  zone  but 
is  moving  toward  the  party  delegates  area.  For 
example,  in  several  of  the  Danish  parties,  from  the 
1970s  until  recently,  central  party  agencies  could 
veto  or  change  the  selection  made  by  the  party 
members.  In  Finland,  the  election  law  states  that 
party  organizations  have  the  right  to  change  up  to 
one  fourth  of  the  candidates  selected  by  the  mem¬ 
bers’  vote.  In  Canada,  national  party  leaders  have 
some  power  over  candidate  selection,  although 
they  usually  refrain  from  exercising  it,  while  in 
Ireland  the  national  party  leadership  kept  and  even 
enhanced  its  power  in  those  parties  that  adopted  a 
membership  ballot. 


Cases  where  party  agencies  first  filter  the  candi¬ 
dates  who  are  then  put  to  a  membership  vote  are 
close  to  the  middle  between  the  party  members  and 
the  party  delegates  zones.  If  the  screening  process 
still  leaves  a  large  and  viable  pool  of  candidates 
from  whom  the  party  members  can  make  the  final 
decision,  then  the  selectorate  is  still  on  the  party 
members’  side.  Since  the  mid-1990s,  the  Israeli 
Meretz  party  has  produced  a  sizeable  “panel”  of 
candidates  from  which  the  members  choose  the 
final  list.  The  Social  Democrats  and  the  Liberals  in 
Britain  did  much  the  same,  as  did  several  of  the  par¬ 
ties  in  Belgium  in  some  districts,  particularly  the 
Belgian  Socialist  Party  during  the  1960s. 

When  equal  weight  is  given  to  the  party  delegates 
and  to  the  party  members,  the  selectorate  is  in  the 
middle  between  these  two  categories.  In  Taiwan,  in 
1995-1996,  the  Democratic  Progressive  Party  used 
a  weighted  method  in  which  the  vote  of  the  mem¬ 
bers  was  equally  weighed  to  the  vote  of  the  party 
representatives.  British  Labour’s  use  of  a  multistage 
method  since  1997 — where  candidates  are  screened 
by  party  agencies,  selected  by  party  members,  and 
can  then  be  vetoed  by  the  National  Executive 
Committee — can  also  be  seen  as  a  middle-of-the- 
road  example.  The  same  seems  to  be  the  case  for 
the  Democratic  Party  of  Botswana  in  2002,  where 
party  members  selected  candidates  after  a  national 
party  agency  screening,  and  for  both  the  Labor 
(1960-1964)  and  the  Pacifist  Socialist  (1957-1973) 
parties  of  the  Netherlands,  which  allowed  their 
national  executives  to  propose  the  list  of  candidates 
but  let  the  members  then  vote  and  alter  both  the 
rank  and  the  composition  of  these  lists. 

It  should  be  noted  that  the  party  members’ 
selectorate  can  be  further  distinguished  according 
to  the  restrictions  on  party  membership.  For  exam¬ 
ple,  one  rule  that  could  restrict  membership — or 
the  right  to  participate  in  primary  elections — is  the 
rate  of  membership  dues.  Members’  participation 
may  also  be  restricted  by  the  requirement  of  a 
minimal  party  membership  period  prior  to  candi¬ 
date  selection,  proof  of  party  activity,  and  so  on. 

Party  Delegates 

When  the  party  members  have  less  of  an  impact 
than  selected  party  delegates,  the  selectorate  is  still 
located  between  these  two  zones  but  is  closer  to 
the  latter.  Here,  for  example,  the  members  can 
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ratify  or  reject  a  list  of  candidates  drawn  up  by  the 
party  agency,  as  was  done  by  the  French  Socialist 
Party  in  1986.  Since  the  1980s,  the  British  Con¬ 
servatives  use  a  multistage  method,  which  starts 
with  a  screening  by  a  nonselected  national  party 
agency,  followed  by  a  locally  selected  party  agency 
screening,  and  finally  a  party  members’  selection 
meeting. 

The  selectorate  moves  slightly  closer  to  exclusiv¬ 
ity  when  the  candidates  are  produced  by  a  wide 
delegate  convention,  as  was  the  case  in  Ireland  with 
Fianna  Fail,  Fine  Gael  (until  it  adopted  member¬ 
ship  ballots  in  the  1990s),  and  Labour.  Another 
example  is  selection  by  a  party  agency  that  might 
be  followed  (or  preceded)  by  a  membership  ballot, 
as  implemented  by  the  Swedish  Communist/Left 
Party. 

When  the  selectorate  is  an  agency  of  the  party, 
the  selectorate  is  in  the  middle  of  the  selectorate 
continuum.  Inside  the  party,  the  relative  size  of 
each  agency  is  a  sign  of  its  inclusiveness:  Con¬ 
ventions  are  usually  larger  than  central  commit¬ 
tees,  which  in  turn  are  usually  larger  than  executive 
bodies,  such  as  bureaus.  As  the  size  of  the  particu¬ 
lar  party  agency  gets  smaller,  the  selectorate  moves 
toward  the  exclusive  pole  of  the  continuum.  The 
terminology  used  in  each  country  is  not  necessarily 
equivalent,  and  hence  one  must  be  cautious  when 
inferring  the  extent  of  inclusiveness  based  solely  on 
what  a  particular  party  calls  a  specific  agency.  The 
use  of  party  delegates  is  widespread.  Since  the 
1950s,  the  major  German  parties  have  used  dele¬ 
gate  conventions  at  the  single-member  district 
level.  This  was  also  the  typical  selectorate  in 
Australia.  Several  Israeli  parties  use  their  central 
committees  to  select  their  candidates.  This  was  also 
one  of  the  methods  used  by  the  Canadian  parties 
60  years  ago  (1920-1950),  which  have  since 
adopted  the  more  inclusive  membership  ballot. 

When  a  nonselected  party  agency — such  as  a 
nominating  committee — has  an  influence  on  the 
selection  of  candidates  alongside  the  selected  party 
agency,  the  selectorate  moves  toward  more  inclu¬ 
siveness  but  remains  within  the  party  agency  zone. 
One  example  is  the  assorted  system  used  by  both 
the  National  Liberation  and  the  United  Social 
Christian  parties  in  Costa  Rica,  where  most  candi¬ 
dates  were  chosen  by  a  selected  party  agency,  but 
several  were  nominated  by  the  party  president. 
There  are  many  examples  of  both  directly  and 


indirectly  selected  party  agencies  taking  part  in 
candidate  selection,  as  in  the  Austrian  Socialist 
Party  (1945-1990);  the  Dutch  Christian  Democrats 
(1986),  Christian  Historical  Union  (1960-1979), 
and  Radical  Political  Party  (1973-1989);  and  the 
British  Conservatives  from  the  1950s  to  the  1970s 
and  British  Labour  from  the  1950s  until  1987.  Yet 
another  example  is  the  New  Zealand  Labour 
Party,  which  has  used  a  method  since  the  1950s 
where  both  nominated  and  selected  delegates 
choose  the  candidates. 

The  selectorate  is  between  the  party  delegates 
and  the  party  elite  when  there  is  a  relative  balance 
of  power  between  the  selected  and  the  nonselected 
party  agencies.  This  was  the  case  with  the  methods 
used  in  the  1980s  in  the  Union  for  French 
Democracy,  the  Dutch  People’s  Party  for  Freedom 
and  Democracy  from  the  1960s  to  the  1990s,  and 
the  Spanish  Socialists  from  1979  to  1998.  The 
Norwegian  parties,  in  most  cases,  from  the  1920s 
and  until  2002,  followed  such  a  system  by  virtue 
of  a  law  that  allocated  funding  to  political  parties 
that  selected  their  candidates  in  a  (selected)  dele¬ 
gate  convention  at  the  level  of  the  multimember 
constituency.  Candidate  selection  at  the  lander 
level  in  Germany  also  belongs  here — although 
party  delegates  are  the  final  decision  makers  at  this 
level,  the  selection  is  made  on  the  basis  of  a  recom¬ 
mended  list  that  is  designed  by  the  land  party  elite. 

Party  Elite 

This  category  includes  nominating  committees 
that  are  formed  for  the  sole  purpose  of  selecting 
the  party’s  candidates  as  well  as  nonselected  party 
agencies  (typically  small  executive  boards)  that  are 
entrusted  with  several  different  tasks,  including  the 
selection  of  candidates.  The  composition  of  both 
nominating  committees  and  nonselected  party 
agencies  can  be  regarded  as  slightly  more  inclusive 
if  it  is  indirectly  selected  or  somewhat  more  exclu¬ 
sive  if  it  is  not. 

When  the  selection  power  of  the  party  elite  is 
stronger  than  that  of  the  selected  party  agency,  the 
selectorate  moves  into  the  party  elite  zone  and  is 
within  the  exclusive  part  of  the  continuum.  There 
are  cases  of  methods  that  involve  various  selected 
and  nonselected  (or  highly  indirectly  selected) 
party  agencies,  with  more  influence  given  to  the 
latter.  For  example,  the  French  Rally  for  the 
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Republic  in  the  1980s  allowed  a  nominating  com¬ 
mittee  to  select  its  candidates  but  afforded  some 
influence  to  other  directly  and  indirectly  selected 
party  agencies.  The  Italian  Communist  Party  from 
1956  to  1986  allowed  for  the  involvement  of  a 
selected  party  agency,  but  the  final  word  on  candi¬ 
date  selection  was  given  to  an  indirectly  selected 
party  agency.  The  Chilean  Party  for  Democracy 
and  the  National  Renovation  Party  allowed  their 
national  councils  to  be  involved,  but  the  final  deci¬ 
sion  was  made — because  of  the  constraints  set  by 
the  binominal  electoral  system — by  negotiations 
between  the  party  leaders.  When,  for  example,  the 
selected  party  agency  is  only  asked  to  ratify  a  deci¬ 
sion  made  by  the  nominating  committee,  the  selec- 
torate  is  still  leaning  slightly  toward  the  inclusive 
side  of  the  party  elite  zone.  This  was  the  case  in  the 
Japanese  Liberal  Democratic  Party  in  the  period 
between  1950  and  1990. 

When  only  a  nonselected  party  agency/group  is 
involved  in  candidate  selection,  then  the  selector- 
ate  is  squarely  in  the  middle  of  the  party  elite  zone. 
This  was  the  case  in  the  Venezuelan  Democratic 
Action  Party  in  the  1990s;  the  Italian  Christian 
Democrats  from  1957  to  1984;  the  Mexican  Insti¬ 
tutional  Revolution  Party  before  2000;  the  Chilean 
Independent  Democratic  Union  from  1989  to 
2001;  in  a  significant  number  of  constituencies  in 
several  of  the  main  Belgian  parties,  especially  since 
1968;  the  Indian  Congress  Party  in  the  1950s  and 
1960s;  and  the  Danish  Peoples  Party  in  1998. 

The  selectorate  moves  toward  the  single-leader 
pole  with  such  exclusive  selectorates  as  a  gathering 
of  the  party  founders  in  a  new  party  or  an  informal 
group  of  factional  leaders  in  older  parties.  Israel’s 
ultrareligious  parties  serve  as  an  example  of  such 
highly  exclusive  selectorates. 

Single  Leader 

The  extreme  end  of  the  exclusive  pole  is  defined  by 
a  selectorate  composed  of  a  single  individual.  If  the 
leader  does  not  have  complete  control  over  candi¬ 
date  selection,  then  the  selectorate  is  close  to  the 
exclusive  end  of  the  selectorate  continuum  but  not 
at  its  pole.  In  Forza  Italia,  in  the  1990s,  the  found¬ 
ing  leader  Silvio  Berlusconi  chose  the  candidates  in 
cooperation  with  the  party’s  regional  coordina¬ 
tors.  The  French  National  Front  leader,  Jean- 
Marie  Le  Pen,  together  with  the  party’s  general 


secretary,  chose  the  party’s  candidates,  with  some 
influence  given  to  nonselected  forces.  Winston 
Peters,  New  Zealand  First’s  founder,  was  also 
given  almost  complete  control  over  candidate 
selection. 

Like  the  party  elite,  a  single  leader  will  lean 
more  toward  the  inclusive  side  if  the  leader  is 
selected  and  more  toward  the  exclusive  pole  if  the 
leader  is  not  a  selected  leader.  In  2005,  Israeli 
Prime  Minister  Ariel  Sharon  quit  his  party  (Likud) 
and  formed  a  new  one  (Kadima).  Since  he  also  quit 
his  position  as  prime  minister  and  called  for  early 
elections,  time  constraints  and  a  new  party  allowed 
him  to  choose  the  list  of  candidates. 

Political  Consequences 

Different  levels  of  selectorate  inclusiveness  pro¬ 
duce  different  political  consequences.  For  example, 
in  terms  of  quantity,  the  more  inclusive  selector¬ 
ates  are  the  more  participatory  ones.  In  terms  of 
quality,  though,  the  picture  becomes  less  clear 
because  most  of  those  who  do  join  parties  do  not 
participate  in  their  party’s  candidate  selection  pro¬ 
cess  and  are  not  affiliated  for  more  than  a  short 
period. 

Smaller,  exclusive  selectorates  are  able  to  bal¬ 
ance  representation,  which  is  more  difficult  when 
the  aggregation  of  a  large  number  of  votes  pro¬ 
duces  the  party  candidates.  Party  agencies  are 
actually  more  competitive  than  primaries  because 
of  the  shorter  “distance”  between  the  candidates 
and  their  selectors. 

Inclusiveness  also  influences  responsiveness. 
Legislators  selected  by  a  small  selectorate  owe 
their  positions  to  that  selectorate  and  are  likely  to 
be  party  players.  Legislators  selected  in  primaries 
need  to  reach  a  massive,  fluid  audience  and  will 
behave  more  like  individuals  than  team  players. 

Conclusion 

The  consequences  of  primary  elections  can  be 
more  or  less  significant  as  a  result  of  the  character¬ 
istics  of  a  particular  nation’s  politics.  For  example, 
in  countries  with  single-member  districts,  if  the 
number  of  safe  seats  is  either  large  or  growing  then 
the  selection  process  of  the  winning  party  could  be 
more  decisive  than  the  election  itself.  In  both  the 
United  States  and  Great  Britain,  more  than  one 
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half  of  the  constituencies  are  safe  for  one  party  or 
the  other,  with  majorities  of  greater  than  10%, 
which  means  that  the  effective  choice  of  who  will 
become  a  legislator  is  made  not  by  the  voters  in  the 
general  election  but  by  the  selectorate  in  the  pri¬ 
mary  elections.  Moreover,  even  in  marginal  seats, 
the  correct  choice  of  a  candidate  by  a  party  could 
make  the  decisive  difference  between  winning  the 
seat  and  losing  it.  In  countries  using  proportional 
electoral  systems,  the  selection  of  candidates  at  the 
top  of  a  party  list  can  virtually  guarantee  election, 
particularly  in  the  major  parties,  practically  regard¬ 
less  of  the  results  of  the  general  election.  In  short, 
in  the  majority  of  democratic  nations,  in  a  major¬ 
ity  of  the  parties,  the  preliminary  selection  inside 
the  party  is  as  important  as  the  subsequent  election 
by  the  general  population. 
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Elections,  Volatility 


A  recurring  exercise  in  the  field  of  electoral  behav¬ 
ior  is  assessing  to  what  extent  parties’  share  of  the 
vote  and  voter  choice  remain  stable  over  time. 
Voters  may  remain  loyal  to  the  same  party  they 
voted  for  in  the  previous  election  or  change  pref¬ 
erence  by  voting  for  another  party,  abstaining,  or 
spoiling  their  ballots.  A  significant  number  of 
studies  have  been  devoted  to  understanding  the 
dynamic  of  such  changes  in  contemporary  democ¬ 
racies.  The  change  of  voters  from  one  party  to 
another  in  the  election  is  called  election  volatility. 

Possible  changes  in  voters’  preferences  over  time 
may  be  measured  on  the  individual  level  or  on  the 
aggregate  level  of  the  electorate.  A  voter  may 
change  preferences  during  an  election  campaign, 
or  over  a  longer  period  involving  at  least  two  elec¬ 
tions.  The  traditional  way  of  identifying  the  level 
of  permanence/change  in  electoral  preference  on 
the  individual  level  is  quite  simply  asking  voters. 
For  this  reason,  election  surveys  are  the  fundamen¬ 
tal  source  for  picking  up  permanence/change  in 
preference  on  a  microlevel. 

For  electoral  studies  on  the  aggregate  level,  the 
source  of  data  is  the  final  result  of  the  race  (parties’ 
vote,  turnout,  and  spoiled  votes)  for  a  certain  elec¬ 
toral  unit  (national  or  subnational)  in  at  least  two 
elections.  Easy  access  to  election  results  throughout 
the  world  may  be  the  main  reason  why  studies  about 
electoral  changes  on  the  macrolevel  have  been  con¬ 
ducted  with  such  frequency  over  the  past  few  years. 
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Many  indices  have  been  proposed  to  measure 
permanence/change  in  electoral  preferences.  The 
most  fortunate,  and  one  widely  used  at  present  in 
studies  on  comparative  party  systems,  is  the  volatil¬ 
ity  index,  created  by  Mogens  Pedersen  (1979, 1980). 
The  term  volatility  first  appeared  in  chemistry  and 
refers  to  the  tendency  of  a  substance  to  vaporize.  It 
has  also  been  defined  as  a  measure  of  how  readily  a 
substance  vaporizes.  The  notion  of  a  change  in 
physical  state  captures  the  fundamental  idea  of  the 
index,  which  is  to  quantify  the  intensity  of  change  in 
party  preferences  when  two  elections  are  compared. 

The  Index 

The  volatility  index  (or  total  volatility,  TV)  mea¬ 
sures  the  level  of  aggregate  electoral  change  over 
two  consecutive  elections.  Although  the  index  is 
traditionally  calculated  for  the  total  number  of  votes 
received  by  parties,  it  can  also  be  calculated  includ¬ 
ing  total  turnout  and  spoiled  votes.  Its  formula  is 

TV  =  j(\PaV\  +  \PbV\  +  \PcV\), 

where  V  represents  the  percentage  difference  of 
votes  (or  seats)  for  each  party  in  two  consecutive 
elections. 

The  volatility  index  is  calculated  as  follows: 
The  percentage  of  votes  (or  seats)  that  a  party 
received  in  an  election  is  subtracted  from  the  per¬ 
centage  of  votes  obtained  by  this  same  party  in  the 
preceding  election;  the  difference  indicates  the 
change,  and  the  ( — )  or  ( + )  signs  reveal  the  decline 
or  growth  of  a  party,  respectively.  The  next  step  is 
to  add  the  result  of  this  operation  (not  considering 
the  sign)  and  divide  by  two.  The  value  of  the  index 
expresses  the  total  number  of  votes  lost  by  the  par¬ 
ties  whose  votes  fell  from  one  election  to  the  next 
or  the  total  number  of  votes  gained  by  the  parties 
whose  votes  grew  in  the  same  period.  For  exam¬ 
ple,  a  volatility  index  of  10.0  indicates  that  the 
combined  share  of  the  vote  of  all  the  parties  whose 
vote  fell  was  10  percentage  points  and,  conse¬ 
quently,  that  the  parties  whose  vote  grew,  taken 
together,  increased  their  share  of  the  vote  by  the 
same  figure. 

Theoretically  the  index  can  vary  from  0  to  100, 
although  these  values  could  only  occur  in  empiri¬ 
cally  improbable  cases.  The  index  is  zero  when  the 


parties  have  identical  percentages  of  votes  (or  seats) 
in  two  successive  elections.  The  index  is  100  when 
a  radical  revolution  in  voters’  preferences  takes 
place  between  two  elections:  The  parties  that  ran  in 
Election  1  receive  no  votes  in  Election  2. 

One  must  be  careful  on  two  counts  when  inter¬ 
preting  the  volatility  index.  The  first  is  that  even  in 
a  situation  in  which  there  are  no  changes  in  voters’ 
preferences  there  may  be  a  certain  degree  of  volatil¬ 
ity.  The  reason  is  simple:  New  voters  may  have  a 
voting  pattern  that  is  different  from  that  of  tradi¬ 
tional  voters.  In  such  a  situation,  volatility  is  derived 
from  the  difference  between  the  votes  cast  by  new 
voters  and  those  that  have  been  cast  by  voters  who 
have  since  died  (see  Bartolini  &  Mair,  1990,  p.  21). 

The  second  count  is  regarding  the  relation 
between  changes  in  the  sphere  of  individual  prefer¬ 
ences  and  aggregate  volatility.  A  party  might  main¬ 
tain  the  same  percentage  of  votes  in  two  consecutive 
elections  on  aggregate  but  not  on  the  microlevel.  For 
example,  imagine  that  two  parties  (A  and  B)  have 
each  received  50%  of  the  votes  in  two  consecutive 
elections  but  that  all  the  voters  of  A  in  Election  1 
voted  for  B  in  Election  2  and  vice  versa.  In  spite  of 
the  fact  that  all  the  voters  changed  their  vote  on  the 
microlevel,  total  volatility  was  equal  to  zero. 

Scott  Mainwaring  and  Edurne  Zoco  (2007)  car¬ 
ried  out  a  wide-ranging  comparative  study  on  the 
subject  of  electoral  volatility  in  47  countries  in  the 
post-1945  period.  The  mean  values  found  for  each 
country  varied  significantly:  The  lowest  volatility 
found  was  that  of  the  United  States  (3.3),  while  the 
highest  was  for  Ukraine  (59.2).  The  mean  volatility 
of  the  countries  studied  is  23.2;  the  median  is  17.0. 
The  authors  investigated  whether  the  variation  in 
volatility  is  associated  with  a  set  of  variables.  Two 
of  them  have  some  statistically  significant  associa¬ 
tion  with  volatility:  fragmentation  of  the  party  sys¬ 
tem  and  duration  of  the  democratic  regime.  They 
argue  that  fragmentation  is  positively  associated 
with  volatility,  while  duration  of  the  democratic 
regime  is  negatively  associated  with  it.  Put  differ¬ 
ently,  countries  with  low  fragmentation  and  a 
democracy  of  long  duration  would  tend  to  have  less 
volatile  party  systems  (Mainwaring  &  Zoco,  2007, 
pp.  164-165). 

Bloc  and  Intrabloc  Volatility 

Stefano  Bartolini  and  Peter  Mair  (1990)  pro¬ 
posed  the  calculation  of  bloc  volatility  (BV)  and 
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intrabloc  volatility  (IV).  Unlike  the  total  volatil¬ 
ity  index,  which  is  based  on  gains  and  losses  on 
the  part  of  individual  parties,  BV  and  IV  take  as 
their  starting  point  a  set  of  parties,  such  as  gov¬ 
ernment  parties  X  opposition  parties,  right  X 
left,  religious  parties  X  secular  parties,  and  so  on. 
Bloc  volatility  has  the  following  mathematical 
expression: 

BV  =  ^£p(aV  +  bV  +  cV) 

+  £P(dV  +  eV  +  /V), 

where  P(aV  +  bV  +  cV)  represents  the  percent¬ 
age  of  votes  gained  (or  lost)  by  the  parties  of  this 
bloc  in  relation  to  the  bloc  P(dV  +  eV  +  /V). 
Bloc  volatility  is  part  of  the  total  volatility  and 
evinces  the  exchange  that  occurs  between  the  two 
blocs. 

Intrabloc  volatility  (IV)  is  equal  to  the  difference 
between  the  total  volatility  and  the  bloc  volatility. 
Hence, 

TV  =  BV  +  IV. 

Intrabloc  volatility  can  be  measured  by  the  sum 
total,  for  each  bloc,  of  the  parties  with  an  algebraic 
sign  different  from  that  of  the  bloc  as  a  whole.  For 
example,  let  us  imagine  two  blocs  with  three  par¬ 
ties  and  the  following  individual  volatilities:  (  +  6, 
—2,  —1),  (—5,  —2,  +4)  and  TV  =  10.  In  this  case, 
BV  and  IV  would  be,  respectively, 


and 

IV  =  |  — 2|  +  |-1|  +  |+4|  -  7(10  -  3). 

An  Index  in  Search  of  a  Concept? 

There  is  not  much  controversy  about  the  capacity 
of  the  volatility  index  to  detect  changes  in  the 
party  system  on  the  aggregate  level.  The  index  is 
good  both  for  diachronic  comparisons  within  a 
country  and  for  comparisons  between  countries. 
Hence,  it  can  be  said  that  a  country  has  a  higher 
volatility  index  than  another  or  that  volatility  in  a 
certain  country  fell,  for  example. 


As  Gary  Goertz  (2005)  notes,  a  classic  effort  of 
the  social  sciences  is  producing  indices  that  may 
make  certain  concepts  operational  in  the  best  way 
possible.  But  the  incorporation  of  the  term  volatil¬ 
ity  into  political  science  did  not  follow  this  path. 
The  term  began  to  be  used  to  designate  a  recently 
created  index.  In  other  words,  the  term  volatility  is 
more  associated  with  a  measurement  than  with  a 
concept.  This  brand  of  origin  becomes  clear  when 
one  observes  the  definition  of  volatility,  which  may 
be  found  in  several  texts  on  the  theme:  Volatility  is 
the  net  change  within  the  electoral  party  system 
resulting  from  individual  vote  transfers. 

Bartolini  and  Mair  (1990)  have  pointed  out  the 
conceptual  fragility  of  the  volatility  index: 

A  variety  of  different  theoretical  meanings  have 
recently  been  attributed  to  the  concept  and 
measure  of  electoral  volatility.  For  example, 
evidence  of  volatility  has  been  read  as  an  indicator 
of  the  “integration”  or  “catchall”  nature  of  party 
support,  as  a  symptom  of  realignment  and/or 
dealignment;  as  an  indicator  of  cleavage  strength 
and  persistence;  as  expressing  an  “issue-effect” 
vis-a-vis  the  basic  vote  of  parties,  and  so  on. 
(pp.  23-24) 

For  this  reason,  the  volatility  index  has  been  used 
to  measure  different  political  processes:  electoral 
dealignment/alignment  (Raul  Madrid,  2005;  Peder¬ 
sen,  1980),  the  freezing  of  political  cleavages 
(Bartolini  &  Mair,  1990),  and  the  institutionaliza¬ 
tion  of  the  party  system  (Michelle  Kuenzi  &  Gina 
Lambright,  2001;  Mainwaring  &  Zoco,  2007). 

This  conceptual  ambiguity,  added  to  some  other 
difficulties  inherent  to  any  interval  measurement — 
when  used  for  the  analysis  of  institutions — has  left 
open  a  series  of  questions.  What  level  of  volatility 
would  allow  one  to  consider  a  party  system  to  be 
institutionalized  (or  not)?  How  should  one  inter¬ 
pret  low  volatility  in  a  party  system  with  low  insti¬ 
tutionalization?  And,  conversely,  how  should  one 
interpret  high  volatility  in  a  consolidated  party 
system?  In  substance,  what  does  it  mean  to  say,  for 
instance,  that  the  average  volatility  for  Colombia 
is  12.5,  while  that  of  Canada  is  11.9? 

In  spite  of  the  limits  pointed  out,  the  volatility 
index  is  one  of  the  most  widely  known  and  used 
indicator  in  political  science.  It  has  helped  in  iden¬ 
tifying  tendencies  in  longitudinal  studies  of  a  single 
country  and  contributed  to  the  effort  of  comparing 
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several  party  systems — a  rare  success  story  for  an 

indicator. 
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Electoral  Behavior 


In  representative  democracies,  voting  is  one  of  the 
fundamental  acts  that  allow  citizens  to  express 
their  belonging  to  a  political  community  and  to 
make  decisions  about  their  own  future.  Analyzing 
electoral  behavior  is  one  of  the  oldest  and  most 
productive  domains  of  research  in  political  sci¬ 
ence.  Research  on  electoral  behavior  contributes 
to  our  understanding  of  how  democratic  systems 
function  and  develop  over  time.  It  does  this  by 


analyzing  the  answers  to  the  following  questions: 
Whom  do  people  vote  for?  Why  do  they  do  so? 
What  do  they  want  to  achieve  by  doing  so? 

This  research  has  developed  over  many  years 
using  three  main  approaches:  sociological,  psycho- 
sociological,  and  rational  choice.  It  should  be 
remembered  nonetheless,  in  France  particularly, 
that  the  first  studies  concentrated  on  electoral  geog¬ 
raphy  and  were  based  on  analysis  of  aggregate  elec¬ 
toral  data  at  different  geographical  levels  and  that 
this  subdiscipline  continues  to  be  of  great  interest. 

The  sociological  approach  endeavors  to  explain 
the  electoral  behavior  of  individuals  according  to 
their  position  in  society  and  their  belonging  to 
particular  social  groups.  It  studies  the  social  deter¬ 
minants  of  voting  behavior.  The  psychosociologi- 
cal  approach  studies  the  individual  attitudes  of 
voters  on  which  they  base  their  electoral  choices. 
The  rational  choice  approach  developed  from  eco¬ 
nomic  theories  of  rationality.  Voters  decide  which 
way  to  vote  using  a  cost-benefit  calculation, 
depending  on  their  preferences  and  what  the  dif¬ 
ferent  parties  and  candidates  have  to  offer.  These 
approaches  are  discussed  in  detail  as  follows. 

The  Sociological  Approach 

Based  on  Social  Cleavages 

Sociological  research  on  voting  behavior  has  been 
developed  since  the  1950s  as  a  result  of  the  increas¬ 
ing  presence  of  major  opinion  poll  surveys  based 
on  large  samples  of  voters.  These  surveys  allowed 
a  great  deal  of  individual  data  to  be  gathered  and 
submitted  to  complex  quantitative  processing.  In 
many  countries,  strong  statistical  relationships 
appeared  between  the  social  characteristics  of  indi¬ 
viduals  and  the  way  they  voted.  Among  the  main 
social  variables  used  and  whose  effect  has  been 
measured,  class  and  religious  belonging  were  ini¬ 
tially  the  most  frequently  analyzed,  and  they  also 
appeared  to  be  the  factors  that  had  the  most  deter¬ 
mining  effect. 

Seymour  Lipset  and  Stein  Rokkan’s  theory  of 
cleavages  looked  at  the  history  of  European  societ¬ 
ies  to  identify  the  main  stable  social  cleavages  at 
the  roots  of  political  and  partisan  cleavages.  They 
exerted  a  huge  influence  on  research  into  voting 
behavior  by  showing  the  profound  and  lasting 
nature  of  the  major  social  determinants  of  voting. 
Their  research  has  provided  solid  results  on  this 
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question.  The  study  of  class  voting  was  thus  well 
developed,  as  was  the  study  of  the  relationship 
between  religious  belonging  and  voting  behavior. 
Thus,  for  example,  while  class  constituted  the 
most  important  electoral  cleavage  in  Great  Britain, 
religion  seemed  to  be  the  variable  with  the  greatest 
impact  in  both  Germany  and  France,  though  class 
also  played  a  role  but  to  a  lesser  extent. 

More  recently,  this  approach  has  been  criticized 
in  several  ways.  It  was  observed  that  the  ability  of 
these  two  variables  to  explain  voting  behavior  was 
becoming  weaker  in  accordance  with  a  long-term 
trend.  Thus,  the  permanence  of  class  voting,  par¬ 
ticularly  in  Great  Britain,  but  also  more  broadly 
speaking  in  Europe,  became  debatable  as  a  result 
of  analyses  that  showed  an  underlying  decrease  of 
the  left-wing  working-class  vote  and  the  right- 
wing  middle-class  vote.  Many  explanations  for  the 
decline  of  class  voting  have  been  suggested.  Some 
of  these  point  to  changes  in  social  structures  in 
Western  societies  and  to  both  the  decrease  and 
fragmentation  of  the  working  class.  Others  stress 
the  individualization  of  these  societies  and  the 
increasing  autonomy  of  individuals  in  relation  to 
the  groups  they  belong  to.  This  hypothesis  is  also 
suggested  for  the  religious  factor,  as  European 
societies  have  become  increasingly  secular. 

Others  again  stress  changes  in  the  political  par¬ 
ties  themselves  rather  than  changes  among  the 
electorate.  According  to  this  hypothesis,  it  is  no 
longer  in  the  interest  of  the  parties  to  present  them¬ 
selves  as  “class-based  parties”  for  the  purpose  of 
winning  an  election.  This  has  contributed  to  weak¬ 
ening  the  coherence  of  the  different  classes  as  self- 
conscious  collective  actors.  As  a  response  to  these 
criticisms,  other  researchers  have  relativized  or 
denied  the  “end  of  class  voting.”  According  to 
them,  current  surveys  show  that  the  decrease  of 
class  voting  does  not  imply  its  disappearance.  The 
operationalizations  designed  to  measure  it  have 
also  been  criticized.  It  is  argued  that  the  use  of 
more  detailed  and  more  complex  social  and  occu¬ 
pational  classifications  would  reveal  that  the  rela¬ 
tionship  between  belonging  to  a  social  group — and 
not  necessarily  to  one  of  the  two  major  classes 
conceptualized  by  Marxism — and  voting  behavior 
remain  significant.  Finally,  the  ideological  supposi¬ 
tion  according  to  which  the  working-class  vote,  for 
example,  can  only  be  classified  as  a  class  vote  if  it 
is  a  left-wing  vote  can  be  called  into  question.  For 


example,  in  France  in  2002,  working-class  voters 
voted  more  than  any  other  social  group  for  the 
extreme  Right.  Flow  can  such  a  vote  be  qualified 
according  to  the  class  voting  theory? 

The  same  is  true  for  the  religious  variable. 
Although  surveys  (notably  in  Europe)  have 
revealed  a  decrease  in  both  religious  attachment 
and  the  relationship  between  religious  belonging 
and  voting  behavior,  this  relationship  is  far  from 
having  disappeared.  Furthermore,  the  increasing 
importance  of  Muslims  in  Europe  has  made  this 
variable  relevant  once  again  in  the  analysis  of  vot¬ 
ing  behavior.  In  France,  for  example,  Muslims  are 
far  more  likely  to  vote  left  than  right.  In  the  same 
way,  in  the  United  States,  presidential  elections  in 
this  century  still  show  the  permanence  of  the 
strong  relationship  between  religion  and  how 
people  vote. 

Finally,  even  if  the  hypothesis  according  to 
which  the  vote  has  become  individualized  should 
be  taken  seriously,  it  nonetheless  merits  a  certain 
degree  of  relativity.  In  particular,  although  the  role 
played  by  certain  social  variables  in  voting  behav¬ 
ior  is  diminishing,  the  role  played  by  other  social 
variables  is  emerging,  reappearing,  or  indeed  gain¬ 
ing  importance.  This  applies,  for  example,  for  gen¬ 
der  and  age.  Furthermore,  in  a  number  of  European 
countries,  the  regional  vote,  which  is  a  vote  based 
on  a  sense  of  identity,  has  been  strongly  expressed 
for  several  years,  such  as  in  Scotland,  in  Catalonia, 
in  the  Basque  country,  and  in  Flanders.  In  the  same 
way,  the  ethnic  or  racial  vote  is  not  receding  either 
and  may,  indeed,  be  increasing.  In  the  2008 
American  presidential  elections,  95%  of  African 
Americans  voted  for  Barack  Obama  as  compared 
to  43%  of  Americans  of  European  descent. 

Whatever  the  tendency  toward  the  individual¬ 
ization  of  Western  societies,  voters  remain  indi¬ 
viduals  belonging  to  different  social  groups  and 
networks.  Individuals  have  multiple  identities,  and 
one  or  another  of  these  may  more  or  less  dominate 
and  be  mobilized  depending  on  the  period,  the 
events,  and  what  the  parties  and  candidates  are 
offering.  Although  the  sociological  approach  has 
real  limits — and  in  particular  it  has  not  always  suf¬ 
ficiently  concerned  itself  with  what  politics  has  to 
offer,  at  times  reifying  a  given  determinant  and 
overestimating  its  permanence  or  its  centrality — it 
remains  nonetheless  an  essential  approach  in  the 
analysis  of  electoral  behavior. 
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The  Sociopsychological  Approach 

In  the  1950s,  the  publication  of  The  American 
Voter  brought  about  a  turning  point  in  the  study 
of  electoral  behavior.  The  starting  point  for  the 
model  used — known  as  the  Michigan  model — was, 
on  the  one  hand,  the  apparent  contradiction 
between  the  absence  of  political  sophistication 
among  the  electorate  and  the  instability  of  their 
opinions  on  different  issues  and,  on  the  other 
hand,  the  fact  that  the  vast  majority  of  voters  con¬ 
stantly  voted  for  the  same  party  throughout  their 
lifetime.  This  model,  which  was  founded  on  the 
importance  of  political  socialization  (familial,  pro¬ 
fessional,  ethnic,  regional,  etc.)  and  on  the  way  in 
which  political  choices  and  preferences  develop 
among  individuals,  has  produced  the  concept  of 
party  identification  to  explain  electoral  behavior. 
Partisan  identification  is  of  an  affective  nature.  It  is 
politically  stable,  and  this  stability  may  even 
become  stronger  with  age.  Individuals  often  repro¬ 
duce  the  same  partisan  preferences  of  their  par¬ 
ents.  This  partisanship  creates  a  basis  for  political 
identity  and  provides  cues  for  evaluating  political 
events,  candidates,  issues,  and  subsequently  for 
voting  preferences. 

This  model  also  has  a  strong  heuristic  value. 
However,  it  has  nonetheless  been  criticized  for 
many  reasons.  It  has  been  judged  to  be  tautological 
and  nonexplicative — voters  who  vote  for  a  given 
party  say  they  feel  close  to  the  party  as  a  result  of 
voting  for  it  and  not  the  other  way  around. 
Furthermore,  more  recent  studies  have  shown  a 
strong  decrease  in  partisan  identification  in  several 
countries.  Various  theses  on  the  modernization 
and  individualization  of  our  societies  have  high¬ 
lighted  the  fact  that  it  is  less  useful  for  a  more  and 
more  sophisticated  electorate  to  use  partisan  iden¬ 
tification  as  a  shortcut  in  deciding  how  to  vote. 
Other  studies  have  measured  increasing  levels  of 
individual  electoral  volatility  and  have  highlighted 
the  fact  that  the  partisan  de-alignment  phenome¬ 
non  is  part  of  a  general  process  of  political  change 
that  is  transforming  the  relationship  between  vot¬ 
ers  and  parties.  Finally,  it  has  been  noted  that  this 
concept  has  always  been  more  relevant  in  two- 
party  systems  than  in  countries  where  the  system  is 
multipartisan  and  unstable. 

The  psychological  and,  in  particular,  the  affec¬ 
tive  dimension  of  the  concept  should  be  put  into 
perspective  although  it  does  still  exist  for  a  certain 


category  of  voters.  However,  a  little  vaguer  but 
perhaps  more  heuristic  notion  of  partisan  proxim¬ 
ity,  but  also  possibly  of  partisan  distance,  remains 
useful.  For  many  voters,  it  represents  an  important 
way  to  locate  themselves  in  the  political  landscape. 
The  general  dynamic  of  political  modernization 
tends  to  individualize  electoral  behavior.  However, 
this  does  not  mean  that  all  citizens  have  the  same 
relationship  to  politics,  the  same  level  of  sophisti¬ 
cation,  and  the  same  interest  in  or  information 
about  politics.  Some  people  have  a  greater  need  of 
shortcuts  and  symbolic  clues  to  help  them  decide 
how  to  vote.  In  continental  Europe,  where  party 
systems  are  frequently  multipartisan,  the  notions 
of  left  and  right  are  often  more  useful  tools  to 
study  the  orientation  and  stability  of  political  pref¬ 
erences  than  the  notion  of  party  identification,  as 
used  in  Great  Britain  or  the  United  States.  The 
notions  of  Left  and  Right  can  play  a  comparable 
role.  Studies  have  shown,  furthermore,  that  the 
importance  of  the  voter’s  relationship  to  the 
party — or  to  the  Right  or  the  Left — is  not  elimi¬ 
nated  by  the  increasing  importance  of  the  leading 
candidates’  personalities.  This  is  all  the  more  true 
as  the  major  candidates  are  designated  and  sup¬ 
ported  by  political  parties. 

Attitudes  of  an  affective  nature  in  the  relation¬ 
ship  between  the  voter  and  the  political  system  do 
not,  however,  only  concern  the  relationship  to 
political  parties.  They  also  concern  the  candidates 
and  other  political  personalities.  The  dynamic  at 
work  in  most  democracies  shows  the  increasing 
importance  of  the  personalization  of  politics,  in 
particular  in  presidential  or  semipresidential  sys¬ 
tems.  The  role  played  by  the  media,  by  opinion 
polls,  and  by  expert  opinion  is  in  strong  competi¬ 
tion  with  the  role  played  by  the  parties  and  favors 
the  personalization  of  politics.  The  choice  of  a 
leader  includes  an  affective  dimension  to  the  extent 
that,  at  a  given  moment,  the  leader  incarnates 
responsibility  for  the  destiny  of  the  political  com¬ 
munity.  Studies  have  shown  that  the  governmental 
credibility  of  a  leader  plays  an  important  role  in 
decisions  on  how  to  vote.  This  is  particularly  true 
at  times  of  serious  national  crisis,  whether  it  be 
economic,  international,  or  political.  The  notion  of 
the  leader’s  credibility,  which  is  an  important 
dimension  for  electoral  choice,  cannot  be  sepa¬ 
rated  from  the  notion  of  confidence,  which  is 
partly  of  an  affective  nature.  This  is  why,  when 
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designating  candidates,  the  major  parties  must 
take  the  charismatic  aspect  of  their  personalities 
into  account.  Admittedly,  studies  have  undermined 
the  hypothesis  according  to  which  voters  essen¬ 
tially  make  a  decision  based  on  the  personal  image 
of  a  candidate  and  independently  of  their  personal 
political  orientations  and  of  those  of  the  candidate. 
Conversely,  however,  the  candidate’s  personal 
image  cannot  be  considered  to  be  secondary. 
Political  leaders  have  become  electorally  important 
in  their  own  right  by  personifying  the  policy  plat¬ 
forms  of  their  respective  parties.  This  personifica¬ 
tion  necessarily  contains  a  psychological  dimen¬ 
sion  even  if  this  is  not  the  only  dimension. 

The  psychosociological  approach  to  voting 
behavior  does  not  merely  concern  the  study  of  the 
effects  of  partisan  identification  and  personaliza¬ 
tion.  It  also  takes  the  voter’s  ideological  orienta¬ 
tions  and  value  systems  into  account.  Even  though 
these  systems  may  be  complex  and  modifiable, 
they  are  nonetheless  consistent  and  long  lasting.  In 
the  political  landscape,  some  of  the  major  value 
cleavages  reflect  cleavages  in  the  politico-ideologi¬ 
cal  field.  From  this  point  of  view,  the  role  of  the 
left-right  dimension  is  also  important.  In  many 
countries,  studies  have  shown  that  two  main  sub¬ 
dimensions  of  attitudes  provide  structure  to  the 
left-right  dimension.  These  are  the  economic 
dimension,  on  the  one  hand  (economic  liberalism 
vs.  state  interventionism),  and  the  societal  dimen¬ 
sion,  on  the  other  hand  (cultural  liberalism  or  lib¬ 
ertarianism,  or  in  a  certain  sense,  postmaterial- 
ism),  versus  cultural  conservatism.  A  voter  who 
subscribes  to  both  economic  liberalism  and  cul¬ 
tural  conservatism  is  more  likely  to  vote  for  the 
Right,  whereas  hostility  to  economic  liberalism 
and  support  for  cultural  liberalism  is  more  likely  to 
foster  a  vote  for  the  Left.  To  take  this  further, 
France  is  a  country  where  there  is  great  diversity  in 
what  is  on  offer  politically.  It  has  been  noted  in 
recent  elections  that  the  existence  of  a  supply  of 
centrist  and  extreme-right  parties  has  favored  the 
political  expression  of  economic  antiliberals  and 
cultural  antiliberals  who  voted  for  the  extreme 
Right  and  economic  liberals  and  cultural  liberals 
who  voted  for  the  centrist  parties.  It  seems,  there¬ 
fore,  that  the  left-right  dimension  is  in  reality  not 
a  continuum  but  a  complex  structure  made  up  of 
different  ideological  dimensions  that  are  expressed 
in  different  ways  depending  on  the  period  and  on 


the  country.  Thus,  in  many  European  countries, 
the  increasing  importance  attributed  by  the  social¬ 
ist  left  to  the  values  of  cultural  liberalism  and  anti¬ 
racism,  which  is  widely  shared  by  the  well-educated 
middle  classes,  encouraged  a  certain  number  of 
working-class  voters  to  change  their  electoral 
behavior.  These  were  voters  who,  up  until  then, 
had  been  attached  to  the  left  because  of  their 
economic  antiliberalism  positions  but  whose  xeno¬ 
phobic  tendencies  were  increasing.  They  availed 
themselves  of  the  presence  of  an  extreme-right 
party  that  combined  the  two  types  of  antiliberalism 
in  its  ideology  and  voted  for  it. 

Similar  to  the  sociological  approach,  the  psy¬ 
chological  and  sociopsychological  approaches 
have  kept  heuristic  power.  Furthermore,  it  should 
be  noted,  as  this  last  example  shows,  that  the  two 
approaches  are  complementary  to  the  extent  that 
there  are  strong  relationships  between  social 
groups  and  value  systems. 

The  Rational  Choice  Approach 
and  Short-Term  Elements 

The  rational  choice  approach  has  more  recently 
influenced  the  study  of  voting  behavior.  Its  point  of 
departure  is  the  notion  of  a  cost-benefit  calculation. 
Voters  vote  in  accordance  with  their  preferences  for 
the  party  that  will  best  satisfy  them.  Based  on  a 
microeconomic  decision-making  model,  illustrated 
notably  by  Anthony  Downs,  this  approach  differs 
from  the  sociological  and  psychological  approach 
in  that  it  considers  the  voter  to  be  a  rational  indi¬ 
vidual.  According  to  this  model,  citizens  vote  for 
the  party  they  believe  will  provide  them  with  more 
benefits  than  the  other  parties.  Parties  offer  policies; 
voters  look  at  these  policies  and  decide  which  of 
them  will  provide  maximum  benefits  for  them  and 
vote  accordingly.  Parties  and  candidates,  on  the  one 
hand,  and  voters,  on  the  other,  are  in  a  constant 
state  of  interaction.  Parties  constantly  adapt  to  the 
demands  of  the  voters  in  an  attempt  to  conquer  or 
maintain  power,  which,  according  to  this  theory,  is 
their  essential  and  indeed  only  aim. 

This  approach  has  been  very  strongly  criticized 
in  different  ways.  It  has  been  argued  that  the  notion 
of  interest  or  utility  is  not  directly  applicable  to  the 
relationship  of  citizens  to  politics,  as  their  expecta¬ 
tions  are  expressed  in  a  complex  way  both  in  terms 
of  material  and  symbolic  retributions  and  also  in 
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terms  of  precise  demands  and  attachment  to  values. 
It  becomes  necessary  to  attribute  such  a  broad 
meaning  to  the  notion  of  preference  that  the  use  of 
the  notion  of  utility  becomes  problematic.  Certain 
critics  have  gone  so  far  as  to  claim  that  this 
approach  has  no  real  value.  The  result  has  been  a 
veritable  battle  between  two  schools  of  thought. 

And  yet  this  model  holds  many  advantages.  It 
highlights  the  importance  of  the  process  of  interac¬ 
tion  between  voters  and  parties.  This  approach 
stresses  that  any  study  of  voting  behavior  cannot 
take  only  the  voter  into  account  without  also 
including  the  political  system,  the  party  system, 
and  the  set  of  alternative  propositions  (set  of  poli¬ 
cies)  proposed  by  the  parties  and  candidates.  Here, 
the  role  played  by  what  the  parties  have  to  offer  is 
essential.  The  parties  frame  what  is  on  offer  politi¬ 
cally,  and  the  choice  made  by  the  voter  is  largely 
constrained  by  the  issues  that  the  parties  them¬ 
selves  have  chosen  to  highlight. 

Another  advantage  of  this  model  is  that  it  places 
the  voter/individual  at  the  heart  of  the  process  and 
considers  that,  whatever  their  position  in  society, 
voters  are  capable  of  working  out  their  electoral 
choices  themselves.  What  is  involved  is  an  indi¬ 
vidual  decision-making  process.  A  significant  con¬ 
tribution  offered  by  this  approach  is  that  it  makes 
a  real  attempt  to  build  complex  statistical  models 
to  enhance  understanding  of  electoral  behavior. 
The  development  of  this  approach  has  contributed 
in  a  certain  sense  to  the  current  tendency  in  elec¬ 
toral  analysis  to  show  a  general  decline  of  the  long¬ 
term  determinants,  a  growth  in  the  importance  of 
short-term  attitudes,  and  a  fragmentation  of  voter 
choice  (diversity  of  decision-making  processes). 
This  tendency  notably  benefits  research  on  issue 
voting  and  economic  voting. 

Analyses  of  issue  voting  partly  stem  from  the 
rational  choice  approach  to  the  extent  that  they 
deal  with  the  interactive  political  relation  between 
supply  and  demand  and  presuppose  that  the  voter 
is  relatively  sophisticated.  Two  types  of  issues  are 
usually  identified:  position  issues,  where  the  parties 
give  priority  to  different  issues,  each  in  relation  to 
the  other,  with  these  issues  thus  being  in  competi¬ 
tion  with  each  other,  and  performance  issues 
(valence  issues),  where  the  parties  pursue  objectives 
aspired  to  by  all  (e.g.,  a  reduction  in  the  rate  of 
unemployment)  and  try  to  demonstrate  their  com¬ 
petence  in  this  respect  as  a  means  to  reach  them. 


In  terms  of  issue  voting,  the  rational  choice 
approach  has  produced  spatial  theories  on  voting 
behavior.  The  proximity  theories  of  voting  place 
voters  and  parties  in  a  political  landscape.  According 
to  these  theories,  voters  feel  close  to  the  parties  they 
vote  for  and  distant  from  those  they  don’t  vote  for, 
and  there  is  a  measurable  distance  between  the 
voter’s  position  on  policies  and  issues  and  the  par¬ 
ty’s  equivalent  positions.  There  is  a  distance  between 
the  voter  and  the  parties  on  offer,  the  function  of 
this  distance  being  to  identify  which  party  a  voter 
will  choose  in  a  given  election.  The  “proximity” 
function,  which  has  long  been  accepted  for  model¬ 
ing  party  and  self-placement  data,  has  been  chal¬ 
lenged  by  directional  theories.  Based  on  an  analysis 
of  the  notion  of  proximity,  the  rational  choice  mod¬ 
els  concluded  that  it  was  in  the  interests  of  the  par¬ 
ties  to  adopt  the  position  of  the  median  voter. 
Supporters  of  the  directional  thesis  bring  not  only 
the  choice  of  an  issue  by  the  voter  into  play  but  also 
the  (psychological)  intensity  of  the  preference  for  the 
issue  in  question.  Here,  the  rational  choice  model 
includes  recourse  to  the  psychosociological  one. 
Indeed,  within  this  model,  voters  may  prefer  a  party 
that  has  a  very  strong  position  on  the  issue  in  ques¬ 
tion,  even  if  it  is  not  the  party  the  voter  feels  closest 
to.  By  the  same  token,  it  may  be  in  the  party’s  inter¬ 
est  to  adopt  a  strong  position  on  the  issue  and  not 
necessarily  to  attempt  to  position  itself  in  relation 
to  the  median  voter.  Under  these  conditions,  parties 
do  not  necessarily  converge  toward  the  center  of 
the  political  landscape.  Today,  this  family  of  mod¬ 
els  represents  the  formalized  attempt  to  draw  up  a 
synthesis  between  the  different  approaches  encom¬ 
passing  both  the  rational  choice  elements  of  the 
proximity  model  and  the  psychological  processes  of 
the  directional  equivalent. 

All  analyses  of  issue  voting  pose  problems  of  a 
more  general  nature.  First,  their  specific  contribu¬ 
tion  cannot  be  evaluated  without,  at  the  same  time, 
taking  the  contributions  made  by  other  types  of 
variables  that  affect  voting  behavior  into  account. 
For  example,  a  voter  who  feels  very  close  to  a  given 
party  will  tend  to  attribute  the  best  mark  to  this 
party  on  all  the  issues  proposed.  Thus,  in  this  model, 
the  question  of  causality  (what  comes  first?)  has  to 
be  addressed.  Second  and  more  important,  prefer¬ 
ences  on  issues  are  not  independent  of  the  more 
clearly  articulated  and  long-lasting  value  systems  of 
voters  or  of  their  social  position  and  socialization. 
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Therefore,  both  the  short-term  (new  issues)  and  the 
long-term  (stable  and  consistent  ideological  align¬ 
ments)  factors  must  be  taken  into  account.  From 
this  point  of  view,  the  analysis  of  issue  voting  can¬ 
not  be  carried  out  independent  of  the  analysis  of 
values  and  attitudes.  However,  a  new  issue  can,  for 
example,  disturb  or  contribute  to  the  reconfigura¬ 
tion  of  different  dimensions  of  a  voter’s  value  sys¬ 
tem  or  indeed  reverse  the  priority  given  by  the  voter 
to  one  dimension  over  another  within  the  system.  It 
can  update  the  voter’s  long-term  predispositions 
(concept  of  funnel  of  causality). 

Short-term  analyses  also  cover  economic  voting. 
They  are  based  on  the  hypothesis  according  to 
which  an  analysis  by  the  voter  of  the  economic 
performance  of  those  in  power  or  simply  the  vot¬ 
er’s  perception  of  the  economic  situation  and  its 
development  has  an  influence  on  his  or  her  vote. 
Analysis  of  economic  voting  poses  several  prob¬ 
lems  even  if  certain  results  remain  convincing. 
First,  the  results  obtained  are  not  always  consis¬ 
tent,  depending  on  the  country  and  period  of  time 
involved.  Second,  retrospective  judgment  analyses 
on  policies  implemented,  particularly  in  the  eco¬ 
nomic  field,  show  that,  similar  to  judgments  on 
issue  voting,  they  are  not  independent  of  voters’ 
partisan  proximities  and  their  political  orienta¬ 
tions  and  therefore  of  the  political  orientation  of 
the  incumbent  government. 

Conclusion 

The  study  of  voting  behavior  is  an  extremely  com¬ 
plex  affair  for  all  the  reasons  described  above  but 
also  because  political  systems  evolve  over  time  and 
according  to  country,  and  voters  adapt  to  the 
changes  in  these  systems.  Whatever  the  attempts  to 
universalize  the  different  theories,  the  significance 
of  the  vote  differs  from  one  society  to  another, 
from  one  historical  phase  to  another,  and  from  one 
voter  to  another.  The  studies  available  on  societies 
other  than  North  American  and  European  ones 
(which  are  still  too  scarce)  show  that  the  major 
approaches  used  here  do  not  always  work  very 
well  in  other  systems.  Thus,  in  democratic  societies 
in  the  making,  with  embryonic  party  systems  and 
without  any  real  unity  or  national  awareness,  and 
in  societies  that  are  still  traditional,  the  social 
determinants  that  produce  the  vote  may  be  quite 
different.  Many  factors  can  strongly  influence  the 


vote,  such  as  belonging  to  an  ethnic,  religious,  or 
village  community;  belonging  to  a  client-type  net¬ 
work;  the  existence  of  mafia-type  systems;  and  so 
on.  Research  must  therefore  extend  itself  in  the 
future  to  include  the  whole  range  of  electoral 
behaviors. 

Finally,  research  on  institutional  aspects  and  the 
rules  that  organize  voting  must  also  be  developed. 
This  is  notably  the  case  for  the  effects  on  electoral 
behavior  of  voting  systems,  the  size  of  constituen¬ 
cies,  the  political  and  institutional  issues  at  stake 
during  a  given  election,  and  the  material  organiza¬ 
tion  itself  of  the  vote. 

Gerard  Grunberg 
CNRS/Sciences  Po  Paris 
Paris,  France 
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Electoral  Campaigns 


Electoral  campaigns  are  the  means  by  which 
political  parties,  alliances,  coalitions,  and  candi¬ 
dates  convey  their  policies  and  election  programs 
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to  the  electorate  in  an  election  contest.  Electoral 
campaigns  are  essential  in  a  democratic  society 
that  holds  national  and  local  elections  on  a  peri¬ 
odic  basis.  International  standards  governing  the 
environment,  conduct,  and  principles  involved  in 
electoral  campaigns  in  democratic  elections  have 
evolved  over  the  past  several  decades. 

In  order  to  create  an  environment  conducive  to 
the  organization  and  conduct  of  free  and  fair  elec¬ 
tions,  a  number  of  checks  and  balances  have  been 
developed  over  time  to  assist  parties  and  candidates 
in  ensuring  a  level  playing  field  during  the  campaign 
period.  Similarly,  measures  have  been  developed  to 
avoid  or  reduce  the  threat  of  intimidation  or  vio¬ 
lence  against  candidates,  parties,  or  their  support¬ 
ers.  In  addition  to  provisions  dealing  with  election 
campaigns  in  the  given  legal  framework,  a  code  of 
conduct  aimed  at  parties,  candidates,  and  their  sup¬ 
porters  is  often  recommended.  In  a  similar  vein,  a 
code  of  conduct  to  guide  the  media,  particularly  the 
publicly  owned  media,  is  always  recommended. 

Nature  of  Electoral  Campaigns 

Many  electoral  legislative  schemes  stipulate  a  fixed 
period  during  which  electoral  campaigning  should 
take  place.  The  stipulated  period  may  range  from 
14  days  to  11  weeks  or  even  longer.  The  campaign 
period  may  be  required  to  end  24  or  48  hours,  or 
even  some  days,  before  polling  day.  The  electoral 
law  or  the  code  of  conduct  for  parties,  alliances/ 
coalitions,  and  candidates  sometimes  impose  pen¬ 
alties  for  campaigning  outside  the  official  desig¬ 
nated  period. 

Electoral  campaigns  flourish  best  where  there 
are  constitutional  guarantees  of  freedom  of  asso¬ 
ciation  and  freedom  of  speech.  These  fundamental 
rights  are  essential  to  the  electoral  campaign  envi¬ 
ronment,  as  was  vividly  shown  to  the  world  in 
Zimbabwe’s  run-off  election  in  June  2008  when 
the  opposition  candidate,  Morgan  Tsvangirai,  had 
to  withdraw  from  the  contest  because  of  disrup¬ 
tion  to  his  campaign  rallies,  intimidation,  and 
violence  against  his  supporters. 

Methods  of  Electoral  Campaigns 

There  are  a  number  of  electoral  campaign  methods, 
including  rallies,  motorcades,  radio  broadcasts, 
television  broadcasts,  debates,  party  manifestos, 


advertisements,  house-to-house  canvassing,  and  the 
use  of  banners,  fliers,  buntings,  billboards,  shop¬ 
ping  bags,  clothing,  and  theater  plays.  In  many 
countries,  the  issuance  of  permits  to  hold  rallies  and 
motorcades  resides  with  the  police  authorities,  who 
can  act  in  favor  of  the  ruling  party.  The  issuance  of 
permits  to  hold  rallies  or  motorcades  often  operates 
more  smoothly  when  it  is  done  by  the  electoral 
management  body  in  cooperation  with  the  police 
authorities  as  in  Gambia  and  Sierra  Leone. 

Media  access,  particularly  the  publicly  owned 
media — electronic  and  print — is  sometimes  regu¬ 
lated  by  specific  legislation  and/or  by  a  media  code 
designed  to  regulate  election  campaign  reporting. 
The  publicly  owned  electronic  media  may  allow 
free  airtime  in  accordance  with  an  agreed  formula 
(based  on  equal  or  equitable  or  fair  allocation  free 
time).  In  many  new  and  emerging  democracies,  the 
public  media  find  it  difficult  to  follow  the  rules 
relating  to  the  allocation  of  free  time  to  the  opposi¬ 
tion.  The  private  media  sometimes  enter  into  vol¬ 
untary  conduct  aimed  at  developing  balanced 
reporting  during  an  electoral  campaign,  but  this 
approach  is  not  widespread. 

The  behavior  of  the  media  can  have  a  signifi¬ 
cant  impact  on  leveling  the  playing  field  for  the 
contestants  in  an  electoral  campaign.  Fair  access  is 
only  one  dimension  of  the  media  environment. 
There  is  also  the  issue  of  whether  or  not  the  media 
reports  campaign  news  in  a  balanced  and  nonpar¬ 
tisan  way  and  whether  misstatements  are  corrected 
with  equal  prominence  as  the  original  report. 

Manifestos 

It  is  customary  for  political  parties,  alliances,  coali¬ 
tions,  and  candidates,  including  independent  can¬ 
didates,  to  publish  an  election  manifesto  or  plat¬ 
form  during  the  electoral  campaign.  Such  an  elec¬ 
tion  manifesto  or  platform  sets  out  the  program 
that  the  parties  or  candidates  intend  to  put  across 
to  the  voters  as  the  one  they  would  implement  if 
they  were  elected.  This  proposition  suggests  that 
manifestos  in  national  elections  are  about  issues 
affecting  voters  and  the  country.  However,  often, 
perhaps  too  often,  the  campaigns  are  run  on  the 
personality  of  individual  candidates  rather  than 
the  issues  raised  by  the  manifestos. 

The  extent  to  which  a  party’s  or  candidate’s 
manifesto  or  platform  is  binding  may  be  political 
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rather  than  legal.  Parties  or  candidates  who  do  not 
honor  their  manifestos’  pledges  may  pay  a  political 
price  at  the  next  election.  Of  course,  these  pro¬ 
grams  may  be  modified  to  meet  changed  circum¬ 
stances,  but  such  changes  should  be  properly 
explained  to  the  electorate.  There  have  been  cases 
where  elections  have  been  held  without  political 
parties,  as  in  Uganda  until  recently  and  Swaziland, 
and  there  are  a  number  of  countries  where  inde¬ 
pendent  candidates  are  not  allowed  to  contest  elec¬ 
tions,  as  in  Tanzania. 

Incumbency 

The  term  incumbency  refers  to  the  party  in  power, 
that  is,  a  party  or  the  parties  forming  a  government 
at  some  level  or  to  an  individual  holding  an  office. 
An  incumbent  authority  usually  attracts  advan¬ 
tages  in  a  campaign  by  virtue  of  the  office  held, 
unless  steps  are  taken  to  reduce  the  incidence  of 
abuse  of  such  an  office.  To  avoid  incumbency 
abuses,  some  countries,  most  notably  India,  have 
introduced  strict  measures  by  way  of  a  code  of 
conduct  governing  incumbent  parties  and  candi¬ 
dates  during  electoral  campaigns.  These  measures, 
for  example,  restrict  the  combination  of  ministe¬ 
rial  functions  with  party  political  campaigning  and 
forbid  the  use  of  public  resources  for  campaign 
purposes.  Bangladesh’s  attempt  to  reduce  the 
influence  of  incumbency  on  general  elections  was 
to  require  the  government  to  leave  office  at  the  end 
of  its  term  and  turn  over  the  role  of  government  to 
a  caretaker  government  to  oversee  the  preparation 
and  conduct  of  the  elections.  Regrettably,  that 
approach  has  not  worked  well  and  not  only  has  it 
led  to  the  disintegration  of  the  democratic  path  to 
holding  elections  but  also  in  part  has  brought 
down  the  entire  system  of  democracy  and  brought 
in  an  army-backed  regime. 

Level  Playing  Field 

The  phrase  level  playing  field  has  taken  hold  with 
respect  to  electoral  campaigns  and  is  used  to 
describe  the  treatment  meted  out  to  competing 
contestants  during  an  electoral  campaign.  Often, 
complaints  are  triggered  by  stakeholders  when 
they  consider  that  the  election  environment  does 
not  give  rise  to  a  level  playing  field.  This  situation 
arises  frequently  with  access  to  the  media  and  to 


the  use  of  public  resources.  The  negative  aspect  of 
a  level  playing  field  sometimes  comes  into  play 
when  the  police  authorities  delay  or  refuse  to  grant 
permits  for  election  rallies. 

Campaign  Finance 

Dealing  with  campaign  financing  may  range  from 
no  regulation  by  the  state  or  the  election  manage¬ 
ment  body  (EMB)  concerned  to  tight  controls 
either  by  the  state  or  by  the  EMB.  Such  controls 
may  include  limits  on  maximum  expenditure  by 
parties  and/or  candidates,  party-sponsored  or 
independent  maximum  public  contribution,  or 
maximum  contribution  by  individuals  or  compa¬ 
nies.  There  may  also  be  strict  rules  governing  dis¬ 
closure  of  the  amount  of  private  contributions 
received  during  the  electoral  campaign.  Such  dis¬ 
closure  is  often  open  to  public  scrutiny,  and  the 
contributions  are  subject  to  audit  by  reputable 
auditors.  Campaign  finance  reporting  should  be 
required  at  a  stipulated  period  before  and/or  after 
polling  day.  Failure  to  meet  the  stipulated  deadline 
usually  incurs  significant  sanctions,  and  any  mate¬ 
rial  breach  of  the  campaign  finance  rules  may 
result  in  forfeiture  of  the  elected  office  by  a  suc¬ 
cessful  candidate. 

Role  of  the  Media 

The  media  play  the  most  important  role  in  convey¬ 
ing  to  the  electorate  the  message  of  parties  and 
candidates.  The  electronic  media  has  been  playing 
an  increasingly  prominent  role  through  television, 
radio,  and  the  Internet  (mainly  in  the  developed 
world).  In  many  new  and  emerging  democracies, 
the  radio  has  the  greatest  reach  with  respect  to 
comprehensive  countrywide  coverage  (although  in 
a  few  countries  even  the  use  of  radio  batteries  dur¬ 
ing  a  campaign  period  have  proven  to  be  beyond 
the  means  of  rural  dwellers). 

Often,  a  distinction  is  made  for  electoral  cam¬ 
paign  purposes  between  publicly  and  privately 
owned  electronic  media.  This  is  because  the  ruling 
party  frequently  influences  those  public  bodies  to 
operate  in  a  partisan  manner.  This  behavior  is 
often  mitigated  by  codes  of  conduct  for  the  media 
whereby  a  schedule  for  free  time  broadcast  by  par¬ 
ties  and  candidates  is  regulated  by  statute  or  vol¬ 
untarily.  Free  time  allocation  is  done  by  a  formula 
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based  on  fair,  equal,  or  equitable  distribution.  The 
basic  rule  of  good  practice  is  that,  in  addition  to 
fair  access  to  the  electronic  media,  the  reporting  of 
election  news  and  the  coverage  thereof  should  be 
balanced. 

The  print  media  also  play  an  important  role  in 
many  campaigns.  A  similar  distinction  between 
public  and  privately  owned  print  media  and  code 
of  conduct  may  be  useful  in  creating  a  level  playing 
field  with  respect  to  access  between  the  ruling 
party  and  others.  However,  in  addition  to  fair 
access  to  publicly  owned  print  media,  the  privately 
owned  media  should  have  access  to  print  paper 
and  to  distribution  outlets  as  in  normal  times  out¬ 
side  campaign  periods. 

Security 

The  security  of  party  functionaries  and  candidates 
is  pivotal  during  the  period  of  election  campaign¬ 
ing.  Some  electoral  legislative  schemes,  supported 
by  codes  of  conduct  for  parties,  their  supporters, 
and  candidates,  provide  the  security  framework 
for  election  campaigns.  Occasionally,  security 
measures  that  were  designed  for  election  cam¬ 
paigns  do  not  follow  through  to  the  immediate 
postpolling  period  as  happened  during  the  post¬ 
election  violence  in  Kenya  in  2007-2008.  Equally 
disturbing,  the  election  campaign  security  may 
break  down  through  the  partisan  behavior  of  the 
security  forces  either  at  the  instigation  of  the  gov¬ 
erning  party  or  with  their  complicity,  as  in  the 
run-off  presidential  election  in  Zimbabwe  in  June 
2008  between  Robert  Mugabe  and  Morgan 
Tsvangirai.  Inadequate  campaign  security  in  the 
past,  and  even  recently,  as,  for  example,  in 
December  2007,  when  Benazir  Bhutto  was  assas¬ 
sinated  at  a  campaign  rally  in  Pakistan,  can  pro¬ 
foundly  and  adversely  affect  election  campaigns 
and  may  even  influence  the  results.  The  security 
forces  employed  during  election  campaigns  should 
be  properly  trained  and  ensure  adequate  security 
to  all  contestants  without  partisanship  during  an 
election  campaign. 

Intimidation  and  Violence 

When  perpetrated  during  an  election  campaign 
and  designed  to  interfere  in  any  way  with  an  oppo¬ 
nent’s  legitimate  campaign  activities,  intimidation 


and  violence  should  be  condemned  and  sanctioned 
strongly  under  the  election  law  and/or  the  code  of 
conduct  for  parties,  their  supporters,  and  candi¬ 
dates.  Credible  reports  by  independent  election 
observers,  international  and  domestic,  and  other 
stakeholders  described  a  campaign  environment  in 
Zimbabwe  for  the  June  2008  run-off  presidential 
elections  of  relentless  and  continuing  violence  and 
intimidation  that  resulted  in  many  deaths  and  inju¬ 
ries.  According  to  many  stakeholders,  the  sus¬ 
tained  vicious  intimidation  and  violence  directed 
mainly  at  supporters  of  the  opposition  could  be 
attributed  to  premeditated  planning  by  the  ruling- 
party  to  prevent  the  opposition  contesting  the  run¬ 
off  elections.  Rallies  planned  by  the  opposition 
were  either  denied  the  required  police  permit  or, 
when  the  required  permit  was  granted,  disrupted 
by  ruling-party  activities.  The  consequence  of  the 
unlawful  acts  against  the  opposition  was  the  with¬ 
drawal  of  their  presidential  candidate,  leaving  the 
incumbent  presidential  candidate  unchallenged  at 
the  polls. 

Conclusion 

In  recent  years,  the  spotlight  in  election  organiza¬ 
tion  has  been  very  much  on  campaigns.  This  has 
led  to  more  political  parties  adhering  to  codes  of 
conduct  and  subscribing  to  greater  transparency  in 
campaign  financing.  The  growing  acceptance  of 
domestic  and  international  observation  at  national 
elections  campaigns  is  enhancing  the  transparency 
in  election  planning  and  conduct.  Notwithstanding 
the  notable  improvements  being  made,  certain 
undesirable  practices  linger  on  in  emerging  democ¬ 
racies  in  the  African  Union  countries  and  in  coun¬ 
tries  in  Eastern  Europe  (including  Russia)  and  in 
central  Asia.  These  undesirable  campaign  features 
include  undue  influence  on  EMBs,  abuse  of  incum¬ 
bent  advantages,  not  having  a  level  playing  field  in 
respect  of  access  to  publicly  owned  media,  and  use 
of  public  resources  by  ruling  parties. 
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Electoral  Geography 


The  electoral  geography  hypothesis  is  that  elec¬ 
toral  behavior  is  related  to  place — neighborhood, 
community,  town  or  city,  region,  or  country — and 
cannot  entirely  be  explained  by  considering  indi¬ 
vidual  voters,  and  nonvoters,  as  possessing  char¬ 
acteristics  on  an  atomized  and  isolated  basis.  It 
claims  that  geographical  and  contextual  effects 
are  significant  and  that  the  process  of  day-to-day 
personal  communication  and  social  interaction  in 
the  community — or  in  a  variety  of  communities — 
influences  voting  patterns.  As  explained  by  John 
Agnew  (2002),  localities  are  the  arena  for  political 
socialization  and  produce  the  settings  for  much 
interpersonal  social  interaction.  The  general  ten¬ 
sions  within  society  are  actualized  within  locali¬ 
ties,  influencing  the  relative  weight  and  various 
meanings  attached  to  social  divisions  and  hence 
the  appeal  of  different  ideologies — which  are  not 
the  same  everywhere.  Political  movements  then 
use  geography  in  their  search  for  electoral  and 
other  support,  using  claims  relating  to  localities, 
regions,  and  so  forth  to  mobilize  that  support. 

Geographical  Effects  on  Electoral  Behavior 

There  is  a  long  history  of  debate  about  whether 
such  geographical  effects  on  electoral  behavior  do 


in  fact  exist.  Opponents  of  the  electoral  geography 
hypothesis  have  contended  that  once  all  the  vari¬ 
ables  affecting  individual  electoral  behavior  have 
been  fully  identified  and  properly  specified,  there  is 
no  role  for  geographical  and  contextual  variables 
to  have  any  independent  effect.  Furthermore, 
research  on  the  existence  and  nature  of  neighbor¬ 
hood  effects  depends  on  the  existence  of  usable 
neighborhood  data:  As  a  result,  it  is  not  necessarily 
easy  to  investigate  the  various  levels  at  which  they 
may  exist. 

Neighborhood  effects  in  voting  patterns  were 
first  demonstrated  in  1937  in  Sweden  by  Herbert 
Tingsten,  who  showed  that  the  more  working  class 
the  area,  the  more  likely  the  working-class  voter 
was  to  cast  a  socialist  vote.  Much  subsequent  work 
in  this  area  has  focused  on  the  United  States  and 
the  United  Kingdom,  perhaps  because  the  political 
effects  of  the  links  between  electoral  geography 
and  political  representation  are  more  immediately 
evident  under  majoritarian  electoral  systems. 
Support  for  the  electoral  geography  hypothesis  has 
been  more  extensively  demonstrated  in  the  work  in 
the  UK,  over  many  years,  of  Ron  Johnston  and 
Charles  Pattie,  demonstrating  that,  in  the  words  of 
William  Miller  (1977),  “people  who  talk  together 
vote  together.”  These  effects  are  not,  however, 
uniform.  They  are,  for  example,  more  powerful  in 
“working-class”  neighborhoods  than  in  “middle- 
class”  neighborhoods,  probably  because  the  social 
circles  of  salaried  workers  are  likely  to  be  wider 
and  more  geographically  varied  than  those  of  resi¬ 
dents  of  poorer  neighborhoods. 

The  impact  of  neighborhood  effects  on  patterns 
of  political  representation  is  affected  not  only  by 
the  formation  and  evolution  of  social  communities 
but  also  by  the  geographical  definition  of  electoral 
communities  achieved  by  the  delimitation  of  elec¬ 
toral  boundaries.  The  legislation,  which  defines 
how  delimitation  and  redistribution  takes  place; 
the  independence  or  otherwise  of  the  authority  that 
undertakes  it;  and  the  attempts  of  political  parties 
and  movements  to  influence  it  in  their  favor  are  all 
relevant  factors.  It  is  not  possible  for  redistribution 
to  produce  outcomes  without  a  bias  of  some  form. 
The  inherence  of  geographic  factors  in  redistribu¬ 
tion  is  shown  in  the  analysis  of  Bernard  Grofman 
and  others  (1997),  breaking  down  bias  into  three 
component  parts:  (1)  bias  resulting  from  malap¬ 
portionment,  (2)  bias  resulting  from  different  levels 
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of  turnout,  and  (3)  bias  that  results  from  the  geo¬ 
graphic  distribution  of  party  vote  shares.  Although 
requirements  on  a  delimitation  authority  to  con¬ 
trol  or  eliminate  malapportionment  through  a 
stipulation  that  the  electorates  of  all  seats  be  equal 
within  a  stated  small  tolerance  level  are  now  com¬ 
mon,  statutory  exceptions  exist  for  sparsely  popu¬ 
lated  areas,  for  example,  Norway  and  Western 
Australia.  In  South  Australia,  boundary  revisions 
are  also  required  to  take  into  account  the  geogra¬ 
phy  of  party  or  grouping  vote  shares  and  ensure 
that  the  party  or  grouping  with  more  than  half  the 
two-party  preferred  vote  sees  this  reflected  by  an 
elected  majority  in  the  state  legislative  council. 

The  effect  of  demographic  changes  and  bound¬ 
ary  delimitation  can  be  accentuated  by  party  cam¬ 
paigning  activity,  as  UK  Liberals  and  later  Liberal 
Democrats  demonstrated  through  successes  gener¬ 
ated  by  intensive  campaigning  in  individual  locali¬ 
ties  in  local  elections  from  the  1970s  onward.  On  a 
larger  scale,  the  1997  and  subsequent  general  elec¬ 
tion  victories  of  the  UK  Labour  Party  also  owed 
much  to  the  party’s  efforts  to  “create  its  own  geog¬ 
raphy,”  in  Johnston  and  Pattie’s  words,  through 
targeted  geographical  focus  of  its  efforts  linked  to 
an  overall  increase  in  the  effectiveness  of  the  party 
machine.  Electoral  geography  is  not  only  deter¬ 
mined  by  changing  demographics  and  community 
formation  and  by  the  outcomes  of  boundary  delim¬ 
itation  exercises  but  also  by  the  response  of  political 
parties  in  targeting  their  resources,  the  intensity  of 
campaigning,  and  promoting  or  implicitly  accept¬ 
ing  tactical  voting.  Furthermore,  effective  party 
management  recognizes  that  changes  in  electoral 
geography  can  affect  the  motivation  and  activity  of 
party  organization  and  responds  accordingly. 

The  Salience  of  the  Electoral  System 

At  a  broader  level,  the  overall  electoral  geography 
of  a  country  and  hence  the  dynamic  not  only  of  its 
electoral  politics  but  of  its  political  system  as  a 
whole  are  linked  to  the  form  of  electoral  system 
chosen.  Cleavages  can  be  accentuated  or  weakened 
by  electoral  system  choice.  Majoritarian  systems 
usually  have  a  strong  inbuilt  tendency  to  produce 
a  seat  bonus  for  the  winning  party,  but  there  are 
occasions  in  practice  where  this  does  not  happen. 
In  Malawi  in  1999,  the  three  leading  parties, 
United  Democratic  Front  (UDF),  Malawi  Congress 


Party  (MCP),  and  Alliance  for  Democracy  (AforD), 
received  47%,  34%,  and  11%  of  the  national 
vote,  respectively,  which  converted  into  48%, 
34%,  and  15%  of  the  seats.  However,  82%  of  the 
UDF  seats  were  in  the  southern  region,  82  %  of  the 
MCP  seats  were  in  the  central  region,  and  97%  of 
the  AforD  seats  were  in  the  northern  region — 
which  inevitably  produced  a  dynamic  of  regional 
cleavage  in  the  parliament. 

By  contrast,  Indonesia  has  used  list  propor¬ 
tional  representation  (with  differences  in  detail)  in 
the  democratic  elections  of  the  reform  era  from 
1999  onward.  In  a  political  climate  where  the 
maintenance  of  the  unity  of  the  state  was  both  a 
domestic  political  and  a  societal  imperative  and  a 
priority  of  the  international  community,  this  type 
of  electoral  system  ensured  that  each  of  the  major 
parties  secured  representation  from  across 
Indonesia.  Probably  as  a  consequence,  political 
divisions  since  1999  have  hardly  demonstrated  a 
regional  dimension.  However,  mapping  of  the 
leading  party  in  each  local  authority  area  shows  a 
clustering  of  party  support  in  different  parts  of  the 
country,  which  would  almost  certainly  have  led  to 
party  groups  in  the  legislature  with  a  strong 
regional  origin  and  identity.  Had  this  happened, 
the  likely  consequence  would  have  been  the  emer¬ 
gence  of  regionalism  as  a  much  stronger  cleavage 
at  the  national  level,  with  probable  significant 
negative  effects  on  national  unity  and  the  process 
of  democratic  consolidation.  It  is  not  only  the  pro¬ 
portion  of  seats  gained  by  each  party  that  influ¬ 
ences  the  way  a  political  system  works  in  practice, 
for  the  geographical  distribution  of  the  leadership 
and  elected  members  can  be  just  as  significant. 

Future  Research 

While  there  no  longer  appears  to  be  significant  con¬ 
test  about  the  contribution  of  geographic  effects  to 
explain  electoral  behavior,  there  is  still  much  scope 
for  research  into  how  they  work.  The  study  of  elec¬ 
toral  geography  brings  together  political  scientists 
and  geographers,  and  there  may  be  rich  fields  for 
future  exploration  both  in  the  comparative  study  of 
electoral  geographies  and  in  the  change  of  electoral 
geographies  over  time,  booking  forward,  it  is  a  fair 
hypothesis  that  continuing  changes  in  the  way 
people  form  communities  of  different  kinds  will 
affect  the  working  of  the  electoral  geography 
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hypothesis  in  practice.  The  spread  of  access  to  tele¬ 
communications  and  personal  transport  in  many 
parts  of  the  world  has  clearly  affected  the  context 
of  the  evolution  of  communities.  The  growth  of 
Internet-based  social  networking  and  the  conse¬ 
quent  advent  of  virtual  communities  may  be 
expected  also  to  generate  new  impacts  on  the  way 
in  which  personal  interaction  leads  to  electoral 
decision. 
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Electoral  Systems 


One  may  view  electoral  rules  for  a  given  office  as 
having  six  basic  components:  (1)  determination  of 
who  is  eligible  to  be  on  the  ballot  (e.g.,  parties 
only  or  also  individual  candidates),  (2)  internal 
party  rules  for  determining  who  are  to  be  a  given 
party’s  candidates  and/or  for  specifying  candidate 
rankings  within  a  party  list,  (3)  specification  of 
ballot  type,  (4)  specification  of  constituencies  (dis¬ 
tricts),  (5)  determination  of  election  timing,  and 
(6)  rules  for  ballot  aggregation  (tallying  rules).  In 
addition,  what  is  not  to  be  forgotten  is  the  seventh 
component  of  all  elections — the  voter — whose 
preferences  and  beliefs  about  how  voting  rules 
will  translate  those  preferences  into  outcomes  as 


well  as  the  ways  in  which  voter  preferences  are 
geographically  distributed  will  critically  affect 
how  electoral  rules  operate  in  the  real  world. 

Sometimes  the  term  electoral  system  is  used 
more  broadly  to  include  other  aspects  of  elections 
and  their  regulation,  such  as  rules  for  voter  suf¬ 
frage,  campaign  finance,  campaign  advertising, 
location  of  and  times  of  access  to  polling  stations, 
and  so  on.  While  all  these  elements  are  interrelated 
and  part  of  the  study  of  rules  for  conducting  an 
election,  in  this  entry,  because  of  space  constraints, 
the  focus  is  primarily  on  ballot  aggregation  mecha¬ 
nisms.  The  entry  examines  the  nature  of  the  rules 
that  determine  electoral  outcomes,  identifies  the 
most  important  rules  used  worldwide,  considers 
alternative  approaches  to  the  study  of  electoral  sys¬ 
tems,  and  discusses  domains  of  electoral  system 
impact. 

Typologies  of  Electoral  Rules 

Elections  and  the  political  parties  that  compete  in 
them  are  at  the  heart  of  modern  democracy.  The 
study  of  elections  and  of  the  voting  rules  that  are 
their  engines  has  attracted  the  attention  of  many 
scholars  and  led  to  a  literature,  which,  in  the 
4  decades  since  the  publication  of  Douglas  Rae’s 
Political  Consequences  of  Electoral  Laws  in  1967 
(second  edition,  1971),  has  become  well  devel¬ 
oped,  both  empirically  and  theoretically.  The 
study  of  electoral  rules  and  their  effects  is  made 
easier  by  the  existence  of  variables  that  can  be  pre¬ 
cisely  defined  and  usually  made  quantifiable,  for 
example,  votes  and  seats.  Because  of  the  close  con¬ 
nections  between  the  study  of  electoral  rules  and 
the  study  of  topics  such  as  party  systems,  represen¬ 
tation,  and  constitutional  design,  success  in  under¬ 
standing  electoral  rules  and  their  effects  can  have 
what  Rein  Taagepera  has  called  a  “Rosetta  stone” 
linkage  function  for  more  general  theory  building 
in  political  science. 

But  there  are  also  very  practical  reasons  for  try¬ 
ing  to  understand  electoral  system  effects.  Of  the 
fundamental  aspects  of  constitutional  design,  for 
example,  along  continua  such  as  unitary  versus 
federal  systems  or  parliamentary  versus  presiden¬ 
tial  systems,  electoral  rules  are  the  easiest  to 
change.  In  most  countries,  electoral  rules  are  not 
constitutionally  embedded,  or  at  least  many  key 
details  are  left  for  legislative  determination.  There 
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are  myriad  different  ways  of  conducting  elections, 
and  there  is  widespread  belief,  backed  by  empirical 
evidence,  that  choice  of  electoral  rules  matters  a 
great  deal  for  who  gets  what,  when,  and  how — the 
questions  that  lie  at  the  heart  of  all  politics.  Thus, 
manipulation  of  electoral  rules  seems  an  obvious 
tool  of  institutional  engineering. 

Candidate  Eligibility/Designation  of  Candidates 

In  most  polities,  complex  rules  govern  which 
parties  and  which  candidate  names  can  appear  on 
the  official  ballot;  for  example,  there  may  be  signa¬ 
ture  requirements  for  nominating  petitions,  which 
require  a  certain  number  of  names  or  a  certain 
number  of  names  from  a  certain  number  of  differ¬ 
ent  geographic  regions,  but  some  rules  may  be 
waived  for  parties  that  have  demonstrated  a  sub¬ 
stantial  level  of  support  via  their  vote  share  in  the 
previous  election(s).  In  most  election  settings, 
however,  it  is  parties  that  designate  the  candidates 
entitled  to  appear  on  the  ballot  under  their  party 
label,  sometimes  choosing  them  by  mechanisms 
purely  internal  to  the  party’s  formal  organization, 
sometimes  by  “primary”  elections  with  voting 
restricted  to  party  members  or  those  registered  as 
party  affiliates,  sometimes  in  so-called  open  prima¬ 
ries  where  voters  can  choose  the  party  primary  in 
which  they  participate.  Thus,  it  is  often  sensible  to 
talk  not  about  the  electoral  rules  but  about  elec¬ 
toral  rules  for  a  general  election,  on  the  one  hand, 
and  electoral  rules  for  party  nominations,  on  the 
other,  especially  if  different  parties  handle  nomina¬ 
tions  in  different  ways. 

Ballot  Type 

There  are  three  main  ways  to  distinguish  types 
of  ballots:  (1)  ballot  complexity,  (2)  number  of 
rounds  of  balloting,  and  (3)  types  of  alternatives. 

Ballot  complexity  refers  to  the  kind  of  informa¬ 
tion  that  voters  are  required  to  provide.  The  sim¬ 
plest  ballot  is  one  where  voters  mark  an  X  for 
some  prespecified  number  of  alternatives,  or  for  up 
to  some  prespecified  number  of  alternatives — for 
example,  approval  voting,  where  every  voter  may 
indicate  that  up  to  M  alternatives  are  “satisfac¬ 
tory,”  with  the  M  alternatives  receiving  the  most 
“approval”  votes  being  the  ones  that  win;  or  lim¬ 
ited  voting,  where  each  voter  has  a  fixed  number, 


k,  of  X  ballots  to  cast,  where  k  is  less  than  the 
number  of  seats  to  be  filled,  M;  and  the  most  com¬ 
mon  case  where  voters  have  but  a  single  X  to  cast, 
for  example,  plurality  voting  in  a  single-seat  dis¬ 
trict.  Another  important  type  of  X  ballot  is  pure 
list  PR,  where  there  is  a  list  prepared  by  each 
party,  and  a  certain  number  of  the  top  candidates 
on  each  list  are  elected,  with  that  number  deter¬ 
mined  by  the  proportion  of  (viable)  votes  cast  for 
that  party. 

More  complex  ballots  require  voters  to  rank 
order  alternatives,  for  example,  the  Bor  da  count  or 
the  single  transferable  vote,  or  to  assign  cardinal 
numbers  to  alternatives,  for  example,  cumulative 
voting.  If  one  neglects  the  possibility  of  ties,  one 
may  think  of  the  Borda  method  as  involving  each 
voter  ranking  all  candidates  and  giving  C  —  k  votes 
to  the  candidate  the  voter  puts  in  kth  place,  where 
C  is  the  number  of  candidates  in  the  contest.  For 
each  candidate,  their  Borda  score  is  the  sum  (over 
all  voters)  of  these  Borda  counts,  with  the  candi¬ 
date  with  the  highest  Borda  score  to  be  elected. 
Under  single  transferable  vote,  voters  also  rank 
order  the  candidates.  If  there  are  M  seats  to  be 
filled,  any  candidate  who  receives  at  least  a  Droop 
quota  of  votes  (— £/[M  +1J,  where  E  is  the  size  of 
the  actual  electorate)  is  elected,  exactly  E/(M  +  1) 
of  the  ballots  in  which  that  candidate  is  at  the  top 
of  the  preference  rankings  are  removed  from  fur¬ 
ther  consideration,  and  the  votes  on  the  remaining 
ballots  on  which  that  candidate  is  at  the  top  of  the 
preference  ranking  are  reallocated  to  the  next  high¬ 
est  ranked  (still  eligible)  candidate  on  that  ballot.  If 
that  reallocation  now  gives  some  additional  candi¬ 
date  a  Droop  quota,  then  that  candidate  is  elected, 
and  we  continue  in  this  fashion  as  long  as  we  can. 
If  there  are  still  unfilled  seats,  the  candidate  with 
the  fewest  first-place  votes  is  dropped  from  eligibil¬ 
ity,  and  his  or  her  votes  are  reallocated  to  the  next 
highest  ranked  (still  eligible)  candidate  on  the  bal¬ 
lots  of  those  who  had  the  dropped  candidate  at  the 
top  of  their  preference  ranking.  This  process  con¬ 
tinues  until  all  seats  are  filled.  (If  there  are  some 
voters  who  do  not  rank  sufficiently  many  candi¬ 
dates,  their  ballots  may  never  come  into  play,  and 
so  sometimes  it  may  be  necessary  to  elect  the  last 
candidate[sj  with  less  than  a  Droop  quota.)  There 
are  many  variants  of  cumulative  voting,  with  per¬ 
haps  the  most  common  involving  the  requirement 
that  all  components  of  the  allocation  vector  that 
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must  sum  to  the  number  of  votes  each  voter  is 
entitled  to  cast  must  be  integers. 

The  two  most  common  options  for  numbers  of 
rounds  of  balloting  are  one-round  and  two-round 
ballots,  but  the  exact  number  of  rounds  required  in 
some  forms  of  election  (e.g.,  multiround  sequential 
elimination  elections  )  can  vary  with  the  distribu¬ 
tion  of  votes  among  the  various  candidates.  In  a 
multiround  sequential  elimination  system,  there  is 
some  rule  for  dropping  candidates  at  each  round  if 
no  candidate  has  received  a  majority  of  first-place 
preferences  among  the  set  of  still-viable  candi¬ 
dates,  for  example,  dropping  the  candidate  with 
the  fewest  first-place  votes  from  eligibility  and 
then  revoting  among  the  remaining  candidates. 

Type  of  alternatives  is  meant  to  distinguish 
among  the  situations  where  voters  choose  among 
party  lists  versus  voting  for  individual  candidates 
versus  some  mix  of  the  two — for  example,  open 
list  PR  systems  or  mixed  electoral  systems.  In  pure 
list  PR,  only  parties  are  objects  of  choice,  and  the 
parties  determine  and  rank  their  own  candidates 
so  that  a  party  that  wins  r  seats  will  elect  the  top  r 
candidates  on  its  list  of  candidates;  in  open  list  PR, 
in  contrast,  voters  may  also  affect  by  their  vote 
choices  (even  if  not  fully  determine)  which  indi¬ 
vidual  candidates  from  a  given  party  will  be 
elected.  A  mixed  electoral  system  is,  technically, 
simply  one  in  which  the  electoral  rule  is  not  con¬ 
stant  across  all  constituencies,  but  the  term  is  more 
commonly  used  to  refer  to  electoral  systems  that 
include  both  constituencies  in  which  voters  vote 
for  a  single  candidate  and  those  in  which  candi¬ 
dates  are  elected  by  some  form  of  proportional 
representation  (see  Matthew  Shugart  &  Martin 
Wattenberg,  2001). 

But  a  further  distinction  is  necessary  as  to  what 
information  about  the  candidate’s  party  affiliation 
voters  have  available  on  the  ballot.  Even  where 
voters  are  choosing  among  candidates  rather  than 
parties,  voters  usually  know  the  party  affiliation  of 
each  candidate  or  whether  a  candidate  is  running 
without  a  party  attachment  (i.e.,  as  an  “indepen¬ 
dent”).  Nonpartisan  elections,  that  is,  elections 
without  party  labels  for  any  of  the  candidates  on 
the  ballot,  are  relatively  rare  outside  the  United 
States,  but  in  the  United  States  such  elections  are 
the  most  common  mechanism  for  electing  local 
officials.  Such  nonpartisan  elections  are  usually 
single-round,  plurality-based  elections,  but  there  is 


a  two-round  variant  called  a  blanket  primary,  used 
in  a  few  U.S.  states,  where  the  top  two  vote  getters 
enter  into  a  second-round  runoff  if  no  candidate 
receives  an  actual  majority  of  the  votes  cast. 
However,  nonpartisanship  often  is  merely  a  form 
of  disguised  partisanship,  with  the  party  affilia¬ 
tions  of  nominees  being  an  open  secret  to  knowl¬ 
edgeable  voters. 

Constituencies 

Worldwide,  most  elections  take  place  within 
geographically  defined  constituencies.  As  Bernard 
Grofman  and  Lisa  Handley  (2008)  note, 

There  are  two  kinds  of  exceptions  to  districted 
elections,  the  at-large  election  and  the  communal 
roll.  In  at-large  elections,  the  entire  polity  is  used 
as  the  district  and  thus  there  is  no  need  ever  to 
redraw  constituency  boundaries.  Netherlands 
and  Israel,  for  example,  elect  their  national 
parliaments  using  List  PR  from  the  nation  as  a 
whole;  while  in  the  United  States,  the  majority  of 
cities  elect  city  council  representatives  city-wide. 

In  communal  rolls,  the  fundamental  basis  of 
representation  is  non-geographic:  choices  are 
made  from  candidacies  drawn  from  the  members 
of  a  given  race  or  ethnicity  or  religion.  Usually, 
but  not  always,  only  members  of  a  given 
community  will  be  eligible  to  vote  for 
representatives  from  that  community,  (p.  6, 
Footnote  3) 

But,  of  course,  even  at-large  elections,  when  they 
take  place  below  the  national  level,  are  thought  of 
as  geographically  based,  and  in  many  cases  com¬ 
munal  rolls  may  also  be  geographically  based. 

With  respect  to  districted  constituencies,  a  key 
distinction  is  between  single-member  districts 
(SMDs)  and  multimember  districts  (MMDs).  The 
specification  of  constituency  boundaries,  called 
redistricting  in  the  United  States  and  boundary 
delimitation  in  much  of  the  rest  of  the  English- 
speaking  world,  is  an  important  topic  from  both  a 
legal  and  a  theoretical  point  of  view.  For  example, 
rules  about  the  degree  of  population  equality 
required  across  constituencies  can  be  instrumental 
in  permitting  or  preventing  malapportionment, 
which,  whether  deliberate  or  unintended,  can  have 
substantial  consequences  for  the  translation  of 
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votes  into  seats  and  the  representation  of  groups 
that  differ  in  their  geographical  locations  and 
degree  of  geographic  concentration,  such  as  blocs 
of  ethnic  voters  or  party  supporters. 

Election  Timing 

In  terms  of  election  timing  there  are  two  impor¬ 
tant  distinctions.  The  first  is  between  elections  that 
take  place  at  predetermined  intervals  versus  elec¬ 
tions  that  may  be  “called”  by  those  in  power  or 
that  result  from  the  failure  of  a  vote  of  confidence 
in  the  parliament.  When  election  dates  are  not  pre¬ 
determined,  there  is  scope  for  strategy  in  the  timing 
of  elections,  and  parties  within  a  governing  multi¬ 
party  coalition  may  have  credible  threats  to  with¬ 
draw  from  the  cabinet  so  as  to  force  a  new  election. 
Such  threat  potential  can  create  policy  uncertainties 
and  ultimately  destabilize  governments. 

The  second  distinction,  involving  election  tim¬ 
ing,  is  about  the  degree  to  which  elections  of  vari¬ 
ous  types  and  at  various  levels  of  government  are 
held  concurrently.  Generally  speaking,  when  elec¬ 
tions  are  not  held  concurrently,  this  increases  the 
likelihood  that  different  policies  will  be  pursued  at 
different  levels  of  government  and  that  different 
parties  will  be  in  power  for  different  offices. 

Ballot  Aggregation 

The  process  by  which  ballots  are  turned  into 
outcomes  is  referred  to  as  ballot  aggregation  in  the 
social  choice  literature. 

The  most  frequently  made  distinction  among  bal¬ 
lot  aggregation  methods  is  between  majoritarian/ 
plurality  methods  and  methods  of  representation 
intended  to  allow  unified  blocs  of  voters  of  a  certain 
size  that  do  not  constitute  an  overall  majority  to 
nonetheless  gain  representation.  The  best  known 
form  of  minoritarian  representation  is  proportional 
representation.  The  proportionality  principle  refers 
to  the  idea  that  seat  shares  in  a  legislature  should 
reflect  vote  shares. 

The  distinction  between  SMDs  and  MMDs  is 
linked  to  this  distinction,  in  that  proportional  rep¬ 
resentation  methods  can  only  be  used  in  MMDs; 
however,  use  of  MMDs  does  not  imply  the  use  of 
proportional  representation,  since  there  can  be 
plurality  voting  in  MMDs  (sometimes  called  plu¬ 
rality  bloc  voting ),  where  each  voter  has  M  (single) 


votes  to  cast  and  where  the  M  candidates  with  the 
most  votes  are  elected,  or  limited  voting  in  MMDs, 
which,  for  k  >  1,  is  a  semiproportional  system  (see 
below). 

The  standard  way  to  conceptualize  the  contin¬ 
uum  anchored  by  majoritarian/plurality  methods, 
at  one  end,  and  proportional  representation  meth¬ 
ods,  at  the  other  end,  is  in  terms  of  the  threshold 
of  exclusion ,  which  Douglas  Rae,  Victor  Hanby, 
and  John  Loosemore  define  as  the  largest  vote 
share  a  party  can  receive  and  still  be  denied  any 
seats  in  the  district  (see  Rae,  1967/1971).  The 
threshold  of  exclusion  involves  a  hypothetical 
worst-case  scenario  in  terms  of  the  distribution  of 
votes  among  the  other  parties  that  will  minimize 
the  given  party’s  seat  share.  For  M  seat  districts,  in 
pure  PR  systems,  the  threshold  of  exclusion  is 
either  exactly  or  approximately  1/(M  +  1).  In 
other  words,  if  it  coordinates  its  votes,  a  group 
that  has  only  a  1/(M  +  l)th  share  of  the  district’s 
population  can  guarantee  to  elect  a  candidate  of  its 
choice.  As  M  gets  larger,  smaller  and  smaller 
minorities  become  a  large  enough  share  of  the  elec¬ 
torate  to  win  a  seat  for  a  candidate  of  their  choice. 
The  minimum  feasible  threshold  of  exclusion, 
which  occurs  for  PR  systems  with  the  whole  polity 
as  the  single  constituency,  is  1/(5  +  1),  where  S  is 
the  number  of  seats  in  the  parliament.  At  the  other 
end  of  the  continuum,  for  simple  plurality  and 
majority  systems,  the  threshold  of  exclusion  is 
(infinitesimally  under)  1/2,  regardless  of  M. 
Intermediate  cases  arise  for  limited  voting,  where 
voters  have  k  (single)  votes  to  cast,  and  there  are 
M  seats  to  be  filled  (k  <  M),  with  the  M  highest 
vote  getters  elected.  Here,  the  threshold  of  exclu¬ 
sion  is  k/(k  +  M).  The  closer  k  is  to  M,  the  less 
proportional  is  the  limited  voting  rule.  The  thresh¬ 
old  of  exclusion  for  SNTV,  the  single  nontransfer- 
able  vote,  which  is  another  name  for  limited  voting 
in  M  seat  districts  (M  >  1)  where  voters  have  but 
a  single  vote  to  cast,  is  identical  to  that  of  the 
d’Hondt  form  of  list  PR,  namely  1/(M  +  1). 

Within  any  given  country,  an  m-seat  district  can 
be  expected  to  have  roughly  m  times  the  popula¬ 
tion  of  a  single-seat  district.  Thus,  if  one  is  inter¬ 
ested  in  the  actual  number  of  voters  who  must 
change  their  mind  to  affect  election  outcomes,  then 
threshold  of  exclusion  values  need  to  be  adjusted 
to  take  population  differences  into  account  across 
constituencies  of  different  sizes. 
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Paralleling  the  threshold  of  exclusion,  there  is 
also  a  threshold  of  representation ,  which  is  the 
smallest  vote  share  a  party  can  receive  and  still  win 
a  seat  in  the  district  (see  Rae,  1967/1971).  The 
threshold  of  representation  involves  a  hypothetical 
best  case  scenario  in  terms  of  the  distribution  of 
votes  among  the  other  parties  that  will  maximize 
the  given  party’s  seat  share. 

The  thresholds  of  exclusion  and  representation 
are  theoretical  concepts.  In  any  given  electoral  sys¬ 
tem,  the  actual  vote  share  that  a  party  will  need  to 
be  assured  of  to  gain  representation  will  often  be 
much  less  than  the  theoretical  maximum.  Relatedly, 
as  Rein  Taagepera  (personal  communication,  June 
4,  2004)  observes,  “The  same  electoral  rules  can 
lead  to  vastly  different  disproportionality,  even  in 
the  same  country  and  even  in  consecutive  elections.” 
To  deal  with  this  problem,  scholars  commonly  cal¬ 
culate  empirical  indices  of  disproportionality. 

For  partisan  elections ,  the  two  most  common 
measures  of  overall  proportionality  are  the 
Loosemore-Hanby  Index  of  Distortion  (see  Rae, 
1967/1971), 
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with  vi  being  the  vote  share  of  the  z'th  party  and  sj 
the  seat  share  of  that  same  party;  and  the  Gallagher 
Index  (Michael  Gallagher,  1991), 
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For  two-party  competition,  another  approach 
to  measuring  disproportionality  empirically  is  in 
terms  of  what  is  called  the  swing  ratio.  Edward 
Tufte  (1973)  proposed  that,  in  two-party  legisla¬ 
tive  competition,  a  party  can  expect  to  receive  a 
share  of  seats  such  that 


log  s  _  k  log  v 
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Here,  the  closer  k  is  to  1,  the  closer  we  are,  empir¬ 
ically,  to  proportionality. 

While  the  PR  versus  plurality/majority  distinc¬ 
tion  is  certainly  important,  there  are  equally 


important  distinctions  among  the  members  of  each 
of  the  two  groupings. 

Among  PR  methods,  the  most  important  three¬ 
fold  distinction  is  between  list  PR  methods  and 
rules  that  are  based  on  preference  rankings,  most 
notably  the  single  transferable  vote  (STV,  aka  the 
Hare  system),  on  the  one  hand,  and  those  that 
involve  point  schemes,  most  notably  the  simple 
cumulative  vote  (CV),  where  each  voter  has  M 
votes  to  be  divided  (in  whole  integers)  among  can¬ 
didates  in  any  way  that  the  voter  wishes,  on  the 
other  hand.  While  CV,  like  SNTV,  is  sometimes 
called  semiproportional  since  it  requires  voter  coor¬ 
dination  to  ensure  proportionality,  this  is  a  quanti¬ 
tative  rather  than  a  qualitative  difference  between 
them  and,  say,  list  PR  methods,  and  so  both  are 
treated  under  the  general  rubric  of  PR.  The  inter¬ 
mediate  category  of  semiproportional  systems  is 
reserved  for  limited  voting,  with  1  <  k  <  M. 

There  are  also  important  distinctions  among  list 
PR  systems.  This  entry  has  already  mentioned  the 
issue  of  whether  the  system  allows  voters  to  choose 
among  candidates  rather  than  parties.  A  second, 
rather  more  technical,  distinction  is  between  meth¬ 
ods  that  can  be  characterized  as  divisor  methods 
and  methods  that  cannot  be  so  characterized,  most 
notably  the  greatest-remainder  method.  Divisor 
methods  are  methods  where  one  can  determine 
outcomes  by  taking  the  party  vote  shares  pt,  and 
dividing  each  in  turn  by  a  particular  vector  of  divi¬ 
sors  (d j,  d2, ,  dM)  and  then  allocating  seats  to 
each  party  corresponding  to  their  having  one  of  the 
M  highest  quotients  from  among  the  n  x  M  mem¬ 
ber  set  {pjdf.  For  example,  d’Hondt  (also  some¬ 
times  known  as  the  Hagenbach-Bischoff  method) 
uses  the  vector  of  divisors  (1,  2,  3,  4,  ...  ,  n),  while 
Sainte-Lague  uses  (1,  3,  5,  7, .  . . ,  2n  —  1),  and 
the  modified  Sainte-Lague  uses  (1.4,  3,  5,  7,  ... , 
In  -  1). 

For  example,  if  there  were  three  parties  of  sizes 
0.40,  0.35,  and  0.25,  respectively,  and  six  seats  to 
be  filled,  under  d’Hondt  one  would  look  at  the  set 
of  numbers  {0.40,  0.35,  0.25;  0.20,  0.175,  0.125; 
0.133,  0.117,  0.083;  etc.}  and  find  the  six  highest, 
which  would  result  in  three  seats  to  the  largest  par¬ 
ties  (from  quotients  of  0.40,  0.20,  and  0.133),  two 
seats  to  the  second  largest  party  (from  quotients  of 
0.35  and  0.175),  and  one  seat  to  the  smallest  party 
(from  a  quotient  of  0.25).  Since  the  quotient  of 
0.40/3  (-0.133)  is  higher  than  0.25/2  (=0.125), 
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the  sixth  seat  goes  to  the  largest  party  rather  than 
the  smallest  party.  To  calculate  the  greatest- 
remainder  results,  one  would  look  at  pt  X  M  and 
first  allocate  whole  quotients  to  the  parties  and 
then  allocate  the  remaining  seats  in  order  of  the 
magnitude  of  the  remainder.  For  the  same  six-seat 
example  with  three  parties  of  sizes  0.40,  0.35,  and 
0.25,  one  gets  2.4,  2.1,  and  1.5,  respectively.  Since 
the  greatest  remainder  is  0.5,  each  party  ends  up 
getting  two  seats,  despite  the  disparities  in  size 
among  the  three  parties. 

Note  that  even  though  both  d’Hondt  and  the 
greatest  remainder  are  PR  systems,  they  do  not 
always  yield  identical  allocations.  In  general, 
although  most  often  the  allocations  under  the  two 
rules  will  be  the  same,  sometimes  d’Hondt  will  be 
more  favorable  to  the  largest/larger  parties  than 
will  the  greatest  remainder.  But  not  all  divisor 
rules  produce  the  same  results,  either.  In  particu¬ 
lar,  Sainte-Lague  is  likely  to  be  in  between  the 
other  two  rules  in  terms  of  favoring  larger  over 
smaller  parties,  while  the  modified  Sainte-Lague 
was  specifically  designed  to  be  more  favorable  to 
moderately  sized  parties  than  the  unmodified 
Sainte-Lague. 

Mathematically,  divisor  methods  for  allocating 
seats  to  parties  on  the  basis  of  party  vote  shares  are 
identical  to  divisor  methods  for  allocating  seats  to 
geographic  units  on  the  basis  of  the  unit’s  share  of 
the  total  population.  The  same  method  may  go 
under  different  names  in  the  two  different  con¬ 
texts.  Thus,  the  d’Hondt  method  was  actually  first 
invented  by  Thomas  Jefferson  to  partition  seats  in 
the  U.S.  House  of  Representatives  among  the 
states.  Similarly,  the  Sainte  Lague  method  is  identi¬ 
cal  to  an  apportionment  method  devised  by  the 
American  legislator  Daniel  Webster.  Five  divisor 
methods  have  particular  centrality  because  they 
each  correspond  to  a  particular  way  of  rounding 
values  to  create  integer  allocations  of  seats  to  the 
various  parties  that  sum  up  to  M.  The  Adams 
method  corresponds  to  rounding  up;  the  Jefferson 
method  (d’Hondt)  corresponds  to  rounding  down; 
the  Webster  method  (Sainte-Lague)  corresponds  to 
rounding  up  if  the  remainder  is  greater  than  0.5; 
the  Dean  method  corresponds  to  taking  the  har¬ 
monic  mean;  and  the  Hill-Huntington  method  cor¬ 
responds  to  taking  the  geometric  mean  and  is  the 
method  that  is  currently  in  use  in  the  U.S.  House 
of  Representatives  to  determine  how  many  seats 


each  state  will  get  (subject  to  the  constraint  that 
each  state  must  have  at  least  one  seat  in  the 
House).  There  are  also  alternative  ways  to  define 
divisor  methods,  for  example,  in  terms  of  quotient 
maximization  functions.  It  should  also  be  noted 
that  the  greatest-remainder  method  is  also  called 
the  Hamilton  method  because  Alexander  Hamilton 
proposed  it  as  way  of  allocating  seats  to  the  states. 

While  the  distinctions  among  types  of  ballot 
aggregation  identified  above  are  the  ones  most 
often  studied — generating  what  Rein  Taagepera 
(2007)  refers  to  as  the  set  of  simple  electoral 
systems — they  are  only  the  tip  of  the  iceberg.  The 
devil  is  in  the  details,  and  there  are  many  other 
features  of  electoral  rules  that  can  affect  vote 
aggregation.  For  example,  there  may  be  rules  that 
allow  parties  to  combine  their  vote  share  by  enter¬ 
ing  into  preelection  alliances;  and/or  there  may  be 
specific  national,  regional,  or  district  vote  share 
thresholds  or  rules  for  geographic  distribution  that 
must  be  surmounted  before  a  party  can  receive  any 
seats;  and/or  there  may  be  complex  mixtures  of 
electoral  rules,  with  different  types  of  rules  apply¬ 
ing  in  different  constituencies  or  in  multiple  tiers  of 
representation  involving  different  levels  of  propor¬ 
tionality  and  “topping  up”  across  the  various  tiers. 

Electoral  Rule  Effects 

There  are  five  major  theoretical  perspectives  on 
how  to  estimate  the  likely  consequences  of  elec¬ 
toral  rules  or  changes  in  electoral  rules:  (1)  social 
choice  theory,  (2)  mainstream  empirical  research, 

(3)  rational  choice  and  game-theoretic  models, 

(4)  the  social  physics  perspective,  and  (5)  the 
embedded  systems  approach. 

Social  choice  approaches  are  those  that  (a)  in 
the  spirit  of  Kenneth  Arrow  (1951/1963)  study 
the  normative  properties  of  voting  rules,  for 
example,  responsiveness  to  changes  in  underlying 
voter  preferences,  the  likelihood  of  choosing  a 
candidate  who  can  receive  a  majority  over  all 
other  candidates  in  a  head-on-head  contest  (known 
in  this  literature  as  Condorcet  efficiency ),  and/or 
seek  to  characterize  voting  rules  in  axiomatic 
form;  (b)  are  concerned  about  the  structure  of 
individual  preferences,  for  example,  the  extent  to 
which  preferences  have  a  particular  form  of  unidi¬ 
mensionality  known  as  single  peakedness  or  sat¬ 
isfy  other  domain  restrictions;  and/or  (c)  for  a 


Electoral  Systems  755 


fixed  set  of  voter  preferences,  assess  the  likelihood 
that  different  voting  methods  will  yield  the  same 
outcome. 

Mainstream  empirical  research  seeks  to  measure 
the  effects  of  particular  electoral  rules,  cross-nation- 
ally  or  across  different  units  in  the  same  polity,  by 
techniques  such  as  regressing  an  outcome  variable — 
for  example,  (effective)  number  of  political  parties 
or  measures  of  proportionality  of  seats-votes  rela¬ 
tionships — against  electoral  system  features  and 
some  set  of  control  variables  (see,  e.g.,  Arend 
Lijphart,  1994).  It  is  characterized  by  large-scale 
data  analysis  of  both  survey  and  aggregate  data. 

Rational  choice  modeling  emphasizes  strategic 
behavior  on  the  part  of  voters  and  candidates/ 
parties  and  makes  extensive  use  of  game  theory. 
Customarily,  results  are  in  the  form  of  theorems 
about  how  electoral  system  effects,  in  equilibrium, 
are  determined  by  the  incentives  that  different 
rules  provide  for  the  behavior  of  voters  and  par¬ 
ties/candidates  under  different  assumptions  about 
the  motivations  ascribed  to  voters  and  the  utility 
functions  (office  seeking,  policy  seeking,  or  some 
combination  thereof)  that  are  ascribed  to  parties/ 
candidates  (see,  e.g.,  Gary  Cox,  1997).  While  the 
formal  properties  of  electoral  rules  determine  how 
“inputs,”  that  is,  completed  ballots,  will  be  con¬ 
verted  into  electoral  outcomes,  it  is  the  structure  of 
electoral  incentives  that  helps  determine  both  what 
options  will  actually  be  available  to  the  voters  on 
the  ballot  and  how  voters  will  decide  among  those 
options.  In  particular,  when  there  is  strategic  vot¬ 
ing,  such  that  voters  do  not  always  support  the 
candidate/party  they  most  prefer  if  they  do  not 
believe  that  that  candidate/party  has  a  realistic 
chance  to  be  elected,  then  great  care  must  be  taken 
in  projecting  outcomes  from  underlying  prefer¬ 
ences  and  the  nature  of  the  voting  rule. 

The  social  physics  approach,  associated  with  the 
work  of  Rein  Taagepera  (see  Taagepera  &  Matthew 
Shugart,  1989;  also  especially  Taagepera,  2007),  is 
inspired  by  the  principles  of  statistical  thermody¬ 
namics  in  physics.  It  does  not  attempt  to  predict  the 
effects  of  electoral  rules  in  individual  political  units 
but  seeks  instead  to  precisely  model  effects  on  aver¬ 
age.  Also,  it  (a)  makes  use  of  only  a  handful  of  key 
variables;  (b)  uses  functional  forms,  which  must 
yield  results  consistent  with  the  boundary  condi¬ 
tions  determining  the  range  of  feasible  outcomes; 
and  (c)  requires  that  left-hand  (dependent)  and 


right-hand  (independent)  variables  be  stated  in  a 
fashion  that  yields  dimensional  comparability. 

The  term  embedded  systems  was  introduced  in 
Grofman  (1999)  and  used  in  subsequent  work  by 
Grofman  and  coauthors,  but  many  others  use  the 
same  approach  without  calling  it  by  that  name. 
The  hallmark  of  the  embedded  systems  approach 
to  electoral  systems  is  the  notion  that  electoral  sys¬ 
tem  effects  can  be  fully  understood  only  within  the 
context  of  the  overall  constitutional,  social,  and 
party  systems  in  which  they  are  embedded.  Of  spe¬ 
cial  concern  are  (a)  seeing  how  similar  systems  can 
yield  different  outcomes  in  different  contexts; 
(b)  the  need  for  care  in  attributing  causality  to  elec¬ 
toral  system  effects  when  the  choice  of  electoral 
rules  may  be  endogenously  determined,  which 
leads  to  an  interest  in  experiments  on  voting  rules 
and  “natural  experiments”  such  as  those  that 
occur  when  different  methods  are  used  within  the 
same  polity  in  different  geographic  areas  or  at  dif¬ 
ferent  time  points;  (c)  attention  to  how  the  seem¬ 
ingly  trivial  differences  in  electoral  rules,  for 
example,  different  rules  for  nominating  candidates, 
can  have  major  consequences;  and  (d)  attention  to 
the  fact  that  it  will  take  time  until  voters,  candi¬ 
dates,  and  parties  come  to  understand  how  a  given 
set  of  electoral  rules,  embedded  as  they  are  within 
a  particular  political  context,  will  affect  outcomes. 

While  social  choice  theory  has,  historically, 
largely  limited  itself  to  studying  the  properties  of 
voting  rules  in  the  abstract,  many  political  scien¬ 
tists  and  economists  have  been  more  interested  in 
who/what  is  being  affected  by  the  choice  of  rules. 
Scholars  have  looked  at  the  implications/conse¬ 
quences  of  electoral  rules  for  voters,  parties,  and 
interest  groups  as  well  as  at  general  effects  on  gov¬ 
ernance  and  effects  on  specific  policy  outputs. 

For  voters,  there  is  an  extensive  literature  on 
how  electoral  systems  may  affect  representation  of 
geographic  areas  and  of  racial/ethnic/social  group¬ 
ings,  and  voter  turnout,  as  well  as  on  spillover 
effects,  such  as  on  voter  perceptions  of  transpar¬ 
ency  and  voter  efficacy.  For  parties,  there  is  work 
on  the  consequence  of  electoral  rules  for  party 
proliferation,  party  ideological  dispersal,  strength 
of  party  organization,  party  discipline  in  the  legis¬ 
lature,  party  coalitional  strategies,  and  of  course, 
the  relationships  between  party  vote  shares  and 
party  seat  shares  in  terms  of  the  extent  of  propor¬ 
tionality  and  of  partisan  bias.  For  governance, 
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important  questions  have  to  do  with  the  link 
between  electoral  rules  and  issues  such  as  the 
incentives  for  corruption,  the  incentives  for  local¬ 
ism  and  pork-barrel  politics,  the  durability  of  par¬ 
liamentary  governments,  and  for  ethnically  divided 
societies,  the  links  between  electoral  rules  and  civic 
unrest  and  the  strength  of  secessionist  movements. 
For  specific  policy  arenas,  hypotheses  have  been 
proposed  about  the  (independent)  effects  of  elec¬ 
toral  rules  on  the  size  of  the  welfare  system  and  the 
degree  of  economic  inequality  (see,  e.g.,  Torsten 
Persson  &  Guido  Tabellini,  2003). 

The  electoral  systems  literature  is  now  so  vast 
that  no  single  essay  can  possibly  do  justice  to  it. 
Readers  are  reminded  of  two  points:  First,  one 
must  be  attentive  to  issues  of  endogeneity  and 
interaction  effects.  Voting  rules  do  not  exist  in  a 
vacuum;  on  the  one  hand,  they  are  determined  by 
past  history  and  maintained  only  in  some  type  of 
political  equilibrium,  and  on  the  other,  their  con¬ 
sequences  are  shaped  by  the  present  political  con¬ 
text  in  which  they  are  embedded  even  though  they 
also  affect  that  context.  Second,  and  relatedly,  to 
understand  electoral  system  effects,  one  must 
understand  both  the  formal  properties  of  electoral 
rules  (including  prosaic  facts  such  as  the  maximum 
number  of  parties  elected  in  a  constituency  being 
capped  by  the  number  of  seats  that  are  up  for  elec¬ 
tion),  on  the  one  hand,  and  the  (short-run  and 
long-run)  incentives  for  voters,  candidates,  parties, 
and  interest  groups  created  by  those  rules,  on  the 
other. 

Bernard  Grofman 
University  of  California,  Irvine 
Irvine,  California,  United  States 
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Electoral  Turnout 


Electoral  turnout  is  called  an  “aggregate-level” 
concept  because  it  only  makes  sense  to  talk  of 
turnout  when  talking  of  aggregates  of  individuals. 
Yet  because  turnout  derives  from  cumulating 
individual-level  decisions,  one  must  start  with  an 
understanding  of  why  people  vote — something 
about  which  there  is  little  agreement  in  the  schol¬ 
arly  literature.  Proposed  theories  have  been  based 
on  rational,  expressive,  mobilized,  habitual,  duti¬ 
ful,  altruistic,  and  even  genetic  explanations;  see, 
for  example,  Benny  Geys  (2006)  and  James 
Fowler  and  Christopher  Dawes  (2008).  From  a 
rational  choice  perspective,  the  decision  to  vote  is 
conceptualized  as  the  result  of  a  cost-benefit  “cal¬ 
culus  of  voting”  in  which  difficulties  of  becoming 
informed  and  getting  to  the  polling  booth,  and  so 
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on,  are  compared  with  the  benefits  a  voter  expects 
if  his  or  her  preferred  party  is  enabled  to  imple¬ 
ment  promised  policies.  The  calculus  must  take 
account  of  the  probability  that  one  vote  will  affect 
an  election’s  outcome.  In  a  large  electorate,  this 
probability  is  vanishingly  small,  so  a  truly  rational 
voter  would  “free  ride”  on  the  efforts  of  others. 
The  fact  that  in  reality  many  citizens  do  vote  has 
come  to  be  called  the  “paradox  of  voting.” 

Electoral  Participation  as  Social  Behavior 

The  recognition  of  the  paradox  of  voting  led  some 
scholars  to  reinterpret  benefits  and  costs  in  terms 
that  are  unrelated  to  outcomes  of  elections  by 
assuming  that  people  are  motivated  by  social, 
ethical,  or  expressive  considerations  rather  than 
narrow  self-interest.  In  brief,  ethical  explanations 
claim  that  citizens  vote  out  of  a  sense  of  civic  duty, 
mobilization  theories  argue  that  people  are  moti¬ 
vated  by  political  campaigns,  while  expressive 
explanations  suggest  that  individuals  are  affirming 
their  (group)  identity.  Proponents  of  these  theories 
have  not,  however,  explained  why,  if  motivations 
are  unrelated  to  election  outcomes,  people  would 
be  more  likely  to  vote  when  their  vote  has  more 
impact  on  the  outcome  (e.g.,  in  a  close  race)  or 
when  more  is  at  stake  (see  below). 

An  approach  with  more  promise  of  overcoming 
the  paradox  of  voting  recognizes  that  voting  ben¬ 
efits  more  people  than  just  the  individual  voter  and 
sees  voters  as  social  beings  motivated  in  part — 
especially  if  they  are  of  an  altruistic  disposition — 
by  their  obligations  to  groups  with  which  they 
identify  and  responsive  to  the  expectations  and 
mobilizing  efforts  of  members  of  those  groups. 
This  approach  has  gained  prominence  with  recent 
research  demonstrating  that  turnout  is  subject  to 
socialization,  learning,  and  habit  formation,  as 
shown  by  Eric  Plutzer  (2002).  In  addition,  the 
value  of  this  approach  is  in  demonstrating  that 
most  voters  acquire  a  habit  of  voting  over  time  and 
that  contextual  factors  such  as  the  closeness  of  the 
race  have  a  stronger  impact  on  voters  who  have 
not  yet  acquired  the  habit  of  voting,  as  demon¬ 
strated  by  Mark  Franklin  (2004). 

The  fact  that  people  differ  in  terms  of  whether 
their  behavior  is  based  on  habit  suggests  that  much 
earlier  research  into  individual-level  explanations 
of  turnout  variations,  demonstrating  the  influence 


of  sociodemographic  factors  (such  as  education, 
income,  or  age)  and  social  factors  (such  as  home 
ownership  or  church  attendance),  were  focusing 
on  factors  that  facilitated  this  learning  process.  A 
reconceptualization  of  the  decision  to  vote  into 
long-term  (habitual)  and  short-term  (variable) 
behavior  shows  promise  of  clarifying  much  that  is 
currently  confusing  in  the  turnout  literature. 

Short-  and  Long-Term  Forces 
in  Turnout  Change 

The  same  distinction  between  short-  and  long-term 
forces  helps  explain  a  puzzle  that  has  not  received 
much  attention.  Turnout  differences  between  coun¬ 
tries  are  considerable.  In  some  countries  (e.g., 
Australia,  Belgium,  or  Malta),  virtually  everyone 
votes.  In  other  countries  (Switzerland  or  the  United 
States),  turnout  rarely  reaches  50%.  These  differ¬ 
ences  are  huge,  yet  in  established  democracies  the 
turnout  seldom  varies  by  as  much  as  5%  between 
successive  elections  of  the  same  type  (presidential 
elections  in  the  United  States,  parliamentary  elec¬ 
tions  elsewhere).  Only  over  a  longer  period 
(20  years  or  more)  do  some  countries  see  marked 
changes  in  their  turnout  levels.  This  puzzle  is 
resolved  if  one  understands  that  most  people  in 
established  democracies  at  any  given  time  have 
acquired  established  habits  of  voting  or  not  vot¬ 
ing — habits  that  condition  the  level  of  turnout 
according  to  the  ratio  of  habitual  voters  to  habitual 
nonvoters.  An  electorate  consisting  mainly  of 
habitual  voters  will  regularly  produce  a  high  turn¬ 
out,  whereas  an  electorate  with  more  nonvoters 
will  produce  lower  turnout,  and  these  levels  of 
turnout  will  be  quite  stable  in  the  short  run.  Any 
long-term  change  in  turnout  levels  must  come  from 
new  voters  whose  behavior  is  not  yet  habitual.  If 
the  proportion  of  new  voters  who  acquire  the  habit 
of  voting  changes  and  remains  at  a  different  level 
(higher  or  lower),  then  long-term  change  will  arise 
from  natural  processes  of  generational  replace¬ 
ment.  Such  processes  are  slow  but  inexorable 
because  an  electorate  will  be  entirely  replaced  with 
new  voters  over  the  course  of  about  50  or  60  years. 

What  would  lead  new  voters  to  acquire  the  habit 
of  voting  at  a  different  rate  than  their  elders  did?  To 
answer  this  question,  one  needs  to  turn  to  explana¬ 
tions  of  turnout  variations  that  focus  on  things 
about  elections  rather  than  things  about  individuals. 
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Election-Level  Explanations 

Changes  in  the  level  of  turnout  between  countries 
and  over  time  are  not  due  to  changes  in  individ¬ 
ual-level  characteristics.  Increases  in  the  propor¬ 
tion  of  university-educated  or  high-income  indi¬ 
viduals  (two  characteristics  strongly  associated 
with  the  acquisition  of  habitual  voting)  do  not 
result  in  rising  turnout.  The  United  States  and 
Switzerland  are  both  rich  countries  with  very  high 
levels  of  education,  yet  their  turnout  is  lowest  of 
all  established  democracies;  and  in  both  countries, 
turnout  fell  even  as  wealth  and  education  increased. 
Moreover,  countries  that  see  high  turnout  do  not 
have  markedly  different  distributions  of  these  and 
other  social  characteristics  in  their  electorates.  So 
there  must  be  something  else  that  can  motivate 
everyone — even  the  poorest  and  least  educated — 
to  vote  in  certain  circumstances,  quite  overrid¬ 
ing  the  influence  of  individual-level  factors — 
something  that  must  be  weaker  or  even  absent  in 
other  circumstances. 

Scholars  agree  that  this  “something”  relates  to 
what  is  at  stake  in  an  election.  In  countries  where 
much  is  at  stake,  everybody  votes.  Where  the 
stakes  are  lower,  so  is  turnout.  Research  has 
pointed  to  several  factors  that  raise  the  stakes  of 
an  election — primarily,  the  institutional  and  com¬ 
petitive  context  (see  Andre  Blais,  2006;  Benny 


Geys,  2006).  To  indicate  the  nature  and  impor¬ 
tance  of  the  more  potent  contextual  factors, 
Table  I  displays  their  effects  in  22  established 
democracies  between  1945  and  1999.  The  right- 
hand  column  shows  the  maximum  extent  of  turn¬ 
out  differences  found  to  be  associated  with  each 
factor,  comparing  elections  with  the  lowest  stakes 
in  terms  of  that  factor  to  elections  with  the  highest 
stakes. 

These  influences  are  ordered  in  terms  of  their 
importance  in  the  long-term  perspective  (i.e.,  after 
allowing  time  for  generational  replacement  to 
reveal  their  full  effects),  with  the  most  influential 
characteristic  being  “policy  responsiveness.”  This 
influence  is  emblematic  because  it  relates  so  clearly 
to  the  policy  stakes  of  an  election:  Some  political 
systems  have  separated  powers  or  other  obstacles 
that  stand  in  the  way  of  a  majority  enacting  its 
policies,  reducing  the  extent  to  which  voters  can 
easily  estimate  the  policy  consequences  of  an  elec¬ 
tion.  “Electorate  size”  is  almost  as  important, 
picking  up  additional  aspects  of  policy  responsive¬ 
ness  since  larger  countries  are  more  often  subdi¬ 
vided  by  means  of  federal  or  quasi-federal  arrange¬ 
ments.  Electorate  size  also  dilutes  the  power  of 
each  individual  vote.  “Largest  party  size”  is  a 
measure  of  legislative  power.  A  party  that  reaches 
the  50%  threshold  can  govern  alone;  one  that 
comes  close  can  dominate  any  coalition  of  which  it 


Table  I  Maximum  Effect  of  Differences  in  the  Stakes  of  an  Election 


Influence 

Range  of  Values 

Maximum  Turnout  Difference 

Due  to  This  Influence  (%) 

Policy  responsiveness 

0—3  units 

16.0 

Electorate  size 

1—280  million 

12.3 

Compulsory  voting 

0,  1  =  yes 

11.3 

Largest  party  size 

33-50%a 

7.0 

Mean  margin  across  districts 

0-10% 

5.7  b 

Margin  of  victory  nationwide 

0-10% 

5.3  b 

Party  cohesiveness 

0,  1  =  yes 

4.2 

Sum  of  differences 

56.5  b 

Source:  Adapted  from  Table  5.2  in  Franklin,  M.  N.  (2004).  Voter  turnout  and  the  dynamics  of  electoral  competition  in 
established  democracies  since  1945.  New  York:  Cambridge  University  Press. 

a.  Maximum  limited  to  50%,  the  threshold  for  majority  status. 

b.  Margin  and  mean  margin  operate  in  different  countries,  so  one  is  zero  when  the  other  is  not. 
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forms  a  part,  yielding  confidence  that  it  will  enact 
its  policy  promises  (other  factors  permitting).  One 
obvious  prerequisite  is  “party  cohesiveness”  at  the 
foot  of  the  table — that  is,  the  extent  to  which  par¬ 
ties  enjoy  disciplined  support  in  the  country’s  leg¬ 
islature.  Two  alternative  measures  of  margin  of 
victory  are  shown,  depending  on  whether  votes  are 
cast  in  districts  where  the  “winner  takes  all”  or 
seats  are  assigned  proportionately  to  votes.  Both 
measures  tap  the  extent  to  which  an  election  is  a 
close  one,  in  which  every  vote  counts.  “Compulsory 
voting”  has  a  rather  different  influence,  increasing 
the  stakes  of  an  election  by  threatening  sanctions 
(real  or  imagined)  for  failure  to  vote.  In  practice, 
this  influence  has  been  shown  to  have  most  effect 
where  turnout  would  otherwise  be  lowest,  effec¬ 
tively  trumping  all  other  influences  on  turnout 
(Arend  Lijphart,  1997). 

The  final  coefficient  at  the  bottom  of  the  table 
shows  the  long-term  difference  in  turnout  that 
might  be  observed  empirically  between  a  hypo¬ 
thetical  election  in  which  all  stakes  are  at  their 
maximum  and  one  where  all  stakes  are  at  their 
minimum.  It  is  something  of  an  underestimate 
because  several  small  influences  have  been  omitted 
for  simplicity.  However,  no  established  democracy 
in  practice  has  elections  characterized  by  minimum 
possible  stakes,  so  the  hypothetical  total  difference 
of  56%  is  not  an  unrealistic  maximum. 

In  new  or  transitional  democracies,  greater 
swings  in  turnout  can  occur  than  generally  seen  in 
established  democracies — presumably  because  in 
new  democracies  no  one  has  yet  had  time  to 
acquire  habitual  behavior  patterns.  Indeed,  the 
more  volatile  turnout  seen  in  such  countries 
serves  to  reinforce  our  conclusions  about  the  role 
of  habitual  behavior  in  turnout  change. 
Nevertheless,  little  is  known  about  how  the 
mechanics  of  turnout  change  in  established 
democracies  play  out  in  new  democracies  (Andre 
Blais  &  Agnieska  Dobrzynska,  1998;  Pippa 
Norris,  2002).  So  the  study  of  new  and  transi¬ 
tional  democracies  remains  an  important  avenue 
for  future  research. 

Further  research  is  also  needed  to  test  compet¬ 
ing  explanations  of  individual-level  behavior.  At 
the  aggregate  level,  more  research  is  needed  on 
new  democracies  as  well  as  research  connecting 
the  aggregate  context  of  elections  to  individual 
voter  behavior.  The  methodological  advances 


being  made  in  time  series  and  multilevel  analysis 

should  help  answer  the  remaining  puzzles. 
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See  also  Civic  Participation;  Democratization;  Electoral 
Behavior;  Electoral  Systems;  Multilevel  Analysis; 
Participation;  Rational  Choice;  Time-Series  Analysis 

Further  Readings 

Blais,  A.  (2006).  What  affects  voter  turnout?  Annual 
Review  of  Political  Science ,  9,  111-125. 

Blais,  A.,  &  Dobrzynska,  A.  (1998).  Turnout  in  electoral 
democracies.  European  Journal  of  Political  Research, 
33,  239-261. 

Fowler,  J.  H.,  &  Dawes,  C.  T.  (2008).  Two  genes  predict 
voter  turnout.  Journal  of  Politics,  70(3),  579-594. 

Franklin,  M.  N.  (2004).  Voter  turnout  and  the  dynamics 
of  electoral  competition  in  established  democracies 
since  1945.  New  York:  Cambridge  University  Press. 

Geys,  B.  (2006).  “Rational”  theories  of  voter  turnout:  A 
review.  Political  Studies  Review,  4,  16-35. 

Lijphart,  A.  (1997).  Unequal  participation:  Democracy’s 
unresolved  dilemma.  American  Political  Science 
Review,  91,  1-14. 

Norris,  P.  (2002).  Electoral  engineering:  Voting  rules  and 
political  behavior.  New  York:  Cambridge  University 
Press. 

Plutzer,  E.  (2002).  Becoming  a  habitual  voter:  Inertia, 
resources,  and  growth  in  young  adulthood.  American 
Political  Science  Review,  96,  41-56. 


Elites 


Elites  are  small  groups  of  persons  who  exercise 
disproportionate  power  and  influence  in  social 
domains.  It  is  customary  to  distinguish  between 
political  elites ,  whose  locations  in  powerful  insti¬ 
tutions,  organizations,  and  movements  enable 
them  to  shape  or  influence  political  outcomes, 
often  decisively,  and  cultural  elites ,  who  enjoy  a 
high  status  and  influence  in  nonpolitical  spheres 
such  as  arts  and  letters,  philanthropy,  professions, 
and  civic  associations.  At  the  national  level,  polit¬ 
ical  elites  number  only  a  few  thousand  persons  in 
all  but  the  largest  countries,  whereas  the  makeup 


760  Elites 


of  cultural  elites  is  more  indeterminate  and  turns 
on  the  nonpolitical  spheres  regarded  as  conse¬ 
quential  in  a  society.  Taking  their  lead  from 
Gaetano  Mosca  (1856-1941),  Vilfredo  Pareto 
(1848-1923),  and  Robert  Michels  (1876-1936), 
political  scientists  focus  on  political  elites  (simply 
elites  hereafter),  investigating  how  their  actions 
and  beliefs,  social  profiles,  and  overall  configura¬ 
tions  affect  political  regimes  and  policies.  Below, 
various  aspects  of  elite  formation,  autonomy, 
structure,  transformation,  and  their  analysis  in 
political  science  are  discussed. 

Mosca,  Pareto,  and  Michels  portrayed  elite  for¬ 
mation  as  inescapable  in  modern  societies  and  as 
imposing  limits  on  what  is  possible  in  politics. 
They  maintained,  for  example,  that  genuine  demo¬ 
cratic  systems  are  impossible  because  there  are 
always  self-interested  elites  who  out-organize  and 
outwit  the  demos.  The  most  that  can  be  hoped  for, 
in  their  view,  is  a  relatively  liberal  but  still  quite 
unequal  order  led  by  capable,  cooperative,  and 
enlightened  elites.  But,  they  noted,  elites  in  most 
societies  and  times  fall  well  short  of  these  attri¬ 
butes,  so  politics  usually  involve  fierce  power 
struggles  between  ambitious,  blinkered,  and  inse¬ 
cure  elites. 

The  research  literature  on  elites  is  large. 
Especially  prominent  are  studies  of  parliamentary 
and  cabinet-level  politicians;  high  civil  servants; 
and  the  owners,  CEOs,  and  board  members  of 
major  business  firms.  Studies  of  military,  trade 
union,  mass  media,  pressure  group,  religious,  and 
social  movement  elites  are  also  numerous.  There  is, 
in  addition,  a  substantial  body  of  historical  research 
uncovering  the  makeup  and  role  of  elites  in  pivotal 
events  and  processes  of  economic,  political,  and 
social  change.  Researchers  identify  elites  in  terms 
of  who  holds  the  most  important  formal  positions 
in  a  society,  who  has  pronounced  reputations  for 
political  power  and  influence,  or  who  participates 
in  making  key  decisions  and  policies.  Each  identifi¬ 
cation  method  tends  to  produce  a  distinctive  elite 
configuration.  Concentrating  on  positions  high¬ 
lights  the  differentiation  of  elites  by  sectors  (poli¬ 
tics,  business,  government  administration,  etc.); 
focusing  on  reputations  often  identifies  a  cohesive 
and  small  “power  elite”  sitting  astride  diverse  sec¬ 
tors;  and  studying  participation  in  decisions  and 
policies  most  often  yields  an  array  of  specialized 
elite  groups.  To  overcome  these  identification 


peculiarities,  researchers  frequently  use  more  than 
one  method  and  combine  the  results. 

Once  elites  are  identified,  how  their  positions 
overlap  and  interlock  can  be  analyzed,  and  data  on 
their  social  background,  education,  and  careers  can 
be  compiled  from  public  registers  and  question¬ 
naires;  also,  elite  persons’  opinions  and  perceptions, 
as  well  as  their  informal  interactions  and  networks, 
may  be  solicited  through  interviews.  A  perennial 
question  is  the  degree  to  which  collective  elite  pro¬ 
files  mirror  those  of  nonelite  publics  and,  thus,  the 
extent  to  which  elites  can  be  supposed  to  reflect  and 
represent  such  publics.  Other  important  questions 
include  the  extent  to  which  elites  are  autonomous 
actors,  how  their  compositions  and  behavior  vary 
among  societies,  and  how  elites  change,  gradually 
or  suddenly,  in  these  and  other  respects. 

Elites  and  Nonelites 

Research  shows  that  in  their  social  background, 
education,  and  occupations,  elites  are  almost 
always  more  privileged  than  cross  sections  of  non¬ 
elite  populations.  They  come  prevailingly  from 
well-off  families,  some  of  whose  members  may 
have  held  elite  positions  in  preceding  generations. 
The  frequency  with  which  they  hold  university 
degrees — often  from  “elite”  institutions — far 
exceeds  the  distribution  of  such  education  among 
nonelite  publics.  Elites  also  come  disproportion¬ 
ately  from  high-status  occupations — lawyers, 
teachers,  and  public  or  private  sector  managers. 
There  are  exceptions,  of  course.  Significant  pro¬ 
portions  of  civil  service  elites  have  career  graphs 
that  begin  in  lowly  positions  and  involve  long 
climbs  to  the  top.  Self-made  entrepreneurs  are  cer¬ 
tainly  not  unknown  among  business  elites. 
Historically,  the  elites  of  trade  unions  and  various 
social  movements  exhibited  a  modest  background 
and  education,  although  today  in  industrialized 
countries,  they  are  predominantly  middle  class  in 
background  and  university  educated.  Elite-level 
politicians  are  increasingly  involved  in  full-time 
careers  in  or  close  to  politics,  often  starting  as  stu¬ 
dent  political  leaders  and  then  serving  on  leading 
politicians’  staffs  or  holding  paid  positions  in  par¬ 
ties,  or  they  come  from  politically  relevant  careers 
in  journalism,  public  relations,  and  policy  think 
tanks.  Historically,  elites  have  consisted  almost 
entirely  of  men,  and  men  continue  to  outnumber 
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women  greatly  in  most  elite  sectors.  In  multi-ethnic 
or  multiracial  societies,  elite  persons  usually  belong 
to  the  largest  or  otherwise  dominant  ethnic  or 
racial  population  segment.  Finally,  in  average  age, 
elites  tend  to  be  significantly  older  than  nonelites. 

Research  shows,  however,  that  these  differences 
between  elites  and  nonelites  are  gradually  being 
reduced.  For  example,  research  on  the  social,  edu¬ 
cational,  and  occupational  profiles  of  parliamen¬ 
tary  elites  in  11  European  countries  from  the  mid¬ 
dle  of  the  19th  century  to  the  present  reveals  a  long 
trend  toward  less  exclusive  and  privileged  profiles. 
Changes  in  the  gender  makeups  of  elites  are  also 
now  quite  evident.  It  is  possible,  moreover,  that  the 
preponderance  of  bureaucratic  and  service  work  in 
many  contemporary  societies  is  intermingling  elites 
and  nonelites  in  important  ways.  Both  perform 
essentially  similar  nonmanual  work  tasks  and  not 
infrequently  rub  shoulders  in  offices.  Because  elites 
now  more  frequently  ascend  to  their  positions 
from  nonelite  origins  than  in  the  rigidly  stratified 
societies  of  earlier  times,  not  a  few  see  themselves 
as  one  of  a  kind  with  nonelites,  among  whom  they 
have  intimate  personal  associates  and  for  whom 
they  have  considerable  empathy.  Such  closer  and 
more  empathetic  ties  may  dispose  elites  toward 
actions  that  better  reflect  and  represent  nonelite 
desires  and  interests.  There  is,  furthermore,  a 
debate  about  the  degree  to  which  the  backgrounds 
of  elites  translate  into  opinions  and  actions  that 
disregard  nonelites.  Some  scholars  argue  that  there 
is  at  most  a  weak  linkage  between  elite  back¬ 
grounds  and  elite  opinions  and  actions.  Faced  with 
pressing  and  constantly  changing  organizational 
and  political  imperatives,  elites  hold  views  and 
take  actions  that  resonate  poorly,  if  at  all,  with 
their  life  histories.  Elite  backgrounds  may  reveal 
more  about  the  distribution  of  opportunities  in  a 
society  than  why  its  elites  think  and  act  as  they  do. 

Elite  Autonomy 

Elites  seldom  enjoy  complete  autonomy.  To  carry 
out  major  initiatives  and  perpetuate  their  holds  on 
power,  elites  need  nonelite  support.  To  win  it, 
elites  frame  appeals  that  accord,  at  least  roughly, 
with  nonelite  interests  and  political  orientations. 
It  is  reasonable  to  suppose  that  nonelite  interests 
and  orientations  are  not  created  by  elites  but 
derive  from  combinations  of  human  nature,  life 


circumstances,  and  beliefs,  although  elites  often 
activate  or  muffle  nonelite  interests  and  orienta¬ 
tions  through  well-couched  appeals.  Failure  to 
win  nonelite  support  frequently  shortens  elite 
tenures  or  undermines  elite  effectiveness.  Still, 
elites  normally  have  a  range  of  choices  that  are 
decisive  for  political  outcomes.  In  political  crises, 
for  example,  opposing  elite  camps  and  factions 
may  choose  to  enter  into  accommodations  or  per¬ 
sist  in  lethal  oppositions.  During  transitions  to 
democracy,  ascendant  factions  may  choose  to 
embrace  or  merely  feign  democratic  practices.  In 
well-instituted  democracies,  elites  may  proffer 
outcomes  they  know  to  be  impossible  or  resort  to 
obfuscations  that  enable  them  to  avoid  responsi¬ 
bility  for  what  happens. 

How  much  autonomy  elites  have — and  should 
have — are  questions  that  have  long  dogged  discus¬ 
sions  of  “democratic  elitism.”  In  his  seminal  book 
Capitalism,  Socialism  and  Democracy,  Joseph 
Schumpeter  (1883-1950)  held  that  democracy  is 
simply  a  method  by  which  voters  select  governing 
leaders  and  elites,  who  should  then  be  left  alone  to 
get  on  with  the  business  of  governing.  For 
Schumpeter,  in  other  words,  democracy  combines 
autonomous  governance  by  leaders  and  elites  with 
time -limited  mandates  to  govern  given  by  the 
demos  in  periodic  elections.  However,  many  critics 
of  democratic  elitism  claim  that  this  too  blithely 
assumes  leaders  and  elites  to  be  creative  and 
responsible  actors  who  can  safely  be  entrusted 
with  autonomy.  Whether  the  autonomy  of  leaders 
and  elites  is  increasing  or  decreasing  in  contempo¬ 
rary  democracies  is  a  major  empirical  question  in 
political  science,  and  its  desirability  is  a  bone  of 
contention  in  normative  democratic  theory. 

Variation  in  Elite  Structure  and  Values 

It  is  obvious  in  modern  history  and  today’s  world 
that  elites  differ  importantly  in  their  structures  and 
values.  First,  elites  located  in  functionally  differen¬ 
tiated  societal  sectors  display  different  modes  of 
organization  and  different  aims.  Much  research  on 
the  careers,  outlooks,  and  skills  peculiar  to  elites  in 
specific  sectors,  as  well  as  the  ease  or  difficulty 
with  which  elite  persons  traverse  sector  boundaries 
or  hold  interlocked  positions  in  two  or  more  sec¬ 
tors,  assesses  the  extent  to  which  elites  are  plural¬ 
istic  or  monolithic  in  their  structures  and  values. 
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Second,  there  are  efforts  to  capture  different  types 
of  national  elites  when  considered  as  wholes. 
These  efforts  risk  being  too  simplistic,  yet  the  com¬ 
parative  study  of  elites  cannot  escape  the  need  for 
a  broad  typology.  Typologies  that  have  been  put 
forth  concentrate  on  the  extent  of  structural  inte¬ 
gration  and  the  extent  of  value  consensus. 
Structural  integration  involves  the  relative  inclu¬ 
siveness  of  formal  and  informal  networks  of  com¬ 
munication  and  influence  among  elite  members. 
Value  consensus  involves  their  relative  agreement 
about  the  worth  of  existing  political  institutions 
and  a  code  of  political  behavior. 

Comparing  national  elites  along  these  dimen¬ 
sions,  a  disunited  type  and  two  types  of  united 
elites  can  be  distinguished,  although  there  are  dif¬ 
ficult  borderline  cases.  In  the  disunited  type,  elite 
groups  are  clearly  divided  and  separated  from  each 
other,  they  disagree  fundamentally  about  the 
worth  of  existing  institutions,  and  they  adhere  to 
no  single  code  of  behavior.  This  is  still  the  most 
common  type  of  elite  in  the  modern  world.  It  is 
usually  associated  with  authoritarian  regimes  in 
which  one  elite  group  or  camp  has  the  upper  hand 
and  governs  ruthlessly.  A  disunited  elite  may  also 
be  associated  with  illiberal  democracies,  in  which 
competitive  elections  take  place  but  are  so  ridden 
with  fraudulent  practices  that  few  elites  and  citi¬ 
zens  accept  their  outcomes  as  legitimate. 

Disunited  elites  have  originated  principally  in 
processes  of  national  state  formation,  during  which 
elites  in  one  territory  forcibly  annex  and  subjugate 
other  territories  and  elites.  The  resulting  conflicts 
and  hatreds  have  most  often  produced  deeply  dis¬ 
united  elites  in  new  national  states.  There  are  many 
examples:  the  animosities  and  bitter  memories  that 
followed  the  formation  of  national  states  in  Europe 
between  the  16th  and  19th  centuries,  the  “man  on 
horseback’s”  (caudillos)  control  of  Latin  American 
states  from  their  formations  early  in  the  19th  cen¬ 
tury  until  long  into  the  20th  century,  nearly  all 
African  states  under  alternating  military  and  civil¬ 
ian  rule  since  their  emergence  in  the  1950s  and 
1960s,  and  the  creation  of  postcolonial  states  by 
dynastic  ruling  elites  in  the  Middle  East. 

In  one  of  the  two  types  of  united  elites,  by  con¬ 
trast,  elite  structure  is  sharply  centralized  in  a 
party,  theocratic,  or  ethno-nationalist  state.  There 
is  a  secular  ideology,  religious  doctrine,  or  ethno- 
nationalist  creed  that  is  defined  by  the  uppermost 


political  leaders  and  professed  publicly  by  all  or 
nearly  all  elite  persons.  Much  less  common  than 
the  disunited  type  of  elite,  examples  of  this  “ideo¬ 
logically  united”  type  are  the  Soviet  elite  from 
1922  until  the  mid-1980s,  Germany’s  elite  during 
the  Third  Reich,  the  North  Korean  elite  under  Kim 
Il-sung  and  his  son  Kim  Jong-il,  China’s  elite  dur¬ 
ing  Mao  Zedong’s  rule  and  after,  the  Cuban  elite 
under  Fidel  Castro,  the  Viet  Minh  elite  since  gain¬ 
ing  control  of  North  Vietnam  in  1954  and  all  of 
Vietnam  in  1975,  the  Khmer  Rouge  elite  in 
Cambodia  (until  1978),  and  the  Iranian  elite  from 
the  time  of  Ayatollah  Khomeini’s  ascendancy 
(1979)  and,  arguably,  until  the  present.  As  these 
examples  indicate,  the  ideologically  united  type  of 
elite  most  often  originates  in  a  revolutionary  or 
anticolonial  war  that  culminates  in  the  victory  and 
dominance  of  an  especially  doctrinaire  elite  group. 

Disunited  and  ideologically  united  elites  are 
clearly  incompatible  with  stable  liberal  democra¬ 
cies,  whereas  a  second  type  of  united  elite — what 
might  be  labeled  a  “consensually  united”  elite — 
appears  to  be  a  precondition  for  them.  In  this  sec¬ 
ond  type,  elite  structure  consists  of  formal  and 
informal  networks  of  influence  and  acquaintance 
that  are  most  dense  within  the  functionally  differ¬ 
entiated  sectors  mentioned  above.  But  these  net¬ 
works  overlap  and  interlock  to  form  web  works 
and  “inner”  or  “central”  circles  through  which 
sector  elites  are  tied  together  and  obtain  access  to 
key  political  decision  makers.  As  well,  the  mem¬ 
bers  of  this  second  type  of  united  elite  share  a 
voluntary,  mostly  tacit  consensus  about  political 
norms  and  practices,  the  hallmark  of  which  is 
keeping  political  competition  restrained  and  non¬ 
violent.  Factions  recognize  each  other’s  right  to  be 
heard,  they  agree  to  disagree  when  decisions  can¬ 
not  be  reached,  they  emphasize  technical  and  pro¬ 
cedural  feasibilities  rather  than  ultimate  rights  and 
wrongs,  and  they  practice  enough  secrecy  to  have 
flexibility  when  bargaining  and  fashioning  com¬ 
promises  on  difficult  policy  issues. 

Like  ideologically  united  elites,  consensually 
united  elites  are  relative  rarities  in  the  modern 
world.  They  appear  to  originate  in  only  two  or 
three  ways.  One  is  through  a  deliberate,  sudden, 
and  basic  “settlement”  of  costly  and  long-stand¬ 
ing  oppositions,  as  occurred  between  Tory  and 
Whig  elite  camps  in  England’s  Glorious  Revolution 
during  1688  to  1689  and  between  Franquist  and 
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anti -Franquist  elite  camps  in  Spain  during  1977  to 
1978.  A  second  way  has  been  through  a  long  expe¬ 
rience  of  relatively  benign  colonial  home  rule  dur¬ 
ing  which  settler  or  “native”  elites  learn  to  practice 
cautious  and  restrained  politics  and  eventually 
make  a  unifying  effort  to  gain  independence — the 
origin  of  consensually  united  elites  in  the  former 
British  colonies  of  Australia,  Canada,  India, 
Jamaica,  New  Zealand,  and,  not  least,  the  United 
States.  A  third  way  has  been  the  gradual  conver¬ 
gence  of  camps  in  a  disunited  elite  toward  con¬ 
sensual  unity  when  spreading  prosperity  and 
agreement  about  basic  democratic  rights  and 
principles,  which  erodes  the  postures  taken  by 
radically  dissident  camps,  as  happened  in  France 
and  Italy  during  the  1960s  and  1970s  and 
Germany  and  Japan  during  the  1970s  and  1980s. 
It  must  be  noted,  however,  that  basic  settlements 
between  opposed  elites  are  highly  contingent  and 
have  been  historically  relatively  rare  occurrences. 
The  further  consolidation  and  formation  of  elite 
consensus  in  the  more  recent  “Third  Wave” 
(Huntington)  democracies  remains  to  be  seen. 

Elite  Circulation  and  Transformation 

How  elites  change  in  their  composition  and 
behavior  is  a  major  question.  Mosca  and  Pareto 
emphasized  elites’  need  to  replenish  their  ranks 
through  constant  but  gradual  and  peaceful  inflows 
and  outflows  of  members.  Pareto  also  toyed  with 
a  more  dramatic  process,  in  which  cunning  and 
foxlike  elites  are  replaced,  gradually  or  suddenly, 
by  belligerent  and  lion-like  elites,  who  are,  how¬ 
ever,  in  turn  eventually  displaced  by  new  foxlike 
elites  in  an  endless  cycle.  In  this  context,  Pareto 
famously  depicted  revolutions  as  amounting  to 
little  more  than  circulations  of  ruling  elites. 
Processes  of  elite  circulation  and  transformation 
appear  to  vary  according  to  the  basic  type  of  elite 
that  exists  in  a  society.  Where  this  is  a  disunited 
elite,  sudden  and  violent  displacements  of  ruling 
elite  camps  and  cliques  tend  to  occur  frequently. 
Through  a  coup  or  elite-led  popular  uprising,  one 
camp  or  clique  seizes  and  holds  power  until  the 
next  coup  or  uprising.  Modern  history  is  replete 
with  such  violent  but  essentially  garden-variety 
elite  circulations.  More  rarely,  the  ascendant 
camp  in  a  disunited  elite  may  so  mismanage  its 
rule  or  suffer  so  many  misfortunes  that  the  state 


and  its  coercive  forces  collapse,  opening  the  door 
to  a  revolutionary  interregnum  and  the  victory  of 
a  previously  peripheral  but  tightly  organized  and 
doctrinaire  elite  group.  Where  this  happens,  one 
may  speak  of  a  transformation  from  the  disunited 
to  the  ideologically  united  type.  The  classic 
example  is  the  collapse  of  the  war-ravaged  Tsarist 
state  and  the  ensuing  revolutionary  triumph  of 
the  tiny  Bolshevik  elite  in  Russia  between  1917 
and  1921. 

Where,  as  in  the  Soviet  Union  from  1922,  an 
ideologically  united  elite  holds  sway,  elite  circula¬ 
tion  occurs  more  or  less  exclusively  by  jockeying 
for  high  positions  in  a  crushingly  dominant  party 
or  movement.  Eventually,  however,  the  imposed 
and  “official”  ideology  or  other  dogma  of  that 
party  or  movement  may  become  so  threadbare  and 
at  odds  with  societal  developments  that  its  elite 
unravels  or  implodes.  The  ideologically  united  elite 
is  transformed  into  a  disunited  elite,  different  in 
makeup  but  not  in  basic  type  from  the  disunited 
elite  that  preceded  the  earlier  revolutionary  trans¬ 
formation.  This  was  the  fate  of  the  Soviet  Union’s 
ruling  elite  some  70  years  after  it  took  power.  Its 
implosion  created  disunited  elites  in  Russia  and  all 
of  the  newly  independent  states  that  had  consti¬ 
tuted  the  Soviet  Union.  A  question  at  present  is 
whether  similar  implosions  of  ideologically  united 
elites  will  occur  in  China,  Cuba,  Iran,  North 
Korea,  Vietnam,  and  perhaps  Iraq. 

Consensually  united  elites  appear  to  be  immune 
to  such  dramatic  circulations  and  transforma¬ 
tions.  Sudden  and  irregular  seizures  of  governing 
power  by  competing  elite  camps  and  factions  do 
not  occur,  and  the  historical  record  indicates  that, 
once  formed,  a  consensually  united  elite  strongly 
tends  to  persist  over  long  periods:  in  England/ 
Britain  since  1689,  the  United  States  since  1789, 
Sweden  since  1809,  Canada  since  1867,  Australia 
since  1901,  India  since  1947,  inter  alia.  Elite  cir¬ 
culation  occurs  gradually  through  periodic  elec¬ 
tion  contests  and  complex  career  and  other  hier¬ 
archical  recruitment  structures  in  which  “merit” 
is  emphasized  and  rewarded,  even  if  mainly  in  the 
eyes  of  those  who  are  successful.  This  circulation 
is  not  inconsequential,  however.  Modern  com¬ 
munication  technologies  so  heighten  elite  organi¬ 
zation  and  mobilization  of  pubic  support  that 
electoral  competition  and  policy  disputes  increas¬ 
ingly  approximate  battles  between  armada-like 
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coalitions  of  elites,  whose  alternating  victories 
bring  sharp  changes  in  governing  style  and  policy 
content. 

Elites  and  Political  Science 

The  study  of  elites  confronts  political  science  with 
difficult  problems.  First,  elite  behavior  always  con¬ 
tains  enough  arbitrary  and  capricious  elements  to 
defy  deterministic  explanation.  A  theory  of  elites 
and  politics  must  allow  considerable  leeway  for 
unpredictable  elite  choice,  and  its  explicative 
claims  must,  therefore,  be  modest.  This  means, 
second,  that  a  theory  should  focus  principally  on 
what  is  unlikely  or  impossible  in  politics — on  what 
the  inescapability  of  elites  and  their  relatively 
autonomous  actions  rule  out.  Yet  much  political 
science  regards  political  possibilities — especially 
possibilities  for  making  people  more  free  and 
equal,  more  enlightened  and  participative — as 
essentially  open-ended. 

Third,  the  secrecy  that  elites  routinely  practice 
is  at  odds  with  the  strong  empirical  bent  of  politi¬ 
cal  science.  Compiling  more  data  on  elite  back¬ 
grounds  and  other  aspects  of  their  profiles,  con¬ 
ducting  more  surveys  of  usually  evasive  elite 
opinions  and  momentary  network  locations,  and 
manipulating  such  data  with  rigorous  statistical 
techniques  are  unlikely  to  bring  political  scientists 
appreciably  closer  to  a  robust  understanding  of 
elites  and  politics.  In  short,  elites  often  do  not  lend 
themselves  to  empirical  and  inductive  research. 
Capturing  what  elites  do  and  why  they  do  it 
requires  a  deductive  and  rationalistic  theory  that 
must  be  tested  against  observed  events.  Do  events 
occur  as  they  would  if  the  theory  were  true?  Does 
the  theory  make  better  sense  of  what  happens 
politically  than  other  theories?  If  it  does,  are  there 
nevertheless  clear  and  undoubted  facts  that  show 
the  theory  to  be  at  least  in  part  mistaken?  These 
are  legitimate  and  necessary  questions.  “Can  each 
of  the  theory’s  postulates  about  elite  behavior  and 
motivation  be  demonstrated  and  measured  empir¬ 
ically?”  is  not,  however,  a  legitimate  question.  No 
seriously  explicative  theory,  even  in  the  most 
experimental  and  reputedly  scientific  fields,  can 
meet  such  a  test.  Things  that  are  observable 
merely  happen  as  they  would  if  the  theory  were 
true  and  no  more  elegant  and  parsimonious  expla¬ 
nation  is  available.  While  barriers  to  empirical 


research  and  inductive  explanation  exist  in  many 

areas  of  political  science,  they  are  especially  pro¬ 
nounced  in  the  study  of  elites. 

John  Higley 
University  of  Texas  at  Austin 
Austin,  Texas,  United  States 
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Elitism 


The  etymology  of  the  word  elite  (occasionally 
spelled  elite)  comes  from  the  Latin  eligere  (to 
elect).  In  general  usage,  an  elite  is  a  relatively  small 
dominant  group  or  social  category  of  people  who 
occupy  a  position  of  privilege  or  dominance  and 
have  a  privileged  status  within  a  larger  society.  To 
examine  the  “elite”  category  in  political  science  is 
to  take  up  a  fundamental  question — that  of  defin¬ 
ing  the  group  or  groups  that  hold  “power”  in  a 
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social  system,  that  is,  those  who  have  a  dominating 
influence  on  the  definition  and  the  carrying  out  of 
public  decisions.  The  question  of  who  holds  power 
in  modern  societies  is  generally  dealt  with  from 
three  different  theoretical  standpoints: 
(1)  elitism,  (2)  pluralism,  and  (3)  Marxism.  From  an 
elitist  perspective,  power  belongs  to  an  elite  that  is 
rather  united.  Pluralists,  on  the  other  hand,  consider 
that  there  is  no  single  ruling  class  but  rather  a  plu¬ 
rality  of  ruling  groups  that  alternate  between  coop¬ 
eration  and  confrontation.  According  to  Marxists, 
power  in  capitalist  societies  is  monopolized  by  a 
dominant  class — the  bourgeoisie — and  its  auxilia¬ 
ries.  Here,  domination  is  concealed  by  ideology. 

Theories  of  Elitism 

The  concept  of  the  elite  was  first  introduced  by 
Vilfredo  Pareto  (1848-1923),  an  Italian  econo¬ 
mist,  sociologist,  and  senator.  A  firm  believer  in 
economic  liberalism  and  an  adversary  of  socialism, 
he  aimed  at  challenging  the  Marxist  conception  of 
class  struggle.  In  1901,  he  published  Rivista  itali- 
ana  di  sociologica,  a  famous  text  later  translated  as 
The  Rise  and  Fall  of  the  Elites:  An  Application  of 
Theoretical  Sociology  (1968),  and  in  1916  he  pub¬ 
lished  Trattato  di  Sociologia  Generate,  later  trans¬ 
lated  as  The  Mind  and  Society  (1935).  He  defined 
members  of  the  elite  as  people  with  exceptional 
virtues  who  show  distinguished  abilities  in  any 
domain.  Being  part  of  the  elite  therefore  depends 
on  individual  capacities  and  natural  talent  that 
lead  to  above-average  success. 

Gaetano  Mosca  (1858-1941)  had  previously 
worked  on  a  very  similar  theory,  although  not 
explicitly  using  the  concept  of  elite.  A  professor  of 
public  law,  and  deputy  and  then  life  senator  of  the 
former  Kingdom  of  Italy,  he  wrote  the  first  version 
of  Elementi  di  scienza  politica  in  1896  (The  Ruling 
Class,  1939).  He  further  developed  the  notion  of 
the  “ruling  class”  and  the  idea  that  all  societies  are 
based  on  a  distinction  between  the  ruled  and  rulers. 
According  to  Mosca,  the  former  is  always  an  orga¬ 
nized  minority  that  takes  on  all  political  responsi¬ 
bilities,  monopolizes  power,  and  enjoys  all  the 
subsequent  advantages.  The  latter,  always  greater 
in  number,  is  led  and  controlled  by  the  rulers,  who 
act  more  or  less  legally  and  sometimes  violently. 

The  German  socialist  Roberto  Michels  (1876- 
1936)  is  also  among  the  elitist  authors.  A  pupil  of 


Max  Weber,  he  studied  in  England,  France, 
Germany,  and  Italy,  where  he  taught  economics 
and  political  science.  After  World  War  I,  he  joined 
Mussolini’s  Fascist  party,  whose  ideals  he  per¬ 
ceived  as  a  more  democratic  form  of  socialism.  In 
1911,  he  published  Z ur  Soziologie  des  Parteiwesens 
in  der  modernen  Demokratie.  Untersichungen  iiber 
die  oligarchischen  Tendenzen  des  Gruppenlebens 
{Political  Parties:  A  Sociological  Study  of  the 
Oligarchical  Tendencies  of  Modem  Democracies, 
1915).  His  premise  was  that  the  masses  lacked  the 
“mechanical  and  technical”  capacity  to  govern 
themselves.  According  to  Michels,  a  society  cannot 
exist  without  a  dominant  class  because  leadership 
is  technically  indispensable  to  the  survival  of  orga¬ 
nizations.  The  principle  of  his  “iron  law  of  oligar¬ 
chy”  is  that  one  dominant  class  inevitably 
succeeds  to  leadership  in  an  organization,  as  the 
collective  psychological  characteristics  of  both 
masses  and  elites  make  the  leaders  of  any  organi¬ 
zation  become  concerned  above  all  with  remaining 
in  office  and  strengthening  their  control  over  the 
organization. 

In  1941,  when  his  seminal  work  The  Managerial 
Revolution:  What  Is  Happening  in  the  World  was 
published,  James  Burnham  (1905-1987)  had 
already  left  the  American  Socialist  Workers  Party 
and  was  soon  to  serve  as  a  public  intellectual  of  the 
American  conservative  movement.  Burnham 
argued  that  the  19th-century  bourgeoisie  has  been 
replaced  since  World  War  I  by  a  “social  group  or 
class,”  which  he  called  “managers.”  They  control 
the  management,  the  leadership,  and  the  coordina¬ 
tion  of  the  economy.  But  these  managers  are  not 
capitalists  as  they  do  not  own,  finance,  or  have  a 
major  share  in  the  company.  Thus,  the  main  differ¬ 
ence  between  the  ruling  elite  and  the  masses  does 
not  lie  in  ownership  as  much  as  in  the  control  of 
the  means  of  production,  in  particular  through 
knowledge  and  technical  competence.  According 
to  Burnham,  managers  do  not  seek  profits  so  much 
as  they  seek  power  itself,  their  incentive  being 
more  political  than  economic.  They  do  not  distin¬ 
guish  between  corporate  and  state  ownership,  and 
their  oligarchy  holds  power  in  both  capitalist  and 
socialist  regimes. 

These  four  authors  are  sometimes  called  the 
“Machiavellians,”  as  they  are  deemed  to  convey  a 
pessimistic  and  cynical  view  of  the  world.  They 
defend  the  idea  of  the  autonomy  of  politics  toward 
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economic  infrastructure  and  give  a  prominent 
position  to  political  factors  (and  not  economic 
ones),  as  their  main  interest  lies  in  power,  not  in 
ownership. 

Usually  defined  as  one  of  the  elitist  authors,  the 
American  sociologist  C.  Wright  Mills  (1916-1962) 
opposed  both  Marxism  (and  its  principle  that 
power  is  based  on  the  ownership  of  the  means  of 
production)  and  the  liberal  version  defended  by 
the  previous  authors  (in  particular,  the  idea  of  an 
autonomous  political  order).  Mills  (1956)  showed 
that  in  the  United  States,  the  main  decisions  are 
made  by  an  elite  composed  of  the  people  at  the  top 
of  the  three  big  institutional  hierarchies  (political, 
military,  and  economic  elites)  that  form  a  “triangle 
of  power.”  These  major  institutional  orders  are 
interdependent  and  interlocked,  and  their  mem¬ 
bers  share  a  common  worldview  and  a  class  iden¬ 
tity  (they  consider  themselves  as  being  separate 
and  superior  to  the  rest  of  the  society). 

Similarly,  the  American  professor  of  psychology 
and  sociology  G.  William  Domhoff  (1936-  ) 
argued  in  his  1967  book  Who  Rules  America ?  that 
there  exists  a  ruling  class  in  the  United  States  that 
is  the  elite  and  in  it  is  concentrated  the  major  part 
of  national  wealth;  it  controls  the  major  compa¬ 
nies  and  the  main  banks;  runs  the  foundations, 
universities,  and  media;  and  therefore  leads  the 
country’s  economy.  According  to  Domhoff,  those 
who  own  income-producing  property  (corpora¬ 
tions,  real  estate,  and  agribusinesses)  set  the  rules 
within  which  policy  battles  are  waged.  For  him, 
economy  is  the  only  power  network  of  any  conse¬ 
quence  in  the  history  of  the  United  States.  Floyd 
Hunter  (1953)  reached  similar  conclusions  about 
power  at  the  local  level.  He  examined  in  detail  the 
power  relationships  in  Atlanta,  Georgia,  where  he 
found  that  the  city  was  led  by  a  small  and  homo¬ 
geneous  elite  and  that  “real”  decisions  were  made 
by  a  corporate  oligarchy. 

Elites’  Circulation 

The  notion  of  elites’  circulation  is  central  to  elitist 
theories.  According  to  Pareto,  because  being  part 
of  the  elite  is  not  hereditary,  there  is  a  constant 
replacement  of  old  elites  by  new  ones  via  the  circu¬ 
lation  of  individuals,  a  process  that  enables  a  cer¬ 
tain  social  equilibrium  (and  prevents  revolution). 
This  continuous  replacement  of  one  elite  by 


another  is  the  driving  force  behind  history.  Mosca 
further  developed  Pareto’s  theory  of  elites’  circula¬ 
tion,  claiming  that  modern  democratic  societies 
enabled  a  high  degree  of  upward  social  mobility. 
For  these  authors,  hereditary  inequalities  are  sec¬ 
ondary  and  affect  elites’  circulation  only  margin¬ 
ally.  Individual  capacities  are  indeed  necessary  to 
remain  part  of  the  elite,  given  the  fierce  individual 
competition  at  stake. 

According  to  Mills,  the  interlocking  nature  of 
the  power  elite  allows  for  interchange  between  dif¬ 
ferent  sorts  of  institutions,  maintaining  and 
strengthening  the  power  of  each.  But  circulation  is 
confined  to  the  “three  bigs,”  among  which  it  is 
easy  to  move  frequently  given  the  shared  interests 
and  origins  of  the  rulers.  In  the  same  vein,  the 
French  political  sociologist  Pierre  Birnbaum 
(1940-  )  holds  that  there  is  interpenetration  and  a 
never-ending  circulation  within  this  socially  and 
culturally  homogeneous  ruling  space  (Birnbaum, 
1978). 

Criticisms  of  Theories  on  Elitism 
The  Pluralist  View 

Strongly  inspired  by  Weber’s  work,  supporters 
of  pluralist  analysis  believe  that  public  policy  is  the 
result  of  interactions  between  the  players  and  vari¬ 
ous  competing  groups  situated  both  inside  and 
outside  the  sphere  of  the  state.  No  particular 
group,  or  elite,  can  occupy  a  constant  and  far- 
reaching  dominant  position. 

Robert  Alan  Dahl  (1915-  )  strongly  opposes 
Mills’s  and  Domhoff’ s  views  on  the  nature  of  poli¬ 
tics  in  the  United  States.  Dahl  holds  that  far  from 
being  unitary  and  demographically  narrow,  the 
elites  are  numerous  and  autonomous,  and  alter¬ 
nate  between  competition  and  compromises. 
According  to  his  polyarchy  model,  no  elite  domi¬ 
nates,  which  makes  compromises  and  negotiations 
necessary.  His  1961  study  of  the  formal  and  infor¬ 
mal  power  structures  in  New  Haven,  Connecticut, 
where  the  University  of  Yale  is  located,  supports 
this  view.  This  analysis  can  also  be  found  in  Aaron 
Wildavsky  (1930-1993)  who  examined  local  deci¬ 
sions  made  in  Oberlin,  Ohio.  In  his  1964  book, 
Leadership  in  a  Small  Town,  he  concludes  that  the 
structure  of  power  is  pluralistic  and  that  according 
to  the  nature  of  the  problems  at  stake,  different 
groups  will  lead  and  rule  at  the  local  level. 
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According  to  the  French  sociologist,  political 
scientist,  philosopher,  and  journalist  Raymond 
Aron  (1905-1983),  power  is  structured  in  a  very 
complex  way  in  modern  societies  (Aron,  1950). 
He  distinguishes  the  political  elite  from  five  rul¬ 
ing  categories:  those  who  hold  “spiritual  power” 
and  influence  the  ways  of  thinking  and  believing 
(priests,  intellectuals,  scientists  and  scholars,  and 
ideologists),  army  and  police  chiefs,  managers  of 
collective  work  (those  who  own  or  control  the 
means  of  production),  mass  leaders  (at  the  head 
of  trade  unions  or  political  parties),  and  senior 
civil  servants  who  hold  administrative  power.  In 
modern  societies,  these  categories  are  highly 
competitive. 

The  Marxist  Thesis 

Marxist  theory  is  based  on  an  economic  analy¬ 
sis  according  to  which  power  is  held  by  the  owners 
of  the  means  of  production,  the  bourgeoisie,  who 
also  monopolize  political  power  and  control  the 
state  apparatus.  According  to  Marxists,  economic 
factors  play  a  prominent  role;  the  state,  far  from 
being  neutral,  is  a  means  of  political  domination  in 
the  hands  of  the  bourgeoisie. 

According  to  Marxists,  pluralists  conceal  real¬ 
ity  by  substantiating  the  liberal  illusion  of  an 
autonomous  political  order.  They  give  credence  to 
the  idea  of  state  neutrality  and  mask  the  true 
nature  of  state  power  as  well  as  class  struggle. 
Marxists  also  criticize  Mills’s  elitism  for  not  being 
based  on  the  ownership  of  the  means  of  produc¬ 
tion.  Elitists  in  general  are  criticized  for  focusing 
on  political  domination  and  failing  to  identify  it 
with  the  economic  class  that  owns  the  means  of 
production.  This  criticism  is  in  particular  sup¬ 
ported  by  the  Greco-French  political  sociologist 
Nicos  Poulantzas  (1937-1979),  who  rejected  the 
very  notion  of  elite  in  Fouvoir  politique  et  classes 
sociales  (1968),  published  in  English  as  Political 
Power  and  Social  Classes  (1973).  He  disagreed 
with  another  Marxist  political  theorist  and  soci¬ 
ologist,  Ralph  Miliband  (1924-1994),  a  friend  of 
Mills,  who  claimed  that  the  notion  of  elite  was 
relevant  and  that  Marxists  could  even  admit  the 
plurality  of  elites.  Besides  being  diverse,  these 
elites  all  belong  to  the  same  dominant  social  class 
and  share  the  same  social  origins,  the  same  net¬ 
works,  and  the  same  ideology. 


Elites  and  Socialization 

For  a  theorist  such  as  Mills,  leaders  share  the  same 
origins,  the  same  interests,  and  the  same  education 
and  training.  Their  solidarity  stems  from  their 
“social  similarities  and  psychological  affinities,”  in 
which  their  common  training  plays  a  critical  role 
as  the  basis  for  permanent  alliances  of  power  and 
for  strategic  marriages  and  other  affiliations. 
According  to  Domhoff  (1967),  educational  institu¬ 
tions  participate  in  the  reproduction  of  the  power 
of  the  wealthy  few.  Schools  and  universities  are 
indeed  developed  by  the  upper  class  and  are  filled 
with  their  children,  who  are  socialized  in  an  upper- 
class  worldview  along  with  the  newly  wealthy 
people  who  are  assimilated  there.  The  primarily 
White  and  Protestant  elite  of  the  Ivy  League  that 
he  identifies  as  being  dominant  in  all  fields  hold  a 
common  vision  of  society  and  “what  must  be 
done”  to  govern.  This  homogeneity  is  linked  to  a 
homologous  socialization  (same  schools,  same 
social  circles,  etc.)  and  to  their  easy  access  to  posi¬ 
tions  of  power  both  in  and  out  of  the  government. 

Education  is  also  critical  in  the  reproduction  of 
the  social  order,  according  to  the  French  sociologist 
Pierre  Bourdieu  (1930-2002).  He  showed  that  elites 
are  able  to  reproduce  themselves,  that  is,  pass  on 
their  privileges  to  their  offspring,  even  in  the  osten¬ 
sibly  most  meritocratic  social  fields,  such  as  educa¬ 
tion.  The  educational  system  enables  children  from 
the  dominant  class  to  obtain  the  best  diplomas  and 
thus  gain  access  to  dominant  social  positions.  At  the 
same  time,  the  system  legitimizes  their  academic 
achievement  by  linking  it  to  individual  and  natural 
gifts,  concealing  its  social  origins  (Bourdieu,  1996). 

Education  and  Monopolization 
of  Power  Positions 

According  to  Birnbaum  (in  Madeleine  Grawitz  & 
Jean  Leca,  1985),  the  differentiation  between  state 
and  society,  which  is  greater  or  smaller  depending 
on  the  country  and  the  time  period,  may  account 
for  both  the  contours  of  the  elite  or  elites  involved 
in  the  making  of  public  decisions  and  the  extent  of 
their  power.  Thus,  for  example,  in  the  case  of 
France,  at  least  since  1945,  the  combination  of  a 
protean  state  intervention  and  networks  for  the 
education  and  selection  of  elites  in  a  few  presti¬ 
gious  state  ugrandes  ecoles ”  have  contributed  to 
the  quasi-monopolization  of  ruling  positions  by 
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those  belonging  to  the  highest  levels  of  the  engi¬ 
neering  and  administrative  fields  (Ezra  Suleiman, 
1978).  This  tendency  was  mainly  reinforced  by  the 
relative  weakness  of  intermediary  bodies,  employ¬ 
ers’  organizations,  and  trade  unions  and,  from  the 
1960s  onward,  by  a  process  of  political  “technoc¬ 
ratization,”  that  is,  the  access  to  positions  of 
political  responsibility  by  high-ranking  civil  ser¬ 
vants.  The  fact  that  these  situations  were  not  called 
into  question  undoubtedly  contributed  to  the 
legitimization  of  an  ultimately  hegemonic  organi¬ 
zation  of  power  distribution. 

If  one  believes  that  the  place  occupied  by  the 
elite(s)  varies  according  to  the  problems  presented 
and  the  time  period  concerned,  then  the  beginning 
of  the  crisis  of  the  French  model  of  public  policies 
in  the  1970s  had  a  significant  effect  on  the  elitist 
structures.  Members  of  the  state  nobility  still 
occupy  the  majority  of  the  dominant  positions  in 
the  field  of  public  service  and  continue  to  be  the 
main  players.  The  ability  of  the  administrative  elite 
to  move  easily  from  one  social  group  to  another 
(e.g.,  to  hold  directorial  positions  in  large  private 
firms)  allows  them  to  continue  to  capture  a  lot  of 
power,  even  if  they  are  not  in  power.  But  a  state’s 
fall  into  a  crisis,  on  both  ideological  and  practical 
levels,  is  also  accompanied  by  the  promotion  of 
new  elites  who  are  supposed  to  embody  new  ways 
of  defining  public  policy.  The  more  competitive 
nature  of  the  decision-making  networks,  the  intro¬ 
duction  of  private  sector-inspired  management 
models  into  public  administration,  Europeanization, 
and  even  the  globalization  of  issues,  and  thus  of 
procedures,  are  directly  contrary  to  a  unified, 
monistic  conception  of  power.  Administrative  elites 
are  thus  beginning  to  be  seriously  challenged  by 
corporate  actors  such  as  private  consultants,  who 
have  been  educated  in  the  same  “ grandes  ecoles” 
and  impose  themselves  as  experts  on  public  sector 
matters,  as  in  other  countries  (Canada,  the  United 
States,  or  the  United  Kingdom)  where  their  success 
has  been  much  quicker  (Denis  Saint-Martin,  2001). 

The  Concept  of  Elite  and 
the  Issue  of  Democracy 

Most  authors  agree  on  the  fact  that  democracy  is 
an  illusion  and  impossible  to  achieve.  According  to 
Roberto  Michels,  history  is  led  by  elites’  struggle 
for  power.  His  idea  of  the  iron  law  of  oligarchy 


was  the  result  of  his  analysis  of  leadership  patterns 
in  democratic  governments  and  in  other  organiza¬ 
tions,  such  as  trade  unions.  According  to  this  law, 
the  overthrow  of  any  elite  by  the  masses  will  even¬ 
tually  lead  to  such  oligarchical  leadership.  Michels, 
therefore,  refutes  the  possibility  of  the  existence  of 
an  ideal  democracy.  For  the  supporters  of  classical 
elitist  theories,  the  existence  of  elites  at  all  times 
and  places  is  a  natural  phenomenon  because  of 
unequal  distribution  of  talent,  wealth,  and  politi¬ 
cal  influence  in  society. 

According  to  Mills,  the  great  masses  of  people 
are  largely  unorganized,  ill  informed,  virtually 
powerless,  and  controlled  and  manipulated  by  the 
powerful  elite  who  exploit  them  both  economically 
and  politically.  Their  dependence  and  disorganiza¬ 
tion  prevent  the  masses  from  participating  in  dem¬ 
ocratic  life.  Above  the  masses,  Mills  saw  a  middle 
level  of  power  (composed  of  local  opinion  leaders 
and  special  interest  groups)  that  is  reflected  in  the 
U.S.  Congress  and  in  U.S.  political  parties.  But  they 
not  only  fail  to  represent  the  masses  but  also  have 
no  real  effect  on  the  elites,  who  leave  them  to 
debate  and  decide  some  minor  issues,  as  long  as 
they  do  not  represent  a  serious  challenge  to  their 
authority.  The  political  directorate  itself  is  described 
as  undemocratic  in  both  the  process  of  its  selection 
and  its  maintenance.  Moreover,  Mills  believed  that 
widespread  alienation  of  the  masses,  their  political 
indifference,  and  the  economic  and  political  con¬ 
centration  of  power  are  serious  threats  to  democ¬ 
racy.  The  U.S.  political  scientist  Harold  Dwight 
Lasswell  (1902-1978),  who  also  used  both  power 
and  control  of  hierarchies  as  elements  in  identifying 
elites,  insisted  on  the  critical  importance  of  legiti¬ 
macy.  In  his  1930  book  Psychopathology  and 
Politics,  he  held  that  the  extent  of  elites’  positions 
of  authority  is  determined  by  the  reactions  of  the 
public  toward  those  who  hold  power.  The  exis¬ 
tence  of  political  elites  depends  on  the  legitimacy  of 
their  authority  “in  the  eyes  of  the  people.” 

Pluralists  do  not  believe  in  the  ideal  of  democ¬ 
racy  either.  For  Dahl  (1989),  it  is  a  theoretical  uto¬ 
pia.  But  Dahl  sees  polyarchical  institutions  as  a 
major  improvement  as  they  create  multiple  and 
autonomous  centers  of  political  powers  where  deci¬ 
sions  can  be  made  through  competition,  bargaining, 
and  compromises. 

Marxists  also  view  liberal  democracy  as  an 
unrealistic  utopia.  According  to  them,  the  capitalist 
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state  is  inherently  undemocratic  as  it  represents  the 
dictatorship  of  the  bourgeoisie.  The  fact  that  inter¬ 
mediary  powers  such  as  the  media  and  most 
political  parties  need  the  support  of  the  bourgeoisie 
to  win  elections  makes  the  system  intrinsically 
undemocratic.  Vladimir  Lenin  asserted  in  The 
State  and  Revolution  (1917)  that  the  dictatorship 
of  the  proletariat  would  be  the  highest  possible 
form  of  democracy. 

Broadening  and  Completing  the  Analysis 

In  other  respects,  the  development  of  research  on 
the  structuring  of  public  decisions  and  their  practi¬ 
cal  implementation  has  led  to  a  better  understand¬ 
ing  of  the  role  played  by  the  elite.  The  classic  idea 
that  it  is  the  position  held  that  infers  the  power  now 
appears  to  be  a  necessary  but  insufficient  condition. 
Research  on  the  sociology  of  elites  conducted  in  the 
1960s  and  1970s  remains  quite  relevant  when  it 
comes  to  describing  the  social  and  educational 
resources  required  to  hold  a  position  of  influence 
and/or  power.  On  the  other  hand,  since  the  1980s, 
developments  in  sociology  of  public  policy  have 
allowed  for  the  reexamination  of  at  least  three 
avenues  of  research: 

1.  Public  decisions  are  made  in  an  often 
competitive  and  uncertain  context.  Holding  a 
position  objectively  seen  as  being  dominant 
does  not  guarantee  the  ability  to  effectively 
develop  a  given  public  policy.  If  the 
“dominators”  remain  dominant,  they  must 
exercise  their  authority  more  often  than  in  the 
past  and  bargain  to  keep  it. 

2.  Exerting  a  direct  influence  on  the  definition  of  a 
public  policy  does  not  necessarily  mean  that  the 
policy,  once  implemented,  will  not  be  the  object 
of  various  and  sometimes  unexpected 
reappropriations  that  can  ultimately  produce 
concrete  results  that  are  quite  different  from 
what  the  producers  intended. 

3.  To  understand  public  policies  through  the  eyes 
of  an  elite(s),  one  must  go  beyond  the  question 
of  specific  interests  to  take  into  account  the 
values,  world  visions,  and,  more  generally,  the 
knowledge  that  together  play  a  role  in  the 
framing  of  political  exchanges  and  decision 
making. 


One  of  the  major  contributions  of  the  sociology 
of  the  players  and  the  studies  of  “street-level 
bureaucracy”  (Michael  Lipsky,  1980)  or  middle¬ 
ranking  officials  (Herbert  Kauffman,  1960;  Page, 
2003)  who  carry  out  the  wishes  of  the  administra¬ 
tive  elite  is  to  show  that  public  policies  are  shaped 
and  designed  during  the  implementation  process 
by  people  with  low-  or  middle-range  powers.  In 
short,  the  elitist  approach  to  public  policy,  since  it 
expresses  a  global  point  of  view,  can  be  supple¬ 
mented  and  enriched  by  a  more  microsociological 
perspective  that  provides  a  better  explanation 
of  public  policies  in  action  and  thus  proposes  a 
re-reading  of  elitist  theories  in  the  negative. 
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Empire 


An  empire  is  a  type  of  political  organization  in 
which  the  metropolis  exercises  control  over  diverse 
peripheral  actors  through  formal  annexations 
and/or  various  forms  of  informal  domination. 
This  short  definition  may  well  be  uncontroversial, 
but  when  one  tries  to  make  it  more  specific  con¬ 
sensus  is  lost.  This  is  partly  because  most  social 
science  categorizations  reflect  abstract  models 
that  appear  in  a  variety  of  shapes,  forms,  times, 
and  places.  In  this  sense,  the  term  empire  is  as 
fuzzy  and  contestable  as  terms  such  as  democracy, 
class,  power,  or  nation.  But  unlike  these  other 
categories,  empire  is  laden  with  negative  historical 
and  ideological  connotations  and  is  often  used  as 


a  synonym  for  colonial  rule  based  on  oppression 
and  exploitation.  This  entry  examines  the  concept 
of  empire,  the  historical  evolution  of  empires,  and 
the  ways  in  which  a  study  of  empire  can  shed  light 
on  international  politics. 

The  Nature  and  Evolution  of  Empire 

Such  an  understanding  of  empire  is  ahistorical  and 
biased.  Empire  has  been  a  characteristic  form  of 
political  organization  since  early  antiquity  and 
predates  colonial  rule  by  several  centuries.  The 
colonial  era  may  be  over,  but  this  does  not  imply 
the  end  of  empire  (although  it  is  important  to 
emphasize  that  the  colonial  legacy  still  haunts  for¬ 
mer  colonial  empires  and  their  erstwhile  colonies). 
Moreover,  many  empires  were  symbols  of  peace 
and  prosperity,  rather  than  of  oppression  and 
exploitation.  Studies  of  empires  show  that  control 
within  them  can  be  based  on  incentives  rather  than 
coercion  or  on  a  combination  of  both.  It  can  be 
exercised  through  a  variety  of  military,  economic, 
and  cultural  means.  It  can  be  formal  or  informal  to 
varying  degrees.  Periphery  status  within  an  empire 
can  also  differ.  Some  peripheral  actors  are  given 
access  to  the  decision  making  and  resources  of  the 
metropolis,  while  others  are  kept  at  a  distance  or 
even  subject  to  open  discrimination  and  exploita¬ 
tion.  The  relationship  between  metropolis  and 
periphery  can  certainly  be  hierarchical  and  conflict 
ridden,  but  it  can  also  be  harmonious  and  based 
on  mutual  dependency.  Some  empires  are  orga¬ 
nized  in  concentric  circles  or  even  form  quite  loose 
multiple  independencies. 

The  nature  of  both  metropolis  and  peripheral 
actors  can  also  differ.  In  most  cases,  the  metropolis 
has  a  centralized  government,  differentiated  econ¬ 
omy,  and  shared  political  loyalties,  while  the 
“imperializable  peripheries”  (to  use  Michael 
Doyle’s  term)  have  weak  government,  undifferenti¬ 
ated  economies,  and  highly  divided  political  loyal¬ 
ties  (Doyle,  1986,  p.  19).  However,  the  imperial 
metropolis  can  also  have  a  relatively  weak,  limited, 
and  decentralized  government;  an  inefficient  eco¬ 
nomic  system;  and  multiple  cultural  identities.  For 
instance,  medieval  empires  are  known  for  having 
limited  and  decentralized  government  performing 
only  a  few  basic  governmental  functions.  They 
were  ridden  by  internal  conflicts  between  king  (or 
emperor)  and  the  lower  aristocracy  (whether  feudal 
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or  bureaucratic),  while  the  persistent  divergence  of 
local  cultures,  religions,  and  traditions  implied  a 
highly  divided  political  loyalty. 

The  metropolis  does  not  always  have  a  master 
plan  of  imperial  conquest.  States  can  become 
empires  by  default  because  they  try  to  bring  some 
order  to  unstable  neighbors  or  try  to  convert  bar¬ 
barians  into  “good”  citizens  or  into  a  specific  reli¬ 
gion.  Likewise,  an  empire  does  not  necessarily 
come  into  being  through  outright  aggression. 
Some  empires  emerge  quietly  or  even  surrepti¬ 
tiously  through  uneven  modernization  and  social 
differentiation.  They  may  not  even  see  themselves 
as  empires. 

Different  historical  periods  have  generated  dif¬ 
ferent  types  of  empires.  Alexander  the  Great  built 
his  empire  and  brought  a  multitude  of  nationalities 
under  his  central  authority  largely  by  the  use  of 
military  force.  The  Roman  Empire  relied  far  more 
on  peaceful  methods,  using  language  and  law  in 
the  service  of  its  civilizing  effort.  The  Chinese  Han 
Empire  was  renowned  for  its  administrative  skills, 
enabling  it  to  keep  diverse  and  autonomous  prov¬ 
inces  under  a  single  rule.  The  British  Empire,  like 
those  of  France,  Portugal,  and  Spain,  used  its 
maritime  power  and  supremacy  in  global  trade. 
The  Soviet  Empire  skillfully  applied  ideological 
penetration  alongside  the  older  military,  political, 
and  economic  techniques  of  empire  building. 

The  “end  of  empire”  has  been  declared  several 
times  throughout  history,  but  at  the  beginning  of 
the  21st  century  it  looks  as  if  empires  are  on  the  rise 
rather  than  in  decline.  The  United  States,  China, 
the  European  Union  (EU),  and  Russia  are  most 
frequently  described  as  empires.  All  four  represent 
vast  territorial  units  with  global  influence  in  mate¬ 
rial,  institutional,  and  ideological  terms.  They  pos¬ 
sess  not  only  global  economic  and  military  reach 
but  also  the  ability  to  influence  the  global  institu¬ 
tional  agenda  and  shape  the  notion  of  legitimacy  (if 
not  normality)  in  various  parts  of  the  world.  In 
addition,  all  four  have  a  record  of  acting  in  a  way 
that  imposes  significant  domestic  constraints  on  a 
variety  of  formally  sovereign  actors.  These  actors 
are  seen  as  a  kind  of  periphery  to  be  governed  by 
the  imperial  center.  (China  and  Russia  also  contain 
peripheries  within  their  own  borders  and  without 
formal  sovereignty,  such  as  Tibet  or  Chechnya.) 
Moreover,  the  policies  of  the  United  States,  China, 
the  EU,  and  Russia  are  guided  not  merely  by  their 


selfish  interests  but  also,  if  not  primarily,  by  a  uni¬ 
versal  normative  vocation  reminiscent  of  the  Pax 
Romana  or  the  old  Chinese  Mandate  of  Heaven.  In 
other  words,  they  all  pursue  a  civilizing  mission  to 
benefit  their  region  if  not  the  entire  world.  John 
Ikenberry  notes  that  the  United  States  claims  to  be 
an  “indispensable  nation”  maintaining  global  order 
and  possessing  an  economic  project  “congruent 
with  the  deeper  forces  of  modernization.”  The  EU 
sees  itself  as  a  pole  of  attraction  for  its  neighbors, 
contributing  to  what  the  EU  High  Representative 
has  described  as  a  fairer,  safer,  and  more  united 
world.  Even  contemporary  Russia  sees  itself  as  a 
bastion  against  the  barbarian  forces  of  chaos, 
nationalism,  and  religious  fundamentalism  flour¬ 
ishing  in  its  backyard. 

However,  there  are  significant,  and  in  some 
cases  striking,  differences  among  these  four  con¬ 
temporary  empires.  Consider,  for  instance,  the 
difference  between  an  autocratic,  nationalistic,  and 
militaristic  Russia  and  the  peculiar,  postmodern 
polity  that  is  the  EU,  with  no  single  center  of  gov¬ 
ernment,  no  single  demos  (let  alone  nation),  and 
no  army.  Nor  is  there  an  American  equivalent  of  a 
subjected  inner  periphery  like  Tibet,  Xinjiang,  and 
what  China  calls  Inner  Mongolia.  However,  many 
of  those  who  study  empire  argue  that  the  United 
States  tends  to  treat  the  entire  world  as  its  periph¬ 
ery.  Its  scope  of  territorial  control  is  truly  global. 
Although  control  by  the  United  States  is  usually 
informal,  it  is  backed  up  by  enormous  military  and 
economic  might. 

Of  course,  empires  may  rise  and  fall  even  while 
the  criteria  for  defining  them  are  under  discussion. 
Japan,  India,  Brazil,  South  Africa,  and  Germany 
may  be  seen  as  empires  of  a  sort  or  may  become 
empires  in  the  next  few  decades.  For  instance, 
Japan’s  reluctance  to  reassume  an  imperial  posture 
is  more  a  function  of  historical  memory  than  of 
military  capability.  Contemporary  Japan,  like 
China,  shows  signs  of  a  “middle  kingdom”  mental¬ 
ity — a  belief  that  the  world  revolves  around  it.  This 
is  clearly  an  imperial  feature.  And  Japanese  military 
expenditure  is  nearly  as  large  as  the  Chinese  and 
much  higher  than  the  Russian. 

India  also  shows  signs  of  a  middle-kingdom  men¬ 
tality.  Moreover,  India  exercises  significant  influ¬ 
ence  over  Sri  Lanka,  Nepal,  and  Bangladesh.  Time 
and  again,  India  has  been  able  to  use  its  power  quite 
effectively  in  global  trade  talks,  environmental 
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negotiations,  and  interstate  disputes.  Many  Indians 
believe  that  their  democratic  and  economic  profile, 
together  with  their  language  abilities,  give  them  the 
edge  over  Russia  or  China.  But  the  strength  of 
Indian  society  has  yet  to  translate  into  strong  insti¬ 
tutions,  and  this  hampers  India’s  global  rise  and 
contains  its  imperial  aspirations. 

Empire  and  the  Analysis 

of  International  Politics 

If  the  list  of  empires  is  open-ended  and  the  differ¬ 
ences  between  them  are  so  striking,  does  a  study  of 
empire  help  our  understanding  of  international 
politics?  Is  the  state  paradigm  not  sufficient  and, 
indeed,  more  suitable?  The  problem  is  that  the 
notion  of  a  state  is  as  fuzzy  as  the  notion  of  empire. 
The  United  States  is  a  state;  but  so  too  are  Somalia, 
Luxembourg,  and  Costa  Rica.  Moreover,  the  state 
paradigm,  with  its  emphasis  on  territoriality  and 
sovereignty,  seems  ill  suited  for  understanding  the 
21st  century  with  its  cascading  economic  interde¬ 
pendence,  transnational  jurisdictions,  and  multiple 
cultural  loyalties.  Besides,  the  reemergence  of 
empires  does  not  herald  the  end  of  nation  states. 
As  Herfried  Miinkler  (2007)  pointed  out,  contem¬ 
porary  imperial  structures  are  superimposed  on 
the  state  order,  but  they  no  longer  replace  it: 
“Whoever  thinks  of  imperiality  as  simply  an  alter¬ 
native  to  statehood  will  come  to  the  conclusion 
that  no  empires  exist  today”  (p.  6). 

Some  authors  prefer  to  use  the  term  hegemony 
in  place  of  empire.  However,  hegemony  is  not  nec¬ 
essarily  a  synonym  for  empire.  According  to 
Alejandro  Colas  (2007,  p.  165),  for  example,  hege¬ 
mony,  unlike  empire,  has  no  final  instance,  no 
decisive  location  or  moment  of  authority,  but  is 
reproduced  instead  through  a  sequence  of  coordi¬ 
nated  and  generally  consensual  actions  by  a  multi¬ 
plicity  of  actors,  albeit  orchestrated  by  a  leading 
state.  True,  the  concept  of  hegemony,  like  the  con¬ 
cept  of  empire,  underlines  the  inequality  among 
individual  actors  and  the  ensuing  hierarchy  in  the 
international  system.  However,  the  literature  on 
hegemony  tells  us  less  about  the  nature  of  the 
actors  exercising  hegemony  than  does  the  literature 
on  empire.  Power  is  often  detached  from  agency  in 
the  study  of  hegemony.  Hegemony  discourse  is 
more  about  the  symptoms  of  power  politics  than  its 


The  study  of  empire  is  not  only  about  power 
but  also  about  the  actors  engaged  in  power  poli¬ 
tics.  As  such,  it  helps  us  understand  the  nature  of 
the  magnetic  poles  within  the  current  global  sys¬ 
tem.  Moreover,  it  forces  us  to  examine  power  not 
only  in  crude  economic  or  military  terms  but  also 
in  institutional  and  ideological  ones.  As  noted 
above,  most  empires  try  to  “civilize”  and  “institu¬ 
tionalize”  their  peripheries,  not  merely  to  conquer 
them.  The  study  of  empire  also  underlines  the  role 
of  order,  hierarchy,  and  structure  in  international 
politics.  It  is  not  chiefly  about  exploitation  and 
inequality,  as  is  often  the  case  in  the  study  of  hege¬ 
mony.  In  fact,  the  study  of  empire  shows  that  some 
peripheries  can  display  an  impressive  ability  to 
determine  the  politics  of  the  center,  despite  all  the 
material  or  normative  asymmetries.  This  leads  to 
another  advantage  that  a  focus  on  empire  offers:  It 
helps  overcome  the  often  misleading  distinction 
between  domestic  and  foreign  affairs.  Empires  sel¬ 
dom  possess  fixed  and  hard  borders,  and  this 
fuzziness  or  permeability  of  borders  largely  deter¬ 
mines  the  politics  of  the  imperial  center.  For 
instance,  the  nature  of  European  integration  is 
constantly  being  reshaped  by  its  successive  waves 
of  enlargement.  Russian  policy  is  largely  deter¬ 
mined  by  the  fate  of  ethnic  Russians  living  outside 
its  formal  frontiers.  The  one-way  permeability  of 
U.S.  borders  allows  Washington  to  intervene  in  the 
politics  of  various  states  in  Central  America  and 
the  Caribbean,  without  having  seriously  to  con¬ 
sider  the  possibility  that  they  will  intervene  in  the 
U.S.  sphere. 

Finally,  the  study  of  empire  helps  us  understand 
the  evolving  pattern  of  cooperation  and  conflict 
among  major  power  poles.  History  shows  that 
empires  can  coexist  in  parallel  if  they  do  not  con¬ 
test  similar  spheres  and  spaces  and  if  they  respect 
each  other’s  claims  to  legitimacy.  The  parallel 
functioning  of  the  Roman  and  Chinese  empires  is 
a  good  example  of  such  coexistence.  Contemporary 
empires  often  contest  similar  spheres  and  spaces. 
Moreover,  the  dominance  of  the  United  States  is 
likely  to  generate  conflict  even  though  other  con¬ 
temporary  empires  have  not  yet  tried  to  balance  or 
deter  the  United  States. 

Jan  Zielonka 
University  of  Oxford 
Oxford,  United  Kingdom 
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Empowerment 


Empowerment  is  a  concept  that  has  received  much 
interest  in  the  generic  management  literature  dur¬ 
ing  recent  years.  There  are  numerous  definitions 
of  this  concept,  but  the  common  denominator  of 
all  these  deals  with  the  distribution  of  power  to 
lower  levels  of  a  hierarchy.  The  meaning  of 
empowerment  for  the  public  sector  is,  however, 
an  arena  that  calls  for  more  elaboration  and 
revivification.  This  short  entry  aims  at  three  major 
goals;  (1)  to  provide  an  up-to-date  review  and 
explanation  of  the  generic  meaning  of  empower¬ 
ment,  (2)  to  explore  the  importance  of  empower¬ 
ment  for  public  sector  domains,  and  (3)  to  provide 
a  short  direction  to  different  foci  of  empowerment 
in  the  context  of  the  public  sector. 

The  Essence  of  Empowerment:  A  Broad 
Organizational  and  Managerial  View 

Empowerment  is  a  multifaceted  construct,  and 
there  is  much  deliberation  with  regard  to  its  mean¬ 
ing  and  focus.  Discussions  regarding  its  meaning 
include  questions  of  whether  it  is  merely  the  shar¬ 
ing  of  power,  a  more  general  concept  of  decentral¬ 
ization,  or  a  psychological  motivational  concept 
regarding  the  person’s  feelings  of  worth.  The  focus 


or  target  of  the  empowerment  process  may  be 
employees,  the  end  users/clients/public,  or  institu¬ 
tions/organizations/subnational  governments. 

As  control  and  power  are  regarded  in  the  man¬ 
agement  literature  from  either  a  relational  perspec¬ 
tive  or  a  motivational  perspective,  the  concept  of 
empowerment  too  can  be  seen  from  these  two 
different  points  of  view.  From  the  relational  per¬ 
spective,  power  is  a  function  of  dependence  and/or 
interdependence  of  parties,  and  thus  empower¬ 
ment  is  a  process  in  which  a  leader  or  manager 
shares  his  or  her  power  with  subordinates,  making 
them  less  dependent  on  him  or  her.  In  accordance, 
empowerment  is  practiced,  by  those  adopting  this 
perspective,  through  participative  management 
techniques  such  as  goal  setting  by  subordinates 
and  quality  circles.  From  a  motivational  perspec¬ 
tive,  power  is  an  intrinsic  need  for  self-determina¬ 
tion  or  a  belief  in  personal  self-efficacy.  Thus, 
empowerment  is  the  creation  of  conditions  enhanc¬ 
ing  a  strong  sense  of  self-efficacy.  Kenneth  Thomas 
and  Betty  Velthouse  (1990)  built  on  this  psycho- 
logical-motivational  definition  and  argued  that 
empowerment  is  a  multifaceted  concept  of  four 
cognitions:  (1)  meaning,  (2)  competence,  (3)  self- 
determination,  and  (4)  impact.  Meaning  is  a  match 
between  the  job  requirements  and  the  person’s 
beliefs,  values,  and  behaviors.  Competence  is  the 
person’s  belief  in  his  or  her  own  capability  to  per¬ 
form  the  job  well.  Self-determination  is  related  to 
autonomy  in  initiating  and  continuing  work 
behaviors  and  processes.  Finally,  impact  is  the 
extent  to  which  a  person  can  influence  strategic, 
administrative,  or  operational  outcomes.  These 
cognitions  are  shaped  by  the  person’s  work  con¬ 
text  and  personality  characteristics  and  in  turn 
motivate  individual  behavior.  Thus,  empower¬ 
ment  processes  under  this  perspective  will  encour¬ 
age  self-managing  or  cross-functional  teams  and 
will  provide  information  about  the  organization’s 
goals  and  strategy  to  give  the  workers  a  sense  of 
ownership.  Similarly,  according  to  David  Bowen 
and  Edward  Fawler  (1995),  empowerment  entails 
a  high-involvement  approach.  Thus,  organiza¬ 
tions  are  encouraged  to  change  their  policies, 
practices,  and  structure  in  order  to  create  and 
sustain  empowerment,  and  this  change  must  dis¬ 
tribute  power,  authority,  responsibility,  informa¬ 
tion,  knowledge,  and  rewards  throughout  the 
organization. 
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The  Meaning  of  Empowerment 
in  the  Public  Sector 

The  generic  psychological  understanding  of 
empowerment  in  organizations  is  useful  and  mean¬ 
ingful  when  state  agencies  and  social  governmental 
environments  are  discussed.  Essentially,  the  discus¬ 
sion  of  empowerment  in  such  neighborhoods  calls 
for  integration  of  other  concepts  such  as  democ¬ 
racy,  citizens’  involvement,  public  sector  reforms, 
and  multiple  stakeholders’  engagement.  As  will  be 
explained  below,  such  a  discussion  must  also  con¬ 
sider  both  the  personal  and  the  institutional  impli¬ 
cations  of  empowerment.  Putting  it  another  way, 
what  is  the  contribution  of  empowerment  to  the 
theory  and  practice  of  public  administration,  to 
public  organizations,  and  to  the  overall  under¬ 
standing  of  government  and  governance? 

Most  important,  empowerment  can  be  seen  as  a 
democratic  strategy  that  involves  multiple  stake¬ 
holders  in  the  action  of  governance  or  in  the  action 
of  managing  governmental  agencies.  As  such, 
empowerment  allows  greater  self-contributing 
behavior;  it  encourages  individuals’  engagement  in 
public  actions  and  advances  the  production  of 
public  goods  and  services.  It  can  also  contribute  to 
commitment  to  public  service  and  to  public  sector 
motivation  (PSM)  by  all  those  who  are  part  of  the 
process,  be  it  public  personnel,  citizens,  residents 
of  cities  and  communities,  or  other  stakeholders 
such  as  the  private  sector  and  the  third  sector. 

There  are  two  major  perspectives  of  empower¬ 
ment  in  this  regard.  One  perspective  sees  democ¬ 
racy  as  a  predominantly  aggregative  theory,  and 
the  other  perspective  sees  democracy  as  a  predom¬ 
inantly  integrative  theory.  Predominantly  aggrega¬ 
tive  theories  see  democracy  as  a  means  of  distribut¬ 
ing  political  power  and  influence  and  of  regulating 
conflict  within  a  society.  Thus,  empowerment 
refers  to  equal  influence  over  processes  of  collec¬ 
tive  governance  and  individual  autonomy. 
According  to  Albert  Hirschman,  citizens  have  two 
means  of  empowerment  available  to  them  when 
they  are  not  satisfied  with  the  consequences  of  a 
collective  action:  exit  and  voice.  In  an  aggregative 
strategy  of  empowerment,  exit  by  voting  for  a  new 
political  party  is  a  way  of  punishing  politicians  and 
political  organizations  that  were  unsatisfactory 
and  a  way  of  escaping  unacceptable  majority  deci¬ 
sions.  Voice,  in  this  perspective,  is  a  supplementary 
means  of  empowerment  as  it  gives  the  decision 


makers  information  about  the  reasons  exit  was 
taken. 

Predominantly  integrative  theories  see  democ¬ 
racy  as  a  means  for  producing  democratic  citizens. 
According  to  this  view,  society  is  a  social  construc¬ 
tion  that  is  more  than  the  sum  of  the  preferences  of 
individuals.  Empowerment  is  therefore  the  trans¬ 
formation  of  individuals  into  citizens  by  participa¬ 
tion.  Thus,  in  this  perspective,  voice  is  the  main 
vein  of  empowering  individuals  as  it  gives  them  a 
chance  to  participate  in  decision-making  processes. 
As  Danny  Burns,  Robin  Hambleton,  and  Paul 
Hoggett  (1994)  point  out,  voice  empowers  as  it 
informs,  organizes,  and  gives  confidence  to  the 
citizens.  They  argue  that  a  strategy  of  empower¬ 
ment  under  this  perspective  must  include  decen¬ 
tralization  of  authority  to  local  governments, 
institutionalization  of  communication  channels 
between  citizens  and  politicians,  and  encourage¬ 
ment  of  citizen  participation  in  general. 

The  Target  of  Empowerment: 

A  Triple  Meaning 

Based  on  the  above,  empowerment  in  public 
administration  employs  at  least  three  major  targets: 
(1)  public  officers/employees,  (2)  citizens  as  cus¬ 
tomers,  and  (3)  institutions  or  subsectors.  Whereas 
the  first  relate  to  the  individual  level,  the  latter 
refers  to  the  institutional  and  subgovernment  level. 

First,  stemming  from  the  generic  managerial 
literature,  the  most  classical  orientation  of  empow¬ 
erment  in  the  public  sector  is  with  public  sector 
employees.  These  internal  “agents”  of  policy  and 
public  action  are  the  primary  mechanism  through 
which  public  strategies,  policies,  and  actions  are 
implemented.  Public  employees  and  public  officers 
can  be  trained  and  encouraged  to  take  greater  con¬ 
trol  over  their  jobs,  tasks,  and  decisions  in  the 
organizations.  This  process  of  empowering  people 
should  go  hand  in  hand  with  formal  rules  and  pro¬ 
cedures  that  characterize  the  public  bureaucracy. 
Although  they  cannot  go  against  them,  they  must 
also  give  employees  more  flexibility  in  doing  the 
job.  Flexibility  does  not  mean  taking  individual 
decisions  that  contradict  the  formal  rules  that 
bureaucracies  so  typically  maintain  and  safeguard. 
Hence,  although  bureaucracy  may  become  some 
barrier  to  empowerment,  there  is  much  room  for 
building  innovative  mechanisms  that  allow  healthy 
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involvement,  intra-organizational  democracy,  and 
self-motivation  that  works  for  the  good  of  many. 
The  restraints  and  control  of  bureaucracy  also 
somewhat  limit  the  voice  of  employees,  but  mod¬ 
ern  managerial  techniques  such  as  team  building, 
self-managed  teams,  job  enrichment,  and  even 
quality  circles  or  goal-setting  strategies  may 
increase  commitment  and  build  a  sense  of  recre¬ 
ation  in  public  sector  agencies. 

Next  is  the  citizenry  level.  While  in  the  manage¬ 
ment  literature  the  focus  or  target  of  empowerment 
are  the  employees,  in  the  public  administration 
literature  the  reference  is  the  citizens  themselves. 
When  the  focus  is  the  citizens  themselves,  empow¬ 
erment  is  the  government  acknowledging  that 
people  should  be  involved  in  decisions  regarding 
their  own  lives  and  acting  accordingly.  As  Eran 
Vigoda  (2002)  notes,  empowerment  is  seen  when 
governments  treat  the  citizens  as  customers  or  as 
partners,  take  their  complaints  seriously,  encour¬ 
age  them  to  participate  in  open  hearings,  and,  at 
more  extreme  levels,  let  them  manage  public  insti¬ 
tutions  such  as  schools  or  housing  estates. 
According  to  Eran  Vigoda-Gadot  and  Shlomo 
Mizrahi  (2008),  this  view  has  gained  major  atten¬ 
tion  in  recent  decades,  and  a  rich  discussion  of 
public  sector  reforms  has  evolved,  which  has  been 
heavily  influenced  by  the  new  public  management 
(NPM)  doctrine  that  highlighted  the  role  of  citizens 
in  governance  and  the  meaning  of  customer  orien¬ 
tation  in  public  service. 

Finally,  the  literature  mentions  empowerment 
in  regard  to  subnational  governments.  When  the 
focus  of  empowerment  is  on  subnational  govern¬ 
ment,  the  discussion  is  around  “decentralization” 
or  “devolution”  of  power,  with  the  notion  that 
local  or  regional  decision  makers  know  better 
what  matches  the  needs  and  capabilities  of  their 
respective  public.  Elowever,  such  empowerment  is 
much  debated  as  it  produces  little  coherence  in 
decision  making  and  does  not  necessarily  result  in 
superior  public  policy.  In  addition,  governments 
may  empower  social  movements  to  carry  out 
actions  and  legislation  relevant  to  a  specific  social 
movement  or  to  represent  specific  ideas  in  govern¬ 
ment  deliberations.  In  recent  years,  the  question  of 
social  movements  and  third-sector  empowerment 
has  intensified.  Should  government  encourage 
more  empowerment  of  those  bodies  and  by  so 
doing  improve  services  to  citizens?  Or,  could  this 


be  a  sign  of  the  hollow  state  that  is  practically 
incapable  of  meeting  the  expectations  of  the  peo¬ 
ple  with  its  own  resources  and  bureaucratic  agen¬ 
cies?  This  question  remains  moot. 

Conclusion 

Taking  both  the  aggregative  and  integrative  theo¬ 
ries  together,  it  is  easier  to  understand  the  complex 
meaning  of  empowerment  in  public  sector  domains. 
Beyond  the  conventional  meaning  of  engagement 
in  organizational  life,  empowerment  in  the  public 
sector  has  deeper  and  wider  implications.  It  needs 
to  refer  to  more  stakeholders.  It  must  consider  the 
effect  on  larger  sections  of  society,  and  it  deals  with 
forming  and  shaping  the  nature  of  our  democratic 
values.  Therefore,  public  sector  empowerment  has 
a  widespread  effect  beyond  the  organizational 
arena.  Its  echo  is  heard  far  beyond  the  realms  of 
public  organization  and  affects  the  inner  fabric  of 
government  and  governance  in  modern  societies. 

Empowerment  is  thus  a  concept  that  is  of  much 
relevance  in  modern  times  to  organizations,  mar¬ 
kets,  governments,  and  people.  Its  generic  manage¬ 
rial  exploration  diffuses  into  the  public  sector  and 
receives  additional  interpretations  that  reflect  the 
complexity  of  public  organizations  and  the  need  to 
involve  multiple  stakeholders  and  subcontract 
with  institutions  on  the  federal,  national,  and  local 
levels.  Empowerment  in  the  public  sector  means 
continuous  interactions  with  the  values  of  democ¬ 
racy  and  citizens’  involvement  and  with  the  need 
to  decentralize  activities  from  state  agencies  to  oth¬ 
ers.  In  the  coming  years,  the  public  sector  will 
undoubtedly  make  use  of  the  ethos  of  empower¬ 
ment  to  move  forward  with  reforms  and  change 
and  to  improve  services  using  self-mechanisms  of 
public  employees  and  citizens  alike. 
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Environmental  issues  are  concerned  with  human 
actions  that  affect  the  biosphere  that  humans  and 
other  species  inhabit.  Interest  in  the  conservation  of 
particular  species,  habitats,  or  landscapes,  in  par¬ 
ticular  localities  or  countries,  dates  back  to  the  19th 
century,  but  a  concern  with  international  environ¬ 
mental  issues  that  transcend  national  boundaries 
may  be  dated  back  to  the  beginning  of  the  1970s. 
The  European  Conservation  Year  in  1970,  the  first 
Earth  Day  in  1970,  and  the  United  Nations  (UN) 
Conference  on  the  Human  Environment  held  in 
Stockholm  in  1972,  which  initiated  the  UN 
Environmental  Programme,  raised  and  focused  the 
level  of  awareness  and  interest.  A  number  of  insti¬ 
tutional  developments  also  took  place  around  this 
time.  The  Environmental  Protection  Agency  in  the 
United  States  was  established  in  1970,  as  was  the 
Department  of  the  Environment  in  the  United 
Kingdom.  In  the  European  Community,  the  1972 
Paris  summit  of  heads  of  government  called  on  the 
Commission  to  draw  up  an  environmental  policy 
and  set  up  a  directorate  responsible  for  environ¬ 
mental  protection.  The  Community  published  its 


first  Environmental  Action  Programme  in  1973. 
New  nongovernmental  organizations  with  a 
broader  agenda  than  older  conservationist  bodies 
and  with  a  greater  reliance  on  methods  of  direct 
action  were  founded  around  this  time,  the  most 
important  being  Friends  of  the  Earth  and 
Greenpeace,  both  in  1971. 

Why  did  this  new  interest  in  environmental 
issues  emerge  at  this  time?  A  more  affluent  and 
better  educated  society  was  becoming  less  inter¬ 
ested  in  prosperity  per  se  and  more  interested  in  a 
broader  definition  of  the  quality  of  life.  Rachel 
Carson’s  best-selling  book  Silent  Spring  published 
in  1962,  which  dealt  with  the  effects  of  pesticide 
use  on  the  countryside,  raised  public  awareness  of 
the  fact  that  modern  technology  had  costs  as  well 
as  benefits.  The  fragility,  vulnerability,  and  beauty 
of  the  Earth  were  emphasized  by  the  first  pictures 
of  the  planet  taken  from  deep  space  in  1968.  This 
initial  period  of  the  environmental  debate  was  also 
influenced  by  a  concern  about  world  population 
growth  and  its  impact  on  scarce  natural  resources. 
Incidents  such  as  the  mercury  poisoning  at 
Minamata  Bay  in  Japan  in  1959  showed  that  pol¬ 
lution  could  have  serious  public  health  conse¬ 
quences,  a  lesson  reinforced  at  the  end  of  the 
1970s  by  the  Love  Canal  incident  in  the  United 
States  where  leaking  toxic  waste  affected  the 
health  of  children,  among  others. 

This  early  phase  of  the  environmental  debate 
was  characterized  by  a  more  integrated  ecological 
approach  to  the  problems  being  encountered,  in 
which  the  ecosphere  was  conceived  as  a  whole, 
with  changes  in  one  part  of  the  system  seen  as  hav¬ 
ing  effects  elsewhere.  This  approach  was  exempli¬ 
fied  by  the  1972  Club  of  Rome  Limits  to  Growth 
report.  The  analysis  carried  out  for  this  report  by 
researchers  at  the  Massachusetts  Institute  of 
Technology  (MIT)  used  computer  modeling  tech¬ 
niques  to  explore  the  relationships  between  a 
number  of  variables  such  as  industrialization, 
resource  depletion,  pollution,  food  production, 
and  population.  Seen  in  retrospect,  the  results 
were  far  too  pessimistic  and  highlight  the  dangers 
of  projecting  current  trends  into  the  future,  but  the 
report  was  highly  influential  at  the  time  and 
encouraged  consideration  of  the  environment  as 
an  interacting  system  rather  than  just  particular 
pollutants  that  had  been  the  focus  of  earlier  ad  hoc 
policy  interventions. 
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As  policy  measures  developed,  the  debate  became 
much  more  fragmented.  It  became  focused  on  par¬ 
ticular  pieces  of  legislation  concerned  with  various 
forms  of  pollution  (air,  water,  and  soil)  and  with 
habitats  and  biodiversity.  One  of  the  reasons  for 
this  fragmentation  was  that  devising  effective  pol¬ 
icy  measures  created  a  necessity  for  very  specific 
policy  expertise  that  was  not  readily  transferable 
from  one  arena  to  another.  An  expert  on  river  pol¬ 
lution  might  be  of  little  help  in  tackling  problems  of 
marine  pollution  and  certainly  could  not  contribute 
very  much  to  the  debate  on  substratospheric  air 
pollution.  One  risk  associated  with  this  more  frag¬ 
mented  approach  was  an  emphasis  on  legislative 
outputs  rather  than  a  focus  on  their  likely  impact 
on  outcomes  in  terms  of  the  state  of  the  environ¬ 
ment.  The  development  of  the  climate  change 
debate  has  given  a  central  organizing  narrative  to 
environmental  policy  once  again  but  perhaps  at  the 
expense  of  subordinating  other  aspects  of  environ¬ 
mental  policy  to  this  central  goal.  Thus,  the  protec¬ 
tion  of  biodiversity  may  be  viewed  through  the  lens 
of  the  effects  of  climate  change. 

The  Internationalization  of 
Environmental  Issues 

Initial  environmental  policy  responses  were  at  a 
national  or  European  Union  (EU)  level.  It  was  evi¬ 
dent  that  many  environmental  problems  crossed 
national  boundaries.  Within  Europe,  examples 
included  the  pollution  of  major  rivers  that  passed 
through  a  number  of  countries  such  as  the  Rhine 
or  the  effect  of  acid  rain  from  British  power  sta¬ 
tions  on  Norwegian  forests.  However,  the  problem 
of  the  thinning  of  the  ozone  layer  could  not  be 
tackled  at  a  regional  level  but  required  effective 
global  collective  action.  By  the  1980s,  the  threat 
from  ozone-depleting  chemicals  was  generally 
accepted.  There  was  evidence  of  substantial  thin¬ 
ning  of  the  ozone  layer  in  the  polar  regions. 
Associated  risks  included  an  increased  incidence  of 
skin  cancer,  crop  failure,  and  a  possible  contribu¬ 
tion  to  global  warming.  The  1985  Vienna 
Convention  provided  for  controls  on  the  produc¬ 
tion  of  ozone-depleting  chemicals — chlorofluorcar- 
bons  (CFCs).  The  1987  Montreal  Protocol  froze 
production  of  numerous  substances  believed  to  be 
responsible  for  ozone  depletion,  to  be  followed  by 
a  50%  reduction  by  the  end  of  the  century.  The 


1989  London  Amendment  agreed  to  phase  out  the 
most  dangerous  CFCs. 

These  agreements  were  an  example  of  success¬ 
ful,  effective  global  collective  action  to  tackle  an 
environmental  problem,  but  there  were  a  number 
of  specific  conditions  that  facilitated  success.  A 
single  major  cause  of  the  problem  was  identified 
and  accepted,  which  was  not  the  case  for  climate 
change.  This  made  it  relatively  easy  to  focus  on 
what  had  to  be  done.  Substitutes  were  available  for 
the  withdrawn  products  and  were  relatively  inex¬ 
pensive.  Action  did  not  require  cuts  in  industrial 
production  or  economic  growth. 

New  Concepts  in  the  Debate 

The  debate  on  environmental  issues  was  being 
driven  increasingly  by  international  reports  and 
events,  even  if  the  required  policies  needed  action  at 
national  and  local  levels,  often  supported  by 
increasingly  successful  Green  parties.  A  particularly 
influential  report  was  produced  by  the  UN  World 
Commission  on  Environment  and  Development  on 
Our  Common  Future ,  often  referred  to  as  the 
Brundtland  Report  after  its  chair.  The  report  intro¬ 
duced  the  concept  of  “sustainable  development,” 
which  focused  on  intergenera tional  justice.  It  was 
defined  as  meeting  “the  needs  of  the  present  with¬ 
out  compromising  the  ability  of  future  generations 
to  meet  their  own  needs.”  It  became  a  key  defining 
objective  of  many  environmental  policies. 

Another  concept  that  became  increasingly  influ¬ 
ential  was  that  of  the  “precautionary  principle.” 
This  originally  emerged  in  Germany  as  the  Vor- 
sorgeprinzip  or  principle  of  precaution.  Although 
it  initially  had  a  number  of  definitions,  it  became 
increasingly  influential  in  international  discourse. 
The  principle  was  first  incorporated  in  an  interna¬ 
tional  treaty  in  the  1987  Montreal  Protocol  and 
was  also  reflected  in  the  1992  Rio  Declaration  on 
Environment  and  Development.  At  the  center  of 
the  concept  is  the  idea  of  taking  anticipatory  action 
in  the  absence  of  complete  proof  of  harm  when 
there  is  scientific  uncertainty  about  causal  links. 
The  1998  Wingspread  Statement  produced  by  a 
group  of  lawyers,  scientists,  and  environmental 
activists  summarizes  the  principle  in  this  way: 
“When  an  activity  raises  threats  to  the  environ¬ 
ment  or  human  health,  precautionary  measures 
should  be  taken  even  if  some  causes  and  effect 


778  Environmental  Issues 


relationships  are  not  fully  established  scientifi¬ 
cally.”  A  communication  from  the  European 
Commission  on  the  precautionary  principle  in 
2001  emphasized  that  any  measures  based  on  it 
should  be  proportional  to  the  chosen  level  of  pro¬ 
tection,  nondiscriminatory  in  their  application, 
and  consistent  with  similar  measures  already  taken. 

The  assumption  that  there  was  a  zero-sum 
trade-off  between  environmental  protection  and 
economic  growth  was  challenged  by  the  concept  of 
“ecological  modernization.”  Following  the 
Brundtland  Report,  it  was  argued  that  a  high  level 
of  environmental  protection  was  essential  to  the 
success  of  a  modern  economy.  Increasing  energy 
efficiency,  reducing  pollution,  and  using  waste 
products  could  boost  the  profits  of  firms  and  the 
success  of  the  economy  as  a  whole.  The  environ¬ 
mental  technology  industry  could  itself  make  an 
important  contribution  to  economic  success,  par¬ 
ticularly  for  the  first  companies  to  enter  the  mar¬ 
ket,  as  has  happened  in  Germany  with  regard  to 
solar  technology.  Politicians  began  to  refer  to  the 
benefits  of  “green-collar”  jobs.  One  consequence 
of  this  new  school  of  thinking  was  less  emphasis 
on  “command  and  control”  forms  of  environmen¬ 
tal  regulation  and  more  emphasis  on  a  range  of 
“new  environmental  policy  instruments,”  such  as 
eco  audits,  voluntary  agreements,  and  labeling. 

Climate  Change 

Climate  change  refers  to  the  warming  of  the  planet 
as  the  result  of  human-made  greenhouse  gases, 
principally  carbon  dioxide,  but  also  with  signifi¬ 
cant  impacts  from  gases  such  as  methane,  nitrous 
oxide,  and  halocarbons.  Considerable  uncertainty 
attaches  to  the  consequences  of  these  changes  in 
the  atmosphere,  but  they  are  likely  to  include  a  rise 
in  sea  levels,  changes  in  weather  patterns,  and 
more  extreme  weather  events.  There  is  a  global 
distributional  issue  attached  to  climate  change  as 
those  most  likely  to  lose  out  as  a  result  are  poor 
low-lying  countries  that  have  low  emissions  them¬ 
selves  and  countries  in  sub-Saharan  Africa  that 
may  experience  an  increased  incidence  of  drought, 
affecting  their  agriculture. 

Devising  an  effective  set  of  policy  solutions  to 
the  challenge  of  climate  change  is  difficult.  One  is 
dealing  not  only  with  considerable  uncertainty  but 
with  future  uncertainty.  The  problem  is  caused  by 


everyone  and  affects  everyone  on  the  planet,  so 
there  is  an  evident  public  issue  that  needs  to  be 
tackled,  but  there  are  also  collective  action  and 
free-rider  problems.  The  scale  of  international 
cooperation  required  is  unprecedented.  Much  of 
business  was  initially  opposed  to  any  action  being 
taken,  trying  to  argue  that  climate  change  was  not 
occurring,  or  if  it  was,  it  was  not  caused  by  human 
agency.  However,  being  a  “climate  change  denier” 
has  become  a  less  tenable  and  respectable  position, 
while  emissions  trading  represents  a  policy 
approach  to  the  problem  that  business  generally 
finds  acceptable.  The  2006  Stern  Report  high¬ 
lighted  the  economic  costs  of  inaction.  The  United 
States  refused  to  sign  the  1997  Kyoto  protocol,  but 
both  candidates  in  the  2008  presidential  election 
supported  action  on  climate  change.  Emerging 
countries  such  as  China  and  India  objected  to  hav¬ 
ing  to  reduce  their  emissions  when  the  problem 
had  been  caused  by  countries  that  developed 
before  them.  However,  even  if  an  effective  interna¬ 
tional  agreement  can  be  reached,  greenhouse  gas 
emissions  result  from  billions  of  independent  deci¬ 
sions  by  households  and  firms.  The  policy  instru¬ 
ments  available  to  governments,  such  as  informa¬ 
tion,  incentives,  and  regulations,  may  prove  to  be 
inadequate. 
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Environmental  Policy 


For  as  long  as  humans  have  existed  on  planet 
Earth,  they  have  sought  to  alter  it  to  suit  their  pur¬ 
poses.  In  its  most  basic  sense,  environmental  policy 
seeks  to  govern  the  relationship  between  humans 
and  their  natural  environment.  A  relatively  new 
area  of  state  activity,  it  has  grown  enormously, 
particularly  since  the  1960s.  Indeed,  its  spectacular 
rate  and  extent  of  growth  has  been  such  that  it  is 
no  longer  a  small  and  fairly  discrete  area  of  policy 
but  one  that  increasingly  intrudes  into  virtually  all 
other  policy  areas.  At  its  core  is  an  identifiable 
“environmental  state”  comprising  specific  minis¬ 
tries,  agencies,  and  organizations  whose  mandate 
is  to  secure  environmental  improvements. 
Underpinning  environmental  politics  is  the  tense 
relationship  between  this  state  and  its  opposite 
numbers  representing  the  social  and  economic 
realms.  As  the  following  sections  show,  this  inher¬ 
ent  tension  can  be  better  understood  first  through 
tracing  the  historical  evolution  of  environmental 
policy,  then  outlining  its  distinctive  characteristics, 
and  finally  by  identifying  its  underlying  rationales. 
While  this  overview  would  suggest  that  environ¬ 
mental  policy  development  has  resulted  in  a  great 
deal  of  state  building,  the  overall  impacts  on  the 
state  of  the  environment  are  not  so  clear. 

The  Main  Phases  of  Environmental 
Policy  Making 

One  way  to  understand  environmental  policy  is  to 
trace  its  historical  evolution.  Modern  environmen¬ 
tal  thought  dates  back  to  the  preindustrial  period. 
Before  the  1960s,  environmental  policy  making 
was  primarily  geared  toward  protecting  human 
health  from  pollution  and  establishing  designated 
areas  of  green  space  for  leisure  activities.  These 
ends  were  mainly  achieved  by  limiting  pollution 


from  point  sources  such  as  factories  and  establish¬ 
ing  protected  areas  and  national  parks. 

In  a  second  phase  broadly  encompassing  the 
1960s  and  1970s,  environmental  policy  really  took 
off  and  environmental  pressure  groups  boomed.  It 
is  often  said  that  the  first  evocative  images  of  the 
Earth  transmitted  from  deep  space  in  1968  were 
what  really  catalyzed  public  concern  for  what 
became  known  as  “the  environment.”  The  envi¬ 
ronmental  state  was  born  in  this  period  as  many 
jurisdictions  created  environmental  ministries  and 
agencies.  However,  these  responses  remained  iso¬ 
lated,  uncoordinated,  and  generally  reactive. 
Policies  adopted  were  mainly  regulatory  in  nature, 
specifying  process  and  emission  standards;  many 
were  very  poorly  implemented. 

In  the  1980s  and  1990s,  environmental  policy 
entered  a  third  phase.  The  sector  witnessed  a  huge 
expansion  in  the  scale  of  the  new  environmental 
states,  which  sought  to  address  problems  in  a  more 
preventive  manner  at  source.  Old  ways  of  thinking 
(such  as  “pollution  control”)  duly  gave  way  to 
new  ones  (“ecological  modernization”  and  “sus¬ 
tainable  development”).  Rather  than  promoting 
end-of-pipe  standards,  these  sought  to  embed  envi¬ 
ronmental  concerns  within  wider  systems  of  human 
production  and  consumption.  More  specifically, 
they  supplemented  “command  and  control”  regu¬ 
lations  with  new  environmental  policy  instruments 
such  as  eco-taxes,  voluntary  agreements,  and 
product-labeling  devices.  The  new  environmental 
states  also  began  seeking  innovative  ways  to  coop¬ 
erate  on  addressing  cross-border  pollution  and 
resource  overexploitation.  A  number  of  interna¬ 
tional  environmental  agreements  mushroomed  in 
this  phase,  and  supranational  bodies  such  as  the 
European  Union  began  to  assume  an  ever-greater 
role  in  environmental  problem  solving,  as  shown 
by  Andrew  Jordan  (2005). 

Environmental  policy  in  the  industrialized  world 
is  currently  in  a  fourth  phase,  roughly  covering  the 
period  since  2000.  This  phase  has  witnessed  another 
set  of  policy  developments  as  the  implications  of 
environmental  policy  have  ramified  into  ever  more 
cognate  policy  domains.  In  this  phase,  it  has  become 
even  more  obvious  that  environmental  problems 
arise  not  from  single-point  sources  such  as  factories 
and  power  stations  but  from  countless  everyday 
social  practices  such  as  travelling,  shopping,  and 
even  using  the  Internet.  State  action,  by  implication, 
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is  a  necessary  precondition  for  progress,  but  soci¬ 
ety  too  must  play  its  part  in  nurturing  and  deliver¬ 
ing  environmental  policy. 

The  issue  that  most  clearly  encapsulates  these 
challenges  is  that  of  climate  change — arguably  the 
greatest  environmental  threat  the  world  has  faced 
(Andrew  Jordan,  Dave  Huitema,  Harro  van  Asselt, 
Tim  Rayner,  &  Frans  Berkhout,  2010).  Crucially, 
it  links  into  many  other  social  and  economic  chal¬ 
lenges  such  as  poverty  reduction,  energy  insecu¬ 
rity,  and  massive  biodiversity  loss  and  is  stub¬ 
bornly  resistant  to  simple  “techno-fixes.”  Climate 
change  cannot,  in  short,  be  left  entirely  to  environ¬ 
mental  policymakers,  but  it  demands  long-term 
and  socially  transformative  adjustments — a  sys¬ 
tematic  “carbon  revolution”  no  less — which  mir¬ 
rors  the  Industrial  Revolution  of  the  19th  century. 

It  is  hugely  important  to  be  aware  of  the  differ¬ 
ences  between  these  phases  while  at  the  same  time 
remaining  cognizant  of  the  underlying  continu¬ 
ities.  The  best  way  to  appreciate  the  importance  of 
both  is  to  explore  what  distinguishes  the  environ¬ 
ment  from  other  policy  areas. 

What  Is  Distinctive  About 
Environmental  Policy? 

Environmental  quality  exemplifies  what  econo¬ 
mists  term  a  nonexcludable  or  “public”  good.  In 
other  words,  the  benefits  of  higher  environmental 
quality  often  accrue  to  the  wider  public  rather  than 
to  specific  individuals.  By  contrast,  the  costs  of 
protection  (e.g.,  the  fitting  of  abatement  equip¬ 
ment)  tend  to  be  borne  by  specific  groups  (princi¬ 
pally  those  that  pollute  or  who  are  prevented  from 
developing  their  land).  Consequently,  there  is  a 
basic  asymmetry  between  those  that  pollute  (in  the 
initial  phases  noted  above,  generally  a  minority) 
and  those  (the  wider  public)  suffering  the  conse¬ 
quences.  Polluters  have  strong  incentives  to  unite 
and  fight  to  protect  their  “rights”  to  pollute  and 
have  therefore  tended  to  enjoy  advantages  over 
those  enduring  its  disbenefits.  The  latter  are  often 
too  widely  dispersed  or  individually  suffer  too 
little  to  mobilize  into  a  coherent  group. 

In  the  third  and  fourth  phases,  this  asymmetry 
has  arguably  become  less  stark  as  mass  consump¬ 
tion  has  gradually  eclipsed  dirty  “point  sources”  as 
the  main  driver  of  environmental  change.  Nonethe¬ 
less,  the  state’s  role  is  still  to  mediate  between  those 


supporting  and  those  opposing  development. 
Because  environmental  damage  has  its  origins  in 
otherwise  socially  legitimate  activities  such  as  driv¬ 
ing,  using  electricity,  and  farming,  the  state  is  rou¬ 
tinely  called  on  to  limit  any  damage  imposed  by 
one  section  on  another.  Initially,  this  was  achieved 
by  regulating  polluting  activities.  As  regulating  is  a 
technically  demanding  process,  the  state  found 
itself  relying  heavily  on  polluters  and  scientists  for 
information  and,  eventually,  political  support. 
Environmentalists  routinely  claimed  that  this 
skewed  regulation  was  in  favor  of  producer  inter¬ 
ests.  In  Phases  3  and  4,  the  state  has  therefore 
come  under  pressure  to  explore  alternative  instru¬ 
ments  to  promote  more  sustainable  forms  of  mass 
consumption  (W.  Neil  Adger  &  Andrew  Jordan, 
2009). 

Environmental  policy  tends  to  be  heavily  reliant 
on  science,  both  to  establish  the  extent  of  environ¬ 
mental  problems  and  to  determine  how  to  resolve 
them.  However,  science  is  rarely  capable  of  deliv¬ 
ering  clear-cut  advice  to  policymakers  in  this 
regard.  On  the  contrary,  different  interests,  includ¬ 
ing  governments,  tend  to  choose  and  interpret  sci¬ 
entific  findings  selectively  in  order  to  justify  their 
preconceived  views.  These  tensions  are  most  acute 
when  policymakers  are  required  to  go  “beyond 
science”  and  address  problems,  such  as  climate 
change,  whose  causes  and  consequences  are  not 
precisely  understood. 

Environmental  problems  are  highly  expan¬ 
sive — they  spill  across  political  borders  and  create 
problems  extending  well  into  the  future.  The  pre¬ 
cise  spatial  and  temporal  distribution  of  costs  and 
benefits  produces  alliances  of  self-interest,  which 
then  battle  to  determine  the  scope  and  content  of 
policy.  This  is  particularly  visible  in  cross-national 
settings.  International  environmental  policy  bur¬ 
geoned  in  the  third  and  fourth  phases  noted 
above. 

Balancing  these  various  factors  is  the  core  policy 
challenge  in  the  environmental  policy  sector. 
According  to  widely  adopted  interpretations  of 
sustainable  development  (see  above),  policymakers 
should  address  the  development  needs  of  current 
generations  of  humans  without  compromising  the 
ability  of  future  generations  to  meet  theirs.  While 
there  is  general  agreement  that  sustainable  devel¬ 
opment  is  preferential  to  unsustainable  develop¬ 
ment,  moving  from  the  latter  to  the  former  raises 
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difficult  questions  centered  on  justice  and  moral¬ 
ity,  to  say  nothing  of  due  democratic  process. 

Fifth,  environmental  policy  is  probably  unique  in 
the  extent  to  which  it  cuts  across  traditional  sectors 
of  policy  making.  One  great  unmet  challenge  identi¬ 
fied  in  the  third  phase  has  been  how  to  integrate 
environmental  concerns  into  and  across  all  policy 
areas.  The  irony  is  that  were  this  to  happen,  there 
would  be  no  need  for  an  identifiable  “environmen¬ 
tal”  policy.  For  the  time  being,  this  seems  an  unlikely 
prospect:  Although  environmental  arguments  tend 
to  find  a  more  receptive  audience  in  policy  networks 
surrounding  environment  departments,  they  clash 
powerfully  with  producer-oriented  networks  found 
in  the  agriculture,  transport,  and  energy  sectors. 
Environmental  policymakers  are  attracted  to  inte¬ 
gration  because  it  potentially  helps  tackle  problems 
at  source.  As  Andrew  J.  Jordan  and  Andrea 
Lenschow  (2008)  note,  the  danger,  however,  is  that 
the  sectors  may  only  accept  integration  on  their 
own  terms  or,  worse  still,  take  active  steps  to 
“reverse  integrate”  environmental  policy  with  their 
preferred  (sectoral)  objectives.  Integration  in  other 
words,  is  a  two-way,  not  a  one-way,  street. 

Sixth,  environmental  policy  is  probably  the  only 
policy  domain  that  seeks  to  actively  protect  non¬ 
human  entities.  But  to  what  extent  should  moral 
significance  be  ascribed  to  plants  and  animals, 
especially  if  it  involves  limiting  human  choice  and 
raising  the  economic  costs  of  development?  This 
has  opened  up  many  ethically  charged  debates 
between  those  adopting  broadly  anthropocentric 
positions  and  those  favoring  more  ecocentric  ones. 
The  former  regards  nature  as  primarily  a  source  of 
resources  and  amenity;  the  latter  holds  that  the 
nonhuman  world  should  enjoy  much  greater  (and 
in  some  cases  equivalent)  moral  significance. 
Although  there  is  plenty  of  evidence  to  suggest  that 
people  are  more  ecocentric  than  they  were  in  the 
1960s,  many  (if  not  all)  contemporary  environ¬ 
mental  conflicts  are  still  driven  by  the  clash 
between  these  two  worldviews. 

What  Drives  Environmental  Policy? 

Ultimately,  the  main  source  of  environmental  pol¬ 
icy  making  is  environmentalism,  a  broad  social 
movement  that  emerged  in  the  late  1960s  and 
loudly  demanded  that  “quality  of  life”  issues  should 
receive  more  attention.  In  the  past,  environmental 


concern  has  tended  to  exhibit  a  cyclical  pattern, 
with  particularly  pronounced  peaks  in  support  in 
the  late  1960s  and  the  late  1980s.  Closer  scrutiny 
reveals  that  these  short-term  “pulses”  often  coin¬ 
cided  with  periods  of  economic  growth  and  social 
introspection.  Conversely,  concern  has  tended  to 
tail  off  during  economic  recession.  It  is  not  nor¬ 
mally  as  pronounced  in  poorer  sections  of  Western 
society  or  in  developing  countries,  although  aca¬ 
demic  views  differ  on  this  matter.  Looked  at  over 
longer  time  periods,  an  underlying  effect  is  evi¬ 
dent — each  policy  phase  has  provided  the  founda¬ 
tions  for  the  next.  During  the  third  and  fourth 
phases  in  particular,  environmental  policymakers 
sought  new  ways  to  institutionalize  environmental 
concern  and  escape  what  Anthony  Downs  termed 
the  episodic  “issue  attention  cycle.” 

Environmental  Policy: 

Retrospect  and  Prospect 

Environmental  policy  addresses  the  complex  and 
reciprocal  links  between  the  underlying  structure 
of  modern  economies  and  societies  and  the  quality 
of  their  natural  environment.  As  such,  it  is  proba¬ 
bly  unique  in  its  breadth  and  its  political  complex¬ 
ity.  Since  the  1960s,  a  new  “environmental  state” 
has  evolved  to  produce  and  implement  it,  parallel¬ 
ing  the  emergence  of  the  welfare  state  in  the  19th 
and  20th  centuries.  Both  represent  lagged  and 
socially  mediated  responses  to  the  negative  effects 
of  industrialization.  And  naturally,  both  are  highly 
fertile  grounds  for  political  science  and  public 
policy  research.  But  what  has  all  this  activity  pro¬ 
duced?  While  environmental  policy  has  witnessed 
a  great  deal  of  state  building,  its  overall  impacts 
are  still  far  from  clear.  For  sure,  the  most  damag¬ 
ing  forms  of  air  and  water  pollution  have  largely 
been  contained  in  the  most  industrialized  states. 
Yet  efforts  to  address  the  creeping  decline  in  eco¬ 
system  health  reported  by  the  Millennium 
Ecosystem  Assessment  (2005)  as  well  as  to  stabi¬ 
lize  Earth’s  climatic  system  have  been  continually 
overwhelmed  by  the  ever-growing  scale  and  speed 
of  human  activity.  Consequently,  environmental 
policy  remains,  at  best,  a  work  in  progress. 

Andrew  Jordan  and  David  Benson 
University  of  East  Anglia 
Norwich,  United  Kingdom 
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Environmental  Security 
Studies 


Environmental  security  studies  analyze  both  the 
confluence  of  foreign  policy  and  environmental 
security,  particularly  in  the  aftermath  of  the  Cold 
War,  and  the  linkages  between  security  coopera¬ 
tion  and  global  environmental  risks.  They  result 
from  an  opening  in  the  fields  of  foreign  policy, 
defense  studies,  and  security  cooperation,  wherein 
researchers  have  tended  to  move  away  from  a  nar¬ 
rowly  defined  military  and  strategic  understand¬ 
ing  of  threat,  vulnerability,  risk,  and  national 
interest.  In  a  post-Cold  War  era  characterized  by 
growing  transnational  flows,  emerging  environ¬ 
mental  movements,  and  a  global  political  process 
of  ecological  awareness  building,  the  changes  pro¬ 
duced  in  the  regional  and  global  environment 
have  become  a  critical  area  on  the  security  agenda. 
However,  the  term  environmental  security  has 
acquired  multiple  meanings  and  has  gone  through 
contentious  academic  and  policy  debates  about 
how  environment  and  security  can  be  connected. 


Neither  scientific  programs  nor  policy  agendas 
have  so  far  been  able  to  produce  a  commonly 
agreed  definition  of  environmental  security,  which 
also  shows  the  academic  relevance  and  the  policy¬ 
building  interest  of  continuous  reflection  in  this 
research  field.  However,  the  same  goes  for  the 
concept  of  security  itself;  as  Edward  Page  (2000) 
affirms,  environmental  security  appears  to  be  a 
clear  example  of  a  concept  that  is  prone  to  end¬ 
less,  irresolvable  dispute  regarding  its  meaning 
and  policy  application.  This  entry  discusses  how 
concerns  with  environmental  security  have  evolved 
since  the  Cold  War  era  and  then  examines  its 
place  in  foreign  policy  today. 

Origin  and  Evolution  of  Concerns 
With  Environmental  Security 

In  the  1960s,  the  first  efforts  were  directed  toward 
articulating  issues  about  the  environment  and 
security.  They  were  generally  concerned  with  the 
impact  produced  by  humans  on  the  security  of 
nature,  animals,  and  plants  and  also  with  the  links 
between  development  and  quality  of  life.  These 
first  interpretations  of  environmental  security  have 
shed  light  on  the  misuse  of  pesticides  and  fertilizers 
and  the  threats  to  the  environment  associated  with 
nuclear  energy.  As  in  the  cases  of  Rachel  Carson’s 
book  Silent  Spring  (1962),  Aldous  Huxley’s  article 
“The  Politics  of  Ecology:  The  Question  of  Survival” 
(1963),  and  Kenneth  E.  Boulding’s  notion  of 
spaceship  Earth  (1966),  these  perspectives  have 
emphasized  the  need  to  secure  the  integrity  of  eco¬ 
systems  in  order  to  ensure  mankind’s  survival. 
Mercury  poisoning  associated  with  the  Chisso- 
Minamata  disease  (1959)  and  the  Torrey  Canyon 
shipwreck  off  the  western  coast  of  Cornwall  in 
England  (1967),  among  other  environmental  disas¬ 
ters,  have  played  a  key  role  in  this  context. 

In  the  1970s,  other  readings  analyzing  the  envi¬ 
ronmental  effects  of  war  and  refugee  movements 
developed,  and  research  also  tackled  issues  related 
to  the  link  between  the  environment  and  develop¬ 
ment.  This  was  the  case  with  the  “Only  One 
Earth”  report  written  by  Barbara  Ward  and  Rene 
Dubos,  Edward  Goldsmith’s  “Blueprint  for 
Survival,”  and  the  “Limits  to  Growth  Meadows” 
report,  all  of  them  published  in  1972.  They  all 
expressed  an  overreaction  to  some  of  the  supposed 
dangers  of  environmental  degradation  to  human 
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security.  The  United  Nations  (UN)  Conference  on 
Human  Environment  (Stockholm,  1972),  despite 
the  fact  that  it  ended  with  a  largely  dead  minimal¬ 
ist  consensus  among  states  on  26  principles  and 
109  nonbinding  recommendations,  has  paved  the 
way  for  future  environmental  cooperation,  led  to 
the  establishment  of  global  and  regional  environ¬ 
mental  monitoring  networks,  and  allowed  the 
creation  of  the  UN  Environmental  Program  in 
Nairobi.  In  its  final  declaration,  there  is  not  a  sin¬ 
gle  reference  to  environmental  security,  although 
its  Principle  21  states  that  “States  have  .  .  .  the 
responsibility  to  ensure  that  activities  within  their 
jurisdiction  or  control  do  not  cause  damage  to  the 
environment  of  other  states  or  of  areas  beyond 
the  limits  of  national  jurisdiction.”  Herein  resides 
the  idea  of  environmental  threats  to  the  national 
interest  of  a  state,  which  should  eventually  pro¬ 
duce  a  change  in  the  conception  of  foreign  and 
defense  politics. 

Since  the  1992  UN  Conference  on  Environment 
and  Development  held  in  Rio  de  Janeiro,  most  of 
the  academic  debate  has  been  conceptual,  leading 
to  the  creation  of  several  schools  of  thought.  Some 
researchers  have  analyzed  the  significance  of  envi¬ 
ronmental  stress  as  a  contributing  factor  to  con¬ 
flict  in  many  developing  countries,  generally  in  an 
attempt  to  relate  environmental  scarcity  (environ¬ 
mental  degradation  or  the  depletion  of  renewable 
resources)  to  violent  conflict.  Research  has  also 
developed  on  environmental  change  produced  by 
human  activities  and  conflicts  and  shifted  to  exam¬ 
ining  how  environmental  stress  may  contribute  to 
conflict  in  combination  with  other  relevant  fac¬ 
tors.  Since  1989,  for  instance,  Thomas  Homer- 
Dixon  and  his  team  have  produced  a  significant 
body  of  research  on  the  connections  between 
large-scale  human-induced  environmental  stress 
and  mass  violence  in  developing  countries.  This 
team  of  researchers  has  analyzed  the  links  between 
water  scarcity  and  conflict,  rapid  urbanization  and 
urban  violence,  and  environmentally  induced 
migration  and  ethnic  violence,  among  other  issues, 
having  conducted  case  studies  of  environmental 
stress  and  violence  in  Rwanda,  South  Africa, 
Bangladesh,  India,  Pakistan,  China,  Nicaragua, 
Mexico,  and  so  on. 

More  recently,  research  has  highlighted  the 
importance  of  disputes  arising  from  access  to  natu¬ 
ral  resources  such  as  water,  oil,  or  minerals  for 


strategic  purposes.  Some  researchers  call  them  new 
wars.  It  is  clear  that  for  some,  what  needs  to  be 
secured  remains  predominantly  the  survival  of  the 
state,  and  in  this  sense  environmental  insecurity 
can  be  read  as  synonymous  with  environmental 
threats  to  the  nation-state  (e.g.,  military  operations 
may  have  an  impact  on  the  environment  or  the 
environment  may  be  manipulated  or  contaminated 
as  a  target  during  wars).  For  other  researchers, 
environmental  security  has  a  much  broader  scope, 
including  the  security  of  individuals  and  communi¬ 
ties  and  the  security  of  nature  itself.  Some  more 
critical  researchers  associate  environmental  secu¬ 
rity  with  issues  of  inequality  in  North-South  rela¬ 
tions,  social  class,  and  revenue  distribution.  To 
them,  the  central  research  questions  are  “Whose 
security  is  at  stake?”  and  “Who  defines  the  need  to 
protect  what  and  for  whom?” 

These  theoretical  debates  also  show  clearly  that 
the  relationship  between  the  environment  and 
security  is  historically  and  politically  complex  and 
controversial  and  that  many  factors  play  a  role  in 
it.  The  cause-and-effect  relationships  between  ten¬ 
sions  and  vulnerabilities  are  multidimensional,  and 
the  links  between  the  various  political  and  eco¬ 
nomic  components  may  be  direct  or  indirect. 
These  vibrant  discussions  also  reflect  different 
concepts  of  nature  and  the  very  distinct  social  rep¬ 
resentations  of  what  should  get  counted  as  part  of 
the  environment.  One  of  the  dangers  of  some 
approaches  is  that  they  often  dichotomize  human 
beings  and  nature.  On  the  one  hand,  deep  ecology 
may  often  see  nature  as  a  subject  in  itself — a  sub¬ 
ject  that  requires  protection  and  for  which  techni¬ 
cal  norms  should  be  set  up.  On  the  other  hand, 
anthropocentric  visions  tend  to  promote  human 
interests  over  those  of  the  environment.  If  one 
understands  the  notion  of  environment  as  includ¬ 
ing  humans,  then  the  way  one  defines  environmen¬ 
tal  problems  alters,  and  one  may  come  to  reformu¬ 
late  the  concept  of  environmental  security  in  terms 
of  broader  human  security.  What  constitutes  the 
environment  is,  in  this  respect,  a  key  conceptual 
matter  for  our  understanding  of  environmental 
security. 

Environmental  Security  and  Foreign  Policy 

Foreign  policy  has  traditionally  referred  to  a  series 
of  actions  followed  by  one  nation  to  deal  with 
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another  nation,  another  region,  or  an  international 
issue  in  a  multilateral  setting.  In  this  connection,  a 
state’s  foreign  policy  would  be  based  on  values 
(democracy,  promotion  of  peace  and  human  rights, 
rule  of  law,  etc.)  and  national  interest  and  would 
usually  be  aimed  at  preserving  the  state’s  economic 
and  political  goals  abroad  as  well  as  its  position  in 
the  world.  Foreign  policy  analysis  has  classically 
oriented  its  lenses  to  the  actions  of  national  gov¬ 
ernments,  principally  those  related  to  diplomatic 
or  military  issues,  thus  reflecting  an  overly  simpli¬ 
fied  view  of  a  world  justified  in  the  name  of  a  well- 
established  sense  of  national  interest  and  threats  to 
national  security.  Today,  the  media,  the  corporate 
sector,  lobbyists,  think  tanks,  environmental  and 
human  rights  nongovernmental  organizations,  eth¬ 
nic  groups,  and  social  movements,  among  other 
actors,  play  an  increasingly  important  role  in  try¬ 
ing  to  influence  foreign  policy  in  many  countries. 
Furthermore,  with  the  end  of  the  Cold  War,  the  in/ 
out  divide  has  lost  its  absolute  relevance,  and  there 
has  been  a  keener  interest  in  domestic  policy 
issues,  such  as  energy  policies,  ecological  and 
health  issues,  the  use  of  chemicals  in  agriculture, 
and  the  impact  of  genetically  modified  organisms 
in  the  environment.  These  developments  have 
deeply  changed  the  analytic  field  of  foreign  policy, 
its  agendas,  decision-making  processes,  and  the 
definition  of  the  social  practices  of  its  stakehold¬ 
ers.  Taking  into  account  environmental  security  is 
but  another  intellectual  and  political  challenge  put 
forward  to  both  the  epistemic  community  and  the 
world  of  practitioners  of  foreign  policy. 

In  fact,  Braden  Allenby  (2000)  suggests  that 
environmental  security  may  be  an  important  evolu¬ 
tion  in  the  definition  of  what  constitutes  the 
national  interest  of  states  in  international  policy 
systems.  Because  the  previous  apparent  stability 
generated  by  the  ideological  confrontation  between 
capitalism  and  communism  has  broken  down, 
global  environmental  changes  and  the  need  to  pro¬ 
mote  environmental  security  worldwide  potentially 
affect  both  foreign  policy  and  security  agendas 
more  easily.  Environmental  issues  can  no  longer  be 
thought  of  as  merely  subsidiary  components  of 
industrial,  social,  political,  and  economic  systems. 
There  is  a  movement  of  environmental  problems 
from  an  “overhead”  to  a  “strategic”  position  on 
the  global  agenda.  This  movement  from  “periph¬ 
ery”  to  the  “center”  occurs  at  many  different  scales 


within  firms,  industrial  sectors,  society  itself,  and 
governments.  They  may  differ  in  detail  and  time¬ 
table  but  have  many  fundamental  similarities  in 
each  case  and  across  regions — both  North-South 
and  East-West. 

This  happens  also  because  there  is  an  important 
constellation  of  environmental  issues  (energy, 
access  to  natural  resources,  climate  change  and 
sea-level  increases,  biodiversity-related  wealth, 
dangerous  waste  management,  etc.)  that  support 
the  fabric  of  a  country’s  security  and  its  territorial 
integrity.  They  may  also  threaten  and  be  of  most 
concern  for  foreign  policy  analysts  and  decision 
makers.  They  challenge  two  comfortable  classical 
assumptions  that  have  been  at  the  basis  of  tradi¬ 
tional  views  of  foreign  policy:  (1)  that  the  nation¬ 
state  is  relatively  absolute  in  the  definition  of  its 
sovereign  security  interests  and  that  (2)  only  inter¬ 
state  relations  (particularly  within  Cold  War  con¬ 
flicts  between  capitalism  and  communism)  define 
global  geopolitics.  From  a  state-centric  perspec¬ 
tive,  security  is  national;  thus,  environmental 
threats  to  national  security  should  be  dealt  with  by 
national  authorities. 

The  integration  of  the  notion  of  environmental 
security  (and  insecurity)  into  the  field  of  foreign 
policy  makes  these  assumptions,  at  least  in  their 
absolute  form,  much  less  valid. 

It  is  in  this  connection  that  the  Worldwatch 
Institute  had  already  demanded,  in  1977,  a  broad¬ 
ening  of  the  sense  of  national  security.  Lester  R. 
Brown,  in  “Redefining  National  Security,”  had 
reaffirmed  that  threats  to  countries  and  societies 
should  stem  less  from  relations  with  nations  and 
more  from  relations  with  nature.  Jessica  Mathews 
makes  the  same  argument  in  her  1989  article  on 
“Redefining  Security.”  Even  within  the  more  tradi¬ 
tional  realist  scholarship,  there  is  an  emphasis  on 
this  link;  for  example,  George  Kennan  claimed  (in 
1985)  that  the  degradation  of  the  environment  is 
one  of  two  threats  facing  mankind,  the  other  being 
military  and  terrorist  attacks.  Extreme  versions  of 
realism,  such  as  the  geopolitical  theories  of  major- 
general  Karl  Haushofer,  look  at  the  security  impli¬ 
cations  of  strategic  raw  materials.  Both  German 
and  Japanese  expansion  in  the  1930s  would  partly 
have  been  a  search  for  raw  materials.  Some 
researchers  see  President  George  Bush’s  interven¬ 
tion  in  Iraq  in  2003  as  an  attempt  to  secure  the  oil 
resources  of  the  Middle  East. 
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Indeed,  the  foreign  policy  of  the  United  States  of 
America  has  begun  to  evolve  in  response  to  these 
recent  environmental  security  challenges,  in  part 
by  recognizing  the  need  to  manage  a  new  set  of 
issues,  broadening  the  generally  accepted  and  cozy 
concept  of  security  as  part  of  a  larger  national 
security  mission.  In  a  speech  at  Stanford  University 
in  1996,  the  then  U.S.  Secretary  of  State  Warren 
Christopher  had  explicitly  recognized  the  need  to 
include  additional  dimensions  in  American  foreign 
policy,  noting  that  “our  Administration  has  recog¬ 
nized  from  the  beginning  that  our  ability  to 
advance  our  global  interests  is  inextricably  linked 
to  how  we  manage  the  Earth’s  natural  resources,” 
and  the  great  relevance  to  “contend  with  the  vast 
new  danger  posed  to  our  national  interests  by 
damage  to  the  environment  and  resulting  global 
and  regional  instability.”  Irrespective  of  the  rhe¬ 
torical  dimension  of  such  a  statement,  the  U.S. 
secretary  of  state  then  concluded  on  a  strong  note: 
“That  is  why  we  are  determined  to  put  environ¬ 
mental  issues  where  they  belong:  in  the  main¬ 
stream  of  American  foreign  policy.  ”  In  the  case  of 
Japan,  in  his  1980  “Report  on  Comprehensive 
National  Security,”  the  then  prime  minister  of 
Japan  asserted  that  security  would  encompass  eco¬ 
nomic  vulnerabilities,  natural  disasters,  environ¬ 
mental  degradation,  and  ecological  imbalances. 

These  developments  confirm  what  Thomas 
Homer-Dixon  and  Jay  Blitt,  in  their  1998  book 
Ecoviolence ,  had  pointed  to  as  their  key  findings  in 
the  analysis  of  linkages  between  environmental 
scarcity  and  foreign  policy: 

1.  Scarcities  of  renewable  resources  produce  civil 
violence,  instability,  and  conflict. 

2.  The  degradation  and  depletion  of  renewable 
resources  causes  environmental  scarcity. 

3.  Powerful  groups  capture  valuable  resources, 
while  marginal  groups  migrate  to  ecologically 
sensitive  areas. 

4.  Environmental  scarcity  constrains  economic 
development  and  produces  migration. 

5.  Existing  distinctions  between  social  groups  may 
be  sharpened  by  environmental  scarcity. 

6.  Environmental  scarcity  weakens  governmental 
institutions  and  states. 


7.  Environmental  scarcity  can  also  cause  ethnic 
conflicts,  insurgencies,  and  coups  d’etat. 

8.  The  international  community  can  be  indirectly 
affected  by  these  conflicts  generated  by 
environmental  scarcity. 

These  discoveries  support  the  assumptions  of 
political  ecology  researchers  who  argue  that  envi¬ 
ronmental  degradation  often  leads  to  economic 
underdevelopment,  which  can  in  turn  diminish 
political  stability  in  countries  and  societies  in  the 
South. 

Global  Environmental  Risks 
and  Security  Cooperation 

There  is  no  doubt  that  today’s  human  activities 
constitute  a  significant  and  frequently  destructive 
feature  of  Earth’s  ecosystem.  Interpretations  may 
vary,  but  UN  sources  suggest  that  around  40%  of 
the  entire  natural  photosynthetic  product  of  the 
biosphere  is  appropriated  by  human  beings.  It  is 
clear  that  the  world  faces  a  global  environmental 
crisis  and,  inseparable  from  this,  a  coincidental 
crisis  of  growing  social  inequality  and  poverty. 
Unprecedented  environmental  and  social  changes 
pose  huge  challenges  to  the  interstate  system,  one 
key  question  being  if  and  how  a  global  set  of  local 
sovereign  states  would  be  capable  of  saving  the 
biosphere  from  environmental  degradation  and 
pollution.  Many  signs  indicate  a  need  for  society’s 
cross-sectoral  attention  to  the  environment  as  an 
underlying  security  issue.  This  sentiment  had 
echoed  through  the  literature  on  environmental 
and  ecological  problems  and  also  through  the 
opening  words  of  the  Brundtland  Report  in  1987: 
“The  Earth  is  one  but  the  world  is  not.  We  all 
depend  on  our  biosphere  for  sustaining  our  lives. 
Yet  each  community,  each  country,  strives  for  sur¬ 
vival  and  prosperity  with  little  regard  for  its  impact 
on  others”  (World  Commission  on  Environment 
and  Development,  1987). 

Although  security  tends  to  be  defined  only  from 
a  military  perspective  and  is  thus  not  open  to  envi¬ 
ronmental  concerns,  since  the  end  of  the  1980s 
some  debate  has  been  devoted  to  global  environ¬ 
mental  risks  and  to  the  threats  that  they  represent 
to  collective  security.  A1  Gore,  in  his  1992  book, 
Earth  in  the  Balance,  had  pointed  to  global  risks 
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such  as  the  production  and  trade  of  highly  toxic 
chemicals,  the  loss  of  biodiversity,  ozone  deple¬ 
tion,  climate  change,  marine  degradation,  and 
desertification  and  deforestation.  The  environ¬ 
ment,  peace,  and  security  chapter  of  Our  Common 
Future  was  not  subject  to  much  negotiation  at  the 
1992  United  Nations  Conference  on  Environment 
and  Development  in  Rio  de  Janeiro. 

Developing  countries  did  not  endorse  a  global 
dialogue  on  environmental  issues  within  the  context 
of  conflict  and  security,  reacting  negatively  to  any 
multilateral  treatment  of  environmental  security 
proposals  within  UN  forums.  Coalitions  of  develop¬ 
ing  nations,  mainly  the  Group  of  77,  perceived  the 
security  frame  as  a  Pandora’s  box  that,  once 
opened,  could  formally  menace  their  claims  of  sov¬ 
ereign  control  over  their  resources.  Moreover,  lead¬ 
ing  developing  nations  (Brazil,  China,  India,  and 
Malaysia,  among  others)  tend  to  consider  that  pre¬ 
dominantly  Northern  geopolitical  concerns  about 
the  impacts  of  environmental  conflicts  on  national 
sovereignty  and  the  destruction  of  the  global  com¬ 
mons  associated  with  comprehensive  security  have 
largely  dominated  environmental  security  negotia¬ 
tions  and  policy  making  to  date.  Their  criticisms 
focus  on  the  failure  of  this  security  approach  to 
address  the  causes  of  inequality  and  injustice  that 
underpin  poverty  and  environmental  degradation  at 
local  and  global  levels.  Representatives  from  devel¬ 
oping  nations  support  that  global  change  has  more 
impact  on  those  least  able  to  protect  themselves 
from  its  effects,  such  as  the  poor  and  coastal  com¬ 
munities  at  risk  from  severe  flooding  and  climate 
change  spillovers. 

Therefore,  the  world  of  interstate  relations  has 
not  welcomed  the  ideal  of  environmental  responsi¬ 
bility  as  another  normative  principle  in  interna¬ 
tional  relations,  arguing  that  it  would  justify 
breaches  in  the  state’s  sovereignty  and  legitimize 
foreign  interventions  by  stronger  countries.  That  is 
why  some  authors  such  as  Norman  Myers  tend  to 
defend  the  individual  level  of  analysis;  that  is,  secu¬ 
rity  should  apply  mostly  at  the  level  of  the  indi¬ 
vidual  citizen  as  a  basis  of  political  stability.  In  his 
view,  security  should  not  be  a  function  of  stable 
macrosocial  processes  (such  as  sociopolitical,  eco¬ 
nomic,  or  even  military  systems);  rather,  it  should 
be  seen  as  a  goal  to  be  achieved  at  the  microlevel 
of  human  existence.  Another  type  of  definition  is 
provided  by  ecological  security  approaches,  which 


tend  to  consider  that  environmental  degradation, 
by  undermining  mankind’s  means  of  subsistence, 
threatens  the  security  of  ecosystems.  Ecological 
security  takes  into  account  the  need  to  provide  for 
the  physical  circumstances  and  conditions  of  the 
needs  of  a  community  without  diminishing  natural 
stocks.  Other  approaches — for  instance,  the  one 
supported  by  the  United  Nations  Development 
Program — are  inclined  to  privilege  human  security 
as  an  overarching  concept.  As  a  consequence,  envi¬ 
ronmental  security  and  human  rights  would  be 
inextricably  linked.  In  this  case,  security  should 
become  an  all-encompassing  term  relating  to  the 
social,  economic,  political,  and  ecological  well¬ 
being  of  individual  human  beings. 

Irrespective  of  the  level  of  analysis,  all  these  per¬ 
spectives  agree  that  environmental  threats  pose 
long-term  dangers  to  security  and  that  the  causes  of 
environmental  insecurity  are  related  to  population 
growth,  degradation,  overuse  and  abuse  of  resources, 
depletion  of  finite  global  resources,  transborder 
effects  of  environmental  degradation,  or  the  ener¬ 
getic  impact  of  unsustainable  consumption  patterns. 
Apart  from  the  intrinsic  shortcomings  associated 
with  the  interstate  system  to  deal  with  global  envi¬ 
ronmental  security  issues,  another  major  obstacle 
that  may  stand  in  the  way  of  effective  global  gover¬ 
nance  mechanisms  and  structures  relates  to  the 
extraordinary  power  of  transnational  corporations. 
Of  the  100  largest  economies  in  the  world,  51  are 
corporations;  and  the  top  500  corporations  now 
control  almost  two  thirds  of  world  trade. 

The  literature  on  environmental  security  is  very 
much  shaped  by  the  assumption  that  the  behavior  of 
states  is  determined  by  the  structure  of  power  rela¬ 
tions  in  international  relations.  Particularly  in  the 
1990s,  this  literature  has  also  been  under  the  influ¬ 
ence  of  a  credo  that  no  regulatory  mechanisms 
should  guide  or  control  corporate  behavior.  In  this 
sense,  security  would  be  restricted  to  the  need  to 
protect  a  population  and  a  territory  against  foreign 
threats  while  at  the  same  time  defending  the  interests 
of  the  state.  Questions  related  to  scope  and  content 
would  be  withdrawn  from  the  policy  debate,  but 
some  scholars  insist  on  retaining  them:  What  is  the 
nature  of  the  societies,  organizations,  and  institu¬ 
tions  whose  security  matters  from  the  point  of  view 
of  security  discourse?  What  environmental  phenom¬ 
ena  put  them  at  risk  and  render  them  insecure?  Who 
is  threatened  by  environmental  insecurity? 
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According  to  Steven  Ney  (1999),  it  is  necessary 
to  recall  that  evaluating  different  models,  or  concep¬ 
tual  maps,  of  environmental  security  is  not  a 
straightforward  task.  Using  Graham  Allison’s  ter¬ 
minology,  models  can  be  considered  perceptual 
lenses  that  allow  us  to  make  sense  of  the  world:  A 
theory  is  a  way  of  seeing  as  well  as  a  way  of  not 
seeing.  As  such,  models  are  tools  with  which  policy 
actors  may  construct  arguments  in  the  course  of 
policy  negotiation  and  deliberation.  Models  can 
have  a  pragmatic  impact  on  policy  making,  and  by 
extension  they  may  be  designed  solely  to  affect  the 
policy  process.  They  may  perform  as  well  as  inform. 
That  is  why  it  does  not  make  sense  to  assess  models 
in  isolation  of  their  argumentative  contexts  and 
concrete  policy  settings  in  which  they  are  to  be 
employed.  As  constructivist  and  critical  researchers 
argue,  the  often  implicit  normative  assumptions 
that  underpin  the  different  approaches  to  environ¬ 
mental  security  should  also  be  analyzed.  For 
instance,  one  of  the  explanatory  powers  of  many 
environmental  security  approaches  is  based  on  the 
assumption  that  increased  population  growth  inevi¬ 
tably  leads  to  environmental  degradation.  By  the 
same  token,  however,  this  argument  does  not  take 
into  consideration  how  individual  and  social  con¬ 
sumption  patterns  are  structured  and  how  some  of 
them  may  have  a  stronger  impact  on  the  environ¬ 
ment.  It  is  true  that  the  various  theoretical  approaches 
to  environmental  security  contribute  to  promoting 
global  environmental  issues  into  policy  agendas 
worldwide,  but  the  question  that  environmental 
security  studies  should  also  pose  relates  to  how 
these  issues  are  being  brought  into  the  agendas. 

Carlos  R.  S.  Milani 
Universidade  Federal  da  Bahia 
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Epistemic  Communities 


Epistemic  communities  are  networks  of  knowledge- 
based  communities  with  an  authoritative  claim  to 
policy-relevant  knowledge  within  their  domain  of 
expertise.  Their  members  share  knowledge  about 
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the  causation  of  social  or  physical  phenomena  in  an 
area  for  which  they  have  a  reputation  for  compe¬ 
tence  and  a  common  set  of  normative  beliefs  about 
what  actions  will  benefit  human  welfare  in  such  a 
domain.  In  particular,  they  are  a  group  of  profes¬ 
sionals,  often  from  a  number  of  different  disciplines, 
who  share  the  following  set  of  characteristics: 

1.  Shared  principled  beliefs:  Such  beliefs  provide  a 
value-based  rationale  for  social  action  of  the 
members  of  the  community. 

2.  Shared  causal  beliefs  or  professional  judgment: 
Such  beliefs  provide  analytic  reasons  and 
explanations  of  behavior,  offering  causal 
explanations  for  the  multiple  linkages  between 
possible  policy  actions  and  desired  outcomes. 

3.  Common  notions  of  validity — intersubjective, 
internally  defined  criteria  for  validating 
knowledge:  These  allow  community  members  to 
confidently  differentiate  between  warranted  and 
unwarranted  claims  about  states  of  the  world 
and  policies  to  change  those  states. 

4.  A  common  policy  enterprise:  This  entails 
tackling  a  set  of  practices  associated  with  a 
central  set  of  problems,  presumably  out  of  a 
conviction  that  human  welfare  will  be  enhanced 
as  a  consequence. 

Epistemic  communities  is  a  concept  invoked  by 
constructivist  scholars  of  international  relations  to 
focus  analytic  attention  on  the  process  by  which 
states  formulate  interests  and  reconcile  differences 
of  interest.  Epistemic  communities  are  a  principal 
channel  by  which  consensual  knowledge  about 
causal  understandings  is  applied  to  international 
policy  coordination  and  by  which  states  may  learn 
through  processes  of  international  cooperation. 

John  Ruggie  introduced  the  term  episteme,  bor¬ 
rowed  in  turn  from  Michel  Foucault,  to  describe 
the  overarching  perspective  through  which  politi¬ 
cal  relationships  are  visualized  and  understood 
during  historical  eras.  The  1992  winter  issue  of 
International  Organization  on  “Knowledge, 
Power,  and  International  Policy  Coordination” 
focused  on  the  actors  responsible  for  articulating 
and  aggregating  knowledge-based  understanding 
in  areas  of  security,  environment,  and  interna¬ 
tional  political  economy  (IPE).  It  also  developed 


the  now  standard  four-element  definition  of  an 
epistemic  community  provided  above. 

By  analyzing  epistemic  communities,  construc¬ 
tivist  scholars  gained  leverage  on  understanding 
the  processes  of  social  construction  and  collective 
learning.  If  contemporary  international  relations  is 
characterized  as  a  setting  of  complexity  and  uncer¬ 
tainty,  as  argued  by  constructivist  scholars — par¬ 
ticularly  under  contemporary  circumstances  of 
complex  interdependence,  increasing  globaliza¬ 
tion,  and  the  emergence  of  new  technical  issues  on 
the  international  agenda  with  which  traditional 
decision  makers  are  habitually  unfamiliar — then 
state  interests  are  often  unknown  or  incompletely 
specified.  International  relations  then  becomes  a 
matter  of  applying  embedded  and  institutionalized 
beliefs  about  the  nature  of  problems  and  the 
appropriate  means  of  collective  response  rather 
than  the  process  of  resolving  rationally  formulated 
state  preferences  as  argued  by  many  current  theo¬ 
rists  of  international  relations.  Changes  in  informa¬ 
tion  processing  are  likely  to  follow  well-publicized 
shocks  or  crises.  Only  at  such  times  are  decision 
makers  likely  to  recognize  major  anomalies  and 
pursue  new  policy  patterns.  During  subsequent, 
less  revolutionary  periods,  these  new  doctrines  or 
orthodoxies  assume  the  status  of  taken-for-granted 
assumptions  or  dogma,  which  persist  until  called 
into  question  again  by  external  stimuli.  Because  of 
the  disjointed  equilibrium  nature  of  policy  change, 
an  evolutionary  focus  on  institutional  learning  and 
path  dependence  may  provide  an  appropriate 
model  by  which  to  understand  the  international 
recognition  and  response  to  global  change.  Such  a 
research  program  may  provide  a  better  understand¬ 
ing  of  factors  that  influence  the  introduction  of 
new  policy  frames,  collective  understandings,  or 
doctrines  as  well  as  illuminate  mechanisms  of  lock- 
in  and  identify  those  factors  that  may  influence  the 
degree  of  irreversibility  of  national  and  collective 
actions. 

Epistemic  communities  are  one  of  the  principal 
actors  responsible  for  aggregating  and  articulating 
knowledge  in  terms  of  state  interests  for  decision 
makers  and  disseminating  those  beliefs  internation¬ 
ally.  In  a  broader  political  context,  epistemic  com¬ 
munities  provide  one  of  the  major  channels  by 
which  overarching  regime  principles,  norms,  and 
rules  are  articulated  for  the  international  community 
and  disseminated  internationally.  While  epistemic 
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communities  are  the  principal  agents  responsible 
for  articulating  such  principles,  norms,  and  rules, 
the  extent  to  which  they  become  more  deeply  dif¬ 
fused  and  embedded  internationally  has  to  do  with 
the  political  influence  of  epistemic  community 
members:  their  ability  to  persuade  others,  their 
ability  to  consolidate  bureaucratic  influence  in 
important  institutional  venues,  and  their  ability  to 
retain  influence  over  time.  State  interests  and  deci¬ 
sions  to  deploy  state  power  are  thus  identified 
subject  to  consensual  knowledge. 

Research  on  Epistemic  Communities 

The  epistemic  community  literature  is  now  nearly 
20  years  old.  The  epistemic  communities  concept 
was  initially  favorably  received  in  international 
relations  because  it  provided  a  means  for  focusing 
on  the  ideational  component  of  politics  and  also 
allowed  for  agency  in  theorizing  about  governance 
and  policy  making.  Studies  of  the  European  Union 
(EU)  in  particular  have  analyzed  the  role  of  various 
epistemic  communities  in  shaping  EU  directives  as 
well  as  in  building  a  broader  sense  of  European 
identity.  Since  the  initial  response  to  epistemic 
communities,  analysts  have  developed  a  variety  of 
clarifications,  refined  hypotheses,  carried  out  fur¬ 
ther  empirical  work,  and  empirically  confirmed 
the  broad  predicted  social  patterns  associated  with 
epistemic  communities. 

Initial  critiques  of  epistemic  communities  called 
for  a  clearer  theory  of  the  state  as  well  as  a  clearer 
metric  by  which  epistemic  communities  could  be 
recognized  and  consensus  within  an  epistemic 
community  could  be  measured. 

Critical  voices  from  the  literature  were  sounded 
on  social  studies  of  knowledge  regarding  the  degree 
of  political  autonomy  enjoyed  by  epistemic  commu¬ 
nities  and  science-based  arguments  in  general  for 
public  policy,  questioning  the  political  consequences 
of  such  ideas  and  also  the  potential  implicit  political 
bias  in  the  research  programs  pursued  by  epistemic 
community  members.  They  feared  that  by  depoliti- 
cizing  expertise  and  the  consequences  of  expert- 
based  advice,  epistemic  communities  may  become 
antidemocratic  and  antiparticipatory.  However, 
such  points  were  explicitly  acknowledged  in  the  ini¬ 
tial  formulation  of  the  epistemic  community  research 
program — namely,  the  political  nature  of  all  policy 
debate.  The  epistemic  community  argument  was 


that,  normatively,  epistemic  communities  ultimately 
provided  more  impartial  advice  than  other  modes  of 
policy  advice  and,  analytically,  their  study  offered  a 
clear  causal  pathway  by  which  ideas  came  to  inform 
political  practices  that  was  superior  to  conventional 
approaches  to  the  study  of  politics,  which  are  unable 
to  provide  credible  explanations  for  how  ideas  influ¬ 
enced  politics  or  the  conditions  under  which  ideas 
were  likely  to  be  influential. 

Epistemic  community  members  can  be  identi¬ 
fied  through  interviews,  from  secondary  literature 
(especially  by  journalists),  and  by  cross-checking 
membership  lists  for  international  negotiations 
and  consultations  over  time.  Recurrent  names  are 
eligible  candidates.  Refining  an  identification  of 
the  consensual  knowledge  shared  by  the  epistemic 
community  members  can  be  done  by  reading  their 
publications  (especially  scientific  ones  with  equa¬ 
tions  that  force  precision)  and  open-ended  inter¬ 
views.  The  extent  to  which  they  feel  that  they  are 
members  of  a  common  community  can  be  ascer¬ 
tained  through  interviews  and  through  snowball 
techniques,  in  which  prospective  members  identify 
others  with  whom  they  believe  they  share  beliefs. 
The  combination  of  self-identified  traits  and  exter¬ 
nally  confirmed  traits  (such  as  the  application  of 
consensus  approaches  to  truth)  define  the  epis¬ 
temic  community.  The  causal  mechanisms  by 
which  they  exercise  influence — learning — can 
effectively  be  pursued  by  process  tracing,  focused 
comparative  case  studies,  counterfactuals,  and 
alternative  hypothesis  testing. 

Stronger  hypotheses  about  the  mechanisms, 
effects,  and  variation  of  epistemic  communities’ 
influence  were  also  developed.  Epistemic  commu¬ 
nities  are  likely  to  be  found  in  substantive  issues 
where  methods  from  scientific  disciplines  have 
been  applied  to  policy-oriented  work  and  in  coun¬ 
tries  with  well-established  institutional  capacities 
for  administration,  and  science  and  technology. 
Governments  with  such  capacities  are  likely  to  see 
the  need  for  the  technical  skills  that  epistemic  com¬ 
munity  members  command,  and  such  professionals 
may  be  attracted  to  governmental  service  only 
when  they  believe  that  their  policy  enterprise  can 
be  advanced.  Crises  or  widely  publicized  shocks 
are  probably  necessary  precipitants  of  environmen¬ 
tal  regime  creation,  but  crises  alone  are  insufficient 
to  explain  how  or  which  collective  responses  to  a 
perceived  joint  problem  are  likely  to  develop. 
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Epistemic  communities  then  help  identify  cause- 
and-effect  relationships,  elucidate  interlinkages 
between  problems,  help  define  the  consulting 
state’s  or  organization’s  interests,  and  help  formu¬ 
late  policy.  Their  aggregate  effect  depends  on  the 
extent  to  which  their  ideas  become  embedded  in 
influential  multilateral  institutions  more  generally: 
powerful  countries  and  international  institutions 
and,  possibly,  nongovernmental  organizations 
(NGOs),  which  will  then  deploy  their  own  influ¬ 
ence  to  disseminate  the  shared  ideas  enunciated 
within  the  epistemic  community.  Overall,  learning 
will  occur  in  the  policy  system  as  new  policy- 
relevant  knowledge  is  identified  and  applied  to  a 
common  problem. 

The  initial  reception  of  their  ideas  is  likely  to 
occur  in  rich  democratic  societies,  where  scientific 
capacity  and  the  free  flow  of  information  within 
society  are  high.  However,  other  countries  may 
also  be  receptive  to  epistemic  communities’  ideas 
but  not  as  quickly. 

Treaties  developed  with  a  strong  contribution 
from  epistemic  communities  are  likely  to  enter  into 
force  more  rapidly  than  those  developed  through 
other  political  mechanisms,  as  the  social  influence 
of  epistemic  community  members  will  accelerate 
ratification  in  domestic  legislative  venues. 

Multiple  mechanisms  of  diffusion  are  possible 
through  various  international  channels  at  different 
political  scales.  For  instance,  in  stratospheric 
ozone  protection,  the  very  regulatory  scheme  arose 
from  atmospheric  chemists,  but  the  new  technolo¬ 
gies  to  satisfy  those  regulations  emerged  through 
the  interplay  of  multinational  corporations 
(MNCs,  e.g.,  DuPont,  ATOCHEM,  and  ICI), 
international  institutions  (the  Montreal  Ozone 
Protocol’s  technical  advisory  panels  and  the 
Montreal  Ozone  Fund),  and  NGOs  (most  notably, 
Greenpeace).  The  appropriate  factors  by  which 
international  institutions  may  induce  other  actors 
to  accept  the  epistemic  ideas  will  vary  because  of 
the  political  and  economic  conditions  in  the  target 
country;  for  instance,  advanced  industrialized  soci¬ 
eties  are  more  responsive  to  public  education  cam¬ 
paigns,  whereas  poor  developing  countries  respond 
to  material  incentives  from  capacity  building.  The 
key  analytic  point,  though,  is  that  without  the 
involvement  of  epistemic  communities  to  set  the 
agenda  and  impart  new  ideas,  the  political  dynam¬ 
ics  of  collective  action  are  likely  to  be  far  more 


conventional  and  lack  the  potential  for  reflective 
learning  by  policymakers. 

Subsequent  empirical  work  has  found  that  epis¬ 
temic  community  members  have  played  a  strong 
role  in  negotiating  international  environmental 
agreements.  Members  of  the  epistemic  community 
that  have  dominated  technical  discussions  in  envi¬ 
ronmental  regimes  have  subscribed  to  holistic 
ecological  beliefs  about  the  need  for  policy  coordi¬ 
nation  subject  to  ecosystemic  laws.  Thus,  they 
promote  international  environmental  regimes  that 
are  grounded  on  policies  that  offer  coherent  plans 
for  the  management  of  entire  ecosystems  and  are 
sensitive  to  interactions  among  environmental 
media  (such  as  air  and  water),  sources  of  pollu¬ 
tion,  and  contending  uses  of  the  common  property 
resource,  rather  than  being  limited  to  more  tradi¬ 
tional  policies  for  managing  discrete  activities  or 
physical  resources  spaces  within  fairly  short-term 
time  horizons.  Epistemically  informed  collective 
action  has  a  distinctively  comprehensive  form  and 
is  politically  resilient,  yielding  patterns  that  are  not 
associated  with  political  dynamics  involving  any 
other  sets  of  political  actors.  Moreover,  epistemi¬ 
cally  informed  treaties  are  more  likely  to  be  effec¬ 
tive  because  the  regulatory  standards  fit  the  behav¬ 
ior  of  the  socio-ecosystem  being  managed. 

Further  studies  of  international  institutions 
refined  the  characteristics  of  formal  international 
organizations  that  are  likely  to  absorb  the  lessons 
of  epistemic  communities  and  disseminate  them. 
Design  principles  for  international  scientific  panels 
were  also  identified. 

The  hypotheses  about  the  distinctive  patterns  of 
epistemic  collective  action  were  confirmed  by 
numerous  focused  comparative  case  studies  of  a 
wide  array  of  multilateral  environmental  regimes. 
In  IPE  (international  political  economy),  similar 
dynamics  are  associated  with  the  popularization 
and  dissemination  of  major  economic  doctrines, 
including  the  Washington  Consensus,  Keynesianism 
and  structuralism,  and  development  economics. 

The  ideational  focus  was  absorbed  into  the 
broader  constructivist  research  program  developed 
in  international  relations  and  comparative  politics, 
which  looked  at  the  role  of  beliefs  and  ideas  in  shap¬ 
ing  state  interests  and  practices,  with  epistemic  com¬ 
munities  serving  as  one  of  the  mechanisms  by  which 
new  ideas  are  developed  and  circulated.  Analyzing 
epistemic  communities  continues  to  provide  a  key 
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group  of  actors  associated  with  a  distinctive  set  of 
processes  by  which  causal  understandings  shape 
actor  interests  and  regime  dynamic  amid  construc¬ 
tivists’  broader  focus  on  other  ideational  forces, 
including  norms,  linguistic  usage,  and  the  like. 

More  recent  publications  in  the  epistemic  com¬ 
munities  research  program  have  been  looking  more 
carefully  at  the  context  in  which  epistemic  commu¬ 
nities  arise  and  operate.  Current  works  try  to  clar¬ 
ify  the  institutional  factors  that  shape  or  amplify 
ideational  consensus  and  dissemination  and  that 
influence  the  creation  of  relatively  impartial  and 
usable  scientific  advice,  and  they  try  to  separate  the 
causal  mechanisms  of  social  learning  from  other 
causal  mechanisms  that  drive  collective  action.  For 
instance,  how  can  one  explain  which  epistemic 
community  will  prevail  and  when?  One  could  ask 
why  environmental  scientists  have  tended  to  prevail 
over  economists  in  all  environmental  issues  other 
than  climate  change.  Additional  research  frontiers 
include  the  mechanisms  of  lock-in,  the  domestic 
dynamics  that  influence  the  authority,  the  identity 
and  influence  of  epistemic  communities,  and  the 
mechanisms  by  which  decision  makers  learn  more 
about  the  nature  of  the  international  political  sys¬ 
tem  as  well  as  the  management  of  isolated  issues. 
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Epistemological  and 
Methodological  Foundations 


Understood  literally,  method-ology  means  the 
study  and/or  use  of  methods.  This  understanding 
implies  the  existence  of  more  than  one  method 
and  that  the  utility  of  a  particular  method  can 
vary  across  contexts.  For  many  practitioners  in 
political  science,  the  method  of  choice  is  deter¬ 
mined  by  the  number  of  observations:  Students  of 
relatively  abundant  political  phenomena  rely  on 
statistical  techniques,  students  of  unique  political 
phenomena  use  case  study  techniques,  and  those 
in  the  middle  sample  choose  from  a  wide  array  of 
qualitative  or  small-N  comparative  approaches. 
This  methodological  rule  of  thumb  is  derived  from 
an  implicit  hierarchy  of  methods,  which  is  the 
product  of  a  particular,  and  very  influential, 
research  tradition. 

This  entry  examines  the  ontological  and  episte¬ 
mological  foundations  of  this  research  tradition  to 
explain  the  reasoning  behind  its  implicit  hierarchy  of 
methods  and  to  show  some  of  its  inherent  limita¬ 
tions.  There  are  two  objectives  for  this.  The  first 
objective  is  to  broaden  the  reader’s  methodological 
horizons  by  showing  how  one’s  choice  of  methods 
should  reflect  the  nature  of  the  subject  one  studies  as 
well  as  the  way  in  which  one  can  acquire  knowledge 
about  that  subject.  By  examining  these  method¬ 
ological  foundations,  political  scientists  can  become 
more  aware  of  both  the  potential  and  the  shortcom¬ 
ings  of  the  methods  they  use.  Just  as  important, 
methodological  reflection  can  encourage  political 
scientists  to  use  a  broader  range  of  methods.  In  rec¬ 
ognizing  how  the  usefulness  of  any  particular 
method  will  depend  on  the  nature  of  the  subject 
under  study  as  well  as  on  the  mode  of  explanation 
that  the  analyst  deems  most  appropriate,  political 
scientists  may  find  it  necessary  to  adapt  their  meth¬ 
ods,  or  adopt  new  ones,  as  the  nature  of  the  political 
world  (and  political  scientists’  interest  in  it)  evolves. 
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To  stimulate  this  sort  of  reflection,  this  entry 
focuses  on  the  philosophical  underpinnings  of  the 
most  common  approaches.  For  empirically  minded 
political  scientists,  these  philosophical  concerns 
are  twofold:  (1)  they  need  to  be  aware  of  the 
nature  of  the  things  they  study,  and  (2)  they  need 
to  concern  themselves  with  the  various  means  by 
which  they  can  secure  reliable  knowledge  about 
those  things.  These  two  concerns  provide  the  basic 
framework  for  this  entry.  The  first  section  exam¬ 
ines  the  way  in  which  mainstream  approaches 
depend  on  a  particular  understanding  of  the  polit¬ 
ical  world  to  produce  the  theories  and  generaliza¬ 
tions  that  the  political  science  profession  admires. 
The  most  common  methods  are  designed  to  exploit 
specific  ontological  contexts  and  epistemological 
standards;  as  a  result,  they  do  not  always  work  as 
designed  when  transplanted  to  new  and  different 
contexts.  The  second  section  introduces  an  alter¬ 
native  ontological  approach  and  its  relationship  to 
the  same,  commonplace  methods.  In  short,  the 
relationship  among  ontology,  epistemology,  and 
the  methods  is  discussed  in  the  first  two  sections 
that  follow.  The  third  section  examines  some  of 
the  most  common  difficulties  facing  researchers 
who  are  trying  to  secure  reliable  knowledge  about 
that  political  world.  In  their  attempt  to  secure  and 
understand  knowledge  about  the  world,  they 
inevitably  impose  their  own  ways  of  ordering  it. 
Recognizing  and  correcting  these  inherent  biases  is 
the  subject  of  the  third  section,  which  harks  back 
to  Francis  Bacon  for  the  task. 

The  Political  World 

Political  science — as  an  academic  discipline — has 
grown  out  of  several  fields,  which  vary  signifi¬ 
cantly  from  one  national  tradition  to  the  next.  In 
some  contexts,  the  weight  of  history  has  been 
strongest;  in  others,  politics  grew  from  the  study  of 
law;  in  still  others,  economics  or  philosophy  were 
the  host  communities.  As  a  result,  modern  political 
science  can  draw  from  a  very  deep  methodological 
well,  if  only  one  is  willing  to  tap  into  it.  Throughout 
most  of  the  history  of  European  political  thought, 
one  can  find  a  remarkable  diversity  and  openness 
about  how  to  address  political  affairs.  The  found¬ 
ing  thinkers  of  the  political  science  discipline — 
Plato  and  his  student  Aristotle — could  not  agree 
about  the  nature  of  the  world  they  studied,  the 


utility  of  different  sources  of  knowledge  about  that 
world,  or  on  any  particular  method  for  studying 
that  world.  After  all,  Plato  was  not  especially  inter¬ 
ested  in  experience  or  empirical  detail;  he  empha¬ 
sized  the  importance  of  reason  in  discovering 
truths  about  an  ideal  world.  Aristotle,  by  contrast, 
favored  a  more  down-to-earth  approach,  where 
observation  of  events  in  the  world  and  careful 
comparisons  could  show  us  true  from  false. 

In  these  foundational  thinkers,  one  can  find  an 
open  discussion  about  the  importance  of  linking 
the  approach  of  study  to  the  world  of  study.  In  the 
centuries  since,  there  has  been  a  broad  consensus 
that  the  nature  of  the  social  and  political  world  is 
relatively  similar  to  that  of  the  natural  world. 
Indeed,  this  consensus  is  one  reason  why  many 
methods  developed  for  studying  the  natural  world 
have  successfully  migrated  to  the  social  sciences, 
and  it  may  be  an  important  reason  why  political 
scientists  spend  so  little  time  discussing  ontological 
issues.  As  will  be  seen  in  the  next  section,  this  is 
not  the  only  ontological  perspective  embraced  by 
political  scientists — but  it  remains  the  most  domi¬ 
nant.  Most  political  scientists  view  the  political 
world  in  a  way  that  is  strongly  influenced  by  a 
long  and  illustrious  tradition  in  the  natural  sci¬ 
ences.  This  tradition  builds  on  two  important 
assumptions.  The  first  holds  that  the  phenomena 
studied  are  independent  of  the  observer,  so  that 
observations  do  not  affect  them  in  any  significant 
way.  In  this  tradition,  political  scientists  can 
approach  their  world  of  study  confident  that  there 
is  an  objective  subject  of  study — a  Real  World — 
and  that  this  subject  will  reveal  itself  consistently 
to  each  observer.  Because  the  Real  World  exists 
independent  of  us,  and  because  that  World  is  sta¬ 
ble  and  unchanging,  it  is  possible  to  arrive  at  sin¬ 
gular  truths  about  the  nature  of  this  Real  World, 
given  the  appropriate  techniques. 

The  second  common  assumption  holds  that  this 
Real  World  is  patterned — that  it  obeys  a  certain 
logic  of  its  own — and  that  these  patterns  are  also 
independent  of  their  observers.  Because  these  pat¬ 
terns  are  a  natural  part  of  the  world,  this  can  be 
called  a  naturalist  ontology,  perspective,  or  world¬ 
view.  While  some  of  these  patterns  are  patently 
obvious  (e.g.,  heavy  items,  when  dropped,  fall  to 
the  ground;  the  sun  rises  in  the  east  and  sets  in  the 
west),  others  are  concealed  by  the  complexities  of 
nature  and  life.  The  latter  is  especially  true  in  the 
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political  world.  It  is  the  fixed  existence  and  inde¬ 
pendent  nature  of  these  underlying  patterns  that 
allows  us  to  generalize  about  the  nature  of  social 
relationships.  In  recognizing  these  fundamental 
components  of  this  ontology,  researchers  have 
developed  methods  that  can  search  for,  uncover, 
and  eventually  explain  the  systematic  patterns  that 
they  believe  exist  but  that  are  often  hidden  from 
the  casual  observer. 

Belief  in  the  existence  of  these  natural  and  stable 
patterns  can  explain  the  predominance  of  two  main 
sources  of  knowledge  in  political  science:  reason 
and  experience  (or  sensory  perception).  These  are 
not  the  only  sources  of  knowledge  available  to 
political  scientists,  nor  are  they  exclusive  to  the 
naturalist’s  world  view.  Still,  rationalism  and 
empiricism  remain  political  scientists’  most  com¬ 
mon  and  dominant  sources  of  knowledge.  The  first 
of  these,  rational  knowledge,  acquires  understand¬ 
ing  about  the  world  through  strict  adherence  to 
rigorous  thinking  (e.g.,  by  using  syllogisms  or  for¬ 
mal  logic).  Because  we  believe  the  world  to  be  sta¬ 
ble  and  patterned,  we  can  use  reason  and  logic  to 
uncover  the  underlying  form,  ideal,  or  model  on 
which  the  Real  World  is  based.  While  the  original 
motivation  for  this  way  of  interpreting  the  world 
was  religious  (that  an  all-powerful  God  created  the 
world  and  imbued  it  with  meaning),  it  enjoys  a  long 
and  august  pedigree:  one  that  extends  from  Plato, 
through  17th-century  continental  European  phi¬ 
losophy  (e.g.,  Descartes,  Spinoza,  and  Leibniz),  and 
continues  today  in  mathematical  and/or  rational 
choice  approaches  to  studying  the  political  world. 

In  the  rationalist  tradition,  the  patterned  nature 
of  the  Real  World  can  be  uncovered  and  under¬ 
stood  by  creating  consistent  and  logical  models  to 
show  how  the  world  really  is  beneath  the  confus¬ 
ing  complexity  of  everyday  life.  By  filtering  out  the 
particular  irregularities  of  experience  and  by  focus¬ 
ing  on  the  underlying  concepts  and  relationships, 
rationalist  approaches  are  able  to  create  pristine 
models  of  social  behavior.  Thus,  for  example, 
economists  can  generate  models  of  what  a  true 
market,  in  equilibrium,  should  look  and  behave 
like  (e.g.,  in  the  absence  of  organized  interests  and 
government  regulations).  Similarly,  social  choice 
theorists  can  show  (under  a  number  of  very  rea¬ 
sonable  conditions)  that  no  voting  system  is  fair. 

Implicitly,  the  authors  of  these  models  believe 
that  they  correspond  to  the  true  nature  of  the 


(underlying)  Real  World.  To  complicate  matters, 
this  Real  World  might  differ  from  the  world  of 
experience.  When  models  do  not  actually  corre¬ 
spond  to  observable  actions  or  patterns,  supporters 
can  argue  (like  Plato)  that  the  particular  experiences 
of  everyday  life  (or  our  description  of  these  experi¬ 
ences)  get  in  the  way  of,  or  obscure,  our  perception 
of  the  underlying  patterns  and  relationships. 
Consequently,  the  standard  of  proof  in  this  tradition 
tends  to  be  reason,  not  a  reference  to  how  things 
actually  are  in  the  world  of  everyday  experience. 

The  second  main  source  of  knowledge — empiricist 
knowledge — is  arguably  the  most  dominant  in  con¬ 
temporary  political  science.  The  power  of  this  type 
of  knowledge  is  derived  from  the  belief  that  the  most 
secure  means  of  obtaining  knowledge  about  the 
world  is  through  sensory  perception  and  the  clear 
description  of  these  sensory  experiences.  Although  it 
may  be  harmful  to  depict  the  history  of  science  in 
simple — dichotomous — terms,  it  is  common  and 
convenient  to  depict  the  empiricist  tradition  as  a 
response  to — or  in  dialogue  with — the  rationalist 
tradition.  Thus,  Aristotle  is  seen  to  have  reacted  to 
Plato’s  rationalism  in  beginning  his  treatise  on 
Politics  with  the  simple  claim  that  “observation 
shows  us,”  and  British  philosophers  such  as  Francis 
Bacon,  John  Locke,  and  David  Hume  are  said  to  be 
responding  to  the  continental  philosophers  associ¬ 
ated  with  the  rationalist  tradition. 

Here,  too,  the  messiness  of  everyday  life  can 
cloud  the  underlying  patterns: 

The  universe  to  the  eye  of  the  human  understanding 
is  framed  like  a  labyrinth,  presenting  as  it  does  on 
every  side  so  many  ambiguities  of  way,  such 
deceitful  resemblances  of  objects  and  signs,  natures 
so  irregular  in  their  lines  and  so  knotted  and 
entangled.  And  then  the  way  is  still  to  be  made  by 
the  uncertain  light  of  the  sense,  sometimes  shining 
out,  sometimes  clouded  over,  through  the  woods 
of  experience  and  particulars;  while  those  who 
offer  themselves  for  guides  are  (as  was  said) 
themselves  also  puzzled,  and  increase  the  number 
of  errors  and  wanderers.  (Bacon,  1620/1863a, 
Preface) 

Although  the  true  patterns  of  a  Real  World  lie  hid¬ 
den  in  this  labyrinth  of  experience,  empiricists 
believe  that  the  use  of  appropriate  methods  and 
careful  study  can  uncover  them  and  reveal  the 
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truth.  Following  Karl  Popper,  naturalists  in  the 
empiricist  tradition  believe  that  a  statement  is  true 
if  it  accurately  corresponds  to  a  state  of  affairs  in 
the  Real  World. 

For  the  world  of  political  science,  as  for  the 
world  of  nature  (or  perhaps  even  more  so),  the 
complexity  of  overlapping  vocabularies  and  per¬ 
spectives  can  make  it  very  difficult  to  see  the 
underlying  and  stable  patterns  that  make  up  the 
Real  World.  To  uncover  these  patterns,  research¬ 
ers  must  choose  their  approaches  carefully  and 
dissect  the  world  systematically.  Modern  empiri¬ 
cist  approaches  draw  from  the  “experimental 
methods”  in  John  Stuart  Mill’s  (1891)  A  System  of 
Logic  to  develop  inductive  approaches  that  can 
identify  the  lawlike  relationship  in  the  underlying 
(political)  world.  Modern  political  science — in  all 
its  variants — is  remarkably  indebted  to  Mill’s 
methods  of  experimentation.  His  method  of  differ¬ 
ence,  method  of  agreement,  indirect  method  of 
difference,  and  method  of  concomitant  variation 
have  become  the  building  blocks  of  modern  com¬ 
parative  and  statistical  analyses:  tools  that  are  used 
to  systematically  control  and  compare  our  experi¬ 
ences  of  the  political  world  in  ways  that  allow  us 
to  discover  the  stable  connections  and  causal  regu¬ 
larities  that  are  hidden  beneath  the  surface. 

Both  these  mainstream  sources  of  knowledge 
(rationalism  and  empiricism)  assume  that  the  Real 
World  not  only  exists  but  that  it  is  naturally  struc¬ 
tured  or  patterned  in  a  logical,  meaningful  way. 
Methods  that  rest  atop  both  rationalist  and  empir¬ 
icist  foundations  are  designed  to  uncover  singular 
truths  about  the  nature  of  the  Real  World  and  to 
do  so  in  an  objective,  disinterested  manner.  It  is 
this  pattern  (or  logic)  that  allows  us  to  use  system¬ 
atic  tools  of  induction  (or  methods  of  experimen¬ 
tation,  to  use  Mill’s  terms)  to  tease  out  the  patterns 
that  lie  just  beneath  the  surface  or  to  generate 
logical  and  rational  models  of  how  the  world  actu¬ 
ally  is.  In  this  mainstream  (naturalist)  tradition, 
political  scientists  can  transcend  their  personal, 
temporal,  and  cultural  perspectives,  and  beliefs 
and  prejudices,  to  arrive  at  valid  knowledge  and 
objective  truths. 

Most  contemporary  political  scientists  have 
inherited  approaches  that  have  grown  out  of  this 
tradition:  They  have  found  utility  and  meaning  in 
methods  that  have  been  carefully  adapted  in  ways 
that  make  them  more  suitable  for  studying  the 


political  world.  Indeed,  scholars  in  this  tradition 
subscribe  to  a  strong  (if  implicit)  hierarchy  of 
methods  for  uncovering  truths  about  the  world, 
and  they  do  so  by  combining  elements  from  both 
the  rationalist  and  empiricist  traditions. 

At  the  top  of  this  hierarchy  lies  the  experimental 
method.  This  method  is  prized  because  it  allows 
researchers  to  manipulate  the  relevant  variables,  in 
a  context  that  controls  for  all  other  sources  of 
influence,  in  order  to  secure  firm  knowledge  about 
posited  relationships.  This  method  works  because 
we  assume  the  world  to  be  made  up  of  discrete  and 
independent  parts  (not  unlike  those  in  a  clock), 
which  can  be  mixed  and  matched  in  different  ways 
to  reveal  (and  test)  causal  relationships.  Indeed, 
this  capacity  to  control  and  compare  components 
of  the  real  world  is  a  hallmark  of  all  good  science. 

When  experimentation  is  not  a  realistic 
alternative — and  many  areas  of  political  life  do 
not  lend  themselves  to  experimental  design — 
naturalists  tend  toward  statistical  approaches. 
Here  too,  manipulation,  comparison,  and  control 
are  central  to  uncovering  real-world  patterns.  But 
instead  of  manipulating  the  contexts  in  which 
these  variables  lie,  the  statistician’s  computer 
manipulates  the  data  in  ways  that  mimic  experi¬ 
mental  control.  Because  statistics  do  not  involve 
the  physical  manipulation  of  data,  it  is  a  method 
that  lends  itself  to  the  study  of  social  phenomena, 
where  the  tendency  is  to  study  events  that  have 
already  occurred. 

The  naturalist’s  third  best  alternative  is  small-N 
comparative  approaches.  These  are  used  when  one 
is  interested  in  questions  that  have  too  few  obser¬ 
vations  to  run  reliable  statistical  enquiries.  As  both 
statistical  and  small-N  comparative  methods  use 
experimental  designs,  they  share  much  in  common. 
The  most  important  difference  is  that  the  compara- 
tivist  selects  the  cases  of  study  in  a  strategic  way, 
so  as  to  maximize  the  variance  of  the  independent 
variables  and  to  minimize  the  influence  of  other 
(control)  variables.  In  other  words,  unlike  statisti¬ 
cal  and  experimental  designs,  case  selection  in 
small-N  comparative  projects  is  anything  but  ran¬ 
dom.  Because  of  this,  researchers  need  to  rely  on 
theory  to  avoid  major  problems  associated  with 
this  method  and  its  control  strategy  (such  as  over¬ 
determination  and  a  sampling  bias). 

Finally,  case  studies  constitute  the  bottom  rung 
of  this  methods  hierarchy.  This  method  is  used 


Epistemological  and  Methodological  Foundations  795 


only  when  researchers  are  faced  with  a  paucity  of 
data  or  relevant  comparisons.  Case  studies  are 
histories  with  a  point:  They  gather  information  on 
cases  that  are  interesting  or  relevant  in  light  of  a 
larger  theoretical  concern  or  a  specific  research 
design.  By  collecting  more  data  on  such  cases, 
researchers  might  eventually  bump  the  study  up  to 
a  higher  rung  on  the  hierarchy.  On  their  own, 
however,  case  studies  have  little  value,  in  that 
studying  a  single  case  can  provide  little  grounds  for 
generalization  about  the  patterns  that  motivate  the 
research. 

These  methods  work  remarkably  well  when 
used  in  the  right  contexts  and  when  the  limitations 
of  each  are  recognized  and  respected.  These  con¬ 
texts  draw  on  a  naturalist  ontology  that  sees  the 
world  as  something  independent  of  the  observer, 
patterned  in  nature,  and  accessible  by  methods 
that  use  observation,  control,  and  comparison. 
The  methods  depend  on  a  world  that  can  be  bro¬ 
ken  down  into  its  component  parts  (variables), 
which  are  themselves  independent  and  autono¬ 
mous.  This  allows  the  researcher  to  cut  up  a 
political  phenomenon,  manipulate  its  component 
parts,  and  use  this  structured  manipulation  (of 
either  the  world  or  the  data)  to  interrogate  the 
phenomenon  in  a  way  that  will  reveal  its  logic  or 
pattern.  However,  when  these  methods  are  used  in 
a  methodologically  naive  manner — when  they  are 
used  in  contexts  that  do  not  correspond  to  the 
assumptions  that  gird  them — the  results  can  be 
misleading,  biased,  and  wrong. 

A  Few  Examples 

There  is  little  point  in  humiliating  those  who 
have  ignored  the  ontological  assumptions  that 
underlie  their  methods.  This  problem  is  common 
enough  for  the  shame  to  be  shared  by  many.  Still, 
this  point  can  be  made  with  reference  to  a  couple  of 
recent  (positive)  examples  of  colleagues  who  have 
reflected  on  (and  responded  to)  the  shortcomings  of 
a  particularly  popular  method.  Although  both 
these  examples  are  critical  of  cross-national  regres¬ 
sion  analyses,  it  is  not  meant  to  suggest  that  this 
problem  is  confined  to  statistical  studies.  These 
examples  are  used  because  they  highlight  the  sort  of 
difficulties  that  can  be  found,  even  among  the  most 
refined,  explicit,  and  sophisticated  of  contemporary 
methods.  A  fundamental  assumption  of  most 


comparative  research  is  that  a  representative  sample 
of  independent  observations,  drawn  from  the  popu¬ 
lation  of  interest,  is  used.  The  independence  of 
observations  is  important  as  it  affects  the  effective 
number  of  observations  (e.g.,  if  there  are  four  peo¬ 
ple  in  a  household  and  they  generate  an  opinion 
together,  the  number  of  independent  opinions  is  one 
not  four).  Samples  based  on  observations  that  are 
not  independent  will  generate  findings  that  misesti¬ 
mate  the  amount  of  variability  in  the  population, 
producing  biased  sample  statistics  and  erroneous 
conclusions.  This  assumption  of  (observational) 
independence  meshes  well  with  the  ontological 
assumptions  girding  many  of  the  fields  of  research 
that  use  regression  techniques.  When  applied  on 
international  samples,  this  assumption  is  also  in 
tune  with  the  logic  of  the  Westphalian  state  system 
(at  least  an  idealized  view  of  that  system),  where 
states  are  understood  to  be  autonomous  and  inde¬ 
pendent.  But  in  a  world  that  is  increasingly  charac¬ 
terized  by  international  integration  and  diffusion, 
this  assumption  is  increasingly  tenuous. 

This  problem  is  nothing  new,  and  it  is  usually 
traced  back  to  the  critical  comments  of  Sir  Francis 
Gabon,  who — late  in  the  19th  century — ques¬ 
tioned  the  results  of  a  paper  that  used  data  from  a 
cross-cultural  sample  to  test  the  relationship 
between  marriage  laws  and  descent  patterns  in 
tribal  cultures.  “Gabon’s  problem”  was  born 
when  he  noted  that  the  observed  correlation  might 
have  been  a  result  of  contacts  between  the  cultures 
in  the  sample  (and  not  the  result  of  a  test  of  truly 
independent  cases).  While  anthropologists  have 
been  struggling  with  Gabon’s  problem  for  some 
time,  political  scientists  have  been  remarkably 
slow  to  realize  (and  address)  the  problem.  In  recent 
years,  however,  there  has  been  an  explosion  of 
activity  on  this  front,  as  many  statisticians  attempt 
to  deal  with  the  effects  of  globalization.  Detlef 
Jahn  (2006)  provides  political  scientists  with  a 
good  introduction  to  the  nature  of  the  problem 
and  suggests  one  way  to  try  to  deal  with  it.  In  an 
honest  attempt  to  align  method  and  ontology, 
Jahn  proposes  a  simple  modification  of  ordinary 
least-squares  (OLS)  regression  techniques  that 
allow  him  to  measure  the  strength  of  the  diffusion 
associated  with  globalization. 

This  is  one  way  to  resolve  the  growing  tension 
between  a  changing  world  and  the  most  popular 
method  for  its  study.  Jahn  provides  us  with  a 
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“patch”  to  an  approach  that  is  not  particularly 
well  suited  for  dealing  with  the  changing  world  of 
international  politics.  These  sorts  of  patches  work 
if  they  are  few  in  number  and  do  not  overwhelm 
the  underlying  approach  (the  result,  then,  would  be 
like  trying  to  run  Windows  XP  on  an  old  386  pro¬ 
cessor).  In  the  long  run,  it  is  likely  that  an  approach 
that  is  better  designed,  from  the  bottom  up,  will 
need  to  be  developed  to  deal  with  this  new  context. 
In  the  meantime,  however,  Jahn  gets  us  thinking. 

A  second  example  comes  from  an  article  by 
Peter  Hall  (2008).  Like  Jahn,  Hall  is  critical  of 
mainstream  regression  techniques  but  for  different 
reasons.  His  concern  is  not  so  much  in  the  changing 
nature  of  the  things  researchers  study  but  on  the 
different  kinds  of  things  they  study.  In  particular, 
Hall  is  interested  in  how  researchers  secure  causal 
explanations  of  political  phenomena.  More  to  the 
point,  Hall  is  critical  of  the  (common)  assumption 
that  large-N  statistical  approaches  provide  the  best 
grounds  for  causal  inference.  After  reviewing  three 
different  modes  of  explanation  (historically  spe¬ 
cific,  multivariate,  and  theory  oriented)  Hall  shows 
how  small-N  comparative  designs  can  be  valuable 
for  causal  inference.  In  particular,  an  intensive 
examination  of  the  causal  chain  provides  research¬ 
ers  with  a  new  and  different  basis  for  causal  infer¬ 
ence — one  that  is  especially  well  suited  for  assessing 
the  sort  of  complex  causal  theories  that  are  so 
prominent  in  the  social  sciences.  When  they  use 
systematic  process  analyses  (Hall’s  term),  small-N 
comparisons  can  be  especially  useful  for  developing 
theory-oriented  explanations. 

Hall’s  reasoning  rests  on  ontological  founda¬ 
tions.  In  addition  to  traditional  arguments  (such  as 
the  need  to  be  aware  of  the  state  of  the  literature), 
Hall  notes  how  the  researcher’s  methodology 
should  be  conditioned  on  the  state  of  the  world,  as 
he  or  she  perceives  it  (Hall,  2008;  see  also  Hall, 
2003).  Because  standard  regression  models  build  on 
assumptions  that  are  unlikely  to  hold  in  situations 
where  researchers  are  trying  to  develop  theory- 
oriented  explanations,  he  questions  the  (common) 
claim  that  a  single  (statistical)  method  can  ade¬ 
quately  assess  the  validity  of  causal  inferences  in  the 
social  sciences.  While  the  underlying  assumptions 
of  standard  regression  analyses 

were  usefully  employed  to  assess  the  conditions 

conducive  to  securing  stable  democracy,  when 


those  conditions  were  thought  to  include  basic 
socioeconomic  factors,  such  as  the  level  of 
economic  development,  related  levels  of  literacy 
and  the  correlates  of  “modernisation”  .  .  .  when 
theorists  began  to  see  stable  democracy  as  the 
product  of  an  intricate  strategic  interaction 
among  reforms,  extremists  and  defenders  of  the 
old  regime,  statistical  methods  were  no  longer 
appropriate  for  assessing  the  causal  chain.  (Hall, 
2008,  p.  307) 

Both  these  examples  show  how  greater  aware¬ 
ness  of  ontological  assumptions  should  be  impor¬ 
tant  for  empirically  minded  political  scientists.  In 
the  first  example,  this  sort  of  awareness  was  used 
to  tweak  an  existing  approach  in  a  way  that  makes 
more  sense  in  a  changed  (and  changing)  world  of 
study.  In  the  second  example,  explicit  reflection  on 
the  different  types  of  causal  arguments  was  used  to 
argue  for  the  inclusion  of  small-N  methods,  more 
suitable  for  some  of  the  tasks  at  hand.  Both  exam¬ 
ples  illustrate  the  importance  of  embracing  differ¬ 
ent  methods  and  thinking  carefully  about  how  our 
methods  engage  the  world  we  study. 

Competing  Worldviews 

As  with  just  about  everything  else  in  political  sci¬ 
ence,  there  is  more  than  one  way  to  look  at  the 
political  world.  Indeed,  there  are  many  who  are 
unwilling  to  commit  to  a  single  Real  World  of 
political  study,  and  this  has  important  conse¬ 
quences  for  the  way  that  methods  (even  familiar 
methods)  are  used.  While  the  focus  of  this  entry  is 
based  on  mainstream,  naturalist  approaches  to 
political  science,  it  can  be  useful  to  glance  at  an 
alternative  ontological  perspective  to  show  how 
standard  methods  play  different  roles  when  used  in 
different  contexts. 

This  alternative  ontological  perspective  can  be 
called  constructivism.  Its  followers  question  the 
existence  of  a  single  world  of  study,  independent 
of  its  examiner  and  capable  of  revealing  objective 
and  singular  truths.  Instead,  constructivists 
believe  that  truth  lies  in  the  eyes  of  the  observer 
and  in  the  constellation  of  power  and  forces  that 
support  that  truth.  As  our  observations  and 
descriptions  of  events  cannot  be  shielded  from  the 
biases  that  surround  us,  constructivists  do  not 
believe  that  it  is  possible  to  secure  an  absolute 
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truth  that  corresponds  to  a  single  Real  World. 
Their  objectives  are  more  modest:  They  point  to 
the  ways  in  which  our  contexts  (and  those  of  our 
subject  matter)  frame  the  way  we  come  to  develop 
an  understanding. 

This  constructivist  tradition  is  often  marginal¬ 
ized  by  mainstream  political  science  for  being 
unreliable  and  even  serendipitous.  Some  think 
mainstream  political  science  does  so  at  its  own 
peril.  This  is  because  constructivism  shares  a  num¬ 
ber  of  ontological  and  epistemological  features 
with  the  naturalist  tradition,  and  these  similarities 
allow  each  tradition  to  borrow  from  the  other.  In 
some  respects,  the  two  perspectives  can  comple¬ 
ment  one  another,  and  the  lessons  learned  from 
one  tradition  can  be  amended,  or  modified,  to  be 
applied  in  contexts  more  familiar  to  the  other. 
Indeed,  this  useful  collaboration  is  evident  in  the 
growing  movement  for  scientific  realism.  Most 
important,  constructivists  and  naturalists  share  a 
willingness  to  see  the  world  in  terms  of  interpre¬ 
table  patterns,  and  both  traditions  rely  heavily  on 
rationalist  and  empiricist  sources  of  knowledge  for 
understanding  these  patterns. 

How  these  perspectives  differ  is  in  their  view  of 
the  source  of  these  patterns  and  the  ends  to  which 
they  use  rationalist  and  empiricist  sources  of 
knowledge.  For  constructivists,  the  patterns  of 
interest  are  not  firmly  rooted  in  nature,  but  they 
are  the  products  of  our  making:  Each  of  us  sees 
and  experiences  different  things,  and  what  we  see 
(or  experience)  is  determined  by  a  complicated 
mix  of  social  and  contextual  influences  and/or 
presuppositions. 

The  modern  intellectual  roots  of  this  tradition 
can  be  traced  to  Immanuel  Kant,  who  was  strug¬ 
gling  with  the  implication  of  David  Hume’s  diffi¬ 
culty  in  securing  firm  knowledge  about  causality. 
Hume’s  approach  to  causation  rested  on  a  rather 
simple  theory  of  sensory  perception:  The  human 
mind  (straightforwardly,  even  mechanistically) 
absorbs  impressions  through  the  senses.  Kant 
seems  to  have  appreciated  Hume’s  general  theory 
of  sensory  perception,  but  he  did  not  think  that  the 
human  mind  should  be  understood  as  an  empty 
vessel  into  which  sensory  perceptions  were  merely 
dumped.  For  Kant,  our  senses  only  bring  percep¬ 
tions  to  the  doorstep  of  the  mind.  It  was  then  up 
to  the  mind  to  reorganize,  characterize,  and  then 
store  these  perceptions  for  later  use.  To  perform 


this  task,  Kant  believed  that  the  human  mind  was 
prewired  with  basic  preconditioning  concepts,  or 
forms  of  understanding.  Thus,  in  thinking  about 
how  we  experience  and  store  sense  perceptions, 
Kant  shifted  the  ontological  terrain  of  patterned 
(lawlike)  behavior  from  nature  to  the  human 
mind.  In  other  words,  Kant  believed  that  the  pat¬ 
terns  we  see  in  nature  are  the  result  of  our  mental 
manipulation  of  sensory  perceptions — not  a  char¬ 
acteristic  of  nature  itself. 

Kant  did  not  suggest  that  the  social  world  was 
ontologically  different  from  the  natural  world. 
Neither  did  he  question  the  existence  of  a  Real 
World.  Rather,  Kant  argued  that  we  cannot  know 
anything  about  the  Real  World  ( noumena ,  in  his 
vocabulary).  All  we  can  know  is  that  our  percep¬ 
tions  (phenoumena)  of  the  Real  World  are  some¬ 
how  related  to  it — and  that  the  nature  of  this 
relationship  is  complex  and  ambiguous.  The  result 
of  this  complex  and  ambiguous  process  is  that  sci¬ 
entists  (both  natural  and  social)  find  themselves 
facing  what  appear  to  be  different,  even  compet¬ 
ing,  worlds. 

There  are  several  other  reasons  to  question  the 
naturalist  ontology.  Many  of  these  responses 
emphasize  the  unique  qualities  of  human  agency  to 
argue  that  the  nature  of  human  relations  must  be 
different  from  those  that  we  find  in  the  natural 
world.  “We  are  cultural  beings,  endowed  with  the 
capacity  and  the  will  to  take  a  deliberate  attitude 
toward  the  world  and  to  lend  it  significance” 
(Weber,  1949,  p.  81).  Constructivists  take  obser¬ 
vations  such  as  these  to  show  how  human  agency, 
itself,  can  create  objects  that  have  a  different  onto¬ 
logical  status  than  those  found  in  the  natural 
world:  where  the  very  existence  of  social  facts 
(e.g.,  money,  property  rights,  and  sovereignty) 
depends  on  human  agreement  and  institutions. 
Given  this  different  ontological  context,  construc¬ 
tivists  question  the  utility  of  employing  methods  to 
search  for  naturally  occurring  (independent)  pat¬ 
terns  of  behavior. 

Because  constructivists  embrace  a  world  that  is 
ontologically  diverse  and  complex,  they  tend  to 
draw  on  more  and  different  types  of  evidence  and 
proof.  While  constructivists  draw  heavily  on  ratio¬ 
nalist  and  empiricist  sources  of  knowledge,  they  are 
often  willing  to  employ  other,  more  radical,  sources 
of  knowledge  as  well  (including,  e.g.,  empathy, 
revelation,  and  myths).  While  these  alternative 
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sources  of  knowledge  may  be  less  reliable,  they  are 
firmly  anchored  in  human  experience.  For  this  rea¬ 
son,  they  can  provide  important  keys  for  unlocking 
the  (constructed)  patterns  researchers  study.  As 
constructivists  are  not  searching  for  objective  and 
verifiable  patterns  of  phenomena  existing  naturally 
in  the  social  world,  they  do  not  subscribe  to  the 
naturalist’s  hierarchy  of  methods.  After  all,  this 
tradition  doesn’t  believe  that  truth  is  just  “out 
there”;  knowledge  about  the  social  world  is  always 
knowledge  in  context;  it  is  socially  situated  and  it 
has  social  consequences.  While  it  is  difficult  to 
argue  that  constructivists  subscribe  to  an  explicit 
methods’  hierarchy  of  their  own,  they  clearly  pri¬ 
oritize  methods  that  protect  and  nurture  the  con¬ 
texts  that  can  explain  the  source  of  the  patterns 
they  see.  As  these  patterns  are  understood  to  be 
socially  (or  individually)  constructed,  their  meth¬ 
ods  aim  to  uncover  the  motivations  and  presuppo¬ 
sitions  that  generate  the  underlying  patterns. 

For  this  reason,  constructivists  have  a  soft  spot 
for  narrative  approaches,  as  these  provide  research¬ 
ers  with  a  nearness  to  the  data  and  context  that  is 
necessary  and  desirable  in  order  to  gain  insight. 
This  emphasis  on  narration  and  the  importance  of 
context  and  contingency  is  extended  to  the  way  in 
which  constructivists  employ  comparisons — as  a 
tool  for  developing  associations  that  can  help 
establish  meaning.  While  constructivists  seldom 
use  statistical  and  experimental  approaches,  there 
is  nothing  inherent  to  either  of  these  methods  that 
need  alienate  constructivist  scholars.  With  a  little 
thought  and  imagination  (two  traits  usually 
embraced  by  constructivists),  statistical  and  exper¬ 
imental  approaches  could  be  designed  and  used  in 
ways  that  can  exploit  contextual  familiarity  and 
appeal  to  constructivists.  In  short,  constructivists 
tend  to  use  the  same  basic  methods,  but  they  do  so 
in  different  ways,  toward  different  objectives. 

In  a  nutshell,  naturalists  tend  to  exhibit  little 
regard  for  the  surrounding  context.  Their  methods 
are  designed  to  cut  into  contexts  in  order  to  gain 
access  to  the  component  parts.  These  are  then 
manipulated  to  control  for,  and  capture,  relevant 
variation.  Because  the  patterns  they  seek  are 
assumed  to  exist  independently  of  the  component 
agents  and  their  observers,  preserving  the  interpre¬ 
tive  context  is  not  a  high  priority  for  naturalists. 
Constructivists,  on  the  other  hand,  want  to  use 
comparisons  to  better  understand  the  nature  or 


source  of  these  patterns.  As  the  patterns  they  see 
are  a  function  of  the  agents  they  study,  or  the 
observers  themselves,  familiar  methods  are  often 
turned  inside  out  to  look  critically  at  how  the  con¬ 
texts  themselves  provide  meaning  to  the  patterns 
under  study. 

As  in  the  naturalist  tradition,  constructivists 
can  draw  from  an  impressive  intellectual  legacy, 
which  stretches  back  to  antiquity.  More  surpris¬ 
ing,  perhaps,  is  that  this  perspective  finds  support 
among  some  rather  unlikely  allies.  For  example, 
this  entry  has  already  noted  how  indebted  main¬ 
stream  (naturalist)  political  science  is  to  the  exper¬ 
imental  methods  developed  by  J.  S.  Mill.  What 
was  not  mentioned  previously  was  that  Mill 
himself  (e.g.,  Mill,  2002)  was  skeptical  of  their 
application  to  the  world  of  politics,  and  his  skepti¬ 
cism  rested  on  ontological  foundations. 

This  sort  of  skepticism  has  been  voiced  by  many 
others.  Indeed,  these  shaky  ontological  foundations 
may  be  the  reason  why  political  science  knowledge 
doesn’t  measure  up  (in  terms  of,  say,  the  ability  to 
generalize  and/or  predict)  to  the  knowledge  pro¬ 
duced  in  the  other  branches  of  (natural)  science. 
While  such  failures  might  be  explained  by  other 
factors  (e.g.,  the  youth  of  the  discipline,  the  com¬ 
plexity  of  human  action),  there  are  formidable 
arguments — authored  by  reputable  thinkers  such 
as  Wilhelm  Dilthey,  John  Stuart  Mill,  Martin 
Heidegger,  Jurgen  Habermas,  and  Charles  Taylor — 
for  believing  that  the  shortcomings  of  this  disci¬ 
pline  are  the  result  of  relying  on  methods  that  rest 
on  faulty  ontological  assumptions. 

These  sorts  of  challenges  are  important,  if  only 
because  they  force  political  scientists  to  consider 
ontological  alternatives.  The  very  existence,  and 
growing  popularity,  of  these  alternatives  should 
encourage  political  scientists  to  consider  the  degree 
to  which  their  approaches  and  methods  rely  on 
assumptions  about  the  nature  of  the  real  world. 
After  all,  if  the  political  world  is  not  really  as 
political  scientists  assume — if  its  patterns  are  not 
constant,  universal,  and  independent  of  us — then 
the  workhorse  methods  they  have  developed  will 
lose  much  of  their  pull. 

Overcoming  Bias 

The  previous  two  sections  argued  for  the  impor¬ 
tance  of  thinking  carefully  about  the  nature  of  the 


Epistemological  and  Methodological  Foundations  799 


political  world  researchers  study  and  how  that 
nature  affects  the  way  they  can  and  should  study 
political  phenomena.  This  section  sets  aside  these 
larger  ontological  issues  and  narrows  the  focus  to 
the  most  common  approaches  in  contemporary 
political  science:  approaches  that  assume  a  natu¬ 
ralist  ontology  and  rely  on  empiricist  sources  of 
knowledge.  But  even  here,  in  this  familiar  onto¬ 
logical  and  epistemological  context,  political  scien¬ 
tists  face  a  number  of  important  obstacles  that  can 
stand  in  the  way  of  securing  objective  truths  about 
the  world  they  study.  Overcoming  these  obstacles 
is  another  role  to  consider  when  choosing  a  par¬ 
ticular  method  or  methods. 

While  there  are  many  ways  to  think  about  the 
practice  of  securing  unbiased  and  true  knowledge 
from  the  Real  World,  most  of  them  draw  from  a 
common  and  distinguished  intellectual  legacy. 
This  section  highlights  the  contribution  of  a  foun¬ 
dational  thinker  in  that  legacy:  Francis  Bacon. 
Bacon’s  approach  to  studying  the  political  world  is 
similar  to  that  of  contemporary  political  science:  It 
begins  by  assuming  that  there  is  a  real  world  out 
there  and  that  it  is  possible  to  obtain  true  knowl¬ 
edge  about  that  world.  Also,  Bacon  was  not  par¬ 
ticularly  troubled  by  the  possibility  that  our  senses 
might  deceive  us  (or  any  of  the  other  arguments 
advanced  by  those  who  are  skeptical  of  our  ability 
to  secure  truth).  Rather,  Bacon  was  concerned 
with  unmasking  the  underlying  sources  of  miscon¬ 
ception,  irrationality,  and  error  that  bar  us  from 
reaching  the  truth.  He  seems  to  have  believed  that 
any  impediment  to  knowledge  could  be  overcome 
by  the  careful  and  considered  means  of  dealing 
with  the  subtleties  of  nature: 

The  mind  of  man  is  far  from  the  nature  of  a  clear 
and  equal  glass,  wherein  the  beams  of  things 
should  reflect  according  to  their  true  incidence; 
nay,  it  is  rather  like  an  enchanted  glass,  full  of 
superstition  and  imposture,  if  it  not  be  delivered 
and  reduced  [corrected].  (Bacon,  1605,  Book  2, 
chap  14,  p.  9) 

To  deliver  and  reduce  this  enchanted  glass, 
Bacon  provided  us  with  a  typology  for  thinking 
about  the  most  common  hindrances  to  seeing 
truth.  In  Novum  Organum  ( 1620/1 863b),  Bacon 
introduces  the  idols  of  the  mind  that  had  hitherto 
blocked  the  progress  of  mankind’s  knowledge  of 


nature.  For  Bacon,  these  idols  are  like  deceitful 
pictures — they  are  images  that  stand  in  the  way  of 
our  grasping  the  truth,  even  though  they  may  be 
grounded  in  human  reason.  Together,  these  idols 
capture  the  range  of  mental,  psychological,  and 
social  dispositions  that  Bacon  believed  were  respon¬ 
sible  for  distortion  and  error.  The  first  of  these  are 
the  idols  of  the  tribe.  These  idols  are  intellectual 
errors  that  result  from  our  common  (i.e.,  human) 
weaknesses.  For  example,  Bacon  tells  us  that  we 
tend  to  assume  more  order  and  regularity  in  the 
world  than  is  actually  the  case;  we  have  a  soft  spot 
for  common  sense — that  is,  we  rely  on  limited 
assumptions  without  trying  to  verify  them  as  estab¬ 
lished  truths;  we  tenaciously  hold  on  to  our  beliefs 
by  means  of  continual  rationalization,  even  in  the 
face  of  countervailing  evidence;  we  are  attracted  to 
positive  examples  more  than  to  negative  examples; 
we  pursue  wishful  thinking  (i.e.,  we  believe  what 
we  want  to  believe);  and  we  have  a  tendency  to 
overgeneralize  and  to  believe  in  ultimate  causes. 

In  short,  Bacon’s  idols  of  the  tribe  refer  to  our 
tendency  to  project  our  own  (human)  patterns  of 
instinctive  thinking  onto  nature.  This  tendency  has 
not  died  with  Bacon,  nor  has  it  been  permanently 
corrected  by  subsequent  methodological  develop¬ 
ments.  Indeed,  these  idols  can  be  seen  at  the  fore¬ 
front  of  all  the  social  sciences.  For  example,  the 
idols  of  the  tribe  might  be  used  to  explain  why 
political  scientists  search  for  (and  find!)  the  demo¬ 
cratic  peace,  why  economists  find  it  fruitful  to 
borrow  the  concept  and  mathematics  of  “energy” 
from  19th-century  physics  and  apply  it  to  the  con¬ 
cept  of  utility,  and  why  social  science  journals  do 
not  appear  to  be  interested  in  publishing  negative 
findings  (and  why  it  has  been  so  difficult  to  launch 
a  journal  of  negative  results  in  the  social  sciences). 
More  to  the  point,  these  idols  may  explain  our  will¬ 
ingness  to  assume  that  nature  is  constant,  indepen¬ 
dent,  and  patterned,  even  in  the  absence  of  an 
almighty  God  that  was  once  said  to  have  made  it  so. 
Bacon  himself  would  have  political  scientists 
continually  question  the  underlying  ontological 
assumptions  of  their  discipline — not  because  those 
assumptions  are  unique  to  their  discipline  but 
because  those  assumptions  might  reflect  a  deeper, 
human  need  for  a  world  that  is  constant  and  pat¬ 
terned,  as  if  designed  by  a  god. 

This  entry  is  not  suggesting  that  the  democratic 
peace,  neoclassical  economics,  or  all  of  the  positive 
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examples  in  professional  journals  are  either  false 
or  illusive.  (Nor  does  this  entry  intend  to  argue 
for — or  against — the  existence  of  God.)  The  point, 
like  Bacon’s,  is  more  modest:  Researchers  need  to 
be  extra  diligent  and  skeptical  when  they  discover 
anthropocentric  patterns  and  arrive  at  convenient 
truths. 

The  second  category  of  Bacon’s  idols  concerns 
those  of  the  cave.  Idols  of  the  cave  represent  the 
peculiarities  of  each  individual’s  temperament  and 
limitations.  These  individual  biases  can  include, 
for  example,  tendencies  toward  insularity,  conser¬ 
vatism,  or  novelty;  acceptance  of  authoritative 
propositions;  or  a  willingness  to  bypass  intellectu¬ 
ally  difficult  positions.  While  the  idols  of  the  tribe 
are  derived  from  the  human  condition,  the  idols  of 
the  cave  reflect  one’s  own  individual  biases.  These 
biases  are  particularly  difficult  to  shed,  as  they  are 
derived  from  (and  forged  in)  individual  experience. 
But  they  are  biases,  nonetheless.  Consider  how 
practitioners  of  political  science  often  become 
beholden  to  a  particular  method.  As  methods 
become  more  specialized  and  technical,  this  sort  of 
method’s  monogamy  is  increasingly  common.  This 
specialization  is,  in  itself,  a  welcome  development, 
as  a  division  of  specialized  labor  can  boost  produc¬ 
tivity  and  the  quality  of  its  research.  In  doing  so, 
however,  political  scientists  reverse  the  way  in 
which  scientists  have  traditionally  understood  the 
relationship  between  the  world,  the  methods,  and 
the  understanding.  Rather  than  let  the  nature  of 
the  question  determine  the  proper  method  and 
means  of  understanding,  we  see  the  world  as  we 
would  have  it  from  the  comfort  of  our  cave. 

This  method’s  monogamy  can  have  serious  con¬ 
sequences  for  the  way  we  come  to  interpret  and 
understand  the  world.  Political  scientists  may 
remember  their  own  training  in  regression  tech¬ 
niques  and  how  exciting  (and  challenging)  it  was 
to  interpret  the  world  anew  in  terms  of  dependent 
and  independent  variables.  This  new  perspective 
can  be  liberating,  but  it  is  also  blinkered:  “Someone 
with  a  new  hammer  thinks  the  world  as  a  nail.” As 
with  the  idols  of  the  tribe,  Bacon  teaches  us  that  it 
is  possible  to  overcome  idols  of  the  cave  by  being 
aware  of  our  individual  biases  and  confronting 
them  directly  in  a  critical,  balanced,  and  clear  way. 
It  is  for  this  reason  (rather  than  a  need  for  a  clear 
demarcation  principle)  that  researchers  should 
strive  to  falsify  their  findings. 


The  next  two  types  of  idols  are  not  so  easily 
corrected.  Bacon’s  third  category,  the  idols  of  the 
marketplace ,  is  derived  from  the  shortcomings  of 
language  and  intercourse,  as  words  often  betray 
their  own  purpose  and  obscure  the  very  thoughts 
they  are  designed  to  express.  At  the  most  general 
level,  Bacon’s  idols  of  the  marketplace  contain  two 
types  of  misunderstandings.  First,  we  often  give 
confusing,  even  ill-defined,  names  to  things  that  do 
exist.  After  all,  the  world  is  full  of  meaningless 
words,  words  with  double  meanings,  personalized 
meanings,  jargon,  and  so  on.  Second,  names  are 
sometimes  given  to  unreal  things  (e.g.,  Fortune, 
Prime  Mover).  While  the  first  type  of  errors  is  dif¬ 
ficult  to  eliminate,  as  they  are  deep-seated  and 
quite  complicated,  Bacon  believed  that  the  latter 
errors  can  be  easily  thrown  out  along  with  the 
theories  that  inform  them. 

Most  social  scientists  are  aware  of  the  slippery 
and  inaccurate  ways  in  which  language  and  con¬ 
cepts  can  be  used  in  the  social  sciences.  Consider, 
for  example,  the  different  ways  in  which  “signifi¬ 
cance”  is  often  used  in  statistical  papers  or  the 
varied  and  overlapping  ways  that  political  scien¬ 
tists  refer  to  democracy,  political  parties,  global¬ 
ization,  or  even  justice.  As  Emile  Durkheim  noted, 

In  the  present  state  of  knowledge,  we  cannot  be 
certain  of  the  exact  nature  of  the  state,  of 
sovereignty,  political  liberty,  democracy,  socialism, 
communism,  etc.  Our  methods  should,  then, 
require  our  avoidance  of  all  use  of  these  concepts 
so  long  as  they  have  not  been  scientifically 
established.  And  yet  the  words  which  express 
them  recur  constantly  in  the  discussions  of 
sociologists  [and  political  scientists].  They  are 
freely  employed  with  great  assurance,  as  though 
they  correspond  to  things  well  known  and  precisely 
defined,  where  as  they  awake  in  us  nothing  but 
confused  ideas,  a  tangle  of  impressions,  prejudices 
and  emotions.  (Durkheim,  1895/1964,  pp.  65-66) 

The  fourth  and  final  of  Bacon’s  categories  con¬ 
tains  the  idols  of  the  theater.  These  are  idols 
derived  from  things  learned  in  the  past,  on  the 
basis  of  poorly  conceived  experiments  and  super¬ 
stitions.  This  makes  them  particularly  difficult  to 
avoid  or  overcome,  as  false  learnings — when  culti¬ 
vated — gain  wide  influence  and  become  unques¬ 
tioned  by  their  followers.  It  is  because  of  this  that 
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Bacon  dedicated  most  energy  to  deal  with  these 
idols,  criticizing  the  habits  of  earlier  philosophers 
and  their  approaches. 

Presently,  the  political  world  (and  its  most  influ¬ 
ential  actors)  has  begun  to  recognize  the  idolatry 
of  a  once  dominant  model  of  political  and  eco¬ 
nomic  exchange.  The  idea  of  a  self-regulating, 
equilibrating  marketplace — one  that  could  usefully 
and  peacefully  coexist  within  democratic  poli¬ 
ties — had  become  ingrained  in  the  minds  of  men 
and  is  supported  by  the  sort  of  poorly  conceived 
experiments  and  superstitions  to  which  Bacon 
referred.  As  the  curtain  appears  to  be  falling  on 
this  idol  of  the  theater,  political  scientists  will  have 
to  rely  on  new  understandings  of  the  political 
world — understandings  that  are  more  aware  of 
their  debt  to  the  “various  dogmas  of  philosophy” 
and  the  “wrong  laws  of  demonstration.” 

For  Bacon,  these  idols  did  not  challenge  his 
underlying  (naturalist)  ontology.  Rather,  he 
sought  to  undermine  the  authority  of  accepted 
sources  of  knowledge;  he  wanted  to  point  out  the 
pitfalls  that  lie  along  our  path  to  more  useful 
knowledge,  as  only  then  do  we  have  a  chance  of 
avoiding  them.  Bacon  believed  that  the  mind  of 
man  is  like  a  circus  mirror;  it  reflects  what  is  going 
on  in  the  world,  but  it  distorts  it  in  the  process.  As 
a  result,  our  mind  produces  false  images  or  idols. 
While  the  mind  will  always  create  these  false 
images,  Bacon  believed  that  it  was  possible  to 
compensate  for  the  distortion  by  building  a  cor¬ 
recting  lens.  Broad-based  methodological  reflec¬ 
tion  provides  that  lens. 

Conclusion 

Bacon’s  Novum  Organum  is  divided  into  two 
books.  The  first  book  introduces  the  idols  of  the 
human  mind,  to  show  the  sort  of  obstacles,  biases, 
prejudices,  and  limitations  of  perspective  that  can 
stand  in  the  way  of  our  quest  for  knowledge. 
These  idols  warn  of  the  dangers  inherent  to  both 
inductive  and  deductive  approaches. 

The  second  book  provides  the  sort  of  lens  that 
he  believed  was  necessary  to  correct  for  the 
maligned  images.  In  doing  so,  he  did  not  argue 
that  all  the  idols  could  be  overcome.  Indeed,  the 
latter  two  types  of  idols  (those  of  the  marketplace 
and  theater)  could  not  be  eradicated,  as  they 
denote  social  and  ideological  biases  that  are 


imposed  from  the  outside.  For  these  idols,  we  can 
only  be  aware  of  their  existence  and  hope  that  this 
awareness  will  protect  us  from  the  insidious  effects 
they  have  on  our  minds. 

The  first  two  idols,  by  contrast,  “inhere  in  the 
nature  of  the  intellect.”  For  this  reason,  Bacon 
believed  that  these  idols  of  the  tribe  and  the  cave 
could  be  overcome  by  confronting  them  in  a  criti¬ 
cal,  balanced,  and  clear  way.  It  is  in  this  light  that 
methodological  reflection  is  so  important  for  good 
science — whatever  its  ontological  foundations. 
Bacon’s  solution  to  these  problems  was  to  develop 
a  single  method,  called  induction,  which  could  cor¬ 
rect  these  many  distortions.  This  name  for  his 
method  has  fooled  many  into  seeing  Bacon  as  some 
sort  of  poster  boy  for  atheoretical  empiricist 
approaches.  It  is  hoped  that  by  reviewing  his  idols 
of  the  mind,  readers  will  see  how  critical  Bacon  was 
of  the  idea  that  simple  observation  alone  could  con¬ 
vey  accurate  knowledge  of  the  Real  World.  Bacon’s 
( 1620/1 863b)  approach  combined  rational  and 
empiricist  elements  in  a  middle-course  approach: 

Those  who  have  handled  sciences  have  been 
either  men  of  experiment  or  men  of  dogmas.  The 
men  of  experiment  are  like  the  ant,  they  only 
collect  and  use;  the  reasoners  resemble  spiders, 
who  make  cobwebs  out  of  their  own  substance. 
But  the  bee  takes  a  middle  course:  it  gathers  its 
material  from  the  flowers  of  the  garden  and  of 
the  field,  but  transforms  and  digests  it  by  a 
power  of  its  own.  Not  unlike  this  is  the  true 
business  of  philosophy;  for  it  neither  relies  solely 
or  chiefly  on  the  powers  of  the  mind,  nor  does  it 
take  the  matter  which  it  gathers  from  natural 
history  and  mechanical  experiments  and  lay  it  up 
in  the  memory  whole,  as  it  finds  it,  but  lays  it  up 
in  the  understanding  altered  and  digested. 
Therefore  from  a  closer  and  purer  league  between 
these  two  faculties,  the  experimental  and  the 
rational  (such  as  has  never  yet  been  made),  much 
may  be  hoped.  (Book  1,  p.  95/xcv) 

In  the  same  way  that  Bacon  counsels  us  to  bor¬ 
row  from  the  ways  of  the  ant  and  those  of  the 
spider,  good  methodologists  need  to  be  aware 
of — and  draw  from — competing  worldviews, 
sources  of  knowledge,  and  methods.  Choosing  the 
right  method  requires  critical  reflection  on  how  a 
given  method  can  interrogate  the  world  under 
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study  and  generate  the  type  of  knowledge  we  hope 

to  obtain. 

Jonathon  W.  Moses 
Norwegian  University  of  Science  &  Technology 

Trondheim,  Norway 

See  also  Causality;  Constructivism;  Quantitative 
Methods,  Basic  Assumptions;  Quantitative  Versus 
Qualitative  Methods 

Further  Readings 

Bacon,  F.  (1605).  The  advancement  of  learning.  New 
York:  Dent  &  Sons.  Retrieved  from  http://www 
.uoregon.edu/~rbear/adv2.htm 

Bacon,  F.  (1863a).  The  great  installation.  In  J.  Spedding, 
R.  L.  Ellis,  &  D.  D.  Heath  (Trans.),  The  works 
(Vol.  8).  Boston:  Taggard  and  Thompson.  Retrieved 
November  27,  2010,  from  http://www.constitution 
.org/bacon/instauration.htm  (Original  work 
published  1620) 

Bacon,  F.  (1863b).  Novum  Organum.  In  J.  Spedding, 

R.  L.  Ellis,  &  D.  D.  Heath  (Trans.),  The  works 
(Vol.  8).  Boston:  Taggard  and  Thompson.  Retrieved 
November  27,  2010,  from  http://www.constitution 
.org/bacon/nov_org.htm  (Original  work  published 
1620) 

Durkheim,  E.  (1964).  The  rules  of  sociological  method 
and  selected  texts  on  sociology  and  its  method 
(G.  E.  G.  Catlin,  Ed.;  S.  A.  Solovay  &  J.  H.  Mueller, 
Trans.).  New  York:  Free  Press.  (Original  work 
published  1895) 

Hall,  P.  A.  (2003).  Aligning  ontology  and  methodology  in 
comparative  research.  In  J.  Mahoney  & 

D.  Rueschemeyer  (Eds.),  Comparative  historical 
analysis  in  the  social  sciences  (pp.  373-406). 

New  York:  Cambridge  University  Press. 

Hall,  P.  A.  (2008).  Systematic  process  analysis:  When  and 
how  to  use  it?  European  Political  Studies,  7,  304-317. 

Jahn,  D.  (2006).  Globalization  as  “Gabon’s  problem”: 
The  missing  link  in  the  analysis  of  diffusion  patterns 
in  welfare  state  development.  International 
Organization,  60,  401-431. 

Mill,  J.  S.  (2002).  A  system  of  logic.  Honolulu,  HI: 
University  Press  of  the  Pacific.  (Original  work 
published  1891) 

Moses,  J.  W.,  &  Knutsen,  T.  L.  (2007).  Ways  of 
knowing:  Competing  methodologies  in  social  and 
political  research.  Hampshire,  UK:  Palgrave. 

Shapiro,  I.  (2005).  The  flight  from  reality  in  the  human 
sciences.  Princeton,  NJ:  Princeton  University  Press. 


Topper,  K.  (2005).  The  disorder  of  political  inquiry. 

Cambridge,  MA:  Harvard  University  Press. 

Weber,  M.  (1949).  Objectivity  in  social  science  and  social 
policy.  In  E.  A.  Shils  &  H.  A.  Finch  (Ed.  &  Trans.), 

The  methodology  of  the  social  sciences  (pp.  50-112). 
Glencoe,  IL:  Free  Press.  (Original  work  published  1904) 


Equality 


Throughout  human  history,  equality  as  an  ideal 
has  evoked  powerful  emotions.  It  has  fueled  revo¬ 
lutions  and  social  movements,  and  people  have 
died  and  have  killed  for  it.  In  addition,  it  has  been 
the  subject  of  formal  analysis  that  has  sought  to 
analyze,  compare,  and  measure  inequalities  in 
goods,  liberties,  and  opportunities.  This  entry 
examines  the  concept  of  equality,  beginning  with 
the  two  fundamental  questions:  “why  equality?” 
and  “equality  of  what?”  The  second  issue  con¬ 
cerns  the  varied  domains  in  which  equality  may  be 
sought.  This  distinguishes  the  various  social  ethi¬ 
cal  conceptions,  but  it  seems  a  priori  that  such  a 
moral  question  can  be  answered  only  if  the  former 
question  is:  why  equality  in  the  first  place?  There 
turns  out  to  be  various  types  of  reasons  for  equal¬ 
ity,  and  in  particular  the  trivial  equality  as  gener¬ 
ality  or  universality,  the  basic  equality  as  logic  or 
rationality,  and  the  socially  essential  equality  as 
the  absence  of  subjection  and  domination  (hence 
as  this  freedom).  The  discussion  of  equality  from 
rationality  introduces  an  analysis  in  terms  of  rule 
equality. 

The  entry  then  explores  alternative  conceptual¬ 
izations  of  distributive  justice,  which  can  be  consid¬ 
ered  with  respect  to  both  their  substance  (the  goods 
to  be  distributed)  and  their  structure  (the  principles 
according  to  which  distribution  should  occur).  The 
concept  of  equality  of  liberty  is  then  explored  with 
respect  to  the  equality  of  basic  rights;  of  real  liberty 
(including  means  for  the  possibility  of  free  action); 
and  of  opportunity.  The  relationship  between 
equality  and  responsibility,  the  concept  of  equal 
hypothetical  liberty,  and  the  notion  of  comparative 
egalitarianism  are  also  considered. 

The  concept  of  equality  is  an  important  concern 
for  political  science,  and  the  entry  considers  the 
relationship  between  democracy  and  political  equal¬ 
ity.  For  philosophers,  the  concept  of  impartiality 
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has  been  a  central  one,  and  the  entry  discusses  its 
elaboration  through  the  theory  of  the  original 
position.  The  work  of  social  scientists  on  the  mea¬ 
surement  and  comparison  of  inequality  is  also 
noted.  Finally,  equality  in  social  relations  can  be 
seen  as  grounded  in  reciprocity — the  fundamental 
tendency  of  human  beings  to  treat  and  relate  to 
others  as  they  treat  and  relate  to  you. 

The  Problem  of  Equality 

Equality  as  the  First  Virtue  of  Society 

Human  history  is  intrinsically  linked  to  inequal¬ 
ity,  in  such  forms  as  slavery,  racism,  apartheid, 
sexism,  class  and  caste  structures,  and  other  forms 
of  domination  and  discrimination.  A  vast  array  of 
facts,  emotions,  and  reasons  establishes  the  over¬ 
whelming  importance  of  both  the  question  of 
equality  and  of  its  necessary  conceptual  clarifica¬ 
tion.  Equality  may  sometimes  be  so  bad  that  only 
one  thing  can  be  worse:  its  absence.  “Inequality  is 
the  source  of  all  evil”  is  Jean-Jacques  Rousseau’s 
(1755)  clear-cut  conclusion  of  a  nevertheless  elab¬ 
orate  investigation.  Aristotle  and  John  Rawls  see 
justice  as  actual  or  ideal  equality  and  find  it  to  be 
the  first  virtue  of  society.  Indeed,  “Justice  is  equal¬ 
ity,  as  everybody  thinks  it  is,  quite  apart  from 
other  considerations”  is  Aristotle’s  teaching  to  the 
king’s  son  in  Nicomachean  Ethics.  Social  ethical 
equality,  the  topic  here,  is  almost  consubstantial 
with  the  concepts  of  justice  in  the  same  field  (social 
justice,  distributive  justice,  compensatory  justice, 
restorative  justice,  commutative  justice,  diorthic 
justice,  etc.),  but  this  entry  first  considers  the  issues 
from  the  equality  angle. 

Equality  of  What? 

Of  course,  equality  can  a  priori  be  of  many 
things,  with  often  opposite  actual  consequences. 
Equality  may  be  of  various  types,  including 

equality  of  income  or  goods; 

equality  of  basic  rights  or  liberties; 

equal  right  to  the  product  of  one’s  labor  or 
capacities; 

equality  in  psychological  welfare  (happiness, 
satisfaction); 


equality  in  “real”  freedom  of  choice; 
equality  of  opportunity; 
the  political  equalities; 
relational  equality; 
equality  in  dignity; 
equality  of  consideration; 
a  society  of  equals;  and 
ontological  equality. 

Equality  is  commonly  thought  to  mean  equality 
in  incomes  or  goods.  It  can  also  be  in  liberty,  how¬ 
ever.  Historically,  in  fact,  the  first  and  main 
demand  for  general  equality  was  equality  in  rights 
and  notably  in  basic  rights,  which  are  essentially 
liberties  (“Men  are  free  and  equal  in  rights”  is  the 
opening  statement  of  the  1789  Declaration  of  the 
Rights  of  Man  and  of  the  Citizen).  This  freedom 
from  forceful  interference  has  been  seen  as  forbid¬ 
ding  income  redistribution,  thus  as  meaning  equal 
full  self-ownership  (i.e.,  each  person  is  entitled  to 
the  effects  and  products  of  her  own  capacities  to 
work  and  earn,  and  to  enjoy)  and  hence  implying 
a  precise  opposite  of  income  equality.  Equality 
may  be  not  in  goods  but  in  the  (psychological) 
welfare  or  “happiness”  people  derive  from  them 
thanks  to  their  capacities  to  enjoy.  On  the  con¬ 
trary,  it  can  be  in  the  (other)  resources  given  to 
society,  and  therefore  in  the  “real  liberty”  of  using 
them,  thus  complementing  the  “formal  liberty” 
provided  by  the  basic  rights.  If  these  resources  are 
attached  to  the  individuals,  as  their  earning  capac¬ 
ities  or  social  conditions  are,  transfers  or  specific 
policies  achieve  this  equalization.  This  can  give 
various  equalities  of  opportunity.  Equality  can 
also  be  in  the  variety  of  social  relations,  processes, 
statuses,  situations,  or  conditions.  Of  particular 
importance  is  political  equality,  equality  in  politi¬ 
cal  power  and  civic  duty,  and  its  manifestation  in 
democracy.  Finally,  one  kind  of  equality  is  particu¬ 
larly  fundamental  in  the  ethics  of  modernity:  that 
of  the  basic  moral  worth  of  humans  as  such,  with 
the  attached  respect,  consideration,  dignity,  and 
social  and  material  consequences.  This  “ontologi¬ 
cal  equality”  refers  to  our  common  humanity, 
which  should  be  respected  in  all  its  instances  (basic 
moral  equality).  In  Immanuel  Kant’s  words,  all 
humans  are  equal  in  the  kingdom  of  ends,  and  no 
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one  should  consider  any  other  as  a  means  only. 
Equality  can  also  appear  in  different  types  of  rules 
that  permit  to  determine  individual  situations.  It  is, 
for  instance,  an  equality  of  weights  in  utilitarian¬ 
ism  or  in  the  highest  social  income  (highest  sums 
of  individuals’  utilities  or  incomes).  Equality  is  also 
sometimes  rule-equality  (or  functional  equality), 
that  is,  the  items  of  individuals  are  derived  from 
their  specific  given  or  chosen  characteristics  by  the 
same  rule  or  function.  As  is  noted  below,  this  is  the 
very  structure  of  rationality  in  the  sense  of  provid¬ 
ing  a  reason,  with  important  consequences. 

Equality  and  Modernity: 

Formal  and  Real  Equalities 

The  equalities  considered  here  are  results  of 
choices  by  society,  often  by  institutions  but  some¬ 
times  by  individuals.  In  almost  all  societies  there 
are  peer  groups  with  some  values  of  equality 
between  their  members,  and,  often,  equalities  of 
certain  types  with  larger  extensions.  Elowever,  we 
are  also  particularly  interested  in  equality  in  the 
ethics  of  modernity.  The  logical  analysis  of  equality 
will  apply  to  all  cases.  The  ethics  of  modernity  is 
characterized  by  the  acceptance  or  demand,  by 
large  majorities  of  populations,  of  certain  equali¬ 
ties  for  large  populations,  universally  for  some 
equalities.  These  ideal  values  are,  first,  moral  basic 
worth,  classical  basic  rights,  and  some  sort  of 
democracy.  Respect,  and  basic  rights  when  the  dis¬ 
tribution  of  resources  is  given,  are  nonrival  (that  is, 
one  individual’s  benefit  from  an  item  does  not  pre¬ 
vent  or  impair  similar  benefits  for  others)  and 
therefore  the  demands  may  simply  be  that  each 
person  should  have  them,  which  implies  their 
equality.  In  contrast  with  these  consensual  values 
of  the  ethic  of  modernity,  this  ethic  is  deeply 
divided  with  regard  to  the  distribution  of  goods — 
the  economic  values.  The  polar  positions  are,  on 
the  one  hand,  a  divided  family  of  “egalitarians” 
who  favor  equality  in  incomes,  goods,  resources,  or 
welfare  and,  on  the  other  hand,  “classical  liberals” 
who  advocate  self-ownership  of  all  personal  capac¬ 
ities  (to  earn  and  enjoy) — and  hence,  by  the  way, 
equal  self-ownership  for  all.  This  issue  and  the 
resulting  structure  of  the  optimum  distributions  are 
discussed  below.  Note  that  since  (prima  facie) 
equal  treatment  of  equals  in  the  relevant  character¬ 
istics  turns  out  to  be  a  logically  necessary  property 


of  a  determinate  social  choice  with  minimal  ratio¬ 
nality  (cf.  below),  equality  appears  in  two  different 
ways  in  social  choice:  (1)  as  this  rationally  neces¬ 
sary  property  of  all  social  ethics  that  applies  to  the 
particular  equalizand  and  scope  of  this  ethics 
whatever  they  are — it  can  in  particular  be  an  equal 
freedom,  for  instance;  and  (2)  as  the  particular 
values  of  the  noted  family  of  distributional  “egali¬ 
tarians”  (in  goods,  incomes,  resources,  or  welfare). 

The  object  of  equality  has  various  possible  struc¬ 
tural  properties,  discussed  below.  For  now,  simply 
note  that  equality  can  be  between  individuals  but 
also  between  groups  or  institutions  variously 
defined  (with,  possibly,  the  problem  of  relating  the 
situation  of  the  group  to  that  of  its  members). 
For  simplicity  in  presentation,  this  entry  uses 
expressions  of  equality  between  persons  or  indi¬ 
viduals  only. 

Why  Equality? 

Equality  raises  two  classical  questions:  “of  what?” 
(including  between  whom  and  in  what  circum¬ 
stances)  and  “why?”  The  operational  question  is 
“of  what?”  However,  it  seems  that  it  can  be 
answered  if  and  only  if  we  first  have  the  answer  to 
the  other,  apparently  deeper  question,  “why?”  The 
issue  is  more  subtle,  however.  Consider,  for 
instance,  very  common  claims  such  as  that  all 
humans  should  equally  have,  for  instance,  any¬ 
thing  such  as  “the  basic  rights,”  or  “at  least  the 
food  needed  for  survival.”  This  “of  what,”  what¬ 
ever  its  own  reason,  constitutes  the  reason  for  this 
equality,  it  explains  it.  Then,  the  answer  to  “equal¬ 
ity  of  what”  entails  the  answer  to  “why  equality?” 
In  this  case,  the  mention  of  equality  is  in  fact 
redundant.  Yet  it  is  often  emphasized  for  reasons 
noted  below.  In  another  example,  the  very  com¬ 
monly  given  reason  “I  divide  this  cake  equally 
because  I  see  no  reason  to  divide  it  otherwise”  has 
a  puzzling  logic  analyzed  shortly.  In  other  cases, 
equality  and  its  reason  or  value  are  just  two  differ¬ 
ent  names  for  the  same  thing,  as  with  the  most 
important  equality  as  nondomination  or  nonsub¬ 
jection  (the  two  faces  of  the  same  relational  coin). 

When  reasons  for  equality  are  considered,  the 
striking  fact  is  that  there  is  not  one  reason  or 
motive  for  equality  but  many  of  them,  of  very 
different  and  often  unrelated  kinds.  The  two 
most  important  types  of  reason  for  equality  are 
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of  totally  different  natures.  One  is  equality  as 
logic  or  rationality.  It  concerns  the  reason  for 
“equal  treatment  of  equals,”  the  logic  of  justifi¬ 
cation,  the  property  of  “permutability,”  and  the 
meanings  of  justifying  equality  by  the  absence  of 
a  sufficient  reason  for  inequality.  The  other  type 
is  social.  It  is  equality  as  nonsubjection  and  non¬ 
domination,  a  protective  or  negative  relational 
equality,  justified  by  this  type  of  liberty  and  of 
dignity,  and  extending  to  the  general  properties 
of  relations  between  equals.  Equality  as  rational¬ 
ity  can  apply  to  all  issues — economic,  social, 
political. 

Logic — if  one  dare  say — provides  also  another 
reason  for  equality  that  is  trivial  from  its  view¬ 
point,  a  tautology,  and  is  nevertheless  often 
repeated,  sometimes  with  great  emphasis  and  a 
great  importance  attached  to  it.  This  is  equality  as 
generality  or  universality,  meaning  that  each 
member  of  a  given  group  has  or  should  have  some 
given  property  of  any  nature.  This  is  extended 
into  a  comparison:  each  member  has,  all  members 
have,  all  members  equally  have.  This  property 
then  is  general  to  the  members  of  the  group.  It  is 
“universal”  in  this  group,  but  the  term  “univer¬ 
sal”  is  often  reserved  for  cases  in  which  the  group 
is  all  mankind.  Logically,  this  mention  of  equality 
is  redundant.  Its  presence  may  have  two  reasons 
aiming  at  reinforcing  the  claim  or  value.  One  is  to 
draw  attention  on  the  fact  that,  in  the  present  or 
past  states,  some  members  only  have  or  had  the 
property.  Another  may  be  to  appeal  to  other  rea¬ 
sons  for  equality,  namely  comparative  fairness 
based  on  the  logical  reasons  mobilized  by  the 
emphasis  that  the  persons  in  question  have  the 
same  relevant  characteristics. 

Comparative  equality  results  from  the  compari¬ 
son  of  persons’  endowments  of  the  items  relevant 
in  nature  and  in  measure  (e.g.,  perhaps  the  appro¬ 
priate  concepts  relative  to  some  specific  character¬ 
istic  of  the  person).  Equality  then  results  from 
sentiments  of  relative  fairness,  and  it  prevents  the 
various  social  sentiments  that  may  be  aroused  by 
inequality  (such  as  envy  or  sentiments  of  injustice, 
unfairness,  inferiority,  or  superiority).  This  fair¬ 
ness,  however,  is  based  on  the  notion  that  the 
persons  have  the  same  relevant  characteristics  (no 
one  deserves,  needs,  or  is  entitled  to  or  accountable 
for  more  than  the  other)  and  on  the  logical  reasons 
presented  in  the  section  dealing  with  rationality  as 


a  reason  for  equality.  Relatedly,  the  principle  of 
“equity-no-envy”  saying  that  each  person  prefers 
to  have  her  own  allocation  or  situation  rather  than 
that  of  any  other  person  holds  a  central  place  in 
equality  analyses. 

Equality,  therefore,  is  essentially  a  derived 
value.  It  derives  from  direct  (end-)  values  by  impli¬ 
cations  that  are  varied  and  opposite  in  type  and 
direction.  In  the  various  cases,  it  is  a  condition,  a 
cause,  or  a  consequence.  For  nondomination, 
equality  is  factually  identical  with  it  and  hence 
morally  a  consequence  of  it.  Directly  comparative 
approval  of  equality  results  from  some  sentiment 
of  propriety  perhaps  supported  by  the  justification 
from  rationality.  However,  it  is  not  sure  that 
equality  is  or  can  be  valued  as  an  end  in  itself, 
directly,  although  it  may  look  like  this  in  some 
egalitarian  judgments  that  appear  as  gut  feelings 
or  flashes  of  moral  intuition,  previous  to  consid¬ 
ered  analysis  (the  opposite  of  the  search  for  a 
good  reason).  This  may  concern,  in  particular,  the 
basic  worth  of  humans,  relational  equality  in  itself 
(relation  between  equals),  comparative  fairness, 
the  absence  or  impossibility  of  a  reason  for 
inequality,  and  the  pure  quasi-aesthetic  value  of 
balance  and  symmetry. 

When  the  relevant  equality  is  impossible  or 
costly  on  other  grounds,  some  reasons  for  it  or 
judgments  favoring  it  can  extend  to  preferring 
lower  corresponding  inequalities.  This  extends 
considerably  the  complexity  of  the  problem  and 
constitutes  a  vast  field  of  studies.  When  what  is 
wrong  with  inequality  is  that  people  who  have 
the  least  have  too  little,  and  if  another  situation 
can  improve  their  situation  sufficiently  without 
costs  in  the  other  people’s  endowments  of  this 
item  or  otherwise  that  would  make  the  overall 
situation  worse,  the  solution  may  be  to  maximize 
the  lowest  endowments  or  “maximin”  (“practi¬ 
cal  justice”  for  interpersonally  comparable 
ordinal  utilities  in  Serge  Kolm  [1971 J,  the  “dif¬ 
ference  principle”  for  an  index  of  “primary 
goods”  in  Rawls  [1 97 1 J ,  or  Derek  Parfit’s  [1 995 J 
“prioritarianism”). 

Finally,  some  equalities  induce,  entail,  or  require 
others.  This  can  result  from  the  existence  of  strictly 
complementary  goods.  For  instance,  enjoying  some 
right  or  liberty  may  require  some  condition  such  as 
the  access  to  some  amount  of  some  good.  But  the 
most  famous  and  classical  example  is  Arthur 
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Pigou’s  derivation  of  equal  income  from  the  utili¬ 
tarian  highest  sum — hence  with  equal  weights — of 
identical  concave  individual  utility  functions.  A 
more  elaborate  similar  property  is  the  basis  of  the 
present-day  welfarist  theory  of  measures  of 
inequality. 

The  essential  question  of  the  relations  between 
equality  and  liberty  will  be  split  in  two:  equality  as 
liberty,  the  historically  most  important  defensive 
relational  equality  of  nonsubjection  and  nondomi¬ 
nation,  and  equality  of  liberty,  including  the  basic 
rights  and  the  various  cases  of  equality  of  freedom 
of  choice  and  of  opportunity. 

Equality  as  Liberty:  Nondomination 
and  Nonsubjection 

Today,  equality  is  commonly  considered  as 
opposed  to  liberty.  This  usually  refers  to  inequali¬ 
ties  in  income  and  wealth  resulting  from  free 
exchange,  and  to  interferences  by  public  redistri¬ 
butions  tending  to  reduce  these  inequalities.  It 
sometimes  also  refers  more  philosophically  to 
freedom  permitting  the  manifestation  of  differ¬ 
ences  in  preferences  in  a  diversity  seen  as  an 
inequality.  However,  liberty  and  equality 
entered — and  founded — the  modern  world  not  as 
enemies  but  as  associates,  or,  rather,  as  identical 
situations.  Such  a  radical  change  as  overthrowing 
the  “feudal”  order  required  the  association  of 
these  two  powerful  values.  The  principle  that 
“men  are  free  and  equal  in  rights”  (the  1789 
Declaration)  transmutes  dominated  subjects  (and 
their  masters)  into  equal  and  free  citizens. 

The  absence  of  the  relation  of  subjection  and 
domination  is,  indeed,  in  a  society,  both  the  most 
basic  equality  and  the  most  basic  liberty.  Relations 
are  more  intrinsic  to  society  than  comparisons  are, 
and,  in  a  relation,  freedom  from  the  other’s  com¬ 
mand  and  equality  are  practically  synonyms. 
Domination  is  a  person’s  power  to  compel  another 
do  something,  notably  by  force  or  threat.  By 
nature,  the  corresponding  subjection  is  the  most 
vicious  of  unfreedoms  since,  in  it,  a  person’s  will 
determines  another’s  acts.  It  is  in  essence  worse 
than  a  simple  constraint,  not  only  because  of  the  a 
priori  uncertainty,  but,  much  more  basically, 
because  it  constitutes  a  kind  of  amputation  of  part 
of  the  dominated  self,  and  this  substitution  of  wills, 
this  occupation  of  the  other’s  command  center  by 


force  (or  ruse),  is  the  annihilation  of  the  condition 
for  agency,  autonomy,  self-respect,  and  dignity. 
Domination  is  usually  maintained  by  force,  but  it 
may  be  worse  when  the  subject  endorses  the  situa¬ 
tion  in  “voluntary  serfdom”  as  Etienne  de  La 
Boetie  put  it.  The  situation  admits  of  degrees,  how¬ 
ever,  depending  on  possibilities  and  costs  of  avoid¬ 
ing  the  domination.  Slavery  is  one  extreme,  and 
there  are  many  forms  of  it.  Avoiding  subjection  is 
sometimes  prevented  by  a  status  of  lower  caste  one 
is  born  in.  Serfdom  of  diverse  types  also  exists,  as 
do  lifetime  servants  of  the  same  master.  Domination 
sometimes  masquerades  as  free  exchange,  which  is 
fictitious  when  the  alternative  is  starvation  or  the 
lack  of  satisfaction  of  some  essential  need.  The 
wage  relationship  differs  from  an  exchange  of  ser¬ 
vices  by  its  being  subjection  to  the  boss’s  orders 
within  some  limits,  and  the  wage  earner  may  have 
no  real  alternative  or,  perhaps,  has  the  only  choice 
to  replace  one  boss  by  another.  This  limited  pos¬ 
sibility  to  leave  the  relation  also  results  in  a  low 
wage,  hence  inequality  in  this  respect  too,  and  situ¬ 
ations  of  unequal  exchange  and  exploitation. 
Intrafamily  domination  and  emancipation  toward 
equal  status,  power,  and  rights  and  duties  are 
major  problems  of  humankind.  The  domination 
can  also  be  group-wise,  as  with  colonial  situations, 
and  equal  status  obtained  by  independence  or  lib¬ 
eration.  All  this  covers,  of  course,  a  large  variety  of 
situations  according  to  cases,  places,  and  historical 
periods. 

The  absence  of  subjection,  or  of  strong  forms  of 
it,  is  jointly  an  equality  in  itself,  relational,  and,  if 
all  members  of  a  group  (or  of  mankind)  have  to  be 
free  from  the  corresponding  domination,  an  equal¬ 
ity  of  liberty  and  an  equality  as  generality  (or 
universality). 

Equality  From  Logic 

Overview 

The  basic  property  of  equal  treatment  of  equals 
in  the  relevant  characteristics  results  from  logic  for 
two  different  reasons.  In  one,  equality  as  rational¬ 
ity,  it  results  from  rationality  in  the  relevant  and 
most  common  sense  of  providing  a  reason — justi¬ 
fying.  This  holds  whatever  the  reason,  and  even 
from  simply  being  favorable  to  provide  a  reason 
since  the  equal  treatment  of  equals  turns  out  to  be 
a  necessary  condition  for  all  reasons.  The  second 
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way  in  which  logic  requires  equal  treatment  of 
equals  is  the  property — shortly  explained — of 
“permutable  treatment  of  equals”  plus  the  require¬ 
ment  of  full  determination  (uniqueness)  of  the 
result.  However,  this  equality  of  equals  is  some¬ 
times  an  inferior  solution  and,  then,  “permutable 
treatment”  is  the  second-best  logically  egalitarian 
concept.  The  next  two  sections  discuss  these  top¬ 
ics.  Then,  the  famous  principle  of  “insufficient 
reason”  for  inequality  is  examined  and  shown  to 
be  either  fallacious,  tautological  in  two  possible 
ways,  or  any  of  the  two  above  reasons. 

The  relevant  characteristics  may  include,  nota¬ 
bly,  a  description  of  the  relations  to  possibilities. 
At  any  rate,  this  equality  can  be  prima  facie,  that 
is,  in  the  absence  of  an  overriding  reason  that  may 
be  impossibility  or  the  joint  relevance  of  some 
other  value  (which  may  be  the  ideal  equality  of 
something  else,  a  unanimous  benefit  from  leaving 
equality,  and  so  on). 

Equality  From  Rationality 

Equal  is  rational,  rational  is  equal.  Indeed, 
rational,  in  its  most  common  sense,  used  here, 
means  to  give  a  reason,  to  justify,  or  to  begin  to  do 
it  or  at  least  to  intend  to.  It  opposes  the  irrational, 
unjustified,  or  arbitrary.  Assume  individual  i 
receives  xt  of  the  relevant  item  of  any  nature 
(goods,  income,  wealth,  position,  right,  freedom, 
power,  respect,  honor,  reputation,  consideration, 
bundles  of  these,  interpersonally  comparable  level 
of  satisfaction,  etc.;  the  item  may  even  be  a  rule 
providing  something  to  an  individual  as  a  function 
of  some  facts  possibly  including  some  characteris¬ 
tics  of  hers,  and  the  equality  is  that  the  same  rule 
is  used  for  various  persons,  a  derived  rule-equality 
that  will  shortly  appear  to  be  the  very  form  of 
rationality  itself).  If  this  specific  x,  is  intrinsically 
justified,  given  a  reason  for,  this  reason  a  priori 
refers  to  a  number  of  relevant  characteristics  of 
individual  i,  of  any  nature.  The  set  of  these  rele¬ 
vant  characteristics  is  denoted  as  yt.  The  reason 
that  leads  to  choose  xt  because  of  y,  is  described  by 
a  function 

x,  =  r(y,).  (1) 

Note  that  we  write  equation  (1)  rather  than  x,  = 
rfy,)  with  a  function  rt  proper  to  individual  i 


because,  in  this  case,  the  reasons,  a  priori  proper  to 
individual  i,  that  leads  one  to  write  ri  should  be 
included  in  the  set  of  relevant  characteristics  yt  and 
the  function  takes  form  with  equation  (1). 
Moreover,  a  complete  social  choice  determines  a 
unique  x„  and  then  r  is  a  proper  function.  Then,  if 
another  person,  /,  has  an  identical  (equal)  set  of 
relevant  characteristics,  y.  =  yt,  relation  ( 1 )  implies 
that  she  receives  x.  =  xr  This  equality  is  derived 
from  the  simple  requirement  of  justifying,  giving  a 
reason,  that  is,  from  rationality  applied  to  this 
social  issue. 

Note  that  this  rationality  provides,  in  fact,  two 
(equivalent)  types  of  equality:  a  conditional  equal¬ 
ity,  xt  —  x.  if  y,  =  y,  and  a  functional  equality 
meaning  that  the  same  function  r  is  used  for  all 
individuals,  which  manifests  the  universality  of 
rationality  (giving  a  reason)  fully  applied.  The  for¬ 
mer  is  also  substitutability,  that  is,  if  another  indi¬ 
vidual  j  than  i,  for  which  y.  =  y;,  is  substituted  to 
individual  i,  then  x.  =  xr  The  latter  is  also  called 
rule-equality,  that  is,  the  same  rule  r,  rather  than 
specific  rules  r,  possibly  different  for  different  i, 
relates  y,  to  x(;  rationality  (in  this  most  common 
sense)  implies  rule-equality.  (The  converse  is  not 
true,  although  it  generally  holds;  most  rules  describe 
reasons;  logically,  however,  there  can  be  rules  not 
justifiable  from  a  “reason.”)  The  equality  x,  =  x. 
results  from  a  requirement  of  rationality  when  y .  =  y . 
(if  direct  comparisons  are  furthermore  introduced, 
function  r  may  also  depend  on  x-  for  j  A  i  for 
describing  these  comparisons;  then,  it  should  also 
depend  on  y.  ,  and  y.  =  y.  entails  the  comparison 
between  x,  and  x,  which  favors  x,  —  x,). 

In  this  pure  rational  equality,  there  is  no  direct 
comparison  between  xt  and  x;.  Sentiments  of  jus¬ 
tice  or  fairness  refer  in  particular  to  the  choice  of 
the  relevant  characteristics  yr  This  choice  implies 
the  answer  to  the  question  “equality  among 
whom?”  A  particular  form  of  characteristics  y,  is 
simply  “belonging  to  a  certain  set  of  individuals 
I”;  then  the  x(  of  all  these  individuals  should  be 
prima  facie  equal  (“equality  from  generality”). 

The  property  of  equal  x(  for  equal  y.  holds  irre¬ 
spective  of  the  specific  reason  r.  The  simple  fact  of 
giving  a  reason,  justifying,  suffices  for  this  result. 
This  is  common  grounds  of  all  reasons  and  a  nec¬ 
essary  property  for  the  existence  of  a  reason. 
Hence,  the  mere  a  priori  posture  or  intention  to 
provide  a  reason  whatever  it  is  suffices  for  the 
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result  uXj  =  xj  if  yi  =  yr  ”  This  is  strictly  minimal 
rationality.  A  reason  that  yields  unequal  results 
(Xj  A  x^  is  applied  to  different  sets  of  relevant 
characteristics  (y,  #  yt). 

A  remarkable  consequence  is  that  if  one  has  to 
share  something  perfectly  divisible  between  a  num¬ 
ber  of  persons  who  have  no  other  relevant  differ¬ 
ent  characteristic,  their  yt  is — or  amounts  to — 
belonging  to  this  group  and  hence  is  the  same  for 
all,  and  general  a  priori  rationality  (and  more 
generally  any  particular  rule  consistent  with  the 
constraint)  requires  equal  sharing.  No  reason  can 
give  another  choice:  any  other  choice  is  necessarily 
without  a  rule  and  hence  without  a  reason — that 
is,  irrational.  Equal  sharing  is  the  only  rational 
(and  rule-following)  solution  (a  unique  one  if  all 
the  good  is  distributed).  This  is,  of  course,  what  is 
usually  done.  An  example  can  be  drawing  lots 
between  these  persons:  rationality  requires  allocat¬ 
ing  equal  probabilities  to  them  (actually,  this  hap¬ 
pens  to  be  the  basic  Condorcet-Laplace  axiom  of 
the  theory  of  probabilities). 

Permutable  Treatment  of  Equals 

Denote  as  zt  —  (xp  y,)  the  pair  of  xt  and  y,. 
Choose  the  set  of  characteristics  yt  as  being  suffi¬ 
ciently  encompassing  for  z,  to  include  all  that 
concerns  person  i  for  the  judgment  under  consider¬ 
ation.  Then,  if  individual  i  is  attributed  zf  instead 
of  Zj  whereas  individual  j  is  attributed  z,  instead  of 
zp  the  two  social  states  are  not  relevantly  discern¬ 
ible  and  are  equivalent  for  this  evaluation.  Hence, 
any  permutation  of  the  z,  between  the  persons  i 
creates  equivalent  social  states.  Consider  now  that 
all  the  individuals  i  belonging  to  a  subset  I  have  the 
same  sets  of  characteristics  y,  =  y.  Then  permuting 
the  z,  —  (Xj,  y)  between  persons  i  of  I  is  identical  to 
the  same  permutation  of  the  xt  only  between  them. 
Hence,  these  permutations  of  the  x,  give  equivalent 
social  states.  This  is  permutable  treatment  of 
equals  in  the  relevant  characteristics.  Consider 
three  applications  of  this  property. 

Full  Determination 

If  some  of  these  x,  differ  from  one  another,  these 
permuted  social  states  are  not  all  identical  since  at 
least  one  individual  has  different  x.  in  some  of 
these  states.  However,  a  virtue  of  a  principle  of 


social  choice  is  that  it  be  complete,  providing  full 
determination,  that  is,  it  designates  one  of  the 
alternative  social  states  only  rather  than  several 
equivalent  ones.  Indeed,  notably,  action  and  imple¬ 
mentation  are  the  realization  of  one  of  these  mutu¬ 
ally  exclusive  possible  alternatives  only,  and  the 
principle  fully  plays  its  role  of  guiding  the  choice 
solely  if  it  has  this  property. 

Now  the  states  derived  by  the  permutations  of 
the  Xj  between  the  persons  i  of  I  (with  yt  —  y)  are 
one  and  the  same  state  if  and  only  if  all  these  x,  are 
the  same.  This  is  equal  treatment  of  equals  in  the 
relevant  characteristics.  Therefore,  permutable 
treatment  of  equals  plus  full  determination  implies 
equal  treatment  of  equals. 

Permutability  as  Second-Best  Equality 

However,  it  may  be  that,  actually,  some  unequal 
treatment  of  equals  is  better  than  equal  treatment 
of  equals.  For  instance,  some  collective  tasks  are 
better  performed  with  a  hierarchical  organization 
of  the  people,  even  if  they  a  priori  have  the  same 
capacities.  This  is  conspicuous  for  the  military 
defense  of  society,  but  it  is  also  the  case  for  many 
productive  or  administrative  tasks:  firms  and 
administrations  have  everywhere  a  hierarchical 
organization.  Then,  people  have  different  powers, 
which  usually  entails  different  statuses  (and 
unequal  pays).  Society  (and  all  its  members)  may 
also  benefit  from  a  differentiated  education  of 
people,  even  if  their  abilities  in  all  respects  are  a 
priori  identical.  Savings  provides  another  example. 
Aggregate  savings  become  investment  and  provide 
growth.  Since  people  usually  save  a  larger  fraction 
of  their  income  when  this  income  is  larger,  an 
unequal  distribution  of  income  provides  higher 
aggregate  savings  even  if  people  have  the  same 
propensity  to  save  (as  a  function  of  their  income), 
hence  a  priori  a  higher  growth  rate.  For  a  similar 
reason,  private  support  of  the  arts  benefits  from 
unequal  income  distributions  (with  rich  sponsors). 
There  may  also  simply  be  a  limited  number  of  non- 
divisible  consumption  goods  or  tools,  and  it  is  usu¬ 
ally  better  that  they  be  actually  distributed  and 
used  rather  than  not  using  them  at  all,  which  is  the 
feasible  equality.  In  all  such  cases,  unequal  treat¬ 
ment  of  a  priori  equals  is  generally  better  than 
possible  equal  treatment.  It  may  be  that  everybody 
benefits  from  it. 
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In  such  situations,  permutations  of  the  different 
x- — ranks,  education,  incomes,  or  items — between 
individuals  i  with  identical  relevant  characteristics 
yi  =  y  provide  social  states  that  cannot  be  judged 
otherwise  than  “equally  good”  from  an  external 
standpoint  although  they  are  not  so  for  each  indi¬ 
vidual.  This  permutable  treatment  of  equals  is  the 
“egalitarian”  property  of  such  cases.  The  property 
it  keeps  from  equal  treatment  of  equals  is  the  equal 
social  value — in  some  sense — of  permutations  of 
individuals’  allocations.  It  is  a  kind  of  second-best 
egalitarianism.  The  drawback  is  that  the  corre¬ 
sponding  social  choice  is  no  longer  fully  deter¬ 
mined  by  the  problem  alone,  since  one  of  the 
socially  equivalent  permuted  states  has  to  be  cho¬ 
sen.  A  strictly  egalitarian  desire  to  equalize  the 
individual  situations  leads  to  an  overall  worsening. 
Using  lotteries  or  rotation  are  classical  means  to 
face  such  situations  (both  were  used,  for  instance, 
by  the  Athenian  democracy  to  fill  official  posi¬ 
tions).  Lottery  provides  a  choice  with  the  possibil¬ 
ity  of  ex  ante  equality,  but  it  leaves  the  actual, 
ex  post,  inequality. 

The  Principle  of  Insufficient  Reason 

Answering  the  question  “Why  equality?”  by  the 
trivial  “Why  not?”  seems  hardly  serious.  However, 
“if  there  is  no  reason  for  inequality,  choose  equal¬ 
ity”  (or  “if  there  is  no  good,  valid,  or  sufficient 
reason  for  it”)  is  the  “reason”  for  equality  pro¬ 
posed  by  innumerable  people,  including  some  of 
the  best  minds  from  Aristotle  in  the  Nicomacbean 
Ethics,  Thomas  Hobbes  in  Leviathan,  John  Locke 
in  the  Second  Treatise  on  Government,  and 
Condorcet  in  his  1789  proposal  for  a  Declaration 
of  Rights,  to  many  contemporary  thinkers,  includ¬ 
ing  Rawls,  who  proposes  that  a  good  reason  would 
be  that  everybody  or  the  poorest  benefit  from 
inequality,  and  so  on.  Consider,  however,  the  fol¬ 
lowing  properties  of  this  most  famous  position  for 
equality. 

1.  Indeed,  if  there  is  no  reason,  or  good  or  suf¬ 
ficient  reason,  for  inequality,  what  else  can  one 
advocate  but  equality?  This  seems  to  be  a  tautol¬ 
ogy  about  providing  reasons.  Any  other  choice 
would  be  irrational  or  arbitrary. 

2.  However,  if  this  argument  in  favor  of  equal¬ 
ity  is  of  any  use,  this  implies  that  there  is  no  other 


sufficient  reason  for  equality  either.  Then,  consider 
any  state  with  inequality.  There  is  no  reason  for 
any  other  state,  with  equality  or  inequality. 
Therefore,  the  same  argument  leads  one  to  advo¬ 
cate  this  specific  unequal  state.  Finally,  this  argu¬ 
ment  leads  one  to  choose  any  state,  equal  or 
unequal.  This  apparent  tautology  is  in  fact  worse: 
a  fallacy. 

3.  The  same  reasoning  is  the  “principle  of  non- 
sufficient  reason,”  which  is  the  basis  of  the  axiom¬ 
atic  epistemic  foundation  of  the  theory  of 
probability,  introduced  by  Laplace  and  Condorcet: 
if  there  is  no  reason  for  an  event  to  be  more  likely 
than  another,  attribute  equal  probabilities  to  them. 
However,  the  principle  is,  in  this  context,  actually 
an  axiom.  This  suggests  that,  in  social  ethics,  this 
statement  could  just  express  a  “moral  taste,”  an  a 
priori  preference  for  equality.  What  it  adds  to  just 
expressing  this  is  open-mindedness:  if  there  is  a 
reason,  a  fortiori  a  good  or  valid  reason,  and 
unavoidably  a  sufficient  reason,  for  states  with 
inequality,  one  is  ready  to  abandon  this  preference. 
However,  equality  and  inequality  are  a  priori 
unevenly  treated:  a  reason  is  required  for  inequal¬ 
ity,  not  for  equality.  This  is  a  prima  facie  prefer¬ 
ence  for  equality.  But  not  a  justified  one,  so  far. 
Why  this  unequal  treatment  of  equality  and 
inequality,  this  asymmetrical  status  of  symmetry 
and  asymmetry? 

4.  However,  preferences  do  or  may  also  inter¬ 
vene  for  deciding  what  counts  as  a  good,  valid, 
acceptable,  and  in  the  end  sufficient  reason  for 
inequality,  that  is,  one  that  can  override  the  choice 
of  equality.  Then,  the  statement  just  is:  “I  choose 
equality  if  I  do  not  prefer  something  else  to  equal¬ 
ity.”  However,  this  can  mean  two  things,  depend¬ 
ing  on  whether  preference  alone  is  considered  or 
the  necessity  of  choosing  also  is.  First,  we  have 
pointed  out  that  this  choice  of  equality  is  to  be 
seen  as  resulting  from  a  preference.  Hence,  the 
statement  just  says:  “I  prefer  equality  if  I  do  not 
prefer  something  else  to  equality.  ”  This  is  a  strange 
preference  structure  that  omits  indifference. 
Second,  in  fact,  a  choice  has  to  be  made  between 
mutually  exclusive  alternatives.  Then,  the  state¬ 
ment  becomes:  “I  prefer  to  choose  equality  rather 
than  anything  else  if  I  do  not  prefer  to  choose  any¬ 
thing  else  rather  than  equality,”  which  now  is  a 
tautology. 
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5.  Nevertheless,  the  absence  of  reason  for 
inequality  may  also  mean  two  other  things.  One  is 
that  no  imaginable  reason  for  an  overall  allocation 
gives  inequality.  This  certainly  implies  that  all  pos¬ 
sibly  relevant  characteristics  y- — which  could  a 
priori  be  used  for  such  a  reason — are  identical  for 
all  i.  Then,  for  any  reason  r,  the  definite  x,  =  r(yt) 
are  also  identical  for  all  i.  This  simply  is  the  general 
a  priori  equal  treatment  of  equals. 

6.  In  particular,  we  may  have  to  choose  the  allo¬ 
cations  x,  to  the  individuals  i  who  belong  to  a  cer¬ 
tain  set  I,  while  we  have  no  (other)  reason  for  this 
choice.  Hence,  the  relevant  characteristic  of  these 
individuals  is  only  that  they  belong  to  the  set  I. 
This  is  yt  for  all  these  i.  Hence,  these  yj  are  identi¬ 
cal.  Then  any  reason  based  on  them  gives  identical 
x,  for  all  i.  Note  that,  here,  there  is  no  a  priori  other 
reason  either  for  equality  or  for  inequality. 

7.  A  different  type  of  reason  can  justify  the 
principle.  With  sufficient  sets  of  characteristics  yp 
permutations  of  the  individuals’  pairs  zt  —  (x(,  y;) 
among  the  individuals  are  not  discernible.  Then,  if 
all  these  y(  are  equal  (perhaps  just  for  i  belonging 
to  the  set  I),  this  permutation  is  identical  to  a  per¬ 
mutation  of  the  individual  allocations  x;  only 
among  the  individuals.  These  permutations  are 
therefore  equivalent  for  any  evaluation  of  the 
social  state.  If  one  is  a  solution,  so  are  the  others. 
Yet,  when  the  x,  are  not  all  equal,  some  of  these 
permuted  states  differ  from  one  another  since  at 
least  one  individual  receives  different  xr  However, 
a  complete  social  choice  consists  of  a  unique  solu¬ 
tion.  Then,  this  can  only  happen  if  the  x,  are  all 
equal.  Equality  results  from  permutable  treatment 
of  equals  and  the  requirement  of  full  determina¬ 
tion  of  the  choice.  Sharing  the  cake  between  two 
equal  individuals  in  proportions  (1/3,  2/3)  or  (2/3, 
1/3)  is  equivalent  in  moral  terms  although  it  is  not 
equivalent  for  each  individual.  For  the  proportion 
(1/2,  1/2)  only  this  multiplicity  is  avoided. 
Although  there  are  cases  in  which  equal  treatment 
of  equals  is  less  good  than  unequal  permutable 
allocations,  the  outcome  then  is  not  uniquely 
determined. 

Finally,  the  insufficient  reason  for  equality  is 
either  a  fallacy,  one  of  two  tautologies,  or  any  of 
the  two  basic  logical  requirements  of  prima  facie 
equality. 


Equalities  Determining 
the  Overall  Distribution 

The  Five  Alternative  Equalities 
of  Distributive  Justice 

Besides  the  equalities  protecting  against  force  in 
nonsubjection,  basic  rights,  and  democracy,  the 
most  important  role  of  equalities  may  concern  the 
overall  distribution  of  the  resources  of  society. 
Equalities  are  used  in  many  types  of  relations. 
Inspired  by  Max  Weber’s  remarks  about  people’s 
ethical  judgments,  Michael  Walzer  (1983)  argues 
that  this  is  how  it  should  be  with  equality  in  each 
of  a  variety  of  “spheres  of  justice.”  One  sphere, 
however,  is  much  more  important  than  others  in 
volume:  that  in  which  income  distribution  is  deter¬ 
mined  (especially  since  various  services  can  option¬ 
ally  be  bought  with  disposable  income — i.e.  put  in 
the  market  sphere).  This  overall  distribution  of  the 
resources  of  society  through  income  belongs  to  the 
domain  of  “macrojustice” — which  Rawls  calls 
“social  justice,”  but  the  common  use  of  this  term  is 
sometimes  more  extensive  (including,  for  instance, 
the  question  of  handicaps).  This  contrasts  with  the 
multifarious  issues  of  “microjustice”  specific  as 
regards  goods,  people,  or  circumstances,  and  with 
issues  of  “mesojustice”  concerned  with  specific 
goods  but  important  ones  that  concern  everybody 
(e.g.,  education  and  health). 

For  macrojustice,  five  polar  theories  of  the 
appropriate  distribution  are  classical  and  impor¬ 
tant  claims.  As  for  all  theories  of  justice,  they  are 
characterized  by  what  they  hold  should  be  equal. 
These  equalizands  are  characterized  by  two  aspects. 
One  is  their  substance  (material,  currency,  metric) 
such  as  income  or  resources,  welfare  as  happiness, 
or,  in  an  equality  from  generality,  self-ownership. 
The  second  aspect  is  their  structure,  as  with  an 
ideal  equality  in  individuals’  income,  resource 
endowment  or  welfare,  or  an  equal  weight  in  the 
highest  sum  of  welfare  (utilitarianism)  or  of 
incomes.  Figure  1  shows  this  overall  structure  of 
the  issues.  The  values  of  liberties,  responsibility, 
entitlement,  happiness,  needs,  deserts,  and  merits 
are  implicit,  as  shortly  seen. 

These  five  polar  equalities  of  social  justice  are 
very  different  in  nature.  The  most  tangible  of 
these  equalities  is  that  of  incomes.  Welfare  classi¬ 
cally  means,  in  this  context,  psychological  wel¬ 
fare,  for  instance  satisfaction  or  happiness. 
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Equal  self-ownership 


Equal  welfare 


Welfarism 


Equal  income 
Equal  resources 

Resourcism 


Utilitarianism  Highest  social  income 


egalitarian  aspect  of  utilitarianism  due  to  equal 
weight  is  classically  forcefully  (and  redundantly) 
emphasized  by  Jeremy  Bentham  and  quoted  by 
John  Stuart  Mill:  “each  is  to  count  for  one  and 
nobody  for  more  than  one”  ( Utilitarianism , 
Chapter  5).  It  is  the  basis  of  Richard  Hare’s  (1981) 
defense  of  this  philosophy  as  an  interpretation  of 
Kant’s  view  that  each  individual  should  be  given 
equal  consideration  (it  seems,  however,  that  the 
product  of  individual  utilities  would  not  give  them 
less  equal  consideration  than  their  sum — it  amounts 
to  comparing  relative  variations  in  utilities  rather 
than  their  absolute  variations). 


Figure  I  The  Topology  of  Equality 


Economists  represent  it  by  individuals’  utilities. 
Concepts  of  equality,  addition,  or  other  opera¬ 
tions  concerning  such  notions  are,  of  course, 
problematic,  but  classical  theories  consider  them 
and  this  may  more  or  less  provide  rough  guide¬ 
lines  for  policies.  Income  egalitarians  differ  from 
welfare  egalitarians  by  their  holding  that  individu¬ 
als  are  accountable  for  their  own  different  capaci¬ 
ties  to  enjoy  (utility  functions).  If,  in  addition, 
people  are  also  entitled  to  their  own  capacities  to 
produce  and  earn  (and  hence  to  their  resulting 
income),  the  result  is  equal  self-ownership.  It  suf¬ 
fices,  for  it,  to  say  that  each  individual  has  self¬ 
ownership — hence  it  is  also  an  equality  from 
redundancy,  generality,  or  universality. 

Equalities  in  welfare  or  income  from  which  it  is 
not  possible  that  everyone  has  more  (whereas 
income  transfers  are  a  priori  possible)  can  also  be 
described  as  maximizing  the  lowest  individual 
endowment  of  these  items,  or  “maximin.”  If 
equality  is  desired  because  individuals  who  have 
little  of  the  item  have  too  little,  and  if  some  situa¬ 
tion  with  inequality  can  give  more  to  everybody 
than  situations  with  equality,  equality  is  to  be 
replaced  by  maximin  (e.g.,  Rawls’s  [1971 J  “differ¬ 
ence  principle”  for  “primary  goods”  and  Kolm’s 
[1971 J  “practical  justice”  for  interpersonally  com¬ 
parable  welfare).  This  assumes  that  policy  can 
improve  the  lowest  endowment  without  excessive 
cost  (notably  in  terms  of  lowering  those  of  other 
people). 

Equality  in  weights  is  a  priori  anterior — more 
“upstream” — in  the  evaluation.  Nevertheless,  the 


The  Multiple  Equalities  of 
the  Overall  Distribution 

If  everybody,  which  includes  voters  and  offi¬ 
cials,  holds  that  some  social  principle  is  irrelevant 
for  a  problem,  this  principle  cannot  be  imple¬ 
mented  for  this  question  on  social  grounds.  Now 
people  actually  hold  that  the  comparison  of  indi¬ 
viduals’  capacities  to  enjoy  (hedonistic  capacities) 
or  their  variations,  and  of  their  tastes,  is  relevant 
for  allocative  choices  in  two  types  of  cases:  when 
they  refer  to  suffering  and  when  the  distribution 
is  between  people  who  sufficiently  know  one 
another  to  feel  empathy  towards  the  others. 
Allocations  in  a  hospital  or  in  a  family  are  typical 
cases.  If  national  fraternity  actually  ruled  the 
minds,  or  in  case  of  national  disasters  creating 
general  suffering,  the  principle  of  overall  national 
distribution  (for  instance  for  the  income  tax) 
would  be  in  the  welfarist  family.  In  the  other 
cases,  people’s  opinions  about  income  distribu¬ 
tion  are  instances  and  associations  of  the  other 
two  cases  only,  income  egalitarianism  on  the  one 
hand,  and  the  self-ownership  of  classical  liberal¬ 
ism  on  the  other.  The  resulting  social  and  politi¬ 
cal  synthesis  or  compromise  is  a  mix  of  these  two 
values. 

The  normal  way  of  representing  the  resulting 
incomes  is  that  they  are  the  sum  of  two  parts,  an 
egalitarian  income  and  a  classical  liberal  one.  For 
clarity,  denote  as  i  one  of  the  n  individuals,  yi  her 
income,  t,  her  labor,  wt  her  wage  rate.  Her  earned 
income  is  The  average  wage  is  w .  The  egali¬ 
tarian  income  is  the  equal  sharing  of  individuals’ 
earnings  during  an  equal  labor  k,  kwi  for  individ¬ 
ual  i.  This  egalitarian  income  is  ku).  Above  that, 
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however,  individuals  are  free  to  work  l,  and  keep 
their  earning  from  the  extra  labor,  (/:,  —  k )  w, . 
Their  disposable  income  is 

Ji  —  kw  +  (f;  —  k)W;. 

For  individuals  participating  in  this  redistribution, 
the  equalization  labor  should  be  such  that  k  ^ 
because  people  do  not  accept  taxing  leisure  k  —  i, 
at  the  value  of  labor  (if  w,  >  tv),  and  providing  a 
wage  supplement  (of  (w  —  wf)  if  tv,  <w)  to  hours 
k  —  which  provide  no  wage  seems  absurd. 

This  redistribution  is  egalitarian  in  various 
respects.  On  tangible  grounds,  it  is  the  more  egali¬ 
tarian  the  higher  the  equalization  labor  k  is  (it  is 
not  at  all  for  k  —  0,  the  pure  self-ownership  of 
classical  liberalism).  On  rational  grounds,  it  has  a 
number  of  remarkable  egalitarian  structures.  It 
transfers  from  each  equally  in  labor  (or  in  equal 
proportion  of  her  capacities),  and  to  each  equally 
in  income.  It  implies  an  equal  minimum  income 
kw.lt  amounts  to  each  receiving  an  equal  univer¬ 
sal  basic  income  kw  financed  by  an  equal  labor  k 
of  each  or  in  equal  proportion  k  of  each  capacity 
wt.  It  also  amounts  to  each  individual  equally 
yielding  to  each  other  the  product  w,  •  ( k/n )  of  an 
equal  labor  k/n  in  a  kind  of  general  equal  labor 
reciprocity.  It  is  also  equal  free  exchange  (of  labor) 
from  an  equal  allocation  (kw  in  income  and  k  in 
labor  or  the  complementary  leisure).  The  two  parts 
of  income  are  equality  according  to  deserts  (equal 
income  kw  for  the  equal  labor  k)  and  to  merit  (i.e. 
including  the  effects  of  personal  capacities  w,  for 
labor  f,  ~  k),  respectively.  Individual  /’ s  leisure  is 
A,  =  1  —  lt  if  total  time  is  measured  as  1 .  Individual 
? s  total  income ,  including  the  value  of  her  leisure, 

Wj A,,  is  Yj  =  y i  +  w, A,  =  w,  +  k(w  —  wf)  —  kw 
+  (I  -  k)Wj.  But  Pj  —  kw  +  (1  —  k)wj  is  the 
value  of  a  price  index  for  the  two  prices  of  income 
or  consumption  goods,  1,  and  leisure,  wt  for  indi¬ 
vidual  i,  with  respective  weights  kw  and  1  —  k. 
Hence,  Y  IP,  is  the  same  for  all  i.  But  Y  IP,  is 
by  definition  the  corresponding  “real  income”  or 
“purchasing  power”  of  individual  i.  Therefore,  the 
distribution  in  question  also  amounts  to  equal  pur¬ 
chasing  power ,  which  is  a  kind  of  equal  real  liberty, 
for  individuals’  choices  of  income  (or  consumption) 
and  leisure  (or  labor).  This  equal  liberty  is  not  an 
identity  of  domains  of  choice  (with  different  wf). 


Note  that  basing  a  tax  on  the  wage  rate  can  be 
done  as  in  the  present  French  tax  law,  by  exempt¬ 
ing  overtime  labor  earnings  from  the  income  tax, 
over  a  low  benchmark.  There  is  de  facto  no  cheat¬ 
ing  (because  it  would  be  too  complicated  to  hide 
this  basis  from  the  possible  controls).  The  full 
theory  adds  other  dimensions  of  labor  than  dura¬ 
tion,  notably  formation. 

Equality  of  Liberty 

Basic  Rights  or  Equal  Negative, 

Protective,  or  Civic  Liberty 

The  use  or  threat  of  force  may  be  steady  or  occa¬ 
sional.  A  person  may  incur  it  from  others  as  indi¬ 
viduals,  in  groups,  or  through  institutions.  The 
absence  of  such  forceful  interference  defines  a  free¬ 
dom  called  social,  protective,  negative  (a  term  used 
by  Kant,  Mill,  and  Isaiah  Berlin),  or  civic  (Mill).  Its 
application  to  various  specific  issues  constitutes  the 
basic  rights  or  basic  liberties.  With  this  freedom,  a 
forceful  constraint  on  someone  can  only  implement 
a  previous  acceptation  of  it  and  notably  a  previous 
agreement  (possibly  an  implicit  one)  of  this  person. 
This  absence  of  force  in  inter-individual  relations  is 
an  equality,  and  a  general  basic  demand  of  modern 
society  is  that  all  individuals  benefit  equally  from 
such  liberty  (equality  as  generality  or  universality). 
This  demand  is  even  that  this  liberty  has  priority 
(this  is  the  first  and  main  statement  of  the  corre¬ 
sponding  constitutions). 

Is  this  general  equal  liberty  with  priority  possi¬ 
ble,  however?  This  raises  an  essential  conceptual 
issue  with  important  consequences.  The  text  of  the 
French  Declaration  of  the  Rights  of  Man,  and 
many  thinkers  such  as  Rousseau,  Condorcet,  Mill, 
and  Rawls  have  held  that  these  basic  liberties  or 
rights  should  be,  with  priority,  “equal  for  all  and, 
then,  maximal”  (Rawls  even  admits  inequality  if  it 
permits  each  of  the  least  endowed  to  have  more,  as 
he  does  about  “primary  goods”).  However,  they 
consider  jointly  these  rights  and  liberties  plus  some 
means  to  make  them  actual  possibilities  or  “real” 
(as  Marx  puts  it).  They  want  to  face  Anatole 
France’s  objection  that:  “rich  and  poor  people 
alike  have  an  equal  right  to  sleep  under  the 
bridges”  (the  traditional  shelter  of  Paris  tramps). 
However,  there  is  no  a  priori  limit  to  these  means 
(to  the  number  of  private  planes  and  airports  for 
freedom  to  move,  the  size  of  private  cathedrals  for 
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freedom  of  worship,  the  privately  owned  media  for 
freedom  of  expression).  Then,  this  principle  uses  all 
the  resources  of  society  without  even  a  principle  for 
choosing  between  these  various  real  liberties.  The 
solution  of  defining  some  amount  of  means  for 
each  right  is  a  priori  arbitrary.  Moreover,  any 
equality  of  these  means  for  all  would  be  found 
worse  by  everybody  than  some  other,  unequal  solu¬ 
tion  because  people  make  different  uses  of  these 
rights  and  have  different  preferences  about  them. 

The  rational  solution  consists  in  distinguishing 
the  formal  rights  from  the  means  of  benefiting 
from  them,  and  in  putting  the  question  of  the 
means  in  that  of  the  general  distribution  and  of  the 
free  exchanges  of  goods  resulting  from  it  (with  the 
possibility  of  some  minimum  income).  Then,  when 
actions  or  intentions  of  different  individuals  oppose 
one  another  and  cannot  be  implemented  jointly, 
this  opposition  can  be  attributed  to  the  means  and 
is  solved  by  the  property  rights  concerning  them 
(for  instance,  the  occupation  of  the  same  place  at 
some  time)  and  not  to  the  “formal”  rights  in  them¬ 
selves.  Then  these  rights  are  nonrival  between 
themselves  and  can  be  equally  held  in  full  and  used 
at  satiety  by  everybody. 

Equal  Real  Liberty 

The  next  issue  about  liberty  concerns  people’s 
means  of  free  action,  which  was  the  topic  of  the 
section  on  the  overall  distribution.  The  necessary 
distinction  between  general  “macrojustice”  and 
more  specific  issues  of  “microjustice”  and  “meso- 
justice”  is  explained  there.  The  basic  liberties 
imply  equal  freedom  of  exchange,  given  the  over¬ 
all  income  allocation.  The  theory  of  macrojustice 
obtains  a  structure  of  distribution  that  can  be 
defined  in  various  ways  as  equal  liberty  (although 
with  different  domains  of  choice  as  a  result  of  the 
different  earning  and  productive  capacities  of  the 
individuals — see  Kolm,  2004,  2008a,  2010). 

Equality  of  Opportunity 

Equality  of  opportunity  describes  a  set  of  cases  of 
equality  of  liberty  to  be  found  in  various  issues  of 
justice  at  all  levels.  A  priori  it  means  the  identity,  for 
various  agents,  of  a  set  of  alternatives  among  which 
each  can  choose.  In  the  standard  and  most  common 
meaning,  this  refers  to  social  conditions  of  access  to 


certain  benefits,  positions,  situations,  jobs  (for 
instance,  with  regard  to  various  types  of  discrimina¬ 
tion  or  family  influence),  or  possibilities  (such  as 
access  to  various  types  of  education).  The  concept 
has  been  extended  to  transform  these  formal  free¬ 
doms  into  more  “real”  ones,  and  these  direct  choices 
into  their  outcomes,  by  adding  the  effects  of  per¬ 
sonal  capacities  and  social  settings  and  thus  consid¬ 
ering  opportunities  for  income,  achieved  level  of 
education,  or  the  actual  performance  permitted  by 
jobs  or  positions  (perhaps  for  given  levels  of  effort). 
One  could  also  consider  equality  of  opportunity  for 
welfare  as  happiness  or  satisfaction  by  adding  the 
consideration  of  capacities  to  enjoy  or  be  satisfied. 
Equality  of  opportunity  thus  describes  cases  that  are 
different  and  sometimes  opposed.  This  explains  why 
politicians  of  all  kinds  love  the  concept  whereas 
practically  all  philosophers  criticize  it  severely. 

The  initial  motives  for  equality  of  opportunity 
arose  or  arise  from  two  different  and  opposed  per¬ 
spectives,  one  for  realizing  an  equality  and  the  other 
for  criticizing  another  equality.  The  equality  to  be 
realized  is  that  of  some  possibility  of  choice,  as  the 
name  indicates  (for  example,  one  wants  some  peo¬ 
ple  to  have  access  to  certain  positions  without  dis¬ 
crimination  or  other  obstacle).  The  other  motive  is 
the  objection  to  policies  of  equalization  or  uni- 
formization,  for  different  people,  of  outcomes  due 
in  part  to  their  actions,  and  the  demand  to  replace 
this  equality  by  that  of  domains  of  choice  in  which 
these  agents  choose,  thus  replacing  an  equal  out¬ 
come  by  an  equal  liberty  to  pursue  this  end.  Since 
this  change  generally  leads  to  unequal  results  of 
actions,  this  stance  is  anti-egalitarian  in  this  sense. 
The  emphasis  is  often  not  only  on  the  comparison 
between  the  agents  but  also  on  some  competition 
between  them,  for  which  the  equality  of  opportu¬ 
nity  is  supposed  to  provide  fair  conditions.  This 
elicits  the  classical  leftist  judgment,  preferring  not 
only  the  “actual”  outcome  equality  but  also  or 
mainly  the  possibly  convivial  relationships  jeopar¬ 
dized  by  the  competition,  as  expressed  in  the  state¬ 
ment  that  “equality  of  opportunity  is  good  for 
horse  races  but  not  for  humans.”  However,  equal¬ 
ity  of  opportunity  includes  both  liberty  and  equality 
in  possibilities;  the  absence  of  such  equality  includes 
cases  that  are  generally  considered  to  constitute  the 
most  viciously  unjust  features  of  societies. 

The  basic  feature  and  difference  between  the 
cases  are  in  the  definition  of  equal  versus  unequal 
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opportunities.  The  main  one  refers  to  discrimina¬ 
tion  that  limits  choices  on  the  basis  of  such  criteria 
as  race/ethnicity,  family,  caste,  feudal-like  order, 
gender,  and  religion.  Apart  from  formal  discrimina¬ 
tion,  the  principle  often  refers  to  advantages  pro¬ 
vided  by  family  relations,  including  favoritism, 
nepotism,  social  networks,  information,  direct  sup¬ 
port,  and  the  role  of  families  in  education  at  home 
or  school.  A  basic  issue  is  whether  personal  capaci¬ 
ties,  innate  or  due  to  family  influence,  notably  in 
childhood,  are  counted  among  the  sources  of  the 
opportunities  in  question.  The  famous  meritocratic 
slogan  “the  career  opened  to  talents”  refers  to  a 
situation  in  which  positions  are  allocated  according 
to  talents  alone,  banning  other  social  discrimina¬ 
tion.  Policies  described  as  “positive  discrimination” 
often  are  attempts  to  compensate  for  disadvantages 
resulting  from  social  setting  and  family  influence. 

The  simple  slogan  “equality  of  opportunity” 
thus  covers  a  number  of  cases  that  are  quite  differ¬ 
ent  and  can  belong  to  opposing  ethical  positions. 
Formally,  there  are  several  types  of  equality  of 
opportunity. 

Negative  equal  opportunity  in  action  bans  formal 
social  discrimination  of  all  kinds. 

Positive  equal  opportunity  in  action  helps  people 
who  cannot  perform  some  relevant  action  to 
actually  perform  it. 

Equal  opportunity  in  action  and  result  implies  that 
if  some  people  choose  to  perform  the  same  action, 
they  will  obtain  the  same  outcome,  possibly  with 
help  for  those  who  are  disadvantaged  as  a  result  of 
unfavorable  personal  capacities  or  circumstances 
(notably  social  environment). 

The  next  step  would  simply  be  equality  in  out¬ 
come,  which  is  not  equality  of  opportunity  from 
the  point  of  view  of  its  causes,  but  which  can  be 
equality  of  opportunity  for  the  further  use  of  the 
outcome  in  so  far  as  it  is  an  intermediate  prod¬ 
uct — such  as  wealth  or  education  or  health  as  used 
for  further  choices  and  actions. 

Equality  and  Responsibility:  Responsibility-Free 
Equality  and  Equal  Joint  Responsibility 

Responsibility  is  the  assignment  of  the  effects  of 
an  action  to  the  actor.  It  requires  freedom,  which 


conversely  implies  responsibility  if  this  issue  is 
raised.  The  allocative  principle  of  responsibility 
holds  that  this  should  determine  the  allocation  of 
benefits  and  costs  (and  not  only  blame  or  praise). 
This  principle  can  be  equally  applied  to  several  or 
all  actors.  If  this  is  the  only  principle,  facts  of  soci¬ 
ety  that  do  not  result  from  members’  actions  have 
no  particular  reason  for  their  allocation  and  there¬ 
fore  their  value  should  be  equally  shared  (from 
rationality).  However,  beneficial  or  detrimental 
aspects  of  society  are  generally  joint  products  of  the 
actions  of  several  people  and  of  given  facts.  Apart 
from  particular  structures  of  the  effects  of  the  acts 
(separability,  additivity,  symmetry),  there  is  no  a 
priori  solution  to  the  corresponding  allocation  of 
costs  or  benefits.  However,  when  several  agents’ 
acts  and  the  allocation  of  the  resulting  benefits  or 
liabilities  are  collectively  chosen  by  a  required 
unanimous  agreement  between  these  people,  each 
of  them  has  a  veto  power  on  the  realization;  there¬ 
fore  she  is  fully  responsible  for  it,  and  hence  in 
particular  for  her  own  act  and  her  benefit  or  liabil¬ 
ity.  When  there  is  some  impediment  to  this  general 
agreement  (in  the  nature  of  a  “transaction  cost”),  a 
solution  is  to  estimate  what  this  agreement  would 
have  been  were  this  impediment  absent  and  to 
impose  the  obtained  sharings  and  acts  (this  is  a 
“liberal  social  contract” — see  Kolm,  1985). 
Recently,  the  responsibility-free  equality  has  been 
emphasized  by  Gerald  Cohen  (1989),  and  closely 
analyzed  on  philosophical  grounds  by  Matt 
Matravers  (2007)  and  on  economic  grounds  by 
Marc  Fleurbaey  (2008).  A  particular  important 
application  on  a  delicate  point  differentiates  two 
main  theories  of  what  should  be  equal.  Rawls 
(1971)  and  Ronald  Dworkin  (1981)  hold  people 
fully  responsible  for  their  own  tastes,  preferences, 
capacities  to  enjoy  (and  hence  “utility  functions”), 
which  is  a  priori  rather  strange  but  can  be  defended 
by  considering  people’s  ends,  desires  or  values  as 
“ambitions”  or  “life  plans”  chosen  by  them.  In 
contrast,  the  ordinary  “welfarist”  theories  include 
compensations  derived  from  differences  between 
individuals’  hedonic  capacities. 

Fundamental  Insurance:  Equal 
Hypothetical  Liberty 

A  “fundamental  insurance”  is  a  hypothetical 
mutual  insurance  undertaken  by  people  against  the 
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risk  of  some  disadvantage  that  in  fact  they  already 
have.  This  may  be,  for  instance,  having  poor  health 
or  having  received  a  poor  education  because  of 
family  status.  This  theory  provides  a  rationale  for 
corresponding  compensating  transfers  that  mitigate 
the  inequality.  It  rests  on  a  putative  free  choice  (an 
exchange,  which  makes  it  be  a  kind  of  “social  con¬ 
tract”  in  the  technical  sense),  and  it  also  is  a  “par¬ 
tial  original  position”  with  a  “partial  veil  of  igno¬ 
rance.”  Its  assimilation  of  a  choice  concerning  jus¬ 
tice  to  a  choice  in  uncertainty  is  a  priori  problem¬ 
atic  but  is  to  be  accepted  if  this  is  general  opinion. 
This  is  the  case,  for  instance,  for  the  European 
system  of  public  health  insurance;  the  fact  that 
what  people  pay  does  not  depend  on  their  given 
propensities  to  be  sick  implies  a  “fundamental 
insurance”  of  these  handicaps.  This  particular  kind 
of  “liberal  social  contracts”  (the  impediment  to  the 
insurance  agreement  is  the  “arrow  of  time”)  has 
also  been  directly  suggested  by  Dworkin  (1981). 

Comparative  Egalitarianism 

Sentiments  favoring  equality  are  often  the  result  of 
direct,  intuition-like  comparative  judgments. 
However,  the  logical  “equal  treatment  of  equals  in 
the  relevant  characteristics”  certainly  lurks  behind 
such  emotions.  Nevertheless,  such  judgments  seem 
close  to  aesthetic  ones,  as  remarked  by  Kant  and 
confirmed  by  location  on  brain  imagery  ( fair 
comes  from  a  Germanic  word  referring  to  beauty, 
and  the  Greek  and  Latin  concepts  of  beauty,  kalon 
and  pulchrum ,  were  never  neatly  and  consistently 
distinguished  from  the  moral  good). 

When  the  individual  items  that  are  compared 
have  several  dimensions  about  which  the  individu¬ 
als  can  have  different  preferences  (this  is  the  case, 
for  instance,  for  bundles  of  commodities),  it  is  pos¬ 
sible  that  no  individual  prefers  any  other’s  “alloca¬ 
tion”  to  her  own  without  these  individual  alloca¬ 
tions  being  identical.  This  principle,  called  equity- 
no-envy ,  is  one  of  the  most  commonly  used  in 
egalitarian  studies  since  the  early  1970s.  Its  egali¬ 
tarian  properties  are  readily  seen.  If  there  is  one 
(desired)  dimension  only,  the  principle  implies 
equality.  If  the  individuals  have  identical  prefer¬ 
ences,  the  principle  implies  that  they  are  indifferent 
between  all  individual  allocations.  However,  the 
most  important  egalitarian  property  of  this  prin¬ 
ciple  is  that  it  amounts  to  an  equality  in  liberty.  It 


is  satisfied  if  and  only  if  there  exists  a  domain  of 
choice  such  that  each  individual’s  allocation  can  be 
chosen  by  this  individual  (with  her  given  prefer¬ 
ences)  in  a  domain  identical  to  it.  The  analysis  of 
this  principle  in  Kolm  (1971),  after  mentions  by 
Jan  Tinbergen  and  Duncan  Foley,  was  followed  by 
a  large  number  of  applications  and  variants 
reviewed  by  William  Thomson  (2008).  It  cannot 
be  called  “no  envy”  by  itself  because  the  sentiment 
of  envy  arises  from  the  joint  presence  of  the  other’s 
and  one’s  own  allocation  in  one’s  “utility  func¬ 
tion,”  but  it  is  formally  related  to  structural  prop¬ 
erties  of  a  genuine  theory  of  envy  (Kolm,  1995). 

Political  Equality:  Democracy 

One  of  the  most  important  applications  of  equality 
is  to  politics,  in  the  realm  of  democracy.  The 
Athenian  four  equalities  of  democracy  still  provide 
the  basic  framework: 

equality  in  the  eyes  of  the  law,  or  isonomia; 

equality  in  voting,  “one  man  one  vote,”  or 
democracy  stricto  sensu; 

equality  in  public  expression  for  influencing  others, 
or  isegoria,  applied  in  Athens  as  a  right  to  equal 
time  of  speech  in  the  assembly  of  citizen; 

equality  in  the  access  to  official  positions, 
implemented  there  by  drawing  lots  or  by  rotation. 

This  was  for  a  middle-sized  society,  with  officials 
but  a  priori  the  possibility  of  mutual  influence 
between  citizens.  Women,  slaves,  and  foreigners 
were  excluded,  and  official  positions  soon  became 
the  privilege  of  members  of  influent  families. 

Later  democracies  had  a  variety  of  restrictions  to 
voting  rights  or  access  to  positions.  In  present-day 
mass  societies,  the  most  violated  democratic  equal¬ 
ity  is  isegoria,  since  the  flow  of  public  messages  is 
that  of  the  mass  media  in  which  a  tiny  aristocracy 
of  journalists,  media  owners,  and  politicians  send 
views,  values,  information  (and  entertainment)  to 
the  mass  of  the  people  who  are  gagged  in  this 
respect  (the  Internet  opens  new  possibilities  that 
already  has  had  some  important  political  effects, 
but  it  is  not  clear  whether  this  will  or  can  develop  at 
the  level  of  the  overall  problem).  Although  democ¬ 
racy  has  led  to  important  inequality-lowering 
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redistributive  transfers  in  many  countries,  it  also 
allows  corporate  and  other  interest  groups  to  buy 
favorable  laws  with  contributions  to  campaigns. 
The  various  specific  rules  of  particular  democratic 
systems  also  raise  innumerable  issues  with  respect 
to  equality.  In  the  end,  the  main  egalitarian  virtue  of 
democracy  is  as  a  barrage  against  dictatorship,  the 
harshest  inequality. 

Equality  and  Impartiality 

From  a  social  point  of  view,  humans  manage  to 
shelter  two  opposed  selves  “in  their  breast”  as 
Adam  Smith  puts  it.  Their  self-centered  and  partial 
self  favors  themselves  and  the  people  they  like  or 
have  particular  relations  with  only.  However,  they 
also  have  an  impartial  self  able  to  take  an  objective 
view  putting  everybody — themselves  included — on 
the  same  footing:  This  is  what  Thomas  Nagel 
(1986,  1991)  calls  “the  view  from  nowhere” — but 
is  there  such  a  place  as  nowhere? 

A  priori,  the  distribution  most  favorable  to 
someone’s  strict  interest  is  without  equality.  More¬ 
over,  among  situations  with  equality  of  some  kind, 
one  (in  general)  is  most  favorable  to  this  person’s 
interest  than  the  others.  For  instance  it  is  equal 
incomes  (barring  incentive  effects)  for  people  with 
low  earning  power,  and  equal  self-ownership  in  the 
free  market  for  people  with  high  earning  capacities. 
The  person’s  impartial  self,  however,  will  generally 
make  some  other  choice  with  some  sort  of  equality. 
But  will  this  latter  choice  be  the  same  for  every¬ 
body?  This  is  often  believed,  for  instance,  by  Adam 
Smith;  by  Mill,  who  takes  the  equality  to  be  that  of 
the  weights  in  an  egalitarian  sum  of  utilities;  and 
probably  by  Thomas  Nagel.  However,  the  a  priori 
only  logical  structural  requirement  of  an  impartial 
judgment  is  that  it  respects  equality  of  some  sort. 
Hence,  there  is  a  priori  a  very  large  choice  for  such 
a  judgment.  Moreover,  the  impartial  individual 
evaluation  uses  some  psychological  characteristics 
of  this  person  that  a  priori  differ  from  one  person  to 
the  other.  Therefore,  there  tends  a  priori  to  be  dif¬ 
ferent  impartial  evaluations  (hence  with  different 
equalities)  for  the  various  individuals — other  phe¬ 
nomena  such  as  a  common  moral  culture  or  mutual 
influence  through  dialog  can  change  this. 

However,  since  an  individual’s  impartial  judg¬ 
ment  of  society  does  not  depend  on  her  own  specific 
interests,  she  produces  such  a  judgment  if  she  is 


asked  to  judge  when  she  does  not  know  which  spe¬ 
cific  person  she  is,  which  specific  situation  is  hers. 
Each  individual  may  still  be  purely  self-interested, 
but  her  ignorance  prevents  her  from  favoring  her 
own  actual  self-interest.  On  can  think  of  this  as  a 
time  sequence,  with  evaluations  “before”  the  indi¬ 
viduals  know  which  specific  person  they  will  be, 
before  the  actual  individual  situations — with  the 
specific  interests  attached  to  them — are  assigned. 
Rawls  (1971)  calls  this  state  the  “original  position,” 
in  which  individuals  evaluate  “behind  the  veil  of 
ignorance,”  and  this  way  of  thinking  is  the  most 
famous  modern  theory  of  impartiality.  A  first  result 
appears  straightforwardly:  Since  the  individuals  in 
the  original  position  face  the  same  (uncertain)  pros¬ 
pect,  their  evaluation  there  are  the  same:  They  make 
an  equal,  unanimous  “original  social  choice.” 
Rawls  then  assumes  that  some  possible  risks  faced 
by  these  individuals  are  so  severe  that  they  choose 
maximal  protection  thanks  to  three  principles  in 
order  of  priority:  (1)  the  basic  liberties  (equal  and 
maximal);  (2)  nondiscrimination;  (3)  the  “difference 
principle,”  that  is,  the  highest  possible  level  of  the 
lowest  individual  endowment  of  “primary  goods” 
(one  of  which  is  income  or  wealth),  which  is  not  an 
equality  because  incentive  and  disincentive  effects 
permit  unequal  allocations  to  induce  a  higher  pro¬ 
duction,  hence  to  provide  more  to  the  least  endowed. 
This  implies,  however,  that  the  individuals  in  the 
original  position,  who  are  assumed  to  make  a  social 
contract  for  the  adoption  of  the  “principles  of  jus¬ 
tice,”  do  not  also  agree  not  to  follow  their  self- 
interested  impulses  when  working  in  real  life. 

Rawls  built  this  theory  in  order  to  provide  an 
alternative  to  utilitarianism,  then  the  prevailing 
view  in  English-language  philosophy.  However, 
John  Harsanyi  (1976)  produced  another  theory  of 
the  original  position,  which  leads  to  a  utilitarian¬ 
like  structure!  He  finds  that  the  individuals  in  the 
original  position  want  to  maximize  a  sum  of  indi¬ 
viduals’  utilities  obtained  from  the  mathematical 
expectation  of  utility  with  equal  probabilities  of 
being  all  the  actual  individuals.  Yet  the  various 
individuals,  in  the  original  position,  should  in  fact 
have  different  maximands  because  they  a  priori  dif¬ 
fer  with  respect  to  ( 1 )  their  preferences  for  being  the 
actual  individuals  (some  prefer  to  be  rich,  others 
prefer  to  be  liked  or  famous,  still  others  prefer  to  be 
good),  and  (2)  their  risk-aversion.  For  both  theo¬ 
ries,  moreover,  the  reduction  of  a  choice  of  justice 
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or  social  optimality  to  a  self-interested  choice  in 
uncertainty  is  problematic:  the  former  is  accounta¬ 
ble  to  society  and  ethics,  as  the  second  is  not.  The 
individuals  may  also  be  risk-seekers,  which  leads  to 
the  most  unegalitarian  result.  The  theory  of  “moral 
time-sharing,”  which  asks  the  individuals  to  con¬ 
sider  that  they  are  the  various  actual  individuals 
successively  in  time  is  another  impartiality  theory, 
which,  however,  raises  similar  problems. 

Equality  as  Lower  Inequality 

Philosophers  discuss  equality,  but  since  large  equal¬ 
ities  do  not  exist  in  real  life,  sociologists  study  ine¬ 
quality  (Melvin  Tumin,  1967,  provides  a  good 
overview)  and  economists  compare  and  measure 
inequalities.  This  comparison  and  measure  of  ine¬ 
quality  has  developed  into  a  very  large  field  of  stud¬ 
ies  since  the  mid-1960s.  Various  questions  are  ana¬ 
lyzed:  Does  income  inequality  increase  or  decrease 
when  all  incomes  vary  in  the  same  proportion  or  by 
the  same  amount?  Does  a  transfer  from  a  richer 
person  to  a  poorer  one  diminish  inequality  (it  aug¬ 
ments  the  inequalities  between  the  poorer  and  the 
equally  poor  and  still  poorer,  and  between  the 
richer  and  the  equally  rich  and  still  richer,  yet  it  may 
be  favored  on  the  grounds  of  welfare  if  the  poorer 
suffers  from  this  poverty)?  Are  the  relevant  inequal¬ 
ities  relative  or  absolute?  And  so  on  (Kolm,  1966). 
Multidimensional  inequalities  and  inequalities  in 
liberty  are  also  studied.  The  Handbook  of  Income 
Inequality  Measurement,  edited  by  Jacques  Silber 
(2000),  gathers  reflections  of  most  of  the  experts. 

Multidimensional  equalities,  that  is,  equalities 
in  each  of  several  goods,  are  in  general  such  that 
other,  unequal  distributions  are  preferred  by  eve¬ 
rybody,  because  people  a  priori  have  different 
tastes.  However,  among  allocations  of  these  goods 
that  are  not  so  dominated,  some  can  be  defined  as 
“more  equal”  than  the  others.  But  if  each  individ¬ 
ual  consumes  some  of  each  good,  these  solutions 
amount  to  equal  incomes  (Kolm,  1977,  1996b). 
One  famous  proposal  of  such  multidimensional 
equality  is  Sen’s  (1985)  for  individuals’  “capabili¬ 
ties”;  the  noted  result  applies  to  it. 

Positive  Relational  Equalities,  Reciprocity 

Equality  in  social  relations  does  not  solely  involve 
the  noted  absences  of  domination,  too  unequal 


distribution,  or  envy.  It  has  many  other  dimen¬ 
sions.  Equality  can  also  be  with  respect  to  status, 
respect,  and  consideration,  with,  notably,  mutual 
respect  and  consideration.  In  such  a  society,  people 
relate  to  one  another  on  an  equal  footing  and 
interact  with  others  as  alter  ego.  They  are  knights 
of  the  round  table  of  society.  Such  a  society  of 
equals  is  something  other  than  an  egalitarian  soci¬ 
ety,  although  it  certainly  limits  inequalities  of  vari¬ 
ous  types.  It  adds  a  requirement  of  liberty  in  the 
consideration  of  others,  which  situates  these  rela¬ 
tions  on  the  verge  of  fraternity.  Relations  between 
equals  are  described  in  particular  by  Marcel  Mauss 
(1924)  and  David  Miller  and  Wayne  Norman,  in 
Mason  (1998). 

These  positive  relational  equalities  can  be  sup¬ 
ported  by  a  basic  sociopsychological  property  of 
humans:  the  tendency  to  treat  others  and  relate  to 
them  as  they  treat  you  and  relate  to  you;  such 
reciprocity  is  a  relational  egalitarian  reaction  that 
is  a  main  cement  of  society  (Kolm  1984,  2008b). 
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Equality,  Political 


Political  equality  is  understood  as  a  condition  in 
which  all  citizens  have  equal  influence  on  the  col¬ 
lective  decision  making  of  a  political  community. 
Political  equality  can  never  be  fully  realized.  It 


stands  in  tension  with  social  and  economic  power 
as  well  as  cultural  hegemony,  but  countries  and 
communities  differ  greatly  in  the  degrees  and 
forms  of  political  inequality.  Democracy  puts  a 
premium  on  reducing  political  inequality,  and  for 
many  the  measure  of  political  equality  points  to  a 
critical  dimension  along  which  the  quality  of 
democracy  varies.  Yet  there  are  significant  contro¬ 
versies  on  whether  and  how  public  policy  should 
try  to  reduce  political  inequality.  Whatever  one’s 
normative  position,  it  is  important  to  recognize  the 
reality  of  major  disparities  in  the  power  resources 
of  “the  few”  and  “the  many.”  These  disparities 
explain  the  difficulties  of  coming  closer  to  political 
equality.  Sheer  numbers  aside,  collective  organiza¬ 
tion  is  the  main  form  of  power  available  to  the 
many,  counterbalancing  the  advantages  of  domi¬ 
nant  groups. 

During  most  of  recorded  human  history,  claim¬ 
ing  political  equality  as  an  ideal  was  an  incompre¬ 
hensible  notion,  except  for  a  few  who  retained  in 
subordinate  positions  sufficient  independence  of 
thought.  Political  equality  became  conceivable  as  a 
goal  with  the  rise  of  democracy.  Instigated  by 
excluded  groups  who  increasingly  could  make  their 
voice  heard,  even  rather  elementary  forms  of 
democracy  created  a  zone  of  relative  autonomy 
within  which  political  decisions  are  taken. 
Democratic  politics  must  to  some  extent  be  insu¬ 
lated  from  the  overall  structures  of  power  and  the 
system  of  social  inequality  as  a  whole  if  citizens  are 
to  be  able  to  exercise  democratic  freedoms. 
Conventionally  defined  by  freedoms  of  expression 
and  association,  regular  elections  with  comprehen¬ 
sive  suffrage,  and  the  government’s  responsibility  to 
those  elected  as  the  people’s  representatives,  a 
democratic  polity  gives  citizens  equal  political 
rights — to  vote,  to  equal  protection  of  political  free¬ 
doms,  and  to  equality  before  the  law.  Critics  have 
deprecated  the  conventional  definition  of  democ¬ 
racy  as  merely  “formal”  because  it  works  to  even 
out  advantages  only  within  a  circumscribed  politi¬ 
cal  sphere  while  overlooking  broader  and  subtler 
ways  in  which  social  inequality  shapes  politics.  Yet 
even  formal  democracy  opens  the  possibility  of 
political  contest  about  the  acceptable  degree  of 
political  inequality. 

Dominant  groups  can  use  their  social  and  eco¬ 
nomic  power  resources  in  the  political  sphere. 
Specifically,  they  can  use  their  influence  over 


Equality,  Political  819 


education,  cultural  productions,  and  mass  com¬ 
munications — their  cultural  hegemony — to  shape 
in  less  than  obvious  ways  the  views,  values,  and 
preferences  of  subordinate  groups.  If  these  effects 
of  social  and  economic  equality  are  not  substan¬ 
tially  contained,  political  equality  will  be 
extremely  limited. 

The  distance  between  the  ideal  and  the  reality  of 
democratic  equality  has  varied  greatly  over  time  and 
continues  to  vary  widely  from  one  country  to 
another.  In  each  instance,  this  equality  gap  repre¬ 
sents  a  compromise  between  dominant  groups  and 
the  many,  often  shaped  by  political  and  social  insti¬ 
tutions  of  long  standing.  The  underlying  factors 
include  the  power  balance  within  society,  the  rela¬ 
tions  between  the  state  and  civil  society,  interna¬ 
tional  power  constellations,  the  organization  and  the 
degree  of  cultural  autonomy  that  subordinate  groups 
enjoy,  and  the  extent  to  which  dominant  groups  see 
democratization  as  a  threat  to  their  interests. 

Competing  Concerns 

Advancing  political  equality  is  clearly  not  an  uncon¬ 
tested  goal.  It  is  at  odds  with  major  interests,  and  it 
may  involve  the  sacrifice  of  competing  values. 
Opposition  to  greater  political  equality,  whether 
based  on  interest  or  principle,  is  likely  to  vary 
depending  on  whether  the  topic  is  the  political  realm 
narrowly  conceived,  the  conversion  of  socioeco¬ 
nomic  conditions  to  political  advantage,  or  the  over¬ 
all  impact  of  social  inequality  on  the  political  sphere. 

Does  increasing  political  equality  constitute  an 
unqualified  good?  Or  does  it  involve  sacrificing 
other  important  values?  If  there  are  trade-offs,  how 
do  we  judge  different  outcomes?  These  are  norma¬ 
tive  issues,  but  their  discussion  often  intertwines 
the  empirical  with  normative  claims.  Thus,  it  seems 
reasonable  to  sketch  some  of  these  contested  issues 
and  to  indicate  alternative  views  of  them. 

Even  if  we  stay  purely  within  the  political  realm, 
there  are  two  major  objections  to  promoting  politi¬ 
cal  equality  as  much  as  possible.  The  first  is  that 
“the  many”  are  not  competent  to  choose  reasonable 
policies.  The  second  is  that  increasing  the  number  of 
participants  can  worsen  coordination  problems 
to  the  point  of  system  overload  and  ungovernability. 
The  conventional  form  of  democratic  rule — 
representative  democracy — answers  most  of  these 
objections  by  interposing  a  professional  state 


apparatus  (responsible  to  elected  officials) 
between  voters  and  policy  outcomes,  though  the 
issues  of  voter  competence  and  coordination 
problems  will  vary  widely  across  countries  and 
historical  situations. 

Both  the  objection  from  competence  and  the 
objection  from  numbers  raise  issues  of  reduced 
efficiency  and  its  costs.  These  seem  outweighed  in 
principle  by  three  considerations  of  a  different 
sort.  First,  “efficiency”  takes  on  substantive  mean¬ 
ing  only  in  light  of  the  aims  being  pursued,  and 
these  in  turn  depend  on  material  and  immaterial 
interests  on  which  people  are  often  divided.  Second, 
a  reasonable  principle  holds  that  all  members  of  a 
political  community  whose  interests  are  affected 
by  collective  decisions  should  have  a  say  in  them; 
affected  interests  trump  limited  competence. 
Finally,  political  equality  is  an  acknowledgment  of 
the  decisively  similar  dignity  of  all  citizens  as 
human  beings  who  are  entitled  to  the  rule  of  reci¬ 
procity.  In  practice,  broader  measures  aimed  at 
reducing  political  inequality — among  them  high- 
quality  general  education,  multiple  sources  of 
information  and  analysis,  and  trustworthy  organi¬ 
zations  to  represent  interests  that  are  otherwise  at 
a  disadvantage — may  in  effect  also  cut  these  costs 
in  efficiency. 

Fimiting  the  conversion  of  assets  from  other 
spheres  into  political  advantages  is  far  more  con¬ 
troversial.  And  objections  mount  even  further 
against  policy  proposals  to  level  differential  assets 
outside  the  sphere  of  politics  because  spillover 
effects  are  too  difficult  to  contain. 

There  is  first  the  issue  of  using  economic  assets 
in  electoral  campaigns.  In  Citizens  United  v. 
Federal  Election  Commission  (2010),  the  U.S. 
Supreme  Court  has  ruled,  much  to  the  outrage  of 
some,  that  spending  money  in  campaigns  for 
political  office  enjoys  the  same  protection  as  free 
speech.  Critics  of  the  decision  note  that  the  Court, 
in  effect,  dealt  with  economic  inequality  as  if  it 
were  irrelevant  to  the  integrity  of  the  democratic 
political  process.  While  there  may  be  other  ways  of 
evening  out  the  financial  aspects  of  electoral  com¬ 
petition,  none  is  likely  to  shut  off  the  flow  of 
money  into  politics  completely. 

Economic  inequality  has  consequences  for  polit¬ 
ical  equality  that  go  far  beyond  the  issues  of  a  level 
playing  field  in  electoral  campaigns.  The  wealthy 
can  disproportionately  influence  how  policies  are 
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made  and  implemented.  Monopolistic  and  oligop¬ 
olistic  market  power  can  easily  be  turned  into 
political  bargaining  advantage.  Different  levels  of 
government — from  local  to  national — are  depen¬ 
dent  on  investment  decisions  of  large  corporations. 
In  addition,  most  voters  in  industrial  and  knowl¬ 
edge-based  societies  depend  for  their  economic 
security  on  employment.  The  threat  of  unemploy¬ 
ment  is  taken  for  granted  in  capitalist  societies,  but 
it  has  a  major  impact  on  their  political  dynamics. 
All  advanced  capitalist  countries  have  legislation 
that  seeks  to  constrain  the  creation  and  use  of 
monopolistic  economic  power.  But  such  measures 
have  at  best  slowed  the  trend  toward  concentra¬ 
tion  of  corporate  ownership,  and  few  have  focused 
on  blunting  their  impact  on  politics.  Measures  to 
soften  the  impact  of  unemployment  enjoy  widely 
varying  support  across  countries. 

Attempts  to  go  further  in  limiting  the  conver¬ 
sion  of  economic  power  into  political  gain  are 
subject  to  fundamental  ideological  objections  as 
well  as  to  pragmatic  arguments  that  the  pursuit  of 
equality  will  harm  economic  growth.  While  the 
value  of  economic  freedom  for  entrepreneurs  and 
capital  owners  is  in  principle  hardly  a  match  for 
the  value  of  increased  political  equality,  claims 
that  measures  compromising  the  functioning  of 
markets  also  impair  growth  have  persuasive  power 
in  Western  countries.  The  more  comprehensive 
welfare  states  in  advanced  capitalist  countries 
insist,  however,  that  this  protection  of  the  market 
mechanism  has  to  be  combined  with  relatively 
generous  compensatory  policies,  which  make  the 
functioning  of  the  market  socially  sustainable. 

If  measures  to  limit  the  conversion  of  economic 
advantage  into  political  power  have  only  a  partial 
effect,  does  the  pursuit  of  political  equality  warrant 
policies  that  reduce  inequalities  of  income  and 
wealth  more  directly?  There  can  be  little  doubt  that 
differences  in  the  distribution  of  income  and  wealth 
across  countries  and  over  time  within  countries 
make  for  significant  variations  in  political  equality. 

Competing  value  claims  dominate  the  discussion 
of  policies  meant  to  change  the  distribution  of 
income  and  wealth.  It  is  on  the  issue  of  direct  polit¬ 
ical  action  to  reduce  income  and  wealth  differentials 
that  the  claims  of  an  inherent  contradiction  between 
equality  and  freedom  have  concentrated.  These 
claims  are  plausible  when  one  thinks  of  sudden 
policies  of  increased  taxation  and  expropriation 


that  are  aimed  at  leveling  incomes  and  wealth. 
Such  policies  would  be  fought  by  the  privileged 
with  all  means  at  their  disposal,  potentially  endan¬ 
gering  democratic  rule.  In  turn,  breaking  this 
opposition  could  indeed  lead  to  ruthless  dictator¬ 
ship.  Reduced  levels  of  economic  inequality  would 
then  coincide  with  the  destruction  of  democratic 
equality.  On  the  other  hand,  we  know  of  exam¬ 
ples — the  Scandinavian  countries  are  the  most 
prominent — where  popular  policies  pursued  over 
decades  have  resulted  in  significant  reductions  of 
after-tax  and  after-transfer  income  inequalities. 
Yet  political  freedom  remained  unimpaired,  and 
no  long-run  drop  in  economic  growth  has  accom¬ 
panied  these  policies. 

The  other  major  issues  beset  by  value  controver¬ 
sies  concern  cultural  hegemony.  There  is  little  ques¬ 
tion  that  relatively  small  elites  have  a  dispropor¬ 
tionate  influence  on  the  production  of  culture  as 
well  as  on  its  diffusion  through  education  and  the 
mass  media.  Yet  this  influence  is  in  most  advanced 
capitalist  countries  quite  heterogeneous  in  charac¬ 
ter.  It  is  pluralistic;  but  it  is  unequal  nevertheless. 
Cultural  hegemony,  however  manifold  and  com¬ 
plex,  creates  substantial  political  inequality. 

Policy  responses  to  issues  of  cultural  hegemony 
are  limited.  Apart  from  some  attempts  at  hindering 
if  not  obstructing  the  trend  toward  concentrated 
ownership  of  the  media  and  support  for  public 
education  and  public  broadcasting  networks,  the 
standards  of  news  journalism  and  of  academic 
freedom  and  analytic  universalism  constitute  an 
important  protection  against  direct  conversions  of 
wealth  and  political  influence  into  control  over 
news  and  research. 

At  the  same  time,  measures  that  would  go 
beyond  these  broadly  accepted  policies  of  insulating 
certain  cultural  institutions  and  spheres  against 
direct  partisan  influence  face  nearly  insuperable  dif¬ 
ficulties.  Most  people  who  care  about  undue  influ¬ 
ence  in  cultural  production  and  diffusion  are  part  of 
the  dominant,  though  internally  heterogeneous, 
mainstream.  Efforts  to  curtail  such  influence,  fur¬ 
thermore,  quickly  run  up  against  two  important 
and  commonly  accepted  principles — freedom  of 
expression  and  respect  for  the  autonomy  of  differ¬ 
ent  cultural  spheres  such  as  art,  news  reporting,  or 
academic  research.  Furthermore,  since  some  protec¬ 
tions  against  the  conversion  of  economic  and 
political  clout  into  cultural  influence  rely  precisely 
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on  these  principles,  policies  at  odds  with  them 
would  at  least  partly  be  counterproductive. 

Power  Resources  and  Political  Equality 

If  competing  values  argue  against  reining  in  pro¬ 
cesses  that  have  inegalitarian  implications,  the  effects 
of  these  processes  are  not  thereby  rendered  any  less 
real.  A  clearer  picture  of  the  chances  of  decreasing, 
maintaining,  or  increasing  political  inequality  can 
therefore  benefit  from  a  brief  overview  of  the  main 
power  resources  of  the  few  and  the  many. 

Coercive  power  is  a  fundamental  resource  that 
is  inherently  distributed  unequally.  If  the  state  suc¬ 
ceeds  in  monopolizing  coercive  power — as  most 
states  strive  to  do — democratic  equality  is  pro¬ 
tected  only  if  the  use  of  that  power  is  regulated  by 
law  and  if  equality  before  the  law  is  sufficiently 
realized  to  rule  out  differential  intimidation  for 
political  gain.  While  this  can  roughly  be  taken  for 
granted  in  most  rich  democracies  today,  pockets  of 
private  coercive  power  and  gross  imperfections  in 
equality  before  the  law  are  major  factors  subvert¬ 
ing  political  equality  in  a  number  of  countries  in 
the  developing  and  postcommunist  worlds. 

The  state  administrative  apparatus  represents 
another  power  resource  of  great  reach,  combining 
expertise,  tax-based  funding,  readymade  organi¬ 
zational  capacity,  and  ultimate  recourse  to  coer¬ 
cion.  The  administrative  organizations  of  modern 
states  preempt  a  good  deal  of  decision  making  and 
can  create  substantial  imbalances  in  political 
equality.  Supranational  administrative  bodies  such 
as  the  European  Union  or  the  World  Trade 
Organization  wield  great  power  unchecked  by 
democratic  controls. 

Economic  advantage  is  a  major  obstacle  stand¬ 
ing  in  the  way  of  realistic  political  equality.  Capital 
ownership,  as  already  indicated,  carries  great 
power,  the  more  so  the  greater  the  mobility  of 
capital.  Again,  transnational  corporations,  much 
as  transnational  public  agencies,  wield  power  that 
is  unconstrained  by  democratic  controls,  except  in 
conditions  of  severe  crisis. 

Income  differences  are  less  concentrated  than 
capital  ownership,  but  they  vary  greatly  from  one 
country  to  another  as  well  as  across  time  periods. 
The  two  ends  of  a  distribution  are  most  significant 
for  democratic  equality.  The  poor  never  attain 
influence  commensurate  with  their  numbers,  and 


the  very  rich  nearly  inevitably  have  disproportion¬ 
ate  political  clout. 

Social  status  represents  more  than  a  point  on  a 
scale  of  esteem;  it  involves  social  attachments  and 
aversions.  It  shapes  interaction  patterns,  offering 
entrance  to,  and  imposing  exclusion  from,  different 
social  circles.  Politically,  it  defines  the  chance  to  be 
heard  and  to  be  trusted;  it  increases  or  diminishes 
one’s  political  “voice.”  The  importance  of  status 
differentials  varies  considerably  across  countries, 
but  status  hierarchies  tend  to  persist  over  time. 
However,  some  major  correlates  of  status,  includ¬ 
ing  economic  position  and  level  of  education,  are 
not  policy  independent.  Broadening  access  to 
advanced  education  and  flattening  the  distribution 
of  posttax  and  posttransfer  incomes  reduce  even 
persistent  differences  grounded  in  ethnic,  racial, 
and  gender  status. 

The  unequal  influence  on  the  production  and 
diffusion  of  culture  exerted  by  varied  but  small 
cultural,  religious,  and  economic  elites  constitutes, 
as  noted,  a  major  problem  for  political  equality. 
Cultural  hegemony  represents  inequality  of  a  pecu¬ 
liar  kind:  Its  effects  remain  largely  invisible  to 
those  whose  views,  values,  and  preferences  it 
shapes,  and  they  are  taken  for  granted  by  those 
whose  material  and  immaterial  interests  find 
expression  under  the  hegemonic  status  quo. 

Many  regard  arguments  about  cultural  hege¬ 
mony  as  spurious  expressions  of  Marxist  ideol¬ 
ogy — the  claims  of  “false  consciousness”  in  a  new 
guise.  But  the  conception  of  cultural  hegemony 
sketched  here  does  not  claim  superior  knowledge 
of  people’s  objective  interests;  it  simply  argues  that 
elite  views  and  interests  tend  to  shape  the  views 
and  preferences  of  subordinate  groups  more  than 
vice  versa. 

It  is  useful  to  explore  these  problems  of  cultural 
inequality  in  regard  to  the  specific  issue  of  politi¬ 
cally  relevant  knowledge.  Political  knowledge  is  a 
major  power  resource,  and  its  distribution  is  pro¬ 
foundly  unequal.  This  is  true  most  obviously  in 
regard  to  information.  Those  who  are  favored  by 
education  and  their  position  in  networks  of  infor¬ 
mation  have  clear  advantages  over  those  less 
favored.  Furthermore,  for  judging  and  absorbing 
the  flow  of  information  and  for  the  ability  to  resist 
spin,  complex  background  knowledge  is  necessary. 
This  is  largely  a  function  of  education  and  of  the 
repeated  exercise  of  such  assessments. 
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Beyond  these  cognitive  advantages,  there  is 
another  more  basic  kind.  It  concerns  the  knowl¬ 
edge  that  is  generated  by  systematic  inquiry.  Here, 
the  question  is  not  how  this  newly  generated 
knowledge  is  disseminated  but  whether  the  prob¬ 
lem  formulations  and  specific  questions  asked  in 
research  are  shaped  by  concerns  and  presupposi¬ 
tions  as  well  as  by  blind  spots  that  correspond  to 
the  concerns  and  blind  spots  of  select  groups  while 
neglecting  the  interests  and  perspectives  of  others. 

The  establishment  of  feminist  and  African 
American  scholarly  niches  may  have  gone  against 
the  norms,  but  it  certainly  is  an  indication  that  for 
a  long  time  these  norms  of  universalism  had  rather 
disappointing  results.  The  problem  may  well  be 
equally  serious  for  class-based  cognitive  interests. 
Researchers  from  a  lower  class  background  acquire 
a  new  social  and  economic  status  in  the  process  of 
their  training  and  career,  and  they  have  better 
career  chances  if  they  stay  within  established  theo¬ 
retical  frameworks  of  basic  assumptions  and  prob¬ 
lem  formulation. 

That  the  orientation  of  research  is  of  great 
importance  for  long-term  political  gains  is  also 
reflected  in  the  proliferation  of  privately  financed 
(though  tax  supported)  think  tanks  in  the  United 
States,  many  of  which  have  a  right-of-center  out¬ 
look.  In  many  other  Western  countries,  the  orien¬ 
tation  of  similar  centers  is  less  weighted  toward 
one  side  and  often  balanced  by  publicly  supported 
institutions. 

Organization  for  collective  action  in  voluntary 
associations,  unions,  and  parties  is  the  most  prom¬ 
ising  power  resource  of  “the  many.”  Collective 
organization  can  compensate  for  the  impact  of 
social  and  economic  inequality  on  democratic 
politics.  It  can  mobilize  voters  and  campaigners,  it 
can  raise  substantial  funds  if  membership  and 
small  contributions  are  numerous  enough,  it  can 
represent  otherwise  dispersed  interests  between 
elections,  and  it  can  compensate  to  some  extent  for 
the  cultural  hegemony  of  the  most  influential 
groups  and  institutions,  advancing  its  own  views 
and  symbols  and  shielding  the  followers  against 
the  dominant  influences. 

This  compensation  for  the  impact  of  social  and 
economic  inequality  requires,  however,  that  these 
organizations  are  relatively  autonomous  from 
dominant  groups  and  responsive  to  their  constitu¬ 
encies.  Responsiveness  may  be  endangered  by 


oligarchic  tendencies  that  are  hard  to  control. 
Furthermore,  dominant  interests  quite  often  seek 
to  protect  themselves  by  sponsoring  sympathetic 
organizations  and  parties  with  broad  appeal. 
Many  voluntary  associations  rely  on  private 
(though  often  tax  supported)  funding  from  a  rela¬ 
tively  small  number  of  wealthy  patrons.  A  simple 
measure  of  participation  in  civil  society,  then,  is 
not  sufficient  to  gauge  the  compensatory  potential 
of  collective  organization.  What  is  decisive  is  that 
relatively  autonomous  organizations  protect  oth¬ 
erwise  disadvantaged  interests. 

The  importance  of  autonomous  collective 
organization  in  parties  and  politically  relevant 
voluntary  associations  for  leveling  political 
inequality  can  hardly  be  exaggerated.  If  the  eco¬ 
nomic  and  political  power  of  concentrated 
wealth,  the  cultural  hegemony  of  a  limited  set  of 
elites,  and  the  decision-making  power  of  an 
imperfectly  controlled  state  apparatus  are  the 
most  important  factors  underlying  political 
inequality,  strong  and  autonomous  organization 
of  subordinate  interests  is  the  most  important 
counterbalancing  factor.  It  offers  political  com¬ 
petition  to  the  influence  of  wealth  and  high  sta¬ 
tus  and  limits  the  relative  autonomy  of  the  state 
vis-a-vis  subordinate  interests.  It  gives  subordi¬ 
nate  groups  some  protection  against  hegemonic 
influence  by  offering  alternative  views  and  orien¬ 
tations.  And  it  can  strengthen  the  universalistic 
norms  of  academic,  social,  and  political  analysis 
as  well  as  of  news  reporting  by  sponsoring 
research  informed  by  the  concerns  and  interests 
of  subordinate  groups. 

The  conditions  favoring  or  hindering  the  devel¬ 
opment  of  such  forms  of  collective  organization 
are  closely  related  to  the  conditions  shaping  politi¬ 
cal  participation  generally.  Prominent  among  them 
are  mutually  reinforcing  processes  in  the  political 
sphere:  The  prospect  of  political  success  stimulates 
political  participation,  while  its  absence  stifles  it; 
this  seems  to  be  a  major  reason  why,  in  many 
countries,  politically  oriented  participation  is  more 
prevalent  among  the  middle  classes  than  among 
the  working  class  or  the  poor.  Gaining  or  losing 
political  influence  on  decisions  about  legislation 
and  implementation  can  in  turn  favor  or  hinder 
collective  organization. 

In  countries  whose  institutional  setting  has  been 
shaped  by  strong  unions,  strong  parties  of  the  left, 
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and  a  significant  participation  of  these  parties  in 
government,  class  and  status  differences  in  social 
and  political  participation  are  much  reduced  or 
eliminated.  This  claim  has  empirical  support,  and 
on  simple  reflection  it  makes  good  theoretical  sense. 
It  is  no  real  surprise  that  the  sustained  impact  of 
successful  self-organization  of  subordinate  interests 
and  of  the  representation  of  these  interests  in  the 
governance  of  societies  diminishes  the  social,  eco¬ 
nomic,  and  cultural  factors  maintaining  political 
inequality  and  leads  to  a  leveling  in  the  social  and 
political  participation  of  different  socioeconomic 
strata.  By  contrast,  the  disappointing  performance 
of  recently  democratized  political  systems  in  Latin 
America  seems  largely  due  to  a  renewed  weakness 
in  the  organized  representation  of  subordinate 
interests. 

These  interactions  between  collective  organiza¬ 
tion  and  political  success  can  create  stable  paths  of 
advance  or  decline  of  equality  in  a  political  system, 
once  critical  turning  points  are  passed.  Therefore, 
the  chances  of  a  political  self-organization  of  sub¬ 
ordinate  interests  and  its  potential  effects  on 
political  equality  may  well  be  dependent  on  his¬ 
torical  conditions  that  are  not  easily  changed. 

Democratic  Equality  Versus 

Social  and  Economic  Power 

Political  equality  is  a  critical  dimension  of  the 
quality  of  any  system  of  democratic  rule.  It  stands 
in  tension  with  the  structure  of  social,  economic, 
and  cultural  inequality.  Even  democracy  as  mini¬ 
mally  conceived  is  only  possible  if  political  deci¬ 
sions  are  to  some  extent  separated  from  the 
system  of  class,  status,  and  power.  But  since  struc¬ 
tured  inequality  can  never  be  entirely  eradicated 
and  political  decision  making  can  never  be  fully 
emancipated  from  the  inequality  in  power 
resources,  democratic  equality  is  a  goal  that  can 
only  be  approximated  at  a  considerable  distance. 
At  the  same  time,  political  equality  is  a  value  that 
is  grounded  in  many  practices  and  commitments 
common  in  democracies.  The  principle  of  an 
equal  vote  is  only  one  but  not  the  least  among 
these. 

The  tension  between  democratic  equality  and 
the  impact  of  differential  economic,  cultural,  and 
social  power  is  not  a  constant  across  modern 
democracies.  There  exists  a  great  deal  of  variation, 


which  ranges  from  democracies  that  remain 
extremely  shallow  in  their  egalitarian  substance  to 
realistic,  if  still  limited  approximations  to  demo¬ 
cratic  equality  that  are  built  on  a  significant 
empowerment  of  socially,  economically,  and  cul¬ 
turally  subordinate  groups.  To  deepen  democracy 
in  the  direction  of  greater  political  equality  requires 
strong  and  sustained  policies  promoting  social  and 
economic  equality. 

Dietrich  Ruescbemeyer 
Brown  University 
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Ethics 


Ethics  is  the  most  general  term  used  to  character¬ 
ize  the  different  conceptions  of  the  good  as 
opposed  to  the  evil  and  the  just  as  opposed  to  the 
unjust.  These  abstract  conceptions  vary  from  one 
school  of  thought  to  the  other.  Kantianism  favors 
an  approach  that  focuses  on  principles,  insisting 
on  the  role  of  duty  and  the  universality  of  moral 
law.  It  builds  a  liberal  ethics  stressing  the  necessity 
of  a  public  debate  on  the  universality  of  core 
moral  and  political  choices.  In  its  Benthamian  ver¬ 
sion,  consequentialism  and  utilitarianism  plead  in 
favor  of  the  maximization  of  pleasure  and  the 
minimization  of  pain,  while  John  Stuart  Mill 
argues  in  favor  of  the  maximization  of  happiness 
and  the  minimization  of  unhappiness.  Aristotelian 
ethics  focuses  on  the  importance  of  virtues  that 
characterize  the  citizen  of  the  polis.  When  ethics  is 
contextualized,  it  is  most  often  referred  to  as 
morality  and  designates  a  less  abstract  phenome¬ 
non.  The  different  sets  of  values  to  which  the  good 
and  the  just  are  associated  are  then  rooted  in  his¬ 
torical  and  social  preferences.  Finally,  ethics  is 
also  used  as  a  synonym  of  morals  that  character¬ 
ize  the  domain  of  habits  that  includes  ethos  and 
mores.  The  etymological  root  of  the  word  ethics — 
ethos — already  includes  this  double  meaning:  eth¬ 
ics  and  habits.  This  threefold  dimension  of  this 
notion — ethics,  morality,  and  ethos — is  of  great 
importance  for  political  scientists.  It  parallels  the 
division  between  different  approaches  used  in  the 
discipline:  normative  accounts,  causal  explana¬ 
tory  analyses  of  social  phenomena  or  correlation 
analyses,  and  interpretative  analyses. 

Political  science  echoes  a  strong  interest  for  eth¬ 
ics  and  its  related  themes  that  has  prevailed  in  the 
humanities  since  the  early  1990s.  However,  the 
degree  to  which  ethics  is  accepted  as  a  legitimate 
theme  of  research  differs  from  one  subfield  to  the 
other.  Some  political  scientists  are  still  reluctant  to 
use  this  terminology.  It  is  more  common  for  politi¬ 
cal  theorists  to  do  so  because  at  least  some  of  them 
engage  in  debates  on  normative  issues.  Political 
sociologists  are  more  skeptical  about  using  ethics 
as  a  concept.  Since  they  usually  maintain  a  position 
of  axiological  neutrality,  they  are  more  inclined  to 
study  the  functional  uses  of  morality.  In  compara¬ 
tive  politics,  the  use  of  ethics  as  a  terminology  and 


as  an  abstract  and  absolute  concept  can  also  be  a 
controversial  issue,  because  ethics,  some  political 
scientists  argue,  has  different  cultural  meanings. 
For  similar  reasons,  ethics  is  also  a  controversial 
question  in  the  field  of  international  relations;  it 
has,  however,  attracted  significant  attention  and 
has  become  more  and  more  popular  in  the  litera¬ 
ture  over  the  past  2  decades. 

The  social  sciences  have  been  primarily  inter¬ 
ested  in  explaining  and  understanding  ethics  as  a 
social  and  political  phenomenon.  Political  scien¬ 
tists  have  drawn  their  inspiration  from  the  works 
of  major  authors  who  have  studied  the  genesis  of 
ethics  and  its  functions.  They  have  used  this 
knowledge  to  elaborate  a  new  thinking  on  contem¬ 
porary  issues  in  which  norms  and  values  play  a 
significant  role.  This  has  created  a  debate  that  is  of 
great  significance  for  all  the  branches  of  political 
science  and  has  had  a  deep  epistemological  reso¬ 
nance.  It  also  unveils  tensions  and  quarrels  between 
social  sciences  and  normative  theories,  primarily 
those  proposed  in  philosophy,  that  need  to  be 
elaborated.  This  development  should  encourage 
political  scientists  to  engage  in  an  interdisciplinar¬ 
ity  debate  and  be  open  to  views  that  lie  at  the  close 
borders  of  their  field. 

The  Social  History  of  Ethics 

There  is  a  long-standing  debate  on  the  historical 
origins  of  ethics  and  the  mechanisms  that  have  led  to 
the  emergence  and  the  development  of  ethics,  moral¬ 
ity,  or  morals.  The  work  of  Emile  Durkheim  serves 
as  a  reference  for  those  who  wish  to  ground  moral¬ 
ity  in  the  community.  Morality  is  rooted  in  ancient 
religious  and  ethnic  communities,  and  religion 
serves  as  a  pillar  of  moral  values.  Durkheim  and  his 
followers  see  morality  as  a  collective  phenomenon 
that  includes  looser  conceptions  of  morality,  such  as 
an  ethos,  around  which  the  members  of  the  group 
coalesce.  To  Paul  Fauconnet,  one  of  Durkheim’s 
disciples,  who  has  made  a  significant  contribution  to 
the  sociological  study  of  responsibility,  moral  and 
legal  sanctions  are  the  cement  of  society. 

Ethnic  groups  and  the  political  communities 
such  as  nation-states  produce  values  and  moral 
codes,  creating  a  web  of  meaning  that  has  deep 
political  consequences.  Republicanism  is  a  form  of 
secular  religion  that  brings  social  cohesion  to  wide 
collective  entities  that  otherwise  face  the  risk  of 
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anomy.  As  a  collective,  the  French  Republique  is 
nourished  by  this  sense  of  belonging  to  a  national 
community.  The  French  nation-state  is  sustained 
by  the  efforts  of  those  who  strongly  believe  in 
republican  values.  This  community  has  its  equiva¬ 
lents  of  cardinals  and  priests:  high-ranking  civil 
servants  who  serve  as  role  models,  who  demon¬ 
strate  a  commitment  to  the  well-being  of  the  com¬ 
munity  and  to  strong  ideals  of  freedom,  justice, 
and  brotherhood.  The  comparison  with  the  United 
States  is  indeed  very  illuminating.  The  First 
Amendment  to  the  U.S.  Constitution  prohibits 
Congress  from  establishing  religion  and  from  pre¬ 
venting  the  free  exercise  of  religion,  a  prohibition 
that  Thomas  Jefferson  described  as  “building  a 
wall  of  separation”  between  church  and  state. 
Various  U.S.  Supreme  Court  decisions  have  pro¬ 
vided  guidelines  for  what  the  establishment  and 
free  exercise  clauses  prohibit.  Thus,  unlike  in  some 
European  countries,  there  is  no  official  (i.e.,  state) 
religion  in  the  United  States. 

Ethics  can  be  a  powerful  driving  force  for  the 
development  of  societies.  The  development  of 
capitalism  relies,  according  to  the  German  sociolo¬ 
gist  Max  Weber,  on  puritan  values  that  were  pre¬ 
dominant  in  America  since  its  origins.  An  individu¬ 
alistic  approach  best  captures  the  mechanisms  that 
prevail  in  the  emergence  and  then  the  development 
of  capitalism.  According  to  Weber,  individuals 
forge  their  own  capitalistic  ethos  grounded  on  the 
belief  of  predestination.  The  puritan  has  to  show 
evidence  of  his  or  her  worldly  success  to  dispel  any 
doubt  about  his  or  her  salvation  by  God.  Social 
scientists  such  as  Peter  Berger  and  David  Martin 
have  been  tempted  by  this  mode  of  analysis  when 
explaining  the  role  of  Protestantism — and  its  new 
churches — in  Latin  America  or  in  countries  such  as 
South  Korea,  where  the  growth  of  capitalism  has 
paralleled  the  spread  of  this  new  creed.  This  phe¬ 
nomenon  is  explained  in  terms  of  correlation. 

In  his  work  on  the  court  society,  Norbert  Elias 
has  drawn  our  attention  to  another  aspect  of  the 
development  of  mores.  Individuals  are  part  of  a 
“civilizational  process”  that  finds  its  origins  in  the 
postmedieval  European  court  society  and  that  leads 
them  to  greater  self-restraint.  This  psychological 
phenomenon  is  nurtured  by  a  historical  dynamic 
that  has  important  social  consequences  for  the 
functioning  of  social  organizations  and  state 
bureaucracies.  Of  course,  several  major  events  in 


the  history  of  modern  society — and  most  particu¬ 
larly  World  War  II — serve  as  strong  counterexam¬ 
ples  that  weaken  the  theory.  However,  the  work  of 
Elias  is  an  interesting  contribution  to  the  study  of 
progress  and  the  euphemization  of  violence  in  con¬ 
temporary  democracies,  for  example,  when  study¬ 
ing  the  attitudes  toward  casualties  in  war,  at  a  time 
when  the  number  of  casualties  is  changing  more 
dramatically.  Another  dynamic  that  is  of  great 
importance  in  the  study  of  ethics  is  the  moralization 
of  societies  and  the  measure  of  moral  progress. 

Michel  Foucault  has  brought  to  light  another 
aspect  of  morality.  Although  the  term  itself  is  con¬ 
troversial  when  it  is  used  in  reference  to  Foucault’s 
thought,  morality  is  part  of  the  first  phase  of  his 
work,  and  individual  ethics  is  at  the  center  of  his 
study  of  sexuality.  Foucault  explains  in  genealogical 
terms  the  role  of  utilitarianism  in  power  relations  in 
18th-  and  19th-century  England.  Utilitarianism  is 
used  when  considering  an  architectural  project  for 
building  a  prison.  Utilitarianism  is  also  at  the  center 
of  a  critique  of  political  power  addressed  by  the 
agents  of  the  market;  Foucault  refers  to  this  phe¬ 
nomenon  as  the  “critique  of  governmental  reason” 
(i critique  de  la  raison  gouvernementale).  Utility 
becomes  the  cornerstone  of  their  critique  of  politi¬ 
cal  and  particularly  state  decision.  Political  leaders 
and  politics  as  a  system  are  still  being  challenged  by 
market-oriented  agents  and  or  utilitarians  who 
frame  their  discourse  in  similar  terms.  In  the  second 
part  of  his  work,  Foucault  discusses  the  modes  of 
sexual  identities  and  relationships  in  ancient  Greece 
and  highlights  one  of  its  principal  aspects:  the  “care 
of  the  self”  [le  souci  de  soi).  This  self-knowledge 
reflects  the  virtues  of  the  Greek  citizen;  it  also  has  a 
political  dimension  to  the  extent  that  these  personal 
traits  characterize  those  who  belong  to  the  polis. 

These  social  science  approaches  all  provide  an 
explanation  of  the  emergence  and  development  of 
ethics.  As  in  the  case  of  Weber,  Elias,  and  Foucault, 
they  also  favor  a  study  of  the  understanding  of  eth¬ 
ics  as  a  language  or  as  an  element  of  discourse. 
Explaining  and  understanding  are  also  intermingled 
with  normative  views.  Durkheim’s  philosophical 
background  is  Kantianism.  Weber  makes  a  distinc¬ 
tion  between  the  scholar’s  principled  approach  to 
ethics — an  ethics  of  conviction — and  a  consequen- 
tialist  framework  that  belongs  to  the  sphere  of  poli¬ 
tics.  Foucault  gives  an  illuminating  interpretation  of 
utilitarianism  and  also  puts  into  perspective  the  role 
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of  virtue  that  lies  at  the  core  of  Greek  philosophy 
and  especially  Aristotelian  ethics.  Elias  also  gives  an 
example  of  the  use  of  a  virtue,  restraint,  as  a  trait  of 
moral  character  rooted  in  the  psyche. 

The  Tensions  Between  Ethics  and  Politics 

Those  who  successfully  promote  ethical  ideas 
often  have  access  to  power  and  to  material 
resources.  This  fact  has  led  some  major  philoso¬ 
phers  to  see  ethics  as  an  ideology  used  to  subordi¬ 
nate  the  weak.  The  disillusionment  expressed  in 
this  ontological  and  political  vision  is  shared  by 
many  political  scientists.  Thomas  Elobbes  argues 
in  favor  of  the  use  of  religion  as  a  source  of  legiti¬ 
macy  for  the  Leviathan  that  exercises  his  control 
over  the  church.  Marxism  points  at  the  predomi¬ 
nant  role  of  economic  and  political  power  over 
ideas.  Political  scientists  have  mostly  focused  on 
one  social  dimension  of  ethics  and  its  instrumen- 
talization  of  ethics  by  the  rulers. 

Realists  and  neo-Marxists  consider  that  mate¬ 
rial  power  prevails  and  that  ideas  play  a  political 
role  when  they  are  not  an  obstacle  to  the  pursuit 
and  maximization  of  the  power  of  social  and 
political  elites.  The  founding  fathers  of  classical 
realism  have  argued  along  those  lines.  Neo-Marxist 
approaches  or  theoretical  frameworks  inspired  by 
the  work  of  Pierre  Bourdieu  also  tend  to  discard 
the  role  of  ethics  in  the  fields  of  politics  and  eco¬ 
nomics.  When  they  refer  to  ethics,  rulers  use  this 
discourse  of  legitimation  to  pursue  their  ends;  they 
only  play  lip  service  to  ethics  and  only  when  it 
does  not  constrain  their  action.  On  the  contrary,  it 
is  a  resource  used  to  maximize  their  power.  When 
discussing  the  role  of  lawyers  or  moral  entrepre¬ 
neurs,  Bourdieu’s  followers  often  refer  to  a  “mar¬ 
ket  of  virtue,”  widely  dominated  by  those  who 
have  the  resources  to  set  the  rules  of  the  game. 

However,  there  can  be  other  more  interesting 
and  nuanced  approaches  to  this  question.  Depending 
on  the  historical  and  cultural  context,  morality  and 
the  legitimacy  derived  from  its  sources  can  be  a 
powerful  resource.  This,  however,  does  not  neces¬ 
sarily  imply  the  structural  subordination  of  ethical 
agents  to  power.  Religious  actors  can  be  crucial 
players  that  engage  in  bargaining  relations  with 
princes.  Indeed,  over  its  history,  the  Papacy  has 
often  been  courted  by  leaders  willing  to  be  endorsed 
by  the  Church.  In  contemporary  politics,  the  role  of 


religious  groups  has  been  highlighted  when  they  use 
their  bargaining  powers  in  democratic  elections. 

Even  classical  realism  is  more  nuanced  than  the 
general  materialistic  approach  that  is  generally 
attributed  to  it,  in  which  ethics  is  subordinated  to 
power.  Indeed,  at  the  individual  level,  Hans 
Morgenthau  points  out  the  virtues  of  political 
leaders,  and  in  doing  so  he  rehabilitates  the  role  of 
ethics  in  politics.  Wisdom  as  a  moral  trait  is  an 
important  resource  for  the  prince  or  political  head. 
The  art  of  politics  lies  on  some  moral  capacities 
and  virtues  in  an  Aristotelian  sense. 

Interests  and  Ideas 

The  interaction  between  interests  and  ideas — which 
include  moral  visions  or  beliefs — is  one  of  the  cen¬ 
tral  conceptual  dimensions  of  the  question  of  moral 
authority.  When  interests  determine  the  content  of 
ideas,  material  power  has  the  capacity  to  dictate  the 
moral  message  of  moral  entrepreneurs.  However, 
as  Weber  has  brought  to  light,  ideas  do  matter. 
Moral  agents  and  the  moral  visions  that  might 
emanate  from  them  can  in  specific  contexts  orient 
the  definition  of  the  interests  of  and,  therefore,  the 
decisions  made  by  rulers. 

Constructivists  describe,  interpret,  and  explain 
the  trajectories  of  certain  norms.  Contingent  visions 
of  morality  can  lie  at  the  origins  of  the  norm  trajec¬ 
tory  as  Martha  Finnemore,  Nina  Tannenwald,  and 
Richard  Price  indicate  in  their  work  on  the  cultures 
of  national  security.  This  genealogical  analysis  high¬ 
lights  the  social  origins  of  certain  prohibitions.  A 
taboo  can  be  a  source  of  inaction,  as  in  the  case  of 
the  nuclear  weapons  not  used  since  World  War  II; 
such  a  taboo  also  parallels  other  abstract  visions  of 
the  ethics  of  nuclear  weapons  that  set  a  legal  and 
ethical  framework  banning  their  use. 

There  are  numerous  critiques  to  this  approach. 
Critical  sociologists  and  realists  first  consider  that 
liberal  theory  and  constructivism  overestimate  the 
role  of  ethics  in  politics  and  the  power  of  norms 
and  moral  entrepreneurs.  Critical  sociologists  also 
strongly  criticize  liberal  theorists  and  constructiv¬ 
ists  for  their  political  agenda.  These  critics  argue 
that  liberals  and  constructivists  are  too  complacent 
about  the  relationship  between  states  and  power 
elites  and  thus  reproduce  their  own  discourse  of 
legitimation,  whereas  the  task  of  the  social  scien¬ 
tist  would  be  to  deconstruct  this  discourse.  Their 
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epistemological  flaws  are  said  to  reflect  their 
political  biases  and  their  naivete. 

There  are  more  convincing  critiques  that  can  be 
addressed  to  constructivist  analysis  of  morality.  The 
constructivist  focus  on  moral  progress  can  induce  a 
bias  in  their  vision  of  international  relations.  There 
is  a  need  to  differentiate  between  the  source  of  the 
emergence  of  a  norm — a  taboo,  for  instance,  or  a 
natural  instinct  of  the  just  and  the  good — and  the 
consequence  of  the  application  or  nonapplication  of 
the  norm.  There  is  also  a  need  to  take  into  account 
the  indirect  consequences  of  the  implementation  of 
a  norm.  A  norm  might  have  some  indirect  negative 
consequences  that  are  sometimes  unforeseeable  or 
remain  unforeseen;  thus,  such  a  norm  may  not  be 
considered  as  a  cause  of  moral  progress. 

Universalism  and  Relativism 

When  political  scientists  study  ethics,  willingly  or 
not,  they  engage  in  an  epistemological  debate  with 
normativists,  primarily  philosophers.  Indeed,  sev¬ 
eral  key  epistemological  issues  have  recently 
emerged  and  have  created  lines  of  debate  and  front 
lines  in  the  subfields  of  political  science.  There 
have  been  some  interesting  discussions  that  all  tes¬ 
tify  to  the  necessity  of  interdisciplinarity  in  the 
social  sciences  and  in  the  humanities. 

The  question  of  universalism  versus  relativism  is 
one  of  the  most  common  issues  that  has  created  ten¬ 
sions  both  within  the  political  science  community 
and  also  in  the  discussions  between  social  scientists 
and  philosophers,  as,  for  example,  in  the  debates 
between  Clifford  Geertz  and  Richard  Rorty  on  rela¬ 
tivism,  antirelativism,  and  anti-antirelativism.  The 
position  that  power  determines  what  is  just  and 
good,  advocated  by  some  political  scientists,  implies 
a  strong  relativist  stance.  The  debate  about  democ¬ 
racy  and  democratization  is  an  interesting  case  in 
this  regard.  Liberal  political  theorists  implicitly 
engage  in  a  normative  debate  about  the  moral  supe¬ 
riority  of  democracies  over  other  types  of  regimes. 
Liberal  theorists  argue  that  respecting  the  rule  of 
law  and  ensuring  pluralism  are  duties  that  a  politi¬ 
cal  regime  must  fulfill.  Consensus  does  not  prevail. 
Political  sociologists  or  anthropologists  would  make 
the  case  that  societies  should  have  a  certain  degree 
of  autonomy,  with  each  following  its  own  path. 
They  therefore  de  facto  create  their  own  norms, 
values,  and  ethos.  In  the  long  run,  do  societies  tend 


to  be  democratic?  In  the  1970s,  developmental 
studies  argued  that  there  is  a  political  trajectory  of 
development  leading  to  democracy.  This  mode  of 
explanation  has  strong  normative  implications  and 
has  been  severely  criticized  for  being  Western¬ 
centric,  paternalistic,  and  hegemonic. 

Are  rules  of  justice  universal?  If  there  is  such  a 
universalism,  to  what  extent  ought  it  be  promoted 
or  imposed  by  coercive  measures?  This  question  is 
of  crucial  importance  for  the  various  branches  of 
political  science.  Political  sociologists  argue  about 
the  reasons  that  motivate  the  imposition  of  moral 
norms  and  about  their  consequences.  They  also 
discuss  the  universality  of  norms  and  their  social 
and  historical  contextualization.  Philosophers  too 
are  deeply  concerned  with  these  issues.  The  theory 
of  justice  presented  by  philosopher  John  Rawls  has 
been  one  of  the  major  contributions  to  political 
theory  over  the  past  decades.  Rawls’s  vision  of 
justice  as  fairness  and  his  concept  of  the  veil  of 
ignorance  have  greatly  influenced  the  literature  in 
this  field  and  also  points  at  other  questions  that  go 
beyond  political  theory.  Indeed,  Rawls’s  theory  of 
justice  was  elaborated  in  a  context  where  individu¬ 
als  gather  in  one  closed  political  community.  How 
is  this  question  to  be  elaborated  at  the  interna¬ 
tional  level?  Would  a  Rawlsian  approach  favor  the 
imposition  of  principles  of  justice  on  nonliberal 
states?  His  views  on  this  question  are  nuanced. 
Rawls  argues  in  The  Law  of  Peoples  that  although 
liberal  regimes  are  superior  to  nonliberal  ones,  tol¬ 
erance  should  prevail  so  long  as  a  regime  does  not 
violate  human  rights;  however,  international  inter¬ 
vention  is  justified  when  human  rights  are  violated. 

The  tendency  toward  an  increasing  social  and 
cultural  interdependence  is  a  strong  incentive  to 
develop  a  new  thinking  in  this  area.  “Thick”  and 
“thin”  are  the  two  categories  that  Michael  Walzer 
uses  to  distinguish  a  strong,  abstract,  and  a  priori 
normative  judgment  from  one  in  which  flexibility 
prevails  and  where  moral  judgment  is  more  contex¬ 
tualized.  In  this  second  case,  there  would  be  a  core 
set  of  principles  that  should  never  be  violated;  the 
avoidance  of  inflicting  unnecessary  pain  could  be 
such  a  principle.  Different  societies  may  have  differ¬ 
ent  interpretations  of  the  same  principle,  and  their 
members  would  be  free  to  preserve  their  cultural 
identity.  The  core  principles  of  a  democracy,  such 
as  public  liberties  and  the  right  not  to  be  discrimi¬ 
nated  against,  are  inalienable,  whereas  moral  and 
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religious  precepts  that  locally  regulate  marriage  or 
education  would  be  untouched  as  far  as  they  do  not 
violate  core  principles  such  as  physical  integrity  or 
respect  for  human  dignity. 

Ethics  as  a  Discussion 

This  phenomenon  of  ethics  as  a  discussion  is  best 
captured  by  pragmatist  philosophers.  In  the  long¬ 
standing  tradition  of  American  philosophy  that  goes 
back  at  least  to  John  Dewey,  authors  such  as 
Richard  Rorty  argue  in  favor  of  a  “relativism  of 
justification.”  Ethics  is  thought  of  as  a  forum  for 
debate,  a  public  space  where  the  most  plausible, 
morally  correct,  and  politically  appropriate  justifica¬ 
tion  prevails.  This  implies  the  use  of  power — mate¬ 
rial  power  and  or  knowledge — to  achieve  this  end. 
Ethics  should  also  be  scrutinized  by  the  wider  audi¬ 
ence  that  includes  other  Western  democracies. 
Therefore,  ethics  is  a  dialectical  mechanism  of  justi¬ 
fication.  It  is  a  justification  used  by  people  and  insti¬ 
tutions  to  rally  the  greatest  number  of  people  to  their 
cause  and  to  anticipate  the  critique  they  would  face 
ex  ante  and  ex  post.  It  is  also  a  weapon  used  by 
competitors  who  wish  to  block  or  modify  the  deci¬ 
sions  taken  by  the  majority.  Finally,  ethics  is  the 
output  of  such  a  dialectic.  Its  realm  is  also  pluridi- 
mensional;  it  can  take  place  at  the  local,  national, 
international  (the  arena  of  interstate  relations), 
transnational  (the  realm  of  transnational  actors),  or 
global  levels  (where  states  interact  with  transna¬ 
tional  actors). 

Ariel  Colonomos 
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Paris,  France 
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Ethnicity 


Understanding  ethnicity  is  a  challenge  for  political 
science.  How  is  the  good  polity  to  deal  with  cultural 
differences:  Should  it  insist  that  each  individual  be 
treated  “equally,”  or  should  it  take  note  of  cultural 
variation  and  ethnic  diversity?  These  questions  pose 
challenges  for  normative  theorists.  Equally  compel¬ 
ling  are  the  challenges  posed  for  students  of  devel¬ 
opment,  a  subfield  in  which  the  study  of  ethnicity 
looms  large.  Although  there  are  many  things  that 
“development”  can  denote,  this  discussion  limits  it 
to  two:  economic  prosperity  and  political  security. 
Ethnicity  plays  an  ambivalent  role  with  respect  to 
each,  it  would  appear,  promoting  private  accumula¬ 
tion  but  inhibiting  public  investment  and  multiply¬ 
ing  opportunities  for  peaceful  bargaining  while  also 
provoking  bloodshed  and  violence.  Even  within  a 
discussion  so  narrowly  framed,  the  relationships 
between  ethnicity  and  development  are  complex. 
This  entry  discusses  these  relationships  from  a 
political  economy  perspective. 

Prosperity 

The  formation  of  capital  underpins  the  growth  of 
prosperity.  When  people  form  capital,  they 
refrain  from  consumption  and  set  a  portion  of 
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their  earnings  aside;  by  investing  that  portion 
today,  they  seek  to  secure  higher  levels  of  consump¬ 
tion  tomorrow.  When  the  gains  from  the  invest¬ 
ment  outweigh  the  losses  from  the  initial  sacrifice, 
then  the  result  is  an  increase  in  welfare  for  the 
individual  and  economic  growth  for  the  society. 

When  we  speak  of  investment,  we  are  speaking 
of  capital  formation.  Families  play  a  major  role  in 
capital  formation  and,  in  particular,  in  the  forma¬ 
tion  of  human  capital.  They  do  so  by  structuring 
relationships  between  generations  such  that  the 
older  generation,  rather  than  consuming  all  its 
income,  instead  devotes  a  portion  of  its  resources 
to  the  younger,  who  in  turn  will  prosper  and, 
while  doing  so,  offer  support  to  those  who  had 
previously  invested  in  them. 

This  intergenerational  flow  of  funds  underpins 
two  forms  of  capital  formation,  and  ethnic  groups 
play  a  major  role  in  both.  One  is  education,  another 
migration.  The  two  are  closely  related,  of  course,  as 
by  educating  their  children,  parents  seek  to  prepare 
them  to  compete  in  urban  labor  markets. 

To  illustrate,  consider  Victor  Uchendu,  a  noted 
anthropologist,  who  writes  of  his  childhood  in  east¬ 
ern  Nigeria.  Uchendu  recalls  the  pride  he  felt  as  a 
child  as  his  academic  abilities  became  apparent.  He 
writes,  too,  of  the  sacrifices  his  parents  made  to  pay 
for  his  schooling  and  of  the  contributions  collected 
by  his  community  to  send  him  for  further  education 
abroad.  Each  step  of  the  way — from  village  school 
room,  to  secondary  school,  and  thence  to  the  uni¬ 
versity — opened  up  a  broader  range  of  opportuni¬ 
ties.  Uchendu  also  writes  of  the  burden  of  his  suc¬ 
cess;  for  those  who  made  it  possible  then  looked  to 
him  for  leadership,  hoping  that  he,  as  an  “Ibo  son 
abroad,  ”  would  facilitate  their  own  transition  from 
villager  to  a  member  of  the  global  community. 

The  pressures  placed  on  Uchendu  find  their  par¬ 
allel  in  other  upwardly  mobile  communities.  The 
Kikuyu  of  Kenya,  the  Chinese  of  Malaysia,  the 
Jews  in  Europe,  and  the  Bengali  in  India:  These 
and  other  ethnic  groups  have  pursued  a  strategy  of 
collective  improvement,  based  on  the  transfer  of 
resources  between  generations  and  the  formation 
of  human  capital. 

Already  suggested  is  the  close  link  between  edu¬ 
cation  and  migration:  Investments  in  the  former 
are  often  made  to  ensure  success  in  the  latter,  as 
the  skills  acquired  in  school  translate  into  better 
jobs  in  urban  labor  markets.  Migration,  it  has  been 


found,  like  education,  represents  a  strategy  formu¬ 
lated  by  families  rather  than  a  decision  made  by 
individuals.  With  globalization,  moreover,  the 
urban  destination  need  not  be  an  adjacent  city  or 
the  national  capital,  but  rather  Houston,  London, 
or  Paris.  Reflecting  the  investment-like  character 
of  the  process,  the  Northern  flow  of  people  is  mir¬ 
rored  by  a  Southern  flow  of  remittances.  But  why 
do  these  assume  an  ethnic  character? 

One  reason  is  that  as  educated  persons  from  the 
countryside  flow  into  labor  markets  in  towns,  their 
skills  confer  a  competitive  advantage.  Locals  often 
respond  by  demanding  that  places  in  the  classroom 
be  reserved  for  them  and  that  the  government 
guarantee  them  a  “fair  share”  of  the  jobs.  As  “sons 
of  the  soil,”  they  demand  preferential  treatment 
(Myron  Weiner,  1978).  Such  responses  character¬ 
ized  those  of  the  Yoruba  tribe  in  the  face  of  Ibo 
immigration,  the  Malays  in  response  to  the 
Chinese,  and  the  local  population  of  Assam  when 
challenged  by  the  Bengali — the  list  is  long. 

Less  apparent  but  no  less  powerful  is  the  role  of 
ethnicity  in  strengthening  the  expectations  that 
shape  the  relationships  between  generations.  It  is 
the  adults  who  sacrifice  and  invest  and  the  youths 
who  benefit  and  repay.  The  sacrifice  is  immediate; 
the  repayment  is  necessarily  delayed.  While  both 
parents  and  offspring  should  benefit  from  the 
investment,  it  is  the  parents  who  bear  the  risk; 
should  they  fear  the  youths’  defection,  the  pact 
between  generations  will  break  down.  The  role  of 
the  ethnic  group  is  to  reduce  such  fears. 

One  way  is  by  celebrating  the  family  bonds.  The 
groups  celebrate  the  blood  their  members  share  and 
the  ties  of  kinship  that  bind  them.  They  honor  the 
contributions  of  previous  generations  and  the  debt 
that  is  owed.  Inculcating  such  values  in  the  young 
helps  assure  the  old  that  the  youths  will  fulfill  the 
intergenerational  contract.  In  addition,  in  some  set¬ 
tings,  ethnic  groups  possess  power.  If  a  group  pos¬ 
sesses  a  homeland,  the  elders  are  likely  to  occupy 
political  and  religious  offices.  Being  in  a  position  to 
sanction  youths  who  abscond,  the  elders  may  then 
have  reason  to  believe  that  few  will  actually  do  so 
(Robert  Bates  &  Irene  Yackolev,  2002). 

Public  Goods 

While  ethnic  groups  may  thus  contribute  to  prosper¬ 
ity  by  promoting  the  formation  of  human  capital, 
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they  also  appear  to  impede  it.  They  do  so  when  the 
diversity  of  preferences  among  ethnic  groups 
reduces  a  community’s  ability  to  supply  public 
goods  or  to  share  in  the  costs  of  their  provision.  A 
transport  system,  the  supply  of  electricity  or  water, 
or  a  harbor  or  airport,  each  requires  large  public 
investments.  And  the  costs  per  capita — that  is,  the 
tax  burden — will  be  lower  the  larger  the  size  of  the 
community  that  provides  them.  But  as  the  size  of 
the  population  increases,  so  too  will  the  heteroge¬ 
neity  of  preferences.  One  group  might  prefer  that 
a  road  run  through  its  territory  rather  than  else¬ 
where;  another  may  prefer  that  the  airport  not  be 
located  in  “its  back  yard.”  The  ability  to  apportion 
the  costs  and  benefits  declines  as  the  number  and 
intensity  of  such  disagreements  rises,  making  it 
more  difficult  for  multiethnic  societies  to  contract 
for  the  supply  of  public  services.  Ethnic  diversity  is 
an  important  source  of  such  heterogeneity,  and  a 
great  deal  of  research  has  been  devoted  to  its 
impact.  The  relationship  between  diversity  and 
public  goods  provision  is  “one  of  the  most  power¬ 
ful  hypotheses  in  political  economy”  (Abhijit  Ban- 
erjee,  Lakshmi  Iyer,  &  Rohini  Somanathan,  2005, 
p.  629). 

James  Habyarimana,  Macartan  Humphreys, 
Daniel  N.  Posner,  and  Jeremy  M.  Weinstein 
(2009)  deepen  our  understanding  of  the  relation¬ 
ship  between  ethnicity  and  public  goods  provision. 
Using  experimental  methods  in  a  field  setting,  they 
probe  the  microlevel  linkages  that  underlie  the 
macrolevel  patterns  described  in  this  entry.  While 
Bates  believes  that  Habyarimana  et  al.  (2009)  fail 
sufficiently  to  replicate  conditions  of  external 
threat — conditions  commonly  associated  with  eth¬ 
nic  politics — they  significantly  advance  our  under¬ 
standing  of  the  mechanisms  that  impart  causal 
force  to  ethnic  relations. 

Ethnicity  is  thus  Janus-like.  On  the  one  hand, 
ethnic  groups  provide  governance  structures  that 
promote  private  investments  in  education  and 
migration.  For  individual  families,  then,  they 
strengthen  the  prospects  for  prosperity;  and  for 
societies,  they  strengthen  the  forces  of  economic 
modernization.  On  the  other  hand,  where  there  is 
ethnic  diversity,  this  tends  to  weaken  the  ability  of 
the  state  to  supply  public  goods.  Insofar  as  these 
services  are  themselves  valuable  or  underpin  pri¬ 
vate  production,  ethnic  diversity  lowers  the  level  of 
diversity  and  the  level  of  development. 


Conflict 

In  addition  to  raising  the  costs  of  providing  public 
goods,  ethnic  diversity — it  is  held — poses  an  addi¬ 
tional  threat:  the  prospect  of  conflict.  While  intui¬ 
tively  appealing,  there  is  little  systematic  evidence 
to  support  this  claim.  Rather,  once  again,  a  more 
nuanced  view  emerges  from  the  literature. 

The  equation  of  ethnic  diversity  and  conflict 
stems  from  an  obvious  reality:  In  much  of  the 
developing  world,  where  there  is  conflict,  ethnic 
groups  are  likely  to  be  involved.  But,  as  noted  by 
Bates  and  Yackolev  (2002)  and  James  Fearon  and 
David  Laitin  (1996),  while  ethnic  diversity  may 
characterize  developing  polities,  conflict  remains  a 
rare  event:  Even  where  there  is  a  high  level  of  eth¬ 
nic  diversity,  there  is  often  little  fighting.  The 
implication  is  clear:  As  a  rule,  ethnic  groups 
cohabit  peacefully.  And,  indeed,  cross-national 
regressions  find  no  systematic  relationship  between 
measures  of  ethnic  diversity  and  the  likelihood  of 
civil  conflict. 

In  general,  ethnic  politics  in  the  developing 
world  resembles  that  in  the  major  metropolitan 
areas  in  the  developed  world.  Politicians  draw  on 
ethnic  blocs  of  voters,  extract  enough  resources  to 
offer  enough  patronage  to  retain  their  positions, 
and  bargain  with  others  similarly  positioned. 

Having  set  aside  the  conventional  wisdom  and 
clarified  the  basic  reality,  the  fact  remains  that  eth¬ 
nic  groups  do  fight  each  other  and  that  ethnic 
clashes  number  among  the  most  deadly  forms  of 
civil  conflict.  It  is  important  to  determine,  then,  the 
conditions  under  which  ethnic  politics  turn  violent. 

They  do  so,  it  would  appear,  when  a  group  faces 
the  prospect  of  political  exclusion.  When  in  1962 
politicians  from  the  northern  region  of  Nigeria 
allied  with  those  in  the  western  region,  for  exam¬ 
ple,  the  Ibo  in  the  eastern  region  “stood  alone,”  in 
the  words  of  their  leading  newspaper;  their  isola¬ 
tion  marked  an  important  step  toward  the  civil  war 
that  resulted  from  their  attempted  succession. 
More  data  come  from  India,  where  Steven 
Wilkinson  (2004)  finds  a  systematic  relationship 
between  the  threat  of  exclusion  and  political  vio¬ 
lence  in  municipal  elections.  In  constituencies  that 
offer  multiple  ways  of  forming  majority  coalitions, 
he  finds,  politicians  face  incentives  to  bargain  in 
order  to  construct  a  majority  coalition,  and  the 
composition  of  that  coalition  may  alter  between 
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elections;  when  the  constituency  is  evenly  divided 
and  the  divisions  likely  to  be  long-standing,  how¬ 
ever,  political  leaders  may  seek  to  provoke  inci¬ 
dents  that  will  politicize  and  rally  their  backers. 
The  expected  value  of  the  risky  outcome  from 
violence  may  exceed  the  losses  certain  to  follow 
from  the  exclusion  from  power. 

Some  have  noted  and  stressed  the  spatial  as  well 
as  the  size  distribution  of  ethnic  groups;  others 
have  emphasized  the  extent  to  which  they  differ 
from  each  other,  not  only  culturally  and  linguisti¬ 
cally  but  also  in  their  choice  of  religion  and  their 
incomes.  Such  features  are  difficult  to  summarize 
numerically,  which  hinders  efforts  to  test  their  sig¬ 
nificance  for  ethnic  conflict.  Thus  far,  only  indices 
of  ethnic  polarization  have  been  computed,  and 
they  have  in  fact  been  found  to  be  positively 
related  to  political  conflict. 

Politicians  may  also  provoke  ethnic  violence 
when  the  loss  of  office  would  leave  them  vulnera¬ 
ble  to  reprisals.  This  is  particularly  true,  of  course, 
of  those  who  have  themselves  used  violence  while 
in  office.  Examples  can  be  taken  from  the  period  of 
redemocratization  in  Africa  (roughly  between  1989 
and  1994).  Some,  such  as  Samuel  Doe  of  Nigeria, 
purposively  attacked  the  traditional  rivals  of  his 
own  ethnic  group  so  that  the  latter  would  be 
unable  to  defect  from  his  coalition  and  thus  lose 
the  protection  he,  as  president,  could  offer.  A  simi¬ 
lar  situation  was  seen  in  Rwanda  when  the  Hutu 
elite  appeared  to  have  provoked  retaliatory  attacks 
by  Tutsi  in  an  effort  to  obviate  the  formation  of  a 
cross-ethnic  coalition  that  would  have  threatened 
their  rule.  Statistical  support  is  offered  by  Paul 
Collier  and  Robert  Bates  (2008),  who  find  that 
when  rulers  have  used  violence  to  consolidate  their 
power,  ethnic  diversity  increases  the  likelihood  of 
conflict.  While  there  is  thus  no  universal  relation¬ 
ship  between  ethnic  diversity  and  political  conflict, 
there  appear  to  be  particular  conditions  that  render 
ethnic  violence  significantly  more  likely. 

Conclusion 

Recent  work  has  thus  deepened  our  understand¬ 
ing  of  the  relationship  between  ethnicity  and 
development.  In  light  of  this  work,  the  sweeping 
claims  of  earlier  scholars  (David  Apter,  1963)  and 
contemporary  commentators  (Robert  Kaplan, 
1994)  give  way  to  a  more  nuanced  assessment. 


We  now  know  that  ethnicity  can  both  help  and 
hinder  the  process  of  development  and  also  help 
one  to  have  a  clear  idea  of  the  conditions  under 
which  it  will  do  the  one  or  the  other. 

Robert  H.  Bates 
Harvard  University 
Cambridge,  Massachusetts,  United  States 
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Ethnographic  Methods 


Ethnography  is  a  methodology  with  more  than 
100  years  of  history.  It  originated  in  the  Western 
world  as  a  particular  form  of  knowledge  about 
distant  cultures  (typically  non-Western  ones)  that 
were  previously  largely  unknown  to  the  outside 
world,  having  had  only  fleeting  contacts  or  brief 
conversations  with  some  persons.  After  having 
entered  several  disciplines  of  the  social  sciences 
(anthropology,  sociology,  and  psychology),  in 
recent  times  ethnography  has  become  an  impor¬ 
tant  tool  in  political  science  as  well  (where  it  takes 
the  name  of  political  ethnography ),  as  a  part  of 
interpretive  methodologies  and  research.  This 
entry  discusses  the  origins  and  definitions  of  this 
method,  its  strengths  and  weaknesses,  and  its 
increasing  relevance  for  political  science. 

Competing  Definitions  of  Ethnography 

Defining  a  term  is  always  difficult  because  there  are 
as  many  definitions  as  there  are  different  points  of 
view.  Paul  Atkinson  and  Martyn  Hammersley 
observe  that  the  definition  of  the  term  ethnography 
has  been  subject  to  controversy.  For  some  scholars, 
it  refers  to  a  philosophical  paradigm  to  which  one 
makes  a  total  commitment;  for  others  it  designates 
a  method  that  one  uses  as  and  when  appropriate. 
But  the  controversy  extends  further.  Since  the 
1980s,  the  meaning  of  ethnography  has  been 
expanded  to  such  an  extent  that  it  encompasses 
forms  of  research  that  are  extremely  diverse  from  a 
methodological  point  of  view.  Everything  is  now 
ethnography:  from  life  stories  to  analysis  of  letters 
and  questionnaires,  from  autobiography  to  narra¬ 
tive  analysis,  and  from  action  research  to  perfor¬ 
mance  to  field  research  lasting  from  a  few  days  to 
several  years.  For  example,  leading  scholars  such  as 
James  Lull  and  David  Morley  have  pointed  out  that 


what  passes  as  ethnography  in  cultural  studies  fails 
to  fulfill  the  fundamental  requirements  for  data  col¬ 
lection  and  reporting  typical  of  most  anthropologi¬ 
cal  and  sociological  ethnographic  research. 
Ethnography  has  become  an  abused  buzzword  and 
has  been  diluted  into  a  multitude  of  sometimes  con¬ 
trasting  and  contradictory  meanings,  becoming 
synonymous  with  qualitative  studies. 

Amid  this  increasing  polysemy  of  meanings, 
there  are  at  least  three  terms  that  are  related  to 
ethnography:  participant  observation,  fieldwork, 
and  case  study.  However,  these  should  not  be  con¬ 
founded.  The  term  case  study  denotes  research  on 
a  system  bounded  in  space  and  time  and  embedded 
in  a  particular  physical  and  sociocultural  context. 
Research  is  conducted  using  diverse  methodolo¬ 
gies,  methods,  and  data  sources,  such  as  participant 
observation,  interviews,  audiovisual  materials, 
documents,  and  so  on.  The  term  fieldwork  empha¬ 
sizes  the  continuous  presence  of  the  researcher  in 
the  field,  as  opposed  to  the  grab-it-and-run  meth¬ 
odologies,  such  as  surveys,  in-depth  interviews,  or 
analysis  of  documents  and  recordings.  In  this  case, 
too,  diverse  methodologies  and  methods  may  be 
used.  Finally,  participant  observation  is  a  distinct 
research  strategy.  Participant  observation  and  field¬ 
work  treat  observation  mostly  as  a  mere  technique, 
whereas  the  term  ethnography  stresses  the  theo¬ 
retical  basis  of  such  work  stemming  from  a  particu¬ 
lar  history  and  tradition. 

An  Updated  Definition 

The  stretching  of  the  term  ethnography  has  emp¬ 
tied  it  of  its  original  meaning.  Ethnography  was 
born  as  a  technique  based  on  direct  observation. 
By  contrast,  interviews  and  surveys  are  mainly 
based  on  listening  and  asking  questions.  Of  course, 
it  is  also  essential  in  ethnography  to  listen  to  the 
conversations  of  the  actors  “on  stage,”  read  the 
documents  produced  by  the  organization  under 
study,  ask  people  questions,  and  so  on.  Yet  what 
most  distinguishes  ethnography  from  other  meth¬ 
odologies  is  the  role  of  the  “protagonist”  assigned 
to  observation.  Ethnographic  methodology  com¬ 
prises  two  research  strategies:  nonparticipant 
observation  and  participant  observation.  In  the 
former  case,  the  researcher  observes  the  subjects 
“from  a  distance”  without  interacting  with  them. 
Those  who  use  this  strategy  are  uninterested  in 
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investigating  the  symbolic  sphere,  and  they  make 
sure  not  to  interfere  with  the  subjects’  actions  so 
as  not  to  influence  their  behavior.  Of  course, 
there  are  many  intermediate  situations  between 
the  two  extremes  of  participant  and  nonpartici¬ 
pant  observation. 

Participant  observation  has  the  following  char¬ 
acteristics:  (a)  the  researcher  establishes  a  direct 
relationship  with  the  social  actors,  (b)  staying  in 
their  natural  environment,  (c)  with  the  purpose  of 
observing  and  describing  their  behavior,  (d)  by 
interacting  with  them  and  participating  in  their 
everyday  ceremonials  and  rituals,  and  (e)  learning 
their  code  (or  at  least  parts  of  it)  to  understand  the 
meaning  of  their  actions. 

A  Historical  Sketch 

The  birth  of  ethnographic  methodology  is  com¬ 
monly  dated  to  the  period  between  the  late  19th 
and  early  20th  centuries.  It  was  developed  in  ethnol¬ 
ogy,  a  discipline  that  in  the  first  half  of  the  1800s 
split  from  traditional  anthropology,  which  was  then 
dominated  by  physical  and  biological  concepts. 
Ethnology  was  more  concerned  with  studying  peo¬ 
ples  (through  comparison  of  their  material  artifacts) 
and  their  cultures  and  classifying  their  salient  fea¬ 
tures.  Before  the  advent  of  ethnographic  methodol¬ 
ogy,  ethnologists  did  not  collect  information  by 
means  of  direct  observation;  instead,  they  examined 
statistics,  the  archives  of  government  offices  and 
missions,  documentation  centers,  accounts  of  jour¬ 
neys,  archaeological  finds,  and  native  manufactures 
or  objects  furnished  by  collectors  of  exotic  art,  or 
they  conversed  with  travelers,  missionaries,  and 
explorers.  These  anthropologists  considered  the 
members  of  native  peoples  to  be  “primitives”:  They 
were  savages  to  be  educated,  and  they  could  not  be 
used  as  direct  informants  because  they  could  not  be 
trusted  to  furnish  objective  information. 

Ethnographic  methodology  did  not  suddenly 
erupt  in  anthropology;  rather,  it  developed  gradu¬ 
ally  through  the  work  of  various  authors,  among 
them  the  English  anthropologist  of  Polish  origin 
Bronislaw  K.  Malinowski  (1884-1942),  and  the 
English  anthropologist  Alfred  R.  Radcliffe-Brown 
(1881-1955).  British  social  anthropology  assimi¬ 
lated  the  positivist  intellectual  climate  of  its  time 
and  put  itself  forward,  according  to  Radcliffe- 
Brown  (1948),  as  a  “natural  science  of  society” 


that  was  better  able  to  furnish  an  objective  descrip¬ 
tion  of  a  culture  than  the  other  methods  used  by 
anthropologists  at  the  time.  Radcliffe-Brown’s 
polemic  was  directed  against  the  then  dominant 
speculative  or  “desk”  anthropology,  which  pre¬ 
ferred  to  rely  on  secondary  sources  rather  than  to 
conduct  direct  observation  of  social  facts  (cus¬ 
toms,  rituals,  and  ceremonies)  in  order  to  uncover 
the  “laws”  that  govern  a  society. 

Malinowski  is  commonly  regarded  as  being  the 
first  to  systematize  ethnographic  methodology.  In 
his  famous  Introduction  to  Argonauts  of  the 
Western  Pacific — the  book  that  sets  out  his  research 
conducted  in  the  Trobriand  Islands  of  the 
Melanesian  archipelago  off  eastern  New  Guinea — 
Malinowski  described  the  methodological  princi¬ 
ples  underpinning  the  main  goal  of  ethnography, 
which  is  to  grasp  the  native’s  point  of  view  and  his 
relation  to  life  and  to  realize  his  vision  of  his 
world.  For  this  purpose,  Malinowski  lived  for  sev¬ 
eral  years  during  World  War  I  among  the  Kula  of 
the  Trobriand  Islands.  He  learned  their  language 
(Kiriwinian),  used  natives  as  informants,  and 
directly  observed  the  social  life  of  a  village,  par¬ 
ticipating  in  its  everyday  activities.  Malinowski 
inaugurated  a  view  “from  within”  that  American 
anthropologists  of  the  1950s  would  call  the  “emic” 
perspective — as  opposed  to  the  “etic”  or  compara¬ 
tive  perspective,  which  instead  seeks  to  establish 
categories  useful  for  the  analyst  but  not  necessarily 
important  for  the  members  of  the  culture  studied. 

From  the  1920s  onward,  ethnographic  method¬ 
ology  was  incorporated  into  sociology — where  it 
was  adopted  by  researchers  who  mostly  belonged  to 
the  Chicago  School — and  then  into  psychology  and 
(recently)  political  science.  It  was  imported  from 
anthropology,  however,  for  70  years  earlier  the 
French  mining  engineer  and  later  sociologist  Pierre 
Le  Play  (1806-1892)  had  used  primitive  forms  of 
participant  observation  when  he  had  stayed  with 
the  working-class  families  that  he  was  studying.  The 
English  philanthropist  Seebohm  B.  Rowntree 
(1871-1954)  also  used  early  forms  of  participant 
observation  (after  1886)  for  his  inquiries  into  pov¬ 
erty  and  living  conditions  in  the  London  slums. 

Ethnography  and  Political  Science 

Ethnographic  methods  have  recently  also  entered 
political  science,  long  dominated  by  official  statistics, 
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surveys,  and  comparative  methods.  The  entry  of 
ethnography  has  been  favored  by  two  cultural  and 
theoretical  changes  in  the  discipline:  an  interest  in 
the  “micro”  dimensions  of  political  phenomena 
and  an  openness  to  the  insights  of  qualitative 
research.  These  two  changes  have  been  carried 
forward  by  important  scholars  such  as  Sidney 
Tarrow  (1995)  and  Charles  Tilly  (2001),  who  pro¬ 
posed  bridging  qualitative  and  quantitative 
approaches  by  means  of  “triangulation”  to  explore 
the  micro-macro  link,  considering  that  a  purely 
quantitative  approach  fails  to  uncover  the  causal 
mechanisms  of  political  processes. 

As  said  with  regard  to  other  disciplines,  the  term 
ethnography  has  assumed  a  variety  of  meanings  in 
political  science,  and  it  has  become  largely  synony¬ 
mous  with  fieldwork.  The  work  of  Jeremy  Weinstein 
(2007)  on  political  violence  and  civil  wars  and 
Elizabeth  Jean  Wood  (2000)  on  democratic  transi¬ 
tions  in  South  Africa  and  El  Salvador  are  examples 
of  this  kind.  Drawing  mainly  on  narrative  inter¬ 
views  (with  the  addition  of  some  observations, 
official  statistics,  and  government  documents), 
these  authors  have  sought  to  reach  the  experiences, 
the  subjective  perspectives,  and  the  points  of  view 
of  people  involved  in  violent  actions  and  demo¬ 
cratic  transitions. 

Weinstein  attempted  to  uncover  specific  causal 
mechanisms  (why  some  rebel  groups  decide  to  use 
indiscriminate  violence  against  civilians)  by  going 
beyond  traditional  quantitative  studies,  which 
explain  this  phenomenon  of  violence  through  mac¬ 
rovariables  such  as  income  and  so  on.  He  analyzed 
the  inner  dynamics  (the  recruitment  of  members 
and  the  inner  hierarchical  structure)  of  rebel  groups 
in  Peru,  Mozambique,  and  Elganda.  He  found  that 
indiscriminate  violence  against  civilians  was  com¬ 
mitted  mainly  by  rebel  groups  with  external  finan¬ 
cial  resources,  such  as  those  deriving  from  drug 
trafficking,  foreign  aid,  and  so  on.  Consequently, 
these  groups  did  not  need  or  seek  civilians’  involve¬ 
ment  in  and  consensus  on  their  political  actions. 

Wood  noted  that  South  Africa  and  El  Salvador 
are  very  different  when  structural  variables  (eco¬ 
nomic  development,  race  composition,  etc.)  are 
considered  but  quite  similar  when  the  link  between 
the  political  elite  and  the  economic  elite  is  exam¬ 
ined.  She  also  focused  on  the  bottom-up,  violent 
mobilization  of  workers,  pointing  out  that  this 
produces  an  increase  in  costs  affecting  the  entire 


production  system  (and,  consequently,  the  eco¬ 
nomic  elite) — strikes,  vandalism,  economic  uncer¬ 
tainty,  a  decrease  in  foreign  capital  inflows,  and  so 
on.  To  resolve  the  situation,  the  economic  elites 
push  for  reform  of  the  authoritarian  regime  and 
the  economic  and  productive  system  that  supports 
this  form  of  government.  Wood  pinpoints  every¬ 
day  political  processes,  the  impact  of  local  actions 
on  national  politics,  and  the  effects  of  micro-events 
on  macrophenomena. 

In  addition  to  studies  based  on  fieldwork,  now 
appearing  in  political  science  are  real  ethnogra¬ 
phies,  such  as  the  work  of  Adam  Ashforth  (2005) 
on  violence  and  democracy  in  South  Africa. 
Ashforth,  a  White  American,  spent  3  years  as  a 
guest  of  a  family  in  Soweto,  the  well-known  Black 
township  on  the  outskirts  of  Johannesburg.  During 
this  time,  Ashforth  realized  that  it  would  be 
extremely  difficult  to  understand  local  politics 
(macro)  without  considering  witchcraft  (micro) 
and  its  role  in  social  relations  (meso).  Through 
participation  in  the  everyday  life  of  the  community, 
he  acquired  the  conceptual  categories,  the  constel¬ 
lation  of  meanings,  and  the  culture  of  Soweto’s 
residents.  He  learned  that  witchcraft  beliefs  were 
remedies  for  the  uncertainty  and  insecurity  of 
everyday  life;  that  envy  and  jealousy  produced  the 
social  conditions  for  the  success  of  witchcraft;  and 
that  the  latter  shaped  relations  among  individuals, 
social  groups,  and  political  institutions  on  issues 
such  as  the  spread  of  AIDS,  its  social  consequences, 
and  health  policies.  He  also  discovered  the  effects 
of  the  building  of  democracy  on  community  mem¬ 
bers,  their  acceptance  of  violence,  the  shape  of  the 
concept  of  social  justice,  and  the  affirmation  of  a 
modern  democratic  and  liberal  state. 

The  Added  Value  of  Ethnography 

As  mentioned  before,  ethnographic  methodology 
gives  priority  to  observation  as  its  primary  source 
of  information.  This  purpose  is  also  served,  in  a 
secondary  and  ancillary  manner,  by  other  sources 
of  information  used  by  the  ethnographer  in  the 
field:  informal  conversations,  individual  or  group 
interviews,  and  documentary  materials  (diaries,  let¬ 
ters,  class  essays,  organizational  documents,  newspa¬ 
pers,  photographs,  and  audiovisual  aids).  However, 
the  overriding  concern  is  always  to  observe  actions 
as  they  are  performed  in  concrete  settings.  As  John 
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Heritage  notes,  if  one  is  interested  in  action,  the 
statements  made  by  social  actors  during  interviews 
cannot  be  treated  as  an  appropriate  substitute  for  the 
observation  of  actual  behavior.  In  fact,  there  is  an 
often  documented  gap  between  attitudes  and  behav¬ 
iors,  between  what  people  say  and  what  they  do. 
Ethnography  is  therefore  the  privileged  methodology 
with  which  to  analyze  and  understand  social  and 
political  actions  in  their  making. 

The  presence  of  the  researcher  in  the  field  enables 
him  or  her  to  gain  better  understanding  of  the  con¬ 
ceptual  categories  of  social  actors,  their  points  of 
view  (emic),  the  meanings  of  their  actions  and 
behavior,  and  social  and  political  processes.  As 
Javier  Auyero  (2006)  maintains,  political  ethnogra¬ 
phy  highlights  the  aspects  neglected  by  quantitative 
analysis,  such  as  the  impact  of  micropolitics  on 
macrophenomena,  the  complexity  of  everyday  life, 
the  network  of  participants’  meanings,  their  motiva¬ 
tions,  the  making  of  political  action,  and  the  prac¬ 
tices  of  politics.  As  Charles  Tilly  (2006)  notes,  eth¬ 
nographic  methods  are  useful  for  the  analysis  of 
political  phenomena  consisting  not  of  macrostruc¬ 
tures  and  fixed  roles  but  of  interactions  among 
participants,  families,  and  small  groups,  emancipat¬ 
ing  an  inquiry  from  the  ethnocentrism  (etic)  that  still 
characterizes  many  explanations  of  political  science. 

Ethnography  and  Its  Enemies 

Notwithstanding  the  acknowledged  usefulness  of 
ethnography,  however,  it  is  still  subject  to  several 
well-known  stereotypes  and  prejudices. 

Is  Ethnography  a  Highly  Subjective  Method ? 

It  is  often  argued  that  ethnography  is  a  purely 
subjective  method,  in  the  sense  that  it  is  very  sensi¬ 
tive  to  the  researcher’s  attitudes  and  perceptions.  In 
other  words,  if  different  researchers  visit  the  same 
setting,  they  will  see  different  things,  and  their  eth¬ 
nographic  notes  will  record  different  aspects. 
Instead,  a  questionnaire  or  a  discursive  interview,  if 
conducted  correctly,  is  more  likely  to  obtain  similar 
replies  (reliability)  regardless  of  who  the  interviewer 
is.  And  yet  experience  shows  that  this  idea  has  weak 
empirical  foundations.  For  example,  some  time 
ago,  Giampietro  Gobo’s  students  conducted  an  eth¬ 
nographic  study  in  a  pub.  Two  groups  (formed  of 
three  students  each)  visited  the  same  pub  a  few  days 


apart.  The  fact  that  they  had  chosen  the  same  pub 
was  absolutely  coincidental,  in  the  sense  that  they 
had  not  agreed  on  it  beforehand.  Nevertheless,  the 
two  groups  had  a  specific  research  design:  to  study 
the  rituals,  ceremonials,  and  behaviors  of  consump¬ 
tion  in  pubs.  They  then  produced  a  report.  Gobo 
discovered,  to  his  great  surprise,  that  they  had 
observed  and  discovered  practically  the  same  things. 
Therefore,  the  research  design  makes  a  greater  con¬ 
tribution  to  discovery  (or  construction  of  data)  than 
does  the  researcher  himself  or  herself.  Ethnography, 
therefore,  is  anything  but  a  highly  subjective  meth¬ 
odology  (even  if  subjectivity  is  ever  present,  as  in  all 
methodologies). 

Behaviors  Are  More  Consistent 
Than  Attitudes  and  Opinions 

What  does  the  experience  just  described  tell  us 
theoretically?  In  other  words,  why  did  six  different 
observers  in  the  same  pub  notice  practically  the 
same  things?  Because  what  ethnography  mainly 
observes  are  behaviors  (rituals,  routines,  and  cere¬ 
monials),  and  these  are  much  more  stable  over  time 
than  are  attitudes  and  opinions  (the  privileged  fields 
of  inquiry  for  discursive  interviews  and  surveys),  as 
shown  by  Richard  La  Piere’s  well-known  experi¬ 
ments  in  the  early  1930s.  Those  who  deal  with 
organizations  know  very  well  that  altering  a  behav¬ 
ior  requires  more  time  than  altering  an  attitude,  not 
to  mention  opinions,  which  are  sometimes  so  vola¬ 
tile  that  they  change  from  one  day  to  the  next. 

Can  Ethnographic  Research 
Be  Replicated,  Reproduced ? 

From  this  it  follows  that,  because  behaviors  are 
temporally  rather  stable,  the  results  of  ethno¬ 
graphic  research  can  be  repeated  and  reproduced, 
with  two  requirements: 

1.  the  presence  of  a  precise  research  design  that 
has  guided  the  research  and 

2.  no  significant  changes  between  one  study  and 
the  next. 

Ethnography  and  Generalization 

A  recurrent  criticism  made  of  ethnographic  meth¬ 
ods  is  that  their  results  are  impossible  to  generalize 
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because  they  are  based  on  few  cases,  sometimes  on 
only  one.  However,  there  are  numerous  disciplines 
that  work  on  a  limited  number  of  cases:  for 
instance,  paleontology,  archaeology,  geology, 
ethology,  biology,  astrophysics,  history,  genetics, 
anthropology,  linguistics,  cognitive  science,  and 
psychology  (whose  theories  are  largely  based  on 
experiments  and  therefore  on  research  conducted 
on  nonprobabilistic  samples  and  on  few  cases). 
According  to  Howard  Becker,  these  disciplines  are 
unconcerned  about  their  use  of  only  a  handful  of 
cases  to  draw  inferences  and  generalizations  about 
thousands  of  people,  animals,  plants,  and  other 
objects.  Moreover,  science  studies  the  individual 
object/phenomenon  not  in  itself  but  as  a  member 
of  a  broader  class  of  objects/phenomena  with  par¬ 
ticular  characteristics/properties  (Geoff  Payne  & 
Malcolm  Williams,  2005;  Williams,  2000). 

For  these  reasons,  it  is  anything  but  odd  to  think 
that  the  results  of  ethnographic  research  cannot  be 
generalized.  As  Randall  Collins  stated,  much  of  the 
best  work  in  sociology  has  been  carried  out  using 
qualitative  methods  without  statistical  tests. 
This  has  been  true  of  research  areas  ranging  from 
organizational  and  community  studies  to 
microstudies  of  face-to-face  interaction  and  mac¬ 
rostudies  of  the  world  system.  In  addition,  if  the 
focus  of  ethnography  is  on  behaviors,  and  given 
that  these  are  stable  in  time,  then  it  is  likely  that 
generalizations  are  possible.  Obviously,  precise 
criteria  must  be  followed  in  the  choice  of  samples. 
Nevertheless,  ethnography  is  not  precluded  from 
making  generalizations. 

Sampling  Cases  or  Instances 

It  will  by  now  be  clear  that  the  term  case  is  used 
ambiguously  in  ethnographic  research.  In  surveys 
and  discursive  interviews,  the  cases  correspond  to 
the  number  of  persons  interviewed  (the  sample)  and 
who  are  usually  interviewed  only  once.  Indeed,  it  is 
rather  rare  for  several  interviews  to  be  conducted 
with  the  same  person  (during  a  single  piece  of 
research).  Then,  statistical  calculations  and  analyses 
of  the  interviews  are  performed  on  these  cases. 

Ethnographic  research  is  very  different.  What  is 
usually  referred  to  as  the  “case”  (the  organization 
or  the  group  studied)  is  in  fact  the  setting.  The 
cases  are  instead  the  hundreds  of  instances  (per¬ 
taining  to  rituals,  ceremonials,  and  routines)  that 
the  researcher  observes  or  the  dozens  of  individuals 


that  he  or  she  meets  dozens  of  times  during  his  or 
her  presence  in  the  field.  The  researcher  is  not 
interested  in  the  organization  (or  the  group)  per  se 
but  rather  in  the  behaviors  that  take  place  within 
it.  Consequently,  to  not  create  confusion  with  the 
other  methodologies,  it  would  be  better  in  ethno¬ 
graphic  research  to  abandon  the  term  case  and 
replace  it  with  that  of  instance. 

The  Future  of  Ethnography 

Even  if  ethnography  is  a  methodology  with  more 
than  100  years  of  history,  it  is  perceived  (especially 
in  political  science)  as  a  new  method.  The  reason 
why  ethnography  is  now  becoming  so  fashionable, 
probably,  concerns  not  its  inner  features  but  the 
sociohistorical  period  in  which  we  live.  If  the  rela¬ 
tionship  between  science  and  society  (on  how  social 
beliefs  have  influenced  knowledge  and  scientific 
theories  and  vice  versa)  has  been  widely  proven  and 
if  also  the  relationship  between  technology  and 
society  (on  how  the  birth  of  certain  artifacts,  e.g., 
the  bicycle  or  the  personal  computer,  are  related  to 
the  type  of  society  that  has  produced  them)  has  been 
tested,  it  seems  necessary  to  enlarge  these  perspec¬ 
tives  by  acknowledging  the  relationship  between 
society  and  research  methods — the  interdependence 
among  social  beliefs  and  methodologies. 
Ethnography  and  society  are  mutually  constitutive: 
On  the  one  hand,  ethnography  (like  surveys  and  in- 
depth  interviewing  in  a  previous  age)  requires  a 
particular  type  of  society  for  it  to  come  into  being 
and  develop;  on  the  other  hand,  these  research 
methods  strengthen  the  society  that  has  produced 
them.  We  are  probably  entering  the  “observation 
society” — a  social  formation  in  which  watching  and 
scrutinizing  are  becoming  the  dominant  cognitive 
modes  alongside  the  others,  such  as  listening,  feel¬ 
ing,  hearing,  and  eavesdropping,  typical  of  the 
“interview  society”  from  which  survey  methodol¬ 
ogy  originated.  This  phenomenon  also  accounts 
for  the  increasing  demand  in  various  sectors  of 
society — from  marketing  to  security  and  from  tele¬ 
vision  to  the  fashion  industry — for  observation  and 
ethnography.  Ours  are  becoming  observation  soci¬ 
eties.  Amid  the  increasing  popularity  of  interpretive 
methods,  driven  by  epistemological  approaches 
such  as  constructivism,  postmodernism,  feminism, 
and  relativism,  political  ethnography  is  in  a  position 
to  play  a  major  role.  The  emic  perspective,  which 
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aims  to  capture  the  social  actors’  points  of  view, 

perspectives,  meanings,  motivations,  and  emotions, 

has  an  important  place  in  political  science  methods. 

Giampietro  Gobo 
Universita  degli  Studi  di  Milano 
Milan,  Italy 
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Europe  as  an  International 
Actor 


Considering  Europe  as  an  international  actor  con¬ 
tinues  to  be  a  contested  perspective  in  the  study  of 
European  politics  and  international  relations,  even 
though  scholars  and  decision  makers  increasingly 
see  Europe  as  an  important  player  within  the 
global  political  system.  Thus,  the  meaning  of  the 
phrase  Europe  as  an  international  actor  needs  a 
few  clarifications  to  avoid  misunderstandings  and 
disputes.  Actor  is  not  a  controversial  concept  in 
political  science.  Broadly  speaking,  it  means  a  unit 


(person,  group,  or  organization)  able  to  select  and 
carry  out  actions  aimed  at  influencing  the  subjects, 
institutions,  and  policies  of  the  political  system. 
Accordingly,  international  actor  is  the  unit  capable 
of  making  decisions  aimed  at  influencing  other 
international  actors  and  the  institutions  and  poli¬ 
cies  of  the  international  political  system.  States  as 
well  as  international  organizations,  nongovern¬ 
mental  organizations,  and  associations  of  various 
kinds  are  considered  to  be  international  actors — 
that  is,  units  having  attributes  such  as  rationality 
and  goal  orientation,  autonomy,  and  the  control  of 
resources  and  instruments  for  acting  internation¬ 
ally.  Consequently,  considering  Europe  as  an  inter¬ 
national  actor  means  recognizing  that  Europe  has 
the  attributes  necessary  to  influence  by  intention 
the  actors,  institutions,  and  policies  of  the  interna¬ 
tional  system.  However,  Europe  is  an  international 
actor  as  far  as  the  European  countries  agree,  volun¬ 
tarily  and  by  using  institutionalized  mechanisms  of 
decision,  to  act  cohesively  for  influencing  the  inter¬ 
national  system.  However,  this  condition  applies 
only  to  the  27  countries  that  are  members  of  the 
European  Union  (EU).  Therefore,  in  this  discourse, 
Europe  is  the  same  as  the  EU.  In  fact,  the  EU  mem¬ 
ber  states  are  able  to  make  decisions  on  common 
actions  toward  the  world  system  because  they  have 
the  same  image  of  the  external  world,  agree  on  a 
cluster  of  common  international  values,  share 
important  international  interests,  and  pursue  com¬ 
mon  external  goals.  Furthermore,  they  have  cre¬ 
ated  policy-making  offices  and  mechanisms  in  the 
areas  of  international  politics  and  economic  exter¬ 
nal  relations  and  are  determined  to  improve  these 
offices  and  mechanisms,  expand  the  international 
action  capabilities  of  the  common  institutions,  and 
make  efforts  for  achieving  better  external  policies. 
Finally,  the  international  role  and  actions  of  the  EU 
can  count  on  the  broad  support  of  international 
actors,  such  as  states,  international  organizations, 
and  nongovernmental  organizations. 

A  Europe  of  Common  Values  and  Interests 

How  can  one  explain  this  common  pursuit  of  val¬ 
ues  and  interests  on  the  part  of  the  states  belong¬ 
ing  to  the  EU?  Interdependence  forces  states  to 
cooperate  in  order  to  produce  compatible  rules 
and  policies  regarding  common  problems  such  as 
environment  degradation,  transnational  crime, 
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migration,  public  security,  and  so  on.  This  pres¬ 
sure  is  especially  urgent  in  the  case  of  states  that, 
like  the  European  ones,  are  in  general  small-  and 
medium-sized  countries  placed  in  a  rather  small 
continental  space;  have  an  economy  highly  depen¬ 
dent  on  the  world  market;  and  share  a  democratic 
regime  controlled  by  people  and  the  media.  This 
pressure  drives  governments  to  coordinate  domes¬ 
tic  and  foreign  policies  in  order  to  implement  the 
logic  of  scale.  Generally  speaking,  the  larger  the 
scale  of  execution  of  political  decisions,  the  more 
appropriate  and  efficient  will  be  the  policy  solu¬ 
tions  of  the  common  problems.  The  European 
governments  react  rather  swiftly  to  the  requisites 
of  the  politics  of  scale  because  they  know  well  the 
benefits  of  such  a  strategy  due  to  the  long  process 
of  economic  coordination  and  integration  that  they 
have  passed  through  in  the  past  50  years.  Thus,  the 
European  resolve  to  become  an  international  actor 
is  founded  on  an  awareness  of  international  inter¬ 
dependence  and  the  perceived  success  of  joint  cus¬ 
tom,  trade,  monetary,  and  other  economic  and 
sectoral  policies. 

However,  not  all  analysts  agree  on  this  interpre¬ 
tation  of  the  European  ambition  of  becoming  an 
international  actor.  Some  of  them  underline  the 
complexity  and  consequent  slowness  of  the  build¬ 
ing  process  of  the  common  foreign  and  security 
policy  (CFSP).  These  analysts  concede  only  that  the 
EU  is  an  international  actor  in  the  making,  not  an 
existing  one.  Others  question  the  capacity  of  mem¬ 
ber  states  in  building  a  common  foreign  policy  and 
point  to  the  national  differences  that  make  it  almost 
impossible  for  the  EU  to  solve  existing  policy  differ¬ 
ences  viewed  in  the  short  and  medium  term.  To 
other  analysts,  the  success  of  an  international  actor 
depends  on  its  economic  strength  as  well  as  on  the 
appropriateness  and  consistency  of  its  strategy  in 
global  economic  negotiations.  These  analysts  hold 
that  the  EU’s  role  as  an  international  actor  is  mostly 
dependent  on  the  strength  of  the  external  economic 
relations  of  the  EU  and  is  restricted  to  this  area  of 
international  relations.  Finally,  some  analysts  claim 
that  Europe’s  attempt  to  be  an  actor  in  interna¬ 
tional  politics  has  been  linked  to  Europe’s  ability  to 
perform  as  a  civilian  power  in  the  diffusion  and 
defense  of  new  values  such  as  environmentalism, 
cultural  pluralism,  and  human  rights.  These  ana¬ 
lysts  point  to  the  potentially  negative  consequences 
of  current  attempts  to  establish  a  common  security 


and  defense  policy  (ESDP)  for  the  international 
prestige  and  leverage  that  the  EU  has  achieved 
thanks  to  its  role  as  a  civilian  power,  advising 
European  leaders  that  the  best  way  to  exert  influ¬ 
ence  as  an  international  actor  is  to  limit  themselves 
to  exerting  normative  power. 

This  criticism  notwithstanding,  the  convergence 
of  the  EU  member  states  toward  an  important  set 
of  international  values  and  goals  is  apparent  in 
recent  EU  treaties.  This  convergence  has  made  pos¬ 
sible  the  construction  of  a  European  conception  of 
the  world  system  and  the  consequent  ascription  to 
the  EU  of  the  role  of  promoter  of  a  definite  set  of 
international  values  and  interests.  In  the  treaties  of 
Nice  and  Lisbon,  the  signing  governments  proceed 
from  a  pluralist  and  communitarian  image  of  the 
world  system  and  represent  the  EU  as  an  actor 
engaged  in  defending  values  related  to  this  image. 
Europe’s  image  of  the  world  is  a  pluralist  one 
because,  in  addition  to  states,  entities  of  a  different 
kind — that  is,  individuals,  peoples,  and  nongovern¬ 
mental  organizations  and  associations — are  pre¬ 
sented  as  important  actors  within  the  world  system. 
Furthermore,  the  defense  of  new,  nonstate  values  is 
considered  as  equal  in  importance  to  the  preserva¬ 
tion  of  state  values.  In  a  world  encompassing  indi¬ 
viduals,  organizations,  peoples,  and  states  as  equals, 
communitarian  solidarity  and  mutual  respect 
among  all  the  subjects  must  be  promoted.  States,  in 
particular,  are  called  on  to  respect  the  communitar¬ 
ian  principle  of  mutual  recognition  by  all  the  sub¬ 
jects  and  therefore  to  rigorously  adhere  and  con¬ 
tribute  to  the  development  of  international  law  and 
the  principles  of  the  United  Nations  (UN)  Charter. 
In  harmony  with  this  pluralist  and  communitarian 
view,  the  European  governments  see  the  EU  as  a 
legitimate  international  actor  that  wants  to  defend 
different  values  such  as  peace  and  security,  sustain¬ 
able  development,  free  and  just  trade,  elimination 
of  poverty,  and  the  defense  of  all  human  rights. 

Furthermore,  the  promotion  and  defense  of  the 
European  identity,  territorial  integrity,  and 
EU-specific  values  and  interests  are  also  claimed  by 
the  European  leaders  to  be  goals  of  European 
international  action.  In  other  words,  the  EU  wants 
to  participate  in  a  world  of  peoples,  individuals, 
and  states  also  in  order  to  promote  its  own  values 
by  all  the  means  at  its  disposal.  Hence,  on  the  one 
hand,  the  EU’s  pluralist  and  communitarian  image 
of  the  world  is  close  to  the  conception  of  Europe 
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as  a  civilian  and  normative  power.  On  the  other 
hand,  this  does  not  exclude  a  vigorous  defense  of 
European  interests  and  specific  values.  Consequently, 
the  formal  image  of  the  world  contained  in  the  trea¬ 
ties  must  be  complemented  by  adding  the  position 
the  EU  leaders  affirmed  in  the  document  A  Secure 
Europe  in  a  Better  World,  the  so-called  European 
Security  Strategy,  issued  in  2003,  as  a  result  of  the 
agreed  decision  to  build  the  common  defense  policy. 
In  this  document,  the  EU  forcefully  declared  a  pref¬ 
erence  for  effective  multilateralism  as  the  funda¬ 
mental  instrument  of  any  international  action  aimed 
at  fostering  the  development  of  a  stronger  interna¬ 
tional  society,  the  efficient  operation  of  interna¬ 
tional  institutions,  and  an  international  order  based 
on  international  law.  As  will  be  seen  below,  multi¬ 
lateralism  is,  indeed,  the  blueprint  for  European 
strategies  of  action  in  the  international  system. 

Europe  as  a  Security  Actor 

The  process  of  convergence  toward  shared  interna¬ 
tional  interests  is  less  definite  and  narrower  in 
scope  than  the  convergence  of  values.  Strengthening 
international  stability  and  containing  conflict  are 
interests  shared  by  all  the  European  states.  In  this 
view,  creating  conditions  for  the  security  of  all  the 
member  states  contributes  to  consolidating  the 
common  economic  growth  that  continues  to  be 
the  paramount  objective  of  European  unification. 
In  addition,  European  states  accept  that  acting 
together  as  a  single  international  actor,  especially  in 
areas  close  to  Europe,  is  more  efficient  and  produc¬ 
tive  than  acting  individually;  this  not  only  provides 
for  more  resources,  but  neighboring  countries  are 
also  less  apprehensive  of  joint  external  EU  actions 
than  of  the  actions  of  individual  states.  Finally, 
convergent  interests  are  promoted  by  globalization 
and  interdependence  in  as  much  as  these  processes 
extend  the  international  public  space  well  beyond 
diplomatic  and  economic  matters.  Contemporary 
international  relations  encompass  a  plurality  of 
dimensions  such  as  the  physical  environment, 
human  rights,  transnational  labor,  and  crime  orga¬ 
nizations;  these  cannot  be  dealt  with  efficaciously 
by  means  of  independent  national  policies.  Thus, 
the  logic  of  scale  is  clearly  perceived  by  all  the 
European  countries  also  in  the  area  of  foreign  pol¬ 
icy.  Consequently,  the  EU  governments  normally 
agree  to  discuss  and  negotiate  among  themselves 


first  in  order  to  reach  a  joint  position  on  issues  of 
world  politics.  In  short,  having  a  joint  European 
position  at  international  conferences  is  viewed  as 
the  most  feasible  way  for  advancing  the  interests  of 
each  EU  member  state. 

However,  this  pressure  for  interest  convergence 
notwithstanding,  the  administrative  and  diplo¬ 
matic  structures  of  foreign  policy  making  of  the 
member  states  have  not  been  dismantled  and  are 
far  from  being  dismantled,  for  the  reason  that 
European  governments  want  to  preserve  their  own 
instruments  for  promoting  and  defending  those 
national  interests  that  they  consider  as  not  fully 
compatible  with  the  common  interests  as  defined 
by  the  EU  itself.  Consequently,  in  the  medium  to 
long  term,  EU  international  actions  will  be  the 
output  of  what  has  been  called  European  foreign 
policy.  This,  indeed,  is  the  sum  of  three  elements: 
(1)  the  common  policy  of  external  economic  rela¬ 
tions,  (2)  the  formal  dimension  of  the  CFSP  and 
ESDP,  and  (3)  the  national  foreign  policies  that,  at 
the  same  time,  are  independent  from  one  another 
but  under  the  institutional  influence  of  the  CFSP/ 
ESDP  and  the  views  of  other  member  states. 

Europe  as  an  International  Actor  in  Practice 

Within  this  framework  that  encompasses  shared 
international  values,  a  common  image  of  the 
world,  and  the  converging  international  interests 
of  the  European  countries,  this  entry  now  discusses 
the  presence  of  Europe  as  an  international  actor  in 
the  world  system  by  reviewing  Europe’s  strategy 
and  actions  in  crucial  areas  of  contemporary  inter¬ 
national  relations,  such  as  the  politics  of  regions 
and  interregional  affairs,  the  UN,  and  world  eco¬ 
nomic  affairs. 

The  process  of  integration  that  sustained  the 
development  of  the  EU  over  the  past  50  years  and 
the  pan-European  cooperation  process  known  as  the 
Helsinki  Process,  lasting  for  about  30  years,  from 
1971  to  the  end  of  the  past  century,  is  the  seminal 
experience  that  created  a  European  preference  for 
the  so-called  global  approach  to  the  solution  of 
problems  at  the  regional  and  region-to-region  levels. 
Over  the  past  50  years,  European  countries  have 
recognized  that  peaceful  relations  and  cooperation 
among  neighboring  countries  can  arise  from  the  bal¬ 
anced  and  multidimensional  management  of  politi¬ 
cal,  economic,  and  social  interactions.  On  this 
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premise,  European  governments  constructed  the 
principle  that  any  cooperative  program  must  be 
“global” — that  is,  it  must  encompass  the  political 
and  security  dimensions;  the  economic  and  finan¬ 
cial  dimensions;  and  the  social,  cultural,  and  human 
dimensions.  The  myth  of  European  civil  and  nor¬ 
mative  power — articulated  by  academics  and 
encouraged  by  the  EU  institutions  and  officers — is 
also  at  the  root  of  the  vision  of  what  European 
foreign  policy  values  are  and  what  drives  the  EU’s 
role  as  an  international  actor.  Indeed,  the  global 
approach  lies  at  the  core  of  the  EU  grand  strategy 
as  an  international  actor.  The  late  Euro- 
Mediterranean  Partnership,  now  superseded  by 
the  Union  for  the  Mediterranean,  the  European 
Neighbourhood  Policy,  and  the  region-to-region 
cooperation  programs,  such  as  ASEM  (Asia-Europe 
Meeting)  and  the  EU-Gulf  Cooperation  Council 
relations,  are  all  examples  of  this  style  of  interna¬ 
tional  behavior.  Although  the  results  of  this  strat¬ 
egy  remain  meager,  Europe  can  present  itself  as  a 
new  kind  of  actor,  to  distinguish  it  from  the  United 
States.  In  fact,  the  American  government  has  a 
preference  for  the  strategy  of  bilateral  relations  and 
the  use  of  specific  problem  measures  to  manage 
international  problems,  while  Europe  has  a  strong 
preference  for  multilateral  agreements  and  the 
global  approach  to  manage  cooperation  programs 
with  other  countries.  In  the  past  10  years,  however, 
European  governments  have  recognized  that  change 
in  the  area  of  international  security,  and  also  in  the 
European  region,  demands  the  adoption  of  an 
assertive  policy  extending  to  military  capabilities. 
The  experience  of  the  relations  with  Serbia  on  the 
occasion  of  the  Kosovo  crisis  in  1995,  after  the 
negative  experience  of  the  European  failure  to 
manage  the  collapse  of  the  Yugoslav  federation, 
caused  Europe  to  face  security  problems  provoked 
by  low-intensity  conflicts  that  have  had  important 
destabilizing  consequences  in  nearby  areas.  For  this 
reason,  Europe  accelerated  the  formation  of  the 
common  defense  policy,  launched  the  European 
security  strategy,  and  established  military  and  civil 
capabilities  of  crisis  and  conflict  management  as 
additional  instruments  of  European  international 
action. 

This  European  resolve  to  act  militarily  in  the 
form  of  crisis  management  and  peace  support 
operations  is  relevant  also  to  the  European  pres¬ 
ence  in  the  UN.  The  European  Community 


obtained  observer  status  in  the  UN  General 
Assembly  in  1974.  In  subsequent  years,  this  status 
was  obtained  also  in  many  UN  agencies,  such  as 
the  Office  of  the  High  Commissioner  for  Human 
Rights  (OHCHR),  Office  of  the  United  Nations 
High  Commissioner  for  Refugees  (UNHCR), 
United  Nations  Conference  on  Trade  and  Develop¬ 
ment  (UNCTAD),  International  Labour  Organiza¬ 
tion  (ILO),  World  Health  Organization  (WHO), 
United  Nations  Educational,  Scientific  and  Cultural 
Organization  (UNESCO),  United  Nations  Environ¬ 
ment  Programme  (UNEP),  United  Nations  Devel¬ 
opment  Programme  (UNDP),  United  Nations 
World  Food  Programme  (WFP),  United  Nations 
Office  on  Drugs  and  Crime  (UNODC),  and  Inter¬ 
national  Atomic  Energy  Agency  (IAEA).  Thanks  to 
the  exclusive  competence  the  European  treaties 
gave  to  the  EU  in  various  policy  areas,  the  EU 
became  a  member  of  the  Food  and  Agriculture 
Organization  of  the  UN  (FAO;  as  the  European 
Community)  in  1991,  participates  in  many  UN- 
specialized  conferences,  and  has  signed  more  than 
50  conventions  of  the  UN.  On  the  whole,  the 
importance  of  the  EU  and  its  member  states  in  the 
UN  system  is  remarkable.  Although  the  27  votes  of 
the  EU  member  states  are  one  eighth  of  all  the 
General  Assembly  votes,  the  contribution  rate  of 
the  EU  states  to  many  areas  of  the  UN  system  is 
larger  than  the  vote  rate.  In  the  financial  sector,  for 
instance,  EU  states  contribute  about  38%  of  the 
UN  ordinary  budget,  50%  of  the  contributions  to 
special  funds  and  programs,  and  40%  of  the  UN 
peace  operations  costs. 

The  EU’s  proactive  attitude  toward  the  UN 
showed  a  significant  upward  trend  in  the  early 
2000s.  In  2001,  the  European  Commission 
released  the  Communication  on  Building,  an  effec¬ 
tive  partnership  with  the  UN  in  the  fields  of  devel¬ 
opment  and  humanitarian  affairs.  In  May  2003,  a 
financial  and  administrative  framework  agree¬ 
ment  was  signed  by  the  EU  and  UN.  In  2003,  the 
Commission  released  the  communication  on  the 
EU  and  the  UN:  the  Choice  of  Multilateralism, 
which  made  public  the  areas  in  which  the  Union 
was  ready  to  engage  itself  as  a  UN  partner.  In 
September  2003,  the  European  Council  and  the 
UN  signed  the  Joint  Declaration  on  cooperation  in 
crisis  management.  This  document  is  proof  of  the 
importance  the  EU  governments  give  to  ESDP  as 
an  instrument  of  European  action  in  world  affairs 
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and  multilateralism.  Finally,  it  is  worth  remember¬ 
ing  that  in  addition  to  the  EU’s  role  in  economic  and 
trade  multilateralism,  there  is  one  other  area  in 
which  European  action  in  the  UN  and  multilateral 
governance  is  very  important.  This  is  the  area  of 
environment  protection,  within  which  the  EU  has 
taken  on  a  leadership  role,  even  in  confrontation 
with  the  United  States,  in  many  negotiations  on  cru¬ 
cial  matters  such  as  those  relating  to  the  ozone  layer, 
climate  change,  desertification,  and  biodiversity. 

In  the  UN  agencies  and  programs,  the  EU  pres¬ 
ence  has  been  the  responsibility  of  the  European 
Commission,  but  the  role  of  the  presidency  has 
been  very  important  in  coordinating  the  position  of 
the  member  states  and  in  representing  the  EU  in 
various  forms,  such  as  issuing  official  declarations 
and  making  demarches  toward  third  countries. 
Actually,  change  in  the  responsibility  of  driving  the 
European  presence  in  the  UN  is  an  important 
example  of  the  institutionalization  of  the  EU  trans¬ 
formation  into  an  international  actor.  In  fact, 
important  reforms  of  this  responsibility  are  con¬ 
tained  in  the  Lisbon  Treaty.  The  High  Representative 
of  the  Union  for  Foreign  Affairs  and  Security  Policy 
will  be  given  the  function  of  coordinating  the 
action  of  the  member  states  in  international  orga¬ 
nizations  and  conferences.  The  High  Representative 
will  have  the  right  to  be  informed  about  matters  of 
common  interest  by  the  EU  member  states,  which 
are  also  members  of  the  Security  Council.  In  addi¬ 
tion,  when  the  EU  has  formed  a  common  position 
on  an  object  on  the  agenda  of  the  UN  Security 
Council,  those  member  states  that  sit  in  the  Security 
Council  shall  request  that  the  High  Representative 
be  invited  to  present  the  EU’s  position. 

The  strategy  of  global  approach  of  the  EU  as  an 
international  actor  has  not  changed  the  impor¬ 
tance  of  Europe  in  the  world  economic  market  and 
in  other  interactions.  The  EU  is  the  area  of  the 
world  with  the  largest  internal  market  in  terms  of 
gross  product  and  the  largest  actor  in  world  trade. 
The  European  import-export  rate  is  higher  than 
that  of  the  United  States.  Export  contributes  to  the 
gross  domestic  product  of  the  member  states  to  an 
extent  larger  than  that  of  the  United  States.  These 
achievements,  which  are  the  result  of  the  European 
economic  integration  process,  cannot  be  safe¬ 
guarded  without  appropriate  actions  in  the  world 
economic  system.  Therefore,  in  harmony  with  the 
liberal  character  of  the  European  economy,  the 


European  external  economic  policy  is  aimed  at 
preserving  the  solidity  of  the  European  economy 
within  the  free  international  trade  regime.  But  pre¬ 
serving  the  strength  of  the  European  economy 
depends  also  on  protectionist  measures  such  as  the 
external  common  tariff,  the  imposition  of  import 
quotas  and  antidumping  measures,  and  the  adop¬ 
tion  of  voluntary  restrictions  on  exports. 
Accordingly,  in  the  World  Trade  Organization,  the 
EU  supports  the  liberalization  of  world  trade  and 
the  abolition  of  tariff  restrictions  to  commerce. 
However,  at  the  same  time,  it  compromises  on  and 
delays  the  implementation  of  measures  of  trade 
liberalization  when  these  measures  may  put 
European  industries  in  difficult  positions.  Political 
and  economic  opportunity  reasons  also  determine 
the  definition  of  economic  and  trade  agreements 
with  single  countries  and  groups  of  countries. 
Indeed,  some  economic  agreements  signed  by  the 
EU  distort  the  rules  of  free  trade  for  the  sake  of 
defending  the  European  industries.  This  is  the  case, 
for  instance,  with  the  free  exchange  agreements 
with  Israel  and  Switzerland;  the  mutual  recogni¬ 
tion  agreements  with  the  United  States  and  Canada; 
and  the  trade  agreements  signed  by  the  EU  and 
countries  such  as  Australia,  Argentina,  Brazil, 
Pakistan,  and  South  Korea.  Other  trade  agreements 
are  aimed  at  developing  good  relations  with  nearby 
states — for  instance,  the  European  Economic  Area 
agreement  with  Norway,  Iceland,  and  Liechtenstein 
and  the  custom  union  with  Turkey.  Other  commer¬ 
cial  agreements  aim  at  giving  to  the  EU  a  politically 
rather  than  an  economically  favorable  position  in 
the  dialogue  with  disadvantaged  countries — for 
example,  the  Lome  Convention  that  grants  prefer¬ 
ential  trade  measures  to  70  countries  of  Africa,  the 
Caribbean,  and  the  Pacific  and  the  Generalized 
System  of  Preferences  that  gives  to  products  com¬ 
ing  from  145  developing  countries  free  access  to 
the  European  market.  Other  agreements  aim  at 
building  good  political  relations  with  neighboring 
countries  and  giving  preferential  measures  to  eco¬ 
nomically  weak  countries,  such  as  the  Euro- 
Mediterranean  Agreements  with  Algeria,  Egypt, 
Jordan,  Lebanon,  Morocco,  Palestine,  Syria,  and 
Tunisia,  and  the  Partnership  and  Cooperation 
Agreements  with  10  former  Soviet  Union  countries. 
Finally,  some  agreements  respond  to  the  logic  of 
regionalism  of  the  contemporary  world  economy — 
for  instance,  the  agreements  with  the  Andean  Pact, 
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Southern  Common  Market  (Mercado  Comun  del 
Sur;  MERCOSUR),  the  Central  America  custom 
union,  the  Gulf  Cooperation  Council,  and  the 
Association  of  Southeast  Asian  Nations  (ASEAN). 
Briefly,  the  external  economic  action  of  the  EU  is 
driven  by  a  policy  that  combines  the  strategy  for 
consolidating  the  European  position  in  the  world 
trade  system  with  strategies  of  a  political  nature 
aimed  at  either  preserving  existing  political  rela¬ 
tions  or  creating  new  political  solidarity  with 
countries  of  interest  to  Europe.  Both  strategies 
interact  with  the  sphere  of  the  international  politi¬ 
cal  relations  between  the  EU  and  external  actors 
and  are  dependent  on  the  ability  of  the  EU  to  act 
as  an  international  actor. 

Conclusion 

The  EU  is  an  international  actor  by  ascription  and 
practice.  It  has  emerged  to  such  a  role  thanks  to 
interdependence  and  globalization  and  the  self¬ 
change  process  of  economic  integration.  In  the 
1980s  and  1990s,  European  transformations  in  the 
form  of  the  Helsinki  Process  and,  later,  the  fall  of 
the  Communist  bloc  and  the  unification  of  regional 
politics  promoted  the  search  for  a  common  security 
policy.  Finally,  the  current  reconfiguration  of  world 
politics  has  created  the  conditions  for  a  self- 
conscious  ascription  of  specific  roles  in  the 
EU-constructed  image  of  the  world  system.  At  the 
same  time,  it  is  true  that  the  EU  is  something  like  an 
infant  international  actor  that  has  to  drive  itself  by 
trial  and  error  in  world  affairs  while  facing  the 
problems  of  building  its  own  mechanisms  of  foreign 
policy  making. 
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European  Integration 


European  integration  is  the  voluntary  creation  and 
maintenance  of  regional  institutions  by  states  in 
Europe.  It  is  legally  binding  and  involves  intense 
and  varied  patterns  of  interaction  in  areas  such  as 
economics,  agriculture,  justice,  immigration,  and 
foreign  affairs.  The  creation  of  the  European  Coal 
and  Steel  Community  (ECSC)  in  1951  and  the  sub¬ 
sequent  establishment  of  both  the  European  Atomic 
Energy  Community  (EURATOM)  and  the 
European  Economic  Community  (EEC)  in  1957 
marked  the  initial  phase  of  European  integration. 
Today,  the  European  Union  (EU)  is  the  key  pillar 
of  European  integration:  It  has  27  member  states, 
and  its  constitutional  basis  lies  in  the  2009  Lisbon 
Treaty.  EU  institutions  have  binding  powers  in  sev¬ 
eral  fields,  chiefly  economic,  and  are  burdened  with 
corresponding  levels  of  public  expectation. 
However,  the  transfer  of  financial  resources  and 
political  loyalty  to  the  EU  headquarters  in  Brussels, 
Belgium,  has  been  limited.  While  the  EU  represents 
a  new  political  community,  largely  autonomous 
from  its  individual  member  states,  it  has  not  super¬ 
imposed  itself  on,  or  replaced,  these  states. 

This  entry  discusses  the  entire  process  of  inte¬ 
gration,  first,  by  giving  the  historical  background 
at  the  origin  of  the  very  decision  of  creating  the 
ECSC  and  the  related  complexities  of  the  enter¬ 
prise;  second,  by  examining  the  key  features  of 
widening  and  deepening  the  process;  third,  by  sin¬ 
gling  out  the  aspect  of  EU  policy  making;  and 
fourth,  by  analyzing  the  role  of  the  EU  in  world 
politics.  A  few  remarks  on  the  future  perspective  of 
EU  conclude  the  entry. 
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Introduction 

European  history  over  the  past  2  centuries  was 
dominated  by  the  forces  of  nationalism,  left-right 
ideological  battles,  and  the  militarism  accompany¬ 
ing  these  clashes  of  belief.  The  principal  vehicle  of 
modern  European  history  up  until  1945  was  the 
nation-state:  a  territorially  delimited  entity  whose 
predominant  collective  self-understanding  was  the 
nation.  In  his  seminal  history  of  Europe,  A.  J.  P. 
Taylor  treated  European  states  as  personified  enti¬ 
ties  with  human  characteristics:  French  arrogance, 
British  aloofness,  and  German  frustration.  One  of 
the  driving  forces  behind  European  integration 
since  the  early  1950s  has  been  transformation  of 
the  nation-state  in  Europe.  Looking  at  how  states 
of  the  EU  relate  to  each  other  today,  we  see  that 
nation-states  have  been  replaced  by  member 
states.  These  member  states  are  committed  to 
cooperation  within  the  institutions  and  frame¬ 
works  of  European  integration,  a  commitment 
that  raises  questions  about  their  own  sovereignty. 
Certainly,  national  self-interest  has  not  disap¬ 
peared,  and  states  still  bargain  hard  with  each 
other  over  matters  of  policy.  However,  the  27 
member  states  are  committed  to  finding  an  agree¬ 
ment  often  at  considerable  cost  to  themselves. 
They  also  delegate  to  the  EU  powers  that  were 
traditionally  thought  of  as  the  prerogative  of  sov¬ 
ereign  states. 

The  very  purpose  and  character  of  this  exercise 
has  never  been  clearly  spelled  out,  and  the  end 
goals  of  the  integration  process  have  remained 
shrouded  in  mystery.  From  the  very  start  of  the 
European  integration  project,  ambiguity,  obfusca¬ 
tion,  and  the  prevalence  of  competing  sets  of 
underlying  principles  has  been  the  norm.  Successive 
generations  of  European  leaders  have  been  faced 
with  multiple  and  ever-changing  agendas  for 
which  there  were  no  readymade  solutions. 
Persistent  differences  among  European  actors  con¬ 
cern  the  very  nature  of  integration  (federalism  vs. 
intergovernmentalism)  and  the  functional  scope  of 
integration  (high  politics  vs.  low  politics),  and  at 
each  stage  a  compromise  between  competing 
national  agendas  has  had  to  be  found.  In  this  situ¬ 
ation,  pragmatism,  incrementalism,  and  vague¬ 
ness  become  operative  principles,  with  the  result 
that  ambitious  and  straightforwardly  cooperative 
projects  have  had  a  fairly  good  chance  of  being 


shot  down.  A  good  example  is  the  European 
Defense  Community  (EDC)  launched  by  the 
French  government  in  the  early  1950s.  Opposed 
by  both  Gaullists  and  Communists  in  the  name  of 
a  dangerous  federalism  that  would  see  French  sol¬ 
diers  come  under  the  authority  of  German  offi¬ 
cers,  the  EDC  was  voted  down  in  the  French 
National  Assembly  in  1954.  Unable  or  unwilling 
to  overcome  these  residual  currents  of  national 
sentiment,  European  policymakers  were  faced 
with  a  choice:  integration  in  disguise  or  no  inte¬ 
gration.  The  language  of  successive  cooperative 
arrangements  had  to  be  vague,  and  no  specific 
destination  point  for  the  European  project  was 
ever  officially  proclaimed.  Although  some  schol¬ 
ars  and  politicians  talked  about  constructing  a 
European  federation  resembling  the  United  States 
of  Europe,  most  were  happy  to  see  the  EU  as  an 
“unidentified  political  object,”  to  use  Jacques 
Delors’s  famous  expression.  Major  decisions 
taken  in  the  EEC  and  later  the  EU  were  thus  based 
on  a  search  for  compromise  and  consensus  rather 
than  on  any  clearly  articulated  political  principles 
or  strategic  end  goals.  The  implications  of  this  for 
democracy  at  the  national  level  were  never  prop¬ 
erly  defined. 

Deepening  and  Widening 

In  addition  to  such  ambiguity  pertaining  to  the 
ends  of  integration,  the  process  itself  has  been  an 
uneven  mixture  of  deepening  and  widening. 
Extending  cooperation  to  ever-new  fields  required 
different  legal  and  procedural  arrangements  given 
the  unwillingness  of  member  states  to  delegate 
powers  to  the  European  Commission  in  certain 
areas.  The  same  rules  governing  European  compe¬ 
tition  policy  or  common  agriculture  policy  could 
hardly  be  applied  to  the  European  justice  and 
home  affairs,  let  alone  to  the  European  Common 
Foreign  and  Defense  Policy.  Territorial  widening 
of  the  integration  project  has  generated  yet  further 
complexity.  The  EU  has  enlarged  several  times 
since  the  United  Kingdom  (UK),  Denmark,  and 
Ireland  joined  the  original  six  member  states  of  the 
EEC  in  1973.  With  each  new  enlargement,  there 
have  been  institutional  and  procedural  changes  in 
the  EU  system.  New  members  have  imported  into 
the  EU  their  diverse  cultures,  political  concerns, 
and  legal  procedures.  This  has  especially  been  the 
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case  with  the  “big  bang”  enlargement  in  2004  that 
saw  the  entry  of  eight  new  member  states  from 
Central  and  Eastern  Europe  (followed  by  the  entry 
of  Romania  and  Bulgaria  in  2007).  These  new 
member  states  are  significantly  poorer  than  their 
Western  counterparts  and  are  unlikely  to  close  the 
wealth  gap  in  the  following  decades.  Their  demo¬ 
cratic  institutions  are  still  relatively  unstable,  and 
their  economic,  legal,  and  administrative  struc¬ 
tures  are  undeveloped  when  compared  with  those 
in  the  older  member  states.  Central  and  Eastern 
European  members  also  have  their  own  distinct 
histories,  societies,  and  cultures.  For  instance, 
while  “de-communization”  is  a  central  preoccupa¬ 
tion  of  political  life  in  Eastern  Europe,  it  plays  no 
role  in  Western  European  politics.  Eastern  Europe 
still  has  relatively  few  immigrants  from  Third 
World  countries  compared  with  Western  Europe, 
but  it  is  struggling  to  come  to  terms  with  its  own 
diverse  and  often  sizable  national  minorities. 
These  include  Russians  in  Latvia  and  Estonia  and 
Hungarians  in  Romania  and  Slovakia.  New  mem¬ 
bers  have  also  different  foreign  and  security  pri¬ 
orities,  especially  regarding  relations  with  the 
United  States  and  Russia. 

The  EU’s  present  complexity,  however,  is  not 
the  result  of  the  enlargement  process  alone.  The 
EU’s  diversity  goes  well  beyond  the  simple  East- 
West  divide.  It  also  corresponds  to  the  different 
levels  of  development  and  of  national  experience 
across  Europe  as  a  whole.  Countries  such  as 
Portugal,  Greece,  Spain,  Ireland,  and  even  Finland 
have  all  undergone  significant  modernization  and 
adaptation  after  joining  the  European  Commun- 
ities/EU.  The  difficulties  that  arose  in  2010  in 
Greece,  Ireland,  and  Spain  suggest  that  significant 
levels  of  unevenness  pertain  within  the  EU.  In 
addition,  Western  European  countries  are  strug¬ 
gling  to  overcome  their  own  internal  problems 
related  to  the  incomplete  integration  of  their  large 
immigrant  populations.  Given  the  nature  of  the 
political  responses  generated  by  these  challenges — 
from  the  emergence  of  populist  rightwing  parties 
to  the  spread  of  Euroscepticism  across  the  conti¬ 
nent — it  comes  as  no  surprise  that  the  enlargement 
process  has  stalled.  Though  membership  for 
Bosnia,  Serbia,  or  Turkey  might  make  good  geo¬ 
strategic  sense,  given  how  effective  enlargement 
has  been  as  a  tool  for  stabilizing  the  EU’s  external 
environment,  the  entry  of  these  countries  remains 


in  doubt.  Burdened  by  the  EU’s  own  institutional, 
political,  and  social  complexity,  the  future  of  its 
enlargement  process  is  far  from  guaranteed. 

Challenges  to  EU  Policy  Making 

Such  complexities  indicate  how  far  we  have  come 
since  the  days  of  the  traditional  mode  of  European 
governance,  the  so-called  community  method. 
This  method  rested  on  the  presumption  of  equal 
rights  and  duties  of  all  member  states  and  the  cen¬ 
tral  role  of  the  European  Commission  as  guardian 
and  agent  of  the  common  European  interest.  This 
method  has  repeatedly  clashed  with  the  forces  of 
modernization  and  globalization,  which  generate 
increasing  levels  of  interdependence  in  the  fields  of 
economic  exchange  and  migration  and  the  con¬ 
comitant  demand  for  discretionary  and  flexible 
responses  by  powerful  national  actors  within  the 
EU.  That  different  fields  of  cooperation  had 
evolved  along  with  different  memberships  and  dif¬ 
ferent  internal  rules  has  made  adaptation  to 
changes  all  the  more  difficult.  For  instance,  Norway 
and  Iceland  are  parties  to  the  Schengen  Agreement, 
while  the  UK,  Ireland,  and  Bulgaria  are  not,  albeit 
for  different  reasons.  Only  some  EU  member  states 
are  members  of  the  Euro  Zone,  but  their  decisions 
directly  affect  the  entire  EU.  More  flexible,  diversi¬ 
fied,  and  decentralized  modes  of  governance  have 
therefore  been  suggested  for  the  EU,  but  it  is  far 
from  clear  whether  they  will  prevail.  This  is  partly 
because  the  experiments  with  these  nonhierarchi- 
cal  types  of  governance  have  not  been  very  success¬ 
ful.  (The  so-called  open  method  of  coordination, 
which  relies  on  benchmarking  of  national  prog¬ 
ress,  organized  mutual  learning,  and  deliberative 
problem  solving  across  the  EU,  has  failed  to  pro¬ 
duce  any  tangible  results.)  This  is  also  because  the 
economic  crisis  that  started  in  2008  has  reinforced 
the  need  for  regulation  and  enforcement  of  the 
agreed  rules. 

As  these  challenges  have  grown,  and  alongside 
them  the  need  to  make  crucial  decisions  pertaining 
to  people’s  well-being,  the  dissociation  between 
politics  and  policy  making  that  was  the  hallmark  of 
the  Community  method  has  become  a  major  prob¬ 
lem.  The  peculiar  relationship  between  politics  and 
policy  making  is  at  the  heart  of  the  “comitology” 
system.  This  system  consists  of  some  250-plus 
committees  set  up  to  regulate  the  implementation 
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of  European  Commission  policies.  The  committees 
are  chaired  by  the  Commission,  are  composed  of 
national  officials  from  the  member  states,  and  are 
the  site  for  detailed  and  highly  technical  delibera¬ 
tions  over  the  secondary  rules  that  govern  how 
policies  are  to  be  implemented.  In  all  political  sys¬ 
tems,  negotiations  over  the  implementation  of 
policy  agendas  exist  as  a  fundamental  part  of 
political  life.  What  is  distinctive  about  comitology 
is  what  Vivien  Schmidt  calls  “policy  without  poli¬ 
tics.”  Because  the  work  of  these  committees  is 
highly  technical,  they  are  staffed  mainly  by  experts. 
The  committees  thus  see  themselves  not  as  fora  for 
political  debates  but  as  problem-solving  bodies. 
Even  when  political  questions  intrude  into  the 
workings  of  these  committees,  this  issue  is  mainly 
about  the  balance  between  the  power  of  the 
Commission  and  the  power  of  member  states. 
Political  battles  amount  to  institutional  turf  wars 
rather  than  debates  over  political  principles  or  eth¬ 
ics.  Notwithstanding  its  central  role  in  the  organi¬ 
zation  of  contemporary  European  society,  the  EU’s 
mode  of  functioning  thus  excises  the  question  of 
the  “good  life”  from  its  deliberations.  As  a  result, 
comitology  is  a  system  of  policy  making  where  the 
political  questions  about  the  EU’s  purpose  and  end 
goals  are  not  addressed.  This  system  has  found 
itself  under  increasing  pressure  as  the  evidently 
political  questions  dealt  with  by  the  EU  imply 
answers  that  have  important  consequences  for 
people’s  daily  lives. 

At  issue  here  is  the  question  of  democracy  and 
public  participation  within  the  European  integra¬ 
tion  project.  Striking  rejections  of  the  EU’s 
Constitutional  Treaty  in  France  and  in  the 
Netherlands  in  2005  have  been  accompanied  by  a 
rise  in  Eurosceptic  sentiment.  Gone  are  the  days  of 
the  “permissive  consensus,”  where  citizens  of  EU 
member  states  passively  accepted  the  inevitability 
of  an  “ever-closer  Union.”  Central  to  the  debate 
about  the  “democratic  deficit”  of  the  EU  is  the 
question  of  standards.  Against  which  standards 
should  the  EU  be  judged?  Critics  of  the  EU  often 
imply  that  we  should  expect  the  EU  to  be  as  demo¬ 
cratic  as  our  own  national  political  systems. 
Supporters  of  the  EU  respond  that  the  EU  is  not  a 
state  and  as  such  should  be  assessed  on  a  different 
basis.  As  long  as  those  decision-making  powers 
delegated  to  the  European  Commission  are  ade¬ 
quately  monitored  (jointly  by  the  Council  of 


Ministers  and  the  European  Parliament),  there  is 
little  cause  for  concern.  Notwithstanding  this 
debate  about  standards,  certain  facts  remain.  The 
European  elections  to  the  European  Parliament  are 
striking  for  their  demonstration  of  the  growing 
disinterest  of  European  citizens.  In  the  first  direct 
elections  to  the  European  Parliament  in  1979, 
turnout  across  the  member  states  was  61.99%. 
Since  then,  there  has  been  a  steady  decline:  58.98% 
in  1984,  58.41%  in  1989,  56.67%  in  1994, 
49.51%  in  1999,  45.47%  in  2004,  and  43% 
in  2009.  Alongside  the  disinterest,  doubts  have 
been  raised  about  the  democratic  quality  of  pan- 
European  initiatives.  In  Germany,  there  was  great 
reluctance  to  support  Greece  when  the  latter  ran 
into  economic  difficulties  in  2010.  If  national  sen¬ 
timent  no  longer  corresponds  to  the  processes  and 
procedures  of  European  integration,  then  realign¬ 
ment  is  necessary  in  order  that  the  EU  remain 
consistent  with  European  public  opinion. 

From  Peace  in  Europe  to  Peace 
Building  Around  the  World 

Throughout  its  history,  European  integration  has 
been  seen  as  an  engine  behind  Europe’s  economic 
prosperity.  Lifting  exchange  controls,  eliminat¬ 
ing  quotas,  reducing  tariffs,  and  combating 
monopolist  practices  associated  with  the  creation 
of  the  European  Common  Market  have  been  gen¬ 
erating  economic  competition  and  growth  for 
some  decades.  However,  as  Giandomenico 
Majone  and  others  have  pointed  out,  growth  has 
stagnated,  or  even  regressed,  since  the  launch  of 
the  two  most  important  economic  projects  of  the 
past  2  decades:  the  Single  Market  Programme 
and  the  European  Monetary  Union.  European 
integration  has  not  necessarily  been  responsible 
for  this  negative  development.  However,  the  lack 
of  economic  success  has  prompted  European 
leaders  to  search  for  another  rationale  of 
European  integration,  and  many  have  alighted 
on  the  role  the  EU  can  play  in  world  politics. 
From  tackling  climate  change  to  crisis  manage¬ 
ment  in  conflict  zones  around  the  world,  the  EU 
is  identified  as  a  potential  heavyweight.  As  China 
and  India  take  their  place  on  the  world  stage,  it 
has  become  commonplace  to  repeat  Jacques 
Delors’s  phrase,  pronounced  at  a  time  when 
France  had  to  decide  on  its  European  or  national 


846  European  Integration 


vocation:  The  choice  is  only  between  a  United 
Europe  and  decline. 

The  EU  is  indeed,  by  all  measures,  a  very  pow¬ 
erful  global  actor.  With  its  27  member  states, 
nearly  500  million  inhabitants,  a  quarter  of  the 
world’s  GNP,  around  40%  of  the  world  merchan¬ 
dise  exports,  and  a  comprehensive  array  of  eco¬ 
nomic,  legal,  diplomatic,  and  military  instruments 
at  its  disposal,  the  EU  is  able  to  exercise  significant 
influence  in  various  parts  of  the  world.  The  euro  is 
now  the  world’s  second  most  important  interna¬ 
tional  reserve  and  trade  currency  and  as  such 
serves  as  a  conduit  for  EU  influence  across  the 
global  arena.  Owing  to  the  size  of  its  internal  mar¬ 
ket,  European  norms  and  regulations  are  progres¬ 
sively  being  adopted  by  others  in  the  global  econ¬ 
omy.  Leverage  in  this  field  allows  the  EU  to  shape 
the  multilateral  agenda  on  issues  such  as  trade, 
investment,  competition,  banking,  accounting, 
government  procurement,  and  other  types  of  ser¬ 
vices.  Together,  the  EU  and  the  United  States  pro¬ 
duce  around  80%  of  the  international  norms  and 
standards  that  regulate  global  markets.  The  EU 
also  campaigns  for  universal  labor  standards,  envi¬ 
ronmental  protection,  eradication  of  poverty,  and 
sustainable  development.  It  champions  the  cause 
of  international  law  and  multilateral  cooperation 
by  supporting  international  endeavors  such  as  the 
Convention  on  the  Rights  of  the  Child  or  the 
International  Criminal  Court.  Democracy  and 
human  rights  are  also  high  on  its  agenda,  even 
though  the  EU  has  eschewed  the  United  States’ 
more  interventionist  approach  to  democracy  pro¬ 
motion  and  regime  change.  The  EU,  if  one  adds  up 
national  and  EU-level  funds,  is  also  the  largest 
provider  of  developmental  aid. 

There  are  other  good  reasons  to  take  the 
European  foreign  policy  project  seriously.  Dip¬ 
lomats  from  EU  countries  meet  about  100  times  a 
year  and  adopt  more  than  100  joint  statements, 
communiques,  and  declarations.  Following  the 
adoption  of  the  Lisbon  Treaty,  the  EU  is  now  in 
the  process  of  developing  its  own  diplomatic  ser¬ 
vice.  The  EU’s  institutional  framework  has  become 
the  focal  point  for  European  foreign  policy 
debates,  and  it  is  the  preferred  point  of  departure 
for  most  national  diplomacies.  Today,  all  EU 
member  states  try  to  speak  and  act  “in  the  name 
of  Europe,”  even  if  that  does  not  mean  acting 
through  Europe  itself.  In  recent  years,  the  EU’s 


contribution  to  international  peace  and  security 
has  seen  it  dispatch  missions  to  far-flung  places, 
from  East  Timor,  Congo,  Sudan,  and  Afghanistan 
to  Iraq,  Lebanon,  Bosnia,  and  Georgia. 

However,  the  EU’s  global  policy  is  confronted 
with  numerous  problems.  Europe’s  external  trade 
relations  are  largely  divorced  from  Europe’s  for¬ 
eign  policy.  Responsibility  over  external  trade  is 
shared  or  split  between  the  European  Commission, 
the  European  Central  Bank,  the  Council  of  Min¬ 
isters,  the  Eurogroup,  or  the  member  states. 
Foreign  policy  proper,  unlike  external  trade,  is 
subject  to  individual  countries’  veto.  Moreover, 
EU  member  states  are  allowed  to  act  outside  the 
EU  framework,  and,  in  fact,  they  frequently  do  so, 
either  within  the  United  Nations  framework  or  via 
the  Organization  for  Security  and  Co-operation  in 
Europe,  the  Council  of  Europe,  or  NATO.  Euro¬ 
pean  foreign  and  security  policies  are  often  carried 
out  by  formal  or  informal  coalitions  of  the  willing, 
by  contact  groups  or  bilateral  initiatives.  And  the 
EU  has  practically  no  military  standing  force  at  its 
disposal  and  has  to  rely  on  member  states’  volun¬ 
tary  contributions. 

Export  of  EU  laws  and  norms  has  proved  suc¬ 
cessful  chiefly  in  the  EU’s  immediate  neighbor¬ 
hood  where  its  power  was  overwhelming  and  its 
norms  were  shared,  as  was  the  case  in  Central  and 
Eastern  Europe.  In  regions  farther  away,  the  EU’s 
efforts  to  promote  its  norms  in  fields  such  as  trade, 
environment,  or  social  protection  are  often  seen  as 
disguised  forms  of  protectionism,  of  benefit  mainly 
to  those  European  producers  and  consumers  wish¬ 
ing  to  be  shielded  from  the  competitive  pressures 
of  less  developed  countries.  In  fact,  the  opposition 
of  a  group  of  developing  countries  led  by  India  and 
Brazil  has  made  it  impossible  for  the  EU  to  include 
as  part  of  the  multilateral  trade  agenda  the  so- 
called  Singapore  issues:  competition  policy,  invest¬ 
ment,  government  procurement,  and  trade  facili¬ 
ties.  Moreover,  major  players  such  as  the  United 
States  or  China  are  able  to  play  individual  EU 
member  states  against  each  other,  effectively 
undermining  the  EU’s  aggregate  potential.  This  is 
because  the  EU  lacks  a  European  equivalent  of  the 
raison  d’etat :  As  the  president  of  the  European 
Council,  Herman  Van  Rompuy,  put  it,  the  EU’s 
strategic  partnerships  with  rising  powers  such  as 
China  lack  any  real  strategic  vision.  Moreover,  the 
EU’s  mechanisms  for  identifying  common  interests 
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are  weak  and  unable  to  overcome  national  particu¬ 
larities,  leaving  it  susceptible  to  divide-and-rule 
practices  by  rivals  such  as  Russia. 

Conclusion:  Twilight  of  the  EU? 

European  integration  has  never  been  a  smooth  or 
linear  process,  and  it  has  always  had  its  critics. 
Nevertheless,  in  recent  years  doubts  have  been 
raised  about  the  very  viability  of  the  EU.  Its  politi¬ 
cal  legitimacy  was  profoundly  shaken  by  the  fail¬ 
ure  of  the  European  Constitution  in  2005  and  the 
arm-twisting  that  occurred  at  the  time  of  Ireland’s 
ratification  of  the  Lisbon  Treaty  in  2008.  Its  exis¬ 
tence  as  a  stable  and  unified  monetary  community 
is  in  doubt  as  weaker  member  states — from  Ireland 
to  Portugal  and  Greece — struggle  to  service  their 
debts  without  external  help.  The  EU’s  foreign  and 
security  policy  is  a  continual  source  of  frustration 
for  those  who  believe  in  the  EU’s  vocation  as  a 
power  with  global  reach;  from  the  Balkans  to  the 
Middle  East,  the  EU  is  still  unable  to  credibly 
answer  Henry  Kissinger’s  often-cited  question 
about  which  number  to  call  when  wanting  to  get 
through  to  “Europe.” 

These  crises  and  disappointments  have  clearly 
undermined  the  EU’s  credibility  and  stalled  indi¬ 
vidual  projects  either  in  the  domain  of  widening 
or  of  deepening.  And  yet  as  in  previous  moments 
of  instability  and  crisis,  it  would  be  premature  to 
talk  of  the  swing  of  the  pendulum  from  integra¬ 
tion  to  disintegration.  As  politicians  of  EU  mem¬ 
ber  state  governments  themselves  admit,  European 
integration  is  the  context  within  which  they  all 
think  and  act.  Calls  to  exit  the  EU  are  still  viewed 
as  quixotic;  it  is  rather  like  wanting  to  retreat 
from  a  globalized  and  interconnected  world  in 
favor  of  a  premodern  idyll.  The  EU’s  central  func¬ 
tion  as  a  mediator  of  conflict  between  its  members 
suggests  that  in  the  years  to  come  it  may  be  more 
necessary  than  ever.  Moreover,  the  structural  shift 
from  nation-states  to  member  states  implies  that 
Europe  itself  has  changed  and  that  European  inte¬ 
gration  has  become  an  integral  feature  of  contem¬ 
porary  statehood  today  in  Europe.  The  end  of 
European  integration  also  runs  against  the  play  of 
international  politics  as  it  would  make  it  difficult 
for  European  states  to  aggregate  their  power  when 
faced  with  global  competitors  such  as  China, 
India,  Russia,  or  the  United  States  of  America. 


There  may  be  much  to  criticize  about  European 
integration,  but  alternatives  to  it  seem  either  uto¬ 
pian  or  outdated.  Adjustment  to  the  rapidly 
changing  external  environment  and  the  manage¬ 
ment  of  internal  public  expectations  was  and  still 
is  the  greatest  challenge  for  the  EU  in  the  years  to 
come. 
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Europeanization  of  Policy 


Europeanization  most  often  refers  to  domestic 
adaptation  to  regional  integration.  It  can  also 
denote  the  diffusion  of  European  values  and 
forms  of  governance  to  other  parts  of  the  world. 
In  a  third  meaning,  the  concept  may  refer  to  the 
emergence  of  common  political  standards  and 
patterns  throughout  Europe.  Studies  on 
Europeanization  focus  on  different  aspects  of  the 
political  system:  values,  institutions,  policies,  pol¬ 
icy  styles,  and  policy  procedures.  There  is  no 
consensus  among  scholars  as  to  what,  more  spe¬ 
cifically,  Europeanization  is  and  where  it  takes 
place.  This  entry  focuses  on  two  different  ways  of 
defining  Europeanization  as  adaptation  to  regional 
integration,  because  these  are  the  most  frequently 
used  definitions.  One  popular  definition  is  that 
Europeanization  is  the  impact  of  the  European 
Union  (EU)  on  domestic  policy,  polity,  and  poli¬ 
tics.  This  implies  that  Europeanization  is  some¬ 
thing  that  happens  within  EU  member  states  and 
others,  and  Europeanization  is  thus  seen  as  an 
outcome  of  European  integration.  Another  defini¬ 
tion  derives  Europeanization  from  the  interaction 
between  European  and  domestic  policy  processes. 
In  this  view,  Europeanization  is  seen  as  a  process 
embedded  in  the  evolution  of  European  integra¬ 
tion.  These  two  different  ways  of  defining 
Europeanization  are  accompanied  by  different 
sets  of  analytical  concepts  and  generate  different 
research  questions. 

Europeanization  is  not  a  theory  of  its  own,  but 
the  increasing  number  of  studies  of  Europeanization 
since  the  end  of  the  1990s  signals  a  growing  inter¬ 
est  in  the  theoretical  aspects  of  the  integration 
process.  Even  though  the  intellectual  development 
of  this  concept  has  not  been  particularly  tidy,  it 
shows  a  shift  in  focus  by  students  interested  in 
theories  of  European  integration  from  the  develop¬ 
ment  of  laws,  regulations,  and  institutions  on  the 
EU  level  to  changes  at  domestic  levels.  In  this  way, 
the  concept  has  been  used  not  solely  for  studies  of 
EU  member  states  but  also  for  states  that  are  not 


EU  members  (both  future  members  and  other 
states). 

Europeanization  as  an  Outcome 

When  Europeanization  is  seen  as  the  impact  of  the 
EU  on  domestic  policy,  polity,  and  politics,  it  con¬ 
ceptualizes  Europeanization  as  taking  place  on  the 
domestic  level.  In  this  model,  Europeanization  is 
defined  as  national  or  regional  changes  caused  by 
European  integration.  Through  this  prism,  one  can 
study  Europeanization  as  the  consequence  of  a 
formal  or  informal  European  integration  process. 
In  this  model,  the  main  research  question  is  fre¬ 
quently  whether,  or  to  what  extent,  something  has 
been  Europeanized. 

In  this  understanding  of  Europeanization — as 
used,  for  example,  in  an  anthology  edited  by  Maria 
Green  Cowles,  James  Caporaso,  and  Thomas  Risse 
and  in  a  much-cited  article  by  Robert  Ladrech — 
one  needs  to  specify  which  specific  domestic 
changes  in  policy,  politics,  and  polity  can  be  caused 
by  European  integration  and  can  count  as  Euro¬ 
peanization.  The  literature  on  the  concept  does  not 
agree  on  the  relevant  effects  of  the  process.  Analyses 
of  transposition  and  implementation  examine 
changes  in  legislation  and  legal  practice,  whereas 
other  studies  focus  on  policy  change,  institutional 
persistence,  and  policy  discourse. 

The  literature  also  examines  the  mechanisms 
by  which  this  impact  comes  about,  and  important 
efforts  have  been  made,  for  example,  by  Claudio 
Radaelli,  Christoph  Knill,  and  Dirk  Lehmkuhl  to 
systematize  how  domestic  changes  take  place, 
drawing  on  the  conceptual  toolboxes  of  various 
disciplines.  In  some  cases,  there  are  sequences  of 
voluntary  Europeanization  in  one  area  as  a  spill¬ 
over  of  coercive  Europeanization  in  another 
policy  area. 

Another  distinction  has  been  made  between 
vertical  and  horizontal  mechanisms  of  Euro¬ 
peanization.  Vertical  mechanisms  function  where 
there  is  a  European  model  that  a  state  should 
adhere  to,  resulting  in  direct  adaptational  pres¬ 
sure.  The  horizontal  mechanisms  involve  different 
kinds  of  framing  processes,  such  as  learning,  inspi¬ 
ration,  mimicry,  and  copying.  From  this  point  of 
departure,  the  mode  of  governance  in  a  policy  area 
may  help  predict  the  character  and  efficiency  of 
the  Europeanization  drivers.  This  means  that  the 
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organizational  placement  of  a  policy  area  as  either 
a  supranational  policy  area  (where  law  is  made)  or 
an  intergovernmental  policy  area  (where  policy 
primarily  is  coordinated)  may  influence  the  levers 
of  Europeanization. 

Europeanization  as  a  Process 

Even  in  policy  areas  where  there  is  no  suprana¬ 
tional  policy  process  or  where  its  impact  is  very 
limited,  domestic  policy  is  influenced  by  what 
happens  in  other  European  states  and  more  gener¬ 
ally  in  the  European  context.  In  this  way,  the 
interaction  between  European  and  domestic  pol¬ 
icy  processes  can  be  thought  of  as  generating  new 
policy  areas. 

The  novelty  in  these  policy  areas  lies  less  in  sub¬ 
stance  than  in  shape  and  framing.  When  previ¬ 
ously  domestic  policy  areas  are  affected  by 
European  policy  making  and  jurisprudence,  they 
often  change  names  and  demarcation  lines  to  other 
policy  areas.  Such  shifts  occur  not  only  in  fields 
where  the  EU  has  legislative  competence  but  also 
in  many  other  spheres.  It  might  be  more  accurate 
to  speak  of  partially  Europeanized  policy  areas. 
Both  state  and  nonstate  actors  within  such  areas 
have  a  growing  number  of  international  contacts 
and  are  increasingly  exposed  to  European  rules 
and  other  extraneous  influences.  The  process  of 
adaptation  is  seldom  unidirectional  and  sequen¬ 
tial.  More  often,  it  takes  the  form  of  gradual  and 
mutual  adaptation,  with  increased  integration  and 
domestic  changes  affecting  each  other  and  taking 
place  in  parallel. 

While  some  studies  treat  Europeanization  as 
an  objective  phenomenon,  others  emphasize  its 
rhetorical  and  discursive  dimensions  by  treating  it 
as  a  construct  with  particular  political  functions. 
A  popular  research  design  makes  Europeanization 
a  dual  process  in  which  European  and  domestic 
political  patterns  are  intertwined.  Some  scholars, 
for  example,  Ulrika  Morth,  argue  that  it  is  impos¬ 
sible  to  study  the  European  and  domestic 
processes  in  separation  as  they  are  increasingly 
interwoven  and  dependent  on  each  other.  Others, 
for  example,  Ian  Bache,  Simon  Bulmer,  and 
Martin  Burch,  have  examined  Europeanization  as 
a  two-way  process  and  made  efforts  to  distin¬ 
guish  the  different  streams  from  each  other.  From 
this  point  of  view,  Europeanization  not  only 


consists  of  the  downloading  of  EU  policies  that 
affect  member  states’  policy,  politics,  and  polity 
but  also  of  its  uploading. 

Conceptualizing  Europeanization  as  an  interac¬ 
tion  between  domestic  and  European  levels  makes 
it  difficult  to  measure  the  effects  of  the  integration 
process.  Recent  researchers  tend  to  distance  them¬ 
selves  from  the  sequential  conception  of  policy 
making  in  which  decision  makers  were  seen  as 
identifying  problems,  setting  goals,  proceeding 
then  to  compare  alternative  strategies,  and  choos¬ 
ing  those  with  a  favorable  ratio  between  costs  and 
expected  results.  Instead,  following  John  Kingdon 
(building  on  the  garbage  can  model),  they  operate 
at  the  intersection  of  problems,  policies,  and  solu¬ 
tions  floating  around  in  the  partially  Europeanized 
policy  processes. 

The  question  of  when  Europeanization  takes 
place  is  also  complicated  by  the  temporal  aspects 
of  the  European  integration  process  that  in  several 
ways  affect  domestic  planning  horizons.  Within 
this  model,  the  important  research  question  is  not 
whether  Europeanization  has  taken  place  but  how 
it  proceeds  and  manifests  itself  in  various  modes  of 
interaction  between  European  and  domestic  policy 
processes. 

Of  crucial  importance  for  this  line  of  research  is 
a  closer  look  at  the  specific  linkages  and  ligatures 
of  multilevel  governance.  Understanding  the  nature 
of  Europeanization  requires  notions  illuminating 
the  particular  ways  and  forms  in  which  European 
and  domestic  policy  processes  are  related.  Some 
concepts  in  the  literature  on  Europeanization  that 
might  guide  such  an  analysis  are  customizing, 
framing,  the  idea  of  social  representations,  and 
mediated  cross-load. 

Customizing  refers  to  efforts  undertaken  by 
national  politicians  to  make  the  EU  resemble  their 
own  state.  When  participating  in  the  EU  policy 
process,  member  states  emphasize  features  of  the 
European  policy  that  makes  it  more  similar  to  the 
domestic  policy,  thereby  making  the  EU  more 
familiar  to  various  national  constituencies.  This  is 
a  way  of  strengthening  the  legitimacy  of  the  EU 
within  the  national  political  culture. 

A  related  notion  is  that  of  framing,  which 
stands  for  the  context  in  which  a  policy  is  pre¬ 
sented.  Again,  the  purpose  is  to  gain  acceptance 
for  specific  proposals  or  lines  of  action.  A  similar 
idea  is  that  of  “social  representations,”  which 
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shape  the  institutionalization  of  the  policy  area 
and  define  frameworks  for  possible  cooperation. 

An  idea  related  to  that  of  upload  and  download 
discussed  earlier  is  that  of  cross-load,  where  local 
practices  are  disseminated  from  one  state  to  another 
without  necessarily  passing  the  central  EU  level  and 
its  institutions  in  Brussels.  However,  even  if  prac¬ 
tice  is  not  disseminated  via  the  EU  institutions,  they 
are  not  necessarily  isolated  policy  transfers,  but 
they  can  be  expected  to  be  mediated  by  other  pro¬ 
cesses  on  the  European  level,  which  would  be  medi¬ 
ated  cross-load.  Such  processes  might  take  place  in 
transnational  networks.  In  addition,  policy  transfer 
in  a  policy  area  may  take  place  between  some  coun¬ 
tries  and  might  then  be  pushed  “upward”  to  the 
European  policy  process.  Such  conceptual  tools 
have  been  employed  in  the  analyses  of  European¬ 
ization  encompassing  domestic  changes  that  pre¬ 
cede  and  facilitate  further  formal  integration. 
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Evaluation  research  is  about  assessing  the  conse¬ 
quences  of  public  policy.  The  emphasis  can  be  either 
on  the  policy  process  or  on  the  outputs  and  the  out¬ 
comes  of  the  policies.  Evaluation  research  thus  has  a 
clear  democratic  and  learning  goal:  to  inform  the 
decision  makers,  citizens,  and  interest  groups 
whether  public  policies  have  or  have  not  had  the 
intended  consequences.  Whether  public  policies 
result  in  alleviating  public  problems  is  a  central  ques¬ 
tion  for  many  societal  stakeholders.  Governments 
are  keen  on  showing  that  the  practiced  policy  mak¬ 
ing  has  been  effective  and  that  promises  made  during 
an  election  campaign  have  been  kept.  In  addition, 
evaluation  is  an  important  tool  of  monitoring;  gov¬ 
ernments  as  principals  use  evaluation  to  keep  the 
agents  and  the  bureaucracy  accountable.  Opposition 
parties,  in  principle,  are  as  keen  on  showing  that 
government  policies  have  not  had  the  intended 
effects.  The  media  and  think  tanks  inform  citizens, 
and  to  do  so,  they  need  information  on  activities. 
This  definition  implies,  first,  that  evaluation  research 
is  usually  empirical  and,  second,  that  evaluation  is 
usually  bound  to  the  goals  of  public  policy  making. 
This  is,  however,  not  the  only  option.  Seen  against 
the  multitude  of  benefits  it  serves,  evaluation  research 
is  not  without  problems.  The  academic  element  of 
evaluation  research  is  to  develop  and  further  improve 
the  tools  with  which  to  evaluate.  The  problems  of 
evaluation  concern  both  the  practice  of  evaluation — 
how  to  do  it — and  the  usage  of  evaluation.  This 
entry  focuses  on  the  how  question,  and  thereafter 
the  focus  moves  on  to  the  usage  of  evaluation. 

Conducting  Evaluation 

Researchers  engaged  in  the  evaluation  of  public 
policies  face  a  number  of  choices  related  to  approach, 
focus,  and  criteria.  First,  various  forms  of  evalua¬ 
tion  research  differ  from  each  other  on  the  basis  of 
the  selected  approach.  The  principal  approaches  are 
formative  and  summative  evaluation.  Formative 
evaluation  monitors  the  way  in  which  a  program  is 
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being  administered  or  managed  so  as  to  provide 
feedback  that  may  serve  to  improve  the  implemen¬ 
tation  process.  Summative  evaluation  measures 
how  a  program  has  actually  affected  the  problems  it 
was  designed  to  address.  Second,  evaluation  can 
focus  on  various  parts  in  a  policy  process:  on  the 
process  or  implementation  at  various  levels  of  gov¬ 
ernment,  ranging  from  the  top  of  the  hierarchy  to 
the  grassroots  bureaucrats,  or  on  the  outputs  and 
outcomes.  Third,  evaluation  can  focus  on  various 
aspects  of  the  process  or  outputs.  The  approach  and 
focus  used  determine  the  criteria  for  evaluating  a 
program’s  success.  Usually,  evaluation  involves  a 
comparison  of  the  goals  and  impacts  of  a  program, 
with  the  program’s  goals  often  selected  as  the  yard¬ 
stick  for  measuring  or  evaluating  success.  But  effi¬ 
ciency,  organizational  changes,  stakeholder  con¬ 
cerns,  and  democracy  can  also  be  used  as  criteria.  In 
other  words,  if  a  new  policy  aims  at  diminishing 
traffic  accidents,  the  evaluator  assesses  whether  this 
indeed  has  been  the  result.  Other  options  include 
focusing  on  economy  (Has  money  been  wasted?), 
democracy  (Did  the  program  respond  to  popular 
will?),  or  impacts.  In  recent  decades,  public  admin¬ 
istration  has  moved  into  using  performance  indica¬ 
tors  (PI)  to  guide  agencies  and  lower  rank  officials 
to  aim  at  the  negotiated  targets.  For  example,  in 
academia,  PI  may  include  the  number  of  publica¬ 
tions  and  student  achievement  scores;  for  health 
care,  PI  may  include  the  number  of  operations  and 
patient  calls.  The  debate  concerning  the  use  of  PI 
has  pointed  out  that  the  choice  of  indicators  is 
important  so  as  not  to  narrow  the  target  field  too 
much  and  has  warned  that  top-down  set  indicators 
can  have  a  demoralizing  effect  on  lower  rank  offi¬ 
cials’  professional  drive  for  work  improvement. 

The  formal  goals  of  policies  form  an  important 
starting  point  for  evaluators.  This  is  because  govern¬ 
ment  ought  to  respond  to  the  needs  and  expecta¬ 
tions  of  the  people.  If  evaluation  shows  that  the 
results  are  poor  or  that  the  way  in  which  programs 
are  carried  out  is  not  optimal,  government  should  be 
concerned.  At  the  same  time,  there  are  a  number  of 
reasons  to  defend  the  replacement  of  formal  goals  as 
an  evaluation  criterion.  First,  political  goals  are  not 
always  very  precise.  Public  programs  can  contain 
multiple  goals,  some  of  which  may  be  meant  to  sat¬ 
isfy  the  voters  rather  than  target  a  problem.  Second, 
concentrating  solely  on  monitoring  the  realization 
of  formal  goals  may  restrict  the  identification  of 


various  side  effects  that  policies  may  have.  These  are 
usually  not  evaluated  as  thoroughly  as  the  formal 
goals.  Yet  unintentional  side  effects  can  replace  and 
disturb  the  formal  goals  in  the  long  run. 

An  extreme  option  for  criteria  is  to  select  a  goal- 
free  approach — that  is,  to  study  a  policy  process 
from  all  possible  angles.  A  more  recent  approach, 
a  realistic  one,  asks  what  works  for  whom  in  what 
circumstances,  emphasizing  thus  the  importance  of 
the  context  in  which  a  policy  is  making  an  impact. 
All  in  all,  the  various  evaluation  models  or 
approaches  have  various  emphases  and  hence  may 
lead  to  different  sorts  of  conclusions  as  to  what 
constitutes  a  successful  program.  An  evaluation 
researcher  usually  either  chooses  a  particular  per¬ 
spective,  say  how  a  regional  policy  affects  the  par¬ 
ticipation  of  the  local  citizens,  or  can  aim  at  recon¬ 
structing  a  comprehensive  account  of  the  merits  of 
a  program.  The  latter  choice  inevitably  involves  a 
selection  of  multiple  criteria,  data,  and  perspectives 
in  a  program.  Finally,  as  in  social  science  in  gen¬ 
eral,  some  researchers  prefer  a  deductive  approach 
and  others  an  inductive  one;  some  prefer  a  quanti¬ 
tative  analysis,  others  a  qualitative  case  approach. 

Whereas  evaluating  the  administrative  process 
whereby  policies  are  fulfilled  can  be  relatively  easy, 
emphasizing  the  impact  of  public  policies  is  loaded 
with  difficulties.  This  is  mainly  because  public  poli¬ 
cies  are  not  the  only  factor  that  affects  societal  pro¬ 
cesses.  A  standard  example  is  education.  Although 
schools  play  a  vital  role  in  education  by  providing 
the  institutional  framework  (schools,  teachers,  and 
resources)  for  learning,  education  is  also  dependent 
on  the  pupils  themselves,  their  parents,  and  the 
underlying  culture.  In  other  words,  a  number  of 
other  factors  may  intervene  once  a  public  policy  is 
implemented.  Regional  policy  offers  a  good  exam¬ 
ple:  A  tax  relief  program  may  be  proposed  to 
encourage  private  sector  enterprises  to  add  jobs. 
Once  the  program  is  implemented,  it  is  but  a  single 
factor  when  such  enterprises  consider  whether  or 
not  to  invest  in  hiring  additional  staff.  In  education, 
similarly,  good  education  is  a  necessary  but  not  a 
sufficient  ingredient  of  learning.  Full  impacts  take 
time  to  be  realized.  In  other  words,  when  public 
policies  are  put  into  effect,  as  time  passes,  it  becomes 
difficult  to  distinguish  between  the  net  effect  of  the 
policy  and  other  effects.  Furthermore,  the  task  is  not 
made  easier  by  the  fact  that  the  grassroots  officials 
often  cooperate  with  other  officials.  Environmental 
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health  inspectors,  for  example,  talk  and  meet  with 
representatives  of  other  municipal  branches  (gar¬ 
bage,  health,  fire,  and  planning),  other  public  offi¬ 
cials  and  relevant  private  shopkeepers,  producers, 
and  so  on,  and  all  this  adds  up  to  environmental 
health.  One  particular  policy  constitutes  only  a  frac¬ 
tion  of  the  total  impact;  alone  it  does  not 
suffice  to  make  much  difference.  A  bottom-up  per¬ 
spective  into  a  policy  process  opens  up  a  different 
scenario  compared  with  a  top-down  perspective. 

Use  of  Evaluation  Results 

The  usage  of  evaluation  results  is  also  not  without 
problems.  First,  the  aforementioned  methodological 
constraints  mean  that  evaluation  results  are  not 
totally  reliable.  One  can,  for  example,  argue  for  or 
against  the  application  of  one  specific  criterion  or 
approach.  The  evaluation  of  economic  efficiency, 
for  example,  can  be  criticized  if  it  does  not  include 
in  the  analysis  an  indicator  for  the  quality  of  services 
too.  The  same  applies  to  other  criteria,  and  hence, 
one  solution  is  to  apply  several  criteria.  Also,  it  may 
simply  be  impossible  to  sample  all  the  necessary 
data.  Evaluation  as  a  part  of  monitoring  usually  has 
to  rely  on  quantitative  data.  European  Union  pro¬ 
grams,  for  example,  are  assessed  on  the  basis  of  the 
number  of  new  jobs  created.  There  is,  however,  no 
certainty  that  these  jobs  will  be  lasting.  Qualitative 
data  may,  however,  be  more  difficult  and  costly  to 
obtain  in  a  day-to-day  routine  evaluation.  Second, 
the  difficulties  of  impact  assessment  are  serious  but 
not  impossible.  It  is  evident  that  public  policy  out¬ 
puts  need  to  be  balanced  in  relation  to  other  inter¬ 
vening  factors.  In  the  recent  Programme  for 
International  Student  Assessment  (PISA)  evalua¬ 
tions,  which  compare  international  education 
results,  reading,  for  example,  is  defined  as  deter¬ 
mined  by  a  combination  of  teaching,  the  pupil’s 
own  activity,  and  support  by  parents.  Impacts  are 
also  complex  in  the  modern  world.  Globalization, 
crowded  policy  space,  and  postmodern  citizen  atti¬ 
tudes  mean  that  the  way  individual  policies  influ¬ 
ence  the  target  problems  is  more  and  more  uncer¬ 
tain.  However,  full  impacts  can  hardly  be  assessed 
on  a  routine  basis.  Also,  as  the  realistic  argument 
says,  the  contextual  nature  of  policy  implementa¬ 
tion  implies  that  the  process  and  hence  also  the 
impact  varies,  and  explanations  do  not  rest  on  cer¬ 
tainty  but  on  varying  degrees  of  plausibility. 


Irrespective  of  how  skillfully  and  comprehen¬ 
sively  an  evaluation  is  conducted,  its  results  may  be 
poorly  used.  This  has  to  do  with  power.  Governments 
and  other  policymakers  may  not  be  willing  to  pub¬ 
lish  news  when  the  impact  of  government  programs 
has  been  poor,  nor  will  the  responsible  agency 
applaud  if  the  results  of  its  activities  are  criticized. 
After  all,  public  agencies  are  interested  in  both  good 
results  and  maintaining  their  status.  Evaluation 
results  are,  in  other  words,  treated  strategically,  and 
this  can  take  various  forms.  Evaluation  can,  for 
example,  be  designed  and  conducted  in  concert  with 
the  individual  or  organization  who  commissions  it. 
Thus,  the  selection  of  criteria  can  be  affected  by  fac¬ 
tors  other  than  the  principles  of  good  evaluation.  A 
second  possibility  is  simply  that  evaluation  results 
may  be  rejected  if  they  do  not  correspond  with  the 
wishes  of  those  who  authorized  the  evaluation. 
Evaluation  is  a  part  of  democratic  governance.  Feed¬ 
back  and  openly  discussing  impartial  evidence  are 
signs  of  transparency  and  openness.  A  perfect  evalu¬ 
ation  is  seldom  possible  due  to  the  aforementioned 
restrictions.  Evaluation  should  not  be  abandoned, 
and  the  effort  to  see  how  public  programs  are  work¬ 
ing  should  not  be  given  up.  Evaluation  is  just  one 
element  in  the  debate  over  policies  and  programs.  It 
may  have  an  influence  but  so  do  other  factors, 
including  political  commitments,  the  preservation  of 
institutional  relationships,  public  opinion,  and  so 
on.  Politics  in  democratic  societies,  on  the  other 
hand,  is  about  the  belief  that  policy  making  can  and 
will  make  differences,  and  hence  public  policy  mak¬ 
ing  needs  feedback.  There  is  a  lot  of  uncertainty 
involved.  Successful  evaluations  are  based  on  clear, 
explicated  criteria.  A  particular  debate  within  evalu¬ 
ation  research  concerns  the  underlying  theoretical 
assumptions  of  the  various  approaches  and  to  what 
extent  these  affect  the  evaluation  results.  Evaluation 
research,  in  other  words,  cannot  escape  the  common 
basis  of  social  science  to  which  it  belongs. 

Pekka  Kettunen 
University  of  Jyvdskyld 
Jyvdskyld,  Finland 
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Event  Counts 


Many  subjects  of  interest  to  political  scientists 
take  the  form  of  event  counts:  nonnegative,  inte¬ 
ger  counts  of  the  number  of  times  that  a  particular 
event  has  occurred.  Event  count  data  are  found  in 
American  politics  (e.g.,  counts  of  the  number  of 
policy  adoptions,  presidential  vetoes,  and  judicial 
decision  reversals),  international  relations  (e.g., 
interstate  conflicts,  militarized  disputes,  and  treaty 
violations),  and  comparative  politics  (e.g.,  protest 
activities,  cabinet  changes,  and  regime  transi¬ 
tions).  Just  as  scholars  are  often  interested  in  esti¬ 
mating  the  time  of  event  occurrence,  via  event 
history  analysis,  so  also  are  they  often  interested  in 
modeling  counts  of  events  that  are  observed,  via 
event  count  models.  Count  data  present  problems 
for  the  linear  regression  model  developed  for  con¬ 
tinuous  dependent  variables,  which  can  produce 
biased,  inconsistent,  and  inefficient  estimates  when 
applied  to  count  data.  As  a  consequence,  scholars 
have  developed  several  alternative  event  count 
models  for  count  data.  These  are  discussed  below. 


Poisson  Regression  Model 

The  most  basic  event  count  model  is  the  Poisson 
regression  model,  which  assumes  that  the  count  data 
are  produced  by  a  Poisson  process,  with  a  condi¬ 
tional  mean  captured  by  the  covariates  in  the  model: 


Pr(y,|x,) 


exp(-p,V,y' 


(1) 


where  |xL  =  exp(xJ3).  Because  the  Poisson  regres¬ 
sion  model  is  a  loglinear  model,  the  expected  count 
changes  by  a  factor  of  expOJ  for  a  one-unit 
change  in  xk.  The  Poisson  regression  model  can  be 


estimated  by  maximum  likelihood.  Although  it  is 
natural  to  conceive  of  count  processes  as  following 
a  Poisson  process,  the  Poisson  regression  model 
actually  contains  three  quite  restrictive  limitations 
that  limit  its  applicability.  First,  the  Poisson  process 
is  premised  on  events  being  independent  of  each 
other — the  occurrence  of  an  event  does  not  increase 
the  probability  of  the  unit  experiencing  the  same 
type  of  event  in  the  future.  This  assumption  will 
often  be  violated  in  practice.  Second,  related  to 
this,  the  Poisson  regression  model  also  assumes 
that  the  conditional  variance  is  equal  to  the  condi¬ 
tional  mean,  a  condition  known  as  equidispersion. 
In  many  applications,  the  conditional  variance  will 
exceed  the  conditional  mean,  a  condition  known  as 
overdispersion.  Third,  the  Poisson  regression  model 
underpredicts  the  number  of  zeros  (absences  of 
events)  that  are  observed  in  many  applications. 
More  sophisticated  count  models  than  the  Poisson 
regression  model  account  for  these  divergences 
from  the  assumptions  of  the  Poisson  model. 

Overdispersion 

In  an  event  count  model,  the  dependent  variable  is 
the  number  of  events,  not  the  occurrence  of  an 
event.  As  a  consequence,  the  researcher  needs  to 
make  assumptions  about  the  process  that  aggregates 
to  the  event  count.  When  overdispersion  is  present, 
estimates  from  the  Poisson  regression  model  gener¬ 
ally  will  still  be  consistent  if  the  conditional  mean  is 
properly  specified.  (Note,  however,  that  truncated 
or  censored  data  can  result  in  inconsistency,  even  if 
the  conditional  mean  is  properly  specified).  Standard 
errors  from  Poisson  regressions,  however,  will  be 
biased  downward,  leading  to  Type  I  errors  (rejecting 
the  null  hypothesis  when  it  is  true)  in  inference.  As  a 
consequence,  overdispersion  must  be  incorporated 
into  the  model  specification  when  present. 

There  are  two  principal  sources  of  overdisper¬ 
sion  in  count  data.  On  the  one  hand,  unmodeled 
sources  of  heterogeneity  may  produce  a  condi¬ 
tional  variance  that  is  larger  than  the  conditional 
mean.  On  the  other  hand,  the  assumption  of  inde¬ 
pendence  of  events  inherent  in  the  Poisson  process 
may  be  violated.  Typically,  researchers  conceive  of 
this  nonindependence  as  a  form  of  contagion 
within  units — the  experience  of  an  initial  event 
increases  the  probability  of  the  same  unit  experi¬ 
encing  subsequent  events.  For  example,  a  country’s 
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experience  of  an  interstate  conflict  may  increase 
the  probability  of  the  country  experiencing  subse¬ 
quent  interstate  conflicts. 

Given  the  tendency  of  event  count  data  to 
exhibit  overdispersion,  it  is  important  to  test  for 
this  violation  of  the  Poisson  regression  assumption 
rather  than  simply  accepting  the  estimates  from  a 
Poisson  model.  A  variety  of  tests  for  overdisper¬ 
sion  have  been  developed.  A  common  regression- 
based  test  for  overdispersion  typically  assumes 
that  the  overdispersion  takes  the  following  form: 


Typically,  a  gamma  distribution  with  parameter 
0,  is  assumed  for  the  8(,  resulting  in  the  following 
negative  binomial  probability  distribution: 


Pr(y,|x;) 


r(y,  +  e,)/  e,  W  h  V'  (5) 
y,!r(e,)  \e,  +  |x,y  \e,  + 


The  expected  value,  |x(,  remains  the  same  for  the 
negative  binomial  distribution  as  in  the  Poisson. 
The  conditional  variance  in  the  negative  binomial, 
however,  is  larger  than  the  conditional  mean: 


=  M-;  +  “g(M-i), 

where  a  is  the  dispersion  parameter  and  g  is  a 
known  function.  Under  the  null  of  equidispersion, 
a  =  0.  A  test  of  H0:  a  =  0,  then  proceeds  by  esti¬ 
mating  a  Poisson  regression  model  and  running  an 
auxiliary  ordinary  least  squares  (OLS)  regression. 
If  the  null  of  equidispersion  can  be  rejected,  the 
researcher  will  then  wish  to  estimate  a  model 
capable  of  capturing  overdispersion. 


Negative  Binomial  Regression  Model 

The  most  commonly  employed  model  for  overdis¬ 
persion  is  the  negative  binomial  regression  model. 
The  mean  p  =  exp(x(3)  from  the  Poisson  regres¬ 
sion  model  is  replaced  in  the  negative  binomial 
regression  model  with  a  random  variable: 


P-,  =  exp(x,3  +  8,) 

=  exp(x,0)  exp(e,'),>  (3) 

=  p,  exp(e,), 

where  s;  is  an  error  term  reflecting  unobserved 
heterogeneity.  In  other  words,  the  heterogeneity  in 
unmodeled  covariates  that  produces  a  variance 
that  exceeds  the  mean  is  modeled  in  the  negative 
binomial  regression  through  the  random  variable 
pi.  With  8;  —  exp(E(),  the  distribution  of  the  yi  con¬ 
ditional  on  xt  and  8(  remains  Poisson: 


Pr(y,|x,',  8,-) 


exp(  — (Xjil/- 


(4) 


Var(y,|x,)  =  exp(x,P)  ^1  +  exp^Y'P^  .  (6) 

The  negative  binomial  regression  model  can  be 
estimated  by  maximum  likelihood. 


Excess  Zeros 

A  variance  larger  than  the  mean,  overdispersion,  is 
evidenced  by  a  larger  number  of  zero  counts  of 
events  (the  absence  of  any  events)  than  is  predicted 
by  the  Poisson  regression  model.  The  negative  bino¬ 
mial  regression  model  accounts  for  this  excess  num¬ 
ber  of  zeros  by  modifying  the  conditional  variance 
from  the  standard  Poisson  model.  Heterogeneity  and 
contagion,  however,  are  not  the  only  possible  sources 
of  excess  zeros.  An  excessive  number  of  zeros  may 
also  be  produced  by  a  dual-regime  process,  in  which 
the  data-generating  processes  producing  zero  counts 
and  producing  event  counts  once  events  are  possible 
are  qualitatively  different.  The  first  of  the  two  stages 
is  a  transition  stage,  which  governs  the  probability  of 
transitioning  from  experiencing  no  event  to  experi¬ 
encing  events.  The  second  stage  is  the  events  stage, 
which  produces  the  number  of  events  that  are  expe¬ 
rienced  once  the  unit  has  transitioned  into  the  stage 
in  which  events  are  possible. 

If  a  dual-regime  process  is  suspected,  rather  than 
modifying  the  conditional  variance  and  leaving  the 
conditional  mean  unchanged  from  the  Poisson 
model,  the  researcher  will  instead  wish  to  modify 
the  model  for  the  conditional  mean.  This  is  done  by 
recognizing  the  dual  regime  nature  of  the  data- 
generating  process.  Two  principal  dual-regime 
alternatives  have  been  proposed:  the  zero-inflated 
model  and  the  hurdle  model.  The  two  alternative 
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dual-regime  models  differ  in  their  conception  of 
the  possibility  of  zero  counts  once  a  unit  has  exited 
the  transition  process. 

Zero-Inflated  Models 

In  zero-inflated  models,  zero  counts  can  be 
observed  in  either  of  two  distinct  regimes.  In  the 
first  regime,  the  outcome  is  always  zero.  In  the  sec¬ 
ond  regime,  events  or  nonevents  may  be  observed. 
Although  developed  for  the  application  of  identify¬ 
ing  defects  in  manufacturing  processes  (in  a  perfect 
manufacturing  process,  defects  are  never  observed 
in  the  first  regime),  it  is  easy  to  think  of  political 
science  applications  in  which  these  two  distinct 
types  of  regimes  are  possible.  For  example,  under 
certain  conditions,  states  may  never  go  to  war; 
however,  if  these  conditions  are  not  met,  then 
either  war  or  the  absence  of  war  may  be  observed. 

The  probability  of  exiting  the  regime  in  which 
only  zeros  are  observed  can  be  parameterized  as  a 
logit  or  probit.  Poisson  and  negative  binomial  mod¬ 
els  have  been  used  to  parameterize  the  second  events 
stage.  But  whether  one  uses  a  Poisson  or  negative 
binomial  specification,  the  commonality  of  these 
zero-inflated  approaches  is  that  zero  counts  are  pos¬ 
sible  for  either  of  the  two  regimes  in  the  model.  This 
latter  feature  sets  these  models  apart  from  another 
common  event  count  model,  the  hurdle  model. 

Hurdle  Models 

Like  their  zero-inflated  counterparts,  hurdle  mod¬ 
els  also  assume  a  dual  regime  process.  Hurdle 
models,  however,  depart  from  zero-inflated  mod¬ 
els  in  assuming  that  zero  counts  are  not  possible  in 
the  events  stage.  Instead,  once  the  unit  has  crossed 
the  hurdle  and  departed  the  first  stage,  at  least  one 
event  will  be  observed.  The  second-stage  process  is 
thus  truncated  at  zero.  As  a  consequence,  the  prob¬ 
ability  of  excessive  zeros  is  determined  solely  by 
the  first-stage  process.  As  with  zero-inflated  mod¬ 
els,  both  Poisson  and  negative  binomial  versions  of 
hurdle  models  are  used  by  researchers,  depending 
on  their  conception  of  the  events  stage  process. 

Conclusion 

Political  scientists  are  increasingly  interested  in 
modeling  phenomena  for  which  the  standard  linear 


regression  model  is  not  appropriate.  One  of  these 
phenomena  is  count  processes,  which  arise  in  each 
of  the  empirical  subfields  of  the  discipline.  A  vari¬ 
ety  of  specialized  models  have  been  developed  to 
model  count  processes,  and  research  continues  on 
these  models.  Important  recent  developments  in 
event  count  models  include  correlated  event  count 
models,  panel  data  models,  and  Bayesian  models. 
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Event  History  Analysis 


Event  history  analysis  allows  researchers  to  inves¬ 
tigate  not  only  occurrences  of  certain  political 
events  but  also  the  processes  or  histories  of  those 
events.  In  essence,  event  history  analysis  estimates 
how  histories  of  events  or  durations  leading  up  to 
the  events  (if  events  occur  at  all)  change  with  dif¬ 
ferent  values  of  independent  variables  (or  covari- 
ates).  Such  estimates  allow  researchers  to  discuss 
how  included  covariates  increase  or  decrease  the 
probability  of  the  event  occurrence  at  any  given 
time.  Since  event  history  analyzes  the  duration 
between  the  start  when  the  event  is  possible  to  the 
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time  when  the  event  actually  occurs  (if  it  occurs), 
event  history  analysis  is  often  called  duration 
analysis.  Alternatively,  as  event  history  analysis 
models  the  probability  of  an  observation  surviving 
(not  experiencing  the  event),  event  history  analysis 
is  sometimes  called  survival  analysis.  Below,  this 
entry  discusses  some  major  features  and  possible 
applications  of  event  history  analysis,  taking  into 
account  both  parametric  and  nonparametric  mod¬ 
els  and  some  recent  extensions. 

Many  political  science  studies  ask  questions 
regarding  occurrences  of  significant  political 
events.  For  instance,  scholars  in  comparative  poli¬ 
tics  are  interested  in  why  some  countries  democra¬ 
tize  while  others  fail  to  do  so  (democratization),  or 
why  some  countries  are  able  to  economically 
develop  while  others  stay  in  poverty  (economic 
development).  Likewise,  scholars  in  international 
relations  study  why  some  countries  join  many 
international  organizations,  while  others  join  only 
a  few  (international  organization  [IOJ  member¬ 
ship)  and  why  some  pairs  of  states  are  more  likely 
to  engage  in  militarized  interstate  disputes  than 
others  (militarized  disputes).  Scholars  in  American 
politics  ask  why  some  court  nominees  are  approved, 
while  others  are  not  (legislative  and  judicial  branch 
relations),  and  what  makes  an  incumbent  more 
likely  to  face  a  challenger  (reelection).  By  answer¬ 
ing  these  questions,  political  scientists  are  able  to 
identify  systematic  dynamics  influencing  occur¬ 
rences  of  these  political  events. 

While  these  inquiries  certainly  enhance  our 
understanding  of  interesting  political  phenomena, 
the  timing  of  occurrences  of  such  events  can  provide 
even  richer  information  and  thus  help  further 
enhance  our  understanding  of  the  underlying 
dynamics.  For  instance,  not  only  identifying  under 
what  conditions  a  country  is  able  to  democratize 
but  also  finding  out  under  what  conditions  a  coun¬ 
try  is  able  to  do  so  faster  than  other  countries  would 
help  promote  our  understanding  of  democratiza¬ 
tion.  Event  history  analysis  provides  researchers 
with  a  unique  opportunity  to  exploit  richer  infor¬ 
mation  of  the  histories  of  events. 

Since  event  history  analysis  has  been  introduced 
to  political  science,  there  have  been  a  growing 
number  of  studies  that  use  it  in  empirical  investiga¬ 
tions.  With  this  powerful  new  tool,  many  old 
research  questions  have  been  revisited,  and  many 
new  research  questions  have  been  developed  and 


empirically  tested.  These  questions  cut  across  sub- 
disciplines  of  political  science,  and  examples  of 
them  are  numerous.  Why  do  some  coalition  gov¬ 
ernments  last  longer  than  others?  Why  do  some 
militarized  disputes  last  longer  than  others?  Why 
do  some  periods  of  peace  last  longer  than  others? 
Why  do  some  economic  sanctions  last  longer  than 
others?  Why  do  some  drugs  take  a  longer  time  to 
get  approved  by  the  Food  and  Drug  Administration 
than  other  drugs?  And  why  can  some  representa¬ 
tives  maintain  their  seats  longer  than  others?  The 
popularity  of  the  event  history  analysis  is  due  to  its 
usefulness  in  understanding  the  underlying  politi¬ 
cal  dynamics  of  events  and  their  histories. 
Specifically,  event  history  analysis  has  a  few 
advantages  over  traditional  binary  logit  or  probit 
analysis,  on  the  one  hand,  and  over  ordinary  least 
squares  regression  (OLS),  on  the  other. 

Event  history  analysis  has  many  advantages 
over  binary  logit  or  probit  models.  First  of  all, 
event  history  analysis  takes  the  history  of  an  event 
seriously  by  taking  duration  into  account.  As  how 
long  it  takes  for  an  event  to  occur  provides  more 
information  than  just  whether  the  event  occurs  or 
not,  it  helps  better  understand  the  dynamics  of  the 
events  under  investigation.  Returning  to  a  prior 
example,  some  countries  may  successfully  democ¬ 
ratize  less  than  a  few  decades  after  independence, 
others  may  democratize  after  more  than  a  few 
decades,  while  still  others  may  not  be  able  to 
democratize  at  all.  Conventional  logit  or  probit 
models  cannot  differentiate  between  those  cases 
democratizing  in  less  than  a  few  decades  and  those 
in  more  than  a  few  decades.  In  contrast,  event  his¬ 
tory  analysis  allows  researchers  not  only  to  inves¬ 
tigate  factors  that  make  democratization  more 
likely  but  also  to  investigate  factors  that  hasten 
democratization  by  exploiting  richer  data. 

Likewise,  one  may  think  that  OLS  regression 
might  be  able  to  capture  factors  influencing  dura¬ 
tion  quite  nicely,  as  it  examines  continuous  depen¬ 
dent  variables.  But  event  history/duration  data 
pose  several  challenges  for  traditional  OLS  estima¬ 
tion.  For  one,  duration  data  are  often  right  skewed 
and  the  OLS  approach  requires  an  arbitrary  trans¬ 
formation  of  data  (most  commonly  taking  a  log)  to 
deal  with  a  right-skewed  data  set.  A  more  serious 
problem  is  data  truncation.  Data  truncation  hap¬ 
pens  when  researchers  do  not  know  either  the 
exact  entry  time  of  an  observation  (left  truncated) 
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or  the  end  time — this  is  the  case  when  events  have 
not  yet  occurred  when  the  data  collection  is  done 
(right  censoring).  That  is,  there  could  be  some 
observations  that,  at  the  time  of  data  collection, 
are  still  ongoing.  OLS  treats  left-truncated  obser¬ 
vations  as  if  they  have  an  equivalent  entry  time 
with  other  observations  and  copes  with  right- 
censoring  problems  either  by  dropping  all  the 
observations  that  have  not  experienced  the  event 
or  by  capping  the  history  by  assuming  that  the 
event  has  occurred  at  the  conclusion  of  the  period 
of  data  collection.  Both  these  arbitrary  assump¬ 
tions  can  cause  biases.  In  contrast,  event  history 
analysis  takes  into  account  these  cases  of  left  trun¬ 
cation  and  right  censoring.  Finally,  event  history 
analysis  can  accommodate  time-varying  covariates 
(TVCs),  which  are  independent  variables  with  dif¬ 
ferent  values  over  time. 

In  event  history,  an  “event”  is  the  primary  phe¬ 
nomenon  of  interest,  and  the  “history  of  such  an 
event”  refers  to  the  duration  leading  up  to  the 
event.  If  we  think  that  history  can  span  across  time 
T,  then  the  probability  of  the  event  occurring  at 
any  given  time  point  t  (t  is  an  element  of  T)  can  be 
written  as  a  probability  density  function  f(t).  Then, 
the  cumulative  probability  at  time  t  is  given  by  F(t) 
and  the  integration  of  f(t)  from  0  to  t.  This  is  the 
probability  of  the  event  having  occurred  between 
time  0  and  t.  Then,  the  probability  of  survival,  or 
the  event  not  having  occurred  up  to  time  t,  can  be 
simply  written  as  S(t)  —  1  —  F(t).  Given  f(t)  and 
S(t),  the  hazard  rate  is  the  probability  of  the  event 
happening  at  time  t  given  the  observation  has  not 
experienced  the  event  until  time  t.  With  the  simple 
notations  outlined  above,  the  hazard  rate  h(t)  is 
equal  to  f(t)/S(t ),  the  conditional  probability  of  an 
event  occurring,  given  that  it  has  not  happened  up 
until  time  t. 

Going  back  to  the  democratization  example,  the 
hazard  rate  at  10  years  after  a  country’s  indepen¬ 
dence  would  be  the  probability  of  democratization 
at  Year  10  divided  by  the  probability  of  democra¬ 
tization  not  having  happened  for  the  past  decade 
in  this  country.  Theoretically,  various  factors 
would  presumably  influence  the  hazard  rate.  For 
instance,  one  may  think  that  the  level  of  economic 
development,  former  colonial  experience,  and 
strength  of  military  force  all  affect  the  hazard  rate, 
as  well  as  the  duration  itself,  and  build  the  event 
history  model  accordingly.  Event  history  analysis 


then  estimates  the  hazard  rate  for  democratization 
as  the  function  of  both  these  covariates  and  time. 
A  baseline  hazard  rate  refers  to  the  hazard  rate  as 
only  a  function  of  time  t. 

Parametric  and  Nonparametric  Approaches 

There  are  various  modeling  options  for  researchers 
who  seek  to  use  event  history  analysis  based  on 
appropriateness  to  the  data  and  background  theo¬ 
ries.  First,  the  choice  between  parametric  and  non¬ 
parametric  approaches  depends  on  how  confident 
researchers  are  of  the  shape  of  the  baseline  hazard, 
which  ideally  is  guided  by  theory.  In  essence,  all 
parametric  models  make  explicit  assumptions 
about  the  shape  of  the  baseline  hazard,  while  non¬ 
parametric  models  do  not  make  such  assumptions. 

Parametric  models  make  assumptions  for  the 
baseline  hazard  rate  once  covariates  are  included  in 
the  model.  There  are  a  wide  variety  of  models,  and 
models  may  be  nested  in  others.  For  instance,  the 
exponential  model  assumes  that  the  baseline  hazard 
is  flat  across  time.  This  would  mean  that  the  prob¬ 
ability  of  an  event  occurring  at  time  t  conditional 
on  the  event  not  having  occurred  is  constant  over 
time.  The  exact  value  then  depends  on  included 
covariates.  The  Weibull  model  is  more  flexible  than 
the  exponential  model,  and  it  allows  that  the  base¬ 
line  hazard  rate  may  be  monotonically  increasing, 
monotonically  decreasing,  or  flat  over  time.  Thus, 
the  Weibull  model  nests  the  exponential  model. 
Both  the  Weibull  and  exponential  models  assume 
proportional  hazards  as  the  changes  of  hazard  rates 
with  changes  of  covariates  are  proportional  to  the 
baseline  hazard  rate.  The  proportional  hazards 
assumption  should  be  tested.  Since  the  Weibull 
model  can  be  flexible,  there  have  been  many  studies 
that  have  used  it.  Yet,  in  some  settings,  the  mono¬ 
tonicity  assumption  may  not  be  appropriate,  and 
for  those  cases  parametric  models  without  a  mono¬ 
tonicity  assumption  can  be  used.  These  models 
include  the  log-logistic  and  the  lognormal  models. 
These  models  allow  hazard  rates  to  first  increase 
and  then  decrease  as  t  passes.  Neither  of  these  mod¬ 
els  has  the  proportional  hazards  property. 

The  generalized  gamma  model  can  be  useful  to 
adjudicate  among  different  parametric  models,  as 
several  parametric  models  are  nested  within  the 
generalized  gamma  model.  The  exponential,  the 
Weibull,  the  lognormal,  and  the  gamma  models 
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are  all  special  forms  of  the  generalized  gamma 
model.  When  one  has  no  a  priori  theoretical  justi¬ 
fication  about  how  the  baseline  hazard  rate  varies 
across  time,  the  generalized  gamma  model  is  likely 
to  be  particularly  useful.  If  the  fit  is  correct,  para¬ 
metric  models  generally  have  smaller  standard 
errors  than  their  semiparametric  counterparts. 
However,  as  parametric  models  require  assump¬ 
tions  about  the  shape  of  the  baseline  hazard,  when 
assumptions  are  not  correct,  the  estimation  will  be 
biased. 

In  comparison  with  the  parametric  models,  non- 
parametric  approaches  do  not  make  assumptions 
about  the  shape  of  the  baseline  hazard.  While  it  is 
true  that  some  parametric  models  are  more  flexible 
than  others,  they  all  still  make  assumptions  about 
the  shape  of  the  baseline  hazard.  Moreover,  more 
flexible  models  require  more  parameters  to  be  esti¬ 
mated,  and  inclusion  of  such  parameters  may  be 
tenuous  and  cumbersome.  Finally,  nonparametric 
models  will  approximate  a  correct  parametric 
model;  if  the  correct  parametric  model  is  the 
Weibull,  nonparametric  estimates  will  closely 
approximate  the  Weibull;  if  the  correct  parametric 
model  is  the  exponential,  nonparametric  estimates 
will  approximate  the  exponential  without  specify¬ 
ing  the  baseline  hazard.  In  general,  given  that  there 
is  little  theoretical  justification  for  the  shape  of  the 
baseline  hazard  in  most  political  science  inquiries, 
there  seems  to  be  a  rising  consensus  that  there  is 
little  to  lose  and  much  to  gain  in  using  nonparamet¬ 
ric  models  in  social  science  applications.  Thus,  it 
may  be  better  to  leave  the  baseline  hazard  unspeci¬ 
fied  and  instead  focus  on  how  covariates  of  interest 
influence  the  hazard  rate.  This  is  the  basic  justifica¬ 
tion  of  the  Cox  proportional  hazards  model,  or 
simply  the  Cox  model.  Time-varying  covariates  are 
easily  handled  in  the  Cox  model  as  well. 

The  Cox  model  estimates  the  hazard  rate  of  the 
/th  observation  at  time  t  as  a  function  of  both  the 
unspecified  baseline  hazard  and  the  covariates.  As 
the  hazard  rate  is  a  product  of  the  function  of  the 
covariates  and  the  baseline  hazard,  the  Cox  model 
is  also  a  proportional  hazards  model,  and  this 
assumption  needs  to  be  checked.  The  Cox  model 
uses  partial  likelihood  methods  to  estimate  coeffi¬ 
cients  for  each  covariate.  The  partial  likelihood 
function  for  some  given  data  is  the  product  of  the 
hazard  rate  of  the  observation  that  experiences  the 
22th  event  over  the  sum  of  all  the  hazard  rates  of 


observations  that  have  not  experienced  the  event, 
22  to  K,  when  n  varies  from  1  to  N,  where  N  is  the 
number  of  observations  that  experience  the  event 
in  the  data,  and  K  is  the  total  number  of  observa¬ 
tions.  Then,  the  partial  likelihood  function  is 
maximized,  and  the  coefficients  are  obtained.  For 
instance,  imagine  a  data  set  with  three  observa¬ 
tions  (K  —  3),  where  only  two  of  them  (N  =  2) 
experience  the  event.  The  partial  likelihood  is  then 
given  by  the  hazard  rate  of  the  observation  that 
experiences  the  event  first  over  the  sum  of  the  haz¬ 
ard  rates  of  all  three  observations  multiplied  by  the 
hazard  rate  of  the  observation  that  experiences  the 
event  second  over  the  sum  of  the  hazard  rates  of 
the  two  remaining  observations. 

As  illustrated,  the  Cox  model  uses  information 
about  which  observation  experiences  the  event 
sooner  than  others.  Thus,  the  interval  between  two 
observations  experiencing  events  does  not  provide 
any  additional  information.  In  this  sense,  the  Cox 
model  is  an  ordered  events  model.  Important 
advances  have  been  made  in  handling  ties,  which 
are  observations  that  experience  the  event  at  the 
same  time,  such  as  the  Breslow,  the  Efron,  and  the 
exact  discrete  approximations. 

Discrete  and  Continuous  Time  Approaches 

Thus  far,  the  discussion  of  event  history  analysis 
implicitly  assumes  continuous  time.  While  time  is 
continuous  in  nature,  in  reality  the  data  sets  that 
researchers  use  are  often  discrete.  Event  history  data 
in  discrete  time  contain  the  same  information  in  dif¬ 
ferent  forms — a  series  of  binary  observations  in 
which  zeros  are  recorded  for  a  nonevent,  and  ones 
are  recorded  for  an  event.  In  such  a  case,  research¬ 
ers  need  to  select  a  probability  distribution  to  cap¬ 
ture  the  binary  decision.  The  logistic  distribution  (as 
in  a  logit  model)  and  the  standard  normal  distribu¬ 
tion  (as  in  a  probit  model)  are  typically  used. 

Duration  dependence  still  needs  to  be  directly 
accounted  for  in  the  discrete  time  model,  which  is 
generally  handled  by  fitting  smoothing  functions 
such  as  spline  functions  or  lowess.  Some  argue  for 
discrete  time  due  to  greater  familiarity  by  most 
social  scientists  with  logit/probit  models.  Others 
argue  for  continuous  time  approaches  since  model¬ 
ing  duration  dependency  does  not  require  extra 
modeling  steps.  Both  discrete  and  continuous  time 
models  are  appropriate  and  adequate. 


Event  History  Analysis  859 


Diagnostics 

Model  fit  and  assumptions  should  be  tested  for 
event  history  models,  just  as  they  are  for  any  other 
statistical  model.  Residual  plots  show  the  differ¬ 
ence  between  the  observed  values  and  the  predicted 
values  from  the  estimated  model,  and  residual  mea¬ 
sures  can  be  used  as  graphical  tests  for  various 
purposes.  Cox- Snell  residuals  are  often  used  to 
evaluate  the  overall  fit  of  the  estimated  model. 
When  Cox- Snell  residuals  plot  roughly  around  the 
45°  reference  line,  the  Cox  model  performs  reason¬ 
ably  well.  When  there  is  a  systematic  deviation  of 
the  plotted  residuals  from  the  reference  line,  then 
there  may  be  some  omitted  variables  or  problems 
with  model  specifications.  But  one  should  also  note 
that  the  Cox- Snell  residuals  are  not  a  definite  evi¬ 
dence  of  how  the  estimated  model  fits. 

Since  the  Cox  model  and  some  parametric  mod¬ 
els  assume  proportional  hazards,  it  is  important  to 
test  this  assumption.  If  it  does  not  hold,  it  is  not 
possible  to  interpret  the  hazard  rate  or  ratio  con¬ 
stant  across  time,  and  additional  modeling  steps 
are  required.  Graphically,  scaled  Schoenfeld  resid¬ 
uals  can  be  used  in  testing  the  proportional  hazards 
assumption.  Scaled  Schoenfeld  residuals  can  be 
thought  of  as  the  difference  between  the  expected 
values  of  the  covariates  and  the  observed  values  of 
the  covariates.  Thus,  when  scaled  Schoenfeld 
residuals  are  plotted  across  time,  one  can  see  if  any 
variable  shows  violations  of  the  proportional  haz¬ 
ards  assumption.  When  scaled  Schoenfeld  residuals 
are  consistently  gathered  around  the  zero  line,  the 
proportional  hazards  assumption  is  likely  to  hold. 
When  residuals  systematically  deviate  from  the 
zero  line  or  show  heterogeneity  across  time,  it  is 
possible  that  the  model  is  misspecified  and  the  pro¬ 
portional  hazards  assumption  might  not  hold. 
There  is  a  global  statistical  test  to  examine  the 
proportional  assumption  as  well.  If  the  global  test 
indicates  that  the  model  may  have  a  violation, 
Harrell’s  rho  can  be  used  to  focus  on  offending 
covariates  individually.  Typically,  an  interaction 
with  time  and  the  offending  covariate  is  included  in 
the  model  to  account  for  the  nonproportionality. 

Extensions 

Additional  extensions  make  event  history  analysis 
even  more  useful.  For  instance,  an  observation 
may  experience  an  event  more  than  once.  It  is  also 


possible  that  an  observation  can  exit  in  more  than 
one  way.  The  model  can  also  be  extended  to 
include  unobserved  heterogeneity  among  observa¬ 
tions,  event  dependence,  or  spatial  dependence. 

Repeated  Events 

Repeated  events  are  simply  those  when  the  obser¬ 
vation  can  experience  the  event  more  than  once, 
such  as  occurs  in  the  study  of  repeated  interstate 
disputes  between  certain  pairs  of  countries.  The 
analysis  of  repeated  events  takes  into  account  the 
fact  that  the  first,  second,  third,  and  such  other 
events  are  not  independent.  Pairs  of  countries  with 
two  prior  interstate  disputes  will  likely  have  a  dif¬ 
ferent  probability  of  having  another  dispute  than 
pairs  of  countries  with  no  such  prior  experiences 
of  having  an  interstate  dispute.  Stratification  by 
event  number  is  an  appropriate  way  to  model 
repeated  events  to  allow  the  hazard  rate  to  vary  by 
event  number. 

Competing  Risks 

Competing  risks  extensions  to  the  basic  Cox  model 
allow  analysts  to  consider  different  types  of  events. 
For  example,  a  member  of  Congress  may  leave 
office  due  to  defeat  in  the  primary  election,  defeat 
in  the  general  election,  running  for  higher  office,  or 
retiring.  Economic  sanctions  may  end  with  either  a 
sanctioning  failure — a  sanctioner  lifting  sanctions 
without  achieving  desired  policy  goals — or  a  sanc¬ 
tioning  success — a  target  accepting  the  demand 
from  the  sanctioner.  These  events  are  distinct,  and 
the  covariate  effects  are  likely  to  differ.  The  usual 
modeling  strategy  for  a  competing  risks  model  is  to 
estimate  separate  models  for  each  event  while 
treating  observations  experiencing  other  events  as 
censored.  One  drawback  of  this  approach  is  that  it 
assumes  independence  between  different  types  of 
events.  Yet,  often,  this  assumption  is  questionable. 
For  instance,  the  hazard  rate  of  sanctioning  failure 
may  depend  on  the  hazard  rate  of  sanctioning  suc¬ 
cess.  The  competing  risks  model  does  not  assume  a 
time  order  for  events  to  occur. 

Unobserved  Heterogeneity 

In  an  ideal  world,  researchers  would  include  all  rel¬ 
evant  covariates  in  the  statistical  model  to  estimate 
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the  precise  effects  of  each  covariate.  But  it  is  often 
not  the  case  that  all  the  relevant  variables  are  avail¬ 
able.  Some  may  be  unmeasurable,  and  others  may 
be  unobservable.  The  frailty  model  extension  of  the 
Cox  model  provides  one  way  to  account  for  unob¬ 
served  heterogeneity  across  observations.  The  frailty 
term  can  be  thought  of  as  an  additional  factor  that 
measures  how  prone  or  “frail”  observations  are  to 
experiencing  the  event.  The  frailty  can  be  modeled 
as  either  a  group-  or  individual-specific  term. 

Event  Dependence 

The  conditional  frailty  model  separates  and 
accounts  for  both  event  dependence  and  heteroge¬ 
neity  in  repeated  events  models.  Event  dependence 
exists  when  the  occurrence  of  one  event  makes 
further  events  more  or  less  likely.  For  example, 
learning  effects  or  damaging  effects  may  make  an 
event  more  or  less  likely  to  occur.  In  short,  the  risk 
of  an  event  may  be  a  function  of  a  prior  event 
occurring.  Separating  out  event  dependence  and 
heterogeneity  is  helpful  to  analysts  methodologi¬ 
cally  in  order  to  draw  more  accurate  substantive 
conclusions. 

Spatial  Dependence 

Finally,  recent  extensions  incorporate  spatial 
dependence  into  event  history  models.  This  is  an 
exciting  development  because  it  offers  an  effective 
approach  to  account  for  spatial  dependence  in 
political  event  processes.  The  model  allows  for 
spatially  correlated  random  effects  at  neighboring 
locations.  For  example,  scholars  have  looked  at 
incorporating  the  proximity  of  civil  unrest  on  the 
timing  of  outbreaks  of  additional  violence  and  the 
diffusion  of  state  policy  adoption  across  states  on 
the  timing  of  those  policy  adoptions. 
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Evidence-based  policy  making  (EBP)  has  developed 
over  the  past  40  years  from  conceptual  roots  that 
lie  in  the  health  care  profession.  Its  deeper  intellec¬ 
tual  roots  can  be  traced  back  even  further  to  the 
Enlightenment,  the  policy  sciences  movement  in  the 
mid-20th  century,  and  efforts  to  address  societal 
ills  through  informed  and  activist  government  in 
the  1960s.  Definitions  of  EBP  may  vary,  but  most 
observers  agree  that  EBP  requires  rational  and 
systematic  processes  for  producing  evidence  to  be 
applied  in  the  policy  making  process.  While  differ¬ 
ences  in  definitions  of  EBP  tend  to  focus  on  varia¬ 
tions  on  this  idea,  there  is  debate  about  the  kinds  of 
evidence  that  should  be  collected,  EBP’s  feasibility 
as  an  actual  guide  for  policy  making,  and  the  stan¬ 
dards  of  evidence  that  it  requires.  In  this  sense,  EBP 
shares  challenges  that  are  similar  to  earlier  move¬ 
ments  that  sought  to  rationalize  policy  making  and 
policy-making  processes. 
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EBP’s  more  recent  conceptual  roots  lie  in  research 
on  health  care  practices.  In  1971,  Archibald 
Cochrane,  a  British  epidemiologist,  highlighted  gaps 
between  the  state  of  medical  knowledge  and  the 
realities  of  medical  practice.  He  advocated  more 
systematic  research  on  patient  care  as  well  as  on  the 
systematic  application  of  knowledge  gained  through 
this  research  to  clinical  practice.  In  the  early  1990s, 
his  hope  for  more  systematic  assessments  of  clinical 
practices  and  their  use  in  guiding  patient  care  was 
realized  when  the  international  Cochrane  Collab¬ 
oration  was  established  in  Oxford,  United  Kingdom 
(UK),  to  execute  systematic  reviews  of  medical  evi¬ 
dence.  The  work  of  the  Cochrane  Collaboration 
and  the  nearby  Centre  for  Evidence-Based  Medicine 
at  John  Radcliffe  Hospital  in  Oxford  caught  the 
attention  of  policymakers  in  the  UK,  the  United 
States,  and  elsewhere. 

Research  on  evidence-based  medicine  gave  rise  to 
practical  applications  in  both  health  care  and  other 
policy  sectors.  As  Sue  Dopson,  Louise  Locock,  John 
Gabbay,  Ewan  Ferlie,  and  Louise  Fitzgerald  (2003) 
note,  in  the  health  care  sector,  the  UK  and  the 
United  States  promoted  the  application  of  evidence- 
based  research  and  practice  in  various  ways,  includ¬ 
ing  the  establishment  of  guidelines  and  research 
centers  to  encourage  the  use  of  systematic  medical 
research  findings  in  clinical  practices.  In  the  UK,  the 
Labour  government  of  the  1990s  embraced  the  EBP 
concept  and  expanded  it  to  other  policy  areas  as  a 
part  of  its  efforts  to  modernize  government.  The 
Labour  Government’s  1999  Cabinet  Office  White 
Paper  on  Modernizing  Government,  for  example, 
highlighted  the  importance  of  evidence  and  research 
in  policy  making  across  the  board,  not  just  in  the 
health  care  arena.  In  an  effort  to  ensure  that  public 
policies  actually  work  to  achieve  their  objectives, 
the  UK  also  increased  funding  for  social  research 
during  this  time  period. 

In  recent  years,  the  phrase  evidence-based  pol¬ 
icy  making  has  been  used  with  increasing  fre¬ 
quency.  For  example,  the  title  of  the  Association 
of  Public  Policy  Analysis  and  Management’s 
(APPAM)  Annual  Research  Conference  in  Wash¬ 
ington,  D.C.,  during  the  fall  of  2009,  was 
“Evidence-Based  Policy  Making  in  the  Post  Bush/ 
Clinton  Era.”  This  title  reflects  recognition  of  the 
importance  of  the  phrase  not  only  in  the  UK, 
where  it  has  been  commonly  used,  but  also  in 
North  America  and  elsewhere  as  well. 


However,  while  the  EBP  terminology  has  devel¬ 
oped  only  in  recent  decades,  the  ideas  underlying  it 
have  been  in  evidence  for  many  years.  Some  trace 
the  roots  of  EBP  to  the  Enlightenment  and  its 
emphasis  on  applying  science  and  knowledge  for 
purposes  of  human  betterment  (Carolyn  Heinrich, 
2007).  In  the  1940s  and  1950s,  Harold  Lasswell’s 
efforts  to  build  a  science  of  policy  through  policy 
analysis  sought  to  encourage  analytically  based 
policy  improvement  efforts  that  are  similar  to 
those  targeted  by  EBP.  And,  in  the  1960s,  govern¬ 
mental  efforts  to  create  a  “Great  Society”  in  the 
United  States  were  predicated  on  the  idea  that 
rationally  constructed  public  policies  could  be 
implemented  in  ways  that  alleviate  societal  ills. 
These  policies,  in  turn,  were  accompanied  by  the 
establishment  of  public  policy  think  tanks  in  uni¬ 
versities  and  elsewhere  that  were  designed  to  pro¬ 
vide  objective  information  to  help  guide  efforts  to 
improve  policy  making  on  an  ongoing  basis. 

The  attention  that  EBP  has  received  is  in  part 
traceable  to  its  grounding  in  basic  concerns  about 
the  application  of  human  knowledge  to  improve 
public  policies,  but  it  also  has  roots  in  the  unique 
circumstances  of  the  past  several  decades.  During 
the  1980s,  growing  skepticism  regarding  the  effec¬ 
tiveness  of  government  programs  led  to  a  search 
for  new  alternatives  that  took  better  account  of 
their  costs  and  the  cost-effectiveness.  These  con¬ 
cerns  contributed  to  the  performance  measurement 
movement  of  the  1 990s  and  a  need  for  government 
programs  that  “work  better  and  cost  less,”  to  bor¬ 
row  a  phrase  from  the  famous  1993  Gore  report  in 
the  United  States.  The  EBP  movement  fit  in  nicely 
with  these  concerns  and  added  a  specific  focus  on 
the  role  of  research  and  evidence  to  the  perfor¬ 
mance  measurement  interests  that  came  to  domi¬ 
nate  public  sector  management  in  the  1990s  and 
the  early  years  of  the  21st  century. 

Challenges  for  Evidence-Based  Policy 

However,  while  the  EBP  movement  coincided  well 
with  the  times  in  which  it  developed,  it  did  not 
escape  the  issues  that  had  plagued  similar  move¬ 
ments  in  the  past.  At  least  three  issues  are  worth 
addressing  in  this  regard. 

First,  the  concept  of  EBP  assumes  that  research¬ 
ers  and/or  policymakers  can  define  what  evidence 
is  important  and  apply  it  appropriately  to  achieve 
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desired  policy  outcomes.  However,  public  sector 
programs  may  seek  to  achieve  more  than  one  goal, 
and  choices  regarding  the  most  appropriate  evi¬ 
dence  to  collect  are  not  always  clear.  The  evidence 
one  needs  to  garner  depends  on  the  goals  one 
chooses  to  pursue  or  assess.  Where  goals  are 
numerous  and  differ  with  respect  to  the  availability 
of  evidence  and  the  ease  with  which  it  can  be  devel¬ 
oped,  the  evidence  produced  may  be  inherently 
biased  toward  goals  that  are  easily  measured.  As  a 
result,  some  argue  that  EBP  and  performance  mea¬ 
surement  more  broadly  are  inherently  biased 
toward  measures  of  efficiency  that  are  more  easily 
quantified  in  monetary  terms  than  measures  of 
other  values,  such  as  those  relating  to  equity,  fair¬ 
ness,  and  democratic  participation.  This  concern  is 
exacerbated  by  the  fact  that  producing  evidence 
often  requires  money,  which  may  be  available  in 
less  abundance  to  those  in  society  who  are  most 
concerned  about  these  other  values. 

Second,  there  are  debates  about  whether  evi¬ 
dence  produced  by  research  efforts  can  be  incorpo¬ 
rated  effectively  into  policy-making  processes. 
Research  takes  time  to  be  done  in  quality  fashion, 
and  time  is  often  in  short  supply  as  policymakers 
make  decisions  to  address  both  perceived  emergen¬ 
cies  and  ambitious  policy  agendas.  There  are  also 
issues  relating  to  the  ability  of  policymakers  to 
access  the  right  or  best  information  as  they  make 
decisions.  Because  policymakers  are  often  general¬ 
ists,  they  may  have  limited  ability  to  grasp  all 
aspects  of  the  evidence  they  receive.  As  a  result, 
they  may  depend  heavily  on  those  who  provide 
them  with  evidence,  and  they  may  ultimately  rely 
more  on  the  perceptions  of  those  who  provide  the 
evidence  than  on  the  evidence  itself — particularly 
when  the  evidence  is  complex  and  subject  to  inter¬ 
pretation.  Finally,  because  policymakers  in  demo¬ 
cratic  political  systems  are  (at  least  theoretically) 
held  accountable  to  their  electorates,  they  may 
perceive  a  need  to  react  to  public  perceptions  of 
evidence  and  risk  rather  than  to  the  evidence  itself. 
And  where  public  perceptions  of  risk  and  evidence 
differ  from  the  actual  evidence  provided  by  experts, 
systems  of  democratic  accountability  may  encour¬ 
age  decisions  based  on  public  perceptions  rather 
than  on  objective  evidence. 

A  third  difficulty  that  is  inherent  in  EBP  relates 
to  decisions  regarding  the  adequacy  of  evidence 
that  is  provided.  Even  within  scientific  disciplines, 


there  is  often  disagreement  among  experts  regard¬ 
ing  the  kinds  of  evidence  that  are  most  appropriate 
for  answering  particular  kinds  of  questions.  This  is 
true  in  the  social  sciences  as  well  as  in  the  natural 
sciences,  and  it  is  particularly  true  when  evidence 
is  being  gathered  to  help  guide  policy  making 
because  of  varying  values  that  may  come  into  play 
in  these  cases.  Some  researchers  argue  that  quanti¬ 
tative  information  is  critically  important  if  one  is 
to  assess  policy  effectiveness  in  objective  fashion, 
while  others  argue  that  more  qualitative  assess¬ 
ments  are  also  required  if  one  is  to  reach  balanced 
conclusions  that  address  a  range  of  legitimately 
held  values  in  society.  As  a  result,  even  where  evi¬ 
dence  is  timely,  appropriate,  and  clear  to  all 
involved,  there  are  likely  to  be  cases  where  differ¬ 
ing  kinds  of  evidence  lead  one  to  differing  conclu¬ 
sions.  And  there  is  certainly  room  for  debate  over 
what  kinds  of  evidence  are  most  appropriate  to 
believe  in  these  contexts. 

In  all  three  of  the  aforementioned  areas,  EBP 
faces  challenges  that  parallel  the  challenges  faced 
by  similar  movements  in  the  past.  While  EBP  has 
the  benefit  of  past  experiences  and  appears  to  be 
particularly  cognizant  of  these  challenges  as  a 
result,  it  is  still  working  to  address  these  questions. 
What  kind  of  evidence  should  be  collected?  How 
should  evidence  be  incorporated  into  policy¬ 
making  processes?  And  what  standards  of  eviden¬ 
tial  quality  should  be  applied?  These  are  not  simple 
questions,  but  they  are  important  ones.  There  is 
reason  to  be  skeptical  of  the  extent  to  which  these 
questions  will  be  answered  adequately  once  and 
for  all,  but — if  one  believes  that  human  intelligence 
and  the  rigors  of  systematic  inquiry  can  yield  ben¬ 
eficial  impacts — there  appears  to  be  little  alterna¬ 
tive  to  addressing  them  in  some  fashion.  In  the 
end,  EBP  and  its  predecessor  movements  focus  on 
bringing  systematic  evidence  to  bear  on  the  policy¬ 
making  process  and — despite  the  questions  raised 
above — this  may  be  the  best  kind  of  process  that  is 
available. 

John  A.  Hoornbeek 
Kent  State  University 
Kent,  Ohio,  United  States 
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The  national  executive — more  usually  referred  to 
as  the  “government”  or,  in  the  United  States,  as 
the  “administration” — is  the  only  political  institu¬ 
tion  that  exists  in  every  country.  There  cannot  be 
a  state  without  a  national  executive,  while  bodies 
such  as  legislatures,  parties,  or  interest  groups 
exist  only  where  the  polity  is,  at  least  relatively 
speaking,  liberal-democratic.  This  entry  concen¬ 
trates  exclusively  on  liberal-democratic  polities, 
which  have  become,  by  the  early  years  of  the  21st 
century,  a  substantial  majority  of  the  200  or  so 
polities  of  the  world;  yet  there  are  still  countries 
without  legislatures,  without  parties,  or  even 
without  interest  groups,  either  because  they  are 
very  traditional,  as  some  of  the  Middle  Eastern 
states,  or  because  a  coup,  typically  of  a  military 
character,  replaced  a  liberal-democratic  political 
system  that  had  been  in  place  for  only  a  few  years. 

Both  the  composition  and  the  power  of  national 
executives  need  to  be  examined  with  care — the 


composition  because  it  varies  appreciably  in  terms 
of  the  relationship  among  government  members 
and  because  of  the  role  of  the  leader  of  the  govern¬ 
ment,  and  the  power  because  of  the  variations  and 
indeed  even  marked  disagreements  over  what  gov¬ 
ernments  can  effectively  achieve  with  whatever 
powers  they  may  formally  have.  This  entry  there¬ 
fore  looks  successively  at  both  aspects;  before 
doing  so,  however,  it  briefly  describes  the  way  in 
which  national  executives  gradually  developed. 

Evolution  of  the  National  Executive 

While  the  proportion  of  liberal-democratic  national 
executives  is  large  in  the  contemporary  world,  as 
noted,  this  was  not  the  case  at  all  in  the  past,  even 
in  much  of  the  20th  century,  let  alone  earlier.  It  is 
worth  noting  that,  in  the  classical  Roman  Republic, 
there  were  a  number  of  separate  executive  positions 
rather  than  a  government — for  instance,  the  posi¬ 
tions  of  consul,  pretor,  or  edile.  A  similar  frame¬ 
work  was  adopted  by  the  few  republics  that  came 
to  exist  during  the  Middle  Ages  and  later,  princi¬ 
pally  in  Italy.  The  idea  of  a  government  prevailed  in 
monarchies,  as  kings  or  queens  came  to  appoint 
“secretaries”  or  “ministers”  to  oversee  the  fields 
that  had  to  be  covered,  such  as  those  of  finance, 
foreign  affairs,  or  defense.  This  model  was  widely 
adopted  in  the  strong — or  absolute — monarchies  of 
16th-,  17th-,  and  18th-century  Europe.  As  against 
this  trend,  the  first  modern — and  liberal — republic, 
that  of  the  United  States,  was  established  by  a  for¬ 
mal  constitution  that  came  into  force  in  1789  and 
proved  to  be  a  great  success;  that  is,  it  endured  and 
was  imitated  early  in  the  19th  century,  as  a  variety 
of  Latin  American  states  became  independent  from 
Spanish  rule. 

On  the  other  hand,  the  French  revolutionary 
republic,  set  up  in  1792,  was  not  a  success:  It  was 
replaced  after  a  few  years  by  Napoleon’s  empire, 
and  the  monarchy  was  reestablished  in  1815.  Yet 
that  monarchy  was  different  in  character  from  the 
one  that  had  ruled  France  for  centuries;  this  was 
the  result  of  the  profound  changes  in  the  direction 
of  liberal  government  that  had  begun  to  occur  in 
Britain  over  a  century  earlier,  from  1688.  Such  a 
move  toward  liberal  government  in  Britain  was 
followed  gradually  after  1815  through  the  course 
of  the  1 9th  century  in  parts  of  Western  Europe  and 
beyond,  including  Japan.  The  evolution  toward  a 
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liberal-democratic  government  was  to  lead  to  the 
emergence  of  a  parliamentary-cabinet  system, 
which  was  to  characterize  much  of  Western 
Europe  and  the  “Old  Commonwealth”  countries 
from  the  end  of  the  19th  century  in  contrast  to  the 
American  presidential  system. 

The  move  toward  liberal  democracy  in  Europe 
was  interrupted  between  the  two  World  Wars, 
however,  as  the  liberal-democratic  model  was 
shaken  and  indeed  replaced  by  dictatorships. 
Revolutionary  Russia,  renamed  the  Soviet  Union, 
became  the  first  Communist  state  from  the  1920s 
to  the  1990s;  in  the  1920s  and  1930s,  right-wing 
dictatorships  were  installed,  by  coups  or  even  as  a 
result  of  elections,  in  many  parts  of  continental 
Europe,  especially  in  Italy,  Germany,  and  Spain,  as 
well  as  in  most  of  the  states  that  had  been  created 
after  World  War  I  in  East-Central  and  Eastern 
Europe.  Moreover,  although  World  War  II  ended 
with  the  victory  of  democracies  in  1945,  only  in 
the  1980s  and  1990s  did  the  trend  toward  liberal 
democracy  become  widespread.  Many  of  the  new 
states  of  Asia  and  Africa  had  indeed  quickly 
become  dictatorships  after  having  acquired  inde¬ 
pendence  from  European  colonial  powers  in  the 
1950s,  1960s,  and  1970s;  the  major  exception 
being  India  that  was  to  remain,  from  1948  onward, 
by  far  the  most  populous  liberal  democracy. 

Structure  and  Composition  of 
the  National  Executive 

The  national  executive  is  the  body  that  governs  the 
state,  both  in  democratic  and  in  nondemocratic 
countries;  exactly  who  composes  that  executive  is 
not  always  entirely  clear,  and  the  relationship 
among  its  members  is  both  varied  and  sometimes 
rather  obscure.  As  was  already  suggested,  liberal- 
democratic  executives  are  often  said  to  be  divided 
into  two  types:  the  presidential  and  the  parliamen¬ 
tary-cabinet  type.  While  that  dichotomy  does  cor¬ 
respond  to  a  major  distinction,  there  are  variations 
within  each  of  the  two  groups.  There  is  also  an 
intermediate  type  known  as  semipresidential,  sin¬ 
gled  out  and  theoretically  developed  by  Maurice 
Duverger  (1980)  and  Robert  Elgie  (1999). 

The  “ideal-type”  presidential  model  is  based  on  a 
sharp  division  among  three  state  powers — executive, 
legislative,  and  judiciary — the  assumption  being  that 
none  of  these  powers  can  encroach  on  the  others,  a 


distinction  that  is  particularly  relevant  with  respect 
to  the  relationship  between  the  executive  and  legis¬ 
lative  powers.  The  model  originated  in  the  United 
States  at  the  time  of  independence;  while  the 
American  constitution  of  1787  (ratified  in  1789) 
has  been  amended  many  times,  it  has  kept  its  main 
original  features.  The  cornerstone  of  the  executive 
is  naturally  the  presidency,  whose  holder  is  elected 
for  4  years  by  the  people  (albeit  indirectly,  as  elec¬ 
tors  vote  for  the  members  of  an  electoral  college 
that  has  as  its  only  function  the  election  of  the 
president).  The  Twenty-Second  Amendment  (1947) 
further  mandated  that  the  president  can  be  reelected 
only  once,  while  in  some  Latin  American  states 
(most  of  which  adopted  a  similar  model)  the  presi¬ 
dent  cannot  serve  two  successive  terms  and  in  some 
cases  cannot  be  reelected  at  all. 

The  U.S.  president  is  free  to  appoint  the  mem¬ 
bers  of  the  administration  at  will,  except  that  the 
upper  house  of  the  legislature — the  Senate — has  to 
approve  these  appointments.  The  top  echelon  of  the 
executive  is  constituted  by  what  is  formally  described 
as  a  cabinet  composed  of  secretaries  in  charge  of  the 
various  departments  of  the  government,  under 
whom  are  assistant  secretaries,  also  appointed  by 
the  president.  There  is  also  an  Executive  Office  of 
the  President,  while  the  closest  advisers  are  part  of 
the  White  House  Office.  All  these  are  appointed  at 
the  “pleasure”  of  the  president  and  rarely  remain  in 
their  position  from  one  presidential  administration 
to  the  next,  even  if  the  incoming  and  outgoing 
presidents  are  members  of  the  same  party. 

Perhaps  the  key  characteristic  of  the  American 
executive  (and  probably  of  at  least  many  of  those 
presidential  systems  modeled  on  the  American  sys¬ 
tem)  is  that  its  members  are  only  loosely  connected 
together  and  often  do  not  know  each  other  well  at 
all;  indeed,  the  president  may  not  have  had  a  close 
relationship  with  at  least  a  number  of  them, 
although  some  of  the  positions  may  be  filled  by 
friends  who  are  rewarded  for  their  help,  in  particu¬ 
lar  during  the  presidential  election  campaign.  This 
does  create  a  situation  that  Hugh  Heclo  (1977)  has 
described  as  a  “government  of  strangers.”  It  also 
follows  that  one  does  not  really  know  precisely  who 
is  part  of  the  government.  Problems  are  particularly 
serious  with  respect  to  the  distinction  between  the 
secretaries  and  their  assistants,  on  the  one  hand, 
and  White  House  staff  members,  on  the  other,  the 
latter  being  often  effectively  in  charge  of  some  of 
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the  functions  of  the  secretaries,  particularly  in  the 
context  of  foreign  affairs,  as  when  Henry  Kissinger 
was  adviser  to  President  Richard  Nixon  before  he 
became  Secretary  of  State  in  the  early  1970s. 

The  structure  and  composition  of  the  national 
executive  are  very  different  in  the  other  models  of 
liberal-democratic  executives;  for  example,  that  of 
the  parliamentary-cabinet  system,  which,  following 
British  developments  since  the  late  17th  century, 
has  been  adopted  widely  in  Europe  and  in  many 
Commonwealth  countries,  as  well  as,  among  oth¬ 
ers,  in  Japan,  Thailand,  Israel,  and  Turkey.  Orig¬ 
inally,  as  was  suggested  earlier,  the  system  allowed 
the  members  of  the  government — typically  a  cabi¬ 
net — to  run  the  affairs  of  the  state,  at  least  nomi¬ 
nally,  on  behalf  of  the  monarch. 

The  cabinet  system  differs  in  two  fundamental 
ways  from  the  presidential  model,  first  in  that  the 
cabinet  must  enjoy  the  “confidence”  of  parliament 
to  be  able  to  be  and  remain  in  office  and,  second, 
in  that  the  cabinet  takes  its  decisions  “collegially”: 
The  prime  minister  is  the  head  of  the  cabinet  and 
therefore  a  part  of  it,  while  American  presidents 
are  not  strictly  speaking  a  part  of  their  cabinet.  As 
a  result,  members  of  the  cabinet  form  a  team:  They 
stand  or  fall  together;  moreover,  given  the  part 
played  by  parliament  in  the  whole  structure,  cabi¬ 
net  members  are  typically  drawn  from  among 
members  of  parliament.  A  further  indirect  conse¬ 
quence  is  that  the  duration  of  a  cabinet  is  not  fixed 
in  advance  but  depends  on  the  maintenance  of 
parliamentary  support. 

Furthermore,  and  again  largely  because  the 
cabinet  needs  the  support  of  a  majority  in  parlia¬ 
ment,  cabinets  may  be — and  often  are — based  on 
coalitions  among  two  or  more  parties.  There  can 
be  single-party  governments  when  one  party  enjoys 
an  absolute  majority  in  parliament  or  when,  as 
occurs  fairly  frequently  in  Scandinavian  countries, 
the  government  commands  minority  party  support 
only  but  has  negotiated  (typically  in  advance)  a 
promise  from  some  of  the  other  parties  that  they 
will  vote  for  the  impending  government.  In  this 
case,  but  also  in  cases  of  coalitions,  formal  agree¬ 
ments  that  aim  at  determining  in  advance  what  the 
policy  of  the  government  will  be  are  needed. 

While  the  president  is  the  head  of  the  executive 
that  he  or  she  has  set  up,  in  parliamentary  govern¬ 
ment  the  prime  minister  does  not  have,  as  a  result 
of  his  or  her  position  alone,  a  formally  dominant 


position  in  the  parliamentary-cabinet  executive. 
This  may  happen,  but  many  situations  prevent  the 
prime  minister  from  exercising  the  kind  of  supreme 
leadership  that  the  president  is  often  said  to  have. 
Yet  the  president  is  also  constrained  in  many  ways, 
despite  the  fact  that  the  expression  presidentializa- 
tion  is  sometimes  used  to  refer  to  very  powerful 
prime  ministers. 

Because  the  presidential  system  is  regarded  as 
somewhat  rigid  (Juan  Linz,  1994),  an  intermediate 
system,  known  as  semipresidentialism,  has  been 
adopted  in  some  countries,  France  in  particular, 
which  is  where  it  has  lasted  the  longest,  in  effect  for 
half  a  century  since  1958;  but  it  has  also  prevailed, 
in  a  less  clear-cut  manner,  admittedly,  in  Finland 
and  in  Portugal,  as  well  as  outside  Europe,  in  Sri 
Lanka,  Taiwan,  and  South  Korea.  The  detailed 
organization  and  practice  of  semipresidential  sys¬ 
tems  varies  appreciably,  but  the  general  principle  is 
that,  alongside  the  president,  who  is  popularly 
elected,  a  cabinet  headed  by  a  prime  minister  is,  as 
in  parliamentary-cabinet  systems,  responsible  to 
parliament.  Finally,  to  be  truly  comprehensive,  the 
case  of  Switzerland  must  be  mentioned,  since  this  is 
the  only  country  in  which  the  national  executive, 
known  as  the  Federal  Council,  which  is  composed 
of  seven  members  elected  for  4  years  by  parliament, 
is  truly  collective. 

To  be  able  to  compare  effectively  these  forms  of 
government,  and  in  particular  presidential  and 
parliamentary-cabinet  systems,  this  entry  turns  to 
an  examination  of  the  power  of  these  bodies,  both 
formally  and  in  practice,  to  direct  the  policies  of  the 
nations  at  the  head  of  which  they  are. 

Power  of  National  Executives 

Executive  power  is  difficult  to  define.  It  is  not 
based  only  on  the  powers  that,  for  example,  con¬ 
stitutions  give  to  these  bodies.  As  a  matter  of  fact, 
most  constitutions  are  somewhat  vague  or  even 
misleading  in  this  respect,  to  the  extent  that  they 
may  suggest  that  legislatures  are  in  charge  of  the 
lawmaking  process.  This  may  be  true  from  a 
purely  constitutional  point  of  view,  but  it  is  rarely 
true  in  practice.  Moreover,  the  word  executives 
conveys  the  impression  that  executives  merely 
execute  policies  or  ideas  that  have  originated  else¬ 
where.  In  reality,  both  in  parliamentary-cabinet 
systems  and,  albeit  to  a  more  limited  extent,  in 
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liberal-democratic  presidential  systems,  legislation 
is  prepared  by  the  government. 

Executives  have  in  reality  to  fulfill  three  different 
functions:  those  of  initiation,  coordination,  and 
implementation.  Initiation  means  that  the  ideas  that 
may  have  been  put  forward  previously  by  parties, 
especially  by  those  that  are  represented  in  the 
government,  as  well  as  by  interest  groups  or  non¬ 
governmental  organizations  (NGOs)  and  indeed  by 
members  of  the  legislature  have  to  be  imple- 
mentable,  both  administratively  and  politically. 
They  must  not  arouse  conflicts  such  that  the  ability 
of  the  government  to  achieve  its  overall  goals  will  be 
affected.  Second,  executives  must  coordinate  the 
policies  that  have  been  initiated,  as  some  of  these 
may  be  closely  connected  to  others;  moreover, 
the  policy-making  process  as  a  whole  has  to  be 
financially  sustainable:  Proposals  may  have  to  be 
postponed,  despite  the  fact  that  they  may  have 
previously  been  announced  as  being  part  of  the  gov¬ 
ernment  program.  Finally,  policies  have  to  be  imple¬ 
mented.  It  is  often  believed  that  implementation  is 
simple  and  can  be  left  to  subordinates.  This  is  true 
only  up  to  a  point,  as  implementation  may  be  diffi¬ 
cult,  for  instance,  if  a  new  organization  has  to  be  set 
up  or  if  a  matter  is  regarded  as  particularly  urgent. 

The  skills  required  of  ministers  to  fulfill  satisfac¬ 
torily  these  three  governmental  functions  are  very 
different.  Initiation  implies  imagination  to  ensure 
that  the  new  policy  will  be  attractive  and  effective, 
coordination  entails  being  able  to  work  closely 
together  with  other  members  of  the  government,  and 
implementation  means  managing  public  service  bod¬ 
ies  that  are  often  very  large  and  whose  permanent 
officials  may  prefer  to  postpone  new  developments. 
Many  members  of  the  executive  may  find  it  difficult 
to  carry  out  all  three  sets  of  activities  well.  The  prob¬ 
lem  is  particularly  serious  in  parliamentary-cabinet 
systems  as  these  are  officially  collective  and  at  least 
collegial.  It  is  in  this  type  of  executive  that  the  mem¬ 
bers  of  the  cabinet — the  ministers — are  expected  to 
have  the  skills  required  to  initiate,  coordinate,  and 
implement  policies. 

The  following  discussion  first  examines  the  case 
of  members  of  the  executive  in  presidential  systems 
and,  specifically,  of  liberal-democratic  presidential 
systems,  such  as  the  American  polity.  As  was  noted 
earlier,  there  is  a  cabinet  in  the  United  States,  but 
that  cabinet  is  not  in  charge  of  the  overall  policy 
making;  the  president  is  in  charge.  The  members  of 


the  cabinet — the  secretaries — are  essentially 
regarded  as  managers  of  policies  that  have  been 
previously  adopted  in  principle  by  the  president: 
They  may  have  helped  frame  these  policies,  together 
with  their  officials  of  the  federal  civil  service,  but 
the  decision  to  proceed  has  to  be  given  by  the 
president.  Coordination  is  also  likely  to  be  the  prov¬ 
ince  of  the  Executive  Office  of  the  President,  where 
the  Office  of  Management  and  Budget  is  located. 

Yet  each  president  tends  to  hold  different  views 
about  how  the  whole  executive  is  to  be  organized 
as  well  as  about  the  ways  in  which  the  various 
offices  are  to  relate  to  each  other;  moreover,  as  we 
noted,  the  secretaries  are  likely  to  be  strangers  to 
each  other  and  to  the  departments  that  they  run. 
As  Shirley  Warshaw  (1996)  has  noted,  there  have 
therefore  been  substantial  variations  from  presi¬ 
dent  to  president  about  the  process  of  policy  mak¬ 
ing,  and  the  results  have  often  been  unsatisfactory. 

These  kinds  of  difficulties  seem  to  be  in  the 
nature  of  the  liberal-democratic  presidential  sys¬ 
tem,  especially  when  that  executive  is  very  large,  as 
it  is  in  the  case  of  the  American  federal  govern¬ 
ment;  yet  two  further  sets  of  problems  arise  almost 
inevitably.  The  first  has  to  do  with  the  fact  that 
there  is  clearly  some  ambiguity  about  the  effective 
power  of  the  executive  with  respect  to  the  initia¬ 
tion  of  policies,  especially  when  these  require  new 
legislation.  As  in  liberal-democratic  presidential 
systems,  the  legislature  is  elected  independently 
from  the  executive,  and  it  tends  to  maintain  sub¬ 
stantial  control  over  the  proposed  bills  that  are 
submitted,  including  those  that  the  president  sends; 
it  also  exercises  much  independent  control  over  the 
budget,  in  terms  of  both  income  and  expenditure. 
There  is  therefore  a  clear  limit  to  the  influence  that 
presidents  can  exercise.  They  may  cajole  and 
induce;  they  cannot  threaten. 

Second,  at  the  other  end  of  the  chain,  there  is 
also  a  limit,  in  practice,  if  not  formally,  to  what  the 
president  and  the  secretaries  can  extract,  so  to 
speak,  from  the  departments.  Most  departments 
cherish  some  kinds  of  policy  developments  more 
than  others;  meanwhile,  they  cultivate  close  rela¬ 
tionships  with  some  members  and  in  particular 
some  committees  of  the  Congress;  this  enables  the 
officials  of  these  departments  to  obtain  financial 
support  for  their  requests.  As  a  result,  the  top  mem¬ 
bers  of  the  executive — the  secretaries — are  often 
likely  to  find  it  difficult  to  change  the  direction  of 
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the  policies  of  their  departments.  In  the  late  1960s 
and  early  1970s,  for  instance,  President  Lyndon 
Johnson,  who  was  a  policy  expansionist,  and 
President  Richard  Nixon,  who  was  in  favor  of 
retrenchment,  experienced  problems  in  convincing 
departments  to  achieve  their  own  goals  in  view  of 
what  can  be  regarded  as  the  policy  “habits”  of  the 
departments  of  the  federal  government. 

Thus,  in  practice,  there  are  clear  limitations  to 
the  power  that  the  president  and  the  executive  in 
general  can  exercise  in  a  liberal-democratic  presi¬ 
dential  system  such  as  that  of  the  United  States. 
Such  limitations  do  not  occur  to  the  same  extent  in 
those  presidential  systems  in  which  the  presidential 
powers  are  less  constrained  by  the  constitution,  as 
stressed  by  Mathew  Shugart  and  John  Carey 
(1992).  Most  political  scientists  believe  that  there 
were  fewer  limitations  to  the  effective  power  of  the 
executive,  seemingly,  in  the  few  liberal-democratic 
semipresidential  systems  that  existed  in  the  begin¬ 
ning  of  the  21st  century,  of  which  France  is  often 
regarded  as  the  best  example.  Finally,  the  execu¬ 
tive  appears  to  be  least  constrained  of  all  in  parlia¬ 
mentary-cabinet  systems  principally  because  that 
executive,  more  than  the  American  executive,  usu¬ 
ally  controls  the  legislature. 

In  parliamentary-cabinet  systems,  the  govern¬ 
ment  does  indeed  need  majority  support  of  the 
legislature  to  remain  in  existence  and,  by  and 
large,  that  support  exists  because  it  is  sustained  by 
one  or  more  parties  that  constitute  the  majority. 
According  to  Jean  Blondel  and  Maurizio  Cotta 
(2000),  this  is  why  that  type  of  executive  has  often 
been  described  as  being  based  on  party  govern¬ 
ment.  Since  there  is  a  close  relationship  between 
the  executive  and  the  legislature  in  the  parliamen¬ 
tary-cabinet  system,  the  cabinet  is  able  to  exercise 
enough  pressure  on  the  legislature  to  obtain  sup¬ 
port  at  any  rate  for  a  very  large  proportion  of  the 
bills  that  it  introduces  in  parliament  as  well  as  for 
a  very  large  proportion  of  the  financial  require¬ 
ments  that  it  submits  for  approval.  This  does  not 
mean  that  parliament  can  never  have  its  way  on 
any  issue  or  on  any  budgetary  provisions;  there  are 
indeed  differences  from  country  to  country  about 
the  extent  to  which  parliament  can  induce  the 
executive  to  modify  its  policies,  but  by  and  large, 
across  Western  and  even  East-Central  Europe,  as 
well  as  in  those  countries  of  the  Commonwealth 
that  are  liberal-democratic  parliamentary  systems, 


the  executive  is  broadly  speaking  able  to  achieve 
its  goals.  If  this  is  not  achieved,  there  will  be  a 
governmental  “crisis”:  The  prime  minister  is  likely 
at  least  to  threaten  to  resign  and,  indeed,  in  some 
cases,  the  threat  is  implemented. 

The  power  of  the  executive  in  parliamentary 
cabinets  of  a  liberal-democratic  kind  is  thus  large; 
yet  it  does  not  follow  that  all  the  members  of  these 
cabinets,  let  alone  all  the  members  of  the  execu¬ 
tive,  including  those  who  are  below  cabinet  rank, 
can  exercise  an  equal  fraction  of  that  overall 
power.  It  has  been  pointed  out  by  Michael  Laver 
and  Kenneth  Shepsle  (1996)  that  ministers  are 
often  more  concerned  about  what  goes  on  in  the 
department  that  they  run  than  about  overall  gov¬ 
ernmental  policy.  Although  all  members  of  the 
cabinet  formally  share  the  overall  power  of  initia¬ 
tion  that  was  referred  to  earlier,  many  do  not  exer¬ 
cise  that  power  in  fact  at  cabinet  meetings.  In 
parliamentary-cabinet  systems,  these  meetings  take 
place  frequently  (typically,  once  a  week);  yet  they 
would  be  overburdened  if  every  one  of  the  15  to 
20  members  of  the  cabinet  were  to  participate 
actively  on  every  issue.  The  result  is  that,  by  and 
large,  the  prime  minister  exercises  more  than  the 
usually  recognized  role  of  “first  among  equals” 
and  indeed  that  some  prime  ministers,  in  Britain 
but  also  elsewhere,  are  regarded  as  being  “presi¬ 
dential,”  wrongly  in  a  sense  because,  as  was  noted 
earlier,  presidents  of  liberal-democratic  systems 
are  confronted  with  the  substantial  power  of  the 
legislature  and  with  the  tendency  of  departments 
to  be  somewhat  autonomous  in  the  way  they  con¬ 
duct  their  affairs.  The  power  and  strength  of  prime 
ministers  do  indeed  vary  appreciably,  partly  across 
countries,  but  partly  also  as  a  result  of  the  extent 
to  which  they  wish  to  be  activists,  in  leading, 
indeed  possibly  in  forcing,  members  of  their  cabi¬ 
net  to  follow  their  line  (Brian  Farrell,  1971; 
Anthony  King,  1994).  It  is  nonetheless  usually 
recognized  that  prime  ministers  of  parliamentary 
cabinets  enjoy  a  degree  of  power  over  political 
developments  in  their  countries  that  exceeds  that 
of  presidents  of  liberal-democratic  systems. 

Conclusion 

Executives  in  the  contemporary  world  are,  by 
and  large,  very  powerful,  even  if  they  head 
liberal-democratic  systems  in  which  there  are 
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constitutional  and  customary  limitations  to  what 
governments  can  do — that  is  to  say,  their  members 
are  constrained  by  “veto  players,”  as  George 
Tsebelis  (2002)  called  them.  Admittedly,  some 
decisions  may  be  slow  to  take  because  of  the  influ¬ 
ence  that  interest  groups  and  NGOs  are  able  to 
exert  and  of  the  consequential  unwillingness  of 
governments  to  attempt,  or  attempt  quickly,  to 
override  the  proposals  of  these  bodies. 

Yet  it  is  not  the  case  that  liberal-democratic 
governments  cannot  achieve  what  they  propose  to 
do,  especially  if  these  proposals  are  part  of  the 
program  that  was  presented  to  the  electorate  at  the 
previous  election.  While  presidents  in  presidential 
systems  of  the  liberal-democratic  variety  are  typi¬ 
cally,  especially  in  the  American  case,  not  closely 
constrained  by  these  election  programs,  parliamen¬ 
tary-cabinet  governments  are;  this  constraint  limits 
effectively  the  power  that  these  executives  would 
be  able  to  exercise  if  they  were  not  party  govern¬ 
ments.  There  is  thus  little  danger  that  governments 
should  become  unable  to  act  because  of  the  consti¬ 
tutional  or  customary  limitations  under  which  they 
operate.  The  limitations  to  which  they  have  to  pay 
strong  attention  stem  from  the  part  that  the  civil 
society  at  large  is  able  to  take  in  a  democratic  soci¬ 
ety,  and  it  is  surely  right  that  it  should  be  so  if 
liberal-democratic  systems  are  to  deserve  their 
name. 
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Experiments,  Field 


Field  experimentation  brings  the  rigorous  estima¬ 
tion  of  causal  effects  that  is  the  hallmark  of  exper¬ 
imentation  to  more  naturalistic  settings.  This  entry 
begins  by  defining  field  experimentation.  It  then 
describes  the  early  history  of  political  science  field 
experimentation  and  discusses  more  recent  devel¬ 
opments.  It  concludes  with  a  discussion  of  some  of 
the  weaknesses  of  field  experimentation. 

Definition 

Experimentation  is  a  method  for  obtaining  unbi¬ 
ased  estimates  of  causal  effects.  In  social  science 
experiments,  the  unit  of  observation  is  randomly 
assigned  to  different  interventions,  and  the  effect  of 
the  intervention  is  measured  by  comparing  out¬ 
comes  of  interest  across  the  randomly  assigned 
groups.  Field  experiments  are  experiments  that  take 
place  in  real-world  settings  and  attempt  to  repro¬ 
duce  the  environment  in  which  the  phenomenon  of 
interest  naturally  occurs.  In  contrast,  laboratory 
experiments  are  often  highly  stylized  representa¬ 
tions  of  the  social  behaviors  the  investigators  wish 
to  learn  about  through  their  experiments,  leading 
to  concern  that  the  results  from  the  laboratory  will 
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not  apply  to  behavior  more  generally.  The  realism 
of  field  experiments  is  intended  to  minimize  con¬ 
cerns  about  the  external  validity,  or  the  generaliz- 
ability,  of  the  experimental  results. 

Experiments  have  many  components,  which  may 
differ  in  their  degree  of  realism,  leading  to  a  blur¬ 
ring  of  the  distinction  between  what  is  or  is  not  a 
field  experiment.  Economists  Glenn  Elarrison  and 
John  List  propose  a  four-category  taxonomy  to 
distinguish  field  experiments  from  laboratory 
experiments.  According  to  this  taxonomy,  a  con¬ 
ventional  laboratory  experiment  uses  a  standard 
subject  pool  (i.e.,  drawn  from  the  university)  and  an 
abstract  framing  of  the  problem  being  investigated. 
Examples  of  this  would  be  laboratory  studies  of 
play  in  dictator  and  ultimatum  games.  Two  inter¬ 
mediate  categories  of  experiments,  which  move  in 
the  direction  of  greater  realism,  are  the  artifactual 
and  the  framed  field  experiments.  An  artifactual 
field  experiment  deviates  from  the  conventional 
laboratory  experiment  through  the  use  of  nonstan¬ 
dard  subjects.  For  example,  researchers  have  inves¬ 
tigated  ethnic  conflict  by  performing  standard  labo¬ 
ratory  games,  such  as  the  dictator  game,  in  Africa, 
with  subjects  drawn  from  groups  with  a  history  of 
ethnic  rivalry.  The  framed  field  experiment  moves 
further  away  from  the  conventional  laboratory 
experiment.  This  type  of  experiment  uses  realistic 
subjects  (as  does  the  artifactual  experiment);  how¬ 
ever,  in  contrast  to  the  artifactual  experiment,  the 
task  and  context  is  also  more  realistic.  An  example 
of  a  framed  field  experiment  in  political  science  is 
research  on  the  effect  of  money  on  legislative  access, 
in  which  congressional  staffers  make  hypothetical 
scheduling  decisions  based  on  information  they  are 
provided  about  the  individual  seeking  time  with  the 
representative.  A  natural  field  experiment,  which  is 
the  design  often  referred  to  simply  as  a  “field  exper¬ 
iment”  in  political  science,  is  the  same  as  a  framed 
field  experiment,  except  that  the  environment  is  the 
one  where  the  subjects  naturally  perform  the  task  in 
question  and  the  subjects  are  unaware  that  they  are 
in  an  experiment.  Research  in  which  political  cam¬ 
paigns  randomly  assign  households  to  receive  dif¬ 
ferent  campaign  mailings  to  test  the  effect  of  alter¬ 
native  communications  on  voter  turnout  is  an 
example  of  a  natural  field  experiment. 

This  entry  focuses  on  natural  field  experiments. 
Although  the  degree  of  naturalism  in  field  experi¬ 
ments  is  the  greatest  strength  of  the  method,  it  is 


important  to  remember  that  the  goal  of  experi¬ 
mental  interventions  is  to  estimate  a  causal  effect, 
not  to  achieve  “realism.”  If  the  researcher  aims  to 
capture  basic  psychological  processes  that  do  not 
vary  across  populations,  experimental  contexts,  or 
subject  awareness  of  the  experiment,  then  there  is 
no  problem  with  conventional  laboratory  studies. 
That  said,  understanding  behavior  in  natural  envi¬ 
ronments  is  frequently  the  ultimate  goal  of  social 
science  research,  and  it  is  hard  if  not  impossible  to 
even  recognize  the  full  set  of  threats  to  external 
validity  present  in  artificial  contexts,  let  alone 
adjust  the  measured  experimental  effects  and  mea¬ 
sures  of  uncertainty  to  account  for  these  threats. 

Field  Experiments  in  Political  Science 

The  earliest  field  experiments  in  political  science 
were  performed  in  the  1920s  by  Harold  Gosnell, 
who  sought  to  measure  the  effect  of  get-out-the- 
vote  appeals  in  the  1924  presidential  and  1925 
mayoral  elections  in  Chicago.  Over  the  next 
30  years,  researchers  conducted  related  field  exper¬ 
imental  work  on  the  relative  effectiveness  of  emo¬ 
tional  versus  rational  political  appeals  and  the 
turnout  effects  of  different  modes  of  communica¬ 
tion  (mail,  phone,  and  visits).  This  early  literature 
appears  to  have  been  inspired  by  the  development 
and  application  of  experimental  methodology  in 
agriculture,  medicine,  and  other  fields.  These  pio¬ 
neering  field  experiments,  however,  had  limited 
impact  on  subsequent  research  strategies.  With 
rare  exceptions,  there  was  no  follow-up  work 
using  the  experimental  method  in  real-world  set¬ 
tings  to  replicate  the  earlier  experiments,  to  extend 
the  voter  mobilization  studies  to  new  settings  or 
communications  methods,  or  to  apply  the  method 
to  new  questions.  Rather,  when  considering  alter¬ 
native  research  designs,  field  experimentation  was 
either  ignored  or  dismissed  due  to  skepticism 
about  whether  real-world  experiments  were  feasi¬ 
ble.  Indeed,  no  field  experiment  appeared  in  a 
major  political  science  journal  in  the  1990s. 

In  recent  years,  there  has  been  a  significant 
revival  of  field  experimentation  in  several  disci¬ 
plines,  most  notably  in  economics  and  political 
science.  This  turn  to  field  experimentation  follows 
important  intellectual  developments  across  the 
social  sciences.  During  the  1980s,  there  was 
increasing  appreciation,  especially  in  economics,  of 
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the  difficulty  in  estimating  causal  effects  from  stan¬ 
dard  observational  data  due  to  bias  caused  by 
unobservable  factors.  Researchers  were  increasingly 
attracted  to  naturally  occurring  randomizations  or 
near-randomized  applications  of  a  “treatment”  that 
could  provide  data  where,  as  in  a  planned  experi¬ 
ment,  there  is  no  expected  association  between  the 
treatment  and  the  unobservables.  Examples  of  this 
work  include  Joshua  Angrist’s  study  of  the  effect 
of  Vietnam  War  service  on  wages,  where  an  indi¬ 
vidual’s  draft  lottery  number  altered  the  likelihood 
of  service,  and  Steven  Levitt’s  use  of  variation  in 
policing  levels  associated  with  election  cycles  to 
estimate  the  effect  of  police  on  crime.  This  brand 
of  research  was  characterized  by  enormous  cre¬ 
ativity.  However,  naturally  occurring  randomiza¬ 
tions  or  near  randomizations  are  rare,  perhaps  as 
rare  as  the  talent  required  to  spot  them,  and  valid 
instrumental  variables  must  satisfy  rigorous 
requirements.  In  contrast,  randomized  experimen¬ 
tal  interventions  permit  scholars  to  produce  exog¬ 
enous  variation  in  the  treatment  variable  by 
design,  avoiding  the  need  to  discover  a  fortuitous 
naturally  occurring  randomization  or  make  the 
often  questionable  assumptions  required  by  other 
statistical  designs.  A  further  advantage  of  planned 
experiments  is  that  the  researcher  knows  which 
subjects  are  assigned  to  be  treated.  This  is  fre¬ 
quently  not  the  case  in  observational  studies.  In 
particular,  for  studies  based  on  survey  data,  treat¬ 
ment  assignment  is  typically  measured  through 
self-reports,  introducing  the  possibility  of  bias  due 
to  measurement  error. 

The  modern  revival  of  field  experiments  in 
political  science  can  be  traced  to  Alan  Gerber  and 
Donald  Green,  who  in  2000  examined  the  effect  of 
voter  mobilization  in  New  Haven,  Connecticut,  on 
the  November  1998  elections.  Gerber  and  Green 
measured  the  turnout  effects  of  nonpartisan  face- 
to-face  canvassing,  phone  calls,  and  mailings. 
Since  this  study,  numerous  field  experiments  have 
measured  the  effects  of  political  communications 
on  voter  turnout.  Several  replicate  the  basic  design 
of  the  New  Haven  study  and  measure  the  effect  of 
canvassing,  phone,  or  mail  in  new  political  con¬ 
texts,  including  other  countries.  Others  examine 
new  modes  of  communication  or  variations  on  the 
simple  programs  used  in  New  Haven,  such  as  an 
analysis  of  the  effect  of  phone  calls  or  contacts  by 
communicators  matched  to  the  ethnicity  of  the 


household,  repeat  phone  calls,  or  partisan  televi¬ 
sion  and  radio  broadcasts.  Recent  work  has  begun 
to  consider  the  effects  of  newer  technologies,  such 
as  e-mail  and  text  messaging,  or  novel  approaches 
to  mobilization,  such  as  election  day  parties  at 
polling  places.  Meta-analysis  of  the  results  of  doz¬ 
ens  of  canvassing,  mail,  and  phone  studies  is 
reported  in  a  quadrennial  literature  review,  “Get 
Out  the  Vote.”  Overall,  the  results  have  been  con¬ 
sistent  with  the  initial  New  Haven  findings. 
Canvassing  has  a  much  larger  effect  than  less  per¬ 
sonal  modes  of  communication,  such  as  phone  and 
mail.  The  effect  of  brief  commercial  calls,  such  as 
those  studied  in  New  Haven,  and  nonpartisan 
mailings  appear  to  be  less  than  1  percentage  point 
while  canvassing  boosts  turnout  by  about  7  per¬ 
centage  points  in  a  typical  election. 

In  recent  years,  field  experimentation  has  spread 
beyond  measuring  the  effects  of  various  interven¬ 
tions  on  voter  turnout  to  address  other  aspects  of 
political  behavior,  to  assess  institutional  perfor¬ 
mance,  and  to  consider  broader  theoretical  issues 
such  as  social  influence,  norm  compliance,  collec¬ 
tive  action,  and  interpersonal  influence.  For 
instance,  one  recent  application  of  field  experi¬ 
ments  to  investigate  theoretical  constructs  in  politi¬ 
cal  behavior  uses  a  field  experiment  to  measure  the 
causal  effect  of  partisanship.  Researchers  sent  mail¬ 
ings  informing  unaffiliated,  registered  voters  of  the 
need  to  affiliate  with  a  party  to  participate  in  the 
upcoming  closed  primary  in  Connecticut.  They 
found  that  the  mailings  increased  formal  party 
affiliation,  and  a  posttreatment  survey  found  a  shift 
in  partisan  identification  as  well  as  a  shift  in  politi¬ 
cal  attitudes.  Another  study  measures  social 
spillover  effects  of  political  communications  and 
finds  that  spouses  and  roommates  of  those  who  are 
contacted  during  a  voter  mobilization  drive  are  also 
more  likely  to  vote.  Other  scholars  have  investi¬ 
gated  the  effect  of  social  pressure  on  norm  compli¬ 
ance  by  measuring  the  effects  of  alternative  mailings 
that  exert  varying  degrees  of  social  pressure.  These 
findings  indicate  that  a  pre-election  mailing  listing 
the  recipient’s  own  voting  record  and  a  mailing  list¬ 
ing  the  voting  record  of  the  recipient  and  their 
neighbors  cause  a  dramatic  increase  in  turnout. 
Follow-up  research  uses  a  postexperiment  survey  to 
measure  whether  this  treatment  response  varies 
with  subject  personality.  A  novel  study  of  elite 
behavior  examines  the  effect  of  a  lobbying  effort  on 
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a  bill  in  the  New  Hampshire  legislature  and  finds 
that  a  random  e-mail  from  an  interest  group  causes 
a  statistically  significant  increase  in  roll  call  voting 
for  the  sponsor’s  measure. 

Although  the  initial  studies  in  the  modern  wave 
of  field  experiments  were  entirely  by  American 
politics  specialists,  comparative  politics  and  inter¬ 
national  relations  scholars  are  now  applying  the 
method  to  important  questions  in  these  fields.  For 
example,  one  study  compares  broad  policy  versus 
narrow  clientelistic  campaign  messages  in  a  2001 
Benin  election.  Another  study  examines  the  effect 
of  election  monitors  on  vote  fraud  levels.  In  a 
study  of  collective  action  and  social  capital  con¬ 
ducted  following  civil  war  in  Liberia,  researchers 
found  that  foreign  aid  for  community-driven 
reconstruction  projects  increases  social  cohesion  as 
measured  by  behavior  in  public  goods  games.  A 
recent  example  of  work  that  examines  institutional 
performance  compares  the  performance  of  alterna¬ 
tive  institutions  for  the  selection  of  a  public  good 
in  Indonesia.  This  research  finds  that  while  more 
participatory  institutions  do  not  change  the  set  of 
projects  approved,  participants  are  more  satisfied 
with  the  decision-making  process.  A  recent  study 
examined  the  effect  of  legislative  performance 
report  cards  on  representatives’  attendance  records 
in  Uganda.  They  report  that  instituting  this  method 
of  showing  legislators’  attendance  to  their  constit¬ 
uents  results  in  higher  levels  of  parliamentary 
attendance  rates.  This  work  highlights  an  impor¬ 
tant  contrast  between  field  experimentation  and 
observational  methods:  Field  experiments  may  be 
used  to  study  both  common  real-world  phenom¬ 
ena  (such  as  campaign  television  commercials  or 
the  effect  of  election  monitors)  as  well  as  novel 
interventions  for  which  there  are  no  observational 
counterparts  (unusual  mailings  or  legislative  report 
cards  in  developing  countries).  For  novel  interven¬ 
tions,  there  are  no  naturally  occurring  parallels  to 
the  intervention,  and  so  no  observational  study  is 
even  possible. 

In  addition  to  these  and  other  substantive  appli¬ 
cations,  field  experiments  have  increasingly  been 
used  to  assess  the  performance  of  nonexperimental 
methods.  Following  the  example  of  Robert 
LaLonde’s  study  of  job  training  programs,  data 
from  experimental  studies  can  be  reanalyzed  using 
observational  techniques.  The  performance  of  the 
observational  method  can  then  be  evaluated  by 


comparing  the  results  with  the  unbiased  estimates 
obtained  when  the  analyst  uses  knowledge  of  the 
randomized  treatment  assignment.  Applying  this 
approach  to  an  experimental  study  of  voter  mobi¬ 
lization  phone  calls,  researchers  compared  the 
experimental  and  observational  estimates  of  the 
effect  of  the  intervention.  They  find  that  regression 
analysis  and  exact  matching  both  dramatically 
overestimate  the  treatment  effect. 

Some  Weaknesses  of  Field  Experimentation 

The  combination  of  unbiased  estimation  and  a 
natural  context  makes  field  experimentation  very 
attractive,  but  the  method  has  some  important 
drawbacks.  Among  the  various  criticisms,  three 
are  focused  on  here.  First,  field  experiments  are 
typically  more  costly  and  difficult  to  implement 
than  conventional  laboratory  experiments  or  stan¬ 
dard  observational  studies.  A  field  experiment 
designed  to  test  the  effects  of  face-to-face  canvass¬ 
ing  involves  recruiting  and  training  canvassers  and 
then  sending  out  canvassers  to  deliver  mobilization 
messages,  perhaps  in  the  rain,  to  subjects  who  may 
or  may  not  be  home.  Following  the  election,  the 
data  must  be  merged  with  the  postelection  voter 
file.  It  is  logistically  more  challenging  than  inviting 
subjects  into  your  university  laboratory  to  listen  to 
a  script  and  then  state  their  vote  intention.  Scholars 
have  analyzed  the  design  of  experiments  to  improve 
their  statistical  efficiency,  an  especially  important 
concern  for  field  experiments  facing  low  compli¬ 
ance  rates. 

Second,  in  contrast  to  the  laboratory  environ¬ 
ment,  interaction  between  subjects  cannot  be  con¬ 
trolled  in  the  field  environment,  which  leads  to  the 
possibility  that  the  outcomes  for  subjects  will  be 
influenced  by  the  treatment  status  of  other  sub¬ 
jects.  For  example,  a  study  comparing  treatment 
group  and  control  group  households  will  be  biased 
if  those  who  are  treated  interact  with  those 
assigned  to  the  control  group  and  in  turn  alter  the 
behavior  of  those  in  untreated  households.  These 
contamination  effects  can  be  minimized  by  mea¬ 
suring  the  outcome  soon  after  treatment  or  treat¬ 
ing  a  small  fraction  of  the  subject  pool.  Furthermore, 
contamination  effects  can  themselves  become  the 
object  of  study,  either  by  asking  about  interactions 
across  subjects  in  a  posttreatment  survey  or  by 
measuring  the  effects  of  the  proximity  of  control 
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group  subjects  to  those  randomly  treated.  These 
spillover  effects  are  of  course  endemic  in  the  data 
routinely  used  in  observational  studies,  though  this 
complication  is  rarely  noted. 

Third,  the  set  of  field  experiments  that  make  it 
into  print  may  suffer  from  selection  bias. 
Publication  bias  is  a  general  danger,  but  there  are 
several  reasons  for  special  concern  regarding  field 
experiments.  To  offset  the  costs  of  field  experi¬ 
ments,  researchers  may  choose  to  collaborate 
with  nonacademic  organizations  or  groups  that 
undertake  the  activities  being  studied.  This  poses 
two  dangers.  First,  the  subset  of  organizations 
wishing  to  participate  and,  therefore,  the  results 
of  a  study  of  their  activities  may  be  unrepresenta¬ 
tive  of  all  active  organizations.  Bias  will  follow 
when  the  willingness  to  be  studied  is  a  function  of 
the  expected  efficacy  of  the  group’s  efforts. 
Second,  and  perhaps  more  significantly,  after  the 
experiment  has  been  performed  and  analyzed, 
organizations  may  wish  to  keep  the  results  as 
trade  secrets  or  to  suppress  negative  findings.  A 
final,  and  related  concern,  is  that  field  experi¬ 
ments  typically  require  more  effort  to  replicate 
than  the  usual  laboratory  experiment.  In  princi¬ 
ple,  when  a  researcher  reads  an  interesting  social 
psychology  paper,  he  or  she  can  easily  attempt  to 
reproduce  the  same  finding  in  his  or  her  psychol¬ 
ogy  lab  at  a  small  cost  and  with  little  effort.  In 
contrast,  if  field  experiments  appear  only  inter¬ 
mittently,  it  may  be  some  years  before  a  mislead¬ 
ing  finding  is  corrected  by  the  accumulation  of 
contrary  evidence. 

Alan  S.  Gerber 
Yale  University 
New  Haven,  Connecticut,  United  States 
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Laboratory  experiments  are  used  in  political  sci¬ 
ence  to  empirically  test  the  results  and  assump¬ 
tions  of  formal  models.  These  types  of  experi¬ 
ments  are  conducted  using  human  subjects  in  a 
specific  location,  usually  in  a  computer  labora¬ 
tory,  where  the  experimenter  has  complete  control 
over  the  experimental  environment.  These  types 
of  experiments  often  use  a  computer  interface  to 
deliver  the  experimental  stimuli  to  subjects.  For 
example,  laboratory  experiments  are  used  to 
empirically  test  formal  models  of  elections,  com¬ 
mittees,  bargaining,  and  other  political  topics. 
This  entry  presents  some  of  the  special  features 
and  advantages  of  this  method. 

Formal  models  are  mathematically  defined  and 
derived  by  assumptions  where  the  assumptions 
(sometimes)  result  in  an  equilibrium  prediction.  In 
formal  models,  the  assumptions  are  of  two  types: 
assumptions  about  institutional  and  other  exoge¬ 
nous  factors  such  as  voting  rules,  bargaining  pro¬ 
cedures,  preference  distributions,  and  so  forth,  and 
assumptions  about  the  behavioral  choices  of  the 
political  actors  in  the  context  of  these  exogenous 
factors.  The  model  makes  assumptions  about  both 
electoral  institutions  and  voters’  rationality,  and 
the  predictions  are  behavioral. 

Laboratory  experiments  are  ideal  to  test  formal 
models  since  the  assumptions  of  the  model  can  be 
operationalized  in  the  laboratory,  and  human  sub¬ 
jects  can  be  used  to  play  the  role  of  actors  within 
the  model.  Hence,  researchers  can  empirically 
determine  how  the  assumptions  and  predictions  of 
the  model  hold  up  when  human  subjects  play  the 
game.  In  laboratory  experiments,  researchers  are 
able  to  establish  controls  over  the  experimental 
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environment  that  allow  them  to  establish  and  mea¬ 
sure  causality,  which  is  how  one  variable(s)  affects 
another  variable(s).  In  experiments,  causality  is 
established  by  specifying  different  control  vari¬ 
ables  and  then  having  subjects  randomly  assigned 
to  various  treatments.  For  instance,  if  a  researcher 
is  interested  in  how  the  cost  of  voting  affects 
voting  behavior,  the  researcher  can  specify  two 
treatments — one  with  a  voting  cost  and  another 
without  a  voting  cost.  Then,  holding  all  other 
parameters  of  the  experiment  constant,  it  is  possi¬ 
ble  to  compare  the  two  treatments  and  isolate  the 
effects  of  voting  cost,  given  the  parameters  intro¬ 
duced  in  the  experiment. 

Most  laboratory  experiments  use  a  within- 
subject  design  as  opposed  to  a  between-subject 
design.  In  a  within-subject  design,  subjects  in  the 
experiment  experience  all  treatments;  in  a  between- 
subject  design,  subjects  experience  only  one  treat¬ 
ment.  In  a  within-subject  design,  subjects  are  ran¬ 
domly  assigned  to  all  the  treatments  over  multiple 
periods.  One  advantage  of  a  within-subject  design 
is  that  it  greatly  increases  the  data  that  are  col¬ 
lected.  Another  advantage  as  opposed  to  a  between- 
subject  design  is  that  it  reduces  the  error  variance 
associated  with  differences  in  subject  behavior  (i.e., 
in  a  between-subject  design,  subjects  may  not  be 
randomly  mixed  and  might  have  shared  factors 
that  are  unknown  to  the  experimenter).  Third,  a 
within-subject  design  reduces  repeated  game  effects, 
which  means  that  most  models  are  a  one-shot 
game,  but  experiments  usually  test  the  model  over 
multiple  periods.  To  keep  the  one-shot  nature  of 
the  model  intact,  subjects  are  assigned  to  different 
treatments  over  multiple  periods,  so  that  they  will 
experience  a  different  experimental  environment  in 
each  period.  Hence,  each  period  will  be  a  new 
experience,  and  subjects  should  not  feel  like  they 
are  playing  the  same  experiment  in  each  period. 

As  mentioned  above,  causality  is  established  by 
experimental  controls  and  randomization  of  sub¬ 
jects  to  different  treatments,  allowing  the  researcher 
to  directly  observe  the  impact  of  one  variable  on 
another  without  the  use  of  sophisticated  statistical 
methods.  One  important  control  is  the  motivation 
of  subjects.  Laboratory  experiments  follow  the  dic¬ 
tates  of  induced  value  theory.  This  means  that  sub¬ 
jects  in  the  experiment  have  monetary  incentives, 
and  they  are  paid  earnings  based  on  their  perfor¬ 
mance  during  the  experiment.  With  this  payment 


structure  then,  one  subject  might  take  an  action 
that  earns  him  $1,  while  another  subject  might 
take  a  different  action  that  earns  her  $2.  Induced 
value  theory  specifies  that  as  long  as  subjects  are 
sufficiently  rewarded  monetarily,  subject  choices 
in  the  experiment  should  be  natural. 

To  enhance  a  subject’s  monetary  incentive  and 
to  control  factors  that  are  not  controlled  for  in  the 
experiment  (e.g.,  psychological  factors),  labora¬ 
tory  experiments  use  a  sterile  environment  as 
opposed  to  a  contextual  environment.  For  instance, 
in  an  election  experiment,  instead  of  having  voters 
vote  for  two  candidates  labeled  the  left  and  the 
right  candidate,  candidates  are  given  generic  labels, 
such  as  A  or  B,  or  Green  and  Red.  The  reason  is 
that  a  subject  might  vote  for  a  right  candidate 
because  the  subject  is  conservative  and  thinks  the 
right  candidate  is  more  aligned  with  his  or  her 
preferences.  Hence,  the  subject  might  vote  for  a 
candidate  based  on  ideological  preferences  and 
mistakenly  not  vote  for  a  candidate  based  on  the 
monetary  structure  induced  within  the  experiment. 

Using  monetary  incentives,  the  experimenter  is 
able  to  establish  controls  over  the  preferences  of 
subjects.  Why  do  researchers  need  to  control  the 
preferences  of  subjects?  Because  this  allows  a  sub¬ 
ject’s  behavior  to  be  measured.  Generally,  subjects 
are  assigned  a  utility  function  that  maps  monetary 
amounts  to  various  outcomes.  For  instance,  assume 
that  there  are  three  alternatives  that  a  subject  con¬ 
fronts  during  the  experiment — A,  B,  C — where  for 
a  subject,  A  is  worth  $2,  B  is  worth  $1,  and  C  is 
worth  $0.  Hence,  a  subject’s  preferences  over  the 
alternatives  are  A  >  B  >  C.  In  the  experiment,  a 
subject’s  motivation  is  to  make  choices  that  yield 
the  highest  possible  payoff  for  the  subject.  Subjects 
in  the  experiment  can  have  conflicting  interests, 
such  that  another  subject  might  have  the  prefer¬ 
ences  C  >  B  >  A. 

By  controlling  preferences,  it  is  possible  to 
examine  a  wide  range  of  topics,  but  let  us  consider 
one:  strategic  and  coordinated  voting  in  three- 
candidate  elections.  To  illustrate,  suppose  a  group 
of  four  subjects  (Group  1)  has  the  preferences  A  > 
B  >  C,  another  group  of  four  subjects  (Group  2) 
has  the  preferences  B  >  A  >  C,  and  a  third  group 
of  six  subjects  (Group  3)  prefers  C  and  is  indiffer¬ 
ent  between  A  and  B  (i.e.,  A  =  B).  If  the  experi¬ 
mental  election  assumed  plurality  voting  and  all 
subjects  voted  for  their  most  preferred  alternative, 
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then  C  would  win  the  election.  But  note  that  this 
is  the  least  preferred  outcome  of  Groups  1  and  2. 
Hence,  members  of  Groups  1  and  2  have  an  incen¬ 
tive  to  vote  strategically — that  is  to  vote  for  a  least 
preferred  outcome  to  get  a  better  one.  To  accom¬ 
plish  this,  either  members  of  Group  1  would  have 
to  strategically  vote  for  B  or  members  of  Group  2 
would  have  to  strategically  vote  for  A.  Hence,  it  is 
a  coordination  problem.  In  experiments  Robert 
Forsythe  and  colleagues  found  that  without  any 
type  of  coordination  device,  voters  were  unable  to 
coordinate. 

Within  this  framework,  it  is  possible  to  vary 
other  control  variables  to  determine  the  impact 
that  other  variables  have  on  the  ability  of  voters  to 
coordinate  their  vote  when  strategic  voting  will 
increase  their  payoffs.  One  variable  is  the  type  of 
election  rule  subjects  vote  under.  Experiments  that 
varied  election  rules,  such  as  the  Borda  rule  and 
approval  voting,  found  that  as  compared  with  plu¬ 
rality  voting,  the  Borda  rule  and  approval  voting 
allowed  subjects  to  coordinate  more  easily. 

Another  important  control  variable  that  is  estab¬ 
lished  in  the  laboratory  is  the  information  that 
subjects  have  about  the  experimental  environment. 
For  instance,  in  election  experiments,  it  is  possible 
to  vary  the  information  that  subjects  have  about 
candidates  (i.e.,  a  voter  may  know  that  A  is  her 
most  preferred  candidate  but  does  not  know  if  she 
prefers  B  or  C).  Controlling  subject  information 
has  been  used  to  experimentally  study  the  classic 
Downsian  model  of  elections.  In  this  model,  politi¬ 
cal  candidates  adopt  policy  positions  over  a  single¬ 
issue  dimension  (i.e.,  a  single  line  that  ranges  from 
0  to  100).  Voters  in  this  setup  have  single-peaked 
preferences  over  the  dimension  (single-peaked  pref¬ 
erences  means  that  voters  have  an  ideal  point  over 
the  dimension  or  a  best  location).  Assuming  an  odd 
number  of  voters  who  know  where  the  candidates 
are  located  on  the  dimension  and  of  candidates 
who  know  where  voters  are  located  on  the  dimen¬ 
sion,  candidates’  optimal  location  or  equilibrium  is 
at  the  position  of  the  median  voter — which  is  the 
well-known  median  voter  theorem. 

Experiments  were  conducted  that  relaxed  the 
full-information  condition  of  the  model  to  deter¬ 
mine  if  the  median  voter  result  would  still  hold. 
Richard  McKelvey  and  Peter  Ordeshook  limited 
the  information  that  voters  and  candidate  subjects 
had  about  the  election  environment.  They  found 


that  when  voters  only  knew  the  relative  location  of 
candidates  and  candidates  had  no  knowledge  of 
the  voters’  distribution  of  preferences,  then  out¬ 
comes  generally  conformed  to  the  median  voter’s 
ideal  point. 

In  addition  to  the  controls  over  the  experimen¬ 
tal  environment,  laboratory  experiments  allow 
researchers  to  study  political  phenomena  that 
would  be  difficult  to  study  using  real-world  data. 
For  instance,  Marco  Battaglini,  Rebecca  Morton, 
and  Thomas  Palfrey  examined  a  “swing  voter’s 
curse”  that  relates  to  the  paradox  of  why  people 
vote,  since  the  probability  of  a  single  vote  affecting 
the  outcome  is  very  small.  If  there  is  a  cost  to  vot¬ 
ing,  then  there  is  no  rational  reason  to  vote,  but 
what  if  a  voter  pays  the  cost  of  voting  and  goes  to 
the  polling  station  but  then  does  not  vote  for  cer¬ 
tain  items  on  the  ballot?  The  swing  voter’s  curse 
happens  when  a  voter  forgoes  voting  for  an  item 
on  a  ballot  and  delegates  this  decision  to  other  vot¬ 
ers  whom  he  thinks  are  more  informed.  The  results 
showed  that  uninformed  voters  do  in  fact  delegate 
their  vote  or  abstain  most  of  the  time  in  these 
cases.  Hence,  knowledge  of  why  voters  behave  in 
this  manner  would  be  difficult  to  discern  without 
using  laboratory  experiments. 

One  concern  that  has  been  expressed  about 
laboratory  experiments  among  nonexperimental 
researchers  is  related  to  validity  issues.  The  reason 
that  experimental  researchers  face  this  question  is 
that  nonexperimental  researchers  feel  that  a  com¬ 
puter  laboratory  is  an  artificial  environment  and 
does  not  replicate  the  real  world.  However,  the 
artificiality  of  the  environment  is  a  desirable  prop¬ 
erty,  and  it  allows  the  researcher  to  make  precise 
measurements  of  behavior  within  the  laboratory. 
The  results  in  the  laboratory  can  be  compared  with 
other  studies  using  different  types  of  data  to  fur¬ 
ther  enhance  validity. 

In  sum,  laboratory  experiments  are  used  to  test 
the  assumptions  and  predictions  of  formal  mod¬ 
els.  Using  this  methodology,  it  is  easy  to  measure 
causality  by  specifying  different  control  variables 
and  randomly  assigning  subjects  to  different  treat¬ 
ments.  Control  over  motivation  in  the  experiment 
allows  for  the  control  of  preferences  and  allows 
for  the  measurement  of  behavior.  Also,  using  this 
method,  a  researcher  is  able  to  examine  the  prop¬ 
erties  of  political  mechanisms  that  have  not  been 
instituted  or  used  very  little  in  the  real  world. 
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Finally,  laboratory  experiments  allow  the  researcher 
to  empirically  examine  all  types  of  political  mecha¬ 
nisms  and  collect  relevant  data,  which  are  difficult 
to  obtain  using  nonexperimental  methods. 
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The  importance  of  natural  experiments — analyses  of 
real-world  situations  that  attempt  to  employ  some 
type  of  random  assignment — is  in  their  contribution 
to  addressing  confounding,  a  pervasive  problem  in 


the  social  sciences.  Consider  the  obstacles  to  address¬ 
ing  the  following  hypothesis:  Extending  property 
titles  to  poor  land  squatters  boosts  access  to  credit 
markets  and  promotes  beliefs  in  individual  politi¬ 
cal  efficacy,  thereby  fostering  socioeconomic 
development.  To  test  this  idea,  researchers  might 
compare  poor  squatters  who  possess  land  titles 
with  those  who  do  not.  Yet  confounding  may  be  a 
problem  because  differences  in  individual  attitudes 
and  behaviors  could  in  part  be  due  to  factors — 
such  as  family  background — that  also  make  cer¬ 
tain  poor  squatters  more  likely  to  acquire  titles  to 
their  property.  This  entry  examines  both  the 
strengths  and  potential  limitations  of  natural 
experiments  and  how  they  fit  among  the  various 
research  strategies  available  to  political  scientists. 

It  is  useful  to  contrast  natural  experiments  with 
both  conventional  observational  studies  and  true 
experiments.  In  the  first  approach,  investigators 
seek  to  control  for  potential  confounders  in  obser¬ 
vational  (nonexperimental)  data.  For  instance, 
they  may  compare  titled  and  untitled  squatters 
within  strata  defined  by  measures  of  family  back¬ 
ground.  At  the  core  of  conventional  quantitative 
methods  is  the  hope  that  such  confounders  can  be 
identified,  measured,  and  controlled.  Yet  this  is 
not  easy  to  do.  Even  within  the  strata  defined  by 
family  background  and  intelligence,  there  may  be 
other  difficult-to-measure  confounders — say, 
determination — that  are  associated  with  obtaining 
titles  and  that  also  influence  economic  and  politi¬ 
cal  behaviors. 

A  second  approach  would  be  to  use  a  random¬ 
ized  controlled  experiment  to  estimate  the  effects 
of  land  titling.  Subjects  could  be  randomly  assigned 
to  receive  titles  or  not;  family  background,  deter¬ 
mination,  and  other  possible  confounders  would 
then  be  equivalent,  on  average  and  up  to  random 
error,  across  these  two  groups.  Thus,  large  postti- 
tling  differences  would  then  be  credible  evidence 
for  a  causal  effect  of  land  titles.  However,  experi¬ 
mental  research  in  such  contexts  may  be  expen¬ 
sive,  impractical,  or  unethical. 

Scholars  therefore  increasingly  make  use  of 
natural  experiments — attempting  to  identify  and 
analyze  real-world  situations  in  which  some  pro¬ 
cess  of  random  or  as-if  random  assignment  places 
cases  in  alternative  categories  of  the  key  indepen¬ 
dent  variable.  In  the  social  sciences,  this  approach 
has  been  used  to  study  the  relationship  between 
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lottery  winnings  and  political  attitudes,  the  effect 
of  voting  costs  on  turnout,  the  impact  of  quotas 
for  women  village  councilors  on  public  goods  pro¬ 
vision  in  India,  and  many  other  topics.  In  the 
health  sciences,  a  paradigmatic  example  comes 
from  John  Snow’s  19th-century  tests  of  the  hypoth¬ 
esis  that  cholera  is  waterborne. 

Natural  experiments  share  one  crucial  attribute 
with  true  experiments  and  partially  share  a  second 
attribute.  First,  outcomes  are  typically  compared 
across  subjects  exposed  to  a  treatment  and  those 
exposed  to  a  control  condition  (or  a  different  treat¬ 
ment).  Second,  in  partial  contrast  with  true  exper¬ 
iments,  subjects  are  often  assigned  to  the  treatment 
not  at  random  but  rather  as-if  at  random  (though 
sometimes  true  randomization  occurs,  as  in  lottery 
studies).  Given  that  the  data  come  from  naturally 
occurring  phenomena  that  often  entail  social  and 
political  processes,  the  manipulation  of  the  treat¬ 
ment  is  not  under  the  control  of  the  analyst;  thus, 
the  study  is  observational. 

However,  a  researcher  carrying  out  this  type  of 
study  can  often  make  a  credible  claim  that  the 
assignment  of  nonexperimental  subjects  to  treat¬ 
ment  and  control  conditions  is  as  good  as  random. 
This  distinguishes  natural  experiments  from  “quasi 
experiments,”  in  which  comparisons  are  also  made 
across  treatment  and  control  groups,  but  nonran¬ 
dom  assignment  to  treatment  is  a  key  feature  of  the 
designs. 

Yet  how  can  the  claim  of  as-if  random  assign¬ 
ment  in  natural  experiments  be  validated?  And 
how  much  leverage  do  natural  experiments  in  fact 
provide  for  causal  inference?  These  questions  are 
discussed  below,  after  the  initial  example  on  land 
titling  is  discussed  at  greater  length. 

How  Do  Property  Rights  Affect  the  Poor? 

An  interesting  social-scientific  example  comes 
from  a  study  of  how  land  titles  influence  the  socio¬ 
economic  development  of  poor  communities.  In 
1981,  urban  squatters  organized  by  the  Catholic 
Church  in  Argentina  occupied  open  land  in  the 
province  of  Buenos  Aires,  dividing  the  land  into 
parcels  that  were  allocated  to  individual  families. 
A  1984  law,  adopted  after  the  return  to  democracy 
in  1983,  expropriated  this  land  with  the  intention 
of  transferring  titles  to  the  squatters.  However, 


some  of  the  original  landowners  challenged  the 
expropriation  in  court,  leading  to  long  delays  in 
the  transfer  of  titles  to  some  of  the  squatters.  By 
contrast,  for  other  squatters,  titles  were  granted 
immediately. 

The  legal  action  therefore  created  a  (treatment) 
group  of  squatters  to  whom  titles  were  granted 
promptly  and  a  (control)  group  to  whom  titles  were 
not  granted.  The  authors  of  the  study  found  subse¬ 
quent  differences  across  the  two  groups  in  standard 
social  development  indicators:  average  housing 
investment,  household  structure,  and  educational 
attainment  of  children.  In  contrast,  the  authors  do 
not  find  a  difference  in  access  to  credit  markets, 
which  contradicts  a  well-known  theory  that  the 
poor  will  use  titled  property  to  collateralize  debt. 
They  also  found  a  positive  effect  of  property  rights 
on  self-perceptions  of  individual  efficacy.  For 
instance,  squatters  who  were  granted  land  titles — 
for  reasons  over  which  they  apparently  had  no 
control — disproportionately  agreed  with  statements 
that  people  get  ahead  in  life  due  to  hard  work. 

Is  this  a  valid  natural  experiment?  The  key 
claim  is  that  land  titles  were  assigned  to  the  squat¬ 
ters  as-if  at  random,  and  the  authors  present  vari¬ 
ous  kinds  of  evidence  to  support  this  assertion.  In 
1981,  for  example,  the  eventual  expropriation  of 
land  by  the  state  and  the  transfer  of  titles  to  squat¬ 
ters  could  not  have  been  predicted.  Moreover, 
there  was  little  basis  for  successful  prediction  by 
squatters  or  the  Catholic  Church  organizers  of 
which  particular  parcels  would  eventually  have 
their  titles  transferred  in  1984.  Titled  and  untitled 
parcels  sat  side-by-side  in  the  occupied  area,  and 
the  parcels  had  similar  characteristics,  such  as  dis¬ 
tance  from  polluted  creeks.  The  authors  also  show 
that  the  squatters’  characteristics,  such  as  age  and 
sex,  were  statistically  unrelated  to  whether  they 
received  titles — as  should  be  the  case  if  titles  were 
assigned  at  random.  Finally,  the  government 
offered  equivalent  compensation — based  on  the 
size  of  the  lot — to  the  original  owners  in  both 
groups,  suggesting  that  the  value  of  the  parcels 
does  not  explain  which  owners  challenged  expro¬ 
priation  and  which  did  not.  On  the  basis  of  exten¬ 
sive  interviews  and  other  qualitative  fieldwork, 
the  authors  argue  convincingly  that  idiosyncratic 
factors  explain  some  owners’  decisions  to  chal¬ 
lenge  expropriation  and  that  these  factors  were 
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unrelated  to  the  characteristics  of  squatters  or 
their  parcels. 

The  authors  thus  present  compelling  evidence 
for  the  equivalence  of  treated  and  untreated  units. 
Along  with  qualitative  evidence  on  the  process  by 
which  the  squatting  took  place,  this  evidence  helps 
bolster  the  assertion  that  assignment  is  as-if  ran¬ 
dom.  Of  course,  assignment  was  not  randomized, 
so  the  possibility  of  unobserved  confounders  can¬ 
not  be  entirely  ruled  out.  Yet  the  argument  for  as- 
good-as-random  assignment  appears  compelling. 
Note  that  the  natural  experiment  plays  a  crucial 
role.  Without  it,  the  intriguing  findings  about  the 
self-reinforcing  (not  to  mention  self-deluding) 
beliefs  of  the  squatters  could  have  been  explained 
as  a  result  of  unobserved  characteristics  of  those 
squatters  who  did  or  did  not  successfully  gain  titles. 

A  Framework  for  Evaluating 
Natural  Experiments 

Flow  much  leverage  for  causal  inference  do  natural 
experiments  in  fact  provide?  To  address  this  ques¬ 
tion,  it  is  helpful  to  discuss  three  dimensions  along 
which  natural  experiments  may  vary: 

1.  plausibility  of  an  as-if  random  assignment; 

2.  credibility  of  the  statistical  models,  which  is 
closely  connected  with  the  simplicity  and 
transparency  of  the  data  analysis;  and 

3.  substantive  relevance  of  the  intervention — that 
is,  whether  and  in  what  ways  the  specific 
contrast  between  treatment  and  control 
provides  insight  into  a  wider  range  of  important 
issues  and  contexts. 

First,  the  key  claim — and  the  definitional  crite¬ 
rion — for  a  natural  experiment  is  that  treatment 
assignment  is  as  good  as  random.  Yet  one  finds 
marked  variation  in  the  plausibility  of  this  claim 
among  studies  that  claim  to  use  natural  experiments. 

How  can  the  assertion  of  as-if  random  be  at 
least  partially  validated?  First,  it  should  be  sup¬ 
ported  by  the  available  empirical  evidence — for 
example,  by  showing  equivalence  on  relevant  pre¬ 
treatment  variables  (those  whose  values  were 
determined  before  the  intervention  took  place) 
across  treatment  and  control  groups,  as  would 


occur  on  average  with  true  randomization. 
Qualitative  knowledge  about  the  process  by  which 
treatment  assignment  takes  place  can  also  play  a 
key  role  in  validating  a  natural  experiment.  The 
authors  of  the  studies  on  land  titling  use  both 
quantitative  comparisons  and  qualitative  evidence 
about  the  process  of  organizing  squatters’  settle¬ 
ments  to  validate  the  claim  of  an  as-if  random 
assignment. 

A  second  dimension  along  which  natural  exper¬ 
iments  may  vary  is  in  the  credibility  of  the  statisti¬ 
cal  models  used  to  analyze  the  data.  As  with  true 
experiments,  as-if  random  assignment  implies  that 
both  known  and  unknown  confounders  are  bal¬ 
anced  (in  expectation)  across  treatment  and  con¬ 
trol  groups,  obviating  the  need  to  measure  and 
control  for  confounding  variables.  This  has  the 
great  advantage  of  permitting  the  use  of  simple 
analytic  tools — for  example,  comparisons  of 
means  or  percentages  across  the  treatment  and 
control  groups — to  make  causal  inferences.  In 
principle,  the  simplicity  and  transparency  of  the 
statistical  analysis  provides  natural  experiments 
with  an  important  advantage,  relative  to  the  con¬ 
ventional  quantitative  methods  that  have  in  recent 
years  incurred  substantial  criticism  from  leading 
methodologists. 

In  practice,  greater  credibility  of  statistical  mod¬ 
els  is  not  inherent  in  all  studies  that  claim  to  use 
natural  experiments.  One  recent  survey  of  leading 
examples  from  political  science  and  economics 
found  that  about  half  of  the  studies  failed  to  pres¬ 
ent  simple,  unadjusted  difference-of-means  tests 
(in  addition  to  any  auxiliary  analyses).  Of  course, 
in  less-than-perfect  natural  experiments,  in  which 
the  plausibility  of  as-if  random  is  perhaps  impeach¬ 
able,  researchers  may  feel  compelled  to  control  for 
potential  confounders  they  can  measure.  Yet  any 
substantial  changes  after  adjustment  likely  point  to 
a  lack  of  as-if  random  assignment — because  ran¬ 
domization  would  ensure  that  control  variables  are 
independent  of  treatment  assignment.  Post  hoc  sta¬ 
tistical  fixes  can  also  lead  to  data  mining,  with  only 
“significant”  estimates  of  causal  effects  making 
their  way  into  published  reports.  (Researchers  also 
sometimes  use  multivariate  regression  to  reduce  the 
variability  of  treatment  effect  estimators;  yet  vari¬ 
ance  may  be  higher  or  lower  after  adjustment,  and 
the  standard  errors  calculated  using  the  usual 
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regression  formulas  do  not  apply.)  Thus,  to  bolster 
the  credibility  of  the  statistical  models  used  in  natu¬ 
ral-experimental  designs,  analysts  should  report 
unadjusted  difference-of-means  tests,  in  addition 
to  any  auxiliary  analyses. 

A  third  dimension  along  which  natural  experi¬ 
ments  can  be  classified  is  the  substantive  relevance 
of  the  intervention.  Here  we  ask,  “To  what  extent 
does  as-if  random  assignment  shed  light  on  the 
wider  social-scientific,  substantive,  and/or  policy 
issues  that  motivate  the  study?”  Answers  to  this 
question  might  be  a  cause  for  concern,  for  a  num¬ 
ber  of  reasons.  For  instance,  the  type  of  subjects 
or  units  exposed  to  the  intervention  might  be 
more  or  less  like  the  populations  in  which  we  are 
most  interested.  In  lottery  studies  of  electoral 
behavior,  for  example,  levels  of  lottery  winnings 
may  be  randomly  assigned  among  lottery  players, 
but  we  might  doubt  whether  lottery  players  are 
like  other  populations  (say,  all  voters).  Next,  the 
particular  treatment  might  have  idiosyncratic 
effects  that  are  distinct  from  the  effects  of  greatest 
interest.  To  continue  the  same  example,  levels  of 
lottery  winnings  may  or  may  not  have  similar 
effects  on,  say,  political  attitudes  as  income  earned 
through  work.  Finally,  natural  experimental  inter¬ 
ventions  (like  the  interventions  in  some  true 
experiments)  may  “bundle”  many  distinct  treat¬ 
ments  or  components  of  treatments,  which  may 
limit  the  extent  to  which  this  approach  isolates  the 
effect  of  the  explanatory  variable  about  which  the 
researcher  cares  most.  Such  ideas  are  often  dis¬ 
cussed  under  the  rubric  of  “external  validity,”  but 
the  issue  of  substantive  relevance  involves  a 
broader  question — that  is,  whether  the  interven¬ 
tion  in  fact  yields  causal  inferences  about  the  real 
causal  hypothesis  of  concern. 

To  tie  this  discussion  together,  we  can  imagine 
a  cube,  in  which  the  three  axes  are  defined  by  these 
three  dimensions: 

f .  plausibility  of  as-if  random  assignment, 

2.  credibility  of  statistical  models,  and 

3.  substantive  relevance  of  intervention. 

In  the  front  lower-left  corner  of  the  cube,  we 
find  those  natural  experiments  that  offer  the  least 
plausibility,  credibility,  and  substantive  relevance. 


In  the  back  upper-right  corner  of  the  cube,  we  find 
those  studies  with  the  most  plausibility,  credibil¬ 
ity,  and  relevance.  The  process  of  achieving  a 
strong  research  design — in  which  natural  experi¬ 
ments  provide  substantial  leverage  for  causal 
inference — may  be  understood  as  the  process  of 
moving  from  the  front  lower-left,  “weak  research 
design”  corner  of  the  cube,  to  the  back  upper- 
right,  “strong  research  design”  corner  of  the  cube. 

Two  points  are  important  to  make  in  closing. 
First,  there  may  be  trade-offs  in  seeking  to  design 
a  strong  natural  experiment — that  is,  in  moving  to 
the  back  upper-right  corner  of  the  cube.  Different 
studies  may  manage  the  trade-off  among  these 
three  dimensions  in  different  ways,  and  which 
trade-offs  are  acceptable  (or  unavoidable)  may 
depend  on  the  question  being  asked. 

Second,  deep  substantive  knowledge,  and  a  com¬ 
bination  of  quantitative  and  qualitative  analyses, 
can  help  analysts  better  achieve  success  along  all 
the  three  dimensions  of  the  framework  described 
above.  Consider  the  studies  of  squatters  in  Argen¬ 
tina.  There,  substantive  knowledge  was  necessary 
to  recognize  the  potential  to  use  a  natural  experi¬ 
ment  to  study  the  effect  of  land  titling,  and  many 
field  interviews  were  required  to  probe  the  plausi¬ 
bility  of  as-if  randomness — that  is,  to  validate  the 
research  design.  Fieldwork  can  also  enrich  analysts’ 
understanding  and  interpretation  of  the  causal 
effects  they  estimate. 

In  sum,  many  modes  of  inquiry  may  contribute 
to  successful  causal  inference  using  natural  experi¬ 
ments;  ultimately,  the  right  mix  of  methods  sub¬ 
stantially  depends  on  the  research  question 
involved.  Natural  experiments,  like  regression 
analysis,  do  not  provide  a  technical  quick  fix  to 
the  challenges  of  causal  inference.  In  every  study, 
analysts  are  challenged  to  think  critically  about 
the  match  between  the  assumptions  of  models  and 
the  empirical  reality  they  are  studying.  Natural 
experiments  are  valuable  to  the  extent  that  they 
build  on  real  substantive  knowledge  and  appropri¬ 
ate  methodological  craftsmanship  and  a  full 
awareness  of  the  trade-offs  inherent  in  this  style  of 
research. 
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Yale  University 
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Factor  Analysis 


Factor  analysis  is  a  well-established  method  that 
attempts  to  measure  latent  constructs  such  as  atti¬ 
tudes  or  values,  which  cannot  be  observed  directly. 
The  standard  model  assumes  that  the  measured 
variables  (items  or  indicators)  are  linear  additive 
functions  of  the  unobserved  (latent)  factors  and  the 
error  component.  This  is  called  the  common-factor 
model.  The  general  system  of  equations  for  this 
model  is 

Ji  =  A/1 T1 ,  +  \/2in2  +  •  •  •  +  \jmT\m  +  Zh 

where  y-  represents  the  1, ,  j  observed  vari¬ 
ables  measured  on  a  sample  of  n  independent 
subjects;  rq,  .  .  .  ,  r\m  represent  the  1 ,  ...  ,m  latent 
constructs  (factors)  in  the  model;  \;1,  .  .  .  ,  \jm  rep¬ 
resent  the  factor  loading  (partialized  regression 
coefficient)  relating  variables  1,  .  .  .  ,  /  to  the  first 
to  rath  factors;  and  e15 .  .  .  ,  e;  stands  for  the  error 
component  (uniqueness).  It  is  assumed  that  the 
error  component  of  one  indicator  or  item  is  inde¬ 
pendent  of  all  factors  and  of  all  error  components 
of  the  other  items.  In  this  entry,  the  two  major 
forms — exploratory  factor  analysis  (EFA)  and  con¬ 
firmatory  factor  analysis  (CFA) — are  presented. 

An  example  with  five  items  (yv  .  .  . ,  ys),  two  fac¬ 
tors  (rp,  r|2),  and  five  random  measurement  errors 
(s15 .  .  .  ,  e5)  is  given  in  Figure  1;  <f)12  is  the  symbol 
for  the  covariance  between  the  Factors  1  and  2. 

The  variance  of  the  unique  component  can  be 
further  decomposed  as  s  =  es  +  se,  where  cs  is  the 


Figure  I  Visualization  of  a  Factor  Model  With  Five 
Observed  and  Two  Latent  Variables 

specific  error  variance  due  to  the  particular  item 
(e.g.,  a  specific  item  wording)  and  £e  the  random 
measurement  error.  In  most  cross-sectional  stud¬ 
ies,  one  cannot  distinguish  between  these  two 
kinds  of  errors  because  they  are  not  separately 
identified.  It  is  possible  to  separate  these  compo¬ 
nents  only  by  using  special  designs  like  multitrait- 
multimethod  or  panel  studies. 

Generally,  one  has  to  differentiate  between  EFA 
and  CFA.  EFA  is  used  when  there  is  no  or  not  suffi¬ 
cient  a  priori  knowledge  about  the  number  of  factors 
and  the  relationship  between  items  and  constructs. 
CFA,  in  contrast,  is  used  when  researchers  have  con¬ 
crete  assumptions  about  the  measurement  model. 

Exploratory  Factor  Analysis 

EFA  is  a  method  used  to  detect  the  optimal  num¬ 
ber  of  factors  that  accounts  for  the  correlation  of 
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items.  Each  factor  is  interpreted  and  named  based 
on  the  items  that  have  high  loadings  on  this  factor. 
Its  character  is  inductive  as  the  number  of  factors, 
the  amount  of  correlations  between  factors,  and 
the  assignments  between  items  and  factors  are  per¬ 
formed  empirically  without  a  precise  deductive 
theoretical  model.  There  is  no  unique  solution  for 
the  relation  between  observed  items  and  latent  fac¬ 
tors.  There  are  two  different  approaches  to  the 
factor  model — (a)  common-factor  model  and 

(b)  principal  component  analysis — and  each 
implies  different  substantive  assumptions.  In  the 
common-factor  model,  it  is  assumed  that  every 
item  is  measured  with  some  error.  Therefore,  the 
common-factor  model  is  most  appropriate  in  sur¬ 
vey  research,  where  items  are  always  measured 
with  some  error.  However,  in  the  principal  compo¬ 
nent  model,  the  researcher  assumes  that  there  is  no 
measurement  error  in  the  items.  All  analyses  are 
based  on  correlation  matrices  as  input. 

EFA  is  performed  in  five  steps: 

1.  Factor  extraction:  The  method  that  is  most 
often  used  is  the  method  of  principal  axis  factoring 
and  is  based  on  the  computation  of  the  eigenvalues 
and  vectors  of  the  correlation  matrix  between  all 
measured  variables.  Its  goal  is  the  maximization  of 
the  variance  of  each  successively  extracted  factor. 

2.  Number  of  factors:  Different  procedures 
determine  the  number  of  factors:  (a)  According  to 
the  Kaiser-Guttman  criterion,  the  number  of  fac¬ 
tors  should  correspond  to  the  number  of  eigenval¬ 
ues  of  the  full  input  correlation  matrix  that  exceed 
one.  (b)  The  scree  plot  is  a  graph  of  each  eigenvalue 
plotted  in  descending  order.  The  number  of  factors 
is  determined  by  visual  inspection  by  observing 
whether,  suddenly,  the  plot  shows  no  real  differ¬ 
ence  between  the  last  two  eigenvalues  in  the  scree 
plot,  (c)  In  parallel  analysis,  a  second  factor  analy¬ 
sis  is  calculated  with  a  random  data  set  with  the 
same  numbers  of  variables  and  cases  as  in  the 
original  analysis.  Only  the  factors  with  higher 
eigenvalues  than  the  factors  of  the  random  data  are 
accepted,  (d)  In  maximum  likelihood  estimation, 
one  studies  the  amount  of  residuals  left  over  after 
introducing  more  factors  and  evaluates  them  by 
goodness-of-fit  tests  of  the  whole  model. 

3.  Communality  estimation  as  a  measure  for  the 
common  variance  of  every  item:  The  most  often 
used  method  for  an  initial  estimate  is  the  squared 


multiple  correlation  of  one  item  with  all  other 
items.  Depending  on  the  number  of  factors 
extracted,  the  final  solution  contains  the  explained 
variance  of  every  item. 

4.  Factor  rotation:  The  goal  of  rotation  is  to 
achieve  “a  simple  structure”  that  allows  a  good 
interpretation  of  the  resulting  solution.  Rotations 
can  be  orthogonal,  assuming  that  there  are  no  cor¬ 
relations  between  factors  (e.g.,  varimax,  quarti- 
max,  and  equimax),  or  oblique  (e.g.,  promax, 
oblimin),  assuming  that  there  exist  correlations 
between  factors.  In  the  case  of  correlated  factors, 
the  factor  loadings  of  items  on  factors  contained  in 
the  pattern  factor  matrix  represent  standardized 
partialized  regression  coefficients.  In  the  case  of 
noncorrelated  factors,  these  coefficients  are  simply 
correlation  coefficients  between  items  and  factors. 
The  factor  structure  matrix  contains  the  simple 
bivariate  correlations  between  items  and  con¬ 
structs.  The  values  differ  from  the  factor  pattern 
matrix  only  if  factors  are  correlated. 

5.  Factor  score  estimation:  As  a  final  step,  one 
may  be  interested  in  estimating  the  factor  scores. 
The  factor  scores  are  often  used  as  weights  for  raw 
scores  in  the  computation  of  indices  that  are  used 
in  subsequent  analyses. 

Confirmatory  Factor  Analysis 

In  contrast  to  EFA,  in  CFA,  a  theoretical  model  is 
needed  that  contains  a  series  of  a  priori  hypotheses. 
These  hypotheses  should  be  drawn  from  the  sub¬ 
stantive  literature.  For  example,  in  the  theory  of 
values  of  Shalom  Schwartz,  10  values  are  postulated 
and  theoretically  derived.  Therefore,  the  10  theo¬ 
retical  constructs  can  be  formalized  and  tested  as 
latent  variables  in  a  CFA  model.  However,  it  is  not 
definitely  determined  in  the  literature  which  and 
how  many  dimensions  and  items  are  adequate  to 
measure  the  concepts  of  nationalism  and  patriotism 
and  whether  one  can  differentiate  between  them. 
The  hypotheses  in  a  CFA  model  refer  to  the  follow¬ 
ing  aspects  of  a  measurement  model:  (a)  determina¬ 
tion  of  the  number  of  factors,  (b)  assumption 
regarding  whether  the  factors  are  correlated  or  not, 

(c)  determination  of  which  items  load  on  which  fac¬ 
tor  and  where  to  set  loadings  a  priori  to  zero,  and 

(d)  assumptions  about  the  correlations  of  the  errors. 
CFA  allows  the  specification  and  testing  of  these 
assumptions.  It  is  possible  to  fix  parameters  to  a 
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certain  value  or  to  constrain  parameters  that  are  set 
equal  to  other  parameters.  One  assumes  that  the 
expected  value  of  random  measurement  error  is 
zero  (es  =  0)  and  the  correlation  of  the  random 
measurement  error  with  the  latent  variable  (factor) 
is  also  zero.  Furthermore,  it  is  necessary  to  fix  one 
of  the  loadings  to  one  or  to  standardize  the  variance 
of  the  latent  variable  to  reach  identification  of  the 
model,  that  is,  to  achieve  a  unique  solution. 

To  illustrate  this,  in  this  entry,  we  use  data  from 
the  International  Social  Survey  Programme  (ISSP) 
National  Identity  module  collected  in  2003,  which 
contains  items  to  measure  nationalism  and  con¬ 
structive  patriotism.  Theoretically,  it  has  been  dis¬ 
cussed  whether  there  are  two  different  latent  vari¬ 
ables  (factors) — namely,  nationalism  and  construc¬ 
tive  patriotism.  Nationalism  was  measured  on  a 
5-point  scale  ranging  from  I  ( strongly  disagree )  to 
5  ( strongly  agree).  The  questions  were  worded  as 
follows:  (1)  “The  world  would  be  a  better  place 
if  people  from  other  countries  were  more  like 
the  [country  nationality  of  the  respondent].” 
(2)  “Generally  speaking,  [the  respondent’s  coun¬ 
try]  is  a  better  country  than  most  other  countries.” 
A  high  score  indicates  a  high  degree  of  national¬ 
ism.  Constructive  patriotism  was  measured  using 
the  following  three  items:  “How  proud  are  you  of 


[respondent’s  country]  in  each  of  the  following:” 
(3)  “The  way  democracy  works.”  (4)  “Its  social 
security  system.”  (5)  “Its  fair  and  equal  treatment 
of  all  groups  in  society.”  Responses  were  given  on 
a  4-point  scale  ranging  from  I  [not  proud  at  all)  to 
4  (very  proud).  The  Pearson  correlation  matrix 
and  standard  deviations  are  provided  for  the 
United  States  in  Table  1.  Pairwise  deletion  of  miss¬ 
ing  values  was  used  because  the  number  of  missing 
values  was  low  (less  than  5%). 

The  standardized  and  unstandardized  coeffi¬ 
cients  were  computed  using  the  maximum  likeli¬ 
hood  estimation  procedure  in  AMOS  version 
18  software  with  graphical  input  (see  Figure  2). 

Figure  2  shows  that  the  two  latent  variables  (fac¬ 
tors) — constructive  patriotism  and  nationalism — are 
positively  correlated  (r  =  .48).  The  standardized 
factor  loadings  are  all  equal  to  or  higher  than  .50, 
which  is  an  expression  of  the  sufficient  formal 
validity  of  the  items  in  the  model.  Furthermore, 
“demo,”  “security,”  and  “fair”  are  only  explained 
by  patriotism,  whereas  “like  us”  and  “better”  are 
only  explained  by  nationalism.  There  are  no 
cross-loadings.  Therefore,  both  convergent  validity 
by  sufficient  factor  loadings  of  the  items  on  their 
respective  factors  and  divergent  validity  by  having 
no  cross-loadings  and  a  correlation  between  the 


Table  I  Means,  Standard  Deviations,  and  Correlations  Between  Items  (U.S.  Sample) 


Like  Us 

Better 

Demo 

Security 

Fair 

Mean 

2.74 

1.87 

1.80 

2.44 

2.06 

Standard 

deviation 

1.11 

0.90 

0.68 

0.85 

0.81 

Like  us 

r 

N 

1 

1,197 

Better 

r 

N 

.48 

1,187 

1 

1,197 

Demo 

r 

N 

.16 

1,138 

.23 

1,140 

1 

1,150 

Security 

r 

N 

.14 

1,156 

.15 

1,154 

.31 

1,118 

1 

1,167 

Fair 

r 

N 

.23 

1,153 

.21 

1,151 

.35 

1,116 

.31 

1,129 

1 

1,162 

Source:  Data  from  International  Social  Survey  Programme  (ISSP),  National  Identity  II  (2003).  Distributor:  GESIS  Cologne 
Germany,  ZA  No.  3910. 


Note:  r  =  Pearson’s  correlation  coefficient;  correlations  between  indicators  of  the  same  construct  are  shaded. 
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Figure  2  Results  From  a  Confirmatory  Factor 

Analysis  for  the  U.S.  Sample  (Standardized 
Solution) 

Source:  Data  from  International  Social  Survey  Programme 
(ISSP),  National  Identity  II  (2003).  Distributor:  GESIS 
Cologne  Germany,  ZA  No.  3910. 

Note:  PAT  =  patriotism;  NAT  =  nationalism. 

concepts  that  is  not  too  high  have  been  established. 
The  explained  variances  of  the  observed  variables 
(given  at  the  edge  of  the  rectangles)  are  between  .25 
for  “security”  and  .53  for  “better.”  The  unstan¬ 
dardized  factor  loadings  and  their  standard  errors 
are  reported  in  Table  2.  The  model  fit  (not  reported) 
is  satisfactory  according  to  standard  criteria. 

One  important  generalization  of  CFA  is  multi¬ 
ple  group  confirmatory  factor  analysis  (MGCFA), 
which  allows  us  to  test  the  invariance  of  factor 
loadings,  factor  covariances  and  variances,  mea¬ 
surement  errors,  intercepts  of  observed  variables, 
and  latent  means.  This  procedure  is  especially 
relevant  for  comparative  social  research  and 
politics — for  example,  in  cross-national  compari¬ 
sons  to  establish  measurement  equivalence  across 
countries.  The  invariance  of  the  factor  model 
presented  above  for  nationalism  and  constructive 
patriotism  was  tested  in  34  countries.  According 
to  the  global  fit  measures,  the  factor  loadings 
were  not  substantially  different  between  the  coun¬ 
tries  in  the  ISSP  2003.  Therefore,  one  can  con¬ 
clude  that  the  necessary  condition  for  the  equality 
of  meaning  of  the  constructs  in  the  34  countries  is 
given,  and  the  relations  can  thus  be  compared  on 
the  theoretical  level. 

Furthermore,  in  both  EFA  and  CFA,  higher 
order  factors  can  be  estimated.  This  means  that  the 
correlation  between  factors  is  explained  with  one  or 


Table  2  Unstandardized  Factor  Loadings  and 
Standard  Errors  (U.S.  Sample) 


PAT 

NAT 

X 

0 

1.000 

Like  us 

SE 

0 

a 

X 

0 

1.106 

Better 

SE 

0 

0.130 

X 

0.822 

0 

Demo 

SE 

0.078 

0 

X 

0.873 

0 

Security 

SE 

0.085 

0 

X 

1.000 

0 

Fair 

SE 

a 

0 

Source:  Data  from  International  Social  Survey  Programme 
(ISSP),  National  Identity  II  (2003).  Distributor:  GESIS 
Cologne  Germany,  ZA  No.  3910. 

Notes:  X  =  unstandardized  factor  loadings;  PAT  = 
patriotism;  NAT  =  nationalism;  SE  =  standard  error  of  X. 

a.  No  SE  is  estimated  because  1.0  is  a  fixed  parameter, 
which  is  necessary  for  identification  of  the  model. 

several  higher  order  factors.  Further  topics  are  non¬ 
linear  CFA,  CFA  with  categorical  and  nonnormal 
indicators,  the  relationship  to  item-response  theory, 
application  of  Bayesian  estimation,  CFA  with  longi¬ 
tudinal  data,  multilevel  CFA,  and  mixture  CFA. 
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Fair  division  analyzes  how  to  divide  divisible  or 
indivisible  goods  among  two  or  more  players  in 
order  to  satisfy  certain  criteria  of  fairness.  The 
problem  of  fair  division  goes  back  at  least  to  the 
Bible.  Abraham  and  Lot  had  to  decide  who 
would  get  Canaan  and  who  Jordan;  Solomon 
had  to  decide  which  of  two  women  was  the 
mother  of  a  disputed  baby.  What  was  a  fair  solu¬ 
tion  in  each  case?  This  entry  first  discusses  the 
criteria  of  fair  division,  followed  by  specific  pro¬ 
cedures  and  some  applications  in  political  science 
and  everyday  life. 

The  oldest  known  procedure  for  dividing  a  single 
divisible  good,  such  as  a  cake  or  land,  between  two 
players  is  “I  cut,  you  choose,”  or  divide-and-choose. 
The  same  procedure  can  be  used  if  there  are  multi¬ 
ple  indivisible  goods:  The  divider  partitions  the 
items  into  two  piles,  and  the  chooser  selects  one 
pile.  If  the  divider  has  no  knowledge  of  the  choos¬ 
er’s  preferences,  the  divider  should  divide  the  items 
50/50.  That  way,  whichever  pile  the  chooser  selects, 
the  divider  is  assured  of  getting  50%.  The  chooser, 
by  contrast,  will  get  more  than  50%  if  she  thinks 
that  the  two  piles  are  unequal  and  selects  the  one 
that  she  thinks  is  more  valuable.  If,  however,  the 
divider  knows  the  chooser’s  preferences,  he  can 
exploit  this  information  to  make  one  pile  slightly 
more  valuable  than  50%  for  the  chooser,  so  the 
chooser  will  select  that,  keeping  for  the  divider  the 
pile  he  values  more  (assuming  that  he  values  items 
different  from  what  the  chooser  values).  It  is  impor¬ 
tant  to  note  here  that  fairness  does  not  refer  to  an 
“objective”  criterion  of  equality  but  agreement 
according  to  one’s  subjective  preferences. 


Criteria  of  Fair  Division 

Is  divide-and-choose  fair?  To  make  an  assessment, 
consider  the  following  criteria  for  determining 
what  a  fair  share  is: 

Proportionality:  If  there  are  n  players  and  they  are 
each  equally  entitled  to  the  items,  a  division  is  pro¬ 
portional  if  each  player  thinks  that  he  or  she 
received  at  least  1  In  of  the  total  value.  Proportionality 
can  be  traced  back  to  Aristotle,  who  argued  in  his 
book  Ethics  that  goods  should  be  divided  in  pro¬ 
portion  to  each  player’s  contribution. 

Envy  Freeness:  If  no  player  is  willing  to  give  up 
his  portion  in  exchange  for  the  portion  another 
player  receives,  this  player  will  not  envy  any  other 
player.  In  two-player  disputes,  a  division  is  envy 
free  if  and  only  if  it  is  proportional.  To  see  why 
this  is  so,  assume  that  a  settlement  is  propor¬ 
tional,  so  you  think  you  are  getting  at  least  1/2  of 
the  total  value  of  all  the  items.  Will  you  envy  me? 
Not  if  you  think  that  you  have  at  least  1/2, 
because  then  you  must  think  that  I  have  at  most 
1/2.  Symmetrically,  if  I  think  I  have  at  least  1/2, 
then  I  will  not  envy  you,  so  the  settlement  is  envy 
free.  Conversely,  if  the  settlement  is  envy  free, 
then  both  of  us  think  that  each  is  getting  at  least 
1/2;  otherwise,  at  least  one  of  us  will  envy  the 
other  for  getting  more  than  1/2.  Thus,  if  there  are 
only  two  players,  proportionality  and  envy  free¬ 
ness  are  equivalent. 

In  the  case  of  three  players,  however,  envy  free¬ 
ness  is  more  demanding  than  proportionality.  For 
example,  I  may  think  I’m  getting  1/3,  but  if  I  think 
you’re  getting  1/2  (e.g.,  because  the  third  player,  in 
my  eyes,  is  getting  only  1/6),  then  I  will  envy  you. 
By  contrast,  if  an  allocation  among  three  players  is 
envy  free,  then  I  must  think  I  received  at  least  1/3 
(otherwise,  I  would  think  the  others  together 
received  more  than  2/3,  and  I  would  envy  the  one  or 
both  players  who  received  more  than  1/3).  Hence, 
an  envy-free  allocation  is  always  proportional,  even 
if  there  are  more  than  two  players,  but  a  propor¬ 
tional  allocation  is  not  necessarily  envy  free. 

Equitability:  A  division  is  equitable  if  all  players 
think  that  they  receive  exactly  the  same  value 
above  1  In.  Equitability  is  an  aspect  of  satisfaction 
more  subtle  than  envy  freeness.  To  illustrate  it  in  a 
divorce  settlement,  suppose  you  think  you  got  5 1  % 


886  Fair  Division 


of  the  marital  property  but  your  spouse  thinks  she 
got  90  %  (because  she  had  little  interest  in  what  you 
got).  While  you  do  not  envy  the  portion  of  your 
spouse — in  your  eyes,  she  got  only  49% — you  may 
envy  her  happiness:  She  got  more  of  what  she 
wanted  than  you  did  of  what  you  wanted. 

Equitability  means  that  both  players  think  that 
they  receive  the  same  fraction  of  the  total  as  each 
of  them  values  different  items.  Coupled  with  envy 
freeness,  it  means  not  only  that  both  get  more  than 
50%  but  also  that  both  exceed  50%  by  the  same 
amount.  Thus,  if  both  spouses  think  that  they 
receive  70%  of  the  total  value,  they  are  equally 
pleased  by  their  allocations.  Equitability  may  be  a 
difficult  property  to  ascertain.  Elow  does  one  mea¬ 
sure  whether  both  players  are  equally  happy  with 
their  allocations?  In  fact,  it  generally  requires  an 
interpersonal  comparison  of  utilities. 

Efficiency:  An  allocation  is  efficient  if  there  is  no 
other  allocation  that  is  better  for  some  player  with¬ 
out  being  worse  for  some  other  player.  Efficiency  by 
itself — that  is,  when  not  linked  with  properties  such 
as  proportionality,  envy  freeness,  or  equitability — is 
no  guarantee  that  an  allocation  will  be  fair.  For 
example,  an  allocation  that  gives  everything  to  me 
and  nothing  to  you  is  efficient:  Any  other  alloca¬ 
tion  will  make  me  worse  off  when  it  makes  you 
better  off.  It  is  the  other  properties  of  fairness, 
combined  with  efficiency,  that  ensure  that  the 
maximum  possible  value  is  distributed  according 
to  the  aforementioned  properties. 

Fair-Division  Procedures 

The  modern  mathematical  theory  of  fair  division 
has  its  roots  in  the  1940s,  particularly  in  the  work 
of  Polish  mathematicians.  They  proposed  different 
procedures  for  dividing  both  divisible  goods  such 
as  a  piece  of  land  and  indivisible  goods  such  as 
a  car,  boat,  or  house.  Procedures  that  involve 
dividing  divisible  goods  include  a  plethora  of  cake¬ 
cutting  schemes,  some  of  which  use  “moving 
knives”  that  parties  stop  when  they  believe  they 
have  a  fair  share;  others  involve  trimming  pieces 
from  a  cake,  or  a  set  of  divisible  goods,  to  con¬ 
struct  fair  shares.  Many  of  these  procedures  are 
quite  esoteric  and,  therefore,  impractical  to  apply. 
Besides  divide-and-choose,  probably  the  two  most 
feasible  procedures  are  the  following: 


1.  Strict  and  balanced  alternation:  Strict  alterna¬ 
tion  is  simply  taking  turns:  You  pick  an  item,  then  I 
pick  one,  you  choose  again,  and  so  on.  Of  course, 
going  first  can  be  a  huge  advantage;  giving  extra 
choices  to  compensate  for  going  second  can  reduce, 
if  not  eliminate,  this  advantage.  A  specific  way  of 
balancing  choices  out  yields  a  procedure  called  bal¬ 
anced  alternation.  If  there  are,  say,  four  items  to  be 
divided  among  Players  A  and  B,  the  order  of  choice 
under  balanced  alternation  is  ABBA.  Balanced  alter¬ 
nation  can  be  extended  to  more  than  two  players 
and  any  number  of  items  they  may  wish  to  divide. 

2.  Adjusted  winner:  Two  players  begin  by  inde¬ 
pendently  distributing  a  total  of  100  points  across 
all  the  items  to  be  divided,  depending  on  the  rela¬ 
tive  value  they  attach  to  each.  Thus,  if  you  consider 
a  certain  item  to  be  worth  1/4  of  the  total  value  of 
everything  to  be  divided,  then  you  would  put  25 
points  on  it.  The  term  winner  in  “adjusted  winner” 
comes  from  the  next  step:  Each  player  is  temporar¬ 
ily  given  the  items  on  which  it  placed  more  points 
than  its  opponent.  Then,  the  adjusted  part  comes 
in:  Items  are  transferred  from  the  initial  winner  to 
the  initial  loser  in  a  certain  order  until  the  point 
totals  of  the  two  players  are  equal.  This  order  of 
transfer,  which  requires  splitting  at  most  one  item 
(it  could  also  be  shared  or  sold  and  the  proceeds 
divided),  guarantees  that  the  final  allocation  will 
be  envy  free,  equitable,  and  efficient  if  the  players 
are  truthful.  To  be  sure,  players  may  try  to  manip¬ 
ulate  a  procedure  to  their  advantage  by  giving  false 
information  about  their  preferences.  However, 
most  fair-division  procedures  provide  certain  guar¬ 
antees,  even  against  players  that  are  exploitative. 

To  return  to  divide-and-choose,  it  was  shown 
that  the  divider  can  guarantee  that  he  receives  a 
tied-for-largest  portion,  whatever  the  chooser 
selects,  by  dividing  the  items  50/50  for  himself. 
Moreover,  the  chooser  can  do  at  least  as  well  by 
selecting  what  she  considers  the  larger  pile,  so  this 
procedure  is  envy  free.  However,  divide-and-choose 
may  be  neither  efficient  nor  equitable.  Thus,  both 
players  may  do  better  with  some  other  allocation, 
making  the  division  inefficient.  In  the  case  of  a 
chocolate  and  vanilla  cake,  for  example,  if  the 
divider  does  not  know  which  flavor  the  chooser 
favors,  the  divider  will  cut  it  so  each  pile  comprises 
50%  chocolate  and  50%  vanilla.  But  if,  in  fact,  the 
divider  likes  chocolate  and  the  chooser  likes  vanilla, 
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it  would  be  better  for  both  players  if  the  cut  were 
made  so  as  to  separate  the  chocolate  from  the 
vanilla,  giving  each  player  an  opportunity  to  get  the 
preferred  flavor.  Divide-and-choose  may  be  inequi¬ 
table  because  the  chooser  could  receive  a  piece  she 
considers  larger  than  50%  when  the  cutter  receives 
only  50%.  Of  course,  the  tables  can  be  turned  if  the 
cutter  has  information  about  the  chooser’s  prefer¬ 
ences  and  uses  it  to  get  a  substantially  greater  than 
50%  portion  for  himself  when  the  chooser  must 
settle  for  only  slightly  more  than  50%. 

Applications 

The  procedures  discussed  so  far  offer  an  algorith¬ 
mic  approach  to  fair  division:  A  set  of  rules  is 
specified;  subject  to  the  rules,  the  players  select 
strategies  (e.g.,  the  divider  cuts  the  cake  into  two 
equal  portions)  that  give  each  certain  guarantees 
about  the  resulting  allocation  (e.g.,  it  will  be  envy 
free).  Political  scientists,  sociologists,  and  applied 
economists  have  taken  a  more  empirical  tack, 
seeking  to  determine  the  conditions  under  which 
fairness  and  departures  from  it  occur  in  the  world 
and  what  consequences  they  have  for  people  and 
institutions.  Psychologists,  especially,  have  paid 
heed  to  how  perceptions  of  fairness  impinge  on 
people’s  attitudes  and  affect  their  behavior. 

Theoretical  economists  and  game  theorists  have 
been  more  interested  in  finding  axioms  that  charac¬ 
terize  fairness.  Their  models  are  often  nonconstruc¬ 
tive  in  the  sense  that  they  establish  the  existence — or 
sometimes  the  nonexistence — of  an  allocation  satis¬ 
fying  certain  properties.  (The  problem  of  nonexis¬ 
tence  is  particularly  acute  in  the  case  of  indivisible 
goods.)  But  even  models  that  show  existence  may 
provide  no  clue  as  to  how  to  construct  the  desired 
allocation.  Many  analysts  are  interested  in  fairness 
because  they  desire  to  help  people  settle  their  differ¬ 
ences  amicably.  Numerous  books  have  been  written 
that  purport  to  show  how  to  achieve  win-win  solu¬ 
tions,  whereby  players  win — or  at  least  do  not 
lose — on  the  issues  they  consider  most  important. 

Ultimately,  players  will  want  to  know  on  which 
issues  they  will  win,  on  which  they  will  lose,  and  on 
which  they  will  have  to  compromise.  For  this  pur¬ 
pose,  fair-division  procedures  that  can  guarantee 
that  the  disputants  do  as  well  as  possible  in  realiz¬ 
ing  all  the  win-win  potential  that  is  available  can 
be  extremely  useful.  From  a  normative  perspective, 


fair-division  procedures  can  help  disputants  reduce 
the  frustration,  anger,  and  occasional  violence  that 
accompany  escalating  demands  and  endless  hag¬ 
gling.  By  facilitating  the  disputants’  bringing  their 
own  closure  to  a  dispute,  fair-division  procedures 
help  them  avoid  the  arbitrary  imposition  of  a  settle¬ 
ment  or  continuing  impasse.  Thereby,  these  proce¬ 
dures  tend  to  level  the  playing  field,  so  players  with 
greater  resources  or  capabilities  are  prevented  from 
gaining  unduly  large  shares.  In  this  sense,  fair- 
division  procedures  offer  a  means  to  prevent 
exploitation  of  the  weak  by  the  strong  and  to 
minimize  the  effects  of  power  differences. 
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Fascism 


Fascism  has  become  a  generic  term  used  to  repre¬ 
sent  a  political  movement  that  developed  between 
the  two  world  wars  principally  in  Europe,  with  a 
few  extensions  notably  in  Latin  America.  This  term 
was  employed  for  the  first  time  as  a  partisan  label 
in  Italy  in  1919  by  Benito  Mussolini,  who  appealed 
to  younger  veterans  to  establish  Fasci  di  combatti- 
mento.  In  Italy,  a  new  type  of  political  organization 
emerged,  developed,  and  was  eventually  imposed — 
one  that  more  or  less  indirectly  foreshadowed  and 
inspired  a  set  of  movements  and  political  regimes, 
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including  Adolf  Hitler’s  National  Socialism  in 
Germany,  Leon  Degrelle’s  Rexisme  in  Belgium, 
Oswald  Mosley’s  Fascist  Union  in  Great  Britain, 
Corneliu  Codreanu’s  Iron  Guard  in  Romania, 
Ferenc  Szalasi’s  Arrow  Cross  Party  in  Hungary, 
Jose  Antonio  Primo  de  Rivera’s  Falange  in  Spain, 
Plinio  Salgado’s  Integralist  Action  in  Brazil,  and 
Jacques  Doriot’s  French  Popular  Party  in  France. 
Within  this  group,  important  differences  persisted 
among  movements.  Some  of  them  remained  minor¬ 
ity  opposition  forces,  while  others  formed  political 
regimes;  some  ruled  in  coalition,  while  others  gave 
rise  to  dictatorships  based  on  a  single  party.  There 
are  also  important  differences  in  the  ideology  or  the 
social  basis  of  these  fascist-inspired  movements  as 
these  emerged  and  took  shape  in  very  different 
socioeconomic  and  cultural  contexts.  Important 
variables  include  the  degree  to  which  their  setting 
was  industrialized  and  secular  and  the  extent  of 
nationalization  and  politicization  of  the  masses.  It  is 
nevertheless  possible  to  classify  a  group  of  move¬ 
ments  and  regimes  into  a  category  labeled  fascism. 
These  share  some  major  typical  traits,  formed  in  the 
context  of  the  political  laboratory  represented  by 
the  Blackshirts  in  Italy.  This  entry  analyzes  fascism 
through  its  connections  with  World  War  I,  its  ideol¬ 
ogy,  and  finally  the  way  in  which  it  produced  a  new 
kind  of  regime. 

The  Decisive  Influence  of  World  War  I 

The  first  common  feature  lies  in  relation  to  war.  In 
Europe,  World  War  I  is  indeed  the  cradle  of  fascism. 
It  represents  a  critical  element  of  context,  for  both 
the  birth  of  fascist  leaders  and  the  organizational 
and  ideological  forms  that  characterized  this  politi¬ 
cal  movement.  Whether  in  Italy  or  in  Germany,  the 
most  committed  elements  of  fascism  and  National 
Socialism  were,  for  the  most  part,  young  veterans 
for  whom  the  war  proved  to  be  a  fundamental  ini¬ 
tiation  experience.  Like  their  respective  leaders, 
Mussolini  and  Hitler,  they  supported  the  war  and, 
above  all,  gloried  in  the  “spirit  of  the  trenches”;  they 
also  suffered  from  postwar  demobilization.  Their 
political  trajectory,  their  values,  the  ideology  they 
claimed,  the  political  organizations  they  would 
frame,  and  the  regimes  they  sometimes  succeeded  in 
creating  cannot  be  understood  outside  of  this  initial 
context:  The  Great  War  led  to  the  brutalization  of 
societies. 


The  influence  of  the  war  could  already  be  seen  in 
the  type  of  partisan  organizations  set  up  by  those 
who  would  become  fascist  leaders.  First  in  Italy, 
then  in  Germany  and  other  European  countries, 
they  forged  a  new  type  of  organization:  the  militia 
party.  This  organizational  model  was  a  synthesis  of 
the  mass  party,  previously  unique  to  the  labor 
movement,  and  of  the  military  organization  systems 
developed  in  the  front  lines  of  World  War  I.  The 
result  was  a  form  of  political  organization  unlike 
any  other,  driven  by  agents  who,  in  the  case  of  Italy, 
came  mainly  from  the  middle  class,  often  fought  in 
the  ranks  of  the  extreme  Left  (left-wing  Socialist 
Party,  revolutionary  syndicalism),  and  were  most 
often  veterans  of  the  Italian  Army  elite  troops 
(Arditi).  This  militia  party  was  designed  to  assemble 
an  avant-garde  group  organized  around  a  military 
model:  uniforms,  strict  hierarchical  organization, 
inculcation  of  discipline,  and  maintenance  of  mili¬ 
tary  sociability  modeled  on  the  camaraderie  of  the 
front.  Thanks  to  this  organization,  the  fascists 
introduced  methods  of  war  into  the  political  arena. 
In  the  context  of  latent  civil  war  that  characterized 
immediate  postwar  Italy  and  parts  of  Europe  in  the 
1920s  as  well  as  the  climate  of  political  radicaliza- 
tion  caused  by  the  crisis  of  the  early  1930s,  the 
fascist  militias  (including  death  squads,  Storm 
Troopers,  green  shirts,  etc.)  carried  out  violent 
attacks  on  labor  movement  organizations,  which 
included  attacks  on  the  Socialist  and  Communist 
parties’  public  meetings,  ransacking  of  left-wing 
municipalities,  burning  down  local  trade  union 
offices,  and  intimidating  the  red  league  of  farmers. 
During  this  fierce  struggle  against  labor  organiza¬ 
tions,  the  fascists  won  the  sympathy  and  even  the 
active  support  of  some  economic  elites  and  ruling 
circles,  a  development  that  would  greatly  facilitate 
their  coming  to  power  in  a  coalition  government 
with  Center  and  Right  traditional  parties. 

Fascist  Ideology 

These  militia  parties  mobilized  their  members 
around  a  set  of  values,  creeds,  and  myths  that 
formed  the  common  ideological  basis  of  fascism. 
This  was  primarily  an  ideology  of  action  that  found 
its  immediate  extension  in  a  style  and  in  a  quite 
singular  aesthetic.  As  in  organizational  matters, 
fascisms  practiced  hybridization  in  ideological  mat¬ 
ters.  They  created  a  synthesis  of  influences  from  the 
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ultra-Right  (the  most  radical  nationalist  and  racist 
standards)  and  influences  from  the  ultra-Left  (e.g., 
revolutionary  syndicalism  inspired  by  Sorel).  France 
was  one  of  the  first  laboratories  of  this  new  ideo¬ 
logical  alchemy.  The  result  was  a  cult  of  action, 
violence,  and  youth  and  a  revolutionary  rupture 
with  the  parliamentary  system  and  liberal  society. 
Fascist  ideology  initially  involved  hostility  to  the 
political  currents  of  the  Enlightenment;  liberalism 
and  Marxism  were  its  targets  of  choice.  It  was  based 
on  a  radical  denunciation  of  social  egalitarianism  as 
well  as  of  bourgeois  society,  presenting  itself  as  a 
third-force  ideology,  different  from  both  capitalism 
and  communism.  On  economic  and  social  develop¬ 
ment,  this  third  position  would  find  its  solution  in 
the  corporatist  principle  of  harmonious  cooperation 
between  labor  and  capital  for  the  benefit  of  the 
nation;  nationalism — not  class  consciousness — was 
the  backbone  of  the  fascist  ideology.  This  ideology, 
accordingly,  was  organized  around  the  myth  of  the 
nation  conceived  as  an  organic  and  compact  com¬ 
munity.  Fascism’s  mission  was  to  purify  the  nation 
politically,  anthropologically,  and  even  racially  with 
a  view  to  the  assertion  of  its  power.  Most  fascist 
movements,  beginning  of  course  with  National 
Socialism,  put  racism  and  anti-Semitism  at  the  cen¬ 
ter  of  their  doctrine  and  political  plan.  Even  Italian 
fascism,  which  was  initially  based  on  a  more  politi¬ 
cal  conception  of  the  nation,  ended  in  the  mid- 
1930s  by  putting  the  issue  of  race  in  the  foreground 
and  adopting  anti-Semitic  legislation  in  1938.  This 
homogeneous  national  community  was  destined  to 
be  permanently  mobilized  for  the  goal  of  conquest. 
The  fascist  ideology  was  guided  by  an  imperialist 
teleology:  conquest  of  Lebensraum  in  the  case  of 
National  Socialism  and  construction  of  a  new  world 
order  in  the  case  of  Italian  fascism.  Fascism  was  not 
just  born  from  war,  it  was  also  born  for  war:  Its 
intention  was  to  ideologically  mobilize  the  popula¬ 
tion,  with  a  view  to  forming  a  combatant  commu¬ 
nity  fit  for  military  conquest. 

A  Regime  of  Mass  Mobilization 

In  the  Italian  and  German  cases,  fascism  as  a  ris¬ 
ing  power  aimed  objectively  to  establish  a  system 
of  permanent  mobilization  in  which  society  was 
seen  as  an  army  and  the  citizen  as  a  soldier.  Along 
with  this  initiative  of  a  militarized  society,  the 
development  of  fascism  into  a  regime  started  a 


political  sanctification  process;  its  ideology  was 
introduced  into  “civil  religion,”  which  citizens 
were  called  to  practice  with  ardor.  This  was 
accompanied  by  a  massive  production  of  liturgies, 
ceremonies,  and  political  customs.  Accordingly, 
fascism’s  goal  of  the  “New  Man”  included  both  a 
believer  and  a  soldier:  an  ideal  summed  up  by  one 
of  the  most  famous  programmatic  slogans  of  the 
Italian  fascist  regime — “believe,  obey,  fight.” 

This  mobilizing  regime  is  based  on  a  very  specific 
mechanism.  The  mass  party  was  the  essential  ele¬ 
ment,  the  central  institution  of  fascist  society. 
Fascist  movements  came  to  power  in  coalition,  for¬ 
mally  respecting  constitutional  executives.  But  very 
quickly,  they  implemented  strategies  to  monopolize 
power  by  violently  destroying  labor  movement 
organizations  and  causing  dissolution  or  taking 
over  the  traditional  right-wing  parties.  This  process 
quickly  led  to  a  situation  of  a  one-party  dictator¬ 
ship.  The  dynamics  of  fascism  then  operated  as  a 
dual  process  of  further  state  and  civil  society  pene¬ 
tration.  The  capture  of  power  set  in  motion  a  sym¬ 
biotic  process  of  nationalization  of  the  party  and 
partisan  penetration  of  the  state.  The  state  party 
proved  to  be  an  effective  tool  for  the  regime’s  objec¬ 
tives.  In  particular,  it  allowed  deep  penetration  into 
a  civil  society  that  was  heavily  supervised  by  youth 
organizations,  unions,  associations,  and  govern¬ 
mental  agencies  in  areas  such  as  sports,  culture,  and 
recreation.  Thanks  to  the  millions  of  members  in 
fascist  organizations,  the  system  could  organize  the 
ultrasupervised  mobilization  of  the  population, 
ensure  the  dissemination  of  fascist  values  and  ideol¬ 
ogy,  and  organize  the  faith  in  the  regime  and  its 
leader.  This  leader  (Duce,  Fiibrer ),  “Almighty”  and 
“infallible,”  was  also  one  of  the  typical  elements  of 
fascist  domination.  His  seemingly  limitless  power 
was  inextricably  linked  to  the  strength  of  the  party. 
This  is  because  the  leader  had  quasi-absolute  con¬ 
trol  over  a  party  that  itself  had  an  unrestricted  influ¬ 
ence  over  the  population.  The  leader’s  power  was 
thus  a  direct  reflection  of  the  power  of  the  fascist 
movement  as  it  became  a  political  regime.  Fascist 
organizations  were  also  formidable  instruments 
that  were  used  to  control  political  conformity. 
Spread  over  the  whole  territory,  present  in  all 
spheres  of  social  life — in  the  workplace  as  well  as  in 
recreational  activities,  in  factories  as  well  as  in 
families — they  contributed  to  the  establishment  of  a 
social  and  political  control  exerted  on  the  entire 
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population  with  intensity  unlike  any  other,  finding 
an  equivalent  only  in  Stalin’s  Soviet  Union. 

This  situation  leads  to  the  question  of  the  nature 
of  fascism  and,  more  precisely,  its  place  within  the 
typologies  forged  by  political  science  to  situate  the 
political  regimes  of  the  20th  century.  The  answer  to 
the  question  of  whether  fascism  should  be  classified 
as  an  authoritarian  regime  or  a  totalitarian  regime 
depends,  in  part,  on  what  principles  of  logic  and 
restrictive  definitions  are  implemented.  If  we  adopt 
a  restrictive  definition  of  fascism  and  say  that  only 
Italian  fascism  and  Nazism  fully  qualify,  we  can 
consider  fascism  to  be  one  component  of  the  totali¬ 
tarian  phenomenon.  This  is  indeed  a  form  of  domi¬ 
nation  based  on  the  mobilization  of  the  population 
through  a  mass  political  organization  around  a  civil 
religion,  generating  a  level  of  control  of  individuals 
that  is  unprecedented  in  its  intensity,  resulting  in  the 
eradication  of  all  forms  of  organized  political  oppo¬ 
sition  and  functioning  even  in  tandem  with  any 
dissenting  public  expression.  Nazism  and,  to  a 
lesser  extent,  Italian  fascism  largely  corresponded 
to  this  totalitarian  type  by  the  time  they  collapsed 
under  the  weight  of  their  military  defeats. 
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University  of  Montpellier 
Montpellier,  France 
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Fascist  Movements 


In  the  years  between  World  Wars  I  and  II,  there 
were  violent  fascist  movements  all  over  Europe. 
Nearly  every  country  had  at  least  one;  France  had 


six  and  Switzerland  three.  The  fascist  category  was 
quite  vague  and  included,  among  several  compet¬ 
ing  groups,  militant  nationalists  and  veterans  of 
World  War  I.  Their  chief  antagonists,  the  socialists 
and  communists,  called  them  fascists — from  Benito 
Mussolini’s  fasci  di  combattimento  (combat 
squads) — but  the  criteria  for  such  a  label  to  be  used 
were  rather  hazy  at  first.  Even  after  nearly  a  decade 
in  power,  Mussolini  himself  was  reluctant  to  define 
the  faith  of  his  movement  until  he  allowed  Giovanni 
Gentile,  a  fascist  philosopher,  to  write  his  famous 
article  on  fascism  for  the  Italian  Encyclopedia 
Treccani  in  1931.  A  once-prominent  revolutionary 
socialist  leader,  Mussolini  forged  the  character  of 
his  movement  as  he  went  along,  seeking  violent 
confrontations  with  his  erstwhile  party  comrades 
and  accepting  the  support  of  whatever  group 
would  back  him  at  the  time,  including  former  ene¬ 
mies  such  as  big  industry,  the  state,  and  the 
Catholic  Church.  During  his  struggle  for  power,  he 
had  learned  tactics  from  Vladimir  Lenin’s  violent 
takeover  of  the  mighty  Tsarist  Empire.  Adolf  Hitler 
in  turn  sought  to  imitate  Mussolini’s  methods,  and 
finally,  scores  of  other  European  fascist  movements 
began  to  imitate  and  curry  favor  with  the  trium¬ 
phant  Third  Reich.  Opportunism  and  constantly 
changing  profiles  were  typical  features  of  the  early 
fascist  movements.  Their  virulence  and  relative 
“purity,”  however,  were  quickly  watered  down 
and  compromised  once  they  participated  in  govern¬ 
ments.  Fascist  regimes  usually  sidelined  their  “old 
fighters”  and  left  policy  to  the  leadership,  the  army, 
and  the  bureaucracy. 

The  most  important  European  fascist  move¬ 
ments  were  the  Italian  National  Fascist  party 
( Partito  Nazionale  Fascista,  or  PNF),  the  National 
Socialist  (NS)  German  Workers’  Party  (National- 
sozialistiscbe  Deutsche  Arbeiterpartei,  or  NSDAP), 
the  Hungarian  Arrow  Cross,  the  Romanian  Iron 
Guard,  the  Belgian  Rexists,  and  the  Spanish  JONS/ 
Falange,  but  there  were  dozens  of  others,  including 
many  imitators  of  the  most  successful  fascist  move¬ 
ments.  Some  were  more  on  the  side  of  labor,  some 
with  employers.  Some  claimed  to  defend  the  church; 
most  were  stridently  anti-Semitic  and  opposed  to 
minorities.  Imitation  also  characterized  their  use  of 
quasi-military  uniforms  and  ranks,  their  flags,  and 
their  internal  organization.  The  larger  movements 
developed  specialized  organizations,  for  example, 
for  youth  and  women  and  also  for  quasi-military 
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enforcement  operations  and  subversion  abroad. 
During  World  War  II,  the  Third  Reich  armed  forces 
even  established  elite  Waffen-SS  divisions — the  SS 
(originally  Hitler’s  body  guard)  being  otherwise 
employed  for  enforcement  and  concentration  camp 
guard  duty.  Some  movements,  such  as  the  NSDAP, 
also  strove  to  recruit  into  their  ranks  corporatist 
groups,  such  as  the  German  Labour  Front  and  pro¬ 
fessional  NS  associations,  for  example,  a  NS  Doctors 
Federation,  NS  Farmers,  and  many  others  to  which 
all  German  workers,  doctors,  and  farmers,  and  so 
on  had  to  belong. 

Historical  Causes  of  Fascism 

Major  historical  issues  and  mid-20th-century  con¬ 
flicts  gave  rise  to  most  fascist  movements.  Among 
the  most  important  was  a  country’s  participation 
in  World  War  I,  especially  on  the  losing  side.  The 
defeated  of  the  war — the  Central  Powers  (Germany, 
Austro-Hungary,  and  Turkey)  and  others — who 
felt  left  out  of  the  distribution  of  the  spoils  were 
fertile  ground  for  organizations  of  resistance  and 
revenge.  It  is  impossible  to  imagine  the  atmosphere 
surrounding  the  birth  of  Italian  fascism  without 
mentioning  the  600,000  Italian  casualties,  the 
sense  of  having  been  cheated  out  of  the  fruits  of 
victory,  and  the  territorial  ambitions  of  many 
middle-class  Italians.  Germans  felt  similarly 
stricken  by  vast  casualties  and  economic  losses  and 
obsessed  with  the  loss  of  German  colonies  and 
lands.  The  war  had  destroyed  four  empires  (Austro- 
Hungarian,  German,  Russian,  and  Turkish)  and 
led  to  vast  transfers  of  territory,  leaving  behind 
seething  resentments  and  unfulfilled  aspirations, 
sowing  hatred  and  the  seed  for  future  conflicts,  for 
example,  in  the  Balkans,  and  still  generating 
bloodshed  and  ethnic  cleansing  today.  The  so- 
called  axis  of  fascist  powers  and  movements  after 
World  War  I  was  made  up  of  countries  and  ele¬ 
ments  violently  opposed  to  the  peace  settlements 
of  the  war,  such  as  the  Treaties  of  Versailles  and 
Trianon.  The  war  also  explains  the  extreme  degree 
of  militarization  of  these  fascist  movements,  the 
prominent  presence  of  veterans,  and  the  wide¬ 
spread  belief  among  them  that  quasi-military 
violence  could  overpower  democratic  majority 
decisions  and  parliamentary  rule.  For  this  purpose, 
they  all  sported  private  armies  like  the  Nazi  storm 
troopers  and  the  Italian  Blackshirts  and  trumpeted 


their  belief  in  political  violence.  Fascism  thus  left  a 
legacy  that  was  most  inhospitable  to  the  develop¬ 
ment  of  constitutional  democracy. 

The  second  reason  for  the  formation  of  fascist 
movements  was  the  significant  Left-Right  schism 
that  had  been  developing  in  various  European 
countries  for  decades.  Industrialization,  capitalism, 
and  organized  labor  at  different  stages  of  political 
modernization  had  reached  a  point  of  all-out  con¬ 
frontation  by  the  time  of  the  war.  In  highly  indus¬ 
trialized  areas  such  as  Germany  and  Northern  Italy, 
a  vanguard  of  socialist  and  communist  parties 
emerged  with  electoral  majorities  and  was  stirred  to 
action  by  the  Russian  Revolution  of  1917.  Tsarist 
Russia  was  still  mostly  agrarian,  but  the  war  had 
brought  it  to  its  knees  with  the  onslaught  of  paci¬ 
fism,  anarchism,  and  Lenin’s  socialist  forces.  The 
Whites,  the  military  defenders  of  the  monarchy, 
nobility,  and  traditional  order  in  the  bloody  Russian 
civil  war,  in  many  ways  resembled  the  fascist  move¬ 
ments  of  Central  and  Eastern  Europe.  In  republican 
Spain,  rebellious  workers,  anarchists,  socialists,  and 
communists  faced  off  against  capitalists,  state 
forces,  the  Church,  and  General  Franco’s  army.  In 
semi-industrial  Italy,  when  the  rising  socialist  chal¬ 
lenge  culminated  in  the  seizure  of  factories  and 
agrarian  estates,  the  fascists  spearheaded  a  counter¬ 
revolution  that  was  really  aimed  at  establishing 
their  own  dictatorship.  Everywhere,  fascist-like 
movements  were  defined  by  their  violent  opposition 
to  the  inexorable  rise  of  socialist  or  communist 
movements,  and  the  brutal  use  of  violence  became 
their  trademark. 

There  were  other,  later,  causes  for  the  fascist 
eruptions  in  mid-20th-century  Europe:  The  Great 
Depression  of  the  1930s  mostly  aggravated  and 
deepened  the  Left-Right  schism  rather  than  being 
the  primary  cause.  Similarly,  the  rising  power  of 
the  fascist  Axis  and  its  interventions  into  the 
domestic  politics  of  small  neighboring  states — for 
example,  by  Mussolini’s  Italy  into  Austrian  or 
Hungarian  politics — created  incentives  for  fascist 
movements  or  profascist  realignments.  Eventually, 
fascist  puppet  regimes  came  into  power  in  the 
Balkans  and  in  German-occupied  Northern  Europe 
during  World  War  II.  Native  fascist  movements 
like  the  Hungarian  Arrow  Cross,  the  Romanian 
Iron  Guard,  and  the  Norwegian  Nasjonal  Samling 
thus  briefly  achieved  participation  in  government, 
although  with  little  popular  support. 
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After  World  War  II,  the  more  prominent  inter¬ 
war  fascist  movements  generated  small  successor 
movements  such  as  the  Italian  Social  Movement 
(. Movimento  Sociale  Italiano,  or  MSI)  and  the  half 
dozen  German  neo-Nazi  groups.  Their  members 
generally  were  nostalgic  “old  fighters,”  and  their 
children  were  constrained  by  West  German  anti- 
Nazi  legislation  to  refrain  from  publicly  repeating 
old  Nazi  propaganda.  There  were  several  waves  of 
such  successor  organizations,  each  progressively 
less  authentic.  They  even  appeared  in  Eastern 
Europe,  following  the  collapse  of  the  communist 
empire,  consisting  mostly  of  violent,  anti-immigrant 
youth  gangs  and  nativist  skinhead  bands. 

Membership 

The  focus  on  fascist  movements  naturally  puts  the 
emphasis  on  the  membership.  How  large  were 
these  movements?  What  was  the  typical  age  of  the 
members?  At  what  point  did  they  join  up  and  why? 
The  largest  among  them — the  PNF  and  the 
NSDAP — approached  the  magical  1  -million  mark 
at  about  the  time  they  came  to  power  or  shortly 
thereafter,  but  their  membership  was  notoriously 
volatile,  as  if  motivated  only  by  short-term  bouts  of 
passion.  After  taking  power,  both  tended  to  absorb 
other  existing  groups  such  as  youth  organizations 
or  other  parties.  Fascist  movements,  like  most 
political  parties,  were  inclined  to  brag  about  their 
membership  figures  or  to  keep  poor  records.  Hectic 
membership  growth  was  also  related  closely  to  the 
postwar  demobilization  from  military  service,  though 
fascist  movements  were  in  competition  with  veterans 
groups  and,  in  Germany  and  Austria,  with  large 
right-wing  vigilante  organizations  (Einwohnerwehr). 
A  large  influx  also  occurred  after  election  triumphs 
and  the  takeover  of  power. 

At  the  very  beginning,  of  course,  the  member¬ 
ship,  say  of  the  Italian  “fascists  of  the  first  hour”  or 
the  Nazi  “old  fighters”  was  very  small.  Their  first 
goal,  in  terms  of  membership,  had  to  be  to  acquire 
a  mass  following  and  audience  amid  the  huge  fol¬ 
lowings  of  their  left-wing  rivals,  given  the  burgeon¬ 
ing  mass  politics  of  most  interwar  countries. 

Social  Composition 

Although  there  is  little  and  often  unreliable  informa¬ 
tion  about  the  social  composition  of  most  interwar 


European  fascist  movements,  the  question  of  their 
socioeconomic  origins,  from  their  beginnings, 
took  on  great  significance.  Given  the  rising  chal¬ 
lenge  of  labor  movements  and  the  prominence  of 
Marxist  interpretations  in  advanced  industrial 
countries  throughout  much  of  the  20th  century, 
many  analysts  equated  socialist  and  communist 
parties  with  the  rising  industrial  proletariat  while 
associating  fascist  movements  with  the  militant 
defense  of  the  interests  of  the  bourgeois  middle 
class  and  old  aristocratic  elites.  Fascist  antisocial¬ 
ism  was  thus  interpreted  as  the  bourgeois  “class 
struggle  a  rebours”  (the  class  struggle  in  reverse) 
against  the  advance  of  working-class  parties  and 
trade  unions. 

There  was  of  course  much  evidence  for  this 
antisocialist  behavior  of  fascists,  although  they 
themselves  resisted  such  an  interpretation  and  fre¬ 
quently  directed  their  polemics  also  against  capi¬ 
talism  and  the  “reactionary”  social  elites.  The 
NSDAP  even  called  itself  National  Socialist 
German  Workers  Party  and  often  pointed  out  its 
worker-friendly  program.  Nearly  a  third  of  its 
rank-and-file  membership  in  1935,  indeed,  were 
blue-collar  workers,  and  one  fifth  were  low-level 
white-collar  employees.  To  be  sure,  the  German 
Socialists  (Sozialdemokratische  Partei  Deutscb- 
lands;  SPD)  and  Communists  (Kommunistische 
Partei  Deutscblands-,  KPD)  had  a  far  larger  work¬ 
ing-class  component,  and  the  blue-collar  share  of 
the  total  German  population  at  the  time  was 
46.3%  (Peter  Merkl,  1980a,  Tables  3-5,  pp.  765- 
767;  Tables  9-11,  pp.  774-776).  The  guiding  for¬ 
mula  of  most  contemporary  fascist  movements 
was  a  third-force  formula  combining  strident 
nationalism  with  a  form  of  non-Marxist  socialism. 

But  the  crux  of  the  matter  is  that  it  makes  little 
sense  to  equate  Karl  Marx’s  sweeping  concept  of  the 
rising  proletariat  with  the  fragmented  occupational 
statistics  of  socialist  and  Nazi  movements.  Occu¬ 
pational  status  does  not  necessarily  imply  a  class  role 
and  ideology.  It  makes  a  difference,  also,  whether  a 
worker  is  skilled  or  unskilled,  a  foreman  or  just  an 
ordinary  worker;  whether  he  or  she  belongs  to  a 
well-established,  traditional  trade,  is  a  union  mem¬ 
ber  or  not,  and  so  on.  The  Nazis  and  the  Italian 
fascists  even  had  their  own,  large  trade  unions  and 
so  did  various  conservative  parties,  such  as  the 
Catholic  Center  Party  in  Germany  and  the  Christian 
trade  unions  in  several  other  European  nations. 
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The  status  differences  within  fascist  parties  also 
matter:  The  blue-collar  share  among  all  NSDAP 
party  office  holders,  for  example,  was  much 
smaller — only  23%  in  1935  (Merkl,  1980a) — than 
among  the  rank  and  file.  Neither  Hitler  nor 
Mussolini  nor  comparable  European  fascist  lead¬ 
ers  were  workers,  although  at  the  start  of  his 
political  career,  Mussolini  was  a  revolutionary 
socialist  intellectual  but  hardly  a  worker. 

Failing  to  clearly  associate  Nazis  and  other  fas¬ 
cists,  social  analysts  then  tried  to  concentrate  on 
exactly  how  the  NSDAP  differed  from  the  popula¬ 
tion  of  Germany  in  1935:  According  to  Merkl 
(1980a),  the  party  had  far  higher  percentages  of 
white-collar  employees  (20.6%),  civil  servants 
(9.6%),  teachers  (3.4%),  professionals  (3.2%), 
business  owners  (7.5%),  and  independent  artisans 
(19.5%). 

Unable  to  attribute  fascism  to  bourgeois  and 
upper-class  interests  alone,  the  analysts  tried  to 
label  the  NSDAP  a  “lower-middle-class”  or  “petty 
bourgeois”  movement.  But  although,  the  German 
white-collar  occupations,  for  example,  had  grown 
prodigiously  since  1900,  the  concept  of  a  lower 
middle  class  united  by  a  common  awareness  of  its 
interests  is  quite  vague.  Does  the  professional  mili¬ 
tary,  the  civil  service,  or  the  large  farm  population 
belong  to  it  or  not?  It  was  never  a  part  of  any  over¬ 
arching  class  struggle,  which  would  have  assigned 
it  a  role  in  the  dynamic  of  the  larger  society. 

Change  in  the  Movements 

In  the  dozen  or  so  years  since  its  beginnings,  fur¬ 
thermore,  the  NSDAP  changed  radically,  and  the 
disproportionate  numbers  of  demobilized,  militant 
veterans  and  university  students  of  the  early 
(Bavarian)  phase  were  replaced  by  white-collar 
and  blue-collar  elements  in  the  party  of  the  1930s. 
There  were  similar  groups  outside  Bavaria  before 
1925,  such  as  the  Freikorps  (Free  Corps)  and  the 
Deutscbvolkiscbe  Scbutz-  und  Trutzbund,  but 
there  is  a  lack  of  sufficient  data  for  comparison.  By 
the  early  1930s,  farmers,  blue-  and  white-collar 
employees,  and  victims  of  foreclosure,  bankruptcy, 
and  mass  unemployment  swelled  the  ranks. 

Once  in  power  and  able  to  dispense  favors,  like 
jobs  to  the  unemployed,  the  NSDAP  grew  prodi¬ 
giously.  Particularly  civil  servants  (earlier  barred 
from  joining)  and  professional  and  business  people 


in  small  towns  and  rural  areas  joined  up  so  that,  by 
1937,  its  membership  stood  at  about  2.5  million. 
But  what  had  been  dangerous  rebels  before  the 
Beer  Hall  Putsch  of  1923,  when  their  demonstra¬ 
tion  was  shot  at  by  the  police,  had  become  a  rather 
tame  collection  of  conformists.  Only  the  leader¬ 
ship,  not  the  movement  as  such,  had  taken  over  the 
state,  which  continued  to  be  run  by  its  specialized 
services.  The  Italian  PNF  experienced  similar  rapid 
growth  after  coming  into  power  with  the  bold 
March  on  Rome  in  1922  and  accumulated  well 
over  a  million  members  by  1929,  when  even  the 
Vatican  was  ready  to  conclude  a  concordat  with 
Mussolini  that  became  the  1929  Fateran  Treaty. 

Statistical  gaps  and  differences  in  national  pop¬ 
ulation  statistics  make  it  harder  to  compare  the 
social  composition  of  the  20-some  other  fascist 
movements  of  interwar  Europe.  Many  of  the 
smaller  ones  kept  no  records,  but  many  had  their 
own  historical  interpretations  of  themselves,  usu¬ 
ally  outside  the  Marxist  scheme  and  not  derived 
from  the  judgment  of  their  antagonists.  As  in  many 
revolutionary  situations,  university  students  often 
played  a  big  role  among  the  various  fascist  move¬ 
ments,  whether  they  were  the  prewar  French 
camelots  du  roi  with  their  integral  nationalism  and 
anti-Semitism,  the  anti-Russian  Finnish  Academic 
Karelian  Society  (AKS),  or  the  Moldavian  student 
disciples  of  the  anti-Semite  Alexander  Cuza  from 
whom  the  Romanian  Iron  Guard  of  Corneliu 
Codreanu  eventually  sprang  with  a  nativist  and 
frenetically  religious  fascism  found  otherwise  only 
among  the  Belgian  Rexists  and  the  Slovakian 
Hlinka  fascists. 

Marginal  Fascist  Movements 

Were  there  other  fascist  movements,  perhaps  in 
India,  Argentina,  or  the  Middle  East,  far  from  ris¬ 
ing  Marxist  labor  parties?  For  many  Balkan  or 
East  European  successor  states,  left  in  the  wake  of 
falling  empires  where  power  gravitated  toward 
old  imperial  elites,  a  Marxist  interpretation  seems 
beside  the  point.  Fighting  off  ethnic  secession 
movements  with  socialist  overtones  was  not  the 
same  as  facing  the  challenge  of  a  rising  labor 
movement.  Even  the  post-Tsarist  Whites  in  the 
Russian  civil  war  and  post-Hapsburg  Hungary  fit 
this  category:  The  old  aristocratic,  landowning 
elites  supported  the  growth  of  ex-military  and 


894  Fascist  Movements 


fascist  movements  against  the  urban  Bolsheviks  of 
St.  Petersburg  and  Budapest  and  ethnic-nationalist 
rebellions.  The  Bolsheviks  won  in  the  end  in 
Russia,  though  hardly  because  of  the  support  of 
an  industrial  working  class.  The  violence  of  the 
first  war  and  upheavals  involving  “dictatorships 
of  workers  and  soldiers”  from  Russia  and  Finland 
to  Budapest,  Vienna,  Munich,  and  Berlin  made 
for  civil  war-like  confrontations  and  attitudes: 
Revolution  and  violence  create  militants. 

But  the  farther  west  and  north  one  goes,  if  not 
always  toward  greater  industrial  development,  the 
greater  is  the  role  of  the  Marxist  image  of  capital 
versus  workers  and,  therefore,  the  significance  of 
the  social  composition  of  self-styled  fascist  move¬ 
ments.  In  the  newborn  Austrian  republic,  the 
Heimivehr  veterans  were  perhaps  more  reliable 
supporters  of  the  aristocrats  and  landowners  of  the 
old  order  than  the  German-supported  Austrian 
NSDAP  of  1934  and  1938.  Its  fight  against  the 
socialists  and  organized  workers  was  hardly  a 
defense  of  capital  that  could  be  exhaustively 
described  by  the  Marxist  formula.  In  Italy,  the 
separate  developments  in  the  north — which  was 
closer  to  the  model — and  the  Nationalists  of  the 
south  who  were  eventually  co-opted  by  Mussolini 
also  did  not  quite  add  up  to  a  victory  of  the  bour¬ 
geoisie  in  a  class  struggle  an  rebours. 

The  role  of  the  Catholic  Church  in  Southeastern 
and  Southern  European  fascism  was  nowhere  stron¬ 
ger  than  in  the  complex  fronts  of  the  Spanish  Civil 
War.  Spain  had  not  been  part  of  World  War  I,  but 
the  external  intervention  of  General  Franco’s  colo¬ 
nial  army  in  defense  of  a  mix  of  well-established 
interests  (monarchists,  landowners,  capital,  and  the 
Church)  was  decisive  in  beating  down  the  republi¬ 
can  coalition,  not  to  mention  secessionist  challenges. 
The  real  fascists,  the  Falange/JONS,  were  castrated, 
forced  into  a  merger  with  monarchists,  and  sidelined 
as  a  kind  of  popular  ideological  auxiliary  by  the  real 
winner,  the  military  dictator  Franco.  The  resulting 
Falange/Movimiento  became  another  toothless  fas¬ 
cist  faqade  not  unlike  the  Nazi  storm  troopers 
[Sturm  Abteilung,  or  SA)  after  1934. 

A  Generational  Explosion? 

These  interwar  movements  mostly  started  in  a 
grandiose  manner  and,  then,  even  though  they 
reached  amazing  numbers,  began  to  decline  in 


militant  spirit.  Could  their  age  composition  explain 
their  counterrevolutionary  power?  Their  earliest 
years,  whether  right  after  the  Great  War  or  at  the 
onset  of  the  Great  Depression,  were  often  marked 
by  the  surprising  youth  of  most  of  the  militants. 
Fascist  leaders,  though  not  particularly  young 
themselves,  tended  to  take  advantage  of  the  enthu¬ 
siasm  and  energy  of  the  young  with  slogans  like 
“Move  over,  you  old  ones!”  or  by  celebrating 
youth  in  their  songs — Giovinezza,  Giovinezza. 
They  often  pointed  to  the  generational  abyss 
between  their  own  parties  and  their  more  estab¬ 
lished  and  gerontocratic  rivals  (except  the  equally 
young  Communists). 

The  long  period  of  peaceful  prewar  develop¬ 
ment,  in  particular,  and  the  war  itself  often  led  to 
a  kind  of  sclerosis  within  progressive  movements 
and  then  an  explosion  of  generational  conflict. 
Young  and  old  clashed  over  their  goals,  which 
helped  produce  the  fatal  wartime  splits  among 
socialists  or  between  them  and  the  communists.  In 
some  battlegrounds,  such  as  Berlin  between  1928 
and  1933,  the  two  ferocious  young  armies — the 
young  Nazi  storm  troopers  and  the  Communist 
Red  Front  Fighters — were  mirror  images  of  each 
other,  both  from  proletarian  backgrounds  but  one 
nationalistic  and  the  other  socialist.  Right  at  the 
end  of  the  war,  Mussolini’s  squadristi  boasted  as 
their  antecedents  the  equally  young  arditi  shock 
troops  of  the  Italian  army,  while  the  brown-shirted 
Ordner  (bouncers)  of  the  NSDAP  rallies  claimed 
the  youthful  Sturmbatallione  (elite  battalions)  of 
the  German  army. 

The  actual  ages  at  which  would-be  fascists  were 
recruited  may  have  been  even  lower  than  that  of 
the  soldiers:  Comparative  statistics  are  limited,  but 
they  suggest  that  nearly  half  of  the  Bavarian  Nazis 
of  1923,  37%  of  the  NSDAP  of  1930,  and  one 
fourth  of  the  Austrian  NSDAP  of  1934  (the  year  of 
the  Dollfuss  Putsch)  were  under  24  years  of  age. 
One  third  of  Danish,  Norwegian,  and  British  fas¬ 
cists  of  the  1930s  were  under  26  (Merkl,  1980a, 
Tables  6-7,  pp.  768-770).  One  empirical  study, 
based  on  the  Abel  Collection  of  autobiographies  of 
early  Nazis,  emphasizes  the  large  number  of 
German  youth  groups  of  the  1920s,  many  of  a 
political  sort  from  Left  to  Right,  that  the  young 
would-be  Nazis  had  briefly  joined  before  homing 
in  on  the  storm  troopers  or  the  party  itself.  The 
youngest  to  join  militant  Nazi  organizations  were 
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barely  teenagers.  Like  the  neo-Nazi  skinheads  of 
60-some  years  later,  many  already  had  records  of 
criminal  violence  and  confrontations  with  author¬ 
ity.  The  very  young  were  also  the  most  violent  and 
the  most  likely  to  participate  soon  in  marching  in 
demonstrations  and  proselytizing  for  the  move¬ 
ment.  It  is  generally  accepted  that  what  attracted 
these  youngsters  most  was  largely  the  sheer  impe¬ 
tus  of  an  extremist  populist  movement  and  the 
constant  action  and  comradeship  and  to  a  lesser 
extent  the  ideological  content  of  what  they  were 
fighting,  marching,  and  proselytizing  for,  including 
anti-Semitism.  There  was  quite  a  difference 
between  them  and  the  relatively  less  violent  but 
also  more  racist  or  anti-Semitic  adult  party  mem¬ 
bers  who  never  joined  the  storm  troopers. 

Investigations  of  contemporary  neo-Nazi  and 
skinhead  violence  thus  reveal  major  differences 
between  them  and  their  interwar  predecessors. 
First  of  all,  there  appears  to  be  a  broad  range  of 
very  young  perpetrators  of  antiforeigner  violence, 
including  soccer  rowdies,  among  whom  real  politi¬ 
cal  affiliation  with  neo-Nazi  groups  is  rare  though 
many  wear  pins  and  insignia  that  suggest  other¬ 
wise.  They  are  part  of  today’s  very  violent  European 
youth  scene,  especially  those  15  to  20  years  old, 
and  resemble  youth  gangs  and  their  activities  all 
over  the  world  far  more  than  their  counterparts 
who  took  part  in  political  violence  of  the  interwar 
period  (see,  e.g.,  Merkl,  1995;  Willems,  1993). 
Older  skinheads  tend  to  be  more  political  and  often 
anti-Semitic  and  affiliated  with  neo-Nazi  parties 
that  vainly  struggle  to  attract  the  young  hooligans. 
There  is  little  in  common  with  the  quasi-military, 
disciplined  organizations  of  the  fascist  past  that 
aimed  at  major  political  goals  such  as  overturning 
the  peace  settlements  of  World  War  I. 
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Federalism 


Federalism  is  often  referred  to  in  current  writings 
on  politics  as  an  old  political  idea  whose  time  has 
finally  come  in  the  21st  century.  In  this  respect,  a 
distinguished  scholar  of  this  subject,  Ronald  L. 
Watts  (2008),  has  noted  “there  are  at  present 
some  25  countries  encompassing  over  40%  of  the 
world’s  population,  and  each  exhibiting  the  fun¬ 
damental  characteristics  of  a  functioning  federa¬ 
tion”  (p.  1).  And  he  points  out  that  “a  distinctive 
feature  about  this  current  popularity  of  federalism 
in  the  world  is  that  the  application  of  the  federal 
idea  has  taken  a  great  variety  of  forms”  (p.  7). 

The  initial  framing  of  “federalism”  in  its  mod¬ 
ern  manifestation,  both  as  an  idea  and  as  an  insti¬ 
tutional  form,  is  generally  attributed  to  two 
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prominent  American  Constitutional  Fathers, 
Alexander  Hamilton  and  James  Madison,  the  prin¬ 
cipal  authors  of  The  Federalist,  in  October  1787. 
In  promoting  this  concept,  they  sought  to  defend  a 
new  hybrid  regime  form  that  they  and  other  fram¬ 
ers  of  the  American  Constitution  had  proposed  at 
the  founding  Philadelphia  Convention  on  Septem¬ 
ber  17,  1787,  for  later  ratification.  It  was  intended 
to  strike  a  balance  between  the  decentralized  and 
centralized  forms  of  confederal  and  unitary 
government.  The  former  had  been  applied  to  the 
much  more  decentralized  political  union  that  had 
operated  from  1781  to  1787  in  the  13  newly  inde¬ 
pendent  colonies  of  British  North  America  under 
the  Articles  of  Confederation.  In  this  earlier  short¬ 
lived  and  highly  unstable  political  arrangement, 
the  original  constituent  units  were  essentially  in 
control  of  the  newly  formed  and  fragile  central 
government.  A  similar  pattern  of  union  had  oper¬ 
ated  in  the  German,  Dutch,  and  Swiss  Confed¬ 
erations  of  the  16th  and  17th  centuries.  Unitary 
government,  on  the  other  hand,  was  the  pattern  of 
rule  that  operated  in  most  other  states  in  which 
political  decision-making  power  and  authority 
were  vested  in  one  unit  or  center.  The  American 
Constitutional  Fathers  favored  federalism  as  an 
intermediate  form  between  centralized  and  decen¬ 
tralized  government  because  they  believed  that  by 
dividing  sovereignty  in  this  balanced  way,  they 
could  prevent  the  concentration  of  authority  and 
promote  liberty  and  political  pluralism  in  a  new 
union.  They  considered  federalism  to  be  a  vertical 
extension  of  the  concept  of  separation  of  powers 
between  the  executive,  legislature,  and  judiciary, 
which  they  had  adapted  from  the  noted  18th- 
century  French  political  philosopher,  the  Baron  de 
Montesquieu. 

Defining  Principles  and  Characteristics 

The  major  and  defining  principle  of  this  new 
regime  form,  according  to  the  British  constitutional 
thinker  Kenneth  C.  Wheare  in  his  pioneering  study 
Federal  Government  (1964),  consisted  of  a  division 
of  power  and  authority  between  two  different 
political  decision-making  units — a  central  one  exer¬ 
cising  jurisdiction  over  the  entire  nation-state  and  a 
constituent  or  regional  one  having  responsibility 
for  a  part  of  that  nation-state.  These  two  govern¬ 
mental  units  would  be  recognized  as  having  equal 


status  in  law,  so  that  neither  one  could  abolish  or 
subordinate  the  other.  Other  characteristics  of  feder¬ 
alism,  such  as  the  capacity  to  exercise  authority  and 
decision-making  power  directly  on  their  respective 
populations  and  the  right  of  each  unit  to  legislate  or 
regulate  policy  within  its  assigned  sphere  of  author¬ 
ity,  flowed  directly  from  this  principle.  The  same 
was  true  of  the  various  institutional  elements  or 
manifestations  of  this  basic  idea.  These  included  the 
institution  of  two  complementary  legislative  bod¬ 
ies — one  to  represent  the  center  on  the  basis  of 
population  and  the  other  to  represent  the  constitu¬ 
ent  units  in  terms  of  territoriality — and  the  establish¬ 
ment  of  courts  as  judicial  arbiters  in  jurisdictional 
disputes  between  the  two  levels  of  government.  Both 
levels  would  also  be  expected  to  participate  in  a 
formal  and  fundamental  way  in  any  future  constitu¬ 
tional  amendments. 

Pre-1945  Theoretical  Contributions 

This  general  concept  of  federalism  gradually  gained 
prominence  and  approval  by  some  leading  19th- 
and  early-20th-century  political  thinkers,  includ¬ 
ing  luminaries  like  Alexis  de  Tocqueville,  Albert  V. 
Dicey,  James  Bryce,  Pierre-Joseph  Proudhon,  and 
Harold  Laski  (Sobei  Mogi,  1931).  Like  the  authors 
of  The  Federalist,  they  viewed  federalism  in  both 
normative  and  descriptive  terms  largely  as  an  insti¬ 
tutional  device  designed  to  divide  sovereignty  and 
prevent  the  concentration  of  authority  and  power 
in  a  single  decision-making  center,  promote  politi¬ 
cal  pluralism,  and  maximize  liberty.  It  was 
entrenched  as  a  regime  form  in  the  19th  and  early 
20th  centuries  in  several  countries  besides  the 
United  States,  including  Switzerland,  Canada,  and 
Australia.  It  was  also  embraced  by  the  new  post¬ 
war  regimes  in  Germany  and  Austria  and,  at  least 
in  a  formal  sense,  by  a  number  of  newly  indepen¬ 
dent  Third  World  countries,  such  as  Brazil, 
Argentina,  Venezuela,  and  Mexico  in  Latin 
America;  South  Africa  and  Nigeria  in  Africa;  and 
India  and  Malaysia  in  Asia.  Still,  its  adoption,  both 
intellectually  and  institutionally,  was  limited  and 
slow.  Moreover,  it  faced  widespread  opposition 
globally  from  critics  for  what  was  perceived  to  be 
its  predisposition  to  cumbersome  and  inefficient 
governmental  decision-making  processes,  perva¬ 
sive  internal  conflict  and  division,  and  regime 
instability  and  breakdown. 
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Post-World  War  II  Theoretical  Trends 

After  World  War  II,  the  core  ideas  and  writings  on 
federalism  began  to  diverge  in  political  science  and 
to  coalesce  around  two  distinct  traditions,  which 
are  generally  designated  as  Anglo-American  and 
continental  European.  The  former  (the  Anglo- 
American  tradition)  encouraged  the  formulation 
of  new  analytical  approaches  to  federalism,  which 
paralleled  conceptual  and  paradigmatic  changes  in 
the  discipline  as  a  whole.  The  proponents  of  these 
new  approaches  to  federalism,  like  the  advocates 
of  the  reform-oriented  behavioral  movement  in 
political  science  during  this  period,  generally 
embraced  positivist,  empirically  oriented,  system¬ 
atic,  and  individualistic  intellectual  perspectives  in 
the  study  of  politics.  They  also  sought  to  minimize 
and  downplay  the  significance  of  values  in  their 
analyses. 

The  Postwar  Anglo-American  Tradition 

An  early  and  seminal  contribution  to  this  post¬ 
war  tradition  was  made  by  Wheare  (1964),  who 
defined  federalism  both  constitutionally  and  in 
governmental  practice  as  that  system  of  govern¬ 
ment  in  which  authority  is  divided  between 
national  and  regional  governments,  so  that  each, 
within  a  sphere,  is  coordinate  (i.e.,  co-equal)  and 
independent.  For  Wheare,  a  principal  focus  of 
analysis  for  identifying  the  degree  of  federalism 
that  existed  in  the  constitutions  and  governmental 
practices  of  nation-states  was  the  division  of  legis¬ 
lative  powers  between  its  levels  of  government. 
Wheare  applied  this  analytical  device  to  a  com¬ 
parison  of  the  institutions  and  working  practices 
of  four  “mature  federations”  ofthat  time:  Australia, 
Canada,  Switzerland,  and  the  United  States.  His 
approach  was  rapidly  and  widely  accepted  as  an 
appropriate  template  for  identifying  and  analyzing 
federalism. 

But  as  occurred  in  post-World  War  II  political 
science  in  general,  criticism  by  federalism  special¬ 
ists  began  to  be  directed  at  what  was  seen  as  the 
excessive  formal-legalism  of  Wheare’s  approach. 
Their  critique  encouraged  other  Anglo-American 
scholars  to  redefine  federalism  in  different  terms 
and  contribute  to  its  further  theoretical  develop¬ 
ment.  They  included  William  S.  Livingston,  who 
viewed  federalism  from  a  sociological  standpoint 
as  “a  device  by  which  the  federal  qualities  of  a 


society  are  articulated  and  protected”  (Livingston, 
1952/1971,  p.  22);  William  Riker  (1964),  who 
defined  federalism  in  political  and  rational  choice 
terms  as  a  “bargain”;  Carl  Friedrich  (1968),  who 
envisaged  federalism  as  an  ongoing  “process”;  and 
Daniel  Elazar  (1962),  who  first  analyzed  federal¬ 
ism  as  a  cooperative  “partnership”  between  the 
two  major  levels  of  government.  Later,  in  conjunc¬ 
tion  with  Vincent  Ostrom  (1970),  he  redefined  the 
concept  of  partnership  as  a  “covenant”  or  legal 
and  moral  contract  between  these  governmental 
partners  (Elazar,  1987). 

There  were  also  some  strong  criticisms  directed 
at  these  new  postwar  approaches  to  federalism  by 
William  Riker  and  S.  Rufus  Davis.  Riker  ques¬ 
tioned  whether  federalism  as  a  regime  form  even 
existed  and,  if  it  did,  whether  it  mattered  at  all 
(Riker,  1969).  Davis  attacked  the  post-World  War 
II  predilection  for  redefining  and  applying  differ¬ 
ent  analytical  constructs  of  federalism  as  an  artifi¬ 
cial  doctoring  of  what  he  considered  to  be  an 
ambiguous  and  value-laden  concept  consisting  of 
many  different,  noncomparable  concrete  forms  in 
different  societal  settings  (Davis,  1978).  After  this 
rather  fertile  period  in  the  theoretical  evolution  of 
federalism  from  the  late  1940s  to  the  1970s,  there 
was  a  dearth  of  innovative  and  influential  Anglo- 
American  contributions  to  federal  theory. 

The  Postwar  Continental  European  Tradition 

Early  Mentors 

The  proponents  of  the  postwar  continental 
European  tradition  of  federalism  generally  looked 
to  different  mentors  or  guides  for  their  intellectual 
inspiration  (Michael  Burgess,  2000).  For  many  of 
them,  the  originators  and  intellectual  founders  of 
modern  federalism  were  not  the  principal  authors 
of  The  Federalist.  Rather,  they  considered  them¬ 
selves  to  be  the  theoretical  disciples  of  Johannes 
Althusius,  a  German,  Protestant  (Reformed)  local 
councilor  from  the  city  of  Endic,  who  remained  a 
relatively  obscure  social  and  political  thinker  during 
his  lifetime  and  for  the  ensuing  two  and  a  half  cen¬ 
turies.  Even  though  he  lived  and  wrote  almost  200 
years  prior  to  the  drafting  of  the  American  constitu¬ 
tion,  Althusius’s  ideas  are  often  regarded  today  as  a 
bridge  between  premodern  and  postmodern 
Western  political  thought.  More  precisely,  they  are 
viewed  as  a  precursor  of  the  pluralist  critique  of 
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statist  and  sovereigntist  ideas  that  have  dominated 
Western  thinking  since  the  establishment  of  the 
Westphalian  state  system  in  the  mid-  17th  century. 
He  viewed  federalism  in  broad  terms  as  a  manifes¬ 
tation  of  plural  governance,  popular  consent,  and 
a  normative  commitment  to  social  solidarity.  His 
concept  of  federalism  was  multilevel  and  societal 
rather  than  two-tiered  and  narrowly  political.  It 
was  derived  from  his  understanding  of  politics  as  a 
“consociation”  or  social  network  of  generically 
equivalent  associations  linking  family  and  kinship 
groups  to  social  and  economic  guilds  or  colleges, 
cities,  rural  districts  and  provinces,  and  ultimately 
a  universal  commonwealth  (Thomas  Hueglin, 
1999,  pp.  114-116). 

Another  early  inspiration  for  this  postwar  conti¬ 
nental  tradition  of  federalism  can  be  found  in  Du 
principe  federatif  (The  Federal  Principle)  published 
in  1863,  one  of  the  last  works  of  the  influential 
19th-century  anarcho-socialist  French  philosopher 
Pierre-Joseph  Proudhon.  Like  Althusius,  he  ad¬ 
vanced  a  notion  of  federalism  embodying  an  organic 
view  of  state  and  society,  in  which  emphasis  is 
placed  on  the  whole  person  in  social  interaction 
with  others,  rather  than  on  the  more  socially  iso¬ 
lated  individual.  He  likewise  incorporated  into  his 
federal  idea  the  medieval  concepts  of  corporatism 
(i.e.,  representation  of  functional  associations  in 
public  policy  making)  and  subsidiarity  (i.e.,  invest¬ 
ing  decision-making  authority  in  a  governing  unit 
closest  and  most  attuned  to  the  needs  of  the  mass  of 
the  population,  wherever  feasible).  Also,  he  espoused 
a  normative  and  secular  rather  than  religious  con¬ 
ception  of  federalism. 

Proudhon’s  federalist  thought  also  resembled 
that  of  Althusius  in  that  he  viewed  federalism  as  a 
series  of  agreements  struck  by  the  associational  rep¬ 
resentatives  of  socially  engaged  individuals.  These 
associations  were  likewise  multitiered,  linking  the 
smallest  and  most  primary  group,  the  family,  to  the 
largest  and  most  remote  of  secondary  associations, 
the  state.  But  unlike  in  Althusius’s  vision,  his  corpo¬ 
ratism  embodied  the  free  association  of  economic 
producers  and  workers  interacting  in  what  he  con¬ 
sidered  to  be  the  socialist  ideal  of  a  “mutualist” 
society.  In  this  type  of  socioeconomic  order,  goods 
are  exchanged  between  individuals  on  the  basis  of  a 
“just  value”  defined  in  terms  of  labor  input. 

Both  of  these  earlier  philosophers  had  a  signifi¬ 
cant  impact  on  post-World  War  II  continental 


European  intellectuals  and  political  activists  who 
advocated  the  formation  of  some  type  of  loose 
supranational  federal  structure  for  Europe.  This 
was  also  the  case  for  some  leading  social  Catholic 
thinkers  of  late- 19th-  and  early-20th-century  Eur¬ 
ope,  who  derived  many  of  their  ideas  from  the 
papal  encyclicals  Rerum  Novarum  (1891)  and 
Quadragesimo  Anno  (1931).  While  generally 
omitting  any  direct  reference  to  the  concept  of  fed¬ 
eralism  itself,  they  helped  popularize  some  of  the 
central  concepts  previously  identified  with  the 
early  antecedents  of  continental  European  federal 
thought.  These  included  an  organic  view  of  state 
and  society  based  on  corporatism  and  subsidiarity, 
an  emphasis  on  the  socially  interactive  person 
rather  than  the  atomized  individual,  and  the  fos¬ 
tering  of  a  normative  view  of  politics  based  on 
ethical-religious  principles. 

Postwar  Manifestations 

There  have  been  two  distinct  post-World  War 
II  manifestations  of  the  continental  European  fed¬ 
eral  tradition.  The  first  was  that  of  the  integral 
federalists,  a  mixture  of  institutionally  unattached 
intellectuals  and  political  activists,  who  advocated 
a  so-called  maximalist  revolutionary  concept  of 
federalism  for  postwar  Europe.  The  second  is  a 
more  mainstream  and  traditional  intellectual  group 
made  up  primarily  of  university-affiliated  social 
and  political  scientists,  who  promote  a  more  flex¬ 
ible,  inclusive,  and  looser  form  of  federalism  for 
the  postwar  European  Union  (EU),  which  is  widely 
referred  to  as  multilevel  governance  (MLG). 

Integral  federalism  was  initially  established  in 
France  in  the  1930s  by  a  group  of  continental 
European  thinkers  who  called  for  a  pan-European 
federalism  based  on  Proudhonian  principles,  includ¬ 
ing  antinationalism,  decentralization,  and  mutual¬ 
ism,  designed  to  combat  the  growing  conflict 
between  France  and  Germany  at  that  time.  However, 
it  was  forced  to  dissolve  in  1939  in  the  face  of  the 
looming  global  conflict.  Its  original  members 
regrouped  and  formed  a  more  broadly  based  intel¬ 
lectual  and  political  group  in  1944  that  included 
some  prominent  social  Catholic  thinkers  who  were 
in  the  vanguard  of  the  movement  for  a  pan-EU  in 
the  post-World  War  II  period.  Their  ideas  included 
reorganization  of  European  society  from  the  ground 
up  according  to  organic  and  corporatist  principles, 
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transnational  integration  of  different  branches  of 
the  economy  as  a  precondition  for  the  creation  of 
supranational  economic  structures,  and  the  estab¬ 
lishment  of  new  political  structures  that  involved 
regional  decentralization,  devolution  of  power  to 
functional  economic  agencies,  and  the  creation  of 
a  constituent  assembly  to  fashion  a  new  pan-Euro¬ 
pean  legal-political  order  along  federalist  lines. 
Although  the  integral  federalists  attempted  to  give 
these  ideas  more  concrete  meaning  by  applying 
them  to  the  particular  conditions  of  postwar 
Europe,  most  advocates  of  a  supranational 
European  community  rejected  their  views  as  too 
radical  and  utopian  and  as  lacking  in  an  accompa¬ 
nying  strategy  for  their  practical  realization.  As  a 
result,  the  direct  influence  of  the  integral  federal¬ 
ists  on  the  evolution  of  the  European  Community 
began  to  wane  by  the  late  1940s.  Since  then,  the 
handful  of  remaining  members  have  focused  more 
on  political  education  than  on  practical  politics 
(Lutz  Roemheld,  1990). 

Those  who  are  committed  to  applying  the  con¬ 
cept  of  MLG  to  the  political  analysis  of  the  EU 
today  constitute  the  most  recent  and  probably  the 
most  influential  global  proponents  of  new  federal 
theoretical  ideas.  They  comprise,  primarily,  uni¬ 
versity-based  social  and  political  analysts.  They 
first  began  to  promote  the  use  of  the  concept  of 
MLG  rather  than  federalism  to  describe  the  EU  in 
response  to  what  they  considered  to  be  the  unique 
character  of  the  new  supranational  governance 
form  that  had  emerged  after  the  passage  of  the 
Single  European  Act  in  1985  and  the  negotiation 
and  implementation  of  the  Maastricht  Treaty  in 
the  early  1990s.  Their  writings  on  MLG  have  been 
numerous  and  wide-ranging,  and  their  definitions 
of  the  concept,  although  manifesting  some  com¬ 
mon  or  shared  characteristics,  are  highly  diver¬ 
gent.  They  have  sought  to  describe  and  analyze  the 
new  multilayered  European  political  entity  consist¬ 
ing  of  overlapping  jurisdictions  by  first  broadening 
the  idea  of  federalism  to  encompass  more  than  two 
levels  of  government  and  more  than  two  autono¬ 
mous  policy-making  structures;  they  now  include 
both  regional/supranational  and  local  government 
tiers.  Their  concepts  have  other  common  strands — 
most  notably,  accepting  or  even  promoting  the 
participation  of  nonstate  actors,  such  as  nongov¬ 
ernmental  organizations  (NGOs),  corporations, 
and  unions,  in  intergovernmental  decision  making; 


observing  and  applauding  the  proliferation  of 
overlapping  decision-making  networks  engaged  in 
these  functions;  pointing  to  the  change  in  the  style 
of  state  decision  making  from  that  of  command 
and  control  to  steering,  coordination,  and  net¬ 
working;  and  exposing  new  and  difficult  chal¬ 
lenges  facing  MLG  structures,  such  as  how  to 
assign  governing  responsibility  and  how  to  foster 
democratic  accountability  (Ian  Bache  &  Matthew 
Flinders,  2004). 

The  concept  of  MLG  has  since  been  extended 
by  its  proponents  to  other  areas  of  systematic 
political  analysis  besides  the  decision-making 
structures  and  functions  of  the  EU,  among  them 
regionalism  and  urban  governance.  It  has  also 
gained  followers  in  other  continents  and  geo¬ 
graphic  areas,  including  North  America,  South 
America,  Asia,  and  Africa,  and  it  is  increasingly 
applied  as  a  tool  of  political  analysis,  largely  due 
to  its  flexibility  and  adaptability  to  the  existing  but 
rapidly  changing  global  economic  and  political 
conditions. 

However,  its  adherents  have  also  been  subjected 
to  a  broad  range  of  criticisms,  including  that  of 
being  too  descriptive  and  insufficiently  predictive; 
exaggerating  the  importance  of  subnational  actors 
in  decision  making;  neglecting  the  implementation 
and  outcome  stage  of  policy  making;  overempha¬ 
sizing  what  they  describe  as  the  postconstitutional 
and  extraconstitutional  nature  of  the  patterns  of 
governance  that  they  try  to  encompass;  giving  pri¬ 
ority  to  accommodation,  consensus,  and  efficiency 
in  governance  over  the  core  values  of  democratic 
government;  and  above  all,  acceding  too  readily  to 
definitional  imprecision  and  conceptual  stretching 
by  applying  MLG  indiscriminately  “to  any  com¬ 
plex  and  multifaceted  political  process”  (Bache, 
1998,  pp.  153-154). 

Since  the  mid-1990s,  there  has  been  an  increas¬ 
ing  blurring  of  the  distinction  between  the  con¬ 
cepts  of  federalism  and  MLG,  again  primarily  as  a 
result  of  the  growing  complexity  in  patterns  of 
intergovernmental  relations  in  federal  systems 
emanating  from  the  forces  of  globalization.  The 
historical  and  analytical  relationship  between  these 
two  concepts  tends  to  operate  in  a  multidirec¬ 
tional,  interactive,  and  interdependent  manner,  so 
that  their  respective  theoretical  developments  are 
mutually  intertwined  and  overlapping  in  both  con¬ 
tent  and  direction.  Thus,  MLG  may  be  regarded  as 
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a  useful  addition  to  the  conceptual  toolbox  of 
scholars  of  federalism. 

Future  Theoretical  Directions 
and  Applications 

If  the  concept  of  federalism  continues  to  evolve  in 
this  flexible  manner  in  the  face  of  intensifying  glo¬ 
balization,  it  is  likely  that  it  will  be  increasingly 
embraced  in  positive  terms  by  both  political  ana¬ 
lysts  and  practitioners  across  the  globe.  This  is 
particularly  the  case  if  the  two  post-World  War  II 
theoretical  traditions  of  federalism — the  Anglo- 
American  and  Continental  European — persist  in 
their  current  patterns  of  mutual  interaction,  influ¬ 
ence,  and  convergence.  And  it  is  also  likely  to 
manifest  itself  in  future  political  studies  that  draw 
on  either  or  both  of  the  prevailing  definitions  of 
federalism  and  MLG. 

Whether  the  same  pattern  of  convergence  will 
develop  in  future  institutional  applications  of  these 
two  concepts  is  more  difficult  to  predict.  A  few 
developed  and  developing  countries,  such  as  Spain, 
South  Africa,  Russia,  Ethiopia,  and  Nigeria,  are 
currently  experimenting  with  or  are  still  consider¬ 
ing  using  some  type  of  federal  institutional  form 
for  their  regimes.  Others,  such  as  Yugoslavia, 
Czechoslovakia,  the  Sudan,  Iraq,  and  Sri  Lanka, 
have  previously  embraced  or  have  seriously  con¬ 
templated  adopting  federal  structures  and  found 
them  to  be  inapplicable  to  conditions  in  their 
countries.  Therefore,  it  is  uncertain  whether  the 
recent  global  trend  toward  expanding  federalism 
as  an  institutional  device  in  the  21st  century  will 
persist.  But  there  is  little  doubt  that  this  concept,  in 
one  form  or  another,  will  continue  to  influence  in 
significant  ways  the  political  thought  and  analysis 
of  future  generations  of  political  scientists. 

Michael  B.  Stein 
University  of  Toronto 
Toronto,  Ontario,  Canada 
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Feminism 


The  20th  century  was  a  momentous  one  for  femi¬ 
nism  and  politics.  Beginning  with  the  spread  of 
women’s  voting  rights,  the  political  participation 
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and  engagement  of  women  expanded  during  the 
decades  that  followed.  Links  were  forged  between 
women’s  workforce  participation  and  their  active 
involvement  in  public  affairs.  Women’s  civic  asso- 
ciational  activity,  often  voluntary,  brought  the 
public  and  private  worlds  of  citizenship  together. 
Women’s  liberation  movements  reached  across 
age,  class,  and  racial  barriers  to  forge  new  policy 
demands.  Gender  inequalities  in  representation, 
too,  mobilized  coalitions  of  women  determined  to 
open  up  formal  politics.  Women  played  active 
roles  in  the  democracy-seeking  political  move¬ 
ments  of  the  latter  part  of  the  century.  Although 
the  adoption  of  feminist  insights  for  making  sense 
of  the  world  of  politics  was  relatively  slow  in  the 
field  of  political  studies,  gender  and  politics  schol¬ 
arship  is  now  a  coherent  subfield  within  the  wider 
discipline.  Over  the  past  few  years,  this  body  of 
scholarly  research  has  significantly  evolved.  New 
research  questions,  concepts,  theories,  and  meth¬ 
ods  have  emerged  in  response  to  new  developments 
in  the  feminist  struggle  for  women’s  empower¬ 
ment,  together  with  important  developments  in  the 
social  sciences  and  in  the  wider  context  of  world 
politics.  Against  this  background,  this  entry  dwells 
on  the  legacy  of  the  20th  century  for  women’s 
advancement  in  politics,  maps  out  the  main  meth¬ 
odological  approaches  used  by  feminist  political 
scientists,  and  outlines  the  main  thematic  areas  in 
the  study  of  politics  and  gender. 

Women  and  Politics:  The  Legacy 
of  the  20th  Century 

In  1906,  Finland  became  the  first  country  in  the 
world  to  give  women  the  right  to  vote  and  to  stand 
for  election.  One  year  later,  19  women  were  sent  to 
the  Finnish  parliament.  This  is  a  momentous  land¬ 
mark  in  the  story  of  women’s  participation  in  public 
affairs  since  it  was  the  first  time  in  history  when 
involvement  in  democratic  politics  became  a  right 
for  all  women  and  not  the  privilege  of  some.  In  addi¬ 
tion,  this  also  represents  a  first  milestone  in  a  cen¬ 
tury  of  unprecedented  activism  on  gender  equality. 

The  right  to  vote  and  be  elected  to  parliament 
was  an  agenda  around  which  the  first  great  wave 
of  women’s  mobilization  rallied — a  movement 
that  soon  became  a  global  one  as  it  spread  rapidly, 
with  mass  demonstrations,  rallies,  and  marches. 
Winning  the  vote  created  a  demand  for  women  to 


become  political  party  activists,  speakers  at  public 
meetings,  and  candidates,  though  their  latter  role 
was  largely  tokenist  at  the  time.  It  realized,  in  some 
measure,  the  dreams  of  18th-  and  19th-century 
feminist  political  philosophers  and  women’s  rights 
advocates  such  as  Olympe  de  Gouge,  Mary 
Woolstonecraft,  and  Harriet  Taylor  Mill.  However, 
not  all  countries  accorded  women  the  right  to  vote 
at  the  turn  of  the  20th  century  as  in  many  of  them, 
women  did  not  have  access  to  this  political  right 
until  well  into  the  middle  of  the  century.  Thus, 
Spain  accorded  women  voting  rights  in  1931, 
France  in  1944,  and  Italy  in  1946.  Indonesian 
women  won  the  vote  in  1945,  while  Peruvian 
women  followed  a  decade  later  in  1 955 .  Switzerland 
finally  extended  the  franchise  to  women  in  1971, 
while  the  Kuwaiti  parliament  passed  a  law  enfran¬ 
chising  women  in  2005. 

As  the  century  moved  on,  women’s  sense  of  civic 
identity  became  linked  with  their  ambitions  for 
personal  fulfillment.  The  years  of  World  War  II 
were  defining  in  this  regard  as  the  armaments 
industry,  and  the  various  services  supporting  the 
war  effort  drew  increasing  numbers  of  women  into 
the  workplace.  War-induced  employment,  pre¬ 
sented  as  a  patriotic  act,  provided  low-paid  women 
already  in  work  with  better  jobs  and  conditions 
and  brought  married  women  into  the  workforce 
(although  most  women  of  working  age  remained  in 
the  home).  After  the  war,  working  women  either 
returned  to  the  home  or  were  squeezed  back  into 
lower  paid  positions.  Women’s  status  declined,  the 
gender  wage  gap  increased,  and  as  Wilma  Rule 
noted  (1974,  p.  1715)  “many  left  [employmentj 
voluntarily  in  the  ideological  milieu  of  a  new  famil- 
ialism  buttressed  by  a  popularized  Freudianism 
which  portrayed  the  non-homebound  woman  as 
psychologically  afflicted.”  The  fact  that  women  in 
the  1950s  were  expected  by  society  to  marry  and  be 
full-time  homemakers  means  that  they  lost  out  on 
the  orientation  to  civic  and  public  affairs  that  the 
world  of  employment  tends  to  offer,  although  it  did 
not  mean  that  they  were  not  civically  engaged.  On 
the  contrary,  women’s  associational  life  during  the 
1950s  was,  by  all  accounts,  active  and  vibrant,  as 
hundreds  of  thousands  of  home-based  women 
joined  locally  based  voluntary  women’s  organiza¬ 
tions  in  Britain  and  the  United  States.  Yet  because 
these  organizations  focused  on  church  and  charity- 
related  volunteering,  improving  homemaking  skills, 
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civic  engagement,  and  education,  associational 
activity  was  engaged  in  for  its  own  sake,  not  as  a 
political  enterprise,  although  some  scholars  have 
suggested  that  by  encouraging  the  citizen  house¬ 
wife,  these  associational  groups  contributed  to  the 
blurring  of  the  distinction  between  the  public  and 
the  private  worlds  of  women. 

By  the  1960s,  women’s  membership  in  these 
associations  had  suffered  a  severe  decline.  The 
decade  of  housewives  was  followed  by  the  decade 
of  women’s  liberationists.  Many  of  these  wom¬ 
en’s  associations  became  redefined  as  locally 
based  civic  bodies  with  national  and  international 
advocacy  roles  coordinated  by  a  professionalized 
leadership.  This  is  a  decade  that  ushered  in  a  new 
phase  of  economic  and  social  development  in 
industrialized  societies,  which  in  turn  had  an 
impact  on  women’s  social  roles.  Women’s  educa¬ 
tional  attainment,  an  important  contributor  to 
social  and  civic  empowerment,  had  risen  and,  in 
the  United  States  and  Europe,  began  to  outstrip 
that  of  men.  Employment  opportunities,  a  tech¬ 
nological  revolution  that  drove  a  new  phase  in 
consumerism  and  communications,  and  the 
expansion  of  middle-class  aspirations  began  to 
chip  away  at  traditional  gender  role  contracts  in 
both  public  and  private  spheres.  The  antiwar,  civil 
rights,  and  student  protest  movements  of  the 
1960s  in  advanced  democracies  provided  the 
impulse  for  the  second  wave  of  feminism.  The 
popularizing  of  authors  such  as  Betty  Friedan, 
Simone  de  Beauvoir,  and  Germaine  Greer  through 
the  new  media  of  television  brought  the  issue  of 
women’s  oppression  into  homes  across  industrial¬ 
ized  postwar  democracies.  At  around  the  same 
time,  legislation  in  the  United  Kingdom  and  the 
United  States  had  begun  to  tackle  pay  and  other 
discriminations  against  women  in  the  workplace. 
Government  reports  in  many  countries  docu¬ 
mented  women’s  disadvantaged  legal  and  social 
status  and  made  recommendations  for  reform. 
Women’s  liberation  movements  in  Britain,  the 
United  States,  and  other  countries,  engaged  in  a 
period  of  intense  activity.  Regional  and  national 
meetings  that  attracted  hundreds  of  women,  local 
consciousness-raising  groups,  marches  on  parlia¬ 
ment,  and  other  high-visibility  protests  character¬ 
ized  the  movement.  Connections  were  made  across 
class  and  race  lines,  and  for  a  brief  time,  all 
women  were  united  in  their  commitment  to  end 


discrimination.  The  sisterly  solidarity  fragmented 
shortly  afterward,  with  class,  race,  and  other 
deep-seated  social  and  political  divisions  reassert¬ 
ing  themselves.  The  lasting  legacy  of  the  women’s 
movement,  though,  was  to  change  social  attitudes 
toward  women’s  role  and  status.  It  had  a  long¬ 
term  effect  on  public  policy,  on  sociocultural 
norms,  and  on  women’s  aspirations.  What  was 
just  as  important  was  that  it  mobilized  and 
engaged  a  generation  of  women  and  affected  the 
attitudes  of  a  generation  of  men. 

Following  the  women’s  liberation  movement  of 
the  1960s  and  1970s,  the  last  part  of  the  century 
saw  the  global  diffusion  of  gender  equality  norms. 
This  was  supported  by  the  United  Nations  (UN) 
Commission  on  the  Status  of  Women,  which  in 
turn  is  dedicated  to  gender  equality  and  the 
advancement  of  women.  Responding  to  the 
momentum  generated  by  the  women’s  movement 
in  the  1970s,  the  General  Assembly  declared  1975 
as  International  Women’s  Year  and  organized  the 
first  World  Conference  on  Women  in  Mexico  City, 
followed  by  others  in  Copenhagen  (1980),  Nairobi 
(1985),  and  Beijing  (1995).  In  1979,  the  UN 
adopted  the  Convention  on  the  Elimination  of 
Discrimination  Against  Women,  seen  as  the  inter¬ 
national  bill  of  women’s  human  rights.  In  1995,  it 
adopted  the  Beijing  commitments  and  Platform  for 
Action  to  secure  gender  equality  and  peace,  and  in 
2000,  it  adopted  Resolution  1325  on  women, 
peace,  and  security,  calling  for  women’s  inclusion 
in  peace-building  processes.  In  2009,  it  added  to 
this  resolution  by  explicitly  mandating  peacekeep¬ 
ing  forces  to  protect  women  and  children  from 
sexual  violence  during  armed  conflict. 

None  of  these  policies  and  events  would  have 
occurred  in  the  absence  of  a  concerted,  focused, 
transnational  impetus  from  women.  Transnational 
activism  was  the  chief  characteristic  of  women’s 
movements  of  the  end  of  the  century.  It  linked 
local  issues  with  global  concerns,  built  connections 
across  borders,  and  created  dialogue  between 
feminists  in  the  Global  North  and  the  Global 
South.  It  began  in  the  mid-1980s,  as  preparations 
for  the  Nairobi  World  Conference  on  Women  got 
under  way,  although  it  took  some  time  before 
women  from  the  Global  North  and  Global  South 
could  agree  on  a  common  agenda.  Two  paradigm 
shifts  in  the  economic  and  cultural  world  order 
prompted  consensus  building  among  First  World 
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and  Third  World  women:  the  assertion  of  transna¬ 
tional  neo-liberal  markets  dependent  on  cheap 
labor  (often  provided  by  women)  and  the  rise  of 
Islamic  fundamentalism.  Both  were  seen  as  global 
threats  to  women’s  equality  and  empowerment.  A 
more  united  transnational  feminist  activism  quickly 
proliferated,  particularly  after  the  Beijing  confer¬ 
ence.  In  parallel  with  this  transnational  civic  and 
political  engagement  of  women,  awareness  of  the 
absence  of  women  from  elite  decision  making  grew. 
The  importance  of  their  presence  in  this  arena  for 
bringing  about  gender  equality  and  for  representing 
women’s  concerns  resulted  in  the  widespread  adop¬ 
tion  of  gender  quotas  in  political  and  public  life.  By 
2000,  many  European  and  Latin  American  coun¬ 
tries  had  adopted  some  form  of  gender  quota, 
along  with  a  range  of  countries  in  Asia,  largely  due 
to  women’s  engagement  with  party  and  political 
systems.  However,  even  currently,  women’s  equal¬ 
ity  with  men  in  political  decision  making  has  some 
considerable  way  to  go.  Their  representation  stood 
at  18%  worldwide  at  the  end  of  the  century. 

Feminism  in  Political  Science: 

Perspectives  and  Methods 

Despite  the  important  achievements  in  women’s 
political  empowerment  that  took  place  over  the 
course  of  the  20th  century,  the  incorporation  of 
feminist  perspectives  into  the  discipline  of  political 
science  was  a  much  slower  process.  Thus,  it  was  not 
until  the  1980s  that  research  on  gender  and  politics 
took  off  and  not  until  the  1990s  that  it  consolidated 
itself  as  a  subfield  within  the  wider  discipline  of 
politics — with  dedicated  journals,  research  net¬ 
works,  projects  and  events,  specialized  research 
centers  and  institutions,  postgraduate  programs, 
and  so  on.  However,  to  claim  that  gender  and  poli¬ 
tics  has  become  a  coherent  subfield  does  not  mean 
that  it  is  a  uniform  subfield — quite  the  opposite,  in 
fact,  as  we  can  find  a  diversity  of  research  agendas 
that  are  informed  by  manifold  theoretical  perspec¬ 
tives  and  methods  and  that  are  as  diverse  as  the 
feminisms  on  which  they  draw.  This  section  pro¬ 
vides  a  summary  overview  of  the  variety  of  feminist 
perspectives  underpinning  most  of  the  gender  and 
politics  research  that  is  being  conducted  today. 

Studies  on  gender  and  politics  are  framed  within 
one  or  more  of  the  main  feminist  perspectives  that 
have  been  developed  up  to  the  present.  These  are 


liberal,  radical,  poststructuralist,  and  postcolonial 
feminisms.  Categorizing  studies  into  one  or  another 
perspective  is  not  always  straightforward  since 
these  perspectives  are  used  to  address  a  variety  of 
questions  specifically  concerned  with  gender  and 
politics;  many  opt  to  develop  a  modified  version  of 
a  particular  one  while  incorporating  valuable 
insights  from  the  others. 

Liberal  Feminism 

The  first  perspective  is  liberal  feminism.  Research 
on  gender  and  politics  informed  by  this  perspective 
is  characterized  by  a  number  of  features.  The  first 
one  is  that  the  structures  of  the  liberal  state  are 
assumed  rather  than  problematized.  In  other  words, 
the  problem  is  not  the  structures  of  the  liberal  state 
but  the  lack  of  women  and  women’s  representatives 
within  its  structures.  Therefore,  for  gender  and 
politics  scholars  influenced  by  liberal  feminism,  the 
task  of  feminizing  politics  consists  in  bringing  more 
women  into  the  political  realm.  It  is  a  reformist, 
rather  than  a  transformative,  agenda.  The  second 
feature  of  gender  and  politics  research  influence  by 
this  perspective  is  its  acceptance  of  mainstream 
understandings  of  the  political  as  the  world  of  gov¬ 
ernment  and  public  affairs.  Consequently,  their 
subject  matter  is  constructed  on  a  strict  distinction 
between  the  public  and  the  private  realms,  where 
the  latter  is  considered  to  fall  outside  the  scope  of 
research  and,  therefore,  to  be  irrelevant  to  the  cen¬ 
tral  research  questions.  The  third  feature  of  this 
research  relates  to  the  methodologies  adopted,  as  it 
operates  with  a  sharp  distinction  between  fact  and 
value  statements,  a  strong  reliance  on  positivist  epis¬ 
temologies,  and  a  favoring  of  quantitative  methods. 
Finally,  the  fourth  feature  of  this  research  is  that  it 
is,  to  a  large  extent,  focused  on  individuals  and 
agency  for  understanding  our  social  reality  (e.g., 
lack  of  individual  women  in  parliaments)  and  as  a 
main  source  of  social  change  (e.g.,  women  political 
leaders,  women’s  and  feminist  interest  groups). 

Radical  Feminism 

The  second  perspective  is  radical  feminism.  A 
key  feature  of  gender  and  politics  research  influ¬ 
enced  by  this  perspective  is  its  questioning  of  the 
public-private  split  that  has  traditionally  defined 
the  feminist  study  of  political  science.  The  blurring 
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of  the  boundaries  between  the  public  and  the  pri¬ 
vate  derives  from  one  of  the  central  premises  of 
radical  feminism — namely,  the  idea  that  women 
have  been  traditionally  oppressed  by  a  power  sys¬ 
tem  of  male  supremacy  (patriarchy)  that  permeates 
all  social  structures  and  relations.  This  male  hege¬ 
mony  not  only  applies  in  public  institutions  but 
also  in  those  areas  belonging  to  what  has  been 
traditionally  understood  as  the  private  sphere  (i.e., 
the  body,  sexuality  and  reproduction,  nurturing 
and  intimacy,  the  emotions,  etc.).  Consequently, 
according  to  this  perspective,  there  is  nothing  in  the 
entire  social  system  that  is  exempted  from  becom¬ 
ing  an  object  of  political  struggle.  In  contrast  to 
research  influenced  by  liberal  feminism,  gender  and 
politics  scholars  influenced  by  radical  feminism  do 
not  see  the  project  of  bringing  more  women  into 
state  structures  (legislatures,  executives,  and  judi¬ 
ciaries)  as  emancipatory  unless  it  is  accompanied 
by  other  changes  that  imply  a  deeper  transforma¬ 
tion  of  those  existing  structures  themselves.  Another 
characteristic  feature  of  this  research  is  a  shift  of 
attention  from  the  concept  of  sex  to  that  of  gender. 
Thus,  instead  of  focusing  on  individual  women  (or 
the  lack  of  them)  in  political  institutions,  this 
research  is  mainly  interested  in  the  gendered  nature 
of  social  structures,  including  formal  political  insti¬ 
tutions.  Finally,  in  relation  to  the  methodologies 
adopted  by  this  kind  of  research,  it  should  be  noted 
that  although  scholars  influenced  by  the  tenets  of 
radical  feminism  may  be  suspicious  of  positivism, 
no  specific  feminist  methodology  has  been  devel¬ 
oped  for  the  study  of  the  political  world.  Instead, 
these  researchers  opt  for  a  diversity  of  qualitative 
methods  such  as  discourse  analysis,  ethnographic 
analysis,  and  case  studies. 

Poststructuralism 

The  third  feminist  perspective  that  has  influ¬ 
enced  gender  and  politics  scholars  is  poststructural¬ 
ism.  A  distinctive  feature  of  this  strand  of  research 
is  the  idea  that  traditional  concepts  in  this  field  of 
research — such  as  gender,  representation,  or  the 
state — are  discursive  constructions  rather  than 
objectively  or  structurally  given  realities.  Using  this 
perspective,  the  main  focus  of  the  gender  and  poli¬ 
tics  researcher  is  not  on  individual  women  and 
their  lack  of  political  representation  and  participa¬ 
tion  or  on  social  structures  and  the  gender-power 


relations  embedded  in  formal  and  informal  politi¬ 
cal  institutions  but,  rather,  on  the  discourses  and 
practices  that  construct  those  concepts.  A  second 
feature  of  this  research  is  that  it  operates  with  an 
idea  of  power  understood  as  a  relational,  rather 
than  a  static,  concept.  Hence,  in  contrast  to  an  idea 
of  power  as  something  that  individual  men  or 
patriarchal  structures  own  and  try  to  retain,  from 
a  feminist  poststructuralist  perspective,  power  is 
understood  as  an  unstable  and  ever-shifting  expres¬ 
sion  of  relations  in  all  aspects  of  every  life.  A  third 
distinctive  feature  of  gender  and  politics  research 
influenced  by  poststructuralism  is  the  abandon¬ 
ment  of  a  normative  standpoint  from  which  value 
judgments  are  made  to  appraise  or  condemn  a 
reality  that  is  taken  as  objectively  given,  such  as 
the  patriarchal  or  the  women-friendly  state.  In 
other  words,  the  aim  of  research  informed  by  this 
perspective  is  not  to  provide  answers  but  to  show 
how  gender  power  is  constructed  through  rela¬ 
tional  discursive  practices  in  specific  social  and 
political  contexts.  For  example,  recent  studies  on 
gender  and  the  state  argue  that  the  idea  that  the 
state  is  discursively  constructed  can  explain  not 
only  why  there  are  patriarchal  states  as  well  as 
women-friendly  states  but  also  why  it  is  possible  to 
find  significant  differences  in  this  regard  between 
different  parts  of  the  same  state.  Poststructural 
feminist  perspectives  have  also  influenced  recent 
work  on  gender  and  political  representation. 

Postcolonial  Feminism 

The  postcolonial  feminist  perspective  (mainly  in 
the  field  of  international  relations)  challenges  the 
epistemological  and  ontological  bases  of  liberal 
and  radical  feminist  research  in  two  important 
ways.  First,  it  argues  that  gender  oppression  is 
perpetrated  by  colonialism  as  well  as  patriarchy 
and  that  the  combination  of  both  forms  of  oppres¬ 
sion  shape  women’s  roles  and  their  status  in  non- 
Western  societies.  It  is  critical  of  the  universalist 
assumptions  of  women’s  shared  experiences  that 
liberal  and  radical  feminism  presume.  This  leads 
postcolonial  scholars  to  question  the  Western  dis¬ 
cursive  construction  of  Third  World  women  as  a 
singular  entity,  its  inbuilt  tendency  to  homogenize 
women’s  experiences,  and  its  unstated  valuing  of 
Western  women’s  empowerment.  Second,  it  chal¬ 
lenges  the  inherent  racism  in  the  colonial  project 
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in  historical  and  contemporary  contexts. 
Postcolonial  feminism  as  a  research  approach  uses 
the  deconstruction  of  the  Euro/Westcrn-centric 
legacy  to  recognize  the  multiple  forms  of  women’s 
activism  and  empowerment  in  Third  World  societ¬ 
ies.  More  recent  studies  have  sought  to  uncover  the 
shared  sites  of  oppression  among  women — such  as 
global  capitalism — while  recognizing  the  diversity 
of  their  traditions,  standpoints,  and  experiences. 
Postcolonial  feminist  political  research  has  much 
in  common  with  postmodern  studies:  Both  seek  to 
uncover  the  multiple  layers  of  female  oppression 
that  are  created  through  discourse  and  narrative. 
Both  are  attentive  to  the  intersections  of  race, 
class,  gender,  and  sexuality  with  national,  cultural, 
political,  and  other  identities. 

Established  and  Emerging  Research 
Agendas  in  Feminism  and  Politics 

The  brief  reviews  above  of  women’s  political 
empowerment  and  scholarly  approaches  to  studying 
gender  and  politics  reveal  a  close  connection  between 
the  worlds  of  activism  and  research.  This  section 
outlines  some  important  thematic  areas  of  research 
that  unite  the  study  of  feminism  and  politics. 

The  first  theme  is  one  that  is  fundamental  to 
political  science — democracy  and  political  repre¬ 
sentation.  Feminist  scholars  have  paid  consider¬ 
able  attention  to  this  theme,  grounding  their 
work  in  case  and  comparative  studies  of  women’s 
political  participation  and  legislative  seat  holding 
and  electoral  and  party  politics.  This  research 
also  encompasses  extensive  studies  on  gender  dif¬ 
ferences  in  political  attitudes  and  behavior.  An 
extension  of  this  research  examines  women’s 
contributions  to  democracy  and  democratization 
processes.  The  cultural  turn  in  feminist  studies  is 
also  reflected  in  this  scholarship,  with  a  focus  on 
the  role  of  the  media  in  shaping  gender  percep¬ 
tions  of  politics.  The  studies  in  this  thematic  area 
are  generally  conducted  within  the  tradition  of 
liberal  feminism  and  employ  positivist  and  his¬ 
torical  methods  of  scholarship.  The  concern  in 
this  field  of  research  is  to  identify  and  explore 
women’s  participation  in  civic  and  political  activ¬ 
ity  and  the  sites  and  issues  on  which  women  chal¬ 
lenge  the  state  and  seek  to  represent  interests  and 
perspectives  that  may  be  unacknowledged  in  the 
existing  political  order. 


A  related  thematic  research  field  is  that  of  the 
relationship  between  gender  politics  and  the  state. 
This  disparate  area  addresses  the  many  and  varied 
sites  of  feminist  political  contestation,  including 
women’s  movements,  nationalism  and  feminism, 
policy  issues,  gender  mainstreaming,  and  the  male- 
gendered  nature  of  political  institutions.  There  are 
two  common  characteristics  of  this  research,  espe¬ 
cially  among  most  recent  scholarship.  One  is  a 
shift  of  attention  from  women  in  politics  to  gender 
in  politics.  This  highlights  the  way  in  which  politi¬ 
cal  institutions  are  gendered  and  how  they  act  to 
construct  and  reinforce  gender-power  relations, 
both  in  the  political  world  and  in  the  wider  society. 
The  second  characteristic  is  the  marked  influence  on 
gender  and  politics  research  of  the  institutional  turn 
that  has  taken  place  in  the  wider  field  of  political 
science.  Emerging  research  agendas  in  this  area 
focus  on  changes  in  the  modern  nation-state  with  a 
view  to  exploring  the  differential  impact  of  a  variety 
of  state  architectures  (federalism,  regionalism,  etc.) 
on  gender  relations  as  well  as  examining  how  recent 
trends  in  globalization,  supranationalization,  and 
decentralization  call  into  question  established 
assumptions  about  gender  politics  and  the  state. 
Again,  producing  a  prolific  output  of  scholarly 
analysis,  this  body  of  work  uses  mainly  qualitative 
methodologies  in  both  liberal  and  radical  feminist 
traditions. 

Women’s  advocacy  in  transnational  and  global 
civil  society  constitutes  a  more  recent  thematic 
area  in  feminism  and  politics  research.  Prompted 
by  observations  on  the  globalizing  nature  of  polit¬ 
ical  life  in  the  21st  century  and  its  effects  on 
women’s  opportunities  for  civic  and  political 
expression,  this  scholarship  examines  the  gendered 
effects  of  the  global  political  economy  and  interna¬ 
tional  political  institutions.  This  research  strand  is 
firmly  grounded  in  feminist  international  relations 
research,  where  questions  of  transnational  activ¬ 
ism  and  the  gendered  nature  of  international 
political  institutions,  linkages,  and  practices  are 
opened  to  critical  review.  This  thematic  field  offers 
the  most  eclectic  use  of  methodologies  in  conduct¬ 
ing  research,  and  the  influence  of  critical  social  and 
political  theory  on  scholarship  is  apparent. 

Yvonne  Galligan  and  Sara  Clavero 
Queen’s  University  Belfast 
Belfast,  United  Kingdom 
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Feminist  Movements 


Feminist  movements  have  become  ubiquitous  in 
the  modern  world  over  the  last  half  century,  and 
today,  there  are  few  countries  in  which  such  orga¬ 
nizations  do  not  exist.  Feminist  movements  have 
mobilized  on  a  vast  number  of  issues  ranging  from 
civil  and  equal  rights  to  violence  against  women 
and  reproductive  rights.  They  have  also  organized 
around  issues  of  war  and  peace,  the  environment, 
racism,  and  world  poverty.  Feminist  movements 
were  part  of  the  struggles  for  national  liberation 
in  the  20th  century  and  played  an  important  part 
in  recent  transitions  to  democracy.  They  have 
always  had  international  connections,  but  since 
the  United  Nations  (UN)  Women’s  World  Con¬ 
ferences  at  Nairobi  (1986)  and  Beijing  (1995)  and 
the  UN  World  Population  Conference  at  Cairo 
(1994),  there  has  been  an  exponential  growth  of 
transnational  women’s  groups.  In  their  aims, 
feminist  movements  have  never  been  solely  con¬ 
cerned  with  political  change;  sociocultural  change 
has  always  been  seen  as  essential  to  challenging 
gender  hierarchies. 

Given  the  plethora  of  issues  on  which  women 
have  mobilized,  distinguishing  feminist  movements 


from  women’s  movements  in  general  has  been  con¬ 
tentious  in  the  literature.  Women’s  movements 
mobilize  women  as  women ,  as  a  constituency, 
regardless  of  the  issue  that  they  address.  The  term 
feminist  movement  is  best  reserved  for  those  wom¬ 
en’s  movements  challenging  gender  hierarchies 
that  subordinate  women  to  men.  This  debate  has 
been  fueled  by  historical  records  showing  that 
both  have  often  intermingled  and  influenced  each 
other.  In  recent  studies,  the  definition  of  feminist 
and  women’s  movements  is  now  regarded  as  an 
empirical  question,  to  be  answered  by  contextual 
analysis  of  the  discourses  and  activities  of  the 
movements  in  question.  Among  scholars,  it  has 
become  common,  given  the  varieties  in  feminism, 
to  speak  of  feminist  movements  in  the  plural. 

The  term  feminism  was  first  used  in  19th- 
century  Europe,  but  the  discourse  itself  can  be 
traced  to  the  early  15th  century.  Its  principal  thesis 
is  that  the  condition  of  women  is  not  ordained  by 
nature  or  supernatural  order  but  can  and  should 
be  changed.  During  the  era  of  colonization, 
Western  feminist  ideas  became  familiar  in  the  East 
and  the  South,  which  led  to  early  reforms  of  wom¬ 
en’s  rights  in  the  first  half  of  the  20th  century  in  a 
number  of  countries  as  diverse  as  China,  Turkey, 
and  much  of  Latin  America  as  well  as  in  North 
America  and  Europe. 

Up  to  the  1980s,  women’s  movements  in  the 
South  had  been  critical  of  the  concept  of  feminism, 
regarding  it  as  a  limited  Western  approach  to 
women’s  issues  and  arguing  that  social-economic 
development  is  of  greater  importance  for  improving 
women’s  status  in  the  developing  world.  The  critique 
was  shared  by  many  from  the  former  communist 
world  who  depicted  feminism  as  a  bourgeois  ideol¬ 
ogy  and  claimed  women’s  equality  had  been  achieved 
under  communism.  However,  autonomous  feminist 
movements  have  since  then  emerged  in  many  coun¬ 
tries  in  the  South  and  East,  with  their  own  feminist 
discourses  and  agendas  of  gender  justice.  The 
increasing  international  engagement  of  feminist 
movements  has  led  to  a  consensus  in  the  2000s  on  a 
broad  social  feminism  engaging  with  global  inequal¬ 
ities  and  social  justice  as  well  as  including  reproduc¬ 
tive  rights  and  violence  against  women. 

Social  movement  scholars  have  argued  that  the 
emergence  of  social  movements  is  tied  to  the  devel¬ 
opment  of  the  national  state,  which  created  a  public 
sphere  for  addressing  grievances  and  protest. 
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Mobilization  occurs  in  periods  of  crisis,  nearly 
always  generating  feminist-inspired  protest  along¬ 
side  other  social  movements.  In  the  literature  on 
feminist  movements,  these  periods  of  heightened 
activities  have  been  denoted  by  the  metaphor  of 
waves.  Usually  the  term  first  wave  is  reserved  for  the 
feminist  movements  that  arose  in  the  second  half  of 
the  1 9th  century,  the  period  of  industrialization  in 
Europe  and  Northern  America,  and  the  second 
wave  for  those  originating  in  the  major  social 
changes  of  the  1960s  and  1970s.  Historians  have 
made  a  strong  case,  however,  for  earlier  waves  of 
feminist  activity  in  the  period  of  the  English  Civil 
War  and  the  French  Enlightenment  as  well  as  during 
the  French  Revolution,  when  numerous  treatises 
and  pamphlets  on  the  status  of  women  were  pub¬ 
lished.  The  metaphor  is  also  somewhat  misleading 
as  there  is  ample  evidence  of  sustained  contention 
by  women  between  the  waves.  Moreover,  the  timing 
of  protests  is  not  synchronical  across  the  world. 
Women’s  suffrage  and  changes  in  patriarchal  family 
law  were  gained  in  the  1920s  and  1930s  in  Latin 
America,  while  in  many  European  countries,  such 
reforms  had  taken  place  in  the  previous  decades. 

The  early  social  movements  of  the  mid-  19th 
century  are  the  precursors  of  feminist  movements 
in  the  Western  world.  These  were  the  antislavery 
(abolitionist)  and  temperance  movements  in  the 
United  States  and  Britain,  and  many  social  reform 
and  religious  revival  movements  in  Europe.  By  the 
late  19th  century,  feminist  movements  had  devel¬ 
oped,  either  arguing  that  women  were  equal  to 
men,  and  therefore  were  entitled  to  equal  rights,  or 
different  from  men,  requiring  the  recognition  of 
women’s  unique  contribution  to  society.  The  first 
wave  has  often  been  reduced  to  the  struggle  for 
suffrage,  but  it  had  a  much  broader  agenda  of 
social  and  legal  rights.  The  campaign  for  suffrage 
emerged  in  the  late  19th  century;  today,  the  only 
countries  where  women  cannot  vote  on  a  par  with 
men  are  several  Islamic  states  in  the  Middle  East. 

Other  issues  of  the  first  wave  were  the  reform  of 
family  law,  access  to  education,  and  the  right  to 
work.  The  first  wave  retained  its  social  reform 
character,  organizing  on  issues  as  diverse  as  public 
health,  social  work,  temperance,  peace,  prostitu¬ 
tion,  and  the  trafficking  of  women.  Many  femi¬ 
nists  also  campaigned  against  informal  social 
codes  limiting  women’s  activities,  such  as  dress 
reform,  access  to  public  spaces,  and  bars  on  female 


sports.  Antiwar  activism  and  campaigns  against 
trafficking  led  to  the  first  transnational  feminist 
organizations  in  the  1890s.  Often,  it  is  held  that 
the  first  wave  ended  in  the  West  after  World  War 
I,  but  in  many  countries  the  feminist  movement 
remained  active  in  social  reform  issues  beneficial 
to  women,  such  as  maternal  leave.  Such  demands 
were  able  to  cross  class  divides.  One  of  the  major 
contentious  issues  has  always  been  that  of  how  far 
feminist  movements  were  speaking  on  behalf  of  all 
women  and  not  just  for  the  middle  class  and 
whether  working  class  women  stood  to  gain  by 
suffrage  and  equal  rights. 

Demobilization  occurred  in  the  1940s  and 
1950s,  but  the  revival  of  feminism  owes  much  to 
entrenched  feminist  networks  and  women’s  institu¬ 
tions  during  this  period.  The  second  wave  emerged 
in  the  wake  of  the  early  protest  movements  of  the 
1960s,  such  as  the  civil  rights,  student,  and  antiwar 
movements  in  the  West  and  the  rising  tide  of 
national  independence  and  social  revolutionary 
movements  in  Latin  America  and  the  decolonizing 
world.  Often  called  the  new  or  the  autonomous 
women’s  movement,  its  emergence  galvanized 
existing  women’s  organizations  across  much  of  the 
world.  Differences  in  organizational  style,  action 
repertoire,  and  ideology  distinguished  the  second 
wave  from  the  older  branches  of  feminisms  in 
many  countries.  The  new  feminist  groups  were 
highly  critical  of  formal  politics,  organized  in  loose 
networks,  and  used  street  protest  instead  of  the 
traditional  lobby  to  reach  their  goals.  They  created 
a  new  radical  feminist  ideology  that  emphasized 
consciousness  raising  and  developed  a  far-reaching 
critique  of  patriarchy,  culture,  and  men. 

Politicizing  the  body,  by  setting  abortion  and 
reproductive  rights,  sexuality  (including  lesbianism), 
and  (sexual)  violence  on  the  political  agenda,  also 
originated  in  the  new  feminist  groups.  These  issues 
have  become  the  distinctive  marks  of  second-wave 
feminism,  leading  to  worldwide  reforms  and  to 
strong  opposition  from  (religious)  conservatives  in 
general.  But  the  second  wave  also  set  out  to  achieve 
time-honored  goals,  such  as  equal  political  represen¬ 
tation,  access  to  education  and  work,  and  decent 
living  conditions  in  the  developing  world.  The  cre¬ 
ation  of  a  vibrant  women’s  culture,  with  its  own 
media,  women’s  centers,  and  networks,  has  also 
been  integral  to  most  feminist  movements.  In  coun¬ 
tries  faced  with  authoritarian  or  military  regimes,  a 
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constituency  of  women  originated  who  fought  for 
democratization  and  rights  for  women  in  new  con¬ 
stitutions,  as  in  South  Africa  and  many  Latin 
American  states.  This  was  much  less  the  case  in  the 
former  communist  states,  where  feminism  became 
discredited  by  its  connection  to  the  former  regimes. 

One  of  the  crucial  issues  the  feminist  movement 
has  been  confronting  is  the  diversity  of  women:  No 
organization  can  claim  to  speak  on  behalf  of  all 
women.  Differences  of  class,  ethnicity,  sexual  ori¬ 
entation,  age,  and  disabilities  have  challenged 
unity  and  the  choice  of  priorities.  These  debates 
occurred  in  different  sequences  in  most  countries 
and  in  the  international  arena.  Scholars  generally 
agree  that  the  feminist  movement  is  one  of  the 
most  successful  of  the  1960s  social  movements  and 
the  one  with  the  greatest  longevity.  This  is  testi¬ 
mony  to  the  increasing  success  of  feminist  move¬ 
ments  in  bridging  diverse  communities  and  coping 
with  the  changing  global  environment.  Scholars 
disagree  about  the  end  of  the  second  wave.  Some 
scholars  have  contended  that  the  new  autonomous 
movement  disappeared  by  the  early  1980s.  Others 
have  pointed  out  that  many  second-wave  groups 
have  turned  into  professional  lobby  organizations 
still  targeting  government  and  parliament  on 
women’s  issues  into  the  2000s,  a  phenomenon  also 
well  documented  at  the  supranational  level. 

The  study  of  feminist  and  women’s  movements, 
like  social  movements  in  general,  has  fallen  victim 
to  the  division  of  labor  between  political  scientists, 
historians,  and  sociologists.  Political  science,  with 
its  strong  distinction  between  the  public  and  the 
private,  has  emphasized  formal  institutions  and 
political  behavior.  Sociology  tackled  social  move¬ 
ments  but  has  often  paid  only  nominal  attention  to 
feminist  and  women’s  movements,  raising  ques¬ 
tions  about  the  validity  of  its  theories.  Women’s 
studies  and  women’s  history,  both  legacies  of  sec¬ 
ond-wave  feminism,  have  produced  most  of  the 
current  knowledge;  as  of  now,  these  findings  have 
been  only  incompletely  incorporated  in  the  other 
disciplines,  and  they  do  provide  the  most  promis¬ 
ing  material  for  further  study. 

Joyce  Outsboorn 

Leiden  University 
Leiden,  Netherlands 
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Feminist  Theory  in 
International  Relations 


Feminist  theory  has  developed  in  the  context  of  the 
feminist  movement  and  seeks  to  explain  women’s 
subordination,  its  causes,  and  the  ways  in  which  it 
is  perpetuated.  It  employs  gender  as  a  key  concept, 
that  is,  the  social  relation  between  the  sexes,  argu¬ 
ing  that  this  relation  is  not  naturally  given  but 
socially  constructed.  Many  feminists  today  think 
of  gender  as  informing  multiple  dimensions  of 
human  experience,  from  individual  identities  to 
social  and  political  institutions.  Thus,  gender  oper¬ 
ates  through  individual  identifications,  shared  nar¬ 
ratives,  and  symbols.  As  a  social  construct,  gender 
not  only  produces  a  social  reality  but  also  signifies 
a  relationship  of  power — the  predominance  of 
things  and  people  gendered  masculine  over  those 
gendered  feminine.  Thus,  feminists  also  use  gender 
as  a  methodological  tool,  a  lens  through  which  to 
look  at  the  world  in  order  to  reveal  gendered  rela¬ 
tionships  of  power.  In  international  relations, 
feminist  theory  has  made  three  distinct  contribu¬ 
tions:  First,  using  gender  as  a  lens,  it  has  served  to 
inform  critiques  of  the  field  as  gendered.  Second, 
approaching  gender  as  a  social  construct,  it  has 
described  the  reproduction  of  masculinity  and 
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femininity  in  the  conduct  of  international  rela¬ 
tions.  Finally,  it  has  introduced  a  unique  method¬ 
ology  that  requires  researchers  to  think  about  their 
own  position  in  the  research  enterprise.  This  entry 
describes  the  three  distinct  contributions. 

Compared  with  other  disciplines,  feminist  theory 
entered  the  field  of  international  relations  relatively 
recently,  with  seminal  writings  by  Cynthia  Enloe 
and  Jean  Elshtain  in  the  late  1980s  and  J.  Ann 
Tickner,  V.  Spike  Peterson,  and  Christine  Sylvester 
in  the  early  1990s.  While  this  first  generation  of 
feminist  research  in  international  relations  focused 
on  critiquing  concepts,  a  second  generation  has 
focused  on  empirical  investigations,  which  have 
resulted  in  a  broader  questioning  of  methodologies. 
International  relations  scholarship  employing  femi¬ 
nist  theory  has  become  institutionalized  through  the 
formation  of  the  Feminist  Theory  and  Gender 
Studies  section  of  the  International  Studies 
Association  in  1990  and  the  founding  of  a  new  jour¬ 
nal  in  1999,  the  International  Feminist  Journal  of 
Politics. 

Critiquing  the  Canon 

Feminists  have  argued  that  standard  concepts  of 
international  relations  orthodoxy  are  gendered,  that 
is,  that  they  carry  associations  of  masculinity  and 
femininity.  Classical  realism  imagines  individuals 
seeking  domination  and  engaged  in  a  struggle  for 
power.  Feminists  have  suggested  that  this  concep¬ 
tion  of  individuals  reflects  masculine  experiences. 
What  realists  consider  human  nature  (following  the 
theories  of  Thomas  Hobbes)  does  not  reflect  the 
experiences  of  mothers  and  other  caretakers  who 
know  themselves  less  as  autonomous  individuals 
separate  from  others  but  as  connected  in  their  caring 
relationships.  Feminists  point  out  that  Thomas 
Hobbes’s  state  of  nature  is  populated  by  individuals 
who  are  self-generated,  not  born  of  or  nurtured  by 
women,  but  who — in  his  own  words — shot  out  of 
the  ground  like  mushrooms.  This  image  has  informed 
political  theories  that  presume  atomistic  human 
subjects.  Projected  onto  the  state,  it  has  become  the 
starting  point  for  a  realist  imagining  of  the  interna¬ 
tional  world  as  made  up  of  autonomous  and  self- 
interested  actors  devoid  of  social  connections. 

In  addition  to  constructing  humans  as  essen¬ 
tially  male,  realist  theories  celebrate  a  militarized 
version  of  masculinity.  They  construct  this  form  of 


masculinity  as  hegemonic,  implicitly  denigrating 
other  forms  of  masculinity  and  any  type  of 
femininity.  The  opposition  between  hegemonic 
masculinity  and  subordinated  masculinities  and 
femininities  drives  narratives  of  war  and  security. 
For  example,  the  opposition  between  a  masculine 
protector  and  women  in  need  of  protection  is  a 
basic  structuring  device  that  justifies  and  thus  per¬ 
petuates  war  making.  In  her  examination  of 
women  and  war,  Elshtain  gives  further  content  to 
this  opposition,  finding  that  Western  philosophy 
constructs  the  opposition  as  one  between  mascu¬ 
line  “just  warriors”  and  feminine  “beautiful  souls” 
imagined  as  innocent  about  the  ways  of  the  world. 
The  opposition  drives  a  basic  narrative  that  has 
drawn  women  and  men  into  war  while  silencing 
the  stories  of  women  and  men  that  do  not  fit  the 
stereotypes.  The  opposition  also  informs  a  multi¬ 
plicity  of  stories  around  war  fighting,  from  male 
courage  and  heroism  to  feminine  nobility  and  sac¬ 
rifice.  By  privileging  an  understanding  of  a  milita¬ 
rized  male  as  the  prototypical  human,  theories  of 
international  relations  produce  an  intrinsically 
biased,  highly  militarized  understanding  of  the 
international  world,  while  at  the  same  time  repro¬ 
ducing  a  thoroughly  stereotypical  understanding 
of  what  it  means  to  be  a  woman  or  man. 

In  addition  to  defining  gendered  actors  in  inter¬ 
national  relations,  international  relations  theory 
employs  the  gendered  opposition  between  public 
and  private  in  order  to  explain  and  prescribe  par¬ 
ticular  types  of  conduct.  For  example,  the  separa¬ 
tion  of  public  morality  from  private  disorder  fuels 
Machiavelli’s  gendered  conception  of  politics. 
Virtii,  the  character  trait  Machiavelli  demanded 
from  a  leader,  emerges  as  a  masculine  principle  and 
is  opposed  to  a  feminized  fortuna,  that  is,  fate, 
which  he  thought  needed  to  be  conquered  by  male 
virtue.  In  political  realism,  fortuna  reappears  as  the 
disorder  of  anarchy  in  the  international  system  and 
the  task  of  the  policymaker  is  to  tame  this  femi¬ 
nized  disorder.  A  highly  misogynist  construction  of 
gender  relations  thus  cements  the  prescriptions  of 
power  politics. 

If  the  citizen  warrior  reigns  supreme  in  classical 
security  studies,  notions  of  rational  man  replace 
him  in  liberal  international  political  economy  (IPE). 
Feminists  have  criticized  the  instrumental  rational¬ 
ity  reflected  in  notions  of  economic  man  because  it 
denies  the  neediness  of  human  beings  and  their 
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mutual  dependence  on  each  other.  Presumptions  of 
self-interested  cost-benefit  calculations  in  economic 
conduct  neglect  this  dependence  and  underestimate 
the  role  of  caring  in  the  production  of  values. 

As  in  security  studies,  feminists  have  also  found 
explanations  in  IPE  to  be  based  on  the  denigration 
of  the  private  sphere.  This  devaluing  leads  theories 
of  political  economy — in  both  the  liberal  and 
Marxist  variety — to  ignore  the  broad  realm  of 
social  reproduction,  the  value  generated,  mostly  by 
women,  through  the  unpaid  work  of  subsistence 
production,  caring,  and  housekeeping. 

Women  and  Men,  Femininity  and 
Masculinity  in  International  Relations 

The  feminist  critique  of  the  international  relations 
canon  has  directed  scholars  toward  exploring  areas 
left  out  of  orthodox  international  relations  schol¬ 
arship.  Arguably,  feminist  scholars  operate  within 
a  different  ontology  of  international  relations — 
one  that  includes  spaces  and  practices  of  women 
and  one  in  which  gender  produces  a  diverse  range 
of  outcomes.  A  major  focus  of  inquiries  thus  has 
been  to  make  visible  the  practices  of  gendered 
agents.  Another  focus  has  been  to  analyze  the  situ¬ 
ated  constructions  of  femininity  and  masculinity 
that  are  produced  in  such  practices. 

In  security  studies,  feminists  have  followed 
Enloe’s  lead  and  argued  that  power  operates  in  a 
wide  variety  of  everyday  activities  to  sustain  milita¬ 
rized  security  arrangements.  Camouflage  clothing, 
Rambo  movies,  the  training  programs  of  the  U.S. 
military  in  schools  and  colleges,  the  military’s  regu¬ 
lation  of  prostitution  around  foreign  army  bases, 
the  employment  of  sexualized  and  homely  imagery 
when  defense  intellectuals  talk  about  weapons  sys¬ 
tems — all  function  to  construct  the  military  and  its 
often  horrific  weapons  systems  as  a  benign  force, 
part  of  the  normal  order  of  things.  Women  gain  a 
militarized  role  in  this  order  as  much  as  men — as 
mothers  who  encourage  their  sons  to  go  to  war,  as 
pinup  girls,  nurses,  prostitutes,  and  soldiers’  wives 
who  divert  and  sustain  them  and  who  remind  them 
of  the  purpose  of  their  fighting.  Because  war  in  this 
way  depends  on  constructions  of  gender,  the  post¬ 
war  reconstruction  of  countries  needs  to  include  a 
reconstruction  of  gender  relations  if  it  wants  to 
prevent  wars  in  the  future. 

Feminists  also  have  explored  the  kinds  of  mas¬ 
culinities  being  constructed  in  particular  wars  and 


militaries,  finding  the  warrior  citizen  model 
hegemonic  but  also  tamed  in  the  first  Gulf  War  by 
a  combination  of  toughness  and  tenderness.  They 
have  sought  to  destabilize  the  model  of  militarized 
masculinity  by  exploring  the  resistance  against 
gays  in  the  military  and  the  military’s  refusal  to 
acknowledge  posttraumatic  stress  disorder. 
Denying  differences  in  sexual  orientation  and 
denying  the  humanity  of  soldiers  in  the  face  of  the 
horrors  they  encounter  are  efforts  to  preserve  a 
militarized  version  of  tough  and  fearless  masculin¬ 
ity  that  thrives  on  the  fiction  of  being  the  masculine 
protector  of  feminized  “womenandchildren” 
(Enloe’s  formulation). 

In  IPE,  feminist  researchers  have  sought  to  make 
visible  the  work  of  women  engaged  in  “reproduc¬ 
tion,”  including  subsistence  farmers,  maids,  home¬ 
workers,  and  sex  workers.  They  have  identified  a 
different  type  of  masculinity  as  hegemonic — that  is, 
a  bourgeois-rationalist  model  that  celebrates  com¬ 
petition,  reason,  and  self-control.  This  model  has 
become  particularly  emphasized  in  the  context  of 
neoliberal  restructurings  and  globalization.  Fem¬ 
inists  have  explored  the  construction  of  a  class  of 
global  jet-setting  managers  empowered  by  forms  of 
hegemonic  masculinity  and  have  shown  that  this 
has  been  accompanied  by  the  development  of  a 
global  low-pay  feminized  services  economy.  They 
have  described  the  reconstructions  of  gender  rela¬ 
tions  in  postsocialist  transition  economies,  includ¬ 
ing  patterns  that  pushed  women  out  of  workforces 
and  parliaments  and  re-enabled  their  sexual  objec¬ 
tification.  And  they  have  interpreted  the  newly 
emerging  East  Asian  developmentalism  as  a  rever¬ 
sal  of  colonial  hypermasculinity  and  as  made 
possible  by  an  inflated  traditional  paternalism  in 
newly  industrializing  countries.  Finally,  they  have 
explored  feminist  efforts  to  mainstream  consider¬ 
ations  of  gender  in  institutions  of  global  economic 
governance,  such  as  the  World  Bank,  showing  how 
feminism  becomes  integrated  into  existing  arrange¬ 
ments,  sometimes  improving  the  situation  of 
women  at  the  margins  while  also  continuing  to 
deny  the  relevance  of  the  reproductive  economy. 

Feminist  Methodology 

Most  feminists  in  international  relations  have  argued 
for  postpositivist  methodologies  that  allow  them  to 
critically  interrogate  the  field  and  its  practices. 
Having  grown  out  of  a  political  movement,  feminist 
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research  has  often  sought  to  ask  questions  relevant 
to  women’s  experiences  and  produce  knowledge 
useful  to  the  movement.  Rather  than  insisting  on  the 
researcher  keeping  a  distance  from  her  topic,  femi¬ 
nists  have  sought  to  specify  their  relationship  to  the 
topic.  For  some,  this  requires  defining  a  feminist 
standpoint — that  is,  specifying  that  they  speak  from 
the  perspective  of  a  specific  movement  or  a  specific 
community.  For  others,  it  has  meant  making  into  a 
subject  matter  their  own  position  or  experience  dur¬ 
ing  the  research  process.  Yet  others  employ  positiv¬ 
ist  methods  that  keep  the  researcher’s  distance  from 
the  object  of  inquiry  but  make  sure  to  ask  feminist 
questions.  All  share  an  element  of  reflexivity  toward 
the  research  enterprise  that  distinguishes  feminism 
from  most  other  approaches  in  international  rela¬ 
tions.  It  allows  feminists  to  problematize  the  politi¬ 
cal  effects  of  the  knowledge  produced  and  arguably 
abide  by  a  more  robust  version  of  objectivity  than 
approaches  that  fail  to  keep  in  view  the  relationship 
between  the  researcher  and  the  researched. 

Elisabeth  Friigl 
Florida  International  University 
Miami,  Florida,  United  States 
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The  Development  Assistance  Committee  (DAC)  of 
the  Organisation  for  Economic  Co-operation  and 


Development  (OECD)  defines  “development  assis¬ 
tance”  (or  foreign  aid)  as  financial  flows  that 
qualify  as  official  development  assistance  (ODA). 
ODA  is  calculated  as  the  sum  of  grants  and  loans 
to  aid  recipients  that  are  (a)  undertaken  by  the  offi¬ 
cial  sector  of  the  donor  country,  (b)  with  promo¬ 
tion  of  economic  development  and  welfare  in 
recipient  countries  as  the  main  objective,  (c)  at 
concessional  financial  terms,  where  the  grant  ele¬ 
ment  is  equal  to  at  least  25  % .  In  addition  to  finan¬ 
cial  flows,  technical  cooperation  costs  are  included 
in  ODA;  but  grants,  loans,  and  credits  for  military 
purposes  are  excluded,  and  transfer  payments  to 
private  individuals  are  in  general  not  counted.  The 
same  goes  for  donations  from  the  public,  commer¬ 
cial  loans,  and  foreign  direct  investment  (FDI).  It  is 
common  to  treat  ODA  and  foreign  aid  as  equiva¬ 
lent,  but  this  can  be  misleading.  Assistance  funded 
by  nongovernmental  organizations  (NGOs)  includ¬ 
ing  church-based  agencies  has  grown  significantly 
in  the  past  25  years  and  now  amounts  to  about  one 
third  of  ODA. 

Only  official  aid  to  “traditional”  developing 
countries  counted  as  ODA  until  2005.  For  these 
(Part  I)  countries,  there  is  a  long-standing  United 
Nations  (UN)  target  from  1970  that  they  should 
receive  0.7%  of  donors’  gross  national  income 
(GNI)  as  aid.  Assistance  to  the  “more  advanced” 
Eastern  European  and  “more  advanced”  develop¬ 
ing  (Part  II)  countries  was  recorded  separately  by 
DAC  as  “official  aid,”  not  included  as  part  of 
ODA.  DAC  countries  have  over  the  years  accounted 
for  some  95  %  of  all  ODA  flows,  but  the  distinction 
between  Part  I  and  Part  II  countries  is  no  longer 
used.  All  flows  that  fulfill  the  established  criteria  are 
now  included  in  the  aggregate  measure  of  ODA, 
but  NGO-funded  contributions  are  not  added. 

In  2009,  the  total  amount  of  ODA  disbursed  by 
donors  to  developing  countries  and  multilateral 
organizations  reached  US$123.1  billion  according 
to  the  OECD/DAC  2010  statistics.  (For  the  data  in 
this  and  subsequent  paragraphs,  see  OECD,  2010). 

This  means  that  the  average  citizen  in  the  donor 
countries  contributed  around  US$149  as  ODA. 
This  can  be  compared  with  a  figure  of  around 
US$64  in  1960-1973  and  US$99  in  1992. 
However,  the  UN  target  of  0.7%  of  GNI  is  with 
few  exceptions  far  from  being  reached.  Donors 
disbursed  30.4%  of  total  foreign  aid  to  multilat¬ 
eral  organizations  in  2006,  and  some  70%  of  this 
flow  was  disbursed  to  developing  countries,  with 
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the  European  Union  and  the  International 
Monetary  Fund  and  World  Bank  as  the  dominat¬ 
ing  sources  followed  by  the  UN  and  the  Regional 
Development  Banks. 

It  is  a  widespread  perception  that  foreign  aid 
amounts  to  a  very  significant  resource,  in  both 
absolute  and  relative  terms,  and  that  aid  is  not 
insignificant  measured  relative  to  developing  coun¬ 
try  production  and  income.  At  the  same  time,  aid 
is  much  less  sizeable  when  measured  in  relation  to 
GNI  or  government  budgets  in  the  donor  countries 
or  in  comparison  with  population  size  of  aid¬ 
receiving  countries.  Moreover,  aid  has  been  on  a 
declining  trend  since  the  early  1990s  as  a  share  of 
GNI  in  recipient  countries.  Most  recipient  coun¬ 
tries  receive  aid  to  the  order  of  1.8%  of  their  GNI 
per  year  with  a  median  of  3.2%.  This  corresponds 
to  a  distribution  of  aid  per  capita  with  a  mode  of 
US$17.9  per  year  and  a  median  of  US$31. 
Accordingly,  the  relative  size  of  the  aid  inflow  var¬ 
ies  significantly  among  recipients,  and  while  the 
13.2%  size  of  the  aid  to  GNI  ratio  in,  for  example, 
Tanzania  may  seem  high,  this  share  reflects  not 
only  the  size  of  the  aid  flow  but  also  the  very  low 
level  of  income.  With  this  background,  modest 
expectations  are  advisable  when  analyzing  the 
overall  impact  of  past  aid  on  development. 

In  subsequent  paragraphs,  the  historical  and 
theoretical  context  of  foreign  aid  is  reviewed  first. 
This  is  followed  by  overviews  of  the  empirical  evi¬ 
dence  on  the  allocation  of  aid  and  its  impact  on 
furthering  growth  and  development  in  aid-receiving 
countries.  The  conclusion  provides  discussion  and 
summary  remarks. 

Historical  and  Theoretical  Context 

Foreign  aid  in  its  modern  form  has  roots  back  to 
the  early  1940s  and  intensified  after  the  disruption 
that  followed  World  War  II.  The  international 
economic  system  had  collapsed,  and  war-ravaged 
Europe  faced  a  critical  shortage  of  capital  and  an 
acute  need  for  physical  reconstruction.  The 
response  was  the  European  Recovery  Program, 
commonly  known  as  the  Marshall  Plan.  During 
the  peak  years,  the  United  States  transferred  some 
2%  to  3%  of  its  national  income  to  help  restore 
Europe.  The  Marshall  Plan,  which  was  adminis¬ 
tered  by  the  Organisation  for  European  Economic 
Co-operation  (OEEC),  the  predecessor  of  the 


OECD,  was  implemented  on  schedule,  and  its  suc¬ 
cess  fuelled  highly  optimistic  expectations  about 
the  future  effectiveness  of  foreign  aid. 

After  the  success  of  the  Marshall  Plan,  the 
attention  of  industrialized  nations  turned  to  the 
developing  countries,  many  of  which  became  inde¬ 
pendent  around  1960.  Economic  growth  in  a  state- 
led  planning  tradition  became  a  key  objective 
during  the  1950s  and  1960s,  and  it  was  widely 
believed  that  poverty  and  inequality  would  be 
quickly  eliminated  through  growth  and  moderniza¬ 
tion  (“trickle-down”).  Theoretical  thinking  on  eco¬ 
nomic  development  at  the  macrolevel  was  informed 
by  both  the  Harrod-Domar  growth  model  (which 
was  extended  into  the  two-gap  model  of  Chenery 
and  Strout)  and  the  Lewis  dual-sector  model  frame¬ 
work;  whereas  standard  microeconomic  cost- 
benefit  analysis  (CBA)  was  relied  on  in  project 
analysis.  The  Harrod-Domar  model  is  based  on  the 
assumption  that  investment  is  the  key  constraint  on 
growth,  whereas  the  Lewis  model  is  concerned  with 
the  transfer  of  labor  from  the  “traditional”  (rural) 
to  the  “modern”  (industrial)  sector.  CBA,  on  the 
other  hand,  assesses  cost  and  benefits  of  individual 
microeconomic  project  interventions  based  on 
shadow  prices  for  inputs  and  outputs  and  derives 
net  present  values  and  internal  rates  of  return. 

The  major  part  of  the  rapidly  increasing  bilat¬ 
eral  flows  during  the  1950s  came  from  the  United 
States.  New  bilateral  donor  agencies  (other  than 
the  United  States)  were  mainly  established  in  the 
1960s,  with  the  Commonwealth-inspired  Colombo 
Plan  from  1950  being  an  exception.  A  transition 
toward  more  independent,  multilateral  relations 
also  began  to  emerge  during  the  1960s.  This  cre¬ 
ated  a  constituency  for  foreign  aid,  and  the  non- 
aligned  movement  gave  an  articulated  developing 
country  focus  to  this  voice,  as  did  the  various 
organs  of  the  UN.  The  International  Bank  for 
Reconstruction  and  Development,  or  World  Bank, 
established  at  the  Bretton  Woods  Conference  in 
1944,  came  to  play  a  central  role  in  development 
assistance  and  international  policy  formulation, 
especially  following  the  creation  of  the  International 
Development  Association  in  1960. 

The  original  Marshall  Plan  was  built  around 
support  to  finance  general  categories  of  imports 
and  strengthen  the  balance  of  payments  (i.e.,  pro¬ 
gram  aid),  but  from  the  early  1950s,  project  aid 
became  the  dominating  aid  modality.  Some  donors 
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continued  to  supply  program  aid,  but  aid  was 
increasingly  disbursed  for  the  implementation  of 
specific  capital  investment  projects  and  associated 
technical  assistance. 

The  multilateralism  of  aid  became  somewhat 
more  pronounced  after  the  mid-1970s.  Multilateral 
channels  were  at  the  time  seen  as  more  efficient 
and  less  political  than  bilateral  aid,  so  the  UN, 
World  Bank,  and  other  multilateral  agencies 
expanded  their  activities  considerably.  The  1970s 
also  saw  an  increased  focus  on  employment, 
income  distribution,  and  poverty  alleviation  as 
essential  objectives  of  development  and  foreign 
aid.  The  effectiveness  of  trickle-down  was  widely 
questioned,  and  new  strategies  referred  to  as  basic 
human  needs  and  redistribution  with  growth  were 
formulated  and  propagated  alongside  more  radical 
dependency  theories  of  development.  Nevertheless, 
the  typical  project  aid  modality  remained  largely 
unchanged. 

The  golden  era  of  the  1960s  and  1970s  came  to 
an  abrupt  end  at  the  beginning  of  the  1980s.  The 
second  oil  shock  in  1979  reversed  economic  condi¬ 
tions,  and  there  was  a  huge  increase  in  interest 
rates  due  to  the  economic  stabilization  policies  in 
the  developed  countries.  The  international  debt 
crisis  erupted,  and  macroeconomic  imbalance 
became  characteristic.  On  the  political  scene, 
Ronald  Reagan  and  Margaret  Thatcher  came  to 
power  in  the  United  States  and  the  United  Kingdom, 
respectively,  and  at  the  World  Bank,  Anne  Krueger 
became  vice  president  and  chief  economist,  replac¬ 
ing  Hollis  Chenery.  Economic  circumstances  in  the 
developing  countries  and  relations  between  the 
North  and  South  changed  radically.  The  crisis  hit 
hard,  especially  in  many  African  countries.  Focus 
in  development  strategy  and  policy  shifted  to  inter¬ 
nal  domestic  policy  failure,  and  achieving  macro- 
economic  balance  (externally  and  internally) 
became  widely  perceived  as  an  essential  prerequi¬ 
site  for  renewed  development. 

“Rolling  back  the  state”  turned  into  a  rallying 
call  in  the  subsequent  structural  adjustment  efforts, 
and  reliance  on  market  forces,  outward  orienta¬ 
tion,  and  the  role  of  the  private  sector,  including 
NGOs,  was  emphasized  by  the  World  Bank  and 
others.  In  parallel,  poverty  alleviation  slipped  out 
of  view  in  mainstream  agendas  for  economic 
reform  but  remained  at  the  center  of  attention  in 
more  unorthodox  thinking  such  as  the  “adjustment 


with  a  human  face”  approach  of  the  United 
Nations  Children’s  Fund  (UNICEF).  At  the  same 
time,  bilateral  donors  and  international  agencies 
struggled  with  how  to  channel  resources  to  the 
developing  world.  Quick-disbursing  macroeco¬ 
nomic  program  assistance,  such  as  balance  of  pay¬ 
ments  support  and  sector  budget  support  (which 
were  not  tied  to  investment  projects  and  which 
could  be  justified  under  the  headings  of  stabiliza¬ 
tion  and  adjustment),  appeared  an  ideal  solution  to 
the  dilemma  of  maintaining  the  resource  flow  and 
the  desire  to  promote  policy  reform.  Financial  pro¬ 
gram  aid  and  adjustment  loans  (and  eventually 
debt  relief)  became  fashionable  and  policy  condi¬ 
tionality  more  widespread.  In  other  words,  a  ratio¬ 
nale  that  corresponded  well  with  the  orthodox 
guidelines  for  good  policy  summarized  by  the 
“Washington  Consensus”  had  been  found  for 
maintaining  the  aid  flow. 

Accordingly,  total  aid  continued  to  grow 
steadily  in  real  terms  until  the  early  1990s,  but 
after  1992,  total  aid  flows  started  to  decline  in 
absolute  terms  until  the  turn  of  the  millennium. 
Many  reasons  account  for  the  fall  in  aggregate 
flows  after  1992,  including  first  of  all  the  end  of 
the  Cold  War.  The  same  can  be  said  for  the  weak¬ 
ening  patron-client  relationships  among  the  devel¬ 
oping  countries  and  the  former  colonial  powers. 
The  traditional  support  of  foreign  aid  by  vocal 
interest  groups  in  the  industrial  countries  receded. 
Bilateral  and  multilateral  aid  institutions  were  sub¬ 
jected  to  criticism  and  characterized  at  times  as 
blunt  instruments  of  commercial  interests  in  the 
industrial  world  or  as  self-interested,  inefficient, 
rent-seeking  bureaucracies.  Moreover,  acute 
awareness  in  donor  countries  of  cases  of  bad  gov¬ 
ernance,  corruption,  and  “crony  capitalism”  led  to 
skepticism  about  the  credibility  of  governments 
receiving  aid. 

The  potential  role  of  foreign  aid  in  all  this 
attracted  attention,  and  the  fear  that  aid  can  gener¬ 
ate  undesirable  dependency  relationships  became 
clear  during  the  second  part  of  the  1990s  and 
persisted  into  the  21st  century.  In  parallel,  the 
perception  that  policy  conditionality  was  failing  to 
promote  policy  reform  started  to  assert  itself.  This 
assessment  prompted  World  Bank  and  independent 
academic  researchers  to  start  digging  into  the  aid- 
growth  relationship  using  modern  econometric 
techniques.  Even  more  recently,  efforts  to  develop 
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randomized  program  evaluation  techniques 
(including  a  variety  of  experimental  approaches) 
appeared  based  both  on  micro-CBA  studies  from 
the  past  and  an  increasing  understanding  of  the 
need  for  developing  proper  counterfactuals  in  eco¬ 
nomic  analysis  of  foreign  aid  and  associated  pol¬ 
icy  interventions. 

Aid  Allocation 

Over  the  years,  foreign  aid  has  been  justified  in 
public  policy  pronouncements  in  widely  differing 
ways,  ranging  from  pure  altruism  to  the  shared 
benefits  of  economic  development  in  poor  coun¬ 
tries  and  further  on  to  the  political  ideology,  for¬ 
eign  policy,  and  commercial  interests  of  the  donor 
country.  Few  would  dispute  that  humanitarian 
sentiments  have  also  motivated  donors.  Action  fol¬ 
lowing  severe  natural  calamities,  which  continue 
to  be  endemic  in  poor  countries,  is  an  example. 
Food  and  emergency  relief  also  remains  an  impor¬ 
tant  form  of  aid.  In  addition,  existing  data  and 
analyses  confirm  that  donors  allocate  relatively 
more  ODA  to  the  poorest  countries. 

Emphasis  on  the  needs  of  poor  countries  was  a 
particularly  prominent  characteristic — and  the 
underlying  economic  rationale — in  much  of  the 
policy  literature  on  foreign  aid  in  the  1950s  and 
1960s.  Here  focus  was  on  estimating  aid  require¬ 
ments  in  the  tradition  of  the  two-gap  model.  While 
still  influential  in  practice,  the  two-gap  model  has 
been  subjected  to  a  variety  of  criticisms,  and  in 
parallel,  the  role  of  aid  has  changed  to  a  much 
more  multidimensional  set  of  concerns.  Economic 
return  is  by  no  means  the  only  goal  of  aid. 
Nevertheless,  growth  and  economic  development 
in  aid-receiving  countries  have  continued  as  a 
yardstick  for  the  effectiveness  of  aid  both  in  their 
own  right  and  as  necessary  conditions  for  the  real¬ 
ization  of  other  development  aims. 

It  is  not  new  that  selfish  motives  are  critical  in 
bilateral  donor  decisions.  Moreover,  bilateral 
donors  do  indeed  behave  very  differently  among 
themselves.  Up  to  about  1990,  the  Cold  War  was 
used  as  a  powerful  justification  for  providing  aid 
to  developing  countries  to  stem  the  spread  of  com¬ 
munism.  Similarly,  aid  from  socialist  governments 
was  motivated  to  promote  socialist  political  and 
economic  systems.  Other  strategic  interests  play  a 
role  as  well.  The  United  States  has  over  the  years 


earmarked  substantial  amounts  of  aid  to  Egypt 
and  Israel;  being  a  former  colony  is  an  important 
determinant  in  getting  access  to  French  aid;  and 
voting  behavior  in  the  UN  can  affect  aid  allocation 
both  bilaterally  and  through  the  multilateral  sys¬ 
tem.  The  same  goes  for  how  bilateral  donors  are 
influenced  in  aid  allocations  by  their  own  strategic 
and  commercial  interest  versus  the  development 
motives  of  aid  recipients. 

It  is  widely  accepted  that  not  all  donors  behave 
the  same;  but  the  donor  community  has  as  a  whole 
failed  to  meet  the  established  international  target 
of  contributing  0.7%  of  their  national  income  as 
ODA.  This  is  so  in  spite  of  widespread  endorse¬ 
ment  of  the  recommendations  for  a  large  scaling 
up  in  the  context  of  the  Millennium  Development 
Goals.  Only  the  group  of  Scandinavian  countries 
and  the  Netherlands  (according  to  OECD/D  AC 
statistics)  have  consistently  met  the  0.7%  target 
since  the  mid-1970s,  while  the  United  States  con¬ 
tributed  around  0.2%  of  its  GNI  in  2009. 

Aid  Impact 

To  measure  the  effect  of  aid  properly,  the  analyst 
must  in  principle  be  able  to  compare  the  value  of  a 
chosen  indicator  (such  as  growth  or  poverty  reduc¬ 
tion)  in  two  strictly  independent  situations — with 
and  without  aid.  To  establish  the  “true”  measure 
of  aid  impact,  the  importance  of  all  other  circum¬ 
stances  that  have  affected  a  given  country  over 
time  needs  to  be  properly  accounted  for.  Comparing 
what  actually  happened  with  an  appropriate  coun- 
terfactual  is  the  fundamental  evaluation  challenge. 
Yet  there  is  in  social  science  no  way  of  addressing 
this  problem  (i.e.,  the  challenge  of  establishing  an 
appropriate  counterfactual)  in  a  broadly  accept¬ 
able  way  without  making  assumptions  that  are 
bound  to  be  debatable,  in  theory  and  in  practice. 

Accordingly,  the  past  decades  have  witnessed  a 
massive  outpouring  of  studies  on  the  effectiveness 
of  foreign  aid.  This  topic  has  been  a  central  and 
recurring  theme  with  which  many  development 
experts,  subscribing  to  the  different  paradigms  of 
development  thinking,  have  grappled,  and  meth¬ 
odologies  have  varied.  More  specifically,  (a)  the 
impact  of  aid  has  been  evaluated  at  the  micro-  and 
macroeconomic  level,  (b)  cross-country  compari¬ 
sons  as  well  as  single-country  case  studies  have 
been  relied  on,  and  (c)  aid  effectiveness  research 
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includes  broad  surveys  of  a  qualitative  and  inter¬ 
disciplinary  nature  as  well  as  more  hard-core 
quantitative  work. 

One  key  point  on  which  there  is  at  least  some 
agreement  in  the  literature  is  that  aid  has  in  many 
cases  been  highly  successful  at  the  microeconomic 
level.  The  most  rigorous  project  evaluations  in  this 
area  are  done  by  the  World  Bank,  and  reports 
from  the  Independent  Evaluation  Group  of  the 
World  Bank  are  generally  encouraging.  Average 
rates  of  return  are  generally  above  20%,  and 
decent  project  rates  of  return  have  been  reported 
regularly  over  the  years  in  one  survey  after  the 
other.  Overall,  a  mass  of  project-based  evidence 
has  been  collected,  and  few  dispute  that  aid  inter¬ 
ventions  have  worked  in  helping  improve  social 
outcomes  through  better  health  and  in  helping 
promote  and  develop  appropriate  technology  (i.e., 
the  green  revolution)  and  so  on.  Yet  it  remains  less 
clear  what  works  in  more  concrete  terms  and 
what  does  not  work,  and  doubts  about  aid’s  over¬ 
all  impact  on  growth  and  development  linger  on. 
The  question  is  regularly  raised  whether  all  this 
adds  up  at  the  macrolevel. 

It  is  easy  to  arrive  at  a  negative  association 
between  aid  and  growth  in  simple  aid-growth  cor¬ 
relation  analysis.  There  is,  however,  no  logical 
inconsistency  in  development  terms  between  little 
growth  and  aid  inflows  of  the  size  experienced  in  the 
past.  Donors  allocate  more  aid  to  poorer  countries, 
which  are  subject  to  difficulties  and  shocks  of  many 
kinds,  including  natural  and  man-made  calamities. 
When  countries  have  done  well  for  a  while  average 
income  has  gone  up,  donors  tend  to  transfer  less  aid, 
and  eventually  they  withdraw.  While  such  “gradua¬ 
tion”  may  take  a  while,  simple  correlations  are  on 
this  background  likely  to  show  a  negative  relation¬ 
ship;  but  they  do  not  reveal  the  “true”  impact  of  aid. 
Aid  allocation  matters,  as  do  the  major  changes  that 
have  taken  place  in  the  global  economy  and  affected 
the  environment  in  which  aid  is  implemented. 
Targets  for  aid  have  also  been  changing  from  one 
decade  to  the  next.  Thus,  simple  correlation  analysis 
or  storytelling  cannot — and  should  not — be  allowed 
to  settle  the  causality  debate  about  aid’s  potential 
impact  on  development. 

It  is,  however,  never  straightforward  to  general¬ 
ize  from  case  studies,  and  this  helps  explain  why 
macroeconomic  cross-country  (panel  data)  studies 
of  the  aid-growth  link  became  so  popular  from 


around  1995.  Such  an  approach  makes  it  possible 
in  principle  to  move  beyond  simplistic  aid-growth 
correlation  analysis,  where  the  analysis  of  causal 
effects  is  rather  primitive.  Much  of  the  modern 
empirical  aid  effectiveness  literature  has  focused 
on  whether  the  impact  of  aid  is  conditional  on 
policy  or  whether  aid  can  be  expected  to  have  a 
separate  and  positive  impact,  independent  of  pol¬ 
icy.  This  has  involved  a  mixture  of  concerns.  They 
range  from  technically  demanding  econometric 
modeling  issues  to  fundamentally  different 
approaches  to  the  design  and  implementation  of 
development  strategy  and  policy. 

Overall,  it  has  become  clear  that  coming  up 
with  the  “true”  aid-growth  relationship  is  far  from 
easy,  and  aid  is  in  any  case  of  much  too  limited  a 
size  to  turn  the  wheels  of  history.  Yet,  while  aid  is 
controversial,  few  reject  aid  as  a  potentially  useful 
instrument  in  the  fight  against  poverty.  Careful, 
nuanced,  and  subtle  assessments  are  advisable  with 
the  empirical  evidence  in  hand;  and  the  single  most 
common  result  in  the  modern  aid-growth  litera¬ 
ture  is  that  aid  seems  to  have  had  a  positive  impact 
on  per  capita  growth.  No  excessive  claims  about 
aid  impact  on  development  should  be  made  on  this 
basis,  and  the  empirical  evidence  remains  an  area 
of  dispute,  leaving  the  door  open  for  conflicting 
interpretations  and  policy  recommendations. 

Conclusion 

Foreign  aid  has  been  associated  with  development 
successes  and  failures,  and  the  fundamental  ana¬ 
lytical  problem  in  assessing  its  impact  is  that 
nobody  has  to  date  successfully  identified  the 
underlying  development  model.  Analysts  therefore 
continue  to  work  with  reduced-form  models, 
which  are  bound  to  be  debatable.  In  parallel,  exist¬ 
ing  data  suggest  that  foreign  aid  is  far  from  equally 
effective  everywhere.  The  necessary  and  sufficient 
conditions  for  aid  to  have  a  positive  contribution 
on  the  development  process  remain  elusive.  In 
other  words,  how  to  come  to  grips  better  with 
what  actually  drives  existing  differences  in  the 
impact  of  foreign  aid  is  a  challenge  in  theory  and 
practice.  This  is  so,  for  example,  in  relation  to 
potential  interaction  with  economic  policy,  but  the 
same  goes  for  deeper  structural  characteristics. 

The  lack  of  generalized  understanding  of  the 
complex  links  in  particular  country  circumstances 
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between  aid,  growth,  and  development  objectives, 
such  as  poverty  reduction,  means  that  selectivity  in 
aid  allocation,  based  on  simple  macroeconomic 
criteria,  is  hard  to  defend.  Few  would  argue  that 
old-fashioned  conditionality  should  be  brought 
back  to  rule  the  way;  but  a  better  understanding  of 
the  intricacies  of  the  donor-recipient  relationship 
in  theory  and  in  practice  would  be  valuable.  This 
would  as  key  elements  include  addressing  issues 
such  as  (a)  the  best  way  to  channel  resources  to  the 
poor,  when  national  governments  are  not  capable 
and/or  willing  to  take  on  this  task;  (b)  how  to 
ensure  that  aid  delivered  directly  to  national  gov¬ 
ernments  does  not  undermine  local  accountabil¬ 
ity;  and  (c)  establishing  the  appropriate  balance 
between  aid  going  to  the  government  vis-a-vis 
individuals  and  others  in  the  private  sector. 
Accordingly,  how  best  to  strengthen  incentives  in 
support  of  genuine  domestic  policy  leadership  is  a 
challenge.  The  same  goes  for  the  fundamental  task 
of  furthering  accountability  and  transparency  vis- 
a-vis  local  populations. 

Recent  years  have  seen  a  drive  to  scale  up  aid. 
In  this  context,  it  is  critically  important  to  avoid 
making  the  mistake  of  the  past  of  promising  too 
much — that  is,  of  contributing  to  the  misconcep¬ 
tion  that  aid  can  on  its  own  turn  history.  It  would, 
based  on  history,  appear  that  aid  has  much  to 
offer,  but  managing  expectations  is  far  from  easy. 
It  is  demanding  to  determine  how  best  to  make 
sure  that  promises  made  are  kept.  There  are  many 
unresolved  issues  here,  including  deciding  how 
best  to  design  incentives  in  aid  agencies  to  meet 
this  challenge  alongside  topics  such  as  the  role  of 
independent  evaluation,  of  coordination  among 
multiple  donors,  and  of  the  need  to  consider  polit¬ 
ical  economy  issues,  including  the  need  to  sharpen 
the  incentives  for  recipients  to  use  aid  effectively  in 
promoting  development. 
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Foreign  Policy  Analysis 


Foreign  Policy  Analysis  (upper  case,  abbreviated 
as  FPA)  is  a  subfield  of  the  academic  subject  of 
international  relations  (IR)  that  has  developed 
since  the  1950s,  broadly  in  parallel  with  IR  itself. 
Foreign  policy  analysis  (lower  case,  abbreviated  as 
fpa)  is  what  all  commentators  on  matters  of  for¬ 
eign  policy  do  as  a  matter  of  routine,  for  one 
country  or  many.  (This  distinction  between  FPA 
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and  fpa  will  be  used  here  for  clarity.)  Whether 
academics,  journalists,  or  think  tank  researchers, 
they  dissect  the  sources,  goals,  instruments,  and 
feasibility  of  foreign  policies  and  have  thus  gained 
over  the  years  the  sobriquet  of  foreign  policy  ana¬ 
lysts.  The  term  itself,  however,  is  relatively  mod¬ 
ern  in  origin  and  owes  much  to  the  emergence  of 
the  academic  field  of  FPA,  which  is  the  main  focus 
of  this  entry,  given  its  importance  for  IR  and  for 
political  science  more  widely. 

FPA  has  predominantly  become  established  in 
the  Anglo-Saxon  world  (i.e.,  England  and  her  for¬ 
mer  colonies),  including  the  United  States,  the 
United  Kingdom,  together  with  Scandinavia  and 
some  countries  of  the  Commonwealth,  such  as 
Australia,  Canada,  India,  Nigeria,  and  South 
Africa.  Foreign  policy  has  been  long  studied  else¬ 
where  but  more  in  the  fpa  tradition  and  influenced 
more  by  history  and  international  law  than  by 
political  science.  Even  within  the  relatively  narrow 
world  of  FPA,  there  are  important  differences  of 
approach.  The  U.S.  scholarly  community  has  pro¬ 
duced  most  of  the  important  work  in  all  areas  of 
the  subject,  but  it  has  also  historically  placed  much 
more  emphasis  on  large-scale  data  sets  associated 
with  the  specialization  known  as  comparative  for¬ 
eign  policy  (CFP),  which  has  not  attracted  much 
interest  (or  the  necessary  resources)  outside  the 
United  States.  Elsewhere  the  focus  has  been  more 
on  case  studies,  on  the  microaspects  of  decision 
making,  and  on  the  ideas  of  foreign  policy — 
particularly  in  the  past  decade,  with  the  rise  of 
constructivism  and  its  concern  with  identity  poli¬ 
tics.  But  wherever  the  subject  is  studied,  it  is  under¬ 
pinned  by  the  preoccupation  with  the  relationship 
between  process  and  outcome.  Even  those  primar¬ 
ily  concerned  to  generate  events  data  have  been 
motivated  by  the  wish  to  discover  correlations 
between  certain  kinds  of  foreign  policy  events  and 
key  variables,  such  as  domestic  upheaval  or  degrees 
of  pluralism. 

FPA  can  operate  with  a  zoom  or  a  wide-angle 
lens,  on  individual  decisions,  and  on  a  class  of 
actions — behavior  in  crises,  for  instance.  Yet  even 
close-textured  analysis  is  set  up  in  such  a  way  as  to 
enable  comparisons  to  take  place.  In  this,  it  is  simi¬ 
lar  to  the  study  of  comparative  politics,  which  can 
be  regarded  as  its  equivalent,  or  parallel,  within 
mainstream  political  science.  Yoked  together,  the 
two  subjects  constitute  a  solid  bridge  between 


political  science  and  IR.  Each  also  draws  on  history 
and  area  studies,  as  one  cannot  understand  political 
culture,  whether  in  the  singular  or  the  plural,  with¬ 
out  a  grounding  in  its  geographical  and  historical 
particularities.  Theory  is  central  to  FPA,  but  it  is  not, 
by  itself,  a  powerful  tool  in  this  particular  context. 
FPA  also  looks  to  the  study  of  international  law  and 
organization  (more  properly  to  the  sociology  of  law 
and  organization)  for  insights  into  the  constraints 
under  which  foreign  policymakers  operate.  In  fact 
the  literature  of  IR  as  a  whole  is  useful  in  this 
respect.  Even  Kenneth  Waltz’s  neorealism,  which 
explicitly  disavows  having  anything  to  say  about 
foreign  policy  (it  being  a  different  level  of  analysis 
from  his  own),  provides  a  clear  picture  of  the  nature 
of  the  international  system,  with  implied  guidance 
for  the  actors  within  it. 

It  might  be  thought  that  the  comparison  between 
FPA  and  comparative  politics  is  misplaced  because 
the  former  focuses  on  policy  and  the  latter  on  the 
overall  nature  of  a  political  system.  But  this  is  only 
superficially  true.  FPA  has  evolved  beyond  both 
policy  and  decision  making  to  include  all  the 
domestic  sources  of  foreign  policy — and  to  some 
extent  also  the  foreign  sources  of  domestic  policy. 
It  thus  provides  scholars  with  a  way  of  comparing 
how  different  polities  behave  outside  their  own 
borders  and  of  analyzing  the  various  ways  in 
which  domestic  society  becomes  involved  with  IR. 
At  one  level,  it  is  certainly  the  study  of  the  external 
dimension  of  public  policy  and  therefore  directly 
comparable  with  that  particular  branch  of  political 
science.  But  it  goes  beyond  that  to  engage  with  the 
nature  of  the  political  process  as  revealed  through 
arguments  about  foreign  policy. 

If  one  looks  for  the  intellectual  origins  of  FPA, 
one  needs  to  go  back  further  than  its  actual  begin¬ 
nings,  with  the  work  of  Richard  Snyder  and  his 
colleagues  at  Princeton  in  the  mid-1950s.  Their 
work  was  rooted  in  the  burgeoning  belief  in  the 
value  of  social  science,  following  in  the  powerful 
tracks  of  natural  science,  which  had  changed  the 
world  so  dramatically  over  the  previous  century. 
This  meant  both  that  foreign  policy  should  be  ana¬ 
lyzed  in  as  dispassionate  a  manner  as  possible, 
avoiding  the  unproductive  polemics  that  had 
marked  the  discussion  of  Munich,  or  McCarthyism, 
so  as  to  probe  beneath  the  surface  of  what  is  now 
called  the  “spin”  from  all  sides,  to  reveal  “what 
really  happened.”  On  this  basis,  governments  and 
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citizens  could  gain  a  better  grasp  of  what  was 
involved  in  improving  the  quality  of  what  was 
from  then  on  to  be  known  as  decision  making — 
that  is,  a  neutral  term,  divorced  from  the  particu¬ 
larities  of  time  or  place,  by  comparison  with  public 
administration,  which  studied  specific  institutions 
such  as  Congress  or  Whitehall  in  their  own  terms. 
Given  the  dangers  of  the  just  commenced  nuclear 
age  this  was  an  understandable,  indeed  noble,  aim. 

Beyond  this  general  trend  of  the  growth  of 
social  science,  and  its  application  to  decisions 
about  war  and  peace,  lay  also  work  in  economics, 
politics,  sociology,  and  philosophy  going  back  to 
Aristotle,  which  had  produced  in  various  ways 
attempts  to  think  systematically  about  how  those 
in  positions  of  political  responsibility  should — and 
did — calculate  their  best  strategy  under  conditions 
of  conflict,  scarcity,  and  danger.  Political  theorists 
(as  they  are  now  called)  had  focused  largely  on 
how  to  promote  the  good  life  (or  for  whom)  and 
how  to  manage  power,  often  including  speculation 
about  relations  with  foreigners.  Economists  from 
the  18th  century  on  had  constructed  theories  of  the 
efficient  use  of  resources,  while  sociologists  had 
begun  to  identify  elites  whose  work  extended 
beyond  that  of  formal  officeholders.  The  interest 
in  psychology  sparked  by  Sigmund  Freud  (together 
with  the  evident  psychopathology  of  Adolf  Hitler 
and  his  associates)  was  starting  to  turn  a  search¬ 
light  onto  the  behavior  and  motivations  of  all 
those  in  positions  of  power.  Key  individuals  such 
as  Max  Weber  were  capable  of  making  connec¬ 
tions  between  these  strands  and  thus  laying  down 
the  interdisciplinary  foundations  necessary  for  the 
study  of  political  decision  making.  It  was  therefore 
virtually  inevitable  that  at  some  point  the  long 
tradition  of  concern  over  war  and  its  consequences 
would  fuse  with  wider  developments  in  social  sci¬ 
ence  to  produce  a  systematic  approach  to  the  study 
of  foreign  policy — in  short,  to  produce  Foreign 
Policy  Analysis ,  even  if  the  term  itself  took  some 
time  to  become  routine.  In  the  following,  the  theo¬ 
retical  concerns,  subfields,  and  contemporary  sig¬ 
nificance  of  FPA  are  discussed. 

Theoretical  Concerns 

As  FPA  has  evolved,  it  has  acquired  certain  defin¬ 
ing  characteristics,  or  key  assumptions.  These  may 
not  be  shared  by  everyone  working  in  the  field  to 


the  same  extent,  but  they  are  certainly  points  of 
reference  that  all  must  take  into  account.  This  sec¬ 
tion  identifies  five  such  assumptions  and  adds 
three  more  that  may  be  in  the  process  of  attracting 
consensus  as  the  field  progresses  and  adapts  to  its 
changing  intellectual  and  real-world  environments. 
Between  them,  these  eight  assumptions  delineate 
the  central  content  of  FPA  while  also  helping  mark 
its  external  boundaries. 

The  first,  and  most  fundamental,  assumption 
has  already  been  referred  to — namely,  that  process 
affects  outcomes.  This  is  why  decision  making  is 
such  an  important  part  of  FPA,  even  if  it  should 
never  be  confused  with  the  field  as  a  whole.  The 
making  of  decisions  and  policies  is  studied  because 
it  is  taken  for  granted  that  the  way  in  which  a 
problem  is  handled,  and  by  whom,  will  have  a 
considerable  effect  on  the  quality  of  the  subse¬ 
quent  decisions.  Perhaps  surprisingly,  before 
World  War  II,  this  point,  which  now  seems  a  tru¬ 
ism,  was  not  explicitly  formulated  in  such  a  way, 
even  if  much  attention  was  given  to  the  personality 
and  foibles  of  individual  leaders.  But  the  structures 
in  which  they  worked,  to  say  nothing  of  their  gov¬ 
ernmental  cultures,  was  not  given  formal  atten¬ 
tion.  FPA,  however,  soon  made  it  clear  (using 
among  other  tools,  systems  theory)  that  not  only 
did  decision  making  have  certain  common  features 
across  national  contexts  but  also  that  feedback 
loops  existed  between  policy  outputs,  reactions 
from  outsiders,  and  a  policy  system.  Thus,  policy 
processes  shape  the  outcomes  in  State  A,  but  out¬ 
comes  are  also  determined  by  interaction  with 
States  B,  C,  and  so  on,  which  in  turn  may  modify 
the  original  structures  for  taking  decisions  in  State 
A — and  so  on.  One  straightforward  example  is  the 
setting  up  of  the  hotline  between  Washington  and 
Moscow  after  the  perilous  missile  crisis  of  October 
1962 — an  innovation  designed  to  improve  the  cri¬ 
sis  management  machinery  in  both  capitals. 

A  second  central  characteristic  of  FPA  is  the 
acceptance  that  foreign  policy  as  an  area  is  incapa¬ 
ble  of  generating  an  overall  theory  ofIR  in  the  way 
that  work  pitched  at  the  level  of  the  international 
system  is.  The  latter  may  be  deemed  (especially  by 
Foreign  Policy  Analysts)  as  parsimonious  to  the 
point  of  abstraction,  telling  us  very  little  about  the 
actual  dilemmas  of  states  and  citizens  beyond  what 
can  be  deduced  a  priori  from  the  overarching  the¬ 
ory  (as  with  neorealism),  but  at  least  it  has  a  set  of 
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propositions  about  how  the  world  as  a  whole 
functions.  A  focus  on  foreign  policy  cannot  do 
this.  Contrary  to  the  view  commonly  expressed  by 
Neorealists,  foreign  policy  theories  are  not  inher¬ 
ently  reductionist,  because  they  rarely  claim  to 
generate  explanations  of  the  international  system. 
But  they  do  focus  on  agency,  that  is,  on  how 
agents,  or  rather  actors  (the  former  being  a  more 
ambiguous  term)  cope  with  the  various  domestic 
and  international  structures  in  which  they  are 
embedded.  This  coping  will  be  of  a  variable 
nature,  according  to  the  actor  in  question,  even  if 
it  takes  place  within  certain  limits,  dictated  by 
what  Edmund  Burke  called  “the  empire  of  circum¬ 
stance.”  We  may  thus  draw  certain  conclusions 
about  what  kind  of  behavior,  whether  by  states  or 
by  other  kinds  of  actor,  meets  with  greater  or 
lesser  success,  and  therefore  about  what  kind  of 
world  we  all  live  in,  but  this  is  not  the  main  pur¬ 
pose  of  FPA,  and  it  is  difficult  to  be  rigorous 
about  it  unless  we  shift  level  to  that  of  the  system 
overall. 

A  related  assumption  to  that  just  described  is 
the  consensus  that  has  emerged  over  the  past 
2  decades  in  FPA  that,  even  at  the  level  of  com¬ 
parative  foreign  policy,  it  will  not  be  possible  to 
construct  a  convincing  overarching  theory — in 
other  words,  a  single  theory  of  foreign  policy  is  the 
chimera.  The  reason  for  this  is  clear  enough. 
Finding  a  single  model  of  how  foreign  policy  is 
made  and  conducted  that  would  fit  the  192  mem¬ 
ber  states  of  the  United  Nations  (to  say  nothing  of 
nonstate  actors)  would  entail  such  a  high  level  of 
generality  as  to  be  banal  in  the  extreme.  When  one 
allows  for  the  importance  of  the  vast  range  of  dif¬ 
ferent  policy-making  arrangements,  degrees  of 
interest  group  involvement,  national  cultures,  and 
historical  traditions,  together  with  the  mosaic  of 
distinctive  geopolitical  locations,  resource  distribu¬ 
tion,  cultural  patrimonies,  and  degrees  of  develop¬ 
ment  of  the  states  concerned,  it  is  clear  that  an 
umbrella  theory  of  foreign  policy  must  be  reduced 
to  the  same  kind  of  lofty  generalization  as  that 
imposed  by  realism  or  Marxism — indeed  the  latter 
can  be  justly  accused  of  the  reverse  sin  of  reduc- 
tionism,  an  error  that  might  be  labeled  “holism” 
because  of  its  flattening  out  of  the  variety  of  world 
politics  in  the  interests  of  parsimony.  That  said, 
the  flowcharts  and  taxonomies  of  the  decision¬ 
making  process  that  have  been  produced  by  such 


figures  as  Richard  Snyder  and  Michael  Brecher  are 
extremely  useful  in  revealing  the  wiring  diagrams 
of  what  occurs  in  most  foreign  policy  systems,  to  a 
greater  or  lesser  extent — but  they  are  not  theories, 
that  is,  sets  of  propositions  capable  of  explaining 
why  certain  patterns  of  behavior  emerge  and/or  on 
giving  guidance  on  how  we  should  behave  in  any 
given  situation. 

The  third  point  of  reference  for  most  FPA  schol¬ 
ars  follows  from  the  skepticism  over  grand  theory, 
namely  the  emphasis  on  analysis.  But  what  is 
meant  by  this  common  term?  It  is  less  than  theory 
and  more  than  description,  to  be  sure.  It  refers  to 
breaking  down  the  process  of  foreign  policy  mak¬ 
ing  and  execution  into  its  component  parts,  to 
understand  better  what  lies  beneath  the  surface  of 
events.  To  do  this,  it  uses  concepts  and  theories  of 
a  particular  kind,  as  will  be  explained  below,  in  a 
process  of  deconstruction  and  reconstruction.  In 
part,  this  involves  locating  foreign  policy  in  its 
own  specific  environments,  identifying  the  con¬ 
straints  under  which  it  thus  operates,  and  also  the 
interactions  between  action  and  environments  that 
result.  “Analysis”  here  thus  has  its  routine,  scien¬ 
tific  associations,  but  it  also  has  some  of  the  con¬ 
notations  of  psychoanalysis,  in  the  sense  of  the 
wish  to  probe  beneath  the  surface  of  events  to 
uncover  a  semiconcealed,  barely  understood  set  of 
forces  that  shape  behavior  just  as  much  as  the  for¬ 
mal  roles  of  the  actors  or  the  reasons  they  give  for 
their  actions  do.  Most  foreign  policy  analysts, 
whatever  their  own  designation,  present  them¬ 
selves  as  having  this  capacity  to  see  beyond  the 
public,  or  perhaps  more  exactly  the  public  rela¬ 
tions,  dimension  of  international  affairs. 

The  fourth  underpinning  assumption  of  FPA  is 
that  it  generates  middle-range  theories.  Middle- 
range  theories  are  those  that  attempt  to  explain 
certain  delimited  and  specified  aspects  of  human 
behavior.  They  are  self-evidently  neither  grand 
theory  nor  microtheory — the  latter  tending  to 
attach  to  specific  phenomena  (or  “phenotypes”), 
as  with,  for  example,  a  theory  that  tried  to  explain 
Russian  foreign  policy  in  terms  of  the  fear  of 
encirclement.  By  contrast,  middle-range  theory 
addresses  classes  of  phenomena  (“genotypes”)  but 
within  limits,  so  as  to  not  overload  the  activity 
with  too  many  variables.  FPA  has  been  notably 
successful  at  generating  this  kind  of  theory,  because 
by  focusing  on,  say,  the  operation  of  bureaucracy 
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it  inherently  limits  the  scope  of  explanation  to  one 
set  of  players  (bureaucrats)  and  to  one  aspect  of 
politics  (foreign  policy).  This  allows  for  manage¬ 
ability  without  restricting  intellectual  ambition. 
While  reality  is  complex,  our  understanding  of  it 
has  to  simplify,  by  theorizing  the  critical  factors 
but  not  at  such  a  high  level  of  generality  as  to 
become  meaningless — hence  middle-range  theory. 
There  is  no  doubt,  however,  that  this  still  leaves 
open  the  big  issue  of  how  to  aggregate  the  various 
insights  from  diverse  middle-range  theories,  just  as 
we  have  to  find  some  way  of  putting  fruitfully 
together  agent-based  theories  derived  from  FPA 
with  structural  theories  pitched  at  the  level  of  the 
international  system.  There  is  no  easy  answer  to 
this  dilemma.  The  most  that  can  be  said  is  that  it 
is  better  to  make  sense  of  certain  dimensions  of 
reality  and  then  to  be  faced  with  the  problem  of 
commensurability  than  to  take  refuge  in  excessive 
detail  on  the  one  hand  or  empirically  thin  general¬ 
ization  on  the  other. 

The  last  of  the  conceptual  anchor  points  for 
FPA  has  been  implicit  in  much  of  what  has  gone 
before — namely,  that  the  subject’s  principal  focus 
is  on  agency,  that  is,  the  business  of  deciding  and 
acting  in  the  world.  In  one  sense  this  seems  a 
straightforward  job  description:  Focus  on  what 
states  and  other  actors  do,  rather  than  on  the 
structures  within  which  they  operate.  But  the  sub¬ 
ject’s  own  evolution,  together  with  lessons  learned 
from  the  philosophy  of  social  science,  has  brought 
us  to  understand  three  important  caveats.  First, 
actors  and  structures  are  engaged  in  a  continual 
process  of  mutual  interaction  and  mutual  consti¬ 
tution  and  therefore  cannot  be  regarded  as  fixed 
points;  second,  some  actors  have  rather  more 
capacity  to  have  an  impact  on  structures  than  oth¬ 
ers  do,  just  as  not  all  structures  are  relevant  to  all 
actors  in  the  same  way;  third,  it  is,  indeed,  not 
always  possible  to  make  clear  distinctions  between 
actors  and  structures.  In  part,  this  is  because  the 
latter  are  multiple  and  nest  inside  each  other  like 
Chinese  boxes — for  example,  the  French  state 
exists  within  the  structure  of  the  European  Union 
(EU)  but  is  itself  the  main  structure  that  French 
political  parties  relate  to,  while  they  in  turn  repre¬ 
sent  key  structures  for  individual  politicians  and 
interest  groups.  But  it  is  also  because,  while  most 
actors  are  readily  identifiable  empirically,  struc¬ 
tures  are  often  conceptual  entities  whose  existence 


cannot  be  falsified,  such  as  “the  international 
community”  or  “globalization.”  Accordingly,  it  is 
more  common  for  structures  to  be  identified  in 
terms  of  actors — whether  dominant  groups  such 
as  the  United  Nations  Security  Council,  or  “the 
West,”  in  the  case  of  the  international  community; 
or  the  aggregate  of  states,  companies,  and  indi¬ 
viduals,  in  the  case  of  globalization.  Waltz  made  a 
strenuous  effort  to  avoid  this  trap  in  his  neorealist 
theory  by  stressing  that  it  was  the  pattern  of  the 
balance  of  power  that  determined  the  structure  of 
international  anarchy,  even  if  this  is  all  too  easily 
reducible  to  the  activities  of  major  players. 

The  result  of  this  problem  so  far  as  FPA  is  con¬ 
cerned  is  that  it  is  all  too  easy  to  get  drawn  into 
wider  issues  than  the  problem  of  action  and  espe¬ 
cially  to  neglect  the  viewpoint  of  the  actor — when 
it  is  indispensable  from  the  viewpoint  of  good 
scholarship  to  be  able  to  reconstruct  the  perspec¬ 
tive  of  decision  makers  themselves,  rather  than  to 
impose  on  them  assumptions  drawn  from  a  univer¬ 
sal  model.  The  other  side  of  the  coin  is  that  foreign 
policy  analysts  may,  in  the  very  attempt  to  distin¬ 
guish  actors  from  structures,  unnecessarily  reify 
the  state  (or  even  actors  like  A1  Qaeda),  when  they 
are  more  intelligently  understood  as  a  series  of 
interconnected  points  of  agency,  overlapping  with 
other  apparent  actors.  This  is  clearly  the  case 
within  the  EU,  where  national  foreign  policy  occu¬ 
pies  a  still  important  but  nonetheless  ambiguous 
position  in  relation  to  the  Union’s  own  collective 
activity.  Ultimately,  individual  human  beings  make 
foreign  policy  decisions  but  always  within  organi¬ 
zational  structures,  some  of  which  will  have  for¬ 
mal  responsibilities  and  are  accountable  to  various 
types  of  constituency.  The  nature  of  the  agency 
that  is  the  key  interest  of  FPA  therefore  wobbles 
between  and  across  the  levels  of  individual  and 
collective  action,  while  the  latter  is  itself  divided 
between  different  political  and  bureaucratic  for¬ 
mations  and  sometimes  between  a  state  and  the 
alliances/organizations  to  which  it  belongs. 

Beyond  these  five  pillars  of  FPA  research  are 
three  further  assumptions  that,  while  more  con¬ 
testable,  are  increasingly  widely  shared.  The  first 
of  these  is  that  foreign  policy  is  not  only  the  busi¬ 
ness  of  states.  While  it  may  seem  something  of  a 
category  mistake  to  talk  of  “foreign”  policy  in 
relation  to  actors  that  do  not  have  sovereignty,  or 
even  a  clear  territorial  expression,  there  can  be 
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little  doubt  that,  among  those  described  generi- 
cally  as  nonstate  actors  (the  use  of  the  negative 
tells  us  of  the  difficulty  in  identifying  any  other 
common  characteristics),  many  have  political  strat¬ 
egies  for  influencing  IR.  It  is  not  unreasonable  to 
conclude  that  they  too  possess  international  strate¬ 
gies  that  may  be  identified,  with  some  qualifica¬ 
tion,  as  “private  foreign  policies.”  Certainly  any 
organization,  from  the  smallest  firm  to  the  United 
Nations,  makes  a  distinction  between  its  internal 
affairs  and  its  external  environment  and  has  to 
formulate  strategies  for  dealing  with  the  latter.  The 
British  Academy,  for  example,  which  exists  to  pro¬ 
mote  scholarship,  uses  the  term  Foreign  Secretary 
to  describe  the  Fellow  appointed  to  liaise  with 
similar  bodies  in  other  countries. 

The  category  of  nonstate  actors  contains  bodies 
as  diverse  as  these  civil  society  groupings  at  one 
end  of  the  spectrum  and  A1  Qaeda  at  the  other.  In 
between  come  transnational  churches,  corpora¬ 
tions,  sports  organizations,  and  charities,  many  of 
which  are  taking  on  more  of  a  political  role,  in 
terms  of  both  the  protection  of  their  own  interests 
internationally  and  the  wish  to  influence  the 
nature  of  what  they  perceive  as  a  single,  global¬ 
ized,  environment.  Given  that  FPA  is  the  main  site 
of  the  concern  with  agency  within  IR,  it  must  pay 
attention  to  the  significance  of  these  actors,  their 
aims,  instruments,  and  effectiveness.  This  is 
because  of  their  inherent  importance  and  also 
because  of  their  interaction  with  states,  shaping 
the  latter’s  choices.  Just  as  transnational  relations 
has  added  another  dimension  to  the  traditional 
interstate  focus  of  IR,  so  transnational  actors  have 
expanded  the  scope  of  FPA. 

The  proliferation  of  transnational  actors  has 
helped  reinforce  a  further  common  preoccupation 
of  FPA  in  recent  years,  that  over  the  increasing  dif¬ 
ficulty  of  defining  foreign  policy.  Apart  from  the 
fact  that  there  is  an  inherent  problem  in  deciding 
whether  an  aspect  of  public  policy  is  by  and  large 
inwardly  or  outwardly  directed,  the  development 
of  the  world  economy  since  1945,  with  the  freer 
movement  of  goods,  people,  and  capital,  has 
meant  that  foreign  and  domestic  policies  increas¬ 
ingly  merge  into  each  other.  Indeed,  most  minis¬ 
tries  in  developed  states  now  work  with  their 
equivalents  in  other  states,  while  classical  foreign 
policy  has  to  be  concerned  not  simply  with  nego¬ 
tiation  and  the  calibration  of  power  but  also  (and 


quite  legitimately)  with  the  domestic  politics  of 
other  states. 

All  this  calls  into  question  the  boundaries  of 
FPA  and  possibly  its  whole  raison  d’etre.  If  the 
explanandum  is  only  vaguely  discernible,  then 
how  can  a  systematic  analysis  be  possible,  let  alone 
an  explanation?  The  answer  to  this  is  twofold:  On 
the  one  hand,  practitioners  and  citizens  continue 
to  use  the  term  foreign  policy  as  central  to  their 
political  vocabulary.  To  put  it  crudely,  if  enough 
people  think  foreign  policy  exists,  then  it  does  exist 
or,  at  least,  it  is  a  sufficiently  important  idea  to  be 
worth  serious  investigation.  On  the  other,  what¬ 
ever  the  blurring  that  has  occurred  between  inside 
and  outside,  the  two  dimensions  of  actors’  activity 
cannot  be  reduced  to  the  same  thing.  Rather,  and 
this  proposition  is  likely  to  attract  general  support 
within  the  field,  policy  now  exists  on  a  single  con¬ 
tinuum,  with  matters  that  are  almost  entirely 
domestic  (such  as  local  government)  at  one  end 
and  those  that  are  almost  entirely  foreign,  such  as 
a  civil  war  in  a  third  country  where  the  given  state 
has  no  particular  interests,  at  the  other.  The  task 
of  FPA  is  then  to  interrogate  areas  of  policy  to  see 
whether  there  are  still  important  differences 
according  to  the  extent  of  their  external  ramifica¬ 
tions  and  also  to  show  how  international  factors 
play  into  domestic  politics  (the  “second  image 
reversed”).  In  fact,  the  very  enquiry  into  the  nature 
and  boundaries  of  foreign  policy  is  an  important 
scholarly  task  that  many  inside  FPA  have  been 
engaged  in. 

The  last  of  the  more  recent  common  reference 
points  that  underpin  FPA  derives  from  constructiv¬ 
ism,  and  more  from  “thick”  than  from  “thin” 
constructivism  at  that.  This  is  the  proposition  that 
identity  is  central  to  our  understanding  of  foreign 
policy,  and  (often)  vice  versa.  Identity  is  an  impre¬ 
cise  notion,  even  by  comparison  with  those  essen¬ 
tially  contested  concepts  (e.g.,  national  interest) 
that  political  scientists  work  with  regularly.  But  its 
use  does  extend  FPA’s  standard  concern  with  the 
domestic  sources  of  foreign  policy  to  the  area  of 
culture,  including  nationalism,  tradition,  memory, 
and  self-understanding.  At  first  sight  this  simply 
adds  another  set  of  variables  to  the  equation:  now, 
if  one  wants  a  full  understanding  of  a  country’s 
foreign  relations,  one  has  to  take  into  account  its 
sense  of  itself  and  its  culture.  Outsiders’  percep¬ 
tions  of  that  identity  will  also  be  important.  That 
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this  is  hardly  a  new  insight,  indeed  one  that  would 
have  been  familiar  to  Alexis  de  Tocqueville,  mat¬ 
ters  little.  The  boost  that  constructivism  has  given 
to  the  study  of  culture  and  language  has  certainly 
affected  FPA,  both  giving  it  a  new  lease  of  life  and 
posing  some  new  challenges  to  its  orthodoxies. 

The  challenges  seem  largely  to  be  at  the  level  of 
methodology,  in  stressing  discourse  analysis,  inter¬ 
views,  and  texts  more  than  foreign  policy  “events” 
or  institutional  data.  More  significantly,  however, 
the  “cultural  turn”  represents  an  epistemological 
challenge  in  the  sense  of  the  renewed  criticism  of 
positivism  that  it  implies — not  all  FPA  has  been 
positivist  in  orientation,  but  it  was  undoubtedly  a 
product  of  the  “scientific”  wave  of  IR  work  in  the 
postwar  period.  It  suggests  that  foreign  policy  can 
both  arise  from  a  constructed  national  identity  and 
be  constitutive  of  that  identity.  The  circular  ten¬ 
dency  of  this  reasoning,  not  to  mention  its  essen- 
tialism,  is  naturally  not  to  the  taste  of  everyone 
working  in  FPA.  But  a  certain  consensus  is  merg¬ 
ing,  as  in  IR  more  generally,  to  the  effect  not  only 
that  issues  of  identity  and  culture  cannot  be 
ignored  but  also  that  they  may  also  be  central 
to  our  understanding  of  much  foreign  policy 
behavior — not  least  in  an  era  when  religious  strife 
has  reemerged,  and  public  diplomacy  has  become 
a  more  prominent  instrument  of  state  policy.  Most 
would  certainly  concede  that  foreign  policies  such 
as  that  of  Wilhelmine  Germany,  or  Brezhnev’s 
USSR,  were  profoundly  important  in  determining 
not  only  the  external  image  of  those  countries  but 
their  very  conformation  at  the  time. 

The  Subfields  of  Foreign  Policy  Analysis 

Over  its  50-year  history,  FPA  has  produced  a  num¬ 
ber  of  well-researched  subfields,  sometimes  draw¬ 
ing  on  work  already  done  elsewhere  in  IR  and 
political  science  but  also  often  deriving  from 
advances  in  the  other  social  sciences.  Taken 
together,  they  represent  a  massive  advance  on  the 
understanding  of  foreign  policy  making  that 
existed  in  the  days  of  Edward  Carr  and  Hans 
Morgenthau — which  is  not  to  say  that  the  keen 
insights  of  figures  such  as  these  were  misplaced; 
they  were  simply  not  cumulative,  or  part  of  a  sys¬ 
tematic  schema.  They  also  depended  on  the  accep¬ 
tance  of  a  few  top-heavy  assumptions.  Now,  for 
anyone  who  cares  to  look,  there  is  a  vast  array  of 


analytic  and  comparative  work  on  the  machinery 
of  foreign  policy,  on  its  conceptual  aspects,  on  its 
implementation,  and  on  its  relationship  to  differ¬ 
ent  kinds  of  polity.  What  is  more,  most  of  this 
work  is  accessible  to  intelligent  general  readers  and 
is  certainly  capable  of  being  translated  into  a  form 
that  enables  it  to  enter  political  and  popular 
debate.  What  follows  outlines  five  important  clus¬ 
ters  of  activity  in  the  research  agenda  of  FPA.  No 
pretence  at  comprehensive  coverage  can  be  made, 
given  the  limited  space  available.  But  there  cannot 
be  much  work  of  major  interest  that  would  not  fit 
under  one  of  the  subheads  that  follow. 

Leadership  and  Decision  Making 

FPA  tends  to  start  at  the  center  and  work  out¬ 
ward.  The  center  consists  of  the  high  officeholders, 
formally  responsible  for  policy,  and  their  intimate 
counselors.  If  we  were  living  in  the  time  of  Louis 
XIV  the  focus  would  be  on  the  king  and  his  court. 
Now  it  is  on  the  head  of  government,  often  but  not 
always  elected,  together  with  his  or  her  political 
advisers,  the  security  council  or  its  equivalent,  and 
the  heads  of  the  key  relevant  departments,  which 
means  (at  a  minimum)  the  foreign  and  defense 
ministries,  and  the  intelligence  service.  But  what 
are  the  questions  that  FPA  asks  about  this  inner 
executive?  The  most  straightforward  is  that  famil¬ 
iar  to  political  analysts  over  3  millennia:  What  is 
the  impact  of  an  individual  leader,  extended  in  the 
post-Freudian  age  to  include  the  impact  of  the 
leader’s  personality  and  psychological  quirks,  on 
foreign  policy?  An  important  line  of  thought  that 
has  emerged  in  FPA  is  the  difficulty  of  considering 
such  questions  outside  a  relational  context — that 
is,  not  only  how  does  the  leader  behave  and  why, 
but  how  are  his  or  her  actions  received  by  others? 
In  short,  this  entails  the  analysis  of  leadership  as 
such,  which  is  central  to  political  science  as  a 
whole,  but  has  certain  specific  dimensions  in  IR, 
given  that  a  good  part  of  foreign  policy  making  is 
conducted  away  from  public  scrutiny,  while  at  the 
same  time  much  of  a  leader’s  constituency  actually 
consists  of  foreigners,  whether  other  leaders  or 
their  publics. 

Perhaps  the  most  well-known  product  of  FPA 
research  relates  to  the  psychology  of  small  groups 
at  the  top  level  of  the  policy  process — namely 
groupthink.  This  theory,  which  was  validated  by 
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the  various  enquiries  in  the  United  States  and 
Britain  on  the  decision  to  invade  Iraq  in  2003, 
looks  at  the  dynamics  of  leadership  within  the  for¬ 
eign  policy  executive,  with  particular  reference  to 
its  psychological  dimension  and  impact  on  the 
quality  of  decisions.  It  is  closely  related  to  our  by 
now  extensive  understanding  of  how  political  per¬ 
ception  works,  with  distortions  caused  by  both 
emotional  factors  and  formal  roles  within  a  group. 
FPA  has  learned  much  from  its  sister  subject  of 
political  psychology  but  has  also  made  a  contribu¬ 
tion  of  its  own,  notably  in  relation  to  groupthink, 
images  of  the  foreign,  and  the  concept  of  misper¬ 
ception,  the  consequences  of  which  can  be  particu¬ 
larly  dramatic  in  IR. 

One  interesting  new  strand  of  work  that  is 
emerging  from  this  well-grounded  tradition  is  that 
relating  to  neuroscience.  It  stresses  the  role  of  emo¬ 
tion  and  of  preconscious  factors  in  determining 
choice.  Rationality  is  thus  bounded  not  only  by  the 
limited  ability  of  the  human  mind  to  process  infor¬ 
mation  or  to  trade  off  a  large  number  of  variables 
but  also  by  emotional  predisposition — some  have 
said  that  we  are  hardwired,  for  example,  to  defend 
our  existing  beliefs.  On  the  other  hand,  whatever 
the  physiological  and  neurological  sources  of  emo¬ 
tional  preferences,  it  has  been  known  from  the 
early  days  of  FPA  that  personal  as  well  as  political 
forms  of  bias  affect  decision  making — indeed,  that 
the  two  sources  are  difficult  to  distinguish.  What 
the  recent  wave  of  interest  in  neuroscience  has 
brought  to  the  party  is  the  insight  that  reason  itself 
is  intimately  connected  to  emotion  and  that  the 
two  phenomena  are  not  separated  in  the  brain. 
The  problem  remains  of  how  such  distortions 
work  at  the  collective  level:  Are  they  cumulative  as 
in  groupthink  or  reined  in  through  the  checks  and 
balances  of  argument  and  interinstitutional  con¬ 
flict?  This  issue  is  at  the  heart  of  the  subject  as  a 
whole  and  opens  the  possibility  of  advising  practi¬ 
tioners  on  how  best  to  avoid  the  wrong  implica¬ 
tions  that  may  arise  as  a  result  of  errors  of  such 
distortions. 

It  also  connects  up  with  the  other  main  avenue 
of  research  on  top-level  decision  making,  that  of 
bureaucratic  politics.  This  came  originally  from 
practical  insights  under  the  Kennedy  presidency, 
spawning  a  large  and  fruitful  academic  literature 
that  has  fed  back  into  policy  making  in  the  form  of 
direct  advice  but  also  as  warnings  to  the  citizenry 


that  policy  making  is  much  more  a  matter  of  turf 
wars  than  they  are  led  to  believe.  As  a  result,  the 
insights  of  bureaucratic  politics  are  now  routinely 
used  in  case  studies,  in  historical  writing,  and  in 
think  tank  policy  briefings.  They  do,  nonetheless, 
cut  both  ways:  On  the  one  hand,  they  alert  us  to 
the  way  in  which  most  outputs  are  the  result  of 
pulling  and  hauling,  leading  to  arbitrary  compro¬ 
mises,  even  in  nondemocratic  systems.  This  makes 
it  virtually  impossible  to  live  up  to  the  rational/ 
realist  ideal  type  of  the  consistent  pursuit  of  a 
clearly  identifiable  national  interest.  On  the  other 
hand,  they  point  up  the  advantages  of  collective 
debate,  even  on  the  crude  basis  of  competing  insti¬ 
tutional  interests.  They  can  counter  the  personal 
biases  of  the  leader  and  the  tendencies  to  group 
conformity,  which  have  been  noted  by  the  psycho¬ 
logical  approaches  referred  to  above.  This  is  both 
a  matter  of  fact  and  one  of  preference,  in  that 
while  some  would  wish  to  give  a  leader  a  free  hand 
when  dealing  with  foreign  threats,  others  are  con¬ 
vinced  by  the  arguments  for  pluralism  in  decision 
making,  in  foreign  policy  as  in  other  areas  of  pub¬ 
lic  policy. 

The  Domestic  Sources  of  Foreign  Policy 

One  sign  of  progress  in  a  social  science  is  when 
its  insights  change  from  seeming  marginal,  or  even 
outlandish,  to  being  an  accepted  way  of  looking  at 
a  problem.  This  has  been  the  case  with  FPA’s 
emphasis  on  the  domestic  sources  of  foreign  pol¬ 
icy.  Of  course  any  claim  has  to  be  subject  to  the 
caveat  of  post  hoc,  ergo  propter  hoc,  in  that  a 
change  in  general  attitudes  might  have  occurred 
anyway,  through  events,  rather  than  because  of 
any  academic  research.  But  FPA  has  certainly  edu¬ 
cated  generations  of  students,  against  the  still  pow¬ 
erful  realist  skepticism  over  domestic  factors, 
including  the  vagaries  of  public  opinion,  having 
the  capacity  to  complicate,  even  shape,  foreign 
policy.  To  a  degree  this  has  been  a  matter  of  life 
imitating  art,  as  the  more  that  an  understanding  of 
the  foreign  policy  process  has  been  disseminated, 
the  more  pressure  groups  have  sprung  up  to  insert 
themselves  into  the  interstices  of  the  process. 

There  have  long  been  both  hopes  and  fears 
about  the  way  in  which  the  public’s  opinion  could 
disrupt  or  hold  to  account  the  best  laid  plans  of 
their  more  expert  leaders.  On  the  pessimistic  side 
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were  Alexis  de  Tocqueville  and  Walter  Lippmann, 
while  the  optimists  are  liberals  such  as  Woodrow 
Wilson  or  Senator  Henry  “Scoop”  Jackson.  Work 
done  in  FPA,  however,  has  helped  disentangle  the 
analytical  from  the  normative  elements  in  this 
inherently  political  debate  while  distinguishing  the 
new  dimensions  brought  to  elite-mass  relations  by 
the  rise  of  the  mass  media,  information  technol¬ 
ogy,  and  the  24/7  news  cycle.  The  activities  of 
interest  groups  and  social  movements  in  relation  to 
foreign  policy  have  also  been  excavated,  including 
the  previously  neglected  gender  dimension.  Equally 
neglected  for  many  years  was  the  measurement  of 
public  attitudes  on  foreign  policy  questions,  espe¬ 
cially  outside  the  United  States.  The  creation  of 
scientific  polling  by  the  Gallup  Organization  in 
1937  led  for  the  most  part  to  the  generation  of 
data  on  elections,  economics,  and  domestic  politi¬ 
cal  attitudes.  When  views  were  sought  on  interna¬ 
tional  politics  they  were  usually  at  a  high  level  of 
generality  or  in  the  midst  of  major  crises.  But  in 
recent  years,  it  has  become  accepted  both  that  the 
public  can  have  a  serious  interest  in  foreign  policy 
(with  quite  consistent  views)  and  that  the  blurring 
of  the  line  between  domestic  and  foreign  policy  has 
ended  the  presumption  that  politics  (and  therefore 
public  participation)  stops  at  the  national  frontier. 

Although  Marxists  and  other  political  econo¬ 
mists,  together  with  some  liberals,  have  always 
stressed  the  connection  between  domestic  and 
international  politics,  within  academic  IR  it  has 
been  primarily  FPA  that  has  insisted  on  its  impor¬ 
tance,  not  least  in  the  face  of  the  powerful  neoreal- 
ist  movement  of  the  1980s  and  the  rather  different 
globalization  paradigm  of  the  1990s,  for  both  of 
which  domestic  events  were  subordinate  to  the 
logic  of  the  international  system.  The  constructiv¬ 
ist  wave  has  reinforced  this  research  tradition  by 
legitimizing  a  focus  on  culture  and  identity  that 
had  previously  been  seen  as  unscientific.  Taken  as 
a  whole,  FPA  is  a  powerful  demonstration  of  how 
realist  attempts  to  abstract  foreign  policy  from  the 
political  and  social  context  in  which  it  is  embedded 
are  misguided,  even  dangerously  simplistic. 

Organizational  Logics 

The  theory  of  bureaucratic  politics  expounded 
by  Graham  Allison  collapsed  into  one  of  his  previ¬ 
ously  separate  models  of  governmental  politics 


and  organizational  behavior.  There  are,  however, 
good  reasons  for  keeping  the  two  separate,  as 
indeed  the  second  edition  of  Allison’s  Essence  of 
Decision  (with  Philip  Zelikow,  1999)  recognized. 
The  institutional  competition  associated  with  the 
former  has  its  most  significant  effects  at  the  top 
levels  of  decision  making.  It  also  rests  on  the 
assumption  that  as  individuals  identify  with  their 
particular  organizational  home,  they  will  pursue 
its  particular  interests  or  view  of  the  world.  By 
contrast,  the  organizational  process  approach 
identifies  processes  and  mind-sets  that  derive  from 
the  nature  of  the  complex  state  in  the  age  of  the 
masses.  By  contrast  with  the  ideal  type  of  Max 
Weber,  the  research  conducted  by  and  drawn  on 
by  FPA  sees  modern  bureaucracy  as  developing 
pathologies  that  often  inhibit  the  efficient  and  spe¬ 
cialized  divisions  of  labor  it  exists  to  promote. 
That  is,  the  very  conduct  of  government  necessi¬ 
tates  complex  and  often  large  organizations  that 
inevitably  fail  to  live  up  to  their  terms  of  reference. 
A  century  ago  this  would  not  have  applied  to  for¬ 
eign  policy,  with  small  and  cohesive  diplomatic 
services.  Now  it  does,  with  many  departments 
involved  in  policy  making,  defense  budgets  usually 
representing  a  major  claim  on  the  public  purse, 
and  the  amount  of  information  available  about 
global  politics  growing  exponentially. 

The  management  of  information,  indeed,  as  of 
the  increasingly  large  organizations  themselves,  is 
one  of  the  major  challenges  facing  the  modern  for¬ 
eign  policymaker  and  one  on  which  FPA  has  much 
to  offer.  The  study  of  intelligence  is  a  case  in  point. 
For  many  years  after  1945,  intelligence  was  seen  as 
both  too  sensitive  and  too  difficult  a  subject  for 
academic  researchers.  Over  the  past  2  to  3  decades, 
however,  this  assumption  has  changed,  thanks  to 
both  historians  and  political  scientists.  The  critical 
role  played  by  secret  intelligence — political,  mili¬ 
tary,  and  economic — has  come  under  scrutiny  by  all 
those  interested  in  decision  making.  In  part  this  has 
involved  the  study  of  turf  disputes — for  example,  as 
between  the  numerous  U.S.  intelligence  agencies — 
but  it  has  also  involved  the  more  complex  issue  of 
how  an  issue  gets  framed  and  by  whom,  how  to 
evaluate  different  sources  of  information,  how  to 
weigh  capabilities  against  intentions,  and — most 
critically  in  the  case  of  Iraq  in  2002  to  2003 — what 
constitutes  reliable  professional  judgment  in  a 
highly  politicized  environment.  As  the  intelligence 
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community  has  come  to  emerge  from  the  shadows, 
it  has  become  more  willing  to  accept  the  value  of 
dispassionate  outside  analysis  of  its  role  in  foreign 
policy  making. 

Another  benefit  of  the  organizational  approach 
comes  via  the  focus  on  the  decisional  flow — 
namely,  the  stages  through  which  major  decisions 
pass.  The  orthodox  approach  in  the  political  world 
is  to  focus  on  individuals,  agencies,  and  the  nature 
of  public  debate.  But  this  needs  complementing  by 
the  perspective  to  be  found  in  FPA  that  peels  back 
the  surface  layers  to  reveal  the  degree  to  which  a 
particular  decision,  or  policy,  arose  out  of  a  linear 
process  of  information  gathering  and  consultation 
or  sprang  from  various  forms  of  short-circuiting, 
whereby  some  parts  of  the  government  machine 
were  cut  out  of  the  process  and/or  informal  groups 
formed  across  organizational  boundaries  to  hasten 
matters  to  the  point  of  action.  The  linear  process 
starts  with  information  gathering  or  political  inputs, 
moves  to  an  assessment  of  the  issue  and  the  framing 
of  the  problem,  which  in  turn  lead  to  the  delinea¬ 
tion  of  options,  the  point  of  authoritative  decision, 
and  finally  the  complexities  of  implementation. 
Sometimes  this  model  is  actually  borne  out  by 
events,  but  more  often  it  represents  an  ideal  type 
against  which  derogations  can  be  measured.  All  too 
often  the  formal  structure,  on  which  democratic 
accountability  depends  (where  possible  in  the  first 
place),  dissolves  into  a  set  of  arrangements  that  is 
half  constitutional  and  half  ad  hoc,  with  some 
stages  of  the  process  missed  out  or  accelerated. 
Only  careful  analysis  based  on  rigorous  enquiry 
can  reveal  the  true  nature  of  the  process.  This  will 
almost  inevitably  be  after  the  event,  with  academics 
sometimes  helped  by  the  high-level  enquiries  often 
convened  after  the  fiascoes  that  a  high-handed 
approach  to  policy  making  can  produce. 

Goals,  Purposes,  and  Values 

In  all  agency-focused  social  science,  the  issue  of 
the  objectives  of  action  is  always  at  stake.  What 
specific  goals  does  an  actor  have,  what  broad  pur¬ 
poses,  and  what  underlying  values?  How  far, 
indeed,  is  behavior  purposive?  We  have  seen  that 
the  study  of  both  political  psychology  and  organi¬ 
zational  behavior  places  big  question  marks  against 
any  assumption  of  rationality,  which  may  be 
extended  to  the  idea  of  purposive  behavior  itself. 


Foreign  policy  is  still  a  sufficiently  discrete  area  for 
evidence  on  this  question  to  be  gathered  system¬ 
atically.  Because  goals  for  the  most  part  have  to  be 
outwardly  directed,  against  specific  “others,”  they 
are  more  visible — or  more  notable  when  absent — 
than  the  objectives  of  domestic  politics,  which  so 
often  blur  electoral  considerations  with  those  of 
economics  and  welfare.  On  the  other  hand,  FPA 
has  shown  how  foreign  policy  does  not  usually 
require  legislation  and  can  often  fall  back  on  main¬ 
taining  a  small  number  of  parameters  within 
acceptable  tolerances. 

The  classic  way  of  explaining  the  distinctive 
nature  of  foreign  policy  goals  was  through  the 
concept  of  the  national  interest.  It  was  one  of  the 
early  achievements  of  FPA  to  blow  holes  in  that 
concept.  It  has  not  sunk  without  trace,  given  the 
predilection  of  politicians  for  rhetoric  and  linguis¬ 
tic  shorthands,  but  few  serious  observers  would 
now  regard  it  as  unproblematic.  In  its  place,  the 
subject  has  emphasized  the  importance  of  under¬ 
standing  the  ends-means  relationship,  the  diverse 
and  sometimes  contradictory  targets  that  decision 
makers  pursue,  and  the  partiality  of  many  of  the 
diverse  interests  that  are  pursued  in  the  name  of 
the  state.  The  domestic  dimension  has  been  brought 
even  more  into  play  by  showing  how  foreign  pol¬ 
icy  relates  to  nation  building  and  is  infected  by 
ideologies  (including  nationalism  and  religion) 
even  in  states  that  stress  their  pragmatism.  In  other 
words,  the  sources  of  both  goal-directed  and  more 
impulsive  behavior  in  foreign  policy  are  regularly 
interrogated  within  FPA,  both  theoretically  and  by 
country  specialists. 

Closely  related  to  this  line  of  enquiry  is  the 
effectiveness  of  foreign  policies,  for  success  can 
hardly  be  measured  without  reference  to  the  goals 
that  are  at  stake.  Research  here  has  recently  clus¬ 
tered  around  the  concept  of  “soft  power”  (“civil¬ 
ian”  or  “normative”  power  in  the  case  of  the  EU), 
but  this  is  simply  the  latest  stage  in  a  long-running 
and  cumulative  program  of  enquiry.  It  derived 
from  the  realist  preoccupation  with  exercise  of 
power  and  steadily  became  more  refined,  variously 
seeing  power  as  currency  or  on  a  spectrum  between 
Machtpolitik  and  isolationism.  It  differentiated 
between  the  various  instruments  of  foreign  policy, 
with  much  attention  being  given  to  economic 
statecraft,  to  the  use  and  threat  of  force,  and  to 
diplomacy  in  its  various  guises.  Much  of  the  most 
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fruitful  work  has  been  done  on  the  interplay  and 
overlaps  between  these  various  categories,  such  as 
coercive  diplomacy,  sanctions,  or  the  projection  of 
a  national  image  abroad  (public  diplomacy). 
Practitioners  are  particularly  interested  in  aca¬ 
demic  findings  on  such  subjects  and  frequently  call 
on  them  during  political  debate. 

For  too  long  FPA  had  a  tendency  to  concentrate 
on  process  and  to  avoid  the  issue  of  the  content  of 
foreign  policy.  It  was  forced  out  of  this  ostrich 
posture  by  the  democratic  peace  debate,  which  had 
its  origins  more  in  philosophical  approaches  to  IR 
than  in  the  empirical  study  of  state  behavior.  It 
did,  however,  quickly  attract  the  empiricists  and 
gave  a  further  lease  on  life  to  the  CFP  school 
within  FPA,  whose  large  data  sets  on  the  behavior 
of  classes  of  states  seemed  immediately  relevant. 
The  revival  proved  temporary,  as  once  again  CFP’s 
combination  of  broad  brush  and  methodological 
introversion  proved  insufficient  to  cope  with  the 
subtleties  of  the  argument  over  what  kind  of 
democracies  went  to  war  in  what  kinds  of  circum¬ 
stance.  But  there  was  a  valuable  payoff  for  FPA  in 
that  a  new  bridge  was  created  between  the  empiri¬ 
cal  study  of  foreign  policy  and  the  renewed  interest 
in  ethics  within  IR.  The  work  of  the  Carnegie 
Council  and  other  bodies  had  long  emphasized  the 
need  for  a  sophisticated  discussion  on  the  ethical 
dimension  of  international  behavior.  Now,  follow¬ 
ing  the  lead  of  Stanley  Hoffmann  and  Michael 
Walzer,  this  need  began  to  be  fulfilled,  to  the  point 
where  international  theorists  such  as  Chris  Brown 
or  Nicholas  Wheeler  regularly  work  on  the  edges 
of  political  theory  and  FPA,  in  their  discussions  of 
topics  such  as  the  motives  for  humanitarian  inter¬ 
ventionism  or  the  priority  to  be  given  to  overseas 
development  aid.  This  has  proved  timely  in  that 
the  post-Cold  War  era  has  led  some  governments 
to  raise  the  ethical  dimension  of  foreign  policy 
without  a  trace  of  the  defensiveness  that  would 
have  been  apparent  in  the  years  of  Cold  War  real¬ 
ism  and  has  liberated  the  forces  in  (usually  Western) 
public  opinion  that  wished  to  see  foreign  policy 
consistent  with  their  own  domestic  values.  Yet 
foreign  policy  analysts,  as  opposed  to  philoso¬ 
phers,  bring  an  important  practical  perspective  to 
bear  on  these  complex  problems.  They  relate 
moral  questions  to  those  of  policy  practice  without 
falling  back  either  on  the  simplistic  realist  assump¬ 
tion  that  everything  ultimately  reduces  to  the 


national  interest  or  on  the  common  liberal  view 
that  interests  and  ethics  can  be  made  to  coincide. 
Rather,  an  analytical  approach  attempts  to  iden¬ 
tify  the  multiple  constituencies  that  a  given  state  is 
trying  to  serve  and  to  deconstruct  the  priorities 
that  are  revealed  by  practice.  It  can  then  go  on  to 
assess  the  feasibility  of  expecting  X  or  Y  of  any 
given  actor,  given  that  actor’s  values  and  circum¬ 
stances. 

Environments  and  Structures 

The  external  circumstances  in  which  foreign 
policy  is  made  constitute  the  last  main  area  of 
work  within  FPA  described  here.  From  its  early 
years  the  politically  minded  geographers  Harold 
and  Margaret  Sprout  had  set  the  parameters  for 
thinking  about  foreign  policy  within  the  frame  of 
both  material  and  political  factors.  In  so  doing  they 
also  raised  the  fundamental  questions  of  causation 
and  determinism — namely,  how  much  freedom  of 
action  do  foreign  policymakers  enjoy,  given  the 
difficulties  of  operating  on  the  international  level, 
and  how  much  difference  do  size,  location,  and 
power  make  to  the  margins  of  maneuver.  It  might 
be  thought,  for  example,  that  the  great  powers 
must  enjoy  more  freedom  than  others,  but  much 
work  within  the  context  of  FPA  has  been  devoted 
to  demonstrating  the  paradoxical  freedoms  of 
small  states  and  the  burdens  of  overextension  on 
the  part  of  large  ones. 

The  Sprouts’  emphasis  was  on  the  environments 
of  foreign  policy,  and  this  has  produced  the  useful 
orthodoxy  of  a  division  between  the  domestic,  the 
external,  and  the  psychological  environments.  They 
were  also  remarkably  prescient  in  developing  an 
ecological  perspective  on  human  affairs.  To  some 
extent,  however,  this  perspective  has  been  over¬ 
taken  theoretically  by  the  agent-structure  debate. 
Structures  are  multiple  and  cross  the  internal/ 
external  frontier,  but  for  foreign  policymakers  the 
most  obvious  structures  in  which  they  have  to 
work  are  the  international.  These  are  material,  in 
terms  of  the  distribution  of  resources  and  climatic 
advantage;  political  but  tangible,  in  terms  of 
international  law  and  institutions;  and  abstract/ 
normative,  in  terms  of  the  values  of  international 
society.  The  different  ways  in  which  they  bear 
down  on  the  great  variety  of  kinds  of  states  in  the 
world  are  grist  for  the  mill  for  FPA  researchers, 
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who  can  thereby  work  both  from  the  outside 
inward  and  from  the  inside  outward,  showing  how 
participation  in  the  international  system  to  some 
degree  shapes  the  actors  that  operate  within  it  and 
how  the  actors,  both  state  and  nonstate,  singly  and 
in  groups,  themselves  determine  the  way  in  which 
the  system  evolves. 

The  Contemporary  Significance 
of  Foreign  Policy  Analysis 

FPA  as  a  field  has  three  broad  paths  by  which  the 
problems  outlined  in  this  entry  may  be  approached: 
theory,  comparisons,  and  country  studies.  The 
theoretical  activity  revolves  mainly  around  deci¬ 
sion  making,  the  problems  of  agency  and  rational¬ 
ity,  the  relationship  between  the  state  and  the 
system  in  which  it  is  embedded,  and  ethical  ques¬ 
tions.  This  is  hardly  a  trivial  agenda.  In  pursuing 
it,  FPA  borrows  much  from  other  subjects  where 
advances  have  already  been  made  but  its  own  dis¬ 
tinctive  problematique  requires  original  work  that 
then  makes  its  own  contribution  to  social  sci¬ 
ence — as  we  have  seen  with  the  now  ubiquitous 
concepts  of  groupthink  and  bureaucratic  politics. 

The  comparative  perspective  in  FPA  can  operate 
on  the  grand  scale,  as  with  CFP,  in  an  attempt  to 
use  all  foreign  policy  events  to  identify  patterns  of 
behavior,  and  with  correlations  with  types  of  state 
and/or  types  of  internal  events.  More  frequently, 
and  more  fruitfully,  it  now  concentrates  on  limited 
classes  of  actor,  such  as  small  states,  democracies, 
or  specified  types  of  events  such  as  crises  or  eco¬ 
nomic  sanctions.  Even  with  such  a  restriction,  the 
task  of  producing  nonbanal  generalizations  is  for¬ 
midable  and  then  most  useful  at  a  heuristic  level. 
But  despite  the  obstacles,  all  work  in  FPA  is  by 
definition  comparative,  if  only  at  times  implicitly, 
in  that  even  single  events  are  approached  using 
common  concepts  and  methodologies,  so  that  they 
may  be  related  to  other  similar  instances.  In  fact, 
most  case  studies  are  influenced  by  the  method  of 
structured,  focused  comparison  designed  by 
Alexander  George  to  minimize  the  randomness. 
Where  processes  are  the  focus  rather  than  case 
studies  the  method  is  inherently  comparative.  The 
discussion  of  organizational  process,  or  leadership, 
for  example,  begins  with  propositions  of  a  univer¬ 
sal  character,  which  are  then  usually  qualified 
according  to  the  context  in  which  they  are  applied. 


For  example,  is  the  domination  of  foreign  policy 
by  charismatic  leaders,  say,  more  likely  in  a  devel¬ 
oping  country  or  in  those  with  access  to  the  sophis¬ 
ticated  techniques  of  the  mass  media? 

Country  studies  are  the  most  conventional  path 
for  analysts  to  take.  They  are  unfashionable  because 
they  are  all  too  often  seen  as  intellectually  less 
interesting  than  theory  or  comparison,  rather  like 
the  prejudice  within  social  science  against  area 
studies.  But  this  is  a  simplistic  view.  There  are,  after 
all,  many  ways  of  studying  an  individual  country’s 
foreign  policy,  some  highly  analytical  and  incisive. 
It  is,  in  fact,  rare  these  days  to  find  a  work  that 
does  not  rise  above  the  descriptive.  Indeed,  when 
significant  change  occurs  in  a  state’s  international 
position,  the  natural  response  is  a  flood  of  work 
within  the  broad  FPA  tradition — as  was  the  case 
with  Germany  and  the  United  States  after  1989 
and  China  after  its  rise  to  economic  power.  This 
tradition  is  indispensable,  for  however  insightful  a 
theory,  or  however  rigorous  and  wide-ranging  a 
work  of  comparison,  they  cannot  provide  the  rich, 
in-depth  understanding  of  a  particular  country  and 
its  foreign  policy  choices.  Any  attempt  to  deduce 
the  strategy  of  Burma  or  Saudi  Arabia,  for  exam¬ 
ple,  from  neorealism  or  some  other  parsimonious 
theory,  is  doomed  to  superficiality.  Such  states  are 
comparable  only  up  to  a  very  limited  point,  qua 
states.  Individual  cases  might  fit  a  generalization  in 
some  broad  sense,  but  the  resulting  knowledge 
cannot  have  the  thickness  achieved  by  combining 
theoretically  based  analysis  with  expertise  in  the 
country  or  particular  issue  at  stake. 

This  is  why  FPA,  like  all  other  aspects  of  social 
science,  cannot  be  a  subject  unto  itself.  It  relies  on 
collaboration  with  others.  If  IR  and  political  sci¬ 
ence  are  the  most  immediate  points  of  contact, 
other  social  sciences,  notably  psychology,  geogra¬ 
phy,  and  political  economy,  are  equally  important. 
What  is  more,  FPA  needs  to  be  close  to  history, 
philosophy,  and  area  studies,  which  may  not  be 
social  sciences  themselves  but  provide  much  foun¬ 
dational  material  for  the  latter.  But  FPA  is  not  just 
a  matter  of  scope,  a  field  where  the  master  disci¬ 
plines  play  out  their  games.  Over  half  a  century,  it 
has  focused  attention  on  one  of  the  most  impor¬ 
tant  areas  of  human  activity,  if  judged  by  the 
potential  consequences  for  the  mass  of  humanity. 
And  it  has  accumulated  a  considerable  body  of 
knowledge,  and  techniques,  for  throwing  light  on 
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foreign  policy  wherever  it  may  be  practiced.  For 
that  reason,  it  is  important  in  itself,  as  a  source  of 
intellectual  progress,  and  for  the  help  it  can  give  all 
of  us,  practitioners  and  citizens,  in  the  search  to 
understand  the  marvelous  but  often  dangerous 
world  we  live  in. 

Christopher  Hill 
University  of  Cambridge 
Cambridge,  United  Kingdom 
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Formal  Theory 


See  Game  Theory;  Social  Choice  Theory 


Freedom 


See  Liberty 


Functionalism 


Functionalism  refers  to  the  methodological  concept 
of  explaining  social  phenomena  by  specifying  an 
asymmetrical  relationship  between  two  objects 
under  consideration.  Functionalism  rests  on  the 
view  that  phenomena  can  be  best  explained  in  terms 
of  what  they  do  and  what  their  impact  is  on  other 
phenomena;  thus,  it  considers  systems  of  interaction 
among  individuals  and  groups.  In  political  science, 
functionalism  emphasizes  the  functions  of  social 
institutions.  After  providing  a  general  definition  of 
functional  explanation,  this  entry  outlines  the  ideas 
of  some  of  the  most  influential  originators  of  func¬ 
tionalism.  Next,  it  describes  the  main  stages  in  the 
development — structural  functionalism,  equiva¬ 
lence  functionalism,  and  neo-functionalism.  The 
entry  ends  with  a  critical  summary  of  the  status  of 
functionalism  today. 

Definition 

A  functional  explanation  can  take  this  general 
form: 

Given  a  system  S  in  a  certain  state  s  with  a  structure 
T,  there  is  an  activity  a  from  the  point  of  view  of 
the  observer,  regularly  coming  from  an  element  E 
of  T,  and  having  an  effect  on  S  or  its  environment. 
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The  theoretical  status  of  s  can  either  be  stationary 
(the  most  frequent  case)  or  dynamic.  If  one  can, 
with  respect  to  a  certain  theoretical  point  of  refer¬ 
ence,  claim  a  systematic  relationship  between  a  and 
s,  then  E  can  be  interpreted  as  a  “function”  for,  or 
a  as  a  “functional  contribution”  to,  the  mainte¬ 
nance  of  S,  whether  it  means  stability,  identity, 
equilibrium,  or  changing  of  S.  This  explanation  of 
s  neither  involves  an  explication  of  a’s  origin  nor 
the  causal  nexus  of  its  effect  on  s.  For  example,  a 
political  party  can  be  considered  as  contributing  to 
the  working  democracy  without  constructing  this 
function  as  the  cause  of  its  creation. 

Functional  explanations  are  based  on  empirical 
evidence:  After  s  and  a  have  been  observed,  the 
functional  character  of  the  relationship  between 
them  can  be  deduced  from  the  features  of  T  and  E. 
Of  course,  these  explanations  can  be  formulated 
with  a  teleological  orientation:  as  a  projection  of  a 
future  s,  derived  from  T  as  a  goal-orientated  agent, 
under  the  assumption  that  a’s  effect  on  .S'  is  a  func¬ 
tional  prerequisite  for  the  maintenance  of  it.  In 
case  there  is  a  deliberately  role-differentiated  orga¬ 
nization  under  consideration,  such  a  prognosis  will 
become  normative.  It  will  express  the  expectation 
that  S,  as  a  whole,  will  function  in  the  way  pre¬ 
scribed  by  its  blueprint. 

However,  functional  explanations  make  sense 
only  under  several  conditions.  For  example,  if  one 
states  that  general  and  free  elections  in  parliamen¬ 
tary  systems  have  the  “function”  of  maintaining 
the  circulation  of  political  elites,  then  a  bottom-up 
nomination  of  candidates  is  assumed,  but  other 
functions  (such  as  lawmaking  or  representation) 
are  not,  however,  excluded.  Functional  analyses 
are  based  on  theoretical  assumptions,  but  they 
hardly  suggest  new  ones  by  themselves.  According 
to  the  widespread  usage  of  functional  explana¬ 
tions,  systems  can  be  represented  as  institutions, 
religious  behavior  patterns,  cultures,  societies,  and 
so  on,  whereas  a  may  operate  as  a  kind  of  activity, 
exchange,  information,  sanction,  service,  coercion, 
production,  and  other  forms  of  output. 

Origins 

Functionalism,  as  a  tradition  or  “school,”  culmi¬ 
nated  in  the  1950s  and  1960s.  However,  its  ori¬ 
gins  can  be  found  in  the  works  of  many  of  the 
classic  social  and  political  theorists.  Charles  de 


Montesquieu  may  be  considered  such  an  early 
proto-functionalist  thinker.  In  his  chief  work 
Spirit  of  Laws  (1748),  he  classified  the  compo¬ 
nents  of  political  systems  in  terms  of  their  physi¬ 
cal,  geographic,  climatic,  mental,  and  cultural 
“nature” — that  is,  the  structure  of  their  external 
systems.  The  respective  legal  principles  activate 
the  “natural”  structures  toward  an  outcome  of 
political  order  by  which  the  subjects  are  reminded 
of  their  duties  and  rights.  The  privileges  of  nobil¬ 
ity,  for  instance,  are  a  function  of  its  freedom, 
whereas  parliamentary  power  is  one  of  the  prin¬ 
ciples  of  constitutional  monarchy.  At  the  same 
time,  Montesquieu  tried  to  make  it  clear  that 
important  political  functions  can  often  hide  behind 
ostensibly  incidental  or  useless  symbols  and  ways 
of  acting. 

An  important  predecessor  of  classical  function¬ 
alism  was  Herbert  Spencer,  who,  as  a  leading 
social  Darwinist,  tried  to  explain  the  development 
of  industrial  society  in  an  evolutionist  way. 
According  to  Spencer,  the  law  underlying  societal 
evolution  proceeds  from  homogeneity  of  equal  and 
independent  parts  to  heterogeneity  of  unequal  and 
dependent  ones.  These  parts — institutions,  groups, 
technologies,  ideas,  and  so  on — tend  to  become 
increasingly  specialized  and,  as  a  result,  fulfill 
increasingly  special  functions  for  society.  The  start¬ 
ing  point  of  this  process  is  that  the  management  of 
population  growth  implies  prerequisites  such  as 
advances  of  productivity,  distribution,  or  regulation. 
Social  differentiation,  functional  specialization,  and 
interdependence  of  parts  feed  back  to  one  another  as 
mechanisms  of  the  same  movement. 

One  of  the  founders  of  modern  sociology,  Emile 
Durkheim,  then  adopted  functional  analysis,  logi¬ 
cally  distinguishing  it  from  genetic-causal  explana¬ 
tion,  as  a  methodological  program.  In  his  works 
The  Division  of  Labor  in  Society  (1892)  and  Rules 
of  the  Sociological  Method  (1895),  he  argued: 

1.  In  explaining  a  social  phenomenon,  one  has 
to  distinguish  its  generating  cause  from  the  func¬ 
tion  actually  fulfilled  by  it.  It  is  inadmissible  to 
explain  collective  phenomena  in  a  utilitarian  way 
because  their  respective  functions  can  originate 
from  or  serve  different  purposes.  At  the  same  time, 
teleological-causal  explanation  is  not  promising 
because  a  collective  phenomenon  such  as  society 
does  not  constitute  a  consistently  acting  whole. 
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2.  With  this,  Durkheim  suggested  that  we 
understand  social  phenomena  as  emergents:  They 
come  into  being  through  an  accumulation  of 
immense  courses  of  aggregated  individual  actions 
or  of  unintended  effects  of  purposeful  action. 

3.  A  social  phenomenon  emerges  and  tends  to 
enhance  the  importance  of  its  function  when  indi¬ 
viduals  recognize  the  advantage  gained  from  it  and 
develop  an  interest  in  optimizing  profit.  Durkheim 
was  one  of  the  first  to  conceptualize  function  as  a 
self-reinforcing  mechanism  and  a  feedback  system. 
Durkheim  demonstrated  this  in  the  light  of  divi¬ 
sion  of  labor.  Specialization  of  occupations  and 
social  roles  lead  actors  into  growing  dependence. 
With  this,  human  networks  become  closer,  making 
the  exchange  of  specialized  goods  easier.  As  a 
result,  a  social  order  that  Durkheim  called  organic 
solidarity  is  established,  as  a  kind  of  moral  sui 
generis.  In  the  course  of  this  process,  people  also 
experience  the  secondary  benefits  of  division  of 
labor,  such  as  the  growth  of  economic  productivity 
or  differentiation  of  the  legal  system.  Finally,  mod¬ 
ern  division  of  labor  derives  from  needs  that  have 
emerged  at  a  preceding  state.  Thus,  the  theoretical 
meaning  of  social  progress  is  a  growing  complexity 
of  functions. 

In  the  1930s  through  the  1950s,  functionalist 
thinking  spread,  especially  in  anthropology.  From 
an  ethnological  point  of  view  in  the  Western  per¬ 
spective,  there  was  a  strong  temptation  to  interpret 
“strange”  phenomena  by  taking  the  stability  of  the 
social  and  cultural  systems  as  a  point  of  reference. 
Bronislaw  Malinowski  ( 1 944 )  and  Alfred  Radcliffe- 
Brown  (1952),  for  instance,  tried  to  explain  the 
performance  of  magic  rites  by  Pacific  Islanders  in 
this  way.  While  Malinowski  emphasized  the 
physio-psychological  functions  of  need  satisfaction 
and  fear  reduction  by  magic,  Radcliffe-Brown 
claimed  that  magic  served  to  promote  the  survival 
of  the  group  and  the  maintenance  of  the  system’s 
structure.  It  was  George  Homans  who  presented 
an  interesting  proposition  for  “reconciling”  these 
approaches  by  overcoming  the  one-sidedness  of 
both  of  them:  Magic  initially  was  performed  to 
soothe  ghosts  or  to  release  stress  and  anxiety  in 
dangerous  situations,  such  as  sea  fishing  or  child¬ 
bearing,  and  to  bring  about  confidence,  provided 
that  the  rites  are  performed  in  the  “right”  way.  At 
the  same  time,  this  procedure  involves  conformity 


with  the  norms  that  are  in  force,  the  violation  of 
which  would  have  serious  consequences  for  the 
group.  Therefore,  society  instills  in  its  members  a 
fear  of  punishment  for  noncompliance  through 
rites  that  serve  as  a  means  of  enforcing  compliance 
with  such  norms.  At  the  same  time,  magic  rites  are 
important  group  activities,  strengthening  the  bond 
and  solidarity  of  the  group  and  promoting  the  sur¬ 
vival  of  the  system  thereby.  Homans,  by  combin¬ 
ing  the  psychological  and  the  structural  aspects  of 
a  functional  explanation,  demonstrated  how  to 
take  the  hybrid  nature  of  functions  into  account — 
following  Durkheim’s  basic  idea. 

Structural  Functionalism 

Classical  functionalism  culminated  in  Talcott 
Parsons’s  (1951)  systems  theory.  Parsons  called  his 
approach  structural  functionalism ,  later  on  giving 
up  this  term  at  the  zenith  of  his  work.  Social  sys¬ 
tems,  as  the  point  of  reference  for  his  theory,  have 
to  solve  a  limited  number  of  general  problems  for 
their  maintenance  or  keeping  their  balance.  When 
modern  society  is  being  considered,  “function” 
refers  to  the  solutions  to  such  general  problems. 
Some  functions  are  externally  directed:  They 
provide  for  the  adaptation  of  the  system  to  its 
environment.  Others  are  internally  directed,  prereq¬ 
uisites  for  the  integration  of  parts  or  for  actors’ 
motivations.  In  his  famous  AGIL  (adaptation,  goal 
attainment,  integration,  latency)  scheme,  Parsons 
systematically  specified  the  four  main  system  prob¬ 
lems  and  related  them  to  the  four  subsystems,  from 
which  the  corresponding  functions  of  problem 
solution  proceed  (see  Table  1).  The  functions  are 
interrelated  by  processes  of  interchanging  special 
classes  of  “media”  that  define  their  respective  out¬ 
puts,  as  if  they  were  material,  psychic,  and  social 
goods  or  energies  in  the  metabolism  of  society. 

Along  these  lines,  adaptation — that  is,  mainte¬ 
nance  within  environment — is  the  function  of 
economy,  including  technology,  labor,  and  con¬ 
sumption,  and  the  output  of  it  flows  in  the  general 
form  of  (equivalences  of)  money.  Attainment  of 
goals  fundamental  to  society  is  guaranteed  by  the 
subsystem  of  polity  bringing  about  effects  of  con¬ 
trol  in  the  form  of  power.  The  interchange  between 
A  and  G,  for  instance,  can  be  manifested  in  the 
way  laws  and  sanctions  (polity)  regulate  the  func¬ 
tioning  of  markets,  whereas  the  economy  enables 
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Table  1  Talcott  Parsons’s  AGIL  Model 

Function  for  Solution  of  System  Problems 

Subsystem 

Media  of  Interchange 

(A)  Adaptation 

Economy 

Money 

(G)  Goal  attainment 

Polity 

Power 

(I)  Integration 

Societal  community 

Moral  appeals 

(L)  Latency  (pattern  maintenance) 

Fiduciary  system 

Influence 

Note:  AGIL  =  adaptation,  goal  attainment,  integration,  latency. 


the  functioning  of  the  political  system  by  providing 
revenue  from  taxes.  Societal  communities  contrib¬ 
ute  to  system  integration  by  providing  for  an  opti¬ 
mal  climate  of  morals,  solidarity,  and  socialization 
as  well  as  sanctioning  conformity  with  values  and 
norms  (e.g.,  by  awarding  prestige).  Finally,  pattern 
maintenance  typically  arises  from  the  fiduciary 
subsystem,  the  actors  of  which  are  sources  of  influ¬ 
ence  toward  ensuring  common  interests,  standards 
of  professionalism,  and  other  grounds  of  trust. 
Within  the  subsystems,  social  roles — the  intersec¬ 
tions  of  structure  and  actor — are  interlinking.  They 
represent  the  institutionalized  expectations  toward 
the  occupants  of  social  positions  derived  from  the 
given  structure  of  social  differentiation.  Therefore, 
the  classification  of  functional  parts  according  to 
the  four  system  problems  is  repeated  on  every  sub- 
level:  Subsystems  consist  in  functionally  structured 
subsystems  of  the  same  nature.  The  system  as  a 
whole  is  vaulted  by  the  cultural  system  as  the  total 
of  legitimate  values,  norms,  and  symbols. 

For  several  decades,  Parsons’s  functionalist 
approach,  as  a  heuristic  scheme,  has  been  used  in 
various  research  applications.  Flowever,  to  apply 
the  AGIL  model  to  empirical  systems  involved 
some  compromises.  For  example,  some  overlap¬ 
ping  of  the  subsystem-function  dimensions  had  to 
be  accepted  in  matching  a  specific  function  with  a 
given  part  of  the  system. 

Most  criticism  of  Parsons’s  approach  focused  on 
its  inherent  tendency  of  holism  to  consider  society  as 
an  “actor”  without  any  microfoundations,  assum¬ 
ing  the  fiction  of  a  system’s  tendency  toward  an 
equilibrium,  the  fiction  of  common  goals  of  society, 
and  their  teleological  character.  Finally,  Parsons  was 
said  to  have  explained  social  change  too  simply  by 
referring  to  interference  from  environment  or  inter¬ 
nal  accident,  instead  of  taking  into  consideration 
dynamic  features  in  the  system’s  structure  itself. 


Robert  K.  Merton  was  the  most  prominent  critic 
of  structural  functionalism.  Not  rejecting  functional¬ 
ism  in  total,  he  suggested  revising  it  on  many  points. 
First  of  all,  he  distinguished  between  “manifest”  and 
“latent”  functions.  The  former  arise  as  cumulated 
effects  from  individual,  intended  action,  mostly  a 
case  of  individual  conformity  to  collective  rational¬ 
ity.  The  latter  means  the  type  of  collective  effects 
that  emerge  beyond  the  control  of  individuals,  when 
there  is  no  matching  of  goals  and  consequences.  This 
distinction  corresponds  to  the  difference  between 
“functional  systems”  and  “systems  of  interdepen¬ 
dence”  suggested  by  Raymond  Boudon  (1981).  The 
first  are  goal-orientated  systems  of  interaction  regu¬ 
lated  on  the  base  of  reciprocal,  specialized  role 
expectations  as  well  as  controlling  by  means  of 
input-output  assessments.  Business  organizations, 
universities,  political  parties,  bureaucracies,  and  so 
on  are  of  this  kind.  On  the  other  hand,  systems  of 
interdependence  emerge  wherever  actors  are  inter¬ 
related,  losing  (parts  of)  control  over  their  actions, 
ultimately  becoming  involved  in  a  complex  struc¬ 
ture  that  is  nontransparent  both  in  its  nature  and  in 
its  consequences.  If,  for  instance,  at  an  election 
with  three  contesting  parties,  a  clear  ranking  of 
votes  is  the  result,  one  usually  would  expect  that 
the  leader  of  the  most  successful  Party  A  is  likely  to 
become  the  next  prime  minister.  Yet  in  case  of  a 
proportional  representation,  together  with  a  proved 
culture  of  forming  coalitions,  the  next  prime  minis¬ 
ter  could  come  from  the  secondary  Party  B  as  well 
if  Party  B  successfully  negotiates  with  Party  C, 
which  has  the  least  number  of  votes,  and  together 
Party  B’s  and  Party  C’s  votes  total  more  than  50%. 
The  main  feature  of  this  system  of  interdependence 
is  that  cabinet  formation  does  not  derive  directly 
from  the  ranking  of  votes.  Thus,  the  ideal  function 
of  democratic  elections  to  provide  for  a  freely 
selected  government  is  not  achieved.  To  understand 


932  Functionalism 


this  structure  of  emergence,  it  is  necessary  to  find 
out  the  statistical  as  well  as  institutional  and  proce¬ 
dural  rules  that  are  interdependently  responsible 
for  the  composition  of  collective  consequences  out 
of  the  individual  effects  of  action. 

Merton  had  already  approached  such  a  skepti¬ 
cism  of  the  fruitfulness  of  trying  to  functionally 
analyze  systems  of  interdependence,  including 
society.  He  turned  away  from  “big  theory”  (as  a 
global  explanation),  pleading  for  progress  in  devel¬ 
oping  less  abstract,  and  less  general,  “theories  of 
the  middle  range,”  which  target  a  more  modest 
explanation.  As  a  consequence,  he  proposed 
deconstructing  the  general  concept  of  function  and 
making  a  distinction  between  function  (in  a  sense 
of  adaptation  or  adjustment  of  a  system),  dysfunc¬ 
tion  (as  a  consequence  of  action  destabilizing  a 
system),  and  nonfunctional  consequences  (irrele¬ 
vant  to  the  state  of  a  system).  For  a  functional 
analysis  with  higher  concreteness  and  specifica¬ 
tion,  he  proposed  a  number  of  conditions  or 
restrictions  such  as  the  following: 

•  the  phenomena  under  consideration  should  have 
an  institutional  character  and  be  standardized; 

•  subjective  moments  of  action  should  be  taken 
into  account  as  well  as  the  distinction  of 
positive,  negative,  and  neutral  consequences; 

•  the  functional  affected  units  should  be  specified 
as  well  as  the  criteria  for  system  maintenance; 

•  functional  alternatives  should  be  specified,  with 
respect  to  their  scope  of  application; 

•  dysfunctions  producing  change  should  be 
identified; 

•  comparative  studies  to  better  validate  approaches 
of  explanation  should  be  performed; 

•  features  of  actors,  meanings,  places, 
motivational,  and  overt  behavior  that  are 
involved  in  pattern  of  action  should  be  specified. 

It  is  hard  to  imagine  that  this  demanding  pro¬ 
gram  would  have  been  executed  in  a  more  or  less 
exhaustive  way.  The  usual  restrictions  on  per¬ 
sonal,  material,  and  temporal  resources  and  on 
theoretical  potentialities  allowed  only  for  selective 
implementations  in  a  few  studies. 

Equivalence  Functionalism 

Along  with  claiming  to  have  established  a  universal 
theory  of  social  systems  as  modern,  Niklas  Luhmann 


(1995)  proposed  a  new  definition  of  “function” 
that  allowed  for  a  larger  scope  of  interpretations. 
In  his  approach,  functions  have  both  a  multiple 
sense,  as  with  Durkheim,  and  also  an  “equiva¬ 
lent”  one:  Functions  serve  as  criteria  for  compari¬ 
son  of  the  actually  identified  functions  with 
virtual  ones — that  is,  some  of  the  theoretically 
possible  relationships  in  a  system  under  consider¬ 
ation.  They  are  viewed  as  an  expression  of  “unity 
and  difference,”  indicating  contingent  structures 
of  the  system.  Luhmann  used  the  research  ques¬ 
tion  of  what  mechanisms  regulate  the  shortness  of 
goods  to  illustrate  his  approach.  At  first  sight,  it  is 
easy  to  come  to  a  function  as  a  combination  of 
moral  rules  and  economic  mechanisms,  the  par¬ 
ticular  effects  of  which  can  be  compared  with 
each  other.  Yet  the  question  of  why  shortness 
actually  shall  be  regulated  “can  only  be  answered 
with  respect  to  system/environment  differences” 
(Luhmann,  1995,  p.  404,  German  edition).  The 
openness,  indefiniteness,  and  complexity  of  this 
concept  of  function  involve  a  comparative 
approach  for  functional  explanations,  distinguish¬ 
ing  particular  system  levels  as  well  as  system- 
environment  differences. 

Neo-F  unctionalism 

Jeffrey  Alexander  (1998)  and  others  made  the  next 
step  in  the  further  development  of  functionalism, 
calling  their  approach  neo-functionalism.  The 
main  aspects  of  their  revisions,  in  accord  with 
some  of  Merton’s  suggestions,  are  as  follows: 

•  Parsons’s  analytic  model  shall  be  preserved, 
however,  there  were  no  explications  given  on  the 
methodological  status  of  the  concept  “function” 
or  a  functional  explanation. 

•  Parsons’s  view  of  society  as  a  network  of 
relatively  independent  parts  tending  toward  an 
equilibrium  is  not  empirically  substantiated. 

•  The  inflexibility  of  the  systems  theory  approach 
must  be  loosened  up  by  the  inclusion  of 
“contingencies”  in  individual  action:  The 
interrelations  between  cultural  order,  as  the 
frame  of  action  and  the  contingent  moments  in 
themselves,  are  to  be  empirically  analyzed. 
Thereby,  knowledge  from  micropolitics  or 
sociology,  ethnomethodology,  symbolic 
interactionism,  and  so  on  must  become 
integrated. 
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•  Parsons’s  “idealistic”  view,  that  is,  the 

predomination  of  culture  and  values  in  modeling 
social  order,  is  widened  by  emphasizing  the 
instrumental-rational  dimensions  of  action  and 
by  incorporating  conflict  theories. 

Alexander  gave  an  example  of  analyzing  social 
change  based  on  diese  criteria.  The  general  accep¬ 
tance  of  new  institutional  structures  that,  on  the 
basis  of  approved  principles,  are  better  able  to 
resolve  an  acute  problem  will  emerge  at  the  end  of 
a  multistage,  conflictual  process.  As  a  result  of  the 
pressure  from  this  process,  competing,  contrasting 
interest  groups  will  form  and  be  stimulated  to 
action.  Then  it  becomes  functional  to  optimize  an 
internal  division  of  labor  so  the  numbers  of  alter¬ 
natives  are  reduced  step  by  step,  and  a  generaliza¬ 
tion  of  public  discussion  emerges.  If,  farther  on, 
under  the  pressure  of  “institutional  entrepre¬ 
neurs,”  it  happens  that  an  established  pattern  be¬ 
comes  illegitimate,  then  a  certain  alternative  can 
win  if  it  is  accepted  as  final  by  the  actors.  At  the 
same  time,  the  general  value  system  was  never 
jeopardized. 

Recent  Variants  in  Political  Science 

Variants  of  (neo-)functionalism  are  still  taking 
effect  in  parts  of  comparative  politics.  Here,  the 
concept  of  function  mostly  refers  to  the  meaning  of 
institutional  task,  concern,  responsibility,  or  another 
aspect  of  purposeful  action  carried  out  by  or 
expected  from  actors,  systems,  or  strategies.  Thus, 
functions  are  understood  as  intentional  rather  than 
emergent  processes  or  collective  effects  of  action. 
Furthermore,  approaches  in  political  functionalism 
are  based  on  empirical  generalizations  rather  than 
on  explicit  theoretical  or  methodological  principles. 
Common  to  them  is  the  conception  that  political 
systems  or  processes,  such  as  international  affairs  or 
European  unification,  can  be  controlled  better  by 
frontier-crossing  institutions,  nongovernmental 
organizations,  experts,  or  supraregional  net¬ 
works  of  scientists  than  by  governments  within 
their  territorial  sovereignty.  Processes  of  political 
decision  chiefly  run  along  the  actual  activities 
(functions)  of  these  agents  instead  of  those  done 
by  formally  legitimated  authorities,  who  only 
validate  the  results.  Therefore,  certain  needs  or 
interests  of  groups,  who  are  the  truly  operating 
agents,  have  become  the  focal  point  of  research, 


so  far  making  use  of  a  bottom-up  method  of 
analysis. 

If  the  functional  output  of  an  institution  over¬ 
steps  a  certain  threshold  value,  then  a  spillover 
effect  may  emerge,  for  instance,  when  the  outcome 
of  international  economic  cooperation  involves  an 
enhanced  political  cooperation  (A.  J.  R.  Groom  & 
Paul  Taylor,  1975;  David  Mitrany,  1975). 

A  reversing  of  functional  thinking  can  be  found 
in  the  theoretical  framework  for  comparative  poli¬ 
tics  by  Gabriel  Almond,  Bingham  Powell,  Russell 
Dalton,  and  Kaare  Strom  (2008).  After  having 
subjected  a  great  number  of  states  to  comparative 
analysis,  a  limited  typology  of  functions,  taken  as 
inherent  processes  of  system  maintenance,  is  gener¬ 
ated  and  standardized  for  all  of  the  cases.  Then  the 
same  procedure  is  done  with  respect  to  the  system 
units  exercising  functions,  such  as  parliaments, 
executives,  courts,  bureaucracies,  and  so  on.  Due  to 
their  origin  and  direction,  the  functions  are  distin¬ 
guished  in  the  following  way.  System  functions  are 
settled  in  the  domestic  environment:  socialization, 
recruitment,  and  communication.  They  are  inter¬ 
vening  conditions  for  the  process  functions:  interest 
articulation  and  aggregation,  policy  making,  and 
policy  implementation  and  adjustment.  Policy 
functions,  as  a  result,  operate  as  the  essential  out¬ 
puts:  extraction,  regulation,  and  distribution.  They 
again  react  to  the  process  functions,  thus  closing  a 
dynamic  control  loop,  which,  beyond  that,  is  inter¬ 
changed  with  the  political  systems  of  other  states. 
In  empirical  practice,  after  having  operationalized 
these  concepts  for  the  empirical  application,  com¬ 
parative  analyses  are  carried  out  to  find  out  how, 
in  different  real  systems,  the  functions  under  con¬ 
sideration  are  fulfilled  and  what  this  means  with 
respect  to  policy  production  as  well  as  to  the  kinds 
of  relationships  between  the  states  under  observa¬ 
tion.  To  state  it  once  more:  As  a  feature  of  this 
approach,  functions  do  not  mean  ex  post  facto 
interpretations  of  relations  having  been  observed. 
Rather,  they  are  outcomes  from  institutions  and 
agents,  a  priori  specified  as  indicators  for  data  col¬ 
lection,  that  on  the  ground  of  foreknowledge  are 
available  to  the  researchers. 

Conclusion 

Functionalism  as  a  scientific  school  or  continu¬ 
ously  developing  research  program  has  had  its  day. 
The  main  arguments  against  it  were  too  serious  to 


934  Fundamentalism 


permit  further  promising  development.  Among  the 
problems  were  the  following: 

•  an  inclination  to  holism,  with  a  neglect  of  effects 
of  action  on  the  micro-  and  mesolevel; 

•  an  overemphasis  on  system  stability  or 
equilibrium,  without  explicating  potentials  of 
change  as  a  feature  of  structure  itself; 

•  a  temptation  to  look  at  systems  by  analogy  with 
organisms — that  is,  institutions  may  function  in 
principle  like  the  organs  of  a  living  being; 

•  a  tendency  to  make  teleological  assumptions, 
even  with  respect  to  systems  whose  structures 
are  not  created  purposefully;  overabstractedness 
of  functions  having  been  discovered,  for  lack  of 
specification  of  their  origins,  places,  and  ranges 
of  manifestation;  lack  of  specification  of  the 
actors  involved;  or  lack  of  efficacy;  and 

•  a  lack  of  comparative  studies  for  the  purpose 
of  better  validation  of  function  detected  as 
well  as  the  identification  of  functional 
equivalences. 

Nevertheless,  many  suggestions  from  classical 
functionalism  entered  the  mainstream  of  social 
and  political  science,  such  as  the  basic  distinction 
between  genetic-causal  and  functional  explana¬ 
tions,  focusing  one’s  attention  to  undesirable  col¬ 
lective  consequences  of  individual  actions,  finally 
sharpening  the  view  to  a  complexity  and  interac¬ 
tion  that  defy  simple  interpretations. 
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Fundamentalism 


Fundamentalism  is  an  inclusive  term  for  modern 
religious  movements  that  appeal  to  past  events, 
texts,  and  authoritative  figures  and  project  into  the 
future  a  variety  of  doctrines,  stories,  or  laws  that 
protect  the  group  that  is  devoted  to  them  and  that 
serve  to  motivate  and  influence  attitudes  and 
actions,  more  or  less  aggressive,  toward  those  out¬ 
side  the  group. 

The  term  may  not  be  acceptable  to  adherents  in 
all  religions  and  cultures  because  it  was  first  identi¬ 
fied  with  20th-century  American  Protestantism. 
However,  the  concept  may  be  described  as  portable 
and  is  easily  translated  to  cognate  movements  in 
the  various  religions  and  cultures.  Sometimes  it  is 
appropriate  to  speak  of  these  as  “fundamentalist¬ 
like”  movements,  or  movements  that  bear  “family 
resemblances”  to  fundamentalism. 

Historical  Background 

The  term  fundamentalism  did  not  appear  in  dic¬ 
tionaries  or  encyclopedias  before  the  20th  century. 
Scholars  trace  it  to  usage  by  partisans  in  denomi¬ 
national  conflicts  in  the  United  States,  debates  that 
divided  especially  the  Baptist  and  Presbyterian 
groups.  Thus,  among  the  Northern  Baptists,  an 
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editor  complained  that  great  numbers  in  the 
church  body  claimed  to  be  and  wanted  to  be  called 
“conservative,”  but  their  conservatism  was  pas¬ 
sive.  As  such,  it  was  not  useful  in  the  decisive 
struggles  of  the  period.  He  and  those  who  sided 
with  him  described  a  situation  in  which  there  were 
theological  and  moral  threats  to  the  religious  iden¬ 
tity  and  integrity  of  the  adherents. 

In  the  case  of  the  American  Protestants,  the 
threats  came  from  some  “modern”  or  “modernist” 
claims  and  innovations.  These  threats,  which 
included  the  challenge  posed  by  Darwinism  to  the 
inherited  biblical  understandings  of  the  creation  of 
the  universe  and  of  humans,  were  seen  to  be  an 
assault  on  human  dignity;  to  undercut  faith  in  the 
authority  of  the  scriptures,  which  were  described 
as  inerrant  in  all  detail;  and  to  erode  the  psycho¬ 
logical  and  theological  boundaries  of  the  members 
of  the  group.  The  leaders  who  rallied  those  who 
would,  in  the  terms  of  the  day,  “do  battle  for  the 
Lord”  named  their  cause  “fundamentalism.” 
When  the  term,  which  they  bannered  in  the  begin¬ 
ning,  invited  stigmatization  by  their  churchly  foes 
and  in  the  public  media  and  consciousness,  some 
of  them  chose  other  terms  to  signal  that  they  were 
less  aggressive.  In  Protestantism,  the  term  evan¬ 
gelical  was  sometimes  used. 

From  the  beginning,  it  was  clear  that  “funda¬ 
mentalism”  was  not  the  same  as  “conservatism,” 
though  both  may  have  started  from  the  same  base. 
Fundamentalism  was  a  specific  response  to  modern 
challenges;  the  word  reaction  best  serves  to  describe 
the  impulse  and  strategy  of  fundamentalists.  The 
modern  assaults  on  this  reaction  were  corrosive 
and  erosive  of  boundaries  around  the  group  and 
took  the  form  of  increased  openness  to  and  toler¬ 
ance  of  other  beliefs  on  the  part  of  adherents. 

It  could  be  said  that  all  through  history  religious 
groups  have  felt  threats  to  their  beliefs,  practices, 
and  identities,  so  one  must  ask  why  the  early-20th- 
century  challenges  met  the  particular  forms  of 
response  that  they  did.  While  it  is  never  possible  to 
substantiate  exact  claims  regarding  cause-and- 
effect  relations,  some  elements  stand  out.  One  of 
the  most  notable  is  the  sophistication  of  mass 
media  of  communication,  notably  the  radio.  Such 
media  made  it  possible  to  reach  beyond  immediate 
locales  to  recruit,  minister  to,  and  draw  on  the 
discontents  and  aspirations  of  the  like-minded 
who  lived  far  away.  Such  employment  of  media 


suggests  one  of  the  paradoxes  of  fundamentalisms: 
their  use  of  the  most  modern  instruments  to  prop¬ 
agate  messages  that  they  advertise  as  and  may 
believe  to  be  what  some  Americans  called  “the 
old-time  religion.” 

In  the  context  of  other  religions,  there  has  been 
an  identification  of  fundamentalism  with  anticolo¬ 
nial  and  nationalist  programs.  Thus,  in  India, 
around  1925,  the  Rashtriya  Svayamesevak  Sangh 
(RSS)  was  formed  to  link  religious  nationalist 
movements  with  mainly  Hindu  bases;  this  was 
extended  by  the  creation  of  the  Bharatiya  Janata 
Sangh  (now  the  Bharatiya  Janata  Party,  BJP)  in 
1951.  In  Egypt  in  1928,  the  Muslim  Brotherhood 
organized  to  resist  what  its  leaders  saw  as  secular¬ 
izing  forces  and  was  promoted  as  a  fusion  of  the 
Islamic  religion  and  nationalist  goals.  These  lead¬ 
ers  revived  classic  originating  stories  from  the 
Muslim  past,  building  on  the  Koran,  the  revelatory 
text  uttered  by  Allah  through  the  prophet 
Mohammed.  Those  who  followed  the  founders  of 
the  Muslim  Brotherhood  set  out  to  restore, 
enhance,  and  enforce  laws  from  the  sharia,  the 
code  of  Islamic  laws.  From  Egypt,  Wahabi  Muslim 
movements  fanned  out,  for  instance,  to  the  Arabian 
Peninsula,  where  they  found  ready  followers. 

Characteristics  of  Fundamentalisms 

Scholars  who  view  fundamentalisms  synoptically 
and  comparatively  find  characteristics  common  to 
most  of  them.  When  such  scholars  point  to  similar 
motives  and  expressions  in,  for  example,  American 
Protestantism  and  Shiite  Islam,  they  are  not  saying 
that  fundamentalisms  are  all  the  same.  To  suggest 
that  would  be  foolish,  because  all  fundamental¬ 
isms  must  have  something  to  be  fundamentalist 
about.  The  myths  that  animate  Hindu  movements 
differ  vastly  from  those  of  Judaism  or  Islam.  These 
myths  are  usually  autochthonous,  which  means 
that  they  grew  up  out  of  their  own  soil  and  were 
not  transplanted. 

Devotion  to  Scriptures 

The  host  documents  in  most  fundamentalisms 
are  ancient  scriptures,  almost  always  believed  to  be 
a  revelation  of  God  or  the  gods,  as  in  Hinduism, 
and  thus  carrying  divine  authority.  When  some 
religious  scholars  began  to  interpret  scriptures  as 
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they  would  other  ancient  writings,  they  appeared 
to  some  believers  to  be  the  ultimate  threats,  and 
their  questioning  and  assaults  inspired  more  precise 
defining  defense  than  was  needed  when  there  was 
no  challenge.  When  various  schools  of  interpreta¬ 
tion  emerge,  often  competitively,  it  is  necessary  for 
the  movements  to  make  claims  for  one  uniquely. 
Since  these  texts  have  been  cherished  and  guarded 
by  the  community  that  gave  rise  to  them,  they  also 
come  to  be  used  as  definers  of  exclusivity.  All  other 
sacred  or  putatively  sacred  texts  are  repudiated.  No 
measure  or  bases  for  tolerance  or  synthesis,  atti¬ 
tudes  typically  born  in  the  Enlightenment  era,  is 
permitted. 

Group  Exclusivism 

If  scripturalism  is  the  first  mark,  then  the  exclu¬ 
sivism  just  mentioned  is  a  necessary  correlate. 
Psychologists  who  study  fundamentalisms  note 
that,  across  the  board,  fundamentalisms  draw 
people  who  have  no  tolerance  for  ambiguity,  con¬ 
tradiction,  or  paradox.  If  those  who  have  nothing 
at  stake  in  the  question  of  their  authenticity  or 
who  are  from  another  religion  find  such  phenom¬ 
ena,  fundamentalist  scholars  immediately  offer 
explanations  that  appear  to  define  away  any 
apparent  contradictions.  The  drama  of  interpreta¬ 
tion  that  excites  many  in  religion  is  closed  down  in 
the  interest  of  a  monopolistic,  unitary  interpreta¬ 
tion  within  a  community,  where  only  one  version 
of  one  story  from  one  text  is  seen  to  be  “funda¬ 
mental”  and  inerrant. 

Rejection  of  Moderates 

These  texts  may  have  vastly  different  characteris¬ 
tics.  In  Judaism  and  Islam,  fundamentalisms  are 
legalistic;  an  adherent  lives  by  the  divinely  revealed 
law.  In  Christianity,  in  part  retaining  its  Jewish 
base,  communities  thrive  on  stories  and  legalistic 
application  of  norms  inspired  by  the  texts.  When 
critics  challenge  the  integrity  of  a  sacred  text,  they 
cause  distress  within  the  groups  that  revere  the  text 
in  question.  Couple  this  with  intolerance  of  ambigu¬ 
ity,  and  it  can  be  seen  that  the  texts  impart  rules  and 
meanings  that  cannot  be  questioned.  For  this  rea¬ 
son,  in  many  fundamentalisms,  moderates  are  more 
despised  than  modernists.  The  modernist  is  an  obvi¬ 
ous  enemy.  The  moderate  knows  the  norms  of  the 


group  but  is  open  to  listening  to  other  sources  and, 
in  a  sense,  is  thus  traitorous,  which  means  giving  the 
game  away.  The  apostate,  the  questioner,  and  the 
person  at  the  margin  demand  the  most  defensive 
response  and  inspire  positive  counteraction. 

“The  Fundamentals ”  as  Weapons 

Such  action  includes  a  closing  of  the  ranks  to 
keep  those  enclosed  within  and  to  protect  against 
influences  from  without.  Fundamentalisms  can  be 
seen  as  enclaves — self-reinforcing  and  bounded 
communities.  Those  committed  to  each  other 
around  the  fundamentals  need  spiritual  ammuni¬ 
tion  to  use  against  the  enemy,  the  “other.”  This 
includes  key  doctrines,  stories,  or  descriptions  that 
are  important  to  the  believing  community  and  a 
negation  of  all  others.  To  further  the  chosen  causes, 
fundamentalisms  need  and  have  rationales  for  set¬ 
backs  and  postponements  of  various  consumma¬ 
tions,  such  as  military  victory,  the  conversion  of 
multitudes  to  the  faith,  or  the  assurance  that  God 
is  on  one’s  side. 

Confidence  in  Victory 

However  the  goals  of  the  community  are  defined, 
no  matter  how  great  the  odds  against  success  may 
be,  and  despite  setbacks  and  even  the  martyrdom 
of  some  members,  the  group  has  the  assurance  of 
ultimate  victory:  God  or  the  forces  behind  the  texts 
and  the  movement  may  not  always  be  visible  or 
clear,  but  they  cannot  withdraw  the  promises 
extended  to  the  group  any  more  than  they  could 
withhold  the  truth  in  the  texts  and  usually  the 
ancient  laws  and  meanings. 

Conversion  of  Others 

If  fundamentalists  define  “the  other”  negatively 
and  are  ready  for  defensive  and  aggressive  action, 
sometimes  lethal  in  character,  they  do  not  ordinar¬ 
ily  allow  their  exclusivism  to  be  absolute;  there  is 
an  appeal  to  others  to  whom  members  witness  and 
who  come  to  see  the  assets  that  are  enjoyed  by 
those  within  the  group,  an  appeal  that  leads  to  calls 
for  conversion.  The  concept  of  being  “born  again,” 
which  is  one  of  the  marks  of  Protestant  fundamen¬ 
talism,  is  enriched  and  enhanced  as  members 
spread  the  message,  utter  threats  to  the  future  of  the 
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resistant,  and  offer  those  who  join  rewards  that 
may  be  realized  in  this  life  or  in  a  life  to  come. 

The  Politics  of  Fundamentalisms 

Most  fundamentalisms  expect  an  eschatological 
consummation;  that  is,  their  adherents  await  an 
end  to  history  and  the  world  as  it  is  now  known 
and,  more  immediately,  an  end  to  their  present 
earthly  existence.  For  Jews,  the  ancient  expecta¬ 
tion  of  a  Messiah — the  anointed  one  who  would 
rescue  Israel  and  give  it  an  exalted  place — gets 
translated  in  the  contemporary  world  into  political 
actions  that  justify  and  impel  actions  in  the  mod¬ 
ern  state  of  Israel.  The  ultratraditionalist  Flaredim, 
orthodox  Jews  who  are  extreme  in  their  attempts 
to  fulfill  the  law,  do  not  support  the  state  of  Israel 
as  a  fulfillment  of  Messianic  dreams  and  are  not 
considered  by  friends,  foes,  or  scholars  to  be  fun¬ 
damentalists,  to  whom  they,  nevertheless,  bear 
some  likeness.  Some  version  of  rewards  or  fulfill¬ 
ment  in  the  life  to  come,  usually  offered  to  the 
community  of  Israel,  is  another  version  of  this 
eschatological  hope. 

Islamic  fundamentalism  adheres  fervently  to 
calls  to  serve  Allah  in  ways  that  will  ensure  a 
future  to  Muslims  as  individuals  and  within  the 
community.  One’s  personal  death,  particularly  if  it 
occurs  in  such  service,  is  followed  by  a  fulfillment 
in  the  life  to  come.  In  fundamentalisms,  that  ser¬ 
vice  acquires  political  and  often  military  involve¬ 
ment,  so  that  the  values  and  meanings  of  the 
Islamic  community  are  realized. 

Christian  fundamentalists  may  hold  different 
views  of  the  future  beyond  history,  but  they  do 
have  distinctive  views.  Most  of  these  are  related  to 
the  prophecy  of  a  millennium,  a  1,000-year  reign 
of  Christ,  who  will  return  to  the  earth.  Christian 
fundamentalists  also  share  with  other  Christians  a 
belief  in  a  personal  resurrection  in  the  life  to  come 
and  proclaim  one  or  another  of  the  biblical  pic¬ 
tures  of  what  such  a  realized  life  involves. 

Accenting  both  personal  and  communal  con¬ 
summation  beyond  history,  as  nonfundamentalists 
in  these  faiths  also  may  do,  led  and  leads  many  to 
be  classified  as  “otherworldly.”  That  is,  the  true 
drama  of  their  existence  transcended  an  ordinary 
this-worldly  life.  The  main  task  in  proclaiming  the 
truth  of  the  religion  was  to  convert  others,  which 
meant  to  rescue  them  from  the  travails  of  earthly 


life.  In  this  stage  of  their  development,  such  fun¬ 
damentalisms  do  not  seem  to  have  much  to  do 
with  politics.  Earlier  histories  of  their  movements 
tended  to  pick  up  on  their  otherworldly  emphasis 
and  show  them  to  be  unconcerned  with  the 
drama  of  politics.  A  typical  case  was  that  of  a 
founder  of  American  Protestant  fundamentalism, 
Dwight  Moody,  who  announced  his  program 
thus:  The  world,  he  said,  was  a  flood.  God  gave 
him  a  boat  and  told  him,  “Save  all  you  can.” 
That  was  it. 

Of  course,  that  was  not  wholly  it,  in  Protestant 
or  other  fundamentalisms.  Most  of  these  move¬ 
ments  spent  much  energy  in  inculcating  personal 
virtues  that  can  serve  as  benefits  in  the  polis,  the 
human  city,  and  the  political  order.  All  things  being 
equal,  one  would  observe  them  to  be  law-abiding 
and  conforming  to  approved  norms  of  society. 
Most  of  them  have  the  reputation  for  being  rigor¬ 
ous,  in  Christian  terms  “puritanical”  and  in  Muslim 
versions  more  so.  Fundamentalisms  stipulate  rules 
of  behavior  and  expect  and  enforce  adherence  to 
them.  They  engage  in  the  critique  of  violations  of 
what  they  take  to  be  the  norms  of  good  civil  soci¬ 
ety.  Where  the  regimes  within  which  they  live  have 
what  they  consider  to  be  positive  characteristics, 
they  are  supportive  of  civil  goals.  Most  of  them  are 
patriotic  and  quite  characteristically  can  be  mili¬ 
tant  in  support  of  their  state.  Few  fundamentalists 
are  pacifist,  and  they  believe  that  their  God  placed 
the  sword  in  the  hands  of  the  state,  so  it  is  to  be 
used  against  offenders  from  other  tribes  and  states 
and  nations.  God  is  a  warrior  God,  justifying  the 
actions  of  God’s  people  when  their  ways  are  con¬ 
gruent  with  divine  commands. 

If  it  were  possible  to  quantify  types  within 
observed  fundamentalist  groups,  it  is  safe  to  say 
that  most  of  them  concentrate  on  individual  vices 
and  virtues  just  as  they  focus  on  otherworldly  real¬ 
izations.  Yet  there  are  enormous  and  dramatic 
exceptions,  and  late  in  the  20th  century,  most  were 
no  longer  perceived  as  simply  otherworldly.  Flow 
the  turn  came  about  differs  in  various  religions  and 
nations,  but  it  is  informative  to  concentrate  on  vivid 
examples.  Thus,  American  Protestant  fundamental¬ 
ists,  living  generally  peaceful  ethical  lives  on  the 
sidelines,  rather  suddenly  changed  and,  in  alliance 
with  conservative  evangelicals,  Roman  Catholics, 
and  Jews,  formed  “moral  majorities”  or  coalitions 
that  advanced  very  pointed  political  programs  and 
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garnered  support  for  them.  One  notable  Protestant 
fundamentalist,  Jerry  Falwell,  expressed  a  ratio¬ 
nale  for  the  change.  Years  before  he  turned  politi¬ 
cal,  his  camp  criticized  Christians  who  were 
political  and  thus  were  sinful,  since  politics  was 
tainting  and  distracting.  Now,  however,  for  the 
churches  not  to  be  in  politics  was  the  sin. 

Why  the  change?  In  the  political  order  and  cul¬ 
tural  ethos,  threats  had  come  against  which  private 
fundamentalists  had  no  defense;  the  mass  media  in 
particular  were  universally  intrusive.  Fundamental¬ 
ists  responded  to  these  cultural  changes  by  urging 
those  who  shared  their  beliefs  to  work  together  to 
change  that  “corrupt”  order  through  politics. 
Fundamentalists  produced  some  political  leaders  or 
gravitated  toward  them,  exploiting  them  just  as 
they  were  often  being  exploited.  Fundamentalists 
were  not  unique  among  religious  groups  in  this 
respect.  What  was  remarkable  was  that  the  believ¬ 
ers  who  had  first  and  foremost  opposed  political 
involvement  now  mastered  many  techniques,  forms 
of  organization,  and  patterns  of  rhetoric  to  make 
gains  for  their  causes  and  themselves. 

This  turn  to  the  political  occurred  in  part  in 
response  to  specific  public  actions.  To  follow  the 
American  Protestant  example,  the  threat  of  com¬ 
munism  during  the  Cold  War  called  them  to  mili¬ 
tancy.  U.S.  Supreme  Court  rulings  on  the  legality 
of  abortion  and  the  illegality  of  state-sponsored 
prayer  in  public  schools  did  much  to  solidify  and 
motivate  their  base  and  attract  converts.  Their 
political  action  was  generally  compatible  with  the 
goals  and  means  of  other  religious  groups  in  a 
republic:  One  garnered  votes,  promoted  legisla¬ 
tion,  sought  constitutional  change  where  neces¬ 
sary,  and  “won  some  and  lost  some,”  as  is  the 
manner  in  politics,  which  involves  compromise 
and  partial  victories  or  defeats. 

Where  the  terms  of  governmental  life  are  not 
those  of  republics,  fundamentalists  have  had  to 
resort  to  other  kinds  of  tactics.  Most  visible  in  the 
final  2  decades  of  the  20th  century  and  the  begin¬ 
ning  of  the  21st  were  the  stirrings  in  Islam. 
Concurrently,  Hindu-based  fundamentalist  move¬ 
ments  in  India  engaged  in  military  and  terrorist 
action  against  Muslims,  and  vice  versa.  Still,  Islam 
produced  movements  of  such  size  and  scope  com¬ 
pared  with  other  fundamentalisms  and  attracted  so 
much  media  attention  around  the  world  that 
“Islamic  fundamentalism”  came  to  be  the  best 


known — probably  the  most  feared  by  its  enemies — 
and  the  most  inspiring  of  all  to  its  constituencies. 
Many  of  these  fundamentalists  were  theocratic,  like 
the  fundamentalist  Jews,  who  promoted  settlement 
in  the  West  Bank  to  fulfill  their  vision  of  a  Biblically 
based  map  of  the  state  of  Israel:  They  believed  that 
God  ruled  and  had  revealed  that  divine  purposes 
were  to  be  fulfilled  through  their  actions,  directed 
against  “the  infidel”  or  the  “heretic.” 

The  goal  of  some  Muslim  fundamentalists  was 
to  realize  a  dream  they  had  nurtured  for  centuries 
when  Muslims  were  not  rulers  or  where  they  had 
once  ruled  but  now  had  been  pushed  back.  What 
one  scholar,  Marshall  Hodgson,  called 
“Islamdom” — on  the  model  of  “Christendom,” 
when  and  where  Christians  had  a  domain  (-dom) — 
was  something  to  be  retrieved  from  the  infidels, 
who  had  conquered  parts  of  it  or  corrupted  all  of  it. 
The  Iranian  Revolution  under  Ayatollah  Khomeini 
in  1979  to  1980  was  of  this  character:  The  United 
States  must  be  attacked  when  and  because  this 
“Great  Satan”  supported  the  enemies  of  those  who 
would  rule  wherever  Muslims  had  once  ruled. 

In  radical  Islamism,  such  terrorism  is  justified  as 
following  in  the  light  of  divine  commands  and 
promises,  mediated  through  texts  rich  in  story,  or 
doctrine,  or  law.  Terrorism  is  thus  an  extreme 
form  of  action  beyond  politics,  since  politics 
always  involves  some  compromise,  and  fundamen¬ 
talisms  are  absolutist  and  uncompromising.  That 
is  the  characteristic  of  fundamentalism  that  is 
unsettling  to  nonfundamentalists  in  government  or 
to  those  who  inhabit  other  nations,  hold  on  to 
other  religions  and  customs,  and  would  like  to  find 
ways  to  live  with  people  of  other  religions. 

Not  all  fundamentalisms  can  be  or  need  be  pic¬ 
tured  as  static  and  unchangeable.  Some  movements 
among  them  move  on,  thanks  to  the  company  they 
keep  in  coalitions,  the  realism  that  sets  in  when 
goals  are  frustrated,  or  their  devotion  to  texts  other 
than  the  militant  or  terrorist  texts  in  Hebrew 
Scriptures,  the  Koran,  the  Christian  New  Testament, 
or  a  variety  of  Hindu  texts.  Using  them,  they  would 
preach  and  promise  peace  and  understanding. 
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Fundamentalist  Movements, 
Islamic 


The  term  fundamentalism  has  its  origin  in  a  strug¬ 
gle  that  developed  between  conservative  and  lib¬ 
eral  Protestant  Christian  theologians  in  the  United 
States  around  the  turn  of  the  20th  century.  This 
struggle  was  epitomized  in  the  publication  by  the 
Bible  Institute  of  Los  Angeles  in  the  years  between 
1910  and  1915  of  the  series  The  Fundamentals:  A 
Testimony  to  Truth,  which  in  12  volumes  detailed 
90  fundamental  beliefs  on  which  there  could  be 
no  compromise  for  a  true  Christian  believer. 
Later,  the  term  took  on  a  broader,  more  general 
meaning  of  strict  and  uncompromising  adherence 
to  and  advocacy  of  certain  basic  principles  or  a  set 
of  beliefs,  though  it  is  most  commonly  used  in 
reference  to  religious  movements  that  are  presum¬ 
ably  characterized  by  such  an  attitude  and  that 
serve  to  motivate  and  effect  aggressive  attitudes 
and  actions  toward  those  outside  the  group.  These 
include  evangelical  movements  in  the  United 
States  and  elsewhere,  fundamentalist  Jewish  set¬ 
tlers  on  the  West  Bank,  and  similar  contemporary 
developments.  In  the  international  public  sphere, 
and  not  the  least  in  the  media,  fundamentalism  is 


currently  very  often  used  to  describe  political 
movements  among  Muslim  populations  that 
legitimize  their  ideology  and  politics  through  a 
reference  to  Islam.  Other  terms  in  use  for  this 
phenomenon  are  “political  Islam”  and  “Islamism,” 
both  of  which  have  the  analytical  advantage  that 
they  carry  a  somewhat  lesser  baggage  of  precon¬ 
ceptions.  The  following,  therefore,  mostly  focuses 
on  recent  religious  and  political  developments  in 
the  Islamic  sphere. 

What  is  involved  in  the  political  utilization  of 
the  mobilizing  resources  of  the  Muslim  culture  and 
religion?  In  understanding  the  phenomenon,  it  is 
important  to  distinguish  clearly  between  two  levels 
of  analysis:  on  the  one  hand,  the  reasons  why 
political  actors  have  recourse  to  framing  their  ide¬ 
ological  and  political  discourse  with  reference  to 
semantically  potent  symbols  from  Muslim  culture, 
history,  and  religion,  sometimes  to  the  extent  of 
insisting  on  using  a  distinctly  Islamic  vocabulary, 
and,  on  the  other,  the  extremely  varied  uses  to 
which  these  same  actors  put  those  Islamic  refer¬ 
ences  in  national  and  international  politics  and  the 
variables  that  affect  their  choices. 

Reasons  for  the  Use  of  Islam  as  a 
Reference  for  Guidance  and  Meaning 

At  the  first  level,  there  are  two  central  driving 
forces  behind  the  recourse  to  Islam  as  a  reference 
for  guidance  and  meaning:  One  is  related  to  the 
defense  of  Muslim  identity  against  a  perceived  for¬ 
eign  onslaught;  the  other  is  linked  to  an  attempt  to 
resurrect  a  moral  framework  for  a  rapidly  chang¬ 
ing  society. 

Maintaining  Muslim  Identity 

In  the  context  of  the  colonial  expansion  in  the 
1 9th  and  20th  centuries,  the  symbolic  categories  and 
signs  of  the  imposed  colonial  culture  came  to  replace 
the  institutional  and  normative,  and  even  the  aes¬ 
thetic,  expression  of  the  local  Muslim  culture.  The 
“losing”  culture  found  itself  progressively  marginal¬ 
ized  and  “indigenized”  or  “folklorized.”  The  exclu¬ 
sion  of  its  symbolic  attributes  from  the  production  of 
meaning  and  more  specifically  from  expressing  val¬ 
ues  perceived  as  universal  served  dramatically  to 
underline  the  humiliating  centrality  of  the  colonizing 
culture.  While  confining  the  expressions  of  religious 
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culture  to  the  sphere  of  personal  status,  the  impor¬ 
tation  of  secularization  also  initiated  a  very  differ¬ 
ent  process  in  “the  lands  of  Islam.”  In  the 
European  context  in  general,  secularism  often 
appeared  as  a  liberation  from  religious  power  and 
as  part  of  a  modernization  that  facilitated  the  coex¬ 
istence  of  potentially  conflicting  loyalties  and  iden¬ 
tities.  In  the  Muslim  world,  where  secularism  was 
imported  by  the  colonizing  power  or  by  elites  that 
were  associated  with  it,  it  has  carried  the  sense  of 
a  denial  of  the  local  culture  and  its  right  to  satisfy 
the  normative  needs  of  the  society  in  the  direction 
of  public  affairs.  This  privilege  has  been  brutally 
transferred  to  a  foreign  culture,  not  the  least 
through  a  wholesale  importation  of  judicial  codes. 

It  is  against  this  background  of  a  symbolic  dis¬ 
possession  resulting  from  the  hegemonic  drive  of 
the  culture  of  the  colonizers  that  a  generation  of 
Muslims  in  the  period  between  the  two  World  Wars 
committed  themselves  to  restoring  the  visibility  of 
the  inherited  Muslim  culture  and  reaffirming  the 
centrality  of  its  norms.  It  seemed  to  this  generation 
that  modernization  should  no  more  be  constructed 
around  the  opposition  between  “progress”  and  “reac¬ 
tion”  or  “dominant  class”  and  “popular  masses”  but 
between  “endogenous”  and  “exogenous”  and 
“inherited”  and  “imported”  cultural  traits.  With  an 
indigenous  vocabulary,  the  Islamist  rhetoric  brings 
to  the  dynamic  of  nationalist  resistance  to  foreign 
domination  a  new  symbolic  resource.  In  rehabili¬ 
tating  the  use  of  references  that  had  for  a  time  been 
excluded  from  political  discourse,  this  rhetoric  at 
the  same  time  allows  for  the  reconciliation  of  this 
discourse  with  a  more  intuitive  culture.  This  cul¬ 
ture  is  above  all  perceived  of  as  that  of  the  ances¬ 
tors  (real  or  mythical),  with  whom  a  symbolic 
affiliation  now  again  becomes  possible.  The  resto¬ 
ration  of  the  categories  of  the  Muslim  culture  does 
not  produce  more  than  a  relative  rupture.  What  it 
does  is  reaffirm  the  ambition  of  the  endogenous 
culture  of  adding  its  contribution  to  the  expression 
of  universality  on  an  equal  footing  with  other  cul¬ 
tures  of  the  world  and  in  particular  with  that  of  the 
former  colonizers. 

At  the  start,  in  the  face  of  what  a  part  of  society 
perceived  as  a  wave  of  rampant  Westernization, 
the  Islamist  mobilization  consisted  of  reaffirming 
the  political  role  of  the  religious  endogenous  cul¬ 
ture  in  the  resistance  to  colonization.  In  1928, 
10  years  after  the  dismembering  of  the  Ottoman 


Empire,  4  years  after  the  dissolution  of  the 
Caliphate,  and  8  years  before  the  recognition  of  a 
precarious  independence  for  Egypt  (Treaty  of 
London,  1936),  the  founding  in  Ismailiyya  at  the 
Suez  Canal  of  the  Muslim  Brotherhood  by 
the  schoolteacher  Hasan  al-Banna  may  be  seen  as 
the  beginning  of  the  organization  of  this  reaction. 
In  the  context  of  the  anticolonial  struggle,  the 
Muslim  Brothers  devoted  themselves  to  reaffirm¬ 
ing  the  centrality  of  the  religious  lexicon  in  the 
fight  for  independence.  In  Egypt,  they  engaged 
simultaneously  the  internal  political  debate  and  the 
field  of  social  and  educational  work.  Adept  at 
modern  modes  of  mobilization  such  as  public 
meetings  and  the  publication  of  magazines,  they 
also  very  rapidly  developed  regional  ambitions.  In 
1948,  the  Brothers  in  Egypt  mobilized  volunteers 
to  counter  the  creation  of  Israel,  and  this  became 
the  beginning  of  a  rapid  spread  of  resistance 
throughout  the  countries  of  the  Arab  Middle  East. 
This  first  Islamist  generation,  however,  failed  to 
achieve  control  of  the  state  apparatuses  that  were 
eventually  left  vacant  by  the  colonizers.  While 
some  old  royal  dynasties  managed  to  reassert 
themselves  in  many  central  countries  of  the  Middle 
East,  the  reins  of  power  were  grasped  by  national¬ 
ists  of  a  more  secular  leaning,  typically  through 
military  coups. 

From  the  period  when  most  countries  in  the 
Middle  East  gained  independence  (1945-1970) 
until  the  1990s,  two  overlapping  processes 
unfolded:  first,  the  affirmation  and  then  the  begin¬ 
ning  of  the  challenging  of  the  indigenous  elites  that 
had  come  to  power.  The  primary  critique  the 
Islamists  directed  toward  the  Nasserists  and 
Baathists  in  power  was  that  they  suffered  from  a 
deficit  of  cultural  authenticity.  The  Islamists  called 
for  the  continuation  in  the  ideological  and  sym¬ 
bolic  sphere  of  the  distancing  from  the  colonizers 
that  had  until  then  been  focused  on  achieving  inde¬ 
pendence  in  political  and  economic  terms.  In  the 
Maghreb,  the  persistence  of  the  official  and  public 
use  of  the  French  language  made  this  rupture 
explicit:  The  secular  elites  progressively  became 
denounced  as  Hizb  Fransa,  “the  party  of  France.” 

Except  on  the  Arab  peninsula  and  in  Jordan,  the 
first  generation  of  Islamist  mobilization  was  even¬ 
tually  not  only  denied  access  to  the  legal  political 
scene  but  was  systematically  suppressed.  Even 
more  than  at  the  time  of  its  reaction  to  colonial 
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violence,  it  was  faced  with  suppression  when  the 
first  radical  ideological  and  political  offshoots  saw 
the  light  and  won  their  first  followers.  The  crush¬ 
ing  of  the  Muslim  Brothers  by  Nasser  in  Egypt  in 
1954,  after  he  had  realized  the  strength  of  their 
mobilizing  potential,  is  the  archetype  of  this 
repression.  The  “theology  of  war”  that  Sayyid 
Qutb — imprisoned  continuously  from  1954  until 
he  was  hanged  in  1966 — developed  in  response  to 
Nasser’s  torture  is  central  to  the  revolutionary  pos¬ 
ture  that  from  the  1970s  onward  was  adopted  by 
radical  groups.  To  respond  to  the  injustice  of  the 
tyrant  who  oppressed  him,  Qutb  declared  him  an 
infidel,  a  kafir,  and  thus  deprived  him  of  the  pro¬ 
tection  afforded  to  members  of  the  Muslim  com¬ 
munity.  He  further  denounced  the  tyrant  as  the 
standard  bearer  of  a  new  jahiliyya  (a  term  usually 
referring  to  the  state  of  ignorance  that  prevailed  in 
Arabia  before  Muhammad).  Then,  he  expanded 
the  scope  of  his  excommunication  and  did  the 
same  to  the  society  that  refused  to  reject  such  a 
ruler.  His  rhetoric  was  picked  up  in  diverse  forms 
by  those  who  some  years  later  (October  6,  1981) 
assassinated  the  Egyptian  president  Anwar  al- 
Sadat,  and  to  this  day,  it  continues  to  mobilize  the 
devotees  of  “direct  action”  simultaneously  against 
the  infidel  rulers  and  against  their  foreign  protec¬ 
tors.  The  Muslim  Brotherhood  also  joined  the 
opposition  forces,  which  led  to  the  downfall  of 
President  Mubarak  in  2011.  The  emergence  of  the 
radical  Qutb-inspired  groups  formed  part  of  a 
much  wider  phenomenon:  the  great  wave  of 
Islamic  awakening  that  swept  the  Muslim  world 
from  the  mid-1970s  onward  and  that  greatly  rein¬ 
vigorated  the  Islamist  movements.  The  vastly 
expanded  numbers  of  high  school  and  university 
students  were  the  main  basis  of  the  early  stages  of 
this  awakening.  Some  of  these  students  were 
attracted  to  the  violent  groups,  but  most  eventu¬ 
ally  contributed  to  swelling  again  the  ranks  of  the 
Muslim  Brothers,  who  were  by  now  staunchly 
settled  on  a  course  of  peaceful  work  for  political 
and  social  change,  except  in  the  fight  against  a 
foreign  occupier,  as  in  Palestine.  As  for  the  radical- 
ization  that  would  lead  a  fringe  of  the  wider  move¬ 
ment  to  commit  acts  such  as  the  September  11, 
2001,  attacks,  the  ideological  variable  is  most 
often  invoked  in  explaining  it,  and  emphasis  is  put 
on  the  diffusion  of  the  sectarian  theology  of  Qutb 
among  the  activists  trained  in  Afghanistan.  But  this 


inverses  the  causal  order  and  obscures  the  centrality 
of  simple  political  variables.  The  radical  drive  is  a 
result  of  the  unilateralism  of  the  existing  political 
order.  First,  there  is  that  of  the  Arab  regimes.  For  a 
whole  generation  of  opposition  members,  the  prom¬ 
ises  of  the  1980s  of  a  democratic  transition  have 
brutally  given  way  to  a  repressive  backlash.  Despite 
some  hesitant  opening  of  the  institutional  political 
sphere  in  some  countries  for  a  while,  the  generaliza¬ 
tion  of  the  use  of  torture  at  the  national  level  and  of 
the  recourse  to  “hard  power”  by  the  regional 
(Israeli)  and  global  (U.S.)  powers  worked  to  dis¬ 
credit  reformist  and  legalist  strategies  for  change 
and  thus  reinforced  the  credibility  of  the  sectarian 
currents  and  their  revolutionary  strategies. 

From  1990,  Islamist  parties  here  and  there 
found  doors  being  tentatively  and  partially  opened 
to  the  legal  political  arena.  Some  (in  Jordan, 
Algeria,  Palestine,  Febanon,  Kuwait,  Sudan,  and 
Morocco)  established  more  or  less  permanent 
footholds  in  the  parliaments — occupying  ministe¬ 
rial  posts  (Yemen)  or  even  directing  governments 
(Palestine).  But  in  the  global  arena  as  well  as  in  the 
terrain  of  the  Israeli-Palestinian  conflict  and  in 
most  national  frameworks,  these  openings  turned 
out  to  be  limited  indeed.  Almost  everywhere,  elec¬ 
tions  proved  the  fragility  of  popular  support  for 
the  regimes  and  incited  them  (in  Morocco,  Algeria, 
Tunisia,  and  Egypt  in  particular)  to  narrow  the 
limited  openings  that  had  been  granted  and  to 
define  political  competition  as  a  threat  to  security. 

The  prevailing  unilateralism  also  operated  at 
the  global  scale,  where  the  militarization  of  the 
foreign  policy  of  the  United  States  echoed  the 
regional  generalization  of  repression  and  torture. 
At  the  Sharm  al-Shaykh  summit  in  March  1996, 
under  cover  of  the  “fight  against  terrorism,”  the 
cooperation  between  the  U.S.,  European,  and 
Israeli  security  apparatuses  on  one  side  and  the 
repressive  apparatuses  of  the  Arab  regimes  on  the 
other  reinforced  the  ideological  criminalization  of 
all  resistance  to  the  existing  local  and  global  order. 

Providing  a  Moral  Framework 

But  Islamism  is  a  multifaceted  movement  rooted 
in  the  legitimate  concerns  of  broad  layers  of  the 
population.  A  sort  of  identity  politics  mobilized  to 
counter  foreign  domination  and  repressive  regimes 
allied  with  the  foreigners  is  a  central  driving  force. 


942  Fundamentalist  Movements,  Islamic 


Yet  the  success  of  Islamism  in  dominating  the 
political  and  ideological  landscape  of  popular 
activism  is  also  linked  to  another  strong  dynamic 
at  work,  which  may  be  termed  the  promotion  of 
modernization  under  pious  discipline.  Looking  at 
the  leading  activists  of  the  movements  in  question, 
one  finds  that  they  are  representative  of  dynamic 
elements  within  the  middle  and  lower-middle 
classes,  for  whom  modernization  opens  the  way  to 
upward  social  mobility.  They  have  a  positive  atti¬ 
tude  toward  economic  and  technological  modern¬ 
ization.  But  the  ensuing  social  change  is  rapidly 
breaking  up  the  old  framework  of  town  quarter, 
village,  kinship  group,  and  patron-client  relations 
in  which  people  led  their  lives.  To  counter  the 
uncertainty  and  fear  of  social  chaos  springing  from 
these  dissolving  effects  of  modernization,  it  is  not 
unnatural  that  people  from  traditional  religious 
families  look  to  Islam  to  provide  both  a  personal 
feeling  of  safety  and  confidence  and  at  the  same 
time  social  cohesion.  However,  this  call  for  re- 
Islamization  is  a  sign  that  the  traditional  Islam  of 
the  ulema  has  not  kept  up  with  the  changes.  What 
the  Islamists  do  is  to  consciously  or  unconsciously 
create  a  new  version  of  Islam  suited  to  give  moral 
guidance  to  a  changing  society  while  promoting 
economic  progress.  Their  Islam  is  marked  by  a 
stress  on  individual  responsibility,  on  the  duty  of 
the  true  believer  to  engage  in  a  constant  struggle 
against  evil  within  oneself  and  in  society,  and  to 
work  hard  to  increase  prosperity  and  welfare.  At 
the  same  time,  it  involves  new  organizational  prac¬ 
tices,  where  lay  people  collect  outside  the  tradi¬ 
tional  village  or  kinship  groups  in  associations  that 
study  the  scriptures  to  edify  the  soul  and  engage  in 
social  and  political  activism  for  the  perceived  cause 
of  God. 

Islamic  Reference  in  Social 

and  Political  Mobilization 

On  both  counts,  the  relation  between  the  dynamic 
of  re-Islamization,  on  the  one  hand,  and  the 
dynamics  of  social  modernization  and  political  lib¬ 
eralization,  on  the  other,  is  far  from  simply  one  of 
opposition.  If  a  reactive  rejectionist  approach  has 
sometimes  been  prominent,  the  complex  process  of 
rehabilitation  of  the  vocabulary  of  Islamic  culture 
seems  on  the  whole  rather  to  extend  the  field  of 
application  of  the  dynamics  of  modernization  and 


to  accelerate  its  rhythm  than  to  interrupt  it  or  to 
reverse  its  progress.  A  mobilization  centered  on 
questions  of  identity  and  morality  is  rarely  con¬ 
fined  to  particular  social  groups.  And,  taken 
together,  existing  studies  of  the  social  basis  of 
Islamist  movements  do  in  fact  show  a  great  diver¬ 
sity.  Urban  workers  figure  alongside  dignitaries  of 
the  various  bourgeoisies  (oil,  merchant,  military, 
and  tribal),  the  whole  spectrum  of  middle  classes, 
people  from  different  generations,  and  those  from 
all  spheres  of  work  and  levels  of  education.  The 
factor  of  gender  is  often  invoked,  yet  women  are 
well  represented  along  with  men  in  the  ranks  of 
those  mobilized  by  Islamism.  Within  the  social 
diversity  involved,  the  main  social  stronghold  of 
Islamists,  in  particular  where  they  remain  in  oppo¬ 
sition,  has  been  and  remains  among  the  socially 
mobile  lower-middle  and  middle  classes,  especially 
among  students  and  educated  elites.  The  leading 
activists  have  typically  been  engineers,  doctors, 
teachers,  lawyers,  and  other  members  of  the  mod¬ 
ern  professions  along  with  an  important  segment 
of  the  upcoming  business  elites.  Even  if  they  pre¬ 
serve  a  “reactive”  component  inherent  in  the  iden¬ 
tity  affirmation  that  is  part  of  the  driving  force  of 
the  movement,  the  discourse  of  Islamists  and  the 
practices  that  it  legitimates  are  not  only  varied  but 
also  continuously  evolving.  Depending  on  the 
political  context — local  or  regional — and  on  the 
practices  of  their  opponents,  they  come  forth  as 
“literalist”  or  “liberal,”  “democrats”  or  “authori¬ 
tarian,”  or  “legalist”  or  “revolutionary.” 

From  the  deposed  Taliban  regime  to  the  Turkish 
Prime  Minister  Recep  Teyyip  Erdogan,  the  same 
term  Islamist  has  served  to  designate  actors  who 
express  themselves  within  extremely  varying  doc¬ 
trinal  and  political  forms.  While  armed  operations 
are  part  of  the  picture  for  some  movements  and  in 
some  circumstances,  parliamentary  action  has  for 
some  time  massively  been  part  of  the  Islamist  rep¬ 
ertoire  of  practices.  Besides  the  developments  in 
non- Arab  Iran  and  Turkey,  electoral  success  has 
led  Islamists  to  leave  the  pure  oppositional  posi¬ 
tion  to  participate  in  and  even  lead  governments  in 
Yemen  (1993),  Jordan  (1989),  Iraq  (2005),  and 
Palestine  (2006).  For  those  involved  directly  in 
political  struggle,  it  is  a  distinctly  noticeable  trend 
over  the  past  decades  and,  in  spite  of  the  fluctuat¬ 
ing  levels  of  repression,  toward  a  more  principled 
stand  against  the  use  of  violent  means  to  further 
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the  cause  of  Islamic  reform  and  in  favor  of  a 
democratic  political  system.  Yet  again,  under  the 
identical  label  “Islamists,”  those  who  are  adept  at 
“speaking  Muslim”  may  equally  well  be  members 
of  pietistic  movements  or  new  converts  who  stay 
mostly  outside  the  political  arena,  whether  of  the 
revolutionary  or  the  legalist  kind. 

A  wide  range  of  discourses,  attitudes,  and  prac¬ 
tices  may  be  observed  as  well  in  oppositional  pos¬ 
tures  as  in  the  practices  of  “Islamic”  regimes  in 
power.  At  one  extreme  of  the  doctrinal  spectrum, 
the  expression  of  an  automated  and  indiscriminate 
rejection  of  any  Western  democratic  institution 
persists.  In  that  corner,  popular  sovereignty  is  con¬ 
sidered  as  strictly  incompatible  with  its  divine 
competitor.  At  the  opposite  end,  movements  that 
have  demonstrated  repeatedly  their  capacity  to 
win  a  majority  of  votes  (notably  in  Algeria, 
Palestine,  and  in  large  parts  of  Egypt)  have  under¬ 
taken  a  kind  of  cultural  reappropriation  of  the 
same  popular  sovereignty.  The  rewriting  of  the 
modern  categories  of  politics  with  references  from 
the  Islamic  repertoire  works  little  by  little  to  heal 
the  symbolic  fracture  caused  by  the  colonial  inter¬ 
ruption.  Religious  affirmation  and  ethical  reaction 
will  certainly  not  go  away.  But  the  religious 
resource  is  no  longer  used  to  deny  the  legitimacy  of 
the  democratic  principle,  but  on  the  contrary,  it  is 
used  more  and  more  to  affirm  it.  The  same  goes 
for  the  defense  of  human  rights  (even  increasingly, 
if  often  ambiguously,  women’s  rights),  which  have 
earlier  tended  to  be  seen  as  solely  linked  to  Western 
culture  and  thereby  disqualified. 

Two  major  movements,  the  Muslim  Brothers 
and  the  salafis ,  provide  the  often  diffuse  and  fluc¬ 
tuating  landscape  of  Islamism  with  a  structuring 
axis.  According  to  varying  national  configura¬ 
tions,  the  fluctuating  borderline  between  the  two 
does  not  follow  exactly  the  same  course  in  every 
country  or  society.  The  Muslim  Brothers  and  the 
organizations  that  have  emerged  from  it,  which 
vary  greatly  from  Tunisia  to  Kuwait,  have  for  sev¬ 
eral  decades  been  engaged,  where  they  are  given 
the  opportunity,  in  legal-political  action  and  elec¬ 
toral  competition.  It  is  the  Muslim  Brothers  that 
have  contributed  the  most  to  popularizing  the 
evolution  of  classical  Islamic  thinking  and,  nota¬ 
bly,  to  the  acceptance  of  an  autonomous  space  for 
politics  and  the  secularization  that  is  implicit  in  the 
recourse  to  the  techniques  and  the  juridical  and 


political  categories  of  the  modern  parliamentary 
scene.  Where  legal  recognition  continues  to  be 
denied  to  them  (as  in  Tunisia,  Libya,  and  Syria), 
the  activists  of  the  Muslim  Brothers  engage  in 
educational  efforts  and  devote  themselves  to 
work  in  the  associations  of  civil  society  and  in  the 
antechambers  of  parliamentary  life  (professional 
associations,  human  rights  organizations,  and 
charitable  institutions).  The  principal  alternative 
to  the  Muslim  Brothers  is  the  so-called  salafi 
movement,  a  reference  to  the  “pious  ancestors” 
who  are  seen  to  have  never  deviated  from  the 
straight  path  and  by  whose  example  this  trend 
portends  to  be  guided.  The  salafis  have  in  particu¬ 
lar  affirmed  themselves  through  a  virulent 
critique  of  the  Muslim  Brothers,  the  main  coun¬ 
terimage  of  their  own  Islamist  identity. 

Even  if  some  of  the  recruits  for  jihadi  direct 
action  come  from  salafi  ranks,  the  great  majority  of 
salafis  are  pietist  and  quietist.  It  is  outside  the  field 
of  armed  or  even  political  struggle  that  they  mani¬ 
fest  their  rupture  with  society,  through  a  cultural 
practice  by  which  they  wish  to  be  cleansed  of  all 
recent  or  ancient  exegetic  influences.  The  salafis 
radically  refuse  all  ideas  of  a  contextualization  of 
the  religious  norms.  They  consider  the  stages  in  the 
development  of  the  Muslim  Brothers  with  regard  to 
political  liberation  and  social  modernization  (and 
notably  the  formation  of  parties  or,  indeed,  organi¬ 
zational  structures  as  such)  as  so  many  illegitimate 
alterations  of  orthodoxy  (bid’ a).  Their  doctrine  and 
their  practices,  less  systematically  studied  than 
those  of  the  Brothers,  who  preceded  them  histori¬ 
cally,  are  neither  homogeneous  nor  static.  Con¬ 
versely,  the  Muslim  Brothers  are  themselves  the 
object  of  a  modernizing  critique  coming  from  more 
modernist  Islamists,  who  seek  to  accelerate  the  very 
reforming  dynamics  that  the  salafis  want  to  put  a 
brake  on  and  who  denounce  the  inflexibility  and 
doctrinal  rigidity  of  the  Muslim  Brothers. 

The  Islamist  regimes,  for  their  part,  have  con¬ 
trasting  but  nevertheless  converging  performances. 
Even  if  the  pace  is  slow,  the  development  of  an 
autonomous  political  space,  the  limitation  of 
recourse  to  repressive  violence  by  the  state,  and 
the  affirmation  of  the  place  of  women  in  the  pub¬ 
lic,  professional,  and  political  spheres  is  hardly 
moving  slower  in  Sudan,  Iran,  and  especially  in 
the  Turkey  of  the  Justice  and  Development  Party 
than  in  Algeria,  Egypt,  or  the  Tunisia  of  the 
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“defenders  of  secularism.”  In  Iran,  even  if  the  ref¬ 
erence  to  a  double  legitimacy,  popular  and  reli¬ 
gious,  has  continued  to  limit  the  real  importance 
of  the  democratic  game  and  the  mechanism  of 
parliamentary  and  presidential  elections  is  far 
from  being  definitively  sheltered  from  the  author¬ 
itarianism  of  the  Supreme  Leader  and  the  Guardian 
Council,  the  lively  debates  indicate  the  formation 
of  a  veritable  public  space.  And  the  possibility 
offered  to  the  electors  of  influencing  the  relation 
of  forces  at  the  top  layers  of  the  state  is,  even  if 
seen  against  the  turbulent  aftermath  of  the  “sto¬ 
len”  presidential  elections  of  2009,  rather  more 
credible  than  that  offered  to  their  Algerian, 
Tunisian,  or  Egyptian  counterparts. 

On  the  scale  of  the  past  half-century,  it  is  in  fact 
difficult  to  demonstrate  that  the  Islamists  have  in 
any  significant  way  broken  with  the  norm  of  insti¬ 
tutional  and  political  practices  of  their  secular  com¬ 
petitors.  Nowhere  have  they  imported  violence  into 
a  democratic  universe,  for  the  simple  reason  that,  in 
the  region  under  discussion,  such  a  universe  has  not 
yet  really  emerged.  Their  occasional  recourse  to 
political  violence  may,  most  often,  be  correlated 
with  the  violence  that  is  exercised  against  them  by 
forces  in  the  national  or  international  environment. 
The  process  of  domination  seems  thus  to  be  at  the 
heart  of  the  principal  ideological  and  strategic  tra¬ 
jectories  of  this  category  of  actors. 

Francois  Bur  gat 
IFPO  CNRS  ( Damascus ) 
Aix  en  Provence,  France 
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University  of  Oslo 
Oslo,  Norway 
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Fuzzy-Set  Analysis 


Fuzzy-set  analysis  (FSA)  is  one  branch  of  qualita¬ 
tive  comparative  analysis  (QCA).  The  idea  of 
fuzzy  sets  goes  back  to  earlier  insights  from  com¬ 
puter  science  and  formal  logic,  which  had  led  to 
the  development  of  a  fuzzy-set  approach  already 
in  the  1960s.  As  George  Klir,  Ute  St.  Clair,  and  Bo 
Yuan  (1997)  have  shown,  the  difference  between 
fuzzy  sets  and  conventional  (“crisp”)  sets  is  that 
set  memberships  are  unequivocal  in  crisp  sets.  For 
example,  the  element  “Sunday”  is  a  member  in  the 
set  of  all  days  of  the  week  (membership  value  of 
1);  whereas  the  element  “January”  is  not  a  mem¬ 
ber  of  that  same  set  (membership  value  of  0).  By 
contrast,  fuzzy  sets  have  the  characteristic  that 
nonperfect  set  membership  can  exist.  This  is  espe¬ 
cially  relevant  considering  that  social  science 
concepts  very  often  have  fuzzy  boundaries. 
Democracies  are  usually  not  perfect  democracies 
but  just  democracies  to  a  certain  degree.  Following 
this  reasoning,  most  countries  have  both  a  partial 
membership  in  the  set  of  “democracies”  as  well  as 
in  the  set  of  “nondemocracies.”  This  is  also  a  fun¬ 
damental  difference  to  crisp  sets:  In  fuzzy  sets,  one 
and  the  same  case  can  have  a  (partial)  membership 
both  in  the  original  set  and  in  the  complementary 
set.  A  dichotomy  is  implicitly  maintained,  but 
membership  degrees  render  it  possible  to  differen¬ 
tiate  for  the  degree.  QCA  as  a  technique  is  based 
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on  the  notion  of  a  set.  Since  crisp-set  qualitative 
comparative  analysis  (csQCA)  can  be  criticized  for 
requiring  dichotomous  conditions  and  outcomes, 
and  since  many  phenomena  in  the  social  sciences 
are  not  simply  dichotomous,  it  is  understandable 
why  fuzzy  sets  are  attractive  for  an  inclusion  in 
QCA.  Below,  this  entry  discusses  some  of  its 
major  features  and  current  developments. 

Both  the  dichotomous  version  csQCA  and 
fuzzy-set  qualitative  comparative  analysis  (fsQCA) 
are  based  on  an  algebra  that  is  different  from 
mainstream  linear  algebra,  as  it  is  applied  in  con¬ 
ventional  statistical  methods.  csQCA  is  based  on 
Boolean  algebra  and  fsQCA  on  fuzzy  algebra. 
However,  it  is  wrong  to  assume  that  fuzzy  algebra 
would  be  an  extension  of  Boolean  algebra;  rather, 
Boolean  algebra  should  be  seen  as  a  special  case  of 
fuzzy  algebra.  Indeed,  all  rules  for  fuzzy  algebra 
are  also  valid  for  Boolean  algebra  but  not  vice 
versa.  Dichotomies  (as  used  in  csQCA  and  for 
Boolean  algebra)  are  nothing  else  than  very 
restricted  versions  of  fuzzy  scales,  containing  just 
the  values  of  0  and  l.One  of  the  most  difficult  (and 
also  most  contested)  points  with  regard  to  FSA  is 
the  decision  about  the  attribution  of  fuzzy  values 
to  individual  cases.  More  concretely,  if  the  level  of 
democracy  of  a  given  country  has  to  be  coded,  the 
question  is  how  the  fuzzy  values  (between  0  and  1) 
are  attributed  to  single  countries,  or,  in  other 
words,  which  indicators  have  to  be  chosen  to  attri¬ 
bute  a  lower  fuzzy  value  to  one  country  and  a 
higher  value  to  another.  One  possibility  is  to  use 
already  existing  quantitative  indicators  and  to 
standardize  them  on  a  range  between  0  and  1. 
However,  as  Charles  Ragin  (2000)  notes,  such  a 
strategy  would  overlook  the  central  aspect  of  the 
coding,  namely,  that  fuzzy  values  represent  the 
degree  of  presence  of  a  concept  and  that  they  are 
based  on  theoretical  knowledge.  Furthermore, 
most  social  science  concepts  suffer  from  not  hav¬ 
ing  easily  definable  quantitative  equivalents. 
Therefore,  an  important  part  of  FSA  deals  with  the 
translation  of  qualitative  degrees  of  the  presence/ 
absence  of  a  concept  into  quantitative  measures. 
Indeed,  “[ijn  the  hands  of  a  social  scientist  [ .  .  .  J, 
a  fuzzy  set  can  be  seen  as  a  fine-grained,  continu¬ 
ous  measure  that  has  been  carefully  calibrated 
using  substantive  and  theoretical  knowledge” 
(p.  7).  This  also  means  that  in-depth  case  knowl¬ 
edge  is  the  undeniable  prerequisite  for  a  meaningful 


FSA.  Therefore,  the  coding  process  usually  comes 
more  toward  the  end  of  the  research  process. 
Recently,  Ragin  has  proposed  two  more  methods 
to  calibrate  fuzzy  values:  a  so-called  direct  and  an 
indirect  method  that  both  combine  the  (theory 
driven)  setting  of  qualitative  anchors  with  quanti¬ 
tative  standardizations  in  between  these  anchors 
(for  details,  see  Ragin,  2008,  7 1  ff ) . 

As  in  conventional  csQCA,  the  main  interest  of 
fsQCA  is  to  find  necessary  and  sufficient  condi¬ 
tions  (or  combinations  of  conditions)  to  account 
for  a  certain  outcome.  In  fsQCA,  the  rule  is  that  a 
condition  is  necessary  if  its  fuzzy  value  is  always 
greater  than  or  equal  to  the  fuzzy  value  of  the  out¬ 
come  to  be  explained.  This  goes  back  to  the  insight 
that  in  the  dichotomous  world  a  condition  is  nec¬ 
essary  if  it  exists  whenever  the  outcome  is  observed. 
The  condition  can  exist  without  the  outcome  being 
present,  but  the  outcome  must  not  exist  without 
the  (necessary)  condition  being  present.  This 
insight  from  the  dichotomous  version,  namely, 
that  the  value  of  the  condition  must  not  be  0  when 
the  value  of  the  outcome  is  1,  is  translated  to  the 
fuzzy  situation  in  which  values  between  0  and  1 
are  also  permitted:  A  condition  is  necessary  if  its 
fuzzy  value  is  not  smaller  than  the  fuzzy  value  of 
the  outcome.  Note  that  “conditions”  in  this  sense 
also  can  mean  combinations  of  conditions  that  can 
be  created  following  the  rules  of  fuzzy  algebra. 
Similarly,  sufficiency  can  be  defined:  A  condition  is 
sufficient  if  its  fuzzy  value  is  always  smaller  than 
or  equal  to  (but  not  greater  than)  the  fuzzy  value 
of  the  outcome  to  be  explained. 

Since  FSAs  very  often  comprise  data  for  a  con¬ 
siderable  number  of  cases  and  also  examine  very 
different  conditions,  a  thorough  assessment  of  suf¬ 
ficiency  and  necessity  cannot  be  made  “by  hand.” 
Therefore,  a  software  program  has  been  developed 
that  can  be  downloaded  from  the  Fuzzy  Set/ 
Qualitative  Comparative  Analysis  website.  It  is 
based  on  an  algorithm  that  attributes  the  individ¬ 
ual  cases  to  their  (dichotomous)  ideal  types  within 
which  they  have  various  degrees  of  fuzzy  member¬ 
ship  values  (remember  that  in  fsQCA  a  case  can  be 
a  member  in  more  than  just  one  set).  These  ideal 
types  are  then  assessed  for  representing  sufficient 
and  necessary  conditions  and  are  finally  mini¬ 
mized,  following  the  rules  of  Boolean  algebra  that 
are  at  the  heart  of  each  QCA  analysis.  This  analy¬ 
sis  is  summarized  in  the  so-called  Truth  Table 
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Algorithm.  In  the  end,  a  Boolean  equation  results 
(identical  to  conventional  QCA  results),  from 
which  sufficient  and  necessary  conditions  can  be 
extracted.  The  consistency  measure  indicates  how 
well  the  resulting  equation  corresponds  to  the 
empirical  data.  The  overall  coverage  measure  is  a 
measure  to  indicate  which  portion  of  the  outcome 
is  explained;  this  is  comparable  with  the  function 
of  an  R 2  in  quantitative  analysis.  Raw  coverage 
values  show  how  much  each  single  component  of 
the  solution  contributes  to  the  explanation  of  the 
outcome;  unique  coverage  values  indicate  which 
portion  of  the  explanation  of  the  outcome  is 
unique  to  each  single  component  of  the  solution 
term.  Consistency  and  coverage  are  in  a  trade-off 
relationship.  It  is  hard  to  have  optimal  values  for 
both  of  them.  Usually,  values  around  0.7  or  (even 
better)  0.8  are  considered  acceptable. 

As  Ragin  (1987)  notes,  another  problem  of  QCA 
analyses  is  how  to  deal  with  limited  diversity — that 
is,  the  fact  that  for  some  theoretical  combinations  of 
conditions  no  empirical  representations  (cases) 
exist.  If,  for  example,  a  comparative  study  among 
Western  European  countries  includes  the  conditions 
of  a  “strong  welfare  state”  and  “European  Union 
(EU)  membership,”  we  may  not  find  a  country  that 
does  not  have  a  strong  welfare  state  and  is  not  a 
member  in  the  EU.  It  can  be  argued  that  limited 
diversity  is  an  even  stronger  problem  in  fsQCA  than 
in  csQCA,  since  not  only  the  two  values  0  and  1  are 
permitted  for  fuzzy  scales  but  potentially  an  infinite 
number  of  values  and  therefore  also  an  infinite 
number  of  theoretically  possible  combinations. 
However,  in  this  case,  ideal  types  are  examined.  If 
there  are  any  ideal  types  to  which  no  single  empiri¬ 
cal  case  can  be  ideally  attributed  (because  the  case 
would  be  better  attributed  to  another  ideal  type), 
then  this  ideal  type  is  considered  a  remainder  (i.e., 
not  existing  in  the  empirical  data),  and  limited 
diversity  is  affirmed.  However,  it  is  not  limited 
diversity  in  the  same  sense  as  in  conventional 
csQCA,  since  each  case  is  a  member  in  all  the  ideal 
types  up  to  a  certain  extent.  Too  much  limited 
diversity  may  affect  the  validity  of  fsQCA  results. 
Indeed,  if  a  result  is  based  on  not  enough  empirical 
information,  its  usefulness  can  be  questionable. 
There  are  a  number  of  proposals  how  to  deal  with 
the  problem  of  limited  diversity,  but  of  course,  no 
definite  answer  can  be  given,  since  nonexisting  data 
cannot  simply  be  recuperated. 


fsQCA  can  be  applied  in  all  social  sciences  dis¬ 
ciplines.  An  overview  is  offered  on  the  homepage 
of  the  COMPASSS  group  website.  Although  it  is 
often  considered  a  small-N  method,  a  certain 
minimum  of  cases  is  required.  Otherwise,  the 
problem  of  limited  diversity  becomes  too  severe. 
By  contrast,  also  not  too  many  cases  should  be 
examined,  since  otherwise  the  intimacy  with  sin¬ 
gle  cases,  which  is  needed  for  the  calibration  of 
the  fuzzy  values,  is  lost.  Nevertheless,  fsQCA  has 
also  been  successfully  applied  to  large-N  data  sets. 

It  is  important  to  underline  that  fsQCA  is  not 
just  a  technique  of  data  manipulation  (the  “ana¬ 
lytical  moment”)  but  that  its  logic  of  set-theoretic 
relationships  influences  the  research  process  right 
from  the  beginning.  Therefore,  fsQCA  should 
never  be  applied  mechanically  but  be  integrated  in 
a  well-prepared  research  design.  Currently,  fsQCA 
scholars  work  on  the  improvement  of  the  algo¬ 
rithm  and  implement  ever  more  features.  Ad¬ 
ditionally,  a  code  of  good  practice  is  being  devel¬ 
oped  that  should  improve  advanced  applications 
of  the  method.  As  a  relatively  new  technique, 
fsQCA  risks  being  misinterpreted  in  many  ways. 

Claudius  Wagemann 
Istituto  Italiano  di  Scienze  Umane 
Florence,  Italy 
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See  also  Case  studies;  Cross-national  surveys 
Comparative  policy  analysis,  2:354-355 
Comparative  politics,  2:342-359 
definition,  2:342 

distinction  from  international  relations,  2:342,  2:357 
future  of,  2:355-357 

historical  institutionalism,  2:346-347,  2:349,  4:1037 
human  rights,  4:1104-8 
issues,  2:350-355 

legal  constitutionalism,  2:416,  2:418-420,  4:1037 

methods,  2:342-344 

political  development  field,  3:644-648 

public  office  rewards,  7:2163,  7:2166 

rational  choice  institutionalism,  2:345-346,  4:1037 

social  movements,  8:2435-2436 

sociological  institutionalism,  2:347-349 

structural  functionalism,  4:996 

success,  2:355 

theories,  2:344-349 
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typologies,  6:2040-2041,  6:2042-2050 
See  also  Regimes 

Competition,  political,  2:359-364 
alternation  in  government,  6:1828-1829 
among  parties,  2:360,  2:361,  2:363,  3:728,  6:1792, 
6:1828-1829 
definition,  2:359 

in  democracies,  2:359-361,  2:363,  2:589 
democratic  quality,  2:568 
electoral,  2:359-361,  2:363 
governmental,  2:359 
measures,  2:361 

median  voter  theorem,  2:360,  6:1981-1982,  6:2032, 
8:2417,  8:2716 

political  economy  models,  6:1981-1983 
subversion,  2:570 
theory,  2:361-363 
veto  players,  8:2706 
Competitive  democracy,  6:1956 
Complex  system  modeling,  2:364-367 
compared  to  game  theory,  2:366 
foundations,  2:364-365 
political  science  applications,  2:365-367 
Complexity,  2:364-367,  8:2581 
Compliance,  2:367-370 
definition,  2:367 
enforcement,  2:368-369 
management  approach,  2:368,  2:369 
by  states,  2:367-369 

Comprehensive  Nuclear  Test  Ban  Treaty,  3:671,  4:1242 
Compromises,  in  consociational  democracies,  2:556,  2:557 
Computational  models.  See  Models,  computational/ 
agent-based 

Computer  simulations.  See  Information  technology;  Monte 
Carlo  methods 

Comte,  Auguste,  6:1951-1952,  7:2080-2081 
Concept  formation,  2:370-382 
abstraction  levels,  2:375 
analytic  frames,  2:376-377 
definition,  2:375 
disorders,  2:372-375 
empirical  boundaries,  2:376 
innovation,  2:380-381 
language  acts,  2:370-371 
Ogden-Richards  triangle,  2:372 
in  political  science,  2:370,  2:380-381 
reconstruction,  2:375 
semantic  fields,  2:376 
structures,  2:377-378 
Concepts 

abstraction  levels,  2:375 
ambiguity,  2:373 
contested,  2:379-380 
cores,  2:377 

definitional  defects,  2:374 
diminished  subtypes,  2:378-379 
family  resemblances,  2:377-378 
instability,  2:374 
meanings,  2:371,  2:375,  2:380 


opacity,  2:372-373 
reification,  2:371-372 
semantic  constraints,  2:381 
semantic  fields,  2:381 
structures,  2:377-378 
vagueness,  2:373-374 
Conceptual  abuse,  2:374-375 
Conceptual  confusion,  2:373-374 
Conceptual  disorders,  2:372-375 
Conceptual  lumping,  2:375 

Concert  of  Europe,  4:1011,  5:1632,  7:2386,  8:2677 
Conditionality,  2:382-384 
aims,  2:382-383 
definition,  2:382 
effectiveness,  2:383-384 
evolution,  2:383 
foreign  aid,  2:383,  4:1040 
legitimacy,  2:384 

structural  adjustment  packages  and  loans,  2:382-383, 
2:384,  4:1016,  8:2756-2757 
utility  calculations  and,  2:382-383 
welfare  policies,  8:2745,  8:2751 
Conditions,  necessary  and  sufficient,  2:384-388,  7:2175 
Condorcet,  Jean-Antoine-Nicolas  de  Caritat,  Marquis  de, 
3:809,  7:2203,  8:2417 
Condorcet  cycle,  8:2416 
Condorcet  efficiency,  3:754 
Condorcet’s  paradox,  3:709 
Conference  on  Disarmament  (CD),  3:671 
Conference  on  Security  and  Cooperation  in  Europe  (CSCE), 
8:2461 

Configurational  comparative  methods  (CCMs),  2:388-391 
applications,  2:389,  2:390 
assumptions,  2:388-389 
causality,  2:388-389 
debates  on,  2:390 
definition,  2:388 

further  developments,  2:390-391 
goals,  2:388-389 
techniques,  2:389-390 
See  also  Qualitative  comparative  analysis 
Confirmatory  factor  analysis  (CFA),  3:881,  3:882-884, 
5:1524,  8:2554 

Conflict  resolution,  2:391-394 

civil  wars,  1:268-269 
definition,  2:391 
between  democracies,  3:621 
mediation,  5:1545-1548 
participants,  2:393 
peaceful  means,  5:1545,  6:1759 
superpower  interventions,  2:396 
theoretical  approaches,  2:392-393 
third-party  roles,  6:1841 
Conflict  theory,  2:396,  6:2027-2028,  6:2029 
Conflicts,  2:394-399 
armed,  8:2723-2725,  8:2726-2729 
classifications,  2:394-395 
definition,  2:394,  8:2727 
dimensions,  2:3  95-396 
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nonstate,  8:2724,  8:2727 
origins,  2:394 

possible  outcomes,  2:391-392 
religious,  5:1351,  6:1720,  7:2262 
research,  2:397-398 
in  state  of  nature,  2:447 
strategies,  2:396,  2:397 
territorial,  8:2589,  8:2727,  8:2728 
theoretical  approaches,  2:396-397 
violent,  2:397-398 

See  also  Civil  wars;  Ethnic  conflicts;  Revolutions; 

Violence;  Wars 

Confounding  variables,  5:1581-1582,  5:1597, 
7:2181-2182,  8:2699,  8:2701 
Confucianism,  2:399-402 
capitalism  and,  1:192 
in  China,  1:163 

compatibility  with  democracy,  2:401 
good  government  principles,  2:399^401 
moral  education,  2:400 
political  statements,  6:2052 
view  of  politics,  ldiii 
Confucius,  2:399,  2:400,  2:401,  6:2051 
Congo  crisis,  8:2681 
Congress,  U.S. 

agenda-setting,  1:50 
apportionment,  3:751,  3:754 

delegation  to  bureaucracy,  2:548-550,  7:2128,  7:2203 
impeachment  process,  8:2405 
powers,  1:18-19,  8:2405 
Senate  Intelligence  Committee,  4:1057 
war  powers,  8:2406,  8:2407 
Congress  of  Vienna,  8:2450,  8:2677 
Conner,  Mark,  1:95 

Conscientious  objectors,  6:1721,  6:1758 
Consensus,  2:402-404 
definition,  2:402 

deliberative  policy  making,  2:551 

domain,  2:402,  2:404 

on  foreign  policy,  3:695 

in  international  organizations,  4:1253 

overlapping,  2:402-403 

political  applications,  2:404 

public  opinion  and,  7:2168 

See  also  Deliberative  democracy;  Election  by  lot; 

Groupthink 

Consensus  democracy,  2:402,  2:403,  2:555-556, 

2:564,  2:587 

See  also  Democracy,  consociational 
Consequentialism,  3:824 

Consequentialist  libertarianism,  5:1442,  5:1444-1445 
Consequentiality,  logic  of,  5:1475 
Conservatism,  2:405-410 
British  sources,  2:405-407 
classical,  2:407-408 
critique  of  liberalism,  2:405,  2:410 
definition,  2:405 
as  ideology,  2:405 
individualism,  4:1176-1177 


main  currents  of  thought,  2:407-410 
as  philosophy,  2:405 

See  also  Christian  Right;  Fundamentalisms; 
Neo-conservative  idealism 
Conservative  dictatorships,  3:656 
Conservative  parties,  2:410-413 
British,  2:406,  2:411-412,  6:1801,  6:1802 
definition,  2:410 
electoral  performance,  2:412 
in  Europe,  2:412-413 
evolution,  2:411 

organization,  2:411-412,  6:1801 
political  principles,  2:410-411 
privatization  policies,  7:2133 
role  in  party  systems,  2:412 — 413,  5:1427 
social  bases,  2:412,  6:1794 
See  also  Christian  Democratic  parties 
Considerant,  Victor,  8:2450 
Consistency.  See  Attitude  consistency 
Consociational  democracy.  See  Democracy,  consociational 
Consolidation,  democratic.  See  Democratic  consolidation 
Constitution,  U.S. 

Bill  of  Rights,  2:419,  5:1430,  6:1720,  6:1721,  7:2319, 
7:2337 

Establishment  Clause,  1:236 

executive  branch,  3:864 

federalism,  3:896 

Fifth  Amendment,  7:2138 

First  Amendment,  1:236,  3:825,  6:1720,  6:1721 

goals,  6:1723 

judicial  review,  5:1373 

liberalism,  5:1440 

presidential  system,  7:2123 

representation,  7:2278-2279 

republic,  7:2289-2290 

separation  of  powers,  1:4,  1:6,  1:10,  1:17-19,  8:2403, 
8:2404,  8:2405-2407 
Twenty-Second  Amendment,  3:864 
Constitutional  engineering,  2:413-416 
amendments,  7:2231 
constraints,  2:413,  2:414,  2:415 
definition,  2:413 
duration,  2:415 
participants,  2:414-415 
stages,  2:413 

See  also  Separation  of  powers 
Constitutional  monarchy,  5:1613,  5:1614,  7:2290 
Constitutional  patriotism,  5:1649,  5:1657,  6:1833-1834 
Constitutionalism,  2:416-421 
constraints  on  government,  6:1722-1723 
definition,  2:416 
democratic,  2:353 
goals,  2:416,  6:1722-1723 
identity  building,  5:1649 
international,  4:1243 
judiciary,  5:1383-1384 
legal,  2:416,  2:418-420,  4:1037 
legislative  voting  rules,  8:2718,  8:2719-2720 
liberalism  and,  5:1429,  5:1430,  5:1431,  5:1442 
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liberty  and,  6:1720 
in  multiethnic  states,  5:1649 
normative  claims,  6:1722-1723 
party  systems,  2:418 
political,  2:416-418,  2:419-420 
processes,  5:1649,  7:2340 
republican,  2:417 
rights,  2:419,  7:2337 

rule  of  law  and,  7:2336-2337,  7:2338,  7:2340 
statism  and,  8:2513 
See  also  Separation  of  powers 
Constitutions 
cabinets,  1:184 
Chinese,  3:595,  3:596 
church-state  relationships,  1:235-236 
in  consociational  democracies,  2:555 
English,  1:10,  1:18,  5:1614,  8:2404 
French,  7:2227,  7:2337,  8:2403 
as  institutions,  4:1203 
Construct  validity,  5:1515-1517 
Constructionism,  2:421 
Constructivism,  2:421-425 
contextual,  2:422 
definition,  2:421 

discursive  institutionalism  and,  3:684 
on  ethics,  3:826-827 
frames,  2:422^123,  6:1924 
globalization,  4:976 
identity,  4:1136,  7:2113 
influence,  2:425 
international  law,  4:1241 
intersubjectivity,  2:428 
methodologies,  3:798 

mutual  constitution  of  structures  and  agents,  2:428-429 

perspective  on  crime,  2:423-424 

policy  analysis,  6:1898-1899 

policy  models,  6:1879-1881 

religion,  7:2259 

in  social  sciences,  2:421-424,  6:1879 
sovereignty,  8:2471 
strict,  2:422 

strong  and  weak,  6:1879 
terminology,  2:421 
view  of  reality,  3:796-798,  6:1879 
See  also  Discourse  analysis 
Constructivism  in  international  relations,  2:425-431 
anarchic  world,  1:79,  4:1292 
building  blocks,  2:428-430 
contrast  to  realism,  2:424 
development,  2:426^128,  2:430 
domestic  politics  and,  3:694-695 
double  hermeneutics,  2:425,  2:429^130 
epistemic  communities,  3:787-791 
interests,  2:426 

international  political  economy,  4:1262 
international  regimes,  4:1273,  4:1274 
irredentism,  5:1348 
methodological  issues,  2:427-428 
national  interests,  5:1651-1652 


norms,  2:424-425 
ontological  innovations,  2:427 
peace,  6:1837 

power,  7:2110,  7:2112-2113 
premises,  2:425-426 
rise  of,  2:426 
security  studies,  8:2550 
subdivisions,  4:1292 
transatlantic  relations,  8:2655 
Containment,  2:431-433 
alliances,  2:432 
alternatives,  2:433 

during  Cold  War,  2:431-432,  2:433,  4:1280 
engagement  and,  2:433 
objectives,  2:431 
origins  of  concept,  2:431-432 
in  post-Cold  War  period,  2:432-433 
of  rogue  states,  2:432 
Content  validity,  5:1514-1515 

Contentious  participation.  See  Participation,  contentious 
Contested  concepts,  2:379-380 
Continental  Congress,  8:2404-2405 
Contingency  theory,  2:433-436 
definition,  2:434 

explanations  of  organizational  performance,  2:434 
fits  and  misfits,  2:434-436 
task  uncertainty,  2:434 
variables,  2:434 

Continuous  variables,  8:2698-2699 
Contract  theory,  2:436-447 
consent  and,  6:1723,  6:1724 
contemporary,  2:442 
critiques  of,  2:445-446,  6:1723,  7:2334 
definition,  2:436 
gender,  2:443-444 
history,  2:438-442 
individualism  and,  2:443^144 
modernity  and,  2:437,  2:438,  2:445,  2:446 
moral  dimension,  6:1723-1724 
necessity  of  contract,  2:442-443 
on  society,  2:437-438 
state  of  nature,  2:440^442,  2:444 
two  pacts,  2:445 
See  also  Social  contract 

Converse,  Philip,  1:145,  3:726,  5:1502,  6:1807,  6:2004 
Cooley,  Charles  Horton,  4:1132 
Coombs,  Clyde,  7:2355 
Cooperation,  2:447-449 
areas,  2:447-448 
in  communities,  2:447,  2:449 
complex  system  modeling,  2:365 
definition,  2:447 

developing  and  sustaining,  2:448-449 

distinction  from  collaboration,  2:297 

game  theory  models,  2:448-449,  4:1251,  7:2130-2132 

international,  4:979,  4:981,  4:1012-1013,  7:2382-2383 

transaction  costs,  8:2648 

See  also  Democracy,  consociational;  Intergovernmentalism; 

Multilateralism;  Security  cooperation 
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Coordination,  2:449-458 
during  crises,  2:495 
definition,  2:449—450 
distinction  from  collaboration,  2:297 
of  economic  policy,  3:703-704 
effectiveness,  2:457 

by  executive,  2:449-457,  3:866,  4:1161,  7:2215 

goals,  2:450-451 

history,  2:45 1-452 

of  implementation,  4:1161-1162 

informal  governance,  4:1020 

instruments,  2:451,  2:456,  2:457,  7:2205-2206 

means,  2:45 1 

normative  theory,  6:2057-2058 
opposing  principles,  2:454-45 6 
organization  theory,  2:449-457 
political  importance,  2:452 
rational  choice  perspective,  7:2205-2206 
strategic  approaches,  2:456-457 
theories,  2:452-454 
types  and  levels,  2:456-457 
See  also  Cooperation;  Multilateralism 
Copenhagen  School,  4:1302,  4:1303,  7:2376,  8:2549 
Coproduction,  1:180,  4:1020 
Copyright,  7:2141-2142 
Corbin,  Juliet,  4:1057 

Core  executive  models  of  government,  4:1039 
Cornell  Series  in  Political  Economy,  4:1262 
Cornes,  Richard,  7:2159 
Corporate  oligarchy,  3:766 

Corporate  social  responsibility,  5:1643,  8:2463,  8:2576 
Corporations.  See  Business  interest  groups;  Firms; 

Multinational  corporations 
Corporatism.  See  Corporativism 
Corporativism,  2:458-463 
advocacy  activities,  1:31-32 
authoritarian,  2:462—463 
Catholic  concepts,  2:461 
contemporary,  2:459,  5:1670-1673 
criticism  of,  2:458—459,  5:1668 
definition,  2:458 
Durkheim  on,  3:698-699 
functional  democracy,  2:589-590 
history,  2:459-461,  5:1668 
interest  intermediation,  6:1875 
medieval,  2:459—460 
model,  2:459 

modern  ideologies,  2:459,  2:461,  5:1485 
policy  formulation,  6:1920-1921 
regimes,  1:110,  4:1222,  6:2043 
secular,  2:461-462 
social  insurance,  8:2743 

See  also  Governance  networks;  Neo-corporatism 
Correlation,  2:463-468 
causation  and,  2:385,  2:464,  2:465,  7:2116 
coefficients,  2:467-468 
contemporaneous,  6:1762 
definition,  2:463—464 
history  of  concept,  2:464 


nonparametric,  2:467 
partial,  2:465 

Pearson’s  r,  2:464-465,  2:466 
scatterplots,  2:464,  2:465 
sensitivity  analysis,  5:1596 
serial,  6:1761-1762,  8:2617-2618 
spatial,  2:522-525 
spurious  relationships,  2:465 
See  also  Model  specification 
Correspondence  analysis  (CA),  2:468-474 
definition,  2:468 
maps,  2:470-471,  2:473 
multiple,  2:471—474 
profiles,  2:469-471 
simple,  2:468-471 
Corruption,  2:474-480 
categories,  2:475-477 
causes,  2:478-479,  4:1017-1018 
clientelism,  1:283 
control  of,  6:1972-1973 
country  rankings,  2:478 
cures,  2:479—480 
definition,  2:474-475 
effects,  2:479 

electoral,  2:475,  2:478,  6:1804 
embezzlement,  2:476 
measures,  2:478,  7:2235 
in  natural  resources  management,  5:1667 
in  party  finance,  6:1804 
patronage,  1:283,  1:286 
patterns,  2:477—478 
political,  1:283,  1:286,  2:474-479 
rule  of  law  and,  7:2340-2341 
tax  evasion  and,  8:2586-2587 
See  also  Governance,  good 
Corruption,  administrative,  2:480-483 
causes,  2:481 
consequences,  2:481 
definition,  2:480-481 

distinction  from  political  corruption,  2:480—481 
measurement,  2:481 
reduction  efforts,  2:481—483 
theories,  2:482 
Cortright,  David,  7:2353 

Cosmopolitan  democracy,  2:580,  2:593,  5:1424, 
6:1734-173 6,  6:2060 
Cosmopolitan  interventionists,  5:1438 
Cosmopolitanism,  4:1276,  5:1436,  6:1733,  6:1834,  6:1994, 
8:2466 

Cost-benefit  analysis  (CBA),  2:483-486 
definition,  2:483 
discounting,  2:484-485 
of  foreign  aid,  3:912 
framework,  2:483 
government  use,  2:483-484 
nonmarket  values,  2:485-486 
partial  and  general  equilibrium  analysis.,  2:484 
process,  2:484-486 
public  goods  and,  7:2160 
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risk  and  uncertainty,  2:485,  7:2324 
standing,  2:484 

Council  for  Mutual  Economic  Assistance  (CMEA,  or 
Comecon),  8:2458 
Council  of  Europe,  5:1374 
Council  of  Financial  Stability,  4:1247 
Counterespionage  (CE),  4:1212 
Counterfactual  models,  1:204-205,  2:387 
Counterintelligence  (Cl),  4:1212,  4:1213 
Counterrevolutionary  dictatorships,  3:656 
Counterterrorism 

international  conventions,  4:1240 
international  coordination,  8:2593 
network  analysis,  5:1690 
policies,  8:2593 

See  also  Security  apparatus;  Terrorism,  international;  War 
on  terror 

Country  risk,  6:2012 

Coups,  3:657,  3:658,  3:763,  5:1574,  5:1576,  5:1577 
Courts.  See  International  courts;  Judicial  systems;  Judiciary; 
Supreme  Court,  U.S. 

Cox,  Gary  W.,  8:2713,  8:2715-2716,  8:2717 
Cox,  Gray,  7:2080 

Cox,  Robert,  4:1072,  4:1263,  4:1264,  4:1292 
CPC.  See  Chinese  Communist  Party 
CPRs.  See  Common  pool  resources 
CPSU.  See  Communist  Party  of  the  Soviet  Union 
CR.  See  Christian  Right 
Credible  commitment,  2:486-489 
definition,  2:486 
delegation,  2:487-488 
deterrence,  2:486-487 
of  negotiated  settlements,  6:1841 
nonmajoritarian  institutions,  2:487-488 
principal-agent  theory  and,  7:2129 
programming  behavior,  2:487 
Crime 

constructivist  view,  2:423-424 
corruption  and,  2:477,  2:479 
election  fraud,  3:722,  3:723 
organized,  7:2379,  7:2380,  8:2596 
political  issues,  6:1878 
by  terrorist  groups,  8:2594,  8:2596 
war  crimes,  4:1110,  4:1111,  4:1112,  8:2665 
See  also  Genocide;  Police 
Crimes  against  humanity,  4:968-969,  5:1336 
Criminal  law,  4:1239-1240 
See  also  Judicial  systems;  Law 
Crises,  2:489-494 
in  capitalism,  1:192 
constitutional  reforms,  2:415 
coordination  during,  2:495 
definition,  2:489,  2:494 
legitimacy,  3:623,  3:624 
lessons,  2:496 

market  failures  and,  5:1491-1492 
natural  disasters,  3:668 
opportunities  for  conflict,  1:268 
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Game  Theory 


Game  theory  is  the  study  of  rational  behavior  in 
situations  of  interactive  decision  making — that  is, 
situations  in  which  two  or  more  individuals  make 
decisions  that  jointly  determine  an  outcome  about 
which  the  participants  have  differing  preferences 
or  information.  As  a  theory,  it  aims  to  articulate 
the  criteria  and  implications  of  such  decision  mak¬ 
ing,  defining  principles  of  idealized  strategic  choice. 
These  principles  can  then  be  used  to  clarify  explan¬ 
atory  and  normative  concepts  of  human  interac¬ 
tion.  An  experimental  branch  examines  the  extent 
to  which  real  decision  makers  in  a  laboratory  set¬ 
ting  realize  those  principles  and  identifies  regular 
patterns  of  departure  from  them.  As  an  empirical 
tool,  game  theoretic  models  of  social  phenomena 
yield  empirical  predictions  suitable  for  testing  with 
observational  data.  Originally  applied  to  econom¬ 
ics,  game  theory  has  found  application  across  the 
social  sciences,  as  well  as  in  biology  (where  in 
effect  the  “selfish  gene”  and  natural  selection 
replace  individual  strategic  decision  making)  and 
literary  theory  (where  game  theory  principles  have 
been  used  to  explicate  portrayals  of  conflict  and 
cooperation,  as  well  as  the  interaction  between 
author  and  reader). 

Following  scattered  preliminaries  during  the 
period  from  f  910  to  1927,  game  theory  was  essen¬ 
tially  created  by  the  mathematician  John  von 
Neumann  in  a  paper  published  in  1928  defining 
strategic  play  in  finite  games  of  pure  conflict  with 
two  players.  In  collaboration  with  the  economist 


Oskar  Morgenstern,  von  Neumann  authored 
Games  and  Economic  Behavior  (published  in 
1944),  further  developing  the  theory  and  beginning 
its  application  to  problems  of  economic  interaction. 
Rapid  theoretical  development  followed  World 
War  II,  and  the  theory  made  its  first  scattered 
appearances  in  the  political  science  journals  in  the 
1950s.  By  the  1980s,  game  theory  had  become  the 
primary  model-building  tool  of  rational  choice 
political  science,  with  applications  in  virtually  all 
subfields.  International  relations  saw  some  of  the 
earliest  use  of  game  theory,  which  came  to  be  espe¬ 
cially  common  in  the  study  of  crisis  bargaining  and 
the  inception  of  war.  The  scholars  of  U.S.  politics 
were  also  early  adopters,  applying  game  theory 
extensively  to  legislative  processes,  electoral  compe¬ 
tition,  and  even  school  budget  referendum  politics. 
Many  of  the  American  politics  applications  were 
generalized  to  similar  questions  concerning  legisla¬ 
tive  and  electoral  politics  in  comparative  perspec¬ 
tive,  but  game  theoretic  applications  to  coalition 
formation,  class  relations,  and  ethnic  politics  were 
pioneered  in  comparative  politics.  In  political  the¬ 
ory,  game  theoretic  tools  were  proving  useful  for 
gaining  new  normative  understanding  in  areas  such 
as  collective  action  and  constitutionalism. 

Game  theory  itself  consists  of  two  main 
branches:  (1)  noncooperative  and  (2)  cooperative. 
Noncooperative  game  theory  focuses  on  individ¬ 
ual  decisions,  dwelling  on  problems  of  strategic 
moves  and  incentives,  and  attempting  to  explain 
not  only  conflict  and  competition  but  also  the 
attainment  of  commitment,  coordination,  or  coop¬ 
eration  in  groups.  This  entry  dwells  mostly  on  the 
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noncooperative  branch,  constructing  in  detail  the 
two  major  formulations  used  there,  namely,  the 
strategic  and  sequential  forms;  the  major  theories 
of  how  each  form  of  game  is  solved;  and  the  cor¬ 
respondences  between  them.  Cooperative  game 
theory  focuses  on  the  abilities  and  actions  of  coali¬ 
tions,  often  assuming  that  coordination  and  coop¬ 
eration  can  be  attained  as  needed  and  asking  how 
they  will  be  employed  to  realize  and  distribute 
gains  or  losses.  Cooperative  game  theory  may  be 
taken  to  subsume  not  only  coalition  games  but 
also  the  foundations  of  social  choice  theory  and 
voting  theory,  although  the  latter  have  important 
overlaps  with  noncooperative  theory  as  well.  The 
final  section  summarizes  the  basic  features  of 
the  fundamental  cooperative  game  formulation, 
the  characteristic  function  form. 

Strategic  Games 

The  strategic  game  formulation  portrays  two  or 
more  players  choosing  simultaneously  and  inde¬ 
pendently  among  (possibly  different)  sets  of  feasible 
strategies.  Most  simply,  a  strategy  may  represent  a 
simple  action:  the  casting  of  a  vote,  for  example. 
More  generally,  however,  each  strategy  may  encom¬ 
pass  a  complex  scheme  of  conditional  actions,  reac¬ 
tions,  and  randomizations,  as  if  chosen  entirely  in 
advance  and  then  automatically  implemented.  The 
techniques  involved  in  solving  strategic  games  play 
important  roles  in  analyzing  sequential  games  as 
well.  Political  scientists  have  applied  strategic 
games  to  a  wide  variety  of  political  interactions.  In 
some,  the  choices  are,  for  all  intents  and  purposes, 
literally  simultaneous — secret-ballot  voting,  for 
example.  More  often,  the  model  is  used  to  under¬ 
stand  choices  with  an  important  simultaneous 
aspect,  such  as  candidate  or  party  electoral  compe¬ 
tition  and  international  military  confrontations. 
Strategic  games  have  also  proven  highly  useful  in 
the  conceptualization  of  problems  important  in 
political  theory  and  political  economy,  such  as 
commons  problems,  free  rider  problems,  and  the 
role  of  coordination  in  the  formation  and  interpre¬ 
tation  of  constitutions. 

General  Formulation 

Three  elements  specify  an  elementary  strategic 
game: 


f.  a  set  of  players,  designated  for  convenience  as 
N  or  {1,  2, .  .  . ,  n],  with  individual  players 
labeled  i,  j,  and  so  on; 

2.  a  number  of  strategy  sets,  one  for  each  player, 
with  /’ s  strategy  set  denoted  S-;  and 

3.  for  each  player  i,  a  real-valued  utility  function  ui 
defined  on  the  set  of  all  outcomes  of  the  game. 

For  games  of  perfect  information,  all  players 
are  assumed  to  know  the  full  description  of  the 
game:  who  the  players  are,  what  their  strategic 
opportunities  are,  and  what  their  utility  functions 
are.  The  players  are  each  to  choose,  simultane¬ 
ously  and  independently,  one  strategy  from  their 
respective  strategy  sets.  One  thinks  of  the  resulting 
strategy  profile  s  —  (s15  s2,  .  .  .  ,  sn )  G  S  as  identify¬ 
ing  an  outcome,  and  the  utility  function  u:  on  S 
represents  the  preferences  of  player  i.  Traditionally, 
the  utility  derived  by  player  i  when  the  players 
choose  strategy  profile  s,  ufs),  is  called  player  V s 
payoff  from  s.  (In  many  concrete  applications, 
distinct  strategy  profiles  may  lead  to  identical  sub¬ 
stantive  “outcomes,”  such  as  “Candidate  A  wins 
the  election”).  Finally,  it  is  often  convenient  to 
write  s_f  for  (sv  s2,  .  .  .  ,  s,_„  s,+1,  .  .  .  ,  sj,  the  pro¬ 
file  of  all  players  other  than  i,  and  to  let  (s; ,  s_,) 
denote  s  as  well.  The  utility  function,  by  defini¬ 
tion,  is  meant  to  capture  completely  the  prefer¬ 
ences  of  a  player.  This  includes  any  considerations 
the  player  may  give  to  attaining  some  sort  of  win¬ 
ning  margin  over  other  players,  to  avoiding  or 
seeking  risk,  or  to  such  considerations  as  altruism, 
revenge  seeking,  and  the  like.  Game  theory  is  not 
inherently  a  theory  of  selfish  preferences  or  mate¬ 
rial  preferences;  however,  its  applications  have 
traction  only  insofar  as  correct  assumptions  about 
player  preferences  are  employed. 

The  analysis  of  some  games  involves  the  use  of 
probabilities  and  expected  utility,  sources  of  which 
are  examined  below.  In  the  absence  of  such  fea¬ 
tures,  only  the  players’  preference  orderings  over 
outcomes  matters  in  the  analysis  of  strategic  play. 
If  probability  is  involved,  however,  then  payoffs 
must  be  cardinal,  in  the  sense  of  being  von 
Neumann-Morgenstern  utility  functions  based  on 
preferences  over  “lotteries”  of  outcomes.  Even  so, 
applying  any  positive  linear  transformation  of  any 
player’s  utility  function  does  not  affect  the  analysis 
of  strategic  play.  Usually,  game  theory  avoids  the 
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comparison  of  payoff  values  across  players  for 
exactly  the  same  reason  that  such  interpersonal 
comparisons  of  utilities  are  eschewed  in  microeco¬ 
nomic  theory:  The  absolute  numbers  are  taken  to 
have  no  objective  meaning,  apart  from  the  ordinal 
preferences  (whether  over  outcomes  or  lotteries) 
from  which  they  are  derived.  When  the  number  of 
players  and  the  number  of  strategies  are  small,  a 
strategic  game  can  be  conveniently  described  with 
a  payoff  matrix,  which  often  also  gives  meaningful 
labels  for  the  players  and  strategies  in  a  particular 
model.  For  example,  Prisoners’  Dilemma  is  often 
represented  as  shown  in  Table  1  where,  for  exam¬ 
ple,  if  Player  1  chooses  “Defect”  and  Player  2 
chooses  “Cooperate,”  the  payoffs  are  indicated  by 
the  two  numbers  in  the  bottom-left  cell:  5  for 
Player  1  and  0  for  Player  2.  The  payoff  result  in  a 
cell  is  often  written  as  (5,  0). 


Rational  Play  in  Strategic  Games 

The  main  concern  of  game  theory  is  to  define 
what  it  means  for  players  to  choose  rationally  in 
such  an  environment  of  interactive  decision  mak¬ 
ing.  A  basic  concept  for  defining  rational  choice  is 
the  best  response-.  Given  that  the  players  other 
than  i  choose  the  strategies  s_„  a  best  response  to 
s_i  is  a  strategy  s;  in  St  that  maximizes  ufst,  s_i).  If 
S:  is  not  finite  (as  is  common  in  many  games  of 
bargaining  and  of  electoral  and  legislative  compe¬ 
tition),  additional  assumptions  are  needed  to 
ensure  existence  of  a  best  response.  Notice  that 
there  may  be  more  than  one  best  response,  each 
giving  the  same  utility.  In  that  case,  each  best 
response  is  called  a  iveak  but  not  a  strict  best 
response.  (Note,  however,  when  authors  speak  of 
the  set  of  player  V s  weak  best  responses  to  s_(,  they 
mean  to  include  the  strict  best  responses  as  well: 
that  set  is  a  singleton  when  there  is  only  one  best 
response  to  s_p  a  multiple-element  set  when  there 

Table  I  A  Typical  Payoff  Matrix:  Prisoners’  Dilemma 


Cooperate 

Player  1 


Player  2 


Cooperate  Defect 


3 

3 

5 

0 

0 

5 

1 

1 

is  more  than  one  best  response — and  the  empty  set 
if  there  is  no  best  response  due  to  an  infinite  Sr ) 

The  standard  formulation  of  rational  strategic 
play  is  Nash  equilibrium.  A  strategy  profile  s'  is  a 
Nash  equilibrium  if  and  only  if  for  each  i,  s,  =  s' 
maximizes  ui(si,  s’_:)  over  Sr  In  general,  Nash  equi¬ 
libria  can  be  identified  by  deriving  the  correspon¬ 
dence  of  best  responses  of  each  i  to  all  possible  s_p 

Bfs-j)  —  [s,  £  S,ls,  maximizes  uj  (sp  s^,)} 

and  then  finding  the  intersection  of  those  corre¬ 
spondences,  that  is, 

{(s15  s2,  .  .  . ,  sjlfor  each  i,  s,  £  BfsJ}. 

This  definition  satisfies  two  criteria  for  strategic 
rationality.  First,  each  player  chooses  a  best 
response  to  the  choices  he  or  she  anticipates  the 
other  players  will  make,  while  anticipating  that  the 
other  players  too  are  strategically  rational.  This 
criterion  alone  defines  a  weaker  solution  concept, 
a  rationalizable  strategy  profile.  As  we  will  see, 
however,  this  may  or  may  not  result  in  outcomes 
in  which  each  player  has  chosen  a  best  response  to 
the  actual  choices  of  all  other  players. 

Second,  Nash  equilibrium  satisfies  a  criterion 
of  what  one  might  call  rational  experienced  play. 
This  criterion  is  embodied  in  the  assumption  that 
each  player’s  anticipation  of  the  others’  choices 
actually  matches  those  choices.  Suppose  groups 
of  n  individuals  in  a  population  of  much  larger 
size  are  randomly  combined  from  time  to  time  to 
play  a  single  iteration  of  the  original  game  and 
that  all  have  information  about  how  the  game  is 
being  played.  Each  individual  has  a  contingent 
strategy  plan — that  is,  a  plan  of  which  strategy  to 
choose  if  placed  in  the  role  of  Player  1,  Player 
2,  .  .  .  ,  or  Player  n.  Suppose  further  that  indi¬ 
viduals  revise  their  plans  when  they  observe  that 
an  alternative  strategy  could  have  yielded  a 
higher  payoff  to  profiles  recently  played  than  did 
the  player’s  previous  intended  choice  and  that 
those  revisions  tend  to  adopt  higher  payoff  strat¬ 
egies  against  the  observed  actions  of  others.  In 
this  setting,  any  population  strategy  profile  that 
does  not  induce  a  Nash  equilibrium  for  every 
possible  assignment  of  n  individuals  to  the  player 
roles  in  the  game  will  be  subject  to  such  change 
by  some  player.  That  is,  only  Nash  equilibria  can 
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be  stable.  (The  literature  on  learning  in  games 
formalizes  this  intuition.) 

Coalitionally  Rational  Play 

The  ideas  of  best  response  and  hence  of  Nash 
equilibrium  imagine,  in  effect,  that  every  player 
considers  only  individual  deviations  from  a  profile. 
For  equilibrium  profiles  in  some  games,  however, 
it  may  be  the  case  that  two  or  more  players  could 
all  have  achieved  higher  payoffs  had  they  all  cho¬ 
sen  differently.  The  analysis  of  possible  coalitional 
action  in  a  strategic  game  can  sometimes  be 
accomplished  through  a  version  of  Nash  equilib¬ 
rium  that  inquires  not  only  about  individual 
departures  from  a  strategy  profile  but  also  about 
simultaneous  departures  by  several  players.  For 
any  strategy  profile  s  and  any  subset  or  coalition  of 
players  C  Q  N,  write  sc  for  the  list  of  strategies 
chosen  by  players  in  C,  s_c  for  the  list  of  strategies 
chosen  by  players  not  in  C,  and  let  (sc,  s_c)  also 
denote  s.  A  strategy  profile  s'  =  (s0  s’_c)  is  a  strong 
Nash  equilibrium  if  for  every  coalition  C  and  for 
every  feasible  alternative  list  of  strategies  sc  (where 
for  each  i  €  C,  si  €  St),  there  is  some  member  j  of 
C  for  whom 

UMc>  S-c)  -  uj(sc>  s-c)‘ 

That  is,  for  no  coalition  is  it  possible  to  find  an 
alternative  profile  for  coalition  members  such  that 
all  would  have  been  strictly  better  off  had  all  cho¬ 
sen  those  alternative  strategies,  given  the  strategies 
of  nonmembers.  Note  that  a  strong  Nash  equilib¬ 
rium  is  a  Nash  equilibrium,  since  the  coalitions  in 
the  definition  must  include  single-element  sets  of 
players,  C  =  {/}.  However,  not  every  Nash  equilib¬ 
rium  is  a  strong  Nash  equilibrium.  Moreover,  no 
strongly  inefficient  strategy  profile  can  be  a  strong 
Nash  equilibrium,  since  the  coalitions  in  the  defini¬ 
tion  must  include  the  universal  coalition,  C  —  N. 

For  example,  in  the  Prisoners’  Dilemma,  (Defect, 
Defect)  is  a  Nash  equilibrium  but  not  a  strong 
Nash  equilibrium,  since  all  members  of  the  univer¬ 
sal  coalition  C  =  {Player  1,  Player  2}  could  have 
achieved  a  higher  payoff  had  they  instead  chosen 
(Cooperate,  Cooperate).  The  latter  profile,  how¬ 
ever,  is  not  a  strong  Nash  equilibrium  either,  since 
the  singleton  coalition  C  =  {Player  1 }  could  strictly 
improve  its  members’  payoffs  by  instead  choosing 


Defect.  The  Prisoners’  Dilemma  has  no  strong 
Nash  equilibrium. 

Mixed  Strategies  and  Existence 
of  Nash  Equilibrium 

A  game  may  not  have  any  Nash  equilibria  in 
such  deterministic  (or  “pure”)  strategies.  A  classic 
example  is  the  game  usually  called  “Matching 
Pennies,”  shown  in  Table  2.  More  generally,  this 
happens  in  games  in  which  one  player  wants,  in 
some  sense,  to  match  the  other  but  not  vice  versa: 
Rock-Paper-Scissors  and  Soccer  Penalty  Kicks  are 
also  common  examples.  Such  examples  are  not 
confined  to  games  of  pure  conflict,  however.  In 
such  situations,  it  behooves  a  player  to  be  as 
unpredictable  as  possible  to  his  or  her  opponent. 
Game  theorists  denote  this  through  the  device  of  a 
mixed  strategy,  in  which  player  i  randomizes  over 
Sr  In  an  analysis  allowing  mixed  strategies,  a  strat¬ 
egy  is  denoted  by  a  probability  distribution  over  Sr 
(The  set  of  all  mixed  strategies  includes  all  the 
deterministic  strategies  as  well,  which  are  denoted 
by  degenerate  probability  distributions  placing  all 
weight  on  one  element  of  S,.)  And  in  that  setting, 
players  are  assumed  to  choose  strategies  so  as  to 
maximize  expected  payoff. 

With  the  addition  of  mixed  strategies,  this 
seminal  existence  result  known  as  Nash’s  theorem 
holds:  Every  finite  game  has  at  least  one  Nash 
equilibrium  in  mixed  strategies. 

Multiple  Nash  Equilibria 

Nash  equilibrium  eliminates  some  rationaliz- 
able  strategy  profiles,  but  a  game  may  still  have 
many  Nash  equilibria.  The  canonical  example  is 
the  game  of  pure  coordination,  shown  in  Table  3, 
which  has  two  pure-strategy  (and  one  mixed 


Table  2  A  Game  With  No  Pure-Strategy  Equilibria: 
Matching  Pennies 
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Table  3  A  Game  With  Two  Pure-Strategy  Equilibria: 
Pure  Coordination 


Player  2 


A 

B 

A 

1 

0 

Player  1 

1 

0 

B 

0 

1 

0 

1 

strategy)  equilibria.  Without  additional  assump¬ 
tions — concerning,  for  example,  iterated  play  and 
dynamics — game  theory  proper  has  nothing  fur¬ 
ther  to  say  about  which  equilibrium  would  be 
chosen.  In  applications,  the  historical  or  cultural 
context  in  which  a  game  is  played  might  afford  the 
players  additional  information  about  what  each 
other  will  anticipate,  resulting  in  a  focal  point ,  a 
choice  among  equilibria  that  players  can  arrive  at 
more  often  than  random  chance  or  symmetric 
mixed  strategies  would  predict.  Games  with  mul¬ 
tiple  equilibria  demonstrate  the  difference  between 
rationalizable  and  Nash  strategy  profiles:  If  Player 
1  chooses  A  based  on  the  belief  that  2  will  choose 
A,  and  Player  2  chooses  B  based  on  the  belief  1  will 
choose  B,  this  is  a  rationalizable  profile  of  strate¬ 
gies  and  beliefs.  But  the  result  disconfirms  the 
players’  beliefs  and  disappoints  their  expectations. 

Evolutionary  Game  Theory 
and  Learning  in  Games 

Evolutionary  game  theory  offers  an  additional 
set  of  assumptions  and  tools  to  say  more  about  the 
conditions  under  which  various  Nash  equilibria 
will  be  selected.  These  tools  are  used  in  biology  to 
study  actual  evolution  through  natural  selection 
(usually  with  “selfish  genes”  filling  the  role  of 
rational  players).  Individuals  whose  phenotypes — 
strategies — yield  them  higher  reproductive  fitness 
in  one  generation  have  more  progeny  in  the  next. 
A  distribution  of  genotypes  stable  in  the  resulting 
dynamic  system  corresponds  roughly  to  equilib¬ 
rium  in  the  game. 

In  the  social  sciences,  the  tools  of  evolutionary 
game  theory  are  more  commonly  applied  to  mod¬ 
els  of  “learning  in  games,”  in  which  individuals 
play  games  many  times  against  different  oppo¬ 
nents,  occasionally  switching  strategies  to  adopt  a 


new  one  observed  to  have  been  more  successful. 
The  abandoning  of  one  strategy  in  favor  of  another 
takes  the  place  of  the  biologist’s  process  of  natural 
selection.  This  learning  process  can  be  quite  myo¬ 
pic  and  error  ridden,  but  generally  the  result  is  that 
a  population  of  individuals  may  reach  a  stable 
point,  a  whole-population  profile  of  strategies  in 
which  little  further  systematic  improvement  is  pos¬ 
sible.  Stable  points  correspond  under  general  con¬ 
ditions  to  a  subset  of  Nash  equilibria.  Models  of 
learning  in  games  are  significant  to  game  theory 
overall  because  they  demonstrate  that  the  strong 
assumptions  used  to  construct  basic  games — 
perfect  and  common  information,  perfect  ability  to 
select  optimal  strategies,  and  an  a  priori  coinci¬ 
dence  in  the  selection  among  multiple  equilibria — 
are  not  remotely  necessary  to  produce  choices  that 
correspond  to  classic  Nash  equilibrium.  What  is 
required  is  a  sufficient  opportunity  for  players  to 
gain  experience  in  playing. 

Imperfect  Information  in  Strategic  Games 

The  assumption  of  perfect  information  is  obvi¬ 
ously  an  extreme  idealization  of  strategic  interac¬ 
tion.  Even  in  classical  game  theory  (as  opposed  to 
evolutionary  or  population  learning  models),  a 
major  preoccupation  is  the  generalization  of  this 
ideal  case.  This  is  accomplished  by  specifying 
probability  distributions  that  describe  the  state  of 
a  player’s  knowledge,  including  knowledge  about 
the  state  of  other  players’  knowledge.  In  general, 
imperfect  information  about  the  set  of  players,  the 
strategy  sets,  or  the  utility  functions  in  a  strategic 
game  can  all  be  expressed  in  terms  of  uncertainty 
about  the  utility  functions  alone:  An  unavailable 
strategy  can  be  represented  by  an  extremely  low 
utility  for  any  profile  in  which  that  strategy  is  used, 
and  an  absent  player  can  be  assigned  a  singleton 
strategy  set.  Often,  the  true  payoff  function  of  a 
player  is  called  the  player’s  type. 

Once  each  player’s  imperfect  information  is 
expressed  in  the  form  of  a  probability  distribution 
over  payoffs,  the  criterion  of  best  response  can  be 
expressed  in  terms  of  expected-payoff  maximiza¬ 
tion.  Nash  equilibrium  generalizes  directly  to  this 
form  of  best  response.  An  important  complication 
is  that,  in  calculating  best  responses,  a  player’s 
subjective  probabilities  about  other  player  types 
must  be  updated  to  account  for  the  circumstances 
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under  which  the  first  player’s  action  will  affect  his 
or  her  payoff.  For  example,  in  models  of  instru¬ 
mental  voting,  only  the  cases  in  which  a  player’s 
vote  will  be  pivotal  have  any  importance  for 
maximizing  payoff;  but  the  fact  of  being  pivotal 
carries  implications  about  what  the  other  players’ 
types  might  be  in  those  circumstances. 

Sequential  Games 

Strategic  games  abstract  away  players’  opportuni¬ 
ties  to  react  to  other  players  or  to  anticipate  their 
future  decisions,  burying  such  activity  implicitly 
within  each  player’s  strategy.  Sequential  games,  by 
contrast,  offer  an  alternative  model  designed  to 
emphasize  this  aspect  of  interactive  decision  mak¬ 
ing.  Political  processes  are  usually  sequential  deci¬ 
sion  processes,  and  political  scientists  have 
exploited  this  in  numerous  ways.  Crisis  bargaining 
and  the  inception  of  war  were  early  and  important 
objects  of  study  through  sequential  games;  later 
models  of  ethnic  conflict  and  cooperation  involve 
an  approach  similar  in  spirit.  In  the  study  of  policy 
formation  through  voting,  agenda-setting  models 
such  as  committee  gatekeeping  have  been  widely 
employed,  and  the  theory  of  legislative  bargaining 
has  achieved  the  status  of  a  general-purpose  tool 
for  legislative  politics,  political  parties,  and  group 
choice  generally.  Sequential  game  models  are 
prominent  also  in  the  study  of  delegation  of  pow¬ 
ers  to  an  agency  or  committee  and  of  decision 
making  within  a  system  of  separated  powers. 
Games  of  sequentially  repeated  interaction  are 
often  applied  to  the  formation  and  maintenance  of 
community  or  of  collective  action,  in  settings  rang¬ 
ing  from  the  theoretical  study  of  polities  to  the 
stability  of  international  organizations.  With  the 
incorporation  of  incomplete  information,  sequen¬ 
tial  games  have  also  found  important  applications 
to  all  manner  of  political  communication  phenom¬ 
ena,  such  as  policy  advice  from  biased  experts, 
lobbying,  and  informative  political  debate. 

General  Formulation 

The  following  elements  are  necessary  to  specify 
a  basic  sequential  game;  for  special  purposes,  addi¬ 
tional  elements  may  be  needed,  and  these  will  be 
described  in  the  subsequent  sections.  Just  as  with 
strategic  games,  every  sequential  game  has  a  set  N 


of  players  and,  for  each  player  i,  a  utility  function 
ut  over  outcomes.  The  difference  lies  in  the  descrip¬ 
tion  of  player  decisions  and,  thus,  of  outcomes.  A 
sequential  game  also  consists  of  a  description  of 
turns  on  which  a  given  player  has  the  chance  to  act 
and  the  actions  among  which  that  player  may 
choose.  A  game  need  not  simply  describe  the  turns 
as  alternating  regularly  among  players;  in  general, 
the  coming  of  a  player’s  turn,  as  well  as  the  actions 
available  to  that  player,  may  depend  on  the  prior 
actions  of  all  players.  The  description  of  the  game 
must  also  fully  specify  the  sequences  of  player 
actions  that  determine  the  various  outcomes  of  the 
game.  As  with  strategic  games  using  this  descrip¬ 
tion,  any  outcome  may  be  simply  specified  by  the 
sequence  of  actions  that  led  to  it,  although  in 
applications  the  outcomes  usually  correspond  to 
substantive  results. 

Traditionally,  and  in  most  small  games,  the 
whole  description  of  turns,  actions,  and  outcomes 
is  accomplished  through  the  specification  of  a 
game  tree.  The  “tree”  terminology  refers  to  a 
rooted  tree  in  the  mathematical  theory  of  graphs. 
In  that  theory,  a  graph  consists  of  a  set  of  nodes 
and  the  edges  that  connect  some  nodes  directly  to 
other  nodes;  a  tree  is  any  graph  that  is  both  con¬ 
nected  (any  node  can  be  reached  from  any  other 
node  via  a  sequence  of  edges  that  does  not  retrace 
itself,  called  a  path)  and  acyclic  (has  no  cycles,  i.e., 
no  path  from  a  node  back  to  itself);  and  a  rooted 
tree  has  one  node  designated  as  the  initial  node  (or 
root).  The  properties  of  connectedness  and  acyclic¬ 
ity  then  imply  that  for  any  two  paths  leading  away 
from  the  initial  node,  once  those  two  paths  diverge 
onto  different  edges  they  never  reconnect — giving 
the  graph  the  shape,  schematically,  of  the  branches 
of  a  tree. 

The  initial  node  represents  the  beginning  of  the 
game.  The  path  leading  from  the  initial  node  to 
any  particular  node  represents  a  partial  or  com¬ 
plete  history  of  play  of  the  game  (including  the 
“null  history”  at  the  initial  node);  at  each  node 
along  any  path  from  the  initial  node,  either  it  is 
one  player’s  turn  to  move  or  else  the  path  outward 
from  the  root  ends.  These  nodes  along  any  path 
are  called  nonterminal  and  terminal  histories  of 
the  game,  respectively.  The  edge  connecting  the 
node  of  a  nonterminal  history,  h,  to  the  next  node 
outward  from  the  root  represents  an  action  chosen 
by  the  player  P(h)  whose  turn  came  at  h.  Each 
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terminal  history  represents  an  outcome  of  the 
game,  the  result  of  a  particular  path  of  play. 

This  description  of  a  sequential  game  as  a  game 
tree  is  just  a  way  of  forcing  the  specification  of 
turns,  action  choices,  potential  histories,  and  out¬ 
comes  to  satisfy  appropriate  regularity  conditions, 
making  the  description  unambiguous  and  inter¬ 
nally  consistent.  With  the  addition,  described 
below,  of  chance  moves,  simultaneous  moves,  and 
information  sets,  the  structure  is  remarkably  versa¬ 
tile  and  yet  amenable  to  powerful  proofs  concern¬ 
ing  the  general  properties  of  types  of  games  and 
their  solutions.  Figure  1  shows  an  example  of  a 
simple  game  tree,  a  so-called  entry  or  incumbency 
game.  In  the  figure,  the  payoffs  of  the  two  players 
are  shown  by  a  label  on  each  terminal  history  x, 
with  ut(x)  written  above  u2(x).  This  is  sometimes 
written  instead  as  (zq(x),  u2(x)). 

Note,  however,  that  the  general  form  of  a  sequen¬ 
tial  game  may  involve  trees  that  cannot  be  literally 
portrayed  in  a  tree  diagram  such  as  Figure  1.  For 
many  applications,  analytic  tractability  is  improved 
by  allowing  a  player  i  at  a  given  history  h  to  have  a 
set  of  available  actions  Afh)  that  is  infinite,  as  in 
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Figure  I  A  Typical  Game  Tree:  The  Entry  Game 


games  of  bargaining  over  the  division  of  a  stock  or 
in  a  spatial  model  of  electoral  competition.  Also,  it 
is  often  important  that  there  be  no  a  priori  limit  on 
the  possible  length  of  a  path  of  play,  so  that  some 
terminal  histories  are  infinitely  long,  as  in  alternat¬ 
ing-offer  bargaining  processes  or  in  repeated  games. 
A  well-specified  model  must  somehow  limit  the 
payoff  to  such  an  infinite  “terminal”  history  (e.g., 
by  a  low  reversion  payoff  in  bargaining  or  by  dis¬ 
counting  in  a  repeated  game).  An  infinitely  long 
path  of  play  is  nevertheless,  of  course,  unrealistic  as 
an  outcome;  but  the  ever-present  possibility  of  con¬ 
tinuing  the  game  is  strategically  critical.  A  game 
having  at  least  one  terminal  history  of  infinite  length 
is  said  to  have  “infinite  horizon.”  A  game  having  no 
such  history,  and  in  which  the  set  of  available 
actions  at  every  turn  is  finite,  is  a  “finite  game.” 

Strategies  and  Solutions  in 
Basic  Sequential  Games 

A  strategy  for  player  i  in  a  sequential  game  con¬ 
sists  of  a  function  s,  specifying,  for  each  h  having 
P(h)  =  i,  an  action  sfh)  from  z’s  set  of  available 
actions  at  that  history.  A  strategy  profile  is  a  list  of 
such  functions,  one  for  each  player.  As  with  strate¬ 
gic  games,  a  strategy  profile  is  written  equivalently 
as  s,  (Sj,  s2,  .  .  . ,  sj,  or  (s,.,  sj. 

Backward  Induction 

For  any  finite  basic  sequential  game,  a  simple 
and  appealing  solution  concept  is  available:  the 
method  of  backward  induction.  Consider  an  “ulti¬ 
mate”  nonterminal  history,  that  is,  a  history  h  such 
that  every  action  taken  by  player  P(h )  will  end  the 
game.  Since  there  is  a  finite  number  of  such  actions, 
at  least  one  of  them  yields  a  maximum  payoff. 
Here  at  the  end  of  the  game,  the  only  relevant 
implication  of  this  player’s  choice  is  to  determine 
his  or  her  own  payoff;  clearly  the  rational  choice  is 
one  yielding  that  maximum.  Backward  induction 
begins  by  identifying  one  such  payoff-maximizing 
action  for  every  ultimate  nonterminal  history. 
Now  consider  a  “penultimate”  history:  a  history  h' 
for  which  (h',  a')  is  an  “ultimate”  history  no  mat¬ 
ter  which  action  a'  is  chosen  by  player  P(h').  For  a 
given  a',  let  a  be  the  action  that  will  then  be  taken 
at  the  resulting  ultimate  history  (h1,  a').  In  that 
case,  using  our  analysis  of  ultimate  histories,  player 
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i  =  P(b')  will  receive  a  predictable  payoff  from  a', 
namely,  ut  (h',  a',  a).  Proceeding  in  this  fashion, 
backing  up  the  tree  to  the  initial  node  specifies  an 
action  choice  for  the  choosing  player  at  every  non¬ 
terminal  history,  defining  a  strategy  for  every 
player.  This  strategy  profile  defines  the  sequence  of 
actions  that  will  transpire  when  the  game  is 
played — a  terminal  history  or  complete  path  of 
play  through  the  game  tree — as  well  as  the  actions 
that  would  be  chosen  following  any  arbitrary 
departure  from  that  path.  This  strategy  has  the 
property  that  all  actions  chosen,  both  on  and  off 
the  path,  are  payoff  maximizing,  given  all  the  sub¬ 
sequent  choices  by  all  players.  Call  this  strategy 
profile  the  solution  by  backward  induction. 

Nash  Equilibrium:  The  Strategic 

Form  of  a  Sequential  Game 

Alternatively,  a  sequential  game  can  be  con¬ 
verted  to  an  essentially  equivalent  strategic  form 
and  solved  by  specifying  a  Nash  equilibrium.  For  a 
finite  basic  sequential  game,  consider  the  strategic 
game  defined  by  (a)  the  same  set  of  players  N; 

(b)  for  each  player  i,  the  strategy  set  consisting  of 
all  full  sequential-game  strategies  for  player  i;  and 

(c)  for  every  profile  of  such  strategies,  a  payoff 
equal  to  the  sequential  game  payoff  corresponding 
to  the  terminal  node  that  would  result  from  apply¬ 
ing  that  strategy  profile  to  the  sequential  game.  A 
Nash  equilibrium  of  this  strategic  form  is  defined 
as  a  Nash  equilibrium  of  the  original  sequential 
game;  the  strategic  form  is  also  known  as  the  nor¬ 
mal  form  or  occasionally  the  matrix  form.  The 
sequential  form  is  also  known  as  the  extensive 
form  or  occasionally  the  tree  form  of  the  game. 

Notice  that  every  solution  by  backward  induc¬ 
tion  is  a  Nash  equilibrium  of  the  game.  In  a 
backward-induction  solution,  no  player  has  any 
incentive  to  change  both  an  ultimate  and  any  pen¬ 
ultimate  moves  because  all  the  ultimate  moves  are 
optimal  regardless  of  how  those  ultimate  histories 
were  arrived  at,  and  the  penultimate  moves  were 
optimal  given  those.  Backing  up  through  the  game 
tree,  similar  reasoning  establishes  that  no  player 
would  want  to  change  any  subset  of  his  or  her 
moves  in  the  backward-induction  solution;  hence 
the  profile  is  a  Nash  equilibrium. 

Not  every  Nash  equilibrium  is  a  backward- 
induction  solution,  however.  Consider  the  Challenge 


Game  shown  in  Figure  1.  Backward  induction 
yields  a  profile  in  which  Incumbent  Acquiesces,  and 
Challenger  Enters,  which  is  also  a  Nash  equilib¬ 
rium.  Consider  another  strategy  profile  in  which 
Challenger  chooses  Stay  Out  and  Incumbent 
chooses  Fight.  Clearly,  Challenger  would  not  prefer 
to  Enter  if  Incumbent  will  Fight;  moreover,  given 
that  Challenger  will  Stay  Out,  Incumbent  has  no 
incentive  to  Acquiesce.  This  too  is  a  Nash  equilib¬ 
rium,  but  it  violates  backward  induction.  Such  a 
flawed  Nash  equilibrium  is  sometimes  called 
“imperfect.” 

Subgame-Perfect  Equilibrium 

A  subgame  of  a  sequential  game  G  is  the  part  of 
the  game  that  follows  any  nonterminal  node  h,  and 
thus,  it  is  a  game  in  its  own  right.  Its  set  of  histories 
corresponds  to  the  set  of  all  histories  of  G  that 
begin  with  k-,  its  payoffs  are  the  corresponding 
payoffs  from  G;  and  its  player  set  is  the  same, 
although  some  may  have  no  moves  to  make  in  the 
subgame.  (For  games  of  imperfect  information, 
defined  below,  this  definition  is  modified  to 
exclude  putative  subgames  whose  set  of  nontermi¬ 
nal  histories  would  contain  an  incomplete  portion 
of  any  information  set. )  When  a  game  is  solved  by 
backward  induction,  a  backward-induction  solu¬ 
tion  on  every  subgame  is  induced  as  well.  Thus, 
backward  induction  induces  a  Nash  equilibrium 
on  every  subgame. 

This  property  can  be  applied  as  well  to  a  game 
with  infinite  horizon,  which  cannot  be  solved  via 
backward  induction.  For  any  sequential  game,  a 
Nash  equilibrium  that  induces  a  Nash  equilibrium 
on  every  subgame  avoids  the  “imperfection” 
described  above;  hence,  it  is  called  a  subgame- 
perfect  equilibrium.  Since  not  every  Nash  equilib¬ 
rium  is  subgame  perfect,  subgame  perfection  is 
also  referred  to  as  a  refinement  of  Nash  equilib¬ 
rium.  (The  qualification  “subgame”  perfect  distin¬ 
guishes  this  refinement  from  other  refinements 
treating  alternative  notions  of  imperfection.) 
Where  backward  induction  is  not  possible  or  fea¬ 
sible,  other  methods  may  be  employed  to  derive 
subgame-perfect  equilibria.  Such  methods  are 
often  based  in  one  way  or  another  on  Bellman’s 
principle  of  dynamic  programming:  An  optimal 
strategy  is  identified  by  always  taking  the  optimal 
current  action,  assuming  that  the  subsequent 
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actions  are  optimal.  Other  Nash  equilibrium 
refinements  in  a  similar  spirit  use  criteria  based  on 
rational  behavior  when  choices  are  known  to  be 
subject  to  small  errors  or  trembles,  both  in  sequen¬ 
tial  and  in  strategic  games.  Examples  include 
trembling-hand  perfection  and  proper  equilibrium. 

Extending  the  Basic  Formulation 

The  basic  formulation  of  a  sequential  game, 
whether  finite  or  infinite,  can  be  extended  in 
several  ways.  Backward-induction  and  subgame- 
perfect  equilibrium  extend  as  well,  making  the 
sequential  game  a  powerful  all-purpose  modeling 
tool.  This  section  examines  how  to  generalize  the 
formulation  of  sequential  games  to  include  simul¬ 
taneous  moves,  chance  occurrences,  randomized 
actions,  and  imperfect  information. 

Simultaneous  Moves 

Suppose  that,  following  some  histories,  two  or 
more  players  are  to  choose  actions  simultaneously, 
after  which  the  game  may  continue.  This  can  be 
represented  by  letting  the  player  function  P  desig¬ 
nate  a  set  of  players,  P(h )  ^  N,  rather  than  a  single 
player.  For  each  i  G  P(h),  we  still  let  Afh)  represent 
the  set  of  actions  among  which  i  is  to  choose;  this 
may,  in  general,  be  a  different  set  of  actions  for 
each  player  in  P(h).  The  description  of  a  history 
accommodates  such  simultaneous  moves  by  allow¬ 
ing  the  sequence  of  actions  defining  h  to  include, 
where  appropriate,  a  profile  of  actions  by  a  set 
P(h)  of  players,  in  place  of  the  move  by  a  single 
player.  For  example,  if  action  a 1  by  Player  1  is  fol¬ 
lowed  by  a  simultaneous  choice  by  Players  2  and 
3,  who  choose  a 2  and  a 3,  respectively,  then  result¬ 
ing  history  may  be  written  unambiguously  as  ( a 1, 
(a2,  a3)).  Player  V s  strategy  in  a  sequential  game  can 
still  be  represented  in  the  same  form  as  before,  let¬ 
ting  Sj(h)  represent  the  action  i  would  take  at  his¬ 
tory  h  regardless  of  whether  other  players  are 
simultaneously  making  choices  at  h.  A  basic 
sequential  game  is  one  in  which  P(h)  is  always  a 
singleton  {/}.  A  strategic  game  now  becomes  a  spe¬ 
cial  case  of  a  sequential  game,  in  which  the  null 
history  0  is  the  only  nonterminal  history,  with 
P(0)  =  N. 

Finally,  it  is  natural  to  extend  backward  induc¬ 
tion  to  simultaneous-choice  situations  as  follows: 


when,  in  the  process  of  backward  induction,  a  his¬ 
tory  h  is  reached  at  which  P(h)  has  more  than  one 
player,  then  construct  a  special  strategic  game  in 
which  the  set  of  players  is  P(h );  player  V s  strategy 
set  is  Afh);  and  player  i’s  payoff  to  strategy  profile 
s(h)  in  the  special  strategic  game  is  identical  to  i’s 
payoff  in  the  overall  sequential  game  from  the  ter¬ 
minal  history  defined  by  h,  followed  by  s(h), 
followed  by  continued  play  according  to  the  back¬ 
ward-induction  steps  already  completed.  The 
backward  induction  is  continued,  then,  by  desig¬ 
nating  a  Nash  equilibrium  s"(h)  in  the  special  stra¬ 
tegic  game  at  h,  and  then  continuing  up  the  tree 
under  the  assumption  that  if  h  is  reached,  the  play¬ 
ers  in  P(h)  will  choose  according  to  s  (h).  As  in 
backward  induction  in  a  basic  sequential  game,  if 
a  simultaneous-move  history  has  more  than  one 
Nash  equilibrium,  the  implications  of  each  must  be 
traced  out  for  the  remaining  steps  of  backward 
induction,  yielding  several  different  backward- 
induction  solutions.  Extending  the  method  of 
backward  induction  in  this  manner  preserves  the 
definition  of  strategic  form,  the  definition  of  Nash 
equilibrium  for  sequential  games,  and  the  corre¬ 
spondence  between  backward-induction  and  sub- 
game-perfect  equilibrium  all  in  the  same  form  as 
defined  for  basic  sequential  games. 

Example:  Repeated  Games 

An  important  application  of  sequential  games 
with  simultaneous  moves  is  the  case  of  repeated 
games.  Fet  G  be  a  strategic  game  (players  N,  strat¬ 
egy  sets  St,  and  utility  functions  u).  In  a  repeated 
game,  the  same  set  of  players  plays  G,  known  as 
the  stage  game-,  all  learn  the  outcome;  and  all  play 
G  again,  and  so  on.  The  players  know  that  G  will 
be  repeated  a  specific  number  of  times  or  indefi¬ 
nitely.  The  payoff  to  this  entire  sequence  is  the  sum 
of  the  payoffs  to  the  successive  instances  of  G, 
perhaps  discounted  (i.e.,  multiplied  by  some  “dis¬ 
count  factor”  between  0  and  1  for  each  repetition) 
due  to  time  preferences  or  to  the  probabilistic  ter¬ 
mination  of  the  sequence. 

Consider  for  instance  the  Prisoners’  Dilemma 
shown  in  Table  1  as  the  stage  game,  to  be  repeated 
infinitely  with  discount  factor  w.  This  forms  an 
infinite-horizon  sequential  game,  in  which  at 
every  nonterminal  (i.e.,  every  finite)  history  h, 
P(h)  =  {Player  1,  Player  2}  and  Afh)  —  {Cooperate, 
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Defect}.  A  terminal  history  is  an  infinite  sequence 
of  the  form  ((a10,  a20),  (an,  a21),  (a12,  a22),  ..  .), 
where  each  ait  €  {Cooperate,  Defect}.  The  payoff 
to  player  i  from  such  a  terminal  history  is 

oo 

Y  u/ui{au,alt), 

t= 0 

where  t  represents  time. 

Finally,  a  strategy  takes  the  form  of  a  rule  tell¬ 
ing  what  action  would  be  chosen  at  every  possible 
nonterminal  history.  Actual  strategies  of  interest 
are  usually  expressed  in  the  form  of  a  relatively 
simple  rule.  For  example,  the  strategy  of  Tit  for 
Tat  in  the  repeated  Prisoners’  Dilemma  can  be 
described  as  follows:  Cooperate  at  t  —  0;  and  for 
all  t  >  0,  imitate  the  other  player’s  action  at  t  —  1. 

A  repeated-game  strategy  profile  can  be  proven 
to  be  a  subgame-perfect  equilibrium  by  showing 
that  at  no  possible  history  (on  or  off  its  path  of 
play)  could  payoff  be  increased  by  changing  the 
action  taken  at  that  history  and  nowhere  else.  This 
method  is  derived  from  Bellman’s  principle.  Notice 
that  the  profile  in  which  both  players  use  Tit  for 
Tat  is  a  Nash  equilibrium  for  sufficiently  large  w 
but  is  not  a  subgame-perfect  equilibrium  for  any 
value  of  tv.  This  is  because  if  one  player  ever  com¬ 
mits  an  unexplained  Defection,  then,  under  the 
assumption  that  both  players  would  adhere  to  Tit 
for  Tat  in  the  future,  it  would  be  better  for  the 
other  player  to  forgo  punishing  the  defection 
(incurring  a  punishment  in  turn).  If  the  strategy  for 
both  players  is  modified,  however,  to  include  an 
“apology” — namely,  Cooperate  twice  regardless 
of  the  other  player’s  action — for  any  unprovoked 
Defection,  it  becomes  a  subgame-perfect  equilib¬ 
rium  (again,  for  sufficiently  large  w).  Thus, 
although  Defect  is  a  dominant  strategy  in  the  unre¬ 
peated  game,  full  cooperation  can  be  maintained 
in  equilibrium  in  the  indefinitely  repeated  game. 

Moves  by  Chance 

The  interactions  we  subject  to  game  theoretic 
analysis  sometimes  depend  in  part  on  intervening 
events  best  understood  by  the  players  as  being  ran¬ 
dom.  Such  phenomena  can  be  included  by  allow¬ 
ing  the  player  function  to  specify  that  at  some 
histories  h,  instead  of  a  player  or  players  choosing 
an  action,  a  random  event  occurs  before  subse¬ 
quent  player  actions  or  payoff  determinations  take 


place.  This  is  denoted  by  writing  P(h)  —  Chance  or 
P(h)  —  {Chance}  and  such  an  h  is  casually  referred 
to  as  a  “move  by  Chance”  or  by  “Nature.”  In 
describing  the  game,  any  such  instance  must  be 
accompanied  by  a  set  of  possible  outcomes  of  the 
random  event,  which  we  might  as  well  write 
A  chance  ( ^ )  3  and  a  probability  distribution  p  on 
Achmcf.(b).  The  outcome  of  a  Chance  move  is  part 
of  the  sequence  of  actions  defining  any  history. 

If  Chance  moves  are  encountered  in  the  path  of 
play  under  a  profile  s,  then  s  determines  not  a  par¬ 
ticular  terminal  history  but  rather  a  probability 
distribution  over  terminal  histories.  For  a  game 
involving  moves  by  Chance,  then,  the  backward- 
induction  process  must  be  modified  by  maximizing 
expected  payoffs,  computed  in  the  appropriate 
way  using  the  probability  distributions  for  the 
Chance  moves  to  determine  the  probabilities  of 
various  payoffs  given  a  strategy  profile. 

Behaviorally  Mixed  Strategies 

Mixed  strategies  are  usually  treated  in  a  differ¬ 
ent  way  for  sequential  games  than  for  strategic 
games.  Game  theorists  use  the  term  behavioral  (or 
behaviorally)  mixed  strategy  to  refer  to  a  strategy 
in  a  sequential  game  in  which  the  player  random¬ 
izes  over  the  actions.  (Where  the  context  is  clear, 
the  modifier  “behavioral”  is  usually  dropped.) 
Unless  otherwise  specified,  these  randomizations 
are  always  taken  to  be  independent  across  players; 
they  are  represented  as  independent  across  histo¬ 
ries  too,  although  of  course  different  actions  at  an 
earlier  history  lead  to  different  subsequent  histo¬ 
ries,  at  which  the  player  may  use  different  random¬ 
izing  probabilities. 

Obviously,  a  strategy  profile  involving  random¬ 
ization  will  determine  a  probability  distribution 
over  outcomes,  and  expected  utility  is  the  criterion 
for  backward  induction.  Analogous  to  deriving 
mixed-strategy  equilibria  in  strategic  games,  the 
procedure  in  a  sequential  game  for  finding  sub- 
game-perfect  equilibria  involving  behavioral  mixed 
strategies  depends  on  the  fact  that  a  player  who 
maximizes  expected  utility  can  rationally  random¬ 
ize  if  and  only  if  each  of  the  actions  accorded  posi¬ 
tive  probability  would,  if  used  alone,  also  maximize 
expected  utility.  In  the  backward-induction  pro¬ 
cess,  this  means  that  a  single  player  P(h)  could  use 
either  no  randomized  action  or  a  continuum  of 
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randomized  actions  at  h.  However,  many  of  these 
possibilities  will  often  be  ruled  out  of  the  eventual 
subgame-perfect  equilibrium,  since  randomiza¬ 
tions  by  more  than  one  player  or  at  more  than  one 
history  will  require  multiple  indifference  condi¬ 
tions  that  (again  as  in  strategic  games)  constrain 
the  feasible  values  of  the  probabilities  used. 

Imperfect  Information 

Perhaps  the  single  most  important  extension  of 
the  basic  formulation  is  to  represent  games  in 
which  players  do  not  have  full  information  about 
the  previous  actions  of  other  players.  This  is,  after 
all,  an  important  feature  of  many  parlor  games  as 
well  as  a  key  feature  of  real-world  interactions. 
The  state  of  player  /’ s  knowledge  about  previous 
moves  in  a  sequential  game  is  represented  by  dis¬ 
tinguishing,  as  an  information  set  of  player  i,  any 
subset  of  histories  at  which  player  i  is  required  to 
move  but  among  which  i  is  unable  to  distinguish  at 
the  time  they  are  reached.  Thus,  for  all  histories 
within  the  same  information  set,  the  player  is 
required  to  choose  the  same  action.  The  whole  col¬ 
lection  of  such  information  sets  for  a  player  is  that 
player’s  information  structure.  To  define  equilib¬ 
rium  in  a  game  with  imperfect  information  requires 
that  we  incorporate  a  notion  of  player  V s  probabi¬ 
listic  beliefs  about  previous  actions.  These  notions 
are  formalized  below  to  facilitate  defining  the  rel¬ 
evant  equilibrium  concepts. 

For  each  player  i,  let  H(i)  be  the  set  of  histories 
for  which  i  is  in  the  player  set.  Let  I,  be  a  partition 
of  H(i) — that  is,  a  set  of  nonempty  subsets  of  H(i) 
such  that  each  history  in  H(i)  is  an  element  of 
exactly  one  of  those  subsets.  Denote  by  Ifh)  the 
partition  element  containing  b,  called  V s  informa¬ 
tion  set  at  h.  In  what  follows,  the  symbol  Z  denotes 
a  typical  information  set  without  reference  to  a 
particular  history  in  it;  but  for  any  h  in  Z,  Ifh)  = 
Z.  7,  is  a  valid  information  structure  for  i  if  and 
only  if,  for  each  information  set  Z  in  7;,  player  i  has 
exactly  the  same  action  choices  at  every  h  in  Z.  (In 
any  application,  the  names  or  labels  of  the  actions 
for  each  history  in  an  information  set  should  be 
identical  as  well.  From  a  theoretical  standpoint, 
however,  the  labels  on  the  actions  are  irrelevant; 
hence,  the  only  real  restriction  is  that  the  action 
sets  be  of  the  same  size.)  This  restriction  is  needed 
since  i  must  be  aware  of  the  available  choices  at 


the  present  history;  it  must  be  impossible  for  the 
player  to  infer  from  this  whether  that  history  is  h 
or  h' .  Thus,  we  may  as  well  write  Aflfh ))  for  the 
actions  available  at  every  b  in  information  set  Ifh). 
Finally,  for  any  history  b  at  which  player  i  €  P(h) 
does  know  the  entire  sequence  of  previous  actions, 
Ifh)  —  {h},  just  that  one  history. 

This  approach  requires  a  slight  redefinition  of  a 
strategy.  Since  player  i  cannot  distinguish  between 
b  and  h'  in  an  information  set  Z,  a  strategy  must 
specify  the  same  action  at  both  histories.  Thus,  in 
general,  a  strategy  for  player  i  can  be  fully  described 
as  a  mapping  not  from  each  history  into  its  action 
set  but  simply  from  each  information  set  Z  6  I:  into 
its  action  set;  thus,  instead  of  sfh),  we  keep  track 
of  the  accounting  by  writing  sfZ ),  which  auto¬ 
matically  reminds  us  that  b,  h'  €  Z  implies  sfh)  — 
sfh').  As  noted  above,  the  presence  of  information 
sets  restricts  the  applicability  of  the  notion  of  sub- 
game-perfect  equilibrium.  In  a  proper  subgame, 
every  h  within  the  subgame  must  have  the  property 
that  the  subgame  contains  all  the  histories  in  Ifh). 
That  is,  no  information  set  can  be  “shared” 
between  two  subgames.  Thus,  in  a  game  of  imper¬ 
fect  information,  not  every  nonterminal  h  gives  rise 
to  a  subgame.  With  this  restriction  on  what  quali¬ 
fies  as  a  subgame,  the  definition  of  subgame- 
perfect  equilibrium  remains  the  same  as  before. 

To  go  further  in  assessing  the  choice  of  a  ratio¬ 
nal  strategy  requires  that  we  devise  some  method 
of  handling  a  player’s  uncertainty  at  any  nontriv¬ 
ial  information  set  Z.  As  usual,  we  represent  the 
state  of  the  player’s  knowledge  about  what  is  the 
actual  history  in  Z  with  a  probability  distribution 
over  Z.  Now,  the  payoff  from  choosing  action  a 
from  AfZ)  is  the  expectation  over  the  histories  in 
Z  of  the  payoffs  from  choosing  a.  Each  of  these 
payoffs,  in  turn,  may  be  an  expectation  based  on 
subsequent  moves  by  nature  or  randomized 
actions  to  be  chosen  by  i  or  by  other  players  sub¬ 
sequent  to  that.  We  thus  wish  to  equip  each  player 
i  with  a  probability  distribution  over  each  non¬ 
trivial  Z  in  f.  Let  | xz(b)  represent  V s  subjective 
probability  that  the  current  history  is  h  given  that 
the  current  information  set  is  Z  6  I;.  The  distribu¬ 
tion  |xz  is  known  as  player  /’ s  beliefs  at  Z,  and  |x 
as  defined  over  all  Z  for  all  players  is  a  system  of 
beliefs.  Using  this  notion  of  beliefs,  we  can  gener¬ 
alize  subgame-perfect  equilibrium  to  the  imper¬ 
fect-information  setting.  There  are  several  such 
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extended  definitions,  each  having  the  following 
form:  An  equilibrium  consists  of  an  assessment , 
that  is,  a  profile  of  strategies  s  and  a  system  of 
beliefs  |x,  obeying  two  conditions: 

f.  Sequential  rationality:  For  each  i  and  each  Z  6 
f,  s,(Z)  maximizes  the  expected  value  of  u;  given 
s_.  and  jjl. 

2.  Consistency:  Each  pz  is  consistent  with  s  and 
with  the  probabilities  placed  on  Chance  moves, 
given  the  laws  of  probability  and  other 
considerations. 

The  sequential  rationality  condition  is  just  a 
restatement  of  the  criterion  we  have  been  using  all 
along,  that  each  action  maximizes  expected  pay¬ 
off.  The  consistency  condition  varies  according  to 
the  “other  considerations”  imposed  as  conditions, 
and  this  gives  rise  to  different  solution  concepts. 

The  simplest  consistency  condition — and  the 
least  restrictive  while  still  being  consistent  with  the 
laws  of  probability — is  that  beliefs  must  comport 
with  Bayes’s  rule  wherever  it  is  directly  applicable. 
That  is,  for  every  information  set  Z  that  can  actu¬ 
ally  be  reached  with  positive  probability  given  s,  for 
every  history  h'  in  Z,  we  must  define  | ±z(h')  accord¬ 
ing  to  Bayes’s  rule,  namely,  as  the  conditional  prob¬ 
ability  of  reaching  h'  given  that  Z  is  reached,  given 
the  players’  strategies  and  the  game’s  Chance-move 
probabilities.  This  amounts  to 

M>')  =  Pr(fc')/Pr(Z)  -  Vr(h')/ZheZVr(h), 

for  all  h'  in  Z.  Each  marginal  probability  Pr(^)  is 
calculated  by  multiplying  all  the  mixed-strategy 
action  probabilities  and  Chance-move  probabili¬ 
ties  along  the  sequence  of  actions  that  make  up  h. 
An  assessment  (s*,  |x*)  satisfying  this  form  of  con¬ 
sistency,  as  well  as  sequential  rationality,  is  a  weak 
sequential  equilibrium  (WSE). 

Game  theoretic  models  frequently  invoke  a 
related  equilibrium  concept  called  perfect  Bayesian 
equilibrium  (PBE).  A  sequentially  rational  assess¬ 
ment  (s*,  (x*)  is  a  PBE  if  and  only  if  it  induces  a 
WSE  on  every  subgame.  The  only  difference 
between  PBE  and  WSE  occurs  when  there  is  an 
information  set  Z  that  is  not  reached  by  s  but  for 
which  other  off-path  actions  or  chance  moves 
intervene  between  the  point  where  s  diverges  from 


Z  and  Z  itself.  In  such  cases,  WSE  consistency 
imposes  no  restriction  on  beliefs  in  Z,  but  PBE 
consistency  requires  that  the  probabilities  in  Z 
comport  with  the  probabilities  of  reaching  Z  con¬ 
ditional  on  the  initial  departure  from  the  path  of  s. 
Other,  stronger  consistency  conditions  produce 
further  belief-based  refinements  of  WSE,  such  as 
sequential  equilibrium  and  divine  equilibrium.  In 
general,  such  refinements  are  used  to  rule  out  equi¬ 
libria  thought  to  be  based  on  “unreasonable” 
beliefs. 

Example:  Signaling  Games 

The  important  example  of  “semipooling”  equi¬ 
librium  in  certain  signaling  games  provides  impor¬ 
tant  applications  of  chance  moves,  behavioral 
mixed  strategies,  and  imperfect  information.  In  a 
signaling  game,  a  Sender  learns  the  value  of  an  ini¬ 
tial  Chance  move  and  then  sends  one  of  a  set  of 
feasible  signals  to  a  Receiver,  who  must  choose  an 
action.  The  players’  payoffs  depend  in  different 
ways  on  the  combination  of  the  true  chance  move 
and  the  Receiver’s  move.  If  the  Sender’s  payoff  does 
not  depend  directly  on  the  Sender’s  signal,  the  game 
is  one  of  “cheap  talk”;  otherwise  it  may  be  a  game 
of  “costly  signaling.”  Depending  on  the  details  of 
the  conflict  between  the  players’  interests,  in  equi¬ 
librium  the  Sender’s  signal  may  be  (a)  ignored  by 
the  Receiver,  called  “pooling”  since  the  Sender’s 
behavior  is  the  same  regardless  of  the  Chance  move; 

(b)  perfectly  informative  about  the  Chance  move, 
called  “separating”  since  the  Sender’s  signal  per¬ 
fectly  distinguishes  the  possible  Chance  moves;  or 

(c)  partially  informative,  called  “semipooling”  since 
the  Sender  uses  a  mixed  strategy,  reporting  truth¬ 
fully  with  some  probability  less  than  1. 

Figure  2  shows  the  game  tree  for  a  signaling 
game  that  minimally  embodies  all  these  features. 
The  game  begins  (in  the  middle  of  the  diagram) 
with  Chance  determining  whether  Policy  A  or 
Policy  B  would  be  best  for  Receiver;  A  is  best  with 
probability  p  <  .5.  Sender  learns  the  truth  and 
signals  either  a  or  b.  Receiver  knows  p  but  does 
not  know  the  Chance  move;  thus,  Receiver  has 
two  nontrivial  information  sets,  one  reached  when 
Sender  signals  a  and  the  other  when  Sender  signals 
b.  These  are  indicated  in  the  game  tree  by  the  dot¬ 
ted  lines  connecting  histories  that  Receiver  cannot 
distinguish.  After  hearing  the  report  from  Sender, 
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Receiver  must  choose  either  a  (implement  Policy 
A)  or  p  (implement  Policy  B).  At  each  terminal  his¬ 
tory,  each  pair  of  payoffs  in  the  diagram  shows 
Sender’s  payoff  above  Receiver’s  payoff.  Thus, 
Receiver’s  preferences  simply  favor  Policy  A  when 
A  is  best  and  B  when  B  is  best,  with  equal  intensity. 
We  assume  that  Sender’s  payoff  parameters  obey 
x  >  w  >  y  >  0,  so  Sender  has  a  preference  for  A 
regardless  of  which  policy  is  best  for  Receiver. 
Since  w  >  y,  Sender’s  message  has  a  direct  impact 
on  Sender’s  utility:  if  B  is  best  and  B  is  imple¬ 
mented,  Sender’s  utility  will  be  reduced  (by  w  —  y) 
for  having  falsely  reported  that  A  is  best.  This  is, 
therefore,  a  game  of  costly  signaling  rather  than 
cheap  talk. 

This  game  has  a  semipooling  equilibrium 
defined  by  the  following  assessment,  which  incor¬ 
porates  a  behavioral  mixed  strategy: 

•  Sender  signals  a  when  A  is  best;  when  B  is  best, 
Sender  signals  a  with  probability  r  =  (1  —  2 p)/ 

(1  —  p)  and  b  with  probability  1  —  r. 

•  Following  a  signal  of  b  (the  left-hand 
information  set),  Receiver  believes  that  B  is  best 
with  probability  1.  Following  a  signal  of  a  (the 
right-hand  information  set),  Receiver  believes 


there  is  a  probability  m  =  p/[p  +  (1  —  p)r]  that 
A  is  best  and  a  probability  1  —  m  that  B  is  best. 

•  Following  a  signal  of  b,  Receiver  chooses  (3  with 
certainty.  Following  a  signal  of  a.  Receiver 
chooses  a  with  probability  s  =  (tv  —  y)/(x  —  y) 
and  chooses  (3  with  probability  1  —  s. 

To  see  how  the  belief  m  is  derived  from  Bayes’s 
rule,  notice  that  the  top  history  in  the  right-hand 
information  set  is  reached  with  probability  p 
(Chance  probability  that  A  is  best,  times  Sender’s 
probability  1  of  truthfully  reporting  that)  and  the 
bottom  history  is  reached  with  probability  (1  —  p)r 
(the  Chance  probability  B  is  best  times  Sender’s 
mixed-strategy  probability  of  falsely  reporting  A  is 
best).  Notice  that  this  formulation  of  m  is  equiva¬ 
lent  to  the  usual  form  of  Bayes’  rule:  m  is  the  con¬ 
ditional  probability  that  A  has  occurred,  given  that 
Sender  has  reported  a,  so  Bayes’  rule  says  Pr(Ah) 
=  Pr(A)Pr(alA)  /  [Pr(A)  Pr(dA)  +  Pr(B)  Pr(dB)J, 
which  is  just  p  /  [p  +  (1  -  p)r],  as  given  above. 

As  with  any  signaling  game,  there  are  also  pool¬ 
ing  equilibria  in  which  Sender  uses  the  same  signal¬ 
ing  probabilities  regardless  of  Chance’s  move;  and 
Receiver  chooses  p  in  all  cases,  believing  B  is  best 
with  probability  1  —  p  >  .5  regardless  of  the  signal. 


x  >  w  >  y  >  0;  0  <  p  <  1/2;  Sender’s  payoff  on  top,  Receiver’s  on  bottom; 
figures  in  brackets  indicate  mixed-strategy  probabilities  and  beliefs 


Figure  2  Semipooling  Equilibrium  in  a  Signaling  Game 
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These  are  often  referred  to  as  “babbling”  equilib¬ 
ria,  since  the  Sender’s  signaling  is  meaningless. 

Cooperative  Game  Theory 

The  term  cooperative  game  theory  should  be 
understood  to  cover  a  variety  of  fairly  distinct 
approaches,  including  axiomatic  bargaining  theory 
and  spatial  voting  theory;  but  the  central  and 
original  branch  of  the  theory  is  that  of  games  in 
characteristic  function  form  or  simply  coalition 
games.  A  brief  treatment  will  serve  to  illustrate  the 
approach  and  its  differences  from  and  similarities 
to  the  noncooperative  theory. 

As  the  emphasis  on  coalitions  might  lead  one  to 
expect,  some  of  the  original  applications  of  game 
theory  to  politics  used  cooperative  game  theory. 
The  cooperative  game  formulation  provided  the 
basis  for  the  calculation  of  power  indices  that,  in 
a  1950s  conceptual  breakthrough,  clarified  the 
difference  between  voting  weights  and  voting 
influence  for  small  weighted-voting  bodies  and  for 
voting  bodies  with  complex  veto  rules,  notably  the 
United  Nations  Security  Council.  The  earliest 
models  of  voting  competition  also  took  the  form 
of  cooperative  games,  being  based  on  the  specifi¬ 
cation  of  winning  and  blocking  coalitions;  this 
approach  is  still  used  productively,  both  in  theo¬ 
retical  analyses  and  in  the  empirical  study  of  mul¬ 
tiparty  election  systems.  A  related  early  political 
science  use  of  game  theory  was  in  the  study  of 
parliamentary  government  formation,  where  coali¬ 
tion  patterns  were  analyzed  using  both  party  size 
and  the  restriction  that  coalitions  include  ideo¬ 
logically  compatible  partners. 

The  theory  of  coalition  games  originated  with 
the  attempt  of  John  von  Neumann  and  Oskar 
Morgenstern  to  theorize  about  zero-sum  games 
with  more  than  two  players.  In  such  games,  a  coali¬ 
tion  of  players  could  act  in  concert  to  assure  them¬ 
selves  certain  outcomes  regardless  of  the  efforts  of 
the  remaining  players;  the  question  is,  “what  coali¬ 
tions  will  form  to  take  advantage  of  their  capabili¬ 
ties?”  The  original  formulation  assumed,  contrary 
to  our  previous  assumptions,  that  payoffs  have 
meaning  across  players  and  can  in  fact  be  com¬ 
bined  and  voluntarily  transferred  from  one  player 
to  another  in  a  simple  additive  fashion.  This  class 
of  coalition  games  is  today  called  transferable  util¬ 
ity  (TU)  games;  in  nontransferable  utility  (NTU) 


games,  by  contrast,  one  keeps  separate  account  of 
the  payoffs  chosen  by  the  coalition  for  each  player. 

Formally,  let  N  =  {1,  2, ...,«}  be  a  set  of  play¬ 
ers.  For  every  subset  of  players  C,  let  t'(C)  be  the 
total  payoff  that  those  players  can  ensure  receiving 
regardless  of  the  actions  of  the  remaining  players; 
each  nonempty  C  is  a  possible  coalition ,  and  v  is  the 
characteristic  function ;  strictly  speaking,  v  com¬ 
pletely  describes  the  game  (since  it  implicitly  includes 
a  specification  of  the  set  of  players).  A  valid  charac¬ 
teristic  function  should  be  superadditive-.  For  any 
two  disjoint  coalitions  C  and  D,  v(C)  +  v(D)  < 
i/(C  U  D)  since,  acting  together,  the  members  of  the 
two  coalitions  could  do  anything  they  could  have 
done  by  acting  separately,  and  perhaps  more.  Let 
x  =  (xp  x„  . .  . ,  xj  be  the  distribution  of  payoffs 
that  ultimately  results  from  the  play  of  the  game, 
after  any  transfers  of  utility  to  which  members  of 
coalitions  have  agreed.  The  aim  of  the  characteristic 
function  formulation  is  to  predict  the  distribution  of 
payoffs  among  the  players,  given  the  capabilities  of 
the  coalitions  that  they  might  enter  into.  Notice  that 
the  particular  actions  taken  by  individuals,  acting 
alone  or  in  concert,  are  completely  abstracted  away, 
and  that  it  is  implicitly  assumed  that  a  coalition  can 
perfectly  accomplish  any  desired  commitment, 
cooperation,  or  coordination  to  achieve  its  potential 
total  payoff.  The  definition  of  v(  C)  as  the  most  that 
C’s  members  can  ensure  themselves  embodies  a  pes¬ 
simistic  viewpoint  that  is  really  appropriate  only 
where  the  underlying  game  is  zero  sum  (so 
v(N)  =  0),  although  that  restriction  is  not  always 
imposed  when  the  coalition  theories  are  applied. 

The  first,  very  weak,  solution  concept  is  the  set 
of  imputations  of  a  game  v.  This  is  the  set  of  pay¬ 
off  distributions  satisfying  two  properties: 

1.  Individual  rationality.  For  every  player  i,  xj  £ 
v({i}),  since  the  player  has  no  reason  to  accept  a 
payoff  lower  than  what  he  or  she  could  ensure 
himself  or  herself  by  acting  alone. 

2.  Efficiency.  Xi  +  x2  +  •  •  •  +  x„  =  v{N),  since  by 
definition  the  grand  coalition  can  get  any 
feasible  total  payoff,  and  there  is  no  reason  to 
settle  for  less. 

The  addition  of  one  more  condition  defines  the 
fundamental  solution  concept  for  coalition  games, 
the  core.  Let  x  and  y  be  two  imputations;  x  is  said  to 
dominate  y  if  there  is  some  coalition  C  such  that 


Gender  961 


(a)  v(C)  ^  S,eCx;,  that  is,  members  of  C  have  the 
power  to  impose  x,  and  (b)  x(  >  yt  for  each  i  E  C,  and 
strictly  so  for  some  i  E  C,  that  is,  each  member  of  C 
is  at  least  as  well  off,  and  some  better  off,  under  x 
than  under  y.  The  core  of  v  then  consists  of  the  set 
of  all  undominated  imputations.  In  many  games  of 
interest,  the  core  is  empty;  consider,  for  example, 
three  players  who  divide  a  dollar  by  majority  rule. 
The  Bondareva-Shapley  theorem  gives  a  balance 
condition  ensuring  the  core  will  be  nonempty. 
Alternative  conditions  on  imputations  yield  other 
solution  concepts,  such  as  the  von  Neumann- 
Morgenstern  solution,  the  kernel,  and  the  nucleolus. 
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Gender 


Today,  gender — a  term  that  is  also  used  in  lan¬ 
guages  other  than  English — is  used  in  the  context 


of  gender  relations  or  as  a  social  construction  of 
sexuality.  It  is  comparatively  a  new  term,  with 
gender  originally  being  used  until  the  1960s  only 
as  a  grammatical  category  to  allocate  nouns  into 
different  categories — feminine,  masculine,  and 
neutral.  Since  then,  the  term  has  been  fully 
engrained  in  the  social  and  cultural  sciences 
discourse  and  has  also  established  itself  in  the 
political  discourse  of  different  societies,  as  the 
concepts  of  gender  mainstreaming  and  gender 
budgeting  show.  The  expansion  of  the  term  and 
its  widespread  use  can  be  seen  as  a  sign  that  gen¬ 
der  inequality  is  perceived  in  many  areas  of  soci¬ 
ety  and  is  examined  as  a  social-organizational 
form.  This  entry  discusses  the  origins  of  this  con¬ 
cept,  its  relevance  in  social  discourses,  and  its 
implications  for  contemporary  political  science. 

Background 

As  opposed  to  the  biological  term  sex,  the  term 
gender  refers  to  the  “social”  or  “psychological” 
gender  of  a  person,  where  “sex”  and  “gender” 
should  be  seen  as  analytically  independent  of  each 
other.  Gender  refers  to  a  historically  specific  prac¬ 
tice  of  social  classification.  It  focuses  on  the  special 
perceptions  and  patterns  of  interpretation  by 
which  the  binary  structure  of  “male”  and  “female” 
is  viewed,  dealt  with,  and  institutionalized.  In 
doing  so,  it  deals  with  the  symbolic  order  of  sexes 
in  a  society,  as  well  as  with  the  self-attributions 
and  identification  of  the  individual,  which  focus 
on  it  but  are  not  identical  to  this  symbolism.  In 
most  societies,  categorizing  into  male  and  female 
gender  categories  takes  place  on  the  basis  of  the 
shaping  of  primary  and  secondary  gender  charac¬ 
teristics.  Disambiguations  and  the  one-sidedness  of 
“male”  and  “female”  are  associated  with  the  two- 
gender  classification,  as  well  as  with  social  rela¬ 
tions  of  symmetry  and  asymmetry,  equality  and 
hierarchy,  of  inclusion  and  exclusion.  These  rela¬ 
tionships  are  embedded  in  concrete  social  relations 
and  structural  connections.  At  the  same  time,  the 
fact  that  classifications  do  not  simply  exist  but 
constantly  need  to  be  reacquired  points  to  their 
instability  and  dependency  on  having  to  be  updated 
at  a  practical  level.  Based  on  the  realization  that 
social  gender  differentiation  is  not  based  on 
anthropological,  biological,  or  physiological  con¬ 
ditions  but  on  the  result  of  social  classifications,  it 
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can  be  inferred  that  the  hierarchical  relationship  of 
the  sexes  to  one  another  is  not  an  expression  of  a 
“natural”  order  that  cannot  be  changed. 

Stages  in  the  Discourse 

The  sociological  debate  on  inequality  between 
women  and  men  and  the  category  of  “gender” 
began  around  the  middle  of  the  1960s  and  is 
closely  linked  to  the  so-called  second  wave  of  the 
women’s  movement  emerging  at  that  time  in  many 
Western  countries.  The  studies,  mainly  compiled 
by  female  academics,  first  concentrated  on  unmask¬ 
ing  science  as  “male  oriented”  and  criticizing  the 
lack  of  awareness  of  women’s  lives  and  active  deal¬ 
ings  in  research  and  teaching.  A  close  connection 
between  the  women’s  movement  and  feminist 
criticism  in  science  was  established,  and  it  resulted 
in  criticism  of  the  discrimination  against  women  in 
academia  and  society  and  of  the  one-sidedness  of 
research  that  was  mainly  conducted  by  men  and 
that  often  treated  women  as  objects. 

Feminist  criticism  of  science  highlighted  the 
facts  that  cultural,  political,  and  socially  dominant 
gender  images  influenced  the  scientific  theories  of 
the  time.  Based  on  this,  the  much  lauded  scientific 
objectivity  was  often  distorted  by  a  male  point  of 
view.  The  primary  aim  in  women’s  studies  was, 
therefore,  to  make  the  ignored  contributions  of 
women  in  academia  visible  by  coming  to  terms 
with  and  analyzing  the  social  setting  of  the  time 
and  women’s  thoughts  and  actions.  In  doing  so, 
women’s  studies  were  viewed  at  first  as  biased 
research  that  placed  the  focus  on  women  and  on 
the  changes  in  their  life  situations. 

This  view  was  gradually  broadened,  however, 
and  increasingly  women’s  thinking  and  actions 
were  analyzed  in  relation  to  men’s  with  the  inten¬ 
tion  of  identifying  differences  and  hierarchies.  It 
was  in  this  context  that  the  discussion  began  about 
what  sex  and  gender  really  mean.  In  gender  and 
women’s  studies,  gender  is  no  longer  treated  as 
something  natural  but  as  a  social  or  structural  cat¬ 
egory  with  subsequent  social  roles.  People  are 
assigned  or  denied  specific  characteristics,  attri¬ 
butes,  and  places  in  society  on  the  basis  of  their  sex. 
Sociological  gender  studies  since  the  1980s,  there¬ 
fore,  speak  of  gender  as  a  structural  category.  What 
is  meant  is  that  the  separation  and  hierarchiza- 
tion  of  social  spheres  such  as  reproduction  and 


production,  and,  consequently,  the  public  and  pri¬ 
vate  realms  are  significant  characteristics  of  mod¬ 
ern,  industrial  societies.  This  separation  and  hierar- 
chization  are  accompanied  by  an  engendering  of 
different  spheres  and  social  subdisciplines.  Gender 
is  regarded  as  a  binary  hierarchy  that  co-organizes 
the  different  levels  of  social  organizations.  Although 
study  on  the  discrimination  against  women  is  fre¬ 
quently  the  point  of  departure  for  research,  gender 
research  now  also  addresses  the  social  organization 
of  gender  relationships  more  broadly. 

Based  on  the  recognition  that  gender  is  a  part 
and  a  result  of  a  construction  process  in  which 
“he”  and  “she”  as  individuals  are  involved,  a 
sociological  shift  has  occurred:  Gender  is  no  longer 
viewed  as  a  structural  category  but  also  as  a  pro¬ 
cess  category.  Candace  West  and  Don  Zimmerman’s 
concept  of  “doing  gender,”  which  was  developed 
in  1987,  differentiates  itself  from  the  current  sex/ 
gender  distinction  and  the  nature/culture  concept 
transported  with  it.  Doing  gender  understands  gen¬ 
der  attribution  and  gender  identity  as  a  continuous 
production  process  that  is  taking  place  in  all  human 
activities.  This  ensures  that  the  gender  discrepancy 
is  generated  by  people  continuously  making  them¬ 
selves — and  allowing  themselves  to  be  turned 
into — women  and  men  in  their  everyday  lives.  It  is 
a  collective  and  individual  process  of  construction 
through  which,  depending  on  the  context  and  the 
situation,  dual-gender  classification  is  strategically 
implemented  in  practical  social  actions. 

Similarly,  a  critical  “men’s  studies”  category  is 
emerging  that  focuses  on  male  disciplinary  and 
normalization  processes  and  calls  into  question  the 
norms  and  realities  of  “hegemonic  maleness.”  A 
central  understanding  within  men’s  studies  is  the 
recognition  that  hegemonic  maleness  is  responsible 
not  only  for  the  dominance  of  men  over  women 
but  also  for  hierarchies  among  men.  Beyond  this, 
research  on  maleness  has  emphasized  the  fact  that 
men  are  also  subjected  to  one-dimensional  gender 
assignations  that  are  formative  and  do  not  at  all 
represent  the  “general  human  being.”  An  important 
difference  from  women’s  studies  continues  to  be 
that  the  experience  of  injustice  and  discrimination 
has  not  formed  the  basis  of  research  and  the  rela¬ 
tionship  between  research  and  practice  in  the  con¬ 
text  of  a  “political”  or  partisan  intervention — as 
was  central  in  the  early  days  of  women’s  studies — 
has  therefore  had  very  little  significance. 
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Gender  studies  further  broadened  in  scope  in 
the  1990s.  The  category  of  gender,  which  had  until 
then  been  viewed  as  a  “natural”  classification, 
even  if  culturally  and  socially  formed,  was  ques¬ 
tioned.  Gender  Trouble,  a  book  by  the  American 
philosopher  Judith  Butler  published  in  1990, 
evoked  a  comprehensive  discussion  on  the  iden¬ 
tity-giving  category  of  “woman.”  What  had  widely 
served  as  an  unquestionable  basic  assumption  in 
women’s  studies — the  natural  existence  of  two 
exclusive  genders — increasingly  became  an  object 
of  academic  examination.  The  binary  distinction 
between  two — and  only  two — genders  is  now  fre¬ 
quently  understood  as  an  ideological  construct  of 
a  discursive  power  system.  Gender  affiliation  of 
persons  and  the  two-sex  model  as  a  social  classifi¬ 
cation  and  functional  differentiation  principle  do 
not  refer  to  the  law  of  nature  but  are  the  result  of 
social  constructive  processes. 

This  construction  of  gender  cannot,  however, 
be  viewed  as  a  neutral  form  of  determination; 
rather,  it  is  already  part  of  the  social  determination 
of  gender  relations  and  therein  also  of  the  social 
inequality  between  men  and  women.  The  facti¬ 
tiousness  of  gender  not  only  affects  gender,  how¬ 
ever,  but  also  applies  to  the  biological  category  of 
sex.  As  the  latter  is  also  not  free  of  a  sociocultural 
construct,  the  “prediscursivity”  of  sex  and  the 
norms  and  regulations  tied  to  biological  sex  must 
be  debated. 

Gay  theory  also  focused  the  spotlight  on  the 
connection  between  gender  and  heteronormativity, 
or  the  social  norm  of  heterosexuality.  Academic 
studies  continue  to  take  up  heterosexuality  as  a 
power  configuration,  which  gender  studies  has 
barely  researched  and  has  not  acknowledged  as 
hetero-normatively  composed.  Normative  refer¬ 
ence  points  in  women’s  and  gender  studies  are  put 
up  for  discussion  with  the  transformation  and 
broadening  of  knowledge  on  the  category  of  gen¬ 
der  as  a  binary  construct. 

In  the  beginning,  gender  studies  were  oriented — 
as  was  the  women’s  movement — toward  the  issue 
of  how  equality  between  women  and  men  could  be 
attained  in  all  areas  and  which  barriers  are  an 
obstacle  to  this  goal  at  the  political,  legal,  sym¬ 
bolic,  and  other  levels.  At  a  practical  political  level, 
equality  was  seen  as  nondiscriminatory,  but  this 
view  raised  the  issue  that  equality  leads  to  a 
cementing  or  even  strengthening  of  inequality 


according  to  the  different  points  of  departure  and 
socially  created  differences.  In  addition,  there  was 
a  criticism  that  the  concept  of  equality  was  tar¬ 
geted  toward  aligning  the  position  of  women — 
their  social  and  individual  possibilities — with  those 
of  men.  The  measure  of  equality  is  thus  “the  man” 
and  the  options,  values,  and  concepts  of  work,  life, 
and  politics  ascribed  to  him. 

As  a  patriarchal  devaluation  of  the  female  is 
latently  connected  to  this  view  of  equality,  the  dif¬ 
ference  perspective  articulated  the  suggestion  that 
women  should  be  treated  differently  from  men. 
Connected  to  this  is  the  challenge  to  go  beyond 
stereotypical  attributions  to  reflect  on  the  ways  in 
which  women  are  different  or  unique.  Furthermore, 
the  characteristics,  lifestyles,  and  so  on  of  women 
should  not  be  viewed  as  deficient  but  as  parts  of 
one  and  the  same  concrete  whole.  Associated  with 
this,  however,  is  a  fear  of  a  codification  of  typical 
female  characteristics  and  gender  roles  that  tend  to 
become  gridlocked.  The  discussion  on  equality  or 
difference  has  become  a  never-ending  story;  none¬ 
theless,  it  is  clear  that  the  deconstruction  of  gender 
has  led  to  a  shift  from  the  selection  of  one  of  these 
two  positions  to  a  concern  with  analyzing  the  con¬ 
nections  between  equality  and  difference.  For 
some  years  now,  the  perspective  has  broadened, 
and  in  addition  to  the  gender  imbalance,  differ¬ 
ences  among  women  are  also  increasingly  being 
taken  into  consideration — especially  class-specific 
or  ethnic  differences  that  resulted  from  American 
theory-building  discourses  and  Black  feminism,  as 
well  as  the  debate  surrounding  women  (co)perpe- 
trators  during  National  Socialism. 

As  all  individuals  not  only  belong  to  a  group 
but  always  belong  to  multiple  groups  at  the  same 
time,  how  the  category  of  gender  interacts  with 
other  categories  of  inequality  is  analyzed  below. 
Both  legally  and  politically,  this  is  codified  in  the 
concept  of  multiple  discrimination.  At  a  theoreti¬ 
cal  level,  gender  studies  subsume  this  phenomenon 
as  a  crossing  over  between  different  categories  of 
inequality  under  the  term  of  inter sectionality,  a 
term  that  the  American  legal  specialist  Kimberle 
Crenshaw  coined  in  1987  in  the  context  of  Black 
feminism.  The  debate  surrounding  intersectional- 
ity  and  the  interdependencies  among  race,  class, 
and  gender  has  so  far  been  mainly  theory  driven 
and  is  only  gradually  being  enhanced  by  empirical 
studies.  In  general,  it  is  argued  that  the  question  of 
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whether  sex  or  another  category  has  a  larger  influ¬ 
ence  on  discrimination,  which  has  been  raised  in 
the  discussion  on  gender  as  a  “master  category” 
from  time  to  time,  should  remain  open  in  the 
analysis. 

Gender  in  Political  Science 

In  comparison  with  other  social  science  disciplines, 
political  science  was  relatively  late  in  focusing  on 
gender  as  a  category.  In  the  beginning,  solely  the 
field  of  “women  and  politics”  was  the  object  of 
study.  Within  this  context,  the  question  of  the 
political  representation  of  women  and  how  it  was 
reflected  in  political  issues  was  especially  interest¬ 
ing.  The  feminist  perspective  first  took  up  this 
analytical  perspective  and  questioned  the  reasons 
behind  the  formal  and  informal  exclusion  of 
women  from  politics. 

A  reconceptualization  and  expansion  of  the 
research  field  to  include  “gender  and  politics,” 
however,  was  initiated  relatively  quickly.  In  this 
context,  maleness/masculinity  as  a  measure  and 
norm  is  inscribed  in  all  political  processes  in  a 
more  or  less  subtle  manner.  In  most  countries,  a 
quantitative  overrepresentation  of  men  in  politics 
exists  that  is  oriented  on  the  biological  sex. 
Politics  is  not,  however,  male  because  almost  all 
political  decision-making  bodies  are  dominated  by 
men,  but  because  male  interests  are  systematically 
presented  as  hegemonic,  representing  and  giving 
preferential  treatment  to  interests  that  most  closely 
correspond  to  the  everyday  lives  of  this  social 
group.  In  addition,  there  is  a  substantive  mascu¬ 
linity:  as  Joan  Acker  highlighted  with  the  term 
gendered  organizations ,  institutionalized  male 
patterns  are  integral  components  of  political  orga¬ 
nizations  that  influence  political  rules,  values, 
norms,  and  structures. 

Thus,  gender-critical  discourse  within  political 
science  generally  takes  place  at  two  levels.  First,  it 
occurs  at  the  sociopolitical  level  and  expounds  on 
the  limited  number  of  women  in  the  discipline.  The 
underrepresentation  of  women  results  in  the  fact 
that  women  receive  less  attention  as  actors  and,  at 
the  same  time,  the  androcentric  scholarly  concep¬ 
tion  of  women,  sex,  and  sexuality  is  maintained. 
The  other  level  focuses  primarily  on  epistemologi¬ 
cal  and  conceptual  questions  and  delivers  a  critical 
inspection  of  political  science  paradigms  and  terms 


at  a  theoretical  analytical  level  with  the  goal  of 
making  the  gender  subtext  visible.  Based  on  this 
deconstruction  of  apparently  gender-neutral  nor¬ 
mative  premises,  terms,  and  research  interests  in 
mainstream  political  science,  the  political  was 
broadened  and  reformulated  to  include  a  gendered 
view.  Part  and  parcel  of  this  was  a  critical  revision 
of  its  core  areas  and  subdisciplines,  among  others 
the  fundamentals  of  political  scientific  thinking. 
Among  these,  especially,  are  the  fundamental  cat¬ 
egories  of  the  public  sphere,  privacy,  and  politics. 

The  Public  Versus  the  Private  Sphere 

The  central  point  of  departure  for  a  gender-critical 
political  science  is  the  challenge  of  the  separation 
and  polarization  between  the  public  political  and 
the  private  nonpolitical  spheres.  Every  debate  con¬ 
cerning  this  distinction  deals  with  the  problem  of 
participation  at  its  very  core  or  the  possibilities  and 
conditions  of  women’s  presence  in  politics.  In  the 
history  of  political  ideologies,  “women,  together 
with  other  groups  such  as  slaves,  servants  (to 
which  a  large  number  of  women  belonged)  and 
landless  men,  were  described  as  incapable  of  lead¬ 
ing  a  moral  life  or  able  to  participate  in  the  justice 
system”  (Seyla  Benhabib  &  Linda  Nicholson, 
1987,  p.  514).  The  differences  and  inequalities 
between  the  genders  were  substantiated  on  four 
assumptions:  First,  the  natural,  anatomical,  and 
physiological  differences  were  used  to  justify 
inequality  by  claiming  that  the  ability  to  give  birth 
as  well  as  other  apparent  characteristics  of  women 
were  decisive  criteria  for  participating  in  political 
life.  Second,  it  was  assumed  that  men  are  both 
physically  and  intellectually  superior  to  women, 
which  brought  with  it  a  hierarchy  of  naturalized 
differences  that  was  also  reflected  in  the  signifi¬ 
cance  of  the  spheres.  Third,  the  so-called  female 
sphere  is  irrelevant  for  political  life.  This  is  accom¬ 
panied  by  the  fourth  point — the  assumption  that 
the  domestic  sphere,  although  it  takes  place  outside 
the  political  arena,  is  nonetheless  controlled  by 
politics  and  is  inferior  to  it  (Benhabib  &  Nicholson, 
1987,  p.  515).  In  sum,  this  means  that  male  reason 
and  rational  conduct  are  attributed  to  the  public 
sphere,  while  the  private  sphere  is  seen  as  being 
characterized  by  irrational,  unpredictable  patterns. 

Criticism  is  targeted  at  the  double  separation  of 
the  public  and  the  private,  which  has  traditionally 
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influenced  theory  formation  in  political  science.  By 
distinguishing  state  and  (private)  economy  on  the 
one  hand,  and  economy  and  (private)  households 
on  the  other,  gender  inequalities  are  left  out  of  sight 
in  many  respects.  The  gender-specific  division  of 
labor  that  is  constitutive  for  the  (private)  economy 
(paid/unpaid)  eludes  state  and  political  regulation, 
such  as  gender-hierarchical  appraisal  of  labor  (well 
paid/badly  paid),  which  essentially  represents  a 
continuation  of  this  labor  division  in  a  semipublic 
space  in  its  differentiation  between  women’s  and 
men’s  activities.  The  entire  vocabulary  used  in 
political  science  is  built  on  this  dichotomy.  Feminist 
welfare  state  research  especially  points  to  the  fact 
that  gender-specific  labor  division  is  not  only  a 
constitutive  element  of  the  (welfare)  state — as 
Carole  Pateman  (1988)  showed  with  the  “sexual 
contract”  approach — but  that  politics  is  also  gen¬ 
dered  and  supports  gendered  processes:  “In  a  soci¬ 
ety  in  which  there  is  a  gendered  division  of  labour 
there  is  almost  no  area  of  policy  in  which  women 
and  men  are  not  differently  affected”  (Joni 
Lovenduski,  1992,  p.  610).  The  discourse  on  the 
separation  of  public  and  private  highlights  the  fact 
that  a  close  connection  exists  between  the  two 
spheres,  and  that,  at  the  same  time,  these  are  placed 
in  a  hierarchy  (e.g.,  important-unimportant,  inclu¬ 
sion-exclusion)  that  is  constructed  and  has  an  ideo¬ 
logical  basis. 

Broadening  of  the  Definition  of  Politics 

The  division  between  the  public  and  the  private 
sphere  has  also  become  engrained  in  the  field  of 
politics  and  the  political.  The  classic  definition  of 
politics  views  politics  ( politikos )  as  the  administra¬ 
tion  of  the  affairs  of  the  polls ,  the  community.  In 
contrast,  the  areas  of  housework,  household 
administration,  the  house  ( oikos ),  and  the  family, 
or  in  other  words  those  areas  in  which  women 
were  traditionally  active,  were  not  seen  as  politi¬ 
cal.  Even  if  this  classic  political  understanding  in 
political  science  does  not  dominate  (anymore),  and 
many  different  political  definitions  exist,  the  polit¬ 
ical  is  commonly  positioned  in  the  public  sphere. 
Feminist  political  science,  therefore,  calls  for  a 
widening  of  a  public  and  institutionalized  defini¬ 
tion  of  politics  that  only  sees  gender  relationships 
and  gender  relations  as  un-  or  prepolitical  but  not 
as  a  political  expression  or  object. 


The  American  political  scientist  and  philosopher 
Nancy  Fraser,  for  example,  distinguishes  between 
an  official  political  and  a  discursive  political  defini¬ 
tion.  The  former  views  a  matter  as  political  if  it  is 
represented  in  official  government  institutions  or, 
in  other  words,  in  parliaments,  administrations, 
and  other  bodies.  As  Fraser  (1989)  notes,  “the  dis¬ 
cursive  political  definition  comprises  everything 
that  is  disputed  beyond  a  range  of  different  discur¬ 
sive  arenas  and  among  a  range  of  different  dis¬ 
course  publics”  (p.  166).  Between  the  political 
definitions  is  a  close  connection,  as  an  issue  usually 
only  turns  into  an  object  of  legitimate  state  inter¬ 
vention  once  it  has  been  discussed  among  the 
broader  public.  The  border  between  both  attribu¬ 
tions  is  not  defined  but  is  an  object  of  conflict  and 
changes  within  social  discourses. 

The  expansion  of  political  contents  and  spaces 
is  accompanied  by  criticism  against  forms,  norms, 
processes,  and  practices  linked  to  a  governmental 
political  definition.  This  would  lead  to  the  fact  that 
women  not  only  as  a  group  of  people  but  also 
concerning  women’s  interests  (even  if  these  are  not 
necessarily  homogeneous  interests)  would  insuffi¬ 
ciently  be  represented  or  not  be  represented  at  all. 

A  further  important  aspect  in  the  discourse  sur¬ 
rounding  the  definition  of  politics  concerns  the 
question  of  how  gender  relations  and  gender  rela¬ 
tionships  can  be  viewed  as  political  objects. 

When  politics  govern  social  relationships,  they 
also  embody  identities  and  interests  and  the  rela¬ 
tionships  between  men  and  women,  as  well  as 
between  different  groups  of  women  or  men.  If  this 
understanding  is  pursued,  the  field  of  activity  of 
political  science  is  broadened  to  include  a  debate 
and  analysis  of  politics  as  the  regulation  of  bipolar 
hierarchical  gender  relationships.  Birgit  Sauer 
(2005)  outlines  the  meaning  of  gender  categories 
for  political  science  as  follows: 

As  a  structural  category  in  politics  and  political 
science,  gender  is  constituted  at  the  subjective 
individual  and  objective  levels — a  culturally 
formed,  socially  shaped  relationship  that  is  loaded 
with  purpose  and  meaning,  that  possesses  structural 
and  institution-building  potency,  (p.  381) 

Gender  is  to  be  understood  as  a  structural  and 
structuring  category.  This  means  that  gender,  on 
the  one  hand,  is  a  political  institution  and  that,  on 
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the  other  hand,  institutions  “have”  a  gender  and 
reproduce  genders.  The  following  five  levels  are 
relevant  to  the  analysis  of  gender  in  political  sci¬ 
ence:  first,  the  level  of  political  cultures  and  sym¬ 
bols;  second,  the  level  of  state  organizations  and 
rules  (polity);  third,  the  policy  level;  fourth,  the 
level  of  political  processes  and  their  actors  (poli¬ 
tics);  and  fifth,  the  level  of  political  subjects. 

Status  Quo 

Nowadays,  a  gendered  view  of  the  discipline  can 
be  found  in  nearly  all  areas  of  the  field,  such  as 
international  relations,  political  theory,  political 
systems,  or  policy  research.  Similar  to  this  profil¬ 
ing,  a  partial  institutionalization  of  feminist  or 
gendered  political  science  as  an  area  of  research 
and  as  a  teaching  subject  has  taken  place. 
Nevertheless,  a  problem  that  Virgina  Sapiro  (1995) 
identified  almost  1 5  years  ago  continues  to  be  true: 

Despite  the  size  of  the  gender  politics  community, 
and  despite  the  fact  that  most  political  science 
departments  now  offer  at  least  one  course  in  this 
area,  gender  politics  has  not  been  fully  integrated 
into  political  science;  its  theories,  questions,  and 
conclusions  have  not  been  “mainstreamed”  to 
any  significant  degree,  (p.  292) 

This  means  that  gender  relations  beyond  the  femi¬ 
nist  and  gender-oriented  branch  of  the  discipline 
are  not,  or  are  barely,  conceptualized  as  social 
relations  that  structure  politics. 
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Genocide 


While  the  term  genocide  is  new,  the  phenomenon  is 
old.  Genocide  has  a  long  history,  going  back  to  at 
least  the  12th  century  BCE  and  continuing  into  the 
present.  The  20th  century  can  be  considered  an 
“age  of  genocide,”  with  numerous  examples  of 
genocide  throughout  the  period  claiming  as  many 
as  70  million  lives  (some  scholars  would  say  many 
more  than  that).  After  the  Holocaust,  the  watch¬ 
word  was  “Never  again,”  yet  since  1945  the  lives 
claimed  by  genocide  exceed  those  killed  in  interna¬ 
tional  and  civil  wars  combined.  But  with  so  many 
genocides,  different  situations,  motives,  and  victim 
groups,  it  has  been  difficult  to  define  genocide  in 
ways  that  everyone  would  accept.  Moreover,  the 
term  is  widely  used  in  a  rhetorical  fashion  to  attract 
attention  and  to  further  political  and  social  demands. 

Nevertheless,  the  definition  of  genocide  that  is 
contained  in  the  United  Nations  (UN)  Convention 
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on  the  Prevention  and  Punishment  of  Genocide, 
though  widely  criticized,  is  the  one  that  prevails 
among  scholars.  To  complicate  matters  further, 
the  concept  of  genocide  could  be  described,  fol¬ 
lowing  Ludwig  Wittgenstein,  as  a  concept  with 
“blurred  edges.”  Here,  the  family  resemblances 
and  differences  between  genocide  and  other  con¬ 
cepts,  such  as  ethnic  cleansing,  must  be  taken  into 
account.  There  is  still  a  core  meaning:  Genocide 
consists  of  acts  that  intentionally  threaten  the  exis¬ 
tence  of  human  groups.  Yet  this  statement  itself  is 
incomplete:  What  acts?  What  level  of  threat?  What 
is  a  group?  Who  defines  it?  And  how  is  intention- 
ality  determined? 

Before  the  term  genocide  existed,  mass  killings 
were  referred  to  as  massacres ,  race  extermination, 
and  so  on.  Perpetrators  had  very  different  terms: 
victory,  self-defense,  purification  of  society.  It  was 
not  until  1944  that  the  term  genocide  was  created 
by  the  Polish  Jewish  lawyer  Raphael  Lemkin  in  his 
book  Axis  Rule  in  Occupied  Europe.  The  term 
was  based  on  the  Greek  genos  (race,  group)  with 
the  Latin  cide  (killing).  Lemkin  used  the  term  to 
describe  not  so  much  the  destruction  of  individuals 
but  “the  destruction  of  a  nation  or  ethnic  group.” 
Political  and  social  groups,  such  as  classes,  were 
not  included  in  his  definition  and  nor  was  the  par¬ 
tial  destruction  of  a  group.  He  stated,  “Genocide 
is  a  coordinated  plan  of  different  actions  aiming  at 
the  destruction  of  the  essential  foundations  of  the 
life  of  national  groups,  with  the  aim  of  annihilat¬ 
ing  the  groups  completely”  (p.  79).  For  him,  the 
core  of  the  crime  was  the  destruction  of  groups, 
not  individuals;  he  believed  that  groups  were  the 
bearers  of  civilization  and  culture,  of  identity,  of 
diversity,  and  thus  in  special  need  of  protection. 
Pieter  Drost  (1959)  countered  with  a  definition 
that  focused  not  on  the  destruction  of  groups  but 
of  individuals:  “Genocide  is  the  deliberate  destruc¬ 
tion  of  physical  life  of  individual  human  beings  by 
reason  of  their  membership  of  any  human  collec¬ 
tivity  as  such”  (p.  125). 

Lemkin  pressed  the  UN  to  declare  genocide  a 
crime  against  international  law.  From  1946  to 
1948,  the  UN  debated  the  wording  of  what  would 
become,  by  unanimous  vote  in  December  1948, 
the  UN  Convention  on  the  Prevention  and 
Punishment  of  Genocide.  With  sufficient  ratifica¬ 
tions  by  member  states,  the  Convention  entered 
into  force  in  early  1951.  But  despite  the  occurrence 


of  numerous  examples  of  genocide  in  the  years  fol¬ 
lowing,  not  a  single  case  of  genocide  was  tried 
under  international  law  until  the  opening  years  of 
the  21st  century.  The  International  Tribunals  for 
the  former  Yugoslavia  and  Rwanda  have  now 
both  charged  and  convicted  several  individuals  for 
genocide.  One  problem  with  this,  however,  is  that 
a  vast,  state-organized  process,  with  deep  societal 
involvement,  is  treated  as  if  it  were  simply  a  trial 
for  murder. 

The  Convention  makes  genocide  and  related 
crimes,  such  as  incitement  to  genocide,  crimes 
under  international  law,  “whether  committed  in 
time  of  peace  or  in  time  of  war.”  But  the  heart  of 
the  Convention  is  found  in  Article  II,  where  the 
crime  of  genocide  is  defined  and  where  many  of 
the  problems  with  the  concept  lie: 

In  the  present  Convention,  genocide  means  any 
of  the  following  acts  committed  with  intent  to 
destroy,  in  whole  or  in  part,  a  national,  ethnical, 
racial  or  religious  group,  as  such: 

(a)  Killing  members  of  the  group; 

(b)  Causing  serious  bodily  or  mental  harm  to 
members  of  the  group; 

(c)  Deliberately  inflicting  on  the  group 
conditions  of  life  calculated  to  bring  about  its 
physical  destruction  in  whole  or  in  part; 

(d)  Imposing  measures  intended  to  prevent 
births  within  the  group; 

(e)  Forcibly  transferring  children  of  the  group 
to  another  group. 

Many  important  issues  arise  from  this  concep¬ 
tion  of  genocide.  First,  not  all  groups  are  pro¬ 
tected;  political  and  social  groups,  such  as  classes, 
lie  outside  the  boundaries  of  the  definition.  The 
reasons  for  this  are  that  (a)  only  groups  would  be 
covered  that  were  “stable  and  permanent”  and 
(b)  many  countries  wanted  to  keep  open  their  right 
to  suppress  by  violence  any  political  or  social  chal¬ 
lenge  to  the  authority  of  the  ruling  power.  Second, 
not  only  must  there  be  intent  to  destroy  a  group, 
but  there  must  also  be  “specific  intent,”  that  is,  the 
intent  to  destroy  the  group  must  be  because  of  its 
particular  characteristics  (national,  ethnical,  racial, 
or  religious).  Destroying  such  a  group  for,  say, 
economic  reasons,  would  not  count  as  genocide. 
Third,  the  question  of  destroying  a  group  “in  part” 
has  raised  issues  about  how  large  a  part  and 
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whether  it  is  a  matter  of  numbers  or  a  percentage 
of  the  overall  group  that  counts.  In  general,  these 
issues  have  now  been  dealt  with  by  saying  a  “sub¬ 
stantial  part,”  leaving  it  to  courts  to  decide  just 
where  to  draw  the  boundaries.  Some  legal  schol¬ 
ars,  however,  would  add  that  “in  part”  could 
mean  a  “significant  part,”  in  some  functional 
sense — for  example,  culling  the  leadership  ele¬ 
ments  of  the  group. 

A  fourth  issue  concerns  the  problem  of  what 
constitutes  a  group.  For  example,  the  Tutsis  in 
Rwanda  were  neither  a  national  nor  an  ethnic 
group,  but  they  were  perceived  by  their  killers  to 
be  members  of  a  group.  In  the  UN  Rwandan  tribu¬ 
nal,  the  court  decided  that  groups  and  membership 
in  them  are  defined  by  the  perceptions  of  the 
perpetrator.  Finally,  the  acts  that  are  listed  as  con¬ 
stituting  genocide  are  physical,  biological,  and 
cultural  may  not  matter,  but  of  the  five  acts,  any 
one  of  which  constitutes  genocide,  only  one  of 
them  actually  involves  killing,  though  all  aim  at 
the  destruction  of  the  group.  The  UN  definition 
focuses  on  intent,  not  on  results.  That  genocide 
could  occur  with  few  or  even  no  deaths  is  a  sur¬ 
prising  conceptual  outcome  of  the  UN’s  approach 
and  contrary  to  both  scholarly  analysis  and  public 
understanding  of  what  is  meant  by  genocide. 

There  are  other  concepts  that  bear  a  family 
resemblance  to  genocide  but  also  can  be  distin¬ 
guished  from  it — for  example,  war.  Many  geno¬ 
cides  occur  within  the  context  of  war,  but  apart 
from  ancient  warfare,  which  was  synonymous 
with  genocide,  war  traditionally  was  a  battle 
between  armed  combatants  not  between  perpetra¬ 
tors  and  largely  unarmed  civilians.  In  the  20th 
century,  war  has  often  been  total  war,  and  waged 
against  civilians  as  well  as  soldiers,  but  not  with 
the  idea  of  eliminating  the  “enemy”  as  a  group. 
Genocide,  on  the  other  hand,  is  a  species  of  total 
war:  It  is  aimed  at  a  particular  group,  most  of 
whom  are  unarmed,  and  the  intent  is  not  the  sub¬ 
duing  of  the  group  but  its  elimination,  if  not  in 
whole,  in  substantial  part. 

Massacre  also  can  be  part  of  how  genocide  is 
carried  out,  but  more  often  it  is  geared  toward 
revenge  or  repression,  not  annihilation.  It  tends 
also  to  be  episodic  and  isolated  rather  than  sus¬ 
tained  and  pervasive  as  in  genocide. 

The  term  ethnic  cleansing,  on  the  other  hand, 
has  gained  so  much  traction  since  the  war  and 


genocide  within  Bosnia  that  it  has  almost  displaced 
the  idea  of  genocide  or,  alternatively,  is  treated  as 
another  term  for  genocide.  In  practice,  ethnic 
cleansing  is  often  part  of  the  genocidal  process;  the 
term  is  also  a  euphemism  used  by  the  perpetrator 
to  disguise  what  is  actually  taking  place.  But  in 
theory,  ethnic  cleansing  means  using  violence, 
rape,  terror,  and  forced  deportation  to  drive  out  or 
remove  a  particular  group  from  a  common  terri¬ 
tory.  The  emphasis  is  on  creating  a  homogeneous 
society  through  fear  and  expulsion,  not  as  in  geno¬ 
cide  on  annihilating  the  group. 

There  are  legal  and  institutional  developments 
that  also  relate  to  the  status  of  genocide  within 
international  law.  In  1998,  the  Statute  of  Rome 
created  an  International  Criminal  Court,  with  the 
following  crimes  within  its  jurisdiction:  genocide, 
crimes  against  humanity,  war  crimes,  and  the 
crime  of  aggression.  The  very  term  crimes  against 
humanity  arose  out  of  the  Young  Turk  genocide  of 
the  Armenians,  when  the  Allies  in  May  1915 
warned  the  Ottoman  rulers  that  they  would  be 
held  accountable  and  tried  for  “crimes  against 
humanity  and  civilization.”  In  the  statute,  “crime 
against  humanity”  is  defined  in  terms  of  a  number 
of  particular  acts,  including  murder,  extermina¬ 
tion,  and  deportation  or  forcible  transfer  of  popu¬ 
lation  but  more  widely,  torture,  apartheid,  and 
“other  inhumane  acts  of  a  similar  character  inten¬ 
tionally  causing  great  suffering,  or  serious  injury  to 
body  or  to  mental  or  physical  health.”  Any  of 
these  acts  “when  committed  as  part  of  a  wide¬ 
spread  or  systematic  attack  directed  against  any 
civilian  population”  constitutes  a  crime  against 
humanity.  The  key  category  here  is  “civilian  popu¬ 
lation,”  not  particular  groups  as  indicated  in  the 
Genocide  Convention;  and  the  question  of  intent  is 
different  from  that  required  under  the  Convention: 
Here,  no  specific  intent  is  required. 

Though  one  can  differentiate  “genocide”  and 
“crimes  against  humanity”  in  important  respects, 
the  two  concepts  overlap  in  many  ways.  But  it  is 
much  easier  to  prosecute  actions  as  crimes  against 
humanity  than  as  genocide;  the  former  also  have  a 
wider  reach,  not  being  restricted  to  crimes  against 
particular  groups  named  in  the  Convention.  Some 
international  legal  scholars  suggest  that  with  the 
increased  emphasis  on  crimes  against  humanity, 
the  international  community  could  dispense  with 
the  Genocide  Convention.  Prosecution  of  crimes 
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against  humanity  would  thus  provide  alternatives 
to  the  conceptual  confusions  and  limitations  of  the 
Convention. 

Roger  W.  Smith 
College  of  William  and  Mary 
Williamsburg,  Virginia,  United  States 
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Geopolitics  denotes,  generically,  strategic  power 
struggles  among  states  over  the  political  control  of 
territories  and  resources.  More  specifically,  it  sug¬ 
gests  that  politics  is  primarily  defined  by  geo¬ 
graphical  location,  territorial  expansion,  and 
interstate  competition  over  finite  spaces.  The 
late- 19th-century  interimperial  rivalries  provided 


the  historical  context  for  the  rise  of  geopolitical 
thought  across  the  core  countries  of  the  imperial 
zone,  rearticulating  the  relation  between  eco¬ 
nomic  expansion  and  territorial  control.  The  final 
division  of  the  last  extra-European  vacant  spaces 
generated  an  acute  awareness  of  the  closure  of 
absolute  space  in  a  new  “planetary”  age.  Inter¬ 
imperial  relations  came  to  be  regarded  as  intensi¬ 
fied  zero-sum  conflicts  over  the  redivision  of  an 
occupied  planet.  This  closed  spatial  horizon 
prompted  a  reconceptualization  of  the  state  as  a 
territorial  phenomenon  in  space,  locked  into  a 
permanent  struggle  for  survival.  The  revaloriza¬ 
tion  and  politicization  of  geography  and  its 
absorption  into  a  reconceptualized  science  of  poli¬ 
tics  forged  the  new  field  of  political  geography — 
geopolitics’  direct  precursor.  Today,  particularly 
with  increasing  globalization,  geopolitics  is  more 
and  more  challenged  and  questioned:  Territory 
has  lost  its  classical  relevance,  as  transnational 
relationships  imply  a  new  vision  of  the  global 
world,  which  is  less  compatible  with  geopolitics. 
Critical  geography  points  out  the  ideological  and 
political  background  of  geopolitics. 

The  concept  of  geopolitics  was  invented  by  the 
Swedish  political  scientist  Johan  Rudolf  Kjellen 
(1864-1922)  and  developed  in  The  State  as  an 
Organism  (1917) — a  critique  of  the  prevailing 
legal  positivism  in  constitutional  and  international 
law.  The  radicalization  of  political  geography  into 
Geopolitik  as  a  specifically  German  theory  of 
international  politics  was  prosecuted — most  nota¬ 
bly  by  Karl  Haushofer  (1869-1946)  and  his  school 
and,  from  a  different  angle,  by  Carl  Schmitt 
(1888-1985) — during  the  revisionist  conflicts  over 
the  Versailles  Settlement  in  Weimar  Germany. 
Geopolitics  had  a  lasting,  though  not  exclusive, 
impact  on  Nazi  foreign  policy.  Intellectually,  it 
positioned  itself  as  a  counternarrative  against 
Marxist  theories  of  imperialism  (Rosa  Luxemburg, 
Nikolai  Bukharin,  Vladimir  Lenin)  and  against  the 
Wilsonian  liberal  internationalism,  represented  by 
Isaiah  Bowman  (1878-1950).  Basic  elements  of 
geopolitical  thought  infused,  if  in  a  semantically 
altered  and  politically  sanitized  form,  the  American 
Cold  War  discourse  of  neorealism  in  a  new  binary 
world-political  geography.  The  proliferating  but 
often  indiscriminate  contemporary  invocations  of 
the  term  geopolitics  reflected  the  shifting  post- 
Cold  War  strategic  geography — globalization, 
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multipolarity,  hegemony,  unipolarity,  empire — 
and  a  renewed  urgency  of  resource  politics  in  times 
of  natural  resource  depletion  and  environmental 
degradation.  Simultaneously,  the  new  approaches 
of  critical  and  Marxist  geopolitics  sought  to 
recover,  respectively,  the  cartographic  discourses 
that  shape  the  construction  of  geopolitics  and  the 
efficacy  of  geopolitical  conflicts  that  mediate 
European  and  worldwide  socioeconomic  and 
political  developments. 

Friedrich  Ratzel  (1844-1904),  a  professor  of 
political  geography  and  the  cofounder  of  the 
Alldeutscher  Verband  (Pan-German  League),  was 
cross-nationally  the  most  influential  figure  in  the 
development  of  the  geopolitical  tradition.  Applying 
ideas  from  biology,  evolutionism,  and  human 
geography,  he  contended  that  state  behavior — the 
expansion  and  contraction  of  states  in  space — is 
primarily  determined  by  geographical  properties 
and  geostrategic  location.  Space,  as  a  political  cat¬ 
egory,  is  no  longer  conceived  as  a  geometric,  neu¬ 
tral,  and  empty  expanse  (the  absolute  space  of 
nonhuman  geography)  but  as  a  concretely  ordered 
and  constructed  territory  (the  historical-relational 
space  of  political  geography).  Fusing  an  organic- 
biological  conception  of  the  state  with  principles  of 
Darwinian  natural  selection  in  interstate  relations, 
Ratzel  claimed  that  states  come  to  life,  grow,  and 
die  in  the  struggle  for  space.  Borders  are  temporary 
phenomena  in  a  pulsating  geopolitical  environ¬ 
ment.  Preoccupied  by  the  quest  for  a  “scientific” 
legitimation  of  Wilhelmine  Germany’s  expansion¬ 
ist  policies,  his  Politiscbe  Geographic  (1897) 
suggested  that  states  enjoy  a  natural  right  to  an 
adequate  Lebensraum  (living  space)  grounded  in 
variations  between  soil  fertility  and  population 
growth.  Population  “density  pressures,”  estab¬ 
lished  by  a  calculus  between  state  territory  and 
demography,  are  formalized  in  the  “laws  of  the 
spatial  growth  of  states.”  Differences  arise  in  the 
prewar  era  between  ethnocentric  and  geopolitical 
conceptions  of  states  and  the  goals  and  limits  of 
their  foreign  policies.  While  the  ethnocentric 
notion  defines  nations  as  cultural-linguistic  units 
[Volksnation),  leading  to  a  self-limiting  territorial 
correspondence  between  a  nation’s  area  of  settle¬ 
ment  and  the  scale  of  state  territory,  the  latter 
conception  (Staatsnation)  prioritizes  territorial 
aggrandizement  over  ethnic-racial  homogeneity, 
although  it  may  also  involve  an  active  policy  of 


ethnic  settlement  (Germanicization).  Volksnation 
and  Staatsnation  are  not  synonymous  and  consti¬ 
tute  rival  points  of  reference.  This  tension  between 
“race”  and  “space”  resurfaced  later  in  differences 
between  the  original  program  of  German  geopoli¬ 
tics  and  Adolf  Hitler’s  racial  ideology. 

Contemporaneously,  Alfred  T.  Mahan’s  (1840— 
1914)  anti-isolationist  navalism  and  Halford 
Mackinder’s  (1861-1947)  heartland  theory  formed 
the  core  of  political  geography  in  the  United  States 
and  Britain.  Admiral  Mahan’s  The  Influence  of  Sea 
Power  on  History,  1660-1783  (1890)  identified 
the  fleet-based  domination  of  seaborne  commerce 
as  the  decisive  factor  in  world  politics,  exemplified 
by  Britain’s  early  modern  oceanic  hegemony.  In  an 
age  of  pre-aerial  warfare  and  limited  mechaniza¬ 
tion  of  terrestrial  military  technology,  a  major 
battle  fleet  secures  a  mutually  reinforcing  and  self- 
sustaining  relation  between  naval  domination  and 
commercial  supremacy — the  geostrategic  axis  of 
history.  Mackinder  drew  the  opposite  conclusion 
in  his  lecture  The  Geographical  Pivot  of  History 
(1904).  Naval  power,  after  unifying  the  world  into 
a  closed  political  system  during  the  Columbian  age 
(1500-1900),  was  superseded  during  the  post- 
Columbian  age  by  land  power.  According  to 
Mackinder’s  trizonal  global  geography,  a  geostra- 
tegically  immune  inner-Eurasian  heartland — the 
pivot  of  history — is  surrounded  by  an  inner  cres¬ 
cent  (the  European  periphery,  Near/Middle  East, 
India,  China),  enclosed,  in  turn,  by  an  outer  cres¬ 
cent  (England,  the  Americas,  Africa,  Australia, 
Oceania,  Japan).  The  contemporary  global  balance 
between  land  and  sea  power  was  decisively  altered 
by  the  infrastructural  penetration  of  the  pivot  area 
(Central  Asia,  Russia)  through  the  construction  of 
transcontinental  railway  networks.  The  future 
belonged  to  a  German-Russian  alliance,  strategi¬ 
cally  invulnerable  in  its  territorial  heartland  against 
sea  power.  Only  a  grand  coalition  of  sea  powers 
could  prevent  Eurasian  world  domination.  Both 
conceptions  had  to  be  situated  in  the  transition 
from  cosmopolitan  free  trade  imperialism  to  pro¬ 
tectionist  neomercantilism. 

During  the  passage  from  the  19th-century 
European  pentarchy  to  the  new  constellation  of 
world  politics,  flanked  by  the  rise  of  the  United 
States  and  the  former  USSR,  political  geography 
turned  into  German  geopolitics.  Kjellen’s  antile- 
galistic  The  State  as  an  Organism  contrasted  the 
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norm-oriented  concept  of  the  state  as  a  legal  sub¬ 
ject,  most  notably  represented  in  Weimar  Germany 
by  Hans  Kelsen,  with  an  “empirical  viewpoint”  of 
the  state  as  a  geographic  organism  commanding 
space.  Foreign  policy  was  elevated  to  an  existen¬ 
tial  condition.  Geopolitics  was  redefined  as  an 
auxiliary  science  of  statecraft  in  the  international 
struggle  for  survival.  Ratzel’s  and  Kjellen’s 
thoughts  received  a  wide  and  enthusiastic  recep¬ 
tion  in  Weimar  Germany  during  the  revanchist 
intellectual  struggles  against  the  Versailles  Diktat 
and  as  a  key  legitimation  for  national-socialist 
Groflraumpolitik.  The  rise  of  the  geopolitical  tra¬ 
dition  in  the  1920s  was  marked  by  the  foundation 
of  the  Institute  of  Geopolitics  in  Munich  (1922), 
the  Geopolitische  Seminar  at  the  Deutsche 
Hochschule  fiir  Politik  (1924)  in  Berlin,  and  the 
launch  of  the  high-circulation  Journal  of 
Geopolitics  (1924).  Haushofer,  who  was  profes¬ 
sor  of  geography  at  the  University  of  Munich 
(1919-1939),  director  of  the  Institute  of  Geo¬ 
politics,  president  of  the  German  Academy  (1934- 
1945),  and  a  close  confidant  of  Rudolf  Hess, 
emerged  as  the  most  influential  figure  in  the 
German  discourse.  Its  intellectual  core  was  pre¬ 
mised  on 

1.  an  organic-biological  conception  of  the  state 
and  a  Darwinian  view  of  interstate  rivalry; 

2.  a  critique  of  Western,  liberal,  and  “mechanistic” 
conceptions  of  state  and  society,  contrasted  with 
the  volkisch  “ideas  of  1914”  that  recast  states 
as  expressions  of  culturally  defined  and 
homogeneous  peoples,  organically  rooted  in 
specific  territories; 

3.  a  revaluation  of  agriculture,  mystified  as 
“chthonic”  and  “organic,”  coupled  with  a  neo- 
Malthusian  understanding  of  population 
growth,  leading  to  demands  of  an  autarchic 
Lebensraum-, 

4.  the  rejection  of  international  law  (League  of 
Nations)  and  its  replacement  by  a  new 
Grofiraumordnung  (Grand  Spatial  Order); 

5.  a  geostrategic  determinism  premised  on  the 
dualism  between  land  power  and  sea  power, 
informing  the  programmatic  policy  prescriptions 
for  a  Eurasian  power  bloc  under  German 
leadership; 


6.  the  promotion  of  geography  as  the  key  science 
of  statecraft  and  the  dissemination  of 
“suggestive  cartography”  for  the  collective 
preparation  of  the  nation  in  its  struggle  for 
survival;  and 

7.  the  combination  and  political  radicalization  of 
these  elements  in  Anglophobia  and  anti-Semitism. 

Geopolitics  matured  from  1933  onward  into  the 
official  German  science  of  the  state.  Scientific  pre¬ 
tense  and  normative  orientation  blended  into  a 
highly  politicized  literature.  Attacking  the  “liberal” 
system  of  European  Kleinstaaten  (ministates),  the 
juridicization  of  international  politics  and  the  fix¬ 
ing  of  borders,  the  geopoliticians  invoked  during 
the  1930s  the  “law  of  growing  spaces.”  Buoyed  by 
the  first  Nazi  foreign  policy  successes,  the  new  spa¬ 
tial  order  envisaged  no  longer  sovereign  nation¬ 
states  as  the  units  of  the  international  system — the 
defunct  Westphalian  System — but  a  plurality  of 
Grofiraume  (Pan-regions).  Schmitt,  although  insti¬ 
tutionally  unconnected  to  the  geopolitical  school, 
emerged  as  the  sharpest  critic  of  the  League  of 
Nations  project  and  advocate  of  a  new  Nomos  of 
the  Earth  (1950).  He  argued  that  all  conceptions 
of  international  law  are  ultimately  grounded  in 
metajuridical  foundations — the  power  projections 
and  land  appropriations  that  underlie  state  for¬ 
mations  and  expansion,  exemplified  in  relation  to 
the  Spanish  and  English  conquests  of  the  Americas. 
Anglo-American  international  law  was  based  on  a 
structural  correspondence  tying  the  postwar 
promotion  of  liberal-constitutional  Kleinstaaten 
on  the  European  continent  and  beyond  to  their 
capitalist  penetration  by  private  economic  forces 
and  integration  into  the  world  market.  Their 
incorporation  into  the  League  obliterated  their 
political  essence — the  right  to  define  an  enemy 
and  to  wage  war — due  to  the  turn  toward  a  dis¬ 
criminatory  concept  of  war.  The  apolitical  “space¬ 
less  universalism”  of  universal  law — the  flatten¬ 
ing  of  differentially  organized  polities  and  their 
submission  to  common  legal  principles — sanc¬ 
tioned  a  constitutive  dualism  between  the  prolif¬ 
eration  of  liberal-constitutional  states  and  the 
expansion  of  a  borderless  private  world  market. 
This  strategy  represented  a  concrete  political  proj¬ 
ect  of  global  domination — the  rationalization  of 
global  space  driven  by  a  nonterritorial  capitalist 
imperialism  for  American  Lebensraum. 
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The  architect  behind  this  project  was  Bowman, 
director  of  the  American  Geographical  Society 
(1915-1935),  key  adviser  to  Woodrow  Wilson  at 
Versailles  and,  later,  to  Franklin  D.  Roosevelt. 
Wilson’s  moralistic  liberal  internationalism  was 
commensurate  with  the  American  national  inter¬ 
est  in  an  attempt  to  move  international  order 
“beyond  geography.”  Simultaneously,  Schmitt 
contended  that  U.S.  hemispheric  policy  toward 
the  Pacific,  Middle,  and  South  America  (Monroe 
Doctrine,  1823)  and  the  British  Empire  consti¬ 
tuted  legal  precedents  for  his  multi -Grofiraume 
vision.  The  Monroe  Doctrine  manifested  Anglo- 
American  double  standards  in  international  law 
and  politics.  Against  this  background,  Schmitt  set 
out  the  historico-legal  argument  for  the  pluraliza- 
tion  and  regionalization  of  diverse,  coexisting, 
and  mutually  exclusive  legal  spheres.  Gro/?- 
raumwirtschaft  (Grand  Economic  Order)  and 
Groflraum — not  a  biological,  but  a  political- 
juridical  category — merged  into  a  supranational 
hegemonic  order  in  Central  and  Eastern  Europe 
under  German  leadership,  located  between  the 
Soviet  bloc  and  the  Western  powers’  spheres  of 
interest.  Schmitt  remained  vague  on  the  inner 
structure  of  Pan-regions  and  silent  on  inter- 
Groflraum  relations,  excepting  the  principle  of 
nonintervention  for  foreign  powers.  Similarly, 
Haushofer  pleaded  with  reference  to  Mackinder’s 
heartland  theory  for  the  construction  of  a  conti¬ 
nental  bloc,  comprising  Germany,  the  former 
E1SSR,  and  Japan.  This  conception  informed  the 
German-Japanese  Anticomintern  Pact  (1936),  the 
German-Soviet  Non-Aggression  Pact  (1939),  and 
the  division  of  Poland.  The  contradiction  between 
Haushofer’s  anti-Bolshevism  and  his  demands  for 
an  Eurasian  bloc  remains  irresolvable.  During  the 
late  1930s,  the  term  geopolitics  became  recharged 
with  volkisch  terms  and  was  finally  subordinated 
to  racial  policies. 
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Globalization 


Globalization  refers  to  the  integration  of  separate 
nations,  regions,  or  even  individuals  into  a  wider 
global  system.  It  is  characterized  by  increasing  the 
linkages  and  connections  between  peoples  and 
countries  and  by  the  growing  knowledge  of  these 
interactions.  This  integration  process  can  affect 
the  economy,  polity,  society,  or  culture;  that  is, 
the  definition  of  the  process  is  very  general  and 
can  refer  to  economic,  political,  social,  or  cultural 
integration.  The  question  of  who  is  being  inte¬ 
grated  is  also  left  open,  although  in  political  sci¬ 
ence  it  is  mostly  the  countries  themselves  that  are 
at  issue.  The  way  in  which  such  integration  oper¬ 
ates  is  also  variable.  The  most  common  form  of 
globalization  in  the  literature  involves  the  eco¬ 
nomic  integration  of  national  economies  into  a 
wider  global  one,  usually  through  international 
trade,  capital  flows,  labor  movements,  and  global 
production  networks.  For  some  authors,  this 
means  the  integration  of  countries  into  a  global 
capitalist  system — that  is,  one  based  on  private 
property,  limited  government  intervention,  and 
the  use  of  markets  to  allocate  economic  value. 
Some  term  this  neoliberal  globalization.  Cap¬ 
italism,  international  capitalist  groups,  and  neo¬ 
liberal  practices  all  spread  throughout  the  world 
as  globalization  proceeds.  Whether  this  is  good  or 
bad  for  countries  and  individuals  greatly  depends 
on  a  scholar’s  perspective.  Some  associate  global¬ 
ization  with  the  homogenization  of  distinct 
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groups;  one  example  is  the  loss  of  many  local 
languages  and  the  increasing  diffusion  of  the  use 
of  English.  Others  associate  it  with  the  domina¬ 
tion  of  American  (or  Anglo-American)  values, 
beliefs,  and  practices.  The  global  turn  to  liberal 
democracy  is  often  cited  as  an  example. 
Globalization  is  not  complete  at  this  point. 
Countries  have  different  levels  of  integration  into 
the  world  system.  The  extent  to  which  a  truly 
global  system — whether  economic,  political,  or 
social — exists  is  a  point  of  debate.  The  causes  and 
consequences  of  globalization  are  also  debated. 

Globalization  has  both  secular  and  cyclical 
aspects.  For  some,  the  roots  of  globalization  are  in 
technological  changes,  such  as  the  lowering  of  the 
costs  of  communications  and  transportation,  which 
make  the  movement  of  people,  ideas,  goods,  ser¬ 
vices,  and  capital  faster  and  less  costly.  These 
changes  are  irreversible  and  often  increase  over 
time.  While  countries  and  their  governments  may 
try  to  put  up  barriers  against  such  changes,  it  is  very 
difficult  for  them  to  successfully  fend  off  the  pres¬ 
sures  of  globalization.  In  this  view,  globalization  has 
been  increasing  steadily  over  time  as  technological 
change  makes  the  world  ever  smaller.  Sometimes 
technological  change  is  treated  as  exogenous — that 
is,  it  has  sources  other  than  countries’  policies  or 
global  integration.  Other  times,  globalization  itself, 
especially  via  trade  of  capital  flows,  is  seen  as  having 
an  impact  on  technology;  it  usually  is  conceived  of 
as  helping  induce  innovation  and  speeding  its  trans¬ 
fer  around  the  world.  Globalization  and  technologi¬ 
cal  change  may  then  go  hand  in  hand,  each  encour¬ 
aging  the  other. 

On  the  other  hand,  globalization  has  been 
viewed  as  cyclical;  high  levels  of  global  integration 
have  been  associated  with  two  distinct  periods  in 
recent  history.  In  the  mid-  to  late  1 9th  century,  a 
global  economy  emerged  as  the  European  powers, 
especially  the  British,  forged  an  open  economy 
using  both  military  power  and  economic  policy. 
The  development  of  colonies  around  the  globe  and 
the  extension  of  European  trade,  capital,  values, 
and  political  power  undergirded  this  period  of  glo¬ 
balization.  In  the  late  1 9th  century,  the  world 
economy  attained  levels  of  openness  that  had 
never  been  seen  before.  In  some  areas  such  as  the 
movement  of  labor,  such  levels  of  integration  have 
never  been  achieved  since.  This  open  economy  col¬ 
lapsed  in  the  early  20th  century  with  the  two 


World  Wars  and  the  Great  Depression.  From  1914 
to  1945,  globalization  was  in  retreat.  Protectionism 
became  the  dominant  trade  policy  for  many  coun¬ 
tries;  trade  and  currency  blocs  formed  regionally, 
often  around  one  powerful  country,  and  military 
conflict  was  prevalent.  Nationalism,  isolationism, 
and  regionalism  marked  this  30-year  period. 
Globalization  thus  declined  during  this  period. 

However,  after  1945  the  United  States  picked 
up  the  mantle  from  the  British  and  began  to  forge 
a  world  economy  through  the  creation  of  interna¬ 
tional  institutions,  an  open  economic  policy,  and 
an  internationalist  foreign  policy.  The  global  sys¬ 
tem,  however,  was  divided  into  at  least  two  blocs 
with  the  United  States  on  one  side  and  the  former 
Soviet  Union  on  the  other;  hence,  globalization 
would  always  be  limited  in  this  Cold  War  system. 
The  United  States  pressed  ahead  with  the  organiza¬ 
tion  of  an  open,  capitalist  economy  guided  by 
principles  embodied  in  various  international  insti¬ 
tutions.  It  worked  to  encourage  democratization  in 
certain  regions  and  then  to  include  democracies  in 
this  integration  process.  The  creation  of  the  General 
Agreement  on  Tariffs  and  Trade  (GATT)  and  the 
subsequent  development  of  the  European  Com¬ 
munity  started  a  process  of  trade  liberalization 
among  developed  countries.  The  Bretton  Woods 
monetary  system  supported  by  the  International 
Monetary  Fund  (IMF)  and  the  World  Bank  helped 
countries  establish  currency  convertibility  and  a 
fairly  stable  world  monetary  system  based  on  the 
dollar.  This  stability  provided  the  conditions  for 
the  development  of  private  international  capital 
markets,  and  over  time  these  markets  have  come  to 
dwarf  those  for  goods  and  services.  The  removal  of 
capital  controls  from  many  advanced  economies  in 
the  1970s  and  1980s,  after  the  termination  of  the 
Bretton  Woods  fixed-exchange  rate  system,  opened 
these  economies  even  more  to  global  markets. 
While  labor  flows  never  became  as  free  from  gov¬ 
ernment  intervention  as  those  of  capital  and 
goods,  largely  because  of  immigration  restrictions 
in  the  developed  world,  there  was  some  return  to 
the  greater  mobility  of  labor  that  had  prevailed  in 
the  late- 19th-century  period  of  high  globalization. 
The  extension  of  the  European  Union  (EU)  to  new 
sets  of  countries  and  new  issues  also  characterized 
this  integration  movement.  By  the  1980s,  most  of 
the  advanced  industrial  world,  except  the  former 
Soviet  Union  and  its  satellites,  had  joined  the 
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world  capitalist  economy.  Globalization  was  only 
partial  in  this  period. 

From  the  1980s  onward,  the  rest  of  the  world’s 
countries  began  opening  their  markets  and  borders 
even  more  and  joining  the  world  economy.  The 
end  of  the  Cold  War,  the  collapse  of  the  former 
Soviet  Union,  and  the  economic  reform  movement 
in  China  all  added  impetus  to  the  globalization 
process,  as  the  former  communist  countries  and 
China  all  began  to  integrate  their  economies  into 
the  global  capitalist  one.  In  the  1990s  and  early 
2000s,  the  world’s  developing  countries  began 
joining  this  world  economy  and  its  international 
institutions  in  large  numbers.  Most  countries  in  the 
world  now  belong  to  the  World  Trade  Organization 
(WTO),  the  IMF,  and  the  World  Bank.  Democracy 
was  also  spreading  throughout  much  of  the  world 
at  this  time.  Countries  also  increasingly  chose  to 
join  a  wide  variety  of  international  organizations 
and  treaty  systems.  Conventions  on  labor  stan¬ 
dards,  human  rights,  environmental  issues,  and 
other  areas  became  widely  adopted  by  countries 
around  the  world.  By  2009,  most  of  the  world’s 
countries  had  become  part  of  a  global  system  that 
had  become  increasingly  organized  around  a  series 
of  common  principles  and  values  governing  numer¬ 
ous  issues.  Economic  shocks,  such  as  the  1998 
Asian  financial  crisis  and  the  2008  financial  crisis 
begun  in  the  United  States,  spread  across  the  globe 
rapidly.  The  second  period  of  globalization  had 
arrived.  A  major  issue  is  to  what  extent  globaliza¬ 
tion  was  purposefully  constructed,  rather  than 
unintentionally  realized.  Whether  globalization  is 
reversible  or  not  is  also  at  issue. 

Globalization  has  been  studied  from  two  general 
perspectives.  On  the  one  hand,  the  causes  of  global¬ 
ization  have  attracted  much  attention.  Why  has 
there  been  a  movement  toward  increasing  contact 
and  interaction  between  states  and  peoples  all  over 
the  globe?  Why  has  this  occurred  more  at  certain 
times  than  others?  Why  have  some  regions  or  peo¬ 
ples  been  more  involved  in  this  process  than  oth¬ 
ers?  Why  have  certain  aspects  of  globalization  such 
as  international  trade  moved  faster  than  others? 
Many  questions  have  been  raised  about  globaliza¬ 
tion  as  an  outcome  that  in  itself  needs  explanation. 
On  the  other  hand,  the  impact  of  globalization  has 
also  received  great  attention.  What  effects  does  this 
process  of  increasing  contact  and  interaction  among 
states  and  peoples  have  on  the  economy,  politics, 


society,  and  culture?  Are  these  effects  large  or 
small?  Negative  or  positive?  Delineating  the  myriad 
effects  that  globalization  might  have  and  develop¬ 
ing  evidence  for  these  has  been  central  to  the  litera¬ 
ture.  Once  we  understand  more  about  globaliza¬ 
tion,  we  can  then  address  the  question  of  how  to 
manage  the  process  and  its  effects. 

The  Causes  of  Globalization 

A  question  of  much  import  has  been  how  globaliza¬ 
tion  has  developed,  and  why  it  has  developed  the 
way  it  has.  If  one  adopts  the  secular  view  of  global¬ 
ization,  then  its  advance  can  be  explained  by  how 
fast  technological  change  occurs  and  spreads.  In 
periods  where  such  change  is  very  rapid  and  when 
the  adoption  of  new  ideas  and  techniques  is  quick, 
globalization  will  progress  rapidly.  Many  seem  to 
feel  that  this  was  the  case  in  the  mid-  and  late  19th 
century,  when  critical  innovations  such  as  steam 
power,  railroads,  and  the  telegraph  changed  politi¬ 
cal,  social,  and  economic  life  around  the  globe. 
Spread  in  part  by  military  competition  and  imperial 
control,  these  technologies  helped  integrate  distant 
regions  of  the  world.  Technological  change  did  not 
stop  in  the  interwar  period  when  globalization  was 
in  retreat;  indeed,  war  tends  to  spur  such  innova¬ 
tion.  But  the  spread  of  such  technologies  in  this 
period  was  much  more  limited  since  many  borders 
were  closed  and  countries  guarded  their  new 
secrets,  which  were  often  tied  to  war  fighting.  The 
development  of  nuclear  weapons  is  a  case  in  point. 
The  period  after  World  War  II  has  again  been 
viewed  as  one  when  rapid  technological  change 
was  fostered  by  governments,  when  the  spread  of 
new  processes  and  products  was  unhindered,  and 
when  the  adoption  of  innovations  at  least  in  the 
noncommunist  countries  was  embraced  by  states. 
Globalization  thus  sped  up  after  1950  because 
technological  change  accelerated. 

It  is  important  to  note  that  technological  change 
is  not  just  about  the  economy;  such  changes  have 
major  ramifications  for  society,  culture,  and  the 
polity.  Electricity,  the  telephone  (not  to  mention  the 
mobile  phone),  radio,  television,  refrigerated  ship¬ 
ping  containers,  automobiles,  airplanes,  and  now 
the  Internet — to  name  just  a  few  of  the  major  inno¬ 
vations  of  the  last  century  and  a  half — have  affected 
social,  political,  and  cultural  life  in  enormous  ways. 
They  have  all  contributed  to  globalization  by 
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reducing  the  cost  of  transportation  and  communi¬ 
cation  among  individuals  across  the  globe.  In  this 
secular  story  of  global  integration,  countries  can 
try  to  impede  innovation  and  slow  its  spread,  but 
they  cannot  ultimately  halt  such  change.  To  the 
extent  that  technology  reduces  transportation  and 
communication  costs,  technological  innovation 
provides  an  irrepressible  force  for  globalization. 
The  pace  of  globalization  may  be  affected  but  not 
its  forward  movement. 

In  the  cyclical  view  of  globalization,  other  factors 
are  often  cited  as  being  of  great  importance  to  the 
changes  we  see.  One  theory  focuses  attention  on  the 
salience  of  having  one  country  lead  the  globaliza¬ 
tion  process.  Hegemonic  stability  theory  (HST), 
developed  in  the  1970s  and  1980s,  argued  that  a 
single  world  leader  was  necessary  for  an  open,  sta¬ 
ble  world  economy.  Without  such  a  leader,  even 
technological  change  would  not  bring  globaliza¬ 
tion.  Two  versions  of  this  theory  exist.  In  one,  the 
hegemon,  who  has  overwhelming  capabilities  in  all 
areas  relative  to  other  countries,  exercises  a  benign 
influence  over  the  rest  of  the  world  by  organizing 
and  paying  the  costs  for  such  openness  and  stabil¬ 
ity.  It  plays  the  role  of  leader  and  others  benefit 
from  its  efforts.  The  United  Kingdom  (UK)  and  the 
United  States  are  cited  as  two  examples  of  this  in 
the  mid-  to  late  19th  and  mid-  to  late  20th  centu¬ 
ries,  respectively.  The  loss  of  British  leadership 
capacity  and  the  failure  of  the  United  States  to  exert 
leadership  in  the  interwar  period  were  seen  as  the 
leading  causes  of  the  closure  and  turmoil  of  that 
period.  Without  such  benign  leadership,  the  world 
would  not  be  able  to  integrate  and  countries  would 
remain  in  an  isolationist  stance  trying  to  beggar 
their  neighbors.  The  hegemon  solves  the  problem  of 
trust  and  competition  for  the  smaller  states  and  thus 
allows  everyone  to  emerge  better  off  through  its 
provision  of  global  public  goods.  It  acts  as  lender  of 
last  resort  and  tries  to  provide  an  open  market  in 
difficult  economic  times.  By  creating  international 
institutions  to  help  countries  in  difficult  times,  the 
hegemon  can  also  promote  stability  and  bind  itself 
to  reassure  other  countries  of  its  benign  motives. 
Globalization  requires  leadership  from  the  world’s 
strongest  power.  Some  note  that  perhaps  a  small 
group  of  powerful  countries  could  provide  this 
same  leadership,  but  concerns  about  disagreement 
or  buck  passing  within  this  group  have  shed  doubt 
on  this  view. 


In  a  second  view,  the  role  of  the  hegemon  is  not 
seen  as  selfless.  Here  the  global  leader  seeks  to 
maximize  its  own  interests  and  uses  its  superior 
resources  to  bribe  and  coerce  others  into  following 
it.  British  and  American  hegemony  is  seen  as  a 
coercive  process  by  which  the  country  imposes  its 
will  on  others;  the  hegemon  gains,  while  everyone 
else  submits.  In  the  British  case,  imperialism  com¬ 
bined  with  open  markets  and  the  dominant  role  of 
the  pound  helped  it  induce  others  open  their  mar¬ 
kets  and  join  the  international  trading  and  finan¬ 
cial  system.  Even  international  law  as  developed  in 
this  period  served  to  enable  globalization  as  the 
European  powers  used  it  to  force  open  the  colo¬ 
nies.  British  domination  ran  into  increasing  diffi¬ 
culty,  however,  as  other  countries  grew  and 
became  competitors.  Once  Britain  could  no  longer 
dictate  the  terms  of  engagement  with  the  world 
economy,  regional  blocs  developed  around  the 
strongest  powers.  Britain  could  no  longer  force 
countries,  including  the  United  States  and  Germany, 
to  follow  its  preferred  practices,  such  as  joining  the 
sterling  system  or  opening  their  markets  to  inter¬ 
national  trade.  The  decline  of  British  hegemony 
then  ended  the  period  of  globalization  as  powerful 
rivals  developed  their  own  spheres  of  influence. 

In  the  case  of  the  United  States,  its  dominance  in 
all  domains  after  World  War  II  helped  it  establish 
a  system  of  international  governance  that  has 
maintained  American  hegemony  for  over  60  years. 
Unlike  the  British  earlier,  the  Americans  set  up  a 
series  of  international  institutions  to  embed  the 
U.S.  global  order.  The  GATT/WTO,  IMF,  World 
Bank,  and  regional  development  banks  have, 
among  others,  been  a  means  for  the  United  States 
to  exercise  its  domination  without  having  to  do  so 
directly.  These  institutions  have  pushed  countries 
to  adopt  capitalist  practices,  to  open  their  markets 
to  flows  of  goods,  services,  and  capital,  and  to  fol¬ 
low  the  so-called  Washington  Consensus  in  other 
economic  matters.  Allowing  or  encouraging  coun¬ 
tries  to  become  indebted  and  then  imposing  condi¬ 
tions  on  them  when  they  are  unable  to  service  their 
debts  has  allowed  international  institutions  to  open 
the  developing  world  to  the  global  economy.  More 
recently,  pressure  for  democratization  has  also 
come  from  these  institutions.  Thus,  the  American 
method  for  controlling  the  global  system  has  dif¬ 
fered  from  the  British  one,  but  it  has  also  involved 
coercing  others  to  join  the  global  economy.  The  use 
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of  soft  power  has  been  perhaps  more  evident  in  the 
case  of  the  United  States.  But  American  military 
intervention  around  the  world  has  been  plentiful, 
and  this  view  has  served  to  enforce  its  global 
vision.  In  this  view  of  coercive  hegemony,  first  the 
UK  and  then  the  United  States  gained  much  from 
their  exalted  position  at  the  center  of  the  global 
system;  ironically,  however,  in  both  cases  their 
decline  seems  to  be  connected  to  their  leadership 
role.  This  view  of  the  malign  hegemon  is  associ¬ 
ated  with  both  a  realist  and  a  Marxist  approach  to 
international  relations.  Substantial  debate  exists  as 
to  whether  the  coercive  or  benign  view  of  hege¬ 
mony  is  more  evident.  This  debate  includes  contro¬ 
versy  over  the  extent  to  which  the  countries  other 
than  the  hegemon  have  been  hurt  or  helped  by  this 
globalization  process. 

A  more  constructivist  account  of  the  globaliza¬ 
tion  process  would  focus  on  the  way  ideas  and 
norms  about  the  global  community  and  modernity 
shape  state  behavior.  It  would  identify  the  domi¬ 
nant  ideas  of  the  past  60  years  and  how  these  have 
been  legitimated  and  spread  throughout  the  sys¬ 
tem.  Beliefs  in  liberal  democracy,  human  rights, 
capitalism,  and  neoliberalism,  for  instance,  have 
diffused  across  the  globe;  these  institutions  and 
practices  have  become  legitimated  and  viewed  as 
the  “modern”  way  to  manage  a  country.  Countries 
thus  aspire  to  introduce  and  develop  them  in  order 
to  be  a  part  of  the  global  community.  International 
institutions  embody  and  diffuse  these  ideas,  as  do 
norm  entrepreneurs  from  global  civil  society.  The 
focal  effect  of  these  ideas,  however,  is  to  reduce 
diversity  and  contribute  to  homogenization. 
Scholars  are  often  divided  as  to  whether  this  nor¬ 
mative  globalization  is  beneficial  or  not.  Clearly, 
the  spread  of  some  ideas  (e.g.,  democracy  and 
human  rights)  is  seen  in  a  more  positive  light  than 
the  spread  of  others  by  some  scholars  (e.g.,  capital¬ 
ism  and  neoliberalism).  There  is  also  debate  over 
how  certain  ideas  come  to  be  dominant.  Is  it 
through  coercion  or  inducement  by  a  powerful 
country,  such  as  the  United  States,  that  others 
espouse  these  ideas?  Is  it  more  the  Zeitgeist  of  the 
time  that  makes  certain  ideas  fit  the  situation  bet¬ 
ter?  Or  is  it  the  impact  of  international  political 
entrepreneurs  who  give  legitimacy  and  moral  sua¬ 
sion  to  certain  views,  independent  of  countries? 

Another  reason  given  for  the  rise  of  globaliza¬ 
tion  is  much  less  intentional.  The  uncoordinated 


actions  of  countries  over  time  have  simply  resulted 
in  an  increasingly  open  and  interconnected  system. 
No  country  desired  such  a  result,  but  each  one  act¬ 
ing  on  its  own  best  interests  adopted  policies  that 
resulted  in  a  globalized  world.  This  view  has  sup¬ 
port  since  the  reasons  for  and  sequencing  of  many 
countries’  decisions  to  globalize  were  nationally 
distinctive.  The  advanced  industrial  countries  liber¬ 
alized  trade  and  then  their  capital  markets  back  in 
the  1960s  to  1980s.  The  EU  was  a  major  force  for 
movement  in  this  direction;  however,  European 
countries  joined  and  expanded  the  Union  for  rea¬ 
sons  other  than  globalization,  such  as  domestic 
politics  or  international  security.  China  then 
launched  its  economic  reform  program  in  the  late 
1970s  and  early  1980s  for  internal  reasons,  and 
political  reforms  have  not  been  forthcoming.  The 
former  communist  countries  only  moved  toward  a 
more  open  and  capitalist  economy  after  the  end  of 
the  Cold  War  and  the  demise  of  the  former  Soviet 
Union  in  1991,  and  for  many  of  them,  joining  the 
EU  had  important  domestic  political  and  interna¬ 
tional  security  motivations.  In  the  developing  world, 
democratization  often  came  before  economic  liber¬ 
alization.  Many  poor  countries  did  not  liberalize 
economically  until  the  late  1990s  and  later.  Indeed, 
the  most  striking  is  the  different  paths  that  countries 
have  taken  since  the  1970s.  The  manner  in  which 
and  the  degree  to  which  countries  are  integrated 
into  the  world  system  are  remarkably  varied.  While 
the  overall  trend  has  been  toward  greater  contacts 
and  openness,  there  has  been  substantial  diversity  in 
how  states  realized  this.  For  instance,  English  is  spo¬ 
ken  much  more  in  most  countries  today  than  30 
years  ago,  but  the  penetration  of  English  varies 
greatly  from  country  to  country.  In  this  view,  this 
uncoordinated  process  has  been  driven  by  different 
causes  in  each  country.  Globalization  has  developed 
differently  for  each;  and  the  fate  of  globalization  lies 
with  the  (somewhat  independent)  decisions  of  coun¬ 
tries.  The  reactions  to  the  global  financial  and  eco¬ 
nomic  crisis  of  2008  to  2009  underline  this  view 
since  governments  have  failed  to  coordinate  and 
have  instead  responded  distinctly. 

Consequences  of  Globalization 

In  addition  to  the  debates  over  the  causes  of  glo¬ 
balization,  there  is  much  debate  about  its  conse¬ 
quences.  There  are  at  least  three  distinct  views  in 
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the  literature:  (1)  Whereas  some  see  its  conse¬ 
quences  as  negative  in  all  aspects,  (2)  others  see  its 
effects  as  positive  on  the  whole,  and  (3)  still  others 
stand  firmly  in  the  middle  and  see  it  as  having  both 
costs  and  benefits.  Globalization  is  said  to  have 
many  different  effects.  The  approach  here  is  to 
look  at  a  number  of  those  effects  and  sketch  out 
the  different  opinions  scholars  hold  on  each.  At 
this  stage,  there  is  little  consensus  on  its  effects. 
Substantial  literatures  exist  on  whether  (and  how) 
globalization  has  affected  economic  growth  rates, 
poverty,  inequality,  democracy,  conflict,  state 
capacity,  policy  and  institutional  convergence,  cul¬ 
tural  diversity,  volatility  and  the  diffusion  of  crises, 
and  balance  of  power  between  capital  and  labor. 
Assessing  the  impact  of  globalization  implies  that 
one  must  hold  other  factors  constant,  which  is  a 
difficult  task.  Since  globalization  is  an  ongoing 
process,  definitive  answers  to  these  questions  can¬ 
not  be  given.  What  is  striking,  however,  is  the  wide 
range  of  views  and  evidence  that  exists  for  each  of 
these  outcomes. 

Economic  Growth 

Globalization  is  alleged  to  have  had  a  variety  of 
effects  on  economic  growth.  Growing  integration 
of  national  economics  into  a  wider  global  one  has 
usually  been  achieved  through  trade  liberalization 
and  the  pursuit  of  foreign  investment,  and  some¬ 
times  through  capital  market  liberalization.  The 
consequences  of  these  policies  for  economic 
growth  are  still  debated.  Some  research  supports 
the  idea  that  all  of  the  policies  increase  growth; 
there  is  probably  more  evidence  that  trade  and 
foreign  investment  support  growth  than  does 
capital  market  liberalization.  Research  also  shows 
evidence  that  countries  that  globalized  in  the  past 
30  years  grew  faster  than  those  that  did  not.  Many 
expected  that  the  integration  of  developing  coun¬ 
tries  into  the  world  economy  would  promote  eco¬ 
nomic  convergence — that  is,  it  would  make  the 
poorest  countries  grow  faster  and  hence  catch  up 
over  time  with  the  richest.  There  is  some  evidence 
that  this  has  occurred  in  parts  of  the  world — 
mainly  Asia — but  not  much  support  for  it  glob¬ 
ally.  The  North-South  divide  does  not  seem  to 
have  closed  much  over  the  past  few  decades.  In 
part,  this  is  because  the  North  has  had  faster 
growth  rates  than  some  developing  countries.  For 


a  wide  range  of  countries,  however,  globalization 
does  not  seem  to  have  fostered  faster  growth. 
Scholars  point  out  that  for  parts  of  Latin  America 
and  Africa  growth  rates  were  higher  in  the  decades 
before  the  1980s  than  after.  The  impact  of  trade 
on  growth  for  many  countries  has  also  been 
ambiguous;  some,  especially  in  Asia,  seem  to  have 
gained  from  it.  But  many  others,  in  Latin  America 
and  Africa,  for  instance,  do  not.  Foreign  direct 
investment  has  tended  to  flow  to  the  most  rapidly 
developing  economies  and  so  it  is  associated  with 
fast  growth,  but  this  may  not  be  causal.  Finally, 
there  is  much  skepticism  that  capital  market  liber¬ 
alization  for  developing  countries  is  or  has  been 
good  for  economic  growth.  Some  of  the  problems 
associated  with  globalization,  such  as  inequality 
and  volatility,  seem  to  have  affected  its  capacity  to 
deliver  growth. 

Poverty  and  Inequality 

A  large  debate  centers  on  the  impact  of  global¬ 
ization  on  poverty  and  inequality.  World  Bank 
data  suggest  that  from  the  early  1980s  to  roughly 
2005,  poverty  declined  globally.  The  number  of 
extreme  poor  (those  living  on  less  than  $1.25/day) 
decreased  from  1.9  billion  in  1981  to  1.4  billion  in 
2005,  which  is  equivalent  to  a  decrease  from  50% 
of  the  developing  countries’  population  to  25% 
thereof  (World  Bank  Annual  Report  2009 ,  p.  61). 
Also,  for  example,  in  the  East  Asia  and  Pacific 
regions,  the  percentage  living  on  less  than  $2  a  day 
fell  from  69%  in  1990  to  25%  in  2007. 
Nevertheless,  2.5  billion  people  in  the  world  still 
live  on  less  than  $2  a  day  (World  Bank  Annual 
Report  2008,  pp.  14,  34). 

Other  measures  of  the  quality  of  life  in  develop¬ 
ing  countries  have  also  improved:  Literacy  rates, 
life  expectancy,  and  infant  mortality,  for  example, 
have  ameliorated  in  most  regions  except  where 
HIV/AIDS  has  struck  hardest.  Was  this  decline  due 
to  globalization?  Some  countries  that  have  low¬ 
ered  trade  barriers  and  promoted  foreign  invest¬ 
ment,  such  as  China,  Taiwan,  and  Vietnam,  have 
experienced  significant  declines  in  poverty  in  the 
past  decades.  However,  even  for  these  countries,  it 
is  not  clear  that  within-country  inequality  has  been 
reduced;  the  growing  rural-urban  divide  in  many 
developing  countries  suggests  rising  inequality. 
Many  other  countries  have  seen  little  change  in 
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poverty  levels,  and  inequality  has  either  remained 
fairly  constant  or  risen. 

The  World  Bank  associates  greater  globaliza¬ 
tion  with  increased  growth,  declining  poverty,  and 
falling  inequality — especially  globally.  But  other 
data  call  this  optimistic  scenario  into  question.  A 
number  of  studies  of  trade  have  shown  that  its 
liberalization  in  developing  countries  does  not 
reduce  poverty  or  inequality.  The  so-called  skill 
bias  associated  with  trade  and  foreign  investment 
today  often  means  that  more  highly  skilled  work¬ 
ers  gain  more  from  international  integration  than 
do  low-skilled  ones,  which  tends  to  exacerbate  the 
degree  of  within-country  inequality.  Interestingly, 
inequality  seems  to  also  have  risen  recently  in  the 
developed  countries.  This  outcome  was  not  unex¬ 
pected.  Standard  models  of  trade  suggest  that  in 
rich  countries,  high-skilled  workers  should  gain 
the  most  from  globalization  and  low-skilled  ones 
should  be  the  losers;  this  is  the  opposite  of  what 
should  happen  in  the  poor  countries.  This  seems 
to  have  occurred  in  a  number  of  developed  coun¬ 
tries,  although  it  has  probably  been  tempered  by 
the  redistributive  effects  of  the  welfare  state. 
Inequality  has  thus  risen  in  a  number  of  developed 
countries. 

The  more  surprising  outcome  has  been  that  low- 
skilled  workers  in  the  developing  world  have  not 
done  better.  While  there  are  now  numerous  reasons 
articulated  for  this  result,  it  is  a  problem  for  pov¬ 
erty  reduction  and  equity  in  the  developing  world, 
especially  since  these  countries  by  and  large  do  not 
have  well-developed  welfare  states.  The  impact  of 
financial  crises  and  their  diffusion  in  a  global  sys¬ 
tem  are  also  concerns  with  regard  to  poverty  and 
inequality.  Some  argue  that  such  crises  are  more 
likely,  more  intense,  and  spread  more  broadly  in  a 
globalized  world.  Others  point  out  that  with  an 
open  economy  domestic  and  international  shocks 
may  balance  each  other  out  and  actually  make  the 
system  more  stable.  There  is  little  doubt,  however, 
that  these  financial  shocks  can  have  negative  conse¬ 
quences  for  countries.  The  Asian  financial  crisis  of 
the  late  1990s  and  the  recent  global  financial  crisis 
have  both  increased  poverty  and  inequality.  How 
governments  shape  the  globalization  process  in 
their  countries  seems  to  be  important  in  influencing 
the  way  globalization  affects  poverty  and  inequal¬ 
ity.  This  suggests  that  countries  will  have  quite  dif¬ 
ferent  experiences  with  globalization. 


Politics  and  Democratization 

Globalization  has  also  been  credited  with  hav¬ 
ing  effects  on  politics.  Some  argue  that  it  has 
helped  spread  democracy  and  put  pressure  on 
leaders  to  democratize.  Democracy  has  certainly 
increased  globally  since  the  1980s.  But  has  this 
been  due  to  globalization?  Globalization  through 
an  open  economy  or  through  international  pres¬ 
sure  generated  by  international  institutions  or 
norm  entrepreneurs  may  induce  leaders  to  adopt 
more  democratic  forms  of  governance  if  they  want 
to  be  members  of  the  international  system.  Some 
argue  the  reverse:  Democratization  was  necessary 
for  the  change  in  policies  in  many  countries  that 
led  to  greater  globalization.  The  causal  connection 
between  democratization  and  globalization  is 
much  debated.  There  is  also  the  difficult  question 
of  whether  democracy  and  globalization  are  com¬ 
patible.  Political  regimes  practice  democracy 
within  their  boundaries,  and  publics  see  their  gov¬ 
ernments  as  responsible  for  the  outcomes  they 
experience.  But  in  a  globalized  world  both  of  these 
may  be  compromised.  Democracy  may  have  little 
meaning  if  the  most  important  outcomes  result 
from  global  forces  and  if  governments  can  do  little 
to  affect  these  forces.  The  supposed  democratic 
deficit  in  the  EU  is  one  manifestation  of  this  prob¬ 
lem.  The  growing  number  of  international  institu¬ 
tions  may  also  be  a  related  concern.  Many  of  these 
institutions  are  established  to  facilitate  interna¬ 
tional  cooperation  and  prevent  countries  from 
pursuing  beggar-thy-neighbor  policies.  But  if  they 
take  decision-making  power  away  from  govern¬ 
ments,  they  may  undermine  democracy  at  home 
and  the  public’s  faith  in  it.  Others  have  argued  that 
democracy  may  be  enhanced  by  such  international 
institutions.  The  spread  of  democracy  and  this 
wave  of  globalization  seem  to  have  gone  hand  in 
hand;  how  countries  react  over  time  to  a  highly 
globalized  world  may  or  may  not  be  propitious  for 
democracy. 

Conflict 

Globalization  may  have  an  impact  on  conflict 
as  well.  The  frequency  of  international  wars  has 
declined  steadily  since  the  World  Wars,  but  civil 
wars  had  risen  in  frequency  until  recently.  Some 
research  suggests  that  aspects  of  globalization  can 
reduce  conflict,  especially  interstate  war.  Research 
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has  shown  that  increased  trade  among  countries, 
increased  foreign  investment,  and  trade  agree¬ 
ments  are  all  associated  with  less  international 
conflict.  The  so-called  international  capitalist 
peace  is  one  example  of  this  argument.  International 
institutions  also  seem  to  have  a  similar  relation¬ 
ship:  The  more  of  them  a  country  joins,  the  less 
likely  it  is  to  get  into  military  conflicts.  There  do 
not  seem  to  be  similar  connections  for  civil  war. 
On  the  other  side  of  the  ledger,  however,  as  coun¬ 
tries  trade  and  invest  more  with  each  other,  they 
become  more  likely  to  have  trade  and  investment 
disputes.  Over  time  these  can  exacerbate,  if  not 
create,  conflicts.  Furthermore,  as  distant  countries 
and  peoples  are  brought  into  ever  closer  contact, 
the  potential  for  misunderstandings  and  disputes 
rises  as  their  different  cultures,  values,  and  beliefs 
clash;  the  so-called  clash  of  civilizations  is  more 
likely  in  a  highly  interdependent  world.  As  their 
interdependence  increases,  countries  also  become 
more  vulnerable  to  the  actions  of  other  states. 
International  economic  ties  have  the  potential  to 
be  used  as  political  leverage.  An  interesting  exam¬ 
ple  is  the  case  of  China  and  the  United  States  in  the 
early  2000s.  The  growth  of  trade  and  investment 
ties  between  the  two  countries  has  resulted  in  very 
large  and  imbalanced  flows  of  capital  and  trade 
from  China  to  the  United  States;  these  economic 
linkages  can  serve  as  potential  political  levers  for 
both  sides.  The  relationship  has  become  much 
more  complex  and  fraught  with  political  stress. 
Some  think  that  these  ties  will  dampen  any  pro¬ 
pensity  to  engage  militarily;  others  see  them  as 
potential  sparks  for  a  future  conflict.  Globalization 
may  reduce  tendencies  toward  conflict  by  making 
it  more  costly  since  countries  will  have  to  break 
their  economic  ties  if  they  fight;  or  it  may  induce 
conflict  as  they  have  more  and  more  linkages  that 
can  lead  to  disputes. 

State  Capacity  to  Govern  and 
International  Cooperation 

Questions  have  arisen  about  the  impact  of  glo¬ 
balization  on  state  capacity.  That  is,  does  global¬ 
ization  weaken  countries  and  erode  their  capacity 
to  govern?  States  are  often  judged  by  their  capacity 
to  manage  their  economies,  to  respond  to  crises, 
and  to  provide  public  goods  such  as  health  care, 
national  defense,  and  education.  As  they  become 


increasingly  intertwined  in  a  global  economy, 
states  may  lose  the  ability  to  manage  their  econo¬ 
mies.  The  smaller  the  national  economy  relative  to 
the  world  economy,  the  more  likely  is  this  out¬ 
come.  Countries  will  experience  the  externalities 
of  other  countries’  policies  but  will  often  be  unable 
to  manage  these  negative  consequences.  Losing 
control  over  monetary  policy,  especially  if  a  coun¬ 
try  opens  its  capital  account  and  lets  its  currency 
float,  is  a  well-known  example  of  this.  Tax 
competition  among  states  in  a  globalized  world  is 
another  concern,  since  taxes  provide  the  resources 
for  governments  to  provide  public  goods.  The 
race-to-the-bottom  phenomenon  in  general  that  is 
associated  with  globalization  can  signal  a  coun¬ 
try’s  increasing  loss  of  control  over  its  economy 
and  perhaps  its  polity.  As  noted  above,  for  democ¬ 
racies  this  loss  of  control  can  be  especially  worri¬ 
some  as  it  may  induce  a  loss  of  public  faith  in  the 
government  and  in  democracy  overall. 

On  the  other  hand,  international  cooperation 
and  international  institutions  may  help  states  alle¬ 
viate  the  problems  associated  with  globalization. 
State  behavior  can  create  negative  externalities  for 
others  even  in  a  system  that  is  not  very  globalized. 
But  without  globalization  it  may  be  very  hard  for 
states  to  cooperate  to  address  these  externalities. 
Globalization  may  awaken  states  to  the  need  for 
organized  cooperation.  It  may  induce  the  creation 
of  international  institutions  that  help  states  deal 
with  these  externalities  and  thus  provide  greater 
political  capacity  than  otherwise.  One  can  think  of 
the  EU  in  this  light.  The  states  within  the  EU  may 
now  have  greater  capacity  as  a  group  to  affect  their 
economies  than  they  did  before  joining  the  EU. 
Individually  they  might  not  be  able  to  manage  the 
pressures  of  globalization  but  with  an  institution¬ 
alized  cooperative  regime  they  may  be  much  better 
able  to  do  so.  The  recent  interest  of  countries  such 
as  Iceland  in  joining  the  EU  after  the  2008  finan¬ 
cial  crisis  suggests  such  an  outcome.  Globalization 
no  doubt  creates  or  exacerbates  problems  that  do 
not  respect  national  borders,  but  it  may  also  con¬ 
tain  pressures  that  allow  countries  to  better  respond 
to  those  problems. 

Convergence  Among  States 

Globalization  has  also  been  associated  with 
policy  and  institutional  convergence  among  states. 
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Some  have  suggested  that  as  countries  open  their 
economies  they  become  more  likely  to  adopt  simi¬ 
lar  policies  and  institutions.  This  convergence  pro¬ 
cess  can  be  driven  by  different  pressures.  Some 
attribute  it  to  the  power  of  the  world’s  hegemon; 
some,  to  the  increased  vulnerability  of  states  to  the 
pressures  of  international  institutions;  some,  to 
increased  competition  among  states;  others,  to  a 
learning  process  undertaken  by  countries.  Whatever 
the  cause,  this  convergence  seems  to  have  occurred 
in  economic  policy  and  institutions.  The  wide¬ 
spread  adoption  of  neoliberal  policies,  or  the 
so-called  Washington  Consensus,  has  been  notable 
since  the  late  1980s.  Other  examples  are  the  deci¬ 
sions  by  many  countries  to  create  independent 
central  banks  and  to  allow  their  currencies  to  float. 
This  process  has  been  remarked  in  other  areas, 
from  the  creation  of  bureaucracies  to  deal  with  sci¬ 
ence  and  technology  issues  to  the  turn  to  democ¬ 
racy  itself.  Globalization  may  narrow  the  choices 
that  states  have  and  pressure  them  to  adopt  similar 
institutional  forms  and  practices.  To  the  extent 
that  these  forms  and  practices  result  in  better  out¬ 
comes,  this  process  may  be  welcomed.  However,  if 
they  are  not  productive  for  states,  then  this  conver¬ 
gence  process  may  be  a  negative  for  governments 
everywhere. 

When  this  convergence  pressure  extends  to  cul¬ 
tural  and  social  life,  it  is  often  seen  as  a  negative 
force.  Losing  the  cultural  and  social  diversity  that 
are  associated  with  distinct  regions,  ethnic  groups, 
languages,  and  nations  is  an  often  remarked  effect 
of  globalization.  The  disappearance  of  many  lan¬ 
guages  and  the  increasing  use  of  English  are  two 
such  examples.  For  some,  this  is  a  benefit  since 
transaction  costs  are  greatly  reduced  if  all  people 
speak  the  same  language,  and  the  dream  of  a 
common  global  language  has  a  long  history.  The 
pressure  on  countries  or  peoples  to  curtail  certain 
practices  and  rituals  that  are  not  considered  “mod¬ 
ern”  or  “civilized”  is  said  to  be  growing  as  well. 
Debate  rages  over  whether  countries  have  the  right 
to  condone  all  types  of  practices  within  their 
borders  or  whether  there  exist  certain  minimal 
standards  that  all  countries  must  follow.  And  the 
end  of  some  practices,  such  as  slavery,  has  been 
widely  applauded.  The  decrease  in  both  communi¬ 
cation  and  transaction  costs  in  a  globalized  world 
has  obviously  helped  intensify  such  convergence. 
External  pressures  to  conform  to  global  norms  may 


be  stronger  today  than  ever  but  such  pressures 
have  always  existed.  Today  they  are  less  likely  to 
be  imposed  through  the  use  of  force  or  conquest 
but  that  does  not  mean  these  pressures  are  less 
powerful. 

Scholars  also  worry  that  a  global  system  is  an 
increasingly  interconnected  one  where  problems  in 
one  country  or  region  can  spread  more  rapidly, 
forcefully,  and  widely  than  in  a  less  interconnected 
environment.  The  increasing  ease  of  transporta¬ 
tion,  for  example,  makes  the  spread  of  disease 
much  faster  and  perhaps  deadlier.  The  communi¬ 
cations  revolution  does  much  the  same  for  new 
ideas  and  fads.  Information  cascades  are  more 
likely  in  such  systems.  This  means  that  instability 
may  be  greater  as  well.  Small  changes  to  the  system 
radiate  out  in  all  directions  and  can  lead  to  large 
consequences.  There  is  a  concern  that  crises  are 
also  more  likely  and  more  contagious  in  such  a 
globalized  world.  The  Asian  financial  crisis  of  the 
late  1990s  and  the  recent  global  financial  crisis  of 
2008  to  2009  are  two  such  examples.  Some  have 
argued  that  the  world’s  largest  economies  have 
experienced  a  “great  moderation”  since  the  1970s 
with  a  dampening  of  business  cycles  and  decreas¬ 
ing  inflation  due  partially  to  greater  international 
exposure.  Others  see  an  international  economy 
beset  by  rising  volatility  over  time,  as  crises  multi¬ 
ply  and  spread.  Globalization  may  stabilize  a 
system  by  spreading  problems  out  and  bringing 
counterbalancing  forces  to  bear,  but  it  may  also 
destabilize  a  system  by  creating  a  network  of  link¬ 
ages  among  all  countries  and  across  all  sectors  that 
transmits  and  magnifies  problems. 

Domestic  Economies 

In  addition  to  the  question  about  which  coun¬ 
tries  gain  and  lose  from  globalization,  there  is  the 
issue  of  which  groups  gain  and  lose  domestically. 
Many  see  globalization  as  strengthening  the  hand 
of  capital  owners  and  right-wing  political  parties 
as  opposed  to  labor  owners  and  left-wing  parties. 
With  a  global  economy  where  capital  can  move 
relatively  freely,  it  becomes  advantaged  domesti¬ 
cally.  Globalization  changes  the  domestic  balance 
of  power  between  business  and  labor.  When  own¬ 
ers  of  capital  do  not  like  a  government  policy  or 
fear  labor  has  become  too  strong,  they  can  often 
move  to  a  new  political  environment  that  is  more 
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accommodating.  This  process  can  then  generate 
race-to-the-bottom  pressures  as  governments  try 
to  retain  capital  in  their  country  by  deregulating 
and  adopting  more  business-friendly  policies. 
Some  claim  that  globalization  has  helped  under¬ 
mine  labor  unions  and  other  forms  of  labor  market 
organization.  This  process  can  then  influence  the 
political  system.  If  right-wing  parties  represent 
business  interests  more,  they  may  be  strengthened 
by  globalization.  Left-wing  parties  and  their  pro¬ 
grams  may  seem  increasingly  unattractive  as  they 
“scare”  capital  away  and  perhaps  slow  down 
growth.  Evidence  for  this  effect  is  mixed.  Some 
claim  that  left-wing  parties  and  governments  actu¬ 
ally  do  a  better  job  wooing  capital  and  thus  are  not 
disadvantaged  by  globalization;  they  may  even 
make  better  partners  for  international  business 
than  right-wing  parties.  Some  scholars  see  no 
effect  of  globalization  on  domestic  politics;  they 
claim  that  domestic  political  institutions  filter  and 
shape  such  external  pressures  so  that  their  ultimate 
effects  are  a  function  of  domestic  politics. 
Preexisting  domestic  institutions  and  practices 
then  govern  whether  and  how  globalization  affects 
internal  politics.  The  impact  of  globalization  on 
political  power  within  and  between  countries  is  a 
topic  of  debate  and  great  importance  for  future 
research. 

Limits  of  Current  Research 

The  consequences  of  globalization  could  thus  be 
widespread.  They  might  affect  economics,  politics, 
society,  and  culture.  Existing  research  points  out 
that  globalization  has  affected  economic  growth 
rates,  poverty,  inequality,  democracy,  conflict, 
state  capacity,  policy  and  institutional  convergence, 
cultural  diversity,  volatility  and  the  diffusion  of 
crises,  and  balance  of  power  between  capital  and 
labor.  But  it  provides  little  consensus  on  the  direc¬ 
tion  or  magnitude  of  these  effects. 

Globalization  and  the  Future 

Given  the  many  consequences  that  globalization 
might  have,  one  question  that  is  often  raised  is 
whether  countries  can  manage  its  impact.  This  is 
especially  important  to  the  extent  that  globaliza¬ 
tion  has  negative  consequences.  Many  types  of 
problems  that  have  been  endemic  to  countries  for 


centuries  have  taken  on  a  new  perspective  in  a 
globalized  world.  Problems  that  were  national 
have  become  transnational.  The  ability  of  coun¬ 
tries  to  realize  many  of  their  basic  goals  has 
become  more  and  more  tied  to  the  actions  of  other 
countries.  Economic  prosperity  domestically  often 
depends  now  on  international  trade  and  capital 
movements  around  the  globe;  national  security 
from  all  sorts  of  threats  depends  on  the  behavior 
of  other  countries  and  nonstate  actors;  environ¬ 
mental  conditions  rely  on  the  behavior  of  other 
agents  globally;  and  public  health  relies  more  and 
more  on  transnational  factors  and  the  behavior  of 
international  agents.  Countries  have  always  faced 
such  problems;  it  is  just  that  their  impact  and  reso¬ 
lution  depend  more  and  more  on  agents  outside 
the  state  itself.  Globalization  is  intimately  con¬ 
nected  to  this  process.  By  increasing  contacts  and 
interactions  among  countries  and  peoples,  it  helps 
make  many  issues  transnational  in  character, 
rather  than  national. 

Given  that  governance  is  largely  the  domain  of 
nation-states,  is  there  anything  that  can  be  done  to 
address  these  transnational  problems?  One  coun¬ 
try  can  rarely  dictate  another’s  economic,  environ¬ 
mental,  security,  or  public  health  policy,  for 
example.  But  to  deal  with  transnational  problems, 
coordination  of  policies  may  be  the  best  way  for¬ 
ward.  In  particular,  cooperation  where  countries 
coordinate  their  policies  to  arrive  at  mutually  pref¬ 
erable  outcomes  may  be  essential.  Globalization 
may  facilitate  cooperation;  it  may  make  the  costs 
of  failing  to  cooperate  so  high  that  countries  are 
more  willing  to  try.  But  it  may  also  make  the 
stakes  of  cooperation  much  higher  and  thus  render 
it  less  likely. 

International  institutions  may  be  one  way  to 
address  these  concerns.  In  a  globalized  era,  some 
see  such  institutions  as  essential  for  dealing  with 
transnational  problems.  Countries  alone  cannot 
successfully  deal  with  them;  they  require  changes  in 
the  behavior  of  other  states.  International  institu¬ 
tions  that  states  voluntarily  join  and  comply  with 
may  help  them.  Such  institutions  may  help  states 
realize  cooperative  outcomes;  they  may  provide 
transparency  and  lower  transaction  costs  for  nego¬ 
tiating  solutions  to  transnational  issues;  they  may 
embody  global  norms  and  practices  that  allow 
states  to  identify  focal  points  for  cooperation  and / 
or  help  them  enforce  compliance  with  international 
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norms  and  cooperative  agreements.  These  are  the 
potential  benefits  of  global  institutions.  They  may 
have  costs  as  well.  Not  all  international  institu¬ 
tions  function  adequately;  some  have  serious 
internal  defects  that  cause  them  to  operate  poorly. 
Others  are  deadlocked  by  internal  divisions.  Some 
are  dominated  by  one  or  two  states  that  coerce 
others  to  adopt  their  preferred  norms  and  prac¬ 
tices  even  if  these  are  not  very  beneficial  for  the 
others.  International  institutions  cannot  be  seen  as 
a  costless  and  efficient  solution  to  all  transna¬ 
tional  problems.  But  many  of  them  have  shown 
some  ability  either  to  prevent  transnational  prob¬ 
lems  for  worsening  (e.g.,  the  WHO  and  the  WTO 
in  times  of  crisis)  or  to  allow  states  to  cooperate  in 
order  to  better  manage  problems  in  a  globalized 
world  (e.g.,  the  EU). 

A  related  issue  is  concern  over  globalization’s 
future.  If  one  adopts  the  secular  view  of  it,  then 
little  chance  exists  that  the  increasing  contact  and 
interactions  among  peoples  and  countries  can  be 
reversed.  Indeed  if  there  is  technological  progress, 
there  is  likely  to  be  increasing  globalization;  more 
and  more  contact  and  interaction  are  inevitable  as 
technology  brings  people  in  the  world  closer 
together.  On  the  other  hand,  the  more  cyclical 
view  suggests  that  globalization  comes  and  goes  in 
waves  depending  on  political,  economic,  and 
social  reactions.  We  may  now  be  experiencing  the 
height  of  this  period  of  globalization.  Forces  for  a 
backlash  against  the  pressure  exerted  by  increasing 
contacts  and  interactions  among  peoples  and 
states  may  be  gaining  strength.  The  economic  and 
financial  crisis  of  2008  to  2009  may  be  a  prelude 
to  this  type  of  backlash.  In  this  view,  governments 
can  respond  in  ways  that  curtail  or  reverse  global¬ 
ization;  they  can  protect  their  economies,  close 
their  borders,  end  participation  in  international 
institutions,  and  take  other  steps  that  seal  them  off 
from  the  rest  of  the  world.  The  period  between 
1914  and  1945  is  one  example  of  this.  The  ques¬ 
tion,  of  course,  is  what  would  be  the  costs  and 
benefits  of  such  a  course  of  action  and  would  gov¬ 
ernments  be  able  to  achieve  any  kind  of  meaning¬ 
ful  autonomy  or  autarchy,  given  the  technologies 
we  now  have  and  the  public  expectations  about 
their  connections  to  the  rest  of  the  world.  Whether 
globalization  is  reversible  or  not  is  a  question  of 
great  import;  the  costs  of  doing  so  would  be  a  key 
factor.  More  realistic,  perhaps,  are  questions  about 


whether  globalization  can  be  slowed  down  or  man¬ 
aged  in  ways  that  are  better  for  all  concerned. 

In  conclusion,  debate  rages  about  globaliza¬ 
tion.  A  number  of  theories  exist  about  its  sources. 
Technological  change,  international  political 
hegemony,  global  normative  convergence,  and/or 
independent  country  reactions  have  all  been  cited 
as  major  causes  of  globalization.  Globalization’s 
effects  have  been  debated  in  even  greater  intensity 
and  breadth.  Substantial  literatures  exist  on 
whether  globalization  has  affected  economic 
growth  rates,  poverty,  inequality,  democracy, 
conflict,  state  capacity,  policy  and  institutional 
convergence,  cultural  diversity,  volatility  and  the 
diffusion  of  crises,  and  balance  of  power  between 
capital  and  labor.  Agreement  on  the  nature  of  its 
effects  on  any  of  these  is  scarce.  For  some,  the  bal¬ 
ance  of  its  impact  has  been  negative  in  almost  all 
domains;  for  others,  it  is  seen  as  largely  positive, 
especially  in  the  economic  and  political  areas;  and 
for  yet  others,  it  has  costs  and  benefits  that  are 
hard  to  calculate  and  summarize  overall.  One  fact 
not  in  doubt  is  that  globalization  is  an  important 
feature  of  our  world  these  days;  it  seems  to  have 
consequences  for  all  domains  of  economic,  politi¬ 
cal,  social,  and  cultural  affairs.  How  to  manage  it 
in  order  to  make  states,  peoples,  or  the  world  bet¬ 
ter  off  is  thus  of  central  importance. 
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Amid  the  debate  on  globalization  and  its  influence 
on  domestic  politics  and  institutional  arrange¬ 
ments,  it  is  intriguing  to  note  the  resilience  of 


national  institutions,  policy  styles,  and  percep¬ 
tions  of  the  role  of  the  state  in  society.  True,  many 
countries  that  displayed  significantly  different 
governance  arrangements  a  few  decades  ago  today 
present  a  more  common  governance  model.  Even 
so,  however,  the  legacy  of — or  more  correctly, 
perhaps,  the  continued  support  for — traditions  of 
statehood,  governance,  and  collective  action  loom 
large.  These  traditions  can  be  seen  as  value  sys¬ 
tems  that  have  evolved  over  an  extensive  period  of 
time,  continuously  reproduced  in  public  policy  and 
public  discourse.  If  there  is  strong  resilience  in  insti¬ 
tutions  and  policy,  this  in  part  can  be  attributed  to 
strong  political  and  administrative  tradition.  This 
entry  examines  the  nature  of  the  contemporary 
state  and  then  examines  the  different  governance 
models  as  means  of  solidifying  policy  preferences. 

Although  it  makes  sense  to  think  of  these  value 
systems  as  traditions,  they  can  mostly  be  traced 
back  to  specific  political  contexts  and  policy  choice 
(see,  e.g.,  Francis  Castles,  1993).  For  instance,  the 
Asian  “developmental  state”  (Chalmers  Johnson, 
1982)  and  the  Scandinavian  and  continental 
European  welfare  state  models  (Gosta  Esping- 
Andersen,  1990)  evolved  through  a  series  of  politi¬ 
cal  choices  about  the  role  of  the  state,  the  market, 
and  civil  society.  Against  this  backdrop,  the  pur¬ 
pose  of  this  entry  is  to  explore  the  linkages  between 
policy  choice  and  models  of  governance.  Con¬ 
ventional  wisdom  holds  that  policy  choice  is  largely 
a  product  of  institutional  arrangements.  The  imme¬ 
diate  interest  in  this  entry  is  to  see  to  what  extent 
those  institutions,  in  turn,  can  be  related  to  a  more 
overarching  set  of  values  and  beliefs  about  the  role 
of  the  state.  ( Value  refers  to  everything  that  is  con¬ 
sidered  to  have  a  value,  be  it  material,  ideological, 
ethical,  or  other,  by  most  people/citizens.)  Should 
that  be  the  case,  we  will  be  well  under  way  to  con¬ 
nect  a  typology  of  governance  arrangements  to 
broad  categorizations  of  policy.  There  has  been 
some  work  done  on  the  relationship  between  differ¬ 
ent  types  of  policy,  drawing  on  the  familiar  typology 
developed  by  Ted  Lowi,  and  the  institutional 
arrangements  of  the  state  but  this  is  a  vast  research 
field  and  much  more  work  remains  before  we  have 
a  more  complete  account  of  the  linkages  between 
governance  arrangements  and  policy  styles. 

This  entry  then  looks  at  a  couple  of  different 
governance  models  as  a  form  of  institutionaliza¬ 
tion  of  overarching  policy  choices  in  terms  of 
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constitutional  and  institutional  arrangements.  The 
underlying  premise  of  this  entry  is  that  the  institu¬ 
tions  framing  policy  choice  are  themselves  “locked 
in”  by  fundamental  policy  choices  concerning  the 
role  of  the  state  in  society  and  its  centrality  in 
governance.  The  predominant  governance  model 
of  a  particular  country  is  not  a  given  but  reflects 
entrenched  notions  about  the  role  of  the  state,  the 
market,  and  civil  society. 

These  values  and  perceptions  become  deeply 
institutionalized  over  time  and  elevated  into  politi¬ 
cal  culture  above  and  beyond  day-to-day  policy 
making.  Even  so,  however,  when  observed  over  a 
longer  stretch  of  time  distinct  changes  in  these 
value  systems  can  be  observed.  The  role  of  govern¬ 
ment  in  the  United  States  changed  fundamentally 
from  Lyndon  Johnson’s  Great  Society  project  in 
the  1960s  to  Ronald  Reagan’s  neoliberal  policy 
style  in  the  1980s;  in  Britain,  Labour  under  Tony 
Blair  and  Gordon  Brown  looked  significantly  dif¬ 
ferent  from  Labour  under  Harold  Wilson;  in 
Japan,  the  relationship  between  the  Liberal  Demo¬ 
cratic  Party  (LDP),  the  bureaucracy,  and  the  cor¬ 
porate  sector  has  changed  profoundly  since  the 
1960s;  and  in  Scandinavia,  the  welfare  state  today 
has  very  different  organization  and  modus  ope- 
randi  from  those  of  the  1960s  or  1970s,  and  so  on. 
Thus,  however  entrenched  and  institutionalized 
these  values  may  be,  they  do  nonetheless  change, 
sometimes  abruptly,  sometimes  more  subtly. 

The  hypothesis  that  specific  cases  of  policy  mak¬ 
ing  are  embedded  in  more  overarching  values  and 
norms,  perhaps  even  a  social  theory,  concerning 
what  should  be  the  role  of  the  state  in  society  opens 
up  for  potential  theoretical  circularities.  It  could  be 
that  the  more  general  values  are  simply  defined  as 
the  sum  of  specific  policies,  in  which  case  the  argu¬ 
ment  about  embeddedness  and  institutionalization 
becomes  circular.  And,  even  if  different  specific 
types  of  policies  can  be  theoretically  linked  to  dif¬ 
ferent  governance  arrangements,  a  causal  mecha¬ 
nism  is  still  needed  to  explain  how  and  why  that 
would  be  the  case.  A  more  plausible  way  of  looking 
at  such  relationships  would  be  to  suggest  that  pol¬ 
icy  and  governance  tend  to  foster  and  reinforce 
each  other.  Lor  instance,  “big  government”  service 
models  tend  to  emphasize  redistributive  policies 
and  equality  and  equal  treatment  in  service  deliv¬ 
ery,  while  neoliberal  policies  emphasize  economic 
growth  and  a  customer-oriented,  market-like 


model  of  public  service.  Thus,  different  aspects  of 
public  policy  and  public  administration  practices 
are  closely  related. 

This  entry  first  looks  at  four  broad  dimensions  of 
governance  that  relate  to  different  aspects  of  the 
role  of  the  state  in  society.  It  then  discusses  three  dif¬ 
ferent  models  of  governance:  state-centric,  liberal- 
democratic,  and  network-based  governance.  The 
third  section  merges  these  two  analyses  to  elaborate 
the  features  and  logics  of  the  three  governance  mod¬ 
els.  Since  issues  related  to  institutions,  institutional¬ 
ization,  and  deinstitutionalization  are  central  to  the 
present  analysis,  these  issues  are  discussed  before  a 
concluding  section  sums  up  the  entry. 

Dimensions  of  Governance 

Nation-states  around  the  world  offer  a  wide  vari¬ 
ety  in  terms  of  how  they  organize  their  gover¬ 
nance.  Perhaps  the  most  obvious  and  striking  dif¬ 
ferences  relate  to  the  fundamental  aspects  of  state 
organization  in  terms  of  their  institutional  arrange¬ 
ment.  Thus,  there  are  federal  states  and  unitary 
states,  centralized  states  and  decentralized  states, 
and  states  with  a  long  tradition  in  regulating  mar¬ 
kets  and  social  behavior  while  other  states  see 
their  role  as  enabling  and  facilitating  markets  and 
free  individuals. 

These  institutional  arrangements  are  highly 
path  dependent.  They  evolved  as  constitutional 
systems  defining  the  model  of  governance,  which 
were  believed  to  reflect  popular  expectations  and 
the  requirements  of  a  political  system  to  govern  the 
country.  Thus,  for  instance,  American  or  Canadian 
federalism  was  the  model  that  was  believed  to 
provide  the  optimal  balance  between  national 
interests  and  regional  autonomy  in  those  vast  ter¬ 
ritories,  while  the  unitary  state  was  the  preferred 
arrangement  in  smaller  European  countries.  These 
are  complex  issues  to  capture  analytically  because 
they  are  shaped  by  the  history  of  the  country  and 
its  unification  process,  its  state  tradition,  the 
homogeneity  of  the  population,  and  its  geopoliti¬ 
cal  context. 

One  strategy  to  approach  these  issues  is  to 
relate  them  to  a  set  of  dimensions  of  governance. 
If  we  conceive  of  governance  as  the  pursuit  of  col¬ 
lective  interests,  with  the  state  as  a  coordinating 
and  enabling  actor,  we  can  clearly  see  that  gover¬ 
nance  is  both  a  process  that  is  endogenous  to  the 
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state — division  of  labor,  competencies,  jurisdic¬ 
tions,  and  resources — and  a  process  in  which  the 
state  interacts  with  other  actors  in  society. 
Therefore,  we  will  want  the  dimensions  of  gover¬ 
nance  to  capture  both  the  internal  and  external 
aspects  of  governance.  Below,  four  such  dimen¬ 
sions  of  governance  are  briefly  outlined.  These 
dimensions  describe  different  relationships  between 
the  state,  the  society,  and  the  market. 

The  governance  dimensions  speak  to  very  basic 
issues  about  the  organization  of  the  state  and  its 
role  in  society.  Therefore,  it  should  not  be  surpris¬ 
ing  to  find  that  they  are  normatively  charged  and 
are  frequently  topics  of  political  disunity  in  many 
countries. 

Popular  Representation  Versus 
Institutional  Autonomy 

The  first  governance  dimension  measures  the 
degree  to  which  the  state  is  responsive  to  popular 
demands  and  expectations,  and  it  places  that 
responsiveness  in  juxtaposition  to  a  high  degree  of 
autonomy  for  the  state’s  political  and  administrative 
institutions.  For  governance  to  qualify  as  demo¬ 
cratic,  it  is  essential  that  the  state  responds  to  popu¬ 
lar  demands  and  incumbent  political  parties  are 
allowed  to  shape  the  policy  agenda  and  to  engage  in 
public  discourse.  At  the  same  time,  however,  institu¬ 
tions  need  to  have  some  integrity  and  autonomy, 
both  in  relationship  to  partisan  ideologies  and  to 
direct  popular  pressures  in  order  to  be  able  to  fulfill 
their  role  in  the  policy  process.  Institutions  that 
develop  close  and  bilateral  relationships  with  paro¬ 
chial  societal  interests  can  develop  networks  as 
alternative  instruments  of  policy  implementation, 
but  they  will  have  problems  in  implementing  poli¬ 
cies  that  discriminate  among  policy  targets.  This  is, 
thus,  a  dimension  where  both  extremes  are  unsatis¬ 
factory  from  a  governance  perspective;  democratic 
governance  is  contingent  on  both  popular  input  on 
public  policy  and  some  degree  of  institutional 
autonomy  allowing  those  institutions  to  pursue 
policies  that  might  not  enjoy  popular  support  but 
are  necessary  to  the  state. 

Another  example  of  the  complexities  associated 
with  this  dimension  can  be  seen  in  the  relationship 
between  politicians  and  bureaucrats.  On  the  one 
hand,  we  expect  the  civil  service  to  be  responsive 
to  the  government  of  the  day,  implementing  its 


political  wishes  without  prejudice.  On  the  other 
hand,  we  expect  the  public  administration  to  be 
guided  by  values  such  as  impartiality  and  by  due 
process  and  public  law,  and  a  “politicization”  of 
the  administrative  system  is  often  seen  as  a  contro¬ 
versial  development  that  could  compromise  the 
integrity  of  the  bureaucracy.  Again,  both  extremes 
on  the  dimension  are  problematic  in  a  governance 
perspective  whereas  some  form  of  combination  of 
the  two  positions  fosters  democratic  governance  or 
is  indeed  integral  to  governance. 

Societal  Integrity  Versus  Institutional  Capacity 

A  second  dimension  of  governance  refers  to  the 
tension  between  fundamental  values  related  to 
individual  freedom,  on  the  one  hand,  and  the  need 
for  institutions  to  encroach  on  that  freedom  in 
order  to  ensure  collective  interests  and  values,  on 
the  other.  This  dilemma  and  the  question  it  raises 
on  forms  of  justification  for  political  control  over 
individuals  is  a  major  field  of  research  and  dis¬ 
course  in  political  philosophy,  and  the  finer  details 
of  that  debate  are  not  probed  too  deeply  here. 
There  are  obviously  values  related  to  the  individ¬ 
ual  qua  an  individual  and  other  values  derived 
from  the  individual  as  a  member  of  a  polity.  The 
tension  between  these  two  types  of  values  plays 
out  almost  daily  in  a  variety  of  policy  sectors  such 
as  environmental  policy  and  climate  change,  crime 
prevention,  social  welfare,  and  tax  policy.  The 
government’s  need  to  collect  information  about 
individual  behavior  has  become  a  major  issue,  par¬ 
ticularly  in  matters  related  to  national  security 
where  the  need  for  surveillance  stands  against  per¬ 
sonal  integrity.  A  similar  debate  has  surrounded 
means-tested  welfare  state  programs  where  gov¬ 
ernment  agencies  require  beneficiaries  to  submit 
detailed  information  about  their  standard  of  living 
and  income. 

The  degree  to  which  government  should  have 
access  to  private  information  or  regulate  personal 
behavior  is  ultimately  a  normative  issue  where  dif¬ 
ferent  interests  are  weighed  against  each  other;  it  is 
not  possible  to  objectively  delineate  the  pursuit  of 
individual  interests  aggregated  into  government 
action,  on  the  one  hand,  and  the  rights  of  individu¬ 
als  to  act  freely,  on  the  other.  The  point  here  is 
instead  that  some  encroachment  on  individual 
freedom  is  necessary  for  government  to  govern 
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society,  while  the  precise  extent  to  which  such 
encroachment  can  be  justified  remains  a  normative 
issue  that  reemerges  fairly  regularly  on  the  political 
agenda.  Meanwhile,  constitutional  reform  and 
institutional  design  have  to  wrestle  with  this  issue 
in  a  more  mundane  and  pragmatic  fashion. 

Institutional  capacity  is  a  key  variable  in  a 
broader  governance  perspective.  According  to  one 
perspective,  contemporary  society  has  become  too 
complex  for  any  single  actor  to  provide  effective 
steering  or  coordination.  Even  if  government  pre¬ 
viously  had  the  capacity  to  fulfill  those  roles,  the 
growing  complexity  has  undermined  the  governing 
capacity  of  the  state.  As  a  result,  governance  is 
conducted  through  networks,  partnerships,  and 
other  interactive  instruments  of  coordination  and 
collective  action.  In  terms  of  institutional  capacity, 
this  perspective  argues  that  in  contemporary  gov¬ 
ernance,  such  capacity  is  not  so  much  a  matter  of 
constitutional  mandates  as  a  continuous  process  of 
reproducing  institutional  leverage  through  coop¬ 
eration  with  societal  actors. 

National  Interests  Versus  Subnational  Autonomy 

All  systems  of  government  integrate  political 
and  administrative  action  on  different  institutional 
levels.  Exactly  how  that  integration  is  organized  is, 
however,  a  matter  of  substantive  cross-national 
variation.  Thus,  subnational  political  systems  are 
granted  some  degree  of  autonomy  in  relationship 
to  central  government  (for  an  overview,  see  Alan 
Norton,  1994). 

The  basic  logic  of  organizing  government  in  this 
way  is  one  of  efficiency  and  expediency,  but  there 
are  also  important  aspects  of  these  arrangements 
that  speak  to  the  development  of  democratic  gov¬ 
ernance.  The  efficiency  argument  is  fairly  obvious; 
it  simply  does  not  make  any  sense  to  have  the 
national  government  organize  water  and  sewage 
system  maintenance  or  garbage  disposal  service  in 
every  town  and  village  throughout  the  country. 
The  democracy  argument  holds  that  participation 
in  local  public  affairs  socializes  citizens  into  the 
democracy  form  of  governance.  For  that  socializa¬ 
tion  to  work,  local  government  must  have  some 
autonomy,  some  jurisdiction,  within  which  the 
local  demos  can  make  decisions  pertaining  to  pub¬ 
lic  matters  in  their  locale.  Also,  the  democratic 
theory  of  local  government  holds  that  the  role  of 


elected  officials  should  be  to  cater  to  the  interests 
of  the  populace  in  a  nonpartisan  way.  Ideology 
and  division  should  not  be  imported  from  national 
politics  and  brought  onto  the  local  political  scene. 

This  somewhat  cozy  account  of  local  govern¬ 
ment  is  troubled  by  the  existence  of  national  inter¬ 
ests,  which  play  out  in  the  local  territory.  In  issues 
such  as  infrastructural  development,  the  location 
of  windmill  parks,  or  the  deposit  of  nuclear  waste, 
national  interests  frequently  encounter  fierce  oppo¬ 
sition  from  local  governments.  The  question  then 
becomes  under  what  conditions  should  the  central 
government  be  allowed  to  override  the  will  of  a 
local  demos?  The  problem  is  larger  than  simple 
not-in-my-backyard  (NIMBY)  politics;  it  relates 
ultimately  to  the  problem  of  two  competing  demoi : 
the  national  and  the  local. 

Again,  the  two  extremes  of  this  dimension — 
complete  dominance  of  national  interests  and 
extensive,  unconditional  local  autonomy — raise 
serious  questions  in  terms  of  the  quality  and, 
indeed,  the  prospects,  of  governance.  Instead, 
some  middle  ground  position  characterized  by  dia¬ 
logue  between  central  and  local  government  with 
respect  for  both  the  national  and  local  demoi 
should  provide  for  better  governance. 

As  is  the  case  with  the  previous  dimension,  this 
dimension,  too,  speaks  directly  to  governance 
issues  in  several  different  ways.  If  central,  regional, 
and  local  governments  have  overlapping  compe¬ 
tencies  and  jurisdictions,  that  is,  if  policy  making 
involves  several  levels  of  government,  the  policy 
process  will  be  characterized  by  a  significant  verti¬ 
cal  dimension  of  resource  allocation,  control,  and 
bargaining.  If,  on  the  other  hand,  policy  sectors  are 
largely  divided  among  institutional  levels  so  that 
central  government  is  in  control  of  some  policy 
sectors,  regional  government  of  other  sectors,  and 
local  governments  of  yet  other  sectors,  the  process 
becomes  less  complex  and  will  not  involve  much 
interaction  among  institutional  levels.  It  is  thus 
essential  to  acknowledge  the  political  nature  of 
state-local  relationships;  while  a  formal  institu¬ 
tional  analysis  depicts  those  relationships  mainly 
as  one  of  division  of  labor,  a  closer  inspection  of 
central-local  interactions  will  uncover  struggles 
over  control,  financial  resources,  and  policy. 

Another  example  of  the  governance  ramifica¬ 
tions  of  central-local  relationships  is  that  they 
define  to  a  large  extent  the  relationship  between  a 
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city  and  key  players  in  the  local  environment. 
Urbanists  would  refer  to  this  as  the  interplay 
between  horizontal  and  vertical  dimensions  of 
autonomy  where  the  vertical  dimension  defines 
much  of  the  preconditions  for  horizontal,  urban 
governance. 

Autonomous  Markets  Versus  Regulation 

The  degree  to  which  government  regulates  mar¬ 
kets  is  often  seen  as  a  fundamental  aspect  of  the 
left-right  division  in  political  life.  Parties  on  the 
political  right  typically  insist  that  markets  work 
best  if  they  are  left  alone;  market  regulation 
chokes  the  market,  rendering  it  an  inefficient 
instrument  of  resource  allocation.  Government 
regulation  in  this  perspective  distorts  market 
mechanisms,  perverts  pricing,  and  removes  incen¬ 
tives  for  risk  taking  and  entrepreneurialism.  Leftist 
parties,  on  the  other  hand,  maintain  that  markets 
need  to  be  brought  under  some  public  control 
partly  in  order  to  ensure  that  they  do  not  create 
politically  undesirable  outcomes  such  as  vast  dif¬ 
ferences  in  wealth  and  life  chances.  Many  of  the 
areas  of  society  that  are  essential  to  the  type  of 
social  reform  that  parties  on  the  political  left  pur¬ 
sue  (e.g.,  housing,  education,  health  care,  and 
work),  operate  under  market  control.  Therefore, 
market  regulation  is  an  important  instrument  to 
insert  political  objectives  into  markets. 

Market  regulation,  as  mentioned  earlier,  is 
politically  controversial.  While  there  may  be  agree¬ 
ment  among  political  actors  on  long-term  policy 
goals  such  as  economic  growth,  issues  become 
charged  as  soon  as  the  role  of  the  market  toward 
those  political  goals  is  to  be  defined.  Parties  right 
of  center  will  argue  that  markets  left  alone  will 
deliver  the  highest  economic  growth,  whereas  par¬ 
ties  on  the  political  Left  will  insist  that  markets 
cannot  deliver  a  socially  acceptable  economic 
development  or  that  markets  are  simply  unable  to 
solve  collective  problems. 

That  having  been  said,  however,  there  is  also  a 
fair  amount  of  pragmatism  surrounding  the  issue 
of  market  regulation.  The  American  political  cul¬ 
ture  has  historically,  albeit  with  few  but  important 
exceptions,  heralded  the  notion  of  free  markets 
and  small  government.  Meanwhile,  the  economic 
history  of  the  United  States  is  replete  with  exam¬ 
ples  of  publicly  driven  large-scale  infrastructure 


development  projects  and  public  service  delivery, 
according  to  Jeff  Madrick  (2009).  More  broadly, 
there  is  an  understanding  from  Adam  Smith’s  clas¬ 
sic  book  in  the  field,  On  the  Wealth  of  Nations, 
onward  that  markets  need  a  regulatory  framework 
to  function  properly.  Furthermore,  the  evidence 
supporting  the  argument  that  nonregulated  mar¬ 
kets  perform  better  than  markets  under  some 
degree  of  government  regulation  is  agnostic.  The 
key  problems  seem  to  revolve  around  issues  such 
as  designing  the  governance  of  a  market  economy 
that  generates  growth  at  the  same  time  as  it  solves 
problems  of  coordination  and  provides  consumer 
protection  and  sustainable  development. 

This  leads  us  to  consider  the  critical  distinction 
between  regulation  of  and  within  markets.  The 
regulation  of  markets  means  providing  a  regulatory 
framework  defining  property,  contracts,  and  so  on 
and  a  mechanism  for  settling  disputes  among  mar¬ 
ket  actors.  Regulation  in  a  market  refers  to  political 
and  administrative  control  of  market  mechanisms 
such  as  price  or  supply  by  regulating  entrance  to 
the  market.  The  former  type  of  regulation  is  non- 
discriminatory,  while  the  latter  form  is  discrimina¬ 
tory  and  targeted  to  ensure  collective  interests  in 
the  markets  such  as  consumer  protection  and  price 
caps  on  specific  goods  provided  by  the  market. 
Such  public  presence  in  the  market  could  also 
manifest  itself  in  taxes  or  incentives  to  sanction  or 
encourage  different  forms  of  market  behavior. 

From  a  governance  perspective,  this  dimension 
provides  important  information  about  the  politi¬ 
cal  economy  of  governing.  Peter  Hall  and  David 
Soskice’s  (2001)  distinction  between  “liberal”  and 
“coordinated”  market  economies  suggests  that  the 
degree  to  which  the  capitalist  economy  is  embed¬ 
ded  in  domestic  governance  varies  considerably 
among  different  countries.  In  coordinated  market 
economies,  with  the  continental  European,  Scan¬ 
dinavian,  and  several  Asian  countries  as  leading 
examples,  interactions  between  the  state  and  pri¬ 
vate  capital  are  institutionalized,  making  con¬ 
certed  governance  less  complicated  and  politically 
sensitive  than  in  the  Anglo-American  liberal  mar¬ 
ket  economies.  Also,  market  regulation  is  a  policy 
instrument  defining  a  political  objective  in  the 
market.  That  objective  could,  however,  also  be 
implemented  in  less  obtrusive  ways,  thereby  facili¬ 
tating  a  higher  degree  of  concerted  action  among 
public  and  private  interests. 
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As  is  the  case  with  the  other  governance  dimen¬ 
sions,  both  end  points  of  the  dimension — com¬ 
pletely  free  or  completely  regulated  markets — are 
neither  efficient  from  a  market  point  of  view  nor 
perhaps  desirable  from  a  democratic  governance 
perspective. 

Governance  Models 

Let  us  now  turn  to  a  brief  presentation  of  a  few 
stylized  models  of  governance.  In  2005,  Jon  Pierre, 
together  with  Guy  Peters,  outlined  a  theory  of  gov¬ 
ernance  drawing  on  a  typology  of  governance 
models.  The  basic  assumption  sustaining  that 
analysis  was  that  all  governance  arrangements 
have  to  fulfill  a  number  of  roles  or  functions,  but 
different  governance  models  accord  the  state  and 
societal  actors  widely  different  significance  for 
those  roles.  The  authors  found  that  both  the  most 
state-centric  governance  model  and,  at  the  other 
end  of  the  spectrum,  network-based  governance 
had  problems  with  defining  a  state-society  interac¬ 
tion  that  blended  institutional  autonomy  with 
openness  to  the  interests  and  expertise  found 
among  societal  actors.  Instead,  governance  models 
between  the  two  extreme  positions  on  the  contin¬ 
uum,  such  as  the  corporatist  governance  model, 
fared  relatively  well;  they  were  capable  of  blending 
some  indigenous  capacity  to  steer  with  a  fair  open¬ 
ness  in  state-society  exchanges. 

The  defining  feature  of  governance  theory,  as 
opposed  to  more  traditional  government-centric 
accounts  of  governing,  is  the  centrality  of  exchanges 
between  the  state  and  actors  in  its  external  envi¬ 
ronment.  The  three  models  outlined  here  describe 
different  types  of  such  relationships  and  interac¬ 
tions.  However,  the  present  space  only  allows  for 
very  brief  accounts  of  complex  and  multidimen¬ 
sional  governance  arrangements. 

State-Centric  Governance 

The  first  governance  model  portrays  the  state 
exercising  firm  control  over  its  territory  and  popu¬ 
lation.  The  state  has  few  contingencies  to  other 
societal  actors  and  does  not  have  to  be  concerned 
about  any  lack  of  institutional  capacity  to  impose 
its  will  on  society.  The  typical  governing  style  of 
government  in  state-centric  governance  is  to  use 
policy  instruments  sufficiently  distinct  to  ensure 


the  desired  change  in  social  behavior.  That  includes 
securing  tax  revenues  of  sufficient  level  to  sustain 
those  projects  that  the  state  wishes  to  implement. 
The  state,  in  this  model  of  governance,  sees  little 
reason  to  engage  in  collaborative  forms  of  gover¬ 
nance  or  service  delivery. 

If  we  accept  Joel  Migdal’s  (2001)  argument  that 
state  and  society  mutually  constitute  each  other, 
democratic  forms  of  state-centric  governance  occur 
where  there  is  a  societal  expectation  for  such  a 
leading  role  of  the  state.  That  means  that  although 
government  will  not  refrain  to  use  obtrusive  policy 
instruments,  the  societal  submission  to  the  state 
suggests  that  the  state  does  rarely  have  to  throw 
much  weight  to  impose  its  will  on  society.  Whether 
it  is  because  the  state  sees  it  as  its  role  to  provide 
distinct  leadership  or  because  society  expects  it, 
state-centric  governance  is  built  on  the  centrality  of 
the  state.  That  centrality  is  obviously  a  significant 
asset  in  terms  of  being  able  to  steer  and  control 
societal  development.  However,  it  tends  to  entail  a 
fairly  unidirectional  process  of  state-society 
exchange  whereby  the  state  executes  its  preferences 
on  society.  States  in  this  model  of  governance  eas¬ 
ily  become  self-referential  and  arrogant  vis-a-vis 
societal  actors  and  may  thus  be  unaware  or  oblivi¬ 
ous  of  societal  change. 

France  is  historically  seen  as  a  prime  example  of 
state-centric  governance  although  that  state  pre¬ 
dominance  is  today  somewhat  less  articulated.  We 
also  find  examples  of  state-centric  governance  in 
many  Latin  American  countries  such  as  Argentina 
(not  least  under  Juan  Peron)  and  Uruguay.  Note 
that  state-centric  governance  does  not  correlate 
with  the  existence  of  leftist  political  regimes.  The 
legitimacy  of  state  encroachment  on  society  stems 
from  a  symbiotic  notion  of  the  state  embodying 
the  people  and  catering  to  its  interests  like  parents 
look  after  their  children.  In  this  model  of  gover¬ 
nance,  nationalism  plays  a  significant  role  and  it  is 
legitimate,  and  expected,  of  the  government  to 
intervene  in  markets  if  it  can  be  justified  with  refer¬ 
ence  to  interests  of  the  state. 

Liberal-Democratic  Governance 

The  previous  model  of  governance  is  a  relatively 
rare  phenomenon  in  time  and  space.  Liberal- 
democratic  governance  is  a  much  more  common 
creature.  The  Anglo-American  democracies  and 
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the  continental  European  countries  belong  to  this 
category  as  do  some  of  the  Asian  countries,  par¬ 
ticularly  with  regard  to  their  post-2000  model 
of  governance.  Thus,  although  the  fundamental 
features  of  the  governance  model  cover  a  fairly 
narrow  section  of  the  spectrum  of  different  gover¬ 
nance  models,  we  should  expect  to  find  a  fair 
degree  of  empirical  heterogeneity  as  we  inspect  this 
model  in  more  detail. 

In  terms  of  democratic  theory,  liberal-democratic 
governance  rests  on  a  notion  of  a  distinct  separa¬ 
tion  between  state  and  society.  The  state  is  not 
expected  to  interfere  in  markets  and  to  honor  val¬ 
ues  of  individual  freedom  and  personal  integrity. 
However,  the  state  is  expected  to  play  a  key  role  in 
the  regulation  of  markets  by  upholding  a  regulatory 
framework  for  markets  and  ensuring  free  competi¬ 
tion.  Thus,  in  the  spirit  of  Adam  Smith,  the  “regula¬ 
tory  state”  model  sees  the  government  enforcing 
antitrust  legislation  to  protect  markets  from  cartels 
and  other  forms  of  improper  market  behavior. 
Thus,  there  is  an  emphasis  of  regulation  o/The  mar¬ 
ket  and  an  equally  strong  normative  belief  that 
government  should  regulate  within  markets.  In  the 
parlance  of  economic  theory,  it  is  the  role  of  the 
state  in  this  governance  model  to  prevent  market 
failure.  It  should  not,  however,  pursue  political 
objectives  through  regulation  within  markets. 

The  state  in  liberal-democratic  governance  is 
more  deeply  embedded  in  society  than  state-centric 
governance,  yet  it  is  able  to  maintain  some  degree 
of  institutional  integrity  in  relationship  to  policy 
targets  and  other  social  interests.  The  state  is  not 
alien  to  collaborative  forms  of  governance  but 
insists  on  playing  the  role  of  a  primus  inter  pares  in 
those  arrangements.  That  means  that  government 
will  consult  with  social  partners  in  policy  making 
but  will  reserve  the  right  to  make  the  final  deci¬ 
sions  on  policy  design.  Thus,  in  some  ways  liberal- 
democratic  governance  could  be  just  as  intrusive 
on  society  as  state-centric  governance,  but  the 
mechanisms  and  processes  sustaining  that  capacity 
of  the  state  are  less  derived  from  institutional 
strength  and  more  a  result  of  social  bargaining. 

The  different  forms  of  corporatism  that  have 
been  a  trademark  of  state-society  exchanges  in 
several  European  countries  and  also  in  Japan  fit 
nicely  into  this  model  of  governance.  This  applies 
in  particular  to  the  tripartite  arrangements  between 
the  state,  private  capital,  and  labor  where  the  state 


can  be  both  an  arbitrator  as  well  as  a  negotiating 
party.  The  significance  of  these  corporatist  arrange¬ 
ments  has  declined  in  several  countries  remains 
strong  in,  for  instance,  Denmark. 

Network  Governance 

Some  time  ago  now,  Rod  Rhodes  gave  new 
momentum  to  the  emerging  debate  on  contempo¬ 
rary  governance  by  arguing  that  policy  networks 
were  the  key  instrument  of  governance,  hence 
governance  of  policy  sectors  saw  government  as  one 
of  a  large  number  of  actors.  Indeed,  cohesive  policy 
networks  had  the  capacity  to  resist  government 
policy  and  were  basically  capable  of  governing  the 
sector  themselves.  Coupled  with  the  growing  litera¬ 
ture  on  the  governance  problems  that  stemmed 
from  the  increasing  social  complexity  and  multilevel 
institutional  arrangements  of  the  political  and 
administrative  sphere,  networks  and  other  interac¬ 
tive  forms  of  governance  became  the  key  focus  of  a 
large  group  of  governance  scholars. 

Network  governance  scholars  deliver  a  powerful 
argument  about  the  lack  of  institutional  capacity 
and  integrity  of  the  state.  Short  of  those  features, 
segments  of  the  state  embark  in  a  range  of  different 
interactive  arenas  where  collective  solutions  are 
designed  and  implemented  in  concert  with  social 
partners.  Policy  becomes  reflective  of  the  contextu¬ 
alized  process  from  which  it  evolves;  interactive 
governance  forms  have  difficulties  with  imposing 
unpopular  policies  on  those  social  actors  and  col¬ 
lective  action  becomes  to  a  large  extent  defined  by 
the  least  common  denominator  among  the  network 
participants.  That  said,  network  governance  pro¬ 
vides  public  officials  with  a  multitude  of  social 
contacts  that  ensures  up-to-date  information  about 
social  change.  Network  governance  therefore  pro¬ 
vides  quicker  and  more  effective  linkages  between 
changing  societal  problems  and  a  political  response 
to  those  changes. 

It  is  difficult  to  think  of  any  national  context 
where  networks  do  not  exist.  The  intriguing 
question  is  how  such  forms  of  governance  are 
integrated  with  more  traditional,  hierarchical 
governance  and  administrative  arrangements  and 
what  the  consequences  are  of  blending  different 
governance  models.  Network  governance  is  pred¬ 
icated  on  substantive  autonomy  and  empower¬ 
ment  for  lower-level  civil  servants  so  that  they 
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can  engage  in  meaningful  dialogue  with  social 
partners.  Thus,  network  governance  challenges 
traditional  top-down  channels  of  communication 
in  public  sector  organizations,  both  in  terms  of 
policy  implementation  and  in  bringing  information 
from  networks  to  the  attention  of  the  senior  levels 
of  the  organization. 

Governance  as  Institutionalization 
of  Policy  Choices 

Following  these  discussions  about  dimensions  and 
models  of  governance,  this  entry  now  presents  a 
more  elaborated  account  of  different  stylized  mod¬ 
els  of  governance  with  regard  to  how  they  score  on 
the  overarching  dimensions  of  governance. 

It  is  particularly  challenging  to  look  at  the 
dynamic  nature  of  these  interactions.  The  dimen¬ 
sions  of  governance  described  earlier  could  easily 
produce  a  functionalist  analysis  of  the  roles  of  gov¬ 
ernment  in  governance  with  all  that  entails  in  terms 
of  static  modeling.  However,  thinking  about  how 
government  resolves  these  issues  and  tasks  in  a  gov¬ 
ernance  perspective  yields  a  highly  dynamic  account 
of  the  role  of  the  state  since  the  core  assumption  in 
governance  theory  is  that  collective  objectives  are 
pursued  in  a  variety  of  different  concerted  and  col¬ 
laborative  process  between  state  and  society.  The 
key  assumption  being  explored  in  this  analysis  is 
that  the  degree  to  which  political  actors  and  institu¬ 
tions  shape  the  governance  process  determines 
much  of  how  specific  governance  arrangements 
relate  to  the  three  dimensions  of  governance. 

State-Centric  Governance 

With  the  leading  role  of  the  state  in  this  model 
of  governance,  we  would  expect  that  its  institutions 
enjoy  a  high  degree  of  autonomy  in  relationship  to 
societal  actors  and  interests.  Since  those  institu¬ 
tions  have  very  little  collaborative  interaction  with 
societal  actors,  there  is  little  incentive  for  them  to 
make  concessions  on  policy.  Also,  given  the  cen¬ 
trality  of  the  state  and  its  insistence  on  “going  it 
alone”  in  terms  of  policy  implementation  and  ser¬ 
vice  delivery,  government  institutions  will  have  the 
administrative  and  financial  resources  required  to 
play  those  roles. 

State-centric  governance  is  also  likely  to  display  a 
priority  for  national  interests  over  local  autonomy. 


It  is  interesting  to  note  that  many  of  the  empirical 
contexts  that  we  associate  with  state-centric  gover¬ 
nance,  again  with  France  as  a  leading  example, 
have  weak  and  fragmented  local  government  sys¬ 
tems.  This  is  obviously  not  a  feature  only  of  coun¬ 
tries  with  state-centric  governance — consider  for 
the  sake  of  comparison  the  equally  fragmented 
local  government  system  in  the  United  States — but 
it  makes  sense  to  stipulate  that  state-centric  socie¬ 
ties  do  not  develop  strong  subnational  government 
as  that  could  create  or  exacerbate  central-local 
conflict. 

State-centric  governance  easily  translates  into 
strong  regulation  of  markets.  This  is  because  these 
states  have  a  path-dependent  political  and  admin¬ 
istrative  behavior  according  to  which  they  rarely 
yield  to  the  interests  of  societal  actors.  The  state 
defines  its  interests  and  objectives  in  the  market 
and  implements  whatever  regulation  is  required 
toward  those  ends. 

All  these  features  of  state-centric  governance — 
strong,  insulated  institutions,  central  government 
control,  tight  market  regulation,  and  so  on — are  in 
fact  reflections  of  society  as  much  as  of  the  state.  It 
is  very  difficult  to  think  of  a  democratic  state 
ascending  to  such  societal  predominance  without 
the  consent  and  indeed  expectation  from  society. 
While  this  is  a  governance  model  that  may  appear 
to  be  highly  obtrusive  and  insensitive  to  societal 
actors,  a  closer  inspection  of  specific  governance 
processes  indicates  that  coercive  policy  instru¬ 
ments  are  not  always  necessary  for  the  state  to 
control  society;  societal  actors  anticipate  the  pref¬ 
erences  of  government  and  act  accordingly.  A  case 
in  point  is  the  Japanese  developmental  state,  which 
throughout  much  of  the  postwar  period  was  a  case 
of  state-centric  governance.  Governance  was  coor¬ 
dinated  within  the  tripod  of  the  Liberal-Democratic 
Party,  the  bureaucracy,  and  leading  corporate 
players.  The  chief  ministry  driving  the  economic 
modernization  programs  could  ensure  compliance 
for  its  plans  by  formulating  “visions”;  it  rarely  had 
to  employ  its  institutional  capacity  to  ensure  cor¬ 
porate  compliance  with  its  policy,  according  to 
Johnson  (1982). 

However,  as  we  look  at  the  dimensions  of  gover¬ 
nance,  we  can  see  that  the  case  of  Japan  offers  some 
intriguing  patterns.  For  instance,  although  national 
interests  were  paramount  through  the  history  of 
Japan,  not  least  during  the  postwar  economic 
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recovery,  and  although  central-local  relationships 
have  been  predominantly  hierarchical,  the  relation¬ 
ship  between  central  and  subnational  government 
has  been  replete  with  ideological  conflict.  Central 
government  actively  promoted  the  ascension  of 
Tokyo  to  a  global  city  status,  triggering  rivalries 
with  several  other  metropolitan  areas.  National 
interests  have  translated  into  Tokyo  interests,  argu¬ 
ably  at  the  expense  of  other  cities. 

The  paradox  of  state-centric  governance  is  thus 
that  although  the  state  has  all  the  instruments  and 
resources  necessary  to  impose  a  strict  regime  on 
society,  it  rarely  needs  to  use  all  that  muscle  as 
societal  actors  readily  submit  to  more  subtle  forms 
of  governing.  Society  expects  the  state  to  play  a 
leading  role  in  governance.  When  we  bring  state- 
society  exchanges  that  are  integral  to  governance 
theory  into  the  analysis,  it  becomes  clear  that  the 
more  formal  power  bases  that  a  state  controls,  the 
less  it  will  have  to  use  those  forces. 

The  governance  approach  to  these  issues  also 
suggests  that  states  remain  strong  but  that  the 
sources  of  that  strength  have  changed.  State-centric 
governance,  embedded  in  norms  that  legitimized 
such  governance  and  prescribed  long-term  policy 
goals,  defined  overarching  policy  choices  that  con¬ 
tinued  to  shape  public  policy  for  decades.  Economic 
development  has  seen  a  distinct  role  for  the  state 
either  as  a  coordinator  of  private  business  or  as  a 
leading  player  in  research  and  development  or  as 
safeguard  against  overseas  competition  or  all  of  the 
above.  State-centrism  has  also  had  a  major  influ¬ 
ence  on  the  policy  instruments  typically  employed 
in  policy  implementation.  Again,  this  rather  self- 
referential  modus  operandi  proved  to  be  a  signifi¬ 
cant  problem  in  times  of  rapid  external  change 
such  as  when  globalization  caught  momentum, 
opening  up  society  to  new  political,  financial,  and 
cultural  influences. 

Liberal-Democratic  Governance 

If  state-centric  governance  presents  a  pattern 
of  a  state  that  enjoys  extensive  capabilities  to 
impose  its  will  on  society  but  rarely  has  to  use 
that  leverage,  an  almost  opposite  pattern  is 
found  in  liberal-democratic  governance.  In  this 
governance  model,  the  state  has  some  political 
and  institutional  capabilities  but  the  exercise  of 
those  power  bases  is  more  contested  than  is  the 


case  in  state-centric  governance.  Thus,  this  gov¬ 
ernance  model  displays  a  medium  level  of  institu¬ 
tional  autonomy  and  an  institutional  capacity 
that  is  more  contextualized  than  in  the  previous 
model. 

The  portrayal  of  the  state  in  liberal-democratic 
theory  is  an  idealized  account  of  the  state  as  dis¬ 
tinctly  separate  from  society.  It  could  be  seen  as  an 
extension  of  the  Wilsonian  model  of  public  admin¬ 
istration  as  firmly  separated  from  the  political 
sphere  of  the  state.  That  model  has  lost  much  of  its 
empirical  accuracy  in  contemporary  public  admin¬ 
istration  and  a  similar  development  can  be  seen 
with  regard  to  the  separation  of  state  and  society. 
Migdal  (2001)  describes  this  development  of  the 
state  nicely: 

Various  parts  or  fragments  of  the  state  have 
allied  with  one  another,  as  well  as  with  groups 
outside,  to  further  their  goals.  Those  practices 
and  alliances  have  acted  to  promote  a  variety  of 
sets  of  rules,  often  quite  distinct  from  those  set 
out  in  the  state’s  own  official  laws  and  regulations. 
These  alliances,  coalitions,  or  networks  have 
neutralized  the  sharp  territorial  and  social 
boundary  that  the  first  portrayal  of  the  state  has 
acted  to  establish,  as  well  as  the  demarcation 
between  the  state  as  a  pre-eminent  rule  maker 
and  society  as  the  recipient  of  those  rules,  (p.  20) 

With  the  border  between  state  and  society  thus 
compromised  by  parochial  interests  inside  and  out¬ 
side  the  state,  governance  theory  offers  a  new  and 
promising  approach  to  understanding  the  process 
of  governing.  Indeed,  Migdal’s  account  of  the  con¬ 
temporary  liberal-democratic  state  is  an  apt  illus¬ 
tration  of  the  contemporary  liberal-democratic 
state,  involved  in  collaborative  governance  and  a 
rule  taker  almost  as  much  as  a  rule  maker.  It  might 
be  argued  that  the  state-society  division  that  this 
model  of  governance  departs  from  has  become  a 
normative  myth.  This  myth  may  at  some  previous 
time  have  defined  what  constitutes  proper  exchanges 
between  the  state  and  its  external  environment,  but 
given  the  complexities  of  contemporary  governance 
the  state-society  separation  has  become  impossible 
to  sustain.  That  having  been  said,  normative  mod¬ 
els  still  matter.  As  the  rules  of  the  political  game, 
normative  models  provide  definitions  of  roles  for 
political  actors  and  institutions.  Those  norms  are 
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also  the  foundation  on  which  constitutions  are 
written  and  amended. 

In  the  liberal-democratic  governance  model,  the 
state  itself  has  moderate  institutional  capacity  but 
makes  up  for  that  through  collaboration  with 
social  partners.  This  can  take  place  through  corpo- 
ratist  models  of  interest  representation  and  media¬ 
tion  but  it  can  also  be  generated  through  more  ad 
hoc  models  of  joint  ventures  such  as  public-private 
partnerships.  Also,  states  in  liberal-democratic 
governance  tend  to  define  their  chief  role  as  a 
“regulatory  state,”  providing  market  stability 
without  necessarily  being  committed  to  extensive 
service  delivery. 

It  is  interesting  to  note  that  liberal-democratic 
governance  would  encompass  both  liberal  and 
coordinated  market  economies.  Government  plays 
a  distinctly  different  role  in  those  two  models  of  the 
capitalist  economy,  with  a  higher  profile  and  more 
accentuated  political  presence  in  the  market  in  the 
case  of  the  coordinated  market  economy.  According 
to  Hall  and  Soskice  (2001),  there  is  also  more 
legitimacy  for  articulating  collective  interests  in 
economic  development  in  the  coordinated  market 
economy  than  in  the  liberal  market  economy  where 
government  is  expected  to  provide  a  regulatory 
framework  for  the  economy  but  not  to  itself  be  a 
player  in  the  market.  Corporatism  is  clearly  related 
to  the  coordinated  market  economy.  This  stronger 
legitimacy  for  market  intervention  in  the  coordi¬ 
nated  market  economy  links  it  strongly  to  welfare 
state  models  and  to  overarching  policy  choices  that 
give  a  higher  priority  to  distributive  or  redistribu¬ 
tive  types  of  policy  than  to  promoting  economic 
growth. 

Network  Governance 

This  model  of  governance  could  be  summa¬ 
rized  as  characterized  by  low  institutional  auton¬ 
omy;  its  institutional  capacity  is  to  a  large  extent 
contingent  on  societal  consent  and  a  low  degree 
of  regulation  or  even  self-regulation.  In  a  model 
of  governance  where  concerted  action  is  the  typi¬ 
cal  strategy  to  govern,  the  issue  of  institutional 
autonomy  becomes  moot  as  institutions  do  not 
require  autonomy  to  engage  social  partners. 
Indeed,  such  autonomy  could  be  an  obstacle  to 
creative  problem  solving  since  it  suggests  that 
there  are  rules  defining  the  forms  and  conditions 


for  institutions’  exchange  with  their  external 
environment. 

The  institutional  capacity  in  network  gover¬ 
nance  is  limited;  it  is  part  of  the  logic  of  network 
governance  that  action  is  based  on  consent  and 
therefore  institutions  can  only  implement  those 
programs  that  other  actors  in  the  network  agree 
to,  which  frequently  comprises  important  policy 
targets.  That  having  been  said,  however,  when 
there  is  consensus  for  the  execution  of  a  policy, 
implementation  is  likely  to  be  swift  and  efficient 
since  interaction  between  institutions  and  their 
social  partners  continues  throughout  the  imple¬ 
mentation  process. 

The  regulatory  capacity  of  networks  is  high 
within  the  network  itself  but  less  so  outside  the 
network.  In  network  governance  theory,  primary 
attention  is  on  networks  as  instruments  of  self¬ 
regulation  within  a  policy  sector.  In  that  perspec¬ 
tive,  networks  offer  regulation  that,  by  definition, 
is  reflective  of  the  interests  of  the  participants  in 
the  network. 

This  is  not  a  model  of  governance  that  defines 
large-scale  political  projects  such  as  welfare  states 
or  developmental  states.  It  lacks  the  social  inclu¬ 
sion  and  capacity  to  resolve  conflicts  between  the 
network  and  other  social  constituencies.  Rather,  it 
operates  within  the  parameters  set  by  long-term 
policy  objectives. 

Change  and  Resilience  in 
Models  of  Governance 

The  three  governance  models  discussed  in  this 
entry  are  manifestations  of  social  norms  and  val¬ 
ues  related  to  individual  versus  collective  action, 
state  versus  market,  and  proactive  versus  reactive 
policies.  They  become  institutionalized  in  the  sys¬ 
tem  of  government  so  that  state-centric  gover¬ 
nance  draws  on  extensive  institutional  capabilities 
and  a  strong  legitimacy  of  political  encroachment 
of  society,  or  in  liberal-democratic  governance 
where  there  is  a  search  for  a  balance  between 
political  capabilities  and  individual  and  societal 
freedom. 

These  norms  and  values  are  reinterpreted  over 
time.  What  constituted  a  free  market  in  the  early 
20th  century  was  different  from  what  is  today  con¬ 
sidered  a  free  market.  The  developmental  state  was 
the  Japanese  state’s  response  to  a  situation  of 
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national  urgency.  It  became  deeply  institutional¬ 
ized  in  the  Japanese  society  for  decades  but  was 
weakened  during  the  1980s  when  the  downsides 
to  aggressive  progrowth  policies  surfaced  and  the 
most  acute  problems  had  been  resolved.  The 
Scandinavian  welfare  state  enjoyed  massive  politi¬ 
cal  support  for  several  decades  but  has  lost  some 
of  that  support  along  with  growing  social  afflu¬ 
ence  and  increasing  tax  levels.  The  Great  Society  in 
the  United  States  had  a  shorter  life  span  and  was 
put  to  rest  with  the  election  of  Ronald  Reagan  as 
president  in  1980. 

Thus,  although  governance  models  are,  by  defi¬ 
nition,  institutionalized,  they  change  over  time.  It 
is  a  major  task  in  itself  to  account  for  what  drives 
those  changes.  Given  their  institutionalization,  re¬ 
defining  the  role  of  the  state  or  the  market  is  a 
slow  and  incremental  process.  One  type  of  change 
is  where  values  and  norms  and  expectations  on  the 
state  are  not  fundamentally  reassessed  but  instead 
rearticulated  and  given  a  new  meaning  in  a  new 
political  and  economic  landscape.  In  other  cases, 
new  governance  models  emerge  to  solve  new  types 
of  societal  problems.  Network  governance  could 
be  seen  as  a  response  to  growing  social  complexity 
and  a  lacking  capacity  or  knowledge  within  the 
state  to  effectively  and  appropriately  address  those 
problems.  If  the  state  cannot  solve  such  problems, 
actors  closer  to  the  source  of  problems  may  spring 
into  action  themselves. 

Another  driver  of  change  is  international  influ¬ 
ences  and  the  diffusion  of  strategies  and  instru¬ 
ments  for  problem  solving.  Globalization  has 
brought  with  it  seamless  and  instant  communica¬ 
tion  spreading  ideas  and  expertise  across  national 
and  jurisdictional  boundaries.  This  means  that 
models  of  problem  solving,  including  the  best 
agents  of  those  measures,  are  imported  and  imple¬ 
mented  in  another  national  or  local  context,  some¬ 
thing  that  in  turn  triggers  a  reassessment  of  which 
actors  are  best  equipped  for  such  action. 

Conclusion 

We  should  not  expect  broad  definitions  of  models 
of  governance  to  provide  clues  about  policy  choices; 
as  any  textbook  in  public  policy  will  testify,  policy 
evolves  through  complex  and  contextualized  pro¬ 
cesses.  More  important  than  promoting  specific 
policy  choices,  it  appears  as  if  the  governance 


models  are  manifestations  of  previous  choices  of 
policy  styles  and  the  role  of  government  in  solving 
collective  problems.  State-centric  governance  does 
not,  in  and  of  itself,  define  any  long-term  policy 
objectives.  However,  it  does  define  a  particular 
policy  style.  Also,  the  governance  models  define 
different  institutional  arrangements  in  the  state, 
which  in  turn  define  the  preconditions  for  broader 
policy  choices.  It  is,  to  give  an  example,  difficult  to 
think  of  extensive  welfare  state  policies  managed 
overwhelmingly  through  network  governance 
arrangements. 

This  brings  us  back  to  a  comment  made  earlier 
in  this  entry  about  causal  mechanisms.  Do  over¬ 
arching  policy  choices  such  as  welfare  states  or 
developmental  states  lead  to  the  emergence  of  a 
particular  model  of  governance,  or  is  it  rather  the 
case  that  the  entrenched  governance  model  that  is 
typical  to  a  country  is  conducive  to  a  particular  set 
of  broad  policy  goals?  A  tentative  and  preliminary 
answer  to  these  questions  is  that  there  could  well 
be  causality  in  both  directions,  that  is,  processes  of 
mutual  reinforcement  of  values  and  norms  through 
which  one  fosters  the  other.  Megapolicy  choices 
such  as  between  a  developmental  state  and  a  wel¬ 
fare  state  must  by  necessity  include  the  creation  of 
an  institutional  system  equipped  to  coordinate  and 
implement  such  large  political  campaigns. 

Equally  important,  for  such  projects  to  be  suc¬ 
cessful,  they  must  draw  on  political  and  social 
sentiments  of  the  significance  of  such  projects  to 
the  country.  There  will  only  be  a  welfare  state  if 
dominant  social  constituencies  support  such  a 
massive  undertaking  in  terms  of  government 
effort  and  treasury,  just  as  a  developmental  state 
requires  all  political,  administrative,  and  corpo¬ 
rate  actors  to  commit  themselves  to  such  a 
national  priority.  Such  commitment  is  reflective 
of  fundamental  social  values  such  as  national 
security  or,  in  the  case  of  the  welfare  state,  social 
justice.  Thus,  social  norms  are  directly  or  indi¬ 
rectly  instrumental  in  fostering  both  overarching 
policy  choices  and  predominant  governance 
models. 

The  analysis  presented  here  represents  the  first 
few  steps  in  what  appears  to  be  a  rather  long  jour¬ 
ney  toward  a  better  understanding  of  why  and 
how  different  national  contexts  or  what  Castles 
called  “families  of  nations”  develop  different  mod¬ 
els  of  governance.  This  entry  explored  only  a  few 
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conceivable  explanatory  factors.  It  is  likely  that 
state  traditions,  the  trajectory  of  state  building, 
political  culture,  and  national  prosperity  provide 
important  additional  clues  to  answering  those 
questions. 

Jon  Pierre 
University  of  Goteborg 
Goteborg,  Sweden 
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Governance,  Administration 
Policies 


Some  decades  ago,  in  the  early  1950s,  Harold 
Lasswell  defined  politics  as  “who  gets  what.”  This 
simple  but  powerful  definition  emphasizes  the 
fundamental  point  that  virtually  all  political  action 
is  directed  toward  using  the  power  of  the  public 
sector  to  allocate  goods  and  services  among  the 
members  of  a  society.  Some  politics  is  symbolic 
and  emotive,  but  to  some  extent  even  those  com¬ 
ponents  of  political  action  may  help  legitimate  the 
political  system  and  the  decisions  it  must  make 
about  allocations  of  costs  and  benefits  among 
citizens.  Citizens  may  be  willing  to  accept  alloca¬ 
tions  with  which  they  do  not  agree  in  part  because 
of  their  emotional  attachment  to  their  government 
and  the  legitimacy  associated  with  that  attach¬ 
ment.  This  entry  examines  basic  questions  about 
making  and  implementing  public  policy  in  con¬ 
temporary  political  systems. 

Perhaps  the  broadest  possible  manner  in  which 
to  consider  this  process  of  allocating  goods  and 
services  through  the  public  sector  is  to  employ  the 
concept  of  governance.  As  explained  more  fully 
below,  the  meaning  of  governance  is  contested 
rather  vigorously  in  the  academic  community,  but 
the  fundamental  notion  of  governance  is  that 
there  is  a  need  for  some  form  of  collective  decision 
making  to  steer  the  society.  This  functionalist  con¬ 
ception  of  governance  emphasizes  that  although 
the  market  may  be  an  efficient  mechanism  for 
making  individual  choices,  there  is  a  need  to  make 
some  decisions  for  the  society  as  a  whole. 
Furthermore,  those  decisions  may  need  to  take 
into  account  a  range  of  human  values,  not  just 
economic  values,  and  also  attempt  to  accommo¬ 
date  influences  from  a  range  of  actors.  ( Value 
refers  to  everything  that  is  considered  to  have  a 
value,  be  it  material,  ideological,  ethical,  or  other, 
by  most  people/citizens.) 
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The  idea  of  governance  therefore  is  to  provide 
some  means  of  collective  steering  for  the  economy 
and  society,  and  the  word  is  derived  from  a  Greek 
word  for  steering  that  also  is  the  root  for  cybernet¬ 
ics.  This  steering  activity  is  usually  associated  with 
government  and  with  the  formal  institutions  of  the 
public  sector,  but  this  entry  presents  an  argument 
for  seeing  the  concept  as  neutral  about  the  manner 
in  which  steering  occurs,  as  suggested  by  Jon  Pierre 
and  Guy  Peters  in  2001.  Although  those  formal 
institutions  of  the  public  sector  continue  to  have  a 

role  in  governing,  social  actors - both  market  and 

societal — also  have  an  influence.  This  involvement 
of  social  actors  certainly  occurs  in  democratic  soci¬ 
eties  but  may  also  operate  in  less  democratic  sys¬ 
tems  that  have  sought  to  co-opt  those  social  actors 
as  part  of  their  legitimation  strategies. 

The  activity  that  we  generally  conceptualize  as 
public  policy  making  is  a  major  component  of 
governance.  Governments  exercise  that  steering 
function  mainly  by  making  policies  and  then  put¬ 
ting  them  into  action.  The  policy  choices  made 
through  the  state,  and  also  involving  actors  from 
the  civil  society,  are  directed  at  a  range  of  targets, 
involve  the  use  of  a  number  of  instruments,  and 
reflect  a  variety  of  political,  economic,  and  social 
values.  Furthermore,  the  process  through  which 
governments  make  policy  (and  again  this  typically 
is  done  with  the  involvement  of  other  actors)  is 
complex,  often  taking  years  to  complete,  and 
involves  a  large  number  of  actors,  many  of  whom 
may  be  able  to  exercise  a  virtual  veto  over  the 
actions  as  suggested  by  George  Tsebelis  in  2002. 

The  emphasis  on  who  gets  what  from  political 
action  is  sometimes  considered  to  be  technical  and 
apolitical,  focusing  on  economic  values  of  effi¬ 
ciency,  or  on  the  legalistic  application  of  the  law 
through  public  administration.  Public  policy  is, 
however,  anything  but  apolitical  and  reflects  the 
convergence  of  a  wide  array  of  political  forces,  all 
attempting  to  influence  the  policy  choices  in  ways 
that  will  benefit  them  and  their  constituents. 
According  to  Margaret  Hill  in  2005,  these  political 
forces  play  themselves  out  through  complex  pro¬ 
cesses  that  move  from  recognizing  policy  problems 
to  deciding  what  good  policy  would  be  and  then  to 
evaluating  the  consequences  of  public  action. 

Although  it  is  only  one  component  of  the  policy¬ 
making  process,  governance  depends  heavily  on  the 
implementation  of  policies  through  a  variety  of 


political  and  social  institutions,  notably  the  public 
bureaucracy.  Although  much  research  in  contem¬ 
porary  political  science  is  directed  at  organizations 
and  actors  involved  in  making  inputs  into  the 
political  system — through  voting,  political  parties, 
policy  networks,  legislatures,  and  so  on — the  public 
bureaucracy  remains  the  largest  component  of  the 
public  sector,  and  arguably  the  most  significant  for 
policy  choices.  Not  only  does  the  expertise  of  the 
public  bureaucracy  inform  policy  making,  but  the 
bureaucracy  is,  in  fact,  more  often  in  touch  with 
citizens  than  are  other  parts  of  the  public  sector. 

Following  the  implicit  outline  already  pre¬ 
sented,  this  entry  begins  with  the  most  general 
concept  of  governance,  moves  on  to  the  policy¬ 
making  process  as  a  major  component  of  public 
action  in  governing,  and  concludes  by  discussing 
public  administration  and  its  importance  in  imple¬ 
menting,  as  well  as  making,  public  policy.  All  these 
aspects  of  governing  are  necessarily  linked  in  the 
“real  world”  and  in  theory,  but  each  of  them  also 
must  be  considered  separately  to  understand  better 
how  the  three  interact.  The  three  perspectives  all 
combine  to  provide  a  picture  of  what  the  public 
sector  actually  does  and  how  it  affects  the  lives  of 
citizens. 

Studying  these  three  aspects  of  political  science 
involves  the  relationships  between  policy  and 
administrative  issues  that  occur  in  the  reality  of 
public  governance  and  in  the  analytic  and  theo¬ 
retical  approaches  that  social  scientists  employ  to 
interpret  and  understand  those  problems.  Although 
the  objective  problems  in  these  areas  have  consid¬ 
erable  practical  relevance,  the  role  of  theory  must 
also  be  emphasized.  This  emphasis  on  theory  is 
especially  important  given  that  some  of  the  same 
strands  of  theory,  for  example,  institutionalism 
and  bounded  rationality,  can  be  applied  across 
these  areas.  Therefore,  the  entry  is  a  tour  d’horizon 
designed  to  identify  the  principal  issues  and 
approaches  in  these  parts  of  the  discipline  of  polit¬ 
ical  science. 

Governance  and  Political  Science 

The  basic  idea  of  governance  has  been  part  of 
political  science  as  long  as  there  has  been  such  a 
discipline,  but  the  concept  has  become  more  cen¬ 
tral  to  studying  politics  over  the  past  several 
decades.  The  exact  date  and  source  of  the  revival 
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of  the  term  is  perhaps  not  important.  What  is 
more  important  is  that  the  discipline  has  been 
supplementing  its  general  concern  with  individual 
political  behavior  with  greater  concern  with  what 
the  political  processes  and  all  that  individual 
behavior  produce  by  working  through  institutions 
and  processes.  The  outcomes  of  governance  can 
be  conceptualized  to  include  not  only  policy 
choices  but  also  the  general  management  of  the 
political  and  social  systems.  These  outcomes 
would  therefore  include  the  capacity  to  involve 
members  of  the  public  in  the  governance  process, 
the  building  of  trust  and  social  capital,  and  what 
Canadians  might  recognize  as  the  maintenance  of 
“peace,  order  and  good  government,”  as  identi¬ 
fied  in  the  Constitution  Act  of  1867. 

At  its  most  basic  level,  governance  means  the 
capacity  to  steer  the  economy  and  society  toward 
collective  goals.  That  definition  does  not  privilege 
any  particular  actors  or  any  particular  institutions, 
but,  instead,  it  recognizes  the  functional  need  to 
provide  that  steering  and  then  attempts  to  under¬ 
stand  how  the  process  is  carried  out.  One  of  the 
most  important  distinguishing  features  between 
governance  and  other  forms  of  social  action,  for 
example,  markets,  is  the  notion  of  collective  goals 
and  their  centrality  in  steering.  Other  forms  of 
social  action,  for  example,  markets,  assume  that 
individual  goals  are  sufficient,  or  that  goals  emerge 
through  interaction,  but  a  governance  perspective 
assumes  that  goals  emerge  through  collective  choice 
of  some  sort. 

Governance  models  in  political  science  therefore 
are  at  their  heart  functionalist  explanations  for  the 
processes  of  governing  writ  large.  In  this  way,  the 
governance  approach  to  political  science  has  some 
affinity  with  the  structural-functionalist  approaches 
used  in  comparative  politics  during  the  1960s  and 
1970s.  In  those  approaches,  there  was  an  assump¬ 
tion  that  some  form  of  collective  choice  is  necessary, 
but  the  specific  actors  to  be  involved  in  making  that 
choice  were  not  identified,  and  the  variations  in  the 
way  in  which  decisions  were  made  were  the  loci  for 
comparison.  The  same  assumption  concerning  the 
need  for  collective  choice  is  made  about  gover¬ 
nance,  and  the  approach,  in  the  view  of  some 
political  scientists,  should  be  agnostic  about  how  it 
is  performed  and  about  what  activity  should  be 
subject  to  collective  control  and  what  is  better  left 
to  the  market  and  society. 


The  major  contrast  concerning  governance 
that  has  been  made  in  contemporary  political  sci¬ 
ence  is  between  government  and  governance.  The 
traditional  assumption  in  political  science  was 
that  governance  almost  inherently  was  the  appro¬ 
priate  province  of  the  institutions  within  the 
formal  public  sector.  To  the  extent  that  nongov¬ 
ernmental  actors  were  involved  in  the  process, 
they  were  to  different  degrees  suspect.  The  degree 
of  suspicion  concerning  nongovernmental  actors 
varied  with  political  cultures.  For  example,  the 
pluralist  traditions  in  Anglo-American  countries 
have  tended  to  assume  that  interest  groups  pursue 
“the  private  use  of  public  power.”  On  the  other 
hand,  however,  most  of  the  countries  of  Northern 
and  continental  Europe  have  some  tradition  of 
corporatist  involvement  of  private  sector  actors 
in  making  and  implementing  policy,  and  these 
countries  tend  not  to  denigrate  but  often  laud 
that  involvement. 

This  contrast  between  government  and  gover¬ 
nance  (see,  e.g.,  Seppo  Tiihonen,  2004)  has  emerged 
as  a  central  analytic  question  for  scholars  con¬ 
cerned  with  governance.  Many  scholars  advocat¬ 
ing  the  governance  perspective  have  argued  that 
the  formal  institutions  of  the  public  sector  have 
become  increasingly  incapable  of  providing  that 
scarce  commodity  called  governance.  The  formal 
organizations  of  the  public  sector  are  characterized 
as  bureaucratic,  clumsy,  and  incapable  of  coping 
with  the  rapidly  changing  societies  with  which  they 
are  confronted.  There  is  ample  evidence  of  policy 
failure  (for  a  variety  of  evidence,  see  Mark  Bovens, 
Paul  ’t  Hart,  &  Guy  Peters,  2001)  and  the  under¬ 
performance  of  the  public  sector  in  many  countries 
that  has  been  used  to  justify  the  position  that  gov¬ 
ernments  are  not  able  to  provide  governance. 

Critics  of  conventional  forms  of  government 
have  argued  that  as  well  as  being  weak  in  terms  of 
efficiency,  they  are  also  weak  in  democratic  terms. 
The  institutions  of  representative  democracy  appear 
unable  to  capture  the  enthusiasm  of  many  citizens, 
and  levels  of  voting  in  most  industrial  democracies 
have  been  decreasing.  Likewise,  membership  in 
political  parties  and  many  other  political  organiza¬ 
tions  appears  to  be  dropping  rather  inexorably. 
Much  of  the  successful  political  mobilization  that 
has  occurred  has  been  around  “flash  parties”  that 
are  developed  around  an  individual  or  around  a 
particular  political  issue.  The  advocates  of  the  “new 
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governance,”  therefore,  believe  that  providing  other 
loci  and  methods  of  political  participation  will 
make  public  action  more  democratic  as  well  as 
more  capable  of  solving  collective  problems. 

Even  if  citizens  were  engaged  in  the  mass  politics 
of  representative  democracy,  the  limited  capacity 
of  those  institutions  to  translate  their  preferences 
into  effective  policy  choices  is  clear.  Richard  Rose 
pointed  out  in  the  mid-1970s  that  even  in  majori- 
tarian  Westminster  systems,  voting  was  an 
extremely  blunt  instrument  to  use  to  control  policy 
choices.  In  addition,  the  divided-government  litera¬ 
ture  helps  explain  the  extent  to  which  coalition 
governments  further  complicate  democratic  con¬ 
trol  over  policy.  These  constraints  on  governance 
become  even  more  apparent  as  governments  move 
away  from  direct  ministerial  mechanisms  for  ser¬ 
vice  delivery  that  are  driven  more  by  market  and 
social  actors  than  by  political  controls. 

The  lack  of  agreement  about  what  the  term 
governance  actually  means  has  been  addressed  in 
part  by  adding  various  adjectives  to  the  basic  con¬ 
cept.  One  of  the  most  important  of  these  more 
circumscribed  terms  has  been  network  gover¬ 
nance  (Eva  Sorenson  &  Jacob  Torfing,  2007), 
and  indeed  a  good  deal  of  the  movement  toward 
governance  as  an  approach  to  political  life  has 
been  focused  on  the  capacity  of  networks  to  pro¬ 
vide  governance.  The  basic  conviction  of  network 
governance  is  that  self-organizing  networks  of 
social  actors  are  better  suited  to  coping  with  the 
complexity  of  contemporary  governing  demands 
than  are  hierarchical  mechanisms.  The  “new  gov¬ 
ernance”  of  networks  is  assumed  to  be  better  able 
to  cope  with  that  complexity  and  respond  to 
changing  social  conditions. 

Another  way  of  limiting  the  meaning  of  gover¬ 
nance  has  been  to  focus  on  “good  governance,” 
generally  meaning  the  reduction  of  corruption 
and  the  creation  of  greater  accountability,  espe¬ 
cially  in  less  developed  political  systems.  This  ver¬ 
sion  of  governance  has  been  dominant  in  much  of 
the  international  donor  community  that  is  seeking 
not  only  to  describe  governance  but  also  to 
improve  public  sector  performance.  Some  schol¬ 
ars,  however,  have  begun  to  focus  on  the  quality 
of  governance  in  a  more  general  vein,  attempting 
to  understand  what  features  of  political  systems 
may  be  associated  with  the  capacity  to  improve 
the  quality  of  life  of  citizens. 


The  linkages  between  institutional  and  behav¬ 
ioral  characteristics  of  political  systems  and  the 
quality  of  public  decisions  is  often  tenuous,  but 
establishing  that  linkage  can  both  help  develop 
understanding  governance  within  comparative 
politics  and  enhance  the  actual  performance  of  the 
public  sector.  There  is  as  yet  very  little  discussion 
of  the  design  of  governance  systems,  but  the  accu¬ 
mulation  of  evidence  may  provide  that  version  of 
“constitutional  engineering”  just  as  knowledge 
about  formal  public  sector  institutions  has  gener¬ 
ated  some  design  principles. 

Governance  has  been  discussed  primarily  at  the 
national  level,  but  the  concept  can  be  used  to  dis¬ 
cuss  steering  capacity  at  all  levels.  For  example, 
“global  governance”  has  been  used  to  describe 
steering  at  the  international  level.  At  this  level,  the 
state  often  ceases  to  be  the  major  actor  in  gover¬ 
nance,  much  as  national  governments  cease  to  be 
the  principal  actors  in  some  models  of  governance. 
Instead,  regimes  composed  of  nongovernmental 
actors  are  often  central  to  providing  some  control 
over  policy  areas  that  governments  may  not  be 
able  to  or  want  to.  Although  states  have  attempted 
to  exert  some  control  over  areas  such  as  climate 
control,  the  nongovernmental  actors  may  prove  to 
be  more  effective. 

Urban  areas,  as  well  as  other  subnational  lev¬ 
els,  also  require  governance,  and  these  regions 
experience  all  the  problems  of  mobilizing  consent 
and  exercising  control  over  policy  choices  that  the 
national  level  may  encounter.  Indeed,  networks 
and  other  forms  of  social  actors  may  be  more  sig¬ 
nificant  at  the  local  level  than  at  the  national  level, 
and  much  of  the  study  of  local  politics  has 
revolved  around  these  interactions  between  for¬ 
mal  and  informal  actors.  Perhaps  more  than  at 
other  levels  of  government,  urban  areas  have 
developed  complex  patterns  of  service  delivery 
that  correspond  to  the  governance  ideas  of  involv¬ 
ing  a  range  of  private  sector  actors  in  the  process 
of  steering. 

As  Ian  Bache  and  Matthew  Flinders  noted  in 
2004,  one  of  the  more  important  aspects  of  the 
development  of  governance  as  a  means  of  under¬ 
standing  comparative  politics  has  been  the  develop¬ 
ment  of  multilevel  governance.  While  students  of 
federalism,  such  as  Thomas  Heuglein  and  Alan 
Fenna  (2006),  and  of  intergovernmental  relations 
more  generally  might  consider  that  this  term  has  to 
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some  extent  reinvented  a  very  old  wheel,  the  prom¬ 
inence  of  the  concept  of  multilevel  governance  has 
emphasized  the  complexity  of  the  processes  through 
which  public  programs  are  made  and  implemented. 
The  basic  point  of  multilevel  governance  is  that 
most  attempts  to  govern  in  the  early  21st  century 
involve  interactions  among  multiple  levels  of  gov¬ 
ernment,  as  well  as  interactions  between  the  public 
and  private  sectors. 

The  multilevel  governance  approach  also  points 
to  the  extent  to  which  the  contemporary  state  has 
been  disaggregated  and  the  linkage  between  the 
structure  of  states  and  the  governance  being  pro¬ 
vided.  Christopher  Pollitt  and  Colin  Talbot  noted 
in  2004  that  as  the  public  sector  has  moved  a  large 
number  of  its  activities  to  agencies  and  quangos  or 
has  developed  mechanisms  such  as  public-private 
partnerships  to  deliver  services  more  efficiently, 
the  capacity  for  control  from  the  center  is  reduced. 
Liesbet  Hooghe  and  Gary  Marks  in  2003  pointed 
to  the  variable  consequences  of  the  different  mech¬ 
anisms  for  disaggregating  the  central  state,  but  the 
general  consequence  has  been  to  reduce  the  steer¬ 
ing  capacity  of  the  center  of  government. 

The  need  to  understand  reactions  to  disaggre¬ 
gation  and  the  empowering  of  nonstate  actors  in 
governance  has  led  to  consideration  of  the  capac¬ 
ity  to  restore  some  direction  from  the  center.  The 
concept  of  “metagovernance”  has  been  coined  to 
reflect  the  need  for  central  steering  the  context  of 
a  more  decentralized  and  devolved  world  of  polit¬ 
ical  action.  Metagovernance  is  the  governance  of 
governance,  or  the  attempt  to  provide  or  create 
some  coherence  out  of  very  decentered  policy  pro¬ 
cesses.  Metagovernance  does  not  imply  a  return  to 
the  command  and  control  style  of  governing  asso¬ 
ciated  with  the  hierarchical  state  but  rather 
reflects  an  attempt  to  steer  through  frames  and 
guidelines  instead  of  through  hierarchy  and  for¬ 
mal  rules. 

Metagovernance  also  represents  an  attempt  on 
the  part  of  many  political  leaders  to  reassert  the 
primacy  of  politics.  If  networks  and  other  forms  of 
devolved  political  action  are  increasingly  dominat¬ 
ing  political  life,  then  the  connection  between  vot¬ 
ing  and  policy  choices  is  becoming  more  attenu¬ 
ated.  Political  leaders  therefore  have  felt  the  need 
to  control  policy  without  necessarily  undermining 
the  efficiency  and  legitimation  gains  achieved  by 
using  the  devolved  and  marketized  methods  of 


governing.  In  some  cases,  these  attempts  at  control 
have  involved  overt  politicization  of  the  public 
service,  but  others  have  involved  softer  forms  of 
steering  and  control. 

Even  if  one  accepts  that  a  great  deal  of  the 
action  of  governing  has  been  shifted  away  from 
formal  institutions  in  the  public  sector  and  toward 
devolved  and  delegated  action,  it  is  important  to 
remember  that  this  activity  is  always  being  done  in 
the  “shadow  of  hierarchy,”  as  noted  by  Fritz 
Scharpf  (1997).  That  is,  if  the  market  or  social 
actors  being  given  grants  of  power  fail  to  perform 
adequately  in  making  and  implementing  policy, 
then  the  state  can  always  revoke  its  delegation  and 
resume  control  over  the  policy.  The  reassertion  of 
hierarchical  control  would  go  beyond  the  softer 
controls  associated  with  metagovernance  to  recap¬ 
ture  direct  control  of  policy  and  a  more  direct 
imposition  of  the  priorities  of  elected  politicians. 

At  the  same  time  that  governing  is  carried  on  in 
the  shadow  of  hierarchy,  to  some  extent,  more 
formal  patterns  of  governing  are  carried  on  in  the 
“shadow  of  governance.”  The  policy-making  pro¬ 
cesses  of  the  public  sector  do  fail  all  too  often; 
creating  policy  arrangements  that  rely  more  heav¬ 
ily  on  nonstate  actors  provide  political  leaders  an 
opportunity  to  correct  for,  or  perhaps  avoid,  those 
failures.  In  part,  the  capacity  to  move  activities 
that  may  fail  within  the  state  out  of  the  center  is  an 
important  means  of  blame  avoidance  for  politi¬ 
cians.  Using  market  and  nonstate  actors  can,  of 
course,  be  a  less  “political”  strategy  and  more  a 
reflection  of  the  capacity  of  those  institutions  to 
act  in  ways  that  the  public  sector  cannot — whether 
that  is  being  less  bureaucratic  in  the  pejorative 
sense  of  the  term,  being  able  to  respond  more  nim¬ 
bly  to  changing  conditions,  or  being  able  to  lever¬ 
age  human  and  financial  resources  that  are  out  of 
the  reach  of  government. 

In  summary,  the  governance  turn  in  political 
science  has  brought  the  discipline  back  to  some  of 
its  roots  in  thinking  about  the  capacity  of  the  pub¬ 
lic  sector  to  provide  coherent  direction  to  society. 
Clearly,  in  the  case  of  failed  states,  this  type  of 
governance  is  not  possible,  and  even  for  some 
more  successful  political  systems,  it  can  be  spo¬ 
radic  and  inconsistent.  Furthermore,  governance  is 
not  just  about  the  success  or  failure  of  the  formal 
public  sector  but  also  involves  the  capacity  of 
those  institutions  to  interact  successfully  with 
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social  actors  and  also  increasingly  the  capacity  to 
interact  with  the  global  environment. 

Public  Policy 

Public  policy  studies  are  the  second  major  body  of 
work  to  be  considered  when  thinking  about  the 
outputs  of  the  public  sector.  Most  political  scien¬ 
tists  do  not  think  of  themselves  as  “policy  scien¬ 
tists”  of  the  type  advocated  by  Harold  Lasswell  and 
Abraham  Kaplan  in  1950,  but  in  essence  they  are. 
The  individualistic  turn  in  political  science,  reflected 
in  both  behavioral  and  rational  choice  approaches 
to  the  discipline,  has  diverted  attention  from  the 
output  side  of  the  political  system  toward  aspects  of 
political  life  where  institutions  and  processes  are 
less  dominant  features.  For  many  political  scientists 
taking  those  approaches,  public  policy  is  better  left 
to  economists  or  to  professions  such  as  planning. 

Although  the  two  aspects  of  political  science — 
policy  and  individual  behavior — are  in  fact  more 
complementary  than  competitive,  they  tend  to 
ignore  each  other  and  to  press  the  primacy  of  one 
or  the  other.  This  is  most  unfortunate,  given,  that 
it  is  individuals,  working  singly  or  collectively, 
who  must  make  policy  decisions;  we  need  to  know 
more  about  how  they  make  those  decisions. 
Likewise,  public  policies  frame  a  good  deal  of  the 
political  action  by  individuals,  so  that  voting  may 
be  voting  for  or  against  specific  policy  choices. 

The  Policy  Process 

The  study  of  public  policy  in  political  science  has 
several  important  but  somewhat  disparate  dimen¬ 
sions.  The  largest  single  body  of  research  has  been 
on  the  policy  process,  notably  the  work  of  Charles 
O.  Jones.  This  approach  to  policy  is  inherently 
political,  arguing  that  the  policy  choices  made  by 
government  are  primarily  a  function  of  the  politi¬ 
cal  process  through  which  they  are  made.  The 
majority  of  the  process  models  attempt  to  explain 
policy  choices  by  understanding  the  actors  who  are 
involved  at  each  of  these  stages,  as  well  as  by 
understanding  linkages  among  the  stages.  These 
models  are,  however,  far  more  useful  as  heuristics 
for  describing  the  process  than  they  are  as  means 
of  explaining  outcomes. 

The  conventional  process  model  of  policy  mak¬ 
ing  begins  with  agenda  setting  or  problem  definition 


and  then  proceeds  through  a  series  of  steps  such  as 
program  design,  legitimation,  resource  attachment, 
and  evaluation;  typically,  it  did  not  have  any  strong 
assumptions  about  the  political  mechanisms  that 
were  used  to  manage  that  movement.  Although 
some  elements  within  the  general  model,  for  exam¬ 
ple,  agenda  setting,  relied  on  the  role  of  policy  entre¬ 
preneurs  to  provide  agency  within  the  process,  the 
general  models  tended  to  be  largely  devoid  of  that 
animation. 

Furthermore,  the  process  model  as  generally 
used  in  political  science  did  not  reflect  the  degree  of 
conflict  that  might  exist  in  these  crucial  political 
processes;  rather,  the  process  appeared  to  move 
along  rather  smoothly  to  its  conclusion.  One 
important  addition  to  process  models  that  did  con¬ 
tain  that  conflict  was  Paul  Sabatier  and  Hank 
Jenkins-Smith’s  (1993)  “advocacy-coalition  frame¬ 
work.”  This  model  was  concerned  explicitly  with 
policy  change  and  suggested  that  policy  change 
came  about  through  the  clash  of  ideas,  as  mani¬ 
fested  in  an  existing  policy  coalition  and  a  potential 
replacement.  These  conflicts  could  be  worked  out 
through  bargaining  processes,  often  generating  a 
new  synthesis  about  policy,  which  would  in  turn  be 
institutionalized,  setting  the  stage  for  the  next 
round  of  policy  change. 

In  addition  to  the  general  model  of  the  policy 
process,  there  has  been  a  substantial  degree  of 
theorizing  about  various  components  of  the  policy 
process.  In  particular,  political  scientists  have  been 
concerned  with  agenda  setting,  budgeting,  imple¬ 
mentation,  and,  at  one  point,  evaluation.  These 
process  considerations  have,  however,  often  been 
divorced  from  concerns  about  the  quality  of  poli¬ 
cies  being  adopted.  Each  of  these  process  areas  has 
produced  a  large  body  of  literature  that  has  influ¬ 
enced  more  general  studies  of  policy.  That  having 
been  said,  with  the  exception  of  the  study  of 
agenda  setting,  these  areas  of  research  have 
declined  in  political  science  to  some  extent,  reflect¬ 
ing  the  general  decline  in  the  importance  of  policy 
studies  in  the  discipline. 

Implementation 

The  first  of  these  components  of  the  policy¬ 
making  process  to  emerge  as  a  distinctive  field  for 
research  was  implementation.  Although  the  term  is 
hardly  new,  it  was  brought  to  center  stage  in  policy 
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studies  by  Jeffrey  Pressman  and  Aaron  Wildavsky 
in  1974.  They  focused  attention  on  the  difficulties 
of  taking  legislation  and  making  it  work  in  “the 
real  world”  and  especially  the  difficulties  in  mak¬ 
ing  it  work  in  the  manner  intended  by  the  framers 
of  the  legislation.  They  developed  this  analysis  by 
identifying  the  number  of  clearance  points  that 
had  to  be  passed  successfully  if  the  program  were 
to  go  into  effect.  While  their  analysis  of  the  diffi¬ 
culties  of  implementation  was  perhaps  excessively 
pessimistic,  it  did  help  frame  a  fundamental  policy 
problem  in  a  strong  analytic  model. 

The  Pressman  and  Wildavsky  framework  for 
understanding  implementation  is  a  top-down  per¬ 
spective  on  the  process,  while  the  process  may  also 
be  analyzed  from  a  bottom-up  perspective.  The 
bottom-up  perspective  assumes  that  programs 
might  be  better  designed  from  the  perspective  of 
the  prospective  of  implementation.  This  “back¬ 
ward-mapping”  perspective  assumes  that  programs 
might  ultimately  be  more  effective  if  their  final 
implementation  were  central  to  their  formation. 
Although  criticized  as  putting  implementation 
ahead  of  policy  goals,  this  approach  does  empha¬ 
size  the  key  role  of  implementation  in  making  pro¬ 
grams  actually  work. 

A  third  wave  of  implementation  research 
emphasized  quantitative  studies  of  the  success  and 
failure  of  implementation  efforts,  relying  heavily 
on  measures  that  were  in  many  ways  difficult  to 
sustain  in  the  face  of  the  complexity  of  the  imple¬ 
mentation  process.  If  anything,  the  complexity  of 
implementation  has  become  even  greater.  For 
example,  implementation  increasingly  involves 
multiple  actors  at  multiple  levels  of  government,  as 
described  by  Laurence  O’Toole  in  2004,  or  per¬ 
haps  also  actors  in  the  private  sector. 

Budgeting 

Wildavsky  was  also  central  to  the  emergence  of 
public  budgeting  as  an  area  of  concerted  research 
in  political  science  in  the  mid-1960s.  In  contrast  to 
economists  who  looked  at  the  public  budget  as  a 
search  for  a  Pigovian  optimal  allocation,  Wildavsky 
saw  budgeting  in  a  much  more  political  manner, 
emphasizing  the  importance  of  structure  and  pro¬ 
cess  in  producing  the  outcomes.  In  particular,  the 
repetitive  nature  of  budgeting  and  the  multiple 
organizations  involved  in  the  process  (at  least  in  the 


United  States)  tended  to  produce  incremental  out¬ 
comes.  There  was  also  some  individual  element 
involved  in  this  model  of  incrementalism,  given 
that  many  of  the  actors  involved  played  the  “game” 
year  after  year  and  their  memories  and  their  inter¬ 
personal  trust  also  influenced  the  outcomes. 

After  his  initial  insights  into  the  American  bud¬ 
getary  process,  Wildavsky  and  his  colleagues, 
Otto  Davis  and  Michael  Dempster,  made  a  num¬ 
ber  of  other  important  contributions  to  a  political 
understanding  of  budgeting.  First,  they  formalized 
and  tested  the  incremental  model  of  budgeting  in 
1966  and  demonstrated  that  there  was  a  great 
deal  of  predictability  in  the  decisions  made  for  the 
U.S.  federal  budget.  Furthermore,  Wildavsky  and 
colleagues  developed  alternative  models  that  could 
describe  budgetary  choices  in  other  types  of  polit¬ 
ical  systems.  Finally,  Hugh  Heclo  and  Wildavsky 
in  1974  provided  a  detailed  analysis  of  budgetary 
decision  making  in  the  British  Treasury,  again 
emphasizing  the  importance  of  trust  in  these 
deliberations. 

The  seminal  work  by  Wildavsky  sparked  a 
number  of  interesting  elaborations  of  budgeting 
and  critiques  of  the  incremental  model.  Unfor¬ 
tunately,  however,  that  interest  in  budgeting  has 
not  been  maintained,  even  given  its  centrality  in 
governing.  The  interest  in  budgeting  that  remains 
has  been  largely  on  the  part  of  rational  choice 
scholars  who  see  the  budget  as  a  logic  locus  for 
actors  to  seek  to  maximize  their  own  benefits  and 
to  pursue  interesting  strategies  in  order  to  do  so. 

As  Charles  Lindblom  (1965)  notes,  the  incre¬ 
mental  ideas  that  have  been  central  to  understand¬ 
ing  budgeting  in  the  United  States,  and  to  a  lesser 
extent  elsewhere,  are  related  to  more  general  ana¬ 
lytic  approaches  emphasizing  the  likelihood,  and 
the  desirability,  of  incremental  decision  making.  In 
this  view,  the  public  sector  will  produce  better 
governance  through  successive  limited  compari¬ 
sons  rather  than  through  attempting  to  make  a 
single,  correct,  and  comprehensive  decision.  While 
such  a  pattern  of  governing  can  be  condemned  as 
excessively  conservative,  the  logic  also  corre¬ 
sponded  well  to  the  American  political  style  from 
which  it  had  evolved. 

At  a  more  theoretical  level,  beginning  with 
Herbert  Simon’s  (1947)  work,  the  logic  of  bounded 
rationality  has  been  a  powerful  contender  in  the 
attempted  domination  of  more  rationalistic 
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approaches  to  policy,  and  to  politics  more  gener¬ 
ally.  In  this  view,  any  attempt  at  comprehensive 
rationality  in  public  decision  making  is  doomed  to 
failure,  so  the  more  appropriate  course  of  action  is 
to  make  a  series  of  smaller  decisions,  examining  the 
outcomes  of  those  decisions  in  turn  and  adjusting. 
Furthermore,  the  objective  of  the  policy-making 
exercise  is  better  understood  as  satisficing  rather 
than  maximizing  utilities,  so  that  all  that  decision 
makers  must  find  is  a  decision  that  is  good  enough 
rather  than  one  that  is  optimal. 

Agenda  Setting 

Agenda  setting  is  the  final  element  of  policy 
process  that  has  achieved  a  great  deal  of  promi¬ 
nence  in  the  discipline.  Roger  Cobb  and  Charles 
Elder  were  responsible  for  bringing  the  idea  of 
agenda  setting  to  a  central  place  in  the  study  of 
policy  in  the  early  1980s.  The  basic  argument  is,  of 
course,  that  nothing  can  be  done  about  a  problem 
in  society  unless  it  is  identified  and  transferred  to 
the  active  agenda  of  a  political  institution.  Thus, 
one  of  the  most  important  aspects  of  political 
mobilization  is  generating  awareness  of  the  issue 
and  commitment  on  the  part  of  politicians  to 
attempt  to  deal  with  it. 

Cobb  and  Elder’s  approach  to  agenda  setting 
was  largely  purposive,  but  later  approaches  to  this 
stage  of  policy  making  emphasized  the  almost  ran¬ 
dom  elements  of  the  process.  In  particular, 
Kingdon’s  (1995)  work  emphasized  the  need  for  a 
“policy  window”  to  open  before  an  issue  can  be 
put  on  the  agenda.  This  idea  is  not  dissimilar  to  the 
garbage  can  model  of  decision  making  proposed  in 
1972  by  Michael  Cohen,  James  March,  and  Johan 
Olsen  in  the  bounded  rationality  literature  men¬ 
tioned  earlier,  which  argues  that  decisions  are 
largely  the  product  of  the  fortunate  confluence  of 
opportunities  and  ideas  and  actors.  This  is,  of 
course,  in  marked  contrast  to  the  usual  rational 
assumptions  about  how  social  and  political  deci¬ 
sions  are  made. 

The  agenda-setting  literature  has  been  expanded 
significantly  by  Frank  Baumgartner  and  Bryan 
Jones’s  (1993)  discussion  of  patterns  of  movement 
of  issues  on  and  off  active  agendas.  They  used 
some  of  the  same  terminology  as  historical  institu¬ 
tionalism,  such  as  “punctuations”  to  emphasize 
the  rather  abrupt  changes  in  the  visibility  and  the 


salience  of  policy  problems.  Their  plotting  of  the 
dynamics  of  agenda  in  American  politics  has  gen¬ 
erated  a  large  new  area  of  inquiry  in  the  United 
States  and  in  other  countries  that  has  made  signifi¬ 
cant  progress  in  mapping  the  sources  of  policy 
change  and  the  political  foundations  of  policy 
choices. 

Policy  Causes  Politics 

A  second  strand  of  theorizing  about  policy  in 
political  science,  developed  or  initiated  by 
Theodore  J.  Lowi  (1972),  has  to  some  extent 
reversed  the  logic  of  the  process  model,  arguing 
that  policy  produces  politics.  This  model  involves 
the  assumption  that  four  fundamental  types  of 
policy  tend  to  be  associated  with  certain  types  of 
political  action.  They  are  distributive,  constituent, 
regulative,  and  redistributive.  For  example,  redis¬ 
tributive  policies  are  assumed  to  be  associated  with 
the  immediate  applicability  of  coercion  focused  on 
the  environment  of  conduct  rather  than  on  the 
conduct  itself.  In  1974  Robert  H.  Salisbury  and 
John  P.  Heinz  used  a  similar  set  of  policy  variables, 
although  relating  them  more  to  the  types  of 
demands  being  placed  on  the  policy-making  sys¬ 
tem  and  the  degree  of  fragmentation  of  that  policy¬ 
making  system. 

The  model  developed  by  Lowi  has  motivated  a 
great  deal  of  empirical  research  as  well  as  a  number 
of  theoretical  critiques.  Perhaps  the  most  funda¬ 
mental  contribution  has  been,  however,  to  make 
political  scientists  think  about  policies  in  more 
analytic  categories.  There  is  always  the  tendency  to 
think  about  policies  in  terms  of  the  names  one  finds 
on  government  buildings — agriculture,  defense, 
housing,  and  so  on.  That  manner  of  thinking  is  use¬ 
ful,  but  only  up  to  a  point,  and  it  may  mask  some 
of  the  internal  variations  in  individual  policy  areas. 

A  slightly  different  version  of  the  “policy  causes 
politics”  approach  is  the  argument  made  by  Gary 
Freeman  and  others  that  the  differences  across 
political  systems  are  not  as  great  as  the  differences 
across  policy  areas.  The  technical  and  political 
foundations  of  a  policy  area  such  as  health  may  be 
more  similar  across  countries  than  are  those  foun¬ 
dations  for  disparate  policy  areas  within  a  single 
system.  As  policy  ideas  spread  across  national 
boundaries,  these  common  aspects  of  policy  may 
become  more  similar.  Pressures  from  globalization 
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and  from  Europeanization  have  tended  to  homog¬ 
enize  policy  across  national  boundaries,  further 
reinforcing  Freeman’s  argument. 

Policy  Styles 

To  understand  public  policy,  it  is  important  to 
consider  the  policy  choices  made  by  any  one  coun¬ 
try  (or  subnational  unit).  It  is  also  important  to 
compare  those  choices  with  the  choices  made  by 
other  countries.  This  comparison  could  be  done  on 
a  policy-by-policy  basis,  as  described  for  the  wel¬ 
fare  state  below,  or  the  policy-making  style  of  the 
political  system  in  question  could  be  typified.  In 
this  case,  rather  than  policy  being  assumed  to 
cause  policy,  policy  is  used  to  characterize  the 
political  systems;  policy  becomes  the  lens  through 
which  to  compare  those  systems. 

Jeremy  Richardson  made  the  first  major  analysis 
of  policy  styles  in  Western  Europe,  arguing  that  the 
policy  styles  in  these  countries  could  be  character¬ 
ized  by  two  variables.  One  was  an  active-reactive 
dimension,  assessing  whether  a  county  attempted 
to  anticipate  policy  problems,  and,  if  so,  to  what 
extent,  or  simply  waited  until  the  problem  appeared. 
The  other  dimension  was  one  of  imposition 
versus  negotiation,  reflecting  in  part  patterns  of 
corporatist  interest  intermediation  in  some  political 
systems. 

Almost  all  the  research  on  policy  styles  has  been 
done  on  the  industrialized  democracies.  This  pat¬ 
tern  has  the  advantage  of  holding  some  important 
variables,  notably  wealth  and  democracy,  rela¬ 
tively  constant.  There  have  been  some  attempts  to 
assess  the  policy  styles  of  less  affluent  countries, 
generally  attempting  to  build  something  on  the 
order  of  a  model  of  policy  making  for  one  region 
or  another.  In  particular,  a  great  deal  of  work  has 
been  done  on  the  policy  styles  of  Asian  countries 
and  their  relationship  to  economic  growth. 

Policy  Instruments 

Implementation  studies  have  been  concerned 
with  the  ability  to  transform  stated  policies  into 
effective  action.  The  ability  to  make  the  difficult 
translation  from  law  to  action  involves  a  number 
of  components;  one  of  the  more  important 
elements  is  the  choice  of  a  policy  instrument,  or 


tool  (Christopher  Hood,  1976).  The  basic  idea  of 
policy  instruments  is  that  the  public  sector  can 
achieve  its  goals  using  a  variety  of  different  types 
of  programs — subsidies,  regulation,  vouchers,  and 
so  on — and  each  of  these  instruments  will  have 
distinct  substantive  and  political  characteristics 
that  will  influence  the  likely  success  of  the  program 
being  implemented. 

While  it  is  common  to  think  of  these  instru¬ 
ments  in  more  or  less  technical  terms,  it  is  crucial 
to  remember  that  they  do  have  a  strong  political 
dimension.  For  example,  as  the  politics  of  govern¬ 
ing  have  changed  to  cast  suspicion  on  direct  public 
sector  interventions,  many  of  the  conventional 
“command  and  control”  instruments  of  the  public 
sector,  such  as  regulation,  have  been  exchanged 
for  softer  instruments.  The  choice  of  policy  instru¬ 
ments  may  also  reflect  the  need  to  build  coalitions 
needed  to  get  legislation  adopted.  For  example, 
much  of  the  federal  support  for  university  students 
in  the  United  States  is  provided  through  private 
bank  loans  guaranteed  by  government.  While  it 
might  be  more  efficient  to  have  the  loans  managed 
through  the  universities,  it  was  important  to  create 
a  political  ally  in  the  banking  industry. 

The  study  of  policy  instruments  leads  to  a  num¬ 
ber  of  other,  even  more  difficult,  questions  about 
public  policy  and  the  possibilities  of  effective  public 
sector  intervention.  The  most  fundamental  issue 
that  the  choice  of  instruments  raises  is  the  possi¬ 
bilities  of  policy  design.  Do  policy  analysts,  whether 
in  political  science  or  not,  have  the  ability  to  design 
better  policies?  To  do  so  requires  some  understand¬ 
ing  of  the  causation  of  policy  problems,  the 
relationship  of  those  problems  to  the  instruments 
available  to  address  those  issues,  and  some  norma¬ 
tive  basis  to  evaluate  the  outcomes  of  the  policy 
intervention  (Pearl  Eliadas,  Margaret  Hill,  & 
Michael  Howlett,  2005).  The  design  issue  also 
raises  the  question  of  whether  the  simple  functional 
categories  we  generally  use  for  policy  are  sufficient 
to  capture  the  complexity  of  the  challenges  faced 
when  attempting  to  match  instruments  to  the 
underlying  dimensions  of  policy  problems. 

Discourse  and  Deliberation 

The  majority  of  thinking  about  public  policy 
has  used  a  strongly  objective  model,  attempting  to 
measure  policies  as  dependent  variables  and  then 
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finding  the  factors  that  explain  those  policy  choices. 
Likewise,  economic  and  utilitarian  values  tend  to 
undergird  much  of  this  analysis,  assuming  that 
policies  are  best  understood  in  terms  of  how  they 
improve  the  economic  well-being  of  the  objects  of 
the  policy.  One  strand  of  reasoning  in  political  sci¬ 
ence,  however,  argues  that  policy  is  best  under¬ 
stood  in  more  linguistic  terms.  The  assumption  of 
deliberative  models  as  presented  by  Frank  Fischer 
(2003)  is  that  the  discourse  surrounding  the  policy, 
and  the  conflicts  between  alternative  discourses, 
defines  the  ways  in  which  policy  choices  are  made. 
Of  course,  utilitarian  values  belong  to  a  discourse, 
but  they  are  only  one  among  many  possible  sets  of 
values  that  can  guide  policy  choices. 

Discourse  and  deliberative  models  are  often  best 
applied  in  areas  such  as  science  policy  and  some 
aspects  of  environmental  policy  where  there  are 
often  widely  diverging  understandings  of  the 
nature  of  the  policy.  Flerbert  Gottweiss,  for  exam¬ 
ple,  has  examined  the  deliberative  aspects  of 
research  on  human  biology,  and  there  have  been  a 
number  of  studies  of  environmental  policy  that 
apply  the  deliberative  approach  to  policy  analysis. 
These  studies  have  demonstrated  the  range  of  val¬ 
ues  that  may  be  involved  in  complex  policy  issues 
as  well  as  something  of  the  ways  in  which  those 
almost  inherent  conflicts  may  be  resolved. 

Welfare  State 

Although  there  are  many  policy  areas,  the  dis¬ 
cussion  will  focus  attention  on  the  development  of, 
and  then  the  challenges  to,  the  welfare  state.  This 
special  treatment  is  justified  because  not  only  have 
welfare  state  politics  and  policy  been  central  to  the 
legitimation  of  the  contemporary  political  systems 
in  Europe,  and  to  some  extent  the  rest  of  the 
world,  but  the  welfare  state  has  played  a  central 
role  in  theorizing  about  the  politics  of  public  pol¬ 
icy.  Furthermore,  the  study  of  the  welfare  state  has 
to  some  extent  integrated  the  study  of  policy,  com¬ 
parative  politics,  and  normative  political  theory. 

The  development  of  the  welfare  state  and  the 
various  forms  that  this  policy  regime  has  assumed 
has  been  a  rich  vein  of  research  and  theorizing. 
Gosta  Esping-Andersen’s  The  Three  Worlds  of 
Welfare  Capitalism  (1990)  is  perhaps  the  most 
famous  work  of  this  sort.  Esping-Andersen  identi¬ 
fied  three  types  of  welfare  states,  based  on  the 


extent  to  which  labor  was  commodified,  and  the 
welfare  system  was  seen  as  replacement  for  labor 
or  simply  as  more  of  a  right  of  citizenship. 
Although  the  Esping-Andersen  work  has  been  cen¬ 
tral  to  thinking  about  the  welfare  state,  a  number 
of  other  studies  have  attempted  to  explain  patterns 
of  development  and  differences  in  the  contempo¬ 
rary  systems.  For  example,  Peter  Flora  and  Arnold 
Heidenheimer  constructed  a  development  model  to 
explain  welfare  policy  choices,  taking  into  account 
explicitly  the  differences  between  Europe  and 
North  America. 

Although  the  welfare  state  project  has  been 
central  to  legitimating  the  states  of  Western 
Europe,  and  in  improving  the  quality  of  life  for 
most  citizens  of  those  countries,  the  model  for 
building  a  social  safety  net  for  all  members  of 
society  has  been  under  a  good  deal  of  pressure 
since  at  least  the  late  1970s.  The  oil  crisis  of  the 
1970s  brought  the  cost  of  these  programs  to 
greater  public  attention,  and  those  costs  have  only 
continued  to  increase.  Furthermore,  the  continu¬ 
ing  aging  of  the  societies  with  well-developed 
welfare  states  will  exacerbate  the  financial  prob¬ 
lems.  In  addition  to  the  manifest  fiscal  problems 
of  the  welfare  state,  this  model  of  governing  has 
been  the  source  of  large-scale  ideological  assaults 
driven  by  shifts  in  the  consensus  that  had  domi¬ 
nated  policy.  Those  assaults  came  from  both  the 
political  Right  and,  less  predictably,  from  the 
political  Left,  which  complained  about  the  deper¬ 
sonalization  and  bureaucracy  of  the  welfare  state. 

The  pressures  on  the  welfare  state  have  engen¬ 
dered  a  number  of  important  studies  that  have 
integrated  the  study  of  the  policies  themselves  with 
the  study  of  comparative  politics.  For  example, 
Paul  Pierson  and  his  collaborators  have  been  con¬ 
cerned  with  the  role  of  demographic  and  fiscal 
pressures  on  the  erosion,  or  at  least  recasting,  of 
many  welfare  state  programs,  even  in  the  countries 
that  have  been  among  the  most  committed  to  this 
style  of  governing.  Rather  than  being  a  simple  eco¬ 
nomic  response,  however,  these  transformations 
are  considered  in  a  very  political  light,  reflecting 
the  variety  of  pressures  being  exerted  on  govern¬ 
ment  and  the  internal  politics  of  organizations 
within  the  public  sector  itself. 

The  pressures  on  the  welfare  state  reflect  the 
confluence  of  political,  economic,  and  social 
change  and  the  impact  of  that  juncture  of  forces  on 
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a  set  of  very  successful  public  programs.  Somewhat 
ironically,  the  welfare  state  that  had  been  crucial 
for  legitimating  the  state  has  now  become  a  source 
of  delegitimation  in  many  political  systems.  In  an 
era  of  neoliberal  economics  and  politics,  these  pro¬ 
grams  no  longer  have  an  uncontested  place  in 
governing  but  yet  remain  crucial  for  the  well-being 
of  many  segments  of  society. 

Public  Administration 

The  third  body  of  literature  to  be  included  in  this 
entry,  public  administration,  has  been  a  compo¬ 
nent  of  public  sector  studies  virtually  since  that 
inquiry  began.  Governments  have  always  had  to 
have  some  means  of  implementing  their  programs 
(see  above),  and  the  public  bureaucracy  has  been 
central  to  that  role.  As  important  as  public  admin¬ 
istration  is  in  implementation,  however,  its  role  is 
not  limited  to  that  activity,  and  public  administra¬ 
tors  play  a  major  role  in  shaping  public  policy  and 
making  public  policy  directly  too.  Finally,  civil 
servants  also  play  a  role,  and  an  increasing  role,  in 
linking  the  public  sector  and  the  public,  so  that 
public  administration  has  become  a  central  locus 
for  emerging  forms  of  democratic  control  over 
governing  processes. 

Bureaucracy 

To  this  point,  this  entry  has  dealt  with  bureau¬ 
cracy  and  public  administration  as  virtually  identi¬ 
cal  concepts,  but  the  two  should  be  differentiated. 
Public  administration  is  the  more  general  term, 
while  bureaucracy  refers  to  a  particular  organiza¬ 
tional  form,  and  especially  to  the  ideal-type  model 
of  bureaucracy  developed  by  Max  Weber  in  1946. 
That  conceptual  distinction  does  not,  however, 
prevent  bureaucracy  being  used  as  a  word  of 
opprobrium  for  public  administration,  referring  to 
the  perceived  inefficiency  and  excessive  formality 
of  those  public  organizations. 

As  the  concept  was  developed  by  Weber, 
bureaucracy  was  meant  to  be  the  highest  level  of 
development  of  rational-legal  governance.  Rather 
than  oppressing  the  public,  as  is  now  often  argued, 
the  bureaucracy  should  provide  both  greater  equal¬ 
ity  and  greater  transparency  than  the  traditional 
forms  of  governance  that  it  would  replace.  The 
reliance  on  law,  the  separation  of  the  bureaucracy 


from  political  controls,  and  the  development  of 
files  were  assumed  to  allow  citizens  to  know  when 
decisions  were  being  made  in  their  case  and  to 
have  some  justification  for  those  decisions.  These 
patterns  would  be  markedly  different  from  the 
patrimonial  administrations  that  characterized 
many  governments  at  the  time  that  Weber  wrote 
and  that  also  characterize  many  governments  in 
the  early  21st  century. 

Although  the  Weberian  model  of  bureaucracy 
and  associated  model  of  the  state  were  dismissed 
almost  completely  by  reformers  stressing  markets 
(see  the  discussion  of  new  public  management 
[NPMJ  below)  and/or  social  networks  as  alterna¬ 
tives  to  the  hierarchy  of  bureaucracy,  the  bureau¬ 
cratic  model  has  made  something  of  a  comeback. 
Christopher  Pollitt  and  Geert  Bouckaert  (2004) 
have  argued  for  the  emergence  of  a  “neo-Weberian 
state”  that  would  reassert  some  of  the  values  of 
bureaucracy,  albeit  while  retaining  many  of  the 
reforms  that  have  produced  greater  efficiency  in 
the  public  sector.  In  particular,  the  neo-Weberian 
state  is  an  attempt  to  recapture  accountability  for 
service  delivery  instruments  that  have  become 
increasingly  divorced  from  controls  by  legislatures 
and  political  executives. 

The  New  Public  Management 

The  Weberian  style  of  bureaucracy  not  only 
produced  a  number  of  benefits  for  the  public  sec¬ 
tor  and  for  citizens,  but  it  was  also  the  focus  of 
intense  criticism.  Rigidity,  reliance  on  formal 
authority,  impersonality,  and  overspecialization 
were  but  a  few  of  the  numerous  criticisms  leveled 
at  formal  bureaucracy.  Beginning  from  the  end  of 
the  1970s,  NPM  (Hood,  1991)  posed  a  series  of 
challenges  to  the  bureaucratic  model  and,  to  some 
extent,  became  a  new  paradigm  for  public  admin¬ 
istration. 

NPM  has  been  defined  in  any  number  of  ways 
and  has  been  argued  to  be  both  the  salvation  and 
damnation  of  the  public  sector.  At  its  most  funda¬ 
mental,  NPM  implies  at  least  (a)  the  use  of  market- 
type  mechanisms  for  public  policy  and  for  public 
management  and  (b)  an  emphasis  on  the  role  of 
managers  in  the  public  sector,  often  using  the  man¬ 
tra  of  “let  the  managers  manage”  (see  Peters, 
2001).  To  this  core  set  of  ideas  in  NPM  might  be 
added  some  notion  of  the  citizen  as  an  important 
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actor  in  the  process,  although  citizens  are  more 
likely  to  be  conceptualized  as  customers  than  as 
full-fledged  citizens  participating  in  a  democratic, 
political  process. 

As  NPM  has  evolved,  the  style  of  governing  has 
become  one  of  making  the  managers  manage  rather 
than  merely  letting  them  manage.  Performance 
management  has  been  one  of  the  principal  instru¬ 
ments  for  that  change  in  emphasis.  The  measure¬ 
ment  of  the  outputs  of  individuals  and  programs, 
and  using  that  information  to  make  decisions 
about  budgets,  retention,  and  other  important  out¬ 
comes  for  the  organization  and  its  managers,  places 
pressure  on  those  managers  to  perform  as  well  as 
possible.  To  be  successful,  however,  that  perfor¬ 
mance  must  be  measured  well  and  there  must  be  a 
means  of  enforcing  the  results. 

Public  Personnel 

The  people  who  work  for  the  public  sector  are 
one  of  the  crucial  elements  explaining  the  relative 
success  or  failure  of  administration.  There  is  little 
disagreement  on  the  proposition  that  the  public 
should,  to  the  extent  possible,  employ  the  “best 
and  brightest.”  Indeed,  management  in  the  public 
sector  is  almost  certainly  more  difficult  than  in 
the  private  sector,  given  the  absence  of  clear  crite¬ 
ria  for  success  and  failure,  the  transparency  of 
most  of  the  operations,  and  the  absence  of  control 
over  many  aspects  of  the  personnel  and  budget 
systems. 

Public  personnel  management  raises  a  number 
of  highly  technical  questions  about  the  employ¬ 
ment,  training,  and  motivation  of  public  employ¬ 
ees,  but  there  are  several  crucial  political  questions 
as  well.  The  following  sections  focus  on  three 
political  concerns:  (1)  the  role  of  representative 
bureaucracy,  (2)  the  politicization  of  the  civil  ser¬ 
vice,  and  (3)  the  importance  of  understanding  the 
rewards  of  high  public  office,  whether  elected  or 
appointed. 

Representative  Bureaucracy 

Representative  bureaucracy  is  one  of  the  classic 
topics  in  public  administration.  The  term  was 
coined  toward  the  end  of  World  War  II  when  it 
became  clear  that  there  would  be  a  socialist  govern¬ 
ment  in  the  United  Kingdom  (UK)  at  the  end  of  the 


war  and  that  the  implementation  of  what  appeared 
to  be  a  rather  sweeping  set  of  policy  initiatives 
would  be  the  responsibility  of  a  civil  service  that 
was  in  cast  of  mind,  if  not  of  party  membership, 
conservative.  Could  the  new  government  expect 
support  from  this  civil  service,  or  would  its  social 
background  prevent  its  whole-hearted  support? 

The  question  of  to  what  extent  membership  of 
the  public  service  mirrors  the  nature  of  society  has 
both  empirical  and  normative  elements.  The  empir¬ 
ical  questions  require  identifying  the  extent  to 
which  the  members  of  the  civil  service  are  represen¬ 
tative  of  the  population,  and  especially  the  extent 
to  which  women  and  members  of  minority  groups 
are  included  in  government.  The  answers  to  these 
questions  vary  markedly  depending  on  what  level 
within  the  government  is  being  considered.  As  a 
whole,  the  public  sector  in  most  industrialized 
countries  does  represent  the  society  reasonably 
well,  and  often  is  more  than  half  female.  On  the 
other  hand,  at  the  higher  levels  of  the  public 
bureaucracy,  women,  and  especially  minority 
group  members,  tend  to  be  underrepresented. 

The  second  part  of  the  empirical  question  con¬ 
cerning  representative  bureaucracy  is  whether  it 
really  makes  any  difference.  One  version  of  the 
role  of  representation  is  a  passive  one,  in  which 
merely  having  women  and  minority  group  mem¬ 
bers  in  the  public  service  is  assumed  to  present  a 
positive  image  to  the  public  and  to  produce  better 
outcomes.  In  a  more  active  version  of  representa¬ 
tion,  those  minority  representatives  would  assume 
a  more  active  role  in  governing  and  in  attempting 
to  use  their  positions  to  redress  some  of  the 
inequalities  that  may  exist  in  the  social  and  eco¬ 
nomic  benefits  available  from  government. 

The  normative  questions  about  representative 
democracy  are  based  on  the  logic  that  the  public 
sector  should  be  a  “model  employer”  and  should 
demonstrate  that  more  equitable  hiring  practices 
can  be  effective.  They  are  further  based  on  the  logic 
of  attempting  to  use  the  public  sector  as  a  means  of 
addressing  long-standing  social  and  economic 
deprivations  experienced  by  social  groups,  with 
public  servants  adopting  a  more  active  representa¬ 
tional  role  in  that  process.  These  normative  stances 
may  not  be  as  effective  as  their  advocates  might 
like  them  to  be  in  improving  the  lot  of  members  of 
society,  but  they  do  express  some  commitment 
toward  social  justice  through  administration. 
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Politicization 

Another  important  political  question  about 
recruitment  of  public  officials  is  the  extent  to 
which  political  criteria  are,  and  should  be,  involved 
in  the  selection  of  public  officials.  The  logic  of 
bureaucracy  and  the  civil  service  is  that  public 
administrators  should  be  selected  on  the  basis  of 
merit  and  that  political  criteria  should  be  irrele¬ 
vant.  On  the  other  extreme,  however,  there  is  a 
perceived  need  by  politicians  to  have  advisors,  and 
to  some  extent  also  managers,  who  will  support 
their  programs. 

Although  the  norm  of  a  professional  and  depo- 
liticized  public  service  is  certainly  accepted  in  the 
industrialized  democracy,  it  is  also  under  attack. 
The  movements  toward  a  more  politicized  public 
service  are  generally  subtle  and  the  argument  in 
favor  of  the  movement  is  difficult  to  press  overtly, 
but  politicians  clearly  want  to  have  greater  control 
over  their  public  servants.  The  movement  toward 
greater  politicization  reflects  in  part  the  conse¬ 
quences  of  adopting  NPM,  with  many  programs 
being  moved  from  direct  ministerial  control. 
Furthermore,  contemporary  changes  in  personnel 
management,  reflecting  the  logic  of  NPM,  have 
made  it  easier  for  ministers  to  make  decisions 
about  hiring  that  are  not  done  strictly  on  merit 
criteria. 

For  many  of  the  less  developed  countries,  polit¬ 
icization  of  the  public  service  is  quite  common. 
The  failure  to  institutionalize  a  merit-based  civil 
service  reflects  the  importance  of  public  service 
employment  in  less  affluent  countries.  For  politi¬ 
cians  in  these  often  clientelistic  administrative 
systems,  the  ability  to  distribute  public  office  is  an 
important  means  of  building  a  political  power 
base.  In  addition,  political  leaders  may  perceive  the 
need  for  loyalty  from  their  civil  servants,  as  well  as 
commitment  to  particular  policy  agendas,  which  is 
crucial  for  their  vision  of  development  as  well  as 
their  political  future. 

Although  merit  recruitment  is  the  usual  stan¬ 
dard  for  a  “proper”  civil  service,  all  public  sectors 
have  some  place  for  political  recruitment.  At  one 
extreme  among  the  industrial  democracies,  the 
United  States  has  more  than  4,000  political  appoint¬ 
ments  in  positions  that  in  many  other  countries 
would  be  career  positions.  In  France  and  Belgium, 
there  is  also  large-scale  use  of  political  appoint¬ 
ments  in  ministerial  cabinets  that  advise  ministers 


and  help  enforce  their  decisions.  In  a  number  of 
other  cases,  a  parallel  bureaucracy  may  be  created 
with  political  appointees  to  some  extent  shadowing 
the  actions  of  the  career  public  servants. 

Rewards 

Decisions  about  the  level  of  rewards  for  public 
employees  are  a  final  issue  in  personnel  policy. 
Although  there  is  a  good  deal  of  evidence  that  pay 
is  not  the  prime  motivation  for  individuals  joining 
or  remaining  in  the  public  service,  it  must  be  con¬ 
sidered.  Financial  rewards  are  increasingly  impor¬ 
tant  because  of  the  adoption  of  ideas  from  NPM 
concerning  the  opening  of  the  public  service  to 
direct  entry  from  outside.  These  rewards  may  be 
especially  important  for  the  top  officials  of  the 
public  sector,  who  may  be  responsible  for  extremely 
large  organizations  but  receive  very  modest 
rewards.  At  the  extreme,  the  Secretary  of  Defense 
in  the  United  States,  who  is  responsible  for  the 
largest  organization  in  the  world,  receives  a 
“princely”  salary  of  $180,100,  while  his  subordi¬ 
nates  receive  even  lower  salaries. 

If  we  want  to  understand  the  importance  of 
public  rewards  for  the  public  sector,  however,  we 
need  to  move  beyond  formal  salary  levels.  The 
formal  salaries  and  visible  benefits  of  public  offi¬ 
cials  are  but  a  small  portion  of  the  rewards  for 
high  public  officials  in  many  countries.  At  the 
extreme,  the  nominal  salaries  of  the  leaders  of 
China  are  less  than  $100  per  month,  but  that  fig¬ 
ure  does  not  begin  to  take  into  account  the  huge 
range  of  housing,  transportation,  and  other  bene¬ 
fits  from  holding  office.  At  the  other  end  of  the 
dimension  of  visibility,  public  officials  in  Australia 
and  New  Zealand  receive  few  indirect  rewards  and 
are  held  to  close  account  for  the  benefits  they  do 
receive. 

The  degree  of  rewards  received  by  public  offi¬ 
cials,  and  especially  those  informal  rewards  received 
in  addition  to  their  formal  salaries  and  benefits,  is 
important  for  the  legitimacy  of  the  system  as  well 
as  for  recruitment.  The  relationship  between 
rewards  and  legitimacy  often  represents  a  negative 
spiral  in  which  low  legitimacy  makes  it  difficult  for 
some  governments  to  provide  adequate  resources 
to  their  employees,  and  therefore,  those  govern¬ 
ments  may  choose  to  provide  a  number  of  intrans¬ 
parent  rewards.  They  are  almost  inevitably  exposed 
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to  public  scrutiny,  increasing  public  cynicism  about 
government  and  reducing  legitimacy. 

Politicians  and  Bureaucrats 

Thus  far,  this  entry  has  considered  the  role  of  pub¬ 
lic  servants  primarily  as  the  providers  of  public 
services  to  the  public.  That  role  is  certainly  their 
dominant  image  for  most  citizens,  but  senior  pub¬ 
lic  servants  also  play  a  crucial  role  in  the  political 
processes  of  their  countries.  Although  both  schol¬ 
ars  and  practitioners  had  understood  for  years  the 
importance  of  civil  servants  in  making  policy  and 
in  advising  their  political  masters,  the  understand¬ 
ing  was  largely  anecdotal  until  Joel  Aberbach, 
Robert  Putnam,  and  Bert  Rockman  (1981)  began 
to  conceptualize  and  measure  the  relationships 
more  precisely.  Their  work  has  been  the  founda¬ 
tion  of  a  number  of  other  studies  on  the  roles  of 
civil  servants  in  a  range  of  countries.  Although 
there  are  a  number  of  differences  across  political 
systems  and  across  time,  the  research  does  find 
that  the  same  roles  appear  in  most  countries, 
although  they  may  be  interpreted  differently. 

The  Aberbach  et  al.  approach  to  the  relation¬ 
ships  between  politicians  and  bureaucrats  is  largely 
attitudinal,  judging  the  extent  to  which  both  sets 
of  actors  have  attitudes  that  are  conducive  to 
cooperation  and  to  effective  policy  making.  These 
relationships  can,  however,  be  considered  in  terms 
of  structural  relationships.  For  example,  Peters 
identified  five  ideal  types  of  relationships  between 
politicians  and  bureaucrats  and  then  used  those 
models  as  a  means  of  comparing  the  styles  of 
executive  politics  in  a  number  of  countries.  These 
models  ranged  from  the  formal  Weberian/ 
Wilsonian  domination  of  governing  by  elected 
politicians  to  an  administrative  state  in  which 
experts  in  the  bureaucracy  dominate  policy.  All 
the  models  were  conceived  of  as  ideals,  but  there 
were  certainly  empirical  referents  in  a  number  of 
countries. 

Finally,  the  relationship  between  civil  servants 
and  politicians  can  be  conceptualized  as  a  bargain 
between  the  two  sets  of  actors.  It  can  be  argued  that 
politicians  give  up  some  of  their  personal  powers  in 
this  bargain  so  that  they  could  create  a  professional 
and  career  public  service  and  thus  gain  the  loyalty 
and  commitment  of  an  expert  body  of  profession¬ 
als.  Likewise,  the  civil  servants  cede  their  capacity  to 


complain  publicly  about  policies  in  return  for  a 
more  or  less  guaranteed  position  in  the  structures  of 
governing  and  a  predictable  set  of  rewards. 

Accountability 

While  public  administrators  have  a  powerful 
role  in  governing  and  exercise  substantial  control 
over  the  public  as  they  implement  programs,  these 
officials  also  have  substantial  influence  over  public 
policy.  In  both  their  roles — implementing  policy 
and  making  policy — bureaucrats  have  been  able  to 
exercise  their  power  with  little  direct  control  from 
the  public  (but  see  below).  The  traditional  model 
of  accountability  for  public  servants  was  that  they 
were  accountable  to  their  ministers  (or  to  some 
other  political  official),  the  minister  was  account¬ 
able  to  the  legislature,  and  the  legislature  was 
accountable  to  the  people.  Even  in  nondemocratic 
regimes  there  has  been  some  sense  of  the  necessity 
of  holding  public  officials  accountable  for  their 
actions,  often  to  the  hegemonic  party  rather  than 
to  duly  elected  officials. 

To  make  the  hierarchical  model  of  accountabil¬ 
ity  function  effectively,  governments  have  devel¬ 
oped  a  number  of  instruments  that  enable  political 
officials,  and  the  public,  to  detect  malfeasance  and 
to  force  some  reckoning  for  failures  (whether  of 
commission  or  omission).  These  instruments  are 
too  numerous  and  beyond  the  scope  of  this  entry, 
but  some  generalizations  should  at  least  be  men¬ 
tioned  here.  One  is  that  many  of  the  instruments 
for  accountability  are  internal  to  the  executive 
branch  of  government,  and  they  require  ministers 
(and  other  public  servants)  to  exercise  control  over 
their  subordinates. 

A  second  generalization  about  hierarchical 
forms  of  accountability  is  that  legislatures  have 
played  a  central  role.  This  role  is  easier  for  well- 
staffed  legislatures  with  strong  committees,  such  as 
the  U.S.  Congress,  but  it  is  true  even  for  the  less 
well-resourced  legislatures.  In  addition,  as  cabinets 
and  prime  ministers  come  to  dominate  parliamen¬ 
tary  systems,  their  legislatures  appear  to  be  using 
the  enforcement  of  accountability  as  a  means  of 
recapturing  some  of  the  powers  they  may  have 
lost.  Legislatures  appear  to  be  adding  staff  and 
creating  structures  to  facilitate  this  central  func¬ 
tion,  but  they  still  have  a  great  deal  to  do  to  be 
effective  controllers  for  the  executive. 
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In  addition  to  the  revival  of  legislatures  as  a 
locus  of  accountability,  public  policy  and  adminis¬ 
tration  have  become  increasingly  influenced  by 
law  and  the  courts.  Public  administration  has 
always  been  the  implementation  of  law,  and  in 
some  administrative  systems  such  as  that  of  France 
and  those  derivative  of  that  tradition,  there  has 
been  substantial  ex  ante  judicial  control.  Legal 
means  of  control  over  administration  have, 
however,  become  more  central,  especially  in  West¬ 
minster  systems  in  which  the  courts  had  relatively 
little  capacity  to  control  the  other  branches  of 
government.  However,  with  the  adoption  of  some 
constitutional  instruments  such  as  the  Charter  of 
Rights  and  Freedoms  in  Canada,  and  the  UK’s 
membership  in  the  European  Union  (EU),  the 
courts  have  gained  a  greater  capacity  to  enforce 
accountability  even  in  those  systems. 

To  some  extent,  all  these  mechanisms  for  account¬ 
ability,  even  those  with  a  strong  institutional  foun¬ 
dation,  depend  on  transparency  and  openness.  If  an 
administrative  system  can  keep  its  practices  hidden 
from  the  public  and  from  political  actors,  then  the 
possibilities  for  exercising  control  over  that  adminis¬ 
tration  are  very  limited.  The  central  role  of  transpar¬ 
ency  is  at  the  heart  of  many  efforts  by  international 
organizations,  for  example,  the  World  Bank,  to 
promote  transparency  and  accountability  in  transi¬ 
tional  and  developing  countries.  Transparency  is 
important  for  maintaining  accountability  in  general, 
but  it  may  be  especially  important  in  the  ongoing 
war  against  corruption  in  these  systems. 

The  traditional,  hierarchical  model  of  account¬ 
ability  was  built  on  some  heroic  assumptions,  and 
to  some  extent  it  is  now  being  complemented  by  a 
range  of  other  approaches  to  accountability  that 
relies  on  a  variety  of  different  instruments.  The 
most  important  addition  to  the  repertoire  of 
accountability  instruments  has  been  the  use  of 
competition  both  to  identify  problems  in  service 
provision  and  to  find  the  means  of  enforcing  con¬ 
trol  over  less  successful  programs.  This  method  of 
control  is  to  some  extent  manifested  in  perfor¬ 
mance  management  (see  above),  and  it  further  rely 
on  the  “customers”  of  programs  paying  attention 
to  the  success  or  failure  of  the  programs  that  serve 
them  and  becoming  mobilized  politically  if  the 
programs  perform  poorly. 

A  final  point  concerning  accountability  is  that 
we  need  to  recognize  the  growing  importance 
of  output  legitimation  in  many  countries.  The 


traditional  democratic  model  of  legitimation  is 
that  the  political  process,  along  with  inputs  such  as 
voting,  is  central  to  legitimating  government  and 
its  policy  decisions.  To  an  increasing  extent,  how¬ 
ever,  the  public  sector  is  legitimated  by  its  ability 
to  provide  services.  This  change  is,  at  least  in  part, 
a  consequence  of  the  NPM  and  its  emphasis  on 
service  provision,  but  it  is  also  in  part  a  function  of 
the  declining  centrality  of  political  participation  in 
many  democratic  systems.  Furthermore,  many 
scholars  argue  that  the  alleged  democratic  deficit 
in  the  EU,  due  to  inadequate  mechanisms  of  repre¬ 
sentation,  has  required  output  legitimation. 

The  Centrality  of  Administration 

Although  most  citizens,  and  many  political  scien¬ 
tists,  do  not  consider  public  administration  as  a  cen¬ 
tral  part  of  contemporary  politics  and  government,  it 
is  difficult  to  understand  what  is  happening  in  gover¬ 
nance  without  understanding  the  central  role  of  the 
public  bureaucracy.  Furthermore,  the  centrality  of 
administration  may  be  increasing  as  many  political 
institutions  lose  some  of  their  capacity  for  gover¬ 
nance  and  the  actions  of  public  administrators 
become  more  important  both  for  making  substantive 
decisions  and  for  linking  state  and  social  actors. 

The  increasing  centrality  of  administrative 
actors  in  governing  is  to  some  extent  a  paradoxical 
result  of  some  reforms  of  the  public  sector  that 
were  designed  to  transform  administration  and  to 
make  service  delivery  more  market  and  customer 
driven.  Although  many  of  these  reforms  were 
directed  ostensibly  at  the  public  bureaucracy  as 
traditionally  organized  and  managed,  the  net  effect 
of  the  reforms  has  been  to  reduce  the  power  of  the 
political  officials  nominally  in  charge  of  govern¬ 
ment.  In  reality,  NPM  was  a  broader  theory  of 
governance  as  much  as  a  theory  of  public  adminis¬ 
tration  per  se.  Its  explicit  demand  was  to  let  the 
mangers  manage;  its  implicit  recommendation  was 
that  politicians  should  absent  themselves  from  the 
process  of  governing  as  much  as  possible. 

Linking  the  Fields  of  Inquiry 

The  preceding  discussion  has  considered  three 
major  fields  of  inquiry  in  political  science  some¬ 
what  in  isolation  from  one  another,  namely, 
(1)  the  role  of  representative  bureaucracy,  (2)  the 
politicization  of  the  civil  service,  and  (3)  the 


Governance,  Administration  Policies  1009 


importance  of  understanding  the  rewards  of  high 
public  office,  whether  elected  or  appointed.  While 
that  strategy  helps  clarify  each  of  the  three  fields 
individually,  it  does  not  capture  the  reality  of  gov¬ 
erning  and  the  close  interconnections  between 
policy  choices,  the  administration  of  policies,  and 
the  overall  governance  capacity  of  these  systems. 
The  purpose  of  political  science  can  be  understood 
as  finding  ways  to  explain  why  certain  policy 
choices  are  made,  why  they  are  effective,  and  what 
their  consequences  are  for  citizens.  All  these  three 
fields  focus  on  what  governments  actually  do  to 
improve  the  lives  of  their  citizens. 

If  we  adopt  the  basic  Lasswellian  principle  that 
politics  is  about  transforming  the  life  chances  for 
citizens,  then  we  should  begin  with  the  study  of 
public  policies  and  work  back  from  those  outputs 
of  the  political  process.  These  policy  choices  can  be 
conceptualized  both  as  a  dependent  variable  for 
the  political  process  and  as  an  independent  vari¬ 
able  for  explaining  the  steering  capacity  of  the 
political  system.  That  is,  the  nature  of  the  policy 
process,  as  well  as  the  demands  being  pressed  on 
the  system,  may  serve  some  purposes  well  but  may 
not  enable  the  governance  system  to  steer  effec¬ 
tively.  For  example,  Scharpf  has  pointed  out  that 
the  design  of  policy-making  systems  in  the  EU  and 
in  Germany  tends  to  produce  suboptimal  results, 
and  hence  the  steering  capacity  of  these  systems  is 
not  as  great  as  it  might  otherwise  be.  There  are 
mechanisms  available  to  overcome  those  prob¬ 
lems,  but  they  too  may  present  governance  chal¬ 
lenges.  Furthermore,  in  other  cases,  the  outputs  of 
the  political  system  may  reflect  the  interests  of 
dominant  groups  in  society,  but  by  ignoring  other 
interests,  they  may  complicate  its  steering  and 
diminish  its  overall  legitimacy. 

When  beginning  to  think  about  the  broader 
processes  of  governing  with  making  policy  choices, 
however,  it  is  important  to  remember  that  making 
and  implementing  policy  is  not  a  final  set  of 
choices.  Almost  all  policy  making  in  contemporary 
political  systems  consists  of  revisioning  and 
attempting  to  improve  older  policies,  sometimes 
incrementally  and  sometimes  radically,  but  there 
tends  to  be  a  framework  for  public  action  that 
must  be  addressed.  That  is,  remaking  policy,  or 
policy  succession  feedback  from  the  earlier  attempts 
to  make  policy,  is  a  crucial  part  of  assessing  the 
success  and  failure  of  those  interventions.  For 
comparative  political  analysis,  the  capacity  and 


willingness  of  political  systems  to  respond  to  feed¬ 
back  is  one  crucial  element  for  understanding  the 
differences  in  policy  making. 

The  feedback  process  places  public  administra¬ 
tion  in  a  central  position.  There  are  certainly  politi¬ 
cal  processes  for  providing  feedback,  but  the  con¬ 
nections  between  the  bureaucracy  and  the  environ¬ 
ment  are  crucial.  Hood  (1976)  noted  that  the  tools 
of  government  (see  above)  included  both  detectors 
and  “effectors”  and  that  the  public  bureaucracy 
was  central  to  both.  The  involvement  of  the 
bureaucracy  in  feedback  occurs  in  part  through  its 
connections  with  clients,  and  especially  because  of 
the  role  of  street-level  bureaucrats.  The  public 
bureaucracy  may  also  be  important  in  detecting 
more  “objective”  problems  in  policy,  given  the 
wide  range  of  data  collection  in  which  it  is  engaged. 

As  patterns  of  governing  continue  to  shift 
toward  the  greater  use  of  market  and  civil  society 
actors,  the  bureaucracy  may  become  even  more 
important  as  a  locus  for  steering.  The  public 
bureaucracy  is  crucial  for  managing  instruments 
such  as  contracts  and  partnerships  and  in  provid¬ 
ing  the  linkage  between  networks  of  social  actors 
and  the  public  sector.  Political  leaders  may  not 
have  the  technical  capacity  for  some  of  these 
aspects  of  contemporary  governing  and  may  not 
be  willing  to  engage  in  the  continuous  bargaining 
and  adjustment  needed  for  coping  with  networks. 
These  patterns  of  governing  may  therefore  be 
uncomfortable  for  many  public  servants  who 
believe  that  they  are  being  forced  to  make  too 
many  policy  decisions  and  for  politicians  who  may 
feel  they  have  lost  control  of  governing. 

In  addition  to  being  linked  through  some  aspects 
of  the  common  problems  with  which  they  are  con¬ 
cerned,  these  three  major  components  of  political 
science  are  also  linked  through  theory.  Each  area 
has  its  own  particular  theoretical  and  conceptual 
issues,  as  identified  above,  but  there  are  also  some 
common  theoretical  strands  that  unite  them. 
Perhaps  the  most  apparent  of  these  is  institutional¬ 
ism,  and  the  fundamental  assumption  that  struc¬ 
ture  matters.  These  three  bodies  of  research  all 
recognize  that  agency  is  also  a  part  of  any  explana¬ 
tion;  however,  at  the  heart  of  the  three,  and  per¬ 
haps  especially  public  administration,  there  is  also 
a  strong  sense  that  structures  to  a  great  extent 
constrain  that  individual  agency  (Peters,  2001). 

While  institutionalist  approaches  may  appear 
the  most  logical  means  to  link  these  three  sets  of 
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literature,  they  may  also  be  addressed  through 
other  approaches,  such  as  rational-choice  models. 
Certainly,  rational  choice  has  been  applied  to  both 
public  administration  (Murray  Horn,  1995)  and  to 
public  policy,  providing  a  good  deal  of  insight  into 
these  aspects  of  governing.  Rational  choice  has, 
however,  been  applied  to  the  more  comprehensive 
issues  associated  with  governance  only  to  a  limited 
extent.  Although  that  has  been  the  case,  one  can 
see  how  that  application  might  be  made.  The 
design  of  governance  structures  can  be  considered 
from  the  rational  perspective  almost  as  easily  as 
can  the  design  of  administrative  structures  or  the 
choice  of  policy. 

In  addition  to  the  central  role  of  institutions  in 
making  and  implementing  public  policy,  a  number 
of  the  central  concepts  in  political  science  are  rel¬ 
evant  for  understanding  governing.  Governing 
involves  the  use  of  power,  as  the  public  sector  must 
impose  its  decisions  on  society  and  is  often  con¬ 
fronted  by  countervailing  power  within  society. 
Power  is  often  subtle  and  is  seen  as  much  in  what 
does  not  happen  as  in  what  does.  Governing  also 
involves  the  use  of  authority  and  legitimacy,  allow¬ 
ing  governance  without  the  overt  application  of 
power.  The  list  of  fundamental  concepts  could  be 
extended,  but  the  fundamental  point  is  that  public 
administration  and  policy  are  not  distinct  from  the 
remainder  of  political  science  but  rather  may  be  at 
the  heart  of  the  discipline  and  its  concerns. 

B.  Guy  Peters 
University  of  Pittsburgh 
Pittsburgh,  Pennsylvania,  United  States 
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Few  terms  are  as  insecurely  defined  or  controver¬ 
sial  in  a  normative  sense  as  global  governance.  A 
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major  part  of  the  problem  is  the  word  governance. 
How  does  this  differ  from  traditional  notions  of 
government  exercised  by  sovereign  states  and  their 
legal  subdivisions?  Must  governance  be  institution¬ 
alized  and  rest  on  national,  legal,  and  moral 
authority?  To  the  extent  that  governance  exists 
beyond  the  state,  must  it  be  created  by  formal 
agreement  among  states?  Or  should  governance  be 
understood  to  include  all  forms  and  degrees  of  rule, 
exercised  by  states  and  nonstate  actors,  whether 
based  on  law  and  institutionalized  or  much  less 
formal  in  nature?  If  governance  is  other  than  tradi¬ 
tional  government  by  states,  from  where  does  such 
governance  derive  its  legitimacy?  What  is  the  rela¬ 
tionship  between  authority  and  legitimacy?  Indeed, 
does  legitimacy  actually  matter  if  there  exists  a 
capacity  to  govern  effectively?  Does  authority 
imply  legitimacy  or  only  effective  governance? 

Confusion  is  compounded  by  the  word  global. 
Is  the  analytical  focus  only  on  some  form  of  world 
government — that  is,  a  truly  global  manifestation 
of  some  form  of  rule  that  extends  over  all  the  ter¬ 
ritory  and  persons  in  the  world?  Or  should  global 
be  understood  as  including  all  forms  of  gover¬ 
nance,  however  narrow  in  function  and  limited  in 
territorial  reach? 

The  answers  to  each  of  these  questions  obvi¬ 
ously  affect  the  answers  to  still  others:  How  old  or 
new  is  global  governance?  How  do  present  pat¬ 
terns  of  global  governance  differ  from  those  of  the 
past?  This  entry  discusses  several  forms  of  global 
governance,  including  international  governmental 
organizations  (IGOs),  international  regimes,  and 
nongovernmental  organizations  (NGOs),  and  the 
legitimacy  of  global  governance. 

International  Governmental  Organizations 

Not  coincidentally,  even  as  globalization  in  the 
1990s  became  a  popular  way  to  describe  a  vast 
acceleration  in  the  volume  and  pace  of  transna¬ 
tional  activities,  so  did  global  governance  gain 
prominence  in  diplomatic  and  academic  circles  as 
a  concept  that  seemed  to  capture  the  increasing 
challenge  of  managing  issues  that  were  beyond  the 
geographical  reach  of  individual  states  to  address 
on  their  own. 

The  term  global  governance  was  associated  early 
on  in  the  decade  with  the  Commission  on  Global 
Governance.  In  1991,  a  group  of  distinguished 
public  figures  met  in  Stockholm  and  pledged  to 


form  an  international  commission  intended  to 
help  build  a  more  effective  system  of  world  secu¬ 
rity  and  governance.  The  United  Nations’  (UN) 
then-  Secretary-General  Boutros  Boutros  Ghali 
endorsed  the  initiative,  and  in  1992,  the  commis¬ 
sion  was  established  with  28  members  from 
around  the  world,  who  served  in  their  personal 
capacities.  The  commission  issued  its  report  Our 
Global  Neighbourhood  in  1994,  making  recom¬ 
mendations  for  the  reform  of  the  UN  Economic 
and  Social  Council  and  certain  other  UN  bodies. 
Few  of  the  recommendations  were  implemented, 
so  the  report  constituted  more  of  a  landmark  than 
a  watershed. 

Be  that  as  it  may,  the  commission  experience 
did  seem  to  link  global  governance  with  the  UN 
and,  more  broadly,  with  IGOs  whose  members  are 
states.  In  the  modern  state  system,  growth  in  IGOs 
went  hand  in  hand  with  the  expansion  of  interna¬ 
tional  law  in  the  19th  and  early  20th  centuries. 
More  than  20  times  as  many  treaties  from  1851  to 
1950  took  the  multilateral  form  as  in  the  period 
between  1751  and  1850  (Charlotte  Ku,  cited  in 
Jan  A.  Scholte,  2005). 

One  trend  was  multilateral  conference  diplo¬ 
macy,  beginning  after  the  defeat  of  Napoleon  in 
1815,  with  the  Concert  of  Europe,  an  informal 
consultation  mechanism  of  European  government 
officials.  The  concert  convened  periodically  to  deal 
with  perceived  general  European  problems,  includ¬ 
ing  arms  races  and  potential  clashes  over  imperial 
expansion,  which  was  the  main  focus  of  the  1 8  84- 
1885  Berlin  meeting.  Over  time,  the  number  of 
countries  attending  multilateral  conferences 
increased  and  included  some  beyond  Europe.  The 
Hague  conferences  of  1899  and  1907  banned 
aerial  bombing,  chemical  warfare,  and  the  use  of 
hollow-point  bullets;  attempted  to  control  the  pro¬ 
liferation  of  submarines  and  armed  merchant  ves¬ 
sels;  and  established  the  Permanent  Court  of 
Arbitration.  In  addition,  the  two  Hague  Conven¬ 
tions,  along  with  the  later  Geneva  Conventions, 
profoundly  shaped  the  laws  of  war  and  related 
matters,  such  as  the  treatment  of  prisoners. 

Meanwhile,  beginning  with  the  1826  Panama 
Congress,  there  was  also  a  series  of  inter- American 
and  Pan-Latin  American  conferences.  The  inter- 
American  meetings  eventually  led  to  the  creation  of 
the  Pan-American  Union  in  1910.  Later  inter- 
American  conferences  were  overshadowed  by  U.S. 
military  interventions  in  the  Caribbean  basin,  but 
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after  the  Roosevelt  Good  Neighbor  Policy  and  as 
World  War  II  loomed,  the  Americas  forged  an 
almost  united  hemispheric  alliance  against  the 
Axis. 

Another  parallel  trend  in  the  establishment  of 
IGOs  was  that  of  functional  and/or  regulatory 
organizations  that  from  the  outset  concerned  them¬ 
selves  about  global  matters  within  the  area  of  their 
expertise.  What  is  now  the  International  Tele¬ 
communications  Union  (ITU)  was  created  in  1865 
and  the  Universal  Postal  Union  in  1874.  An  agree¬ 
ment  on  universal  time  coordination  (UTC)  based 
on  the  Greenwich  prime  meridian  was  concluded  in 
1884.  Technical  standardization  started  with  the 
International  Electrotechnical  Commission  (IEC)  in 
1906.  The  International  Labor  Organization  (ILO) 
was  founded  in  1919.  Functional  IGOs  to  date 
include  a  total  of  19  UN-specialized  agencies — 
among  them,  well-known  organizations  such  as  the 
World  Health  Organization  (WHO),  the  Ford  and 
Agriculture  Organization  (FAO),  the  International 
Civil  Aviation  Organization  (ICAO),  and  the 
International  Maritime  Organization  (IMO) — as 
well  as  numerous  others  outside  the  UN  system, 
such  as  the  Bank  of  International  Settlements  (BIS) 
founded  in  1930,  the  International  Atomic  Energy 
Agency  (IAEA),  and  any  number  of  other  regula¬ 
tory  bodies  at  the  regional  level. 

The  aftermath  of  the  two  world  wars  saw  a 
momentous  advance  in  universal  IGOs:  the  League 
of  Nations  and  then  the  UN.  Both  organizations 
institutionalized  great  power  control  at  the  council 
level  that  seemed  to  recall  the  earlier  Concert  of 
Europe  and  were  primarily,  although  not  exclu¬ 
sively,  concerned  with  peace  and  security  issues. 
The  same,  by  definition,  was  true  of  post-World 
War  II  alliances  such  as  the  North  Atlantic  Treaty 
Organization  (NATO),  the  Warsaw  Pact,  and  the 
Organization  of  American  States,  which  alone 
included  a  process  for  resolving  disputes  among  its 
own  members.  On  the  other  hand,  economics  was 
the  focus  of  the  Bretton  Woods  institutions — the 
International  Bank  for  Reconstruction  and  Devel¬ 
opment  (IBRD — the  World  Bank),  the  International 
Monetary  Fund  (IMF),  the  General  Agreement  on 
Tariffs  and  Trade  (GATT),  and  its  eventual  succes¬ 
sor,  the  World  Trade  Organization  (WTO). 
Likewise,  economic  recovery  and  growth  were  the 
central  motivation  for  post-World  War  II  European 
institution  building  that  over  time  produced  the 


European  Common  Market  and  today’s  European 
Union  (EU).  Yet  another  area  of  concern  for  uni¬ 
versal  IGOs  has  been  law  and  justice,  including  the 
Permanent  Court  of  International  Justice  (PCIJ), 
the  International  Court  of  Justice  (ICJ),  and  most 
recently  the  International  Criminal  Court  (ICC). 

Overall,  the  growth  in  the  numbers  of  IGOs  has 
been  so  rapid  that  The  Yearbook  of  International 
Organizations,  2004-2005  (Union  of  International 
Organizations,  2004,  Appendix  3,  Table  1)  was 
able  to  list  some  245  as  “conventional”  IGOs  and 
another  1,743  under  the  classification  of  “other 
international  bodies.” 

International  Regimes 

Foreshadowing  later  discussions  about  global  gov¬ 
ernance,  regime  theory  appeared  in  the  1970s  and 
1980s  partly  to  explain  how  international  coop¬ 
eration  might  continue  even  as  United  States’  post- 
World  War  II  dominance  gradually  eroded.  This 
was  a  concept  of  what  has  been  described  as  “gov¬ 
ernance  without  government”  that  did  not  require 
a  “hegemon,”  or  at  least  one  dominating  the  entire 
international  system. 

Regime  theory  postulates  that  states  cooperate 
through  institutional  arrangements,  rules,  or 
shared  practices.  According  to  Oran  Young,  the 
capacity  of  these  regimes  depends  either  on  the 
ability  of  a  dominant  actor  or  on  jointly  maxi¬ 
mizing  shared  interests.  Some  authors  point  out 
the  role  of  influential  groups  of  experts,  consti¬ 
tuted  as  epistemic  communities,  who  define  the 
problem  at  stake  and  suggest  solutions;  others,  in 
a  more  functionalist  vision,  consider  the  regime 
as  spontaneously  emerging  when  it  is  needed. 
According  to  this  perspective,  states  succeed  in 
cooperating  when  (and  to  the  extent  that)  they 
are  able  to  form  institutional  arrangements  or 
sets  of  roles,  rules,  and  relationships  of  the  sort 
we  have  come  to  think  of  as  international 
regimes.  This  perspective  attempts  to  explain 
successes  and  failures  in  terms  of  whether  such 
regimes  are  created. 

Regime  theory  paradoxically,  for  all  its  empha¬ 
sis  on  states,  was  important  not  least  because  it 
pointed  away  from  an  exclusively  state-centric 
view  of  global  affairs.  Rather,  as  Hedley  Bull 
(1977)  suggested  about  the  same  time,  cooperation 
and  order  are  as  commonplace  in  international 
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society  as  competition  and  anarchy.  Cooperation 
might  result  in  the  creation  of  constitutionalized 
IGOs — which  themselves  often  proved  to  be  more 
than  the  simple  sum  of  their  state  parts — but 
equally  significant  were  international  treaties  and 
other  much  less  formal  arrangements  (“conven¬ 
tions”  in  a  different  sense)  that  shaped  outcomes  in 
particular  issue  areas.  There  were  regimes  to  pro¬ 
tect  endangered  species  and  transnational  ecosys¬ 
tems,  to  curb  nuclear  proliferation  and  chemical 
warfare,  and  so  on.  Moreover,  regime  theory  high¬ 
lighted  the  prominence  of  epistemic  communities 
linking  experts  inside  national  bureaucracies,  IGO 
bureaucracies,  and  NGOs. 

Postinternational  Perspectives 
on  Global  Governance 

Classical-realist  international  relations  (IR)  theory 
rested  so  firmly  on  the  presumed  egoistic  behavior 
of  states  and  assumptions  about  anarchy  in  world 
politics  that  it  put  little  stock  in  international 
organizations  and  international  law.  Therefore,  it 
is  not  surprising  that  IR  theory  was  late  in 
acknowledging  the  significant  presence  of  NGOs, 
let  alone  in  beginning  to  conceptualize  their  influ¬ 
ence  on  and  even  direct  involvement  in  gover¬ 
nance  both  within  and  across  state  boundaries.  In 
fact,  there  has  been  such  a  proliferation  of  NGOs 
in  the  past  few  decades  that  they  have  become 
increasingly  impossible  to  ignore.  James  Rosenau 
(2003)  describes  this  growth  as  a  veritable  “orga¬ 
nizational  explosion.”  Many  of  even  the  most 
“local”  among  the  millions  of  NGOs  are  con¬ 
cerned  with  issues  that  also  resonate  at  regional 
and  global  levels,  and  many  if  not  most  such 
NGOs  are  networked  with  others  that  advance 
their  joint  positions  transnationally. 

The  same  reference  work  on  international  orga¬ 
nizations  (Union  of  International  Organizations, 
2004,  Appendix  3,  Table  1)  cited  earlier  for  num¬ 
bers  of  IGOs  lists  7,261  “conventional”  NGOs 
and  an  additional  13,590  under  the  heading  of 
“other  international  bodies.”  Many  of  these 
NGOs  have  institutionalized  relationships  or  are 
at  least  selectively  engaged  with  IGOs.  Rosenau 
(2003),  for  example,  noted  that  NGOs  were 
involved  with  some  70%  of  World  Bank  projects 
in  2002.  The  literature  on  globalization  has 
tended  to  characterize  the  proliferation  of  NGOs 


and  their  transnational  activism  with  the  rise  of 
“global  civil  society.”  A  related  body  of  writings 
has  built  on  earlier  work  on  nonstate  actors  and 
transnational  corporations  (TNCs)  to  spotlight 
the  growing  importance  of  “private  authority”  in 
global  governance  and  international  affairs  gener¬ 
ally  (see,  especially,  A.  Clair  Cutler,  Virginia 
Haufler,  &  Tony  Porter,  1999;  Rodney  B.  Hall  & 
Thomas  J.  Biersteker,  2002). 

However,  the  NGO  and  TNC  dimension  of  the 
organizational  explosion  is  only  one  of  numerous 
sweeping  changes  in  the  global  system  identified 
by  Postinternationalists  like  Rosenau,  as  well  as 
Yale  Ferguson  and  Richard  Mansbach  (2004, 
2008).  They  argue  that  these  changes  are  the  fun¬ 
damental  trends  that  profoundly  shape  patterns  of 
governance  at  all  levels.  The  central  analytical 
challenge,  as  Rosenau  (1997)  expresses  it,  is  how 
to  assess 

a  world  in  which  the  [domestic-foreign]  Frontier 
is  continuously  shifting,  widening  and  narrowing, 
simultaneously  undergoing  erosion  with  respect 
to  many  issues  and  reinforcement  with  respect  to 
others?  .  .  .  Under  what  circumstances  does 
authority  along  the  Frontier  accrue  to  like- 
minded  states,  to  global  regimes,  to  transnational 
organizations,  to  subnational  entities,  or  to 
coalitions  of  diverse  types  of  actors?  (p.  5) 

Postinternational  change  is  the  product  of  simul¬ 
taneous  processes  of  fusion  and  fission  of  author¬ 
ity.  The  first  is  reflected  in  the  growth  of  networks 
that  connect  and  influence  the  behavior  of  persons 
geographically  far  from  one  another.  As  has  often 
been  observed,  some  associations  are  falling  apart 
even  as  others  come  together.  In  his  book  Distant 
Proximities:  Dynamics  Beyond  Globalization 
(2003),  Rosenau  describes  this  dual  process  as 
“fragmegration.”  The  main  argument  is  that  “the 
best  way  to  grasp  world  affairs  today  requires 
viewing  them  as  an  endless  series  of  distant  prox¬ 
imities  in  which  the  forces  pressing  for  greater 
globalization  and  those  inducing  greater  localiza¬ 
tion  interactively  play  themselves  out”  (p.  4).  The 
second  tendency  is  the  fracturing  of  existing 
political  units  into  islands  of  self-identification 
that  localize  and  often  specialize  authority  and 
that  may  encumber  efforts  to  deliver  collective 
goods. 
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Postinternational  theory  sees  the  world  today 
(also  historically)  as  inhabited  by  countless  actors 
of  many  different  types  that  reflect  different  identi¬ 
ties,  are  differentially  engaged  in  countless  issues, 
and  exercise  effective  authority  in  particular 
domains  and  contexts. 

Accelerating  change  is  producing  an  increasingly 
complex  universe  of  actors  in  global/local  politics. 
Ferguson  and  Mansbach  call  them  polities,  while 
Rosenau  prefers  the  term  spheres  of  authority 
(SOAs).  Polities  are  collectivities  with  a  significant 
measure  of  identity  and  institutionalization,  a 
degree  of  hierarchy  in  their  organization,  and  the 
capacity  to  mobilize  persons  and  groups  for  politi¬ 
cal  purposes  (value  satisfaction).  Some  entities 
more  clearly  meet  these  criteria  than  others.  For 
instance,  most  states,  international  institutions, 
TNCs,  major  NGOs,  and  criminal  and  terrorist 
organizations  are  polities.  By  contrast,  most  mar¬ 
kets  are  not  polities  because  they  lack  the  requisite 
identity,  institutionalization,  and  hierarchy. 

Polities  coexist,  cooperate,  compete,  and  clash. 
They  often  overlap,  layer,  and  “nest”  (Ferguson  & 
Mansbach,  1996)  and  hence  share  some  of  the 
same  “political  space” — territory,  institutions, 
issues,  identities,  markets,  and/or  cyberspace. 
Polities  are  all  “becoming”  in  the  sense  that  politi¬ 
cal  evolution  is  constant,  although  they  evolve  at 
different  rates  and  not  necessarily  in  a  unilinear 
fashion.  In  the  postinternational  framework,  each 
type  of  polity  is  only  an  ideal  type  and  assumes 
many  different  forms  in  practice.  There  are,  for 
example,  many  variations  in  the  structures  and 
processes  of  cities  and  empires,  and  the  same  is 
plainly  true  of  sovereign  states,  IGOs,  NGOs, 
TNCs,  and  so  on — not  to  mention  hybrid  polities 
such  as  public-private  partnerships  or  polities  aris¬ 
ing  out  of  networks  or  alliances.  The  “domain”  of 
any  polity — its  “reach”  in  political  space — consists 
of  the  persons  and  groups  who  identify  with  it  and 
comply  with  its  directives,  as  well  as  the  resources 
it  can  therefore  command. 

All  polities  are  authorities  and  govern  within 
their  respective  and  often  overlapping  domains. 
Thus,  governance  exists  within,  across,  and 
beyond  the  jurisdictions  of  sovereign  states.  Global 
governance,  in  turn,  refers  to  the  totality  of  dis¬ 
cernible  polity  authority  domains  in  the  world. 
It  is  important  to  understand  that  postinterna¬ 
tional  theory  defines  authority  and  governance  as 


effective  influence  or  control.  Authority  need  not 
be  legitimate  to  be  effective,  although  almost  every 
polity  offers  some  sort  of  ideological  justification 
for  its  existence  and  behavior,  and  those  that  are 
widely  regarded  as  legitimate  obviously  tend,  for 
that  very  reason,  to  be  all  the  more  secure. 

A  postinternational  approach  presumes  that 
another  related  assumption  of  traditional  IR  theory — 
that  the  world  is  fundamentally  anarchic — tells  us 
little  more  than  that  there  is  no  effective  world 
government.  Human  affairs  are  largely  governed — 
that  is,  ruled  on  a  day-to-day  basis — not  only  by 
states  but  also  by  a  multitude  of  individual  polities 
that  exist  within,  crisscross,  or  transcend  individ¬ 
ual  states.  Some  of  these  polities  are  internally 
dysfunctional  or  inclined  to  disruption  and  vio¬ 
lence,  but  many  if  not  most  act  individually  and 
collectively  in  a  peaceful,  fairly  effective,  patterned, 
and  often  predictable  manner.  This  is  the  “real 
world  order.” 

The  Legitimacy  of  Global  Governance 

Long  before  a  global  financial  crisis  triggered  a 
major  downturn  in  the  global  economy  in  mid- 
2008,  there  was  a  rising  worldwide  antiglobaliza¬ 
tion  movement  revolving  around  resistance  to  free 
trade  and  unregulated  markets,  to  the  Washington 
Consensus  model  for  development,  and  to  pre¬ 
sumed  threats  to  local  culture,  labor  standards, 
and  the  local  and  global  environment  from  more 
open  borders  and  rapid  economic  change.  The 
earliest  mass  demonstrations  occurred  in  1999  and 
ebbed  and  flowed  in  subsequent  years  at  meetings 
of  global  economic  institutions,  the  G7  (  +  1),  G20, 
and  the  World  Economic  Forum  at  Davos.  For 
most  of  the  antiglobalization  protesters,  global 
governance  was  and  continues  to  be  a  symbol  of 
unrestrained  economic  liberalism  and  thus  some¬ 
thing  that  is  inherently  illegitimate. 

Among  scholars,  policymakers,  and  attentive 
publics  more  favorably  inclined  to  a  globalizing 
world,  there  has  also  arisen  a  separate  debate 
about  the  legitimacy  of  international  institutions 
and  other  actors  engaged  in  global  governance. 
This  discussion  has  usually  been  framed  as  concern 
about  a  “democratic  deficit”  in  global  governance. 
In  sum,  the  ideal  notion  of  governance  in  liberal 
democratic  states  has  been  that  it  derives  from  the 
“consent  of  the  governed,”  expressed  in  elections 
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and  other  democratic  processes.  How  can  such  an 
ideal  be  translated  to  governance  by  international 
institutions  and  other  nonstate  actors?  Indeed, 
should  the  same  sort  of  ideal  apply? 

In  fact,  from  one  perspective,  the  entire  problem 
is  vastly  overstated.  First,  except  for  a  few  high- 
profile  institutions  such  as  the  UN,  the  World 
Bank,  the  IMF,  and  the  EU,  most  global  gover¬ 
nance  actors  operate  with  little  publicity  and  far  off 
the  radar  except  for  those  relatively  few  states  and 
citizens  most  closely  affected  by  their  activities. 

A  second  and  important  reason  is  that  most  of 
the  states  in  the  world  are  not  democracies  and 
many  of  those  who  claim  to  be  have  a  serious 
democratic  deficit  of  their  own.  It  is  not  an  exag¬ 
geration  to  say  that  most  international  institutions 
are  paragons  of  democratic  virtue  by  comparison 
with  most  states. 

From  a  third  perspective,  the  problem  is  mini¬ 
mal  because  states  are  the  main  actors  in  interna¬ 
tional  institutions  and  regimes.  In  a  legalistic  sense, 
states  thus  “consent”  at  least  to  this  governance  on 
behalf  of  their  citizens. 

This  third  perspective  is  what  Allen  Buchanan 
and  Robert  O.  Keohane  (2006)  term  “the  Pedigree 
View,”  that  “legally  constituted  institutions,  cre¬ 
ated  by  states  according  to  the  recognized  proce¬ 
dures  of  public  international  law  and  consistent 
with  it,  are  ipso  facto  legitimate  or  at  the  very  least 
enjoy  a  strong  presumption  of  legitimacy” 
(pp.  412-413).  As  they  observe, 

The  Pedigree  View  fails  because  it  is  hard  to  see 
how  state  consent  could  render  global  governance 
institutions  legitimate,  given  that  many  states  are 
non-democratic  and  systematically  violate  the 
human  rights  of  their  citizens  and  are  for  that 
reason  themselves  illegitimate.  State  consent  in 
these  cases  cannot  transfer  legitimacy  for  the 
simple  reason  that  there  is  no  legitimacy  to 
transfer,  (pp.  412-413) 

Moreover,  as  Michael  Barnett  and  Martha 
Finnemore  (2004)  emphasize,  many  international 
institutions  have  a  tendency  to  build  on  their  initial 
mandate,  which  may  itself  be  sufficiently  ambigu¬ 
ous  as  to  invite  improvisation. 

Scholarship  on  organizations  generally  (not  just 
IOs )  has  made  it  abundantly  clear  that  organizations 


routinely  behave  in  ways  unanticipated  by  their 
creators  and  not  formally  sanctioned  by  their 
members.  Organizations  that  start  with  one 
mission  routinely  acquire  others.  .  .  .  They  exhibit 
mission  creep.  They  wander  far  from  their  original 
mandate.  .  .  .  They  develop  new  rules  and 
procedures  in  response  to  new  problems  they 
identify.  They  formulate  rules  that  are  politically 
safe  and  comfortably  routine  rather  than  efficient 
or  effective.  . . .  [They  act]  like  the  bureaucracies 
they  are.  (pp.  2-3) 

How,  then,  may  we  best  understand  the  issue  of 
the  legitimacy  of  global  governance?  It  is  crucial  to 
recognize  that  legitimacy  is  different  from  author¬ 
ity.  Authority,  for  those  inclined  to  a  legalistic 
interpretation,  rests  on  a  legal  right  to  exercise  it. 
For  the  Postinternationalists  and  Constructivists 
Barnett  and  Finnemore  (2004),  authority  is  the 
capacity  to  “rule”  effectively,  which  may  be  bol¬ 
stered  by — but  does  not  depend  on — law  and 
general  approbation. 

Legitimacy,  at  root,  is  the  widespread  belief  that 
a  global  governance  actor  has  the  right  to  exercise 
whatever  authority  it  does.  That  belief  may  be 
influenced  by  an  actor’s  constitutionality,  state 
pedigree,  and  internal  democratic  character  and 
processes,  as  well  as  matters  such  as  the  actor’s 
reputation  for  moral  authority  or  possession  of 
expert  knowledge.  However,  legitimacy  derives 
fundamentally  from  a  global  governance  actor’s 
ability  to  deliver  the  goods  that  its  constituents 
(members  and  wider  publics)  need  and  want. 
Success  in  doing  so,  over  time,  establishes  a  track 
record  and  tends  to  attract  broader  support,  iden¬ 
tity,  and  even  loyalty.  Consistent  failure,  obvi¬ 
ously,  courts  the  reverse. 

Yale  H.  Ferguson 
Rutgers  University-Newark 
Newark,  New  Jersey,  United  States 
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The  concept  of  good  governance  has  multiple 
meanings,  but  it  most  generally  refers  to  a  stan¬ 
dard  or  model  for  how  states  or  other  political 
entities  should  govern  and  be  governed.  This  usu¬ 
ally  includes  a  long  list  of  normative  principles  to 
which  these  entities  should  adhere,  such  as  trans¬ 
parency,  accountability,  inclusiveness,  effective¬ 
ness,  and  impartiality.  Quite  commonly,  good 
governance  is  defined  in  terms  of  its  antonyms — 
that  is,  by  referring  to  phenomena  that  indicate  its 
absence,  such  as  corruption,  nepotism,  favoritism, 
particularism,  or  patrimonialism.  Within  policy 


circles  of  the  developmental  aid  community,  the 
concept  has  been  used  as  an  agenda  for  reform  of 
developing  countries,  such  as  civil  service  reforms, 
securing  property  rights,  or  installing  judicial  inde¬ 
pendence.  Within  the  academic  community,  where 
close  synonyms  such  as  institutional  quality  or 
quality  of  government  have  been  developed,  the 
concept  has  been  systematically  linked  to  several 
highly  desirable  outcomes,  most  notably  economic 
growth  and  long-term  development. 

The  concept  was  first  launched  by  the  World 
Bank  in  the  late  1980s  and  early  1990s  in  a  series  of 
reports  that  sought  to  develop  a  new  strategy  for  aid 
conditionality.  Previously,  the  Bank  had  refrained 
from  taking  political  criteria  into  account  in  its 
donor  policy.  By  demanding  that  recipient  countries 
adhere  to  the  agenda  of  “good  governance  reforms,” 
this  stance  was  changed.  By  the  late  1990s,  the  con¬ 
cept  started  to  surface  on  the  research  agenda  of 
several  academic  fields  in  the  social  sciences,  most 
notably  in  economics  and  political  science.  To  a 
considerable  extent,  this  development  was  pro¬ 
pelled  by  the  appearance  of  empirical  measures,  or 
worldwide  governance  indicators,  made  publicly 
available  by  the  World  Bank  Institute. 

To  apprehend  the  scope  of  meanings  attached  to 
the  concept,  a  series  of  distinctions  is  needed.  First, 
good  governance  may  be  defined  in  terms  of  what 
it  is  (as  a  good  in  itself)  or  in  terms  of  what  it  is 
good  for  (e.g.,  good  for  economic  development). 
The  former  may  be  called  an  internal  definition  and 
the  latter  an  external  (or  functionalist)  definition  of 
good  governance.  In  the  internal  case,  some  proper¬ 
ties  of  the  governance  system  of  a  country  must  be 
singled  out  in  advance,  together  with  a  notion  of 
why  these  properties  make  the  system  better  or  of  a 
higher  quality.  This  could,  for  example,  be  the  idea 
that  good  governance  is  to  be  equated  with  the  rule 
of  law.  Countries  abiding  by  this  principle  are  then 
considered  to  have  good  governance,  regardless  of 
whether  the  rule  of  law  leads  to  other  desirable 
consequences.  In  the  external  case,  empirical  scru¬ 
tiny  is  required  to  determine  what  properties  of  the 
system  are  considered  to  be  good,  namely,  those 
that  are  found  to  exert  an  influence  on  some  prefer¬ 
able,  external  outcome.  In  this  case  too,  rule  of  law 
might  be  considered  to  be  part  of  good  governance 
but  only  to  the  extent  that  rule  of  law  can  be  shown 
to  influence,  for  example,  economic  growth  or 
other  desirable  phenomena. 
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Second,  good  governance  may  be  defined  in 
terms  of  procedures  or  policy  content.  Procedural 
definitions  refer  to  regulatory  principles,  codes  of 
conduct,  norms,  or  other  value-laden  criteria  that 
constrain  the  forms  in  which  politics  may  be  con¬ 
ducted.  Definitions  demanding  certain  contents, 
by  contrast,  require  certain  policies  to  be  enacted 
by  the  political  system,  such  as  market  deregula¬ 
tion  or  state  privatization. 

Third,  there  is  a  fundamental  difference  between 
procedural  definitions  referring  to  the  input  and 
the  output  sides  of  the  political  system.  The  former, 
where  access  to  public  authority  is  regulated,  is  the 
realm  of  decision-making  bodies,  such  as  parlia¬ 
ments,  cabinets,  juntas,  party  executive  commit¬ 
tees,  or  royal  councils.  The  output  side  concerns, 
instead,  the  exercise  of  public  authority:  how 
reforms  are  implemented,  rules  enforced,  court 
cases  decided,  and  so  on.  This  is  the  realm  of  the 
public  administration,  the  courts,  and  the  law 
enforcement  bodies  of  the  state  apparatus.  Different 
regulatory  principles  guide  our  general  thinking  as 
to  what  makes  different  political  systems  better 
than  others  on  these  two  sides  of  the  political  sys¬ 
tem.  On  the  input  side,  democracy  and  the  principle 
of  political  equality  (one  person,  one  vote;  civil  and 
political  rights,  etc.)  is  the  most  well-established 
norm.  On  the  output  side,  key  guiding  principles 
are  the  rule  of  law,  administrative  effectiveness  (or 
efficiency),  and  impartial  policy  implementation. 
The  most  comprehensive  procedural  definitions  of 
good  governance  encompass  all  these  normative 
views  and  facets  of  the  political  system,  whereas 
other  more  specific  notions  concentrate  on  the  out¬ 
put  side,  or  even  on  certain  aspects  of  the  output 
side  (such  as  administrative  corruption). 

The  academic  discourse  has  concerned  itself 
with  both  the  causes  and  consequences  of  good 
governance  or  quality  of  government.  Starting 
with  consequences,  the  strongest  empirical  regular¬ 
ity,  as  already  mentioned,  concerns  the  positive 
link  to  long-run  economic  growth.  The  theory 
underlying  these  findings  posits  that  governance  is 
key  to  shaping  the  incentives  of  key  economic 
actors  in  society,  in  particular  their  propensity  to 
invest  in  physical  and  human  capital  and  technol¬ 
ogy,  the  key  generators  of  growth.  Most  critical  in 
this  regard  is  the  security  of  property  and  contract 
rights  for  a  broad  cross  section  of  the  population. 
Simply  put,  investments  require  transactions  where 


a  commodity  at  one  time  or  place  is  traded  for  a 
return  that  occurs  at  another  time  or  place.  Key 
examples  include  borrowing  and  lending,  a 
demander  and  supplier  some  distance  apart,  and 
the  parties  to  an  insurance  policy.  Secure  contract 
rights  (or  contract  enforcement)  underpin  beliefs 
among  economic  actors  that  their  agreement  will 
be  carried  out  in  these  situations.  Secure  property 
rights,  in  addition,  guarantees  that  the  fruits  of 
such  transactions  are  not  at  some  later  point  of 
time  expropriated  by  the  state  or  by  other  eco¬ 
nomic  actors.  Research  on  the  determinants  of 
economic  growth  has  highlighted  the  importance 
of  good  governance  or  quality  of  government  in 
securing  these  contract  and  property  rights. 

Other  positive  consequences  of  good  gover¬ 
nance  have  been  uncovered  in  the  literature.  Some 
of  these  concern  individual-level  phenomena. 
Empirical  studies  show  that  corruption  has  a  nega¬ 
tive  impact  on  ordinary  people’s  feelings  of  trust, 
not  only  in  political  authorities  but  also  in  one 
another  (the  latter  sometimes  being  referred  to  as 
interpersonal  trust  or  social  capital).  The  more 
general  governance  indicators  published  by  the 
World  Bank  Institute  have  even  been  found  to  be 
strongly  related  to  feelings  of  happiness  or  life  sat¬ 
isfaction.  Thus,  people  living  in  countries  with 
better  working  government  institutions  are  on 
average  happier  than  those  living  in  corrupt  and 
ill-governed  countries,  other  things  being  equal. 
Studies  of  peace  and  conflict  have  also  taken  inter¬ 
est  in  the  quality  of  government  institutions.  It  has 
been  argued  that  dysfunctional  government  insti¬ 
tutions  make  both  civil  war  and  interstate  belliger¬ 
ence  more  likely. 

The  literature  on  causes  of  good  governance  or 
quality  of  government  is  most  refined  in  the  area 
of  explaining  the  origins  of  corruption,  but  key 
insights  from  this  work  apply  to  other,  broader 
governance  phenomena  as  well.  There  are  two 
general  approaches  to  understanding  the  preva¬ 
lence  of  corruption  or  other  forms  of  illicit  behav¬ 
ior  on  behalf  of  government  officials.  The  first  is 
the  principal-agent  model.  According  to  this  view, 
a  collective  body  of  actors  (e.g.,  the  top  political 
leadership  in  a  country)  are  the  principals  who 
delegate  the  performance  of  some  government  task 
to  another  collective  body  of  actors  (e.g.,  the 
bureaucracy),  the  agents.  As  in  any  situation  where 
authority  is  being  delegated,  the  problem  from  the 
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perspective  of  the  principal  is  that  the  agents  may 
acquire  specific  information  about  the  task  at  hand 
that  they  do  not  disclose  to  the  principal,  or  that 
they  may  have  other  private  motivations  affecting 
their  behavior  than  the  goals  of  performing  the 
delegated  task.  In  the  case  of  corruption,  adminis¬ 
trators  delegated  the  task  of  collecting  a  tax,  for 
example,  may  collude  with  firms  and  collect  bribes 
rather  than  taxes.  The  principals  will  try  to  monitor 
the  behavior  of  their  agents  by  exacting  legal  rem¬ 
edies  for  those  being  caught  collecting  bribes.  Seen 
from  this  principal-agent  perspective,  the  level  of 
corruption — or,  more  generally,  the  extent  to  which 
good  governance  is  being  upheld — in  a  country  is 
primarily  a  function  of  the  individual  incentives  and 
risk  perceptions  of  the  agents.  The  principal,  in 
turn,  may  affect  these  individual  motivations  by  use 
of  control  instruments  such  as  increasing  the  effec¬ 
tiveness  of  the  monitoring  system  and  the  severity 
of  the  legal  punishment  exacted. 

The  principal-agent  model  of  corruption  (or 
other  forms  of  governance  failures)  has  been  a 
forceful  tool  for  understanding  this  phenomenon, 
but  it  suffers  from  several  limitations.  The  most  seri¬ 
ous  flaw  is  probably  the  implicit  assumption  that 
the  principals  are  benevolent — that  is,  that  their 
overall  goal  is  to  promote  good  governance  and 
quality  of  government  in  the  country.  In  a  more  pes¬ 
simistic  (some  would  say,  more  realistic)  scenario, 
however,  everyone  has  something  to  gain  personally 
from  a  dysfunctional  system  as  long  as  a  large 
enough  body  of  actors  continues  to  play  foul.  This 
may  be  called  the  collective  action  theory  of  good 
governance  or  quality  of  government.  According  to 
this  theory,  government  officials  at  any  hierarchical 
level  (and  even  ordinary  citizens)  may  reason  as  fol¬ 
lows:  True,  the  best  solution  for  all  would  be  to 
have  a  government  free  from  corruption  or  other 
forms  of  dysfunctional  behavior.  But  as  long  as 
most  other  actors  take  bribes,  why  should  I  play 
fair?  As  should  be  clear,  this  structure  of  incentives 
leads  to  a  collective  action  problem:  Despite  the  fact 
that  everyone  realizes  that  another  outcome  would 
be  preferable,  if  most  others  are  expected  to  play 
foul,  everyone  will  play  foul.  To  break  the  deadlock 
of  the  collective  action  dilemma,  actors  have  to  start 
believing  that  a  large  enough  number  of  other 
actors  will  from  now  on  play  fairly. 

The  study  of  good  governance  or  quality  of 
government  is  a  fairly  new  and  rapidly  growing 


field  of  research.  The  future  of  this  field  will  in 
large  part  hinge  on  how  well  the  literature  resolves 
a  number  of  tensions  inherent  in  the  concept  itself. 
These  include,  above  all,  the  tensions  between  nar¬ 
row  (e.g.,  corruption)  and  broader  (e.g.,  impartial 
government  institutions)  concepts  of  good  gover¬ 
nance,  between  functionalist  and  internal  under¬ 
standings,  between  definitions  referring  to  policy 
contents  and  those  referring  to  procedures,  and 
between  definitions  stressing  the  input  and  those 
stressing  the  output  side  of  the  political  system. 

Jan  Teorell 
Lund  University 
Lund,  Sweden 
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Many  academic  observers  would  argue  that  we 
have  witnessed  a  shift  from  government  to  gover¬ 
nance  over  the  past  decenniums,  reflected  in 
“softer”  and  more  indirect  steering  mechanisms. 
Governance  also  involves  more  interaction  between 
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public  authorities  and  other  social  actors.  Still, 
from  a  distance,  the  discussion  seems  rather  con¬ 
fusing,  in  that  different  meanings  of  the  concept  of 
governance  are  used  in  the  literature.  According  a 
few  European  scholars,  the  term  governance  basi¬ 
cally  refers  to  “new”  ideas  related  to  the  involve¬ 
ment  of  society  in  the  process  of  governing. 
According  to  some  American  specialists,  on  the 
other  hand,  the  term  retains  much  of  its  original 
steering  conception.  To  overcome  this  conceptual 
challenge,  one  may  understand  governance  as  con¬ 
sisting  of  four  activities: 

1.  the  process  of  articulating  common  priorities, 

2.  the  process  of  bringing  coherence  to  these 
priorities, 

3.  the  process  of  and  capacity  for  steering,  and 

4.  the  processes  of  establishing  some  form  of 
accountability. 

Based  on  this  perspective,  this  entry  explains  the 
characteristics  of  informal  governance  and  the  dif¬ 
ferent  forms  it  takes  and  discusses  how  informal 
governance  may  be  evaluated  in  terms  of  effective¬ 
ness  and  democracy. 

Governance  can  assume  many  different  forms 
and  can  be  classified  along  different  dimensions. 
Focusing  on  the  extent  of  formality,  one  can  argue 
that  governance  is  formal  when  the  production  of 
authoritative  decisions  and  actions  takes  place  in  a 
single  hierarchical  structure,  such  as  a  democrati¬ 
cally  elected  legislative  assembly  and  government. 

Moving  to  the  opposite  pole,  informal  gover¬ 
nance  means  that  the  production  of  public  values 
arises  from  the  interaction  of  a  plethora  of  public 
and  private  and  collective  and  individual  actors. 
The  mechanisms  in  use  depend,  moreover,  to  some 
extent  on  the  cooperation  of  the  involved  nongov¬ 
ernmental  actors.  Informal  governance  means  that 
participation  in  the  decision-making  process  is  not 
yet  or  cannot  in  fact  be  codified  and  publicly 
enforced.  But  it  may  also  mean  that  a  certain 
group  of  decision  makers  agree  informally  to 
advocate  or  enact  particular  policies,  while  still 
acting  in  formal  decision-making  contexts. 

Informal  governance  is  not  new,  nor  should  it 
be  associated  with  specific  institutions  or  levels  of 
government.  Governments  have  been  engaged  in 
informal  governance  since  time  immemorial,  and 


informal  governance  can  be  found  in  institutions 
such  as  the  European  Union  (EU),  as  in  national, 
regional,  and  local  policy  processes. 

Why  Informal  Governance? 

Although  not  new,  governments  have  come  to  rely 
more  heavily  on  informal  governance.  Many  rea¬ 
sons  exist  for  this  shift  toward  informality.  One  is 
the  declining  trust  in,  and  respect  for,  government 
in  many  countries.  As  most  formal  instruments  of 
governance  rely  on  a  government’s  legitimate 
authority  to  be  effective,  the  waning  of  legitimacy 
makes  traditional  instruments  less  certain.  Like¬ 
wise,  governments  have  experienced  direct  mecha¬ 
nisms  as  costly  and  inefficient,  compared  with  the 
use  of  the  private  sector  as  partners  in  implementa¬ 
tion.  Governments  are  seldom  the  sole  proprietors 
of  expert  knowledge,  often  needing  to  rely  on  other 
institutions’  expert  knowledge,  and  a  wide  range  of 
stakeholders  wish  to  participate  in  the  policy 
process.  Ideological  changes  toward  “new  public 
management”  and  a  “network  society”  have  also 
stressed  the  need  to  move  away  from  direct  govern¬ 
ment  involvement  to  more  indirect  forms  of 
governing  and  informal  governance. 

Some  argue  that  informal  governance  is  pro¬ 
pelled  by  the  EU,  and  to  a  significant  extent,  infor¬ 
mal  governance  can  be  claimed  to  be  a  necessary 
part  of  the  EU  system,  because  the  EU  can  often 
actually  make  policy  by  way  of  informal  routes  and 
networking.  One  example  of  this  is  the  Eurogroup: 
the  circle  of  finance  ministers  arguably  shaping 
European  economic  governance  by  pre-agreeing  on 
all  critical  decisions  made  by  the  council,  deciding 
on  overall  orientation  of  economic  governance  in 
the  euro  area,  and  establishing  common  interpreta¬ 
tions  of  the  European  Monetary  Union’s  (EMU’s) 
core  policy  instruments.  In  spite  of  its  important 
role,  the  Eurogroup  is  seen  as  administratively  pre¬ 
carious  and  only  semirecognized  by  law.  It  is  for¬ 
mally  recognized  that  it  should  meet  informally, 
and  consequently  neither  a  secretariat  nor  a  Euro¬ 
group  mailbox  exists  in  Brussels. 

Forms  of  Informal  Governance 

Even  if  informal  governance  appears  as  a  concept 
in  the  literature,  one  should  not  treat  all  forms  as 
virtually  identical.  These  interventions  in  the  name 
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of,  if  not  by,  the  public  itself  need  to  be  invested 
more  carefully,  and  one  should  attempt  to  put  into 
perspective  the  various  political,  economic,  and 
administrative  logics  of  choice  involved. 

One  way  to  categorize  different  forms  of  infor¬ 
mal  governance,  proposed  by  B.  Guy  Peters,  is  to 
divide  informal  governance  into  soft  law,  networks, 
partnerships,  coproduction,  multilevel  governance, 
and  the  so-called  open  method  of  coordination. 

The  basic  idea  of  soft  law  is  that  the  public  sector 
does  not  establish  all  the  rules  for  action  but  permits 
social  actors  to  bargain  within  a  broader  established 
policy  statement.  One  example  is  negotiated  rule 
making  in  the  United  States,  where  the  affected  par¬ 
ties  and  relevant  government  agencies  are  permitted 
to  bargain  among  themselves  with  regard  to  the 
final  and  operating  rules.  This  commonly  discussed 
form  of  governance  stands  in  contrast  to  the  hard 
law  of  legal  command  and  formal  regulations. 

Network  is  a  second  type  of  informal  gover¬ 
nance,  covering  mechanisms  through  which  state 
and  society  have  become  more  closely  linked. 
Networks  represent  a  broad  and  general  category 
of  interactions  that  is  grounded  in  interdependency 
among  actors  simultaneously  keeping  their  opera¬ 
tional  autonomy  intact.  These  actors  interact  and 
negotiate  over  a  period  of  time,  creating  an  institu¬ 
tional  framework  of  rules,  norms,  shared  knowl¬ 
edge,  and  social  imagination. 

Public-private  partnerships  represent  a  distinct 
subcategory  of  networks  and  a  common  mecha¬ 
nism  for  implementing  public  programs.  The  basic 
idea  of  these  arrangements  is  that  a  relationship  is 
created  between  the  public  sector  and  one  or  more 
private  actors,  intending  to  deliver  a  specific  ser¬ 
vice  jointly.  Most  commonly,  these  arrangements 
materialize  as  written  contracts  between  the 
involved  partners.  Building  or  using  existing  trust 
is  arguably  a  necessary  condition  to  make  these 
partnerships  work. 

Coproduction  is  closely  related  to  partnerships 
and  departs  from  the  idea  that  the  public  sector 
will  cooperate  with  private  actors  in  the  delivery  of 
a  service,  therefore  running  the  risk  of  devolving 
the  necessary  authority  to  these  actors.  Public  pro¬ 
grams  directed  at  “neighborhood  watch”  and  dif¬ 
ferent  kinds  of  user  boards  consisting  of,  for 
example,  school  teachers  and  parents  have  served 
as  examples  of  informal  governance — in  the  form 
of  coproduction. 


Multilevel  governance  is  usually  understood  in 
terms  of  public  actors  who,  through  bargaining, 
make  decisions  and  coordinate  their  actions.  But 
even  if  a  clear  hierarchy  does  not  exist,  multilevel 
governance  still  does  not  by  definition  include 
social  actors.  On  the  other  hand,  the  use  of  these 
interactions,  and  the  way  they  steer  clear  of  the 
traditional  hierarchy,  may  provide  social  actors 
with  greater  opportunities  for  influence. 

The  so-called  open  method  of  coordination,  a 
distinct  form  developed  in  the  EU,  resembles  to 
some  extent  the  form  of  soft  law  discussed  above. 
This  cannot  be  defined  as  a  single  method,  rather 
a  variety  of  mechanisms,  such  as  benchmarking 
and  sharing  best  practices  to  change  the  behavior 
of  the  relevant  actors  (in  the  case  of  the  EU,  the 
governments  within  the  union).  But  the  basic  ideas 
of  benchmarking  apply  much  more  widely  and 
indicate  that  this  model  should  be  understood  as  a 
more  general  form  of  informal  governance. 

Evaluating  Informal  Governance 

Informal  governance  can  be  evaluated  on  different 
terms,  and  the  specific  forms  mentioned  above  will 
to  some  extent  score  differently  depending  on  which 
criterion  is  used.  However,  relevant  literature  on  the 
subject  tends  to  agree  that  informal  governance  has 
consequences  for  its  steering  capacity.  Permitting  a 
wider  range  of  actors  to  become  involved  in,  for 
example,  policy  processes  means  that  their  views 
will  be  known  to  actors  with  a  formal  capacity  to 
make  decisions.  This  may  improve  the  quality  of 
decisions  and,  thereby,  prevent  later  problems  when 
policies  are  actually  implemented.  However,  this 
may  also  slow  down  the  process  and  make  bargain¬ 
ing  among  the  different  actors  more  complicated. 

The  most  problematic  aspects  of  informal  gov¬ 
ernance  are  still  related  to  democracy  and  account¬ 
ability.  Informal  governance,  whatever  specific 
form  it  takes,  means  that  some  social  actors 
become  part  of  the  public  policy  process,  while 
others  do  not.  To  evaluate  informal  governance  in 
democratic  terms,  questions  need  to  be  raised 
about  authority,  participation,  and  assurance: 
Who  has  the  authority  to  invite  participants?  Who 
participates  and  on  what  terms?  And  finally,  how 
are  justice  and  reliability  assured? 

Just  asking  these  questions  leads  one  to  draw 
the  inevitable  conclusion  that  the  existence  of 
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informal  governance  definitely  challenges  core 
ideas  about  democracy.  These  challenges  appear  at 
two  levels.  From  the  viewpoint  of  representative 
democracy,  we  are  entitled  to  ask  whether  infor¬ 
mal  governance  can  be  developed  in  such  a  way 
that  its  foundation  in  representative  government 
remains  clear.  But  informal  governance  may  also 
lead  to  a  broader  discussion  about  democracy  and 
to  whether  the  whole  idea  of  democracy  should  be 
rethought  in  such  a  way  that  informal  governance 
becomes  less  challenging. 

Asbjorn  Roiseland 
Bodo  University  College 
Bodo,  Norway 
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Multilevel  governance  gained  prominence  in  the 
field  of  European  Union  (EU)  studies,  where  it  was 
applied  to  explain  patterns  of  European  integration 


and  the  way  in  which  policy  was  made  and  imple¬ 
mented  within  an  increasingly  populated  European 
political  arena.  Feeding  into  the  broader  gover¬ 
nance  debate,  which  seeks  to  understand  the  chal¬ 
lenges  to  governmental  capacity  presented  by 
processes  such  as  hollowing  out  and  fragmenta¬ 
tion,  multilevel  governance  highlights  the  vertical 
relations  between  actors  and  institutions  across 
various  territorial  levels,  alongside  the  transform¬ 
ing  horizontal  relationships  between  state  and 
nonstate  actors.  Proponents  of  multilevel  gover¬ 
nance  reject  the  state-centricity  of  many  alternative 
accounts  to  highlight  the  blurring  of  boundaries 
between  domestic  and  international  politics, 
wherein  authority  is  increasingly  shared  across  lev¬ 
els.  In  turn,  the  growing  multiplicity  of  both  state 
and  nonstate  participants  and  of  policy  implemen¬ 
tation  is  highlighted.  A  key  empirical  question 
underpinning  theories  of  multilevel  governance  is 
therefore  the  extent  to  which  central  governments 
are  subsequently  losing  their  authority  and  capac¬ 
ity  to  achieve  their  policy  preferences.  This  entry 
identifies  the  commonly  understood  strands  that 
underpin  accounts  of  multilevel  governance  before 
considering  the  various  distinct  forms  that  multi¬ 
level  governance  can  assume.  This  entry  then  pro¬ 
vides  an  account  of  multilevel  governance’s  major 
strengths,  while  highlighting  the  weaknesses  identi¬ 
fied,  to  illustrate  its  utility  as  a  theory  of  political 
science. 

Historical  and  Theoretical  Context 

The  intellectual  origins  of  multilevel  governance 
can  be  found  in  the  study  of  the  EU.  Influenced  by 
the  emerging  perspective  of  the  EU  as  a  distinct 
political  system,  the  term  was  first  used  by  Gary 
Marks  in  1992  to  capture  developments  in 
European  policy  making.  Prior  to  the  1980s,  the 
study  of  the  EU  and  the  process  of  European  inte¬ 
gration  were  dominated  by  approaches  that  were 
influenced  by  the  field  of  international  relations 
(IR).  Drawing  on  the  tradition  of  pluralism  within 
IR  studies,  the  neo-functionalist  approach,  which 
prevailed  in  the  late  1950s  and  early  1960s,  sug¬ 
gested  that  despite  their  initiation  of  the  integration 
process,  national  governments  were  increasingly 
losing  control  in  a  complex  network  of  state  and 
nonstate  actors.  The  neo-functionalist  perspective 
was  not  unanimously  accepted  and  was  countered 
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by  the  intergovernmentalist  approach,  which  drew 
on  the  realist  approaches  in  IR  to  stress  the  ongo¬ 
ing  dominance  of  national  governments  within 
European  structures.  Its  assumptions  were  strength¬ 
ened  by  the  reassertion  of  national  authority  and 
the  continuance  of  national  veto  across  the  major¬ 
ity  of  policy  areas,  and  the  intergovernmentalist 
approach  emerged  as  the  dominant  approach  to 
EU  studies  from  the  1960s  onward.  Elowever,  by 
the  mid-1980s,  the  structures  of  the  EU  had  begun 
to  undergo  a  series  of  significant  reforms,  includ¬ 
ing  the  creation  of  a  single  market  (the  common 
market)  following  the  1986  Single  European  Act 
and  the  continued  extension  of  qualified  majority 
voting  across  a  range  of  policy  areas,  which  in  turn 
reduced  the  scope  of  national  veto.  In  particular, 
the  ongoing  shift  toward  qualified  majority  voting 
influenced  new  perspectives  regarding  the  EU, 
wherein  the  EU  was  perceived  as  a  distinct  political 
system,  with  parallels  to  domestic  political  institu¬ 
tions,  rather  than  simply  as  a  vehicle,  or  process, 
of  integration.  In  turn,  greater  attention  was  given 
to  the  role  of  subnational  and  supranational  actors 
within  EU  policy  making,  which  led  to  a  reap¬ 
praisal  of  traditional  conceptions  of  center-periph¬ 
ery  relationships.  It  was  against  this  theoretical 
and  empirical  backdrop  that  Marks  sought  to 
move  away  from  a  two-dimensional  analysis,  cen¬ 
tered  on  the  upward  flow  of  power  from  national 
to  supranational  actors,  in  order  to  highlight  the 
effects  of  the  downward  seepage  of  power  to  sub¬ 
national  actors  and  their  resultant  impact  on  the 
policy  process.  He  also  drew  on  ideas  regarding 
fragmentation,  policy  networks,  and  the  role  of 
nonstate  actors  to  indicate  the  horizontal,  as  well 
as  the  vertical,  flows  of  power. 

Essence  of  Multilevel  Governance 

The  burgeoning  role  of  subnational  governments 
as  key  actors  in  the  European  political  arena  has 
attracted  increasing  academic  attention,  within 
which  multilevel  governance  has  emerged  as  a  key 
theoretical  response.  A  central  assumption  of  mul¬ 
tilevel  governance  is  that  the  process  of  European 
integration  has  meant  that  authority  and  policy¬ 
making  influence  are  shared  across  multiple  levels 
of  government — subnational,  national,  and  supra¬ 
national.  In  contrast  to  the  earlier  state-centric 
views,  multilevel  governance  has  stressed  the 


apparently  independent  role  played  by  subnational 
actors  in  the  European  policy-making  process,  sug¬ 
gesting  that  regional  governments  have  sufficient 
independence  from  national  governments  to  shape 
their  own  preferences  and  directly  influence  the  EU 
agenda.  In  turn,  as  Marks  has  noted,  national  gov¬ 
ernments  are  perceived  as  no  longer  representing 
the  sole  interface  between  the  domestic  and  supra¬ 
national  arenas.  The  theoretical  focus  of  multilevel 
governance  can  be  located  within  the  governance 
narrative,  which  has  gained  increasing  currency 
since  the  mid-1990s  as  a  way  of  understanding  the 
challenges  of  steering  and  coordinating  policy 
making  and  policy  implementation  within  an 
increasingly  complex  social  system.  Central  to  the 
governance  debate  is  the  ability  of  national  gov¬ 
ernments  to  assert  their  agendas  on  increasingly 
segmented  polities,  wherein  concepts  such  as  hol¬ 
lowing  out  and  fragmentation  have  been  applied 
to  suggest  that  national  governments  have  become 
simply  one  among  a  range  of  competing  authori¬ 
ties.  Multilevel  governance  thus  refers  to  the  nego¬ 
tiated,  nonhierarchical  exchanges  between  institu¬ 
tions  at  the  supranational,  national,  regional,  and 
local  levels,  highlighting  the  vertical  layering  of 
governance  processes  at  these  different  levels. 

As  with  so  many  concepts  and  theories  in  polit¬ 
ical  science,  multilevel  governance  is  without  an 
all-encompassing  definition,  and  disagreement 
exists  regarding  its  scope  and  utility.  Nonetheless, 
there  are  several  commonly  agreed  strands.  A 
shared  starting  point  is  the  acceptance  that  deci¬ 
sion  making  occurs  at  various  territorial  levels  and 
is  characterized  by  the  participation  of  state  and 
societal  actors.  In  turn,  this  has  meant  that  the 
identification  of  the  discrete  and/or  nested  levels  of 
decision  making  has  become  increasingly  difficult 
in  the  context  of  complex,  overlapping  networks. 
Within  this  context,  according  to  Ian  Bache  and 
Matthew  Flinders  (2004),  the  role  of  the  state  is 
under  transformation,  as  emergent  nonstate  actors 
seek  to  develop  new  strategies  of  coordination, 
steering,  and  networking  to  protect,  or  even 
enhance,  their  autonomy.  It  is  important  to  note 
that  multilevel  governance  does  not  necessarily 
confront  the  sovereignty  of  states  directly,  and  it 
casts  national  governments  as  the  most  important 
pieces  of  the  European  puzzle.  Nevertheless,  by 
highlighting  the  inability  of  national  governments 
to  monopolize  the  policy  process,  theories  of 
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multilevel  governance  make  important  judgments 
regarding  state  capacity,  suggesting  that  the  sharing 
of  decision-making  competencies  among  actors  at 
various  geopolitical  levels  has  involved  a  significant 
loss  of  control  for  individual  national  governments. 
This  has  been  compounded  by  the  shift  away  from 
nested  political  arenas  to  increasingly  intercon¬ 
nected  policy  networks,  wherein  subnational  actors 
operate  in  both  the  national  and  international  are¬ 
nas,  creating  a  range  of  transnational  associations. 
Instead  of  being  explicitly  challenged,  EU  states  are 
therefore 

being  melded  into  a  multi-level  polity  by  their 
leaders  and  the  actions  of  numerous  sub-national 
and  supranational  actors,  [and]  one  does  not 
have  to  argue  that  states  are  on  the  verge  of 
political  extinction  to  believe  that  their 
control .  .  .  has  significantly  weakened.  (Liesbet 
Hooghe  &  Gary  Marks,  2001,  p.  27) 

Reflecting  on  their  changing  capacity,  many 
accounts  of  multilevel  governance  have  identified 
the  development  of  a  gatekeeping  role  for  state 
actors,  wherein  national  governments  act  as  a 
mediator  between  subnational  and  supranational 
actors  to  control  access  to  the  European  policy 
arena  in  order  to  achieve  or  defend  its  preferred 
policy  outcomes.  Adopting  the  role  of  gatekeeper  is 
perceived  as  enabling  state  actors  to  address  strate¬ 
gic  alternatives  to  zero-sum  power  struggles,  as  they 
seek  to  move  away  from  “authoritative  allocation 
and  regulation  ‘from  above’  to  the  role  of  partner 
and  mediator”  (Beate  Kohler-Koch,  1996,  p.  371). 

Competing  Models  of  Multilevel  Governance 

In  many  respects,  multilevel  governance  had 
assumed  the  guise  of  a  conceptual  umbrella,  and  it 
had  been  suggested  that  the  breadth  of  its  focus 


was  sometimes  at  the  expense  of  its  explanatory 
power  and  analytical  vigor.  In  particular,  multi¬ 
level  governance  had  been  criticized  for  its  focus  on 
relationships  between  state  actors  across  different 
levels  rather  than  across  different  sectors,  for  being 
overly  descriptive,  and  for  overemphasizing  the 
role  of  subnational  actors  in  the  policy  process. 
Andrew  Jordan  (2001),  for  example,  suggested  that 
while  multilevel  governance  (MLG)  provides  an 
“appealing  picture  of  what  the  EU  looks  like,”  it 
was  “weak  at  explaining  which  levels  are  the  most 
important  and  why,  and  what  actually  motivated 
the  experiment  in  governance  in  the  first  place” 
(p.  194).  This  led  him  to  conclude  that  “the  most 
we  can  currently  say  about  MLG  is  that  it  needs  to 
be  subject  to  a  great  deal  more  case-study  testing 
before  it  can  be  adopted  as  a  general  account  of 
how  (parts  of)  the  EU  operate(s)”  (p.  204). 

In  response  to  such  criticisms,  Hooghe  and 
Marks  sought  to  delineate  two  contrasting  views 
of  the  organization  of  multilevel  governance, 
which  they  labeled  simply  Type  I  and  Type  II,  sum¬ 
marized  in  Table  1.  Type  I  multilevel  governance 
focuses  on  formal  intergovernmental  relationships 
at  a  limited  number  of  levels.  These  jurisdictions 
are  general  purpose,  bundling  together  a  range  of 
policy  responsibilities,  and  the  membership  bound¬ 
aries  of  each  jurisdiction  do  not  intersect. 
Alternatively,  Type  II  is  composed  of  specialized 
jurisdictions  wherein  governance  arrangements  are 
organized  around  functionally  specific  governance 
networks  (Hooghe  &  Marks,  2003). 

As  this  brief  overview  suggests,  Type  I  and 
Type  II  multilevel  governance  have  different  intel¬ 
lectual  foundations,  and  therefore,  they  seek  to 
explain  different  manifestations  of  the  policy  pro¬ 
cess.  With  its  emphasis  on  power  sharing  among  a 
limited  number  of  formal  governmental  levels — 
international,  national,  regional,  meso,  and  local — 
Type  I  multilevel  governance  has  been  influenced 


Table  1  Types  of  Multilevel  Governance 

Type  I 

Type  II 

General-purpose  jurisdictions 

Nonintersecting  memberships 

Jurisdictions  at  a  limited  number  of  intervals 
Systemwide  architecture 

Task-specific  jurisdictions 

Intersecting  memberships 

No  limit  to  the  number  of  jurisdictional  levels 

Source:  Hooghe  and  Marks  (2003,  p.  235). 
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by  theories  of  federalism,  which  have  been  devel¬ 
oped  and  applied  beyond  the  parameters  of  indi¬ 
vidual  nation-states.  Type  I  multilevel  governance 
is  characterized  as  a  “nested”  or  “Russian  doll” 
model,  wherein  there  is  only  one  relevant  jurisdic¬ 
tion  at  any  particular  territorial  scale,  which  is 
intended  to  be  stable  for  several  decades  or  more. 
The  bundling  together  of  policy  functions  in  a  sys¬ 
tem  of  Type  I  multilevel  governance  is  perceived  to 
gain  the  benefits  of  varying  territorial  scale  while 
minimizing  the  number  of  jurisdictions  to  be  coor¬ 
dinated.  Within  this  typology,  nation-states  remain 
of  central  importance,  reflecting  the  emphasis  given 
to  national  boundaries  as  a  key  organizer  of  politi¬ 
cal  activity,  and  the  focus  of  analysis  is  therefore  on 
individual  governments  or  institutions  rather  than 
on  specific  policies  or  issues.  Alternatively,  Type  II 
multilevel  governance  reflects  too  many  of  the  con¬ 
cepts  associated  with  the  governance  narrative  to 
present  a  fluid  and  complex  picture  of  governance. 
Unlike  the  general  purpose  design  of  Type  I  multi¬ 
level  governance,  jurisdictions  in  Type  II  multilevel 
governance  are  task  specific  and  organized  around 
explicit  problems  or  policy  areas,  their  organiza¬ 
tion  being  flexible  as  governance  demands  change. 
These  jurisdictions  can  entail  intersecting  member¬ 
ships  and  can  be  organized  across  a  range  of  juris¬ 
dictional  levels,  without  being  constrained  by  the 
nested  hierarchical  structures  associated  with  Type 
I  multilevel  governance.  Type  II  governance 
structures  are  intended  to  secure  optimum  effi¬ 
ciency  in  policy  making  and  will  be  discontinued 
once  they  have  served  their  purpose,  in  accor¬ 
dance  with  the  emphasis  on  flexibility.  Each 
typology  is  therefore  based  on  a  different  concep¬ 
tion  of  power.  Type  I  is  suited  to  dealing  with 
zero-sum  conflict  about  basic  values,  because  it 
facilitates  logrolling  and  cross-issue  trading. 
However,  Type  II  is  effective  in  solving  ad  hoc 
coordination  problems  among  individuals  sharing 
the  same  geographical  or  functional  space  (Marks 
&  Hooghe,  2004).  Nonetheless,  the  two  typolo¬ 
gies  are  not  mutually  exclusive,  and  often,  Type  II 
governance  is  embedded  within  a  Type  I  struc¬ 
ture,  for  example,  general-purpose  jurisdictions 
can  coexist  alongside  special-purpose  jurisdic¬ 
tions;  and  formal  institutions  of  government  can 
actively  assign  functions  to  special-purpose  bod¬ 
ies  charged  with  addressing  particular  problems. 
The  result  of  this  has  been  described  as  a 


“baroque  patchwork  of  Type  II  jurisdictions 
overlaying  a  nested  pattern  of  Type  I  jurisdic¬ 
tions”  (Hooghe  &  Marks,  2003,  p.  238). 

Strengths  and  Weaknesses 

Multilevel  governance  has  made  an  important  con¬ 
tribution  to  political  theory  and  has  utility  across  a 
range  of  empirical  settings.  It  has  drawn  on  a  range 
of  ideas  and  concepts  in  political  science — such  as 
internationalization,  regionalization,  complexity, 
governance,  and  policy  networks — to  highlight  the 
transdisciplinary  approach  that  is  needed  in  analyz¬ 
ing  many  contemporary  issues  and  challenges.  In 
particular,  the  theory  bridges  the  traditional  dichot¬ 
omy  between  domestic  and  international  politics  to 
emphasize  the  “blurring  of  the  distinction  between 
the  two  through  the  process  of  European  integra¬ 
tion”  (Bache,  2008,  p.  28).  Although  developed  in 
the  context  of  EU  policy  making,  multilevel  gover¬ 
nance  has  also  gained  increasing  salience  across  a 
range  of  geopolitical  settings,  being  perceived  as  a 
far  more  relevant  way  of  understanding  intergov¬ 
ernmental  relationships  by  moving  beyond  the 
characterization  of  the  zero-sum  nature  of  intergov¬ 
ernmental  relations  implicit  in  previous  approaches. 
While  it  remains  contested,  the  broad  appeal  of 
multilevel  governance  reflects  a  shared  concern  with 
the  role  of  nonstate  actors,  the  growth  of  intersect¬ 
ing  jurisdictions,  increased  complexity  across  the 
policy  arena,  and  the  challenges  posed  to  state 
power.  In  this  respect,  multilevel  governance  has 
made  an  important  contribution  to  the  broader 
governance  debate,  providing  a  theoretical  model, 
the  robustness  of  which  has  been  reinforced  by  the 
later  distinction  between  Type  I  and  Type  II,  which 
highlights  the  relationship  between  vertically  orga¬ 
nized  territorial  units  of  governance  on  the  one 
hand  and  the  interplay  between  task-specific,  hori¬ 
zontally  organized  jurisdictions  on  the  other. 
Multilevel  governance  has  therefore  been  seen  to 
provide  a  way  of  understanding  the  interaction  of 
complexity  within  and  between  all  jurisdictional 
levels,  highlighting  the  role  of  state  and  nonstate 
actors.  In  this  sense,  multilevel  governance  can  be 
seen  as  having  been  successful  in  moving  beyond 
simplistic  conceptions  of  the  linear  exercise  of 
power  that  was  implicit  in  earlier  intergovernmen- 
talist  approaches  by  focusing  attention  on  the 
impact  of  processes  such  as  supranationalization, 
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decentralization,  devolution,  and  the  dispersal  of 
authority  to  semiautonomous  bodies.  In  turn,  mul¬ 
tilevel  governance  can  be  used  to  sharpen  questions 
regarding  the  mechanisms  and  strategies  used  to 
govern  within  a  system  of  multilevel  governance  not 
just  by  national  government  actors  seeking  to 
ensure  their  influence  in  an  increasingly  complex 
policy  arena  but  also  by  regional  and  subnational 
actors  who  similarly  seek  to  ensure  their  influence. 

Nonetheless,  there  have  been  several  criticisms 
of  multilevel  governance  as  a  way  of  understand¬ 
ing  contemporary  policy  making  and  implementa¬ 
tion.  While  posited  as  a  counter  to  state-centric 
approaches,  there  are  criticisms  that  multilevel 
governance  has  overstated  the  erosion  of  the  gov¬ 
erning  capacity  of  nation-state  actors,  and  it  has 
been  suggested  that  an  emphasis  on  the  plurality 
of  competing  interests  has  led  to  “fundamental 
questions”  about  the  structure  of  power  relations 
being  “downplayed”  (Paul  Stubbs,  2005).  It  has 
also  been  suggested  that  multilevel  governance 
has  been  overly  descriptive  at  the  expense  of  ana¬ 
lytical  vigor  (e.g.,  Jordan,  2001).  This  was 
acknowledged  by  several  exponents  of  multilevel 
governance,  contributing  to  the  delineation 
between  Type  I  and  Type  II  multilevel  governance 
discussed  above;  see,  for  example,  Hooghe  and 
Marks  (2003).  Nevertheless,  despite  attempts  at 
refinement,  there  is  uncertainty  regarding  the  abil¬ 
ity  of  multilevel  governance  to  “travel”  across 
alternative  geopolitical  settings,  with  a  distinction 
being  made  between  the  “theoretical  vagueness”  of 
multilevel  governance  as  a  concept,  which  can 
travel,  and  its  precise  and  “rigid”  modeling,  which 
is  unsuited  to  different  contexts  (Stubbs,  2005).  A 
pronounced  normative  dimension  has  also  been 
identified  within  several  later  accounts  of  multi¬ 
level  governance.  Despite  suggesting  that  “there  is 
no  agreement  about  how  multi-level  governance 
should  be  organised,”  Marks  and  Hooghe  (2004) 
have  argued  that 

the  dispersion  of  governance  across  multiple 
jurisdictions  is  both  more  effective  than,  and 
normatively  superior  to  [italics  added],  central 
state  monopoly  .  .  .  that  governance  must  operate 
at  multiple  scales  in  order  to  capture  variations 
intheterritorialreachofpolicyexternalities.  .  . .  To 
internalize  externalities,  governance  must  be 
multi-level,  (p.  16) 


The  normative  justification  for  the  application 
of  multilevel  governance  has  been  perceived  as  a 
“very  dubious  elision”  of  “what  governance 
should  be,  whilst  a  legitimate  question  in  norma¬ 
tive  political  science,  is  of  a  different  order  to 
questions  of  what  governance  is”  (Stubbs,  2005, 
p.  69).  Thus,  as  Bache  and  Flinders  have  argued, 
it  is  therefore  important  to  distinguish  between 
multilevel  governance  as  an  analytical  mode  and  a 
normative  concept. 

Role  in  Future  Political  Analysis 

While  some  aspects  remain  contested,  multilevel 
governance  has  emerged  as  an  important  way  of 
making  sense  of  policy  making  and  implementa¬ 
tion  in  an  increasingly  crowded  polity.  Although 
its  roots  are  in  the  study  of  the  EU,  the  way  in 
which  multilevel  governance  bridges  the  divide 
between  the  study  of  international  and  domestic 
politics  has  meant  that  its  utility  has  extended 
beyond  the  field  of  Europeanization  and  into  a 
range  of  alternative  analytical  areas.  To  promote 
the  ongoing  salience  of  multilevel  governance,  a 
key  challenge  will  be  in  ensuring  that  it  remains 
responsive  to  the  organic  evolution  of  the  political 
world  that  it  seeks  to  explain.  Returning  to  the 
study  of  the  EU,  the  2004  enlargement  saw  10 
further  countries  formally  join  the  EU,  which  were 
joined  by  Bulgaria  and  Romania  in  2007;  and 
accession  negotiations  are  going  on  with  several 
other  states.  As  the  EU  continues  to  expand  east,  it 
is  likely  that  questions  will  emerge  regarding  the 
possibility  of  stretching  a  Type  I  jurisdiction  of  an 
EU  that  comprises  upward  of  27  countries  or  the 
extent  to  which  Type  II  governance  arrangements 
will  be  used  in  policy  making  and  implementation. 
Another  emergent  issue  is  the  extent  to  which  the 
changing  context  of  policy  making,  as  perceived 
within  accounts  of  multilevel  governance,  has 
challenged  traditional  conceptions  of  democratic 
accountability.  In  turn,  this  necessitates  their  reap¬ 
praisal,  as  while  different  forms  of  multilevel  gov¬ 
ernance  may  add  to  the  legitimacy  of  policy  mak¬ 
ing  through  increased  efficiency,  this  may  reduce 
democratic  legitimacy  unless  new  means  of  con¬ 
necting  citizens  with  these  shifting  locations  of 
power  are  identified.  Finally,  as  noted  by  Bache 
and  Flinders,  there  is  some  evidence  that  sug¬ 
gests  the  emergence  of  unintended  consequences 
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arising  from  forms  of  multilevel  governance  that 
are  beyond  state  control,  and  a  key  future 
research  agenda  will  entail  making  sense  of  their 
occurrence. 
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The  term  governance  is  increasingly  used  in  urban 
politics,  policy,  and  administration  to  signify 
activities  related  to  public  policy  that  are  under¬ 
taken  jointly  by  multiple  actors,  including  some 
outside  government.  Gerry  Stoker  (2000)  defines 
governance  as  “working  across  boundaries  within 
the  public  sector  or  between  the  public  sector  and 
private  or  voluntary  sectors”  (p.  93).  Although 
the  literature  has  dealt  with  governance  at  differ¬ 
ent  levels,  the  idea  has  special  relevance  at  the 
local  level,  where  solving  problems  or  creating  the 
capacity  to  act  often  involves  mobilizing  support 
from  other  sectors  and  other  levels  of  government. 
This  entry  first  discusses  the  evolution  and  types 
of  governance  and  then  focuses  on  models  of 
urban  governance. 

Why  Governance? 

Social  and  economic  changes  have  increased  the 
permeability  of  policy  making.  The  literature 
points  to  forces  such  as  globalization  and  fiscal 
pressures  across  advanced  industrial  democracies, 
particularly  because  of  demographic  change  and 
economic  competition.  Jan  Kooiman  (2000) 
describes  this  complexity  as  the  growing  ungov¬ 
ernability  of  society.  At  the  same  time,  the  demands 
and  expectations  of  citizens  have  increased.  Rising 
education  levels,  more  critical  citizens,  and 
decreased  trust  in  government  have  increased  pres¬ 
sures  for  reform  and  alternative  service  delivery. 

These  reforms  have  opened  up  government  to 
the  market,  and  in  some  cases  it  has  led  to  more 
participation  at  the  grassroots  level.  As  Donald 
Kettl  in  2000  and  B.  Guy  Peters  in  2001  note,  the 
wave  of  privatization  in  the  United  States  in  the 
1980s  and  trends  such  as  new  public  manage¬ 
ment  abroad  or  government  “reinvention”  here 
have  produced  new  contractual  relationships 


Governance,  Urban  1027 


with  private  sector  and  nonprofit  organizations. 
These  changes  have  also  multiplied  quasi- 
governmental  agencies  and  special  districts.  Most 
governance  theory  and  research  is  concerned  with 
describing,  explaining,  and  managing  processes 
within  this  new  context,  particularly  processes  of 
bargaining  and  cooperation. 

While  cities  in  the  decentralized  American  sys¬ 
tem  are  particularly  sensitive  to  business  invest¬ 
ment  decisions,  Stoker  (2000)  contends  that  frag¬ 
mentation  and  interdependence  are  more  generally 
high  in  urban  policy  across  countries.  Problems 
such  as  poverty  and  the  redevelopment  of  dis¬ 
tressed  areas  are  not  easily  solved  by  government 
alone.  Theories  of  local  governance  have  evolved 
more  broadly  to  embrace  other  types  of  local  pol¬ 
icy  partnerships. 

Types  of  Governance 

Two  general  approaches  to  governance  are  identi¬ 
fied  by  Jon  Pierre  (2000),  and  while  these  apply  to 
governance  at  the  national  level,  they  are  useful  for 
thinking  about  models  of  local  governance  as  well. 
One  is  governance  as  goal  definition  and  policy 
making.  Pierre  and  Peters  (2005)  define  it  in  terms 
of  four  activities: 

1.  goal  definition  (articulating  a  common  set  of 
priorities  for  society), 

2.  coherence  (consistency  and  coordination), 

3.  steering  (finding  ways  of  achieving  goals),  and 

4.  accountability. 

In  the  literature  on  local  governance,  urban  regimes 
or  governing  coalitions  set  the  policy  agenda  and 
goals  more  generally  within  the  community,  in 
contrast  to  the  more  specialized  governance  rela¬ 
tions  in  policy  networks. 

On  the  other  hand,  research  on  governance  as 
networks  emphasizes  interactions  within  policy 
domains  such  as  education,  economic  develop¬ 
ment,  or  public  safety,  as  described,  for  example, 
by  Walter  J.  M.  Kickert,  Erik-Hans  Klijn,  and  Joop 
F.  M.  Kopenjan  (1997),  Laurence  J.  O’Toole 
(1997),  Rod  A.  W.  Rhodes  (1997),  R.  A.  W. 
Rhodes  and  David  Marsh  (1992),  and  Fritz  Scharpf 
(1993).  Interdependent  and  cooperative  relation¬ 
ships  in  networks  are  often  depicted  as  distinct 


from  those  in  hierarchies  or  markets,  as  noted  by 
Kooiman  (2000)  and  Rhodes  (1997).  According  to 
Pierre  and  Peters  (2005)  and  Stoker  (2000),  col¬ 
laboration,  of  course,  entails  costs  in  terms  of  time 
and  effort,  and  processes  of  negotiation  and  com¬ 
promise  may  also  produce  outcomes  that  are  not 
congruent  with  the  goals  of  all  participating  orga¬ 
nizations.  The  research  on  urban  governance  as 
policy  networks  is  often  more  prominent  in 
European  research,  as  local  governments  often 
have  less  autonomy  from  the  central  government 
and  play  an  important  role  in  delivering  the  services 
of  the  welfare  state.  But  work  on  civic  capacity  and 
urban  education  in  the  United  States  also  falls  into 
this  category. 

Urban  Regimes  and  Governing 

Models  of  local  governance  arose  in  response  to 
earlier  debates  on  community  power  and  contem¬ 
porary  debates  over  the  role  of  economic  develop¬ 
ment  in  American  local  government.  In  1987  John 
Logan  and  Harvey  Molotch,  for  example,  depicted 
urban  governance  as  occurring  through  local 
growth  coalitions,  composed  of  city  officials  and 
locally  dependent  businesses  such  as  newspapers, 
utilities,  banks,  and  developers.  The  most  influen¬ 
tial  framework  for  understanding  cross-sectoral 
governance  at  the  local  level  has  been  urban 
regime  analysis.  A  “regime”  connotes  a  set  of  gov¬ 
erning  arrangements,  and  in  its  usage  over  the  past 
few  decades,  it  indicates  cooperation  across  insti¬ 
tutional  boundaries,  beyond  the  formal  apparatus 
of  government.  According  to  Karen  Mossberger 
and  Gerry  Stoker  (2001),  collaboration  is  a 
response  to  the  fragmentation  of  authority  and 
interdependence  between  the  policy-making  capac¬ 
ity  of  democratic  institutions  and  the  wealth-gen¬ 
erating  resources  of  the  market  economy. 

Early  versions  of  regime  analysis  emphasized 
cross-sectoral  goal  setting  and  steering  rather  than 
network,  for  example,  historical  variation  in 
regimes,  and  suggest  change  over  time  in  goals, 
resources,  and  partners  in  American  cities.  Their 
typology  described  directive  regimes  during  the 
period  of  urban  renewal,  concessionary  regimes  in 
the  1960s,  and  entrepreneurial  regimes  that 
responded  to  later  cuts  in  federal  aid  and  capital 
mobility  with  economic  development  initiatives. 
For  Stephen  Elkin,  regimes  are  also  goal  setting  and 
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steering  in  the  most  fundamental  sense.  They  are 
“constitutive”  or  formative  institutions  that  shape 
the  interaction  between  the  democratic  state  and 
the  privately  controlled  market.  Elkin  also  depicts 
historical  and  geographic  variation  in  regime  types. 

According  to  Clarence  Stone  (1989),  urban 
regimes  are  governing  arrangements  also;  but 
more  concretely,  they  are  carried  out  by  a  govern¬ 
ing  coalition,  “an  informal  yet  relatively  stable 
group  with  access  to  institutional  resources  that 
enable  it  to  have  a  sustained  role  in  making  gov¬ 
erning  decisions”  (p.  4).  Stone’s  work  echoes  many 
of  the  same  themes  found  in  the  network  litera¬ 
ture — interdependence,  congruent  goals,  the  mobi¬ 
lization  of  resources,  negotiation,  and  mutual 
adjustment  of  goals  through  interaction  over  time. 
The  social  production  model  of  power  in  regime 
analysis  is  necessary  to  overcome  the  fragmenta¬ 
tion  of  authority  and  resources.  Regimes  embody 
the  “power  to  act”  or  the  “capacity  to  act”  rather 
than  simply  domination  over  other  actors. 
According  to  Stone,  the  goal-setting  and  steering 
aspects  of  regimes  are  significant — urban  regimes 
are  more  than  economic  development  coalitions: 
They  are  governing  coalitions  that  set  priorities  for 
the  city  as  a  whole.  Over  time,  Stone  (2005)  has 
broadened  the  concept,  arguing  that  regimes  may 
vary  in  their  stability  and  that  cross-sectoral  par¬ 
ticipation  does  not  require  the  presence  of  busi¬ 
ness.  The  participation  of  other  groups  varies  with 
the  policy  agenda  of  the  regime. 

Stone  and  his  collaborators  on  education  reform 
have  formulated  the  concept  of  “civic  capacity,” 
applied  to  cooperation  within  a  more  specific 
local  policy  network  (rather  than  a  governing 
coalition).  Civic  capacity  is  needed  “to  devise  and 
employ  formal  and  informal  mechanisms  to  col¬ 
lectively  solve  problems,”  but  it  also  engages  a 
broader  range  of  actors  than  governing  coalitions, 
including  neighborhood  residents  and  community 
groups  as  well  as  policy  professionals,  universities, 
nonprofit  organizations,  businesses,  and  other 
stakeholders  in  the  particular  issue.  Influencing 
policy  requires  changing  governing  arrangements 
within  policy  subsystems  rather  than  winning  spo¬ 
radic  victories  through  interest  group  pressure. 
Some  other  scholars  have  used  elements  of  urban 
regime  analysis  to  explain  cooperation  in  local 
policy  networks — for  example,  in  AIDS  policy. 


The  range  of  actors,  the  motivations,  and  the  dif¬ 
ficulty  of  mobilizing  resources  and  maintaining 
collaboration  would  be  expected  to  differ  across 
policy  areas. 

Other  Models  of  Urban  Governance 

At  the  same  time  that  urban  regime  analysis  devel¬ 
oped  in  the  United  States,  economic  restructuring 
and  changes  in  the  welfare  state  in  many  countries 
increased  pressures  at  the  local  level  and  the  possi¬ 
bility  that  urban  regime  analysis  could  explain  new 
forms  of  governance  appearing  in  many  places. 
One  study  of  11  cities  in  the  United  States, 
Germany,  and  France  found  stable  urban  regimes 
in  all  three  countries,  particularly  where  there  were 
locally  dependent  businesses.  There  are  also 
instances  where  no  stable  regime  has  formed  in  any 
of  the  countries  in  the  study. 

The  cross-national  use  of  the  urban  regime  con¬ 
cept  poses  some  challenges,  however  (see  Moss- 
berger  &  Stoker,  2001).  European  institutional 
contexts  offer  markedly  different  patterns  of 
resources,  authority,  and  incentives  for  cross- 
sectoral  collaboration  at  the  local  level.  In  Europe, 
greater  fiscal  support  from  central  governments, 
more  public  ownership  of  municipal  land,  and  other 
factors  lessen  the  need  for  business  involvement  as 
a  critical  factor  for  the  “power  to”  achieve  develop¬ 
ment  and  support  local  services.  In  some  countries, 
business  is  more  centralized  (as  in  the  United 
Kingdom),  or  corporatist  arrangements  are  largely 
worked  out  through  the  national  parties.  Stronger 
parties  more  generally  lend  a  national  character  to 
political  relationships  at  the  local  level,  in  contrast 
with  the  United  States.  Partnerships  in  small  U.K. 
cities,  for  example,  appear  to  be  the  result  of  top- 
down  initiatives  from  central  government  rather 
than  an  expression  of  the  social  production  model 
of  power  at  the  local  level.  Economic  development 
partnerships  are  apparent  in  several  Western 
European  countries,  but  despite  some  parallels  with 
urban  regimes,  the  central  government  often  played 
a  large  role  in  the  formation  and  priorities  of  the 
partnerships.  And  European  instances  of  cross-sec¬ 
toral  cooperation  are  not  always  setting  agendas  for 
the  city  as  a  whole.  Furthermore,  economic  devel¬ 
opment  policy  in  Britain  tends  to  resemble  a  discrete 
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policy  network  with  strong  vertical  links  to  the 
central  government  as  well  as  business  participa¬ 
tion.  As  such,  it  appears  to  be  closer  to  the  model  of 
networks  within  policy  domains  rather  than  a  gov¬ 
erning  coalition  that  influences  citywide  priorities. 

Pierre  (2005)  and  Alan  DiGaetano  and  Elizabeth 
Strom  (2003)  have  argued  for  a  broader  concept  of 
urban  governance  that  is  more  comparative.  There 
has  been  a  burgeoning  literature  on  local  gover¬ 
nance  across  many  countries,  conceived  of  as  a  local 
multi-actor  collaboration  that  fits  either  policy  net¬ 
works  or  governing  models.  Studies  show  evidence 
of  trends  toward  governance  at  the  local  level  in 
most  advanced  industrialized  countries  and  in  some 
other  countries  as  well.  One  edited  volume  contain¬ 
ing  studies  of  a  dozen  Organisation  for  Economic 
Co-operation  and  Development  (OECD)  countries 
reveals  the  presence  of  public-private  partnerships 
in  10  of  them.  The  authors  caution  that  these  “may 
not  look  like  urban  regimes”  but  that  there  is 
increased  local  economic  development  activity  and 
greater  dependence  on  other  organizations  outside 
of  government  at  the  local  and  regional  levels  (see 
Robin  Hambleton  &  Jill  Gross,  2007). 

Conclusion 

Concepts  of  urban  governance  have  been  useful  for 
describing  patterns  of  urban  policy  making  in  both 
citywide  agenda  setting  and  in  local  policy  domains. 
While  forms  of  governance  differ  across  countries, 
the  concept  has  described  trends  toward  policy 
making,  mobilizing  the  resources  of  multiple  actors 
in  governing  coalitions  or  networks.  There  are 
many  questions  about  governance  that  bear  further 
scrutiny — the  role  and  legitimacy  of  actors  outside 
of  government,  the  costs  and  difficulty  of  coordina¬ 
tion,  the  consequences  of  unequal  power  within 
coalitions  and  networks  for  representation  and  for 
problem  solving,  and  diffused  accountability  for 
achieving  public  purposes.  All  of  these  are  impor¬ 
tant  questions  for  empirical  investigation  and  for 
theory,  across  different  countries,  policy  areas,  and 
forms  of  urban  governance. 
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Governance  is  usually  defined  as  the  processes 
through  which  a  plurality  of  actors  aim  to  produce 
and  deliver  public  purpose  in  the  broad  sense  of 
political  visions,  operative  plans,  problem  defini¬ 
tions,  policy  solutions,  infrastructures,  regulations, 
resource  allocations,  and  public  services.  Governance 
can  be  provided  by  unicentric  forms  of  government 
based  on  hierarchical  command  structures.  It  may 


1030  Governance  Networks 


also  involve  forming  multicentric  forms  of  quasi 
markets  where  public  and/or  private  actors  compete 
to  deliver  goods  and  services  on  terms  defined  by 
elected  governments.  Finally,  governance  may  take 
the  form  of  pluricentric  partnerships  and  networks 
in  which  policy  decisions  are  reached  through  nego¬ 
tiated  interaction  among  interdependent  actors. 

The  study  of  governance  networks  constitutes  a 
novel  research  field  based  on  the  “discovery”  of 
nonhierarchical  forms  of  governance  involving  net¬ 
works  of  public  and/or  private  actors.  The  basic 
argument  prompting  the  recent  surge  in  governance 
network  research  is  that  public  policy,  defined  as 
common  attempts  to  achieve  a  desired  outcome,  is 
the  result  of  governing  processes  that  are  no  longer 
controlled  by  the  government,  instead  involving  a 
wide  range  of  relevant  and  affected  actors  engaged 
in  ongoing  negotiations  that  give  rise  to  a  stable 
pattern  of  horizontal  interaction  constituting  a  spe¬ 
cific  mode  of  coordination.  It  is  this  “heter archie” 
mode  of  coordination  that  is  dubbed  “governance 
networks”  in  the  academic  literature. 

The  focus  on  governance  networks  has  an 
empirical  background  in  the  widespread  recogni¬ 
tion  of  the  increasingly  fragmented,  complex,  and 
dynamic  character  of  contemporary  society. 
Fragmentation  increases  as  a  result  of  the  func¬ 
tional  differentiation  of  society  into  relatively 
autonomous  subsystems  and  organizational  enti¬ 
ties.  At  the  same  time,  the  breakup  of  the  sedi¬ 
mented  forms  of  political  alignment  and  identifica¬ 
tion  spurs  multiplication  of  political  actors  and 
identities.  Complexity  increases  as  a  result  of  the 
growth  and  interweaving  of  crosscutting  policy 
problems,  and  it  is  further  augmented  by  the 
advent  of  new  forms  of  risk  and  the  blurring  of  the 
boundaries  between  institutions,  sectors,  and 
scales.  Finally,  new  societal  dynamics  are  created 
by  the  proliferation  and  interconnection  of  new 
spatial  and  temporal  horizons  of  action.  The  con¬ 
tingent  articulation  and  interaction  of  the  different 
rationalities,  procedures,  and  strategies  for  solving 
policy  problems  and  exploiting  new  opportunities 
produces  new  and  unpredictable  developments. 
The  result  of  the  growing  fragmentation,  complex¬ 
ity,  and  dynamism  is  an  increasing  ungovernability 
of  society  that  makes  it  difficult  to  meet  the  grow¬ 
ing  steering  ambitions  demanding  that  public  gov¬ 
ernance  be  flexible,  knowledge  based,  targeted, 
responsive,  and  inexpensive.  The  mobilization  of 
the  knowledge,  resources,  and  energies  of  relevant 


and  affected  actors  through  governance  networks 
offers  a  promising  way  out  of  this  impasse. 

Leading  politicians  and  executive  administra¬ 
tors  seem  to  have  taken  governance  networks  to 
their  hearts.  Governments  around  the  world  and  at 
different  levels  increasingly  aim  to  govern  at  a  dis¬ 
tance  by  creating  arenas  for  self-governance  and 
self-regulation.  Public  agencies  and  managers  are 
becoming  linked  through  new  initiatives  aiming  to 
promote  joined-up  government.  Last  but  not  least, 
public  and  private  actors  are  brought  together 
through  the  formation  of  different  kinds  of  part¬ 
nerships  and  collaborative  governance. 

After  closer  inspection  of  the  definition  of  gover¬ 
nance  networks  and  a  brief  account  of  the  theoreti¬ 
cal  roots  of  the  concept  of  governance  networks, 
some  caveats  regarding  the  discussion  of  gover¬ 
nance  networks  are  presented  here.  This  is  followed 
by  an  assessment  of  the  merits  and  problems  of 
governance  networks  in  public  policy  making. 
Finally,  some  of  the  main  theories  of  governance 
networks  are  presented  before  a  concluding  discus¬ 
sion  of  the  major  themes  in  the  research  on  gover¬ 
nance  networks. 

Defining  Governance  Networks 

Governance  networks  can  be  defined  as  (a)  a  rela¬ 
tively  stable  horizontal  articulation  of  interdepen¬ 
dent  but  operationally  autonomous  actors,  (b)  that 
interact  with  one  another  through  ongoing  negotia¬ 
tions  (c)  that  take  place  within  a  regulative,  norma¬ 
tive,  cognitive,  and  imaginary  framework; 
(d)  facilitate  self-regulation  within  the  limits  set  by 
external  forces;  and  (e)  contribute  to  the  produc¬ 
tion  of  public  purpose.  Other  defining  features 
might  be  added,  but  this  definition  captures  the 
essence  of  what  are  commonly  referred  to  as  gover¬ 
nance  networks. 

This  entry  attempts  to  unpack  this  dense  defini¬ 
tion  of  governance  networks  by  considering  each 
defining  feature  in  turn.  First,  governance  networks 
articulate  a  number  of  public,  semipublic,  and  pri¬ 
vate  actors  who,  on  the  one  hand,  are  dependent 
on  one  another’s  resources  and  capacities  to  get 
things  done  and,  on  the  other  hand,  are  operation¬ 
ally  autonomous  in  that  they  are  not  commanded 
by  superiors  to  act  in  a  certain  way.  The  interde¬ 
pendent  relations  between  the  network  actors 
mean  that  they  are  horizontally,  rather  than  verti¬ 
cally,  related.  However,  the  horizontal  relations 
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between  the  network  actors  do  not  imply  equality 
in  terms  of  authority  and  resources.  There  might 
be  an  asymmetric  allocation  of  material  and  imma¬ 
terial  resources  among  the  network  actors,  but 
since  participation  is  voluntary  and  the  actors  are 
free  to  leave  the  network — and  since  the  actors  are 
mutually  dependent — no  single  actor  can  fully 
exercise  its  power  to  exert  hierarchical  control 
over  anyone  else. 

Second,  members  of  governance  networks  inter¬ 
act  through  negotiations  that  combine  elements  of 
bargaining  and  deliberation.  The  network  actors 
may  bargain  over  the  distribution  of  resources  to 
maximize  outcomes;  however,  to  facilitate  some 
degree  of  cooperation,  coordination,  and  collabo¬ 
ration,  the  bargaining  process  must  be  embedded 
in  a  deliberative  process  that  facilitates  trust  build¬ 
ing,  reciprocal  recognition,  learning,  and  common 
understanding.  Nevertheless,  deliberation  within 
governance  networks  will  seldom  lead  to  consen¬ 
sus,  since  it  transpires  within  a  context  of  power 
struggles  that  breed  conflict  and  social  antago¬ 
nism.  Hence,  outcomes  involve  a  collective  agree¬ 
ment-based  compromise  and  tacit  acceptance  of 
decisions  despite  grievances. 

Third,  the  negotiated  interaction  between  the 
network  actors  does  not  take  place  in  an  institu¬ 
tional  vacuum.  Rather,  it  proceeds  within  a  rela¬ 
tively  institutionalized  framework  that  is  shaped 
and  reshaped  in  the  course  of  action.  In  the  begin¬ 
ning,  when  a  governance  network  is  formed,  there 
is  no  agreed-on  institutional  framework  that  deter¬ 
mines  where  and  how  a  legitimate  decision  is  to  be 
taken.  However,  the  ongoing  interaction  of  net¬ 
work  actors  will  lead  to  the  formulation  of  an 
incomplete  and  precarious  framework  of  rules, 
norms,  values,  and  ideas  that  condition  the  future 
interaction  in  the  network  by  defining  appropriate 
action.  ( Value  refers  to  everything  that  is  consid¬ 
ered  to  have  a  value,  be  it  material,  ideological, 
ethical,  or  other,  by  most  people/citizens.) 

Fourth,  governance  networks  are  relatively  self¬ 
regulating,  as  they  are  not  part  of  a  hierarchical 
chain  of  command  or  components  of  markets. 
Rather,  they  aim  at  regulating  a  particular  policy 
field  or  problem  area  based  on  their  own  ideas, 
resources,  and  dynamic  interactions,  and  they  do 
so  within  a  regulative,  normative,  cognitive,  and 
imaginary  framework  that  is  adjusted  through 
negotiations  between  the  participating  actors. 
Nevertheless,  governance  networks  always  operate 


in  a  shadow  of  hierarchy  as  public  authorities  and 
other  legitimate  and  resourceful  actors  aim  to 
facilitate,  shape,  and  constrain  the  network’s  pref¬ 
erence  and  capacity  for  self-regulation.  Governance 
networks  that  are  metagoverned  in  this  way  only 
enjoy  a  bounded  autonomy. 

Fifth,  governance  networks  contribute  to  the 
production  of  public  purpose  within  a  certain  policy 
area.  Public  purpose  is  an  ensemble  of  visions,  val¬ 
ues,  plans,  policies,  and  regulations  that  are  claimed 
to  be  valid  for,  and  directed  toward,  the  general 
public.  As  such,  the  network  actors  are  engaged  in 
political  negotiations  about  how  to  identify  and 
solve  emerging  policy  problems  and  provide  new 
opportunities.  Networks  that  do  not  contribute  to 
the  production  of  public  purpose  in  this  broad  sense 
cannot  be  counted  as  governance  networks. 

Governance  networks,  as  defined  above,  may 
have  different  functions.  Some  governance  net¬ 
works  merely  contribute  to  the  exchange  of  knowl¬ 
edge,  ideas,  and  experiences,  while  other  networks 
aim  to  coordinate  action  in  order  to  avoid  conflicts 
and  duplication  of  efforts.  Some  governance  net¬ 
works  might  even  attempt  to  arrive  at  a  common 
understanding  of  emerging  policy  problems  and 
formulate  joint  solutions.  Governance  networks 
may  also  take  different  forms  as  they  grow  autono¬ 
mously  from  below  or  are  initiated  from  above,  are 
open  or  closed,  are  loosely  connected  or  highly 
integrated,  are  short-lived  or  relatively  permanent, 
and  have  a  sector-specific  or  society-wide  scope. 
Finally,  governance  networks  have  many  different 
labels  as  they  are  frequently  referred  to  as  planning 
cells,  think  tanks,  public  boards  and  committees, 
commissions,  partnerships,  and  so  on.  The  different 
labels,  forms,  and  functions  of  governance  net¬ 
works  attest  to  the  broad  relevance  of  the  concept 
for  describing  contemporary  interactive  governance. 

Theoretical  Roots 

The  increasing  prominence  of  governance  network 
research  is  rooted  in  theoretical  developments  in 
organization  theory,  political  theory,  policy  analy¬ 
sis,  Europeanization  studies,  and  international 
relations.  The  conception  of  organizations  as  open 
systems  adapting  to  environmental  changes — and 
the  subsequent  recognition  that  this  environment 
consists  of  other  organizations — paved  the  way  for 
a  new  focus  on  the  interorganizational  exchange  of 
information  and  resources  taking  place  through 
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relatively  stable  forms  of  network-based  negotia¬ 
tion.  Likewise,  the  recognition  of  the  limits  of  the 
corporatist  and  neo-corporatist  images  of  Iron 
Triangles,  linking  state  agencies  and  social  part¬ 
ners  in  tightly  organized  decision-making  arenas, 
spurred  the  attempts  of  political  theorists  to  distin¬ 
guish  between  different  kinds  of  more  or  less  inte¬ 
grated  policy  networks  that  may  include  a  broader 
range  of  public  and  private  actors. 

If  both  organization  theory  and  political  theory 
gradually  arrived  at  some  notion  of  governance 
networks,  the  real  breakthrough  for  the  new  focus 
on  governance  networks  came  in  the  field  of  policy 
analysis.  Decision-making  theorists  revealed  the 
limits  of  the  rational  decision-making  model  in  the 
face  of  the  complexity  of  increasingly  fragmented 
policy  processes  in  which  social  and  political 
actors  struggle  to  link  problems  and  solutions  in 
highly  contingent  policy  decisions,  and  implemen¬ 
tation  theorists  demonstrated  the  failure  of  com¬ 
prehensive  planning  and  top-down  programming 
in  the  face  of  the  fierce  resistance  of  street-level 
bureaucrats,  user  groups,  and  interest  organiza¬ 
tions.  The  inescapable  conclusion  was  that  public 
policy  making  becomes  more  efficient  if  the  key 
actors  are  somehow  included  in  the  policy  process 
and  develop  an  ownership  to  joint  decisions. 

The  empirical  discovery  and  analytical  descrip¬ 
tion  of  the  European  Union  as  a  networked  polity 
in  which  public  and  private  actors  negotiate  within 
and  across  multiple  levels  and  the  current  transfor¬ 
mation  of  international  regimes  into  transnational 
networks  where  private  actors  play  an  increasingly 
important  role  in  policy  making  have  further  con¬ 
tributed  to  the  development  of  theories  of  network 
governance. 

Some  Caveats 

Policy  making  and  public  governance  are  not  con¬ 
gruent  with  the  formal  political  institutions  in 
terms  of  parliament  and  public  administration  but 
take  place  in  and  through  interactive  forms  of  net¬ 
work  governance.  Some  scholars  tend  to  describe 
the  result  of  this  apparent  shift  from  government 
to  governance  as  a  hollowing  out  of  the  state. 
However,  others  note  that  the  growth  of  interac¬ 
tive  forms  of  governance  in  the  shape  of  networks 
and  partnerships  has  not  mitigated  the  role  and 
impact  of  the  state.  The  state  may  have  lost  its 
privileged  position  in  public  policy  making,  but 


many  of  the  former  state  powers  remain  in  place 
and  new  capacities  are  developed  as  central  and 
local  state  agencies  take  on  the  task  of  metagov- 
erning  governance  networks  at  different  levels.  As 
such,  state  power  is  not  reduced;  rather,  it  is  exer¬ 
cised  in  new  and  subtle  ways. 

A  common  misunderstanding  is  that  gover¬ 
nance  networks  are  an  entirely  new  phenomenon. 
In  many  countries  and  policy  areas,  there  are  long 
traditions  for  the  corporatist  involvement  of  the 
social  partners  in  the  formulation  and  implementa¬ 
tion  of  policy,  and  interaction  between  public  and 
private  actors  is  a  key  feature  of  modern  govern¬ 
ment  and  a  constitutive  trait  of  pluralist  theory. 
What  is  new,  however,  is  that  political  theorists 
and  central  decision  makers  increasingly  view  gov¬ 
ernance  networks  as  both  an  efficient  and  legiti¬ 
mate  mechanism  of  public  governance.  This  is 
evidenced  by  the  increasing  reliance  on  governance 
networks  at  all  levels  of  government. 

When  discussing  governance  networks,  it  is 
important  to  counteract  the  tendency  to  perceive 
governance  networks  as  the  perfect  solution  to  all 
kinds  of  public  governance  problems.  Governance 
networks  are  no  panacea.  When  it  comes  to  the 
exercise  of  public  authority,  there  are  good  reasons 
for  placing  this  task  in  the  hands  of  public  bureau¬ 
cracies  that  can  be  held  accountable  for  their 
action  and  inaction.  Likewise,  when  it  comes  to 
the  production  of  fairly  standardized  public  goods 
and  services,  private  markets  might  provide  good 
value  for  public  money.  Governance  networks 
have  little  to  offer  in  relation  to  these  public  tasks, 
but  they  have  their  strength  in  relation  to  the  large 
amount  of  “wicked  problems”  in  public  gover¬ 
nance  where  the  nature  of  the  policy  problem  is 
uncertain,  specialized  knowledge  is  needed,  and 
there  are  many  relevant  stakeholders  from  differ¬ 
ent  sectors  and  levels  and  a  high  risk  of  conflict. 

Merits  and  Problems  of 
Governance  Networks 

Today,  governance  networks  are  increasingly  per¬ 
ceived  as  a  suitable  response  to  the  question  of  how 
to  tackle  complex  and  crosscutting  policy  problems 
and  governance  tasks.  Hence,  both  political  scien¬ 
tists  and  practitioners  praise  governance  networks 
for  their  potential  contribution  to  efficient  gover¬ 
nance.  Efficiency  gains  derive  from  the  distinctive 
features  of  governance  networks.  First  of  all,  it  is 
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often  claimed  that  ongoing  negotiations  among 
relevant  and  affected  actors  give  governance  net¬ 
works  a  large  potential  for  proactive  governance. 
The  network  actors  can  identify  policy  problems 
and  future  opportunities  at  a  relatively  early  stage 
and  produce  adequate  responses  that  allow  for 
adjustments  in  the  face  of  new  developments  and 
changing  conditions.  This  helps  explain  why  small 
states  with  strong  traditions  for  negotiated  gover¬ 
nance  have  done  relatively  well  in  the  face  of  fluc¬ 
tuations  in  world  markets. 

Governance  networks  are  also  regarded  as 
important  instruments  for  the  aggregation  of 
information,  knowledge,  and  assessments  that  can 
help  in  qualifying  policy  decisions.  The  network 
actors  often  possess  specific  knowledge  that  is  rel¬ 
evant  for  political  decision  making,  and  when  the 
knowledge  of  all  of  the  network  actors  is  pooled 
together,  it  represents  an  important  basis  for  for¬ 
mulating  and  selecting  innovative  and  yet  feasible 
solutions  or  responses  to  the  problems  and  chal¬ 
lenges  at  hand. 

In  addition,  governance  networks  are  said  to 
establish  a  framework  for  consensus  building  or,  at 
least,  for  the  civilizing  of  conflicts  among  the  stake¬ 
holders.  Governance  networks  tend  to  develop 
their  own  logic  of  appropriate  action,  which  regu¬ 
lates  the  process  of  negotiation,  the  formulation  of 
shared  objectives,  and  the  resolution  of  conflicts. 
Trust  building  through  social  interaction,  through 
joint  fact-finding,  and  through  mutual  learning 
further  contributes  to  mediation  of  conflicts. 

Finally,  governance  networks  tend  to  reduce  the 
risk  of  implementation  resistance.  If  the  affected 
actors  are  actively  involved  in  the  decision-making 
process,  they  will  develop  a  sense  of  joint  respon¬ 
sibility  and  ownership  for  the  political  decisions, 
even  when  they  disagree  with  crucial  aspects  of 
these  decisions.  The  shared  ownership  will  oblige 
the  stakeholders  to  support,  rather  than  hamper, 
the  implementation  of  public  policy  based  on  col¬ 
laborative  interaction. 

The  problem  is  that  the  potential  efficiency  and 
legitimacy  gains  of  governance  networks  are  only 
fully  realized  in  well-functioning  governance  net¬ 
works.  Changes  in  the  composition  of  a  network 
and  the  presence  of  unresolved  tensions  and  con¬ 
flicts,  weak  and  ineffective  leadership,  frustrations 
over  the  lack  of  clear  and  visible  results,  and  exter¬ 
nal  events  that  jeopardize  the  policy  process  tend 
to  destabilize  governance  networks  and  turn  them 


into  inefficient  talking  shops.  Careful  institutional 
design  and  network  management  might  prevent 
major  dislocations  and  mitigate  the  impact  of  vari¬ 
ous  disturbances,  but  optimizing  the  functioning  of 
governance  networks  on  all  dimensions  poses  a 
daunting  task.  Even  apparently  well-functioning 
networks  might  cause  problems  for  public  policy 
making,  either  by  enabling  particular  veto  players 
to  block  new  and  innovative  policy  initiatives  or  by 
attempting  to  shift  the  costs  of  expensive  policy 
solutions  to  third  parties.  To  avoid  such  problems, 
elected  governments  and  other  forms  of  political 
agency  must  exercise  their  powers  to  overrule  veto 
players  and  provide  incentives  for  the  networks  to 
find  responsible  solutions. 

Theories  of  Governance  Networks 

The  concept  of  governance  networks  is  descriptive, 
but  the  study  of  the  emergence,  functioning,  and 
results  of  interactive  forms  of  network  governance 
certainly  has  explanatory  ambitions,  though  not  in 
the  classical  sense  of  aiming  to  establish  determin¬ 
istic  causalities  with  a  lawlike  character.  The  ambi¬ 
tion  is  to  produce  open-ended,  context-sensitive 
knowledge  that  is  relevant  for  the  actors  engaged 
in  network  governance.  Explanation  is  established 
through  theoretically  informed  empirical  analysis 
of  network-based  policy  processes  and  policy  out¬ 
comes.  The  attempts  of  politicians  and  administra¬ 
tors  to  metagovern  the  relatively  self-regulatory 
governance  networks,  the  sociopolitical  conditions 
and  institutional  design  of  governance  networks, 
and  the  power  struggles  among  the  key  stakehold¬ 
ers  are  central  factors  in  explaining  the  outputs 
and  outcomes  of  networked  policy  making. 

Explanation  of  the  formation,  development, 
and  impact  of  governance  networks  may  draw  on 
central  insights  from  two  of  the  main  theories  of 
governance  networks:  interdependence  theory  and 
governability  theory.  Both  theories  tend  to  view 
social  action  as  driven  by  interest-based  calcula¬ 
tions,  but  they  differ  in  their  view  on  the  prospect 
of  overcoming  conflicts  and  facilitating  collabora¬ 
tion  among  multiple  stakeholders. 

Interdependence  theory  is  inspired  by  historical 
institutionalism.  It  defines  governance  networks  as 
an  interorganizational  medium  for  interest  media¬ 
tion  between  interdependent — but  conflicting — 
actors,  each  with  a  rule  and  resource  base. 
Governance  networks  are  formed  as  a  result  of 


1034  Governance  Networks 


strategic  calculations  of  independent  actors  who 
choose  to  interact  because  of  their  mutual  resource 
dependencies.  The  formation  of  governance  net¬ 
works  counteracts  the  institutional  fragmentation 
caused  by  new  public  management  reforms. 
Governance  networks  are  formed  through  incre¬ 
mental  bottom-up  processes  but  are  recruited  as 
vehicles  of  public  policy  making  by  public  author¬ 
ities.  The  network  actors  pursue  different  interests 
through  internal  power  struggles,  but  they  are  held 
together  by  their  mutual  interdependence,  which 
facilitates  negotiation  and  compromise. 

Governability  theory  combines  rational  choice 
institutionalism  with  a  systems  theoretical  view 
of  societal  development.  It  defines  a  governance 
networks  as  a  horizontal  coordination  between 
autonomous  actors  who  interact  through  differ¬ 
ent  negotiation  games.  The  formation  of  gover¬ 
nance  networks  is  seen  as  a  functional  response  to 
the  increasing  societal  complexity,  differentia¬ 
tion,  and  dynamism  that  undermine  the  ability  to 
govern  society  efficiently  through  hierarchy  and 
market.  Governance  networks  are  viewed  as 
gamelike  structures  that  facilitate  horizontal 
coordination  among  systems  and  organizations 
and  are  held  together  partly  by  the  anticipated 
gains  from  resource  pooling  and  joint  action  and 
partly  by  development  of  mutual  trust  that  helps 
overcome  collective  action  problems  and  mitigate 
conflicts. 

Institutional  theories  of  normative  integration 
and  theories  of  governmentality  advanced  by 
Michel  Foucault  and  his  followers  also  provide 
insights  into  the  intricacies  of  interactive  gover¬ 
nance.  These  theories  do  not  focus  explicitly  on 
governance  networks,  but  they  tend  to  view  gov¬ 
ernance  as  a  decentered  process  involving  a 
plethora  of  public  and  private  actors.  While  dif¬ 
fering  in  the  emphasis  on  the  role  of  power  and 
conflict  in  societal  governance,  both  theories  take 
an  interpretative  approach  to  social  action, 
emphasizing  the  role  of  institutions  and  dis¬ 
courses  in  shaping  the  identity,  perceptions,  and 
actions  of  social  actors. 

Institutional  theories  of  normative  integration 
define  governance  networks  as  an  institutionalized 
field  of  interaction  between  relevant  policy  actors 
who  are  integrated  in  a  community  defined  by 
common  rules,  norms,  and  perceptions.  Governance 
networks  are  regarded  as  a  normative  response  to 


the  twin  problems  of  totalitarian  overintegration 
and  individualistic  underintegration  of  social 
agency.  They  are  formed  through  a  bottom-up 
process  whereby  contacts  established  due  to  the 
recognition  of  interdependence  are  evaluated  and 
extended  on  the  basis  of  institutionalized  logics  of 
appropriateness.  Over  time,  governance  networks 
develop  their  own  logic  of  appropriateness,  and 
the  network  actors  become  normatively  integrated 
through  the  construction  of  solidarity  and  com¬ 
mon  identities. 

Governmentality  theory  implicitly  defines  gov¬ 
ernance  networks  as  an  attempt  by  an  increasingly 
reflexive  and  facilitating  state  to  mobilize  and 
shape  the  free  actions  of  self-governing  actors. 
Governance  networks  are  interpreted  as  a  political 
response  to  the  failure  of  neoliberalism  to  realize 
its  key  goal  of  less  state  and  more  market.  The 
problematization  of  neoliberalism  has  led  to  the 
formulation  of  a  new  governmentality  program, 
associated  with  advanced  liberal  government  that 
aims  to  shift  the  burden  of  government  to  local 
networks  in  which  the  energies  of  social  and  polit¬ 
ical  actors  are  mobilized  and  given  a  particular 
direction  in  order  to  ensure  conformity.  Governance 
networks  are  constructed  and  framed  by  particular 
governmental  technologies  and  narratives  that  aim 
to  recruit  social  and  political  actors  as  vehicles  of 
the  exercise  of  power. 

Major  Themes  in  First-  and 

Second-Generation  Research 

Although  the  field  of  research  is  relatively  new,  it  is 
possible  to  talk  about  first  and  second  generations 
of  governance  network  research.  The  first  genera¬ 
tion  attempted  to  convince  us  that  something  new 
was  occurring.  As  such,  it  was  primarily  preoccu¬ 
pied  with  explaining  why  and  how  governance 
networks  are  formed,  how  they  differ  from  tradi¬ 
tional  forms  of  governance  in  terms  of  hierarchy 
and  markets,  and  how  they  contribute  to  efficient 
governance  within  different  policy  fields  and  at  dif¬ 
ferent  regulatory  scales.  The  first  generation  suc¬ 
ceeded  in  linking  the  rise  of  network  governance  to 
new  societal  trends,  in  fleshing  out  the  distinctive 
features  of  governance  networks  vis-a-vis  state 
and  market,  and  in  analyzing  the  formation  and 
functioning  of  governance  networks  in  different 
countries,  in  policy  areas,  and  at  different  scales. 
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Slowly  but  surely,  the  research  agenda  has  pro¬ 
ceeded  beyond  the  preoccupations  of  the  first  gen¬ 
eration  of  governance  network  research.  Govern¬ 
ance  networks  no  longer  represent  something  new 
and  exotic;  rather,  they  are  an  intrinsic  part  of 
modern  governance.  At  this  stage,  new  and  as  yet 
unanswered  questions  have  come  to  the  fore  and 
constitute  the  research  agenda  of  a  second  genera¬ 
tion  of  governance  network  research.  The  new 
generation  does  not  constitute  a  clear  break  with 
the  past  as  there  is  a  considerable  overlap  in  terms 
of  scholars  and  research  themes.  However,  the 
research  agenda  has  been  expanded  to  include  a 
number  of  pressing  questions.  First,  there  is  a 
growing  number  of  researchers  who  are  analyzing 
the  sources  of  governance  network  failure  and  the 
conditions  of  success.  Second,  a  large  number  of 
scholars  are  studying  how  public  authorities  and 
other  legitimate  and  resourceful  agencies  can  regu¬ 
late  self-regulating  governance  networks  through 
different  kinds  of  metagovernance.  Third,  the  per¬ 
sistent  critique  of  the  tendency  to  view  networked 
policy  processes  as  depoliticized,  managerial  prob¬ 
lem  solving  has  prompted  studies  of  the  role  of 
political  conflicts  and  power  struggles.  Finally, 
political  theorists  and  public  administration 
researchers  are  paying  increasing  attention  to  the 
democratic  problems  and  merits  of  governance 
networks. 

Eva  Sorensen  and  Jacob  Torfing 
Roskilde  University 
Roskilde,  Denmark 
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Government  is  a  key  concept  in  comparative 
political  science  that  has  undergone  frequent 
changes  in  terminology  and  has  different  impor¬ 
tance  in  the  intellectual  traditions  of  democratic 
countries.  Its  core,  more  current  definition  refers 
to  all  activities  of  steering  within  human  groups, 
from  tribes  to  the  state  and  to  supranational  and 
international  organizations.  This  entry  first  dis¬ 
cusses  the  meanings  of  the  term  government  and 
then  reviews  the  main  functions  of  government.  In 
the  third  section,  the  different  contemporary  mod¬ 
els  of  government  are  illustrated.  The  subsequent 
two  sections  examine  the  key  institutional  aspects 
of  government  and  the  new  approaches  to  the 
study  of  government,  such  as  the  so-called  core 
executive  models,  the  rational  choice  institutional¬ 
ism,  and  governance  studies. 

Terminology 

In  the  Middle  Ages,  the  terms  regere  and  guber- 
nare  were  used  as  synonyms.  The  tradition  of 
English-  and  French-speaking  countries  derived  to 
govern  and  government  or  gouverner  and  gouver- 
nement  from  the  Latin  term  gubernare ,  whereas 
the  German-speaking  countries  derived  their  main 
term  Regierung  from  the  Latin  word  regere.  The 
French  terminology  preserved  the  notion  of  regime , 
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which  later  was  frequently  used  in  a  pejorative 
sense.  The  term  ancien  regime  originally  referred 
to  the  French  system  of  government  from  about 
the  15th  century  until  the  French  Revolution. 
Following  Alexis  de  Tocqueville’s  1856  work  on 
the  concept  of  ancien  regime,  the  term  was  usually 
not  used  in  a  value-free  and  neutral  way  to  refer  to 
the  prerevolutionary  system  under  the  Bourbon 
dynasty.  Still,  Max  Weber,  in  his  political  writings, 
denounced  the  “regime”  of  his  country  as  “half- 
patriarchical,  half  caesaristic.  ” 

In  early-modern  times,  government  and  admin¬ 
istration  were  rarely  differentiated.  Later,  under 
the  impact  of  the  division  of  power  theories  of  the 
18th  and  19th  centuries,  they  were  combined 
under  the  heading  of  the  executive — as  contrasted 
with  the  legislature  or  parliament.  In  the  era  of 
absolutist  monarchy,  government — defined  as  the 
auxiliary  institution  of  a  monarch — was  the  most 
important  term.  With  the  increasing  constitution¬ 
alization  of  monarchy,  government  was  no  longer 
the  leading  institution  in  a  political  system  but 
became  a  subsystem  under  the  control  of  parlia¬ 
ment  and  jurisdiction.  Even  in  constitutional 
regimes  with  a  dualistic  structure  but  without  a 
dominant  parliament,  as  in  Britain  and  the  United 
States,  the  legal  state  or  Recbtsstaat  promoted  the 
idea  that  government  could  not  act  without  being 
empowered  by  a  legislative  decision.  In  Germany, 
positivist  legal  thinkers  restricted  the  possibility  of 
acting  without  mandate  from  parliament  in  the  so- 
called  justizfreien  Hoheitsakte  (governmental  acts 
as  an  emanation  of  a  sovereign  will  in  the  state, 
not  underlying  judicial  review)  in  foreign  policy 
and  military  affairs. 

After  the  parliamentarization  of  monarchical 
regimes,  the  political  power  of  government  was 
increasingly  differentiated  from  the  mere  technical 
execution  of  decisions  taken  by  a  common  enterprise 
of  legislature  and  government.  New  terminology 
arose  due  to  the  rise  of  the  parties  in  parliaments. 
With  the  tenure  of  Benjamin  Disraeli  as  prime  min¬ 
ister  in  Britain,  the  term  party  government  came 
to  be  used  to  demonstrate  the  integration  of 
those  politicians  who  are  currently  holding  office. 
Parliamentary  majorities  and  the  prime  minister  and 
his  or  her  ministerial  crew  were  united  by  a  common 
organization  originally  though  not  yet  provided  for 
in  the  statutes  of  the  legislature — that  is,  the  parties 
or  parliamentary  party  groups. 


Outside  Britain  or  the  United  States,  there  is 
rarely  one  homogeneous  majority  party  governing 
the  country.  Governmental  coalitions  or  Regierungs- 
koalitionen  are  directed  by  various  office  holders 
with  quite  different  names,  such  as  the  British 
prime  minister,  the  French  president  du  conseil,  or 
the  imperial  or  federal  chancellor  in  the  German 
and  Austrian  traditions.  In  the  hierarchy  of  gov¬ 
ernment,  there  are  normally  two  or  more  levels. 
In  Britain,  the  cabinet  constitutes  only  the  inner 
core  with  ministers  of  state  and  parliamentary 
secretaries.  In  the  United  States,  government  is 
normally  described  as  the  administration  or  the 
executive. 

Key  Functions  of  Government 

In  parliamentary  systems,  the  government  has  sev¬ 
eral  functions.  Most  important  is  the  function  of 
initiating  policies.  Quite  frequently,  parliamentary 
bills  are  prepared  by  the  executive  offices. 
Moreover,  government  has  the  increasingly  impor¬ 
tant  function  of  implementation.  Research  on 
implementation  has  shown  that  laws  and  decrees 
are  changed  substantially  by  modes  of  implementa¬ 
tion  in  the  hand  of  bureaucracies  under  the  guid¬ 
ance  of  governmental  offices.  A  third  function  of 
government  is  coordination.  The  office  of  the  gov¬ 
ernment  leader  has  to  ensure  that  policies  and  bills 
and  their  implementation  are  complementary  and 
do  not  contradict  each  other.  With  the  growing 
power  and  fragmentation  of  parties  in  parliament 
during  the  20th  century,  conservative  political 
theorists  frequently  complained  that  countries 
were  becoming  increasingly  difficult  to  govern. 
Ungovernability  was  a  polemical  term  that  became 
a  popular  exaggeration  of  political  reality  in  the 
1970s.  The  highly  normative  debate  on  govern- 
ability  was  a  reaction  to  the  growth  of  the  new 
social  movements — the  ecological  and  right-wing 
populists.  The  more  progressive  analysts  who 
fought  against  the  conservative  doomsday  scenar¬ 
ios  accepted  the  new  society  of  movements.  They 
accepted  the  ecological  movement  as  a  promising 
source  of  change  in  society.  These  theorists  came 
to  less  negative  conclusions  about  the  populist 
movements,  which  criticized  “the  system”  but  did 
not  challenge  the  system  as  a  whole,  instead  direct¬ 
ing  their  criticism  primarily  at  individual  institu¬ 
tions  and  at  the  behavior  of  the  “political  class.” 
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The  postmodern  cultural  turn  created  other 
derivations.  In  France,  Michel  Foucault  developed 
the  expression  gouvernementalite ,  which  tried  to 
link  the  mode  of  governing  and  regulations  with 
the  specific  rationality  of  a  decision.  The  mode  of 
governing  in  this  context  aims  less  at  direct  influ¬ 
ence  of  the  individual  and  collective  actors  and 
focuses  more  on  the  indirect  impact  of  action. 
Gouvernementalite  tried  to  overcome  the  restric¬ 
tions  of  an  outlook  that  puts  the  sovereignty  of 
state  at  the  center  of  analysis.  The  dichotomy  of 
notions,  such  as  governing  oneself  and  governing 
over  others,  is  meant  to  strengthen  the  self-respon¬ 
sibility  of  modern  citizens.  Criticism  in  this  theory 
is  directed  not  so  much  against  the  existing  govern¬ 
ments  and  their  institutions  but  against  the  under¬ 
lying  rationality  of  governmental  aims  and  actions. 

In  the  history  of  comparative  research,  the  origi¬ 
nal  term  comparative  government  is  rarely  used 
anymore,  and  the  dilemma  of  how  to  define  gov¬ 
ernment  is  avoided  by  adopting  the  new  term  gov¬ 
ernance,  which  came  into  use  in  the  1990s.  In 
recent  encyclopedias,  “government”  is  no  longer 
listed  as  an  entry  but  is  generally  replaced  by 
entries  on  “governance”  and  “governability” 
instead.  In  other  professional  dictionaries,  only  spe¬ 
cialized  articles  such  as  “government  regulation,” 
“government  budget  constraint,”  or  “government 
lawyers”  and  “government  statistics”  are  included. 

Models  of  Government 

Government  as  a  subject  of  comparative  political 
science  in  recent  years  is  no  longer  at  the  center  of 
scholarly  attention  as  it  used  to  be  until  the  1970s. 
The  literature  on  executive  government  in  parlia¬ 
mentary  systems  is  often  not  written  by  political 
scientists.  Biographies  and  journalistic  approaches 
dominate  (Rod  Rhodes,  2006).  Government  was 
defined  as  an  institution  that  has  the  authority  to 
allocate  values  within  a  legitimately  recognized  set 
of  institutions.  Experience  increasingly  showed, 
however,  that  government  was  no  longer  capable  of 
making  the  desired  allocations.  Complex  arrange¬ 
ments  rest  on  horizontal  forms  of  interaction 
between  actors,  independent  of  each  other  and  no 
longer  embedded  in  one  hierarchical  structure.  Such 
horizontal  relations  have  always  been  studied  by 
political  science,  as  in  the  “cozy  triangles”  of  inter¬ 
est  groups,  administrators,  and  parliamentarians. 


In  some  European  systems,  the  existence  of  federal 
actors  and  constitutional  courts  with  certain  veto 
powers  complicated  decision  making.  Postmodern 
polities  developed  this  kind  of  cooperation  with  a 
certain  regularity.  Governance  includes  decision 
making  not  only  via  deliberation  and  negotiation 
but  also  through  implementing  bodies.  Governance 
is  no  longer  centered  on  the  coercive  power  of  the 
state — the  incarnation  of  national  sovereignty — 
although  governance  is  not  just  another  term  for 
the  market. 

In  a  family  tree  of  comparative  politics,  Philippe 
Schmitter  in  2009  differentiated  between  three 
main  branches  of  comparative  politics:  (1)  histori¬ 
cal-sociological  institutionalism  from  Alexis  de 
Tocqueville  and  Max  Weber  to  Raymond  Aron, 
Gabriel  A.  Almond,  and  David  Apter;  (2)  legal 
constitutionalism  from  Maurice  Duverger  and 
Carl  J.  Friedrich  to  Klaus  von  Beyme;  (3)  rational 
institutionalism  in  the  public  choice  tradition  from 
Mancur  Olson  and  Albert  Flirschman  to  Anthony 
Downs,  James  Buchanan,  and  William  Riker. 
Public  policy  and  administration — no  longer 
lumped  together  under  the  heading  of  “govern¬ 
ment” — could  be  studied  under  the  auspices  of 
both  neo-institutionalism  and  rational  institution¬ 
alism.  Schmitter’s  plea  was  in  favor  of  “complexi- 
fication” — that  is,  of  using  different  approaches 
depending  on  the  problem  to  be  addressed. 

In  spite  of  the  dominance  of  the  notion  “gov¬ 
ernance”  during  past  decades,  the  term  govern¬ 
ment  is  still  used  in  various  ways.  In  a  recent 
essay,  the  political  scientist  Jean  Blondel  con¬ 
cluded  that  the  governments  are  a  recurrent  fea¬ 
ture  of  all  social  organization.  In  a  broad  sense, 
government  comprises — especially  in  the  British 
tradition — the  central  institutions  of  decision 
making,  such  as  parliament  and  government.  In 
1930s  and  later,  government  was  used  in  the 
broader  sense  of  a  whole  political  system.  The 
classifications  of  systems  since  Aristotle,  such  as 
“monarchy,”  “aristocracy,”  and  “democracy,” 
have  been  treated  under  the  notion  of  “classifica¬ 
tion  of  governments.”  Later,  government  appeared 
in  subtypes  of  representative  democracies,  such  as 
“parliamentary  government”  and  “presidential 
government.  ”  These  prototypes  were  developed  by 
Britain  and  the  United  States.  In  a  parliamentary 
government,  the  cabinet  is  responsible  to  parlia¬ 
ment.  There  is  mutual  dependence,  contrary  to  the 
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dualism  in  a  presidential  government;  parliamen¬ 
tary  majorities  can  topple  the  government  by  votes 
of  nonconfidence,  and  governments  can  dissolve 
parliament  and  call  for  new  elections.  In  general, 
there  are  parliamentary  governments  throughout 
the  European  continent  and  in  the  British  domin¬ 
ions.  Presidential  government  in  many  variations 
was  developed  in  Latin  America  and  Eastern 
Europe  when  those  countries  were  making  the 
transition  to  democracy.  Presidential  government 
tends  to  be  less  dependent  on  parties,  especially  the 
parliamentary  party  groups.  Under  Charles  de 
Gaulle,  a  semipresidential  government  with  a  popu¬ 
larly  elected  president  was  reinvented  in  France,  a 
system  that  had  existed  in  the  Second  French 
Republic  (1848-1851),  in  the  Weimar  Republic 
(1919-1933),  and  in  Finland  (since  1919).  With  the 
third  wave  of  democratization  since  the  early  1970s, 
new  variations  were  created  in  Eastern  Europe,  such 
as  in  Poland  and  Russia.  As  suggested  by  Thomas 
Poguntke  and  Paul  Webb  (2005,  pp.  8ff),  in  a  post¬ 
modern  media  society  the  various  types  of  execu¬ 
tives  are  becoming  more  similar — that  is,  there  is  a 
“presidentialization  of  parliamentary  government” 
that  exists  even  in  the  United  Kingdom. 

Institutional  Settings  of  Government 

In  a  narrower  sense,  government  “is”  governmen¬ 
tal  organization,  including  the  chief  executive  and 
his  or  her  office,  the  cabinet,  and  the  ministries. 
The  varieties  of  governmental  organizations 
depend  on  constitutional  provisions,  conventions, 
and  standing  orders  of  parliament  and  routine 
orders  in  the  bureaucracy.  In  the  governments  of 
Great  Britain  and  some  Commonwealth  countries, 
the  government  is  dominated  by  the  prime  minis¬ 
ter.  In  the  French  Third  and  Fourth  Republics,  the 
president  du  conseil  was  weak  and  ministers  had 
more  room  for  maneuvering.  The  British  principle 
of  “hire  and  fire”  is  rarely  valid  in  continental 
government  organizations  because  coalition  par¬ 
ties  and  important  politicians  representing  wings 
of  the  majority  party  limit  the  power  of  the  head 
of  government.  The  German  chancellor  system  is 
somewhat  closer  to  the  British  prime  ministerial 
government,  though  it  has  always  needed  a  coali¬ 
tion  government — with  one  exception,  1957-1961 
under  Konrad  Adenauer,  and  even  in  this  case,  the 
federal  chancellor  preferred  a  coalition  to  provide 


majorities  for  a  case  of  less  comfortable  majorities 
in  forthcoming  elections.  The  cabinet  or  council  of 
ministers  was  legally  recognized  for  the  first  time, 
though  rather  late,  in  the  constitutional  laws  of  the 
Third  French  Republic  (1875). 

Beyond  these  legal  principles,  governments  in 
most  countries  have  developed  informal  traditions. 
In  European  parliamentary  systems,  governmental 
organization  is  hierarchical  and  bureaucratic.  The 
collaborators  of  the  chief  of  government  are 
mostly  experienced  professional  civil  servants.  The 
recruitment  of  politicians  for  important  offices  is 
determined  by  the  strength  of  their  parties.  When 
the  influence  of  parties  is  strong,  the  cabinet 
considers  its  leader  as  primus  inter  pares.  This  is, 
however,  incompatible  with  the  legal  framework 
of  the  German  government.  Nevertheless,  German 
history  shows  strong  chancellors,  such  as  Konrad 
Adenauer  or  Helmut  Schmidt,  and  weak  chancel¬ 
lors,  such  as  Kurt  Georg  Kiesinger  and  Angela 
Merkel.  These  latter  leaders  were  restricted  in  their 
power  by  a  grand  coalition  of  the  two  major  par¬ 
ties.  Such  a  chancellor  has  to  set  guidelines  for  all 
ministers,  as  mandated  by  Article  65  of  the  Basic 
Law.  Some  offices,  such  as  agriculture  or  health, 
also  attract  exponents  of  important  interest  groups. 
Moreover,  in  the  United  States,  the  Senate  has  an 
important  influence  on  the  composition  of  govern¬ 
ment  because  it  confirms  the  president’s  appoin¬ 
tees  for  certain  government  offices. 

Several  years  ago,  it  was  common  to  identify 
the  United  States  with  having  an  “amateur  type” 
of  government  and  France  with  a  “technocratic 
type”  of  leader  who  usually  came  from  a  few 
grandes  ecoles  (elite  schools).  Most  European  par¬ 
liamentary  systems  are  somewhere  in  between.  As 
noted  above,  Germany  is  relatively  close  to  the 
British  type  of  selection  of  government  officers, 
which  could  be  dubbed  the  “parliamentary  party 
type.”  In  systems  with  a  high  number  of  parties 
and  the  resultant  fragmentation,  coordination  is 
more  difficult,  and  governmental  stability  may  be 
underdeveloped.  The  most  important  example 
was  Italy  before  Silvio  Berlusconi’s  period.  In  fact, 
the  highly  fragmented  party  government  increased 
the  number  of  offices.  With  a  strong  factionalism 
within  parties,  as  in  the  case  of  the  Italian  cor- 
renti,  the  government  apparatus  expanded.  In 
general,  it  was  largely  accepted  that  coalition  gov¬ 
ernments  increased  the  number  of  offices.  But 


Government  1039 


prime  ministerial,  single-party  governments  of  the 
Westminster  type  contributed  still  more  to  the  pro¬ 
liferation  of  offices,  up  to  the  point  where  one  third 
of  the  governing  parliamentary  group  in  Britain 
held  executive  office.  The  “executivization”  of  par¬ 
liamentary  personnel  had  repercussions  on  parlia¬ 
ment.  When  all  the  capable  deputies  held  office,  the 
remaining  parliamentarians  were  weakened,  and  it 
was  not  easy  to  organize  parliamentary  activities 
independent  of  the  government.  Some  systems, 
therefore,  limited  the  number  of  ministers,  as  in 
Norway.  The  cabinet  in  a  narrow  sense  and  the 
government  in  a  broader  sense  in  parliamentary 
systems  comprise  a  number  of  offices  from  a  dozen 
to  several  dozens  of  governmental  posts.  In  the 
presidential  system  of  the  United  States,  the  patron¬ 
age  power  of  the  president  can  result  in  about 
4,000  nominations  for  government  positions.  In 
Germany,  there  are  about  20,000  employees  in  the 
government  in  a  broader  sense,  but  only  about  150 
positions  are  held  by  secretaries  of  state  and  lead¬ 
ing  ministerial  directors  who  can  be  dismissed  at 
any  time.  German  government  in  a  broad  sense 
includes  also  trustees  for  special  tasks  (about  27) 
and  parliamentary  secretaries  of  state  who  are  par¬ 
liamentarians,  and  as  exponents  of  their  respective 
parties,  they  connect  government  and  parliament. 
Their  main  task  is  to  support  the  views  of  govern¬ 
ment  and  to  defend  governmental  propositions  in 
parliament  to  find  a  majority  of  favorable  votes. 

Recent  research  on  government  in  Europe 
increasingly  turns  to  the  study  of  multilevel  gov¬ 
ernment  within  the  European  Union  (EU),  charac¬ 
terized  by  the  fact  that  most  parliamentary  legisla¬ 
tion  in  member  countries  is  the  implementation  of 
EU  decisions  and  directives.  The  European  influ¬ 
ence  on  national  government  depends  on  the  pol¬ 
icy  arena,  most  frequently  in  environment,  agricul¬ 
ture,  traffic,  and  communication.  The  least  affected 
so  far  is  the  most  important  area  for  citizens — that 
is,  social  security  (about  5%).  According  to  Klaus 
von  Beyme  (1998),  not  more  than  one  quarter  of 
the  laws  in  European  countries  are  the  absorption 
inside  each  country  of  decisions  made  by  the  EU. 

New  Approaches  to  the 
Study  of  Government 

Three  approaches  have  recently  been  used  to  study 
governments  in  order  to  overcome  the  conventional 


classifications  of  types  of  government:  core  execu¬ 
tive  models ,  rational  choice  institutionalism,  and 
governance  studies. 

Core  Executive  Models 

The  six  core  executive  models  that  have  been 
differentiated  by  Robert  Elgie  (1997)  more  exactly 
indicate  the  true  center  of  power  than  traditional 
notions  such  as  parliamentary  government: 

1 .  Monocratic  government:  The  key  feature  is 
personal  leadership  by  a  prime  minister  or 
president. 

2.  Collective  government:  Small,  face-to-face 
groups  decide  policy,  with  no  single  member 
controlling. 

3.  Ministerial  government:  The  political  heads  of 
major  departments  decide  policy. 

4.  Bureaucratic  government:  Nonelected  officials  in 
government  departments  and  agencies  decide 
policy. 

5.  Shared  government:  Two  or  three  individuals 
have  joint  and  equal  responsibility  for  policy 
making. 

6.  Segmented  government:  A  sectoral  division  of 
labor  among  executive  actors  with  little  or  no 
cross-sectoral  coordination. 

These  “ideal  types”  are  rarely  found  in  modern 
democracies,  but  elements  of  them  are  combined 
in  fluid  patterns  and  sometimes  one  model  suc¬ 
ceeds  another. 

Rational-Choice  Institutionalism 

George  Tsebelis  (2002),  in  his  approach  to 
rational  choice,  posits  that  governments  in  order 
to  change  policies  must  get  individual  actors  or 
veto  players  to  agree.  Institutional  veto  players 
are  specified  by  the  constitution  and  partisan  veto 
players  are  specified  by  the  party  system.  Tsebelis 
(2002)  reverses  the  usual  meaning  of  notions  such 
as  parliamentary  and  presidential  government: 
“Agenda  control  most  frequently  belongs  to  gov¬ 
ernments  in  parliamentary  systems  and  parlia¬ 
ment  in  presidential  ones.”  If  the  veto  players  are 
differentiated  according  to  policy  areas,  the  old 
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taxonomy  of  representative  systems  is  more  or 
less  irrelevant. 

Governance  Studies 

The  concept  of  governance  (derived  from  the 
Latin  term  gubernantia)  has  a  wider  range  than 
that  of  government  because  it  includes  all  individ¬ 
ual  and  collective  interactions  of  governments  with 
nongovernmental  organizations,  interest  groups, 
social  movements,  and  citizen  groups.  Even  before 
the  invention  of  the  notion  of  governance,  the  gov¬ 
ernmental  organization  was  integral  to  the  com¬ 
munications  network  involving  interest  groups 
and  expert  bodies  of  political  advice.  Also  included 
in  the  notion  of  governance  are  long-term  policy 
networks  and  short-lived  networks  established  for 
individual  decisions.  The  increasing  complexity  of 
the  decision-making  process  has  contributed  to  a 
wide  debate  on  “government  overload,”  namely,  a 
higher  number  of  demands  that  are  difficult  or 
impossible  for  government  to  meet.  This  overload 
contributed  to  the  discussion  on  ungovernability 
that  occurred  in  the  1980s.  Governance  studies 
have  shown  that  the  generic  analyses  of  ungovern¬ 
ability  were  misleading,  given  the  new  forms  of 
overcoming  crises  and  deadlocks  that  have  been 
developed  in  postmodern  societies. 

The  term  governance  was  first  widely  used  in 
the  subdiscipline  of  international  relations  where 
“governing  without  government”  (James  N. 
Rosenau  &  Ernst-Otto  Czempiel,  1995)  played  a 
major  role.  Moreover,  in  the  subfield  of  interna¬ 
tional  relations  dealing  with  developmental  poli¬ 
tics  in  the  Third  World,  the  term  good  governance 
has  been  used.  It  aims  at  finding  normative  criteria 
for  efficient  aid  to  developing  countries.  This 
approach  was  directed  against  an  ideological  moti¬ 
vation  for  aid  to  developing  countries  in  the  times 
of  the  Cold  War.  Later,  the  instrumental  combina¬ 
tion  of  multilateral  developmental  strategies  and 
the  ideological  preoccupation  of  developed  democ¬ 
racies  for  the  promotion  of  democracy  in  the  Third 
World  were  criticized  by  the  elites  in  the  Third 
World.  Building  on  the  resolutions  adopted  by  the 
United  Nations  since  1996,  scientific  criteria  for 
good  governance  were  developed. 

Governance  gained  a  central  importance  for  the 
discipline  when  network  became  the  central  expres¬ 
sion  for  a  great  variety  of  interactions  beginning  in 


the  early  1980s.  Governance  is  a  new  term  for 
old  approaches  that  started  with  theories  of  neo¬ 
corporatism  on  guided  cooperation  of  governments 
and  interest  groups  and  theories  on  governmental 
steering  when  the  term  planning,  which  dominated 
in  the  1970s,  became  obsolete.  Theories  of  the 
cooperative  government  and  interdependence  man¬ 
agement  in  the  science  of  administration  also  pre¬ 
pared  the  road  for  studies  of  governance.  The  older 
approaches  were  clearly  centered  on  “actors.” 
Governance  was  not  only  a  new  game  of  words  but 
part  of  a  neo-institutionalist  turn  in  the  social  sci¬ 
ences.  Governance  approaches  were  built  on  the 
experience  that  policy  formulation  became  more 
difficult  in  postmodern  societies.  There  is  more  dif¬ 
ferentiation  in  the  new  century  in  the  realm  of 
interest  groups  and  social  movements  than  in  neo¬ 
corporatism — a  dominant  paradigm  in  the  1980s — 
aimed  at  cooperation  between  state  agencies  and 
main  actors,  such  as  organizations  of  business  and 
trade  unions.  Self-organization  in  a  civil  society 
made  “steering”  less  easy.  The  core  executive  was 
no  longer  dominant  even  in  the  area  of  traditional 
government  because  many  governmental  and 
para-statal  agencies  are  involved  in  policy  formu¬ 
lation  and  policy  implementation  and  a  new 
term — agencification — has  been  invented.  In  the 
name  of  new  public  management  reform  for  greater 
efficiency,  a  separation  of  policies  and  operations 
was  recommended  by  Christopher  Pollitt  and  Colin 
Talbot  (2003).  Lobbying  also  changed  in  postmod¬ 
ern  governments.  Clientele  relations  grew  up 
between  ministerial  and  para-statal  agencies  and 
organized  interests,  as  Marian  Dohler  (2001) 
pointed  out.  Think  tanks  and  councils  of  experts 
gained  an  important  role  in  many  policy  fields. 

Moreover,  the  role  of  the  media  has  increased. 
Public  affairs  agencies  and  international  organiza¬ 
tions  of  attorneys  that  do  not  permanently  work 
for  the  same  interests  grew  in  importance  against 
the  traditional  organized  lobbies.  Privatization 
and  liberalization  of  former  state  monopolies  in 
infrastructure  (e.g.,  mail  services,  telephone,  rail¬ 
roads,  and  energy  supply)  have  created  new  pro¬ 
files  of  governmental  tasks.  The  old  instruments 
of  government  such  as  laws,  persuasion  and  pro¬ 
paganda,  coercion,  and  regulation  developed  a 
different  relative  weight.  As  already  suggested  by 
von  Beyme  (1998),  today,  there  is  virtually  no 
institution  that  is  “sovereign,”  for  example,  “the 
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legislator” — as  an  actor  that  includes  both  parlia¬ 
ment  and  government. 

These  changes  left  political  science  with  consid¬ 
erable  pessimism  as  to  whether  institutional  poli¬ 
cies  can  create  more  efficient  policies.  There  are 
two  approaches  to  answering  this  question.  One 
approach  prefers  the  transformative  perspective 
and  accepts  that  institutional  arrangements  can  be 
changed  only  by  long-run  changes  in  culture  and 
outlooks.  “Path  dependency”  of  governmental 
institutions  has  been  discovered  and  tries  to 
explain  why  French  centralization  or  German  fed¬ 
eralism  is  so  difficult  to  be  transformed  because 
historical  traditions  and  vested  interests  prevent 
radical  change. 

Another  approach  believes  in  the  possibility  of 
creating  a  deliberate  design  of  a  regulating  system 
and  that  a  kind  of  “metagovernor”  (as  described  by 
Bob  Jessop)  or  “third-order  governing”  (according 
to  Jan  Kooiman)  survived  in  postmodern  society. 

In  both  approaches,  government  ceased  to  be 
“government”  in  the  traditional  sense  in  which  it 
had  been  under  the  impact  of  theories  on  states 
and  sovereignty  for  almost  500  years  after  Niccolo 
Machiavelli,  Jean  Bodin,  or  Thomas  Hobbes. 

Klaus  von  Beyme 
University  of  Heidelberg 
Heidelberg,  Germany 
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Granger  Causality 


Granger  causality  is  the  concept  that  if  a  variable 
causes  another  variable,  it  should  help  with  the 
prediction  of  the  latter  variable.  First,  Granger 
causality  is  defined  informally  and  formally.  Next, 
the  standard  hypothesis  tests  used  to  evaluate  the 
concept  are  presented.  Third,  the  interpretations 
of  these  tests  for  Granger  (non)causality  are  dis¬ 
cussed.  Then,  some  caveats  and  limitations  are 
presented  before  turning  to  some  common  exten¬ 
sions  and  applications. 

The  statistical  concept  of  Granger  causality  is 
related  directly  to  the  idea  of  causality  and  causal 
inference  in  the  social  sciences.  In  the  (social) 
sciences,  it  is  the  case  where  two  variables  are 
causally  related  if 

•  the  change  in  one  variable  temporally  precedes 
that  of  the  variable  it  causes, 

•  there  is  a  nonzero  correlation  among  the  two  (or 
more)  variables,  and 

•  the  relationship  among  the  variables  is  logically 
nonspurious. 

Granger  causality  assesses  the  first  two  of  these 
using  a  statistical  test,  leaving  the  third  to  the  jus¬ 
tification  of  the  analyst.  Granger  causality  analy¬ 
sis  asks  whether  a  variable  helps  in  predicting 
another,  which  requires  both  temporal  precedence 
and  correlation.  One  of  Clive  Granger’s  seminal 
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time-series  papers  on  this  topic  developed  the 
statement  of  the  concept  and  presented  the  most 
common  statistical  tests  used  to  assess  causality. 

If  a  variable  Z  at  Time  t  Granger  causes  a  vari¬ 
able  X  at  Time  t  or  Zt  Granger  causes  Xt  then  the 
following  three  statements  are  logically  equivalent: 

1.  Zt  helps  predict  Xt  (i.e.,  it  lowers  the  error  of 
the  prediction), 

2.  Zt  is  not  exogenous  of  Xt, 

3.  Zt  is  linearly  informative  about  future  Xr 

The  assumption  that  makes  these  three  state¬ 
ments  equivalent  is  that  there  is  a  linear  relation¬ 
ship  between  Zt  and  Xt.  Granger  causality  can  be 
defined  more  generally  and  can  also  encompass 
nonlinear  relationships.  In  this  case,  only  the  first 
two  statements  are  relevant. 

The  formal  definition  of  Granger  causality  is 
actually  a  statement  of  noncausality.  A  variable 
Zt  is  Granger  noncausal  for  Xt  if  the  past  values 
of  Zt  do  not  help  predict  the  current  (or  future) 
values  of  Xt  over  a  prediction  based  on  the  past 
history  of  Xt  alone.  Under  the  assumption  of  lin¬ 
earity,  then,  this  is  the  same  as  analyzing  a 
dynamic  regression  of  Xt  on  past  values  of  Xt  and 
Z„  or 

Xt  =  d  +  £  a jXt-i  +  £  (3 jZt-j  +  et, 

i= 1  i=  1 

where  d  is  a  constant,  as  are  the  coefficients  for 
the  p  lagged  values  of  Xt,  and  (3,s  are  the  coeffi¬ 
cients  for  the  p  lagged  values  of  Zt.  If  Zt  Granger 
causes  Xt,  then  some  of  the  (3,  coefficients  are  dif¬ 
ferent  from  zero.  Note  that  by  including  the  lagged 
values  of  Xt,  the  variable  Zt  must  add  more  to  the 
prediction  of  X  than  its  own  past  (and  thus  help 
minimize  the  prediction  error  et)  to  deem  it  causal. 
Testing  whether  these  are  equal  to  zero  is  thus  a 
test  of  Granger  noncausality. 

Statistical  tests  of  Granger  (non)  causality 
depend  on  comparing  the  residuals  of  the  model 
in  the  above  equation  with  those  of  a  model 
where  Xt  only  depends  on  its  past  values,  or 
where  one  assumes  that  (3;  =  0  for  i  =  1,  .  .  . ,  p. 
Statistical  tests  for  Granger  causality  are  gener¬ 
ally  implemented  as  a  comparison  of  the  mean 


squared  prediction  error  of  the  previous  equation 
with 

p 

Xt  —  d  +  Yj  aiXt-i  +  ut. 

i=i 

This  amounts  to  comparing  the  variance  of  et  with 
ut.  If  the  variance  of  the  former  is  smaller  than  Zf, 
Granger  causes  Xt  under  the  earlier  assumptions. 
A  Granger  causality  test  is  done  most  commonly 
using  either  an  T,  Wald,  or  likelihood  ratio  test  for 
whether  (3X  =  |32  =  .  .  .  =  (3p  =  0.  These  test  statis¬ 
tics  degrees  of  freedom  are  typically  corrected  by  a 
factor  of  T  2p  1,  where  T  is  the  sample  size 
and  p  is  the  number  of  lags.  This  helps  account  for 
the  estimation  of  the  potentially  large  number  of 
parameters  in  these  equations. 

While  this  test  statistic  is  easily  and  commonly 
implemented,  it  depends  on  several  assumptions. 
These  include  (but  are  not  limited  to)  the  following: 

1.  The  relevant  information  and  variables  have 
been  included  in  the  model  used  to  conduct  the 
test.  Any  relevant  omitted  variables  can  lead  to  a 
biased  test. 

2.  The  relationship  between  the  variables  is  linear 
and  the  errors  in  the  equations  are  normally 
distributed.  (The  result  can  hold  for  nonlinear 
and  nonnormal  cases  but  requires  other 
assumptions.)  In  this  most  common  case,  testing 
for  Granger  causality  is  reduced  to  the 
comparison  of  the  minimum  mean  squared 
regression  errors  defined  above. 

3.  There  is  no  residual  correlation  in  et  or  ut, 
meaning  that  the  lag  length  p  is  sufficient  to 
remove  any  residual  serial  correlation  from  the 
regressions  used  to  construct  the  test. 

4.  The  variable  Zt  is  stationary.  Nonstationarity 
could  induce  spurious  correlation,  since  then 
there  could  be  an  error  correction  mechanism 
between  the  variables  that  naturally  implies  a 
Granger  causality  relationship  among  the 
variables.  Extensions  of  the  standard  Granger 
causality  tests  exist  for  this  case. 

Finding  that  one  Granger  variable  causes 
another  depends  on  the  information  set  or  the  vari¬ 
ables  that  are  included  in  a  model.  So  if  one  were 
to  introduce  a  variable  Yt  that  is  causal  for  Xt  and 
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correlated  with  Zt,  one  could  find  that  the  effects 
of  Zt  on  Xt  work  through  this  new  variable.  The 
same  is  not  necessarily  true  for  findings  of  Granger 
noncausality,  since  the  existence  of  a  variable  that 
would  turn  a  noncausal  relationship  into  a  causal 
one  is  nearly  impossible. 

It  is  important  to  note  that  tests  for  Granger 
(non)causality  do  not  assess  the  direction  of  the 
causal  relationship.  The  null  hypothesis  is  that  the 
past  values  of  the  variable  Zt  do  not  help  with 
predicting  Xt.  Rejecting  this  hypothesis  could 
occur  if  there  were  a  positive  or  negative  correla¬ 
tion  among  the  past  values  of  Zf  and  the  current 
Xt.  To  interpret  the  causal  direction  of  the  effects 
of  Zt  on  Xt,  one  typically  employs  a  vector  autore¬ 
gression  model  or  computes  the  dynamic  effects  of 
changes  in  past  Zt  on  contemporaneous  Xt.  This  is 
where  theory  helps  identify  that  the  relationship  is 
nonspurious  and  provides  some  ideas  about  the 
expected  direction  of  the  effects  of  one  variable  for 
forecasting  the  other.  Formally,  impulse  response 
functions  or  the  vector  moving  average  interpreta¬ 
tion  of  a  vector  autoregression  model  are  used  to 
determine  the  direction  of  these  effects.  The  vector 
autoregression  model  and  its  estimated  effects  will 
demonstrate  the  same  causal  relationships  identi¬ 
fied  with  a  Granger  causality  test  and  provide  the 
magnitude  and  direction  of  any  causal  effects. 

The  above  definition  of  Granger  causality  is 
what  is  known  as  bivariate  Granger  causality. 
The  concept  can  also  be  extended  in  several  ways. 
The  first  involves  the  multivariate  case.  Suppose 
now  that  Zt  and  Xt  are  vectors  of  observations  at 
time  t.  Then,  we  say  that  Zt  is  non-Granger  causal 
or  block  exogenous  of  Xt  if  the  past  values  of  the 
matrix  Zt  do  not  help  in  predicting  the  current 
values  of  Xt.  The  earlier  test  statistics  can  then  be 
extended  to  their  matrix  equivalents  and  used  to 
assess  whether  the  past  Zt  values  help  predict  Xt. 

A  second  extension  is  what  is  known  as  contem¬ 
poraneous  Granger  causality.  A  variable  Zt  instan¬ 
taneously  causes  Granger  Xt  if  the  contemporane¬ 
ous  and  leading  or  future  values  of  Zt  help  in 
predicting  the  current  value  of  Xt.  To  test  this 
hypothesis,  one  looks  at  the  contemporaneous  rela¬ 
tionships  (the  et  and  ut  residuals  for  the  vector 
autoregression)  to  see  if  there  is  a  correlation 
among  the  current  values  of  the  series. 

The  third  possible  extension  is  to  move  beyond 
the  assumption  of  minimizing  one-step  prediction 


errors.  Multistep  Granger  causality  can  be  defined 
by  looking  at  the  h-step  residuals  for  a  regression 
of  Xt  on  past  Xt  and  Zt.  If  past  Zt  minimize  the  h 
periods  of  forecast  errors  in  Xt  better  than  the  past 
Xt  alone,  then  Zt  is  said  to  be  multistep  Granger 
causal  for  Xt.  A  final  extension  is  to  panel  data. 
One  can  extend  the  same  ideas  used  in  assessing 
Granger  causality  in  time-series  data  to  collections 
of  time  series  that  are  part  of  a  panel  of  time  series. 

Granger  causality  is  a  useful  test  and  tool  for 
determining  which  variables  help  predict  each 
other  in  multivariate  time-series  models.  It  is  used 
commonly  to  justify  which  variables  are  included 
in  a  vector  autoregression  or  other  multivariate 
time-series  models  since  it  allows  the  analyst  to 
justify  different  (weak)  exogeneity  assumptions. 
The  point  of  a  Granger  causality  test  analysis  is 
that  one  uses  the  results  to  support  an  argument 
about  the  predictive  power  of  a  variable  on 
another  in  a  time-series  model. 

Patrick  T.  Brandt 
University  of  Texas  at  Dallas 
Richardson,  Texas,  United  States 
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Statistical  graphics  provide  visual  displays  of  quan¬ 
titative  information.  While  graphs  have  been  an 
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important  tool  in  the  statistical  sciences,  their  role 
and  applications  have  evolved  over  time.  In  the 
past,  graphical  displays  were  generally  employed 
for  pedagogical  purposes  and  to  present  the  final 
results  of  an  analysis.  Currently,  graphs  are  being 
incorporated  more  directly  into  data  analysis  itself, 
where  they  are  used  to  provide  insights  and  direc¬ 
tions  for  further  investigation.  In  this  entry,  the 
basic  elements,  major  advantages,  and  recent 
developments  of  statistical  graphics  are  presented. 

Statistical  graphs  are  used  for  several  purposes. 
First,  they  provide  a  tool  for  exploring  the  contents 
of  a  data  set;  in  effect,  graphical  displays  enable 
researchers  to  interact  directly  with  the  “raw  mate¬ 
rial”  for  their  analyses.  Second,  graphs  enable 
researchers  to  look  for  structure  in  their  data;  they 
provide  a  ready  means  to  identify  interesting  fea¬ 
tures  such  as  concentrations  of  observations  at 
particular  data  values,  covariance  across  variables, 
outliers  in  distributions,  and  so  on.  Third,  graphi¬ 
cal  displays  are  useful  for  checking  the  assump¬ 
tions  of  statistical  models;  for  example,  they  can  be 
used  to  examine  distributional  shapes  of  constitu¬ 
ent  variables  or  to  search  for  systematic  patterns 
among  model  residuals.  Fourth,  graphs  remain  an 
excellent  vehicle  for  communicating  the  results  of 
statistical  analyses;  they  provide  fairly  intuitive 
representations  of  complex  models  that  can  be 
understood  by  broad  audiences. 

Of  course,  all  of  the  preceding  objectives  could 
also  be  achieved  through  numerical  methods,  such 
as  equations  and  tabular  representations.  But 
graphs  have  several  advantages.  For  one  thing,  an 
enormous  amount  of  information  can  be  repre¬ 
sented  directly  or  summarized  in  a  single  well- 
constructed  graph;  therefore,  graphical  displays 
are  useful  for  depicting  large,  complex  data  sets. 
At  the  same  time,  graphs  are  not  as  reliant  on 
underlying  assumptions  as  are  numerical  summa¬ 
ries  of  data  such  as  descriptive  statistics  or  model 
coefficients.  And  graphs  almost  inherently  encour¬ 
age  greater  interaction  between  the  researcher  and 
the  data;  interesting  features  stand  out  immedi¬ 
ately  and  graphs  provide  the  means  to  pursue  them 
to  gain  further  insights. 

Historical  Evolution 

The  earliest  “modern”  graphical  displays,  gener¬ 
ated  during  the  Enlightenment  period  of  the  17th 


and  18th  centuries,  downplayed  depictions  of 
empirical  data  in  favor  of  graphing  the  abstract 
functions  that  seemed  to  describe  the  dynamics  of 
the  observable  universe.  But  a  minor  scientific 
revolution  occurred  at  the  end  of  the  18th  century, 
when  an  iconoclastic  Scottish  political  commenta¬ 
tor,  William  Playfair,  published  The  Commercial 
and  Political  Atlas  (various  editions  were  pro¬ 
duced  between  1786  and  1801)  and  the  Statistical 
Breviary  (1801).  The  substantive  objective  for 
each  of  these  books  was  to  provide  information 
about  the  British  economy,  relative  to  other 
nations.  But  they  are  remembered  largely  because 
Playfair  developed  a  number  of  graphical  displays 
to  illustrate  his  data  and  support  his  arguments. 
Playfair’s  innovations  would  be  recognizable 
immediately  to  a  modern  reader;  they  include  bar 
charts,  pie  charts,  and  line  charts. 

During  the  19th  century,  graphs  were  used 
widely  in  scientific  communications.  This  period  is 
characterized  by  the  use  of  creative  pictorial 
devices  to  convey  information  as  efficiently  as  pos¬ 
sible.  For  example,  a  famous  map  by  French  engi¬ 
neer  Charles  Joseph  Minard  depicts  Napoleon’s 
march  during  the  Russian  campaign  of  1812.  It 
has  been  hailed  as  the  finest  statistical  graphic,  in 
large  part  because  it  encodes  information  about  six 
different  variables  into  a  single  display.  Graphical 
representations  of  empirical  information  waned 
once  again  during  the  first  half  of  the  20th  century. 
During  this  period,  statisticians  turned  their  atten¬ 
tion  to  confirmatory  methods  and  hypothesis  test¬ 
ing.  These  are  strategies  where  the  characteristics 
of  the  data  need  to  be  well  understood  before  the 
tests  are  applied  to  gain  insight  about  the  values  of 
specific  parameters.  And  since  assumptions  about 
the  processes  that  generated  the  data  precede  the 
tests,  graphical  displays  tend  to  be  less  useful. 

Beginning  in  the  1950s,  and  continuing  through 
the  first  decade  of  the  21st  century,  there  has  been 
a  strong  resurgence  of  interest  in  statistical  graph¬ 
ics.  There  are  several  reasons  for  this  recent  trend. 
First,  there  has  been  an  increasing  emphasis  on 
exploratory  methods,  through  the  influence  of  the 
late  John  Tukey  and  his  followers.  Second,  there 
has  been  productive  research  on  the  psychology  of 
graphical  perception,  so  there  is  greater  under¬ 
standing  of  the  ways  that  people  process  visual 
information.  Third,  there  have  been  major  advances 
in  graphical  methodologies,  providing  new  tools 
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for  pictorial  representations  of  numeric  informa¬ 
tion.  Fourth,  the  widespread  availability  of  power¬ 
ful  computing  equipment  and  high-resolution, 
fast-refreshing,  graphical  displays  means  that  sci¬ 
entists  now  have  the  means  to  render  graphs 
quickly  and  easily.  And  fifth,  the  “data  explosion” 
of  recent  years  has  encouraged  broader  reliance  on 
graphical  strategies  for  coping  with  the  sheer  vol¬ 
ume  of  information  confronting  many  researchers. 

Graphical  Information 

and  Human  Perception 

A  graph  can  be  effective  only  if  the  information 
contained  therein  can  be  extracted  easily  and  accu¬ 
rately  by  its  intended  audience.  Understanding 
how  to  accomplish  this  requires  some  understand¬ 
ing  of  human  graphical  perception.  An  influential 
theory  developed  by  William  S.  Cleveland  holds 
that  interpretation  of  graphs  comprises  two  funda¬ 
mental  tasks:  table  lookup  and  pattern  perception. 
Table  lookup  refers  to  the  process  of  moving  from 
the  displayed  geometric  rendering  of  the  data  back 
to  the  numerical  values  that  generated  the  display. 
This  process  is  facilitated  through  careful  attention 
to  the  axes  and  scales  that  define  the  plotting 
region(s)  of  a  graphical  display. 

Pattern  perception  is,  itself,  composed  of  three 
more  fundamental  perceptual  tasks:  First,  detection 
refers  to  the  ability  to  perceive  the  individual  ele¬ 
ments  of  the  data  in  a  graph;  it  usually  involves  the 
selection  of  suitable  plotting  symbols  to  represent 
data  points.  Second,  assembly  refers  to  the  ability 
to  combine  separate  elements  together  in  a  way 
that  facilitates  recognition  of  nonrandom  structure 
within  the  data — for  example,  the  shape  of  a  uni¬ 
variate  distribution  or  functional  dependence  of 
one  variable’s  values  on  another  variable’s  values. 
Third,  estimation  is  the  ability  to  make  quantitative 
comparisons  across  the  geometric  elements  in  a 
graphical  display.  For  example,  observers  can  typi¬ 
cally  make  quite  accurate  judgments  about  relative 
differences  when  data  points  are  plotted  along  a 
common  scale.  But  it  is  more  difficult  to  make 
similarly  accurate  comparisons  of  quantities  that 
are  represented  as  differences  in  angles,  areas  of 
geometric  shapes,  or  gradations  in  color. 

Theories  of  graphical  perception  are  important 
because  they  provide  standards  for  judging  the 
quality  of  graphical  displays.  Relying  on  Cleveland’s 


theory,  the  objective  is  to  encode  the  data  with  geo¬ 
metric  devices  that  optimize  the  three  components 
of  pattern  perception.  However,  compromises  are 
almost  always  necessary;  for  example,  emphasizing 
the  individual  data  points  facilitates  detection  but 
may  impede  assembly.  As  a  result,  there  is  seldom 
any  single  best  display  for  a  given  data  set.  Instead, 
different  graphs  may  provide  different  useful  per¬ 
spectives  on  the  same  basic  information. 

Types  of  Graphical  Displays 

There  are  many  situations  in  which  the  analyst  is 
confronted  with  labeled  quantitative  values,  such 
as  the  raw  data  values  associated  with  individual 
observations  or  summary  values  calculated  for 
subgroups  within  a  data  set  (e.g.,  the  conditional 
means  on  a  variable  or  the  percentage  of  observa¬ 
tions  within  each  subset).  For  such  data,  pie  charts, 
bar  charts,  and  dot  plots  can  be  used.  Pie  charts 
require  relatively  inaccurate  visual  judgments  of 
angles  and  areas,  while  accurate  interpretation  of 
bar  charts  can  be  impeded  by  the  arbitrary  loca¬ 
tion  of  the  origin  on  the  scale  that  defines  the 
lengths  of  the  bars.  Carefully  constructed  dot  plots 
can  overcome  these  problems,  and  they  can  be 
adapted  to  a  wide  variety  of  specific  situations. 
Therefore,  they  are  the  preferred  analytic  graphical 
display  for  labeled  data  values. 

Graphical  displays  for  univariate  data  enable 
visual  assessments  of  a  distribution’s  most  impor¬ 
tant  features — center,  spread,  shape,  and  outliers. 
Again,  there  are  several  possibilities,  including 
univariate  scatterplots,  histograms,  smoothed  his¬ 
tograms,  quantile  plots,  and  box  plots.  Univariate 
histograms  are  simple  to  construct  and  interpret; 
however,  they  are  effectively  limited  to  small  data 
sets.  Histograms  are  familiar  and  easy  to  interpret. 
However,  the  contents  of  a  histogram  can  be 
affected  by  the  definitions  of  the  categories  (or 
“bins”)  used  to  delineate  its  bars.  Smoothed  histo¬ 
grams  overcome  the  problems  caused  by  bin  defi¬ 
nitions,  but  they  are  similarly  affected  by  technical 
aspects  of  the  smoothing  process.  Quantile  plots 
overcome  all  of  the  preceding  problems,  but  they 
are  cognitively  demanding  and  probably  not  famil¬ 
iar  to  many  social  scientists.  Box  plots  focus  on  the 
most  important  quantiles  of  a  data  distribution; 
they  are  easily  interpreted  and  are  especially  useful 
for  comparing  univariate  distributions. 
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There  is  really  only  one  type  of  graphical  display 
for  bivariate  data:  the  scatterplot.  But  careful  atten¬ 
tion  to  the  details  is  critical  for  optimizing  the  infor¬ 
mation  conveyed  by  this  type  of  display.  For  exam¬ 
ple,  the  plotting  symbol  used  to  represent  individual 
observations  must  be  visually  prominent  and  resis¬ 
tant  to  overplotting  effects.  And  the  plotting  region 
should  be  fully  enclosed  by  the  scale  rectangle,  with 
none  of  the  data  points  colliding  with  any  of  the 
axes.  Grid  lines  should  not  be  included  within  the 
plotting  region.  And  if  the  objective  of  the  graphical 
display  is  to  assess  the  relationship  between  two 
variables,  then  there  is  usually  no  need  to  include 
point  labels.  Finally,  a  scatterplot  smoother  (i.e.,  a 
curve  that  traces  out  the  conditional  mean  of  the 
variable  plotted  on  the  vertical  axis  across  the  range 
of  data  values  on  the  horizontal  axis)  can  be  used  to 
summarize  the  relationship  between  the  variables 
plotted  in  the  display. 

Multivariate  data  pose  particular  challenges  for 
statistical  graphics.  While  it  is  possible  to  draw  a 
three-dimensional  scatterplot,  using  an  oblique  ori¬ 
entation  on  the  data  space  to  simulate  a  perspective 
view,  such  displays  are  generally  not  very  effective. 
Instead,  other  devices  have  been  employed  to  repre¬ 
sent  multiple  dimensions  of  variability  in  a 
two-dimensional  display  medium.  For  example,  a 
scatterplot  matrix  is  the  graphical  analog  of  a  cor¬ 
relation  matrix,  showing  scatterplots  between  all 
possible  pairs  of  variables  arranged  in  a  regular, 
square  layout.  Conditioning  plots  and  trellis 
displays  show  one-  or  two-dimensional  graphical 
displays  for  subsets  of  a  data  set,  where  the  subsets 
are  defined  by  intervals  of  values  on  other  variables. 
And  finally,  there  are  a  number  of  related  strategies 
for  “touring”  high-dimensional  data,  by  taking 
projections  of  the  data  points  into  a  smaller  dimen¬ 
sioned  subspace  that  can  be  visualized  directly. 

William  G.  Jacoby 
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Greek  philosophy  is  often  taken  as  synonymous 
with  ancient  Western  philosophy,  or  as  thought  in 
the  age  of  classical  Athens,  represented  by  Socrates, 
Plato,  and  Aristotle.  Strictly  speaking,  however,  it 
means  the  thoughts  expressed  and  transmitted  in 
Greek  before  the  Byzantine  period,  except  for 
Christian  writings.  It  covers  the  period  from  the 
early  6th  century  BCE  to  529  CE  (the  year  of  the 
closure  of  the  Academy  [a  school  founded  by  Plato 
around  385  BCEJ  by  Justinian  I).  In  a  long  tradi¬ 
tion,  it  has  formed  our  ways  of  thinking  about  the 
world  and  human  beings  and  is  therefore  regarded 
as  the  basis  of  Western  civilization.  This  entry  first 
examines  the  essence  of  Greek  philosophy  and 
then  reviews  the  development  of  Greek  philosophy 
from  its  origin  to  the  classical  period,  particularly 
the  political  philosophy  of  Plato  and  Aristotle. 

Greek  philosophy  has  four  stages:  (1)  early 
Greek  (or  so-called  Presocratic)  philosophy  (early 
6th  to  mid-5th  century  BCE),  (2)  classical  philoso¬ 
phy  (late  5th  to  4th  century  BCE),  (3)  Hellenistic 
philosophy  (3rd  to  1st  century  BCE),  and  (4)  the 
philosophy  of  late  antiquity  (1st  to  early  6th  cen¬ 
tury  CE).  In  the  first  two  stages,  philosophers  were 
active  mainly  in  the  Eastern  Mediterranean  region 
(Asia  Minor,  Greece,  and  southern  Italy),  and  in 
the  latter  two,  their  activities  were  extended  to  the 
Near  East  and  the  whole  Mediterranean  region.  It 
should  be  remembered  that  many  philosophers  liv¬ 
ing  in  the  Roman  Empire  wrote  their  works  in 
Greek  (e.g.,  Plutarch,  Epictetus,  Marcus  Aurelius, 
Plotinus,  Galen,  and  Sextus  Empiricus). 

What  we  possess  is  the  transmitted  body  of 
Greek  philosophical  texts,  mostly  of  the  classical 
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and  the  Roman  periods.  While  a  large  portion  of 
the  ancient  books  were  lost,  some  have  been  trans¬ 
mitted  to  us  first  in  manuscripts  (earlier  in  papyri 
and  later  in  codices)  and  then  in  printed  books. 
Quotations  or  references  included  in  those  books 
are  also  edited  as  “fragments”  or  “testimonies”  of 
Greek  philosophers,  for  example,  the  Presocratics 
(Die  Fragmente  der  Vorsokratiker  by  Hermann 
Diels)  and  the  Stoics  (Stoicorum  Veterum  Frag- 
menta,  by  Hans  von  Arnim).  Luckily,  a  few  impor¬ 
tant  texts  are  also  preserved  in  papyri,  inscriptions, 
and  so  on.  Therefore,  studies  of  Greek  philosophy 
consist  of  the  intellectual  excavation  of  original 
thoughts  and  the  reexamination  of  their  heritage. 

Greek  philosophy  is  by  no  means  monolithic, 
but  rather,  it  is  characterized  by  variety  and  com¬ 
petitiveness.  Greek  philosophers  provided  polar 
ideas  such  as  “monism  versus  pluralism,”  “materi¬ 
alism  versus  idealism,”  “absolutism,  relativism, 
and  skepticism  versus  agnosticism,”  “free  will 
versus  determinism,”  and  “eudaimonism  and 
hedonism  versus  asceticism.”  Nevertheless,  they 
shared  some  essential  features:  inquiry  into  nature 
and  being  and  open  debate  between  different  posi¬ 
tions.  Their  philosophical  inquiry  was  based  on 
experience  and  reason — that  is,  the  pursuit  of  truth 
through  argument  (logos).  By  pursuing  rationality 
and  universality,  Greek  philosophy  achieved  sys¬ 
tematization  of  science  and  knowledge  and  thus 
participated  in  shaping  political  science.  This  fea¬ 
ture  dissociates  it  from  the  preceding  cultures  of 
Egypt  and  Babylonia  and  from  the  other  cultures 
outside  Europe.  Therefore,  Greek  philosophy  is 
often  evoked  as  a  source  of  the  cultural  identity  of 
Europe  and  the  West. 

The  Beginning  of  Philosophy 

Philosophy  (Greek  pbilosopbia,  meaning  “aspiring 
for  wisdom”)  is  generally  thought  to  have  started 
in  Greece  in  the  beginning  of  the  6th  century  BCE, 
with  Thales  of  the  Ionian  city  Miletus,  whom 
Aristotle  calls  “the  founder  of  philosophy.” 
Although  Thales,  one  of  the  Seven  Wise  Men, 
probably  left  no  writings,  he  and  his  compatriots, 
Anaximander  and  Anaximenes,  started  a  philo¬ 
sophical  inquiry  ( historia )  into  nature.  The  early 
philosophers  were  interested  mainly,  but  not  exclu¬ 
sively,  in  natural  phenomena,  such  as  celestial 
motions  and  the  structure  of  the  universe.  They 


asked  what  the  “origin”  or  “principle”  (arche)  of 
the  world  and  all  things  is  and  presented  different 
candidates:  water  (Thales),  fire  (Heraclitus),  air 
(Anaximenes,  Diogenes  of  Apollonia),  the  unlim¬ 
ited  (Anaximander),  and  the  unchanging  One 
(Parmenides).  In  the  mid-4th  century  BCE, 
Empedocles  from  Sicily  established  the  four-element 
theory  (fire,  air,  water,  and  earth)  and  thereon 
explained  the  cycle  of  change  of  the  universe,  in 
terms  of  mixture  and  separation,  by  means  of  the 
two  powers,  love  and  strife.  As  Empedocles  com¬ 
posed  the  religious  poem  called  Purifications  and 
the  cosmological  one  called  On  Nature,  arguably  in 
a  single  work,  inquiry  into  nature  and  moral  think¬ 
ing  came  together  in  the  early  philosophers.  They 
often  used  metaphorical  expressions,  but  the  first 
crucial  step  can  be  clearly  observed:  They  sought 
one  or  the  smallest  number  of  principles  to  explain 
the  whole  world. 

The  Ionian  tradition  of  inquiry  developed  both 
natural  sciences  (astronomy,  mathematics,  and 
medicine)  and  social  sciences  (geography  and  his¬ 
tory).  Herodotus,  in  the  investigation  into  the 
causes  of  the  Greco-Persian  Wars,  collected  a  wide 
range  of  geographical  and  cultural  information 
about  the  regions  of  Egypt,  Asia,  and  Europe.  His 
Histories  (Historiai,  meaning  “inquiries”)  pro¬ 
vided  ample  examples  of  the  conflicting  ethical 
and  legal  values  adopted  by  different  peoples  and 
societies  and  thus  introduced  a  multicultural  and 
relativistic  way  of  looking  at  the  world,  which  was 
further  developed  by  the  Sophists. 

The  first  stage  of  Greek  philosophy  is  often 
characterized  as  a  progression  from  mythos  (myth) 
to  logos  (reason).  This  means  that  the  way  of 
understanding  the  world  changed  from  mythical 
stories  based  on  Greek  religion  to  more  rational  or 
scientific  explanation.  Yet  the  actual  development 
was  not  so  straightforward.  Philosophical  theories 
and  explanations  still  contain  many  mythical  or 
irrational  factors  but  philosophers  at  least  share 
interest  in  the  program  of  rational  inquiry,  and  on 
this  basis,  they  compete  with  each  other  to  attain 
the  truth. 

The  Sophists  and  Socrates 

In  the  mid-5th  century  BCE,  when  Athens  experi¬ 
enced  democracy  and  prosperity  under  Pericles, 
professional  thinkers  and  teachers  called  Sophists 
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appeared  and  became  influential  in  the  society. 
These  intellectuals  played  an  important  role  in  the 
democratic  society,  where  free  citizens  sought 
power  by  cultivating  their  own  ability  and  the 
Sophists  taught  young  people  how  to  succeed  in 
society. 

The  greatest  Sophist,  Protagoras  of  Abdera, 
propagated  the  famous  “Man  measure  thesis”: 
“Man  is  the  measure  of  all  things:  of  those  which 
are,  that  they  are,  and  of  things  which  are  not,  that 
they  are  not.”  This  thesis,  which  bases  moral  val¬ 
ues  not  on  gods  but  on  human  beings,  promoted  a 
radical  relativism  fitting  for  democracy.  Protagoras 
was  successfully  teaching  virtues — that  is,  political 
excellence. 

What  the  Sophists  actually  taught  was  rheto¬ 
ric — the  art  or  skills  of  speech — for  it  was  crucial 
in  the  court  or  the  assembly  to  speak  persuasively 
in  front  of  a  large  audience.  Gorgias  from  Sicily 
developed  the  technique  of  public  speech  and 
reflected  on  the  power  of  speech  (logos),  noting 
that  it  can  move  people  and  the  country  and  there¬ 
fore  becomes  a  source  of  political  power.  The 
Sophists’  appeal  to  the  power  of  speech  attracted 
people  but  was  regarded  as  radical  and  dangerous 
to  traditional  morality. 

The  Sophists  investigated  the  basis  of  society 
and  the  political  system.  One  of  their  inventions 
was  the  contrast  of  nomos  and  physis  (law  or  cus¬ 
tom  and  nature),  and  based  on  this  contrast,  they 
put  forward  a  social  contract  theory.  This  theory 
initially  tried  to  explain  how  and  why  social  and 
moral  systems  differ  in  societies,  but  it  was  later 
used  to  attack  the  current  customs  and  morality. 
Antiphon,  the  Athenian  politician,  teacher  of  rheto¬ 
ric  and  Sophist  (though  there  is  a  controversy  over 
his  identity),  wrote  a  treatise  called  On  Truth  (of 
which  three  large  fragments  have  been  preserved  in 
papyri),  where  he  insisted  that  while  it  is  human 
nature  to  do  harm  to  others  in  order  to  get  more, 
laws  and  customs  prevent  one  from  doing  so. 

The  work  of  Herodotus  and  Thucydides  in  their 
Histories  contains  the  origin  of  Greek  political 
thought.  Their  description  and  comparison  of 
Persia,  Sparta,  and  Athens  reflected  the  serious  dis¬ 
cussions  and  controversies  over  the  different  politi¬ 
cal  systems  in  the  late  5th  century  BCE:  monarchy, 
oligarchy  (also  called  aristocracy,  i.e.,  rule  of  the 
best),  democracy,  and  tyranny.  Several  comparative 
studies  on  the  Athenian  and  Spartan  constitutions 


( politeiai )  were  written  and  circulated  in  this 
period.  It  is  noteworthy  that  in  democratic  Athens, 
the  oligarchic  system  of  Sparta  was  popular  among 
the  intellectuals,  including  Socrates,  Critias,  Xeno¬ 
phon,  and  Plato. 

Critias  was  an  oligarchic  politician  and  political 
thinker  influenced  by  the  Sophistic  movement.  A 
fragment  of  the  tragedy  Sisyphus,  attributed  to 
him,  claims  that  gods  were  invented  by  a  clever 
man  to  deceive  people  into  obeying  laws.  The 
Sophist  Prodicus  expressed  a  similar  rationalistic 
view  on  the  origin  of  religion — that  gods  are  per¬ 
sonifications  of  the  useful  things  and  heroes.  By 
casting  radical  doubts  on  traditional  values,  the 
Sophistic  enlightenment  movement  contributed  to 
re-forming  the  ideas  of  politics  and  ethics  as  dis¬ 
tinct  from  those  of  traditional  religion  and  to  con¬ 
sidering  them  objectively  and  scientifically. 

Socrates  belongs  to  the  same  generation  as  the 
Sophists  and  was  regarded  as  a  major  Sophist  by 
his  contemporaries  (as  depicted  in  Aristophanes’s 
comedy  Clouds).  It  was  his  pupils,  above  all  Plato, 
who  distinguished  him  from  the  other  Sophists  and 
natural  philosophers.  Since  Socrates  did  not  write 
anything,  it  is  difficult  to  determine  his  original 
thought,  but  a  few  philosophical  contributions  are 
generally  ascribed  to  him.  First,  Socrates  raised 
questions  on  moral  issues,  such  as  courage,  justice, 
and  goodness.  He  often  began  his  inquiries  with  a 
request  for  a  definition,  and  in  doing  so,  he  often 
revealed  his  interlocutors’  ignorance.  Second,  he 
often  drew  a  conclusion  from  a  series  of  examples, 
thus  making  use  of  the  logic  of  induction.  Third,  by 
this  cross-examination,  he  urged  people  to  take 
care  of  the  soul  ( psyche ) — that  is,  one’s  own  self. 
However,  his  radical  way  of  questioning  was  seen 
as  dangerous  and  subversive  of  ordinary  beliefs  and 
traditional  morality.  Socrates  was  convicted  of 
impiety  and  for  corrupting  youth  and  was  executed 
in  399  BCE.  But  his  way  of  life  became  a  major 
subject  of  the  later  philosophers,  and  Socrates  was 
made  the  model  philosopher  for  Cynics,  Stoics, 
Cyrenaics,  and  skeptics  of  the  Academy. 

Plato 

Plato  was  born  in  a  famous  Athenian  family,  and 
in  his  youth,  he  wished  to  be  active  in  politics,  as 
was  usual  in  his  society.  His  life  was  changed  by 
his  encounter  with  Socrates,  and  particularly  after 
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seeing  the  trial  and  death  of  Socrates,  he  kept  his 
distance  from  Athenian  politics  and  concentrated 
on  philosophy  and  scientific  research  in  the 
Academy.  It  is  reported  that  Plato  visited  Syracuse 
a  few  times  to  convert  the  despot  Dionysius  to 
philosophy,  but  in  vain.  Plato  wrote  some 
30  works  in  the  form  of  dialogues,  most  of  which 
feature  Socrates  as  the  main  speaker. 

Plato  made  full  use  of  the  dialogue  form.  The 
author  himself  never  appears  or  speaks  in  the  dia¬ 
logues.  Readers  are  urged  to  think  for  themselves 
about  the  issues  discussed  in  the  dialogue.  While 
it  is  often  assumed  that  the  main  character, 
Socrates  (or  a  few  others),  speaks  for  Plato’s  posi¬ 
tion,  the  opposition  often  has  a  good  point  to 
make.  For  example,  when  in  the  Gorgias ,  Callicles, 
the  young  Athenian,  challenged  Socrates  and 
defended  the  life  of  politics  by  an  appeal  to  “natu¬ 
ral  justice,”  some  readers  strongly  sympathized 
with  Callicles’s  position,  despite  Socrates’s  even¬ 
tual  refutation.  Friedrich  Nietzsche  is  an  obvious 
example  of  a  later  philosopher  whose  work  reflects 
the  perspective  taken  by  Callicles.  Nietzsche  openly 
defended  the  “will  to  power”  and  tried  to  overturn 
Platonist  morality.  It  is  sometimes  suggested  that 
Plato  himself  experienced  such  a  tension  and  con¬ 
flict  between  the  life  of  philosophy  and  the  life  of 
politics  as  depicted  in  the  dialogue. 

Plato’s  dialogues  put  forward  several  paradoxi¬ 
cal  arguments  of  Socrates — for  example,  that  no 
one  willingly  does  wrong  and  that  it  is  better  to 
suffer  injustice  than  to  do  injustice.  His  emphasis 
on  the  need  for  a  thorough  examination  of  moral 
concepts  and  values  appeared  to  ordinary  men  to 
turn  their  lives  upside  down.  In  urging  people  to 
take  care  of  the  soul,  Plato  makes  a  sharp  distinc¬ 
tion  between  the  soul  and  the  body  and  its  desires. 
Then,  the  soul  in  itself  is  able  to  contemplate  the 
essence  of  each  thing — what  it  essentially  is,  or 
what  Plato  calls  its  “Form.”  The  Forms  are  distinct 
from  sensible  things,  and  cause  their  being,  while 
the  sensible  things  partake  of  the  transcendent 
Forms.  For  example,  “Beauty”  itself  is  different 
from  beautiful  things  but  makes  things  beautiful. 
This  is  Plato’s  theory  of  Forms,  presented  in  the 
dialogues,  including  the  Phaedo,  Symposium, 
Republic,  and  so  on. 

Socrates  himself  was  probably  not  interested  in 
politics  and  lived  a  nonpolitical  life  as  a  private 
citizen.  However,  Plato’s  dialogues  understand  his 


life  as  engaging  in  true  politics  and  present  him  as 
the  ideal  statesman.  The  trial  of  Socrates  is  rein¬ 
terpreted  as  fundamentally  political.  His  cross- 
examination  of  his  accusers  at  his  trial  and  his 
constant  advice  to  his  fellow  citizens  to  take  care 
of  the  soul  and  virtue,  instead  of  the  body,  money, 
and  reputation,  constituted  a  true  political  activity 
to  make  the  polis  better,  but  they  gave  rise  to  mis¬ 
understanding  and  hatred  toward  Socrates  and 
eventually  led  to  his  death. 

Through  the  Socratic  dialogues,  Plato  explores 
the  tension  between  politics  and  philosophy  and 
expresses  the  principle  that  justice  in  all  its  forms 
can  be  understood  only  through  philosophy.  In 
this  way,  the  famous  idea  of  philosopher-kings  or 
philosopher-rulers  is  conceived  and  proposed  in 
the  Republic  (the  Greek  title  Politeia  meaning  the 
being  of  a  state  or  a  constitution).  The  main  ques¬ 
tion  of  the  dialogue  is  what  justice  is  and  whether 
a  just  person  lives  a  happy  life.  The  Sophist 
Thrasymachus  first  raises  a  radical  claim  of  justice 
as  the  “advantage  for  the  stronger” — that  the 
stronger  people  exploit  the  weaker  in  the  name  of 
law  and  justice.  Following  him,  Glaucon  and 
Ademinatus  (Plato’s  older  brothers)  reformulate 
this  claim  into  a  formidable  challenge  to  morality: 
that  people  obey  the  law  and  do  justice  only 
because  they  think  it  necessary  (while  wanting  to 
avoid  it)  on  the  basis  of  the  social  contract.  Against 
this,  Socrates  has  to  demonstrate  that  justice  is 
good  both  in  itself  and  for  its  results.  To  consider 
justice  and  injustice  more  clearly,  Socrates  then 
suggests  that  they  construct  a  state  (polis)  in  argu¬ 
ment  and  compare  its  justice  with  that  of  the  soul. 
The  basic  principle  of  founding  the  state  is  that 
everyone  does  one’s  own  work  (labor  division 
according  to  one’s  nature).  If  each  class,  namely, 
rulers,  warriors,  and  producers,  does  its  own 
work,  the  state  is  just.  On  the  other  hand,  when 
someone  transgresses  this  boundary,  various  types 
of  injustice  occur.  The  just  state  is  discussed  to 
show  by  analogy  the  justice  of  the  soul.  When  the 
three  parts  of  the  soul,  namely,  reason,  spirit,  and 
appetite,  harmonize  to  make  a  single  whole,  the 
soul  is  just.  Thus,  the  Republic  deals  with  political 
philosophy  in  the  context  of  discussing  justice. 

The  conditions  of  the  ideal  state  are  a  focus  in  the 
central  books  of  the  Politeia  (Books  5-7),  where 
some  original  proposals  are  discussed,  such  as 
equality  of  men  and  women  in  ruling,  abolishment 
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of  the  family,  and  a  common  property  system. 
Socrates’  most  challenging  proposal  for  realizing 
the  ideal  state  in  this  world  is  to  unite  philosophy 
and  politics  by  either  making  the  philosopher  a 
ruler  of  the  state  or  making  the  present  ruler  a 
philosopher,  since  the  philosopher,  who  has 
knowledge  of  “Justice,”  “Goodness,”  and  the 
other  Forms,  alone  can  make  a  state  just  and  good 
in  the  proper  way.  To  become  a  philosopher, 
however,  one  needs  a  long  course  of  education 
and  hard  training,  first  in  general  education  in 
literature  and  physical  training  and  second  in  the 
mathematical  sciences.  The  final  stage  consists  of 
training  in  dialectic  (the  art  of  dialogue),  which  is 
said  to  deal  with  the  reality  or  the  Forms  by  logos 
(reason)  alone. 

Plato’s  idea  of  political  science  was  criticized  by 
his  contemporaries  and  by  later  philosophers. 
Isocrates,  Plato’s  rival  teacher  of  rhetoric,  insists 
that  sound  judgment  is  more  important  than  the 
theoretical  sciences,  such  as  mathematics,  in  mak¬ 
ing  one  a  good  citizen  and  politician.  Aristotle  also 
criticizes  Plato’s  idea  of  unity  in  Book  2  of  the 
Politics.  A  state  should  not  seek  a  strict  singleness 
as  proposed  in  the  Republic  but  should  be  realized 
in  a  harmony  of  diverse  elements.  This  echoes  the 
modern  criticisms  of  Plato  as  totalitarian.  According 
to  Karl  Popper  and  other  critics,  Plato’s  notions  of 
dictatorship,  the  “noble  lie,”  eugenics,  and  censor¬ 
ship  resemble  the  modern  totalitarian  ideologies  of 
Nazism  and  Stalinism.  Whereas  in  the  19th  century 
Plato’s  Republic  was  read  as  a  “utopian”  work,  in 
the  20th,  there  was  greater  emphasis  on  its  political 
implications.  However,  it  should  be  noted  that 
Plato’s  main  aim  lies  not  in  the  political  blueprint 
but  in  the  ethical  contrast  between  the  tyrant,  who 
rules  for  his  own  interest  and  greed,  and  the  true 
philosopher. 

Plato  places  the  Form  of  the  “Good”  in  the 
highest  position  in  the  realm  of  reality.  This 
implies  that  the  natural  world  and  human  activi¬ 
ties  have  a  common  basis.  This  idea  is  developed 
in  his  later  dialogue,  the  Timaeus ,  in  which  the 
origin  and  structure  of  the  universe  are  explained 
as  the  god’s  (Demiurge’s)  creation  by  means  of 
mathematical  principles.  Goodness  and  beauty  are 
regarded  there  as  the  foundation  principles  of  the 
universe.  Plato’s  cosmology  (which  he  called  “a 
likely  myth”)  was  influential  in  late  antiquity  and 
the  Middle  Ages  as  a  part  of  his  political  thought. 


Plato  wrote  two  other  dialogues  that  deal  with 
political  philosophy:  the  Statesman  (Politicus)  and 
the  Laws  (Nomoi).  The  Statesman  defines  the  art 
or  knowledge  of  statesmanship  as  the  interweaving 
of  different  elements  in  a  state,  particularly  cour¬ 
age  and  temperance.  Plato’s  last  work,  the  Laws, 
discusses  the  principles  and  laws  of  a  new  colonial 
state.  Although  the  differences  between  Plato’s 
ideas  in  the  Republic,  Statesman,  and  Laws  are 
matters  of  much  debate  among  modern  scholars,  it 
is  clear  that  Plato  examines  political  science  as  a 
major  subject  in  his  whole  system  of  philosophy. 

Aristotle 

Aristotle  was  born  in  Macedonia,  the  son  of  a 
medical  doctor.  Probably  because  of  this  back¬ 
ground,  he  was  more  interested  in  biology  and 
other  natural  sciences  than  in  mathematics  (which 
Plato  ranks  high  as  a  more  abstract  and  academic 
science).  He  joined  Plato’s  Academy  as  a  youth  to 
study  philosophy.  After  his  master’s  death,  Aristotle 
founded  his  own  school,  the  Lyceum,  and  led  sci¬ 
entific  projects  in  biology  and  political  science. 
Although  he  tutored  the  young  Alexander  the 
Great  for  some  years,  it  is  uncertain  what  influence 
he  had  on  the  prince. 

While  all  his  published  works  were  lost,  his  lec¬ 
tures  fortunately  survived  and  were  edited  by 
Andronicus  of  Rhodes,  the  Peripatetic  philosopher 
of  the  1st  century  BCE.  The  Corpus  Aristotelicum 
(Works  of  Aristotle)  is  the  transmitted  body  of  texts 
from  this  edition,  and  it  is  normally  assumed  that 
the  order  of  treatises  more  or  less  represents 
Aristotle’s  original  scheme  of  sciences:  namely,  logic, 
physics  (natural  philosophy),  metaphysics,  ethics, 
politics,  rhetoric,  and  poetics.  On  the  other  hand, 
one  must  be  cautious  in  using  the  lectures,  since  they 
are  often  sketchy  and  may  seem  to  contradict  each 
other.  Modern  scholars  have  tried  to  distinguish  the 
different  layers  of  composition  within  the  Corpus, 
but  no  agreement  has  been  reached  so  far. 

Aristotle  founds  the  logical  system  of  syllogisms 
and  discusses  epistemology  as  the  basis  of  the 
whole  of  science.  In  the  Posterior  Analytics,  he 
defines  the  scientific  demonstration  as  necessary 
reasoning  from  the  true  and  primary  premises. 
Whereas  it  remains  controversial  how  much  of  this 
principle  he  applies  to  particular  sciences,  such  as 
physics  and  ethics,  the  idea  of  a  demonstrative  and 
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deductive  system  lays  the  basis  for  all  scientific  dis¬ 
ciplines.  Another  important  method  in  use  is  dialec¬ 
tic.  In  the  academic  fields  where  we  cannot  secure 
true  premises,  we  should  collect  common  opinions 
and  reputable  views  of  experts  (called  endoxa)  first. 
Then,  by  examining  the  exdoxa,  we  can  hope  to 
find  truth  in  them.  The  method  of  dialectic  is 
applied  in  several  places,  mainly  to  provide  the 
starting  points  for  establishing  his  own  theories. 

While  physics  deals  with  the  world  of  changing 
things  and  their  unchanging  causes,  human  prac¬ 
tice  and  morality  should  be  treated  differently.  As 
distinct  from  the  causally  necessary  phenomena, 
human  issues  cannot  be  reasoned  about  or  proved 
with  the  same  strictness.  Therefore,  in  studies  of 
ethics  and  politics,  we  should  be  content  with 
something  true  “for  the  most  part.”  This  clear  dis¬ 
tinction  between  the  theoretical  and  practical  sci¬ 
ences  is  one  basic  divergence  from  Plato’s  thought. 

The  aim  of  lectures  in  ethics  and  politics  is  prac¬ 
tical,  namely,  to  make  the  listeners  good  human 
beings  and  citizens.  Every  person  seeks  the  good — 
that  is,  happiness  (eudaimonia,  not  in  the  sense  of 
modern  “happiness”  as  a  subjective  feeling) — but 
its  conceptions  differ  among  people.  Aristotle 
argues  that  since  the  eudaimonia  of  human  beings 
is  the  realization  of  their  natural  activity,  and  since 
their  essence  lies  in  rationality,  the  human  eudai¬ 
monia  is  sought  in  the  activity  of  contemplation 
(theoria).  Ethics  is  included  in  politics,  which  is  the 
highest  science  for  the  human  good.  Aristotle 
again  bases  his  inquiry  on  human  nature:  Since  a 
human  being  is  by  nature  a  political  animal  (i.e., 
animals  that  form  a  polis  to  live  together),  which 
is  equivalent  to  the  rational  animal  (the  animal 
that  has  logos,  i.e.,  reason  and  speech),  the  polis 
(political  community)  is  the  first  completion  of 
self-sufficiency  of  this  political  nature  of  human 
beings.  In  Politics,  Aristotle  then  examines  family, 
slavery  (which  he  notoriously  favors),  citizenship, 
various  constitutions,  and  education. 

Aristotle  is  also  said  to  have  engaged  in  joint 
research  on  some  158  constitutions  (politeiai)  of 
Greek  and  foreign  states.  Perhaps  the  most  impor¬ 
tant  one  was  the  Athenian  Constitution,  which 
was  discovered  in  1880  in  the  Egyptian  desert. 
This  work  provides  essential  reports  of  the  politi¬ 
cal  system  of  Athens,  collected  in  his  school. 

The  philosophies  of  Plato  and  Aristotle  spread 
throughout  the  world  and  formed  a  basic  way  of 


thinking  about  the  world  and  life.  The  most 
important  legacy  of  Greek  philosophy  is  perhaps 
the  whole  system  of  our  scientific  and  philosophi¬ 
cal  inquiry,  which  we  now  take  for  granted. 
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Green  Parties 


Green  and  alternative  movements  first  emerged  in 
several  Western  European  countries  in  the  1970s 
accompanied  by  a  more  general  generational  change 
toward  postmaterialist  values.  They  were  mostly 
concerned  with  environmental  issues,  such  as  the 
antinuclear  movement,  but  also  with  peace  and 
women’s  issues.  Since  then,  the  electoral  as  well 
as  the  parliamentary  performance  of  green  parties 
has  improved  remarkably  in  many  democratic 
societies  (mostly,  however,  in  European  countries). 
Thus,  green  parties  were  not  just  a  temporary 
phenomenon — as  many  political  observers  had  sug¬ 
gested  in  the  early  1980s — but  have  developed  into 
a  stable  element  of  many  party  systems  around  the 
world.  In  2001,  the  green  parties  created  a  “global 
network”  that  consists  of  61  member  parties: 
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15  from  Africa:  Benin,  Burkina  Faso,  Cameroon, 
Cote  d’Ivoire,  Guinea,  Guinea-Bissau,  Kenya,  Mali, 
Mauritius,  Morocco,  Niger,  Nigeria,  Senegal, 
Somalia,  and  South  Africa; 

12  from  the  Americas:  Brazil,  Canada,  Chile, 
Colombia  (2),  Dominican  Republic,  Mexico, 
Nicaragua,  Peru,  the  United  States,  Uruguay,  and 
Venezuela; 

14  from  the  Asia-Pacific  region:  Australia,  Japan, 
Mongolia,  Nepal,  New  Caledonia,  New  Zealand, 
Papua  New  Guinea,  Pakistan,  Philippines, 
Polynesia,  South  Korea,  Sri  Lanka,  Taiwan,  and 
Vanuatu;  and 

32  from  Europe:  Austria,  Belgium  (2),  Cyprus, 
Czech  Republic,  Denmark,  Estonia,  Finland, 

France,  Georgia,  Germany,  Greece,  Hungary, 
Ireland,  Italy,  Latvia,  Luxembourg,  Malta,  the 
Netherlands  (2),  Norway,  Poland,  Portugal, 
Romania,  Russia,  Slovakia,  Spain,  Sweden, 
Switzerland,  Ukraine,  and  the  United  Kingdom. 

Most  green  parties  outside  Europe  developed 
later,  in  the  mid-  or  late  1990s,  and  have  remained 
relatively  minor  players  in  terms  of  electoral  suc¬ 
cess.  Consequently,  comparative  research  and 
scholarly  debates  focus  mainly  on  Western 
European  green  parties.  The  central  scholarly  con¬ 
tributions  revolve  around  four  topics:  (1)  the  rise 
and  development  of  the  Greens  in  Europe  (Ferdinand 
Muller-Rommel,  1989;  Michael  O’Neill,  1997; 
Dick  Richardson  &  Chris  Rootes,  1995),  (2)  their 
political  profile  (Kitschelt,  1989;  Rihoux,  2001), 

(3)  their  electoral  and  parliamentary  performance 
(Pascal  Delwit  &  Jean  M.  de  Waele,  1999),  and 

(4)  their  participation  in  national  governments 
(Ferdinand  Muller-Rommel  &  Thomas  Poguntke, 
2002). 

This  entry  documents  the  rise  of  green  parties 
and  describes  their  political  profile,  performance, 
and  participation  in  national  governments.  Last, 
this  entry  discusses  the  impact  of  green  parties,  both 
as  a  voice  for  those  whose  views  are  not  represented 
by  established  parties  and  as  a  way  of  bringing 
attention  to  issues  these  parties  have  not  addressed. 

From  Social  Movements  to  Green  Parties 

The  early  1970s  saw  the  rise  of  new  social  move¬ 
ments  in  many  countries  of  the  Western  world. 


Most  of  these  movements  emerged  spontaneously 
from  politically  independent  citizen  initiative 
groups.  These  groups  were  generally  interested  in 
specific  issues  such  as  the  provision  of  parks,  and 
they  protested  against  urban  renewal,  new  high¬ 
ways,  or  the  construction  of  nuclear  power  plants. 
They  mobilized  public  opinion  via  unconventional 
political  behavior  characteristic  of  earlier  student 
movements,  such  as  demonstrations  and  occa¬ 
sional  sit-ins,  information  campaigns,  and  other 
similar  tactics. 

In  the  mid-1970s,  one  particular  issue  became 
dominant  in  several  European  countries:  nuclear 
energy.  At  this  time,  heavily  influenced  by  the  oil 
crisis,  most  European  governments  had  decided  to 
expand  their  nuclear  energy  programs.  It  was  pre¬ 
cisely  the  issue  of  nuclear  power  that  stimulated 
the  rise  of  environmental  movements  at  the 
national  level,  since  governmental  decisions  on 
nuclear  power  expansion  could  not  be  resolved  at 
the  local  level.  Thus,  more  and  more  local  action 
groups  formed  national  “anti-nuclear  power” 
umbrella  organizations  to  concentrate  their  impact 
on  policy  making.  In  addition  to  issues  of  nuclear 
energy  and  environmental  protection,  the  debate 
over  the  North  Atlantic  Treaty  Organization’s 
(NATO’s)  dual-track  decision  on  intermediate 
nuclear  forces  and  the  stationing  of  cruise  missiles 
and  Pershing  II  rockets  in  Western  Europe  in  the 
late  1970s  strengthened  these  movements.  This 
debate  created  considerable  solidarity  among  the 
new  social  movements  across  Europe. 

In  the  early  1980s,  most  national  social  move¬ 
ments  expected  the  established  left-wing  parties  to 
act  as  effective  safeguards  against  unlimited 
growth,  environmental  degradation,  and  the 
deployment  of  nuclear  weapons.  Yet,  for  different 
reasons,  these  parties  could  not  accommodate 
these  demands.  Hence,  one  of  the  main  reasons  for 
the  foundation  of  the  Greens  was  their  negative 
experience  with  existing  left-wing  parties,  as  well 
as  the  perceived  inability  of  established  political 
parties  and  institutions  to  come  up  with  a  funda¬ 
mentally  different  policy  approach. 

Political  Profile 

Green  parties  in  Western  Europe  can  be  character¬ 
ized  by  three  elements:  First,  most  of  these  parties 
follow  the  idea  of  the  concept  of  New  Politics 
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(emphasizing  quality  of  life,  citizen  participation, 
and  self-actualization  rather  than  material  afflu¬ 
ence  and  economic  growth).  They  generally 
embrace  a  left-wing  standpoint  consisting  of  fervent 
egalitarianism  (equal  rights  for  minorities),  strong 
ecological  and  antinuclear  power  orientation,  and 
solidarity  with  the  Third  World.  They  campaign  for 
a  genuine  sharing  of  wealth  between  rich  and  poor 
nations  and  for  assistance  to  underprivileged  coun¬ 
tries  in  creating  their  own  self-sufficient  economies, 
free  of  financial  domination  by  industrial  nations. 
In  a  nutshell,  green  parties  introduced  a  new  form 
of  programmatic  and  ideological  thinking  that  is 
less  consistent  with  the  traditional  ideological 
Left-Right  dimension.  Instead,  they  offer  a  set  of 
alternative  values  that  differ  significantly  from  the 
established  parties. 

Second,  green  parties  display  a  much  stronger 
preference  for  participatory  party  organization 
than  established  parties.  Their  organizational  struc¬ 
ture  gives  local  party  branches  more  autonomy  in 
decision  making  and  a  maximum  of  interest  articu¬ 
lation  from  grassroots  initiatives.  Thus,  the  process 
of  decentralization  is  considered  an  essential  pre¬ 
condition  for  meaningful  participation. 

The  third  and  last  characteristic  that  sets  green 
parties  apart  is  that  they  have  an  electorate  with 
attributes  that  differ  significantly  from  those  of 
established  parties.  Comparative  data  on  the  elec¬ 
torate  of  the  Greens  have,  for  instance,  shown  that 
followers  of  new  social  movements  are  among  the 
majority  of  green  party  voters.  They  are  for  the 
most  part  young,  highly  educated,  and  occupy 
white-collar  and  government  jobs  where  the  tradi¬ 
tional  class  conflict  is  virtually  nonexistent. 

Although  there  was  some  diversity  in  the  pro¬ 
grammatic  demands,  and  in  the  degree  of  partici¬ 
patory  party  organization,  green  parties  developed 
rapidly:  Between  1980  and  1984,  new  green  par¬ 
ties  were  founded  in  12  Western  European  coun¬ 
tries.  By  the  late  1980s,  these  parties  had  gained 
significant  electoral  and  (in  some  countries)  parlia¬ 
mentary  success. 

Electoral  and  Parliamentary  Performance 

The  electoral  performance  of  green  parties  in 
Western  Europe  has  varied  markedly  across  coun¬ 
tries  over  the  past  3  decades.  Most  of  these  parties 
have  participated  by  now  in  six  or  more  national 


elections  and  have  therefore  stabilized  their  elec¬ 
toral  performance  in  major  European  countries. 
Overall,  the  average  electoral  results  of  green  par¬ 
ties  range  between  1.5%  and  7.3%  of  the  total 
suffrage.  In  six  countries,  Austria,  Belgium,  Finland, 
Germany,  Luxembourg,  and  Switzerland,  electoral 
support  for  the  Greens  was  higher  than  5  %  of  the 
overall  vote  in  the  past  three  national  elections. 

This  comparably  low  electoral  support  for  the 
Greens  is  also  reflected  in  their  parliamentary  rep¬ 
resentation.  During  the  period  ranging  from  1978 
to  1984,  only  three  green  parties  (in  Germany, 
Luxembourg,  and  Switzerland)  have  succeeded  in 
passing  the  electoral  threshold  for  gaining  represen¬ 
tation  in  national  parliaments.  Between  1985  and 
1997,  the  number  of  green  parties  in  parliaments 
has  increased  from  three  to  nine:  Austria,  Belgium, 
Finland,  France,  Germany,  Luxembourg,  Nether¬ 
lands,  Sweden,  and  Switzerland.  Meanwhile,  green 
parties  were  also  represented  in  the  Italian,  Irish, 
Greek,  Czech,  and  Portuguese  parliaments.  How¬ 
ever,  compared  with  most  other  parties,  the  abso¬ 
lute  number  of  green  party  members  in  parliament 
is  rather  low.  The  average  parliamentary  strength 
ranges  from  0.3%  in  Greece  to  6.5%  in  Luxemburg. 
In  sum,  the  electoral  and  parliamentary  perfor¬ 
mance  of  green  parties  in  Europe  has  remained 
marginal  except  for  the  green  parties  in  Austria, 
Belgium,  Germany,  Luxembourg,  and  Switzerland. 

Participation  in  National  Governments 

The  majority  of  green  parties  in  national  parlia¬ 
ments  serve  as  opposition  parties  and  therefore 
have  no  impact  on  government  formation  or  pol¬ 
icy  making.  Outside  Europe,  the  Greens  in  Brazil 
are  the  only  green  party  to  have  participated  in 
government  after  the  2002  national  elections.  In 
Europe,  seven  green  parties  have  entered  govern¬ 
ments  as  coalition  partners:  The  Greens  in  Finland 
were  the  first  to  become  a  member  of  the  ruling 
government  coalition  (1995),  followed  by  the 
Greens  in  Italy  (1996),  France  (1997),  Germany 
(1998),  Belgium  (1999),  Ireland  (2007),  and,  last, 
the  Czech  Republic  (2007).  Interestingly,  the  par¬ 
liamentary  duration  of  these  parties  prior  to  gov¬ 
ernment  participation  varied  substantially.  The 
Greens  in  Belgium  and  Ireland  held  the  longest 
representation  in  national  governments  (18  years 
each),  followed  by  the  German  Greens  (15  years), 


1054  Grounded  Theory 


the  Finnish  Greens  (12  years),  and  the  Italian 
Greens  (9  years).  The  French  and  the  Czech 
Greens  were  immediately  invited  to  participate  in 
government  after  having  entered  parliament  in  a 
national  election.  Thus,  we  can  differentiate 
between  three  types  of  governing  green  parties: 
First,  the  professionals,  defined  as  green  parties 
with  a  long  duration  of  preparliamentary  and 
parliamentary  participation  before  entering  gov¬ 
ernment  (e.g.,  in  Germany  and  Finland);  second, 
the  parliamentary  experienced  Greens  who  have 
accumulated  long  experiences  of  parliamentary 
participation  but  low  duration  in  preparliamen- 
tary  activities  (e.g.,  Belgium,  Italy,  and  Ireland); 
third,  the  preparliamentary  experienced  Greens 
who  had  some  extraparliamentary  experience 
before  entering  parliament  and  government  (e.g., 
in  France  and  the  Czech  Republic). 

The  Impact  of  Green  Parties 

The  individual  and  collective  political  relevance  of 
the  green  parties  can  be  summarized  in  two  points: 
First,  green  parties  mobilize  many  citizens  with 
new  value  orientations  by  making  it  possible  for 
them  to  find  expressions  for  their  views  at  the  bal¬ 
lot  box.  The  Greens  therefore  serve  as  political 
vehicles  for  those  citizens  whose  grievances  have 
been  ignored  by  the  larger  established  parties.  By 
making  themselves  the  spokespeople  of  the  dissat¬ 
isfied,  green  parties  also  stimulate  a  process  of 
change  of  party  loyalties  and  are  paving  the  way 
for  increasing  volatility  within  Western  European 
party  systems.  Second,  green  parties  affect  political 
issues  and  the  tone  of  political  life  by  bringing 
controversial  matters  into  the  public  debate.  If  the 
issues  prove  popular,  they  are  adopted  by  one  or 
more  of  the  larger  established  parties.  This  led  to 
changes  in  the  programs  of  major  European  par¬ 
ties.  Finally,  and  by  means  of  conclusion,  the 
political  impact  of  green  parties  to  a  very  large 
extent  depends  on  the  emergence  of  New  Politics 
issues  influencing  mass  public  opinion  and  the 
inability  of  larger  left-wing  or  right-wing  estab¬ 
lished  parties  to  harness  these  issues  and  integrate 
them  into  their  political  agenda. 

Ferdinand  Miiller-Rommel 
Leupbana  Universitiit  Liineburg 
Liineburg,  Germany 
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Grounded  Theory 


Developed  by  the  sociologists  Anselm  L.  Strauss 
and  Barney  Glaser  at  the  University  of  California 
at  San  Francisco  in  the  late  1960s,  grounded  the¬ 
ory  (GT)  has  become  one  of  the  foremost  research 
styles  in  qualitative  and  interpretative  social, 
political,  and  educational  research.  Aiming  at 
empirically  GT  generation  rather  than  at  theory 
testing  or  mere  description,  GT  provides  the 
researcher  with  a  rich  set  of  research  heuristics, 
the  so-called  constant  comparative  method.  GT  is, 
unlike  ethnography  or  interviewing,  not  a  method 
of  data  generation.  It  is  also,  despite  its  somewhat 
confusing  name,  not  a  theory.  Instead,  it  is  a  gen¬ 
eral  model  for  certain  types  of  research  designs, 
the  results  of  which  are  substantive  theories  of  the 
field  under  scrutiny.  This  entry  discusses  the  ori¬ 
gins  of  GT,  basic  procedures,  its  role  in  political 
science,  and  recent  developments. 
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Origins 

Epistemologically,  as  well  as  in  terms  of  social 
theory,  the  origins  of  GT  are  primarily  in  early 
American  pragmatism  and  interactionism  as  repre¬ 
sented  by  Strauss.  At  the  sociology  department  of 
the  University  of  Chicago,  he  was  trained  in  the 
pragmatist  logic  of  inquiry  (of  Charles  Peirce,  John 
Dewey,  and  George  H.  Mead),  with  its  strong 
notion  of  reality  being  brought  forth  constantly  in 
interactively  dealing  with  the  physical  and  social 
restraints  of  the  “world  out  there.”  By  the  same 
token,  Strauss  was  influenced  by  the  empirical 
fieldwork  practice  of  E.  C.  Hughes,  a  descendent 
of  Robert  E.  Park’s  human  ecology  approach.  A 
second  strand  of  origin  lies  in  the  Columbia  School 
and  its  leanings  toward  the  development  of  mid¬ 
dle-range  theories — a  heritage  brought  into  the 
project  by  Glaser,  who  was  trained  under  Robert 
K.  Merton  before  he  worked  with  Strauss  in  San 
Francisco.  While  Strauss  advocated  the  need  of  a 
research  style  that  allows  for  creativity  to  be 
brought  into  the  scientific  process,  Glaser  repre¬ 
sented  the  call  for  rigor  in  qualitative  and  interpre¬ 
tative  research  and  theory  development. 

Developed  in  practice  during  an  organizational 
ethnography  on  death  and  dying  in  hospital  wards, 
the  first  monograph  on  GT,  instead  of  being 
a  method  textbook,  served  more  as  a  political 
statement  against  the  orthodox  consensus  of  struc¬ 
tural-functionalist  theories  and  logico-hypothetical 
methods.  Published  in  1967,  amid  the  political 
uprising  of  the  civil  rights  and  student  movements, 
The  Discovery  of  Grounded  Theory,  and  its  meth¬ 
odological  statement,  was  meant  to  support  these 
new  ways  of  thinking  among  young  researchers. 
Since  GT  was  among  the  first  qualitative  research 
styles  spelled  out  in  its  practical  details  and  at  the 
same  time  offered  a  way  to  systematically  develop 
empirically  grounded  scientific  theories,  it  was 
quickly  adopted  by  a  broader  group  of  social  science 
researchers.  While  originally  developed  in  the  field 
of  medical  sociology,  GT  has  gained  a  standing  in 
most  areas  of  sociology,  psychology,  education,  eth¬ 
nography,  and  social  anthropology. 

Basic  Procedures  and  Heuristics 

Though  advocating  rigor  in  theory  development, 
the  most  basic  rule  of  GT  is  that  it  is  not  to  be 


understood  as  a  rigid  set  of  rules.  Researchers  are 
meant  to  grasp  the  general  guidelines  of  GT  in  an 
interpretative  and  flexible  manner  in  order  to 
adapt  its  heuristics  to  the  circumstances  of  practi¬ 
cal  research.  Among  these  basic  guidelines  is  the 
call  for  understanding  the  entire  case — that  is, 
instead  of  either  comparing  single  items  on  various 
cases  (as  in  quantitative  research)  or  interpreting  in 
depth  single  cases  (as  in  hermeneutic  approaches), 
GT  developed  a  heuristic  that  integrates  key  fea¬ 
tures  of  both  strategies  into  its  constant  compara¬ 
tive  method.  A  second  basic  rule  underlines  not 
only  the  inevitability  but  also  the  need  for  and  the 
advantage  of  the  researchers’  subjectivity  and  cre¬ 
ativity.  Following  Dewey’s  notion  of  research  as 
iterations  of  problem-solving  cycles,  it  is  the  cre¬ 
ativity  of  abductive  processes  that  bridges  the  gap 
between  a  problem  systematically  laid  out  in  pro¬ 
cesses  of  logical  reasoning  and  results  obtained  in 
experimental  practice.  The  notion  of  creativity 
goes  hand  in  hand  with  the  pragmatist  claim  that 
there  is  no  universal  reality  to  discover.  Third, 
research  is  seen  as  a  joint  endeavor  of  a  community 
of  practice  comprising  not  only  the  research  team 
but  also  the  scientific  community.  In  terms  of  GT, 
this  endeavor  is  understood  as  a  work  in  progress, 
which  gives  scientific  activities  a  down-to-earth 
flavor  of  practical  day-to-day  activities  that  need 
to  be  organized  and  articulated  interactively  among 
research  participants.  Finally,  GT  assumes  social 
reality  to  consist  of  complex  phenomena  requiring 
a  similarly  rich  theoretical  understanding  that  pri¬ 
marily  accounts  for  variation.  This  includes  the 
notion  of  theory  as  a  never-ending  process. 

In  GT,  both  analysis  and  theory  building  start 
immediately  with  scrutinizing  the  material  pro¬ 
duced  in  investigating  a  first  case.  Thus,  analysis 
here  is  thought  of  as  a  continual  interplay  of  data 
generation,  data  inspection,  and  theoretical  rea¬ 
soning  in  an  iterative-cyclical  procedure.  Single 
incidents  of  phenomena  occurring  in  the  data  are 
compared  with  each  other  to  develop  a  concep¬ 
tual,  theoretical  understanding  of  what  these 
incidents  mean  with  respect  to  the  research  ques¬ 
tion  under  scrutiny.  No  a  priori  hypothesis 
drives  this  process.  The  researcher  starts  from  a 
research  question  that  in  the  beginning  is  more 
like  a  vague  research  curiosity:  What  is  going  on 
here?  It  is  the  interplay  of  data  analysis  and  con¬ 
ceptual  thinking  that  at  the  same  time  allows  for 
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a  step-by-step  stating  of  the  research  question 
more  precisely. 

The  comparative  analytical  mode  employed  in 
GT  is  called  coding — that  is,  occurrences  of  a  cer¬ 
tain  phenomenon  in  the  data  are  compared  with 
other  occurrences  of  the  same  or  a  similar  phenom¬ 
enon.  Here,  two  comparative  heuristics  alternate. 
Whereas  in  the  first  phase  the  analyst  compares 
incidents  where  the  phenomenon  occurs  that  are  as 
similar  as  possible  (homogeneous  cases),  in  the  sec¬ 
ond  step,  he  or  she  will  compare  these  with  inci¬ 
dents  as  different  from  the  first  ones  as  possible 
(heterogeneous  cases).  The  first  step  serves  to 
develop  the  theoretically  relevant  properties  of  the 
concept  to  be  constructed  on  the  basis  of  the  empir¬ 
ical  data.  While  still  adding  to  the  development  of 
properties,  the  second  step  is  more  dedicated  to 
determining  the  scope  of  the  concept.  The  aim  is  to 
find  out  under  which  conditions  and  in  which  con¬ 
texts  the  concept  still  holds  and  which  variations  of 
the  concept  are  necessary  in  order  to  account  for 
the  complexity  of  the  phenomenon  in  reality.  This 
scope  is  called  conceptual  representativeness. 

The  coding  process  consists  of  different  modes. 
As  the  GT  procedures  of  Strauss  and  Glaser  have 
been  developed  in  different  directions  over  time, 
there  is  a  Glaserian  two-mode  procedure  and  a 
Straussian  three-mode  procedure.  Both  Glaser  and 
Strauss  start  with  open  coding,  which  tries  to 
break  up  the  data  in  a  line-by-line  analysis  and 
develop  those  units  into  theoretical  concepts  that 
in  light  of  the  research  question  appear  to  be  rele¬ 
vant.  Glaser  on  his  part  continues  with  a  coding 
mode  called  theoretical  coding.  Here,  the  concepts 
derived  inductively  from  the  data  are  compared 
with  and  linked  with  theoretical  codes  chosen 
among  a  larger  number  of  code  families  (e.g., 
dimensions,  cultures,  and  types).  The  aim  of  this 
procedural  step  is  for  the  researcher  to  come  up 
with  a  central  core  category.  Viewed  from  the  per¬ 
spective  of  this  core  category,  the  analytical  struc¬ 
ture  developed  in  the  course  of  the  project  should 
answer  the  research  question.  Strauss  in  his  more 
widely  used  approach  instead  suggests  two  further 
coding  modes:  axial  and  selective  coding.  In  axial 
codings  researchers  seek  to  establish  empirically 
grounded  relations  between  different  concepts, 
often  by  using  a  so-called  coding  paradigm  that 
offers  a  set  of  basic  questions  to  answer  for  a  phe¬ 
nomenon  under  scrutiny.  These  questions  serve  to 


account  for  contexts,  intervening  conditions,  cau¬ 
salities,  strategies,  and  consequences.  Though  not 
operating  with  theoretical  code  families,  selective 
coding  likewise  aims  at  discovering  a  core  cate¬ 
gory,  thereby  integrating  the  variety  of  concepts 
into  a  coherent  theoretical  perspective. 

As  analysis  does  not  limit  itself  to  a  single  case, 
empirical  material  on  further  cases  is  integrated  in  a 
step-by-step  procedure  called  theoretical  sampling. 
Different  from  random  sampling,  this  selection 
strategy  draws  on  the  basic  findings  of  the  theory 
under  construction.  By  posing  generative  questions 
derived  from  open  ends  in  the  theory-building  pro¬ 
cess,  the  researcher  determines  which  material  on 
which  types  of  cases  is  needed  to  continue  the  rele¬ 
vant  comparative  strategies  at  a  certain  point  in  the 
project.  Thus,  not  only  homogeneous  and  heteroge¬ 
neous  sampling  strategies  on  cases  and  materials 
but  also  on  incidents  within  the  material  alternate 
flexibly.  The  criterion  for  switching  from  one  strat¬ 
egy  to  the  other  is  called  theoretical  saturation.  A 
certain  comparative  step  ends  when  the  researcher 
does  not  gain  any  more  conceptually  relevant 
insights  from  it. 

An  important  issue  in  most  qualitative  strategies 
is  the  question  of  how  to  deal  with  previous  practi¬ 
cal  and  theoretical  knowledge.  Right  from  the 
beginning,  GT  has  underscored  the  importance  of 
studying  social  phenomena  unbiased  from  pregiven 
theoretical  frameworks  and  has  encouraged  re¬ 
searchers  to  start  their  empirical  investigations  right 
away.  At  times,  this  has  been  critically  interpreted 
as  a  tabula  rasa — a  critique  that  was  encouraged  by 
the  late  Glaser’s  naive  inductivism.  However,  even 
in  the  early  writings,  Glaser  and  Strauss  denied 
starting  with  a  tabula  rasa  and  Strauss  for  his  part 
repeatedly  advocated  the  advantages  of  a  knowl¬ 
edge-based  theoretical  sensitivity  as  opposed  to  the 
definiteness  of  pregiven  theoretical  frameworks. 

Grounded  Theory  and 
Political  Science  Research 

Compared  with  other  social  sciences,  the  usage  of 
GT  in  political  sciences  is  less  widespread.  This  is 
true  not  only  for  GT  but  also  for  all  kinds  of  sound 
and  systematic  qualitative  research  methods. 
Overall,  qualitative  research  is  not  yet  well  estab¬ 
lished  in  political  science.  This  might  be  due  to  the 
fact  that  this  field  has  a  long  tradition  of  normative, 
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value-driven  theorizing,  which  fits  well  both  with 
theory-testing  procedures — as  a  test  whether  the 
reality  of  political  systems  indeed  behaves  as  main¬ 
tained  in  normative  theories  (e.g.,  theories  of 
democracy) — and  with  presenting  exemplary  evi¬ 
dence  in  an  illustrative  manner.  The  place  of  GT,  in 
contrast,  lies  in  the  domain  of  developing  explana¬ 
tory  theories  of  interactions  between  political 
actors  as  representatives  of,  for  example,  political 
parties,  nongovernmental  organizations  (NGOs), 
legal  institutions,  or  lay  groups.  It  can  be  observed, 
however,  that  young  scholars  from  the  field  of 
political  science  are  increasingly  attracted  by  GT, 
for  example,  for  studying  policy  making,  commu¬ 
nity  power  structures,  or  empowerment  processes. 

Variants  and  Add-Ons  of  Grounded  Theory 

Over  the  more  than  40  years  of  its  existence,  GT 
has  developed  into  a  number  of  variants:  some  of 
them  challenging  the  original  claim,  others  repre¬ 
senting  add-ons  to  the  basic  model.  The  most 
important  split  happened  between  the  two  origina¬ 
tors  of  GT  during  the  early  1990s,  when  Glaser 
accused  Strauss  and  his  coresearcher  Juliet  Corbin 
of  forcing  the  data  into  preconceived  concepts 
(e.g.,  by  using  the  coding  paradigm).  Glaser  him¬ 
self  proposed  a  purely  inductive  version  of  GT  and 
rejected  both  the  use  of  previous  knowledge  and 
the  verification  of  data.  Strauss  on  his  part  regards 
previous  knowledge  as  an  inevitable  and  useful 
precondition  of  empirical  research  but  only  when 
it  is  used  in  a  sensitizing  instead  of  a  definitive 
manner.  Also,  he  insisted  on  the  need  for  a  con¬ 
stant  process  of  experimental  verification  to  come 
up  with  scientific  theories  based  on  which  predic¬ 
tions  of  future  developments  can  be  made.  Recently, 
some  of  Strauss’s  students  came  up  with  alterna¬ 
tive  or  additional  concepts  of  how  to  do  GT.  Adele 
Clarke,  for  example,  suggests  further  developing 
GT  into  a  situational  analysis  by  using  a  set  of 
mapping  techniques  to  better  represent  the  various 
perspectives  occurring  in  the  data.  Another  strand 
of  discussion  introduced  by  Kathy  Charmaz  cen¬ 
ters  on  the  notion  of  a  constructivist  GT  as 
opposed  to  older,  objectivist  versions  of  GT. 

Jorg  Striibing 
Universitdt  Tubingen 
Tubingen,  Germany 
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Groupthink 


The  rather  Orwellian-sounding  word  groupthink 
is  one  of  those  pieces  of  social-scientific  terminol¬ 
ogy  that  has  entered  popular  usage,  where  it  is 
now  used  rather  loosely  and  colloquially  to  refer 
to  defective  or  dysfunctional  group  decision  mak¬ 
ing  in  general;  the  U.S.  Senate  Intelligence 
Committee  concluded  in  2004,  for  instance,  that 
there  was  evidence  of  groupthink  in  the  mistaken 
assessment  that  there  were  weapons  of  mass 
destruction  (WMD)  in  Iraq  prior  to  the  2003 
American  invasion.  The  chairs  of  that  committee 
were  using  the  term  to  refer  to  a  kind  of  general¬ 
ized  or  collective  misperception.  In  the  study  of 
public  policy,  foreign  policy  analysis,  psychology, 
and  management,  however,  groupthink  has  a 
rather  more  precise  usage.  It  was  originated  by  the 
late  social  psychologist  Irving  Janis  in  his  book 
Victims  of  Groupthink ,  first  published  in  1972 
and  later  reissued  in  a  revised  form  10  years  later 
as  simply  Groupthink.  Although  Janis  himself  did 
apply  his  theory  to  domestic  cases  such  as  the 
Watergate  scandal — and  the  theory  has  also  been 
used  to  explain  management  fiascoes  such  as  the 
1986  decision  to  launch  the  space  shuttle 
Challenger — within  political  science  it  has  exerted 
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its  greatest  impact  on  the  study  of  foreign  policy 
decision  making.  The  theory  has  been  heavily 
criticized  in  recent  years  for  reasons  that  are 
detailed  below,  but  Janis’s  basic  assumption  that 
individual  behavior  can  be  altered  by  group  pres¬ 
sure  is  supported  by  a  substantial  body  of  empiri¬ 
cal  research. 

Janis  defined  the  groupthink  phenomenon  as  a 
process  through  which  a  group  reaches  a  hasty  or 
premature  consensus  and  then  becomes  closed  to 
outside  ideas  or  alternative  thoughts  within.  The 
effort  to  achieve  unanimity  and  agreement  over¬ 
rides  the  desire  to  consider  a  full  range  of  policy 
alternatives.  High  group  cohesion  or  excessive 
camaraderie  lies  at  the  heart  of  the  groupthink 
phenomenon  and  may  develop  where  the  members 
think  too  much  alike  or  have  developed  powerful 
links  of  friendship  or  collegiality  that  inhibit  the 
challenging  of  assumptions.  While  Janis  did  not 
feel  that  groupthink  was  inevitable  in  such  a  situa¬ 
tion,  he  maintained  that  this  kind  of  group  can  fall 
victim  to  groupthink  where  members  of  the  group 
come  to  prize  “concurrence  seeking”  (unanimity 
or  agreement)  over  the  comprehensive,  rational 
consideration  of  all  available  courses  of  action. 
Janis  illustrates  his  theory  with  a  range  of  exam¬ 
ples  in  which  he  claims  that  groupthink  played  a 
central  role  in  the  decision  making,  including  the 
1941  decision  by  U.S.  admirals  at  Pearl  Harbor  to 
ignore  reports  that  the  naval  base  was  about  to  be 
attacked  by  Japan,  the  1961  Bay  of  Pigs  fiasco, 
and  the  1965  escalation  of  the  Vietnam  War.  He 
contrasts  this  with  what  he  calls  vigilant  appraisal — 
in  which  decision  makers  do  rigorously  and  thor¬ 
oughly  consider  all  possible  options — and  holds  up 
the  Cuban  missile  crisis  of  October  1962  as  a 
notable  instance  of  a  case  in  which  this  kind  of 
superior  process  occurred. 

According  to  Janis,  groupthink  has  a  number 
of  antecedent  conditions  in  addition  to  high  group 
cohesiveness.  For  instance,  it  is  encouraged  where 
the  group  is  insulated  from  outside  advice,  where 
an  aggressive  or  opinionated  leader  prevents 
meaningful  debate,  where  norms  requiring  the  use 
of  methodical  procedures  are  absent,  where  most 
members  of  the  group  come  from  a  similar  social 
and  educational  background  and/or  think  alike, 
where  the  group  is  confronting  high  levels  of 


stress,  and  where  its  members  are  experiencing 
temporarily  low  levels  of  self-esteem.  Janis  also 
identifies  a  number  of  symptoms  that  can  be  used 
to  diagnose  the  presence  of  groupthink.  These 
include  the  presence  of  an  illusion  of  invulnera¬ 
bility  among  the  group;  collective  rationalization 
about  the  risks  of  a  chosen  option;  a  belief  in  the 
inherent  morality  of  the  group,  in  which  mem¬ 
bers  come  to  believe  in  the  exclusive  moral  right¬ 
ness  of  their  cause;  the  appearance  of  stereotyped 
views  of  outgroups  or  “the  enemy”;  the  exertion 
of  direct  pressure  on  those  who  dissent  from  the 
majority  opinion;  self-censorship,  where  doubt¬ 
ers  fail  to  express  their  true  feelings;  the  false 
perception  of  unanimity,  where  in  fact  some 
members  may  harbor  personal  doubts  about  the 
policy  option  chosen;  and  finally,  the  emergence 
of  self-appointed  “mind  guards,”  where  mem¬ 
bers  of  the  group  take  it  on  themselves  to  protect 
the  group  and  its  leader  from  dissenting  views 
(guarding  minds  much  as  a  bodyguard  guards  us 
physically). 

It  is  important  to  realize  that  groupthink  is 
essentially  a  theory  about  the  policy  process;  it  does 
not  necessarily  tell  us  about  the  outcomes  of  deci¬ 
sion  making.  In  other  words,  a  process  free  of 
groupthink  will  not  guarantee  policy  success,  nor 
will  procedures  fraught  with  groupthink  necessar¬ 
ily  lead  to  policy  disaster  (although  Janis  argued 
that  the  likelihood  of  an  undesirable  outcome  was 
greatly  increased  by  the  presence  of  groupthink).  It 
is  possible  that  sheer  luck,  for  instance,  will  inter¬ 
vene  in  a  rather  fortuitous  fashion  to  break  the 
general  causal  chain  between  process  and  outcome. 

In  the  years  since  Janis’s  book  was  first  pub¬ 
lished,  the  theory  of  groupthink  has  been  sub¬ 
jected  to  a  variety  of  criticisms,  and  it  has  been 
adapted  and  reformulated  in  various  ways. 
Probably  the  most  common  criticism  of  Janis’s 
theory  is  that  it  is  too  vague  and  imprecise — 
allowing  almost  anything  that  might  be  labeled 
poor  collective  decision  making  to  be  categorized 
as  an  instance  of  groupthink — and  some  have  cri¬ 
tiqued  the  theoretical  coherence  of  the  model  Janis 
developed.  As  Philip  Tetlock  and  his  colleagues 
have  suggested,  four  broad  criticisms  can  be  raised 
against  Janis’s  work  in  this  vein:  First  of  all,  Janis 
relied  on  qualitative  case  studies,  a  method  that 
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frequently  tempts  the  researcher  to  emphasize  evi¬ 
dence  that  “fits”  a  theory;  second,  there  is  a  suspi¬ 
cious  correlation  between  the  presence  of  group¬ 
think  and  flawed  decision  making  in  Janis’s  book, 
even  though  Janis  concedes  that  process  is  not 
everything;  third,  critics  have  noted  an  equally 
suspicious  all-or-nothing  quality  to  the  way  in 
which  Janis’s  case  fits  so  conveniently  into  the 
categories  of  groupthink  or  vigilant  decision  mak¬ 
ing;  and  last,  there  are  various  conceptual  prob¬ 
lems  with  the  model  itself,  especially  those  having 
to  do  with  distinguishing  the  causes  from  the 
consequences  of  groupthink.  Certainly,  Janis 
lumps  together  a  bewildering  variety  of  factors; 
some  have  questioned  the  inclusion  of  a  belief  in 
the  inherent  morality  of  the  group  and  the  stereo¬ 
typed  views  of  outgroups  in  his  list  of  antecedent 
factors,  for  instance,  since  these  appear  to  have 
little  to  do  with  consensus  formation. 

Others  have  critiqued  the  model  on  empirical 
grounds,  and  later  analysts  have  challenged  Janis’s 
interpretation  of  the  Bay  of  Pigs  and  Vietnam  case 
studies,  the  two  centerpieces  of  the  book  Group¬ 
think.  The  contention  that  Lyndon  Johnson  did 
not  receive  competing  advice  on  Vietnam,  or  that 
he  and  others  felt  “an  illusion  of  invulnerability” 
on  this  issue,  has  been  strongly  questioned  in 
recent  years.  The  evidence  we  now  have  suggests 
that  Johnson  himself  agonized  quite  extensively 
over  the  decision  to  escalate  the  war.  The  recent 
declassification  of  secretly  taped  phone  calls  that 
Johnson  made  in  late  1964  and  throughout  1965 
reveals  a  president  who  was  deeply  troubled  by  the 
issue  of  Vietnam  and  quite  pessimistic  about  the 
chances  of  winning  the  war.  Similarly,  it  has  also 
been  argued  that  Janis’s  interpretation  of  the  Bay 
of  Pigs  episode  sits  poorly  with  what  we  now 
know.  In  giving  the  go-ahead  for  the  invasion  of 
Cuba,  John  F.  Kennedy  and  his  advisers  may  have 
been  influenced  not  so  much  by  a  feeling  that  they 
could  not  fail  as  they  were  by  faulty  (and  perhaps 
deliberately  misleading)  intelligence  assessments 
provided  by  the  CIA  (Central  Intelligence  Agency). 
Rather  than  being  influenced  by  group  processes, 
Kennedy  may  have  simply  believed  that  a  great 
World  War  II  general  like  Dwight  Eisenhower 
would  not  have  advised  him  to  go  ahead  with  a 
plan  that  had  little  chance  of  success. 


One  response  to  these  kinds  of  criticisms  has 
been  to  reformulate  Janis’s  original  argument  in 
the  effort  to  lend  it  more  theoretical  rigor.  Some 
have  attempted  to  better  specify  the  kind  of  leader¬ 
ship  behavior  that  gives  rise  to  groupthink.  Paul ’t 
Hart,  on  the  other  hand,  argues  in  his  study  of  the 
Iran-Contra  decision  making  under  the  Reagan 
administration  that  groupthink  is  not  one  phe¬ 
nomenon  but  two.  Noting  that  the  decision  mak¬ 
ers  in  some  of  Janis’s  cases  do  not  appear  to  be 
laboring  under  an  illusion  of  invulnerability,  ’t 
Hart  distinguishes  between  what  he  calls  Type  I 
Groupthink  (or  Collective  Avoidance)  and  Type  II 
Groupthink  (Collective  Overoptimism),  ’t  Hart 
argues  that  the  first  kind  of  groupthink  occurs 
when  members  perceive  that  the  likelihood  of  suc¬ 
cessfully  resolving  the  problem  confronting  the 
group  is  low,  while  the  latter  occurs  when  the 
likelihood  of  success  is  seen  as  high.  Prompted  by 
the  desire  to  “look  beyond  groupthink,”  analysts 
of  foreign  policy  decision  making  have  also  reex¬ 
amined  the  wider  literature  on  group  behaviors 
within  social  psychology  for  clues  as  to  how  other 
theoretical  frameworks  might  be  developed.  One 
such  approach  has  become  known  as  the  new- 
group  syndrome.  Paul  ’t  Hart,  Eric  Stern,  and 
Bengt  Sundelius,  the  theory’s  major  advocates, 
have  proposed  that  in  ad  hoc  or  newly  formed 
groups,  the  lack  of  well-agreed  norms  and  the 
absence  of  any  existing  organizational  culture  cre¬ 
ate  uncertainty  among  the  members.  Group  con¬ 
formity  is  especially  likely  under  such  circum¬ 
stances,  they  argue,  and  members  are  particularly 
likely  to  accept  strong  direction  from  a  leader  or 
other  dominant  individuals. 

Nevertheless,  Janis’s  theory  has  also  been  sup¬ 
ported  by  a  large  body  of  empirical  research  that 
has  essentially  confirmed  his  original  claims  about 
the  ways  in  which  group  cohesion  can  lead  to 
defective  decision  making.  Perhaps  the  greatest 
strength  of  this  approach  is  that  it  highlights  the 
fact  that  groups  are  more  than  the  sum  total  of  the 
individuals  who  compose  them;  once  policymakers 
form  a  group,  the  resultant  body  subsumes  (and 
may  overwhelm)  its  members.  Individuals  often 
behave  differently  in  groups  from  how  they  do  on 
their  own,  and  the  greatest  contribution  of  group¬ 
think  has  been  the  way  in  which  it  highlights  the 
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fact  that  even  the  “best  and  the  brightest”  can  fall 
prey  to  dysfunctional  group  dynamics. 

David  Patrick  Houghton 
University  of  Central  Florida 
Orlando,  Florida,  United  States 
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Health  policy  denotes  normative  and  procedural 
dimensions  of  assuring  physical,  mental,  and  social 
well-being,  including  but  transcending  medical 
care.  Spanning  normatively  and  empirically  con¬ 
tested  domains,  however,  health  policy  means  dif¬ 
ferent  things  to  different  people.  Sometimes  it 
concerns  content  such  as  financing  or  organizing 
health  services;  sometimes  it  emphasizes  the  pro¬ 
cess  by  which  valued  ends  are  sought  and  occa¬ 
sionally  attained;  and  sometimes  it  implies  power 
and  whatever  else  may  influence  the  formation  and 
implementation  of  health-related  activities.  This 
entry  describes  the  values  and  practices  intrinsic  to 
health  policy,  focusing  on  six  areas:  (1)  definitions 
of  health,  (2)  values  underlying  responsibility  for 
health,  (3)  the  context  of  health  care  practices, 

(4)  political  actors  in  the  health  policy  process, 

(5)  options  in  providing  health  care,  and 

(6)  reforms  in  health  care  systems. 

Definitions 

Definitions  of  health  range  from  perfect  well-being 
through  functional  capacity  to  psychological  per¬ 
ceptions.  According  to  the  World  Health 
Organization,  health  is  considered  a  state  of  phys¬ 
ical,  mental,  and  social  well-being  and  not  merely 
the  absence  of  disease  or  infirmity.  For  the  func¬ 
tionalist  Talcott  Parsons  (1964),  “Health  is  a  state 
of  optimum  capacity  of  an  individual  for  the  effec¬ 
tive  performance  of  the  roles  and  tasks  for  which 


he  has  been  socialized”  (p.  274).  For  the  postmod¬ 
ernist  Walsh  McDermott  (1977),  “Health  is  a  rela¬ 
tive  state  that  represents  the  degree  to  which  an 
individual  can  operate  effectively  within  the  cir¬ 
cumstances  of  his  hereditary  and  his  physical  and 
cultural  environment”  (p.  136). 

Health  was  originally  an  individual  concern, 
then  a  concern  of  the  family,  and  then  of  larger 
collectivities  up  to  and  including  society.  Given  the 
changes  in  technology,  health  moved  beyond 
physical  encounters  (the  germ  theory  of  disease)  to 
socioeconomic  concerns  (contributions  to  produc¬ 
tivity)  and  postmodern  reconstruction  of  well¬ 
being  (cultural  coping  with  chronic  disability). 

Different  societies  perceive  health  in  different 
ways  and  differ  about  the  meaning  of  disease  and 
ill  health.  Beliefs  about  health,  disease,  and  illness 
also  differ  according  to  gender,  socioeconomic 
status,  profession,  and  social  role.  As  definitions 
of  health  change  over  time,  perceived  health  needs 
of  people  change.  Paradigms  about  what  consti¬ 
tutes  health  have  been  broadened  by  scientific 
investigations  as  well  as  the  emergence  of  new 
ideas. 

Prevention  of  illness  is  the  first  line  of  defense; 
restoration  or  rehabilitation  of  health  through 
medical  intervention  is  the  remedial  approach;  and 
capacity  to  cope  with  chronic  conditions  is  a  third 
method.  Systems  of  health  services  (which  include 
a  gamut  of  opportunities  and  interventions  ranging 
from  psychosocial  supports  through  episodic  phar¬ 
maceuticals  to  invasive  therapies  such  as  surgery) 
are  valued  in  all  societies  and  provide  an  arena 
within  which  occur  struggles  over  the  allocation  of 
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values  as  well  as  the  pursuit  and  exercise  of  power 
that  represent  both  policy  and  politics. 

Values 

Anthropologists  note  that  the  most  important 
thing  to  know  about  people  is  what  they  take  for 
granted.  Unarticulated  values  guide  behavior  and 
generate  consensus  about  its  intrinsic  propriety. 
Only  when  challenged  by  other  deeply  held  views 
do  such  basic  orientations  become  manifest — and 
then  perhaps  only  dimly.  But  although  elusive  and 
pervasive,  basic  orientations  serve  as  selective  fil¬ 
ters  that  inhibit  or  distort  comprehension  of  alter¬ 
native  viewpoints  based  on  other  equally  basic  but 
unarticulated  values. 

Common  values  and  beliefs  determine  the 
demands  made  on  health  policymakers.  The  more 
widely  values  and  beliefs  are  shared,  the  easier  and 
greater  the  public’s  acceptance  of  proposed  policies. 
Public  opinion  sets  the  boundaries  and  direction  of 
health  policy,  while  the  social  system  sensitizes 
policymakers  about  health  demands  and  supports. 
Most  countries  share  basic  goals  in  health  policy: 
universal  (or  near-universal)  access  to  health 
services,  equity  in  sharing  the  financial  burden  of 
illness,  and  good-quality  health  care.  Given  the 
ever-larger  share  of  public  money  in  funding  health 
care,  governments  are  increasingly  concerned  about 
cost  control  and  efficiency.  Patient  satisfaction, 
patient  choice,  and  autonomy  of  professionals  are 
important  goals  too. 

Normative  preferences  about  responsibility  for 
health  care  range  from  a  commitment  to  individu¬ 
alism,  where  a  person  is  the  only  one  responsible 
for  his  or  her  well-being,  to  an  abiding  concern  for 
general  welfare,  where  the  state  (acting  on  behalf 
of  all  of  society)  ensures  an  all-inclusive  system  of 
health  care  for  all.  Empirical  practice  involves 
three  dimensions  of  health  care  services:  (1)  equi¬ 
table  access,  (2)  effective  quality,  and  (3)  efficient 
production. 

One  cleavage  that  informs  health  policy  divides 
those  who  stress  an  essential  solidarity  among  all 
humans  (at  least  those  within  a  delimited  territory) 
from  those  who  espouse  the  individual  responsibil¬ 
ity  of  each  human  being  (with  perhaps  concessions 
for  age-specific  categories  such  as  children  and  the 
elderly).  The  former  view  is  egalitarian  or  solidar- 
istic;  the  latter  is  characteristic  of  libertarianism  or 


radical  individualism.  As  modern  communications 
globalize  Western  orientations,  their  spread  elicits 
challenges  and  responses  from  non-Western  societ¬ 
ies  about  how  to  regard,  amend,  or  replace  these 
values.  “Great”  traditions  (to  use  Robert  Redfield’s 
term)  such  as  the  Confucian  and  Islamic  as  well  as 
“little”  traditions  such  as  regional  variants  and 
local  folkways  provide  options  as  they  interact 
with  the  globalized  values  of  a  secularized  Judeo- 
Christian  culture. 

Context 

Health  policy  cannot  be  divorced  from  the  situa¬ 
tional  and  historical  context  in  which  it  is  made. 
The  legacy  of  previous  policies,  including  the 
absence  of  such  policies  altogether,  provides  a  con¬ 
textual  limit  on  policy  options.  The  situational 
environment  provides  demands  for  policy  action 
plus  support  for  as  well  as  constraints  on  what 
policymakers  can  do.  The  environment  within 
which  policy  is  made  as  well  as  the  substantive  pro¬ 
cesses  that  underlie  the  formation  and  implementa¬ 
tion  of  specific  health  policies  must  be  understood. 

Health  policy  analysis  is  applicable  to  all  soci¬ 
eties.  Regularities  in  the  policy  process  and 
similarities  among  actors  who  try  to  influence  it 
transcend  differences  in  levels  or  degrees  of  devel¬ 
opment.  Within  government  are  policymakers 
including  politicians,  advisers,  judges,  and  admin¬ 
istrators.  Outside  governments  are  groups  with 
major  stakes  in  policies  that  affect  them;  these 
range  from  organized  institutions  to  social  forces 
and  latent  interests,  and  even  foreign  entities. 
How  far  such  groups  participate  in  policy  making 
depends  on  how  open  or  closed  the  political  sys¬ 
tem  is.  Even  if  the  array  of  problems  and  their 
solutions  are  different,  procedures  and  routines  in 
policy  making  are  comparable. 

While  empirical  experience  with  goals  and 
means  for  health  policy  indicates  potential  global 
convergence  on  patterns  of  performance,  countries 
implement  changes  within  their  own  historical 
legacies  and  within  the  constraints  of  existing 
national  institutions  and  political  boundaries.  The 
timing  and  speed  of  change  vary  as  well.  Some 
governments  implement  major  changes  rapidly 
whereas  others,  including  those  characterized  by 
well-organized  stakeholders,  adjust,  delay,  or  even 
abandon  health  policies. 
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The  health  sector  comprises  a  major  share  of 
the  economy  in  each  country.  It  employs  many 
workers,  accounts  for  a  considerable  portion  of 
gross  national  product  (GNP),  absorbs  large 
amounts  of  resources  (including  often  unrecorded 
payments  out  of  pocket),  and  generates  vested 
interests.  Since  any  expenditure  is  income  for 
someone  else,  a  “penny  saved”  deprives  someone 
somewhere  of  his  or  her  earnings. 

Health  is  also  influenced  by  public  policies  that 
initially  appear  to  be  unconnected  with  health  care 
or  services.  Civic  insecurity,  political  instability, 
environmental  pollution,  revenue  generation,  and 
economic  regulation — all  affect  changes  in  mor¬ 
bidity  and  mortality,  not  to  mention  the  provision 
of  health  services  per  se.  Health  policy  draws  on 
many  sectors  whose  relevance  is  not  at  first  glance 
self-evident. 

Health  policy  is  of  global  concern  beyond  the 
boundaries  of  the  nation-state.  Efforts  to  pro¬ 
mote  health  cannot  be  restricted  to  a  particular 
country  because  diseases  do  not  confine  them¬ 
selves  within  geographical  borders.  They  are 
transferred  from  one  place  to  another  through 
travel,  migration,  commerce,  and  social 
exchange.  Efforts  to  prevent  disease  and  to  pro¬ 
mote  health  require  cooperation  and  collabora¬ 
tion  among  agencies  at  local,  national,  and 
international  levels. 

Actors 

Structures  and  procedures  of  political  institutions 
have  important  consequences  for  the  adoption  and 
content  of  health  policy.  Enduring  institutions — 
both  governmental  and  nongovernmental — interact 
in  the  policy-making  process.  Policy  actors  are  indi¬ 
viduals  and  groups,  both  formal  and  informal,  that 
seek  to  influence  the  creation  and  implementation 
of  public  policies.  Potentially  such  actors  include 
everyone,  but  degrees  of  activity  over  time  vary. 
Some  potential  players  never  enter  the  game;  others 
dominate  almost  every  stage  of  play.  In  the  health 
sector,  actors  include  branches  of  government, 
governmental  agencies,  commercial  enterprises, 
nongovernmental  institutions,  organized  interest 
groups,  and  professional  associations.  The  role  each 
actor  plays,  in  combination  with  relationships 
among  actors  in  formal  and  informal  settings, 
determines  health  policy  outcomes. 


Interest  groups  are  fundamental  partners  in 
policy  making.  To  the  extent  that  health  policy 
interests  are  shared,  collective  pressure  allows 
greater  policy  influence.  But  the  dynamic  of  inter¬ 
est  groups  is  not  simple.  While  salient  interests  are 
represented  by  groups,  the  strength  of  representa¬ 
tion  is  not  tied  to  the  salience  of  an  issue.  Some 
groups  are  already  vested  due  to  past  history,  and 
salience  itself  may  be  a  consequence  of  interest 
group  action.  The  study  of  health  policy  must  iden¬ 
tify  the  policy  actors  and  their  political  resources. 
Common  resources  include  information  and 
bureaucratic  knowledge,  a  network  of  contacts, 
citizen  support  (including  size  as  well  as  strategic 
location  of  constituency),  an  ability  to  make  politi¬ 
cal  contributions,  and  an  ability  to  mount  a  public 
relations  campaign.  While  few  groups  ever  use  all 
such  resources  at  all  times,  the  profile  of  an  orga¬ 
nized  group’s  ability  to  make  use  of  some  combi¬ 
nation  of  them  is  critical  for  policy  influence. 

In  addition  to  organized  interests,  political  lega¬ 
cies  matter  as  do  key  political  processes.  For 
example,  the  United  States  developed  a  health  care 
system  that  relies  on  private  financing  and  volun¬ 
tary  hospitals  for  the  provision  of  health  services. 
Public  sector  responsibility  is  confined  to  targeted 
financing  for  the  poor,  the  elderly,  and  a  limited 
public  hospital  network.  Subsequent  efforts  to 
reform  the  American  health  system  face  increas¬ 
ingly  vested  interests  and  widely  accepted  routines 
of  private  health  care. 

Significantly  for  health  policy,  the  American 
system  of  government  is  built  on  an  enduring 
Madisonian  system  of  checks  and  balances  that 
gives  well-mobilized  interests  the  ability  to  impede 
or  block  policy  initiatives.  The  American  health 
system  has  numerous  groups  powerful  enough  to 
oppose  any  reform  that  might  harm  their  own  eco¬ 
nomic  interests. 

Robert  Alford’s  classic  conceptualization  of 
health  policy  posits  three  categories  of  actors 
engaging  in  “dynamics  without  change”:  (1)  pro¬ 
fessional  monopolists,  (2)  corporate  rationalizers, 
and  (3)  citizen-consumers — the  latter  getting  the 
proverbial  short  end  of  the  stick.  While  institutions 
of  representation,  government,  policy  making,  and 
intergovernmental  relations  may  be  stable,  pro¬ 
cesses  by  which  health  policy  are  shaped  and 
implemented  have  undergone  marked  changes 
during  recent  decades.  These  changes  challenge  the 
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notion  that  health  policy  is  made  by  a  unified  cen¬ 
ter  or  by  coordinated  pillars  at  the  helm  of  repre¬ 
sentative  political  regimes. 

Options 

Health  care  systems  are  characterized  by  country- 
specific  mixes  of  public  and  private  funding,  con¬ 
tracting,  and  modes  of  providing  services.  The  five 
main  sources  of  funding  are  (1)  taxation  (general 
revenues,  earmarked  taxes,  and  tax  expenditure), 
(2)  public  insurance,  (3)  private  insurance,  (4)  direct 
patient  payments  (copayments,  coinsurance,  deduct¬ 
ibles,  and  uninsured  services),  and  (5)  voluntary 
contributions.  For  some  low-income  countries, 
external  aid  is  a  major  source  as  well. 

Among  the  three  basic  contracting  models,  the 
integrated  version  places  funding  and  ownership  of 
services  under  the  same  (public  or  private)  responsi¬ 
bility.  The  best  known  example  is  the  British 
National  Health  Service  (NHS)  that  provides  tax- 
funded  health  care  for  all.  The  second  contracting 
model  allows  governments  or  third-party  payers 
(often  administrative  agencies  for  social  health 
insurance  but  sometimes  private  health  insurers)  to 
negotiate  long-term  contracts  with  health  care  pro¬ 
viders.  The  third  model,  common  in  private  insur¬ 
ance,  is  reimbursement  when  a  patient  pays  the 
provider  and  then  seeks  reimbursement  from  the 
insurance  agency.  Therefore,  the  ownership  and 
management  of  health  services  can  be  public,  pri¬ 
vate  (both  for-profit  and  not-for-profit),  or  a  mix  of 
those.  Moreover,  there  are  country-specific  mixes  of 
formal  and  informal  care,  traditional  and  modern 
medicine,  and  medical  and  related  social  services. 

Combinations  of  those  core  elements — funding, 
contracting  (including  payment  modes),  and 
ownership — determine  the  allocation  of  financial 
risks  and  decision-making  power  among  the  main 
players  in  health  care.  Government  ownership  and 
tax-funded  services  require  strong  government 
influence,  whereas  private  funding  (insurance  and 
direct  patient  payments)  combined  with  legally 
independent  providers  restricts  the  role  of  the  state 
even  though  governments  can — and  often  do — 
impose  rules  to  protect  patients  or  safeguard  the 
quality  of  and  access  to  health  care. 

National  arrangements  for  financing  health  care 
vary.  In  Scandinavia,  Italy,  and  the  United 
Kingdom,  the  major  share  of  health  care  funding  is 


financed  by  general  taxation,  whereas  systems  of 
health  insurance  are  the  major  source  of  funds  in 
Germany,  France,  and  other  continental  countries. 
In  all  countries,  patients  are  expected  to  pay  a  pro¬ 
portion  of  health  care  costs  out  of  their  own  pock¬ 
ets  through  copayments  or  deductibles.  In  most 
cases,  however,  governments  mitigate  the  effects  of 
user  fees  by  exempting  certain  groups  or  by  setting 
annual  limits  on  how  much  families  must  pay. 

Variations  in  funding  and  contracting  models  in 
health  care  can  be  traced  to  country-specific  his¬ 
torical  developments,  but  two  events  in  Europe 
played  a  crucial  role  as  models  for  policy.  The  first 
was  the  introduction  of  mandatory  social  health 
insurance  for  industrial  workers  and  their  families 
in  Germany  in  1883.  Several  other  countries  in 
Europe  followed  the  German  example  of  state- 
sponsored  (but  not  state-administered)  mandatory 
social  insurance  to  protect  the  family  income  of 
industrial  workers  against  the  risks  of  illness,  dis¬ 
ability,  unemployment,  and  old  age.  Mandatory 
membership  enforced  by  social  insurance  meant 
that  the  so-called  sickness  funds  had  stable  revenue 
streams  and  could  create  wider  pools  of  shared 
risk.  In  the  past  decades,  these  nongovernmental 
funds  became  core  actors  in  the  public  policy  arena 
by  sharing  the  responsibility  for  social  policy  mak¬ 
ing  but  under  ever-greater  government  regulation. 

The  second  major  innovation  in  the  funding  of 
health  care  was  the  establishment  of  Britain’s  NHS 
in  1948.  The  NHS  extended  the  German  insurance 
model  by  providing  coverage  to  the  entire  popula¬ 
tion  with  costs  paid  out  of  general  taxation. 
Although  hospitals  in  the  United  Kingdom  were 
nationalized,  family  physicians  remained  indepen¬ 
dent  as  practitioners.  Postwar  reconstruction  in 
Europe  was  characterized  by  popular  support  for 
the  expansion  of  state-sponsored  schemes.  Some 
countries  followed  the  German  example  of  employ¬ 
ment-based  schemes;  others  preferred  the  popula¬ 
tion-based  NHS  model. 

The  spread  of  the  two  models  was  not  restricted 
to  Europe.  Nations  throughout  the  world  sought 
to  implement  policies  to  protect  the  incomes  of 
their  populations  (or  population  groups)  against 
the  financial  risks  of  illness,  disability,  and  old  age. 
By  the  end  of  the  20th  century,  funding  for  health 
care  in  most  countries  had  become  hybridized  by 
adopting  elements  from  both  the  British  and  Ger¬ 
man  models.  Employment-based  arrangements  for 
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certain  categories  of  workers  were  combined  with 
population-wide  and  tax-based  universal  schemes. 

Reforms 

The  1970s  saw  a  shift  from  expansion  and  popular 
support  for  welfare  state  arrangements  to  reassess¬ 
ment  and  retrenchment.  Economic,  demographic, 
and  ideological  factors  contributed  to  the  reshap¬ 
ing  of  the  popular  notion  of  the  welfare  state  as  a 
solution  for  social  problems  to  that  of  an  economic 
burden  and  a  cause  of  economic  stagnation.  After 
the  oil  crisis,  economic  stagflation  with  persistently 
high  levels  of  unemployment  meant  that  state  rev¬ 
enues  stagnated  or  declined  while  public  expendi¬ 
tures  continued  to  grow.  Moreover,  as  the  end  of 
the  postwar  baby  boom  became  visible,  demogra¬ 
phers  revised  earlier  demographic  projections 
downward  and  future  pension  outlays  upward. 

Ideological  views  about  the  role  of  the  state  also 
changed.  On  both  the  left  and  right  of  the  political 
spectrum,  critics  agreed  that  state  powers  had 
become  too  intrusive  in  the  lives  of  individuals. 
Discontent  over  fiscal  burdens  and  disappointing 
results  of  public  programs,  rising  consumerism, 
and  patient  advocacy  groups  claiming  a  stronger 
say  in  the  allocation  and  organization  of  health 
care — all  challenged  existing  arrangements  for 
providing  welfare.  Governments  sought  alternative 
models  of  governance  to  reduce  the  dominant  role 
of  the  state  and  decentralize  decision  making,  with 
more  space  for  choice  and  entrepreneurial  ideas. 
Some  countries  introduced  market  competition  in 
health  care  by  reducing  state  control  over  the  fund¬ 
ing  and  planning  of  health  care  services.  They  also 
sought  to  broaden  patients’  choice  of  provider  and 
health  plan.  Other  countries  turned  to  traditional 
tools  of  controlling  public  expenditure  by  setting 
strict  budgets,  reducing  the  scope  of  public  insur¬ 
ance,  and  increasing  direct  patient  payments. 

During  the  past  few  decades,  attempts  to 
change  health  policy  have  been  stimulated  by 
economic  recession  and  by  severe  fiscal  problems 
in  the  state  treasury  rather  than  by  an  ideologi¬ 
cally  driven  taste  for  reform,  although  the  2010 
health  reform  bill  in  the  United  States  was 
intended  to  address  not  only  economic  concerns 
but  also  issues  of  social  justice.  Declining  govern¬ 
ment  budgets  have  adversely  affected  service 
delivery,  even  in  countries  that  previously  had 


reasonably  well-performing  systems  for  the  public 
delivery  of  health  services.  Pressures  for  changes  in 
health  policy  often  emanate  from  central  ministries 
such  as  finance  and  planning.  In  many  cases,  the 
ministries  of  health  struggle  to  reinterpret  and  to 
respond  to  policy  directives  outside  their  control. 

Economic  realities  of  recession  and  fiscal  crises 
affect  not  only  the  types  of  policies  that  are  imple¬ 
mented  but  also  reactions  to  them  by  the  users, 
beneficiaries,  and  citizens.  The  stage  of  raising 
revenue  through  the  introduction  of  user  fees  to 
supplement  government  budgetary  resources  was 
critical  for  many  governments  because  of  the 
endemic  economic  crisis.  But  the  success  of  the 
policy,  however  logical  in  theory,  was  constrained 
by  the  dwindling  capacity  of  citizens  to  pay  for 
health  care.  Furthermore,  the  administrative  cost 
of  collecting  user  fees  and  of  monitoring  exempted 
categories  of  users  often  exceeds  the  revenue  col¬ 
lected.  Although  well  intended,  the  initial  policy 
had  not  considered  inevitable  transaction  costs. 

For  several  decades,  health  policy  reforms  have 
been  premised  on  the  assumption  that  improving 
the  ability  of  government  to  manage  its  business 
will  lead  to  improved  social  and  economic  prog¬ 
ress.  The  first  generation  of  reforms  sought  to  cut 
public  expenditures  and  to  revive  the  private  sec¬ 
tor.  Measures  included  budget  cuts,  tax  reforms, 
limited  privatization,  liberalization  of  prices,  and, 
most  conspicuously,  efforts  to  downsize  the  public 
sector.  The  latter  was  invariably  described  as 
bloated  and  in  need  of  surgery  followed  by  a 
strictly  enforced  diet. 

When  it  became  evident  that  the  transformation 
of  government  would  require  a  long  time  and  that 
the  savings  from  reduced  bureaucratic  costs  would 
be  insufficient  to  provide  even  basic  levels  of  pub¬ 
lic  services,  a  second  generation  of  policy  reforms 
sought  to  improve  the  efficiency  and  effectiveness 
of  government.  While  the  first-generation  reforms 
stressed  downsizing,  contracting,  and  improved 
control  over  budgeting  and  public  expenditures, 
the  second  generation  advocated  decentralization 
to  subnational  levels,  creation  of  semiautonomous 
agencies  in  the  central  government,  and  reforms  of 
human  resource  management  (recruitment,  selec¬ 
tion,  and  training). 

Recently,  the  agenda  of  health  policy  refocused 
yet  again  as  a  third  generation  of  reforms  seeks  to 
improve  social  outcomes  through  better  service 


1066  Hegel,  Georg  Wilhelm  Friedrich 


delivery.  This  strategy  emphasizes  sectorwide 
approaches,  particularly  in  health  and  education,  to 
produce  a  coherent  program  for  delivery  of  services 
that  involves  both  governmental  and  nongovern¬ 
mental  organizations.  While  these  generations  of 
policy  reforms  are  overlapping  rather  than  strictly 
sequential,  all  have  been  driven  by  a  combination 
of  external  and  internal  agencies.  Multilateral  and 
bilateral  aid  entails  conditionalities  that  require  a 
(commitment  to)  change  in  governmental  behavior 
before  money  can  be  transferred.  In  turn,  national 
planning  commissions  and  ministries  of  finance 
require  line  agencies  to  adopt  reforms  that  may 
include  a  combination  of  these  generations. 

Policy  reforms  range  across  a  repertoire  of 
instruments:  streamlined  budgets,  staff  reductions, 
raised  tariffs,  contracting  out,  and  other  forms  of 
privatization.  Reform  of  the  health  sector  has 
focused  on  four  main  options,  none  of  which  is 
mutually  exclusive  and  all  of  which  may  occur  at 
the  same  time.  These  are  the  establishment  of 
autonomous  organizations,  introduction  of  user 
fees,  contracting  out  of  services,  and  the  enable¬ 
ment  and  regulation  of  the  private  sector. 

While  these  issues  characterize  features  of  health 
care  systems  and  health  policy,  they  do  not  explain 
the  causes  or  effects  of  policy  change.  To  under¬ 
stand  why  countries  embark  on  particular  reform 
paths,  not  only  external  and  internal  pressures  for 
change  must  be  investigated  but  also  structural 
features  of  social  policy  making  that  enable  politi¬ 
cians  and  policy  entrepreneurs  to  change  the  sys¬ 
tem  despite  the  fact  that  institutional  legacies  and 
popular  support  for  existing  policy  arrangements 
create  barriers  to  change. 

James  Warner  Bjorkman 
Institute  of  Social  Studies 
The  Hague,  Netherlands 
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Hegel,  Georg  Wilhelm 
Friedrich  (1770-1831) 


Georg  Wilhelm  Friedrich  Hegel  was  born  in 
Stuttgart,  Germany,  on  August  27,  1770.  His 
political  writings,  including  those  on  political  phi¬ 
losophy,  owed  much  to  his  reactions  to  the  frag¬ 
mentary  nature  of  the  German  territories  within 
the  Holy  Roman  Empire  and  Napoleon’s  attempted 
forced  unification  of  them  under  a  common  civil 
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code,  between  the  early  1790s  and  the  Battle  of 
Waterloo  in  1815.  Hegel’s  trenchant  critique  of 
the  abstract  universalism  of  the  French  revolution¬ 
aries  in  his  Phenomenology  of  Spirit  (1807)  did 
not  prevent  him  from  celebrating  every  Bastille 
Day  until  his  death  on  November  14,  1831.  His 
alternative  was  a  form  of  civic  nationalism  based 
on  shared  traditions  and  cultural  attachments,  an 
ideal  that  he  championed  against  the  ethnic  nation¬ 
alism  espoused  at  the  time  by  many  German 
romantics.  In  itself,  the  fact  that  Hegel  allied  him¬ 
self  so  clearly  with  the  civic  nationalists  in  this 
particular  dispute  should  do  much  to  unsettle  the 
received  picture  of  him  as  an  apologist  for  Prussian 
authoritarianism  and  a  forerunner  of  Nazism. 

Today,  Hegel  is  notorious  for  his  claim,  made 
most  famously  in  the  Philosophy  of  History 
(1831),  that  history  is  the  gradual  revelation  of 
Geist  (translated  as  “Spirit”  or  “Mind”  and  syn¬ 
onymous  with  God’s  temporal  self-manifestation) 
in  the  world,  a  dialectical  process  that  critics  allege 
he  believes  occurs  without  the  knowledge  of  indi¬ 
vidual  human  agents  and  largely  through  the  bru¬ 
tal  intervention  of  quasi-mystical  “world-historical 
individuals.”  The  meaning  and  significance  of  such 
claims  are  easily  misunderstood  and,  in  certain 
cases,  apparently  deliberately  distorted.  Hegel 
approaches  the  historical  record  as  a  philosopher, 
and  in  this  role,  he  seeks  to  discover  the  rationality 
underlying  the  shifting  events  of  the  past.  He 
explicitly  distinguishes  this  “philosophical  his¬ 
tory”  from  the  other  forms  of  history,  including 
those  undertaken  by  the  early  chroniclers  and 
those  who  would  now  be  seen  as  historians,  both 
of  whom  he  acknowledges  have  legitimate  func¬ 
tions  to  perform  in  providing  an  understanding  of 
the  past.  The  philosopher’s  primary  task,  however, 
is  to  try  to  arrive  at  knowledge  of  the  fundamental 
processes  at  work  in  the  world.  According  to 
Hegel,  knowledge  can  only  be  of  a  rational  system 
of  related  concepts  and  propositions;  hence, 
knowledge  of  the  fundamental  structural  processes 
of  human  action  must  presuppose  the  ultimate 
rationality  of  the  world.  It  is  partly  because  he 
holds  this  view  that  Hegel  claims,  most  notably  in 
the  Preface  to  Elements  of  the  Philosophy  of  Right 
(1821),  that  “what  is  rational  is  actual;  and  what 
is  actual  is  rational.”  Knowledge  is  of  reality  and, 
Hegel  argues,  everyone  presupposes  reality  to  be 
structured  by  a  coherent  system  of  fundamental 


principles  or  laws.  To  “comprehend”  something  is 
to  understand  that  particular  thing  as  being  under¬ 
pinned  in  particular  ways  by  such  a  coherent  sys¬ 
tem  of  laws.  A  philosophical  history  attempts  to 
discover  those  laws  as  they  have  been  revealed 
through  past  actions  and  events. 

This  presupposition  informs  every  branch  of 
Hegel’s  philosophical  system.  Hence,  in  the  pref¬ 
ace  to  Elements  of  the  Philosophy  of  Right,  he 
states  that  the  fundamental  task  of  his  political 
philosophy  is  “to  comprehend  and  portray  the 
state  [StaatJ  as  an  inherently  rational  entity ”  [ital¬ 
ics  added].  It  is  this  project  that  has  led  a  great 
many  critics  to  allege  that  Hegel  is  simply  an 
apologist  for  the  Prussian  state.  In  reality,  Hegel  is 
seeking  to  discover  the  ideal  type  of  the  Staat  that 
is  implicit  within  the  self-image  of  the  world’s 
most  developed  empirical  politically  organized 
communities.  This  project  necessitates  an  attempt 
to  rationalize  certain  negative  features  of  the 
world,  not  least  war.  Nevertheless,  the  project  has 
a  profoundly  critical  thrust,  in  that  some  elements 
of  empirically  observable  states  do  not  cohere  with 
the  majority  of  the  other  elements  of  those  states. 
In  this  sense,  these  wayward  features  of  empirical 
states  cannot  be  included  within  a  conception  of 
the  state  as  an  ideal  type.  The  irrational  elements 
must  be  excluded,  which  in  practice  means  that  the 
incongruent  features  of  the  world  should  be 
reformed,  a  process  that  may  require  radical 
changes  to  existent  states.  Hegel’s  most  significant 
analysis  of  an  incoherent  social  practice  concerns 
slavery,  in  Phenomenology  of  Spirit.  While  the 
master  treats  the  slave  as  a  thing  that  can  be 
owned  and  used,  in  giving  him  orders  he  treats  him 
also  as  a  being  with  capacities  for  cognition  and 
practical  reason — that  is,  as  a  person.  More  than 
this,  the  master  seeks  to  gain  a  sense  of  his  own 
power  and  status  by  dominating  the  slave.  Yet  he 
can  never  be  satisfied  with  recognition  from  a 
lesser  being.  Similarly,  the  slave  is  denied  the  rec¬ 
ognition  that  he  seeks  innately  from  his  master. 
Hence,  the  master-slave  relationship  is  self-contra¬ 
dictory  and  unsatisfying  for  both  parties;  the  situ¬ 
ation  tends  to  drive  each  to  destroy  the  other  or  to 
try  to  reconstruct  their  relationship  on  a  more 
egalitarian  footing.  This  struggle  for  recognition  is 
the  primary  motor  of  human  progress. 

Hegel’s  project  would  have  profound  implica¬ 
tions  even  if  he  had  restricted  his  conception  of  the 
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proper  subject  matter  of  political  philosophy  (the 
Staat)  to  the  state  as  it  is  understood  by  many 
contemporary  political  scientists  and  philoso¬ 
phers:  the  formal  legal-political  organs,  including 
the  legislature,  executive,  judiciary,  civil  service, 
police,  army,  and  so  on.  In  fact,  Hegel  adopts  a 
wider  conception  of  the  Staat  understood  as  an 
order  that  encompasses  conventional  sociopoliti¬ 
cal,  economic,  and  legal  practices  and  institutions 
in  such  a  manner  that,  in  essence,  it  represents  a 
thoroughly  modernized  ideal  of  the  ancient  Greek 
polis.  Elements  of  the  Philosophy  of  Right  analy¬ 
ses  the  relationships  within  the  ideal  type  of  the 
modern  state  through  the  interactions  of  its  three 
“moments”:  (1)  the  individual  will,  (2)  Moralitdt 
(translates  very  loosely  as  “morality”),  and 
(3)  Sittlichkeit  (usually  translated  as  the  “ethical 
life”).  In  Hegel’s  view,  the  individual  will  is  free 
when  the  agent  acknowledges  the  determinate 
action  that  results  from  that  will  as  an  expression 
of  a  law  that  he  (the  individual)  has  set  for  him¬ 
self.  (Hegel  is  concerned  with  men,  not  women, 
as  commented  on  below.)  Institutions  and  prac¬ 
tices  are  justified  to  the  extent  that  they  facilitate 
such  freedom.  Hegel  explores  this  claim  through 
his  discussion  of  the  individual  right  to  private 
property,  which  he  analyses  as  the  means  for  the 
facilitation  of  the  determinate  expression  of  the 
individual’s  will  in  the  world.  This  discussion 
leads  him  to  an  analysis  of  contract  as  a  moment 
of  the  interaction  of  the  wills  of  different  persons. 
This  leads  him  to  the  discussion  of  wrong  action 
and  the  first  stage  of  his  analysis  of  punishment 
as  the  annulment  of  crime.  Hegel  conceives  this 
“cancellation”  of  wrong  action  as  the  reharmoni¬ 
zation  of  the  individual’s  will  with  itself  and  with 
the  social  practices  and  institutions  that  provide 
the  will  with  its  initial  content  (its  socialization) 
and  background.  Underlying  Hegel’s  theory  is  a 
proposition  that  recurs  throughout  his  writings: 
The  reality  of  an  object — in  this  case,  the  deter¬ 
minate  individual  will — is  constituted  by  the 
coherent  principles  underlying  it.  If  existent 
states  were  to  embody  their  ideal  type,  then  the 
criminal  act  would  represent  of  necessity  an 
internal  contradiction  within  the  individual. 
Punishment  enables  the  individual’s  rational  will 
to  reassert  itself  in  practice,  by  communicating 
the  Staafs  disapproval  thereby  “reminding”  the 
individual  of  the  nature  and  legitimacy  of  the 


presupposed  meanings  and  values  that  form  the 
basis  of  his  rational  will. 

Hegel  titles  this  part  of  Elements  of  the  Phil¬ 
osophy  of  Right  as  “Abstract  Right.”  He  argues 
that  the  lack  of  determinacy  in  such  a  (logical 
rather  than  temporal)  “moment”  is  overcome 
through  Moralitdt.  Yet  even  this  second  “moment” 
is  limited  because  it  lacks  an  external  referent, 
being  concerned  solely  as  it  is  with  the  individual’s 
internal  moral  life,  the  qualities  of  his  particular 
character,  virtue,  and  conscience.  The  individual 
can  overcome  these  limitations  only  by  being  an 
active  participant  in  the  Sittlichkeit  (“ethical  life”) 
of  his  community.  The  first  facet  of  ethical  life  in 
which  the  individual  participates  is  the  family.  This 
is  the  realm  of  love  and  hence  the  unreflective  hon¬ 
oring  of  one’s  duties  to  the  members  of  one’s  fam¬ 
ily.  In  essence,  in  the  family  the  individual  acts 
without  asking  why  or  what  is  required  of  him;  in 
this  sense,  Hegel  calls  the  family  the  realm  of 
“immediacy.”  This  realm  is  limited,  however,  in 
that  the  person  is  treated  as  a  facet  of  a  group, 
rather  than  as  an  individual  with  his  or  her  own 
desires,  beliefs,  values,  plans,  and  will.  The  person 
moves  closer  to  expressing  his  or  her  individuality 
through  participation  in  civil  society.  In  Hegel’s 
view,  this  nonpolitical  public  realm  encompasses 
the  economic  system,  voluntary  associations,  and 
the  like.  The  fruits  of  private  property  are  realized 
here  in  the  “system  of  needs,”  partly  through 
employment  and  self-expression  through  work.  (Yet 
Hegel  appreciates  that  these  fruits  are  associated 
with  unemployment  and  poverty.)  In  civil  society  as 
a  whole,  individuals  face  each  other  as  self-inter¬ 
ested  agents,  bound  together  by  contracts  that  they 
have  entered  into  self-consciously  and  deliberately. 
In  this  sense,  it  is  a  realm  of  mediated  action. 

In  spite  of  any  appearance  of  pure  self- 
determination,  the  individual’s  actions  in  civil  soci¬ 
ety  are  necessarily  conceived  and  executed  within  a 
shared  framework  of  common  modes  of  action, 
norms,  conventions,  and,  ultimately,  civil  laws. 
These  underlying  structures  are  maintained  through 
the  action  of  the  state,  conceived  as  living  interac¬ 
tions  of  citizens  in  the  context  of  the  formalized 
legal  and  political  organs.  To  the  extent  that  the 
state  embodies  its  ideal,  it  enjoys  the  self-conscious 
loyalty  of  its  citizens.  This  is  true  patriotism:  the 
self-conscious  emotional  identification  of  individ¬ 
ual  citizens  with  the  community  as  the  latter  is 
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embodied  in  the  state,  with  this  identification  being 
based  on  an  uncoerced  recognition  that  one’s  own 
highest  good  is  realized  through  respecting  the 
good  of  one’s  Staat.  Elements  of  the  Philosophy  of 
Right  concludes  with  pregnant  discussions  of  the 
tragic  inevitability  of  war  and  various  forms  of 
imperialism,  with  the  latter  in  particular  anticipat¬ 
ing  many  Marxist  theories.  Finally,  Hegel  argues 
that  progress  is  inherently  tragic,  in  the  sense  that 
human  development  is  driven  by  world-historical 
individuals,  such  as  Napoleon,  and  world-historical 
states  (here,  one  might  think  of  the  contemporary 
United  States),  who  in  driving  the  world  forward, 
inevitably  bring  about  their  own  fall  from  grace. 

Hegel’s  claim  that  the  Staat  should  be  conceived 
organically  has  led  many  to  argue  that  he  sees  indi¬ 
vidual  human  beings  as  mere  adjuncts  of  the  state 
qua  political-legal  institutions  or  as  organs  and 
limbs  to  be  directed  not  by  their  own  individual 
wills  but  by  the  automatic  functioning  of  the  state 
as  some  form  of  collective  whole.  On  this  view, 
Hegel  held  the  state  to  be  the  determining  agent  and 
the  individual  to  be  valuable  only  to  the  extent  that 
he  or  she  served  the  state’s  interests  and  will.  This 
unappealing  picture  is  crowned  literally  by  Hegel’s 
retention  of  the  monarch  as  the  embodiment  of  the 
state,  exercising  the  sole  function  of  signing  bills 
into  law.  In  reality,  Hegel  holds  that  individuals  in 
a  well-ordered  state  would  be  active,  self-conscious, 
and  critical  patriots;  organized  through  various  civil 
society  associations;  and  represented  politically 
through  their  membership  of  corporations  that  are 
integrated  formally  into  the  state  apparatus.  To  the 
extent  that  they  fail  to  be  so  in  practice,  the  particu¬ 
lar  state  fails  to  live  up  to  its  ideal.  Throughout,  the 
guiding  principle  of  Hegel’s  political  philosophy  is 
“to  be  a  person,  and  respect  others  as  persons” 
(. Philosophy  of  Right,  Part  1,  sec.  36). 

Certainly,  problems  remain.  Even  though  Hegel 
describes  polygamy  as  slavery  for  women,  he 
excludes  women  from  full  citizenship,  claiming 
that  they  are  incapable  of  freedom  in  that  they  are 
guided  by  emotion  rather  than  reason.  Other  crit¬ 
ics  have  baulked  at  his  corporatism  and  retention 
of  the  monarch,  even  if  in  only  a  largely  ceremo¬ 
nial  role.  Hegel  himself  acknowledges  different 
contradictions  within  his  rationalization  of  the 
world:  Free  economic  activity  seems  to  result 
inevitably  in  poverty  for  some  respectable  work¬ 
ers,  and  self-interested  states  remain  the  primary 


actors  in  an  international  system  thereby  destined 
for  conflict  and  war.  Despite  the  claims  of  both 
friends,  such  as  Alexandre  Kojeve,  and  critics,  such 
as  Karl  Popper,  Hegel  never  claimed  that  the  world 
that  we  have  now  is  perfect.  Rather  than  believing 
that  we  have  reached  the  “End  of  History,”  Hegel 
insists  that  philosophical  insight  always  comes  too 
late:  The  philosopher  can  only  rationalize  a  stage 
of  the  world  when  it  is  in  its  death  throes. 
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Hegemony 


Hegemony  derives  from  the  Greek  term  hegemonia 
(fiyepoma),  referring  to  the  predominance  of  one  of 
the  city  states  (e.g.,  Sparta  or  Athens)  over  the  oth¬ 
ers  and  its  related  leadership  in  issues  of  common 
interest.  In  modern  times,  it  has  come  to  be  used 
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in  reference  more  generally  to  the  predominance 
of  one  actor  over  others.  In  contemporary  politi¬ 
cal  science,  we  can  distinguish  two  traditions  in 
the  use  of  the  term  hegemony.  In  international 
relations  theory,  hegemony  has  been  used  more  or 
less  directly  in  reference  to  the  original  Greek 
meaning,  where  it  refers  to  the  predominance  (pri¬ 
marily  in  terms  of  economic  and  military  power) 
of  one  state  over  others.  Alternatively,  in  Marxist- 
inspired  political  science,  hegemony  refers  to  the 
predominance  (especially  in  an  ideological  sense) 
of  one  social  group  over  others.  Finally,  Robert 
Cox  has  established  an  approach  in  critical  inter¬ 
national  relations  theory  in  which  the  differences 
between  these  two  traditions  are  transcended. 

Hegemony  in  International  Relations  Theory 

In  international  relations  theory,  the  concept  of 
hegemony  has  been  applied  to  the  phenomenon  of 
one  state’s  being  strong  enough  to  maintain  an 
international  order  that  is  beneficial  to  (most) 
other  states  as  well.  Two  main  traditions  can  be 
distinguished  here,  the  neorealist  theory  of  hege¬ 
monic  stability  and  the  theory  of  historical  cycles 
of  global  hegemony  and  leadership. 

Hegemonic  Stability 

The  theory  of  hegemonic  stability  (developed  in 
the  1970s  by  Charles  Kindleberger  and  subse¬ 
quently  subscribed  to  by  many  neorealist  authors) 
argues  that  the  interstate  system  will  be  relatively 
peaceful  and  stable  if  there  is  a  hegemonic  state 
that  provides  certain  public  goods  to  the  system 
(most  basically  prosperity  and  security).  A  state 
will  be  a  hegemon  if  it  has  both  the  capability  and 
the  will  to  perform  this  function.  The  capability  of 
a  state  depends  on  the  (relative)  size  and  level  of 
development  of  its  economy,  its  ability  to  domi¬ 
nate  certain  key  technological  sectors,  and  its 
political  and  military  power.  During  much  of  the 
19th  century,  Great  Britain  dominated  the  world 
economically,  politically,  and  militarily.  Through 
its  role  as  a  stabilizer  in  the  European  balance  of 
power  and  through  its  management  of  the  gold 
standard,  it  was  able  to  secure  a  measure  of  stabil¬ 
ity  in  interstate  relations,  which  was  beneficial  to 
the  growth  of  a  liberal  world  economy.  Similarly, 
after  1945  the  United  States  became  the  hegemon: 


It  emerged  out  of  the  war  as  by  far  the  strongest 
military  and  economic  power,  and  it  used  its 
power  to  create  the  institutional  framework  (the 
Bretton  Woods  system),  which  facilitated  the 
unprecedented  expansion  of  the  world  economy  in 
the  following  decades. 

When  the  Bretton  Woods  system  collapsed  in 
the  early  1970s  and  the  hegemony  of  the  United 
States  weakened,  hegemonic  stability  theory  pre¬ 
dicted  the  erosion  of  the  liberal  regime  established 
by  the  hegemon  as  well.  However,  no  such  devel¬ 
opment  took  place.  This  gave  rise  to  an  alternative 
interpretation.  In  his  After  Hegemony  (1984), 
Robert  Keohane  argued  that  collaboration  between 
states  needs  to  be  based  not  only  on  the  exercise  of 
hegemony  but  also  on  the  recognition  by  states 
that  voluntary  cooperation  can  be  beneficial  to 
them.  Crucial  to  this  approach  is  the  concept  of 
complex  interdependence,  which  explains  why 
states  engage  in  cooperation  through  the  creation 
and  reproduction  of  international  regimes. 

World  Systems  Theory 

Very  similar  conceptions  of  hegemony  have  been 
developed  by  scholars  who  started  theorizing  inter¬ 
state  hegemony  from  adjoining  disciplines  such  as 
historical  sociology  (e.g.,  Immanuel  Wallerstein)  or 
political  geography  (e.g.,  Giovanni  Arrighi  and 
Peter  Taylor).  Wallerstein  (1980),  the  intellectual 
founder  of  world  systems  theory,  defines  hegemony 
as  that  “short  moment  in  time  when  a  given  core 
power  can  manifest  simultaneously  productive, 
commercial  and  financial  superiority  over  all  other 
core  powers”  (pp.  38-39).  According  to  Wallerstein, 
there  have  been  three  instances  of  hegemony  in  the 
history  of  the  capitalist  world  economy:  (1)  the 
United  Provinces  (1620-1672),  (2)  England  (1815— 
1873),  and  (3)  the  United  States  of  America  (1945- 
1967).  All  these  powers  displayed  some  common 
characteristics.  First,  they  initially  achieved  a  rela¬ 
tive  advantage  in  agro-industrial  production,  then 
in  commerce,  and  finally  in  finance.  Second,  hege¬ 
monic  powers  tended  to  be  advocates  of  global 
“liberalism”  in  economic  and  political  terms. 
Finally,  in  all  three  cases  military  power  rested 
primarily  on  naval  capabilities.  The  conditions  that 
determine  the  rise  of  a  particular  country  to  hege¬ 
mony  in  the  system  are  manifold:  Geographic  loca¬ 
tion  and  size,  the  strength  of  the  state  apparatus, 
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the  availability  of  new  technologies  capable  of  giv¬ 
ing  the  country  in  question  a  competitive  advan¬ 
tage  over  other  core  powers,  the  availability  of 
sufficient  investment  capital  and  human  capital 
(skilled  labor),  and  a  productive  agricultural  sector 
can  all  contribute  to  the  attainment  of  hegemony. 

Some  authors  with  intellectual  roots  in  interna¬ 
tional  relations,  such  as  George  Modelski  and 
William  R.  Thompson,  have  taken  a  historical 
view  and  arrived  at  insights  very  similar  to  those  of 
world  systems  theory  (although  without  using  the 
word  hegemony).  Modelski’s  “long  cycle  of  world 
leadership”  consists  of  four  phases:  the  phase  of 
(1)  global  war  among  major  powers  in  the  system, 
resulting  in  the  emergence  of  a  (2)  world  power 
assuming  the  leadership  role;  after  a  period  of  2  to 
3  decades,  the  world  power’s  leadership  starts  to 
wane  in  the  phase  of  (3)  delegitimation  so  that 
eventually  a  strong  challenger  rises  in  the  phase  of 
(4)  deconcentration,  whose  challenge  results  in  a 
new  global  war.  According  to  Modelski,  there 
have  been  five  such  cycles  in  the  history  of  the 
world  system:  the  Portuguese  cycle  (1494—15  80), 
the  Dutch  cycle  (1581— 1688),  the  first  (1689- 
1791)  and  second  (1792-1913)  British  cycles,  and 
finally  the  American  cycle  (1914-present).  Decisive 
for  the  attainment  of  world  leadership  is  the 
“capacity  for  global  reach”:  The  aspiring  power 
must  be  securely  located,  preferably  on  an  ocean; 
it  must  possess  ocean-going  sea  power  (potentially 
more  than  half  of  all  available  sea  power);  it  must 
have  sufficient  financial  resources  to  pay  for  its 
navy;  and  it  must  have  a  political  structure  that 
can  lend  coherence  to  a  global  enterprise.  The 
principal  challengers  have  been  land  powers:  The 
French  challenged  the  Dutch  leadership  only  to 
find  the  English  taking  over  the  lead,  and  similarly 
the  French  and  German  challenges  eventuated  in 
leadership  by  (again)  Britain  and  the  United  States. 

Ultimately,  these  different  approaches  to  hege¬ 
mony  are  in  agreement  on  one  fundamental  point: 
They  all  conceptualize  hegemony  as  a  relationship 
between  states.  This  contrasts  sharply  with  the 
original  definition  of  hegemony  in  Marxist  thought. 

Hegemony  in  Marxist  Political  Theory 

Hegemony  in  the  Marxist  tradition  is  usually  asso¬ 
ciated  with  the  name  of  Antonio  Gramsci,  the 
leader  of  the  Italian  Communist  Party  between 


1924  and  1926,  when  he  was  arrested  by  the 
Fascist  regime.  In  fact,  the  term  was  already  used 
extensively  by  various  Russian  socialist  leaders  in 
the  decades  leading  up  to  the  Bolshevik  Revolution 
of  1917.  In  this  early  debate,  hegemony  referred  to 
the  leading  role  that  the  proletariat  might  play  in 
the  revolutionary  struggle  against  the  feudal  order, 
provided  it  could  transcend  its  internal  divisions 
and  develop  a  coherent  ideological  project.  Gramsci 
was  concerned  to  rethink  political  strategy  in  light 
of  the  very  different  experiences  of  the  Russian  and 
the  West  European  revolutions  of  1917-1919.  In 
Russia,  the  capture  of  state  power  gave  the  revolu¬ 
tionary  forces  a  sufficient  grip  on  society  at  large 
to  implement  their  political  program.  In  the  West 
(as  the  failed  revolutions  in  Germany  and  Italy 
showed),  the  political  power  of  the  ruling  class  did 
not  rest  primarily  on  the  control  of  the  coercive 
apparatus  of  the  state  but  was  diffused  and  situ¬ 
ated  in  the  myriad  institutions  and  relationships  in 
civil  society.  In  reflecting  on  these  historical  expe¬ 
riences,  Gramsci  developed  his  understanding  of 
hegemony  beyond  its  early  forms  by  emphasizing 
the  importance  of  intellectual  and  moral  leader¬ 
ship  and  of  the  hegemonic  group  being  able  to 
bind  other  social  groups  and  by  recognizing  the 
possibility  of  hegemony  being  exercised  by  any 
social  class  (in  particular  the  bourgeoisie). 

No  social  group  can  rule  by  force  alone.  In  any 
class  society,  the  ruling  class,  that  is,  the  social 
group  that  has  the  ultimate  control  over  the  pri¬ 
mary  means  of  production,  organizes,  reproduces, 
and  reinforces  its  position  of  economic  power 
through  a  complex  mix  of  noneconomic  mecha¬ 
nisms.  These  include  ideological  or  religious 
power,  institutional  forms,  and  ultimately  the 
(threat  of  the)  use  of  force.  Under  certain  histori¬ 
cal  conditions,  the  balance  between  force  and 
persuasion  can  move  from  one  end  of  the  contin¬ 
uum  to  the  other  (without  either  ever  completely 
disappearing).  Hegemony,  in  the  classic  Gramscian 
sense,  is  then  defined  as  a  form  of  class  rule  based 
primarily  on  consent  by  the  subordinate  groups 
(produced  by  the  political,  intellectual,  and  moral 
leadership  of  the  hegemonic  group)  rather  than  on 
coercion  (resting  on  the  application  of  physical 
force). 

Ernesto  Laclau  and  Chantal  Mouffe  have  cri¬ 
tiqued  Gramsci’s  thought,  and  in  particular  the 
adoption  of  Gramsci’s  ideas  by  more  recent 
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Marxist  authors,  as  being  marred  by  a  lingering 
materialist  determinism.  They  have  argued  for  a 
radical  reinterpretation  of  Gramsci’s  theory  of 
hegemony  on  the  basis  of  discourse  theory: 
Hegemony  then  becomes  a  strictly  subjective  phe¬ 
nomenon.  Their  intervention  has  led  to  continuing 
debate  in  Marxist  thought  between  those  commit¬ 
ted  to  further  developing  historical  materialism 
and  those  arguing  for  a  constructivist  (or  post¬ 
structuralist)  turn. 

Transnational  Hegemony 

Robert  Cox  became  the  founder  of  an  approach 
that  became  known  as  neo-Gramscianism  or  trans¬ 
national  historical  materialism  in  which  he  brought 
the  earlier  two  traditions  together  by  applying 
Gramsci’s  understanding  of  hegemony  to  the 
analysis  of  international  politics.  He  argued  that  to 
understand  global  politics  we  need  to  abandon  the 
idea  that  is  central  to  the  classic  paradigm  in  inter¬ 
national  relations  theory,  namely,  that  the  state  is 
the  key  unit  of  analysis.  Basing  himself  on  a  mate¬ 
rialist  position,  he  argued  that  it  is  social  relations 
of  production  that  are  key  and  that  therefore  the 
basic  unit  of  analysis  should  be  the  state-society 
complex  rather  than  the  state  conceived  as  a  black 
box.  Cox  enlarged  the  scope  of  his  project  by  mak¬ 
ing  the  crucial  claim  that  Gramsci’s  core  ideas  can 
fruitfully  be  applied  to  the  analysis  of  a  globalizing 
international  capitalism  even  though  Gramsci  him¬ 
self  was  primarily  concerned  with  analyzing  poli¬ 
tics  in  national  contexts. 

Hegemony,  in  Cox’s  understanding,  is  simulta¬ 
neously  a  relationship  between  social  forces  as 
well  as  a  relationship  between  states:  The  interna¬ 
tional  hegemony  of  states  such  as  Great  Britain 
and  the  United  States  is  in  fact  the  outward  pro¬ 
jection  of  the  domestically  grounded  hegemony  of 
a  particular  configuration  of  social  forces. 
American  hegemony  in  the  post-1945  era  was 
grounded  in  the  hegemony  of  the  class  coalition 
(or  in  Gramsci’s  terminology  historic  bloc )  of 
internationally  oriented  transnational  corpora¬ 
tions  and  the  organized  industrial  working  class. 
The  longer-term  historical  dynamics  of  the  global 
system  are  determined  by  the  patterns  of  interac¬ 
tion  of  hegemonic  and  nonhegemonic  state-society 
complexes:  societies  characterized  by  the  predom¬ 
inance  of  consensual  and  coercive  forms  of  class 


rule,  respectively.  The  international  order  can  be 
characterized  as  hegemonic  when  a  specific 
national  configuration  of  class  forces  is  able  to 
project  its  predominant  influence  externally,  thus 
supporting  a  leading  role  for  its  state;  or  it  can  be 
characterized  as  nonhegemonic  (Cox  speaks  of 
“rival  imperialisms”)  when  such  leading  class 
coalitions  remain  “domestic”  because  they  are  too 
weak  to  project  their  influence  outwardly. 

Henk  Overbeek 
Vrije  Universiteit  Amsterdam 
Amsterdam,  Netherlands 
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Hermeneutics 


As  one  might  expect  of  a  concept  and  tradition  of 
thought  that  traverses  vast  expanses  of  history  and 
a  variety  of  disciplines,  the  term  hermeneutics  is 
inconducive  to  short  summary  or  univocal  defini¬ 
tion.  In  the  broadest  of  terms,  hermeneutics  refers 
to  the  theory  and  methodology  of  interpretation, 
that  is,  the  theory  and  methods  of  understanding, 
or  making  sense  of,  an  object  of  investigation.  The 
proper  objects  of  interpretation  are  taken  to  be 
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“texts”  or  “text  analogues”  whose  meanings  are 
unclear,  strange,  puzzling,  confused,  fragmentary, 
incomplete,  or  seemingly  contradictory.  The  pri¬ 
mary  aim  of  hermeneutical  investigation  is  to 
provide  a  reading  or  interpretation  of  the  text  that 
removes  or  mitigates  its  obscurity  or  puzzling 
aspects  by  disclosing  its  underlying  meaning  and 
coherence.  Additionally,  hermeneutical  inquiry 
involves  systematic  reflection  on  questions  about 
the  very  nature  of  interpretation:  What  is  interpre¬ 
tation?  What  makes  one  interpretation  better  or 
more  correct  than  another?  What  is  the  proper 
scope  of  interpretive  inquiry?  What  are  its  social 
and  political  implications?  One’s  stance  on  these 
and  other  questions  will  decisively  shape  one’s 
conception  of  hermeneutics.  This  entry  discusses 
its  history,  various  meanings,  and  possible  appli¬ 
cations  in  political  science. 

History  and  Meanings 

Consistent  with  the  varied  contemporary  under¬ 
standings  of  the  term,  the  etymological  roots  of  the 
word  hermeneutics  are  equally  rich  and  complex. 
The  word  first  appears  in  ancient  Greek  as  the 
verb  hermeneuein  (“to  interpret”)  and  the  noun 
hermeneia  (“interpretation”).  Specifically,  its  ori¬ 
gins  point  to  Hermes,  the  wing-footed  herald  of 
the  gods  who  traveled  between  the  mortal  and 
immortal  worlds,  delivering  messages  from  the  lat¬ 
ter  to  the  former.  As  such,  Hermes  is  associated 
with  activities  that  are  later  regarded  as  central  to 
the  hermeneutical  enterprise,  namely,  the  activities 
of  interpreting  and  illuminating  alien  or  opaque 
systems  of  meaning  (in  Greek,  a  hermeneus  was  an 
interpreter),  of  translating  meanings  from  one 
world  or  language  into  another,  and  of  mediating 
between  different  people  or  worlds.  At  the  same 
time,  Hermes’s  reputation  as  the  discoverer  of 
writing,  music,  and  poetry  points  to  the  strong 
emphasis  that  advocates  of  hermeneutics  place  on 
the  role  of  language  in  human  life.  Coming  to  an 
understanding  of  some  phenomenon,  hermeneuti- 
cists  contend,  inevitably  requires  a  grasp  of  the 
linguistic  practices  and  symbolic  systems  through 
which  the  meaning  of  that  phenomenon  is 
expressed.  Finally,  the  connection  between  herme¬ 
neutics,  interpretation,  and  language  is  apparent  in 
the  title  of  Aristotle’s  work  on  the  relationship 
between  language  and  logic,  Peri  Hermeneias, 


which  was  later  translated  into  Latin  as  De 
Interpretation. 

Although  originating  in  the  classical  world,  the 
concept  of  hermeneutics  is  sharply  altered  during 
the  modern  period.  With  the  advent  of  the 
Protestant  Reformation  and  its  shift  of  responsibil¬ 
ity  for  interpreting  the  Bible  from  the  Church 
Fathers  to  individual  Christians,  hermeneutics, 
especially  in  Germany,  becomes  increasingly  iden¬ 
tified  with  efforts  to  expound  the  proper  principles 
and  methods  of  Biblical  interpretation.  This  notion 
is  broadened  over  time  to  include  methods 
of  philological  and  legal  interpretation  but  none¬ 
theless  remains  the  dominant  understanding 
of  hermeneutics  into  the  19th  century,  when 
the  philosopher  and  theologian  Friedrich 
Schleiermacher  advances  a  view  of  hermeneutics  as 
a  universal  discipline  that  encompasses  all  fields  of 
study  in  which  the  interpretation  of  texts  (in  the 
extended  sense  of  the  term)  is  a  central  concern. 
According  to  Schleiermacher,  universal  hermeneu¬ 
tics  aims  to  articulate  the  general  principles  of  all 
understanding  that  constitute  the  foundation  of 
the  specialized  fields  of  hermeneutical  inquiry. 

Perhaps  the  most  significant  shift  of  meaning 
occurs  in  the  late  19th  century,  when  the  philoso¬ 
pher  Wilhelm  Dilthey  conceives  of  hermeneutics 
as  providing  the  methodological  and  epistemo¬ 
logical  foundation  of  the  Geisteswissenscbaften 
(the  social  sciences  and  humanities).  Contending 
that  the  central  task  of  the  Geisteswissenscbaften 
is  to  understand  lived  experience  in  all  of  its  abun¬ 
dant  richness  and  diversity,  Dilthey  differentiates 
the  mechanistic  and  nomological  forms  of  expla¬ 
nation  characteristic  of  the  natural  sciences  from 
the  types  of  hermeneutical  understanding  that  in 
his  view  constitute  the  methodological  and  episte¬ 
mological  foundations  of  the  human  sciences. 
While  the  former  strive  to  explain  the  behavior  of 
physical  objects  through  the  discovery  of  causal 
laws,  the  proper  aim  of  the  latter,  Dilthey  argues, 
is  to  understand  the  historically  constituted  mental 
life  and  lived  experience  that  is  an  essential  aspect 
of  human  reality. 

Consequently,  the  human  sciences  are  both  irre¬ 
ducible  to  and  partly  independent  of  the  sciences  of 
nature.  Not  only  is  their  subject  matter  ontologi- 
cally  distinct  from  that  of  the  natural  sciences,  but 
this  ontological  difference  yields  a  methodological 
distinction.  One  of  Dilthey’s  central  projects  was  to 
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develop  a  view  of  hermeneutics  as  providing  the 
proper  methods  for  the  study  of  human  existence. 
This  project  is  taken  up  and  variously  reformulated 
by  several  leading  representatives  of  hermeneutical 
thought  in  the  late  19th  and  20th  centuries,  includ¬ 
ing  figures  as  diverse  as  Max  Weber,  Robin  George 
Collingwood,  Hans-Georg  Gadamer,  Martin  Heid¬ 
egger,  Peter  Winch,  Paul  Ricoeur,  Jurgen  Habermas, 
Charles  Taylor,  Clifford  Geertz,  and  Quentin 
Skinner. 

Hermeneutics  and  Political  Science 

Although  Dilthey  is  the  first  writer  to  argue  that 
hermeneutics,  or  interpretation,  provides  a  firm 
foundation  for  the  human  sciences  and  constitutes 
its  central  task,  it  is  only  during  the  later  decades  of 
the  20th  century  that  hermeneutical  perspectives 
become  a  significant  part  of  the  discourse  of 
Anglophone  political  science.  Largely  in  response 
to  the  growing  prominence  of  positivist  epistemolo¬ 
gies,  the  champions  of  hermeneutics  simultane¬ 
ously  criticized  many  of  the  central  claims  of  posi¬ 
tivism  and  outlined  their  own  post-positivist  con¬ 
ceptions  of  political  inquiry.  Importantly,  herme¬ 
neutical  critics  held  that  positivist  orthodoxies  not 
only  sustained  a  model  of  scientific  explanation 
that  was  inappropriate  for  the  social  sciences,  but 
they  also  obfuscated  important  issues  of  power  and 
politics  that  could  only  be  identified  through  care¬ 
ful  attention  to  the  interpretive  dimensions  of 
political  affairs. 

On  the  one  hand,  hermeneuticists  challenged  the 
central  commitments  of  positivism:  that  the  cover¬ 
ing-law  model  of  explanation  constitutes  the  only 
valid  model  of  scientific  explanation;  that  sense  data 
can  be  directly  perceived  and  represented  in  a  neu¬ 
tral,  noninterpretive  observation  language;  that 
empirical  data  can  be  neatly  divorced  from  theoreti¬ 
cal  frameworks;  that  scientific  explanations  are 
necessarily  value-free;  and  that  the  methods  of  the 
mature  natural  sciences  provide  the  proper  model 
for  the  study  of  political  affairs.  Although  articu¬ 
lated  in  various  ways,  the  hermeneutical  critique  of 
positivism  generally  turns  on  an  analysis  of  a  key 
task  of  social  and  political  inquiry  that  has  no  echo 
in  the  physical  sciences,  namely,  the  radically  reflex¬ 
ive  and  doubly  hermeneutical  task  of  interpreting 
interpretations.  Because  human  beings  are  self¬ 
interpreting  animals  whose  individual  and  social 


self-interpretations  constitute  an  essential  part  of 
social  reality,  no  investigation  of  social  or  political 
phenomena  can  neglect  this  task  on  pain  of  distorting 
the  phenomena  it  purports  to  explain  or  understand. 

At  a  minimum,  hermeneuticists  maintain,  social 
and  political  inquiry  must  begin  by  understanding 
agents’  self-understandings  and  self-interpretations. 
It  is  only  on  the  basis  of  such  understanding  that 
one  can,  first,  identify  properly  what  stands  in  need 
of  explanation  and,  second,  combat  the  often  fatal 
tendency  of  social  scientists  and  laypersons  to  proj¬ 
ect  their  own  languages  of  understanding  onto  the 
agents,  cultures,  traditions,  and  practices  they 
study.  More  generally,  because  self-interpretations 
partly  constitute  human  and  social  reality  yet  typi¬ 
cally  change  over  time  and  vary  across  cultures,  the 
conceptual  unity  that  is  a  central  feature  of  the 
covering  law  model  and  a  necessary  condition  of 
precise  prediction  rarely,  if  ever,  obtains  in  social 
science.  In  short,  the  inherent  mutability  and  insta¬ 
bility  of  social  reality  undermines  the  positivist 
imperative  to  develop  a  nomological  and  predictive 
science  of  society  and  politics. 

On  the  other  hand,  hermeneuticists  have  devel¬ 
oped  their  own  accounts  of  the  proper  practices  of 
political  inquiry.  The  central  premise  of  hermeneu¬ 
tical  approaches  is  that  interpretation  takes  place 
within  specific  backgrounds  or  contexts  that  are 
conceived,  for  example,  as  traditions,  webs  of 
meaning  and  belief,  constellations  of  social  rela¬ 
tions,  or  practices  embedded  in  what  Ludwig 
Wittgenstein  calls  a  “form  of  life.”  In  the  act  of 
interpretation,  both  the  interpreter  and  the  text  (or 
text-analog)  are  “situated”  within  one  or  more  of 
these  contexts  or  backgrounds.  The  relationship 
between  them  is  therefore  not  one  of  a  disinter¬ 
ested,  disembodied  observer  in  search  of  objective, 
context-free  knowledge.  Rather,  it  involves  a  dia¬ 
logical  and  fallible  process  of  navigating  within  and 
between  often  disparate  linguistic,  cultural,  histori¬ 
cal,  and  political  contexts  to  better  understand  or 
make  sense  of  the  text  or  texts  under  investigation. 

Most  significantly,  interpretation  involves 
movement  within  what  is  called  the  hermeneutic 
circle.  Hermeneuticists  contend  that  an  adequate 
interpretation  of  the  meaning  or  sense  of  a  text 
requires  an  analysis  that  is  circular  in  a  double 
sense:  first,  any  effort  to  establish  the  meaning  of  a 
text  must  proceed  by  moving  back  and  forth 
between  a  reading  of  particular  parts  of  the  text 
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and  the  text  as  a  whole,  with  the  latter  being  read 
in  light  of  the  former  and  vice  versa;  and,  second, 
both  the  individual  parts  of  the  text  and  the  text  as 
a  whole  must  be  located  within  a  larger  context, 
background,  or  field  of  conventions  and  assump¬ 
tions  from  which  they  derive  their  meaning  and  to 
which  they  contribute.  The  process  is  therefore 
unavoidably  holistic  and  circular — there  is  no 
privileged  standpoint  outside  the  circle  of  mean¬ 
ings  from  which  one  might  establish  the  meaning 
of  a  single  element  within  the  text  or  of  the  text  as 
a  whole.  This,  however,  does  not  imply  that  the 
standards  for  assessing  interpretations  are  subjec¬ 
tive  or  arbitrary.  Because  even  individual  beliefs 
and  attitudes  are  only  intelligible  against  a  back¬ 
ground  of  publicly  available  “intersubjective”  and 
“common”  meanings,  the  latter  provide  a  publicly 
accessible  evidential  basis  for  evaluating  the  ade¬ 
quacy  of  an  interpretation. 

Although  hermeneutical  writers  argue  that 
interpretation  is  an  indispensable  part  of  all  social 
inquiry,  the  deliberate  appropriation  of  hermeneu¬ 
tical  methods  is  more  common  in  some  areas  of 
political  inquiry  than  in  others.  Hermeneutical 
approaches  are  most  frequently  deployed  in  politi¬ 
cal  theory  and  legal  interpretation  and  in  scholar¬ 
ship  that  examines  the  historical  development  and 
transformation  of  ideas,  concepts,  and  institutions. 
They  have  also  gained  considerable  currency  in 
areas  of  comparative  politics  and  international 
relations  where  scholars  explore  issues  of  nation¬ 
hood  and  national  attachments,  the  politics  of 
recognition,  and  a  host  of  problems  involving  cul¬ 
tural,  ethnic,  religious,  and  class  differences  and 
conflicts.  Finally,  they  have  acquired  prominence 
among  a  wide  range  of  scholars — including  schol¬ 
ars  who  use  multiple  or  mixed  methods — who 
investigate  the  multiple  dimensions  of  empower¬ 
ment  and  disempowerment  that  reside  in  language, 
social  practice,  and  the  complex  legacies  of  cul¬ 
tural  and  political  traditions.  These  scholars  have 
found  hermeneutics  to  be  a  valuable  resource  for 
analyzing  how  language  can  conceal  and  distort 
relations  of  power  yet  also  open  new  possibilities 
for  enhanced  understanding,  resistance,  social 
transformation,  and  political  renewal. 

Keith  Topper 
University  of  California,  Irvine 
Irvine,  California,  United  States 
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Hinduism 


Hinduism  is  more  a  civilization  than  a  religion: 
not  only  because  it  is  associated  with  a  full-fledged 
social  system,  the  caste  system,  but  also  because  it 
is  closely  linked  with  an  ancient  culture  spread 
over  a  clear-cut  territory  (today’s  India)  and 
endowed  with  a  language  (Sanskrit).  As  a  civiliza¬ 
tion,  Hinduism  does  not  display  much  cohesion 
and  homogeneity.  At  first  sight,  it  seems  to  possess 
the  classic  duality  between  what  Robert  Redfield 
calls  a  “great  tradition”  and  a  “little  tradition,” 
which  one  may  call  “popular  Hinduism,”  com¬ 
posed  of  local  practices.  According  to  Shyama 
Charan  Dube,  however,  this  schema  is  compli¬ 
cated  in  the  Hindu  case  by  the  lack  of  unity  in  the 
great  tradition  itself. 

A  Religion  of  “Unity  in  Diversity” 

Most  Hindus  traditionally  share  a  common  belief 
in  reincarnation,  consider  the  seven  sacred  cities 
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(Ayodhya,  Mathura,  Haridwar,  Benares, 
Kanchipuram,  Ujjain,  and  Dwarka)  as  pilgrimage 
centers,  worship  god  according  to  similar  rituals  in 
their  respective  temples  ( pujas ),  and  celebrate  the 
same  festivals,  including  the  Kumbh  Mela,  for 
which  people  gather  together  in  millions  every  9  or 
12  years.  Yet  Hinduism  does  not  have  an  ortho¬ 
doxy  enshrined  in  one  book  and  guarded  by  an 
ecclesiastical  body  but  encompasses  several  reli¬ 
gious  streams  founded  by  gurus  who  pay  obei¬ 
sance  to  any  one  of  the  many  gods  of  the  Hindu 
pantheon  (Shiva,  Vishnu,  or  one  of  his  avatars,  or 
the  Sbaktas,  who  worship  the  Goddess  under  one 
form  or  another).  Thus,  the  Hinduism  of  the  great 
tradition  appears  as  a  “conglomeration  of  sects” 
(Romila  Thapar,  1989,  p.  207),  known  as  sampra- 
daya  (from  the  Sanskrit  samprada,  to  transmit); 
indeed,  the  essence  of  the  sect  lies  in  “the  uninter¬ 
rupted  transmission  from  one  master  to  another” 
(Catherine  Clementin-Ojha,  1990,  p.  19)  of  the 
message  of  the  founding  guru,  which  itself  derives 
from  a  revelation.  (However,  a  Hindu  sect  may 
also  take  the  form  of  a  panth  [way],  where  the 
founder  is  not  considered  as  having  been  the 
recipient  of  a  divine  revelation.)  Isolated  from  one 
another,  these  sects  are  also  rivals  to  the  extent 
that  they  compete  for  patronage  and  for  the  pre¬ 
eminence  of  their  particular  teachings. 

The  main,  if  not  the  only,  current  of  Hinduism — 
which  became  formalized  in  a  way  that  approxi¬ 
mates  to  an  “ecclesiastical  structure”  (Thapar, 
1985,  p.  17) — was  that  of  Shankara.  This  ascetic 
reformer  of  the  8th  century  responded  to  the 
spread  of  Buddhism  by  establishing  monasteries 
(■ math )  in  the  four  corners  of  India — Sringeri  in 
the  south,  Dwarka  in  the  west,  Badri  in  the  north, 
and  Puri  in  the  east — at  the  head  of  which  were 
placed  shankaracharyas,  ordained  to  exercise  a 
spiritual  authority  comparable  with  that  of  the 
Buddhist  clergy. 

The  Making  of  Hinduism 

In  spite  of  the  common  features  listed  above,  the 
remarkable  diversity  of  Hinduism  prevailed  for 
centuries,  to  such  an  extent  that  until  the  16th 
century  there  was  no  single  religious  tradition 
known  as  “Hinduism.”  The  development  of  a  col¬ 
lective  Hindu  consciousness  was  inhibited  not  only 
by  the  extreme  social  and  religious  differentiation 


within  Hinduism  but  also  by  a  tendency  to  dis¬ 
count  the  importance  of  the  Other  and  therefore  to 
ignore  the  need  for  solidarity  in  the  face  of  that 
Other.  In  the  process  of  sociocultural  integration, 
a  capacity  for  assimilation  in  the  caste  system  is 
revealed:  Insofar  as  the  hierarchy  represents  a  sys¬ 
tem  of  gradation  based  on  the  notion  of  ritual 
purity,  everyone  can  find  a  place  in  it,  below  the 
Brahmins,  according  to  the  degree  of  conformity 
with  the  exalted  values  personified  by  Brahmins. 
As  a  result,  invaders  such  as  the  Huns  found  them¬ 
selves  classed  as  Kshatriyas  (warriors),  observing 
the  ritual  practices  prescribed  for  these  castes  and 
recognizing  the  authority  of  the  Brahmins. 

Paradoxically,  the  ability  to  integrate  newcom¬ 
ers  also  reflected  an  inability  to  recognize  the 
Other  as  such.  Brahminical  texts  assume  that  the 
indigenous  social  order  has  a  homologous  rela¬ 
tionship  to  the  cosmic  order,  the  dharma-,  and  it  is 
the  preservation  of  this  relationship  in  equilibrium 
that  tends  to  demand  the  assimilation  of  foreign 
elements  into  the  social  system.  The  same  texts 
describe  this  society  as  endowed  with  a  language 
and  territory  with  unique  attributes  and  thus 
enable  the  rites  necessary  to  perpetuate  the  dhar- 
mic  order  to  be  performed.  Sanskrit  is  that  lan¬ 
guage  par  excellence,  and  other  languages  are  seen 
as,  at  best,  degraded  forms  of  it.  Parallel  to  this, 
North  India — Aryavarta — is  considered  to  be  the 
land  where  the  rites  necessary  for  the  maintenance 
of  dharma  are  fulfilled. 

This  belief  in  a  privileged  connection  between 
India  and  dharma  underlies  the  logic  of  the  social 
system  of  the  varnas,  which  consists  of  Brahmins, 
Kshatriyas,  Vaishyas  (merchants  and  artisans), 
and  Shudras  (the  servile  castes).  In  a  foundational 
myth  of  the  Vedas — the  basis  of  the  Brahmanical 
great  tradition — these  varnas  are  described  as 
owing  their  origin  to  the  sacrificial  dismember¬ 
ment  of  the  cosmic  Primeval  Man,  the  Virat 
Purushan,  when  the  Brahmin  proceeded  from  this 
mouth,  the  Kshatriya  from  his  arms,  the  Vaishya 
from  his  thighs,  and  the  Shudra  from  his  feet.  This 
division  of  the  social  structure  into  four  parts 
reflects  the  belief  in  a  correspondence  between 
Hindu  society  and  the  totality  of  the  cosmic  order, 
a  correspondence  that  hampers  the  development  of 
a  distinct  identity.  Nothing  outside  it  can  have  any 
meaning  or  be  worthy  of  interest.  In  the  absence  of 
“them,”  there  cannot  be  “we”  and  “us.” 
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The  term  Hindu  was  originally  derived  from  the 
name  Indus  and  was  used  successively  by  the 
Achaemenids,  the  Greeks,  and  the  Muslims  to 
denote  the  population  living  beyond  that  river. 
However,  it  was  not  appropriated  by  the  people 
who  themselves  bore  this  designation,  nor  did  they 
use  it  themselves  till  the  medieval  period.  This  devel¬ 
opment  has  to  be  seen  in  conjunction  with  the  pen¬ 
etration  of  the  subcontinent  by  Muslim  invaders, 
who  were  the  first  to  put  up  a  sustained  resistance 
to  the  capacity  of  the  indigenous  culture  for  assimi¬ 
lation.  For  example,  to  be  integrated  into  the 
Ksbatriya  varna,  Muslim  leaders  would  have  been 
obliged  to  recognize  the  natural  superiority  of  the 
Brahmins-,  in  fact,  they  refused  to  give  up  the  prac¬ 
tices  required  by  Islam  and  indeed  were  prepared  in 
certain  circumstances  to  clash  with  the  Hindus, 
such  as  over  cow  slaughter.  Such  antagonism  con¬ 
tributed  to  the  emergence  of  a  Hindu  consciousness. 

The  Hindu  consciousness  apparently  found  its 
principal  expression  in  the  17th  and  18th  centuries 
in  the  empire  of  Shivaji  and  then  in  the  Maratha 
confederation.  In  1720,  Brahmins  took  over  the 
latter  kingdom,  in  which  they  had  formerly  served 
as  chief  ministers  (Peshvas — a  name  they  retained 
when  they  became  the  ruling  dynasty)  and  military 
chiefs.  These  two  political  institutions  were  formed 
in  Maharashtra  in  opposition  to  the  Mughal 
Empire  and  in  the  name  of  dharma-,  the  slaughter 
of  cows,  which  the  Muslims  sometimes  offered  as 
a  sacrifice,  was  thus  forbidden  there.  However, 
according  to  Christopher  Bayly,  the  conquests  of 
the  Marathas  in  the  direction  of  the  Gangetic  plain 
were  not  examples  of  religious  war  based  on  ethnic 
or  communal  consciousness.  They  resulted  from  a 
motivation  that  was  ritual  in  character — to  restore 
to  the  Hindus  certain  holy  places,  such  as  Varanasi, 
which  were  revered  throughout  India. 

From  Hinduism  to  Neo-Hinduism 

Things  changed  after  the  establishment  of  the 
British  Raj  in  the  mid-  19th  century.  The  prosely¬ 
tizing  activities  of  the  Christian  missionaries  and 
the  (selective)  reformist  zeal  of  the  administration 
fostered  a  reaction  from  the  Hindu  upper  castes, 
mostly  from  the  Brahmins,  who  were  the  custodi¬ 
ans  of  the  Hindu  tradition. 

The  Brahmo  Samaj  was  founded  in  1828  not  as 
a  conservative  movement  but  as  the  instrument  of 


Hindu  revivalism.  Admitting  that  Hinduism  had 
lost  its  pristine  purity — as  the  European  Orientalists 
were  arguing  since  the  18th  century — its  leaders 
claimed  that  it  could  regain  it  through  a  reform 
process  drawing  its  inspiration  from  a  Golden  Age 
that  was  associated  with  the  Vedic  period.  This 
invention  of  tradition  was  the  touchstone  of  a  new 
Hindu  nationalism  that  acquired  its  definitive 
shape  in  the  late  19th  century  and  early  20th  cen¬ 
tury.  In  1875,  the  Arya  Samaj  claimed  that  the 
Hindus  descended  from  the  first  people  of  the 
earth — the  Aryans — whose  language,  Sanskrit, 
was  the  mother  of  all  languages.  In  1915,  this 
movement  was  the  crucible  of  the  first  Hindu 
nationalist  party,  the  Hindu  Mahasabha.  A  few 
years  later,  Vinayak  Damodar  Savarkar  gave  an 
ideological  charter  to  Hindu  nationalism  in  a  book 
called  Hindutva:  Who  Is  a  Hindu ?  (1923). 

The  Muslim  mobilization  in  defense  of  the 
Khilafat  (caliphate)  had  a  catalytic  effect,  and  from 
then  on,  the  Muslim  minority  was  the  real  bete 
noire  of  the  Hindu  nationalists  because  of  its  Pan- 
Islamic  solidarities  and  its  reluctance  to  assimilate 
into  the  Hindu  civilization.  The  Hindu  identity 
had  crystallized  primarily  in  reaction  to  this  so- 
called  threatening  Other.  Its  contours  were  further 
hardened  by  the  census  operations,  which  enumer¬ 
ated  religious  communities  from  1871  onward, 
and  by  the  politicization  of  religion  under  the 
impulse  of  populist  politicians. 

The  erosion  of  the  traditional  inner  diversity  of 
Hinduism  continued  during  the  20th  century  under 
the  influence  of  these  different  forces.  Among  them, 
the  most  powerful  was — and  still  is — the  Rashtriya 
Swayamasevak  Sangh  (The  National  Volunteers 
Corps),  which  was  founded  in  1925  and  has  devel¬ 
oped  several  fronts  after  independence,  including  a 
student  union,  a  labor  union,  and  a  political  party, 
the  Jana  Sangh,  whose  heir,  the  Bharatiya  Janata 
Party  (Indian  People’s  Party),  led  the  ruling  coali¬ 
tion  from  1998  to  2004.  While  Hindu  nationalism 
has  stagnated  in  the  early  21st  century,  its  social 
activities  (including  free  medicine  and  education) 
and  its  proselytizing  zeal  (aiming  to  reconvert 
Christian  communities  especially)  have  increased. 

Christophe  Jaffrelot 
CERI-Sciences  Po/CNRS 
Paris,  France 
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Historical  Memory 


The  concept  of  “historical  memory,”  often 
expressed  as  “collective  memory,”  “social  mem¬ 
ory,”  or  for  political  scientists,  “the  politics  of 
memory,”  refers  to  the  ways  in  which  groups,  col¬ 
lectivities,  and  nations  construct  and  identify  with 
particular  narratives  about  historical  periods  or 
events.  Historical  memories  are  foundational  to 
social  and  political  identities  and  are  also  often 
reshaped  in  relation  to  the  present  historical- 
political  moment.  In  this  entry,  the  origins  and  uses 
of  this  concept  in  contemporary  political  science 
are  discussed.  This  applies  particularly  to  recent 
transitions  from  authoritarian  rule  and  the  forma¬ 
tion  of  newly  democratizing  political  cultures. 

The  study  of  memory  has  a  long  tradition  in 
other  social  science  disciplines.  Yet  until  fairly 
recently,  political  scientists  tended  to  dismiss  his¬ 
torical  memory  as  a  conceptual  or  research  arena. 
There  seemed  to  be  three  general  reasons  for  this: 
First,  political  scientists  saw  memory  as  too  subjec¬ 
tive,  as  properly  the  realm  of  psychology,  difficult 
to  measure,  quantify,  and  operationalize  in  ways 


that  could  be  compared  and  generalized.  Second, 
political  scientists  tended  to  hold  that  “relevant” 
memory  belongs  to  the  discipline  of  history,  of 
sorting  facts,  validity,  and  particularities,  and  that 
particularities  were  simply  not  what  was  driving 
the  discipline  of  political  science.  It  was  for  the 
historians  to  decide  what  constituted  the  dividing 
line  between  memory  and  history.  Third,  political 
scientists  tended  to  view  collective  memories  as 
codified  in  social  and  political  institutions,  and 
therefore  it  was  more  useful  to  study  institutions 
than  memories.  Institutions  enshrine  memories. 

Despite  these  biases  against  the  study  of  mem¬ 
ory  as  a  lens  on  politics,  political  scientists  have 
come  to  appreciate  that  historical  memories  pow¬ 
erfully  influence  politics  in  observable  as  well  as 
subjective  ways;  that  they  are  constructions  of  fact, 
myth,  and  interpretation;  and  that  they  fall  outside 
the  rubric  of  institutions  as  usefully  understood. 
Historical  memories  are  the  less  conventionally 
institutionalized  dimensions  of  politics — symbols 
and  sites  for  contestation,  associations,  palpably 
expressed  through  representations,  testimonials, 
imagery,  the  media,  public  opinion,  and  political 
discourse. 

In  political  contexts  that  involve  transitions 
from  conflict,  war,  and  repression,  all  of  which 
involve  traumatic  individual  and  collective  experi¬ 
ences,  collective  memories  prove  difficult  to  ignore 
politically.  Memories  often  become  mobilized  to 
challenge  opponents,  including  the  state.  States 
themselves  can  be  aggressive  as  purveyors  of 
national  memory,  illustrated  by  the  recent  prolif¬ 
eration  of  officially  sponsored  truth  commissions 
and  “museums  of  memory”  around  the  globe. 
Implicitly,  the  nation-state  has  always  been  preoc¬ 
cupied  with  developing  a  national  memory  that 
exudes  unity,  continuity,  stability,  and  purpose. 
This  is  often  expressed  through  commemoration, 
educational  textbooks,  and  official  political  rheto¬ 
ric.  There  is  a  significant  and  growing  set  of 
debates  on  statecraft,  the  nation  and  memory,  or 
the  memory-nation. 

History  of  the  Term 

The  French  sociologist  Maurice  Halbwachs  is 
widely  regarded  as  the  founding  father  of  collec¬ 
tive  memory  studies.  His  work  On  Collective 
Memory ,  originally  published  in  1925,  broke  new 
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ground  in  the  sociology  of  knowledge.  Halbwachs 
emphasized  social  interaction  as  foundational  to 
individual  memory — as  formed  in  relation  to 
groups  and  collectivities — and  remembering  itself 
as  a  process  subject  to  the  needs  of  society  as  a 
whole.  Both  classic  and  more  contemporary  works 
in  sociology  and  history  argue  persuasively  that 
memory  is  reconstructed  over  generations  to  fit 
particular  social  and  political  contexts. 

Following  this  vein  in  the  1980s,  the  French 
historian  Pierre  Nora  made  a  major  contribution 
to  reawakening  interest  in  Europe  concerning  the 
relationships  between  memory  and  history. 
Together  with  a  team  of  well-known  researchers, 
Nora  produced  a  seven-volume  study  locating  and 
classifying  a  host  of  French  lieux  de  memoir e,  or 
“sites  of  memory,”  that  evoke  or  symbolize,  in  an 
emotive  sense,  the  modern  French  nation.  In  the 
past,  according  to  Nora,  history  and  memory 
blended  together  fairly  seamlessly.  Historic  lieux 
de  memoire ,  including  monuments,  the  national 
flag,  school  history  textbooks,  commemorative 
dates,  national  museums,  and  others,  served  to 
project  a  shared  sense  of  civic  values  and  an  alle¬ 
giance  to  the  French  Republic.  Nevertheless,  Nora 
claimed  that  such  cohesive  projections  had  given 
way  in  postmodernity  to  particularistic,  frag¬ 
mented,  subnational  identity  sites.  Nora  claims 
that  a  psychologization  of  memory,  an  individual 
psychology  of  remembering,  has  increasingly 
replaced  the  memory-nation  and  the  social  rituals 
and  emotive  symbols  that  accompany  a  shared, 
master  past. 

Memory  scholars  often  charge  that  Nora’s 
analysis  leans  toward  sentimentalizing  the  notion 
of  a  national  memory  or  lamenting  the  loss  of 
shared  national  narratives,  of  some  kind  of  master 
memory.  Postcolonial,  subaltern  studies  scholars 
note  with  irony  such  lament  or  nostalgia.  Scholarly 
work  on  voices  and  memories  of  the  subaltern, 
including  memories  of  the  French  colonial  past 
and  French  practices  in  Algeria,  Haiti,  and  else¬ 
where,  prove  powerful  correctives  to  any  tendency 
to  romanticize  the  French  nation.  In  addition, 
studies  of  Vichy  France,  including  most  promi¬ 
nently  that  of  the  French  historian  Henry  Rousso, 
emphasize  the  deliberate  efforts  of  the  post-World 
War  II  political  elite  to  recraft  a  national  memory 
of  French  collusion  with  the  Nazi  regime  as  a 
memory  of  resistance  to  Nazism.  In  his  history  of 


the  evolution  of  memories  regarding  France  and 
World  War  II,  Rousso  termed  the  French  preoc¬ 
cupation  with  this  period  the  Vichy  syndrome ,  a 
constantly  erupting  societal  neurosis  about  French 
failure  to  prevent  or,  for  the  most  part,  even  to 
challenge  the  fascist  takeover  of  government. 

The  British  international  relations  theorist  Jenny 
Edkins  emphasizes  the  violent  production  and 
reproduction  of  the  state  through  commemora¬ 
tion,  particularly  in  the  aftermath  of  wars,  geno¬ 
cides,  famines,  and  terrorism,  when  states  attempt 
to  erase  memories  of  atrocities  committed  and 
compelled  by  the  state  as  well  as  against  other 
states.  Yet  Edkins  also  shows  that  citizens  do  not 
accept  such  erasures  lightly.  Edkins’s  work  on 
World  War  I,  the  Vietnam  War,  the  Holocaust, 
and  September  If,  2001,  demonstrates  how  in 
spite  of  states’  intentions,  families  and  groups  in 
society  frequently  challenge  states  to  mourn  and  to 
be  held  accountable  for  atrocities.  Edkins  terms 
societal  challenges  to  state  erasures  an  encircling  of 
traumatic  memory.  Recent  historical  memory 
scholarship  warns  against  neat  divisions  between 
state  and  society  in  battles  over  memory.  Rather, 
scholars  emphasize  negotiation,  tense  collabora¬ 
tions,  complementarities,  as  well  as  struggles 
within  civil  society  over  memory  itself. 

Historical  Memory  and  Transitions 
From  Authoritarian  Rule 

Historical  memory  began  to  make  an  impact  in 
political  science  when  it  came  to  be  recognized  as 
an  important  dimension  in  regime  transition  dur¬ 
ing  the  latest  “wave”  of  democratization  since  the 
1970s.  The  first  political  science  transitions  theo¬ 
rist  to  study  historical  memory  in  an  intensive  way 
was  the  Spanish  scholar  Paloma  Aguilar,  who 
documented  how  political  discourse  was  gradually 
reinvented  to  allow  for  a  more  consensualist  elite 
interpretation  of  past  conflicts  across  the  political 
spectrum,  thereby  facilitating  the  mid-1970s 
Spanish  transition  from  authoritarianism  to 
democracy,  involving  a  consensus  between  the 
moderates  on  both  sides — the  “old”  elite  and 
the  opposition.  More  than  a  quarter  century  after 
the  Spanish  transition  from  Franco  (1939-197 5), 
however,  long-repressed  memories  of  the  Spanish 
Civil  War  (1936-1939)  have  resurfaced  and  pro¬ 
duced  a  range  of  collective  action,  debate,  and 
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legislation.  This  is  illustrative  of  the  nonlinear 
nature  of  historical  memory,  where  generations- 
old  events  and  phenomena  can  be  recalled  and 
deployed  as  if  they  occurred  far  more  recently. 

Scholarly  debates  on  the  politics  of  memory 
have  emerged  in  several  transitioning  regions  of 
the  world,  including  Latin  America,  Eastern 
Europe,  and  the  former  Soviet  republics.  In  the 
case  of  Chile,  for  example,  the  American  political 
scientist  Brian  Loveman  and  the  Chilean  social 
psychologist  Elizabeth  Lira  traced  a  powerful  pat¬ 
tern  of  conflict  and  subsequent  political  elite 
amnesia,  manifested  chiefly  as  postconflict  con¬ 
gressional  declarations  of  amnesty,  from  the  19th 
century  to  the  very  recent  past.  Drawing  a  good 
deal  from  Holocaust  studies,  the  Argentinian  soci¬ 
ologist  Elizabeth  Jelin  conceptualized  and  led  a 
major  collective  memory  social  science  research 
initiative.  The  project  produced  12  volumes  on 
what  Jelin  framed  as  “state  repression  and  the 
labors  of  memory”  in  the  aftermath  of  dictator¬ 
ships  in  Argentina,  Chile,  Uruguay,  and  Brazil,  as 
well  as  in  Peru  at  the  close  of  the  Alberto  Fujimori 
(1990-2000)  regime.  The  purpose  of  the  project 
was  both  to  define  a  memory  studies  field  in  Latin 
America’s  Southern  Cone,  Brazil,  and  Peru  and, 
implicitly,  to  contribute  to  rebuilding  the  social 
sciences  in  the  wake  of  their  devastation  by  dicta¬ 
torship.  Like  most  scholarship  on  historical  mem¬ 
ory,  the  agenda  was  thus  ethical  and  practical  as 
well  as  intellectual  and  interdisciplinary. 

Historical  memory  studies  emphasize  the  fitful 
process  of  what  is  (or  as  Nietzsche  once  said,  what 
must  be)  forgotten  to  forge  a  viable  politics  of  the 
present.  Framed  in  an  instrumentalist  way,  histori¬ 
cal  memory  is  at  the  service  of  political  actors  and 
institutional  processes.  Embedded  in  many  of  the 
historical  memory  studies,  however,  is  a  serious 
critique  of  officialist  approaches  and  policies  of 
“forget”  to  move  on.  Common  to  historical  mem¬ 
ory  works  are  at  least  three  arguments:  first,  that 
different  generations  are  entirely  capable  of  inter¬ 
preting  the  same  political  events  differently;  sec¬ 
ond,  that  political  ideology  or  partisanship  contin¬ 
ues  to  weigh  heavily  on  interpretations  of  past 
political  events;  and  third,  that  in  the  aftermath  of 
traumatic  conflict,  a  significant  number  of  both 
citizens  and  political  elites  profoundly  desire  con¬ 
sensual  collective  memory  images,  crafted  by  the 
political  class,  that  convey  national  unity  and 


peace.  Such  consensual  images  can  have  the  ability 
to  overcome  ideologically  driven  memory  divides, 
at  least  for  a  while.  The  studies  remind  us  that 
though  national  political  trauma  may  have 
occurred  decades  ago,  memories  of  such  trauma 
continue  to  influence  politics  at  several  levels. 

At  the  same  time,  memories  can  be  powerfully 
enduring  in  spite  of  social  change  or  dominant 
political  discursive  attenuation.  Official  memo¬ 
ries  can  be  perceived  as  being  imposed  far  more 
than  embraced.  And  on  the  ground,  in  countries 
such  as  Chile,  state  actors  and  institutions  are  in 
constant  reactive  mode  to  grassroots  memory 
mobilizing.  Social  historians  document  how  indi¬ 
vidual  and  collective  memories  part  dramatically 
and  are  not  obviously  reconcilable  and  that  dis¬ 
tinct  communities  temporalize  traumatic  memo¬ 
ries  quite  distinctly. 

While  calls  to  remember  atrocious  pasts  have 
been  consistently  embraced  by  grassroots  sectors, 
there  is  also  considerable  a  desire  to  forget.  This 
desire  is  not  limited  to  the  political  elite,  for  whom 
resurrecting  painful  pasts  can  prove  contentious 
and  costly.  The  wish  to  forget  can  also  be  heard 
from  communities  affected  most  directly  by  armed 
conflict,  including  formerly  displaced  families  that 
have  lost  loved  ones  and  are  uncertain  whether 
they  can  return  either  physically  to  their  former 
communities  or  mentally  and  spiritually  to  their 
memories  of  violence  and  loss. 

In  their  studies  of  communities  in  Ayacucho, 
Peru,  the  region  in  which  confrontations  between 
the  Peruvian  military  and  the  Shining  Path  most 
frequently  occurred  (1980-2000),  the  scholars 
Ponciano  del  Pino  and  Kimberly  Theidon  reveal  a 
pattern  of  narration  in  indigenous  accountings  of 
the  recent  past  that  they  have  termed  toxic  mem¬ 
ory.  Toxic  memory  emerges  from  experiences  of 
intense,  direct  violence  within  a  community  for 
which  there  is  no  recourse,  no  sense  of  the  possibil¬ 
ity  of  social  justice,  or  remorse  from  the  perpetra¬ 
tors.  Public  memory  debates  in  such  settings  are 
explicitly  constrained  by  the  knowledge  of  what 
violence  particular  agents  are  capable  of  exacting 
and  by  power  dynamics  that  make  no  guarantees 
that  such  violence  will  be  prevented  in  the  future. 

Today,  the  term  politics  of  memory  is  most 
often  associated  with  studies  that  assess  human 
rights  politics  and  policies — prosecutions,  truth 
telling,  commemoration,  reparations — as  well  as 
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with  what  is  commonly  referred  to  as  the  “transi¬ 
tional  justice”  literature.  Transitional  justice  stud¬ 
ies  analyze  how  particular  political  institutions, 
policies,  and  actors  hold  human  rights  violators  of 
previous  regimes  accountable  for  their  crimes. 
Such  studies  focus  on  the  judiciary,  the  military, 
and  civil  society  organizations,  as  well  as  the  exec¬ 
utive  and  legislature.  Government-sponsored  truth¬ 
telling  processes,  for  example,  are  largely  symbolic 
exercises  to  produce  official  and  societal  acknowl¬ 
edgment  of  past  atrocities  and  to  drive  home  the 
message  of  never  again.  Such  processes  are  fraught 
with  debate  over  political  intent:  What  truths 
should  be  privileged  and  what  downplayed?  Where 
should  remembering  begin?  Should  testimonies  be 
private  or  public?  Can  witnesses  be  subpoenaed? 
How  will  the  findings  be  deployed?  Truth-telling 
designs  have  significant  political  implications, 
some  anticipated,  but  some,  such  as  the  irruptions 
of  memories  themselves,  unanticipated. 

The  anthropologist  Richard  Wilson  argues  that 
government-sponsored  truth  commissions  attempt 
to  craft  narratives  of  the  past  that  render  the  pres¬ 
ent  more  governable  and  that  “manufacture 
bureaucratic  legitimacy”  for  the  state.  At  first 
glance,  such  would  appear  to  be  the  case  for  the 
truth  commissions  of  Chile  (1991)  and  Peru 
(2003),  both  of  which  include  the  term  reconcilia¬ 
tion  in  their  official  titles,  suggesting  an  intention 
to  build  consensus  and  foster  national  unity.  The 
United  Nations-sponsored  Guatemalan  truth  com¬ 
mission,  in  contrast,  defines  itself  as  a  body  for 
“historical  clarification,”  explicit  in  its  intent  to 
examine  long  histories  of  violence  and  injustice 
that  make  even  the  concept  of  reconciliation  more 
distant.  The  proliferation  of  truth  commissions, 
memory  debates,  and  the  like  can  be  interpreted  as 
a  sign  of  leaders  striving  for  a  new  kind  of  credibil¬ 
ity.  In  contrast  to  an  earlier  period  in  which  politi¬ 
cal  leaders  attempted  to  move  on,  to  “turn  the 
page”  on  painful  memories,  politicians  are  increas¬ 
ingly  resorting  to  political  manipulations  of  mem¬ 
ory,  championing  explorations  of  the  past,  and 
instrumentalizing  memory  to  serve  their  agendas 
or  to  enhance  their  status. 

The  transitional  justice  literature  brings  interna¬ 
tional  institutions  and  actors  deliberately  to  the 
fore,  marrying  international  influences  with  local 
power  dynamics.  Yet  a  politics  of  memory  literature 
is  a  more  apt  conceptual  frame  than  transitional 


justice  for  such  policies,  given  that  much  of  the 
movement  to  hold  human  rights  violators  account¬ 
able,  and  to  bring  an  important  measure  of  justice, 
is  taking  place  in  posttransitional  rather  than  tran¬ 
sitional  contexts.  In  addition,  a  politics  of  memory 
frame  allows  us  both  to  reach  back  in  historical 
terms  and  at  the  same  time,  therefore,  to  reach 
beyond  the  focus  on  human  rights  violations  of 
repressive  regimes,  to  violence  and/or  dramatic 
phenomena  that  mark  distinct  temporalities  for 
distinct  collectivities.  It  is  clear  in  the  Chilean  case, 
for  example,  that  the  indigenous  Mapuche  com¬ 
munities  possess  deep  historical  understandings  of 
what  constitutes  trauma,  as  well  as  what  consti¬ 
tutes  continuity  in  Chile.  In  contrast,  the  Chilean 
right  consistently  evokes  memories  of  the  1960s 
land  reform  and  takeovers  during  a  democratic 
regime  as  a  haunting  traumatic  memory.  Politics  of 
memory  studies  heighten  our  analytical  sensitivity 
to  distinct  temporalities  as  well  as  more  broadly  to 
the  role  of  timing  in  political  processes. 

Katherine  Hite 
Vassar  College 
Poughkeepsie,  New  York,  United  States 
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Historical  Sociology 


History  and  sociology  have  been,  since  the  19th 
century,  two  completely  separate  fields  based  on 
different  views  of  society.  History  was  supposed 
to  deal  only  with  the  main  political  or  military 
events,  with  important  people  shaping  the  world, 
its  battles,  its  diplomatic  quarrels,  while  sociology 
was  supposed  to  be  concerned  with  the  evolution 
of  the  society  as  a  collective  body  changing 
through  various  stages.  Sociology  was  frequently 
ahistorical,  neglecting  individuals  and  their  val¬ 
ues,  outlining,  on  the  contrary,  the  structures  and 
the  functions  of  crucial  institutions,  their  role 
within  the  process  of  modernization  leading  all 
societies  toward  some  kind  of  progress.  History 
and  sociology  were  competing,  excluding  each 
other  from  the  realm  of  social  science.  History 
was  the  noble  science,  often  building  a  kind  of 
philosophical  approach  accepted  by  everyone  on 
the  meaning  and  the  destiny  of  random  events. 
Increasingly,  though,  these  two  fields  have  inter¬ 
sected,  resulting  in  the  discipline  of  historical  soci¬ 
ology,  which  studies  sociology  within  a  historical 
framework.  This  entry  describes  the  historical 
foundations  and  the  contemporary  historical  soci¬ 
ology,  including  its  subfields,  is  discussed. 

Historical  Foundations 

From  Auguste  Comte  to  Emile  Durkheim,  sociol¬ 
ogy  was  mainly  based  on  positivism  and  an  objec¬ 
tive  knowledge  of  society  excluding  ideologies, 
values,  and  even  cultures  from  the  variables 
explaining  social  phenomena.  Even  though 
Durkheim,  the  father  of  contemporary  sociology, 
once  said  that  sociologists  need  historians  trained 
as  sociologists,  he  nevertheless  saw  history  as  a 


chronology  of  dates  and  singular  events  unrelated 
to  any  more  general  social  laws.  For  example,  in  a 
comparative  study  of  suicide,  Durkheim  acknowl¬ 
edged  the  role  played  by  religion;  however,  his 
explanation  of  the  rate  of  suicide  did  not  consider 
the  role  of  history  or  of  political  crises. 

The  Marxist  tradition  was  also  built  outside 
any  real  historical  explanation.  Historical  materi¬ 
alism,  a  theory  explaining  changes  based  on 
contradictions  occurring  between  class  interests 
structured  by  capitalism,  did  not  in  fact  focus  on 
the  impact  of  specific  historical  events  but  instead 
simply  described  these  contradictions  as  created  by 
the  nature  of  ownership.  History  was  therefore 
seen  as  an  objective  result  of  social  contradictions. 
Karl  Marx,  like  Durkheim,  was  also  a  positivist 
and  advocated  a  form  of  functionalism.  He  did  not 
deal  with  religion,  regarding  it  as  a  means  of  alien¬ 
ating  the  working  class;  he  did  not  compare  reli¬ 
gions  and  their  influence  on  the  emergence  of  a 
specific  kind  of  capitalism  or  political  system.  His 
work,  like  Durkheim’s  analysis,  did  not  contain 
any  serious  comparative  research  that  would  have 
shown  how,  in  the  long  run,  cultures,  religions, 
and  values  shape  almost  all  social  phenomena, 
such  as  the  state  and  the  public  realm,  the  occur¬ 
rence  of  conflicts  or  revolutions,  the  process  of 
secularization,  the  nature  of  education,  and  even 
different  kinds  of  capitalism.  Although  Marx  was 
aware  of  this  need  for  a  historical  and  comparative 
dimension,  and  although  he  looked  seriously  at  the 
values  of  various  individuals  in  The  Eighteenth 
Brumaire  of  Louis  Napoleon  (1852)  and  short 
articles  written  for  various  newspapers,  he  ignored 
history  and  culture  in  his  main  works,  such  as  Das 
Kapital  (1867).  Thus,  the  Marxist  movement’s 
neglect  of  crucial  variables  such  as  states  and  reli¬ 
gions  resulted  in  a  reluctance  to  look  seriously  at 
other  elements  of  culture,  such  as  loyalty  to  a 
nation,  which  is  shared  by  most  citizens;  deeply 
rooted  traditions  and  the  molding  of  cultures;  and 
the  values  and  behavior  of  all  individuals,  regard¬ 
less  of  their  position  in  the  economic  structure. 
This  weakness  had  important  consequences  for 
contemporary  sociology,  which  was  largely  shaped 
by  Marxism  or  by  a  Durkheimian  positivist  view 
looking  at  correlations  between  purely  morpho¬ 
logical  variables. 

Therefore,  to  find  traces  of  historical  sociology 
in  previous  centuries,  one  must  look  at  Alexis  de 
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Tocqueville’s  work.  During  the  first  part  of  the 
19th  century,  far  from  his  own  society,  Tocqueville 
discovered  the  culture,  history,  and  religion  of  the 
United  States.  He  also  traveled  to  Algeria  and 
Ireland.  He  was  keen  to  learn  as  much  as  he  could 
about  India’s  caste  system  and  values.  At  the  core 
of  his  work  is  a  historical  comparative  sociology 
dealing  with  different  kinds  of  states,  religions, 
and  cultures  and  numerous  interpretations  of  indi¬ 
vidualism  and  even  of  liberty  and  equality  within 
societies  based  on  almost  the  same  kind  of  capital¬ 
ism.  In  his  main  book  Democracy  in  America 
(1835),  Tocqueville  implicitly  compared  France 
with  the  United  States  and  showed  that  everything 
was  quite  different  within  the  same  socioeconomic 
system;  comparative  histories  (presence  or  com¬ 
plete  absence  of  feudalism  and  therefore  of  nobil¬ 
ity)  and  cultures  (Catholicism  vs.  Protestantism) 
therefore  matter  deeply.  Tocqueville  also  raised 
the  kind  of  issues  that  are  much  discussed  today:  Is 
it  possible  to  import  some  institutions,  such  as 
democracy,  into  another  culture  molded  by  a  dif¬ 
ferent  religion?  At  what  cost?  With  what  kind  of 
collective  movement  leading,  in  reaction,  toward 
nationalist  anger?  He  foresaw  this  crucial  contem¬ 
porary  discussion  related  to  the  importation  and 
exportation,  through  soft  or  military  means,  of 
various  regimes  and  institutions  in  societies  shaped 
by  contradictory  histories  and  values.  Tocqueville 
was  mainly  a  politician;  unfortunately,  he  never 
built  a  systematic  model  laying  the  foundation  of 
historical  sociology.  He  remained  alone  in  his 
comparative  historical  attempt,  and  not  until 
almost  the  end  of  the  19th  century  did  the  real 
birth  of  modern  historical  sociology  appear 
through  Max  Weber’s  work. 

In  Weber’s  main  books,  such  as  Economy  and 
Society  (1920)  or  The  Protestant  Ethic  and  the 
Spirit  of  Capitalism  (1905),  and  in  his  numerous 
studies  of  ancient  Judaism,  China,  Japan,  or  even 
the  Muslim  world  (looking  seriously  at  Judaism, 
Hinduism,  Buddhism,  and  Islam),  he  was  clearly 
comparing  sociologically  different  societies  shaped 
by  various  histories,  religions,  and  cultures.  Deeply 
concerned  by  the  values  of  each  actor,  by  their 
degree  of  rationality  or  irrationality,  and  looking  at 
the  interactions  among  all  of  them,  the  way  they  met 
and  organized  their  world  together,  Weber  can  be 
seen  as  the  founder  of  historical  sociology.  Among 
sociologists,  he  was  the  first  to  deal  extensively  with 


questions  of  power,  state,  and  authority  in  differ¬ 
ent  historical  frameworks,  from  sultanism  to  feu¬ 
dalism,  patrimonialism,  and  modern  rational 
bureaucracy  in  the  context  of  capitalism.  Far  from 
any  positivist  methodology  and  avoiding  an  evolu¬ 
tionist  interpretation  common  among  most  authors 
studying  the  process  of  modernization,  Weber 
permanently  underlined  the  intentions  of  every 
individual  and  tried  to  understand  his  or  her 
decision  and  his  or  her  reasons  for  acting  in  a 
particular  way.  Weber’s  interpretative  approach  to 
historical  sociology  finally  provided  a  valuable 
comparative  methodology.  For  example,  Weber 
was  the  first  to  call  attention  to  charismatic 
authority — the  fact  that  some  individuals  possess 
personal  and  almost  magical  qualities  that  allow 
them  to  lead  people,  crowds,  political  parties,  or 
even  a  nation.  Neither  Marxist  economic  analysis 
nor  Durkheim’s  positivist  methodology  was  able 
to  interpret  this  personal  quality  (charisma)  of 
some  leaders  in  ancient  as  well  as  modern  societies 
and  in  authoritarian  states  as  well  as  in  democratic 
ones.  Even  though  he  argued  that  a  process  of 
rationalization  or  bureaucratization  was  increas¬ 
ingly  shaping  all  modern  societies,  he  kept  a  non¬ 
evolutionist  view  by  outlining  the  unpredicted 
individual  power  of  some  individuals  within  differ¬ 
ent  cultures. 

Contemporary  Historical  Sociology 

Weber’s  interpretative  sociology  has  many  heirs 
and,  today,  one  can  argue  that  he  remains  at  the 
core  of  contemporary  historical  sociology,  which 
became,  in  the  1960s,  a  legitimate  subfield.  His 
influence  on  most  issues  is  obvious.  Four  different 
fields  within  this  subdiscipline  are  considered  below. 

State 

Alone  among  the  founding  fathers  of  sociology, 
Weber  gave  us  a  convincing  theory  of  the  state  as 
an  institutionalized  and  differentiated  institution 
serving  the  general  interest,  recruiting  its  own  rul¬ 
ing  elite  through  meritocratic  competition,  and 
concentrating  in  itself  all  means  of  violence  previ¬ 
ously  controlled  by  specific  social  groups  or  by 
forces  located  at  the  periphery.  Today,  to  bring 
back  the  state  as  a  crucial  variable,  one  can  look 
either  at  Tocqueville  or,  even  better,  at  Weber’s 
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analysis.  The  contemporary  rediscovery  of  the 
state  began,  in  fact,  mainly  thanks  to  a  kind  of 
revision  of  an  evolutionist  Marxism. 

Immanuel  Wallerstein  and  Perry  Anderson  then, 
while  remaining  Marxists,  nevertheless  were  the 
first  to  introduce  a  strong  comparative  perspective 
into  their  work  during  the  1970s.  Wallerstein,  an 
American  sociologist,  in  the  three-volume  The 
Modern  World  System  (1974,  1980,  1989), 
described,  on  the  one  hand,  the  birth  of  the  state 
located  in  the  core  of  Europe,  on  its  Atlantic 
facade,  where  the  bourgeoisie  needed  a  state  to 
protect  its  economic  interests  and  its  wide-ranging 
foreign  economic  relations,  and,  on  the  other 
hand,  Eastern  Europe,  unaware  of  either  capital¬ 
ism  or  the  birth  of  the  state.  In  his  Lineages  of  the 
Absolutist  State  (1974),  Perry  Anderson,  the  direc¬ 
tor  of  The  New  Left  Review,  contrasted  Western 
and  Eastern  Europe.  He  showed  that  the  state  was 
born  before  capitalism,  in  those  Western  regions 
where  an  intense  feudalism,  in  the  context  of 
Roman  law,  led  to  an  extreme  pluralism  favorable 
to  the  earlier  differentiation  of  a  state  that  made  it 
easier  to  protect  the  dominant  economic  interests. 
In  the  East,  on  the  other  hand,  a  mighty  nobility 
kept  its  power,  using  it  against  the  peasantry’s 
revolts.  These  analyses,  while  keeping  to  an  instru¬ 
mentalist  view  of  the  state,  clearly  belong  to  com¬ 
parative  historical  sociology;  nevertheless,  they 
remained  quite  schematic  in  their  use  of  history, 
and  they  completely  ignored  the  cultural  element. 

Stein  Rokkan,  a  Norwegian  sociologist,  drew 
many  conceptual  maps  of  Europe  based  on  two 
different  axes:  the  first  one,  East/West,  dealing  both 
with  the  economy  and,  in  a  new  approach,  with 
territorial  aspects  and  the  second,  North/South, 
based  on  a  broad  opposition  between  Protestantism, 
Catholicism,  and  regions  of  Counter-Reformation. 
It  was  immediately  clear  that  the  neo-Marxist  con¬ 
ception  of  the  state  had  been  supplemented  by  a 
more  rigorous,  comparative,  Weberian  interpreta¬ 
tion.  Rokkan’s  innovative  work  brought  us  to  the 
core  of  historical  sociology  because  he  was  able  to 
compare  different  paths  of  state  building  in  the 
context  of  meaningful  territorial  variations,  thus 
raising  a  definitive  argument  against  any  kind  of 
evolutionism.  Rokkan  was  clearly  following 
Weber’s  comparative  lesson  when  he  described  the 
absence  of  state  building  in  the  core  of  Europe, 
from  Hamburg  to  Barcelona  through  Geneva, 


Bern,  and  Milan,  where  the  market  was  predomi¬ 
nant  and  each  strong  city  was  unable  to  conquer  its 
surrounding  territory.  This  comparative  approach 
was  also  evident  when  he  showed  how  Protestant¬ 
ism,  in  the  North,  allowed  the  early  building  of  the 
nation  through  a  common  consensus  while  Cath¬ 
olicism  led  to  an  internal  split  of  the  nation  in  the 
South.  However,  Rokkan  remained  focused  only 
on  a  sociological  approach;  his  interest  in  history 
was  still  too  limited. 

Rokkan  did  not  base  his  work  on  a  deep  knowl¬ 
edge  of  English,  French,  German,  or  Russian  his¬ 
tory.  If  he  had,  he  likely  would  have  shown,  for 
instance,  the  incredible  strength  of  the  French  state 
as  a  reaction  against  the  formidable  power  of  the 
Catholic  Church;  he  would  have  acknowledged 
the  weakness  of  the  state  in  the  strong  and  consen¬ 
sual  English  society;  and  he  would  have  described 
more  precisely,  following  Weber,  different  types  of 
states.  This  task  had  been  initiated  by  Charles  Tilly 
in  his  comparative  work  on  the  European  state¬ 
building  process.  Tilly,  both  a  historian  and  a 
sociologist,  showed  the  crucial  influence  of  wars 
on  the  growth  of  the  state’s  bureaucracy.  This  per¬ 
spective  has  also  been  taken  by  Samuel  Finer, 
Shmuel  Eisenstadt,  Joseph  R.  Strayer,  Theda 
Skocpol,  and  other  specialists  on  the  state.  Thus, 
according  to  Bertrand  Badie  and  Pierre  Birnbaum, 
one  can  use  the  broad  opposition  between  strong 
and  weak  states  as  useful  ideal  types  based  on  a 
comparative  history  of  state-building  processes 
explaining  not  only  different  kinds  of  strikes,  col¬ 
lective  actions,  and  new  social  movements  but  also 
various  importation  and  exportation  of  types  of 
states  in  another  cultural  context.  At  last,  sociolo¬ 
gists  are  themselves  becoming  historians,  working 
like  them  on  archives  and  primary  resources  while 
raising  specific  sociological  questions. 

Citizenship 

The  Weberian  sociohistorical  approach  has  also 
been  applied  to  the  question  of  citizenship.  In  a 
lecture  in  London  given  just  after  World  War  II, 
Thomas  Humphrey  Marshall,  an  English  econo¬ 
mist,  presented  a  famous  evolutionist  interpreta¬ 
tion  of  citizenship  based  on  three  dimensions  of 
citizenship  (civil,  political,  and  social)  correspond¬ 
ing  more  or  less  to  three  different  centuries  (the 
18th,  19th,  and  20th)  and  illustrating  mainly  the 
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British  case.  However,  this  model  was  criticized 
for  not  applying,  for  instance,  to  the  German  or 
French  cases  and  for  being  even  less  applicable  to 
Eastern  European  societies.  In  some  societies,  such 
as  Germany,  citizenship  emerged  first  in  the  social 
dimension,  while  the  civil  and  political  dimensions 
emerged  only  recently,  with  the  creation  of  the 
Federal  Republic  after  World  War  II. 

Other  contemporary  societies  still  ignore  most 
dimensions  of  citizenship.  Marshall’s  notion  of 
citizenship  also  implied  a  strong  consensual  and 
democratic  nation  with  a  near  absence  of  strong 
political  and  religious  cleavages,  a  weak  state,  and 
an  efficient  representative  political  system  bringing 
a  deep  feeling  of  loyalty,  with  citizens  sharing  a 
common  interest. 

In  France,  on  the  contrary,  serious  internal 
cleavages  led  to  deep  ideological  conflicts:  among 
them,  the  permanent  antagonism  between  the  state 
and  the  Catholic  Church,  which  led  to  a  divided 
loyalty.  Citizens  were  supposed  to  be  loyal  to  the 
state,  while  confining  their  religious  identity  to  the 
private  sphere.  Until  the  middle  of  the  20th  cen¬ 
tury,  some  Catholics  did  not  really  feel  at  home  in 
the  strong  republican  universalist  state.  Moreover, 
a  strong  radical  right  arose,  wishing  to  destroy  the 
strong  state  that  it  viewed  as  an  artificial  source  of 
French  identity,  while  the  real  French  identity  was 
Catholic.  The  Jacobin  culture  battled  against  reli¬ 
gious  beliefs  that  challenged  the  nation’s  common 
culture  and  that  prompted  the  birth  of  a  unique 
public  space  open  to  meritocratic  procedure.  In  a 
context  such  as  this,  any  private  identity  of  the 
citizen  is  supposed  to  be  hidden  from  the  public 
realm.  In  such  a  historical  context,  one  cannot 
hold  two  identities,  such  as  French  and  Catholic, 
so  far  as  public  allegiance  is  concerned.  Any  mul- 
ticulturalist  identity  is  largely  ignored  and  fought 
within  such  a  strong  republican  state.  For  instance, 
in  Turkey  and  in  France,  this  militant  notion  of 
citizenship  may  explain  the  willingness  to  ban  the 
veil  from  the  public  sphere.  Thus,  a  comparative 
approach  to  the  concept  of  national  identity  based 
on  different  historical  paths  to  emancipation  can 
explain  why  citizens  are  expected  to  show  their 
religious,  cultural,  and/or  gender  identities  in  one 
case,  whereas  in  another,  they  are  barred  from 
doing  so.  The  French  principle  of  lattice  is  thus 
shaped  by  a  long  history  through  which  politics 
was  built  by  separating  it  from  private  life:  In  the 


public  sphere,  the  French  citizen  is  not  permitted 
to  show  up  his  or  her  religious,  cultural,  or  familial 
allegiances;  his  or  her  commitment  to  French  citi¬ 
zenship  is  prior  to  any  other  sources  of  identity 
and  may  even  exclude  them  from  public  life.  This 
is  the  main  basis  of  the  culture  of  the  Jacobin  state. 
The  multiculturalist  approach  of  North  American 
authors  such  as  Charles  Taylor,  Michael  Walzer, 
Michael  Sandel,  and  Will  Kymlicka  cannot  be  eas¬ 
ily  imported  into  France,  Turkey,  or  Tunisia  or 
into  societies  molded  by  a  thick  conception  of 
culture  such  as  Japan,  a  country  eager  to  keep  a 
homogeneous  culture  and  reluctant  to  open  itself 
to  diversity.  In  the  same  perspective,  one  can  also 
make  a  comparative  study  of  Jewish  emancipation. 
For  example,  Jews  in  France  are  openly  asked  to 
assimilate  completely  when  entering  the  public 
sphere,  thereby  quickly  submitting  their  collective 
identity  to  their  citizenship;  whereas  in  the  United 
Kingdom,  the  United  States,  or  in  various  empires 
such  as  the  Austro-Hungarian  Empire  where  col¬ 
lective  identity  is  seen  as  legitimate,  they  are  able 
to  keep  it. 

Elite 

The  comparative  study  of  the  recruitment  of 
elites  to  positions  of  power  is  also  more  and  more 
bound  to  a  historical  sociology  approach.  To 
understand  why  the  circulation  of  elites  is  shaped 
by  a  specific  network,  one  must  take  a  compara¬ 
tive  approach  in  studying  both  the  state  and  the 
ruling  class.  The  stronger  the  institutionalization 
of  the  state,  the  stronger  its  differentiation;  the 
weaker  the  penetration  of  business  or  private  inter¬ 
ests  within  the  state  structure  and  the  state  elites, 
the  weaker  the  penetration  of  the  political  parties, 
of  the  churches,  of  any  recruitment  of  the  elites 
based  on  some  kind  of  affirmative  action.  In  such 
a  secularized  public  realm,  civil  servants,  like  a 
strong  civil  army,  are  protected  by  their  role  and 
status  within  the  state  from  any  political,  religious, 
or  economic  pressures  coming  from  outside.  The 
reverse  is  also  true:  The  weaker  the  state,  the  stron¬ 
ger  are  those  kinds  of  penetration  within  the  elite. 
Thus,  the  horizontal  circulation  of  elites  is  shaped 
by  the  actual  presence  of  different  kinds  of  states 
resulting  from  a  long  history. 

To  summarize  this  point,  the  stronger  the  state, 
the  weaker  and  less  homogeneous  is  the  upper 
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(ruling)  class.  In  addition,  the  stronger  the  state, 
the  more  open  the  vertical  circulation.  For  exam¬ 
ple,  where  there  is  a  state-owned  educational 
system  that  is  not  subject  to  clientelist  or  other 
social  intervention,  there  is  greater  meritocractic 
recruitment  across  all  social  strata.  One  can  thus 
not  only  outline  the  remarkable  differences  in  the 
elite  recruitment  circulation  within  almost  the 
same  capitalist  societies  but  also  study  the  recruit¬ 
ment,  for  instance,  of  political  elites  in  African 
societies  where  even  now,  the  state  is  often  shaped 
by  different  long-term  histories  framed  by  thick 
cultures  that  have  not  experienced  the  process  of 
differentiation. 

Nationalism 

Nationalism  has  also  been  studied  through  a 
new  kind  of  historical  sociology  approach. 
According  to  some  main  theorists,  nationalism  is  a 
normal  and  functional  phenomenon  within  strong 
industrialized  societies  in  which  collective  identity 
has  been  threatened  by  individualism  and  a  pro¬ 
cess  of  atomization.  Nationalism — as  defined  by 
the  British  sociologist  Ernest  Gellner  and  as  viewed 
by  most  authors  whose  ideas  have  been  framed  by 
a  developmentalist  interpretation — is  supposed  to 
be  a  normal  process  of  socialization  controlled  by 
the  state  in  order  to  bring  citizens  together  in  a  sort 
of  nationalization  of  the  mind  owing  to  mass  com¬ 
munication  and  the  educational  system.  Gellner’s 
interpretations  (and  also  the  important  contribu¬ 
tions  of  Karl  Deutsch  and  Elie  Kedourie  on  nation 
building)  are  largely  evolutionist,  ignoring  nation¬ 
alism  as  a  historical  phenomenon  implying  the  use 
of  violence,  the  impact  of  ideologies,  the  cultural 
reaction  against  Enlightenment,  or  a  contempo¬ 
rary  colonial  domination.  Following  the  work  of 
the  English  philosopher  Isaiah  Berlin,  nationalism 
can,  in  fact,  be  described  as  a  “bent  twig”  reaction 
based  on  a  neglected  cultural  identity.  Thus,  to 
understand  its  appearance,  one  again  needs  a  his¬ 
torical  comparative  approach  explaining  why 
German  or  Russian  nationalism  in  the  19th  cen¬ 
tury  was  mainly  a  cultural  reaction  against  the 
French  state  imposing  its  domination  in  the  name 
of  Reason;  why  during  the  20th  century,  the  strug¬ 
gle  for  decolonization  was  also  a  kind  of  collective 
nationalist  protest;  and  why,  for  instance,  Algerian 
nationalism  took  such  a  bloody  form  in  reaction  to 


the  violent  form  of  colonization  imposed  by  the 
strong  French  state  while  countries  colonized  by 
England  or  the  Netherlands  were  more  frequently 
able  to  liberate  themselves  without  a  violent  reac¬ 
tion  to  violent  repression.  This  approach  can  tell 
us  the  following:  (a)  why  this  type  of  nationalism 
may  suddenly  appear  from  within  a  society,  as  a 
collective  reaction  of  a  repressed  culture  usually 
organized  by  the  radical  right — a  populist  uprising 
against  the  establishment  as  a  political  target,  seen 
as  an  almost  foreign  power  cut  off  from  native 
culture;  (b)  why  one  cannot  find  an  obvious 
nationalism  in  pluralist  societies,  such  as  the 
United  States,  wherein  populist  collective  actions 
are  quickly  exhausted;  (c)  why  civic  and  ethnic 
forms  of  nationalism  grow  in  different  historical 
contexts;  (d)  why  not  only  “imagined  communi¬ 
ties”  but  also  realms  of  memory  and  rediscovery  of 
the  past  feed  nationalist  movements  and  to  what 
extent  nationalism  and  patriotism  are  distinct  phe¬ 
nomena;  (e)  why  transnationalism  can  penetrate  a 
society  with  a  weak  state  that  is  more  open  to 
globalization  than  a  strong  nation-state;  and 
(f)  why  loyalty  to  a  diaspora  remains  more  alive 
when  immigrants  are  spread  within  weak  state 
societies  in  which  immigrants  are  able  to  keep  their 
identity  and  are  not  required  to  assimilate  into  a 
homogeneous  whole. 

Nevertheless,  historical  sociology  must  avoid 
the  pitfalls  of  relativism  and  culturalism.  By  outlin¬ 
ing  the  impact  of  different  cultures  or  types  of 
states  as  a  crucial  factor,  this  comparative  perspec¬ 
tive  must  be  careful  not  to  invent  a  new  kind  of 
determinism  conditioning  each  specific  history.  It 
must  only  try  to  discover  regularities,  collective 
behaviors,  and  processes  initiated  by  various  par¬ 
ticipants  but  occurring  similarly  even  within  differ¬ 
ent  historical  contexts. 

Tocqueville  and  Weber  used  essentially  histori¬ 
cal  secondary  works  to  ask  those  comparative 
sociological  questions.  Today,  as  “sociology  meets 
history,”  to  use  Tilly’s  famous  phrase,  social  sci¬ 
entists  must  become  increasingly  aware  of  the 
necessity  to  build  their  work  within  a  long-term 
historical  framework.  They  themselves  must  look 
at  various  kinds  of  historical  data,  through  which 
they  can  ask  sociological  questions.  If,  in  turn,  his¬ 
torians  are  to  be  trained  as  sociologists,  in  the  spirit 
of  the  French  Annales  School,  the  two  fields  will 
meet  more  and  more  frequently,  with  historians 
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recently  starting  to  study  the  transformation  of 
contemporary  societies. 

Pierre  Birnbaum 
Universite  Paris  1 
Paris,  France 
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History  of  Political  Science 


The  idea  of  politics  as  a  subject  of  science  is  as  old 
as  Aristotle,  but,  as  Bernard  Crick  stressed,  politi¬ 
cal  science  as  a  distinct  academic  discipline  and 
branch  of  social  science  originated  as  a  uniquely 
American  invention.  Although  there  were,  in 
many  respects,  functionally  equivalent  studies  of 
politics  in  other  countries,  the  history  of  political 
science,  from  the  mid- 19th  century  to  the  mid- 
20th  century,  was  primarily  a  story  of  the 


“American  science  of  politics.”  Notwithstanding 
its  universal  scientific  aspirations,  its  emigration, 
and  export  to  other  countries,  especially  subse¬ 
quent  to  World  War  II,  and  the  waves  of  foreign 
influence  that  have  at  times  significantly  contrib¬ 
uted  to  shaping  the  field,  political  science  has 
borne  a  unique  relationship  to  American  political 
life  and  to  American  democratic  ideology. 
Although  the  history  of  the  discipline  could  be 
written  from  many  perspectives,  an  important 
dimension  of  that  history  is  the  democratic  narra¬ 
tive.  The  field  has  always  been  committed  to  cre¬ 
ating  a  truly  scientific  study  of  politics,  but, 
despite  changing  images  of  science,  there  has  been 
a  consistent  search  for  a  discipline  that  would 
contribute  to  realizing  and  enhancing  democratic 
values  and  institutions.  In  this  respect,  as  well  as 
with  regard  to  matters  of  methodology,  the 
genetic  imprint  of  the  American  form  has  remained 
manifest  in  the  extended  speciation  that  now  char¬ 
acterizes  so  many  other  countries  within  which 
the  field  has  taken  root  and  evolved. 

There  has,  however,  been  a  constant  ambiva¬ 
lence  about  the  discipline’s  practical  relationship 
to  politics,  and  it  has  often  been  suggested  that  the 
simultaneous  commitments  to  science  and  democ¬ 
racy  have  not  always  been  in  harmony.  Although 
this  tension  has  in  part  involved  the  problem  of 
reconciling  scientific  and  political  criteria  of  judg¬ 
ment,  it  has  also  been  the  consequence  of  a  long¬ 
standing  assumption  that  only  by  remaining  aloof 
from  politics  and  establishing  its  claim  to  scientific 
objectivity  could  the  discipline  gain  the  cognitive 
authority  that  would  facilitate  practical  purchase. 
Such  distance  was  viewed  as  necessary  in  part 
because  many,  particularly  in  the  late  19th  cen¬ 
tury,  who  attempted  to  speak  politically  from  the 
podium  discovered  that  politics  was  often  a  dan¬ 
gerous  object  of  inquiry,  particularly  when  inquiry 
involved  advocacy.  Consequently,  it  is  not  surpris¬ 
ing  that  some  have  suggested  that  the  discipline 
has  at  times  become  alienated  from  the  realities  of 
political  life,  but  self-consciousness  about  its  rela¬ 
tionship  to  politics  has  significantly  informed 
political  science’s  successive  crises  of  intellectual 
identity.  Despite  sometimes  contradictory  claims 
about  the  extent  to  which  claims  about  politics 
and  government  produced  by  political  science  have 
influenced  political  ideas  and  behavior,  the  images 
produced  by  the  discipline  have,  in  various  ways, 
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such  as  through  diverse  levels  of  pedagogy  and 
through  their  influence  on  a  variety  of  media,  been 
reflected  in  the  practices  of  citizens  and  political 
actors. 

There  was,  from  the  point  of  the  formation  of 
the  American  Republic,  a  theoretical  paradox  that 
has  been  the  central  axis  in  discussions  of  popular 
government.  This  paradox,  which  was  bequeathed 
to  the  field  of  political  science,  emerged  with 
respect  to  validating  American  democracy — and 
validating  America  as  a  democracy.  While  it  was 
assumed  that  a  republican  or  democratic  regime 
was  predicated  on  the  existence  of  an  intelligible 
and  autonomous  people,  it  was,  at  the  same  time, 
difficult,  after  the  American  Revolution,  to  iden¬ 
tify  any  such  entity.  This  search  for  a  “people,” 
and  for  democracy,  was,  and  has  been,  through  the 
end  of  the  20th  century,  confronted,  and  con¬ 
ducted,  in  two  distinct  ways.  One  approach  has 
been  to  argue  that,  despite  great  social  diversity, 
there  is  an  American  people  that  has  been  the 
author  and  subject  of  democratic  government.  The 
other  approach  has  been  to  argue  that  the  exis¬ 
tence  of  such  a  national  community  is  not  neces¬ 
sary  to  achieve  the  ends  of  popular  sovereignty. 
One  persistent  aspect  of  the  democratic  vision  in 
America,  represented  in  both  of  these  approaches, 
has,  however,  been  its  accommodational  charac¬ 
ter.  The  tendency  has  been  to  adapt  the  concept  of 
democracy  to  changes  in  the  perceived  realities  of 
American  politics. 

This  paradox  of  democracy  was  first  exempli¬ 
fied  in  the  Federalist  Papers,  which  were  devoted 
to  a  defense  of  the  proposed  1787  Constitution. 
While  the  authors  maintained  that  the  Constitution, 
manifesting  the  accrued  wisdom  of  Western  politi¬ 
cal  thought  regarding  a  science  of  politics,  created 
a  popular  government  that  was  republican,  or 
representative,  rather  than  purely  democratic,  they 
had  difficulty  clarifying  and  defending  their  con¬ 
tinued  allegiance  to  the  basic  idea  of  popular  sov¬ 
ereignty.  The  concept  of  a  people  that  had  been  at 
the  core  of  revolutionary  ideology,  as  well  as 
essential  to  the  arguments  of  certain  of  the  anti- 
Federalist  criticisms  of  the  new  document,  seemed 
to  have  an  anomalous  ring  when  juxtaposed  to  the 
images  of  American  politics  advanced  by  James 
Madison  and  Alexander  Hamilton.  The  Federalists 
and  anti-Federalists  shared  the  worry  that  there 
was  not  an  identifiable  American  people,  which 


transcended  the  smaller  communities  of  the  vari¬ 
ous  states.  From  one  perspective,  the  genius  of  the 
authors  of  the  Federalist  was  to  invent  the  very 
idea  of  a  unified  people  that  encompassed  more 
local  constituencies  and  that  was  to  be  represented 
in  and  by  the  new  national  government.  To  the 
extent,  however,  that  the  oft-mentioned  “people” 
had  a  concrete  meaning  for  Madison,  as  voiced  in 
Federalist  No.  10,  it  seemed,  in  the  end,  to  refer 
either  to  the  sum  of  self-interested  individuals  or 
to  diverse  and  divisive  factions  that  were  charac¬ 
terized  by  their  attachment  to  their  own  rather 
than  a  public  good.  In  place  of  the  traditional 
republican  notion  of  an  organic  people,  Madison 
conceived  of  a  virtual  people  that  would  arise  out 
of  an  institutional  and  social  balance  of  conflict¬ 
ing  interests.  He  argued  that  the  disease  of  repub¬ 
lican,  and  now  American,  government  was 
factionalism  but  that  it  could  be  transformed  into 
its  own  cure  through  an  intricate  constitutional 
design  combined  with  fortuitous  demographic  and 
geographical  circumstances.  Political  discourse 
and  commentary,  however,  kept  alive  the  civic 
republican  image  of  a  people  capable  of,  and  the 
subject  of,  popular  government,  which  lay  beneath 
the  surface  of  American  diversity.  The  origins  of 
political  science  were  closely  involved  with  vouch¬ 
safing  that  image. 

The  Science  of  the  Democratic  State 

During  the  19th  century,  academic  publicists  pro¬ 
duced  their  own  version  of  the  people,  which  was 
represented  in  the  concept  of  the  “state.”  While 
today  many  tend  to  look  back  on  this  concept  as 
an  archaic  formalistic  and  legalistic  artifact  or  as 
an  intellectual  reflection  of  American  state  build¬ 
ing,  it  was  in  fact  the  nucleus  of  a  theory  of 
American  democracy.  Apart  from  a  reference  to 
the  American  states,  the  word  state  had,  by  the 
beginning  of  the  1 9th  century,  little  currency  in  the 
language  of  American  politics.  The  introduction  of 
the  concept  of  the  state  was  largely  through  the 
work  of  the  German  emigre  Francis  Lieber,  begin¬ 
ning  about  the  time  that  his  acquaintance,  Alex  de 
Tocqueville,  visited  America.  Tocqueville  had 
noted  that  the  new  world  of  American  democracy 
demanded  a  “new  science  of  politics,”  and  Lieber 
can  reasonably  be  designated  the  founder  of 
American  political  science.  There  was  already  a 
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nascent  program  of  civic  education  within  the  tra¬ 
ditional  American  college  and  university  curricu¬ 
lum  of  moral  philosophy,  which  was  devoted  to 
practical  ethics  and  to  educating  the  American  elite 
in  the  principles  and  duties  of  public  life.  Lieber 
focused  on  expanding  this  field  of  study  by  inte¬ 
grating  German,  largely  Kantian  and  Hegelian, 
philosophy  and  images  of  world  history  ( Manual 
of  Political  Ethics ,  1839).  He  applied  that  philoso¬ 
phy  to  the  circumstances  and  traditions  of  the 
United  States  and  to  devising  a  solution  to  the 
perennial  democratic  paradox.  His  adaptation  of 
the  German  philosophy  of  the  state  (Civil  Liberty 
and  Self-Government,  1853)  paralleled  the  work 
of  individuals  such  as  the  German  theorist  Johann 
K.  Bluntschli  ( The  Theory  of  the  State,  1895),  and, 
for  nearly  a  century,  it  provided  the  intellectual, 
institutional,  and  professional  foundation  of  aca¬ 
demic  political  inquiry  in  the  United  States. 

After  a  period  in  South  Carolina,  Lieber,  who 
was  a  strong  supporter  of  the  Union  cause,  was 
appointed,  in  1857,  as  the  first  professor  of  politi¬ 
cal  science  at  Columbia  College  in  New  York.  His 
work  was  perpetuated  and  refined  by  second- 
generation  theorists  such  as  Theodore  Woolsey  at 
Yale,  Herbert  B.  Adams  at  Johns  Hopkins,  and, 
above  all,  John  W.  Burgess,  who  was  Lieber’s  suc¬ 
cessor  at  Columbia.  What  Lieber,  and  the  later 
American  state  theorists,  who  were  educated 
abroad  and  imbibed  the  German  paradigm  of 
Staatswissenschaft  (in  German,  “political  sci¬ 
ence”),  created  was  the  image  of  a  democratic 
people  as  well  as  a  history  of  democratic  institu¬ 
tions  that  sprung  from  ancient  Teutonic  origins, 
passed  through  English  government,  and  culmi¬ 
nated  in  the  American  polity.  Although  Americans 
had  at  first  been  wary  of  the  word  democracy,  it 
had,  by  the  middle  of  the  19th  century,  been 
largely  divested  of  its  radical  overtones  and  become 
a  general  term  of  approbation  in  the  United  States 
as  well  as  in  many  places  abroad.  Unlike  some  of 
his  European  counterparts  and  correspondents, 
such  as  Edouard  Laboulaye  in  France,  as  well  as 
the  American  historian  George  Bancroft,  who  all 
commented  extensively  on  American  political  soci¬ 
ety  and  contributed  significantly  to  the  19th- 
century  democratic  narrative,  Lieber,  still  feared 
“democratic  absolutism”  and,  like  Tocqueville, 
majoritarian  rule.  He  inveighed  against  ideas  such 
as  women’s  suffrage  and  tended  to  eschew  the 


word  democracy  in  favor  of  phrases  such  as  self- 
government  and  hamarchy  by  which  he  basically 
meant  representative  institutions.  His  vision  of  the 
state,  however,  was  essentially  that  of  an  associa- 
tionally  and  institutionally  diverse  but  organic 
people  and  its  pedigree  that  gave  theoretical  sub¬ 
stance  to  the  idea  of  democracy. 

Although  the  “state  talk”  of  19th-century  polit¬ 
ical  inquiry,  as  well  as  that  of  public  intellectuals 
such  as  Orestes  Brownson  and  Elisha  Mulford, 
paralleled  the  discourse  of  democracy  in  political 
life,  it  remained,  like  many  later  constructions  of 
political  science,  far  removed  from  the  language  of 
politics  in  the  United  States.  The  most  essential 
feature  of  the  concept  of  the  state  during  this  long 
and  formative  period  in  the  evolution  of  American 
political  science  was  that  it  did  not  refer  either  to 
forms  of  government  or  to  the  institutions  of  gov¬ 
ernment  but  rather  to  a  community  whose  voice 
expressed  a  will  and  interest  that  was  expressed 
through  the  agency  of  government  but  preceded,  in 
both  time  and  authority,  both  the  constitution  and 
the  government.  This  vision  often  reflected  and 
abetted  the  conservative  ideology  of  theorists  such 
as  Burgess,  who  wished  to  propagate  and  justify 
limited  government  as  well  as  to  curtail  democratic 
populism  while  maintaining  the  ethic  of  popular 
sovereignty,  and  it  was  in  some  ways  both  inspired 
by,  and  functioned  to  legitimate,  the  cause  of  the 
Union  before  and  after  the  Civil  War.  But  it  was 
also  embraced  by  social  scientists  on  the  political 
left,  such  as  the  economist  Richard  Ely,  who  per¬ 
ceived  the  state  as  authorizing  government  inter¬ 
vention  in  social  life.  The  theory  of  the  state  pro¬ 
vided  a  scientific  identity  for  the  discipline  and 
sublimity  for  its  subject  matter,  but,  above  all,  it 
offered  a  distinct  answer  to  the  congenital  paradox 
of  American  democratic  theory,  and  it  was  an 
answer  that  extended  well  into  the  Progressive  Era 
after  the  turn  of  the  century.  The  19th-century 
American  state  theorists,  despite  their  attachment 
to  a  certain  conception  of  individual  natural  rights, 
all  rejected  contract  theory  and  its  implications. 
They  claimed  that  there  was,  from  the  outset,  a 
primordial  people  who  both  authorized  and  placed 
limitations  on  government.  Burgess  ( Political 
Science  and  Comparative  Constitutional  Law, 
1891)  validated  this  picture  with  a  singular  and 
ingenious  account  of  American  history.  He  argued 
that  modern  states,  and  particularly  the  United 
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States,  were  the  prime  examples  of  those  founded 
on  a  national  unity  and  that  they  represented  a  self- 
conscious  democracy  that  was  the  apex  of  political 
history.  Nations,  as  ethical  and  geographical  units, 
he  claimed,  tended,  at  least  in  the  West,  to  become 
states — that  is,  a  people  with  a  government — and 
the  highest  examples  of  the  latter  were  those  that 
had  achieved  the  popular  or  democratic  form. 

Burgess’s  interdisciplinary  School  of  Political 
Science  at  Columbia  (1880)  represented  an  attempt 
to  replicate  the  form  of  scientific  education  that 
characterized  German  universities,  and  his  aim 
was  both  to  educate  an  American  administrative 
and  political  elite  and  to  influence  government 
policy.  The  imprint  of  this  curriculum  is  still  evi¬ 
dent  in  contemporary  political  science  programs, 
and  Columbia  produced  the  first  professional  jour¬ 
nal  of  political  science,  Political  Science  Quarterly 
(1886),  which  was  devoted  to  the  assumption  that 
the  “domain  of  political  science”  was  the  study  of 
the  state  and  that  among  the  social  sciences  con¬ 
cerned  with  this  subject,  political  science  occupied 
the  “dominant  position.”  Similar  institutional 
developments  took  place  under  Adams  at  Hopkins, 
which  published  Studies  in  History  and  Political 
Science  and  which  was  the  site  of  the  first  profes¬ 
sional  Political  Science  Association,  and  in  Europe, 
there  was  the  beginning  of  institutions  such  as 
Ecole  Libre  des  Sciences  Politiques  in  France  and 
the  London  School  of  Economics  in  England.  By 
this  point,  political  scientists  were  still  not  always 
clearly  distinguished,  either  intellectually  or  pro¬ 
fessionally,  from  historians  and  economists,  but  it 
was  primarily  the  theory  of  the  state  that  bound 
them  together.  The  third  generation  of  political 
scientists,  which  included  Bernard  Moses  at 
Berkeley,  Woodrow  Wilson  at  Hopkins  and 
Princeton,  and  Westel  W.  Willoughby  at  Hopkins, 
did  much  to  institutionalize  further  the  field  of 
political  science  in  American  universities  during 
the  last  years  of  the  19th  century  and  early  years  of 
the  20th  century,  but  by  the  last  decade  of  the  19th 
century,  a  significant  theoretical  transformation 
had  begun  to  take  place.  Theorists  such  as  Wilson 
( The  State ,  1889),  who  established  the  department 
of  politics  at  Princeton,  continued  in  many  ways  to 
affirm  some  aspects  of  the  traditional  theory  of  the 
state,  but,  in  part  as  a  consequence  of  urging  a 
more  active  administration,  they  began  to  blur  the 
line  between  state  and  government.  The  problem, 


in  a  country  of  great  and  increasing  complexity 
and  multiplicity,  was  to  specify  the  locus  of  the 
invisible  community  that  putatively  constituted  the 
American  people,  and  eventually,  no  one  did  more 
than  Willoughby  ( An  Examination  of  the  Nature 
of  the  State ,  1896)  to  empty  the  word  state  of  its 
original  theoretical  meaning  and  transform  it  into 
an  analytical  or  juristic  category  and  synonym  for 
government.  This,  however,  precipitated  a  crisis  in 
democratic  theory. 

Although  it  has  often  been  assumed  that  there 
was  a  fundamental  break  between  the  state  theory 
of  the  19th  century  and  the  conceptions  of  both 
political  inquiry  and  politics  embraced  by  early-20th- 
century  political  scientists,  the  continuities  in  many 
respects  exceeded  the  innovations.  One  might  very 
well  ask  how  the  largely  conservative  academic  cul¬ 
ture  that  dominated  19th-century  universities,  such 
as  Columbia,  produced  progressive,  reform-minded 
scholars,  such  as  the  historian  and  political  scientist 
Charles  Beard  and,  particularly,  Charles  Merriam, 
who  might  well  be  considered  the  father  of  20th- 
century  political  science  and  who  contributed  so 
significantly  to  transforming  the  discipline.  In  addi¬ 
tion  to  retaining  commitments  both  to  the  idea  of 
scientific  inquiry  and  to  its  application  to  practical 
ends,  one  thread  of  continuity  was  a  persistent 
belief  in,  and  dedication  to,  the  national  state  as 
encompassing  both  government  and  community. 
During  the  early  part  of  the  20th  century, 
Progressive  politics  as  well  as  political  and  social 
thought  continued  to  be  predicated  on  the  belief, 
such  as  in  the  case  of  sociologist  Charles  Horton 
Cooley  (Social  Organization:  A  Study  of  the 
Larger  Mind,  1909),  that  there  was  at  least  an 
incipient  national  political  community  or,  like 
Herbert  Croly  (The  Promise  of  American  Life, 
1910),  that  such  a  community  could  be  created 
and  mobilized  and  in  whose  name  government 
could  legitimately  and  authoritatively  act.  It  was, 
however,  eventually  out  of  the  ruins  of  both  tradi¬ 
tional  state  theory  and  the  Progressive  vision  that 
a  new  account  of  democratic  government  in 
America  emerged.  The  decline  of  the  state  as  a 
theory  of  democracy  was  paralleled  by  the  begin¬ 
ning  and  evolution  of  the  theory  of  democratic 
pluralism  and  by  the  account  of  science  and  forms 
of  research  that  the  latter  entailed. 

During  the  last  years  of  the  19th  century,  pro¬ 
fessional  social  science  associations  began  to  break 
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away  from  the  umbrella  of  the  American  Social 
Science  Association,  which  had  its  roots  in  a  vari¬ 
ety  of  reformist  causes,  such  as  the  abolition  of 
slavery,  and  to  affiliate  more  directly  with  aca¬ 
demic  institutions,  under  the  assumption  that  this 
would  provide  scientific  authenticity  and  author¬ 
ity.  The  American  Political  Science  Association 
(APSA),  with  the  leadership  of  individuals  such  as 
Willoughby  and  its  first  president,  Frank  Goodnow, 
was  formed  in  1903,  when  it  broke  away  from  its 
affiliation  with  the  more  conservative  American 
Ffistorical  Association,  and  the  American  Political 
Science  Review  began  publication  in  1906.  The 
practical  concerns  of  the  previous  generation  were 
perpetuated  in  the  creation  of  this  organization, 
but  it  represented  an  emerging  progressive  ideol¬ 
ogy  as  well  as  a  commitment  to  endow  the  disci¬ 
pline  with  yet  greater  scientific  authority  by 
embracing  what  was  viewed  as  the  methods  of 
modern  empirical  science.  For  individuals  such  as 
Wilson  and  Goodnow,  who  were  dedicated  to 
more  efficient  and  effective  government,  the  goals 
were  still  ultimately  practical.  There  was,  however, 
something  of  a  theoretical  hiatus  regarding  both 
democracy  and  the  nature  of  politics  as  the  origi¬ 
nal  concept  of  the  state  continued  to  wane,  and,  at 
the  same  time,  departments  of  political  science  and 
government  continued  to  emerge  at  major  univer¬ 
sities  such  as  the  University  of  California,  Berkeley; 
University  of  Illinois;  University  of  Wisconsin; 
Harvard  University;  and  Stanford  University. 

Pluralism  and  the  Liberal  Science  of  Politics 

The  demise  of  the  theory  of  the  state  was  in  part  a 
reaction,  in  the  context  of  World  War  I,  to  its 
German,  and  now  allegedly  authoritarian,  origins, 
but  it  was  also  a  consequence  of  the  dimming 
Progressive  hope  of  awakening  or  creating  a 
democratic  public,  which  could  rise  up  and  take 
power  back  from  corrupt  politicians  and  a  capital¬ 
istic  economic  hegemony.  Social  scientists,  in  the 
wake  of  immigration  and  growing  cultural  and 
class  differences,  became  overwhelmed  with  evi¬ 
dence  of  social  and  economic  diversity  and  con¬ 
tentiousness.  There  was  an  increased  sense  that 
there  was  no  homogeneous  American  public  but 
rather  only  complex  congeries  of  interests  and 
groups  that  exceeded  even  Madison’s  account.  In 
1907,  Harvard  historian  Albert  Bushnell  Hart, 


after  presenting  an  exhaustive  account  of  dis¬ 
carded  theories  of  popular  government  in  the 
United  States,  noted  that  even  though  the  idea  of 
the  state  as  the  basis  of  a  theory  of  popular  sover¬ 
eignty  seemed  to  still  hold  sway,  it  really  did  not 
fit  the  present  circumstances  of  American  politics. 
Although  he  expressed  faith  that  America  was  a 
democracy,  he  could  no  longer,  any  more  than 
most  of  his  contemporaries,  account  for  it  theo¬ 
retically.  To  provide  such  an  account  was  the  task 
of  pluralist  theory  as  it  evolved  during  the  first 
third  of  the  20th  century.  Individuals  such  as 
Lawrence  Lowell  ( Public  Opinion  and  Popular 
Government,  1913),  and,  later,  Walter  Lippmann 
(The  Phantom  Public,  1925)  questioned  the  exis¬ 
tence  of  an  actual  public  or  even  the  reality  of  a 
public  opinion  that  commentators  such  as  James 
Bryce  (The  American  Commonwealth,  1890)  had 
emphasized  as  constituting  the  heart  of  demo¬ 
cratic  society  in  America. 

Despite  the  publication  of  William  James’s 
Pluralistic  Universe  (1904),  the  term  pluralism  had 
not,  at  this  point,  in  any  substantial  manner, 
entered  the  discourse  of  American  political  science. 
Although  Arthur  Bentley’s  pointed  critique  of  the 
concept  of  the  state  and  his  analysis  of  interest 
groups  as  the  essence  of  politics  (Process  of 
Government,  1908)  would  become  a  central  refer¬ 
ence  for  later  pluralist  theory,  it  had  very  little 
immediate  impact,  and  Bentley  never  employed  the 
term  pluralism.  It  was  during  Harold  Laski’s  brief 
sojourn  in  the  United  States,  subsequent  to  World 
War  I,  that  the  term  was  introduced  into  the  con¬ 
versation  of  political  science,  as  part  of  his  attack 
on  the  idea  of  state  sovereignty  and  centralized 
authority  and  his  propagation  of  the  notion  that 
the  state  was  merely  one  association  among  many 
in  society.  Laski’s  principal  concern,  as  in  the  case 
of  Tocqueville,  was  his  own  country,  but  he,  as 
well  as  other  English  theorists  such  as  Ernest 
Barker  and  A.  D.  Lindsay,  helped  instigate  a  debate 
about  pluralism  that  focused  on  whether  political 
reality  consisted  of  anything  more  than  an  endless 
process  of  group  interaction  with  the  government 
functioning  as  an  arbiter  and  whether  this  could 
add  up,  empirically  and  theoretically,  to  democ¬ 
racy.  It  was  difficult,  however,  for  American  politi¬ 
cal  scientists  to  give  up  the  idea  that  the  state  was 
nothing  more  than  government  and  that  govern¬ 
ment  was  not  the  agent  of  a  general  popular  will. 
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Merriam  embraced  certain  democratic  values 
associated  with  cultural  diversity  and  political 
pluralism,  but  he  was  equally  impressed  with  the 
divisiveness  inherent  in  such  difference  and  with 
the  antidemocratic  sentiments  and  practices  of 
certain  groups.  He  retained  the  assumption  that 
democracy  ultimately  required  unity,  even  if,  in 
his  view  and  that  of  his  student  Harold  Lasswell, 
it  was  necessary  to  introduce  it  from  the  top  down 
through  social  control,  civic  education,  and  even 
the  judicious  use  of  propaganda.  They  transferred 
their  hopes  for  a  democratic  society  to  the  actions 
of  governmental  elites  informed  by  social-scientific 
knowledge,  but  no  articulate  image  of  American 
democracy  and  the  American  political  system 
appeared,  for  example,  in  Merriam’s  principal 
work  of  this  period  (e.g.,  New  Aspects  of  Politics, 
1925),  even  though  he  sponsored  much  of  the 
research  and  modes  of  inquiry  that  seemed  appro¬ 
priate  for  a  changing  image  of  politics. 

The  strongest  riposte  to  the  normative  theory 
of  pluralism  associated  with  Laski  and  other  writ¬ 
ers  of  the  period,  as  well  as  to  empirical  political 
scientists  and  sociologists  whose  work  increas¬ 
ingly  lent  support  to  the  notion  that  politics  was 
irreducibly  pluralistic,  was  the  work  of  William 
Yandell  Elliott  (The  Pragmatic  Revolt  in  Politics, 
1928).  He  spoke  for  many  of  his  generation  when 
he  argued  that  to  give  up  the  concept  of  the  state 
as  an  organic  public  was,  in  effect,  to  give  up 
democracy  as  well  as  the  autonomy  of  political 
theory  and  political  science.  Elliott  did  not  reject 
the  reality  of  pluralism,  but  he  believed  that  it 
tended  to  undermine  the  communal  basis  of 
democracy.  He  argued  that  in  an  age  dominated 
by  empiricism  and  pragmatism,  as  well  as  by  the 
threat  of  foreign  doctrines  such  as  fascism  and 
communism,  it  was  still  possible  to  perceive,  and 
suggested  that  it  was  at  least  necessary  to  believe 
in,  what  he  called  a  “co-organic”  community  in 
American  political  life,  which  was  the  basis  of 
constitutional  government.  A  somewhat  similar, 
but  ideologically  different,  position  had  been 
advanced  by  Mary  Parker  Follett  (The  New  State, 
191 8).  It  was  difficult,  even  for  someone  such  as 
John  Dewey  (The  Public  and  Its  Problems,  1927), 
who  along  with  Laski  was  perceived  by  Elliott  as 
a  purveyor  of  relativism  and  its  destructive  impli¬ 
cations  for  democracy,  to  sever  the  idea  of  popu¬ 
lar  government  from  the  existence  of  a  national 


community  that  transcended  the  complexity  of 
modern  “great  society.” 

By  the  end  of  the  1920s,  however,  the  concept 
of  pluralism  had  become  Americanized  and  formed 
the  basis  of  an  empirical  account  of  American 
politics,  a  normative  image  of  democratic  practice, 
and  closely  connected  to  what  G.  E.  G.  Catlin,  a 
transplanted  British  scholar  who  championed  both 
pluralism  and  the  work  of  Merriam,  referred  to  as 
The  Science  and  Method  of  Politics  (1927).  For  the 
first  time  since  Madison,  a  description  of  social 
diversity  and  conflict  and  of  group  pressures  on 
government  was  transformed  into  a  theory  of 
popular  government  that  would  provide  much  of 
the  content  of  a  new  and  widely  embraced  image 
of  democratic  identity.  The  group  theory  of  politi¬ 
cal  reality  became  deeply  entrenched  in  political 
science  as  it  evolved  into  an  argument  about  how 
the  process  of  interest-group  politics  constituted  a 
form  of  both  democratic  interaction  and  represen¬ 
tation.  This  had  been  implied  by  the  early  research 
of  individuals  such  as  Pendelton  Herring  (Group 
Representation  Before  Congress,  1927),  but,  dur¬ 
ing  the  late  1920s  and  early  1930s,  a  number  of 
individuals  such  as  Peter  Odegard  (The  American 
Public  Mind,  1930)  and  John  Dickinson  (“Demo¬ 
cratic  Realities  and  Democratic  Dogma,”  American 
Political  Science  Review,  1930)  elaborated  a  plu¬ 
ralist  theory  of  democracy  that  contained  all  the 
essential  theoretical  elements  that  would  be  reart¬ 
iculated  and  reconstructed  a  generation  later  in  the 
work  of  individuals  such  as  David  Truman  and 
Robert  Dahl. 

At  the  core  of  this  theory  was  the  claim  that  all 
societies  consisted  of  groups  seeking  their  self- 
interest  and  that  this,  at  any  stage  of  social  evolu¬ 
tion,  required  mechanisms  for  compromise  and 
adjustment.  In  the  context  of  modern  society,  such 
adjustment  was  achieved  through  the  medium  of 
government  that  functioned  as  an  umpire  acting 
pragmatically  in  response  to  the  needs  of  the  situ¬ 
ation  and  with  respect  to  matters  of  intervention 
and  control.  It  was  through  participation  in  groups 
that  individuals  realized  their  goals  and  achieved 
identity,  and  it  was  through  groups  gaining  access 
to  influence,  more  than  through  formal  institu¬ 
tions,  that  democratic  representation  was  most 
essentially  effected.  Stability  in  society  was  achieved 
through  a  balance  of  conflicting  social  pressures 
constrained  by  appropriate  enabling  institutions 
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and  a  basic  consensus  on  the  rules  of  the  game. 
Majoritarian  democracy  was  viewed  as  a  myth 
that  belied  the  fact  that  majorities  were  little  more 
than  aggregations  of  individual  preference  that 
were  democratic  only  in  the  sense  that  they  had  the 
capacity,  through  elections,  to  effect  a  circulation 
of  elites. 

From  the  1920s  to  the  1940s,  political  science 
continued  to  be  institutionalized  and  expand  as  a 
part  of  higher  education  in  the  United  States,  and 
during  this  period,  the  membership  of  the  APSA 
tripled.  The  work  of  Merriam  and  Lasswell,  at  the 
University  of  Chicago,  represented  the  most  impor¬ 
tant  development,  but  early  forms  of  political  sci¬ 
ence  were  emerging  in  England,  France,  and 
Germany.  During  the  later  part  of  the  1930s,  there 
was  little  in  the  way  of  a  further  explicit  statement 
or  elaboration  of  pluralist  theory,  but  it  became,  in 
both  politics  and  the  academy,  the  basis  of  an 
account  of  the  United  States  as  a  democratic  soci¬ 
ety,  and  it  was  advanced  as  distinguishing  the 
American  polity  from  the  growing  number  of 
totalitarian  regimes  that  seemed  to  be  character¬ 
ized  by  excessive  unity.  The  name  for  this  new 
democratic  identity  was  liberalism,  and  the  man¬ 
ner  in  which  pluralism  was  transfigured  as  liberal¬ 
ism  is  a  crucial  chapter  in  the  story  of  the  evolution 
of  democratic  theory  in  American  political  science. 

Although  common  in  Europe,  the  term  liberal¬ 
ism  had  seldom  been  systematically  invoked  in 
either  American  politics  or  political  science  before 
the  1930s.  While  politicians  such  as  Woodrow 
Wilson,  and  later  Franklin  Roosevelt,  began  tenta¬ 
tively  to  court  this  label  for  a  variety  of  policy 
initiatives,  everyone  eventually  seemed  anxious  to 
adopt  this  synonym  for  democracy.  A  variety  of 
individuals,  including  Herbert  Hoover,  claimed  to 
be  the  “true”  liberal,  but  Roosevelt  won  the  title, 
and  his  opponents  eventually  accepted  the  name  he 
had  originally  pejoratively  bestowed  on  them — 
Conservatives.  The  term  liberalism  gravitated  into 
the  language  of  political  science,  often  via  those 
such  as  Dewey  who  were  sympathetic  to  the  New 
Deal,  but  eventually  political  theorists,  such  as 
George  Sabine  in  his  paradigmatic  A  History  of 
Political  Theory  (1937)  emptied  the  concept  of  its 
concrete  political  meaning  and  began  writing  the 
history  of  Western  political  thought  and  institutions 
as  a  story  of  the  progress  of  liberalism  that,  despite 
the  temporary  aberrations  of  totalitarianism,  found 


full  expression  in  American  society.  Although  in  the 
work  of  Dewey,  as  well  as  a  number  of  political 
scientists,  there  continued  to  be  a  certain  correspon¬ 
dence  between  the  academic  and  lay  visions  of  lib¬ 
eralism,  two  quite  distinct  traditions  of  discourse 
began  to  evolve  as  liberalism,  in  the  language  of 
political  science,  was  reified,  provided  with  a  phi¬ 
losophy  and  history,  and  reimposed  as  a  description 
of  American  politics.  To  the  extent  that  liberalism 
had  a  definite  conceptual  meaning  in  the  literature 
of  political  science  and  political  theory,  other  than 
a  name  for  American  government  and  society,  it 
tended  to  stand  for  pluralism  and  attendant  values 
such  as  individualism,  social  freedom  and  differ¬ 
ence,  bargaining,  and  compromise.  Philosophers, 
such  as  T.  V.  Smith  at  Chicago  (The  Promise  of 
America,  1936)  took  the  position  that  what  charac¬ 
terized  democracy  was  less  any  absolute  doctrine 
and  regime  than  a  commitment  to  toleration  and 
the  propagation  of  diversity  within  a  procedural 
framework  for  settling  conflicts. 

By  the  early  1940s,  on  the  eve  of  World  War  II, 
the  basic  elements  of  this  vision  were  extracted  from 
the  research  of  mainstream  political  science,  system¬ 
atized  by  individuals  such  as  Herring,  and  presented 
as  the  Politics  of  Democracy  (1940).  Herring  saw 
his  task  as  taking  all  that  was  often  considered  bad 
about  politics — from  pressure  groups  to  bosses  and 
soft  money — and  demonstrating  that  they  were  all, 
if  understood  scientifically,  part  of  a  democratic 
process.  One  reason  for  the  rearticulation  of  plural¬ 
ism  qua  liberalism  was  to  provide  a  response  and 
counterideal  to  the  doctrines  of  totalitarianism.  For 
Lasswell,  political  science  continued  to  be  part  of 
“The  Developing  Science  of  Democracy”  ( The 
Future  of  Government  in  the  United  States,  edited 
by  Leonard  D.  White,  1942). 

The  Behavioral  Era  and  the  Reconstitution 
of  Science  and  Democracy 

Just  as  the  discipline  was  expanding  in  the  United 
States  after  World  War  II,  it  was  proliferating 
abroad.  In  1949,  the  International  Political  Science 
Association  (IPSA)  was  formed  in  Paris,  which 
brought  together  national  associations  from 
numerous  countries  in  Europe  and  elsewhere.  The 
emerging  national  forms  of  the  discipline  were  in 
many  respects  increasingly  responsive  to  and 
reflective  of  the  social  and  cultural  milieu  in  which 
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they  were  situated,  even  though  the  American 
commitment  to  empirical  and  quantitative  studies 
was  widely  accepted  and  promulgated  along  with 
more  traditional  historical  and  institutional  forms 
of  research.  In  the  United  States,  David  Easton’s 
The  Political  System:  An  Inquiry  Into  the  State  of 
Political  Science  (1953)  set  the  agenda  for  the  next 
decade  by  defining  political  science  as  the  study  of 
the  “authoritative  allocation  of  values”  and  mak¬ 
ing  the  case  for  moving  beyond  mere  factual 
research  as  well  as  traditional  historical  and  institu¬ 
tional  forms  by  advancing  empirical  theory  and 
adopting  the  methods  of  natural  science.  The 
behavioral  “revolution,”  for  which  Easton  was 
often  the  principal  spokesperson,  significantly  trans¬ 
formed  the  practice  of  political  science  and  increased 
in  large  measure  the  substantive  and  methodologi¬ 
cal  contributions  on  a  variety  of  subjects,  including 
survey  research  and  voting  behavior.  It  was,  how¬ 
ever,  in  many  respects,  less  a  revolution  than  a 
recommitment  to  the  visions  of  both  the  scientific 
study  of  politics  and  liberal  democracy  that  had 
informed  the  discipline  for  nearly  half  a  century, 
and  it  was  also  in  part  a  response  to  the  first  sig¬ 
nificant  challenge  to  those  visions. 

By  the  1950s,  the  academic  image  of  liberalism 
had  become  increasingly  dominant  as  individuals 
such  as  Daniel  Boorstin  ( The  Genius  of  American 
Politics ,  1953)  and  Louis  Elartz  (The  Liberal  Tradi¬ 
tion  in  America ,  1955)  set  out  to  demonstrate  that 
although  there  might  not  be  an  American  public, 
there  was,  for  better  or  worse,  in  addition  to  the 
institutions  that  held  the  fragments  of  society 
together,  a  historically  rooted  liberal  value  consen¬ 
sus  and  tradition  that  gave  credence  to  the  concept 
of  e  pluribus  unum.  This  notion  of  a  liberal  consen¬ 
sus  that  transcended  and  reconciled  group  differ¬ 
ences  became  an  essential  element  of  the  revived 
group  theory  of  politics.  The  continuing  attempt  to 
give  meaning  to  the  idea  of  liberalism,  and  to 
equate  liberalism  with  democracy,  was,  however, 
catalyzed  and  galvanized  by  a  persistent  but  often 
still  somewhat  submerged  attack  on  liberalism  that 
had  begun  to  influence  what  had  been,  since  the 
1920s,  a  thoroughly  American  political  science.  By 
the  1950s,  liberalism,  because  of  both  doubts  about 
interventionist  government  and  events  such  as  the 
McCarthy  hearings,  had  become  a  highly  contested 
concept  in  American  politics.  Often,  for  quite 
different  reasons,  it  was  also  losing  its  positive 


valence  in  academic  discourse  as  a  somewhat  sub¬ 
terranean  critique  of  liberal  democracy,  and  politi¬ 
cal  science  began  to  infiltrate  the  discipline  and  form 
a  counterpoint  to  the  postwar  behavioral  movement 
in  political  science  and  its  rededication  to  a  scientific 
study  of  politics  based  on  emulating  what  was 
assumed  to  be  the  methods  of  natural  science. 

This  critique,  largely  conceived  and  mounted  by 
emigre  scholars,  was  gaining  a  place  in  the  litera¬ 
ture  of  political  theory,  and  it  was  manifest  in 
journals  such  as  the  Review  of  Politics  with  its 
theological  antiliberal  perspective  as  well  as  in  the 
perspective  of  the  president,  Robert  Hutchins,  and 
those  involved  in  the  Committee  on  Social  Thought 
at  the  University  of  Chicago  who  set  themselves 
directly  against  the  Chicago  image  of  social  sci¬ 
ence.  What  emerged  was  a  new  mode  of  political 
theory  that  would  eventually  lead  a  number  of 
scholars  to  make  an  identity  choice  they  had  never 
previously  been  confronted  with — that  is,  a  choice 
between  political  science  and  political  theory.  The 
confrontation  between  this  critique  and  the  recon¬ 
stituted  pluralist  account  of  liberal  democracy  in 
political  science  made  up  the  dialectic  of  democ¬ 
racy  in  the  postwar  generation.  At  this  point,  what 
really  separated  mainstream  political  scientists 
from  political  theorists  was  less  a  commitment  to 
science,  as  opposed  to  a  commitment  to  normative 
theory,  than  holding  two  quite  different  ethical 
positions  on  the  issue  of  democracy. 

The  predominantly  German  scholars  who  emi¬ 
grated  to  the  United  States,  beginning  in  the 
1930s,  were  in  many  respects  a  philosophically 
and  ideologically  diverse  group,  which  included 
Hans  Morgenthau  ( Scientific  Man  and  Power 
Politics ,  1946),  Eric  Voegelin  (The  New  Science 
of  Politics,  1951),  Leo  Strauss  (Natural  Right  and 
History,  1953),  Hannah  Arendt  (The  Human 
Condition,  1958),  and  members  of  the  neo-Marx¬ 
ist  Frankfurt  School  such  as  Herbert  Marcuse, 
Theodor  Adorno,  Otto  Kirchheimer,  and  Max 
Horkheimer.  What  they  had  in  common,  how¬ 
ever,  was  a  suspicion  of,  if  not  antipathy  toward, 
liberalism,  which  as  a  political  theory  they 
believed  was  philosophically  flawed  and  which  as 
a  political  form,  they  claimed  was  inherently 
pathological  and  represented  political  institutions 
that  historically,  as  in  the  case  of  Weimar 
Germany,  were  on  the  threshold  of  totalitarian¬ 
ism.  This  form  of  antimodernism,  rooted  in  the 
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politics  and  intellectual  context  of  early-20th-century 
Germany  and  in  various  transcendental  philoso¬ 
phies  such  as  that  of  Heidegger  as  well  as  forms  of 
socialist  thought,  was  a  strange  and  difficult  body 
of  ideas  for  Americans  to  absorb.  These  theorists, 
who  rejected  scientism  and  who  were  wedded  to 
images  of  the  decline  of  Western  civilization,  repre¬ 
sented  a  profound  challenge  to  a  conception  of 
democracy  based  on  commitments  to  empiricism, 
relativism,  liberalism,  and  historical  progress,  which 
had  characterized  American  political  science  for 
half  a  century.  By  the  end  of  the  1950s,  the  work  of 
these  foreign  scholars  and  indigenous  fellow  travel¬ 
ers  had  largely  begun  to  define  the  subfield  of 
political  theory,  which  heretofore  had  been  an  inte¬ 
gral  dimension  of  mainstream  American  political 
science  and  the  principal  vehicle  of  its  vision  of 
democracy.  This  challenge,  coupled  with  the  con¬ 
tinuing  concern  about  presenting  a  coherent  image 
of  democracy  as  a  counterpoint  to  totalitarianism, 
prompted  the  postwar  reconstitution  of  group  the¬ 
ory  and  the  pluralist  account  of  democracy. 

David  Truman’s  The  Governmental  Process 
(1951)  and  Earl  Latham’s  The  Group  Basis  of 
Politics  (1952)  revived  the  relevance  of  Bentley’s 
work,  and  Robert  Dahl’s  Preface  to  Democratic 
Theory  (1955)  established  the  genre  of  empirical 
democratic  theory  that  was  devoted  to  vouchsafing 
the  image  of  pluralist  democracy.  The  latter  work 
was  in  one  respect  less  a  “preface”  than  an  “epi¬ 
logue”  and  codification  of  ideas  that,  during  the 
1930s,  had  become  an  essential  part  of  the  identity 
of  political  science.  Dahl  returned  to  Laski’s  and 
Barker’s  term  polyarchy  as  a  synonym  for  a  form  of 
democracy  that  Dahl  contrasted  both  with  majori- 
tarian  or  “populist”  types  and  with  what  he 
claimed  was  Madison’s  excessive  emphasis  on  con¬ 
stitutional  checks  and  balances  at  the  expense  of 
adequate  attention  to  the  informal  and  social 
dimensions  of  group  interaction  where,  in  effect, 
minorities  ruled.  In  Who  Governs ?  (1961),  Dahl 
explicitly  embraced  the  term  pluralism ,  and  his 
theory  of  pluralist  democracy  was  offered  in  part  as 
a  counter  to  the  claims  about  elitism  and  the  struc¬ 
ture  of  community  power  advanced  by  individuals 
such  as  C.  Wright  Mills  (The  Power  Elite,  1956) 
and  various  sociologists  such  as  Floyd  Hunter.  But 
it  was  also  an  attempt,  during  the  Cold  War,  to 
systematize  and  accentuate  an  image  of  Western 
liberal  democracy  that  supported  the  faith  of  those 


who  opposed  the  political  ideas  and  institutions  of 
the  East.  Dahl,  like  those  after  him  such  as  Gabriel 
Almond  and  Sidney  Verba  (The  Civic  Culture, 
1963),  argued  that  we  should  begin  by  examining 
the  character  of  those  countries  that  we  know  to  be 
democracies  and  by  extracting  an  empirical  basis 
for  a  normative  theory  that,  Dahl  claimed,  was 
most  fully  represented  in  the  “American  hybrid.” 

The  critique  of  liberalism  and  pluralism  was, 
however,  perpetuated  by  a  wide  range  of  political 
theorists  in  the  1960s,  and  the  “end  of  ideology,” 
which  had  been  proclaimed  by  sociologists  such  as 
Daniel  Bell  and  Seymour  Martin  Lipset  as  the 
future  of  the  dominance  of  pluralist  liberal  democ¬ 
racy,  failed  to  materialize.  While  the  debate  that 
precipitated  between  political  theory  and  behav¬ 
ioral  political  science  during  this  period  has  often 
been  represented  as  a  conflict  between  “scientific” 
and  “traditional”  theory,  the  underlying  issue  was 
the  nature  of  democracy.  The  emigre-inspired  cri¬ 
tique  was  at  this  point  joined  by  theorists  such  as 
Sheldon  Wolin  (Politics  and  Vision,  1960)  as  well 
as  by  the  persistent  progressive  statist  countertra¬ 
dition  that  had  remained  alongside  pluralist  theory 
in  political  science  and  that  was  now  represented 
in  a  new  form  by  individuals  such  as  Elmer  E. 
Schattschn eider  (The  Semisovereign  People:  A 
Realist  View  of  Democracy  in  America,  1960). 
Schattschneider  argued  that  interest-group  politics 
had  an  upper-class  bias  as  well  as  a  corrosive  effect 
on  party  democracy.  Grant  McConnell  (Private 
Power  and  American  Democracy,  1966)  and 
Theodore  Lowi  (The  End  of  Liberalism,  1969) 
mounted  sustained  attacks  on  what  they  claimed 
democratic  and  institutional  pathologies  of  the 
theory  and  practice  of  “interest-group  liberalism.” 
Despite  their  similarities,  the  critiques  of  behavior- 
alism  and  liberalism  that  were  inspired  by  the 
emigre  theorists  and  those  that  were  rooted  in  the 
American  tradition  were  sometimes  uneasy  allies, 
such  as  in  the  Straussian-inspired  Essays  on  the 
Scientific  Study  of  Politics  (1961),  but  a  growing 
intellectual  split  between  political  theory  and 
mainstream  political  science  characterized  the 
1960s  and  evolved  through  the  1980s. 

From  Postbehavioralism  to  the  21st  Century 

Although  in  1961  Dahl  had  proclaimed  that  with 
respect  to  the  behavioral  movement  it  was  possible 
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to  write  “An  Epitaph  for  a  Monument  to  a 
Successful  Protest,”  both  the  commitment  to  pure 
science  and  the  pluralist  theory  of  democracy  con¬ 
tinued  to  be  the  target  of  widespread  criticism  from 
both  political  theorists  and  a  number  of  more  main¬ 
stream  political  scientists.  This  dissatisfaction 
sprung  in  part  from  what  seemed  to  many  to  be 
political  science’s  lack  of  relevance  for  and  atten¬ 
tion  to  political  events  such  as  the  Vietnam  War, 
the  crisis  of  American  cities  and  problems  of  civil 
rights,  and  Cold  War  politics.  The  Caucus  for  a 
New  Political  Science  challenged  the  authority  of 
the  APSA,  and  in  1969,  Wolin  pitted  the  “vocation 
of  political  theory”  against  what  he  claimed  was  the 
apolitical  “methodism”  of  behavioral  political  sci¬ 
ence.  At  the  same  time,  the  president-elect,  Easton, 
repudiated  the  tenets  of  much  of  what  had  repre¬ 
sented  the  behavioral  movement  when  he  announced 
a  “new  revolution  in  political  science,”  a  “postbe- 
havioral  revolution”  that  would  recognize  the  defi¬ 
ciencies  of  the  pluralist  theory  of  democracy  and 
embrace  a  new  “credo  of  relevance”  that  would 
give  precedence  to  research  on  pressing  contempo¬ 
rary  political  issues  rather  than  to  the  immediate 
advancement  of  scientific  theories  and  methods. 

By  the  early  1970s,  concerns  about  practical 
issues  led  the  mainstream  discipline  to  seek  an  iden¬ 
tity  for  the  postbehavioral  era  by  a  return  to  what 
Lasswell  had  championed  as  policy  science,  and  a 
more  ecumenical  spirit  was  apparent  as  the  debate 
about  behavioralism  wound  down  and  the  issue  of 
maintaining  professional  inclusiveness  became  more 
prominent.  Although  the  controversy  about  behav¬ 
ioralism  had  created  an  intellectual  breach  between 
mainstream  political  science  and  the  subfield  of 
political  theory,  it  also  had  the  effect  of  relocating, 
or  dislocating,  the  discussion  of  American  political 
identity  and  democratic  theory.  While  political  sci¬ 
ence  continued,  in  various  ways  and  degrees,  to 
validate  the  traditional  liberal  vision,  it  tended,  after 
the  1970s,  to  concede  to  political  theory  the  role  of 
normative  theorizing.  The  conversation  about 
democracy  and  liberalism  increasingly  became  the 
property  of  the  interdisciplinary  and  relatively 
autonomous  enterprise  of  political  theory  and 
absorbed  into  an  eclectic  conversation  that  was 
determined  more  by  the  reigning  academic  philo¬ 
sophical  authorities  than  by  any  direct  relevance  to 
the  particularities  of  American  politics.  Debates 
about  liberalism,  often  centering  on  philosophical 


arguments  such  as  those  of  John  Rawls,  Robert 
Nozick,  Jurgen  Habermas,  and  Richard  Rorty, 
became  a  large  part  of  the  focus  of  political  theory, 
while  both  political  science  as  a  whole  and  political 
theory  as  a  subfield  became  increasingly  pluralized. 

By  the  mid-1980s,  it  was  increasingly  difficult  to 
speak  in  general  of  political  science  as  a  discipline 
and  of  the  history  of  the  field  as  a  whole.  Although 
the  debate  about  behavioralism  had  fractured  the 
field,  it  had  also  constituted  the  terms  of  a  common 
conversation  that  in  some  respects  defined  the  disci¬ 
pline.  The  centrifugal  forces  of  specialization, 
increased  concerns  about  social  and  gender  diversity 
within  the  profession,  and  other  internal  tendencies 
toward  pluralization  in  the  field  were  accentuated 
by  the  need  to  recognize  the  growing  distinctions 
among  national  practices  of  political  science.  In 
1982,  the  IPSA  Study  Group  on  the  Comparative 
Sociology  of  Political  Science  was  formed,  and  in 
1986,  it  was  supplemented  by  the  International 
Committee  for  the  Study  of  the  Development  of 
Political  Science.  These  two  groups  merged  in  1988, 
forming  the  IPSA  Research  Committee  for  the 
Study  of  Political  Science  as  Discipline,  which  was 
formally  recognized  as  IPSA  Research  Committee 
33  (RC  33)  in  1989.  Since  its  formal  establishment, 
RC  33  has  been  active  in  all  of  the  IPSA  Congresses, 
and  it  has  also  undertaken  a  number  of  inter- 
Congress  workshops,  conferences,  research  proj¬ 
ects,  and  publications  devoted  both  to  the  study  of 
the  history  and  character  of  political  science  in 
various  countries  as  well  as  to  features  common  to 
the  field  as  a  whole.  The  research  sponsored  by  this 
endeavor  has  made  it  clear  that  despite  many  simi¬ 
lar  concerns  and  research  programs,  political  sci¬ 
ence  is  no  longer  consistently  the  science  of  politics 
as  it  evolved  in  the  United  States. 

The  beginning  of  the  1990s  was  a  watershed  for 
political  science.  The  public  policy  orientation 
inspired  by  the  events  of  the  1960s  began  to  fade, 
and  many  believed  the  growing  popularity  of  what 
Anthony  Downs  had  referred  to  as  An  Economic 
Theory  of  Democracy  (1957)  and  what  came  to  be 
referred  to  as  rational  choice  analysis  promised  a 
new  methodological  basis  for  disciplinary  identity 
and  a  reconstruction  of  democratic  theory.  This 
trend,  however,  was  paralleled,  and  challenged,  by 
those  who  had  begun  to  advocate  new  directions  in 
institutional  and  historical  research  (e.g.,  Bringing 
the  State  Back  In,  edited  by  Peter  B.  Evans,  Dietrich 
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Rueschemeyer,  &  Theda  Skocpol,  1985).  There 
were  calls  for  more  diverse  approaches  to  the  study 
of  politics  as  the  enthusiasm  for  rational  choice 
declined  (e.g.,  The  Economic  Approach  to  Politics: 
A  Critical  Assessment  of  the  Theory  of  Rational 
Choice,  edited  by  Kristen  Monroe,  1991).  The  col¬ 
lapse  of  communism  and  the  bipolar  world  of  the 
Cold  War  also  heightened  the  sense  of  plurality  as 
the  basic  condition  of  both  politics  and  political  sci¬ 
ence,  and  it  catalyzed  a  renewed  concern  with  the 
theory  of  democracy  as  a  variety  of  national  com¬ 
munities  sought  to  define,  and  redefine,  themselves 
as  democratic  societies.  In  this  context,  pluralism 
emerged  once  again  as  a  dominant  theme.  Not  only 
had  politics  become  internationally  more  diverse 
but  concerns  about  multiculturalism  and  philoso¬ 
phies  such  as  postmodernism  accentuated  the  value 
and  reality  of  diversity. 

After  its  intellectual  estrangement  from  main¬ 
stream  political  science,  a  unifying  and  driving  force 
in  the  increasingly  dispersed  conversation  of  politi¬ 
cal  theory  continued  to  be  a  critique  of  the  liberal/ 
pluralist  vision  of  democracy  and  an  attempt  to 
resurrect  some  version  of  participatory  democracy. 
By  the  early  1990s,  however,  there  was  a  subtle,  but 
in  some  ways  quite  fundamental,  shift  in  perspec¬ 
tive.  Although  the  idea  that  democracy  must  be 
rooted  in  unity  was  still  evident  in  the  work  of 
individuals  such  as  Robert  Putnam  [Bowling  Alone, 
2000),  who  stressed  the  importance  of  social  capital 
and  a  communal  basis  of  democracy,  the  concept  of 
pluralism  once  again  appeared  in  the  discourse  of 
political  theory  as  the  centerpiece  of  the  democratic 
imagination.  The  concept  was  seldom  any  longer 
that  of  the  interest-group  liberalism  of  the  1950s, 
but  theorists  such  as  Robert  Dahl  and  Charles 
Lindblom,  despite  some  critical  emendations,  main¬ 
tained  their  faith  in  social  diversity  as  the  ultimate 
value  of  democracy.  Both  John  Rawls  [Political 
Liberalism,  1993)  and  Jurgen  Habermas  [Facts  and 
Norms,  1992)  manifested  an  increased  acceptance 
of  the  social  realities  of  liberal  democracy,  and  the 
many  and  various  versions  of  both  deliberative  and 
radical  democracy  embraced  the  ethic  of  pluralism. 

The  manner  in  which  the  critique  of  pluralist 
liberalism  by  democratic  theorists  gave  way,  during 
the  closing  years  of  the  20th  century,  to  the  idea  of 
pluralism  as  the  core  of  democracy  uncannily  mir¬ 
rored  the  path  of  conceptual  transformation  that 
occurred  in  the  first  quarter  of  the  century.  And  the 


new  pluralism,  like  the  emergence  of  the  old  plu¬ 
ralism,  seems  in  large  measure  to  be  a  response  to 
the  ineluctable  realities  of  the  sociology  of  contem¬ 
porary  society.  Dahl  took  it  as  a  virtue  that  in  a 
polyarchial  society  one  might  say  that  no  one  gov¬ 
erns  or  that  minorities  govern,  but  the  problem 
always  was  that  if  this  is  the  case,  then  it  also 
means  that  democracy,  as  the  mediation  of  public 
decisions  through  the  general  citizenry,  is  difficult 
to  identify.  Plurality,  one  might  argue,  is  surely  a 
necessary  condition  of  any  realistic  concept  of 
democracy,  but  it  may  not  be  a  sufficient  condi¬ 
tion.  In  the  last  analysis,  the  philosophical  recon¬ 
ciliation  of  pluralism  with  democracy  has  been  no 
easier  than  it  was  at  the  time  of  Madison. 

By  the  beginning  of  the  21st  century,  it  was  dif¬ 
ficult  to  discern  any  clear  basis  of  unity  in  the  disci¬ 
pline  [Political  Science:  The  State  of  the  Discipline, 
edited  by  Ira  Katznelson  &  Helen  V.  Ingram,  2002), 
and  by  the  end  of  the  first  decade,  it  is  still  not  easy 
to  specify  what  trends  may  be  most  significant.  In 
the  United  States,  concerns  about  methodological 
diversity  surfaced  once  again  in  what  came  to  be 
known  as  the  Perestroika  debate,  but,  unlike  the 
conflicts  of  the  late  1960s  and  early  1970s,  the  issues 
appeared  to  be  more  professional  than  political.  The 
discipline’s  origins  were  closely  tied  to  a  definite 
practical  mission  of  political  reform  and  political 
education,  and,  like  the  meaning  of  democracy,  the 
relationship  between  political  science  and  politics 
remains  unresolved.  In  addition  to  noting  the  con¬ 
tinuing  tendencies  toward  specialization  within  and 
among  subfields,  it  is  necessary  to  recognize  that  in 
taking  account  of  the  history  and  current  practices 
of  the  field,  the  story  of  political  science,  despite 
some  intimations  to  the  contrary  [A  New  Handbook 
of  Political  Science,  edited  by  Robert  Goodin  & 
Hans-Dieter  Klingemann,  1996),  is  no  longer  simply 
the  story  of  the  American  science  of  politics. 
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Hobbes,  Thomas  (1588-1679) 


Thomas  Hobbes  is  regarded  as  one  of  the  greatest 
political  thinkers  in  the  Western  tradition  for  his 
contribution  to  the  development  of  the  concepts 
of  political  obligation  and  state  sovereignty.  His 
theory  is  often  associated  with  absolutism  in 
domestic  politics  and  with  realism  in  international 
relations.  A  long  line  of  interpreters,  however, 
have  questioned  such  associations. 

Hobbes  was  born  in  Malmesbury,  England,  on 
April  5,  1588.  For  much  of  our  knowledge  of 
Hobbes’s  early  life,  we  have  to  rely  on  the  notes  of 
his  contemporary  and  friend  John  Aubrey.  Hobbes 
was  of  humble  origin;  his  education  at  Magdalen 
Hall  in  Oxford  was  sponsored  by  an  uncle.  For 
most  of  his  life,  Hobbes  worked  for  the  earls  of 
Devonshire.  In  his  capacity  as  tutor,  he  travelled 
extensively  in  Europe  and  came  in  contact  with  the 
finest  minds  of  the  time,  including  Galileo, 
Descartes,  and  Leibniz.  Hobbes  fled  to  France  dur¬ 
ing  the  English  Civil  War  and  was  tutor  of  math¬ 
ematics  to  the  future  King  Charles  II,  who  was  also 
in  exile. 

The  intellectual  context  of  17th-century  Europe 
inspired  Hobbes  to  plan  a  comprehensive  system 
of  philosophy  that  explained  everything  that  could 
be  explained,  resorting  to  one  basic  principle, 
namely,  “motion.”  Hobbes’s  project  was  made  up 
of  three  parts,  the  first  on  metaphysics,  the  second 
on  man’s  physiology  and  psychology,  and  the  third 
on  politics.  The  political  crisis  in  his  country 
induced  Hobbes  not  to  follow  the  order  of  the 
original  project  but  to  begin  his  investigation  with 
the  study  of  the  rights  and  duties  of  citizens. 
Indeed,  the  political  climate  also  explains  why 
Hobbes  translated  Thucydides’  History  of  the 
Peloponnesian  War  in  1628.  According  to  Hobbes, 
Thucydides’  work  affords  us  important  insights 
into  the  weaknesses  of  democracies  and  was  rele¬ 
vant  at  a  time  when  there  were  growing  tensions 
between  Charles  I  and  Parliament. 

It  is  not  from  history,  however,  that  Hobbes 
thought  that  political  science  should  take  inspira¬ 
tion  for  its  methodology.  In  Brief  Lives,  Aubrey 
relates  an  anecdote  that  sheds  some  light  on  the 
method  that  Hobbes  used  in  his  political  writings. 
Apparently,  Hobbes  was  40  years  old  when  by 
chance  he  came  across  the  47th  proposition  in  the 
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first  book  of  Euclid’s  Elements.  Although  at  first 
the  proposition  appeared  counterintuitive  to 
Hobbes,  its  demonstration  convinced  him  of  its 
truth.  This,  Aubrey  tells  us,  was  the  beginning  of 
Hobbes’s  love  affair  with  geometry. 

Hobbes  believed  that  it  was  possible  to  apply 
the  formal  rigor  of  geometry  to  the  study  of  poli¬ 
tics  and  demonstrate  logically,  proceeding  from 
assumptions  to  conclusions,  the  causes  of  civil  war 
and  the  conditions  for  “immortal”  domestic  peace. 
Hobbes  firmly  believed  that  he  had  succeeded  in 
his  endeavor,  and  he  famously  claimed  that  politi¬ 
cal  science  was  not  older  than  his  book  De  Cive. 
The  concepts  and  ideas  put  across  in  De  Cive  are 
very  close  to  those  expanded  in  Leviathan,  which 
was  published  in  English  in  1651,  after  the  execu¬ 
tion  of  Charles  I  (in  1649).  Leviathan  is  regarded 
as  one  of  the  masterpieces  of  Western  political  sci¬ 
ence.  In  addition  to  being  brilliant  in  style, 
Leviathan  brings  to  light  in  an  unparalleled  way 
the  protection-obedience  principle  that  lies  at  the 
foundation  of  the  Western  concept  of  political 
obligation:  Citizens  obey  the  political  state  in 
exchange  for  protection. 

Hobbes  explains  the  sources,  nature,  and  limits 
of  political  obligation  in  three  steps.  In  the  first 
step,  Hobbes  describes  some  immutable  character¬ 
istics  of  human  nature.  To  begin  with,  he  empha¬ 
sizes  that  every  man  regards  self-preservation  as 
summum  bonum  and  fears  violent  death  at  the 
hands  of  others  “as  the  greatest  mischief  that  can 
arrive  to  nature”  (De  Cive,  Epistle  Dedicatory);  he 
stresses  that  all  men  are  capable  of  rationality  and 
that  “every  man  by  reasoning  seeks  out  the  most 
appropriate  means  to  achieve  his  ends”  (De  Cive, 
chap.  14,  sec.  16).  Furthermore,  he  stresses  the 
natural  equality  of  men,  in  the  sense  that  “the 
weakest  has  strength  enough  to  kill  the  strongest” 
(. Leviathan ,  chap.  13,  sec.  1)  and  remarks  that 
some  men  (who  we  cannot  easily  identify)  are  not 
satisfied  with  natural  equality  but  want  dominion 
over  others  and  are  in  perpetual  contention  for 
power,  honor,  and  riches. 

In  the  second  step,  Hobbes  imagines  a  state  of 
nature  where  there  is  no  authority,  no  private 
property,  no  industry,  and  no  common  definition 
of  just  and  unjust,  right  and  wrong.  Under  such 
conditions,  everyone’s  means  of  survival  are  very 
limited  and  unprotected.  Hobbes  puts  it  to  his 
reader  that  the  combined  effect  of  the  action  of 


men  living  in  such  conditions  is  a  war  of  all  against 
all:  Driven  by  a  fear  of  being  attacked,  all  indi¬ 
viduals,  who  are  rational  and  concerned  with  self- 
preservation,  and  who  are  equally  vulnerable  and 
equally  dangerous,  attack  each  other  in  anticipa¬ 
tion.  Hobbes  points  out  that  life  in  such  conditions 
is  “solitary,  poor,  nasty,  brutish  and  short” 
(. Leviathan ,  chap.  13,  sec.  9).  He  claims  that  rea¬ 
son  makes  man  understand  the  first  and  funda¬ 
mental  law  of  nature,  namely,  that  he  ought  “to 
seek  peace”  ( Leviathan ,  chap.  14,  sec.  4)  if  he 
thinks  he  can  obtain  it,  and  that  if  he  cannot 
obtain  it,  he  may  use  his  right  of  nature  and  defend 
his  life  by  all  means. 

Finally,  having  argued  that  the  result  of  the 
joint  action  of  natural  men  and  women  living  in 
the  state  of  nature  is  a  state  of  war  “of  everyone 
against  everyone,”  Hobbes  examines  the  condi¬ 
tions  that  would  bring  about  the  opposite  result, 
namely,  a  state  of  “immortal  peace.”  As  the  nature 
of  man  cannot  be  changed,  Hobbes  argues  that  the 
only  way  to  avoid  the  horror  of  anarchy  is  for 
individuals  to  “lay  down  their  right  to  all  things,” 
to  enter  a  social  contract  with  one  another,  and  to 
appoint  a  sovereign  with  the  primary  task  of  pro¬ 
tecting  their  lives  and  bodily  integrity.  The  sover¬ 
eign,  Hobbes  stresses,  is  not  part  of  the  social 
contract  but  simply  retains  the  original  natural 
right  to  all  things.  Hobbes  explains  that  sovereign 
power  can  reside  in  one  man  (or  woman)  or  an 
assembly,  but  its  characteristics  are  the  same:  It 
must  be  absolute,  unconditional,  unlimited,  irrevo¬ 
cable,  and  indivisible.  Hobbes  maintains  that  it 
would  be  irrational  to  impose  restrictions  on  the 
sovereign  power  as  it  would  restrict  its  ability  to 
protect  its  citizens.  He  also  claims  that  if  there 
were  to  be  someone  who  could  restrict  the  sover¬ 
eign  power,  then  this  person  would,  in  fact,  be  the 
sovereign.  Hobbes  leaves  virtually  no  scope  for 
resistance  to  the  state:  A  Hobbesian  citizen  can 
only  disobey  to  protect  his  own  life.  Because  of  his 
notion  of  sovereignty,  Hobbes  is  regarded  by  many 
as  the  champion  of  the  absolute  state. 

In  the  20th  century,  David  Gauthier  led  the 
camp  of  those  who  suggested  that  game  theory 
(prisoners’  dilemmas,  coordination  games,  super- 
games,  etc.)  sheds  light  on  Hobbes’s  political  argu¬ 
ment.  On  the  one  hand,  game  theory  has  been  used 
to  support  Hobbes’s  claim  that  from  a  set  of  unde¬ 
manding  assumptions  he  demonstrated  that  the 
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state  of  nature  turns  into  a  state  of  war;  on  the 
other  hand,  the  game-theoretical  armory  has  also 
been  deployed  to  claim  that  Hobbes’s  enterprise 
ultimately  fails  in  that  rational  agents  cannot 
escape  from  the  state  of  war  and  cannot  create  the 
Leviathan  in  the  way  suggested  by  Hobbes. 

Critics  of  game-theoretical  applications  to 
Leviathan  have  pointed  out  that  Hobbes  stresses 
that,  historically,  states  are  rarely  born  out  of  con¬ 
tracts.  In  this  view,  the  state  of  nature  is  not  meant 
to  describe  a  state  that  precedes  the  creation  of  the 
political  state  but  a  state  into  which  people  might 
fall  into  as  a  consequence  of  civil  disobedience.  In 
other  words,  the  state  of  nature  describes  a  state  of 
civil  war. 

There  is  textual  evidence  to  support  the  view 
that,  for  Hobbes,  fear  of  punishment  alone  cannot 
guarantee  long-term  order  and  stability  and  that 
citizens  need  to  be  educated.  In  Behemoth,  Hobbes 
singles  out  popular  ignorance  as  the  primary  cause 
of  the  English  Civil  War.  The  state  of  nature  is  a 
thought  experiment  used  by  Hobbes  to  teach  citi¬ 
zens  why  we  ought  to  remain  in  political  associa¬ 
tions  and  why  bad  governments  are  better  than  no 
governments  at  all. 

In  Chapter  13  of  Leviathan,  in  addition  to  stating 
that  the  state  of  nature  materializes  into  civil  wars, 
Hobbes  also  claims  that  it  occurs  in  some  primitive 
societies  and  that  it  describes  the  relations  between 
states.  He  famously  compares  states  to  “gladiators 
with  their  weapons  pointing  and  their  eyes  fixed  on 
one  another”  ( Leviathan ,  chap.  13,  sec.  12)  in  a 
constant  posture  of  hostility  and  war.  Such  remarks 
have  induced  some  interpreters  to  claim  that  Hobbes 
is  one  of  the  founding  fathers  of  realism.  It  has  been 
claimed  that  Hobbes’s  pessimistic  view  of  human 
nature,  his  idea  that  men  are  self-interested  and  seek 
power  after  power,  his  concept  of  anarchy,  and  his 
notion  of  the  sovereign  state  inspired  the  realist  tra¬ 
dition  of  international  relations. 

Whereas  there  is  universal  agreement  among 
interpreters  of  the  importance  of  Hobbes’s  contri¬ 
bution  to  the  development  of  the  concepts  of 
political  obligation  and  state  sovereignty,  no  such 
agreement  exists  as  to  the  association  of  Hobbes 
with  absolutism  and  with  realism.  Points  of  con¬ 
tention  among  the  interpreters  are  Hobbes’s  views 
on  natural  law,  morality,  God,  and  religion. 
The  dominant  view  maintains  all  or  most  of  the 
following  claims:  (a)  the  laws  of  nature  for  Hobbes 


are  prudential  rules  that  recommend  the  creation 
of  the  absolute  state;  (b)  the  state  is  the  source  of 
all  law,  all  justice,  and  all  morality;  (c)  the  detailed 
analysis  of  the  Bible  that  occupies  a  large  part  of 
Leviathan  is  meant  to  support  the  claim  that  the 
ecclesiastical  powers  are  totally  subordinated  to 
the  civil  power  and  that  a  citizen  must  obey  the 
Leviathan  in  everything  except  if  he  or  she  is 
asked  to  renounce  the  belief  that  Jesus  is  the 
Christ. 

In  contrast  to  the  above  views,  since  the  work 
of  A.  P.  Taylor  in  1938  and  Howard  Warrender  in 
1957,  a  long  line  of  interpreters  have  argued  that 
Hobbes’s  laws  of  nature  are  not  just  samples  of 
prudential  morality,  that  Hobbes  makes  the 
Leviathan  accountable  to  God,  that  the  sovereign 
power  is  absolute  but  not  arbitrary,  that  not  all 
justice  in  Hobbes’s  construct  is  legal  justice,  that 
Hobbes  is  ultimately  committed  to  the  individual 
and  not  to  the  state,  and  that  the  claim  that 
Hobbes  is  a  precursor  of  John  Austin  and  legal 
positivism,  or  indeed  a  founding  father  of  realism, 
is  ungrounded. 

The  most  renowned  works  on  Hobbes  in  the 
20th  century  are  those  by  Carl  Schmitt,  Leo 
Strauss,  Howard  Warrender,  Michael  Oakeshott, 
C.  B.  Macpherson,  David  Gauthier,  Jean  Hampton, 
Quentin  Skinner,  Sheldon  Wolin,  and  Noel 
Malcolm. 
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The  term  human  development  (HD)  that  came 
into  prominence  through  the  work  of  Amartya 
Sen  denotes  the  development  toward  a  more 
humane  society  in  which  a  maximum  number  of 
people  live  in  dignity.  From  a  humanist  position, 
a  dignified  life  is  a  life  that  people  have  reason  to 
value  because  they  are  free  to  shape  it  in  accor¬ 
dance  with  their  own  and  mutually  agreed  ideals. 
This  notion  of  HD  is  inspired  by  an  inherently 
emancipative  idea  of  the  “good  life”  that  unifies 
republican,  liberal,  contractual,  and  democratic 
thought,  as  noted  by  David  Held  (2006).  To  be 
human  in  this  perspective  means  to  have  the 
potential  to  reason,  to  judge,  to  choose,  and,  thus, 
to  be  an  agent  who  is  in  control  of  one’s  actions 
and  life.  The  most  humane  life  is  an  emancipative 
life  that  one  lives  in  self-determination. 

Human  Development  as  Emancipation 

Mastery,  control,  and  autonomy  are  features  of 
human  emancipation  emphasized  in  Edward  L. 
Deci  and  Richard  M.  Ryan’s  psychological  theory 
of  self-determination  (2002).  In  the  evolution  of 
our  species,  self-awareness  became  the  most  salient 
feature  of  the  human  intellect.  Self-awareness  cre¬ 
ates  a  need  to  realize  oneself  in  what  one  is  doing, 
and  this  implies  a  need  to  act  in  self-determined 
ways.  Self-determination  is  thought  to  become  a 
dominant  striving  as  soon  as  humans  are  existen¬ 
tially  secure,  so  that  the  trying  to  thrive  can  replace 
the  struggle  to  survive  as  the  main  director  of 
human  energies.  Self-determination  is  an  inher¬ 
ently  emancipative  striving  that,  if  satisfied,  creates 
feelings  of  being  in  unison  with  oneself.  Because 
self-awareness  is  an  evolution-shaped  feature  of 
every  human  being,  self-determination  is  the  most 
universally  and  most  specifically  human  striving. 


Sen  also  draws  on  the  emancipative  notion  of 
HD  in  psychology,  but  he  relates  it  to  societies  as 
the  unit  of  reference.  When  the  good  life  is  an 
emancipated  life,  the  good  society  is  a  society  that 
makes  a  maximum  number  of  people  capable  to 
live  emancipative  lives.  Because,  as  members  of  the 
human  race,  all  people  are  of  equal  existential 
value,  every  person  has  the  same  right  to  an  eman¬ 
cipated  life;  opportunities  to  live  in  emancipation 
must  be  equally  distributed  in  a  humane  society.  In 
that  sense,  HD  theory  construes  the  ideals  of  free¬ 
dom  and  equality  as  interdependent  rather  than 
contradictory. 

Due  to  Sen’s  capability  approach,  the  HD  of 
societies  can  be  measured  by  how  widely  emanci¬ 
pative  capabilities  are  distributed.  The  United 
Nations  Development  Programme  (UNDP)  fol¬ 
lows  the  capability  approach  in  its  annual  Human 
Development  Report,  which  publishes  a  Human 
Development  Index  (HDI).  The  HDI  summarizes 
on  a  per-country  basis  information  on  the  average 
person’s  life  expectancy,  educational  attainment, 
and  per  capita  income,  assuming  that  longevity, 
education,  and  income  increase  ordinary  people’s 
capabilities  to  live  an  emancipated  life. 

In  the  perspective  of  HD,  democracy  becomes 
an  integral  part  of  the  definition  of  development 
for  the  following  reasons:  Human  emancipation 
requires  freedom  of  choice  in  private  and  public 
affairs,  and  such  freedoms  are  granted  through  the 
civil  and  political  rights  that  define  democracy. 

The  emancipative  notion  of  HD  has  been  criti¬ 
cized  as  prescribing  a  Western  view  of  the  good 
life.  But  supporters  of  the  HD  approach  note  that 
this  criticism  is  defensible  only  on  two  question¬ 
able  premises:  The  potential  to  live  an  emancipated 
life  is  not  a  universal  human  potential  but  one  that 
only  Western  people  possess,  and  emancipation  is 
not  a  universal  human  value  because  non-Western 
people  do  not  desire  it.  Neither  of  these  positions 
is  tenable.  First,  the  potential  for  emancipation  is 
anchored  in  a  most  general  feature  of  the  human 
mind,  namely,  self-awareness,  which  is  not  the  sole 
property  of  Westerners.  Second,  the  claim  that 
emancipation  is  not  a  valued  feature  of  life  among 
non-Westerners  has  been  empirically  disproven. 
Whether  people  value  emancipation  can  be  seen  in 
whether  feelings  of  being  free  in  shaping  their  lives 
impact  satisfaction  with  their  lives.  Only  if  free¬ 
dom  over  their  lives  increases  people’s  satisfaction 


1102  Human  Development 


with  their  lives  is  emancipation  of  value  for  people. 
As  Ronald  Inglehart  and  Christian  Welzel  have 
shown,  feelings  of  freedom  over  their  lives  increase 
people’s  life  satisfaction  in  all  cultures.  Thus,  HD’s 
emphasis  on  emancipation  does  not  prescribe  a 
specifically  Western  view  of  the  good  life.  It  cham¬ 
pions  a  particularly  humane  view. 

The  People  Empowerment  Framework 

Christian  Welzel,  Ronald  Inglehart,  and  Hans- 
Dieter  Klingemann  widened  the  capability 
approach  into  a  people  empowerment  framework 
(PEF).  The  PEF  specifies  how  economic  condi¬ 
tions,  cultural  norms,  and  institutional  rules  inter¬ 
act  in  empowering  people,  relating  each  of  these 
three  areas  of  social  reality  to  human  emancipa¬ 
tion  as  a  common  underlying  theme.  Figure  1 
depicts  the  PEF. 

In  line  with  the  emancipative  notion  of  HD, 
people  empowerment  denotes  ordinary  people’s 
potential  to  govern  their  lives  themselves  through 


the  practice  of  personal  freedoms  in  private  mat¬ 
ters  as  well  as  political  freedoms  in  public  affairs. 
The  joint  and  equal  emphasis  on  personal  and 
political  freedoms  resonates  with  Isaiah  Berlin’s 
notion  of  “negative”  and  “positive”  freedom:  To 
be  truly  free,  people  must  have  the  right  to  follow 
their  personal  preferences  in  private  matters  and  to 
make  their  political  preferences  known,  bargained, 
and  counted  in  public  matters. 

Elements  of  People  Power 

To  make  freedoms  practicable  for  people,  social 
reality  must  meet  three  criteria.  First,  ordinary 
people  must  have  command  over  participatory 
resources  that  make  them  capable  to  exercise  free¬ 
doms.  Participatory  resources  include  material 
means,  cognitive  skills,  and  connective  opportuni¬ 
ties;  these  resources  empower  people  on  the  level 
of  capabilities,  widening  their  objective-action 
repertoire.  Second,  ordinary  people  must  have 
internalized  self-expression  values  that  make  them 
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Figure  I  Human  Development  as  People  Empowerment 

Source :  Adapted  from  Welzel,  C.,  Inglehart,  R.,  &  Klingemann,  H.-D.  (2003).  The  theory  of  human  development:  A  cross- 
cultural  analysis.  European  Journal  of  Political  Research,  43,  379-401. 
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strive  to  exercise  freedoms.  Self-expression  values 
emphasize  freedom  of  choice  and  equality  of 
opportunities;  they  empower  people  on  the  level  of 
ambitions,  widening  their  subjective-action  hori¬ 
zon.  Third,  ordinary  people  must  be  granted 
democratic  rights  that  license  them  to  practice 
freedoms.  Democratic  rights  establish  equal  per¬ 
sonal  and  political  liberties;  they  empower  people 
on  the  level  of  entitlements ,  widening  their  legal- 
action  radius. 

Participatory  resources,  self-expression  values, 
and  democratic  rights  constitute  three  distinct, 
though  interdependent,  elements  of  people  power. 
The  pattern  of  their  interdependence  is  governed 
by  the  “utility  logic  of  freedom.”  This  logic 
becomes  evident  when  rethinking  participatory 
resources,  self-expression  values,  and  democratic 
rights  as,  respectively,  capabilities  of,  ambitions 
for,  and  entitlements  to  exercising  freedoms.  Doing 
so  makes  a  number  of  logical  connections  obvious. 
For  one,  utility  suggests  that  people  become  ambi- 
tioned  to  exercise  freedoms  only  insofar  as  they 
find  themselves  able  to  exercise  them.  The  utility 
logic  also  suggests  that  entitlements  to  exercise 
freedoms  come  into  wide  and  frequent  use  only 
insofar  as  people  are  able  and  willing  to  use  them. 
Together,  these  propositions  suggest  that  self- 
expression  values  grow  in  response  to  grown  par¬ 
ticipatory  resources  and,  then,  that  popular  pres¬ 
sures  for  democratic  rights  grow  in  response  to 
grown  self-expression  values  and  participatory 
resources. 

The  three  elements  of  human  empowerment  are 
interdependent  not  on  conceptual  grounds  only. 
Empirically,  as  shown  by  Welzel,  Inglehart,  and 
Klingemann  (2003,  pp.  368-369),  the  interdepen¬ 
dence  is  reflected  in  very  strong  correlations 
between  societal-level  measures  of  participatory 
resources,  self-expression  values,  and  democratic 
rights.  The  three  elements  correlate  so  strongly 
that  statistically  speaking  they  reflect  one  and  only 
one  empirical  dimension.  Content-wise,  this 
dimension  represents  people  empowerment — 
which  is  a  unity  of  empowering  capabilities,  men¬ 
talities,  and  entitlements.  Because  of  their  interde¬ 
pendence,  the  three  elements  of  people  empower¬ 
ment  establish  equilibria  that  resemble  vicious  and 
virtuous  cycles  of  HD.  Most  societies  are  found  on 
either  consistently  low  or  consistently  high  levels 
of  HD  in  all  three  elements.  HD  is  a  syndrome  of 


developmental  conditions  that  is  not  easily  disen¬ 
tangled.  This  is  part  of  the  reason  why  it  is  so 
difficult  to  get  development  started:  It  needs  a 
synchronized  effort  to  trigger  economic,  cultural, 
and  institutional  changes,  as  they  mutually  condi¬ 
tion  each  other. 

Democracy  represents  the  institutional  element 
of  people  empowerment  and  as  such  is  highly  con¬ 
ditioned  by  the  economic  and  cultural  elements  of 
empowerment.  No  question,  democracy  can  be 
imposed  from  the  outside  by  foreign  powers,  or  it 
can  be  adopted  from  the  inside  by  unilateral  acts 
of  domestic  power  holders — irrespective  of  the 
social  conditions.  But  in  such  cases,  one  most 
likely  deals  with  a  socially  aloof  version  of  democ¬ 
racy,  in  which  case  the  freedoms  that  define 
democracy  are  not  coming  into  wide  usage  by 
most  of  the  people.  Only  when  ordinary  people  are 
capable  of  exercising  freedoms  and  only  when  they 
are  willing  to  do  so  will  the  freedoms  granted  by 
democracy  come  into  wide  use.  Only  then  do  we 
witness  socially  embedded  democracy. 

Sometimes  one  finds  democratic  freedoms  insti¬ 
tutionalized  at  a  level  that  goes  beyond  most  peo¬ 
ple’s  capabilities  and  ambitions  to  struggle  for 
these  freedoms.  When  this  is  the  case,  power  hold¬ 
ers  are  free  to  corrupt  formally  enacted  freedoms, 
and  in  general,  they  use  this  leeway  to  their  benefit. 
Thus,  when  democracy  is  conceded  by  power 
holders  in  the  absence  of  popular  pressures,  it  is 
usually  corrupted  democracy.  By  contrast,  when 
one  finds  democracy  to  be  effective  in  the  sense 
that  its  freedoms  are  really  respected  by  those  in 
power,  this  is  usually  a  tribute  to  ordinary  people’s 
capabilities  and  ambitions  to  practice  freedoms 
and  struggle  for  them. 

Strengths  of  Human  Development  Theory 

Compared  with  other  concepts  in  the  social  sci¬ 
ences,  HD  theory  offers  a  broad  perspective  by 
integrating  economic,  cultural,  and  institutional 
aspects  of  social  reality  into  a  common  frame¬ 
work.  This  breadth  of  perspective  does  not  come 
at  the  expense  of  the  analytical  focus.  Instead,  HD 
theory  sharpens  the  analytical  focus  by  relating  all 
three  aspects  of  social  reality  to  the  empowerment 
of  people  as  the  lead  theme. 

The  HD  framework  is  inspired  by  an  emancipa¬ 
tive  idea  of  the  good  society  and  the  good  life.  This 
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normative  inspiration  constitutes  the  framework’s 
integrative  core.  However,  the  normativity  of  the 
concept  is  limited  to  providing  a  standard  against 
which  to  evaluate  a  factual  state  of  affairs  as  desir¬ 
able  or  undesirable.  Having  a  standard  to  evaluate 
reality  does  by  no  means  mean  to  prescribe  reality. 
Rather,  it  provides  a  basis  for  interventions  to 
change  it. 

The  only  other  concept  in  the  social  sciences 
comparable  with  HD  in  its  integrative  potential  is 
social  capital  (SC).  With  its  emphasis  on  assets 
that  facilitate  collective  action,  SC  theory  sets  a 
focus  similar  to  HD  theory’s  emphasis  on  assets 
that  empower  ordinary  people.  Thus,  the  three 
assets  championed  in  SC  theory — networks,  trust, 
and  norms — resonate  with  the  three  assets  high¬ 
lighted  in  HD  theory — capabilities,  ambitions, 
and  entitlements.  However,  SC  theory  lacks  two 
properties  of  HD  theory.  First,  SC  theory  lacks  an 
evolutionary  perspective  that  theorizes  how  devel¬ 
opment  transforms  different  types  of  SC.  Second, 
SC  theory  has  little  grounding  in  psychology  and 
has  hence  difficulties  to  anchor  SC  in  the  human 
motivational  system.  By  contrast,  with  its  evolu¬ 
tionary  perspective,  grounding  in  psychology,  and 
humanistic  inspiration,  HD  theory  offers  an 
appealing  integrative  framework  for  the  social 
sciences. 

Christian  Welzel 
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Bremen,  Germany 
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Human  rights  are  inherently  comparative.  Empirical 
analysis  on  and  advocacy  for  human  rights  are 
grounded  in  significant  questions  relating  to  human 
well-being  and  how  such  well-being  varies  across 
units,  whether  these  units  are  individuals,  groups, 
countries,  or  regions.  Comparative  politics  is  con¬ 
cerned  with  explaining  variation  in  social  and 
political  phenomena  across  these  very  same  units. 
Legal  and  normative  frameworks  for  human  rights 
have  sought  to  establish  rules  and  norms  about 
how  states  and  individuals  ought  to  treat  one 
another,  and  the  human  rights  community  has 
sought  to  make  such  a  set  of  constraints  on  human 
behavior  universal  through  the  vehicle  and  mecha¬ 
nisms  of  public  international  law.  Through  the 
specification  and  testing  of  social  theory  on 
observed  similarities  and  differences  in  the  protec¬ 
tion  of  human  rights  across  individuals,  groups, 
countries,  and  regions,  comparative  politics  helps 
provide  solutions  to  enhancing  the  promotion  and 
protection  of  human  rights.  But  it  is  crucial  in  this 
effort  that  comparison  remains  committed  to 
methodological  considerations  of  case  selection 
and  bias,  measurement  error  and  data  availability, 
universality  and  particularity,  and  causal  heteroge¬ 
neity  (see,  e.g.,  Todd  Landman,  2002).  Absence  of 
attention  to  these  key  issues  can  lead  to  insecure 
inferences  and  the  possibility  of  perverse  outcomes 
that  can  endanger  human  rights.  This  entry  first 
examines  issues  related  to  methodology  and 
describes  some  commonly  used  measures  of  human 
rights.  It  then  discusses  research  on  human  rights 
and  the  contributions  of  comparative  politics  to 
such  research.  It  concludes  with  a  look  at  some  of 
the  challenges  for  future  research  in  the  field. 

Methodological  Considerations 

Broadly  speaking,  comparative  analysis  variously 
includes  large-N  statistical  analysis  of  many 
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countries  over  time  and  space,  small-N  qualita¬ 
tive  or  quantitative  analysis  of  most  similar  or 
most  different  countries,  or  single-country  studies 
of  human  rights  developments  at  the  national  or 
subnational  level.  Such  comparisons  have  strug¬ 
gled  to  overcome  significant  ontological,  episte¬ 
mological,  and  methodological  challenges  in 
order  to  provide  valid,  meaningful,  and  reliable 
inferences  in  this  burgeoning  subfield  of  research. 
But  there  is  now,  however,  a  distinct  subfield  and 
strong  community  of  researchers  specifically  ded¬ 
icated  to  the  application  of  the  theories  and  meth¬ 
ods  in  social  science  to  significant  human  rights 
problems  and  puzzles  that  includes  systematic 
comparative  analysis.  The  American  Political 
Science  Association  (APSA),  the  International 
Political  Science  Association  (IPSA),  and  the 
International  Studies  Association  (ISA)  have  all 
established  specialized  human  rights  sections.  The 
European  Consortium  for  Political  Research 
(ECPR)  hosted  a  joint  workshop,  The  Systematic 
Study  of  Human  Rights  Violations,  organized  by 
Steve  Poe  and  Sabine  Carey  in  2002;  the  Journal 
of  Peace  Research  published  a  special  issue  dedi¬ 
cated  to  political  science  analysis  of  protecting 
human  rights,  while  other  mainstream  journals  in 
comparative  politics  and  international  relations 
have  increasingly  published  empirical  studies  on 
human  rights  that  adopt  a  comparative  frame¬ 
work  of  some  kind. 

Comparative  studies  have  tended  to  sidestep 
normative  concerns  over  foundations  for  human 
rights  and  have  adopted  a  pragmatic  approach 
that  seeks  to  conduct  systematic  research  on  the 
conditions  under  which  such  protections  are  (or 
are  not)  made  possible.  Such  a  focus  can  help  us 
understand  how  best  to  prevent  human  beings 
from  doing  their  worst  to  one  another,  as  well  as 
how  to  overcome  some  of  the  structural  barriers  to 
achieving  greater  human  dignity  for  a  larger  cross 
section  of  humankind.  Comparative  analysis  of 
human  rights  is  thus  essential  for  explaining  and 
understanding  the  conditions  under  which  human 
beings  forge  their  existence,  assert  their  dignity, 
and  seek  protections  for  their  different  identities, 
for  the  pursuit  of  self-determination,  and  for  the 
exercise  of  agency  and  autonomy.  Like  the  study 
of  markets,  social  classes,  and  democracy,  the 
study  of  human  rights  reveals  much  about  human 
nature  and  the  ways  in  which  structure  and  agency 


interact  to  create  extraordinarily  different  life 
experiences  across  the  globe  and  provides  valuable 
insights  into  the  types  of  real  protections  that  need 
to  be  in  place. 

Measures  of  Human  Rights 

Beyond  its  normative  and  ontological  concerns, 
comparative  analysis  of  human  rights  problems  is 
predicated  on  the  genesis  and  use  of  comparable 
measures  of  human  rights  either  within  states  or 
across  states.  Such  measures  can  be  qualitative  or 
quantitative,  and  both  types  of  measures  provide 
some  gauge  of  the  degree  to  which  human  rights 
are  being  respected,  protected,  and  fulfilled  by 
states  (see  Todd  Landman  &  Edzia  Carvalho, 
2009).  To  date,  such  measures  have  included 
events-based  data  on  violations  of  human  rights, 
standards-based  data  on  country  protection  of 
human  rights  in  principle  and  in  practice,  survey- 
based  data  on  perceptions  and  experiences  of 
human  rights,  and  socioeconomic  and  administra¬ 
tive  statistics  on  government  inputs,  processes, 
outputs,  and  outcomes  that  have  a  bearing  on 
human  rights.  These  measures  still  tend  to  be 
biased  toward  the  protection  of  civil  and  political 
rights,  but  progress  has  been  made  recently  on 
providing  comparable  measures  of  some  socioeco¬ 
nomic  rights  (see  Landman  &  Carvalho,  2009). 

Causes,  Conditions,  and  Consequences 
of  Human  Rights 

In  using  these  various  measures,  much  comparative 
research  has  been  carried  out  in  an  effort  to  explain 
and  understand  cross-national  similarities  and  dif¬ 
ferences  in  the  protection  of  human  rights.  There  is 
now  a  large  body  of  large-N  quantitative  studies, 
small-N  qualitative  and  quantitative  studies,  as 
well  as  a  vast  array  of  single-country  studies  that 
examine  the  causes,  conditions,  and  consequences 
of  human  rights  protection  around  the  world. 
Extant  global  comparative  studies  on  human  rights 
protection  have  tended  to  focus  on  a  narrow  set  of 
civil  and  political  rights  and  carried  out  diachronic 
and  synchronic  analyses  to  estimate  the  effects  of  a 
series  of  important  explanatory  variables  that 
account  for  their  variation.  Such  explanatory  vari¬ 
ables  have  included  economic  development,  popu¬ 
lation  and  population  growth,  democracy  and 
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democratization,  multinational  corporations,  inter¬ 
nal  and  external  violent  conflict,  the  end  of  the 
Cold  War,  U.S.  and  European  foreign  aid,  domestic 
constitutional  provisions,  and  religious  differences 
and  ethnic  diversity.  In  addition  to  consideration  of 
civil  and  political  rights,  other  global  comparative 
projects  have  focused  on  discrimination,  minori¬ 
ties,  and  conflict;  U.S.  refugee  policy;  and  the  provi¬ 
sion  of  basic  human  needs. 

The  most  recent  area  of  interest  has  been  the 
examination  of  the  growth  and  effectiveness  of  the 
international  human  rights  regime.  Linda  Camp 
Keith  (1999)  and  Oona  Hathaway  (2002)  were 
the  first  scholars  to  analyze  the  relationship 
between  ratification  of  human  rights  treaties  and 
human  rights  protection,  while  controlling  for  a 
variety  of  other  variables.  Their  analyses  do  not 
find  robust  support  for  such  a  relationship.  Emilie 
M.  Hafner-Burton  and  Kiyoteru  Tsutsui  (2007) 
added  a  layer  of  complexity  to  the  relationship  by 
specifying  models  that  included  an  interaction 
effect  between  treaty  ratification  and  democracy, 
which  in  turn  has  an  effect  on  human  rights  pro¬ 
tection.  Landman’s  (2005)  analysis  included  the 
level  and  timing  of  democratization,  a  feedback 
process  between  treaty  ratification  and  human 
right  protection,  as  well  as  a  weighted  measure  of 
treaty  ratification  that  took  into  account  the  filing 
of  reservations  across  the  main  international 
human  rights  treaties. 

Common  Themes  of  Analysis 

Analyses  of  human  rights  problems  and  puzzles 
that  use  a  relatively  small  and  intentional  selection 
of  countries  address  a  number  of  common  themes 
and  adopt  a  wide  range  of  comparative  methods. 
The  dominant  themes  at  this  level  of  analysis 
include  the  struggle  for  rights  and  the  relationship 
between  social  mobilization,  political  liberaliza¬ 
tion,  and  repression  (Thomas  Risse,  Stephen  C. 
Ropp,  &  Kathryn  Sikkink,  1999);  the  similarities 
and  differences  in  the  formation,  function,  out¬ 
comes  and  impact  of  truth  commissions  (e.g., 
Priscilla  B.  Hayner,  2002);  the  legacies  of  authori¬ 
tarian  rule  (Luis  Roniger  &  Mario  Sznajder,  1999); 
nonstate  violence,  “uncivil”  movements,  and  death 
squads  (Leigh  Payne,  2000);  and  the  ways  in  which 
human  rights  norms  have  transcended  state  sover¬ 
eignty  through  the  use  of  transnational  advocacy 


networks  that  help  form  alliances  and  informa¬ 
tional  networks  between  domestic  and  interna¬ 
tional  human  rights  organizations  that  are  able  to 
put  pressure  on  rights  violating  states  (Risse, 
Ropp,  &  Sikkink,  1999).  These  various  small-N 
studies  have  adopted  a  range  of  comparative-case 
or  focused-comparison  strategies,  including  most 
similar  systems  designs  (MSSDs),  which  compare 
different  human  rights-related  outcomes  across 
similar  cases;  most  different  systems  designs 
(MDSDs),  which  compare  similar  outcomes  across 
different  cases;  and  the  mirror  image  of  MSSDs, 
which  compare  similar  variation  across  similar 
cases  and  even  different  outcomes  across  different 
cases. 

Comparative  Strategies 

Single-country  studies  have  long  been  an  impor¬ 
tant  part  of  comparative  politics,  and  social  sci¬ 
ence  more  generally,  for  their  use  in  description 
and  classification,  in  plausibility  probes,  in  gener¬ 
ating  and  testing  hypotheses,  and  as  crucial  cases 
(i.e.,  the  most  likely  and  least  likely  examples)  that 
can  confirm,  inform,  or  disconfirm  existing  theo¬ 
ries.  The  human  rights  themes  that  are  addressed 
in  single-country  studies  vary  little  from  those  that 
have  featured  in  the  large-N  and  small-N  com¬ 
parative  studies,  but  as  in  the  comparison  of  a  few 
countries,  single-country  analysis  has  allowed  for 
much  greater  attention  to  process  tracing  and 
dynamic  relationships  between  actors,  conditions, 
and  rights.  The  work  of  Darren  G.  Hawkins 
(2002)  provides  a  good  example  of  how  process 
tracing  provides  a  deeper  insight  into  the  causal 
mechanisms  within  a  single  county.  In  his  study  of 
authoritarianism  in  Chile  and  the  response  of  the 
Pinochet  regime  to  international  human  rights 
pressure,  Hawkins  shows  that  the  rule-oriented 
factions  within  the  regime  grew  increasingly  wary 
of  the  possible  delegitimizing  power  of  interna¬ 
tional  human  rights  pressure,  gained  ground  over 
time,  and  were  largely  responsible  for  the  various 
moves  the  regime  made  toward  democratization 
and  improvement  in  human  rights  protection; 
these  lessons,  as  Hawkins  shows,  apply  to  an 
analysis  of  South  Africa  and  Cuba. 

Increasingly,  the  inferential  logic  of  large-N  stud¬ 
ies  has  begun  to  be  applied  to  single-country  studies 
using  subnational  divisions  within  individual 
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countries,  such  as  democracy  and  human  rights  in 
the  federal  systems  of  Mexico  and  India  and 
democracy  and  political  violence  across  the  admin¬ 
istrative  districts  of  Nepal.  In  these  studies,  the  use 
of  states  in  a  federal  structure  as  the  basic  units  of 
analysis  allows  for  a  large  number  of  observations 
(or  increased  degrees  of  freedom)  for  the  analysis  of 
variation  in  human  rights  protection  while,  at  the 
same  time,  controlling  for  similarities,  since  these 
units  are  all  from  the  same  country.  In  this  way, 
comparative  research  in  federal  systems  is  a  form  of 
MSSD  and  offers  tremendous  promise  for  research 
in  the  field  of  human  rights. 

Taken  together,  there  is  a  natural  synergy 
between  human  rights  and  comparisons,  and  the 
field  of  comparative  politics  brings  much  to  bear 
on  our  understanding  of  the  conditions  under 
which  human  rights  can  or  cannot  be  protected  in 
the  ways  that  are  outlined  in  the  international  law 
of  human  rights.  Systematic  analysis  of  human 
rights  problems  is  a  burgeoning  subfield  in  com¬ 
parative  politics  and  in  many  ways  is  a  natural 
place  in  the  larger  discipline  of  political  science  in 
which  to  locate  such  analysis.  While  the  interna¬ 
tional  human  rights  regime  seeks  to  use  the  mecha¬ 
nism  of  international  law  to  govern  the  ways  in 
which  states  relate  to  their  citizens,  it  is  within 
states  and  it  is  across  states  that  the  promotion  and 
protection  of  human  rights  varies.  This  variation, 
which  makes  a  reference  to  an  ideal  standard  out¬ 
lined  within  the  many  international  human  rights 
norms  and  instruments,  is  best  analyzed  through 
the  various  methods  available  to  comparativists. 
These  methods,  which  have  developed  through  the 
analysis  of  many  areas  of  research  outside  any 
concern  for  human  rights,  have  served  and  will 
serve  human  rights  well  for  the  future. 

Challenges 

Despite  the  many  virtues  of  comparative  analysis 
for  human  rights  research,  there  are  many  chal¬ 
lenges  that  remain  in  the  field.  Comparative  politics 
does  not  have  its  own  distinct  theoretical  tradition 
but  engages  in  rationalist,  structuralist,  and  cultur- 
alist  theorizing  that  can  be  developed  more  fully  in 
ways  that  will  prove  fruitful  for  human  rights 
research.  The  attention  to  rationalism  in  Neil 
Mitchell  (2004)  and  Hawkins  (2002)  is  laudable, 
especially  since  both  afford  an  equally  large  role  for 


ideational  approaches.  Mitchell’s  “principals”  are 
motivated  by  material  self-interest,  as  well  as  ide¬ 
ology,  and  it  is  precisely  the  differences  in  these 
motivations  that  help  explain  the  differences  in 
levels  of  atrocity  that  he  observes  across  his  three 
cases.  Moreover,  the  model  that  he  develops  ought 
to  travel  to  other  cases  both  in  the  past  and  in  the 
contemporary  period.  In  a  similar  fashion,  the  dif¬ 
ferent  factions  in  Hawkins’s  analysis  of  the  mili¬ 
tary  regime  in  Chile  are  not  only  motivated  by  the 
quest  to  maintain  power  but  also  by  a  particular 
attachment  to  legal  rules  and  standards  both 
within  and  outside  the  Chilean  constitutional 
order.  This  kind  of  innovative  development  and 
testing  of  theories  is  needed  in  future  comparative 
research  on  human  rights. 

For  example,  the  field  has  not  yet  developed  a 
set  of  formal  models  of  the  kind  found  in  the 
political  economy  literature  that  use  the  relative 
costs  and  benefits  of  attention  to  human  rights  to 
explain  either  domestic  or  international  state 
behavior.  At  the  international  level,  there  are  sig¬ 
naling  and  reputational  factors  that  may  explain 
why  states  ratify  human  rights  treaties,  while  at  the 
domestic  level,  human  rights  can  be  theorized  as  a 
set  of  public  good  for  which  states  have  different 
welfare  functions.  Moreover,  spatial  econometrics 
and  simple  gravity  models  can  start  to  examine 
possible  spillover  and  bandwagon  effects,  as  well 
as  the  influence  of  large  and  powerful  states  on 
other  states.  In  all  of  these  examples,  systematic 
comparative  analysis  of  the  variation  in  state 
behavior  can  provide  the  evidence  base  to  test 
models  that  have  been  developed  at  a  particular 
level  of  conceptual  abstraction. 

Finally,  it  is  clear  that  more  attention  is  needed 
on  methodological  issues.  For  questions  of  mea¬ 
surement,  more  attention  is  needed  on  the  sources 
of  human  rights  information,  the  procedures  for 
coding  human  rights  information,  the  develop¬ 
ment  of  measures  of  economic  and  social  rights, 
and  the  provision  of  indicators  that  capture  the 
many  different  dimensions  of  human  rights 
(Landman  &  Carvalho,  2009).  These  measure¬ 
ment  issues  are  pertinent  to  all  forms  of  compara¬ 
tive  analysis  from  large-N  studies  to  single-country 
analysis.  For  studies  that  compare  a  few  countries 
or  a  single  country,  more  attention  to  the  process 
of  case  selection  is  needed  to  avoid  indeterminate 
research  designs  and  severe  forms  of  selection  bias. 
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MSSDs  and  MDSDs  offer  good  first  solutions  for 
case  selection,  while  greater  attention  to  negative 
cases  and  crucial  cases  is  required  to  make  the 
most  of  single-country  studies  of  particular  human 
rights  problems.  Human  rights  research  is  a  field 
worthy  of  study  within  comparative  politics, 
where  the  standards  for  high-quality  and  system¬ 
atic  rules  of  inquiry  apply  equally,  while  the  need 
for  a  strong  evidence  base  is  paramount  for  policy¬ 
makers  and  advocates  interested  in  the  promotion 
and  protection  of  human  rights  across  the  world. 
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Human  Rights  in 
International  Relations 


Human  rights  is  the  soul  of  politics.  The  essence 
of  human  rights  is  the  idea  that  all  persons  possess 
equal  moral  worth,  that  social  order  exists  to  pre¬ 
serve  the  essential  humanity  of  its  members,  and 
that  therefore  the  exercise  of  all  forms  of  political 
authority  is  properly  bounded  by  its  impact  on 
fundamental  human  dignity.  In  contemporary 
international  society,  this  norm  has  become  the 
theoretical  basis  for  the  legitimacy  of  all  states — 
albeit  poorly  realized  in  practice.  The  emergence 
of  the  principle  and  practice  of  human  rights  is  an 
essential  area  of  international  law,  a  hallmark  of 
global  civil  society,  and  a  response  to  the  multilay¬ 
ered  challenges  of  globalization,  along  with  the 
persistence  of  state  abuse.  The  politics  of  human 
rights  also  provides  a  fascinating  test  of  the  power 
of  transnational  citizen  action  and  international 
cooperation  to  sporadically  transform  state  sover¬ 
eignty.  This  entry  traces  the  evolution  of  this 
emerging  norm,  charts  some  systematic  patterns 
of  violation,  assesses  the  range  of  international 
remedies,  and  discusses  challenges  to  the  concept 
and  its  application. 

Historical  Development  and 
Emerging  Consensus 

While  most  cultures  have  had  some  historic  stan¬ 
dard  of  humane  treatment  for  their  members,  the 
notion  of  universal  and  natural  rights  rises  with 
modernity  and  increasing  interactions  across  bor¬ 
ders.  The  early  arguments  put  forth  by  the  Spanish 
theologian  Bartolome  de  las  Casas  to  recognize  the 
essential  humanity  of  the  Indians  of  the  Americas 
culminated  several  centuries  later  in  the  transatlan¬ 
tic  movement  against  the  African  slave  trade,  which 
secured  its  abolition.  During  the  19th  century,  the 
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increase  in  costly  collisions  among  the  rising 
European  powers  led  to  recognition  of  the  horrors 
of  war  as  a  violation  of  universal  standards,  the 
establishment  of  the  International  Committee  of 
the  Red  Cross,  and  the  drafting  of  a  series  of  codes 
of  conduct  for  the  treatment  of  civilians  and  pris¬ 
oners:  the  Geneva  Conventions  (four  treaties 
developed  between  1864  and  1949).  Meanwhile, 
new  types  of  democratic  regimes  based  on  citizen¬ 
ship  and  social  contract  inscribed  the  protection  of 
life  and  liberty  as  a  requisite  of  rule  within  the 
state,  a  practice  that  was  later  exported.  The 
British  Magna  Carta  laid  the  foundation  in  limit¬ 
ing  the  sovereign’s  powers  of  coercion,  France’s 
Universal  Declaration  of  the  Rights  of  Man  and 
the  Citizen  asserted  rights  that  transcend  social 
class  and  condition,  while  the  U.S.  Constitution’s 
Bill  of  Rights  established  a  normative  hierarchy  in 
which  fundamental  rights  supersede  even  the 
democratic  popular  will. 

The  contemporary  international  human  rights 
regime  was  established  in  reaction  to  the  horrors  of 
the  Holocaust — the  first  modern-era  genocide  com¬ 
mitted  by  modern  means  within  Europe,  coupled 
with  border-crossing  war  crimes  and  international 
aggression.  The  Universal  Declaration  of  Human 
Rights,  signed  on  December  10,  1948,  is  widely 
considered  to  mark  the  birth  of  the  modern  norm 
of  international  human  rights.  Over  the  following 
decades,  a  global  architecture  was  constructed, 
consisting  of  an  expanding  body  of  international 
law,  global  and  regional  monitoring  and  sanction¬ 
ing  institutions,  an  emerging  practice  of  humani¬ 
tarian  intervention,  humanitarian  and  governance 
foreign  assistance,  and  a  growing  global  network 
of  human  rights  movements.  Jack  Donnelly  has 
calculated  that  more  than  140  states  now  sub¬ 
scribe  to  the  foundational  documents  that  delimit 
human  rights  in  theory:  the  Universal  Declaration 
and  two  International  Covenants — the  International 
Covenant  on  Civil  and  Political  Rights  (ICCPR) 
and  the  International  Covenant  on  Economic, 
Social,  and  Cultural  Rights  (ICESCR). 

While  human  rights  began  as  a  legal  defense  of 
the  lives  and  physical  integrity  of  political  dissi¬ 
dents  and  religious  or  ethnic  minorities  from  the 
malfeasance  of  dictatorships,  its  mandate  has 
expanded  in  claims,  subjects,  and  mechanisms. 
During  the  Cold  War,  the  Western  bloc  sought  to 
globalize  civil  and  political  rights  via  the  ICCPR, 


while  the  socialist  states  emphasized  social  rights 
such  as  education  and  health  care  under  the 
ICESCR.  Meanwhile,  developing  nations  added  to 
the  canon  a  concern  with  collective  rights  of  devel¬ 
opment,  identity,  and,  eventually,  attempts  to 
secure  accountability  of  nonstate  actors  such  as 
multinational  corporations  for  abuses  connected 
with  globalization.  From  decolonization  onward, 
the  increasing  presence  of  non-Western  states  in 
international  society  brought  diverse  calls  for 
expansion  of  the  human  rights  agenda  to  socially 
marginalized  as  well  as  politically  oppressed 
groups,  attention  to  government  negligence  as  well 
as  repression,  and  culturally  based  critiques  of  the 
liberal  norms.  Since  the  1990s,  the  claims  and  sub¬ 
jects  of  human  rights  have  been  expanded  to  previ¬ 
ously  unrecognized  groups  such  as  children  and 
indigenous  peoples.  Islamic  and  Confucian  cul¬ 
tural  critiques  of  the  liberal  Western  basis  of 
human  rights  were  largely  transcended  at  the 
global  level  at  the  1993  United  Nations’  (UN) 
Vienna  Conference  on  Human  Rights,  which 
declared  a  consensus  that  human  rights  are  “uni¬ 
versal,  indivisible,  and  interdependent”  (although 
such  challenges  reappear  regularly  in  selected 
states’  defenses  against  international  criticism).  By 
the  21st  century,  international  human  rights 
enforcement  mechanisms  had  been  strengthened 
significantly — but  only  in  the  core  domain  of  mas¬ 
sive  and  systematic  violations  of  life  and  liberty  by 
government  action  of  pariah  states. 

Defining  Rights  and  Setting  Standards 

Like  other  forms  of  international  law,  human 
rights  standards  are  a  melange  of  explicit  treaties, 
customary  international  law,  pilot  domestic  stan¬ 
dards,  and  scholarly  interpretation.  For  example,  in 
assessing  Spain’s  1998  request  for  the  extradition 
of  former  Chilean  dictator  Augusto  Pinochet  from 
Britain  to  face  charges  of  crimes  against  humanity, 
a  British  court  considered  its  obligations  under  the 
Convention  Against  Torture,  the  European  Court 
of  Human  Rights,  Chile’s  own  amnesty  and  claims 
of  sovereignty,  British  legislation,  and  expert  testi¬ 
mony.  The  Universal  Declaration  and  the  twin 
International  Covenants  are  considered  to  encom¬ 
pass  a  comprehensive  foundation,  with  more 
specific  standards  and  jurisdiction  developed  by 
the  more  widely  subscribed  phenomenon-based 
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treaties:  the  Geneva  Conventions  on  war  crimes, 
the  Convention  Against  Torture,  the  Genocide 
Treaty,  and  the  International  Convention  on  the 
Elimination  of  All  Forms  of  Racial  Discrimination. 
Another  set  of  treaties  is  linked  to  vulnerable  popu¬ 
lations;  they  establish  valuable  benchmarks  and 
monitoring  obligations  but  are  less  widely  accepted 
and  less  effective  sources  of  pressure  on  states:  the 
Convention  on  the  Elimination  of  Discrimination 
Against  Women,  the  Convention  on  the  Rights  of 
the  Child,  and  Refugee  Convention.  The  most 
recent  21st-century  international  norms,  the 
Elniversal  Declaration  on  the  Rights  of  Indigenous 
Peoples  and  the  UN  Convention  on  the  Rights  of 
Persons  with  Disability,  establish  new  populations 
of  international  concern  but  still  struggle  for  broad 
recognition. 

The  conceptual  core  of  human  rights  standards 
is  to  provide  protection  and  empowerment  against 
evolving  threats  to  human  dignity.  To  meet  this 
mandate,  fundamental  human  rights  must  be  uni¬ 
versal  and  inalienable — immune  from  contingent 
variations  in  geography  and  political  conditions. 
Such  rights  must  be  enforced  by  the  international 
system  that  provides  a  safety  net  for  uneven  citizen¬ 
ship  regimes,  incorporates  increasing  numbers  of 
displaced  persons  and  noncitizens,  and  acts  to 
check  rogue  states.  But  international  rights  are  also 
meaningless  without  international  duties.  In  this 
sense,  the  key  emerging  debate  on  enforcement  is 
the  Responsibility  to  Protect  doctrine  (approved  by 
the  UN  in  2005),  which  proposes  a  broad  but 
important  obligation  on  member  states  to  act  col¬ 
lectively  where  the  state  cannot  or  will  not,  in  cases 
of  massive  and  gross  violations  of  fundamental 
rights. 

Patterns  of  Abuse 

If  human  rights  violations  are  a  form  of  political 
pathology,  the  search  for  a  remedy  begins  with  the 
study  of  the  symptoms  and  analysis  of  the  causes. 
The  study  of  patterns  of  abuse  has  become  a  sub¬ 
field  of  international  relations  and  several  social 
sciences.  Violations  can  be  measured  and  moni¬ 
tored  by  a  combination  of  testimonies  of  survivors 
and  witnesses,  government  records,  country  case 
studies,  statistical  patterns  of  demographic  records 
and  potential  correlations  with  determinants  of 
abuse,  and  forensic  anthropology  of  gravesites. 


War  crimes  such  as  massacres  of  civilians  and 
torture  of  prisoners  of  war  are  often  associated 
with  counterinsurgency  struggles  and  participation 
of  paramilitary  forces  on  both  sides,  which  consti¬ 
tute  an  increasing  proportion  of  conflicts.  While 
state-sponsored  slavery  was  almost  eliminated  by 
the  20th  century  through  international  coopera¬ 
tion,  state-tolerated  forced  labor  by  private  parties 
along  with  associated  trafficking  in  persons  has 
actually  increased  in  the  21st  century,  in  tandem 
with  the  pressures  of  globalization.  Genocide,  the 
attempt  to  systematically  eliminate  a  population 
defined  by  identity  characteristics,  has  also  experi¬ 
enced  a  lamentable  revival  in  the  aftermath  of  the 
Cold  War.  Genocide  is  associated  with  recent 
defeat  in  international  war,  the  collapse  of  hege¬ 
monic  influence,  and  competitive  political  mobili¬ 
zation  by  “ethnic  entrepreneurs” — and  can  rarely 
be  attributed  primarily  to  “ancient  hatreds.” 
Moreover,  mass  expulsions  of  ethnic  cleansing 
often  spill  across  borders  to  create  new  zones  of 
conflict,  as  Rwandan  refugees  contributed  to  the 
epochal  war  in  the  Democratic  Republic  of  Congo 
(DRC).  Massive  numbers  of  political  prisoners  and 
gulag-like  prison  complexes  are  characteristic  of 
dictatorships,  especially  totalitarian  regimes  such 
as  communist  North  Korea  or  theocratic  Iran,  and 
increasingly  concentrated  within  such  regimes. 
However,  Amnesty  International  estimates  that 
around  half  of  all  states  practice  torture  with  some 
regularity,  and  this  includes  some  democracies — 
especially  recent  democracies  in  the  developing 
world.  High  levels  of  social  inequality  and  low 
levels  of  judicial  accountability  are  the  leading  fac¬ 
tors  associated  with  the  systematic  occurrence  of 
torture,  especially  in  Latin  America.  Discrimination 
by  race,  religion,  caste,  gender,  and  national  origin 
is  still  endemic  and  often  state  supported  in  South 
Asia  and  the  Middle  East,  while  even  some  demo¬ 
cratic  developed  countries  generally  respectful  of 
their  own  citizens  have  responded  to  high  levels  of 
migration  with  discriminatory  policies  that  violate 
international  standards. 

Remedies:  The  International 
Human  Rights  Regime 

The  ensemble  of  local,  national,  and  global  human 
rights  organizations  has  been  labeled  an  “interna¬ 
tional  human  rights  regime” — an  issue-specific 
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pattern  of  global  governance.  There  is  no  definitive 
enforcement  of  human  rights,  but  critical  combi¬ 
nations  of  various  forms  of  transnational  pressure 
have  resulted  in  significant  reforms,  such  as  the 
demise  of  the  apartheid  system  in  South  Africa. 
Human  rights  campaigns  are  often  catalyzed  by 
transnational  issue  networks  that  include  social 
movements,  international  organizations,  and 
experts,  in  fluid  habitual  exchanges  that  pressure 
states  from  above  and  below.  Margaret  Keck  and 
Kathryn  Sikkink  depict  human  rights  appeals  as  a 
“boomerang”  launched  by  a  powerless  civil  soci¬ 
ety  that  reaches  around  the  state  to  secure  transna¬ 
tional  pressure,  while  Thomas  Risse,  Stephen 
Ropp,  and  Kathryn  Sikkink  propose  a  phased 
“spiral  model”  in  which  such  pressure  evolves 
through  phases  of  state  denial,  normative  lip  ser¬ 
vice  without  policy  change,  then  limited  local 
empowerment,  more  systematic  reform,  and  even¬ 
tual  state  socialization  and  internalization  of 
human  rights  norms. 

At  the  global  level,  standard  setting  and  moni¬ 
toring  by  international  organizations  depend 
mostly  on  “naming  and  shaming” — but  the  vigor 
of  China’s  efforts  to  avoid  annual  censure  by  the 
United  Nations  Human  Rights  Commission 
(UNHCR),  including  generous  aid  packages  to 
influence  the  votes  of  member  states,  suggests  that 
such  symbolic  diplomacy  does  have  a  diffuse 
impact  on  violators.  In  addition  to  the  Commission 
(now  Council),  which  considers  country  situa¬ 
tions,  each  of  the  major  treaties  generates  a  mem¬ 
ber  state  reporting  regime,  such  as  the  ICCPR’s 
Human  Rights  Committee.  Issue-specific  UN  rap¬ 
porteurs  conduct  field  visits  and  issue  reports  on 
abuses  such  as  forced  disappearance,  which  can  be 
influential  in  international  or  bilateral  sanctions, 
or  even  secure  the  release  of  prisoners  highlighted 
by  the  reports.  At  the  regional  level,  the  European 
Union  (EU),  the  Organization  of  American  States 
(OAS),  and  the  African  Union  also  host  Human 
Rights  Commissions,  which  vary  correspondingly 
from  the  highest  to  lowest  level  of  activity,  in 
accessibility  to  individual  versus  state  complaints, 
and  in  mandated  impact  on  member  states’  domes¬ 
tic  policies.  Bilateral  diplomacy  can  also  serve  as 
an  important  source  of  signaling,  especially  by 
superpowers  with  influence  over  abusive  regimes, 
and  seems  particularly  useful  in  occasionally  free¬ 
ing  prominent  political  prisoners  or  protecting 


officially  stigmatized  ethnic  or  religious  groups 
(such  as  Soviet  Jews).  However,  diplomatic  pres¬ 
sure  alone  has  proven  relatively  unavailing  in 
addressing  broader  patterns  of  repression  imbri¬ 
cated  in  the  dominance  of  the  target  regime. 

Transnational  legal  processes  run  the  gamut 
from  a  global,  to  a  regional,  to  a  bilateral  exercise 
of  universal  jurisdiction.  The  International 
Criminal  Court  (ICC),  following  the  UN’s  postwar 
tribunals  on  Rwanda  and  the  former  Yugoslavia, 
is  the  only  permanent  universal  body  with  jurisdic¬ 
tion  over  war  crimes,  genocide,  and  crimes  against 
humanity.  Its  combination  of  an  autonomous 
Prosecutor  and  Security  Council  referral,  with  the 
standard  safeguard  that  it  can  act  only  when 
domestic  remedies  are  exhausted,  has  secured  the 
participation  of  more  than  100  member  states — 
but  not  the  United  States.  The  ICC  has  been  used 
mainly  for  war  crimes  in  Africa  since  its  2002 
activation.  The  EU  and  OAS  also  have  human 
rights  courts  with  contentious  jurisdiction  and  the 
ability  to  levy  sanctions  against  member  states. 
Since  treaties  such  as  the  Convention  Against 
Torture  and  domestic  mechanisms  such  as  the  U.S. 
Torture  Victims’  Protection  Act  grant  willing 
states  universal  jurisdiction  to  prosecute  foreign 
nationals  in  their  own  domestic  courts  for  crimes 
against  humanity  committed  abroad,  several  states 
have  also  secured  national-level  criminal  or  civil 
judgments  against  fallen  dictators  and  war  crimi¬ 
nals  (such  as  Argentine  generals).  The  Spanish 
judge  Baltasar  Garzon,  who  brought  the  Pinochet 
case,  has  been  particularly  active  in  testing  the 
parameters  of  transnational  legal  accountability  in 
a  variety  of  cases. 

Global  economic  sanctions  have  been  issued  in 
a  few  of  the  most  egregious  cases  of  gross  viola¬ 
tions  by  pariah  states  but  have  sometimes  had  a 
contradictory  humanitarian  impact  even  where 
consensus  was  achieved.  Bilateral  diplomacy  and 
sanctions  have  been  more  widespread  and  influen¬ 
tial,  including  trade  and  investment  limitations, 
foreign  aid  conditionality,  and  limitations  on  mili¬ 
tary  assistance  to  repressive  regimes.  While  the 
U.S.  Office  of  the  Treasury  sanctions  dozens  of 
states  each  year,  only  a  proportion  of  sanctions  are 
based  on  human  rights  violations — but  they  include 
gross  violators  such  as  Burma,  North  Korea,  and 
Sudan.  Although  U.S.  foreign  assistance  legisla¬ 
tion,  such  as  the  Kennedy,  Harkin,  and  Leahy 
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amendments,  specifies  that  neither  development 
nor  military  assistance  should  be  given  to  viola¬ 
tors,  these  limits  have  been  very  unevenly  enforced 
and  often  overridden  on  national  security  grounds 
by  the  executive;  however,  the  threat  of  suspension 
of  aid  has  occasionally  secured  investigations, 
accountability,  and  even  a  reduction  in  violations 
by  client  states.  The  corresponding  carrot  for  the 
stick  of  economic  sanctions  is  human  rights- 
directed  foreign  assistance,  such  as  governance  aid 
and  support  for  global  or  recipient  country  human 
rights  organizations.  The  Nordic  countries,  the 
Netherlands,  and  Canada  have  vital  and  effective 
programs  of  human  rights  foreign  aid,  training, 
and  civil  society  promotion. 

The  ultimate  enforcement  of  human  rights  is 
armed  intervention  against  the  violating  authority. 
While  the  UN  Security  Council  can  authorize 
intervention  in  response  to  massive  crimes  against 
humanity,  in  practice  the  UN  has  acted  only  in  the 
aftermath  of  regime  collapse  or  conflict  resolu¬ 
tion — and  UN  peacekeeping  forces  on  the  ground 
as  violations  unfolded  notably  failed  to  protect 
victims  in  Bosnia,  Rwanda,  or  Timor.  More  mus¬ 
cular  interventions  by  the  North  Atlantic  Treaty 
Organization  (NATO)  against  abusive  regimes  did 
halt  violence  and  contribute  to  subsequent  return 
of  refugees  in  Kosovo  and  Afghanistan  but  with 
controversial  civilian  casualties.  Attempts  to 
regionalize  intervention  in  African  conflicts  have 
foundered  most  recently  in  Darfur.  Unilateral 
interventions  without  multilateral  consensus  are 
often  questioned  as  a  selective  rationalization  of 
national  interest  on  spurious  humanitarian 
grounds,  but  it  must  be  noted  that  the  countervail¬ 
ing  interests  of  neighboring  powers  afflicted  by 
spillover  violence  have  been  the  only  check  on 
some  of  the  worst  violations,  such  as  the  Cambodian 
genocide  halted  by  a  Vietnamese  invasion  or  the 
murderous  regime  of  Uganda’s  Idi  Amin  displaced 
by  Tanzania. 

International  human  rights  movements  moni¬ 
tor,  lobby,  protest,  and  reform  human  rights 
around  the  world — usually  in  conjunction  with 
local  organizations  and  victims.  The  flagship 
global  organization  Amnesty  International,  estab¬ 
lished  in  1961  to  advocate  for  “prisoners  of  con¬ 
science,”  now  comprises  millions  of  members  in 
more  than  150  countries — and  has  secured  the 
freedom  of  around  one  third  of  its  subjects  from 


diverse  regimes.  Along  with  peers  such  as  Human 
Rights  Watch,  Amnesty’s  nonpartisan  annual 
reports  on  country  conditions  and  monitoring 
of  conflict  situations  and  emerging  abuses  often 
influence  global  and  bilateral  condemnations  and 
sanctions.  Country-specific  campaigns,  increasingly 
mobilized  via  the  Internet,  are  credited  with  shield¬ 
ing  some  local  advocates  and  pressuring  vulnerable 
repressive  governments  for  limited  reform.  An  addi¬ 
tional  element  of  advocacy  mobilizes  transnational 
identities,  such  as  trade  unionists  defending  their 
threatened  colleagues  in  Colombia,  Christians  press¬ 
ing  for  religious  freedom  in  China,  and  women’s 
groups  campaigning  to  have  genocidal  rape  desig¬ 
nated  as  a  war  crime  by  the  International  Criminal 
Court.  Local  human  rights  movements  generally 
seek  to  transcend  political  and  ascriptive  identities 
and  derive  their  power  from  a  combination  of 
Gandhian  nonviolent  collective  action  and  global¬ 
ized  information  politics.  Such  protests  contrib¬ 
uted  strongly  to  transitions  to  democracy  in  many 
Latin  American  countries,  South  Africa,  and  the 
Philippines,  and  former  political  prisoners  became 
the  first  democratic  leaders  of  Poland,  the  Czech 
Republic,  South  Korea,  and  Timor.  Human  rights 
campaigns  dedicated  to  identifying  new  genres  of 
transnational  abuse  have  introduced  new  forms  of 
leverage  through  boycotts  and  new  mechanisms  of 
global  governance  such  as  codes  of  conduct  for 
multinationals  to  limit  labor  abuses  and  an  inter¬ 
national  certification  scheme  to  stem  the  trade  in 
“blood  diamonds,”  whose  illicit  export  contrib¬ 
uted  to  war  crimes  in  several  notorious  African 
conflicts. 

Human  Rights  Reform 

When  the  international  human  rights  regime  is  suc¬ 
cessful,  states  engage  in  human  rights  reform. 
Human  rights  reform  can  be  studied  from  a  policy 
design,  comparative,  or  transnational  perspective — 
since  such  reforms  have  often  been  diffused  or 
implemented  by  global  as  well  as  national  institu¬ 
tions.  Retrospective  accountability  for  abuses 
committed  in  a  prior  war  or  dictatorship  is  labeled 
“transitional  justice”  and  generally  consists  of 
some  combination  of  investigation  (“truth  commis¬ 
sions”),  adjudication,  and  reparations.  Human 
rights  literature  is  replete  with  debates  on  the  effec¬ 
tiveness  and  impact  on  regime  stability  of  different 
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forms  of  transitional  justice,  from  Argentina’s 
maximalist  trials  of  former  military  rulers  that  led 
to  rebellions  to  South  Africa’s  explicit  trade-off  of 
truth  for  justice.  Postconflict  accountability  is 
increasingly  implemented  by  a  hybrid  blend  of 
global  and  national  institutions,  as  in  Cambodia 
and  Sierra  Leone,  which  marks  an  evolution  of 
sovereignty  and  international  law. 

New  democracies  and  regimes  responding  to 
civil  rights  campaigns  against  chronic  discrimina¬ 
tion  also  establish  forward-looking  institutions 
and  policies  to  improve  citizen  rights.  National 
human  rights  institutions  for  domestic  monitor¬ 
ing  of  policies  as  well  as  practices  relevant  to 
human  rights,  mandated  by  the  UN  under  the 
Paris  Principles,  were  originally  established  by 
Western  European  democracies  but  have  diffused 
to  dozens  of  states  in  Latin  America  and  Asia. 
Such  human  rights  institutes  sometimes  overlap 
with  and  sometimes  replace  ombudsman  offices, 
which  are  empowered  to  independently  investi¬ 
gate  and  represent  citizens  (and  in  some  cases, 
migrants)  against  allegations  of  abuse  by  any  gov¬ 
ernment  agency.  Finally,  programs  of  human 
rights  education  for  agents  of  authority,  vulnera¬ 
ble  groups,  schoolchildren,  or  the  general  popula¬ 
tion  are  promoted  by  global  agencies  such  as  the 
UN  High  Commissioner  for  Human  Rights, 
regional  bodies  such  as  the  Organization  for 
Security  and  Co-operation  in  Europe’s  High 
Commissioner  on  National  Minorities,  and 
national  governments  seeking  to  implement  inter¬ 
national  human  rights  norms. 

Challenges 

While  the  struggle  for  fundamental  human  dignity 
has  advanced  tremendously  in  the  years  since  the 
Universal  Declaration,  challenges  to  the  theory 
and  practice  of  human  rights  continue.  Since 
human  rights  are  based  in  a  liberal  legal  model  of 
the  primacy  of  the  public  individual,  cultural, 
sociological,  and  even  feminist  critiques  of  univer- 
salism  question  the  rights’  relevance  for  the 
pursuit  of  disparate  collective  social  goals  and 
identities.  During  eras  of  renewed  conflict,  espe¬ 
cially  the  unconventional  transnational  threat  of 
terrorism,  states  revive  old  arguments  about 
national  security  trumping  inalienable  rights — and 
human  rights  scholarship  contests  the  rationale 


and  record  of  such  logics.  Critics  and  campaigners 
alike  note  the  increasing  salience  of  abuses  com¬ 
mitted  by  private  actors  who  are  not  accountable 
under  the  “government  and  opposition”  model 
the  international  human  rights  regime  was  created 
to  address:  from  multinational  sweatshops,  to  pri¬ 
vate  security  contractors,  to  patriarchal  religious 
authorities.  Moreover,  many  abusive  situations 
result  from  a  lack  rather  than  a  surfeit  of  state 
authority.  Human  rights  advocates  and  theorists 
respond  that  these  concerns  suggest  the  need  to 
expand  human  rights  norms  and  mechanisms,  not 
abandon  them.  While  human  rights  has  not  halted 
“death  by  government”  or  other  systematic  abuses 
of  power,  it  has  proven  to  be  a  resilient  tool  for  the 
voiceless  and  vulnerable  to  often  endure  and  occa¬ 
sionally  prevail. 
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Hybrid  Regimes 


The  notion  of  hybrid  regimes — that  is,  regimes 
that  were  characterized  by  the  presence  at  the 
same  time  of  key  aspects  of  different  models  or 
types  of  regimes — has  been  present  in  the  classic 
political  science  literature  for  years  (Samuel  E. 
Finer,  1970).  The  basic  change  in  recent  decades 
is  the  increase  in  the  number  and  variety  of  hybrid 
regimes  as  a  side  effect  of  the  widespread  phenom¬ 
enon  of  democratization  in  all  areas  of  the  world. 
This  entry  clarifies  the  notion,  points  out  how  to 
distinguish  hybrid  regimes  from  transitional 
phases,  and  builds  on  existing  empirical  research 
to  show  the  varieties  of  hybridism  in  contempo¬ 
rary  times. 

Meaning 

As  regards  the  term  regime,  consideration  is  given 
here  to  all  government  institutions  and  norms  that 
are  either  formalized  or  are  informally  recognized 
as  existing  in  a  given  territory  with  respect  to  a 
given  population.  While  formal  and  informal  insti¬ 
tutions  may  no  longer  configure  some  form  of 
nondemocracy  and  do  not  yet  configure  a  complete 
democracy,  they  still  bear  traces  of  the  previous 
political  reality.  In  addition,  to  have  something  that 
may  be  labeled  a  regime,  at  least  minimal  stabiliza¬ 
tion  is  needed.  This  is  the  key  aspect  differentiating 
a  regime  from  transition  phases,  which  are  charac¬ 
terized  by  short  duration.  The  precise  length  of 
time  that  is  considered  necessary  before  classifying 
something  as  a  regime  rather  than  a  transitional 
phase  is  arbitrarily — but  reasonably — stipulated. 

Such  a  regime  does  not  fulfill  the  minimalist 
requirements  of  a  democracy,  such  as  (a)  universal 
suffrage,  both  male  and  female;  (b)  free,  competi¬ 
tive,  recurrent,  and  fair  elections;  (c)  more  than 
one  party;  and  (d)  different  and  alternative  media 
sources.  One  important  aspect  of  this  definition  is 
that  in  the  absence  of  just  one  of  these  require¬ 
ments,  or  if  at  some  point  one  of  them  is  no  longer 
met,  there  is  no  longer  a  democratic  regime  but 
another  political  and  institutional  setup  marked  by 
varying  degrees  of  uncertainty  and  ambiguity. 
Moreover,  in  a  democracy,  be  it  only  minimal, 
institutions  and  rights  should  not  be  subject  to,  or 
conditioned  by,  nonelected  actors  or  exponents  of 


other  external  regimes.  The  former  refers  to  the 
armed  forces,  religious  hierarchies,  economic  oli¬ 
garchies,  a  hegemonic  party,  or  even  a  monarch 
with  pretensions  to  influencing  decision-making 
processes  or  the  overall  functioning  of  the  democ¬ 
racy;  in  the  latter  case,  a  regime  might  be  condi¬ 
tioned  by  an  external  power  that  deprives  the 
democracy  of  its  independence  and  sovereignty  by 
pursuing  nondemocratic  policies. 

In  addition,  such  a  regime  does  not  have  the  key 
aspects  of  nondemocratic  regimes,  and  a  basic  ref¬ 
erence  must  be  made  at  least  to  traditional  and 
authoritarian  regimes.  The  former  are  based  on  the 
personal  power  of  the  sovereign,  who  binds  his 
underlings  in  a  relationship  of  fear  and  reward; 
they  are  typically  legibus  soluti  regimes,  where  the 
sovereign’s  arbitrary  decisions  are  not  limited  by 
norms  and  do  not  need  to  be  justified  ideologically. 
Power  is  thus  used  in  particularistic  forms  and  for 
essentially  private  ends.  In  these  regimes,  the 
armed  forces  and  police  play  a  central  role,  while 
there  is  an  evident  lack  of  developed  ideology  and 
mass  mobilization.  Basically,  then,  the  political 
setup  is  dominated  by  traditional  elites  and  institu¬ 
tions.  As  for  the  authoritarian  regimes,  the  defini¬ 
tion  advanced  by  Juan  Linz  is  still  the  most  useful 
one:  a  political  system  with  limited,  nonresponsible 
political  pluralism;  without  an  elaborated  and 
guiding  ideology  but  with  distinctive  mentalities; 
without  either  extensive  or  intense  political  mobili¬ 
zation,  except  at  some  points  in  their  development; 
and  in  which  a  leader,  or,  occasionally,  a  small 
group,  exercises  power  from  within  formally  ill- 
defined,  but  actually  quite  predictable,  limits. 
However,  with  respect  to  such  a  definition,  which 
identifies  five  significant  dimensions — limited  plu¬ 
ralism,  distinctive  values,  low  political  mobilization, 
a  small  leading  group,  ill-defined  but  predictable 
limits  to  citizens’  rights — we  need  to  stress  the  con¬ 
straints  imposed  on  political  pluralism  within  a 
society  that  has  no  recognized  autonomy  or  inde¬ 
pendence  as  well  as  no  effective  political  participa¬ 
tion  of  the  people,  with  the  consequent  exercise  of 
various  forms  of  state  suppression. 

Having  considered  definitions  for  minimalist 
democracy,  traditional  regime,  and  authoritarian¬ 
ism,  it  is  now  possible  to  define  hybrid  regimes.  As 
suggested  by  Terry  L.  Karl  (1995)  in  relation  to 
some  Latin  American  countries,  they  may  be  char¬ 
acterized  by  “uneven  acquisition  of  procedural 
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requisites  of  democracy,”  without  “civilian  con¬ 
trol  over  the  military,”  with  sectors  of  the  popula¬ 
tion  that  “remain  politically  and  economically 
disenfranchised”  and  with  a  “weak  judiciary” 
(p.  80).  But  this  definition  refers  only  to  authori- 
tarianisms  that  partially  lose  some  of  their  key 
characteristics,  retain  some  authoritarian  or  tradi¬ 
tional  features,  and  at  the  same  time  acquire  some 
of  the  characteristic  institutions  and  procedures  of 
democracy.  A  hybrid  regime,  on  the  other  hand, 
may  also  have  a  set  of  institutions  where,  going 
down  the  inverse  path,  some  key  elements  of 
democracy  have  been  lost  and  authoritarian  char¬ 
acteristics  have  been  acquired.  Thus,  an  adequate 
definition  has  to  be  completed  by,  for  example, 
including  some  of  the  aspects  that  can  be  recurrent 
in  authoritarian  situation,  such  as  the  existence  of 
incumbents  who  routinely  abuse  state  resources, 
deny  the  opposition  fair  media  coverage,  harass 
opposition  candidates  and  their  supporters,  and  in 
some  cases  manipulate  electoral  results. 

This  discussion,  however,  prompts  reflection 
about  two  elements.  First,  a  hybrid  regime  is  always 
a  set  of  ambiguous  institutions  that  maintain  aspects 
of  the  past.  Second,  it  is,  consequently,  a  “corrup¬ 
tion”  of  the  preceding  regime,  lacking  as  it  does  one 
or  more  essential  characteristics  of  that  regime  but 
also  failing  to  acquire  other  characteristics  that 
would  make  it  fully  democratic  or  authoritarian. 
Consequently,  the  term  hybrid  can  be  applied  to  all 
regimes  preceded  by  a  period  of  authoritarian  or 
traditional  rule  followed  by  the  beginnings  of 
greater  tolerance,  liberalization,  and  a  partial  relax¬ 
ation  of  the  restrictions  on  pluralism  or  all  regimes 
that,  following  a  period  of  minimal  democracy  in 
the  sense  indicated  above,  are  subject  to  the  inter¬ 
vention  of  nonelected  bodies — the  military,  above 
all — that  place  restrictions  on  competitive  pluralism 
without,  however,  creating  a  more  or  less  stable 
authoritarian  regime.  Moreover,  in  disentangling 
empirical  realities  that  fit  the  previously  formulated 
definition  of  the  hybrid  regime  from  different  tran¬ 
sitional  situations,  it  should  be  stressed  that  there 
has  been  some  sort  of  stabilization  or  duration, 
perhaps  at  least  a  decade,  of  those  ambiguous, 
uncertain  institutional  setups.  Consequently,  to 
avoid  a  misleading  analysis  of  democratization  pro¬ 
cesses,  a  hybrid  regime  can  be  defined  as  a  set  of 
institutions  that  have  been  persistent,  whether  stable 
or  unstable,  for  about  a  decade;  have  been  preceded 


by  authoritarian  rule,  a  traditional  regime  (possibly 
with  colonial  characteristics),  or  even  a  minimal 
democracy;  and  are  characterized  by  the  breakup  of 
limited  pluralism  and  forms  of  independent,  auton¬ 
omous  participation,  but  with  the  absence  of  at  least 
one  of  the  four  aspects  of  a  minimal  democracy. 

A  Classification 

On  the  basis  of  the  previous  definition,  then,  a  cru¬ 
cial  aspect  of  hybrid  regimes  is  that  there  may  be 
(or  there  may  emerge)  veto  players — that  is,  indi¬ 
vidual  or  collective  actors  who  are  influential  or 
decisive  in  maintaining  the  regime  in  its  character¬ 
istic  state  of  ambiguity  and  uncertainty.  These 
actors  may  be  an  external,  foreign  power  that 
interferes  in  the  politics  of  the  nation;  a  monarch  or 
authoritarian  ruler  who  has  come  to  power  through 
more  or  less  violent  means;  the  armed  forces;  a 
hegemonic  party  run  by  a  small  group  or  a  single 
leader;  religious  hierarchies;  economic  oligarchies; 
and  other  powerful  groups  or  a  mixture  of  such 
actors,  who  are  either  unable  or  unwilling  to  elimi¬ 
nate  other  prodemocratic  actors,  assuming  that  in 
the  majority  of  current  hybrid  regimes  the  alterna¬ 
tive  is  between  democracy  and  nondemocracy. 

From  this  perspective,  an  effective  classification 
should  focus  on  the  legacy  of  the  previous  regime 
and  the  constraints  preventing  a  country  from 
becoming  a  minimal  democracy.  Thus,  the  types  of 
hybrid  regimes  that  develop  depend  directly  on  the 
classifications  of  authoritarian  regimes  and  democ¬ 
racies  that  have  already  been  established  and  the 
factors  that  prevent  democratic  change.  The  core 
assumption  of  this  typology  is  that  traditional  and 
democratic  regimes  can,  by  virtue  of  their  charac¬ 
teristics,  give  rise  to  different  results,  while  it  is 
more  likely  that  the  survival  of  authoritarian  veto 
players  points  toward  a  single  solution. 

However,  when  switching  from  theoretical 
hypotheses  to  empirical  analysis  (see  Leonardo 
Morlino,  2009),  three  analytic  axes  emerge  with 
clarity:  a  first  one  where  electoral  process,  political 
pluralism  and  participation,  freedom  of  expression 
and  beliefs,  and  freedom  of  association  and  orga¬ 
nization  are  grouped  together  closely;  a  second 
where  state  functioning  stands  alone;  and  a  third 
one,  conceptually  close  to  the  second  one,  where 
rule  of  law  and  personal  autonomy  are  also 
strongly  connected.  Additional  empirical  analysis 
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shows  how  three  models  can  result  by  merging  the 
second  and  third  components  also.  Thus,  the  first 
model  of  hybrid  regime,  termed  limited  democracy, 
derives  from  the  first  component  and  is  character¬ 
ized  by  universal  suffrage,  a  formally  correct  elec¬ 
toral  procedure,  and  elective  posts  occupied  on  the 
basis  of  elections  and  a  multiparty  system  but 
where  civil  rights  are  constrained  by  the  police  or 
other  effective  forms  of  suppression.  Consequently, 
there  is  no  actual  political  opposition  and,  above 
all,  the  media  are  compromised  by  a  situation  of 
monopoly  to  the  point  that  part  of  the  population 
is  effectively  prevented  from  exercising  their  rights. 
The  notion  of  limited  democracy  is  also  well  devel¬ 
oped  by  Wigell  (2008).  The  second  empirical  model 
is  that  of  democracies  without  law  or  democracy 
without  a  state,  as  the  state  can  be  conceived  of  as 
a  government  based  on  the  primacy  of  the  law, 
where  there  are  no  relevant  legacies  or  powerful 
veto  players  or  any  forms  of  state  suppression  or 
nonguarantee  of  rights  but  simply  a  situation  of 
widespread  illegality  in  which  the  state  is  incapable 
of  performing  properly  due  to  poorly  functioning 
institutions.  Limited  democracies  and  democracies 
without  state  are  confirmed  as  empirically  relevant 
classes  for  which  different,  contrasting  elements 
need  to  be  stressed:  on  the  one  hand,  the  lack  of  an 
effective  guarantee  of  rights  despite  the  presence  of 
state  institutions  and,  on  the  other  hand,  the  lack  of 
the  rule  of  law  and  of  a  functioning  state,  with  laws 
that  are  not  applied  because  the  judiciary  has  no 
effective  independence,  there  is  widespread  corrup¬ 
tion,  and  the  bureaucracy  is  flawed  and  inefficient. 
The  third  model  suggested  by  empirical  research  is 
that  of  quasi  democracies  characterized  by  regimes 
where  all  main  aspects  mentioned  above  are  deeply 
rooted  in  ambiguity:  There  is  illegality  and  at  the 
same  time  partial  constraints  in  the  effective  guar¬ 
antee  of  rights.  This  model  is  the  most  empirically 
relevant  one  (see  Morlino,  2009). 

Conclusion 

On  the  grounds  of  existing  empirical  analyses, 
hybrid  regimes  are  a  reality  and  empirically  sup¬ 
port  an  autonomous  model  of  regime  vis-a-vis 
democracy,  authoritarianism,  and  the  traditional 
regime.  The  three  models  of  hybridism  developed 
through  empirical  analysis  suggest  that  the  most 
significant  empirical  problem  is  to  ensure  the  exis¬ 
tence  of  institutions  that  are  capable  of  performing 


their  functions.  Hence,  there  are  countries  where 
strengthening  of  the  state  and/or  promoting  more 
effective  guarantee  of  rights  might  transform  the 
regimes  into  democracies. 

Leonardo  Morlino 
LUISS  Guido  Carli 
Rome,  Italy 

See  also  Authoritarian  Regimes;  Democracy,  Theories  of; 
Democratization;  International  Regimes;  Transition 
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Hypothesis  Testing 


The  primary  means  of  conveying  the  strength  of 
empirical  findings  in  political  science  is  the  null 
hypothesis  significance  test  (NHST).  This  para¬ 
digm,  along  with  its  strengths  and  weaknesses,  is 
therefore  important  for  nearly  every  quantitative 
study  in  political  science.  This  entry  reviews  the 
current  hypothesis  testing  paradigm  and  its  his¬ 
tory,  discusses  the  underused  idea  of  statistical 
power  from  tests,  and  points  out  some  common 
misinterpretations  of  hypothesis  testing. 

The  Current  Paradigm:  Null  Hypothesis 
Significance  Testing 

The  current  approach  to  hypothesis  testing  in  all  of 
the  social  sciences  is  a  synthesis  of  the  Fisher  test 
of  significance  and  the  Neyman-Pearson  hypothe¬ 
sis  test.  In  this  20th-century  procedure,  two 
hypotheses  are  set  forth:  a  null  or  restricted 
hypothesis,  H0,  which  is  set  against  an  alternative 
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or  research  hypothesis,  Hv  Thus,  they  are  sup¬ 
posed  to  describe  two  complementary  notions 
about  some  political  science  phenomenon  of  inter¬ 
est.  The  research  hypothesis  is  the  probability 
model  that  describes  the  author’s  belief  about  this 
phenomenon  and  is  typically  operationalized 
through  statements  about  an  unknown  parameter 
0  G  ©.  In  the  most  basic  and  common  setup,  the 
null  hypothesis  asserts  that  0  =  0,  and  the  research 
hypothesis  asserts  that  0  A  0.  Such  a  two-sided  test 
is  the  overwhelming  default  in  assessing  the  statis¬ 
tical  reliability  of  individual  regression  parameters. 

Once  the  hypotheses  are  established,  a  test  sta¬ 
tistic  T,  some  function  of  0  and  the  data,  is  calcu¬ 
lated  and  assessed  with  the  distribution  under  the 
assumption  that  H0  is  true.  Commonly  used  test 
statistics  are  sample  means,  X ;  chi-square  statistics 
from  tabular  analysis,  x2;  and  t  statistics  in  linear 
and  generalized  linear  models.  Note  that  the  sam¬ 
ple  space  of  the  test  statistic  must  correspond  to  the 
support  of  the  specified  null  and  alternative  distri¬ 
butions.  The  key  idea  is  that  test  statistics  that 
appear  to  be  “unusual”  for  the  null  distribution 
(e.g.,  those  in  the  tails)  cast  doubt  on  the  original 
assumption  that  this  is  the  true  distribution. 

The  test  procedure  <f>  assigns  one  of  two  deci¬ 
sions,  D0  and  D  p  to  all  possible  values  in  the  sample 
space  of  the  statistic  T,  corresponding  to  supporting 
either  H0  or  Hv  respectively.  The  p  value  (also 
called  the  associated  probability)  is  equal  to  the  area 
in  the  tail  (or  tails)  of  the  assumed  distribution 
under  H0,  which  starts  at  the  point  determined  by 
T  on  the  horizontal  axis  and  continues  to  positive  or 
negative  infinity.  If  a  predetermined  significance 
level  a  has  been  specified,  then  H0  is  rejected  for 
p  values  less  than  a;  otherwise,  the  p  value  itself  is 
reported.  More  formally,  the  sample  space  of  T  is 
split  into  two  complementary  regions,  S0  and  .S' , , 
such  that  the  probability  that  T  falls  in  Sv  causing 
decision  Dp  is  either  a  predetermined  null  hypoth¬ 
esis  cumulative  distribution  function  (CDF)  level 
(a  =  size  of  the  test,  Neyman-Pearson)  or  the  CDF 
level  corresponding  to  the  value  of  the  test  statistic 
under  H0  is  reported  as  follows: 

p  value  =  j  Ph0(T  —  t)dt  (Fisher). 

Sl 

Thus,  decision  D1  is  made  if  the  test  statistic  is 
sufficiently  atypical  given  the  distribution  under 


H0.  This  process  is  illustrated  for  a  one-tailed  test 
at  a  =  .05  in  Figure  1. 

Historical  Background 

The  current  NHST  is  a  blend  of  two  extremely 
important  but  incompatible  schools  of  thought  in 
20th-century  statistics.  R.  A.  Fisher  developed  a 
procedure  that  gives  significance  levels  from  the 
data  with  the  intention  of  possibly  “nullifying”  a 
straw  man  hypothesis,  but  Neyman  and  Pearson  set 
up  a  very  rigid  decision-making  process  to  arrive  at 
one  of  two  possible  competing  conclusions. 

Fisher’s  Test  of  Significance 

Originally  Fisher,  in  1935,  developed  his  test  of 
significance  in  the  context  of  randomized  experi¬ 
ments,  where  the  physical  act  of  randomization  of 
units  to  treatment  and  control  group  is  the  basis 
for  unbiased  inference.  Fisher  posited  a  single  and 
sharp  null  hypothesis,  H0.  The  simplest  and  most 
popular  H0  is  that  the  treatment  effect  is  zero  for 
all  units.  Note  that  this  H0  is  more  restrictive  than 
the  usually  tested  null  hypothesis  that  the  average 
effect  is  zero  across  the  sample.  Under  the  sharp 
H0,  all  (potential)  values  of  the  outcome  variable 
under  treatment  and  control  can  be  either  observed 
or  inferred  for  each  unit.  Hence,  the  distribution  of 


Distribution  of  the  test  statistic:  T 


Figure  I  Null  Hypothesis  Significance  Testing 
Illustrated 
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the  statistic  T  is  assumed  to  be  known.  Since  the 
distribution  of  T  is  solely  determined  by  random¬ 
ization,  it  is  called  the  randomization  distribution 
of  T.  Using  this  distribution,  one  can  compare  the 
observed  test  statistic  against  its  distribution  under 
the  null  hypothesis.  An  observed  value  that  is 
“very  unlikely,”  given  the  null  hypothesis  and  the 
implied  randomization  distribution,  is  taken  as 
evidence  against  the  null  hypothesis  in  what  is 
essentially  a  stochastic  version  of  the  “proof  by 
contradiction,”  according  to  Guido  Imbens  and 
Don  Rubin.  More  specifically,  as  the  statistic 
moves  away  from  its  conditional  expected  value, 
E(T\H0),  H0  becomes  progressively  less  plausible 
(less  likely  to  occur  by  chance).  However,  as  is 
shown  below,  the  stochastic  nature  of  this  proof 
by  contradiction  creates  some  logical  fallacies.  The 
relationship  between  T  and  the  level  of  significance 
produced  by  the  test  is  established  by  the  density 
outside  the  threshold  established  by  T  (one-  or 
two-tailed),  going  away  from  the  density  region 
containing  the  expected  value  of  T  given  H0.  The 
outside  density  is  the  p  value,  also  called  the 
achieved  significance  level.  Fisher  hypothesis  test¬ 
ing  is  therefore  summarized  by  the  following  steps: 

1.  Identify  the  null  hypothesis. 

2.  Determine  the  appropriate  test  statistic  and  its 
distribution  under  the  assumption  that  the  null 
hypothesis  is  true. 

3.  Calculate  the  test  statistic  from  the  data. 

4.  Determine  the  achieved  significance  level  that 
corresponds  to  the  test  statistic  using  the 
distribution  under  the  assumption  that  the  null 
is  true. 

5.  Reject  H0  if  the  achieved  significance  level  is 
sufficiently  small.  Otherwise,  reach  no  firm 
conclusion. 

The  Fisher  construct  naturally  leads  to  the  ques¬ 
tion  of  what  p  value  is  sufficiently  small  as  to  war¬ 
rant  rejection  of  the  null  hypothesis.  This  measure 
of  extremeness  or  what  is  considered  as  unlikely 
should  be  based  on  the  specific  application  but 
involves  necessarily  some  arbitrary  judgments. 
Fisher  established  the  CDF  thresholds  of  .10,  .05, 
and  .01  as  “convenient”  and  nothing  more. 
Therefore,  there  is  absolutely  no  scientific  basis  for 


making  hypothesis  test  decisions  at  these  levels 
whatsoever;  they  have  merely  become  convention. 
This  suggests  a  great  amount  of  caution  about 
making  decisions  about,  say,  .049  versus  .051,  a 
difference  that  is  typically  swamped  by  measure¬ 
ment  error  in  the  social  sciences  anyway. 

Neyman  and  Pearson  Hypothesis  Testing 

During  the  same  period  when  Fisher  was  devel¬ 
oping  his  exact  test  of  significance  based  on  the 
sharp  H0,  Neyman  and  Pearson  were  interested  in 
methods  to  estimate  average  treatment  effects, 
such  as  0  =  0  over  the  sample,  and  to  construct 
confidence  intervals  around  this  estimate.  They 
rejected  Fisher’s  idea  that  only  the  sharp  null 
hypothesis  needs  to  be  tested.  Since  only  the 
assumption  of  the  sharp  null  hypothesis  allowed 
Fisher  to  derive  the  exact  randomization  distribu¬ 
tion  of  T,  Neyman  and  Pearson  can  only  derive 
certain  properties  of  this  distribution.  This  is  the 
price  they  have  to  pay  to  allow  “nonsharp” 
hypotheses  about  average  differences  between, 
say,  treatment  and  control  groups.  Neyman  and 
Pearson  argue  that  a  more  useful  procedure  is  to 
propose  two  complementary  hypotheses:  ©A  and 
&B  (or  a  class  of  &B,  such  that  @A  U  ©A  —  0), 
which  need  not  be  labeled  “null”  or  “alternative” 
but  often  are  purely  for  convenience.  Furthermore, 
Neyman  and  Pearson  point  out  that  one  can  posit 
a  hypothesis  and  consecutively  test  multiple  admis¬ 
sible  alternatives  against  this  hypothesis.  Since 
there  are  now  two  competing  hypotheses  in  any 
one  test,  Neyman  and  Pearson  can  define  an  a 
priori-selected  a,  the  probability  of  falsely  reject¬ 
ing  @A  under  the  assumption  that  H0  is  true,  and 
3,  the  probability  of  failing  to  reject  @A  when  H0  is 
false.  By  convention,  the  first  mistake  is  called  a 
Type  I  error,  and  the  second  mistake  is  called  a 
Type  II  error.  Note  that  a  and  3  are  probabilities 
conditional  on  two  mutually  exclusive  events:  a  is 
conditional  on  the  null  hypothesis  being  true,  and 
3  is  conditional  on  the  null  hypothesis  being  false. 
A  more  useful  quantity  than  3  is  1  -  3>  which 
Neyman  and  Pearson  (1933,  1936)  call  the  power 
of  the  test:  the  long-run  probability  of  accurately 
rejecting  a  false  null  hypothesis  given  a  point  alter¬ 
native  hypothesis. 

In  this  setup,  we  want  to  develop  the  test  that 
has  the  highest  power  for  a  given  a  priori  a.  To 
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state  this  more  precisely,  let  the  test  function  4> 
take  on  the  values  1  if  D,  is  reached  under  ©B  and 
0  if  D0  is  reached  under  ©A.  Hence,  the  set  of  data 
points  or  values  of  T  for  which  <f>  =  1  is  equal  to 
the  region  of  rejection  (this  is  only  approximately 
true  for  randomization  tests).  For  a  given  signifi¬ 
cance  level  a,  the  problem  is  to  select  a  test  cj>  such 
that  1  —  (3  is  maximized  for  all  0  belonging  to  0B 
subject  to  the  condition  that  Ee[(j>J  ^  a  for  all  0 
belonging  to  ©A. 

To  accomplish  this  goal,  the  researcher  consid¬ 
ers  the  fixed  sample  size,  the  desired  significance 
level,  and  the  research  hypothesis,  then  employs 
the  test  with  the  greatest  power.  Neyman  and 
Pearson’s  famous  lemma  shows  that  under  certain 
conditions  (a  sufficient  condition  is  that  the  prob¬ 
ability  density  tested  has  a  monotone  likelihood 
ratio)  there  exists  a  “uniformly  most  powerful” 
test  that  has  the  greatest  possible  probability  of 
rejecting  a  false  null  hypothesis  in  favor  of  a  point 
alternative  hypothesis,  compared  with  other  tests. 
Formally,  this  comparison  employs  the  poiver 
function  that  is  based  on  Type  I  and  Type  II  error 
rates.  Again,  the  probability  of  a  Type  I  and  Type 
II  error  can  be  stated  as  follows: 

P(Type  I  error)  =  P(reject  0AI0A  is  true) 

P(Type  II  error)  =  P( accept  0AI0B  is  true) 

=  1  —  P(reject  ©AI0B  is  true). 

The  last  expression  gives  the  probability  to  reject 
©A  as  a  function  of  0  and  builds  the  basis  for  the 
power  function.  If  0  belongs  to  ©B  then  the  power 
function  gives  the  probability  of  making  the  right 
decision,  that  is,  reject  ©A.  If  0  belongs  to  ©B,  then 
the  power  function  gives  the  probability  of  a  Type 
I  error  rate,  which  is  smaller  than  or  equal  to  a. 
Ideally,  a  power  function  of  a  one-sided  test  would 
be  a  step  function,  equaling  0  as  long  as  0  belongs 
to  0A  and  equaling  1  as  soon  as  0  belongs  to  0B.  It 
is  already  clear  from  the  discussion  above  that  this 
sharp  and  error-free  distinction  between  0A  and 
0B  is  impossible.  However,  the  better  a  test,  the 
steeper  the  slope  of  the  power  function  between 
accepting  0A  and  accepting  ©B  .  In  general,  the 
power  of  a  test  increases  with  the  preset  level  a,  the 
sample  size,  and  greater  difference  between  values 
from  ©A  and  ©B.  Figure  2  illustrates  the  power 
functions  for  a  one-sided  test.  We  use  a  normally 


Figure  2  Power  Function  for  a  One-Sided  Test; 
a  =  .05,  .0 1 ,  and  .00 1 


distributed  sample  of  size  N  =  10.  We  estimate  the 
power  function  at  |x  =  105  for  0A:  |x  =  100 
against  ©B:  jjl  >  100  with  a  standard  deviation  of 
o'  —  10  and  varying  values  for  a  (.05,  .01,  .001). 

Figure  3  illustrates  the  power  functions  for  a 
normally  distributed  sample  of  varying  size  with 
N  =  10,  20,  and  100.  Again,  we  estimate  the 
power  function  at  p,  =  105  for  0A:  |x  =  100 
against  0B:  |x  >  100  with  a  standard  deviation  of 
ct  =  10  and  a  fixed  a  =  .05.  Again,  the  power  func¬ 
tions  are  based  on  a  one-sided  test.  The  influence 
of  the  sample  size  on  power  is  further  discussed 
below. 

In  comparing  the  Neyman-Pearson  approach 
with  Fisher’s  exact  test  of  significance,  note  how 
different  the  following  steps  are  from  Fisher’s  pre¬ 
viously  described  steps: 

1.  Identify  a  hypothesis  of  interest,  ©B,  and  a 
complementary  hypothesis, ©A. 

2.  Determine  the  appropriate  test  statistic  and  its 
distribution  under  the  assumption  that  ©A  is  true. 

3.  Specify  a  significance  level  (a)  and  determine 
the  corresponding  critical  value  of  the  test 
statistic  under  the  assumption  that  ©A  is  true. 


1120  Hypothesis  Testing 


Figure  3  Power  Function  for  a  Normally  Distributed  Sample  of  Varying  Size;  N  =  10,  20,  and  100 


4.  Calculate  the  test  statistic  from  the  data. 

5.  Reject  ©A  and  accept  ©Bif  the  test  statistic  is 
more  extreme  than  the  critical  value  from  the 
expected  value  of  the  test  statistic  (calculated 
under  the  assumption  that  ©^  is  true). 

Otherwise,  accept  ©A. 

The  Neyman-Pearson  approach  is  important  in 
the  context  of  decision  theory  where  the  decision 
in  the  final  step  above  is  assigned  a  risk  function 
computed  as  the  expected  loss  from  making  an 
error.  Most  political  scientists  are  surprised  to  see 
the  word  accept  in  this  context  since  they  are 
trained  not  to  accept  a  null  hypothesis,  but 
Neyman  and  Pearson  do  not  have  a  null;  they  have 
a  partition  of  the  decision  space. 

The  Synthesis  of  Fisher  and  Neyman-Pearson 

The  NHST  attempts  to  combine  the  two 
approaches  described  above  giving  a  synthesized 
approach.  There  is  no  attributed  authorship  for 
this  notion,  and  it  appears  to  have  come  from 
textbook  writers  in  the  mid-20th  century  who 
may  have  been  afraid  to  offend  either  Fisher  or 


Neyman-Pearson  (the  conversation  between  these 
powerful  voices  was  unusually  vitriolic). 

Neyman  and  Pearson’s  hypothesis  test  defines 
the  significance  level  a  priori  as  a  function  of  the 
testing  procedure  and  not  as  a  function  of  the  data. 
Conversely,  Fisher’s  test  of  significance  defines  the 
significance  level  afterward  as  function  of  the  data 
after  the  test.  The  current  paradigm  in  the  social 
sciences  mixes  these  two  approaches  by  pretending 
to  select  a  a  priori  but  actually  using  p  values  (or 
asterisks  next  to  test  statistics  indicating  ranges  of 
p  values)  to  evaluate  the  strength  of  the  evidence. 
Thus,  the  NHST  allows  inclusion  of  the  alternate 
hypothesis  but  precludes  a  search  for  a  more  pow¬ 
erful  test. 

The  NHST  also  tries  to  reconcile  the  two  differ¬ 
ing  perspectives  on  how  the  hypotheses  are  actually 
defined.  It  adopts  the  Neyman-Pearson  convention 
of  two  explicitly  stated  rival  hypotheses,  but  one  is 
always  labeled  as  the  null  hypothesis  as  in  the 
Fisher  test.  In  some  introductory  texts,  the  null 
hypothesis  is  presented  only  as  a  null  relationship: 
0  =  0  (no  effect),  whereas  Fisher  intended  the  null 
hypothesis  simply  as  a  straw  man  to  be  “nullified.” 
The  NHST  pretends  to  use  the  Neyman-Pearson 
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decision  process  except  that  failing  to  reject  the  null 
hypothesis  is  treated  as  a  quasi  decision:  “modest” 
support  for  the  null  hypothesis  assertion.  There  is 
also  confusion  in  the  synthesized  test  about  p  val¬ 
ues  as  long-run  probabilities.  Since  the  p  value  or 
range  of  p  values,  indicated  by  asterisks,  is  not  set 
a  priori,  it  is  not  the  long-run  probability  of  mak¬ 
ing  a  Type  I  error  but  is  typically  treated  as  such, 
even  with  a  single  sample  and  a  single  test.  Neither 
Fisher  nor  Neyman  and  Pearson  would  have  been 
satisfied  with  the  synthesis.  Fisher  objected  to 
Neyman  and  Pearson’s  preselection  of  the  signifi¬ 
cance  level  as  well  as  the  mandatory  two-outcome 
decision  process.  Neyman  and  Pearson  disagreed 
with  Fisher’s  interpretation  of  p  values. 

Underlying  Problems  With  the 
Null  Hypothesis  Significance  Test 

The  basis  of  the  NHST  rests  on  the  logical  argu¬ 
ment  of  modus  tollens  (denying  the  consequent) 
but  adds  a  necessary  probabilistic  aspect: 

If  X,  then  Y  is  highly  likely. 

Y  is  not  observed. 

Therefore,  X  is  highly  unlikely. 

This  logic  seems  at  first  to  be  plausible.  Yet  it  is  a 
fallacy  to  assert  that  obtaining  data  that  are 
atypical  under  a  given  assumption  implies  that  the 
assumption  is  likely  false:  Almost  a  contradiction 
of  the  null  hypothesis  does  not  imply  that  the  null 
hypothesis  is  almost  false.  This  means  that  the 
NHST  rests  on  a  fundamentally  flawed  logical 
basis.  The  remaining  issues  in  this  section  are 
merely  common  misinterpretations. 

The  Inverse  Probability  Problem 

A  common  interpretive  problem  with  null 
hypothesis  significance  testing  is  a  misunderstand¬ 
ing  of  the  order  of  the  conditional  probability. 
Many  people  have  a  belief,  which  stems  directly 
from  Fisher’s  perspective,  that  the  smaller  the 
p  value,  the  greater  the  probability  that  the  null 
hypothesis  is  false.  This  incorrect  interpretation  is 
that  the  NHST  produces  P(H0\D ):  the  probability 
of  H0  being  true  given  the  observed  data  D.  But  the 
NHST  first  posits  H0  as  true  and  then  asks  what  the 


probability  of  observing  these  or  more  extreme  data 
is.  This  is  clearly  P(D\H0).  In  fact,  a  more  desirable 
test  would  be  one  that  produces  P(H0\D),  because 
then  we  could  simply  search  for  the  hypothesis  with 
the  greatest  probability  of  being  true  given  some 
observed  data.  Bayes’s  law  clarifies  the  difference 
between  these  two  unequal  probabilities: 

P(H0\D)  =  ®  P(D|H0). 

The  two  quantities,  P(D\H0)  and  P(H0\D),  are 
equal  only  if  P(H0)  =  P(D),  and  there  is  absolutely 
no  theoretical  justification  supporting  such  an 
equality. 

Significance  Driven  by  Sample  Size 

There  are  two  misinterpretations  about  the  role 
of  sample  size  in  null  hypothesis  significance  test¬ 
ing.  First  is  the  belief  that  statistical  significance  in 
a  large  sample  study  implies  real-world  impor¬ 
tance.  Many  have  observed  that  an  NHST  based 
on  a  large  sample  size  almost  always  results  in 
statistical  significance  (Macdonald,  1997).  This  is 
a  concern  in  political  science  research  since  we  do 
not  want  to  infer  that  some  subfields  have  greater 
legitimacy  just  because  the  corresponding  data  sets 
tend  to  produce  smaller  p  values:  “a  prejudice 
against  the  null”  (Greenwald,  1975,  p.  1). 

The  second  misinterpretation  is  that  for  a  given 
p  value  in  a  study  that  rejects  the  null  hypothesis, 
larger  sample  sizes  imply  more  reliable  results. 
This  is  false:  Two  studies  that  reject  the  null  with 
the  same  p  value  are  equally  likely  to  make  a  Type 
I  error  even  if  they  have  dramatically  different 
sample  sizes.  This  mistake  results  from  a  poor 
understanding  of  Type  II  errors.  Unlike  the  case 
above,  two  studies  that  fail  to  reject  the  null 
hypothesis  are  identical  in  every  way  except  that 
the  sample  sizes  are  qualitatively  different.  The 
test  with  the  larger  sample  size  is  less  likely  to 
make  a  Type  II  error  since  the  sample  size  is 
always  in  the  denominator  of  the  equation  for 
statistical  significance.  To  show  why  this  is  true, 
Figure  4  displays  two  tests  for  H0:  |x  =  0  versus 
Hy  (x  >  0,  with  sample  sizes  of  9  and  12,  respec¬ 
tively,  and  the  true  distribution  normal  (3,  3).  The 
shaded  region  is  the  area  under  the  true  sampling 
distribution  of  X  to  the  left  of  the  area  determined 
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by  setting  a  =  .05  under  the  false  assumption  that 
H0  is  true. 

Thus,  the  shaded  region  is  the  probability  we 
would  fail  to  reject  the  false  H0  (i.e.,  (B  =  P(Type  II 
error)).  It  is  clear  from  Figure  4  that  the  probabil¬ 
ity  of  a  Type  II  error  is  much  lower  for  the  test 
with  a  larger  sample  size,  even  though  the  proba¬ 
bility  of  a  Type  I  error  is  identical.  The  area  under 
the  true  sampling  distribution  of  X  in  either  panel 
of  Figure  4  not  shaded  is  equal  to  1  —  [B  =  1  — 
P(Type  II  error),  which  is  the  power  of  the  test.  It 
follows  from  above  that  the  second  test  with  a 
larger  sample  size  has  greater  power. 

Replication  Fallacy 

Another  misinterpretation  of  null  hypothesis  sig¬ 
nificance  testing  is  the  belief  that  one  minus  the 
p  value  is  the  probability  of  replication  (producing 
significant  results  in  repeated  iterations  of  the  same 
study).  This  is  equivalent  to  the  two  misconceptions: 


P(H0)  —  a  and  P(H1)  =  1  —  a.  The  term  replica¬ 
tion  fallacy  describes  the  wrong  belief  that  a  low  p 
value  such  as  p  —  .05  implies  that  95  out  of  100 
replications  will  be  statistically  significant.  The 
error  is  obvious  when  one  recalls  that  p  —  P(D\H0 ) 
and  is  thus  a  function  of  a  single  data  set  produc¬ 
ing  a  single  test  statistic. 

Asymmetry  and  Accepting  the  Null  Hypothesis 

Failing  to  reject  the  null  hypothesis  does  not  rule 
out  an  infinite  number  of  other  competing  research 
hypotheses.  Null  hypothesis  significance  testing 
is  asymmetric:  If  the  test  statistic  is  sufficiently 
atypical  given  the  null  hypothesis  then  the  null 
hypothesis  is  rejected,  but  if  the  test  statistic  is 
insufficiently  atypical  given  the  null  hypothesis  then 
the  null  hypothesis  is  not  accepted.  This  is  a  double 
standard:  H1  is  held  innocent  until  proven  guilty, 
and  H0  is  held  guilty  until  proven  innocent.  Despite 
the  fact  that  it  is  very  risky  to  accept  implicitly  or 


Figure  4  Type  II  Errors  for  X,  Differing  Sample  Sizes 
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expressly  the  conclusion  from  a  nonrejected  null 
hypothesis,  such  instances  are  common. 

There  are  two  problems  that  develop  as  a  result 
of  asymmetry.  The  first  is  a  misinterpretation  of 
the  asymmetry  to  assert  that  finding  a  nonstatisti- 
cally  significant  difference  or  effect  is  evidence  that 
it  is  equal  to  zero  or  nearly  zero.  This  acceptance 
of  the  null  hypothesis  is  damaging  because  it 
inhibits  the  exploration  of  competing  research 
hypotheses.  The  second  problem  pertains  to  the 
correct  interpretation  of  failing  to  reject  the  null 
hypotheses.  Failing  to  reject  the  null  hypothesis 
essentially  provides  almost  no  information  about 
the  state  of  the  world.  It  simply  means  that  given 
the  evidence  at  hand  one  cannot  make  an  assertion 
about  a  relationship:  All  you  can  conclude  is  that 
you  can’t  conclude  that  the  null  was  false  (Cohen, 
1962). 
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Idealism 


As  a  category  in  metaphysics,  “idealism”  usually 
refers  at  the  most  general  level  to  the  proposition 
that  “reality”  is  in  some  sense  mental  rather  than 
material.  In  political  philosophy/theory,  idealism 
refers  to  theories  that  hold  concepts  and  proposi¬ 
tions  to  be  the  constitutive  and  determining  fac¬ 
tors  in  politics,  a  claim  that  includes  the  belief  that 
“material”  processes  are  actually  mental  at  root. 
Normatively,  idealists  tend  to  hold  that  the  values 
by  which  personal  and  political  conduct  should  be 
judged  are  in  some  sense  “spiritual,”  usually 
founded  on  some  notion  of  human  nature  that 
individuals  are  innately  driven  to  realize  in  their 
daily  lives.  They  also  tend  to  reject  utilitarianism, 
claiming  that  it  celebrates  man’s  animality  rather 
than  his  inchoate  humanity  or,  in  many  cases, 
immanent  “divinity.”  That  noted,  idealists  have 
tended  to  hold  theological  beliefs  that  were  het¬ 
erodox  at  best  or  even  atheistic.  Beyond  these  very 
general  claims,  little  is  shared  by  those  political 
theories  that  are  commonly  labeled  “idealist,” 
aside  from  certain  common  misinterpretations 
made  by  nonidealists.  For  example,  there  is  no 
necessary  requirement  to  understand  the  world 
“idealistically” — that  is,  as  ultimately  “good,” 
peaceful,  or  providentially  ordered.  Similarly, 
there  is  no  common  tendency  to  deny  the  exis¬ 
tence  of  the  external  world  outside  of  thought  (the 
denial  that  is  most  associated  with  Berkeleyian 
solipsism).  Instead,  what  is  characteristically  at 
issue  is  one’s  understanding  of  the  concept  of 


“reality,”  especially  social  reality.  Idealists  tend 
to  hold  that  the  closest  one  can  get  to  any  mind- 
independent  world  “out  there”  are  the  various 
(interpreted)  sensations  excited  by  (it  seems)  that 
world  in  our  consciousnesses.  Consequently,  one 
can  only  analyze  with  even  a  modicum  of  precision 
those  sensations  and  the  ideas  to  which  they  give 
rise.  Hence,  the  realms  of  human  conduct  such  as 
social  life,  politics,  aesthetics,  and  economics  are 
constructs  of  the  mind  but  are  no  less  real  for  that 
(in  fact,  they  are  real  because  they  are  such  construc¬ 
tions).  Counterfactually,  humans  have  no  direct 
access  to  an  underlying  material  world,  meaning 
that  the  only  “reality”  that  one  can  become  aware  of 
and  can  analyze  and  criticize  rationally  is  the  mental 
reality  of  interpreted  thought. 

Plato  (ca.  428-348  BCE)  was  the  first  recorded 
idealist  political  philosopher.  He  claimed  the 
world  that  we  perceive  is  made  up  of  imperfect 
ideas,  which  are  themselves  copies  of  a  world  of 
purely  mental  entities,  each  of  which  forms  part  of 
a  single  eternal  coherent  system  of  concepts, 
ordered  with  reference  to  the  “Form  of  the 
Good.”  Only  that  single  system  is  real,  and  our 
actual,  imperfect  ideas  at  best  imitate  what  is  real, 
and  do  so  only  to  the  extent  that  they  reproduce 
elements  of  that  system.  To  the  extent  that  our 
beliefs  do  not  match  the  eternal  system  of  con¬ 
cepts,  they  trap  us  in  mere  opinion  and  error.  This 
claim  has  highly  significant  ethical  and  political 
implications.  According  to  Plato,  the  good  life  is 
one  lived  in  accordance  with  the  virtues  of  wis¬ 
dom,  courage,  moderation,  and  justice,  with  the 
last  of  these  being  achieved  through  the  correct 
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ordering  of  the  first  three  virtues  within  the  indi¬ 
vidual’s  soul  and  within  the  life  of  the  ancient 
Greek  city-state  or  polis.  Possibly  on  grounds  of 
its  impracticality  rather  than  on  philosophical 
grounds,  Plato’s  later  political  works  (The  Laws 
and  The  Statesman)  replaced  the  Republic’s  sys¬ 
tem  of  rule  by  “guardians”  with  systems  relying 
on  greater  constitutional  checks.  Nevertheless, 
many  scholars  have  seen  even  these  later  books  as 
totalitarian  models. 

There  were  many  philosophically  significant 
idealists  in  the  centuries  after  Plato,  most  notably 
Neo-Platonists  such  as  Plotinus  (ca.  205-270  CE) 
and  Iamblichus  (ca.  250-325  CE),  Cambridge 
Platonists  including  Ralph  Cudworth  (1617-1688) 
and  Richard  Cumberland  (1631-1718),  and  the 
solipsistic  idealist  Bishop  George  Berkeley  (1685- 
1753).  Nevertheless,  after  Plato,  the  next  significant 
idealist  political  philosopher  was  Immanuel  Kant 
(1724-1804).  Johann  Gottlieb  Fichte  (1762-1814) 
sought  to  develop  what  he  saw  as  the  leading 
“spirit”  of  Kant’s  critical  philosophy,  placing  par¬ 
ticular  stress  on  the  significance  of  the  self-expressing 
will  as  the  basis  of  practical  reason.  His  early  indi¬ 
vidualism  gave  way  to  an  increasingly  collectivist 
political  position,  reinforcing  Kant’s  earlier  rejec¬ 
tion  of  Fichte’s  self-proclaimed  discipleship.  Fichte’s 
Closed  Commercial  State  (1800)  advocated  an 
autarkic,  corporatist,  quasi-military  state,  while  his 
Addresses  to  the  German  Nation  (1808)  stressed 
the  need  for  state  control  of  education  as  a  precon¬ 
dition  of  the  orderly  development  of  the  German 
people,  the  (allegedly  predetermined)  stages  of 
which  he  set  out  in  his  Characteristics  of  the 
Present  Age  (1806).  Idealism’s  next  great  philoso¬ 
pher  was  the  much  misunderstood  Georg  Wilhelm 
Friedrich  Hegel  ( 1 770-1 831). 

Idealism  was  imported  into  the  United  States  via 
the  writings  of  the  St  Fouis  Hegelians,  whose  most 
important  political  philosophers  were  William 
Torrey  Harris  (1835-1909),  Denton  J.  Snider 
(1841-1925),  and  Thomas  Davidson  (1840-1900). 
Their  greatly  modified  form  of  Hegelianism 
emphasized  the  role  of  education  in  creating  the 
conditions  for  the  individual  to  enjoy  a  stable  and 
worthwhile  social  existence.  Yet  members  dis¬ 
agreed  over  the  relative  weight  to  be  given  to  indi¬ 
vidual  will  and  conscience,  on  the  one  hand,  and 
social  conventions  and  norms,  on  the  other,  and 
the  movement  dissolved  into  its  current  obscurity. 


A  more  lasting  idealist  heritage  is  found  in  Britain. 
Even  though  there  were  some  idealist  philosophers 
with  an  interest  in  politics  in  England  in  the  19th 
century — most  notably,  John  Grote  (1813-1866) 
and  John  Hutchison  Stirling  (1820-1909) — idealism 
first  held  sway  in  Anglo-American  political  philoso¬ 
phy  (for  approximately  50  years  from  the  1870s  to 
the  1920s)  through  the  dominance  of  British  ideal¬ 
ism.  The  origins  of  this  tradition  as  an  identifiable 
intellectual  movement  can  be  traced  to  Oxford 
University,  especially  the  writings  of  Thomas  Hill 
Green  (1836-1882).  His  Prolegomena  to  Ethics 
and  “Fectures  on  the  Principles  of  Political 
Obligation”  were  published  posthumously  (in 
1883  and  1886,  respectively).  Green  defended  a 
perfectionist  ethics  developed  along  the  lines  of 
Fichte’s  early  philosophy.  This  ethics  was  founded 
on  the  claim  that  individuals  experienced  an  inher¬ 
ent  if  often  imperfectly  articulated  need  and  drive 
to  develop  their  higher  capacities  through  social 
citizenship  and  an  active  commitment  to  the 
Aristotelian  virtues  of  temperance,  wisdom,  cour¬ 
age,  and  justice,  with  the  latter  spelled  out  as  the 
honoring  of  Kant’s  categorical  imperative  and  the 
realization  of  a  ’’kingdom  of  ends”  in  a  society 
orientated  around  a  determinate  ethically  and  aes¬ 
thetically  enriching  common  good  (what  later  ide¬ 
alists  called  a  “concrete  universal”).  In  this  way, 
Green  claimed  that  individuals  would  manifest  the 
“eternal  consciousness”  (which  he  saw  as  an 
immanent  God)  within  their  individual  minds. 
This  ethical  perfectionism  did  not  issue  in  a  politi¬ 
cal  perfectionism,  however,  largely  because  Green 
held  that  no  one  can  be  forced  to  act  ethically.  The 
most  the  state  or  any  other  group  or  individual  can 
do  is  help  foster  the  conditions  under  which  indi¬ 
viduals  can  develop,  primarily  through  educational 
opportunities  for  all,  employment  contracts,  and 
temperance  reform.  He  invoked  the  medieval 
notion  of  a  legitimate  (according  to  Green, 
enabling)  political  order  as  a  “societies  of  societ¬ 
ies,”  much  in  the  way  that  John  Rawls  was  to 
invoke  it  100  years  later. 

Green’s  preference  for  local  democratic  politics 
and  his  endorsement  of  the  “rights  recognition 
thesis”  were  shared  by  followers  such  as  Bernard 
Bosanquet  (1848-1923)  and  David  George  Ritchie 
(1853-1903).  Bosanquet  was  especially  sensitive 
to  the  individual  character’s  vulnerability  to  state 
interference,  whereas  Ritchie  saw  more  danger 
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coming  from  socioeconomic  factors  and  thereby 
saw  more  opportunities  for  beneficial  state  action. 
Ritchie  sought  to  combine  some  form  of  utilitari¬ 
anism  with  evolutionary  theory,  as  a  socialist  alter¬ 
native  to  the  social  Darwinism  of  Herbert  Spencer, 
and  in  this  sense  his  thought  pointed  toward  the 
New  Liberalism  of  Leonard  Hobhouse  and  John 
Hobson.  It  is  notable  that  Bosanquet’s  analysis  of 
the  ideal  type  of  the  modern  state  anticipated 
much  of  the  contemporary  literature  on  the  state 
versus  failed  or  quasi  states. 

Outside  its  Oxford  heartland,  idealism  was  also 
influential  through  the  Coleridgeian  idealism  of 
Frederick  Denison  Maurice  (1805-1872),  the 
quasi  Hegelianism  of  John  McTaggart  (1866- 
1925),  and  the  personal  idealism  of  Andrew  Seth 
Pringle-Pattison  (1856-1931),  James  Seth  (1860- 
1924),  and  Hastings  Rashdall  (1858-1924), 
among  others.  Idealist  philosophy  was  also  influ¬ 
ential  in  the  subsequent  decades  in  India  (primarily 
through  Sir  Brajendra  Nath  Seal  [1864—1 938J, 
whose  humanism  emphasized  the  parity  of  Eastern 
and  Western  cultures,  and  the  philosopher  and 
statesman  Sarvepalli  Radhakrishnan  [1 888-19  75 J, 
who  emphasized  intuition  over  the  rational  con¬ 
sciousness  of  the  earlier  idealists),  Canada  (primar¬ 
ily  through  John  Watson  [1847— 1939J  and  Leslie 
Armour  [born  1931  J),  Australia  (primarily  through 
Francis  Anderson  [185  8—194 1 J ,  Mungo 
MacCallum  [1854-1942J,  Ernest  Burgmann 
[1885-1965J,  and  Garnet  Portus  [1883-1954J), 
and  South  Africa  (primarily  through  Arthur  Ritchie 
Lord  [1880— 1941J).  The  leading  Italian  idealists 
differed  significantly  in  their  political  persuasions: 
Benedetto  Croce  (1866-1952)  was  an  outspoken 
liberal  who  was  persecuted  by  Mussolini;  Giovanni 
Gentile  (1875-1944)  was  explicitly  the  philoso¬ 
pher  of  fascism,  with  his  Theory  of  Mind  as  Pure 
Act  (1916)  being  an  attempted  philosophical  justi¬ 
fication  of  the  authoritarian  nation-state  as  the 
protector  of  the  people  and  the  developer  of  their 
being  as  a  corporate  person. 

Even  though  John  Dewey  defended  Greenian 
ethics  when  young,  at  this  time  America’s  most 
significant  defender  of  idealist  political  philosophy 
was  Josiah  Royce  (1855-1916).  While  Royce 
denied  any  significant  debts  to  either  Hegel  or 
Green,  he  acknowledged  similarities  between  his 
own  position  and  those  of  Green  and  Kant.  Royce 
held  conflict  and  struggle  to  be  preconditions  not 


merely  of  an  ethical  life  for  the  individual  but  of  a 
meaningful  life.  Only  by  orientating  one’s  life 
around  some  worthwhile  end  could  life  be  given 
purpose  and  value  and  even  then  only  if  that  ori¬ 
entation  is  made  freely  by  the  individuals  them¬ 
selves.  The  end,  however,  must  be  one  valued 
within  one’s  community,  for  in  that  way  one’s 
orientation  toward  it  represents  an  act  of  loyalty, 
which  Royce  argued  was  a  necessary  feature  of  a 
worthwhile  life.  Ultimately,  Royce  held  that  every 
valuable  human  society  developed  through  the 
action  of  a  spirit  within  it  that  transcended  the 
minds  of  individual  citizens. 

Several  subsequent  political  philosophers  have 
exhibited  idealist  tendencies,  most  notably  Robin 
George  Collingwood  (1889-1943)  and  Michael 
Oakeshott  (1901-1990)  in  Britain,  both  of  whom, 
along  with  the  other  British  idealists,  attract  sig¬ 
nificant  scholarly  attention  today.  The  main  themes 
defended  by  these  idealist  philosophers  form  the 
heart  of  much  contemporary  political  philosophy 
and  social  theory,  including  the  social  construction 
of  reality;  the  interrelationship  of  personal  self- 
realization  and  cultural  forms;  the  politics  of  recog¬ 
nition  and  identity  politics;  the  state  as  an  ideal  type 
(in  opposition  to  what  is  now  called  the  “quasi 
state”  or  “failed  state”);  the  defense  of  the  partici¬ 
patory  enabling,  welfare  state  against  attacks  from 
advocates  of  the  competition  state;  and  the  critique 
of  utilitarianism  as  a  dehumanizing  theory  of  public 
policy.  What  is  most  remarkable  is  the  extent  to 
which  many  of  the  idealists  discussed  above  pro¬ 
vide  more  extensive  and  deeper  insights  into  these 
very  live  political  issues  than  one  finds  in  most 
mainstream  contemporary  political  philosophy. 

Colin  Tyler 
University  of  Hull 
Hull,  United  Kingdom 
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Idealism  in  International 
Relations 


Idealism  in  the  field  of  international  relations  has 
usually  been  identified  with  the  perspectives,  theo¬ 
ries,  and  methods  derived  from  the  Wilsonian  tradi¬ 
tion.  Referring  to  the  ideals  espoused  by  U.S. 
President  Woodrow  Wilson,  idealist  perspectives 
such  as  liberalism  and  neo-conservatism  assume  that 
there  are  universal  political  and  economic  ideals  that 
humans  should  aspire  for.  In  international  politics, 
these  ideals  are  said  to  be  liberty,  freedom,  demo¬ 
cratic  systems  of  representation,  free  markets,  and 
capitalist  trading  systems.  Certain  forms  of  idealism 
also  see  international  institutions  (economic  and 
political)  as  ensuring  transparency  and  reinforcing 
the  incentives  for  states  to  cooperate.  Post-World 
War  II  idealism  also  incorporated  some  of  the  tenets 
of  scientific  positivism  in  the  hope  of  first  studying 
and  then  isolating  variables  that  contaminated  the 
ability  of  nation-states  to  cooperate  so  that  such 
variables  could  be  eliminated  over  time.  For  much  of 
idealism’s  history,  it  has  been  contrasted  with  realist 
international  relations  theory,  which  sees  national 
ideals  as  relative  and  pluralist,  derived  from  an  anar¬ 
chical  order  of  international  politics  founded  on 
power  and  an  order  that  cannot  be  transcended. 
There  are  two  forms  of  idealist  approaches  usually 
considered  in  international  relations — liberalism 
and  neo-conservatism.  This  entry  discusses  these 
two  approaches  as  well  as  the  relationship  between 
idealism  and  realism  and  the  political  nature  of  the 
term  idealism. 


Idealist  Approaches 

Liberalism 

Liberalism  assumes  that  while  states  operate 
within  anarchy,  and  that  states  are  indeed  primarily 
self-interested,  this  self-interest  lends  itself  to  coop¬ 
eration  rather  than  conflict.  Since  the  end  of  World 
War  II,  liberal  idealism  has  had  three  branches. 
Institutional  liberalism  posits  that  international 
organizations  and  regimes  facilitate  cooperation  by 
reducing  uncertainty  among  states  and  increasing 
transparency.  Some  institutional  idealists,  some¬ 
times  termed  neoliberal  institutionalists  or  even 
institutionalists,  may  not  even  consider  themselves 
“liberal”  in  the  sense  that  they  would  argue  that 
international  institutions  can  facilitate  cooperation 
regardless  of  the  type  of  domestic  political  system  of 
a  nation-state.  Economic  or  commercial  liberalism 
asserts  that  open  trading  systems  make  cooperation 
more  likely  because  the  benefits  of  trade  outweigh 
the  costs  of  going  to  war.  Political  liberalism 
assesses  the  likelihood  of  cooperation  or  conflict  as 
based  on  the  nature  of  a  country’s  political  system. 
Political  liberalism  has  developed  into  a  separate 
research  program  of  democratic  peace  theory — 
which  posits  that  democratic  countries  are  less 
likely  to  go  to  war  with  one  another  because  of  the 
structural  and  cultural  nature  of  democratic  deci¬ 
sion  making.  While  many  liberals  would  disagree 
with  the  “idealist”  moniker,  they  still  see  certain 
ideals  as  universally  valid  and  posit  that  an  interna¬ 
tional  community  where  those  ideals  were  allowed 
to  flourish  would  be  a  much  more  peaceful  and 
stable  one  than  the  unstable  one  we  currently  have. 

Neo-Conservatism 

Neo-conservatism  began  as  a  domestic  political 
movement  in  U.S.  politics,  but  toward  the  conclu¬ 
sion  of  the  Vietnam  War  and  thereafter  it  became 
an  approach  to  U.S.  foreign  policy  and  has  now 
emerged  as  an  approach  to  international  politics. 
Neo-conservative  argues  that  the  United  States 
bears  a  “special  burden”  because  of  its  unique  sta¬ 
tus  as  a  nation-state  and  eventual  great  power. 
What  has  made  the  United  States  unique  is  the 
moral  vision  on  which  it  was  founded — liberty, 
human  rights,  democracy.  These  are  universal 
principles,  unlike  those  constituting  past  great 
power  enterprises.  Furthermore,  these  principles 
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have  made  the  United  States  powerful  by — as  the 
neo-conservative  sees  it — “right  making  might.” 

Neo-conservative  idealism  differs,  however,  in 
two  major  respects  from  liberal  idealism.  First,  the 
neo-conservative  emphasizes  action — in  U.S.  poli¬ 
tics  it  is  sometimes  termed  movement  conserva¬ 
tism.  The  United  States’  imperative  to  act  may 
sometimes  outweigh  its  imperative  to  cooperate 
with  multilateral  organizations,  which  constrain 
nation-states  in  their  abilities  to  act.  Against  threats 
not  only  to  U.S.  national  security  but  also  to  the 
ideals  of  Western  civilization,  neo-conservatives 
argue  that  certain  liberal  institutional  restraints  can 
be  dangerous  in  that  they  get  in  the  way  of  an 
effective,  expedient,  and  forceful  American  action. 
Second,  neo-conservatism  maintains  a  skeptical 
stance  toward  the  scientific  positivism  of  liberals. 
Neo-conservatives  argue  that  while  scientific  stud¬ 
ies  can  provide  us  with  generalizations,  and  even 
statistical  evaluations,  they  do  not  give  meaning  to 
these  evaluations,  nor  could  such  studies  serve  as  a 
guide  toward  moral  action.  In  using  “value-free” 
lenses  in  these  scientific  studies,  such  studies 
missed  certain  qualitative  features  of  emerging 
threats,  such  as  ideological  and  cultural  attributes 
that  made  certain  regimes  or  groups  more  of  a 
challenge  to  the  West  than  others.  Furthermore, 
such  scientific  evaluation,  and  judicious  reflection 
in  general,  could  get  in  the  way  of  action. 
Nevertheless,  some  of  the  most  prominent  neo¬ 
conservatives  in  both  policy  circles  (such  as  Paul 
Wolfowitz)  and  intellectual  circles  (such  as  Joshua 
Muravchik)  readily  admit  the  connections  between 
neo-conservatism  and  liberalism. 

Idealism  and  Realism 

The  term  idealism  in  international  relations  is  an 
old  one.  Idealism  emerged  as  a  dominant  force  in 
thinking  about  the  world  largely  as  a  reaction  to 
the  World  War  I.  In  that  conflict’s  wake,  it  was 
assumed  that  war  was  senseless  and  hardly  effec¬ 
tive  at  obtaining  national  objectives.  This  thinking 
became  a  centerpiece  during  the  so-called  First 
Debate  among  international  relations  theorists 
between  realists  and  idealists  during  the  interwar 
period  of  1919  to  1939.  Idealism  permeated  many 
of  the  peace  movements,  the  push  and  eventual 
construction  of  the  League  of  Nations,  as  well  as 
some  of  the  political  initiatives  (such  as  the 


Kellogg-Briand  pact)  to  use  legal  means  to  outlaw 
war.  This  First  Debate  has  been  depicted  as  one  in 
which  idealists,  putting  their  faith  in  Enlightenment 
principles  such  as  reason,  naively  assumed  that 
war  could  be  legislated  or  collectively  securitized 
away  through  international  organizations  such  as 
the  League  of  Nations.  Ultimately,  the  depiction 
goes,  interwar  realists  such  as  Edward  Carr  and 
Reinhold  Niebuhr,  and  post-World  War  II  ones 
such  as  Hans  Morgenthau,  John  Herz,  Arnold 
Wolfers,  and  Kenneth  Osgood,  “won”  this  debate 
when  World  War  II  disconfirmed  much  of  the 
basis  of  interwar  idealism. 

Morgenthau,  in  Scientific  Man  Versus  Power 
Politics,  positioned  scientific  approaches  to  interna¬ 
tional  relations  as  a  form  of  idealist  theory. 
Morgenthau  asserted  that  such  approaches  approx¬ 
imated  particularist  bourgeois  ideals  (liberty,  prop¬ 
erty,  wealth,  etc.)  as  universals,  using  those  as  a 
barometer  to  measure  the  “progress”  of  nations  in 
global  politics.  According  to  Morgenthau,  because 
these  were  abstract  ideals,  the  study  of  these  by  the 
social  scientist  ignored  the  realities  of  power  poli¬ 
tics.  Nevertheless,  these  scientific  approaches  toward 
the  study  of  international  relations  became  domi¬ 
nant  during  the  1950s  and  1960s  behavioral  revolu¬ 
tion  of  American  social  science,  and  thus,  positivist 
frameworks  because  of  their  “problem-solving” 
approach  to  social  action  are  considered  by  many 
classical  realists  to  be  another  form  of  idealism. 

Neo-conservative  Joshua  Muravchik  has  justi¬ 
fied  calling  both  liberalism  and  neo-conservatism 
forms  of  Wilsonianism  with  the  helpful  binary  of 
possession  versus  milieu  goals.  Realists  argue  that 
the  ordering  principle  of  international  politics — 
anarchy — by  definition  implies  that  there  is  no 
overarching  or  policing  supranational  authority. 
Thus,  nation-states  have  possession  goals — they 
seek  to  acquire  as  much  power  as  possible  to 
ensure  their  own  survival.  Both  neo-conservatives 
and  liberals,  on  the  contrary,  argue  that  the  envi¬ 
ronment  of  international  politics  is  subject  to 
change  if  the  political  and  economic  institutions  of 
members  change;  therefore,  democratic  states, 
such  as  the  United  States,  can  and  should  have 
milieu  goals  that  seek  change  in  the  international 
environment.  That  is,  a  world  constituted  by  eco¬ 
nomically  and  politically  liberal  democracies  is 
much  more  stable  and  inclined  to  cooperation  than 
a  world  constituted  by  totalitarian  or  authoritarian 
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regimes.  Or,  in  another  realm,  a  world  where 
international  trade  can  occur  openly  without  tar¬ 
iffs  or  other  forms  of  national  economic  protec¬ 
tionism  is  one  where  cooperation  is  more  likely 
than  in  a  world  where  trade  occurs  in  isolated 
pockets,  if  at  all. 

As  such,  both  liberalism  and  neo-conservatism 
have  at  their  core  a  faith  in  the  idealist  “demo¬ 
cratic  peace”  account,  which  posits  that  demo¬ 
cratic  countries  are  less  likely  to  go  to  war  with 
one  another  because  of  the  structural  and  cultural 
nature  of  democratic  decision  making.  They  have 
also  spawned  a  number  of  evaluative  institutions, 
such  as  Freedom  House  and  Transparency 
International,  which  assess  nation-states’  eco¬ 
nomic  and  political  development  trajectories  using 
operationalized  liberal  ideals  as  a  metric. 

Realist  scholars  and  analysts  have  issued  vibrant 
criticisms  of  both  liberal  and  neo-conservative 
assumptions  and  policies.  In  addition  to  the  real¬ 
ist-idealist  debate  in  the  interwar  period,  debates 
between  realists  and  liberals  in  the  1980s  and  1990s 
centered  on  the  primary  motives  of  nation-states 
and  the  implications  such  motives  had  for  coopera¬ 
tion.  Liberals  posited  that  nation-states  sought 
absolute  gains  and  thus  would  cooperate  with  other 
states  in  certain  interactions  (such  as  trade)  so  long 
as  both  would  gain,  even  if  certain  states  gained 
more  than  others.  Realists  argued  that  nation-states 
sought  gains  relative  to  other  states,  and  thus 
nation-states  were  more  cooperation-averse  because 
power  was  more  important  to  nation-states  in  rela¬ 
tive  terms  rather  than  absolute  terms. 

Since  the  beginning  of  the  21st  century,  realists 
and  neo-conservatives  have  disagreed  about  the 
conception  of  U.S.  foreign  policy  interests  and 
acutely  quarreled  over  the  strategy  of  U.S. -led 
democratization  of  nondemocratic  areas.  Realists 
have  argued  that  democratization  is  a  fool’s 
errand,  one  that  wastes  U.S.  resources  for  little,  if 
any,  strategic  gains.  Ideals  such  as  freedom,  lib¬ 
erty,  and  capitalism  are  complex,  arising  in  the 
West  from  a  particular  constellation  of  factors, 
and  thus,  such  ideals  are  hardly  transferable  to 
every  global  region.  Neo-conservatives  posit 
that  fostering  democratic  regimes,  which  some¬ 
times  necessitates  U.S.  assistance  through  force, 
not  only  promotes  universal  ideals  but  also 
serves  as  a  demonstration  to  the  international 
community  that  these  ideals  are  worth  fighting 


for.  This  realist/neo-conservative  debate  over  democ¬ 
ratization  as  a  U.S.  foreign  policy  strategy  can  be 
considered  part  of  the  larger  debate  realists  have 
had  with  all  forms  of  idealism  over  the  past  century. 

Political  Nature  of  the  Term 

It  should  be  noted  that  many  liberals  and  neo¬ 
conservatives  would  eschew  the  label  “idealist.” 
Often,  idealism  has  been  equated  to  a  naive  form  of 
belief  that  peace  is  just  around  the  corner  if  certain 
principles  were  implemented  to  amend  the  anarchi¬ 
cal  state  of  international  politics.  Both  liberals  and 
neo-conservatives  have  argued  against  their  realist 
critics  that  their  views,  far  from  being  naive,  are 
derived  from  pragmatic  social  and  historical  pat¬ 
terns  of  human  existence.  For  example,  liberal 
democratic  peace  proponents  would  point  to  the 
many  statistical  studies  proving  that  no  two  fully 
formed  democracies  have  ever  gone  to  war  with  one 
another.  Neo-conservatives  have  also  suggested 
that  there  is  very  little  about  realism  that  is  “realis¬ 
tic,”  since  it  fails  to  recognize  the  inherent  threats  to 
U.S.  interests  that  pure  power  calculations  cannot 
predict — for  instance,  the  hostility  that  originates 
within  and  from  volatile  rogue  regimes. 

As  evidence  for  the  political  purpose  behind  the 
label  idealism,  one  can  consider  that  the  term  was 
deployed  in  the  1990s  by  realist  scholars,  such  as 
John  Mearsheimer,  against  the  then  emerging 
perspective  of  constructivism  in  international 
relations.  Other  scholars  have  argued  that  construc¬ 
tivism,  which  sees  intersubjective  understandings, 
identities,  perceptions,  discourse,  and  social  struc¬ 
tures  as  central  to  international  relations,  reflects  a 
liberal  bias.  While  it  is  highly  debatable  that 
constructivism  is  a  form  of  idealism,  this  attempt  by 
realists  and  other  scholars  to  label  it  as  such  is 
reflective  of  the  more  generally  pejorative  aura  the 
term  idealism  still  conjures  in  some  circles  of  inter¬ 
national  relations.  This  is  problematic,  simply 
because  all  international  relations  perspectives,  even 
realism,  can  be  considered  idealist  in  the  sense  that 
they  approximate  a  world  of  particular  ideals  and 
seek  mechanisms  through  which  humans  can  realize 
those  ideals  for  a  more  stable  existence. 
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Identity,  Social  and  Political 


Identity  is  a  somewhat  ambiguous  term  and  a  con¬ 
tentious  concept.  With  regard  to  individuals, 
groups,  and  societies,  a  differentiation  between 
personal,  social,  and  collective  identity  is  common. 
In  popular  language  and  essentialist  concepts,  per¬ 
sonal  identity  is  used  to  denote  an  individual’s 
self-concept:  certain  typical  traits  and  meanings 
that  are  perceived  as  relatively  stable  over  differ¬ 
ent  situations  and  over  time  and  that  distinguish 
the  self  from  other  persons.  Although  some  have 
questioned  the  universality  of  this  concept,  the 
idea  of  an  essential  inner  self  of  the  individual 
dominates  at  least  contemporary  Western  con¬ 
cepts  of  the  person.  Social  identity  refers  to  soci¬ 
etal  or  cultural  influences  on  the  development  of 
the  self,  to  memberships  of  individuals  along  a 
continuum  of  interaction  groups  (e.g.,  families, 
sports  teams,  political  parties,  and  other  net¬ 
works),  and  to  social  categories  (e.g.,  according  to 
social  roles  such  as  mothers,  teachers,  sports  fans). 
As  such,  social  identity  is  also  part  of  the  personal 
identity.  The  terms  cultural  identity  and  collective 
identity  mostly  denote  social  identities  of  a  large 
scale  (e.g.,  gender,  age,  ethnicity,  class)  and  some¬ 
times  are  also  used  to  describe  a  relatively  long 
enduring  stability  of  values,  institutions,  symbols, 
and  aims  of  a  group,  a  society,  or  a  nation-state 


(e.g.,  represented  in  national  consciousness). 
National  identity  is  a  special  form  of  a  collective, 
political  identity,  which  consists  mostly  of  a  self- 
image  based  on  certain  assumptions  about  common 
features  as  descent,  history,  language,  culture,  sub¬ 
jective  feelings  of  belonging,  and/or  citizenship. 
Furthermore,  the  term  identity  is  used  not  only  with 
regard  to  the  microlevel  and  its  aggregates  but  also 
to  describe  certain  traits  at  the  meso-  or  macrolevel 
(e.g.,  the  identity  of  the  Americans,  the  British 
Empire,  or  the  Eastern  world). 

For  some,  identity  means  absolute  sameness  of 
human  beings;  thus,  if  used  in  a  social  or  political 
context,  it  carries  connotations  of  strong  homoge¬ 
neity  between  persons  and  a  collective.  From  a 
normative  democratic  perspective,  this  meaning  is 
rejected  because  it  is  linked  to  ideas  of  a  homoge¬ 
neous  people,  authoritative  leadership,  and  dicta¬ 
torship  as  well  as  a  sharp  definition  of  insiders 
and  outsiders  and  a  demarcation  of  friend  and 
foe. 

Others  use  identity  to  depict  scientific  concepts 
that  play  a  certain  role  in  the  humanities  and  social 
sciences.  Most  of  them  point  to  the  fact  that  iden¬ 
tity  usually  has  multiple  facets,  which  can  gain 
different  salience  in  different  situations  and  thus 
may  also  change  over  time  in  intensity  and  mean¬ 
ing.  In  addition,  the  term  identification  is  used  to 
denote  a  feeling  of  sameness  with  another  person 
or  group  and  loyalty  toward  its  motives,  ideals, 
aims,  leaders,  and  so  on. 

Since,  in  the  human  sciences,  there  are  many 
differing  concepts  of  social  and  political  identity, 
this  entry  concentrates  on  concepts  referring  to 
the  levels  of  individuals  and  groups  as  holders  of 
identity.  After  a  few  remarks  concerning  the  his¬ 
torical  roots  of  the  concept  of  identity  and  a  brief 
mention  of  some  concepts  referring  to  personal 
identity,  the  entry  discusses  selected  major 
approaches  to  social  identity.  The  final  section 
deals  with  questions  of  political  identity  with 
respect  to  societies,  especially  at  the  level  of 
national  identity,  and  considers  its  relationship  to 
subnational  and  supranational  units  of  identifica¬ 
tion.  Other  aspects  of  political  identity  such  as 
party  identification  or  identity  of  social  and  polit¬ 
ical  movements  are  not  dealt  with  here.  Similarly, 
this  entry  also  does  not  deal  with  identity  politics 
at  the  macrolevel  (e.g.,  affirmative  action  in  mul¬ 
ticultural  societies). 
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Historical  Roots  of  Identity  Research 

Early  concepts  of  identity  primarily  dealt  with  the 
self  as  personal  identity  from  a  psychological  per¬ 
spective  and  concentrated  on  the  development  and 
consistency  of  the  psyche.  By  contrast,  sociology  is 
concerned  with  social  identity  and  its  structural 
determinants,  and  social-psychological  research 
focuses  on  human  fundamentals  in  differing  social 
contexts  and  thus  on  the  interplay  between  cogni¬ 
tive,  affective,  and  evaluative  processes  and  social, 
cultural,  and  political  influences  and  on  their 
respective  consequences  for  subsequent  behavior. 

The  history  of  concepts  of  social  identity  in 
modern  social  sciences  begins  toward  the  first 
third  of  the  20th  century.  Early  protagonists  such 
as  William  James,  Charles  Horton  Cooley,  and 
George  Herbert  Mead  had  in  common  the  view  of 
identity  as  a  product  of  constructions  and  percep¬ 
tions  of  reality  in  social  interactions.  This  was  a 
main  basis  for  the  theoretical  link  between  the 
individual  and  society.  They  argued  that  con¬ 
sciousness  and  self-knowledge  are  not  natural,  a 
priori  characteristics  of  human  behavior  but  fea¬ 
tures  that  both  originate  from  and  influence  the 
processes  of  social  interaction.  Cooley’s  metaphor 
of  a  “looking  glass  self”  describes  how  persons 
take  their  ideas  about  themselves  from  their  per¬ 
ceptions  of  other  persons’  opinions  about  them. 
Somewhat  later,  James  and  Mead  differentiated 
the  self  into  the  components  “I”  (internal  perspec¬ 
tive)  and  “Me”  (external  perspective).  The  compo¬ 
nent  “Me”  is  determined  by  culture  and  society;  it 
reflects  social  roles  and  expectations  from  others 
toward  the  person.  These  are  learned  and  internal¬ 
ized  in  processes  of  socialization  and  identification 
with  other  persons  or  collectives.  The  component 
“I”  means  the  spontaneous,  active,  more  autono¬ 
mous  part  of  the  self,  which  is  not  only  a  presup¬ 
position  for  the  ability  to  identify  with  others  and 
to  learn  social  roles  but  also  a  necessary  condition 
for  the  ability  to  keep  a  critical  distance  to  the  social 
self.  This  approach  was  further  developed  in 
theories  of  symbolic  interactionism  and  phenome¬ 
nology.  Peter  L.  Berger  and  Thomas  Luckmann 
conceptualized  the  socialization  process  as  a  dialec¬ 
tical  one  between  identification  by  others  (objec¬ 
tive)  and  by  self  (subjective);  they  see  this  process  as 
a  mirror  of  a  broader  societal  dialectic  between 
subjective  and  objective  reality. 


In  subsequent  research,  some  authors  concen¬ 
trated  on  the  “I”  (personal  identity)  and  linked  this 
to  ethno-methodological  concepts,  such  as  that  of 
Alfred  Schiitz,  or  to  the  cognitive-structural 
approach  of  Jean  Piaget  and  Lawrence  Kohlberg. 
These  approaches  also  distinguish  an  “ideal  self,” 
which  means  culturally  framed  ideas  of  how  the 
self  ought  to  be.  The  fit  between  these  normative 
ideas  and  the  perception  of  how  the  self  actually  is 
strongly  affects  a  person’s  self-esteem. 

Another  approach,  schema  theory  (e.g.,  Susan 
Fiske  and  Shelley  Taylor),  in  part  also  deals  with 
personal  identity  and  its  cognitive  components. 
Schemata  are  highly  generalized  and  organized  cog¬ 
nitive  structures  in  memory  that  guide  perceptions 
and  interpretations  of  new  information  and  mem¬ 
ory  recall  and  thus  help  reduce  the  amount  of  nec¬ 
essary  cognitive  processes  of  orientation.  Identity  is 
defined  as  a  system  of  possibly  hierarchically  and 
context  specifically  ordered  self-concepts.  Situations 
that  fit  an  individual’s  self-schemata  are  more  likely 
than  contrary  information  to  be  processed  and 
stored  in  memory  and  thus  form  a  consistent  and 
relatively  stable  self-schema.  Research  shows  that 
individuals’  schematic  identities  are  different  in 
intensity  and  amount.  Highly  complex,  multifac¬ 
eted  self-schemata  may  contribute  to  psychological 
health  in  situations  that  threaten  part  of  the  identity 
(e.g.,  if  a  person’s  identity  as  a  worker  is  threatened 
because  of  unemployment,  the  psychological  acti¬ 
vation  of  other  identities  such  as  being  a  good 
father  and  a  gifted  singer  in  church  might  help 
uphold  his  self-esteem).  Critics  of  schema  theory 
argue  that — because  of  its  emphasis  on  learning 
effects  and  automatic  cognitive  processes — it  leads 
to  a  too  passive  and  static  perspective  of  identity. 

Cognitive  dissonance  theory  (Leon  Festinger) 
and  social  attribution  theory  (Philip  E.  Tetlock) 
additionally  integrate  the  affective  motivational 
basis  of  identity,  namely,  the  desire  for  a  positive 
self-esteem,  in  their  models  of  information  pro¬ 
cessing.  Thus,  if  cognitions  or  attitudes  become 
dissonant  with  each  other  or  with  behavior  because 
of  new  information,  individuals  try  to  reach  conso¬ 
nance  by  cognitive  strategies  such  as  downplaying 
the  inconsistent  new  information  or  changing  its 
meaning.  In  this  way,  an  individual’s  self-serving 
bias  is  activated  when  positive  events  or  outcomes 
of  actions  are  attributed  to  the  self,  whereas  for 
negative  ones  the  situation  or  others  are  blamed. 
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Social  Identity 
Approaches  of  Role  Identity 

Other  authors  concentrated  on  the  “Me”  com¬ 
ponent  of  the  self.  In  this  context,  Ervin  Goffman 
laid  the  foundations  of  role  theory.  He  distin¬ 
guished  between  a  personal  identity  (individual 
uniqueness  of  life  experiences)  and  a  social  identity 
(role  expectations  and  roles)  of  the  individual. 
Both  confront  the  individual  with  demands  in 
daily  interaction,  which  he  conceptualized  as  a 
kind  of  role  play  on  stage.  It  is  assumed  that  per¬ 
sons  do  not  identify  totally  with  their  different 
roles,  because  of  ever-changing  social  interactions. 
A  reflexive  and  somewhat  distant  relationship 
toward  one’s  own  role  identities  is  seen  as  espe¬ 
cially  characteristic  for  modernity.  Thus,  in  tradi¬ 
tional,  relatively  static  societies,  the  “I”  is  strongly 
determined  by  the  identification  with  (often  pri¬ 
mordial)  social  collectives  (e.g.,  family,  tribe,  and 
guild)  and,  therefore,  keeps  close  to  the  “Me,” 
possibly  throughout  one’s  entire  life.  In  modern 
societies,  as  Norbert  Elias  has  argued,  processes  of 
social  change,  societal  differentiation,  and  mobili¬ 
zation  as  well  as  globalization  lead  to  an  ongoing 
individualization.  On  the  one  hand,  this  is  linked 
to  rising  freedom  for  the  “I.”  There  are  more  role 
models  to  choose  from,  identity  facets  can  change 
often  from  situation  to  situation,  and  multiple 
aspects  of  identity  can  coexist.  On  the  other  hand, 
they  can  also  compete  or  even  come  into  conflict 
with  each  other  and  thus  result  in  identity  crises. 
Erik  Erikson  defines  a  (healthy)  identity  as  one 
that  guarantees  sameness  and  continuity  of  the  self 
throughout  life  in  one’s  own  perception  and  that 
includes  the  expectation  that  others  will  recognize 
this  self.  But  modern  societies  can  also  produce 
problems  such  as  a  negative  identity  (e.g.,  identifi¬ 
cation  with  roles  that  are  evaluated  negatively  by 
society  and  the  individual  him-  or  herself,  because 
a  positively  evaluated  role  takeover  was  too  diffi¬ 
cult  and  failed)  or  competing  identities  (diverging 
structures  of  personality  because  of  status  incon¬ 
sistencies  in  complex  societies).  Thus,  modern 
societies  produce  more  dangers  and  uncertainties 
concerning  an  individual’s  identity  and  thus  create 
the  need  for  life-long  reflection  about  the  relation¬ 
ship  between  the  “Me”  and  the  “I.” 

Some  criticisms  of  this  approach  were  directed 
toward  the  differentiation  between  personal  identity 


as  an  ostensibly  “true”  one  and  role/social  identity 
as  a  “false”  one  or  as  a  “calculating”  perspective  on 
the  self. 

Whereas  in  earlier  concepts  the  self  was  related 
to  interactions  with  other  persons,  in  role  theory 
the  self  is  related  to  features  of  the  social  structure. 
Role  identity  depicts  a  certain  position  or  status 
within  a  social  system.  All  role  theories  share  the 
assumption  that  society  is  differentiated  into  social 
roles,  which  carry  behavioral  norms,  and  that  iden¬ 
tity  entails  the  internalization  of  these  roles  by 
individuals  laying  the  ground  for  patterned,  struc¬ 
tured,  and  routinized  action.  In  their  concepts  of 
identity,  some  theorists  combine  the  symbolic  inter- 
actionism  approach  with  role  theory.  In  this  kind  of 
theory,  society  is  seen  as  a  rather  stable  structure 
that  results  from  repeated  patterns  of  behavior  of 
individuals  and  groups.  Macrolevel  structures  are 
seen  as  the  product  of  the  multiple  roles  that  indi¬ 
viduals  perform  in  day-to-day  interactions.  The  self 
and  its  identity  are  created  by  these  interactions  and 
by  the  meanings  that  the  multiple  roles  have  for  the 
actors,  and — in  a  reflexive  process — identities  pro¬ 
vide  the  meanings  of  the  roles. 

There  are  various  approaches  for  a  systematiza¬ 
tion  of  multiple  roles  and  identities.  For  instance, 
Peter  Burke’s  taxonomy  is  oriented  along  different 
social  situations:  Person  identities  denote  self¬ 
meanings,  the  meanings  that  characterize  the  indi¬ 
viduality  (as  being  active  or  passive,  ambitioned  or 
unpretentious).  Individuals  carry  around  these 
identities  across  different  situations  and  they  are 
activated  frequently.  Role  identities  are  activated 
in  specific  situations  where  individuals  perform  in 
certain  functions  (e.g.,  being  a  daughter,  a  mother, 
a  wife,  an  artist,  or  a  sports  fan).  The  meanings 
linked  to  such  roles  or  functions  are  determined  by 
culture  and  socialization  as  well  as  the  individual 
perceptions  and  assessments  of  these  roles.  Social 
identities  emerge  from  the  identification  of  indi¬ 
viduals  with  social  groups  or  categories.  They 
define  the  belonging  of  insiders  as  well  as  of  the 
outsiders  on  the  basis  of  similarity/difference  and 
they  may  lead  to  common  bonds  and  actions. 
Individuals  have  identities  of  all  three  types  and 
usually  more  than  one  of  these  types  is  invoked  in 
social  interactions. 

Peter  J.  Burke  and  others  also  examine  internal 
processes  of  an  individual  concerning  his  or  her 
identity  and  its  influence  on  behavior.  Identities 
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here  are  seen  as  meanings  attached  to  the  self.  The 
process  of  identity  invoking  is  conceptualized  as  a 
perpetual  control  system,  a  circular  process  of  self¬ 
definitions  of  identity  meanings  and  their  attach¬ 
ment  to  experiences  in  a  feedback  loop.  This  is 
assumed  to  lead  to  a  permanent  verification  of  the 
meanings  of  one’s  identity.  Behavior  is  thought  of 
as  a  function  of  the  relationship  between  situations 
and  identities. 

Sheldon  Stryker’s  analyses  of  the  psychological 
organization  of  identities  led  him  to  a  hierarchical 
concept  of  saliency  and  commitment.  The  main 
idea  is  that  an  individual’s  behavior  is  shaped  by 
how  often  and  how  intensively  his  or  her  identities 
are  activated.  Identity  salience  refers  to  the  proba¬ 
bility  that  a  specific  identity  is  activated  by  others 
or  by  oneself.  A  salient  identity  is  one  that  is 
invoked  frequently  and  across  different  situations 
and  contexts.  The  saliency  of  an  identity  deter¬ 
mines  the  probability  that  the  individual  will  enact 
roles  that  are  consistent  with  this  identity,  that  he 
or  she  will  interpret  situations  as  suitable  to  per¬ 
form  such  roles,  and  even  that  an  individual  will 
seek  opportunities  to  perform  these  roles.  Identity 
commitment  refers  to  the  dependency  of  an  indi¬ 
vidual’s  relationships  to  others  on  certain  roles. 
The  strength  of  the  commitment  defines  the  extent 
and  intensity  of  an  individual’s  attachment  to  other 
persons  to  whom  he  or  she  is  linked  by  an  identity. 
Strength  of  commitment  and  saliency  usually  go 
hand  in  hand,  and  the  higher  the  position  of  an 
identity  in  the  hierarchy,  the  greater  is  the  proba¬ 
bility  the  identity  will  be  activated  and  thus  shape 
behavior,  which  in  turn  reinforces  its  salience. 

Others  concentrate  more  on  social  positions  as 
basis  of  roles  and  identity.  George  McCall  and 
Jerry  Laird  Simmons  distinguish  between  a  conven¬ 
tional  and  an  idiosyncratic  component  of  role  iden¬ 
tity  (similar  to  Erving  Goffmann’s  distinction 
between  social  and  personal  aspects).  The  conven¬ 
tional  component  refers  to  expectations  and  inter¬ 
nalized  meanings  linked  to  social  positions;  the 
idiosyncratic  component  refers  to  the  individual 
meanings  the  respective  roles  have  for  the  person. 
In  this  approach,  a  prominence  hierarchy  orders  the 
identities  according  to  their  importance  for  an  ideal 
self,  which  is  seen  as  a  result  of  the  support  an  indi¬ 
vidual  gains  from  others  for  a  certain  identity,  an 
individual’s  commitment  to  an  identity  and  reward 
for  activating  an  identity.  A  salience  hierarchy  is 


seen  as  a  function  of  the  identity  prominence,  a 
psychological  need  for  support  and  rewards,  and 
the  perceived  opportunities  for  the  activation  of 
identities  in  specific  situations  and  thus  is  impor¬ 
tant  for  concrete  behavior. 

Social  Identity  Theory  and 
Social  Categorization  Theory 

Two  other  closely  related  theories  in  social  psy¬ 
chology  have  gained  much  importance  in  identity 
research,  namely,  the  so-called  social  identity  the¬ 
ory  by  Henry  Tajfel  and  the  social  categorization 
theory  by  John  Turner  and  colleagues,  both  together 
often  denoted  as  the  social  identity  approach.  This 
approach  was  developed  in  the  1960s  as  a  cognitive 
theory  of  group  identity  to  explain  prejudice,  dis¬ 
crimination,  and  conflicts  between  members  of 
social  categories  or  groups  and  has  since  generated 
a  huge  body  of  research  to  test  its  basic  assump¬ 
tions  and  to  generate  new  hypotheses.  In  part,  it 
rests  on  earlier  research  and  also  shares  some 
assumptions  with  other  approaches  to  explain 
prejudice  and  conflict,  but  in  other  aspects  it  clearly 
departs  from  them. 

Earlier  research  to  explain  prejudice  focused  on 
psychological  traits  and  defects  of  the  unconscious 
of  individuals  (e.g.,  the  concept  of  the  authoritar¬ 
ian  personality  in  the  work  by  Theodor  Adorno, 
Else  Frenkel-Brunswick,  Daniel  Levinson,  and 
Nevitt  Sandford  or  on  attitudinal  predispositions 
learned  in  early  childhood  (as  in  the  concept  of 
symbolic  racism  of  Donald  Kinder  and  David 
Sears).  But  both  approaches  are  limited  with 
respect  to  identity  insofar  as  they  remain  restricted 
to  the  personal  level  and  cannot  explain  changes  in 
discrimination  and  conflict.  Another  concept,  the 
theory  of  social  dominance  (e.g.,  James  Sidanius), 
not  only  goes  beyond  personal  identity  but  also 
has  difficulties  in  explaining  why  and  under  what 
conditions  discrimination  and  conflicts  arise  in 
certain  situations.  It  claims  that  all  societies  are 
hierarchical  and  have  at  least  one  hegemonic  and 
one  subordinate  group  that  usually  hold  prejudices 
against  each  other.  This  social  dominance  orienta¬ 
tion  reflects  a  normal  psychological  desire  to 
belong  to  a  high-status  group.  Thus,  individual 
and  institutionalized  discrimination  is  seen  as  a 
normal  feature  of  societies,  which  are  perpetuated 
by  processes  of  social  identification,  self-esteem 
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maintenance,  and  social  comparison.  Realistic  group 
conflict  theory  (e.g.,  Muzafer  Sherif)  assumes  that 
conflicts  between  groups  are  based  on  competition 
for  scarce  resources  and  thus  follow  the  principle  of 
individual  rationality.  The  conflicts  are  seen  as  the 
basis  for  ingroup  identification  and  for  ongoing 
psychological  processes  of  stereotyping  and 
discrimination  of  the  outgroup.  A  recommended 
solution  for  such  conflicts  is  to  alter  the  group  iden¬ 
tification  by  imposing  superordinate  common  goals. 
Critiques  of  this  approach  emphasize  that  the  fram¬ 
ing  of  the  group  conflict  and  the  proposed  solution 
undermine  its  rational  choice  basis.  Furthermore,  a 
change  of  identity  by  imposing  superordinate  goals 
may  be  possible  only  in  very  restricted  situations, 
not  in  real-life  conflicts  of  a  broad  range  such  as 
conflicts  between  ethnic  groups  or  even  nation¬ 
states,  which  may  lead  to  war. 

The  social  identity  approach  suggests  that  iden¬ 
tification  with  an  ingroup  and  distinction  from / 
discrimination  against  outgroups  are  two  sides  of 
one  coin  and  that  competition  for  scarce  resources 
is  not  a  necessary  condition  for  these  processes. 
The  empirical  roots  of  this  research  are  based  on 
the  minimal  group  paradigm.  Here,  in  laboratory 
experiments,  artificial  groups  were  formed  on  the 
basis  of  overtly  senseless  tests  (e.g.,  whether  some¬ 
body  preferred  Paul  Klee  or  Wassily  Kandinsky  as 
a  painter)  or  even  by  lot,  which  was  known  by  the 
participants.  The  participants  had  no  interaction, 
there  was  no  competition,  and  they  did  not  know 
anything  about  each  other.  When  asked  to  reward 
members  of  their  own  and  of  the  other  group,  they 
consistently  favored  their  own  group  and  thus 
showed  bias  against  the  other  group.  Moreover, 
this  ingroup  favoritism  did  not  occur  in  a  way  that 
maximized  the  benefit  for  one’s  own  group  (which 
would  have  supported  the  realistic  group  conflict 
model),  but  led  to  a  maximum  difference  between 
both  groups.  Thus,  a  distinction  of  groups  based 
on  purely  cognitive  belonging  produced  ingroup 
favoritism  and  outgroup  discrimination. 

Social  identity  theory  also  rests  on  claims  about 
psychological  needs:  Individuals  strive  for  positive 
self-esteem,  which  leads  them  to  identify  with 
groups  that  will  enhance  their  self-image.  This  is 
only  possible  if  the  ingroup  is  distinct  from  other 
relevant  groups  in  a  positive  way.  Moreover,  a 
need  to  reduce  uncertainty  compels  persons  to 
compare  and  categorize  groups  as  well  as  the  self 


in  relation  to  such  groups.  This  social-psychological 
approach  also  takes  a  perspective  on  identity  that 
differs  from  the  sociological  symbolic  interaction- 
ist  concept,  because  it  does  not  conceptualize  all 
identities  as  inherently  social.  Rather,  it  argues 
that  group  identity  is  formed  in  contrast  to  per¬ 
sonal  identity,  claiming  a  continuum  between 
both,  along  which  behavior  can  move.  Thus  it 
assumes  that  group  psychology  and  behavior  are 
qualitatively  distinct  from  the  intra-individual  and 
individual  ones.  The  central  aim  is  to  explain 
under  what  conditions  and  how  group  bias  occurs. 
For  this  process,  three  main  tenets  have  been 
defined — namely,  the  salience  of  group  member¬ 
ship,  the  situational  context,  and  the  individual’s 
beliefs.  Thus,  for  a  social  identity  to  affect  percep¬ 
tions,  cognitions,  attitudes,  and  behavior,  the  mere 
awareness  of  group  membership  is  not  enough.  As 
long  as  the  personal  identity  is  more  salient  than 
social  identity,  the  individual  will  behave  in  an 
interpersonal  manner  that  takes  account  of  per¬ 
sonal  traits  and  relationships  between  the  actors. 
But  if  social  identity  becomes  psychologically  more 
salient,  group  behavior  will  occur.  Since  everybody 
develops  multiple  group  memberships  and  corre¬ 
sponding  social  identities,  the  objective  context  is 
also  relevant  for  defining  which  special  social  iden¬ 
tity  fits  the  situation  best.  Subjective  interpreta¬ 
tions  of  the  situation  as  well  as  beliefs  about  the 
group  memberships  intervene  in  this  process.  The 
most  important  beliefs  refer  to  chances  for  indi¬ 
vidual  mobility  or  chances  for  social  change,  both 
of  which  represent  the  poles  of  a  continuum.  These 
imply  corresponding  strategies,  if  the  outcome  of  a 
comparison  between  one’s  own  membership  group 
and  another  one  is  negative.  For  example,  if  a  per¬ 
son  thinks  that  it  is  possible  to  leave  the  ingroup 
and  become  a  member  of  a  comparable  group 
with  a  higher  status,  he  or  she  will  act  in  this  direc¬ 
tion.  This  presupposes  a  belief  in  an  open  social 
structure  or  society.  By  contrast,  if  the  personal 
strategy  does  not  promise  an  enhancement  of  self¬ 
esteem,  an  action  as  a  group  member  will  be 
undertaken  to  change  collectively  the  status  of  the 
ingroup,  which  includes  a  high  saliency  of  social 
identity.  Such  actions  can  include  social  creativity 
(e.g.,  a  change  of  the  dimension  of  comparison,  a 
change  of  the  evaluation  of  the  compared  attri¬ 
butes,  or  a  change  of  the  group  of  comparison)  or 
social  competition/confrontation. 
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Whereas  the  ingroup  favoritism/outgroup  bias 
has  been  demonstrated  consistently,  empirical  sup¬ 
port  for  the  self-esteem  motive  has  been  less 
strong.  This  may  have  contributed  to  a  rise  in 
interest  in  the  cognitive  dimension  of  the  theory 
during  the  1980s  in  the  work  of  John  Turner  et  al. 
They  understand  their  self-categorization  theory  as 
an  extension  and  elaboration  of  the  social  identity 
theory.  Here,  social  identity  is  defined  as  a  set  of 
cognitive  representations  of  the  self.  Two  contri¬ 
butions  are  mentioned  here,  namely,  the  concept 
of  multiple  levels  of  inclusiveness  and  that  of  pro¬ 
totypicality.  The  idea  is  that  self  representations 
are  organized  along  a  hierarchy  of  categories, 
which  are  defined  by  similarity  within  classes  and 
dissimilarity  between  classes,  based  on  the  meta¬ 
contrast  principle  (this  means,  the  interesting  dimen¬ 
sions  of  categorization  are  classified  in  a  way  that 
minimizes  the  differences  within  a  category  and 
maximizes  them  with  regard  to  other  categories). 
Meaningful  comparisons  and  classifications  presup¬ 
pose  a  similarity  of  the  interesting  traits  and  respec¬ 
tive  categories  at  a  higher  level.  Thus — instead  of  a 
continuum  between  the  personal  and  the  social 
self — they  suggest  at  least  three  main  levels  of  iden¬ 
tity.  The  most  inclusive  and  therefore  superordinate 
level  is  that  of  human  beings  against  other  forms  of 
life  (animals,  plants);  the  intermediate  level  com¬ 
prises  membership  in  social  groups  as  against  other 
comparison  groups  (e.g.,  men  and  women, 
Europeans  and  Americans,  the  political  Left  and 
Right);  and  the  subordinate  level  focuses  on  the 
individual  and  the  differences  from  other  ingroup 
members.  Obviously,  on  the  level  of  social  groups, 
these  again  can  show  different  patterns  of  inclusive¬ 
ness,  which  can  be  seen  as  hierarchical  or  nested 
patterns  (e.g.,  women,  European  women,  left- 
oriented  European  women  as  against  their  respec¬ 
tive  counterparts).  A  group  prototype  is  someone 
who  shows  the  highest  value  of  metacontrast  on  a 
certain  dimension  of  comparison.  Thus,  prototypi¬ 
cality  can  change  depending  on  the  situation.  Proto¬ 
types  can  serve  as  a  model  for  other  group  members 
and  thus  can  foster  self-stereotyping  of  ingroup 
members  and  stereotyping  outgroups.  This  deper¬ 
sonalization  contributes  to  ingroup  homogeneity 
and  distinctiveness  as  well  as  to  outgroup  bias  and 
is  seen  as  central  to  all  group  processes. 

From  a  sociological  point  of  view,  a  general 
critique  is  that  the  ontological  assumption  of  an 


independent  person  fluctuating  between  personal 
and  social  identities  restricts  the  possibilities  to 
account  for  social  processes,  especially  with  respect 
to  the  influences  of  political  and  economic  macro¬ 
factors,  because  these  can  only  be  taken  into 
account  as  specific  traits  of  the  situation.  Another 
problem  is  that  most  research  in  this  field  relies  on 
laboratory  experiments  and  thus  cannot  simulate 
the  complexity  of  the  real-life  world,  which  ques¬ 
tions  the  external  validity  of  such  findings.  From 
the  perspective  of  political  science,  another  deficit  is 
the  relative  neglect  of  motivational  factors  and  emo¬ 
tions  linked  to  identities  and  their  consequences. 

Narrativity  of  Social  Identity 

Recent  research  in  understanding  social  identity 
is  also  linked  to  social  constructivism  and  the  dis¬ 
cursive  approach,  resting  on  poststructuralist  per¬ 
spectives  and  the  interpretive  paradigm.  It  is  not 
concerned  with  individual  or  social-psychological 
processes  of  identity  but  concentrates  on  narrative, 
linguistic  constructions  of  identity.  A  central 
assumption  is  that  a  person’s  identity  is  not  a  stable 
outcome  and  entity  but  has  to  be  conceptualized  as 
a  permanent  process  of  construction  in  language 
and  talk  in  changing  situations.  Within  this  process, 
prevailing  ideologies  are  said  to  be  powerful  mecha¬ 
nisms  to  determine  identity  so  that  an  individual 
only  has  the  illusion  of  an  actively  and  freely  chosen 
concept  of  the  self.  An  example  is  Michael  Billig’s 
work  on  the  link  between  thinking  and  arguing. 
Narratives  are  also  seen  as  a  person’s  self¬ 
representation  toward  others,  which  may  appear 
stable  and  continuous  but  in  fact  can  change  (e.g., 
Kenneth  Gergen  and  Mary  Gergen).  On  the  whole, 
these  different  approaches  toward  social  identity 
have  laid  the  ground  for  and  can  contribute  ever 
more  to  the  understanding  of  political  identity  as 
well. 

Political  Identity 

National  identity  is  one  among  other  collective 
identities,  and — in  contrast  to  many  other  political 
identities — it  is  linked  to  a  political  as  well  as  a  ter¬ 
ritorial  point  of  reference,  a  political  community  at 
the  level  of  the  nation-state.  This  community  usu¬ 
ally  is  thought  of  as  one  that  encompasses  people 
of  both  sexes,  from  all  age-groups,  and  from  all 
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social  strata.  Typically,  national  identity  also  rests 
on  beliefs  about  certain  commonalities  among  the 
members  of  the  group,  such  as  a  common  ancestry, 
ethnicity,  language,  religion,  culture,  and  values;  a 
sense  of  belonging;  political  rights;  participation, 
and  so  on.  These  are  often  thought  to  overcome 
contrasting  and  competing  utilitarian  interests  of 
the  members  (e.g.,  those  between  different  classes) 
and  thus  are  crucial  for  the  integration  of  society. 
Some  of  these  characteristics  also  apply  to  sub- 
(local,  smaller  regional)  and  supranational  identi¬ 
ties  (encompassing  more  than  one  nation  in  bigger 
regions  ranging  from  border  areas  over  continents 
to  the  whole  human  race  or  referring  to  suprana¬ 
tional  political  units  such  as  the  European  Union 
[EUJ).  Problems  of  national  identity  are  most 
prominent  in  research,  whereas  sub-  and  suprana¬ 
tional  identities  are  often  discussed  only  in  their 
relation  to  national  identities. 

In  political  science,  diverse  subdisciplines  deal 
with  national  identity,  national  movements,  nation¬ 
alism,  and  nation-building  processes.  With  respect 
to  national  identity  in  the  more  restricted  sense  of 
the  topic,  research  is  based  mainly  on  political  socio¬ 
logical  and  political  cultural  approaches  resting  on 
behaviorism  and  working  with  surveys,  although  it 
is  less  linked  than  the  topic  of  social  identity  to 
certain  authors  or  research  schools.  In  contrast  to 
sociology  and  social  psychology,  political  science  is 
also  less  concerned  with  the  cognitive  basis  of  iden¬ 
tity  and  more  with  affective  components  and  its 
determinants  and  consequences.  Moreover,  identity 
is  rarely  treated  as  a  cognitive  categorization  but 
rather  as  an  attitude,  as  identification.  Especially 
when  dealing  with  national  identity  on  a  microlevel, 
the  purely  cognitive  categorization  seems  self- 
evident,  because  there  is  almost  no  one  who  does  not 
know  to  which  nation  he  or  she  belongs.  But  catego¬ 
rization  processes  become  a  topic  when  dealing  with 
stereotyping  between  different  nationalities  within  a 
country  or  between  countries.  Here,  questions  con¬ 
cern  the  intensity  of  collective  identification,  the 
relationships  between  identifications  with  different 
collectives,  and  reference  objects  as  the  content  and 
meaning  of  identity. 

Intensity  of  Political  Identification 

People  have  been  and  still  are  ready  to  fight  for 
their  country.  Wars  in  the  name  of  country  and 


nation  again  and  again  have  shown  unbelievable 
cruelties  and  destruction.  Usually  the  losses  and 
pains  are  not  only  on  the  side  of  the  losers  but  also 
on  the  side  of  the  winners.  So  the  question  is  why 
people  fight  voluntarily  and  willingly  in  wars.  Yet 
all  nations  and  societies  can  function  only  for  a 
time  on  the  basis  of  common  instrumental  aims 
and  interests;  in  the  long  run,  they  probably  need 
a  basis  of  common  values  and  norms,  which 
underlie  a  common  identity.  Moreover,  most  inter¬ 
ests  and  identities  are  not  mutually  exclusive  but 
work  as  reinforcing  foundations  of  integration. 
Without  these  forces  of  integration  any  nation  will 
fall  apart,  inner  conflicts  may  arise,  and  civil  war 
can  be  a  consequence. 

The  different  scenarios  of  war  between  nations 
or  civil  war  within  a  nation  may  depict  the  poles 
of  a  spectrum  from  extreme  national  identification 
(highly  salient  cognitive  national  identity  and 
strong  affective  identification)  to  a  total  loss  of  a 
common  national  identification  (be  it  against  a 
background  of  other  interests  or  identities,  such  as 
in  class  struggles,  race  riots,  or  religious  confronta¬ 
tions,  or  against  a  background  of  other  national 
identities,  such  as  in  secessionism  and  separatist 
movements).  Diverse  national  as  well  as  cross¬ 
national  survey  research  findings  show  that  a 
purely  cognitive  identification  with  a  nation 
(knowledge  about  one’s  national  belonging,  also 
called  national  consciousness)  usually  does  not  go 
along  with  a  bias  for  one’s  own  nation  and  aggres¬ 
siveness  toward  others.  Yet  a  purely  cognitive 
identification  is  seldom  seen.  More  often  national 
consciousness  is  accompanied  by  positive  feelings 
toward  one’s  nation — for  example,  those  expressed 
in  national  pride.  In  most,  but  not  all  countries  and 
periods  of  research  (also  depending  on  the  kind  of 
measurement  and  the  indicators  used),  the  strength 
of  positive  feelings  toward  one’s  own  nation  shows 
links  to  ingroup  bias  and  outgroup  discrimination. 
Thus,  for  example,  with  the  increasing  intensity  of 
national  identification,  it  is  not  only  the  probabil¬ 
ity  of  constructive  patriotism  (seen  as  critical  loy¬ 
alty  to  one’s  own  nation)  that  increases  but  also 
the  probability  of  blind  nationalism  (designating 
an  attitude  that  puts  one’s  own  nation  on  top  of 
everything,  regardless  of  its  state  of  affairs:  “Right 
or  wrong,  my  country”).  Moreover,  this  often 
coincides  with  skeptical  or  hostile  attitudes  toward 
immigrants  and  immigration  per  se  and  mistrust  of 
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other  countries  and  peoples.  Research  has  also 
shown  that  an  extremely  strong  national  identifi¬ 
cation  often  is  accompanied  by  low  identification 
with  other  political  collectives  and  social  groups 
and  even  more  private  roles — a  loss  of  other  identi¬ 
ties  giving  place  to  the  national  one  or  the  national 
one  replacing  the  others.  With  regard  to  determin¬ 
ing  factors,  within  Western  democracies  a  high 
level  of  education  and  a  favorable  socioeconomic 
position  often  lead  to  only  moderate  levels  of 
national  identification,  at  least  in  times  of  peace. 
By  contrast,  fast  social  change,  feelings  of  anomie, 
and  sometimes  feelings  of  relative  deprivation  fos¬ 
ter  national  and  nationalistic  attitudes.  A  perceived 
threat  from  immigrants  and  from  other  nations 
often  contribute  to  a  rise  in  national  identification. 
This  also  points  to  the  fact  that  national  identifica¬ 
tion  is  also  dependent  on  the  political  context. 
Thus,  for  instance,  in  Eastern  European  countries, 
national  identification  today  is  much  stronger  than 
in  most  Western  countries  as  a  consequence  of 
their  newly  won  freedom  from  the  former  Soviet 
empire  Similarly,  intellectuals  and  higher  status 
groups  in  non-Western  societies,  as  in  the  Middle 
East,  often  exhibit  a  higher  level  of  nationalism  (or 
religious  fundamentalism)  because  of  feelings  of 
relative  deprivation  and  of  being  dominated  by  the 
West,  as  noted  by  Bettina  Westle  and  Paolo  Segatti 
(2011).  Historical  facts  and  contextual  events  may 
also  lead  to  periods  where  great  parts  of  the  popu¬ 
lation  show  only  low  levels  of  national  identifica¬ 
tion  or  even  negative  feelings  toward  their  own 
nation.  This  has  been  observed  in  countries  that 
have  been  occupied  or  were  defeated  in  war  (e.g., 
Belgium)  and  in  countries  where  many  in  the 
population  felt  guilty  or  ashamed  about  former 
wars  and  war  crimes  (e.g.,  West  Germany). 

Thus,  all  in  all,  it  is  known  that  some  collective 
national  identification  is  necessary  for  the  inner 
cohesion  of  nation-states,  but  there  is  no  straight¬ 
forward  answer  to  the  question  of  how  much  inten¬ 
sity  of  national  identification  is  needed  and  at 
which  point  it  might  become  harmful.  It  can  be  said 
that  the  possible  effect  of  national  identification  not 
only  depends  on  the  intensity  of  these  feelings  but 
also  on  the  question  of  how  far  the  national  iden¬ 
tity  is  accompanied  by  other  identities  or  is  exclu¬ 
sive.  Moreover,  it  varies  with  the  self-image  of  the 
nation,  the  content,  meaning,  or  objects  of  national 
pride  (which  reflect  the  dimensions  of  the  ingroup 


definition).  Finally,  strong  positive  feelings  toward 
one’s  own  nation  are  a  precondition  for  national¬ 
istic  attitudes,  but  such  feelings  do  not  automati¬ 
cally  result  in  nationalistic  attitudes.  And  those 
feelings  are  per  se  neither  helpful  nor  harmful, 
unless  they  become  politically  mobilized  and  lead 
to  action.  In  Western  democracies,  mostly  populist 
and  right-wing  parties  mobilize  traditional  nation¬ 
alistic  attitudes  on  the  basis  of  economic  problems, 
which  then  typically  are  not  directed  against  other 
countries  but  against  the  perceived  cause  of  the 
economic  problems — for  example,  immigrants  or, 
in  Europe,  the  European  Union. 

Relationships  Between  Nonpolitical  and 
Political/National  Identifications 

National  identity  usually  coexists  with  other 
diverse  social  and  political  identities.  As  political 
psychology  shows,  one  can  feel,  for  example,  as  a 
woman,  a  mother,  a  scientist,  a  dancer,  and  a  mem¬ 
ber  of  a  nation.  Such  identities  belong  to  different 
domains  of  life  and  therefore  usually  do  not  com¬ 
pete  or  conflict  with  each  other.  Rather,  they  can 
be  activated  differently  in  diverse  situations;  for 
example,  talking  with  children  will  activate  one’s 
identity  as  a  mother,  going  to  a  party  will  highlight 
the  salience  of  one’s  identification  as  a  dancer  and 
probably  as  a  woman,  teaching  students  a  lesson 
will  activate  one’s  identity  as  a  scientist,  and  getting 
confronted  with  the  problems  of  foreign  students 
or  going  to  an  international  meeting  may  activate 
not  only  one’s  identity  as  a  scientist  but  also  one’s 
own  national  identity.  In  the  same  way,  national 
identity  is  compatible  with  various  other  political 
identities  within  a  country,  such  as  party  identifica¬ 
tion  or  being  a  trade  unionist,  and  even  with  identi¬ 
ties  that  go  beyond  the  nation-state,  such  as  being 
a  member  of  Amnesty  International  or  identifying 
with  Greenpeace.  In  the  latter  cases,  the  activation 
of  the  international  identity  may  compete  with  the 
national  identity,  but  it  also  can  strengthen  it, 
depending  on  the  concrete  conditions  of  the  situa¬ 
tion.  Because  national  identity  is  rather  close  to  the 
collective  end  of  the  continuum  of  possible  identi¬ 
ties  between  personal  and  collective,  it  can  include 
many  other  identities.  But  situations  that  will  pri¬ 
marily  or  even  exclusively  activate  national  identity 
are  rather  rare,  as  in  the  case  of  a  national  celebra¬ 
tion  or  in  cases  of  international  conflicts. 
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Relationships  Between  Different  Political/ 

National  Identifications  and  Between 
Subnational  and  National  Identities 

More  difficult  are  the  relationships  between 
national  identity  and  other  political  identities, 
which  also  refer  to  nationality  and/or  to  potential 
or  real  territorial-political  collectives.  Thus,  there 
are  many  countries  that  can  be  called  multina¬ 
tional  states,  because  people  who  identify  with 
more  than  one  nation  live  within  such  countries. 
This  is  caused  by  different  historical  developments, 
which  often  underlie  different  situations  and  rela¬ 
tionships  between  the  respective  nationalities. 

A  typical  example  for  a  historically  multina¬ 
tional  country  is  Switzerland,  where  French, 
German,  and  Italian  populations  (and  even  a  small 
Ladino  one)  live  together,  in  different  parts  of  the 
country,  with  a  great  deal  of  autonomy  and  with¬ 
out  major  problems.  Most  of  the  Swiss  citizens  feel 
as  though  they  are  Swiss  citizens,  though  they  may 
have  French-,  German-,  or  Italian-Swiss  identities 
as  well.  Another  example  is  Belgium,  with  a 
Walloon  (French  speaking),  a  Flemish  (Dutch 
speaking),  and  a  small  German-speaking  popula¬ 
tion.  Though  these  populations  live  in  different 
parts  of  the  country  and  there  are  highly  complex 
political  institutions  for  representation,  conflicts 
between  Wallonia  and  Flanders  arise  again  and 
again.  One  example  of  aggressive  conflicts  and  the 
desire  for  separation  is  Northern  Ireland,  though 
Britain  has  decentralized  its  political  structures. 
Spain  is  another  example  where  decentralization 
of  political  structures  was  undertaken  to  give 
people  of  different  historically  rooted  nationalities, 
who  also  live  in  different  parts  of  the  country, 
more  autonomy.  This  was  partly  successful,  for 
example,  in  Catalonia.  But  other  parts  now  also 
strive  for  more  autonomy,  and  Euskadi  (the 
Basque  part)  especially  has  a  separatist  movement, 
with  the  aim  of  creating  a  separate  state  together 
with  the  Basques  living  in  France.  On  an  individual 
level,  these  examples  show  that  citizens  tend  to 
identify  more  intensively  or  even  exclusively  with 
their  (officially)  subnational  identity  than  with 
their  national  one  and  sometimes  strive  to  form  a 
sovereign  nation  of  their  own.  Historically  rooted 
feelings  of  national  identity  that  accompany  con¬ 
temporary  distinguishing  features  with  regard  to 
other  parts  of  the  population  or  that  even  lead 
to  political  cleavages  (e.g.,  a  different  religion  or 


language,  translated  into  politics)  and/or  that  go 
along  with  feelings  of  economic  deprivation  are 
factors  that  often  lead  to  inner  conflicts  and  seces¬ 
sionist  or  separatist  movements.  But  a  favorable 
economic  situation  can  also  contribute  to  the  wish 
for  more  national  autonomy  to  avoid  having  to 
share  welfare  benefits  with  the  poorer  parts  of  a 
country,  as  is  the  case  with  northern  Italy.  Yet  to 
satisfy  the  desires  associated  with  such  political 
identities  through  institutions  of  territorial  auton¬ 
omy  is  evidently  possible  only  if  the  respective 
populations  with  a  common  we-feeling  live  pre¬ 
dominantly  in  different  internally  homogeneous 
parts  of  the  country  or  belong  to  a  diaspora  in 
which  national  minorities  live  in  adjacent  territo¬ 
ries  of  another  country. 

In  countries  that  are  made  up  of  many  different 
ethnicities/nationalities  and/or  in  countries  where 
the  populations  of  different  nationalities  are  spread 
over  the  territory,  such  solutions  are  not  possible. 
Such  countries  also  show  quite  different  histories 
and  current  situations  with  respect  to  questions  of 
national  identity.  One  typical  situation  is  produced 
by  nation-building  processes  in  postcolonial  coun¬ 
tries.  Often  boundaries  were  drawn  that  did  not 
respect  traditional  (prenational)  we-feelings,  such 
as  in  Africa,  and  therefore  may  lead  to  inner  con¬ 
flicts  as  well  as  to  conflicts  between  countries.  In 
other  postcolonial  cases,  such  as  in  India,  huge 
populations  with  quite  different  ethnicities,  tradi¬ 
tions,  languages,  and  cultural  customs  have  suc¬ 
ceeded  in  forming  a  common  political  identity. 

Another  constellation  could  be  found  after  the 
dissolution  of  greater  political  entities,  as  in 
Eastern  Europe  in  the  1990s  with  the  breakdown 
of  the  Soviet  empire.  The  Soviets  have  tried  for 
almost  a  century  to  build  a  common  communist 
identity  that  should  overcome  earlier  ethnic  and 
national  identities.  Yet  this  was  only  partly  suc¬ 
cessful,  and  today  these  countries  attempt  to 
rebuild  their  earlier  national  identities  in  a  situa¬ 
tion  of  newly  gained  political  sovereignty.  This 
situation  also  produced  new  national,  territorially 
concentrated,  as  well  as  dispersed  minorities  and 
diasporas.  Thus,  the  split  of  former  Czechoslovakia 
into  two  states  is  an  example  of  a  peaceful  solution 
for  the  national  self-determination  of  two  popula¬ 
tions  with  different  identities  and  living  in  different 
parts  of  a  country.  The  Baltic  countries  are  an 
example  of  relatively  peaceful  ways  of  dealing  with 
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minorities,  especially  the  large  Russian  minorities 
(mainly  members  of  the  former  military  and  their 
families)  by  granting  them  certain  minority  rights 
and  possibilities  to  gain  full  citizenship.  By  con¬ 
trast,  in  the  former  Yugoslavia,  extremely  violent 
conflicts  especially  between  (mainly  Catholic) 
Croats,  (mainly  Orthodox)  Serbs,  and  (mainly 
Muslim)  Bosniaks  developed,  which  destroyed  the 
country  and  led  to  a  division  into  several  parts. 

Other  typical  contexts  in  which  different  national 
identities  play  a  role  are  caused  by  migration. 
Today  there  are  so-called  immigration  countries — 
those  where  immigrants  dominate  an  indigenous 
population,  such  as  Australia,  or  where  national 
subcommunities  are  formed  from  different  sub¬ 
populations,  including  indigenous  populations, 
descendants  from  early  settlers,  and  new  immi¬ 
grants,  as  in  the  French-  and  English-speaking 
regions  of  Canada.  Another  example  is  the  United 
States,  where  the  population  includes  not  only  the 
indigenous  American  Indian  population  but  also 
the  descendants  of  immigrants  from  Europe,  Latin 
America,  and  Asia,  as  well  as  the  descendants  of 
African  slaves.  These  different  populations  come 
together  with  quite  different  backgrounds  with 
respect  to  their  identities — in  some  cases  with  a 
strong  national  identity,  in  others  with  a  negative 
national  identity  with  regard  to  their  countries  of 
descent,  and  in  some  with  a  nonnational  but 
mainly  ethnic  identity  based  on  race,  language,  or 
religion.  Against  this  background  of  different  eth¬ 
nic  or  national  identities  and  the  feeling  that  a 
common  political  national  identity  is  necessary  to 
guarantee  societal  integration  and  stability  of  the 
political  system,  different  ideas  have  developed 
(which  in  more  or  less  moderated  forms  also  have 
influenced  other  immigration  countries):  the 
perspective  of  a  melting  pot;  the  concept  of  a  mul¬ 
ticultural,  ethnic  pluralist  society;  and  the  group 
dominance  and  assimilation  perspective.  The  melt¬ 
ing  pot  concept  suggests  that  all  citizens  regardless 
of  their  ethnic  or  ancestral  national  background 
form  a  common  political  identity  and  in  the  long 
run  mingle  with  each  other  in  such  a  way  that  the 
different  identity  backgrounds  lose  importance 
and  form  a  new  common  composite  identity.  The 
multicultural  concept,  often  associated  with  the 
metaphor  of  a  salad  bowl  or  mosaic,  implies  that 
the  different  ethnic  and  national  groups  keep  their 
distinctive  characteristics  and  identities  but  are 


respected  as  equal  and  as  complementary  rein¬ 
forcing  loyalties.  The  group  dominance  concept 
articulates  that  in  multiethnic  and  immigration 
countries,  there  is  one  dominant  group  that  claims 
to  have  preeminent  rights  and  can  legitimately 
expect  newcomers  to  adapt  to  their  rules  and  life¬ 
style  and  in  the  long  run  adopt  the  dominant 
group’s  national  identity.  These  ideas  are  also 
linked  to  different  identity  politics.  The  melting 
pot  concept  disregards  ethnic  and  former  national 
identities.  By  contrast,  the  multicultural  concept  is 
linked  to  policies  such  as  affirmative  action  in 
order  to  support  the  different  identities,  especially 
in  cases  where  some  groups  have  historically  been 
deprived  of  social  or  political  rights  and  chances  of 
economic  advancement.  The  assimilation  concept 
is  linked  to  politics  that  promote  but  also  demand 
that  immigrants  adapt  to  the  dominant  group.  As 
different  and  contrary  as  these  concepts  are,  they 
all  are  intended  to  ensure  peaceful  relations 
between  groups  with  different  identities.  However, 
there  is  no  uncontested  evidence  that  shows  which 
concept  works  best.  For  example,  in  some  cases 
the  respect  and  support  of  different  identities  have 
helped  pacify  conflicts  between  groups,  but  in  oth¬ 
ers  they  have  led  to  a  reinforcement  of  ethnic, 
exclusive  identities  with  aggressive  potential 
toward  groups  of  other  identities.  Also,  when  indi¬ 
viduals  and  groups  keep  their  former  national 
identification  in  addition  to  the  new  one,  their 
national  loyalty  may  be  questioned,  especially  if 
the  country  of  origin  and  the  country  of  immigra¬ 
tion  should  become  involved  in  a  conflict.  Thus, 
identity  politics  in  multiethnic  and  multinational 
contexts  (with  dispersed  settlement)  may  involve  a 
choice  between  two  hazardous  alternatives — like 
the  one  the  sailors  confronted  between  Scylla  and 
Charybdis. 

Finally,  a  new  form  of  multiple  national  identity 
seems  to  arise  with  processes  of  globalization  and 
migration,  namely,  so-called  transnational  identi¬ 
ties.  They  may  become  characteristic  for  persons 
who  live  in  more  than  one  country,  perhaps 
because  they  work  in  more  than  one  country  or 
because  they  have  their  family  in  one  country  and 
work  in  another,  who  migrate  and  re-migrate. 
With  regard  to  multicultural  societies,  evidence 
and  speculations  about  this  kind  of  multiple  iden¬ 
tity  are  rather  diverse,  reaching  from  the  expecta¬ 
tion  that  for  such  individuals  national  identity  will 
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lose  any  relevance  and  they  will  develop  into  cos¬ 
mopolitans  to  the  contrary  expectation  that  the 
different  identities  may  reinforce  each  other  per¬ 
manently  or  to  the  expectation  that  those  identities 
permanently  will  conflict  with  each  other  and 
cause  individual  identity  crises. 

Relationship  Between  National  and  Supranational 
Identity:  The  European  Union 

The  idea  of  a  cosmopolitan  identity  as  well  as 
concepts  of  a  supranational  identity  mostly  remained 
apolitical,  because  they  did  not  have  any  specific 
political  object  of  identification.  Yet  this  is  chang¬ 
ing  in  the  EU,  as  it  develops  from  a  mainly  eco¬ 
nomic  community  to  a  political  unit.  Protagonists 
of  European  integration  increasingly  see  a  European 
identity  as  necessary  for  the  chances  and  stability  of 
this  multilevel  political  community.  A  crucial  differ¬ 
ence  from  other  situations  such  as  nation-building 
or  immigration  countries  is  that  the  EE1  consists  of 
countries  that  have  developed  their  national  identi¬ 
ties  for  a  long  time.  Thus,  if  a  common  European 
identity  is  to  develop,  it  should  not  be  in  competi¬ 
tion  with  the  primarily  ethnic  or  ancestral  national 
identities  but  should  be  formed  against  the  back¬ 
ground  of  current  national  identities.  Therefore, 
and  before  the  foundation  of  the  European 
Community  (EC)  was  based  on  the  aim  of  over¬ 
coming  national  rivalries  and  securing  peace 
between  its  members,  many  supporters  of  the  EC/ 
EU  expected  that  the  national  identities  would  lose 
importance  for  the  sake  of  a  developing  European 
identity.  Yet  empirical  data  (mainly  collected  in  the 
Eurobarometer  surveys)  show  no  decrease  in 
national  loyalties  and  no  rise  of  European  identity. 
They  also  reveal,  from  country  to  country  and  time 
to  time,  differing  relationships  between  national 
and  European  identifications,  with  correlations 
ranging  from  negative  to  none  to  positive.  Thus, 
the  idea  of  a  compatibility  of  national  and  European 
identity  gained  ground,  which  some  researchers 
conceptualize  as  a  hierarchical  system  (similar  to 
identifications  with  one’s  town,  one’s  region,  and 
one’s  nation  in  federal  countries)  and  others  as  a 
nested  structure  (like  a  Russian  matryoshka  doll), 
which  is  more  apolitical  and  refers  to  territories,  or 
as  a  structure  like  a  marble  cake,  in  which  national 
and  European  components  might  mix  and  blend. 
Although  compatibility  of  both  identities  can  be 


shown  for  most  of  the  member  countries,  the 
national  identities  until  now  remain  by  far  stronger 
than  the  European  one.  Moreover,  it  seems  as  if 
changes  in  the  relationship  between  the  two  identi¬ 
ties  follow  a  pattern,  which  has  to  do  with  the  EU 
policy  itself.  Times  of  enforced  integration  and 
policies,  which  bring  to  mind  a  competition 
between  member  countries  or  between  the  national 
and  the  EU  level,  tend  to  reduce  the  European 
identification  and  to  strengthen  the  national  ones 
perhaps  because  of  fears  about  a  loss  of  national 
achievements.  Thus,  it  is  still  an  open  question  how 
far  the  existing  European  identity  may  reach  in 
times  of  stress,  and  there  are  doubts  about 
an  EU-wide  loyalty,  for  instance,  in  case  of  redistri¬ 
bution  policies.  Reasons  for  the  difficulties  in 
developing  a  strong  European  identification  are 
manifold — for  instance,  the  absence  of  a  common 
history,  a  common  language,  culture,  religion, 
media  system,  and  so  on,  as  well  as  the  absence  of 
clear  boundaries  and  a  clear  aim  with  regard  to 
common  political  structures.  Researchers  who  con¬ 
ceptualize  European  identity  in  strong  analogy  to 
national  identities  therefore  are  mostly  pessimistic 
about  its  chances,  whereas  others  hope  for  the  pos¬ 
sibility  that  a  common  democratic  culture  might 
arise  out  of  the  heterogeneous  general  cultures  and 
that  this  demos-based  identity  can  be  become  suf¬ 
ficient  for  future  political  cohesion 

Content  and  Meanings  of  Political  Identities 

The  intensities  of  national,  subnational,  and  supra¬ 
national  identifications  and  the  links  between 
them  tell  something  about  the  relevance  of  these 
identities  but  nothing  about  their  content  or  mean¬ 
ing.  For  a  long  time,  answers  to  the  question  about 
the  meaning  of  national  identity  were  primarily 
informed  by  historical  and  macrosociological 
research.  These  mostly  formulated  dichotomous 
typologies,  for  instance,  state-building  nations  ver¬ 
sus  nation-building  states,  cultural  versus  political 
nations,  or  ethnic  versus  civic  nations.  Sometimes, 
the  former  were  linked  to  an  Eastern  model  and 
the  latter  to  a  Western  model.  Additionally,  the 
former  ones  are  seen  as  less  compatible  than  the 
latter  ones  with  democracy,  with  openness  toward 
immigration  and  with  peaceful  relations  to  other 
countries.  Thus,  the  ethnic  nation  is  associated 
with  a  high  potential  for  aggressive  nationalism, 
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whereas  the  civic  nation  is  linked  to  a  loyal  and 
constructive,  but  not  fanatical,  patriotism. 
Although  these  dichotomies  have  become  some¬ 
what  contested  in  recent  years,  on  the  individual 
level  they  are  mirrored,  albeit  not  in  a  perfect  way, 
as  shown  by  diverse  national  survey  data  and  by 
two  studies  in  1995  and  2003  of  the  International 
Social  Science  Programme  (ISSP).  An  ethnic 
national  identity  mostly  refers  to  primordial, 
ascribed  features  such  as  ancestry  (“blood”),  place 
of  birth,  and  place  of  living  and  religion,  whereas 
a  civic  identity  builds  on  achievable  and  voluntary 
characteristics  such  as  subjective  feelings  of 
belonging  and  political  rights  and  institutions. 
Additionally,  both  kinds  of  identity  can  be  linked 
to  cultural  objects,  for  example,  language  and  way 
of  life.  Yet  these  types  of  identity  do  not  distin¬ 
guish  between  nations  but  rather  between  differ¬ 
ent  strata  of  the  population  within  countries. 
Furthermore,  in  some  countries  something  like  a 
postnational  identity,  denying  the  relevance  of 
nationality  and  instead  referring  to  the  humanity 
as  such,  can  be  found.  In  any  case,  the  ethnic  type 
of  national  identity  tends  to  go  along  with  a  more 
closed  concept  of  the  nation,  in  respect  to  immi¬ 
grants  as  well  as  to  other  nations  and  suprana¬ 
tional  communities. 

Finally,  recent  research  questions  also  deals  with 
the  meaning  of  European  identity.  Some  works  on 
public  narratives  and  discourse  analyses  find  that 
national  identity  and  European  identity  are  inter¬ 
twined  in  a  strong  way.  Either  they  assume  that  the 
image  of  Europe  has  already  influenced  nation 
building  in  the  early  historical  phases,  or  others 
find  that  different  narratives  about  Europe  are 
built  into  the  national  narratives.  But  again,  analy¬ 
ses  of  ordinary  citizens’  thoughts  about  their 
national  identity  and  their  European  identity  (on 
the  basis  of  the  Europe-wide  survey  data  of  the 
project  “Integrated  and  United”  in  2008  and 
2009)  do  not  support  these  assumptions.  Thus, 
both  the  national  and  the  European  identity  con¬ 
cepts,  and  the  links  between  them,  do  not  distin¬ 
guish  different  nations  but  rather  different  groups 
of  individuals  within  nations.  An  ascribed  national 
identity  mostly  goes  along  with  a  concept  of 
European  identity,  which  rests  on  the  same  charac¬ 
teristics,  as  is  the  case  with  religion  as  a  basis  of 
both  identities  and  political  rights,  whereas  cul¬ 
tural  features  are  seen  as  a  dimension  that  differs 


in  the  respective  nations  and  in  Europe  as  a  whole. 
With  the  ongoing  processes  of  modernization, 
internationalization,  and  globalization,  the  mean¬ 
ings  of  political  identities  and  their  consequences 
will  remain  a  major  field  of  research  in  the  future. 
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The  word  ideology  was  coined  by  the  French  phi¬ 
losopher  Destutt  de  Tracy,  who  defined  it  as  the 
“science”  of  the  “intellectual  faculties”  of  animals, 
therefore  as  a  component  of  zoology  but  a  compo¬ 
nent  crucial  for  man,  an  animal  for  whom  thought 
is  fundamental.  What  Destutt  de  Tracy  called 
ideologic,  he  might  have  named  psychology 
(psyche,  “soul”  in  Greek)  if,  as  an  empiricist  and  a 
utilitarian,  he  had  not  disapproved  of  this  term. 
Ideology  as  he  meant  it  was  not  just  a  general 
grammar  but  a  form  of  logic  that  should  enable 
men  to  think  better  and  therefore  to  live  better 
together  in  society.  This  entry  first  examines  the 
evolution  of  ideology  in  Marxist  thought.  It  then 
describes  the  development  of  the  concept  of  ideol¬ 
ogy  in  20th-century  thought — for  example,  in  the 
analysis  of  education,  the  elaboration  of  structur¬ 
alism  from  a  Marxism  perspective,  the  explanation 
of  totalitarianism,  and  the  analysis  of  the  relation¬ 
ships  between  ideology,  science,  and  religion. 

Ideology  in  Marxist  Thought 

This  meaning  was  radically  transformed  by  Karl 
Marx  (and  Friedrich  Engels).  In  their  theory — 
devised  for  the  political  purpose  of  bringing  about 
a  liberating  revolution — ideology  constitutes  a 
sphere  of  society  to  be  conceptually  distinguished 
from  the  other  two  spheres,  the  economy  and 


politics.  Marx  took  up  the  word  ideology  and  used 
it  negatively  to  criticize  the  understanding  that  the 
driving  force  of  history  is  ideas;  Marx  claimed  that 
the  driving  force  is  not  ideology  but  labor.  Dividing 
thinkers  into  idealists  and  materialists,  Marx 
declared  that  ideology  was  the  idealist’s  illusion. 
However,  despite  the  fact  that  society  depended  on 
the  economy,  ideological  illusion  was  an  essential 
part  of  that  society,  necessary  to  its  operation. 
According  to  Marx,  professional  thinkers — that  is, 
philosophers,  clerics,  jurists — believed  that  the 
world  is  governed  by  ideas.  This  belief  had  to  be 
overturned,  for  in  fact  effective  history  was  the  his¬ 
tory  of  production,  labor,  and  modes  of  produc¬ 
tion.  Men’s  consciousness,  which  was  formalized 
in  philosophy,  religion,  and  law,  amounted  to  a 
mystifying  derivative  of  their  material  practices.  In 
sum,  Marx  explained,  ideology  was  “false  con¬ 
sciousness,”  and  in  their  false  perception,  men 
turned  reality  upside  down:  “If  in  all  ideology  men 
and  their  circumstances  appear  upside-down  as  in 
a  camera  obscura,  this  phenomenon  arises  just  as 
much  from  their  historical  life-process  as  the  inver¬ 
sion  of  objects  on  the  retina  does  from  their  physi¬ 
cal  life-process”  (Marx,  The  German  Ideology,  Pt. 
1,  sec.  A).  The  error  could  be  corrected,  but  the 
victims  of  the  illusion  were  caught  in  a  formidable 
trap:  They  did  not  know  they  were  under  the  spell 
of  false  representations — hence  Marx’s  stigmatiz¬ 
ing  of  religion  as  “the  opium  of  the  people.”  The 
same  applied  to  belief  in  political  representation 
(universal  suffrage,  elections,  parliaments);  such 
belief  mystified  men  by  denying  the  reality  of  class 
differences.  Marx’s  theory  of  ideology  is  thus 
inseparable  from  a  theory  of  domination  (Herr- 
schaft ):  Ideology  works  to  stabilize  the  existing 
social  order,  wherein  one  group  necessarily  domi¬ 
nates  all  others.  The  dominant  class  dominates 
economically  (as  the  labor-exploiting  class),  politi¬ 
cally  (it  holds  state  power),  and  ideologically  (its 
ideas  impose  themselves  as  a  general  belief  system). 

The  Concept  of  Ideology 

in  20th-Century  Thought 

Marx’s  analysis  of  ideology  as  illusion  was  explic¬ 
itly  reused  by  Pierre  Bourdieu  and  Jean- Claude 
Passeron  in  their  theory  of  how  pedagogical  and 
political  authority  is  legitimated.  The  initial  target 
of  their  critique  was  the  ideology  of  giftedness,  a 
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model  for  their  critique  of  ideology  as  legitimation 
of  authority.  In  the  modern  French  educational 
system,  Bourdieu  and  Passeron  explained,  teach¬ 
ers,  pupils,  and  parents  all  believe  that  scholastic 
success  depends  on  having  a  gift,  when  in  it  fact  it 
depends  on  the  “cultural  capital”  a  person  has 
inherited.  False  belief  in  giftedness  was  what 
enabled  the  system  to  function,  objectifying  the 
failure  of  some  pupils  and  legitimating  the  success 
of  others  in  psychological  terms  when  in  fact  that 
success  was  the  product  of  the  schooling  system’s 
“symbolic  violence.”  Domination  is  all  the  more 
effective  because  the  social  mass  of  “the  domi¬ 
nated”  is  blind  to  the  principle  of  domination. 
Theories  of  ideology  thus  became  linked  to  theo¬ 
ries  working  to  criticize  and  combat  illusion  and 
alienation. 

In  Marx’s  thinking,  the  polemic  aspect  was  cru¬ 
cial  because  while  criticism  was  an  active  practice 
amounting  to  revolutionary  mobilization,  it  was 
also  an  intellectual  one  (the  subtitle  of  Capital  is 
“Critique  of  Political  Economy”).  Ideology  was  to 
be  criticized  in  the  name  of  science,  which  com¬ 
bated  ideas  that  were  derived  not  from  true  knowl¬ 
edge  but  from  mere  opinion  (or  the  doxa,  a  Greek 
term  Bourdieu  liked  to  use).  But  ideology  was  not 
just  false;  it  was  also  linked  to  the  interests  of  the 
respective  classes  battling  each  other.  The  ideology 
of  the  nobility  in  ancien  regime  France  (according 
to  which  the  social  order  was  founded  on  three 
hierarchically  ranked  groups:  the  nobility,  on  top, 
followed  by  the  clergy,  and  then  the  third  estate) 
ran  up  against  the  ideology  of  the  bourgeoisie,  a 
class  fighting  in  favor  of  its  own  understanding — 
corresponding  to  its  own  interests — that  all  men 
had  equal  rights.  This  egalitarian,  individualist 
ideology  triumphed  with  the  French  Revolution, 
and  it  allowed  the  market  system  to  develop  and 
spread.  But  the  revolutionary  proletariat,  declared 
Marx,  could  free  itself  from  all  such  illusions  and 
attain  true  knowledge.  The  various  Marxist  theo¬ 
ries  could  therefore  present  themselves  as  true 
knowledge  that  coincided  with  the  only  true  ideol¬ 
ogy:  the  ideology  of  the  proletariat. 

While  ideologies  divided  society,  they  also  played 
a  functional  role:  It  was  precisely  because  ideology 
was  an  illusion  that  it  enabled  the  given  social  order 
to  maintain  itself.  In  Louis  Althusser’s  structural 
Marxist  perspective,  developed  in  the  mid-1960s, 
ideology  is  the  sphere  of  representation,  and  it 


could  be  an  efficient  cause  of  some  forms  of  social 
organization,  though  in  the  last  analysis,  that  role 
was  determined  by  economic  organization.  An 
example  of  this  relation  between  the  sphere  of  rep¬ 
resentation  and  the  economic  sphere  is  Christianity 
in  the  feudal  mode  of  production.  The  feudal 
arrangement  known  as  the  corvee  amounted  to 
blatant  exploitation  in  that  the  serf  had  to  work 
for  the  feudal  lord  for  a  certain  amount  of  time 
every  year,  receiving  nothing  in  exchange.  By  sanc¬ 
tifying  this  social  order,  Christianity  enabled  the 
feudal  system  of  extorting  “surplus  value”  to 
remain  in  place,  whereas  in  the  capitalist  economy, 
that  same  extortion  was  masked  by  the  form 
known  as  the  wage.  Moreover,  in  the  capitalist 
economy,  the  ideology  of  equal  human  rights 
allowed  for  the  perpetuation  of  a  system  based  on 
radical  inequality — the  inequality  between  those 
who  had  to  sell  their  labor  power  and  those  who 
were  in  a  position  to  buy  it.  Revolutionary  strug¬ 
gle,  then,  in  Marx’s  understanding,  was  first  and 
foremost  an  ideological  battle  whose  ultimate  pur¬ 
pose  was  to  dispel  illusion. 

The  understanding  that  ideology  stabilizes  exist¬ 
ing  social  power  relations  was  used  in  various 
forms  by  all  communist  regimes.  In  these  regimes, 
ideology — in  the  form  of  propaganda  and  dogma 
that  can  be  changed  at  any  moment  by  the  political 
power — would  promote  “true”  doctrine.  This  in 
turn  explains  why  for  Hannah  Arendt  and  Carl  J. 
Friedrich,  the  existence  of  a  single  mass  party  with 
an  official  ideology  was  one  defining  attribute  of 
the  totalitarian  syndrome.  There  was  no  need  for 
people  to  believe  in  the  ideology;  in  a  totalitarian 
system,  ideology  of  itself  forbade  all  discourse 
except  the  one  emanating  from  the  party,  and  it 
did  not  matter  that  people  knew  that  discourse  to 
be  a  false  one.  Here,  ideology  has  been  virtually 
emptied  of  content,  becoming  little  more  than  an 
obligation  to  lie  and  to  acclaim  the  party  and  its 
leader. 

In  Maoism,  on  the  other  hand,  ideology  was 
conceived  of  as  the  ultimate  condition  for  the  tri¬ 
umph  of  the  revolution.  Ideological  combat  was 
the  driving  force  of  Mao  Zedong’s  Cultural 
Revolution,  launched  in  1966.  The  understanding 
here  was  that  the  political  and  economic  revolu¬ 
tion  had  not  been  enough  to  transform  China 
radically,  lastingly;  what  was  needed  was  a  revolu¬ 
tion  in  belief  that  would  replace  the  bourgeois 
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ideology  with  a  new,  people’s  ideology  (wherein  a 
sense  of  the  collective  would  put  down  individual¬ 
ism,  for  example).  As  Mao  saw  it,  having  pure, 
radical  revolutionary  ideas  was  more  important 
than  having  experience  or  expert  knowledge.  The 
ideological  superstructure  was  presented  as  a  force 
with  the  power  either  to  slow  down  the  revolution 
or  to  drive  it  forward.  This  explains  the  attacks  in 
Mao’s  China  against  such  doctrines  as  revisionism 
and  Confucianism  and  also  the  violence  against 
people  who  bore  these  “deviations”  or  traditions 
within  them:  They  were  stigmatized  as — and 
treated  like — vermin  that  had  to  be  eradicated.  In 
this  way,  Maoism  actually  reversed  Marxism:  The 
structure  of  social  power  relations  was  understood 
to  depend  on  representations — that  is,  doctrinal 
purity.  The  Chinese  Cultural  Revolution  can  be 
understood  as  a  war  waged  by  one  component  of 
the  political  elite  against  other  components,  but  it 
did  not  bring  ideological  struggle  to  an  end;  it  con¬ 
tinues  in  post-Maoist  China.  The  party’s  hold  on 
the  means  and  content  of  communication  has 
been  the  determinant  for  keeping  it  in  power,  and 
it  does  not  hesitate  to  use  strong  coercive  means. 
In  any  case,  the  collapse  of  Soviet  communism 
definitively  devalued  the  prestige  of  communism  in 
the  West.  The  “end  of  ideology”  announced  by 
Daniel  Bell  in  1960 — namely,  the  end  of  the  pres¬ 
tige  of  communist  millenarianism — affected  not 
only  communist  sympathizers  but  also  the  com¬ 
munist  parties  themselves.  Some  authors  say  that 
religious  belief  has  returned  to  fill  the  void  left  by 
communist  ideology,  as  if  a  society  needed  ideol¬ 
ogy.  The  claim  leads  to  establishing  an  equivalency 
between  a  political  program  such  as  communism 
and  systems  of  representation  that  are  not  neces¬ 
sarily  linked  to  programs  for  transforming  the 
world.  This  brings  up  the  question  of  the  political 
meaning  of  Islamism  and  its  relation  to  theocratic 
programs. 

The  role  played  by  Marxism  as  dogma  in 
the  Soviet  regime  led  analysts  to  describe  it  as  an 
“ideocracy”:  Because  the  communist  party  and  its 
leaders  had  to  justify  any  and  all  decisions  with  refer¬ 
ence  to  an  absolute  truth  whose  prescriptions  had  to 
be  followed  in  all  circumstances,  communist  ideol¬ 
ogy  has  been  described  as  a  millenarian  religious 
thought  system.  The  same  term  has  been  applied  to 
Nazism,  but  this  disregards  the  real  difference 
between  the  rudimentary  totalitarian  language  of 


Nazism — the  Nazi  Weltanschaung,  comprising 
Fuhrerprinzip,  hatred  of  democracy  and  Marxism, 
and  anti-Semitism — and  communist  ideology, 
structured  by  abundant  and  in  some  cases  sophis¬ 
ticated  doctrines  and  numerous  cultural  refer¬ 
ences.  Communist  ideology  has  been  compared 
with  church  ideology,  and  the  term  ideocracy  has 
been  extended  to  include  power  systems  such  as 
Islam  in  which  religion  structures  social  life.  The 
anthropologist  and  political  thinker  Ernest  Gellner 
grouped  together  communism  and  Islamism  as 
ideocracies  but  specified  that  Islamic  regimes  do 
not  impose  a  single  economic  form  on  their  societ¬ 
ies  (as  communist  regimes  did)  and,  thus,  they 
allow  the  development  of  the  market  and  indus¬ 
trial  society.  From  this  perspective,  ideology 
appears  a  universal  social  dimension  that  can  take 
a  variety  of  forms,  depending  on  the  content  of  its 
doctrine  and  institutions.  For  example,  scriptural 
religions  such  as  Protestantism,  where  all  individ¬ 
uals  are  called  on  to  read  the  Bible  in  their  ver¬ 
nacular  language,  have  an  effect  on  the  social 
system  as  a  whole:  They  foster  national  cultural 
homogeneity  in  countries  where  they  are  the 
dominant  religion.  The  social  sciences’  use  of  the 
notion  of  ideology  presents  insoluble  epistemo¬ 
logical  problems.  Distinguishing  ideology  from 
true  scientific  knowledge  presupposes  that  those 
who  set  out  to  do  so  (such  as  Althusser  and 
Bourdieu)  are  in  possession  of  a  differentiation 
criterion.  Marx  believed  he  had  one;  he  believed 
that  history  itself  was  the  movement  of  a  self- 
affirming,  self-deploying  truth.  As  Bourdieu  saw 
it,  sociology — which  implies  doing  sociology  of 
sociology — would  provide  such  a  criterion.  This 
claim  may  seem  similar  to  Georg  Wilhelm 
Friedrich  Hegel’s  that  he  possessed  absolute 
knowledge  in  the  form  of  “the  science  of  science.” 

The  belief  that  ideology  stands  opposed  to  true 
knowledge,  that  it  is  false  consciousness  as  opposed 
to  knowledge  of  objective  truth,  may  thus  appear 
the  result  of  a  paradox,  since  what  is  assumed  to 
be  science  (historical  materialism  for  Marx  and 
Marxists,  critical  sociology  for  Bourdieu  and 
Bourdieusians)  is  understood  to  be  beyond  critical 
suspicion. 

Dominique  Colas 
Institut  d’Etudes  Politiques  de  Paris  (Sciences  Po) 

Paris,  France 


1146  Immigration  Policy 


See  also  Liberalism;  Marxism;  Political  Philosophy; 
Theocracy;  Totalitarianism;  Utopianism 

Further  Readings 

Althusser,  L.,  &  Balibar,  E.  (1967).  Reading  capital 
(B.  Brewster,  Trans.).  London:  Verso. 

Bell,  D.  (2000).  The  end  of  ideology:  On  the  exhaustion 
of  political  ideas  in  the  fifties.  Cambridge,  MA: 
Harvard  University  Press. 

Bourdieu,  P.,  &  Passeron,  J.-C.  (1977).  Reproduction  in 
education,  society  and  culture  (R.  Nice,  Trans.). 
London:  Sage. 

Gellner,  E.  (1994).  Conditions  of  liberty:  Civil  society 
and  its  rivals.  London:  Penguin. 

Marx,  K.,  &  Engels,  F.  (1972).  Excerpts  from  German 
ideology.  In  R.  C.  Tucker  (Ed.),  The  Marx-Engels 
reader  (pp.  146-200).  New  York:  W.  W.  Norton. 
(Original  work  written  1845-1846) 


Immigration  Policy 


The  migration  of  people  from  one  territory  to 
another  is  one  of  the  oldest  practices  of  mankind. 
However,  immigration  policy  is  a  broad  label 
encompassing  a  range  of  different  and  formally 
unrelated  issues  as  diverse  as  the  Viking  settlement 
of  Anglo-Saxon  England  following  the  Roman 
withdrawal  in  410  CE;  the  triangular  trade  in  slaves 
up  to  the  19th  century;  European  immigration  to 
the  New  World  in  the  19th  and  20th  centuries,  and 
large-scale  resettlements  following  conflicts  such  as 
World  War  II,  the  1947  India-Pakistan  War,  and 
the  Rwandan  genocide  of  1994.  The  first  part 
of  this  contribution  sets  out  what  these  issues  are  in 
the  countries  listed  above  before  moving  on  to 
consider  how  they  have  been  analyzed  in  political 
science. 

Defining  Immigration  Policy 

Conventionally,  immigration  policy  is  considered 
to  refer  primarily  to  the  responses  of  governments 
in  developed  countries  to  migratory  pressures  from 
less  developed  countries  post-1945.  These  essen¬ 
tially  “rich”  developed  countries  can  be  further 
subdivided  into  three  broad  categories.  First,  pre¬ 
dominantly  Anglo-Saxon  immigration  countries 


such  as  the  United  States,  Canada,  Australia,  and 
New  Zealand  have  traditions  of  accepting  immi¬ 
gration  that  long  predate  1945.  Second,  Western 
European  countries  have  since  1945  been  trans¬ 
formed  from  countries  of  emigration  to  countries 
of  immigration.  This  transformation  has  not  been 
uniform:  Thus,  France,  the  United  Kingdom  (UK), 
West  Germany,  the  Netherlands,  Belgium,  and 
Austria  all  experienced  large-scale  immigration 
during  the  1950s  and  1960s,  while  countries  such 
as  Spain,  Italy,  and  Ireland  have  only  become  des¬ 
tinations  for  immigration  since  the  end  of  the  Cold 
War.  Lastly,  the  countries  of  Central  and  Eastern 
Europe  have  themselves  also  become  destinations 
for  migration  since  the  turn  of  the  millennium  and 
especially  since  becoming  member  states  of  the 
European  Union  (EU)  in  2004  and  2007.  Notably, 
the  developed  countries  of  Asia,  particularly  Japan, 
Taiwan,  and  South  Korea,  have  only  experienced 
comparatively  minimal  levels  of  immigration. 

Despite  constituting  just  a  relatively  small  ele¬ 
ment  of  human  migrations  both  historically  and 
contemporarily,  immigration  to  developed  coun¬ 
tries  has  become  one  of  the  most  sensitive, 
controversial,  and  therefore  important  domestic 
political  issues.  For  this  reason,  the  area  has 
attracted  increasing  attention  from  scholars 
across  a  range  of  cognate  disciplines,  including 
economics,  sociology,  development  studies, 
anthropology,  geography,  demography,  history, 
education,  psychology,  and  political  science. 
Within  political  science,  scholars  of  public  policy, 
political  theory,  comparative  politics,  political 
economy,  and  international  relations  have  all 
contributed  to  our  understanding  of  this  field. 

At  its  most  elementary  level,  immigration  policy 
addresses  the  control  of  entry  to  a  state’s  territory. 
The  physical  control  of  borders,  through  passport 
controls,  visa  requirements,  or  quite  simply  a  bar¬ 
rier,  remains  one  of  the  key  deliverables  for  any 
government  and  indeed  constitutes  one  of  the  fun¬ 
damental  elements  of  state  sovereignty.  In  this 
context,  the  EU’s  policy  in  recent  decades  of 
removing  border  checks  between  its  member  states 
(the  so-called  Schengen  Agreement)  and  replacing 
them  with  a  closely  guarded  external  border  vis-a- 
vis  third  countries  is  particularly  significant. 
However,  and  especially  in  the  modern  age  of 
global  travel,  a  high  degree  of  control  over  fron¬ 
tiers,  although  frequently  promised  in  election 
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campaigns,  is  seldom  easy  to  deliver,  and  borders, 
whether  they  are  the  U.S.  border  with  Mexico,  the 
Israeli  border  with  Gaza,  or  the  EU’s  external  bor¬ 
ders  in  Eastern  Europe  and  the  Mediterranean  Sea, 
are  notoriously  porous.  In  consequence,  illegal 
immigration  remains  a  major  challenge,  both  from 
a  humanitarian  perspective  of  human  trafficking 
and  in  the  context  of  enforcement.  Prevention  and 
return  have  become  a  top  priority  in  international 
diplomacy:  Thus,  the  EU  has  signed  readmission 
agreements  with  a  range  of  countries,  as  well  as 
seeking  the  cooperation  of  countries  such  as  Libya 
in  preventing  boats  from  leaving  for  southern  Italy 
in  the  first  place. 

Five  Dimensions  of  Immigration  Policy 

When  it  comes  to  managing  and  shaping  migration 
flows,  as  opposed  to  simply  controlling  their  entry, 
it  is  possible  to  identify  five  principal  constituent 
elements  of  this  field:  labor  migration,  the  increased 
importance  of  political  asylum,  the  growth  of 
immigration  of  dependents,  ethnic  immigration, 
and  issues  related  to  the  integration  of  immigrants 
into  the  host  country. 

Labor  Migration 

First,  labor  migration  has  perhaps  been  the  sin¬ 
gle  most  important  element  of  immigration  policy, 
and  it  underpins  much  of  the  migration  that  devel¬ 
oped  countries  have  received  over  the  past  60-odd 
years.  This  is  hardly  surprising  given  the  existing 
and  indeed  growing  differences  in  wealth  between 
developed  countries  and  their  neighbors.  The 
desire  to  work  is  also  a  major  driving  force  behind 
illegal  immigration  and  residence.  While  the 
authorities  in  developed  countries  do  formally  seek 
to  clamp  down  on  such  activities,  for  instance,  by 
raiding  building  sites  and  farms  where  illegal 
immigrants  are  known  to  work,  they  are  also  often 
tacitly  tolerated  as  a  useful  contribution  to  the 
economy.  Several  countries,  such  as  Spain,  Italy, 
and  the  United  States,  have  also  instituted  amnes¬ 
ties,  sometimes  repeatedly,  to  regularize  the  status 
of  illegal  immigrants. 

Labor  migration  can  be  both  high  skilled  and 
low  skilled.  For  instance,  much  of  the  labor  migra¬ 
tion  to  Western  Europe  up  to  the  early  1970s, 
which  numbered  millions  of  people,  was  low 


skilled,  largely  from  Mediterranean  countries  such 
as  Italy,  Spain,  Yugoslavia,  and  Turkey.  In  some 
countries,  notably  West  Germany,  such  migration 
was  explicitly  expected  to  be  temporary  in  nature, 
leading  to  the  coining  of  the  (pejorative)  term 
Gastarbeiter  (“guest  worker”).  Labor  migration 
was  radically  cut  back  during  the  1970s  and 
1980s,  but  high-skilled  migration  has  witnessed  a 
resurgence  since  the  late  1990s,  with  a  range  of 
countries  now  introducing  points-based  immigra¬ 
tion  systems  originally  pioneered  in  Australia  and 
Canada. 

This  resurgence  has  been  linked  to  two  key 
developments  in  Western  societies:  first,  the  twin 
phenomena  of  increasing  life  expectancy  and 
falling  birthrates  has  created  a  demographic 
imbalance  and  the  prospect  of  falling  populations 
in  many  countries,  especially  Germany,  Italy, 
Spain,  and  Eastern  European  countries,  and  labor 
migration  is  one  key  option  for  redressing  this. 
Thus,  the  UK’s  decision  not  to  close  its  labor  mar¬ 
ket  to  the  new  member  states  of  the  EU  in  2004 
led  to  large-scale  labor  migration  of  up  to  1  mil¬ 
lion  persons,  as  a  result  of  which  the  UK’s  popula¬ 
tion  is  projected  to  increase  rather  than  decrease 
over  the  coming  decades.  Similarly,  the  compara¬ 
tively  dynamic  state  of  the  U.S.  population  is  also 
largely  down  to  recent  immigration.  The  second 
key  development  is  skills  shortages,  which  have 
emerged  in  a  range  of  sectors,  most  notably  infor¬ 
mation  technology. 

Political  Asylum 

The  second  element  of  immigration  policy  has 
been  the  rise  in  importance  of  political  asylum. 
Asylum  is  a  humanitarian  policy  designed  to  afford 
protection  to  those  persecuted  for  their  political 
views  in  their  home  countries  and  its  legal  expres¬ 
sion,  the  1951  Geneva  Convention,  constitutes  one 
of  the  best  known  cornerstones  of  international 
law.  With  low  numbers  of  claimants  globally  until 
the  end  of  the  1970s,  the  ability  to  seek  political 
asylum  was  initially  largely  notional.  However, 
since  then,  numbers  have  risen  strongly,  generally 
affecting  those  countries  with  an  already  existing 
sizeable  immigrant  population.  Thus,  West  Germany 
was  the  principal  destination  for  asylum  seekers  in 
Europe  during  the  1980s;  since  then,  the  UK, 
France,  Netherlands,  Austria,  Switzerland,  and 
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Belgium  have  also  all  emerged  as  major  recipient 
countries.  With  rising  numbers  have  come  rising 
costs  and  not  infrequently  social  tensions,  which  in 
turn  have  propelled  asylum  up  the  political  agenda. 
In  response,  governments  have  introduced  a  range 
of  measures  to  restrict  the  ability  of  refugees  to 
lodge  asylum  claims,  such  as  visa  restrictions  for 
major  countries  of  origin,  the  principle  of  the  “safe 
third  country”  (i.e.,  transit  country),  and  the  “safe 
country  of  origin”  principle.  Refugee  organiza¬ 
tions  such  as  the  UNHCR  (the  Office  of  the  United 
Nations  High  Commissioner  for  Refugees)  criti¬ 
cize  that  such  measures  undermine  the  spirit,  if  not 
the  letter,  of  the  Geneva  Convention. 

The  annual  numbers  of  asylum  seekers  have 
fluctuated  widely  over  the  years.  The  early  and  late 
1990s  were  major  periods  of  asylum  migration, 
although,  since  the  early  2000s,  applications  have 
dropped  back.  In  the  case  of  Germany,  which  had 
hitherto  been  the  world’s  most  significant  destina¬ 
tions  for  asylum  seekers,  these  are  now  historically 
low  levels.  In  processing  asylum  cases,  government 
agencies  have  generally  tended  to  take  a  restrictive 
view  of  claims,  and  initial  recognition  rates  have 
generally  been  around  25%,  although  in  some 
countries  they  are  as  low  as  5%.  Nonetheless,  the 
tendency  of  rejected  asylum  seekers  to  appeal  their 
decision  through  the  courts,  plus  logistical  difficul¬ 
ties  in  physically  deporting  such  persons  once  legal 
avenues  have  been  exhausted,  has  meant  that 
many  asylum  claims  take  years  to  complete,  which 
has  in  turn  led  to  a  considerable  backlog  of  cases 
in  many  countries. 

Immigration  of  Dependents 

Third,  most  developed  countries  have  since  the 
1970s  experienced  a  rapid  growth  in  the  immigra¬ 
tion  of  dependents.  Such  migration  is  considered 
“secondary”  because  it  cannot  take  place  without 
the  immigration  of  a  “primary”  migrant — typically 
a  labor  migrant  but  also  potentially  a  recognized 
refugee.  The  quantitative  dimension  of  such  sec¬ 
ondary  migration  should  not  be  underestimated, 
and  dependent  migration  currently  constitutes  the 
largest  form  of  legal  migration  to  developed  coun¬ 
tries,  exceeding  both  asylum  and  labor  migration. 
Crucially,  dependent  migration  is  generally 
regarded  as  a  human  right  and  in  consequence, 
many  countries  have  not  found  it  easy  to  impose 


restrictions  on  this  form  of  migration.  Nonetheless, 
the  right  to  family  reunification  has  certainly  been 
circumscribed  in  recent  years,  with  countries 
imposing  conditions  such  as  minimum  living 
space,  maximum  ages  for  dependent  children,  and, 
most  recently,  the  introduction  by  several  European 
countries  of  pre-entry  requirements  such  as  the 
successful  completion  of  language  and  integration 
courses. 

Ethnic  Immigration 

The  fourth  aspect  is  ethnic  immigration,  which 
broadly  speaking  refers  to  privileged  entry  condi¬ 
tions  sometimes  granted  by  countries  to  persons  of 
specific  ethnic  and/or  cultural  origins.  For  instance, 
Israel  guarantees  residence  to  Jews  from  all  over 
the  world,  regardless  of  whether  they  have  any 
active  ties  to  Israel.  Likewise,  the  origins  of  the 
Green  Card  Lottery  in  the  United  States  (through 
which  50,000  permanent  green  cards  are  issued 
annually  on  the  basis  of  computer-generated  ran¬ 
dom  selection)  lie  in  a  decision  by  Congress  to 
privilege  immigration  from  Ireland.  But  the  single 
largest  instance  of  this  immigration  has  been  to 
Germany,  with  more  than  4  million  ethnic  Germans 
having  immigrated  since  1950.  Initially,  these 
came  from  Poland,  Czechoslovakia,  and  Romania, 
but  since  the  late  1980s  these  have  come  over¬ 
whelmingly  from  the  countries  of  the  former  Soviet 
Union.  Typically,  such  ethnic  Germans  had  been 
living  in  these  countries  for  centuries  but  had  then 
experienced  persecution  at  the  hands  of  the  com¬ 
munist  authorities.  Germany  initially  offered  gen¬ 
erous  immigration  conditions,  including  extensive 
financial  assistance,  but  the  very  large  numbers 
that  arrived  between  1989  and  1993  (more  than 
1.4  million)  led  to  this  being  sharply  curtailed.  The 
relative  weakness  of  the  German  economy  since 
2000  has  caused  ethnic  German  migration  to  fall 
back  to  much  lower  levels. 

Integration  of  Immigrants 

The  final  aspect  of  policy  to  be  considered  here 
is  in  fact  one  of  the  most  difficult:  the  successful 
integration  of  immigrants  into  their  new  host  soci¬ 
eties.  Strictly  speaking,  integration  is  not  related 
to  the  process  of  immigration  itself;  however, 
because  integration  arises  as  a  direct  consequence 
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of  immigration,  the  two  are  conventionally  viewed 
under  the  same  rubric.  The  principal  policy 
challenge  here  is  that  immigrants  across  the  devel¬ 
oped  world  broadly  speaking  show  similar  socio¬ 
economic  deficits  compared  with  the  indigenous 
population,  including  higher  unemployment,  lower 
educational  outcomes,  lower  wages,  and  substan¬ 
dard  housing.  That  said,  there  are  significant  differ¬ 
ences  in  the  socioeconomic  profile  of  immigrant 
groups:  For  instance,  in  the  UK,  people  of  Indian 
origin  are  generally  better  integrated  socio¬ 
economically  than  those  of  Bangladeshi  origin. 

The  challenge  of  integration  encompasses  a  par¬ 
ticularly  wide  range  of  policy  fields,  including 
education,  labor  market,  housing,  health,  and  law 
and  order.  It  includes  not  only  residence  policy  but 
also  citizenship  and  naturalization  policy.  What  is 
more,  integration  is  a  highly  symbolic  area  and 
touches  on  powerful  mobilizers  such  as  cultural 
identity  and  belonging.  For  instance,  the  question 
of  Muslim  head  scarves  in  France,  and  especially  in 
its  schools,  has  polarized  public  opinion  for  more 
than  20  years  and  is  considered  by  some  to  be 
challenging  the  Republican  emphasis  on  secular¬ 
ism.  Similarly,  populist  politicians  such  as  Pirn 
Fortuyn  and  Geert  Wilders  in  the  Netherlands,  and 
Jorg  Haider  in  Austria  and  the  French  Front 
National,  have  often  politicized  what  they  see  as 
incompatibility  between  Islam  and  the  respective 
indigenous  culture.  In  this  way,  integration  covers 
both  substantive  and  symbolic  issues,  and  in  polit¬ 
ical  debate,  perhaps  unsurprisingly,  the  two  often 
get  (deliberately)  blurred. 

Certainly,  since  2000,  governments  across 
Europe  have  imposed  new,  largely  symbolic  restric¬ 
tions  on  the  acquisition  of  citizenship  by  foreigners. 
Thus,  most  countries  now  require  applicants  for 
naturalization  to  pass  a  citizenship  test  and  to 
attend  a  citizenship  ceremony,  often  involving  an 
oath  of  allegiance  as  well.  Such  practices  are  of 
course  well  established  in  traditional  countries  of 
immigration  such  as  the  United  States,  Canada,  and 
Australia,  and  indeed  these  countries’  experiences 
have  explicitly  served  as  a  template  for  the  introduc¬ 
tion  of  these  citizenship  policies  in  Europe.  At  the 
same  time,  there  has  been  a  general  liberalizing 
trend  across  Europe  in  the  acquisition  of  citizenship 
at  birth,  with  countries  increasingly  using  ius  soli 
(the  territorial  principle)  as  well  as  ius  sanguinis  (the 
principle  of  descent).  In  parallel,  dual  citizenship, 


which  once  was  widely  rejected  in  European  citizen¬ 
ship  policies,  is  increasingly  accepted,  if  not  for¬ 
mally  then  through  far-reaching  exemptions. 

Political  Science  and  Immigration  Policy 

Together,  then,  these  five  key  dimensions  illustrate 
the  complexity  and  the  multifaceted  nature  of  this 
controversial  policy  field  of  immigration  policy. 
Arguably,  this  explains  why  scholars  have  strug¬ 
gled  so  far  to  develop  an  integrated,  multidisci¬ 
plinary  perspective  on  immigration  that  can 
address  the  holistic  aspect  of  the  field  while  still 
capturing  the  specificities  of  its  individual  constitu¬ 
ent  elements.  Indeed,  the  dominant  explanations 
for  migration  and  integration  have  come  from 
economics  and  sociology,  respectively.  For 
instance,  economists  argue  that  wage  differentials 
between  economies  create  incentives  for  labor 
migration  from  the  lower  wage  (or  poorer)  coun¬ 
try  to  the  higher  wage  (or  richer)  country.  Certainly, 
this  is  a  powerful  explanation  and  helps  account 
for  both  formal  and  informal  labor  migration  over 
the  decades,  including  the  guest  worker  migration 
to  West  Germany,  illegal  migration  from  Mexico 
to  the  United  States,  and  Polish  migration  to  the 
UK  after  its  2004  accession  to  the  EU.  But  equally, 
straight  economics  encounters  difficulties  in  fully 
accounting  for  why  some  countries  receive  more 
asylum  seekers  than  others. 

From  the  perspective  of  political  science  and  its 
several  subdisciplines,  including  public  policy  anal¬ 
ysis,  immigration  policy  is  a  relatively  new  object  of 
study,  and  scholars  have  yet  to  bring  its  full  ana¬ 
lytical  toolkit  to  bear  on  its  various  elements.  To 
date,  though,  a  number  of  key  approaches  can  be 
identified  in  the  literature.  The  first  approach  is 
grounded  in  political  economy,  regulation,  and  wel¬ 
fare,  with  Gary  Freeman  as  its  leading  exponent.  In 
his  work,  he  emphasizes  the  role  that  organized 
interests,  and  especially  business  interests,  play  in 
structuring  immigration  policy,  thereby  creating  a 
clear  pressure  toward  liberalization.  A  second 
approach  draws  on  the  role  universal  human  rights 
play  in  requiring  states  to  accept  “unwanted” 
immigration,  including  dependent  migration.  At  the 
same  time,  as  Christian  Joppke  (1999)  points  out, 
states  have  proved  to  be  quite  activist  in  responding 
to  such  challenges;  for  instance,  Germany’s  1993 
constitutional  amendment  to  limit  asylum  can  be 
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seen  as  a  prima  fade  example  of  a  state  wresting 
back  control  over  a  policy  are. 

Joppke’s  analysis  is  particularly  germane  to  the 
issue  of  state  sovereignty,  which  in  turn  has  under¬ 
pinned  the  contribution  of  international  relations 
to  our  understanding  of  immigration  policy.  Here, 
the  question  of  control  and  the  ability  of  states  to 
exercise  it  have  traditionally  been  dominant. 
However,  more  recently,  other  aspects  have  come 
to  the  fore,  in  particular  globalization,  as  Saskia 
Sassen  points  out,  and  increasingly  security,  as 
Barry  Buzan,  Ole  Waever,  and  Japp  de  Wilde  have 
described.  Indeed,  Samuel  Huntington  took  the 
latter  argument  several  steps  further  in  an  (in) 
famous  article  mapping  out  a  “clash  of  civiliza¬ 
tions”  between  Christianity  and  Islam — a  relation¬ 
ship  that  of  course  lies  at  the  heart  of  much  of  the 
debate,  at  least  in  Europe,  about  integration. 

Beyond  international  relations  and  political 
economy,  a  number  of  seminal  contributions  have 
been  grounded  in  political  theory,  reflecting  the 
fact  that  asylum  and  citizenship  are  two  issues  that 
directly  intersect  with  notions  of  equality,  justice, 
ethics,  and  pluralism.  As  well  as  discussions 
of  asylum  from  this  perspective  (e.g.,  Matthew 
Gibney,  2004),  multiculturalism  has  emerged  from 
the  pluralist  stable  as  an  influential  model  for 
immigrant  integration  (Bhikhu  Parekh,  2005). 
However,  although  multiculturalism  remains  pop¬ 
ular  in  the  New  World  and  especially  Canada  and 
Australia,  it  has  found  fewer  devotees  in  Europe: 
On  the  contrary,  recent  debates  in  the  Netherlands 
and  Germany  have  reemphasized  the  more  assimi- 
latory  nature  of  integration. 

Elsewhere  in  the  study  of  citizenship  and  inte¬ 
gration,  Adrian  Favell’s  exploration  of  the  philoso¬ 
phies  of  integration  in  the  UK  and  France  remains 
an  indispensable  point  of  reference,  as  does 
Yasemin  Soysal’s  provocative  discussion  of  “per- 
sonhood”  and  its  implications  for  the  “hollowing 
out”  of  national  citizenships  in  Western  Europe. 
Other  authors,  such  as  Randall  Hansen  (2000)  and 
Simon  Green  (2004),  emphasize  the  importance  of 
history  and  path  dependence  in  understanding  the 
evolution  and  operation  of  citizenship  policy  in 
France,  Germany,  and  the  UK. 

Last  but  not  the  least,  a  thriving  subfield  has 
sprung  up  in  recent  years  examining  the  develop¬ 
ment  and  role  of  the  EU  in  immigration  policy.  The 
creation  of  the  Schengen  area  plus  the  importance 


of  some  common  labor  market  policies  has  made 
immigration  within  the  EU  a  more  important  con¬ 
cern.  As  well  as  important  overviews  of  this  field, 
especially  that  of  Andrew  Geddes  (2008),  security 
has  been  the  focus  of  extensive  attention. 

Conclusion 

Although  a  brief  overview  such  as  this  cannot  hope 
to  do  full  justice  either  to  the  complexities  of  the 
field  of  immigration  policy  or  to  the  rich  variety  of 
the  scholarly  canon  dealing  with  this  policy,  this 
entry  has  sought  to  do  two  things.  First,  it  has 
sketched  out  in  broad  terms  the  nature  and  chal¬ 
lenges  of  immigration  policy  as  well  as  political 
actors’  and  institutions’  responses  to  it  in  com¬ 
parative  perspective.  Second,  it  has  introduced  the 
principal  scholarly  contributions  of  political  sci¬ 
ence  to  our  understanding  of  immigration  policy  in 
all  its  forms.  There  is  no  doubt  that  collectively 
these  contributions  have  served  to  emphasize  the 
centrality  of  the  discipline  to  the  purpose  of  under¬ 
standing  this  vital  policy  field  more  broadly.  At  the 
same  time,  much  remains  to  be  done.  Leaving 
aside  the  perhaps  rather  lofty  aspiration  of  devel¬ 
oping  an  integrated,  cross-disciplinary  model  of 
immigration,  the  challenge  for  political  scientists  in 
this  field  is  to  provide  further  insights  at  all  levels, 
to  include  the  roots  of  party  political  positions  on 
immigration,  psephological  aspects  of  immigrant 
voters,  the  formulation  processes  of  government 
policy,  the  development  of  cross-national  migra¬ 
tion  responses,  the  evolution  and  formalization  of 
citizenship  and  integration  policy,  and  the  process 
of  administration  of  these  policies. 
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The  concept  of  policy  impacts  refers  to  the  effects 
and  consequences  of  public  policy  on  individuals, 
groups,  the  broader  society,  and  the  natural  world 
in  which  we  live.  All  policies  have  intended  targets 
but  are  more  or  less  successful  in  attaining  their 
stated  goals.  This  entry  examines  the  nature  of 
public  policy  in  political  science,  discusses  several 
taxonomies  for  understanding  the  targets  for  pub¬ 
lic  policy,  and  also  discusses  the  linkages  between 
the  design  of  policies  and  their  final  outcomes. 

Public  Policy  and  Political  Science 

Public  policies  are  the  mechanisms  used  by  govern¬ 
ing  authorities  to  allocate  benefits,  burdens,  and 
regulations  for  the  society.  Public  policies  are  what 
governments  and  their  agents  produce,  including 
statutes,  budgets,  resolutions,  proclamations,  pro¬ 
grams,  and  direct  contacts  with  people.  Policies 
must  be  understood  as  nested  in  one  another.  Thus, 
the  U.S.  Constitution  is  a  fundamental  public  pol¬ 
icy.  Statutes  passed  by  Congress  or  state  legislatures 


also  are  policies,  as  are  the  guidelines  written  by 
agencies  as  policy  is  implemented.  Court  rulings 
may  change  a  statute,  and  thus  court  rulings  also 
are  public  policies.  A  program  established  by  a 
local  government  to  regulate  or  provide  services  is 
itself  a  part  of  the  public  policy  chain.  These  poli¬ 
cies  are  revealed  as  texts  and  also  in  the  actual 
practices  and  discourse  of  those  involved  in  crafting 
or  implementing  public  policy.  Policy  as  written 
may  not  be  the  way  it  actually  is  perceived  and  may 
or  may  not  be  put  in  place  exactly  as  written.  Policy 
may  have  intended  as  well  as  unintended  conse¬ 
quences:  direct  effects  and  indirect  ones. 

Although  the  effects  of  public  policy  have 
always  been  of  concern  to  political  science,  this 
field  was  slow  to  develop.  Part  of  the  reason  is  the 
sheer  complexity  of  policy  content  and  the  diffi¬ 
culty  in  finding  common  elements  across  different 
levels  and  types  of  policy.  Another  reason  for  the 
slow  development  of  theory  about  policy  impacts 
is  the  assumption  that  policy  consequences  are 
guided  mainly  by  policy- specific  theory,  which 
often  is  simply  not  an  accurate  understanding  of 
policy  effects.  So,  for  example,  it  is  sometimes 
assumed  that  the  impact  of  environmental  policy 
depends  on  what  theory  of  the  environment  is 
embedded  in  the  policy  rather  than  on  theories  of 
human  behavior  and  values. 

During  the  past  20  years,  however,  political  sci¬ 
ence  has  embraced  the  study  of  policy  conse¬ 
quences  to  the  point  that  some  are  advocating  a 
“policy-centric”  approach  to  the  study  of  politics 
in  which  two  major  questions  dominate  the  field. 
The  first  is  how  to  explain  and  understand  how 
and  why  governments  produce  the  kinds  of  policy 
designs  that  they  have,  and  the  second  is  how  to 
understand  the  impacts  of  those  designs. 

Theories  of  Policy  Consequences 

Theodore  Lowi  is  generally  credited  with  being  the 
first  to  offer  a  systematic  theory  of  policy  impacts. 
He  popularized  the  phrase  policy  creates  politics, 
thereby  reversing  the  common  causal  pattern  in 
political  science.  Lowi  contends  that  there  are 
characteristics  of  policy  that  encourage  affected 
people  to  mobilize  and  that  without  these  charac¬ 
teristics,  elites  will  dominate.  His  typology  is  based 
on  two  dimensions:  the  probability  of  coercion 
and  whether  the  policy  identifies  specific  targets  or 
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consists  of  general  rules  that  affect  the  environ¬ 
ment  of  groups.  From  these  dimensions,  he  posits 
four  types  of  policy:  distributive,  regulatory,  redis¬ 
tributive,  and  constituent.  Regulatory  policy, 
involving  the  imposition  of  costs  in  the  environ¬ 
ment,  will  produce  pluralist  competition  according 
to  Lowi.  Policy  becomes  a  temporary  compromise 
between  competing  groups,  but  if  policy  becomes 
too  harsh  on  some,  they  are  expected  to  mobilize 
and  change  the  governing  officials  or  change  the 
policy.  Redistributive  policy,  such  as  welfare  pol¬ 
icy,  Lowi  contends,  does  not  produce  pluralist 
forms  of  subsequent  participation  but  instead  is 
dominated  by  elites  who  lead  major  segments  of 
society  who  act  to  insure  social  stability. 
Distributive  policy  is  the  classic  pork  barrel  and  is 
inherently  elitist,  according  to  Lowi’s  framework. 
His  fourth  type,  constituent  policy,  produces  indi¬ 
vidualistic  claims  on  government. 

James  Wilson  proposed  an  alternative  to  Lowi’s 
typology.  Wilson’s  typology  begins  with  whether 
benefits  are  narrowly  focused  or  broadly  cast,  and 
whether  costs  are  narrowly  focused  or  dispersed 
broadly  among  the  public.  When  benefits  are 
broadly  distributed  (e.g.,  national  defense)  and 
costs  are  broadly  distributed  (e.g.,  taxes  support¬ 
ing  national  defense),  then  all  people  have  about 
equal  incentives  to  incorporate  this  policy  into 
their  behavior.  The  result  is  majoritarian  politics. 
In  majoritarian  politics,  elected  officials  pay  spe¬ 
cial  attention  to  what  the  majority  wants.  When 
benefits  are  concentrated  on  just  a  few  (such  as 
welfare  recipients)  but  costs  are  distributed  widely 
among  taxpayers,  there  is  little  incentive  for  those 
bearing  the  costs  to  pay  attention  to  the  policy 
itself  but  great  advantage  to  potential  beneficia¬ 
ries.  This  produces  what  Wilson  called  a  clientist 
style  of  politics  in  which  elected  leaders  distribute 
favors  to  their  clients  while  apathetic  taxpayers 
do  not  pay  attention.  A  third  type  of  politics, 
entrepreneurial  politics,  occurs  when  small  por¬ 
tions  of  society  pay  the  costs  to  benefit  large 
groups — for  example,  education.  This  type  of 
politics  results  in  virtually  uncontrolled  growth 
because  of  the  strong  political  payoffs  for  distrib¬ 
uting  numerous  benefits  at  the  expense  of  so  few. 
Since  most  policies  are  either  entrepreneurial  or 
clientist,  government  grows  until  it  becomes  inef¬ 
ficient  and  produces  more  goods  and  services 
than  people  actually  want. 


A  third  typology,  which  seems  to  explain  more 
about  attitudes  and  participation  patterns,  was 
developed  by  Anne  Schneider  and  Helen  Ingram. 
Their  theory  recognizes  that  public  policy  involves 
far  more  than  the  allocation  of  benefits  and  bur¬ 
dens,  and  that  all  aspects  of  policy  design  may  be 
important.  Policy  design  refers  to  the  empirical 
characteristics  of  a  statute,  agency  guidelines,  or 
practices  of  street-level  officials.  They  contend  that 
policies  contain  common  observable  elements  that 
influence  the  types  of  impacts  of  the  policy.  Policies 
typically  have  target  populations — the  people 
affected  positively  or  negatively  by  the  policy,  goals 
or  problems  to  be  solved,  benefits  or  costs  or  both 
that  are  being  allocated,  and  rules  that  guide  or 
constrain  action  or  specify  who  is  to  do  what  to 
whom,  when,  and  with  what  resources.  Policies  also 
contain  tools  that  motivate  agents  or  members  of 
the  general  public  to  take  the  action  recommended 
by  the  policy  and  rationales  embedded  in  the  policy 
text  or  used  to  legitimate  the  policy.  Policies  typi¬ 
cally  have  an  implementation  “chain”  in  which  one 
level  of  policy  directs  the  actions  of  another  at  a 
“lower”  level.  Hence,  policies  contain  people, 
whose  behavior  impinges  on  policy  consequences. 
Policies  also  contain  underlying  assumptions  about 
how  people  behave  and  what  motivates  them, 
about  cause  and  effect,  about  scientific  knowledge, 
and  about  the  values  that  should  be  pursued. 

Policies  are  not  simply  instrumental  means, 
however,  and  contain  symbolic  and  interpretive 
elements  that  are  important  in  understanding  their 
impacts.  For  example,  policies  that  impose  bur¬ 
dens  on  people  may  simultaneously  show  their 
respect  for  these  people.  Or  policies  that  impose 
burdens  may  denigrate  the  target  groups  and 
blame  them  for  their  own  plight.  The  consequences 
of  these  two  types  of  policy  are  quite  different. 

Schneider  and  Ingram’s  theory  contends  that 
policy  designs  are  sometimes  produced  in  institu¬ 
tions  dominated  by  divisive  social  constructions 
that  stigmatize  some  and  extol  the  virtues  of  oth¬ 
ers.  These  social  constructions  interact  with  vastly 
unequal  political  power  of  target  populations  pro¬ 
ducing  policy  designs  that,  in  turn,  produce  dis¬ 
tinctive  political  attitudes  and  participation. 
Schneider  and  Ingram  posit  four  types  of  target 
populations.  Advantaged  populations  have  consid¬ 
erable  political  power  and  positive  social  construc¬ 
tions,  such  as  being  “deserving,”  “intelligent,” 
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hardworking,”  and  so  on.  Advantaged  groups 
mainly  receive  benefits  from  government,  but  even 
if  regulations  are  imposed,  they  tend  to  be  those 
supported  by  the  advantaged  populations  them¬ 
selves  as  necessary  to  sustain  their  social  position. 
Politics  is  a  “game,”  but  it  is  winnable  if  played  by 
the  rules.  Advantaged  groups  have  high  levels  of 
traditional  political  participation. 

Contenders  also  have  a  great  deal  of  political 
power  but  carry  negative  social  constructions. 
Contenders  know  that  they  are  controversial  and 
that  their  interests  clash  with  others.  The  types  of 
policy  they  receive  tend  to  provide  significant  ben¬ 
efits  that  are  however  well  disguised  and  protected 
from  public  view.  The  consequences  of  such 
designs  are  that  contenders  become  suspicious  and 
vigilant.  They  believe  that  the  political  game 
involves  the  raw  use  of  power  and  is  often  crooked. 
Their  participation  patterns  tend  toward  low 
mobilization  but  high  levels  of  financial  assistance 
to  candidates  and  office  holders,  along  with 
intense  lobbying. 

A  third  type  of  target  group  Schneider  and 
Ingram  called  disadvantaged  in  that  they  have 
almost  no  political  power  and  are  socially  con¬ 
structed  as  helpless,  needy,  and  unable  to  function 
effectively  in  a  highly  competitive  society.  Public 
officials  need  to  be  careful  with  disadvantaged 
groups,  because  there  would  be  political  damage  if 
an  official  appears  to  be  mean  spirited;  on  the 
other  hand,  there  is  nothing  much  to  be  gained  by 
providing  benefits  to  these  people.  Hence,  policy 
tends  toward  paternalistic  assistance  and  consider¬ 
able  rhetoric  about  the  needs  of  these  people  but 
inadequate  resources  to  meet  their  needs.  The 
results  of  such  policy  designs  is  that  disadvantaged 
people  come  to  believe  that  they  are  helpless  and 
needy;  that  government  “wants”  to  assist  but  does 
not  have  the  resources  to  do  so;  and  that  their 
problems  are  best  resolved  through  religious  orga¬ 
nizations,  nonprofits,  and  local  government. 
Traditional  participation  tends  to  be  very  low — 
low  turnout,  lack  of  contacting  officials,  and  not 
much  participation  in  interest  groups. 

The  fourth  type  of  target  groups  are  deviants — 
people  with  almost  no  political  power  who  are 
viewed  negatively  as  dangerous,  violent,  a  threat  to 
“our  way  of  life.”  Policy  directed  at  these  groups 
tends  to  be  overly  punitive,  as  policymakers  realize 
that  considerable  political  capital  can  be  gained  by 


punishing  these  groups  and  holding  them  respon¬ 
sible  for  most  of  society’s  problems,  and  the  groups 
themselves  have  so  little  credibility  that  they  are 
unable  to  respond.  Traditional  participation  among 
deviants  is  very  low  but  other  forms — strikes,  dem¬ 
onstrations,  riots,  terrorist  attacks — will  occur. 

This  typology  also  is  useful  in  understanding 
policy  consequences  for  social  justice.  The  dynam¬ 
ics  of  degenerative  democracy  produces  too  many 
benefits  for  advantaged  groups.  The  groups  them¬ 
selves  support  public  officials  who  provide  them 
with  desired  policy,  and  others  will  not  complain 
because  the  positive  social  construction  of  advan¬ 
taged  groups  means  that  they  are  viewed  as 
“deserving.”  Often,  their  interests  are  character¬ 
ized  as  being  necessary  for  the  public  good.  And 
punishment  for  deviants  will  also  become  exces¬ 
sive  as  political  leaders  seek  to  outdo  one  another 
in  being  “tough.”  In  degenerative  politics,  where 
divisive  social  constructions  interact  with  power  to 
produce  distinctive  types  of  designs,  disadvantaged 
people  continue  to  suffer  from  limited  access  to 
opportunities — poor  schools  and  unemployment 
producing  a  continuing  underclass. 

One  primary  contribution  of  political  science  to 
the  study  of  policy  impacts  has  been  to  broaden  the 
discussion  beyond  the  narrow  emphasis  on  whether 
policy  has  been  successful  in  solving  problems  or 
achieving  its  instrumental  goals.  Deborah  Stone, 
for  example,  has  been  especially  influential  in 
addressing  the  many  complexities  of  policy  conse¬ 
quences.  Policies  typically  have  multiple  goals, 
some  of  which  may  be  in  conflict  with  others. 
Additionally,  a  policy  success  in  the  eyes  of  one 
person  may  well  be  a  policy  failure  in  the  eyes  of 
another. 

The  question  remains,  however:  Are  there  any 
systematic  characteristics  of  policy  designs  that 
increase  the  likelihood  of  policy  producing  desired 
consequences?  Implementation  scholars  disagree 
over  this  question,  as  some  such  as  Paul  Sabatier 
argue  that  strong  statutes  are  more  likely  to  pro¬ 
duce  desired  outcomes.  Strong  statutes  contain 
clear  statements  of  objectives,  strong  tools  such  as 
mandates,  and  little  or  no  discretion  for  lower  level 
agents.  Others  contend  that  policy  designs  must  be 
fit  into  context  and  that  designs  that  permit  greater 
discretion  at  lower  levels  enable  local  officials  to 
use  local  knowledge  and  are  more  likely  to  produce 
desired  results.  Still  others  focus  on  an  analysis  of 
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the  context  itself  and  argue  that  the  characteristics 
of  policy  at  a  higher  level  need  to  differ,  depending 
on  context.  For  example,  if  cause  and  effect  theory 
is  uncertain  or  weak,  or  if  the  context  varies  exten¬ 
sively  across  locales,  then  policy  should  encompass 
a  learning  mode  sometimes  called  adaptive  man¬ 
agement  so  that  local-level  agencies  can  adjust  the 
policy  as  they  learn  more  about  its  consequences. 

Some  political  scientists  question  what  constitutes 
“good”  public  policy.  Many  focus  on  economic  cri¬ 
teria  such  as  efficiency,  although  complexities  in 
measuring  societal  benefits  and  costs  typically  pre¬ 
vent  most  policies  being  assessed  in  efficiency  terms. 
Some  propose  that  effectiveness  is  a  guide — in  the 
sense  that  it  is  effective  in  addressing  problems.  As 
mentioned  above,  however,  there  may  be  many 
different  indicators  of  effectiveness  and  little 
agreement  on  which  are  more  important.  Others 
have  added  that  justice  (fairness)  must  be  consid¬ 
ered  when  assessing  the  desired  consequences  of 
policy  and  that  support  for  citizenship  and  institu¬ 
tions  also  must  be  ascertained.  Unintended  conse¬ 
quences  must  be  considered  and  weighed  against 
the  positive  effects. 

To  summarize,  political  science  has  made  major 
contributions  to  the  study  of  policy  impacts,  par¬ 
ticularly  in  proposing  a  more  complex  way  of 
conceptualizing  what  is  meant  by  public  policy 
and  by  broadening  the  discussion  beyond  the  nar¬ 
row  view  that  policy  serves  only  to  solve  a  specific 
problem.  Policy  has  many  consequences  for  democ¬ 
racy,  and  theories  of  policy  impact  need  to  embrace 
them. 

Anne  Schneider 
Arizona  State  University 
Tempe,  Arizona,  United  States 
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Imperialism 


The  term  imperialism  contains  the  prefix  imperial, 
at  once  an  adjective  and  a  noun.  As  an  adjective, 
imperial  means  pertaining  to  empire  or  the  ruler, 
an  emperor  or  empress,  and  this  is  used  to  refer  to 
characteristics  that  befit  a  ruler,  such  as  majesty, 
regality,  or  grandeur.  The  term  imperial  also 
conveys  the  character  of  rule  by  a  sovereign  state 
over  its  dependencies  or  a  commanding  quality  or 
manner  of  that  rule.  For  these  reasons,  the  term 
imperial  has  been  associated  with  despotic,  high¬ 
handed,  and/or  authoritarian  rule.  As  a  noun  and 
in  the  context  of  political  authority,  an  imperial  is 
a  member  of  imperial  party,  including  troops. 
Imperialism  in  the  senses  above  implies  the  pursuit 
of  grandeur  or  the  desire,  policy,  practice,  or 
advocacy  of  extending  the  power  and  dominion  of 
one  political  entity  over  another.  From  the  above 
perspectives,  imperialism  has  a  long  ancestry  in 
multiple  traditions  and  thus  may  be  found  histori¬ 
cally  to  exist  on  many  continents.  One  could  rea¬ 
sonably  speak  of  imperialism  in  the  contexts  of 
historic  China  and  France,  the  Moguls  in  India, 
the  Roman  and  Holy  Roman  empires,  the 
Ottoman,  Mali,  and  Mayan  empires,  and  the  like. 
In  most  of  these  contexts,  an  imperialist  would 
mean  an  adherent  of  an  emperor  or  empress  or  an 
advocate  of  such  an  adherence.  The  associated 
irredentist  processes  and  forms  constituted  impe¬ 
rialism  in  its  present  understanding.  For  instance, 
these  empires  necessarily  originated  from  a  local¬ 
ity  before  outward  expansions  that  required  the 
dismantling  of  competing  governments  and  the 
subordination  of  the  resident  authorities.  Still, 
their  ideologies,  policies,  systems  of  law,  and 
organizing  principles  and  norms  emerged  from 
contexts  that  are  different  from  those  of  modern 
imperialisms. 

The  shift  toward  modern  imperialism  began 
under  Pax  Britannica,  the  period  when  Great 
Britain  set  the  terms  of  the  European  order  (and  by 
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extension  world  order)  owing  to  its  defeat  of 
France  at  Waterloo  in  1815,  its  unchallenged  sea 
power,  and  control  of  the  key  naval  trade  routes. 
Pax  Britannica  coincided  with  scientific  and  indus¬ 
trial  revolutions,  the  rise  of  finance  capital  and 
consumer  society,  transformations  in  the  material 
conditions  of  states,  and  advances  in  warfare  and 
bureaucracy  that  set  the  context  for  unprecedented 
overseas  British  expansionism  in  Africa,  Asia,  and 
Latin  America.  Subsequently,  Britain  came  to 
dominate  overseas  markets  and  managed  to  proj¬ 
ect  its  “influence”  over  most  regions  of  the  world. 
The  terms  and  conditions  of  British  expansionism 
gave  modern  imperialism  its  distinctive  character¬ 
istics.  Above  all,  the  new  British  expansionism 
involved  a  worldwide  colonial  system.  From 
Waterloo  in  1815  to  the  1870  Franco-Prussian 
War  to  the  beginning  of  World  War  I  in  1914, 
Great  Britain  helped  set  in  motion  processes,  struc¬ 
tures,  and  relationships  under  a  colonial  order 
consisting  of  zones  of  influence,  colonies,  protec¬ 
torates,  trusteeships,  and  the  like.  Moreover, 
Britain  associated  its  own  dominions  (Australia, 
Canada,  New  Zealand,  and  South  Africa),  other 
European  powers,  most  notably  France,  and  Japan 
and  the  United  States  in  the  management  of  this 
order  under  distinct  colonial  systems.  For  the 
above  reasons,  the  expansionist  activities  of  the  era 
extending  from  the  19th  century  to  the  early  20th 
century  were  dubbed  the  New  Imperialism. 

There  is  a  long-standing  debate  in  Britain  and 
the  West,  particularly  among  politicians  and  aca¬ 
demics,  about  the  motivation,  wisdom,  and  utility 
of  imperialism  as  well  as  the  usefulness  of  the  term 
itself.  Most  authors  would  trace  the  debate  back  to 
John  Hobson’s  1902  critique  of  imperialism  and 
questions  raised  about  the  factors  contributing  to 
the  New  Imperialism.  Accordingly,  the  roots  of  the 
New  Imperialism  are  found  in  policies  linked  to  the 
resolutions  of  the  following  problems:  the  maldis¬ 
tribution  of  wealth  and  underconsumption,  the 
requirements  of  finance  capitalism;  commercial 
competition  among  rival  industrialized  countries; 
the  erosion  of  British  hegemony  and  the  subsequent 
breakdown  of  the  concert  of  Europe;  challenges  to 
Britain’s  dominance  in  world  trade;  quest  for  a  sup¬ 
ply  of  cheap  raw  materials  to  support  home  indus¬ 
tries  as  a  means  to  recovery  from  the  Long 
Depression  of  1873  to  1896  and  beyond;  the  aban¬ 
donment  of  free  trade  by  rival  European  powers; 


and  the  need  to  overcome  the  constraints  of  domes¬ 
tic  markets  through  export  opportunities.  Further¬ 
more,  it  has  been  argued  that  imperialism  was  an 
instrument  of  national  cohesion  in  the  face  of 
domestic  restlessness,  working-class  militancy,  and 
the  emergence  of  radical  left-leaning  parties  after 
the  Depression.  For  others  still,  the  New  Imperialism 
was  a  matter  of  national  pride  (France  after  defeat 
in  the  Franco-Prussian  war):  a  sign  of  national 
grandeur  and  manifest  destiny  (Germany  on  indus¬ 
trialization  and  the  United  States  under  the  Monroe 
Doctrine)  or  a  marker  of  racial  equality  and  non¬ 
discrimination  (Japan  after  Meiji). 

The  New  Imperialism  soon  turned  into  formal 
colonial  regimes  under  which  imperial  powers 
took  administrative  control  of  “foreign”  lands  and 
turned  them  into  colonies  and  dependencies. 
Initially,  imperial  powers  established  control 
through  chartered  companies  such  as  the  British 
East  India  Company.  These  corporations  were 
afforded  semisovereign  status  under  letter  patents 
that  afforded  them  rights  and  titles  allowing  them 
to  acquire  territories  outside  of  Europe  and  to 
exercise  within  them  the  total  monopoly  of  power. 
The  transition  to  formal  imperialism  in  India  and 
elsewhere  was  thus  effected  when  companies  trans¬ 
ferred  their  prior  administrative  functions  to 
European  governments.  The  new  administrative 
arrangements  also  associated  trained  natives  as 
civil  service  for  colonial  rule.  Thereafter,  Britain, 
France,  Portugal,  Spain,  the  United  States,  and 
others  proclaimed  sovereignty  or  overlordship  of 
foreign  populations  under  diminished  legal  and 
political  capacities.  Formal  imperialism  in  Africa 
began  dramatically  with  the  Berlin  Conference 
(1884-1885),  which  moderated  competition 
between  colonial  powers  by  defining  “effective 
occupation”  as  the  criterion  for  international 
recognition  of  territorial  claims.  The  Sykes-Picot 
(-Sazonov)  Agreement  of  1916  was  the  moral 
equivalent  of  the  Berlin  Agreement  for  the  Ottoman 
provinces  of  the  Arab  World.  A  secret  agreement, 
it  defined  the  respective  spheres  of  influence  of  the 
United  Kingdom  and  France  in  West  Asia  with 
the  assent  of  Imperial  Russia  and  in  anticipation  of 
the  downfall  of  the  Ottoman  Empire.  The  post- 
World  War  I  mandate  system  revived  the  mythical 
“White  man’s  burden”  under  a  so-called  sacred 
trust  as  the  means  to  incorporate  former  German 
colonies  into  existing  imperial  orders. 
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Among  observers  and  scholars,  these  activities 
led  to  multiple  explanations  about  the  advent  of 
imperialism.  Conservatives  favored  political  and 
psychological  factors  and  therefore  linked  imperi¬ 
alism  to  the  need  to  preserve  the  social  order. 
Liberals  who  had  been  reluctant  backers  of  impe¬ 
rialism  nonetheless  branded  its  economic  benefits 
despite  contravening  free  trade.  Marxists,  how¬ 
ever,  were  struck  by  the  rivalry  and  mutual  suspi¬ 
cion  that  characterized  international  diplomacy 
throughout  the  era.  Imperialist  ambitions  and 
rivalries  in  East  Asia  inevitably  came  to  focus  on 
the  vast  empire  of  China  whose  sovereignty  barely 
survived  the  unequal  treaties  imposed  on  it  by 
European  powers.  In  any  case,  some  Marxists 
attributed  World  War  I  to  imperial  ambitions  and 
rivalries.  Among  them,  Vladimir  Lenin  (1916) 
advanced  the  thesis  that  imperialism  was  the  high¬ 
est  stage  of  capitalism. 

Since  Lenin,  the  debate  on  imperialism  crystal¬ 
lized  around  two  central  questions.  First  is  the 
extent  to  which  imperialism  was  necessary  and 
inevitable  given  the  nature  of  the  international  order 
and  the  uneven  capacities  of  states  and  economic 
agents.  The  second,  favored  by  some  Marxists, 
gravitates  around  the  preposition  of  a  necessary 
relationship  between  capitalist  developments  and 
imperialism.  The  relevant  theses  have  been  taken  up 
in  different  contexts  by  world  system  theorists  who 
view  imperialism  as  part  of  a  general,  gradual 
extension  of  capital  investment  from  the  “core”  of 
the  industrial  countries  to  a  less  developed  “periph¬ 
ery.”  With  the  globalization  of  capital  and  the  ubiq¬ 
uitous  importance  of  civil  society,  other  Marxists 
puzzled  about  the  persistence  of  critical  dimensions 
of  imperialism  under  Pax  Americana  (the  relative 
peace  in  the  Western  Hemisphere  as  a  result  of  U.S. 
leadership  during  the  second  half  of  the  20th  cen¬ 
tury).  They  turned  to  Antonio  Gramsci’s  theses  on 
the  advent  of  hegemony,  particularly  on  the  role  of 
“spontaneous  consent”  in  perpetuating  what  might 
reasonably  be  called  imperialism.  They  also  drew 
attention  to  the  ideological  function  of  education  in 
order  to  gain  insight  into  the  expansion  of  capital¬ 
ism  and  the  associated  market  institutions  and 
modes  of  government. 

On  decolonization  and  the  formal  transfer  of 
power  to  the  newly  created  independent  states, 
increasing  numbers  of  scholars  and  politicians  have 
dismissed  the  utility  of  the  notion  of  imperialism  in 


today’s  world.  The  dismissal  is  founded  on  two 
orders  of  deficiencies.  The  first,  found  in  conserva¬ 
tive  and  liberal  theories,  is  the  focus  placed  on  the 
outward  bureaucratic  and  organizational  dimen¬ 
sions  of  imperialism  at  the  time  of  creation  and 
maintenance  of  domination  and  subordination. 
The  second,  found  in  Marxism  and  other  radical 
theories,  is  the  focus  on  dated  historical  events,  for 
instance,  the  modes  of  expansions  that  established 
unequal  economic,  cultural,  and  territorial  rela¬ 
tionships  between  imperial  states  and  their  formal 
dominions.  Indeed,  it  is  hard  today  to  entertain  the 
possibility  of  direct  or  outright  imperialism  in 
which  any  power  could  claim  to  be  justified  by  law 
or  providence  to  take  over  the  government  of  a  ter¬ 
ritory  other  than  its  own.  Nor  could  any  power 
insist  on  a  right  to  unequal  relations  with  other 
political  entities. 

Indeed,  the  focus  on  state  identity  and  ideology 
or  culture  and  discourses  may  be  misplaced  today. 
To  be  sure,  the  novelty  of  the  New  Imperialism  and 
the  formal  colonial  empires  that  followed  it  had  to 
do  with  the  scale  of  the  colonial  order.  The  con¬ 
struction  of  empire  required  a  domestic  ideology 
associated  with  state  identity  and  rationalizing  nar¬ 
ratives  (of  destiny  and  providence).  The  imperial 
project  also  required  self-justification — for  instance, 
to  right  the  wrongs  done  by  others  or  to  improve 
the  lot  through  trusteeships  or  mandates.  In  these 
regards,  imperialism  had  the  backing  of  elements  of 
civil  society.  These  elements  successfully  articulated 
their  own  private  desires,  passions,  and  motives  of 
acquisition,  enrichment,  and  self-aggrandizement 
as  the  national  interest  and  then  caused  legislators 
and  executives  to  spring  into  colonial  adventures  as 
means  of  defending  the  latter.  These  private  inter¬ 
ests  were  comforted  in  their  positions  by  ideo¬ 
logues  such  as  consumer  advocates  seeking  to 
advance  the  good  life,  liberal  progressives  (who 
once  engineered  mandate  systems  under  the  sacred 
trust),  and  so-called  cosmopolitans  and  interna¬ 
tionalists.  Science  helped  define  conceptions  of  the 
moral  qualities  of  subjects,  the  status  of  individual 
societies  within  the  moral  order,  and  the  entitle¬ 
ments  or  deserts  due  to  them.  International  law, 
legal  doctrines,  and  jurisprudence  completed  the 
discourses  of  the  emergent  international  order  by 
providing  the  norms  and  instruments  of  empire. 
Each  discourse  played  a  part  in  defining  the  moral¬ 
ity  and  ethics  of  war  and  peace. 
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Thus  conceived,  imperialism  is  not  an  arbitrary 
act.  Historically,  it  remains  a  reflex  or  realism  that 
has  been  guided,  according  to  material  conditions, 
by  the  available  logics  of  power,  the  permissible 
modes  of  interventions,  the  applicable  political 
and  ethical  traditions,  and  a  recurrent  theology 
and  ideology  of  a  preordained  but  singular  hierar¬ 
chical  order  lodged  in  the  West.  From  this  sover¬ 
eignty  flow  inherent  rights  and  privileges,  a  notion 
of  predestined  leadership,  an  assumed  legitimacy 
of  interests,  an  a  priori  and  necessary  absolution  of 
sins  against  others,  and  a  singular  expectation  of 
access  to  the  necessary  resources  of  its  chosen  life¬ 
style,  presently  the  good  life.  These  imperial 
claims,  which  were  once  contested  within  colonial 
empires,  are  yet  to  disappear  from  international 
relations  today.  Indeed,  real  and  aspiring  imperial 
states  continue  to  demand  deference  from  others 
even  in  the  postcolonial  world.  This  is  not  done 
without  contestations,  but  to  underscore  the  point, 
we  still  witness  occasional  unilateral  political  and 
military  interventions  to  remind  us  of  the  lingering 
presence  of  the  imperium. 

Siba  N.  Grovogui 
Johns  Hopkins  University 
Baltimore,  Maryland,  United  States 

See  also  Power;  Power  and  International  Politics; 
Superpower 
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Implementation 


Implementation  refers  to  the  carrying  out  of  pol¬ 
icy  or  authoritative  actions  in  support  of  policy 


goals.  Implementation  is  closely  affiliated  with  the 
classic  political  function  of  executing  laws,  but 
implementation  research  also  focuses  on  the  execu¬ 
tion  of  authoritative  policies  other  than  laws,  such 
as  administrative  rules  issued  by  government  and 
policies  made  by  local  governments.  Several  stages, 
factors,  and  actors  are  involved  in  the  implementa¬ 
tion  process  and  affect  its  results.  Implementation 
is  a  continuation  of  the  policy  formation  process, 
but  it  also  involves  new  actors.  The  implementation 
process  and  its  results  depend  on  whether  the  pol¬ 
icy  design  is  valid  and  on  what  policy  instruments 
have  been  selected.  Also,  the  number  and  interests 
of  the  actors  that  are  involved  in  carrying  out  the 
policy  and  the  relationships  between  them  affect 
the  implementation  process  and  results.  Managers 
have  an  important  but  difficult  task  in  transform¬ 
ing  and  communicating  the  policy  down  to  the 
frontline  workers.  Because  of  the  discretion  that 
these  workers  typically  have,  they  play  a  key  role  in 
delivering  the  policy  to  citizens  or  firms  and  enforc¬ 
ing  the  rules  associated  with  it. 

While  most  implementation  studies  analyze  the 
implementation  of  one  particular  policy  or  a  set  of 
related  policies,  some  bottom-up  implementation 
scholars  are  more  interested  in  local  problem  solv¬ 
ing.  These  problems  are  typically  defined  by  the 
researchers  themselves,  for  example,  Christopher 
Hull  and  Benny  Hjern.  They  would  often  focus  on 
a  set  of  policies  that  have  an  impact  on  a  given 
policy  problem,  for  example,  chronic  youth  unem¬ 
ployment  or  helping  small  firms  grow.  The  rele¬ 
vance  of  this  kind  of  research  depends  on  whether 
the  reader  accepts  the  definition  of  the  problem. 
This  entry  focuses  on  official  public  policy  goals 
and  mandates  as  a  standard  for  implementation  is 
warranted  from  a  democratic  effectiveness  per¬ 
spective.  Democracy  concerns  not  only  how  poli¬ 
cies  are  made  but  also  whether  or  not  they  are 
executed.  Policy  goals  formulated  in  legislatures, 
governments,  and  local  governments  have  a 
particular  legitimate  status  and  are  relevant  for 
holding  government  accountable. 

The  start  of  implementation  research  was  stim¬ 
ulated  by  evaluation  research.  President  Lyndon 
Johnson’s  Great  Society  social  policy  reforms  in 
United  States  in  the  1960s  and  1970s  fostered  a  lot 
of  evaluation  research.  The  effects  of  the  new  wel¬ 
fare  state  programs  were  assessed  and  improve¬ 
ments  in  social  engineering  suggested.  As  most 
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evaluation  studies  actually  showed  little  or  no 
effect  of  the  social  program  interventions,  the  first 
interpretation  suggested  that  the  program  was  a 
failure  and  based  on  wrong  program  theory. 
However,  gradually,  another  interpretation 
emerged  that  the  causal  theory  behind  the  planned 
policy  intervention  might  after  all  be  valid,  but  the 
intervention  had  not  taken  place  as  intended.  This 
led  to  an  interest  in  studying  the  relation  between 
planned  and  actual  interventions  and  the  adminis¬ 
trative  process  in  between  policy  making  and 
social  interventions. 

Most  people  consider  Jeffrey  Pressman  and 
Aaron  Wildavsky’s  1973  book  Implementation  to 
be  the  first  major  piece  of  implementation  research. 
It  was  a  case  study  of  a  federal  economic  develop¬ 
ment  program  in  Oakland,  California,  that  had 
been  created  to  enhance  minority  employment.  It 
failed  to  do  so,  however,  due  to  the  complexity  of 
many  actors  having  to  work  together  and  because 
it  was  based  on  an  invalid  causal  theory  of  how  to 
stimulate  minority  employment.  The  book  cer¬ 
tainly  opened  the  field  of  implementation  research, 
yet  some  pieces  of  earlier  research  by,  for  example, 
Herbert  Kaufman  and  Jerome  Murphy  had  actu¬ 
ally  focused  on  implementation  problems. 

The  organization  of  this  entry  is  based  not  on  a 
chronological  review  of  the  implementation  litera¬ 
ture  but  on  the  Integrated  Implementation  Model 
(Figure  1).  The  model  was  developed  by  Soren 
Winter  to  integrate  some  of  the  theoretically  most 
promising  implementation  factors  identified  by 
various  implementation  scholars  into  a  common 
analytical  framework. 

For  dependent  variables,  the  model  focuses  on 
both  the  actions  of  implementers  in  delivering  poli¬ 
cies  and  the  societal  outcomes  in  relation  to  the 
official  policy  objectives.  Implementation  research 
was  long  dominated  by  stories  of  implementation 
failures.  The  implementation  of  policies  typically 
failed  to  achieve  the  planned  outcomes,  and  imple¬ 
menters  failed  to  deliver  the  mandated  services  or  in 
enforcing  regulations.  Or  the  process  suffered  from 
serious  delays.  Implementation  research  focused 
mainly  on  barriers  to  successful  implementation. 
However,  gradually,  more  optimistic  studies  were 
made.  One  of  the  important  insights  was  that  time 
horizons  matter.  Often,  policy  programs  suffer 
from  start-up  problems  that  are  gradually  amelio¬ 
rated  over  the  years  on  the  basis  of  experience. 


Sometimes,  it  may  take  a  decade  or  more  to  carry 
out  a  successful  implementation.  Implementation 
results  also  tend  to  vary  among  localities  at  any 
given  point  in  time. 

The  first  set  of  factors,  which  affects  implemen¬ 
tation  results,  is  the  policy  design.  Even  perfect 
implementation  processes  and  outputs  are  not 
likely  to  be  successful  if  the  expected  causal  rela¬ 
tionship  between  policy  objectives  and  means  is 
invalid.  Second,  implementation  is  affected  by  the 
commitment  of  authorities  and  organizations  that 
are  involved  in  implementing  policies  and  by  the 
interorganizational  relations  between  them.  Third, 
managers’  transmission  of  organizational  policies 
to  the  staff  is  crucial.  While  management  matters, 
it  is  no  simple  matter!  Fourth,  this  complexity  is 
due  to  the  discretionary  behaviors  of  street-level 
bureaucrats  in  their  interactions  with  target  groups. 
These  bureaucrats  typically  enjoy  some  autonomy 
vis-a-vis  the  legislation  and  their  managers.  Fifth, 
target  groups  not  only  are  passive  recipients  of 
public  policies  but  are  often  active  in  affecting 
implementation  outputs  as  well  as  outcomes. 
Sixth,  implementation  results  are  conditioned  by 
socioeconomic  factors.  What  makes  organizations, 
street-level  bureaucrats,  and  target  groups  impor¬ 
tant  political  actors  in  the  implementation  process 
is  that  their  contributions  to  implementation  are 
affected  by  their  action  models,  which  for  all  three 
sets  of  actors  analytically  are  made  up  of  actor 
interests  and  the  resources  the  actors  master.  After 
presenting  the  theoretical  implementation  frame¬ 
work  in  the  following,  some  major  implementa¬ 
tion  research  strategies  and  methodologies  will  be 
discussed  before  the  entry  is  concluded. 

Policy  Design 

Too  many  implementation  researchers  have  erro¬ 
neously  put  the  whole  blame  for  any  lack  of  goal 
achievement  on  implementation.  This  is  in  sharp 
contrast  to  the  early  evaluation  scholars  who  had 
blamed  the  policy  design  for  any  lack  of  effect.  As 
noted  by  Peter  May,  well-designed  policies  are  nec¬ 
essary  but  not  sufficient  for  improving  implementa¬ 
tion  prospects.  A  policy  design  typically  contains  a 
set  of  goals,  a  mix  of  instruments  for  obtaining 
these  goals,  a  designation  of  governmental  or  non¬ 
governmental  entities  charged  with  carrying  out 
the  goals,  and  an  allocation  of  resources  for  the 
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Figure  I  The  Integrated  Implementation  Model 

Source:  Winter,  S.  C.,  &  Nielsen,  V.  L.  (2008).  Implementering  af  politik  [Policy  implementation!.  Copenhagen,  Denmark: 
Academica. 


requisite  tasks.  Policy  design,  and  particular  policy 
instruments,  has  received  substantial  research  inter¬ 
est  since  the  1980s,  for  example,  by  Stephen  Linder 
and  Guy  Peters,  Lester  Salamon,  and  May. 

According  to  this  literature,  any  policy  can  be 
disaggregated  into  one  or  a  mix  of  few  generic 
policy  instruments.  Unfortunately,  the  common 
research  interest  has  not  led  to  agreement  on  any 
typology  of  instruments.  One  simple  classification 
focuses  on  rules  (often  combined  with  sanctions 
for  noncompliance),  economic  incentives,  and 
information,  all  of  which  aim  at  affecting  the 
behavior  of  either  target  groups  or  intermediaries 
(implementers). 

Policy  designs  may  affect  the  implementation 
process  and  results  in  various  ways.  The  pioneers 
of  implementation  research  saw  goal  clarity  as  the 


key  to  implementation  success.  However,  according 
to  later  research,  conflicting,  unclear,  and  ambigu¬ 
ous  goals  are  the  rule  rather  than  the  exception  in 
public  policies,  and  decent  implementation  results 
can  be  achieved  even  in  theses  circumstances. 
Different  mixes  of  instruments  are  not  equally  effec¬ 
tive  in  obtaining  a  given  policy  objective.  In  sum¬ 
marizing  the  scant  empirical  evidence  in  the  field, 
May  suggests  that  policy  designs  are  important  in 
affecting  the  incentives  of  intermediaries  to  carry 
out  their  requisite  tasks,  particularly  through  affect¬ 
ing  their  commitment  and  capacity  and  by  signaling 
desired  actions. 

The  selected  instruments  may  also  affect  the 
overall  implementation  structure  and  process 
because  certain  instruments  tend  to  favor  the  for¬ 
mation  of  particular  implementation  structures. 
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Mandates  aimed  at  regulating  the  behavior  of  tar¬ 
get  groups  normally  require  a  staff  for  inspecting 
and  enforcing  the  mandate  and  a  set  of  sanctions. 
Information  strategies  and  use  of  economic  incen¬ 
tives  such  as  environmental  taxes  can  sometimes 
be  implemented  with  less  staff,  although  there  is 
no  one-to-one  relationship  between  instruments 
and  staff  requirements.  Some  taxes  are  relatively 
automatic  and  easy  to  collect,  such  as  an  environ¬ 
mental  tax  per  unit  of  gasoline  sold,  while  others 
require  a  substantial  staff  for  inspection  and 
enforcing,  for  example,  taxing  diffuse  pollution. 

However,  while,  as  already  indicated  by 
Pressman  and  Wildavsky,  the  validity  of  the  causal 
theory  linking  instruments  to  objectives  is  crucial, 
the  research  documentation  of  instrument  effects  is 
still  meager.  One  reason  is  that  effects  of  instru¬ 
ments  on  implementation  often  depend  on  the 
context,  including  the  political  context. 
Consequently,  designing  good  policies  is  not  a 
simple,  technocratic  process  such  as  selecting  the 
best  types  of  materials  for  building  a  bridge. 

Policy  design  also  involves  organizational 
design,  that  is,  what  types  of  organizations  are  to 
be  involved  in  implementing  the  policy,  and  what 
types  of  incentives  can  be  used  to  increase  their 
commitment  to  the  policy  and  its  objectives? 
Organizational  implementation  structures  are 
often  quite  messy  and  uncoordinated.  Some 
top-down  implementation  scholars  like  Daniel 
Mazmanian  and  Paul  Sabatier  hold  quite  optimis¬ 
tic  views  on  the  ability  of  policy  proponents  to 
structure  the  implementation  process  to  provide 
more  commitment  and  loyal  implementation. 
Critics  of  their  viewpoint,  however,  argue  that 
policies  are  often  political  compromises  that  also 
include  policy  opponents  with  little  interest  in 
making  an  effective  implementation  structure. 

The  roots  of  problematic  policy  designs  and 
implementation  problems  can  often  be  found  in 
the  prior  policy  formulation  process.  For  instance, 
conflicts  in  this  process  often  create  a  policy  with 
conflicting  and  ambiguous  goals  and  an  invalid 
causal  theory  embedded  in  a  policy  design  that 
does  not  adequately  connect  the  goals  to  the 
means.  Furthermore,  symbolic  policies  are  some¬ 
times  adopted  to  address  a  policy  problem  with¬ 
out  actually  offering  the  means  that  could  achieve 
the  stated  objectives.  And  as  first  claimed  by 
Eugene  Bardach,  conflicts  in  policy  formation 


often  continue  in  the  subsequent  implementation 
process.  It  is  important  to  understand  that  ineffec¬ 
tive  policy  designs  are  not  always  due  to  lack  of 
knowledge  on  the  part  of  the  policy  designers. 
According  to  Terry  Moe,  policy  design  of  instru¬ 
ments  and  organizational  structure  is  first  of  all  a 
political  process,  in  which  political  actors — 
including  both  policy  proponents  and  opponents — 
try  to  maximize  their  own  interests.  This  includes 
the  selection  of  an  organizational  structure  that 
will  maximize  their  long-term  control  of  the 
implementation  process. 

Unfortunately,  the  research  evidence  on  how 
different  organizational  arrangements  contribute 
to  implementation  success  or  failure  is  rather  mod¬ 
est.  For  example,  we  have  very  little  evidence  on 
the  effectiveness  or  efficiency  of  using  federal  or 
state  government  field  offices,  local  governments, 
nonprofit  organization,  or  private  firms  for  deliv¬ 
ering  public  services.  One  study  found  that  Danish 
central  government  field  offices  pay  more  atten¬ 
tion  to,  are  more  committed  to,  and  comply  more 
completely  with,  the  national  employment  policy 
rules  on  emphasizing  quick  job  placements  than  is 
the  case  for  local  government  agencies.  May  and 
collaborators  show  not  only  that  cooperative  regu¬ 
latory  strategies  adopted  by  central  or  state  gov¬ 
ernments  toward  local  governments  can  provide 
more  commitment  and  compliance  at  the  local 
level  by  preventing  flooding  but  also  that  these 
strategies  can  backfire  for  particular  “bad  apples.” 

According  to  many  studies,  outsourcing  rela¬ 
tively  technical  public  tasks  is  in  most  cases  more 
efficient  than  public  provision  of  services.  How¬ 
ever,  the  research  on  the  effects  of  outsourcing  of 
more  classic  social  services  to  for-profit  or  non¬ 
profit  providers  is  much  more  limited  and  has 
more  inconsistent  findings.  When  comparing  the 
effects  of  employment  service  provision  of  public, 
nonprofit,  and  for-profit  organizations  in  the 
United  States,  Carolyn  Heinrich  and  colleagues  did 
not  find  any  difference,  while  Danish  analyses  find 
that  for-profit  and  nonprofit  providers  deliver  both 
poorer  outputs  (e.g.,  in  terms  of  job  emphasis)  and 
poorer  employment  outcomes  with  much  higher 
unit  costs  than  public  providers.  However,  the 
overpricing  may  be  due  to  large  subsidies  of  out¬ 
sourcing  applied  by  the  Danish  government.  More 
research  is  needed  on  how  various  types  of  organi¬ 
zations  contribute  to  implementation  success. 
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Interorganizational  Relations 

Implementation  processes  are  characterized  by 
organizational  and  interorganizational  behaviors 
that  represent  different  degrees  of  commitment 
and  coordination.  In  their  seminal  implementation 
study,  Pressman  and  Wildavsky  (1973)  focused  on 
the  “complexity  of  joint  action,”  according  to 
which  successful  implementation  is  likely  to  be 
negatively  related  to  the  number  of  actors, 
the  diversity  of  their  interests  and  perspectives,  and 
the  number  of  decision  and  veto  points.  Given  the 
great  number  of  actors,  subsequent  decisions,  and 
veto  points  that  is  typical  in  policy  implementa¬ 
tion,  Pressman  and  Wildavsky  stated  in  their  book 
Implementation  that  the  fact  that  federal  policies 
work  at  all  is  amazing.  As  an  illustration,  they  sug¬ 
gested  a  series  of  subsequent  implementation  deci¬ 
sions  with  a  relative  high  probability  of  success  in 
each  of,  say,  90%.  Yet,  according  to  Pressman  and 
Wildavsky,  it  only  takes  seven  decisions  to  lower 
the  cumulated  chance  of  success  to  less  than  50%. 

One  of  their  key  insights  was  that  the  complex¬ 
ity  of  joint  actions  is  not  found  only  when  imple¬ 
menting  organizations  fundamentally  disagree 
with  the  policy  objectives  or  with  each  other;  it 
even  takes  place  when  implementing  organizations 
generally  agree  about  the  reform  goals  but  have 
slightly  different  perspectives  on  the  implementa¬ 
tion.  For  some  authorities  or  organizations  with 
implementation  tasks,  contributing  positively  to 
the  implementation  may  be  at  odds  with  other 
commitments,  or  they  may  simply  prefer  other 
tasks  that  are  more  crucial  to  the  core  tasks  of  the 
organization.  Or  it  may  just  be  that  organizations 
have  different  time  lines  for  contributing  to  the 
implementation  assembly  process.  However,  seri¬ 
ous  conflicts  of  interest  and  bureaucratic  politics 
tend  to  make  implementation  even  more  difficult. 

Thus,  how  organizations  contribute  to  the 
implementation  process  is  strongly  affected  by 
their  action  model,  which  consists  of  their  interests 
and  their  resources  for  pursuing  them.  Implementing 
organizations  may  have  interests  that  are  identical 
with  the  public  policy.  They  agree  both  with  its 
goals  and  means.  Their  interests  may  also  be  con¬ 
vergent.  Although  not  agreeing  with  the  goals, 
they  agree  with  the  means.  An  implementation 
success  is  more  likely  if  organizations  have  inter¬ 
ests  that  are  identical  or  at  least  converge  with  the 


policy.  However,  implementation  prospects  are 
based  not  only  on  interests  but  also  on  the 
resources  that  organizations  can  use  for  pursuing 
their  interests. 

Bardach  describes  the  implementation  process 
as  a  series  of  “implementation  games”  that  organi¬ 
zations  play  for  promoting  their  own  interests.  The 
complexity  of  joint  action  and  implementation 
games  may  lead  to  implementation  failures,  delays, 
or  biases  in  the  implementation  outputs  and 
outcomes  that  are  produced.  The  notion  of  “the 
complexity  of  joint  action”  seems  related  to  the 
proverb  of  “too  many  cooks  spoil  the  broth.” 
However,  according  to  Laurence  O’Toole,  this 
problem  applies  only  to  certain  kinds  of  inter¬ 
organizational  implementation  settings.  Decision 
points  are  not  independent  of  each  other.  An  early 
agreement  on  basic  understandings  can  enhance 
implementation  success  by  promoting  “bandwagon 
effects”  in  later  decisions,  and  decisions  can  be 
merged  by  crafting  “package  deals.” 

Implementation  prospects  also  depend  on  the 
type  of  resource  dependencies  among  participating 
organizations.  The  “complexity  of  joint  action”  is 
most  relevant  for  a  chain  of  sequential  relations. 
Here,  a  particular  organization,  A,  cannot  produce 
its  implementation  contribution  without  receiving 
outputs  from  another  organization,  B,  while  B 
does  not  need  any  outputs  from  A.  It  is  like 
Christmas  tree  lights  where  one  dead  bulb  can  put 
out  all  bulbs  if  they  are  connected  on  series  rather 
than  in  parallel.  The  parallel  connection  resembles 
pooled  relations  where  multiple  organizations  can 
produce  implementation  outputs  independently  of 
each  other.  This  can  produce  relatively  good 
implementation  results  although  the  coordination 
may  not  be  optimal.  Finally,  reciprocal  relations — 
where  two  or  more  organizations  are  dependent 
on  each  other — can  decrease  the  likelihood  of  veto 
points  because  both  organizations  have  incentives 
to  cooperate.  Reciprocal  relations  can  be  created 
based  on  exchange  of  services  or  money. 
Accordingly,  better  implementation  results  can 
sometimes  be  obtained  by  replacing  sequential 
relations  by  reciprocal  or  pooled  ones. 

Interorganizational  relations  and  the  role  of 
outsourcing  and  third  parties  in  implementation 
have  stimulated  a  substantial  research  interest 
under  the  labels  of  policy  networks,  governance 
networks,  or  implementation  regimes.  A  common 
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theme  is  the  dependency  of  public  authorities  on 
resources  from  the  private  sector — for  example, 
interest  groups,  associations,  and  firms — for  imple¬ 
menting  public  policies.  Perceived  benefits  for 
policymakers  and  implementers  may  be  expertise, 
information,  legitimization,  or  more  cost-effective 
ways  of  providing  services  or  enforcing  regula¬ 
tions.  In  some  countries — particularly  the  United 
States — private  for-profit  and  nonprofit  actors  are 
involved  so  much  in  policy  implementation  that 
Brinton  Mil  ward  talks  about  the  “hollowing  out 
of  the  state.”  Network  studies  focus  on  the  charac¬ 
ter  of  networks  and  interdependencies,  why  net¬ 
works  are  created,  if  and  how  they  are  sustained 
over  time,  and  the  democratic  and  participatory 
consequences  of  networks.  This  is  also  a  key  inter¬ 
est  in  corporatist  research  that  studies  the  formal 
integration  of  major  interest  organizations  into 
public  policy  making  and  implementation.  Given 
the  enormous  research  interest  in  networks,  sur¬ 
prisingly  few  studies  have  so  far  examined  the 
effects  of  different  types  of  networks  on  implemen¬ 
tation  outputs  or  outcomes. 

While  interorganizational  implementation 
research  focuses  on  commitment  and  collaboration 
at  the  organizational  level,  managers  have  impor¬ 
tant  roles  in  bridging  inter-  and  intra-organizational 
levels  when  supervising  street-level  bureaucrats. 
However,  much  of  the  normatively  based  manage¬ 
ment  literature  fails  to  acknowledge  the  difficulties 
involved  in  this  managerial  task.  The  role  of  man¬ 
agers  is  considered  below  after  discussion  of  the 
role  of  street-level  bureaucrats  in  the  implementa¬ 
tion  process. 

Street-Level  Bureaucracy 

After  the  first  group  of  implementation  studies  had 
focused  mainly  on  the  role  of  organizations  in 
policy  implementation,  in  1980,  Michael  Lipsky 
turned  the  policy  process  upside  down.  He  claimed 
that  street-level  bureaucrats  are  the  real  policy¬ 
makers.  His  theory  of  “street-level  bureaucracy” 
focuses  on  the  discretionary  decisions  that  each 
frontline  worker  is  making  when  delivering  poli¬ 
cies  to  individual  citizens.  Often  such  decisions  are 
made  in  face-to-face  interactions  with  clients 
behind  closed  office  doors  or  in  the  field  as  when 
an  environmental  inspector  is  inspecting  a  farm.  It 
is  not  easy  for  managers  to  monitor  bureaucratic 


behaviors  in  such  encounters.  Their  discretion 
makes  street-level  bureaucrats  essential  actors  in 
implementing  public  policies. 

Although  Lipsky  emphasizes  the  individual  role 
of  street-level  bureaucrats  in  implementing  public 
policies,  it  is  an  ironic  aspect  of  the  theory  that 
their  similar  working  conditions  make  all  street- 
level  bureaucrats  apply  similar  behaviors  whether 
they  are  teachers,  police  officers,  nurses,  doctors, 
or  social  workers.  It  also  means  that  organizational 
differences  and  bureaucrats’  individual  attitudes 
are  not  expected  to  have  important  implications 
for  their  behaviors. 

Although  trying  to  do  their  best,  street-level 
bureaucrats  are  experiencing  a  gap  between  the 
demands  they  meet  from  legislative  mandates, 
managers,  and  citizens  on  one  side  and  their  high 
workload  on  the  other.  In  this  situation,  they  all 
apply  a  number  of  coping  mechanisms  that  sys¬ 
tematically  distort  their  work  in  relation  to  the 
intentions  of  the  legislation.  They  ration  services; 
make  priorities  between  their  tasks;  process  their 
clients  through  simple,  standardized  procedures, 
tending  to  favor  the  easier  and  more  resourceful 
clients  (creaming);  and  seek  to  control  clients. 
Over  time,  street-level  bureaucrats  develop  more 
cynical  perceptions  of  the  clients  and  modify  the 
policy  objectives  that  are  the  basis  of  their  work. 
These  coping  strategies  imply  that  the  implementa¬ 
tion  of  public  policies  is  systematically  biased  com¬ 
pared  with  legislative  mandates,  for  example,  for 
individualized,  yet  equal  treatment.  According  to 
Lipsky,  even  increasing  staff  resources  is  no  solu¬ 
tion  to  coping  problems  because  more  resources 
may  lead  to  more  demand  for  services. 

A  group  of  implementation  scholars  has  a  strong 
focus  on  these  frontline  workers.  What  unites  them 
is  a  conviction  that  the  behaviors  of  these  workers 
have  important  implications  for  implementation 
that  are  inversely  related  to  these  workers’  rank  in 
the  hierarchy.  While  Lipsky  focuses  on  similarities 
in  street-level  bureaucratic  behaviors,  some  imple¬ 
mentation  scholars  seek  to  explain  variation  in  the 
behaviors  of  these  bureaucrats.  They  typically  dif¬ 
fer  with  respect  to  the  aspects  of  frontline  workers’ 
behaviors  they  consider  as  dependent  variables  and 
to  the  concepts  they  apply.  While  Lipsky’s  coping 
strategies  are  reporting  supposedly  dysfunctional 
behaviors,  John  Brehm  and  Scott  Gates  suggest 
that  street-level  bureaucrats  work,  shirk,  or 


Implementation  1163 


sabotage.  Bureaucrats  persistently  and  loyally 
implement  public  policies  when  working.  They  do 
too  little  when  shirking — either  for  leisure  or 
because  they  disagree  with  the  policy.  Sabotage  is 
a  conscious  attempt  to  actively  counteract  the  aims 
of  the  public  policy  because  these  bureaucrats  dis¬ 
agree  with  the  policy  mandate  and  follow  another 
course  of  line.  In  their  empirical  work,  Brehm  and 
Gates  find  that  street-level  bureaucrats  actually  do 
work — not  because  of  managerial  directives  and 
control — but  because  they  fundamentally  agree 
with  the  policy  (refer  to  text  below). 

Other  implementation  studies  examine  the 
actions  of  these  frontline  bureaucrats,  for  example, 
in  terms  of  the  effort  that  they  spend,  the  choices 
they  make  on  what  kind  of  clients  and  what 
aspects  of  their  behaviors  to  focus  on,  and  the 
tools  that  they  employ,  as  well  as  the  decisions 
they  make.  Particularly  in  regulation  research, 
some  research  has  focused  on  the  styles  that 
inspectors  employ  in  day-to-day  interactions  with 
target  groups  (regulatees).  Inspectors  may,  for 
example,  differ  in  how  legalistic  versus  flexible 
they  are  and  how  coercive  versus  trusting  they  are. 
However,  style  also  seems  to  be  a  useful  concept 
for  social  policy  implementation,  for  example, 
when  analyzing  caseworkers’  professional  distance 
to  the  client  versus  personal  involvement  in  addi¬ 
tion  to  the  above  style  dimensions.  Studies  of 
street-level  bureaucrats  typically  find  considerable 
variation  in  their  behaviors,  including  both  coping 
and  styles. 

This  raises  the  crucial  question  of  how  this 
variation  can  be  explained.  Again,  studies  vary  in 
the  explanatory  variables  they  consider.  Some 
focus  on  the  organizational  norms  by  which  front¬ 
line  workers  are  socialized,  which  form  their  basis 
for  treating  clients.  This  research  tradition  has 
participants  from  both  sociology  (e.g.,  Pierre 
Bourdieu,  Erving  Goffmann,  and  Margaretha 
Jarvinen)  and  political  science  (e.g.,  Jodi  Sandfort, 
Steven  Maynard-Moody,  and  Michael  Musheno). 
Some  studies  within  that  tradition  pay  particular 
attention  to  how  bureaucrats,  often  unconsciously, 
use  power  to  make  citizens  comply  with  their 
directives.  Through  identity  work,  bureaucrats  are 
trying  to  make  citizens  accept  their  identity  and 
diagnosis  as  these  are  specified  by  the  bureaucrat. 
Citizens  must  accept  the  frontline  worker’s  defini¬ 
tion  of  their  problematic  behavior  and  cooperate 


and  improve  to  be  perceived  as  a  deserving  client, 
which  is  often  a  precondition  for  receiving  ser¬ 
vices.  Thus,  a  client  is  “created.”  The  function  of 
identity  work  is  to  make  each  client  fit  the  reper¬ 
toire  of  measures  that  are  available  to  the  worker. 
These  “institutionalization  processes”  are  based 
on  the  norms  of  the  organization. 

While  this  perspective  emphasizes  the  role  of 
organizations  for  individual  behaviors,  other 
perspectives  focus  on  the  role  of  people.  With  inspi¬ 
ration  from  the  classic  theory  of  representative 
bureaucracy  by  Donald  Kingsley,  some  studies  sug¬ 
gest  that  street-level  bureaucrats’  behaviors  are 
affected  by  their  social  background,  for  example, 
socioeconomic,  ethnic,  and  gender  as  well  as  educa¬ 
tional  background.  According  to  the  classic  theory 
on  representative  bureaucracy,  the  actions  of  public 
administration  reflect  the  values  of  the  social  strata 
from  where  bureaucrats  come.  The  theory  was 
refuted  for  decades  by  studies  of  bureaucrats  par¬ 
ticularly  in  central  government.  However,  recently 
some  scholars — for  example,  Kenneth  Meier  and 
collaborators — have  claimed  that  the  theory  might 
have  more  relevance  at  the  front  lines.  Thus,  social 
background  and  norms  are  likely  to  be  more  rele¬ 
vant  when  services  are  delivered  and  regulations 
enforced  in  direct  interactions  with  citizens  and 
firms  and  where  bureaucrats  typically  have  substan¬ 
tial  discretion.  According  to  some  studies,  gender 
does  matter,  for  example,  for  prosecution  and 
conviction  in  rape  cases.  By  the  same  token,  some 
studies  indicate  that  minority  students  perform  bet¬ 
ter  in  schools  with  a  minority  representation  among 
teachers  above  a  certain  threshold. 

Whereas  the  previous  perspectives  assume  some 
kind  of  collective  impact  on  the  behaviors  of 
street-level  bureaucrats — that  is,  from  the  very  job 
functions  of  street-level  bureaucrats  in  general, 
organizational  norms,  or  earlier  socialization  into 
the  norms  of  a  certain  social  group — some  studies 
focus  on  the  effects  of  individual  differences 
among  street-level  bureaucrats  in  terms  of  their 
abilities  and  attitudes.  Peter  May  and  Soren  Winter 
have  found  some  empirical  support  for  the  impor¬ 
tance  of  individual  factors.  They  report  substan¬ 
tially  greater  variation  in  caseworker  behaviors 
within  local  employment  service  agencies  than 
between  these  agencies.  They  also  found  that 
street-level  bureaucrats’  knowledge  and  attitudes 
have  a  considerable  influence  on  the  extent  to 
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which  their  behaviors  are  reflecting  the  policy 
mandate. 

According  to  Winter,  several  kinds  of  attitudes 
can  affect  street-level  bureaucratic  behaviors,  that 
is,  their  attitudes  on  (a)  the  policy,  (b)  the  effective¬ 
ness  of  the  tools  that  are  available,  (c)  the  target 
groups,  and  (d)  their  perceived  workload. 
Bureaucrats’  attitudes  on  the  factors  that  are  most 
closely  related  to  their  daily  work — such  as  how 
they  perceive  and  evaluate  various  tools,  their  cli¬ 
ents,  and  their  workload — appear  to  have  stronger 
behavioral  effects  than  their  ideological — and 
more  abstract — view  on  the  overall  policy.  How 
street-level  bureaucrats  perceive  the  adequacy  of 
their  workload  seems  to  have  stronger  behavioral 
effects — for  coping  and  other  kinds  of  behaviors — 
than  have  differences  in  their  actual  workload. 

According  to  this  research  perspective,  street- 
level  bureaucrats  are  even  more  important  as  poli¬ 
cymakers  than  implied  by  Lipsky’s  and  the  other 
street-level  bureaucracy  perspectives  above  because 
individual  street-level  bureaucrats  are  making 
important  authoritative  decisions  on  the  content  of 
the  policies,  which  are  delivered  to  citizens  and 
firms,  based  on  the  bureaucrats’  own  individual 
preferences.  However,  although  there  is  a  substan¬ 
tial  variation  in  their  behaviors,  Winter  and  May 
also  found  that  Danish  street-level  bureaucrats 
show  a  considerable  loyalty  to  the  law. 

Management 

As  indicated  by  the  above  analysis,  managerial  con¬ 
trol  of  street-level  bureaucrats  is  no  easy  task.  The 
very  nature  of  street-level  bureaucrats’  practices 
implies  that  they  exercise  considerable  discretion  in 
encounters  with  target  groups  that  are  normally 
not  very  visible  to  managers.  While  bookstores 
abound  with  simple  recipes  for  excellent  manage¬ 
ment,  these  are  rarely  based  on  systematic  empirical 
research  on  public  management.  And  the  existing 
evidence  is  limited  and  suggests  that  managing 
street-level  bureaucrats  is  by  no  means  a  simple  but 
a  difficult  and  complex  task. 

In  a  meta-analysis,  George  Boyne  finds  surpris¬ 
ingly  few  studies  on  effects  of  management  on 
performance  or  outcomes.  Yet  he  finds  some  evi¬ 
dence  that  management  does  matter.  However, 
because  such  studies  measure  the  direct  link 
between  management  and  outcomes,  it  is  hard  to 


know  through  what  kinds  of  street-level  bureau¬ 
cratic  practices  managers  can  bring  about  better 
outcomes.  Some  causal  links  are  missing. 
Unfortunately,  so  far,  very  few  studies  have  exam¬ 
ined  the  effects  of  management  on  street-level 
bureaucratic  behaviors.  Most  studies  indicate  that 
these  effects  are  limited  and  context  contingent.  The 
research  challenge  is  to  specify  to  what  extent  and 
how  management  affects  street-level  bureaucratic 
behaviors  in  given  contexts.  Such  research  has  just 
begun,  however.  Some  preliminary  findings  from 
studies  of  the  implementation  of  employment  policy 
by  May,  Winter,  and  associates  can  be  mentioned. 

Principal-agent  theory  offers  a  promising  way 
for  studying  these  relationships.  According  to  its 
basic  assumptions,  a  principal  (manager)  and  his  or 
her  bureaucratic  agents  may  not  share  all  prefer¬ 
ences,  and  the  relationship  is  characterized  by  infor¬ 
mation  asymmetry  that  systematically  disadvantage 
principals.  When  applied  to  the  management  of 
street-level  bureaucrats,  managers’  influence  seems 
to  vary  with  the  visibility  of  street-level  bureau¬ 
cratic  practices.  Some  are  quite  transparent  and 
easy  to  measure  and  monitor,  for  example,  social 
caseworkers’  application  of  sanctions  when  clients 
are  absent  from  mandatory  employment  promoting 
measures  with  no  good  cause.  Other  kinds  of 
behaviors  are  more  invisible,  for  example,  coping 
behaviors  and  to  what  extent  employment  service 
bureaucrats  emphasize  immediate  jobs  in  conversa¬ 
tions  with  clients.  Managerial  objectives  are  likely 
to  be  better  reflected  in  frontline  practices  when 
these  are  relatively  visible. 

Having  an  effective  manager  does  not  always 
result  in  better  implementation.  Sometimes  street- 
level  bureaucrats  experience  a  multiple  principal 
problem.  Their  local  managers  are  not  always 
loyal  to  the  letter  and  spirit  of  the  law.  This  may 
be  reinforced  if  local  political  bodies  resist  national 
policies.  Managerial  tools  also  vary  in  the  extent  to 
which  they  are  goal  directed  or  capacity  building. 
While  capacity  building  through  increasing  staff  or 
general  information  provision  might  generally 
improve  the  implementation  of  national  policies, 
the  effect  of  using  the  more  goal-directed  manage¬ 
ment  tools  may  depend  on  whether  local  policies 
support  or  oppose  national  ones.  Examples  of 
highly  goal-directed  management  tools  are  manag¬ 
ers’  clear  signals  of  their  expectations  for  frontline 
behaviors  and  recruitment  of  workers  with  a  better 
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fit  with  the  goals  of  the  organization.  Greater 
application  of  such  managerial  instruments  tends 
to  induce  poorer  implementation  of  national  poli¬ 
cies  among  street-level  bureaucrats  when  local 
policies  resist  national  ones. 

Finally,  management  is  relational.  This  implies 
that  the  effect  of  management  practices  on  street- 
level  bureaucratic  behaviors  is  often  contingent  on 
the  characteristics  of  individual  street-level  bureau¬ 
crats,  including  their  expertise,  motivation,  and 
perception  of  the  applied  management  tools.  In  a 
study  of  the  implementation  of  Danish  employ¬ 
ment  policy,  May  and  Winter  found  that  all  mana¬ 
gerial  effects  on  the  extent  to  which  caseworkers 
emphasized  jobs  in  conversations  with  clients  were 
contingent  on  caseworkers’  perceived  professional 
knowledge — for  example,  delegation  improves 
implementation  when  workers  have  knowledge 
but  weakens  it  when  they  do  not.  Similarly,  in  line 
with  Bruno  Frey’s  theory  on  motivation  crowding 
in  and  out,  Danish  studies  by  Lotte  Andersen  and 
Thomas  Pallesen  show  that  performance  pay  and 
regulation  of  university  faculty  have  positive  effects 
on  research  publication  productivity  only  if  the 
faculty  perceives  these  instruments  as  supportive 
of  professional  norms  rather  than  as  monitoring  or 
control. 

Although  the  direct  and  contingent  effects  of 
management  on  the  behavior  of  street-level  bureau¬ 
crats  are  often  limited,  indirect  effects  must  also  be 
taken  into  account.  These  include  the  impact  that 
managers  have  on  the  attitudes  of  their  frontline 
workers  and  on  in-service  training  and  the  way  in 
which  managers  by  designing  or  changing  organi¬ 
zational  structures  can  shape  bureaucratic  atti¬ 
tudes  and  behaviors. 

Target  Groups 

The  target  groups  of  a  public  policy  have  impor¬ 
tant  roles  both  for  the  production  of  implementa¬ 
tion  outputs  and  outcomes.  Target  groups  affect 
the  implementation  behaviors  of  agencies  and 
street-level  bureaucrats,  and  outcomes  that  achieve 
policy  goals  often  require  that  target  groups 
change  their  behaviors. 

In  depictions  of  the  classic  ideal  type  of  the 
democratic  chain  of  control,  citizens  are  active 
when  articulating  political  demands  and  electing 
political  representatives  but  relatively  passive  and 


law-abiding  subjects  when  it  comes  to  the  execution 
of  the  law.  However,  as  much  service  production 
and  regulatory  enforcement  take  place  in  interac¬ 
tions  between  street-level  bureaucrats  and  target 
populations,  citizens  and  firms  do  often  play  an 
active  role  in  shaping  implementation  outputs  in  a 
kind  of  coproduction  of  services  and  enforcement. 

As  noted  above,  clients  can  be  relatively  power¬ 
less  in  processes  of  coping,  identity  work,  and 
creation  of  clients,  but  the  power  of  target  popula¬ 
tions  in  implementation  processes  varies  across 
and  within  policy  areas.  There  is  a  huge  difference 
between  the  position  of  nonunionized  welfare 
minority  clients  in  welfare  and  employment  policy 
implementation  on  one  hand  and  organized  farm¬ 
ers  in  the  implementation  of  agricultural  support 
and  environmental  programs  on  the  other  hand. 
As  is  the  case  for  street-level  bureaucrats,  the  role 
of  target  populations  in  policy  implementation 
depends  on  their  action  model  in  terms  of  their 
preferences  and  abilities.  Relevant  resources  are 
cognitive  skills,  expertise,  wealth,  affiliation  with 
other  actors  with  resources  including  powerful 
interest  groups  (whether  the  group  has  a  positive 
or  negative  social  construction),  and  institutional 
arrangements  and  norms. 

For  decades  it  has  been  debated  in  the  regulatory 
literature  whether  regulated  industries  tend  to  cap¬ 
ture  the  public  regulatory  enforcement  apparatus  to 
make  it  a  servant  rather  than  a  master.  While  cap¬ 
ture  was  earlier  the  dominating  research  position, 
for  the  last  couple  of  decades,  the  capture  theory 
has  found  less  theoretical  and  empirical  support. 
The  role  of  interest  groups  in  implementation  pro¬ 
cesses  varies  among  political  systems.  Often  strong 
interest  representation  is  found  in  implementing 
committees  in  some  European  countries  with  strong 
corporatist  traditions.  According  to  Scandinavian 
studies,  interest  organizations  still  have  a  strong 
role  in  policy  implementation,  even  though  their 
formal  inclusion  in  the  policy-making  process  has 
been  reduced  during  the  past  few  decades,  when 
their  position  apparently  has  weakened  when  com¬ 
pared  with  that  of  legislative  bodies.  In  some  more 
pluralistic  political  systems  such  as  the  United 
States,  interest  groups  tend  to  be  more  fragmented 
in  terms  of  representation,  are  generally  less  for¬ 
mally  involved  in  policy  making  and  implementa¬ 
tion,  and  more  often  act  as  external  pressure 
groups.  Yet  some  recent  research  indicates  that  the 
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role  of  U.S.  interest  groups  in  implementation — 
including  administrative  rule  making — may  be 
stronger  than  previously  expected. 

As  Vibeke  Lehmann  Nielsen  has  found,  the 
resources  that  individual  firms  and  organizations 
can  engage  in  conversations  with  regulatory  inspec¬ 
tors  can  put  pressure  on  these  inspectors.  This  may 
affect  their  enforcement  directly  but  also  indirectly 
through  the  formation  of  the  informal  rules  of  the 
game  for  such  interactions,  which  in  turn  affect 
how  regulations  are  enforced.  She  found  that  regu¬ 
lations  are  more  leniently  enforced  for  public  insti¬ 
tutions,  which  speak  the  same  language  as  the 
regulators,  than  for  private  firms.  Also,  big  firms 
with  substantial  expertise  in  the  regulatory  area  as 
well  as  local  economic  importance — tend  to  be 
more  successful  than  small  ones  in  negotiating 
enforcement  terms. 

For  social  policy  implementation,  some  theories 
claim  that  clients  with  few  personal  resources  are 
systematically  disadvantaged  in  their  interactions 
with  the  administrative  system.  According  to  the 
access  theory  originally  developed  by  Bernhard 
Schaffer  and  Huang  Wen-hsien,  clients  have  to 
cross  three  thresholds  to  get  access  to  the  services 
to  which  they  are  entitled.  The  first  is  the  registra¬ 
tion  threshold.  For  many  services,  eligible  clients 
must  apply  themselves,  which  tends  to  disadvan¬ 
tage  clients  with  poor  educational  background, 
poor  knowledge  about  their  rights,  and  low  social 
status.  This  can  be  reinforced  by  some  emotional 
factors,  including  stigmatizing.  Some  clients  do  not 
perceive  themselves  as  deserving  assistance,  or  they 
may  hold  personal  ideals  of  dignity  in  fending  for 
themselves. 

Second,  clients  must  cross  the  competence 
threshold,  which  requires  that  one  can  handle  the 
interaction  with  the  bureaucracy,  including  being 
acquainted  with  the  formal  and  informal  rules  of 
the  game  and  applied  decision  criteria  and  being 
able  to  formulate  one’s  thoughts.  This  again  disad¬ 
vantages  clients  with  a  poor  educational  and  social 
background.  Finally,  clients  must  be  able  to  cross 
the  efficiency  threshold,  and  some  do  not  make  it 
because  they  do  not  fit  with  staffs’  perceptions  on 
how  available  services  would  be  most  effectively 
distributed  among  clients.  Also,  the  efficiency 
threshold  tends  to  bias  the  implementation  system¬ 
atically  to  the  benefit  of  clients  with  more  resources 
but  disadvantages  clients  with  a  poor  educational 


and  social  background  (refer  also  to  Lipsky’s  con¬ 
cept  of  “creaming”  earlier). 

Target  groups  have  also  important  roles  in  rela¬ 
tion  to  implementation  outcomes.  Most  public 
policies  aim  at  changing  the  problematic  behaviors 
of — or  conditions  for — citizens  or  firms  by  either 
regulating  their  behaviors  or  providing  services  for 
them  that  will  help  them  do  better.  Most  policies 
build  on  an  explicit — or  often  implicit — causal 
theory  on  how  these  behaviors  and  conditions  can 
be  changed  by  public  intervention.  However,  as 
indicated  above  these  causal  theories  are  not 
always  valid.  Unfortunately,  political  science 
research  has  not  paid  much  attention  to  the  out¬ 
comes  of  public  policies  and  the  mechanisms  that 
affect  how  citizens  and  firms  respond  to  public 
policies  and  the  way  in  which  they  are  imple¬ 
mented.  Some  attention  to  these  issues  has  been 
paid  in  regulation  research  in  the  borderline 
between  sociology,  economics,  psychology,  and 
political  science.  While  developed  for  regulatory 
policies,  the  theorizing  seems  to  have  some  rele¬ 
vance  for  social  policies  as  well. 

Citizens’  and  firms’  responses  to  public  policy 
implementation  are  based  on  their  individual 
action  model  according  to  which  their  behaviors 
are  influenced  by  their  ability  and  motivations 
(refer  to  the  above  text).  Among  abilities,  cognitive 
skills  are  very  important.  This  first  includes  knowl¬ 
edge  of  the  rules,  which  may  be  one  of  the  most 
important  factors  in  explaining  how  citizens  and 
firms  respond.  When  people  do  not  comply  with 
regulations  or  get  their  rights,  it  is  often  not 
because  of  lack  of  will  but  lack  of  knowledge  of  the 
rules.  For  decades,  many  parents  of  handicapped 
children  in  Denmark  have  not  been  compensated 
for  taking  care  of  their  child  at  home  because  they 
are  not  aware  of  their  legal  rights.  Second,  favor¬ 
able  outcomes  also  depend  on  whether  people 
know  how  to  translate  rules  into  concrete  action. 
Farmers  may  know  the  agro-environmental  rules 
but  may  not  know  what  a  particular  rule  implies 
for  their  farming  practices.  Financial  capacity  and 
resilience  can  also  be  important — not  least  for 
firms  in  regulatory  policies. 

Three  main  types  of  motivations  can  be  identi¬ 
fied.  Economic  motivation  concerns  the  perceived 
net  utility  of  different  courses  of  action  including 
compliance/noncompliance.  For  regulatory  and 
criminal  policies,  deterrence  theory  involves  a 
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calculation  of  costs  and  benefits  of  not  comply¬ 
ing.  Into  that  calculus  enter  both  the  perceived 
risk  of  being  detected  and  the  risk  of  being  pun¬ 
ished  if  violating  the  law,  the  costs  of  required 
additional  investments  and  other  production  costs 
for  complying,  and  the  expected  profits  from  dif¬ 
ferent  courses  of  action.  Studies  seem  to  converge 
on  finding  substantial  effects  of  perceived  detec¬ 
tion  risk,  while  most  studies  find  no  or  weak 
effects  of  perceived  sanction  risk.  Economic  moti¬ 
vations  also  seem  relevant  for  target  groups  in 
several  social  policies. 

Social  motivation  builds  on  the  wish  of  citizens 
and  firms  to  get  respect  and  social  recognition  from 
peers  by  following  their  social  expectations.  Thus, 
people  comply  not  because  they  agree  with  the  pol¬ 
icy  but  because  “significant  others”  expect  them  to 
comply.  Those  may  be  peers,  friends  and  family,  or 
customers  and  other  firms,  for  example,  trade  orga¬ 
nizations.  For  example,  the  fact  that  drunk  driving 
has  become  socially  less  acceptable  has  helped  the 
implementation  of  anti-drunk-driving  programs.  By 
the  same  token,  some  anti-drunk-driving  informa¬ 
tion  campaigns  have  successfully  targeted  young 
women  rather  than  young  men  to  encourage  women 
to  tell  male  drivers  to  avoid  drunk  driving. 

Even  street-level  bureaucrats  may  affect  the 
behaviors  of  target  groups  by  setting  up  social 
expectations,  for  example,  based  on  their  styles  of 
interaction  with  clients  or  firms.  Legalistic  rather 
than  flexible  enforcement  styles  may  signal  to 
regulatees  that  rules  and  instructions  must  be 
taken  seriously,  and  legalistic  styles  may  provide 
more  certainty  for  firms  when  making  decisions  on 
investments  in  technology  that  would  allow  more 
compliance. 

Normative  motivation  is  based  on  moral  and 
conscience.  People  or  firms  follow  the  law  not 
because  significant  others  expect  them  to  do  so 
but  because  they  agree  with  the  law  or  because 
they  at  least  feel  a  general  moral  duty  to  obey  laws 
even  if  they  do  not  agree.  The  latter  civic  duty  var¬ 
ies  substantially  between  and  within  countries  and 
is  important  for  achieving  policy  goals.  Because  of 
the  role  of  different  motivations  and  resources  in 
shaping  citizens’  and  firms’  action  models,  target 
populations’  responses  to  policy  implementation 
can  involve  a  complex  weighing  of  economic, 
social,  and  normative  motivation  along  with 
abilities. 


One  implication  of  the  different  types  of  moti¬ 
vations  is  that  public  agencies  and  their  street-level 
bureaucrats  may  not  always  be  the  most  effective 
agents  for  implementing  public  policies.  Third  par¬ 
ties  may  sometimes  be  perceived  as  more  impor¬ 
tant,  competent,  or  trustworthy  by  target  groups, 
and  third  parties  may  trigger  more  social  and  nor¬ 
mative  motivation.  The  anti-drunk-driving  infor¬ 
mation  campaign  above  with  young  girls  as  a  third 
party  is  one  example.  Another  is  that  in  a  study  on 
agro-environmental  regulation  in  Denmark, 
Winter  and  May  found  that  advice  from  third  par¬ 
ties  (e.g.,  farmers’  associations  and  their  agricul¬ 
tural  consultants,  who  are  highly  respected  among 
farmers)  enhance  farmers’  normative  motivation 
to  comply  with  the  rules.  In  contrast,  similar 
advice  from  public  inspectors  did  not  have  such 
effect  even  when  farmers  listened  to  them. 

Socioeconomic  Environment 

The  task  environment  in  terms  of  socioeconomic 
conditions  is  important  for  implementation  pro¬ 
cesses  and  successes.  It  is  quite  a  different  chal¬ 
lenge  to  implement  an  employment  policy  program 
in  a  recession  than  in  a  boom,  and  often  substan¬ 
tial  regional  variations  can  be  found  as  well  at  any 
given  point  in  time.  It  is  also  a  different  challenge 
to  implement  an  agro-environmental  program  in 
areas  with  a  high  livestock  density  with  many  large 
pig  farms  and  a  much  greater  risk  of  nitrate  pollu¬ 
tion  than  in  areas  with  little  animal  husbandry. 
Consequently,  it  is  important  to  isolate  the  effects 
of  implementation  from  effects  of  task  environ¬ 
ments.  As  an  example,  no  less  than  60%  of  the 
variation  in  labor  market  integration  of  refugees 
and  immigrants  among  Danish  municipalities  in 
the  beginning  of  this  century  can  be  explained  by 
differences  in  task  environments.  This  is  because 
the  social  composition  of  the  target  groups  and  the 
local  labor  market  situation  varied  substantially 
among  local  governments. 

Research  Strategies  and  Methodologies 

In  the  early  1980s,  two  opposing  implementation 
research  strategies  were  developed.  Top-down 
implementation  scholars  with  Mazmanian  and 
Sabatier  as  the  leading  ones  saw  implementation  as 
a  control  problem.  They  typically  focused  on  the 
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implementation  of  one  particular  law  according  to 
its  official  goals.  These  scholars  tended  to  focus  on 
the  role  of  organizations  in  the  implementation 
process  and  on  how  policy  statutes  could  be  struc¬ 
tured  to  increase  the  clarity  of  goals  and  the  com¬ 
mitment  of  implementing  organizations. 

Bottom-up  researchers  on  the  other  hand  exam¬ 
ined  mainly  interactions  between  street-level 
bureaucrats  and  citizens  or  firms  in  a  given  prob¬ 
lem  area.  Some  of  the  leading  scholars  were  Benny 
Hj  ern  and  Richard  Elmore.  They  focused  on  the 
capacity  of  street-level  bureaucrats  to  change  the 
problematic  behaviors  of  or  conditions  for  citizens 
or  firms.  This  capacity  includes  an  experience- 
based  knowledge  of  the  incentives  and  conditions 
that  operate  on  citizens  or  firms.  While  such  infor¬ 
mation  is  crucial  for  problem  solving,  it  is  rarely 
systematically  provided  and  used  in  lawmaking 
and  implementation  processes  at  higher  levels. 
They  also  focused  on  the  discretion  and  autonomy 
that  these  street-level  bureaucrats  have  versus 
higher  level  officials.  They  emphasized  the  lack  of 
central  control  over  the  decisive  implementation 
processes  that  are  taking  place  at  the  bottom.  As 
mentioned  above,  some  bottom-up  scholars  even 
disregarded  official  policy  goals  as  an  evaluation 
standard  and  instead  used  problem  solving  as 
defined  by  themselves  or  local  actors  as  a  standard. 

A  problem  for  these  research  strategies  was  that 
top-down  scholars  tended  to  ignore  the  bottom — 
particularly  the  discretionary  role  of  street-level 
bureaucrats — and  that  bottom-up  scholars  tended 
to  remain  at  the  bottom  and  rarely  got  far  enough 
up  to  focus  on  implementation  mechanisms  at  an 
organizational  level  of  analysis.  Many  bottom-up 
studies  were  very  descriptive  and  added  little  to 
theory  building.  In  the  debate  between  the  two 
contending  perspectives,  theoretical,  methodologi¬ 
cal,  and  normative  arguments  were  mingled 
together.  Attempts  of  synthesizing  the  two  strate¬ 
gies  were  made  without  much  success,  and  the 
theoretical  conflict  among  implementation  schol¬ 
ars  was  not  very  fruitful  for  theory  development. 
Although  the  integrated  implementation  model 
that  has  organized  the  previous  part  of  this  entry 
focuses  on  both  top  and  bottom,  it  is  not  a  real 
synthesis  of  the  two  strategies.  It  rather  seeks  to 
integrate  some  of  the  most  promising  theoretical 
elements  from  the  implementation  literature  into  a 
common  framework. 


The  first  generation  of  implementation  studies 
relied  on  case  studies,  and  most  of  these  used 
mainly  qualitative  research  methods  such  as  docu¬ 
ment  analysis  and  loosely  structured  interviews. 
These  were  sometimes  combined  with  quantitative 
analyses  of,  for  example,  program  intake  and  the 
social  situation  of  target  groups  after  program  par¬ 
ticipation.  Exploratory  case  studies  were  natural  in 
an  early  stage  of  implementation  research  when 
knowledge  about  implementation  mechanisms 
was  very  limited  and  when  the  main  purpose  of  the 
research  was  to  generate  hypotheses  and  research 
ideas. 

However,  as  first  claimed  by  Malcolm  Goggin, 
the  case  study  tradition  gradually  became  a  barrier 
to  theory  testing,  generalization,  and  the  accumu¬ 
lation  of  theory  and  knowledge  on  implementa¬ 
tion.  Most  implementation  studies  focused  on 
many  explanatory  variables.  As  an  example,  the 
implementation  model  by  Mazmanian  and  Sabatier 
has  21  different  variables.  Often  key  variables  var¬ 
ied  from  one  study  to  another.  Implementation 
research  ran  into  the  “too  many  variables  but  too 
few  cases  problem.”  The  single-case-study  method¬ 
ology  did  not  allow  isolating  the  crucial  variables 
for  implementation  success.  Accordingly,  Goggin 
recommended  greater  use  of  comparative  case 
studies  and  big studies  to  allow  for  statistical 
control  for  third  variables. 

From  the  mid-1980s,  more  quantitative  research 
designs  with  many  cases  and  the  application  of 
statistical  designs  have  gradually  been  employed  in 
implementation  research  and  in  the  affiliated 
research  fields  of  public  management,  regulatory 
enforcement  and  compliance,  and  recently  also  in 
a  few  network  studies.  This  has  lead  to  more  sys¬ 
tematic  testing  of  theories  and  hypotheses  with 
control  for  third  variables  including  task  environ¬ 
ment.  Gradually,  statistical  methods  have  become 
more  advanced  with  the  introduction  of  methods 
that  take  the  hierarchical  or  multilevel  character  of 
data  into  consideration  (e.g.,  street-level  bureau¬ 
crats  nested  in  organizations),  use  of  instrumental 
variables,  and  time-series  data  for  examining 
causal  directions. 

However,  the  case  study  tradition  still  continues 
parallel  to  the  application  of  quantitative  research 
designs.  It  is  particularly  popular  in  network  stud¬ 
ies  and  in  studies  of  power  relations  between 
street-level  bureaucrats  and  clients. 
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Conclusion 

Policy  implementation  is  neither  an  automatic  nor 
dull  process.  It  is  a  genuine  political  process  for 
which  the  achievement  of  outputs  or  outcomes  that 
match  policy  objectives  cannot  be  taken  for  granted. 
Implementation  results  are  affected  by  the  chosen 
policy  design  and  the  validity  of  the  causal  theory 
behind  it.  The  results  are  also  affected  by  the  com¬ 
mitment  and  cooperation  of  organizations  that  are 
involved  in  the  process.  Managers  have  a  difficult 
role  in  transmitting  policies  to  street-level  bureau¬ 
crats  that  have  a  key  discretionary  role  when  deliv¬ 
ering  policies  to  target  populations.  The  latter  are 
not  passive  subjects  but  participate  in  shaping 
implementation  outputs  as  well  as  outcomes. 

Parallel  analytic  action  models  have  been 
constructed  for  explaining  the  behaviors  of  imple¬ 
mentation  actors  and  target  groups  based  on  the 
interests  and  resources  of  the  actors.  Particularly  in 
the  first  decades  of  implementation  research, 
scholars  sought  to  take  steps  to  create  a  general 
implementation  theory.  This  process  led  to 
very  diverse  analytical  frameworks  and  research 
strategies  but  to  little  theory  accumulation. 
Implementation  is  too  complex  to  be  accounted 
for  by  one  theory.  Different  theories  are  relevant 
for  explaining  different  stages  or  parts  of  the 
implementation  process,  outputs,  and  outcomes. 
Rather  than  a  general  theory  of  implementation, 
the  integrated  implementation  model  presented 
here  can  be  perceived  as  a  framework  for  analysis 
or  as  a  series  of  spotlights  that  shed  light  at  some 
key  aspects  of  the  implementation  process  that  are 
worth  examining  more  closely  if  one  wants  to 
understand  what  implementation  is  about. 

Some  observers  have  claimed  that  after  a  peak 
in  the  early  1980s,  implementation  research  has 
come  to  an  end.  However,  as  shown  by  Harald 
Saetren,  this  is  far  from  the  case.  The  number  of 
publications  on  implementation  using  that  term 
has  continued  to  grow  exponentially  since  the 
1970s.  However,  the  growth  has  taken  place  par¬ 
ticularly  in  journals  and  publications  outside  the 
core  of  political  science,  public  policy,  and  public 
administration.  Implementation  studies  have 
become  widespread  in  education,  health,  environ¬ 
ment,  law,  and  economics  and  much  more  so  than 
in  the  former  core  publications.  However,  even 
within  the  core  journals,  the  number  of  implemen¬ 
tation  articles  applying  the  implementation  term 


has  stabilized  at  a  level  not  far  below  the  peak  of 
the  1980s. 

However,  implementation  research  as  it  has 
been  presented  here  is  often  performed  under  labels 
other  than  “implementation,”  such  as  (new)  gover¬ 
nance,  policy  design  and  instruments,  network 
studies,  outsourcing,  public-private  partnerships, 
street-level  bureaucracy,  management,  “new  public 
management,”  principal-agent  studies,  perfor¬ 
mance,  regulatory  enforcement,  and  compliance. 
Accordingly,  the  total  amount  of  implementation 
relevant  research  has  increased  considerably  and 
continues  to  do  so  yet  under  various  labels.  At  least 
two  reasons  for  the  described  development  can  be 
identified.  First,  fashions  are  important  for  research 
interests  and  how  they  are  labeled.  Second,  when 
for  good  reasons  more  and  more  implementation 
scholars  gave  up  constructing  the  general  imple¬ 
mentation  theory,  several  scholars  got  interested  in 
partial  aspects  of  implementation  under  various 
labels. 
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Copenhagen,  Denmark 
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Indigenous  Peoples’  Rights 


The  concept  of  “indigenous”  as  used  in  United 
Nations  (UN)  documents  seeks  to  clarify  a  diffuse 
relationship  between  the  modern  sovereign  state 
and  a  special  type  of  traditional  community  that 
does  not  itself  constitute  a  political  entity.  A 
report  to  the  UN  by  Martinez  Cobo  in  1987 
describes  indigenous  peoples  as  people  who  have 
a  historical  continuity  with  preinvasion  or  preco¬ 
lonial  societies,  who  consider  themselves  distinct 
from  other  sectors  of  the  societies  now  prevailing 
in  those  territories,  who  form  at  present  nondom¬ 
inant  sectors  of  society,  and  who  are  determined 
to  preserve  and  transmit  to  future  generations 
their  ancestral  territories  and  ethnic  identity.  This 
early  description  is  rephrased  in  different  forms 
maintaining  the  core  criteria  of  priority  in  time, 
cultural  distinctiveness,  experience  of  marginal¬ 
ization  and  dispossession,  and  self-identification. 
The  terms  first  nations ,  first  people,  and  aborigi¬ 
nal  convey  a  similar  meaning.  This  entry  analyzes 
the  notion  of  indigenous  peoples’  rights,  its  global 
recognition,  why  they  contest  rights,  the  legal  use 
and  control  of  territories,  and  the  most  recent 
developments. 

The  Two  Rights 

The  notion  of  indigenous  peoples’  rights  may  be 
interpreted  in  two  ways.  Most  generally,  it  refers  to 
a  moral  entitlement,  perceived  as  restoration  and 
redress  of  previous  wrongs,  and  as  such  it  has 
developed  as  a  part  of  the  decolonialization  process 


and  the  global  agenda  for  human  rights.  These 
rights  are  expressed  most  clearly  in  the  UN 
Declaration  on  the  Rights  of  Indigenous  Peoples, 
passed  by  the  General  Assembly  in  September 
2007.  This  nonbinding  declaration  states  that 
indigenous  peoples  have  the  individual  and  collec¬ 
tive  right  to  the  full  enjoyment  of  all  human  rights 
and  fundamental  freedoms  as  recognized  in  the 
charter  of  the  UN,  and  other  human  rights  mecha¬ 
nisms,  and  that  indigenous  peoples  have  the  right 
to  self-determination. 

A  second  understanding  refers  to  the  legal  rights 
most  clearly  expressed  in  the  International  Labour 
Organization  (ILO)  Convention  No.  169  of  1989, 
Indigenous  and  Tribal  Peoples  Convention,  which 
is  binding  for  the  signatories  and  sets  out  proce¬ 
dures  for  how  rights  should  be  implemented  in 
agreement  between  the  state  and  its  citizens.  Rights 
granted  within  the  legal  structures  of  sovereign 
states  through  treaties  and  court  decisions  are  cre¬ 
ating  a  precedence  expressed  as  a  customary  inter¬ 
national  law. 

Global  Recognition 

The  two  kinds  of  rights  are  mutually  reinforcing. 
The  driving  force  behind  a  global  recognition  has 
been  an  indigenous  movement  initiated  from  the 
1970s  onwards  among  American  Indians,  Sami, 
Inuit,  Maori,  and  Australian  Aborigines,  who 
found  similarities  in  their  situation  across  widely 
different  continents  and  historical  contexts.  A 
common  theme  among  emerging  national  and 
transnational  organizations  was  the  experience 
that  the  expanding  social  and  economic  domain  of 
modern  states  ignored  the  life  ways  and  livelihood 
of  indigenous  minorities;  hence  the  first  claim  was 
for  recognition. 

A  seminal  event  in  the  debate  on  indigenous 
rights  was  the  establishment  of  the  UN  Working 
Group  for  Indigenous  Populations  (WGIP)  in 
Geneva  in  1982.  The  Working  Group  proper  con¬ 
sisted  of  five  experts  who  met  yearly  in  July  and 
allowed  representatives  from  indigenous  peoples 
worldwide  to  attend  and  give  brief  presentations. 
These  meetings  developed  as  significant  meeting 
places  for  hundreds  of  delegates  and  advocacy  orga¬ 
nizations,  which  spoke  at  the  plenary  session  and 
used  the  corridors  and  lounges  and  meeting  rooms 
to  forge  new  networks,  exchange  experiences,  and 
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develop  strategies.  By  1993,  the  WGIP  had  devel¬ 
oped  a  draft  for  the  Declaration  on  Indigenous 
Rights,  and  indigenous  issues  were  raised  in  docu¬ 
ments  from  conferences  such  as  the  Rio  Conference 
on  Sustainable  Development  (1992)  and  the  World 
Conference  Against  Racism  in  Durban  2001.  A 
Special  Rapporteur  position  on  the  Situation  of 
Human  Rights  and  Fundamental  Freedoms  of 
Indigenous  Peoples  was  created  in  2002.  The  same 
year  saw  the  establishment  of  the  UN  Permanent 
Forum  for  Indigenous  Issues  in  New  York  com¬ 
posed  of  eight  state  representatives  and  eight  del¬ 
egates  from  indigenous  organizations. 

Contested  Rights 

The  rights  of  indigenous  peoples  are  contested  in 
many  ways.  It  has  been  relatively  easy  to  identify 
who  are  indigenous  on  continents  where  coloniz¬ 
ers  arrived  from  across  the  water  as  in  the  Americas 
and  Australia.  But  Africa  and  much  of  Asia  repre¬ 
sent  special  conceptual  challenges.  The  dominant 
position  of  White  colonial  forces  left  all  of  Black 
Africa  in  a  subordinate  position  that  in  many 
respects  was  similar  to  the  position  of  indigenous 
peoples  elsewhere,  as  first  comers,  nondominant 
and  different  in  culture  from  the  White  intruders. 
The  important  difference  from  most  settler  states  is 
that  formally  White  colonial  forces  withdrew  from 
Africa.  Accordingly,  many  politicians  argue  that  all 
Africans  are  indigenous,  or  alternatively,  that  this 
distinction  does  not  apply  to  the  African  continent. 

In  addition  to  inevitable  problems,  then,  in 
defining  who  are  indigenous,  another  challenge  is 
that  any  procedure  for  singling  out  one  group  for 
special  treatment  or  affirmative  action  goes  against 
a  cherished  liberal  principle  of  equal  treatment  of 
all  citizens.  It  also  goes  against  administrative  pref¬ 
erences  for  clear-cut  and  unambiguous  target 
groups  and  routines  for  equal  treatment.  Some 
opponents  claim  that  indigenous  rights  represent  a 
new  form  for  racism,  distributing  favors  according 
to  descent,  while  the  main  argument  for  special 
rights  is  a  compensation  for  previous  injustice. 

Legal  Rights 

A  crucial  aspect  of  indigenous  rights  relates  to  the 
use  and  control  over  territories.  The  ILO  Con¬ 
vention  No.  169  goes  far  in  recognizing,  in  Article 


14,  “the  rights  of  ownership  and  possession.” 
However,  this  convention  has  so  far  been  ratified 
by  only  20  countries  and  has  occasioned  extensive 
legal  debates  on  implementation. 

In  a  few  cases,  indigenous  groups  have  been 
delegated  control  over  larger  territories  and  formed 
a  governing  body/parliament.  The  Inuit  of  Green¬ 
land  achieved  home  rule  as  early  as  1979.  Nunavut 
was  separated  from  the  Northern  Territory  in 
Canada  in  1999  with  jurisdiction  over  territorial 
matters.  Some  northern  indigenous  peoples  of 
Siberia  control  their  own  autonomous  republics 
within  the  Russian  state. 

Land  rights  issues  most  commonly  come  up  in 
negotiations  with  governments  or  are  taken  to  the 
courts.  An  important  judgment  in  Australia,  the 
Mabo  case  in  1992,  established  the  concept  of 
“native  title.”  Native  or  aboriginal  title  reflects  the 
entitlement  of  the  indigenous  inhabitants,  in  accor¬ 
dance  with  their  laws  and  customs,  to  their  tradi¬ 
tional  lands,  in  the  cases  where  it  has  not  been 
explicitly  extinguished.  The  judgment  thus 
denounced  the  doctrine  of  terra  nullius — that 
before  the  colonialization  the  land  belonged  to 
nobody.  The  subsequent  Native  Title  Act  1993 
attracted  considerable  international  interest  and 
established  a  precedent  that  has  been  argued  in 
court  cases  in  many  parts  of  the  world,  even 
though  later  amendments  reduced  its  scope. 

In  New  Zealand,  the  Court  of  Appeal,  in  a  case 
in  1987  that  revisited  the  interpretation  of  the 
rights  inherent  in  the  Treaty  of  Waitangi  from 
1840,  laid  down  the  notion  of  “fairness”  as  a  guid¬ 
ing  principle  for  relations  between  the  Maori  and 
the  Crown. 

In  Canada,  forms  of  self-determination  have 
been  recognized  in  a  series  of  treaties.  The  Nisga’a 
of  British  Columba  went  to  court  in  1969  for  a 
declaration  that  their  title  to  land  had  never  been 
surrendered  by  treaty  or  otherwise  extinguished. 
An  agreement  in  support  of  aboriginal  title  was 
reached  in  2000.  The  James  Bay  Cree  reached  an 
agreement  with  the  Government  of  Quebec  in 
2002  in  a  treaty  symbolically  perceived  as  a 
nation-to-nation  agreement. 

Another  case  with  wider  impact  was  the  decision 
by  the  Inter-American  Court  of  Human  Rights  in 
2001  in  the  case  of  Aivas  Tingni  Community  v. 
Nicaragua.  The  Court  found  that  the  Nicaraguan 
government  had  granted  concessions  to  a  foreign 
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transnational  company  for  logging  on  traditional 
lands  without  prior  consultation  and  consent  from 
the  community.  Thus  Nicaragua  had  failed  to 
implement  the  land  rights  provision  of  its  own  laws, 
and  in  violation  of  the  Inter-American  Convention 
on  Human  Rights.  The  decision  set  a  regional  prec¬ 
edent  that  is  potentially  applicable  to  similar  strug¬ 
gles  elsewhere  against  the  incursion  of  transnational 
corporate  interests. 

In  Botswana,  a  group  of  San  indigenous  peoples 
took  the  government  to  court  after  they  were  relo¬ 
cated  from  their  traditional  territory,  the  Central 
Kalahari  Game  Reserve.  In  2006,  the  court  ruled 
in  favor  of  the  applicants  that  they  were  lawfully 
in  possession  of  the  land  they  occupied  and  that 
they  were  deprived  of  it  forcefully  and  without 
their  consent.  This  was  seen  as  a  victory,  as  the 
case  was  argued  as  an  “indigenous  rights”  case. 
However,  while  the  verdict  recognized  rights  based 
on  continuous  occupation,  the  Botswana  govern¬ 
ment’s  violation  of  existing  statutory  laws  was  the 
main  justification.  In  effect,  a  ruling  that  prohib¬ 
ited  access  to  water  in  a  desert  environment  has 
prevented  the  enjoyment  of  “possession.” 

In  many  of  these  cases,  the  courts  have  stressed 
the  significant  connection  between  indigenous 
land  rights  and  indigenous  identity.  It  is  in  the 
nature  of  these  cases  that  the  court  has  ruled 
against  aspects  of  the  development  policies  of  their 
own  governments,  and  verdicts  may  be  counter¬ 
acted  by  state  policies.  But  such  cases  are  sup¬ 
ported  by — and  contribute  to — an  emerging  body 
of  international  law.  As  argued,  for  instance,  by 
James  Anaya,  the  essence  in  this  process  cannot  be 
reduced  to  a  set  of  rules  with  fixed  meanings  but  is 
a  normative  system  that  has  developed  “however 
grudgingly  and  imperfectly”  to  support  indigenous 
peoples’  demands. 

Further  Developments 

Indigenous  peoples  are  tied  to  their  land  by  way 
of  making  a  living.  They  also  have  spiritual  con¬ 
nections  to  territories  where  ancestors  are  buried, 
and  harvesting,  herding,  or  hunting  creates  ties 
between  people  and  nature  and  enhances  a  sense 
of  belonging.  An  enduring  and  dynamic  relation 
to  land  is  best  achieved  through  a  measure  of  self- 
determination.  The  struggle  for  recognition  of 
territorial  rights  (of  ownership  or  possession)  is  an 


ongoing  concern  that  has  brought  tangible  results 
in  some  cases,  while  in  other  parts  of  the  world  it 
has  only  just  begun. 

However,  this  option  is  not  always  possible  to 
implement.  In  many  cases,  the  mix  of  indigenous 
and  settler  populations  is  such  that  it  will  not  be 
possible  to  identify  an  indigenous  ownership  with¬ 
out  discriminating  against  nonindigenous  neigh¬ 
bors.  Moreover,  economic  factors  increasingly 
mean  that  the  exercise  of  traditional  adaptations 
often  on  a  reduced  territory  can  only  carry  a  small 
population.  Indigenous  peoples  find  themselves  as 
part  of  the  global  migration  from  countryside  to 
urban  areas  and  part  of  the  pool  of  landless  labor¬ 
ers  and  squatters. 

The  challenge  for  the  future,  then,  in  following 
up  the  intentions  of  self-determination  as  captured 
in  the  UN  Declaration  on  the  Rights  of  Indigenous 
Peoples  will  be  to  find  ways  to  develop  instruments 
for  nonterritorialized  self-determination.  This  will 
mean  devolution  of  power  within  defined  areas  of 
activities,  most  obviously  matters  of  language,  cul¬ 
ture,  and  education,  but  theoretically  a  broad 
range  of  areas  can  be  included.  Procedures  for  elec¬ 
tion  of  a  representative  body  would  have  to  be 
worked  out  based  on  self-identification  and  some 
criteria  of  descent.  Such  systems  are  in  operation, 
for  instance,  for  elections  to  the  Sami  assemblies  in 
Scandinavia. 
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Individualism 


The  concept  of  individualism  comprises  at  least 
three  basic  ideas  in  the  political  realm.  First  is  the 
dignity  of  man:  Following  the  Kantian  formula, 
each  individual  being  exists  as  an  end  in  himself  or 
herself  and  not  merely  as  a  means  for  arbitrary  use 
by  this  or  that  will.  Second,  the  idea  of  autonomy 
holds  that  an  individual’s  thought  and  action  are 
his  or  her  own  and  not  determined  by  agencies  or 
causes  outside  his  or  her  control.  Third  comes 
privacy,  according  to  which  each  human  being 
should  enjoy  a  private  life,  an  area  in  which  the 
individual  should  be  untroubled  by  others  and 
able  to  act  and  think  in  accordance  with  his  or  her 
own  free  will. 

This  concept  of  individualism  is  indissociably 
bound  up  with  the  tenets  of  modernity  and  with  the 
conviction  that  meaning,  truth,  and  value  originate 
in — and  exist  for  the  benefit  of — mankind.  What 
we  call  our  “modern  societies”  are  no  longer  holis¬ 
tic,  to  use  Louis  Dumont’s  term.  Holistic  societies 
were  understood  as  the  units  that  existed  before  the 
individual,  in  which  individuals  had  only  to  occupy 
their  designated  place.  In  individualistic  societies, 
this  relationship  is  turned  on  its  head:  Man’s  role  is 
to  grow  into  his  or  her  autonomy  and  to  learn  to 
judge  independently  by  detaching  himself  or  herself 
from  entrenched  customs  and  prejudices. 

However,  this  first  brief  overview  of  the  con¬ 
cept  should  not  hide  the  fact  that  individualism 
continues,  as  it  always  has,  to  lend  itself  to  widely 
divergent  interpretations.  In  a  seminal  book  on 
this  topic,  Steven  Lukes  identified  not  less  than 
11  forms  of  individualism  and  added  that  they 
were  not  mutually  exclusive  or  jointly  exhaustive. 
As  Pierre  Birnbaum  and  Jean  Leca  have  also 
shown,  the  notion  of  individualism  can  be  inter¬ 
preted  in  innumerable  different  ways  that  are  not 
always  in  harmony  with  one  another,  whether  we 


are  referring  to  utilitarian  individualism — a  vision 
of  independent  units  motivated  by  a  common 
quest  for  well-being,  romantic  individualism — 
that  of  individuals  inspired  by  their  search  for 
the  most  authentic  form  of  the  self,  juridical 
individualism — which  makes  the  individual  the 
source  and  end  of  all  law,  ethical  individualism — 
taking  individual  conscience  as  the  supreme 
judge  of  the  validity  of  moral  norms,  sociological 
individualism — emphasizing  the  distance  between 
the  subject  and  the  plurality  of  his  social  func¬ 
tions,  or,  finally,  epistemological  individualism — 
which  makes  the  individual  a  conscious  subject 
distinct  from  his  object. 

This  entry  proceeds  in  four  steps.  The  first  two 
parts  sketch  a  series  of  semantic  histories  that 
reveal  alternative  uses  of  the  concept  of  individual¬ 
ism,  ranging  from  a  pejorative  meaning  to  a  posi¬ 
tive  conception  associated  with  the  defense  of 
autonomy  and  the  spirit  of  individuality.  Both  of 
these  interpretations  continue  to  feed  into  contem¬ 
porary  discussion.  The  next  section  proposes  an 
explanation  of  methodological  individualism  since 
the  concept  has  a  distinct  meaning  when  it  is  used 
in  an  epistemological  sense.  Finally,  the  fourth  part 
examines  the  current  debate  on  the  relationship 
between  political  liberalism  and  individualism. 

Individualism  and  the  Dissolution  of  Society 

As  shown  by  Lukes,  individualism  has  long  carried 
and  still  carries  a  pejorative  connotation,  a  strong 
suggestion  that  to  elevate  the  individual  is  to  harm 
the  superior  interest  of  society.  The  first  uses  of  the 
term  in  the  French  form  individualisme  grew  out 
of  the  general  European  reaction  to  the  French 
Revolution  and  to  its  alleged  intellectual  source, 
Enlightenment  thought.  The  Catholic  counterrevo¬ 
lutionary  Joseph  de  Maistre  might  have  coined  the 
term  when,  in  1820,  he  spoke  of  this  “deep  and 
frightening  division  of  minds,  the  infinite  fragmen¬ 
tation  of  all  doctrines,  political  Protestantism  car¬ 
ried  to  the  most  absolute  individualism.”  It  was 
the  followers  of  Henri  de  Saint-Simon  who  were 
the  first  to  use  the  term  systematically  and  associ¬ 
ate  it  with  the  perceived  evils  of  the  contemporary 
age — namely,  disorder,  antagonism,  atheism,  and 
anarchy.  In  that  sense,  Saint-Simonians  shared  the 
counterrevolutionaries’  critique  of  the  celebration 
of  the  individual  and  their  revulsion  for  social 
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atomization.  However,  unlike  the  counterrevolu¬ 
tionaries,  they  applied  these  ideas  in  a  historically 
progressive  direction:  Social  order  was  to  be  not 
the  ecclesiastical  and  feudal  regime  of  the  past  but 
rather  the  industrial  order  of  the  future.  As  pointed 
out  by  Koenraad  Swart,  these  anti-individualists 
held  that  the  18th  century  had  been  successful  in 
breaking  down  the  traditional  values  but  had 
failed  in  developing  a  new  positive  philosophy.  In 
part  because  of  the  great  influence  of  Saint- 
Simonian  ideas,  individualism  came  to  be  a  widely 
used  term  over  the  course  of  the  19th  century  to 
describe  a  serious  evil  undermining  social  and 
political  order. 

However,  it  was  a  liberal  thinker,  Alexis  de 
Tocqueville  (2009),  who  was  to  give  the  most 
comprehensive  definition  of  individualism.  In 
Democracy  in  America,  he  opposed  the  egoism  of 
our  fathers — “a  passionate  and  exaggerated  love 
of  oneself” — with  individualism, 

which  is  a  considered  and  peaceful  sentiment  that 
disposes  each  citizen  to  isolate  himself  from  the 
mass  of  his  fellows  and  to  withdraw  to  the  side 
with  his  family  and  friends;  so  that  after  thus 
creating  a  small  society  for  his  own  use,  he 
willingly  abandons  the  large  society  to  itself. 
(Book  2,  sec.  2,  chap.  2) 

Above  all,  we  owe  to  Tocqueville  the  idea  that 
individualism  would  be  inextricably  bound  up 
with  democratic  revolution:  “Aristocracy  had 
made  all  citizens  into  a  long  chain  that  went  from 
the  peasant  up  to  the  king;  democracy  breaks  the 
chain  and  sets  each  link  apart”  (Book  2,  sec.  2, 
chap.  2).  Tocqueville  was  also  one  of  the  first  to 
call  attention  to  the  paradox  of  democratic  times: 
the  combination  of  individualization  (in  the  sense 
of  a  withdrawal  into  the  self)  and  conformism  (the 
subjection  of  all  individuals  to  identical  codes  of 
behavior).  While  pointing  out  the  danger  of  isola¬ 
tion  inherent  in  increasing  individualism, 
Tocqueville  therefore  also  took  care  to  counterbal¬ 
ance  this  term  with  a  celebration  of  what  he  saw 
as  its  opposite:  the  spirit  of  individuality. 

In  a  similarly  pejorative  vein,  socialists  such  as 
Pierre  Leroux  have  contrasted  individualism  with 
an  ideal  cooperative  social  order.  Identified  with  the 
twin  concepts  of  anomie  and  egoism,  individualism 
came,  for  these  authors,  to  encapsulate  the  social, 


moral,  and  political  isolation  of  individuals  and 
their  dissolution  from  social  purposes. 

This  usage  of  the  term — associating  individual¬ 
ism  with  a  form  of  conformist  withdrawal  into  the 
self — continues  to  hold  sway  in  contemporary  lit¬ 
erature.  In  The  Lonely  Crowd,  David  Riesman 
thus  described  mass  conformism  by  positing  three 
character  types — tradition-directed,  inner-directed, 
and  other-directed  individuals — linked  to  three 
models  of  society.  The  last  type  relies  on  the  judg¬ 
ment  of  other  individuals,  basing  their  conduct  on 
that  of  those  around  them  and  on  the  model  of 
mass  communication.  This  hypothesis  was  cri¬ 
tiqued  by  Richard  Sennett,  who  argues  for  a  rever¬ 
sal  of  Riesman’s  schema:  Rather  than  moving  from 
an  inner-directed  to  an  outer-directed  model,  the 
West — according  to  Sennett — is  in  transition 
between  a  society  directed  by  others  to  a  society 
ruled  by  the  desires  of  individuals’  inner  life. 
According  to  this  analysis,  the  risk  today  is  that  the 
predominance  of  individual  private  life  might 
eventually  destroy  the  public  sphere. 

This  diagnosis  is  echoed  by  Christopher  Lasch’s 
theory  of  the  narcissistic  individual,  which  notes 
that  society  is  being  taken  over  by  the  “self”  and 
particularly  by  a  new  form  of  therapeutic  sensitiv¬ 
ity  in  which  individuals  are  excessively  preoccu¬ 
pied  with  their  own  health,  safety,  and  well-being. 
According  to  these  thinkers,  the  danger  of  our 
times  is  that  collective  life  might  gradually  wither 
away  with  the  rise  of  the  individual.  In  addition, 
they  hold  that  the  modern  democratic  wave  has 
elevated  the  individual  to  ever  new  heights,  leading 
to  neglect  of  the  collective  effort  that  democracy 
should  properly  involve.  Individualism  is  amal¬ 
gamated  in  this  view  with  a  form  of  social  atom¬ 
ism  believed  to  destroy  all  possibility  of  social 
cohesion,  as  well  as  all  democratic  and  civic  par¬ 
ticipation  in  community  life.  In  a  similar  vein,  a 
number  of  theorists  have  recently  challenged  indi¬ 
vidualism  in  the  name  of  community,  with  its 
appeal  to  rootedness,  belonging,  fraternity,  and 
harmony.  It  is  interesting  to  note  the  high  degree 
of  convergence  between  different  critiques  of  indi¬ 
vidualism  coming  from  disconnected  historical 
periods  and  completely  opposing  political  stances — 
one  need  only  point  out  that  the  critiques  outlined 
above  come  from  thinkers  as  different  as  De 
Maistre,  Karl  Marx,  and  Carl  Schmitt  or  contem¬ 
porary  communitarian  thinkers. 


Individualism  1175 


Autonomy  and  the  Spirit  of  Individuality 

Nevertheless,  individualism  is  not  always  under¬ 
stood  in  the  sense  of  conformist  egocentrism 
described  above.  It  may  also  take  in  the  ideas  of 
a  quest  for  autonomy  and  defense  of  a  spirit  of 
individuality.  Defined  in  terms  of  autonomy ,  indi¬ 
vidualism  is  understood  as  the  conviction  that  no 
natural  subordination  of  the  human  being  is  pos¬ 
sible  and  that  every  individual  must  be  free  to 
decide  for  himself  or  herself  in  the  face  of  moral, 
religious,  and  political  authority.  This  unlimited 
sovereignty  of  the  individual  is  consequently  based 
on  a  conception  of  human  nature  as  being  “indi¬ 
viduated”  before  being  social.  In  other  words, 
every  human  being  is  a  unique  individuality  with 
the  capacity  for  self-definition,  reasoning,  and 
autonomous  decision.  The  thesis  at  the  heart  of 
this  individualism,  originating  in  the  English 
Protestant  tradition  of  the  17th  century — Milton, 
Hobbes,  Locke — and  continuing,  in  different 
incarnations,  in  the  work  of  Rousseau,  Paine, 
Kant,  Jefferson,  or  Madison — is  that  every  one  of 
us  is,  partially  at  least,  independent  of  our  posi¬ 
tioning  in  a  particular  social  context  and  capable 
of  freeing  ourselves  from  it.  Norms  thus  originate 
in  the  individual  and  not  the  group,  an  idea  that 
legitimates  the  right  to  resistance  against  political 
power.  The  conception  of  the  individual  as  a  pre¬ 
social  being  is  not  to  be  taken  in  a  descriptive, 
temporal,  or  historical  sense:  It  would  be  clearly 
absurd  to  deny  that  human  beings  need  a  social 
context  in  order  to  develop.  The  principle  has  a 
merely  normative  meaning,  as  a  thought  experi¬ 
ment  that  allows  (at  least  partial)  independence  of 
the  human  being  from  his  or  her  social  group  to  be 
asserted  as  a  legal  principle. 

The  defense  of  a  form  of  individualism  adapted 
to  democratic  times,  meanwhile,  can  be  found  in 
the  work  of  many  different  authors.  Tocqueville, 
for  instance,  distinguished  two  forms  of  individu¬ 
alism.  The  first  indicates  the  withdrawal  of  indi¬ 
viduals  into  their  own  private  sphere;  the  second, 
however,  introduces  the  idea  that  individuals  may 
be  led,  by  way  of  personal  interest,  to  devote  them¬ 
selves  to  the  public  good.  As  Tocqueville  demon¬ 
strated  in  his  analysis  of  the  rise  of  associations  in 
the  United  States,  “You  first  get  involved  in  the 
general  interest  by  necessity,  and  then  by  choice; 
what  was  calculation  becomes  instinct;  and  by 


working  for  the  good  of  your  fellow  citizens,  you 
finally  acquire  the  habit  and  taste  of  serving  them” 
(Vol.  3,  Pt.  1,  chap.  4a). 

It  is  John  Stuart  Mill  above  all  who  must  be  cred¬ 
ited  with  the  idea,  outlined  in  his  book-manifesto 
On  Liberty  (1859),  that  the  free  development  of 
individuality  is  one  of  the  first  requirements  for  the 
well-being  of  humanity  and  the  condition  for  civili¬ 
zation,  instruction,  education,  and  culture.  Mill 
transforms  the  atomist  notion  of  the  individual  by 
emphasizing  the  collective  advantages  inherent  in 
the  development  of  individuality  and  diverse  points 
of  view:  Pluralism  is  thus  seen  as  indissociable  from 
the  defense  of  the  individual,  since  it  is  the  only  state 
of  affairs  that  allows  every  individual  to  exercise 
truly  free  choice.  Mill  thus  opens  the  way  to  a  form 
of  individualism  that  is  not  only  social  but  also 
encourages  solidarity:  What  he  advocates  is  not  a 
denial  of  the  relationship  between  individual  and 
society  but  instead  an  optimization  of  this  rela¬ 
tionship  to  ensure  the  free  development  of  all 
individualities.  In  the  19th  century,  socialists  such 
as  Louis  Blanc  similarly  saw  individualism  as  a 
major  cultural  principle.  Like  many  other  con¬ 
cepts,  individualism  was  initially  introduced  by  its 
opponents  but  gradually  adopted  by  its  tenants. 
The  disciples  of  Charles  Lourier  denied  any  basic 
opposition  between  individualism  and  socialism 
and  Joseph  Proudhon  openly  claimed  himself  as 
an  individualist.  At  the  end  of  the  century,  Jean 
Jaures  posited  that  socialism  was  the  logical 
completion  of  individualism.  Less  than  a  century 
later,  Lukes  likewise  argued  that  a  humane  form 
of  socialism  would  be  the  only  viable  means  of 
realizing  the  values  of  individualism. 

The  output  of  John  Rawls  can  also  be  situated 
in  the  tradition  of  Mill.  The  principles  formulated 
in  A  Theory  of  Justice  (1971)  are  highly  individu¬ 
alist  in  that  they  are  based  on  an  absolute  priority 
accorded  to  individual  liberty,  which  Rawls  argues 
must  not  be  sacrificed  to  the  broader  well-being  of 
society.  However,  Rawlsian  principles  are  also 
egalitarian;  they  envisage  a  system  of  cooperation 
based  on  an  equitable  distribution  of  opportunities 
between  individuals  and  acknowledge  that  the  suc¬ 
cess  of  justice  in  a  given  society  must  be  judged  on 
the  situation  of  the  least  privileged.  In  a  similar 
vein,  other  authors  have  attempted  to  clarify  the 
conditions  for  individual  autonomy.  The  writings 
of  Zygmunt  Bauman,  for  instance,  emphasize  the 
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gulf  between  the  individual  de  jure  and  de  facto: 
For  individuals  to  be  capable  in  real  terms  of 
choosing  a  life  path  and  acting  autonomously,  the 
lack  of  provisions  for  collective  protection  of  those 
in  insecure  situations  must  be  remedied. 

In  more  recent  times,  other  more  optimistic 
approaches  than  those  outlined  in  the  first  section 
have  been  formulated,  emphasizing  the  link 
between  individualism  and  democracy.  Anthony 
Giddens,  for  instance,  argues  that  the  erosion  of 
traditional  modes  of  coherence  also  generates 
democratic  effects.  The  progress  of  gender  equal¬ 
ity  and  sexual  tolerance  should  encourage  us 
to  relativize  worries  about  individualism,  since 
individualization  should  not  be  confused  with  self¬ 
ishness.  The  crisis  of  the  political  domain  is  thus 
explained  less  by  the  apathy  of  individuals  than  by 
a  mismatch  between  traditional  politics  and  more 
societal,  postmaterialist  aspirations  (environment, 
sexuality,  etc.). 

Methodological  Individualism 

One  should  be  aware  that  individualism  has  a  very 
different  meaning  when  it  occurs  in  the  context  of 
ethics  or  sociology  from  when  it  is  used  in  an  epis¬ 
temological  sense.  In  ethical  terms,  individualism  is 
a  doctrine  that  holds  the  person  to  be  the  supreme 
point  of  reference.  Sociologically  speaking,  we 
define  a  society  as  individualist  if  its  law,  customs, 
and  social  constraints  grant  a  broad  autonomy  to 
individuals.  The  methodological  interpretation  of 
individualism,  meanwhile,  is  completely  distinct 
from  other  forms:  It  is  an  attribute  of  the  researcher 
and  his  or  her  methodology,  not  of  the  object  of 
study.  Elaborated  primarily  by  Karl  Popper  and 
subsequently  systematized  by  authors  such  as 
Raymond  Boudon,  the  theory  of  methodological 
individualism  posits  that  to  explain  a  given  social 
phenomenon — whether  in  the  domain  of  method¬ 
ology,  political  science,  sociology,  or  any  other 
branch  of  social  science — it  is  essential  to  recon¬ 
struct  the  motivations  of  the  individuals  concerned 
by  the  phenomenon  in  question  and  to  understand 
it  as  the  outcome  of  an  aggregation  of  individual 
actions  governed  by  these  motivations.  In  this 
sense,  as  Jon  Elster  has  written,  methodological 
individualism  is  a  form  of  reductionism  since  it 
holds  that  any  social  phenomenon  can  be  explained 
by  the  characteristics  of  the  individuals  who  make 


it  up.  For  Elster,  explanation  always  works  by 
opening  up  “black  boxes,”  and  this  can  only  be 
done  at  the  level  of  interacting  individuals. 

One  should  clear  up  two  common  misunder¬ 
standings.  First,  methodological  individualism  is 
not  the  same  thing  as  an  atomist  concept  of  soci¬ 
ety:  It  may  very  well  demonstrate  that  individual 
actions  have  meaning  only  in  a  specific  social 
context  and  that  individuals  judge,  decide,  and 
act  according  to  the  social  situation  in  which  they 
find  themselves.  As  pointed  out  by  Lukes,  explain¬ 
ing  phenomena  in  terms  of  the  microlevel  does 
not  preclude  reference  to  holistically  character¬ 
ized  causes.  In  another  sense,  however,  it  is  true 
that  methodological  individualism  is  clearly  dis¬ 
tinct  from  methodological  holism  in  that  it  rejects 
any  representation  of  social  structures  as  real  and 
autonomous  collective  beings.  Second,  method¬ 
ological  individualism  does  not  mean  adhering  to 
economic  liberalism  or  ethical  individualism. 
Boudon  and  Elster  demonstrated  that  Marxist 
theory,  for  instance,  often  draws  on  a  typically 
individualist  methodology:  Social  phenomena  are 
analyzed  as  the  consequences  of  aggregation  and 
interference  effects  between  the  actions  of  indi¬ 
viduals  each  pursuing  his  or  her  own  interest. 
One  should  therefore  underline,  as  Birnbaum  and 
Leca  do,  that  employing  an  individualist  method¬ 
ology  is  not  the  same  thing  as  suggesting  that  all 
societal  forms  are  governed  by  individualist  val¬ 
ues.  Conversely,  arguing  that  modern  society  is 
imbued  with  individualist  ideology  does  not  pre¬ 
clude  using  a  holistic  methodology.  There  is 
therefore  no  reason  why  one  might  not  reject 
methodological  individualism  while  remaining  a 
firm  believer  in  the  supreme  importance  of 
respecting  persons  and  valuing  their  autonomy, 
privacy,  and  self-development. 

Individualism  and  Political  Liberalism 

Despite  these  caveats,  methodological  individual¬ 
ism  has  often  been  reduced  to  an  economic  (mon¬ 
etary  or  rational  choice)  model,  elevated  to  the 
status  of  a  theory  that  is  not  merely  explanatory 
but  also  prescriptive  of  a  comprehensive  system  of 
social  mechanisms.  Over  the  past  half-century, 
individualism  has  thus  come  to  encapsulate  a  clus¬ 
ter  of  ideas  propagated  by  libertarians  or  neoliber- 
als.  For  Cold  War  liberal  thinkers  such  as  Friedrich 
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Hayek  or  Ludwig  von  Mises,  individualism  was 
considered  as  a  protective  bulwark  against  the 
dangers  of  ideological  thinking  from  both  right 
and  left. 

Consequently,  authors  such  as  Lukes  now  con¬ 
sider  that  defense  of  individualism’s  core  values 
inherent  in  the  ideals  of  liberty  and  equality  has 
been  hijacked  and  harnessed  to  various  doctrines 
that  embody  libertarian  thinking  and  promote 
market-favoring  policies.  The  argument  suggests 
that  libertarians  have  captured  the  high  moral 
ground  from  which  liberal  critiques  of  the  market 
and  arguments  for  redistribution  can  come  to 
look  concessionary  and  defensive.  Colin  Bird  goes 
a  step  further  in  claiming  that  it  makes  no  sense 
to  clarify  liberal  values  as  individualist  (rather 
than  collectivist)  in  the  first  instance  and  ends  up 
declaring  the  unhelpfulness  of  the  dominant 
idiom  of  individualism  and  even  the  obsolescence 
of  the  label  “liberal  individualism.”  Lukes  also 
views  individualism  as  an  ideological  construct  in 
the  double  sense  that  it  embodies  and  conveys 
illusions  and  that  these  illusions  serve  partisan 
interests. 

However,  it  is  one  thing  to  say  that  liberalism 
need  not  be  identified  with  an  atomistic  model  of 
human  conduct  and  that  there  is  no  single  or  eter¬ 
nal  view  of  the  relation  between  the  individual  and 
society — that  the  liberal  distinction  between  public 
and  private  spheres  is  neither  permanent  nor  unal¬ 
terable  since,  in  Judith  Shklar’s  words,  the  impor¬ 
tant  point  for  liberalism  is  not  so  much  where  the 
line  is  drawn  as  that  it  be  drawn  somewhere,  and 
under  no  circumstances  be  ignored  or  forgotten.  It 
is  quite  another  thing  to  claim  that  the  individual¬ 
ist/collectivist  dichotomy  should  be  discarded  alto¬ 
gether,  as  if  it  were  possible  to  conceive  of  political 
liberalism  without  a  firm  commitment  to  individu¬ 
alism  seen  in  its  fullest  sense. 

Consequently,  some  liberals  argue  that  the  neo¬ 
liberal  approach  is  an  incorrect  interpretation  of 
individualism  and  that  the  liberal  commitment  to 
individuality  has  resources  that  can  be  opposed  to 
the  libertarianism  of  economic  conservatives.  For 
instance,  this  commitment  might  be  reformulated — 
as  Jeremy  Waldron  did  with  the  concept  of 
liberty — as  a  commitment  to  equal  individualism. 
This  would  give  rise  to  a  principle  justifying  social 
policies  that  do  not  infringe  on  the  primacy  of  the 
individual  but  that  rather — quite  the  opposite — aim 


at  creating  the  necessary  conditions  for  the 
full  development  of  individuality  by  all  persons 
concerned. 

In  a  similar  vein,  George  Kateb  argues  that  a 
proper  perspective  should  include  two  types  of  indi¬ 
vidualism.  The  first  is  the  individualism  of  personal 
and  political  rights;  the  second  is  democratic  indi¬ 
viduality.  These  two  types  of  individualism  are 
closely  connected  since  democratic  individuality  is, 
according  to  Kateb,  the  best  achievement  that  can 
grow  out  of  a  culture  in  which  personal  and  political 
rights  are  recognized.  Put  differently,  democratic 
individuality  could  not  exist  without  rights-based 
individualism,  and  any  attack  on  the  centrality  of 
rights  should  be  considered  as  an  attack  on  demo¬ 
cratic  individuality. 
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Inequality  is  a  complex  multidimensional  phe¬ 
nomenon,  depending  on  many  factors  and  lending 
itself  to  a  great  variety  of  explanations  and  evalu¬ 
ations.  The  existence,  shape,  and  origins  of 
inequalities  are  significant  both  from  a  positive 
perspective,  owing  to  the  fact  that  the  economic, 
political,  and  psychological  attitudes  and  behavior 
of  individuals  or  groups  are  affected  by  the  rela¬ 
tive  position  they  occupy  in  the  distribution  of 
meaningful  resources,  and  from  a  normative  one, 
as  a  consequence  of  the  fact  that  in  general  a  situ¬ 
ation  characterized  by  inequalities  compares  unfa¬ 
vorably  with  one  of  equality  or  with  lesser 
inequalities.  In  this  entry,  a  brief  overall  survey  is 
provided  that  can  be  summarized  in  the  following 
questions:  Inequality  of  what?  How  large  are  the 
inequalities?  Inequality  among  whom?  Inequalities 
due  to  which  causes? 

Inequality  of  What  and  How  Large? 

The  qualification  of  inequalities  as  economic  lends 
itself  to  various  interpretations:  In  a  narrow  sense, 
it  regards  the  differences  in  the  levels  of  personal 
income  or  wealth;  in  a  broader  sense,  it  includes  all 
the  values  that  directly  or  indirectly  derive  from 
economic  activities,  which  can  be  used  in  obtain¬ 
ing  them  or  can  be  exchanged  with  them.  In  the 
first  case,  its  interpretation  is  straightforward,  in 
the  second,  it  requires  the  integration  of  different 
systems  of  inequality  that  include,  besides  income 
and  wealth,  elements  such  as  health,  knowledge, 
power,  or  availability  of  public  services,  and  so  on, 
whose  distribution  among  individuals  does  not 
coincide  with  that  of  income.  In  the  case  of  income 


and  wealth,  the  size  of  the  inequalities  can  be 
determined  in  terms  of  monetary  units  and  in  ways 
that  are  sufficiently  precise,  although  not  stable 
and  exposed  to  a  certain  margin  of  error.  For  other 
aspects  that  can  be  broadly  termed  economic,  this 
is  not  possible,  and  the  differences  between  the 
positions  can  be  described  only  in  ordinal  terms  of 
larger  and  smaller  (possibly  with  qualifications 
such  as  much  larger  or  much  smaller);  in  still  other 
cases,  even  this  is  impossible,  and  one  is  limited  to 
an  incomplete  or  partial  ordering.  The  measuring 
of  inequalities  raises  problems  even  in  the  case  of 
items  that  are  directly  expressed  in  monetary 
terms:  In  international  comparisons,  the  per  capita 
incomes  of  different  countries  are  usually  trans¬ 
lated  into  a  single  currency  through  purchasing 
power  exchange  rates,  whose  value  is  affected  by 
the  choice  of  national  baskets  of  representative 
consumption  goods.  Even  more  serious  are  the 
problems  arising  from  the  presence  of  externalities 
and  of  goods  whose  production  counts  as  income 
when  they  are  sold  in  the  market  but  not  when 
they  are  consumed  by  the  producer  or  within  his 
or  her  family,  an  occurrence  that  is  usually  more 
frequent  in  poorer  countries  or  among  lower 
income  groups.  To  face  these  problems,  a  number 
of  alternative  and  broader  notions  of  economic 
welfare  have  been  suggested,  among  them  that  of 
net  economic  welfare,  proposed  by  Joseph  Stiglitz, 
which  adjusts  gross  domestic  product  (GDP)  by 
subtracting  from  it  negative  factors  (“bads”)  such 
as  pollution  and  by  adding  the  contribution  of 
beneficial  nonmarket  activities  such  as  leisure, 
household  production,  child  care,  or  looking  after 
sick  people. 

There  are  good  reasons  both  for  treating  the 
inequalities  existing  in  the  various  spheres  sepa¬ 
rately  and  for  pooling  them  in  a  single  comprehen¬ 
sive  notion  of  economic  inequality.  The  reason  for 
treating  them  separately  is  that  the  pattern  of 
inequalities  of  one  sphere  does  not  coincide  with 
those  of  others;  that  their  economic  and  political 
consequences  as  well  as  the  moral  criteria  for 
evaluating  them  are  different,  according  to  Michael 
Walzer,  and  that  the  instruments  available  for 
redressing  them  are  usually  specific  to  each  sphere 
(redistributing  income  has  no  immediate  impact  on 
inequalities  in  health,  education,  or  power).  The 
reasons  for  pooling  them  is  that  the  shortcomings 
in  one  sphere  can,  at  least  partly,  be  compensated 
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for  by  advantages  in  another  and  that  the  overall 
condition  of  people  depends  on  the  positions  they 
have  in  each  of  the  relevant  ones. 

There  have  been  many  attempts  at  finding  crite¬ 
ria  for  combining  the  individual  positions  in  a 
number  of  separate  spheres  to  form  an  overall 
indicator  of  their  standing  both  in  absolute  terms 
and  relative  to  each  other.  One  of  the  most  rele¬ 
vant  of  these  is  the  one  proposed  by  Amartya  Sen 
(1993)  with  the  term  capabilities ,  defined  as  “com¬ 
binations  of  functionings  a  person  can  achieve, 
and  from  which  he  or  she  can  choose  one  collec¬ 
tion”  (p.  31),  which  represents  an  all-encompassing 
notion  of  what  has  value  for  an  individual  and 
covers  the  whole  range  of  possible  valuable  life 
experiences  that  are  within  reach  of  a  person 
(including  alternatives  that  are  mutually  exclu¬ 
sive).  The  solution  Sen  put  forward  can  be  applied 
both  to  individuals  and  to  groups  or  whole  popu¬ 
lations;  in  this  second  form  it  has  inspired  the 
adoption  by  the  United  Nations  of  the  Human 
Development  Index  (HDI),  as  a  benchmark  for  the 
evaluation  of  social  policies  and  policies  for  devel¬ 
opment,  an  index  that  is  calculated  for  each  coun¬ 
try  on  the  basis  of  data  concerning  income  per 
head,  the  expectation  of  life  at  birth,  and  the  level 
of  education.  For  each  of  these  variables,  each 
country  is  given  a  score  corresponding  to  the  point 
in  which  it  is  placed  in  the  interval  that  separates 
the  position  of  the  country  that  ranks  first  from 
that  of  the  country  that  ranks  last,  the  general 
index  being  formed  by  the  simple  average  of  the 
indexes  concerning  the  individual  variables. 

Inequality  Among  Whom? 

A  system  of  inequalities  is  defined 

1.  by  the  choice  of  the  particular  sphere,  or  groups 
of  spheres,  to  be  considered; 

2.  by  the  identification  of  the  population  about 
which  the  comparisons  are  made  and  of  the 
units  within  it  whose  positions  are  compared; 
and 

3.  by  the  specification  of  the  moment  or  span  of 
time  that  is  taken  into  consideration. 

For  the  sake  of  simplicity,  we  shall  limit  ourselves 
to  consider  a  single  sphere  of  value  (that  of  income) 


and  to  a  static  approach,  considering  a  single 
moment  in  time.  The  definition  of  the  population 
for  which  inequalities  are  assessed  requires  on  the 
one  hand  that  its  members  have  a  sufficient  num¬ 
ber  of  elements  in  common  to  make  it  relevant, 
and  on  the  other  that  they  present  sufficient  differ¬ 
ences  to  make  it  interesting;  both  conditions 
depend  on  the  nature  of  the  problems  examined. 

The  most  frequent  cases  are  those  in  which  the 
population  is  defined  on  a  national  level,  but 
Branko  Milanovic  notes  that  an  increasing  num¬ 
ber  of  studies  make  reference  to  the  broadest  pos¬ 
sible  definition  of  the  population,  which  of  course 
corresponds  to  the  whole  of  humanity.  At  the 
other  extreme  are  populations  that  are  formed  by 
specific  groups  (such  as  the  workforce  of  a  single 
factory  or  the  members  of  a  single  union),  local 
communities,  or  even  single  families.  As  a  general 
(but  by  no  means  universal)  rule,  we  can  say  that 
the  size  of  the  inequalities  increases  with  the  size  of 
the  population  but  that  its  significance  can  move  in 
the  opposite  direction,  because  individuals  or 
groups  living  close  together  or  having  much  in 
common  tend  to  have  a  sharper  perception  of 
what  makes  them  different. 

We  have  a  disaggregated  representation  of 
inequalities  when  the  population  is  seen  as  formed 
by  single  individuals  and  an  aggregate  one  when  it 
is  seen  as  formed  by  groups  of  individuals  having 
in  common  certain  characteristics  such  as  gender, 
age,  living  in  the  same  area,  working  in  the  same 
sector,  having  a  similar  professional  qualification, 
belonging  to  the  same  ethnic  group,  or  even 
belonging  to  the  same  bracket  or  fraction  (decile, 
quintile,  etc.)  in  the  distribution  of  income.  The 
case  of  families  is  in  some  ways  ambiguous  in  that 
they  can  be  considered  both  as  elementary  units 
and  as  groups.  Aggregation  is  of  course  a  matter  of 
degree  insofar  as  any  group  can  be  split  into  a 
number  of  subgroups  or  lumped  together  with  oth¬ 
ers  to  form  a  wider  group.  When  the  aggregate 
approach  is  adopted,  the  systems  of  inequalities 
present  both  an  external  and  an  internal  side:  In 
the  first  case,  the  position  of  a  group  is  considered 
in  terms  of  a  representative  figure  or  parameter 
(typically  the  average  or  the  median  value)  and 
compared  with  that  of  other  groups;  in  the  second, 
comparison  is  made  of  the  position  of  each  single 
individual  or  subgroup  with  those  of  other  indi¬ 
viduals  or  subgroups  that  make  up  the  larger  one. 
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The  aggregate  and  disaggregate  approaches  are 
more  often  complementary  than  alternative,  given 
that  the  position  of  single  individuals  in  the  larger 
populations  is  usually  the  joint  result  of  factors 
affecting  the  position  of  the  group  to  which  they 
belong  compared  with  that  of  other  groups  and  of 
factors  affecting  their  personal  position  in  the  for¬ 
mer.  The  choice  of  an  aggregate  or  disaggregate 
approach  is  far  from  neutral  and  can  have  relevant 
ethical  and  political  implications:  Presenting  world 
income  inequalities  in  aggregate  terms  through  a 
comparison  of  the  per  capita  incomes  of  different 
countries  leads  naturally  to  development  policies 
that  work  through  transfers  of  resources  to  govern¬ 
ments;  more  disaggregate  approaches  lead  to  poli¬ 
cies  that  are  directly  targeted  to  individuals  or  local 
communities.  Similarly,  policies  of  affirmative 
action  addressed  to  disadvantaged  groups  (like 
women  or  ethnic  minorities)  often  take  the  forms  of 
reserving  to  members  of  those  groups  coveted  posi¬ 
tions  or  privileged  conditions  for  obtaining  them, 
measures  that  do  in  fact  improve  the  average  posi¬ 
tion  of  the  targeted  group  but  tend  to  increase  the 
inequalities  within  it  and  often  do  little  to  improve 
the  conditions  of  its  most  disadvantaged  members. 

Another  meaningful  distinction,  according  to 
Larry  Temkin,  is  that  between  analytical  and  syn¬ 
thetic  representations  of  inequalities.  In  the  first 
case,  we  have  a  set  of  separate  data  regarding  the 
differences  between  the  position  of  every  single  ele¬ 
ment  (individual  or  group)  into  which  the  popula¬ 
tion  has  been  divided  and  that  of  each  of  the  others; 
for  a  population  of  N  elements,  this  representation 
takes  the  form  of  an  N  X  N  symmetrical  matrix. 
The  synthetic  versions  take  the  form  of  a  single 
numerical  indicator,  whose  value  reflects  the  disper¬ 
sion  of  the  positions  of  the  various  elements,  or 
more  exactly  the  extent  to  which  the  distribution  of 
values  within  the  population  diverges  from  equal¬ 
ity.  The  most  well-known  and  frequently  used  mea¬ 
sure  is  the  Gini  index,  which  is  especially  sensitive 
to  the  variations  of  the  extreme  positions  and  takes 
on  a  zero  value  in  the  case  of  full  equality  and  a 
value  of  1  in  the  case  of  maximum  inequality,  when 
all  resources  belong  to  just  one  individual  or  group. 

Synthetic  representations  may  be  thought  of  as 
the  expression  of  inequalities  in  the  singular, 
describing  properties  of  entire  populations  or  col¬ 
lective  entities:  analytical  representations,  on  the 
other  hand,  as  expressions  of  inequality  in  the 


plural,  describing  the  relative  positions  of  different 
individuals  or  groups.  Analytical  representations 
are  closer  to  what  might  be  called  the  direct  per¬ 
ception  of  inequalities,  that  is,  the  actual  condi¬ 
tions  of  individuals  and  the  way  they  compare 
themselves  with  other  individuals:  Richard 
Wilkinson  and  Kate  Pickett,  for  example,  have 
shown  that  individuals  are  often  more  concerned 
with  how  much  more  or  less  than  others  they  earn 
than  with  the  absolute  level  of  their  income  and 
that  less  wealth  can  be  as  painful  (or  even  more 
painful)  than  actual  poverty.  The  empirical  descrip¬ 
tions  of  systems  of  inequalities  combine  in  general 
synthetic  and  analytical  aspects,  as  well  as  aggre¬ 
gate  and  disaggregate  ones. 

The  systems  of  inequalities  characterizing  a  given 
population  can  show  at  the  same  time  stability  and 
variability:  stability,  as  the  pattern  of  relative  posi¬ 
tions  remains  basically  the  same,  and  variability,  as 
the  individuals  occupying  the  different  positions 
keep  changing  all  the  time.  A  high  degree  of  vari¬ 
ability  or  mobility  tends  to  dampen  significantly  the 
economic,  social,  political,  and  psychological 
impact  of  any  given  system  of  inequalities. 

The  Causes  of  Inequalities 

The  factors  that  give  shape  to  a  raw  system  of  eco¬ 
nomic  inequalities  among  individuals,  by  which  is 
meant  those  that  precede  any  corrective  interven¬ 
tion  by  the  state  or  other  authorities,  normally 
operate  within  a  general  competitive  framework 
and  can  be  grouped  into  three  broad  categories: 

1.  macrofactors,  which  determine  the  relative 
levels  of  average  incomes  of  whole  communities 
(typically  countries); 

2.  personal  factors,  which  determine  the  relative 
positions  that  individuals  living  in  the  same 
country  can  attain  through  their  choices  and 
actions;  and 

3.  microfactors,  which  correspond  to  gaps  in  the 
general  framework  and  provide  privileged 
conditions  for  particular  organizations  or 
individuals  within  them. 

The  individual  positions  in  a  given  system  of 
inequalities  are  to  a  varying  degree  affected  by 
each  of  the  aforementioned  factors. 
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The  general  framework  is  that  of  a  competitive 
capitalist  market  economy  in  which  the  income 
people  earn  broadly  corresponds  to  the  contribu¬ 
tion  they  have  given  to  marketable  production 
through  their  work  and  the  resources  they  have 
made  available.  The  rule  applies  fully  and  directly 
only  to  units  (persons  or  firms)  that  sell  their  prod¬ 
ucts  in  the  market  and  are  motivated  by  the  maxi¬ 
mization  of  income  (or  profit,  in  the  case  of  firms). 
When  production  is  organized  by  public  bodies,  or 
by  nonprofit  organizations,  the  rules  are  different, 
but  still  subject  to  the  indirect  influence  of  those  of 
the  capitalist  sector,  owing  to  the  fact  that  indi¬ 
viduals  and  resources  can  move  or  be  transferred 
from  one  to  the  other.  Before  reaching  individuals, 
the  flows  of  income  (mainly  capital  income  but  to 
a  certain  extent  even  incomes  from  labor)  deriving 
from  the  sale  of  production  can  be  partially  modi¬ 
fied  by  intermediate  actors  (corporations,  financial 
institutions,  insurance  companies,  families,  etc.), 
which  pool  different  flows  and  redistribute  them 
among  their  members  or  associates. 

The  macrofactors  are  those  that  affect  the  posi¬ 
tion  of  a  country  in  the  world  economy;  they 
include  the  nature  of  the  goods  it  produces,  their 
prices,  the  efficiency  of  its  firms  compared  with 
those  of  other  countries,  and  its  currency’s 
exchange  rate;  they  play  a  crucial  role  in  determin¬ 
ing  the  relative  level  of  its  per  capita  income  and, 
owing  to  their  frequent  fluctuations,  its  variation 
through  time.  Each  of  them  has  a  direct  positive 
effect  on  relative  per  capita  income  that  is  some¬ 
times  accompanied  by  indirect  negative  ones;  an 
appreciation  of  the  exchange  rate,  for  instance, 
reduces  the  competitiveness  of  national  firms, 
making  the  goods  they  produce  more  expensive  in 
terms  of  foreign  currencies. 

The  personal  factors  are  those  that  determine  the 
precise  position  of  individuals  and  the  roles  they 
perform  in  the  economic  system  of  the  country  they 
live  in;  these  include  their  natural  gifts  (in  terms  of 
intelligence,  health,  and  physical  dexterity),  the 
social  milieu  in  which  they  were  born  and  grew  up, 
the  choices  they  made  regarding  their  development 
and  during  their  working  life,  how  much  they 
worked  and  the  effort  and  commitment  they  put  in, 
how  much  they  saved,  the  risks  they  took,  and  the 
luck  they  had.  Mainstream  explanations  of  income 
inequalities  among  persons  living  in  the  same  coun¬ 
try  see  them  as  arising  from  competitive  market 


interactions  of  individuals  having  different  personal 
characteristics. 

The  microfactors  are  mainly  sociological  and 
usually  “local,”  in  the  sense  of  having  a  direct 
influence  on  specific  sectors  or  locations;  they 
reflect  the  structure  of  organizations  (firms,  unions, 
etc. )  or  networks  within  which  and  between  which 
elements  of  power,  monopoly,  and  discrimination 
play  a  significant  role:  They  typically  offer  to  insid¬ 
ers  privileged  channels  of  access  to  valuable 
resources  from  which  outsiders  are  excluded  or 
that  they  face  obstacles  in  accessing.  The  inequali¬ 
ties  they  produce  are  sheltered  from  the  competi¬ 
tive  forces  that  are  characteristic  of  the  basic 
framework  and  have  a  “durable”  nature  insofar  as 
they  affect  individuals’  conditions  throughout  their 
lives  and  are  often  transmitted  vertically  through 
generations  and  horizontally  among  persons  who 
share  some  easily  recognizable  and  permanent 
traits  (such  as  gender,  ethnic  origin,  or  faith). 
These  microfactors  can  be  grouped  into  the  three 
broad  categories  of  hierarchization,  exclusion,  and 
exploitation,  which  perform  different  but  often 
complementary  roles,  according  to  Charles  Tilly. 

Hierarchization  has  been  defined  by  Goran 
Therborn  (2006)  as  “institutionalized  ranking  of 
social  actors”:  The  subjects  occupying  the  higher 
ranking  positions,  usually  limited  in  number,  have 
at  their  disposal  a  variety  of  resources  (finances, 
prestige,  power,  etc.),  the  access  to  which  is  subject 
to  their  control  and  approval,  or  from  which  those 
of  lower  rank  are  excluded;  inequalities  of  this 
kind  are  generally  codified  by  rules  that  are  both 
formal  and  stable  in  time,  and  sometimes  transmit¬ 
ted  to  designated  heirs. 

Exclusion  occurs  when  access  to  positions  asso¬ 
ciated  with  wealth  and  prestige  is  barred  to  sub¬ 
jects  who  would  otherwise  qualify,  on  the  basis  of 
characteristics  such  as  gender,  race,  or  religion, 
which  are  easily  ascertained  by  the  insiders  and 
cannot  easily  be  modified  by  the  excluded. 

Exploitation  has  been  examined  from  various 
points  of  view:  the  Marxist  one,  for  which  every 
relationship  based  on  the  separation  between 
workers  and  the  means  of  production  involves 
exploitation,  is  both  too  broad  and  insufficiently 
comprehensive;  more  limited  versions  refer  to  the 
presence  of  elements  of  monopoly  and/or  of  coer¬ 
cion,  which  alter  the  distribution  of  the  fruits  of 
labor  to  the  advantage  of  those  who  control  them. 
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According  to  Tilly’s  version,  similar  to  Marx’s  but 
more  sociological  and  not  limited  to  relationships 
between  capital  and  labor,  exploitation  “operates 
when  powerful,  connected  people  command 
resources  from  which  they  draw  significantly 
increased  returns  by  coordinating  the  effort  of  out¬ 
siders  whom  they  exclude  from  the  full  value  added 
by  that  effort”  (Durable  Inequality,  p.  10).  Common 
to  all  these  versions  is  the  idea  of  a  systematic  and 
long-lasting  distortion  of  relationships  in  favor  of 
the  exploiter,  which  (a)  could  be  removed  without 
significant  loss  of  efficiency  and  (b)  is  accompanied 
by  forms  of  “categorical  inequality,”  that  is,  by  the 
attribution  to  the  exploited  and  exploiter  of  distinc¬ 
tive  characteristics  (in  terms  of  race,  gender,  cul¬ 
ture,  or  class)  that  tend  to  both  consolidate  and 
justify  the  exploitation.  Alongside  exploitation,  and 
sharing  some  of  its  traits,  Tilly  proposes  the  cate¬ 
gory  of  “opportunity  hoarding,”  which  arises  when 
groups  of  people  sharing  some  recognizable  charac¬ 
teristic  establish  preferential  relationships  that 
ensure  them  the  control  of  resources,  which  they 
take  advantage  of  to  the  exclusion  of  outsiders  but 
without  exploiting  the  latter. 
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Political  inequality  is  the  unequal  influence  of  the 
citizens  on  the  collective  decisions.  It  stems  from  a 
situation  in  which  one  person  or  one  group  within 
a  community  differs  in  its  impact  on  the  decisions 
that  affect  the  whole  community.  Defined  as  an 
antonym,  political  inequality  is  the  absence  of 
political  equality,  a  fundamental  attribute  of  democ¬ 
racy.  The  principle  of  political  equality  departs 
from  the  idea  that  decisions  can  only  be  adopted  by 
the  persons  who  are  subject  to  them,  and  that  all 
preferences  must  be  taken  into  account  before 
deciding.  From  the  point  of  view  of  the  quality  of 
democracy,  addressing  the  levels  of  political  inequal¬ 
ity  in  a  democracy  is  therefore  essential.  There  are, 
however,  two  contradictory  conceptions  of  political 
inequality,  which  frame  differently  the  study  of 
democratic  quality.  After  a  brief  contextualization, 
this  entry  reconsiders  these  two  main  conceptions 
of  political  inequality  and  underlines  the  principal 
implications  for  the  quality  of  democracy. 

Discussions  on  the  concept  of  political  equality 
are  intrinsically  linked  to  democratic  discourse.  As 
such,  the  scope  of  political  equality  has  gradually 
been  changed,  in  parallel  to  the  development  of 
Western  democracies.  In  the  early  19th  century,  few 
men  and  almost  no  women  were  usually  entitled  to 
participate  in  the  political  sphere.  Political  equality 
existed  but  only  for  a  reduced  group  of  men  from 
the  high  socioeconomic  strata.  It  was  not  put  into 
question,  however,  that  suffrage  should  be  extended 
to  women,  slaves,  or  the  lower  social  classes.  Only 
late  in  the  19th  century  was  universal  suffrage 
steadily  introduced  in  most  democracies  and  politi¬ 
cal  rights  expanded  to  the  whole  population.  The 
new  status  of  citizenship  enabled  all  citizens  to  be 
political  equal.  Although  by  the  end  of  the  20th 
century  most  societies  labeled  as  democracies  had 
incorporated  universal  suffrage  as  a  basic  norm,  the 
ideal  of  political  equality  is  far  from  being  realized 
in  many  countries,  and  there  are  significant  differ¬ 
ences  across  them.  Political  inequality  remains  for 
that  reason  a  major  field  of  research. 

Formal  Versus  Substantial  Inequality 

Further  development  of  the  liberal  and  republican 
theories  resulted  in  two  competing  understandings 
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of  political  inequality,  which  roughly  reflect  the  dis¬ 
tinction  between  formal  and  substantial  inequality. 

Formal  Inequality 

For  formalists,  universal  suffrage  and  universal 
entitlement  to  participate  in  public  matters  are  the 
basic  features  of  political  equality — a  set  of  rights 
that  allows  citizens  to  equally  influence  collective 
decisions.  Political  equality  is  legally  and  formally 
guaranteed  through  the  existence  of  these  rights.  In 
most  democracies,  indeed,  the  law  endorses  all  citi¬ 
zens  with  the  same  rights  and  the  same  opportuni¬ 
ties  to  participate  in  political  decisions,  and  public 
institutions  and  administrations  are  prevented  to 
avoid  any  type  of  discrimination  against  any  of  the 
citizens.  Following  this  conception,  political  equal¬ 
ity  is  equivalent  to  liberty,  as  the  freedom  to  choose 
whether  to  participate  or  not.  Once  political  equal¬ 
ity  is  formally  established,  citizens  might  freely 
decide  to  become  politically  unequal.  Freedom 
provides  in  fact  with  the  opportunity  to  become 
unequal.  As  a  logical  result,  freedom  is  only  possi¬ 
ble  if  there  is  political  inequality;  in  other  words, 
political  inequality  is  a  reflection  of  the  citizen’s 
decisions,  an  expression  of  their  liberty.  From  this 
theoretical  perspective,  the  principle  of  freedom  is 
clearly  superior  to  the  equality  principle. 

Substantial  Inequality 

From  a  drastically  different  perspective,  politi¬ 
cal  inequality  basically  refers  to  the  unequal  distri¬ 
bution  of  political  resources  among  the  citizens.  It 
comprehends  not  only  formal  equality  but  also 
substantive  equality.  Hence,  formal  equality  is  not 
a  sufficient  condition  in  a  democracy,  as  its  exis¬ 
tence  does  not  guarantee  effective  influence  in  the 
collective  decisions  of  all  the  citizens.  Unequal 
distribution  of  political  resources — based  on  dif¬ 
ferences  of  income,  education,  skills,  gender,  race, 
and  so  on — undermines  political  equality;  those 
who  collect  more  political  resources  have  a  greater 
capacity  to  affect  political  decisions.  Political 
inequality  in  this  second  meaning  is  inextricably 
linked  to  social  inequality  and  social  stratification. 
Because  there  normally  is  correspondence  between 
a  citizen’s  position  in  the  stratification  scale  and 
his  or  her  capacity  to  influence  politics,  there  is  an 
overlap  of  the  concept  of  political  inequality  with 


social  inequality.  As  an  attempt  to  delimitate  the 
former  from  the  later,  political  inequality  has  been 
defined  as  the  sum  of  three  components:  participa¬ 
tory  inequality,  unequal  responsiveness,  and  unequal 
policy  delivery.  Participatory  inequality  is  unequal 
political  participation  among  the  citizens,  which 
might  be  caused  by  unequal  rights  to  participate 
(formal  equality),  unequal  capacity  to  participate 
(unequal  distribution  of  resources),  or  unequal 
disposition  to  participate.  Unequal  responsiveness 
is  uneven  attention  by  the  government,  biased 
because  of  economic,  ideological,  or  political 
factors.  Unequal  policy  delivery  is  asymmetrical 
redistribution  of  favorable  policies  among  the  citi¬ 
zens,  benefiting  some  groups  more  than  others. 

Implications  for  the  Quality  of  Democracy 

Addressing  political  inequality  from  either  a  formal 
or  a  substantive  point  of  view  has  important  impli¬ 
cations  with  respect  to  the  quality  of  democracy, 
regarding  both  how  it  should  be  measured  and  what 
goals  should  be  attained  through  political  equality. 

Measures  of  formal  political  equality  are  based 
on  objective  indicators  of  the  available  rights  and 
freedoms  in  a  given  society.  Higher  democratic 
standards  can  be  achieved  if  there  exist  more  alter¬ 
natives  to  participate  in  collective  decisions,  if 
there  are  more  freedoms  and  rights  that  citizens 
can  enjoy,  and  if  these  rights  and  freedoms  are 
more  inclusive.  Most  democracies  qualify  high  in 
this  dimension,  which  is  an  indicator  that  formal 
equality  is  broadly  extended. 

However,  a  measure  of  the  quality  of  democ¬ 
racy  in  terms  of  formal  political  equality  has  an 
important  limitation  with  respect  to  political  par¬ 
ticipation.  There  may  be  citizens  who  would  like  to 
participate  in  political  decisions,  who  have  the  will 
to  participate  and  freely  decide  to  do  so,  but  who 
lack  the  necessary  resources  to  do  so — they  may 
not  know  how  to  participate,  they  may  not  have 
the  time,  and  so  on.  A  country  may  have  a  high 
degree  of  formal  political  equality,  but  there  may 
be  no  guarantee  that  all  citizens  who  wish  to  influ¬ 
ence  politics  can  effectively  participate.  There  must 
be  a  minimal  standard  of  life  to  ensure  that  all 
citizens  are  able  to  make  their  democratic  choices. 
Restricting  political  inequality  to  a  formal  concep¬ 
tion  might  therefore  provide  a  partial  picture  of 
democratic  quality. 
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When  the  substantive  dimension  is  added  to  the 
definition  of  political  inequality,  measures  of  the 
quality  of  democracy  are  confronted  by  further 
critical  dilemmas.  Assessing  levels  of  substantive 
political  inequality  is  an  extremely  complex  task,  as 
measures  of  political  inequality  rely  most  of  the  time 
on  subjective  interpretations,  which  make  them 
more  difficult  to  control.  Two  indicators  have  gen¬ 
erally  been  used  to  measure  substantive  inequality: 
levels  of  political  participation  (turnout,  number  of 
people  working  for  a  party,  number  of  people  who 
demonstrate,  etc.)  and  political  concentration  (the 
extent  to  which  office  holding  is  monopolized  by  a 
group  of  individuals).  Contrary  to  the  measure  of 
formal  political  inequality,  though,  many  countries 
with  relatively  high  ranks  of  formal  equality  rank 
relatively  low  on  substantive  political  equality,  with 
low  levels  of  participation  and  high  levels  of  politi¬ 
cal  concentration.  Formal  equality  does  not  in  fact 
guarantee  substantive  equality. 

Nevertheless,  increasing  substantive  equality 
might  also  have  a  negative  impact  on  the  quality  of 
democracy.  Two  main  arguments  have  been  devel¬ 
oped  against  an  increase  in  effective  participation. 
First,  the  quality  of  the  decisions  is  questioned.  If 
all  citizens  are  obliged  to  participate  in  the  deci¬ 
sion-making  process,  those  with  little  interest  in 
politics  and  low  levels  of  education  will  be  entitled 
to  a  voice.  It  is  not  certain,  then,  that  they  will  be 
able  to  choose  what  is  best  for  the  whole  commu¬ 
nity.  A  second  concern  is  that  increasing  equality 
might  endanger  a  basic  principle  of  democracy: 
freedom.  There  are  two  possible  ways  to  increase 
equality:  Either  citizens’  socioeconomic  status  is 
equalized  to  avoid  inequality  based  on  unfair  distri¬ 
bution  of  political  resources,  or  citizens  are  obliged 
to  participate.  Although  empirical  studies  are  not 
always  consistent  in  their  results,  the  first  option 
seems  unfeasible  in  a  reasonable  period  of  time. 
The  only  sound  option  for  increasing  participation, 
therefore,  is  to  make  participation  compulsory.  Yet 
compulsory  participation  directly  damages  free¬ 
dom,  because  it  constrains  individual  liberties.  In 
this  situation,  citizens  do  not  freely  decide  whether 
or  not  to  participate,  but  they  are  obliged  to  do  so 
by  the  state;  put  differently,  political  participation — 
an  act  that  is  meant  to  be  the  expression  of  maxi¬ 
mum  liberty — is  transformed  into  an  obligation. 
Compulsory  participation  is  hardly  reconcilable 
with  freedom,  as  it  is  extremely  difficult  to  find  the 


limits  that  do  not  harm  individuals’  liberties.  There 
is,  therefore,  an  unsolvable  contradiction  between 
increasing  equality  and  ensuring  freedom. 

Inequality,  Citizenship,  and  Immigration 

A  final  issue  concerning  political  inequality  deals 
with  the  concept  of  citizenship  and,  as  an  exten¬ 
sion,  with  immigration.  Whether  from  a  formal  or 
substantive  perspective,  political  inequality  is  at 
the  core  of  the  debate  on  citizenship.  First,  the 
status  of  citizenship  is  required  to  enjoy  political 
equality.  Political  rights  and  freedoms  are  guaran¬ 
teed  only  to  the  citizens,  those  who  belong  to  the 
community  and  are  legally  recognized  as  such. 
Although  substantive  political  equality  cannot  be 
ensured  by  the  state,  in  contemporary  democracies 
all  people  who  have  full  citizenship  formally 
should  enjoy  the  same  rights  and  freedoms.  Second, 
and  as  a  consequence,  because  immigrants  do  not 
have  full  citizenship,  they  constitute  a  specific, 
sometimes  very  large,  group  in  the  community  that 
lacks  basic  political  rights,  such  as  the  right  to 
vote.  While  much  discussion  has  been  done  on  the 
extent  to  which  political  rights  should  be  expanded 
to  immigrants — that  is,  on  the  inclusiveness  of  the 
political  rights — the  right  to  vote  is  normally 
restricted  to  citizens.  Formal  inequality  is  therefore 
a  feature  of  many  democracies.  Indeed,  even  if 
immigrants  belong  to  the  political  community, 
they  are  not  entitled  to  the  status  of  citizens. 
Accordingly,  and  from  the  point  of  view  of  the 
quality  of  democracy,  political  inequality  in  a 
democracy  may  be  addressed  from  three  alterna¬ 
tive  outlooks:  (1)  Is  there  formal  equality  for  citi¬ 
zens?  (2)  Is  there  formal  equality  for  citizens  and 
noncitizens?  (3)  Is  there  both  formal  and  substan¬ 
tive  equality? 

Monica  Perrin 
European  University  Institute 

Florence,  Italy 

See  also  Citizenship;  Democracy,  Quality;  Equality, 
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The  ecological  inference  problem  is  a  long-stand¬ 
ing  problem  that  encompasses  a  rich  set  of  intrigu¬ 
ing  puzzles.  Scholars  with  diverse  backgrounds 
and  interests  have  a  stake  in  approaches  to  eco¬ 
logical  inference  problems,  which  appear  as  fre¬ 
quently  in  political  science  as  they  do  in  medicine, 
geography,  economics,  or  sociology.  The  problem 
occurs,  for  instance,  when  one  is  interested  in  the 
behavior  of  individuals,  but  has  data  only  at  an 
aggregated  level  (e.g.,  precincts,  hospital  wards, 
counties).  In  other  words,  a  data  limitation  creates 
a  situation  where  the  behavior  of  individuals  must 
be  surmised  from  data  on  aggregated  sets  of  indi¬ 
viduals  rather  than  on  individuals  themselves. 
Since  the  goal  is  to  make  inferences  from  aggre¬ 
gate  units  that  are  often  derived  from  an  “envi¬ 
ronmental  level”  (i.e.,  geographical/ecological 
units  such  as  a  county  or  precinct),  the  term  eco¬ 
logical  inference  is  used  to  describe  this  type  of 
analysis.  More  generally,  the  problem  manifests 
itself  whenever  one  has  data  at  one  level  of  aggre¬ 
gation  (e.g.,  the  state  level)  but  is  interested  in 


inferences  at  another  level  of  data  aggregation 
(e.g.,  the  county  level).  Accordingly,  the  term 
cross-level  inference  is  often  used  as  a  synonym 
for  ecological  inference.  This  entry  discusses  the 
nature  and  implications  of  this  problem. 

A  classic  example  of  the  ecological  inference 
problem  in  political  science  occurs  when  one  tries 
to  determine  how  members  of  different  racial 
groups  cast  their  ballots.  Because  the  United  States 
employs  the  secret  ballot,  the  only  data  available 
for  solving  this  inquiry  are  at  the  precinct  level, 
where  vote  totals  and  racial  demographics  can  be 
obtained  but  not  vote  totals  broken  down  by  racial 
categories.  Epidemiologists  confront  identical 
methodological  issues  when  they  seek  to  explain 
which  environmental  factors  influence  disease  sus¬ 
ceptibility  using  only  data  from  counties  or  hospital 
wards,  rather  than  individual  patients.  Economists 
studying  consumer  demand  and  marketing  strate¬ 
gies  might  need  to  infer  individual  spending  habits 
with  an  analysis  of  sales  data  from  a  specific  region 
and  the  aggregate  characteristics  of  individuals  in 
that  region,  rather  than  from  data  on  individuals’ 
characteristics  and  purchases.  These  different  que¬ 
ries  are  substantively  varied,  and  it  would  be  simple 
to  identify  a  host  of  other  equally  unique  queries 
that  fit  into  the  ecological  inference  mold. 

In  addition  to  substantive  applications  that 
span  many  fields,  the  mathematics  of  the  eco¬ 
logical  inference  problem  are  also  related,  some¬ 
times  isomorphic,  to  inferential  problems  in 
other  disciplines,  even  when  the  subject  matter  is 
not  substantially  related.  For  instance,  geogra¬ 
phers  have  long  been  intrigued  with  the  modifi¬ 
able  areal  unit  problem  (MAUP).  MAUP  occurs 
when  the  estimates  at  one  level  of  aggregation  are 
different  from  the  estimates  obtained  at  a  differ¬ 
ent  level  of  aggregation.  Many  statisticians  and 
mathematicians  have  been  captivated  by 
Simpson’s  paradox,  which  is  the  reversal  in  direc¬ 
tion  of  association  between  two  variables  when  a 
third  (“lurking”)  variable  is  controlled.  Described 
in  this  way,  Simpson’s  paradox  (and  conse¬ 
quently  ecological  inference)  is  akin  to  the  omit¬ 
ted  variable  problem  discussed  in  virtually  all 
econometrics  and  regression  texts.  While  nota¬ 
tion  and  terminology  may  differ,  the  similarities 
of  the  underlying  problems  cannot  be  denied. 

The  main  difficulty  with  cross-level  inference  is 
that  not  only  might  the  estimates  at  different  levels 
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of  aggregation  be  different,  they  could  be  substan¬ 
tially  different  and  even  lead  to  different  conclu¬ 
sions.  William  Robinson  (1950)  was  among  the 
first  to  point  out  this  conundrum  with  correlation 
coefficients.  He  examined  the  relationship  between 
nativity  and  illiteracy.  At  the  individual  level,  the 
correlation  coefficient  was  .118.  If  the  data  were 
aggregated  at  the  state  level,  the  correlation  coeffi¬ 
cient  reverses  sign  and  becomes  —.526.  If  the  data 
were  aggregated  at  the  census  geographic  divisions, 
the  correlation  coefficient  remains  negative  at 
—.619.  The  same  phenomenon  occurs  when  race 
and  illiteracy  are  examined.  The  correlation  at  the 
individual  level  is  .203.  At  the  census  division  level, 
the  correlation  is  .946.  At  the  state  level,  the  cor¬ 
relation  is  .773.  While  these  correlation  coefficients 
do  not  change  sign,  they  vary  widely  and  imply 
different  substantive  conclusions.  In  fact,  such  wide 
discrepancies  are  common,  and  worse,  in  a  given 
application,  a  researcher  is  unable  to  determine  if 
his  or  her  aggregated  results  bear  similarity  to  the 
true  individual-level  relationship. 

Let  us  consider  the  challenge  of  inferring  how 
individuals  voted  from  election  returns  aggregated 
at  the  precinct  level.  Because  U.S.  elections  employ 
the  secret  ballot,  individual  vote  choices  are 
unknown.  Election  returns,  however,  are  reported 
at  the  precinct  level,  and  aggregate  individual 
racial  categorizations  can  usually  be  obtained  and 
merged  with  the  voting  data.  Hence,  for  any  given 
precinct,  the  available  data  include  how  many 
votes  each  candidate  received  as  well  as  the  pre¬ 
cinct’s  racial  composition.  The  simplified  problem 
can  be  summed  up  with  a  2  X  2  contingency  table. 


Candidate  1 

Candidate  2 

Group  1 

0nZi 

(1  ~  Pn)*i 

*1 

Group  2 

021Z2 

(1  _  02l)*2 

z2 

Vi 

72 

The  known  elements  of  this  problem  are  the 
proportion  of  voters  in  each  group,  zt  and  z2,  and 
the  proportion  of  votes  received  by  each  candidate, 
y1  and  y2.  The  unknown  elements  are  the  propor¬ 
tions  of  each  group  that  voted  for  each  candidate, 
Pu,  312,  (321,  and  p„.  If  we  assume  that  Group  1 
and  Group  2  are  mutually  exclusive  and  exhaus¬ 
tive,  then  z2  is  simply  (1  —  z,)-  Similarly,  if  we 


assume  that  all  votes  were  cast  for  either  Candidate 
1  or  Candidate  2  and  there  were  no  abstentions, 
then  y2  —  (1  —  jq).  Such  contingency  tables  are 
usually  available  to  describe  each  areal  unit  in  the 
data  set.  In  our  voting  application,  since  a  set  of 
precincts  comprises  the  voting  district  in  question, 
one  might  index  these  values  with  a  further  sub¬ 
script  to  identify  the  precinct.  For  each  precinct  i, 
then,  we  have  the  following  relationship: 

y U  —  3lli*li  021 <(1  _  G/)' 

This  relationship  holds  exactly  for  each  of  the 
I  precincts  in  the  district,  yielding  a  system  with 
I  equations  (one  for  each  precinct)  and  27  unknowns 
(two  parameters,  (3ni  and  p21(  for  each  precinct). 
From  basic  algebraic  properties,  we  know  that 
there  is  not  enough  information  to  solve  for  these 
parameters  given  these  equations.  The  fundamen¬ 
tal  indeterminacy  is  that  multiple  values  of  the 
parameters  satisfy  these  restrictions.  Note  as  well 
that  adding  observations  of  further  precincts  does 
not  change  the  indeterminacy.  Each  new  precinct 
adds  one  more  equation  to  the  system  and  two 
more  parameters  to  be  inferred. 

The  ecological  inference  problem  is  an  example 
of  an  ill-posed  inverse  problem,  or  an  inverse  prob¬ 
lem  with  many  “solutions,”  that  is,  a  problem  that 
encompasses  a  fundamental  indeterminacy.  Al¬ 
though  one  may  wish  to  obtain  a  point  estimate  or 
“solution”  to  the  problem,  there  is  not  enough 
information  to  narrow  down  the  feasible  set  of 
estimates  to  one  particular  point  estimate.  This 
type  of  problem  has  been  described  as  “partially 
identified”  because  while  the  available  information 
does  not  allow  one  to  identify  the  parameter  of 
interest  without  the  imposition  of  strong  assump¬ 
tions,  it  does  often  allow  one  to  identify  a  “region” 
within  which  the  parameters  must  lie,  thus  par¬ 
tially  identifying  the  problem.  How  to  proceed 
with  partially  identified  problems  such  as  the  eco¬ 
logical  inference  problem  is  not  obvious. 

There  are  various  strategies  for  pursuing  prob¬ 
lems  of  the  partially  identified  ilk.  However,  for  any 
particular  application,  how  to  proceed  in  identify¬ 
ing  an  estimation  strategy  is  not  straightforward 
and  rarely  obvious  on  the  basis  of  theory  alone.  The 
key  criterion  involves  the  costs  and  benefits  associ¬ 
ated  with  different  types  of  inferential  errors.  Every 
assumption  involves  an  acceptance  of  potential 
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error  in  estimation.  Some  of  these  assumptions 
may  be  more  palatable  than  others.  Importantly, 
one  must  never  allow  excitement  over  the  possibil¬ 
ity  of  a  point  estimate  to  override  careful  and 
explicit  consideration  of  attendant  assumptions. 
There  are  no  pat  answers,  no  generic  solutions. 

Many  methods  have  been  proposed  for  obtain¬ 
ing  estimates  for  ecological  inference  problems. 
Although  the  multifarious  approaches  to  this  prob¬ 
lem  have  been  distinct,  they  can  be  seen  as  lying 
along  a  continuum.  The  left  end  of  the  continuum 
is  characterized  by  a  lack  of  assumptions  and, 
concomitantly,  high  credibility.  Even  when  no 
assumptions  are  imposed,  the  data  can  allow  one 
to  narrow  the  range  in  which  the  true  values  of  the 
parameters  of  interest  lie.  The  best  known  proce¬ 
dure  for  obtaining  a  range  of  possible  estimates 
comes  from  Oris  Duncan  and  Beverly  Davis’s 
method  of  bounds.  The  ideal  situation  occurs  if  the 
bounds  are  sufficiently  informative.  If  they  are  not, 
one  might  choose  to  move  along  this  continuum, 
but  then  assumptions,  which  may  or  may  not  be 
true,  must  be  made.  At  the  right  end  of  the  con¬ 
tinuum  are  approaches  that  lead  to  a  precise  point 
estimate  for  each  parameter.  There  are,  to  be  sure, 
many  different  approaches  clustered  on  the  right, 
in  the  region  of  strong  and  numerous  assumptions. 
Indeed,  the  bulk  of  the  effort  has  been  directed 
toward  obtaining  point  estimates  or  to  somehow 
identify  a  problem  that  is  only  partially  identified. 
In  general,  one  cannot  zero  in  on  precise  estimates 
without  making  restrictive  assumptions  and  thus 
trading  reduced  credibility  for  increased  precision. 
Empirical  researchers  should  be  aware  that  no 
solution  comes  free.  Every  method  yielding  point 
estimates  necessarily  rests  on  assumptions  that  are 
strong  enough  to  remove  the  indeterminacy  of  eco¬ 
logical  inference.  Researchers  contemplating  appli¬ 
cation  of  any  method  should  carefully  consider 
whether  the  associated  assumptions  are  credible  in 
their  applications. 

In  practice,  it  is  rare  that  one  can  accept  point- 
identifying  assumptions  with  great  confidence.  The 
prudent  researcher  should  then  resist  the  tempta¬ 
tion  to  embrace  any  particular  estimation  method. 
Instead,  the  analysis  of  aggregate  data  should  be  a 
process.  First,  one  should  determine  what  one  can 
learn  from  the  data  alone  without  imposing 
any  assumptions.  Then,  one  should  consider  vari¬ 
ous  assumptions  that  have  identifying  power.  A 


productive  approach  is  to  “layer”  the  assumptions, 
imposing  them  sequentially  in  order  of  decreasing 
plausibility.  As  more  assumptions  are  imposed,  one 
will  be  able  to  draw  conclusions  that  are  increasingly 
sharp  but  decreasingly  believable.  This  process 
of  inference  illuminates  the  respective  roles  that  data 
and  assumptions  play  in  empirical  research. 
Moreover,  it  enables  both  researchers  and  their  con¬ 
sumers  to  adjudicate  how  best  to  reconcile  the  inevi¬ 
table  tension  between  the  strength  of  conclusions 
and  their  credibility  in  ecological  inference  problems. 
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Institutional  theory  has  been  grappling  with  one 
major  problem:  how  to  explain  the  dynamics  of 
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change  in  institutions.  Scholars  in  the  three  long¬ 
standing  “new  institutionalisms” — (1)  rational 
choice  institutionalism  (RI),  (2)  historical  institu¬ 
tionalism  (HI),  and  (3)  sociological  institutionalism 
(SI) — have  increasingly  sought  to  “endogenize” 
change,  that  is,  explain  it  “from  the  inside.”  In 
response  to  criticisms  that  their  analytic  frame¬ 
works  could  account  for  continuity  but  not  for 
change,  which  they  explained  mainly  “from  the 
outside,”  as  the  result  of  exogenous  shocks,  these 
neo-institutionalists  have  increasingly  sought  to 
explain  the  origins  of  and  shifts  in  interest-based 
preferences,  historical  paths,  or  cultural  frames. 
Whereas  many  of  these  scholars  have  sought 
explanations  using  one  or  other  of  the  two  existing 
alternative  approaches,  others  have  turned  instead 
to  ideas  and  discourse.  For  some  of  these  neo¬ 
institutionalists,  this  was  but  a  brief  encounter,  in 
particular  for  RI  scholars,  since  taking  ideas  and 
discourse  “seriously”  undermined  many  of  the 
very  premises  of  their  approaches.  Others,  how¬ 
ever,  broke  with  some  of  the  fundamental  prem¬ 
ises  of  their  own  institutionalist  tradition  with 
their  turn  to  ideas  and  discourse.  Because  these 
latter  scholars  have  come  to  have  as  much  if  not 
more  in  common  with  one  another  than  with  those 
in  the  institutionalist  tradition  from  which  they 
emerged,  they  have  come  to  be  seen  as  part  of  a 
fourth  new  institutionalism,  discursive  institution¬ 
alism  (DI) — sometimes  also  called  the  ideational 
turn  or  constructivist  institutionalism. 

The  three  long-standing  neo-institutionalisms, 
although  very  different,  tend  to  share  one  common 
assumption:  Institutions  serve  primarily  as  con¬ 
straints.  RI  focuses  on  rational  actors  who  pursue 
their  preferences  following  a  logic  of  calculation 
within  political  institutions,  defined  as  structures 
of  incentives;  HI  details  the  development  of  politi¬ 
cal  institutions,  described  as  regularized  patterns 
and  routinized  practices  subject  to  a  logic  of  path 
dependence ;  and  SI  concentrates  on  social  agents 
who  act  according  to  a  logic  of  appropriateness 
within  political  institutions,  defined  as  socially 
constituted  and  culturally  framed  rules  and  norms. 
By  contrast,  DI  focuses  on  sentient  agents  who 
convey  substantive  ideas  through  the  interactive 
processes  of  discourse  in  given  meaning  to  contexts 
following  (contextualized)  logics  of  communica¬ 
tion.  Institutions,  here,  rather  than  static  external 
rule-following  structures,  are  dynamic  structures 


and  constructs  of  meaning  internal  to  agents  who 
have  the  ability  not  only  to  create  (and  maintain) 
institutions  but  also  to  communicate  critically 
about  them  and  to  change  (or  maintain)  them 
through  collective  action.  But  while  institutional 
context  in  DI  therefore  refers  first  and  foremost  to 
the  structure,  construction,  and  communication  of 
meaning,  it  can  also  be  understood  in  terms  of  the 
background  information  provided  by  the  other 
three  neo-institutionalisms  in  political  science,  with 
which  discursive  institutionalists  may  engage  and 
from  which  they  often  emerge.  This  background 
information  is  all  about  the  structures,  understood 
in  terms  of  historical  institutional  rules  and  regu¬ 
larities,  sociological  institutionalist  cultural  rules 
and  frames,  and  rational  choice  institutionalist 
incentives  and  power  asymmetries,  that  constitute 
the  context  within  which  collective  action  occurs 
and  that  affect  the  ways  in  which  discursive  insti¬ 
tutionalist  meaning  structures  are  constructed. 

Thus,  the  four  institutionalisms  share  with  one 
another  a  core  focus  on  the  importance  of  institu¬ 
tions,  but  they  differ  in  their  definitions  of  institu¬ 
tions,  in  their  objects  and  logics  of  explanation, 
and  in  the  ways  in  which  they  deal  with  change  (see 
Table  1).  This  entry  turns  first  to  the  examination 
of  attempts  to  endogenize  change  and  the  move  to 
ideas  in  RI,  next  in  HI,  and  then  in  SI,  before 
exploring  the  same  in  DI,  in  particular  through  the 
interactive  dimension  of  discourse.  The  entry  con¬ 
cludes  with  a  consideration  of  the  interrelation¬ 
ships  among  the  four  new  institutionalisms. 

Endogenizing  Change  in  Rational 
Choice  Institutionalism 

RI  posits  rational  actors  with  fixed  preferences 
who  calculate  strategically  to  maximize  their  pref¬ 
erences  and  for  whom  institutions  represent  the 
incentive  structures  that  reduce  the  uncertainties 
resulting  from  the  multiplicity  of  individual  prefer¬ 
ences  and  issues.  The  very  premises  of  RI  about  the 
fixed  nature  of  preferences  and  the  stability  of 
institutions  ensured  that  preference  formation  was 
long  of  little  interest  to  RI  scholars,  while  institu¬ 
tional  change  was  ruled  out  analytically  a  priori, 
given  equilibrium-based  assumptions  that  entailed 
that  change  could  only  come  from  the  outside, 
through  exogenous  shocks.  The  inability  of  RI  to 
deal  with  institutional  change  from  the  inside  and 
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Table  1  The  Four  Institutionalisms 

Rational  Choice 
Institutionalism  (RI) 

Historical 

Institutionalism  (HI) 

Sociological 
Institutionalism  (SI) 

Discursive 

Institutionalism  (DI) 

Object  of 
explanation 

Logic  of 
explanation 

Behavior  of  rational 

actors 

Calculation 

Structures  and 
practices 

Path  dependency 

Norms  and  culture 
of  social  agents 
Appropriateness 

Ideas  and  discourse  of 
sentient  agents 

Communication 

Definition  of 
institutions 

Incentive  structures 

Macrohistorical 
structures  and 
regularities 

Cultural  norms  and 
frames 

Meaning  structures  and 
constructs 

Approach  to 
change 

Static — Continuity 
through  fixed 
preferences,  stable 
institutions 

Static — Continuity 
through  path 
dependency 
interrupted  by 
critical  junctures 

Static — Continuity 
through  cultural 
norms  and  rules 

Dynamic — Change  (and 
continuity)  through 
ideas  and  discursive 
interaction 

Explanation  of 
change 

Exogenous  shock 

Exogenous  shock 

Exogenous  shock 

Endogenous  process 
through  background 
ideational  and 
foreground  discursive 
abilities 

Recent 

innovations  to 
explain  change 

Endogenous 
ascription  of 
interest  shifts 
through  RI  political 
coalitions  or  HI 
self-reinforcing  or 
self-undermining 
processes 

Endogenous 
description  of 
incremental  change 
through  layering, 
drift,  conversion 

Endogenous 
construction  (merge 
with  DI) 

Endogenous 
construction  through 
reframing,  recasting 
collective  memories 
and  narratives  through 
epistemic  communities, 
advocacy  coalitions, 
communicative  action, 
deliberative  democracy 

Source:  Schmidt,  V.  A.  (2010).  Taking  ideas  and  discourse  seriously:  Explaining  change  through  discursive  institutionalism  as 
the  fourth  new  institutionalism.  European  Political  Science  Review,  2(1),  1-25. 


its  difficulties  in  accounting  for  preference  forma¬ 
tion  have  led  scholars  in  the  RI  tradition  in  recent 
years  to  seek  to  endogenize  change.  For  some,  this 
has  meant  remaining  within  RI  and  seeking  to 
explain  change  through  shifts  in  political  coali¬ 
tions.  For  others,  it  has  also  entailed  looking  to  HI 
approaches  for  the  origins  of  preference  forma¬ 
tion,  in  particular  for  the  origins  of  institutions, 
and  for  trajectories  of  change.  For  yet  other  schol¬ 
ars,  however,  endogenizing  change  has  involved 
the  turn  to  ideas — with  most  doing  so  in  ways  that 
did  not  undermine  RI  basic  premises  and  a  few 
moving  to  DI. 


In  their  attempts  to  find  new  ways  to  account 
for  preference  formation  and  institutional  shifts, 
RI  scholars  have  mostly  looked  to  political  coali¬ 
tion  formation,  although  they  have  also  sometimes 
sought  a  rapprochement  with  HI.  Those  RI  schol¬ 
ars  concerned  with  political  coalitions  often  focus 
on  formal  electoral  institutions,  arguing  that  these 
act  as  incentive  structures  to  produce  the  political 
coalitions  that  implement  certain  kinds  of  policies. 
But  whatever  the  value  of  this  insight  within  purely 
RI  analytics,  the  problem  is  that  it  remains  highly 
deterministic,  as  institutions  are  portrayed  as 
determining  politics,  which  in  turn  determine 
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political  outcomes,  and  it  still  cannot  explain  the 
origins  of  the  institutions  or  of  the  political  coali¬ 
tions  that  created  them,  let  alone  why  institutions 
or  policies  might  change  over  time.  When  RI  schol¬ 
ars  try  to  respond  to  the  question  of  origins  by 
turning  to  HI,  they  often  go  back  to  the  founding 
moments  of  the  electoral  institutions.  But  in  this 
case,  institutional  foundations  serve  to  explain 
contemporary  political  institutions  that  explain 
current  political  coalitions  that  explain  current 
policies.  And  where  is  endogenous  change? 

Instead  of  using  HI  to  enhance  substantive  theo¬ 
rizing,  RI  scholars  sometimes  seek  to  build  HI 
insights  into  game-theoretic  analysis.  Avner  Greif 
and  David  Laitin,  for  example,  redefine  the  goal  of 
institutions — from  self-enforcing  to  self-reinforcing 
or  self-undermining  institutions — and  their  effects 
as  parametric  in  the  short  term,  meaning  exoge¬ 
nous  and  fixed  for  agents  who  act  on  self-enforcing 
beliefs,  but  only  quasi-par ametric  in  the  long  run, 
meaning  endogenous  and  variable  as  individuals 
are  led  to  act  in  a  manner  that  does  not  reproduce 
the  associated  beliefs.  The  problem  here  is  that 
although  this  may  better  account  for  change  over 
time  in  game-theoretic  terms,  we  are  still  left  with 
the  irrationality  of  the  choice  of  institutions  to 
begin  with;  the  deterministic  trajectory  of  change 
over  time,  now  for  better  or  for  worse;  and  the 
limited  rationality  of  these  supposedly  rational 
actors  at  any  given  point  in  time. 

Relatively  few  RI  scholars  have  turned  to  ideas 
to  solve  the  problem  of  institutional  change. 
Among  those  who  have,  the  most  significant  RI 
engagement  with  ideas  began  in  the  1990s, 
although  it  has  remained  rather  circumscribed  ever 
since.  It  never  took  ideas  very  seriously,  since  ide¬ 
ational  explanation  was  deemed  useful  only  when 
and  if  explanation  in  terms  of  objective  or  material 
interests  was  insufficient.  This,  RI  scholars  argued, 
would  occur  if  ideas  come  before  interests,  as  road 
maps — in  which  case  ideas  are  little  more  than 
mechanisms  for  choosing  among  interests;  if  ideas 
come  after  interests,  acting  as  focal  points — in 
which  case  ideas  serve  at  best  as  switches  for  inter¬ 
ests;  if  ideas  are  embedded  in  institutions,  in  which 
case  it  is  the  institutions  rather  than  the  ideas  that 
really  matter;  if  ideas  are  after  the  fact  legitimation 
of  actors’  interest-based  action — in  which  case 
ideas  are  not  really  taken  seriously  at  all;  or  even 
if,  following  the  work  of  Douglas  North,  ideas  are 


“shared  mental  modes” — in  which  case  ideas 
would  necessarily  have  an  effect  on  the  content  of 
interests,  undermining  the  very  RI  definitions  of 
interests  as  separate  from  ideas. 

The  problem  for  RI  scholars,  then,  and  the  rea¬ 
son  most  quickly  abandoned  the  pursuit  of  ideas, 
is  that  they  could  not  continue  to  maintain  the 
artificial  separation  of  objective  interests  from 
subjective  ideas  about  interests,  that  is,  beliefs  and 
desires.  Such  subjective  interests  threatened  to 
overwhelm  the  objective  ones  that  are  at  the  basis 
of  the  rationalists’  thin  model  of  rationality,  by 
undermining  the  fixed  nature  of  preferences  and 
the  notion  of  outcomes  as  a  function  of  preexisting 
preferences.  And  without  fixed  preferences  as  well 
as  neutral  institutional  incentive  structures,  RI 
scholars  lose  the  parsimony  of  the  approach  and 
everything  that  follows  from  it,  including  the  abil¬ 
ity  to  mathematically  model  games  rational  actors 
play  as  opposed  to,  in  the  words  of  Fritz  Scharpf, 
the  “games  real  actors  play.”  This  helps  explain 
why  the  foray  into  ideas  for  most  dyed-in-the- 
wool  RI  scholars  was  short-lived.  For  those  who 
persisted,  however,  a  whole  new  approach  to  the 
explanation  of  interests  and  institutions  has 
opened  up. 

For  DI  scholars  engaged  with  the  RI  tradition, 
subjective  interests  replace  the  objective  ones  of  RI, 
as  ideas  about  interests  that  bring  in  a  much  wider 
range  of  strategic  ideas  and  social  norms.  And 
these  must  be  explained  in  terms  of  their  meaning 
to  the  actors  within  a  given  meaning  context  rather 
than  in  terms  of  some  set  of  universally  identifiable 
interests.  Material  interests,  economic  in  particular, 
which  are  at  the  basis  of  much  of  the  institutional 
incentives  in  the  rational  choice  institutionalist  lit¬ 
erature,  are  not  ignored.  But  DI  scholars  tend  to 
separate  material  interests  analytically  into  mate¬ 
rial  reality  and  interests  rather  than  to  conflate 
them,  such  that  material  reality  constitutes  the  set¬ 
ting  within  which  or  in  response  to  which  agents 
may  conceive  of  their  interests.  This  makes  for  a  DI 
ontology  that  is  largely  constructivist,  much  as  in  SI 
(see  below),  by  contrast  with  the  materialist  ontol¬ 
ogy  of  most  RI  scholars,  who  assume  that  they  can 
attribute  material  interests  to  actors,  regardless  of 
how  those  actors  themselves  construct  their  own 
ideas  about  their  interests.  And  it  makes  for  a  dif¬ 
ferent  approach  to  epistemological  questions  of 
knowledge  and  certainty. 
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In  DI,  the  kind  of  knowledge  and  degree  of  cer¬ 
tainty  agents  may  have  with  regard  to  their  ideas 
about  material  reality  may  differ,  depending  on 
the  aspect  of  material  reality  with  which  they  are 
concerned.  Illustrative  of  this  epistemological  obser¬ 
vation  is  Ludwig  Wittgenstein’s  little  noticed  dis¬ 
tinction  in  On  Certainty  between  language  games 
based  on  our  everyday  experiences  in  the  world, 
which  tend  to  admit  few  doubts  or  mistakes,  and 
language  games  based  on  our  (social)  scientific  pic¬ 
tures  of  the  world,  which  may  always  allow  for 
doubts,  mistakes,  and  even  gestalt  switches.  A 
problem  with  RI  is  that  it  tends  to  develop  (social) 
scientific  pictures  games  of  the  world  that  it  treats 
as  if  they  had  the  certainty  of  experience  games. 

Analyzed  using  the  terms  of  RI,  Mark  Blyth 
suggests  that  the  problem  is  that  RI  scholars 
assume  that  most  phenomena  are  explainable  in 
terms  of  “Knightian  risk,”  as  part  of  a  directly 
observable  world  in  which  agents  can  calculate  the 
subjective  probability  of  the  likely  outcomes  of 
their  preferences,  such  as  in  the  U.S.  Congress.  But 
much  of  the  phenomena  that  RI  scholars  seek  to 
explain  are  not  directly  observable  and  are  much 
closer  to  “Knightian  uncertainty”  because  of  the 
impossibility  of  knowing  let  alone  statistically  pre¬ 
dicting  the  effects  of  all  the  forces  that  may  have 
an  impact  on  economic  and  political  realities.  For 
these  phenomena,  agents  are  not  simply  unsure 
about  how  to  achieve  their  interests  but  also 
unsure  of  what  their  interests  are,  given  that  the 
uncertainties  are  too  great,  the  moment  unique, 
prediction  impossible,  and  agents’  interests  always 
structurally  underdetermined.  Thus,  for  example, 
the  Great  Depression  of  the  1930s  and  the  eco¬ 
nomic  crisis  following  the  oil  shocks  of  the  1970s 
were  both  cases  of  Knightian  uncertainty,  as  gov¬ 
ernments,  business,  and  labor  at  these  critical  junc¬ 
tures  all  sought  to  reconstitute  interests  through 
alternative  narratives  and  causal  stories  about  the 
reasons  for  the  crisis,  seeking  to  produce  new 
political  coalitions  for  collective  action,  with  new 
ideas  about  how  to  replace  existing  institutions. 

Needless  to  say,  the  economic  crisis  that  began 
in  2008  is  a  similar  moment  of  Knightian  uncer¬ 
tainty,  when  the  economists’  predictive  models  of 
rationalist  behavior  proved  unable  to  cope  not 
simply  because  they  wrongly  predicted  but  because 
their  use  actually  altered  the  market,  by  acting,  in 
the  words  of  Donald  Mackenzie,  as  an  “engine” 


transforming  the  environment,  not  a  “camera” 
recording  it.  It  is  important  to  note,  however,  that 
while  the  predictive  failure  of  economists’  picture 
games  contributed  to  the  radical  uncertainties  of 
the  economic  crisis,  where  bankers  went  very 
wrong  is  when  they  ignored  what  they  knew  with 
a  high  degree  of  certainty  from  their  cumulative 
experiences  of  lending  and  of  assessing  the  reliabil¬ 
ity  of  risk,  creditworthiness,  and  the  likelihood  of 
repayment  over  time  and  turned  instead  to  predic¬ 
tive  models  that  they  believed  were  more  certain 
because  they  were  based  on  “scientific”  methods. 

Knightian  uncertainty  need  not  always  be  con¬ 
nected  to  such  dramatic  moments,  however.  DI 
scholars  in  the  RI  tradition  have  identified  numer¬ 
ous  cases  in  which  ideas  about  interests  may  be  in 
question  under  more  everyday  circumstances.  For 
example,  actors  may  not  know  what  their  interests 
are  because  they  are  in  a  new  area  of  activity  and 
thus  have  to  determine  which  utility  to  maximize 
(interests),  how  to  maximize  it  (strategies),  and  to 
what  end  (goals).  Alternatively,  long-standing 
ideas  about  interests  may  come  into  question  at  a 
critical  juncture,  say,  when  a  central  figure  in  a 
system  signals  an  idea  that  leads  to  a  shift  in  belief 
system.  In  both  of  these  instances,  scholars  chal¬ 
lenge  RI  logics  only  in  special  circumstances — with 
new  ideas  in  new  areas  of  activity  or  when  old 
ideas  are  questioned.  In  both  cases,  however,  by 
limiting  the  importance  of  ideas  to  the  period  of 
uncertainty  between  the  end  of  the  old  institutional 
“game”  and  the  beginning  of  the  newly  agreed 
institutional  “game,”  DI  scholars  risk  making  it 
appear  that  DI  is  significant  only  at  the  critical 
moment  of  changing  DI  ideas,  which  is  preceded 
and  followed  by  Rl-crystallized  preferences  and 
frozen  institutions.  Institutions,  however,  also 
change  over  time  as  the  ideas  that  infuse  them 
change,  as  other  DI  scholars  in  the  RI  tradition 
suggest.  This  is  because  institutions  themselves  are 
not  neutral  structures  of  incentives  but,  rather,  as 
Bo  Rothstein  suggests,  may  be  the  carriers  of  ideas 
or  collective  memories  that  make  them  objects  of 
trust  or  mistrust  and  changeable  over  time  as 
actors’  ideas  and  discourse  about  them  change  in 
tandem  with  changes  in  their  performance. 

DI  scholars  who  engage  with  the  RI  tradition, 
then,  like  RI  scholars,  speak  the  language  of  inter¬ 
ests,  incentive  structures,  and  collective  action.  But 
they  see  these  as  infused  with  a  wide  range  of  ideas 
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and  norms  rather  than  narrowly  focused  on  an 
instrumental  rationality  of  utility  maximization, 
with  incentive  structures  normative  rather  than 
neutral,  interests  subjective  rather  than  objective, 
and  their  explanations,  where  these  are  RI  pictures 
of  the  world,  much  more  uncertain  than  RI  schol¬ 
ars  recognize. 

Endogenizing  Change  in 
Historical  Institutionalism 

HI  explores  how  institutions,  understood  as  sets  of 
regularized  practices  with  rulelike  qualities,  struc¬ 
ture  action  and  outcomes.  It  emphasizes  not  just 
the  operation  and  development  of  institutions  but 
also  the  path  dependencies  and  unintended  conse¬ 
quences  that  result  from  such  historical  develop¬ 
ment.  The  emphasis  of  HI  on  macrostructures  and 
path  dependence  means  that  it  has  no  agency  built 
into  it,  making  it  difficult  for  it  to  explain  why 
change  may  occur  other  than  exogenously.  And  by 
focusing  on  critical  junctures,  with  big  bangs  fol¬ 
lowed  by  path-dependent  continuities,  HI  can  be 
as  equilibrium  based  as  RI — which  is  even  more 
problematic  for  an  approach  that  purportedly 
explains  “history.”  Hi’s  inability  to  explain  change 
from  the  inside  along  with  its  lack  of  agents  has 
spurred  scholars  in  the  HI  tradition  to  seek  to 
endogenize  change.  And  here,  whereas  for  some 
this  meant  remaining  within  HI  to  theorize  incre¬ 
mental  change  and/or  to  turn  for  agency  to  RI,  for 
others  it  has  meant  a  turn  to  SI  and,  increasingly, 
to  DI. 

In  their  attempts  to  introduce  agency,  HI  schol¬ 
ars  have  most  often  looked  to  RI  for  the  micro¬ 
foundations  to  their  macropatterns.  But  in  so 
doing,  they  have  often  simply  reinforced  the 
immobility  of  institutions.  One  example  of  this  is 
the  varieties  of  capitalism  (VOC)  school  pioneered 
by  Peter  Hall  and  David  Soskice,  which  explicitly 
married  the  two  frameworks  in  its  analysis  of  the 
binary  division  of  capitalism  into  liberal  and  coor¬ 
dinated  market  economies.  VOC  is  HI  in  its  defini¬ 
tion  of  the  historically  grounded  macro-institutional 
rules  and  regularized  practices  of  the  two  different 
capitalist  varieties,  RI  in  its  focus  on  the  rationally 
based,  microfoundational  logic  of  coordination 
games  among  firms  and  other  relevant  actors.  But, 
however  valuable  the  insight  into  the  rationalist 
logics  of  complementarities  among  components 


and  coordinating  mechanisms  among  corporate 
actors  within  different  macro-institutional  con¬ 
texts,  the  result  is  a  very  static  depiction  of  capital¬ 
ism  that  critics  have  also  found  overly  functionalist 
and  unable  to  account  for  the  changing  conditions 
resulting  from  global  and  regional  economic  and 
institutional  forces. 

Recent  revisionist  approaches  to  HI  led  by 
Kathleen  Thelen  and  Wolfgang  Streeck,  which 
describe  incremental  change  in  institutions  through 
endogenous  processes  of  layering  of  new  elements 
onto  the  old,  through  conversion  by  means  of  rein¬ 
terpretation  or  redirection,  or  by  drifting  through 
deliberate  neglect  or  exhaustion,  do  offer  a  way 
out  of  the  HI  statics  of  VOC.  But  they  do  this  at 
the  expense  of  VOC’s  theoretical  RI  logic,  since 
allowing  for  incremental  change  by  definition 
makes  for  unfixed  preferences  and  unstable  insti¬ 
tutions.  Their  “soft”  RI  framework  of  strategic 
actors  trying  to  achieve  advantage  by  interpreting 
or  redirecting  institutions  enables  them  at  best  to 
provide  empirical  accounts  of  rationalist  behavior. 
And  they  still  cannot  explain  preference  formation 
or  why  any  given  institution  was  the  object  of  lay¬ 
ering  or  conversion  in  one  or  another  way, 
although  they  can  describe  it.  Peter  Hall  and 
Kathleen  Thelen  do  more  to  explain  preference 
formation,  with  a  clearer  theoretical  picture  of  the 
use  of  RI  for  HI  incremental  change,  when  they 
cast  institutions  as  the  target  of  rationalist  strategic 
action  by  economic  actors,  who  use  them  as 
resources  to  achieve  their  goals,  with  change 
depending  on  deliberative  interaction,  defection, 
and  reinterpretation.  But  if  actors  are  engaged  in 
constant  deliberation  and  reinterpretation  of  their 
perceived — read  subjective — interests,  then  here 
too,  the  use  of  RI  cannot  go  very  far  in  theoretical 
terms  because  preferences  are  not  fixed,  are  subjec¬ 
tive  rather  than  objective,  and  institutions  are  not 
stable  enough  to  theorize  as  incentives.  And  once 
preferences  are  seen  as  subjective,  with  agency 
consisting  of  deliberation  and  reinterpretation,  the 
way  is  open  to  taking  seriously  the  role  of  ideas  in 
altering  agents’  perceptions  of  their  preferences 
and  of  discourse  in  deliberating  and  reinterpreting 
agents’  institutions. 

The  HI  analytic  framework  is  generally  more 
open  to  the  turn  to  ideas  than  RI.  This  is  because 
whereas  RI  has  certain  ontological  and  epistemo¬ 
logical  presuppositions  about  agency  that  clash 
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with  those  of  DI,  HI  has  none,  since  it  lacks 
agency.  As  a  result,  whereas  RI  scholars  have 
largely  resisted  the  turn  to  ideas,  HI  scholars  have 
been  divided  over  how  far  to  go  with  regard  to 
ideas.  The  question  here  is  where  the  tipping  point 
is  between  HI  scholars  who  continue  to  see  institu¬ 
tions  as  constitutive  of  ideas  and  those  who  might 
better  be  called  DI  scholars  within  an  HI  tradition 
because  they  see  ideas  as  constitutive  of  institu¬ 
tions  even  if  shaped  by  them.  Interestingly  enough, 
even  foundational  books  that  are  purportedly 
exclusively  HI  often  contain  chapters  that  can  be 
seen  as  DI,  whether  because  they  focus  on  the  role 
of  ideas  and  knowledge  in  the  making  of  policy, 
add  to  RI  explanations  of  interest  calculations  a 
focus  on  processes  of  power  and  persuasion,  or 
mix  HI  and  DI  (in  the  HI  or  SI  tradition).  These 
may  involve  showing  how  policy  actors  may  con¬ 
sciously  layer  new  policy  ideas  onto  the  old  or 
illustrating  how  battles  over  ideas  generate  new 
institutional  paths  of  development. 

But  if  HI  and  DI  approaches  can  be  found  in  the 
same  articles  and  chapters,  then  the  question  needs 
to  be  raised  about  how  to  reconcile  HI  and  DI.  For 
some  DI  scholars  in  the  HI  tradition,  HI  and  DI 
can  fit  easily  together,  with  HI  providing  struc¬ 
tures,  DI  agency.  Often,  this  is  formulated  as 
actors’  ideas  and  discourse  set  the  goals  and  indi¬ 
cate  motivation  while  the  (historical)  institutional 
context  provide  the  constraints  for  actions  that 
also  structure  political  openings  for  mobilization 
and  articulation  of  interests.  The  problem  with  this 
mix  is  that  it  risks  papering  over  some  very  real 
differences  between  HI  and  DI,  in  particular  that 
HI  portrays  institutions  as  constraining  structures 
external  to  actors  whereas  DI  defines  institutions 
as  internal  ideational  constructs  and  structures 
(discussed  as  follows).  Moreover,  it  leaves  open 
the  question  of  whether  there  can  be  a  specifically 
HI  approach  to  agency. 

Recent  endogenous  approaches  to  agency  in  HI 
have  tended  to  borrow  from  evolutionary  biology 
to  explain  institutional  change  in  terms  of  how 
populations’  genetic  predispositions,  combined 
with  environmental  factors,  make  for  the  success, 
replication,  proliferation,  and  genetic  feedback  of 
certain  preferences.  The  problem  with  this 
approach,  from  a  DI  perspective,  is  that  it  appears 
mechanistic,  with  no  sense  of  the  critical  thinking 
of  sentient  agents  (read  real  people)  consciously 


changing  their  institutions — for  better  or  for 
worse — through  deliberation  and  contestation,  as 
well  as  consensus  building  around  ideas.  Sven 
Steinmo  is  arguably  the  only  HI  scholar  who  has 
managed  to  combine  DI  with  an  HI  approach  to 
agency  through  evolutionary  biology,  when  he 
provides  evolutionary  narratives  of  the  trajectories 
of  change  in  states’  economic  and  social  systems 
that  show  not  only  how  human  institutions  evolve 
but  also  how  human  agents  who  are  creative,  have 
ideas,  and  communicate  with  one  another  about 
what  they  are  doing  are  also  a  key  element  in  this 
approach,  along  with  the  unanticipated  conse¬ 
quences  to  purposive  behavior,  and  the  fact  that 
what  they  do  occurs  within  evolving  institutional 
context.  This,  then,  is  a  mix  of  HI  and  DI,  but  one 
in  which  the  HI  tends  to  predominate. 

But  how,  then,  to  give  each  analytic  framework 
its  due?  Some  scholars  separate  the  HI  examina¬ 
tion  of  the  institutional  context  of  historical  rules 
and  regularities,  critical  junctures,  and  incremental 
change  from  the  DI  analysis,  in  which  they  then 
use  the  results  of  the  HI  investigation  as  back¬ 
ground  information.  This  helps  show  how  sentient 
agents  may  infuse  HI  rules  with  contextualized 
meanings,  construct  understandings  and  responses 
to  critical  moments,  or  come  up  with  the  ideas  that 
lead  to  the  layering  of  one  institution  over  another, 
the  reinterpretation  of  an  institution,  or  the  con¬ 
version  of  agents  to  another  institution.  Other  DI 
scholars  interweave  HI  and  DI  together  in  discus¬ 
sions  of  evolutionary  changes  across  time,  while 
giving  primacy  to  the  ways  in  which  evolving  ideas 
affect  changes  in  institutions. 

This  said,  DI  does  not  purport  to  explain  all 
change — this  would  be  a  big  mistake  since  “stuff 
happens,”  events  outside  of  people’s  control  occur 
all  the  time,  material  conditions  do  change,  actions 
often  have  unintended  consequences,  and  actors 
often  act  without  prior  ideas  and  discourse  about 
what  it  is  they  will  do.  As  HI  scholars  remind  us, 
processes  of  change  are  often  unconscious — as 
people  may  act  without  any  clear  sense  of  what 
they  are  doing,  creating  new  practices,  renewing 
old  rules,  or  destroying  them  through  drifting  or 
exhaustion.  DI,  however,  shows  that  much  change 
can  and  should  be  explained  in  terms  of  sentient 
agents’  ideas  about  what  to  change  (or  continue) — 
if  nothing  else,  in  response  to  occurrences  on  the 
outside — that  is,  to  the  stuff  that  happens. 
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DI  scholars  who  engage  with  the  HI  tradition, 
then,  like  HI  scholars,  also  speak  the  language  of 
institutional  rules  and  regularities,  critical 
moments,  and  incremental  change.  It  is  just  that 
they  infuse  these  structures  with  agency  by  focus¬ 
ing  on  the  ideas  of  real  actors  that  help  explain 
changes  in  institutions  or  continuities,  at  critical 
moments  or  incrementally  over  time. 

Endogenizing  Change  in 
Sociological  Institutionalism 

SI  focuses  on  the  forms  and  procedures  of  organi¬ 
zational  life  stemming  from  culturally  specific 
practices,  with  institutions  cast  as  the  norms,  cog¬ 
nitive  frames,  scripts,  and  meaning  systems  that 
guide  human  action  according  to  a  logic  of  appro¬ 
priateness.  In  SI,  therefore,  one  cannot  talk  about 
a  turn  to  ideas  as  such,  since  ideas  have  always 
been  at  the  basis  of  the  approach — as  norms, 
frames,  and  meaning  systems — and  constructivism 
as  opposed  to  RI’s  positivism  as  the  generally 
accepted  view  of  the  nature  of  reality.  One  can, 
however,  nevertheless  talk  about  a  split  in  SI 
between  those  whom  we  could  continue  to  label  as 
SI  scholars  because  they  see  ideas  more  as  static 
ideational  structures,  as  norms  and  identities  con¬ 
stituted  by  culture  that  frame  agents’  actions  and 
identities,  and  those  whom  we  could  call  DI  schol¬ 
ars  in  the  SI  tradition,  because  they  see  agents’ 
ideas  as  constituting  culture  and  framing  action. 

DI  scholars  who  engage  with  the  SI  tradition, 
then,  like  SI  scholars,  also  speak  the  language  of 
cultural  framing,  ideas,  and  discourse.  It  is  just 
that  they  ensure  that  these  are  more  dynamic  and, 
thereby,  better  able  to  explain  institutional  change 
(and  continuity).  Importantly,  DI  scholars  go 
beyond  the  SI  scholars  who  put  ideas  into  cultural 
context  to  put  them  into  their  “meaning”  context 
as  well — that  is,  by  treating  ideas  as  empirical  sub¬ 
jects  to  be  studied  in  their  own  right.  And  such 
meaning  contexts  constitute  very  different  kinds  of 
institutions  from  those  of  RI,  HI,  and  SI. 

For  the  three  older  neo-institutionalisms,  insti¬ 
tutions  are  structures  external  to  agents  that  con¬ 
stitute  rules  about  acting  in  the  world  that  serve 
mainly  as  constraints — whether  by  way  of  ratio¬ 
nalist  incentives  that  structure  action,  historical 
paths  that  shape  action,  or  cultural  norms  that 
frame  action.  For  DI,  by  contrast,  institutions  are 


internal  to  sentient  agents,  serving  both  as  struc¬ 
tures  (of  thinking  and  acting)  that  constrain  action 
and  as  constructs  (of  thinking  and  acting)  created 
and  changed  by  those  actors.  This  internal  capac¬ 
ity  to  create  and  maintain  institutions  derives  from 
what  Vivien  Schmidt  defines  as  agents’  back¬ 
ground  ideational  abilities.  This  is  a  generic  term 
for  what  John  Searle  defines  as  the  background 
abilities  that  encompass  human  capacities,  disposi¬ 
tions,  and  know-how  related  to  how  the  world 
works  and  how  to  cope  with  it  or  for  what  Pierre 
Bourdieu  describes  in  the  Logic  of  Practice  as  the 
habitus  in  which  humans  beings  act  “following  the 
intuitions  of  a  ‘logic  of  practice.’”  But  the  psychol¬ 
ogy  of  cognitive  dissonance  is  also  relevant  here, 
which  shows  that  people  generally  act  without 
thinking  and  become  conscious  of  the  rules  that 
might  apply  only  if  they  are  in  contradiction. 
These  background  ideational  abilities  underpin 
agents’  ability  to  make  sense  in  a  given  meaning 
context,  that  is,  to  “get  it  right”  in  terms  of  the 
ideational  rules  or  rationality  of  a  given  discursive 
institutional  setting. 

But  how,  then,  do  we  theorize  about  the  process 
through  which  sentient  agents  get  it  right  or,  bet¬ 
ter,  manage  to  bring  about  change  in  the  ideational 
rules?  Theoretical  approaches  in  DI  on  how  to  plot 
change  in  ideas  remain  underdeveloped,  despite 
much  empirical  analysis  on  changes  in  ideas. 

Some  of  the  most  popular  theories  of  ideational 
change  focus  on  the  notion  of  paradigm  shift, 
building  on  Thomas  Kuhn’s  seminal  work  in  the 
philosophy  of  science.  Peter  Hall,  who  has  pro¬ 
duced  one  of  the  most  influential  such  analyses  in 
terms  of  paradigms,  formalized  the  process  of  ide¬ 
ational  change  by  identifying  three  orders  of  change 
in  paradigms:  minor  first-order  change  of  policy 
instruments,  moderate  second-order  change  of 
policy  instruments  and  objectives,  and  revolution¬ 
ary  third-order  change  of  instruments,  objectives, 
and  core  ideas.  The  first  two  orders  of  change  sug¬ 
gest  evolutionary  development  within  a  paradigm 
through  first-order  renewal  and  second-order 
recasting  of  ideas,  while  the  third  order  of  change  is 
revolutionary,  since  it  would  replace  one  paradigm 
with  another.  This  approach  takes  us  a  lot  farther 
than  Hi’s  critical  juncture  approach,  by  looking 
into  the  moment  of  crisis  to  explain  where  the  ideas 
came  from,  who  picked  them  up,  and  whether  or 
not  this  produced  a  major  change  in  ideas.  But 
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however  evocative  the  concept  of  paradigm  shift 
may  be  as  a  metaphor  for  change,  the  theory  itself 
still  has  problems  similar  to  the  HI  critical  juncture 
literature.  It  does  not  theorize  closely  enough  the 
conceptual  processes  of  ideational  change,  that  is, 
how  old  ideas  fail  and  new  ideas  come  to  the  fore; 
the  reasons  for  ideational  change,  that  is,  why  cer¬ 
tain  ideas  are  taken  up  rather  than  others;  the 
agency  in  ideational  change,  that  is,  who  is  project¬ 
ing  the  ideas  and  how;  and  the  defining  moment(s) 
of  transformation,  since  paradigm  theory’s  empha¬ 
sis  on  abrupt  shifts  in  ideas  rules  out  not  only  evo¬ 
lutionary  change  but  also  revolutionary  change  in 
ideas  that  is  not  abrupt. 

One  promising  way  forward  is  to  build  on  the 
work  of  discourse  analysts  who  can  be  loosely  seen 
as  DI  scholars  in  the  SI  tradition.  Discourse  ana¬ 
lysts  theorize  the  process  of  ideational  continuity 
and  change  by  showing  how  different  elements 
may  be  added  to  ideas  embedded  in  discourse, 
thereby  bringing  about  change  in  ideas  incremen¬ 
tally  even  in  times  of  stability,  and  not  just  at 
critical  junctures  during  paradigm  shifts.  The  theo¬ 
retical  concepts  of  the  various  discourse  analysis 
schools — once  translated  from  the  sometimes  dif¬ 
ficultly  accessible  and  internally  referential 
language — can  provide  added  value  to  the  analysis 
of  ideas.  For  example,  discourse  analyses  that 
build  on  Michel  Foucault  can  offer  insights  into 
how  to  investigate  the  “archeology”  of  what  was 
acceptable  in  a  given  discursive  formation  over 
time,  from  one  period’s  episteme  to  the  next, 
through  examination  of  networks  of  rules  estab¬ 
lishing  what  is  meaningful  at  any  given  time. 
Conversely,  discourse  theories  built  on  the  work  of 
Ernesto  Laclau  and  Chantal  Mouffe  can  point  to 
different  ways  in  which  concepts  may  be  employed, 
such  as  nodal  points  from  which  all  other  ideas 
take  their  meanings  in  an  ideological  system,  for 
example,  how  communism  in  Central  and  Eastern 
Europe  served  to  distinguish  between  “real”  (com¬ 
munist)  democracy  and  “bourgeois”  democracy. 

There  is  one  final  problem  with  this  focus  on 
ideas,  which  is  that  we  have  yet  to  fully  explain  the 
dynamics  of  institutional  change  through  agency. 
Although  concentrating  on  ideas  gets  us  closer  to 
why  institutional  changes  occur,  they  still  do  not 
explain  how  such  institutional  changes  occur,  that 
is,  how  the  ideas  themselves  promote  institutional 
change.  For  this,  however,  we  need  to  consider 


another  aspect  of  DI,  which  is  the  interactive  side 
of  discourse.  How  ideas  are  generated  among 
policy  actors  and  communicated  to  the  public  by 
political  actors  through  discourse  is  the  key  to 
explaining  institutional  change  (and  continuity). 

Endogenizing  Change  in 
Discursive  Institutionalism 

DI  is  an  umbrella  concept  for  the  vast  range  of 
works  in  political  science  that  take  account  of  the 
substantive  content  of  ideas  and  the  interactive 
processes  by  which  ideas  are  conveyed  and 
exchanged  through  discourse.  So  far,  we  have 
mainly  focused  on  the  substantive  dimension  of 
ideas  and  discourse.  But  even  here,  we  have  only 
touched  on  the  surface  of  the  range  of  discursive 
institutionalist  approaches.  DI  scholars  consider 
ideas  and  discourse  at  different  levels  of  generality, 
going  from  policy  ideas  to  programmatic  ideas  or 
paradigms  to  deeper  philosophical  ideas.  They  also 
consider  different  types  of  ideas,  including  cogni¬ 
tive  ideas  justified  in  terms  of  interest-based  logics 
and  necessity  and  normative  ideas  legitimated 
through  appeal  to  values  and  appropriateness. 
And  they  consider  different  forms  of  ideas  and 
discourse,  including  frames,  narratives,  myths, 
story  lines,  collective  memories,  scripts,  argumen¬ 
tative  practices,  discursive  struggles,  and  more. 
Many  of  these  approaches  tend  to  refer  to  agency, 
but  often  in  a  residual  manner,  through  reference 
to  the  carriers  of  ideas.  But  the  interactive  dimen¬ 
sion  of  discourse,  which  focuses  on  agents  who 
communicate  ideas,  is  equally  important  to  the 
endogenization  of  change  in  DI. 

Studies  of  the  interactive  dimension  of  discourse 
also  come  in  many  forms.  They  may  be  divided 
into  those  focused  on  what  Vivien  Schmidt  has 
called  the  discursive  construction  of  ideas  in  the 
“coordinative”  policy  sphere  or  the  discursive 
communication  of  ideas  in  a  “communicative” 
political  sphere.  The  coordinative  discourse  encom¬ 
passes  the  wide  range  of  policy  actors  engaged  in 
policy  construction  who  may  be  organized  in  epis- 
temic  communities  of  elites  with  shared  ideas,  in 
advocacy  coalitions  of  elites  with  shared  ideas  and 
policy  access,  and  advocacy  networks  of  activists 
contesting  ideas  in  international  politics  or  who 
may  act  as  entrepreneurs  who  serve  as  catalysts  for 
the  ideas  of  such  discursive  communities.  The 
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communicative  discourse  encompasses  the  wide 
range  of  political  actors  who  bring  the  ideas  devel¬ 
oped  in  the  context  of  the  coordinative  discourse 
to  the  public  for  deliberation  and  legitimation. 
These  actors  may  include  political  leaders  involved 
in  the  top-down  mass  electoral  process  of  public 
persuasion,  in  public  debates  or  in  policy  forums 
of  informed  publics.  Or  they  may  be  members  of 
civil  society  engaged  in  the  bottom-up  discursive 
interactions  of  grassroots  organizations,  of  social 
movements,  or  local  participatory  governance  and 
of  citizens  whose  voices  are  heard  not  only  in  opin¬ 
ion  polls  but  also  in  votes — where  actions  speak 
even  more  loudly  than  words. 

This  interactive  dimension  is  key  to  the  endoge- 
nization  of  change  in  DI.  Without  discourse  under¬ 
stood  as  the  exchange  of  ideas,  it  is  very  difficult  to 
explain  how  ideas  go  from  individual  thought  to 
collective  action.  We  do  not,  after  all,  know  what 
people  are  thinking  or  why  they  act  the  way  they 
do  until  they  say  it.  And  we  do  not  for  the  most 
part  engage  in  collective  action  or  in  collective  (re) 
thinking  of  our  actions  without  the  articulation, 
discussion,  deliberation,  and  legitimization  of  our 
ideas  about  our  actions.  This  is  why,  in  addition  to 
the  background  ideational  abilities  that  explain  the 
internal  processes  by  which  institutions  are  created 
and  maintained,  we  need  to  recognize  the  impor¬ 
tance  of  what  Schmidt  has  called  the  foreground 
discursive  abilities  through  which  sentient  agents 
may  change  (or  maintain)  their  institutions  follow¬ 
ing  a  logic  of  communication.  This  is  a  generic 
term  for  what  Jurgen  Habermas  calls  communica¬ 
tive  action ,  and  it  is  at  the  basis  of  theories  about 
deliberative  and  discursive  democracy,  about  pub¬ 
lic  debate,  as  well  about  coordinative  discourses  of 
policy  construction  and  communicative  discourses 
of  political  communication. 

These  foreground  discursive  abilities  are  essen¬ 
tial  to  explaining  institutional  change  because  they 
refer  to  people’s  ability  to  think  outside  the  institu¬ 
tions  in  which  they  continue  to  act,  to  talk  about 
such  institutions  in  a  critical  way,  to  communicate 
and  deliberate  about  them,  to  persuade  themselves 
as  well  as  others  to  change  their  minds  about  their 
institutions,  and  then  to  take  action  to  change 
them,  whether  by  building  discursive  coalitions  for 
reform  against  entrenched  interests  in  the  coordi¬ 
native  policy  sphere  or  informing  and  orienting  the 
public  in  the  communicative  political  sphere. 


Conveying  “good”  policy  ideas  through  a  persua¬ 
sive  discourse  helps  political  actors  win  elections 
and  gives  policy  actors  a  mandate  to  implement 
their  ideas. 

Scholars  in  the  SI  tradition  have  little  difficulty 
with  this  approach  to  discourse,  especially  since,  as 
we  have  already  seen,  discourse  analyses  of  all 
kinds  are  loosely  seen  as  part  of  SI.  Moreover,  even 
when  the  word  discourse  is  not  used,  DI  scholars  in 
the  SI  tradition  often  elucidate  its  interactive  effects, 
as  in  recent  work  on  contentious  politics,  which 
weds  organizational  and  social  movements  theory 
and  offers  evidence  of  how  leaders,  social  move¬ 
ment  activists,  and  the  everyday  public  spur  change 
through  ideas  that  persuade  through  discourse. 

Scholars  in  the  HI  tradition  also  have  little  dif¬ 
ficulty  with  discourse,  since  DI  can  add  dynamics 
to  the  historical  processes  of  ideational  change,  by 
focusing  on  who  talks  to  whom,  where,  and  when. 
Differentiating  between  a  coordinative  discourse  in 
the  policy  sphere  and  a  communicative  discourse 
in  the  political  sphere  implicitly  acknowledges  the 
importance  of  institutional  structures  as  well  as 
practices.  By  the  same  token,  however,  HI  can 
provide  additional  structure  to  DI  by  describing 
the  formal  institutional  contexts  that  shape  inter¬ 
active  patterns  of  discourse. 

Scholars  in  the  RI  tradition  have  the  greatest 
difficulty  dealing  with  the  interactive  process  of 
discourse,  or  taking  the  exchange  of  ideas  in  public 
debates  seriously,  because  talk  is  by  definition 
cheap  while  instrumental  actions  are  assumed  to 
speak  more  loudly  than  words.  But  can  the  sub¬ 
stance  of  ideas  not  matter,  as  part  of  the  persuasive 
power  of  discourse?  Margaret  Levi,  in  her  presi¬ 
dential  address  at  the  American  Political  Society 
Association  (APSA)  in  2006,  tacitly  acknowledged 
this  when  she  called  for  research  on  leaders’  com¬ 
munication  because  leaders  have  the  power  to 
inspire  change  as  well  as  the  capacity  to  change 
constituents’  beliefs — although  she  never  engaged 
with  DI  work  on  leadership,  discourse,  or  delibera¬ 
tive  democracy. 

RI’s  problems  with  ideas,  discourse,  and  delib¬ 
eration  follow  from  its  restricted  definition  of 
agency  and  rationality.  In  RI,  agents  are  rational  in 
an  unthinking  manner,  meaning  that  they  respond 
to  incentive  structures  in  ways  that  maximize  their 
interests  (expected  utility),  pursuing  their  goals  in 
accordance  with  their  beliefs  about  the  facts.  In  DI, 
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agents  are  rational  in  a  thinking  manner:  They  also 
pursue  their  goals  in  accordance  with  their  beliefs 
about  the  facts,  but — as  already  noted — they  are 
not  only  able  to  think,  say,  and  act  but  also  to 
think  about  their  thoughts,  reflect  on  their  actions, 
state  their  intentions,  alter  their  actions  as  a  result 
of  their  thoughts  about  their  actions,  say  what  they 
are  thinking  of  doing,  and  change  their  minds  in 
response  to  persuasion  by  others  regarding  what 
they  are  thinking,  saying,  and  doing.  Such  self- 
consciousness  and  self-knowledge  along  with  the 
ability  to  express  it  is  summed  up  in  the  term  sen¬ 
tient  when  used  to  define  agents  in  DI  (scholars 
also  use  the  term  reflexive),  who  may  change  their 
minds  based  on  persuasion — by  contrast  with  RI’s 
rational  agents,  for  whom  interaction  is  based  on 
manipulation,  as  agents  calculate  how  they  will 
influence  others  to  serve  their  own  interests.  A 
problem  for  RI  scholars  as  a  result  of  their  defini¬ 
tion  of  interaction  as  manipulation  is  that  they 
cannot  explain  how  one  overcomes  entrenched 
interests — that  is,  ones  that  cannot  be  coerced, 
tricked,  or  bribed  into  changing  their  actions. 

DI  approaches  focused  on  deliberative  democ¬ 
racy  in  particular  have  elaborated  on  such  insights 
related  to  communicative  action.  Deliberative 
democracy  is  seen  to  occur  when  parties  are  rea¬ 
sonable  and  use  evidence-based  arguments  to 
reach  agreement,  where  persuasion  is  the  key  to 
creating  shared  understandings  and  building  con¬ 
sensus,  and  in  which  the  process  itself  is  based  on 
inclusive,  open,  trusting,  and  consensual  interac¬ 
tion.  Deliberative  democracy  is  considered  to  be  a 
better  form  of  decision  making  because  open  dia¬ 
logue  may  unlock  untapped  knowledge,  generate 
new  skills  and  know-how,  produce  higher  quality 
reasoning  for  more  legitimate  policies,  and  create 
new,  more  collaborative  interrelationships  among 
the  parties  to  the  deliberation.  Deliberation  need 
not  be  limited  to  idealized  communication  situa¬ 
tions,  however,  especially  since  power  and  inter¬ 
ests  cannot  easily  be  eliminated  from  deliberations, 
although  being  aware  of  these  may  help  minimize 
the  potential  effects  of  domination.  But  awareness 
of  power  and  interests  or  even  manipulation  is  no 
guarantee  of  discursive  success. 

Deliberation  on  its  own,  in  other  words,  does 
not  necessarily  ensure  a  more  democratic  outcome. 
Power  and  position  do  matter.  The  question  is 
how  to  define  power  and  position  in  such  a  way  as 


to  also  take  account  of  the  power  of  ideas  and 
discourse.  The  problem  with  RI  and  HI  is  that  they 
tend  to  reify  questions  of  power  and  position  by 
assuming  that  power  is  a  function  of  position  and 
that  strategic  interests  derive  primarily  from  agents’ 
power  and  position.  DI  holds  instead  that  power 
cannot  be  defined  by  (objective)  position  alone, 
since  ideas  and  values  infuse  the  exercise  of  power 
and  (subjective)  perceptions  of  position.  Moreover, 
actors  can  gain  power  from  their  ideas  even  where 
they  may  lack  the  power  of  position — as  in  the 
case  of  social  movements  or  entrepreneurial  actors 
who  set  the  agenda  for  reform  in  policy  or  political 
spheres.  Power  itself,  moreover,  derives  not  only 
from  position,  meaning  actors’  ability  to  wield 
power,  but  also  purpose,  since  actors’  ideas  and 
discourse  about  how  they  can  and  should  wield 
that  power  (i.e.,  not  just  in  their  own  strategic 
interests  but  in  the  general  interest)  may  reinforce 
or  undermine  the  power  they  derive  from  their 
position,  depending  on  the  responses  of  their  audi¬ 
ence  to  their  stated  purposes.  This  is  the  essence  of 
political  leadership. 

Further  support  for  the  view  that  discourse  and 
deliberation  are  necessary  complements  to  investi¬ 
gations  of  power,  position,  and  interests  also  comes 
from  experimental  political  psychology,  which 
seeks  to  probe  the  nature  and  limits  of  (RI  defined) 
human  rationality,  in  particular  with  regard  to 
framing  effects.  Framing  effects  occur  when  differ¬ 
ent  but  logically  equivalent  phrases  cause  individu¬ 
als  to  alter  their  preferences  irrationally  (in  RI 
terms),  for  example,  when  people  reject  a  policy 
program  when  told  about  its  negative  effects  (e.g., 
it  leads  to  5%  unemployment)  and  accept  it  when 
told  of  its  positive  effects  (e.g.,  it  leads  to  95% 
employment).  This  represents  a  blow  to  RI,  which 
assumes  total  information  on  the  part  of  rational 
actors,  at  the  same  time  that  it  provides  an  opening 
for  DI.  This  is  because  framing  effects  are  shown  to 
be  moderated  by  contextual  forces  involving  elite 
competition  and  rhetoric  that  result,  as  James 
Druckman  has  shown,  in  a  process  of  framing  and 
counterframing  (DI’s  communicative  discourse), 
interpersonal  conversation  in  heterogeneous  groups 
among  citizens  (DI’s  deliberative  democracy),  or 
discussions  among  homogeneous  groups  of  experts 
(DI’s  coordinative  discourse).  This  said,  experimen¬ 
tal  political  psychology  also  demonstrates  the  limits 
of  deliberative  effectiveness  and  the  importance  of 
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not  idealizing  deliberation,  given  principles  of 
human  cognition  that  point  to  limited  attention 
spans,  cases  in  which  communication  may  reduce 
participants’  persuasiveness,  and  the  importance  of 
power  relationships. 

One  final  illustration  of  the  value  of  DI 
approaches  to  the  explanation  of  change  (and  con¬ 
tinuity)  comes  from  the  insights  it  can  lend  to  the 
use  of  the  different  forms  of  institutionalism.  RI 
accounts  tend  to  present  human  responses  to  given 
incentive  structures  as  inevitable.  This  can  be  seen 
as  part  of  a  normative  discursive  strategy  to  get  the 
reader  to  accept  this,  or  to  rebel  against  it,  as  in  the 
case  of  globalization  and  convergence  to  a  single 
neoliberal  model.  HI  accounts  of  path-dependent 
institutions,  in  particular  where  they  combine  with 
RI,  may  add  the  inexorable  to  the  inevitable.  With 
regard  to  the  globalization  of  capitalism,  it  can  be 
seen  as  part  of  a  normative  discursive  strategy  to 
suggest  that  there  is  no  convergence  in  capitalism 
since  there  are  two  different  varieties  that  will 
continue  into  perpetuity.  SI  accounts  of  cultural 
framing  may  dispute  both  RI  inevitability  and  HI 
inexorability  by  claiming  that  cultural  embedded¬ 
ness  makes  all  national  cultures  incomparable.  This 
can  in  itself  be  part  of  a  normative  discursive  strat¬ 
egy  to  resist  attempts  to  impose  any  kind  of  inter¬ 
nationalized  order  on  national  economies.  And 
what  kinds  of  ideas  and  normative  discourse  are 
embedded  in  DI,  then?  That  there  is  nothing  inevi¬ 
table,  inexorable,  or  incomparable  in  this  world, 
because  the  future  is  open  to  new  ideas  conveyed  by 
discourse.  With  regard  to  capitalism,  it  would  dis¬ 
pute  the  inevitability  of  neoliberal  convergence,  the 
inexorability  of  a  binary  split  in  capitalism,  and  the 
incomparability  of  culturally  embedded  forms  of 
capitalism.  Instead,  DI  shows  that  rationalist  logics, 
historical  path  dependencies,  and  cultural  frames 
are  conditional  on  public  choices  that  result  not  just 
from  the  power  clash  among  interests,  the  preroga¬ 
tives  of  position,  or  the  scripts  of  culture  but  from 
the  battle  of  ideas  through  discourse  and  delibera¬ 
tion.  The  global  economic  meltdown  clearly  dem¬ 
onstrates  the  ideational  underpinnings  of  many 
views  of  capitalism. 


institutionalisms,  and  yet  others  remain  squarely 
within  one  or  another  institutionalist  approach. 
To  get  a  sense  of  how  all  of  this  fits  together  in  a 
very  general  way,  Figure  1  situates  many  of  the 
scholars’  works  cited  above  within  each  of  the 
four  institutionalisms  while  arraying  these  along  a 
horizontal  continuum  from  interests  to  culture, 
with  history  in  between — and  along  a  vertical  con¬ 
tinuum  from  statics  to  dynamics,  with  interests, 
history,  and  culture  at  the  static  end,  ideas  and 
discourse  at  the  dynamic  end.  HI  sits  between  RI 
and  SI,  mainly  because  RI  and  SI  are  largely 
incompatible,  whereas  HI  can  go  to  either  side 
when  it  adds  agency.  DI  comes  underneath  all 
three  because,  although  it  is  distinctive,  it  can  rest 
on  the  insights  of  any  one  of  the  three  and  because 
scholars  often  see  themselves  as  continuing  to  fit 
into  one  or  another  of  the  traditions  even  as  they 
cross  the  line  into  DI. 

This  leaves  one  final  question:  what  is  the  added 
value  of  approaches  that  take  ideas  and  discourse 
seriously,  by  contrast  with  the  other  three  new  insti- 
tutionalisms?  DI  endogenizes  change,  explaining 
much  of  how  and  why  public  actors  bring  about 
institutional  change  through  public  action.  With 
regard  to  the  other  institutionalisms,  moreover,  the 
discursive  approach  helps  explain  the  actual  prefer¬ 
ences  and  strategies  of  actors  in  RI  and  HI,  and  it 
helps  to  explain  changes  in  the  normative  orienta¬ 
tions  emphasized  by  SI.  Where  DI  can  go  wrong 
is  when  it  considers  ideas  and  discourse  to  the 
exclusion  of  issues  of  power  (read  RI  instrumental 
rationality)  and  position  (read  HI  institutional 
structures),  when  it  assumes  that  DI  deliberation 
necessarily  trumps  RI  manipulation,  or  when  it 
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Conclusion 

While  some  scholars  have  moved  from  one  of  the 
older  neo-institutionalisms  to  DI,  others  straddle 


Figure  I  A  Visual  Map  of  the  Four  Institutionalisms 

Note:  DI  =  discursive  institutionalism;  HI  =  historical 
institutionalism;  RI  =  rational  choice  institutionalism;  SI  = 
sociological  institutionalism. 
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overdetermines  the  role  of  ideas  and  discourse  by 
forgetting  that  “stuff  happens”  or  that  historical 
institutions  and  cultural  frames  affect  the  ways  in 
which  ideas  are  expressed  and  discourse  conveyed. 
We  should  not  forget  that  ideas  and  discourse  that 
seek  to  promote  change  often  have  little  effect  on 
the  crystallized  ideas  about  rationalist  interests  and 
cultural  norms  or  on  the  frozen  landscapes  of  ratio¬ 
nalist  incentives,  historical  paths,  and  cultural 
frames.  The  research  agenda  for  political  scientists 
concerned  with  explaining  institutional  change 
endogenously,  therefore,  should  be  to  use  DI  to 
show  empirically  how,  when,  where,  and  why 
ideas  and  discourse  matter  for  institutional  change, 
and  when  they  do  not,  that  is,  when  RI,  HI,  and  SI 
may  be  sufficient  because  change  can  be  explained 
better  exogenously. 
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Institutionalization 


Institutionalization  is  the  process  by  which  organi¬ 
zations  acquire  identity  and  legitimacy.  Institutionali¬ 
zation  involves  more  than  building  formal  structures 
and  processes.  For  organizations  to  become  insti¬ 
tutions,  structures  need  to  be,  as  famously  noted 
by  Philip  Selznick  (1957),  “infused  with  value 
beyond  the  technical  requirements  of  the  task  at 
hand”  (p.  22).  This  is  true  for  corporations  and 
churches  as  much  as  for  political  parties  and  gov¬ 
ernment  agencies  or  for  entire  political  systems. 
Institutionalization  means  members  begin  to  value 
the  organization  for  itself;  it  elicits  a  normative, 
value-based  commitment  or  is  supported  by  deeply 
held  cultural  beliefs  in  its  mission.  The  result  is  a 
high  degree  of  legitimacy  with  both  members  and 
external  stakeholders  in  the  organization’s  environ¬ 
ment.  Internally,  institutionalization  involves  achiev¬ 
ing  a  consensus  on  the  organization’s  mission 
and  goals  (identity)  beyond  the  acquisition  of  neces¬ 
sary  resources  and  skills.  Externally,  a  highly  insti¬ 
tutionalized  organization  enjoys  a  high  degree  of 
social  acceptability  with  key  constituencies  or  the 
public  at  large  (legitimacy).  Institutionalization  is  a 
process  with  different  stages:  Once  an  organization 
has  developed  effective  working  practices,  these 
are  transformed  into  norms  that  get  accepted  and 
then  embedded  in  organizational  life.  However, 
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institutionalization  is  not  a  linear  process.  Being, 
or  rather  becoming,  an  institution  is  a  matter  of 
degree.  Organizations  or  political  systems  for  that 
matter  exhibit  different  degrees  of  institutionaliza¬ 
tion.  Importantly,  over  time,  institutionalization 
may  move  in  both  directions — that  is,  increase  or 
erode  (deinstitutionalization),  or  simply  stall. 

The  concept  of  institutionalization  has  been 
developed  mainly  by  organizational  sociologists 
who  emphasize  that  individual  action  is  shaped 
by,  and  embedded  in,  larger  social  structures. 
Political  scientists  have  tended  to  look  at  institu¬ 
tions  as  given,  fixed  frameworks  for  political 
behavior  but  have  shown  less  interest  in  their 
emergence,  with  the  exception  of  constitutional 
design.  The  origin  of  public  organizations  in  par¬ 
ticular  remains  a  neglected  area  of  research  for 
which  the  concept  of  institutionalization  can  serve 
as  a  useful  analytical  tool.  This  entry  first  reviews 
different  concepts  and  mechanisms  of  institution¬ 
alization  and  then  presents  typical  criteria  of  insti¬ 
tutionalization  before  discussing  some  constraints 
and  challenges  of  institutionalization  faced  by 
public  sector  organizations. 

Concepts  and  Mechanisms 
of  Institutionalization 

Any  discussion  of  institutionalization  needs  to  be 
based  on  an  understanding  of  the  term  institution. 
At  the  most  general  level,  we  can  think  of  institu¬ 
tions  as  “patterned  behavior,”  as  relatively  stable, 
valued  sets  of  formal  and  informal  rules,  norms,  and 
practices  that  constrain  but  also  enable  political 
behavior.  In  short,  institutions  give  structure  to 
political  life,  but  they  also  emerge  from  actor-based 
social  and  political  processes.  A  key  question  for 
political  scientists  then  is  not  only  to  what  extent 
institutional  structures  shape  or  even  determine  indi¬ 
vidual  political  behavior  and  processes  but  also  to 
what  degree  actors  can  purposefully  create,  design, 
or  mold  institutions  to  serve  their  needs  or  interests. 
This  duality  of  actor  (or  agency)  and  structure  lies  at 
the  heart  of  institutionalization  research. 

There  is  considerable  disagreement  among  dif¬ 
ferent  schools  of  new  institutionalism  about  the 
role  of  institutions  and  about  the  drivers  and 
mechanisms  of  institutionalization.  The  most 
important  divide  runs  between  rational  choice 
institutionalism  and  sociological  (or  organizational 


or  normative)  institutionalism.  In  the  rational 
choice  approach,  institutions  are  essentially  “rules 
of  the  game”  that  emerge  as  the  result  of  political 
choice  by  self-interested  actors,  be  it  to  solve  col¬ 
lective  action  problems  or  to  advance  specific 
interests.  This  suggests  that  institutional  arrange¬ 
ments  are  rather  malleable  and  open  to  design.  A 
typical  approach  in  this  vein  is  the  principal-agent 
framework:  Political  principals  create  and  delegate 
powers  to  administrative  or  international  agents  to 
perform  certain  functions  or  to  entrench  their 
policy  preferences.  Yet  organizational  arrange¬ 
ments  can  be  stabilized,  or  institutionalized, 
through  mechanisms  of  “lock-in,”  based  on  the 
logic  of  increasing  returns:  Actors  have  an  incen¬ 
tive  to  stay  on  a  path  of  institutional  development 
once  it  is  chosen  not  because  it  is  functionally 
superior  but  because  they  are  rewarded  by  positive 
feedbacks,  such  as  learning  and  coordination  ben¬ 
efits.  Historical  institutionalism  makes  a  similar 
argument  about  the  lock-in  effect  of  choices  (at 
critical  junctures)  in  the  formative  period  of  insti¬ 
tutions  (“path  dependency”).  However,  it  gives 
more  importance  to  the  role  of  ideas  in  sustaining 
institutional  trajectories.  This  constructivist  com¬ 
ponent  brings  historical  institutionalism  closer  to 
sociological  institutionalism. 

The  sociological  approach  views  individual 
behavior  as  governed  by  norms  and  rules  that 
define  appropriate  (as  opposed  to  instrumental) 
behavior  according  to  certain  values.  The  key 
mechanism  of  institutionalization  is  an  increase  of 
normative  commitment  to  the  organization.  In  this 
account,  formal  structures  and  processes,  which 
are  the  center  of  many  rational  choice  approaches, 
are  only  a  thin  layer;  “thick  institutionalization” 
(as  Selznick  says)  occurs  when  value-based  com¬ 
mitment  of  internal  and  external  stakeholders  is 
solidified,  as  expressed  in  unifying  ideologies  or 
rituals  or  in  the  administrative  entrenchment  of  the 
organization’s  mission.  A  more  strongly  construc¬ 
tivist  strand  of  the  sociological  approach  holds 
that  culturally  determined  cognitive  beliefs,  rather 
than  values,  are  the  main  factor  in  the  institution¬ 
alization  process.  Certain  organizational  practices 
are  “objectified” — that  is,  they  are  institutional¬ 
ized  in  the  sense  of  being  taken  for  granted  as 
social  facts.  This  is  the  most  powerful  form  of 
institutionalization  as  it  does  not  depend  on  incen¬ 
tives  or  shared  values  any  more. 
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The  different  institutionalisms  shed  light  on  dif¬ 
ferent  dimensions  and  two  stages  of  the  institu¬ 
tionalization  process.  Rational  choice  approaches 
take  an  actor  perspective  and  focus  on  the  initial 
creation  of  a  formal  organization  for  a  specific 
purpose.  This  perspective  runs  the  risk  of  explain¬ 
ing  institutional  creation  by  exclusive  reference  to 
functional  needs  and  rational  calculation.  It  may 
overestimate  the  room  for  rational  design  and 
neglect  obstacles  to  sustained  institutionalization. 
Constructivist  schools  insist  on  the  overriding 
importance  of  the  second,  more  complex  step  to 
imbue  formal  structures  with  values  and  beliefs. 
Conversely,  this  structural  approach  leaves  open 
questions  as  to  how  the  various  actors  and  inter¬ 
ests  populating  the  volatile  world  of  politics  relate 
to  abstract  processes  of  norm  routinization  or 
harmonization  (politics  of  institutionalization). 

Criteria  of  Institutionalization 

How  can  we  assess  the  degree  to  which  an  organi¬ 
zation  has  been  institutionalized?  At  a  general  level, 
the  degree  of  standardization  of  organizational 
procedures  and  practices  is  an  important  indicator. 
However,  routinization  of  behavior  may  in  practice 
be  disconnected  from  “value  infusion” — that  is, 
from  the  degree  of  legitimacy  that  the  organization 
can  command.  Samuel  Huntington  proposed  four 
more  specific  criteria  or  attributes:  (1)  adaptability 
refers  to  the  extent  to  which  organizations  have 
learned  to  adapt  to,  and  deal  with,  a  dynamic  envi¬ 
ronment,  which  is  largely  a  function  of  age  and 
experience;  (2)  complexity  (the  internal  equivalent 
of  adaptability)  refers  to  the  capacity  of  the  organi¬ 
zation  to  develop  differentiated  internal  structures 
(diversification)  that  create  stability  and  help  it 
thrive  in  a  changing  environment;  (3)  autonomy  is 
not  only  the  most  interesting  criterion  from  a  public 
organization’s  perspective  but  also  the  most  diffi¬ 
cult  to  measure;  it  seeks  to  capture  the  extent  to 
which  an  organization  is  independent  and  able  to 
make  its  own  decisions  based  on  protection  from 
external  influence  and  control  over  its  resources; 
(4)  coherence  refers  to  the  degree  of  internal  disci¬ 
pline  and  unity,  the  level  of  consensus  on  institu¬ 
tional  purposes  and  procedures;  more  coherence 
results  in  more  effective  ways  of  working. 

The  profile  of  a  highly  institutionalized  organi¬ 
zation  or  entity,  then,  would  include  the  following 


features:  advanced  internal  differentiation  of  struc¬ 
tures  as  well  as  solid  consensus  and  coherence, 
control  over  decisions,  including  resources,  and 
the  capacity  to  manage  the  boundaries  with  the 
organization’s  environment. 

In  the  case  of  public  sector  entities,  however, 
institutionalization,  defined  as  the  attempt  to 
maintain  some  autonomy  from  the  environment 
through  internal  differentiation  and  external 
boundary  management  seems  particularly  diffi¬ 
cult  to  achieve — and  not  always  desirable  from  a 
perspective  of  democratic  control.  Unless  the 
organization  in  question  occupies  a  monopoly  or 
a  similarly  privileged  position  (National  Aero¬ 
nautics  and  Space  Administration  [NASA]  is  a 
good  example),  there  are  many  constraints  hold¬ 
ing  back  public  sector  organizations  in  their  quest 
to  fully  institutionalize. 

Constraints  and  Challenges  of 
Institutionalization  in  the  Public  Sector 

First,  public  organizations  often  operate  under  a 
rigid  mandate  that  imposes  complex  and  mutually 
exclusive  goals.  The  use  of  public  authority  and 
associated  norms  of  accountability  and  predictabil¬ 
ity  demand  that  public  organizations  formalize  their 
practices  to  a  higher  degree  and  demonstrate  suffi¬ 
cient  congruence  with  the  social  environment  that 
they  are  supposed  to  serve  or  regulate.  Ambitious 
expectations,  set  by  a  political  process  that  has  to 
accommodate  various  interests,  are  often  not 
matched  by  sufficient  resources.  Politicians  may  be 
loath  to  delegate  sufficient  powers  and  discretion. 
The  political  process  is  highly  volatile,  and  especially 
a  start-up  organization  may  not  be  granted  sufficient 
time  by  impatient  constituencies  to  prove  itself  and 
consolidate.  Second,  newcomers  often  face  tough 
competition  for  turf  in  a  crowded  institutional 
space.  Successful  institutionalization  may  even 
require  the  dismantling  or  deinstitutionalization  of 
competing  structures  or  organizations.  Third,  there 
is  the  normative  concern  in  democratic  societies 
about  “runaway  bureaucracies” — public  organiza¬ 
tions  that  abuse  autonomy  by  placing  their  own 
interests  above  the  public  mission.  However,  keep¬ 
ing  public  entities  strictly  accountable  and  respon¬ 
sive  may  curtail  their  ability  to  develop  their  own 
identity  and  ambition;  there  might  be  a  trade-off 
between  internal  identity  and  external  legitimacy. 
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Given  all  these  constraints,  how  is  successful 
institutionalization  at  all  possible?  The  public 
administration  literature  is  indeed  replete  with 
accounts  of  failed  institutionalization  and  patholo¬ 
gies  of  misguided  institutionalization.  At  the  same 
time,  successfully  institutionalized  public  organiza¬ 
tions  do  exist.  A  functional  account  that  explains 
the  origin  of  public  organizations  by  the  needs  they 
purportedly  serve  (to  cope  with  new  tasks  or 
conditions)  is  clearly  inadequate  as  it  ignores  the 
complex  politics  of  institutionalization  that  is  con¬ 
ditioned  by  existing  institutional  orders  and  power 
distributions.  First,  contingent  events,  such  as  the 
occurrence  of  crises,  can  be  an  important  factor  to 
explain  the  success  of  an  institution:  It  manages  to 
position  itself  as  the  adequate,  even  necessary, 
response  to  an  urgent  situation.  Second,  some 
institutional  newcomers  benefit  from  fashionable 
trends  in  public  sector  organization,  such  as  the 
creation  of  agencies  at  arm’s  length  from  political 
executives.  Start-ups  that  conform  to  these  “iso¬ 
morphic  pressures”  are  more  likely  to  overcome 
obstacles  to  institutionalization.  Third,  and  finally, 
the  role  of  leadership  deserves  further  study,  as 
suggested  by  Arjen  Boin  and  Tom  Christensen: 
leaders  not  in  the  sense  of  charismatic  visionaries 
but  leaders  as  facilitators  of  process  who  guide  the 
emerging  entity  through  the  different  stages  of 
institutionalization,  striking  the  right  balance 
between  internal  development  and  external 
demands. 
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Institutions  and 
Institutionalism 


Most  fundamentally,  the  social  sciences  have 
defined  institutions  as  human  made.  As  they  were 
created  by  men  and  women,  institutions  order 
social,  political,  economic,  and  even  cultural  inter¬ 
course.  Indeed,  institutions  constitute  the  very 
basis  for  human  interaction.  Consequently,  insti¬ 
tutions  bear  within  them  equally  the  potential 
danger  of  the  most  deep-seated  social  control  as 
well  as  the  promise  of  human  liberation  from  both 
the  social  bond  and  the  constraints  of  nature. 
Institutionalism  is  the  study  of  the  origins,  effects, 
and  potential  for  reform  of  institutions. 

Definitions 

The  meaning  of  the  term  institution  is  wide  rang¬ 
ing,  from  more  restricted  to  more  elaborate  and 
also  along  several  dimensions.  One  dimension  is 
the  degree  of  formalization  of  an  institution.  At  the 
most  informal  pole,  it  is  common  to  speak  of  hab¬ 
its,  customs,  or  conventions.  A  habit  is  any  action 
whose  repetitive  nature  comes  to  be  recognized  by 
a  self-conscious  actor  and  thus  is  internally  repre¬ 
sented.  A  custom  is  a  habit  that  is  shared  by  mem¬ 
bers  of  a  collectivity  and  hence  social.  A  convention 
is  any  agreed-on  procedure.  Language  itself  is  a 
convention,  as  both  the  meanings  of  words  and 
syntactical  structures  require  social  agreement  for 
communication.  The  same  is  true  for  other  social 
“codes”  such  as  myths  and  rituals  that  both 
embody,  and  thus  communicate,  social  ideas  and 
ideals  but  yet  require  some  understanding  of  such 
collective  representations  for  their  decoding.  To  the 
extent  that  a  convention  is  adopted  by  ever  larger 
numbers  of  people  and  comes  to  be  collectively 
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binding,  it  is  eventually  described  as  an  institution. 
This  movement  from  an  informal  to  a  formal  setup 
is  termed  institutionalization.  As  it  entails  a  shift 
from  an  individual  to  a  society  and  from  freedom 
to  constraint,  it  can  be  viewed  as  a  transition  from 
nature  to  culture. 

The  degree  to  which  institutions  are  collectively 
binding,  however,  constitutes  a  second  dimension 
of  variation.  A  tradition  or  folkway  has  been  fol¬ 
lowed  over  time  and  by  a  particular  group,  such 
that  an  individual’s  cultural  identity  may  incline 
him  or  her  to  adhere  to  a  given  custom.  The 
French  term  moeurs  and  the  English  mores ,  which 
correspond  to  the  German  Sitte,  connote  slightly 
more  social  obligation.  A  norm,  which  may  be 
defined  as  an  internalized  belief,  is  more  strongly 
binding,  because  its  transgression  is  open  to  moral 
or  social  sanction.  The  Hegelian  distinction 
between  Sittlichkeit  and  Moralitat  is  based  on  the 
isolation  of  the  moral  rule  from  the  original  moral 
community  (Gemeinschaft)  and  its  transfer  to  the 
society  ( Gesellschaft ),  after  which,  ultimately,  it 
undergoes  a  synthesis  through  the  state  (Staat).  A 
law  is  a  collectively  binding  decision  whose  inter¬ 
pretation  is  subject  to  adjudication  only  in  special¬ 
ized  juridical  bodies  and  by  breaking  which  one  is 
subjected  to  punishment,  again  by  public  institu¬ 
tions  with  a  monopoly  on  the  exertion  of  legiti¬ 
mate  force,  as  pointed  out  by  Max  Weber. 
Consequently,  the  workings  of  some  institutions 
may  depend  on  other  institutions  or  on  institution¬ 
alized  settings.  As  discussed  in  classical  political 
theory,  the  establishment  of  the  state  or  Leviathan 
is  the  decisive  shift  accomplished  by  the  move  from 
the  state  of  nature  to  civil  society. 

In  a  similar  vein,  institutions  vary  with  regard 
to  whether  they  are  self-reinforcing  or  require 
intervention  by  some  sort  of  meta-institutions.  If  a 
certain  rule  is  adhered  to  by  all  relevant  actors 
because  it  is  in  their  self-interest  to  conform,  the 
rule  is  termed  to  be  self-enforcing.  However,  some 
perspectives  on  institutions  claim  that  rules  need 
to  be  elaborated  for  actors  to  be  able  to  under¬ 
stand  how  to  follow  them — or  indeed  to  calculate 
their  self-interest — such  that  cultural  and  social 
guidelines  as  to  what  is  appropriate  behavior  must 
flesh  out  the  skeletal  “rules  of  the  game.” 
Furthermore,  some  sets  of  preferences  and  rules 
(such  as  majority  rule)  may  not  necessarily  pro¬ 
duce  stable  outcomes.  Instead,  more  elaborate 


procedures  may  set  limits  to  the  possibility  of 
choosing,  such  that  one  may  speak  of  institutional 
or  structure-induced  equilibrium,  as  opposed  to  a 
natural  or  preference-induced  equilibrium.  This  is 
the  subject  both  of  public  choice  or  rational  choice 
theory  and  of  institutional  economics.  Here,  we 
see  that  the  study  of  institutions  may  extend  from 
a  narrowly  defined  set  of  rules  to,  first,  a  more 
elaborated  context  necessary  for  understanding 
the  workings  of  these  institutions  and,  second,  as 
T.  R.  Voss  emphasizes,  to  an  analysis  of  the  equi¬ 
librium  outcomes  produced  by  these  institutions 
and  institutional  contexts. 

The  term  institution  refers  to  the  action  of 
introducing  an  institution,  to  the  identity  of  the 
actor  (Instituteur)  that  introduced  the  institution, 
and  to  the  thing  that  has  been  introduced. 
Institutions  can  be  introduced  by  religious  and 
secular  authorities  as  well  as  by  groups  and  com¬ 
munities,  as  in  canonical  institutions,  institutions 
of  inheritance,  and  social  institutions.  Institutions 
are  created  by  divine  or  mortal  beings  and,  hence, 
set  apart  from  nature.  Institutions  may  produce  a 
person  of  high  morality,  a  group,  or  a  regime. 
Even  Charles  de  Montesquieu  defined  laws  very 
broadly  to  include  both  the  institutions  of  the  leg¬ 
islature  and  the  mores  and  manners  of  the  nation, 
in  general.  Thus,  institutions  include  the  totality  of 
social  forms  and  structures  and  may  be  established 
by  law  or  custom.  Consequently,  three  types  of 
institutions  are  especially  important:  (1)  social 
institutions,  such  as  kinship,  marriage,  family,  and 
inheritance;  (2)  constitutions,  which  are  the  writ¬ 
ten  or  unwritten  law  governing  the  exercise  of 
public  power  and  the  procedures  for  making  laws 
themselves;  and  (3)  regimes,  which  refers  to  the 
process  of  giving  something  the  character  of  an 
institution.  Social  institutions  govern  relationships 
among  individuals  within  societies  and  hence  estab¬ 
lish  a  social  order.  Constitutions  regulate  the  rela¬ 
tionships  among  citizens,  political  representatives, 
and  the  state  and  hence  create  a  political  order. 
Regimes  are  often  found  in  the  international  arena, 
where,  being  beyond  the  reach  of  the  sovereignty  of 
nation-states,  international  agreements  are  used  to 
create  international  regimes  or  international  orders, 
and  international  organizations  with  the  legitimacy 
and  normative  weight  of  an  institution  are  active  in 
the  implementation  of  the  agreed-on  normative 
order  or  international  norms. 
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To  institutionalize  something  or  the  institutional¬ 
ization  of  something  may  refer  to  the  frequency, 
permanence,  or  widespread  nature  of  a  habit,  a 
virtue,  or  even  a  vice — the  granting  of  an  official 
status  as  institution  to  a  custom  or  procedure. 
Institution  also  refers  to  the  inculcation,  indoctrina¬ 
tion,  or  introduction  of  norms,  habits,  and  knowl¬ 
edge  in  various  forms  of  instruction,  education,  or 
upbringing.  In  addition,  institution  or  institute  may 
refer  to  the  corporate  body  or  building  from  which 
such  instruction  is  imparted,  such  as  a  private  insti¬ 
tution  or  institute  of  scientific  study.  Institutions 
meant  for  instruction  or  induction  (e.g.,  the  mili¬ 
tary,  clerical  institutions)  form  and  regulate  indi¬ 
viduals  and  hence  should  (in  theory)  be  reliable. 
Closed  or  total  institutions  are  responsible  for 
psychic  and  physical  healing,  such  as  hospitals  or 
psychiatric  institutions.  Financial  institutions  and 
institutional  investors  hold  money  in  trust  for  large 
numbers  of  individuals  and  should  (again,  in  theory) 
be  held  accountable  to  their  investors.  Individuals 
who  have  fully  imbibed  and  internalized  such 
instruction  and  are  employees  of  such  institutions 
are  known  as  professionals.  In  each  of  these  trust 
relationships,  there  is  a  potential  for  betrayal  of 
trust,  which  has  been  the  focus  of  much  social  sci¬ 
entific  analysis  of  institutions  and  professions. 

Financial  institutions,  institutional  investors, 
and  the  governmental  framework  for  monetary 
transactions  all  form  markets.  Political  institutions 
and  constitutions  frame  public  policies.  Social  insti¬ 
tutions  help  socialize  individuals;  different  psycho¬ 
logical  stages,  pedagogy,  and  social  relations  help 
form  the  psyche.  The  study  of  these  institutional 
effects  is  known  as  institutionalism.  Unlike  institu¬ 
tions  that  are  viewed  as  arbitrary,  institutionalism 
is  by  nature  a  relativistic  approach:  Institutions  that 
may  have  been  introduced  as  arbitrary  results  of 
contingent  events  may  have  unintended  conse¬ 
quences  for  human  nature,  societies,  politics,  and 
markets.  Consequently,  institutionalists  view  devel¬ 
opments  in  these  spheres  as  artifacts  of  institutions, 
and  hence,  they  are  neither  natural  nor  necessarily 
desirable. 

Institutions  and  Institutionalism 
Spanning  the  Social  Sciences 

The  study  of  institutions  and  institutional  effects 
spans  the  social  sciences.  In  the  areas  of  philosophy 


of  knowledge,  philosophy  of  right,  and  political 
philosophy,  institutions  have  been  viewed  both  as 
mental  representations  and  concrete  political 
structures.  As  cognitive  representations,  institu¬ 
tions  help  structure  thought,  thus  constituting 
perceptual  lenses  or  schema.  The  origins  and 
legitimacy  of  social  and  political  institutions  has 
been  a  perennial  problem,  being  divided  into 
historical  or  empirical  and  rational  views.  The 
historical  view  shared  by  classical  authors  such  as 
Georg  Friedrich  Flegel  sees  traditions,  customs, 
norms,  laws,  and  institutions  as  inheritances  that 
have  achieved  legitimacy  (if  they  have  indeed 
achieved  it)  by  standing  the  test  of  time.  Moreover, 
their  functioning  rests  on  this  historical  context. 
The  rational  view,  by  contrast,  aims  to  distill  the 
essence  of  an  institution  via  logical  analysis,  often 
through  recourse  to  a  hypothetical  account  of 
institutional  origins  or  the  use  of  a  particular  pro¬ 
cedure,  as  in  the  contract  theory  of  Thomas 
Hobbes  or  John  Locke.  The  work  of  Jean-Jacques 
Rousseau  bridges  these  extremes  by  taking  a  his¬ 
torical,  anthropological  view  of  the  development 
of  social  institutions  but  a  contract  view  of  institu¬ 
tional  legitimacy;  freedom  is  defined  as  living 
under  a  rule  one  makes  for  oneself.  Such  proce¬ 
dural  legitimacy  is  further  developed  by  Immanuel 
Kant,  John  Rawls,  and  Jurgen  Habermas  but  has 
been  called  into  question  in  the  so-called  commu- 
nitarianism  debate. 

One  strand  of  the  sociological  and  anthropo¬ 
logical  tradition  views  institutions  as  cultural 
products  that  substitute  for  instinct  and  thus  con¬ 
fer  an  evolutionary  advantage  for  human  societies 
with  particular  institutions.  More  generally,  insti¬ 
tutions  create  social  order  both  by  specific  sets  of 
positive  and  negative  sanctions  imposed  to  prevent 
social  deviance  and  by  giving  symbolic  expression 
to  the  social  order,  for  example,  through  myths 
and  rituals.  Indeed,  the  very  categories  of  thought, 
such  as  notions  of  time  and  space,  are  socially  con¬ 
tingent  and  linked  to  social  organization.  Thus,  the 
sociological  meaning  of  the  term  institution  ranges 
from  concrete  social  practices  or  structures  to 
ideas,  representations,  and  even  socially  contingent 
interpretations  of  ideas  and  categories,  as  we 
observe  in  the  social  thought  of  Emile  Durkheim. 

A  central  division  deals  with  whether  institutions 
are  functional  and  hence  have  evolved  from  univer¬ 
sal  human  needs,  even  if  the  exact  institutional 
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configurations  vary  from  society  to  society,  or 
whether  institutions  are  more  aptly  viewed  as  his¬ 
torical  residues  and  products  of  social  interpreta¬ 
tion.  A  second  problematic  concerns  the  issue  of 
free  will  versus  determinism  (also  referred  to  as 
agency  vs.  structure):  Do  institutions  determine 
behavior  or  simply  make  some  courses  of  action 
more  likely  because  they  appear  to  be  “normal”  or 
“appropriate”  and  may  be  sanctioned  by  punish¬ 
ment  or  social  disapproval  if  not  followed?  Talcott 
Parsons’s  theory  of  action  proposed  that  previous 
patterns  of  behavior  serve  as  points  of  orientation 
for  actors,  for  example,  in  establishing  a  repertoire 
of  social  roles,  but  that  individuals  are  free  to  con¬ 
form  or  deviate  from  these  expected  patterns. 

The  lines  between  sociological,  social- 
psychological,  and  psychological  views  on  institu¬ 
tions  are  blurred,  as  all  share  a  focus  on  mental 
representations,  of  which  language  and  cognitive 
operations  are  central.  Here,  the  idea  that  the 
meaning  of  sounds  is  arbitrary  and  that  meaning 
arises  from  the  juxtaposition  of  phonetic  opposi¬ 
tions  as  well  as  the  focus  on  classificatory 
systems,  binary  codes,  and  cognitive  routines 
from  the  fields  of  computer  science  and  artificial 
intelligence  have  been  quite  influential.  These 
representations  or  routines  not  only  mediate 
between  the  individual  subject  and  the  external 
world  but  are  also  formed  or  canalized  through 
these  interactions,  as  in  Jean  Piaget’s  theories  of 
developmental  stages  or  Sigmund  Freud’s  focus 
on  early  childhood  traumas.  Given  the  potentially 
arbitrary  nature  of  these  historic  residues,  an 
emancipatory  program  in  social  psychology  is 
possible — although  its  adherents,  such  as  Erich 
Fromm  or  Herbert  Marcuse,  vary  in  the  extent  to 
which  they  stress  social  and  economic  versus 
purely  psychological  causes  for  personality  (and 
societal)  deformation. 

The  institutional  tradition  in  economics  stresses 
the  “embeddedness”  of  economic  transactions  in 
social  structures  and  culture.  In  contrast  to  Adam 
Smith’s  claim  that  “man”  has  a  “natural”  ten¬ 
dency  to  “truck,  barter  and  exchange,”  economic 
institutionalists  emphasize  the  cultural,  social,  and 
even  normative  bases  for  exchange  as  well  as  the 
ways  in  which  social  and  cultural  motivations  and 
practices — such  as  striving  for  social  honor — shape 
and  even  distort  economic  behavior.  Some  exam¬ 
ples  include  Torsten  Veblen’s  “conspicuous 


consumption”  of  the  leisure  class,  John  Commons’s 
analysis  of  the  impact  of  historical  experience  and 
government  policies  on  the  organization  of  indus¬ 
trial  relations,  and  Max  Weber’s  classic  analysis  in 
The  Protestant  Ethic  and  the  Spirit  of  Capitalism. 
Even  Joseph  Schumpeter,  who  coined  the  phrase 
methodological  individualism  and  drafted  the  eco¬ 
nomic  model  of  democracy,  stressed  the  ways  in 
which  historic  residues  predating  the  capitalist  era 
buttressed  capitalist  economic  arrangements  and 
argued  that  as  these  precapitalist  foundations  are 
on  a  decline,  capitalism  itself  will  become  unstable. 
Similarly,  the  work  of  Karl  Polanyi  analyses  the 
economic  and  political  instability  resulting  from 
the  historically  unique  effort  of  capitalism  to  dis- 
embed  the  market  from  social  arrangements  and 
institutions. 

In  law  and  political  science,  political  institutions 
were  long  understood  in  terms  of  the  norms 
embodied  in  constitutions.  Nevertheless,  both  the 
social  context  for  political  institutions  and  the  rise 
of  the  modern  state,  as  well  as  the  impact  of  insti¬ 
tutional  arrangements  on  the  behavior  of  politi¬ 
cians  and  voting  patterns,  were  considered  by 
classic  political  institutionalists,  such  as  Alexis  de 
Tocqueville  and  Max  Weber. 

Behavioralism  Versus  Institutionalism 

In  the  aftermath  of  World  War  II,  however,  a 
behavioralist  revolution  swept  through  the  social 
sciences,  pushing  institutions  and  institutionalist 
analysis  into  the  background.  Though  more  popu¬ 
lar  in  the  United  States  than  in  Europe  and  other 
parts  of  the  world,  behavioralism  focused  on 
assumed  universal  regularities  of  behavior:  In  psy¬ 
chology,  Burrhus  Frederic  Skinner  studied  scien¬ 
tific  stimulus  and  response;  in  sociology,  Talcott 
Parsons  proposed  structural  functionalist  require¬ 
ments  and  a  universal  movement  from  traditional 
to  modern  society;  in  economics,  the  neoclassical 
model  with  its  universal  uniformities  of  market 
behavior  rose  to  undisputed  dominance;  in  politi¬ 
cal  science,  pluralists  including  Robert  Dahl  and 
David  Truman,  among  others,  focused  on  the 
observable  behavior  of  voters  and  interest  groups. 
By  the  end  of  the  1960s,  however,  limitations  of 
these  behavioralist  models  led  to  a  rediscovery  of 
institutions,  which  James  March  and  Johan  Olsen 
labeled  the  new  institutionalism. 
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Rational  Choice  and  the  New 
Institutional  Economics 

Rational  choice  theorists  and  new  institutional¬ 
ist  economists  understand  institutions  as  a  response 
to  dilemmas  of  collective  choice  and  collective 
action.  Assuming  human  actors  to  be  rational,  self- 
interested  utility  maximizers,  these  scholars  inves¬ 
tigate  the  irrational  or  suboptimal  consequences  of 
such  rational  action  when  outcomes  depend  on  the 
decisions  of  more  than  one  actor;  that  is,  they 
study  strategic  action. 

Rational  choice  studies  of  legislative  decision 
making  are  based  on  the  Condorcet  Paradox  or 
Arrow  impossibility  theorem,  which  demonstrates 
that  even  when  individuals  possess  transitive  pref¬ 
erence  orderings  (e.g.,  preferring  Restaurant  A  to 
Restaurant  B  to  Restaurant  C),  group  decision 
making  may  not  result  in  a  stable  choice.  As  a 
result,  majority  rule  is  inherently  unstable,  and 
these  institutionalists  study  both  the  conditions  for 
democratic  stability  and  the  ways  in  which  institu¬ 
tional  rules,  such  as  the  agenda-setting  privilege  of 
the  political  executive  or  the  veto  powers  of  parlia¬ 
mentary  chambers  or  committees,  allow  stable 
political  choices  to  be  made.  Spatial  models  of 
preferences  serve  as  an  important  tool  for  this  type 
of  analysis.  Game  theory  is  simply  a  method  of 
economic  analysis  based  on  the  payoff  functions  of 
the  actors  (or  players)  and  the  rules  for  their 
choices  (or  moves).  Such  games  often  result  in  sub- 
optimal  outcomes,  as  in  the  famous  prisoners’ 
dilemma,  in  which  both  prisoners  confess  to  a 
crime  out  of  fear  that  their  co-conspirator  will 
betray  them.  Institutional  rules  or  particular  strate¬ 
gies  may  be  of  some  help  in  improving  these  out¬ 
comes,  for  example,  by  turning  a  noncooperative 
game,  such  as  the  tragedy  of  the  commons,  into  a 
cooperative  one. 

The  new  institutionalist  economics  explores  the 
origins  of  economic  institutions  and  their  effects, 
especially  the  legal  framework  for  the  market, 
which  directly  affects  the  distribution  of  property 
rights  and  the  calculation  of  transaction  costs.  Some 
influential  models  for  understanding  the  develop¬ 
ment  of  institutions  are  transaction  cost  economics, 
which  postulates  that  institutions  develop  to  mini¬ 
mize  all  costs  related  to  economic  transactions, 
including  information  costs  and  the  dangers  of 
mutual  dependence;  principal-agent  theory,  which 
understands  contracts  and  organizational  structures 


as  responses  to  the  need  of  “principals”  to  control 
their  “agents,”  such  as  owners-managers,  govern- 
ments-bureaucracies,  and  voters-politicians;  and 
path  dependency,  which  uses  the  concepts  of  sunk 
costs  and  increasing  returns  to  explain  why  initial 
contingent  events  lead  to  inefficient  but  stable 
paths  of  economic  or  political  development. 
Central  issues  deal  with  whether  or  not  socialized 
or  disembedded  individuals  can  be  taken  as  the 
point  of  departure  for  these  models,  whether  insti¬ 
tutional  arrangements  can  be  modeled  as  if  emerg¬ 
ing  from  a  state  of  nature,  and  whether  historically 
given  institutional  starting  points  are  necessary. 

Sociological  Institutionalism 
and  Economic  Sociology 

As  sociology  deals  with  the  social  order,  the 
entire  field  can  be  said  to  be  institutional. 
Nevertheless,  a  distinct  group  of  sociologists  have 
built  on  the  behavioral  psychology  and  organization 
theory  to  elaborate  a  view  of  individual  cognition 
and  collective  decision  making  within  organizations 
that  they  have  termed  sociological  institutionalism. 
Here,  cognitive  limits  on  human  capacities  for  gath¬ 
ering  and  processing  information  (bounded  ratio¬ 
nality)  result  in  various  coping  mechanisms,  such  as 
accepting  the  first  more  or  less  acceptable  alterna¬ 
tive  (satisficing)  in  place  of  maximizing  utility  or 
reliance  on  standard  operating  procedures  that 
reduce  choice  and  thereby  structure  and  coordinate 
action.  Routines  and  scripts  may  produce  patterns 
of  organizational  behavior  that  can  be  termed 
quasi- chaotic,  as  in  the  garbage  can  model,  which 
posits  that  choices  within  organized  anarchies  are 
largely  random.  Newer  applications  of  this  perspec¬ 
tive  have  focused  on  the  reliance  of  myth  and  cere¬ 
mony  even  in  modern  business  organizations,  the 
socialization  in  time  of  various  managerial  cohorts, 
and  the  role  of  organizational  isomorphism  between 
political  systems  and  organizations  of  immigrants 
to  produce  particular  cultural  patterns  of  postmod¬ 
ern  citizenship.  Social  constructivist  approaches 
have  stressed  especially  the  socially  contingent 
development  of  norms,  ideas,  and  institutions,  even 
in  the  international  arena.  Economic  sociology  also 
focuses  on  the  social  and  cultural  bases  of  economic 
institutions  and  concepts,  such  as  inheritance  or  the 
household,  thus  adhering  to  the  institutionalist 
tradition  in  economics  more  faithfully  than  the 
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new  institutional  economics  or  new  economics  of 
organization. 

Historical  Institutionalism 

The  concepts  of  economic  sociology,  also  found 
in  the  corporatism  and  other  varieties  of  capitalism 
literature,  focus  on  the  historical,  social,  and  orga¬ 
nizational  conditions  for  alternate  modes  of  capital¬ 
ist  economic  formation.  Historical  institutionalism, 
more  generally,  follows  the  research  program  of 
Max  Weber  in  understanding  economy,  politics, 
and  society  in  terms  of  historically  contingent,  par¬ 
ticular  developmental  paths,  whose  meanings 
depend  on  the  subjective  interpretations  of  human 
actors.  Some  historical  institutionalists  emphasize 
path  dependency,  sequences,  and  temporal  order¬ 
ings.  Others  argue  that  the  historical  approach  may 
give  us  leverage  precisely  on  questions  of  human 
agency  and  configurational  causality.  The  main 
problem  with  this  approach  is  related  to  the  causes 
of  institutional  stability  and  change.  Its  main  meth¬ 
ods  are  process  tracing  and  thick  description  as  well 
as,  in  some  cases,  analytic  narratives. 

Current  Debates 

Institution  is  so  fundamental  to  the  social  sciences 
that  it  is  not  surprising  that  the  definition  of  and 
research  on  institutions  spans  an  enormously 
broad  range  of  concepts,  methods,  and  topics. 
Nevertheless,  the  study  of  institutions  deals  with 
the  formation  and  impact  of  stable  social  arrange¬ 
ments,  even  if  these  may  range  from  ideas  and 
normative  concepts  to  actual  local,  national,  and 
international  organizations,  associations,  and  states. 
The  central  cleavages  within  institutionalist  analysis 
help  study  whether  a  state  of  nature  can  be  posited 
or  whether  particular  culturally  and  historically 
grounded  starting  points  must  be  addressed, 
whether  institutions  evolve  or  are  products  of  con¬ 
scious  human  design,  and  whether  institutionalist 
outcomes  are  functional  or  historically  contingent, 
and  if  so,  what  normative  consequence  follows 
from  the  artifactuality  of  institutions  and  their 
impact  on  social  life. 
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Humboldt  University  Berlin 
Berlin,  Germany 
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Intellectuals 


The  concept  of  intellectual  as  a  category  emerged  in 
the  late  19th  century  and  bloomed  with  the  politi¬ 
cal  phenomena  of  the  20th  century.  Before  that,  as 
in  the  work  of  the  medieval  historian  Jacques  Le 
Goff,  for  example,  the  term  referred  to  clerics. 
Intellectuals  appeared  in  France  during  the  Dreyfus 
Affair  of  1894,  in  which  Captain  Alfred  Dreyfus,  a 
French  artillery  officer  of  Jewish  descent,  was  tried 
and  falsely  convicted  of  treason;  he  was  later  exon¬ 
erated  and,  in  1906,  received  a  full  pardon.  The 
term  Dreyfus  Affair  was  coined  by  the  writer 
Maurice  Barres  in  1898  to  designate  the  writers, 
scholars,  and  publicists  who  were  convinced  of 
Dreyfus’s  innocence.  It  is  thus  at  the  intersection  of 
science  and  public  debate  that  the  intellectual 
stands.  Throughout  the  20th  century,  intellectuals 
emerged  as  major  players  in  politics  and  public 
affairs.  If  one  speaks  in  the  plural,  intellectuals  des¬ 
ignate  a  group  of  scholars  and  artists  whose  inter¬ 
vention  in  public  space  finds  its  legitimacy  in  their 
expertise  and  knowledge.  Very  soon,  the  figure  of 
the  intellectual  also  appeared  in  the  singular  as  a 
symbol  of  political  idealism  or  commitment  in  the 
political  struggles  of  the  century.  Today,  they  are 
represented  by  writers  such  as  Salman  Rushdie, 
who  have  been  persecuted  for  their  beliefs  and 
writings  and  have  become  symbols  of  freedom  of 
expression.  During  the  last  quarter  of  the  20th  cen¬ 
tury,  the  growing  part  played  by  intellectuals  in 
economic  debates  on  globalization  as  well  as  in 
discussions  on  issues  of  redistribution  and  social 
justice  allowed  the  emergence  of  new  actors.  In  this 
entry,  the  changing  roles  of  intellectuals  and  the 
new  challenges  facing  them  are  discussed. 

Intellectuals  and  Engagement 

The  Dreyfus  Affair  established  the  legitimacy  of 
intellectuals  to  intervene  in  public  debate  in  the 
name  of  universal  values.  From  that  moment 
onward  they  have  never  left  the  political  scene, 


announcing  or  accompanying  its  principal  evolu¬ 
tions.  The  history  of  intellectuals  during  the  first 
half  of  the  20th  century  is  marked  by  both  fascism 
and  communism.  If  certain  intellectuals  such  as  the 
philosopher  Martin  Heidegger,  the  famous  histo¬ 
rian  of  the  Middle  Ages  Ernst  Kantorowicz,  and  the 
jurist  Carl  Schmitt  appeared  favorable  to  Nazism  in 
its  early  days,  most  intellectuals  rallied  to  the  camp 
opposing  Hitler.  Most,  like  Stefan  Zweig  or  Walter 
Benjamin,  chose  exile.  The  German  novelist  and 
short  story  writer  Thomas  Mann  and  the  philoso¬ 
phers  Hannah  Arendt,  Theodor  Adorno,  and  Ernst 
Cassirer  found  refuge  in  the  United  States.  Mean¬ 
while,  many  intellectuals  were  attracted  by  the 
Soviet  system,  in  which  they  saw  the  possibility  of 
the  realization  of  the  communist  ideal.  The  inter¬ 
war  period  marked  a  first  peak  of  the  membership 
of  intellectuals  in  communist  parties,  which  was 
renewed  after  World  War  II  and  the  prestige  gained 
by  the  victory  of  the  Allies.  This  was  then  followed 
by  the  Cold  War,  and  intellectuals  often  were 
regarded  as  compagnons  de  route  in  the  image  of 
Jean-Paul  Sartre  in  France. 

Intellectuals  not  only  participate  in  political 
debate,  but  they  also  judge  political  events.  They 
thus  played  a  major  role  in  the  interpretation  of 
the  Holocaust  and  in  the  debates  on  how  to  deal 
with  this  legacy  in  public  discourses,  as  in  Germany, 
and  also  in  the  interpretation  of  the  Nazi  phenom¬ 
enon  and  its  German  roots  (see  the  Historikerstreit 
in  1986  between  the  historian  Ernst  Nolte  and  the 
philosopher  Jurgen  Habermas).  Intellectuals  also 
have  attempted  to  provide  an  existential  reflection, 
as  in  Primo  Levi’s  works.  They  have  been  similarly 
involved  not  only  in  the  debate  against  racial  dis¬ 
crimination  in  America  in  the  1960s  (Hannah 
Arendt  and  Toni  Morrison)  and  in  Third  World 
struggles  for  independence  (Frantz  Fanon)  but  also 
in  the  assertion  of  former  colonized  cultures — for 
example,  Aime  Cesaire  and  Leopold  Sedar  Senghor, 
the  spokesmen  for  Negritude,  or  more  recently, 
Raphael  Confiant  and  Patrick  Chamoiseau  for 
Creole. 

One  of  the  most  important  influences  of  intel¬ 
lectuals  on  political  life  was  the  challenging  role 
played  by  dissidents  in  the  Soviet  Union  and  other 
communist  states.  Long  persecuted,  imprisoned  by 
the  Soviet  regime,  and  silenced,  they  asserted 
themselves  in  favor  of  change  in  the  climate  of 
international  relations.  The  Helsinki  Accords  of 
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1975  allowed  the  appearance  of  the  Charter  77 
movement  led  by  Jan  Patocka  and  Vaclav  Havel  in 
Czechoslovakia,  which  placed  the  defense  of 
human  rights  at  the  heart  of  its  approach  and 
appealed  to  the  West,  and  similar  movements  in 
Poland,  where  intellectuals  played  a  central  role  in 
the  Solidarnocz  movement  as  also  in  the  round 
table  discussions  that  allowed  the  nonviolent  tran¬ 
sition  toward  democracy  in  the  late  1980s. 

Intellectuals  and  the  End  of  History 

The  events  of  1989,  the  year  of  the  fall  of  the 
Berlin  Wall  and  the  beginning  of  the  collapse  of  the 
Soviet  empire,  would  seem  to  corroborate  theories 
that  shook  the  global  intellectual  debate  since  the 
political  scientist  Francis  Fukuyama  in  an  article, 
based  on  a  Hegelian  reading  of  history,  described 
the  final  victory  of  liberal  democracy  over  totali¬ 
tarian  regimes  as  “the  end  of  history.”  By  contrast, 
Samuel  Huntington,  who  propagated  the  idea  of  a 
“clash  of  civilizations,”  which  the  terrorist  attacks 
of  September  11,  2001,  seemed  to  confirm,  shaped 
the  global  debate  in  a  different  way. 

One  of  the  striking  influences  that  intellectuals 
have  had  over  the  past  20  years  concerns  the  role 
played  by  neo-conservatives  in  the  United  States. 
The  meeting  of  a  group  of  students  at  City  College 
of  New  York,  including  Irving  Kristol,  Daniel  Bell, 
and  Irving  Howe,  led  to  the  birth  of  neo¬ 
conservatism;  they  combined  the  anticommunism 
of  the  Cold  War  period  and  the  theories  of  the 
philosopher  Leo  Strauss,  who  had  emigrated  to 
the  United  States  to  escape  the  Nazi  regime. 
Neo-conservatism  affirms  the  equal  responsibility 
of  intellectuals  and  political  leaders.  Neo¬ 
conservatives  argue  that  to  combat  terrorism,  the 
United  States  should  assert  its  power  internation¬ 
ally  to  promote  the  development  of  democracies. 
This  theory  had  considerable  resonance  after  the 
September  11,  2001,  attacks.  It  was  on  this  basis 
that  American  forces  intervened  in  Iraq  in  2003  to 
create  a  new  political  order. 

New  Theoretical  Objects 

Recent  years  have  seen  the  intellectuals  expanding 
the  scope  of  their  study  to  new  objects  as  well  as  the 
emergence  of  new  voices.  Following  the  theories  of 
John  Rawls  (A  Theory  of  Justice,  1971),  the  Indian 


economist  Amartya  Sen,  recipient  of  the  Nobel 
Prize  in  Economics  in  1998,  and  the  Bangladeshi 
Muhammad  Yunus,  Nobel  Peace  Prize  laureate  in 
2006,  have  played  a  central  role  not  only  in  the 
renewal  of  economic  theory  but  also  in  stressing  the 
importance  of  equality  and  justice  in  international 
relations.  This  movement  signals  the  new  role  of 
today’s  intellectuals  who  raise  questions  of  public 
interest  concerning  the  future  of  the  planet  such  as 
the  principle  of  responsibility  expounded  by  phi¬ 
losopher  Hans  Jonas  in  the  early  1960s  about  the 
nuclear  threat  and  now  dealing  with  environmental 
policies,  besides  public  health  and  bioethical  issues. 

Intellectuals  are  also  faced  with  this  question: 
What  is  the  nature  of  social  change  to  come? 
Whether  postcolonial  thinkers,  theorists  of  the 
“multitude,”  representatives  of  gay  theory,  or  phi¬ 
losophers  of  recognition,  their  answers  are  often 
combined  with  a  strong  critique  of  democracy.  The 
debate  concerns  both  the  nature  of  power  in  con¬ 
temporary  societies  and  the  possibility  for  formerly 
oppressed  groups  to  express  themselves.  It  allows 
the  renewal  of  old  theories  (Marxism,  anarchism) 
at  the  same  time  as  the  emergence  of  new  objects. 
In  recent  years,  intellectuals  dealt  with  questions 
of  law  and  justice  (Jurgen  Habermas,  Giorgio 
Agamben,  and  Roberto  Unger),  media  and  com¬ 
munication  (Noam  Chomsky),  ecology  (Vandana 
Shiva  and  Peter  Singer),  or  HIV/AIDS.  They  have 
especially  sought  to  bring  in  new  players  in  the  line 
of  subaltern  studies  that  developed  predominantly 
in  India  or  Latin  America  or  postcolonial  studies 
(Edward  Said,  Homi  Bhabha,  and  Paul  Gilroy). 
These  theories,  postulated  by  women  scholars  such 
as  Judith  Butler  or  Gayatri  Spivak,  were  based  on 
the  result  of  gender  studies  and  the  formation  of 
identities  as  ideological  constructs.  Theorists  of 
recognition  (Axel  Honneth,  Seyla  Benhabib,  and 
Nancy  Frazer)  refer  to  Charles  Taylor’s  theories  of 
justice  as  a  basis  for  classifying  recognition  as  a 
political  category. 

Politics  at  the  Heart  of 

Intellectuals’  Reflection 

The  current  relationship  between  intellectuals  and 
politics  has  developed  into  a  more  complex  form 
than  it  had  throughout  the  20th  century.  The  links 
between  intellectuals  and  politics  assert  themselves 
as  more  deeply  embedded  than  ever.  One  could 
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indeed  imagine  in  the  early  1970s  that  the  com¬ 
plexity  of  our  societies  and  the  development  of 
globalization  heralded  the  disappearance  of  intel¬ 
lectuals  and  their  replacement  by  competent  experts 
and  scientists,  while  new  trends  such  as  globaliza¬ 
tion  asserted  the  primacy  of  the  economy  at  the 
expense  of  politics.  Nevertheless,  a  number  of  intel¬ 
lectuals  today  enjoy  an  international  reputation 
further  enhanced  by  the  circulation  of  ideas,  global¬ 
ization  of  information,  use  of  the  Internet,  and 
mobility  of  students. 

At  the  international  level,  the  most  burning  issues 
remain  political.  If  in  Western  democratic  societies 
intellectuals  agree  with  the  criticism  of  capitalism, 
they  must  nevertheless  remember  that  their  exis¬ 
tence  is  linked  to  the  practice  of  democracy.  It  is  this 
battle  that  is  waged  today  by  intellectuals  in  all 
countries  where  freedom  of  expression  is  restricted. 
Dissidents  under  repressive  regimes  have  struggled 
for  historical  truth  and  the  recognition  of  different 
communities  that  make  up  society.  The  other  big 
question  now  concerns  the  dynamics  of  political 
and  religious  radicalization.  A  number  of  intellec¬ 
tuals,  for  example,  call  for  a  modernized  reading 
of  Islam. 

The  past  decades  have  discredited  the  idea  of  a 
peaceful  unification  of  the  planet  and  confronted 
intellectuals  with  all  forms  of  conflict,  from  the 
most  global  to  the  most  local,  bringing  out  old 
nationalist  confrontations  but  also  religious 
clashes.  “What  is  most  peculiar  about  our  age  is 
the  conviction  that  evil  is  installed  at  the  core  of 
history  and  our  frenetic  rejection  of  that  convic¬ 
tion”  (Therese  Delpech,  2007,  p.  175).  If  they 
want  to  meet  the  challenges  of  democracy,  intel¬ 
lectuals  have  a  long  way  ahead 
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Intelligence 


Intelligence  refers  to  an  awareness  that  leaders 
hope  to  have  about  the  threats  or  opportunities 
that  face  their  nation,  either  internally  or  from 
abroad.  Armed  with  this  information,  they  may 
make  better  decisions  about  how  to  protect  and 
advance  the  national  interest.  An  understanding 
of  threats  and  opportunities  can  be  reliably 
acquired  only  through  the  careful  collection  and 
study  of  information  about  domestic  and  foreign 
situations,  such  as  plotting  by  internal  subversives 
or  the  machinations  of  external  terrorist  factions 
and  rival  nations.  This  gathering  of  information, 
along  with  the  interpretation  of  its  meaning,  lies 
at  the  heart  of  intelligence. 

Stated  more  formally,  intelligence  is  the 
knowledge  and  foreknowledge  of  dangers  and 
opportunities — both  domestic  and  foreign — as  a 
prelude  to  decision  and  action  by  a  nation’s  lead¬ 
ers.  Strategic  intelligence  refers  to  the  objective  of 
achieving  a  global  understanding  of  dangers  and 
opportunities;  tactical  intelligence  is  concerned 
more  with  threats  and  opportunities  on  specific 
battlefields  or  theaters  of  war. 

The  academic  study  of  intelligence  has  pro¬ 
gressed  by  leaps  and  bounds  in  recent  years.  The 
most  important  developments  in  the  field  are  efforts 
to  move  beyond  spy  memoirs  and  to  apply  rigorous 
research  standards  to  the  questions  of  how  nations 
gather  and  analyze  information  on  world  affairs 
and  engage  in  other  intelligence  activities.  More  and 
more  studies  are  presenting  empirical  data,  testable 
hypotheses,  and  theoretical  frameworks.  Scholars  in 
the  field  have  also  been  conducting  in-depth  inter¬ 
views  with  practitioners  and  have  benefitted  from 
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the  extensive  number  of  intelligence  documents  that 
have  been  released  in  recent  decades.  In  the  United 
States,  such  documents  include  the  Church 
Committee  reports  (1975-1976)  as  well  as  reports 
from  government  panels  examining  intelligence 
such  as  the  Aspin-Brown  Commission  (1995-1996), 
the  Kean  Commission  (2004),  and  the  Silberman- 
Robb  Commission  (2005). 

This  entry  considers  the  nature  of  intelligence, 
highlighting  some  key  examples  involving  covert 
action  and  counterintelligence  activities  on  the  part 
of  the  United  States.  It  then  examines  the  results  of 
intelligence  activities  and  notes  some  distinctive 
features  of  intelligence  in  democratic  regimes. 

The  Nature  of  Intelligence 

Intelligence  may  be  considered  a  process,  a  prod¬ 
uct,  a  set  of  organizations,  and  a  set  of  missions. 

Intelligence  as  a  Process 

As  a  process,  intelligence  is  a  series  of  interac¬ 
tive  phases  whereby  government  officials  plan 
what  information  to  collect  from  around  the 
globe,  use  machines  and  human  agents  to  gather 
the  information,  assign  experts  (“analysts”)  to 
make  sense  of  the  information,  and  finally,  distrib¬ 
ute  the  findings  to  decision  makers.  This  sequence 
of  activities  is  known  as  the  intelligence  cycle. 

Intelligence  as  a  Product 

As  product,  intelligence  consists  of  facts  and 
interpretive  reports  about  homeland,  world,  or 
battlefield  conditions.  In  the  United  States,  for 
example,  the  most  prestigious  intelligence  report  is 
the  President’s  Daily  Brief  (PDB),  a  succinct  sum¬ 
mary  of  global  events  over  the  past  24  hours,  deliv¬ 
ered  to  the  president  and  a  few  other  top  policy 
officials  each  morning.  Important,  as  well,  in  the 
United  States,  is  the  National  Intelligence  Estimate 
or  NIE,  which  attempts  to  make  more  lengthy 
long-range  forecasts  about  world  events  and  relies 
on  detailed  research. 

Intelligence  as  a  Set  of  Organizations 

As  a  set  of  organizations,  a  nation’s  intelligence 
“community”  focuses  on  a  number  of  responsibili¬ 
ties:  code-breaking  and  electronic-eavesdropping 


organization,  the  gathering  and  interpretation  of 
photography  from  satellites  and  reconnaissance 
aircraft,  the  interpretation  of  military  information 
gathered  by  intelligence  units  within  the  uniformed 
services,  and  their  gathering  of  tactical  information 
in  theaters  of  combat. 

Intelligence  as  a  Set  of  Missions 

Collection  and  Analysis 

As  a  set  of  missions,  intelligence  refers  first  of 
all  to  the  range  of  activities  encompassed  within 
the  intelligence  cycle,  known  in  short  as  “collec¬ 
tion  and  analysis.”  This  is  Mission  Number  One 
for  a  nation’s  intelligence  agencies,  that  is,  the 
duty  to  place  on  the  desktops  of  prominent  leaders 
the  best  information  possible  to  help  illuminate 
the  domestic  and  foreign  security  situations  they 
face.  In  the  United  States  and  most  other  affluent 
nations,  the  intelligence  agencies  have  two 
additional  responsibilities:  (1)  covert  action  and 
(2)  counterintelligence  (Cl). 

Covert  Action 

Covert  action  is  a  secret  activity  designed  to 
influence  events  in  other  nations  in  a  direction 
favorable  to  a  nation’s  best  interests,  not  simply 
gather  information  abroad.  This  mission  expanded 
rapidly  in  the  democracies  during  the  Cold  War  as 
a  means  for  combating  the  aggressive  use  of  intel¬ 
ligence  operations  by  the  Soviet  Union  to  spread 
communism  around  the  world. 

Covert  action  is  often  grouped  into  four  catego¬ 
ries:  (1)  propaganda,  (2)  political,  (3)  economic, 
and  (4)  paramilitary  activities.  Propaganda,  some¬ 
times  called  psychological  warfare  or  simply  “psy 
war,”  is  the  most  extensively  used  form  of  covert 
action.  During  the  Cold  War,  for  example,  the 
Central  Intelligence  Agency  (CIA)  secretly  pro¬ 
duced  a  flood  of  worldwide  propaganda  dissemi¬ 
nated  by  way  of  its  vast,  hidden  network  of  “media 
assets” — foreign  newspaper  editors  and  reporters, 
television  correspondents,  magazine  editors,  and 
book  editors.  This  source  of  “perception  manage¬ 
ment”  (in  the  intelligence  euphemism)  was  meant 
to  supplement  the  normal  overt  sources  of  infor¬ 
mation  about  the  United  States  distributed  through 
official  channels,  such  as  White  House  press 
releases  and  State  Department  public  bulletins. 


1212  Intelligence 


During  the  Cold  War  and  till  today,  political 
covert  action  includes  secret  financial  aid  to  friendly 
politicians  and  governmental  officials  abroad — 
what  British  intelligence  officers  refer  to  as  “King 
George’s  cavalry.”  During  the  1960s,  the  CIA  pro¬ 
vided  covert  funding  to  the  Christian  Democratic 
Party  in  Italy  for  its  behind-the-scenes  political 
battles  against  the  Soviet-sponsored  Italian 
Communist  Party.  Normally,  propaganda  and 
political  covert  action  work  together  hand  in  glove. 

Covert  action  in  the  form  of  secret  economic 
operations  can  also  be  used  to  undermine  adver¬ 
saries  abroad,  for  example,  by  encouraging  labor 
unrest  (as  in  Chile  during  the  1970s)  or  by  sabo¬ 
taging  electrical  power  lines  and  mining  harbors 
(approaches  used  by  the  United  States  against 
Nicaragua  during  the  1980s).  Paramilitary  (PM) 
activities  are  the  most  extreme,  and  controversial, 
form  of  covert  action.  Adopting  this  approach,  a 
country  will  enter  into  warlike  operations  against 
an  adversary — everything  from  weapons  supplies 
to  one  of  the  sides  that  is  engaged  in  a  foreign  civil 
war  to  large-scale  “secret”  (or,  at  any  rate,  never 
officially  acknowledged)  wars  or  assassination 
plots  aimed  against  foreign  leaders.  Classic  illus¬ 
trations  of  PM  wars  are  the  failed  Bay  of  Pigs 
operation  in  Cuba  in  1961  and  the  struggle  against 
communist  forces  in  Laos  from  1962  to  1968, 
with  the  CIA  advising  and  arming  the  indigenous 
anticommunist  Hmong  tribesmen. 

Before  assassination  as  an  instrument  of  U.S. 
intelligence  was  prohibited  by  an  executive  order 
signed  by  President  Gerald  Ford  in  1976,  and  reaf¬ 
firmed  by  every  American  president  since,  the 
White  House  instructed  the  CIA  to  carry  out  plots 
to  kill  Fidel  Castro  of  Cuba  and  Patrice  Lumumba 
of  Congo,  among  other  leaders  of  small  nations 
caught  in  the  tug-of-war  between  the  United  States 
and  the  Soviet  Union.  None  of  the  plots  was  suc¬ 
cessful.  Despite  the  executive  order  against  assas¬ 
sinations,  a  waiver  exists  in  times  of  authorized 
war,  and  as  a  consequence,  the  White  House 
ordered  CIA  plots  against  the  lives  of  Saddam 
Hussein  of  Iraq  and  Osama  bin  Laden  of  the  A1 
Qaeda  terrorist  organization  in  the  early  years  of 
the  21st  century.  None  succeeded.  Overt  U.S. 
forces  eventually  captured  Hussein  during  the 
Second  Persian  Gulf  War  that  began  in  2003;  he 
was  tried  and  executed  by  provisional  Iraqi  author¬ 
ities.  Bin  Laden  remains  at  large. 


Counterintelligence 

The  third  core  intelligence  mission  is  known  as 
counterintelligence  (Cl).  As  with  covert  action,  this 
mission  is  rarely  mentioned  in  a  nation’s  official 
documents.  Yet  Cl  grew  quickly  as  a  mission  dur¬ 
ing  the  Cold  War  in  the  face  of  the  rising  confron¬ 
tation  between  the  Western  democracies  and  the 
Soviet  Union.  The  purpose  of  Cl  (of  which  coun¬ 
terterrorism  is  an  important  subset)  is  to  thwart 
the  hostile  activities  of  foreign  intelligence  services 
and  terrorist  organizations  directed  against  the 
homeland. 

The  practice  of  Cl  entails  both  security  and 
counterespionage  (CE).  Security  is  the  passive  or 
defensive  side  of  Cl  and  relies  on  static  safeguards 
such  as  security  clearances,  polygraphs  (lie  detec¬ 
tion  machines),  codes,  fences,  sentries,  alarms, 
badges  and  passes,  curfews,  restricted  areas,  and 
checkpoints.  CE  is  the  aggressive  or  offensive  side 
of  the  mission.  The  ultimate  goal  is  to  infiltrate  the 
hostile  intelligence  service  with  an  agent  (or 
“mole”),  a  ploy  known  as  a  “penetration.”  With  a 
penetration  agent  inside  the  enemy’s  camp,  one 
can  possibly  learn  what  moles  the  enemy  has  man¬ 
aged  to  burrow  inside  one’s  own  camp.  The 
Soviets  successfully  used  this  approach  against  the 
United  States  during  the  1980s  and  1990s  by 
secretly  recruiting  Aldrich  Ames  of  the  CIA  and 
Robert  Hanssen  of  the  FBI,  who  revealed  to  their 
masters  in  Moscow  the  names  of  CIA  moles  inside 
Soviet  intelligence  and  other  information  that 
undermined  the  effectiveness  of  U.S.  intelligence 
operations  against  the  Soviet  Union.  Other  major 
democracies  also  experienced  Soviet  penetrations 
of  their  political  and  intelligence  agencies. 

Results  of  Intelligence  Actions 

In  each  of  these  missions — collection  and  analysis, 
covert  action,  and  Cl — nations  have  registered  suc¬ 
cesses  and  failures.  In  the  United  States,  for  exam¬ 
ple,  the  CIA  successfully  tracked  the  development 
of  Soviet  weapons  systems  by  way  of  surveillance 
machines  and  human  agents.  Yet  the  agency  failed 
to  warn  the  nation  of  the  A1  Qaeda  attack  of 
September  11,  2001,  against  the  United  States  and 
incorrectly  concluded  in  2002  that  Iraq  possessed 
weapons  of  mass  destruction,  thereby  helping  pre¬ 
cipitate  the  Second  Persian  Gulf  War. 
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With  respect  to  covert  action,  the  CIA  success¬ 
fully  overthrew  leaders  in  Iran  (1953)  and 
Guatemala  (1954)  who  were  deemed  unfriendly  to 
the  United  States.  Yet  presidents  would  soon  find 
out  that  this  secret  approach  to  foreign  policy 
often  failed,  as  with  the  Bay  of  Pigs  operation  dur¬ 
ing  the  Kennedy  administration.  Another  failure 
occurred  with  the  Iran-Contra  scandal  during  the 
Reagan  administration,  when — despite  an  express 
law  that  prohibited  such  activities — the  National 
Security  Council  staff  worked  illegally  with  ele¬ 
ments  of  the  CIA  to  conduct  paramilitary  opera¬ 
tions  in  Nicaragua  (where  the  anticommunist 
Contras  were  at  war  against  a  Marxist  regime). 
During  the  Reagan  years,  the  United  States  also 
experienced  covert  action  successes  in  Afghanistan, 
when  the  CIA  helped  drive  the  Soviets  out  of  that 
country.  Once  more  in  Afghanistan,  this  time  in 
the  wake  of  the  9/11  attacks,  the  second  Bush 
administration  enjoyed  covert  action  success  when 
the  CIA  helped  rout  the  Taliban  regime  that  had 
provided  a  haven  for  the  A1  Qaeda  terrorists  who 
had  struck  Washington,  D.C.,  and  New  York  City. 

Similarly,  on  the  Cl  front,  the  United  States  has 
caught  many  spies,  including  those  who  stole  the 
secret  of  the  atom  bomb  and  gave  it  to  Moscow; 
but  Washington  suffered  the  failures  of  Ames, 
Hanssen,  and  several  other  Soviet  and  Chinese 
penetrations  during  the  Cold  War.  In  the  United 
Kingdom,  Harold  “Kim”  Philby  spied  for  the 
Soviet  Union  at  the  highest  levels  during  the  Cold 
War.  With  respect  to  all  three  intelligence  mis¬ 
sions,  one  can  expect  ongoing  successes  and — 
conducted  as  they  are  by  mere  mortals — inevitable 
failures,  too. 

The  intelligence  agencies  of  the  United  States 
are  distinct  in  some  ways  from  those  of  other 
nations.  They  are  large  and  technically  advanced, 
and  they  are  dispersed  in  their  organization  and 
management  far  more  than  is  usually  the  case  even 
in  other  democracies.  The  U.S.  secret  agencies  also 
share  common  traits  with  their  counterparts 
abroad,  such  as  an  inability  to  foresee  and  collect 
all  the  information  that  national  leaders  may  need, 
a  periodic  misinterpretation  of  information  by 
analysts  and  (sometimes  intentionally  to  further 
their  own  political  agenda)  by  decision  makers, 
and  from  time  to  time  critical  failures  of  CI. 

In  democratic  regimes,  another  aspect  of  intel¬ 
ligence  has  garnered  much  attention:  maintaining 


accountability  over  secret  agencies  as  a  guard 
against  their  abuse  of  secret  power.  In  the  United 
States,  for  example,  journalists  and  lawmakers  dis¬ 
covered  during  the  1970s  that  the  CIA  and  its  com¬ 
panion  agencies  had  been  misused  by  politicians  to 
spy  against  American  citizens — chiefly  anti- 
Vietnam  War  dissenters  and  civil  rights  activists. 
This  led  to  a  reform  movement  and  the  establish¬ 
ment  in  Congress  of  specific  intelligence  oversight 
committees  with  the  responsibility  of  keeping  an 
eye  on  the  secret  agencies  and  rein  in  any  wrongdo¬ 
ing.  Compared  with  the  time  of  benign  neglect 
before  the  reform  movement,  the  new  intelligence 
accountability  has  provided  a  serious  check  against 
the  abuse  of  secret  power.  Critics  argue,  however, 
that  overseers  on  Capitol  Hill  have  failed  to  consis¬ 
tently  supervise  the  intelligence  agencies.  Moreover, 
disputes  have  arisen  over  whether  the  second  Bush 
administration  misused  the  agencies  once  again  for 
spying  against  American  citizens,  especially  by  tap¬ 
ping  their  telephone  conversations  without  proper 
warrants.  The  search  continues  in  the  United  States 
and  other  democracies  for  the  proper  balance 
between  efficient  intelligence  services,  on  the  one 
hand,  and  the  exercise  of  proper  accountability  to 
preserve  liberty,  on  the  other. 

Loch  K.  Johnson 
University  of  Georgia 
Athens,  Georgia,  United  States 
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Interaction  Effects 


Many  hypotheses  in  political  science  are  condi¬ 
tional  in  nature.  Institutional  theories  typically 
argue  that  actors  respond  differently  to  similar 
stimuli  when  they  inhabit  different  institutional 
environments.  Political  culture  arguments  typi¬ 
cally  come  down  to  the  claim  that  the  ways  citi¬ 
zens  behave  depends  on  the  environment  in  which 
they  were  socialized.  Approaches  that  emphasize 
strategic  interaction  identify  equilibrium  responses 
that  depend  on  key  parameters;  when  a  key 
parameter  is  above  some  critical  level,  one  strategy 
combination  is  an  equilibrium,  while  entirely  dif¬ 
ferent  behavior  (as  encapsulated  in  a  different  com¬ 
parative  static  relationship  between  key  variables) 
is  expected  when  that  parameter  falls  below  the 
critical  value.  However,  many  context-conditional 
claims  are  tested  as  if  they  were  unconditional 
claims.  Robert  Franzese  and  Cindy  Kam,  for 
example,  show  that  54%  of  the  articles  appearing 
over  a  period  of  5  years  in  journals  covering  three 
disciplines  used  statistical  methods  but  only  24% 
of  these  used  interaction  terms  in  their  empirical 
analysis.  The  review  of  the  literature  by  Thomas 
Brambor,  Robert  Clark,  and  Matt  Golder  is  even 
more  discouraging.  Examining  three  leading  polit¬ 
ical  science  journals  over  5  years,  they  found  that 
more  than  90%  of  the  articles  that  actually  speci¬ 
fied  conditional  tests  made  at  least  one  of  the 
common  errors  those  authors  warn  about.  Some 
ways  to  test  for  interaction  effects  are  presented 
below. 

To  get  a  sense  of  how  multiplicative  interaction 
models  allow  one  to  test  context-conditional 
claims,  consider  the  simplest  case,  where  X  is 
hypothesized  to  be  associated  with  a  continuous 
variable  Y  in  the  presence  of  condition  Z  but  not 
in  its  absence.  Assume  Y  and  X  to  be  continuous 
variables  and  Z  is  defined  such  that  Z  =  1  when 
the  factor  in  question  is  present  and  Z  =  0  other¬ 
wise.  The  multiplicative  interaction  effect  approach 
to  testing  such  a  claim  is  to  estimate  using  ordinary 
least  squares  (OLS)  regression  as  follows: 

Y  =  p0  +  p3X  +  P2Z  +  P3XZ  +  s.  (1) 

To  see  how  this  model  can  be  used  to  capture  a 
context-conditional  claim,  examine  the  case  where 


the  condition  is  absent — that  is,  when  Z  =  0. 
Equation  1  would,  therefore  become 

Y  =  P0  +  PjX  +  0,(0)  +  03X(O)  +  e, 
which  simplifies  to 

Y  =  0O  +  0jX  +  s. 

Thus,  p0  and  p  serve  as  estimates  for  the  intercept 
and  slope,  respectively,  in  a  linear  model  of  the 
relationship  between  X  and  Y  in  the  absence  of 
condition  Z.  Similarly,  Equation  1  allows  us  to 
estimate  the  relationship  between  these  variables 
when  Z  is  present  (i.e.,  when  Z  =  1).  Once  again, 
substituting  the  value  of  the  modifying  variable 
into  Equation  1,  we  get 

Y  =  P0  +  PjX  +  P2(l)  +  p3X(l)  +  £ , 
which  simplifies  to 

Y  —  (P0  +  P2)  +  (Pi  +  P3)X  +  E> 

which  shows  that  Equation  1  allows  us  to  estimate 
the  intercept  (P0  +  p,)  and  slope  (P3  +  P3)  of  the 
linear  relationship  between  X  and  Y  in  the  presence 
of  Z.  It  is  the  ability  to  yield  estimates  of  relation¬ 
ship  between  X  and  Y  for  the  case  where  the 
hypothesized  condition  is  met  as  well  as  when  the 
condition  is  not  met — and  the  ability  to  compare 
those  estimates  in  a  straightforward  manner — that 
makes  multiplicative  interaction  effects  models  a 
powerful  tool  for  analyzing  context-conditional 
claims. 

The  promise  offered  by  interactive  models  has, 
more  often  than  not,  gone  unfulfilled  as  a  result  of 
careless  application  despite  the  relative  simplicity 
of  the  technique.  Brambor,  Clark,  and  Golder 
argue  that  the  most  common  errors  found  in  the 
literature  related  to  multiplicative  interaction  mod¬ 
els  are  (a)  the  failure  to  include  them  when  appro¬ 
priate,  (b)  omission  of  the  individual  terms  that 
make  up  the  interaction,  (c)  failure  to  calculate  the 
theoretically  relevant  quantity  of  interest,  and 
(d)  the  standard  error  of  the  same. 

The  previous  discussion  assumed  that  the  modi¬ 
fying  variable,  Z,  was  dichotomous,  but  it  need 
not  be  so.  In  general,  we  can  examine  how  the 
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relationship  between  X  and  Y  depends  on  Z  by 
taking  the  partial  derivative  of  (1)  with  respect 
to  X: 


and  we  can  see  that  according  to  (1)  the  relation¬ 
ship  between  X  and  Y  is  a  linear  function  of  Z. 
This  derivative  is  often  referred  to  as  the  marginal 
effect  of  X  on  Y  at  some  level  of  Z.  When  Z  is 
equal  to  zero,  the  effect  of  X  on  Y  is  Pr  But  the 
magnitude  and  sign  of  the  marginal  effect  of  X  on 
Y  can  change  as  Z  changes  (to  the  extent  that  P3  is 
nonzero). 

This  simple  setup  allows  the  researcher  to  test 
many  types  of  context-conditional  claims.  For 
example,  the  modifying  variable,  Z,  can  either 
augment  or  inhibit  the  relationship  between  X  and 
Y.  Typically,  if  Z  is  an  augmenting  variable,  the 
sign  of  the  coefficient  on  interaction  term  (P3)  will 
have  the  same  sign  as  the  coefficient  on  the  vari¬ 
able  being  modified  (p3),  but  if  Z  is  an  inhibitor, 
the  sign  of  the  coefficient  on  the  interaction  term 
will  have  the  opposite  sign  of  the  coefficient  on  the 
variable  being  modified.  If  the  theory  being  tested 
claims  that  Z  either  augments  or  inhibits  the  effect 
of  X  on  Y,  then  it  is  also  useful  to  check  to  see  if  X 
actually  has  the  hypothesized  effect  on  Y  at  some 
levels  of  Z.  To  do  this,  one  can  calculate  the  mar¬ 
ginal  effect  of  X  on  Y  across  the  observed  range 
of  Z  and  compare  these  conditional  marginal 
effects  with  their  associated  standard  errors  to 
determine  if  the  marginal  effect  is  (a)  ever  statisti¬ 
cally  distinguishable  from  zero,  (b)  always  statis¬ 
tically  distinguishable  from  zero,  or  (c)  is  only 
statistically  distinguishable  from  zero  at  certain 
values  of  Z.  If  the  modifying  variable  is  either 
dichotomous  or  ordinal,  it  is  easy  to  report  the 
effect  of  X  on  Y  in  a  simple  table  of  marginal  effects 
and  their  associated  standard  errors — a  set  for  each 
of  the  values  that  Z  takes  on.  If  Z  is  a  continuous 
variable,  it  may  be  easier  to  present  this  information 
by  plotting  the  line  p,  +  p,Z  and  its  associated  con¬ 
fidence  interval  over  the  full  range  of  Z. 

The  practical  implications  are  likely  to  be  quite 
different  depending  on  which  of  these  three  situa¬ 
tions  is  the  case.  For  example,  if  Y  is  some  factor 
that  is  considered  desirable  and  one  believes  that  it 


is  brought  about  by  an  increase  in  X  but  only  in 
the  presence  of  Z,  it  should  be  the  case  that  P3  is 
greater  than  zero  and  p3  +  p3Z  is  greater  than  zero 
when  Z  is  sufficiently  high.  It  is  possible  that,  con¬ 
sistent  with  its  role  as  an  augmenting  modifying 
variable,  P3  is  significantly  greater  than  zero  but 
that  p3  +  P3Z  is  not  significantly  greater  than  zero 
over  any  of  the  observed  ranges  of  Z.  If  this  is  true, 
then  there  is  no  evidence  of  a  relationship  between 
X  and  Y  for  Z  to  modify,  and  there  would  be  no 
basis  to  argue  for  supplying  Z  because  it  encour¬ 
ages  the  beneficial  consequences  of  X. 

In  addition  to  inhibiting  or  augmenting  a  rela¬ 
tionship  between  X  and  Y,  the  simple  model  in 
Equation  1  can  be  used  to  capture  the  case  where 
the  modifying  variable  changes  the  sign  on  the 
marginal  effect  capturing  the  relationship  between 
X  and  Y.  It  is  possible  that  for  some  values  of  Z, 
X  is  positively  associated  with  Y,  while  for  other 
values  of  Z,  it  is  negatively  associated  with  Y. 
Evidence  of  this  would  be  found  if  p3  +  P3Z  is 
significantly  greater  than  zero  for  some  values  of 
Z  and  significantly  less  than  zero  for  other  values 
of  Z. 

It  should  be  noted  that  the  basic  logic  of  interac¬ 
tion  outlined  here  can  be  extended  to  other  types 
of  modifying  relationships  and  to  estimators  other 
than  OLS.  For  example,  the  effect  of  X  on  Y  may 
depend  on  the  value  of  X  itself  (as  in  quadratic, 
cubic,  and  higher  order  specifications).  The  effect 
of  X  may  depend  on  the  value  of  more  than  one 
modifying  variable  and  even  the  interaction 
between  some  or  all  of  those  modifying  variables. 
The  method  is  quite  flexible  and  one  should,  with 
some  creativity,  be  able  to  specify  a  model  that 
with  some  additional  calculation  will  produce  the 
quantity  of  interest  implied  by  the  theory  being 
evaluated. 

William  Roberts  Clark 
University  of  Michigan 
Ann  Arbor,  Michigan,  United  States 
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Interdependence 


In  political  science,  interdependence  refers  to  inde¬ 
pendent  social  actors  who  wish  to  preserve  their 
identity  but  who  are  structurally  affected  by  one 
another’s  behavior,  whether  they  like  it  or  not. 
The  actors  are  involved  in  each  other’s  affairs 
(functional  or  integrative  interdependence),  or 
they  are  part  of  the  same  system  (systemic  interde¬ 
pendence).  These  two  types  of  interdependence  do 
not  exclude  one  another:  If  actors  are  part  of  the 
same  system,  they  are  indirectly  involved  in  one 
another’s  affairs,  and  conversely,  social  systems 
are  based  on  the  involvement  of  actors  with  each 
other.  In  terms  of  the  structure-agency  debate, 
however,  it  makes  sense  to  distinguish  systemic 
interdependence  from  the  behavioral,  more 
policy-dependent,  types  (see  Figure  1). 


Functional  interdependence 


Integrative  interdependence 


Figure  I  Three  Types  of  Structural  Interdependence 

Source:  de  Wilde,  J.  H.  (1991).  Saved  from  oblivion: 
Interdependence  theory  in  the  first  half  of  the  20th  century 
(p.  20).  Aldershot,  UK:  Dartmouth. 


The  structural  effect  can  be  operationalized  into 
three  qualitative  dimensions  (see  Figure  2).  They 
express  how  given  actors  can  be  mutually  affected 
by  one  another’s  behavior.  The  first  dimension 
concerns  the  degree  of  mutual  involvement.  This 
relates  both  to  the  quantitative  characteristics  of 
the  intertwinement  and  to  the  existential  value  of 
the  relationship  for  the  actors. 

The  second  dimension  concerns  the  character 
of  mutual  involvement:  confrontational  versus 
constructional  interdependence.  The  difference 
is  related  to  two  connotations  of  the  term 
dependence:  subordination,  and  need  or  want. 
The  more  idealist  expressions  of  interdependence 
rhetoric  generally  refer  to  the  second  connotation: 
interdependence  as  the  consequence  of  a  mutually 
perceived  necessity  of  cooperation  or  interdepen¬ 
dence  as  a  symbiosis — two  can  do  more  than  one. 
Sometimes  this  is  referred  to  as  positive  interde¬ 
pendence.  Cooperation  creates  a  surplus  value. 
Especially,  economic  interdependence  is  often 
emphasized  in  this  context.  Flowever,  interdepen¬ 
dence  can  also  be  the  consequence  of  a  mutually 
perceived  necessity  of  obstruction  or  competition. 
There  is  a  stalemate:  Due  to  the  other(s),  neither 
side  can  reach  its  goals.  Confrontational  interde¬ 
pendence  is  typical  for  a  balance  of  power.  One 
classical  example  is  the  Cold  War  situation  of 
Mutual  Assured  Destruction  between  the  United 
States  and  the  former  USSR. 

The  third  dimension  concerns  the  ( a) symmetry 
of  mutual  involvement:  the  distribution  of  costs 
and  benefits.  Even  in  the  case  of  a  surplus  value, 
there  tends  to  be  conflict  about  the  price  to  be 
paid.  This  can  relate  to  the  contractual  conditions, 
as  is  the  case  in  conflicts  between  employers  and 
labor  unions.  It  can  also  relate  to  structural  asym¬ 
metries  in  the  world  economy,  as  is  the  case  in 
North-South  relations.  It  can  relate  to  the  uneven 
distribution  of  costs  of  preventive  measures  versus 
costs  of  failing  measures,  as  in  many  environmen¬ 
tal  issues  (e.g.,  the  price  for  preventing  upstream 
pollution  vs.  the  price  of  cleaning  costs  down¬ 
stream).  Additionally,  distributional  issues  are 
about  free-rider  dilemmas.  It  can  also  relate  to 
issue  linkages,  for  example,  the  impact  of  func¬ 
tional  and  integrative  interdependence  on  the 
identities  of  the  participating  actors.  Finally,  the 
existential  value  of  the  relationship  can  be  asym¬ 
metrical.  Robert  Keohane  and  Joseph  Nye  Jr. 
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Figure  2  Qualifications  of  Structural  Interdependence 

Source:  de  Wilde,  J.  H.  (1991).  Saved  from  oblivion:  Interdependence  theory  in  the  first  half  of  the  20th  century  (p.  27). 
Aldershot,  UK:  Dartmouth. 


(2001)  distinguish  in  this  context  between  sensitiv¬ 
ity  costs  and  vulnerability  costs.  Some  actors  may 
be  able  to  develop  alternatives  for  the  lost  profit, 
whereas  other  actors  may  lack  such  options. 

The  concept  of  interdependence  has  a  long  his¬ 
tory  in  political  science,  but  it  is  seldom  defined 
carefully.  Often,  it  serves  rhetorical  purposes, 
emphasizing  a  need  for  cooperation,  even  (or  espe¬ 
cially)  when  conflicts  are  paramount.  As  such,  it 
can  be  traced  back  in  the  academic  literature  to  the 
late  19th  century,  when  the  first  generation  of 
scholars  started  to  talk  about  globalization  and  its 
consequences. 

Generally,  the  rhetoric  of  interdependence  is 
associated  with  progressive  liberal  thought,  in 
which  trade  relations  and  social  contract  are  val¬ 
ued  as  mutually  profitable  for  those  involved. 
Particularly  in  the  work  of  Sir  Norman  Angell,  this 
is  developed  into  one  of  the  first  interdependence 
theories.  But  left-wing  syndicalists,  such  as  Francis 
Delaisi,  also  expected  that  economic  interdepen¬ 
dence  would  provide  a  structure  that  would  end 
political  myths  about  the  ideal  of  the  nation-state. 
In  Les  contradictions  du  monde  moderne,  pub¬ 
lished  in  1925,  he  emphasized  the  need  to  manage 
interdependence  by  means  of  permanent  interna¬ 
tional  institutions.  This  comes  close  to  the  theory 


of  functionalism  as  developed  by  David  Mitrany 
and  the  pluralist  theories  of  Charles  Merriam.  A 
more  neutral  early  attempt  to  develop  interdepen¬ 
dence  as  an  analytical  concept  can  be  found  in  The 
Interdependent  World  and  Its  Problems ,  published 
in  1933  by  the  historian  Ramsay  Muir. 

All  of  this  literature  shares  a  normative  concern 
about  the  need  to  manage  globalization:  Inter¬ 
dependence  means  trouble  but  only  when  it  is 
neglected.  At  the  same  time,  the  literature  has  a 
word  of  admiration  for  the  achieved  levels  of  glo¬ 
balization.  There  is  a  hope  for,  and  a  belief  in,  the 
possibility  of  social  learning.  Therefore,  interde¬ 
pendence  theories  are  generally  put  in  the  idealist 
tradition  in  international  relations,  even  though 
they  are  far  from  utopian. 

The  contemporary  interest  in  the  concept  origi¬ 
nates  in  the  book  Power  and  Interdependence,  first 
published  in  1977  by  Keohane  and  Nye.  They 
worked  on  the  same  themes  as  the  first  generation 
of  theorists:  the  functional  development  of  interna¬ 
tional  organization  in  response  to  (transnational) 
globalization  processes.  An  important  difference  is 
empirical:  When  Norman  Angell,  in  1918,  argued 
for  the  need  to  create  a  protective  union  of  democ¬ 
racies,  this  was  a  utopian  idea,  whereas  NATO 
had  existed  for  almost  3  decades  when  Keohane 
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and  Nye  reflected  on  existing  international  regimes 
in  1972  and  1977.  In  addition,  they  emphasized 
the  consequences  of  conscious  manipulation  of 
interdependencies  for  purposes  of  political  power. 
Especially  in  terms  of  soft  poiver,  a  term  later 
coined  by  Nye  in  1990,  asymmetries  in  the  inter- 
twinement  will  be  contested  and  can  be  exploited 
by  the  stronger  side.  The  limit  of  this  type  of  rela¬ 
tional  power,  however,  is  set  by  the  reciprocity 
inherent  in  interdependence:  The  continued  exis¬ 
tence  of  the  relationship  is  a  precondition  for 
exploitation  of  its  political  power. 

Also,  in  political  rhetoric,  both  the  1970s  and 
the  1990s  showed  waves  of  popular  use  of  the 
term.  Politicians,  especially  those  working  for 
international  organizations,  appeal  to  interdepen¬ 
dence  to  silence  their  opposition:  Global  interests 
should  take  precedence  over  local,  ethnocentric 
interests;  or,  one  step  further,  the  local,  ethnocen¬ 
tric  interests  in  the  long  term  require  that  immedi¬ 
ate  priority  is  given  to  the  global  interest.  In  this 
speech-act  capacity,  interdependence  asks  for  soli¬ 
darity  where  it  is  lacking  (North-South  context), 
for  cooperation  where  it  is  lacking  (East-West 
context  during  the  Cold  War),  and  for  common 
action  where  it  is  lacking  (global  challenges,  e.g., 
in  the  environmental  realm). 

A  rather  different  approach  to  interdependence 
has  been  developed  by  the  sociologist  Norbert  Elias. 
His  insights  are  especially  relevant  for  understand¬ 
ing  the  social  impact  of  the  complexity  that  is  typi¬ 
cal  for  situations  of  complex  interdependence,  as 
well  as  the  interdependences  created  by  a  division  of 
labor  (functional  differentiation).  In  line  with 
Mitrany’s  findings,  Elias  emphasizes  that  increasing 
interdependence  leads  simultaneously  to  increased 
collectivization  and  increased  individualization. 

Isolationism  is  the  first  policy  response  to  the 
discovery  of  complexity  and  dependence — an 
attempt  to  reduce  costs  and  complexity  by  neglect¬ 
ing  other  actors  and  by  pulling  out  (e.g.,  U.S.  poli¬ 
cies  toward  Europe  and  Japan  after  World  War  I). 
When  this  does  not  work,  the  second  response  is 
imperialism — an  attempt  to  reduce  costs  and  com¬ 
plexity  by  controlling  or  conquering  other  actors 
(e.g.,  German  and  Japanese  policies  after  World 
War  I).  Only  when  this  does  not  work  either  does 
a  third  response  emerge:  the  acceptance  of  plural¬ 
ism  and  a  willingness  to  compromise  in  face  of 
shared  overarching  interests  (e.g.,  the  creation  of 


the  United  Nations  and  the  Bretton  Woods  system 
after  World  War  II).  This  tension  between  the 
structural  dimension  of  interdependence  (i.e.,  a 
practical  impossibility  to  achieve  aims  unilaterally) 
and  the  cognitive  dimension  of  interdependence 
(i.e.,  the  interpretation  of  the  structure  by  the  actors 
involved)  is  at  the  heart  of  the  academic  and  social- 
political  discourses  on  interdependence.  Overall, 
the  concept  of  interdependence  has  discursive  func¬ 
tions  in  trying  to  show  that  actors  cannot  avoid 
dealing  with  one  another  and  analytical  functions 
in  trying  to  map  out  the  structure  of  world  politics. 

Jaap  de  Wilde 
University  of  Groningen 
Groningen,  Netherlands 
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Interest  Groups 


Interest  groups  are  formal  organizations,  usually 
based  on  individual  voluntary  membership,  that 
seek  to  influence  public  policies  without  assuming 
government  responsibility.  The  majority  of  interest 
groups  do  not  have  primarily  political  goals.  They 
are  normally  engaged  in  a  disparate  range  of 
activities — work,  recreational,  philanthropic, 
humanitarian,  and  cultural — that  develop  indepen¬ 
dently  in  society.  Such  groups  enter  the  political 
arena  when  they  require  some  form  of  public  inter¬ 
vention  (financial,  administrative,  etc.)  to  support 
their  interests  or  when  they  wish  to  influence  the 
adoption  or  implementation  of  government  deci¬ 
sions  so  as  to  secure  some  advantage  or  to  protect 
the  interests  they  represent.  The  goals  of  interest 
groups  may  be  very  specific  or  may  be  intended  to 
make  an  impact  on  the  entire  political  community. 
The  strategies  used  to  influence  decision  makers 
involve  the  combination  of  a  repertoire  of  tactics, 
the  breadth  of  which  varies  in  relation  to  each 
group’s  organizational  resources.  In  this  entry,  the 
origins  and  types  of  such  groups  and  their  respec¬ 
tive  functions  in  political  decision-making  pro¬ 
cesses  are  discussed.  Their  role  with  regard  to  the 
quality  of  democracy  is  also  critically  examined. 

Since  the  end  of  the  19th  century,  there  has  been 
an  exponential  growth  in  the  number  of  interest 
groups,  prompted  by  the  changes  that  have  contrib¬ 
uted  to  shaping  the  contemporary  world. 
Industrialization  has  played  a  decisive  role  in  this 
respect,  favoring  a  specialization  of  production  and 
services  that  has  in  turn  given  a  strong  impulse  to 
the  establishment  of  myriad  groups  with  differenti¬ 
ated  social  interests.  The  formation  and  consolida¬ 
tion  of  nation-states  has  been  equally  important. 
Industrialization  and  the  emergence  of  nation-states 
also  engendered  numerous  conflicts  of  a  social, 
political,  and  cultural  nature  that  contributed  to 
generating  collective  movements,  interest  groups, 
and  parties.  The  further  development  of  interest 
groups  was  encouraged  by  the  extraordinary  condi¬ 
tions  created  by  the  two  world  wars,  which  forced 
governments  to  involve  the  organizations  of  civil 
society  in  managing  the  emergency  created  by  the 
wars.  The  establishment  of  new  groups  was  also 
stimulated  by  the  increasingly  pervasive  role  of  the 
state  in  society  in  the  wake  of  the  development  of 


the  welfare  state,  which  created  new  needs  and 
new  interests. 

The  type  of  political  regime  that  offers  the  most 
opportunities  for  groups  to  organize  themselves 
independently  and  to  influence  decision  makers  is 
liberal  democracy.  By  contrast,  in  totalitarian  or 
authoritarian  regimes,  interest  groups  are  estab¬ 
lished,  controlled,  or  backed  by  a  single  party  or 
state  and  often  used  exclusively  as  instruments  for 
mobilizing  consensus.  This  entry  deals  only  with 
interest  groups  active  in  contemporary  democra¬ 
cies,  as,  indeed,  does  virtually  all  the  literature 
devoted  to  these  political  actors.  Particular  atten¬ 
tion  is  given  to  examining  the  general  functions  of 
interest  groups  in  the  political  system  and  the 
obstacles  and  opportunities  they  encounter  in  the 
various  phases  of  the  process  involved  in  influenc¬ 
ing  public  decisions. 

Definition 

The  above  definition  indicates  three  distinctive 
traits  of  interest  groups.  First,  they  are  described  as 
formal  organizations — in  other  words,  fairly  per¬ 
manent  structures  governed  by  clearly  defined 
rules  and  with  an  identifiable  membership  and 
leadership.  These  characteristics  are  absent  in 
informal  associations — for  example,  certain  groups 
of  influential  citizens,  anomic  groups,  professional 
elites — which  are  sometimes  included  in  the  family 
of  interest  groups.  This  should,  however,  be 
avoided,  because  the  intrinsic  difficulties  of  ascer¬ 
taining  the  presence  of  informal  groups  would 
make  any  reliable  census  of  groups  active  in  differ¬ 
ent  policy  arenas  impossible,  and  an  analysis  of  the 
inner  workings  of  such  nebulous  associations 
would  be  no  less  problematic. 

The  second  distinctive  feature  of  interest  groups 
is  that  they  are  made  up  of  individuals  who  join  on 
a  voluntary  basis.  Members  usually  enter  freely, 
participate  in  the  group’s  activities,  and  discuss  the 
objectives  proposed  by  the  leaders.  If  they  are  dis¬ 
satisfied,  they  leave  the  organization.  Analysis  of 
the  internal  functioning  of  groups  clarifies  the 
conditions  that  favor  or  discourage  participation 
and  collective  action.  It  should  also  be  added  that 
there  is  a  type  of  interest  group  where  membership 
is  not  entirely  voluntary.  These  are  institutional 
interest  groups ,  which,  as  discussed  herewith,  are 
not  vehicles  for  democratic  participation. 
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The  third  qualifying  element  of  interest  groups  is 
the  particular  way  in  which  they  try  to  influence 
policy  making.  They  do  so  without  seeking  govern¬ 
ment  office;  indeed,  they  do  not  present  electoral 
lists.  In  this  respect,  interest  groups  differ  from 
political  parties,  for  whom  it  is  important  to  take 
part  in  elections  in  order  to  be  able  to  exercise 
political  power.  As  interest  groups  usually  represent 
fairly  circumscribed  interests,  they  cannot  hope  to 
win  elections  and  so  prefer  to  convince  the  elected 
representatives  rather  than  the  voters  of  the  validity 
of  their  case.  This  difference  between  political  par¬ 
ties  and  interest  groups  seems  to  be  at  least  partly 
contradicted  in  the  real  world  by  the  frequent  pres¬ 
ence  of  small,  single-issue  political  parties  and  large 
interest  groups — for  instance,  trade  unions — that 
have  a  wide  range  of  concerns.  It  is  also  true  that 
political  parties  appear  to  be  increasingly  incapable 
of  representing  general  concerns,  tending  instead  to 
turn  into  vehicles  of  economic  interests.  Nevertheless, 
it  is  precisely  to  offset  their  diminished  ability  to 
aggregate  political  demands  that  parties  tend  to 
increase  their  delegative  powers  by  occupying  pub¬ 
lic  offices,  while  interest  groups  limit  themselves  to 
supporting,  during  electoral  campaigns,  those  par¬ 
ties  that  declare  a  willingness  to  respond  to  their 
requests;  alternatively,  they  might  endorse  individ¬ 
ual  politicians  or  place  some  of  their  representatives 
in  a  party’s  list  of  candidates. 

Types  of  Interest  Groups 

The  most  common  typology  of  interest  groups 
employed  in  the  literature  distinguishes  between 
sectional  or  economic  groups  and  promotional  or 
cause  groups,  which  are  organized  to  express  par¬ 
ticular  policy  objectives.  Economic  groups  reflect 
the  major  “interests”  of  society  and  represent 
people  who  join  the  organization  mainly  because 
of  their  occupation.  The  three  main  types  of  eco¬ 
nomic  interest  groups  are  trade  unions,  business 
(employers),  and  professional  organizations  repre¬ 
senting  lawyers,  doctors,  engineers,  and  so  on. 

Cause  groups  consist  of  organizations  that  pur¬ 
sue  goals  not  directly  related  to  the  material  inter¬ 
ests  of  their  members.  These  groups  are  voluntary 
organizations  formed  by  people  who  share  some 
common  aspiration  unrelated  to  their  jobs.  The 
cause  group  category  can  potentially  be  divided  into 
a  multitude  of  subgroups,  because  it  encompasses 


all  sorts  of  voluntary  organizations.  Cause  groups 
include  cultural,  religious,  and  recreational  associa¬ 
tions;  organizations  for  the  promotion  of  human 
rights  and  the  welfare  state;  charities;  and  environ¬ 
mental  and  peace  groups.  Many  of  them  are  also 
public  interest  groups — namely,  organizations  that 
seek  to  promote  collective  goods  by  securing  the 
approval  of  policies  whose  benefits  may  be  shared 
equally  by  all  people,  irrespective  of  their  member¬ 
ship  or  support  of  a  given  group.  The  development 
of  public  interest  groups  from  the  1960s  onward 
was  a  consequence  of  more  widespread  education, 
the  adoption  of  more  sophisticated  means  of  mass 
communication,  social  protest,  and  the  introduction 
of  extensive  social  welfare  programs. 

Among  cause  groups,  another  category  that 
deserves  mention  are  the  “not  in  my  backyard” 
groups,  otherwise  known  as  NIMB  Ys,  due  to  their 
widespread  growth  in  recent  years  in  many  coun¬ 
tries.  NIMBYs  are  hybrid  groups  because  the  moti¬ 
vation  of  members  is  basically  material  and  brings 
immediate  benefits  for  members  of  that  group  only. 
The  main  goal  of  NIMBYs  is  almost  always  to 
defend  the  area  in  which  their  membership  lives. 
Typical  demands,  often  accompanied  by  highly  vis¬ 
ible  forms  of  protest,  are  that  a  nuclear  power  sta¬ 
tion  or  a  radioactive  waste  dump,  prison,  airport, 
or  main  road  be  built  anywhere  else  rather  than  “in 
my  backyard.”  The  objectives  of  NIMBYs  may 
also  find  support  among  people  outside  the  “threat¬ 
ened”  area,  in  the  name  of  a  wider,  shared  cause 
such  as  environmental  conservation  or  the  battle  to 
create  alternative  energy  sources.  In  this  respect, 
NIMBYs  can  be  related  to  cause  groups  and  among 
these,  in  some  cases,  to  public  interest  groups. 

The  distinction  between  economic  and  cause 
groups  has  led  to  the  formulation  of  various  hypoth¬ 
eses  about  how  they  function  and  their  relationship 
with  decision  makers.  It  has  been  argued,  for  exam¬ 
ple,  that  economic  groups  are  more  long  lasting 
because  there  are  material  incentives  at  stake  for  the 
membership  that  cause  groups  are  less  likely  to  be 
able  to  offer  (see  below).  It  has  also  been  claimed 
that  cause  groups,  even  those  with  a  large  following, 
have  a  limited  impact  on  policy  making  because  they 
are  barred  from  the  corridors  of  power,  while  eco¬ 
nomic  groups,  in  particular  business  groups,  have 
easier  access.  This  claim  has  been  contested  by  Wyn 
Grant,  who  distinguishes  between  insider  groups, 
which  are  consulted  regularly  by  government 
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bodies,  and  outsider  groups,  which  do  not  want  or 
are  unable  to  establish  privileged  relationships  with 
policymakers.  According  to  Grant,  cause  groups 
may  also  become  insiders,  forging  special  relations 
with  public  officials.  A  further  source  of  debate  is 
the  notion  that  insiders  are  more  influential  than 
outsiders;  this  too  has  come  in  for  criticism  on  the 
grounds  that,  in  many  cases,  the  latter  are  also 
capable  of  conditioning  the  political  agenda. 

Mention  must  also  be  made  of  institutional 
interest  groups,  which  may  be  either  public  or  pri¬ 
vate.  Institutional  public  groups  are  local  or 
regional  governments  and  the  various  branches  of 
central  government,  which  compete  with  each 
other  to  influence  the  political  process  to  their  own 
advantage.  Institutional  private  groups  are  social 
organizations — industrial  and  financial  compa¬ 
nies,  universities,  churches,  and  hospitals — which, 
in  certain  circumstances,  may  commit  a  propor¬ 
tion  of  their  resources  to  influencing  the  decision¬ 
making  process. 

As  has  already  been  said,  the  issue  of  democratic 
participation  is  largely  extraneous  to  institutional 
groups.  Unlike  economic  and  cause  groups,  institu¬ 
tional  groups  are  hierarchical  organizations  run  in  a 
managerial  fashion  by  leaders  whose  main  concern 
is  to  ensure  the  continuity  of  their  institution  irre¬ 
spective  of  the  specific  interests  of  the  membership. 
The  decision  to  enter  the  political  arena  is  generally 
taken  by  leaders  without  prior  consultation  of  the 
membership,  whether  they  are  blue-  or  white-collar 
workers,  shareholders,  worshippers,  or  patients.  In 
other  words,  in  contrast  to  economic  and  cause 
groups,  institutional  groups  have  less  need  to  obtain 
the  approval  of  their  membership  to  justify  their 
political  engagement  or  to  address  their  needs  and 
concerns,  according  to  Robert  Salisbury  (1984). 
Institutional  groups  occupy  an  important  position 
in  the  branch  of  literature  devoted  to  the  analysis  of 
policy  making.  This  is  inevitable,  because  they  have 
always  had  a  dominant  role  in  U.S.  policy  making, 
at  both  state  and  federal  levels;  are  always  widely 
present  in  policy  networks  active  in  the  European 
Union;  and  manage,  obviously,  to  condition  the 
choices  of  the  various  national  governments. 

Levels  of  Organizational  Complexity 

Many  interest  groups  have  a  simple  organiza¬ 
tional  structure.  Their  political  action  begins  and 


ends  inside  the  boundaries  defined  by  the  local 
territory  in  which  they  operate.  For  instance,  many 
public  interest  groups  involved  in  environmental 
preservation  have  as  their  exclusive  political  tar¬ 
gets  the  city  council  and/or  the  regional  public 
authorities.  Other  groups  have  a  more  complex 
organizational  structure  with  basic  units  embed¬ 
ded  in  a  national  structure.  These  second-level 
organizations  help  shape  and  unify  the  disparate 
demands  of  first-level  organizations,  the  advantage 
being  that  it  is  possible  to  supply  decision  makers 
with  aggregate  information  and  to  formulate  more 
effective  requests. 

Third,  there  are  also  “umbrella”  organizations 
called  peak  associations,  which  coordinate  the 
activities  of  various  second-level  associations  from 
different  sectors.  For  example,  in  Germany,  the 
main  unions  of  the  various  categories  of  workers 
belong  to  the  Deutsche  Gewerkschaftsbund  (DGB), 
which  is  analogous  to  the  American  Federation  of 
Tabor  and  Congress  of  Industrial  Organizations 
(AFF-CIO)  in  the  United  States  or  the  Trade  Union 
Congress  in  the  United  Kingdom.  In  the  EU,  work¬ 
ers  and  employers  alike  are  usually  represented  by 
second-level  associations  and  by  intersectorial 
peak  associations.  But  many  big  corporations  that 
represent  themselves  directly  can  also  be  found. 
There  is  also  an  associative  level  that  includes, 
under  a  broad  umbrella,  the  peak  associations  of 
different  member  states,  giving  rise,  for  example, 
to  the  European  Trade  Union  Confederation  and 
the  Union  of  Industrial  and  Employers’  Confed¬ 
erations  of  Europe. 

Second-level  and  peak  organizations  are  wide¬ 
spread  among  economic  interest  groups  but  are 
rare  among  cause  groups,  perhaps,  as  Salisbury 
(1975,  p.  187)  suggests,  because  the  latter  have  a 
shorter  lifetime.  For  this  reason,  they  do  not 
reach  the  degree  of  organizational  complexity 
that  leads  to  the  formation  of  second-level  and 
peak  associations. 

Interest  Groups  and  Public  Policy  Making: 

Pluralist  and  Neo-Corporatist  Approaches 

The  two  most  important  approaches  that  have 
been  adopted  in  the  literature  on  interest  groups 
are  pluralism  and  neo-corporatism.  Pluralism  actu¬ 
ally  started  in  Europe  at  the  beginning  of  the  20th 
century  as  a  reaction  to  the  monistic  theory  of  the 


1222  Interest  Groups 


state.  However,  it  was  principally  developed  in  the 
United  States  as  an  empirical  approach  to  the 
analysis  of  political  phenomena,  through  a  series 
of  works  on  the  American  political  system  pub¬ 
lished  between  the  1920s  and  the  1960s.  Later 
adaptations  have  enabled  pluralism  to  survive  the 
numerous  criticisms  received  in  the  second  half  of 
the  century,  when  the  major  weaknesses  of  the 
approach  were  highlighted. 

According  to  pluralists,  the  political  process  is 
driven  by  competition  between  many  interacting 
groups  that  tend  to  self-limit  their  requests, 
thereby  making  the  political  system  more  stable. 
Competition  is  compatible  with  democracy 
because  negotiation  between  many  interests  favors 
the  dispersion  of  power,  and  so,  according  to 
David  Truman,  no  one  group  is  able  to  perma¬ 
nently  get  the  better  of  the  others.  Robert  Dahl 
concedes  that  some  groups  are  stronger  than  oth¬ 
ers  but  argues  that  the  dispersion  of  political 
power  between  the  host  of  government  institu¬ 
tions  offers  the  possibility  of  access  to  weaker 
groups,  which  then  have  a  chance  of  obtaining  a 
hearing  in  some  crucial  phase  of  the  decision¬ 
making  process.  According  to  pluralists,  the  com¬ 
petition  between  groups  offers  not  only  stability 
and  democratic  compatibility  but  also  the  possi¬ 
bility  of  making  a  decisive  contribution  to  shaping 
public  policies  that  have  a  broad  consensus  among 
citizens.  Pluralists  do  not  believe  in  the  abstract 
concept  of  the  general  interest  because  this  pre¬ 
supposes  that  its  constituent  parts  form  an  indivis¬ 
ible  whole,  while  empirical  evidence  suggests  that 
many  interacting  interests  influence  government 
decisions,  leading  the  way,  but  only  in  a  transient 
fashion,  to  what  is  significant  for  all. 

The  pluralist  approach  has  been  heavily  criti¬ 
cized  for  having  presented  an  erroneous  picture  of 
the  political  system  capable  of  regulating  itself 
exclusively  by  virtue  of  the  competition  and  com¬ 
promises  between  groups.  This  view  neglects  the 
fact  that  the  relations  between  groups  are  not  bal¬ 
anced  because  some  exercise  a  permanent  influence 
while  others  are  marginal  or  insignificant  in  the 
process  of  decision  making.  The  idea  of  leaving  the 
adoption  of  shared  policies  up  to  an  agreement 
between  partisan  interests  appears,  then,  to  be  both 
unrealistic  and  uneven.  Their  legitimacy  would  be 
based  on  existing  power  relations  between  the 
groups,  and  their  implementation  would  be  to  the 


detriment  of  the  weaker  ones.  The  pluralist  illusion 
of  being  able  to  explain  the  functioning  of  the 
political  system  in  terms  of  the  self-regulatory 
activities  of  interest  groups  involves  a  weakened 
state  authority.  In  the  real  world,  however,  state 
intervention  is  indispensable  for  redressing  imbal¬ 
ances  in  opportunities  for  access  to  decision-making 
arenas.  The  cogency  of  these  criticisms  forced  the 
pluralists  to  abandon  the  idea  of  attributing  to 
interest  groups  the  ability  to  ensure  the  governabil- 
ity  of  the  political  system.  This  idea  was,  however, 
resumed  in  Europe  by  the  neo-corporative  approach 
in  its  reassessment  of  the  role  of  the  state. 

Neo-corporatism  is  the  institutionalized  partici¬ 
pation  of  interest  groups  in  government  activities 
that  favors  the  elaboration  of  policies  agreed  on 
jointly  by  major  organized  groups  and  decision 
makers.  Some  features  of  neo-corporatist  agree¬ 
ments  can  also  be  found  in  the  state  corporatism 
established  in  authoritarian  regimes,  such  as  in 
Portugal  and  Spain  between  the  two  world  wars. 
However,  neo-corporatism  can  be  distinguished 
from  state  corporatism  because,  according  to 
Philippe  Schmitter,  the  former  is  a  societal  corpo¬ 
ratism  that  is  not  imposed  from  above  but  arises  in 
a  voluntary  fashion  from  within  society  and  is 
compatible  with  the  system  of  democratic  represen¬ 
tation.  Instances  of  a  neo-corporatist  system  were 
identified  in  various  continental  European  coun¬ 
tries  in  the  1970s  and  1980s,  for  example,  in 
Scandinavia  and  Austria.  Subsequently,  the  num¬ 
ber  of  studies  based  on  the  neo-corporatist  approach 
diminished  considerably,  because  the  changes  that 
have  occurred  in  the  international  economic  system 
have  forced  many  governments  to  prefer  public 
policy  strategies  centered  on  the  market  rather  than 
agreements  stipulated  between  groups,  which  are 
the  most  distinctive  trait  of  neo-corporatism. 

In  a  neo-corporatist  agreement,  the  protagonists 
of  tripartite  negotiations  with  a  government  are  the 
representatives  of  capital  and  labor;  the  policies 
adopted  are  economic  and  relate  in  particular 
to  the  control  of  prices  and  salaries.  For  the  neo- 
corporatists,  the  state,  in  contrast  to  the  way  in 
which  it  is  depicted  by  the  pluralists,  possesses 
extensive  powers  that  are  used  to  authorize,  regu¬ 
late,  and  sanction  the  activities  of  organizations  to 
ensure  that  they  respect  stipulated  agreements  and 
collaborate — sometimes  by  exercising  delegated 
public  functions — to  implement  agreed  policies. 
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Unions  and  employers’  associations  respect  com¬ 
mitments  undertaken  with  government  because 
they  have  a  monopoly  on  representation  in  their 
sector  and  exercise  effective  control  over  their 
members,  who  are  often  obliged  to  lend  their  sup¬ 
port,  because  respect  for  various  binding  norms  is 
a  condition  of  membership.  The  importance  of  the 
control  functions  assumed  by  groups  over  their 
membership  has  led  neo-corporatists  to  replace  the 
traditional  expression  representation  of  interests 
with  of  the  term  intermediation  of  interests. 

A  favorable  condition  for  trilateral  negotiations 
is  the  presence  of  a  prolabor  party  in  the  govern¬ 
ment.  Unions  take  part  because  they  feel  supported 
by  the  left-wing  party,  which  offers,  in  return  for 
moderate  wage  claims,  measures  to  boost  employ¬ 
ment  and  a  commitment  to  maintaining  the  wel¬ 
fare  state.  In  turn,  employers’  associations  accept 
negotiations  because  they  expect  the  government 
to  maintain  a  low  level  of  union  conflict,  thanks  to 
their  influence  over  the  unions,  and  to  support 
investment  in  industry.  The  factors  that  enable 
trilateral  negotiations,  according  to  neo-corporatist 
scholars,  make  it  possible  to  produce  better  and 
socially  more  equitable  public  policies  than  those 
implemented  by  executives  operating  without  the 
institutionalized  collaboration  of  large,  organized 
interest  groups. 

Neo-corporatism  has  greater  theoretical  ambi¬ 
tions  than  pluralism,  but  these  have  not  been 
achieved  because  no  generally  accepted  theory  has 
ever  been  provided  regarding  the  nature  of  neo- 
corporative  agreements.  Are  they  an  extension, 
within  the  state  machine,  of  the  pluralist  represen¬ 
tation  of  interests?  Or  have  they  been  “captured” 
by  the  state,  subordinating  them  to  state  control? 
Furthermore,  the  numerous  political  and  socioeco¬ 
nomic  prerequisites  necessary  for  the  establish¬ 
ment  of  a  neo-corporatist  system  have  inevitably 
meant  that  the  empirical  cases  are  limited  to  the 
Scandinavian  countries,  the  Netherlands,  Austria, 
Belgium,  and,  to  a  more  limited  degree,  Germany. 
Neo-corporatism  cannot,  therefore,  be  viewed  as  a 
general  theory  of  the  action  of  interest  groups  in 
contemporary  democracies.  Finally,  neo¬ 
corporatism  lacks  a  theory  of  the  representation  of 
groups  that  explains,  logically  and  normatively, 
the  privileged  access  to  decision-making  spheres  of 
certain  groups  (unions  and  employers’  associa¬ 
tions)  rather  than  others. 


Mobilization 

The  study  of  the  conditions  for  collective  action  is 
probably  the  field  of  interest  group  research  in 
which  there  has  been  the  greatest  progress.  Various 
contributions  have  examined  the  fundamental  rea¬ 
sons  that  lead  an  individual  to  join  a  given  organi¬ 
zation,  the  means  adopted  by  leaders  to  recruit 
members,  and  the  contextual  factors  that  influence 
mobilization.  The  preliminary  theoretical  basis  for 
this  strand  of  literature  was  provided  by  Mancur 
Olson  Jr.  in  his  book  The  Logic  of  Collective 
Action  (1965).  Olson  revolutionized  previous 
work  on  the  conditions  of  collective  action  and 
laid  the  foundations  for  the  development  of  a 
coherent  body  of  knowledge.  He  demonstrated, 
thereby  refuting  a  pluralist  claim,  that  there  is  no 
automatic  correspondence  between  the  need  of  a 
given  social  group  or  category  of  people  to  mani¬ 
fest  a  policy  preference  and  the  transformation  of 
that  need  into  organized  action.  The  latter  can 
only  occur  under  certain  conditions  and  with  cer¬ 
tain  resources.  Groups  that  operate  in  unfavorable 
conditions  and  without  adequate  resources  will  be 
penalized  in  their  efforts  to  influence  the  shape  and 
outcome  of  public  decisions. 

According  to  Olson,  in  the  most  frequently 
occurring  situations,  it  is  not  rational  for  an  indi¬ 
vidual  to  support  an  organization  that  pursues  a 
collective  good  even  if  he  or  she  wishes  to  obtain 
it.  The  collective  good,  for  example,  the  conserva¬ 
tion  of  nature  or  the  improvement  of  urban  trans¬ 
port,  is  an  indivisible  good  from  which  everyone 
can  benefit.  For  this  reason,  the  beneficiaries  of 
collective  goods  will  tend  to  avoid  paying  the  costs 
of  sustaining  the  organization  (membership  fees, 
participation  in  meetings  and  demonstrations,  etc.) 
and  act  as  free  riders  instead,  waiting  to  reap  the 
benefits  of  other  people’s  efforts.  Furthermore, 
rational  individuals  calculate  that  the  contribution 
of  their  own  effort  to  the  success  of  a  large  organi¬ 
zation’s  initiatives  will  be  substantially  irrelevant. 
Moreover,  they  take  advantage  of  the  scale  of  the 
organization,  concealing  their  opportunistic  behav¬ 
ior  behind  the  anonymity  of  large  numbers. 

To  overcome  the  difficulties  deriving  from  the 
obstructive  attitude  of  free  riders,  organizations 
are  forced,  according  to  Olson,  to  come  up  with 
selective  incentives  for  its  members  (insurance 
policies,  legal  assistance,  etc.)  or  alternatively  to 
threaten  imposing  sanctions,  a  strategy  resorted  to 
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by  many  unions  in  the  United  States  and  the  UK. 
Invoking  closed-shop  legislation,  they  demand  that 
firms  only  employ  workers  who  are  already  union 
members  or  willing  to  become  members.  Among 
economic  interest  groups,  Olson  adds,  it  should  be 
easier  for  smaller  organizations  to  obtain  support, 
because  they  can  offer  their  membership  particularly 
selective  incentives.  For  example,  groups  represent¬ 
ing  farmers  or  artisans  can  increase  their  member¬ 
ship  by  offering  advantageous  medical  insurance  or 
convenient  individual  retirement  schemes.  Olson 
clarifies  that  his  analysis  mainly  explains  the  behav¬ 
ior  of  economic  interest  groups,  while  it  applies 
poorly  to  associations  that  do  not  offer  incentives 
readily  translatable  into  material  benefits. 

Following  the  path  taken  by  Olson,  James 
Wilson  identified  solidarity  and  purposive  incentives 
in  addition  to  material  ones,  going  on  to  formulate 
a  theory  of  collective  action  applicable  to  all  kinds 
of  interest  group  organizations.  Solidarity  incentives 
offer  supporters  the  gratification  that  comes  from 
feeling  part  of  a  group,  while  purposive  incentives 
induce  the  members  of  an  organization  to  actively 
promote  some  good  that  might  also  benefit  a  wider 
spectrum  of  nonmembers.  Each  of  the  three  types  of 
incentives  is  of  varying  importance,  in  relation  to  the 
other  two,  in  fuelling  individual  support  for  an 
organization.  Individuals  join  and  support  organiza¬ 
tions  for  a  variety  of  reasons  that  are  often  hard  to 
distinguish  from  one  another.  It  should,  therefore, 
be  taken  into  account  that  support  for  an  organiza¬ 
tion  is  often  the  result  of  a  combination  of  all  three 
types  of  incentives. 

Wilson’s  theory,  with  the  importance  it  attri¬ 
butes  to  solidarity  and  purposive  incentives,  gave 
fresh  credit  to  the  widespread  conviction  that  col¬ 
lective  action  is  also  motivated  by  ideals,  without 
any  expectation  of  a  material  reward.  The  greatest, 
and  by  no  means  negligible,  weakness  of  this 
typology  is  that  solidarity  and  purposive  incen¬ 
tives,  unlike  material  ones,  are  hard  to  distinguish 
and  measure,  making  it  difficult  in  turn  to  assess 
their  specific  weight.  The  subsequent  development 
of  the  theory  of  incentives  has,  however,  continued 
to  focus  on  nonmaterial  incentives,  without,  as  a 
consequence,  dispelling  doubts  stemming  from 
theoretical  assumptions  that  have  not  been  ade¬ 
quately  measured  and  checked. 

Olson’s  theory  of  selective  incentives  paved  the 
way  for  a  fuller  understanding  of  the  conditions 


that  facilitate  or  penalize  the  mobilization  of  dif¬ 
ferent  types  of  interest  groups.  Public  interest 
groups  such  as  pacifist,  environmental,  and  child 
protection  organizations  are  among  those  that  are 
penalized,  making  it  harder  for  them  to  enter  the 
political  arena.  They  have  a  very  wide  potential 
membership,  but  the  incentives  are  almost  exclu¬ 
sively  purposive,  with  little  intrinsic  value.  As  the 
only  major  investment  required  is  essentially  emo¬ 
tional,  people  join  but  also  abandon  such  organi¬ 
zations  with  considerable  ease,  making  them 
highly  unstable.  In  this  respect,  they  are  similar  to 
collective  movements.  By  contrast,  economic 
groups,  which  have  a  more  limited  potential  mem¬ 
bership  and  can  offer  quite  considerable  material 
incentives  and  returns,  find  it  easier  to  enter  the 
political  arena  and  establish  themselves  as  key 
interlocutors  for  public  decision  makers. 

Access 

What  difficulties  and  advantages  must  be  taken 
into  account  by  interest  groups  when  seeking  to 
establish  useful  contacts  with  decision  makers? 
An  examination  of  the  issue  of  access  does  not 
clarify  how  groups  manage  to  be  influential  in  the 
decision-making  process  but  does  help  ascertain 
the  more  or  less  favorable  conditions  for  exercis¬ 
ing  influence.  In  liberal  democracies,  the  possibil¬ 
ity  of  access  is  conditioned  by  both  variable  and 
permanent  factors.  Variable  factors  include  the 
resources  available  to  a  group  (size  of  member¬ 
ship,  expertise,  money,  existing  contacts  with 
political  elites,  good  reputation,  etc.)  or  the  exis¬ 
tence  or  otherwise  of  a  public  opinion  favorable 
to  the  issues  interest  groups  wish  to  submit  to 
decision  makers  and  the  political  significance  of 
those  issues. 

The  permanent  factors  that  have  a  bearing  on 
access  include  the  nature  of  a  country’s  institu¬ 
tional  structure  and  its  policy  style.  A  democracy 
with  a  unitary  institutional  system  has  fewer  access 
points  than  a  federal  democracy,  because  in  the 
former,  power  is  fairly  concentrated,  while  in  the 
latter,  it  is  distributed  among  a  larger  number  of 
independent  decision-making  bodies.  The  number 
of  access  points  in  the  United  States,  for  instance,  is 
particularly  high,  due  to  the  nation’s  federal  struc¬ 
ture,  the  division  of  government  between  the  presi¬ 
dent  and  Congress,  the  powerful  congressional 
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committees,  and  a  judicial  system  with  constitu¬ 
tional  powers  to  intervene  in  legislation.  It  is  also 
high  in  the  EU,  which,  like  the  United  States,  has 
an  institutional  system  that  separates  powers  both 
vertically  and  horizontally. 

The  varying  degree  of  accessibility  is  influenced 
by  the  country’s  prevailing  policy  style.  Every  lib¬ 
eral  democracy  has  its  own  particular  policy  style, 
which  is  perhaps  not  uniformly  discernable  in  all 
policy  areas  but  is  prevalent  in  many  of  them. 
Policy  style  is  the  set  of  standard  procedures  gener¬ 
ally  adopted  by  institutions  to  devise  and  imple¬ 
ment  public  policies,  according  to  Jeremy 
Richardson,  Gunnel  Gustafsson,  and  Grant  Jordan 
(1982).  The  access  of  groups  to  many  areas  of 
policy  is  easier  when  governments  adopt  a  reactive 
policy  style,  because  decision-making  powers  are 
not  concentrated,  and  public  bureaucracies  depend 
in  large  measure  on  the  technical  information  sup¬ 
plied  by  the  groups.  In  these  cases,  the  state  usually 
avoids  coercive  measures  that  may  discourage  the 
setting  up  of  new  groups  and  reduce  the  fragmenta¬ 
tion  of  existing  ones.  Public  policies  take  the  form 
of  ad  hoc  measures  uncoordinated  with  previous 
decisions  and  are  mostly  designed  to  satisfy  the 
short-term  demands  of  groups  with  ease  of  access 
to  public  bureaucracies,  as  noted  by  Michael 
Atkinson  and  William  Coleman  (1989).  The  access 
of  groups  is  more  selectively  filtered  when  central¬ 
ized  governments  and  independent  bureaucracies 
adopt  an  anticipatory  policy  style,  using  intrusive 
policy  methods,  without  excluding  coercion,  the 
aim  being  to  effect  significant  change  in  society  and 
the  economy. 

Governments  that  adopt  an  anticipatory  policy 
style  may  disregard  interest  groups  while  pursuing 
their  public  policy  goals  or  may  involve  only 
broadly  representative  organizations  capable  of 
ensuring  that  their  membership  adheres  to  agree¬ 
ments  stipulated  by  the  state.  The  result,  in  this 
case,  is  greater  selectivity  in  access,  which  benefits 
more  well-organized  groups. 

Nowhere  in  the  real  world  do  governments 
adopt  a  purely  reactive  or  anticipatory  style. 
Both  should  therefore  be  seen  as  ideal  types  that 
represent  the  opposite  extremes  of  a  spectrum  of 
possible  combinations,  including  the  various 
policy  styles  of  the  liberal  democracies,  some 
closer  to  the  anticipatory  model  and  others  to  the 
reactive  one. 


Lobbying 

The  main  objective  of  interest  groups  is  to  influ¬ 
ence  governments  in  order  to  obtain  favorable 
policy  outcomes.  However,  influence  is  a  continu¬ 
ous  and  unresolved  political  process,  which  by  its 
very  nature  is  unmeasurable.  Furthermore,  other 
factors  may  influence  legislators’  policy  decisions 
more  than  interest  groups — for  instance,  other 
legislators,  party  leaders,  the  personal  convictions 
of  the  legislator,  the  media,  and  so  on.  Faced  with 
the  difficulty  of  measuring  influence,  scholars 
have  tended  to  focus  on  the  phenomenon  of  lob¬ 
bying,  extending  the  original  meaning  of  the  term 
to  encompass  the  whole  set  of  strategies  and  tac¬ 
tics  used  by  interest  groups  to  influence  policy 
making. 

Lobbying  was  initially  understood  in  quite  spe¬ 
cific  terms  as  one  or  more  face-to-face  meetings 
between  representatives  of  an  interest  group  and 
legislators,  sought  by  the  former  so  as  to  influence 
the  decisions  of  the  latter  in  a  way  that  benefits  the 
group’s  preferences.  To  be  effective,  a  lobbyist — 
who  may  either  be  a  member  of  the  group  or  an 
expert  hired  specifically  for  the  purpose — must 
establish  a  relationship  of  trust  with  the  policy¬ 
maker,  supplying  data  and  information  that  facili¬ 
tate  the  adoption  of  policies  favorable  also  to  the 
interest  group.  In  its  broader  reformulation,  lobby¬ 
ing  involves  a  much  wider  range  of  initiatives, 
including  contacts  with  bureaucratic  bodies,  the 
premier’s  office,  the  courts  and  parliament,  the  use 
of  the  mass  media,  the  preparation  of  memoranda, 
the  forging  of  links  with  individual  functionaries, 
and  so  forth. 

Lobbying  has  been  extensively  studied  in  the 
United  States,  where  it  is  widespread.  This  is  due 
to  the  multiple  points  of  access  offered  by  the  insti¬ 
tutional  system,  the  weakness  and  relative  lack  of 
representativity  of  the  political  parties,  and  a 
dominant  political  culture  that  favors  pragmatic, 
nonideological  dealings  between  citizens  and  legis¬ 
lators.  American  literature  in  this  field  can  boast 
hundreds  of  case  studies  devoted  to  the  lobbying 
activities  of  pressure  groups.  However,  they  have 
made  little  contribution  to  furthering  an  under¬ 
standing  of  the  phenomenon  because  they  lack  a 
shared  theoretical  basis.  According  to  Frank 
Baumgartner  and  Beth  Leech  (1998),  scholars  tend 
to  use  their  own  method  of  measuring  the  impact 
of  lobbying  and  a  generally  narrow  analytic 
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perspective  without  adopting  a  comparative 
approach  or  making  any  attempt  to  build  on  previ¬ 
ous  work. 

The  results  of  the  large-scale  surveys  of  lobbying 
in  Washington,  D.C.,  and  the  state  capitals  in  the 
second  half  of  the  1980s  have  been  more  useful. 
These  studies,  which  are  more  theoretical,  refer  to 
a  large  number  of  cases  and  consider  the  context; 
they  have  shown  that  the  tactic  used  most  widely 
by  lobbyists  is  to  cultivate  face-to-face  contacts 
with  public  bureaucrats,  seeking,  at  the  same  time, 
to  extend  the  range  of  their  contacts  among  public 
decision  makers  and  those  capable  of  influencing 
them  in  order  to  exchange  information,  establish 
cooperative  strategies,  and  anticipate  the  moves  of 
competitors.  Furthermore,  the  gamut  of  strategies 
used  varies  according  to  the  issue  in  question  and 
whether  it  has  a  moderate  or  high  political  saliency 
or  level  of  conflict.  The  position  occupied  by  the 
group  in  the  political  process  also  seems  to  have 
considerable  bearing  on  the  choice  of  strategy. 
Groups  operating  within  the  political  process 
adopt  inside  strategies  that  involve  lobbying  legis¬ 
lators  and  bureaucrats,  taking  legal  action,  and 
supporting  certain  candidates  in  elections.  Marginal 
groups  use  outside  strategies  such  as  public  forms 
of  protest  and  organizing  meetings  to  promote 
their  cause.  The  media  are  resorted  to  in  equal 
measure  in  both  inside  and  outside  strategies. 

Lobbying  is  less  developed  in  Europe  than  it  is 
in  the  United  States,  because  the  conditions  for 
lobbying  are  less  favorable.  In  particular,  the 
prevalence  of  unitary  state  systems  reduces  the 
number  of  access  points  and  limits  the  legislative 
power  of  parliaments;  these  are  mainly  used  by 
representatives  of  interest  groups  as  a  channel  for 
drawing  the  attention  of  the  media  and  the  govern¬ 
ment  to  issues  that  concern  them.  Moreover, 
European  scholars  have  displayed  little  interest  in 
the  theme  of  lobbying  due  to  the  importance 
attributed  in  interest  group  literature  to  the  neo- 
corporatist  approach,  which  takes  it  for  granted 
that  there  is  continuing  collusion  between  policy¬ 
makers  and  the  representatives  of  economic  inter¬ 
ests  while  largely  ignoring  the  lobbying  activities 
of  noneconomic  groups.  Despite  these  limits,  the 
number  of  studies  of  lobbying  in  different  European 
nations  has  increased  in  the  past  decades.  In  par¬ 
ticular,  there  has  been  a  steady  growth  in  the 
amount  of  research  devoted  to  lobbying  in  the  EU, 


which  has  a  multilevel  institutional  structure  offer¬ 
ing  many  direct  points  of  access  for  individual 
groups  and  national  associations.  Both  invariably 
maintain  close  contacts  with  national  ministers 
present  in  Brussels,  who  often  act  as  intermediaries 
between  interest  groups  and  European  institutions. 

The  majority  of  interest  groups  employ  one  of 
the  many  lobbying  agencies  based  in  Brussels.  At 
the  beginning  of  the  1990s,  it  was  estimated  that 
more  than  10,000  individuals  were  employed  in 
such  agencies,  for  the  most  part  run  by  British  or 
American  consultants,  who  generally  had  more 
expertise  than  colleagues  from  elsewhere  in  Europe. 
The  result  of  the  lobbying  of  European  institutions 
is  the  preponderance  of  large  European  industrial 
groups  and  their  associations  in  the  technical  com¬ 
mittees  set  up  by  the  European  Commission  and 
the  Council  of  Ministers.  Public  interest  groups  are 
underrepresented  or  excluded.  However,  they  have 
greater  success  with  the  European  Parliament, 
which  is  generally  more  receptive  to  broader  inter¬ 
ests.  European  lobbying  seems  to  be  less  adver¬ 
sarial  and  more  inclined  to  favor  compromise 
policy  solutions,  while  in  the  United  States  there 
tends  to  be  just  one  winner,  often  the  main  interest 
group  that  has  financed  the  election  campaign  of 
the  decision  makers.  Furthermore,  in  the  multilevel 
system  of  the  EU,  lobbying  on  a  certain  issue  may 
continue  at  national,  regional,  and  local  levels 
while  the  adopted  policy  is  being  implemented. 

Critical  Reappraisals  in  the 
Study  of  Interest  Groups 

Policy  Networks  Approach 

Since  the  second  half  of  the  1980s,  pluralism 
and  neo-corporatism  have  come  under  fire  on  the 
grounds  that  they  offer  an  inaccurate  image  of  the 
key  characteristics  of  policy  making.  Critics  have 
noted  that  in  the  United  States,  the  prototype  of  the 
pluralist  political  system,  a  small  number  of  groups 
adopt  collaborative  and  collusive  behaviors  in  cer¬ 
tain  areas  of  policy  that  are  very  similar  to  those 
described  by  neo-corporatists.  Conversely,  in  polit¬ 
ical  systems  traditionally  associated  with  neo- 
corporatist  agreements,  where  one  would  expect 
cooperation  between  a  small  number  of  groups  to 
be  the  dominant  feature,  scholars  have  identified 
policy  areas  corresponding  to  the  pluralist  model  of 
a  large  number  of  competing  groups.  To  overcome 
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the  incongruent  features  of  pluralism  and  neo¬ 
corporatism,  European  scholars  developed  the 
policy  network  approach.  This  focuses  on  the  way 
groups  are  conditioned  by  certain  characteristics 
of  the  state  and  of  policy  making.  The  state  has 
turned  into  a  network  of  independent  administra¬ 
tive  apparatuses  that  forge  permanent  ties  with 
groups  that  are  recipients  of  their  decisions,  while 
policy  making  is  sectorialized  into  various  differ¬ 
ent  specialized  policy  areas.  As  a  consequence  of 
this  conditioning,  the  activities  of  interest  groups 
are  channeled  into  different  policy  networks,  each 
of  which  is  made  up  of  the  public  and  private 
actors  interested  in  formulating  and  implementing 
given  policies. 

The  task  of  research  is  to  examine  and  classify 
the  different  types  of  policy  networks  that  plural¬ 
ism  and  neo-corporatism,  according  to  the  policy 
network  approach,  erroneously  reduce  to  a  single 
typology  of  policy  making.  On  the  contrary,  policy 
making  is  sectorialized  into  different  areas  of  pol¬ 
icy  within  which  various  policy  networks  may  be 
active.  These  are  differentiated  depending  on  the 
number  of  actors,  their  stability,  the  degree  of 
rigidity  of  their  boundaries,  and  various  other 
dimensions.  Apart  from  the  differences,  the  differ¬ 
ent  types  of  policy  networks  also  have  some 
features  in  common:  The  actors  cultivate  nonhier- 
archical  relations  and  promote  interests  that  may 
at  times  be  conflictual  but  are  interdependent, 
because  there  is  a  mutually  beneficial  exchange  of 
information,  expertise,  and,  with  time,  trust. 

The  two  most  common  types  of  policy  net¬ 
works  are  policy  communities  and  issue  networks. 
The  former  comprise  a  limited  number  of  groups, 
are  stable  over  time,  have  quite  rigid  boundaries, 
and  are  characterized  by  a  consensual  decision¬ 
making  style  and  a  fairly  balanced  internal  distri¬ 
bution  of  power.  By  contrast,  issue  networks  are 
fairly  unstable,  have  uncertain  boundaries,  are 
potentially  conflictual  in  terms  of  interaction,  and 
have  an  unequal  distribution  of  power. 

The  limitations  of  the  policy  network  approach 
include  the  absence  of  shared  definitions,  the  limited 
operational  value  of  various  dimensions  used  to 
distinguish  the  different  types  of  network,  the  lack 
of  attention  devoted  to  conditions  that  might  lead 
to  the  transformation  of  a  network,  and  the  failure 
to  integrate  it  with  any  theory  of  the  distribution  of 
power.  As  regards  this  final  point,  however,  it 


should  be  stressed  that  the  policy  network  approach 
seems  compatible  with  pluralism,  not  only  because 
it  admits  the  existence  of  an  oligopolistic  political 
market  dominated  by  policy  communities  impene¬ 
trable  to  “unrecognized”  groups  but  also  because  it 
emphasizes  that  policy  networks  are  never  entirely 
closed,  that  no  one  interest  dominates  broad  areas 
of  policy,  and  that  policy  making  takes  place  within 
a  variety  of  policy  networks  characterized  by  close 
relations  between  different  interests  and  different 
government  offices. 

The  theory  of  policy  networks  has  the  advantage 
of  being  a  more  precise  analytic  tool  than  the  one 
proposed  by  the  pluralists  and  neo-corporatists. 
What’s  more,  it  bridges  the  gap  between  studies  on 
interest  groups  and  those  on  public  policies,  shar¬ 
ing  with  the  latter  an  interest  in  the  characteristics 
of  the  social  and  temporal  context  in  which  groups 
operate.  The  attention  to  context  has  enabled  the 
policy  network  approach  to  distinguish  between 
issues  that  provide  incentives  for  participation  and 
those  that  mobilize  a  small  number  of  actors.  This 
in  turn  has  made  it  possible  to  formulate  some  use¬ 
ful  hypotheses  about  the  differing  propensities  of 
groups  to  participate  and  the  different  levels  of 
conflict  within  networks. 

Historical  Institutionalism 

Historical  institutionalism  overturns  the  vision 
of  the  relations  between  interest  groups  and  institu¬ 
tions  delineated  by  pluralism  and  neo-pluralists 
who  would  like  to  see  the  state  reinforced  so  as  to 
avoid  policy  making  being  dominated  by  interest 
groups.  Institutionalists  hold  that  the  margins  for 
independent  initiative  on  the  part  of  interest  groups 
are  actually  more  limited  than  neo-pluralists 
believe,  because  lobbying  must  necessarily  take 
place  within  existing  political  institutions.  The  pos¬ 
sibility  for  groups  to  succeed  in  influencing  public 
decision  makers  depends  on  the  structure  and  func¬ 
tional  logic  of  the  institutions  that  offer  highly 
conditioning  restrictions  and  opportunities.  If  sig¬ 
nificant  changes  take  place  in  institutions,  interest 
groups  are  obliged  to  adapt  to  the  new  margins  of 
initiative  offered  by  the  revamped  institutions. 

An  example  of  the  institutional  conditioning  of 
interest  group  activities  is  provided  by  Ellen 
Immergut  in  her  comparative  analysis  of  health 
reform  policies  adopted  in  three  European  nations. 
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Immergut  points  out  that  doctors’  associations  in 
France,  Switzerland,  and  Sweden  tried  to  prevent 
the  expansion  of  state  control  over  the  medical 
profession  and  the  establishment  of  a  public  health 
service.  The  numerous  veto  points  posed  by  institu¬ 
tions  in  Switzerland  and  in  France  during  the 
Fourth  Republic  enabled  minority,  but  well- 
organized,  groups  to  block  legislative  initiatives  in 
favor  of  a  public  service.  By  contrast,  the  institu¬ 
tional  situation  in  Sweden  and  during  the  Fifth 
Republic  in  France  favored  the  formation  of  gov¬ 
ernments  that  were  unconditioned  by  medical  asso¬ 
ciations  and  their  allies  and  that  were  thus  able  to 
introduce  health  reforms.  Despite  the  novelty  of  its 
proposed  analytic  criteria,  historic  institutionalism 
is  not  an  alternative  approach  to  pluralism  and  its 
subsequent  reworkings,  because  the  relatively  low 
number  of  research  studies  carried  out  do  not  pro¬ 
vide  adequate  grounds  for  empirically  justifying  its 
explicative  assumptions.  Moreover,  these  do  not 
appear  to  be  entirely  incompatible  with  the  basic 
ideas  of  pluralism.  Rather,  they  seem  to  be  useful 
tools  that  can  be  used  in  the  crowded  field  of  neo- 
pluralist  studies. 

Interest  Groups  and  Democracy 

Can  interest  groups  improve  the  quality  of  democ¬ 
racy  and  enhance  the  development  of  citizens’ 
representation  and  good  government?  Groups 
promote  democracy  if  they  help  citizens  win  better 
opportunities  for  representation.  And  they  con¬ 
tribute  to  good  government  if  they  give  citizens  the 
possibility  of  preventing  or  punishing  abuses  of 
power  by  making  governments  accountable  for 
their  actions  toward  the  political  community. 

Group  Representation  in  Liberal  Democracies 

The  interest  group  system  favors  political  equal¬ 
ity  if  it  promotes  wider  representation,  so  as  to 
prevent  governments  equipped  with  partial  infor¬ 
mation  about  citizens’  preferences  from  favoring 
the  expectations  of  a  minority.  However,  research 
conducted  in  the  United  States  during  the  20th  cen¬ 
tury  and  the  early  21st  century  demonstrates  the 
existence  of  a  permanent  imbalance  in  representa¬ 
tion,  because  business  groups  and  professional 
associations  find  it  easier  to  organize  themselves 
than  cause  groups.  They  can  act  more  rapidly  when 


they  feel  threatened,  and  they  represent  people 
with  a  high  social  status,  income,  and  education. 
They  are  also  dominant  in  areas  of  policy  that 
have  particular  strategic  significance  for  the  whole 
political  community. 

In  Europe,  a  permanent  representational  imbal¬ 
ance  has  been  detected  both  within  the  EU  and  in 
individual  member  states.  Large  economic  groups 
occupy  a  preponderant  position  within  European 
institutions,  even  though  the  number  of  organiza¬ 
tions  present  in  Brussels  grew  continually  in  the 
past  2  decades  of  the  20th  century.  In  member 
states,  permanent  representational  imbalances 
were  generated  by  neo-corporatist  experiments 
that  benefited  business  groups  and  unions  and 
marginalized  less  organized  social  categories  from 
policy  making.  Permanent  imbalances  have  also 
been  detected  in  policy  communities  that  are 
dominated  by  economic  interest  groups  and  are 
inaccessible  to  citizens. 

Therefore,  there  is  a  good  deal  of  converging 
empirical  evidence  suggesting  that  interest  groups 
have  a  limited  ability  to  favor  balanced  representa¬ 
tion  in  the  decision-making  process.  However, 
many  scholars  take  a  less  negative  view  of  the 
dynamics  of  representation,  stressing  that  the  char¬ 
acteristics  of  policy  making  vary  in  relation  to 
time,  area  of  policy,  and  the  issue  at  stake.  It  may 
be  that  the  relative  degree  of  representational 
imbalance  is  more  significant  in  a  certain  period  in 
a  specific  area  of  policy  and  less  so  in  the  same 
area  at  a  later  time.  Furthermore,  certain  decision¬ 
making  processes  prompt  the  mobilization  of  just 
one  of  the  potential  parties  involved,  while  others 
generate  disputes  in  which  hundreds  of  interest 
groups  take  part.  The  variations  in  the  number  of 
actors  participating  in  the  decision-making  process 
suggest  that  the  imbalance  is  probably  less  uniform 
and  rigid  than  it  has  frequently  been  made  out  to 
be  and  may  be  susceptible  to  attenuation. 

Pluralists  who  have  advanced  such  arguments 
support  their  case  by  pointing  to  the  surge  in 
political  participation  that  took  place  during  the 
collective  movements  that  sprang  up  in  the  1960s 
and  1970s.  The  increase  in  the  number  of  groups 
also  contributed  to  “opening  up”  policy  communi¬ 
ties,  which  is  why  the  notion  of  issue  networks  was 
developed  in  the  United  States  to  emphasize  that 
the  number  of  actors  participating  in  decision¬ 
making  processes  had  grown.  More  in  general,  the 
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crowding  of  the  policy-making  scene  has  also 
made  the  outcome  of  decisions  more  uncertain  for 
business  groups.  Their  omnipresence  in  the  various 
decision-making  arenas  is  not  necessarily  proof  of 
their  strength  but  may  be  the  result  of  a  defensive 
reaction  to  the  wider  range  of  public  regulative 
interventions  regarding  issues  directly  affecting 
their  interests.  Moreover,  the  business  community 
is  also  split  by  internal  divisions  that  make  prob¬ 
lematic  the  formation  of  unified  coalitions  for 
promoting  common  lobbying  strategies. 

None  of  the  criticisms  leveled  at  the  thesis  of 
representational  imbalance  denies  that  it  exists. 
Instead,  they  stress  that  this  imbalance  is  an 
unavoidable  result  of  group  competition,  and  it 
may  be  mitigated  when  competition  becomes 
greater.  The  debate  on  representational  imbalance 
suggests  that  the  system  of  interest  groups  is  rela¬ 
tively  open  because  it  enables  new  actors  to  enter 
policy  arenas,  especially  in  periods  of  increased 
competition  and  allows  many  disparate  interests  to 
get  a  hearing  from  governments.  Nevertheless,  the 
group  system  is  characterized  by  permanent  imbal¬ 
ances  that  penalize  citizens  and  the  less  organized 
associations.  In  this  respect,  they  mirror  existing 
inequalities  in  society. 

Groups  and  Democratic  Accountability 

Various  factors  contribute  to  defining  citizens’ 
political  judgment  of  governments.  One  of  these  is 
the  degree  of  accountability  of  government  action. 
In  the  liberal  democracies,  voters  punish  abuses  of 
power  by  voting  governments  out  of  office  or 
reelect  them  if  they  consider  that  the  government 
has  respected  the  law  and  acted  in  their  interests. 
To  monitor  the  actions  of  decision  makers  in  the 
interval  between  elections,  citizens  rely  on  organi¬ 
zations,  which  are  generally  better  equipped  to 
ascertain  when  the  abuse  of  power  gets  the  better 
of  responsibility. 

The  public  organizations  that  monitor  govern¬ 
ment  actions  are  parliament,  the  judiciary,  and 
various  bureaucratic  agencies.  The  most  important 
private  ones  are  the  political  parties,  the  media,  and 
interest  groups.  The  latter  have  an  interest  in  pre¬ 
venting  the  abuse  of  power  so  as  to  ensure  fair 
competition  between  all  the  players  in  the  field  and 
to  avoid  some  groups  obtaining  an  unfair  advantage 
in  the  decision-making  process.  Moreover,  groups 


are  forced  to  follow  the  governmental  process  very 
closely  because  the  final  decisions  on  the  different 
issues  are  highly  unpredictable.  This  explains  why 
researchers  have  discovered  that  a  large  part  of 
group  lobbying  is  devoted  to  the  gathering  of  infor¬ 
mation  in  the  face  of  uncertainty  about  the  inten¬ 
tions  of  political  elites  and  other  players  involved  in 
the  preparation  of  ongoing  legislation.  The  need  to 
achieve  fair  competition  and  to  gather  information 
for  reducing  their  uncertainty  about  legislators’ 
preferences  makes  interest  groups  a  sort  of  watch¬ 
dog  of  public  officials.  In  this  respect,  there  is  more 
than  a  point  of  convergence  among  the  groups’ 
special  interests  and  those  of  ordinary  citizens  for  an 
effective  control  of  decision  makers. 

But  conversely,  insofar  as  they  represent  par¬ 
ticular  interests,  groups  have  also  the  temptation 
to  cultivate  confidential  relations  with  govern¬ 
ments,  and  this  sometimes  leads  to  collusion. 
When  this  happens,  the  correct  functioning  of  the 
democratic  process  is  compromised,  because  it 
induces  decision  makers  to  identify  their  public 
mission  with  the  taking  of  decisions  favorable  to 
groups  with  which  they  are  in  collusion.  The  risk 
can  be  reduced  when  both  the  groups  and  govern¬ 
ments  operate  within  a  system  that  has  effective 
cross-checks.  This  works  well  in  liberal  democra¬ 
cies  if  there  are  strong  political  parties;  a  parlia¬ 
ment  that  exercises  its  powers  of  control  over  the 
executive;  an  independent  judicial  system;  an  effi¬ 
cient,  autonomous  bureaucracy;  and  civil  society 
and  media  that  are  autonomous  from  the  state. 
What  emerges  from  the  above  is  that  interest 
group  competition  enhances  the  development  of 
open  but  permanently  unbalanced  political  repre¬ 
sentation  and  that  the  contribution  of  interest 
groups  to  the  control  of  political  elites  can  be  effec¬ 
tive  if  carried  out  in  the  context  of  democratic 
regimes  that  exhibit  high  qualitative  standards. 

As  a  closing  remark,  it  should  be  remembered 
that  there  are  some  findings  that  represent  a  largely 
uncontroversial  view  of  interest  groups’  contribu¬ 
tion  to  democracy.  First  of  all,  these  judgments 
emphasize  that  groups  offer  significant  opportuni¬ 
ties  for  communicating  with  governments,  enabling 
citizens  to  influence  public  decisions  more  effec¬ 
tively  than  if  they  acted  individually  and  limited 
themselves  to  exercising  their  right  to  vote.  In  addi¬ 
tion,  the  variety,  number,  and  presence  of  groups 
at  all  levels  of  the  government  process  guarantees 
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the  vitality  and  capacity  for  renewal  of  liberal 
democracies.  Groups  contribute  by  helping  pro¬ 
duce  better  citizens  and  motivating  them  to  par¬ 
ticipate  in  elections  and  engage  with  issues  of 
public  interest.  Moreover,  the  great  diversity  of 
views  expressed  by  groups  give  governments  an 
accurate  picture  of  the  worries  and  concerns  of 
civil  society,  permitting  minorities  to  voice  certain 
of  their  key  needs  on  specific  issues  that  distracted 
majorities  resulting  from  elections  might  otherwise 
neglect  to  represent.  Finally,  it  should  not  be  for¬ 
gotten  that,  in  collaboration  with  other  political 
actors — public  and  private — groups  can  help  cre¬ 
ate  rights  and  obligations  protected  by  the  law  and 
often  provide  expertise  that  contributes  to  improv¬ 
ing  the  quality  of  policy  making. 

Liborio  Mattina 
University  of  Trieste 
Trieste,  Italy 
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Intergovernmentalism 


Intergovernmentalism  seeks  to  understand  the 
reasons  why  states  join  together  and  to  provide  a 
realistic  approach  to  the  analysis  of  regional  inte¬ 
gration  mechanisms.  It  tries  to  overcome  the 
contradiction  between  Hobbes’s  vision  of  a  com¬ 
petition  among  sovereign  states  and  the  modern 
process  of  regional  and  global  integration.  This 
entry  explores  how  this  concept  emerged  in 
connection  with  European  integration  and  has 
been  expanded  to  analyze  forms  of  political  and 
economic  cooperation  more  generally. 

Intergovernmentalism  first  found  expression  in 
the  1960s  in  the  work  of  Stanley  Hoffmann,  who 
took  a  realistic  view  of  the  construction  of  Europe. 
Hoffmann,  who  taught  at  Harvard  and  Sciences 
Po  in  Paris,  attempted  to  explain  why  states  as 
nationalistic  as  France  could  bring  themselves  to 
give  up  certain  sovereign  powers  in  order  to  pro¬ 
mote  the  common  market,  while  at  the  same  time 
opposing  the  strategies  of  the  European  Commission 
(EC).  European  integration  cannot  be  explained 
simply  in  terms  of  the  neo-functionalist  spillover 
effect,  according  to  which  successful  integration 
opens  the  way  for  new  cooperation.  Although  it  is 
true  that  Europe  developed  from  an  initially  com¬ 
mon,  then  enlarged  market,  the  single  market  was 
only  created  when  the  political  decision  to  set  up  a 
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single  currency  was  made.  Intergovernmentalism 
thus  represented  a  criticism  of  both  functionalist 
methods  and  institutionalist  approaches.  The  for¬ 
mer  give  priority  to  the  process  of  integration, 
while  the  latter  stress  the  significant  role  of  institu¬ 
tions  in  governing  interstate  relations. 

First,  as  regards  the  functionalist  theories,  it  is 
clear  that,  as  noted  by  Hoffmann,  the  interests  of 
political  elites  do  not  necessarily  coincide  with 
those  of  the  EC  bureaucracy  and  that  national 
elites  may  use  their  veto  to  block  any  transfer  of 
power  affecting  their  sovereignty;  integration  can¬ 
not  be  conceived  of  as  a  functional  process  that 
would  be  spontaneously  built  up,  as  postulated  by 
functionalism.  The  Empty  Chair  Crisis,  in  which 
President  Charles  de  Gaulle  boycotted  meetings  of 
the  Council  of  the  EC  in  opposition  to  the  EC’s 
attempt  to  reduce  the  scope  of  application  of  the 
veto  and  increase  the  EC’s  own  resources,  and 
subsequently  the  Luxembourg  Compromise  that 
ended  the  crisis  in  1966,  deciding  in  favor  of 
France  and  maintaining  unanimity  rule,  symbol¬ 
ized  this  resilience  of  the  state.  Moreover,  again  in 
opposition  to  functionalism,  not  all  sectors  are 
integrated  at  the  same  rate.  Some  key  matters 
remain  within  the  exclusive  domain  of  intergov¬ 
ernmental  cooperation  (the  second  pillar  of  the 
Public  Environmental  Center  for  Sustainable 
Development  [PECSDJ,  one  of  the  constitutive  pil¬ 
lars  of  the  Union — as  they  were  defined  by  the 
Treaty  of  Maastricht,  which  defied  the  European 
“architecture”  in  1992 — devoted  here  to  defense 
and  security),  while  others  are  governed  by  the 
simultaneous  imperatives  of  cooperation  and  inte¬ 
gration,  as  with  the  fourth  pillar  (economy  and 
finance)  in  which  the  European  Central  Bank  (in 
charge  of  integration)  coexists  with  the  Economic 
and  Financial  Affairs  Council  (Ecofin  Council;  in 
charge  of  cooperation). 

As  regards  institutionalist  approaches,  intergov¬ 
ernmentalism  suggests  that  international  institu¬ 
tions  favor  negotiation  by  reducing  transaction 
costs  but  that  it  is  more  effective  in  the  technical 
fields  of  “low  politics.”  In  “high-politics”  areas  of 
sovereignty,  including  the  most  prestigious  pre¬ 
rogatives  of  a  sovereign  state,  states  endeavor  to 
protect  their  powers  and  thus  delegate  them  with 
the  greatest  parsimony.  There  is,  therefore,  a  major 
contradiction  between  technical  matters  that  could 
be  fully  integrated  and  the  more  political  matters 


in  which  cooperation  requires  unanimous  agree¬ 
ment  and  recourse  to  the  veto. 

Intergovernmentalism  therefore  proposes  an 
approach  to  regional  integration  through  which 
states  accept  the  principle  of  cooperation  depend¬ 
ing  on  which  common  interests  are  at  stake.  As 
Morten  Kelstrup  has  shown,  the  realistic  approach 
becomes  compatible  with  regional  integration  that 
allows  the  sharing  of  common  gains,  and  thus  the 
external  relationships  of  the  states  involved  are 
strengthened.  Not  only  do  states  have  an  interest 
in  associating,  they  also  remain  the  chief  players  in 
the  cooperation  or  integration  policy  for  this  pur¬ 
pose.  The  successes  of  the  European  Union  (EU) 
could  thus  be  explained  by  the  proper  functioning 
of  states  capable  of  fostering  the  opening  of  civil 
society  and  waiving  certain  attributes  of  their  sov¬ 
ereignty.  Intergovernmentalism  is  therefore  more 
directly  concerned  with  the  decision  processes  used 
by  states  in  defining  and  then  facing  their  interests 
before  defining  the  area  of  agreement  opened  up 
by  the  relative  powers  of  the  parties. 

Hence,  the  integration  process  must  be  based 
first  on  the  awareness  that  neighboring  states  have 
converging  interests,  a  fact  that  can  be  attributed 
to  the  accelerated  maturity  of  an  international 
anarchy  in  which  sovereign  states  are  in  a  free 
competition.  In  such  conditions,  interstate  coop¬ 
eration  is  envisaged  as  a  rational  solution  to  opti¬ 
mize  the  use  of  means  pooled  by  the  member 
states.  This  is  why  the  veto  was  progressively 
dropped  in  favor  of  a  decision-making  mechanism 
via  a  qualified  majority.  Sovereignty  did  not  disap¬ 
pear  but  instead  became  pooled  sovereignty.  The 
European  Economic  Community  was  thus  envis¬ 
aged  as  a  multiplier  of  power  for  weakened  states. 
The  international  organizations  institutionalized 
the  technique  of  interstate  bargaining.  Their  deci¬ 
sions  became  the  lowest  common  denominator  on 
which  states  could  agree.  In  this  ongoing  negotia¬ 
tion,  the  great  powers  took  precedence,  but  all 
states  were  strengthened  by  their  participation  in 
the  organization.  Contrary  to  the  functionalist  or 
federalist  theories,  which  postulate  a  metasover¬ 
eign  integration,  intergovernmentalism  thus  sug¬ 
gested  that  the  transfer  of  power  strengthened 
states  internationally  and  bolstered  the  political 
elites  with  respect  to  their  domestic  public  opinion. 

This  initial  approach  was  completed  in  the 
1990s  by  Andrew  Moravcsik,  who  developed  the 
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idea  of  liberal  intergovernmentalism.  Like  Hoffmann, 
Moravcsik  considered  that  the  decisions  of  interna¬ 
tional  organizations  resulted  from  bargaining 
between  rational  states.  The  interest  of  this  approach 
is  its  liberal  aspect,  which  is  revealed  by  national 
choices.  States  always  seek  to  impose  their  national 
interests,  but  these  are  no  longer  envisaged  purely 
through  the  reductionist  view  of  general  interest  as 
embodied  in  a  neutral  power.  On  the  contrary,  the 
national  preferences  expressed  by  government  mem¬ 
bers  in  regional  organizations  for  cooperation  or 
integration  are  the  result  of  the  demands  of  social 
players  who  have  succeeded  in  imposing  their  opin¬ 
ions  in  national  administrative  and  political  circles. 
Liberal  intergovernmentalism  thus  appears  to  be  a 
method  for  combining  Robert  Putnam’s  internal 
and  external  levels  and  doubly  strengthening  gov¬ 
ernmental  powers:  The  external  constraints  give  the 
executive  a  means  whereby  to  influence  domestic 
public  opinion,  and  the  internal  constraints  are  used 
to  put  pressure  on  the  external  partners  that  prefer 
to  reach  an  agreement  rather  than  face  failure.  The 
negotiation  that  could  be  complicated  by  this  two- 
level  game  is  in  fact  simplified  by  transparent  insti¬ 
tutional  mechanisms  that  reduce  the  transaction 
costs,  fostering  trust  by  guaranteeing  mutual  respect, 
and  turn  out  to  be  effective  aids  to  decision  making. 
Based  on  Nash’s  equilibrium,  liberal  intergovern¬ 
mentalism  explains  why  and  how  states  manage  to 
come  to  an  agreement  by  reducing  their  demands 
out  of  fear  of  marginalization  or  fear  that  they  will 
be  held  liable  for  failure.  The  result  is  that  the  state, 
and  also  the  executive,  is  strengthened  by  participa¬ 
tion  in  regional  organizations  of  which  it  controls 
the  agenda,  the  procedures,  the  information,  and  the 
instruments  of  legitimation. 

Intergovernmentalism  is  not,  however,  the  only 
explanation  for  the  construction  of  Europe.  First,  it 
would  be  an  exaggeration  to  suppose  that  the  inte¬ 
gration  of  the  Old  World  was  the  sole  initiative  of 
the  hard  core  of  the  EU,  constituted  by  France  and 
Germany.  Although  these  countries  played  a  major 
role,  that  should  not  obscure  the  fact  that  horizon¬ 
tal  agreements  were  concluded  between  the  smaller 
states  to  oppose  this  hegemony.  Moreover,  the 
United  Kingdom  balanced  things  out  by  backing 
agreements  when  it  suited  its  interests.  Moreover, 
while  intergovernmentalism  provides  a  good  expla¬ 
nation  for  the  role  of  states  in  intergovernmental 
conferences  preparing  the  founding  treaties  of  the 


EU,  it  should  be  noted  that  the  Commission  has  the 
power  of  initiative  and  that,  since  the  Single  Act, 
which  was  concluded  in  1986  to  amend  the  Rome 
Treaty  and  deepen  European  integration,  it  is  the 
Commission  that  has  fixed  the  European  agenda. 
The  loss  of  influence  of  the  Council  of  General 
Affairs  and  External  Relations,  which  for  many 
years  was  the  political  hub  of  Europe,  and  the  rise 
in  importance  of  the  technical  councils  symbolize 
the  rise  of  the  experts  who  handle  EU  business  day 
to  day  and  apply  the  treaties.  Nor  is  it  enough  to 
reduce  Europe  to  the  two-level  game  since,  on  the 
contrary,  it  demonstrates  multilevel  governance,  by 
which  many  nonstate  players  are  freed  from  state 
interference  and  develop  multiple  transnational 
cooperations  that  weaken  the  states.  Finally,  the 
overly  positivist  definition  of  national  preferences 
taken  into  account  by  intergovernmentalism  is  also 
criticized  by  constructivist  approaches. 

The  theory  of  intergovernmentalism  also 
includes  the  realist  school,  which  has  markedly 
changed  under  the  influence  of  these  multiple 
instances  of  international  cooperation.  As  a  result, 
realism  has  been  modified  by  four  elements.  First, 
the  interstate  relationship  was  revisited  and 
appeared  as  more  complex  than  the  traditional 
jungle,  now  seen  as  a  mixture  of  competition, 
cooperation,  and  friendship.  Second,  international 
anarchy  has  become  the  raison  d’etre  of  coopera¬ 
tion,  since  states  seek  to  join  forces  in  a  threatening 
environment.  Third,  the  ongoing  bargaining 
between  states  provides  a  satisfactory  answer  to 
Joseph  Grieco’s  question  as  to  whether  states 
refuse  to  conclude  agreements  from  fear  that  their 
neighbor  might  obtain  higher  relative  gains.  Finally, 
intergovernmentalism  has  become  part  of  a  realism 
that  has  abandoned  the  former  balance  of  power  in 
favor  of  a  balance  of  threat  or  balance  of  interests. 
This  neoclassical  realism  is  thus  resolutely  optimis¬ 
tic  since,  according  to  Charles  Glazer,  cooperation 
has  become  the  instrument  of  self-help. 

Jean- Jacques  Roche 
Universite  Pantheon-Assas 
Paris,  France 
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International  law  is  a  system  of  law  that  is  pre¬ 
dominantly  created  by,  and  designed  to  govern  the 
actions  of,  states.  Public  international  law,  the  sub¬ 
ject  of  this  entry,  can  be  distinguished  from  private 
international  law,  which  addresses  aspects  of  pri¬ 
vate  legal  relationships  that  span  national  borders. 
International  law  is  a  horizontal  system  of  law  in 
which  states,  at  least  in  legal  theory,  enjoy  sover¬ 
eign  equality.  Lacking  a  legislative  body  or  a  law 
enforcement  agency,  international  law  functions 
very  differently  than  a  system  of  law  in  a  modern 
liberal  democracy.  International  law  has  grown 
enormously  in  scope  and  complexity  since  World 
War  II.  The  number  of  states  has  expanded  through 
decolonization  and  disintegration,  international 


law  has  come  to  address  a  far  wider  range  of  sub¬ 
jects  than  ever  before,  and  there  has  been  a  con¬ 
siderable  increase  in  the  number  of  international 
courts  and  tribunals.  This  entry  first  looks  at  how 
international  law  is  created  and  enforced,  and 
reviews  some  of  the  most  significant  recent  devel¬ 
opments  in  specific  fields  of  international  law, 
before  considering  competing  explanations  of  the 
impact  of  international  law  on  the  behavior  of 
states  and  the  attitude  toward  international  law  of 
developing  countries,  the  United  States,  Europe, 
and  China. 

Some  scholars  consider  that  the  predictable  pat¬ 
terns  discernible  in  the  interstate  relations  of  the 
ancient  world,  including  the  Greek  city-states  and 
Mesopotamia,  could  be  regarded  as  forerunners  to 
the  current  system  of  international  law,  and  sys¬ 
tems  of  international  law  are  also  said  to  have 
developed  in  precolonial  Africa  and  South  Asia. 
The  modern  system  of  international  law  is,  how¬ 
ever,  usually  dated  from  the  Peace  of  Westphalia  of 
1648,  and  the  term  international  law  did  not  come 
into  use  until  the  late  18th  century.  By  the  20th 
century,  international  law  had  evolved  into  a  global 
system  of  law.  There  are  contrasting  views  as  to  the 
political  significance  of  the  contemporary  system. 
Since  the  emergence  of  international  relations  as  a 
field  of  study  in  the  early  20th  century,  the  domi¬ 
nant  realist  paradigm  has  downplayed  the  impact 
of  international  law  on  the  course  of  world  events. 
Liberalism  assumes  an  important  role  for  interna¬ 
tional  law  as  an  agent  of  world  peace,  but  World 
War  II  stymied  hopes  that  international  law  could 
triumph  over  military  might,  especially  where 
accompanied  by  a  political  determination  to  use 
violence.  As  international  law  continues  to  grow  in 
sophistication,  so  we  are  seeing  more  nuanced  the¬ 
oretical  approaches  to  explicating  its  impact. 

Sources  of  International  Law 

There  are  at  least  three  ways  in  which  the  term 
source  is  used  in  relation  to  international  law. 

Legal  Positivism  and  the  Source 
of  Legal  Obligation 

States  have  a  legal  obligation  to  comply  with 
international  law.  The  theoretical  source  of  this 
obligation  is  state  consent.  Legal  positivism,  the 
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philosophical  foundation  of  international  law 
since  the  19th  century,  assumes  that  law  is  made 
through  human  agency  as  opposed  to  deriving 
from  the  metaphysical  or  religious  realms.  States 
are  bound  by  international  law  because  they  have 
agreed  to  be  bound  by  that  law.  Although  legal 
positivism  underpins  the  “real  world”  of  interna¬ 
tional  law,  it  has  plenty  of  critics  in  the  academy, 
who  regard  it  as  inadequate  for  the  contemporary 
world.  Particularly  in  the  1990s,  much  of  the 
scholarship  critical  of  legal  positivism  and  the 
system  of  international  law  founded  on  legal  posi¬ 
tivism  went  under  the  banners  of  “critical  interna¬ 
tional  legal  studies”  and  “new  approaches  to 
international  law.”  This  scholarship  emphasized 
the  indeterminacy  of  legal  argument  and  found 
fault  with  legal  positivism  for  not  taking  sufficient 
account  of  the  role  of  nonstate  actors,  the  concerns 
of  women,  those  in  postcolonial  societies,  and 
indigenous  peoples.  Heightened  awareness  of  the 
masculinist  bias  of  international  law  has  given  rise 
to  changes  in  specific  fields  of  international  law, 
including  refugee  law  and  international  humani¬ 
tarian  law.  Liberal  scholarship  in  international  law 
emphasizes  the  plurality  of  actors  in  the  interna¬ 
tional  sphere,  including  transnational  networks  of 
judges  and  government  officials. 

Formal  Sources 

The  formal  sources  specify  the  ways  in  which 
international  law  is  made.  Article  38  of  the  Statute 
of  the  International  Court  of  Justice,  which  sets 
out  the  sources  of  international  law  that  the  Court 
is  to  apply,  is  usually  understood  more  generally  as 
a  statement  of  the  formal  sources  of  international 
law.  Although  the  statute  specifies  no  hierarchy, 
the  order  in  which  the  three  sources  are  listed  pro¬ 
vides  a  guide  as  to  the  relative  practical  importance 
of  each  source.  Listed  first  are  conventions,  or 
treaties,  which  are  legally  binding  agreements 
between  states  and,  to  a  lesser  extent,  between 
states  and  intergovernmental  organizations. 
Second  is  customary  international  law.  States  cre¬ 
ate  custom  through  their  actions,  where  that  prac¬ 
tice  is  accompanied  by  opinio  juris ,  a  belief  that 
international  law  requires  the  state  to  behave  in 
that  way.  The  third  of  the  formal  sources  is  general 
principles  of  international  law.  The  formal  sources 
of  international  law  permit  us  to  distinguish 


between  rules,  principles,  and  concepts  that  are, 
strictly  speaking,  part  of  international  law  and 
those  that  are  not.  Article  38  also  specifies  two 
subsidiary  means  for  determining  what  interna¬ 
tional  law  permits  or  does  not  permit  on  a  certain 
subject.  These  are  the  writings  of  distinguished 
international  lawyers  and  judgments  of  interna¬ 
tional  courts  and  tribunals. 

Material  Sources 

If  the  formal  sources  elucidate  the  ways  in 
which  international  law  can  be  made,  the  material 
sources  are  the  specific  documents  in  which  a  par¬ 
ticular  point  of  law  is  set  out — the  Charter  of  the 
United  Nations  (UN)  or  a  resolution  of  the  General 
Assembly,  for  example.  To  state  that  a  text  consti¬ 
tutes  a  material  source  of  international  law  is  not 
to  assume  that  it  has  legal  status  as  a  formal  source 
of  international  law:  The  material  source  may  have 
set  out  a  rule  that  then  becomes  binding  through 
another  process. 

Change  Within  International  Law 

The  doctrine  of  sources  suggests  the  processes 
by  which  international  law  evolves.  There  is  no 
single  formal  source  of  international  law  that 
serves  as  the  motor  of  change.  International  cus¬ 
tomary  law  is  often  codified  into  treaties.  The 
Third  UN  Convention  on  the  Law  of  the  Sea,  for 
example,  put  into  written  form  international  law 
that  had  already  emerged  in  custom.  Usually  in 
such  cases,  however,  the  treaty  not  only  codifies 
custom  but  also  represents  the  “progressive  devel¬ 
opment”  of  international  law;  through  this  pro¬ 
cess,  international  law  is  updated.  Two  general 
principles  of  law  are  relevant  here:  Where  the 
terms  of  two  material  sources  of  international  law 
appear  contradictory,  the  more  recent  prevails, 
and  where  apparently  contradicting  law  can  be 
found  both  in  specific  and  broadly  applicable 
instruments,  the  more  specific  overrides  the  more 
general. 

Compliance  and  Enforcement 
of  International  Law 

Compliance  with  international  law  is  aided  by  the 
fact  that,  at  least  in  theory,  states  choose  the  law 
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to  which  they  wish  to  be  bound.  States  are  not 
bound  by  a  rule  of  customary  law  if  they  have  been 
persistently  objecting  to  the  emergent  rule, 
although,  in  practice,  the  crystallization  of  a  rule 
may  be  clear  only  in  retrospect.  States  are  bound 
only  by  those  treaties  to  which  they  agree  to  be 
bound.  Participants  in  treaty  negotiations  have  an 
opportunity  to  help  shape  the  terms  of  the  agree¬ 
ment,  although  in  large-scale  negotiations  the 
impact  of  any  one  delegation  may  not  be  as  sig¬ 
nificant  as  the  theory  of  state  consent  might  sug¬ 
gest.  Compliance  with  international  law  is  also 
enhanced  by  the  fact  that  multilateral  treaties  com¬ 
monly  permit  reservations,  which  allow  states  to 
exclude  from  their  undertakings  any  provisions 
with  which  they  disagree.  Even  those  states  that 
have  signed  a  treaty  can  choose  not  to  proceed  to 
ratify  and  hence  not  be  bound  by  the  final  text  of 
the  treaty.  The  fact  that  international  law  is  a 
horizontal  system  means  that  one  of  the  most 
important  factors  promoting  compliance  is  reci¬ 
procity:  If  one  party  to  a  conflict  treats  the  prison¬ 
ers  of  war  of  the  other  combatant  according  to 
international  humanitarian  law,  it  is  much  more 
likely  that  its  own  nationals  will  in  turn  be  treated 
appropriately. 

There  are  two  alternative  approaches  to  maxi¬ 
mizing  compliance  with  international  law,  and 
each  has  its  supporters  within  the  academy.  One 
approach  takes  as  its  starting  point  the  assumption 
that  states  have  a  propensity  to  comply  and  gener¬ 
ally  only  fail  to  do  so  when  there  are  financial  or 
bureaucratic  impediments.  This,  the  managerial 
approach,  is  associated  with  the  writing  of  Abram 
Chayes  and  Antonia  Handler  Chayes  (1995)  and 
emphasizes  the  continuity  of  policy  choices,  to 
which  changes  are  made  only  incrementally  when 
perceived  as  necessary.  The  second  approach  is 
that  of  enforcement,  by  which  it  is  assumed  that 
states  will  only  comply  where  they  see  a  direct 
benefit  from  doing  so.  If  a  decision  to  comply  is 
the  product  of  a  cost-benefit  analysis  on  the  part 
of  national  decision  makers,  rates  of  compliance 
should  be  significantly  affected  by  the  presence  or 
absence  of  enforcement  mechanisms.  In  practice, 
most  fields  of  international  law  contain  elements 
of  both  the  “carrot”  and  the  “stick”  approach. 

There  are  several  methodological  problems 
associated  with  attempting  to  determine  with  pre¬ 
cision  the  rate  of  compliance  with  international 


law — most  basically,  the  fact  that  there  may  not  be 
a  clear  divide  between  compliance  and  noncompli- 
ance.  Many  treaty  provisions  are  worded  in  such  a 
way  that  it  is  a  matter  of  interpretation  as  to 
whether  a  particular  party  has  complied  with  its 
obligations.  It  is  not  difficult  to  assess  whether  a 
state  has  or  has  not  complied  with  an  obligation  to 
submit  a  report  on  its  implementation  of  a  human 
rights  treaty,  for  example,  but  it  may  be  more  dif¬ 
ficult  to  assess  on  the  basis  of  that  report  whether 
the  state  has  implemented  the  treaty  fully,  in  both 
letter  and  spirit. 

A  notable  feature  of  the  system  of  international 
law  over  recent  decades  has  been  the  growth  in  the 
number  of  international  courts  and  courtlike  bod¬ 
ies.  Historically,  states — or  at  least  their  decision¬ 
making  elite — have  been  far  from  keen  on  the  idea 
of  being  required  to  abide  by  the  outcome  of  a 
third-party  decision  in  relation  to  issues  with  high 
political  stakes.  The  dispute  resolution  process 
within  the  World  Trade  Organization  stands  out 
because  it  is  compulsory.  Any  member  state  that 
believes  that  another  member  has  breached  an 
agreement  can  initiate  a  case.  The  dispute  is  heard 
by  a  panel  whose  report  is  then  adopted  by  the 
Dispute  Resolution  Body  unless  there  is  a  consen¬ 
sus  against  adoption.  Appeals  are  heard  by  an 
appellate  body.  Should  a  member  not  comply  with 
the  outcome  of  this  process,  retaliatory  trade  sanc¬ 
tions  may  be  imposed.  Despite  the  fact  that  a  con¬ 
siderable  proportion  of  trade  disputes  are  settled 
informally,  there  is  strong  scholarly  interest  in  the 
workings  of  the  dispute  resolution  process  because 
its  establishment  in  an  intergovernmental  organi¬ 
zation  with  such  large  membership  is  a  historically 
significant  development. 

The  two  international  courts  with  the  greatest 
potential  impact  on  world  politics  are  the 
International  Court  of  Justice  (ICJ)  and  the 
International  Criminal  Court  (ICC). 

The  International  Court  of  Justice 

The  ICJ  is  one  of  the  principal  organs  of  the 
UN.  The  ICJ  can  hear  cases  on  any  subject  matter 
brought  to  the  Court  by  states,  so  long  as  it  finds 
that  the  states  concerned  have  consented  to  the 
jurisdiction  of  the  Court.  Expressions  of  consent 
may  have  been  open-ended  or  relate  only  to  the 
case  in  question.  The  Court  can  issue  an  order  for 
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provisional  measures — the  equivalent  of  an  interim 
injunction — which  are  designed  to  protect  the 
rights  of  parties  during  the  period  before  the  Court 
decides  on  the  case  itself.  The  Court  has  averaged 
less  than  three  cases  per  year  but  has  become 
increasingly  busy  in  recent  years.  States  have  com¬ 
plied  with  the  judgments  of  the  Court  in  a  majority 
of,  though  not  all,  contentious  cases,  but  compli¬ 
ance  with  provisional  measures  has  not  been  good. 
In  its  2001  decision  in  the  LaGrand  Case,  the 
Court  confirmed  that  orders  of  the  Court  indicat¬ 
ing  provisional  measures  are  binding.  Should  a 
party  to  a  dispute  heard  by  the  Court  not  comply 
with  the  decision  of  the  Court,  the  other  party/ 
parties  may  request  the  assistance  of  the  Security 
Council  in  obtaining  compliance.  This  would 
clearly  politicize  the  question  and  would  not  be 
useful  if  the  grievance  were  against  one  of  the  per¬ 
manent  members  of  the  Security  Council.  The 
Council  has  never  formally  declared  that  it  is  act¬ 
ing  to  enforce  a  decision  of  the  ICJ. 

In  addition  to  hearing  contentious  cases  brought 
by  states,  the  ICJ  can  respond  to  requests  by  cer¬ 
tain  bodies  within  the  UN  for  advisory  opinions. 
These  are  nonbinding  responses  to  questions  of 
law.  Examples  include  the  1995  opinion  on  the 
legality  of  the  threat  of  use  of  nuclear  weapons  and 
the  2003  opinion  on  the  legal  consequences  of  the 
construction  of  a  wall  in  the  occupied  Palestinian 
territory. 

The  International  Criminal  Court 

The  establishment  of  the  ICC  in  1998  was  one 
of  the  most  momentous  international  law  develop¬ 
ments  of  the  late  20th  century.  The  ICC  exercises 
jurisdiction  over  individuals  who  commit  serious 
breaches  of  international  law  through  the  crimes 
of  genocide,  war  crimes,  and/or  crimes  against 
humanity,  and  it  will  have  jurisdiction  over  the 
crime  of  aggression  once  a  definition  of  the  crime 
has  been  agreed  on.  A  key  principle  in  the  statute 
is  that  of  complementarity;  the  ICC  was  not 
intended  to  replace  national  courts  but  to  operate 
in  situations  in  which  states  are  either  unable  or 
unwilling  to  act.  The  ICC  is  set  to  have  significant 
political  impact,  in  large  part  because,  despite  its 
having  jurisdiction  over  individuals  rather  than 
states,  this  necessarily  gives  rise  to  broader  politi¬ 
cal  implications.  This  is  particularly  evident  in  the 


case  of  aggression.  Aggression  was  included  in  the 
Statute  of  the  ICC  at  the  insistence  of  developing 
countries,  but  there  was  no  agreement  during  the 
negotiations  as  to  how  aggression  was  to  be  defined. 
Historically,  aggression  has  been  considered  a  crime 
committed  by  states,  but  the  Nuremburg  and  Tokyo 
tribunals  after  World  War  II  declared  that  individu¬ 
als  were  accountable  for  crimes  against  peace.  It 
would  be  very  difficult  for  the  Court  to  find  that  an 
individual  had  committed  the  crime  of  aggression 
without  passing  judgment  on  the  actions  of  the  state 
in  question. 

The  issue  is  similar  in  relation  to  the  trigger 
mechanisms  by  which  a  case  can  be  brought  before 
the  Court.  A  situation  may  be  referred  to  the  ICC 
Prosecutor  by  a  state  party  or  initiated  by  the  pros¬ 
ecutor  with  the  authorization  of  a  pretrial  cham¬ 
ber.  In  both  these  situations,  the  state  in  whose 
territory  the  conduct  in  question  occurred  or  of 
which  the  person  accused  of  the  crime  is  a  national 
must  have  given  its  consent.  A  situation  can  also  be 
referred  to  the  prosecutor  by  the  Security  Council, 
and  if  brought  by  this  means,  the  Court  will  have 
jurisdiction  even  if  the  crime  was  committed  in  a 
nonstate  party  by  a  national  of  a  nonstate  party 
and  in  the  absence  of  consent  by  the  territorial 
state  or  the  state  of  nationality  of  the  accused.  The 
United  States  has  not  become  a  party  to  the  statute 
of  the  Court,  one  of  its  principal  objections  being 
that,  contrary  to  the  principle  of  state  consent,  the 
Court  can  hear  a  case  against  an  individual  whose 
country  is  not  party  to  the  statute.  The  United 
States  did  not,  however,  veto  Security  Council 
Resolution  1593  (2005),  which  referred  the  situa¬ 
tion  in  Darfur  to  the  ICC  despite  Sudan  being  a 
nonparty  to  the  statute.  The  first  four  situations 
referred  to  the  Court  were  all  African. 

The  ICC  complements  the  work  of  national 
courts  as  well  as  other  types  of  bodies,  including 
criminal  courts  established  on  an  ad  hoc  basis  and 
truth  and  reconciliation  commissions.  Truth  com¬ 
missions  have  no  prosecutorial  powers  and  are  not 
courts.  They  attempt  to  acknowledge  and  docu¬ 
ment  a  history  of  human  rights  abuses  within  a 
particular  country  or  area — often  those  that  took 
place  under  a  regime,  or  during  a  period  of  turmoil 
or  civil  war — as  a  step  toward  healing  wounds 
within  that  society.  They  represent  a  midpoint 
between  a  blanket  amnesty  for  perpetrators  of 
human  rights  and  international  humanitarian  law 
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violations  and  formal  criminal  trial.  More  than 
20  truth  commissions  have  been  established  since 
1970,  including  in  Uganda,  Nepal,  and  South 
Africa.  Both  a  Truth  and  Reconciliation  Commis¬ 
sion  and  a  special  court  established  jointly  by  the 
Government  of  Sierra  Leone  and  the  UN  operated 
concurrently  in  Sierra  Leone  in  the  wake  of  its  civil 
war.  Other  ad  hoc  tribunals  include  the 
International  Criminal  Tribunal  for  the  former 
Yugoslavia  and  the  International  Criminal  Tribunal 
for  Rwanda.  Both  Timor  and  Cambodia  have 
hybrid  courts  in  which  criminal  justice  is  being 
pursued  through  what  can  be  understood  as  inter¬ 
nationalized  domestic  courts. 

The  Substantive  Content 
of  International  Law 

International  law  can  be  divided  into  a  number  of 
overlapping  subfields  or  regimes.  While  this  is  use¬ 
ful  as  a  way  of  classifying  the  vast  and  growing 
amount  of  international  law,  it  should  be  remem¬ 
bered  that  any  single  material  source  may  well 
have  relevance  to  more  than  one  of  these  fields. 
Enforcement  methods  vary  between  fields  and  in 
their  mix  of  carrot  and  stick  approaches. 

International  Law  and  the  Use  of  Force 

This  field  of  international  law  specifies  the  cir¬ 
cumstances  under  which  a  state  may  or  may  not 
use  force  against  another  state.  One  of  the  most 
striking  features  of  the  current  system  of  interna¬ 
tional  law  is  its  incorporation  of  a  general  prohibi¬ 
tion  on  the  threat  and  use  of  force  in  interstate 
relations.  The  Covenant  of  the  League  of  Nations 
attempted  to  delay  and,  if  possible,  avoid  a  state 
resorting  to  war,  but  the  Charter  of  the  UN  goes 
further  and  permits  states  to  use  force  only  in  self- 
defense  or  if  authorized  to  do  so  by  the  Security 
Council.  The  Charter  recognizes  the  inherent  right 
of  a  state  to  self-defense  if  it  is  a  victim  of  an  armed 
attack.  Having  responded  in  self-defense,  the  state 
concerned  is  expected  to  report  its  actions  to  the 
Security  Council,  and  it  is  assumed  that  the 
Security  Council  will  then  take  appropriate  action. 
Although  the  initial  plan  was  for  individual  states 
to  make  military  forces  available  to  the  UN,  this 
has  not  occurred,  and  in  practice,  the  Council 
authorizes  others  to  use  force  on  its  behalf.  What 


had  previously  been  regarded  as  the  sovereign 
right  of  a  state  to  choose  to  go  to  war  in  pursuit  of 
its  political  objectives  was  removed  by  the  Charter 
and  in  its  place  was  established  a  system  of  collec¬ 
tive  security  in  which  the  whole  international  com¬ 
munity  is  to  work  together  to  prevent,  and  respond 
to,  threats  to  international  peace  and  security. 

The  international  law  of  the  use  of  force  is  not 
so  clear  as  to  how  states  should  respond  to  acts  of 
violence  by  nonstate  actors,  including  terrorists. 
Nor  is  it  well  equipped  to  deal  with  internal  wars. 
The  Charter  of  the  UN  upheld  sovereignty  as  a 
fundamental  principle  of  the  international  system; 
according  to  Article  2(7),  the  UN  is  not  to  interfere 
in  the  domestic  affairs  of  states.  While  a  strong 
affirmation  of  sovereignty  is  welcome  from  the 
perspective  of  a  developing  country  being  able  to 
strengthen  its  position  against  external  meddling 
in  its  political  and  economic  affairs,  it  is  less  useful 
in  safeguarding  human  rights.  This  raises  the  con¬ 
troversial  question  of  the  legality  of  humanitarian 
intervention.  Although  the  vast  majority  of  inter¬ 
national  lawyers  would  agree  that  humanitarian 
intervention  authorized  by  the  Security  Council  in 
response  to  a  threat  to  international  security  is 
legal,  only  a  few  believe  that  contemporary  inter¬ 
national  law  permits  unilateral  acts  of  humanitar¬ 
ian  intervention.  Those  who  do  believe  that 
humanitarian  intervention  is  not  covered  by  the 
general  prohibition  on  the  use  of  force  in  Article 
2(4)  of  the  UN  Charter  generally  seek  support  for 
their  views  in  international  customary  law. 

International  legal  scholarship  in  this  field 
examines  particular  instances  of  the  use  of  force  to 
assess  their  legality  and  any  likely  possible  impact 
on  customary  international  law.  In  recent  years, 
for  example,  a  focus  on  the  legality  of  the  North 
Atlantic  Treaty  Organization’s  (NATO’s)  use  of 
force  during  the  Kosovo  crisis  was  followed  by 
debate  regarding  Operation  Enduring  Freedom  as 
a  response  to  the  terrorist  attacks  of  September  11, 
2001,  and  the  scope  for  a  state  to  justify  the  use  of 
military  force  in  response  to  a  terrorist  attack  as  a 
valid  act  of  self-defense.  Such  debate  was  then 
overshadowed  by  the  2003  Iraq  War,  which  most 
legal  scholars  deem  to  have  been  illegal.  While  the 
United  States,  the  United  Kingdom  (UK),  and 
Australia  claimed  to  derive  authority  for  their  use 
of  force  from  Security  Council  Resolution  678 
(1990),  the  vast  majority  of  international  lawyers 


1238  International  Law 


question  the  assertion  that  Iraqi  noncompliance 
with  Security  Council  Resolution  687  (1991) 
could  have  reactivated  the  Council’s  authorization 
for  the  First  Gulf  War.  Indeed,  both  the  United 
States  and  the  UK  had  written  letters  to  the 
Council  regarding  Security  Council  Resolution 
1441  (2002),  in  which  they  emphasized  that  any 
further  Iraqi  noncompliance  would  need  to  be  con¬ 
sidered  again  by  the  Council. 

One  of  the  criticisms  most  commonly  leveled  at 
international  law  within  political  science  is  its  appar¬ 
ent  inability  to  enforce  rules,  particularly  those  con¬ 
cerning  the  use  of  force,  against  the  most  powerful. 
This  was  a  recognized  weakness  of  the  UN  Charter 
from  its  earliest  days.  Security  Council  enforcement 
can  ensure  compliance  on  the  part  of  less  powerful 
states  and  those  without  an  ally  among  the  perma¬ 
nent  members  of  the  Council,  but  international  law 
arguably  relies  on  “carrots”  to  ensure  the  compli¬ 
ance  of  the  most  powerful.  The  illegality  of  the  2003 
invasion  of  Iraq  became  so  clear  that,  for  example, 
it  served  to  delegitimize  and  weaken  international 
support  for  U.S.  foreign  policy. 

International  Economic  Law 

International  economic  law  (IEL)  regulates  inter¬ 
national  economic  activity.  While  many  aspects  of 
international  law  have  economic  implications,  IEL 
is  generally  understood  as  incorporating  those  legal 
regimes  with  the  most  direct  economic  conse¬ 
quences,  such  as  the  international  law  of  trade, 
investment,  currency,  and  finance.  The  growth  of 
this  field  of  international  law  is  reflected  in  the 
establishment  in  2008  of  a  Society  of  International 
Economic  Law.  International  trade  law  is  the  most 
developed  branch  of  IEL.  The  breakdown  of  the 
Doha  Round  of  negotiations  in  the  World  Trade 
Organization  and  the  associated  turn  to  bilateral 
and  regional  trade  negotiating  processes  is  one  of 
the  most  significant  recent  developments  in  IEL. 

For  many  years,  few  other  than  the  specialists 
kept  track  of  what  was  happening  in  the  periodic 
negotiating  rounds  of  the  General  Agreement  on 
Tariffs  and  Trade  and  the  World  Trade  Organiza¬ 
tion.  The  protests  at  the  1999  ministerial  meeting 
in  Seattle  marked  a  new  era.  The  International 
Monetary  Fund,  World  Bank,  and  World  Trade 
Organization  have  moved  to  enhance  their  account¬ 
ability  through  increased  transparency,  and  some 


legal  scholars  are  promoting  the  need  to  develop  a 
body  of  international  administrative  law  to  address 
governance  questions  in  intergovernmental  institu¬ 
tions.  IEL  regulates  multinational  corporations 
only  indirectly,  and  developing  countries  have 
since  at  least  the  1970s  called  for  more  effective 
regulation  of  the  behavior  of  multinational  corpo¬ 
rations,  especially  in  relation  to  human  rights  and 
environmental  matters. 

International  Environmental  Law 

As  a  field  of  international  law,  international 
environmental  law  is  relatively  recent.  Key  global 
treaties  include  the  1985  Vienna  Convention  for 
the  Protection  of  the  Ozone  Layer  and  its  1987 
Montreal  Protocol,  the  1992  UN  Framework  Con¬ 
vention  on  Climate  Change  and  its  1997  Kyoto 
Protocol,  the  1992  Convention  on  Biological 
Diversity,  and  the  2001  Stockholm  Convention  on 
Persistent  Organic  Pollutants.  Climate  change  is 
arguably  the  most  significant  issue  in  international 
environmental  law  today,  with  considerable  schol¬ 
arly  focus  being  directed  toward  the  architecture 
of  the  post-Kyoto  regime  and  the  implications  of 
climate  change  for  other  fields  of  international 
law,  including  the  international  law  of  human 
rights  and  the  law  of  the  sea. 

There  are  clear  implications  of  measures  taken 
to  protect  the  environment  for  international  eco¬ 
nomic  relations,  and  from  the  perspective  of  a 
developing  country,  this  field  of  law  has  often  been 
criticized  for  perpetuating  Western  colonial  domi¬ 
nance.  Where  an  environmental  issue — such  as 
ozone  depletion — is  of  a  truly  global  nature,  devel¬ 
oping  countries  have,  however,  found  themselves 
in  a  stronger  negotiating  position  and  able  to 
secure  differentiated  responsibilities.  Types  of  dif¬ 
ferentiated  responsibilities  have  included,  inter 
alia,  those  that  distinguish  between  industrial  and 
developing  countries  with  respect  to  the  central 
obligations  contained  in  the  treaty  and  those 
addressing  aspects  of  implementation,  such  as 
delayed  compliance  schedules,  and  financial  and 
technological  assistance. 

The  Law  of  the  Sea 

This  is  a  far  older  branch  of  international  law 
and  addresses  all  aspects  of  ocean  use.  Historically, 
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customary  international  law  evolved  to  address 
navigation  and  fishing.  One  of  the  most  funda¬ 
mental  principles  in  the  law  of  the  sea  is  that  of  the 
“freedom  of  the  seas,”  by  which  the  oceans  are  a 
global  commons  available  for  use  by  all.  Naval 
powers — the  United  States  now  as  with  the  UK 
before  it — emphasize  the  importance  of  this  prin¬ 
ciple.  Over  time,  however,  coastal  states  have 
increased  the  extent  of  their  regulatory  reach  into 
the  oceans  adjacent  to  their  territory.  The  primary 
material  source  of  international  law  on  the  oceans 
is  the  1982  Third  UN  Convention  on  the  Law  of 
the  Sea,  which  entered  into  force  in  1994  and  sets 
out  just  who  has  what  sort  of  rights  in  which  parts 
of  the  ocean  and  seabed.  It  defines  a  number  of 
zones,  including  those  of  the  territorial  sea,  conti¬ 
nental  shelf,  and  exclusive  economic  zone.  The 
United  States  has  not  ratified  the  Convention  but 
respects  most  of  its  provisions,  which  are  also 
found  in  customary  international  law.  Topical 
within  the  study  of  the  law  of  the  sea  in  recent 
years  have  been  questions  of  maritime  boundary 
determination,  protection  of  fish  stocks  from  over¬ 
fishing,  delimiting  the  outer  edge  of  the  continental 
shelf,  and  maritime  security.  Climate  change  is 
raising  new  questions,  including  those  relating  to 
the  melting  of  polar  ice  and  the  future  of  the  mari¬ 
time  zones  associated  with  states  that  may  disap¬ 
pear  as  sea  levels  rise. 

International  Human  Rights  Law 

International  human  rights  law  (IHRL)  is  con¬ 
cerned  primarily  with  the  protection  of  individu¬ 
als  from  threats  to  their  human  rights  stemming 
from  the  state.  The  Universal  Declaration  of 
Human  Rights,  together  with  the  International 
Covenant  on  Civil  and  Political  Rights  and  the 
International  Covenant  on  Economic  and  Social 
Rights  together  make  up  what  is  informally 
referred  to  as  the  International  Bill  of  Rights. 
There  are  a  considerable  number  of  more  specific 
human  rights  treaties  at  a  global  level,  including 
the  1951  Convention  Relating  to  the  Status  of 
Refugees,  the  1965  International  Convention  on 
the  Elimination  of  all  Forms  of  Racial 
Discrimination,  the  1979  Convention  on  the 
Elimination  of  all  Forms  of  Discrimination  against 
Women,  the  1989  Convention  on  the  Rights  of 
the  Child,  and  the  2006  Convention  on  the  Rights 


of  Persons  with  Disabilities.  Global  IHRL  is 
complemented  by  many  regional  treaties. 

Human  rights  law  has  traditionally  assumed  a 
distinction  between  public  and  private  spheres  of 
human  activity  and  focused  on  the  first  of  these  to 
the  detriment  of  those  who  suffer  human  rights 
abuses  in  the  private  sphere.  Recent  years  have 
seen  increased  recognition  within  IHRL  of  the 
need  to  increase  human  rights  accountability  for 
corporations,  international  organizations,  multi¬ 
lateral  development  banks,  multinational  peace¬ 
keeping  operations,  and  individuals.  Compliance 
mechanisms  in  IHRL  often  consist  of  a  regularized 
collection  of  self-reported  data,  with  review  by  an 
international  committee.  The  number  of  human 
rights  treaties  continues  to  increase,  but  the  focus 
has  shifted  to  the  need  to  work  for  better  compli¬ 
ance  with  existing  human  rights  treaties  rather 
than  their  continued  proliferation. 

International  Humanitarian  Law 

International  humanitarian  law  (IHL),  also 
known  as  the  international  law  of  armed  conflict, 
addresses  human  rights  in  wartime  and  thus  over¬ 
laps  with  IHRL  as  well  as  with  international  crimi¬ 
nal  law.  Traditional  means  of  ensuring  compliance 
with  IHL  include  appeals  by  the  International 
Committee  of  the  Red  Cross,  prosecution  before 
international  tribunals,  training  programs  in  military 
academies,  and  condemnation  by  other  states.  The 
war  on  terror  launched  by  the  Bush  administration 
gave  extra  vigor  to  debate  regarding  the  relevance  of 
IHL  to  conflicts  involving  fighters  not  representing 
states  and  a  perceived  need  to  rethink  ways  of  ensur¬ 
ing  respect  for  IHL.  The  increasing  privatization  of 
war  has  also  raised  questions  regarding  the  IHL 
accountability  of  private  security  firms. 

International  Criminal  Law 

International  criminal  law  (ICL)  overlaps  both 
IHRL  and  IHL  as  well  as  other  fields  such  as  the 
law  of  the  sea — consider,  for  example,  the  crime  of 
piracy.  Transnational  offences  addressed  by  inter¬ 
national  criminal  law  include  drug  trafficking, 
money  laundering,  and  cybercrime.  Some  actions 
are  specified  as  international  crimes  in  the  relevant 
treaty.  Article  1  of  the  1948  Convention  on  the 
Prevention  and  Punishment  of  the  Crime  of 
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Genocide  states  in  that  the  Contracting  Parties 
“confirm  that  genocide,  whether  committed  in 
time  of  peace  or  in  time  of  war,  is  a  crime  under 
international  law  which  they  undertake  to  prevent 
and  to  punish.”  Other  treaties,  including  the  1984 
Convention  Against  Torture  and  Other  Cruel, 
Inhuman,  or  Degrading  Treatment  or  Punishment, 
do  not  explicitly  describe  the  conduct  in  question 
as  an  international  crime  but  require  all  parties  to 
make  the  offence  a  crime  in  national  law  and/or  to 
prosecute  or  extradite  the  alleged  offender. 

There  is  no  agreed  definition  of  terrorism  in 
international  law,  but  a  number  of  counterterror¬ 
ism  conventions  have  been  negotiated  in  response 
to  particular  forms  and  acts  of  terrorism.  These 
include  the  1979  International  Convention  Against 
the  Taking  of  Hostages,  the  1998  UN  Convention 
for  the  Suppression  of  Terrorist  Bombings,  and  the 
2000  UN  Convention  for  the  Suppression  of  the 
Financing  of  Terrorism.  Most  counterterrorism 
conventions  define  the  elements  of  the  offence  and 
incorporate  provisions  requiring  the  state  in  which 
an  alleged  offender  is  found  to  proceed  with  extra¬ 
dition  or  prosecution. 

The  Impact  of  International  Law 
on  the  Behavior  of  States 

Amazing  as  it  may  seem  to  most  international  law¬ 
yers,  political  scientists  are  divided  over  whether 
international  law  does  or  does  not  influence  the 
way  in  which  states  conduct  their  affairs,  particu¬ 
larly  as  regards  the  “high-politics”  issues  of  war 
versus  peace  and  even  in  relation  to  “middle-range” 
political  issues  in  which  some  degree  of  national 
interest  is  at  stake.  Fundamental  questions  concern¬ 
ing  state  behavior  in  relation  to  international  law 
necessarily  reflect  broader  theoretical  positions 
regarding  the  relationship  of  international  law  to 
world  politics.  At  the  time  when  the  discipline  of 
international  relations  emerged  after  World  War  I, 
liberal  internationalists  painted  a  picture  of  the  pos¬ 
sibility  of  a  world  in  which  war  was  not  inevitable 
and  promoted  international  law  and  institutions  as 
the  way  to  achieve  such  a  world.  The  idealists 
shared  a  moralistic,  optimistic  outlook  on  world 
affairs.  The  establishment  of  the  League  of  Nations 
and  the  Permanent  Court  of  International  Justice 
could  be  regarded  as  embodiments  of  this  idealist 
perspective.  But  as  the  inevitability  of  world  war 


became  apparent  in  the  late  1930s,  Edward  Hallett 
Carr  spoke  out  against  what  by  then  appeared  to 
be  the  utopian  dreams  of  the  liberal  international¬ 
ists.  In  The  Twenty  Years’  Crisis,  1919-1939,  Carr 
claimed  that  it  is  not  that  ethics  and  international 
law  have  no  place  in  world  politics  but  that  the 
bottom  line  is  always  power. 

World  War  II  ushered  in  an  era  in  which  realism 
dominated  the  discipline.  The  United  States  has 
functioned  as  the  hegemon  of  the  international 
relations  discipline  since  World  War  II,  and  in  the 
United  States,  the  discipline  of  international  rela¬ 
tions  is  firmly  embedded  in  political  science  (rather 
than,  e.g.,  history  or  philosophy).  The  leading 
postwar  U.S.  realist,  Hans  Morgenthau,  had  been 
disillusioned  by  the  failure  of  international  law  to 
prevent  World  War  II,  and  the  focus  of  classical 
realism  on  power  politics  and  national  interest 
typically  leaves  little  room  for  international  law. 
Where  realists  do  refer  to  international  law,  it  is 
generally  to  downplay  any  independent  influence 
of  international  law  on  the  course  of  world  poli¬ 
tics.  Realists  believe  that  they  deal  with  the  world 
as  it  really  is  rather  than  as  it  ought  to  be. 

The  seeming  mismatch  between  the  hard-line 
approach  of  Cold  War  realism  and  the  reality  as 
lived  by  government  international  lawyers  encour¬ 
aged  some  lawyers  to  seek  to  demonstrate  that 
international  law  did  have  a  real-world  impact. 
One  approach  was  to  recount  the  experiences  of 
foreign  policy  legal  advisers.  It  appears  that  there 
is  a  wide  range  of  experiences  here,  from  being 
intimately  involved  in  the  decision-making  pro¬ 
cess,  as  was  the  case  in  the  United  States  during  the 
Cuban  Missile  Crisis,  to  not  even  being  consulted 
before  a  major  policy  decision  with  legal  ramifica¬ 
tions  is  taken.  The  role  of  foreign  policy  legal 
advisers  can  be  characterized  as  generally  being 
that  of  a  neutral  professional  when  first  asked  for 
advice  on  the  legality  of  a  proposed  course  of 
action,  with  that  role  shifting  to  one  of  a  policy 
advocate  once  a  policy  decision  has  been  taken. 

Another  approach  toward  demonstrating  the 
relevance  of  international  law  was  to  speak  in 
terms  of  the  “functions”  or  “roles”  of  interna¬ 
tional  law.  U.S.  scholars  tended  to  focus  on  crisis 
situations,  at  which  times  international  law  was 
shown  to  be  valuable  in  choosing  between  and 
legitimizing  policy  options  and  in  providing  lan¬ 
guage  and  institutions  through  which  the  parties 
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could  communicate  with  each  other.  Hedley  Bull, 
associated  with  the  so-called  English  School  of 
International  Relations,  believed  that  despite  the 
anarchical  nature  of  the  international  system,  there 
is  such  a  thing  as  international  society.  According 
to  Bull  (1994),  international  law  serves  to  identify 
the  idea  of  a  society  of  sovereign  states,  state  the 
basic  rules  of  coexistence,  and  help  mobilize  com¬ 
pliance  with  the  rules  of  international  society. 
International  lawyers,  and  those  international  rela¬ 
tions  scholars  with  an  interest  in  international  law, 
have  generally  not  needed  to  justify  their  respect 
for  international  law  as  much  in  the  UK  and 
Europe  as  in  the  United  States. 

Regime  theory  flourished  in  the  United  States 
and  elsewhere  in  the  1980s  and  early  1990s. 
Regime  theory  began  as  an  approach  to  analyzing 
processes  of  international  cooperation  that  may  or 
may  not  incorporate  a  formal  organizational  struc¬ 
ture  of  which  the  particular  focus  of  much  research 
was  the  General  Agreement  on  Tariffs  and  Trade. 
Recognizing  that  intangibles  are  integral  to  pro¬ 
cesses  of  formalized  and  less  formal  processes  of 
international  cooperation,  regime  theorists  referred 
to  norms,  principles,  rules,  and  decision-making 
procedures.  The  fact  that  international  regimes  are 
generally  founded  by  multilateral  treaty  opened  up 
possibilities  for  interdisciplinary  cooperation, 
although  there  is  no  single  regime  theory  approach 
to  international  law  because  the  regime  vehicle  has 
been  used  by  theorists  across  the  spectrum,  includ¬ 
ing  those  of  an  essentially  realist  persuasion. 

The  primary  ongoing  use  of  regime  theory  is  in 
relation  to  questions  of  regime  design  and  effec¬ 
tiveness  in  the  study  of  environmental  coopera¬ 
tion.  Even  perfect  compliance  may  not  mean  that 
a  treaty  is  effective  if  the  provisions  it  contains  are 
inadequate  to  meet  the  objectives  for  which  it  was 
established.  It  is  generally  accepted,  for  example, 
that  even  if  all  parties  were  to  fully  comply  with 
their  Kyoto  commitments,  the  emissions  reduction 
targets  in  the  treaty  were  inadequate  to  make  a 
significant  impact  on  the  problem.  Names  associ¬ 
ated  with  regime  theory  include  Kenneth  Abbott, 
Stephen  Krasner,  Duncan  Snidal,  Volker  Rittberger, 
Jorgen  Wettestad,  and  Oran  Young. 

From  the  perspective  of  those  who  view  interna¬ 
tional  law  as  a  largely  coherent  system  of  law, 
much  of  the  international  relations  literature  that 
does  recognize  norms — including  within  a  regime 


framework — remains  inadequate  because  it  does 
not  attempt  to  distinguish  a  legal  norm  from  any 
other  norm.  In  2000,  a  special  issue  of  the  U.S. 
journal  International  Organization  presented  a 
framework  within  which  to  analyze  “legaliza¬ 
tion,”  the  increasing  amount  of  law  evident  in 
international  institutions  and  process.  Current 
research  questions  on  legalization  include  those  of 
its  causes  and  consequences  and  why  it  is  that  gov¬ 
ernments  favor  legalization  in  some  institutions, 
issue  areas,  and  regions  more  than  in  others. 
Europe,  for  example,  is  much  more  highly  legal¬ 
ized  than  some  other  regions  of  the  world. 

In  recent  years,  scholarship  on  the  political 
functioning  of  international  law  has  been  influ¬ 
enced  by  constructivism.  Although,  as  with  real¬ 
ism  and  liberalism,  there  are  many  variants  of 
constructivism,  they  share  a  perspective  on  norms 
and  identity  as  central  to  the  social  and  political 
sphere.  Constructivists  break  down  the  divide 
between  social  structures  and  agency,  regarding 
structure  as  being  continually  recreated  by  agency, 
just  as  norms  create  identity  and  expectations. 
Constructivists  acknowledge  the  interrelationship 
between  international  law  and  world  politics. 
They  do  not  deny  the  functions  of  international 
law  as  identified  by  liberals  but  more  consciously 
point  to  international  law  as  having  a  socially 
constitutive  function. 

The  Global  Distribution  of  Power  and 
Attitudes  Toward  International  Law 

Many  of  the  most  ardent  advocates  of  interna¬ 
tional  law  have  been  inspired  by  the  ideal  of  a  rule 
of  law  functioning  in  the  international  sphere  to 
the  benefit  of  all  states  and  people.  Given  the 
increasing  scope  and  complexity  of  international 
law  in  the  early  years  of  the  2 1st  century,  it  is  thus 
timely  to  consider  the  attitude  toward  interna¬ 
tional  law  of  those  most  and  least  powerful  in  the 
international  political  system. 

Developing  Countries  and  International  Law 

The  rapid  process  of  decolonization  resulted  in 
many  new  states  joining  the  system  of  interna¬ 
tional  law  as  political  and  legal  entities  in  their 
own  right.  Acutely  aware  that  their  formal  legal 
equality  as  sovereign  states  did  not  equate  with 
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economic  equality,  these  states  in  the  1970s 
launched  a  campaign  for  a  New  International 
Economic  Order  (NIEO).  The  NIEO  agenda 
included  recognition  of  the  right  of  every  country 
to  adopt  the  economic  and  social  system  that  it 
deems  most  appropriate  for  its  own  development, 
full  permanent  sovereignty  of  every  state  over  its 
natural  resources  and  all  economic  activities,  and 
the  regulation  and  supervision  of  the  activities  of 
transnational  corporations.  The  campaign  failed, 
although  a  number  of  its  elements  have  resurfaced 
in  other  contexts.  Developing  countries  are  find¬ 
ing  new  levels  of  negotiating  weight  in  areas  in 
which  their  consent  and  participation  are  essential 
to  the  industrialized  world,  including  in  global 
environmental  regimes.  Although  they  have  not 
yet  been  able  to  steer  the  international  economic 
agenda  in  the  direction  that  they  would  like  it  to 
go,  developing  countries  are  now  more  able  to 
recognize  the  implications  and  resist  some  of  the 
demands  of  the  developed  world.  It  is,  neverthe¬ 
less,  necessary  to  distinguish  among  developing 
countries.  While  the  leading  developing  countries 
are  increasingly  becoming  active  participants  in 
international  dispute  resolution  bodies  and  in  the 
development  of  new  legal  regimes,  the  least  devel¬ 
oped  simply  do  not  have  the  legal  expertise  and 
financial  resources  at  their  disposal  to  play  a  full 
role  in  international  law. 

There  remains  a  clear  North  American  and 
European  dominance  in  international  legal  scholar¬ 
ship,  but  there  have  been  a  number  of  influential 
international  lawyers  who  have  voiced  a  develop¬ 
ing  world  perspective  on  international  law.  Most  of 
these  scholars  prefer  to  retain  the  term  Third 
World.  Some  of  those  who  established  a  strong 
tradition  of  Third  World  scholarship  on  interna¬ 
tional  law  include  Georges  Abi-Saab,  Francisco 
Garcia-Amador,  R.  P.  Anand,  Mohammed 
Bedjaoui,  and  Taslim  Elias.  In  recent  years,  a 
number  of  scholars,  including  Anthony  Anghie, 
B.  S.  Chimni,  Makau  Mutua,  and  B.  Rajagopal, 
have  self-identified  their  work  as  TWAIL  or  Third 
World  Approaches  to  International  Law.  They 
have  been  influenced  by  postcolonial  theory  and 
critical  approaches  to  international  law  and  have 
contributed  to  reinvigorated  interest  in  the  history 
of  the  discipline  and  its  subject  matter.  TWAIL 
scholars  have  emphasized  that  colonialism  was  not 
simply  an  unfortunate  episode  in  the  history  of 


international  law  but  rather  constitutive  of  interna¬ 
tional  law  and  that  international  law  cannot  be 
separated  from  its  colonial  past. 

The  United  States  and  International  Law 

The  United  States  played  a  leading  role  in  estab¬ 
lishing  most  of  the  key  institutions  that  make  up 
the  contemporary  international  order,  including 
those  of  the  UN,  the  Nuclear  Non-Proliferation 
regime,  and  the  General  Agreement  on  Tariffs  and 
Trade.  From  2001  to  2009,  under  President  George 
W.  Bush,  the  United  States  came  under  consider¬ 
able  criticism  for  what  was  seen  as  an  increased 
tendency  toward  unilateralism  and  associated 
decline  in  support  for  international  law  and  institu¬ 
tions.  Evidence  for  this  charge  included  U.S.  failure 
to  support  the  International  Criminal  Court,  the 
Kyoto  Protocol,  and  the  Comprehensive  Nuclear 
Test  Ban  Treaty.  Some  practices  of  the  United 
States  for  which  it  came  under  criticism,  such  as 
that  of  ratifying  human  rights  treaties  only  to  the 
extent  that  their  provisions  corresponded  with 
rights  already  existing  within  the  U.S.  legal  system, 
were  not  new  to  the  Bush  administration.  President 
Barack  Obama,  elected  to  office  in  2008,  recog¬ 
nized  that  his  administration  would  need  to  work 
hard  to  restore  the  reputation  of  the  United  States 
so  far  as  its  commitment  to  international  law  and 
institutions  was  concerned.  To  further  this  goal,  at 
the  time  of  this  writing,  the  Obama  administration 
was  seeking  Senate  approval  for  ratification  of  the 
Comprehensive  Nuclear  Test  Ban  Treaty. 

Europe  and  International  Law 

Europe  was  the  birthplace  of  modern  interna¬ 
tional  law.  During  the  administration  of  George 
W.  Bush,  some  observers  contrasted  what  appeared 
to  be  strong  European  support  for  international 
law  with  the  more  instrumentalist  approach  of  the 
United  States.  The  United  States  was  accused  of 
using  international  law  as  a  tool  when  it  chose  to 
do  so,  without  lending  its  weight  to  strengthening 
universal  support  for  human  rights  and  respect  for 
the  international  rule  of  law  as  something  far 
greater  than  a  U.S.  policy  tool.  The  allegedly  unde¬ 
sirable  U.S.  attitude  toward  international  law  can 
be  seen  reflected  in  several  strands  of  U.S.  scholar¬ 
ship,  including  recent  applications  to  international 
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law  of  rational  choice  and  game  theory  that  pur¬ 
port  to  prove  that  international  law  serves  as  no 
more  than  a  weak  exogenous  constraint  on  state 
action.  And  yet,  if  the  U.S.  attitude  is  attributable 
to  its  position  of  hegemony,  so  could  European 
attitudes  be  interpreted  as  a  product  of  relative 
power.  From  this  perspective,  Europe  has  been 
determined  to  shore  up  its  moral  and  legal  weight 
because  it  is  unable  to  match  the  United  States  in 
terms  of  sheer  military  and  economic  muscle. 
Setting  aside  transatlantic  tensions,  it  cannot  be 
denied  that  the  European  Union  has  a  highly  effec¬ 
tive  legal  system  and  the  most  highly  developed 
system  of  human  rights  protection  in  the  world 
and  that  Europe  remains  a  bastion  of  intellectual 
support  for  an  international  law  that  purports  to 
be  of  universal  value. 

China  and  International  Law 

With  its  rising  power,  China  and  its  attitude 
toward  international  law  are  subjects  of  increasing 
interest  and  speculation.  In  common  with  most 
developing  countries,  China  places  greater  emphasis 
on  sovereignty  than  does  the  rhetoric  of  Western 
liberal  democracy,  but  China  has  nevertheless  been 
supportive  of  multilateralism  and  collective  security. 
Some  critical  commentators  see  China  as  drawing 
on  its  dual  identities  of  major  power  and  developing 
country  to  accept  many  of  the  benefits  of  its  increas¬ 
ing  economic  weight  without  an  associated  cost. 
Cynics  also  suggest  that  the  apparent  commitment 
of  China  to  multilateralism  may  be  a  function  of  its 
relative  power,  a  strategy  to  be  pursued  until  China’s 
position  in  the  international  order  is  such  that  it  can 
operate  unilaterally  if  it  so  chooses.  And  yet,  particu¬ 
larly  given  that  nondemocratic  countries  are  gener¬ 
ally  assumed  less  likely  than  democratic  countries  to 
uphold  international  law,  China’s  attitude  toward 
international  law  and  the  extent  of  its  integration 
into  the  international  system  have  been  noteworthy. 

Increasing  Complexity  and  Possible 

Fragmentation  of  International  Law 

International  law  is  transforming  from  a  minimal¬ 
ist  body  of  law  addressing  selected  aspects  of  inter¬ 
state  relations  toward  a  much  more  dense  web  of 
law  affecting  all  aspects  and  levels  of  governance 
and  under  increasing  pressure  to  tackle  issues  of 


normative  and  institutional  hierarchy.  Some  fields 
of  international  law,  notably  IHRL,  do  not  seek  to 
regulate  the  external  conduct  of  states  but  set  stan¬ 
dards  for  the  internal  functioning  of  states,  and 
international  law  is  thereby  permeating  domestic 
legal  systems  to  an  increasing  degree.  The  growth 
in  the  number  of  international  courts  and  courtlike 
bodies  is  creating  the  potential  for  states  to  “forum 
shop”  as  well  as  for  competition  between  legal 
regimes.  Tensions  between,  for  example,  interna¬ 
tional  trade  and  human  rights  or  the  environment 
may  be  exacerbated  by  the  degree  to  which  the 
state’s  actions  in  one  of  the  regimes  in  question — 
in  this  case,  trade — are  subjected  to  third-party 
dispute  resolution. 

One  contemporary  approach  to  international  law 
and  its  place  within  world  politics  that  is  arguably 
responding  to  the  growing  complexity  and  centrality 
to  world  politics  of  international  law  is  that  of  inter¬ 
national  constitutionalism.  Writers  on  international 
constitutionalism  have  in  common  the  use  of  consti¬ 
tutional  language  derived  from  the  domestic  context 
to  interpret  the  growing  complexity  of  the  contem¬ 
porary  system  of  international  law  and  its  constitu¬ 
ent  regimes.  While  the  language  of  constitution  has 
long  been  used  within  international  law — the  Third 
United  Nations  Law  of  the  Sea  Convention  has 
often  been  referred  to  as  a  “constitution  for  the 
oceans,”  for  example — the  constitutional  metaphor 
is  now  being  employed  much  more  tightly  and,  in 
many  cases,  with  a  more  clearly  normative  agenda. 

Theorists  differ  as  to  the  extent  to  which  inter¬ 
national  law  is  able  to  function  independently  of 
the  international  distribution  of  power,  but  few,  if 
any,  would  claim  that  it  could  disengage  com¬ 
pletely  from  fundamental  geopolitical  shifts  in  the 
balance  of  power.  This  suggests  that  with  the  rise 
of  Asia,  including  China  and  India,  international 
law  is  set  to  undergo  further  significant  change  in 
the  21st  century. 
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International  Monetary 

Fund  (IMF) 


The  International  Monetary  Fund  (IMF)  is  a  vol¬ 
untary  organization  of  187  member  countries  that 
operates  as  a  major  multilateral  institution.  Its 
goals  include  promoting  international  monetary 
cooperation  and  stability  of  the  exchange  rate, 
thus  aiding  the  expansion  of  international  trade 
and  economic  well-being  of  its  member  states. 

In  March  2010,  the  European  Union  decided 
that  there  might  be  a  need  to  call  the  IMF  in  case 
one  of  the  members  of  the  eurozone  needed  finan¬ 
cial  assistance.  In  July  2007,  less  than  3  years 
before  Dominique  Strauss-Kahn  was  appointed 
managing  director  of  the  IMF,  the  institution  had 
fallen  into  disrepute  and  was  widely  regarded  as 
almost  useless;  indeed,  the  primary  task  of  the  new 
managing  director  seemed  to  be  to  cut  costs  and 
reduce  personnel.  There  appeared  to  be  no  role,  in 
the  new  world  of  “great  moderation”  of  the  early 
21st  century,  for  an  institution  that  had  been  cre¬ 
ated  in  1 944  to  police  exchange  rates  and  that  had 
progressively  become  the  firefighter  and  police 
officer  of  international  financial  crises.  The  global 
financial  crisis  that  had  started  as  a  U.S.  banking 
crisis  only  weeks  before  Strauss-Kahn  took  office 
radically  changed  the  outlook:  Not  only  was  the 
IMF  needed  again  for  emergency  lending  to  coun¬ 
tries  faced  with  sudden  liquidity  shortages,  this 
time  mostly  in  Europe,  but  its  expertise  was  widely 
sought  in  determining  the  extent  of  damage  in  the 
global  banking  sector,  helping  with  fiscal  consoli¬ 
dation  plans  in  the  worst  hit  economies,  and  elabo¬ 
rating  reform  proposals  under  the  mandate  of  the 
newly  created  G20.  Just  as  in  the  aftermath  of  each 
international  financial  crisis,  the  IMF  is  alive  and 
back  in  business — but  in  a  different  business.  This 
entry  describes  the  complex  evolution  of  the  IMF, 
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first  examining  the  role  of  the  Washington 
Consensus  and  taking  into  account  the  2008-2009 
crisis  and  then  investigating  the  problems  that  are 
at  stake  for  the  institution. 

Structure  and  Governance 

The  IMF  was  created,  along  with  the  World  Bank, 
at  the  Bretton  Woods  conference  in  July  1944, 
only  weeks  after  the  landing  of  allied  troops  in 
Normandy.  It  was  initially  conceived  as  the  cor¬ 
nerstone  of  the  international  monetary  system  of 
fixed  exchange  rates  that  was  to  be  instituted  in 
the  aftermath  of  World  War  II.  The  consensus 
among  the  Allied  forces  at  that  time  was  that  a 
fixed  exchange  rate  system  was  needed  to  prevent 
the  reappearance  of  the  kind  of  “competitive 
devaluation”  and  other  beggar- thy-neighbor  poli¬ 
cies  that  had  prevailed  as  reactions  to  the  financial 
crisis  of  1929  and  were  then  widely  regarded  as 
having  played  a  major  part  in  the  depth  and  length 
of  the  Great  Depression  of  the  1930s.  The  initial 
British  plan,  put  forth  by  John  Maynard  Keynes, 
was  to  institute  a  world  exchange  system  based  on 
a  new  world  liquidity  instrument,  the  bancor, 
which  would  be  issued  and  managed  by  a  genuine 
world  central  bank.  This  plan  had  been  rejected  by 
the  U.S.  delegation  which  favored  the  so-called 
gold  exchange  standard:  All  currencies  have  to  be 
convertible  and  have  a  fixed  exchange  rate  vis-a- 
vis  the  U.S.  dollar,  which  by  itself  is  convertible 
into  gold  at  a  fixed  parity  of  $35  (U.S.)  per  ounce. 

In  December  1945,  when  the  first  Articles  of 
Agreements  were  signed,  the  IMF  counted  29 
member  countries.  Membership  progressively 
expanded  to  187  countries  in  2010.  Its  main  gov¬ 
erning  body  is  the  Board  of  Governors,  which 
meets  once  a  year,  where  all  member  countries  are 
represented  by  a  governor,  who  is  usually  the  head 
of  the  central  bank.  Most  executive  tasks  are,  how¬ 
ever,  delegated  to  the  executive  board,  which  com¬ 
prises  24  members.  Only  large  economies  have 
their  own  representative,  the  others  being  distrib¬ 
uted  in  regional  groups  and  electing  representa¬ 
tives.  In  both  boards,  voting  rights  are  distributed 
unevenly  in  broad  accordance  with  relative  weights 
in  the  world  economy  (see  the  last  section  of  this 
entry).  As  of  August  2010,  eight  countries  had 
their  own  representatives  in  the  Executive  Board: 
The  percentage  of  voting  rights  for  the  United 


States  of  America  is  16.74;  Japan,  6.01;  Germany, 
5.87;  France,  4.85;  the  United  Kingdom  (UK), 
4.85;  China,  3.65;  Saudi  Arabia,  3.16;  and  the 
Russian  Federation,  2.69.  The  least  represented 
region  is  sub-Saharan  Africa,  with  only  two  repre¬ 
sentatives  for  43  countries  and  a  total  of  4.35%  of 
voting  rights. 

From  Surveillance  to  Structural  Adjustment 

In  the  original  Bretton  Woods  system,  the  role 
of  the  IMF  was  essentially  one  of  macroeconomic 
surveillance:  making  sure  that  countries  with — 
partially — convertible  currencies  (i.e.,  essentially 
developed  countries  of  the  West  and  Japan)  were 
managing  their  economies  in  a  manner  broadly  in 
line  with  the  requirements  of  a  fixed  exchange  rate 
system — that  is,  maintaining  balance-of-payments 
disequilibria  within  reasonable  limits  in  a  world  of 
strictly  limited  and  state-controlled  international 
capital  movements.  In  principle,  the  IMF  also  had 
to  ensure  that  enough  international  liquidity  is  pro¬ 
vided,  but  in  practice,  this  role  was  taken  over  by 
the  United  States,  which  issues  the  international 
reserve  currency  and  holds  a  blocking  minority 
share  in  the  qualified-majority  voting  mechanism  of 
the  Fund.  In  1969,  the  creation  of  a  new  reserve 
instrument,  the  Special  Drawing  Rights  (SDR),  was 
supposed  to  give  the  Fund  more  leeway.  This  was 
initially  limited  by  the  U.S.  opposition  and  later 
became  defunct  because  of  the  relative  abundance 
and  autonomy  of  international  liquidity  in  the  world 
of  global  finance  in  the  1990s.  The  SDRs  were  in 
fact  never  really  used  on  a  significant  scale.  There 
were  two  major  allocations  of  SDRs:  In  August 
2009,  a  general  allocation  of  161.2  billion  was  made 
to  all  member  countries,  and  in  September,  a  special 
allocation  was  made  to  members  who  had  joined 
after  1981  and  who  had  not  benefited  from  the  pre¬ 
vious  allocations.  As  of  August  2010,  the  total 
amount  of  SDRs  allocated  amounted  to  204.1  bil¬ 
lion,  equivalent  to  approximately  $317  billion  (U.S.). 

Until  the  end  of  the  Bretton  Woods  era  (sym¬ 
bolically  proclaimed  in  August  1971  by  U.S. 
President  Richard  Nixon,  who  officially  suspended 
the  dollar  convertibility  into  gold),  the  IMF  was 
thus  largely  confined  to  making  sure  that  exchange 
rate  adjustments  by  members  would  remain  infre¬ 
quent  and  not  too  large  and  that  balance-of- 
payments  disequilibria  were  not  excessive,  at  least 
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outside  the  United  States.  It  did  intervene  in  a  few 
European  countries — most  visibly  Italy  and  the  UK 
in  1977 — to  provide  liquidity  and  assistance  in 
cases  of  serious  difficulties  and  to  gain  access  to 
international,  private  financing.  The  Jamaica 
Agreements,  in  1976,  sealed  the  end  of  the  Bretton 
Woods  era,  by  officially  admitting  the  plurality  of 
exchange  rate  regimes  and  eliminating  gold  as  a 
reference  in  the  international  monetary  system.  By 
the  same  token,  these  agreements  signaled  the 
weakening  of  the  IMF. 

The  irresistible  rise  in  international  short-term 
capital  flows  and  international  lending,  conse¬ 
quent  to  the  generalized  floating  exchange  rate 
system  and  to  the  external-accounts  surpluses  gen¬ 
erated  by  the  first  oil  shock  in  Organization  of  the 
Petroleum  Exporting  Countries  (OPEC)  countries, 
led  to  a  first  wave  of  sovereign  borrowing  during 
the  second  half  of  the  1970s,  mostly  in  Latin 
America,  which  was  then  widely  regarded  as  the 
promised  land  of  economic  growth.  After  the 
U-turn  in  U.S.  monetary  policy  in  1979-1981, 
under  Federal  Reserve  chairman  Paul  Volcker,  to 
fight  two-digit  domestic  inflation  and  the  second 
oil  shock,  Mexico  defaulted  on  its  foreign  loans  in 
August  1982,  thereby  signaling  the  start  of  a  long 
series  of  external  debt  crises.  The  IMF  did  not  play 
a  major  role  in  the  resolution  of  these  crises — the 
U.S.  Treasury  acted  almost  unilaterally  to  solve  a 
problem  that  primarily  concerned  U.S.  commercial 
banks — but  emergency  lending  to  distressed  sover¬ 
eign  borrowers  and  technical  assistance  for  adjust¬ 
ment  plans  became  a  part  and  parcel  of  the  IMF’s 
new  role  in  international  finance. 

The  Washington  Consensus  in  Question 

One  of  the  major  tasks  of  the  IMF  was  to  counsel 
governments  in  countries  needing  structural  eco¬ 
nomic  reforms.  After  the  introduction  of  reforms 
and  the  collapse  of  the  Soviet  economy,  the  conse¬ 
quent  transition  of  former  centrally  planned  econo¬ 
mies  of  Central  and  Eastern  Europe  provided  the 
Fund’s  teams  with  a  king-size  experiment  to  elabo¬ 
rate  and  test  ambitious  and  wide-ranging  transfor¬ 
mation  and  adjustment  plans  that  would  soon 
become  known  as  the  Washington  Consensus:  a 
cocktail  of  structural  reforms  usually  involving  tax 
reforms,  public  employment  reduction,  cuts  in 
social  expenditures,  privatizations  and  liberalization 


of  external  financial  transactions,  and  standard  pre¬ 
dicaments  for  macroeconomic  adjustment — fiscal 
consolidation  and  tight  monetary  policy. 

The  massive  privatization  movement  that  ensued 
in  the  first  half  of  the  1990s  fueled  the  expansion 
and  globalization  of  financial  markets  along  with 
the  rise  of  private  borrowing.  In  parallel,  many 
countries  had  been  moving  away  from  floating 
exchange  rates,  which  had  proved  to  be  a  source  of 
instability  rather  than  being  the  means  of  national 
monetary  sovereignty.  Pegging  the  exchange  rate 
to  a  foreign  currency  became  the  norm  again,  not 
only  in  Europe  but  also  in  Asia  and  Latin  America. 
This  environment  led  to  overconfidence  and  indeed 
to  excessive  reliance  on  external  financing  in  many 
countries.  The  second  Mexican  crisis  in  1994  and 
the  Asian  crisis  of  1997,  soon  followed  by  the 
Russian  and  Latin  American  crises,  led  the  IMF  to 
further  increase  lending  facilities  and  provided  a 
large-scale  experiment  for  its  structural  adjustment 
plans.  The  social  and  political  costs  incurred  by 
national  governments,  together  with  the  severity  of 
the  subsequent  recessions,  aroused  strong  criticism 
of  the  doctrine  on  which  IMF  interventions  were 
based  and  forced  most  emerging  economies  to  turn 
away  from  external  financing.  Instead,  with  poli¬ 
cies  of  pegging  their  currencies  with  an  underval¬ 
ued  exchange  rate,  they  aimed  at  external  surpluses 
and  foreign  exchange  reserve  accumulation  and 
tried  to  build  regional  banks  to  free  themselves 
from  the  tutorship  of  the  Fund  and  rely  on  a  form 
of  “self-insurance.” 

From  Lending  to  the  Poor 
to  Lending  to  the  Rich? 

The  global  crisis  of  2008  to  2009  radically  changed 
the  outlook.  The  G20  took  over  global  leadership 
as  of  November  2008  and  had  to  rely  on  existing 
institutions  for  expertise  and  for  conducting  rescue 
operations  for  the  worst  hit  economies.  Hence,  the 
Organisation  for  Economic  Co-operation  and 
Development  (OECD)  and  the  IMF  regained  influ¬ 
ence,  even  though  their  legitimacy  could  be  ques¬ 
tioned,  not  least  by  the  large  emerging  economies. 
The  IMF  became  the  promoter  of  coordinated  fis¬ 
cal  stimulus  plans  to  prevent  generalized  deflation 
and  depression;  it  moved  further  in  the  direction  of 
nonorthodox  predicaments  by  advocating  a  light- 
handed  approach  to  fiscal  consolidation  in  the 
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aftermath  of  the  crisis,  both  by  pressing  govern¬ 
ments  not  to  start  consolidating  too  early,  which 
might  destroy  the  fragile  recovery,  and  by  suggest¬ 
ing  that  central  banks  accept  slightly  higher  infla¬ 
tion  targets  to  ease  fiscal  adjustment. 

In  April  2009,  at  the  London  Summit  of  the 
G20,  reforms  were  announced  in  diverse  areas 
such  as  surveillance  and  monitoring  of  global 
financial  risks,  transparency  and  the  fight  against 
tax  havens,  and  emergency  interventions  for  finan¬ 
cially  distressed  economies,  most  particularly  in 
Europe.  For  this  purpose,  the  decision  was  made  to 
increase  the  Fund’s  lending  capacity  to  U.S.$750 
billion.  Half  of  this  additional  funding  came  from 
EU  countries,  and  it  was  soon  to  be  drawn  on  by 
other  European  countries — both  from  the  EU 
(Hungary,  Estonia,  and  Romania)  and  from  out¬ 
side  the  EU  (Iceland).  Though  a  few  poor  countries 
did  get  aid  from  the  IMF,  it  clearly  appeared  that 
the  larger  lending  facilities  had  been  designed  to 
help  relatively  rich  countries  and  incidentally  to 
help  the  EU  Commission  and  national  govern¬ 
ments  to  restore  macroeconomic  stability  and  fis¬ 
cal  discipline  in  economies  that  had  very  close 
links  with  the  core  of  the  EU — that  is,  the  euro¬ 
zone.  In  doing  so,  the  IMF  also  reasserted  the 
structural  adjustment  plans  that  had  characterized 
the  Washington  Consensus,  though  with  appar¬ 
ently  greater  care  for  social  expenditures.  And  in 
March  2010,  when  the  Greek  public  debt  crisis 
reached  a  climax,  the  EU  leaders  could  not  agree 
on  a  bail-out  plan  unless  the  IMF  were  part  of  the 
lending-of-last-resort  mechanism,  the  German 
government  counting  on  its  expertise  and  the 
severity  of  its  predicaments  to  impose  discipline  on 
the  Greek  government  or  indeed  any  eurozone 
member  that  might,  later  on,  fall  victim  to  specula¬ 
tive  attacks  on  purportedly  unsustainable  debt. 

A  New  Role  in  Preventing 
Financial  Instability 

Solving  the  current  crisis,  which  had  started  in 
2008,  is  only  a  part  of  the  renewed  mandate  of  the 
IMF;  preventing  future  ones  by  reforming  interna¬ 
tional  financial  governance  has  also  been  set  high 
on  the  G20  agenda,  and  the  IMF  is,  along  with  the 
Basel  Committee  and  the  Bank  for  International 
Settlements  (BIS) — the  coordination  institutions 
for  major  central  banks  and  the  major  source  of 


the  prudential  regulation  doctrine  for  commercial 
banking — a  major  actor  in  the  new  global  effort  to 
reduce  systemic  risk.  Thus,  the  Forum  for  Financial 
Stability,  a  coordination  committee  within  the 
Fund,  has  been  turned  into  a  full-fledged  and 
permanent  Council  of  Financial  Stability,  with  a 
mandate  to  monitor  global  financial  developments 
and  issue  early  warning  signals  in  case  of  perceived 
mounting  imbalances  and  excessive  risk  accumula¬ 
tion.  And  the  IMF  has  been  entrusted  with  the 
mission  to  study  new  instruments  of  banking  regu¬ 
lation,  including  taxation  on  risk  taking,  and  pos¬ 
sible  limitations  on  the  free  movement  of  capital,  at 
least  for  short-term,  speculative  capital  flows.  This 
could  clearly  yield  a  radical  change  in  financial 
globalization.  But  how  far  will  national  govern¬ 
ments  be  willing  to  go? 

Which  Legitimacy? 

One  of  the  major  weaknesses  of  the  IMF  lies  in  its 
own  governance  structure  and  is  apparent  in  its 
highest  ranking  officials:  Traditionally,  while  the 
Chairman  of  the  World  Bank  is  a  U.S.  citizen,  the 
General  Director  of  the  IMF  is  a  European.  This 
dominance  of  Europe  is  even  reinforced  in  the  cur¬ 
rent  top-ranking  management  by  the  fact  that, 
alongside  Strauss-Kahn,  the  chief  economist  is 
Olivier  Blanchard,  a  French-born  professor  of  eco¬ 
nomics  at  the  Massachusetts  Institute  of  Technology. 
This  governance  structure  itself  reflects  the  distri¬ 
bution  of  capital  and  voting  rights  among  member 
countries.  Initially  justified  by  the  overwhelming 
domination  of  the  United  States  and  the  European 
economies  in  the  immediate  aftermath  of  World 
War  II,  the  country-wise  distribution  is  increasingly 
being  contested,  especially  by  large  emerging  econ¬ 
omies.  The  latest  reform,  implemented  in  2006, 
redistributed  about  5.4%  of  voting  rights  to  the 
benefit  of  54  countries;  among  those  who  have 
gained  the  most  are  China,  Korea,  Brazil,  and 
Turkey.  However,  the  United  States  still  possesses 
nearly  17%  of  voting  rights,  which  amounts  to  a 
veto  power,  and  the  EU  countries,  taken  together, 
have  about  32%  of  voting  rights,  whereas  China 
has  only  3.65%,  India  1.89%,  and  Brazil  1.38%. 

At  the  G20  summit  of  April  2009,  it  was 
decided  to  reallocate  5%  of  voting  rights.  Yet  this 
would  not  suffice  to  yield  a  system  of  representa¬ 
tion  more  in  line  with  economic  and  financial 
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weights  of  countries  in  the  global  economy.  This 
very  slow  change  in  the  IMF  governance  and  prac¬ 
tices  may  well  be  a  major  factor  in  the  current 
acceleration  of  regional  plans,  especially  in  Asia,  to 
organize  monetary  and  financial  stability  indepen¬ 
dently  of  worldwide  institutions  that  appear  to  be 
too  closely  controlled  by  Western  countries.  The 
pending  and  highly  sensitive  issue  of  exchange 
rates  among  major  international  currencies  and 
the  evolutions  required  to  facilitate  the  return  to  a 
more  balanced  international  economy  still  makes 
the  task  of  the  IMF  more  complex  and  further 
weakens  the  legitimacy  of  its  current  governance. 
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International  organizations  (IOs)  have  three  defin¬ 
ing  features:  (1)  they  are  constituted  by  members 
from  more  than  one  state,  usually  more  than  two; 
(2)  they  are  oriented  toward  the  pursuit  of  com¬ 
mon  goals,  at  least  initially;  and  (3)  they  have  a 
certain  level  of  explicit  formal  structure,  usually 
established  by  a  treaty  or  constituent  document. 
IOs  are  material  entities,  usually  with  offices  and 
personnel.  In  most  cases,  they  have  a  legal  person¬ 
ality,  and  thus,  they  can  function  as  actors  in 


national  and  international  politics.  In  general,  orga¬ 
nizations  exhibit  higher  levels  of  goal  specificity 
and  formalization  than  other  types  of  collectivities, 
such  as  social  movements,  informal  networks,  or 
more  or  less  diffuse  communities. 

There  are  many  different  types  of  IOs.  The  main 
distinction  is  between  the  IOs  constituted  by  states 
and  those  that  are  constituted  by  other  actors.  The 
former  are  known  as  intergovernmental  organiza¬ 
tions  (IGOs)  and  the  latter  as  international  non¬ 
governmental  organizations  (INGOs  or  sometimes 
simply  NGOs).  Some  INGOs  have  individual 
members,  some  have  a  combination  of  individuals 
and  locally  or  nationally  organized  groups  as 
members,  and  some  are  umbrella  organizations  for 
national  nongovernmental  organizations. 

IGOs  have  several  formal  characteristics  in 
common.  Since  they  are  established  by  govern¬ 
ments,  they  have  a  formal  document  serving  as 
their  “constitution” — usually  a  formal  treaty  in 
line  with  requisitions  in  international  law. 
Moreover,  they  usually  have  headquarters  with 
permanent  staff,  regular  meetings  of  representa¬ 
tives  of  member  states,  and  rules  for  decision  mak¬ 
ing  and  other  organizational  processes.  IGOs  also 
differ  in  important  respects,  such  as  competence 
and  membership.  Membership  is  open  to  all  coun¬ 
tries  or  restricted  according  to  geographical,  func¬ 
tional,  or  other  criteria.  Competence  can  be  wide, 
covering  a  broad  range  of  issues,  or  restricted  to  a 
narrow  domain. 

The  category  of  INGOs  is  often  subdivided  into 
profit-  and  nonprofit-oriented  organizations.  Profit- 
oriented  organizations  are  mostly  multinational 
companies,  such  as  Nike  or  Shell  Oil  Company,  or 
various  kinds  of  business  organizations  represent¬ 
ing  such  companies  directly  or  indirectly,  such  as 
the  International  Chamber  of  Commerce  (ICC). 
Nonprofit  organizations  cover  wide  areas  of  spe¬ 
cific  group  interests,  such  as  Disabled  Peoples’ 
International  (DPI),  or  promote  core  values,  such  as 
peace,  human  rights  (e.g.,  Amnesty  International), 
and  environmental  protection  (e.g.,  World  Wildlife 
Federation  [WWF]). 

This  entry,  unless  otherwise  stated,  focuses 
exclusively  on  IGOs.  A  final  section,  though,  is 
devoted  to  INGOs.  In  the  following  two  sections, 
brief  overviews  of  the  historical  development  of 
IGOs  and  the  study  of  IGOs  are  provided. 
Subsequent  sections  examine  in  some  depth  three 
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aspects  of  IOs  and  institutions  that  have  been  par¬ 
ticularly  prominent  in  international  relations  (IR) 
research:  Why  are  IOs  established?  What  do  they 
do?  What  impact  do  they  have? 

The  IGO  System:  Patterns  of  Development 

International  cooperation  has  taken  place  as  long 
as  states  have  existed,  but  the  first  institution  that 
met  the  conventional  defining  characteristics  of  an 
IGO  was  the  Central  Commission  for  Navigation 
on  the  Rhine  (CCNR),  which  was  established  in 
1815.  Another  4  decades  would  pass  before  the 
next  IGO  was  established,  for  the  river  Donau. 
The  Industrial  Revolution  and  the  increase  in  vari¬ 
ous  forms  of  communication  and  transaction 
across  borders  led  to  a  more  rapid  increase  from 
the  1860s.  Most  of  these  organizations  were  estab¬ 
lished  to  solve  well-defined  coordination  chal¬ 
lenges.  Examples  include  the  International 
Telegraphic  Bureau  (later  International  Telegraphic 
Union),  established  in  1868;  the  General  Postal 
Union  (later  Universal  Postal  Union),  established 
in  1874;  and  the  International  Bureau  of  Weight 
and  Measures,  established  in  1875.  More  than  two 
thirds  of  the  IGOs  established  before  World  War  I 
dealt  with  technical  coordination  and/or  economic 
activities.  Since  the  end  of  World  War  II,  growth 
has  been  higher  for  IGOs  operating  within  social 
and  cultural  domains.  The  two  world  wars  reduced 
the  population  of  the  IGOs  to  some  extent,  but  the 
first  decade  or  two  following  the  peace  settlements 
saw  particularly  high  growth  rates.  As  the  total 
number  of  IGOs  has  increased,  the  average  organi¬ 
zation  has  become  more  specialized  in  functional 
terms  and  more  narrow  in  terms  of  its  membership 
base.  According  to  Cheryl  Shanks,  Harold 
Jacobson,  and  Jeffery  Kaplan  (1996)  the  involve¬ 
ment  in  the  activities  of  IGOs  differs  substantially: 
“Older”  states  tend  to  participate  more  than 


“young”  ones,  rich  states  more  than  poor,  democ¬ 
racies  more  than  authoritarian  and  totalitarian 
regimes,  and  large  countries  more  than  small. 

Describing  the  development  of  the  IGO  “popu¬ 
lation”  in  numbers  is  a  less  straightforward  task 
than  one  might  think.  The  reason  is  simply  that 
numbers  depend  on  the  precise  definition  applied. 
For  example,  some  overviews  include  only  organi¬ 
zations  established  directly  by  governments  while 
others  also  include  organizations  created  by  one  or 
more  IGOs.  The  authoritative  source  is  the 
Yearbook  of  International  Organizations,  which  is 
published  by  the  Union  of  International 
Associations.  Table  1  is  based  on  this  database  and 
applies  this  definition. 

The  total  number  of  IGOs  also  includes  organi¬ 
zations  that  are  created  by  other  IGOs.  Shanks 
et  al.  (1996)  found  that  these  emanations  consti¬ 
tuted  70%  of  the  total  IGO  population  in  the  early 
1990s,  thus  bringing  the  total  number  up  to  more 
than  1,100.  The  legal  status  of  IGOs  that  are  cre¬ 
ated  by  other  IGOs  is  not  very  clear.  New  rules 
were  defined  in  the  Vienna  Convention  on  the  Law 
of  Treaties  Between  States  and  International 
Organizations  or  Between  International  Organi¬ 
zations  (VCLTIO),  which  opened  for  signature  in 
1986.  At  the  time  of  writing,  this  convention  is  not 
yet  in  force.  Questions  about  their  legal  status  not¬ 
withstanding,  there  can  be  no  doubt  that  this  type 
of  IGO  also  plays  an  important  role  in  interna¬ 
tional  relations. 

The  development  of  IGOs  has  been  driven 
partly  by  changes  in  the  fabric  of  the  international 
system  itself.  Two  aspects  seem  particularly  impor¬ 
tant:  (1)  changes  in  networks  of  interdependence 
and  (2)  increase  in  the  number  of  states. 

In  essence,  cooperation  is  an  effort  to  manage 
interdependence.  The  more  sensitive  or  vulnerable 
one  actor  is  to  the  activities  of  others,  the  stronger 
its  incentives  to  try  to  influence  their  behavior.  For 


Table  I  Number  of  International  Organizations 


Year 

1909 

1956 

1985 

1995 

2005 

IGO 

37 

132 

378 

266 

246 

INGO 

176 

973 

4,676 

5,121 

7,306 

Source:  Yearbook  of  International  Organizations,  http://www.uia.org.  Numbers  for  1909  to  1995  are  taken  from  http://www 
.uia.org/statistics/organizations/ytb299.php  (accessed  December  1,  2010).  Numbers  for  2005  are  taken  from  the  2005/2006 
yearbook  edition,  Appendix  3,  Table  1,  p.  2966. 
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cooperation  to  emerge,  this  interest  must  be 
mutual  so  that  both  (all)  parties  see  some  possibil¬ 
ity  of  improving  outcomes  by  exchanging  commit¬ 
ments  with  prospective  partners.  Taken  together, 
these  observations  suggest  that  the  scope  and  level 
of  cooperation  will  reflect  the  strength  and  con¬ 
figuration  of  interdependence  relationships. 

This  is  a  complex  hypothesis,  and  previous  research 
provides  only  partial  tests.  In  bivariate  analysis, 
conventional  measures  of  interdependence — such  as 
volumes  of  international  trade,  investment,  and 
several  forms  of  communication — correlate  posi¬ 
tively  with  the  number  of  IGOs.  Results  are, 
however,  sensitive  to  the  choice  of  operational 
definitions  of  the  key  variables.  The  best  “fit”  is 
achieved  when  we  link  interdependence  measured 
in  terms  of  absolute  volumes  of  transactions  to  the 
total  IGO  population,  including  also  emanations. 
However,  the  basic  argument  refers  to  sensitivity 
and  vulnerability,  and  these  are  concepts  pointing 
toward  relative  measures  of  interdependence. 
Thus,  a  country’s  sensitivity  to  external  economic 
events  is  primarily  a  function  of  the  importance  of 
foreign  trade  to  its  national  income  (measured  as 
foreign  trade  in  proportion  to  gross  domestic 
product  [GDPJ).  When  we  replace  absolute  with 
relative  measures,  we  find  that  the  population  of 
IGOs  has  grown  at  a  much  higher  rate  than  the 
level  of  interdependence.  Moreover,  the  argument 
linking  interdependence  to  organizations  puts  as 
much  weight  on  configurations  of  interdependence 
as  on  aggregate  volumes  of  transactions.  A  survey 
of  the  case  study  literature  would  bring  out  many 
observations  that  support — at  least  indirectly — the 
latter  proposition.  As  far  as  we  know,  however,  no 
existing  database  provides  the  kind  of  data  required 
for  more  extensive  and  refined  testing.  The  upshot 
of  all  this  is  that  while  we  may  safely  conclude  that 
interdependence  is  a  major  factor  generating  incen¬ 
tives  ( demand )  for  cooperation,  we  cannot  claim 
to  understand  equally  well  supply-side  mecha¬ 
nisms  and  the  interactive  dynamics  of  the  demand- 
supply  relationship. 

As  the  first  shots  were  fired  in  World  War  I,  the 
number  of  sovereign  states  had  not  yet  reached  50. 
Today,  there  are  about  200  such  states.  This 
increase  can  be  expected  to  lead  to  more  IGOs  for 
at  least  two  reasons.  First,  as  the  number  of  states 
increases,  the  average  state  is  left  with  jurisdiction 
over  a  declining  share  of  the  world  population  and 


world  resources.  This  reduces  its  capacity  to  serve 
its  people  through  purely  unilateral  measures. 
Second,  as  the  number  of  actors  increases,  so  does 
the  number  of  potential  partnerships.  This  effect  is 
amplified  by  the  fact  that  states  that  have  recently 
escaped  the  rule  of  their  former  masters  often  are 
particularly  eager  to  engage  in  cooperative  ven¬ 
tures  with  new  partners. 

The  development  of  the  IGO  system  also  seems 
to  reflect  changes  in  the  fabric  of  national  political 
systems  as  well.  Two  features  seem  particularly 
important.  One  is  the  expansion  and  upgrading  of 
public  policy  in  relatively  “new”  domains,  such  as 
social  security  and  environmental  protection.  The 
other  is  the  tendency  for  national  administrative 
systems  to  become  increasingly  specialized  and  dif¬ 
ferentiated.  The  development  of  the  IGO  popula¬ 
tion,  particularly  over  the  second  half  of  the  20th 
century,  reflects  both  of  these  changes  to  a  signifi¬ 
cant  degree. 

The  Study  of  IGOs 

In  the  first  decade  or  two  after  World  War  II,  the 
United  Nations  (UN)  became  a  special  case  of  IOs 
as  its  universal  membership  and  increasing  scope  of 
competences  made  it  appear  more  important  than 
any  other  IO.  Its  uniqueness  and  importance 
attracted  scholars  to  conduct  a  number  of  studies 
of  the  internal  functions  of  the  UN,  such  as  the  vot¬ 
ing  patterns  in  the  General  Assembly  and  the 
recruitment  of  staff  to  the  Secretariat.  Pioneers  in 
this  field  such  as  Robert  Cox  and  Harold  Jacobson 
studied  IOs  as  bureaucracies  with  the  aim  of  under¬ 
standing  the  decision-making  and  other  political 
processes  taking  place  within  the  IOs.  This  approach 
provided  quite  sophisticated  classifications  of  struc¬ 
tural  and  functional  aspects  and  much  empirical 
insight  into  the  work  of  IOs.  It  did  not  to  the  same 
degree  generate  testable  propositions  linking  these 
taxonomies  to  variance  in  performance  or  impact. 

Disillusioned  by  external  developments — the  fail¬ 
ure  of  the  UN  Security  Council  to  deal  with  pressing 
issues  such  as  the  nuclear  arms  race  and  the  Vietnam 
War  and  occasional  setbacks  in  major  regional  inte¬ 
gration  projects — and  left  with  a  research  program 
that  did  not  effectively  get  at  the  underlying  causes 
of  these  difficulties,  many  scholars  reoriented  their 
agenda  to  move  away  from  IGO  structures  and 
procedures  toward  institutions  that  were  defined  as 


International  Organizations  1251 


sets  of  rules  and  norms  governing  certain  activities. 
The  existence  of  formal  organizations  was,  of 
course,  recognized,  but  international  regimes — 
defined  by  Stephen  Krasner  as  the  rules,  norms, 
principles,  and  procedures  around  which  actor 
expectations  converge  in  a  given  issue  area — 
emerged  as  a  new  and  more  exciting  topic.  The 
interest  in  international  institutions  was  also  driven 
by  a  growing  critique  of  the  dominant  grand  theory 
in  the  study  of  international  relations:  realism.  The 
new  emphasis  on  institutions  challenged  a  core 
assumption  of  realism.  It  did  so  in  part  by  claiming 
that  states  may  establish  international  institutions 
even  if  short-term  self-interest  would  be  better 
served  by  free  riding  and  that  these  institutions  tend 
to  acquire  a  life  of  their  own  by  influencing  state 
behavior  in  their  own  right.  It  also  did  so  by  sug¬ 
gesting  that  nonstate  actors  could  significantly 
influence  government  policies  and  outcomes  of 
international  processes. 

Over  the  past  10  years  or  so,  there  has  been  a 
renewed  interest  in  the  study  of  IOs.  The  construc¬ 
tivist  turn  of  much  IR  research  has  inspired  some 
reformulation  of  the  research  agenda,  highlighting, 
inter  alia,  the  role  of  IOs  as  providers  and  custodi¬ 
ans  of  norms  and  rules  serving  as  frames  of  legiti¬ 
macy  for  the  behavior  of  states  as  well  as  groups 
and  individuals.  It  has  also  led  some  to  revisit 
“old”  questions.  For  example,  Michael  Barnett 
and  Martha  Finnemore  have  studied  the  bureau¬ 
cratic  features  of  IOs  in  a  somewhat  different 
framework  and  succeeded  in  providing  new  insight 
into  the  importance  of  these  features  in  shaping 
international  politics  more  broadly. 

The  split  between  the  study  of  organization  and 
the  study  of  regimes,  though,  is  by  no  means  total. 
Many  scholars  see  regimes  and  organizations  as 
two  subgroups  of  international  institutions. 
Moreover,  in  real  life,  organizations  and  regimes 
are  often  intertwined.  Starting  from  the  regime  tra¬ 
dition,  one  could  say  that  regimes  are  often  served 
by  organizations;  starting  from  the  10  tradition, 
one  could  say  that  organizations  often  play  impor¬ 
tant  roles  in  developing  and  managing  regimes. 

Why  and  Flow  Do  States  Establish  IOs? 

The  institutional  turn  in  studies  of  IOs  made  the 
study  of  international  institutions  an  integral  part 
of  the  study  of  international  cooperation. 


International  institutions  can  help  states  solve 
common  problems  through  coordinated  action. 
The  realist  tradition  has  always  pointed  to  the 
anarchical  structure  of  the  international  system  as 
compelling  states  to  be  deeply  concerned  about 
security  and  (relative)  power.  The  international 
system  was  described  as  a  self-help  system  where 
states  have  to  be  very  cautious  about  getting 
involved  in  cooperative  ventures  that  could  benefit 
others  more  than  themselves.  Moreover,  they  will 
expect  others  to  free  ride  as  long  as  they  can  get 
away  with  it.  Institutional  theory  challenges  this 
understanding  by,  inter  alia,  pointing  to  institu¬ 
tional  “services”  that  can  reduce  uncertainty  and 
help  parties  negotiate  as  well  as  enforce  agree¬ 
ments.  An  10  can,  for  example,  help  establish  a 
base  of  consensual  knowledge,  facilitate  negotia¬ 
tions,  and  develop  operational  standards  against 
which  performance  can  be  measured  and  compli¬ 
ance  monitored. 

Research  following  this  functional  approach, 
first  developed  by  Robert  Keohane,  has  explored 
how  different  configurations  of  preferences  can 
shape  outcomes.  Much  interest  has  been  devoted 
to  the  malignant  configuration  known  as  the  pris¬ 
oners’  dilemma  game.  If  played  only  once,  or  a 
finite  number  of  rounds  known  in  advance,  the 
dominant  strategy  for  each  party  will  be  to  defect. 
Evidence  from  experimental  research  indicates  that 
individuals  playing  this  game  tend  to  cooperate 
more  than  static  game  theory  predicts.  Moreover, 
Robert  Axelrod  and  others  have  used  computa¬ 
tional  modeling  to  study  the  evolution  of  coopera¬ 
tion  in  reiterated  games  and  found  that  certain 
strategies  of  conditional  cooperation  do  better 
than  strategies  relying  more  primarily  on  defection. 
By  acknowledging  the  political  malignancy  of  cer¬ 
tain  configurations  of  interests  and  at  the  same 
time  demonstrating  how  regimes  and  organiza¬ 
tions  can  inspire  and  facilitate  cooperation,  research 
pursuing  the  “functional”  approach  helped  bridge 
the  gap  between  the  realist  school  and  the  institu¬ 
tionalists.  The  role  of  IOs  tends,  though,  to  vary 
with  the  problem  structure.  Fortunately,  many  of 
the  common  challenges  facing  governments  gener¬ 
ate  less  malignant  configurations  than  that  consti¬ 
tuting  the  prisoners’  dilemma  game. 

The  realist  tradition  claims,  in  essence,  that 
powerful  states  will  create  institutions  that  serve 
their  interests.  IOs  that  contradict  the  interests  of 
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powerful  states  will  either  not  be  established  or,  if 
they  are,  have  little  or  no  impact.  The  hegemonic 
stability  theory  suggests  that  a  “hegemon”  is 
essential  for  the  establishment  of  strong  interna¬ 
tional  institutions  and  IOs.  A  dominant  actor  will 
have  a  higher  interest  in  the  collective  good  in 
question  and  at  the  same  time  possess  the  capa¬ 
bilities  to  provide  (much  of)  that  good  by  unilat¬ 
eral  action.  Now,  power  defined  as  the  ability  to 
get  others  to  do  what  they  would  otherwise  not 
have  done  can  be  exercised  by  various  means.  In 
the  literature  on  international  institutions,  various 
forms  of  leadership  have  been  discussed,  such  as 
structural  leadership,  where  more  general  power 
resources  are  activated;  intellectual  leadership, 
where  knowledge  is  activated  in  order  to  facilitate 
solutions  to  common  problems;  or  entrepreneurial 
leadership,  where  the  ability  to  identify  solutions 
and  find  compromises  is  activated.  All  these  vari¬ 
ous  resources  or  abilities  on  behalf  of  individual 
actors  can  be  important  in  establishing  and  main¬ 
taining  international  institutions. 

Decision  makers  as  well  as  students  of  interna¬ 
tional  cooperation  have  long  recognized  that  a 
base  of  consensual  knowledge  is  a  necessary 
condition  for  negotiations  to  succeed.  The 
Intergovernmental  Panel  for  Climate  Change 
(IPCC) — a  formal  ICO — was  established  to  trans¬ 
late  findings  from  scientific  research  into  usable 
knowledge  for  policymakers.  The  work  of  the 
IPCC  also  illustrates  how  political  conflict  can 
threaten  to  contaminate  such  processes  and  how  a 
combination  of  organizational  design  and  leader¬ 
ship  may  provide  fairly  good  protection. 

More  generally,  cooperation  often  builds  on  a 
broadly  accepted  ideational  foundation.  IGOs  and 
INGOs  often  play  a  role  in  disseminating  causal 
and  normative  beliefs  that  shape  the  perceptions  of 
international  problems.  In  some  cases,  they  may 
even  contribute  to  the  development  of  collective 
identities.  Various  mechanisms  may  be  at  work, 
but  two  examples  are  often  in  focus.  First,  an  image 
of  a  common  enemy  might  increase  the  actors’  per¬ 
ception  of  common  interests  and  thus  strengthen 
incentives  for  cooperation.  Second,  states  engaged 
in  cooperation  might  find  themselves  to  be  per¬ 
ceived  as  a  model  for  others,  an  effect  that  might 
increase  the  self-confidence  of  the  cooperating  par¬ 
ties  and  lead  to  further  cooperation.  Even  in  the 
absence  of  such  profound  change,  ideas  can 


make  a  significant  difference.  Particular  attention 
has  been  given  to  the  role  of  transnational  net¬ 
works  of  experts — often  referred  to  as  “epistemic 
communities” — in  developing  common  problem 
descriptions  and  diagnoses,  combined  with  ideas 
about  effective  “cures.” 

What  Do  IOs  Do  and  How  Do  They  Do  It? 

As  pointed  out  at  the  beginning,  IOs  are  created 
for  some  purpose — usually  to  achieve  goals  that 
their  members  find  it  hard  to  achieve  (as  effec¬ 
tively)  on  their  own.  More  specifically,  four  roles 
have  attracted  particular  attention. 

One  is  the  role  of  providing  an  arena  for 
exchange  of  information,  negotiation,  and  related 
processes.  As  an  arena,  an  organization  establishes 
a  meeting  place,  specifies  the  activities  to  take 
place  under  its  auspices,  and  provides  rules  regu¬ 
lating  these  activities.  As  arenas  for  exchange  of 
information  and  ideas,  IGOs,  INGOs,  and  some¬ 
times  even  informal  networks  often  facilitate 
learning  and  other  types  of  policy  diffusion. 
Although  IOs  are  not  the  only  possible  way  to 
establish  communication  among  parties,  they  have 
some  characteristics  that  make  them  efficient  in 
this  respect.  IOs  have  some  degree  of  autonomy 
from  the  parties  involved  and  can  thus  be  seen  as 
impartial.  An  10  serves  as  a  place  where  the  parties 
can  meet  on  neutral  ground  for  negotiations.  A 
prominent  example  of  this  is  how  the  secretary- 
general  of  the  UN  is  assigned  a  role  where  he  can 
offer  conflicting  parties  the  use  of  his  “good 
offices.”  In  general,  leaders  of  secretariats  of  IOs 
are  often  assigned  the  role  of  chairing  international 
negotiations.  This  can  imply  a  more  active  role 
than  simply  providing  an  arena  for  negotiations. 
The  assumed  impartiality  and  the  expectation  that 
secretariat  leaders  represent  common  interests  of 
members  make  them  suited  for  such  assignments. 
The  arena  role  of  IOs  makes  it  possible,  at  least  to 
some  extent,  to  regulate  the  access  of  actors  to 
problems  and  the  access  of  problems  to  decision 
games.  And,  finally,  IOs  serve  as  independent 
actors  in  specifying  the  official  purpose  as  well  as 
the  rules,  location,  and  timing  of  negotiations 
among  the  parties. 

IOs  not  only  serve  as  an  arena  for  members  to 
exchange  information  but  also  play  an  active  role 
in  providing  information  themselves.  We  can 
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distinguish  between  factual  information  about  the 
world  in  general  and  about  characteristics  of  the 
member  countries,  information  about  actors’ 
behavior,  and  information  that  elucidate  causal 
connections — that  is,  the  potential  effects  of  the 
IO’s  decisions.  Many  IOs  serve  as  providers  of  all 
these  kinds  of  information  in  their  respective  field. 
The  importance  of  information  is  connected  with 
the  consequences  of  uncertainty  in  the  relationship 
between  states.  Rational  actors  are  assumed  to 
behave  purposively  on  the  basis  of  the  information 
available  to  them.  Thus,  the  information  actors 
possess  about  other  actors’  interests  can  be  a  criti¬ 
cal  determinant  of  their  own  behavior.  The  more 
information  the  IOs  provide,  the  less  likely  it  is 
that  there  will  be  misperception  and  misunder¬ 
standing  among  the  member  states. 

The  second  aspect  is  the  role  of  the  IOs  in  creat¬ 
ing  decision-making  rules  that  affect  the  subse¬ 
quent  outcome  of  the  interaction  of  members  or 
other  actors.  The  formation  of  actors’  interests  is 
influenced  by  various  constraints,  which  will  be 
dealt  with  in  the  subsection  on  framing  of  decisions 
below.  Here,  the  focus  is  on  how  actors’  behaviors 
that  enable  them  to  realize  given  interests  are  influ¬ 
enced  by  constraints  caused  by  decision  rules.  The 
decision  rule  is  an  important  determinant  of  the 
capacity  of  the  10  to  aggregate  divergent  prefer¬ 
ences.  Other  things  being  equal,  aggregation  capac¬ 
ity  reaches  its  maximum  in  strictly  hierarchical 
structures  and  is  at  its  minimum  in  systems  requir¬ 
ing  agreement  (unanimity).  Decision  rules  in  IGOs 
are  affected  by  the  anarchic  structure  of  the  inter¬ 
national  system  and  accompanying  principle  of  the 
sovereignty  of  states.  Since  there  is  no  formal 
authority  that  can  enforce  decisions,  states  are  only 
committed  to  decisions  they  agree  to.  Thus,  com¬ 
pared  with  lawmaking  in  national  political  sys¬ 
tems,  the  role  of  international  law  in  general  is 
weak.  Consensus  is  the  most  common  decision¬ 
making  rule  in  actual  use  in  IGOs.  It  is  also  a  very 
demanding  decision  rule,  leaving  the  burden  of 
proof  with  those  who  want  new  measures  taken 
and  giving  a  veto  to  any  significant  party  that  is 
opposed.  The  majority- voting  rule  produces  more 
ambitious  decisions,  but  it  does  so  at  the  risk  of 
more  defection  in  the  implementation  stage. 

In  most  IGOs,  formal  decision  rules  tend  to 
favor  small  member  states.  This  is  most  evident 
where  unanimity  or  consensus  is  required  and  also 


where  decisions  are  made  by  voting  based  on  the 
principle  of  one  state,  one  vote  (e.g.,  the  UN 
General  Assembly).  Even  where  votes  are  weighted, 
the  small  will  most  often  be  overrepresented.  In  the 
EU  Council,  for  example,  votes  are  distributed  so 
that  1  citizen  of  Luxembourg  balances  about 
24  citizens  of  Germany.  Now,  Germany’s  voting 
power  still  trumps  that  of  Luxembourg  by  a  wide 
margin  (29  to  4),  and  Germany  leads  by  an  even 
larger  margin  when  it  comes  to  power  in  basic 
games  such  as  trade  and  investment.  Moreover,  if 
we  look  at  the  total  population  of  IGOs,  we  will 
find  that  rich  countries  participate  more  than 
poor,  large  more  than  small,  democratic  states 
more  than  authoritarian,  and  old  states  more  than 
young.  In  this  respect,  the  ICO  system  reflects  the 
social  stratification  of  world  society.  Although 
originally  made  in  a  different  context,  Stein 
Rokkan’s  well-known  observation  that  “votes 
count  but  resources  decide”  carries  an  important 
lesson  about  ICO  politics  as  well. 

The  decision  rule  also  influences  the  relation¬ 
ship  between  the  10  and  major  powers.  Unanimity 
is  likely  to  produce  compliance  at  a  low  cost,  since 
the  decisions  will  be  self-enforcing  compared  with 
a  situation  where  major  powers  would  have  to 
enforce  the  decision  made  in  the  10.  Although  the 
formal  decision  rule  is  unanimity,  the  actual  deci¬ 
sion  making  in  several  IOs  is  of  course  influenced 
by  the  distribution  of  power  among  the  members 
outside  the  IO.  For  instance,  in  the  case  of  the 
Organization  of  the  Petroleum  Exporting  Countries 
(OPEC),  large  producers  such  as  Saudi  Arabia  can 
easily  undermine  most  production  decisions  of  the 
organization  by  unilateral  market  behavior. 
Subsequently,  Saudi  Arabia  enjoys  a  stronger 
influence  on  OPEC  decision  than  what  follows 
from  the  one  country  one  vote  rule. 

IOs  are  essential  for  creating  rules  and  regula¬ 
tions  for  states’  behavior  in  the  international  sys¬ 
tem.  Sovereign  states  would  be  very  reluctant  to 
submit  themselves  to  behavior  rules  if  nobody  else 
made  similar  commitments.  When  treaties  or  con¬ 
ventions  are  negotiated,  they  are  normally  subject 
to  subsequent  ratification  by  member  states. 
When  a  certain  number  of  states  have  ratified  the 
treaty,  it  enters  into  force.  This  system  ensures 
that  the  individual  state  that  signs  the  treaty  is 
only  committed  when  a  number  of  others  join. 
IOs  provide  an  efficient  way  to  promote  such 
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conditional  cooperation  among  states  in  an  anar¬ 
chical  system. 

The  third  aspect  is  the  role  of  IOs  in  establishing 
a  common  perception  of  problems  and  solutions. 
Decision  making  through  negotiations  among  par¬ 
ties  is  heavily  influenced  by  the  way  in  which  the 
actors  think  about  their  problem.  The  variation 
among  parties  in  their  perception  of  the  situation, 
beliefs  about  their  own  and  others’  interests,  and 
cognition  of  the  world  in  general  makes  it  harder 
to  reach  common  solutions.  In  the  short  run,  self- 
interested  actors  are  influenced  by  constraints 
imposed  on  policy  choices  by  agreed-on  rules.  In 
the  long  run,  conceptions  of  self-interest  may  be 
reshaped  as  a  result,  in  part,  of  practices  engaged 
in  over  a  period  of  time.  An  10  can  influence  the 
actors’  perceptions  of  the  situation  surrounding 
their  decisions;  in  other  words,  the  organization 
can  frame  member  states’  decisions.  International 
institutions  can  also  play  a  role  in  the  formation  of 
interests  and  collective  identity  among  members. 
An  important  premise  for  this  aspect  is  the  assump¬ 
tion  that  states’  interests  are  not  exogenously  given 
but  are  endogenous  to  interaction  among  states. 

Increased  cooperative  behavior  might  come 
about  as  a  result  of  several  different  processes  of 
identity  formation  performed  by  the  IOs.  First,  there 
can  be  various  intersubjective  systemic  structures 
consisting  of  shared  understandings,  expectations, 
and  social  knowledge  embedded  in  international 
institutions,  in  terms  of  which  states  define  their 
identities  and  interests.  Second,  there  are  systemic 
processes  in  the  external  context  of  state  action  that 
lead  to  increased  interdependence  and  also  imply 
both  an  increase  in  the  “dynamic  density”  of  inter¬ 
actions  and  the  emergence  of  a  “common  Other.” 
This  reduces  states’  ability  to  act  unilaterally  and 
increases  the  extent  to  which  actors  share  a  common 
fate  and,  thus,  their  willingness  to  form  interna¬ 
tional  institutions.  Third,  strategic  behavioral  prac¬ 
tice  has  two  effects  on  identities  and  interests: 
Through  interaction,  actors  form  identities  by  learn¬ 
ing,  which  makes  them  see  themselves  as  others  do, 
and  by  engaging  in  cooperative  behavior,  an  actor 
gradually  changes  its  own  beliefs  about  what  it  is, 
thus  helping  internalize  the  new  identity. 

The  fourth  aspect  is  the  role  of  IOs  in  perform¬ 
ing  politics  on  behalf  of  its  members.  Volker 
Rittberger  and  Berhard  Zangl  identify  five  types  of 
10  operations:  First,  norms  and  rules  usually 


require  further  specification  and  codification  to  be 
implemented  in  practice.  This  is  often  done  by  the 
member  states  that  have  the  potential  for  various 
interpretations  of  the  practical  consequences  of  the 
rules  and  regulations.  In  several  cases,  the  IO  takes 
responsibility  for  specifying  the  rules  agreed  to  by 
the  members.  In  the  EU,  this  is  a  task  often  handled 
by  the  EU  Commission,  and  in  the  World  Bank, 
detailed  plans  for  conditionality  of  various  funds 
are  left  in  the  hands  of  the  World  Bank  bureau¬ 
cracy.  Second,  in  a  number  of  areas,  the  implemen¬ 
tation  of  10  decisions  is  simply  carried  out  by  the 
10  itself.  Several  special  agencies  of  the  UN  are 
entirely  devoted  to  this  kind  of  operative  tasks,  par¬ 
ticularly  in  the  field  of  economics  and  sociology, 
such  as  humanitarian  relief  and  development  aid, 
but  also  in  security  affairs,  when  the  UN  deploys 
peacekeeping  forces.  Third,  when  members  them¬ 
selves  implement  decisions,  the  need  for  independent 
monitoring  arises,  particularly  to  verify  accusations 
of  members’  defection.  Only  with  a  reliable  system 
of  monitoring  the  other  parties  will  the  individual 
members  implement  their  own  commitments.  This 
also  has  a  positive  feedback  on  their  willingness  to 
make  credible  commitments  in  the  first  place.  In  case 
of  disagreement  among  members  regarding  their 
compliance,  the  10  can  also  perform  a  role  in  adju¬ 
dication.  Fourth,  if  monitoring  and  adjudication  do 
not  make  members  abide  by  the  rules,  sanctions  will 
have  to  be  imposed.  These  can  take  the  form  of  ver¬ 
bal  or  written  condemnation,  economic  sanctions, 
or  the  use  of  force.  In  many  cases,  it  is  the  member 
states  that  have  to  exercise  the  sanctions,  but  in 
some  cases,  the  IOs  themselves  can  perform  some  of 
the  sanctioning  acts. 

Measuring  and  Explaining  Effects 

The  importance  of  IGOs  has  long  been  a  matter  of 
some  controversy  in  the  study  of  international 
relations.  Realists  have  consistently  argued  that 
IGOs  reflect  rather  than  shape  configurations  of 
state  interests  and  power  and  therefore  merit 
attention  primarily  as  tools  rather  than  actors.  The 
fortune  of  an  organization — and  ultimately  its  sur¬ 
vival — depends  on  the  extent  to  which  powerful 
members  see  it  as  serving  their  interests.  A  some¬ 
what  similar  argument  has  been  made  by  scholars 
inspired  by  Marxist  notions  of  class  conflict. 
Others,  adopting  what  is  often  referred  to  as  a 
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sociological  approach,  have  argued  that  IOs  and 
regimes  can  best  be  understood  as  embedded  in,  and 
in  important  respects  also  constituted  by,  normative 
“deep  structures”  of  international  society.  A  com¬ 
mon  denominator  for  all  these  lines  of  reasoning  is 
the  proposition  that  IOs  grow  out  of,  and  therefore 
also  reflect,  some  more  fundamental  and  important 
structure  of  the  international  political  system  or 
international  society.  In  the  aggregate,  and  over  the 
long  haul,  they  may  modify  or  reinforce  the  struc¬ 
tures  in  which  they  are  embedded,  but  overall,  IGOs 
can  best  be  understood  as  “intervening”  variables, 
reflecting  at  least  as  much  as  shaping  the  interna¬ 
tional  system.  Some  organizations  will  be  actively 
involved  in  international  political  processes,  but  few 
will,  by  their  own  weight,  leave  a  distinct  and  sig¬ 
nificant  imprint  on  important  outcomes. 

This  pessimistic  conclusion  has  been  challenged 
by  scholars  working  within  an  institutionalist 
framework.  Grossly  simplified,  the  essence  of  their 
arguments  may  be  summarized  as  follows.  First, 
for  an  IGO  to  serve  as  a  useful  tool  (“agent”),  it 
will  need  some  amount  of  independence  from  its 
“principals.”  Rational  principals  understand  this 
and  will  therefore  grant  that  much  autonomy. 
Second,  even  without  the  approval  of  its  princi¬ 
pals,  an  agent — for  example,  an  international 
secretariat — will  often  succeed  in  obtaining  some 
“slack.”  This  slack  is  likely  to  be  largest  for  secre¬ 
tariats  that  work  for  a  politically  homogeneous 
group  of  members  and  that  have  been  able  to  build 
up  substantial  expertise  or  possess  other  relevant 
resources.  Moreover,  an  agent  is  likely  to  develop 
its  own  interpretation  of  its  mission  and  base  its 
behavior  on  that  interpretation  as  long  as  it  is  not 
explicitly  overruled.  Third,  over  time,  most  organi¬ 
zations  and  regimes  establish  rules  and  develop 
practices  into  which  participants — also  representa¬ 
tives  of  member  countries — will  to  some  degree  be 
enmeshed  and  socialized.  As  these  structures  and 
practices  mature,  they  often  acquire  the  status  of 
default  options  or  standard  operating  procedures. 

Despite  occasional  skirmishes,  a  fairly  broad 
consensus  has  emerged  that  there  is  at  least  some 
merit  in  all  these  propositions.  A  closer  scrutiny 
will  show  that  they,  to  some  extent,  refer  to  differ¬ 
ent  dependent  variables,  different  policy  domains, 
or  different  causal  mechanisms.  The  realist  argu¬ 
ment  focuses  on  individual  IGOs  as  actors  in  “high 
politics”  and  measures  importance  mainly  in  terms 


of  the  extent  to  which  they  influence  state  inter¬ 
ests,  the  distribution  of  power,  or  the  basic  order¬ 
ing  principle  of  the  international  political  system 
(anarchy).  By  this  standard,  the  vast  majority  of 
IGOs  do  indeed  obtain  a  low  score,  particularly 
when  compared  with  the  great  powers  that  occupy 
center  stage  in  (neo)realist  theory.  The  so-called 
functionalist  theory  focuses  on  the  role  of  IGOs  in 
managing  networks  of  international  interdepen¬ 
dence.  Growing  interdependence  is  seen  as  involv¬ 
ing  increasing  sensitivity  and  vulnerability,  which 
generates  new  demands  for  coordination  and 
collaboration — first  within  “low-politics”  domains 
such  as  trade  and  investment,  transport  and  com¬ 
munication,  labor  market  regulations,  and  health 
and  environment.  The  IGOs  are  established  and 
designed  to  supply  coordination  services  and 
facilitate  collaboration.  To  the  extent  that  they 
perform  these  functions  effectively,  organizations 
contribute  to  further  development  of  interdepen¬ 
dence  networks,  which  in  turn  may  generate  new 
demands  for  IGO  services — in  part,  by  changing 
domestic  configurations  of  interests  and  influence. 

Early  functionalist  theory  sent  an  upbeat  mes¬ 
sage  that  this  dynamic  interaction  of  demand  and 
supply  would  eventually  lead  to  the  build-up  of 
supranational  systems  of  governance.  Empirical 
evidence  and  refinement  of  causal  models  soon 
showed  that  there  is  nothing  compelling  about  that 
trajectory;  in  fact,  interactive  feedback  loops  simi¬ 
lar  to  those  that  generate  momentum  can  also  lead 
to  cascading  collapse.  This  does  not  imply  that  the 
basic  mechanisms  highlighted  by  (neo-)functional- 
ist  theory  are  not  at  work;  rather,  the  main  lesson 
seems  to  be  that  we  are  dealing  with  more  complex 
systems,  involving  nonlinear  relationships,  condi¬ 
tional  effects,  and  interplay.  Finally,  the  socio¬ 
logical  approach  calls  attention  to  IGOs  and 
INGOs  as  arenas  for,  and  facilitators  of,  socializa¬ 
tion  and  transnational  diffusion  of  ideas  and 
norms.  Most  studies  examine  the  role  of  particu¬ 
lar  organizations  in  legitimizing  or  spreading  a 
more  or  less  well-defined  set  of  ideas,  but  some 
take  on  the  demanding  challenge  of  determining 
the  aggregate  effect  of  the  universe  of  organiza¬ 
tions  on  complex  macrolevel  variables,  such  as 
world  culture. 

The  best  way  to  make  sense  of  such  diversity  is 
to  sort  answers  and  propositions  by  the  questions 
to  which  they  refer.  With  a  wide  range  of  research 
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interests,  this  exercise  will  generate  a  fairly  com¬ 
plex  matrix.  All  we  can  offer  here  is  a  selective  and 
brief  overview,  focusing  on  some  of  the  effects  that 
have  been  subject  to  extensive  study.  Although  not 
explicitly  used  in  much  of  the  empirical  literature, 
the  distinction  introduced  earlier  between  organi¬ 
zations  as  arenas  and  organizations  as  actors  may 
serve  as  a  crude  organizing  device. 

As  pointed  out  in  the  previous  section,  IOs  can 
serve  as  arenas  for  exchange  of  information  and 
ideas  between  member  states.  Several  studies  indi¬ 
cate  that  organizations  often  enhance  policy  con¬ 
vergence  even  in  the  absence  of  formal  agreement 
by  exposing  members  to  new  information  and  ideas 
that  lead  to  unilateral  adjustments.  Others  find  that 
individuals  who  participate  actively  in  the  work  of 
an  organization  over  an  extended  period  of  time 
tend  to  develop  a  stronger  identification  with  the 
organization  itself  and  its  mission,  norms,  and  val¬ 
ues.  The  strength  of  this  socialization  function 
seems,  though,  to  be  moderate.  A  recent  study 
found  that  even  in  a  relatively  powerful  and  presti¬ 
gious  bureaucracy  (the  European  Commission), 
39%  of  the  staff  members  interviewed  saw  them¬ 
selves  as  “very  attached”  to  their  own  nation,  while 
only  25%  said  the  same  about  Europe.  Moreover, 
socialization  seems  to  leave  the  greatest  impact  on 
staff  members  recruited  to  the  organization  at  an 
early  and  formative  stage  of  their  career  before 
some  other  identity  has  been  firmly  established. 

Research  aimed  at  determining  the  importance 
of  IGOs  as  actors  has  pursued  two  main  strategies. 
One  has  focused  on  the  capacity  to  act.  Capacity 
has  been  defined  in  somewhat  different  terms,  but 
most  definitions  point  to  at  least  two  compo¬ 
nents — autonomy  and  resources.  The  interest  in 
autonomy  is  premised  on  the  assumption  that 
independence  from  member  states  and  other  actors 
is  a  necessary,  though  not  sufficient,  condition  for 
influence.  Autonomy  is  sometimes  measured  in 
terms  of  organizational  properties  such  as  deci¬ 
sion-making  procedures,  the  existence  of  a  supra¬ 
national  bureaucracy,  and  the  existence  of  some 
binding  dispute  settlement  mechanism.  One  study 
combines  these  indicators  in  a  7-point  ordinal 
scale  and  applies  this  index  to  a  sample  of 
30  regional  integration  “arrangements.”  Of  their 
cases,  67%  fall  within  the  lower  range  (0-2)  and 
only  17%  within  the  upper  range  (4-6).  Consistent 
with  functionalist  logic,  economic  interdependence 


has  a  positive  and  significant  effect  on  the  auton¬ 
omy  of  IGOs.  Consistent  with  institutionalist 
arguments,  IGOs  tend  to  become  more  indepen¬ 
dent  as  they  mature  over  time.  Other  studies  mea¬ 
sure  autonomy  in  behavioral  terms,  in  most  cases 
by  studying  the  role(s)  played  by  specific  bodies,  in 
particular  the  secretariat.  The  most  comprehensive 
study  of  international  environmental  regimes 
found  that  secretariats  have  little  or  no  indepen¬ 
dence  in  41  %  of  the  cases  included.  Another  study 
reported  that  secretariats  are  confined  to  functions 
of  office  keeping,  and  information  gathering  and 
dissemination  in  nearly  60%  of  the  cases  included 
in  their  database.  The  main  conclusion  seems  to  be 
that  a  large  majority  of  IGOs  obtain  a  low  or,  at 
best,  moderate  score  on  structural  as  well  as 
behavioral  indicators  of  autonomy. 

A  survey  of  the  resources  of  IGOs  would  show 
a  very  wide  range  of  variance;  the  European  Union 
(EU)  and  the  International  Whaling  Commission 
clearly  play  in  very  different  “leagues.”  And  they 
do  so  by  resource  endowments  as  well  as  constitu¬ 
tional  design. 

The  other  approach  is  more  direct,  trying  to 
determine  the  actual  influence  of  one  or  more 
IGOs  on  particular  processes  or  outcomes.  Much 
of  this  literature  focuses  on  one  or  a  few  cases, 
most  often  one  of  the  large  and  presumably  impor¬ 
tant  organizations  (such  as  the  UN,  the  EU,  North 
Atlantic  Treaty  Organization  [NATO],  or  World 
Trade  Organization  [WTOJ).  A  survey  of  this  lit¬ 
erature  will  first  of  all  confirm  that  IGOs  vary 
substantially  in  terms  of  what  they  achieve  as  well 
as  how  they  operate.  It  will,  however,  also  indicate 
certain  patterns;  for  example,  influence  seems  to 
vary  with  constitutional  provisions,  resource 
endowments,  and  “positive”  interdependence 
among  member  states.  It  also  provides  interesting 
insights  into  the  roles  played  by  specific  bodies  or 
individual  leaders.  For  example,  secretariats  can 
exert  influence  by  performing  quite  different 
functions — ranging  from  contributions  to  the 
development  of  a  base  of  consensual  knowledge  to 
high-profile  political  initiatives.  The  former  do  not 
make  headlines  but  may  over  the  long  haul  be  as 
important.  Even  a  “technical  bureaucracy”  can 
make  a  nontrivial  difference  by  facilitating  negoti¬ 
ations  or  monitoring  performance. 

The  case  study  literature  is  rich  in  interesting 
observations  and  important  insights,  but  most 
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refer  to  one  or  a  few  specific  organizations  or 
issues.  Over  the  past  2  decades,  an  increasing  num¬ 
ber  of  studies  have  used  large  databases  that  pro¬ 
vide  a  more  solid  basis  for  general  conclusions. 
Many  of  these  studies  have  examined  the  role  of 
IGOs  in  managing  conflict  and  fostering  or  main¬ 
taining  peace.  Findings  sometimes  diverge,  but  one 
may  at  least  conclude  that  IGOs  sometimes  con¬ 
tribute  significantly.  Thus,  one  study,  relying  on 
data  covering  the  period  1946  to  2000,  found 
IGOs  to  be  more  successful  than  any  other  type  of 
third  party  in  mediating  negotiated  settlement  of 
military  disputes.  While  this  study  and  several  oth¬ 
ers  concentrate  on  specific  functions  (such  as 
mediation),  others  take  a  broader  view  and  try  to 
determine  the  overall  contribution  of  IGOs  to 
building  or  maintaining  peace.  Much  of  this 
research  is  inspired  by  Immanuel  Kant’s  vision  of 
“eternal  peace,”  which  identified  three  pillars, 
today  often  referred  to  as  democracy,  interdepen¬ 
dence,  and  international  law  and  organization. 
Overall,  the  findings  from  previous  research  sup¬ 
port  the  propositions  that  democracy  and  interde¬ 
pendence  are  positively  associated  with  peace,  but 
results  for  IGOs  vary — some  are,  in  fact,  negative. 
This  has  led  some  scholars  to  suggest  that  only 
certain  types  of  IGOs  enhance  peace — notably, 
those  that  are  composed  largely  of  democratic 
states.  A  study  published  in  2006  finds  robust  sup¬ 
port  for  this  revised  hypothesis.  This  finding  points 
to  a  more  conditional  and  less  independent  role  for 
IGOs  in  fostering  and  maintaining  peace;  the  main 
mechanism  at  work  seems  to  be  one  of  interplay  in 
which  “densely  democratic”  IGOs  serve  to  amplify 
the  positive  impact  of  member  state  democracy. 

Other  studies  indicate  that  there  are  more 
mechanisms  at  work  linking  IGOs  to  democratic 
political  systems.  States  in  transition  to  democracy 
tend  to  join  IGOs  at  higher  rates  than  any  other 
group  of  states.  Moreover,  democratizing  states 
show  a  strong  preference  for  IGOs  in  which  most 
other  members  are  established  democracies.  Third, 
it  seems  that  membership  and  active  participation 
in  such  organizations  tend  to  reduce  the  risk  of 
backsliding.  All  this  suggests  that  IGOs  composed 
largely  of  established  democracies  can  play  a  sig¬ 
nificant  role  in  helping  consolidate  new  and  fragile 
democratic  systems.  And  since  relations  among 
democracies  are  the  most  peaceful,  contributions 
to  consolidating  new  democratic  systems  can  have 


a  positive,  albeit  marginal,  and  indirect  effect  on 
world  peace.  The  EU  is  probably  the  most  well- 
known  example,  but  the  basic  proposition  is  sup¬ 
ported  also  by  large-N  studies,  indicating  that 
there  is  a  more  general  effect. 

The  research  literature  is  less  sanguine  about  the 
effect  on  established  democracies  of  active  involve¬ 
ment  in  the  activities  of  IGOs.  Several  studies  indi¬ 
cate  that  IGOs  sometimes  operate  in  ways  that 
tend  to  enhance  the  influence  of  the  executive 
branch,  privilege  technical  and,  in  some  cases, 
“scientific”  expertise,  and  in  other  ways  remove 
decision-making  processes  from  arenas  where  the 
ordinary  citizen  can  play  an  active  role.  Other 
studies  have  found  that  these  effects  are  at  least  to 
some  extent  offset  by  other  democracy-enhancing 
features,  such  as  constraints  on  the  power  of 
domestic  special  interest  groups  and  protection  of 
individual  rights.  The  latter  effects  seem  to  be  most 
evident  for  countries  that  do  not  (yet)  achieve  top 
scores  on  democracy  scales. 

International  Nongovernmental 
Organizations 

This  entry  has  so  far  focused  on  IGOs.  Much  of 
the  cooperation  that  takes  place  across  borders  is, 
however,  initiated  and  conducted  by  individuals, 
voluntary  associations,  or  other  nongovernmental 
actors.  Civil  society  seems  as  involved  as  govern¬ 
ments  in  cooperative  projects  with  partners  in 
other  countries.  To  develop  and  manage  such 
areas  of  cooperation,  they  often  establish  interna¬ 
tional  nongovernmental  organizations  (INGOs), 
or  transnational  organizations  (TNOs).  The  UN 
Economic  and  Social  Council  (UNECOSOC)  con¬ 
siders  all  organizations  that  are  not  established 
through  formal  agreement  between  or  among  gov¬ 
ernments  as  nongovernmental.  Most  of  the  research 
literature  has  adopted  a  more  narrow  definition 
requiring  that  a  majority  of  the  members  be  pri¬ 
vate  parties.  Today,  INGOs  outnumber  IGOs  by  a 
wide  margin,  and  the  growth  rate  is  higher  in  the 
nongovernmental  sphere.  To  be  sure,  most  INGOs 
have  small  budgets,  limited  staff,  and  little  or  no 
political  influence.  But  others  are  impressive  orga¬ 
nizational  complexes.  The  Catholic  Church  counts 
more  than  a  billion  “members.”  Several  humani¬ 
tarian  organizations  have  billion-euro  budgets  and 
thousands  of  employees.  Taken  together,  the  200 
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largest  transnational  companies  account  for  about 
one  fourth  of  the  world  production  of  commercial 
goods  and  services.  In  multiple  ways,  TNOs  can 
indeed  be  important  actors. 

In  international  relations  research,  INGOs 
have  been  studied  particularly  in  their  perfor¬ 
mance  of  three  different  functions.  One  is  that  of 
managing  activities  that  most  governments  do  not 
consider  public  policy  domains.  For  example, 
while  governments  accept  overall  responsibility 
for  law  and  order,  few,  if  any,  see  this  responsibil¬ 
ity  as  involving  a  duty  to  establish  and  enforce 
internal  rules  for  particular  sports  or  proclaim 
and  interpret  principles  of  religious  faith.  Within 
the  arena  of  the  Organisation  for  Economic 
Co-operation  and  Development  (OECD),  most 
governments  would  see  the  production  and  distri¬ 
bution  of  private  goods  as  activities  to  be  under¬ 
taken  mainly  by  the  private  sector  (albeit  subject 
to  public  regulation).  Another  function  performed 
by  many  INGOs  is  that  of  advocacy,  including 
activities  ranging  from  high-publicity  campaigns 
to  confidential  lobbying.  Third,  some  INGOs 
engage  in  activities  for  which  most  governments 
accept  at  least  some  responsibility.  Most  impor¬ 
tant  is  the  area  of  humanitarian  assistance,  but 
INGOs  are  involved  also  in  what  might  be  called 
private  regulation — most  often  in  the  form  of 
standard  setting,  “enforced”  by  mobilizing  the 
market  power  of  concerned  customers  or  through 
public  naming-and-shaming  campaigns. 

Although  often  studied  as  different  from  IGOs, 
important  propositions  from  research  focusing 
on  IGOs  apply  also  to  INGOs.  Both  merit  atten¬ 
tion  as  arenas  as  well  as  actors.  Patterns  of 
growth  are  strongly  correlated  (about  .80, 
depending  on  the  exact  specification  of  the 
model).  So  are  patterns  of  participation;  in  fact, 
INGOs  tend  to  be  even  more  dominated  by 
wealthy  and  liberal  democracies.  For  both  types 
of  organizations,  autonomy,  resources,  and  lead¬ 
ership  are  important  determinants  of  influence. 
And  although  INGOs  often  make  headlines  by 
criticizing  or  opposing  governments,  relation¬ 
ships  are  often  friendly  and  supportive.  Thus,  for 
many  humanitarian  organizations,  grants  from 
governments  and  IGOs  constitute  important 
sources  of  income.  And  in  many  campaigns, 
INGOs,  governments,  and  IGOs  work  side  by 
side — for  as  well  as  against  a  particular  cause. 


Social  science  research  often  portrays  (I)NGOs 
rooted  in  civil  society  as,  overall,  a  positive  trans¬ 
formative  force.  At  least  three  more  specific 
propositions  to  that  effect  can  be  found  in  the 
research  literature.  One  suggests  that  TNOs  by 
and  large  tend  to  have  more  democratic  internal 
structures  and  procedures  than  their  IGO  counter¬ 
parts,  providing  individual  members  better  oppor¬ 
tunities  for  active  participation  and  real  influence. 
This  observation  is  no  doubt  true  for  a  number  of 
INGOs,  though  not  for  all.  The  aggregate  effect 
will,  however,  depend  also  on  who  exactly  are 
empowered  through  these  organizations.  Even  an 
organization  with  a  perfectly  democratic  internal 
structure  may  speak  for  only  a  small  fragment  of 
civil  society.  And  as  far  as  it  can  be  determined, 
the  social  stratification  of  world  society  is  reflected 
as  clearly  in  the  nongovernmental  sphere  as  within 
the  IGO  system.  A  second  proposition  sees  trans¬ 
national  cooperation  as  a  worldwide  mobilization 
of  concerned  citizens  in  defense  of  universal  values 
(such  as  basic  human  rights)  or  profoundly  impor¬ 
tant  collective  goods  (such  as  nature’s  life  support 
systems).  Again,  it  is  easy  to  identify  a  number  of 
INGOs  that  fit  this  description  well.  Yet  if  one 
looks  at  aggregate  figures,  one  will  also  find  that 
the  largest  group  of  TNOs  consists  of  those  that 
are  established  to  serve  interests  related  to  eco¬ 
nomic  activities.  Finally,  some  scholars,  notably 
John  Boli  and  George  Thomas,  have  suggested 
that  organized  transnational  cooperation  serves, 
in  the  aggregate,  as  a  transformative  force  shap¬ 
ing  a  new  world  culture  characterized  by  more 
inclusive  identities  (“world  citizen”)  and  wide¬ 
spread  acceptance  of  reason  and  voluntarism  as 
fundamental  pillars  of  authority.  This  is  a  bold 
and  interesting  hypothesis,  but  at  this  point,  it 
seems  prudent  to  postpone  judgment  until  more 
comprehensive  and  solid  evidence  becomes 
available. 

Dag  Harald  Claes  and  Arild  Underdal 
University  of  Oslo 
Oslo,  Norway 
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International  Political 
Economy 


The  field  of  international  political  economy  (IPE) 
requires  two  definitions  since  it  is  both  a  subject,  a 
field  of  study,  and  an  object,  a  thing  to  study. 
Defining  IPE  as  a  thing  to  study,  as  an  object,  is 
quite  straightforward.  It  refers  to  the  global  or 
international  distribution  of  power,  wealth,  and 
prestige  among  state  and  nonstate  actors.  IPE  as  a 
field  of  study,  as  a  subject,  in  contrast,  has  at  least 
two  different  histories  and  as  such  requires  a  more 
complex  explication. 


This  entry  traces  the  evolution  of  these  twin 
histories  of  IPE.  The  first  history  pertains  to  the 
so-called  American  school  of  IPE  that  emerged 
during  the  early  to  mid-1970s  in  the  context  of  the 
oil  crises  and  breakdown  of  the  Bretton  Woods 
system.  The  second  history  pertains  to  the  con¬ 
trasting  British  school  of  IPE  that  has  altogether 
different  origins.  Rather  than  forming  one  body  of 
knowledge,  these  two  schools  have  developed  dis¬ 
tinct  research  trajectories,  agendas,  and  method¬ 
ological  orientations.  IPE  as  a  field  of  study  is 
therefore  plural. 

In  brief,  the  American  school  has,  over  time, 
become  an  increasingly  analytically  coherent  and 
methodologically  homogeneous  body  of  scholar¬ 
ship.  It  emerged  during  the  mid-1970s  to  analyze  a 
purported  crisis  of  U.S.  hegemony  and  then  evolved 
into  a  particular  set  of  arguments  concerning  the 
importance  of  institutions,  credibility,  contracts, 
and  commitment  in  IPE.  The  British  school  has,  in 
contrast,  blossomed  into  a  multidisciplinary  exer¬ 
cise  that  is  both  methodologically  plural  and 
decidedly  lacking  in  defined  boundaries.  The 
British  school  began  long  before  the  1970s  and  has 
drawn  on  the  contributions  of  historians,  econo¬ 
mists,  journalists,  and  geographers.  Recently,  both 
schools  have  had  to  accommodate  still  other 
bodies  of  scholarship  with  a  claim  to  be  IPE. 
Contemporary  IPE  is  at  the  very  least,  plural. 

The  Development  of  American  IPE 

The  Cold  War  and  Political  Science 
Foundations  of  American  IPE 

What  makes  the  contrast  between  the  American 
and  British  schools  of  IPE  salient  is  the  relationship 
of  American  IPE  to  its  parent  discipline  of  political 
science  and  the  politics  of  the  Cold  War,  a  rela¬ 
tionship  that  does  not  exist  in  the  UK  or  elsewhere. 
American  political  science  has  only  relatively 
recently  come  to  embrace  political  economy  as  a 
distinct  approach  concerned  with  the  core  ques¬ 
tion  of  distribution.  After  World  War  II,  American 
political  science  was  behavioralist  and  functional¬ 
ist  in  its  orientation.  Its  behavioralism  developed 
as  a  result  of  the  focus  of  political  science  after 
pre-World  War  II  on  the  formal  institutions  of 
society — constitutions  and  the  like.  Since  this  focus 
failed  to  anticipate  the  collapse  of  interwar  democ¬ 
racy  in  Germany  and  the  emergence  of  fascism,  the 
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lesson  learned  was  to  focus  on  actor  behavior,  not 
constitutional  form.  Second,  the  United  States 
emerged  from  World  War  II  as  a  superpower  inti¬ 
mately  concerned  with  the  relationship  between 
knowledge  and  national  security — hence  the  focus 
of  early  postwar  American  political  science  on 
policy-relevant  scientific  knowledge. 

While  behavioralism  dominated  the  study  of 
American  politics,  the  study  of  the  societies  outside 
the  Western  core  demanded  a  different  analysis. 
The  thrust  of  these  “modernization”  arguments 
was  to  extend  the  teleological  logic  of  Talcott 
Parsons’s  functionalism  to  the  developmental  tra¬ 
jectories  of  the  non-Western  world,  which  was 
expected  to  develop  in  a  sequence  like  that  already 
undergone  by  the  West,  especially  the  United 
States. 

American  political  science  in  its  moment  of 
refounding  the  Cold  War  developed  a  fourfold 
division  of  the  field  where  American  politics  ana¬ 
lyzed  the  putative  end  of  history  for  all  developing 
states — the  American  present;  comparative  politics 
analyzed  current  real-world  deviations  from  that 
trend  and  thus  how  far  other  countries  had  to 
travel  to  get  to  that  end;  international  relations 
situated  these  processes  in  the  bipolar  structure  of 
the  Cold  War;  and  political  theory,  the  fourth  sub¬ 
field,  kept  the  Marxists  out  and  the  liberals  in. 

As  a  consequence,  in  the  early  years  of  the  Cold 
War,  the  structure  of  political  science  in  America 
left  very  little  room  for  political  economy,  interna¬ 
tional  or  not.  In  part,  because  it  smacked  of 
Marxism,  but  also  because  the  move  toward  ever 
greater  formalism  in  its  sister  discipline  of  econom¬ 
ics  in  this  period  that  depoliticized  the  subject  in 
another  way,  IPE  as  “a  way  of  thinking”  had  to 
wait  for  events  to  make  room  for  (international) 
political  economy  to  come  back  to  political  science. 

That  moment  arrived  in  the  1970s  when  the 
developed  world  underwent  its  first  major  reces¬ 
sion  while  the  experience  of  the  United  States  in 
Vietnam  tempered  belief  in  its  ability  to  command 
the  world.  Stagflation,  oil  shocks,  unemployment, 
and  low  growth,  coming  on  the  heels  of  civil  right 
rebellions  at  home  and  revolutions  abroad,  shook 
the  self-confidence  of  American  political  science.  In 
this  moment  of  crisis,  a  few  American  political 
scientists  began  to  argue  that  the  separation  of 
politics  and  markets,  especially  the  sidelining  of 
questions  of  distribution  in  international  relations 


and  economics,  was  perhaps  part  of  the  problem. 
It  was  in  this  context  that  the  American  school  of 
IPE  emerged.  Where  economics  retreated,  political 
scientists  began  to  ask  a  series  of  questions  that 
would  define  what  IPE  (as  a  subject)  was  and  what 
it  should  investigate  (as  an  object)  for  the  next 
decade  and  a  half;  a  distinct  IPE  then  began  to  take 
shape. 

Orienting  Issues  and  Questions  of  American  IPE 

The  first  concern  of  these  theorists  was  the 
notion  that  the  United  States  was  in  decline.  With 
the  U.S.  growth  rate  slowing,  its  industrial  base 
shrinking,  and  its  deficits  rising,  Europe  and  Japan 
seemed  to  be  in  the  ascendance,  while  the  United 
States  was  falling  by  the  wayside.  Yet  if  the  United 
States  was  in  decline,  how  could  global  order  be 
provided  in  an  environment  of  multiple  players,  all 
with  different  interests?  Moreover,  if  the  United 
States  no  longer  set  the  terms  for  the  other  players 
in  the  system,  which  states  would  prosper  and 
which  would  fail  in  this  new  world? 

Taking  up  these  concerns  were  scholars  such  as 
Robert  Keohane  and  Joseph  Nye,  whose  idea  of 
“complex  interdependence”  challenged  traditional 
realist  notions  of  state  interests  and  state  action. 
Keohane  and  Nye  argued  that  as  the  problems 
faced  by  states  became  more  complex  and  less  ame¬ 
nable  to  unilateral  solutions,  so  the  focus  of  schol¬ 
ars  had  to  be  shifted  outward  from  the  traditional 
“high  politics”  of  international  relations  (qua  secu¬ 
rity  politics)  to  the  political  economy  politics  of,  for 
example,  trade  protectionism  and  economic  devel¬ 
opment.  Such  a  stance  argued  for  attention  to  the 
complexity  of  ties  that  bind  states  and  a  focus  on 
how  markets  are  important  predicates  of,  as  well  as 
arenas  for,  state  action.  Such  a  stance  was  not 
without  its  realist  critics  to  be  sure,  but  this  inter¬ 
vention  made  possible  a  broader  opening  of  the 
agenda  of  American  political  science  to  questions 
of  economic  governance  and  distribution;  it  made 
room  for  IPE  in  the  United  States. 

When  complex  interdependence  was  juxtaposed 
with  the  question  of  the  decline  of  the  United 
States,  the  result  was  a  set  of  theories  called  hege¬ 
monic  stability  theory  (HST).  What  animated  this 
scholarship  was,  in  large  part,  a  particular  reading 
of  the  monetary  history  of  the  20th  century 
inspired  by  the  work  of  Charles  Kindleberger. 
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Kindleberger’s  key  idea  was  that  a  single  stabilizer 
was  needed  to  stabilize  the  world  economy. 
Drawing  this  conclusion  from  a  study  of  the  mon¬ 
etary  relations  of  the  interwar  period,  he  devel¬ 
oped  a  theory  of  hegemonic  transition  that  spoke 
directly  to  U.S.  declinist  fears. 

From  Hegemonic  Stability  to  Regimes 

The  basic  story  of  HST  was  that  following 
World  War  I,  the  United  Kingdom  (UK) — once  the 
world’s  biggest  market,  banker,  and  power — 
emerged  severely  weakened.  While  willing  to 
assume  the  burdens  of  leadership — that  is,  act  as  a 
lender  of  last  resort,  provide  a  market  for  dis¬ 
tressed  goods,  and  lend  countercyclically  to  pro¬ 
mote  stability — the  UK  was  simply  unable  to  do  so 
given  its  diminished  capabilities.  The  United  States 
on  the  other  hand  was  able  to  do  the  same  by  the 
1920s,  but  it  was,  because  of  its  domestic  politics, 
unwilling  to  do  so.  Lacking  such  leadership,  trade 
collapsed,  nationalism  followed,  and  the  turmoil 
of  the  1930s  and  1940s  ensued.  Could  the  same 
thing  happen  again  now  that  the  United  States  was 
in  decline — especially  when,  unlike  the  last  time, 
there  was  no  obvious  candidate  to  replace  it? 

The  theoretical  lesson  drawn  from  Kindleberger’s 
analysis  rested  on  a  classic  public  goods  undersup¬ 
ply  argument:  a  single,  large  hegemonic  state  being 
the  only  type  of  state  able  and  willing  to  take  on 
the  cost  of  providing  the  public  good  of  financial 
order  because  the  majority  of  benefits  flowed  to 
itself.  Smaller  states  may  benefit  from  such  order, 
but  they  have  an  incentive  to  free  ride  in  terms  of 
its  provision.  As  such,  “no  hegemon,  no  order,  big 
problem”  became  the  lesson  American  IPE  drew 
from  Kindleberger,  but  it  could  have  been  worse. 
Taking  their  cue  from  the  same  scholarship,  but 
framing  it  with  a  realist  analysis,  scholars  such  as 
Stephen  Krasner  and  Robert  Gilpin  paid  attention 
to  the  costs  and  benefits  of  being  the  hegemon  and 
suggested  that  when  costs  rise  and  benefits  fall  in 
the  hegemonic  state,  with  more  nimble  “challenger 
states”  rising  more  quickly  and  seeking  to  change 
the  distribution  of  benefits  in  the  system,  the  result 
was  usually  a  hegemonic  war.  The  future  then, 
according  to  this  emergent  school  of  American  IPE, 
was  bleak  if  the  United  States  really  was  in  decline. 

So  how  does  one  save  the  system  from  itself  in 
the  absence  of  a  hegemon?  American  IPE  scholars, 


seeking  this  answer,  began  to  investigate  how  sys¬ 
tem  governance  could  be  provided  in  the  absence 
of  a  hegemon.  One  solution  quickly  became  domi¬ 
nant:  the  provision  of  global  public  goods  either  by 
a  limited  number  of  key  actors  or  through  the 
“demand”  for  “regimes”  that  could  act  as  func¬ 
tional  fillers  for  systemic  governance.  This  body  of 
work,  which  became  known  as  regime  theory,  and 
later  as  the  turn  to  the  study  of  international  insti¬ 
tutions,  was  a  direct  consequence  of  the  attempt  to 
puzzle  a  way  through  the  predicaments  of  HST. 

From  Regimes  to  Institutions  and  Compliance 

For  much  of  IPE,  given  the  purported  centrality 
of  a  hegemon  for  stability,  cooperation  in  the  face 
of  free  riding  became  a  core  problem  to  be 
explained.  Regime  theory  seemingly  offered  a  way 
out.  The  basic  idea  behind  regime  theory  was  that 
where  unilateral  state  action  fails,  coordinated 
action  through  issue-specific  regimes — principles, 
norms,  rules,  and  decision-making  procedures  to 
promote  the  convergence  of  expectations — might 
work.  By  giving  a  focal  point  to  such  expectations 
and  by  reducing  uncertainty  and  transaction  costs, 
cooperation  between  states  (in  terms  of  mutual 
policy  adjustment)  may  become  easier  by  acting 
through  regimes.  As  such,  regimes  were  seen  as 
issue-specific  informational  networks  that  inter¬ 
vened  between  interests  and  outcomes,  providing 
stability  in  the  face  of  hegemonic  decline. 

Where  regimes  came  from  was,  however, 
another  issue.  The  very  public  goods  undersupply 
argument  that  drove  HST  also  haunted  regime 
theory.  One  could  appeal  to  state  interests  to  sup¬ 
ply  regimes  to  be  sure,  but  by  the  logic  of  HST, 
such  supply  should  decline  over  time  as  hegemonic 
state  capabilities  waned.  Moreover,  why  should 
states  comply  with  regime  dictates? 

One  set  of  theories  that  “solved”  this  puzzle  by 
virtue  of  the  rationalist  logic  it  employed  was  the 
literature  that  developed  out  of  regime  theory  in  the 
early  1990s  and  that  went  on  to  become  the  main¬ 
stay  of  contemporary  American  IPE  research:  the 
literature  on  international  institutions.  Following  a 
game-theoretic  logic,  these  theorists  argued  that 
self-interested  states  would  comply  with  regimes  to 
the  extent  that  doing  so  lengthened  the  “shadow  of 
the  future”  for  cooperation,  thus  lessening  the  pres¬ 
ent  discounted  pay-off  to  free  riding  in  the  present. 
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As  such,  cooperation  via  institutions  becomes  a 
Nash  equilibrium  (self-reinforcing)  strategy. 

Building  on  this  insight,  American  scholars 
working  in  this  tradition  have  highlighted  how 
uncertainty  about  the  future  forms  a  barrier  to 
cooperation.  Consequently,  what  international 
institutions  (G20  meetings,  the  Basel  Agreements, 
etc.)  do  is  reduce  the  opacity  of  the  system  through 
the  provision  of  information,  which  lowers  uncer¬ 
tainty  and  increases  the  possibility  of  cooperation 
and  the  density  of  ties,  thus  making  side  payments 
and  monitoring  possible,  thus  furthering  coopera¬ 
tion  still  more.  Rules  thereby  act  not  as  limits  to 
state  action  that  states  would  resist  but  as  resources 
that  allow  states  to  credibly  commit  to  each  other, 
thus  lengthening  the  shadow  of  the  future  further. 

This  move  beyond  the  impasse  of  regime  theory 
and  HST  defines  the  main  thrust  of  American  IPE 
scholarship  today.  A  focus  on  rules  (legalization) 
and  how  they  are  related  to  bargaining  positions, 
enforcement  costs,  and  credibility  have  become  the 
touchstones  of  contemporary  analysis  across  a 
huge  number  of  issue  areas.  From  questions  of 
institutional  design  to  a  generalized  use  of  open- 
economy  macroeconomics  as  the  basic  framework 
for  contemporary  work  harnessed  to  microeco¬ 
nomic  logics  and  statistical  techniques,  American 
IPE  has  narrowed  in  its  methodological  orienta¬ 
tion,  developing  a  preference  for  analytic  rigor  and 
theoretical  coherence  over  traditional  “big  ques¬ 
tions”  such  as  America’s  decline  or,  contempora¬ 
neously,  the  rise  of  China.  It  remains,  however,  at 
least  in  the  opinion  of  its  protagonists,  the  very 
definition  of  what  IPE  is.  Regardless  of  what  other 
practitioners  in  other  places  happen  to  think  they 
are  doing,  the  rationalist  study  of  institutions,  bar¬ 
gaining,  and  commitment  is  IPE,  “American  style.” 

The  Other  American  School(s) 

Yet  not  all  American  IPE  scholars  fit  so  easily 
into  this  particular  representation  of  IPE.  Realist 
analyses  have  hardly  given  up  the  field.  Work  by 
David  Andrews  and  Jonathan  Kirshner  on  the  rela¬ 
tionship  between  power  and  money  in  IPE  stresses 
realism’s  continuing  relevance.  Similarly,  as  Rawi 
Abdelal,  Mark  Blyth,  and  Craig  Parsons  contend, 
there  is  also  a  rich  vein  of  constructivist  scholar¬ 
ship  within  American  IPE  that  deals  with  issues 
that  are  seemingly  “off  the  radar”  of  mainstream 


rationalist  work.  For  example,  how  nationalist 
and  neoliberal  ideologies  affect  state  behavior, 
choice  of  currency  regimes,  and  even  the  behavior 
of  international  organizations  has  become  grist  to 
this  constructivist  mill. 

Most  important  in  generating  this  other  strand 
of  American  IPE  has  been  the  existence  of  the 
Cornell  Series  in  Political  Economy  as  a  publica¬ 
tion  outlet  and  the  pioneering  work  of  Peter 
Katzenstein,  the  editor  of  this  series.  If  scholars 
such  as  Keohane  and  Krasner  were  instrumental  in 
giving  voice  to  the  mainstream  of  American  IPE, 
Katzenstein’s  contribution  was  critical  in  opening 
up  space  for  this  “other”  American  scholarship. 

Katzenstein’s  work  has  always  evidenced  a 
strong  concern  with  the  interaction  of  domestic 
structures  and  IPE.  However,  rather  than  reducing 
these  factors  to  a  study  of  legislatures,  coalitions, 
and  bargaining,  as  do  most  mainstream  American 
IPE  scholars,  Katzenstein  developed  an  under¬ 
standing  of  the  role  of  states  in  IPE  that  drew  on 
more  historical  institutionalist  understandings  of 
politics.  This  different  approach  to  what  IPE  is,  in 
combination  with  the  focus  of  the  Cornell  Series 
on  cognitive  and  ideational  factors  as  explanatory 
variables,  opened  the  door  for  American  scholars 
who  do  not  sit  so  easily  in  the  mainstream  of  U.S. 
IPE  research.  In  sum,  while  it  is  often  lambasted  as 
narrow  and  insular,  American  IPE  contains  within 
it  more  diversity  than  its  protagonists  often  admit. 

The  Development  of  British  IPE 
Origins  of  British  IPE 

Britain  may  have  also  fought  the  Cold  War,  but 
the  Cold  War  did  not  define  its  scholarship  as  it  did 
in  the  United  States.  Moreover,  skepticism  regard¬ 
ing  the  scientific  pretentions  of  the  study  of  politics 
runs  deep  in  the  UK — hence,  in  part,  its  innate 
methodological  pluralism  and  analytic  eclecticism. 
But  what  is  of  critical  significance  here  in  under¬ 
standing  British  IPE’s  different  orientations  are  two 
factors:  (1)  the  failure  of  British  political  science  to 
replicate  the  “four-field”  structure  of  American 
political  science  and  (2)  the  very  different  history  of 
economic  and  political  decline  in  the  UK. 

First  of  all,  according  to  Ben  Clift  and  Ben 
Rosamond,  while  the  events  of  the  1970s  may  have 
provided  the  intellectual  opening  for  American  IPE, 
this  moment  does  not  mark  the  critical  opening  for 
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IPE  scholars  in  the  UK,  since  British  IPE  has  a  far 
longer  lineage  than  can  be  ascribed  to  the  economic 
dislocations  of  the  early  1970s.  Second,  there  never 
has  been  a  “parent  discipline”  for  British  IPE;  it  has 
always  been  a  multidisciplinary  exercise.  British 
political  science  neither  saw  itself  as  the  end  of  the 
developmental  telos  nor  sought  to  “keep  the 
Marxists  out.”  Nor  was  there  any  necessary  con¬ 
nection  to  international  relations  theory  in  political 
science. 

These  differences  are  themselves  explained  by 
the  four  “drivers”  of  the  British  school  of  IPE. 
First,  there  is  a  long  tradition  of  scholarship  that 
deals  with  the  political  economy  of  British  imperi¬ 
alism,  which  has  given  much  of  British  social  sci¬ 
ence  a  particular  historicist  cast.  Second,  there  is 
the  postwar  literature  on  decolonization  and  the 
global  role  of  sterling  in  that  period,  which  is  IPE 
in  all  but  overt  self-identification.  Third,  there  is 
the  enormous  literature  on  the  UK’s  own  economic 
decline,  especially  the  international  determinants 
thereof,  which  makes  the  boundary  cast  in  the 
United  States  between  IPE  and  comparative  politi¬ 
cal  economy  much  more  permeable  in  the  UK. 
Fourth,  there  has  been  a  continuing  concern  with 
the  centrality  of  the  state  in  both  domestic  and 
international  economic  relations. 

As  a  result  of  these  very  different  lineages  of 
scholarship,  for  Rosamond  and  Clift,  the  main 
question  has  less  to  do  with  problems  such  as  the 
domestic  preference  formation  of  states  rather 
than  the  degree  to  which  the  historical  resolution 
of  social  and  economic  struggles  have  become 
inscribed  on  the  state  and  predispose  it  to  certain 
types  of  action.  This  view  of  British  IPE  gives  us  an 
understanding  of  not  just  how  but  why  British  IPE 
differs  from  its  American  namesake. 

In  short,  lacking  the  concerns  that  animated 
American  scholarship  and  grappling  with  its  own 
particulars,  British  IPE  has  become  more  histori¬ 
cally  focused  and  more  open  to  a  variety  of  per¬ 
spectives  than  its  American  counterpart.  It  has  as 
its  point  of  departure  a  basic  skepticism  of  the 
worldviews  that  underlie  American  IPE  and  the 
models  of  cognition  and  causation  that  they 
enshrine.  It  has  never  been  a  subfield  of  political 
science,  nor  is  it  particularly  beholden  to  political 
science  as  a  field.  IPE  in  the  UK  exists  in  depart¬ 
ments  of  politics,  sociology,  geography,  and 
anthropology.  If  American  IPE  is  a  subfield  of 


political  science,  British  IPE  can  (almost)  lay  claim 
to  being  a  field  of  the  social  sciences  in  its  own 
right,  or  at  least  a  reinvention  thereof  focused  on 
questions  of  global  distribution  and  power. 

The  Other  Transatlantic  Bridge 

As  well  as  different  material  drivers,  British  IPE 
has  a  different  set  of  intellectual  progenitors  that 
go  beyond  the  confines  of  Britain  itself  but  do  not 
include  the  United  States.  As  Randell  Germain  has 
argued,  the  legacy  of  the  Canadian  scholar  Robert 
Cox  looms  large  in  this  regard.  The  difference  that 
Cox  made  was  that  due  to  his  influence,  while  his¬ 
torical  materialist  approaches  were  marginalized  in 
the  United  States,  in  Canada,  such  “critical”  per¬ 
spectives  were  constitutive  of  a  very  different  type 
of  mainstream  of  research.  As  such,  there  was  a 
transatlantic  bridge  for  IPE,  but  it  ran  from 
Canada  to  the  UK  and  bypassed  the  United  States. 

In  the  UK  itself,  Benjamin  Cohen  highlights  the 
influence  of  Susan  Strange  as  the  main  protagonist 
for  the  development  of  a  British  version  of  IPE. 
Rather  than  simply  focus  on  the  power  structure, 
as  American  realists  are  wont  to  do,  or  focus  on 
the  informational  environment,  as  American  neo¬ 
liberals  are  fond  of  doing,  Strange  developed  a 
more  nuanced  conception  of  power  that  operated 
through  global  knowledge  and  production  as  well 
as  military  and  financial  structures.  This  view  of 
power  was  relational  as  well  as  material,  and  it 
paid  close  attention  to  the  ideological.  Strange  also 
stressed  the  importance  of  finance  for  understand¬ 
ing  IPE  and  competition  between  different  global 
authorities,  including  private  authorities,  as  key 
aspects  of  global  governance.  She  took  seriously 
the  notion  of  IPE  as  an  “open  range”  where  a 
plethora  of  approaches  should  coexist,  and  in 
doing  so,  she  anticipated  much  of  what  British  IPE 
scholars  were  to  produce  over  the  next  2  decades. 

Taken  together  then,  Cox,  Strange,  and  the  very 
different  political  and  institutional  legacies  of  British 
development  and  decline  gave  rise  to  a  distinct  and 
different  British  school  of  IPE.  This  school  was  not 
simply  a  reaction  to,  or  a  rejection  of,  American 
IPE.  Rather,  it  was  a  school  of  thought  that  had  set 
up  its  own  stall  and  carried  on  as  if  the  Americans 
didn’t  really  exist.  The  result  has  been  a  research 
community  animated  by  an  entirely  different  set  of 
concerns.  This  is  demonstrated  by  comparing  the 
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publication  themes  of  the  two  schools  in  their 
respective  American  and  British  journals. 

Comparing  American  and  British  IPE 

Taking  the  U.S. -based  journal  International 
Studies  Quarterly  (ISQ)  as  representative  of  Amer¬ 
ican  IPE  beyond  the  rationalist  mainstream  and  the 
British  journal  New  Political  Economy  (NPE)  as 
representative  of  British  IPE  yields  the  following 
comparisons.  Of  the  372  articles  published  in  ISQ 
over  the  period  1997  to  2008,  31%  were  identifi¬ 
able  as  “liberal  approaches,”  17%  were  “realist,” 
9%  were  “mixed,”  and  the  rest  were  scattered 
across  the  categories  of  constructivism,  feminism, 
poststructuralism,  and  neo-Marxism,  which  hardly 
suggests  a  disciplinary  monotheism.  Since  American 
IPE  exists  as  a  part  of  political  science,  it  is  not  really 
surprising  when  the  biggest  single  category  that 
emerges  from  the  ISQ  sample  is  “questions  directly 
related  to  existing  debates  in  IR  theory,”  while  the 
second  is  “security  studies,”  Cohen’s  (2008)  twin 
cores  of  U.S.  international  relations  (IR)  theory. 

Repeating  this  exercise  for  NPE  gives  quite  dif¬ 
ferent  results.  Few,  if  any,  of  the  contributions  to 
NPE  speak  of  the  core  issues  of  IR  theory.  Instead, 
they  cluster  around  three  topic  areas:  ( 1 )  the  evolu¬ 
tion  of  advanced  capitalist  states  (17%),  (2)  devel¬ 
opment  (17%),  and  (3)  globalization  (14%).  The 
remainder  are  similarly  scattered  among  various 
topic  areas.  Many  of  NPE’ s  articles  would  sit  well 
in  journals  such  as  Comparative  Studies  in 
International  Development.  Given  this,  what  do 
British  scholars  publish  and  research  under  these 
rubrics  that  is  so  different  from  the  work  of 
American  scholars?  Several  areas  stand  out  in  con¬ 
temporary  British  IPE  that  are  largely  absent  in 
American  IPE:  a  concern  with  ontology  and  philo¬ 
sophical  foundations,  a  focus  on  normative  con¬ 
cerns  and  ethics,  critiques  of  finance  capitalism  and 
globalization,  and  historical  materialist,  Gramscian 
poststructuralist,  feminist,  and  “everyday”  IPE. 

The  “Critical”  Concerns  of  British  IPE 

Angus  Cameron  and  Ronen  Palan  put  it  well 
when  they  argue  that  the  differences  between  Amer¬ 
ican  and  British  IPE  can  be  summed  up  in  the  quip 
that  in  the  American  school  “data  and  observation 
are  so  unproblematic  we  can  accept  them  as  real,” 


while  in  the  British  school  “data  and  observation 
are  so  problematic  that  we  can  dispense  with  them 
altogether”  (2009,  p.  123).  However,  for  the  British 
school,  data  derive  from  methods,  and  prior  to 
methodological  choices  lie  ontological,  epistemo¬ 
logical,  and  general  philosophical  concerns. 

Lacking  the  scientism  of  their  American  peers  and 
embracing  Cox’s  injunction  that  the  point  of  theory 
is  critique,  British  IPE  views  the  role  of  IPE  as  taking 
the  measure  of  the  world  rather  than  simply  measur¬ 
ing  it:  to  judge  and  to  evaluate,  to  be  a  “critical” 
body  of  theory.  Doing  so  necessarily  breaks  the  dis¬ 
tinction  between  positive  and  normative  theory  that 
American  IPE  relies  on  and  instead  opens  up  British 
IPE  to  questions  of  legitimacy  and  ethics. 

For  example,  although  American  IPE  scholars 
have  certainly  been  active  in  the  area  of  under¬ 
standing  international  finance,  British  scholars 
have  been  far  more  critical  in  their  examination  of 
international  finance.  From  Strange’s  Mad  Money 
(1998)  to  Paul  Langley’s  concern  with  the  construc¬ 
tion  of  the  “ethical  investor,”  British  writing  on 
finance  has  developed  a  far  more  critical  view  of 
finance  in  IPE. 

Pushing  this  critical  edge  still  further  are  the 
various  Marxist  scholars  who  form  an  important 
part  of  British  IPE — a  species  almost  extinct  in  the 
United  States.  Building  on  the  legacy  of  Robert 
Cox,  a  new  generation  of  British  historical  materi¬ 
alists  has  sought  to  engage  IPE  from  their  own 
particular  intellectual  angle.  In  the  hands  of  these 
scholars,  the  notion  of  hegemony  radically  departs 
from  its  American  meaning  of  simple  “dominance” 
by  a  state  toward  its  original  Gramscian  meaning 
of  “intellectual  leadership  and  subordination”  by 
transnational  class  forces  and  historic  blocs. 

Finally,  there  are  scholars  within  British  IPE 
who  try  to  transcend  the  very  notion  of  IPE  itself. 
Feminist  scholars  form  one  such  community  within 
British  IPE.  Their  focus  on  social  reproduction — 
that  is,  how  the  division  of  labor  in  capitalism  rests 
on  a  prior  division  of  labor  of  the  household  and 
how  such  gendered  structures  are  replicated 
through  the  practices  of  IPE — forms  the  core  of 
this  enterprise. 

In  a  similar  manner,  the  so-called  everyday  IPE 
scholars  contest  the  notion  of  a  state-centric  IPE. 
For  most  IPE  scholars,  the  answer  to  the  question  of 
“who  governs?”  in  IPE  is  usually  “some  sort  of  set 
of  elite  actors.”  In  contrast,  for  “everyday”  scholars 
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such  as  Leonard  Seabrooke,  non-elite  actors,  the 
masses,  not  the  elites,  their  patterns  of  consumption 
and  production,  and  crucially  their  ability  to  confer 
legitimacy  on  or  withhold  it  from  elite  actions,  all 
come  to  the  fore  as  explanatory  factors  in  the  world 
economy.  Scholarship  in  this  vein  has  engaged 
issues  as  diverse  as  pension  reform,  the  relationship 
between  housing  finance  and  global  financial  power, 
and  the  ability  of  microstate  tax  havens  to  persist 
despite  OECD  efforts  to  shut  them  down  from  this 
interdisciplinary  “bottom-up”  rather  than  “top- 
down”  perspective. 

In  sum,  British  IPE  is  quite  a  different  species 
from  American  IPE.  It  has  different  origins,  orien¬ 
tations,  logics,  philosophical  concerns,  targets, 
agendas,  and  even  views  of  causation  and  matter. 
It  is  not  simply  a  variation  on  a  theme,  it  is  quite  a 
different  beast;  hence  the  claim  that  IPE  is  plural. 
But  it  may  be  worse  than  that,  for  beyond  the 
United  States  and  the  UK,  there  lie  many  claims  to 
IPE  that  can  just  as  easily  be  recognized  as  such, 
even  if  the  authors  use  different  labels. 

Asian  IPE 

The  United  States  and  Britain  (or  more  correctly 
Britain  and  Canada)  are  not  the  only  places  that 
have  made  a  contribution  to  IPE  scholarship.  As 
Walden  Bello  has  argued,  the  distinctive  contribu¬ 
tion  that  scholarship  “in  and  about”  East  Asia  has 
added  to  IPE  is  the  literature  on  the  developmental 
state.  That  is,  a  literature  developed  in  and  about 
East  Asia  that  focuses  on  the  state  as  an  active  agent 
in  economic  development.  Moreover,  there  was, 
argues  Bello,  a  strong  normative  edge  to  such  schol¬ 
arship  that  argued  “for”  states  in  development 
against  the  neoliberal  inclination  of  American  schol¬ 
arship.  From  its  inception,  these  models  were  set  up 
as  a  challenge  to  scholarship  that  sees  universalist 
processes  as  stemming  from  the  Western  experience. 

The  distinctive  focus  of  this  literature  on  the 
state-as-actor  enables  it  to  open  up  issues  that 
Western,  especially  American,  IPE  approaches 
miss  due  to  their  relegation  of  such  processes  to 
“domestic  politics.”  As  such,  the  economically 
positive  “legacy-effects”  of  colonialism,  the  role  of 
labor  repression  (and  the  lack  of  democracy)  in 
state-led  industrialization  efforts,  the  importance 
of  equalitarian  land  reform,  and  how  the  politics  of 
firm-state  relations  changed  over  time,  all  come 


into  focus.  By  tying  the  actual  developmental  his¬ 
tory  of  the  East  Asian  states  into  the  story  of  the 
evolution  of  East  Asian/developmental  state  schol¬ 
arship,  Bello  argues  convincingly  that  this  literature 
is  itself  an  important  contribution  to  contemporary 
IPE — a  contribution  that  belongs  to  neither  the 
United  States  nor  the  UK. 

Underdevelopment  and  IPE: 

France  and  Latin  America 

IPE  is  then  not  only  constituted  by  what  its  U.S.  and 
British  partisans  think  it  is.  There  are  Asian  contri¬ 
butions,  and  there  are  also  telling  absences.  Turning 
first  to  the  lack  of  IPE  in  continental  Europe,  par¬ 
ticularly  France,  Nicolas  Jabko  highlights  three 
main  reasons  why  IPE  remains  “underdeveloped” 
in  France.  First,  in  France,  heterodox  economists, 
rather  than  political  scientists,  tend  to  be  the  politi¬ 
cal  economists  in  France.  Particularly  important 
here  is  the  work  of  the  economists  of  the  French 
“regulation  school,”  who  tend  to  concentrate  on 
the  internal  dynamics  of  capitalist  economies.  As 
such,  both  politics  and  the  notion  of  the  “interna¬ 
tional”  as  a  distinct  sphere  of  action  are  under¬ 
played.  Second,  there  is  a  strong  bias  against 
importing  U.S.  scholarship.  Third  is  the  relationship 
to  Marxism.  While  both  the  UK  and  France  have 
long-standing  legacies  of  Marxist  scholarship,  there 
was,  in  the  postwar  period  in  France,  a  general 
flight  from  Marxism  in  economics  and  political  sci¬ 
ence  rather  than  the  embrace  of  it  as  in  the  British 
case.  As  such,  IPE  in  France,  and  in  Europe,  in  gen¬ 
eral,  remains,  as  Jabko  puts  it,  “underdeveloped.” 

A  similar  story  exists  in  Latin  America,  where  a 
strong  indigenous  theory  of  IPE — dependency 
theory — offered  a  powerful  alternative  theory  of 
international  political  and  economic  processes. 
Dependency  theory  argued  that  capitalism  is  a  sys¬ 
tem  of  unequal  exchange  that  locked  peripheral 
economies  into  exploitative  relationships  with  the 
core  states  of  the  rich  North.  Rather  than  stimulate 
the  development  of  the  whole  economy  and  gener¬ 
ate  broad-based  growth,  international  economic 
relations  retarded  the  growth  of  peripheral  econo¬ 
mies  while  promoting  export  enclaves.  This  in  turn 
produced  a  politics  of  inequality  domestically  that 
could  only  be  held  in  check  by  repressive  forces. 
Jose  Gabriel  Palma  argues  that  these  once  popular 
IPE  literatures  turned  pathological  when,  over 
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time,  the  fit  between  the  facts  generated  by  the 
world  economy  and  the  theory  diverged  so  much 
that  the  goal  of  dependency  theorists  became  sav¬ 
ing  the  theory  at  the  expense  of  reality.  As  such, 
the  theory  lost  its  relevance,  and  a  genuine  Latin 
American  contribution  to  IPE  disappeared  and  left 
the  field  to  other  approaches. 

IPE  in  All  but  Name?  Sociologists 
and  Economic  Historians 

One  consequence  of  not  being  able  to  “bind”  IPE 
is  that  one  is  bound  to  run  into  other  academics 
who  are  doing  essentially  the  same  thing,  but  do 
not  call  what  they  do  IPE.  So  the  question  becomes 
to  what  extent  there  is  any  IPE  beyond  self- 
described  IPE  scholars.  John  Campbell,  for  exam¬ 
ple,  argues  that  international  political  sociologists 
have  for  many  years  been  IPE  scholars.  Such  soci¬ 
ologists  have  studied  topics  such  as  the  diffusion  of 
norms  on  an  international  level,  how  neoliberalism 
has  been  transmitted  around  the  world,  how  the 
international  division  of  labor  is  structured,  and 
how  the  comparative  economic  performance  of 
different  states  is  explainable  by  regime  type:  liter¬ 
atures  that  are  “IPE  in  all  but  name.”  Michael 
Oliver  provides  a  second  example  of  “IPE  in  all  but 
name”  in  the  work  of  economic  historians.  Oliver 
notes  that  economic  historians  tend  to  differ  from 
IPE  scholars  in  two  ways.  First  of  all,  most  eco¬ 
nomic  historians  seek  neither  empirical  laws  nor 
narrow  methodological  perspectives.  Yet,  despite 
this  difference  in  approach,  there  exists  a  strong 
similarity  between  economic  history  and  IPE.  In 
short,  some  of  the  biggest  names  in  IPE  are  eco¬ 
nomic  historians  in  the  broadest  sense  of  the  term, 
in  that  they  use  historical  sources  and  construct 
historical  narratives,  so  there  is  a  natural  overlap. 

Is  the  Future  of  IPE  Fractured? 

IPE  as  an  object  can  be  reasonably  well  defined;  IPE 
as  a  subject,  as  a  way  of  knowing  practiced  by  a 
community  of  scholars  sharing  a  common  branch 
of  scholarship,  is  far  more  fractured.  There  is  a  dis¬ 
tinctive  American  school,  based  in  political  science, 
that  is  becoming  more  distinctive  over  time  with  its 
common  focus  on  “open-economy  politics,”  allied 
to  statistical  and  formal  methods  centering  on 
questions  of  institutional  design,  compliance,  and 


credibility.  Yet  within  this  supposed  monolith  is  a 
thriving  IPE  scholarship  of  a  different  stripe. 

There  is  also  a  distinctive  British  school.  Its  ori¬ 
gins  and  concerns  are  quite  different  from  those  of 
the  American  school,  and  rather  than  the  method¬ 
ological  narrowing  we  see  in  the  American  school, 
British  IPE  threatens  to  become  not  just  an  “inter¬ 
discipline”  but  the  veritable  reinvention  of  the 
social  sciences.  Beyond  America  and  Britain  (plus 
Canada),  there  are  claims  to  be  made  for  an  Asian 
IPE  based  on  the  literature  on  the  developmental 
state  and  for  the  importation  of  work  by  sociolo¬ 
gists  and  economic  historians.  In  one  sense  then, 
the  future  of  IPE  is  bright.  Monocultures  get  wiped 
out  by  rare  events,  but  hybridized  systems  do  not, 
and  IPE  seen  globally  is  certainly  hybrid.  But  in 
another  sense,  there  are  dangers  on  the  horizon. 

Although  diversified  on  a  global  level,  local  pock¬ 
ets  of  IPE,  especially  the  American  school,  are 
extremely  homologous  and  badly  hedged.  They 
constitute  a  local  monoculture  that  seemed  to  be  on 
a  high  for  several  years,  but  the  recent  financial 
crisis  has  thrown  their  methods  (the  utility  of  time- 
series  analysis)  and  insights  (the  ubiquity  of  trans¬ 
parency  and  information  as  stabilizing  technologies) 
into  question  in  some  quarters.  Moreover,  if  the 
orientation  of  the  project  overall  is  to  “become 
more  like  economics,”  then  the  school  risks  redun¬ 
dancy,  since  many  scholars  do  this  already  and  do 
it  better.  They  are  called  economists. 

The  British  school,  while  well  diversified,  faces 
a  different  type  of  redundancy.  If  the  many  differ¬ 
ent  research  communities  of  British  IPE  “let  a 
thousand  flowers  bloom,”  in  doing  such  important 
work,  techniques  and  technologies  established  in 
other  fields  become  rather  pointlessly  rehashed 
and  reinvented.  There  is  a  very  real  risk  that  British 
IPE  ethicists  skim  the  philosophy  enough  to  be 
thinly  informed,  that  everyday  British  IPE  scholars 
reinvent  a  weak-kneed  sociology  of  practice,  and 
that  British  Marxists  haul  out  Louis  Althusser  and 
Antonio  Gramsci  once  again  without  making  any 
actual  progress. 

In  sum,  if  the  boundaries  of  IPE  are  not  drawn 
narrowly,  it  becomes  increasingly  difficult  to  say 
what  IPE  is,  and  what  it  is  not.  If  other  fields  such 
as  sociology  and  economic  history  (and  geography) 
are  included,  then  it  has  to  be  defined  as  an  inter¬ 
disciplinary  field  in  its  own  right.  But  doing  so  has 
its  own  costs  in  terms  of  coherence  and  stability,  as 
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in  any  field  of  knowledge.  On  the  other  hand, 
drawing  boundaries  too  tightly  puts  at  risk  its  very 
existence  as  a  field  of  knowledge. 

In  many  ways,  the  British  school  has  less  at  stake. 
If  the  orientation  of  the  field  is  toward  “a  thousand 
flowers,”  then  cross-pollination  and  what  Strange 
calls  “open  range”  are  virtues.  British  IPE  will  per¬ 
sist  no  matter  what  form  it  takes  because  its  precise 
form  is  less  important  than  its  (hybridized)  content. 
In  contrast,  the  American  school  has  much  more  at 
stake,  but  this  stake  rests  on  a  series  of  rather  nar¬ 
row  technologies  and  theories  that  may  not  be  as 
transcendentally  appropriate  as  many  of  its  protag¬ 
onists  think.  By  either  measure,  IPE  is  (at  least)  a 
plural  field  of  study  with  an  interesting  future. 

Mark  Blyth 
Brown  University 
Providence,  Rhode  Island,  United  States 
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International  Public  Opinion 


Three  points  structure  this  consideration  of  inter¬ 
national  public  opinion.  First,  international  public 
opinion  research  relies  on  statistical  methods  used 
in  domestic  marketing  and  polling  research. 
Second,  public  diplomacy  is  closely  related  to  con¬ 
sideration  of  public  opinion  dynamics  in  interna¬ 
tional  affairs.  Finally,  international  public  opinion 
research  has  rested  on  a  particular  understanding 
of  the  organization  of  the  international  system.  As 
information  technologies  change  the  nature  of 
community  and  individual  identity  formation,  we 
should  expect  concomitant  changes  in  the  organi¬ 
zation  and  analysis  of  public  opinion. 

The  utility  of  opinion  metrics,  both  in  business 
and  politics,  is  found  in  the  ability  to  reveal  pro¬ 
jected  variation  in  cognitive  and  affective  responses 
to  variation  in  product  design,  price  points  (when 
relevant),  or  other  changes  in  key  stimulus  fea¬ 
tures,  regardless  of  whether  they  are  commercial 
or  political  in  nature.  Do  changes  in  some  sort  of 
stimuli  produce  changes  in  attitudes  about  the 
object  of  matter  at  hand?  Whether  we  are  consid¬ 
ering  a  “new  and  improved”  product  or  a  new 
campaign  slogan,  precise  public  opinion  metrics 
offer  an  important  means  of  evaluating  variation 
in  design,  availability,  and  product  presentation. 

With  respect  to  international  public  opinion, 
the  analytical  specificity  that  undergirds  market 
research  (target  marketing  of  micropopulations 
facilitated  by  data  mining  and  computer  analysis) 
and  public  opinion  polling  in  political  and  policy 


domains  (often  with  the  same  level  of  specificity 
found  in  consumer  marketing)  is  often  lost.  In 
domestic  spheres  in  advanced  market  economies  in 
Europe,  North  America,  and  elsewhere  in  the 
world,  consumer  data  and  survey  data  are  used  to 
populate  statistical  programs  that  create  groupings 
or  clusters  of  consumers  (or  voters  or  any  other 
germane  activity).  Niche  media  markets  are  then 
used  (and  often  created)  to  reach  precisely  con¬ 
structed  market  niches.  The  reader  can  see  evi¬ 
dence  of  this  phenomenon  by  visiting  a  local 
magazine  distributor  and  noting  the  numerous  and 
often  esoteric  variety  of  magazines  catering  to 
every  taste,  hobby,  or  habit.  Magazines  devoted  to 
sailing,  for  example,  aggregate  consumers  who 
share  this  interest,  and  in  doing  so,  they  provide  a 
platform  for  advertisers  to  efficiently  reach  that 
precise  audience.  Similarly,  users  of  online  book¬ 
sellers  will  be  met  by  an  opening  web  page  that  is 
populated  by  book  suggestions  that  are  uniquely 
tailored  to  that  person  according  to  his  or  her  past 
purchases. 

As  we  move  away  from  the  specification  of 
niche  populations  according  to  complex  mathe¬ 
matical  analyses  of  demographic  data  concerning 
the  minutiae  of  economic  behavior,  lifestyles,  and 
political  life,  we  usually  lose  this  degree  of  specific¬ 
ity.  Instead,  we  tend  to  see  aggregations  involving 
more  general  features  of  entire  nation-states  and 
regions.  Analyses  are  less  nuanced.  The  degree  of 
specification  changes,  in  part,  owing  to  the  enor¬ 
mous  costs  associated  with  multinational  data  col¬ 
lection  and  analysis.  It  is  also  limited  by  challenges 
associated  with  variations  in  language,  culture, 
and  economic  systems,  all  of  which  put  strains  on 
both  the  internal  and  the  external  reliability  of 
statistical  analyses.  External  validity,  for  example, 
concerns  the  extent  to  which  one  might  safely  gen¬ 
eralize  the  causal  inference — first,  from  the  sample 
studied  to  the  defined  target  population  and,  sec¬ 
ond,  to  other  populations  that  exist  across  time 
and  space.  Put  in  less  technical  terms,  serious  ques¬ 
tions  arise  as  to  whether  we  can,  with  any  confi¬ 
dence,  speak  of  public  attitudes  about  culturally 
specified  concepts  such  as  freedom,  democracy, 
and  rights  in  large-scale  multinational  studies. 

There  are,  however,  several  noteworthy  studies 
that  have  attempted  to  do  serious  transnational  com¬ 
parative  survey  work.  One  of  the  more  important 
efforts  at  overcoming  these  limitations  is  the  Pew 
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Research  Center’s  Global  Attitudes  Project.  As  of 
2009,  Pew  has  conducted  more  than  175,000  inter¬ 
views  in  55  countries.  Another  example  is  found  in 
WorldPublicOpinion.org,  a  consortium  of  domestic 
polling  organizations  from  around  the  world  that 
was  organized  and  managed  by  the  Program  on 
International  Policy  Attitudes  at  the  University  of 
Maryland.  At  the  time  of  this  writing,  the  consor¬ 
tium  includes  research  centers  in  more  than  25  coun¬ 
tries  across  all  the  major  continents.  A  number  of 
associations  and  academic  organizations  are  also 
involved  in  the  analysis  of  international  public  opin¬ 
ion.  One  of  the  oldest  is  the  World  Association  for 
Public  Opinion  Research,  founded  in  1947. 

Noteworthy  efforts  such  as  these  aside,  the  scale 
of  international  public  opinion  measurement  tends 
to  limit  the  practical  application  of  metrics.  Publics 
in  international  public  opinion  are  most  often 
defined  by  the  commonality  of  manageable  geo¬ 
graphical  space,  usually  defined  in  terms  of  the 
nation-state  and  its  subdivisions  in  provinces, 
states,  or  districts.  That  is,  statistically  and  demo- 
graphically  specified  subgroups,  as  found  in  com¬ 
mercial  or  domestic  political  marketing,  are  not 
found  in  most  publicly  available  analyses  of  inter¬ 
national  public  opinion.  Instead,  pollsters,  policy¬ 
makers,  scholars,  and  politicians  speak  of  larger 
aggregations,  such  as  American,  Japanese,  Russian, 
or  other  nationally  defined  collective  of  common 
identity. 

Besides  its  broader  nature,  international  public 
opinion  is  also  more  likely  to  be  concerned  with  the 
dynamics  of  international  public  standing  of  nation¬ 
states  and  closely  related  questions  about  the  effec¬ 
tiveness  of  various  public  diplomacy  programs 
intended  to  affect  those  standings.  Clarifying  this 
idea  requires  a  brief  detour  into  public  diplomacy. 

Whereas  traditional  diplomacy  involves  negotia¬ 
tions,  demarches,  and  other  forms  of  official  com¬ 
munication  between  representatives  of  nation-states, 
public  diplomacy  by  contrast  focuses  on  the  ways  in 
which  a  government  of  one  country  communicates 
with  and  therefore  influences  (or  so  it  is  assumed) 
publics  in  another  country  or  region.  The  term  pub¬ 
lic  diplomacy  was  first  used  in  1965  by  Edmund 
Gullion,  then  dean  of  Fletcher  School  of  Law  and 
Diplomacy  at  Tufts  University.  He  defined  it  as 

the  influence  of  public  attitudes  on  the  formation 

and  execution  of  foreign  policies.  It  encompasses 


dimensions  of  international  relations  beyond 
traditional  diplomacy  .  .  .  [including]  the 
cultivation  by  governments  of  public  opinion  in 
other  countries;  the  interaction  of  private  groups 
and  interests  in  one  country  with  those  of 
another  .  .  .  (and)  the  transnational  flow  of 
information  and  ideas.  (From  an  early  brochure 
of  the  Edward  R.  Murrow  Center  of  Public 
Diplomacy,  Fletcher  School,  Tufts  University) 

There  is,  however,  no  generally  accepted  or  broadly 
shared  definition  of  public  diplomacy.  For  exam¬ 
ple,  it  isn’t  clear  whether  it  involves  only  interna¬ 
tional  broadcasting  (and  today  Internet  outreach) 
or  if  it  also  properly  includes  educational  and  cul¬ 
tural  exchange  programs,  disaster  relief  and  aid 
programs,  and  other  forms  of  international  com¬ 
munication  of  cultural,  political,  and  policy  intent. 
Nor  is  there  a  well-specified  social  science  theory 
or  set  of  theories  that  serve  to  ground  the  expecta¬ 
tions  some  observers  have  concerning  what  public 
diplomacy  can  realistically  achieve  by  affecting 
international  public  opinion.  Therefore,  those 
responsible  for  public  diplomacy  may  lay  out 
ambitious  goals  such  as  improving  international 
opinion  toward  the  United  States,  when  in  fact 
there  are  major  methodological  challenges  associ¬ 
ated  with  measuring  the  extent  of  such  change.  It 
is  not  clear  how  one  might  go  about  finding  com¬ 
munication  stimuli  capable  of  producing  a  measur¬ 
able  cognitive  effect  in  a  generally  specified  foreign 
audience  over  the  short  term.  As  Robert  Entman 
has  made  clear,  political  communication  and 
social  psychological  research  findings  do  not  sup¬ 
port  such  an  optimistic  expectation. 

In  sum,  international  public  opinion  is  often 
contextualized  by  an  undertheorized  discussion 
about  international  public  opinion  concerning 
broad  aggregations,  such  as  the  standing  of  the 
United  States  in  world  opinion.  Unlike  the  analysis 
of  domestic  public  opinion  in  politics  and  econom¬ 
ics,  methodological,  logistical,  and  theoretical 
encumbrances  limit  the  sorts  of  data  collected  and 
questions  asked.  Generally,  international  public 
opinion  analysis  tends  to  follow  the  contours  of 
geopolitically  drawn  maps:  We  are  presented  with 
aggregate  statistics  about  opinion  profiles  accord¬ 
ing  to  national  identity. 

Target  marketing  strategies  that  rely  on  rich 
streams  of  demographic  and  lifestyles  data  point  to 
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forms  of  aggregating  “publics”  that  do  not  neces¬ 
sarily  rest  in  geopolitically  determined  identities. 
Put  differently,  an  underlying  premise  in  interna¬ 
tional  public  opinion  research  is  that  publics,  by 
nature,  can  be  understood  or  conceived  of  only  in 
terms  of  the  coincidental  sharing  of  geographical 
space.  Where  one  is  born  is,  along  with  socializa¬ 
tion,  a  fundamental  element  to  public  opinion. 
International  public  opinion  is  therefore  analyzed 
according  to  a  particular  understanding  of  the 
nature  of  the  international  system  itself.  Comparative 
opinion  profiles  according  to  national  boundaries 
are  most  often  understood  as  defining  the  parame¬ 
ters  of  the  concept  itself.  We  can  speak  of  how  a 
given  nation-state  is  held  in  good  (or  low)  regard  by 
the  citizens  of  an  array  of  other  nation-states. 

But  as  Benedict  Anderson  has  taught  us,  aggre¬ 
gations  of  this  sort,  as  natural  as  they  may  seem, 
are  synthetic  and,  relatively  speaking,  quite  new  to 
human  history.  Anderson  (1991)  defines  a  nation 
as  “an  imagined  political  community  [that  is] 
imagined  as  both  inherently  limited  and  sover¬ 
eign”  (p.  7).  An  imagined  community  is  different 
from  an  actual  community  because  it  is  not  (and 
cannot  be)  based  on  everyday  face-to-face  interac¬ 
tion  between  its  members.  Such  a  thing  is  possible 
only  in  communities  rooted  in  a  shared  and  small 
geographical  space.  Instead,  in  a  nation-state, 
members  hold  in  their  minds  a  mental  image  of 
their  affinity.  We  are  “Americans,”  “Germans,”  or 
“Russians”  not  because  we  know  all  of  our  kins¬ 
men  but  rather  because  of  a  mental  state  that 
defines  the  contours  of  a  mapped  affinity  group — 
the  nation. 

The  formation  of  such  an  imagined  affinity 
group  became  feasible  only  with  the  rise  of  print 
capitalism.  Anderson  (1991)  notes  that  the  basic 
structure  of  two  forms  of  imagining  that  first  flow¬ 
ered  in  Europe  in  the  18th  century  were  the  novel 
and  the  newspaper.  “For  these  forms  provided  the 
technical  means  for  ‘re-presenting’  the  kind  of  imag¬ 
ined  community  that  is  the  nation.”  (p.  25).  The 
nature  of  possible  imagined  communities  is  associ¬ 
ated  with  the  nature  of  communication  systems. 

The  information  environment  of  the  first  decade 
of  the  21st  century,  where  communities  and  indi¬ 
viduals  across  the  globe  are  knitted  together  by 
high-speed  and  low-cost  computer  and  telecom¬ 
munication  links,  opens  the  door  to  new  forms  of 
imagined  communities  constituted  by  aggregations 


rooted  in  cyber  rather  than  physical  space.  As  with 
imagined  communities  qua  nation-states,  the  basis 
of  collective  affinity  is  a  sense  of  communicated 
and  shared  values  and  norms.  But  whereas  news¬ 
papers  and,  to  a  lesser  degree,  books  were  confined 
to  a  fixed  geographical  space,  communication 
today  is  uprooted  from  both  space  and  time. 
Communication  appears  everywhere  in  real  time. 

James  Rosenau’s  work,  especially  Along  the 
Domestic-Foreign  Frontier  and  Distant  Proximities, 
has  led  the  way  in  helping  us  think  about  the  inter¬ 
national  system  in  this  new  way.  An  alternative 
model  to  the  geocentric  understandings  of  the  pub¬ 
lic  and  public  opinion  is  therefore  found  in  the 
consideration  of  affinity  groups  consisting  of  indi¬ 
viduals  working  collaboratively  according  to 
shared  norms  and  values,  though  not  necessarily  in 
shared  physical  space.  In  this  way,  one  might  even 
say  that  we  move  away  from  international — where 
the  nation  is  still  the  central  organizing  principle — 
public  opinion  to  global  public  opinion. 

One  expression  of  global  public  opinion,  under¬ 
stood  in  this  way,  can  be  found  in  nongovernmen¬ 
tal  organizations  (NGOs),  or  what  Ann  Florini 
calls  a  third  force  in  the  international  system. 
Between  1973  and  1983,  the  number  of  transna¬ 
tional  human  rights  organizations  doubled  from 
41  to  89.  In  the  next  decade,  the  number  doubled 
again.  Between  1973  and  2000,  the  number  of 
NGOs  working  for  women’s  rights  increased  by 
300%,  a  figure  that  probably  underestimates  the 
larger  number  of  organizations  that  work  for 
women’s  rights  but  not  exclusively.  There  is  little 
doubt  that  NGOs  of  all  types — working  for  human 
rights,  environmental  issues,  nuclear  nonprolifera¬ 
tion,  economic  development,  and  many  other 
causes  and  concerns — are  a  growing  factor  in  inter¬ 
national  affairs.  The  point  here  is  that  transnational 
advocacy  organizations  and  social  movements  con¬ 
stitute  an  expression  of  a  nongeographically  rooted 
public  created  by  a  set  of  values  and  norms  that  are 
shared  across  the  globe  through  new  media. 
Supporters  and  constituent  elements  of  transna¬ 
tional  NGOs  are  members  of  a  community  of 
shared  beliefs  and  values — publics — without  neces¬ 
sarily  being  members  of  a  shared  geographically 
defined  community.  They  constitute  a  new  type  of 
international  public  opinion. 

But  their  importance  is  perhaps  found  not  in  the 
growth  of  their  individual  numbers  but  rather  in 
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the  growth  of  their  coordination  and  cooperation. 
In  domestic  politics,  it  is  often  said  that  public  opin¬ 
ion  exerts  pressure  to  create  particular  outcomes. 
In  international  affairs,  communities  of  like-minded 
individuals  and  organizations  may  achieve  some¬ 
thing  similar.  Margaret  E.  Keck  and  Kathryn 
Sikkink  (1998)  define  transnational  advocacy  net¬ 
works  (TANs)  as  “characterized  by  voluntary, 
reciprocal,  and  horizontal  patterns  of  communica¬ 
tion  and  exchange”  (p.  8).  Networks  are  “commu¬ 
nicative  structures.”  Communication  structures 
create  new  forms  of  imagined  communities,  as 
Anderson  argued.  This  opens  up  fascinating  new 
ways  to  conceptualize  and  study  international  pub¬ 
lic  opinion. 

This  entry  has  reviewed  international  public 
opinion  in  the  context  of  the  more  precise  domes¬ 
tic  analysis  of  opinion  research.  It  has  also  noted 
the  close  relationship  between  international  public 
opinion  and  public  diplomacy.  Finally,  this  entry 
has  reviewed  recent  thinking  among  international 
affairs  scholars  with  regard  to  the  changing  nature 
of  the  organization  of  the  international  system. 
New  information  technologies  open  up  the  possi¬ 
bility  of  imagined  communities  that  are  not  rooted 
in  geographical  space  but  rather  in  cyberspace.  In 
short,  we  may  now  consider  the  public  opinion 
measurement  of  communities  consisting  of  those 
who  share  similar  values  and  norms  and  organize 
themselves  into  communities  accordingly  but  who 
do  not  share  common  physical  space. 

Steven  Livingston 
The  George  Washington  University 
Washington,  D.C.,  United  States 
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Communication;  Public  Opinion 
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International  Regimes 


Unlike  many  concepts  in  international  relations, 
there  is  broad  agreement  on  the  definition  of  inter¬ 
national  regimes.  Stephen  Krasner  (1983)  pro¬ 
vided  the  standard  definition,  according  to  which 
international  regimes  are  “principles,  norms,  rules, 
and  decision-making  procedures  around  which 
actor  expectations  converge  in  a  given  issue-area” 
(p.  1).  Regime  theorists  generally  try  to  explain  the 
emergence,  persistence,  change,  and  decline  of 
these  relatively  stable  patterns  of  international  col¬ 
laboration  and  coordination.  International  regimes 
are  to  be  distinguished  from  national  regimes, 
which  describe  the  formal  and  informal  structure 
of  political  power  in  a  country. 

All  major  theoretical  schools  of  international 
relations  are  represented  in  the  study  of  regimes. 
The  first  articulations  of  the  realist  theory  of  hege¬ 
monic  stability  and  of  (neo)liberal  institutionalism 
appeared  in  the  early  1970s.  Arguably,  the  first 
coherent  expressions  of  constructivism  in  interna¬ 
tional  relations  also  appeared  in  this  period  and 
treated  the  diffusion  and  use  of  knowledge  in  rela¬ 
tion  to  technology  transfer.  The  study  of  interna¬ 
tional  regimes  experienced  its  golden  age  from  the 
early  1980s  to  the  late  1990s,  but  the  remarkable 
productivity  of  the  research  program  has  since 
shifted  the  focus  of  inquiry  to  the  related  norma¬ 
tive  questions  of  global  governance  and  toward 
disaggregating  regime  theory  to  include  the  influ¬ 
ence  of  domestic  interests  on  the  international 
delegation  of  authority  and  sovereignty. 
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Realists’  Regimes 

Many  realists  see  in  international  regimes  an 
extension  of  the  power  politics  that  suffuse  all 
international  political  life  and  apply  economic 
rationality  to  explain  the  creation  and  demise  of 
regimes.  According  to  the  hegemonic  stability 
theory  of  international  regimes,  only  powerful 
states  can  supply  regimes  as  public  goods,  and 
they  will  do  so  only  when  they  expect  a  net  gain 
in  their  relative  power  positions  as  a  result.  Other 
states  are  then  able  to  act  as  free  riders  by  bene¬ 
fiting  from  the  regime  without  contributing  to 
its  provision,  or  they  can  be  caught  as  forced 
consumers  if  they  do  not  desire  the  regime  but 
cannot  escape  its  scope.  Though  the  hegemonic 
stability  theory  of  international  regimes  can 
explain  the  demand  for  and  supply  of  regimes,  it 
has  been  criticized  with  the  observation  that 
many  regimes  are  not  strictly  public  goods 
because  they  are  excludable  and  potentially 
rivals.  Moreover,  a  group  of  states  might  be  able 
to  supply  the  regime  jointly  in  the  absence  of  a 
hegemon,  as  is  the  case  with  the  small  minority  of 
states  that  provides  the  vast  majority  of  the 
United  Nations’  (UN’s)  budget. 

Some  scholars  have  modified  this  realist  account 
by  positing  that  states  are  not  uniformly  sensitive 
to  gains  reaped  by  others.  A  given  state  is  less 
likely  to  measure  its  own  gains  relative  to  those  of 
an  ally  or  a  small  and  distant  state  than  to  those  of 
a  rival  or  powerful  neighbor.  Weighing  relative 
gains  implies  that  all  states,  rather  than  a  lone 
hegemon,  will  seek  to  create  and  maintain  regimes 
as  long  as  they  assess  their  own  benefit  to  be 
greater  than  the  benefit  accrued  by  other  relevant 
states.  The  resurgence  of  classical  realism,  which  is 
less  reliant  on  an  economic  concept  of  rationality, 
also  provides  a  modified  institutionalist  explana¬ 
tion  according  to  which  dominant  states  will 
establish  regimes  with  rules  congenial  to  them¬ 
selves  in  order  to  create  an  institution  whose  nor¬ 
mative  and  regulatory  inertia  will  benefit  them 
even  after  their  power  begins  to  wane.  That  the 
five  great  power  victors  of  World  War  II  have  the 
only  seats  on  the  UN  Security  Council  endowed 
with  permanence,  veto  power,  and  a  UN-sanctioned 
right  to  possess  nuclear  weapons  serves  as  an 
example.  Other  variables,  such  as  whether  a  state 
is  revisionist  or  satisfied  with  the  status  quo  as  well 


as  the  polarity  of  the  international  system,  can  also 
be  integrated  into  such  analyses. 

Neoliberals’  Regimes 

Neoliberal  theories  also  heavily  incorporate  micro- 
economic  analogies.  Liberal  theories  of  interna¬ 
tional  regimes  that  focus  on  domestic  preference 
aggregation  and  upward  causality  also  exist,  but 
they  tend  to  be  less  well  developed  than  the  neolib¬ 
eral  theories.  The  core  theses  of  the  neoliberal 
account  are  that  regimes  would  be  unnecessary 
were  it  not  for  the  “market  failures”  of  an  anarchi¬ 
cal  system,  that  international  regimes  are  not  mere 
epiphenomena  of  power,  and  that  many  states  are 
interdependent,  such  that  the  advantage  to  the  one 
depends  on  the  success  of  the  others.  Specifically, 
market  failures  include  the  lack  of  an  authoritative 
international  legal  framework,  imperfect  informa¬ 
tion,  and  transaction  costs,  all  of  which  inhibit 
states  from  having  confidence  in  contracts.  Regimes 
alleviate  this  problem  by  providing  states  with 
more  and  better  information  about  others’  activi¬ 
ties,  reducing  transaction  costs  by  providing  a 
single  standard  for  all  rather  than  many  bilateral 
standards,  and  extending  the  shadow  of  the  future 
so  that  defectors  can  be  punished. 

The  neoliberal  literature  relevant  to  interna¬ 
tional  regimes  is  vast,  but  there  are  several  common 
elements.  Among  these  are  the  analytical  assump¬ 
tion  of  states  as  unitary,  utility-maximizing  actors 
and  the  use  of  game-theoretic  analogies  to  formal¬ 
ize  the  collective  action  problems  that  give  rise 
to  regimes.  Briefly,  there  are  assurance  games  in 
which  all  parties  prefer  the  same  outcome  and 
for  which  no  regime  is  required,  coordination 
games  in  which  there  are  multiple  outcomes  that 
the  parties  find  acceptable  and  that  will  often 
require  a  mere  convention,  and  collaboration  games 
in  which  the  parties  must  forgo  their  maximum 
strategies  to  achieve  the  optimal  outcome.  Examples 
of  these  would  include  food  safety  (where  there  is 
little  or  no  incentive  to  cheat),  a  common  language 
for  international  flight  personnel  (where  several 
coordination  points  are  effectively  equivalent),  and 
the  nuclear  nonproliferation  regime  (where  there 
are  strong  incentives  to  cheat),  respectively. 

The  microeconomic  logic  used  in  neoliberal 
regime  analysis  provides  a  parsimonious  framework, 
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but  it  naturally  assumes  knowledge  of  actors’ 
interests,  which  are  in  turn  a  function  of  their  pref¬ 
erences.  However,  this  framework  provides  no 
insight  about  the  sources  or  form  of  these  prefer¬ 
ences,  and  the  neoliberal  approach  has  been  criti¬ 
cized  for  imposing  definitions  of  utility  on  actors 
rather  than  discovering  them.  The  assumption  of  a 
single,  definitive  preference  ranking  for  each  actor 
and  a  neglect  of  the  normative  consequences  of 
regimes  are  further  sources  of  criticism. 

Constructivists’  Regimes 

Whereas  the  realist  and  neoliberal  approaches 
assume  unitary  actors  with  clear  and  singular  pref¬ 
erence  structures,  the  constructivist  approach 
problematizes  these  and  seeks  to  ascertain  actor 
preferences  rather  than  assign  them.  Instead  of 
assuming  that  actors  follow  a  logic  of  conse¬ 
quences,  which  is  based  on  their  assessments  of  the 
costs  and  benefits  of  different  courses  of  action, 
constructivists  would  argue  that  decision  makers 
follow  a  logic  of  appropriateness,  in  which  they 
choose  a  course  of  action  based  on  their  perception 
of  the  situation  and  of  their  roles  in  it.  Accordingly, 
the  way  in  which  an  issue  is  framed  and  the  inputs 
of  the  domestic  and  international  political  pro¬ 
cesses  are  important  determinants  of  the  final 
regime  outcome. 

One  source  of  input  that  can  operate  at  domes¬ 
tic  and  international  levels  is  epistemic  communi¬ 
ties.  These  are  networks  of  professional  experts 
with  technical  knowledge  about  the  problem  a 
regime  could  solve.  For  example,  the  Bretton 
Woods  institutions  would  have  been  anathema  to 
both  the  protectionist  and  liberal  Ricardian  ideas 
that  had  dominated  before  World  War  II,  but 
institutional  innovation  was  clearly  required  to 
rebuild  the  world  economy  in  its  wake.  As 
Keynesian  liberal  interventionism  gained  currency 
among  economic  experts,  however,  the  victors 
were  able  to  agree  on  institutions  that  would 
allow  both  free  trade  and  an  active  industrial 
policy,  though  such  institutions  had  been  previ¬ 
ously  unthinkable.  Without  changing  the  physical 
nature  of  the  problem  or  the  constitution  of  the 
actors,  new  knowledge  can  substantially  recast 
actors’  understanding  of  a  problem  and  their 
likely  courses  of  action. 


Many  constructivists  would  argue  that  coop¬ 
eration  in  the  context  of  regimes  presumes  a 
preexisting  repertoire  of  shared  normative  con¬ 
cepts  and  communicative  deeds.  Taking  this  line 
of  argument  further,  some  constructivists  prob- 
lematize  the  normative  structures  and  social 
institutions  that,  they  would  argue,  provide  the 
conditions  of  possibility  for  concepts  such  as 
rationality,  legitimacy,  and  cooperation  in  the 
first  place.  Hypothesizing  such  malleability  in 
international  institutions  has  led  many  critical 
theorists  to  question  whether  the  current  config¬ 
uration  of  international  regimes  could  not  be 
recast  to  become  more  just  by  making  them  less 
reflective  of  the  prevailing  power  and  distribu¬ 
tional  structures.  The  constructivist  approach 
has  been  criticized  for  lacking  some  elements  that 
would  support  a  cumulative  research  agenda, 
such  as  a  falsifiable  corpus  of  theory  and  hypoth¬ 
esis-generating  research  questions. 

Empirical  Analysis  and  Open  Questions 

In  the  empirical  study  of  regimes,  qualitative  meth¬ 
ods  dominate  because  though  there  are  some 
regime  characteristics  that  most  studies  seek  to 
explain,  such  as  effectiveness,  robustness,  and 
change  over  time,  the  issue  areas  and  characteris¬ 
tics  of  particular  regimes  differ  too  much  for  the 
construction  of  data  sets  and  quantitative  methods 
to  be  practical.  Where  theoretically  applicable, 
researchers  often  devise  a  formal  model,  such  as  a 
game-theoretic  matrix;  derive  hypotheses  from  the 
model;  and  try  to  show  the  similarities  between  the 
model  and  the  case  in  question.  Where  comparison 
between  regimes  is  sensible,  comparative  case 
studies  are  also  widely  used.  One  recent  project 
has  produced  the  International  Regimes  Database, 
which  includes  data  pertaining  to  23  regimes  and 
172  regime  elements.  The  researchers  involved 
hope  that  this  will  allow  others  to  analyze  certain 
dimensions  of  regimes  across  cases  and  to  develop 
broadly  valid  hypotheses  and  generalizations. 
Because  comparison  between  regimes  is  often  not 
sensible,  it  has  also  been  suggested  that  counterfac- 
tuals  could  be  an  appropriate  tool  to  isolate  the 
independent  effects  of  regimes.  In  fact,  a  proposal 
has  even  been  made  to  study  international  nonre¬ 
gimes — that  is,  issue  areas  in  which  the  conditions 
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indicate  that  a  regime  would  be  likely  to  occur  but 
none  is  present. 

Despite  these  innovative  proposals,  scholarly 
interest  has  somewhat  migrated  away  from  the 
study  of  international  regimes  as  such.  This 
change  shows  that  regime  theory  is  partially  a 
victim  of  its  own  success.  The  research  program 
has  served  to  move  the  debate  beyond  the  crude 
conceptions  of  international  anarchy  that  had  pre¬ 
viously  dominated  it,  but  attention  has  now 
broadly  shifted  to  the  normative  consequences  of 
different  forms  of  international  and  global  gover¬ 
nance  and  the  closer  examination  of  mechanisms 
within  and  below  regimes.  Specifically,  much 
research  in  the  rationalist  framework  has  begun  to 
examine  how  authority  is  delegated  from  various 
domestic  structures  and  processes  to  international 
and  supranational  decision-making  fora.  Con¬ 
structivist  research  is  now  primarily  concerned 
with  the  legitimacy  of  the  authority  thus  delegated 
and  how  to  make  the  resulting  decisions  and  poli¬ 
cies  more  representative  of  interest  groups  at  all 
levels.  Recent  studies  have  also  begun  to  consider 
the  transnational  aspects  of  international  regimes 
more  closely  by  investigating  when  and  how  pri¬ 
vate  entities,  such  as  firms  and  cartels,  have  devel¬ 
oped  regulatory  regimes  among  themselves.  This 
trend  is  also  visible  in  the  research  treating  the 
governance  implications  of  public-private  part¬ 
nerships,  in  which  governments  and  private  firms 
contract  to  perform  some  function. 
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The  academic  field  of  international  relations  (IR)  is 
barely  a  century  old.  Its  roots,  however,  go  much 
further  back.  This  entry  exposes  some  of  these  his¬ 
torical  roots  before  it  describes  the  emergence  of 
academic  IR — with  its  university  departments, 
specialized  courses,  and  plethora  of  journals  and 
books  devoted  to  the  causes  of  war  and  the  pre¬ 
conditions  for  a  lasting  peace. 

The  entry  pays  much  attention  to  the  evolution 
of  IR  in  the  aftermath  of  World  War  I  and  the 
emergence  of  its  distinctive  issues  and  approaches. 
Then,  it  follows  the  growth  of  IR  as  a  more  mature, 
multiparadigmatic  social  science  in  the  wake  of 
World  War  II.  The  final  sections  of  the  entry 
emphasize  the  tempestuous  discussions  that  have 
marked  scholarly  IR  after  the  end  of  the  Cold  War. 

Philosophical  Forebears 

When  did  the  science  of  IR  emerge?  According  to 
the  popular  foundation  myth  formulated  by 
Edward  Hallett  Carr  in  the  late  1930s,  it  emerged 
from  World  War  I,  driven  by  a  wish  to  create  a 
peaceful  world.  Myths  are  not  wrong,  but  they  are 
not  historical  descriptions  either.  The  science  of  IR 
has  a  more  complicated  birth  than  this.  Its  subject 
matter  has  been  adumbrated  by  scholars,  states¬ 
men,  and  soldiers  for  centuries. 

What  is  the  subject  matter  of  IR?  At  its  core  lie 
discussions  of  war  and  peace — not  only  from  the 
standpoint  of  single  states  but  also  as  properties  of  a 
larger  society  or  system  of  states.  Thus  defined,  fore¬ 
shadowings  of  IR  are  found  already  in  antiquity — in 
the  West,  most  famously  in  Thucydides’  The 
Peloponnesian  Wars  (ca.  550  BCE)  and  in  the 
East,  in  Kautilya’s  Arthasastra  (ca.  300  BCE). 
However,  as  antique  state  structures  unraveled  or 
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were  swallowed  up  by  empires,  discussions  of 
interstate  relations  disappeared;  early-medieval 
authors  like  Cassiodorus,  Procopius,  and  al- 
Shaybanl  discussed  both  war  and  diplomacy,  but 
relations  among  states  were  not  in  their  purview. 

Discussions  of  interstate  relations  reappeared, 
however,  in  the  postimperial  age.  The  rise  of  city- 
states  in  Italy  and  the  rediscovery  of  classical 
authors  like  Thucydides  were  attended  by  probing 
discussions  of  war  and  peace  and  relations  among 
states.  In  Florence,  for  example,  authors  like 
Niccolo  Machiavelli  and  Francisco  Guicciardini 
reintroduced  antique  themes  of  power  and  princely 
leadership  and  applied  them  to  the  affairs  of  their 
own  times. 

Renaissance  discussions  tended  to  concern  phil¬ 
osophical  issues  such  as  the  nature  of  the  good 
society  and  the  qualities  of  political  leadership.  In 
the  wake  of  the  Reformation,  the  scholarly  discus¬ 
sions  grew  more  descriptive  and  practical,  partly 
because  of  the  advent  of  more  effective  state  struc¬ 
tures  ruled  by  powerful  dynasties  who  concen¬ 
trated  political  power,  religious  authority,  and 
military  command  in  royal  hands  and  partly 
because  of  the  emergence  of  new  concepts  and 
secular  theories. 

Sovereignty,  Contract,  and  Perpetual  Peace 

The  important  concept  of  sovereignty  emerges 
in  the  late  Renaissance.  It  received  its  classic  formu¬ 
lation  by  Jean  Bodin.  The  social  and  political  impli¬ 
cations  of  the  concept  were  explored  by  the  Dutch 
jurist  Hugo  Grotius,  who  wrote  a  learned  treatise 
titled  Law  of  War  and  Peace  (1625),  which  dis¬ 
cussed  IR  in  light  of  natural  law  and  human  gre¬ 
gariousness.  The  English  social  philosopher  Thomas 
Hobbes  presented  a  different  view.  He  argued  in 
his  Leviathan  (1651)  that  rational  individuals  safe¬ 
guard  their  interests  by  agreeing  to  establish  a 
powerful  state  that  will  establish  laws  and  protect 
them  all.  No  such  arrangement  exists  among  states, 
however;  states  exist  in  a  lawless  “state  of  nature.” 

Such  discussions  evolved  hand  in  hand  with  the 
16th-  and  17th-century  evolution  of  the  modern 
state — by  the  development  of  fiscal  and  military 
structures,  by  the  concentration  of  power  in  the 
hands  of  powerful  monarchs,  and  by  growth  in 
international  trade.  New  political  concepts  and 
theories  emerged,  many  of  them  grafted  on  to  the 


theories  of  social  contract  expressed  by  Hobbes 
during  the  English  civil  wars  and,  later,  by  John 
Locke  during  the  Glorious  Revolution  of  1688. 

Hobbes  and  Locke  based  their  political  philoso¬ 
phies  on  the  same  thought  experiment:  They  imag¬ 
ined  what  relations  among  sovereign  human  actors 
might  have  looked  like  in  the  absence  of  any  over¬ 
arching  state  authority.  The  resulting  vision — the 
imaginary  “state  of  nature” — opened  up  a  new  way 
of  conceptualizing  relations  among  sovereign  politi¬ 
cal  units,  on  the  individual  as  well  as  the  collective 
level.  Subsequent  authors  applied  this  vision  to  rela¬ 
tions  among  sovereign  states.  The  Dutch  philoso¬ 
pher  Benedict  de  Spinoza  evolved  from  Hobbes’s 
argument  a  bleak  description  of  interstate  relations 
in  terms  of  jealous  sovereign  actors  that  pursued 
their  interests  in  ruthlessly  egotistical  ways.  Christian 
Wolff  and  Emeric  Vatell  pursued  the  vision  of 
Locke  and  developed  a  more  harmonious  view  of 
rational  actors  that  pursued  their  individual  goals 
yet  also  understood  the  role  played  by  common 
interests  and  cooperation. 

These  two  attitudes  to  the  world — the  Hobbesian 
and  the  Lockean — evolved  and  interwove  in  subse¬ 
quent  years.  A  new  step  in  the  history  of  IR  began 
during  the  Enlightenment,  when  the  early  notions 
of  conflict  and  cooperation  gave  way  to  a  more 
widely  shared  systemic  perspective  on  the  interre¬ 
lations  among  sovereign  actors. 

This  perspective  was  first  expressed  in  two  dis¬ 
courses  that  emerged  in  the  18th  century:  one  orbit¬ 
ing  the  concept  of  balance  of  power  and  the  other 
concerning  plans  for  a  perpetual  peace.  The  notion 
of  political  balance  is  old — it  was  noted  by 
Thucydides  and  invoked  by  Renaissance  scholars 
like  Francesco  Guicciardini.  These  early  authors, 
however,  tended  to  apply  the  notion  to  the  policy  of 
single  states  (later  called  “containment”)  or  to  rela¬ 
tions  between  pairs  of  states  (later  called  “counter¬ 
poise”).  But  a  new  notion  appeared  during  the  wars 
of  Louis  XIV,  when  authors  like  Francois  Fenelon 
noted  that  the  order  of  all  of  Europe  was  main¬ 
tained  by  a  principle  of  balance.  In  1713,  the  diplo¬ 
mats  who  wrote  the  Treaty  of  Utrecht  noted  that 
their  aim  was  to  establish  “a  durable  balance”  in 
Europe.  These  examples  suggest  a  new  understand¬ 
ing  of  the  balance  of  power  as  a  mechanism  that 
would  bring  order  to  Europe’s  state  relations.  They 
indicate  that  Europe’s  macropolitical  scene  is  seen 
as  a  unity  composed  of  several  sovereign  states. 
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This  portrayal  of  Europe  as  a  system  of  states 
evolved  in  subsequent  decades.  It  reached  its 
mature  form  in  the  work  of  William  Robertson, 
who  set  out  to  explain  both  the  origins  of  balance- 
of-power  policies  and  the  mechanisms  of  its 
operation.  Robertson’s  explanation  was  very 
influential.  On  the  basis  of  it,  Edward  Gibbon 
argued  that  balance-of-power  dynamics  have  pre¬ 
vented  the  multistate  system  of  Europe  from  slid¬ 
ing  back  into  empire.  Gibbon,  thus,  drew  an 
analytical  distinction  between  a  balance-of-power 
system  of  sovereign  states  and  an  empire. 

The  notion  of  a  perpetual  peace,  too,  has  long 
forbears — its  roots  can  be  traced  back  to  religious 
thinkers  like  Augustine,  Aquinas,  Dante,  and 
Erasmus.  During  the  Enlightenment,  however, 
secular  scenarios  for  a  new  and  peaceful  world 
order  were  expressed  by  thinkers  who  relied  on 
appeals  to  human  reason,  natural  rights  of  free¬ 
dom,  the  reality  of  historical  progress,  and  an 
implied  notion  of  Europe  as  a  system  of  sovereign 
states.  Due  de  Sully  was  among  the  first  thinkers  to 
propose  a  way  of  establishing  peaceful  relations 
among  the  sovereign  states  of  Europe.  Abbe  de  St. 
Pierre  was  another  important  contributor. 

When  major  Enlightenment  philosophers  began 
to  wrestle  with  the  question  of  peace  and  culti¬ 
vated  new  cosmopolitan  ideals,  the  theoretical 
trickle  became  a  flood:  The  cosmopolitan  approach 
held  that  human  beings  are  rational  and  peaceful 
and  that  conflict  and  war  are  the  result  of  dysfunc¬ 
tional  regime  types.  This  attitude  was  expressed 
most  famously  by  Prussian  philosopher  Immanuel 
Kant  in  his  essay  Perpetual  Peace:  A  Philosophical 
Sketch  (1795). 

The  Turn-of-the-Century  Breakthrough 

Balance-of-power  theories  and  perpetual-peace 
plans  were  products  of  the  early  Enlightenment  and 
were  infused  with  the  optimism  of  the  age.  Both 
theories  invoked  the  use  of  reason  in  the  service  of  a 
better  world.  By  the  19th  century,  their  themes  and 
concepts  were  applied  by  soldiers,  statesmen,  and 
scholars  in  various  fields — in  history,  in  law,  and  in 
the  new  social  sciences  (geography,  economics,  soci¬ 
ology,  and  political  science).  Two  developments 
helped  pave  the  way  for  a  more  mature  notion  of  IR. 

First,  there  emerged  a  clearer  notion  of  Europe 
as  a  system  of  states.  The  German  historian 


Leopold  von  Ranke  contributed  to  this  by  intro¬ 
ducing  the  concept  of  “the  Great  Powers.”  It 
simplified  the  discussion  because  it  reduced  the 
Western  system  of  states  to  a  few  formative  states — 
die  grossen  Machte — that  drove  the  process 
and  imposed  their  rules  of  conduct  on  the  system  at 
large. 

Second,  historians  began  to  explore  the  inter¬ 
state  affairs  of  the  very  recent  past  and,  even,  cur¬ 
rent  events.  In  England,  John  Seeley  and  Edward 
Freeman  made  the  case  for  “contemporary  his¬ 
tory”  as  a  worthy  object  of  study.  In  Germany, 
Heinrich  von  Treitschke  examined  the  nature  and 
evolution  of  states.  In  his  extremely  popular  lec¬ 
ture  series  on  politics,  Treitschke  included  some 
thoughts  on  the  contemporary  relations  among 
states,  during  which  he  noted  that  there  existed 
two  basic  approaches  to  the  subject:  The  first  was 
the  “moralizing  doctrine”  of  the  British  advocates 
of  free  trade;  the  other  was  the  “naturalistic” 
approach,  anchored  in  social  philosophers  like 
Machiavelli  and  Hobbes.  Treitschke  ridiculed  the 
first  and  condoned  the  second. 

Geographers  began,  in  the  wake  of  the  Franco- 
Prussian  War,  to  view  the  state  in  light  of  biological 
theories — Friedrich  Ratzel  and  Johan  Rudolf  Kjellen 
saw  states  as  organisms  and  interstate  relations  as 
competition  for  scarce  resources  and  living  space. 
Economists  observed  novel  relations  in  interna¬ 
tional  trade.  French  writers  were  quick  to  observe 
these  new  relations.  The  economist  Paul  Leroy- 
Beaulieu  wrote  an  influential  book  on  colonialism, 
De  la  colonialisation  chez  des  peoples  modernes 
(Of  Colonization  Among  Modern  Nations,  1874); 
the  sociologist  Jacques  Novicow  sought  to  capture 
interstate  relations  in  social  science  terms  in  his 
Politique  international e  (International  Politics, 
1886).  International  questions  were  discussed  in 
magazines — first  in  the  many  monthlies  and  quar¬ 
terlies  that  emerged  in  the  final  decades  of  the 
century  (The  North  American  Review,  The  Review 
of  Reviews,  Journal  des  deux  mondes  [Review  of 
the  Two  Worlds],  and  many  others)  and  then  in 
professional  social  science  journals  ( Political 
Science  Quarterly,  American  Political  Science 
Review,  and  Journal  of  Race  Development). 

A  most  significant  contributor  to  the  evolution 
of  IR  theory  was  a  new  movement  in  international 
law.  It  was  explicitly  cosmopolitan  and  sponsored 
by  an  international  network  of  legal  activists.  They 
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founded  a  journal,  Revue  de  droit  international  et 
legislation  compared  (RDI-,  Journal  of  International 
and  Comparative  Law),  as  well  as  the  Institute  of 
International  Law  in  Ghent.  They  were  wedded  to 
the  idea  that  international  conflict  could  be  avoided 
by  the  elaboration  of  international  law — indeed, 
they  believed  that  conflicts  could  be  solved  by  the 
establishment  of  international  institutions  devoted 
to  adjudication  and  arbitration. 

One  of  them,  the  Scottish  scholar  James 
Lorimer,  wrote  an  essay  that  reduced  centuries  of 
scholarly  speculation  to  a  few  basic  points.  In  an 
essay  for  the  RDI  in  1871,  titled  “Final  Problem 
in  International  Law,”  Lorimer  explained  that 
international  politics  is  played  out  in  a  community 
of  states  whose  members  are  sovereign  and  that  it, 
as  a  result,  lacks  a  central  body  of  legislation.  It  is, 
thus,  a  lawless  society — an  “anarchy,”  as  Lorimer 
dubbed  it.  Lawlessness  is,  however,  not  the  same 
as  orderlessness,  continued  Lorimer.  Order  is 
imposed  on  this  anarchical  society  by  two  mecha¬ 
nisms:  (1)  the  political  principle  of  balance  of 
power  and  (2)  the  economic  principle  of  interde¬ 
pendence.  Since  neither  result  in  a  stable  order,  it 
will  be  necessary  to  add  a  third  ordering  princi¬ 
ple — namely,  a  centralized  legislature  and  mecha¬ 
nisms  that  can  enforce  its  laws. 

In  the  liberal  states  of  the  West,  other  activists 
entertained  similar  views.  A  growing  number  of 
people  joined  organizations  that  opposed  war  and 
promoted  peace.  Members  of  national  assemblies 
established  the  Inter-Parliamentary  Union  (1889) 
designed  to  solve  conflicts  and  ease  international 
tensions.  Such  efforts  contributed  greatly  to  the 
Peace  Conference  in  Hague  in  1899,  which,  in 
turn,  established  the  Permanent  Court  of  Arb¬ 
itration.  This  Hague  conference  was  attended  by 
nearly  all  the  sovereign  states  in  the  world.  And  it 
was  eagerly  supported  by  more  than  400  peace 
organizations. 

This  optimistic  atmosphere  waned  during  the 
final  years  of  the  century  by  the  new  and  darker 
mood  of  Realpolitik.  The  German  historian  Hein¬ 
rich  von  Treitschke  expressed  this  new  sentiment 
with  his  portrayal  of  politics  as  a  competition 
among  territorial  states  for  power  and  influence. 
This  change  was  noted  by  the  American  historian 
and  diplomat  Paul  Reinsch  in  World  Politics 
(1900).  The  “age  of  reason”  was  in  his  opinion 
replaced  by  an  “age  of  force.”  The  change  was, 


according  to  Reinsch,  hastened  by  the  Franco- 
Prussian  War  (1871),  which  led  to  the  unification 
of  Germany  and  to  the  alteration  of  Europe’s 
established  political  relations.  It  was  also  hastened 
by  the  growth  of  industrialism  and  nationalism. 

The  new  Zeitgeist  was  well  represented  by  the 
advent  of  social  Darwinism  and  by  the  new  impe¬ 
rialism,  which  Reinsch  held  to  be  one  of  the  most 
influential  ideas  of  the  age.  Other  ideas  also 
emerged  in  the  wake  of  the  Franco-Prussian  War 
and  affected  scholarly  discussions  of  war  and 
peace.  Among  them  were  liberal  and  radical  ideas 
that  protested  the  new  militarism  and  warned 
against  new  wars  on  the  continent.  Critical  voices 
were  carried  by  members  of  the  evolving  social 
sciences:  geography,  economics,  sociology,  and 
political  science  all  made  their  imprint  on  the  sys¬ 
tematic  study  of  war,  peace,  and  IR.  Norman 
Angell’s  Great  Illusion  (1910),  for  example,  drew 
on  liberal  theories  of  finance  and  trade  to  warn 
against  war  in  the  industrial  age  and  to  condemn 
statesmen  who  foolishly  still  imagined  that  wars 
could  be  won.  Weapons  systems  are  now  so 
destructive  and  industrial  nations  are  so  tightly 
tied  together  in  webs  of  interdependence  that  war 
among  them  would  bring  disaster  on  them  all. 

World  War  I  and  After 

The  debate  about  IR  has  a  long  prehistory.  World 
War  I  converted  this  debate  into  a  field  of  schol¬ 
arly  study.  It  happened  quickly,  as  a  sudden  burst 
when  the  war  deteriorated,  counter  to  all  military 
opinions,  into  a  stalemate  of  deadly  trench  war¬ 
fare.  Between  1914  and  1917,  the  vast  armies  of 
Europe’s  industrial  states  shelled  each  other  sense¬ 
less  at  the  average  cost  of  100,000  young  soldiers’ 
lives  per  week.  How  could  this  meaningless 
slaughter  happen?  How  could  it  be  brought  to  an 
end?  What  could  be  done  to  prevent  a  similar 
catastrophe  in  the  future?  Statesmen  and  scholars 
struggled  to  answer  these  questions.  And  the 
scholarly  study  of  IR  emerged  as  one  of  the  results. 

Similar  questions  had  long  been  discussed  by 
historians,  lawyers,  and  antiwar  activists.  Two 
additional  groups  added  their  voices  and  helped 
boost  the  scholarly  study  of  IR.  The  first  group 
consisted  of  wealthy  industrialists  who  had  spon¬ 
sored  research  on  war  and  education  for  peace 
before  World  War  I — men  like  Alfred  Nobel, 
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Andrew  Carnegie,  and  Richard  Garton.  The  sec¬ 
ond  group  consisted  of  political  operatives — men 
like  James  Bryce  and  Philip  Noel  Baker  in  England 
and  Woodrow  Wilson  in  the  United  States.  For 
with  the  senseless  destruction  of  World  War  I,  civil 
servants,  diplomats,  statesmen,  and  their  advisers 
also  grew  increasingly  concerned  with  issues  of 
war  and  peace. 

War  as  a  Catalyst 

The  outbreak  of  war  in  1914  spurred  a  flurry  of 
attempts  to  explain  its  causes  and  identify  the  con¬ 
ditions  for  a  lasting  postwar  peace.  G.  Lowes 
Dickinson,  a  classics  scholar  at  Cambridge  Uni¬ 
versity,  argued  that  although  Prussian  militarism 
was  an  immediate  cause  of  the  war,  the  deeper 
cause  was  the  lawless  or  “anarchic”  state  of  affairs 
among  sovereign  states.  He  justified  his  claim  in  a 
slim  1916  book,  The  European  Anarchy. 

Other  authors  and  activists,  too,  designed  pro¬ 
posals  for  such  ordering  organizations.  In  the 
United  States,  Yale  mathematician  Irving  Fisher 
presented  an  idea  for  a  “league  of  peace.”  A  most 
influential  scheme  was  presented  by  Leonard 
Woolf,  a  British  civil  servant  turned  author  and 
publisher.  His  International  Government  (1916) 
deeply  influenced  diplomats  and  statesmen  like 
Robert  Cecil,  Jan  Smuts,  and  Woodrow  Wilson. 
At  the  Peace  Conference  in  Paris  (1919),  President 
Wilson  relied  on  Woolf’s  argument  to  push  for  a 
League  of  Nations. 

Wilson’s  initiative  transformed  international 
politics.  His  arguments  at  the  Paris  Peace  Con¬ 
ference  challenged  the  views  of  old  statesmen.  His 
vision  of  a  League  of  Nations  appealed  greatly  to 
the  broad  public.  It  formulated,  among  other 
things,  a  need  for  research  and  education  in  inter¬ 
national  affairs. 

While  Wilson  met  with  other  statesmen  in  Paris 
to  convince  them  of  his  vision  for  a  postwar 
League,  legal  experts  and  scholarly  advisers  met  to 
discuss  postwar  collaboration  in  international 
research  and  education.  The  world’s  first 
Department  of  International  Politics  was  estab¬ 
lished  at  the  University  of  Wales,  Aberystwyth,  in 
1919 — financed  by  a  generous  endowment  from 
the  wealthy  industrialist  David  Davies.  Institutes  of 
International  Affairs  were  founded  in  Britain  and 
the  United  States  in  1920.  The  London  School  of 
Economics  and  Political  Science  (LSE)  established 


a  department  of  IR  (1924).  In  the  United  States, 
comparable  departments  were  established  at 
Georgetown  University  (Edmund  A.  Walsh  School 
of  Foreign  Service,  1919),  University  of  Southern 
California  (School  of  International  Relations, 
1924),  and  University  of  Chicago  (The  Committee 
on  International  Relations,  1928),  to  mention  but 
a  few.  The  impression  left  by  this  development  is 
that  the  scholarly  study  of  IR  emerged  during  the 
1920s  as  an  English-language  discipline — in  spite 
of  the  fact  that  Switzerland  established  the  first 
school  entirely  devoted  to  IR  (Geneva’s  Graduate 
Institute  of  International  Studies,  1927). 

Early  IR  continued  the  perpetual  peace  tradi¬ 
tion.  Kant’s  famous  essay  Perpetual  Peace  was  a 
central  text  in  many  early  reading  lists  that  were 
steeped  in  a  liberal  faith  in  free  trade,  popular 
democracy,  and  collective  security.  Contemporary 
popular  textbooks  were  written  by  Ernest  Satow, 
David  P.  Heatley,  S.  H.  Allen,  and  P.  Potter.  They 
tended  to  discuss  the  causes  of  war  and  the  precon¬ 
ditions  for  peace — and  commonly  saw  the  causes 
of  war  as  irrationality  or  ill  will  and  the  precondi¬ 
tions  for  peace  in  rational  diplomacy  with  an  eye 
toward  establishing  norms,  rules,  and  interna¬ 
tional  institutions. 

Idealism  and  the  First  Great  Debate 

These  first  years  of  scholarly  IR  were  con¬ 
ducted  in  President  Wilson’s  shadow.  This  is  easy 
to  understand:  In  the  wake  of  a  shockingly 
destructive  war,  the  tired  populations  of  the  West 
cleaved  to  Wilson  and  his  League  as  their  one 
great  hope  (he  was  awarded  the  Nobel  Peace  Prize 
in  1919  and  was  generally  hailed  as  a  worthy 
recipient).  Also,  the  League  was  in  its  earliest 
years  an  active  and  apparently  effective  organiza¬ 
tion.  The  foreign  ministers  of  the  Great  Powers 
attended  League  meetings  and  used  the  League 
machinery  to  improve  relations  and  settle  their 
differences. 

Not  all  observers,  however,  shared  this  general 
confidence  in  human  reason,  trade,  and  interna¬ 
tional  law.  Some  still  saw  world  events  through 
prisms  of  competition  and  power.  Their  case  was 
strengthened  as  world  events  were  marked  by 
deepening  economic  crises  and  interstate  competi¬ 
tion,  and  the  League  met  with  declining  success. 
The  League  could  not  prevent  the  Chaco  War, 
Japan’s  occupation  of  Manchuria,  Italy’s  invasion 
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of  Abyssinia,  or  Germany’s  rearmament.  The 
changing  relations  among  the  Great  Powers,  com¬ 
bined  with  the  advent  of  insistent  rhetoric  drawn 
from  strong  political  ideologies,  altered  the  politi¬ 
cal  climate.  And  the  study  of  IR  altered  with  it. 

There  emerged  authors  who,  invoking  the 
apparent  ineffectiveness  of  the  League,  criticized 
its  cosmopolitan  advocates  and  called  them  “ideal¬ 
ist”  or  “utopian.”  In  the  name  of  “realism,”  they 
rejected  the  idea  that  international  organizations 
such  as  the  League  could  prevent  war.  Authors  like 
Frederick  L.  Schuman,  Frank  H.  Simonds,  and 
James  T.  Shotwell  rejected  the  assumption  that 
undergirded  liberal  internationalism — that  human 
beings  are  inherently  pacific  and,  if  left  to  their 
own  devices,  would  establish  peaceful  relations 
with  one  another.  Many  authors  held  that  humans 
are  basically  self-interested  and  that,  although  they 
pursue  their  egotistical  goals  in  rational  ways,  they 
will  inevitably  enter  into  conflicts  with  one  another. 
Others  argued  that  humans  are  not  fully  rational. 
Still  others  claimed  that  evil  and  sinful  impulses 
have  corrupted  human  reason  and  behavior. 

The  American  theologian  Reinhold  Niebuhr 
drew  on  all  these  arguments.  In  Moral  Man  and 
Immoral  Society  (1932),  Niebuhr  argued  that  indi¬ 
vidual  humans  may  be  both  well-meaning  and 
rational  but  that  these  qualities  do  not  apply  to 
human  collectivities — such  as  states — whose 
behaviors  tend  to  be  both  amoral  and  irrational. 
He  also  argued  that  the  Idealist  and  the  Romantic 
fail  to  understand  the  brutal  character  of  the 
behavior  of  all  human  collectives  and  the  power  of 
self-interest  and  collective  egoism  in  all  intergroup 
relations. 

Such  bleak  assumptions  were  commonly  held 
by  refugees  from  Europe’s  illiberal  states.  Among 
them  were  Jewish  scholars  who  fled  Nazi  Germany 
in  the  1930s  and  settled  in  Britain  or  the  United 
States.  Some  of  them — for  example,  John  Herz  and 
Hans  Morgenthau — found  teaching  jobs  at  U.S. 
universities.  Arnold  Wolfers,  who  taught  at  Yale  in 
the  late  1930s,  argued  that  the  discipline  of  IR  was 
suspended  between  two  poles  or  traditions — a 
liberal  tradition  rooted  in  Locke  and  Vatell  and  a 
realist  tradition  informed  by  Machiavelli  and 
Hobbes. 

This  idea  of  two  traditions  received  a  particu¬ 
larly  influential  formulation  by  Carr.  He  taught  IR 
at  the  University  of  Wales  in  the  late  1930s  and 
divided  the  evolution  of  the  scientific  study  of  IR 


into  two  phases:  first  a  utopian,  League-focused 
phase  that  emerged  in  the  wake  of  World  War  I 
and  then  a  second,  realist  phase  that  emerged  in  the 
1930s  after  a  critical  Great  Debate.  Carr  expressed 
his  two-phased  account  in  The  Twenty  Years’ 
Crisis  (1939).  The  book  was  immensely  influential 
and  his  thesis  of  considerable  didactical  value. 

The  French  philosopher  Alexandre  Kojeve  pre¬ 
sented  an  alternative  account.  And  if  Carr  influ¬ 
enced  a  generation  of  English  IR  scholars,  then 
Kojeve  influenced  a  generation  of  French  social 
philosophers.  Kojeve  was,  like  Carr,  a  Hegelian 
and  argued  that  ideological  strife  ended  with  the 
French  Revolution  and  that  after  Napoleon,  the 
Enlightenment  ideals  of  reason,  rights,  and  equal 
recognition  were  universally  accepted  in  the  West. 
History  was,  in  his  view,  at  an  end.  His  opponents 
begged  to  differ.  They  argued  that  world  events 
were,  in  fact,  deeply  affected  by  ideological  strife 
between  liberalism,  communism,  and  fascism.  This 
clash  was  all  the  more  serious,  they  argued,  as  each 
ideology  had  a  Great  Power  behind  it  to  back  it  up. 

This  late-interwar  discussion  included  a  broad 
range  of  perspectives.  Liberal  internationalism  was 
beleaguered.  On  its  left  were  socialist  and  commu¬ 
nist  parties  and  labor  unions  whose  discussions 
about  war  and  peace  were  informed  by  an  alterna¬ 
tive  internationalism.  Socialists  and  communists 
drew  on  Marxist  political  economy,  elaborated  in 
the  shadow  of  Nikolai  Bukharin’s  and  Vladimir 
Lenin’s  theories  of  imperialism.  On  the  far  right 
were  nationalist  and  fascist  arguments,  often 
informed  by  a  social  Darwinism  that  saw  IR  as  a 
perpetual  struggle  among  countries  and  races  for 
power  and  living  space.  The  fascist  movement  was 
largely  anti-intellectual  and  poor  on  theory;  how¬ 
ever,  there  were  exceptions — the  antiliberal  argu¬ 
ments  of  the  German  lawyer  Carl  Schmitt  being  a 
notable  case. 

World  War  II  and  After 

By  the  end  of  the  1930s,  liberal  internationalism 
had  lost  its  luster,  squeezed  between  realism  and 
radical  approaches — both  of  which  saw  interna¬ 
tional  politics  as  a  struggle  for  power. 

The  war  also  added  geopolitical  dimensions  to 
IR  theorizing.  Whereas  World  War  I  had  largely 
been  a  European  civil  war,  World  War  II  was  a 
global  struggle.  Also,  World  War  II  was  fast  paced: 
If  the  fighting  paused  in  one  theatre,  it  flared  up  in 
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another  region.  Adolf  Hitler  and  Joseph  Stalin 
fought  the  war  with  maps  of  Europe  in  hand; 
Winston  Churchill  and  Franklin  Roosevelt  con¬ 
sulted  globes  in  their  offices. 

The  United  States  emerged  as  the  preeminent 
Great  Power  during  the  course  of  the  war.  President 
Roosevelt  had  two  wartime  aims:  (1)  to  bring 
Germany  and  Japan  to  unconditional  surrender 
and  (2)  to  build  a  strong  antifascist  alliance  of 
states — the  United  Nations  (UN) — which  would 
last  beyond  the  war  and  provide  the  stable  basis 
for  a  postwar  world  order.  Roosevelt  worked  hard 
to  achieve  both  goals.  Like  Wilson  before  him, 
Roosevelt  established  a  worldwide  political  orga¬ 
nization  based  on  the  principle  of  collective  secu¬ 
rity,  the  United  Nations  Organization  (UNO).  In 
addition,  he  worked  to  establish  economic  organi¬ 
zations  to  coordinate  the  world’s  commercial  rela¬ 
tions.  In  1944,  delegates  from  the  UN  alliance  met 
in  the  New  Hampshire  town  of  Bretton  Woods 
and  established  the  International  Monetary  Fund 
(IMF),  the  World  Bank,  and  a  liberal  set  of  trade 
rules  (GATT).  This  Bretton  Woods  system  was 
meant  to  secure  a  postwar  world  of  open,  demo¬ 
cratic  states  in  free  pursuit  of  commercial  ventures. 
Roosevelt’s  scheme  was,  in  other  words,  informed 
by  a  liberal  vision  of  order  and  peace. 

Just  as  American  politicians  emerged  as  leaders 
of  the  liberal  world  in  the  wake  of  World  War  II,  so 
also  did  American  scholars  emerge  as  leading  prac¬ 
titioners  of  scholarly  IR.  Quincy  Wright  was  one  of 
them.  In  1941,  he  published  The  Study  of  War — an 
encyclopedic  tome  of  more  than  1,000  pages. 
Bernhard  Brodie  of  Yale  University  was  another  of 
these  leading  scholars.  Soon  after  the  war,  Brodie 
published  The  Absolute  Weapon  (1946),  a  book 
that  discussed  the  destruction  of  Hiroshima  and 
Nagasaki  and  assessed  the  impact  of  atomic  weap¬ 
ons  on  IR.  Its  message  was  that  atomic  power  will 
change  the  way  we  think  about  strategy  and  diplo¬ 
macy,  and  war  and  peace.  Would  atomic  weapons 
destroy  the  world,  or  would  they  inaugurate  an  era 
of  stable  world  peace,  wondered  Brodie,  echoing 
Kant’s  question  of  150  years  before. 

Realism  and  the  Second  Debate 

The  realist  approach  was  entrenched  when  rela¬ 
tions  between  the  United  States  and  the  former 
USSR  descended  into  enmity  and  competition  in 


the  wake  of  World  War  II.  The  U.S.  diplomat 
George  Kennan  relied  on  traditional  Realpolitik 
when  he  reassessed  the  U.S.-Soviet  relationship  in 
1946  and  recommended  that  President  Harry 
Truman  adopt  a  foreign  policy  stance  based  on  a 
doctrine  of  unwavering  containment. 

Morgenthau,  too,  advocated  a  realist  approach. 
His  Politics  Among  Nations  (1947)  quickly  became 
a  basic  textbook  in  IR.  It  exercised  an  immense 
influence  in  the  United  States  and  elsewhere. 
Realism  was  also  embraced  by  other  influential 
scholars,  like  Georg  Schwartzenberger  in  Great 
Britain  and  Raymond  Aron  in  France.  All  saw  IR 
as  a  high-stakes  game  played  out  in  a  system  of 
sovereign,  self-interested  states  who  all  seek  to 
maximize  power,  security,  and  chances  of  survival. 

Although  realism  dominated  postwar  scholar¬ 
ship,  it  was  not  the  only  approach.  Realism  was 
well  suited  to  capture  the  deteriorating  postwar 
relations  between  the  United  States  and  the  former 
USSR,  but  it  was  hard  to  view  other  state  relations 
through  those  very  same  realist  lenses.  It  was  dif¬ 
ficult  to  grasp  the  development  issues  of  non- 
Western  states — their  efforts  to  achieve  political 
independence,  economic  growth,  and  peaceful 
interrelations  did  not  suit  the  realist  vocabulary 
well.  It  was  also  hard  to  capture  some  Western 
issues.  Relations  between  North  and  South  America 
were  a  case  in  point;  communication  or  discussion 
in  this  context  was  often  phrased  in  radical — 
sometimes  even  Marxist — terms  of  exploitation 
and  repression.  Relations  among  the  United  States 
and  Western  Europe  were  also  hard  to  grasp  in 
realist  terms;  they  were  commonly  discussed  in 
terms  of  liberal  internationalism  instead.  Postwar 
relations  among  the  states  of  Western  Europe  were 
also  marked  by  a  degree  of  cooperation  and  coor¬ 
dination  insufficiently  captured  by  the  old  realist 
concepts  of  “alliance”  and  “league.”  Politicians 
like  Jean  Monnet  and  academics  like  Ernest  Haas 
used  terms  such  as  integration  and  federation  to 
describe  new,  tighter,  and  more  peaceful  forms  of 
interstate  relations. 

Also,  realism  showed  signs  of  deteriorating.  It 
provided  a  convenient  discourse  for  warning 
against  Soviet  expansionism,  but  leading  realist 
scholars  (like  Niebuhr,  Morgenthau,  and  Kennan) 
cautioned  against  the  uncritical  use  of  terms  such 
as  balance  and  containment  to  justify  ideological 
crusades  against  communism.  Another  movement 
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emerged  in  the  United  States  whose  members  criti¬ 
cized  the  realists  for  their  excessive  reliance  on 
“wisdom”  and  on  the  anecdotal  use  of  historical 
“interpretation.”  They  argued  that  IR  should  be 
studied  through  more  scientific  approaches  with 
an  eye  to  establishing  patterns  or  regularities  in 
interstate  behavior.  They  compiled  data  sets  on 
conflicts  and  wars  and  used  statistics  to  identify 
regularities  and  patterns  in  them.  This,  it  was 
argued,  would  enable  IR  to  establish  general  prop¬ 
ositions  about  interstate  behavior  and  escape  the 
old  “interpretive”  approaches. 

Wright  had  paved  the  way:  His  painstakingly 
compiled  data  of  centuries  of  past  wars  and  his 
statistical  analyses  provided  the  basis  for  his  ency¬ 
clopedic  Study  of  War  (1941) — this  substantial 
tome  was  equipped  with  figures,  charts,  and  fold- 
out  tables.  The  advocacy  of  quantitative  methods 
triggered  a  new  “great  debate”  in  the  IR  commu¬ 
nity.  This  debate  was  part  of  a  larger  “behaviorist” 
orientation  in  the  social  sciences  and  related  to  the 
development  of  computer  technologies  and  the 
hope  of  subjecting  vast  data  sets  to  statistical 
analyses. 

New  Schools  and  Approaches 

This  Second  Great  Debate  found  its  classic  rep¬ 
resentation  in  an  exchange  between  J.  David 
Singer  and  Hedley  Bull.  Singer  was  a  behavioralist 
from  the  University  of  Michigan  and  an  early  com¬ 
piler  of  data  on  conflicts  and  war — such  as  the 
important  Correlates  of  War  Project.  Bull  was  an 
influential  traditionalist  scholar  from  the  LSE  and 
Oxford  and  a  member  of  the  British  Committee  on 
International  Relations,  the  members  of  which  had 
developed  an  alternative  approach  that  stressed 
cooperation  over  confrontation — an  English 
School  that  had  Grotius  and  his  notions  of  law  and 
society  as  major  sources  of  inspiration. 

The  debate  between  the  two  was  largely  a  dis¬ 
cussion  of  methodology.  Other  issues  were  soon 
pulled  into  its  wake,  however.  One  such  issue  con¬ 
cerned  the  value  of  IR  as  a  field  of  knowledge;  the 
other  its  proper  object  of  study. 

Kenneth  Waltz  touched  on  all  these  issues  in  his 
book  Man,  the  State,  and  War  (1959).  Waltz 
sought  to  identify  and  map  the  main  arguments 
that  philosophers  and  statesmen  had  applied  to  the 
study  of  IR  through  the  ages.  He  isolated  their 


theoretical  cores  and  categorized  them  according 
to  three  “images” — that  is,  whether  they  were 
located  at  the  individual,  the  social,  or  the  systemic 
level  of  analysis. 

Waltz’  book  anchored  IR  theory  in  history  and 
defined  the  scholarly  field  as  part  of  a  rich  tradition 
of  political  ideas.  LSE  professor  Martin  Wight  did 
much  of  the  same  thing.  He  too  identified  three 
schools  of  thought  in  IR  and  called  them  realism, 
rationalism,  and  revolutionism.  Each  school  repre¬ 
sented  a  distinct  ontology,  argued  Wight — a  dis¬ 
tinct  view  of  the  world.  Realism  sees  the  world  in 
terms  of  states,  state  interests,  and  power;  rational¬ 
ism  perceives  the  world  as  inhabited  by  rational 
humans  driven  toward  cooperation;  and  revolu¬ 
tionism  observes  the  world  through  the  lenses  of 
right  and  wrong  and  dreams  of  a  peaceful  and 
unified  world.  Each  approach  constituted  a  distinct 
school  of  thought,  sustained  by  a  distinct  tradition, 
argued  Wight.  He  demonstrated  how  philosophers 
like  Hobbes,  Grotius,  Kant,  and  others  had  con¬ 
tributed  to  these  schools  in  various  ways. 

As  the  1950s  evolved  into  the  1960s,  new  con¬ 
troversies  were  added  to  the  scholarly  debate.  For 
a  time,  the  influential  schemes  of  Waltz  and  Wight 
helped  keep  a  certain  order  on  the  debate.  But 
controversies  soon  emerged  that  defied  easy  cate¬ 
gorization.  Within  the  scientist  camp,  for  example, 
there  emerged  a  political  dividing  line.  On  one 
side  were  scientists  who  were  concerned  with 
issues  of  security.  These  scientists  collaborated 
with  the  U.S.  government  and  established  a  new 
field  of  security  studies;  among  them  were  schol¬ 
ars  like  Albert  Wohlstetter  and  Thomas  Schelling, 
who  were  associated  with  the  RAND  Corporation, 
a  private  research  center  that  was  largely  funded 
by  the  U.S.  Air  Force.  On  the  other  side  were 
social  scientists  who  were  concerned  with  justice 
and  peace;  among  them  were  the  economist 
Kenneth  Boulding  and  the  sociologist  Johan 
Galtung.  These  scholars  updated  the  old  banner  of 
the  peace  movement  to  include  opposition  against 
atomic  weapons  and  claimed  a  new  field  of  peace 
research. 

These  two  scientific  communities  contributed 
importantly  to  IR  over  subsequent  decades.  They 
were  very  similar  in  methodological  orientation, 
but  they  disagreed  strongly  on  epistemological  and 
ontological  issues.  They  were  matching  bookends, 
enclosing  the  social-scientific  approaches  to  issues 
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of  war  and  peace  but  from  opposite  sides,  dis¬ 
agreeing  strongly  on  the  ultimate  nature  of  politics 
and  the  reasons  for  studying  it. 

Systems  and  Structures 

Political  discussions  intensified  as  the  Cold  War 
entered  a  new  phase.  In  the  wake  of  World  War  II, 
colonies  in  Africa  and  Asia  had  demanded  self- 
government.  As  the  old  imperial  powers  were 
reluctant  to  grant  such  independence,  the  locals 
organized  liberation  movements  to  fight  for  sover¬ 
eignty  and  self-rule.  The  1960s  saw  a  growing 
number  of  armed  struggles  directed  against  French 
and  British  rule. 

Soviet  Premier  Nikita  Khrushchev  observed  that 
armed  rebels  challenged  the  imperialist  powers  of 
the  capitalist  West  and  began  to  support  them.  The 
Americans  responded,  predictably,  by  trying  to 
contain  what  they  saw  as  Soviet  expansionism. 
This  quickly  involved  U.S.  forces  in  conflicts 
around  the  world,  and  American  statesmen  and 
scholars  were  confronted  with  a  horrible  dilemma. 

The  U.S.  containment  of  anticolonial  move¬ 
ments  in  the  Third  World  led  the  United  States  to 
side  with  the  old  colonial  powers  of  Europe  and 
seriously  undermined  America’s  liberal  advocacy 
of  free  trade  and  anticolonialism.  This  became 
painfully  obvious  as  the  United  States  increased  its 
support  for  the  French  war  effort  against  anticolo¬ 
nial  rebels  in  Indochina.  In  1954,  the  French  pulled 
out,  and  the  United  States  continued  the  war  in 
Vietnam  on  its  own. 

The  escalation  of  the  Vietnam  War  changed  the 
world’s  perception  of  the  United  States  and  fueled 
criticism  of  U.S.  policies.  Members  of  the  profes¬ 
sion  engaged  in  the  general  debate  on  colonial 
independence,  sovereignty,  development,  and  the 
reasons  behind  U.S.  engagements  in  foreign  wars. 
Professors  and  students  drew  on  increasingly  radi¬ 
cal  perspectives  that  portrayed  U.S.  behavior  as 
driven  by  the  internal  needs  of  the  capitalist  system 
and  portrayed  the  United  States  as  a  prowling, 
neo-imperialist  Great  Power.  Some  added,  invok¬ 
ing  the  Marxist  notion  of  false  consciousness,  that 
the  capitalists  legitimized  their  expansion  through 
the  production  of  false  ideology  and  presented  it  as 
universal  truth.  Such  radical  arguments  were  also 
voiced  by  non-Western  political  leaders — Mao 
Zedong,  Fidel  Castro,  “Che”  Guevara,  and  Ho 


Chi  Minh.  They  gained  adherents  among  radical 
IR  scholars. 

One  effect  of  this  development  was  to  boost 
what  Wight  had  called  the  revolutionary  approach 
to  IR  and  elevate  it  on  par  with  the  two  established 
traditions  of  realism  and  rationalism.  By  1970,  IR 
was  commonly  presented  as  having  three  rivaling 
approaches  or  schools  or — invoking  Thomas 
Kuhn’s  fashionable  term — three  “paradigms,” 
each  with  its  own  distinct  ontology. 

Another  effect  was  to  challenge  the  traditional 
concept  of  power — one  of  the  key  concepts  of  IR. 
The  traditional  definition  viewed  power  in  terms 
of  specific  resources — as  the  outcome  of  a  system¬ 
atic  application  of  force,  wealth,  or  propaganda. 
Discussions  about  colonialism  and  conflict,  and 
wars  and  wealth  in  the  Third  World  pulled  alter¬ 
native  definitions  of  power  in  its  wake.  Power 
could,  for  example,  be  seen  in  terms  of  social 
relationships — a  state  possesses  power  by  virtue  of 
its  position  in  a  social  structure  or  its  ability  to 
define  the  main  rules  of  interaction.  By  this  view, 
the  Great  Powers  of  the  West  derived  their  supe¬ 
rior  resources  and  privileges  from  their  ability  to 
maintain  an  international  world  economy  that 
enriched  them  while  they  exploited,  impoverished, 
and  repressed  everyone  else. 

A  third  effect  of  this  was  to  direct  attention 
toward  what  Waltz  had  called  the  systemic  level  of 
analysis.  Revolutionist  theorists  included  sweeping 
discussions  of  the  inherent  needs  of  the  capitalist 
system.  They  developed  new  theories  of  imperial¬ 
ism.  Andre  Gunder  Frank  portrayed  the  United 
States  as  a  metropole  and  other  countries  as  satel¬ 
lites  and  identified  the  mechanisms  that  enriched 
the  first  but  impoverished  the  second.  Immanuel 
Wallerstein’s  Modern  World  System  (1974)  pre¬ 
sented  500  years  of  world  history  in  terms  of  inter¬ 
actions  between  a  powerful  and  wealthy  world 
core  and  a  poor  periphery.  This  analysis  of  the 
modern  world  economy  had  a  deep  and  long- 
lasting  influence  on  IR  scholarship. 

Realist  authors,  too,  developed  systemic 
approaches.  Waltz  defined  the  interstate  system  in 
terms  of  the  distribution  of  capabilities  among 
major  power  centers  or  “poles.”  His  Theory  of 
International  Politics  (1979)  presented  a  reformula¬ 
tion  of  the  old  balance-of-power  theory,  anchoring 
it  in  sociological  theory  and  in  microeconomic  rea¬ 
soning.  Robert  Gilpin  presented  a  similar  argument 
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in  his  War  and  Change  (1981).  He  argued  that 
Great  Powers  always  sought  to  impose  their  own 
rules  on  international  games  in  an  effort  to  bring 
about  a  world  order  that  suited  their  own  interests. 
He  then  portrayed  the  history  of  the  modern  world 
as  a  system  in  which  Great  Powers  had  regularly 
risen  to  and  fallen  from  hegemonic  status,  igniting 
major  “hegemonic  wars”  in  the  process,  with  each 
new  hegemon  imposing  its  own  order  on  the  post¬ 
war  world.  George  Modelski  and  William  Thompson 
followed  up  this  idea  and  developed  a  theory  of  long 
cycles  or  waves  of  hegemonic  leadership. 

Authors  from  the  rationalist  tradition  followed 
suit.  Robert  Keohane  presented  a  systemic  alterna¬ 
tive  to  Waltz  and  Gilpin’s  neorealism.  The  fall  of  a 
Great  Power  is  not  necessarily  attended  by  war 
and  followed  by  a  new  world  order,  he  argued.  If 
U.S.  hegemony  were  to  unravel,  the  system  that  the 
United  States  established  after  World  War  II  is  so 
rational  and  serves  everyone  so  well  that  it  will  not 
be  essentially  changed  by  America’s  demise,  argued 
Keohane  in  After  Hegemony  (1984).  The  demise 
of  the  United  States  will,  in  other  words,  not  neces¬ 
sarily  cause  strife  and  conflict. 

By  the  mid-1980s,  then,  the  scholarly  field  of  IR 
was  dominated  by  three  main  approaches,  each 
with  grand,  systemic  theories  at  its  core.  These 
theories  were  all  affected  by  economic  reasoning: 
the  radical  approach  most  directly,  as  it  relied  on 
concepts  and  arguments  from  classical  political 
economy,  and  the  realist  and  rationalist  approaches 
more  subtly.  Waltz  drew  on  theories  of  neoclassi¬ 
cal  oligopolistic  competition,  while  Keohane  drew 
on  neo-institutionalist  economics.  Both  incorpo¬ 
rated  game  theory.  Scholars  who  explored  these 
two  traditions  further  found  so  much  similar 
ground  that  they  began  to  transcend  the  ancient 
rivalry  of  realism  and  rationalism  and  merged  into 
a  so-called  neo-neosynthesis. 

The  End  of  the  Cold  War 
and  the  Ideational  Turn 

Systemic  theories  met  with  much  skepticism. 
Realists  claimed  that  such  theories  were  static  and 
could  not  easily  account  for  change.  Rationalists 
argued  that  they  were  abstract  and  excluded 
human  elements.  International  politics  involve 
people  who  make  decisions  and  shape  events,  they 
argued.  States  do  not  balance;  statesmen  do.  And 


when  statesmen  decide  to  balance,  they  do  not 
heedlessly  react  to  abstract  concepts  such  as 
“capabilities”;  they  balance  against  what  they  per¬ 
ceive  as  concrete  and  tangible  threats. 

Some  scholars  retorted  that  statesmen  may  well 
act  on  perceptions,  but  such  perceptions  are  in 
turn  part  of  larger  systems — of  “epistemological 
communities” — that  socialize  statesmen  into  the 
collective  norms  of  distinct  societies  and  thus  con¬ 
dition  their  perceptions  of  the  world. 

This  was  the  vantage  point  of  Richard  Ashley 
whose  “Poverty  of  Neorealism”  (1984)  amounted 
to  a  sweeping  condemnation  of  Waltz.  Neorealist 
theory  is  a  distinct  epistemic  system,  argued 
Ashley;  it  is  totalizing  and  repressive  and  invokes 
the  authority  of  “science”  to  justify  its  claim  for 
objective  knowledge  where  no  such  claim  could 
exist,  he  argued.  Other  writers  added  their  voices 
to  this  critical  charge.  Neither  neorealism  nor  any 
other  IR  theory  offer  objective  knowledge  of  the 
world.  They  are  all  mere  reflections  of  distinct 
epistemic  communities  and  contingent  on  particu¬ 
lar  conditions  in  time  and  space.  This  was  the  mes¬ 
sage  of  the  scholars  who  contributed  to  James  Der 
Derian’s  book  International/Intertextual  Relations 
(1989).  They  portrayed  IR  as  resting  on  a  flimsy 
foundation  of  subjective  assumptions  that  belonged 
to  a  narrow  group  of  privileged,  largely  White, 
middle-class  males  who  forced  their  articulations 
on  more  marginalized  others,  whose  voices  were 
consequently  silenced  by  the  imposition. 

This  criticism  was  not  new.  But  whereas  earlier 
criticism  had  flowed  from  materialist  social  science 
theories  (e.g.,  from  the  Marxist  false-ideology  the¬ 
sis),  the  critical  claims  of  the  1980s  hinged  on 
ideational  philosophies — on,  for  example,  semiot¬ 
ics,  linguistics,  or  symbolic  interactionism  (often 
inspired  by  Parisian  postmodernists).  These  phi¬ 
losophies  became  a  springboard  of  the  poststruc¬ 
turalist  turn  that  affected  the  social  sciences  in  the 
1990s. 

Globalization  and  Change 

Several  long-term  trends  conspired  to  enhance 
the  force  of  this  poststructural  turn  in  IR.  Two  of 
them  are  indicated  by  the  terms  communication 
revolution  and  globalization.  The  means  and 
modes  of  international  transport  were  changed  by 
new  technologies.  Goods,  money,  text,  and  images 
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were  brought  faster  and  cheaper  around  the  world, 
fueling  a  steady  expansion  of  commerce  and  credit. 
Easier  travel  encouraged  increased  migration,  and 
whereas  earlier  waves  of  migration  had  brought 
Europeans  out  into  the  world,  a  new  wave  brought 
non-Europeans  in.  Such  trends  were  long  studied 
in  terms  of  the  old  IR  concept  of  interdependence. 
During  the  1980s,  however,  when  the  introduc¬ 
tion  of  relatively  inexpensive  personal  computers 
represented  a  quantum  leap  in  worldwide  inter¬ 
connectedness,  new  concepts  and  theories  were 
applied — such  as  textual  interrelationships,  sym¬ 
bolic  interaction,  and  multiple  identities. 

The  liberal  democracies  of  the  West  adapted  to 
globalization.  They  did  not  always  do  it  peace¬ 
fully — national  groups  with  traditional  truth 
claims  and  monocultural  assumptions  clashed 
with  immigrant  demands  for  tolerance  and  under¬ 
standing.  Communist  dictatorships,  however,  did 
not  adapt  well.  Around  1990,  the  Soviet  Union 
collapsed — for  a  variety  of  reasons;  the  refusal  to 
adapt  to  the  communication  revolution  for  fear  of 
losing  political  control  was,  perhaps,  the  most 
important  one. 

The  Soviet  collapse  altered  world  politics.  When 
Russia,  the  former  USSR’s  crisis-ridden  and  non¬ 
communist  successor,  was  in  no  shape  to  assume 
the  role  of  America’s  superpower  rival,  the  Cold 
War  ended.  The  collapse  also  altered  the  percep¬ 
tion  of  world  politics.  On  the  one  hand,  it  reduced 
the  relevance  of  neorealism  and  revolutionism — the 
fall  of  the  former  USSR  weakened  neorealism  by 
striking  at  its  assumption  of  a  bipolar  balance;  rev¬ 
elations  of  the  cruelties  and  the  deep  illegitimacy  of 
the  Soviet  system  weakened  the  ideological  appeal 
of  revolutionism.  On  the  other  hand,  the  Soviet 
collapse  enhanced  the  relevance  of  ideational 
approaches.  For  it  was  not  only  the  material  capa¬ 
bilities  of  the  old  USSR  that  collapsed  around 
1991,  it  was  also  the  Stalinist  system,  and  it  was 
the  exhaustion  of  the  old  communist  outlook  that 
altered  the  international  climate  and  ended  the 
Cold  War.  Many  observers  concluded  that  ideas 
shape  policy  and  that  power  capabilities  may  be 
the  foreign  policy  tools  of  states  but  that  ideas 
define  the  uses  to  which  the  tools  are  put.  Alexander 
Wendt  made  this  point  in  a  much  discussed  1992 
article,  “Anarchy  Is  What  States  Make  of  It.” 

Finally,  the  Soviet  collapse  embarrassed  the 
more  formal  or  scientific  approaches  to  IR.  It  came 


as  a  total  surprise  to  scholars  who  had  long  argued 
that  the  scientific  method  could  produce  general 
explanations  of  the  past  as  well  as  predictions  for 
the  future.  Their  failure  to  predict  the  Soviet  col¬ 
lapse  embarrassed  them  and  made  their  scientific 
approach  vulnerable  to  criticism.  A  new  Great 
Debate  ensued.  It  was  all  the  more  heated  as 
IR’s  entire  philosophy  of  science  was  suddenly 
questioned. 

Fragmentation,  Realignment, 
and  Ideational  Approaches 

The  conceptual  toolbox  of  IR  became  richer 
and  more  diverse  during  the  1990s.  It  was  filled 
with  new  approaches  imported  from  other  disci¬ 
plines — neo-institutionalism,  gender  theory,  cul¬ 
ture  studies,  postcolonial  analysis,  and  literary 
criticism  among  them.  The  substantial  foci  of  IR, 
however,  changed  less.  One  of  the  established 
issues,  the  relationship  of  the  Great  Powers, 
reemerged  in  the  wake  of  the  Cold  War:  With  the 
dissolution  of  the  USSR,  the  world  was  no  longer 
a  bipolar  system,  but  what  kind  of  system  was  it? 
The  United  States  emerged  from  the  Cold  War  as 
the  only  remaining  superpower,  but  what  kind  of 
power  did  it  have? 

Joseph  Nye  adumbrated  an  answer  in  Bound  to 
Lead  (1990):  namely,  that  the  United  States  led  a 
unipolar  system  by  virtue  of  its  ideas,  ideals,  and 
example — that  is,  by  its  “soft  power.”  Other 
authors  made  similar  points.  Samuel  Huntington 
argued  in  The  Clash  of  Civilizations  (1993)  that 
the  Soviet  collapse  would  be  followed  by  an  age 
deeply  affected  by  conflicts  of  faith  and  civiliza- 
tional  ideas  and  ideals.  Robert  Kaplan  wrote  influ¬ 
ential  reports  from  regions  of  conflict — Soldiers  of 
God  (1990),  Balkans  Ghosts  (1993),  and  The 
Ends  of  the  Earth  (1996) — and  described  these 
conflicts  as  driven  by  religious  faith  and  ethnic 
identities. 

Such  claims  stirred  much  debate.  Francis 
Fukuyama  protested  the  notion  that  future  wars 
would  be  civilizational  in  nature.  He  argued 
instead  that  war  would  steadily  diminish  in  the 
post-Cold  War  world  and  that  there  was  now  only 
one  sustainable  route  to  modernity:  the  liberal- 
democratic  path.  With  communism  on  the  ropes, 
liberal  democracy  had  emerged  dominant.  Fiberal 
definitions  of  reason,  rights,  and  equal  recognition 
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were  striking  roots  everywhere,  pushing  religion, 
nationalism,  and  other  philosophical  relics  into  the 
dustbin  of  history.  The  age  of  the  great  ideational 
struggles  was  over,  argued  Fukuyama.  Not  only  is 
the  Cold  War  at  an  end,  history  itself  is  at  an  end, 
he  concluded  in  The  End  of  History  and  the  Last 
Man  (1992) — with  a  nod  to  Friedrich  Flegel  and 
Alexandre  Kojeve.  Bruce  Russett  formulated  a  simi¬ 
lar  argument  but  from  a  scientist’s  perspective — and 
with  a  nod  to  Kant.  In  Grasping  the  Democratic 
Peace  (1993),  he  argued  that  democratic  states 
trade  with  one  another,  evolve  interdependencies, 
and  common  interests.  Democracies  do  not  go  to 
war  with  each  other.  By  this  logic,  the  spread  of 
democracy  was  practical  peace  work. 

Fukuyama  and  Russett  were  right  about  one 
thing:  The  number  of  interstate  wars  dropped  after 
the  Cold  War.  But  Huntington  and  Kaplan,  too, 
had  a  valid  point:  There  were  regions  in  the  world 
that  did  not  conform  to  the  worldwide  trend  of 
growing  order  and  peace.  Conflict  and  war  still 
marked  an  area  between  the  eastern  Mediterranean 
and  the  Indian  Ocean — a  vast  region  from  the 
Balkans  via  the  Middle  East,  along  the  Caucasus 
and  the  Hindu  Kush,  to  the  Bay  of  Bengal.  Some 
of  its  conflicts  were  related  to  the  retreat  of  the 
former  USSR  and  some  to  the  expansion  of  the 
United  States.  Most  of  them  involved  national  and 
religious  identities. 

This  was  worrisome.  For  this  vast  and  varied 
region  of  sustained  conflict  also  contained  the 
world’s  richest  stores  of  oil;  Western  statesmen 
feared  that  conflict  would  disrupt  its  extraction 
and  smooth  transport.  This  region  was  also  pre¬ 
dominantly  Muslim;  Western  analysts  feared  that 
conflict  might  produce  new  and  radical  mutations 
of  political  Islam  as  well  as  proliferation  of  nuclear 
arms  and  other  weapons  of  mass  destruction 
(WMDs) — especially  if  crisis-ridden  Russia  lost 
control  of  its  old  Soviet  WMD  stockpiles. 

Such  fears  soared  to  new  heights  on  September 
11,  2001,  when  Islamist  terrorists  crashed  hijacked 
passenger  jets  into  the  Pentagon  near  Washington, 
D.C.,  and  into  Manhattan’s  World  Trade  Center. 
The  newly  elected  president  George  W.  Bush 
responded  by  striking  out  against  Afghanistan, 
ousting  its  Islamist  government,  and  installing  a 
friendly  regime  in  its  stead.  In  the  fall  of  2003, 
President  Bush  ordered  an  invasion  of  Iraq.  Amid 
loud  protests  and  shrill  warnings,  he  declared  a 


“war  on  terror,”  arguing  that  Iraq’s  dictatorial 
regime  had  stockpiles  of  WMDs  and  nursed  terror¬ 
ist  connections. 

No  WMDs  were  found  in  Iraq.  No  terrorist  con¬ 
nections  were  convincingly  established.  Many  in  the 
world,  including  IR  scholars,  argued  that  the  Bush 
administration  had  invaded  Iraq  either  as  the  result 
of  panic  or  confusion,  or  as  a  calculated  effort  to 
secure  access  to  the  region’s  oil  reserves.  Regardless 
of  reason,  the  faraway  and  costly  wars  in  Iraq  and 
Afghanistan  wasted  America’s  soft  power  and 
weakened  its  standing  in  the  world — partly  by  its 
ongoing  costly  military  exploits  and  by  its  policy  of 
cutting  taxes  while  increasing  spending.  In  the  first 
years  of  the  new  millennium,  America’s  role  in  the 
post-Cold  War  world  was  increasingly  described  in 
terms  of  empire  rather  than  hegemon:  The  United 
States  was  more  commonly  seen  as  an  imperialist 
than  as  a  democratizing  force. 

Turmoil  and  wars  in  the  Middle  East  and 
Central  Asia  sparked  new  energy  debates.  The 
Great  Powers  of  the  world — with  the  significant 
exception  of  Russia — were  deeply  reliant  on  for¬ 
eign  oil,  and  their  dependence  presented  a  serious 
security  risk  for  each  of  them.  Also,  the  continued 
consumption  of  fossil  fuels  represented  a  security 
risk  for  them  all:  Many  scientists  claim  that  the 
burning  of  oil  released  such  large  amounts  of  car¬ 
bon  gases  that  it  altered  the  composition  of  the 
earth’s  atmosphere  and  changed  the  global  cli¬ 
mate — and  thus  created  a  security  risk  of  another 
kind.  By  the  end  of  Bush’s  second  term,  the  United 
States  was  still  the  world’s  most  powerful  country, 
but  it  was  deeply  distrusted.  When  the  American 
economy  entered  a  deep  economic  crisis,  its  image 
as  a  leading  world  power  was  on  the  rocks.  Fareed 
Zakaria  spoke  on  behalf  of  many  observers  when, 
in  The  Post-American  World  (2007),  he  portrayed 
the  international  system  as  increasingly  multipolar, 
where  countries  such  as  Brazil,  Russia,  India,  and 
China  might  represent  new  centers  of  power  along¬ 
side  an  enfeebled  and  distrusted  United  States. 

Conclusion 

The  contours  of  scholarly  IR  that  emerged  in  the 
wake  of  the  Franco-Prussian  War  (1871)  grew 
more  defined  during  the  subsequent  years  of  impe¬ 
rialist  expansion.  It  came  into  its  own  as  a  scholarly 
field  during  World  War  I,  emerging  as  a  branch  of 
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the  social  sciences  in  the  wake  of  that  Great  War. 
It  was  then  preoccupied  with  the  causes  of  war  and 
the  preconditions  for  a  stable  peace. 

IR  has  since  added  other  concerns  to  its  schol¬ 
arly  ambit  and  evolved  as  a  more  varied  scholarly 
field.  It  has  become  too  varied,  according  to  some 
observers — especially  as  IR  scholars  have  engaged  in 
abstract  philosophy-of-science  theorizing.  True,  IR 
may  no  longer  have  a  sharp  and  single  focus. 
However,  its  core  concerns  are  still  apparent  as  a  set 
of  overlapping  issues,  such  as  war  and  peace,  wealth, 
and  power.  And  these  are  maintained  by  the  disci¬ 
plinary  history  that  marks  IR  as  a  scholarly  field. 

Also,  the  theoretical  foundation  of  IR  has 
remained  fairly  constant  over  time.  IR  scholars  still 
hang  on  to  their  old  notion  of  a  pluralist  field  com¬ 
posed  of  opposing  approaches.  The  two  classic 
approaches — the  Hobbes-based  realism  and  the 
Locke-derived  rationalism — are  still  dominating 
the  scholarly  debate  (although  both  have  adapted 
to  the  changing  world).  A  third  approach  has 
drifted  in  and  out  of  the  field,  advocating  change 
and  revolution — now  in  world  affairs,  now  in 
scholarly  approaches,  but  it  has  always  sought  to 
introduce  critical,  collectivist,  and,  one  is  tempted 
to  add,  romantic  perspectives  to  the  field.  At  the 
beginning  of  the  21st  century,  the  basic  approaches 
of  IR,  then,  still  conform  to  the  three  perspectives 
described  by  James  Lorimer  at  the  end  of  the  19th 
century. 

Torbjorn  L.  Knutsen 
Norwegian  University  of  Science  and 
Technology  (NTNU) 
Trondheim,  Norway 

See  also  Balance  of  Power;  International  Law;  Liberalism; 
Realism  in  International  Relations;  War  and  Peace; 
Westphalian  Ideal  State 

Further  Readings 

Armstrong,  D.  (1993).  Revolution  and  world  order. 

Oxford,  UK:  Clarendon  Press. 

Bauer,  H.,  &  Brighi,  E.  (Eds.).  (2003).  International 
relations  at  the  LSE:  A  history  of  75  years.  London: 
Millennium. 

Baylis,  J.,  Smith,  S.,  &  Owens,  P.  (Eds.).  (2008).  The 
globalization  of  world  politics:  An  introduction  to 
international  relations.  Oxford,  UK:  Oxford 
University  Press. 


Burleigh,  M.  (2009).  Blood  and  rage:  A  cultural  history 
of  terrorism.  London:  HarperCollins. 

Calleo,  D.  (2001).  Rethinking  Europe’s  future.  Princeton, 
NJ:  Princeton  University  Press. 

Frieden,  J.  A.  (2006).  Global  capitalism:  Its  fall  and  rise 
in  the  tiventieth  century.  New  York:  W.  W.  Norton. 

Gat,  A.  (2001).  A  history  of  military  thought.  Oxford, 
UK:  Oxford  University  Press. 

Goldstein,  S.  J.  (2007).  International  relations.  London: 
Longman. 

Knutsen,  T.  L.  (1997).  A  history  of  international  relations 
theory.  Manchester,  UK:  Manchester  University  Press. 

Long,  D.,  &  Wilson,  P.  (Eds.).  (1995).  Thinkers  of  the 
twenty  years’  crisis:  Inter-war  idealism  reassessed. 
Oxford,  UK:  Clarendon  Press. 

Luard,  E.  (Ed.).  (1992).  Basic  texts  in  international 
relations:  The  evolution  of  ideas  about  international 
society.  New  York:  St.  Martin’s. 

Olson,  W.  C.,  &  Groom,  A.  J.  R.  (1991).  International 
relations  then  and  now:  Origins  and  trends  in 
interpretation.  London:  Routledge. 


International  Relations, 
Theory 


International  relations  (IR)  theory  is  an  area  of 
social  theory  that  seeks  to  account  for  international 
or  world  political  events,  patterns,  relations,  and 
interactions.  Traditionally,  as  the  concept  implies, 
the  focus  of  IR  theory  has  been  on  the  study  of 
interstate  relations  and  interactions:  mainly  on  why 
states  go  to  war  with  each  other  and  why,  under 
certain  conditions,  they  may  refrain  from  doing  so. 
Yet  it  is  important  to  note  that  in  recent  decades, 
the  meaning,  scope,  and  the  interests  driving  IR 
theorists  have  expanded  significantly  beyond  tradi¬ 
tional  interstate  concerns.  This  expansion  has  been 
driven  by  the  increased  awareness  of  challenging 
new  issues  in  international  politics,  such  as  global¬ 
ization,  human  rights,  global  poverty,  environmen¬ 
tal  threats,  and  global  gender  hierarchies.  Interest  in 
these  issue  areas  has  not  only  stimulated  develop¬ 
ments  within  traditional  IR-theoretical  approaches, 
such  as  realism  and  liberalism,  but  it  has  also  seen 
a  significant  pluralization  within  the  field  to  include 
a  host  of  new  perspectives. 

When  considering  the  history  of  IR  theory,  a 
subject  that  first  emerged  in  the  aftermath  of 
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World  War  I  and  became  consolidated  in  the  post- 
World  War  II  environment,  it  is  notable  that  the 
meaning  of  the  notion  “IR  theory”  has  developed 
in  two  distinct  senses  over  the  years:  in  terms  of  the 
perceptions  scholars  have  of  the  scope  of  “IR” 
and,  simultaneously,  in  terms  of  the  meanings 
attached  to  the  notion  “theory.” 

Having  been  a  part  and  parcel  of  historical,  eco¬ 
nomic,  philosophical,  or  political  inquiry  during 
earlier  centuries,  in  the  20th  century,  IR  theory  came 
gradually  to  form  a  disciplinary  field  distinct  from 
other  subfields  of  political  science,  such  as  political 
theory,  or  a  base  for  theorizing  in  other  social  sci¬ 
ence  disciplines,  such  as  sociology,  history,  or  eco¬ 
nomics.  IR  theory,  as  conceived  by  many  commen¬ 
tators  in  the  mid-20th  century,  focused  primarily  on 
state  interactions  on  the  level  of  the  international 
system  (not  within  the  state)  and  in  reference,  by  and 
large,  to  political  relations  (not  economic  or  social 
relations).  However,  driven  by  the  increasingly 
transnational  and  crosscutting  nature  of  global 
problems,  the  boundaries  between  IR  theory  and 
other  social  sciences  have  been  progressively  recon¬ 
ceived  by  IR  theorists  during  the  past  few  decades. 
Many  IR  theorists  now  analyze  phenomena  beyond 
interstate  relations  and  often  use  concepts  and  theo¬ 
retical  premises  usually  characteristic  of  other  fields, 
whether  it  be  economic  analysis,  the  study  of  domes¬ 
tic  social  relations,  or  ethical  or  philosophical 
inquiry.  Hence,  it  is  not  surprising  that  the  nature  of 
IR  theory  as  a  discrete  academic  subfield  has  been 
under  some  debate  recently. 

It  should  also  be  noted  that  while  the  scope  of 
IR  has  expanded,  the  meaning  of  what  counts  as 
theory  has  at  the  same  time  shifted  within  the  field. 
A  number  of  so-called  postpositivist  scholars  have 
in  recent  decades  turned  away  from  explanatory 
science,  traditionally  IR  theorists’  main  focus,  in 
favor  of  understanding  meanings  and  ideas  in  inter¬ 
national  politics.  Thus,  different  views  have  arisen 
of  the  very  meaning  of  theory.  While  for  some  it 
entails  explanatory  theory  aimed  at  trying  to  estab¬ 
lish  the  causes  of  world  political  developments,  for 
others  it  is  concerned  with  interpretive  theory, 
seeking  an  understanding  of  why  specific  ideas  or 
discursive  systems  of  meaning  have  come  to  prolif¬ 
erate.  In  addition,  some  scholars  make  the  argu¬ 
ment  that  IR  theory  should  primarily  be  viewed  as 
normative  theory  (i.e.,  dealing  with  questions  of 
“ought”  and  not  just  “is”),  while  others  argue  that 


it  should  function  as  critical  theory  (i.e.,  challeng¬ 
ing  existing  systems  of  power  and  oppression  in 
world  politics). 

Given  the  contestation  over  the  scope  of  the 
subject  matter  of  IR  theory  and  the  meaning  and 
purpose  of  theorizing  itself,  there  is  thus  no  singu¬ 
lar  conception  of  IR  theory  that  we  can  turn  to  but 
rather  a  number  of  contested  visions  of  it.  In  view 
of  this,  it  is  important  to  understand  the  plurality 
of  ways  of  framing  IR  theorizing  today.  Whereas 
40  years  ago,  three  theories  of  IR  were  commonly 
referred  to  as  exhausting  the  field — realism,  liber¬ 
alism,  and  either  the  English  School  or  Marxism — 
today,  a  number  of  critical  and  postpositivist  tradi¬ 
tions  of  thought  have  entered  this  field,  such  as 
constructivism,  critical  theory,  normative  theory, 
poststructuralism,  postcolonialism,  feminism,  and 
green  theory. 

This  entry  first  discusses,  in  historical  perspec¬ 
tive,  how  theoretical  arguments  have  developed 
around  the  idea  of  IR.  Second,  it  discusses  the 
distinct  theoretical  schools  of  thought,  or  isms, 
commonly  conceived  to  be  characteristic  of  IR 
theory  today,  highlighting  key  issues  and  themes 
on  which  theorists  agree  and  disagree.  This  discus¬ 
sion  is  followed,  finally,  by  a  brief  assessment  of 
key  developments  in  IR  theory,  with  a  view  to 
deciphering  where  future  interests  and  fault  lines 
of  IR  theorizing  might  lie. 

IR  Theory  in  Historical  Perspective 

The  idea  that  there  should  be  such  a  thing  as  IR 
theory  at  all  is  of  a  relatively  recent  origin,  being  in 
many  ways  a  20th-century  invention.  IR  theory  is 
a  term  that  arose  in  the  early  and  mid-20th  century 
with  the  attempt  by  a  specific  set  of,  mostly  Anglo- 
American,  academics  to  develop  systematic  theo¬ 
retical  thought  to  serve  as  the  basis  for  a  newly 
emerging  (sub)field  of  the  social  sciences — IR  or 
international  politics.  While  initially  conceived  as 
a  subject  matter  for  historians,  lawyers,  or  political 
theorists,  the  professionalization  and  institutional¬ 
ization  of  academic  disciplines  during  this  period 
also  led  scholars  interested  in  the  empirical  domain 
of  IR  to  establish  a  discipline  of  their  own,  includ¬ 
ing  a  set  of  specific  theoretical  assumptions  and 
concepts  appropriate  to  its  study. 

These  concepts  did  not  arise  from  nowhere, 
however.  In  fact,  many  20th-century  IR  theorists 


1288  International  Relations,  Theory 


have  taken  their  cue  from  much  earlier  thinkers.  It 
could  be  argued  that  IR  theory — that  is,  the 
attempt  to  systematically  understand  why  states  or 
political  units  go  to  war  with  each  other  or  engage 
in  other  international  forms  of  social  interaction — 
was  first  developed  in  ancient  Greece  by  Thucydides 
in  his  account  of  the  Peloponnesian  wars.  IR  was 
also  clearly  theorized  in  the  medieval  and  early- 
modern  periods  in  various  scholastic  and  republi¬ 
can  tracts,  perhaps  most  notably  in  Niccolo 
Machiavelli’s  writings  and  in  the  treatises  of  those 
interested  in  the  ethics  and  legality  of  war,  such  as 
Francisco  de  Vitoria  and  Hugo  Grotius.  From 
these  early  writings  on  IR  arise  not  only  many  of 
the  patterns  of  thought  that  continue  to  character¬ 
ize  IR  today,  such  as  the  view  of  IR  as  power  poli¬ 
tics,  but  also  many  concepts  used  in  current  theory, 
such  as  the  notions  of  sovereignty  or  just  war.  Yet 
these  texts  were  not  explicitly  IR-theoretical:  They 
were  seen  as  part  and  parcel  of  wider  historical, 
legal,  political,  or  ethical  reasoning. 

The  theorization  of  IR  as  we  tend  to  understand 
it  today  arose  in  the  20th  century.  It  is  closely  tied 
to  the  rise  of  the  discipline  of  IR,  which  arose  also 
in  a  very  specific  context,  since  it  was  “invented”  to 
address  the  problem  of  war  in  the  aftermath  of 
World  Wars  I  and  II.  After  World  War  I,  an  impe¬ 
tus  arose  to  address,  urgently  and  systematically, 
the  question  of  why  states  ended  up  engaging  in 
destructive  wars.  To  study  this  subject,  the  first 
Chair  of  International  Politics  was  established  at 
the  University  of  Wales,  Aberystwyth,  in  1919,  and 
in  the  1920s  and  1930s  at  a  few  other  universities. 
The  focus  of  postwar  IR  specialists  was  thus  on 
developing  a  systematic  understanding  of  why  wars 
occurred,  specifically  with  the  aim  of  allowing  the 
international  society  to  avoid  them  in  the  future.  A 
number  of  commentators,  such  as  Alfred  Zimmern 
and  Leonard  Woolf,  developed  and  argued  for 
notions  of  collective  security  capable  of  averting  the 
kind  of  disaster  that  World  War  I  had  been. 

However,  the  most  famous  IR  theory  text  that 
was  to  arise  in  the  first  part  of  the  20th  century 
came  in  reply  to  the  interwar  commentators.  It 
was  written  by  Edward  Carr,  the  Chair  in 
International  Politics  at  Aberystwyth  at  this  time, 
and  was  called  the  Twenty  Years’  Crisis.  Carr  was 
one  of  the  early  key  scholars  seeking  specifically  to 
establish  a  theoretical  understanding  of  IR.  In  so 
doing,  he  drew  attention  to  the  failings  of  the 


interwar  commentators.  He  argued  that  they  had 
been  far  too  “idealistic,”  prioritizing  normative 
and  utopian  ideals  instead  of  recognizing  the 
“realities”  of  power  politics.  Carr  dismissed  these 
idealists  in  favor  of  the  development  of  “realist” 
theorizing,  which  took  seriously  the  sources  of 
conflict  between  states,  the  role  of  national  inter¬ 
est  in  shaping  states’  actions,  and  the  role  of  power 
politics  in  IR  more  widely. 

Carr  played  a  key  role  in  setting  out  the  theo¬ 
retical  parameters  of  what  became  known  as  real¬ 
ist  IR  theory,  and  it  is  also  fair  to  say  that  he  was 
instrumental  in  establishing  realism  in  the  post- 
World  War  II  context.  Realist  thinking  was  also 
strengthened  by  the  onset  of  the  Cold  War,  which 
seemed  to  prove  the  importance  of  power  politics, 
as  well  as  by  the  influence  that  Hans  Morgenthau 
came  to  have  during  this  period.  In  Politics 
Amongst  Nations ,  he  argued  that  international 
politics  is  shaped  by  objective  laws  based  on 
human  nature  and,  crucially,  that  the  key  to  under¬ 
standing  IR  lies  in  the  idea  of  interest  defined  in 
terms  of  power.  Unlike  liberal  idealists,  he  claimed 
that  the  actions  of  states  should  not  be  judged  in 
terms  of  universal  moral  standards  but  with  refer¬ 
ence  to  the  specific  interests  that  states  hold  and  by 
the  need  for  prudential  action  on  the  part  of  their 
leaders.  Power  politics  may  be  necessary  in  inter¬ 
national  politics  in  defense  of  the  national  interest. 

It  is  the  centrality  of  power  in  these  early  realist 
analyses  that  initiated  a  whole  set  of  fascinating 
work  on  the  core  dilemmas  in  international  poli¬ 
tics,  notably  on  the  idea  of  security  dilemma, 
explored  initially,  for  example,  by  John  Herz: 
How  can  states  cooperate  when  the  attempts  of 
some  to  gain  security  inevitably  create  insecurity  in 
other  states? 

Although  realism  became  the  core  strand  of 
thought  in  Anglo-American  IR  theory  during  these 
years,  it  was  substantially  modified  as  a  theoretical 
system  by  Kenneth  Waltz,  who  in  1979  made  a 
radical  break  with  the  classical  realist  tradition  by 
modeling  the  international  system  in  accordance 
with  the  methods  of  positivist  science.  He  charac¬ 
terized  this  system  as  one  of  anarchy,  within  which 
all  states  are  conceived  as  “like  units,”  interested 
ultimately  in  nothing  but  their  own  survival.  This 
structural,  or  neorealist,  argument  set  a  new  stan¬ 
dard  in  IR.  Not  only  did  it  reshape  realist  thought, 
but  it  also  influenced  the  work  of  the  new  brand  of 


International  Relations,  Theory  1289 


liberal  internationalists  in  the  1970s,  such  as 
Robert  Keohane — even  though  Keohane  believed 
that  the  self-help  logic  could  be  mitigated  in  inter¬ 
national  politics  through  the  development  of  inter¬ 
national  institutions.  In  its  more  classical  forms, 
realism  also  strongly  influenced  the  so-called 
English  School  developed  in  the  United  Kingdom 
(UK);  for  while  the  focus  here  was  on  the  idea  of 
an  international  society,  the  realist  prioritization  of 
state  relations  still  formed  the  core  of  this  school. 

It  is  important  to  note  that  realists,  many  liber¬ 
als,  and  many  English  School  thinkers  share  certain 
important  assumptions  about  what  IR  theory  is  and 
should  consist  of.  Paradoxically,  the  best  statement 
of  this  notion  of  IR  theory  is  provided  in  Martin 
Wight’s  essay  “Why  is  there  no  international  the¬ 
ory?”  Wight  was  skeptical  of  the  possibility  of 
international  theory  because  of  the  perception  of 
war  as  recurrent  and  inevitable  by  many  and 
because  of  the  attractions  among  political  scientists 
of  political  theory  conceived  in  terms  of  the  “good 
life” — patterns  of  thought  that,  in  his  view,  seemed 
far  removed  from  the  realities  of  international  poli¬ 
tics.  Yet  Wight  gave  a  very  clear  account  of  what 
IR  theory  should  be:  It  should  be  concerned  exclu¬ 
sively  with  international  and  not  domestic  politics, 
and  it  should  be  able  to  deal  with,  although  pos¬ 
sibly  also  look  beyond,  the  endless  recurrence  of 
war.  This  was  precisely  the  sort  of  theory  that 
Edward  Carr,  Hans  Morgenthau,  Hedley  Bull, 
Kenneth  Waltz,  and  others  developed  for  the  dis¬ 
cipline.  Despite  some  methodological  disagree¬ 
ments  between  scholars  (the  so-called  second 
debate)  throughout  the  1950s  and  1960s,  there 
was  substantial  agreement  on  the  meaning  of  the 
focal  point  of  IR  theory,  at  least  in  terms  of  its 
object  of  study — state  behavior  and  the  explana¬ 
tion  of  the  causes  of  war  and  conditions  for  coop¬ 
eration.  This  consensus  on  the  theory  of  IR  broke 
down,  however,  in  the  1970s  and  1980s  with  the 
rise  of  the  interparadigm  debate  and,  subse¬ 
quently,  the  fourth  debate. 

The  interparadigm  debate  in  the  1970s  chal¬ 
lenged  the  classical  understandings  of  IR  theory  in 
terms  of  what  the  field  should  focus  on.  In  light  of 
both  the  oil  and  the  financial  crises  in  the  1970s, 
two  new  perspectives  gained  currency:  complex 
interdependence  theory  and  Marxism.  These  para¬ 
digms  were  conceived  to  challenge  the  realist  focus 
in  two  respects:  First,  while  accepting  the  state  as 


a  key  actor,  both  refused  to  reduce  all  interna¬ 
tional  politics  to  the  pursuit  of  national  interests, 
instead  looking  beyond  the  state  to  the  role  of 
economic  factors  and  nonstate  actors;  second,  they 
challenged  the  negative  conclusions  of  the  realists, 
arguing  instead  that  forms  of  cooperation  beyond 
mere  constant  struggle  for  survival  was  possible. 
This  entailed  a  challenge  to  both  the  dominance  of 
realism  and  the  realist  theoretical  obsession  with 
survival  and  brute  (military)  power;  there  was  now 
a  need  also  to  explain  why  states  cooperated  and, 
for  Marxists,  why  certain  patterns  of  inequality 
outside  mere  military  relations  continued  to  persist 
and  structure  IR. 

These  echoes  critical  of  realism  were  picked  up 
and  developed  even  further  in  the  1980s  when  a 
variety  of  so-called  postpositivist  theorists  started 
to  criticize  realism,  not  just  on  the  basis  of  its  theo¬ 
retical  assumptions  about  the  state  but  also  because 
of  its  philosophical,  normative,  and  political  lean¬ 
ings.  Philosophically,  realist  theory  was  accused  of 
positivism  in  its  attempt  to  establish  laws  and 
regularities  in  international  politics  as  well  as 
objective  scientific  knowledge  of  human  affairs — 
goals  that  were  viewed  as  unfeasible  and  inappro¬ 
priate  within  the  social  sciences.  In  addition, 
realism  was  condemned  normatively  and  politi¬ 
cally  for  constituting  a  conservative  bulwark  for 
statists  and  Great  Power  apologists;  it  was  seen  to 
be  reproducing  a  negative  and  cynical  vision  of  the 
world  and  one  that  unfairly  prioritized  the  inter¬ 
ests  of  great  powers  over  other  parts  of  the  world. 
Critical  theorists  extended  this  criticism  to  include 
many  liberals  as  well — those  who  had  accepted 
some  central  assumptions  of  realism,  notably  with 
a  focus  on  the  self-interested  state. 

Critical  theorists  emphasized  the  need  to  go 
beyond  problem-solving  theory  (perceived  as  cen¬ 
tral  for  realists,  liberals,  and  even  some  Marxists), 
by  unraveling  the  ways  in  which  theories  of  IR 
reproduced  particular  ideological  interests  and 
power  relations  in  the  world. 

The  postpositivist  criticism  led  to  the  prolifera¬ 
tion  of  a  whole  new  set  of  IR  theories — feminism, 
constructivism,  poststructuralism,  and  critical  the¬ 
ory — and  caused  the  onset  of  a  more  theoretically 
pluralist  way  of  approaching  IR  theory.  Indeed, 
whereas  in  the  1970s  and  mid-1980s  many  IR 
theory  textbooks  highlighted  just  three  traditions 
of  IR  theory  (realism,  liberalism,  and  either  the 
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English  School  or  Marxism  as  a  third  option), 
today  IR  theory  textbooks  cover  anywhere  between 
8  and  14  different  theoretical  approaches  (see 
Scott  Burchill  et  ah,  2005;  Timothy  Dunne,  Milja 
Kurki,  &  Steve  Smith,  2007). 

IR  Theories 

Given  the  plurality  of  approaches  that  exists  today 
in  IR  theory,  we  must,  to  understand  contempo¬ 
rary  IR  theory,  engage  with  the  theoretical  plural¬ 
ism  in  the  field.  With  this  in  mind,  this  section 
examines  some  of  the  core  isms  in  IR  theory  today 
and  the  issues  and  debates  that  divide  them. 

The  Isms 

To  characterize  any  field  in  terms  of  a  set  of 
“isms”  is  hugely  problematic,  although  often  also 
attractive  for  cognitive  reasons.  While  this 
approach  is  used  here,  because  such  a  terminology 
is  seen  to  have  distinct  advantages  in  clarifying 
some  of  the  key  divisions  of  thought  on  IR  theory, 
this  usage  also  has  important  problems  that  need 
to  be  kept  in  mind. 

First,  it  must  be  remembered  that  it  can  be  his¬ 
torically  misleading  to  characterize  the  thought  of 
specific  individuals  as  belonging  to  unified  isms. 
Thus,  for  example,  as  historians  of  the  discipline 
have  noted,  idealists  in  the  interwar  period  were 
not  necessarily  unified  in  their  frameworks  of 
thought  constituting  an  ism,  and  nor  were  they 
necessarily  idealists.  Indeed,  the  use  of  the  ism 
label  can  be  a  useful  way  of  rhetorically  dismissing 
or  sidelining  thinkers,  and  Carr’s  attack  on  ideal¬ 
ism  can  be  seen  as  one  such  rhetorical  attack. 
Second,  it  is  important  to  note  that  isms  may  be 
identified  on  very  different  grounds  in  analyses. 
Thus,  in  IR  theory,  some  isms  may  be  strictly 
IR-theoretical  isms,  such  as  realism,  but  others 
may  have  much  wider  meanings  alongside  their 
meanings  in  IR  (e.g.,  constructivism  in  philosophy 
and  social  theory,  liberalism  in  political  theory  and 
economic  science)  or  meanings  entirely  beyond  IR 
(e.g.,  critical  realism  in  philosophy).  Third,  and 
perhaps  most  crucially,  the  isms  approach  to  any 
subject  creates  the  problem  of  specifying  which 
isms  “count.”  Talking  in  terms  of  isms  is  one  way 
of  justifying  and  legitimizing  some  approaches 
over  others  and  simultaneously  of  excluding  some 


of  them.  Now,  the  approach  adopted  here  to  IR 
theories  is  expansive,  yet  it  by  no  means  includes 
within  it  all  the  approaches  that  could  be  consid¬ 
ered  IR-theoretical.  For  example,  anarchism  is  not 
treated  here  as  a  distinct  IR-theoretical  school, 
although  grounds  exist  for  saying  that  it  might 
constitute  one.  It  should  also  be  noted  that  within 
the  approaches  adopted  here,  there  are  various 
subdivisions  and  also  that  not  all  authors  have 
consistently  throughout  their  career  sought  to 
serve  a  single  and  unified  idea  of  an  ism. 

With  all  the  above  caveats  in  mind,  the  isms 
that  are  discussed  here  are  realism,  liberalism,  the 
English  School,  Marxism,  critical  theory,  femi¬ 
nism,  constructivism,  poststructuralism,  postcolo- 
nialism,  green  theory,  and  normative  theory.  These 
constitute  the  core  theoretical  strands  of  thought 
currently  commonly  referred  to  and  developed  in 
IR,  although  they  are  not  isms  in  the  same  sense 
(normative  theory,  e.g.,  may  be  conceived  as  a 
subfield  of  study  and  poststructuralism  as  an  ethos 
rather  than  a  theory). 

Realism 

As  pointed  out  earlier,  realism  provides  the  core 
reference  point  for  much  of  IR  theory.  This  is  not 
because  realism  is  necessarily  correct  but  because  it 
has  played  a  significant  role  in  the  rise  of  the  idea 
of  IR  theory  and  because  the  very  idea  of  IR  con¬ 
ceptually  remains  tied  to  key  realist  assumptions. 
The  core  realist  assumptions  can  be  defined  as 
threefold.  Realist  IR  theory  argues  (a)  that  the  state 
is  the  core  unit  of  international  politics,  (b)  that  the 
national  interests  of  the  state  play  a  crucial  role  in 
defining  the  shape  of  international  politics,  and 
(c)  that  because  of  the  emphasis  of  states  on  sur¬ 
vival  and  self-interest,  cooperation  between  states 
is  difficult  to  maintain,  and  war  between  states  is 
always  a  danger  and  likely  to  be  recurrent.  While 
these  constitute  core  assumptions  that  most  realists 
would  agree  on,  there  are  important  subdivisions 
within  realist  thought  that  need  to  be  considered. 

Neorealism  is  a  term  coined  to  refer  to  those  real¬ 
ists  who  follow  Waltz’s  reformulation  of  realism — 
that  is,  a  theory  that  bases  its  analysis  on  the 
postulation  of  a  structural  impediment  to  state 
cooperation,  the  structure  of  anarchy  in  the  inter¬ 
national  system.  Given  the  structural  constraint  of 
anarchy,  states  are  predisposed  toward  self-help 
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behavior.  Classical  realists,  whom  Waltz  criticized, 
explain  the  causes  of  war  in  terms  of  individualis¬ 
tic  or  state-based  levels  or,  like  Ned  Lebow,  have 
turned  to  the  language  and  theoretical  explana¬ 
tions  of  classical  Greeks  rather  than  modern  social 
science  to  generate  explanations  of  current  affairs. 
Offensive  and  defensive  realisms  are  strands  of 
thought  that  both  often  follow  a  neorealist  or 
structural-realist  theoretical  framework,  yet  they 
disagree  on  how  states  are  likely  to  act  in  the  pur¬ 
suit  of  their  interests.  Defensive  realists  emphasize 
that  self-help  states  can  satisfy  their  security  by 
pursuing  defensive  postures  by  seeking  to  balance 
against  rising  powers.  Offensive  realists,  drawing 
on  John  Mearsheimer’s  work,  argue  that  states  in 
a  self-help  system  always  strive  to  maximize  their 
power  and  to  dominate  the  system  and,  thus,  are 
inherently  offensive  in  their  posture. 

Liberalism 

Liberalism  is  another  core  IR  theory  school.  Its 
basic  principles  arise  from  liberal  political  theory 
but  are  extended  to  the  international  realm,  albeit 
as  with  realism,  in  a  variety  of  different  forms. 
Classical  liberals  tend  to  advocate  a  form  of  liberal 
internationalism,  believing  that  the  expansion  of 
liberal  principles,  when  more  universally  embraced, 
result  in  more  peaceful  international  interactions. 
One  core  doctrine  that  has  arisen  from  classical 
liberalism  is  democratic  peace  theory,  which — 
drawing  on  Immanuel  Kant’s  thought — argues 
that  democratic  states  are  unlikely  to  wage  war 
with  other  democratic  states. 

There  are  also,  however,  other  strands  of  liberal 
IR  theorizing.  Complex  interdependence  is  a  con¬ 
cept  launched  by  Robert  Keohane  and  Joseph  Nye 
in  the  1970s.  They  argued  that  states  by  this  time 
had  become  so  interdependent  with  one  another  in 
nonmilitary  issue  areas,  such  as  the  economy  and 
the  environment,  that  this  entailed  that  the  typical 
hierarchization  of  national  security  and  military 
capabilities  was  not  necessarily  any  longer  regarded 
as  a  top  priority  on  the  part  of  states  or  that  coop¬ 
eration  between  states  in  the  other  areas  was 
impossible.  Neoliberal  institutionalism ,  developed 
by  Keohane  and  others  in  the  mid-1980s,  was,  on 
the  other  hand,  based  on  the  claim  that  even  if  one 
accepts  the  neorealists’  prioritization  of  self-interest 
and  anarchy  as  a  given  structural  condition,  it  does 


not  necessarily  lead  to  the  conclusion  that  states 
will  not  cooperate  with  one  another.  This  is  in 
large  part  because  states  often  prioritize  absolute 
gains  over  relative  gains  and  hence  can  be  per¬ 
suaded  to  pursue  cooperative  endeavors  in  ways 
that  contravene  the  realist  logic.  In  the  case  of  the 
European  Union,  for  example,  the  neoliberals 
argue  that  a  preference  for  absolute  gains  and 
mutual  cooperation  has  overridden  the  relative 
gains  logic  of  the  realists. 

The  English  School 

The  English  School  is  a  tradition  of  thought 
associated  with  a  number  of  English  or  U.K. -based 
scholars  who  have  shared  a  certain  set  of  method¬ 
ological  and  conceptual  starting  points  in  the 
analysis  of  IR.  Key  figures  here  include  Martin 
Wight,  Hedley  Bull,  and  John  Vincent,  and  more 
recently  Barry  Buzan,  Robert  Jackson,  Nicholas 
Wheeler,  and  Tim  Dunne.  The  English  School 
thinkers  share  a  historical  and  interpretivist 
approach  to  IR;  that  is,  they  do  not  tend  to  apply 
quantitative  methods  to  the  study  of  IR  as  many 
realists  and  liberals  do,  but  rather,  they  emphasize 
the  interpretation  of  the  motives,  norms,  and 
understandings  of  actors. 

Crucially,  they  focus  conceptually  on  the  theme  of 
international  society:  The  English  School  conceives 
of  international  politics  in  terms  of  a  rule-bound 
society  of  states,  which  forms  an  international  soci¬ 
ety  in  which  a  set  of  rules,  norms,  and  institutions 
are  mutually  agreed  on.  English  School  thinkers  are 
not  necessarily  cosmopolitans  or  postrealists,  how¬ 
ever.  In  fact,  the  English  School  is  subdivided  into 
solidarist  and  pluralist  strands.  While  the  solidarists 
accept  the  idea  of  human  rights  and  humanitarian 
intervention  as  increasingly  important,  with  mutu¬ 
ally  accepted  rules  defining  the  nature  of  interna¬ 
tional  society,  pluralists  continue  to  maintain  a  belief 
in  nonintervention  and  sovereignty  as  the  key  prin¬ 
ciples  of  international  society. 

Marxism  and  Historical  Materialism 

Although  Marxism  has  its  roots  in  the  19th- 
century  writings  of  Karl  Marx,  within  IR  theory,  it 
made  a  somewhat  belated  appearance  compared 
with  the  other  social  sciences.  Although  some  real¬ 
ists  had  an  interest  in  Marxism  early  on  (notably 
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Carr),  Marxism  entered  IR  theory  debates  only  in 
the  late  1970s.  At  that  time,  it  came  to  be  associated 
in  particular  with  the  so-called  world  systems  theory 
developed  by  Immanuel  Wallerstein.  Contra  the 
realists,  he  argued  that  the  driving  dynamics  of 
world  politics  was  economic  in  nature:  The  struc¬ 
ture  of  the  capitalist  mode  of  production  globally  is 
what  positioned  states  within  international  politics 
and  influenced  their  interactions.  Wallerstein  divided 
the  world  into  the  core  (West),  the  semiperiphery 
(East  Asia),  and  the  periphery  (the  developing 
world),  each  zone  possessing  distinct  functions  in 
the  global  capitalist  system  of  production  and  hence 
also  in  the  state  system. 

World  systems  theory  was  heavily  criticized  for 
what  many  perceived  to  be  its  overly  structural 
and  economistic  focus.  Yet  Marxist  ideas  have  also 
been  developed  by  others,  notably  the  so-called 
neo-Gramscian  school  drawing  on  Robert  Cox 
and  other  historical  materialists,  such  as  Justin 
Rosenberg  and  Mark  Rupert.  The  neo-Gramscian 
strand  of  thought  is  most  famous  in  extending  the 
politico-economic  understanding  of  IR  to  the 
analysis  of  hegemony — that  is,  the  way  in  which 
ideological  consensus  is  created  and  reproduced  in 
IR  and  how  this  is  tied  to  specific  class  or  eco¬ 
nomic  interests. 

Critical  Theory 

Critical  theory  is  a  multifaceted  set  of  ideas 
associated  with  the  idea  of  emancipation  and  one 
with  origins  in  the  so-called  Frankfurt  School  the¬ 
ory  of  Max  Horkheimer,  Theodor  Adorno,  and 
Jurgen  Habermas.  Critical  theorists  in  IR  have 
tended  to  emphasize  the  idea  of  critical  theory 
as  distinct  from  problem-solving  theory.  Cox 
famously  argued  that  problem-solving  theory  fails 
in  being  unable  to  transcend  existing  social  reality 
by  narrowing  its  focus  merely  to  solving  specific 
technical  problems  within  the  parameters  of  other¬ 
wise  unquestioned  social  conditions,  whether  it  be 
the  capitalist  system  of  production,  patriarchal 
gender  relations,  or  narrowly  electoral  systems  of 
democracy.  Contrary  to  such  traditional  theoriz¬ 
ing,  critical  theorizing  has  the  aim  of  challenging 
and  rethinking  the  ideological  premises  that  exist 
for  the  dominant  perceptions  of  reality  that  make 
societies  function  as  they  do  and  that  blind  us  to 
many  forms  of  oppression  that  we  live  with  on  an 


everyday  basis.  Critical  theory  thus  aims  to  go 
deeper  in  its  critique  of  societal  underpinnings:  It 
seeks  the  emancipation  of  people  from  societal  and 
structural  systems  of  oppression  that  often  go 
unnoticed  in  realist  and  liberal  problem-solving 
theorization.  In  IR  theory,  these  themes  have  been 
developed  most  notably  by  Andrew  Linklater. 

Constructivism 

Constructivism  starts  from  the  premise  that 
social  reality,  rather  than  being  fixed  and  natural, 
is  premised  on  and  depends  on  the  thinking  and 
inter  subjective  agreement  of  human  actors.  This 
means  that  social  reality,  including  international 
political  social  reality,  is  socially  constructed. 

In  IR  theory,  there  have  been  numerous  exten¬ 
sions  and  applications  of  constructivist  theoretical 
ideas.  Wendt’s  argument  that  international  anar¬ 
chy  is  socially  constructed  has  been  one  important 
development  of  constructivism.  Wendt  argued  that 
we  cannot  assume,  as  the  realists  do,  that  the  lack 
of  international  government  necessitates  a  self-help 
system.  Instead,  anarchy  is  what  “states  make  of 
it”:  That  is,  those  states  that  have  developed  rela¬ 
tions  of  trust  and  shared  norms  between  them  are 
likely  to  interact  in  “friendly”  ways  with  each 
other,  even  if  other  states  have  been  socialized  into 
distrust  and  hence  to  realist  politics  of  self-help. 
Others,  such  as  Jeffrey  Checkel  and  Friedrich 
Kratochwil,  have  applied  constructivism  to  explain 
developments  such  as  the  end  of  the  Cold  War, 
whereas  Martha  Finnemore  has  used  it  in  under¬ 
standing  the  expansion  of  human  rights  norms.  In 
the  post-Cold  War  setting,  constructivism  has 
provided  a  fruitful  way  of  developing  dynamic 
interpretations  of  change  in  world  politics. 

Yet  it  is  important  to  note  that  constructivist 
ideas  can  be  developed  and  applied  in  very  different 
ways.  While  Wendt’s  constructivism  is  explicitly 
moderate  and  seeks  to  build  bridges  with  the  positiv¬ 
ist  realist  and  liberal  IR  scholars,  other  constructiv¬ 
ists,  such  as  Friedrich  Kratochwil  and  Karin  Fierke, 
argue  for  more  extensive  interpretive  and  linguistic 
forms  of  constructivism,  where  the  idea  of  social 
construction  is  recognized  as  a  deep-running  aspect 
not  just  of  social  forms  “out  there”  but  also  of  our 
explanatory  categories  themselves.  Constructivism, 
we  must  recognize  then,  is  no  united  theoretical 
camp,  and  indeed,  various  subdivisions  have  been 
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recognized  to  exist  within  this  broad  school  (mod¬ 
erate  or  “thin”  constructivism  vs.  radical  or 
“thick”  constructivism;  American  vs.  continental 
constructivism). 

Feminism 

Feminism  emerged  in  IR  in  the  late  1980s  but 
has  been  a  rather  contested  addition  to  the  field. 
This  is  because  feminism  tends  not  to  study  IR  as 
classically  conceived.  Cynthia  Enloe  famously 
argued  that  feminists  are  concerned  not  only  with 
the  international  but  also  with  the  way  in  which 
the  international  is  penetrated  and  constituted  by 
the  personal.  International  power  relations  for 
feminists  are  built  on  a  whole  set  of  global  and 
domestic  interpersonal  power  relations.  Thus,  rela¬ 
tions  between  husband  and  wife  in  a  diplomatic 
family  or  between  Western  consumer  women  and 
women  working  in  the  “feminized”  garment  indus¬ 
try  in  Indonesia  are  global  relations  of  power  too 
and  important  for  making  the  world  of  interna¬ 
tional  politics  go  around  as  it  does. 

Feminists  argue  that  if  social  actors  are  really 
socially  constructed,  as  many  constructivists  and 
others  have  argued,  then  IR  theorists  must  also 
recognize  that  social  actors  and  social  relations  are 
also  constructed  in  gendered  ways.  Indeed,  IR, 
feminists  argue,  has  been  constructed  as  a  field  of 
masculine  actors  pursuing  masculine  aims.  Not 
only  have  women  been  silent  in  the  discipline  and 
practice  of  diplomacy,  but  they  have  also  been  an 
uninteresting  area  of  study  and  simply  do  not  seem 
to  matter  for  IR  theory  as  traditionally  conceived. 
However,  if  theory  and  IR  are  conceptualized 
differently — as  analysis  interpersonally  constituted 
in  the  form  of  global  (rather  than  merely  interna¬ 
tional)  relations  of  power — then  perhaps  there  is 
much  more  to  IR  and  many  more  layers  of  power 
than  traditional  IR  theory  recognizes.  This  is  what 
the  feminists,  through  various  interesting  studies 
of  war,  tourism,  the  sex  industry,  and  peace  build¬ 
ing,  have  tried  to  demonstrate. 

Poststructuralism 

Poststructuralism  has  been  an  influential  force 
in  the  field  since  the  1980s,  especially  in  Europe. 
Yet  this  is  a  difficult  approach  to  label,  in  that 
many  of  its  advocates  reject  any  ism  label  as  a  valid 


description  of  their  approach.  Nevertheless,  schol¬ 
ars  such  as  James  Der  Derian,  Jenny  Edkins,  David 
Campbell,  and  R.  B.  J.  Walker  do  seem  to  share 
certain  assumptions  that  distinguish  them  from 
other  IR  theorists  and  that  could  be  termed  post¬ 
structuralist  in  nature.  Their  key  contribution  is 
that  they  take  the  epistemological  critique  of  posi¬ 
tivist  knowledge  to  its  fullest  conclusion:  They 
highlight  that  instead  of  an  analysis  of  international 
politics  “out  there”  (the  ontology  of  international 
politics),  we  should  study  the  discourses  that  make 
particular  realities  (or  ontologies)  seem  natural.  In 
short,  for  them,  international  politics  is  what  it  is 
because  people  accept  particular  discursive  ways  of 
understanding  what  IR  consists  of.  This  is  why  IR 
theorists  should  be  aware  of  how  international 
political  actors  and  IR  theorists  themselves,  by 
working  with  particular  theoretical  notions,  come 
to  reproduce  or  bring  into  existence  particular 
“realities.”  This  insight  has  been  applied  in  fruitful 
ways:  Poststructuralists  have  studied,  for  example, 
how  discourses  of  “othering”  create  enemies  in  IR 
or  how  discourses  of  nationalism  and  ethnicity  con¬ 
struct  wars  and  civil  strife. 

Another  key  aspect  of  this  theoretical  stance  is 
its  refusal  to  advocate  a  specific  set  of  political  or 
policy  convictions;  positive,  “truth-bound”  ethical 
and  political  positions  characteristic  of  realist,  lib¬ 
eral,  and  Marxist  and  critical  theory  schools  of 
thought  are  seen  as  dangerous  in  world  politics. 
Instead,  they  advocate  an  attitude  of  critique  and 
inquiry  over  assertive  statements  of  truth,  knowl¬ 
edge,  and  action.  Thus,  as  Campbell  has  argued, 
poststructuralism  is  not  a  specific  theoretical  or 
political  account  of  what  world  politics  should 
look  like  but  rather  a  critical  ethos  of  approaching 
the  analysis  of  IR,  which  places  emphasis  on 
deconstructing  existing  schools  of  thought  (see 
Campbell  in  Dunne  et  ah,  2007). 

Postcolonialism 

Postcolonialism  is  a  line  of  thought  that  in 
many  ways  shares  in  the  poststructuralist  skepti¬ 
cism  of  “true”  and  “objective”  knowledge  claims. 
Knowledge  claims,  they  argue,  have  tended  to 
exclude  and  marginalize  many  parts  of  the  world 
as  “uncivilized,”  “barbaric,”  and  hence  legiti¬ 
mately  “governed,”  with  significant  consequences 
for  the  colonized  or  the  developing  states  and 
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populations.  Postcolonialism  has  as  its  building 
block  the  realization  that  knowledge  is  always  pro¬ 
duced  from  somewhere,  from  particular  social  and 
political  conditions,  and  that  knowledge  makers 
can  often  be  unaware  of  the  way  in  which  their 
knowledge  creates  the  “self”  and  the  “other” — that 
is,  how  it  constructs  visions  of  ourselves  and  how 
it  might  exclude  or  silence  “the  others.”  Edward 
Said  famously  argued  that  in  Western  knowledge 
of  the  Middle  East,  for  example,  an  Oriental  sub¬ 
ject  was  considered  as  a  particular  kind  of  inferior, 
barbaric  individual,  while  the  West  was  idealized 
as  civilized,  peaceful,  and  educated.  The  key  aim  of 
postcolonialism  is  to  challenge  such  biased  read¬ 
ings,  whether  it  be  in  international  law,  democrati¬ 
zation,  or  gender  relations/analysis. 

Green  Theory 

Green  theory  provides  another  important,  and 
for  many  a  timely,  corrective  to  the  way  in  which 
IR  theorists  have  framed  and  thought  about  the 
nature  of  world  politics.  Green  theorists  challenge 
the  assumption  that  the  “high  politics”  of  interna¬ 
tional  politics  concerns  matters  of  interstate  war, 
highlighting  instead  the  inherent  and  deeply  chal¬ 
lenging  nature  of  environmental  risks  as  a  crucial 
aspect  of  contemporary  world  politics.  Green  the¬ 
orists  also  challenge  the  ecology-blind  or  ecology- 
unfriendly  frameworks  of  theoretical  thought  that 
have  dominated  the  discipline.  They  seek  to  direct 
us  to  consider  the  ways  in  which  anthropocentric 
assumptions  about  humans  and  nature  have 
informed  IR  theorizing  to  the  exclusion  of  values 
of  interconnectivity  with  nature.  There  are  various 
strands  of  green  thought  and  theorizing,  extending 
from  more  limited  engagements  with  environmen¬ 
tal  thinking  to  “deep  ecological”  frameworks. 

Normative  Theory 

Normative  theory  is  a  subfield  of  IR  theorizing 
(rather  than  a  specific  school  of  thought  as  such) 
that  specializes  in  understanding  the  different  nor¬ 
mative  and  ethical  underpinnings  that  are  given  for 
actions  or  interpretations  in  IR.  Normative  theory 
was  ignored  in  the  discipline  by  the  realists  and 
neoliberals,  who  conceived  of  the  study  of  interna¬ 
tional  politics  in  purely  explanatory  and  value- 
neutral  terms.  Normative  theorists  have  been  at 


pains  to  point  out  that  normative  assumptions 
about  what  is  desirable  or  good  are  constantly 
made  by  IR  scholars — even  when  theorists  are 
incognizant  of  such  normative  assumptions.  Thus, 
normative  theorists  have  pointed  out,  for  example, 
that  realists  argue  not  just  for  a  particular  explana¬ 
tory  account  of  the  causes  of  war  but  also  implic¬ 
itly  in  so  doing  for  a  communitarian  ethic,  which 
sees  the  state  as  a  key  moral  community  in  interna¬ 
tional  politics.  Normative  theory  scholarship 
sometimes  also  goes  under  the  title  international 
political  theory  or  international  ethics ,  terms  that, 
while  not  interchangeable  with  the  idea  of  norma¬ 
tive  theory,  also  indicate  the  considerations  and 
intersections  of  normative  thought  with  political 
theory  and  international  law  as  well  as  ethics.  One 
of  the  key  issues  in  international  political  theory 
and  in  international  ethics  has  been  the  question  of 
just  war:  that  is,  what  constitutes  just  behavior  in 
conditions  of  war  and  in  justifications  for  war. 
Other  key  questions  concern  global  poverty  and 
the  duties  of  redistribution  of  wealth  globally,  as 
well  as  an  engagement  with  the  normative  implica¬ 
tions  of  dealing  with  climate  change. 

Themes  That  Divide  IR  Theorists 

As  seen  above,  there  are  a  number  of  and  diverg¬ 
ing  approaches  that  one  can  take  in  theorizing 
about  IR.  This  section  briefly  highlights  the  core 
differences  between  these  on  certain  key  issues. 

The  Scope  of  the  Field 

As  previously  mentioned,  a  key  line  of  division 
in  IR  theory  during  the  past  few  decades  has  con¬ 
cerned  the  scope  of  the  study  of  IR.  The  classical 
thinkers,  realists,  liberals,  and  the  English  School 
thinkers  had  a  clear  sense  of  the  scope  of  the  disci¬ 
pline:  It  focused  exclusively  on  the  question — the 
core  focus  of  IR — of  why  states  go  to  war  or  why 
they  decide  to  cooperate.  Yet  in  the  1980s,  a 
plurality  of  challenges  to  the  state  centrism  of  IR 
theorizing  emerged.  Critics,  from  feminists,  to 
poststructuralists,  to  postcolonialists,  have  argued 
that  IR  encompasses  not  just  interstate  relations 
but  also  the  transnational  politics  of  trade,  the 
ideological  construction  of  states,  the  nature  of 
policies  and  polities,  and  interpersonal  global  rela¬ 
tions  of  power. 
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With  this  development,  there  has  also  been  a 
strong  tendency  in  recent  years  to  import  into  IR 
theory  assumptions,  knowledge,  data,  and  models 
from  other  social  sciences.  This  is  evident  both 
from  the  increased  popularity  of  positivist 
approaches,  within  which  rational  choice  theoriz¬ 
ing,  an  interdisciplinary  movement  in  the  social 
sciences,  has  played  an  important  role,  and  from 
the  emergence  of  feminist,  Marxist,  and  poststruc¬ 
turalist  forms  of  thought.  There  has  been  an 
expansion  within  all  these  theoretical  schools  in 
cross-  and  interdisciplinary  research,  although 
some  doubts  still  remain  as  to  just  how  much  IR 
theorists  are  listened  to  in  other  fields. 

These  interdisciplinary  moves  have  generally 
been  welcomed  within  IR,  but  they  have  also  led  to 
discussions  about  the  boundaries  of  IR.  Indeed, 
what  is  IR  theory  today  as  opposed  to  economic, 
sociological,  or  political  theory?  There  is  no  fixed 
answer  among  IR  theorists  at  present,  which  has 
given  IR  theory  as  a  discipline  something  of  a 
brittle  basis.  Indeed,  some  have  argued  that  IR 
theory  as  a  project  has  failed  and  that  we  would  be 
better  off  doing  away  with  the  idea  of  IR  theory  as 
a  separate  body  of  thought  and  to  engage  in  gen¬ 
eral  social  theorizing  instead.  Alternatively,  many 
argue,  we  should  rename  the  field  and  its  theories 
by  using  the  terms  world  politics  or  global  sociol¬ 
ogy.  So  displeased  have  some  scholars  become  with 
the  parochial  insistence  of  some  IR  theorists  on  the 
discreteness  of  the  field  that  they  have  abandoned 
the  discipline  to  engage  theoretically  with  sociolo¬ 
gists,  economists,  lawyers,  or  historians  instead. 

Yet  the  social  structure  of  IR  persists.  As  Ole 
Waever  has  suggested,  IR  theory  is  not  just  an 
abstract  academic  exercise  but  also  a  social  structure 
within  academia.  IR  labels,  approaches,  and  posi¬ 
tions  continue  to  structure  the  research  of  academic 
scholarship,  arguments,  and  careers,  as  well  as  teach¬ 
ing.  Indeed,  given  the  deep  entrenchment  of  IR  and 
IR-theoretical  “identities”  in  academia  as  well  the 
intellectual  benefits  of  teaching  IR  in  terms  of  a  set  of 
discreet  theoretical  perspectives,  IR  theory  still  exists 
and  continues  to  be  framed  as  by  and  large  distinct  if 
linked  to  other  social-scientific  perspectives. 

Philosophy  of  Science,  Methods,  and  Epistemology 

Another  clear  line  of  division  in  IR  theory  has 
revolved  around  the  idea  of  positivism — that  is, 


around  the  claim  that  IR  can  and  should  be  stud¬ 
ied  by  means  of  scientific  methods  aimed  at  finding 
regular  patterns  of  behavior  and  interaction  and 
that  IR  scholarship  can  and  ought  to  be  objective 
and  value-neutral.  The  methods  of  knowledge  con¬ 
struction  in  IR  have  always  been  a  subject  of 
debate  in  IR,  and  positivism  has  not  always  been 
influential.  Carr  and  Morgenthau,  for  example, 
were  strong  advocates  of  a  historical  and  interpre- 
tivist  approach  and  adverse  to  positivist  attempts 
to  systematize  IR  into  a  study  of  regular  patterns 
and  variables.  It  was  only  by  the  1960s  and  1970s 
that  a  positivist  (or  behavioralist)  worldview 
obtained  a  strong  foothold  in  American  IR,  and  as 
a  result,  there  has  been  a  distinct  turn  within  it 
toward  systematization  and  the  observational  test¬ 
ing  of  theories. 

Despite  the  central  position  of  positivism  within 
IR,  the  postpositivist  movement  of  recent  decades 
has  posed  a  strong  challenge  to  this  philosophy  of 
science.  Indeed,  the  third  and  fourth  debates  in  IR 
were  brought  on  by  the  head-on  collision  between 
mainstream  American  IR  scholars’  advocacy  of 
positivist  language  and  methods,  on  the  one  hand, 
and  the  claim  by  many  critical  theorists  (many  of 
them  outside  the  United  States),  on  the  other,  that 
this  view  of  science  was  blind  to  many  important 
theoretical  developments  and  unfairly  delegiti- 
mized  approaches  that  did  not  conform  to  its 
research  agenda. 

The  sharp  dichotomization  on  how  to  generate 
knowledge  in  IR  (a  question  that  involves  differ¬ 
ences  in  philosophy  of  science,  epistemology,  and 
methodology)  was  given  both  an  explanation  and  in 
some  senses  a  justification  by  Martin  Hollis  and 
Steve  Smith’s  famous  Explaining  and  Understanding 
International  Relations ,  which  argued  that  there  is  a 
fundamental  philosophical  dualism  at  work  in  the 
social  sciences,  including  IR,  which  makes  it  impos¬ 
sible  to  reach  an  agreement  on  whether  the  aim 
of  IR  should  be  generalization  (positivism)  or 
interpretive  understanding  (postpositivism).  Both 
approaches,  Hollis  and  Smith  argued,  have  their 
strengths  and  their  weaknesses,  but  they  remain 
mutually  exclusive.  Hollis  and  Smith’s  “two-stories 
account”  has  had  a  strong  impact  during  the  past 
few  decades,  especially  in  Europe.  Nevertheless,  it 
has  been  challenged,  notably  in  the  form  of  the  so- 
called  critical-realist  movement,  which  seeks  to  tran¬ 
scend  the  polarization  of  positivist  and  postpositivist 
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scholarship  by  advancing  a  new  philosophy  of 
science  that  is  critical  of  both  the  objectivism  of 
positivism  and  the  relativist  excesses  of  some 
forms  of  postpositivism. 

Theory  Versus  Practice 

There  has  also  been  a  deep  disagreement  among 
IR  theorists  about  the  aims  of  IR  theorizing. 
Notably,  there  is  a  division  in  the  discipline 
between  those  that  see  a  strong  policy-making  role 
for  IR  theory  and  those  who  would  like  to  refrain 
from  excessive  interaction  with  policymakers  and 
their  particular  interests. 

A  famous  exchange  between  William  Wallace, 
Ken  Booth,  and  Steve  Smith  during  1996  to  1997 
brought  out  the  dynamics  of  the  theory-versus- 
practice  debate  very  well.  While  for  Wallace  IR  as 
a  discipline  had  become  too  abstractly  theoretical, 
with  many  (especially  critical)  theorists  retreating 
to  their  ivory  towers  and  disengaging  from  policy 
making,  for  Booth  and  Smith  the  problem  was  that 
(a)  not  all  academics,  from  all  theoretical  schools 
of  thought,  would  be  freely  invited  to  engage  with 
policymakers  because  of  the  inherent  biases  and 
interests  that  drive  decision  making  and  (b)  the 
practice  of  teaching  and  creating  alternative  views 
of  knowledge  could  itself  be  seen  as  a  form  of 
political  practice,  potentially  being  more  influen¬ 
tial  than  entering  into  the  halls  of  power. 

There  seems  to  be  no  solution  to  the  theory- 
practice  dilemma  in  contemporary  IR  theory,  but 
the  accusation  that  IR  theory  is  too  abstract  to  be 
useful  continues  to  be  made  by  some  scholars  and 
policymakers.  At  the  same  time,  some  theorists 
complain  of  policymakers’  lack  of  interest  in  alter¬ 
native  ways  of  framing  and  understanding  prob¬ 
lems.  The  picture  is  in  other  words  mixed:  While 
some  IR  theorists  are  closely  engaged  in  policy 
making,  others  have  decided  not  to  engage  with,  or 
are  pushed  out  from,  policy-making  circles. 

Trends  in  IR  Theory 

As  emphasized  and  exemplified  above,  IR  theory  is 
strongly  characterized  by  theoretical  pluralism. 
But  what  are  the  key  debates  in  IR  theory  at  pres¬ 
ent  and  what  are  they  likely  to  be  in  the  future? 
This  is  a  difficult  question,  for  it  depends  on  whom 
one  asks  and  where  one  looks. 


Interestingly,  it  seems  that  there  are  distinct 
geographical  differences  in  what  kinds  of  IR  the¬ 
ory  are  seen  as  preponderant  or  promising.  In  the 
United  States,  it  is  fair  to  say  that  realism  and  lib¬ 
eralism  in  their  neo-forms,  as  well  as  increasingly 
in  their  more  classical  forms  (neoclassical  realism 
and  Kantian  liberalism),  play  a  key  role.  Realism 
and  liberalism  are  arguably  dominant  also  in 
debates  on  IR  in  China  and  Russia.  In  addition, 
constructivist  analysis  is  increasingly  being  taken 
seriously  in  the  mainstream  American  context. 
Indeed,  it  seems  that  currently  these  three  theoretical 
schools — realism,  liberalism,  and  constructivism — 
usefully  cross-fertilize  each  other. 

Yet,  curiously,  while  this  triumvirate  of  theo¬ 
retical  approaches  dominates  and  frames  debates 
in  key  journals  such  as  International  Organization 
and  International  Studies  Quarterly ,  it  could  also 
be  argued  that  increasingly,  theoretical  debates  are 
currently  being  conducted  largely  within  and  not 
just  between  these  theoretical  schools.  Indeed, 
much  of  realist  debate  at  present  concentrates  on 
arguments  between  offensive  and  defensive  real¬ 
ists,  and  neoliberals  have  also  turned  inward  to 
discuss  different  ways  of  understanding  institu¬ 
tional  logics.  Thus,  there  is  a  curious  fragmenta¬ 
tion  of  debates  in  IR,  as  Waever  has  pointed  out 
(see  his  chapter  in  Dunne  et  ah,  2007). 

Different  perspectives  tend  to  dominate  in 
Europe.  First,  it  is  notable  that  constructivism  is 
very  popular  in  Europe,  especially  in  Germany  but 
also  within  the  UK.  Yet  in  both  the  UK  and  on  the 
Continent  the  scene  is  also  more  varied.  Realists 
tend  to  be  few,  while  critical  theorists  of  various 
persuasions  have  a  strong  position  in  the  field.  In 
the  UK,  for  example,  Marxism  and  Gramscian 
thought  still  play  key  roles  in  debates,  as  does 
critical  theory,  both  in  its  classical  and  poststruc¬ 
turalist  formulations. 

IR  theory,  albeit  sometimes  in  fragmented 
forms,  continues  to  be  developed  and  reconfig¬ 
ured.  However,  it  seems  to  develop  in  different 
ways  in  different  geographical  contexts.  This  is 
partly  because  IR  theory  does  not  develop  in  a 
vacuum:  Like  any  social  theory,  it  is  tied  to  specific 
social  and  political  contexts  and  interests.  With 
this  in  mind,  it  is  important  to  note  that  IR  theory 
continues  to  be  mainly  an  Anglo-American  disci¬ 
pline  and  that  theoretical  voices  from  Latin 
America,  Africa,  or  Asia  are  rarely  listened  to,  and 
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they  certainly  do  not  structure  the  field  (but  often 
seek  to  fit  in  it). 

In  any  case,  and  despite  its  many  failings  and 
silences,  IR  theory  is  by  no  means  on  its  deathbed. 
As  Alexander  Wendt  and  Duncan  Snidal,  the  edi¬ 
tors  of  the  new  International  Theory  journal,  have 
argued,  in  many  ways  international  theorizing  is  at 
its  liveliest  at  the  present  juncture:  There  is  a  great 
deal  of  work  being  done  in  different  traditions, 
and  different  theoretical  tools  are  used  by  the 
scholars  involved.  Moreover,  IR  theory  continues 
to  be  firmly  embedded  in  the  social  and  institu¬ 
tional  structures  of  academia.  In  other  words,  it 
continues  in  being  a  placeholder — even  if  a  con¬ 
tested  one — for  many  scholars  trying  to  grapple 
with  international  and  world  political  trends  as 
well  as  with  the  plurality  of  approaches  being  used 
by  their  colleagues  to  explain  these. 

Milja  Kurki 
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Aberystwyth ,  United  Kingdom 
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International  Relations 
as  a  Field  of  Study 


The  field  of  international  relations  (commonly 
abbreviated  IR)  focuses  on  a  variety  of  subjects. 
The  many  connotations  that  are  usually  associated 
with  the  term  relations  (one  of  the  most  under- 
specified  terms  in  the  field  itself)  and  the  aesthetic 
quality  that  accompanies  relating  the  name  of  the 
field  (IR)  to  a  broad  set  of  subjects  subsumed  under 
the  same  term  in  lowercase  letters,  “international 


Note:  Due  to  limitations  of  space,  the  number  of  references 
had  to  be  restricted.  A  fully  referenced  version  of  this  article 
is  available  at  http://www.soz.uni-frankfurt.de/hellmann/ 
mat/IPSA.pdf. 
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relations,”  help  explain  why  both  IR  and  interna¬ 
tional  relations  are  still  widely  accepted.  Of  course, 
this  is  not  to  say  that  there  is  consensus.  Both  the 
name  of  the  field  and  the  description  of  its  subject 
matter(s)  have  always  been  contested.  Different 
observers  have  argued  that  international  ought  to 
be  replaced  by  interstate,  transnational,  or  global — 
to  name  just  a  few.  Others  would  like  to  see  rela¬ 
tions  replaced  by  studies  or  politics.  A  brief  look  at 
some  of  these  alternative  combinations — for 
example,  “interstate  relations,”  “transnational 
politics,”  or  “global  studies” — would  give  any 
reader  a  quick  idea  as  far  as  different  emphases 
are  concerned,  even  if  he  or  she  is  not  familiar 
with  the  normative  and  theoretical  underpinnings 
that  inform  these  alternative  descriptions  of  the 
field  of  study  and  its  subject  matter(s).  For  this 
reason,  conceptual  contestation  is  unsurprising;  it 
is  already  an  expression  of  the  inevitable  and 
recurring  ascertainment  of  the  borders  of  a  field  of 
study  by  the  community  of  scholars  belonging  to 
it  and  claiming  it  as  their  own. 

In  the  case  of  IR,  contestation  extends  well 
beyond  the  question  of  how  a  rather  loosely  defined 
field  of  study — or  “fragmented  ‘nonfield’”  (James 
Rosenau,  1993) — is  to  be  properly  named. 
Especially  in  the  English-speaking  world,  IR  is 
sometimes  defined  in  terms  of  an  academic  disci¬ 
pline  of  its  own,  separate  from  political  science,  or 
as  a  multidisciplinary  field  of  study.  On  a  global 
scale,  however,  this  is  not  normally  the  case.  Here, 
it  is  usually  considered  to  be  one  of  the  major  sub¬ 
fields  (or  subdisciplines)  of  political  science.  Even  if 
the  term  discipline  may  sometimes  be  used  inter¬ 
changeably  with  field  of  study,  it  is  meant  in  the 
sense  of  a  more  loosely  defined  field  that  keeps  the 
outer  borderlines  both  fairly  fluid  and  permeable 
while,  at  the  same  time,  emphasizing  that  its  core  is 
more  clearly  demarcated  and  in  some  ways  also 
more  stable.  Of  course,  any  such  characterization 
of  the  field  is  in  itself  contestable.  More  specifically, 
two  caveats  need  to  be  kept  in  mind  with  regard  to 
any  exercise  in  “mapping”  a  field  of  study.  First,  as 
Hilary  Putnam  put  it,  any  such  overview  necessar¬ 
ily  involves  a  particular  “view  from  somewhere” 
that  is  shaped  by  specific  individual  and  cultural 
formative  experiences.  Even  if  this  view  of  IR  is 
shaped  by  an  environment  that  allows  for  pluralism 
with  regard  to  normative  commitments,  theoretical 
orientations,  and  methodological  preferences,  it  is 


inevitably  limited  by  what  IR  scholarship  is  actu¬ 
ally  accessible  via  particular  languages  and  aca¬ 
demic  infrastructures.  The  academic  infrastructure 
of  IR  is  lacking  in  crucial  respects  when  measured 
against  the  ideal  of  a  global  discipline  that  is  living 
up  to  the  spread,  reach,  and  interconnections  of  its 
subject  matter.  Second,  there  is  also  an  inevitable 
temporal  dimension  of  contestation.  Any  overview 
of  IR  as  a  field  of  study  necessarily  resembles  a 
snapshot  of  the  field  at  a  particular  point  in  time 
and  will,  almost  inevitably,  be  the  view  of  how  the 
discipline  used  to  operate  in  the  past.  This  notion 
of  IR  as  an  evolving  and  historically  situated  field 
becomes  strikingly  clear  when  one  compares  simi¬ 
lar  overviews  of  the  field  in  approximately  10-year 
intervals  from  the  early  1920s  onward.  Not  only  do 
descriptions  of  the  subject  matter  change.  Rather, 
change  is  ubiquitous  with  regard  to  the  borderlines 
drawn  to  other  (sub)fields  and  the  names  used  to 
denote  and  demarcate  the  field’s  most  prominent 
theories.  Therefore,  this  overview  ought  to  be  seen 
as  a  “disciplining”  exercise  in  the  dual  sense  of  the 
word.  It  is  supposed  to  provide  a  perspective  on  the 
structure  of  the  discipline  and  familiarize  the  reader 
with  some  of  the  prominent  conventions,  theories, 
and  practices  of  the  field  of  IR  as  they  are  currently 
viewed  in  the  field  in  terms  of  a  history  of  the  pre¬ 
sent.  At  the  same  time,  it  ought  to  be  kept  in  mind 
that  the  very  concept  of  scholarship  points  at  mov¬ 
ing  beyond  these  conventions,  theories,  and  prac¬ 
tices.  By  de-emphasizing  disciplinary  stability  in 
favor  of  an  evolutionary  perspective,  this  way  of 
proceeding  does  not  deny  that  a  structural  view  of 
the  discipline  may  be  useful.  As  a  matter  of  fact,  it 
is — and  such  a  structural  perspective  will  be  applied 
in  the  first  section  of  this  overview.  However,  look¬ 
ing  at  the  discipline  with  a  bird’s-eye  perspective 
necessarily  emphasizes  the  big  picture  and  will  thus 
almost  inevitably  appear  fairly  static.  The  second, 
shorter  section  of  this  entry  therefore  applies  a 
more  historical  and  dynamic  perspective  depicting 
the  field  as  an  expanding  one  along  many  frontiers 
simultaneously. 

The  Structure  of  Global 

International  Relations 

What  a  field  is  made  up  of  in  terms  of  intellectual 
substance — that  is,  its  conceptualization  of  the 
subject  matter,  its  theories,  and  its  understandings 
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of  appropriate  procedures  in  producing  knowl¬ 
edge — is  not  dictated  by  the  subject  matter  itself. 
Rather,  it  results  from  the  interplay  of  specific 
social  structures  (such  as  institutional  arrange¬ 
ments  along  disciplinary  lines  within  universities  or 
structures  of  communication  in  the  form  of  jour¬ 
nals,  etc.)  and  intellectual  structures  (i.e.,  what 
counts  as  knowledge  and  how  different  bodies  of 
knowledge  connect  to  make  up  a  discipline).  Both 
are  closely  interconnected.  As  far  as  the  social 
structure  of  IR  is  concerned,  Ole  W sever  (2007)  has 
argued  that  it  is  best  viewed  as  “a  mix  of  a  U.S./ 
global  system  and  national/regional  ones  with  var¬ 
ying  degrees  of  independence”  (p.  296).  Thus,  two 
elements  are  characteristic  of  IR  in  terms  of  its 
global  structure.  First,  IR  in  the  United  States  is 
predominantly  North  American  IR  and  global  IR. 
Second,  other  IR  communities  show  a  great  variety 
in  terms  of  size  and  intellectual  traditions.  However, 
what  is  most  noticeable  from  a  global  perspective 
is  the  extent  to  which  they  relate  to  IR  in  the  United 
States.  This  is  another  way  of  saying  that  Stanley 
Hoffmann’s  (1977)  famous  line  about  IR  being  “an 
American  social  science”  reverberates  until  today. 
Yet  although  the  dominance  of  American  IR 
remains  clearly  visible,  a  broad-brush  global  per¬ 
spective  on  disciplinary  developments  would  stress 
the  distinction  between  the  West  and  the  non-West 
rather  than  the  United  States  versus  the  rest.  One  of 
the  distinguishing  marks  of  the  current  develop¬ 
mental  stage  of  IR  from  a  global  and  evolutionary 
perspective  may  well  be  its  post-Western  and  its 
post-Westphalian  character.  However,  from  a 
bird’s-eye  view  of  the  structures  of  the  discipline, 
the  West  seems  very  well  entrenched. 

Size  and  Power 

Admittedly,  the  IR  community  in  the  United 
States  still  plays  in  a  league  of  its  own.  Just  in  terms 
of  sheer  size,  it  easily  outdistances  that  of  any  other 
country  by  multiples.  A  2009  survey  by  Richard 
Jordan,  Daniel  Maliniak,  Amy  Oakes,  Susan 
Peterson,  and  Michael  Tierney  counted  more  than 
4,100  IR  scholars  in  the  United  States  with  an 
active  affiliation  with  a  university,  college,  or  pro¬ 
fessional  school.  (As  of  early  2009,  the  American 
Political  Science  Association  listed  close  to  4,700 
members  who  had  identified  IR  as  their  “general 
field.”)  Although  similarly  detailed  figures  are  not 


available  for  many  countries,  a  very  rough  esti¬ 
mate  based  on  a  variety  of  sources  would  probably 
put  the  U.S.  share  of  IR  scholars  around  the  world 
at  approximately  25%  to  30%.  Another  15%  to 
25  %  could  probably  be  added  for  Canada,  Europe, 
Israel,  and  Australasia,  putting  the  Western  share 
of  global  IR  production  capacity  at  40%  to  55% 
overall. 

Although  the  size  of  academic  communities  is 
an  important  structural  feature,  in  and  of  itself  it 
says  little  about  the  global  structure  of  IR  in  terms 
of  institutional,  structural,  and  productive  power. 
Yet  even  if  one  adds  such  a  perspective,  it  is  fairly 
clear  that  the  United  States  occupies  the  most 
influential  position  at  the  center,  with  European  IR 
communities  plus  Israel  and  Australasia  forming 
an  appendix  to  the  core  that  tries  to  establish  a 
somewhat  more  independent  profile  (and  has 
partly  succeeded  in  doing  so).  Three  observations 
are  noteworthy  in  this  context.  First,  the  institu¬ 
tional  and  structural  power  of  U.S.  IR  is  reflected 
in  how  IR  scholars  in  other  parts  relate  to  it.  For 
IR  scholars  in  Western  Europe,  Israel,  and  South 
Asia,  and  to  a  lesser  degree  in  East-Central  Europe 
and  some  parts  of  Latin  America,  gaining  recogni¬ 
tion  in  the  United  States  (i.e.,  in  U.S.-based  IR 
journals  in  particular)  continues  to  be  a  crucial 
element  for  professional  advancement  compared 
with  IR  scholars  elsewhere.  Since  the  editors  of  key 
journals  are  drawn  largely  from  the  IR  community 
in  the  United  States,  scholars  aiming  at  being  pub¬ 
lished  in  these  journals  have  to  address  the  con¬ 
cerns  of  this  community.  Yet  as  many  studies  have 
shown,  the  theoretical  debates  in  the  United  States 
are  largely  driven  by  American  foreign  policy  con¬ 
cerns,  not  broader  global  concerns.  A  2009  survey 
by  Tom  Biersteker  of  the  assigned  or  required 
readings  for  PhD  candidates  specializing  in  IR  in 
the  10  leading  U.S.  departments  of  political  science 
showed  that,  on  average,  94%  of  the  assigned 
readings  were  written  by  scholars  who  have  spent 
most  or  all  of  their  careers  in  the  United  States.  As 
a  result,  the  segment  of  global  IR  scholarship  that 
aims  at  the  most  prestigious  journals  in  the  field 
will  inevitably  face  editors  whose  academic  careers 
have  been  largely  shaped  by  U.S.  concerns.  Non- 
American  Western  IR  scholars  are  more  likely  to 
be  able  to  meet  the  expectations  of  these  editors, 
and  to  the  extent  that  their  work  is  actually  being 
published,  it  is  therefore  also  more  likely  to  speak 
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to  an  agenda  shared  by  Western  societies  and 
states.  The  same  applies  vice  versa  for  the  increas¬ 
ing  number  of  U.S.  scholars  being  published  in 
European  peer-reviewed  journals  such  as  European 
Journal  of  International  Relations  and  Journal 
of  International  Relations  and  Development. 
Although  the  differences  between  the  United  States, 
on  the  one  hand,  and  Europe  and  Australasia,  on 
the  other,  are  noteworthy  and  have  often  been 
described,  these  differences  fade  away  against  the 
many  similarities  if  one  contrasts  IR  research  prac¬ 
tices  and  priorities  in  the  West  as  a  whole  with 
those  in  the  non-West. 

The  second  observation  extends  the  emphasis 
on  disciplinary  autonomy  within  largely  national 
borders  from  the  United  States  to  the  global  level. 
Although  the  orientation  toward  the  United  States 
and  its  standards  of  IR  scholarship  in  a  fairly 
small  (though  influential)  set  of  countries  rein¬ 
forces  American  dominance,  it  is  by  far  not  a 
universal  phenomenon.  A  global  structure  domi¬ 
nated  by  Western  standards  of  science  and  the¬ 
matic  agendas  coexists  with  significant  degrees  of 
local  autonomy  in  IR  communities  around  the 
world,  according  to  Arlene  Tickner  and  Ole 
Wtever.  For  a  field  focusing  on  phenomena  that 
by  their  very  nature  transcend  national  boundar¬ 
ies,  it  is  noteworthy  how  parochial  (or  detached 
from  a  truly  global  discourse)  all  IR  communities 
are  around  the  world.  This  is  amply  visible  in  an 
almost  universal  preoccupation  with  the  foreign 
policy  agendas  of  the  respective  countries.  In  the 
context  of  the  United  States  and  intellectually 
linked  IR  communities  such  as  Europe,  this  con¬ 
cern  is  embedded  in  or  dominated  by  an  explicit 
theoretical  framing  of  specific  problems.  In  many 
other  countries  where  the  intellectual  structure  of 
the  discipline  is  less  dominated  by  the  imperatives 
of  theory  production,  these  foreign  policy  agendas 
often  translate  quite  directly  into  research  agen¬ 
das.  Therefore,  a  certain  parochialism  seems  to  be 
an  almost  inevitable  and  universal  characteristic 
of  IR  globally.  In  part,  this  is  also  due  to  the  fact 
that  the  social  structures  of  the  academy  have 
their  own  life  and  in  many  ways  follow  national 
patterns.  Sometimes  national  IR  communities  may 
be  clustered  into  regional  groups  with  distinct 
characteristics,  such  as  an  “Anglo-American”  way 
of  doing  IR  or  a  “Continental”  one.  However, 
national  profiles  often  remain  clearly  visible.  For 


instance,  even  a  quick  look  at  the  social  and 
intellectual  structure  of  IR  in  Italy,  France, 
Germany,  and  the  Netherlands  immediately 
reveals  remarkable  differences  as  well  as  similari¬ 
ties  in  the  relatively  narrow  space  of  the  European 
continent.  In  a  longer  historical  perspective,  how¬ 
ever,  the  patterns  of  academic  institutionalization 
and  professionalization  of  IR  during  the  20th 
century  have  also  converged,  often  following  the 
American  model  of  treating  IR  as  a  more  or  less 
integral  part  or  subdiscipline  of  political  science. 

Third,  it  certainly  matters  whether  one  pursues 
(more  or  less  parochial)  IR  concerns  in  the  United 
States,  Britain,  Denmark,  China,  Thailand,  Nigeria, 
or  Brazil,  to  name  just  a  few  sites.  Although  this 
entry  is  not  the  place  for  a  detailed  analysis  of  the 
global  social  structure  of  IR  (on  this,  see  Tickner 
&  Waever,  2009),  three  layers  of  IR  communities 
can  be  distinguished  in  terms  of  power,  interna¬ 
tional  connectedness,  and  international  visibility. 

International  Relations  as 
a  Three-Tiered  Discipline 

The  Dominance  of  U.S.-American  IR 

As  mentioned  before,  first,  the  IR  community  in 
the  United  States  stands  out  in  terms  of  size  and 
power.  In  the  United  States,  whether  or  not  an  IR 
scholar  engages  with  or  is  knowledgeable  about  IR 
elsewhere  has  little  influence  with  regard  to  profes¬ 
sional  advancement,  and  thus,  U.S.  scholars  may 
neglect  the  rest  of  the  IR  world.  Second,  to  the 
extent  that  other  IR  communities  relate  to  U.S.  dis¬ 
courses,  many  largely  emulate  or  implicitly  follow 
the  U.S.  model  and  the  theories  propagated  there. 
This  can  be  taken  by  some  IR  scholars  as  evidence 
of  the  institutional  and  productive  power  exercised 
by  American  IR  or  even  as  evidence  that  the 
American  way  of  doing  IR  is  the  “right”  way. 
Third,  to  the  extent  that  IR  communities  in  other 
countries  are  essentially  decoupled  from  U.S.  (and 
thus  globally  dominant)  discourses,  this  is  mostly 
due  to  the  fact  that  local  political  concerns  domi¬ 
nate  research  agendas  and  that  theory  (as  defined  in 
the  U.S. -dominated  discourse)  is  largely  irrelevant. 
Therefore,  IR  scholars  from  the  United  States  may 
find  little  incentive  to  investigate  IR  in  such  states. 
Against  this  background,  it  is  worth  emphasizing 
that  the  International  Studies  Association  (ISA) — 
the  most  important  professional  organization  of 
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IR/international  studies  around  the  globe  that 
originated  in  the  United  States  and  still  largely 
comprises  U.S.  scholars  as  members — has  been 
quite  supportive  of  efforts  to  build  up  professional 
structures  of  academic  communication  beyond  it. 
Yet  despite  these  efforts,  ISA  conventions — one  of 
the  premier  sites  of  intellectual  exchange  about  IR 
on  the  global  stage — have  largely  remained  intra- 
Western  scholarly  exchanges.  Whereas  a  system¬ 
atic  analysis  of  attendance  patterns  at  the  ISA 
conventions  in  Toronto  (1997),  Portland  (2003), 
and  New  York  (2009),  based  on  the  institutional 
origin  of  the  respective  scholars  attending,  shows  a 
noteworthy  increase  of  European  Union-European 
scholars  (16%  in  1997,  15%  in  2003,  and  25%  in 
2009)  relative  to  North  American  scholars  (United 
States  plus  Canada:  76%  in  1997,  74%  in  2003, 
and  62%  in  2009),  there  was  essentially  no  change 
if  one  compared  attendance  along  a  Western  ver¬ 
sus  non-Western  distinction  (see  Figure  1). 

The  Second  Tier  of  Semivisible  IR  Communities 

Compared  with  U.S.  insularity,  the  situation  is 
quite  different  if  one  looks  at  IR  from  a  global 
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Figure  I  Origin  of  Scholars  Attending  International 
Studies  Association  (ISA)  Conferences 

Source:  Author’s  analysis  based  on  the  convention  programs 
as  they  were  made  available  via  the  ISA  website:  http://www 
.isanet.org/conventions/2007/04/previous_isa_an.  html. 
Scholars  were  grouped  according  to  the  state  of  their 
institutional  affiliation,  not  their  nationality. 

Note:  “Western  world”  comprises  OECD  (Organisation  for 
Economic  Co-operation  and  Development)  member 
countries  (including  Israel  but  excluding  Japan  and  Turkey). 


perspective  with  regard  to  a  second  category  of  IR 
communities,  such  as  Britain,  China,  Denmark,  or 
Canada.  As  different  as  these  IR  communities  are 
in  terms  of  university  systems,  professional  incen¬ 
tive  structures,  and  national  traditions  and  as  much 
as  the  American  dominance  may  be  resented  in 
some  quarters  there,  many  of  the  influential  IR 
scholars  in  these  countries  do  (have  to)  relate  to 
American  IR  in  one  way  or  another,  if  only  by  dis¬ 
tancing  themselves  as  to  the  way  IR  is  done  in  the 
United  States.  Therefore,  what  distinguishes  these 
communities  from  the  U.S.  IR  community,  on  the 
one  hand,  and  a  third  category  of  peripheral  IR 
communities,  on  the  other,  is  a  medium  level  of 
international  visibility.  International  visibility 
(defined  in  terms  of  a  certain  amount  of  recogni¬ 
tion  by  other  scholars  around  the  world)  may 
result  from  very  different  sources,  such  as  the  natu¬ 
ral  advantage  of  communicating  in  the  lingua 
franca  (as  in  the  case  of  British  IR)  and/or  the  rec¬ 
ognition  by  significant  others  that  the  work  pub¬ 
lished  in  these  communities  matters,  be  it  for  purely 
academic  reasons  or  due  to  other  considerations. 

a.  Among  the  latter,  one  can  single  out  the  phe¬ 
nomenon  of  national  “schools”  of  IR,  which  pro¬ 
vide  for  a  specific  variant  of  this  second  type  of  IR 
communities  with  a  medium  level  of  international 
visibility.  The  so-called  English  School,  for  instance, 
represents  a  conscious  effort  on  the  part  of  schol¬ 
ars  in  the  United  Kingdom  (UK)  to  establish  the 
idea  of  an  international  “society  of  states”  as  a 
distinct  theoretical  concept  synthesizing  elements 
that  have  been  assigned  in  American  IR  to  compet¬ 
ing  (realist  and  liberal)  schools  of  thought.  In  con¬ 
trast  to  American  IR,  the  English  School  scholarship 
exhibits  a  deep-seated  skepticism  vis-a-vis  the 
“scientific”  study  of  international  relations  and 
accordingly  pays  much  more  attention  to  historical 
processes.  These  substantive  differences  notwith¬ 
standing,  insiders  have  argued  that  the  English 
School  essentially  marks  “a  delayed  response  to 
Britain’s  loss  of  Empire  and  world  status”  (Richard 
Little,  2008,  pp.  685-686).  In  forming  such  a 
school,  however,  Britain’s  IR  community  has  not 
only  tried  to  come  to  terms  with  the  changed  inter¬ 
national  role  of  the  UK.  It  has  also  left  a  mark  of 
distinction  vis-a-vis  the  quasi-hegemonic  U.S.  dis¬ 
course,  thereby  establishing  a  widely  recognized 
corporate  IR  identity  globally.  IR  communities  in 
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other  countries  that  are  habitually  ranked  among 
the  “great  powers”  (e.g.,  Russia,  Japan,  and 
China)  are  increasingly  engaged  in  debates  whether 
or  not  to  establish  national  schools  similar  to  the 
English  School  (even  South  Korea  is  contemplating 
such  a  strategic  move).  China  is  the  most  obvious 
and  most  noteworthy  case,  both  because  of  the  size 
of  the  country  and  due  to  the  Eastern  tradition. 
Influential  Chinese  scholars  nowadays  openly 
advocate  the  establishment  of  a  “Chinese  School 
of  IR  Theory”  as  an  “inevitable”  step  in  the  matu¬ 
ration  of  Chinese  IR  (Yaquing  Qin,  2007).  Yet  as 
in  the  case  of  the  English  School,  the  emphasis  on 
building  such  a  distinct  Chinese  school  on  a  set  of 
core  assumptions  about  the  “material  world”  and 
the  “speculative  world,”  distinct  from  dominant 
IR  theories  in  the  West  in  general  and  the  United 
States  in  particular,  only  reinforces  the  picture  of 
the  global  preeminence  of  a  particular  U.S.-led 
type  of  theorizing  centered  on  realist  theory  and  its 
competitors  as  well  as  on  rationalist  and  construc¬ 
tivist  approaches  to  doing  IR  from  an  epistemo¬ 
logical  and/or  methodological  point  of  view.  It  is 
against  this  background  that  scholars  from  non- 
Western  IR  communities  feel  at  least  uncomfort¬ 
able  if  not  offended  if  they  are  asked  why  they 
have  not  yet  come  up  with  some  IR  theory  of  their 
own.  A  special  issue  of  the  journal  International 
Relations  of  the  Asia-Pacific  carried  the  question 
“Why  is  there  no  non-Western  IR  theory?”  quite 
prominently  in  the  title  (Amitav  Acharya  &  Barren 
Buzan,  2007).  To  be  sure,  the  whole  thrust  of  the 
project  was  based  on  the  intention  to  stimulate  a 
debate  about  and  the  development  of  non-Western 
IR  theory  in  Asia.  Yet  the  way  the  question  was 
framed  already  carried  a  specific  understanding  as 
to  what  good  IR  is  all  about — that  is,  that  it  ought 
to  put  a  premium  on  theory,  that  it  is  fairly  obvi¬ 
ous  what  distinguishes  theory  from  nontheory, 
and  that  IR  communities  in  states  such  as  China 
are  not  (yet)  doing  enough  good  theory.  Judging 
from  recent  trends,  some  segments  in  countries 
such  as  China  appear  willing  to  take  up  this  chal¬ 
lenge.  In  part,  this  is  due  to  a  spreading  realization 
among  IR  scholars  that  IR  theories  (broadly 
defined)  do  not  only  structure  our  view  of  the 
world  in  a  very  basic  sense  but  that  they  are  also 
tools  for  governing  the  world.  In  this  analysis, 
both  the  world  and  IR  often  appear  as  if  governed 
by  the  United  States.  Empirical  analyses  by  Chinese 


scholars  show  that  this  power  of  disciplinary  social¬ 
ization  via  Western,  especially  U.S.,  theory  dis¬ 
courses  works  even  in  a  country  such  as  China  with 
a  large  IR  community  and  a  very  distinct  and  old 
tradition  of  its  own.  For  instance,  with  the  excep¬ 
tion  of  one  book,  all  of  the  86  IR  books  translated 
into  Chinese  by  five  leading  publishers  since  1990 
were  originally  written  in  English,  and  the  over¬ 
whelming  majority  of  these  books  had  American  IR 
scholars  as  authors.  Thus,  even  if  a  Chinese  school 
of  IR  drawing  heavily  on  distinctly  Chinese  tradi¬ 
tions  emerges  eventually,  it  will  have  been  mediated 
via  theory  as  practiced  in  the  English-speaking, 
mostly  U.S.-dominated,  Western  world  of  IR. 

b.  A  similar  mixture  of  orientation  toward 
American  IR  while  tying  IR  scholarship  back  to 
local  concerns  and  intellectual  traditions  is  observ¬ 
able  in  a  second  variant  of  internationally  more 
visible  IR  communities.  Some  IR  communities  in 
Western  and  East-Central  Europe  and  in  Israel 
belong  to  this  group.  What  unites  them  in  terms  of 
institutional,  structural,  and  productive  power  is 
that  they  can  only  draw  in  rather  limited  ways  on 
the  advantages  that  the  British  or  the  Chinese  IR 
communities  enjoy.  While  IR  scholars  in  signifi¬ 
cant  numbers  in  these  communities  publish  in 
English,  this  is  certainly  not  the  case  for  all  of  them 
(as  is  the  case  in  the  UK).  Also,  in  contrast  to 
China,  none  of  these  states  are  expected  to  play  a 
crucial  role  politically  in  the  years  to  come  in  order 
to  pay  special  attention  to  their  possibly  unique 
ways  of  doing  IR  research.  Moreover,  in  contrast 
to  the  UK  or  China,  most  of  these  IR  communities 
neither  have  the  size  nor  the  ambition  to  establish 
distinct  national  schools  of  IR.  Still,  in  some  coun¬ 
tries,  substantive  research  programs,  such  as  the 
Copenhagen  School  of  security  studies,  have  gained 
international  recognition  as  distinct  approaches 
well  beyond  the  regional  context.  In  the  case  of  the 
Copenhagen  School’s  “securitization”  research 
program,  distinction  was  achieved  with  a  more 
focused  approach  (in  terms  of  both  the  theoretical 
scope  and  the  substantive  ground  covered  as  far  as 
the  subject  matter  was  concerned)  than  what  is 
usually  subsumed  under  the  much  broader  label  of 
a  “paradigm”  in  the  U.S.  context.  Debates  in  such 
communities  are  similar  to  those  in  the  United 
States  in  putting  a  premium  on  theoretical  work. 
However,  even  though  theory  debates  in  the 
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United  States  are  at  least  taken  note  of  in  most  of 
these,  they  are  not  (or  no  longer)  simply  replicated. 
Rather,  an  increasing  amount  of  IR  in  Europe, 
Australia,  New  Zealand,  or  Canada  is  by  now 
often  inspired  by  philosophical  and/or  social  sci¬ 
ence  traditions  and  research  practices  distinct  from 
those  present  in  IR  debates  in  the  United  States. 
Here,  they  are  increasingly  dominated  by  rational 
choice  and  formal  modeling.  Since  these  are  not 
easily  exportable,  dominant  U.S.  theory  prefer¬ 
ences  entail  a  “de- Americanization  of  IR  else¬ 
where”  (Wasver,  1998).  Another  way  to  put  this  is 
to  point  to  an  ever-present,  often  largely  ignored 
and  now  rediscovered,  cultural  specificity  (e.g.,  in 
French  IR);  an  intra-Anglo-Saxon  divide  between 
“post-imperial  ‘crimson’  locales,”  on  the  one  hand 
(made  up  essentially  of  Britain,  Canada,  Ireland, 
Australia,  and  New  Zealand),  and  the  United 
States,  on  the  other;  and,  more  generally,  the  new¬ 
found  “autonomy”  in  Europe  of  a  specific 
“Continental”  brand  of  Western  IR  or  at  least  a 
claim  of  an  “intrafamilial  emancipation”  (e.g.,  in 
Germany).  To  the  extent  that  this  type  of  work 
gains  international  recognition,  this  is  seldom  due 
to  the  size  of  the  respective  IR  community,  the 
respective  country’s  global  political  significance,  or 
other  such  factors  pointed  to  by  science  studies. 
Rather,  genuine  intellectual  appeal  and/or  reso¬ 
nance  appears  to  be  more  closely  associated  with 
quantitative  as  well  as  qualitative  output  resulting 
from  comparative  advantages  in  terms  of  sheer 
resources  available  and  devoted  to  the  task  (as  in 
the  field  of  European  studies)  and  the  fact  that 
some  of  the  original  theory  products  travel  fairly 
easily  to  other  regions  (as  in  the  case  of  securitiza¬ 
tion  theory).  Moreover,  in  the  European  context, 
in  particular,  the  international  visibility  of 
European  studies  and  distinct  approaches  such  as 
the  Copenhagen  School  has  been  enhanced  by  the 
establishment  of  several  new  journals  (many  of 
which  are  published  in  English).  Some  of  these, 
such  as  the  European  Journal  of  International 
Relations ,  have  quickly  established  themselves 
among  the  leading  journals  in  the  field  globally. 
These  successes  and  differences  between  the  United 
States  and  other  Western  IR  communities  notwith¬ 
standing,  it  needs  to  be  emphasized  that  the  non- 
U.S.  West  offers  a  broad  variety  of  intellectual 
profiles,  not  all  of  which  are  as  much  interested 
in  connecting  via  English-language  publications 


internationally  as  are,  for  instance,  IR  scholars  in 
the  English-speaking  world,  Scandinavia,  the 
Netherlands,  or  Germany.  As  a  matter  of  fact,  siz¬ 
able  communities  with  original  theoretical  work, 
such  as  the  French,  are  quite  detached  from  the 
rest  of  the  West  and  have  fairly  little  impact  glob¬ 
ally.  Thus,  geographical  location  is  not  necessarily 
a  good  indicator  of  whether  an  IR  community  may 
achieve  international  visibility. 

In  any  case,  from  a  power  perspective,  few  non- 
Western  IR  communities  can  be  counted  to  the 
semiperiphery  of  the  second  tier.  Even  Japan,  with 
its  sizable  IR  community  and  conscious  institu¬ 
tional  efforts  at  increasing  its  visibility  globally — 
recently  by  establishing  the  peer-reviewed  English- 
language  journal  International  Relations  of  the 
Asia-Pacific — neither  has  much  of  an  impact  glob¬ 
ally  nor  has  succeeded  in  establishing  a  distinct 
Japanese  IR  profile,  according  to  Takashi  Inoguchi. 
China  currently  seems  to  be  the  only  serious  can¬ 
didate  for  such  a  rise  to  the  semiperiphery,  or  to 
put  it  less  pejoratively,  only  China  appears  able  in 
the  medium  to  long  term  to  engage  in  competition 
with  the  West  for  productive  power  in  actually 
“constructing  the  world”  in  a  language  and  in 
theories  that  are  distinctly  Chinese.  Yet  even  sym¬ 
pathetic  observers  remain  somewhat  skeptical 
about  whether  a  distinct  and  globally  visible 
Chinese  school  of  IR  might  indeed  be  established. 

The  Third  Tier:  Sizable  and  (Self-)Marginalized 

The  third  group  of  IR  communities  is  both  the 
largest  (in  terms  of  the  number  of  national  IR  com¬ 
munities  belonging  to  it)  and  the  most  isolated  and 
marginalized  (in  terms  of  international  visibility 
and  power).  As  a  matter  of  fact,  one  of  the  expres¬ 
sions  of  its  marginalization  is  that  much  less  “is 
known”  about  these  IR  communities  compared 
with  the  two  other  groups.  In  part,  this  is  because 
scholars  from  the  other  two  groups,  who  are  usu¬ 
ally  instrumental  in  producing  such  international 
visibility  due  to  disciplinary  power  structures,  nor¬ 
mally  pay  little  attention  to  the  research  conducted 
there.  Indeed,  one  could  easily  turn  the  complaint 
(sometimes  heard  in  the  non-U.S.  West)  that 
American  IR  treats  research  originating  in  Europe, 
Canada,  or  Australasia  with  indifference  against 
the  plaintiffs  themselves  since  the  same  pattern  of 
ignorance  may  be  observed  in  their  relationship  to 
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most  IR  communities  in  Asia,  Africa,  and  Latin 
America  (many  of  which  have,  of  course,  suffered 
through  European  colonization).  In  part,  however, 
marginalization  also  results  from  the  self-conscious 
separation  or  self-reliance  of  IR  communities  in 
Asia,  Africa,  and  Latin  America  or  from  these 
countries  simply  refraining  from  engaging  in  IR 
research.  Some  of  the  IR  communities  in  this  third 
group  (such  as  the  Indian  or  Brazilian)  are  fairly 
sizable,  though  not  necessarily  tightly  organized 
professionally,  as  their  counterparts  are  in  the 
United  States  or  Europe.  Many  others  are  tiny, 
with  no  professional  organization  whatsoever.  In 
quite  a  few  third-tier  countries,  IR  as  an  academic 
(sub)discipline  is  not  even  present  at  universities 
or — to  the  extent  that  it  actually  is  present — is 
largely  characterized  by  the  dominance  of  research 
questions  that  reflect  the  respective  countries’  for¬ 
eign  policy  agenda.  As  a  matter  of  fact,  there  is 
often  a  very  close  connection  between  the  foreign 
policy  establishments  and  official  state  institutions 
in  these  countries,  on  the  one  hand,  and  IR  depart¬ 
ments,  on  the  other,  since  the  latter  often  exist 
primarily  because  they  are  expected  to  supply 
future  diplomats  for  the  respective  foreign  services. 
Theory-oriented  research  as  practiced  and  cher¬ 
ished  in  the  West  does  not  play  much  of  a  role 
here.  Theory — conceived  of  in  very  general  terms 
as  a  necessary  precondition  for  reality-constituting 
observation — is  ever  present.  Yet  it  transpires 
largely  implicitly  in  empirical  analyses.  The  prime 
research  objective  is  seldom  theorizing  as  such. 
Moreover,  against  the  foil  of  Western  theory  dis¬ 
courses,  this  implicit  form  of  theory  application 
carries  more  realist  than  nonrealist  themes.  A 
similar  observation  can  be  made  with  regard  to  the 
role  of  methods.  Where  the  role  models  of  Western 
IR  scholarship  call  for  methodological  reflection  at 
a  minimum  (and  often  excel  in  offering  highly 
sophisticated  methods  that  are  understood  and 
applied  by  very  small  communities  of  scholars), 
the  requirements  for  methodological  rigor  and 
metareflection  in  third-tier  countries  are  much  less 
stringent.  Typically,  academic  publications  display 
a  combination  of  some  form  of  institutional  and/or 
historical  analysis  without  engaging  in  a  justifica¬ 
tory  argument  as  to  why  a  particular  method  or 
form  of  presentation  has  been  chosen  or  rejected. 

From  a  structural  perspective,  there  is  fairly  lit¬ 
tle  intellectual  exchange  both  among  third-tier 


communities  and  between  them  and  IR  communi¬ 
ties  from  the  first  and  second  tiers.  Moreover, 
some  of  the  interaction  that  can  be  observed  is 
stimulated  by  foundations  located  in  the  West  that 
(often  unintentionally)  tend  to  reproduce  existing 
uneven  global  structures  of  knowledge  production. 
For  instance,  as  the  volume  by  Tickner  and  Wasver 
(2009)  has  shown,  the  funding  provided  by 
Western  foundations  (such  as  Ford)  in  countries 
such  as  India,  East-Central  Europe,  Latin  America, 
or  South  Africa  has  often  been  very  influential  in 
developing  local  IR  communities.  One  key  reason 
was  simply  that  state  funding  was  limited.  Yet  since 
the  type  of  research  that  was  funded  primarily 
addressed  questions  of  more  immediate  local  policy 
relevance,  the  overall  effect  was  to  reinforce  the 
global  intellectual  division  of  labor  whereby  theory 
is  “produced  in  the  center  and  consumed  and 
applied  in  and  by  the  periphery”  (p.  332).  These 
effects  are  particularly  surprising  for  countries 
(e.g.,  India)  that  have  both  a  comparatively  old  and 
large  IR  community  and  a  philosophical  tradition 
of  their  own.  As  a  matter  of  fact,  the  Indian  politi¬ 
cal  philosopher  Chanakya  (ca.  350-283  BCE), 
who  is  usually  know  in  the  West  by  the  name 
Kautilya  and  who  is  sometimes  claimed  among 
Western  scholars  as  one  of  the  founders  of  realism, 
is  apparently  not  even  taught  in  any  principal  IR 
theory  course  in  India  itself,  according  to  Navnita 
Behera  (2007).  Similar  patterns  of  forgetfulness 
and/or  neglect  of  homegrown  traditions  can  be 
observed  in  Japan  and  recently  also  in  China.  From 
a  postcolonial  point  of  view,  this  devaluation  of 
homegrown  traditions  is  just  one  (often  unre¬ 
flected)  expression  of  a  “colonized”  mind-set  that 
stems  from  a  discipline  of  IR  that  is  at  its  very  core 
an  “expression  of  the  Western  theory  of  progress” 
(David  Blaney  &  Naeem  Inayatullah,  2008,  p. 
672).  This  postcolonial  message  starts  to  resonate 
even  among  scholars  from  third-tier  countries  who 
have  been  socialized  into  Western  IR  thinking. 

The  Social  Structure  of  the  Discipline 

The  overall  picture  that  emerges  by  looking  at 
the  social  structure  of  IR  from  a  global  perspective 
is  much  more  one  of  intellectual  segmentation  and 
stratification  than  one  of  intellectual  integration, 
which  one  might  think  the  subject  matter  itself  to 
be  suggesting.  This  impression  of  a  three-tiered 
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system  is  reinforced  if  one  examines  another 
dimension  of  the  social  structure  of  the  disci¬ 
pline — that  is,  its  publication  system  in  general  and 
its  hierarchy  of  journals  in  particular.  If  access 
limitations  (as  measured  in  terms  of  journal  accep¬ 
tance  rates)  are  accepted  as  a  measure  of  reputa¬ 
tion,  clearly  the  most  competitive  journals  of  the 
discipline  are  published  in  the  United  States  and 
Europe.  In  the  United  States  and  several  European 
countries,  getting  published  in  these  top  journals  is 
of  central  importance  to  climb  the  academic  career 
ladder.  Most  of  the  highest  ranking  journals  are 
still  published  in  the  United  States  and/or  con¬ 
trolled  by  American  IR  scholars.  As  a  matter  of 
fact,  a  recent  survey  in  the  IR  communities  of  10 
English-speaking  countries  found  that  at  least  four 
tiers  can  be  distinguished  when  IR  scholars  from 
these  countries  are  called  on  to  list  those  journals 
“that  publish  articles  with  the  greatest  influence 
on  the  way  IR  scholars  think  about  international 
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relations”  (Jordan  etal.,  2009,  p.  49):  (1 )  International 
Organization  is  in  a  league  of  its  own  and  is  men¬ 
tioned  by  73%  of  the  scholars  questioned;  (2)  the 
second  group  is  made  up  of  International  Security 
(45%)  and  International  Studies  Quarterly  (44%); 
(3)  American  Political  Science  Review ,  World 
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International  Relations  and  Review  of  International 
Studies  are  the  only  non-American  journals  men¬ 
tioned  alongside  Journal  of  Conflict  Resolution 
with  a  mere  14%  (for  European  Journal  of  Inter¬ 
national  Relations  in  particular,  the  ranking  is 
much  better  if  one  follows  the  Social  Science 
Citation  Index).  Thus,  interdependence  structures 
are  still  quite  asymmetrical  within  the  West,  with 
U.S.  journals  clearly  outdistancing  the  top 
European  journals. 
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journals  is  not  as  central  for  recruitment  practices 
and  academic  success.  At  the  same  time,  the  “inter¬ 
national”  profile  of  non-Western  IR  journals  is  much 
more  pronounced  than  in  the  West,  as  a  survey  of 
selected  non-Western  journals  shows,  in  contrast  to 
a  similar  survey  conducted  for  Western  journals.  For 
instance,  more  than  80%  of  the  articles  published  in 
International  Organization  or  International  Studies 
Quarterly  in  2000  and  2005  have  been  published  by 
authors  located  in  the  United  States  or  Canada  (see 
Figure  2). 

For  the  Turkish  IR  journal  Alternatives ,  the 
South  African  Journal  of  International  Relations, 
and  the  journal  International  Relations  of  the  Asia- 
Pacific,  the  number  of  national  authors  has  often 
been  lower  than  the  number  of  international 
authors,  even  decreasing  between  2000  and  2005 
(see  Figure  3).  Also,  the  number  of  non-Western 
authors  (including  national  authors)  being  pub¬ 
lished  in  these  journals  has  been  increasing  in  most 
of  these  journals  recently.  This  is  in  stark  contrast  to 
Western  IR  journals,  which  largely  remain  outlets 
for  scholarship  from  Western  IR  scholars  in  general 
and  scholars  from  the  United  States  in  particular.  In 
other  words,  whereas  the  West  remains  fairly  closed 


off  from  the  rest  of  the  world,  there  is  much  more 
of  a  balance  between  national  and  international 
authorship  in  non-Western  IR  journals. 

The  Intellectual  Structure  of  the  Discipline 

Another  characteristic  of  the  publication  system 
of  Western  IR  is  the  distinct  profile  of  mixing 
theory  and  empirical  analysis.  All  of  the  IR  jour¬ 
nals  ranked  among  the  top  20  of  all  political  sci¬ 
ence  journals  in  the  Social  Science  Citation  Index 
distinguish  themselves  as  theory-oriented  journals 
in  this  sense.  As  Kj ell  Goldmann  (1996)  pointed 
out  in  a  comparison  of  Western  IR  journals  from 
the  early  1970s  and  early  1990s,  as  far  as  “meth¬ 
odological  approaches”  are  concerned,  ever  more 
articles  published  in  the  1990s  combined  some 
form  of  theorizing  with  empirical  observation.  If 
anything,  this  trend  has  been  reinforced  during  the 
past  decade.  Again,  the  contrast  with  non-Western 
IR  journals  is  noticeable:  With  the  possible  excep¬ 
tion  of  the  Japanese  journal  International  Relations 
of  the  Asia-Pacific,  all  of  these  journals  primarily 
publish  articles  that  eschew  explicit  theoretical 
discussion  (see  Figure  4). 
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Paradigmatism  and  Great  Debates 

One  of  the  oldest  features  of  the  intellectual 
structure  of  the  discipline  is  paradigmatism.  In  IR, 
it  has  come  to  be  understood  as  a  disciplinary  pre¬ 
occupation  with  and  segregation  into  separate 
“metascientific  constructs”  with  distinct  ontolo¬ 
gies,  epistemologies,  and  methodologies  (Yosef 
Lapid,  1989).  Although  the  allusion  to  Thomas 
Kuhn’s  paradigmatic  view  of  the  evolutionary 
development  of  scientific  disciplines  only  spread  in 
IR  in  the  1970s,  the  phenomenon  as  such  has  been 
a  core  feature  of  the  discipline’s  intellectual  struc¬ 
ture  (at  least  in  its  Western  segment)  since  the  mid- 
20th  century.  Realism,  idealism,  rationalism,  and 
constructivism  are  usually  mentioned  as  examples 
of  such  paradigms.  Surpassed  only  by  questions  of 
epistemology  and  methodology,  such  “paradig¬ 
matic”  differentiation  continues  to  generate  the 
most  division  among  Western  IR  scholars  (Jordan 
et  ah,  2009).  What  is  more,  IR  scholars  in  10 
English-speaking  countries,  surveyed  by  Jordan 
and  colleagues  (2009),  estimate  that  almost  90% 
of  all  IR  literature  is  devoted  to  some  form  of 
paradigmatist  analysis.  Yet  although  one  out  of 
four  considers  his  or  her  own  work  to  be  falling 


outside  any  paradigmatist  frame,  more  than  three 
quarters  of  the  time  is  devoted  to  IR  paradigms  in 
IR  introductory  courses  taught  by  these  scholars. 
In  other  words,  the  intellectual  structure  of  disci¬ 
pline  continues  to  be  reproduced  along  paradigma¬ 
tist  lines  even  though  many  IR  scholars  do  not 
believe  that  such  a  focus  is  particularly  useful  in 
their  own  research. 

This  gap  in  (Western)  IR  between  the  promi¬ 
nence  of  paradigmatism  in  teaching  and  that  in 
individual  research  reflects  on  what  sociologists 
call  “task  uncertainty” — that  is,  the  extent  to 
which  scholars  in  a  discipline  agree  on  what  rules 
are  to  be  followed  and  what  work  techniques  are 
acceptable  in  producing  knowledge  (Waever,  2007). 
In  IR,  the  level  of  task  uncertainty  is  quite  high 
since  there  is  fairly  little  agreement  as  to  what  the 
overarching  disciplinary  questions  are  or  how  one 
should  go  about  tackling  them.  The  TRIP  survey 
found  that  two  thirds  of  IR  scholars  questioned 
believe  that  methods  and  epistemology  generate  the 
most  division  in  the  discipline.  Yet  despite  this 
diversity,  there  has  at  least  been  a  widespread  (if 
sometimes  only  implicit)  understanding  that  para¬ 
digms  as  such  are  a  key  instrument  for  organizing 
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the  discipline,  especially  as  far  as  IR’s  recurring 
great  debates  are  concerned.  This  not  only  shows 
in  how  the  discipline’s  history  is  usually  told  or 
how  IR  is  taught  but  is  also  evident  in  textbooks 
and  handbooks — that  is,  in  works  that  are  sup¬ 
posed  to  introduce  novices  to  the  field  or  that 
provide  summaries  and  syntheses  of  what  is  nor¬ 
mally  claimed  to  be  disciplinary  knowledge.  Two 
recent  examples  are  illustrative.  As  the  subtitle  of 
International  Relations  Theory:  Discipline  and 
Diversity  (Tim  Dunne,  Milja  Kurki,  &  Steve 
Smith,  2007)  indicates,  the  editors  are  keen  to 
emphasize  both  the  necessity  of  a  certain  disciplin¬ 
ary  coherence  and  its  diversity.  Yet  the  organiza¬ 
tion  of  the  book  around  nine  “distinct  theories  of 
International  relations — realism/structural  realism, 
liberalism/neoliberalism,  the  English  School, 
constructivism,  Marxism  and  critical  theory,  femi¬ 
nism,  poststructuralism,  green  theory,  postcolo¬ 
nialism”  (p.  3)  shows  that  diversity  is  expressed 
not  in  the  form  of  theoretical  controversies  over 
core  substantive  questions  of  international  rela¬ 
tions  (e.g.,  what  causes  war,  what  causes  poverty) 
or  methods  but  through  paradigms.  The  same 
understanding  is  reflected  in  a  2008  handbook  by 
Christian  Reus-Smit  and  Duncan  Snidal,  where  the 
presentation  of  a  somewhat  different  set  of  nine 
“major  theoretical  perspectives”  takes  up  almost 
half  the  space  of  the  volume  (the  theories  discussed 
here  include  (1)  realism,  (2)  Marxism,  (3)  neolib¬ 
eral  institutionalism,  (4)  the  new  liberalism,  (5)  the 
English  School,  (6)  constructivism,  (7)  critical  the¬ 
ory,  (8)  postmodernism,  and  (9)  feminism).  In 
other  words,  whereas  IR  scholars  may  disagree 
sharply  as  to  the  particular  value  of  different  para¬ 
digms,  there  at  least  seems  to  be  widespread  agree¬ 
ment  that  it  is  useful  to  conceive  of  the  discipline 
in  terms  of  paradigmatic  differentiation  and  great 
debates. 

However,  from  a  longer  historical  perspective, 
the  period  during  which  paradigmatism  and  great 
debates  were  widely  acknowledged  as  dominant 
features  of  the  discipline’s  intellectual  structure 
may  come  to  an  end.  Recent  self-reflective  observa¬ 
tions  of  the  historiography  of  the  discipline  have 
convincingly  shown  that  even  in  Western  IR,  the 
focus  on  great  debates  may  have  been  as  much  a 
reflection  of  the  perceived  need  of  a  novel  academic 
field  to  identify  a  disciplinary  core  as  a  reflection  of 
a  common  tendency  in  the  social  sciences  to  delimit 


the  number  of  basic  rival  positions  to  a  low  number 
of  macrolevel  theories.  As  a  matter  of  fact,  the 
invocation  of  great  debates  as  a  typical  character¬ 
ization  of  overarching  disciplinary  divides  did  not 
begin  until  Hans  Morgenthau  introduced  the  term 
in  the  early  1950s  to  depict  what  soon  came  to  be 
known  as  the  debate  between  realism  and  idealism. 
Moreover,  it  reached  its  pinnacle  with  the  announce¬ 
ment  of  a  second  great  debate  in  the  1960s  pitting 
traditionalists  against  behavioralists  (or  scientists) 
in  a  clash  over  what  methods  IR  scholars  ought  to 
use  in  studying  international  phenomena.  It  was  in 
this  context  that  Kuhn’s  concept  of  a  paradigm  was 
first  combined  with  the  focus  on  great  debates, 
most  explicitly  (if  somewhat  misleadingly)  in  an 
article  by  Arendt  Lijphardt  (1974)  in  which  he 
identified  the  second  great  debate  as  “a  dichoto¬ 
mous  one  between  two  opposing  paradigms” 
(p.  18).  Yet  the  third  debate  marked  the  end  of 
agreement  over  how  to  describe  what  it  was  sup¬ 
posedly  all  about.  Alternatively,  it  was  framed  as  a 
debate  between  realism  and  globalism;  between 
realism,  pluralism,  and  structuralism;  between 
positivists  and  post-positivists;  or  between  “a 
broad  body  of  interdisciplinary  literature  com¬ 
monly  (and  often  indiscriminately)  labeled  ‘critical 
theory,’  ‘post  positivism,’  ‘discourse  analysis,’ 
or  ‘post-structuralism,’”  on  the  one  hand,  and 
“the  intellectual  imperialism  of  the  modern,  post- 
Cartesian  ‘scientific’  approach  to  knowledge  and 
society,”  on  the  other  (Jim  George,  1989,  p.  270). 
Others  suggested  that  the  latter  two  descriptions 
marked  the  fourth  debate,  which  in  itself  could  be 
subdivided  into  two  subdebates  between  reflectiv- 
ists  and  rationalists,  on  the  one  hand,  and  between 
neorealists  and  neoliberal  institutionalists,  on  the 
other.  Thus,  not  only  the  intervals  between  debates 
have  become  longer  since  the  first  debate  was 
invented,  but  after  the  second  debate,  there  has  also 
been  ever  more  contention  as  to  whether  a  third 
and/or  fourth  great  debate  actually  took  place  and, 
if  so,  what  it  was  all  about.  In  any  case,  no  fifth 
debate  currently  appears  to  be  forthcoming. 

Even  if  great  debates  were  indeed  a  thing  of  the 
past,  paradigmatism  appears  to  be  more  resilient. 
In  Andrew  Abbott’s  reading  of  “generational  para¬ 
digms,”  the  emphasis  on  the  economizing  strate¬ 
gies  of  informational  overload  suggests  a  certain 
disciplinary  immaturity  in  coming  up  with  more 
sophisticated  and  adequate  coping  strategies.  More 
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critical  readings — such  as  Niklas  Luhmann’s 
related  lamentation  about  “multiple  paradigmata- 
sis”  in  sociology — in  contrast  highlight  the 
unforced  ignorance  vis-a-vis  large  segments  of 
knowledge  that  necessarily  accompanies  paradig¬ 
matic  self-restriction.  This  latter  perspective  has 
recently  won  more  recognition.  Two  prominent 
representatives  of  rationalism  and  constructivism, 
for  instance,  have  joined  voices  in  rejecting  the 
implicit  offer  to  conduct  another  “battle  of  ana¬ 
lytical  paradigms,”  since  any  such  battle  would  “at 
the  very  least .  .  .  encourage  scholars  to  be  method- 
driven  rather  than  problem-driven  in  their 
research”  (James  Fearon  &  Alexander  Wendt, 
2002,  p.  52).  Rather  than  looking  at  the  relation¬ 
ship  between  rationalism  and  constructivism  in 
terms  of  a  “debate,”  they  pleaded  for  looking  at  it 
in  terms  of  a  “conversation”  between  two 
approaches  that,  “when  understood  pragmati¬ 
cally,  [arej  largely  either  complementary  or  over¬ 
lapping”  (Fearon  &  Wendt,  2002,  p.  68).  Similarly, 
the  recent  Oxford  Handbook  of  International 
Relations  prominently  positioned  the  chapter 
“Eclectic  Theorizing  in  the  Study  and  Practice  of 
International  Relations”  at  the  very  beginning  of 
the  section  that  presented  the  nine  “major  theo¬ 
retical  perspectives”  referred  to  above  (Peter 
Katzenstein  &  Rudra  Sil,  2008).  In  it,  the  authors 
advocate  analytic  eclecticism,  an  approach  to 
research  in  IR  that  replaces  paradigm-driven 
research  with  a  strategy  drawing  widely  on  seem¬ 
ingly  divergent  research  traditions  built  on  distinct 
concepts,  methods,  analytics,  and  empirics. 

Thus,  although  the  intellectual  structure  of  the 
discipline  continues  to  be  shaped  by  distinctions 
drawn  in  terms  of  paradigms  (or  the  equivalent 
major  theoretical  perspectives,  etc.),  paradigma- 
tism  as  such  has  been  losing  some  of  its  grip.  To  the 
extent  that  there  is  an  inherent  tension  between 
method-driven  approaches  or  paradigmatism,  on 
the  one  hand,  and  a  focus  on  a  problem-driven 
approach,  on  the  other,  the  shift  toward  the  latter 
can  be  seen  as  a  sign  of  maturation.  To  be  sure,  a 
loss  of  disciplinary  coherence  may  loom  as  a  down¬ 
side  if  IR  loses  its  traditional  paradigmatic  sign¬ 
posts,  according  to  Wasver  (2007).  Yet  the  same 
development  can  also  be  interpreted  as  an  advance 
toward  a  more  self-confident  academic  profession 
that  need  no  longer  engage  in  stylized  battles  to 
distinguish  itself  from  adjacent  (and  presumably 


more  reputable)  disciplines  such  as  history,  law,  eco¬ 
nomics,  or  sociology.  Moreover,  such  a  move  can 
also  be  justified  epistemologically  since  the  earlier 
rationalization  for  paradigmatic  separation — the 
usual  reference  to  the  idea  of  “incommensurability” 
in  Kuhn’s  theoretical  vocabulary — has  hence  been 
problematized  in  the  philosophy  of  science.  For  a 
long  time,  Kuhn’s  term  of  incommensurability 
was  understood  to  mean  that  the  theoretical 
vocabularies  of  separate  paradigms  were  not 
intertranslatable.  Yet  as  Richard  Rorty  (1991), 
among  others,  has  argued,  “untranslatability 
does  not  entail  unlearnability”  and  “learnability 
is  all  that  is  required  to  make  discussability 
possible”  (p.  48).  Paradigmatic  separation,  there¬ 
fore,  is  a  disciplinary  convention,  not  an  episte¬ 
mological  necessity.  Recently,  this  approach  to 
“interparadigm”  debates  has  been  spreading  in 
IR  at  least  subliminally,  if  not  explicitly,  accord¬ 
ing  to  Colin  Wight.  Irrespective  of  whether  one 
welcomes  or  criticizes  such  a  development,  it  is 
yet  another  sign  of  a  much  broader  development 
in  the  discipline:  its  expansion  along  many 
dimensions. 

The  Expansion  of  International  Relations 

Thus  far,  the  discipline  has  been  examined  largely 
in  terms  of  its  social  and  intellectual  structures. 
This  inevitably  entails  a  rather  static  view  that 
does  not  sufficiently  acknowledge  the  tremendous 
dynamism  of  IR.  Yet  the  dynamic  development  of 
the  field  is  perhaps  the  most  striking  feature  of  the 
discipline.  Ever  since  the  early  days  of  modern  IR 
in  the  early  20th  century,  the  discipline  has  been 
expanding.  Although  this  intellectual  expansion 
may  resemble  earlier  colonial  practices  of  the  West 
in  some  respects  (and  may  therefore  also  be 
described  in  diverse  vocabularies),  the  phenome¬ 
non  as  such  appears  to  be  largely  uncontroversial. 
Four  dimensions  of  intellectual  expansion  can  be 
distinguished: 

1.  territorial  expansion  (or  spread)  from  a  largely 
Western  core  to  other  countries; 

2.  disciplinary  expansion  within  political  science  as 
measured  in  terms  of  chairs  designated  with 
denominations  that  are  normally  considered  to 
be  IR; 
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3.  substantive  expansion  as  measured  in  research 
problems  being  taken  up  by  scholars  identifying 
themselves  as  doing  IR  and  in  interaction  with 
the  latter;  and 

4.  theoretical  and  methodological  expansion. 

Of  course,  none  of  these  expanding  moves  ought 
to  be  imagined  as  linear  or  unidirectional.  When 
knowledge  travels,  it  always  intermingles  and, 
thereby,  transforms.  John  Agnew  notes  refers  to 
the  image  of  a  marketplace  where  Western  IR  is 
“exported”  to  non-Western  regions  and  countries 
or  where  IR  “imports”  from  other  disciplines  such 
as  philosophy  or  where  economics  misconstrues 
knowledge  as  a  commodity  changing  hands  with¬ 
out  being  affected  by  the  very  transaction.  If  we 
take  this  transformational  dimension  into  account, 
however,  the  metaphor  of  expansion  quite  cogently 
describes  a  phenomenon  that  is  as  familiar  as  a 
subject  matter  to  the  IR  scholar  as  it  is  ubiquitous 
if  one  looks  at  the  dynamic  development  of  the 
discipline. 

Given  the  limits  of  space  and  the  earlier  discus¬ 
sion  of  the  global  structure  of  IR,  the  territorial 
and  disciplinary  expansion  can  be  kept  short.  As 
mentioned  earlier,  territorial  expansion  was  for  a 
long  time  characterized  (and  in  many  ways  still  is 
characterized)  by  the  discipline’s  failure  to  engage 
with  the  non-Western  world.  To  the  extent  that 
such  engagement  did  take  place,  it  often  followed 
general  patterns  of  colonial  interaction.  Western 
IR  presented  its  way  of  practicing  the  craft  as 
exemplary,  while  scholars  in  non-Western  regions 
would  either  emulate  Western  IR  practices 
(thereby,  perhaps,  gaining  some  recognition  from 
abroad)  or  keep  to  whatever  local  forms  of  schol¬ 
arship  were  deemed  suitable  to  study  things 
“international”  (and  remain  largely  marginal¬ 
ized).  In  the  late  20th  century  and  early  21st  cen¬ 
tury,  there  has  been  quite  a  bit  of  change  with 
respect  to  the  quantitative  enlargement  of  IR  com¬ 
munities  outside  the  West  and  a  much  more  self- 
conscious  redefinition  of  what  it  may  mean  to 
conduct  IR  in  places  such  as  China,  India,  Kenya, 
or  Mexico.  This  expansion  has  been  aided 
by  global  shifts  in  power  as  well  as  theoretical 
innovation  (e.g.,  postcolonialism).  The  World 
International  Studies  Committee  (WISC),  an  orga¬ 
nization  of  national  International  Studies  associa¬ 
tions  that  has  been  active  since  the  turn  of  the 


century,  has  certainly  helped  as  well.  Measured  in 
terms  of  chairs  in  IR,  the  discipline  even  seems  to 
be  expanding  much  more  rapidly  in  non-Western 
regions,  Latin  America,  and  Asia  in  particular  (see 
the  2009  special  issue  of  International  Relations  of 
the  Asia-Pacific).  Nevertheless,  even  though  almost 
no  comparative  data  are  available  on  the  number 
of  chairs  in  IR  around  the  globe  or  relative  to  other 
disciplines  in  political  science,  Hans-Dieter 
Klingemann  notes  that  IR  appears  to  be  growing 
numerically  even  in  Europe  and  North  America,  if 
only  at  the  expense  of  other  subdisciplines. 

The  Expansion  of  the  IR  Research  Agenda 

By  most  accounts  (Western)  IR  is  essentially  an 
invention  of  the  20th  century  (for  a  contrasting 
perspective,  see  William  Olson  &  John  Groom, 
1991).  Even  if  some  of  its  origins  may  have  been 
romanticized  in  one  way  or  another,  it  is  widely 
accepted  as  common  knowledge  among  IR  schol¬ 
ars  that  the  primary  subjects  of  study  in  the  first 
half  of  the  20th  century  were  almost  exclusively 
related  to  governmental  activities  crossing  national 
boundaries.  Against  the  background  of  the  two 
World  Wars,  the  question  of  war  and  peace 
loomed  large.  Accordingly,  early  (Western)  IR 
framed  its  research  agenda  mainly  in  the  vocabu¬ 
lary  of  international  law  and  diplomatic  history. 
The  methods  and  approaches  associated  with  these 
disciplines  were  thought  to  offer  the  best  hope  for 
“the  problem  of  international  governmental  reor¬ 
ganization  and  practice”  (Pitman  Potter,  1923, 
p.  391),  without  necessarily  succumbing  to  the  sort 
of  “idealist  internationalism”  for  which  the  disci¬ 
pline  was  later  chastised  by  realists  in  particular 
(Olson  &  Groom,  1991).  In  the  middle  of  the  20th 
century,  the  agenda  was  broadened  to  include 
international  economics  and  all  those  aspects  of 
international  relations  that  could  be  “described  in 
terms  of  decision-making  by  identifiable  individu¬ 
als  or  groups  of  individuals”  (Frederick  Dunn, 
1948,  p.  145).  More  important,  ever  more  scholars 
seemed  ready  to  subscribe  to  the  view  that  interna¬ 
tional  politics  rather  than  international  organiza¬ 
tion  constituted  the  core  of  a  slowly  maturing 
discipline.  While  the  latter  was  said  to  have  been 
approached  predominantly  with  a  constitutional 
frame,  the  image  of  an  international  system  made 
up  of  states  that  were  interacting  “almost  like 
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Leibnizian  monads”  (Morton  Kaplan,  1961)  and 
the  accompanying  clash  of  national  interests 
and  power  were  considered  to  be  more  properly 
dealt  with  in  a  political  framework  (William  Fox, 
1949).  The  first  great  debate  was  an  expression  of 
this  shift. 

The  war  experience  (which  had  involved  all 
those  countries  crucial  for  the  discipline’s  develop¬ 
ment  in  the  1950s  and  1960s)  and  the  spreading 
realization  that  the  advent  of  the  nuclear  age  would 
not  only  revolutionize  warfare  but  also  affect  the 
very  survival  of  humankind  pushed  the  expanding 
research  agenda  of  IR  scholarship  into  the  field  of 
strategic  studies  with  its  focus  on  state  practices 
such  as  deterrence  thinking  and  arms  control  nego¬ 
tiations.  While  essentially  remaining  in  the  classical 
IR  domain  of  state-based  international  politics,  the 
novel  process  of  European  integration  at  least 
offered  a  paradigmatic  alternative  to  the  traditional 
focus  on  great-power  competition.  It  not  only 
inspired  a  series  of  similar  political  projects  in  other 
parts  of  the  world  but  also  helped  stimulate  a  new 
and  vibrant  field  of  study  focused  on  comparative 
regional  integration.  In  addition,  the  process  of 
decolonization  laid  the  foundation  for  expanding 
both  the  territorial  reach  of  IR  research  beyond  the 
confines  of  the  Western  world  as  well  as  the  disci¬ 
plinary  focus  on  politics  by  intensifying  the  link 
with  economics.  Paradigmatically,  though,  the  two 
major  alternatives  of  the  emerging  development 
studies,  modernization  theory  and  dependency 
theory,  continued  to  draw  almost  exclusively  on 
the  Western  tradition.  (Neo-)Marxist-inspired 
analyses  of  capitalism’s  contribution  to  the  “under¬ 
development”  of  non-Western  regions  by  scholars 
such  as  Andre  Gunder  Frank,  a  University  of 
Chicago-trained  economist,  helped  pave  the  way 
for  international  political  economy  to  fully  estab¬ 
lish  itself  as  one  of  the  major  subfields  of  IR, 
starting  in  the  1970s.  On  a  parallel  track,  the 
study  of  foreign  policy,  which  had  been  one  of 
the  major  sections  in  any  North  American  IR/ 
political  science  curriculum,  continued  to  thrive 
as  an  ever  more  “scientific,”  increasingly  separate 
and  differentiating  field.  Foreign  policy  analysis 
(or  FPA)  was  the  prime  subfield  of  IR,  expanding 
into  those  neighboring  disciplines  (such  as  psy¬ 
chology  and  sociology)  that  were  deemed  useful 
in  coming  up  with  theoretical  and  methodological 
tools  for  making  sense  of  group  decision-making 


processes  under  routine  bureaucratic  or  crisis 
situations. 

With  the  advent  of  the  East-West  detente  and 
the  1973  oil  crisis,  the  disciplinary  horizon  broad¬ 
ened  further.  Although  the  classical  focus  on  “high- 
politics”  security  issues  kept  its  prominent  place, 
other  issues  gained  in  importance  or  were  added 
anew  to  the  IR  research  agenda.  First,  the  introduc¬ 
tion  of  the  concept  of  “transnational  politics”  at 
the  end  of  the  1960s  contained  an  unveiled  critique 
of  the  state  centrism  of  classical  IR.  It  also  fore¬ 
shadowed  the  broadening  of  the  more  narrowly 
circumscribed  foreign  policy  perspective  during  the 
1970s  and  1980s  by  also  looking  at  nonstate  actors 
and  their  activities  and  interactions  at  the  systemic 
level.  Second,  the  Club  of  Rome’s  “The  Limits  to 
Growth”  report  of  1972,  the  first  United  Nations 
Conference  on  the  Human  Environment  in  the 
same  year,  and  the  oil  crisis  in  the  following  year 
set  the  stage  for  environmental  issues  to  be  added 
to  the  IR  research  agenda.  Third,  transnational 
relations  and  environmental  politics  both  fitted  in 
neatly  with  perhaps  the  most  prominent  new  theme 
in  IR  since  the  1970s:  the  spreading  interest  in  the 
phenomenon  of  interdependence  and  globalization. 
This  image  of  an  increasingly  globalizing  world 
that  affected  every  corner  of  the  globe  and  reached 
across  all  issue  areas  in  turn  helped  push  a  final 
expansion  of  the  research  agenda:  the  inclusion  of 
an  ever-larger  group  of  nonstate  actors,  to  use  the 
mainstream  IR  vocabulary.  Alternatively,  critical, 
postmodern,  feminist,  and/or  postcolonial  theories 
identified  a  huge,  highly  diverse,  and  often  indis¬ 
cernible  group  that  they  called  the  marginalized: 
victims  of  war,  poverty,  or  colonialism;  women; 
or,  more  generally,  all  human  beings  who  had 
become  mere  objects  of  structures  and  practices  of 
power  rather  than  being  political  subjects.  At  the 
beginning  of  the  21st  century,  there  are,  thus,  few 
phenomena  that  cannot  be  framed  in  one  way  or 
another  as  legitimate  objects  of  study  under  the 
heading  of  IR.  Indeed,  some  even  argue  that  the 
discipline  has  to  rename  itself  in  order  to  do  justice 
to  the  causal  and  constitutive  connections  that 
link  so  many  levels  of  political  action  in  global 
society. 

It  is  hardly  surprising,  therefore,  that  both  the 
structure  of  the  most  influential  professional  orga¬ 
nization  in  the  field  of  international  studies,  the 
International  Studies  Association,  and  the  most 
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recent  comprehensive  survey  of  the  major  topics 
of  the  field  appear  to  cover  essentially  any  phe¬ 
nomenon  of  politically  relevant  social  action 
transcending  state  borders.  The  book  version  of 
the  ISA  “compendium”  encompasses  12  volumes 
and  more  than  8,000  pages  of  text  (Robert 
Denemark,  2010).  The  open-ended  online  com¬ 
pendium  is  even  more  voluminous.  The  same 
applies  as  far  as  the  structure  of  the  ISA  is  con¬ 
cerned.  The  24  sections  are  impressive  not  only 
for  their  breadth  but  also  for  the  fact  that  paradig- 
matism  is  much  less  visible  at  this  level  of  organi¬ 
zation.  Section  themes  include  professional  and 
pedagogical  concerns  (as  in  the  sections  “Women’s 
Caucus”  and  “Active  Learning  in  IS”)  and  episte¬ 
mological  and  methodological  concerns  (as  in  the 
sections  “Comparative  Interdisciplinary  Studies” 
and  “Scientific  Study  of  International  Processes”), 
but  most  often,  they  cover  a  broad  range  of  sub¬ 
stantive  issues  (e.g.,  “Diplomatic  Studies”; 
“Environmental  Studies”;  “Ethnicity,  Nationalism, 
and  Migration  Studies”;  “Feminist  Theory  and 
Gender  Studies”;  “Foreign  Policy  Analysis”; 
“Global  Development  Studies”;  “Human  Rights; 
“Intelligence  Studies”;  “International  Commun¬ 
ication”;  “InternationalEducation”;  “International 
Ethics”;  “International  Law”;  “International 
Organization ” ;  “International  Political  Economy  ” ; 
“International  Political  Sociology”;  “International 
Security  Studies”;  “Peace  Studies”;  “Political 
Demography”;  and  “Post  Communist  States  in 
International  Relations”).  The  only  exception  as 
far  as  a  paradigmatic  orientation  is  concerned 
relates  to  the  section  “English  School.” 

Theoretical  and  Methodological  Expansion 

Theoretical  and  methodological  sophistication 
is  almost  universally  accepted  as  a  key  criterion 
for  judging  the  quality  and  status  of  a  scholarly 
discipline.  In  both  respects,  IR  has  seen  tremen¬ 
dous,  sometimes  even  exponential,  growth.  If  we 
concentrate  on  the  20th  century,  the  formative 
period  of  IR  up  to  the  1950s  was  largely  marked 
by  an  understanding  of  theory  and  method  com¬ 
mon  among  the  (usually  much  older)  disciplines 
from  which  IR  was  drawing  its  new  talents:  (dip¬ 
lomatic)  history,  (international)  law,  economics, 
what  is  nowadays  called  area  studies,  and  the 
study  of  (domestic)  politics.  Many  of  these  were 


considered  to  be  part  of  the  humanities  rather 
than  the  (social)  sciences.  Accordingly,  the  distinc¬ 
tion  between  the  empirical  and  the  normative, 
drawn  in  a  particularly  strong  fashion  in  the 
course  of  the  “behavioral  revolution,”  was  mostly 
not  deemed  appropriate  then.  To  be  sure,  “sci¬ 
ence”  was  already  cherished  among  IR  novices. 
Yet  it  was  not  yet  as  strongly  associated  with  a 
notion  of  the  natural  sciences  as  was  the  case  in 
the  1960s  and  1970s. 

The  revolutionary  shift  to  the  new  mantra  of 
“applying  scientific  methods”  was  perhaps  best 
captured  in  the  transition  from  the  1st  edition  (pub¬ 
lished  in  1961)  to  the  2nd  edition  of  International 
Politics  and  Foreign  Policy ,  a  textbook  edited  by 
Rosenau  (1969),  who,  perhaps  along  with  Hayward 
Alker,  is  himself  an  exemplar  of  the  shifting  episte¬ 
mological,  theoretical,  and  methodological  cur¬ 
rents  of  the  discipline  over  the  past  50  years.  In 
introducing  the  57  chapters  of  the  2nd  edition, 
Rosenau  apologized  for  including  “only  9  percent 
of  the  original  selections”  even  though  he  himself 
had  stated  in  1961  that  “articles  were  included 
‘only  if  they  seemed  likely  to  be  useful  in  twenty 
years.’”  Yet  rather  than  taking  this  to  be  a  “cause 
for  embarrassment,”  he  saw  “cause  for  satisfac¬ 
tion”  due  to  the  “remarkable  growth  in  the  scope 
and  pace  of  the  theoretical  enterprise”  and  the 
“increasingly  sophisticated  penetration  of  the  mys¬ 
teries  of  international  life”  (Rosenau,  1969, 
p.  xvii).  The  book  contained  25  articles  on  14  dif¬ 
ferent  types  of  “theories  and  approaches”  as  well 
as  17  articles  on  different  “research  techniques  and 
orientations.”  For  many  older  IR  scholars,  this  was 
a  misguided  fixation  on  an  ideal  of  science  that  was 
wholly  inadequate  for  the  subject  matter  of  interna¬ 
tional  politics.  Yet  Rosenau’s  candid  assessment 
and  selections  illustrate  the  predominant  mood 
and  trends  in  the  1960s  and  1970s  quite  well. 
Although  “classical”  approaches  and  methods 
continued  to  have  their  followers — and  actually 
benefitted  themselves  from  the  behavioral  revolu¬ 
tion — the  wave  of  the  future  seemed  to  be  an 
understanding  of  “science”  that  required  “an 
articulated  secondary  language  that  permits  rea¬ 
sonable  precision  and  replicability”  (Kaplan,  1966, 
p.  4)  as  well  as  sophisticated  techniques  for  gather¬ 
ing  and  processing  data. 

Mere  listing  of  all  the  new  methods  and  tech¬ 
niques  that  were  invented  and/or  imported  in  IR 
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during  that  phase  could  spread  out  over  a  couple 
of  pages.  Suffice  it  to  say  that  such  a  list  would 
include  almost  the  whole  range  of  tools  applied  in 
other  academic  disciplines — ranging  all  the  way 
from  some  of  the  humanities  to  other  social  sci¬ 
ences  and  the  natural  sciences.  Nevertheless,  even 
some  of  its  initial  adherents  later  granted  that  “the 
promises  of  behavioralism  were  not  fulfilled” 
(Rosenau,  1993,  p.  459).  Against  this  back¬ 
ground,  among  others,  the  third  (or  fourth)  debate 
in  particular  represented  a  move  beyond  method¬ 
ology  by  digging  deeper  to  address  the  underlying 
epistemological  and  ontological  questions.  Yet 
rather  than  shrinking  the  space,  this  debate  enlarged 
it  further  through  reinvigorating  reflection  on  the 
reach  and  uses  of  qualitative  methods  and  by  prob- 
lematizing  the  very  basis  of  theory  formation.  In 
many  ways,  the  qualitative  label  reinforced  an 
encroaching  dualistic  conception  of  methodol¬ 
ogy — with  a  quantitative  pole  on  the  other  side.  Of 
course,  some  influential  theories  and/or  methods 
(e.g.,  rational  choice  and  other  formal  methods) 
that  actually  thrived  during  the  1980s  and  1990s  in 
American  IR  could  not  easily  be  subsumed  under 
such  a  dualistic  conception.  However,  influential 
publications  that  reached  far  beyond  IR  (e.g.,  Gary 
King,  Robert  Keohane,  &  Sydney  Verba,  1994) 
actually  tried  to  ease  the  tension  by  arguing  that 
there  is  a  unified  logic  of  scientific  inference  across 
a  large  spectrum  of  different  methodologies.  While 
this  proposition  was  hardly  acceptable  to  everyone, 
it  did  mirror  a  widespread  understanding  of  scien¬ 
tific  analysis  among  IR  scholars  according  to  which 
the  essence  of  scholarship  lay  in  “linking  theory  to 
evidence.”  In  the  2002  Handbook  of  International 
Relations,  this  is  the  title  of  the  single  explicitly 
methodological  chapter  covering  the  whole  spec¬ 
trum  from  rationalist  to  constructivist  theories,  the 
latter  even  including  critical  theory.  One  set  of 
methods  that  thrived  in  IR  since  the  1980s,  the  so- 
called  case  study  methods — which  were  actually 
put  at  the  center  of  qualitative  methods  by  some 
more  “scientifically”  inclined  scholars — actually 
expressed  this  understanding  most  clearly  and,  for 
many  IR  scholars,  convincingly. 

While  it  certainly  mirrored  “mainstream”  under¬ 
standings,  the  fixation  on  somehow  linking  theory 
and  evidence  with  the  help  of  certain  methods  had 
its  critics — and  increasingly  so.  As  a  matter  of  fact, 
since  the  early  1980s,  an  ever-larger  number  of 


scholars  subscribed  to  a  variety  of  postpositivist 
approaches  that  all  posited  the  mutual  impregna¬ 
tion  of  theory,  reality,  and  descriptions  thereof 
(evidence).  Most  important,  from  a  methodological 
point  of  view,  theories  such  as  feminism,  postmod¬ 
ernism,  poststructuralism,  critical  theory,  pragma¬ 
tism,  and  postcolonialism  all  questioned  whether 
“social  facts”  could  indeed  be  treated  like  “natural 
facts,”  as  implicitly  assumed  by  the  mainstream 
approaches  (Friedrich  Kratochwil,  2008).  The 
answer  was  an  outright  “No.”  Since  nature  did  not 
“speak,”  concepts  and  even  whole  vocabularies 
had  to  be  invented  in  order  to  relate  to  “the  world 
out  there”  when  one  wanted  to  describe  and 
explain  how  even  small  parts  of  it  (not  to  mention 
everything)  actually  hang  together.  In  this  view,  the 
issue  was  not  whether  (and  if  so,  how)  one  would 
come  up  with  the  “correct”  description  to  work 
within  the  actual  business  of  explanation.  Rather, 
one  of  the  key  questions  was  how  we  came  to 
describe  the  world  in  a  particular  vocabulary  in  the 
first  place  and  what  this  description  did  to  our 
being  in  and  relating  to  the  world. 

This  amounted  to  a  radical  critique  of  the 
whole  enterprise  of  how  the  “science”  of  interna¬ 
tional  relations  was  practiced  by  positivists.  In  a 
sense,  the  weight  of  the  charge  was  equivalent  to 
the  one  that  scientists  had  leveled  against  tradi¬ 
tionalists  during  the  second  debate:  the  charge  of 
actually  misconstruing  what  scholarship  about  the 
international  was  all  about.  Methods,  however, 
were  not  relegated  to  the  dustbin  as  charges  about 
“anything  goes”  seemed  to  indicate.  Indeed,  if 
anything,  the  third  (or  fourth)  debate  helped  fur¬ 
ther  broaden  the  theoretical  and  methodological 
horizon  of  the  discipline  by  opening  it  for  a  redis¬ 
covery  of  earlier  roots  in  international  law  and 
normative  theory  and  by  more  explicitly  incorpo¬ 
rating  sociological  perspectives  (as  illustrated  by 
the  founding  of  the  journal  International  Political 
Sociology  in  2007).  To  be  sure,  few  of  its  adher¬ 
ents  would  claim  that  a  switch  to  a  postpositivist 
stance  would  be  rewarded  with  any  of  the  earlier 
promises  of  “cumulation  of  knowledge”  or  “prog¬ 
ress”  (the  latter  is  at  least  kept  as  an  option  in  a 
Lakatosian  assessment  of  different  IR  research 
agendas).  Yet  leaving  behind  the  strait  jacket  of 
“method-drivenness” — which  has  even  become  a 
dirty  word  for  the  self-proclaimed  positivists 
(Fearon  &  Wendt,  2002) — seems  to  be  enough  in 


1314  International  Relations  as  a  Field  of  Study 


terms  of  gratification  for  them.  In  sum,  even  if  the 
discipline  may  not  have  advanced  much  on  the 
path  of  cumulation  and  progress,  it  seems  to  have 
progressed  steadily  on  the  path  of  theoretical  and 
methodological  sophistication. 

Conclusion 

The  story  of  the  field  of  international  relations 
could  be  told  in  an  analogous  fashion  to  the  story 
of  “the  expansion  of  international  society.”  It  may 
well  be  that  a  casual  reader  of  this  entry  20  or 
50  years  from  now  might  actually  have  precisely 
that  impression — with  all  the  critical  undertones 
that  a  postcolonial  perspective  would  want  to  add. 
Yet  this  probably  is  how  “the  state  of  the  art” 
appears  to  a  big  group  of  practitioners  of  IR  schol¬ 
arship  today.  As  has  been  discussed  above,  the  dis¬ 
cipline  is  in  many  ways  not  up  to  the  task  (yet?)  of 
tackling,  not  to  mention  overcoming,  its  many 
awkward  parochialisms.  These  are  all  the  more 
glaring  given  the  almost  universal  expression  of  an 
ever  more  globalizing  world — irrespective  of  how 
one  may  define  the  phenomenon  of  globalization — 
and  the  fact  that  the  discipline  itself  lays  claim  to 
actually  analyzing  these  processes  and  features 
within  its  purview. 
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International  Society 


The  concept  of  “international  society”  was  first 
elaborated  as  a  key  concept  for  international 


relations  analysis  in  Charles  Manning’s  book  The 
Nature  of  International  Society  (1962).  Manning’s 
younger  colleague  at  the  London  School  of 
Economics  and  Political  Science,  Martin  Wight, 
produced  a  series  of  works  that  largely  remained 
unpublished  during  his  lifetime;  in  these,  he  elabo¬ 
rated  on  what  he  called  a  society  of  states,  which  he 
juxtaposed  with  a  system  of  states.  Besides  Manning, 
with  whom  he  remained  at  intellectual  loggerheads, 
Wight  was  indebted  to  the  deliberations  of  the 
British  Committee  on  the  Theory  of  International 
Politics,  of  which  he  was  a  founder  member  since 
1958,  for  his  ideas.  These  ideas  found  further  fertile 
ground  in  the  mind  of  Hedley  Bull  (1977),  who  jux¬ 
taposed  Wight’s  two  key  concepts  in  a  double  defi¬ 
nition,  whereby  “a  system  of  states  is  formed  when 
two  or  more  states  have  sufficient  contact  between 
them  and  have  sufficient  impact  on  one  another’s 
decisions  to  behave  at  least  in  some  measure — as 
parts  of  a  whole”  (pp.  9-10).  “A  society  of  states  (or 
international  society),”  on  the  other  hand, 

exists  when  a  group  of  states,  conscious  of 
certain  common  interests  and  common  values, 
form  a  society  in  the  sense  that  they  conceive 
themselves  to  be  bound  by  a  common  set  of  rules 
in  their  relations  with  one  another,  and  share  in 
the  working  of  common  institutions,  (p.  13) 

These  foundational  definitions,  along  with  a  num¬ 
ber  of  others,  are  to  be  found  in  what  remains  the 
major  text  on  international  society — namely,  The 
Anarchical  Society. 

Indeed,  Bull’s  concept  of  international  society 
has  since  been  given  so  much  attention,  particu¬ 
larly  in  Britain,  Australia,  and  South  Africa,  but 
also  in  Scandinavia  and  Italy,  that  there  has 
evolved  a  self-conscious  tradition  or  school  of 
thought  that  refers  to  itself  as  the  English  School  of 
international  relations  or  indeed  the  international 
society  approach.  The  English  School  forms  a 
separate  section  within  the  International  Studies 
Association  and  has  its  own  website.  The  question 
of  who  is  in  and  who  is  out  of  this  school  has  been 
discussed  at  narcissistic  length.  For  an  overview  of 
this  matter,  the  reader  is  referred  to  Chapter  2  of 
Andrew  Linklater  and  Hidemi  Suganami’s  book 
the  English  School  of  International  Relations 
(2006),  a  book  that  is  also  an  excellent  introduc¬ 
tion.  Timothy  Dunne  and  Brunello  Vigezzi  are  the 
school’s  own  principal  historians. 
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There  has  been  extensive  debate  within  the 
English  School  about  whether  the  concept  of  inter¬ 
national  society  may  best  be  understood  as  a  styl¬ 
ization  of  historical  sequences  or  as  a  functional 
construct.  James  Mayall  (1990)  sees  it  as  basically 
historical,  Alan  James  would  see  it  as  basically 
theoretical,  whereas  Barry  Buzan  once  saw  it  as  a 
hybridization,  arguing  that  the  emergence  of  inter¬ 
national  society  may  be  understood  as  an  ideational 
process  turning  on  a  meaning  that  is  inextricably 
tied  up  with  the  functional  emergence  of  an  inter¬ 
national  system.  In  a  later  work,  however,  Buzan 
(2004)  did  away  with  the  distinction  altogether. 
There  is  a  strong  case  for  doing  so.  As  Max  Weber 
had  already  pointed  out,  the  most  instrumental 
action  is  still  imbued  with  meaning,  and  the  most 
symbolic  action  also  has  some  instrumentality. 
Furthermore,  it  is  difficult  to  imagine  that  two 
polities  would  be  aware  of  one  another,  let  alone 
communicate,  without  a  modicum  of  shared  mean¬ 
ing  and  some  regularity  in  the  quality  of  interac¬ 
tion.  Ontically,  the  difference  between  international 
system  and  international  society  seems  unsustain¬ 
able.  As  demonstrated  by  the  output  of  the  school, 
however,  it  has  proven  itself  to  be  a  nice  analytical 
tool,  which  means  that  we  may  recognize  it  as  an 
analytical  concept  on  pragmatic  grounds. 

Bull  was  very  interested  in  discussing  different 
existing  and  possible  world  orders.  Building  on 
Wight’s  (1977)  historical  work,  the  school  has 
evolved  a  body  of  work  on  the  comparative  sociol¬ 
ogy  of  status  systems  that  includes  a  hypothesis 
that  world  history  oscillates  between  periods  of 
empire  and  periods  of  state  systems,  a  study  of 
interaction  between  different  polities  over  a  period 
of  60,000  years,  and  a  study  of  how  European 
international  society  expanded  to  the  world  at 
large  (Bull  &  Watson,  1984). 

If  an  international  society  exists  when  states  are 
conscious  of  common  interests  and  values  and 
share  in  the  working  of  common  institutions,  then 
the  places  to  look  for  it  are  surely  in  people’s 
thinking  about  international  relations  as  well  as  in 
the  patterned  practices  of  states  as  they  exist  in 
specific  fields.  Note  that  international  society 
should,  therefore,  seemingly  be  understood  as  a 
kind  of  structure  that  may  only  be  studied  through 
its  effects  and  that  is,  by  the  same  token,  consti¬ 
tuted  by  those  effects.  Dunne  called  his  history  of 
the  school’s  origins  Inventing  International  Society 


to  underline  precisely  this  imagined  nature  of 
international  society  as  well  as  the  constitutive, 
and  hence  political,  role  played  in  its  invention  of 
imagination  by  those  working  within  the  confines 
of  the  school.  The  two  following  sections  of  this 
entry  look  at  the  institutions  and  the  thinking  that 
are  said  to  be  constitutive  of  international  society. 
A  final  section  discusses  the  significance  of  the 
school. 

The  Institutions 

To  some  scholars,  it  is  clear  that  international  soci¬ 
ety  has  the  philosophical  status  of  a  structure.  One 
of  the  characteristic  traits  of  the  English  School, 
however,  is  its  unwillingness  to  deal  with  the  phi¬ 
losophy  of  science  (see  section  on  significance 
below).  It  is,  therefore,  far  from  clear  what  kind  of 
status  the  several  members  of  the  school  seem  to 
give  to  the  various  phenomena  that  they  discuss. 
To  Wight,  who  offered  a  number  of  different  lists 
detailing  what  kinds  of  institutions  international 
relations  were  made  of,  institutions  seem  to  be 
social  practices,  and  so  constitutive.  Wight  also 
insisted  that  diplomacy,  understood  as  the  system 
and  the  art  of  communication  between  powers, 
was  the  master  institution  of  international  rela¬ 
tions.  To  Bull,  who  refined  Wight’s  taxonomy  and 
concentrated  on  five  institutions — the  balance  of 
power,  international  law,  diplomacy,  war,  and 
great  power  (concert) — these  institutions  do  not 
seem  to  be  constitutive  of  international  society  but 
reflective  of  it.  In  his  view,  what  constitutes  order 
are  the  primary  goals  of  states — security,  the  sanc¬ 
tity  of  agreements  ( pacta  sunt  servanda ),  and  terri¬ 
torial  property  rights — and  international  society  is 
no  more  than  the  form  a  particular  order  happens 
to  take.  In  this  way,  diplomacy  and  international 
society  are  framed  simply  as  ideational  and  reflec¬ 
tive  of  international  order  and  begin  to  take  on  an 
epiphenomenal  hue. 

There  exist  book-length  studies  of  four  of  the 
five  institutions  postulated  by  Bull  that  are  vari¬ 
ously  informed  by  the  international  society 
perspective.  Richard  Little  traced  the  emergence  of 
the  concept  of  the  balance  of  power  and  discussed 
its  centrality  for  international  society.  His  is  the 
treatment  that  sticks  closest  to  the  concept  of  inter¬ 
national  society.  Buzan’s  (2004)  treatment  of  great 
powers  focuses  on  the  present  time,  giving  pride  of 
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place  to  the  United  States.  James  Der  Derian’s 
study  of  diplomacy  breaks  with  previous  work  of 
the  school  by  drawing  extensively  on  continental 
social  theory,  arguing  that  the  emergence  of  the 
institution  of  diplomacy  may  be  alternatively 
understood  as  humankind’s  negotiation  of  alien¬ 
ation.  Evan  Luard’s  discussion  of  war  in  interna¬ 
tional  society  is  more  of  a  preliminary  study  that 
may  serve  as  a  platform  for  an  international  soci¬ 
ety-inspired  analysis.  Adam  Roberts  and  others 
have  conducted  a  number  of  international  society- 
based  shorter  studies  of  how  law  constitutes  inter¬ 
national  society. 

The  Thinking  and  the  Thinkers 

It  is  another  characteristic  of  the  school  that  it  is 
more  concerned  with  the  thinkers  of  international 
society  than  with  the  thinking  of  international 
society.  Whether  we  should  think  of  this  as  a  con¬ 
sequence  of  methodological  individualism  is  a 
moot  question  in  the  school’s  own  terms,  for  on 
this  point,  as  elsewhere,  the  school  does  not  dwell 
on  the  status  of  the  phenomena  it  discusses.  In  Bull 
and  Watson’s  coedited  Expansion  of  International 
Society  (1984),  Bull  fruitfully  uses  the  concept  of 
an  international  political  culture  to  discuss  the  dif¬ 
fusion  of  ideas  and  practices  and  in  his  evaluation 
of  the  extent  to  which  “The  Revolt  Against  the 
West”  challenged  world  order  at  large.  There  exist 
important  studies  of  how  specific  themes  have 
been  thought  up  against  international  society:  for 
example,  Nationalism  and  International  Society 
(Mayall,  1990),  which  argues  that  mainstream 
liberal  thinking  over  the  past  200  years  or  so  was 
both  statist  and  nationalist,  and  Revolution  and 
World  Order:  The  Revolutionary  State  in 
International  Society  (David  Armstrong,  1993), 
which  generalizes  that  revolutionary  regimes  will 
initially  break  with  the  idea  of  an  international 
society,  only  to  come  back  at  some  later  point. 
John  Vincent  mapped  the  historical  case  for  nonin¬ 
tervention,  leaving  Nick  Wheeler  to  set  out  the 
case  for  humanitarian  intervention. 

These  remain  exceptions,  however.  The  school 
has  mainly  preoccupied  itself  with  how  specific  phi¬ 
losophers  and  statesmen  have  conceptualized  global 
politics,  particularly  the  degree  to  which  and  the  way 
in  which  they  have  dwelled  on  the  existence  of  a  pos¬ 
sible  international  society.  Rather  than  following  the 


lead  of  Manning,  who  looked  at  patterns  of 
thought,  the  school  has  followed  Wight,  whose 
interest  was  what  he  called  traditions  of  thought. 
Wight  (1966)  contrasted  political  theory  with  what 
he  refers  to  as  international  theory.  Whereas  the 
former  consists  of  a  succession  of  classics  from  Jean 
Bodin  to  John  Mill,  there  is  no  succession  of  first- 
rate  books  about  the  states  system  and  diplomacy. 
As  pointed  out  by  Suganami,  inasmuch  as  “poli¬ 
tics”  concerns  the  working  of  the  polls — that  is,  the 
working  of  a  social  collective  that  strives  to  main¬ 
tain  its  boundaries  toward  the  outside,  there  can  be 
no  such  thing  as  international  politics.  As  a  response 
to  this  situation,  Wight  fastened  on  the  concept  of 
“diplomatics”  as  a  more  accurate  and  more  techni¬ 
cally  correct  way  of  referring  to  his  preferred  object 
of  study  than  “the  international”  of  everyday 
speech.  Everything  that  is  not  politics,  then,  is  dip¬ 
lomatics,  but  by  the  same  token,  everything  that  has 
nothing  to  do  with  the  internal  life  of  the  state  and 
with  the  specific  relations  between  states  is  ruled 
out  as  an  object  of  study. 

In  a  standard  lecture  course  that  has  served  as  a 
template  for  many  a  later  school  lecturer,  Wight 
(1991)  postulated  three  traditions  of  diplomatics 
for  all  international  intercourse,  its  purposes  and 
objects,  in  times  of  peace.  These  were  the 
Machiavellian  (or,  to  Bull,  Hobbesian),  the  Grotian, 
and  the  Kantian.  As  is  evident  from  the  use  of  the 
names  of  specific  philosophers,  these  traditions 
were  conceptualized  as  chains  of  specific  philoso¬ 
phers  and  statesmen  who  shared  an  affinity  for 
conceptualizing  international  in  one  of  three  ways. 
Sometimes  a  fourth  is  added  to  make  room  for  the 
principle  that  he  traced  back  to  Guiseppe  Mazzini — 
namely,  that  the  world  may  be  divided  into 
national  states.  Since  this  is  definitely  a  prescrip¬ 
tive  idea  of  how  global  politics  should  be  ordered, 
nationalism  does  in  this  sense  give  rise  to  an  inter¬ 
national  doctrine — namely,  that  the  subjects  of 
global  politics  should  be  nation-states. 

The  school’s  three  traditions — with  Hobbesians 
arguing  that  covenants  without  the  sword  are  but 
words,  Grotians  arguing  that  pacta  sunt  servanda 
(treaties  should  be  observed),  and  Kantians  argu¬ 
ing  in  favor  of  cosmopolitanism — clearly  have 
heuristic  value.  Given  the  way  discourse  about  the 
international  realm  is  still  set  up,  they  also  have  a 
certain  predictive  value,  since  one  may  readily 
expect  that  debates  about  any  problem  that  arises 
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(say,  a  humanitarian  intervention  or  the  setting  up 
of  a  new  international  organization  [IOJ )  will 
include  voices  growing  out  of  these  three  broad 
representations  of  which  duties,  if  any,  transcend 
borders.  As  Wight  himself  acknowledged,  how¬ 
ever,  it  is  also  the  great  weakness  of  the  scheme 
that  it  cannot  be  a  proper  categorization  in  terms 
of  covering  all  possibilities.  Specific  dates  will,  by 
necessity,  come  with  their  own  specific  color¬ 
ations.  One  may  add  that  Wight’s  historiosophy, 
which  saw  what  happened  in  the  international 
realm  as  endless  repetition,  precluded  him  from 
thinking  about  more  basic  change. 

International  society  is  conceived  as  an  anarchi¬ 
cal  society.  As  Emile  Durkheim  pointed  out,  how¬ 
ever,  societies  are  despotic,  in  the  sense  that  they 
lay  down  what  should  count  as  normal  behavior 
and  so  may  be  seen  as  a  structure  of  power  (sys¬ 
tems  are  arrangements  through  which  values  are 
authoritatively  allocated  for  a  society,  as  David 
Easton  would  have  it).  The  tension  between  anar¬ 
chy,  on  the  one  hand,  and  society,  on  the  other,  is 
the  constitutive  tension  of  the  school.  By  dint  of  an 
internal  logic,  then,  the  Grotian  tradition  seems  to 
be  the  broad  home  for  the  school  itself.  According 
to  the  school,  it  comes  in  two  variants:  pluralist 
and  solidarist.  Pluralists  celebrate  the  cultural  plu¬ 
ralism  displayed  throughout  international  society, 
giving  pride  of  place  to  the  importance  of  systems 
maintenance.  One  recognizes  an  affinity  to  the  pru¬ 
dence  of  the  Hobbesian  tradition.  Solidarists,  on 
the  other  hand,  concentrate  on  how  international 
society  should  be  strengthened  by  forging  ever  new 
ties  between  its  members,  giving  rise  to  new  institu¬ 
tions  and  conventions  that  warrant  the  forging  of 
minimum  standards  binding  on  all.  Here,  one  rec¬ 
ognizes  an  affinity  to  Kantian  cosmopolitanism. 

Note  that  these  are  self-reflective  normative 
concerns;  the  school  is  tied  together  by  the  celebra¬ 
tion  of  a  real-existing  international  society  and 
divided  by  debates  about  how  far  this  celebration 
should  be  taken.  Solidarist  tendencies  usually 
come  with  an  interest  in  how  international  society, 
conceptualized  as  a  society  of  states,  is  related  to 
world  society,  conceptualized  as  a  society  of  sun¬ 
dry  agents.  While  maintaining  that  states  make  up 
the  dominant  system  of  units,  the  school  has 
sought  to  incorporate  the  further  complication  of 
nonstate  actors  more  fully  within  its  theoretical 
schema — an  aim  it  shares  with  the  existing  global 


polity  theory.  Just  as  states  make  up  a  system,  non¬ 
state  actors  are  theorized  increasingly  to  make  up 
a  nonstate  system  of  their  own  (although  this,  it  is 
maintained,  depends  ultimately  on  the  state  sys¬ 
tem).  This  system  is  embedded  in  a  nonstate  set  of 
norms,  rules,  values,  and  conventions — that  is,  a 
“world  system”  of  nonstate  actors  is  embedded  in 
a  “world  society”  of  nonstate  values,  rules,  institu¬ 
tions,  and  so  on  (Buzan,  2004). 

Significance 

Much  like  the  body  of  literature  on  geopolitics,  the 
major  significance  of  the  literature  on  interna¬ 
tional  society  is  to  allow  a  wide-ranging  investiga¬ 
tion  of  some  deeply  pertinent  questions  that  are 
key  to  the  discipline  of  international  relations  at 
large.  Again,  like  geopolitics,  the  claims  that 
emerge  from  these  investigations  tend  to  be  rather 
sweeping.  Unlike  geopolitics,  however,  the  style  is 
historical  and  empirical,  and  the  methodology  is 
implicit  and  commonsensical.  The  English  School 
is  English  not  least  in  its  empiricism.  Some  schol¬ 
ars  believe  that  Dunne  is  right  in  stressing  that  the 
invention  of  an  interpretative  approach  to  the  his¬ 
tory  of  ideas  about  international  relations  is  a 
constitutive  as  well  as  a  crowning  achievement  of 
the  school.  However,  when  compared  with  other 
and  parallel  attempts  to  invent  such  approaches  in 
the  social  sciences,  some  think  that  the  school 
comes  up  a  bit  short.  In  Germany,  there  emerged 
concurrently  with  the  first  English  School  writings, 
in  the  early  1960s,  a  tradition  of  conceptual  his¬ 
tory  led  by  Reinhart  Koselleck.  This  work  has 
resulted  in  multivolume  treatments  of  the  German 
and  French  political  languages.  Meanwhile,  in 
France,  Michel  Foucault  initiated  a  number  of 
studies  where  the  history  of  ideas  is  paired  to  social 
history  in  a  different  but  comparable  manner. 
Finally,  in  the  very  same  university  where  the 
British  Committee  met,  at  the  same  time,  and  out 
of  the  same  broadly  empirical  Anglo-American 
tradition  to  which  the  English  School  itself  belongs, 
there  emerged  an  undertaking  spearheaded  by 
Cambridge  historians  such  as  Quentin  Skinner  to 
write  the  history  of  the  past  in  terms  of  the  ideas 
that  animated  it.  This  undertaking  has  resulted  in 
a  number  of  meticulously  researched  monographs 
as  well  as  methodological  essays.  Alexander 
Wendt’s  work,  where  the  school’s  concept  of  the 
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three  traditions  of  thinking  about  international 
relations  is  modified  and  applied  in  his  theorizing 
of  how  international  society  may  mature,  demon¬ 
strates  how  this  body  of  work  may  be  paired  up 
with  wider  theorizing  to  produce  new  insights. 

Nowhere  is  the  historical  bent  of  the  interna¬ 
tional  society  tradition  more  on  display  than  in  a 
book  with  the  telling  title  The  Expansion  of 
International  Society  (Bull  &  Watson,  1984). 
Suganami  noted  in  his  contribution  on  Japan’s 
entry  into  international  society  how  Admiral  Perry 
used  deliberate  mistranslation  of  a  key  treaty  to 
secure  the  United  States  permanent  representation 
in  Japan.  Expansion  is  a  power-laden  process.  This 
fact  is  not  adequately  on  display  in  works  on  inter¬ 
national  society. 

Expansion  is  also  a  process  that  is  imagined  as 
spreading  outward  from  a  center.  Although  there 
may  be  setbacks  and  even  reversions,  the  concep¬ 
tion  is  of  a  process  where  one  party  imposes  its 
order  on  the  other,  with  little  or  no  residue  and 
without  being  itself  changed  by  the  experience.  It 
is  of  course  true  that  European  international  soci¬ 
ety  has  expanded  to  cover  the  entire  globe,  so  in 
terms  of  teleology,  this  perspective  on  history  is 
not  unwarranted.  The  problem  is,  however,  that  it 
is  not  relational.  It  does  not  invite  scrutiny  of  the 
relations  that  went  into  producing  the  result  of 
European  expansion.  This  is  scientifically  prob¬ 
lematic,  for  we  want  to  know  what  kinds  of  ten¬ 
sions  and  conflicts  gave  rise  to  the  consequences 
of  European  expansion.  It  is  also  politically  prob¬ 
lematic,  for  it  invites  a  reading  where  an  “us” 
expands  at  “their”  expense  rather  than  demon¬ 
strating  the  power  asymmetries  and  hybridiza¬ 
tions  that  marked  the  process. 

Manning  criticized  Wight  for  not  empathizing 
with  his  aim  of  establishing  international  relations 
as  a  social  science.  Similar  critiques  have  accompa¬ 
nied  the  school  at  every  turn  since  then.  The  school 
would  have  come  off  on  an  even  better  footing  if  it 
had  followed  Manning’s  aspiration  to  write  works 
that  were  both  studies  of  international  relations 
and  contributions  that  were  explicitly  dialogical  in 
their  relation  to  the  (rest  of  the)  literature  on  social 
phenomena.  The  school  has  followed  Wight’s  lead 
and  styled  its  work  as  a  series  of  comments  on  the 
general  flow  of  history,  instead  of  Manning’s  com¬ 
peting  vision  of  doing  something  more  in  the  vein 
of  the  sociology  of  knowledge  of  international 


relations.  What  the  English  School  seems  to  need 
at  the  present  juncture  is  to  complement  in  many 
ways  the  unique  historical  and  taxonomic  work 
already  carried  out  by  sustained,  theory-led  studies 
that  may  bring  the  school  closer  to  the  general 
style  of  work  in  the  social  sciences. 

Iver  B.  Neumann 
Norwegian  Institute  of  International 
Affairs  (NUPI) 
Oslo,  Norway 
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Collective  action 
complex  system  modeling,  2:365 
epistemic  communities,  3:790 
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identity  and,  8:2435 
by  interest  groups,  4:1223-1224 
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public  goods,  2:307-308,  7:2160,  7:2387 
repertoire,  5:1593,  8:2433,  8:2441 
social  capital  and,  4:1104,  6:1974 
theories,  4:1018,  4:1223-1224 

See  also  Common  goods;  Cooperation;  Mobilization, 
political;  Parties;  Social  movements 
Collective  action  problems,  7:2205 
Collective  goods.  See  Public  goods 
Collective  identity,  1:277,  1:278,  8:2434-2435 
Collective  memory.  See  Historical  memory 
Collective  security,  2:299-302 
definition,  2:299 
enforcement,  2:301 
harmony  of  interests,  2:392-393 
history  of  concept,  2:299 
nature  of,  2:299-301,  7:2386-2387 
neutrality  and,  5:1697-1698 
participants,  2:300 

United  Nations  and,  8:2677,  8:2679-2680 
See  also  League  of  Nations;  Security  cooperation 
Collider  variables,  8:2700 
Collier,  David,  2:370,  2:372 
Colonialism,  2:302-307 
administration,  2:303-304,  6:1754 
in  Africa,  1:41,  2:303-304,  2:305,  2:306,  4:1155, 

5:1648,  8:2500 
area  studies  and,  1:87 
civilizing  mission,  2:303 
commercial  involvement,  5:1641-1642 
conflicts,  1:264,  1:267 
contested  paradigm,  2:304-306 
decolonization,  3:644 
definition,  2:302 

European,  2:302,  2:303,  5:1641-1642,  5:1648, 
7:2086-2087 

international  law  and,  4:1242 

justifications,  2:303 

language  policies,  7:2070-2071 

legacies,  2:306,  7:2085 

mercantilism  and,  8:2656 

modernity  and,  7:2088 

neo-,  1:40,  1:41,  2:304,  3:641,  3:642,  6:1731 

political  doctrine,  2:302-304 

resistance,  1:267,  2:304,  7:2086-2087,  8:2394,  8:2396 
state  formation,  8:2500 
Zionism  as,  8:2767 

See  also  Decolonization;  Empires;  Imperialism;  Indigenous 
peoples’  rights;  Orientalism;  Postcolonialism 
Columbia  University,  1:139,  5:1534,  5:1535, 

6:1778-1779,  6:2014 
Comecon,  8:2458 

Comintern.  See  Communist  International 
Commission  on  Global  Governance,  4:1011 
Common  goods,  2:307-310 
common  pool  resources,  2:307,  2:308,  2:309,  5:1666, 
7:2159,  7:2204,  8:2643-2646 


definition,  2:307 

empirical  research,  8:2645-2646 

global,  7:2160-2161 
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managing,  2:308-309,  7:2205,  8:2644-2646 

prisoners’  dilemma,  7:2132 
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See  also  Natural  resources;  Public  goods;  Tragedy  of  the 
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Common  pool  resources  (CPRs),  2:307,  2:308,  2:309, 
5:1666,  7:2159,  7:2204,  8:2643-2646 
See  also  Tragedy  of  the  commons 
Communal  rolls,  3:751 
Communication 
during  crises,  2:496 
logics,  4:1188,  4:1196 
See  also  Political  communication 
Communication  media,  5:1530 
See  also  Media,  electronic;  Media,  print 
Communication  studies,  6:1961-1962 
Communication  technology,  5:1684,  6:1939,  6:2060 
Communicative  action,  2:499,  3:685,  4:1196,  7:2327-2328 
Communicative  discourse.  See  Discourse  analysis 
Communism,  2:310-314 
anarcho-,  1:76 

distinction  from  socialism,  2:310,  8:2450-2451 
equality,  8:2450 

evolution  of  theory,  2:310-311,  2:314-315 
as  ideal,  2:313-314 

ideology,  2:311,  2:314-315,  2:320,  4:1145 
intellectuals  and,  4:1208 
legacy,  2:313 

Marx  on,  2:310,  2:311-312,  2:314-315,  5:1499 
nationalism  and,  2:313 
pacifism  and,  6:1759 
as  political  organization,  2:311-312 
redistribution,  7:2225 
utopianism  and,  8:2687 
See  also  Maoism;  Marxism;  Socialism 
Communist  International  (Comintern),  2:310,  2:312, 
2:317-319,  6:1796,  8:2452 
Communist  Manifesto  (Marx  and  Engels),  2:310, 
2:314-315,  5:1493-1494,  5:1497-1498,  8:2451, 
8:2657,  8:2687 

Communist  parties,  2:314-320 

after  fall  of  communist  regimes,  7:2092 
Chinese,  2:312,  2:319,  2:320,  3:595,  3:598-599,  5:1422, 
5:1482,  8:2631,  8:2660 
contemporary,  2:319-320 
Cuban,  2:319 
in  democracies,  2:313 
democratic  centralism,  2:312,  2:316 
development  stages,  2:319 
East  European,  2:312,  2:318,  2:320,  2:325 
French,  2:312,  2:317,  2:318,  6:1797 
goals,  2:314 

hegemony,  2:311,  2:316,  8:2457 
history,  6:1796-1797,  8:2452 
ideology,  2:314 
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6:1795,  6:1797 
Marx  on,  2:311-312,  5:1494 
media  control,  5:1543 
organization,  2:316-317 
origins,  2:315-316 
Stalinist,  8:2483 
structures,  8:2458 
vanguard,  2:312 

West  European,  2:312,  2:313,  2:317,  2:318,  6:1796-1797 
See  also  Socialist  parties 
Communist  Party  of  the  Soviet  Union  (CPSU) 
democratic  centralism,  2:312,  2:316,  3:611-612,  8:2483 
dictatorship,  3:654 
dissolution,  2:316,  2:320 
founding,  2:310,  2:316 
Lenin  and,  2:312 
membership,  2:317,  2:320 

nomenklatura ,  2:316-317,  2:322,  8:2458,  8:2459,  8:2484 
Politburo,  2:317 
popular  sovereignty  and,  3:612 
relations  with  other  communist  parties,  2:318-319 
See  also  Marxism-Leninism;  Stalinism 
Communist  systems,  2:320-326 
collapses,  2:313,  2:319-320,  2:325,  7:2091,  8:2396, 
8:2460-2461,  8:2635,  8:2660 
common  features,  2:311 
contradictions,  2:321-322,  2:323-324 
definition,  2:320 

dictatorships,  3:654,  3:655,  3:656,  3:657 
dissidents,  4:1208-1209 
economic  declines,  8:2660 

economic  organization,  2:311,  2:322-323,  7:2090-2091 

elites,  2:316-317,  2:322,  3:762,  3:763,  8:2458,  8:2459 

history,  2:310-311 

ideologies,  7:2090-2091 

informal  economy,  2:323 

legacies,  7:2092 

legitimacy,  5:1421,  5:1422 
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parliamentary  governments,  2:321 

politics,  2:321-322 

print  media,  5:1543-1544 

revisionism,  7:2311-2312 

social  organization,  2:323-324 

as  totalitarian,  2:311,  2:321 

variations,  2:324-325 

violent  repression,  2:311,  2:320,  2:324-325,  8:2460 
See  also  China;  Cold  War;  Eastern  Europe;  Postcommunist 
regimes;  Socialist  systems;  Soviet  Union 
Communitarianism,  2:326-331 
academic,  2:326-327 
authoritarian,  2:327 
branches,  2:326-328 
citizenship,  1:239,  1:240-241 
civil  society,  1:262 

contrast  to  liberalism,  1:240-241,  2:326,  2:330 

definition,  2:326 

East  Asian,  2:327,  2:328 


experiments,  8:2449,  8:2688 
homeland  security  and,  2:328-329 
influence,  2:326 

international  relations  theory  and,  2:328 
moral  dialogues,  2:329-331 
responsive,  2:327-331 
social  capital,  8:2410-2411 
Tocqueville’s  influence,  8:2624 
view  of  state,  1:262 
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expatriate,  3:651,  3:652-653 

imagined,  4:1270,  5:1645,  5:1654,  5:1659,  7:2088-2089, 
8:2394 

network  analysis,  5:1690 
security,  7:2388-2389 
transnational,  5:1569 
utopian,  2:310,  8:2449,  8:2688 
See  also  Epistemic  communities;  Policy  communities 
Comparative  area  studies,  1:89 

Comparative  foreign  policy  (CFP),  3:917,  3:926,  3:927 
Comparative  methods,  2:331-341 
assumptions,  3:795 

case  studies,  2:332,  2:334-335,  2:339,  2:343-344,  5:1560 

causality,  2:332,  2:339,  3:796 

configurational,  2:388-391 

current  developments,  2:340-341 

deductive  approach,  2:333-334 

definition,  2:331 

formal,  2:335-338,  2:340-341 

goals,  2:333-334 

human  rights  research,  4:1104-1105 
inductive  approach,  2:333-334 
informal,  2:335,  2:340 
logic,  2:334-335 

small-N  researchers,  2:331,  2:332,  3:794 
statistical,  2:342-343 
survey  research,  6:1970-1974,  6:1977 
time-series  cross-section,  8:2531,  8:2615-2621 
use  of,  3:794,  6:1970-1974 
variable-oriented  methods  and,  2:338-340 
See  also  Case  studies;  Cross-national  surveys 
Comparative  policy  analysis,  2:354-355 
Comparative  politics,  2:342-359 
definition,  2:342 

distinction  from  international  relations,  2:342,  2:357 
future  of,  2:355-357 

historical  institutionalism,  2:346-347,  2:349,  4:1037 
human  rights,  4:1104-8 
issues,  2:350-355 

legal  constitutionalism,  2:416,  2:418-420,  4:1037 

methods,  2:342-344 

political  development  field,  3:644-648 

public  office  rewards,  7:2163,  7:2166 

rational  choice  institutionalism,  2:345-346,  4:1037 

social  movements,  8:2435-2436 

sociological  institutionalism,  2:347-349 

structural  functionalism,  4:996 

success,  2:355 

theories,  2:344-349 
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typologies,  6:2040-2041,  6:2042-2050 
See  also  Regimes 

Competition,  political,  2:359-364 
alternation  in  government,  6:1828-1829 
among  parties,  2:360,  2:361,  2:363,  3:728,  6:1792, 
6:1828-1829 
definition,  2:359 

in  democracies,  2:359-361,  2:363,  2:589 
democratic  quality,  2:568 
electoral,  2:359-361,  2:363 
governmental,  2:359 
measures,  2:361 

median  voter  theorem,  2:360,  6:1981-1982,  6:2032, 
8:2417,  8:2716 

political  economy  models,  6:1981-1983 
subversion,  2:570 
theory,  2:361-363 
veto  players,  8:2706 
Competitive  democracy,  6:1956 
Complex  system  modeling,  2:364-367 
compared  to  game  theory,  2:366 
foundations,  2:364-365 
political  science  applications,  2:365-367 
Complexity,  2:364-367,  8:2581 
Compliance,  2:367-370 
definition,  2:367 
enforcement,  2:368-369 
management  approach,  2:368,  2:369 
by  states,  2:367-369 

Comprehensive  Nuclear  Test  Ban  Treaty,  3:671,  4:1242 
Compromises,  in  consociational  democracies,  2:556,  2:557 
Computational  models.  See  Models,  computational/ 
agent-based 

Computer  simulations.  See  Information  technology;  Monte 
Carlo  methods 

Comte,  Auguste,  6:1951-1952,  7:2080-2081 
Concept  formation,  2:370-382 
abstraction  levels,  2:375 
analytic  frames,  2:376-377 
definition,  2:375 
disorders,  2:372-375 
empirical  boundaries,  2:376 
innovation,  2:380-381 
language  acts,  2:370-371 
Ogden-Richards  triangle,  2:372 
in  political  science,  2:370,  2:380-381 
reconstruction,  2:375 
semantic  fields,  2:376 
structures,  2:377-378 
Concepts 

abstraction  levels,  2:375 
ambiguity,  2:373 
contested,  2:379-380 
cores,  2:377 

definitional  defects,  2:374 
diminished  subtypes,  2:378-379 
family  resemblances,  2:377-378 
instability,  2:374 
meanings,  2:371,  2:375,  2:380 


opacity,  2:372-373 
reification,  2:371-372 
semantic  constraints,  2:381 
semantic  fields,  2:381 
structures,  2:377-378 
vagueness,  2:373-374 
Conceptual  abuse,  2:374-375 
Conceptual  confusion,  2:373-374 
Conceptual  disorders,  2:372-375 
Conceptual  lumping,  2:375 

Concert  of  Europe,  4:1011,  5:1632,  7:2386,  8:2677 
Conditionality,  2:382-384 
aims,  2:382-383 
definition,  2:382 
effectiveness,  2:383-384 
evolution,  2:383 
foreign  aid,  2:383,  4:1040 
legitimacy,  2:384 

structural  adjustment  packages  and  loans,  2:382-383, 
2:384,  4:1016,  8:2756-2757 
utility  calculations  and,  2:382-383 
welfare  policies,  8:2745,  8:2751 
Conditions,  necessary  and  sufficient,  2:384-388,  7:2175 
Condorcet,  Jean-Antoine-Nicolas  de  Caritat,  Marquis  de, 
3:809,  7:2203,  8:2417 
Condorcet  cycle,  8:2416 
Condorcet  efficiency,  3:754 
Condorcet’s  paradox,  3:709 
Conference  on  Disarmament  (CD),  3:671 
Conference  on  Security  and  Cooperation  in  Europe  (CSCE), 
8:2461 

Configurational  comparative  methods  (CCMs),  2:388-391 
applications,  2:389,  2:390 
assumptions,  2:388-389 
causality,  2:388-389 
debates  on,  2:390 
definition,  2:388 

further  developments,  2:390-391 
goals,  2:388-389 
techniques,  2:389-390 
See  also  Qualitative  comparative  analysis 
Confirmatory  factor  analysis  (CFA),  3:881,  3:882-884, 
5:1524,  8:2554 

Conflict  resolution,  2:391-394 

civil  wars,  1:268-269 
definition,  2:391 
between  democracies,  3:621 
mediation,  5:1545-1548 
participants,  2:393 
peaceful  means,  5:1545,  6:1759 
superpower  interventions,  2:396 
theoretical  approaches,  2:392-393 
third-party  roles,  6:1841 
Conflict  theory,  2:396,  6:2027-2028,  6:2029 
Conflicts,  2:394-399 
armed,  8:2723-2725,  8:2726-2729 
classifications,  2:394-395 
definition,  2:394,  8:2727 
dimensions,  2:3  95-396 
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nonstate,  8:2724,  8:2727 
origins,  2:394 

possible  outcomes,  2:391-392 
religious,  5:1351,  6:1720,  7:2262 
research,  2:397-398 
in  state  of  nature,  2:447 
strategies,  2:396,  2:397 
territorial,  8:2589,  8:2727,  8:2728 
theoretical  approaches,  2:396-397 
violent,  2:397-398 

See  also  Civil  wars;  Ethnic  conflicts;  Revolutions; 

Violence;  Wars 

Confounding  variables,  5:1581-1582,  5:1597, 
7:2181-2182,  8:2699,  8:2701 
Confucianism,  2:399-402 
capitalism  and,  1:192 
in  China,  1:163 

compatibility  with  democracy,  2:401 
good  government  principles,  2:399^401 
moral  education,  2:400 
political  statements,  6:2052 
view  of  politics,  ldiii 
Confucius,  2:399,  2:400,  2:401,  6:2051 
Congo  crisis,  8:2681 
Congress,  U.S. 

agenda-setting,  1:50 
apportionment,  3:751,  3:754 

delegation  to  bureaucracy,  2:548-550,  7:2128,  7:2203 
impeachment  process,  8:2405 
powers,  1:18-19,  8:2405 
Senate  Intelligence  Committee,  4:1057 
war  powers,  8:2406,  8:2407 
Congress  of  Vienna,  8:2450,  8:2677 
Conner,  Mark,  1:95 

Conscientious  objectors,  6:1721,  6:1758 
Consensus,  2:402-404 
definition,  2:402 

deliberative  policy  making,  2:551 

domain,  2:402,  2:404 

on  foreign  policy,  3:695 

in  international  organizations,  4:1253 

overlapping,  2:402-403 

political  applications,  2:404 

public  opinion  and,  7:2168 

See  also  Deliberative  democracy;  Election  by  lot; 

Groupthink 

Consensus  democracy,  2:402,  2:403,  2:555-556, 

2:564,  2:587 

See  also  Democracy,  consociational 
Consequentialism,  3:824 

Consequentialist  libertarianism,  5:1442,  5:1444-1445 
Consequentiality,  logic  of,  5:1475 
Conservatism,  2:405-410 
British  sources,  2:405-407 
classical,  2:407-408 
critique  of  liberalism,  2:405,  2:410 
definition,  2:405 
as  ideology,  2:405 
individualism,  4:1176-1177 


main  currents  of  thought,  2:407-410 
as  philosophy,  2:405 

See  also  Christian  Right;  Fundamentalisms; 
Neo-conservative  idealism 
Conservative  dictatorships,  3:656 
Conservative  parties,  2:410-413 
British,  2:406,  2:411-412,  6:1801,  6:1802 
definition,  2:410 
electoral  performance,  2:412 
in  Europe,  2:412-413 
evolution,  2:411 

organization,  2:411-412,  6:1801 
political  principles,  2:410-411 
privatization  policies,  7:2133 
role  in  party  systems,  2:412 — 413,  5:1427 
social  bases,  2:412,  6:1794 
See  also  Christian  Democratic  parties 
Considerant,  Victor,  8:2450 
Consistency.  See  Attitude  consistency 
Consociational  democracy.  See  Democracy,  consociational 
Consolidation,  democratic.  See  Democratic  consolidation 
Constitution,  U.S. 

Bill  of  Rights,  2:419,  5:1430,  6:1720,  6:1721,  7:2319, 
7:2337 

Establishment  Clause,  1:236 

executive  branch,  3:864 

federalism,  3:896 

Fifth  Amendment,  7:2138 

First  Amendment,  1:236,  3:825,  6:1720,  6:1721 

goals,  6:1723 

judicial  review,  5:1373 

liberalism,  5:1440 

presidential  system,  7:2123 
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republic,  7:2289-2290 

separation  of  powers,  1:4,  1:6,  1:10,  1:17-19,  8:2403, 
8:2404,  8:2405-2407 
Twenty-Second  Amendment,  3:864 
Constitutional  engineering,  2:413-416 
amendments,  7:2231 
constraints,  2:413,  2:414,  2:415 
definition,  2:413 
duration,  2:415 
participants,  2:414-415 
stages,  2:413 

See  also  Separation  of  powers 
Constitutional  monarchy,  5:1613,  5:1614,  7:2290 
Constitutional  patriotism,  5:1649,  5:1657,  6:1833-1834 
Constitutionalism,  2:416-421 
constraints  on  government,  6:1722-1723 
definition,  2:416 
democratic,  2:353 
goals,  2:416,  6:1722-1723 
identity  building,  5:1649 
international,  4:1243 
judiciary,  5:1383-1384 
legal,  2:416,  2:418-420,  4:1037 
legislative  voting  rules,  8:2718,  8:2719-2720 
liberalism  and,  5:1429,  5:1430,  5:1431,  5:1442 
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party  systems,  2:418 
political,  2:416-418,  2:419-420 
processes,  5:1649,  7:2340 
republican,  2:417 
rights,  2:419,  7:2337 

rule  of  law  and,  7:2336-2337,  7:2338,  7:2340 
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See  also  Separation  of  powers 
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in  consociational  democracies,  2:555 
English,  1:10,  1:18,  5:1614,  8:2404 
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as  institutions,  4:1203 
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Constructionism,  2:421 
Constructivism,  2:421-425 
contextual,  2:422 
definition,  2:421 

discursive  institutionalism  and,  3:684 
on  ethics,  3:826-827 
frames,  2:422^123,  6:1924 
globalization,  4:976 
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influence,  2:425 
international  law,  4:1241 
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methodologies,  3:798 
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sovereignty,  8:2471 
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view  of  reality,  3:796-798,  6:1879 
See  also  Discourse  analysis 
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anarchic  world,  1:79,  4:1292 
building  blocks,  2:428-430 
contrast  to  realism,  2:424 
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International  Solidarity 


Solidarity  is  a  type  of  social  relation  as  old  as  the 
first  societies.  Whether  based  on  family,  friendship, 
religion,  corporate  bodies,  or  trade,  solidarity  takes 
multiple  forms,  but  all  forms  have  in  common  a 
relation  in  which  groups  and/or  people  bring,  not 
necessarily  reciprocally,  a  particular  assistance. 
Every  social  order  supposes  a  minimal  level  of 
solidarity  between  its  elements  and,  conversely, 
every  social  relation  acts  as  an  indicator  of  a  cer¬ 
tain  social  order,  according  to  Emile  Durkheim. 
Therefore,  there  is  no  study  of  societies  without 
considering  the  forms  of  solidarity  they  enclose.  At 
an  international  level,  the  question  is  more  delicate: 
The  existence  of  solidarity  is  disputed  given  the  fact 
that  the  very  existence  of  a  society  is  much  debated. 

In  the  tradition  of  international  realism,  the 
world  is  perceived  as  a  general  competition  of 
powers,  and  solidarity  is  expressed  only  through 
military  alliances.  Actually,  one  should  speak  of 
functional  policies  oriented  toward  maintaining 
the  balance  of  power — the  only  principle  that  is 
able  to  regulate  the  international  anarchy — rather 
than  of  solidarity.  Theoretical  approaches  compet¬ 
ing  with  realism  in  international  studies  (liberalism, 
functionalism,  transnationalism,  and  constructivism) 
are  more  attentive  to  demonstrations  of  solidarity  in 
international  relations.  In  highlighting  what  binds 
actors  as  much  as  what  divides  them,  realists 
admit  the  existence  of  an  international  order  that 
entails  relations  of  cooperation  and  assistance. 
Among  these  approaches,  the  British  school  of 
international  relations  has  contributed  more  than 
any  other  to  the  development  of  the  concept  of 
“international  society.”  The  latter,  mainly  com¬ 
posed  of  states,  is  organized  around  norms  and 
institutions  that  secure  a  minimal  order  for  the 
common  interest  of  its  members  and  enable  the 
pursuit  of  shared  objectives.  According  to  Hedley 
Bull,  such  a  conceptualization  of  the  interna¬ 
tional  system  supposes  necessarily  the  existence 
of  relations  of  solidarity  and  fully  supports  their 
analysis. 

From  a  historic  and  empirical  point  of  view, 
demonstrations  of  international  solidarity  are  quite 
undisputable.  Under  various  names — support, 
assistance,  mutual  aid,  and  so  on — actors  have 
always  joined  together  to  defend  their  interests  and 


promote  their  causes.  These  practices  have  involved 
states  as  well  as  nonstate  actors.  But  the  densifica- 
tion,  that  is,  the  increased  number  and  complexity, 
of  relationships  between  societies  (what  is  called 
“interdependence”)  has  led  to  a  new  dynamic. 

Three  dimensions  of  international  solidarity  are 
considered  here.  The  most  traditional  one  refers  to 
a  political  will  to  form  alliances  for  mutual 
empowerment.  Second,  since  the  beginning  of  the 
20th  century,  international  solidarity  has  also  been 
used  as  an  instrument  for  building  up  an  interna¬ 
tional  order.  International  solidarity  is  in  fact  a 
doctrine  of  international  peace.  But,  parallel  with 
the  development  of  the  means  of  communication 
and  the  greater  sensitivity  of  societies,  international 
solidarity  has  also  turned  into  a  strong  means  of 
international  contest  of  the  global  inequalities. 
Thus,  there  is  a  third  dimension  in  which  actors  of 
international  solidarity  are  nonstate  actors  that 
contest  state  policies.  Each  of  these  dimensions  has 
its  own  characteristics  and  aims,  but  all  speak  for 
the  fact  that  international  solidarities  are  at  the 
heart  of  the  political,  economic,  and  social  history 
of  international  relations. 

A  Rationale  of  Power 

Let  us  consider  the  first  dimension:  mutual  empow¬ 
erment.  In  a  context  of  partially  competitive  rela¬ 
tions,  the  actors  of  the  international  system  seek 
naturally  to  decrease  their  vulnerability  to  pressures 
from  other  nations  and  to  enhance  their  strength 
and  build  on  their  assets.  Solidarity  therefore  con¬ 
sists  in  appealing  to  presumed  common  ties  and 
shared  objectives  to  build  a  union  that  will  reinforce 
the  capacities  of  each  of  the  parties  and  the  strength 
of  all.  This  classical  function  of  international  soli¬ 
darity  as  a  power  multiplier  may  assume  several 
shapes.  The  first  one  is  that  of  military  alliances  that 
tend  to  maximize  the  security  of  their  members 
through  reciprocal  assistance  (e.g.,  Article  5  of  the 
North  Atlantic  Treaty  Organization  [NATO]  treaty) 
or  through  various  means  of  bilateral  cooperation. 
The  creation  of  regional  unions  with  multiple  voca¬ 
tions  illustrates  a  second  form  of  mobilization  of 
solidarity  aimed  at  benefiting  from  the  (economic 
and  diplomatic)  advantages  of  an  actor  of  interna¬ 
tional  scale  (e.g.,  the  European  Union).  Third,  the 
organization  of  some  economic  lobbies  also 
expresses  a  form  of  solidarity  in  the  defense  and  the 
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promotion  of  sector-based  interests  at  the  service 
of  the  power  of  concerned  states  (e.g.,  Organization 
of  the  Petroleum  Exporting  Countries  [OPECJ). 
Eventually,  a  fourth  form  of  solidarity  resides  in 
the  elaboration  of  political  alliances,  the  aim  of 
which  is  to  promote  a  cause  or  a  position  on 
the  international  scene  (e.g.,  the  Non-Aligned 
Movement). 

In  all  cases  of  solidarity  in  this  first  form,  soli¬ 
darity  may  be  value  grounded,  but  it  is  mainly  an 
instrument  of  power  and  influence.  If  states  usu¬ 
ally  are  the  main  actors  and  benefactors,  nonstate 
actors  also  know  how  to  benefit  from  this  type  of 
mobilization  to  strengthen  their  capacity  for  action. 
The  International  Workingmen’s  Association, 
known  as  the  First  International,  which  was  cre¬ 
ated  to  defend  the  workers  and  to  reinforce  their 
organizations,  is  a  famous  historical  example  of 
the  way  nonstate  solidarity  was  invited  to  demon¬ 
strate  its  strength.  Many  other  international  mobi¬ 
lizations  were  to  follow  this  path. 

All  these  practices  of  solidarity  result  in  a  para¬ 
doxical  situation:  All  solidarities  are  favorable  to 
unity,  but  all  define  themselves  in  rivalry  and  con¬ 
frontation.  In  this  power  rationale,  solidarity  is  as 
much  a  factor  of  cooperation  as  one  of  conflict. 

A  Doctrine  of  International  Peace 

Let  us  now  turn  to  the  second  dimension  of  inter¬ 
national  solidarity:  international  order.  A  very  dif¬ 
ferent  conception  of  solidarity  emerges  as  early  as 
the  end  of  the  19th  century.  The  “first  globaliza¬ 
tion”  (1860-1914),  characterized  by  a  spectacular 
development  of  trade  relations  and  communica¬ 
tion  means,  substantiates  the  idea  that  the  world  is 
becoming  more  and  more  interdependent.  World 
War  I  and  the  1929  economic  crisis  reinforce  the 
same  image  of  a  shrinking  world.  No  state,  and 
specifically  not  in  its  economic  and  social  develop¬ 
ment,  appears  to  be  able  to  act  on  its  own;  all  seem 
more  or  less  dependent  on  the  others.  Their  interde¬ 
pendence  is  perceived  as  a  reality  that  the  first  “tech¬ 
nical”  international  organizations  (telegraphs,  post, 
transports,  etc.)  account  for,  and  it  is  in  part  inde¬ 
pendent  of  the  actors’  own  will.  Thus,  solidarity  is 
no  longer  the  result  of  an  act  willfully  undertaken  by 
states  but  has  become  a  social  fact  perceived  by 
actors  as  such.  The  interdependence  hereby  created 
reaches  beyond  the  acts  of  solidarity  undertaken. 


Leon  Bourgeois  (1851-1925)  was  one  of  the  first 
to  transform  this  empirical  recording  into  a  political 
doctrine  (solidarism).  Conceived  at  the  level  of  soci¬ 
ety  as  a  third  path  between  liberalism  and  collectiv¬ 
ist  socialism,  solidarism  calls  for  a  certain  degree  of 
social  intervention  to  guarantee  a  liberal  and  just 
order.  At  an  international  level,  according  to 
Bourgeois,  this  doctrine  supports  the  resort  to  inter¬ 
national  organizations  to  ensure  peace  and  justice. 
David  Mitrany  notes  that  many  authors  were  to 
encourage  this  perspective,  which  consists  in 
increasing  and  regulating  interdependences  between 
states  to  better  pacify  their  relations,  through  the 
mediation  of  international  organizations. 

Parallel  to  the  development  of  sector-based 
international  organizations,  the  creation  of  the 
League  of  Nations  and  then  the  formation  of  the 
United  Nations  (UN)  carries  further  the  efforts  and 
tends  to  ground  international  order  in  a  global 
political  solidarity.  The  tragic  experience  of  the 
two  World  Wars  acted  as  a  determining  factor. 
With  various  successes,  multilateralism  imposed 
itself  as  a  method  of  negotiation  between  several 
parties  and  aimed  at  gathering  states  around  shared 
values.  After  1945,  multilateralism  spread  to  almost 
all  fields  (monetary,  economic,  trade,  social,  etc.)  at 
the  world  as  well  as  the  regional  (the  European 
construction)  levels.  The  mobilization  of  concrete 
solidarities  in  the  service  of  a  global  solidarity  from 
which  all  may  benefit  became  the  official  doctrine 
of  multilateralism.  It  was  strengthened  after  decol¬ 
onization  and  under  the  growing  influence  of  coun¬ 
tries  from  the  Global  South.  However,  just  as  at  the 
time  of  the  League  of  Nations  and  despite  the 
reinforced  authority  of  the  UN  Security  Council, 
practices  of  solidarity  remain  limited.  The  assess¬ 
ment  is  incomplete  regarding  functional  coopera¬ 
tion  (technical  cooperation,  development  aid,  etc.). 
Such  solidarity  is  limited  with  respect  to  political 
and  military  cooperation.  The  Cold  War  cannot  be 
held  as  the  only  element  responsible  for  these  limi¬ 
tations.  Collective  security,  as  envisioned  by  the 
UN  Charter,  proved  impossible  to  accomplish  and 
therefore  is  considered  a  major  failure  with  regard 
to  international  solidarity,  which  should  have  pre¬ 
sided  over  the  implementation  of  Security  Council 
decisions.  Another  failure  derives  from  an  inter¬ 
state  conception  of  international  solidarity  within 
the  UN.  Solidarity  is  reduced  to  the  interests  of  the 
most  powerful  states  although,  as  phrased  at  the 
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beginning  of  the  Charter  (“We  the  People  of  the 
United  Nations”),  it  was  officially  addressed  to  the 
societies  of  the  member  states.  The  UN,  as  it  func¬ 
tions,  consecrates  an  interstate  order  that  favors 
powerful  states  instead  of  being  an  instrument  of 
solidarity  between  societies. 

A  Social  Dynamic  of  Contest 

Let  us  finally  turn  to  the  third  dimension  of  interna¬ 
tional  solidarity:  the  emergence  of  international 
social  movements  that  contest  the  established  inter¬ 
state  order.  As  early  as  the  19th  century,  interna¬ 
tional  solidarity  has  been  extended  to  the  relations 
between  societies  in  reaction  to  state  policies:  The 
actors  of  this  form  of  international  solidarity  are 
nonstate  actors.  Taking  advantage  of  the  develop¬ 
ment  of  means  of  communication,  many  social 
groups  grew  conscious  of  the  resources  they  could 
draw  from  their  solidarity  in  the  defense  and  the 
promotion  of  common  causes.  Mobilizations  of 
solidarity  multiplied  (abolitionists,  suffragettes, 
pacifists),  for  instance,  the  international  movements 
of  workers  or  the  action  of  International  Committee 
of  the  Red  Cross  (ICRC),  in  fields  neglected  by 
states.  In  parallel  to  rising  multilateralism,  a  new 
form  of  solidarity  was  born  on  the  international 
scene:  both  social  and  transnational.  After  a  period, 
when  it  fell  back  at  the  eve  of  World  War  II  (despite 
the  singular  episode  of  the  International  Brigades 
during  the  Spanish  Civil  War),  the  dynamic  strength¬ 
ened  itself  globally  through  the  creation  of  a  grow¬ 
ing  number  of  nongovernmental  organizations 
(NGOs)  having  an  international  vocation  in 
domains  as  various  as  humanitarian  emergency, 
human  rights,  development  aid,  environment,  and 
so  on.  This  activism  of  solidarity  now  adds  up  to 
that  of  transnational  social  movements,  which 
inscribes  itself  even  more  clearly  in  a  perspective 
of  contesting  state  policies  (as  the  action  of 
groups  claiming  themselves  as  partisans  of  an  alter- 
globalization  and  the  brutal  response  of  authorities 
in  Seattle  in  1999  or  Genoa  in  2001  have  testified). 
Using  various  means  (global  campaigns,  lobby, 
boycott,  countersummits,  etc. ),  the  mobilized  social 
bases  have  become  more  diversified  while  the  orga¬ 
nizations  have  transformed,  thanks  to  the  Internet, 
into  broad  networks  of  correspondents  (even  if  they 
remain  located  mainly  in  the  North),  as  noted  by 
Marco  Giugni  and  Florence  Passy. 


The  acceleration  of  communications  and  the 
media  exposure  of  international  events  have  led  to 
an  intensification  of  relations  between  societies. 
However,  transnational  solidarities  remain  fragile. 
They  are  primarily  anchored  in  local  and  national 
levels  of  action  and  therefore  strive  to  embody 
global  objectives.  They  often  remain  fragmented 
and  virtual.  They  interfere  in  interstate  relations 
but  are  no  substitute  for  them. 

Trapped  between  diverse  conceptions  and  prac¬ 
tices,  international  solidarity  remains  an  ambiva¬ 
lent  notion.  It  reveals  a  certain  state  of  international 
order  and  also  designates  one  of  the  elements  of  its 
transformation. 

Guillaume  Devin 
Institut  d’Etudes  Politiques  de  Paris  (Sciences  Po) 

Paris,  France 
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International  System 


The  international  system  is  made  up  of  individual, 
constituent  units  and  an  ordering  principle  that 
arranges  the  structure  of  those  units,  together 
forming  a  whole  toward  an  outside  environment. 
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A  third  defining  characteristic  of  the  international 
system  is  the  interaction  that  continuously  occurs 
between  the  level  of  the  individual  units  and  the 
overall  structural  level.  This  notion,  comprising 
three  basic  elements  (units,  structure,  and  unit- 
structure  interaction),  suffices  to  describe  a 
limited,  structural  conceptualization  of  the  inter¬ 
national  system.  An  alternative  definition  that 
also  encompasses  the  connections  between  the 
units  (units,  structure,  unit-structure  interaction, 
and  unit-unit  relations)  may  be  referred  to  as  a 
broader,  relational  conceptualization  of  the  inter¬ 
national  system.  The  use  of  the  concept,  in  either 
version,  is  justified  as  soon  as  the  system  exhibits 
properties  that  individual  units  do  not. 

After  a  brief  note  on  the  intellectual  heritage  of 
the  concept  of  the  international  system,  this  entry 
offers  a  discussion  of  four  sets  of  issues  found  per¬ 
tinent  to  the  subject  in  contemporary  political  sci¬ 
ence  and  international  relations  scholarship.  The 
four  sets  of  issues  that  will  help  elucidate  key 
aspects  of  the  concept  of  the  international  system 
are  (1)  the  properties  of  units  and  structure;  (2)  its 
origins  and  historical  evolution;  (3)  understand¬ 
ings  of  change,  transformation,  and  breakdown; 
and  (4)  the  inherent  potential  and  limitations  of 
the  concept. 

When  it  comes  to  the  basic  figure  of  thought,  it 
can  be  argued  that  some  idea  of  an  international 
system  was  envisaged  in  the  Amarna  letters  of 
ancient  Egypt  during  the  Late  Bronze  Age  and  is 
implicit  in  the  classical  writings  of  Thucydides, 
Kautilya,  Sun  Tzu,  and  others  who  commented  on 
the  subjects  of  diplomacy,  trade,  and  war.  Several 
of  these  authors  described  periods  of  rather  intense 
interaction  between  political  units  and  a  wider 
political  or  economic  structure,  outside  of  which 
there  was  significantly  less  such  interaction.  The 
international  system,  however,  is  a  concept  that 
emerged  in  the  early-modern  era  in  the  treatises  of 
prominent  lawyers  and  philosophers.  Following  in 
the  footsteps  of  Jean  Bodin,  Hugo  Grotius,  and 
Thomas  Hobbes  and  their  respective  works  on 
sovereignty,  state  equality,  and  political  order, 
Samuel  Pufendorf  explicitly  referred  to  the  states 
system  as  several  states  that  are  connected  as  to 
seem  to  constitute  one  body  but  whose  members 
retain  sovereignty.  During  the  20th  century,  the 
rise  of  new  academic  disciplines  made  its  mark  on 
all  concepts  containing  the  component  “system,” 


and  structuralist  authors  of  organicist,  process- 
oriented,  or  functionalist  persuasion  subsequently 
applied  systems  theories  to  virtually  all  branches  of 
the  nascent  social  sciences.  Arguably,  this  intellec¬ 
tual  trend  had  a  profound  effect  on  the  academic 
field  of  international  relations. 

Properties  of  Units  and  Structure 

Relational  concepts  of  the  international  system 
ascribe  most  importance  and  explanatory  power 
to  the  units  that  together  form  a  whole,  as  well  as 
to  the  mutual  bonds  that  are  forged  among  them. 
Realist  balance-of-power  theories  provide  a  good 
illustration  of  this,  as  do  liberal  theories  that  attri¬ 
bute  significance  to  the  internal  organization  of 
political  units  such  as  city-states,  empires,  or 
(modern)  states.  The  idea  inherent  to  the  latter  is 
that  the  internal  organization  of  individual  politi¬ 
cal  entities  affects  the  way  in  which  they  interact 
and  conduct  business  with  their  peer  entities. 

Structural  concepts  of  the  international  system 
assume  that  unit-level  action  is  essentially  shaped 
and  conditioned  by  the  overarching  incentive 
structure,  which  is  why  causes  are  difficult  to 
locate  at  lower  levels  of  analysis  (and  abstraction). 
At  the  unit  level,  we  merely  find  the  attributes  and 
interactions  of  the  constituent  parts.  At  the  system 
level,  a  powerful  structuring  principle  arranges  the 
positions  of  the  units  and  thereby  predisposes  the 
mode  of  interaction  among  them. 

Depending  on  the  characteristics  of  the  particu¬ 
lar  theory,  the  international  system  is  a  concept 
infused  with  some  ontological  premise  as  to  its 
purpose,  function,  or  design.  This  aspect  was 
underdeveloped  in  much  realist  theorizing  during 
the  post- 1945  period  but  has  received  more  atten¬ 
tion  in  recent  years.  The  same  theoretical  neglect 
long  applied  to  the  system/environment  distinction 
and  to  the  means  of  upholding  that  difference. 

Origins  and  Historical  Evolution 

The  differentiation  between  the  international 
system  and  its  environment,  the  latter  typically 
involving  peripheral/nonrecognized  units  and 
asymmetric/low-density  interaction,  is  critical  to 
any  account  of  the  origins  of  that  system.  Some 
have  argued  that  a  trade  relationship  constitutes  a 
prerequisite  for  other  types  of  ties  between  political 
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entities  (units).  Others  believe  that  political  or  stra¬ 
tegic  ambitions  may  be  just  as  common  as  the  first 
step  toward  creating  a  significant  level  of  interac¬ 
tion  and  therefore  toward  a  new  unit  joining  the 
international  system.  Once  established,  many 
agree,  the  system  is  self-reproducing  in  that  units 
sustain  (boundaries  between)  themselves  and  fend 
off  systemic  intrusion. 

Several  important  contributions  regarding  the 
origins  and  historical  evolution  of  the  interna¬ 
tional  system  were  provided  by  scholars  associated 
with  the  so-called  English  School  of  international 
relations.  Informed  by  diplomatic  history  accounts 
of  political  relations,  these  scholars  described  how 
the  international  system  arose  in  Renaissance  Italy 
and  then  progressively  expanded  via  Western 
Europe  to  North  America,  Turkey,  and  other 
Middle  East  countries  and  later  to  Asia,  Australia, 
Africa,  and  South  America. 

Key  protagonists  of  the  English  School  never¬ 
theless  felt  that  the  structural  conception  of  the 
international  system  strongly  downplays  the  social 
dimension  of  interaction  and  launched  a  relational 
notion  called  international  society.  They  insisted 
that  the  latter  reflects  a  more  demanding  level  of 
political  and  diplomatic  exchange  than  a  “mere 
system,  ”  in  that  states  perceive  that  they  are  bound 
by  a  common  set  of  rules  and  institutions. 

Most  of  today’s  scholars  do  not  heed  this  dis¬ 
tinction  but  appear  to  have  subsumed  part  of  the 
arguments  forwarded  by  the  English  School  into 
more  nuanced  understandings  of  the  international 
system.  Indeed,  even  structural  realists  who  stress 
material  factors  and  interests  of  powerful  actors  as 
explanatory  variables  operate  with  some  notion  of 
socialization  and  cognitive  adaptation  to  the 
actions  and  preferences  of  other  actors. 

Another  increasingly  influential  body  of  work 
highlights  the  constitutive  role  of  legal  precepts, 
interstate  accords,  international  organizations,  and 
jurisprudence  in  fashioning  the  institutional  frame¬ 
work  within  which  governments  and  other  actors 
pursue  interests  and  promote  values.  An  older 
view  of  international  law  as  little  more  than  posi¬ 
tive  morality,  akin  to  the  dress  code  of  a  private 
club  (as  one  philosopher  put  it),  is  thus  giving  way 
for  a  more  comprehensive  understanding  that 
acknowledges  the  expanding  body  of  private  and 
public  law  in  the  international  realm.  It  is  further 
accepted  that  the  United  Nations,  for  all  its  flaws 


and  limitations,  has  played  a  constructive  and  at 
times  crucial  role  in  de-escalating  conflicts  between 
great  powers. 

Theoretical  Understandings  of  Change, 
Transformation,  and  Breakdown 

One  of  the  chief  criticisms  of  structural  conceptions 
of  the  international  system  concerns  its  weak  expla¬ 
nation  of  change  or  major  transformation. 
According  to  the  most  influential  account,  the  inter¬ 
national  system  is  anarchic,  lacking  an  institution  of 
central  authority,  and  can  only  shift  to  a  hierarchi¬ 
cal  system  if  functional  differentiation  among 
political  entities  is  greatly  enhanced  and  the  distri¬ 
bution  of  capabilities  (powers)  becomes  more  even. 

That  prospect,  though,  is  theoretically  implausi¬ 
ble.  Instead,  the  law  of  inertia  perpetuates  the  exist¬ 
ing  international  system,  with  minor  fluctuations  on 
the  margins.  New  members  are  socialized  into  the 
system  through  learning  and  mimicking  the  prac¬ 
tices  of  more  successful  peer  entities.  Even  though 
the  present  international  system  is  characterized  by 
an  unprecedented  degree  of  interaction,  trade,  and 
communication,  proponents  of  a  structural  concep¬ 
tion  of  the  former  maintain  that  most  signs  are  of 
continuity  and  incremental  change,  the  latter 
induced  by  states  or  possibly  by  transnational  advo¬ 
cacy  networks  and  international  organizations. 

Advocates  of  relational  concepts  of  international 
system,  in  contrast,  are  more  open  to  the  possibility 
of  large-scale  change  induced  by  interacting  units. 
Transformative  processes,  some  say,  have  occurred 
in  connection  with  the  two  World  Wars  of  the  20th 
century  and  can  ensue  whenever  major  powers 
fundamentally  alter  the  way  in  which  they  deal 
with  each  other.  More  radical  perspectives  project 
the  eclipse  of  U.S.  preeminence  within  the  next 
20  to  25  years  and  believe  that  a  new  constellation 
of  powers  led  by  China  will  become  predominant 
by  that  time.  In  fact,  some  doubt  that  21st-century 
citizens  will  remain  loyal  to  a  single  state  or  nation 
and  predict  the  onset  of  an  era  of  cosmopolitanism 
or  “new  medievalism.” 

A  new  medievalism  with  overlapping  authorities 
and  multiple  loyalties  would  not  necessarily  be  tur¬ 
bulent  or  violent,  but  the  latter  notion  gently 
approaches  the  most  dramatic  type  of  scenario, 
that  of  a  breakdown  of  the  international  system. 
Two  alternative  tragic  visions  of  the  future  seem  to 
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have  gained  currency  in  recent  years,  and  inform 
the  security  and  defense  policies  of  many  individual 
states  (especially  in  the  West).  One  is  that  of  an  epic 
confrontation  along  religious,  cultural,  and/or  civi- 
lizational  lines,  pitting  the  Euro-American  transat¬ 
lantic  community  of  countries  against  challengers 
predominantly  in  Asia  and  the  Middle  East. 

The  second  tragic  vision  is  that  of  ecological 
disaster  on  a  scale  not  previously  experienced. 
While  this  is  a  devastating  prospect,  the  difference 
lies  in  the  nonantagonistic  character  of  the  latter 
threat.  A  breakdown  of  the  international  system 
prompted  by  environmental  damage  could  con¬ 
ceivably  therefore  help  usher  in  a  new  era  of  global 
cooperation  over  and  beyond  previous  political 
boundaries,  jurisdictions,  and  mandates. 

Inherent  Potential  and 
Limitations  of  the  Concept 

The  international  system  is  one  of  several  theoreti¬ 
cal  notions  that  paved  the  way  for  studies  with  a 
global  reach  by  playing  down  the  importance  of 
political  borders  and  helping  wind  down  the  meth¬ 
odological  nationalism  characteristic  of  older 
social  science.  The  structural  conception  of  the 
international  system,  though  subject  to  critique  in 
this  entry,  was  clearly  instrumental  in  widening 
this  research  agenda  to  what  used  to  be  viewed  as 
peripheral  regions  and  issues.  Relational  conceptu¬ 
alizations,  meanwhile,  are  less  likely  to  reify  struc¬ 
ture  or  overemphasize  the  system  level  at  the 
expense  of  explanations  that  account  for  unit-level 
actors  and  institutions. 

The  “international”  element  in  the  concept  of 
international  system,  however,  cannot  reasonably 
transcend  the  state-oriented  focus  associated  with 
either  version  of  the  argument.  And  just  as  states 
and  governments  are  privileged  as  unit-level  actors, 
both  the  structural  and  the  relational  conceptions 
typically  rely  on  an  implicit  notion  of  the  primacy 
of  politics.  In  that  sense,  the  international  system  is 
a  concept  that  may  not  remain  analytically  rele¬ 
vant  in  the  2 1st  century  to  the  same  degree  as  in 
the  20th,  as  the  realities  it  was  created  to  depict 
have  altered  substantively. 

Kjell  Engelbrekt 
Swedish  National  Defense  College 
Stockholm,  Sweden 
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International  Trade 


International  trade  is  the  exchange  of  goods  and 
services  across  borders.  However,  the  boundaries 
between  trade  and  investment  have  become 
increasingly  blurred.  Many  services  can  only  be 
delivered  across  borders  through  a  foreign  com¬ 
mercial  presence,  implying  foreign  direct  invest¬ 
ment.  An  increasing  share  of  world  trade  also 
takes  place  within  multinational  firms  or  through 
international  production  networks  in  which  mul¬ 
tinationals  play  a  central  role. 

The  political  analysis  of  international  trade  has 
centered  on  two  closely  related  issues:  (1)  the 
determinants  of  free  trade  and  protection  at  the 
national  level  and  (2)  the  determinants  of  interna¬ 
tional  cooperation  on  trade  issues,  including 
through  multilateral  organizations.  The  first  sec¬ 
tion  of  this  entry  provides  an  overview  of  models 
of  free  trade  and  protection  at  the  domestic  level, 
focusing  first  on  interest-based  approaches  and 
then  introducing  collective  action  problems  and 
the  role  of  institutions.  The  second  section  exam¬ 
ines  some  of  the  core  issues  in  the  international 
politics  of  trade,  including  the  central  features  of 
the  General  Agreement  on  Tariffs  and  Trade 
(GATT)/World  Trade  Organization  (WTO)  regime 
and  the  growth  of  regional  arrangements.  Brief 
mention  is  also  made  of  the  long-standing  litera¬ 
ture  linking  trade  to  security  considerations  and 
conflict.  The  conclusion  considers  future  research 
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directions  that  are  emerging  from  new  develop¬ 
ments  in  trade  theory,  including  the  role  of  multi¬ 
national  firms,  the  enduring  role  of  geography, 
and  the  political  economy  of  agglomeration. 

Models  of  Free  Trade  and  Protection 

In  their  basic  assumptions,  political  economy  mod¬ 
els  of  free  trade  and  protection  resemble  other 
models  of  public  policy  and  regulation.  Voters  and 
interest  groups  constitute  the  “demand  side”  of  the 
political  market  with  their  preferences  driven  by 
the  effects  of  trade  and  trade  policy  on  income  and 
assets.  The  extent  of  interest  group  organization 
and  the  willingness  to  expend  resources  to  secure 
protection  (so-called  rent-seeking  behavior)  are 
key  variables  in  many  political  economy  models. 

Politicians,  in  turn,  supply  trade  policy  in  return 
for  votes  and  campaign  contributions  (both  legal 
and  illegal).  Bureaucratic  agents  can  also  play  an 
independent  role  in  trade  policy  where  they  exer¬ 
cise  discretion;  an  example  would  be  the  bribery  of 
a  customs  agent.  The  behavior  of  both  politicians 
and  bureaucrats  will  also  be  affected  by  the  design 
of  electoral  and  other  political  institutions. 

The  Role  of  Interests 

The  early  literature  on  the  political  economy  of 
trade  focused  on  the  structure  of  protection  at  the 
national  level,  that  is,  the  levels  of  tariff  and  non¬ 
tariff  barriers  across  industries.  Virtually  all  of  this 
work  was  on  advanced  industrial  states,  and  the 
United  States  in  particular.  Early  empirical  models 
explored  a  variety  of  factors  that  might  be  conse¬ 
quential,  including  the  growth  of  output  and 
employment  and  exposure  to  import  competition 
at  the  industry  level.  However,  the  logical  under¬ 
pinning  for  this  analysis  was  to  use  trade  theory  to 
identify  the  characteristics  of  industries  (and  ulti¬ 
mately  investors  and  workers)  that  would  make 
them  more  or  less  likely  to  support  free  trade. 

Classic  trade  theory  offered  up  two,  quite  differ¬ 
ent  approaches  in  this  regard.  In  the  Heckscher- 
Ohlin  setup,  factors  of  production — capital  and 
labor  in  the  simplest  two-factor  variant — move 
costlessly  across  industries  and  returns  to  them 
therefore  tend  to  equalize  across  the  economy. 
Going  back  to  David  Ricardo’s  classic  formulation 
of  comparative  advantage,  countries  (and  regions 


within  them)  will  export  goods  that  intensively  use 
abundant  factors  and  will  import  goods  intensive 
in  the  scarce  factor.  Trade  thus  benefits  owners  of 
abundant  factors  and  harms  the  scarce  factor.  In 
capital-abundant  economies,  capital  should  there¬ 
fore  favor  free  trade  while  labor  should  oppose  it; 
in  labor-abundant  economies,  the  opposite  would 
be  the  case. 

In  a  sweeping  application  of  the  Hecksher- 
Ohlin  approach,  Ronald  Rogowski’s  Commerce 
and  Coalitions  (1990)  attempted  to  show  how 
changes  in  the  costs  of  trade  in  the  late  19th  cen¬ 
tury  influenced  not  only  the  politics  of  trade  but 
wider  social  cleavages  in  a  number  of  countries.  In 
recent  years,  studies  of  protection  at  the  industry 
level  using  a  Heckscher-Ohlin  motivation  have 
been  supplemented  by  survey  research  that  trace 
preferences  down  to  the  individual  level.  For 
example,  Kenneth  Scheve  and  Matthew  Slaughter 
found  that  the  level  of  education,  taken  as  a  proxy 
for  skills,  was  positively  correlated  with  individual 
preferences  for  free  trade. 

The  Hecksher-Ohlin  model  predicts  that  the 
interests  of  factors  will  align;  in  a  two-factor  setup, 
capital  and  labor  should  have  opposing  trade  pol¬ 
icy  interests.  This  model  could  not  explain  the 
anomaly  that  capital  and  labor  frequently  seemed 
allied  with  one  another  in  opposing  free  trade  in 
the  advanced  industrial  states,  for  example,  in 
declining  sectors  such  as  steel  and  textiles.  In  the 
specific  factors  or  Ricardo-Viner  model,  factors 
are  not  mobile  across  alternative  uses.  Once  invest¬ 
ments  are  made  in  industry-specific  physical  and 
human  capital,  those  investments  are  not  easily 
redeployed.  As  a  result,  trade  policy  coalitions 
form  not  along  factoral  lines  but  by  sector  depend¬ 
ing  on  the  level  of  exposure  to  international  trade. 
Export-oriented  and  import-competing  industries 
constitute  the  dominant  cleavage. 

The  interest  group  foundations  underlying  both 
the  mobile  and  fixed  factor  models  had  an  impor¬ 
tant  limitation  in  not  capturing  the  constraints  on 
politicians  in  acceding  to  rent-seeking  demands. 
Trade  theory  is  useful  in  outlining  the  distributive 
consequences  of  policy,  but  it  also  underlines  the 
distortions  and  welfare  losses  associated  with  pro¬ 
tectionism.  Indeed,  if  free  trade  is  welfare  enhanc¬ 
ing,  politicians  responding  to  the  median  voter 
should  consistently  support  free  trade.  In  an  influ¬ 
ential  article,  Helen  Milner  and  Keiko  Kubota 
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show  that  democratic  developing  countries  were 
more  likely  to  liberalize  trade  beginning  in  the 
1980s  than  their  authoritarian  counterparts. 

In  a  highly  influential  synthesis,  Gene  Grossman 
and  Elhanan  Helpman  model  politicians  as  seeking 
to  maximize  their  chances  of  election  by  balancing 
two  considerations:  (1)  the  political  contributions 
coming  from  factor-specific  lobbies  and  (2)  the 
aggregate  social  welfare  gains  and  losses  associated 
with  any  trade  policy  choices,  which  should  affect 
the  support  of  voters  at  the  margin.  A  core  element 
of  the  Grossman-Helpman  model  is  the  extent  of 
industry  organization;  protection  increases  with 
the  extent  to  which  sectors  are  organized. 

Collective-Action  Problems 

A  central  problem  with  both  the  Heckscher- 
Ohlin  and  Ricardo-Viner  approaches  is  that  they 
are  ultimately  theories  of  trade  policy  preferences, 
not  effective  influence  or  outcomes.  For  prefer¬ 
ences  to  be  translated  into  policy,  groups  must  be 
organized  and  politicians  must  respond  to  them. 

Several  early  empirical  studies  found  that  orga¬ 
nizational  factors,  proxied  by  measures  of  industry 
concentration,  had  a  more  powerful  effect  on  trade 
policy  than  measures  that  capture  preferences,  such 
as  the  factor  intensity  of  output.  High  levels  of 
industry  or  geographic  concentration  may  have 
influence  by  lowering  the  transaction  costs  of  form¬ 
ing  effective  organizations  and  mobilizing  support. 

But  high  levels  of  industry  and  spatial  concen¬ 
tration  may  reflect  other  factors,  such  as  the  speci¬ 
ficity  of  assets,  and  can  even  dilute  the  electoral 
impact  of  an  industry.  An  industry  that  is  concen¬ 
trated  in  a  particular  district  commands  the  full 
attention  of  its  political  representatives.  However, 
if  the  industry  is  spatially  concentrated  in  a  small 
number  of  districts,  it  is  unlikely  to  command 
wider  legislative  support.  Electorally  dispersed 
industries  are  typically  assumed  to  have  more 
political  clout  since  more  legislators  will  care 
about  the  fate  of  such  industries. 

A  more  direct  way  to  get  at  industry  influence  is 
to  move  beyond  economic  proxies  such  as  concen¬ 
tration  to  more  direct  measures  of  organization 
and  influence.  Beginning  in  the  2000s,  several 
studies  exploited  data  on  campaign  contributions 
and  found  them  to  be  positively  correlated  with 
protectionist  outcomes  at  the  sectoral  level. 


These  studies  still  begged  the  question  of  why 
some  groups  are  better  organized  than  others. 
Moreover,  there  is  a  strong  case  that  interest  groups 
are  endogenous.  Anne  Krueger’s  famous  model  of 
rent  seeking  argued  not  that  rents  were  the  result  of 
lobbying  but  rather  that  rent  seeking  arose  as  a 
result  of  various  forms  of  government  intervention 
(and  could  presumably  be  reduced  if  governments 
credibly  swore  off  such  interventions). 

To  date,  there  has  been  surprisingly  little  work 
on  the  determinants  of  interest  group  organization 
around  trade.  It  is  commonly  believed  that  protec¬ 
tionist  interests  enjoy  advantages  in  solving  collec¬ 
tive-action  problems  because  of  the  concentrated 
nature  of  the  benefits  they  can  realize  when  com¬ 
pared  with  the  diffuse  gains  to  highly  dispersed 
consumers.  However,  importers  of  intermediate 
goods,  firms  engaged  in  intrafirm  trade,  and  con¬ 
centrated  retailers,  such  as  the  American  retail 
giant  Walmart,  do  not  have  these  organizational 
disabilities.  The  political  strength  of  these  “con¬ 
centrated  consumers”  is  undoubtedly  a  factor  in 
the  continuing  liberalization  of  world  trade. 

Introducing  Political  Institutions 

The  work  by  economists  on  the  determinants  of 
trade  policy  tended  to  be  institution  free,  in  part 
perhaps  because  these  studies  were  often  under¬ 
taken  within  the  context  of  a  given  country  in 
which  institutions  were  invariant.  However,  the 
Milner  and  Kubota  findings  on  regime  type  suggest 
that  institutions  matter;  more  competitive  political 
systems  are  more  responsive  to  the  median  voter 
than  to  special  interests  and  thus  have  freer  trade. 
This  finding  is  consonant  with  the  Grossman- 
Helpman  model  but  underlines  that  institutions 
may  affect  the  trade-off  between  securing  campaign 
contributions  and  appealing  to  the  median  voter. 

One  line  of  research  on  institutions  and  trade 
policy  in  democracies  looks  at  the  effects  of  elec¬ 
toral  rules.  For  example,  proportional  representa¬ 
tion  (PR)  electoral  rules  are  accompanied  by  larger 
electoral  districts,  which  are  more  heterogeneous 
and  thus  less  prone  to  capture  by  special  interests. 
On  the  other  hand,  majoritarian  systems  have  fea¬ 
tures  that  might  induce  free  trade  as  well,  includ¬ 
ing  the  tendency  for  parties  to  converge  on  median 
voter  interests.  In  a  useful  synthesis,  Fiona 
McGillivray  notes  that  the  empirical  work  on  the 
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effects  of  electoral  rules  on  protection  has  proven 
ambiguous  and  that  their  effects  may  be  contin¬ 
gent  on  other  factors  such  as  the  geographic  con¬ 
centration  of  industries.  For  example,  a  highly 
concentrated  industry  in  a  swing  district  that  is 
crucial  for  winning  the  presidency  or  securing  a 
parliamentary  majority  will  have  a  clout  that  far 
exceeds  what  is  predicted  by  its  level  of  organiza¬ 
tion  alone. 

The  effect  of  institutions  on  trade  policy  is  by 
no  means  limited  to  their  effect  on  legislative 
votes.  Delegation  also  plays  a  crucial  role  in  trade 
policy.  In  both  the  United  States  and  the  European 
Union  (EU),  legislatures  have  delegated  substan¬ 
tial  authority  to  executives  (the  president  and  the 
European  Commission,  respectively)  to  negotiate 
trade  agreements.  The  means  through  which  this 
delegation  is  structured  can  strongly  influence 
policy  outcomes.  The  literature  on  trade  policy  in 
the  United  States  places  particular  weight  on  the 
liberalizing  effect  of  delegation  to  the  president 
under  the  Reciprocal  Trade  Agreements  Act  of 
1934  and  subsequent  innovations,  such  as  fast- 
track  authority,  that  limit  the  ability  of  Congress 
to  amend  trade  agreements  once  negotiated  (leg¬ 
islators  are  constrained  to  vote  them  up  or 
down). 

Delegation  also  plays  a  role  in  the  growth  of 
administrative  channels  for  securing  protection. 
Prevailing  international  trade  law  permits  coun¬ 
tries  to  protect  industries  in  response  to  unfair 
trade  practices  (subsidies  and  dumping)  and  in  the 
face  of  fair  trade  practices  that  threaten  significant 
disruption  to  the  industry  (the  so-called  escape 
clause).  Relief  under  these  provisions  is  typically 
not  legislated  on  a  case-by-case  basis  but  through 
quasi-judicial  processes  that  can  be  either  more  or 
less  favorable  to  industries  seeking  protection.  For 
example,  these  processes  might  allow  standing  to 
adversely  affected  interests,  thus  balancing  protec¬ 
tionist  claims.  The  ability  to  make  filings  may  be 
more  or  less  difficult,  both  substantively  and 
administratively.  A  growing  body  of  empirical  lit¬ 
erature,  particularly  on  antidumping/countervailing 
duty  cases,  shows  that  administrative  bodies  are 
responsive  to  political  pressures  emanating  from 
the  legislature  and  interest  groups.  The  details  of 
administrative  procedure  have  become  a  central 
issue  in  disciplining  the  growth  of  administrative 
protection. 


The  International  Politics  of  Trade 

The  Bretton  Woods  Conference  (1944),  which  led 
to  the  creation  of  the  World  Bank  and  the 
International  Monetary  Fund,  also  introduced  the 
idea  of  creating  the  International  Trade  Organiza¬ 
tion  (ITO).  As  governments  engaged  in  the  complex 
negotiations  over  the  ITO,  15  states  negotiated  the 
General  Agreement  on  Tariffs  and  Trade  (GATT) 
as  a  forum  to  reduce  tariffs  in  the  short  run.  Once 
the  ITO  failed  to  secure  ratification  from  the  United 
States,  only  the  GATT  agreement — a  very  much 
thinner  organization — was  left  in  place.  Not  until 
1995  was  the  GATT  replaced  by  the  World  Trade 
Organization  (WTO).  In  the  interim,  however, 
regional  trade  agreements — most  notably  the  EU — 
as  well  as  bilateral  free  trade  agreements  (FTAs) 
have  proliferated  rapidly,  making  them  a  central 
feature  of  the  international  politics  of  world  trade. 

The  GATT/WTO  and  the 
Logic  of  Multilateralism 

The  role  of  the  GATT  and  WTO  in  liberalizing 
trade  is  frequently  misunderstood.  As  an  organiza¬ 
tion,  the  GATT/WTO  has  very  little  authority  in 
structuring  the  negotiating  agenda  or  influencing 
outcomes.  Nor  does  the  organization  seek  to  nego¬ 
tiate  a  common  tariff  structure  across  countries 
(harmonization)  or  vote  on  liberalization  measures. 

Rather,  the  G ATT/WT O  is  primarily  a  forum  in 
which  countries  trade  “concessions”:  Countries 
offer  reductions  in  their  own  trade  barriers  in 
exchange  for  reductions  in  the  barriers  of  others. 
The  process  of  trading  concessions  has  an  impor¬ 
tant  political  effect  at  the  domestic  level.  For 
export-oriented  industries  to  secure  advantages 
abroad,  the  country  must  be  willing  to  make  cor¬ 
responding  or  reciprocal  concessions.  The  very 
fact  of  negotiating  a  trade  agreement  thus  changes 
the  politics  of  trade;  as  export  interests  are  politi¬ 
cally  mobilized  to  influence  the  course  of  trade 
talks,  they  constitute  a  counterweight  to  protec¬ 
tionist  forces. 

Although  nominally  a  multilateral  institution 
with  a  consensus  decision-making  structure,  the 
G ATT/WT ()  has  in  fact  operated  through  bilateral 
deals  among  the  major  trading  countries  or  among 
small  groups  of  countries.  This  procedure  domi¬ 
nates  in  part  not  only  because  of  the  difficulty  of 
reaching  a  consensus  among  a  growing  number  of 
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parties  but  also  because  power  in  the  organization 
is  effectively  exercised  through  the  ability  to  make 
concessions.  As  a  result,  the  demands  and  conces¬ 
sions  of  the  major  trading  powers  typically  drive  the 
agenda  and  ultimate  outcome  of  the  negotiations. 

A  distinctive  feature  of  the  negotiations,  how¬ 
ever,  is  the  concept  of  nondiscrimination,  under 
which  any  concessions  that  are  extended  to  any 
one  party  are  multilateralized  to  all  members  of 
the  organization  through  the  most  favored  nation 
(MFN)  principle.  The  nondiscrimination  norm  has 
been  a  significant  institutional  feature  of  the 
GATT/WTO  system  and  has  almost  certainly  con¬ 
tributed  to  the  liberalization  of  world  trade  in  the 
postwar  period. 

Nonetheless,  as  with  the  controversy  over  the 
governance  of  the  international  financial  institu¬ 
tions,  there  has  been  significant  debate  about  the 
extent  to  which  the  GATT/WTO  also  embodies 
biases  of  various  sorts.  On  the  one  hand,  the  major 
trading  partners  effectively  control  the  agenda  by 
structuring  the  issues  and  products  over  which 
negotiations  occur.  In  its  early  days,  the  organiza¬ 
tion  focused  overwhelmingly  on  trade  in  manufac¬ 
tures  that  were  of  interest  at  the  time  mainly  to  the 
developed  economies.  Products  of  primary  interest 
to  developing  countries,  including  agriculture, 
received  little  attention. 

On  the  other  hand,  the  power  of  the  advanced 
industrial  states  was  also  due  to  the  fact  that  many 
developing  countries  opted  out  of  the  organization 
altogether  or  argued  successfully  that  they  should 
enjoy  special  and  differential  status  within  it,  effec¬ 
tively  exempted  from  making  concessions.  As  the 
developing  countries  began  to  undertake  reforms 
and  as  their  industrial  capability  increased,  they 
entered  the  organization  in  large  numbers  and 
began  to  change  the  nature  of  the  negotiations. 

During  the  Uruguay  Round  (1986-1994)  and 
current  Doha  Round  (launched  in  2001),  develop¬ 
ing  countries  brought  new  issues  onto  the  agenda. 
The  advanced  industrial  states  sought  to  push 
negotiations  on  issues  in  which  they  enjoyed  com¬ 
parative  advantage,  including  trade  in  services, 
investment,  and  the  protection  of  intellectual  prop¬ 
erty.  Developing  countries,  by  contrast,  sought 
concessions  on  trade  in  labor-intensive  products 
(most  notably  textiles  and  apparel),  agriculture, 
and  a  strengthening  of  the  dispute  settlement 
mechanism  (DSM). 


The  reform  of  the  DSM  was  a  major  develop¬ 
ment  of  the  Uruguay  Round  negotiations  that  led 
to  the  creation  of  the  WTO.  Under  the  prior  DSM 
structure,  plaintiffs  faced  an  uphill  battle  in  secur¬ 
ing  relief  because  prevailing  rules  effectively  gave 
the  defendant  the  ability  to  veto  investigations. 
The  new  DSM  is  much  more  legalized,  with  a 
highly  structured  process  for  the  empowerment  of 
review  panels.  Despite  the  fact  that  the  ultimate 
enforcement  mechanism  is  retaliation,  and  large 
countries  should  therefore  continue  to  enjoy 
advantages  over  small  countries,  there  is  evidence 
that  larger  parties  tend  to  comply  with  rulings  that 
go  against  them.  Current  legal  research  on  the 
DSM  is  considering  the  extent  to  which  it  may 
become  the  foundation  for  judicial  lawmaking 
through  the  establishment  of  precedent,  rather 
than  a  process  that  is  limited  to  the  interpretation 
of  statute. 

The  Rise  of  Regionalism 

Regional  trade  arrangements  got  a  bad  name 
during  the  interwar  period,  as  Japan  and  Germany 
sought  to  carve  out  exclusive  trade  zones.  Britain 
and  France  also  strengthened  preferential  trading 
zones  with  their  colonies,  contributing  to  the  frag¬ 
mentation  of  the  international  trading  system.  The 
strong  commitment  to  the  nondiscrimination  norm 
in  the  GATT/WTO  system  was  motivated  in  large 
part  by  the  desire  to  avoid  such  outcomes  in  the 
future. 

However,  the  dramatic  economic  success  of  the 
European  Community  raised  the  question  of 
whether  regional  agreements  were  necessarily 
incompatible  with  liberalization  at  the  interna¬ 
tional  level.  The  negotiation  of  the  Single  European 
Act  in  1985  and  subsequent  developments  follow¬ 
ing  the  signing  of  the  Maastricht  Treaty  in  1992 
set  Europe  on  the  course  of  deeper  integration 
through  completion  of  the  common  market  and 
ultimately  monetary  union.  The  United  States  also 
pursued  a  regional  option  through  the  U.S.- 
Canada  Free  Trade  Agreement  (1988)  and  its 
extension  to  Mexico  in  the  North  American  Free 
Trade  Agreement  (NAFTA,  1994).  Beginning  in 
the  1990s,  regional  arrangements  and  bilateral 
FTAs  began  to  proliferate  rapidly  and  became  a 
major  research  topic  in  the  political  economy  of 
trade. 
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On  the  one  hand,  a  number  of  arguments  sug¬ 
gested  that  regional  agreements  could  be  the  build¬ 
ing  blocks  of  multilateral  liberalization.  Strong 
regional  agreements  could  provide  the  template  for 
deepening  cooperation  through  the  GATT/ WTO 
and  could  even  be  used  to  leverage  multilateral 
agreements.  U.S.  commitment  to  the  NAFTA  had 
an  important  effect  on  European  participation  in 
the  Uruguay  Round.  A  distinctive  feature  of  U.S.- 
led  FTAs  has  been  the  inclusion  of  environmental 
and  labor  standards,  provisions  that  have  also 
been  mirrored  in  the  web  of  agreements  surround¬ 
ing  the  EU. 

On  the  other  hand,  there  is  increasing  concern 
that  the  multiplication  of  regional  agreements — 
now  numbering  more  than  400 — will  increasingly 
complicate  the  politics  of  international  trade. 
Almost  by  definition,  free  trade  agreements  are 
discriminatory;  even  if  they  do  not  raise  trade  bar¬ 
riers  to  outsiders,  they  nonetheless  extend  prefer¬ 
ences  that  give  insiders  advantage.  Once  created, 
regional  agreements  have  the  result  of  creating 
vested  interests  that  will  resist  expansion.  In  both 
the  Western  Hemisphere  and  particularly  in  East 
Asia,  there  have  been  growing  debates  about  how 
existing  arrangements — such  as  the  NAFTA  in 
North  America  and  the  Mercosur  (centered  on 
Brazil,  Argentina,  Uruguay,  and  Paraguay) — might 
be  reconciled. 

“Low”  and  “High”  Politics:  The  Strategic 
Dimensions  of  International  Trade 

The  politics  of  international  trade  is  by  no 
means  limited  to  economic  issues  narrowly  con¬ 
ceived.  Beginning  with  Albert  Hirschmann’s 
National  Power  and  the  Structure  of  Foreign 
Trade,  an  important  strand  of  research  underlined 
how  large  powers  could  use  asymmetric  trade  rela¬ 
tions  as  an  instrument  of  political  influence.  Such 
considerations  played  a  role  in  the  emergence  of 
dependency  theory  in  the  1960s,  which  not  only 
questioned  the  economic  benefits  of  international 
trade  but  also  noted  how  trade  and  investment  ties 
might  lead  to  political  influence  in  developing 
countries  as  well. 

Strategic  considerations  were  by  no  means  lim¬ 
ited  to  North-South  trade  relations.  Another  impor¬ 
tant  strand  of  research  has  examined  the  extent  to 
which  foreign  trade  was  related  to  broader  political 


relations,  such  as  alliances.  Alliance  commitments 
were  of  obvious  importance  in  structuring  the  post¬ 
war  trading  system,  with  its  strong  initial  focus  on 
the  deepening  of  North  Atlantic  trade  in  conjunc¬ 
tion  with  American  commitments  to  Western 
Europe  and  the  self-imposed  exile  of  the  Soviet  bloc 
from  the  world  trading  system. 

A  final  strand  of  research  in  this  vein  centers  on 
the  long-standing  question  of  whether  trade  acts  as 
a  deterrent  to  military  conflict;  Etel  Solingen  has 
provided  the  most  compelling  recent  synthesis  of 
these  arguments.  Countries  dominated  by  export- 
oriented  coalitions  are  likely  to  favor  conciliation 
and  multilateral  cooperation.  Those  with  more 
closed  economies  are  likely  to  be  more  hostile  to 
international  cooperation,  in  part  because  of  the 
direct  threat  posed  to  vested  interests  in  the  closed 
economy.  Isolating  the  effects  of  trade  in  such 
models  is  difficult,  since  more  open  economies  also 
tend  to  be  more  democratic  and  democratic  poli¬ 
ties  appear  more  pacific,  at  least  toward  one 
another.  Nonetheless,  these  debates  have  strong 
policy  implications  as  analysts  consider  the  possi¬ 
ble  consequences  of  the  rise  of  new  powers  such  as 
China,  which  combine  increasing  economic  open¬ 
ness  with  persistent  elements  of  statism  and 
authoritarian  rule. 

New  Directions 

The  political  economy  of  trade  constitutes  a  very 
dynamic  area  of  research,  yet  significant  gaps  are 
emerging  between  developments  in  the  economics 
of  trade  and  work  in  political  economy.  The 
Heckscher-Ohlin  and  Ricardo-Viner  models  of 
trade  had  appeal  because  of  their  apparent  ability 
to  explain  the  protectionism  associated  with  the 
rapid  growth  of  North-South  trade  in  manufac¬ 
tures  beginning  in  the  1960s.  Yet  even  with  the  rise 
of  the  China  and  other  middle-income  countries, 
the  majority  of  world  trade  continues  to  take  place 
among  the  advanced  industrial  states.  As  early  as 
the  1970s,  it  was  recognized  that  this  trade  could 
not  be  explained  by  standard  models  and  that 
consideration  had  to  be  given  to  the  effects  of 
increasing  returns,  product  differentiation,  and  the 
corresponding  emergence  of  intra-industry  trade 
patterns.  This  trade  appeared  to  be  less  vulnerable 
to  protection  than  the  interindustry  trade  charac¬ 
teristic  of  North-South  relations. 
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The  remarkable  growth  of  foreign  direct  invest¬ 
ment  since  1980  has  also  exercised  a  strong  influ¬ 
ence  on  the  political  economy  of  trade.  In  contrast 
to  earlier  expectations  that  North-South  trade 
would  necessarily  generate  protectionist  pressures, 
the  organization  of  international  production  net¬ 
works  controlled  by  lead  firms  from  the  advanced 
industrial  states  has  probably  had  important  coun¬ 
tervailing  effects. 

More  recently,  the  so-called  new  trade  theory 
has  focused  attention  on  the  geography  of  trade. 
Borders  still  create  much  larger  barriers  to  trade 
than  was  previously  appreciated;  intracountry  trade 
still  dominates  international  trade  by  a  large 
margin.  Despite  the  revolutions  in  transport  and 
communications,  distance  remains  a  serious  deter¬ 
rent  to  trade  as  well.  These  facts  suggest  that  there 
are  substantial  fixed  as  well  as  marginal  costs  to 
entering  international  trade  with  important  impli¬ 
cations  for  the  ability  of  developing  countries  to 
benefit  from  specialization.  The  new  trade  theory 
has  also  emphasized  the  way  the  benefits  of  trade, 
and  production  more  generally,  tend  to  cluster  or 
agglomerate  in  particular  locations  raising  impor¬ 
tant  questions  of  how  such  clustering  arises  in 
the  first  place.  These  new  issues — the  role  of  multi¬ 
national  corporations,  the  enduring  influence  of 
borders  and  distance,  and  the  role  of  agglomeration 
economies — constitute  important  frontiers  for  the 
political  economy  of  trade. 
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Intervention 


An  intervention  is  a  coercive  action  taken  by  one 
international  actor  to  affect  the  political  authority 
of  another.  This  entry  examines  the  concept  of 
intervention  and  then  discusses  the  balance  between 
the  protection  of  human  rights  and  respect  for 
state  sovereignty,  the  legal  basis  for  humanitarian 
intervention,  and  the  issues  related  to  the  strategies 
employed  in  humanitarian  intervention. 

The  notion  of  intervention  is  notoriously  elusive 
and  controversial.  In  what  remains  the  most  rele¬ 
vant  conceptual  work  on  the  issue,  James  Rosenau 
proposed  to  delimit  the  notion  in  two  ways.  First, 
an  intervention  is  a  coercive  action,  implying  a 
sharp  break  from  previous,  conventional,  behav¬ 
ior.  This  sudden  interruption  in  the  course  of 
established  relations  between  political  units  is 
finite  and  transitory.  It  has  a  beginning  (the  usual 
modes  of  behavior  are  abandoned)  and  an  end 
(these  modes  of  behavior  are  reestablished  or, 
because  they  persist,  they  become  conventional  in 
turn).  Military  intervention — either  directly  or 
indirectly,  such  as  the  support  of  subversive  or 
terrorist-armed  activities — is  an  especially  con¬ 
spicuous  and  significant  form  of  coercive  action, 
but  it  is  not  the  only  one.  There  are  other  types  of 
measures,  either  diplomatic  or  economic,  that  can 
constitute  a  forcible  interference  and  provoke  a 
sharp  break  in  preestablished  relations.  Second,  an 
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intervention  is  not  just  any  type  of  break:  It  is 
directed  at  affecting  (changing  or  preserving)  the 
political  authority  of  another  actor.  Political 
authority  notably  refers  to  the  actors  and  the  pro¬ 
cesses  that  enable  a  political  community  to  choose 
its  own  political,  economic,  social,  and  cultural 
system  as  well  as  its  foreign  policy.  The  target  is 
thus  deprived,  at  least  momentarily,  of  its  capacity 
for  self-determination. 

Interventions  are  therefore  different  from  the 
foreign  policies  that  imply  a  continuous  presence 
in  the  target  society.  For  example,  the  presence 
since  1945  of  U.S.  troops  in  Europe  and  in 
Northeast  Asia  is  not  considered  today  as  an  inter¬ 
vention,  a  sharp  break  with  long-standing  rela¬ 
tions.  On  the  contrary,  this  military  presence 
is  integrated  and  largely  taken  for  granted.  The 
continuous  exercise  of  regular  diplomatic  and  eco¬ 
nomic  influence  is  not  considered  as  an  interven¬ 
tion  either.  Similarly,  colonialism  and  imperialism 
imply  a  long-standing  presence  of  the  colonizer 
and  expand  well  beyond  the  notion  of  interven¬ 
tion.  Interventions  are  also  distinct  from  the 
numerous  external  actions  that  have  as  their  main 
goal  an  influence  that  is  not  focused  on  structures 
of  political  authority.  For  example,  measures 
favoring  investment  abroad,  an  international  pub¬ 
lic  relation  campaign  to  promote  the  candidacy  of 
a  city  to  host  the  Olympic  Games,  the  expulsion  of 
diplomats,  or  the  signing  of  an  alliance  all  alter 
previous  behavior.  But  the  main  goal  of  these 
actions  is  to  affect  the  capacity  of  other  interna¬ 
tional  actors,  not  directly  their  authority  structure 
(even  if,  in  the  long  run,  authority  structures  might 
be  somewhat  affected). 

Both  practically  and  theoretically,  the  notion  of 
intervention  is  one  of  the  most  disputed  in  interna¬ 
tional  relations  for  two  main  reasons.  First,  the 
phenomenon  touches  on  some  of  the  core  issues, 
notably  the  imperatives  of  order  and  justice  (sover¬ 
eignty  and  individual  rights),  of  a  variety  of  legal, 
normative,  political,  and  strategic/military  per¬ 
spectives.  As  a  consequence,  different  practitioners 
of  international  politics  (diplomats,  soldiers, 
lawyers,  human  rights  advocates,  etc.)  as  well  as 
several  academic  disciplines  are  simultaneously 
interested,  which  leads  not  only  to  a  proliferation 
of  studies  and  approaches  but  also  to  a  disconnect 
between  these  relatively  autonomous  debates. 
Second,  for  international  relations  practitioners 


and  analysts,  the  notion  of  intervention  is  at  the 
root  of  a  key  issue,  the  evolving  divide  between  the 
domestic,  especially  the  domestic  politics  that 
enable  each  person  to  work  out  his  or  her  own 
destiny,  and  the  international.  The  changing  reach 
of  international  relations  and  the  concerns  that 
these  changes  generate  are  at  the  core  of  the 
debates  on  intervention,  linked  to  states’  sover¬ 
eignty,  territorial  integrity,  and  political  indepen¬ 
dence.  Article  2(7)  of  the  Charter  of  the  United 
Nations  (UN)  provides,  “Nothing  contained  in  the 
present  Charter  shall  authorize  the  UN  to  inter¬ 
vene  in  matters  which  are  essentially  within  the 
domestic  jurisdiction  of  any  state.”  The  UN 
Charter  affirms  the  inviolability  of  state  boundar¬ 
ies,  but  the  content  of  the  “matters  which  are 
essentially  within  the  domestic  jurisdiction  of  any 
state”  has  been  continually  disputed.  What  are  the 
matters  and  concerns  that  are  within  the  reach  of 
the  society  of  states  and  what  are  the  matters  and 
concerns  (if  any)  that  are  off-limits,  that  is,  for  the 
political  communities  to  decide? 

Since  the  end  of  the  Cold  War,  with  the  rise  of 
human  rights  norms  in  international  affairs, 
numerous  crises  brought  back  to  the  fore  the  issue 
of  intervention,  notably  humanitarian  interven¬ 
tion.  The  general  context  of  a  globalizing  interna¬ 
tional  system  has  favored  porous  borders  and 
greater  intrusions  into  what  was  considered  as  the 
domestic  jurisdiction  of  states.  On  the  one  hand, 
these  trends  have  been  perceived  as  a  unique 
opportunity  to  prevent,  and  to  respond  to,  massive 
violations  of  human  rights.  On  the  other,  however, 
they  have  posed  a  growing  threat  to  the  right  of 
distinct  political  communities  to  work  out  their 
own  political  arrangements.  Humanitarian  mili¬ 
tary  interventions  have  been  at  the  forefront  of 
debates  about  when  military  force  should  be  used, 
and  they  have  influenced  local  conflict,  aid  organi¬ 
zations,  and  military  strategy.  The  tension  between 
the  emerging  norm  of  humanitarian  intervention 
to  protect  civilians  from  harm  and  the  continued 
respect  for  state  sovereignty  as  well  as  the  recogni¬ 
tion  of  the  danger  and  cost  of  the  enterprise  have 
been  at  the  center  of  controversial  decisions  and 
nondecisions  about  Somalia,  Bosnia  and  Herze¬ 
govina,  Kosovo,  Rwanda,  East  Timor,  Sudan 
(Darfur),  and  many  other  places.  In  earlier  cases, 
which  might  have  been  humanitarian  interven¬ 
tions,  the  states  justified  their  action  primarily  as 
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self-defense,  notably  the  intervention  of  India  in 
East  Pakistan  (1971),  of  Vietnam  in  Cambodia  to 
overthrow  Pol  Pot  (1978),  and  of  Tanzania  in 
Uganda  to  overthrow  Idi  Amin  (1979).  Since  the 
establishment  of  safe  havens  to  protect  the  Kurds 
in  Northern  Iraq  in  1991,  the  No-Fly  Zones  in 
Northern  and  Southern  Iraq,  and  North  Atlantic 
Treaty  Organization’s  (NATO’s)  Kosovo  opera¬ 
tion  in  1999,  the  use  of  force  to  prevent  humani¬ 
tarian  atrocities  has  been  more  frequent. 

While,  from  an  international  law  perspective, 
the  notion  that  there  is  a  right  to  intervene  to  avert 
an  overwhelming  humanitarian  catastrophe  re¬ 
mains  disputable,  there  is  a  stronger  legal  and 
political  basis  for  forcible  humanitarian  interven¬ 
tion  with  the  authorization  of  the  Security  Council 
under  Chapter  VII  or  VIII  of  the  UN  Charter. 
Formulated  in  2001,  by  the  International  Commis¬ 
sion  on  Intervention  and  State  Sovereignty  (ICISS) 
set  up  by  the  Canadian  government,  the  notion  of 
responsibility  to  protect  showed  that  international 
society  has  evolved  to  recognize  the  norm  of 
humanitarian  intervention  as  a  legitimate  exception 
to  the  principle  of  nonintervention.  The  ICISS  also 
listed  six  criteria,  inspired  by  just  war  principles, 
that  would  constitute  a  legitimate  (but  not  neces¬ 
sarily  legal)  humanitarian  intervention:  (1)  right 
cause,  (2)  right  intention,  (3)  right  authority,  (4)  last 
resort,  (5)  proportionate  means,  and  (6)  reasonable 
prospects.  The  responsibility  to  protect  has  been 
endorsed  by  the  2004  UN  Secretary-General’s 
High-Fevel  Panel  on  Threats,  Challenges,  and 
Changes.  Its  report,  Our  Common  Future ,  noted 
that  there  is  an  emerging  norm  of  collective  interna¬ 
tional  responsibility  to  protect, 

exercisable  by  the  Security  Council  authorizing 
military  intervention  as  a  last  resort,  in  the  event 
of  genocide  and  other  large-scale  killing,  ethnic 
cleansing  or  serious  violations  of  international 
humanitarian  law  which  sovereign  Governments 
have  proved  powerless  or  unwilling  to  prevent. 
(UN  Department  of  Public  Information,  2004) 

In  the  2005  World  Summit  Outcome,  the  Heads 
of  State  and  Government  made  a  significant  politi¬ 
cal  commitment  to  take  action,  through  the 
Security  Council,  in  case  of  genocide,  war  crimes, 
ethnic  cleansing,  and  crimes  against  humanity. 
However,  the  opposition  of  Russia  and  China  to 


action  against  Myanmar  and  Zimbabwe  indicate 
some  of  the  potential  limits  of  the  responsibility  to 
protect. 

Fegal  and  normative  justifications  are,  however, 
not  the  only  significant  aspects  of  humanitarian 
interventions.  Humanitarian  interventions  are  an 
act  of  strategy,  and  it  is  important  to  examine  their 
ends,  means,  operational  practice,  and  effective¬ 
ness.  As  humanitarian  claims  become  more  wide¬ 
spread  and,  to  an  extent,  more  legitimate,  they 
became  part  of  the  art  of  coercion.  The  emerging 
norm  of  humanitarian  intervention  affects  the  ways 
in  which  states  commit  themselves  to  intervention 
and  persuade  their  adversaries  and  allies;  it  affects 
the  manipulation  of  risk  by  local  and  international 
actors  as  well  as  the  idiom  of  military  action,  that 
is,  the  specific  ways  in  which  force  is  used  to  fulfill 
the  main  goal  of  these  interventions — saving  the 
lives  of  civilians  at  risk.  Humanitarian  motives  and 
the  responsibility  to  protect  can  be,  and  actually 
are,  strategically  manipulated.  Many  governments, 
particularly  in  Africa,  Asia,  and  Fatin  America,  see 
the  responsibility  to  protect  as  a  legitimation  of 
military  intervention  by  strong  states,  using  their 
technologically  advanced  weapons,  against  weak 
ones.  While  some  of  these  concerns  might  be  exag¬ 
gerated,  humanitarian  and  political  objectives  have 
been  often  blurred,  for  example,  when  the  goal  of 
UN-led  operations  in  Somalia  and  Bosnia  and 
Herzegovina  was  to  alleviate  human  suffering  as 
well  as  to  promote  a  political  resolution  to  the 
crisis.  There  is  a  thin  line  between  the  protection 
of  fundamental  human  rights  in  extreme  circum¬ 
stances  and  the  protection  or  promotion  of  civil 
and  political  rights.  Moreover,  by  raising  expec¬ 
tations  of  diplomatic  and  military  intervention 
to  protect,  the  emerging  norm  has  been  criticized 
for  unintentionally  fostering  rebellion  by  lower¬ 
ing  its  expected  cost  and  increasing  its  likelihood 
of  success.  Convinced,  rightly  or  wrongly,  that 
they  will  get  international  support  on  humanitar¬ 
ian  ground,  vulnerable  groups  rebelled,  trigger¬ 
ing  major  retaliations  and  reinforcing  the  human 
suffering.  The  prospect  of  humanitarian  inter¬ 
vention  and  the  intervention  itself  can  feed  wars. 

Moreover,  as  Taylor  Seybolt  pointed  out,  the 
strategy  employed  by  the  intervener  under  the 
appropriate  circumstances  is  a  central  determi¬ 
nant  of  success,  defined  as  saving  lives  by  prevent¬ 
ing  or  ending  violent  attacks  on  unarmed  civilians 
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and/or  assisting  the  delivery  of  aid.  His  analysis 
of  17  military  operations  in  six  conflict  areas 
that  were  the  defining  cases  of  the  1990s — 
Northern  Iraq  after  the  Gulf  War,  Somalia, 
Bosnia  and  Herzegovina,  Rwanda,  Kosovo,  and 
East  Timor — shows  that  the  majority  were  suc¬ 
cessful.  The  intervening  states  employed  four 
types  of  strategy:  (1)  helping  deliver  emergency 
aid,  (2)  protecting  aid  operations,  (3)  saving  the 
victims  of  violence,  and  (4)  defeating  the  perpe¬ 
trators  of  violence.  In  sum,  in  the  long  history  of 
legal  and  normative  debates  about  interventions, 
in  general,  and  humanitarian  interventions,  in 
particular,  principles  and  strategies  should  be 
connected. 
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Humanitarian  intervention  refers  to  coercive 
action,  involving  the  use  of  military  force,  in  the 
affairs  of  one  state  by  another  state,  group  of 
states,  or  international  organization  with  the 
express  purpose  of  addressing  massive  human 
rights  violations  or  widespread  human  suffering. 
Two  aspects  of  this  definition  are  worth  noting. 
First,  humanitarian  intervention  is  distinct  from 
the  broader  category  of  “humanitarian  action,” 
which  might  include  a  range  of  activities  (both 
military  and  nonmilitary)  designed  to  address 
humanitarian  emergencies.  Second,  the  explicit 
and  most  prominent  purpose  of  the  military 
action  must  be  humanitarian — even  if  additional 
objectives  are  achieved  through  the  use  of  force. 

There  are  alternative  definitions  of  humanitar¬ 
ian  intervention  that  attempt  to  narrow  the  con¬ 
cept  still  further.  Most  legal  conceptions,  for 
example,  insist  that  the  term  humanitarian  inter¬ 
vention  encompasses  only  those  coercive  actions 
that  do  not  involve  the  consent  or  invitation  of  the 
host  country.  However,  in  practice,  this  definition 
is  difficult  to  maintain.  In  many  cases  where  mili¬ 
tary  force  has  been  used  for  humanitarian  pur¬ 
poses,  consent  from  the  government  of  the  target 
state  has  been  either  coerced  (as  it  was  in  the  case 
of  Indonesian  consent  for  intervention  in  East 
Timor  in  1999)  or  ambiguous  (as  it  was  in  the  case 
of  Bosnia  during  the  Balkan  wars  of  the  1990s).  In 
other  instances,  such  as  Somalia  in  1992,  the 
breakdown  of  order  within  a  country  makes  it  dif¬ 
ficult  to  identify  authorities  who  would  provide 
consent  for  outside  action. 

Humanitarian  intervention  has  been  the  subject 
of  controversy,  for  both  scholars  and  policymakers, 
because  of  two  tensions  that  it  invokes.  The  first  is 
between  international  society’s  commitment  to  sov¬ 
ereignty,  territorial  integrity,  and  nonintervention 
and  its  commitment  to  upholding  and  promoting 
individual  human  rights.  Both  of  these  goals  are 
enshrined  in  the  United  Nations  (UN)  Charter.  The 
second  tension  is  between  the  objective  of  protect¬ 
ing  individuals  from  extreme  persecution  and  vio¬ 
lence  and  the  use  of  coercive  means  (which  can 
involve  the  death  of  soldiers  and  civilians)  to 
achieve  that  goal.  For  many  nongovernmental 
organizations  involved  in  the  delivery  of  aid  and 
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humanitarian  assistance,  the  very  term  humanitar¬ 
ian  intervention  is  a  contradiction  in  terms. 

Origins  and  Evolution  of  the  Practice 
of  Humanitarian  Intervention 

The  origins  of  humanitarian  intervention,  and  the 
tensions  that  it  raises,  date  back  to  at  least  the  17th 
century  and  to  the  writings  of  the  international 
jurist  Hugo  Grotius  on  the  legality  and  morality  of 
war.  Grotius  was  attempting  to  reconcile  his  belief 
that  war  could  be  justly  waged  by  one  sovereign  on 
behalf  of  peoples  facing  oppression  by  another 
with  his  wider  concern  to  curtail  the  right  of  sover¬ 
eigns  to  engage  in  conflict  over  religious  differ¬ 
ences.  The  latter  notion  became  more  deeply 
embedded  in  international  society  during  the  18th 
century,  largely  through  the  writings  of  positivist 
international  lawyers  such  as  Emmerich  de  Vattel 
and  Christian  Wolff,  who  championed  the  princi¬ 
ple  of  nonintervention  as  an  essential  component  of 
state  sovereignty.  The  objective  of  these  jurists  was 
to  limit  the  causes  of  war  that  could  be  considered 
“just”  and  to  sanction  outside  intervention  only  in 
situations  of  civil  war  where  clear  lines  could  be 
drawn  between  a  ruler  and  his  or  her  people. 

In  the  19th  century,  humanitarian  intervention 
became  more  prominent  as  a  political  practice,  as 
European  states  intervened  in  the  internal  affairs 
of  other  countries  either  to  rescue  their  own  citi¬ 
zens  from  harm  or  to  protect  religious  and  national 
minorities  who  were  subject  to  persecution. 
Enshrining  this  practice  as  a  right  in  international 
law,  however,  continued  to  be  strongly  opposed. 
By  the  time  of  the  UN  Charter,  signed  in  the  after- 
math  of  World  War  II,  a  destructive  global  con¬ 
flict,  the  overwhelming  objective  of  state  leaders 
was  to  outlaw  aggression  and  to  circumscribe  the 
grounds  on  which  armed  force  could  be  used.  As  a 
result,  the  Charter  itself  remains  silent  on  the  ques¬ 
tion  of  whether  humanitarian  intervention  is  a 
legitimate  practice.  Moreover,  for  most  of  the 
Cold  War  period,  the  UN  Security  Council  proved 
unwilling  to  consider  or  discuss  humanitarian  cri¬ 
ses  as  part  of  its  remit  for  managing  peace  and 
security,  and  individual  states  were  reluctant  to 
accept  humanitarian  rationale  as  legitimate  justifi¬ 
cations  for  the  use  of  force. 

A  series  of  developments  during  the  latter  part 
of  the  20th  century  created  a  more  permissive 


context  for  intervention  by  outside  actors  in  the 
face  of  grave  humanitarian  crises.  These  include, 
inter  alia,  the  rise  of  international  human  rights 
instruments,  the  increased  vulnerability  of  civilians 
in  the  context  of  civil  conflict,  the  global  and 
instantaneous  access  to  information  that  height¬ 
ened  popular  awareness  of  human  suffering,  and 
the  greater  willingness  of  the  Security  Council  to 
define  instances  where  mass  atrocities  are  occur¬ 
ring  as  threats  to  international  peace  and  security 
(as  it  did  in  Northern  Iraq,  Haiti,  Somalia,  and  East 
Timor  during  the  1990s).  In  addition,  the  delayed 
and  inadequate  response  of  the  international  com¬ 
munity  to  the  genocide  in  Rwanda  in  1994  led 
many  to  argue  that  intervention  for  humanitarian 
purposes  could  be  justified  as  part  of  the  require¬ 
ments  of  a  more  just  international  order. 

Opposition  to  Humanitarian  Intervention 

Despite  this  more  permissive  context,  the  assertion 
that  there  is  a  new  right  of  humanitarian  interven¬ 
tion  has  been  opposed  on  both  legal  and  moral 
grounds.  The  legal  skeptics  start  with  the  basic 
presumption  that  the  use  of  force  is  illegal  as  indi¬ 
cated  in  Article  2(4)  of  the  UN  Charter.  The  only 
legitimate  exceptions  to  this  rule  are  military 
actions  taken  in  the  name  of  either  self-defense 
(Article  51)  or  collective  security  (in  which  case  the 
Security  Council  authorizes  the  use  of  force  explic¬ 
itly  through  a  resolution  adopted  under  Chapter 
VII).  Those  who  favor  a  new  right  of  humanitarian 
intervention  make  two  kinds  of  claims.  First,  they 
suggest  that  exceptions  to  the  general  prohibition 
on  the  use  of  force  in  the  Charter  should  be  wid¬ 
ened  to  accommodate  other  important  principles 
of  the  UN,  such  as  the  promotion  and  protection 
of  human  rights.  Second,  they  assert  that  there  is 
emerging  customary  law  on  humanitarian  inter¬ 
vention  that  runs  parallel  to  the  Charter,  based  on 
a  series  of  interventions  carried  out  by  Western 
governments  during  the  1990s  and  early  years  of 
the  21st  century. 

In  reply,  the  skeptics  have  argued  that  neither 
treaty  nor  customary  law  supports  a  change  in  pri¬ 
orities  between  the  different  goals  of  the  UN;  while 
there  have  been  significant  developments  in  the 
measures  designed  to  protect  human  rights  over  the 
past  50  years,  these  have  not  influenced  the  inter¬ 
pretation  of  Article  2(4)  or  the  broader  commitment 
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in  international  society  to  limiting  the  legitimate 
justifications  for  war.  They  also  question  whether 
the  new  customary  law  is  developing,  since  much  of 
non-Western  legal  opinion  does  not  support  the 
view  that  humanitarian  intervention  is  becoming  a 
more  acceptable  practice  in  international  society. 
This  opposition  derives  from  a  suspicion  that 
humanitarian  intervention  would  not  operate  in  the 
way  prescribed  by  its  advocates  but  instead  would 
legitimate  self-interested  intervention  by  the  power¬ 
ful  under  the  guise  of  humanitarianism. 

The  ethical  objections  to  humanitarian  interven¬ 
tion  fall  broadly  into  two  types.  The  first  argument, 
best  articulated  by  the  political  philosopher  Michael 
Walzer,  claims  that  sovereign  states  provide  the 
protective  shell  for  the  process  of  self-determination 
and  the  exercise  of  human  freedom.  In  fulfilling 
these  functions,  they  become  moral  entities  that 
should  enjoy  the  presumptive  right  of  noninterven¬ 
tion.  While  this  norm  of  nonintervention  is  intended 
to  apply  to  all  communities  equally,  in  practice  it 
has  had  particular  purchase  for  developing  coun¬ 
tries  and  former  colonies.  By  emphasizing  the  con¬ 
nection  of  nonintervention  with  self-determination, 
such  countries  have  sought  to  protect  themselves 
from  stronger  powers  seeking  to  further  their  own 
interests  through  intervention. 

The  second  set  of  ethical  objections  to  interven¬ 
tionism  is  consequentialist  and  emphasizes  the 
negative  outcomes  (both  anticipated  and  unantici¬ 
pated)  that  can  result  from  the  use  of  military 
force.  There  are  two  main  consequentialist  argu¬ 
ments  at  work.  First,  those  from  the  so-called  real¬ 
ist  school  of  international  relations  contend  that 
intervention  in  the  name  of  humanitarianism  is 
likely  to  create  more  problems  than  it  solves  and 
could  therefore  compromise  the  more  important 
goal  of  furthering  the  national  interest.  Opposition 
may  be  created  on  the  ground  in  the  course  of 
engaging  in  military  action,  or  expectations  may 
be  inflated  among  those  suffering  from  oppression 
elsewhere,  who  will  quickly  accuse  the  interna¬ 
tional  community  of  bias  if  there  is  no  intervention 
to  support  their  cause.  Furthermore,  it  is  impossi¬ 
ble  to  know  beforehand  if  intervention  will 
succeed  in  meeting  its  humanitarian  objectives  or 
whether  it  will  lead  to  an  acceptable  level  of  casu¬ 
alties.  This  has  raised  the  larger  question  of  how 
the  success  of  humanitarian  intervention  should  be 
determined:  by  the  achievement  of  short-term 


goals  (such  as  the  alleviation  of  hunger  or  suffering 
and  the  return  of  refugees  to  their  homes)  or  by 
addressing  the  underlying  causes  of  suffering 
through  the  reconstruction  of  stable  and  well¬ 
functioning  states. 

A  second  version  of  consequentialism  alleges  that 
any  use  of  force,  no  matter  how  well  intentioned, 
can  undermine  the  broader  goal  of  preserving  inter¬ 
national  peace  and  stability.  The  international  rela¬ 
tions  theorist  Robert  Jackson  has  argued  that  while 
states  have  a  responsibility  to  pursue  international 
justice  where  they  can,  they  should  not  jeopardize 
other  fundamental  values  in  the  process.  In  balanc¬ 
ing  these  considerations,  international  peace  should 
have  particular  weight,  since  it  is  in  situations  of 
war — particularly  war  between  great  powers — that 
humanitarian  values  are  most  likely  to  come  under 
attack.  The  second  position  is  a  belief  that  the  con¬ 
sensus  that  underpins  international  society  is  much 
more  procedural  than  substantive.  It  is  confined  to 
agreements  on  how  states  should  behave  outside 
their  borders  and  does  not  extend  to  a  common 
view  on  deeper  political  or  economic  values. 
According  to  this  view,  sovereign  states  are  unlikely 
to  converge  on  what  counts  as  injustice  or  oppres¬ 
sion  inside  a  state  and  hence  unlikely  to  agree  when 
interventions  to  address  humanitarian  crises  would 
be  justified. 

Humanitarian  Intervention  and 
the  Responsibility  to  Protect 

While  a  diversity  of  political  and  economic  values 
continues  to  characterize  international  society  in 
the  21st  century,  a  consensus  has  emerged  that  cer¬ 
tain  actions  against  individuals  and  groups — such 
as  the  commission  of  genocide — are  sufficiently 
grave  to  constitute  “crimes  against  humanity”  as  a 
whole.  In  these  extreme  cases,  those  raising  ethical 
objections  to  humanitarian  intervention  often  allow 
for  some  suspension  of  the  nonintervention  rule. 
So,  for  example,  Michael  Walzer  acknowledges 
that  when  the  rights  of  people  within  a  community 
are  seriously  threatened  such  that  they  can  no  lon¬ 
ger  be  said  to  be  truly  self-determining,  outside 
intervention  to  protect  basic  individual  rights  is 
morally  defensible. 

This  kind  of  reasoning  has  inspired  diplomatic 
efforts  to  enshrine  a  new  principle  in  international 
society,  known  as  the  responsibility  to  protect.  In 
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2005,  at  the  World  Summit  commemorating  the 
60th  anniversary  of  the  founding  of  the  UN,  heads 
of  state  unanimously  agreed  that  all  states  have  a 
responsibility  to  protect  their  populations  from 
genocide,  crimes  against  humanity,  war  crimes, 
and  ethnic  cleansing.  They  went  on  to  agree  that 
when  national  authorities  are  “manifestly  failing” 
to  protect  their  populations  from  such  atrocities, 
the  international  community — acting  through  the 
UN — can  take  action  (including,  if  necessary,  the 
use  of  force)  to  fulfill  the  responsibilities  of  protec¬ 
tion.  Although  the  concept  of  the  responsibility  to 
protect  is  closely  related  to  humanitarian  interven¬ 
tion,  it  differs  in  two  core  ways:  First,  the  activities 
that  protect  populations  are  not  restricted  to  the 
use  of  force  and,  second,  the  responsibility  encom¬ 
passes  the  need  not  just  to  respond  to  mass  atroci¬ 
ties  but  also  to  prevent  their  occurrence. 
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Interviewing 


Interviewing  is  a  commonly  used  research 
method  throughout  the  social  sciences. 
Approaches  vary,  with  some  types  of  interviews 
conducted  in  an  informal  and  open-ended  man¬ 
ner  and  other  types  of  interviews  designed  to  be 
much  more  strictly  structured  and  controlled. 
Regardless  of  the  approach,  interviews  by  defini¬ 
tion  always  involve  one  person  asking  questions 
of  another.  The  researchers  (or  staff  members 
working  for  the  researcher)  seek  information 
through  conversation  with  their  research  partici¬ 
pants.  Although  political  scientists  often  use 
interviews  to  collect  information  about  their  top¬ 
ics,  interviewing  methods  have  been  developed 
primarily  within  other  disciplines — including 
anthropology,  sociology,  and  journalism. 
Political  scientists  have  taken  these  methods  and 
adapted  them  to  their  own  purposes.  This  entry 
provides  an  overview  of  the  types  of  interviews 
most  commonly  used  in  political  science  and 
reviews  some  of  the  most  important  method¬ 
ological  issues  associated  with  interviews  as  data 
sources  for  political  scientists. 

In  the  early  20th  century,  the  study  of  political 
science  was  restricted  mainly  to  the  library,  with 
analysis  of  legal  rulings  and  of  the  structures  of 
government  that  the  primary  methods  used.  The 
shift  to  a  behavioral  approach  to  politics — looking 
at  what  actually  happens  in  government  rather 
than  simply  what  the  laws  say  should  happen — 
meant  that  political  scientists  would  need  to 
observe  and  collect  information  about  the  activi¬ 
ties  of  government  officials  and  the  interactions  of 
government  officials  and  the  public.  Interviewing 
was  one  of  the  first  ways  in  which  this  new  infor¬ 
mation  was  collected,  and  in  the  1920s,  political 
scientists  like  Pendleton  Herring  and  Elmer  Eric 
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Schattschneider  began  conducting  fieldwork  in 
Washington,  D.C.,  interviewing  government  offi¬ 
cials  and  representatives  from  groups  and  busi¬ 
nesses  to  be  able  to  better  describe  the  workings  of 
government  and  how  it  interacted  with  organized 
interests.  Today,  interviewing  remains  a  common 
data  collection  technique,  especially  for  scholars 
who  study  political  elites. 

Types  of  Interviews  There  are  several  styles  of 
interviewing,  each  appropriate  for  different 
informational  needs.  These  styles  exist  along  a 
continuum:  At  one  end  are  interviewing  tech¬ 
niques  that  use  extremely  directed,  closed-ended 
questions,  such  as  those  in  a  survey,  and  at  the 
other  are  interpretive  techniques  in  which  ques¬ 
tions  are  extremely  open-ended  and  subject  to 
change  depending  on  the  guidance  of  those  being 
interviewed. 

When  researchers  do  not  know  very  much 
about  the  topic  at  hand,  or  when  they  want  to 
capture  an  insider  perspective  on  the  topic,  they 
often  turn  to  an  ethnographic  style  of  interview¬ 
ing.  This  approach,  borrowed  from  anthropology, 
involves  long  interviews  in  which  the  interview 
subject  (sometimes  referred  to  as  the  interview 
consultant)  helps  direct  the  interviewer.  The  ques¬ 
tions  are  open-ended.  Rather  than  requiring  the 
respondent  to  select  an  answer  from  a  list  written 
ahead  of  time  and  requiring  the  respondent  to 
limit  the  answer  to  those  options,  an  open-ended 
question  may  be  answered  in  whatever  way  the 
respondent  chooses.  This  often  provides  the 
researcher  with  an  opportunity  to  learn  about 
previously  unknown  aspects  of  a  topic.  The  inter¬ 
view  consultant  is  treated  as  the  expert,  as  the 
guide,  in  this  process.  The  goal  of  the  interviewer 
is  to  guide  the  interview  subject  as  little  as  possi¬ 
ble,  so  as  to  elicit  the  unbiased  views  of  the  inter¬ 
view  subject.  Such  unstructured  interviewing 
techniques  are  often  combined  with  participant 
observation,  in  which  interviewers  spend  time  in, 
and  among  those  they  wish  to,  study.  Ethnographic 
interviews  such  as  these  often  provide  new  and 
unexpected  insights,  but  as  a  result  of  how  open 
they  are  to  new  directions,  they  have  a  tendency 
to  drift  away  from  the  researcher’s  original  topic. 
Thus,  they  are  not  the  best  way  to  measure  con¬ 
cepts  reliably  across  multiple  respondents  or 
across  multiple  field  settings,  as  each  interview 
will  be  unique.  This  unique  insight  is  both  the 


advantage  and  the  disadvantage  of  this  type  of 
interviewing. 

At  the  opposite  extreme  are  closed-ended  inter¬ 
viewing  techniques  in  which  the  questions  and  the 
possible  responses  have  been  predetermined  by  the 
interviewer.  Outside  of  political  science,  both  jour¬ 
nalists  and  courtroom  litigators  often  use  closed- 
ended  questions.  In  the  courtroom,  respondents 
are  often  limited  to  responding  “yes”  or  “no”  to 
the  questions  posed  to  them.  The  trial  lawyer  seeks 
to  control  what  information  is  put  before  the  jury 
by  keeping  a  tight  rein  on  the  questions  posed  to 
witnesses.  Trial  lawyers  do  this  because  they  are 
not  interviewing  to  collect  new  information — they 
are  supposed  to  have  learned  the  answers  already 
during  their  depositions  of  the  witnesses.  Likewise, 
political  scientists  use  closed-ended  questions  when 
they  already  know  a  great  deal  about  the  topic  at 
hand.  The  most  common  application  of  this  type 
of  interview  in  political  science  is  the  mass  opinion 
survey,  such  as  those  conducted  biennially  by  the 
American  National  Election  Studies.  While  some 
surveys  are  self-administered — that  is,  survey 
respondents  fill  out  the  questionnaires  themselves, 
either  on  paper  or  online — survey  interviews  are 
conducted  by  an  interviewer,  either  in  person  or  on 
the  phone.  To  write  closed-ended  questions,  the 
political  scientist  must  already  know  what  the 
important  issues  are  and  what  the  possible 
responses  could  be.  The  only  thing  that  remains  to 
be  learned  from  the  interview  is  how  many  people 
will  end  up  in  each  response  category.  This,  of 
course,  is  an  ideal  type,  and  in  practice,  surveys 
that  are  conducted  in  person  by  an  interviewer  also 
include  a  number  of  open-ended  questions.  In  gen¬ 
eral,  however,  control  over  possible  responses  is 
fairly  tight,  and  the  goal  is  to  make  the  responses 
as  easily  comparable  across  respondents  as  possi¬ 
ble.  Data  that  result  from  interviewing  using 
closed-ended  questions  are  much  more  likely  to  be 
reliably  compared  across  participants  and  across 
field  settings.  These  data  are  often  used  for  hypoth¬ 
esis  testing,  but  the  closed-ended  nature  of  the 
questions  means  that  answers  will  be  limited  to 
the  answers  the  interviewer  could  imagine  before 
the  study  began  and  is  not  likely  to  be  the  source 
of  new  ways  of  looking  at  political  phenomena. 
Whatever  errors  exist  in  the  researcher’s  theoreti¬ 
cal  approach  at  the  beginning  are  likely  to  remain 
after  the  study  is  completed. 
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When  political  scientists  today  think  about 
interviewing  as  used  in  political  science,  they  most 
likely  are  talking  about  a  form  of  interviewing  that 
strikes  a  balance  between  these  two  extremes. 
Semistructured  interviews  with  open-ended  ques¬ 
tions  are  designed  to  focus  the  interview  on  a  par¬ 
ticular  research  topic  and  to  ensure  that  particular 
questions  are  answered,  but  the  interviewee  is 
allowed  a  great  deal  of  latitude  in  how  he  or  she 
responds.  Interviews  with  members  of  a  parlia¬ 
ment  or  Congress,  or  staff  members  of  those  legis¬ 
lators,  usually  are  conducted  as  semistructured 
interviews.  Questions  are  aimed  at  getting  the 
respondent  to  talk  about  a  topic  fairly  naturally  in 
order  to  help  capture  the  specialized  knowledge 
that  the  respondent  has.  “Describe  to  me  what 
happened  during  the  debate  about  proposal  X”  is 
an  example  of  an  open-ended  question  that  might 
be  used.  What  makes  the  semistructured  interview 
different  from  the  ethnographic  interview,  how¬ 
ever,  is  how  the  interviewer  works  to  control  the 
interview  once  it  has  begun.  While  the  ethno¬ 
graphic  interviewer  might  allow  the  respondent  to 
decide  what  is  most  important  to  relate  about 
proposal  X,  the  researcher  conducting  the  semis¬ 
tructured  interview  will  have  a  predetermined  list 
(written  or  mental)  of  questions  called  “prompts.” 
This  type  of  semistructured  interviewing  is  also 
sometimes  referred  to  as  “elite”  interviewing 
because  it  is  most  often  used  to  collect  information 
from  elected  and  appointed  government  officials 
and  other  highly  educated,  highly  knowledgeable 
respondents  (who  might  chafe  at  being  forced  to 
pigeonhole  their  responses  into  the  preordained 
categories  of  a  mass  survey  instrument). 

Methodological  Concerns 

Like  all  data  collection  methods,  interviewing 
poses  potential  methodological  pitfalls  that  must 
be  avoided  if  the  data  are  to  be  valid  and  reliable. 
Interviewers  should  remember  that  the  order  in 
which  questions  are  asked,  as  well  as  the  way  in 
which  they  are  asked,  may  affect  the  content  of  the 
answers.  An  early  question  about  gun  control,  for 
example,  may  increase  respondents’  tendency  to 
name  “gun  control”  when  asked  later  in  an  inter¬ 
view  what  they  view  as  the  most  important  issues 
facing  the  nation.  Survey  research  has  shown  that 
small  changes  in  question  wording  sometimes 


create  large  changes  in  responses,  and  similar 
effects  may  result  from  changes  in  interview  ques¬ 
tions.  The  effects  of  question  order  and  question 
wording  are  arguably  not  as  severe  in  elite  inter¬ 
views  as  they  are  in  mass  surveys,  however,  since 
interview  respondents  answer  in  their  own  words 
and  can  be  prompted  for  more  information. 
Sticking  to  exact  wording  and  question  ordering 
does  not  always  work  in  an  elite  interview.  For 
instance,  the  interviewer  needs  to  be  willing  to 
drop  a  question  if  the  respondent  has  already  vol¬ 
unteered  that  information  in  response  to  an  earlier 
question.  The  interviewer  can  adapt  the  question, 
asking  whether  there  is  anything  else  besides  what 
the  respondent  has  already  mentioned,  but  the 
interviewer  needs  to  show  that  he  or  she  is  paying 
attention  and  heard  the  earlier  response.  Otherwise 
the  interviewer  risks  losing  rapport  with  the  inter¬ 
viewee,  and  answers  may  become  less  candid  and 
less  complete.  For  the  most  part,  exact  wording 
and  exact  question  order  are  less  important  than 
rapport  and  getting  the  respondent  talking  in  an 
elite,  semistructured  interview.  The  idea  is  that  the 
open-ended,  semistructured  nature  of  the  questions 
will  allow  a  more  “true”  and  complete  answer. 

As  is  the  case  with  surveys,  the  data  one  collects 
from  interviews  are  only  as  good  as  the  people 
being  interviewed.  People  are  fallible  and  may  give 
incorrect  or  incomplete  answers.  People  may  seek 
to  please  the  interviewer  rather  than  answer  hon¬ 
estly.  Finally,  even  in  the  case  of  interview  subjects 
who  seem  very  reliable,  the  relationship  between 
what  they  say  and  what  they  do  may  be  tenuous. 
People  are  not  always  self-aware  enough  to  be  able 
to  describe  their  behaviors  with  complete  accuracy 
or  to  give  true  insight  into  the  reasons  why  they  do 
what  they  do.  Nevertheless,  it  is  often  difficult  or 
impossible  to  learn  about  political  beliefs  and  atti¬ 
tudes  in  any  way  other  than  asking  about  them, 
and  political  elites  often  have  firsthand  knowledge 
of  events  that  cannot  be  discovered  in  any  other 
way.  Some  of  the  methodological  shortcomings  of 
interviews  can  be  lessened  by  making  sure  to  inter¬ 
view  multiple  sources  and  use  multiple  data  collec¬ 
tion  methods  (e.g.,  consulting  historical  documents 
or  using  experimental  methods)  whenever  possible. 

Despite  the  potential  drawbacks,  interviews 
provide  an  invaluable  method  of  collecting  infor¬ 
mation  about  a  wide  range  of  political  phenom¬ 
ena,  from  political  attitudes  to  political  history. 
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What  political  scientists  have  learned  through 
open-ended  and  semistructured  interviews  in  par¬ 
ticular  has  often  led  to  new  research  hypotheses 
that  then  are  tested  using  other  research  methods. 
Because  interview-based  research  is  labor-intensive 
and  time-consuming,  it  is  unlikely  to  become  the 
dominant  data  collection  method  in  political  sci¬ 
ence,  but  it  will  remain  an  important  one. 

Beth  L.  Leech 
Rutgers  University 
New  Brunswick,  New  Jersey,  United  States 
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Elite  interviews  are  a  significant  source  of  data  in 
political  science  and  sociology.  The  concept  of  the 
elite  and,  consequently,  its  empirical  identifica¬ 
tion,  however,  lacks  a  clear  consensus.  Elites  are 
typically  thought  to  have  more  resources  and 
influence  than  mass  publics  in  whatever  sphere 
they  happen  to  be  located.  Sociologists  sometimes 
define  elites  as  a  set  of  people  with  special  creden¬ 
tials  in  society  such  as  Nobel  Prize  winners. 
Political  scientists  are  especially  likely  to  define 


elites  in  the  context  of  political  power  and  often  as 
formal  holders  of  institutional  authority — typically 
public  officeholders.  Political  sociologists  may  be 
more  inclined  to  look  for  the  powerful,  whether  or 
not  they  are  holders  of  formal  authority.  Of 
course,  the  definition  of  who  is  powerful  is  highly 
contentious  and,  therefore,  far  from  self-evident. 
In  this  entry,  some  specific  features  of  interviewing 
elites  are  discussed. 

There  are  many  ways  of  procuring  information 
from  those  plausibly  described  as  elites.  Background 
data  are  often  available  through  documentation  but 
are  not  always  comparable  across  individuals. 
Sometimes  written  surveys  are  sent  to  elite  popula¬ 
tions.  The  rate  of  return  on  these  may  be  spotty  and, 
by  definition,  nonrandom.  Thus,  the  response  rate 
often  leads  to  unrepresentative  samples.  It  is  also 
unclear  who  fills  out  the  requested  information 
when  the  respondent  has  staff  personnel  to  deal 
with  such  matters.  Consequently,  elite  interviews 
are  taken  here  to  mean  some  form  of  direct  contact 
between  an  investigator  (or  a  member  of  an  investi¬ 
gator’s  staff)  and  targeted  individuals  drawn  from 
an  elite  population. 

Objectives  of  Elite  Interviews 

Elite  interviews  may  be  conducted  to  satisfy  differ¬ 
ent  objectives  of  inquiry.  Broadly,  one  purpose  of 
elite  interviewing  is  to  tell  a  story  about  a  process, 
to  relate  how  events  or  a  set  of  events  unfolded  so 
as  to  ascertain  proximate  causality,  or  to  ascertain 
influence  relations  among  actors.  These  often  pro¬ 
duce  case  studies  more  likely  to  be  predominantly 
qualitative  in  method.  A  second  purpose  of  elite 
interviewing  is  to  discover  characteristics,  percep¬ 
tions,  and  opinions  of  the  actors  themselves.  In  this 
use,  the  members  of  the  particular  elite  stratum 
become  the  units  of  analysis.  Hence,  the  focus  is  on 
the  actors  themselves,  their  attitudes,  perspectives, 
career  paths,  and  backgrounds.  Therefore,  it 
becomes  especially  vital  to  have  samples  that  are 
representative  of  the  populations  to  which  infer¬ 
ences  are  being  made. 

Case  Studies 

Typically,  when  the  objectives  of  elite  interview¬ 
ing  are  to  understand  how  a  process  or  set  of 
events  has  unfolded,  the  number  of  individuals  to 
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be  interviewed  may  be  relatively  small  and  con¬ 
ducted  with  the  narrative  as  the  foremost  objec¬ 
tive.  One  purpose  is  to  evaluate  different  versions 
of  an  event  from  the  various  perches  that  elite 
actors  occupy  and  to  assess  the  extent  to  which  a 
common  or  differentiated  picture  is  being  held  by 
the  various  actors.  In  some  instances,  interviews 
may  start  out  with  the  purpose  of  snowballing  to 
find  other  actors  whose  perspectives  or  roles  in 
events  may  be  particularly  important  and  who  are 
thereby  recommended  by  those  whom  the  investi¬ 
gator  starts  off  interviewing.  Typically,  each  inter¬ 
view  is  somewhat  unique  as  the  investigator  seeks 
to  fill  in  elements  of  a  process  or  event.  Thus,  these 
interviews  are  usually  relatively  unstructured  as 
the  interviewer  adjusts  to  new  information  and 
cues  as  to  possible  new  informants.  It  is  not 
unusual  in  complex  systems  that  the  best  infor¬ 
mants  about  process  may  be  the  facilitators  rather 
than  the  principals.  Staff  personnel  may  often  be 
better  informants  than  the  principals  themselves  in 
the  sense  of  knowing  more  of  the  actual  details  of 
what  occurred.  And  they  are  typically  more  acces¬ 
sible  as  well. 

Elites  as  Units  of  Analysis 

An  entirely  different  kind  of  study  involves 
using  individual  members  of  an  elite  to  assess  its 
aggregate  characteristics,  show  its  coherence  or 
divisions,  and  examine  differences  relevant  to  roles 
or  to  settings.  In  this  case,  sampling  reliability, 
uniformity  of  stimuli  (or  at  least  cross-validation 
of  the  stimuli),  and  missing  data  issues  are  all  per¬ 
tinent.  It  is  essential  that  samples  be  representative. 
In  some  instances,  for  example,  a  legislative  cham¬ 
ber,  one  can  know  in  advance  many  of  the  basic 
parameters  of  the  population  such  as  distributions 
by  party,  region,  age,  gender,  and  metropolitan- 
rural  distinctions.  Other  populations  may  be  less 
well  defined,  for  example,  business  executives  or 
even  civil  servants  or  political  officials  of  the 
executive  branch  of  government.  Willingness  on 
the  part  of  the  individuals  to  participate  tends  to 
be  inversely  related  to  their  organizational  status. 
The  higher  up  they  are,  the  harder  they  are  to  get. 

Whereas  interviews  undertaken  for  the  pur¬ 
poses  of  analyzing  a  case  may  be  less  systematic  in 
design,  in  part  because  of  the  need  to  adjust  in  the 
field  and  follow  leads  supplied  by  elite  informants, 


interviews  undertaken  for  the  purposes  of  charac¬ 
terizing  an  elite  population  must  be  systematically 
designed  at  the  outset  with  correspondingly  less 
adjustment  in  the  field.  Adjustments  that  may 
seem  sensible  as  the  result  of  learning  from  the 
field  experience,  nevertheless,  may  introduce  con¬ 
taminating  nonuniform  stimuli  that  could  induce 
unknown  bias  or  error  in  responses. 

The  Hybrid 

A  few  studies  have  made  use  of  elite  interview¬ 
ing  for  the  purposes  of  collecting  information 
about  a  population’s  behavior  as  well  as  to  flesh 
out  information  about  particular  cases  so  that  each 
form  of  information  gathering  sheds  light  on  the 
other.  By  providing  independent  sources  of  infor¬ 
mation,  the  particulars  of  a  case  study  may  be 
confirmed  by  the  more  general  attitudes  of  the 
relevant  elites.  Equally,  the  expressed  attitudes  of 
elites  may  be  assessed  through  the  behavioral  pat¬ 
terns  manifested  in  the  cases. 

Definitions  of  the  Elite  Universe 

Two  definitions  of  an  elite  population  exist — one 
defined  by  the  power  or  influence  of  a  set  of  indi¬ 
viduals  and  the  other  by  definitional  or  institu¬ 
tional  roles.  The  problem  with  the  first  definition  is 
that  the  influence  any  set  of  individuals  or  a  group 
exercises  is  based  either  on  reputation  or  on  behav¬ 
ior.  It  is  hard  to  disprove  reputation  since  that  is 
based  on  an  assumption  that  repute  is  either  equiv¬ 
alent  to  behavior  or  that  it  reduces  the  propensity 
for  opposition  to  arise.  By  contrast,  behavior  may 
be  shown  only  by  the  influence  that  some  set  of 
individuals’  exercises  over  decisions  that  may  vary 
depending  on  which  individuals  are  involved 
regarding  a  specific  set  of  decisions.  Because  such 
behavior  can  be  assessed  only  after  the  fact,  politi¬ 
cal  scientists  are  especially  likely  to  use  formal 
institutional  roles  as  a  basis  for  defining  a  political 
elite  and  for  analyzing  them  as  a  population. 
However,  individual  influence  is  likely  to  vary 
within  a  given  institution.  Party  leaders,  more 
senior  members,  and  heads  of  committees  are  likely 
to  play  a  more  substantial  role  than  backbenchers 
in  a  legislature,  for  example.  Influence  also  may 
vary  across  institutions.  Legislative  bodies,  for 
example,  vary  greatly  across  political  systems  in  the 
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authority  they  are  able  to  wield.  They  are  not 
equally  influential  in  decision  making.  Where  the 
executive  and  legislative  bodies  are  separated,  as  in 
the  United  States,  the  legislature  typically  is  more 
directly  influential  in  policy  decisions.  In  parliamen¬ 
tary  systems,  the  pool  of  people  likely  to  become 
more  influential,  for  example,  cabinet  ministers, 
will  be  drawn  from  the  members  of  parliament  even 
where  the  parliament  itself  may  exert  relatively  little 
independent  policy-making  influence. 

Access  to  the  Elites 

Typically,  it  is  helpful  for  the  interviewer  to  have 
some  institutional  or  personal  credentials  in  order 
to  gain  access  to  elite  populations.  Sometimes, 
having  an  interlocutor  to  speak  for  the  investiga¬ 
tor’s  credentials  can  be  an  advantage,  at  least  with 
some  elements  of  the  elite.  Some  elite  populations 
are  more  willing  to  be  interviewed  than  others  and 
often  look  forward  to  the  opportunity  to  express 
their  views.  Typically,  these  are  underaccessed 
populations.  Politicians,  especially  legislators,  tend 
to  be  overaccessed,  whereas,  for  example,  civil 
servants  traditionally  have  been  underaccessed. 
For  plenty  of  reasons,  including  busy  schedules 
and  security  measures,  top-level  political  elites  are 
typically  very  hard  to  interview,  at  least  while  they 
are  in  office.  The  passage  of  time,  however,  fre¬ 
quently  eases  accessibility  even  when  talking  about 
controversial  history  and  the  subject’s  participa¬ 
tion  in  it.  Interviews  with  small  numbers  of  pri¬ 
mary  actors  in  events  long  past,  and  with  actors 
long  past  their  prime  in  the  public  spotlight, 
become  possible,  impeded  only  by  the  mortality  or 
decrepitude  of  the  participants  in  those  events. 

Accessibility  to  governmental  elites  has  become 
more  complicated  as  a  result  of  the  security  precau¬ 
tions  that  have  become  increasingly  commonplace 
in  public  buildings.  Therefore,  proper  introductory 
credentials  are  increasingly  essential,  and  onsite 
interviews  require  that  the  interviewer’s  identifica¬ 
tion  papers  be  in  order. 

Structured  and  Nonstructured  Interviews 

Whether  an  investigator  should  have  a  high  level  of 
structure  over  the  interview  protocol  or  a  looser 
structure  depends  greatly  on  the  objectives  of  the 
inquiry.  To  the  extent  that  the  investigator  is  looking 


to  discover  a  process,  it  may  be  wise  to  begin  with 
a  relatively  less  structured  interview  protocol  that 
allows  different  perspectives  to  be  revealed.  Those 
perspectives  can  then  be  used  to  narrow  questions 
as  more  elements  of  the  process  become  identified 
and  the  pathways  of  inquiry  can  be  more  clearly 
developed.  This  also,  of  course,  assumes  that  the 
investigator  begins  the  inquiry  along  lines  that  are 
more  inductive  than  deductive. 

Alternatively,  if  the  investigator  is  seeking  atti- 
tudinal  responses,  it  is  important  that  the  inter¬ 
views  proceed  along  a  relatively  fixed  course. 
There  should  be  as  well  opportunities  to  assess 
convergent  validation  of  attitudes  by  asking  the 
same  question  in  somewhat  different  forms  across 
different  places  in  the  interview.  Elites  like  to  be 
treated  as  the  sophisticated  individuals  they  often 
are.  A  menu  of  fully  closed-ended  questions  is 
likely  to  bore  them  and  will  possibly  lead  to  a 
desire  to  end  the  interview.  The  interviewer  must 
be  alert  to  changes  in  mood,  to  signs  of  restlessness 
on  the  part  of  the  respondent,  to  hyperinterest  that 
may  lead  to  an  exhaustingly  lengthy  interview,  and 
to  a  respondent’s  tendency  to  careen  away  from 
the  subject  matter  the  interviewer  is  focused  on  to 
a  special  topic  that  the  respondent  wants  to  dis¬ 
cuss.  Above  all,  the  interviewer  needs  to  be  aware 
of  time  management  during  the  course  of  the  inter¬ 
view.  Managing  time  is  usually  less  of  a  problem 
in  the  unstructured  interview  than  in  the  struc¬ 
tured  interview  protocol.  In  adapting  to  problems 
of  time  management,  the  investigator  may  need  to 
indicate  to  the  interviewer,  in  instances  where  they 
are  not  one  and  the  same,  which  sets  of  questions 
are  fundamental  and  which  are  optional.  In  some 
instances,  it  may  be  necessary  to  tie  questions 
together  to  present  a  logical  follow-up  to  a  respon¬ 
dent’s  prior  answer.  That  is  partly  to  maximize  the 
efficient  use  of  time  and  partly  to  treat  the  respon¬ 
dents  as  the  elites  they  are,  even  though  this  may 
sporadically  violate  the  canons  of  maintaining 
question  placement. 

In  either  case,  unstructured  or  structured  inter¬ 
views,  interviewer  experience  and  adaptiveness 
(within  limits)  are  key  elements  in  creating  a  con¬ 
text  where  the  respondent  is  comfortable  with  the 
interview  situation  and  where  the  interviewer  can 
be  prepared  for  the  multitude  of  possible  interrup¬ 
tions  that  can  eat  into  the  time  available  for  the 
interview,  including  the  occasional  need  to  bring  a 
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respondent  back  on  track  when  he  or  she  goes  off 
on  a  time-consuming  tangent. 

Conclusion 

It  is  likely  becoming  more  difficult  to  obtain  inter¬ 
views  with  current  officeholding  elites  than  in 
decades  past.  Security  measures  have  made  it  more 
difficult  to  roam  through  official  buildings  after 
one  interview  in  order  to  set  up  another.  Also,  the 
use  of  electronic  or  digital  recording  equipment 
may  be  impermissible  in  some  installations.  Yet 
elite  interviewing  is  an  essential  part  of  under¬ 
standing  events  from  the  varied  perspectives  of  the 
actors  and  intimate  observers  and  of  understand¬ 
ing  the  views  of  the  elites  themselves.  Nevertheless, 
to  a  surprising  degree,  elites  are  often  amenable  to 
the  entreaties  of  well-prepared  researchers.  And 
interviewing  them  is  typically  an  illuminating  exer¬ 
cise.  But  for  these  opportunities  to  be  optimally 
rewarding,  investigators  must  be  well  prepared. 

Bert  A.  Rockman 
Purdue  University 
West  Lafayette,  Indiana,  United  States 

See  also  Elites;  Interviewing;  Interviews,  Expert 

Further  Readings 

Aberbach,  J.  D.,  Putnam,  R.  D.,  &  Rockman,  B.  A. 
(1981).  Bureaucrats  and  politicians  in  Western 
democracies.  Cambridge,  MA:  Harvard  University 
Press. 

Aberbach,  J.  D.,  &  Rockman,  B.  A.  (2002).  Conducting 
and  coding  elite  interviews.  PS:  Political  Science  & 
Politics,  35,  673-676. 

Albright,  J.  J.  (2008).  Partisans  or  independents? 

Evidence  for  campaign  targets  from  elite  interviews  in 
Spain.  Electoral  Studies,  27,  711-722. 

George,  A.,  &  Bennett,  A.  (2005).  Case  studies  and 
theory  development  in  the  social  sciences.  Cambridge: 
MIT  Press. 

Heinz,  J.  P.,  Laumann,  E.  O.,  Salisbury,  R.  H.,  &  Nelson, 
R.  L.  (1990).  Inner  circles  or  hollow  cores?  Elite 
networks  in  national  policy  systems.  Journal  of 
Politics,  52,  356-390. 

Odendahl,  T.,  &  Shaw,  A.  M.  (2002).  Interviewing  elites. 
In  J.  F.  Gubrium  &  J.  A.  Holstein  (Eds.),  Handbook 
of  interview  research:  Context  and  method 
(pp.  133-150).  Thousand  Oaks,  CA:  Sage. 


Suleiman,  E.  N.  (1974).  Politics,  poiver,  and  bureaucracy 
in  France:  The  administrative  elite.  Princeton,  NJ: 
Princeton  University  Press. 

Zuckerman,  H.  (1972).  Interviewing  an  ultra-elite.  Public 
Opinion  Quarterly,  36,  159-175. 


Interviews,  Expert 


An  expert  interview  is  a  semistandardized  inter¬ 
view  by  one  or  more  interviewers  with  a  person 
identified  as  a  so-called  expert  and  serves  to  gen¬ 
erate  data  in  a  research  context.  This  raises  the 
following  questions:  What  constitutes  an  expert? 
Why  are  experts  of  such  particular  interest  in  a 
social  and/or  political  scientific  research  setting? 
The  answer  lies  primarily  in  the  knowledge  that 
they  have  acquired  in  the  course  of  their  (profes¬ 
sional)  activities.  Experts  not  only  have  (a)  spe¬ 
cialist  professional  or  technical  knowledge  and 
(b)  knowledge  of  organizational  procedures  and 
processes,  but  they  also  have  (c)  interpretative 
and  background  knowledge  (“know-how”  and 
“know- why”)  in  their  particular  field.  Such  pro¬ 
fessional/technical  knowledge  and  process  knowl¬ 
edge  are  generally  explicit  knowledge,  which 
experts  can  usually  communicate  in  a  systematic 
and  deliberate  manner.  Interpretive  and  back¬ 
ground  knowledge  is  predominantly  tacit  and 
draws  on  the  relevant  individual  experiences, 
organizational  social  practices,  or  collective  inter¬ 
pretive  patterns  encountered  by  an  expert  in  pro¬ 
fessional  practice.  People  with  this  kind  of  broad 
knowledge  are  usually  to  be  found  on  the  middle 
and  upper — occasionally  also  on  the  lower — 
levels  of  the  organizational  hierarchy.  Given  their 
position  in  the  organizational  and  functional 
context,  experts  often  have  at  least  a  partial 
chance  of  putting  their  knowledge  and  action  to 
practical  use,  that  is,  accomplishing  their  own 
interests  and  ideas  and,  thus,  making  a  decisive 
contribution  to  what  goes  on  both  inside  and 
outside  the  organization.  This  ability  to  assert 
themselves  and  shape  events  is  generally  linked  to 
their  position  and  permits  experts  to  speak  as  a 
representative  of  an  organization  and  be  recog¬ 
nized  as  such.  In  this  entry,  the  history,  use,  and 
specific  problems  of  expert  interviews  are  dis¬ 
cussed  in  greater  detail. 
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The  History  of  Expert  Interviews 

Expert  interviews  were  first  encountered  primarily 
in  German-speaking  countries,  where  they  have 
been  used  increasingly  in  social,  economic,  and 
political  science  research  since  the  1980s.  However, 
it  was  not  until  the  early  1990s  that  they  slowly 
began  to  establish  and  distinguish  themselves  as  a 
specific  qualitative  social  research  method.  In  the 
meantime,  a  number  of  books  have  been  published 
on  the  methodology  and  methods  of  interviewing 
experts,  and  most  pertinent,  newer  German- 
language  books  on  qualitative  research  methods 
now  also  include  expert  interviews.  The  situation 
is  quite  different  in  their  English-language  counter¬ 
parts,  where — unlike  interviews  with  the  elite — 
expert  interviews  are  rarely  mentioned.  In  fact, 
there  are  many  similarities  between  these  two 
interview  forms,  from  the  sampling  and  difficulties 
of  gaining  access  to  the  field  through  to  the  actual 
specifics  of  the  interview  process.  However,  the 
key  difference  lies  in  their  target  groups:  the  elite 
are  the  powerful,  top  echelon  of  a  society.  Indeed, 
the  label  “elite”  is  ascribed  to  a  person  or  group/ 
class  of  persons  with  high  social,  educational,  and 
economic  status  and,  thus,  the  power  to  make  (or 
at  least  the  possibility  of  making)  a  significant 
impact  on  society.  From  a  sociology  of  knowledge 
perspective,  the  elite  can  also  be  seen  as  experts 
with  expert  knowledge,  but  more  precisely  as 
experts  who  have  particular  power.  In  essence, 
they  are  top  company  executives  and  members  of 
corporate  supervisory  bodies,  senior  civil  servants, 
or  high-ranking  government  officials.  The  actual 
person  who  will  be  able  to  provide  the  best  infor¬ 
mation  for  a  particular  research  topic,  the  type  of 
knowledge  sought,  and  the  position  of  the  inter¬ 
viewees  in  their  hierarchy  will  ultimately  always 
depend  on  the  specific  research  context. 

Using  Expert  Interviews 

Depending  on  the  research  design  and  topic(s) 
being  researched,  expert  interviews  can  be  used  for 
the  following  different  purposes: 

f .  Exploratory  expert  interviews  are  used  to  obtain 
an  overview  of  and  access  to  a  less  familiar  field. 

2.  Systematizing  expert  interviews  serve  to 
systematically  reconstruct  “objective,” 


specialized  technical  and  procedural  knowledge 
in  a  particular  field. 

3.  Theory-generating  expert  interviews  focus  not 
just  on  specialized  expert  knowledge  but  also  on 
the  tacit  and  subjective  interpretative  and 
background  knowledge  gained  through 
(professional)  experience. 

Many  research  projects  combine  the  use  of 
expert  interviews  with  other  methods  (mixed 
methods  or  triangulation  of  methods).  In  political 
science  research,  for  example,  expert  interviews 
are  often  used  in  combination  with  document 
analysis  to  ascertain  the  history  behind  certain 
documents.  This  is  usually  done  less  with  a  view  to 
validating  the  data  and  results  but  more  often  to 
establish  a  broad  picture  of  social  practices  in  a 
specific  field. 

Sampling  and  Access  Problems 

Selecting  the  appropriate  interviewees  for  a  par¬ 
ticular  research  project  depends  greatly  on  the 
actual  research  topic(s)  and  specific  field  of 
research.  There  are  often  only  a  limited  number  of 
experts  in  a  given  field.  In  such  cases,  it  is  best 
(where  possible)  to  interview  all  relevant  experts, 
particularly  since  they  will  often  have  their  own 
networks.  Such  networks  can  help  encourage 
experts  to  participate  in  an  interview  to  ensure 
their  views  are  included.  Altruism,  or  even  a  desire 
to  enhance  their  own  status,  can  also  be  a  possible 
motive  for  agreeing  to  an  interview.  The  first  step 
in  the  sampling  process  involves  analyzing  litera¬ 
ture  or  media  reports  and  talking  to  relevant 
sources — people  who  are  familiar  with  the  field — 
to  identify  the  key  experts.  The  original  sample 
should  then  be  extended  in  the  interview  phase  by 
asking  the  interviewees  themselves  to  recommend 
further  experts. 

Time  restrictions  and  a  general  lack  of  willing¬ 
ness  to  provide  information  can  frequently  be  a 
problem  when  seeking  access  to  experts,  so  it  is 
particularly  important  that  the  first  contact — 
either  in  writing  or  by  phone — is  carefully  pre¬ 
pared.  To  encourage  experts  to  participate  in  an 
interview,  the  first  contact  with  them  should 
briefly  outline  the  goals  and  relevance  (e.g.,  inno¬ 
vativeness)  of  the  research  and  explain  how  impor¬ 
tant  it  is  that  they  participate.  Interviews  should  be 
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calculated  to  last  no  longer  than  45  minutes  (even 
if  they  ultimately  take  longer  in  reality).  If  a  face- 
to-face  meeting  cannot  be  arranged,  an  interview 
can  also  be  conducted  by  phone.  One  problem 
with  this  option  is  the  lack  of  control  over  the 
interview  situation:  The  interviewee  might,  for 
example,  be  distracted  by  other  tasks.  Similarly,  no 
account  can  be  taken  of  body  language  or  gestures. 

Interview  Guidelines/Topic  Guide 

Drawing  up  a  set  of  interview  guidelines  is  an 
essential  part  of  the  preparation  for  an  expert 
interview  and  familiarization  with  the  relevant 
vocabulary  and  field  of  research.  Such  guidelines 
should  structure  the  central  topics  in  line  with  the 
expected  course  of  the  interview  and  can  be  sent  to 
any  experts  who  request  information  about  the 
interview  prior  to  the  event.  Nonetheless,  to  ensure 
that  the  interviews  flow  as  authentically  as  possi¬ 
ble,  the  guidelines  should  not  be  overly  detailed.  In 
the  actual  interview,  they  serve  as  an  aide-memoire 
and  prevent  the  interviewer  from  overlooking  any 
important  questions.  However,  they  should  only 
be  used  as  guidelines  and  not  worked  through  rig¬ 
idly  point  for  point:  Interviewees  should  be  given 
the  maximum  opportunity  to  express  their  own 
opinions  and  ideas.  The  more  an  interviewer  suc¬ 
ceeds  in  getting  an  expert  to  talk,  for  example, 
through  a  narrative  generating  introductory  ques¬ 
tions  and  request  for  concrete  examples  the  greater 
the  chance  that  tacit  knowledge  (in  the  form  of 
interpretive  and  background  knowledge)  will 
emerge  in  the  interview.  This  knowledge  is  particu¬ 
larly  relevant  for  reconstructing  social  practices  in 
a  particular  field. 

Interaction  Situations  and  Effects 

Expert  interviews  are — like  all  interviews — a  form 
of  social  interaction.  The  basic  intent  is  to  get  the 
interviewees  to  say  what  they  think  needs  to  be 
said  in  a  given  situation.  “What”  they  say  and 
“how”  they  say  it  depends  on  many  different  ele¬ 
ments  of  the  interaction — from  expectations  and 
motives  for  participation  to  situational  aspects 
such  as  time  pressure,  sympathy  or  antipathy,  and 
trust  or  mistrust.  The  interviewee’s  capacity  to 
remember  and  the  interviewer’s  interviewing  skills, 
way  of  asking  questions,  openness,  inquisitiveness, 


self-assurance,  and  so  on  will  also  influence  the 
conversation.  Gender  relations  can  also  play  a 
twofold  role  in  the  interview.  First,  most  of  the 
experts  are  men,  as  there  are  relatively  few  women 
in  management  positions.  Second,  the  probability 
of  the  participants  “doing  gender”  (i.e.,  assuming 
gender-specific  roles,  particularly  in  a  mixed- 
gender  setting)  becomes  highly  likely.  Of  particular 
relevance  for  interaction  in  an  expert  interview  set¬ 
ting  are  the  status  and  relationships  the  interviewee 
accords  the  interviewer.  The  latter  is  typically  per¬ 
ceived  as  a  co-expert,  a  potential  critic,  an  accom¬ 
plice,  a  controlling  authority,  or  a  layperson.  These 
associations  affect  the  interviewee’s  behavior,  will¬ 
ingness  to  communicate  and  manner  of  communica¬ 
tion,  attitude,  and  so  on.  Interview  effects  resulting 
from  an  individual  interview  constellation  cannot  be 
avoided  and  are  not  necessarily  disadvantageous. 
They  may  even  be  of  strategic  benefit.  For  example, 
if  they  consider  the  interviewer  a  layperson,  inter¬ 
viewees  may  well  be  more  willing  to  provide  infor¬ 
mation  about  their  field  of  expertise  and  experience. 
Interviewers  should  also  be  aware  that  even  research- 
related  interviews  are  a  form  of  social  interaction 
and  do  not  permit  a  “pure”  or  even  objective  gather¬ 
ing  of  data. 

Analysis 

There  is  no  standard  procedure  for  analyzing  expert 
interviews.  In  principle,  all  qualitative  social 
research  analysis  methods  can  be  used,  for  example, 
the  code-based  procedures  common  in  grounded 
theory  or  qualitative  content  analysis,  or  the  sequen¬ 
tial  analyses  applied  in  hermeneutic  sociology  of 
knowledge  or  objective  hermeneutics.  A  combina¬ 
tion  of  different  methods  is  also  admissible.  The 
most  suitable  form  of  analysis  ultimately  depends 
on  the  actual  research  project.  Computer-assisted 
analysis  using  special  qualitative  data  management 
programs  can  help  but  cannot  automatically  com¬ 
plete  the  data  analysis.  Even  if  computers  are  used 
to  assist  with  data  analysis,  it  remains  the  task  of 
the  researchers  to  interpret  the  data  in  a  transpar¬ 
ent,  plausible,  and  comprehensible  manner. 

Ethics 

Given  their  prominent  position  in  what  is  often  a 
small  and  clear  field,  it  can  be  difficult  to  keep  the 
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identity  of  experts  hidden.  Consequently,  the 
researcher  should  always  clarify  how  much  of  the 
information  obtained  from  the  interviewee  can  be 
published  in  a  nonanonymous  form  (e.g.,  as  quota¬ 
tions).  In  some  cases,  it  may  be  necessary  to  have 
the  interviewee  expressly  authorize  the  use  of  the 
minutes  or  interview  transcript  for  analysis  or  pub¬ 
lication  purposes.  The  expert  should,  in  all  cases, 
be  given  the  assurance  that  all  data  will  be  treated 
in  confidence. 

Be  ate  Littig 
Institute  for  Advanced  Studies 
Vienna,  Austria 
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Irredentism 


The  term  irredentism  is  derived  from  the  Italian 
word  irredenta  (unredeemed).  It  originally  referred 
to  an  Italian  political  movement  during  the  late 
1800s  and  early  1900s  that  sought  to  detach  pre¬ 
dominantly  Italian-speaking  areas  from 
Switzerland  and  the  Austro-Hungarian  Empire 
and  incorporate  these  territories  into  the  new 
Italian  state,  thus  “redeeming”  these  territories. 
Modern  usage  denotes  territorial  claims  based  on 
a  national,  ethnic,  or  historical  basis.  In  many 


cases,  an  irredentist  movement  is  referred  to  as 
creating  a  “Greater  X,”  with  the  “X”  being  the 
name  of  a  nation  or  state — for  example,  the  quest 
for  a  Greater  Serbia  was  in  part  responsible  for 
the  Wars  of  Yugoslav  Succession  during  the 
1990s,  and  the  Greater  Romania  project  ( Romania 
Mare)  was  partially  responsible  for  Romania  sid¬ 
ing  with  the  Axis  powers  during  World  War  II.  It 
is  related  to,  but  distinct  from,  secession: 
Irredentism  is  the  process  by  which  a  part  of  an 
existing  state  breaks  away  and  merges  with 
another,  whereas  in  secession  merging  does  not 
take  place.  The  importance  of  irredentism  in  inter¬ 
national  relations  is  based  on  the  intersection 
between  nationalism  and  the  causes  of  war; 
because  such  a  movement  invariably  means  taking 
land  from  another  state,  irredentist  claims  have 
been  known  to  provoke  ethnic  conflicts  and  ter¬ 
ritorial  aggression.  The  continued  discord  between 
nations  and  states  means  that  the  potential  for 
irredentist  wars  remains  serious.  This  entry  identi¬ 
fies  some  notable  cases,  provides  examples  of  how 
irredentist  claims  are  sometimes  enshrined  in  state 
constitutions,  and  reviews  the  literature  on  this 
phenomenon. 

Notable  examples  of  irredentism  include  Nazi 
Germany’s  claims  on  the  Sudetenland  of 
Czechoslovakia;  the  “Megali  Idea”  that  sought  to 
create  a  Greater  Greece;  China’s  desire  to  reincor¬ 
porate  the  territories  lost  during  periods  of  histori¬ 
cal  weakness;  Somalia’s  invasion  of  Ethiopia  during 
the  Ogaden  War  of  1977-1978;  the  attempt  by 
Hungary  to  reverse  the  1920  Treaty  of  Trianon  and 
reclaim  territories  in  Slovakia,  Romania,  and 
Yugoslavia  during  World  War  II;  and  the  continu¬ 
ing  conflict  between  Armenia  and  Azerbaijan  over 
the  Armenian-populated  Nagorno-Karabakh 
region.  More  questionable  cases  include  those  with¬ 
out  a  direct  ethnic  component  but  rather  based 
solely  on  historical  claims,  such  as  Argentina’s  inva¬ 
sion  of  the  British-populated  Falkland  Islands  (Islas 
Malvinas)  in  1982  or  Serbian  claims  to  Kosovo, 
which  was  the  birthplace  of  Serbian  nationalism 
but  is  now  populated  overwhelmingly  by  ethnic 
Albanians.  However,  these  territories  could  also  be 
considered  “unredeemed,”  and  therefore,  these 
cases  can  fall  under  the  category  of  irredentism. 

In  some  instances,  states  have  gone  so  far  as  to 
incorporate  irredentist  claims  directly  into  their 
constitution.  For  example,  Argentina  claims  that 
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recovering  the  Falkland  Islands  is  “a  permanent 
and  unrelinquished  goal  of  the  Argentine  people,” 
and  China  considers  Taiwan  to  be  part  of  the 
country’s  “sacred  territory.”  The  1937  Irish  con¬ 
stitution  originally  laid  claim  to  the  entire  island  of 
Ireland,  but  this  was  changed  following  the  1998 
Good  Friday  Agreement  resolving  the  conflict  in 
Northern  Ireland. 

Despite  the  numerous  potential  cases  of  irreden¬ 
tism,  and  the  long  history  of  these  claims  sparking 
armed  conflict,  there  has  been  surprisingly  little 
scholarly  research  done  on  this  topic,  and  no 
consensus  position  on  its  causes,  dynamics,  or 
resolution  has  emerged.  However,  five  explana¬ 
tory  categories  can  be  identified:  (1)  structural, 
(2)  realist,  (3)  rational  choice,  (4)  domestic,  and 
(5)  constructivist.  Given  the  complexity  of  irreden¬ 
tism,  there  are  likely  multiple  and  overlapping 
explanations  for  any  particular  case. 

Structural  explanations  argue  that  the  interna¬ 
tional  or  regional  context  within  which  the  poten¬ 
tially  irredentist  state  operates  will  play  a  large  role 
in  determining  whether  such  a  project  is  initiated 
and  the  conditions  under  which  it  ends.  In  most 
cases,  this  focuses  on  the  relative  support  in  the 
international  system  for  state  sovereignty  versus 
national  self-determination.  If  the  latter  is  empha¬ 
sized,  then  nationalist  claims  will  be  allowed  to 
override  the  inviolability  of  sovereign  borders;  if 
the  former  is  emphasized,  then  the  legitimacy  of 
irredentist  claims  will  be  widely  rejected.  The 
dramatic  increase  in  irredentist  conflicts  during 
periods  of  major  international  upheaval  and  nor¬ 
mative  reordering  is  seen  as  important  evidence  in 
favor  of  this  position. 

Realist  arguments  emphasize  the  relative  bal¬ 
ance  of  power  between  the  irredentist  state  in  ques¬ 
tion  and  either  the  target  state  or  the  international 
community.  In  the  first  case,  military  weakness 
vis-a-vis  the  target  state  will  force  the  irredentist 
country  to  relinquish  its  claims;  in  contrast,  relative 
military  strength  will  precipitate  such  a  project.  A 
weak  Albania  had  little  chance  of  forcibly  altering 
its  border  with  Yugoslavia,  but  a  strong  Nazi 
Germany  could  pressure  France  and  Great  Britain 
to  force  territorial  revisions  on  Czechoslovakia. 
The  latter  argument  takes  the  balance  of  power  to 
a  higher  level  by  examining  the  degree  to  which 
relevant  international  actors  (states  or  international 
organizations)  acquiesce  in  or  tolerate  the  policies 


of  specific  irredentist  states.  The  more  tolerant 
they  are,  the  more  an  irredentist  state  can  get  away 
with;  if  they  are  less  tolerant,  then  an  irredentist 
state  is  forced  to  temper  or  restrict  its  policies. 
During  the  early  years  of  the  Wars  of  Yugoslav 
Succession,  the  international  community  did  little 
other  than  rhetorical  condemnation  to  stop 
Croatia’s  and  Serbia’s  irredentist  projects. 
However,  once  key  states  (particularly  the  United 
States)  adopted  a  more  robust  and  muscular 
stance,  then  both  countries  recognized  the  prewar 
borders  of  the  Yugoslav  republics. 

Rational  choice  accounts  examine  the  decision¬ 
making  processes  of  elites  that  initiate  an  irredentist 
conflict  for  instrumental  purposes.  Two  arguments 
fall  under  this  rubric:  elite  conflict  and  diversionary 
theory.  Under  the  first  explanation,  leaders  will 
attempt  to  use  irredentism  as  a  tool  to  counter  chal¬ 
lenges  from  other  elites  by  appealing  to  the  nation¬ 
alist  sentiments  of  the  populace.  For  example, 
Slobodan  Milosevic  used  irredentist  claims  in 
Kosovo,  Croatia,  and  Bosnia-Herzegovina  to  rise 
to  political  preeminence  in  Serbia.  According  to  the 
latter  argument,  decision  makers  will  initiate  an 
irredentist  conflict  to  divert  the  attention  of  their 
population  from  domestic  problems.  Argentina’s 
invasion  of  the  Falkland  Islands  to  prop  up  the 
military  junta  is  often  considered  a  classic  case  of 
diversionary  war  theory. 

Domestic-level  explanations  also  come  in  two 
forms:  demographics  and  regime  type.  The  first 
argument  holds  that  ethnically  homogeneous  states 
are  more  likely  to  be  irredentist  because  the  popula¬ 
tions  of  ethnically  divided  states  will  be  unwilling 
to  support  a  project  that  benefits  only  one  group 
and  might  have  the  effect  of  altering  the  ethnic  bal¬ 
ance  of  the  state  if  successful.  For  example,  given 
the  serious  discordance  between  ethnicity  and 
statehood  in  nearly  all  African  states,  it  should  not 
be  surprising  that  the  only  substantial  irredentist 
conflict  was  the  one  launched  by  ethnically  homo¬ 
geneous  Somalia.  The  second  explanation  argues 
that  democratic  states  are  less  likely  to  have  irre- 
dentist-type  conflicts  between  them  because  of 
increased  institutional  restraints  on  launching  an 
irredentist  project;  enhanced  protection  for  minor¬ 
ity  groups,  which  undermines  the  reasons  for 
moving  to  protect  one’s  diaspora;  membership  or 
possible  membership  in  international  organizations 
that  prohibit  such  claims  (e.g.,  North  Atlantic 
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Treaty  Organization  [NATO]  or  the  European 
Union);  and  the  dynamics  of  democratic  peace 
theory,  in  which  democratic  states  tend  to  resolve 
their  disputes  peacefully.  The  reduction  in  tensions 
between  Hungary,  on  the  one  hand,  and  Romania 
and  Slovakia,  on  the  other,  was  associated  with  the 
consolidation  of  democracy  in  these  countries, 
whereas  nearly  all  irredentist  conflicts  were  initi¬ 
ated  by  authoritarian  regimes. 

The  final  explanation  for  irredentism  is  derived 
from  the  insights  of  constructivism,  which  holds 
generally  that  the  social  construction  of  ideas  and 
concepts  is  instrumental  in  determining  political 
outcomes.  When  applied  to  the  issue  of  irreden¬ 
tism,  two  arguments  emerge.  The  first  asserts  that 
the  type  of  national  identity  that  is  dominant  in  a 
particular  state  or  nation  will  influence  the  degree 
to  which  it  will  become  irredentist:  An  ethnic  iden¬ 
tity  emphasizes  the  unity  of  the  culturally  based 
nation  across  political  boundaries  and  lends  itself 
to  the  promotion  of  altering  borders  to  realize  a 
physical  union  of  the  nation;  a  civic  national  iden¬ 
tity  is  political  in  nature,  usually  tied  to  a  preexist¬ 
ing  state,  and  is  therefore  more  easily  limited  to 
status  quo  boundaries.  Some  25  million  ethnic 
Russians  found  themselves  as  minorities  scattered 
throughout  the  former  Soviet  Union  on  its  collapse. 
However,  despite  the  fact  that  many  of  these 
Russians  were  in  territories  contiguous  to  Russia 
itself,  the  association  of  the  Russian  Federation 
(Rossiyskaya  Federatsiya)  with  the  concept  of  ros- 
siiski  (civic)  rather  than  russkii  (ethnic)  dampened 
the  legitimacy  of  an  irredentist  project  aimed  at 
creating  a  Greater  Russia  in  the  1990s.  By  contrast, 
ethnically  based  national  identities,  such  as  that 
found  in  Nazi  Germany,  more  easily  lend  them¬ 
selves  toward  a  desire  for  unification.  The  second 
constructivist  argument  focuses  on  the  ways  in 
which  the  irredentist  claims  are  justified  in  a  soci¬ 
ety.  This  justification  includes  the  arguments  that 
assert  the  group’s  right  and  obligation  to  unify  the 
nation  and  the  proper  means  by  which  to  achieve 
this  goal.  The  potential  for  irredentism  can  be 
reduced  if  this  justification  is  somehow  under¬ 
mined,  perhaps  by  reconceptualizing  the  nation’s 
history,  the  physical  boundaries  of  the  nation,  or 
the  place  of  the  nation  in  the  international  commu¬ 
nity.  The  belief  that  Serbia  was  obligated  to  protect 
its  fellow  Serbs  in  Croatia  and  Bosnia-Herzegovina, 
when  mixed  with  the  conception  of  the  Serb  nation 


as  the  defenders  of  Europe  against  Islam,  helped 
fuel  the  quest  for  a  Greater  Serbia.  By  contrast, 
Germany’s  renunciation  of  aggression  after  World 
War  II  and  the  self-described  role  of  facilitator  of 
Pan-European  cooperation  precluded  adjustments 
in  the  country’s  post-World  War  II  borders. 

Thomas  Ambrosio 
North  Dakota  State  University 
Fargo,  North  Dakota,  United  States 
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Islam 


Why  is  Islam,  which  is  a  religious  faith  and  a  local 
culture,  placed  as  an  entry  in  an  encyclopedia  for 
political  science?  This  discipline  is  not  supposed 
to  deal  with  religion  and  culture  but  instead  with 
the  polity  and  government.  The  subject  matter  is  a 
religious  faith  that  assembles  nearly  1.6  billion 
people  of  humankind  as  an  umma  (community) 
that  consists  of  a  great  variety  of  local  Islamic 
cultures.  These  are,  however,  related  to  one 
another  in  terms  of  shared  values  and  worldview. 
On  these  grounds,  there  is  a  resemblance  that 
unites  all  Muslims  to  one  civilization.  Cultural 
diversity  exists  in  Islam  within  the  unity  of  one 
Islamic  civilization.  In  view  of  this  cultural  diver¬ 
sity,  there  is  no  monolith  named  Islam. 

The  answer  to  the  question  concerning  the  legiti¬ 
macy  of  this  entry  is  that  Islamic  civilization  matters 
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to  politics  since  the  great  changes  in  the  world 
around  the  end  of  the  previous  century,  particularly 
with  respect  to  decolonization  and  globalization. 
Islam  at  the  beginning  of  the  21st  century  is  a  major 
factor  in  politics  on  two  political  levels.  First,  on  the 
state  level,  the  Organization  of  the  Islamic 
Conference  (OIC)  unites  57  states  with  an  Islamic 
majority  population.  Second,  there  are  Islamic  non¬ 
state  actors  (e.g.,  A1  Qaeda  and  the  Muslim 
Brotherhood)  that  engage  on  all  levels  of  politics. 

The  politicization  of  the  religion  of  Islam  is  con¬ 
nected  with  the  claim  that  Islam  is  not  only  a  faith 
but  also  a  constructed  system  of  government.  The 
idea  of  an  Islamic  state  is  based  on  sharia  law, 
believed  to  be  the  constitution  legislated  by  Allah 
for  a  divine  order.  This  political  interpretation  of 
Islam  makes  from  this  religion  an  ideology  of 
Islamism  that  leads  to  inner  conflicts  within  Islamic 
civilization.  The  political  theorist  John  Brenkman 
characterizes  this  issue  as  a  “civil  war”  within 
Islam  (e.g.,  the  fight  between  the  secular  Kemalists 
and  Islamists  in  Turkey).  The  spillover  of  this  pro¬ 
cess  to  world  politics  assumes  a  geopolitical  dimen¬ 
sion  with  the  related  effects  on  world  politics. 

The  politicization  of  Islam  suggests  a  more  com¬ 
plex  relation  between  religion  and  politics  on  all 
three  levels:  local,  regional,  and  international/ 
global.  The  major  issue  is  a  return  of  Islam  to  poli¬ 
tics  that  indicates  the  failure  of  the  secularization 
process  in  contemporary  Islamic  civilization.  This 
failure  is,  among  other  things,  related  to  the  crisis  of 
the  secular  nation-state  as  a  crisis  of  development. 

The  Roots  of  Politics  in  Islam 

At  issue  is  political  Islam,  not  the  Islamic  faith 
based  on  religious-cultural  meaning  inherent  in  a 
cultural  system.  In  short,  Islamic  politics  matter  to 
political  science.  This  politicized  Islam  serves  as  a 
political  ideology  often  used  as  a  device  for  politi¬ 
cal  legitimacy.  This  is  not  a  novelty  to  Islam,  since 
it  is  a  peculiarity  of  this  religion  to  have  had  a  close 
relation  to  politics  and  war  from  its  birth  onward. 
The  novelty  is,  however,  the  recent  phenomenon 
of  Islamism  and  its  idea  of  an  Islamic  state.  To 
understand  the  distinction,  one  has  first  to  look  at 
the  past.  Islamic  revelation  started  spiritually  in 
Mecca  in  the  year  610.  It  was  not  until  the  creation 
of  the  polity  of  the  umma  (i.e.,  community,  not  the 
Islamic  state,  as  Islamists  today  wrongly  contend) 


in  Medina  in  622  that  Islam  was  intermingled  with 
politics.  In  that  year  Islamic  history  commences, 
and  therefore,  in  their  calendar,  Muslims  number 
622  as  the  year  1.  This  is  the  Islamic  hijra  calen¬ 
dar.  Therefore,  hijra  (migration)  has  a  specific 
meaning  in  Islam:  A  Muslim  is  supposed  to 
migrate  in  pursuit  of  the  spread  of  Islam.  In  the 
year  622,  Prophet  Mohammed  migrated  from 
Mecca  to  Medina  and  engaged  in  political  activity 
to  spread  Islam.  For  Muslims,  this  is  a  binding 
precedent.  In  his  new  location,  the  Prophet  made 
political  decisions  and  also  fought  wars.  Therefore, 
the  late  French  scholar  of  Islam,  Maxime  Rodinson, 
characterized  Prophet  Mohammed  as  a  combina¬ 
tion  of  Jesus  and  Charlemagne.  However,  neither 
the  holy  book  of  the  Muslims,  the  Koran,  nor  the 
tradition  of  Prophet  Mohammed,  that  is,  the  scrip¬ 
ture  of  the  Hadith,  ever  made  provisions  for  how 
to  rule  the  polity.  It  follows  that  there  is  no  provi¬ 
sioned  system  of  government  in  the  authoritative 
scripture  of  Islam.  In  short,  the  idea  of  an  Islamic 
rule  is  a  construction  made  after  the  death  of  the 
Prophet.  One  has  also  to  add  that  the  system  of  the 
caliphate,  established  after  the  death  of  the  Prophet 
(in  632),  should  not  to  be  conflated  with  the 
Islamic  state  for  which  Islamists  at  present  fight. 
These  are  distinctly  different  issues. 

In  632,  the  aristocracy  of  Mecca,  which  was 
composed  of  elders  of  the  Islamized  tribe  of 
Quraysh,  established  three  tenets:  (1)  the  Islamic 
ruler  should  be  a  successor,  that  is,  a  caliph,  of  the 
Prophet;  (2)  the  caliph  should  be  descended  from 
the  tribe  of  Quraysh — a  requirement  that  contra¬ 
dicts  the  Islamic  provision  that  all  Muslims  form 
one  community  that  transcends  tribes  and  creates 
a  transtribal  umma  in  which  tribal  affiliations  are 
abolished;  and  (3)  the  caliph  should  be  close  to  the 
Prophet  and  to  the  tradition  he  established.  In  this 
khilafa  (meaning  “caliphate”)  system,  selection  of 
the  caliph  was  based  on  merit  and  descent  in  the 
formative  years  of  Islamic  rule  from  632  through 
661.  There  were  four  Qurayshi  leaders — Abu 
Bakr,  Omar,  Othman,  and  Ali — who  were  selected 
to  rule  as  caliphs.  Three  of  these  righteous  (rash- 
idun)  caliphs,  including  the  last  one,  Ali,  were 
brutally  assassinated.  Despite  this  bloody  feature, 
the  rashidun  age  is  considered  to  date  to  be  the 
model  for  emulation  with  regard  to  political  rule. 

Following  the  assassination  of  Caliph  Ali  in 
661,  the  Islamic  umma  experienced  a  violent 
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schism  between  the  Sunnis  and  Shiites.  The 
Omaiyyads  usurped  the  caliphate  and  established 
dynastical  rule.  In  so  doing,  they  transformed  the 
caliphate  from  selective  to  hereditary  rule.  They 
also  shifted  the  capital  of  Islam  from  Medina/ 
Mecca  to  Damascus.  This  first  imperial  rule  of 
Islam,  named  the  Omayyad  caliphate,  successfully 
led  global  jihad  wars.  This  Islamic  expansion 
resulted  in  an  Islamic  empire  that  stretched  from 
China  to  Spain.  In  Islamic  history,  two  other  such 
imperial  caliphates  followed  successively,  namely, 
the  Abasside  of  Baghdad  and  the  Ottoman  of 
Istanbul.  In  this  imperial  history,  which  lasted 
until  1924,  Islam  was  not  only  a  faith  practiced  in 
a  great  variety  of  local  cultures  but  also  the  basis 
for  a  cross-cultural  civilization  and  for  the  legiti¬ 
macy  of  imperial  rule  (or  caliphate).  This  rule  is 
considered  to  be  the  Islamic  system  of  government. 
Muslims  fought  jihad  wars  from  the  7th  through 
the  17th  centuries  in  pursuit  of  an  Islamic  expan¬ 
sion  envisioned  to  map  the  entire  globe  into  Dar 
al-Islam  (realm,  or  house,  of  Islam). 

History  did  not  follow  this  course  prescribed  by 
Islam.  The  world  was  mapped  into  another  sys¬ 
tem,  namely,  the  Westphalian  one  of  sovereign 
states  that  was  established  in  1648.  This  system 
continues  to  be  the  basis  of  the  modern  interna¬ 
tional  system. 

The  End  of  Islamic  Expansion 
and  the  International  System 

For  Islam,  the  Peace  of  Westphalia  matters  with 
respect  to  both  the  past  and  the  present.  The  roots 
for  the  launching  of  a  competitive  globalization 
project  are  related  to  the  rise  of  the  West  with  its 
“military  revolution”  (1500-1800),  based  on  the 
industrialization  of  warfare.  This  rise  interrupted 
the  Islamic  expansion  and  replaced  it  with  its  own 
European  expansion,  which  is  the  crux  of  the  rela¬ 
tion  between  Islam  and  politics.  In  the  past,  the 
rise  of  the  West  halted  Islamic  expansion.  At  pres¬ 
ent,  the  Westphalian  system  of  states  maps  the 
entire  world  of  Islam.  The  Islamic  revolt  against 
the  West  is  not  only  a  revolt  against  this  system 
but  an  attempt  to  reverse  history. 

After  their  failure  to  capture  Vienna  in  1683, 
the  Muslim  armies  retreated  and  lost  all  the  ensu¬ 
ing  wars.  The  importation  of  European  military 
practices  into  Muslim  armies  did  not  remedy  the 


existing  deficiencies.  European  powers  not  only 
stopped  Islam  from  further  invading  Europe  and 
pushed  it  back  from  Europe,  but  they  also  invaded 
the  world  of  Islam  itself.  The  third  and  last  global 
Islamic  caliphate  of  the  Ottoman  Empire  declined. 
After  a  successful  revolution,  Kemal  Atatiirk  abol¬ 
ished  the  caliphate  in  1924,  following  the  declara¬ 
tion  of  the  secular  republic  of  Turkey.  Turkey  is 
the  only  secular  republic  in  the  world  of  Islam  that 
enshrines  secularism  in  its  constitution.  One  year 
after  the  abolition  of  the  caliphate,  the  Al-Azhar 
scholar  Ali  Abdel-Raziq  (1888-1966)  published 
his  book  al-Islam  wa  usul  al-hukm  (Islam  and  the 
Origins  of  Government),  in  which  he  argues  that 
Islam  was  just  a  religious  faith  and  was  wrongly 
used  to  legitimate  political  rule.  This  depoliticiza- 
tion  of  Islam  was  strongly  contested  by  Muslims. 
As  punishment,  Abdel-Raziq  was  fired  from  his 
position  at  Al-Azhar  University  in  Cairo,  and  he 
died  in  poverty  in  1966.  Like  Turkey,  most  Islamic 
states  underwent  various  forms  of  secularization, 
without,  however,  a  formal  adoption  of  secular¬ 
ism.  The  transitory  period  of  colonial  rule  in  most 
of  the  former  provinces  of  the  Ottoman  Empire 
ended  in  the  building  of  independent,  secular 
nation-states  throughout  the  world  of  Islam.  These 
states  were  mapped  into  the  Westphalian  system  of 
sovereign  states  and  became  a  part  of  the  modern 
international  system.  This  was  the  age  of  secular 
politics  in  the  world  of  Islam,  which  lasted  from 
1924  to  1967.  The  Six-Day  War  of  1967  launched 
in  its  effects  the  return  of  Islam  to  politics.  This 
process  was  facilitated  by  the  fact  that  the  new 
secular  nation-states  had  a  weak  legitimacy  and 
lacked  political  capacities.  This  weakness  was 
emphasized  by  the  1967  defeat,  and  Arab  states 
appeared  as  “nominal  nation-states,”  even  as 
“tribes  with  national  flags.”  The  lack  of  institu¬ 
tions  that  would  have  been  shaped  within  the 
national  cultures  of  these  countries  resulted  in  a 
dangerous  lack  of  legitimacy. 

Islamic  Politics  and  Islamic  States 

In  the  historical  period  between  the  abolition  of 
the  caliphate  in  1924  and  the  Six-Day  War  of 
1967,  Islam  receded  by  and  large  from  the  political 
arena.  The  Muslim  Brotherhood  movement  was 
established  in  1928  in  Cairo,  but  despite  their 
activity,  such  movements  were  by  then  mostly  at 
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the  fringe  of  politics.  Secularism  prevailed  in  a  vari¬ 
ety  of  forms  in  most  Muslim  countries — of  course, 
not  in  Wahhabi  Saudi  Arabia  but  especially  in  the 
staunchly  secular  republic  of  Turkey.  Also,  the 
Pan-Arab  Egypt  under  Gamal  Abdel  Nasser  had  a 
secular  legitimacy,  as  did  the  largest  nation  of  Islam 
in  Southeast  Asia — Indonesia — under  the  rule  of 
the  secular  Ahmed  Sukarno.  The  Arab  defeat  in  the 
Six-Day  War  of  1967  had  repercussions  that  led  to 
a  delegitimation  of  secular  regimes,  first  in  the 
Middle  East  and  then  in  the  rest  of  the  Islamic 
world.  Among  the  consequences  was  the  ascen¬ 
dancy  of  political  Islam  (Islamism),  which  moved 
from  the  fringe  to  the  core  of  politics.  For  a  number 
of  reasons,  the  effects  of  this  process  spilled  over 
from  the  Arab  Middle  East  to  the  rest  of  the  world 
of  Islam.  One  of  the  factors  that  promoted  the 
spillover  effects  is  related  to  the  Arab  part  of 
Islamic  civilization  being  recognized  as  its  core.  It 
generates  cultural-religious  influence  over  the  rest 
of  the  world  of  Islam.  For  instance,  the  Middle  East 
influences  Southeast  Asia  and  not  the  other  way 
around,  even  though  Indonesia’s  population  is 
about  three  times  greater  than  that  of  Egypt. 

The  connection  between  Islam  and  politics  is 
established  today  by  the  Islamist  movements  that 
reclaim  an  Islamic  order  (nizam  Islami)  following 
a  global  trend  that  gives  a  more  important  role  to 
religion  in  politics.  Today,  it  is  Sunni  Arab  Islam 
and  not  the  Shiite  revolution  in  Iran  that  generates 
(a)  jihad  in  the  new  understanding  of  an  Islamic 
world  revolution;  (b)  shariatization  of  politics, 
based  on  a  new  understanding  of  Islamic  law;  and 
(c)  a  call  for  the  return  to  the  golden  age  of  Islam 
in  a  messianic  perspective  that  appeals  to  people 
who  feel  frustrated  and  deprived  after  the  break¬ 
down  of  communism  and  the  end  of  the  East-West 
conflict.  The  uprising  of  2011  against  the  existing 
authoritarian  order  of  the  state,  which  began  in 
January  in  the  Arab  world,  first  in  Tunisia  and  fol¬ 
lowed  by  Egypt  and  Libya,  is  to  be  placed  in  this 
context. 

Islam’s  return  to  the  forefront,  in  the  shape  of 
Islamism,  creates  a  bipolarity  between  the  Islamic 
and  non-Islamic  worlds  and  is  accompanied  by 
Islam’s  conflicts  with  modernity.  Islam  is  a  civiliza¬ 
tion  subjected  to  a  politicization  that  is  also — in 
terms  of  world  politics — a  politicization  of  what 
Raymond  Aron  called  the  “heterogeneity  of  civili¬ 
zations.”  This  process  leads  to  an  intercivilizational 


conflict.  The  reader  is  reminded  of  the  distinction 
that  a  conflict  is  not  a  clash:  A  conflict  can  be 
peacefully  resolved  in  a  negotiation  process  that 
could  assume  the  shape  of  a  dialogue  between 
civilizations,  whereas  a  clash  is  based  on  an  essen- 
tializing  view  and  on  the  fault  lines  of  polarization. 
The  pertinence  of  Islam  to  political  science  and 
also  to  international  relations  is  related  to  the  fact 
that  religion  and  politics  have  become  intertwined 
in  the  countries  of  Islamic  civilization.  Religion  is 
among  the  new  sources  of  conflict.  The  articula¬ 
tion  of  needs  in  religious  language  first  generates 
real  cultural  tensions,  which  later  become  political. 
There  are  many  levels  in  this  conflict.  The  first 
level  is  an  inner-Islamic  conflict  over  the  secular 
nation-state.  Under  conditions  of  globalization, 
this  conflict  within  Islam  is  then  extended  to  an 
international  one.  The  conflict  escalates  with  the 
rise  of  powerful  Islamist  movements  that  challenge 
the  existing  nation-states  in  the  world  of  Islam, 
with  the  aim  of  replacing  them  with  an  Islamic 
state.  This  claim  leads  to  tensions  and  conflicts, 
first  within  Islam  and  then  through  spillover 
effects  to  the  international  and  intercivilizational 
level.  The  inner-Islamic  and  international  levels 
intermingle  with  one  another. 

The  call  for  an  Islamic  state  is  not  to  be  con¬ 
flated  with  the  aim  of  restoring  the  caliphate,  as 
some  pundits  contend.  There  is  a  novelty,  next  to 
the  fact  that  politics  is  becoming  religionized  and 
religion  is  subjected  to  politicization.  This  novelty 
is  the  idea  of  an  Islamic  order  that  incorporates  the 
Islamic  state.  This  particularism  promotes  tensions 
that  ignite  conflict.  These  are  also  related  to  images 
of  the  self  and  of  the  other.  Of  course,  there  are 
socioeconomic  and  political  constraints  underlying 
the  political  conflict  articulated  in  religious  terms. 
This  articulation  in  religious  terms  matters  to  the 
analysis  in  the  sense  that  political,  social,  and  eco¬ 
nomic  concerns  become  religionized.  The  reference 
to  Islam  is  not  merely  a  cover,  nor  does  it  happen 
instrumentally,  as  some  Western  pundits  contend. 
The  religionization  of  conflict  gives  politics  a  new 
shape.  Conflict  becomes  intractable  because  beliefs 
are  not  negotiable.  Some  students  of  Islamic  poli¬ 
tics  argue  that  these  Islamist  Muslims  are  virtually 
secular,  because  they  use,  or  abuse,  religion  for 
nonreligious  ends.  However,  fieldwork  shows  that 
Islamists  act  in  good  faith  as  believers.  Their  good 
faith  is  not  to  be  confused  with  cynicism.  Many 
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Westerners  fail  to  understand  the  place  of  culture 
and  religion  in  post-bipolar  politics. 

Areas  of  Concern  in  Islam  and  Politics 

The  contemporary  view  of  Islamism  as  an  embodi¬ 
ment  of  religionized  politics  relates  Islam  to  the 
areas  of  knowledge  of  politics  and  law  and  also 
human  rights  in  a  contestation  of  the  seculariza¬ 
tion  of  the  world  of  Islam.  Political  scientists  are 
expected  in  their  study  of  Islam  to  be  wary  of  and 
to  avoid  the  biased  debate  in  which  accusations  of 
Orientalism  replace  scholarly  analysis.  Empirical 
evidence,  however,  has  shown  that  cultural  ten¬ 
sions  related  to  Islam  lead  to  political  conflict. 

The  study  of  religion  and  politics  covers  all  reli¬ 
gions,  but  the  pivotal  place  given  to  Islam  in  this 
analysis  is  related  to  the  following  assertions: 

1 .  Islam  as  a  religion  invades  the  public  sphere, 
teaching  that  the  primary  unit  of  society  is  the 
umma,  the  international  brotherhood  of  believers. 

2.  Islam  forces  people  to  follow  sharia  laws.  If  this 
law  is  elevated  to  state  law  in  a  process  of 
shariatization  of  politics,  then  conflicts  based  on 
a  religious  component  are  likely.  These  are 
much  harder  to  solve  than  purely  political 
conflicts.  The  involvement  of  religion  in  politics 
makes  conflict  intractable. 

3.  Political  Islam  is  central  to  post-bipolar  world 
politics  through  the  transformation  of  Islamic 
universalism — in  the  course  of  a  process  of 
politicization — into  a  politics  that  leads  to  a 
religion-based  internationalism.  This 
contemporary  Islamist  internationalism  should 
not  be  conflated  with  the  Pan-Islamism  of  the 
19th  century’s  Islamic  revival  since  from  it 
emerges  the  claim  of  a  new  world  order  based 
on  Islamic  tenets,  which  may  provide  an 
explanation  for  “Islam’s  geopolitical  war,” 
which  some  political  theorists  posit  in  their 
linking  of  the  terrorist  attacks  of  September  11, 
2001,  to  post-bipolar  politics.  Surely,  this 
assertion  is  also  a  contested  translation  of  the 
role  of  the  jihad,  which  must  be  more  exactly 
defined  as  the  effort  by  believers  to  get  closer  to 
the  will  of  God.  Therefore,  one  needs  to 
establish  a  distinction  between  traditional  jihad 
and  Islamist  jihadism. 


The  contention  that  there  is  a  model  of  an 
Islamist  state  matters  to  political  science.  In  inter¬ 
national  relations  of  the  post-bipolar  politics,  the 
competition  among  different  cultural  models  gen¬ 
erates  political  tensions  that  develop  domestically, 
regionally,  and  internationally.  Islam’s  problems 
with  cultural  modernity  are  central  in  this  context. 
The  ignited  tensions  occur  not  only  among  Muslims 
but  also  in  their  relations  to  the  “non-Muslim 
other”  in  an  international  environment  of  crisis. 

Conclusion 

Political  scientists  involved  in  peace  studies  and  in 
mediation  are  challenged  to  relate  civilizational 
encounters  to  the  study  of  conflict  from  the  point 
of  view  of  conflict  resolution.  However,  prevent¬ 
ing  the  clash  of  civilizations  cannot  dispense  with 
the  study  of  intra-  and  intercivilizational  conflict. 
Some  political  scientists  have  gone  to  the  other 
extreme  of  a  constructed  convergence  of  civiliza¬ 
tions.  Western  political  science  needs  a  better 
understanding  of  Islam  and  politics  and  the  related 
conflicts.  Again,  one  should  remember  that  a  con¬ 
flict  is  not  a  clash. 

The  study  of  religion  and  violence  in  Islam  also 
matters  to  political  science  in  its  reference  to  the 
place  of  jihad  and  jihadism  in  politics.  Many 
believe  that  there  can  be  a  democratic  response  to 
terrorism  to  replace  the  war  on  terror  pursued  in 
the  name  of  security.  The  political  interpretation 
of  jihad  as  a  “world  revolution,”  as  advocated  by 
Sayyid  Qutb,  allows — as  jihadism — the  resort  to 
violence  in  an  irregular  war.  This  is  also  among  the 
areas  of  concern  in  the  study  of  Islam  and  politics. 

The  political  theorist  John  Brenkman  states  that 
Islam  is  in  the  midst  of  a  civil  war.  Then,  he  boldly 
ascertains  that  most  of  the  dangers  are  emerging 
from  the  Muslim  world.  Today,  such  views  are 
often  qualified  as  an  expression  of  Orientalism, 
but  the  concern  articulated  in  the  above  statement 
matters  to  political  scientists.  The  study  of  con¬ 
flicts  generated  between  radical  Muslims  them¬ 
selves,  on  the  one  hand,  and  with  the  West,  on  the 
other,  is  a  concern  of  political  science.  The  cultural 
analysis  of  political  conflict  is  not  a  culturalism. 
The  conflict  within  Islam  is  ignited  by  political 
Islam  that  uses  Islam  not  only  to  gain  legitimacy  in 
its  Islamist  movements  but  also  as  a  concept  of 
order.  The  study  of  issues  such  as  the  resolution  of 
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intra-  and  intercivilizational  conflicts  and  the  com¬ 
bination  of  Islam  and  democracy  is  the  business  of 
political  science.  Another  area  of  study  is  the  his¬ 
tory  of  political  ideas.  In  medieval  Hellenized 
Islam,  there  existed  a  rational  political  philosophy 
established  by  Farabi  (870-950)  that  lasted  until 
Ibn  Khaldun  (1332-1406).  This  Islamic  humanism 
helps  establish  a  bridge  between  the  civilizations. 
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Islamist  Movements 


In  recent  decades,  Islamist  movements  have 
become  an  important  social  and  (sometimes) 
political  force  in  many  Arab  countries.  They  share 
three  major  characteristics: 

1.  Islamists  are  critical  of  prevailing  societal 
conditions  in  Arab  countries,  which  they 
describe  interchangeably  as  decadent, 
underdeveloped,  or  unjust. 

2.  They  blame  authoritarian  ruling  establishments 
for  these  societal  conditions  and  therefore 
consider  political  change  as  the  first  crucial  step 
toward  altering  Arab  reality.  Islamist 
movements  operating  in  the  Arab  world  differ 
from  missionary  groups  and  Islamic  charitable 
organizations,  which  see  change  as  a  long-term 
process  of  Islamizing  societies  that  takes  place 
outside  of  the  realm  of  politics. 

3.  They  legitimize  their  practices  and  create 
popular  appeal  for  their  movements  by  basing 
them  on  religious  norms  and  values,  which 
serve  as  the  ultimate  ideological  frame  of 
reference  for  society  and  politics. 

Within  the  broad  category  of  Islamist  move¬ 
ments,  groups  can  be  differentiated  by  their  atti¬ 
tudes  toward  violence  and  their  perceptions  of 
politics.  Militant  groups  such  as  the  Egyptian  Jihad 
and  the  Algerian  Islamic  Group  use  violence  and 
seek  to  establish  theocratic  states  as  the  sole  means 
of  changing  conditions  in  Islamic  societies.  Their 
sources  of  inspiration  are  either  idealized  interpre¬ 
tations  of  past  moments  in  Islamic  history  or  con¬ 
temporary  models  of  Islamic  republics,  be  they  in 
Iran,  Sudan,  or  even  Afghanistan.  Mainstream 
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Islamists  reject  violence  and  endorse  competition 
through  pluralistic  politics.  They  exclude  radical 
strategies  as  options  for  political  transformation 
and  see  gradual  democratic  openings  as  the  only 
viable  way  to  challenge  repressive  authoritarian 
regimes  in  the  Islamic  world. 

This  mainstream  position  is  relatively  new.  In 
the  1980s  and  1990s,  mainstream  movements  still 
had  not  accepted  the  value  of  democratic  gover¬ 
nance.  Caught  in  the  iron  grip  of  regime  oppres¬ 
sion  and  continuous  radicalization  at  the  outer 
edges  of  the  Islamist  spectrum,  these  movements 
were  sometimes  forced  out  of  the  official  political 
sphere  or  violently  banned  and  denied  any  public 
role.  In  a  few  countries  such  as  Morocco,  where 
Islamists  were  partly  integrated  in  the  political 
process,  Islamists’  preoccupation  with  rhetorically 
sound  though  politically  unattainable  goals — 
Islamization  of  legislative  environments  and  edu¬ 
cational  systems — did  not  help  them  overcome 
general  doubts  about  their  real  objectives.  By  the 
end  of  the  1990s,  Arab  Islamists  had  failed  to 
change  political  realities  in  their  homelands  despite 
considerable  popular  support.  This  failure 
prompted  various  revisionist  trends  that  gathered 
momentum  in  the  past  few  years.  As  a  result, 
mainstream  Islamists  have  become  increasingly 
receptive  to  democratic  procedures. 

Three  Paradigms  of  Islamist  Participation 

In  this  context,  it  is  possible  to  identify  three  major 
modes  of  mainstream  Islamist  participation  in 
politics.  The  first  comprises  the  Iraqi,  Lebanese, 
and  Palestinian  cases.  While  the  Islamist  move¬ 
ments  in  these  instances  operate  with  relative  orga¬ 
nizational  freedom  in  the  context  of  political  party 
plurality,  these  experiences  also  take  place  in  a 
climate  of  relative  chaos,  whether  because  foreign 
occupation  has  wrought  the  collapse  of  the  institu¬ 
tions  of  government  and  public  security  or  because 
an  ongoing  intractable  crisis  of  internal  discord  so 
hampers  the  efficacy  of  government  as  to  con¬ 
stantly  threaten  the  stability  of  the  political  system 
and  encourage  the  prevalence  of  monopolistic/ 
exclusivist  tendencies  that  conflict  with  the  spirit 
and  substance  of  peaceful  participation. 

The  Islamist  movements  in  Iraq,  Lebanon,  and 
Palestine  are  characterized  by  hierarchical  internal 
structures,  possession  of  the  means  to  exercise 


violence,  and  a  tendency  to  resort  to,  or  threaten 
to  resort  to,  violence  to  resolve  their  political  con¬ 
flicts.  The  inclusion  of  such  Islamist  movements  in 
plural  politics,  at  a  time  when  they  have  yet  to 
develop  a  full  commitment  to  peaceful  participa¬ 
tion  and  when  such  participation  is  perhaps  only  a 
tactic  within  the  framework  of  a  greater  strategy, 
reduces  or  even  eliminates  the  chances  of  propel¬ 
ling  political  plurality  forward  through  a  process 
of  democratization. 

In  sharp  contrast  to  these  instances,  the  second 
mode  of  Islamist  participation  in  politics  adopts 
participation  as  its  one  and  only  strategic  option. 
The  campaign  motto  “participation  comes  first” 
typifies  the  attitude  of  these  Islamists  who  are 
prominent  in  some  Arab  countries  such  as  Morocco, 
Algeria,  Kuwait,  and  Bahrain.  Whereas  Islamists 
form  small  parts  of  the  governments  of  Algeria  and 
Kuwait,  their  counterparts  in  Morocco  and  Bahrain 
lead  the  opposition  camps.  More  significantly, 
some  of  these  movements  have  succeeded  in 
formulating  a  functional  separation  between  reli¬ 
gious  proselytizing  activities  and  politics,  thereby 
transforming  themselves  into  solely  political  orga¬ 
nizations  guided  by  an  Islamist  code  but  run  by 
professional  politicians  and  the  activities  of  which 
steer  clear  of  the  rhetoric  and  activities  of  a  prose¬ 
lytizing  movement. 

In  spite  of  qualitative  differences  between  their 
movements,  the  Islamists  who  follow  the  motto 
“participation  comes  first”  share  several  funda¬ 
mental  characteristics.  Above  all,  they  honor  the 
legitimacy  of  the  nation-state  to  which  they  belong, 
and  they  respect  that  state’s  governing  institutions, 
the  principle  of  equality  among  all  citizens,  and  the 
pluralistic,  competitive  nature  of  political  life.  This 
attitude,  which  they  have  generally  adopted  as 
much  in  spirit  as  in  form,  has  led  to  the  decline  in 
exclusionist  rhetoric,  whether  directed  toward  rul¬ 
ing  establishments  or  to  the  liberal  and  leftist 
opposition.  It  has  also  resulted  in  a  gradual  shift 
away  from  ideological  diatribes  and  categorical 
judgments  and  toward  the  formulation  of  practical 
political  platforms  and  constructive  attempts  to 
influence  public  policy,  whether  as  minor  partners 
in  government  or  as  members  of  the  opposition. 

The  third  mode  of  Islamist  participation  is 
epitomized  by  the  cases  of  Egypt,  Sudan,  Jordan, 
and  Yemen.  In  spite  of  the  considerable  differ¬ 
ences  between  them,  Islamist  movements  in  these 
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countries  have  persisted  in  the  face  of  a  volatile 
political  space  and  the  fragility  of  their  relationship 
with  the  ruling  establishments.  If  in  Egypt  and 
Jordan,  Islamists  have  been  given  some  room  to 
participate  in  pluralistic  mechanisms,  in  legislative 
elections,  in  professional  syndicates,  and  in  other 
areas  of  civil  society,  the  sword  of  the  security  forces 
is  constantly  hanging  over  their  heads.  By  contrast, 
the  Islamist  movements  in  Sudan  and  Yemen  throw 
into  relief  the  danger  of  the  nondemocratic  accom¬ 
modations  Islamists  have  struck  with  paramilitary 
ruling  establishments  and  the  negative  impact  of 
such  alliances  on  political  life  and  on  the  internal 
dynamics  of  the  Islamists  themselves. 

The  motto  of  these  Islamists  may  be  described 
as  “participation  until  further  notice.”  They  may 
have  adopted  a  strategy  of  peaceful  participation, 
but  it  is  no  more  than  a  strategy.  In  view  of  the 
perpetual  fluctuation  of  their  role  in  the  political 
life  of  Egypt  and  Jordan  or  the  swings  in  their  posi¬ 
tions  from  partners  in  authoritarian  governments 
to  antagonists,  in  the  case  of  Sudan  and  Yemen, 
their  leaders  and  followers  continue  to  hover  in  the 
abstract  heights  of  ideology,  social  narratives,  and 
megapolicy  (the  role  of  religion  in  public  life,  the 
implementation  of  the  Islamic  Law,  and  the  re¬ 
creation  of  the  Muslim  nation)  while  ignoring  the 
need  to  evolve  a  culture  that  values  consensus 
making  and  constructive  mechanisms  for  influenc¬ 
ing  public  policy. 

Challenges  of  Islamist  Political  Participation 

When  examining  the  three  modes  of  Islamist  par¬ 
ticipation  in  politics,  special  attention  has  to  be 
given  to  the  impact  of  participation  on  the  organi¬ 
zational  complexity  of  Islamist  movements,  the 
separation  between  religious  proselytizing  and 
civil-political  components,  and  finally  the  balance 
between  Islamist  interest  in  public  policy  matters 
and  in  ideological  and  moral  issues. 

Essentially,  the  relatively  stable  experiences  of 
Islamists  in  the  political  process,  as  in  Morocco, 
Algeria,  Kuwait,  and  Bahrain,  contributed  to  the 
rapid  development  of  the  Islamists’  organizational 
units.  They  now  have  increasing  functional  spe¬ 
cialization  and  variation,  distinct  institutional 
separation  between  religious  and  civil  components, 
and  a  core  emphasis  on  public  policies.  However, 
the  volatile  experiences  of  Islamists  in  Egypt, 


Jordan,  and  Yemen  have  prevented  the  material¬ 
ization  of  equally  successful  trajectories,  despite 
the  fact  that  some  elements  of  progress  can  be 
noted. 

The  Islamists’  experience  in  Morocco,  Algeria, 
Kuwait,  and  Bahrain  testifies  to  the  existence  of  a 
direct  relationship  between  the  stability  of  the 
available  realm  for  political  participation,  as  a 
result  of  the  decline  in  the  government’s  recourse 
to  the  security  pretext  to  exclude  or  repress  them, 
and  a  relatively  rapid  rise  in  their  resolve  to  respect 
and  play  by  the  rules  of  the  game  and  to  reach 
consensual  agreements  over  the  conduct  of  public 
affairs  in  a  nonconfrontational  manner.  Never¬ 
theless,  these  Islamists  still  face  a  number  of  tests 
of  their  intent.  On  the  one  hand,  they  have  yet  to 
demonstrate  their  unconditional  commitment  to 
the  mechanisms  of  a  pluralistic  form  of  govern¬ 
ment,  even  if  those  mechanisms  produce  policies 
that  do  not  conform  to  their  religious  beliefs.  On 
the  other,  they  must  continue  to  convince  their 
constituencies  of  the  efficacy  of  peaceful  participa¬ 
tion  at  a  time  when  authoritarian  ruling  establish¬ 
ments  have  yet  to  shed  their  suspicions  of  the 
Islamists  and  to  accustom  themselves  fully  to  their 
participation  in  politics. 

Notwithstanding  the  fact  that  mainstream 
Islamists  have  developed  in  general  a  strategic 
commitment  to  the  peaceful  participation  option, 
the  constant  flux  that  defined  their  experiences  in 
Egypt  and  Jordan  or  their  shift  from  active  partici¬ 
pants  in  authoritarian  governments  to  semi¬ 
opposition  entities — as  is  the  case  in  Sudan  and 
Yemen — contributed  to  the  erosion  of  their  popu¬ 
lar  support  or  the  emergence  of  a  confused  mental¬ 
ity  among  their  leaders.  Naturally,  both  of  these 
consequences  negatively  affected  the  capacity  of 
the  Islamist  movements  to  take  meaningful  advan¬ 
tage  of  the  participation  option. 

Because  their  participation  lacks  the  relative 
stability  of  Islamists’  experiences  in  Morocco, 
Algeria,  Kuwait,  and  Bahrain,  Egyptian  and 
Jordanian  Islamists  have  remained  preoccupied 
with  the  traditional  mix  of  normative,  social,  and 
political  issues  with  no  discernable  evolutionary 
trajectory  toward  more  pragmatism.  Both  move¬ 
ments  have  constantly  demonstrated  various  signs 
of  internal  dissent  and  infighting  between  partici¬ 
pation-friendly  and  participation-critical  leaders 
and  members. 
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And  although  participation-critical  voices  inside 
Egyptian  and  Jordanian  Islamist  movements  have 
not  been  pushing  for  giving  up  their  commitment 
to  nonviolence,  they  have  been  putting  forward 
two  alternatives  that  are  worth  noting  and  bear 
potential  risks:  (1)  retreat  from  politics  to  the 
social  sphere  and  (2)  increased  confrontational 
attitudes  toward  the  ruling  establishments.  The 
notion  of  retreating  to  the  social  sphere  has  never 
lost  its  appeal  to  Islamists,  especially  in  moments 
of  political  exclusion  and  prosecution,  as  a  tempo¬ 
rary  strategy  to  regroup  and  revitalize  their  move¬ 
ments.  However,  in  Egypt  and  Jordan  today,  calls 
to  leave  politics  and  retreat  to  society  are  increas¬ 
ingly  articulated  in  a  definite  way  that  attests  to 
the  internal  fatigue  and  frustration  of  Islamists.  In 
contrast,  other  voices  have  been  pushing  for 
assuming  more  hard-line  stances  toward  official 
policies  and  for  using  whatever  political  space 
Islamists  can  sustain  to  confront  the  ruling  estab¬ 
lishments.  Looking  at  the  ongoing  polarization  of 
Egyptian  and  Jordanian  politics,  primarily  driven 
by  ruling  establishment/Islamist  conflicts,  this  is 
clearly  a  recipe  for  instability  and  reform  freeze. 

The  current  challenges  facing  Islamist  move¬ 
ments  are  inextricably  intertwined  with  the  overall 
distortions  that  define  the  contemporary  Arab 
political  reality.  The  participation  of  Islamists  in 
and  of  itself  does  not  prolong  the  realization  of  a 
healthy  democratic  order,  nor  for  that  matter  does 
it  bring  their  polities  closer  to  that  order. 
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Item-response  models  define  a  set  of  data  reduc¬ 
tion  techniques  that  relate  observed  discrete  vari¬ 
ables  to  a  much  smaller  set  of  unobserved  latent 
variables.  These  models  are  primarily  used  in 
political  science  to  relate  observed  votes  on  legis¬ 
lative  roll  calls  to  legislators’  unobserved  ideology. 
Originally  developed  in  the  context  of  educational 
testing,  these  models  were  designed  to  recover 
characteristics  of  test  questions  (items)  and  the 
“abilities”  of  student  test  takers.  Consequently, 
much  of  the  model  description  and  interpretation 
retains  education  terminology.  The  data  involves 
N  test  takers  (indexed  by  i)  answering  M  test 
questions  (indexed  by  /),  with  an  answer  (response) 
by  student  i  on  question  j  (ytf  equal  to  one  if  cor¬ 
rect,  zero  otherwise.  In  legislative  voting,  the  “test 
takers”  are  the  legislators,  the  “items”  are  the  roll 
calls,  and  the  “ability”  is  the  legislator’s  ideology. 
The  canonical  model  for  the  answers  posits  the 
following  probability  model: 


Pr(Ty 


1)  = 


exp(x;p/  -  a,) 

I  +  exp(x;(3/  —  a,-)  ’ 


where  x,  is  the  student’s  ability  and  parameters  (3, 
and  Uj  characterize  the  question.  This  is  simply  a 
logit  model  of  a  correct  answer  as  a  function  of 
one  independent  variable,  the  ability  measured  by 
the  test,  with  the  additional  complication  that  the 
independent  variable  is  unobserved.  Of  course, 
other  cumulative  density  functions  can  be  used 
instead  (e.g.,  the  normal  for  a  probit  model).  This 
is  a  latent  variable  model,  similar  to  factor  analy¬ 
sis.  The  test  items  are  all  observable  indicators 
(manifest  variables)  of  the  unobservable  common 
factor  x,  with  (3;  analogous  to  a  factor  loading.  In 
the  item-response  case,  all  the  indicators  are 
binary,  whereas  they  are  continuous  in  factor 
analysis. 
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The  probability  model  is  frequently  reformu¬ 
lated  as 


Pr  (yv  =  1)  = 


exp(ft;(x;  -  Ky)) 

1  +  exp(p,(x,  -  k,))  ’ 


because  of  a  higher  level  of  ability  but  because  he 
or  she  happened  to  mark  the  answer  correctly.  The 
“pseudo”  aspect  of  guessing  is  that  students  do  not 
necessarily  guess  but  use  reasoning  processes  unre¬ 
lated  to  their  ability  (sometimes  actually  worse 
than  pure  guessing). 


with  specific  interpretations  to  each  item  parame¬ 
ter.  The  slope  coefficients  (3,  are  the  discrimination 
parameters  and  the  values  of  k;  are  difficulty 
parameters.  Discrimination  is  the  extent  to  which 
a  test  question  separates  students  with  higher  and 
lower  levels  of  abilities.  If  (3,  =  0,  then  the  test 
question  does  not  measure  the  ability  gauged  by 
the  test.  Higher  values  of  k;  indicate  a  more  diffi¬ 
cult  question,  k,  represents  the  ability  level  of  a 
student  who  has  a  50%  probability  of  answering 
the  question  correctly  and  50%  answering  incor¬ 
rectly.  Note,  k j  and  x,  are  on  the  same  scale;  if 
Xi  =  Kh  Pr(y„  =  1)  =  .5. 

Other  Item-Response  Models 

A  special  case  is  the  one-parameter  model,  known 
as  the  Rasch  model  (after  Georg  Rasch): 


Pr  {y,i  =  1)  = 


exp(X;  —  Ky) 

1  +  exp(x,  —  Ky) 


This  model  assumes  that  every  item  discriminates 
abilities  equally  well  ((3;  =  1  for  all  questions  j).  In 
this  model,  all  the  observed  information  is  con¬ 
tained  in  the  total  score,  the  number  of  correct 
responses:  an  extremely  desirable  property  for 
tests.  This  property  is  only  true  for  the  logistic  one- 
parameter  model  though. 

A  generalization  of  the  two-parameter  model  is 
the  three-parameter  model: 


Pr(y,7  =  1)  =  7/  +  (1 


,  ,  exP(ft/(X;  ~  K,)) 

'  1  +  exp((3;  (x,  -  Ky) )  ’ 


where  7 y  is  a  “pseudoguessing”  parameter.  It 
allows  for  the  absolute  worst  student  (with  ability 
approaching  negative  infinity)  to  have  a  positive 
probability  of  answering  a  question  correctly;  that 
is,  a  student  answers  a  question  correctly,  not 


Political  Science  Applications 

The  original  application  of  the  model  is  educational 
testing,  but  the  item-response  model  represents  a 
general  latent  variable  model  for  any  application 
that  posits  an  abstract,  unobserved  concept  recov¬ 
erable  from  observed  manifest  binary  variables. 
One  prominent  application  in  political  science  is 
the  estimation  of  legislator  ideal  points.  In  the  spa¬ 
tial  model  of  voting,  legislators  choose  between 
“yea”  and  “nay”  alternatives,  with  the  “nay”  alter¬ 
native  typically  representing  the  status  quo  and 
“yea”  a  change  to  the  status  quo.  The  utility  of 
each  choice  depends  on  how  far  each  alternative  is 
from  the  legislator’s  “ideal”  policy.  The  legislator 
votes  for  whichever  is  closer  to  the  legislator’s  ideal 
point.  This  model  also  applies  to  other  actors  who 
take  announced  positions,  such  as  Supreme 
Court  justices  or  members  of  the  boards  of  inde¬ 
pendent  agencies.  Joshua  Clinton,  Simon 
Jackman,  and  Douglas  Rivers  demonstrate  that 
when  assuming  a  particular  functional  form  for 
this  utility,  the  model  for  this  decision  reduces  to 
the  standard  item-response  model  from  educa¬ 
tional  tests.  The  discrimination  parameter  indi¬ 
cates  the  extent  to  which  a  particular  vote  relates 
to  an  underlying  liberal-conservative  dimension. 
In  educational  testing,  (3;  >  0,  since  test  questions 
that  students  with  lower  ability  have  a  higher 
probability  of  answering  correctly  than  those 
with  higher  abilities  are  counterproductive  to  the 
purpose  of  the  test.  In  the  legislator  context 
though,  the  sign  of  (3;  indicates  whether  a  “yea” 
vote  is  liberal  (negative)  or  conservative  (posi¬ 
tive).  The  difficulty  parameter  is  the  “cut  point” 
between  the  two  choices,  that  is,  the  ideal  point 
for  a  legislator  who  is  completely  indifferent 
between  the  alternatives. 

While  item-response  models  are  frequently 
described  as  unidimensional,  the  model  can  also 
be  multidimensional.  Instead  of  a  general  liberal- 
conservative  dimension,  one  could  specify  policy 
preferences  over  multiple  areas  (e.g.,  economic 
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policy,  social  policy,  and  foreign  affairs).  The 
rotational  indeterminacy  that  affects  factor  analy¬ 
sis  is  also  present  in  this  model.  To  recover  the 
parameters,  some  restrictions  must  be  imple¬ 
mented.  Specific  legislators  can  be  fixed  to  specific 
positions  in  the  ideological  space,  with  estimates 
of  other  parameters  relative  to  these  positions. 
More  useful  in  these  applications  are  restrictions 
on  P;,  similar  to  those  restrictions  found  in  confir¬ 
matory  factor  analysis,  such  as  restricting  some 
votes  to  loading  on  only  one  of  the  policy  areas 
(with  the  discrimination  parameters  of  other 
issues  restricted  to  zero). 

Estimation 

Since  (3;,  k;,  and  x,  are  all  unobserved,  estimation 
can  be  tricky.  An  additional  complication  is  the 
large  number  of  parameters  to  estimate.  For  a 
one-dimensional  model,  there  are  2 M  +  N  param¬ 
eters.  In  the  legislative  case,  that  can  easily  exceed 
1,000  parameters.  Maximizing  the  likelihood  over 
all  parameters  is  actually  not  recommended,  since 
it  does  not  provide  consistent  estimates  of  the 
parameters.  In  educational  testing,  the  dominant 
approach  estimates  only  the  item  parameters, 
which  are  used  to  calibrate  standardized  tests;  esti¬ 
mates  of  the  abilities  are  less  consequential. 
Marginal  maximum  likelihood  assumes  that  the 
latent  variable  parameter  x  is  normally  distributed 
and  then  “averages  out”  (integrates  over)  to  pro¬ 
duce  a  model  with  only  the  item  parameters. 
Estimates  of  the  ability  parameters  can  be  obtained 
a  posteriori  by  calculating  the  expected  value  given 
the  answers  and  item  parameters. 

In  political  science,  recovering  estimates  of  the 
latent  variables  is  actually  more  important.  To 
simultaneously  estimate  the  item  parameters  and 
the  latent  variables,  Bayesian  approaches  are  used. 
Assuming  a  priori  the  parameters  follow  certain 
distributions  (some  of  the  same  assumptions  used 
in  the  marginal  maximum  likelihood  approach) 
and  the  model  of  the  data  produces  the  joint  distri¬ 
bution  of  item  and  ability  parameters.  These 
parameters  can  be  summarized  by  simulating  from 
this  distribution  using  Markov  Chain  Monte  Carlo 
(MCMC)  methods.  The  Rasch  model  is  much 
easier  to  estimate  through  maximum  likelihood 
conditional  on  the  total  score.  The  three-parameter 


model  is  estimated  using  the  same  methods  as  the 
two-parameter  model,  but  the  pseudoguessing 
parameter  can  be  difficult  to  estimate.  As  a  result, 
restrictions  on  7,  in  the  three-parameter  model  are 
often  implemented. 

Other  Types  of  Data 

While  the  standard  item-response  model  is  fitted 
to  binary  (correct/incorrect,  yea/nay)  responses, 
the  model  is  extendable  to  polytomous  data.  One 
application  involves  nominal  responses  from 
multiple-choice  tests.  Instead  of  coding  answers  as 
merely  correct  or  incorrect,  responses  to  the  sepa¬ 
rate  correct  and  incorrect  responses  are  modeled. 
The  model  is  the  same  as  the  binary  case,  except 
that  the  probability  function  is  analogous  to  a 
multinomial  logit  model  for  each  multiple-choice 
question.  While  there  have  been  political  science 
applications  of  this  model,  political  science 
researchers  are  more  often  confronted  with 
ordered  responses. 

In  educational  testing,  ordered  models  are  used 
for  the  grading  of  short  answers  and  essays  or  any 
question  for  which  partial  credit  is  granted.  In 
most  instances  where  political  scientists  use  factor 
analysis,  the  ordered-item/response  model  will  be  a 
more  appropriate  choice,  especially  for  the  Likert- 
type  scales  found  in  most  surveys.  Applications 
include  the  estimation  of  survey  respondent  ideol¬ 
ogy,  policy  preferences,  or  political  knowledge. 
The  model  is  applicable  to  any  circumstance  where 
a  latent  variable  of  interest  is  measured  using  ordi¬ 
nal  variables.  For  instance,  when  the  unit  of  analy¬ 
sis  is  countries,  one  can  recover  from  ordinal  items 
the  level  of  democracy  in  these  countries,  as  well  as 
numerous  other  characteristics  of  government  and 
society,  such  as  the  economic  development,  cor¬ 
ruption,  or  adherence  to  human  rights. 

There  are  also  item-response  models  using  non¬ 
multinomial  probability  models.  For  instance,  the 
responses  can  be  frequencies  or  relative  frequen¬ 
cies.  In  political  science,  such  models  have  been 
used  to  estimate  the  ideological  positions  of  parties 
or  legislators.  For  parties,  the  responses  are  rela¬ 
tive  word  frequencies  in  party  platforms;  for  legis¬ 
lators,  relative  word  frequencies  in  speeches.  The 
number  of  times  observation  i  uses  word  /  (y,;)) 
follows  a  Poisson  distribution,  with  mean  number 
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of  counts  Xjj  —  exp(x,P;  —  a,),  which  specifies  a 
count  regression  model. 

The  item-response  model  also  allows  for  some 
more  unique  data  structures,  such  as  multiple-rater 
data.  J  raters  evaluate  each  of  N  observations 
based  on  M  characteristics  (items).  The  raters 
place  the  observations  on  each  of  the  (typical 
ordered)  items.  The  models  produce  estimates  of 
the  observations’  latent  quantity,  as  well  as  dem¬ 
and  rater-specific  parameters.  This  approach 
allows  researchers  to  incorporate  evaluations  from 
multiple  experts  to  create  more  accurate  measures. 
For  example,  researchers  can  collect  through 
expert  surveys  the  evaluations  of  scholars  in 
bureaucracy  to  measure  the  ideology  of  agencies  or 
ratings  from  comparative  politics  experts  to  esti¬ 
mate  the  ideology  of  political  parties  or  the  level  of 
democracy  of  a  country. 

Conclusion 

The  item-response  model,  while  developed  for 
educational  testing,  is  a  general  measurement 
model  for  many  different  types  of  discrete  data, 
with  many  potential  extensions.  It  is  particularly 
well  suited  for  political  science  applications,  which 


use  many  abstract  theoretical  concepts  measured 
with  discrete-level  data. 
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Joint-Decision  Trap 


The  joint-decision  trap  is  the  name  for  a  particular 
situation  that  arises  in  the  making  of  public  pol¬ 
icy.  It  is  relevant  for  systems  that  are  divided  into 
layers  of  government  where  decisions  are  made 
jointly  by  the  central  and  lower  levels.  These  sys¬ 
tems  may  turn  into  traps,  in  the  sense  that  changes 
from  the  status  quo  are  difficult.  Actors  are  caught 
in  a  deadlock.  The  joint-decision  trap  model  has 
had  the  most  impact  on  the  understanding  of 
politics  in  the  European  Union  (EU)  and  the 
German  federal  system  but  is  relevant  for  a  wide 
range  of  multilevel  systems. 

The  joint-decision  trap  was  identified  by  the 
German  scholar  Fritz  W.  Scharpf,  first  in  German 
federalism  and  later  in  policy  making  in  the  EU. 
Scharpf  developed  the  thesis  in  the  early  1980s  on 
the  basis  of  many  years  of  studies  of  institutional 
rigidities  in  German  federal  politics.  The  German 
version  of  the  thesis  was  published  in  the  mid- 
1980s,  and  the  English  version  followed  a  few  years 
later.  The  thesis  was  notable  for  two  reasons.  First, 
it  was  one  of  the  first  neo-institutional  analyses  of 
policy  making  in  the  EU.  As  such,  it  helped  pave  the 
way  for,  or  even  spearheaded  the  breakthrough  of 
the  neo-institutional  paradigm  into,  studies  of  the 
EU.  Second,  the  thesis  avoided  the  sui  generis 
approach  to  the  EU  that  until  then  had  tended  to 
isolate  studies  of  European  integration  from  main¬ 
stream  social  science.  Scharpf’s  approach  was  com¬ 
parative.  He  sought  to  understand  policy  dynamics 
in  the  EU  by  comparing  it  with  German  federalism. 


The  joint-decision  trap  is  a  designation  for  a  spe¬ 
cific,  but  far  from  uncommon,  institutional  setting 
for  policy  making.  Two  institutional  conditions  are 
specified:  (1)  central  government  decisions  must  be 
directly  dependent  on  the  agreement  of  constituent 
governments  and  (2)  the  agreement  of  constituent 
governments  must  be  unanimous  or  nearly  unani¬ 
mous.  These  conditions  fit  the  cases  of  German 
federal  politics  and  the  EU  reasonably  well,  at  least 
at  the  time  when  Scharpf  was  writing.  A  logic  of 
action  follows  from  these  conditions  that  leads  to  a 
stalemate  and  a  severe  bias  in  favor  of  the  status 
quo.  Whenever  a  proposed  change  of  policy  has  a 
negative  impact  on  just  one  actor,  this  actor  will 
want  to  oppose  the  change  unless  it  is  compensated 
for  the  loss.  Since  all  actors  possess  (near-)veto 
rights  and  because  adequate  compensation  is  often 
difficult,  policy  changes  will  often  be  blocked.  In 
most  cases,  existing  policy  cannot  be  changed  as 
long  as  it  is  preferred  even  by  one  or  a  few  partici¬ 
pants.  In  a  dynamic  environment,  this  is  critical. 
The  quality  of  public  policy  will  gradually  worsen. 
The  result  will  be  the  continuation  of  past  policies 
in  the  face  of  a  changing  policy  environment  and 
policies  that  are  suboptimal  even  by  their  own 
original  criteria. 

To  make  matters  worse,  the  joint-decision  trap  is 
also  a  trap  in  another  sense.  It  not  only  leads  to 
pathological  policy  choices,  but  it  also  blocks  its 
own  further  institutional  evolution.  The  present 
institutional  setup  represents  the  local  optimum  in 
the  actors’  cost-benefit  calculation.  It  is  a  Nash 
equilibrium.  There  is  thus  no  way  that  joint-decision 
systems  may  transform  themselves  into  institutional 
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arrangements  of  greater  policy  potential.  Insti¬ 
tutional  change  would  require  external  interven¬ 
tion.  All  in  all,  the  joint-decision  trap  paints  a 
pessimistic  picture  of  modern  political  systems  and 
is  close  to  structural  determinism. 

The  joint-decision  trap  is  an  abstract  analytical 
model  that  may  be  used  in  a  variety  of  settings. 
Scharpf  originally  suggested  that  the  logic  might 
apply  to  settings  as  diverse  as  political  coalitions, 
military  alliances,  and  neo-corporatist  arrange¬ 
ments.  In  the  original  version,  however,  the  trap 
has  had  most  resonance  in  studies  within  its  own 
empirical  field — namely,  intergovernmental  rela¬ 
tions  in  modern  political  systems.  It  has  been  used 
to  understand  systems  in  several  countries — for 
instance,  Canada  and  Denmark.  But  its  most  pro¬ 
found  influence  has  probably  been  within  studies 
of  German  federalism,  where  it  is  used  almost  as  a 
matter  of  routine  today,  and  in  studies  of  policy 
making  in  the  EU,  where  it  is  standard  material  for 
textbooks. 

The  joint-decision  trap’s  influence  on  EU  studies 
may  strike  one  as  paradoxical.  The  trap  was  identi¬ 
fied  and  the  thesis  developed  after  a  long  period  of 
“eurosclerosis.”  According  to  many  observers,  the 
1970s  and  early  1980s  were  a  period  of  stagnation 
for  the  EU  where  the  hopes  from  the  1950s  and 
1960s  for  further  integration  were  strangled  in  a 
quagmire  of  stagnation  and  policy  deadlocks. 
Scharpf ’s  joint-decision  trap  was  widely  acknowl¬ 
edged  as  an  explanation  of  this  development. 
However,  the  20  years  that  have  passed  since  the 
publication  of  Scharpf ’s  article  in  1988  have  been  a 
period  of  unparalleled  dynamism  and  creative  pol¬ 
icy  making  in  the  EU.  Through  a  series  of  treaty 
changes,  the  EU  has  experienced  several  rounds  of 
enlargement,  the  number  of  policy  areas  covered 
by  the  EU  has  increased,  the  Council  of  Ministers 
has  changed  its  decision  rule  from  one  of  unanim¬ 
ity  to  qualified  majority  voting  in  most  areas,  and 
the  European  Parliament  has  become  a  full  colegis¬ 
lator  in  most  areas. 

At  first  sight,  these  changes  might  be  considered 
a  refutation  of  the  logic  of  the  joint-decision  trap. 
However,  the  development  has  spurred  investiga¬ 
tions  into  ways  in  which  decision  makers  may 
avoid  joint-decision  traps.  B.  Guy  Peters  argues 
that  segmentation  of  policy  making  at  levels  below 
the  Council  of  Ministers  and  the  influence  of  pol¬ 
icy  entrepreneurs  may  help  actors  avoid  the  trap. 


Adrienne  Heritier  finds  that  actors  in  the  EU  system 
engage  in  politics  of  subterfuge  and  creatively  use 
strategies  of  compensation,  compromise,  and  pack¬ 
age  deals  to  avoid  stalemates.  Jens  Blom-Hansen 
argues  that  traps  need  not  be  inherent  in  joint- 
decision  systems.  If  the  central  government  has  an 
exit  option  in  its  relations  with  subcentral  govern¬ 
ments,  it  may  switch  between  arenas  and  use  both 
“exit”  and  “voice”  options  to  pursue  its  interests. 
In  this  way,  joint-decision  systems  may  turn  into 
an  asset. 

Scharpf,  in  a  reevaluation  of  the  joint-decision 
trap  20  years  after  it  was  first  proposed,  argues 
that  such  an  exit  option  is  in  fact  also  available  to 
the  central  decision  maker  in  the  EU.  Rule  making 
may  follow  the  well-known  community  method 
according  to  which  the  Commission  presents  pro¬ 
posals,  and  the  Council  of  Ministers  and,  in 
selected  areas,  the  European  Parliament  enact 
them.  This  is  the  area  where  the  joint-decision  trap 
is  operative.  However,  rule  making  may  also  fol¬ 
low  the  “supranational-hierarchical  mode.”  This 
takes  place  in  the  judicial  arena.  To  use  it,  the 
Commission  exploits  the  doctrines  of  supremacy 
and  direct  effect  of  European  law.  To  make  rules, 
the  Commission  only  needs  to  assert  that  a 
particular  rule  should  have  direct  effect  and  that 
certain  laws  or  practices  in  member  states  are  in 
violation  of  it.  If  these  arguments  are  then  upheld 
by  the  European  Court  of  Justice,  the  interpreta¬ 
tion  will  be  the  law  of  the  land  in  all  member  states 
without  any  further  action  by  governments  or 
parliaments.  In  short,  the  Commission  may  strate¬ 
gically  use  both  the  traditional  community  method 
and  the  “supranational-hierarchical”  way  to 
change  the  status  quo. 

However,  although  the  agenda-setting  powers 
of  the  Commission  and  rule  making  by  use  of  the 
European  Court  of  Justice  may  reduce  the  impact 
of  the  joint-decision  trap,  it  is  still  likely  to  be 
operative  in  EU  politics.  The  number  of  member 
states  has  now  increased  to  27,  and  although  una¬ 
nimity  is  no  longer  necessary  in  the  Council, 
qualified  majority  voting  is  still  quite  a  demanding 
decision  rule.  As  a  result,  decision  making  in  the 
EU  is  still  cumbersome,  difficult  to  manage,  and 
easily  blocked.  Scharpf’s  thesis  on  joint-decision 
traps  helps  us  understand  why. 

More  broadly,  Scharpf’s  contribution  highlights 
how  easily  two-level  decision  systems  where  the 
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central  level  is  dependent  on  the  (near-)unanimous 
agreement  of  the  constituent  levels  fall  prey  to 
stalemate  and  institutional  deadlock.  Since  almost 
all  countries  are  divided  into  several  layers  of  gov¬ 
ernment,  there  is  wide  scope  for  empirical  applica¬ 
tion  of  the  thesis.  Not  only  has  the  joint-decision 
trap  increased  our  understanding  of  the  empirical 
world,  but  it  has  also  stimulated  further  theoretical 
development  in  the  social  sciences.  This  holds  for 
both  theorizing  on  institutional  deadlock,  such  as 
veto  player  theory,  and  theorizing  on  circumvent¬ 
ing  deadlocks. 
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Judaism 


Judaism  is  the  religion,  philosophy,  and  culture  of 
the  Jewish  people.  This  entry  presents  the  basic 
beliefs  and  practices  of  the  Jewish  religion,  its 
adaptation  to  exile,  its  relationships  with  other 


religions,  its  Ashkenaz  and  Sepharad  components, 
and  the  role  of  rabbis;  then,  following  emancipa¬ 
tion,  new  religious  and  secular  developments;  and, 
in  conclusion,  contemporary  challenges. 

Judaism  is  the  first  monotheistic  religion.  It 
appeared  with  the  Hebrew  Patriarchs — Abraham, 
Isaac,  and  Jacob — and  the  Matriarchs — Sarah, 
Rebecca,  Rachel,  and  Leah — some  3,500  years  ago. 
Judaism  crystallized  with  Moses,  with  emphasis  on 
the  exodus  from  Egypt,  freedom  from  bondage, 
and  the  revelation  of  the  Ten  Commandments.  In 
Judaism,  God  is  transcendent — beyond  representa¬ 
tion — omnipotent,  benevolent.  He  created  the 
world,  revealed  himself,  and  will  lead  to  redemp¬ 
tion.  The  covenant  between  God  and  Abraham, 
renewed  at  Mount  Sinai  between  God  and  the 
Children  of  Israel,  is  an  eternal  one:  Jews  exist  as 
a  representative  of  the  deity  (kingdom  of  priests, 
holy  people:  Exodus  19:5-6),  God  exists  because 
Jews  proclaim  Him.  Basic  monotheistic  beliefs  are 
embodied  in  the  commandments:  248  positive  and 
365  prohibitive  or  negative,  for  a  total  of  613. 
Gentiles,  to  share  the  world  to  come,  must  keep 
the  Seven  Noahide  Laws  (prohibition  of  idolatry, 
blasphemy,  bloodshed,  sexual  sins,  theft,  and  eat¬ 
ing  of  flesh  from  a  living  animal  and  establishment 
of  a  fair  legal  system).  From  the  conquest  of 
Canaan,  the  establishment  of  a  Jewish  sovereignty, 
until  the  destruction  of  the  Second  Temple  by 
Titus  in  70  CE,  Judaism  is  the  religion  of  a  specific 
people  with  a  specific  belief,  monotheism. 

This  explains  the  savagery  of  the  war  between 
Rome  and  Jerusalem.  The  Roman  Empire,  to 
impose  its  rule  on  a  defeated  people,  mandated 
that  Caesar  be  worshipped  as  divine.  Judeans 
could  not  accept  a  god  other  than  the  Only  One. 
Wars  continued  for  close  to  a  century;  Jerusalem 
was  captured;  the  Temple  destroyed;  and  the  very 
name  of  the  country,  Judea,  replaced  by  Palestine 
to  erase  any  memory  of  a  people  and  a  religion  so 
different  from  that  of  Rome. 

Faced  with  a  new  situation,  disappearance  of  its 
political  and  religious  institutions,  exile,  and  minor¬ 
ity  status,  Judaism  had  no  other  choice  but  to 
adapt.  The  first  innovation  was  to  write  down  what 
had  been  until  then  the  Oral  Law,  explanations  and 
comments  on  the  Written  Law,  Deuteronomy  (the 
fifth  book  of  the  Torah,  the  Hebrew  Bible).  This 
massive  effort  gave  birth  to  the  Talmud,  which 
would  become  the  authoritative  interpretation  of 
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Judaism,  its  source  of  jurisprudence  and  education. 
The  only  group  of  Jews  who  had  no  knowledge  of 
the  Talmud  was  the  group  in  Ethiopia.  The  second 
innovation  was  an  increased  value  accorded  to  reli¬ 
gion  as  compensation  for  lost  sovereignty.  Such 
reliance  on  religion  has  been  the  mark  of  other 
oppressed  peoples  (Poles,  Irish,  Thai,  etc.). 

Exilic  Judaism,  with  its  synagogues  as  substi¬ 
tutes  for  the  Temple  of  Jerusalem  and  the  Talmud 
as  its  basic  source  of  knowledge,  dominated  the 
following  2,000  years.  Halacha,  Jewish  Law,  was 
the  core  of  Judaism,  a  code  of  laws,  practices,  and 
observances.  Based  on  the  Pentateuch,  Talmud, 
and  the  decisions  of  sages  and  rabbis  until  contem¬ 
porary  times,  Halacha  is  binding  for  the  Orthodox, 
though  not  for  other  groups  within  Judaism. 

Relationship  with  the  two  other  monotheistic 
religions,  Christianity  and  Islam,  became  crucial. 
Both  have  their  origin  in  Judaism  and  believe  that 
they  have  superseded  Judaism.  Most  Jews  were  liv¬ 
ing  either  in  Christian  Europe  or  in  Muslim  North 
Africa  and  the  Middle  East.  Both  Christianity  and 
Islam  tried  to  impose  conversion  but  eventually 
accepted  the  presence  of  Jews,  if  only  as  a  despised 
minority,  their  inferior  status  presented  as  a  proof 
that  they  were  no  longer  the  Chosen  People  of  the 
Living  God.  With  great  differences  according  to 
varying  times  and  places,  Jews  played  a  significant, 
even  if  usually  not  recognized,  role  in  the  develop¬ 
ment  of  Christian  and  Muslim  cultures.  Compara¬ 
tively  few  Jews  settled  in  nonmonotheistic  countries 
such  as  India  and  China,  where,  although  their 
religion  seemed  strange,  they  were  not  subjected  to 
persecution  or  accorded  any  special  status. 

In  exile,  a  distinction  appeared  between  Ashken¬ 
azic  and  Sephardic  Jews.  Ashkenaz,  who  first  set¬ 
tled  along  the  Rhine  River,  circa  1000  CE, 
expanded  toward  Central  and  Eastern  Europe, 
then  to  the  New  World.  Sepharad,  originally  from 
Muslim  Spain,  formed  new  communities  around 
the  Mediterranean  Sea  after  the  1492  expulsion. 
Today,  Sephardim  are  usually  linked  with  Edot 
Hamizrah,  Oriental  Jews  (Iran,  Yemen,  Kurdistan, 
etc.).  The  major  differences  are  historical — with 
most  Ashkenaz  living  in  Christian  countries,  most 
Sepharad  in  Muslim  ones — and  liturgical.  In  most 
places,  for  generations,  down  to  the  present  day, 
as  soon  as  Jews  from  a  certain  geographic  area 
have  constituted  a  critical  mass,  they  have  opened 
their  own  synagogues.  Due  to  migrations,  most 


major  communities  have  both  Ashkenazi  and 
Sephardi  synagogues,  but  they  are  often  united  at 
the  religious  services  level  (Kosher  food,  rabbini¬ 
cal  courts,  etc.) 

In  Israel,  the  Chief  Rabbinate  is  legally  headed 
by  two  chief  rabbis,  one  Ashkenazi  and  one 
Sephardi.  The  liturgical  distinction  has  become 
politicized.  From  a  strictly  religious  one,  it  has 
become  a  social  one  (Ashkenazi  dominancy  in  the 
old  elite,  challenged  by  Sephardim)  and  eventually 
a  political  distinction.  Of  the  three  major  religious 
parties,  one  is  Ashkenazi,  one  is  Sephardi,  and  one 
is  mixed. 

Around  the  11th  century,  a  new  actor  emerged 
in  Judaism:  the  rabbi.  The  general  custom  had  been 
to  have  a  learned  man,  versed  in  Jewish  law  and 
engaged  in  worldly  endeavors,  to  serve  as  a  reli¬ 
gious  interpreter  of  the  law.  The  new  rabbis  became 
experts  in  that  field,  engaged  and  paid  by  Jewish 
communities  and  recognized  as  the  official  religious 
leaders  ( rabbi  means  “my  master”).  They  had  to 
compete  with  dayanim,  religious  judges  involved  in 
judicial  matters  and  not  in  community  ones,  and, 
later,  with  the  directors  of  yeskivot,  institutes  of 
higher  Talmudic  studies.  The  modern  trend,  either 
on  a  voluntary  basis  or  imposed  by  institutions,  is 
to  belong  to  a  rabbinical  association.  They  offer  a 
common  philosophical  ground,  following  the  vari¬ 
ous  streams  of  Judaism,  a  consultation  place,  and 
services  such  as  health  insurance,  pensions,  and  so 
on.  In  contemporary  times,  the  rabbinate  as  an 
institution  is  more  a  professional  association  than  a 
political  force  within  the  Jewish  community. 

Emancipation  and  democratization  were  the 
hallmarks  of  modernity  for  Jews.  With  emancipa¬ 
tion,  European  Jewry  underwent  a  transformation: 
Whereas  previously  there  were  structured,  autono¬ 
mous  Jewish  “nations,”  recognized  as  such  by  the 
sovereign,  with  their  social  institutions  and  tribu¬ 
nals  imposing  norms  on  Jewish  communities,  now 
affiliation  to  any  kind  of  Jewish  institution,  includ¬ 
ing  religious  ones,  became  a  voluntary  association. 
New  forms  of  Judaism  appeared.  As  for  other 
people,  nothing  impeded  Jews  who  did  not  want  to 
follow  the  beliefs  and  practices  of  their  religion. 
Some  Jews  wanted  a  Judaism  with  no  national 
component,  no  dream  of  returning  to  Jerusalem  to 
rebuild  the  Temple,  and  no  customs  too  different 
from  those  of  their  Catholic  and  Protestant  neigh¬ 
bors.  Reform  Judaism  appeared  in  the  19th  century 
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in  Germany  and  flourished  in  die  United  States.  To 
answer  this  new  trend,  neo-orthodoxy,  also  in 
Germany,  offered  a  mixture  of  strict  adherence  to 
Jewish  religious  laws — hence  “orthodoxy” — but 
within  modern  society,  science,  and  democracy — 
hence  “neo.”  (In  the  United  States,  these  congrega¬ 
tions  refer  to  themselves  as  Modern  Orthodox.) 
Other  Orthodox  groups  did  not  want  to  change 
anything  in  their  approach  to  religion  and  the 
world  and  stressed  strict  adherence  to  Judaism 
alongside  or  outside  modern  society  but  not  within. 
Both  Hasidism,  a  new  orthodox  trend  that  emerged 
in  the  19th  century,  with  a  rebbe,  a  charismatic 
leader,  at  the  center  of  the  group,  and  their  oppo¬ 
nents  wanted  a  return  to  premodern  times.  Between 
Reform  and  Orthodox,  a  third  group,  Conservative, 
tried  to  amalgamate  a  respect  for  tradition  and  a 
will  to  adapt  it  to  modern  conditions.  As  there  is  no 
central  authority  in  Judaism,  anyone  may  offer  a 
new  vision  and  try  to  create  a  new  form  of  Judaism. 

Two  ideologies  threatened  the  centrality  of  reli¬ 
gion  in  Jewish  life,  socialism  and  Zionism.  Social¬ 
ism,  under  the  banner  of  Bund,  was  a  Jewish  com¬ 
ponent  of  the  struggle  for  power  by  the  proletariat. 
Based  on  Jewish  ethics,  favorable  to  the  poor  and 
the  destitute,  and  on  Jewish  customs  such  as 
Yiddish,  Bund  became  a  force  mostly  in  Central 
and  Eastern  Europe.  Decimated  by  the  Shoah 
(Holocaust),  attacked  by  Stalin  and  his  successors, 
Bund  has,  for  all  intents  and  purposes,  disap¬ 
peared.  Zionism  transformed  the  Return  to  Zion 
from  a  eschatological  perspective  to  a  political 
program.  It  relied  on  centuries  of  longing  for 
Jerusalem  omnipresent  in  liturgy,  from  everyday 
prayers  “Bless  be  You  who  rebuilds  Jerusalem”  to 
the  annual  cycle  of  festivals,  all  of  them  centered 
on  Jerusalem  and  culminating  at  the  end  of  Yom 
Kippur,  and  during  the  Passover  Seder  with  “Next 
year,  in  Jerusalem.”  Throughout  the  life  cycle, 
from  birth  to  death,  Jerusalem  was  evoked.  What 
followed  was  a  deeply  ambivalent  relationship 
between  Judaism  and  political  Zionism,  inspired 
by  the  same  set  of  beliefs  and  customs  but  differing 
in  their  modern  implications.  Some  chose  to  com¬ 
bine  both  in  Religious  Zionism.  Many  religious 
leaders  were  reluctant  to  support,  if  not  outright 
hostile  to,  the  new  nonreligious  ideology  that  was 
making  rapid  headway  among  the  Jewish  masses. 
The  non-Zionists  and  anti-Zionists  were  from 
both  Reform  Judaism,  which  did  not  recognize 


Jews  as  a  people  but  only  as  a  religion,  and 
Orthodox  Judaism,  which  was  opposed,  like 
Catholicism,  to  any  new  secular  philosophy. 
Eventually,  following  the  Shoah  and  the  creation 
of  Israel,  most  religious  opposition  to  Zionism 
disappeared.  Approximately  half  of  the  surviving 
Hasidic  leaders  live  in  the  United  States,  half  in 
Israel;  with  a  few  exceptions,  they  participate  in 
Israeli  elections. 

Judaism  in  the  21st  century  faces  a  number  of 
challenges: 

•  to  strengthen  its  leadership,  which  is  scattered, 
its  streams  divided  on  both  theological  issues 
(Deuteronomy  as  divine  revelation  or  human 
text)  and  practical  ones  (continuity  through 
matrilineal  descent  for  Orthodox  and 
Conservative,  through  patrilineal  descent  for 
Reform),  often  marginalized  by  secular  Jewish 
leaders; 

•  to  create  new  scholarship  and  provide  new 
answers  in  a  domain  not  probed  for  centuries, 
regarding  religious  decisions  linked  to  Jewish 
sovereignty  and  not  limited  to  the  Jewish 
community  as  a  minority; 

•  to  navigate  between  a  Judaism  that  follows 
societal  trends  and  a  Judaism  that  rejects  the 
modern  world; 

•  to  be  sensitive  to  gender  issues;  Reform  and 
Conservative  streams  have  female  rabbis  and  the 
Orthodox  has  none,  but  there  is  already  a  shift 
within  Orthodox  Judaism,  such  as  tohaniot  bet 
din,  female  specialists  in  divorce  religious  laws, 
being  accepted  as  quasi  advocates  by  Israeli 
rabbinical  courts; 

•  to  respond  to  an  unprecedented  situation  in 
which,  regardless  of  migrations,  the  Jewish 
population  outside  Israel  is  declining  while 
within  Israel  it  is  growing;  the  absolute  number 
and,  even  more,  the  proportion  of  Orthodox  and 
ultra-Orthodox  is  on  the  increase  everywhere, 
and  it  becomes  necessary  for  the  diminishing 
majority  to  accommodate  Judaism  in  its 
Orthodox  expressions  and  for  religious 
authorities  to  offer  leadership  not  only  to  their 
own  limited  communities  but  also  to  all  those 
searching  for  Jewish  values;  and 

•  to  offer,  as  a  monotheistic  religion,  answers  to 
scientific  and  biogenetic  advances  with  ethical 
consequences. 
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In  contemporary  times,  Judaism  is  an  expression  of 
Jewish  identity,  a  privileged  means  to  transmit  it. 

Julien  Bauer 
Universite  du  Quebec  a  Montreal 
Montreal,  Quebec,  Canada 
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Judicial  Decision  Making 


While  the  executive  and  legislative  branches  are 
the  most  visible  aspects  of  government,  the  third 
branch — the  judicial  branch — is  in  many  ways 
more  important.  In  many  countries,  it  is  the  judi¬ 
ciary  that  rules  on  whether  the  actions  of  the  other 
branches  are  constitutional  and,  thus,  permissible. 
Thus,  while  judicial  decision  making  (JDM)  tends 
to  occur  in  private  settings,  out  of  the  spotlight,  it 
would  be  a  mistake  to  conclude  that  the  decisions 
made  by  judges  are  not  consequential. 

This  entry  discusses  two  fundamental  questions: 
(1)  How  do  judges  make  decisions?  (2)  What 
impact  do  those  decisions  have  on  governments 
and  societies?  The  latter  question  is  especially 
important  for  individual  rights  as  well  as  broader 
political  issues.  Most  empirical  research  on  courts 
and  JDM  has  been  done  in  the  United  States 
because  of  the  unique  role  of  the  U.S.  Supreme 
Court  (USSC)  in  the  American  system  of  govern¬ 
ment;  however,  many  recent  studies  examine 
courts  comparatively.  The  theoretical  perspectives 
and  approaches  used  to  study  courts  and  judges 


are  normative,  legal,  behavioral,  neo-institutional, 
based  on  rational  choice,  and  sociological.  This 
entry  provides  a  broad  introduction  to  these 
approaches,  with  a  special  emphasis  on  the  USSC. 

Normative  Legal  Judicial  Decision 
Making  (The  Legal  Model) 

Normative  approaches  to  JDM  focus  on  how  law 
ought  to  be  interpreted.  Often  emphasis  is  placed 
on  purely  legal  factors,  such  as  the  language  of  the 
law  passed  by  the  legislature,  the  wording  of  con¬ 
stitutions  or  founding  documents,  and,  in  common 
law  systems,  precedent  (previous  cases  decided  by 
courts).  The  focus  on  normative  jurisprudence  is 
not  what  judges  personally  decide  but  what  the 
law  demands.  Normative  jurisprudence  tends  to 
think  of  the  law  as  a  puzzle  to  be  unlocked  and  the 
correct  answer  revealed,  not  a  malleable  concept 
with  many  possible  arguments  and  answers.  A 
normative  legal  approach  can  be  seen  in  the  classic 
writings  of  William  Blackstone,  Learned  Hand, 
and  the  founding  fathers  of  the  United  States  (espe¬ 
cially  James  Madison  and  Alexander  Hamilton), 
who  were  heavily  influenced  by  enlightenment 
thinkers  such  as  Thomas  Hobbes,  John  Locke,  and 
Baron  de  Montesquieu. 

Although  the  modern  study  of  JDM  has  moved 
from  a  normative  to  a  positivist  approach,  many 
scholars  continue  to  recognize  the  importance  of 
legal  factors  in  explaining  judicial  decisions.  These 
studies  tend  to  focus  on  the  importance  of  case 
facts  on  the  outcomes  of  various  types  of  cases. 
However,  while  it  is  clear  that  “the  law”  matters 
for  some  case  outcomes,  it  is  also  clear  that  many 
judicial  decisions  are  affected  by  nonnormative 
considerations.  Indeed,  it  may  be  most  useful  to 
consider  the  normative  as  more  prescriptive  than 
descriptive.  That  is,  it  may  explain  what  we  think 
judges  ought  to  do  (more  factors  they  should  take 
into  account  when  they  decide  cases),  but  it  is  less 
useful  is  describing  what  actually  occurs  (what  fac¬ 
tors  actually  matter  to  judges). 

The  level  of  court  being  studied  also  matters.  For 
example,  in  trial  courts,  where  judges  have  to  rule 
on  evidentiary  matters,  charge  the  jurors,  and  so  on, 
it  is  likely  that  the  normative/legal  model  is  generally 
applicable.  These  judges  are  constrained  by  appel¬ 
late  judges  above  them,  and  if  they  were  to  make 
rulings  inconsistent  with  the  facts  and  the  law,  they 
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would  likely  be  overruled.  However,  for  judges  in 
courts  of  last  resort,  the  situation  is  quite  different. 
First,  these  judges  have  no  higher  court  that  can 
overrule  their  decision.  Second,  these  courts  tend  to 
hear  cases  that  are  much  more  difficult.  By  difficult, 
we  mean  that  reasonable  people  can  disagree  as  to 
the  proper  outcome.  If  the  facts  and  the  law  could 
support  both  opposing  outcomes,  then  it  is  not  hard 
to  see  how  judges  can  be  constrained  in  this  way. 

Behavioralism 

Eventually,  scholars  began  to  notice  that  norma¬ 
tive  legal  factors  did  not  explain  all  judicial  deci¬ 
sions.  Not  all  legal  decisions  are  clear-cut,  and 
judicial  preferences  and  ideologies  help  explain  the 
formation  of  voting  coalitions  on  courts,  especially 
high-level  appellate  courts  engaged  in  lower  court 
review.  The  work  of  early  behaviorists  such  as 
C.  Herman  Pritchett  and  Glendon  Schubert  was 
expanded  on  by  others,  most  notably  Jeffrey  Segal 
and  Harold  Spaeth  in  The  Supreme  Court  and  the 
Attitudinal  Model.  Behavioralism  takes  into 
account  the  role  of  ideological  preferences  on  the 
decision  making  of  judges  and  has  found  consider¬ 
able  evidence  that  judges  are  influenced  by  their 
personal  ideology,  especially  in  the  USSC  context. 
Of  course,  to  be  able  to  explore  the  impact  of  pref¬ 
erences,  reliable  and  accurate  measures  of  judges’ 
preferences  must  be  available. 

Measures  of  judicial  ideology  have  been  created 
for  most  judges  in  the  United  States.  Measures  of  the 
ideology  of  U.S.  Supreme  Court  justices  include 
those  developed  by  Jeffrey  Segal  and  Albert  Cover 
(1989)  and  Andrew  Martin  and  Kevin  Quinn 
(2002).  Segal  and  Cover  used  newspaper  articles 
written  at  the  time  of  a  justice’s  confirmation  to 
have  an  independent  indicator  of  voting.  This 
allowed  them  to  avoid  the  endogeneity  problems 
that  plagued  other  measures  that  used  past  votes  to 
predict  future  votes.  Martin  and  Quinn  used 
Bayesian  analysis  to  develop  a  dynamic  model  of  the 
ideal  point  of  judges  using  their  votes  on  past  cases. 
Other  measures  have  been  developed  for  federal 
appellate  and  state  court  judges  in  the  United  States. 

Neo-Institutionalism  and  Rational  Choice 

Judicial  scholars  also  explore  the  impact  that  insti¬ 
tutional  configurations  and  other  actors  have  on 


JDM,  in  addition  to  the  traditional  legal  and 
behavioral  factors.  Institutional  influences  can  be 
either  internal  or  external  in  nature.  Internal  struc¬ 
tural  factors,  such  as  term  length,  docket  control, 
and  the  like,  affect  judges’  behavior,  often  allowing 
them  (or  requiring  them)  to  be  strategic  when  mak¬ 
ing  decisions.  Neo-institutional  interpretations  are 
also  prominent  in  separation-of-powers  studies, 
which  focus  on  the  external  relationship  between 
courts  and  other  political  actors.  The  neo-institu- 
tional  orientation  is  often  used  when  studying  state 
courts  in  the  United  States  because  the  diverse 
nature  of  institutional  configurations  lends  itself 
well  to  such  explanations,  but  it  is  also  promising 
as  an  orientation  for  comparative  research. 
Although  institutionalism  is  a  traditional  theory  of 
government,  it  is  only  since  the  1990s  or  so  that 
neo-institutionalism  has  been  regularly  applied  to 
integrated  research  designs  of  contemporary  legal 
studies.  Current  institutional  and  rational  choice 
studies  trace  their  theoretical  roots  to  the  early 
strategic  work  of  Walter  Murphy,  who  argued  that 
judicial  decisions  were  not  always  based  purely  on 
political  preferences. 

Comparative  Judicial  Behavior 

There  are  many  possible  ways  to  approach  com¬ 
parative  judicial  politics.  Typically,  studies  focus 
on  one  of  three  aspects:  (1)  structural  differences 
and  sociological  impacts,  (2)  judicial  indepen¬ 
dence,  and  (3)  judicial  legitimacy.  The  study  by 
Herbert  Jacob,  Erhard  Blankenburg,  Herbert 
Kritzer,  Doris  Marie  Provine,  and  Joseph  Sanders 
in  1996  is  particularly  comprehensive  and  exam¬ 
ines  the  structures  and  impacts  of  eight  different 
judicial  systems.  Such  global  studies  are  relatively 
rare  as  most  studies  focus  on  one  country  or  area. 

Charles  Epp’s  The  Rights  Revolution  is  a  par¬ 
ticularly  noteworthy  work  focusing  on  the  socio¬ 
logical  and  practical  political  impact  court  decisions 
have  on  the  rights  of  citizens.  Epp’s  study  examines 
the  impact  courts  can  have  on  individual  rights  in 
four  different  common  law  countries,  including  the 
United  States,  Britain,  Canada,  and  India.  It  is  an 
important  addition  to  the  literature  on  the  socio¬ 
logical  impact  of  court  decisions  as  he  makes  a  solid 
argument  demonstrating  the  need  for  a  group- 
driven  support  structure,  in  addition  to  basic  consti¬ 
tutional,  judicial,  and  structural  factors. 
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Judicial  independence  is  an  important  topic, 
particularly  in  countries  with  a  long  history  of 
political  instability  or  authoritarian  regimes.  Inde¬ 
pendence  studies  often  focus  on  Latin  America, 
Russian,  Japan,  or  Italy.  There  is  extensive  evi¬ 
dence  of  the  challenges  facing  courts  in  countries 
without  a  history  of  a  strong  rule  of  law  and  clear 
indications  of  fundamental  differences  in  how  citi¬ 
zens  use  courts  in  such  countries. 

Legitimacy  is  an  important  issue  for  courts  the 
world  over.  All  courts  face  obstacles  to  the  imple¬ 
mentation  of  their  decisions,  especially  when  the 
legitimacy  of  the  court  or  of  the  decisions  them¬ 
selves  is  questioned.  While  scholars  of  American 
politics  do  not  typically  focus  on  legitimacy,  given 
the  unusually  strong  position  enjoyed  by  the 
USSC,  legitimacy  is  an  important  research  topic 
for  other  courts.  Prominent  scholars  include 
Vanessa  Barid,  Gregory  Caldeira,  and  James 
Gibson.  Much  of  the  legitimacy  research  finds  that 
courts  outside  the  United  States  typically  have  little 
legitimacy,  in  large  part  due  to  a  lack  of  informa¬ 
tion  about  courts.  This  raises  serious  concerns 
about  compliance  with  judicial  decisions.  Inter¬ 
estingly,  studies  also  find  that  courts  have  the 
potential  to  build  institutional  legitimacy  through 
court  rulings,  even  in  countries  (e.g.,  the  former 
German  Democratic  Republic  [East  Germany]) 
that  have  had  little  experience  with  a  professional 
judiciary. 

Supranational  Judicial  Behavior: 

The  European  Court  of  Justice 

As  the  European  Union  has  evolved  over  time,  so 
has  the  role  of  the  union’s  highest  court,  the 
European  Court  of  Justice  (ECJ).  Literature  on  the 
ECJ  generally  examines  the  role  of  the  ECJ  in 
promoting  European  integration  as  a  unique 
supranational  institution.  Paralleling  theoretical 
orientations  in  comparative  politics  generally,  ECJ 
studies  tend  to  be  either  intergovernmentalist  or 
neo-functionalist  in  their  orientation.  Intergovern- 
mentalists  argue  that  the  court  is  actually  a  weak 
actor  in  promoting  the  integration  of  European 
countries  into  a  strong  centralized  supranational 
government.  Neo-functionalists  instead  pointed  to 
the  court’s  role  in  enhancing  loyalties  to  its  deci¬ 
sions,  thereby  cementing  its  role  in  the  integration 
process. 


Conclusion 

There  is  much  more  to  say  about  courts  and  JDM 
than  can  be  said  in  one  brief  essay,  so  this  entry  has 
limited  its  survey  of  the  nature  of  JDM,  both  in  the 
United  States  and  in  other  countries.  Most  basi¬ 
cally,  the  studies  discussed  here  commonly  try  to 
determine  what  factors  influence  judges’  decisions 
and  how  those  decisions  affect  society.  Most  of  this 
research  has  focused  on  courts  in  the  United  States 
because  of  their  unprecedented  influence  as  coequal 
third  branches  with  expansive  powers  of  judicial 
review;  however,  many  recent  studies  have  begun 
to  examine  courts  comparatively.  Indeed,  the  com¬ 
parative  study  of  courts  and  judges  has  been  an 
area  of  rapid  growth  since  the  1990s. 

Moreover,  while  the  studies  cited  above  answer 
important  questions,  there  are  a  host  of  other 
questions  that  have  not  been  mentioned  owing  to 
space  constraints,  such  as  judicial  selection,  retire¬ 
ment,  compliance  with  judicial  decisions,  and 
resource  inequalities  between  litigants.  In  addition 
to  the  topics  discussed  above,  scholars  have  been 
studying  questions  such  as  these,  which  are  funda¬ 
mental  to  an  understanding  of  the  judicial  system. 

Finally,  it  should  be  clear  that  the  study  of 
courts  and  judges  is  characterized  by  diversity.  The 
theoretical  approaches  used  by  scholars  were  nor¬ 
mative,  legal,  behavioral,  neo-institutional,  based 
on  rational  choice,  and  sociological.  Scholars  study 
courts  of  last  resort  as  well  as  intermediate  appel¬ 
late  courts,  trial  courts,  and  even  transnational 
courts.  Topics  include  the  law,  the  preferences  of 
judges,  and  the  institutional  arrangements  under 
which  decisions  take  place.  While  some  may  say 
that  this  diversity  has  hindered  the  development  of 
general  knowledge  about  the  judiciary,  the  authors 
agree  with  scholars  such  as  Lawrence  Baum,  who 
argue  that  such  diversity  in  approaches  is  welcome 
because  it  is  most  likely  to  lead  to  a  more  complete 
understanding  of  judicial  behavior. 
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Judicial  Independence 


Judicial  independence  is  a  notoriously  difficult  con¬ 
cept  to  define,  to  the  point  that  some  scholars  have 
questioned  whether  the  concept  serves  any  useful 
analytical  purpose.  In  a  literal  sense,  judicial  inde¬ 
pendence  refers  to  the  ability  of  courts  and  judges 
to  perform  their  duties  free  of  influence  or  control 
by  other  actors.  However,  the  term  is  more  often 
used  in  a  normative  sense  to  refer  to  the  kind  of 
independence  that  is  considered  desirable  for  courts 
and  judges  to  possess.  Consequently,  there  are  two 
sources  of  confusion  over  its  meaning.  The  first  is 
conceptual,  in  the  form  of  a  lack  of  clarity  regarding 
the  kinds  of  independence  that  courts  and  judges 
are  capable  of  possessing.  The  second  is  normative, 
in  the  form  of  disagreement  over  what  kind  of  inde¬ 
pendence  courts  and  judges  ought  to  possess. 

As  a  practical  matter,  the  type  of  judicial  inde¬ 
pendence  that  is  widely  considered  both  the  most 
important  and  the  most  difficult  to  achieve  is  inde¬ 
pendence  from  other  government  actors.  On  the  one 
hand,  this  type  of  judicial  independence  is  highly 
valued  among  those  who  impute  to  courts  a  special 
responsibility  for  ensuring  that  individuals  and 
minorities  do  not  suffer  illegal  treatment  at  the 
hands  of  the  government  or  a  tyrannous  majority. 


On  the  other  hand,  this  type  is  also  considered  espe¬ 
cially  difficult  to  achieve  because  the  other  branches 
of  government  ordinarily  possess  the  power  to  dis¬ 
obey  or  thwart  the  enforcement  of  judicial  deci¬ 
sions,  if  not  also  to  retaliate  against  the  courts  for 
decisions  that  they  oppose.  In  Alexander  Hamilton’s 
famous  formulation,  the  judiciary  is  the  “least  dan¬ 
gerous”  branch,  having  “no  influence  over  either 
the  sword  or  the  purse”  and  is  therefore  least  capa¬ 
ble  of  defending  itself  against  the  other  branches. 

Formal  guarantees  of  judicial  independence 
from  government  control  date  back  to  at  least  as 
far  as  England’s  Act  of  Settlement  of  1701,  which 
gave  judges  explicit  protection  from  unilateral 
removal  by  the  Crown  in  the  context  of  a  larger 
shift  of  power  toward  the  parliament  and  the 
courts.  Today,  the  idea  of  judicial  independence 
has  such  broad  and  powerful  normative  appeal 
that  even  states  that  do  not  honor  it  in  practice  are 
wont  to  profess  a  commitment  to  it.  Two  thirds  of 
the  world’s  current  written  constitutions  contain 
some  form  of  explicit  protection  for  the  indepen¬ 
dence  of  the  judiciary,  and  the  proportion  of  con¬ 
stitutional  documents  that  contain  such  protections 
has  been  rising  over  time.  Judicial  independence 
has  been  formally  endorsed  at  the  international 
level  as  well,  for  example,  in  the  form  of  the  Basic 
Principles  on  the  Independence  of  the  Judiciary , 
adopted  by  the  United  Nations  in  1985.  Empirical 
research  suggests,  however,  that  the  existence  of 
formal  constitutional  guarantees  of  judicial  inde¬ 
pendence  is  poorly  correlated  with  actual  respect 
for  judicial  independence  in  practice. 

Any  comprehensive  and  coherent  definition  of 
judicial  independence  must  address  several  ques¬ 
tions.  The  first  is  the  question  of  independence  for 
whom-,  the  second  is  the  question  of  independence 
from  tuhom-,  and  the  third  is  the  question  of  inde¬ 
pendence  from  what.  To  supply  satisfying  answers 
to  these  questions,  however,  demands  resorting  to 
some  normative  theory,  explicit  or  otherwise,  of 
why  judicial  independence  is  valuable  and  what  it 
is  supposed  to  accomplish.  In  other  words,  it  is 
also  necessary  to  address  the  question  of  indepen¬ 
dence  for  what  purpose. 

Independence  for  Whom? 

Judicial  independence  can  be  defined  as  a  charac¬ 
teristic  of  individual  judges  or  as  a  characteristic  of 
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the  judiciary  as  a  whole.  Neither  conception  is 
indisputably  preferable  to  the  other  as  a  practical 
matter.  On  the  one  hand,  if  judicial  independence 
is  guaranteed  at  the  institutional  level  but  not  the 
individual  level,  individual  judges  can  be  forced  to 
obey  the  wishes  of  the  leadership  of  the  judiciary, 
which  may  result  in  a  less-than-wholehearted 
enforcement  of  the  rule  of  law.  In  Chile  and  Japan, 
for  example,  the  extent  to  which  the  judiciary  as  an 
institution  commands  obedience  and  conformity 
from  its  members  has  been  blamed  for  producing 
timid  judges  who  are  unwilling  or  unable  to  rule 
against  the  government.  On  the  other  hand,  if  judi¬ 
cial  independence  is  ensured  at  the  individual  level, 
individual  judges  will  find  themselves  at  liberty  to 
pursue  their  individual  preferences.  Not  only  does 
unchecked  discretion  of  this  kind  invite  abuse,  but 
it  also  raises  the  likelihood  that  judges  will  decide 
cases  in  inconsistent  ways,  with  the  potential  effect 
of  undermining  the  predictability  and  stability  of 
the  law. 

Independence  From  Whom? 

The  existence  and  adequacy  of  judicial  indepen¬ 
dence  become  matters  of  practical  concern  only 
when  a  court  decides  a  dispute  involving  the  inter¬ 
ests  of  some  actor  or  institution  with  potential  or 
actual  power  over  the  court.  Generally  speaking, 
the  more  powerful  the  actor  whose  interests  are  at 
stake,  the  greater  is  the  need  to  protect  the  inde¬ 
pendence  of  the  court  from  that  actor.  If  both  sides 
to  the  dispute  are  powerful,  however,  that  sym¬ 
metry  of  power  may  provide  part  or  all  of  the 
necessary  protection. 

The  three  scenarios  that  a  court  may  encounter 
are 

1.  disputes  between  private  actors, 

2.  disputes  between  government  actors,  and 

3.  disputes  between  private  actors  and 
government  actors. 

In  the  first  scenario,  the  court  must  strive  to 
remain  independent  from  the  parties,  who  may 
attempt  to  undermine  its  independence  by  a  vari¬ 
ety  of  means,  such  as  bribery  or  intimidation.  In 
this  situation,  the  government  is  a  friend  of  judicial 
independence:  It  can  be  expected  to  defend  the 


independence  of  the  court  from  the  efforts  of  the 
parties. 

In  the  second  scenario,  the  prospects  for  judicial 
independence  are  again  relatively  favorable.  The 
court  is  asked  not  to  face  down  a  powerful  actor 
on  behalf  of  a  weak  one  but,  rather,  to  choose 
sides  between  two  powerful  actors  in  an  impartial 
way.  Whichever  side  the  court  chooses,  the  result 
will  be  a  two-against-one  dynamic  that  ought  to 
provide  the  court  a  degree  of  protection  from 
retaliation.  The  government  does  not  pose  a  mean¬ 
ingful  threat  to  judicial  independence  in  such  cases 
because  it  is  at  war  with  itself. 

In  the  third  scenario,  the  government  does  pose 
a  potent  threat  to  judicial  independence,  but  this 
threat  may  be  either  counteracted  or  compounded 
by  the  public.  Suppose,  for  instance,  that  a  ruler 
seeks  to  extend  his  or  her  own  term  of  office  in  an 
illegal  fashion.  In  this  case,  the  court  faces  a  threat 
to  its  independence  from  the  government,  but  its 
ability  to  withstand  this  threat  is  greatly  improved 
to  the  extent  that  it  can  count  on  public  support  if 
it  rules  against  the  government.  As  long  as  the 
court  is  in  the  position  of  siding  with  either  the 
government  or  the  public,  its  independence  enjoys 
protection:  Either  should  be  capable  of  providing 
the  court  with  the  support  that  it  needs  to  with¬ 
stand  attacks  from  the  other.  In  other  situations, 
however,  the  court  may  be  asked  to  take  a  position 
that  is  antagonistic  to  that  of  both  the  government 
and  the  public,  as  in  the  case  of  illegal  government 
discrimination  against  an  unpopular  minority. 
Here  the  prospects  for  judicial  independence  are  at 
their  nadir:  The  judiciary  is  called  on  to  demon¬ 
strate  independence  from  both  the  government 
and  the  public,  yet  it  lacks  the  help  of  a  powerful 
ally  to  withstand  the  pressures  that  it  faces. 

There  are  various  ways  to  protect  judicial  inde¬ 
pendence  in  the  face  of  such  threats.  Common 
strategies  include  limiting  government  discretion 
over  judicial  salaries,  placing  heavy  restrictions  on 
the  removal  of  judges  from  office,  fixing  the  mini¬ 
mum  jurisdiction  that  courts  are  to  possess,  and 
relieving  judges  of  personal  liability  for  acts  per¬ 
formed  in  the  course  of  their  duties.  Less  obvi¬ 
ously,  the  internal  organization  of  a  judiciary  can 
also  have  a  profound  effect  on  its  susceptibility  to 
external  influence.  The  organization  of  the  Japanese 
judiciary,  for  example,  renders  lower  court  judges 
highly  obedient  to  an  administrative  bureaucracy 
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controlled  by  the  chief  justice,  who  in  turn  is 
invariably  appointed  close  to  mandatory  retire¬ 
ment  age.  As  a  result,  the  government  can  influence 
the  ideological  direction  of  the  entire  judiciary 
simply  by  availing  itself  of  the  relatively  frequent 
opportunity  to  replace  a  specific  judge.  A  decen¬ 
tralized  organizational  structure  that  grants  greater 
autonomy  to  individual  judges,  by  contrast,  may 
make  it  harder  for  the  government  to  capture  or 
co-opt  the  judiciary  as  a  whole. 

In  the  long  run,  however,  it  is  difficult,  if  not 
impossible,  to  create  a  perfectly  independent  judi¬ 
ciary  that  is  completely  insulated  from  all  forms  of 
political  and  popular  influence.  The  relatively 
lengthy  experience  of  the  U.S.  Supreme  Court,  for 
instance,  suggests  that  even  a  highly  independent 
court  is  likely  over  time  both  to  be  reshaped  by 
political  forces  and  to  accommodate  the  wishes  of 
a  durable  political  majority.  It  is  optimistic  to 
think  that  a  handful  of  judges,  lacking  the  power 
of  either  the  purse  or  the  sword,  can  consistently 
defy  more  powerful  actors  and  institutions  with¬ 
out  ever  suffering  any  consequences,  no  matter 
what  formal  protections  they  may  enjoy.  There  are 
limits  to  what  can  be  accomplished  simply  by 
adjusting  the  institutional  characteristics  of  the 
judiciary  or  enacting  solemn  declarations  about 
the  inviolability  of  judicial  independence. 
Ultimately,  the  prospects  for  attaining  even  moder¬ 
ate  levels  of  judicial  independence  are  likely  to 
depend  on  political  and  historical  conditions  that 
are  exogenous  to  the  judiciary  and  may  lie  beyond 
reach,  such  as  the  existence  of  a  stable,  competi¬ 
tive,  multiparty  democracy. 

Independence  From  What? 

Not  all  forms  of  influence  over  judicial  decision 
making  constitute  threats  to  judicial  independence. 
While  some  activities  aimed  at  influencing  courts, 
such  as  bribery  and  physical  intimidation,  may  be 
inappropriate  under  any  plausible  conception  of 
judicial  independence,  others  can  only  be  evalu¬ 
ated  on  the  basis  of  a  contestable  normative  judg¬ 
ment.  Consider,  for  example,  public  protests  in 
front  of  courthouses.  One  view  might  be  that  such 
protests  should  be  privileged  as  a  form  of  political 
expression  and  that  judges  in  a  democracy  are  per¬ 
mitted  or  even  obligated  to  take  public  opinion 
into  account.  Alternatively,  one  might  take  the 


view  that  judges  should  be  shielded  from  such 
expressions  of  public  opinion,  much  as  jurors  are 
sequestered,  to  ensure  that  their  deliberations  are 
not  tainted  by  considerations  that  ought  to  be 
irrelevant.  Likewise,  a  public  campaign  to  deny  a 
judge  reelection  because  he  or  she  has  ruled  in 
unpopular  ways  on  controversial  issues  can  be 
characterized  as  either  a  healthy  manifestation  of 
democracy  or  a  threat  to  judicial  independence. 

Whether  such  efforts  to  influence  judicial  deci¬ 
sion  making  are  consistent  with  judicial  indepen¬ 
dence  cannot  be  answered  by  fiat.  To  define  the 
requirements  of  judicial  independence  in  such  cases 
demands  instead  a  normative  theory  of  what  courts 
are  supposed  to  take  into  account  when  deciding 
cases,  what  judicial  independence  is  supposed  to 
achieve,  and  to  what  extent  judicial  independence 
can  and  should  be  balanced  against  other  objec¬ 
tives  and  considerations. 

Independence  for  What  Purpose? 

Judicial  independence  is  generally  considered  a 
means  to  an  end,  rather  than  an  end  in  and  of  itself. 
Most  would  probably  agree  that  the  ultimate  goal 
can  be  described  as  the  fair  and  impartial  adjudica¬ 
tion  of  disputes  in  accordance  with  law.  If  that  is 
indeed  the  goal,  however,  then  the  pursuit  of  judi¬ 
cial  independence  is  open  to  several  objections. 

One  objection  is  that  the  goal  itself  is  unattain¬ 
able  because  it  rests  on  a  misconception  of  the 
nature  of  both  law  and  adjudication.  It  is  a  com¬ 
monly  held  view  among  legal  theorists  that  the  law 
is  frequently  indeterminate  and  that  it  is  therefore 
impossible  for  judges  to  decide  disputes  simply  by 
applying  preexisting  law.  Rather,  it  is  said,  the 
act  of  adjudication  requires  judges  to  make  the 
very  law  that  they  purport  merely  to  apply.  Yet  if 
adjudication  necessarily  entails  lawmaking,  then, 
judicial  independence  does  not  simply  protect  the 
ability  of  judges  to  decide  disputes  in  accordance 
with  law  but  instead  licenses  them  to  make  and 
impose  whatever  laws  they  see  fit,  which  is  a  pros¬ 
pect  that  many  consider  incompatible  with  either 
the  appropriate  role  of  judges  in  a  democracy  or 
the  idea  of  separation  of  powers. 

Another  objection  is  that  judicial  independence 
is  neither  necessary  nor  sufficient  to  ensure  impar¬ 
tial  adjudication  in  accordance  with  law  and  may 
even  undermine  that  goal  if  left  unchecked.  On  the 
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one  hand,  it  is  possible  for  a  judge  who  faces 
potential  retaliation  to  nevertheless  decide  cases  in 
an  impartial  manner.  On  the  other  hand,  there  is 
no  guarantee  that  giving  judges  the  freedom  to 
decide  cases  as  they  wish  means  that  they  will 
choose  to  do  so  fairly  and  in  accordance  with  law. 
Even  if  it  were  possible  to  create  a  judiciary  that  is 
completely  free  from  both  popular  and  political 
control,  what  would  then  prevent  the  judges  from 
deciding  cases  on  the  basis  of  personal  prejudice  or 
self-interest?  It  is  on  the  basis  of  such  concerns  that 
many  consider  it  essential  to  balance  judicial  inde¬ 
pendence  against  judicial  accountability  and  to 
distinguish  appropriate  forms  of  influence  over  the 
judiciary  from  inappropriate  forms.  However,  any 
mechanism  that  might  be  devised  for  preventing  or 
punishing  judicial  abuse  of  power  is  itself  likely  to 
prove  susceptible  to  abuse.  The  resulting  question 
of  how  to  oversee  the  judges  who  are  responsible 
for  overseeing  the  government — quis  custodiet 
ipsos  custodes — has  long  vexed  constitutional  and 
political  theorists  and  admits  of  no  simple  solution. 
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The  term  judicial  review  refers  to  the  practice  of 
judges  deciding  whether  or  not  to  declare  void  a 
statute  or  an  executive  order  on  the  grounds  of 
conflict  with  a  higher  law.  This  entry  first  describes 
two  sorts  of  judicial  review  and  also  the  political 
logic  by  which  the  institution  of  judicial  review  is 
criticized  and  that  by  which  it  is  defended.  It  then 
discusses  the  history  of  the  adoption  and  spread  of 
the  practice  of  judicial  review.  Finally,  it  describes 
the  kinds  of  judicial  review  that  are  now  prevalent 
in  the  world. 

When  the  judge  declares  void  an  order  from  an 
executive  branch  official  on  the  grounds  that  the 
official  has  transgressed  the  bounds  of  his  or  her 
statutory  authority,  the  practice  of  judicial  review 
is  not  particularly  controversial.  It  is  necessitated 
by  the  idea  that  the  rule  of  law  is  preferable  to  the 
unchecked  rule  of  individual  will.  (This  version  is 
sometimes  called  administrative  review  to  distin¬ 
guish  it  from  the  review  of  the  statutes  themselves.) 
A  judicial  power  to  declare  void  the  acts  of  elected 
legislatures,  however,  is  more  controversial,  espe¬ 
cially  in  systems  where  judges  are  not  elected.  The 
criticism  of  this  sort  of  higher  law  judicial  review 
is  that  it  substitutes  the  will  of  non-electorally 
accountable  persons  for  the  will  of  the  people’s 
elected  representatives.  The  defense  of  it  relies  on 
the  premise  that  the  legislators  will  not  be  inclined 
to  police  themselves  to  stay  within  the  constitu¬ 
tional  bounds  of  their  authority  and  on  the  prem¬ 
ise  that  persons  independent  of  political  pressures 
and  incentives  and  trained  in  the  law — judges — 
will  be  able  to  uphold  better  the  legal  limits  on 
governmental  power. 

Judicial  review  began  in  the  courts  of  American 
states  under  the  Articles  of  Confederation,  perhaps 
influenced  by  the  fact  that  the  British  Privy  Council 
had  wielded  a  veto  power  over  colonial  legislation; 
at  first,  the  practice  aroused  considerable  opposi¬ 
tion.  At  that  time,  most  of  the  states  had  constitu¬ 
tions  written  and  adopted  by  the  legislature,  so 
judges  looked  to  natural  law  or  the  law  of  nations 
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or  Anglo-American  common  law  for  the  higher 
law  to  which  they  claimed  statutes  must  conform. 
Some  opposing  attorneys  argued  that  an  appeal 
beyond  statutes  was  illegitimate,  and  some  judges 
on  the  bench  made  similar  claims.  It  also  provoked 
popular  mass  protest  meetings,  legislative  cen¬ 
sures,  attempts  to  outlaw  it,  and  serious  efforts  to 
impeach  judges  for  engaging  in  it. 

A  man  named  Thomas  Tudor  Tucker,  writing 
in  1784  under  the  pen  name  of  Philodemus  (friend 
of  the  people,  in  Greek),  in  a  pamphlet  titled 
“Conciliatory  Hints,”  developed  the  idea  that  the 
legislatively  adopted  constitution  of  his  state, 
South  Carolina,  should  be  replaced  by  one  written 
by  a  constitutional  convention  specially  elected  by 
the  people  and  that  the  people  should  ratify  it  in  an 
election,  thereby  giving  their  express  consent. 
Moreover,  the  constitution  should  state  that  it  is 
paramount  to  ordinary  legislation  and  not  change¬ 
able  merely  by  the  legislature.  Finally,  and  cru¬ 
cially,  Tucker  suggested  that  a  peaceful  rather  than 
violent  solution  to  abuses  of  power  by  legislatures 
would  be  to  have  the  constitution  state  that  no  act 
of  the  legislature  contravening  it  can  be  valid,  and 
this  declaration  could  be  enforced  in  the  courts  of 
law.  Thus  was  born  the  system  of  judicial  review 
for  safeguarding  a  popularly  ratified  constitution 
as  higher  law. 

The  idea  spread  quickly  to  other  states,  and 
members  of  the  Constitutional  Convention  treated 
this  power  as  something  the  federal  courts  would 
have.  They  did  not  explicitly  characterize  it  as  a 
power  over  law  adopted  by  Congress  but  merely 
implied  that  federal  courts  would  have  such  power 
(by  including  constitutional  prohibitions  on  certain 
kinds  of  laws  and  allowing  the  courts  to  take  cases 
“arising  under  the  Constitution”).  As  to  state  laws, 
the  U.S.  Constitution  was  explicit.  Article  VI  says 
plainly  that  the  national  constitution,  national 
laws,  and  national  treaties  “shall  be  the  supreme 
law  of  the  land;  and  judges  in  every  state  shall  be 
bound  thereby,  anything  in  the  constitution  or  laws 
of  any  state  to  the  contrary  notwithstanding.” 
Therefore,  judges  in  the  United  States  were  obliged 
to  throw  out  any  state-level  law  that  conflicted  with 
a  federal-level  higher  law.  The  fourth  U.S.  Supreme 
Court  chief  justice,  John  Marshall,  explicitly  devel¬ 
oped  and  applied  the  implied  power  to  declare  void 
a  federal-level  statute  for  the  first  time  in  1803,  in 
the  case  of  Marbury  v.  Madison,  5  U.S.  137. 


Switzerland’s  1848  constitution  also  spelled  out 
a  power  in  its  Supreme  Tribunal  to  declare  void 
cantonal  law  on  grounds  of  conflict  with  the 
national  constitution.  On  the  other  hand,  it 
refrained  from  allowing  judicial  review  over  laws 
produced  by  the  national  parliament.  Instead,  such 
laws  could  be  challenged  by  a  process  of  national 
referendum. 

The  Argentina  Constitution  of  1853,  closely 
modeled  on  the  U.S.  Constitution,  and  then  also 
its  amended  version  in  1860,  allowed  for  judicial 
review  of  national-level  law.  The  Argentine 
Supreme  Court  plainly  so  interpreted  it  in  1864, 
in  the  case  Fiscal  v.  Calvete,  1  Fallos  340,  and  did 
declare  void  a  congressional  statute  in  1888  (in 
Municipalidad  de  la  Capital  v.  Elortondo,  33 
Fallos  162).  This  court  then  implemented  its 
power  on  a  regular  basis  until  1930,  but  from 
1930  to  1983,  the  country  was  frequently  ruled  by 
a  military  dictatorship  that  in  some  periods  domi¬ 
nated  the  court. 

Then  in  1866,  via  judicial  precedent,  the  judi¬ 
ciary  of  Norway  acquired  the  power  of  judicial 
review  over  national  legislation.  As  of  World  War 

I,  then,  the  courts  of  the  United  States,  Norway, 
and  Argentina  had  this  power.  (Icelandic  courts 
did  strike  down  a  royal  decree  in  1877  but  did  not 
strike  down  a  national  statute  until  1943.)  In  the 
interwar  period,  the  Weimar  Republic,  Austria, 
Spain,  and  some  of  the  states  of  Central/Eastern 
Europe  had  specialized  constitutional  courts  to 
exercise  judicial  review,  but  all  these  were  ended 
by  the  wartime  constitutions.  The  paucity  of  coun¬ 
tries  with  judicial  review  changed  after  World  War 

II,  at  first  gradually  and  then  quite  rapidly. 

During  the  1940s  and  1950s,  there  was  a 

postwar  wave  of  reconstruction  constitutions  that 
instituted  judicial  review;  these  included  the  consti¬ 
tutions  of  Austria,  Italy,  Germany,  France,  and 
Japan.  The  decolonization  of  Africa  and  Asia  in  the 
1950s  and  1960s  brought  judicial  review  in  several 
of  the  newly  independent  constitutions  in  Africa 
and  Asia.  Democratization  in  Southern  Europe 
brought  judicial  review  to  Spain,  Portugal,  and 
Greece  in  the  1970s  and,  then,  in  the  late  1980s  and 
early  1990s,  to  new  democracies  in  the  Republic  of 
South  Africa  and  in  several  Latin  American  coun¬ 
tries.  Yet  another  wave  came  along  in  the  1990s, 
as  the  Soviet,  Soviet  bloc,  and  Yugoslavian  repub¬ 
lics  adopted  liberal  democratic  constitutions  that 
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included  judicial  review.  As  part  of  no  specific 
trend,  several  additional  countries  in  the  period 
between  1979  and  1994  adopted  new  constitutions 
or  new  constitutional  guarantees  of  fundamental 
rights  to  be  enforced  via  judicial  review.  These 
included  Sweden  (1979),  Egypt  (1980),  Canada 
(1982),  Belgium  (1985),  New  Zealand  (1990), 
Mexico  (1994),  and  Israel  (1992-1995).  By  2004, 
80  countries  in  the  world  had  adopted  a  system  of 
judicial  review. 

There  are  two  basic  types  of  intracountry  judi¬ 
cial  review.  The  first  follows  the  U.S.  model,  allow¬ 
ing  all  judges  to  declare  void  a  law  that  conflicts 
with  the  constitution.  The  second  was  designed  by 
Hans  Kelsen  for  Austria  in  the  interwar  years.  This 
model  sets  aside  a  particular  specialized  court  or 
council,  the  Constitutional  Court,  which  is  given  a 
monopoly  on  the  power  of  judicial  review  and 
which  exists  only  to  exercise  that  power.  Until  the 
late  1970s,  the  U.S.  type  of  judicial  review  was  the 
more  prevalent,  being  used  in  about  three  fourths 
of  the  judicial  review  countries.  Since  that  time,  the 
more  recent  democracies  have  tended  toward  the 
Kelsenian  approach,  adopting  a  single  constitu¬ 
tional  court. 

In  the  1960s,  yet  another  form  of  judicial  review 
burst  forth  on  the  world — judicial  review  by  a 
transnational  court.  The  European  Court  of  Justice 
(ECJ)  had  been  established  to  enforce  the  three 
1950s  treaties  that  bound  together  the  European 
Communities  (since  1992,  called  the  European 
Union  [EUJ):  The  European  Coal  and  Steel  Treaty 
of  1952  formed  the  European  Coal  and  Steel 
Community;  the  Treaty  of  Rome  of  1957  formed 
the  European  Economic  Community;  and  the 
Euratom  Treaty  of  1957  formed  the  European 
Atomic  Energy  Community.  This  court,  the  ECJ, 
declared  in  the  case  Costa  v.  ENEL  (1964)  not 
only  that  the  treaties  had  wiped  out  any  prior  con¬ 
flicting  legislation  of  the  member  countries  but 
also  that  these  treaties  now  would  function  as 
higher  law,  making  void  also  any  subsequent 
member  country  legislation  contrary  to  them.  In 
1970,  the  case  of  International  Handelsgesellscbaft 
v.  EVGF  extended  this  principle  even  to  member- 
country  constitutional  provisions  that  might  con¬ 
flict  with  any  of  the  European  Community  treaties. 
According  to  these  rulings  of  the  ECJ,  ordinary 
judges  in  the  member  countries,  even  in  countries 
that  did  not  previously  allow  judicial  review  of 


statutes,  were  now  obliged  to  give  European-level 
law  “precedence”  over  conflicting  member-state 
law.  And  they  did.  As  of  2008,  there  are  27  mem¬ 
bers  in  the  EU. 

Then,  in  1998,  the  European  Court  of  Human 
Rights,  which  is  the  court  for  upholding  the  1950 
European  Convention  for  the  Protection  of  Human 
Rights  and  Fundamental  Freedoms  of  the  Council 
of  Europe  (a  47-member  body  that  stretches  from 
the  United  Kingdom  and  Ireland  to  Azerbaijan  and 
Russia,  and  from  Norway  to  Greece  and  Turkey), 
was  made  a  permanent  court  with  compulsory 
jurisdiction  over  each  member  country  and  to 
whom  individuals  with  human  rights  complaints 
were  given  direct  access.  Since  that  time,  this  court 
too  has  been  exercising  genuinely  effective  judicial 
review  of  a  transnational  variety  with  power  to 
demand  legislative  remedies  to  Convention  viola¬ 
tions  in  the  member  countries  as  well  as  the  power 
to  insist  that  member  governments  issue  financial 
compensation  to  litigants  whose  human  rights 
have  been  violated. 

These  two  European  courts  exercise  most 
plainly  the  power  of  transnational  judicial  review, 
but  there  are  some  two  dozen  other  international 
courts,  many  of  whom,  to  one  degree  or  another, 
exercise  softer  versions  of  judicial  review.  Judicial 
review  on  the  world  stage  is  a  rapidly  evolving 
phenomenon  as  of  the  2 1st  century  and  will  war¬ 
rant  the  attention  of  political  scientists  well  into 
the  future. 
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Judicial  Systems 


Although  the  core  of  judicial  systems  is  the  trial, 
pretrial  practices  dispose  of  most  potential  litiga¬ 
tion.  Social,  economic,  or  political  pressures  or 
lack  of  knowledge  or  resources  will  inhibit  most 
aggrieved  parties  from  bringing  suits.  Even  among 
those  suits  seriously  contemplated  or  filed,  most 
will  be  resolved  by  voluntary  settlement,  plea  bar¬ 
gaining,  mediation,  arbitration,  consent  decrees, 
administrative  hearings,  or  decision  by  clan,  guild, 
village,  market,  business,  or  religious  “courts” 
that  either  operate  parallel  to  or  are  embedded  in 
the  official  judicial  system. 

Access  to  trial  is  also  typically  narrowed  by 
“standing”  (only  those  who  can  show  that  they 
have  been  directly  and  seriously  injured  may  sue 
the  alleged  wrongdoer)  and  other  rules,  such  as  a 
requirement  that  those  in  dispute  with  the  govern¬ 
ment  exhaust  their  administrative  remedies  before 
resorting  to  the  judiciary  or  that  suits  be  filed 
within  a  certain  time  after  the  injury  or  offense 
(statutes  of  limitations).  In  some  legal  systems, 
under  certain  circumstances,  however,  standing 
rules  may  be  relaxed  or  not  applied  at  all,  for 
instance,  when  a  party  alleges  that  a  particular 
statute  violates  constitutionally  protected  individ¬ 
ual  rights  or  some  action  harms  the  environment. 

The  economic,  social,  and  psychological  costs  to 
persons  brought  to  criminal  trials  is  so  severe,  even 
if  they  are  not  convicted,  that  many  criminal  justice 
systems  provide  for  some  kind  of  “trial  before  the 
trial”  to  determine  whether  actual  prosecution  is 


justified,  such  as  grand  jury  indictment,  arraign¬ 
ment,  or  preliminary  investigation  by  a  judge. 

Trial  courts  may  consist  of  a  moot  or  whole 
community  sitting  as  judge,  a  single  professional 
or  lay  judge,  a  panel  of  professional  or  lay  judges, 
a  mixture  of  professionals  and  lay  persons,  or  a 
judge  and  jury.  Practices  may  be  designed  to  bring 
all  factual  and  legal  issues  together  at  a  single  time 
and  place  to  be  resolved  in  an  oral  proceeding,  or 
a  trial  may  be  a  file  of  papers  developed  over  an 
extended  time  period  as  written  records  of  witness 
statements,  investigative  reports,  and  relevant 
documents  such  as  deeds  and  contracts  until  final 
disposition  occurs  through  the  delivery  of  a  judi¬ 
cial  verdict  in  court. 

Proceedings  are  typically  governed  by  rules 
specifying  who  may  participate  in  what  order,  rules 
of  decorum,  rules  of  evidence  defining  what 
testimony  and  documents  will  be  admitted  as  suf¬ 
ficiently  reliable,  and  relevant  standards  of  proof 
such  as  the  “beyond  reasonable  doubt”  rule  or  the 
Islamic  rule  that  conviction  requires  testimony  of 
guilt  by  two  witnesses.  Confessions,  either  volun¬ 
tary  or  induced  by  coercive  practices,  are  a  major 
feature  of  most  criminal  justice  systems.  Such 
systems  are  often  classified  as  adversarial  or  inquis¬ 
itorial  depending  on  the  degree  to  which  the  case  is 
developed  by  the  prosecutor  and  defense  counsel  or 
by  the  judges  themselves.  Both  civil  and  criminal 
justice  systems  have  historically  varied  enormously 
in  the  degree  to  which  judges  actively  intervene  in 
case  development.  In  Imperial  China,  lawyers  did 
not  exist,  and  the  magistrate  was  solely  responsible 
for  case  development  with  even  the  parties  playing 
little  role  except  as  severely  coerced  witnesses.  In 
the  United  States,  judges  are  expected  to  remain 
largely  passive  recipients  of  the  lawyers’  presenta¬ 
tions.  In  other  systems  such  as  the  English,  lawyers 
present  the  cases  for  each  side,  but  judges  intervene 
fairly  actively. 

Judicial  systems  are  often  organized  in  a  single 
pyramid  or  hierarchy  with  many  local  trial  courts, 
a  certain  number  of  which  are  clustered  under  an 
appeals  court;  these  intermediate  appeals  courts 
are  clustered  under  the  highest  appellate  court. 
Sometimes  there  are  separate  hierarchical  systems 
of  civil,  criminal,  administrative,  commercial, 
labor,  and/or  religious  courts  or  separate  trial 
courts,  such  as  small-claims  or  juvenile  courts, 
under  the  supervision  of  the  general  courts.  In 
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some  instances,  there  is  a  separate  court  with 
exclusive  jurisdiction  over  constitutional  issues, 
and  some  higher  courts  also  serve  as  trial  courts 
for  particularly  important  matters  such  as  impeach¬ 
ment  of  government  officials  or  suits  between  two 
member  states  of  a  federal  system.  Appellate 
courts  almost  always  consist  of  a  panel  of  three  or 
more  judges.  One  function  of  appeal  is  to  protect 
parties  against  trial  court  errors.  Another  is  to 
serve  central  political  authorities  by  ensuring  that 
laws  made  by  them  are  effectively  and  uniformly 
enforced  by  the  web  of  local  trial  courts. 

The  principal  function  of  courts  is  often  said  to 
be  the  resolution  of  disputes  between  two  parties, 
deciding  that  one  party  was  right  and  the  other 
wrong,  according  to  preexisting  legal  rules.  In 
reality,  many  cases  really  involve  not  simply  two 
individuals  but  large  clashing  social,  economic, 
or  political  interests;  many  are  resolved  by  inter¬ 
mediate  decisions  under  which  both  parties  gain 
something  and  lose  something,  and  some  are 
resolved  by  rules  that  the  judges  have  just  con¬ 
structed.  Judicial  discretion  in  the  making  of  new 
legal  rules  is  greatly  constrained  by  many  factors, 
including  the  need  to  explain  their  decisions  in 
published  “opinions,”  the  existence  of  other  more 
politically  powerful  lawmakers  such  as  legislatures 
and  executives,  public  opinion,  courts’  extremely 
limited  means  of  enforcing  their  judgments,  the 
degree  of  completeness  and  specificity  of  existing 
legal  rules,  and  professional  or  craft  standards  of 
permissible  legal  reasoning  and  judicial  creativity. 
Nevertheless,  in  many  polities,  judicial  systems 
have  been  a  significant  source  of  new  public  poli¬ 
cies  and  thus  sometimes  of  political  controversy. 

Some  degree  of  judicial  law  making  character¬ 
izes  all  judicial  systems,  not  necessarily  because 
judges  consciously  seek  law-making  power  but 
because  acceptance  of  judicial  decisions  is  most 
likely  when  they  are  purportedly  the  judicial  appli¬ 
cation  of  preexisting  legal  rules  rather  than  judicial 
fiats.  Thus,  in  cases  where  there  is  no  relevant  pre¬ 
existing  law  or  that  law  is  too  general  or  unclear  to 
prescribe  the  outcome,  judges  themselves  will  be 
moved  to  generate  a  decisive  legal  rule  attributing 
its  origin  not  to  themselves  but  to  judicial  “inter¬ 
pretation”  of  the  body  of  preexisting  law. 

Because  courts  are  necessarily  simultaneously 
dispute  resolvers  and  lawmakers,  judicial  selection 
is  difficult.  Judicial  neutrality  and  independence 


would  seem  essential  if  courts  are  to  successfully 
resolve  disputes.  Yet,  as  lawmakers,  judges  ought 
not  to  be  independent  but  rather  politically  respon¬ 
sible  to  the  people  or  their  elected  representatives  in 
a  democracy  and  to  whoever  holds  political  author¬ 
ity  in  nondemocracies.  This  insoluble  paradox 
results  in  judicial  selection  processes  that  range 
from  direct  election  for  short  terms  of  judicial  can¬ 
didates  nominated  by  political  parties  to  appoint¬ 
ment  and  dismissal  of  judges  at  the  pleasure  of  an 
authoritarian  ruler  or  selection  for  life  terms  by  an 
independent  council  consisting  of  incumbent  judges 
and  distinguished  legal  scholars,  with  every  imagin¬ 
able  combination  of  independence  leaning  and 
political  accountability  devices  in  between. 

It  is  often  claimed  that  judicial  systems  are  sepa¬ 
rate  from  political  systems,  but  more  realistically 
speaking,  they  are  treated  as  subsystems  of  the 
political.  While  courts  are  frequently  associated 
with  the  protection  of  individual  rights,  most  polit¬ 
ical  regimes,  even  those  with  little  regard  for  such 
rights,  have  invested  in  judicial  services.  They  do  so 
not  only  because  conflict  resolution  fosters  eco¬ 
nomic  activity  and  social  peace  but  because  courts 
serve  as  instruments  of  social  control  enforcing 
whatever  policies  the  regime  enacts  into  law.  The 
appellate  hierarchies  described  earlier  are  channels 
for  transmitting  to  the  localities  and  enforcing  laws 
made  by  central  political  authorities.  To  the  extent 
that  judges  themselves  make  law,  they  themselves 
are  active  participants  in  the  public  policy¬ 
making — that  is,  the  political — process.  There  is  a 
perennial  argument  over  how  active  judges  ought 
to  be  in  policy  making,  but  some  level  of  judicial 
lawmaking,  and  thus  judicial  politics,  is  inevitable. 

It  is  argued  by  some  that  judges  engage  in  a 
unique  and  superior  method  of  decision  making. 
Because  they  proceed  case  by  case  with  regard  for 
stability,  continuity,  and  consistency  in  the  case 
law,  their  decisions  are  claimed  to  be  pragmati¬ 
cally  rooted  in  the  particular,  concrete  realities 
presented  in  individual  cases  rather  than  generated 
by  abstract  and  general  prognostication  not  rooted 
in  reality.  Because  they  know  that  the  rule  they 
announce  in  one  case  will  probably  be  used  in  sub¬ 
sequent  cases  and  because  most  serve  for  relatively 
long  terms,  they  are  said,  unlike  politicians  desper¬ 
ately  concerned  to  survive  the  next  election  or  cri¬ 
sis,  to  pursue  long-term  public  values  rather  than 
their  personal,  immediate  survival  in  office.  Others 
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argue  that  case-by-case  decision  making  is  simply 
the  judicial  version  of  the  incremental  decision 
strategies  used  by  all  public  policymakers,  that 
precisely  because  judges  know  that  the  rule  they 
make  in  one  case  today  may  be  used  in  future, 
somewhat  different,  and  somewhat  unpredictable 
cases,  they  mix  concrete,  immediate  considerations 
with  more  general,  probabilistic  considerations  of 
how  a  current  rule  might  work  in  the  future  just  as 
legislators,  administrators,  and  political  executives 
do,  and  that  even  those  judges  who  anticipate  long 
terms  are  subject  to  the  pressures  of  immediate 
events  and  current  crises  that  play  on  other  law¬ 
makers.  There  is  certainly  much  empirical  data 
indicating  that  judges’  decision  making  is  signifi¬ 
cantly  influenced  by  their  political  ideologies  and 
affiliations  and  their  personal  policy  preferences, 
but  such  studies  do  not  resolve  the  question  of 
whether  judges  enjoy  a  decision  method  different 
from  or  superior  to  those  of  other  lawmakers. 

Effective  judicial  systems  are  often  touted  as 
routes  to  economic  and  political  development 
because  they  provide  security  of  expectations  for 
investors  and  protect  individual  rights.  Yet,  under 
some  circumstances,  investment  can  be  attracted 
by  the  expectation  of  high  returns  even  in  the 
absence  of  court-provided  security,  and  authori¬ 
tarian  regimes  can  use  judicial  processes  for  repres¬ 
sion  rather  than  protection  of  individuals. 

It  is  widely  believed  that,  beginning  in  the  latter 
half  of  the  20th  century,  the  courts  have  played  an 
increasingly  active  role  in  public  policy  making  (the 
so-called  judicialization  of  politics)  through  their 
powers  of  constitutional  judicial  review,  review  of 
the  lawfulness  of  administrative  actions,  and  gen¬ 
eral  statutory  interpretation,  although  it  is  not  clear 
that  courts  were  less  active  in  earlier  periods. 

Martin  Shapiro 
University  of  California,  Berkeley 
Berkeley,  California,  United  States 

See  also  International  Law;  Judicial  Decision  Making; 
Natural  Law;  Rule  of  Law;  Soft  Law;  Voting  Rules, 
Legislative 


Further  Readings 

Martin,  S.  (1981).  Courts.  Chicago:  University  of 
Chicago  Press. 


Martin,  S.,  &  Sweet,  A.  S.  (2002).  Law,  politics  and 
judicialization.  Oxford,  UK:  Oxford  University  Press. 
Mirjan,  R.  D.  (1986).  The  faces  of  justice  and  state 
authority.  New  Haven,  CT:  Yale  University  Press. 


Judicialization  of 
International  Relations 


The  judicialization  of  international  relations  refers 
to  the  political  reality  that  judicial  actors  are 
increasingly  involved  in  defining  what  interna¬ 
tional  agreements  mean.  Judicialization  is  an 
aspect  of  legalization  of  policy  and  politics  in 
which  politicians  conceive  of  their  policy  and  leg¬ 
islative  options  as  bounded  by  what  is  legally 
allowed  and  where  citizens,  organizations,  and 
firms  see  law  as  conferring  on  them  rights  that 
others  may  not  abrogate.  Judicialization  occurs 
when  courts  gain  authority  to  define  what  the  law 
means.  Where  judges  have  the  authority  to  say 
what  the  law  means  and  where  litigation  becomes 
a  useful  way  to  reopen  political  agreements, 
judges  become  de  facto  lawmakers.  Negotiations 
among  actors  become  debates  about  what  is 
legally  permissible,  and  politics  takes  place  in  the 
shadow  of  courts  with  the  lurking  possibility  of 
litigation  shaping  actor  demands  and  political 
outcomes. 

Most  people  used  to  think  that  law  only  existed 
where  there  was  coercive  power  to  enforce  it,  and 
thus,  judicialization  of  politics  was  only  possible 
within  states.  But  increasingly,  international  rela¬ 
tions  have  become  both  legalized  and  judicialized. 
This  entry  reviews  how  international  relations 
came  to  be  part  of  the  domain  of  courts,  discusses 
how  the  judicialization  of  international  relations 
has  changed  international  politics,  and  identifies 
where  judicialized  international  relations  are  more 
and  less  prevalent. 

How  Did  International  Relations 
Become  Judicialized? 

In  the  age  of  monarchies,  what  we  today  call  inter¬ 
national  law  consisted  primarily  of  agreements 
among  kings  and  queens,  enforceable  through 
reciprocity  by  state  leaders.  This  old  conception  of 
international  law  still  exists  in  public  law  theory 
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and  in  international  relations  approaches  that  see 
international  law  as  mere  contracts  between  states. 
This  view  of  international  law  is  still  voiced  by 
governments  when  they  assert  that  they  have  abso¬ 
lute  sovereignty.  But  these  are  old-fashioned  con¬ 
ceptions  of  international  law,  relics  of  another 
time  and  place.  This  “old  terrain”  of  international 
law  started  to  recede  around  the  turn  of  the  20th 
century  when  domestic  courts  began  hearing  cases 
of  alleged  violations  of  what  were  largely  unwrit¬ 
ten  international  rules  and  when  countries  started 
negotiating  and  ratifying  more  detailed  interna¬ 
tional  treaties. 

Since  the  end  of  the  Cold  War,  international  law 
and  judicialized  politics  have  entered  a  fundamen¬ 
tally  new  terrain  where  international  treaties  create 
binding  laws,  creating  duties,  expectations,  and 
rights  that  penetrate  the  surface  of  the  state.  The 
new  terrain  of  international  law  is  the  result  of  a 
double  shift  that  has  accelerated  since  1945.  One 
force  leading  to  this  new  terrain  is  the  substantive 
expansion  of  international  law  to  cover  issues  that 
were  traditionally  governed  exclusively  by  states. 
This  expansion  has  been  fueled  by  the  rights  revo¬ 
lution  and  the  rising  trend  of  governments  exerting 
an  extraterritorial  reach  to  domestic  laws.  The 
rights  revolution,  which  took  off  in  the  post-World 
War  II  era,  is  based  on  the  premise  that  individuals 
have  certain  rights  even  if  domestic  constitutions 
or  statutes  do  not  explicitly  recognize  these  rights. 
The  United  Nations  General  Declaration  of  Human 
Rights  in  1948  created  an  aspirational  goal,  as  the 
declaration  itself  was  nonbinding.  Civil  rights 
movements  of  the  1960s  pushed  forward  the  devel¬ 
opment  of  binding  international  laws  and  treaties 
explicitly  addressed  to  individuals.  These  rights  are 
universal;  thus,  they  do  not  depend  on  national 
ratification  to  exist.  Meanwhile,  governments  in  a 
number  of  capitalist  countries  constructed  a  law- 
based  global  international  trade  regime,  and  they 
expanded  the  extraterritorial  reach  of  domestic 
property  and  antitrust  rules  so  as  to  protect  domes¬ 
tic  economic  interests  and  shape  the  way  global 
forces  affect  domestic  markets.  With  powerful 
countries  already  asserting  an  international  reach 
to  their  global  and  domestic  economic  rules,  the 
idea  of  increasingly  coordinated  international  rules 
became  a  least  bad  alternative,  even  if  these  rules 
touch  on  issues  that  in  theory  remained  the  exclu¬ 
sive  prerogative  of  states. 


The  result  of  these  trends  is  that  international 
law  increasingly  speaks  about  how  governments 
should  treat  their  citizens,  which  goods  are  allowed 
into  national  markets  and  on  what  terms,  how 
many  environmentally  destructive  products  states 
should  allow  their  firms  and  people  to  create,  and 
so  on.  While  we  count  on  governments  to  imple¬ 
ment  these  rules,  increasingly  substate  actors  are 
going  to  the  source  of  the  rules,  invoking  interna¬ 
tional  laws  and  regulations  directly  in  national 
contexts.  If  individuals,  nongovernmental  organi¬ 
zations,  and  firms  see  governments  as  having 
duties  and  themselves  as  having  rights  based  on 
international  law,  and  if  governments  start  to 
interact  with  substate  actors  as  if  they  have  rights 
under  international  law,  then  both  domestic  and 
international  politics  move  into  the  new  terrain  of 
legalized  international  politics. 

The  second  shift  is  a  result  of  the  growth  of 
multilateral  institutions.  In  very  real  and  concrete 
ways,  multilateral  institutions  have  gained  gover¬ 
nance  roles.  As  executive  bodies,  multilateral 
institutions  propose  legislation  and  convene  meet¬ 
ings  where  representatives  of  states  agree  to 
collective  rules  that,  once  ratified,  can  become 
binding  both  on  signatories  and  nonsignatories 
alike.  Some  multilateral  institutions  at  the  regional 
level  (e.g.,  the  European  Union  [EU])  have  legisla¬ 
tive  abilities  wherein  councils  of  state  representa¬ 
tives  can  pass  legislation  that  does  not  require  the 
traditional  step  of  national  ratification  for  the 
legal  rule  to  be  binding.  Multilateral  institutions 
also  have  been  granted  administrative  roles,  with 
an  international  secretariat  issuing  binding  inter¬ 
pretations  of  rules  and  decisions  related  to  imple¬ 
menting  international  rules.  Combined  with  the 
first  trend — the  substantive  expansion  of  national 
and  international  law — the  trend  of  delegating  a 
variety  of  powers  to  multilateral  institutions 
means  that  increasingly  there  is  governance  that 
does  not  depend  on  national  legislative  consent 
for  its  authority  and  that  exists  largely  outside  of 
domestic  judicial  oversight. 

These  two  factors  have  contributed  to  the 
growth  of  creating  and  using  international  courts 
(ICs).  It  used  to  be  that  domestic  courts  were  the 
primary  enforcers  of  international  rules.  Even  with 
the  rise  of  powerful  and  active  constitutional 
courts  in  many  countries,  domestic  enforcement 
remains  uneven  because  national  judges  are 


Judicialization  of  International  Relations  1379 


inclined  to  see  international  law  as  external  to  the 
national  legal  order  and  international  relations  as 
falling  into  the  realm  of  executive  prerogative.  As 
multilateral  institutions  have  gained  more  powers, 
states  have  turned  to  ICs  to  oversee  the  exercise  of 
this  power. 

The  end  of  the  Cold  War  ushered  in  a  revolu¬ 
tion  in  the  creation  and  use  of  ICs.  It  disrupted 
traditional  alliances.  Former  Soviet  satellites 
rushed  to  join  the  international  institutions  of 
Western  states,  and  states  that  had  relied  on 
Soviet  support  had  to  seek  new  patrons.  Around 
the  world,  states  embraced  the  trappings  of  liberal 
democracy — free  markets,  human  rights,  and 
open  trading  systems.  These  changes  led  to  the 
creation  of  a  slew  of  new  ICs,  reforms  to  existing 
ICs,  and  an  expansion  of  membership  in  a  number 
of  international  legal  institutions. 

We  can  literally  see  the  judicialization  of  inter¬ 
national  politics.  In  1985,  there  were  eight  ICs  that 
met  the  Project  on  International  Courts  and 
Tribunals’  definition  of  an  IC  as 

1.  a  permanent  legal  body, 

2.  composed  of  independent  judges, 

3.  hearing  legal  cases  in  which  one  of  the  parties  is 
a  state  actor  or  an  international  organization 

(IO), 

4.  deciding  on  the  basis  of  predetermined  rules, 
and 

5.  issuing  binding  legal  rulings. 

By  2009,  there  were  20  active  ICs  meeting  this 
definition  and  8  more  in  various  stages  of  develop¬ 
ment.  These  “new”  ICs  are  not  only  recent  cre¬ 
ations,  but  they  are  also  qualitatively  different 
entities.  Newer  ICs  are  more  likely  to  have  com¬ 
pulsory  jurisdiction,  meaning  states  do  not  need  to 
consent  to  litigation  for  the  case  to  proceed.  They 
also  tend  to  allow  private  access  or  access  for  inter¬ 
national  nonstate  actors  to  initiate  litigation,  even 
though  most  observers  agree  that  these  design  fea¬ 
tures  make  ICs  more  independent  and  more  likely 
to  make  a  ruling  on  cases  in  which  a  government 
is  an  unwilling  participant.  The  new  ICs  are  far 
more  likely  to  be  activated,  which  explains  in  part 
why  IC  usage  has  also  increased.  By  2007,  ICs  had 
issued  more  than  30,000  binding  legal  rulings 


where  an  IO  or  state  actor  was  the  defendant. 
These  rulings  are  not  only  mostly  routine,  but  they 
also  fill  gaps  in  the  law  and,  at  times,  radically 
reinterpret  law  on  the  books  so  that  ICs  end  up  as 
lawmakers  with  significant  power  to  influence 
domestic  and  international  outcomes. 

The  ingredients  for  judicialized  international 
relations  are  now  in  place.  We  have  an  extensive 
body  of  binding  and  fairly  precise  international 
legal  rules.  We  have  state  and  nonstate  actors  who 
see  these  rules  as  conferring  rights  and  duties  on 
state  actors.  We  have  courts — both  domestic  and 
international — with  the  capacity  to  interpret  and 
enforce  these  rules  in  cases  raised  by  states,  private 
firms,  individuals,  and  international  prosecutor- 
type  actors. 

How  Does  the  Judicialization  of 
International  Relations  Affect 
International  Politics? 

It  is  important  to  recognize  that  judicialized  poli¬ 
tics  is  not  the  same  thing  as  courtroom  politics. 
Legalized  politics  involves  the  extension  of  legal 
rhetoric  and  legal  calculations  into  the  larger  realm 
of  politics.  When  courts  start  to  issue  decisions 
that  interpret  rules,  judicialized  politics  extends  to 
politicking  over  new  rules,  actor  decisions  regard¬ 
ing  different  courses  of  law-compliant  action,  and 
negotiations  to  resolve  disagreements  about  exist¬ 
ing  rules. 

During  treaty  negotiations,  judicialization  of 
international  relations  mainly  affects  politics  by 
creating  a  web  of  commitments  that  constrain  new 
policy  making.  Certain  international  treaties  exert 
a  constitutional  influence  over  bilateral  and  multi¬ 
lateral  treaty  negotiations  and  domestic  policy 
making.  For  example,  the  World  Trade  Organi¬ 
zation’s  framework  for  free  trade  casts  a  shadow 
over  domestic  and  international  negotiations 
regarding  trade  issues  because  any  new  agreement 
that  conflicts  with  the  rules  of  the  World  Trade 
Organization  can  give  rise  to  a  legal  challenge. 
International  human  rights  law  also  exerts  a  con¬ 
stitutional  influence  in  that  new  international  and 
domestic  policies  must  respect  what  are  seen  as 
elemental  individual  rights. 

Judicialized  politics  play  an  even  larger  role  dur¬ 
ing  the  implementation  stage,  when  actors  are 
applying  existing  rules  on  a  case-by-case  basis. 
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Legalization’s  largest  although  least  perceptible 
influence  is  in  shaping  how  actors  make  choices. 
For  most  actors,  the  first  step  involves  figuring  out 
what  the  law  allows  or  requires.  People  usually 
choose  legal  action  because  law  provides  a  starting 
place  to  make  decisions  and  because  law-compliant 
behavior  is  safe.  Following  the  law  becomes  the 
default  condition;  one  usually  needs  a  reason  to 
choose  a  riskier  illegal  action  over  what  the  law 
prescribes.  Even  when  actors  choose  to  violate  the 
law,  they  will  attempt  to  bring  legal  understand¬ 
ings  with  them.  For  example,  when  the  U.S.  gov¬ 
ernment  under  the  leadership  of  President  George 
W.  Bush  sought  to  escape  well-understood  legal 
obligations  under  the  Geneva  Conventions  (1949) 
and  the  UN  Convention  Against  Torture,  it  asked 
for  legal  briefs  that  suggested  that  its  policy  choices 
were  actually  consistent  with  what  the  law  required. 

Law  also  becomes  a  tool  used  by  parties  when 
disagreements  arise.  At  this  point,  state  discretion 
can  be  constrained  by  judges  (domestic  and  inter¬ 
national)  interpreting  and  enforcing  the  law  as 
they  see  it.  Law  becomes  a  framework  that  the 
parties  use  to  figure  out  what  arrangements  are 
reasonable.  Law  prescribes  the  duties  owed  and 
the  rights  that  can  be  claimed.  Again,  we  see  that 
judicialized  politics  are  not  per  se  courtroom  poli¬ 
tics.  Litigation  is  meant  to  be  a  last  resort  and  a 
fairly  rare  phenomenon.  Instead,  the  prospect  of 
litigating  a  disagreement  casts  a  shadow  over  out- 
of-court  negotiations  regarding  legal  compliance. 
Actors  are  supposed  to  adjust  their  behaviors  both 
because  they  prefer  to  be  law-abiding  (being  law- 
abiding  is  seen  as  more  legitimate)  and  to  avoid  a 
legal  suit.  Where  legal  violations  exist,  threatening 
litigation  becomes  a  useful  bargaining  tactic,  as 
most  actors  will  prefer  settlement  rather  than  pub¬ 
lic  legal  sanction  and  with  it  a  binding  legal  prec¬ 
edent.  Judicialized  politics  is  therefore  present  in 
the  many  debates  and  arguments  over  what  the 
existing  set  of  rules  do  and  do  not  allow. 

A  proportionally  small  number  of  disputes 
will  end  up  in  front  of  legal  enforcement  bodies — 
international  or  domestic  courts,  monitoring  and 
enforcement  bodies,  or  arbitral  bodies.  Rising  liti¬ 
gation  rates  involving  international  agreements 
suggest  that  litigation  is  a  useful  political  tool,  but 
it  is  hard  to  assess  how  effective  litigation  is.  The 
effects  of  litigation  cannot  really  be  measured 
because  the  many  suits  that  settle  out  of  court  are 


systematically  excluded  in  any  examination  of 
compliance  with  the  law.  The  cases  we  can 
observe — those  that  reach  the  legal  ruling  stage  in 
a  court — tend  to  be  the  most  difficult,  those  where 
the  parties  refuse  a  settlement,  and  thus  where 
compliance  with  the  law  is  the  least  likely.  Studies 
on  compliance  with  IC  rulings  find  that  a  vast 
majority  of  IC  rulings  elicit  full  or  partial  compli¬ 
ance.  Even  if  rulings  do  not  elicit  immediate  or  full 
compliance,  IC  actions  cast  a  shadow  over  subse¬ 
quent  politics.  The  fact  that  an  IC  has  declared  one 
interpretation  of  the  rules  “legal”  and  another 
“illegal”  affects  perceptions  regarding  the  rectitude 
of  certain  behaviors.  State  and  nonstate  actors  may 
become  less  willing  to  abide  by  bargains  and  poli¬ 
cies  that  have  been  named  illegal.  Over  time,  law¬ 
breakers  may  find  that  it  is  simply  easier  to  create 
a  legal  solution  to  end  a  conflict.  The  legal  solution 
may  involve  compliance  with  the  law  as  inter¬ 
preted  by  the  IC,  writing  new  legislation  to  create 
a  new  definition  of  what  is  “legal,”  or  creating  a 
side  bargain  in  which  the  victims  of  the  legal 
breach  are  compensated  in  exchange  for  dropping 
their  legal  claims.  All  of  these  shifts  from  straight- 
out  noncompliance — settlement,  changing  the  law 
to  unseat  a  court  ruling,  compensation,  and  com¬ 
pliance — reflect  judicialized  international  relations. 

Where  Are  Judicialized  International 
Relations  More  and  Less  Likely? 

A  well-known  saying  is  that  “all’s  fair  in  love  and 
war.”  But  with  judicialized  politics,  actors  who 
break  a  love  contract  or  who  engage  in  war  can 
face  costly  litigation.  This  aphorism  does,  how¬ 
ever,  capture  the  reality  that  certain  domains  are 
more  law  constrained  than  others. 

Judicialized  politics  are  less  prevalent  with 
respect  to  new-issue  areas  where  there  are  few 
existing  legal  rules.  The  Internet,  for  example, 
experienced  a  period  of  time  without  legal  regula¬ 
tion  where  there  were  only  norms  of  conduct  but 
not  legal  rights  and  duties.  While  there  is  much 
discussion  about  international  environmental 
accords,  the  environment  remains  another  domain 
in  which  international  legal  obligations  barely 
exist. 

In  tightly  knit  social  settings  where  social  mores 
matter  more  than  law,  and  around  the  fringes  of 
political  life  where  social  opprobrium  counts  for 
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little,  social  interactions  tend  to  be  less  constrained 
by  law.  Thus,  we  find  that  in  many  areas  of  the 
world,  local  custom  matters  more  than  interna¬ 
tional  human  rights  law.  We  also  find  that  illegal 
realms,  such  as  the  black  market  or  drug  and 
human  trafficking,  operate  largely  outside  the  law 
and  legalized  discourse. 

The  realm  of  security  has  fewer  rules  limiting 
state  discretion,  although  over  time  this  policy 
domain  has  also  become  judicialized.  State  actors 
who  indulge  in  violence  across  borders,  who 
violate  promises  that  have  been  codified  via  inter¬ 
national  agreements,  or  who  commit  atrocities 
against  noncombatants  may  be  recipients  of  legally 
authorized  sanctions.  Other  states  may  refuse 
alleged  war  criminals,  the  citizens  of  states  engaged 
in  war,  or  the  regular  courtesies  of  international 
life,  such  as  international  travel,  the  landing  of 
airplanes,  and  trade  across  borders.  With  the  rise 
of  universal  jurisdiction  and  the  creation  of  the 
International  Criminal  Court,  perpetrators  of  war 
crimes  can  even  find  themselves  being  brought  to 
court. 

In  international  economic  relations,  interstate 
interaction  is  highly  legalized  and  judicialized.  The 
rules  for  creating  barriers  to  the  movement  of 
goods  and  services  are  detailed  and  known.  (World 
Trade  Organization  rules  constituted  a  starting 
template  for  most  international  trade  regimes.)  A 
state  that  violates  these  rules  will  usually  be  con¬ 
fronted  with  legal  claims,  legal  threats,  and  even 
litigation. 

The  final  area  of  international  politics  that  is 
increasingly  judicialized  is  human  rights.  Nongov¬ 
ernmental  actors  increasingly  monitor  state  actions, 
using  legal  claims  and  legalized  strategies  as  they 
expose  violations  of  individual  and  group  rights. 
While  activists  may  fail  to  sway  a  determined  gov¬ 
ernment,  outside  groups  would  have  no  right  or 
leverage  were  it  not  for  the  existence  of  binding 
international  duties  and  obligations.  Meanwhile, 
domestic  and  ICs  around  the  world  are  increasingly 
willing  to  help  protect  human  rights. 

This  discussion  reveals  that  judicialized  politics 
is  not  defined  by  law  obedience.  Instead,  judicial¬ 
ized  politics  is  defined  by  legal  politics,  where  the 
fights  are  about  what  constitutes  obedience  to  the 
law.  Where  politics  is  judicialized,  judges  have 
political  power  because  they  have  been  given  the 
final  authority  to  say  what  the  law  means.  Judges 


do  not,  however,  get  to  determine  what  the  law 
ultimately  becomes.  If  enough  actors  are  unhappy 
with  the  state  of  the  law,  they  will  change  the  law. 
Judges  will  then  get  another  chance  to  interpret 
what  the  new  law  means.  Through  a  recursive 
interaction  among  judges,  lawyers,  policymakers, 
and  the  people  ultimately  called  on  to  respect  the 
law,  the  meaning  of  the  law  gets  contested, 
defined,  and  reshaped.  Judicialized  politics  means 
that  judges  have  inserted  themselves  into  this 
recursive  political  dynamic  and  that  the  political 
status  quo  can  be  destabilized  by  litigants  making 
legal  appeals  to  judges. 

Conclusion 

Today  international  agreements  are  negotiated  by 
executive  branches,  written  down  and  registered, 
and  then  implemented  by  state  bureaucracies. 
Bureaucracies  usually  apply  existing  rules  in  cases 
involving  individuals.  Following  the  law  is  nearly 
always  the  simplest  and  safest  course  of  action, 
which  is  why  bureaucracies  will  follow  the  law, 
unless  there  is  a  compelling  reason  not  to. 

Before  courts  were  involved  in  interpreting 
international  rules,  bureaucracies  had  significant 
discretion  to  define  what  the  law  meant.  By  shap¬ 
ing  bureaucratic  behavior,  governments  controlled 
how  international  law  affected  domestic  policy. 
The  growing  trend  of  having  legal  bodies — domes¬ 
tic  and  international — interpret  and  apply  interna¬ 
tional  rules  has  contributed  to  the  judicialization 
of  international  politics.  Government  bureaucra¬ 
cies  are  increasingly  swayed  by  the  possibility  of 
international  review  of  their  decisions;  thus  they 
are  increasingly  willing  to  follow  IC  interpreta¬ 
tions  of  the  law  even  if  their  government  has  not 
explicitly  directed  them  to  do  so.  Now  that  courts 
can  pronounce  on  the  law  and  even  review  the 
actions  of  state  bureaucracies,  state  discretion  has 
diminished. 

Legal  obligations  exist  where  legal  rules  exist. 
Legislative  bodies  make  legal  rules,  and  where 
there  are  no  rules,  actors  retain  discretion.  This 
means  that  through  their  legislation,  legislative 
bodies  drive  where  and  when  international  rela¬ 
tions  become  legalized.  But  when  courts  became 
involved  in  interpreting  international  law,  judges 
start  to  create  law  to  fill  in  lacunae  that  legal  suits 
expose,  and  states  begin  to  lose  discretion  and 
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control  over  how  legal  rules  are  understood. 
Political  actors  will  need  either  to  make  codified 
law  that  is  more  to  their  liking  or  to  let  judges 
define  the  law.  The  ultimate  import  of  the  judicial- 
ization  of  international  relations  will  depend  on 
the  interaction  between  existing  laws,  the  willing¬ 
ness  of  litigants  to  invoke  the  law  and  pursue 
litigation,  and  the  behavior  of  judges. 

This  realm  of  judicialized  politics  has  been 
spreading  in  an  accelerated  fashion.  Many  factors 
account  for  its  spread.  Democracy  and  the  growth 
of  professional  bureaucracy  increase  the  number 
of  actors  who  participate  in  policy  making  and 
who  have  a  stake  in  the  rule  of  law,  both  domestic 
and  international.  Technology  aids  in  the  dis¬ 
semination  of  information  and  the  coordination 
of  actors  across  borders.  A  rising  trend  of  delegat¬ 
ing  authority  to  international  legal  bodies  increases 
the  venues  available  for  litigating.  The  increasing 
willingness  of  national  courts  to  weigh  in  regard¬ 
ing  international  legal  issues  has  also  increased 
the  prevalence  of  bargaining  in  the  shadow  of  a 
court. 

At  this  point,  even  if  governments  decided  to 
stop  using  law  and  litigation  as  a  tool  of  dispute 
resolution,  there  would  still  be  judicialized  interna¬ 
tional  relations.  Legal  rights  are  hard  to  take  back. 
International  law  provides  a  right  for  outsiders  to 
peek  over  borders  and  to  criticize  what  govern¬ 
ments  do  in  their  own  backyard,  and  judges  have 
gained  jurisdiction  regarding  international  legal 
issues.  At  the  same  time,  the  rule  of  law — both 
domestic  and  international — is  far  from  mono¬ 
lithic.  There  are  always  territories  and  issues  where 
one  finds  social  order  that  is  not  defined  by  law. 
The  next  challenge  for  students  of  judicialized 
international  relations  is  to  determine  where  and 
when  law  plays  a  greater  role  in  everyday  interna¬ 
tional  relations  and  to  better  understand  the 
factors  that  shape  how  judicialized  international 
politics  plays  out. 

Karen  J.  Alter 
Northwestern  University 
Evanston,  Illinois,  United  States 
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Judiciary 


All  modern  societies  tend  to  entrust  the  adjudica¬ 
tion  of  disputes  arising  from  the  application  of 
recognized  norms  to  a  specialized  actor,  the  judge. 
Collectively,  the  judges  are  designed  as  the  judi¬ 
ciary.  In  some  countries — such  as  France  and 
Italy — the  judiciary  also  includes  public  prosecu¬ 
tors  since  they  form  a  unitary  organization 
together  with  judges. 

Due  to  the  significance  of  the  adjudication,  the 
judiciary  tends  to  enjoy  a  special  position  in  most 
political  systems  and  especially  so  in  constitutional 
democracies.  In  most  political  systems,  since  the 
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middle  of  the  20th  century,  the  significance  of  the 
judiciary  has  increased,  leading  to  the  phenome¬ 
non  defined  as  “the  judicialization  of  politics.” 

The  Judiciary  in  Constitutional  States 

The  role  of  the  judiciary  in  a  political  system  can¬ 
not  be  analyzed  without  taking  into  consideration 
the  institutional  function  of  the  judge:  adjudica¬ 
tion.  Adjudication  is  a  type  of  dispute  resolution 
that  relies  on  an  externally  appointed  judge  acting 
as  the  third  party  and  where  the  parties  to  the 
dispute  must  comply  with  the  judge’s  decision, 
even  though  they  have  no  control  over  the  choice 
of  judge,  who  is  appointed  by  the  state.  Thus, 
judges  are  inherently  placed  in  a  difficult  position. 
They  must  resolve  cases  without  the  main  element 
that  makes  the  triad  an  effective  means  of  resolv¬ 
ing  disputes:  the  willingness  of  the  participants  to 
submit  to  both  the  proceedings  and  the  involve¬ 
ment  of  the  third  party.  To  address  this  weakness, 
the  judicial  process  tends  to  include  a  number  of 
principles  creating  the  appearance  of  and  reinforc¬ 
ing  judicial  impartiality.  More  specifically,  the 
need  to  guarantee  judicial  impartiality  implies 
that  judges  must  be  independent  from  the  parties 
in  dispute  and  protected  from  interference  by 
them.  Such  independence  is  a  necessary  condition 
as  any  judge  who  is  dependent  in  some  way  on 
one  of  the  parties  cannot  be,  and  cannot  appear  to 
be,  impartial. 

In  the  political  development  of  Europe,  the 
incorporation  of  judges  into  the  machinery  of  the 
state  and  the  superiority  of  government-appointed 
judges  over  other  types  of  judges — for  example, 
feudal  or  city  judges — have  largely  guaranteed 
judicial  independence  from  the  parties  in  dispute. 
However,  the  incorporation  of  the  judge  into  the 
state  organization  creates  the  need  to  redefine 
judicial  impartiality  when  one  of  the  parties  is  the 
state  itself  or  one  of  its  representatives.  Only  by 
defining  judicial  independence  in  relation  to  the 
state  can  the  judge  act  as  an  impartial  third  party 
in  disputes  between  the  state  and  citizens  (e.g.,  in 
criminal  trials).  Judges  can  then  become  an  effec¬ 
tive  check  on  the  way  in  which  public  functions 
are  performed  since  guarantees  of  independence 
allow  them  to  resolve  such  disputes  and  interpret 
the  relevant  laws  without  coming  under  pressure 
from  the  state.  Thus,  the  protection  of  judicial 


impartiality  through  strong  guarantees  of  judicial 
independence  has  become  one  of  the  most  impor¬ 
tant  traits  of  constitutionalism  since  one  of  its 
main  objectives  is  to  limit  the  arbitrary  exercise  of 
political  power  and  make  it  legally  accountable. 

The  strength  of  the  judiciary  is  influenced  by 
whether  it  is  in  a  civil  law  or  common  law  country. 
In  a  civil  law  country,  legislative  law  takes  prece¬ 
dence  over  case  law,  and  judges  are  bound  by  civil 
codes  without  regard  to  previous  judicial  deci¬ 
sions;  in  contrast,  judicial  rulings  take  precedence 
over  civil  law  (legislation)  in  common  law  coun¬ 
tries,  and  judges  are  bound  by  previous  rulings 
issued  by  higher-level  courts  (stare  decisis). 
Historically,  judges  in  civil  law  countries  have 
enjoyed  less  independence,  and  their  role  has 
tended  to  be  less  politically  significant.  Mistrust  of 
the  judiciary  has  always  been  high  and  judicial 
power  considered  an  important  power  to  be 
checked.  Traditionally,  continental  judges  tend  to 
act  in  a  subordinate  way  to  the  political  branches 
and  to  the  norms  they  enact.  This  reflects  a 
historical  interpretation  of  the  “separation  of 
powers”  principle  that  assigns  a  privileged  role  to 
the  legislature  since  it  represents  the  popular  will. 
Therefore,  for  a  long  period,  any  form  of  judicial 
review  of  legislation  was  ruled  out. 

The  situation  in  England  and  many  of  the  for¬ 
mer  British  colonies,  including  the  United  States, 
is  different.  In  England,  the  centralization  of 
political  authority  resulted  in  the  hegemony  of 
one  institution — Parliament — but  in  a  more  poly¬ 
centric  setting:  The  political  branches  do  not 
monopolize  the  creation  of  legal  norms.  As  a 
result,  English  judges  have  been  able  to  maintain 
some  autonomy  in  relation  to  parliamentary  stat¬ 
utes,  and  common  law  principles  developed  by 
judges  still  remain  one  of  the  basic  elements 
of  English  law.  In  the  United  States,  a  written 
constitution  combined  with  judicial  review  of 
legislation  has  ensured  from  the  outset  that  the 
judiciary  would  not  be  subordinate  to  the  politi¬ 
cal  branches. 

Therefore,  in  any  constitutional  state  whose 
main  objective  is  to  safeguard  the  rights  of  citizens, 
judicial  independence  is  primarily  aimed  at  guaran¬ 
teeing  and  supporting  judicial  impartiality  in  the 
adjudication  process.  As  a  consequence,  its  main 
point  of  reference  must  be  the  state  and  its  institu¬ 
tions,  particularly  the  executive,  which  directly  or 
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indirectly  is  most  often  a  party  to  such  adjudica¬ 
tion,  as  in  criminal  trials  where  one  of  the  parties 
is  usually  the  public  prosecutor.  With  the  introduc¬ 
tion  of  judicial  review  of  legislation,  the  legislature 
also  becomes  a  point  of  reference  for  judicial  inde¬ 
pendence.  The  judiciary  is  considered  a  power  on 
the  same  level  as  the  legislative  and  the  executive: 
It  becomes  a  veritable  third  branch  of  government, 
as  it  is  often  defined  in  the  United  States. 

This  picture  emerges  even  more  clearly  from  a 
comparison  of  the  status  of  judges  in  authoritarian 
states  with  that  of  judges  in  totalitarian  states.  As 
a  rule,  the  judiciary  in  authoritarian  regimes  plays 
a  minor  role  in  the  political  system.  Political 
repression  is  usually  entrusted  to  special  politically 
appointed  courts  or  dealt  with  directly  by  the 
police  or  other  security  forces.  The  ordinary  judi¬ 
ciary  is  only  marginally  involved  in  the  policies  of 
the  regime  and  usually  retains  a  modest  degree  of 
independence  (as  in  Spain  under  Franco  or  in 
Portugal  under  Salazar).  On  the  other  hand,  total¬ 
itarian  regimes  invariably  try  to  enlist  the  judiciary 
in  their  attempts  to  implement  deep  social  and 
political  changes.  In  these  regimes,  judicial  inde¬ 
pendence  is  extremely  low  if  not  nonexistent,  with 
judges  appointed  and  dismissed  at  the  pleasure  of 
the  regime.  These  judges  are  part  of  the  state 
apparatus,  and  the  judiciary  is  usually  a  strong 
hierarchical  organization  made  up  of  members  of 
the  totalitarian  party.  Judicial  elections,  when 
present,  only  ratify  the  choice  of  the  political  lead¬ 
ership.  Hence,  the  judges  are  politicized,  and  their 
values  necessarily  mirror  those  of  the  regime. 

Bureaucratic  and  Professional  Judiciaries 

All  democratic  constitutional  systems  protect  judi¬ 
cial  independence,  but  differences  emerge  when 
considering  the  status  that  judges  enjoy  in  practice. 
The  most  significant  elements  concern  appoint¬ 
ments,  salary,  transfers,  disciplinary  proceedings, 
and  career  patterns,  with  the  last  factor  being  the 
most  important  variable  characterizing  the  organi¬ 
zational  structure  of  the  judiciary.  All  of  them 
determine  the  position  of  individual  judges  in  rela¬ 
tion  to  their  colleagues  and  those  responsible  for 
decisions  affecting  their  professional  lives.  Taken 
as  a  whole,  these  elements  can  be  used  to  assess  the 
extent  of  both  internal  and  external  judicial  inde¬ 
pendence.  While  external  independence  refers  to 


the  relations  between  the  judiciary  and  other 
branches  of  government,  internal  independence 
focuses  on  guarantees  aimed  at  protecting  indi¬ 
vidual  judges  from  undue  pressures  from  within 
the  judiciary:  fellow  judges  and,  above  all,  superi¬ 
ors.  The  role  played  by  organizational  hierarchies 
is  crucial  for  understanding  the  internal  dynamics 
of  the  judiciary,  which  in  turn  affect  the  actual 
degree  of  judicial  autonomy. 

Broadly  speaking,  two  types  of  judiciary  can 
be  distinguished:  (1)  bureaucratic,  to  which  civil 
law  judiciaries  tend  to  belong,  and  (2)  profes¬ 
sional,  characteristic  of  common  law  judiciaries. 
This  is  obviously  an  “ideal-typical”  distinction 
since  actual  cases  are  more  complex.  But  judicia¬ 
ries  in  democratic  countries  can  be  placed  on  a 
continuum  defined  at  either  end  by  these  two 
types.  The  French  judiciary  has  traditionally  rep¬ 
resented  the  bureaucratic  model,  while  the  English 
judiciary  has  most  closely  reflected  the  profes¬ 
sional  model. 

The  defining  elements  of  a  bureaucratic  judi¬ 
ciary  include  the  following: 

1.  Selection  of  judges  is  made  on  a  technical  basis 
through  examinations  at  a  young  age,  usually 
immediately  after  university,  with  little  or  no 
emphasis  placed  on  candidates’  previous 
professional  experience. 

2.  Training  takes  place  primarily  within  the 
judiciary. 

3.  A  hierarchy  of  ranks  determines  organizational 
roles.  Advancement  up  the  career  ladder  is 
competitive,  and  promotions  are  granted 
according  to  formal  criteria  combining  seniority 
and  merit.  Flierarchical  superiors  have  wide 
discretion  in  determining  merit. 

4.  Judges  are  supposed  to  be  capable  of 
performing  all  organizational  roles  associated 
with  their  rank  (e.g.,  to  be  able  to  adjudicate 
criminal,  bankruptcy,  family  law,  and  fiscal 
cases,  or  to  act  as  a  public  prosecutor).  They  are 
therefore  recruited  not  for  a  specific  position 
but  for  a  wide  set  of  roles  and,  in  the  course  of 
their  careers,  will  tend  to  change  jobs  often. 

This  makes  guarantees  of  independence  more 
problematic  because  of  the  influence 
hierarchical  superiors  (or,  in  some  cases,  the 
government  itself)  have  over  these  moves. 
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5.  Judicial  guarantees  of  independence  tend  to  be 
weaker,  especially  because  judges  tend  to  enjoy 
a  lower  degree  of  internal  independence. 

On  the  other  hand,  the  following  are  the  defining 
elements  of  a  professional  judiciary: 

1.  Judges  are  appointed  only  after  having  acquired 
professional  legal  experience.  In  some  instances, 
this  experience  is  taken  into  account  in 
recruitment  for  specific  judicial  positions. 

2.  This  experience  usually,  but  not  always, 
involves  being  a  legal  advocate.  In  the  United 
States,  for  example,  legal  academics  are  often 
appointed  to  the  federal  bench. 

3.  There  are  no  formal  provisions  for 
advancement,  although  higher  ranking  judges 
(especially  in  England)  often  exert  some 
influence  in  both  the  initial  appointment  process 
and  the  promotion  of  judges  from  the  lower 
ranks. 

4.  Judges  are  recruited  for  specific  positions. 
Promotions  are  not  widespread,  and  on  the 
whole,  there  are  much  weaker  internal  controls 
over  judges  by  their  higher  ranking  colleagues. 

5.  Stronger  guarantees  of  both  internal  and 
external  judicial  independence  exist. 

Both  civil  and  common  law  judiciaries  have 
checks  to  ensure  that  their  members  pursue  institu¬ 
tional  goals,  but  the  approaches  are  different.  Since 
Anglo-American  judiciaries  tend  to  employ  indi¬ 
viduals  with  lengthy  legal  experience  outside  the 
judiciary,  there  is  less  emphasis  on  internal  con¬ 
trols.  In  contrast,  because  continental  judges  are 
recruited  without  significant  professional  experi¬ 
ence,  young  judges  are  placed  at  the  bottom  of  the 
judicial  pyramid,  and  their  organizational  social¬ 
ization  is  constantly  monitored  through  a  career 
based  on  moving  up  a  hierarchical  ladder.  The 
organizational  setup  also  affects  the  “reference 
group”  of  judges — those  individuals  and  groups 
judges  take  into  account  when  reaching  a  decision. 
In  bureaucratic  judiciaries,  the  reference  group  lies 
mainly  inside  the  judiciary,  where  judges  tend  to 
be  professionally  socialized.  The  hierarchical  struc¬ 
ture  enables  senior  judges  to  influence  the  behav¬ 
ior  of  lower  ranking  judges  since  they  control 


promotions,  transfers,  and  discipline.  In  profes¬ 
sional  or  common  law  judiciaries,  a  similar  type  of 
hierarchy  does  not  exist,  and  reference  groups  tend 
to  lie  outside  the  judiciary.  However,  there  is  a 
difference  between  the  English  judiciary,  which 
traditionally  had  a  small  professional  reference 
group  (the  Bar),  and  the  American  judiciary,  which 
has  a  much  more  diverse  composition  and  a 
recruitment  process  that  incorporates  different 
types  of  professional  and  political  influences. 

Changes  in  the  Judicial  Organization 

Since  the  mid-20th  century,  significant  changes 
have  characterized  the  judiciaries  in  several  coun¬ 
tries.  In  England,  in  2006,  the  institution  of  a 
Judicial  Appointing  Commission — an  independent 
body  in  charge  of  proposing  appointments  to  the 
Lord  Chancellor — has  circumscribed  the  tradi¬ 
tional  powers  of  the  executive.  However,  the  most 
important  change  has  been  the  creation  of  judicial 
councils  in  several  civil  law  countries  since  it  has 
considerably  increased  the  political  significance  of 
the  judiciary.  Judicial  councils  are  collegiate  bod¬ 
ies — composed  of  judges  and  lay  members — in 
charge  of  administering  the  status  of  judges.  Their 
impact  has  been  a  more  or  less  radical  alteration  of 
bureaucratic  judiciaries  by  strengthening  judicial 
independence  and,  at  the  same  time,  fostering  new 
connections  with  the  political  system.  In  this  pro¬ 
cess,  the  powers  and  composition  of  these  bodies 
are  critical  factors.  The  more  extensive  their  func¬ 
tions,  the  stronger  their  role  will  be,  and  judicial 
independence  tends  to  be  stronger  when  there  is  a 
higher  ratio  of  members  chosen  directly  by  and 
from  the  judiciary. 

One  of  the  main  consequences  of  creating  judi¬ 
cial  councils  is  to  increase  the  external  indepen¬ 
dence  of  the  judiciary  by  decreasing  the  traditional 
power  of  the  executive.  But  since  no  judicial  coun¬ 
cil  is  composed  solely  of  judges,  an  important  role 
remains  for  the  institution  in  charge  of  appointing 
the  nonjudicial  members.  This  is  usually  assigned 
to  parliament,  which  allows  political  parties  to 
bypass  the  minister  of  justice  and  influence  the 
judiciary  directly.  The  creation  of  a  judicial  council 
also  has  consequences  for  the  internal  indepen¬ 
dence  of  the  judiciary.  Entrusting  the  promotion 
and  appointment  of  judges  to  a  collegial  body 
where  normally  all  judicial  ranks  are  represented 
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contradicts  the  traditional  hierarchical  principle, 
whereby  only  higher  ranking  judges  are  entitled  to 
evaluate  lower  ranking  colleagues.  In  this  way,  the 
lower  ranks  acquire  a  new  power  since  they  can 
participate  in  the  process  of  choosing  higher  rank¬ 
ing  judges.  As  a  result,  challenges  to  the  very  idea 
of  a  judicial  career  by  the  lower  ranks  have  often 
been  successful.  It  is  not  coincidental  that  in  coun¬ 
tries  with  judicial  councils,  the  number  of  judicial 
ranks  has  been  reduced,  and  the  influence  of  senior 
judges’  assessments  of  lower  ranking  judges  seems 
to  be  declining. 

The  erosion  of  hierarchical  links  is  particularly 
relevant  to  the  general  expansion  of  judicial  power. 
With  the  creation  of  judicial  councils,  the  reference 
group  of  judges  has  become  more  varied.  Traditional 
members  of  the  reference  group,  such  as  legal 
academics  and  senior  judges,  have  decreased  in 
importance  since  they  no  longer  enjoy  a  monopoly 
over  evaluations  for  judicial  promotion.  Thus,  the 
professional  criteria  of  the  judiciary  have  also 
begun  to  shift:  Technical  legal  knowledge  (and 
ideological  conformity)  is  no  longer  the  determina¬ 
tive  element  in  promotions.  Views  of  others  outside 
the  judicial  system  (e.g.,  political  parties  in  parlia¬ 
ment  and  also  unions  and  interest  groups)  have 
gained  in  importance.  Similarly,  the  interests  of  the 
media  and  the  judiciary  increasingly  overlap,  as 
judicial  actions  provide  the  media  with  news.  In 
return,  the  media  are  able  to  support  and  publicize 
the  actions  of  judges  (and  prosecutors).  Inside  the 
judiciary  itself,  judicial  councils  have  increased  the 
role  of  judicial  associations  since  they  organize  the 
electoral  participation  of  judges.  On  the  other 
hand,  since  the  council  is  also  composed  of  political 
appointees,  their  point  of  view  also  has  to  be  taken 
into  account.  In  fact,  as  judicial  actions  gain  politi¬ 
cal  significance,  the  council  may  become  the  main 
institution  where  the  judiciary’s  elected  representa¬ 
tives  can  meet  political  representatives.  As  the  expe¬ 
rience  of  Latin  European  countries  suggests,  the 
creation  of  judicial  councils  is  capable  of  producing 
a  radical  change  in  the  judiciary’s  traditional 
setting;  this  in  turn  can  diversify  the  judiciary’s  ref¬ 
erence  group,  which  is  becoming  more  horizontal 
and,  at  least  in  part,  placed  outside  the  judiciary.  As 
a  result,  more  activist  conceptions  of  the  judicial 
role  tend  to  prevail. 

The  expansion  of  judicial  guarantees  of  inde¬ 
pendence  has  also  involved  public  prosecutors. 


This  process  has  been  stronger  in  those  countries 
in  which  judges  and  prosecutors  form  a  single  pro¬ 
fessional  group.  In  Italy,  prosecutors  enjoy  the 
same  guarantees  as  judges  and,  together  with 
judges,  elect  the  majority  of  the  members  of  the 
judicial  council.  Their  autonomy  is  extremely  high: 
The  executive  cannot  in  any  way  issue  instructions 
to  them.  Also,  in  France,  where  the  ministry  of 
justice  has  been  able,  at  least  so  far,  to  keep  most 
of  its  traditional  powers,  the  autonomy  of  public 
prosecutors  is  growing  and  the  executive  has  often 
been  unable  to  exert  all  its  institutional  powers  on 
public  prosecution. 

While  these  changes  have  affected  the  position 
of  the  judiciary  in  several  states  of  Latin  and 
Eastern  Europe,  other  countries  have  seen  little 
alteration  of  their  judicial  setting.  In  Germany, 
judges  and  prosecutors  remain  organized  along 
hierarchical  lines  that  allow  the  minister  of  justice, 
higher  ranking  judges,  and  senior  prosecutors  to 
influence  their  careers.  Although  the  1949  Basic 
Law  introduced  judicial  review — entrusted  to  a 
politically  appointed  Constitutional  Court — judges 
still  regard  their  relatively  passive  role  as  a  profes¬ 
sional  barrier  against  possible  political  interference 
and  carefully  protect  their  reputation  as  guardians 
of  the  law.  The  role  of  judicial  councils  and  judi¬ 
cial  associations  remains  limited,  at  least  in  com¬ 
parison  with  other  countries. 

The  Judiciary  in  the  Political  System 

In  the  second  part  of  the  20th  century,  a  trend 
toward  a  general  expansion  of  judicial  power  can 
be  singled  out  in  most  democratic  regimes,  a  devel¬ 
opment  often  described  as  the  judicialization  of 
politics.  The  rise  of  judicial  power  has  involved 
both  the  civil  and  the  common  law  worlds,  although 
it  has  been  stronger  in  some  countries  than  in  oth¬ 
ers.  The  reasons  behind  these  developments  are 
several.  First  of  all,  the  antiauthoritarian  backlash 
has  supported  a  new  constitutionalism  with  the 
institution  of  forms  of  judicial  review  of  legislation 
as  well  as  the  strengthening  of  guarantees  of  judi¬ 
cial — and  prosecutorial — independence.  In  this 
process,  the  supranational  dimension  of  constitu¬ 
tionalism  has  to  be  taken  into  account:  For  instance, 
the  increased  activism  of  national  judiciaries  has 
often  found  support  in  the  European  Court  of 
Justice  and  the  European  Court  of  Human  Rights. 
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An  additional  factor  of  judicialization  can  be 
found  in  the  growth  of  welfare  policies.  The  con¬ 
sequent  proliferation  of  legislation  increases  judi¬ 
cial  significance:  Where  a  legal  rule  exists,  sooner 
or  later  a  judge  will  be  asked  to  interpret  and  apply 
it.  Gaps  remain  despite,  and  perhaps  because  of, 
the  growing  number  of  laws,  and  judges  are  called 
to  fill  them.  Another  supporting  element  has  been 
changes  in  the  legal  culture.  A  new  emphasis  on 
citizens’  rights  and  entitlements  and  the  emergence 
of  an  instrumental  approach  to  law — understood 
as  a  tool  for  advancing  individual  or  collective 
goals — have  become  common  traits  of  contempo¬ 
rary  societies.  Today,  there  is  virtually  no  area  of 
social  life  immune  from  public  regulation,  and 
thus,  no  area  can  be  excluded  from  judicial  inter¬ 
vention.  Thus,  the  demand  for  individual  and  col¬ 
lective  rights,  fueled  by  the  development  of  both 
constitutionalism  and  welfare  policies,  has  meant 
that  individuals  and  groups  increasingly  seek  out 
the  judicial  system  with  the  aim  of  obtaining  an 
authoritative  decision  in  their  favor. 

However,  there  are  significant  cross-national 
differences  in  the  political  prominence  taken  by  the 
judiciary.  Bureaucratic  judiciaries,  when  freed 
from  hierarchical  and  executive  influence,  have 
seen  a  significant  increase  of  their  role  in  the 
political  process.  As  for  the  political  context, 
fragmentation  tends  to  support  judicialization.  In 
this  case,  the  lower  decisional  effectiveness  of 
political  structures  leads  to  interests  having  an 
incentive  in  putting  pressure  elsewhere,  for 
instance,  on  courts.  On  the  other  hand,  for  frag¬ 
mented  political  institutions,  it  is  more  difficult  to 
confront  judicialization — that  is,  to  assemble  the 
political  majorities  in  order  to  curb  judicial 
power,  for  example,  by  overruling  unwelcome 
judicial  decisions,  reducing  judicial  independence, 
or  appointing  accommodating  judges. 

Political  fragmentation  is  obviously  related  to 
a  corresponding  institutional  setting.  Strong  sepa¬ 
ration  of  powers,  making  easier  the  advent  of 
divided  government,  is  a  case  in  point,  as  is  true 
bicameralism — leading  to  disalignments  between 
the  two  chambers — or  a  proportional  electoral 
law,  making  a  multiparty  governing  coalition  more 
likely.  Also  federalism,  by  pitting  the  central  gov¬ 
ernment  against  the  states,  supports  fragmenta¬ 
tion.  In  turn,  a  fragmented  setting  is  likely  to  be  the 
by-product  of  a  political  transition  in  which  no 


actor  is  able  to  impose  its  preferences,  trust  tends 
to  be  low,  and  the  uncertainty  about  the  future  is 
high.  In  this  case,  judicial  power  offers  an  insur¬ 
ance  policy  for  prospective  losers  in  the  electoral 
arena. 

The  judicialization  of  politics  has  given  new  life 
to  the  traditional  debate  on  the  democratic  legiti¬ 
macy  of  an  independent  judiciary.  The  increasing 
role  of  independent  judges  in  the  policy  process 
has  been  criticized.  Some  critics  argue  that  to  the 
extent  to  which  judges  are  no  longer  constrained 
by  law,  an  irresponsible  policymaker  seems  to 
emerge.  To  this  view,  a  different  interpretation  of 
the  judicial  role  has  been  opposed:  In  a  constitu¬ 
tional  state,  judges  are  bound  to  follow  the  consti¬ 
tution  rather  than  statutes.  Sometimes,  it  has  also 
been  argued  that  judges  have  no  discretion  since 
there  is  always  one  right  answer  to  the  case  they 
have  to  decide.  In  fact,  discretion  seems  to  be  an 
unavoidable  trait  of  judicial  decision  making.  So 
political  power  results  inevitably  from  increased 
judicial  independence.  However,  the  expansion  of 
judicial  power  is  also  the  by-product  of  specific 
decisions  taken  by  democratically  responsible 
bodies.  The  fact  that  judicial  decisions  can  go 
against  the  will  of  political  majorities  is  inherent 
in  the  institutional  template  of  constitutionalism: 
It  is  an  inevitable  price  to  pay.  However,  it  seems 
unlikely  that  the  judiciary  will  remain  out  of  step 
for  long  with  the  political  branches.  As  long  as 
courts  become  politically  significant,  politicians 
become  interested  in  exerting  influence  on  them 
through  judicial  appointments  (especially  in  the 
highest  courts),  through  jurisdictional  reforms 
(e.g.,  removing  politically  significant  cases  from 
courts),  or  by  influencing  judicial  careers — 
through  the  powers  of  the  ministry  of  justice  or  of 
a  politically  influenced  judicial  council.  However, 
although  it  is  not  certain  whether  long-term  polit¬ 
ical  influence  on  judges  is  likely  to  succeed,  in  the 
short  run,  the  political  role  of  the  judiciary  has 
become — and  is  likely  to  remain — significant. 
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Justice 


Justice  constitutes  a  normative  cornerstone  of 
political  society.  Classical  and  contemporary  phi¬ 
losophers,  from  Plato  and  Aristotle  to  John 
Rawls,  have  seen  it  is  as  the  main  guiding  princi¬ 
ple  for  regulating  political  power.  They  disagree 
about  what  counts  as  justice,  but  they  agree  that 
justice  counts  for  much  in  shaping,  maintaining, 
and  improving  political  order.  The  asymmetry  of 
wide  agreement  on  its  importance  and  deep 
disagreement  on  its  content,  however,  creates 
difficulties  with  the  idea  of  justice.  To  restore  the 
credibility  of  this  idea,  its  complicated  structure 
must  be  elucidated.  This  entry  attempts  to  provide 
such  an  elucidation  by  exploring  the  competing 
ideas  of  justice,  comparing  the  various  philosoph¬ 
ical  grounds  that  have  been  advanced  for  the 
universalization  of  justice,  and  analyzing  the 
legitimation  of  political  decisions  in  society. 


Is  Justice  an  Empty  Idea? 

The  deep  and  persistent  disputes  over  what  counts 
as  justice  have  generated  skepticism  about  the 
philosophical  significance  of  the  agreement  that 
justice  counts.  Value  relativism,  which  applies  to 
values  in  general,  including  justice,  is  a  part  of  this 
skepticism  but  not  the  whole  of  it.  The  distinctive 
skepticism  that  is  evoked  toward  justice  by  the 
aforementioned  asymmetry  of  its  widely  acknowl¬ 
edged  importance  and  deeply  disputed  content  is 
the  suspicion  that  justice  is  an  empty  concept. 
Those  who  harbor  this  suspicion  hold  that  the  end¬ 
less  controversy  about  the  nature  of  justice  exists 
not  because  it  is  an  unfathomably  profound  idea 
but  because  it  is  an  empty  magical  formula  that  we 
can  fill  with  whatever  content  we  may  think  useful 
for  rationalizing  our  claims  to  what  we  want  and 
dismissing  the  competing  claims  made  by  others. 
Hans  Kelsen,  a  representative  positivist  legal  phi¬ 
losopher  in  the  20th  century,  is  a  notable  example 
of  such  critics  of  justice. 

To  dispel  this  suspicion,  attempts  have  been 
made  to  identify  the  common  conceptual  core  of 
justice.  The  starting  point  is  the  distinction  between 
the  concept  of  justice  and  conceptions  of  justice. 
The  concept  of  justice  represents  the  meaning  of 
justice ,  or  what  we  mean  by  saying  about  some¬ 
thing  that  “it  is  just.”  Conceptions  of  justice  spec¬ 
ify  the  criteria  by  which  we  can  judge  whether  and 
when  the  adjective  just  is  applicable  to  something. 
Each  conception  constructs  a  theory  to  defend  its 
own  distinctive  criterion  (or  set  of  criteria)  against 
other  competing  conceptions,  which  expound  dif¬ 
ferent  criteria  (or  different  sets  of  criteria).  For 
example,  utilitarianism  and  individual-rights  theo¬ 
ries  are  two  competing  groups  of  conceptions  of 
justice,  each  with  its  own  competing  varieties,  such 
as  direct  and  indirect  utilitarianism  and  libertar¬ 
ian-  and  egalitarian-rights  theories,  respectively. 
The  chief  purpose  of  this  distinction  is  to  stress  the 
point  that  there  can  be  genuine  conflicts  between 
competing  conceptions  of  justice  because  and  only 
if  they  present  different  (sets  of)  criteria  for  the 
same  concept  of  justice.  If  they  were  expounding 
different  (sets  of)  criteria  for  different  concepts, 
they  could  not  be  conflicting  but  only  at  cross¬ 
purposes  with  each  other. 

This  point  can  be  supported  by  the  general 
semantic  thesis  that  criterial  conflicts  about  a 
word  are  possible  only  if  the  same  meaning  is 
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attributed  to  it  as  well  as  by  the  hermeneutical 
thesis  that  interpretive  conflicts  are  possible  only 
if  the  same  concept  is  interpreted  in  different 
ways.  Whether  the  semantic  and  hermeneutic  the¬ 
ses  are  generally  applicable  or  not,  however,  the 
aforementioned  point  is  hard  to  deny  as  far  as 
justice  is  concerned.  For  example,  there  is  a  genu¬ 
ine  conflict  between  talio,  the  ancient  conception 
of  retributive  justice  that  stipulates  “an  eye  for  an 
eye,”  and  the  modern  utilitarian  conception  of 
corrective  justice,  which  approves  of  less  (or 
more)  harmful  punishments  than  the  harms 
inflicted  on  victims  by  wrongdoers  if  such  punish¬ 
ments  are  sufficient  (or  necessary)  to  bring  about 
the  optimum  deterrence  effect,  which  is  calculated 
taking  punishment  costs  into  consideration.  But 
suppose  that  while  an  advocate  of  lex  talionis  and 
a  utilitarian  were  having  a  debate  with  each  other, 
a  devout  Christian  moralist  said  to  both  of  them, 
“If  someone  slaps  you  on  the  cheek,  turn  your 
other  cheek  to  him.”  Has  the  Christian  joined  in 
the  debate,  adding  a  new  view  on  the  issue  in 
question?  Certainly  she  has  not.  Rather,  she  has 
changed  the  subject  matter.  Her  standpoint  is  at 
cross-purposes  with  those  of  the  to/zo-retributivist 
and  the  utilitarian.  This  is  because  her  criterion 
for  the  appropriate  response  to  the  other  person’s 
wrongdoing  is  one  not  for  the  concept  of  justice 
but  for  a  different  concept,  love.  What  she  has 
propounded  is  a  Christian  conception  of  love,  not 
a  conception  of  justice  that  competes  with  the 
other  two.  By  changing  the  subject  matter  to  love, 
she  has  gone  beyond  the  conceptual  space  of  jus¬ 
tice.  There  is  a  genuine  conflict  between  talio- 
retributivism  and  utilitarianism  just  because  they 
belong  to  the  same  conceptual  space  of  justice. 
This  shows  that  justice  has  a  conceptual  core  that 
delineates  the  range  of  competing  conceptions  of 
justice  and  makes  their  rivalry  possible. 

It  is  one  thing  to  show  that  there  must  be  some 
conceptual  content  that  the  conflicting  concep¬ 
tions  of  justice  have  in  common.  It  is  quite  another 
to  be  able  to  tell  what  it  is.  How  is  it  possible  to 
identify  the  normative  content  of  the  concept  of 
justice  that  underlies  the  conflicting  conceptions 
of  justice  as  their  common  matrix?  There  are  two 
methodological  approaches  to  this  problem,  each 
resulting  in  a  distinctive  view  on  the  common  nor¬ 
mative  content  of  the  concept  of  justice.  They  may 
be  called  the  inductive  and  negative  approaches. 


The  following  two  sections  provide  an  overview 
and  comparison  of  the  two  approaches. 

The  Inductive  Approach:  Justice  as 
the  Demand  for  Regularization 

A  paradigmatic  form  of  the  inductive  approach 
can  be  found  in  the  work  of  Chaim  Perelman,  a 
Belgian  legal  philosopher  noted  for  his  theory  of 
rhetoric,  who  pioneered  the  development  of  philo¬ 
sophical  awareness  of  the  distinction  between  the 
concept  and  conceptions  of  justice  in  his  article 
“De  la  justice ,”  published  in  1945. 

According  to  this  approach,  the  first  step  is  to 
collect,  as  data  for  theory  building,  a  variety  of 
specific  formulas  of  justice  that  were  set  forth  by 
different  political  ideologies.  Among  them  are  “To 
each  according  to  his  merits,”  “To  each  according 
to  his  works,”  “To  each  according  to  his  needs,” 
“To  each  according  to  his  rank,”  “To  each  accord¬ 
ing  to  his  entitlements,”  and  so  on.  The  second 
step  is  to  abstract  (or  “induce”)  the  common  for¬ 
mal  structure  from  them.  All  the  specific  justice 
formulas  share  the  form:  To  (or  from)  each 
according  to  x.  Different  conceptions  of  justice 
expressed  in  different  justice  formulas  substitute 
different  values  for  the  variable  x,  which  specify 
what  type  of  attributes  or  situations  of  agents  are 
relevant  to  the  distribution  of  benefits  and  assign¬ 
ment  of  burdens  and  responsibilities  to  them.  The 
constant  part  of  the  common  form,  especially 
“according  to,”  symbolizes  the  common  concept 
of  justice,  which  Perelman  calls  the  formal  idea  of 
justice.  It  requires  us  to  treat  all  agents  not  in  an  ad 
hoc  way  but  in  such  a  rule-governed  way  that  the 
same  treatment  should  be  given  to  all  agents  if 
their  attributes  or  situations  belong  to  the  same 
relevant  type.  The  demand  for  regularization 
(treatment  according  to  type  features)  is  the  core  of 
the  common  concept  of  justice  that  the  inductive 
approach  identifies  with  the  formal  idea  of  justice, 
presented  as  its  classic  formulation,  “Treat  like 
cases  alike,”  called  the  Justinian  principle  because 
of  its  roots  in  the  Code  ( Corpus  Juris  Civilis ) 
issued  by  the  Roman  emperor  Justinian. 

The  formal  idea  of  justice  in  the  above  sense  is 
not  Perelman’s  invention  but  a  logical  reconstruc¬ 
tion  of  a  widely  shared  understanding  of  the 
Justinian  principle  as  a  formulation  of  the  common 
concept  of  justice.  It  seems  to  be  free  from  the  con¬ 
troversy  that  pervades  competing  conceptions  of 
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justice,  but  this  is  because  it  does  not  give  us  any 
substantive  guide  for  judging  what  norms  are  rele¬ 
vant  to  questions  of  justice.  Consequently,  positivist 
philosophers  such  as  Hans  Kelsen  and  Alf  Ross 
criticized  it  as  an  empty  formula  that  hides  its  emp¬ 
tiness.  This  criticism  is  not  wholly  fair.  The  formal 
idea  of  justice  implies  an  important  constraint  on 
politics:  Political  power  must  be  exercised  in  a 
rule-governed  way  that  assures  its  subjects  of  its 
predictability.  This  constraint  can  hardly  be 
rejected  as  trivial  because  without  it,  people  would 
be  unable  to  seek  security  by  adjusting  their  con¬ 
duct  on  the  basis  of  prudential  calculation  of  the 
interference  of  political  power  in  their  lives.  The 
importance  of  the  rule-governed  exercise  of  power 
is  fully  appreciated  and  emphasized  by  Lon  Fuller 
in  his  conception  of  the  rule  of  law  as  the  internal 
morality  of  law,  which  he  derived  from  his  concept 
of  law  as  the  enterprise  of  subjecting  human  con¬ 
duct  to  the  governance  of  rules. 

Although  there  is  some  basis  for  regarding  the 
formal  idea  of  justice  as  empty,  it  nevertheless  has 
some  element  of  truth.  The  formal  idea  of  justice  as 
a  demand  for  regularization  is  too  thin  to  capture 
the  normative  aspirations  underlying  the  concept 
of  justice.  It  does  not  provide  a  guide  for  distin¬ 
guishing  between  the  permissible  and  nonpermis- 
sible  kinds  of  discrimination  to  which  justice 
cannot  be  indifferent. 

The  Negative  Approach:  Justice 
as  the  Demand  for  Universalization 

The  negative  approach  looks  to  our  intuition 
about  what  is  unjust  for  a  key  to  answering  the 
question  “What  is  justice?”  In  his  Nicomachean 
Ethics ,  for  example,  Aristotle  recommended  that  if 
we  want  to  understand  what  justice  is,  we  should 
begin  with  the  analysis  of  mzjust  people.  This 
approach  has  had  a  persistent  hold  over  many  later 
thinkers  who  do  not  necessarily  share  Aristotle’s 
answer  to  the  question  of  what  is  justice.  At  least 
three  reasons  underlie  the  appeal  of  Aristotle’s 
recommendation. 

First,  our  sense  of  justice  is  more  strongly 
aroused  and  more  sharply  focused  when  we  react 
against  the  actual  injustices  that  we  experience 
than  when  we  think  abstractly  about  what  justice 
is.  We  express  our  deep-felt  sense  of  (in)justice 
when  we  exclaim,  “That’s  not  fair!”  Even  a  rela¬ 
tivist  philosopher  who  cynically  asserts  that  there 


is  no  right  answer  to  the  question  of  what  justice  is 
would  vehemently  object  to  an  injustice  done  to 
him  (e.g.,  another  author’s  plagiarism  of  his  work), 
thereby  betraying  the  presence  of  a  sense  of  justice. 
Our  sense  of  justice  is  basically  revealed  and  exer¬ 
cised  as  our  sense  of  injustice.  Second,  injustices 
invite  our  indignation  not  only  when  they  are  done 
to  us  or  our  family  and  friends  but  also  when  we 
see  them  inflicted  on  perfect  strangers.  Although 
the  discourse  on  justice  can  be  manipulated  to 
rationalize  our  self-interest,  our  sense  of  injustice 
seems  to  be  propelled  more  by  our  sense  of  fairness 
to  others  than  by  self-interested  motivation.  Third, 
and  most  important  in  the  present  context,  there 
are  examples  of  injustices  such  as  killing  and  tor¬ 
turing  innocent  people  that  are  commonly  con¬ 
demned  as  unjust  by  those  who  hold  competing 
conceptions  of  justice.  Our  judgments  seem  to  be 
more  convergent  when  we  react  against  injustices 
than  when  we  try  to  define  what  is  just. 

If  the  negative  approach  is  to  be  taken  to  the 
task  of  identifying  the  common  concept  of  justice, 
the  question  to  be  asked  is  this:  What  is  the  com¬ 
mon  feature  of  the  conduct  that  competing  con¬ 
ceptions  of  justice  concur  in  rejecting  as  unjust? 
The  natural  answer  would  be  that  it  is  egoism.  No 
conception  of  justice  can  defend  egoism  as  such. 
To  be  sure,  there  may  be  a  conception  of  justice 
based  on  Bernard  Mandeville’s  doctrine  of  “pri¬ 
vate  vices”  as  “public  virtues,”  which  approves  of 
egoism  on  the  ground  that  it  brings  about  the  con¬ 
sequence  that  serves  public  interest.  Likewise, 
utilitarians  take  the  self-interested  motives  of  indi¬ 
viduals  into  account  when  they  judge  which  insti¬ 
tutional  design  would  bring  about  the  maximum 
utility.  But  even  these  conceptions  of  justice 
exclude  egoism  from  their  evaluative  grounds  and 
rely  on  some  conception  of  public  interest.  Actually, 
utilitarianism  does  not  hesitate  to  require  severe 
self-sacrifice  from  individuals  if  such  a  measure  is 
necessary  to  maximize  the  aggregate  utility  of  the 
whole  society. 

It  must  be  noted  with  haste  here,  however,  that 
egoism  in  the  sense  relevant  to  this  context  is  not 
simply  the  pursuit  of  self-interest;  justice  may  well 
be  said  to  constitute  fair  rules  of  the  game  among 
agents  seeking  their  own  interests.  The  egoism  that 
justice  excludes  is  rather  the  lack  of  disinterested 
impartial  treatment  of  the  other  agents — that  is,  my 
self-centered  discrimination  against  the  other  per¬ 
son,  which  is  ultimately  reduced  to  my  claim  that 
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my  interests  outweigh  the  other’s  interests  because 
my  interests  are  mine  while  the  other  person’s  inter¬ 
ests  are  not.  This  means  that  the  core  of  the  concept 
of  justice  revealed  by  our  shared  sense  of  injustice 
is  the  prohibition  of  nonuniversalizable  discrimina¬ 
tion  against  others,  which  ultimately  depends  only 
on  the  difference  in  individual  identity  between  me 
and  the  other  person.  In  other  words,  our  sense  of 
injustice  is  rooted  in  the  notion  that  if  one  is  treated 
differently  from  someone  else,  there  must  be  a  mor¬ 
ally  relevant  reason  for  the  difference. 

The  Justinian  principle  that  like  cases  should  be 
treated  alike,  presented  above  as  a  classic  formula¬ 
tion  of  justice  as  the  demand  for  regularization,  can 
also  be  interpreted  as  the  demand  for  universaliza¬ 
tion.  But  it  is  necessary  to  emphasize  here  that 
demands  for  regularization  and  universalization  are 
not  equivalent.  The  demand  for  universalization  is 
normatively  stronger  than  the  demand  for  regular¬ 
ization  because  it  rejects  as  unjust  those  rules  of 
type-differentiated  treatment  that  are  not  justifiable 
in  universalizable  terms. 

For  example,  the  practices  that  involve  free  rid¬ 
ing,  vested  interests ,  and  collective  egoism  are 
rejected  as  unjust  even  if  they  can  be  set  up  to  be 
following  some  rules  for  type  differentiation.  This 
is  because  they  privilege  some  particular  agent  and 
discriminate  against  the  others  in  a  nonuniversaliz¬ 
able  way.  Free  riding  is  unjust  in  that  a  free  rider 
refuses  to  share  with  others  the  cost  of  maintain¬ 
ing  a  public  resource  benefiting  all  of  them,  simply 
because  it  is  he,  not  the  others,  who  is  thereby 
advantaged.  Vested  interests  are  seen  as  unjust 
because  they,  too,  privilege  a  particular  agent  and 
discriminate  against  others  simply  on  the  basis  of 
one  agent’s  personal  stake  in  a  given  case.  Collective 
egoism  gives  precedence  to  the  collective  interest 
over  individual  interests  and  thus  seeks  pseudo¬ 
public  interests,  but  it  is  unjust  because  collective 
egoists  seek  their  own  special  group  interests  at  the 
expense  of  the  wider  society,  even  when  there  is  no 
difference  between  their  situations  and  those  of 
the  outsiders  except  for  the  fact  that  the  advan¬ 
taged  group  is  theirs  while  the  disadvantaged  ones 
are  not. 

The  universalization  criterion  allows  us  to  judge 
social  discrimination  based  on  general  classifica¬ 
tions  such  as  race  and  gender  to  be  unjust  even 
when  they  are  just  according  to  the  regularization 
criterion.  This  is  so  because  what  universalization 
requires  is  not  only  that  the  discriminative  practices 


should  follow  some  general  patterns  but  also  that 
they  should  be  justified  on  universalizable  grounds. 
Claims  that  are  unjust  according  to  the  universaliz¬ 
able  criterion  are  often  based  on  incorrect  empiri¬ 
cal  assumptions.  For  example,  a  manager  may 
discriminate  against  female  employees  in  promo¬ 
tions,  believing  that  women  are  too  easily  swayed 
by  emotion  to  be  effective  managers.  Justifications 
for  discrimination  may  also  be  based  on  circular 
assumptions,  for  example,  about  the  kind  of 
behavior  to  be  preferred. 

Since  the  demand  for  universalization  offers  a 
fundamental  normative  test  for  eliminating  the 
privileges  and  discriminations  that  any  conception 
of  justice  is  bound  to  reject  as  unjust,  it  is  a  more 
appropriate  representation  of  the  common  concept 
of  justice  than  the  demand  for  regularization,  which 
is  too  formal  to  provide  such  a  test.  This  point  can 
be  supported  by  the  fact  that  many  philosophers 
and  thinkers  have  shown  their  appreciation  of  the 
basic  importance  of  the  demand  for  universaliza¬ 
tion,  though  they  have  given  different  theoretical 
expressions  and  scopes  to  this  demand.  There  are 
important  conceptions  of  justice  that  do  not  focus 
on  universalization;  for  example,  utilitarians  sug¬ 
gest  that  justice  ultimately  consists  in  maximizing 
aggregate  welfare,  while  libertarians  focus  on  the 
importance  of  limiting  state  authority  and  maxi¬ 
mizing  the  individual’s  freedom  from  coercion  and 
governmental  intervention.  However,  universaliza¬ 
tion  seems  to  have  a  central  role  in  our  common 
understanding  of  what  is  fair  and  unfair,  just  and 
unjust.  The  remainder  of  this  entry  examines  some 
notable  examples  of  the  analysis  of  the  concept  of 
justice  in  terms  of  universalization  and  then  explores 
the  implication  of  these  theories  for  public  morality 
and  political  legitimacy. 

Philosophical  Expressions  of 
the  Demand  for  Universalization 

The  General  Will  in  Rousseau 

In  the  history  of  political  thought,  Jean-Jacques 
Rousseau’s  doctrine  of  the  general  will,  advocated 
in  his  treatise  on  the  social  contract,  is  a  precursor 
of  the  development  of  the  concept  of  justice  as  the 
demand  for  universalization.  He  distinguishes  the 
general  will  not  only  from  the  particular  will  but 
also  from  the  will  of  all.  This  is  intelligible  only  if 
attention  is  focused  on  his  view  that  the  general 
will  is  general  with  reference  to  its  object.  By  this, 
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he  means  that  the  general  will  can  pursue  not  par¬ 
ticular  interests  but  only  common  interests.  Even 
the  will  of  all  is  a  particular  will  if  it  is  a  unani¬ 
mous  decision  that  all  parties  accept  not  as  a  result 
of  deliberation  about  the  common  interest  of  the 
whole  society  but  simply  out  of  consideration  for 
their  own  particular  interests.  But  how  can  we 
distinguish  the  pursuit  of  the  common  interest 
from  that  of  particular  interests?  Rousseau  gave  a 
revealing  answer  to  this  question  by  identifying  a 
test  to  make  this  distinction.  According  to  him,  our 
will  can  be  said  to  be  the  general  will  in  the  pursuit 
of  the  common  interest  only  if  it  meets  the  follow¬ 
ing  requirement:  “Each  necessarily  subjects  himself 
to  the  conditions  that  he  imposes  on  the  others.” 
He  holds  that  this  requirement  brings  about  what 
he  calls  “an  admirable  accord  of  justice  and  inter¬ 
est  that  gives  a  character  of  equity  to  common 
deliberations”  ( The  Social  Contract,  Book  2,  chap. 
4).  This  means  that  we  can  pursue  our  interests  in 
accordance  with  justice  only  if  we  do  not  exempt 
ourselves  from  the  burdens  and  costs  that  we 
require  the  others  to  bear  in  the  same  situations 
that  we  are  placed  in.  Here,  Rousseau  showed  an 
insight  into  the  conceptual  core  of  justice  as  the 
prohibition  of  nonuniversalizable  discrimination 
against  others. 

The  Categorical  Imperative  in  Kant 

Immanuel  Kant  restated  Rousseau’s  test  of  jus¬ 
tice  for  the  general  will  with  some  of  his  well- 
known  multiple  formulations  of  the  categorical 
imperative,  which  he  set  as  the  second-order 
requirement  that  our  first-order  practical  judg¬ 
ments  must  meet  if  they  are  to  be  moral  judgments. 
This  can  be  clearly  seen  in  his  first  formulation  of 
the  categorical  imperative:  “Act  only  according  to 
that  maxim  by  which  you  can  at  the  same  time  will 
that  it  should  become  a  universal  law.” 

What  Kant  meant  by  this  is  the  following:  You 
cannot  exempt  yourself  from  the  universal  laws 
that  are  applicable  to  everyone.  Likewise,  you  can¬ 
not  exempt  yourself  from  the  same  duties  that 
your  maxims  impose  on  others  if  they  are  to  have 
the  status  of  moral  judgments  that  transcend  pru¬ 
dential  and  technical  norms  (hypothetical  impera¬ 
tives)  that  merely  show  the  means  to  advance  your 
self-interest.  Here,  Kant  gave  an  emphatic  and 
solemn  (though  somewhat  abstruse)  expression  to 


the  demand  for  universalization  that  Rousseau 
presented  in  plain  and  casual  language  in  his  doc¬ 
trine  of  the  general  will. 

It  may  be  said,  however,  that  Kant  differs  from 
Rousseau  not  only  stylistically  but  also  substan¬ 
tially.  While  Rousseau  presented  the  demand  for 
universalization  as  the  demand  of  justice,  Kant 
made  it  the  fundamental  criterion  for  moral  action 
more  generally,  arguing  that  only  when  people  act 
on  the  basis  of  laws  that  they  can  rationally  will  as 
universal  laws  are  their  actions  morally  right.  If 
Kant’s  concept  of  morality  is  a  broader  and  more 
generic  one  than  the  concept  of  justice  as  a  distinc¬ 
tive  value  of  political  morality,  this  difference  is 
certainly  nonnegligible.  But  in  fact,  Kant  elabo¬ 
rates  the  categorical  imperative  in  a  way  that 
enables  it  to  be  a  foundation  of  political  morality, 
through  his  concept  of  the  kingdom  of  ends.  Kant 
holds  that  human  beings  belong  to  what  he  calls  a 
kingdom  of  ends — that  is,  an  idealized  association 
of  autonomous  and  rational  agents  who  treat  each 
other  not  merely  as  the  means  to  their  specific  ends 
but  as  “the  ends  themselves”  under  the  common 
law,  which  makes  their  exercise  of  rational  auton¬ 
omy  compatible.  So  it  is  not  impossible  to  interpret 
the  Kantian  concept  of  morality  to  be  coextensive 
with  justice  as  the  underlying  regulative  principle 
for  the  public  morality  of  political  society. 

It  is  useful  here  to  distinguish  contexts  where 
justice  is  central  from  those  in  which  it  is  out  of 
place  to  demand  universalization  in  nonpolitical, 
purely  personal  relationships  such  as  love.  In  such 
relationships,  we  are  not  only  allowed  but  are  even 
required  to  make  nonuniversalizable  discrimina¬ 
tions.  To  say,  “I  love  you  because  you  happen  to 
meet  better  than  anyone  else  the  right  kind  of 
qualification  that  anyone  must  meet  to  deserve  my 
love”  is  tantamount  to  saying,  “I  would  love 
another  person  if  he  met  that  qualification  better 
than  you.”  This  attitude  may  be  “fair,”  but  it 
would  surely  destroy  love.  What  a  genuine  lover  is 
supposed  to  say  is  “I  love  you  because  you  are  you, 
not  the  others.”  Love  requires  existential  commit¬ 
ment  to  the  loved  individual,  which  involves  non¬ 
universalizable  discrimination  against  the  others. 
The  demand  for  universalization  is  appropriate 
only  for  the  aspect  of  our  life  that  justice  is 
expected  to  govern,  namely,  our  political  relation¬ 
ships,  in  which  we  exercise  power  (governmental 
power  and  politically  authorized  private  power, 
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such  as  rights,  sheer  violence,  etc.)  over  others  and 
are  required  to  justify  our  exercise  of  power  to  the 
others  who  are  affected  by  it.  Rousseau  is  right  in 
applying  the  demand  for  universalization  to  the 
political  order  that  justice  inherent  in  the  general 
will  is  to  govern. 

The  Veil  of  Ignorance  in  Rawls 

In  his  theory  of  justice  as  fairness,  Rawls  (2001, 
pp.  42-43)  attempts  to  define  a  just  distribution  of 
benefits  and  burdens  within  a  state  by  identifying 
the  principles  that  would  be  chosen  by  a  rational, 
self-interested  person  who  has  no  knowledge  of 
her  personal  situation.  He  argues  that  a  such  a 
person  in  this  hypothetical  decision-making  situa¬ 
tion  would  choose  a  society  in  which  social  and 
economic  inequalities  satisfy  two  conditions:  First, 
they  are  to  be  attached  to  offices  and  positions 
open  to  all  under  conditions  of  fair  equality  of 
opportunity  and,  second,  they  are  to  be  to  the 
greatest  benefit  of  the  least  advantaged  members 
of  society  (the  difference  principle). 

To  understand  the  role  of  the  universalization 
demand  in  Rawls’s  theory  correctly,  care  must  be 
taken  not  to  be  misled  by  his  terminology.  In  his 
theory,  the  demand  for  universalization  as  distin¬ 
guished  from  regularization  is  implanted  in  the 
“veil  of  ignorance,”  which  Rawls  stipulated  as  the 
most  crucial  one  of  the  conditions  constitutive  of 
“the  original  position,”  which  is  his  contractarian 
model  of  the  counterfactual  setting  for  choosing 
the  principles  of  justice  to  be  applied  to  the  basic 
structure  of  political  society. 

In  the  original  position,  all  parties  to  the  con¬ 
tractarian  choice  are  indifferent  to  each  other’s 
interests  and  seek  to  advance  their  own  interests. 
But  their  choice  is  constrained  by  the  condition  of 
the  veil  of  ignorance,  under  which  they  are 
deprived  of  all  information  about  themselves  that 
would  enable  them  to  know  their  special  situa¬ 
tions.  They  do  not  know  whether  they  are  talented 
or  disabled,  whether  they  are  rich  or  poor,  to  what 
race  or  ethnicity  they  belong,  what  their  religious 
backgrounds  and  attitudes  to  religions  are,  what 
gender  and  what  sexual  orientation  they  have,  and 
so  on.  This  constraint  has  two  implications.  First, 
all  parties  can  seek  to  optimize  only  their  share  of 
primary  goods  that  are  absolutely  necessary  for 
pursuing  whatever  conception  of  the  good  life  they 


have.  They  cannot  seek  the  special  goods  that  they 
would  have  wanted  if  they  had  known  the  special 
facts  about  themselves  that  the  veil  of  ignorance 
hides  from  them.  Second,  since  they  do  not  know 
what  circumstances  they  will  find  themselves  in 
once  the  veil  of  ignorance  is  lifted,  they  cannot 
help  choosing  the  principles  of  justice  that  they 
could  accept  irrespective  of  whatever  lot  might  fall 
on  them — that  is,  the  principles  that  all  can  accept 
as  fair  irrespective  of  their  different  payoffs. 

The  veil  of  ignorance  is  a  device  for  inducing 
self-interested  individuals  to  choose  a  principle  (or 
a  set  of  principles)  of  justice  that  treats  others  as 
fairly  as  themselves.  Although  Rawls  characterized 
his  theory  as  an  attempt  to  apply  Kantian  con¬ 
structivism  to  political  morality,  his  use  of  the  veil 
of  ignorance  reflects  a  strong  influence  of  Rousseau, 
who,  as  noted  earlier,  tried  to  bring  about  “an 
admirable  accord  of  justice  and  interest”  by 
imposing  on  the  citizenry  as  the  sovereign  political 
ruler  the  requirement  of  the  general  will:  “Each 
necessarily  subjects  himself  to  the  conditions  that 
he  imposes  on  the  others.”  The  veil  of  ignorance 
can  generate  a  similar  accord  between  justice  and 
the  self-interested  choice  of  the  parties  in  the  origi¬ 
nal  position  by  virtue  of  the  demand  for  universal¬ 
ization  that  it  represents  just  as  Rousseau’s  test  for 
the  general  will  does. 

The  aim  of  the  veil  of  ignorance  is  to  disable 
every  party  in  the  original  position  from  choosing  a 
principle  in  consideration  of  the  fact  that  the  cho¬ 
sen  principle  is  in  her  favor  because  of  her  special 
situation.  To  achieve  this  aim,  it  is  not  necessary  to 
deprive  the  parties  of  special  information  as  such — 
namely,  the  special  facts  about  the  individual  mem¬ 
bers  of  the  political  society  to  which  the  chosen 
principle  is  to  be  applied.  All  that  is  necessary  is  to 
deprive  every  party  of  the  knowledge  of  the  identity 
of  the  individuals  in  question — namely,  the  knowl¬ 
edge  of  which  individual  she  is.  Separated  from  the 
information  about  the  individual  identity,  special 
information  does  not  spoil  the  aforementioned  aim 
of  the  veil  of  ignorance.  Moreover,  it  may  be  the 
case  that  full  special  information  about  the  distinc¬ 
tive  urgent  needs  and  particular  vulnerabilities  of  the 
individual  members  of  a  political  society  is  needed  to 
elaborate  the  adequate-priority  rule  for  the  rights 
assigned  to  them  by  the  principles  of  justice  applied 
to  that  society.  Whether  the  latter  point  can  hold 
true  or  not,  it  can  be  said  that  the  normative  essence 
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of  the  veil  of  ignorance  can  be  revealed  most  cor¬ 
rectly  if  it  is  reformulated  as  the  demand  for  uni¬ 
versalization,  which  prohibits  us  from  favoring 
ourselves  and  discriminating  against  others  on  the 
basis  of  differences  between  us  and  the  others  in 
terms  of  the  individual  identity. 

Further  Demands  of  Justice: 

Reversibility  and  Public  Reasons 

The  normative  force  of  the  requirement  that  the 
concept  of  justice  be  rooted  in  universalization  can 
be  more  fully  appreciated  if  light  is  thrown  on  its 
deeper  implication:  the  demand  for  reversibility. 
Prohibition  of  nonuniversalizable  discrimination 
implies  that  if  my  claim  against  another  agent  is  to 
be  just,  I  must  recognize  that  the  agent  could  justly 
make  the  same  claim  against  me  if  he  ivere  I  and  if 
I  were  he.  Otherwise  I  would  discriminate  against 
him  because  he  is  not  I,  that  is,  because  he  and 
I  are  different  simply  in  individual  identity.  The 
claims  about  justice  must  be  such  that  they  would 
be  acceptable  even  if  the  claimer  and  the  claim 
addressee  were  reversed.  This  is  the  demand  for 
reversibility.  It  applies  not  only  to  the  claim  itself 
but  also  to  the  reasons  for  the  claim  that  the 
claimer  advances  to  the  claim  addressee.  The 
claims  about  justice  must  be  justifiable  for  reasons 
that  would  be  acceptable  even  if  the  roles  of 
claimer  and  claim  addressee  were  reversed.  The 
aforementioned  typical  examples  of  injustice  (free 
riding,  vested  interests,  and  collective  egoism)  are 
unjust  because  those  who  enjoy  advantages  in 
these  examples  all  fail  to  pass  the  test  of  revers¬ 
ibility  in  some  way  or  other. 

The  demand  for  reversibility  is  not  a  novel  idea 
at  all.  It  finds  its  classical  expression  in  the  Christian 
Golden  Rule:  “Do  to  others  as  you  would  be  done 
by.”  The  same  idea  is  expressed  by  the  Confucian 
maxim  “Do  not  do  to  others  what  you  would  not 
like  to  be  done  to  yourself.”  But  the  reversibility 
demand  requires  further  refinement  to  forestall  its 
distortion.  Here,  we  can  use  it  to  examine  a  reduc- 
tio  ad  absurdum  argument  that  may  be  made 
against  it.  The  argument  goes  as  follows.  Suppose, 
for  example,  that  you  are  a  masochist.  The  revers¬ 
ibility  demand  would  require  you  to  abuse  others 
because  that  is  what  you  would  like  to  be  done  to 
yourself.  This  conclusion  is  absurd.  Therefore,  the 
reversibility  demand  must  be  rejected. 


The  fundamental  problem  with  this  argument  is 
that  it  misinterprets  the  reversibility  demand  as  the 
requirement  that  we  should  reverse  only  our  cir¬ 
cumstantial  positions  while  keeping  our  perspec¬ 
tives  unreversed.  In  the  above  example,  you  are 
supposed  to  reverse  positions  with  others  while 
carrying  your  masochist  perspective  with  you  into 
their  positions.  But  the  counterfactual  conditional 
“if  you  ivere  they”  involved  in  the  reversibility 
demand  presupposes  that  they  are  not  you.  And 
the  fact  that  they  are  not  you  implies  that  their 
perspectives  are  not  necessarily  the  same  as  yours. 
To  replace  their  distinct  perspectives  with  your 
own  is  to  give  a  privileged  moral  status  to  your 
perspective  simply  because  it  is  yours.  It  runs  coun¬ 
ter  to  the  prohibition  of  nonuniversalizable 
discrimination  that  underlies  the  demand  for 
reversibility.  In  the  moral  thought-experiment  of 
reversibility,  therefore,  you  have  to  test  not  only 
the  positional  reversibility  but  also  the  perspectival 
reversibility  of  your  claims  against  others.  You 
have  to  ask  whether  your  claim  against  others 
would  be  acceptable  even  if  your  perspective  as 
well  as  your  position  were  reversed  with  theirs.  If 
you  as  a  masochist  conducted  this  moral  experi¬ 
ment  on  yourself,  you  would  not  accept  as  just  the 
abuse  of  other,  nonmasochist  persons.  The  reduc- 
tio  ad  absurdum  argument  refutes  not  the  revers¬ 
ibility  demand  itself  but  the  misinterpretation  of  it 
as  a  mere  positional  reversibility  demand. 

The  reversibility  demand  has  a  further  impor¬ 
tant  implication:  the  demand  for  public  reasons. 
As  discussed  above,  the  reversibility  test  applies 
not  only  to  our  claims  against  others  but  also  to 
the  reasons  that  we  give  to  justify  our  claims  to 
them.  Since  the  test  requires  perspectival  revers¬ 
ibility,  we  have  to  justify  our  claims  to  our  claim 
addressees  with  perspectivally  reversible  reasons — 
reasons  that  we  could  accept  even  if  our  perspec¬ 
tives  were  reversed  with  theirs.  Such  reasons  can 
be  called  public  reasons  because  they  are  reasons 
that  would  be  acceptable  not  only  from  our  own 
perspective  but  also  from  the  different  perspectives 
of  others,  thereby  being  distinguishable  from 
private  reasons,  which  are  unacceptable  or  even 
unintelligible  to  those  who  do  not  share  our  idio¬ 
syncratic  perspective.  It  may  be  objected  that  the 
perspectival  reversibility  test  cannot  screen  public 
reasons  because  it  requires  that  we  should  aban¬ 
don  our  private  reasons  only  to  yield  to  the  private 
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reasons  of  other  agents  to  whom  we  are  required 
to  justify  our  claims.  This  objection  fails  to  hold 
because  the  other  agents  are  also  required  to  con¬ 
duct  the  test  of  perspectival  reversibility  on  their 
counterclaims.  The  public  reasons  required  by  the 
perspectival  reversibility  test,  to  put  it  more  pre¬ 
cisely,  are  the  reasons  for  our  claims  against  others 
that  would  be  acceptable  even  from  their  perspec¬ 
tives  provided  they  seek  and  subject  themselves  to 
the  reasons  that  would  be  acceptable  even  if  their 
perspectives  were  reversed  with  ours. 

A  philosophically  thornier  objection  against  the 
demand  for  perspectival  reversibility  and  public 
reasons  is  this.  It  is  impossible  for  you  to  know  the 
perspectives  of  others.  When  you  imagine  that  you 
are  adopting  their  perspectives,  you  are  simply 
projecting  your  perspectives  onto  theirs.  This 
objection  can  be  a  sobering  warning  against  our 
tendency  to  make  hasty,  prejudiced,  and  self- 
deceptive  presumptions  about  what  others  think 
and  how  they  feel,  but  it  is  a  half-truth  whose 
negative  implication  should  not  be  exaggerated.  It 
is  true  that  “other  minds”  are  not  transparent  to 
me  and  that  my  perceptions  about  the  thoughts 
and  feelings  of  others  are  mine,  not  theirs.  But  it 
does  not  follow  that  it  makes  no  moral  difference 
for  me  to  make  an  honest  endeavor  toward  under¬ 
standing  and  paying  due  attention  to  their  perspec¬ 
tives.  Certainly,  the  self-regarding  perspective, 
from  which  I  care  only  about  myself,  and  the 
other-regarding  perspective,  from  which  I  care 
about  the  perspectives  of  others  as  well  as  mine, 
are  both  my  perspectives,  but  there  is  a  big  differ¬ 
ence  between  these  two  perspectives  in  terms  of 
their  moral  quality.  Even  if  I  cannot  go  beyond  my 
perspective,  I  can  bring  about  moral  transforma¬ 
tion  of  my  perspective  from  within  through  the 
process  of  self-critical  reflection  to  which  I  subject 
myself  in  the  dialogue  with  others  who  challenge 
me  about  the  justice  of  my  claims. 

This  point  was  intimated  by  Rousseau  in  his 
distinction  between  the  particular  and  the  general 
will.  If  this  distinction  is  to  be  intelligible,  it  must 
be  interpreted  to  be  concerned  more  with  the 
moral  status  of  the  object  of  the  will  (to  what  pur¬ 
pose?)  than  with  the  subject  of  the  will  (whose 
will?)  as  was  seen  above.  The  particular  and  the 
general  will  are,  in  this  interpretation,  not  the  wills 
of  ontologically  differentiated  agents  (e.g.,  the 
individual  and  the  collective  agent,  the  parts  and 


the  whole  of  society)  but  the  two  morally  differen¬ 
tiated  perspectives  immanent  and  latent  in  each 
individual’s  will.  What  Rousseau  called  “common 
deliberation”  is  expected  to  bring  about  the  moral 
transformation  of  the  perspective  of  each  individ¬ 
ual  from  the  particular  to  the  general  will. 

The  problem  of  the  cognitive  opacity  of  others 
in  morality  was  more  consciously  and  explicitly 
addressed  and  solved  by  Adam  Smith  in  his  classic 
work  The  Theory  of  Moral  Sentiments.  Although 
the  principle  of  sympathy  constitutes  a  foundation 
for  his  moral  theory,  from  the  beginning  of  this 
work,  Smith  conceded  with  a  surprising  candid¬ 
ness  that  since  “we  have  no  immediate  experience 
of  what  other  men  feel,”  the  sympathy  we  expect 
from  others  can  be  nothing  more  than  the  product 
of  our  imagination  about  “what  we  ourselves 
should  feel  in  the  like  situation”  (Pt.  1,  sec.  1, 
chap.  1).  Paradoxically  enough,  this  concession 
did  not  prevent  him  from  developing  a  moral 
theory  in  which  the  crucial  test  of  the  justice  of 
our  conduct  is  whether  our  conduct  would  be  able 
to  gain  the  sympathy  of  spectators.  The  paradox 
is  dissolved  when  it  is  seen  that  he  idealized  the 
spectator  as  impartial  and  internalized  him  in  our 
perspectives.  While  Smith  regarded  the  sympathy 
of  others  as  the  product  of  our  imagination,  he 
distinguished  this  imagination  of  ours  from  our 
raw  feelings  and  instinctive  desires.  Our  imagina¬ 
tion  detaches  us  from  such  feelings  and  desires 
and  generates  the  normative  perspective  of  impar¬ 
tiality  within  us  by  subjecting  us  to  critical 
self-examination.  This  is  why  he  referred  to  the 
impartial  spectator  as  “the  imagined  man  within.” 
Although  Smith  was  keenly  aware  that  we  cannot 
transcend  ourselves  to  have  an  immediate  experi¬ 
ence  of  the  perspectives  of  others,  he  realized  that 
by  exercising  our  imagination  to  care  about  their 
perspective,  we  can  transform  our  perspective 
from  within  so  that  it  will  grow  from  the  sheer 
self-centered  one  in  the  pre-imaginative  stage  into 
something  morally  higher  that  is  open  to  the  sense 
of  impartiality  and  fairness  to  other  persons. 

Justice  and  Legitimacy 

Our  moral  imagination  can  transform  our  motiva¬ 
tion  so  as  to  seek  public  reasons  acceptable  to  oth¬ 
ers  whose  perspectives  are  different  from  ours.  But 
it  does  not  follow  that  we  can  reach  a  consensus 
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about  which  public  reason  should  prevail.  Even  if 
our  moral  imagination  is  fostered  through  our 
common  deliberation,  there  is  no  guarantee  that 
our  judgments  about  public  reasons  will  converge. 
Actually,  part  of  the  condition  of  human  life  is  the 
existence  of  persistent  disagreements  among  those 
who  sincerely  seek  the  public  reasons  required  by 
justice.  This  is  not  surprising.  Judgments  about 
public  reasons  are  guided  by  different  competing 
conceptions  of  justice,  and  the  universalizability 
demand,  which  entails  the  demand  for  public 
reasons,  imposes  only  negative  constraints  on 
admissible  conceptions  of  justice  without  uniquely 
specifying  which  is  the  best  conception.  Further¬ 
more,  the  more  sincere  our  commitment  to  public 
reasons  is,  the  more  difficult  it  is  for  us  to  make 
nonprincipled  strategic  compromises. 

Rousseau  did  not  have  to  worry  very  much 
about  the  disagreement  that  remains  even  after  all 
individuals  have  adopted  the  perspective  of  the 
general  will.  This  is  because  the  polity  he  had  in 
mind  was  a  small-scale,  face-to-face  homogeneous 
community  where  there  is  a  solid  consensus  among 
the  citizens,  even  though  his  favorite  Greek  polis, 
Lacedaemon  (Sparta),  may  not  be  appropriate  as  a 
historical  model  of  such  a  polity  given  the  fact  that 
it  was  not  immune  from  internal  strife.  Whether 
Rousseau’s  ideal  polity  is  sustainable  or  not,  how¬ 
ever,  the  divergence  of  public  reasons  raises  a  seri¬ 
ous  problem  in  contemporary  large-scale  pluralist 
societies.  Contemporary  advocates  of  deliberative 
democracy  tend  to  make  light  of  this  problem 
because  they  are  generally  optimistic  about  the 
possibility  of  reaching  a  rational  consensus  through 
common  deliberation.  There  is  no  ground  for  this 
optimism,  however.  There  is  every  reason  to  think 
that  free,  rational,  and  mutually  critical  discus¬ 
sions  will  bring  about  a  proliferation  of  divergent 
views  rather  than  their  convergence.  Rawls  located 
the  common  basis  of  public  reason  in  the  idea  of 
an  “overlapping  consensus”  on  constitutional 
essentials  and  basic  matters  of  justice  among  com¬ 
peting  reasonable,  “comprehensive  doctrines” 
after  he  converted  to  political  liberalism  as  distin¬ 
guished  from  philosophical  (or  comprehensive) 
liberalism,  but  his  attempt  failed  to  attain  his  aim 
because  he  simply  rejected  the  dissenting  compre¬ 
hensive  doctrines  as  unreasonable  by  resorting  to 
his  own  very  controversial  philosophical  stand¬ 
point  about  what  counts  as  reasonable. 


The  nature  of  this  problem  is  well  captured  by 
Jeremy  Waldron  in  his  explication  of  what  he  calls 
the  circumstances  of  politics,  which  have  the  fol¬ 
lowing  two  features.  First,  the  circumstances  of 
politics  are  contrasted  to  what  Rawls  calls  the 
subjective  circumstances  of  justice,  where  our  con¬ 
ceptions  of  the  good  life  are  so  divergent  that  they 
must  meet  the  condition  of  what  he  calls  the  pri¬ 
macy  of  justice  over  the  good,  which  is  the  require¬ 
ment  that  conceptions  of  justice,  if  they  are  to  be 
the  common  principle  of  the  basic  structure  of  our 
political  society,  should  be  justifiable  indepen¬ 
dently  of  any  specific  conception  of  the  good  life 
and  should  constrain  the  latter.  In  the  circum¬ 
stances  of  politics,  there  is  a  deeper  conflict:  Not 
only  our  conceptions  of  the  good  life  but  also  our 
conceptions  of  justice  are  divided  so  that  the  pri¬ 
macy  of  justice  over  the  good  is  not  enough  to  sift 
out  the  common  political  principle  that  can  govern 
the  basic  structure  of  our  society. 

Second,  whereas  it  is  possible  to  give  a  “let’s 
agree  to  disagree”  solution  to  the  problem  of  the 
conflict  in  the  conceptions  of  the  good  life  by 
endowing  individuals  with  the  right  to  choose  their 
conception  of  the  good  life  autonomously,  it  is 
impossible  to  apply  this  solution  to  the  conflict  in 
the  circumstances  of  politics.  Since  the  conflicting 
conceptions  of  justice  are  competing  to  get  control 
of  the  political  order  of  the  same  society,  conflict 
resolution  in  this  case  can  be  brought  about  only 
by  a  collective  decision  of  the  political  society 
(hereafter  referred  to  as  political  decision  for 
short)  that  binds  not  only  the  supporters  of  its 
content  but  also  the  dissenters  who  object  to  it.  As 
an  example,  although  we  can  let  each  individual 
choose  freely  her  religion  according  to  her  own 
conception  of  the  good  life,  we  cannot  let  each 
individual  choose  the  tax  system  according  to  her 
own  conception  of  distributive  justice.  We  have  to 
make  a  political  decision  about  whether  our  tax 
system  should  be  shaped  along  the  utilitarian 
aggregate-welfare-maximization  line,  the  libertar¬ 
ian  primacy-of-property  line,  the  Rawlsian  differ¬ 
ence-principle  line,  or  the  lines  of  some  other  con¬ 
ceptions  of  distributive  justice,  not  despite  but 
because  of  the  absence  of  agreement  on  the  matter. 

The  circumstances  of  politics  as  presented 
above  raise  the  problem  of  legitimacy:  How  is  it 
possible  for  us  to  respect  as  legitimately  binding 
on  us  the  political  decision  that  we  hold  to  be 
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wrong  according  to  our  conceptions  of  justice? 
The  problem  of  legitimacy  so  stated  may  appear 
to  imply  that  the  idea  of  justice  is  totally  irrelevant 
to  it.  But  this  is  not  the  case.  Although  we  cannot 
resort  to  any  specific  conception  of  justice  as  a 
criterion  of  legitimacy,  the  common  concept  of 
justice  as  the  universalization  demand  can  offer  a 
guide  for  solving  the  problem  of  legitimacy. 
Because  legitimacy  is  concerned  with  the  fact  that 
we  have  to  respect  whichever  conception  of  justice 
we  may  consider  the  best,  it  can  be  illuminated  by 
the  concept  of  justice  as  the  common  normative 
constraint  on  competing  conceptions  of  justice. 
The  following  two  points  are  especially  important 
in  this  connection. 

First,  the  way  democracy  can  give  legitimacy  to 
a  political  decision  can  be  reconceived  in  light  of 
the  concept  of  justice.  In  his  answer  to  the  ques¬ 
tion  of  legitimacy,  Waldron  rejected  constitutional 
constraints  on  democracy,  such  as  judicial  review, 
in  favor  of  simple  majoritarian  democracy.  This 
position  is  difficult  to  sustain.  Democratic  deci¬ 
sions  can  have  legitimacy  only  when  the  losers 
(especially  marginal  minorities  who  cannot  hope 
to  be  winners)  in  the  political  strife  of  the  demo¬ 
cratic  process  can  respect  the  resultant  decisions  as 
the  public  decisions  of  their  own  political  society 
instead  of  regarding  them  as  mere  products  of  the 
majority’s  private  will,  to  which  the  minority 
acquiesces  simply  out  of  prudential  consider¬ 
ations.  This  would  be  impossible  unless  losers  are 
given  some  institutional  guarantee  against  the 
danger  of  the  winning  majority’s  refusal  to  subject 
themselves  to  the  test  of  reversibility,  which 
requires  them  to  examine  sincerely  whether  the 
decisions  that  they  succeeded  in  imposing  on  the 
losers  could  be  justifiable  had  they  themselves 
been  the  losers.  Such  an  institutional  guarantee 
can  and  should  be  embodied  in  constitutional 
arrangements  for  facilitating  power  change  so  as 
to  secure  actual  reversibility  of  the  positions  of 
political  winners  and  losers,  compensating  for  the 
handicap  of  marginal  minorities  in  their  political 
resources  for  effective  democratic  participation, 
preventing  the  government  from  denying  equal 
rights  to  politically  powerless  victims  of  social  dis¬ 
crimination,  and  so  on.  This  is  the  real  significance 
of  the  distinction  between  the  role  and  the  holder 
of  the  role  and  the  normative  background  of  the 
political  institutions. 


Second,  a  further  ground  for  legitimacy  can  be 
explained  by  reconstructing  the  fair-play  argument 
for  political  obligation  in  the  light  of  the  concept 
of  justice.  By  claiming  that  your  conception  of  jus¬ 
tice  should  be  incorporated  in  the  political  decision 
binding  on  others  who  have  competing  concep¬ 
tions  of  justice,  you  also  claim  that  they  should 
bear  the  moral  cost  of  compromising  their  own 
conceptions  of  justice  to  sustain  the  political  sys¬ 
tem  of  collective  decision  making.  Therefore,  you 
should  bear  the  same  moral  cost  when  their  con¬ 
ceptions,  not  yours,  succeed  in  being  incorporated 
in  the  political  decision.  You  should  acknowledge 
the  legitimacy  of  the  political  decision  that  you 
hold  wrong  in  the  light  of  your  conception  of  jus¬ 
tice,  thereby  accepting  the  political  obligation  to 
defer  to  it.  Otherwise,  you  would  be  a  moral  free 
rider  who  violates  the  demand  for  reversibility 
inherent  in  the  concept  of  justice. 
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Kant,  Immanuel  (1724-1804) 


Immanuel  Kant  (born  in  Eastern  Prussia,  now 
Kaliningrad)  is  one  of  the  leading  philosophers  of 
the  Enlightenment  and  has  had  a  lasting  impact  on 
modern  philosophy.  His  major  works  include 
Kritik  der  reinen  Vernunft  ( Critique  of  Pure 
Reason),  first  published  in  1781,  an  attack  on  tra¬ 
ditional  metaphysics  and  epistemology;  Kritik  der 
praktischen  Vernunft  ( Critique  of  Practical 
Reason),  1788,  concerned  with  the  philosophical 
foundations  of  ethics;  and  Kritik  der  Urteilskraft 
( Critique  of  Judgment),  1790,  dealing  with  aes¬ 
thetics  and  teleology.  This  entry  discusses  his 
“political  philosophy,”  even  though  none  of  his 
works  has  appeared  under  this  title.  His  major 
work  on  political  ethics  and  the  international  order 
is  Zum  Ewigen  Frieden  ( Perpetual  Peace),  1795. 

Kant  breaks  with  the  Aristotelian  tradition  of 
political  philosophy  (Politikd),  including  his  “polit¬ 
ical  economy”  (“Aristotle,”  I  8-13)  and  deals  with 
these  subjects  only  under  the  perspective  of  Roman 
law  and  natural  law,  which  he  forms  into  a  law  of 
reason.  Whereas  for  Aristotle  politics  means  the 
realization  of  human  beings  as  political  beings 
(“zoon  politikon”)  and  is  oriented  toward  the 
nature  (“physis”)  of  men,  for  Kant  politics  is 
always  related  to  a  system  of  law  endowed  with 
the  power  of  physical  force.  Politicians  and  politi¬ 
cal  acts  are  judged  according  to  norms  in  the  legal 
system.  Politics  is  judged  positively  when  it  con¬ 
forms  as  an  executive  power  (“ausuebende 
Rechtslehre,”  VIII  370)  to  the  standards  of  law. 


The  goal  of  political  acts  in  “status  civilis”  is  the 
realization  of  the  ultimate  political  good,  stable 
peace  (VI  355,  VIII  386).  Peace  only  can  be 
attained  in  a  cosmopolitan  order.  With  this  idea, 
which  is  still  very  relevant  today,  Kant  follows,  on 
the  one  hand,  stoicism,  a  Hellenistic  school  of  phi¬ 
losophy,  and,  on  the  other,  reflections  on  the 
French  Revolution,  leading  to  his  conclusion  that 
only  republics  with  separation  of  powers  are 
peaceful  (VIII  341-386). 

Political  acts  can  be  judged  according  to  the 
criterion  of  “publicness”  (VIII  381-386).  The 
duty  that  a  politician  performs  can,  but  need  not, 
be  of  an  ethical  nature;  it  is  founded  on  the  special 
legal  duty  to  lead  an  honest  life  (boneste  vine)  that 
all  human  beings  have  (VI  236).  There  is  no  exter¬ 
nal  sanction,  as  with  other  legal  duties,  nor  is 
there  necessarily  an  internal  motivation  to  act 
according  to  this  duty.  Even  those  who  become 
politicians  out  of  sheer  vanity  can  (and  should) 
fulfill  these  duties.  Similarly,  peace  is  a  result  of 
the  ethical  virtues  of  citizens.  For  this  reason,  the 
promotion  of  an  appropriate  infrastructure  and 
education  on  which  men  are  dependent  to  become 
ethical  human  beings  is  a  part  of  politics.  Therefore, 
politics  must  ensure  that  citizens  receive  the  edu¬ 
cation  they  need  to  become  ethical  human  beings 
and  must  create  the  infrastructure  needed  to 
support  it. 

The  actions  of  politicians  are  judged  negatively 
if  they  are  not  oriented  toward  the  law  but  are 
based  on  interests  of  power  and  an  alleged  knowl¬ 
edge  of  human  nature  (VIII  374).  Implicitly,  Kant 
thus  turns  against  Aristotle’s  concept  of  politics, 
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which  presupposes  knowledge  of  human  nature. 
Kant,  like  John  Locke  before  him,  denies  the  pos¬ 
sibility  of  knowing  human  nature  empirically.  In 
his  critical  philosophy,  neither  an  empirical  nor  a 
“rational”  knowledge  of  human  nature  derived 
from  “pure”  reason  can  serve  as  a  foundation  of 
norms  for  political  action  and  ethics,  and  the  rule 
of  law  cannot  be  justified  anthropologically.  Only 
when  both  are  perceived  from  pure  reason  can  this 
knowledge  be  applied  to  human  beings. 

Kant’s  image  of  politicians  in  a  pejorative  sense 
has  been  influenced  by  Niccolo  Machiavelli’s 
Principe  (1513): 

Our  politicians  act,  as  far  as  they  can,  like  [the 
Jewish  prophets]  and  are  as  skilful  in  their 
prophecies.  We  must,  they  say,  take  human 
beings  as  they  are,  and  not  as  some  ignorant 
pedants  or  well-meaning  phantasts  dream  what 
they  should  be.  As  they  are  then  means:  to  what 
we  have  made  them  through  unjust  force  and 
treacherous  acts  of  the  government,  namely 
stubborn  and  rebellious.  Then,  of  course,  when 
the  tight  reins  are  loosened,  sad  things  happen 
which  fulfil  the  prophecies  of  these  “wise 
statesmen.”  (VII  80) 

A  politician  who  is  not  guided  by  law  but  by  his 
alleged  knowledge  of  human  nature  is  incapable 
of  realizing  in  certain  phenomena,  for  example, 
the  French  Revolution,  the  signs  of  reason  and  a 
better  future  and  to  act  accordingly.  In  Perpetual 
Peace,  Kant  calls  the  Machiavellist  a  “political 
moralist”  and  a  person  conscious  of  the  law  a 
“moral  politician”  (VIII  377). 

So  far,  the  definition  of  politics  and  law  seems 
to  be  possible  without  any  problem.  This  becomes 
more  difficult  when  two  specificities  of  the  Kantian 
theory  of  law  and  philosophy  of  history  are  taken 
into  account:  In  his  theory  of  law  (first  part  of  Die 
Metapbysik  der  Sitten  [The  Metaphysics  of  Morals], 
1797),  the  state  does  not  implement  the  law  of 
reason  in  its  pure  form,  as  demonstrated  in  the  two 
sections  on  private  law,  but  the  state  practices  a 
version  modified  for  its  purposes.  In  the  third  sec¬ 
tion  (VI  296ff.),  it  is  shown  that  the  perceptions  of 
“law  in  itself”  have  to  be  restricted  to  be  at  all  real¬ 
ized  by  a  state.  This  is  demonstrated  by  the  exam¬ 
ple  of  some  court  decisions  that  deviate  from  the 
“law  in  itself”  so  that  the  court  and,  therefore,  the 


state  can  come  to  quick  and  solid  decisions.  This 
applies,  for  example,  to  the  respect  of  property 
rights  when  products  have  been  purchased  in  the 
market  according  to  legal  norms  (VI  300-303), 
even  if  they  are  stolen  items  or  (not  mentioned  by 
Kant)  products  from  a  slave  plantation.  The  state 
also  can  apply  “tortura  spirituals,  ”  confessions 
under  oath,  even  if  this  contradicts  the  “law  in 
itself”  (VI  303-305).  Strictly  speaking,  politics  here 
commits  an  injustice,  because  of  the  constraints  of 
the  state  that  lead  to  a  restriction  of  the  “law  in 
itself”  in  its  implementation.  In  the  interpretation 
of  Kantian  legal  and  political  philosophy,  this 
antinomy  has  not  sufficiently  been  taken  into 
account  so  far. 

In  the  philosophy  of  history,  another  peculiar¬ 
ity  can  be  noted.  Whereas  political  economy  in  the 
sense  in  which  Aristotle  used  it  takes  the  same 
nature,  the  physis,  of  men  as  the  guiding  principle 
( Politika  I),  for  Kant  a  negative  anthropology  is 
the  real  source  of  political  and  economic  prosper¬ 
ity.  Ethics  and  politics  do  not  coincide,  as  already 
diagnosed  by  Bernard  de  Mandeville  in  his  The 
Fable  of  the  Bees:  Or,  Private  Vices,  Public 
Benefits.  There,  the  result  was  that  if  politicians 
really  take  moral  laws  seriously,  they  would 
regress  economically  and  would  have  to  retreat 
from  modernity  into  the  woods! 

Kant  follows  Mandeville  and  describes  both  in 
his  philosophy  of  history  and  in  his  anthropology 
the  moving  forces  of  the  common  good  as  the 
unfettered  instincts  of  greed,  vanity,  and  power 
(VIII  21,  VII  271).  This  triad  is  already  a  firm 
constellation  for  Thomas  Hobbes  ( Leviathan  I,  II: 
“Riches,  Honour,  Command” ),  and  not  by  chance, 
because  these  are  the  moral  vices  that  are  opposed 
to  the  first  three  cardinal  virtues  in  Plato’s  Politeia. 
The  “modestia”  of  peasants  is  opposed  by  their 
greed,  the  “fortitudo”  of  soldiers  by  their  ambi¬ 
tion  for  honor,  and  the  “prudentia”  of  kings  by 
their  lust  for  power.  Whereas  in  antiquity  and  the 
Middle  Ages  the  naturally  good  polis  or  city  was 
founded  on  the  virtues  (including  the  fourth  one, 
justice)  and  the  evil  city  (“citta  cattiva”)  was  sub¬ 
jected  to  the  opposite  vices,  in  modern  times  this 
judgment  is  turned  around:  The  vices  are  now  the 
necessary  forces  that  maintain  and  foster  the  state 
economically  and  politically.  This  corresponds  also 
to  the  view  of  Giambattista  Vico,  who  considers  it 
inevitable  that  a  well-functioning  society  is  based 
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on  “ferocia,  avarizia,  ambizione,”  the  “tre  vizi” 
that  have  accompanied  mankind  since  its  eviction 
from  paradise  (La  scienza  nuova,  second  edition, 
II  7).  Kant’s  diagnosis  of  historical  and  political 
reality  is  the  same.  What  about  this  dilemma  in 
Kant’s  view  of  current  politics?  Must  a  politician 
for  moral  reasons  introduce  and  maintain  laws 
that  will  ruin  the  state,  or  does  he  or  she  have  to 
tolerate  vices  or  even  promote  them  because  of  the 
“raison  d’etat”?  This  is,  of  course,  a  very  topical 
question  even  in  the  present  day.  Kant  himself  has 
not  addressed  this  dilemma.  His  solution  may  be 
as  follows:  A  moral  politician  in  a  liberal  society 
with  an  efficient  economy  in  the  sense  of  Adam 
Smith  (The  Wealth  of  Nations)  would  not  inter¬ 
fere  with  the  internal  motives  of  actors  and  ban 
vices  as  long  as  the  external  frame  of  law  is  main¬ 
tained.  The  rules  of  the  law  of  reason  restrict 
greedy,  vain,  and  power-seeking  actors  but  enable 
them  to  secure  their  profits.  That  private  profits 
will  lead  to  welfare  for  all  lies  not  only  for  Adam 
Smith  but  also  for  Kant  in  the  nature  of  things 
(“the  invisible  hand”).  “As  an  old  merchant  said: 
create  good  roads,  mint  good  money,  exchange 
currencies  quickly,  etc.  but  apart  from  that  ‘let  us 
do!”’  (VII  19-20).  This  “laissez-faire,  laissez- 
aller”  policy  is  the  solution  for  Kantian  economic 
policy.  Politicians  should  not  intervene  in  self¬ 
regulating  trade.  Johann  Gottlieb  Fichte  as  well  as 
Karl  Marx  would  have  stood  against  this  liberal 
economic  policy;  Kant  closed  his  eyes  to  the  dev¬ 
astating  consequences  of  a  liberated  economy  in 
Europe,  internal  exploitation,  and  exploitation  of 
the  economically  weaker  countries  by  the  stronger 
ones  in  the  colonial  territories.  Kant  believed  that 
these  problems  would  resolve  themselves  in  the 
long  run. 

Kant  does  not  comment  on  the  day-to-day  poli¬ 
tics  of  his  time,  neither  in  his  publications  nor  in 
his  notes  or  letters.  The  central  event  of  his  time  is 
the  French  Revolution  after  1789,  which  marked 
the  turn  from  despotism  to  a  republic  (not  the 
“Declaration  of  Independence”  in  Virginia  in 
1776)  and  which  was  seen  by  other  authors  as  a 
parallel  event  to  Kant’s  own  “Copernican 
Revolution”  of  breaking  with  ancient  and  medi¬ 
eval  metaphysics  (III  12,  14).  For  Kant,  it  is  evident 


that  men  legally  can  only  pursue  reforms,  not  a 
forceful  revolution,  but  the  French  Revolution  had 
demonstrated  the  feasibility  of  a  republic — this 
event  “will  not  be  forgotten”  (VII  88). 

Author’s  note:  References  to  Kant’s  works  refer  to  the 
volumes  and  pagination  of  the  Akademie  edition,  Berlin, 
Germany,  1990ff. 

Reinhard  Brandt 
Philipps-  Universitat  Marburg 
Marburg,  Germany 

Translated  by  Dirk  Berg-Schlosser 

See  also  Hobbes,  Thomas;  Justice;  Machiavelli,  Niccolo; 
Marx,  Karl;  Peace 

Further  Readings 

Aristotelis.  (1831).  Opera  omnia  (J.  Bekker,  Ed.).  Berlin, 
Germany. 

Beiner  R.,  &  Booth,  J.  (Eds.).  (1993).  Kant  and  political 
philosophy.  The  contemporary  legacy.  New  Haven, 
CT:  Yale  University  Press. 

Hoffe,  O.  (Ed.).  (1995).  Klassiker  Auslegen:  Immanuel 
Kant,  Zum  ewigen  Frieden  [Classic  design:  Immanuel 
Kant:  Perpetual  peace].  Berlin,  Germany:  Akademie. 
Hiining,  D.,  &  Tuschling,  B.  (Eds.).  (1998).  Recht,  Staat 
und  Volkerrecht  bei  Immanuel  Kant  [Law,  state  and 
international  law  by  Immanuel  Kant].  Berlin, 

Germany:  Duncker  &  Humblot. 

Kersting,  W.  (1993).  Wohlgeordnete  Freiheit.  Immanuel 
Kants  Rechts-  und  Staatsphilosophie  [. . .]  Kant  und  die 
politische  Philosophie  der  Gegenwart  [Well-ordered 
liberty:  Immanuel  Kant’s  legal  and  political  philosophy 
[.  .  .]  Kant  and  the  political  philosophy  of  the  present]. 
Frankfurt,  Germany:  Suhrkamp. 

Mandeville,  B.  de.  (1924).  The  fable  of  the  bees:  Or, 
private  vices,  public  benefits  (F.  B.  Kaye,  Ed.).  Oxford, 
UK:  Oxford  University  Press.  (Original  work 
published  1714) 

Plato.  (1901).  Opera  omnia  (J.  Burnet,  Ed.).  Oxford,  UK. 
Smith,  A.  (1910).  The  wealth  of  nations  (E.  R.  A. 
Seligman,  Ed.).  New  York:  Dutton.  (Original  work 
published  1776) 

Vico,  G.  (1963).  La  scienza  nuova  [The  new  science] 

(2nd  ed.;  P.  Rossi,  Ed.).  Milan,  Italy:  Rizzoli.  (Original 
work  published  1744) 


L 


Labor  Movement 


The  concept  of  labor  movement  refers  to  the 
development  of  workers’  organizations  driven  by 
collective  mobilization.  It  consequently  implies 
some  degree  of  coexistence  between  two  distinct 
processes  subject  to  close  analysis  in  political 
science:  (1)  interest  representation  and  (2)  social 
mobilization.  Descriptions  of  labor  movements 
that  emphasize  the  former  process  usually  focus 
on  trade  unions,  these  being  associations  that 
represent  the  labor  force  as  well  as  organizations 
that  play  wider  economic  and  political  roles.  By 
contrast,  descriptions  that  emphasize  the  latter  pro¬ 
cess  concentrate  on  the  dynamics  of  social  move¬ 
ments  and  therefore  center  on  the  construction  of  a 
social  identity,  the  forms  assumed  by  collective 
mobilization,  and  its  eventual  institutionalization. 

This  entry  first  outlines  the  main  “classical”  theo¬ 
ries  concerning  the  labor  movement.  It  then  shows 
how  the  labor  movement  has  evolved  amid  constant 
tension  between  collective  mobilization  and  institu¬ 
tional  forms  of  labor  representation.  Next,  it  dis¬ 
cusses  how  trade  unions  work  to  represent  workers’ 
interests,  as  well  as  acting  as  partners  to  employers 
and  the  state  in  collective  bargaining  and  tripartite 
concertation.  A  brief  conclusion  considers  the  role 
the  labor  movement  may  play  in  the  near  future. 

In  Search  of  a  Theory  of 
the  Labor  Movement 

Labor  movements  are  phenomena  both  complex 
and,  in  a  sense,  contradictory.  They  first  arose  as 


groups  organized  to  improve  conditions  in  capital¬ 
ist  industry.  But  they  have  often  pursued  the  much 
more  ambitious  goal  of  changing  society  as  a 
whole.  Everywhere,  they  have  bred  organizations 
that  gradually  came  to  accept  the  rules  applied  by 
the  economic  and  political  systems  in  which  they 
had  to  operate  but  still  pursued  long-term  goals 
that  frequently  conflicted  with  those  rules.  Although 
these  organizations  manage  industrial  and  social 
conflict,  the  integration  of  large  masses  of  workers 
into  the  economic,  social,  and  political  systems 
of  industrial  democracies  would  have  been  very 
different  in  nature — and  perhaps  impossible — 
without  them. 

While  contemporary  social  sciences  have  con¬ 
ducted  detailed  analyses  of  labor  movements, 
more  general  discussions  of  them  are  still  largely 
couched  in  conceptual  terms  drawn  from  theories 
developed  several  decades  ago  that  may  be  termed 
classic.  The  explanatory  capacity  of  these  theories 
is  dubious  and  erratic,  but  their  continuing  interest 
for  social  scientists  stems  from  rather  different  fac¬ 
tors.  Each  of  them  has  been  put  forward  as  a  gen¬ 
eral  theory  but  has  centered  on  only  one  of  the 
different  alternatives  faced  by  labor  movements 
across  countries  and  historical  periods.  Yet  it  is 
precisely  for  this  reason  that  these  theories  have 
become  important  as  self-interpretations  by  differ¬ 
ent  components  of  the  international  labor  move¬ 
ment,  thereby  influencing  the  latter’s  perception  of 
itself  and  its  goals.  Hence,  these  theories  are  simul¬ 
taneously  analytical  instruments  and  ideologies. 

From  this  point  of  view,  Marxism — which  devel¬ 
oped  the  most  powerful  theoretical  apparatus — 
also  can  be  seen  as  the  dominant  ideology  shaping 
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socialist  positions  within  labor  movements.  Its 
contribution  to  the  analysis  of  such  movements, 
however,  is  still  relevant  in  many  respects.  First,  by 
framing  labor  movements  in  the  context  of  a  struc¬ 
turally  divided  class  society,  it  helps  remedy  the 
shortcomings  of  merely  psychological  or  behavioral 
explanations  of  their  origins  and  goals.  Second,  by 
explaining  industrial  conflict  in  light  of  a  theory  of 
social  change,  Marxism  suggests  that  analysis 
should  extend  beyond  the  short-term  objectives  of 
such  conflict  and  consider  the  aspirations  latent 
within  it  to  achieve  a  different  social  order,  which 
may  cyclically  orient  collective  action.  Finally, 
by  viewing  unionism  as  one — albeit  the  most 
important — of  the  many  different  forms  assumed 
historically  by  labor  organization,  it  allows  the  lat¬ 
ter  to  be  seen  in  relation  to  other  types  of  action 
available  to  workers  rather  than  as  the  sole  natural 
expression  of  their  needs  and  culture. 

While  the  Marxist  socialist  tradition — in  its 
social-democratic  and  communist  versions  alike — 
has  by  far  outweighed  any  others  in  the  history  of 
labor  movements,  a  significant  role  has  neverthe¬ 
less  been  also  played  in  several  countries  by  the 
“revolutionary  syndicalist”  tradition.  This  tradi¬ 
tion’s  departure  from  Marxist  theories  has  been 
evident  ever  since  the  early  work  of  its  major  pro¬ 
ponent,  Georges  Sorel,  who  envisaged  workers’ 
autonomy  embodied  in  trade  unions  as  a  “global 
society”  with  its  own  economy,  organizational 
forms,  and  culture,  which  contrasted  with  the 
decaying  bourgeois  society.  Sorel’s  theory  was 
based  on  a  view  of  the  social  structure  as  sharply 
divided  into  two  classes.  It  was  the  purpose  of  the 
general  strike — the  key  form  of  action  in  the  revo¬ 
lutionary  syndicalist  tradition — to  expose  this 
dichotomy.  Sorel’s  theory  was  also  based  on  a 
simplistic  view  of  political  structures  and  institu¬ 
tions.  Nevertheless,  like  Marxism,  in  late-19th- 
and  early-20th-century  Europe,  it  expressed  the 
ambition  of  a  rising  working  class  to  become  the 
hegemonic  class  and  therefore  to  go  beyond  its 
activity,  confined  to  the  labor  market,  to  address 
more  general  societal  problems. 

From  this  point  of  view,  theories  of  the  labor 
movement  developed  in  the  Anglo-American  coun¬ 
tries  stand  in  sharp  contrast  with  those  described 
above.  Their  differences  notwithstanding,  they 
have  never  proposed  far-reaching  projects  for  the 
revolutionary  transformation  of  capitalist  society. 


Their  objectives  have  been  more  modest,  in  that 
they  have  concerned  themselves  with  the  analysis 
of  the  actual  role  performed  by  the  labor  move¬ 
ment  in  the  existing  society  rather  than  with  sce¬ 
narios  for  social  change.  The  major  theoreticians  of 
British  trade  unionism,  Sidney  and  Beatrice  Webb, 
developed  what  can  be  regarded  as  a  functionalist 
approach.  Trade  unions  were  merely  means  to 
achieve  specific  goals.  And  they  were  so  in  a  two¬ 
fold  sense:  (1)  because  their  function  was  defined 
within  the  strict  limits  of  workers’  defense  against 
the  workings  of  the  market  and  (2)  because  such 
activity  had  to  be  compatible  with  the  systemic 
imperatives  of  firm  productivity  and  economic 
development.  A  more  general  implication  was  that 
the  labor  movement’s  legitimacy  should  be  assessed 
not  only  in  terms  of  its  utility  to  workers  but  also 
in  terms  of  its  contribution  to  creating  a  more 
democratic  though  efficient  society. 

Selig  Perlman’s  view — which  can  be  considered 
the  theoretical  basis  for  North  American  “business 
unionism” — differed  sharply  from  that  of  the 
Webbs.  Perlman  maintained  that  the  mentality  of 
workers  and  militants  was  dominated  by  their 
awareness  of  the  scarcity  of  jobs.  Far  from  being 
based  on  class  consciousness,  therefore,  actual 
labor  movements  were  based  on  the  consciousness 
of  the  competitive  nature  of  the  job  market.  While 
intellectuals  in  trade  unions  sought  to  give  them 
anticapitalist  objectives,  the  main  purpose  of 
workers’  self-organization  was  to  claim  the  collec¬ 
tive  ownership  of  available  jobs  so  that  they  could 
be  shared  among  the  members  of  the  union. 
Workers  were  uninterested  in  the  endeavor  to 
reform  the  capitalist  system  and  in  mass  political 
action.  Nevertheless,  the  long  history  of  the  labor 
movement  demonstrates  that  Perlman’s  theory, 
too,  unduly  generalized  values  dominant  in  spe¬ 
cific  periods  and  countries,  besides  assuming  the 
existence  of  a  universal  “workers’  psychology,” 
which,  as  shown  by  a  large  body  of  research,  is 
instead  strictly  determined  by  the  specific  culture 
and  historical  context. 

None  of  the  classic  theories,  therefore,  addressed — 
let  alone  answered — the  crucial  question  for  a  theory 
of  the  labor  movement:  Under  what  conditions  are 
the  different  components  of  the  working  class  and 
their  organizations  induced  to  behave  as  fragmented 
interest  groups  or,  conversely,  as  unified  collective 
actors  pursuing  economic  and  social  goals?  Despite 
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decades  of  comparative  work  in  the  social  sciences, 
this  question  is  still  largely  unanswered.  A  large 
amount  of  evidence  has  been  collected  to  account 
for  developments  in  individual  countries,  but  a 
convincing  theory  with  general  explanatory  power 
is  still  far  from  being  developed. 

Collective  Mobilization  and  Its  Fate 

In  several  countries,  the  origin  of  labor  movements 
was  twofold  in  nature:  (1)  as  a  form  of  solidarity 
and  defense  of  workers  and  (2)  as  a  revolt  against 
the  capitalist  mode  of  production  and  society.  The 
former  tendency  led  to  the  creation  of  various  types 
of  friendly  societies,  cooperatives,  and,  later,  trade 
unions.  The  outcomes  of  the  latter  ranged  from 
Luddism  to  workers’  councils,  to  different  forms  of 
political  mobilization  and  organization  for  the 
achievement  of  citizenship  rights.  Both  these  ten¬ 
dencies  sprang  essentially  from  the  exclusion  of  a 
young  working  class  from  the  management  of  the 
economy  and  society  in  countries  undergoing  rapid 
industrialization  and  development.  Both  tendencies 
embodied  contradictory  aims:  workers’  rejection  of 
the  existing  society  and  their  endeavor  to  build  an 
autonomous  culture,  on  the  one  hand,  and  their 
aspirations  to  full  industrial  and  political  citizen¬ 
ship,  on  the  other. 

In  the  early  stages  of  the  Industrial  Revolution, 
the  more  educated  and  resourceful  skilled  workers 
also  acted  as  a  politicized  vanguard.  They  saw 
socialism  as  an  opportunity  to  regain  control  over 
the  means  of  production,  which  had  been  expro¬ 
priated  from  craftsmen.  However,  technological 
innovations  and  the  rationalization  of  production 
soon  rendered  obsolete  the  skills  that  gave  those 
workers  their  market  power.  At  the  same  time, 
because  they  increased  the  rigidity  of  production 
and  interdependency  among  jobs,  they  endowed 
unskilled  workers  with  unprecedented  disruptive 
power.  Moreover,  just  as  technology  and  rational¬ 
ization  broke  the  monopolistic  position  of  skilled 
workers  and  enabled  the  mass  of  workers  to  orga¬ 
nize,  so  too  did  the  peasant  origin  of  the  latter 
rupture  the  traditional  political  subcultures  with 
their  strong  identities  and  revolutionary  projects. 

Generalizing  and  simplifying  somewhat,  it  can 
be  argued  that  the  more  a  labor  movement  grows 
and  expands  its  role  in  society,  the  more  its  ideo¬ 
logical  tension  comes  under  severe  strain.  Its  main 


activity  becomes  collective  bargaining,  rather  than 
political  mobilization  or  expressive  action.  Besides 
enlarging  their  rank  and  file,  this  helps  trade 
unions  develop  their  organizational  strength. 
Problems  to  do  with  bureaucratization  arise,  and 
more  generally,  organizational  concerns  tend  to 
prevail  over  those  typical  of  a  social  movement. 
Partly  for  this  reason  and  partly  because  the  polit¬ 
ical  recognition  of  labor  movements  (first  in  the 
West  and  then  in  other,  developing  areas  of  the 
world)  gives  workers  full  political  and  social  citi¬ 
zenship,  the  revolutionary  impetus  has  dwindled. 
The  economic  action  of  labor  movements  is 
increasingly  restricted  to  negotiation  within  a 
framework  of  shared  rules,  while  political  action 
takes  the  form  of  pressure  politics  or  support  for 
prolabor  reformist  parties. 

Of  course,  what  has  just  been  outlined  is  only  a 
very  general  pattern  that  applies  to  a  greater  or 
lesser  extent  to  labor  movements  in  different  coun¬ 
tries.  Today,  the  main  differences  among  their 
behaviors  are  apparent  in  the  diverse  structures 
assumed  by  collective  bargaining  or  in  the  relative 
importance  of  this  form  of  action  compared  with 
political  action.  To  some  extent,  all  labor  move¬ 
ments  use  both  instruments.  But  some  of  them 
(e.g.,  in  the  United  States)  by  tradition  place  over¬ 
whelming  emphasis  on  the  former.  Through  nego¬ 
tiation  with  companies,  they  seek  to  achieve  the 
joint  regulation  of  labor.  Bargaining  focuses  princi¬ 
pally  on  wages  and  working  conditions,  but  it  may 
extend  to  many  other  aspects  of  workers’  lives — 
pensions,  health  insurance,  and  so  on.  By  contrast, 
many  European  and  Latin  American  labor  move¬ 
ments  have  given  greater  priority  to  political 
action.  Trade  unions  have  long  worked  in  close 
alliance  with  “brother”  (socialist  or  labor)  parties 
to  support  reformist  programs,  and  they  have  a 
direct  interest  in  labor  and  social  legislation.  In  the 
post-World  War  II  period,  moreover,  most  of  them 
also  engaged,  for  shorter  or  longer  periods,  in  con- 
certation  with  governments  to  design  economic 
and  social  policies. 

Hence,  the  evolution  of  labor  movements  shows 
that  a  revolutionary  and  globalistic  approach  may 
prevail,  especially  when  workers  uprooted  from 
their  communities  and  not  yet  integrated  in  indus¬ 
trial  society  are  rapidly  introduced  into  modern 
production.  When,  conversely,  workers  are  rather 
well  integrated  into  society,  labor  movements  are 
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less  able  to  provide  their  members  with  identity 
incentives.  They  are  more  concerned  to  seek  par¬ 
ticularistic  advantages  than  to  strive  after  the  ideal 
of  an  alternative  society.  This  observation,  how¬ 
ever,  should  not  be  mistaken  for  an  evolutionary 
theory.  Even  less  should  it  be  taken  as  the  obituary 
for  the  labor  movement  that  diverse  theorists  in 
different  periods  have  been  wont  to  write.  Industrial 
societies  may  cyclically  reproduce  the  phenomena 
described.  They  continue  to  generate  new  groups 
in  the  labor  market  and  in  society  at  large  that  lack 
adequate  representation  of  their  interests  and 
therefore  occupy  marginal  positions  in  the  indus¬ 
trial  relations  and  political  systems.  Moreover,  as 
new  countries  set  off  along  the  road  of  rapid  eco¬ 
nomic  development,  their  labor  movements  are 
likely  to  exhibit  similar  patterns. 

Trade  Unions  as  Representative 
Organizations 

In  several  countries,  trade  unions  became  powerful 
organizations  that  entirely  assimilated  the  require¬ 
ments  of  capitalist  systems  and  their  workings  but 
still  appealed  to  a  collective  identity  that  often 
embodied  long-term  goals  in  conflict  with  those 
systems.  Today,  trade  unions  are  widely  regarded 
by  companies  as  rent-seeking  bodies.  Nevertheless, 
they  often  perform  key  productive  functions  by 
providing  workforce  cooperation,  participating  in 
income  policies,  and  more  generally  contributing 
to  the  regulation  of  work. 

Research  in  the  social  sciences  has  identified 
three  main  dimensions  of  unionism.  The  first 
dimension  consists  in  the  role  of  trade  unions  as 
associations  representing  collective  interests.  The 
second  comprises  the  problems  and  opportunities 
facing  trade  unions  as  organizations.  The  third 
dimension  concerns  their  role  as  economic  and 
political  actors. 

A  trade  union  is  an  association  of  individual 
members  whose  functional  interests  it  seeks  to 
represent  collectively.  The  drawback  of  the  collec¬ 
tive  representation  of  functional  interests  is  that 
rank-and-file  demands  are  inevitably  altered  by 
the  association  that  expresses  them.  In  fact,  any 
trade  union  association  must  first  collect  and 
interpret  workforce  demands,  giving  voice  to 
what  are  often  vaguely  expressed  needs  or  griev¬ 
ances.  Second,  the  requirements  of  consistency, 


organizational  stability,  and  recognition  prevent 
even  highly  representative  associations  from 
expressing  demands  that  contrast  with  their  long¬ 
term  strategy.  Third,  if  a  trade  union  is  to  gain 
concrete  benefits  for  its  members,  it  must  make 
such  demands  negotiable,  and  for  this  purpose  it 
must  transform  expressive  behavior  into  instru¬ 
mental  claims.  Finally,  to  represent  interests 
collectively,  it  must  aggregate  individual  demands 
that  are  normally  dispersed  and  potentially  con¬ 
tradictory.  It  is  these  factors  that  explain  why, 
regardless  of  the  quality  of  a  trade  union’s  inter¬ 
nal  democracy  and  the  responsiveness  of  its  lead¬ 
ership,  it  inevitably  filters,  and  even  profoundly 
changes,  the  demands  put  forward  by  the  employ¬ 
ees  that  it  seeks  to  represent.  Furthermore,  for  a 
trade  union  to  be  considered  a  reliable  partner  by 
the  companies  or  public  institutions  with  which  it 
reaches  agreements,  it  must  control  and  to  some 
extent  police  its  members  to  ensure  that  they 
comply  with  the  rules  agreed  on. 

This  may  give  rise  to  “crises  of  representation,” 
which  unions  must  be  able  to  control  if  they  are  to 
survive  and  maintain  their  role.  The  rate  of  union¬ 
ization,  or  “union  density” — that  is,  the  percent¬ 
age  of  union  members  in  the  workforce — is  usually 
considered  to  be  the  best  indicator  of  a  trade 
union’s  ability  to  represent  employees’  interests, 
and  indeed  of  its  strength.  However,  a  large  body 
of  research  has  shown  that  the  variation  in  union 
density  across  countries  is  closely  conditioned  by 
several  other  factors,  among  them  the  role  of  trade 
unions  as  providers  of  services.  Moreover,  union 
density  is  strongly  affected  by  the  varying  “cover¬ 
age”  of  collective  bargaining:  In  several  countries, 
this  is  established  by  law,  so  that  collective  agree¬ 
ments  apply  to  all  employees  irrespective  of 
whether  they  are  members  of  the  unions  signatory 
to  those  agreements. 

Yet  trade  unions  are  not  simply  associations 
formed  if  and  when  employees  feel  a  need  for  col¬ 
lective  representation  of  their  interests.  After  an 
initial  formative  period,  they  have  everywhere 
grown  into  stable  organizations,  which  pursue  as 
much  the  goal  of  maintaining  and  enlarging  their 
power  vis-a-vis  other  organizations  as  the  original 
one  of  representing  their  members’  interests.  This 
logic  of  action  induces  trade  unions  to  try  to  opti¬ 
mize  their  resources  whenever  they  must  find  solu¬ 
tions  for  the  dilemmas  they  encounter. 
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The  first  of  these  dilemmas  for  a  trade  union  is 
how  to  define  its  sphere  of  representation  or,  in 
other  words,  the  boundaries  of  the  interest  group 
whose  demands  it  intends  to  advance.  Historically, 
the  main  options  have  been  to  act  as  an  “associa- 
tional”  or  as  a  “class”  union.  A  union  of  the 
former  kind  chooses  to  restrict  its  sphere  of  rep¬ 
resentation  to  the  members  that  it  actually 
recruits.  The  latter  kind  of  union  claims  to  speak 
on  behalf  of  all  workers,  whether  or  not  they  are 
its  members.  The  second  dilemma  for  a  trade 
union  is  how  to  define  its  sphere  of  action:  Should 
it  give  priority  to  action  in  the  market,  addressing 
demands  to  companies  and  negotiating  with 
them,  or  should  it  give  priority  to  action  in  the 
state,  addressing  public  institutions  as  the  main 
targets  of  its  claims  and  its  main  partners? 
However  a  trade  union  may  define  whom  it 
wants  to  represent  and  with  whom  it  wants  to 
interact,  it  still  has  several  further  choices  to 
make.  One  such  choice  is  among  the  instruments 
for  action,  which  can  range  from  conflict,  to 
collective  bargaining,  to  various  forms  of  joint 
management,  to  tripartite  concertation.  Another 
concerns  the  level  of  action,  which  may  be  cen¬ 
tralized  at  the  industry  or  cross-industry  level  or 
decentralized  to  the  company  or  territorial  level. 

In  addition  to  acting  as  associations  that  repre¬ 
sent  collective  interests  and  large-scale  organiza¬ 
tions,  trade  unions  have  grown  into  important 
actors  in  several  political  economies.  Where  they 
have  moved  beyond  an  exclusively  distributive 
function,  they  have  come  to  perform  a  wider  role 
in  economic  development.  On  the  one  hand,  in 
fact,  they  may  be  decisive  factors  in  labor  market 
rigidity  and  an  excessively  generous  social  expen¬ 
diture  that  generates  huge  public  deficits.  On  the 
other,  they  may  perform  positive  functions  for 
companies  by  organizing  the  workforce’s  coopera¬ 
tion  with  the  new  modes  of  production  and  con¬ 
tributing  to  its  skill  formation.  They  can  also  help 
governments  improve  economic  performance  by 
coordinating  wage  dynamics  in  accordance  with 
an  income  policy  and  by  contributing  to  labor 
market  and  welfare  reforms. 

Open  Questions  and  Scenarios 

Today,  the  issue  that  concerns  all  countries  is  the 
new  meaning  and  scope  of  solidarity  among  workers 


or,  to  put  it  bluntly,  the  capacity  of  the  labor 
movement  itself  to  construct  social  and  political 
solidarity.  If  a  labor  movement  decides  to  do  no 
more  than  represent  only  traditional  workers,  it 
will  continue  to  enjoy  the  advantages  of  participat¬ 
ing  in  the  economic  and  political  systems.  But  it 
will  do  so  at  the  cost  of  seeing  its  status  reduced  to 
that  of  just  one  interest  group  among  others,  and  it 
will  probably  have  to  deal  with  the  rise  of  new 
representative  organizations  in  competition  with  it. 
It  may  alternatively  decide  to  offer  interest  repre¬ 
sentation  to  newly  formed  social  groups  as  well, 
although  these  often  mobilize  around  nonnegotia- 
ble,  or  at  least  radical,  demands.  If  it  does  so,  it  will 
probably  extend  its  political  influence  and  assume 
the  features  of  a  social  movement  but  at  the  cost  of 
losing  some  or  most  of  the  benefits  stemming  from 
its  economic  and  political  integration. 

Whether  we  regard  trade  unions  as  primarily 
associations  for  interest  representation,  as  organi¬ 
zations,  or  as  economic  and  political  actors,  the 
importance  that  they  acquired  in  most  advanced 
economies  in  the  latter  part  of  the  20th  century  is 
increasingly  challenged  by  various  developments. 
The  most  important  of  these  is  perhaps  the  con¬ 
tinuing  fragmentation  of  employees’  interests  and 
demands  due  to  processes  such  as  the  reorganiza¬ 
tion  of  production,  the  search  for  flexibility,  and 
the  growth  of  the  service  sector  and  of  nonmanual 
and  atypical  work.  Trade  unions  may  find  it  in¬ 
creasingly  difficult  to  build  their  demands  around 
key  professional  figures  such  as  the  assembly-line 
worker  in  the  Fordist  factory.  Not  only  has  it 
become  more  difficult  to  aggregate  demands  and 
identify  “general  class  interests,”  but  as  individual 
workers  identify  increasingly  less  with  the  “work¬ 
ing  class”  as  a  whole — and  with  the  projects  for 
economic  and  political  reform  historically  pursued 
by  the  labor  movement — they  also  tend  to  identify 
more  with  the  company  or  production  unit  to 
which  they  belong.  In  other  words,  the  traditional 
endeavor  by  trade  union  organizations  to  impose 
uniform  protection  standards  on  employers  is  now 
regarded  by  many  employees  as  an  oversimplifica¬ 
tion  of  their  needs  and  capabilities. 

In  conclusion,  the  labor  movement  in  the  new 
millennium  seems  to  be  faced  with  various  difficul¬ 
ties  but  also  with  new  opportunities.  The  former 
indubitably  include  the  reduction  of  trade  unions’ 
scope  for  action  within  the  economic  system  due 
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to  industrial  restructuring  and  the  greater  precari¬ 
ousness  of  labor  markets.  Among  the  more  general 
difficulties  is  the  disappearance  of  a  certain  type  of 
solidarity  based  on  homogeneous  working  and  liv¬ 
ing  conditions,  ideology,  and  the  ability  to  stan¬ 
dardize  demands.  On  the  other  hand,  the  new 
opportunities  open  to  labor  movements  seemingly 
depend  above  all  on  their  capacity  to  interpret  and 
mediate  the  need  of  management  for  closer  work¬ 
force  involvement  in  companies  by  qualifying  the 
cooperation/involvement  requested  and  subordi¬ 
nating  it  to  the  fulfillment  of  workers’  general 
interests  and  objectives.  From  this  point  of  view,  it 
may  become  of  great  importance  for  labor  move¬ 
ments  to  develop  the  ability  to  induce  companies 
and  institutions  to  relinquish  market  strategies 
based  exclusively  on  costs  and  to  adopt,  instead,  a 
competitiveness  model  based  on  product  quality 
and  a  highly  skilled  workforce — that  is,  a  model 
centered  on  the  full  development  of  human 
resources. 
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Language  and  Politics 


See  Political  Communication;  Politics  of 
Language 


Leadership 


Given  the  essential  and  inescapable  role  of  leaders 
in  political  life,  the  notion  of  political  leadership 
has  been  a  constant  presence  in  the  development 
of  political  thought.  Historically,  however,  there 
has  never  been  an  agreed-on  definition  of  political 
leadership,  and  even  the  political  science  literature 
developed  during  the  second  half  of  the  20th  cen¬ 
tury  has  not  modified  the  somewhat  fuzzy  nature 
of  this  concept. 

It  was  the  contribution  of  the  modern  elite 
theory — and  particularly  the  work  of  Gaetano 
Mosca,  Robert  Michels,  and  Vilfredo  Pareto — 
which  overwhelmed  the  traditional  Machiavellian 
associations  between  “leader”  and  “power,”  as 
well  as  the  vision  of  the  “great  man”  theory  of  his¬ 
tory.  In  the  context  of  elite  theory,  leadership  was 
conceived  of  as  a  behavioral  concept:  It  is  rela¬ 
tively  easy  to  individuate  leaders  as  persons  who 
exercise  control  over  the  behavior  of  others,  but 
their  capabilities  and  their  natures  vary  from  one 
case  to  another.  The  new  empiricism  developed 
within  the  social  sciences  at  the  dawn  of  the  20th 
century  brought  about  new  efforts  to  study  politi¬ 
cal  leadership  as  a  result  of  human  interactions. 
Since  then,  political  leadership  has  remained  a  core 
concept  in  political  science,  although  not  many 
works  have  explicitly  addressed  the  notion. 

Taking  into  consideration  some  selected  works 
about  the  nature  of  political  leadership  developed 
in  the  20th  century,  we  can  roughly  describe  the 
evolution  of  scientific  reflection  on  this  notion, 
identifying  three  historical  phases:  The  first  phase, 
developed  during  the  intrawar  period,  was 
particularly  fruitful  in  producing  theoretical  con¬ 
templations  about  the  nature  of  leadership  during 
the  critical  historical  scenario  of  the  so-called  first 
democratization — that  is,  the  process  of  changing 
to  a  democratic  regime  for  the  first  time.  The 
second  phase — in  the  second  half  of  the  20th 
century — was  characterized  by  a  significant 
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growth  of  behavioral  studies,  which  put  the  trans¬ 
formation  of  political  leadership  at  the  core  of  the 
empirical  analysis  of  politics.  The  current  phase  is 
dominated  by  attention  being  newly  paid  to  the 
problem  of  the  complexity  of  political  leadership 
in  a  different  geographical  area. 

From  Weber  to  Lasswell:  Classic  Reflections 
on  the  Nature  of  Political  Leadership 

All  the  theoretical  assessments  of  political  lead¬ 
ership  refer  to  the  well-known  tripartite  typol¬ 
ogy  developed  by  Max  Weber,  which  is  very 
often  labeled  as  an  ideal-typical  classification  of 
political  leadership — traditional,  legal-rational, 
and  charismatic  rule.  Actually,  this  celebrated 
piece  of  work  is  an  analysis  of  legitimated  rule — 
not  of  leadership.  Indeed,  Weber’s  influence  is 
due  to  his  intuition  about  the  growing  presence 
within  modern  leadership  of  peculiar  elements, 
which  determine  the  leader’s  capability  to 
“change  history.”  According  to  Weber,  charis¬ 
matic  authority  is  that  particular  feature  of  lead¬ 
ership  which  explains  the  development  of  single 
figures  and  new  styles  of  command,  justifying 
exceptional  changes  in  political  life.  Legal- 
rational  rule,  on  the  other  hand,  is  the  ideal  type 
of  continuity,  constraining  the  behaviors  of  the 
innovators.  Moving  from  this  argument,  the 
study  of  leadership  within  the  democratic  envi¬ 
ronment  has  been  expanded  all  along  the  course 
of  the  20th  century,  focusing  on  the  causal  links 
between  democratization,  crises,  transitions,  and 
types  of  legitimate  rule. 

If  Weber’s  work  is  a  crucial  step  in  the  analysis 
of  the  link  between  leaders  and  power,  other  classic 
social  scientists  of  the  20th  century  have  renovated 
the  study  of  the  link  between  political  leadership 
and  political  personality.  Probably  the  first  modern 
attempt  to  define  the  qualities  pertaining  to  politi¬ 
cal  leadership  can  be  found  in  the  work  of  Robert 
Michels,  who  critiques  and  illuminates  the  charac¬ 
ters  of  leaders  from  the  Platonic  tradition  to  the 
19th  century — initially  focusing  on  the  art  of  rheto¬ 
ric.  However,  it  was  Harold  Lasswell  who 
provided  an  important  basis  for  the  empirical  study 
of  leadership  behaviors.  According  to  Lasswell, 
whose  theories  had  been  initially  influenced  by 
classic  elite  theorists  and  by  Freud’s  psychoanaly¬ 
sis,  personality  is  a  crucial  variable  shaped  by  a 


system  of  interactions  between  the  values  of  the 
social  and  the  political  system  at  the  macrolevel 
and  the  cultural  and  social/familial  environment  at 
the  individual  level.  Although  Lasswell  was  never 
concerned  with  the  notion  of  leadership,  his  work 
had  much  influence  on  future  reflections  on  such  a 
notion,  and  his  call  for  a  more  attentive  study  of 
biographies  and  social  connections  of  political 
leaders  was  decisive  in  the  development  of  behav- 
ioralism. 

Empirical  Studies  in  Political  Leadership 

The  impetus  provided  by  Lasswell  for  an  analytical 
study  of  political  leadership  has  resulted  in  a  sig¬ 
nificant  growth  of  knowledge.  Especially  during 
the  1960s  and  the  1970s,  the  refinement  of  the 
comparative  method  and  awareness  that  a  nonpa- 
rochial  analysis  of  politics  was  necessary  paved  the 
way  for  accurate  explorations  of  the  manifold 
empirical  dimensions  of  political  leadership.  The 
use  of  behavioral  theories  and  support  from 
research  in  various  disciplines  of  the  social  sci¬ 
ences,  including  political  psychology,  sociology, 
and  history,  have  enabled  us  to  fathom  the  hidden 
dispositions  of  our  rulers.  The  works  of  James 
MacGregor  Burns  (1978),  grounded  in  the  notion 
of  transformational  leadership,  for  instance,  were 
fundamental  in  understanding  the  usefulness  of 
intensive  biographical  studies  aimed  at  creating 
taxonomies  of  leaders  based  on  their  different  capa¬ 
bilities  of  interaction.  In  the  same  period,  Glenn 
Paige  (1977)  put  the  notion  of  political  leadership 
at  the  center  of  a  complex  multidisciplinary  frame¬ 
work  that  explains  the  variance  in  the  behavior  of 
leaders  by  using  a  number  of  macro  variables, 
including  personality,  role,  organization,  values, 
and  capability  of  interaction. 

A  large  number  of  studies  were  then  under¬ 
taken  in  different  disciplinary  environments, 
focusing  on  different  definitions  of  political  lead¬ 
ership.  In  particular,  the  analytical  dimensions  of 
leader’s  ambition  and  leader’s  style  gained  imme¬ 
diate  relevance  for  the  new  generation  of  political 
scientists.  These  dimensions  of  variation  were  at 
first  studied  above  all  in  the  United  States. 
However,  the  seminal  works  of  scholars  like 
Arthur  Schlesinger  Jr.,  James  Barber,  and  Lewis 
Edinger  became  a  fundamental  starting  point  for 
a  discussion  that  soon  spread  to  Europe,  leading 
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to  new  important  studies  in  a  few  years.  More  or 
less  at  the  same  time,  institutional  analysis  consid¬ 
ered  the  relevance  of  leadership  positions  in 
assessing  the  different  nature  and  performance  of 
political  systems.  One  should  mention  here  that 
many  important  theories  were  developed  between 
the  1970s  and  the  1980s  on  the  nature  of  chief 
executives  and  the  institutional  environments  in 
which  they  act. 

Later,  works  on  political  leadership  were  focus¬ 
ing  on  different  aspects  of  the  phenomenon,  also 
including  the  relationships  between  leaders  and 
public  opinion.  Jean  Blondel  provides  a  general 
assessment  of  the  role  and  the  impact  of  political 
leaders  in  the  contemporary  world.  In  terms  of 
empirical  analysis,  this  work  suggests  a  new  typol¬ 
ogy  of  political  leaders  based  on  two  dimensions: 
the  extent  of  their  concern  with  maintenance  or 
change  in  the  society  and  the  scope  of  the  political 
message  they  want  to  introduce  in  the  political 
system.  Working  within  such  a  conceptual  grid,  it 
is  possible  to  assess  the  role  of  a  wide  set  of  leaders, 
from  the  anonymous  managers  (specialized  reform¬ 
ers  aiming  at  maintenance,  such  as  ministers  who 
deal  with  day-to-day  administration)  to  the  famous 
ideologues  of  the  20th  century — for  instance  Mao 
or  Hitler — who  carried  out  important  societal 
changes  with  a  strong  political  message. 

Leader  Democracy?  Transforming  the  Link 
Between  Representation  and  Leaders 

Decades  of  empirical  studies  have  provided  a  sig¬ 
nificant  understanding  of  the  dynamics  of  contem¬ 
porary  political  leadership.  However,  we  still  lack 
a  comprehensive  and  agreed-on  definition  of  what 
leadership  is,  and  scholars  are  still  divided  about 
the  correlation  between  (strong)  leadership  and 
(good)  democracy.  Moreover,  the  rapid  transfor¬ 
mation  of  democratic  politics  raises  a  number  of 
new  questions  about  the  meaning  of  political  lead¬ 
ership.  The  focus  seems  to  have  shifted  away  from 
the  interpretation  of  classic  puzzles,  like  the  dura¬ 
tion  of  leaders  and  measurement  of  their  charisma, 
to  the  extent  of  a  leader’s  role  in  policy  making, 
communication,  and  electoral  performance.  Pol¬ 
itical  theorists,  in  particular,  have  stressed  the 
trade-off  between  the  decline  (transformation)  of 
representative  democracy  and  the  emergence  of 
a  strong  leadership  influence  in  specific  policy 


sectors  and  processes.  Bernard  Manin  (1997) 
argues  that  the  partial  independence  of  representa¬ 
tives  has  been  reinforced — in  what  he  calls  audi¬ 
ence  democracy — by  an  electoral  link  built  on 
images  rather  than  programs.  This  would  increase 
the  capability  of  single  leaders  to  conduct  policy 
processes  and  shape  political  discourse. 

The  changing  link  between  representatives  and 
the  represented  is  also  seen  as  an  element  of  a 
broader  change,  connected  to  an  evolutionary  phe¬ 
nomenon  that  involves  political  processes  and  the 
functions  of  democracy.  According  to  Andras 
Koroseny,  the  meaning  of  political  representation 
tends  to  be  more  and  more  differentiated  from  the 
two  traditional  images  of  deliberation  and  mirror¬ 
ing  since  what  is  at  the  core  of  political  representa¬ 
tion  is  “acting  and  supplying  new  policies,  creating 
a  new  quality” — that  is  to  say,  leadership.  The 
selection  of  leaders  who  would  be  really  free  to 
make  changes  is  therefore  the  very  function  of 
democratic  rule.  Leaders  are  expected  to  represent 
the  people  in  a  qualitative  rather  than  in  a  mechan¬ 
ical  way.  In  empirical  terms,  a  lot  of  implications 
can  be  derived  from  such  a  statement:  They  refer, 
respectively,  to  the  spheres  of  democratic  institu¬ 
tions  and  political  discourse. 

Concerning  the  effects  of  leadership  in  terms  of 
institutional  change,  many  recent  studies  have  con¬ 
verged  to  support  the  argument  that  core  execu¬ 
tives  or  chief  executives  have  been  incrementally 
more  motivated  to  turn  specific  policies  in  new 
directions.  The  thesis  of  presidentialization  of 
politics  is  grounded  in  the  development  of  the  role 
of  guidance  and  coordination  among  a  number  of 
policymakers  who  are  much  more  evidently  subor¬ 
dinated  to  the  leader.  In  a  similar  way,  the  pro¬ 
cesses  of  vertical  governance  shift,  which  would 
determine  phenomena  such  as  globalization  or  the 
Europeanization  of  domestic  politics,  envisage  the 
emergence  of  more  room  for  maneuver  for  national 
leaders,  who  are  therefore  not  only  free  to  shape 
policy  processes  but  also  to  have  an  impact  on 
public  opinion. 

Finally,  an  open  question  that  lies  at  the  center 
of  a  vivid  discussion  concerns  the  influence  of  cur¬ 
rent  political  leaders  on  political  discourse  and 
electoral  outcomes.  Conventional  wisdom  suggests 
that  such  a  role  is  growing  due  to  the  effects  of  less 
well-identified  phenomena  such  as  “leaderization” 
and/or  personalization  of  politics.  However,  not 


Legalization  of  Policy  1411 


all  the  implications  of  this  conventional  view  are 
true.  According  to  Anthony  King  (2002),  demo¬ 
cratic  polities  are  not  necessarily  personality  ori¬ 
ented;  partisan  preferences  and  party  politics  still 
count — more  than  the  personal  features  of  a  party’s 
own  leaders — although  a  great  deal  of  variation 
can  be  seen  if  one  observes  the  electoral  outcomes 
of  a  number  of  political  leaders  in  advanced 
democracies.  Similarly,  a  high  degree  of  variation 
has  been  noted  by  Jean  Blondel  and  Jean-Louis 
Thiebault  (2009),  who  conducted  research  on 
11  recent  political  leaders.  According  to  them,  the 
relevance  of  personality  in  political  leadership  has 
increased  during  the  last  50  years,  balancing  the 
decline  in  importance  of  relevant  variables  such  as 
social  stratification,  class,  and  religious  affiliation. 
Personalization  of  leadership  is  therefore  a  real 
phenomenon,  although  its  impact  seems  to  be 
strongly  influenced  by  historical  and  country- 
specific  variables.  Therefore,  “personalized  leader¬ 
ship”  seems  to  differ  from  one  case  to  another. 

Luca  Verzichelli 
University  of  Siena 
Siena,  Italy 
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Legalization  of  Policy 


One  of  the  significant  elements  of  the  transforma¬ 
tion  of  governance  and  public  policy  across  the 


world  is  its  increasingly  juridical  character.  This 
process  of  legalization  refers  to  the  transfer  of  pub¬ 
lic  policy  making  to  judicial  institutions,  the  increas¬ 
ingly  juridical  and  rule-governed  processes  of  pol¬ 
icy  making,  the  central  role  of  administrative  law  in 
the  governance  of  public  policy,  and  the  growing 
importance  of  independent  regulatory  bodies  in 
areas  such  as  finance  and  health.  The  legalization 
of  public  policy  is  driven  by  an  increasing  fragmen¬ 
tation  of  interests  and  institutions  within  the  state; 
a  specific  strategy  of  core  executives  and  dominant 
social  and  political  coalitions  is  to  protect  their 
interests,  control  subordinate  authorities,  and  move 
contentious  issues  into  nonpolitical  arenas,  which 
is  often  a  strategy  promoted  by  international  orga¬ 
nizations  and  agents.  Legalization  brings  with  it  an 
array  of  new  actors — courts,  public  interests  law¬ 
yers,  and  regulatory  institutions — using  a  wide 
array  of  new  instruments  of  state  authority  such  as 
contracts  and  audits. 

What  is  intriguing  is  that  this  legalization  has 
gone  in  tandem  with  programs  of  market  reform 
across  the  developed  and  developing  world.  As 
noted  by  Kanishka  Jayasuriya,  market  reforms  have 
led  to  new  forms  of  regulatory  governance.  It  is 
clear  that  the  credit  crisis  of  2008  will  further  accel¬ 
erate  the  development  of  regulatory  governance. 
One  of  the  significant  features  of  this  regulatory 
governance  is  its  increasingly  juridical  character,  in 
that  it  is  rule  governed — relies  on  judicial,  quasi¬ 
judicial,  or  independent  agencies,  for  its  implemen¬ 
tation — and  uses  a  range  of  regulatory  techniques, 
which  include  the  use  of  standard-establishing 
procedures  for  the  operation  of  policy  and  the  use 
of  contractualism  and  audit. 

Mechanisms  of  Legalization 

Five  main  mechanisms  of  legalization  of  public 
policy  can  be  identified: 

1.  Legalization  has  been  driven  by  the  increasing 
fragmentation  of  institutions  as  well  as  actors 
providing  incentives  for  political  elites  to  legalize 
various  areas  of  public  policy.  It  does  this,  in  part, 
because  by  making  public  policy  rule  governed,  it 
places  the  core  executive  at  a  distance  from  the 
competing  interests  of  subordinate  institutions 
and  actors  in  areas  as  diverse  as  communications 
and  agriculture.  Even  in  authoritarian  systems, 
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legalization  has  become  a  desired  option:  For 
example,  the  Chinese  Communist  Party  now  faces 
pressures  from  competing  clients  on  issues  such  as 
environmental  and  economic  regulation.  In  these 
circumstances,  legalization  becomes  an  attractive 
option  because  it  moves  the  adjudication  of 
contentious  issues  to  a  seemingly  neutral  arena. 

2.  Legalization,  especially  through  the 
judicialization  of  politics  and  policy  making, 
provides  the  basis  on  which  the  threatened  elites 
and  vested  interests  can  protect  their  assets  from 
encroachment  by  assertive  political  forces  within 
the  democratic  political  process.  For  example,  the 
Thai  Constitution  of  2007,  which  further 
accelerated  judicial  policy  making,  reflected  the 
desire  of  sections  of  the  middle-class  nongovern¬ 
mental  organizations,  some  professional  groups, 
and  the  Thai  monarchy  to  protect  their  interests 
from  populist  forces  with  majority  electoral 
support. 

3.  Legalization  facilitates  a  process  of  meta¬ 
governance  that  allows  the  core  executive  to 
control  the  structural  process  of  governance  rather 
than  the  actual  conduct  of  policy.  Hence,  for 
example,  the  rules,  actors,  and  issues  of  network 
governance — an  important  tool  of  public  policy — 
are  controlled  by  higher  level  public  or  even 
private  agencies.  As  another  example  of  meta¬ 
governance,  take  the  case  of  the  Indonesian 
Constitutional  Court,  which  has  been  active  in 
asserting  control  over  various  aspects  of  local 
political  processes,  such  as  electoral  laws. 

4.  Legalization  has  been  driven  by  the  fact  that 
symbolic  and  cultural  political  disputes  cannot 
easily  be  resolved  by  the  cut  and  thrust  of  electoral 
politics,  which  are  amenable  to  distributive  politics. 
For  example,  the  protection  and  recognition  of 
multiculturalism  in  the  constitutionally  enshrined 
Canadian  Charter  of  Rights  and  Freedoms  has 
been  an  important  influence  in  both  policy 
formulation  and  implementation. 

5.  Legalization  has  been  driven  by  international 
forces,  which  increasingly  work  not  through  treaty 
of  hard  law  but  through  soft  forms  of  internal 
governmental  regulation.  Hence,  for  example, 
some  bilateral  trade  agreements  establish  investor 
mechanisms  that  legalize  what  otherwise  would  be 
in  the  domestic  arena  of  governance.  Similarly,  the 


advent  of  global  administrative  law — that  is,  the 
rules  and  regulations  of  global  governance  located 
within  national  policy-making  institutions — has 
served  to  legalize  areas  of  domestic  economic  and 
social  governance. 

Actors  in  the  Legalization  of  Policy 

Judicial  organizations  have  taken  on  a  key  role  in 
the  legalization  of  public  policy.  These  include  con¬ 
stitutional  or  appellate  courts  that  increasingly  set 
the  parameters  of  public  policy,  which  in  turn 
include  regional  courts  such  as  the  European  Court 
of  Justice  but  also  courts  such  as  the  Supreme 
Court  of  India.  All  these  have  become  assertive 
through  the  category  of  public  interest  litigation 
(PIL).  PIL  allows  courts  to  dilute  the  requirements 
of  locus  standi  and  allow  social  activists,  lawyers, 
and  even  letters  of  complaint  as  the  basis  for  the 
investigation  and  resolution  of  policy  and  political 
disputes  over  environmental  and  other  issues. 

New  institutions  have  been  set  up  to  oversee  the 
legalization  of  public  policy.  For  example,  the 
regulation  of  sport  has  become  increasingly  juridi¬ 
cal  through  the  complex  dispute  settlement  pro¬ 
cesses  in  the  International  Council  of  Arbitration 
for  Sport  (ICAS).  This  body  adjudicates  disputes 
over  issues  such  as  performance-enhancing  drugs 
or  the  observance  of  criteria  for  team  selections.  In 
addition  to  international  policies,  legalization  of 
policy  has  also  occurred  in  many  areas  at  the 
national  level.  For  instance,  in  1995,  with  the 
agreement  of  all  Australian  governments — federal, 
state,  and  territories — the  National  Competition 
Policy  in  Australia  has  articulated  competitive 
principles  to  be  applied  to  all  legislation.  These 
new  rules  were  to  be  enforced  by  a  new  indepen¬ 
dent  agency  called  the  National  Competition 
Council.  In  the  case  of  the  competition  policy, 
legalization  is  designed  to  ensure  economic  order 
and  competitiveness.  In  effect,  this  amounts  to  the 
subordination  of  regulatory  rule  making  to  eco¬ 
nomic  imperatives,  and  what  matters  are  not  the 
specific  rules  as  such  but  the  justification  of  policy 
decisions  framed  in  terms  of  economic  criteria. 

Legalization  of  public  policy  and  the  other  actions 
of  courts  require  the  existence,  mobilization,  and 
support  of  a  wide  range  of  lawyers,  activists,  and 
other  state  actors.  In  China,  the  functioning  of  the 
administrative  law  regime  requires  the  participation 
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of  lawyers  and  other  state  officials  to  give  the 
administrative  court  system  some  legitimacy. 
Legalization  of  public  policy  requires  the  embed¬ 
ding  of  judicial  institutions  within  a  complex  set  of 
relationships  between  those  actors  and  the  legal 
and  nongovernmental  organizations. 

There  has  been  a  shift  toward  autonomous 
dependent  state  agencies  that  are  independent  of 
the  political  executive.  This  is  best  illustrated  by 
the  increased  independence  of  central  banks  from 
political  institutions.  For  example,  the  Bank  of 
Korea  (BOK)  Act  effectively  entrenches  the  auton¬ 
omy  of  the  BOK  from  the  Ministry  of  Finance. 
Although  mechanisms  of  consultation  between  the 
ministry  and  the  BOK  have  been  established,  the 
executive  is  unable  to  impose  a  particular  course  of 
monetary  policy  on  the  monetary  board,  thereby 
diminishing  the  capacity  of  central  economic  agen¬ 
cies  to  direct  the  kind  of  industry  policies  followed, 
which  has  been  a  marked  feature  of  the  develop¬ 
mental  state. 

Instruments 

Rules  Versus  Discretionary  Policy 

Increased  independence  for  state  agencies  has 
been  accompanied  by  a  rule-based  rather  than  a 
discretionary  policy-making  process — often  in  a 
juridical  form.  Economic  governance  has  become 
increasingly  focused  on  the  implementation  of  rules 
(often  with  a  legal  basis)  that  govern  economic 
policy.  These  rules  are  legalized  to  the  extent  that 
they  stipulate  more  specifically  the  principles  or 
standards  to  be  applied,  making  the  observance  of 
these  standards  or  rules  mandatory,  and  delegate 
their  enforcement  to  an  independent  agency  or  dis¬ 
pute  resolution  mechanism. 

Even  in  what  once  was  the  paradigm  case  of 
discretionary  policy  making  in  East  Asia,  a  more 
assertive  legalism  has  emerged  in  public  policy. 
The  panoply  of  administrative  guidance  and 
executive  discretionary  power  of  the  develop¬ 
mental  state  has  given  way  to  the  flowering  of  a 
growing  and  assertive  legalism  and  constitution¬ 
alism  in  economic  and  political  decision  making. 
One  such  important  regulatory  institution  is  the 
Financial  Supervisory  Commission,  which  oper¬ 
ates  autonomously  and  has  been  given  consider¬ 
able  power  to  regulate  and  monitor  the  Korean 
financial  system. 


Proceduralism 

Legalization  is  often  directed  not  at  specifically 
influencing  the  behavior  or  conduct  of  individuals  or 
policy  but  rather  at  attempting  to  provide  the  frame¬ 
work  or  principles  through  which  policy  making  is 
undertaken.  In  this  respect,  governance  consists  of  a 
procedural  regulation  for  setting  the  broad  proce¬ 
dures  and  mandates  for  policy  making.  Through 
proceduralism,  autonomous  sites  of  governance 
come  to  be  constituted  not  by  direct  application  of 
state  law,  regulation,  or  authority  but  through  more 
indirect  means  that  enhance  or  generate  the  capaci¬ 
ties  of  these  sites  of  governance  for  self-regulation. 

Consequently,  these  processes  specify  proce¬ 
dures  to  be  followed  by  individuals  or  corporate 
bodies  complying  with  regulatory  rules.  For  exam¬ 
ple,  a  corporate  regulator,  such  as  the  Australian 
Competition  Council,  will  institute  compliance 
processes  within  the  corporate  entities  rather  than 
directly  intervene  to  enforce  rules.  The  objective 
of  the  agency  is  to  create  a  system  of  metagover¬ 
nance  to  monitor  the  extent  to  which  regulated 
entities  have  put  in  place  a  process  of  regulatory 
compliance. 

Contractualism  and  Audit 

A  key  instrument  of  legalization  is  through  a 
contract-like  relationship  between  public  agents, 
the  deliverers  of  policy  (i.e.,  those  who  implement 
policies),  and  clients.  The  agents  operating  these 
frameworks — be  they  individuals  or  local/national 
government,  civil  society  actors,  or  even  transna¬ 
tional  agencies — are  required  to  meet  certain  stan¬ 
dards  of  performance  and  conduct  as  a  condition 
for  the  receipt  of  a  grant  or  a  benefit.  What  is  more, 
the  performances  of  these  standards  are  monitored, 
and  there  are  specified  mechanisms  of  enforcement. 
For  instance,  employment  and  welfare  programs 
in  the  UK  provide  an  excellent  illustration  of  con¬ 
tractualism.  They  exemplify  governing  through 
contracts  either  directly  with  individual  welfare 
recipients  or  with  private  providers  of  public  ser¬ 
vices.  In  fact,  this  enrollment  of  private  individuals 
or  agencies  in  a  pervasive  system  of  regulatory  gov¬ 
ernance  is  one  of  the  distinguishing  elements  of 
contractual  governance  of  public  policy. 

One  of  the  consequences  of  this  contractualist 
governance  is  the  proliferation  of  audits,  such  as 
quality  assurance  audits  in  public  education  and 
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health.  Many  of  these  audits  and  auditing  agencies 
have  statutory  powers.  As  private  agencies  or  pub¬ 
lic-private  partnerships  can  take  on  enhanced 
governance  functions,  they  are  subject  to  increas¬ 
ingly  stringent  audit  and  inspections  mechanisms. 
In  other  words,  the  firm  or  the  university  itself 
becomes  the  site  of  self-regulation.  Far  from  being 
in  retreat,  this  contractual  governance  reaches 
deeper  into  civil  society  than  was  the  case  during 
the  peak  of  the  postwar  welfare  state. 

Conclusion 

The  distinguishing  feature  of  the  emerging  forms 
of  regulatory  governance  is  that  that  challenges 
our  conventional  understanding  of  public  gover¬ 
nance  and  accountability.  This  new  governance 
turns  away  from  the  traditional  command-and- 
control  system  of  regulation  by  public  administra¬ 
tion  agencies  in  favor  of  independent  agencies  and 
judicial  authorities  using  new  forms  of  regulatory 
techniques  that  are  increasingly  juridical  in  charac¬ 
ter.  Political  analysts  are  now  obliged  to  pay 
greater  attention  to  the  ways  in  which  politics  are 
played  out  in  these  various  regulatory  arenas — be 
they  in  monetary  policy  or  corporate  governance. 
This  highlights,  above  all,  the  need  for  a  more 
rigorous  analysis  of  the  new  kinds  of  actors,  insti¬ 
tutions,  and  arenas  that  have  been  unleashed  by 
the  legalization  of  public  policy. 
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Legitimacy 


The  legitimacy  of  any  system  of  power,  including 
that  of  its  individual  power  holders,  lies  in  the 
degree  to  which  it  is  acknowledged  as  rightful, 
both  by  those  involved  with  and  subject  to  it  and 
by  third  parties  whose  support  and  recognition  it 
may  depend  on.  Although  the  definition  of  legiti¬ 
macy  as  rightful  power  or  authority  is  a  widely 
agreed-on  one,  much  else  about  the  subject  is 
strongly  contested: 

How  significant  is  legitimacy  to  the  maintenance 
and  effectiveness  of  power  relations? 

Who  are  the  key  audiences  for  legitimacy  claims? 

What  exactly  is  it  that  makes  power  rightful? 

How  is  legitimacy  created  and  maintained? 

What  are  the  key  features  differentiating  the 
legitimacy  of  different  political  systems,  and 
what  are  their  respective  strengths  and 
vulnerabilities? 

This  entry  treats  these  main  points  of  disagreement 
in  turn. 

A  useful  starting  point  is  to  distinguish  between 
a  philosophical  and  a  social-scientific  approach  to 
the  study  of  legitimacy.  Throughout  human  history, 
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those  occupying  positions  of  power,  and  especially 
political  power,  have  sought  to  ground  their  author¬ 
ity  in  a  principle  of  legitimacy,  which  shows  why 
their  access  to,  and  exercise  of,  power  is  rightful 
and  why  those  subject  to  it  have  a  corresponding 
duty  to  obey.  These  claims  have  been  elaborated  by 
apologists  in  narrative  form  and  by  iconographers 
in  pictorial  and  artistic  representations.  However, 
where  the  possession  or  exercise  of  power  has  been 
challenged  or  contested,  critical  reflection  and 
argument  about  what  makes  power  rightful 
has  taken  place.  It  has  usually  been  the  task  of 
philosophers  to  elaborate  such  reflection  into  a 
considered  theory  or  theories  and  to  test  legitimacy 
claims  against  accepted  standards  of  normative 
validity  and  discursive  argument.  From  at  least  the 
time  of  the  ancient  Greeks  onward,  the  study  of 
legitimacy  has  been  central  to  the  practice  of 
political  philosophy,  through  its  analysis  of  nor¬ 
mative  principles  of  the  right  and  the  good. 

The  study  of  legitimacy  as  a  subject  for  political 
science,  by  contrast,  is  comparatively  recent,  dat¬ 
ing  from  the  early  20th  century.  In  contrast  to 
political  philosophy,  its  focus  is  more  empirical 
than  normative,  more  on  processes  and  outcomes 
than  on  abstract  principles.  Its  concern  is  less  with 
the  reasoned  validity  of  legitimacy  claims  than 
with  the  degree  of  their  recognition  by  the  relevant 
social  agents  and  with  the  consequences  that  fol¬ 
low  from  that  recognition  for  the  stability  of  a 
system  of  power  or  rule  and  for  the  manner  in 
which  it  is  organized.  It  was  the  German  sociolo¬ 
gist  Max  Weber  who  was  first  responsible  for 
elaborating  a  social-scientific  account  of  legitimacy 
and  for  exploring  its  significance  for  power  rela¬ 
tions  in  his  work  Economy  and  Society.  Most  of 
the  key  debates  in  political  science  since  then  have 
stemmed  from  Weber’s  work,  and  this  entry  will 
return  to  it  at  a  number  of  points. 

The  Significance  of  Legitimacy 
for  the  Exercise  of  Power 

According  to  Weber,  wherever  power  holders  are 
acknowledged  as  legitimate  or  rightful,  they  can 
count  on  those  subordinate  to  them  obeying  their 
commands  and  following  their  instructions  with¬ 
out  the  widespread  use  of  coercion  or  the  constant 
fear  of  disobedience  or  subversion.  This  is  because 
people  will  recognize  a  duty  to  obey  and  not 


merely  an  interest  or  advantage  in  doing  so.  As 
Weber  argued,  motives  of  personal  advantage, 
solidarity,  custom,  or  whatever  are  not  a  suffi¬ 
ciently  reliable  basis  for  power  relations.  In  addi¬ 
tion,  there  is  normally  a  further  element — the  belief 
in  legitimacy.  Every  system  of  power,  therefore, 
attempts  to  establish  and  cultivate  the  belief  in  its 
legitimacy  and  to  demonstrate  a  moral  authority 
going  beyond  any  purely  coercive  capacity. 

While  most  later  authors  agree  on  the  signifi¬ 
cance  of  legitimacy  for  the  reliability  and  durabil¬ 
ity  of  a  system  of  power,  they  are  divided  on  the 
question  of  who  constitutes  the  key  audience  for 
legitimacy  claims.  If  we  distinguish  between  the 
key  power  holders,  their  enforcement  and  admin¬ 
istrative  agents,  and  the  wider  population  that  is 
subordinate  to  them,  then,  one  school  of  thought 
holds  that  a  belief  in  legitimacy  is  necessary  only 
for  the  first  two  of  these.  It  is  the  powerful  who 
most  need  to  believe  in  their  own  rightfulness  if 
they  are  to  have  the  self-confidence  to  rule  effec¬ 
tively,  and  their  administrative  staff  must  believe 
likewise  if  they  are  to  be  effective  and  reliable 
enforcers  of  their  superiors’  policies.  In  this  view, 
it  matters  little  what  wider  strata  of  the  subordi¬ 
nate  believe  since  it  is  primarily  a  recognition  of 
their  own  impotence  in  the  face  of  the  powerful — a 
sense  of  “dull  compulsion,”  not  any  sense  of 
duty — that  typically  keeps  them  in  line.  By  the 
same  token,  it  is  only  when  those  in  power,  or 
their  administrative  staff,  come  to  lose  confidence 
in  their  own  legitimacy  that  the  power  structure 
comes  to  be  critically  weakened.  And  it  is  divisions 
within  a  ruling  elite,  not  the  loss  of  legitimacy 
among  a  wider  public,  that  holds  the  key  to  the 
decline  and  fall  of  regimes. 

Now,  it  is  certainly  true,  and  Weber  acknowl¬ 
edged  as  much,  that  both  historically  and  in  the 
contemporary  world,  we  can  point  to  examples  of 
power  structures  where  the  obedience  of  those 
subject  to  them  has  been  maintained  simply 
through  coercion  and  a  sense  of  their  own  impo¬ 
tence.  However,  to  limit  the  significance  of  legiti¬ 
macy  merely  to  those  who  exercise  power  and 
administer  it  can  be  challenged  on  two  grounds. 
First,  where  power  over  a  population  depends  on 
coercion  alone,  it  requires  a  costly  apparatus  of 
enforcement  and  surveillance  to  maintain  it, 
extending  to  the  agents  of  enforcement  themselves, 
who  may,  as  in  the  late  Roman  Empire,  simply  sell 
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themselves  to  the  highest  bidder.  Moreover,  the 
system  of  power  now  has  only  one  line  of  defense — 
that  of  force — and  it  can  therefore  collapse  very 
rapidly  if  the  coercion  is  insufficient  or  if  people 
believe  that  those  in  power  have  lost  the  capacity 
or  the  will  to  use  it.  Once  Premier  Mikhail 
Gorbachev  made  clear  that  the  former  USSR 
would  no  longer  intervene  militarily  in  Eastern 
Europe,  the  writing  was  on  the  wall  for  the 
Communist  regimes  there  since  their  loss  of  inter¬ 
nal  legitimacy  meant  that  they  were  only  kept  in 
place  by  the  threat  of  Soviet  invasion. 

So  first,  then,  a  wider  acknowledgment  of  legit¬ 
imacy  enhances  the  stability  of  a  system  of  power, 
as  well  as  its  economy  of  operation,  through  the 
recognition  of  the  subordinates  that  they  have  a 
corresponding  duty  to  obey  and  cooperate  with  it 
and  that  they  do  not  merely  have  a  contingent 
interest  in  doing  so.  People  expect  power  holders 
to  meet  certain  moral  or  normative  conditions  in 
their  access  to  and  exercise  of  power,  and  the  loss 
of  moral  authority  consequent  on  their  failure  to 
meet  these  can  lead  to  widespread  disobedience  or 
resistance,  reaching  into  the  coercive  agencies 
themselves.  It  would  be  a  mistake  therefore  to  see 
a  regime  crisis  as  originating  in  divisions  within  the 
ruling  apparatus  alone  and  not  in  the  attitudes  and 
behavior  of  a  wider  population.  The  collapse  of  the 
Shah  of  Iran’s  regime  in  1979  is  a  particularly  spec¬ 
tacular  example  of  the  erosion  of  an  army’s  loyalty 
in  the  face  of  repeated  mass  protests,  resulting 
from  the  regime’s  loss  of  moral  authority  in  the 
eyes  of  the  population  at  large. 

A  second  reason  why  acknowledgment  of  a 
power  regime’s  legitimacy  by  those  subordinate  to 
it  is  important  is  that  those  in  power  are  much  more 
able  to  achieve  their  purposes  when  these  depend 
on  the  willing  cooperation  of  those  subject  to  them. 
Once  again,  the  exceptions  or  limiting  cases  prove 
the  rule.  For  example,  there  have  been  purely  coer¬ 
cive  labor  regimes,  such  as  slave  societies  or  early 
industrial  capitalism,  where  the  subordinates  were 
regarded  as  objects  (e.g.,  chattels,  hands),  readily 
dispensable  for  new  recruits,  and  the  quality  of  their 
cooperation  did  not  greatly  matter.  In  these  cases, 
the  elaborate  justifications  constructed  to  legitimate 
these  arrangements  were  not  intended  for  those 
subjected  to  them  but  for  the  wealthy  and  powerful 
themselves  and  for  the  other  elite  audiences  whose 
cooperation  they  might  need. 


However,  the  more  a  power  structure  is  depen¬ 
dent  on  a  settled  subordinate  population  for  the 
achievement  of  its  purposes,  and  especially  where 
the  quality  of  the  performance  matters,  the  more 
essential  it  is  that  the  relationship  is  constructed 
according  to  an  acknowledgment  of  reciprocal 
rights  and  duties,  such  as  that  that  only  a  principle 
of  legitimacy  can  provide.  This  entails  that  the 
power  relation  is  defined  as  a  normative  and  not 
just  a  de  facto  one  and  that  this  definition  sets  lim¬ 
its  to  the  scope  of  the  powerful  as  well  as  to  the 
obligations  of  the  subordinate.  Where  the  legiti¬ 
macy  of  the  powerful  is  widely  acknowledged,  to 
that  extent  will  the  powerful  be  able  to  rely  on  the 
cooperation  of  the  subordinate  necessary  to  achieve 
their  purposes.  This  issue  is  particularly  pertinent 
to  the  modern  state,  which  requires  those  subject  to 
its  authority  not  only  to  obey  its  laws  but  also  to 
pay  their  taxes,  cooperate  with  its  policies,  and 
even  to  fight  in  its  defense. 

Some  simple  examples  will  serve  to  demonstrate 
the  point.  No  one  likes  paying  taxes.  But  it  makes 
an  enormous  difference  to  a  system  of  tax  collec¬ 
tion  if  people  acknowledge  the  right  of  the  state  to 
tax  them  and  accept  the  system  as  broadly  fair. 
Then,  the  vast  majority  will  pay  up  without  resis¬ 
tance.  Naturally,  the  administrative  arrangements 
will  have  to  be  efficient,  and  there  will  have  to  be 
compulsion  at  the  margin  to  deal  with  backsliders 
and  to  convince  the  rest  that  there  are  no  free  rid¬ 
ers.  But  a  state  where  people  recognize  no  duty  to 
pay  taxes  will  have  to  engage  in  enormously 
expensive  systems  of  enforcement,  which  will  sub¬ 
stantially  reduce  the  overall  take  and  may  even 
compromise  the  capacity  to  raise  taxes  altogether. 
It  follows  that  the  effectiveness  and  the  legitimacy 
of  a  system  of  power  are  not  unconnected  features, 
as  many  writers  have  supposed.  This  is  because  the 
capacity  of  political  authorities  is  also  dependent 
on  their  moral  authority  or  standing  among  those 
whose  cooperation  is  required  for  them  to  achieve 
their  purposes. 

Another  example  we  could  take  is  that  of  polic¬ 
ing.  Where  policing  is  based  on  consent,  that  is, 
where  the  police  are  recognized  as  broadly  working 
for  the  benefit  of  the  population  as  a  whole,  treat¬ 
ing  people  fairly  and  with  restraint,  and  where  there 
is  confidence  in  the  system  of  complaint  and  redress 
in  the  event  of  abuse,  then,  people  will  cooperate 
with  the  police  in  the  prevention,  reporting,  and 
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investigation  of  crime.  To  that  extent,  the  police  will 
be  enabled  to  carry  out  their  function  effectively.  Or 
take  the  mundane  example  of  a  classroom  teacher. 
If  pupils  do  not  share  a  belief  in  the  purpose  or 
value  of  education,  on  which  the  justification  of  the 
teacher’s  authority  is  based,  he  or  she  will  have  to 
devote  correspondingly  greater  energies  to  main¬ 
taining  order  than  to  teaching.  To  that  extent,  the 
purposes  for  which  power  is  held  will  not  be 
achieved,  and  this  may  lead  in  turn  to  a  further  ero¬ 
sion  of  legitimacy. 

Enhanced  order,  stability,  and  effectiveness — 
these  are  the  typical  strengths  of  a  legitimate  system 
of  power  as  a  result  of  the  obligations  on  subordi¬ 
nates  that  follow  from  their  recognition  of  its 
legitimacy.  As  already  mentioned,  there  may  indeed 
be  examples  of  power  systems  where  the  audience 
for  legitimation  claims  is  confined  to  the  powerful 
themselves  and  their  immediate  associates,  and 
legitimacy  is  needed  only  for  their  internal  self¬ 
belief  and  cohesion  in  the  face  of  a  population 
compelled  to  brute  obedience.  Yet  these  are  only 
limiting  cases.  More  typically,  legitimacy  matters 
for  both  the  rulers  and  the  ruled,  for  power  holders 
and  those  subordinate  to  them,  and  their  relation¬ 
ship  is  constructed  as  a  normative  and  not  merely 
a  de  facto  one,  with  legitimating  norms  establishing 
mutual  ties  of  obligation  and  performance  between 
them. 

A  final  potential  audience  for  legitimacy  claims 
should  be  mentioned  here — that  is,  third  parties 
who  stand  outside  a  given  system  of  power  but  on 
whose  recognition  and  cooperation  it  may  depend. 
It  is  difficult  to  generalize  about  these  since  con¬ 
texts  vary  so  widely.  Yet  it  can  happen  that  a 
refusal  by  an  outside  body  to  recognize  a  given 
power  as  legitimate  can  have  significant  repercus¬ 
sions  for  it  internally,  especially  when  withdrawal 
of  necessary  cooperation  or  some  other  sanction 
follows.  For  example,  the  international  sanctions 
imposed  on  the  apartheid  regime  in  South  Africa 
because  of  its  violation  of  internationally  estab¬ 
lished  norms  against  race  discrimination  made 
their  own  contribution  to  the  loss  of  self-confidence 
on  the  part  of  the  regime  and  to  the  success  of  the 
internal  liberation  struggle.  The  role  of  interna¬ 
tionally  accepted  norms  in  sustaining  or  eroding 
the  legitimacy  of  states  or  their  governments  in  the 
contemporary  world  will  be  discussed  more  fully 
later  on. 


What  Exactly  Makes  Power  Legitimate? 

According  to  Max  Weber,  what  makes  power 
legitimate  in  a  social-scientific  analysis  is  not  what 
meets  the  political  philosopher’s  independently 
validated  principles  of  rightful  authority  but  what 
is  accepted  by  the  relevant  agents  as  legitimate  in 
their  context.  Legitimacy  for  the  social  scientist  is 
simply  the  belief  in  legitimacy  since  it  is  this  that 
determines  people’s  behavior.  Moreover,  legiti¬ 
mating  beliefs  will  vary  enormously  from  one  his¬ 
torical  period  or  societal  type  to  another.  We  may 
regard  the  divine  right  of  kings  as  philosophically 
groundless,  but  the  task  of  the  social  scientist  is  to 
understand  how  people  came  to  believe  it,  how  the 
belief  was  reproduced  and  disseminated,  and  by 
whom  and  what  followed  from  it  for  how  a  given 
system  of  rule  was  organized  and  for  how  people 
related  to  it.  In  other  words,  the  critique  of  ideas 
has  to  give  way  to  a  sociology  of  knowledge  if  we 
are  to  explain  the  key  features  of  any  given  system 
of  power  that  we  wish  to  study. 

Some  philosophers  have  objected  to  what  they 
see  as  a  completely  reductionist  definition  of  legit¬ 
imacy,  equating  it  with  the  purely  subjective  belief 
in  legitimacy;  to  the  dismissal  of  ideas  as  simply 
ideology,  myth,  ruling  formula,  and  so  on;  and  to 
the  substitution  of  the  study  of  the  origin,  repro¬ 
duction,  and  dissemination  of  beliefs  for  a  critical 
analysis  of  their  content.  The  implication  seems  to 
be  that  people  can  be  brought  to  believe  almost 
anything,  however  preposterous,  by  those  who 
control  the  production  and  dissemination  of  ideas. 
Should  we  then  treat  power  as  legitimate  simply 
because  enough  people  have  by  one  means  or 
another  come  to  believe  that  it  is? 

Now,  it  is  true  that  a  social-scientific  analysis  of 
ideas  has  to  start  from  a  relativizing  standpoint 
and  that  a  historian  or  political  scientist  can  only 
understand  the  ideas  of  a  different  age  or  society 
to  the  extent  that  he  or  she  stands  back  from  his 
or  her  own  beliefs  and  the  assumption  of  their 
self-evident  truth.  Yet  such  an  understanding  of  a 
different  set  of  beliefs  is  only  possible  through  a 
careful  analysis  of  their  content  and  internal  logic 
as  well  as  of  the  circumstances  of  their  production 
and  dissemination.  In  the  case  of  legitimacy, 
despite  all  the  historical  variability  of  legitimating 
ideas,  it  can  be  shown  that  they  all  share  a  com¬ 
mon  and  distinctive  threefold  structure  to  their 
content.  Power,  and  especially  political  power,  is 
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recognized  as  legitimate  to  the  extent  that  the  fol¬ 
lowing  conditions  are  satisfied: 

1.  It  is  acquired  and  exercised  according  to 
established  rules.  This  is  the  level  of  legality, 
whether  or  not  the  rules  take  an  informal, 
conventional,  or  explicitly  legal  form.  On  its 
own,  however,  legality  is  insufficient  since  the 
rules  have  to  be  underpinned  by  normative 
principles  that  justify  them.  Hence,  we  also  have 
the  following. 

2.  The  rules  of  power  are  justifiable  according  to 
accepted  beliefs  about  (a)  the  rightful  source  of 
authority,  which  determines  who  is  qualified  to 
exercise  power  and  how  they  are  appointed  and 
(b)  the  proper  ends  or  purposes  of  power  and 
standards  in  its  exercise.  This  is  the  level  of 
normative  justifiability.  There  is  a  further 
dimension  still,  however,  and  that  is  the 
following. 

3.  Positions  of  authority  are  publicly 
acknowledged  by  relevant  subordinates  through 
actions  that  confirm  their  acceptance  and 
recognition  of  it.  This  is  the  level  of 
legitimation. 

These  three  levels  are  not  alternatives  since  all 
contribute  to  legitimacy;  together  they  provide  the 
subordinate  with  moral  grounds  for  compliance 
or  cooperation  with  authority.  The  fact  that  all 
are  required  can  be  shown  by  the  different  nega¬ 
tive  words  used  to  express  the  different  ways  in 
which  power  may  lack  legitimacy.  If  there  is  a 
manifest  breach  of  the  rules,  we  use  the  term  ille¬ 
gitimacy.  If  the  rules  are  only  weakly  supported  by 
societal  beliefs  or  are  deeply  contested,  we  can 
talk  of  a  legitimacy  deficit.  If  consent  or  recogni¬ 
tion  is  publicly  withdrawn  or  withheld,  we  speak 
of  delegitimation. 

If  we  take  the  subject  of  political  power,  the 
most  extreme  example  of  illegitimacy  is  usurpation 
or  coup  d’etat — power  attained  in  violation  of  the 
rules.  Examples  of  legitimacy  deficit  are  enor¬ 
mously  varied,  from  situations  where  changing 
societal  beliefs  leave  existing  institutional  arrange¬ 
ments  unsupported  or  where  people  have  widely 
diverging  beliefs — say,  about  which  state  they 
should  belong  to — to  situations  where  government 
is  chronically  unable  to  meet  the  basic  purposes, 


such  as  welfare  or  security,  which  people  expect  it 
to.  Legitimacy  deficits  usually  only  become  critical 
when  some  performance  failure  of  government 
exposes  a  fundamental  doubt  about  its  rightful 
source  of  authority.  Examples  of  delegitimation 
include  acts  of  widespread  public  opposition  to  a 
regime,  of  which  revolutionary  mobilization  is  the 
most  extreme  example.  Revolutions  follow  a  typi¬ 
cal  course  from  chronic  legitimacy  deficit  of  the 
regime  (doubtful  or  disputed  source  of  authority 
compounded  by  performance  failure),  through  its 
delegitimation  by  mass  oppositional  mobilization, 
which  splits  the  governing  apparatus,  to  an  ille¬ 
gitimate  seizure  of  power,  which  heralds  its  recon¬ 
struction  under  a  new  set  of  legitimating  rules  and 
principles. 

The  different  levels  or  dimensions  of  legitimacy 
outlined  above  constitute  only  the  most  general  or 
abstract  framework,  the  specific  content  of  which 
has  to  be  filled  in,  so  to  speak,  for  each  historical 
society  or  example  of  rule.  They  provide  a  heuris¬ 
tic  tool  to  guide  analysis.  Is  political  authority 
valid  according  to  the  rules?  The  relevant  rules 
have  to  be  identified,  their  conventional  or  legal 
form  established,  the  mode  of  adjudication  appro¬ 
priate  to  them  determined  for  the  given  context, 
and  so  on.  Are  the  rules  justifiable  in  terms  of  the 
beliefs  and  norms  of  the  particular  society,  and  are 
these  norms  relatively  uncontested?  We  need  to 
examine  the  specific  beliefs  current  in  the  society 
about  the  rightful  source  of  authority,  on  the  one 
hand  (divine  approval,  dynastic  inheritance,  privi¬ 
leged  knowledge,  ethnic  membership,  the  people, 
etc.),  and  the  proper  ends  of  government,  on  the 
other  (conquest,  security,  welfare,  salvation,  or 
whatever),  all  of  which  assume  the  fulfillment  of  a 
general  interest  beyond  the  narrow  interests  of 
those  exercising  power.  Are  there,  finally,  actions 
expressive  of  consent  or  public  affirmation  on  the 
part  of  those  qualified  to  give  it?  Who  counts  as 
qualified  and  what  actions  count  as  appropriate 
(e.g.,  oaths  of  allegiance,  acclamation,  and  elec¬ 
tion)  will  be  determined  by  the  conventions  of  the 
given  society  or  system  of  power.  Underlying  all 
these  historical  differences,  there  is  a  common 
structure  to  legitimacy  to  guide  analysis. 

This  common  structure  of  legitimacy  is  not  arbi¬ 
trary  or  accidental  since  each  component  relates  to 
a  different  aspect  of  power  that  is  socially  prob¬ 
lematic.  All  power  over  others,  and  especially 
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political  power  with  its  access  to  the  means  of  vio¬ 
lence,  is  potentially  disruptive  of  and  intrusive  into 
people’s  lives  and  well-being.  Therefore,  its  acquisi¬ 
tion  and  exercise  cries  out  to  be  rule  governed  and 
located  in  the  most  imposing  source  of  authority 
acknowledged  by  the  society.  Since  those  who  exer¬ 
cise  power  enjoy  great  status  and  privilege,  they 
have  to  be  shown  to  merit  it  and  to  use  it  to  serve  a 
more  general  interest  than  merely  their  own  advan¬ 
tage.  Finally,  since  power  entails  the  capacity  to  get 
others  to  do  what  otherwise  they  would  not  choose 
to  do,  the  loss  of  freedom  involved  is  validated  by 
public  acts  of  acknowledgment,  which  serve  to  bind 
in  subjects  and  subordinates  to  obedience.  These 
are  the  elements,  therefore,  which  combine  to  make 
power  rightful  or  legitimate. 

The  recognition  that  legitimacy  is  multidimen¬ 
sional  and  a  knowledge  of  what  these  dimensions 
are  moves  social  science  away  from  treating  the 
subjective  belief  in  legitimacy  as  if  it  were  totally 
content  free — that  just  any  belief  will  do — and 
closer  to  the  philosopher’s  discursive  analysis  of 
the  content  of  ideas.  In  particular,  it  shows  how 
social  science  and  political  philosophy  share  a 
common  subject  of  enquiry,  with  recognizably 
common  features.  Yet  a  crucial  difference  remains 
between  them,  as  already  indicated.  For  the  social 
and  political  scientist,  legitimating  ideas  and  prac¬ 
tices  derive  their  force  from  the  beliefs,  conven¬ 
tions,  and  procedures  current  in  and  specific  to  a 
given  historical  society  or  political  system.  And  the 
focus  of  enquiry  is  always  an  empirical  one:  How 
are  legitimating  ideas  reproduced  and  dissemi¬ 
nated?  Through  what  agency?  What  effects  do 
they  have  and  on  whom?  Above  all,  what  makes 
them  credible  in  the  given  social  context,  and  why, 
if  they  do  so,  do  they  come  to  lose  their  credibility? 

In  answering  these  questions,  we  need  to  get 
away  from  the  view  that  the  only  processes  that 
require  examination  are  those  involving  the  ideol¬ 
ogists,  who  develop  and  disseminate  ideas,  and  the 
institutions  that  reproduce  them.  Of  course  these 
are  important.  But  we  also  need  to  examine  the 
way  in  which  the  very  structuring  of  power  makes 
certain  legitimating  ideas  credible  or  believable.  It 
is  a  notable  feature  of  power  relations  that  they  are 
themselves  capable  of  generating  the  evidence 
required  for  their  own  legitimation. 

Two  recurrent  ways  in  which  this  occurs  can  be 
identified.  One  is  the  way  in  which  the  evidence  of 


superiority  and  inferiority,  which  justifies  the  dif¬ 
ference  of  fates  and  inequalities  of  condition 
between  dominant  and  subordinate,  is  itself  largely 
the  product  of  this  condition.  Those  who  are 
excluded  from  key  positions,  activities,  or  resources 
are  thereby  denied  the  opportunity  to  acquire  or 
demonstrate  the  capacities  and  characteristics 
appropriate  to  their  occupation  and  exercise,  so 
justifying  their  subordinate  position.  Throughout 
history,  the  exclusion  of  women  from  privileged 
positions  or  activities  monopolized  by  men  has 
prevented  them  from  acquiring  the  capacities  nec¬ 
essary  for  their  occupancy  or  exercise.  Their  infe¬ 
riority  thus  appears  naturally  rather  than  socially 
constructed,  serving  to  justify  their  exclusion  in  a 
self-fulfilling  manner. 

Second,  there  is  the  way  in  which  the  idea  that 
the  powerful  serve  a  general  interest  gains  credibil¬ 
ity  from  the  structuring  of  power  itself.  Once  some 
necessary  social  resource  or  activity  comes  to  be 
controlled  by  a  particular  group,  it  follows  that  the 
interests  of  society  at  large  can  only  be  met  through 
satisfying  the  interests  of  that  group  and  on  terms 
acceptable  to  them.  Those  who  have  historically 
controlled  the  means  of  production  or  subsistence, 
by  violence  or  administration,  have  been  in  a  posi¬ 
tion  to  ensure  that  general  needs  for  welfare, 
employment,  or  security  could  only  be  met  through 
the  power  relations  that  simultaneously  secured 
their  own  privileges.  To  this  extent,  their  claims  to 
serve  the  general  interest  have  had  an  evidential 
basis  deriving  from  the  structure  of  power  itself. 

We  could  take  today’s  international  bankers  as 
an  example.  To  be  sure,  their  claims  to  legitimacy 
have  had  an  ideological  basis,  in  the  self-regulating 
market  doctrine  that  has  justified  their  activities 
becoming  less  and  less  rule  governed — a  doctrine 
with  its  exponents  in  the  economics  departments 
of  universities  and  its  disseminators  among  politi¬ 
cians  as  well  as  among  the  bankers  themselves.  Yet 
the  very  fact  that  the  economy  of  every  society 
depends  on  the  credit  that  only  banks  supply  has 
given  credibility  to  the  claim  that  their  activities 
together  serve  an  essential  public  interest.  And  the 
very  complexity  and  global  reach  of  their  opera¬ 
tions  have  reinforced  the  idea  that  they  are 
uniquely  qualified  to  understand  and  manage  the 
processes  involved  and  that  their  indispensability 
justifies  the  enormous  salaries  and  bonuses  to 
which  they  lay  claim. 
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Of  course,  the  near  collapse  of  the  banking  sys¬ 
tem  in  the  credit  crunch  of  2008  demonstrated  that 
much  of  this  legitimating  fabric  was  illusory.  The 
free  market  was  exposed  as  far  from  self-regulating, 
many  of  the  bankers’  activities  were  shown  to  be 
not  merely  socially  useless  but  also  socially  and 
economically  damaging,  and  their  claim  to  possess 
an  esoteric  knowledge  available  only  to  the  initi¬ 
ated  was  punctured  by  the  revelation  that  even 
they  did  not  understand  the  complex  instruments 
in  which  they  were  trading.  They  suffered  a  legiti¬ 
macy  crisis  of  epic  proportions.  What  remains  to 
be  explained  is  why  the  active  process  of  popular 
delegitimation  that  typically  follows  such  a  crisis 
did  not  fully  materialize.  Apart  from  a  few  broken 
windows,  no  bankers’  houses  were  burnt  down; 
nor  were  any  bankers  strung  up  from  the  lamp- 
posts.  One  reason  may  be  that  the  institutions  in 
which  they  work  were  difficult  to  penetrate. 
Another  may  be  that  the  governments  which,  as 
third  parties,  endorsed  the  legitimacy  of  the  bank¬ 
ers’  activities  and  rewards,  proved  to  be  more 
accessible  targets  of  popular  anger,  whether 
through  mass  mobilization,  as  in  Iceland;  through 
the  sanction  of  electoral  process,  as  in  the  United 
States;  or  through  the  public  humiliation  of  parlia¬ 
ment  in  an  expenses  scandal,  as  in  the  United 
Kingdom  (UK).  Yet  the  cost  of  the  bankers’  insula¬ 
tion  from  the  full  impact  of  popular  anger  and  of 
their  avoidance  of  a  thorough-going  process  of 
delegitimation  was  shown  in  the  all  too  rapid 
return  to  business  as  usual  and  in  the  displacement 
of  the  costs  of  the  crisis  from  the  financial  to  the 
political  domain. 

A  social-scientific  analysis  of  legitimacy,  then, 
involves  understanding  the  ideas  and  processes 
that  legitimate  a  given  system  of  power  in  its  con¬ 
text;  exploring  the  different  dimensions  of  legality, 
normative  j  usitifiability,  and  legitimation  that  con¬ 
tribute  to  the  acknowledgment  of  power  as  rightful 
by  the  relevant  actors;  identifying  both  the  agents 
of  ideological  production  and  dissemination  and 
the  aspects  of  a  power  system  that  give  legitimating 
ideas  their  credibility;  and  finally,  exploring  the 
processes  that  may  lead  to  their  erosion.  The  next 
section  will  use  the  framework  developed  here  to 
delineate  the  legitimating  principles  and  proce¬ 
dures  of  the  main  types  of  political  systems  of  the 
20th  century  and  identify  their  typical  points  of 
vulnerability. 


The  Legitimacy  of  Different  Regime  Types 

A  third  issue  that  Weber’s  analysis  of  legitimacy 
initiated  ongoing  debate  about  was  the  relation 
between  legitimating  principles  and  the  way  sys¬ 
tems  of  power  are  organized.  He  argued  that  the 
kind  of  legitimacy  being  claimed  had  a  determin¬ 
ing  effect  on  the  mode  of  obedience,  the  kind  of 
administrative  staff  developed  to  guarantee  it,  and 
the  manner  of  exercising  authority,  all  of  which 
would  differ  according  to  their  different  legitimat¬ 
ing  principles.  It  followed  that  types  of  power 
systems  could  be  classified  according  to  the  kind  of 
claim  to  legitimacy  typically  made  by  each.  He 
then  proceeded  to  differentiate  between  three  types 
of  legitimating  claims  and  their  respective  grounds, 
which  he  termed  traditional  (belief  in  the  sanctity 
of  immemorial  traditions),  rational-legal  (belief  in 
the  legality  of  enacted  and  rationally  justifiable 
law),  and  charismatic  (belief  in  the  exceptional 
qualities  of  an  individual  and  the  normative  prin¬ 
ciples  ordained  by  him).  He  used  this  typology  to 
structure  his  political  sociology  and  as  the  basis  for 
detailed  studies  of  patrimonial  authority,  bureau¬ 
cracy,  and  religious  movements,  respectively. 

There  is  no  doubt  that  this  threefold  typology, 
and  Weber’s  development  of  it,  has  proved  histori¬ 
cally  illuminating.  For  example,  in  any  study  of  the 
process  of  modernization,  Weber’s  characteriza¬ 
tion  of  the  shift  from  traditional  to  rational-legal 
modes  of  legitimation — from  personal  to  imper¬ 
sonal  relationships  of  rule,  from  status  distinctions 
to  rights-based  authority,  from  dependency  to  citi¬ 
zenship,  together  with  the  separation  of  public 
finances  from  the  private  household  of  the  ruler — 
has  been  indispensable.  Weber’s  analysis  of  the 
appearance  of  charismatic  figures  in  times  of  dis¬ 
tress,  social  dislocation,  or  revolution  and  the  rou- 
tinization  of  their  charisma  in  a  subsequent  order  is 
also  valuable.  However,  as  a  basis  for  constructing 
a  typology  of  political  regimes  in  the  20th  century, 
Weber’s  types  have  proved  something  of  a  distrac¬ 
tion  for  political  scientists.  It  is  not  particularly 
useful  to  be  told  that  both  liberal  democracy  and 
fascism  are  different  variants  of  charismatic  author¬ 
ity,  one  more  rule  governed  than  the  other,  or  that 
communist  systems  comprised  a  combination  of 
the  traditional,  rational-legal,  and  charismatic 
modes  of  legitimacy. 

At  this  point,  the  structure  of  legitimacy  out¬ 
lined  in  the  previous  section — comprising  legality, 
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normative  justifiability,  and  public  affirmation — 
can  be  used  to  provide  a  more  coherent  basis  for 
distinguishing  between  the  different  regime  types 
of  the  20th  century:  traditional,  fascist,  commu¬ 
nist,  liberal-democratic,  and  theocratic.  For  each, 
their  distinctive  form  of  law,  source  of  authority 
and  ends  of  government,  and  mode  of  public  affir¬ 
mation  can  be  identified.  These  are  shown  in  sum¬ 
mary  form  in  Table  1. 

Military  and  other  forms  of  dictatorship  have 
been  included  as  a  limiting  case  of  a  nonlegitimate 
political  order,  born  of  illegitimacy  and  lacking 
both  a  rightful  source  of  authority  and  any  mode 
of  expressed  consent  or  public  affirmation.  Such 
legitimacy  as  military  regimes  have  is  based  entirely 
on  their  purpose  or  mission — to  save  society  from 
chaos  or  corruption — and  is  typically  defined  as 
transitional,  to  promote  the  restoration  of  a  normal 
political  order.  Like  all  regimes  whose  legitimacy  is 
limited  to  the  dimension  of  performance,  they  are 
vulnerable  once  performance  falters,  and  their  fail¬ 
ure  exposes  their  lack  of  any  valid  source  of 
authority.  Legitimate  political  orders,  in  contrast, 
which  are  secure  in  their  source  of  authority,  are 
able  to  withstand  shocks  and  performance  failures 
and  to  effect  routine  changes  of  administration  that 
do  not  threaten  the  legitimacy  of  the  system  itself. 


The  different  political  regimes  in  Table  1  are, 
like  Weber’s,  constructed  as  pure  or  ideal  types, 
and  mixed  forms  combining  more  than  one  mode 
of  legitimation  can  readily  occur.  For  example,  a 
number  of  recently  established  democracies,  par¬ 
ticularly  in  the  South,  have  persisting  elements  of 
tradition-based  legitimacy,  whether  in  customary 
forms  of  law,  chieftain-  or  clan-based  authorities, 
or  the  persistence  of  personal  favoritisms  and 
dependencies  within  a  formally  rule-based  demo¬ 
cratic  regime.  Depending  on  the  form  these  take,  a 
traditional  legitimacy  may  serve  to  strengthen  the 
legitimacy  of  more  recently  established  democratic 
institutions  or  come  to  weaken  or  undermine 
them.  Of  the  theocratic  systems,  to  take  a  different 
example,  Iran  has  developed  an  electoral  mode  of 
legitimation,  albeit  flawed,  while  Saudi  Arabia’s 
system  is  based  on  the  allegiance  of  a  traditional 
and  hereditary  social  elite.  Such  combinations  and 
transitions  are  not  uncommon  but  are  best  ana¬ 
lyzed  by  starting  with  clearly  defined  types. 

Table  1  provokes  an  obvious  question:  Why  is 
it  that  the  liberal  democratic  mode  of  legitimacy, 
and  form  of  political  system,  has  become  globally 
prevalent  by  the  start  of  the  21st  century?  This  is 
partly  for  negative  reasons — namely,  that  other 
forms  of  legitimate  political  order  have  proved  to 


Table  I  Legitimating  Elements  of  Different  20th-Century  Regime  Types 


Regime  Type 

Form  of  Law 

Source  of  Authority 

Ends  of  Government 

Mode  of  Public 
Affirmation 

Traditional 

Custom/precedent 

Heredity/the  past 

Well-being  within 
traditional  order 

Assembly  of  social 
elite 

Fascist 

Sovereign 

decisionism 

Leadership  principle 

National  purity/ 
expansion 

Mass  mobilization 

Communist 

Codification  of  the 
collective  will 

Party  monopoly  of 
truth  and 
representation 

Building  communist 
future 

Mass  mobilization 

Liberal-democratic 

Constitutional  rule 
of  law 

The  people  through 
competitive 
election 

Individual  rights 
protection  and 
advancement 

Electoral 

endorsement 

Theocratic 

Sacred  texts  and 

canons 

Divine  will 
interpreted  by 
hierarchy 

Purifying  society’s 
moral  order 

Various  of  the  above 

Dictatorial 

Decree 

None 

Restore  order  and 
national  unity 

None 
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be  ill  adapted  to  some  key  aspect  of  contemporary 
economic  and  social  conditions  and  have  found 
their  legitimacy  to  have  been  eroded.  The  heredi¬ 
tary  monopoly  of  political  authority  characteristic 
of  traditional  systems  has  proved  to  be  vulnerable 
to  the  modern  requirement  of  a  career  open  to  tal¬ 
ent  and  to  popular  demands  for  inclusion  in  the 
political  process.  The  Communist  Party’s  claim  to 
represent  workers’  interests  proved  to  be  increas¬ 
ingly  out  of  step  with  its  members’  own  perceptions 
of  them;  its  claim  to  exclusive  truth  lost  credibility, 
and  its  goal  of  a  realizable  communist  society  came 
up  against  the  limits  of  its  system  of  economic  plan¬ 
ning.  The  fascist  pursuit  of  expansionary  national 
goals  typically  led  to  self-destructive  wars,  or, 
where  these  were  avoided,  an  authority  vested  in 
the  person  of  an  individual  leader  proved  unable  to 
survive  his  death.  Theocracies  have  proved  vulner¬ 
able  to  fundamentalisms  that  have  quickly  forfeited 
popularity,  or  else  they  have  provoked  adherents  of 
other  faiths  to  open  disaffection.  Each  system  has 
had  its  own  internal  crisis  tendencies,  inherent  in  its 
legitimating  principles  and  procedures,  which  have 
in  most  cases  proved  to  be  terminal. 

The  prevalence  of  liberal  democracy,  in  con¬ 
trast,  is  due  to  the  fact  that  it  has  proved  to  be  the 
only  sustainable  legitimate  order  compatible  with 
the  conditions  of  market  capitalism,  on  the  one 
hand,  and  with  popular  demands  for  freedom  and 
full  inclusion  in  the  political  process,  on  the  other. 
Market  capitalism’s  antipaternalist  principles — 
individuals  are  the  best  judge  of  their  own  inter¬ 
ests,  are  responsible  for  their  own  fate,  and  are 
sovereign  in  the  consumer  market — have  over  time 
led  to  the  demand  for  people  to  be  sovereign  in  the 
political  sphere  also  and  have  undermined  pater¬ 
nalist  forms  of  legitimacy,  especially  as  education 
has  become  more  widespread.  At  the  same  time, 
the  increasingly  global  dimensions  of  communica¬ 
tion  have  made  closed  political  systems,  claiming  a 
monopoly  of  information  and  ideology,  unsustain¬ 
able  without  levels  of  repression  that  provoke 
popular  resistance.  The  People’s  Republic  of  China 
stands  out,  however,  as  a  potential  test  of  these 
assumptions.  It  remains  an  open  question  whether 
the  post-Tiananmen  combination  of  performance 
legitimacy  through  rapid  economic  growth  and 
strenuous  reinvention  of  the  party’s  justifying  ide¬ 
ology  will  carry  conviction  over  time  beyond  the 
ranks  of  its  own  cadres. 


This  is  not  to  say  that  liberal  democracy  does 
not  have  its  own  characteristic  crisis  tendencies, 
deriving  from  some  of  its  legitimating  principles 
and  procedures.  The  electoral  competition  for 
power,  with  its  winner-take-all  outcome,  gives  rul¬ 
ing  leaders  and  parties  a  powerful  incentive  to 
curtail  basic  rights  and  freedoms  for  opponents  or 
to  skew  the  registration  and  electoral  process  in 
their  favor.  Democracy’s  majoritarian  principle 
can  lead  to  the  permanent  exclusion  of  significant 
minorities  from  any  share  in  power,  provoking 
secessionist  tendencies  if  not  outright  civil  war. 
Also,  the  inescapable  tension  between  the  eco¬ 
nomic  and  social  inequalities  intrinsic  to  market 
capitalism  and  the  equality  of  citizenship  and 
political  voice,  which  democracy  promises,  con¬ 
tains  the  potential  for  disruptive  social  conflict. 
Any  of  these  challenges  can  lead  to  authoritarian 
and  exclusionary  deformations  of  democracy,  in 
which  democratic  norms  and  procedures  become 
more  of  a  legitimating  faqade  than  a  genuinely 
operative  and  regulating  set  of  ideas. 

However,  the  fact  that  even  authoritarian 
regimes  now  claim  the  mantle  of  democratic  legiti¬ 
macy  points  to  a  striking  feature  of  the  interna¬ 
tional  political  landscape  since  the  collapse  of 
communist  regimes  in  1989.  Democracy  is  now 
the  only  form  of  legitimate  political  order  that  has 
wide  attractiveness  across  borders  and  whose 
norms  have  become  endorsed  at  an  international 
level.  This  endorsement  provides  a  strong  external 
legitimation  to  domestic  political  forces  engaged  in 
a  struggle  for  democracy  within  a  country.  It  is 
also  given  practical  effect  through  positive  mea¬ 
sures  of  democracy  support  between  countries  and 
negative  pressure  where  aid,  trade,  and  debt  inter¬ 
dependences  are  involved.  Democracy  promotion 
in  one  form  or  another  on  the  part  of  established 
democracies  in  their  dealings  with  recent  or  less 
secure  ones  has  become  a  major  enterprise  since  its 
inception  in  the  early  1990s. 

Yet  it  is  also  necessary  to  recognize  the  limits  to 
democracy  promotion  and  the  point  where  it 
comes  up  against  other  legitimating  ideas  and 
norms.  Most  obvious  of  these  is  the  force  of 
nationalism  and  the  principle  of  national  sover¬ 
eignty  or  autonomy.  Since  the  late  18th  century, 
the  idea  of  nationhood,  as  solidarity  between  a 
people  sharing  key  markers  of  identity,  has  inspired 
political  movements  for  national  unification  across 
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borders,  secession  from  within  them,  and  indepen¬ 
dence  from  imperial  powers  alike.  The  principle  of 
nationalism  does  not  provide  the  specification  for 
any  one  kind  of  political  system  but  only  its  appro¬ 
priate  spatial  reach.  It  can  best  be  regarded  as  a 
background  legitimating  condition  for  the  rulers  of 
any  and  every  regime  in  the  modern  world — that 
they  should  share  the  national  identity  of  the  ruled 
and  should  not  act  as  agents  of,  or  be  subordinate 
to,  an  external  power.  Nationalism  can  also,  of 
course,  become  foregrounded  as  a  means  to  com¬ 
pensate  for  some  significant  legitimacy  deficit  in 
other  aspects  of  a  given  regime. 

Democracy  promotion,  then,  comes  up  against 
one  limit  when  it  is  perceived  as  an  instrument  for 
the  national  interests  of  the  promoting  power  or 
powers  and,  above  all,  when  its  promotion  infringes 
on  national  self-determination,  as  in  attempts  to 
impose  democracy  by  force.  Another  limit,  already 
mentioned,  is  set  by  the  persisting  legitimacy  of 
traditional  norms  and  practices,  especially  where 
democratic  institutions  are  seen  to  displace  or 
undermine  these,  rather  than  achieve  an  acceptable 
mode  of  coexistence  with  them.  The  transition 
from  one  type  to  the  other  is  necessarily  a  gradual 
process,  requiring  a  generational  shift  in  the  under¬ 
lying  beliefs  supportive  of  a  new  order.  In  this 
context,  it  is  worth  noting  that  in  the  UK,  it  has 
taken  until  the  21st  century  for  the  hereditary 
component  in  the  upper  chamber  of  parliament, 
based  on  a  traditional  principle  of  legitimacy,  to  be 
finally  abolished. 

A  further  limit  to  the  project  of  democracy  pro¬ 
motion,  specific  to  Islamic  countries,  lies  in  the 
attraction  of  a  competing  principle  of  legitimacy — 
that  of  a  political  order  based  on  sharia  law  and 
interpreted  by  a  religious  hierarchy.  Although  the 
installation  of  such  an  order  at  state  level  has  over 
time  proved  divisive  and  unpopular,  the  idea  has 
been  a  potent  source  of  support  for  opposition 
movements  mobilizing  against  only  weakly  legiti¬ 
mated  regimes  in  the  Arab  world  and  beyond. 

All  three  of  these  alternative  sources  of  legiti¬ 
macy  limiting  the  effectiveness  of  democracy  pro¬ 
motion  were  combined  in  Afghanistan.  The  lengthy 
military  occupation  weakened  the  legitimacy  of 
democratic  institutions,  already  compromised  by 
electoral  fraud,  with  the  taint  of  foreign  imposi¬ 
tion.  These  institutions  were  both  distorted  and 
challenged  by  clan-based  forms  of  traditional 


authority  to  which  people  owe  their  primary  alle¬ 
giance.  And  both  these  elements  were  used  by  the 
insurgent  Taliban  to  gain  support  for  its  alterna¬ 
tive  theocratic  agenda.  Given  such  a  combination, 
it  is  hard  to  imagine  a  more  unfavorable  terrain  for 
a  democratizing  project. 

Legitimacy  Beyond  the  State 

Any  analysis  of  the  legitimating  principles  of  con¬ 
temporary  regimes  has  to  take  into  account  the  way 
in  which  forces  beyond  the  nation-state  act  to  con¬ 
strain  its  decisional  autonomy  and  compromise  its 
claim  to  sovereignty.  These  forces  may  include  the 
actions  of  other  states,  pressures  from  the  financial 
and  economic  markets,  or  the  rulings  of  interna¬ 
tional  treaty  bodies  and  other  international  organi¬ 
zations.  Since  the  actions  of  the  latter  are  becoming 
both  more  pervasive  and  more  intrusive,  for  good 
and  ill  alike,  the  basis  for  their  legitimacy  is  becom¬ 
ing  an  important  subject  of  analysis  and  debate. 

The  legitimacy  of  the  international  organiza¬ 
tions  set  up  in  the  aftermath  of  World  War  II,  and 
the  decisions  and  resolutions  taken  by  these  bod¬ 
ies,  has  historically  derived  from  the  participation 
of  member  state  governments  and  the  process  of 
agreement  between  them  required  for  a  binding 
decision.  However,  in  more  recent  times,  that 
legitimacy  has  increasingly  been  called  into  ques¬ 
tion  on  the  grounds  that  voting  rights  have  been 
skewed  toward  the  major  powers  in  the  developed 
world  and  have  not  reflected  either  emerging  pow¬ 
ers  or  the  weight  of  their  populations.  This  is  most 
obvious  with  the  United  Nations  (UN)  itself, 
whose  permanent  members  of  the  Security  Council 
with  veto  powers  constitute  a  historical  anomaly 
dating  from  the  1940s  and  1950s.  With  similar 
effect,  voting  rights  in  the  International  Monetary 
Fund  and  World  Bank  depend  on  the  amount  of 
capital  a  country  contributes  not  on  equality  of  state 
membership.  And  even  with  a  body  like  the  World 
Trade  Organization,  whose  members  states  are  for¬ 
mally  equal,  decisions  in  practice  were  for  a  long 
time  predetermined  in  so-called  green  room  meet¬ 
ings  of  the  larger  trading  nations  and  influenced  by 
the  much  greater  research  capacity  available  to  their 
delegations.  The  cumulative  outcome  of  these  pro¬ 
cedural  inequalities  has  been  policies  and  agree¬ 
ments  that  have  benefited  the  developed  world  more 
than  the  developing  one. 
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In  the  changed  conditions  of  the  21st  century,  it 
is  now  widely  recognized  that  full  legitimacy  and 
the  respect  necessary  for  effective  global  coopera¬ 
tion  can  only  be  restored  to  these  institutions  by 
reform  of  their  membership  rules  and  procedures. 
However,  there  is  considerable  disagreement  about 
what  this  should  entail.  If  we  take  the  UN  itself, 
there  are  at  least  three  contending  views.  The  first 
holds  that  the  basic  idea  of  the  Security  Council, 
consisting  of  permanent  members  plus  rotating 
temporary  ones,  is  still  necessary  to  ensure  that  the 
major  powers  have  an  incentive  to  take  part  in  the 
organization  rather  than  bypass  it.  From  this  view¬ 
point,  what  is  needed  is  to  extend  the  permanent 
membership  to  include  states  that  better  reflect 
today’s  configuration  of  power,  such  as  Brazil, 
Germany,  India,  and  South  Africa. 

A  second  view  holds  that  the  whole  idea  of 
privileged  membership  of  the  Security  Council  is 
anachronistic  and  can  no  longer  be  justified.  Any 
criterion  of  fairness  would  require  equal  opportu¬ 
nity  for  states  to  participate  in  the  Council,  albeit 
with  some  adjustment  mechanism  to  reflect  differ¬ 
ences  in  size.  The  latter  principle  could  also  be 
extended  to  the  General  Assembly  by  assigning 
differential  voting  power  according  to  the  size  of  a 
member  state’s  population. 

A  third  more  radical  view,  associated  with  pro¬ 
ponents  of  cosmopolitan  democracy,  holds  that 
the  legitimacy  deficit  of  international  organiza¬ 
tions  consists  in  the  fact  that  they  represent  only 
states  and  their  interests,  not  peoples.  In  this  view, 
reformed  intergovernmental  organs  of  the  UN 
should  be  complemented  by  a  people’s  assembly, 
directly  elected  from  all  the  peoples  of  the  world. 
They  point  out  that  the  UN  has  already  begun 
moving  in  this  direction  by  recognizing  nongov¬ 
ernmental  organizations  and  advocacy  groups  at 
its  world  conferences  and  as  participants  in  inter¬ 
national  treaty  negotiations.  A  move  to  a  more 
formalized  system  of  popular  representation  would 
constitute  a  more  equitable  and  transparent  exten¬ 
sion  of  this  practice. 

These  differences  of  view  show  that  there  are  no 
agreed  and  settled  criteria  for  establishing  what 
counts  as  legitimacy  in  respect  of  international 
bodies.  Nowhere  has  the  issue  been  more  con¬ 
tested  than  in  debates  about  the  legitimacy  deficit 
or  democratic  deficit  of  the  European  Union  (EU). 
All  depends  on  what  kind  of  political  animal  one 


considers  the  EU  to  be.  If  one  considers  it  as  a  quasi 
state,  then  its  lack  of  any  single  decision-making 
institution  as  the  focus  for  accountability  and  the 
lack  of  any  developed  “demos”  with  powers  to 
hold  it  accountable  through  the  electoral  process 
are  major  legitimacy  deficits.  If  one  considers  it 
primarily  as  an  intergovernmental  organization, 
then  the  lack  of  transparency  in  the  decision  mak¬ 
ing  of  the  council  of  ministers  and  of  adequate 
democratic  oversight  of  these  ministers  through 
national  parliaments  are  major  deficits.  If,  finally, 
one  argues  that  the  EU  is  a  sui  generis  construction, 
with  careful  balancing  of  powers  and  accountabili¬ 
ties  between  its  different  institutions,  then,  there  is 
less  evidence  of  a  serious  deficit  than  under  the  first 
two  assumptions.  Indeed,  it  can  be  pointed  out  that 
the  EU  already  possesses  all  the  features  that  the 
most  radical  reformers  of  the  UN  wish  to  see  for 
that  organization,  including  council  voting  weighted 
by  population,  a  directly  elected  parliamentary 
assembly,  and  a  powerful  court  to  enforce  legisla¬ 
tive  decisions. 

One  further  position  in  the  debate  on  the  EU 
deserves  mention  here  because  it  rests  on  a  form 
of  legitimacy  that  is  quite  generalized  in  modern 
societies,  both  within  and  beyond  the  political 
domain.  This  is  the  view  that  since  the  prime  func¬ 
tion  of  the  EU  is  that  of  economic,  social,  and 
legal  regulation,  it  is  entirely  appropriate  that  its 
decision  making  should  be  entrusted  to  a  techno¬ 
cratic  body  such  as  the  European  Commission 
and  the  technical  committees  representing  national 
bureaucracies  that  work  for  it.  After  all,  in  most 
political  systems,  regulatory  activity  is  undertaken 
by  independent  bodies — courts  of  law,  adminis¬ 
trative  tribunals,  regulatory  bodies  of  all  kinds — 
whose  judgments  are  more  effective  precisely 
because  they  are  not  subject  to  political  or  elec¬ 
toral  pressure.  Their  source  of  legitimacy  stems 
from  the  authority  of  professional  expertise  in  a 
knowledge-based  society  and  their  ability  to  give 
reasoned  justifications  for  their  decisions,  which 
can  withstand  the  scrutiny  of  their  professional 
peers. 

Whatever  the  merits  of  this  view  in  respect  of 
the  EU — and  it  is  difficult  to  maintain  that  its 
functions  are  merely  regulatory  and  not  also  redis¬ 
tributive  and  therefore  highly  political — the  wider 
point  about  the  legitimacy  of  professional  exper¬ 
tise  is  an  important  one  for  understanding  the 
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legitimating  basis  of  many  familiar  types  of 
authority  in  contemporary  society — doctors,  law¬ 
yers,  accountants,  academics,  and  so  on — and  of 
the  wider  social  authority  of  the  institutions  in 
which  they  work.  However,  in  a  democratic  soci¬ 
ety,  these  professional  institutions  and  activities 
have  to  meet  certain  criteria  if  they  are  to  be  fully 
legitimate,  such  as  transparency,  accountability, 
and  accordance  with  fair  procedures.  Moreover, 
those  independent  bodies  that  have  a  high  politi¬ 
cal  salience — central  banks,  regulatory  bodies  of 
public  utilities  and  the  media,  ombudsmen,  and  so 
on — need  their  terms  of  reference  and  membership 
to  be  endorsed  by  parliament,  and  to  be  account¬ 
able  to  it,  if  their  decisions  are  to  have  full  legiti¬ 
macy,  combining  professional  or  technocratic  and 
democratic  authority. 

Conclusion 

To  speak  of  full  legitimacy  implies  that  legitimacy 
is  not  an  all-or-nothing  affair,  such  that  a  body 
exercising  power  either  has  it  or  does  not,  but  is  a 
matter  of  degree.  The  greater  the  acknowledged 
legitimacy  of  a  system  of  power — deriving  moral 
authority  from  a  socially  recognized  source  and 
serving  the  societal  ends  or  purposes  expected  of  it, 
according  to  accepted  conventions  or  standards, 
whatever  these  happen  to  be — to  that  extent  will 
those  subordinate  to  it  not  have  to  be  bribed  or 
coerced  into  obedience  and  the  more  likely  that 
they  will  cooperate  in  helping  it  achieve  its  pur¬ 
poses,  where  their  cooperation  is  necessary  to 
achieve  these.  In  sum,  legitimacy  can  be  seen  as  a 
form  of  public  good,  which  carries  distinctive  ben¬ 
efits  for  rulers  and  the  ruled  alike. 

Research  on  legitimacy  is  now  moving  beyond 
the  governmental  sphere  to  areas  of  authority  out¬ 
side  it — to  economic  institutions,  nongovernmental 
organizations,  criminal  justice  systems,  international 
bodies  of  all  kinds,  and  to  other  authority  domains. 
Attempts  are  being  made  to  quantify  the  degree  of 
legitimacy  of  different  regimes  and  to  explore  more 
deeply  the  psychological  processes  involved  in 
legitimation.  Moreover,  with  the  check  being  expe¬ 
rienced  to  the  so-called  third  wave  of  democracy, 
interest  is  being  revived  in  exploring  the  legitimating 
bases  of  authoritarian  regimes,  on  the  assumption 
that  they  cannot  be  understood  as  either  premature 
or  failed  democracies  but  need  to  be  analyzed  on 


their  own  terms.  All  these  developments  promise 

an  even  richer  research  agenda  for  the  study  of 

legitimacy  in  the  future. 
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Liberal  Parties 


Liberal  parties  (LPs)  represent  one  of  the  major 
party  “families,”  such  as  the  conservative,  social¬ 
ist,  and  nationalist  parties  originating  from  the 
19th-  and  early-20th-century  cleavages  and  party 
systems,  in  the  context  of  which  they  have  to  be 
placed.  The  major  characteristics  of  LPs  include  a 
social  background  in  bourgeois  groups,  which 
advocate  individual  and  property  rights  and  the 
rule  of  law,  in  opposition  to  feudal  and  autocratic 
forms  of  the  state.  This  entry  discusses  the  his¬ 
torical  roots,  social  bases,  ideological  orienta¬ 
tions,  and  contemporary  developments  of  LPs. 

Terminology 

There  are  two  kinds  of  terminological  variety.  In 
the  first  case,  liberal  can  have  rather  different 
meanings  concerning  ideological  orientations  or  a 
party’s  position  within  the  party  system.  Whereas 
in  the  European  tradition  liberal  usually  refers  to 
parties  in  the  middle  of  the  political  spectrum,  in 
the  North  American  context,  liberal  has  a  more 
left-wing,  or  social-democratic,  meaning.  In  the 
second  case,  LPs  come  with  many  different  names. 
Whereas  in  the  19th-century  LP  labels  were  more 
consistent  with  basic  ideological  features,  after 
1945,  this  clarity  began  to  diminish.  Basically,  the 
term  originated  after  the  French  Revolution,  when 
the  term  liberal  had  first  been  used  by  Spanish 
constitutionalists  around  1812.  Only  in  the  second 
half  of  the  19th  century  did  larger  parties  in  Britain 
(in  the  Whig  tradition)  and  Canada  call  themselves 
the  Liberal  Party,  whereas  in  France  or  Italy  terms 
such  as  republicans  or  radicals  were  more  promi¬ 
nent.  In  some  countries  freedom  labels  prevail 
(e.g.,  Free  Democrats  in  Switzerland,  and  later  in 
Germany,  and  Party  for  Freedom  and  Democracy 
in  the  Netherlands).  Examples  of  democratic  par¬ 
ties  exist,  for  example,  in  Luxemburg,  whereas  the 
U.S.  Democrats  show  a  mix  of  liberal  but  also 
social-democratic  ideology.  In  Denmark,  the  term 
venstre  (Left)  is  still  in  current  use.  Some  (agrarian) 
center  parties,  such  as  those  in  the  Scandinavian 
countries,  were  also  close  to  liberalism.  By  con¬ 
trast,  parties  with  a  definitely  conservative  charac¬ 
ter  labeling  themselves  liberal  party  must  not  be 
included  in  the  LP  family,  such  as  the  Liberal 


Democratic  Party  in  Japan  or  the  Liberal  Party  in 
Australia,  which  for  decades  has  been  the  conser¬ 
vative  counterpart  of  Labour.  After  1945,  and 
particularly  during  the  past  decades,  a  stronger 
tendency  can  be  observed  to  use  liberal  as  a  party 
label  rather  at  random  (e.g.,  the  right-wing  popu¬ 
list  Liberal  Democrats  in  Russia  after  1991). 

History 

Early  liberal  movements  developed,  between  the 
mid- 18th  and  mid-  19th  centuries,  from  the  cleav¬ 
age  between  feudal  society  and  an  autocratic  state, 
on  the  one  hand,  and  commercial  economy  and 
bourgeois  property  and  liberty  interests,  on  the 
other.  Economic  and  personal  liberties,  the  rule  of 
law,  and  parliamentary  representation  were  core 
goals  in  the  unfolding  party  competition  with  con¬ 
servative  ideologies  and  organizations.  A  second 
“classical”  period  has  been  characterized  by  suc¬ 
cess  in  parliamentary  elections,  development  of 
stronger  party  organization,  and  even  governmen¬ 
tal  majorities  in  several  countries  of  Europe  (e.g., 
in  Great  Britain)  and  elsewhere  in  the  last  third  of 
the  19th  century  up  to  World  War  I  and  until  some 
time  after.  A  third  period  of  decline  of  LPs,  after 
World  War  I  and  during  the  1930s,  saw  the  rise  of 
socialist/labor  parties  on  one  side  and  populist/ 
fascist  forces  and  regimes  on  the  other.  In  a  fourth 
period,  after  1945,  political  system  values  in  West¬ 
ern  liberal  democracies  converged  with  party  ide¬ 
ology,  which  weakened  the  specific  profile  and 
function  of  LPs.  In  many  countries,  they  could 
only  secure  a  minor  position  in  the  party  systems. 

Liberalism  and  LPs  had  their  primary  origin  and 
relevance  in  Europe  and  some  other  regions  such  as 
North  America.  While  Russia  and  China  were 
dominated  by  communism,  large  parts  of  other  con¬ 
tinents,  particularly  Africa  and  Asia,  have  mainly 
been  concerned  with  liberation  from  colonial  rule. 
Only  some  Latin  American  countries  developed 
conditions  for  liberal  or  “radical”  parties;  since  the 
1930s,  military  and  populist  authoritarian  regimes 
created  cycles  of  suppression  and  revolt  unfriendly 
to  LPs. 

Ideology 

Early  liberal  ideas  focused  on  free,  economically 
independent,  enlightened  individuals  against  an 
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autocratic  state,  a  feudal  social  order,  and  the 
moral  and  intellectual  authoritarianism  of  the 
Catholic  Church.  Political  values  evolved  around 
republican  or  at  least  constitutional  principles,  the 
rule  of  law,  and  limited  state  functions  and  politi¬ 
cal  representation,  quite  often  with  concepts  of 
limited  suffrage.  Economic  principles  highlight 
market  economy,  property  and  commercial  free¬ 
doms  of  independent  producers,  industrial  prog¬ 
ress,  economic  growth,  and  free  trade.  Capitalist 
development  produced  tensions  between  securing 
competition  by  antitrust  laws  and  other  means  and 
accepting  capital  concentration  and  corporate 
firms.  Trade  unions  as  a  counterforce  have  been 
accepted  only  reluctantly. 

In  social  terms,  formal  equality  often  gave  way 
to  equal  opportunity  and  accepting  factual  inequal¬ 
ity,  class  polarization,  and  economic  domination 
by  corporations  and  the  wealthy.  Supporting 
equality  by  education,  however,  has  become  a  con¬ 
tinuous  program  feature  of  LPs.  Welfare  state 
institutions  have  long  been  neglected  but  finally 
seen  as  inevitable  to  a  limited  degree. 

In  the  course  of  history,  a  growing  “ambivalence 
of  liberalism”  and  tensions  inside  LPs  or  between 
different  LPs  developed  between  the  liberal- 
conservative  and  liberal-radical  or  social-liberal 
currents,  with  radicals  or  liberal-radicals  advocat¬ 
ing  more  republican,  democratic,  and  secular  val¬ 
ues.  These  tensions  intensified  under  the  pressure  of 
rising  socialist  or  social-democratic  parties  before 
and  after  World  War  I,  in  the  face  of  corporate 
capitalism,  and  within  less  polarized  party  systems 
after  World  War  II,  when  LPs  in  many  countries 
leaned  toward  liberal-conservative  positions  rather 
than  liberal-radical  or  social-liberal  ones. 

Party  Organization 

Traditionally,  party  organization  in  terms  of  mem¬ 
bership  and  professional  organization  of  LPs  has 
been  rather  weak.  While  representing  independent 
middle-class  groups,  a  cadre,  or  elite,  structure 
prevailed.  Strong  links  with  large  special-interest 
organizations  have  been  more  scarce.  In  addition, 
a  culture  of  individualistic  attitudes  has  been 
prevalent  in  liberal  circles,  even  more  so  in  recent 
times  of  growing  individualism.  LPs  rely  heavily 
on  members  who  are  lawyers  or  doctors,  indepen¬ 
dent  businesspeople,  and  in  the  education  services 


and  tend  to  be  less  member  than  voter  parties.  Lor 
party  finances,  they  often  depend  on  state  sources 
or  strong  economic  interests. 

Electoral  Base,  Government  Participation 

In  the  period  after  1945,  LPs  had,  at  least  in 
Europe,  on  the  average,  a  minor  electoral  position. 
Their  record  of  government  participation,  how¬ 
ever,  was  considerably  stronger  in  the  form  of 
coalition  parties.  Only  in  a  few  European  countries 
could  the  LPs  establish  a  strong  government  posi¬ 
tion  such  as  in  Luxemburg  (Democratic  Party), 
Denmark  (Venstre),  Switzerland  (Lree  Democratic 
Party  [Lreie  Demokratische  ParteiJ),  and  for  some 
time  in  Prance  (Union  for  Prench  Democracy 
[Union  pour  la  Democratie  Prancaise,  UDPJ,  a 
merger  party  in  the  1970s  and  1980s).  Medium¬ 
sized  parties  such  as  the  Dutch  People’s  Party  for 
Preedom  and  Democracy  (Volkspartij  voor  Vrijheid 
en  Democratie,  WD),  the  Belgian  Party  for 
Preedom  and  Progress  (Partij  voor  Vrijheid  en 
Vooruitgang,  PW/Parti  de  la  Liberte  et  du  Progres, 
PLP),  or  small  parties  such  as  the  Lree  Democratic 
Party  (Lreie  Demokratische  Partei,  LDP)  in 
Germany,  the  Liberal  Party  in  Norway,  the  Radicale 
Venstre  in  Denmark,  and  the  Dutch  D’66  could 
also  participate  in  coalition  governments  for  longer 
periods  of  time.  Outside  Europe,  only  the  Liberal 
Party  in  Canada  maintained  itself  as  the  most 
important  national  government  party  for  decades, 
from  the  late  19th  century  and  also  after  1945. 

The  main  factors  for  the  structural  weakness  of 
most  LPs  are  party  systems  dominated  by  conser¬ 
vative  or  Christian-democratic  parties  and  social- 
democratic  ones,  with  only  moderate  left/right 
polarization,  in  a  context  of  declining  ideologies. 
In  addition,  Western  liberal  democracies  strongly 
integrated  liberal  values  into  their  constitutional 
structure  and  weakened  the  ideological  profile  of 
LPs.  In  terms  of  the  electoral  base,  problems  arose 
from  social  structural  changes:  The  decline  of  the 
old  middle  class  of  independent  businesses  neces¬ 
sitated  moves  toward  the  employed  service  class 
of  middle-level  employees  against  much  external 
competition  and  internal  tensions.  Since  the 
1980s,  Green  parties  have  emerged  as  a  new  com¬ 
petitor  for  young,  urban,  and  well-educated  social 
groups.  Thus,  unless  LPs  could  operate  from  a 
well-established  governmental  position  (such  as  in 
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Canada  and  some  European  countries),  structural 
conditions  changed  rather  unfavorably  in  the  1960s 
and  1970s.  Since  the  1970s,  ideological  develop¬ 
ments  toward  neoliberalism  have  moved  LPs  into  a 
closer  relationship  and,  at  the  same  time,  stronger 
competition  with  modernizing  conservative  parties. 

Contemporary  Developments 

In  the  context  of  the  recent  waves  of  democratiza¬ 
tion  (1970s  to  1990s),  one  would  have  expected 
that  the  LPs  would  play  stronger  roles  owing  to 
their  affinity  with  liberal  democracy.  In  fact,  dur¬ 
ing  these  transformation  periods,  some  parties 
with  liberal  ideology  or  name  emerged,  but  they 
turned  out  to  remain  rather  weak  and  often  could 
not  survive  under  a  polarization  between  left-wing 
liberation  movements  and  conservative  or  even 
reactionary  forces. 

The  fall  of  military  or  other  dictatorships  in 
Southern  Europe  and  Latin  America  in  the  1970s 
and  1980s  only  restored  small  currents  of  liberal 
organizations,  somewhat  stronger  in  Argentina’s 
radical  tradition  (Radical  Civic  Union).  A  special 
case  developed  in  South  Africa  in  the  concluding 
phase  of  the  apartheid  regime  (Democratic  Alliance). 

In  Russia  and  Eastern  Europe  in  the  early 
1990s,  the  fall  of  communist  regimes  finally 
opened  up  new  opportunities  for  LPs,  which,  how¬ 
ever,  only  materialized  to  a  small  degree.  The 
double  transformation  toward  liberal  economic 
and  political  systems  with  their  high  economic  and 
social  costs  for  the  population,  in  combination 
with  increasing  electoral  barriers  and  missing 
structural  traditions  of  an  independent  middle 
class,  has  been  unfavorable  for  sustainable  LPs. 
Thus,  early  offshoots  soon  reduced  in  size,  such  as 
Yabloko  in  Russia,  or  transformed  ideologically 
(in  a  conservative  direction,  such  as  the  Civic 
Union  in  the  Czech  Republic,  or  even  to  right-wing 
populism,  such  as  FIDESZ  in  Hungary).  Some 
potential  developed  in  Slovenia  and,  after  several 
electoral  turnabouts,  in  Poland,  where  the  Civic 
Union  as  a  liberal-conservative  governing  party 
could  emerge  from  the  2007  election. 

Since  the  1980s,  globalized  capitalism  has  con¬ 
verged  with  ideological  currents  of  neoliberalism, 
which  has  led  LPs  in  many  countries  to  advocate 
deregulation,  low  taxes,  and  downgrading  welfare 
state  and  public  sector  functions,  with  a  stronger 


affinity  to  conservative  (neoconservative)  parties 
and  coalitions.  The  consequences  of  the  dramatic 
worldwide  financial  and  economic  crisis  of  2008 
may,  however,  leave  them  entrapped  and  in  need 
of  strategic  reorientation. 

Apart  from  economic  liberalism,  currently,  LPs 
are  once  again  stressing  the  values  of  religious  tol¬ 
erance  and  secularism  as  well  as  protection  of 
lifestyle  minorities.  In  addition,  the  new  techno¬ 
logical  capacities  for  surveillance  put  the  limitation 
of  state  powers  back  on  the  agenda  of  LPs. 

Transnational  Networks 

In  1947,  LPs  formed  the  Liberal  International 
(http://www.liberal-international.org)  for  fostering 
some  ideological  coherence,  cooperation,  and  sup¬ 
port  for  emerging  parties.  Their  membership  list, 
sometimes  with  new  entries  and  expulsions,  pro¬ 
vides  at  least  an  indicator  for  identifying  LPs  in 
times  of  decreasing  clarity  of  labels.  On  a  regional 
basis,  the  Alliance  of  Liberals  and  Democrats  in 
Europe  and  its  caucus  in  the  European  Parliament 
and  in  the  Council  of  Europe  represent  a  further 
step  in  transnational  organization. 

Tbeo  Schiller 
Philipps-  Universitat  Marburg 
Marburg,  Germany 
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Liberalism 


Liberalism  offers  a  prescription  of  how  the  state  is 
to  deal  with  citizens:  Loosely  speaking,  the  state  is 
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to  address  citizens  as  equal  individuals.  The  rise  of 
liberalism  therefore  requires  the  prior  or  more  or 
less  simultaneous  development  of  a  strong  princi¬ 
ple  and  practice  of  individualism.  There  is  a  con¬ 
siderable  literature  on  the  roles  of  individualism 
and  of  individualist  Protestantism  in  the  develop¬ 
ment  of  capitalism  but  a  far  less  rich  discussion  of 
its  role  in  the  development  of  political  liberalism. 
This  is  not  a  little  odd,  because  political  liberalism 
is  defined  specifically  for  a  society  of  individuals, 
and  it  requires  constitutional  protections  of  indi¬ 
vidual  citizens  against  intrusions  by  the  state. 
These  three  concepts — individualism,  constitu¬ 
tionalism,  and  liberalism — are  closely  related 
historically,  causally,  and  conceptually.  Before 
turning  to  the  structure  or  content  of  liberalism, 
there  are  two  major  preliminary  issues  to  discuss 
here:  an  explanation  of  why  liberalism  came  to  its 
central  place  in  political  theory  and  practice  when 
and  where  it  did  and  some  account  of  how  it  can 
be  protected  or  enforced. 

Individualism 

Political  liberalism  is  inherently  a  philosophy  and 
practice  of  protecting  individuals  to  live  and  act  as 
they  please,  so  long  as  they  do  not  harm  others; 
without  individualism,  therefore,  it  has  no  point. 
The  central  figure  in  the  history  of  a  vision  of  the 
place  of  individualism  in  political  theory  is  Thomas 
Hobbes,  who  assumes  individualism  in  his  account 
of  social  order  and  the  state.  One  might  suppose 
that  his  assumption  of  individualism  is  normative 
or  libertarian.  But  for  him,  in  fact,  it  is  much  more 
explicitly  a  descriptive  and  causal  issue  just  as  Karl 
Marx’s  or  Max  Weber’s  account  of  economic 
motivation  is  causal.  Descriptively,  individualism 
is  based  on  an  assumption  about  human  nature. 
Causally,  therefore,  it  is  a  necessary  part  of  the 
explanation  of  human  behavior  and,  by  implica¬ 
tion,  of  political  institutions  that  are  designed  to 
deal  with  individuals.  We  are  self-interested;  there¬ 
fore,  to  explain  our  behavior,  one  must  start  from 
the  assumption  of  self-interest. 

Historians  continue  to  debate  when,  where,  and 
why  individualism  first  arose.  The  most  common 
view  is  that  peasants  in  England  were  communally 
organized  and  held  together  by  the  fetters  of  the 
kinship  group,  and  their  land  was  collectively,  not 
individually,  owned.  Economic  progress  required 


what  Weber  calls  “defamilization.”  Richard  H. 
Tawney  observes  that  most  people  in  England  in 
the  16th  century  “have  never  seen  more  than  a 
hundred  separate  individuals  in  the  course  of  their 
whole  lives,  where  most  households  live  by  tilling 
their  great-grandfather’s  fields  with  their  great¬ 
grandfather’s  plough”  (quoted  in  Alan  Macfarlane, 
1978/1979,  pp.  53-54).  In  its  Greek  origin,  econ¬ 
omy  means  household  management,  and  until 
recent  centuries,  that  would  still  have  been  its  apt 
meaning  in  most  of  Europe.  For  the  overwhelming 
majority  of  people,  there  was  little  exchange  and 
virtually  no  money  or  commerce;  there  was  at  best 
merely  self-sufficiency  in  a  subsistence  agrarian 
society.  In  a  society  under  these  conditions,  liberal¬ 
ism  is  irrelevant. 

Major  historians  of  the  relevant  periods,  such  as 
Thomas  B.  Macaulay,  among  the  greatest  of  liber¬ 
als,  commonly  do  not  include  individualism  in 
their  indexes,  whereas  the  idea  runs  through  the 
work  of  the  great  liberal  theorist  Leonard  T. 
Hobhouse.  In  reading  the  historians  who  fre¬ 
quently  delve  into  political  theory,  one  often  won¬ 
ders  where  Hobbes  has  gone.  Not  surprisingly, 
Friedrich  A.  Hayek  makes  a  major  issue  of  indi¬ 
vidualism  and,  implicitly,  of  Hobbesism.  For  many 
liberal  theorists,  the  world  of  Hayek  and  Hobbes 
is  in  principle  our  world.  In  fairness,  many  other 
scholars  address  individualism,  although  some¬ 
what  obliquely,  through  discussions  of  Puritanism 
and  Calvinism  and  the  role  of  individualist 
Protestant  religions  more  generally  in  the  develop¬ 
ment  of  capitalism.  While  these  individualist  reli¬ 
gions  are  surely  causally  important,  secular  aspects 
of  social  life  in  these  centuries  and  even  the  secu¬ 
larizing  tendencies  of  the  individualist  religious 
beliefs  provide  the  final  force  for  remaking  English 
social  and  economic  relations  well  ahead  of  conti¬ 
nental  Europe. 

Liberalism  is  widely  recognized  as  a  magnificent 
social  invention,  perhaps  the  greatest  political 
invention  of  modern  history,  not  least  because  it 
effectively  created  political  modernity.  Its  develop¬ 
ment  required  one  of  the  greatest  changes  in  social 
structure  ever  experienced.  Elements  of  it  or  local 
instances  of  its  elements  precede  modern  times,  but 
the  real  transformation  begins  in  a  big  way  only  in 
the  17th-century  era  of  the  spectacular  efflores¬ 
cence  of — especially — Anglo-Saxon  political  the¬ 
ory.  Hobbes  is  arguably  the  most  important  figure 
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in  this  movement  in  part  because  he  is  the  most 
insistent  on  the  individualist  focus.  He  is  much  less 
concerned  with  property  than  with  social  order. 
Indeed,  he  treats  property  and  its  protection  as 
required  for  social  order  as  well  as  for  prosperity. 
One  could  rightly  say  that  order  must  precede 
finer  issues  of  wealth  and  prosperity,  so  that  liber¬ 
alism  depends  on  order.  And  one  should  note  that 
Hobbes  wrote  against  the  background  of  brutal 
wars  that  likely  informed  his  vision.  Later  writers, 
such  as  John  Locke  and  the  grand  economists  in 
the  line  of  Adam  Smith,  could  focus  on  economic 
growth  and  “the  wealth  of  nations.”  Hobbes’s 
case  shows  that  the  two  strains  of  liberalism — 
individualism  and  constitutional  protections — can 
be  separated  in  theory,  and  the  English  case  shows 
that  they  can  be  at  least  piecemeal  separated  in  the 
institutions  that  make  them  work  in  practice. 

Why  did  liberalism  come  so  late  in  history?  Its 
appeal  seems  almost  obvious.  But  social  structures 
virtually  blocked  it  through  most  of  history. 
Pervasive,  brutal  poverty  got  in  the  way  of  concern 
for  liberty,  so  much  so  that  much  of  the  vocabu¬ 
lary  of  liberalism  is  a  late  invention.  Rather  than  a 
concern  with  liberty,  which  must  have  been  hol¬ 
low,  with  its  implication  of  a  right  to  starve  a  few 
centuries  ago,  rural  families  must  sooner  have 
focused  on  collective  family  welfare  and  fears  of 
famine  and  sickness.  The  development  of  the 
transformative  concern  with  individualism  came 
first  in  England,  and  therefore,  the  idea  of  liberal¬ 
ism  seems  English,  although  it  is  soon  followed  by 
a  somewhat  different  French  conception. 

Anarchists  such  as  William  Godwin  might  hold 
the  optimistic  view  that  individualism  could  drive  a 
liberal  society  through  the  loosely  coordinated 
individual  efforts  of  large  numbers  of  people  with¬ 
out  an  overbearing  government.  But  autocracy  is  a 
more  likely  form  of  government — the  form  that  has 
controlled  most  lives  historically.  While  continen¬ 
tal  monarchies,  especially  in  France  and  Spain, 
increasingly  passed  into  greater  despotism,  the 
English  became  more  liberal  over  many  centuries. 
There  were  retrograde  moves,  such  as  the  procla¬ 
mation  of  the  doctrine  of  the  divine  right  of  kings 
and  the  introduction  of  the  Star  Chamber.  The 
final  reign  of  autocracy  in  England  was  the  monar¬ 
chy  of  the  Stuart,  Charles  I,  who,  oddly,  signed  the 
strikingly  liberal  “Petition  of  Rights”  but  then  went 
on  to  ignore  its  agreed  constraints  on  his  actions. 


Constitutionalism 

The  1787  U.S.  Constitution,  one  of  the  greatest 
liberal  documents,  written  in  light  of  English  expe¬ 
rience,  ironically,  did  not  include  explicit  state¬ 
ments  of  many  of  the  protections  that  liberals  and 
what  would  now  be  called  constitutionalists 
wanted.  Many  of  these,  including  freedoms  of  the 
press  and  of  religious  conscience,  were  added  to 
the  constitution  in  its  first  10  amendments,  the 
“Bill  of  Rights.”  It  may  seem  peculiar  to  create 
government  in  order,  at  least  in  part,  to  protect 
individuals  against  government  when  that  govern¬ 
ment  is  implicitly  enabled  to  act  against  individuals 
as  readily  as  to  act  for  them.  There  is  no  theoretical 
guarantee  that  a  supposedly  liberal  state  will  or 
must  rein  itself  in.  Constitutionalism  is  therefore  at 
best  a  pragmatic  move  that  might  work  or  that 
might  fail.  In  the  vocabulary  of  The  Federalist 
Papers,  a  constitution  is  only  a  parchment  barrier 
that  is  implicitly  no  barrier  at  all.  In  the  history  of 
this  and  many  constitutions,  such  barriers  have 
often  been  overridden,  even  trampled  by  powerful 
office  holders.  When  constitutions  succeed  in  regu¬ 
lating  conduct,  they  do  so  for  social  psychological 
reasons  and  through  James  Madison’s  device  of 
countering  ambition  with  ambition. 

The  rise  of  constitutionalism  has  been  a  long- 
drawn-out  process.  The  set  of  liberties  to  be  pro¬ 
tected  has  grown,  and  support  for  protection  of 
many  of  these  is  virtually  universal  in  liberal  states 
today.  Historical  struggles  over  protection  of  free¬ 
doms  of  the  press  and  speech  more  generally  are 
typical  of  several  other  protections  of  civil  liberties 
and  civil  rights.  Historically,  the  most  important 
issue  for  initial  constitutionalism  is  the  protection  of 
freedom  of  religious  conscience  and  practice.  This  is 
the  issue  that  tore  societies  apart  and  that  still  plays 
a  fundamental  role  in  the  development  of  liberalism. 
Diverse  Christian  sects  were  the  chief  problem  in  the 
United  States  in  1787.  Islam,  especially  militant 
Islam,  is  regarded  as  a  problem  in  much  of  Europe 
and  North  America  today.  Hobbes  noted  the  near 
impossibility  of  assessing  or  controlling  variant  reli¬ 
gious  beliefs  in  an  era  in  which,  for  example, 
England  and  Holland  were  increasingly  diverse  in 
their  religious  commitments.  Still,  he  allowed 
enforcement  of  religious  practices  as  supposedly 
necessary  for  maintaining  order.  The  wreckage 
caused  by  religious  conflicts  in  England  in  his  time 
arguably  licenses  his  illiberal  views  on  this  issue. 


Liberalism  1431 


The  Harm  Principle 

The  harm  principle  says  that  I  may  do  anything 
I  please  so  long  as  I  do  not  harm  others.  This  prin¬ 
ciple,  which  has  been  stated  over  many  centuries 
by  many  people  in  varied  ways,  has  been  elevated 
with  its  seductive  and  almost  self-defining  label  by 
John  Stuart  Mill,  who  has  perhaps,  therefore, 
come  to  own  the  idea  despite  its  common  cur¬ 
rency.  One  could  say  that  this  principle  is  the  cen¬ 
tral  commitment  of  liberalism:  Do  not  interfere  in 
the  deliberate  actions  of  others  whose  actions  do 
no  harm  to  us.  Mill  calls  it  a  very  simple  principle, 
but  the  remarkably  extensive  commentary  on  it 
suggests,  rightly,  that  it  has  subtle  nuances  and 
complex  potential  meanings.  Mill  argues  against 
many  state  regulations,  some  of  them  offensively 
intrusive  and  some  ostensibly  beneficial.  For  exam¬ 
ple,  to  go  back  to  the  early  foundation  of  liberal¬ 
ism,  the  state  should  not  impose  religious  beliefs, 
qualifications,  or  practices  on  its  citizens. 

Strong  defenders  of  the  principle  mean  it  to 
imply  that,  if  you  clearly  know  that  what  you  are 
doing  is  harmful  mainly  to  yourself  alone,  neither 
the  state  nor  I  should  interfere;  we  should  let  you 
harm  yourself.  When  this  view  is  stated  in  the 
abstract,  it  offends  many  people  because  it  seems 
to  be  cruel.  But  in  actual  fact,  Mill  is  descriptively 
right:  We  commonly  act  on  the  strong  form  of  the 
principle.  For  example,  we  do  not  prevent  you 
from  killing  yourself  through  harmful  or  risky 
actions  such  as  smoking,  skydiving,  and  experi¬ 
menting  with  powerful  drugs.  It  is  your  life  even  if 
you  wish  to  destroy  it  or  to  risk  doing  so.  We 
might  even  be  barred  from  interfering  in  your 
choice  to  commit  suicide.  Here,  one  of  the  difficult 
nuances  pops  up.  We  might  suppose  that  your  sui¬ 
cidal  urge  is  somehow  a  mistake  or  a  brief  and 
aberrant  psychological  urge  and  that  you  would  be 
grateful  tomorrow  for  our  interference  today.  If  we 
confidently  hold  these  views,  what  should  we  do? 

Rule  of  Law 

It  is  hard  to  read  the  early  history  of  England  with¬ 
out  stumbling  into  surprising,  brutal  issues  of 
unequal  treatment  under  the  law  and  of  high-handed 
actions  by  the  powerful  that  are  not  grounded  in 
law.  What  today  would  be  legal  matters  were  for¬ 
merly  matters  to  be  resolved  by  politics,  power,  and 
even  murder.  If  the  monarch  or  a  baron  wanted 


you  out  of  the  way,  you  had  little  recourse  other 
than  flight.  Women,  slaves,  and  serfs  were  often 
treated  with  brutality.  In  early  medieval  times,  dif¬ 
ferent  status  groups  were  subject  to  different  laws 
and,  especially,  different  punishments  for  the  same 
offense.  The  rule  of  law  includes  a  crude  principle 
of  fairness  according  to  which  all  are  subject  to  the 
same  law  and  under  which  there  can  be  no  sepa¬ 
rate  statuses  for  citizens. 

In  its  first  article,  the  1789  version  of  the 
French  revolutionary  Declaration  of  the  Rights  of 
Man  and  Citizen  stipulates,  “Men  are  born  and 
remain  free  and  equal  in  rights;  social  distinctions 
may  be  based  only  upon  general  usefulness.”  This 
vision  may  be  taken  as  the  ultimate  guiding  prin¬ 
ciple  of  the  rule  of  law.  The  main  target  of  such  a 
principle  is  arbitrary  government,  and  the  main 
resolution  of  this  principle  is  to  guarantee  that 
everyone  will  be  treated  in  accordance  with  the 
law.  To  make  this  work,  there  must  be  general 
social  order  and  institutions  to  control  officers  of 
the  government. 

Commitment  to  the  rule  of  law  is  of  a  piece  with 
concern  to  limit  government,  which  is  the  main 
point  of  constitutionalism.  Liberalism  and  consti¬ 
tutionalism  are  interdependent  defining  features  of 
political  modernity.  If  a  constitution  does  not  limit 
government,  it  is  a  failed  constitution.  In  this 
respect,  yet  again,  constitutionalism,  liberalism, 
and  individualism  are  joined  together.  Already  at 
the  height  of  concern  with  the  rule  of  law  and  lim¬ 
ited  government,  however,  Jeremy  Bentham  con¬ 
cludes  that  government  is  a  close  corporation  with 
a  vested  interest  that  is  potentially  hostile  to  the 
collective  welfare  of  its  society,  a  view  echoed  later 
by  John  C.  Calhoun.  Their  view  is  that  democracy 
in  England  and  America  is  increasingly  turning 
corporatist.  As  is  often  more  generally  true, 
Bentham  is  prescient  on  this  development,  which 
must  have  distressed  him  deeply  because  it  must 
undermine  his  belief  in  democracy  as  essentially 
utilitarian.  This  conclusion  casts  a  pall  over  modern 
democratic  society.  If  democracy  is  not  utilitarian, 
there  is  little  hope  for  a  generally  good  form  of  gov¬ 
ernment.  Bentham’s  great  utilitarian  book,  An 
Introduction  to  the  Principles  of  Morals  and 
Legislation,  virtually  trumpets  the  association  of 
democracy  with  utilitarianism.  The  book  is  unusual 
in  the  identification  of  its  personal  moral  theory  with 
its  political  theory:  These  do  not  live  in  separate 
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worlds,  as  they  otherwise  typically  do  in  most  of 
normative  theory.  Their  separation  is  deplored  by 
David  Hume  and  Jeremy  Bentham. 

Religion 

Protection  of  religious  freedom  is  the  fundamental 
concern  that  has  driven  political  theory  since  well 
before  Hobbes.  In  the  United  States,  the  principal 
obstacle  to  government  enforcement  of  particular 
beliefs  has  been  the  radical  fractioning  of  beliefs, 
which  nicely  sets  up  a  resolution  of  conflicts  over 
religion  in  the  United  States,  where  the  sheer  num¬ 
bers  of  sects  fuels  widespread  opposition  to  select¬ 
ing  any  sect  as  the  national  religion.  The  only 
workable  compromise  in  1787  was  to  rule  out  any 
official  recognition  of  religion  and  especially  to 
rule  against  establishing  a  state  church.  The  prob¬ 
lem  of  religion,  which  led  to  heated  debates  during 
the  constitutional  era  in  the  United  States,  was 
virtually  settled  by  this  provision  of  the  constitu¬ 
tion.  Indeed,  the  first  Supreme  Court  case  on  reli¬ 
gion,  Reynolds  v.  United  States  (1878),  came 
before  the  Court  90  years  after  the  adoption  of  the 
constitution.  This  decision  ruled  against  the 
Mormon  practice  of  polygamy  on  pragmatic 
rather  than  on  religious  grounds.  The  judgment  in 
Reynolds  is  that  under  religious  freedom,  one  may 
believe  whatever  one  may,  but  one  cannot  act  in 
any  way  one  chooses.  It  might  seem  to  be  hard  to 
formulate  an  objection  to  polygamy  from  the 
harm  principle,  but  many  people  do  object  that  the 
practice  of  polygamy  harms  the  status  of  women 
generally. 

Multiculturalism  and  Group  Rights 

Over  the  past  few  decades,  a  multiculturalist 
movement  has  grown.  This  is  both  a  matter  of 
claims  of  particular  groups  and  claims  of  academic 
theorists  who  take  themselves  to  be  defending  the 
cultural  rights  of  various  minority  cultural  groups. 
Liberalism  has  no  resources  for  dealing  with 
groups,  especially  large  groups  well  beyond  the 
scale  of  the  family,  except  to  treat  them  as  indi¬ 
viduals  aggregated  into  groups.  But  the  features  of 
the  aggregations  or  of  the  groups  play  no  role  in 
standard  liberal  treatment  of  them.  Indeed,  stan¬ 
dard  liberal  theory  often  conflicts  with  claims  by 
the  advocates  or  actual  members  of  the  groups 


themselves.  Such  theory  has  invariably  been  framed 
for  individuals,  and  on  the  evidence  of  the  weak, 
supposedly  liberal  arguments  of  academic  defend¬ 
ers  of  the  liberal  nature  of  such  groups,  the  liberal 
theory  cannot  easily  be  recast  for  groups.  The 
greatest  conflict  commonly  arises  from  certain 
groups’  extremely  illiberal  treatment  of  their  own 
children  and  of  women  in  the  groups. 

Among  the  demands  that  such  groups  make  is 
to  limit  their  children’s  education,  often  so  severely 
as  to  cripple  any  chances  those  children  might  have 
to  survive  outside  the  groups.  In  some  cultures, 
girls  are  not  educated  at  all,  and  boys  are  educated 
primarily  in  religious  texts,  such  as  the  Talmud  or 
the  Koran.  Another  common  demand  is  to  have 
public  agencies  and  schools  speak  in  the  native 
languages  of  the  groups.  Apart  from  Spanish,  per¬ 
haps  no  other  language  is  common  enough  across 
the  United  States  for  dual  language  policies  to 
work  at  reasonable  cost. 

A  liberal  perspective  on  subgroups  with  which 
people  are  free  to  identify  within  a  society  is  not 
strategically  analogous  to  standard  individualist 
political  and  economic  liberalism.  Such  a  “group 
liberalism”  violates  the  strategic  logic  of  liberal¬ 
ism.  A  policy  to  maintain  a  cultural  group’s 
autonomy  and  distinctive  norms  is  not  likely  to 
motivate  the  most  politically  important  groups  in 
liberal  societies.  In  particular,  if  a  state  attempts 
to  maintain  a  cultural  subgroup’s  autonomy,  the 
state  takes  the  risk  of  creating  a  politically  influ¬ 
ential  class.  Nor  are  resolutions  of  group  problems 
likely  to  be  self-enforcing,  for  example,  in  the 
ways  in  which  the  old  liberalisms,  once  in  place, 
are  self-enforcing  or  a  workable  constitution  is 
self-enforcing. 

The  greatest  threat  to  the  survival  of  an  immi¬ 
grant  cultural  subgroup’s  ways  and  norms  is  the 
next  generation  of  the  group  itself.  Their  interests 
are  often  not  served  by  the  group’s  static  values 
and  norms.  In  this,  group  liberalism  has  failed.  To 
preserve  the  group’s  character,  the  state  would 
most  likely  have  to  intervene  to  coerce  that  gen¬ 
eration  into  line.  This  coercion  need  not  be  so 
draconian  as  that  of  the  Saudi  Arabian  morality 
enforcement  squads  (the  mutawa),  but  it  would  be 
dispiriting  for  many  people  in  the  group.  It  would 
violate  most  aspects  of  liberalism,  including  the 
harm  principle,  and  very  likely,  if  the  culture  dis¬ 
criminates  by  gender,  it  would  violate  the  rule  of 


Liberalism  1433 


law  that  requires  equal  treatment  under  law.  In 
many  of  the  cases  argued  in  the  recent  explosion 
of  multicultural  arguments,  it  seems  also  likely  to 
violate  individualism  and  religious  freedom. 

At  its  worst,  the  violation  of  religious  freedom 
is  astonishing.  For  example,  in  the  view  of  many 
Islamic  clerics,  apostasy  from  Islam  is  a  capital 
offense.  Therefore,  Salman  Rushdie  was  threat¬ 
ened  with  murder  under  a  fatwa.  It  is  a  remarkable 
and  distressing  feature  of  claims  for  protecting 
culture  or  so-called  group  rights  that  they  com¬ 
monly  do  not  address  the  reality  of  those  claims 
and,  in  particular,  their  violation  of  the  fundamen¬ 
tal  principles  of  liberalism.  One  can  reject  liberal¬ 
ism  in  defense  of  these  claims,  but  it  is  perverse  and 
wrong  to  suppose  that  these  claims  are  or  can  be 
reformulated  in  liberal  terms.  They  cannot  be. 
There  might  be  good  moral  defenses  of  various 
subcultural  group  practices,  but  liberalism  cannot 
be  distorted  enough  to  provide  a  defense  that  mer¬ 
its  the  hallowed  label  liberal. 

Incidentally,  “group  rights”  is  at  least  half  an 
oxymoron.  The  individualism  of  rights  collapses 
under  the  weight  of  a  group.  Moreover,  groups  of 
any  numerical  significance  are  sure  to  be  very 
indistinct  and  poorly  definable.  Does  Rushdie  con¬ 
tinue  to  be  a  member  of  the  community  of  Muslims 
when  he  ceases  to  believe  or  when  Islamic  leaders 
target  him  for  murder? 

Liberalism  in  Moral  Theories 

With  its  individualist  focus,  liberalism  seems 
consonant  with  the  two  leading  moral  theories 
of  our  time:  utilitarianism  and  Kantianism. 
Kantianism  might  require  a  lot  of  groundwork 
to  fit  it  to  what  is  rightly  framed  as  a  consequen- 
tialist  theory.  Immanuel  Kant  famously  asserted 
that  justification  of  actions  from  their  conse¬ 
quences  is  immoral,  but  he  was  not  consistent  in 
this  odd  view  that  would  virtually  rule  out  any 
serious  understanding  or  justification  of  political 
institutions,  whose  usual  purpose  is  to  effect 
good  consequences. 

Writers  from  Bentham  to  Hobhouse  have 
argued  for  a  strong  connection  between  liberal  and 
utilitarian  principles.  Indeed,  Bentham  essentially 
derives  liberalism  from  utilitarianism.  The  harm 
principle  is  readily  seen  as  utilitarian,  as  are  the 
rule  of  law  and  freedom  of  religious  conscience, 


which  are  conceived  of  as  individualist  principles. 
Insofar  as  Kantian  theories  focus  on  the  individual, 
they  too  must  honor  these  principles. 

Individualism  and  liberalism  have  often  been 
associated  with  utilitarianism,  perhaps  merely 
because  Mill  and  others  took  up  all  of  these.  But 
Mill  would  likely  insist  that  they  are  logically  cou¬ 
pled.  Insofar  as  Kantian  theory  is  individualist,  it  too 
tends  to  fit  with  liberalism  through  its  implications 
for  individual  autonomy,  which,  with  liberty,  is  an 
individual  concern.  Other  moral  theories  generally 
do  not  seem  to  generalize  straightforwardly  to  a 
political  theory. 
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Liberalism  in  International 
Relations 


This  entry  presents  an  overview  of  recent  trends 
and  developments  in  liberal  international  relations 
theory — both  empirical  and  normative.  An  effort 
is  made  to  highlight  the  link  between  contempo¬ 
rary  liberal  scholarship  on  international  relations 
and  the  thought  of  classical  liberal  figures  such  as 
John  Locke,  Adam  Smith,  Immanuel  Kant, 
Giuseppe  Mazzini,  and  John  Stuart  Mill.  The  first 
part  of  the  essay  introduces  key  liberal  principles 
and  ideas  and  identifies  three  different  traditions 
of  liberal  thought  on  international  relations. 
Thereafter,  we  discuss  classical  and  contemporary 
theories  on  the  relationship  between  liberal 
democracy  and  international  peace,  followed  by 
an  overview  of  related,  recent  scholarship  on 
global  governance  and  international  cooperation 
among  democracies.  The  final  part  of  the  essay 
briefly  discusses  two  alternative  liberal  approaches 
to  the  ethics  of  military  intervention  and  shows,  in 
particular,  how  liberal  theorists,  while  they  all 
share  a  fundamental  attachment  to  representative 
governance  and  human  rights,  can  fundamentally 
differ  in  their  support  for  coercive  regime  change. 

Basic  Liberal  Principles  and  Institutions 

Liberalism  resembles  a  family  portrait  of  principles 
and  institutions,  recognizable  by  certain  character¬ 
istics — such  as  individual  freedom,  political 
participation,  private  property,  and  equality 
of  opportunity — that  all  liberal  democratic  societies, 
by  definition,  share  to  some  degree.  Political  theo¬ 
rists  identify  liberalism  with  an  essential  principle: 
the  importance  of  the  freedom  of  the  individual. 
Above  all,  this  is  a  belief  in  the  importance  of  moral 
freedom,  of  the  right  to  be  treated  and  a  duty  to 
treat  others  as  ethical  subjects  and  not  as  objects  or 
means  only. 

The  ideal  version  of  liberalism  is  marked  by  a 
shared  commitment  to  four  essential  institutions. 
First,  citizens  possess  juridical  equality  and  other 
fundamental  civic  rights  such  as  freedom  of  reli¬ 
gion  and  the  press.  Second,  the  effective  sovereigns 
of  the  state  are  representative  legislatures  deriving 
their  authority  from  the  consent  of  the  electorate 
and  exercising  their  representative  authority  free 


from  all  restraint  apart  from  the  requirement  that 
basic  civic  rights  be  preserved.  Most  pertinent,  for 
the  impact  of  liberalism  on  foreign  affairs,  the  state 
is  subject  to  neither  the  external  authority  of  other 
states  nor  the  internal  authority  of  special  preroga¬ 
tives  held,  for  example,  by  monarchs  or  military 
bureaucracies  over  foreign  policy.  Third,  the  econ¬ 
omy  rests  on  a  recognition  of  the  rights  of  private 
property,  including  the  ownership  of  means  of 
production.  Property  is  justified  by  individual 
acquisition  (e.g.,  by  labor)  or  by  social  agreement 
or  social  utility.  This  excludes  state  socialism  or 
state  capitalism,  but  it  need  not  exclude  market 
socialism  or  various  forms  of  the  mixed  economy. 
Fourth,  economic  decisions  are  predominantly 
shaped  by  the  forces  of  supply  and  demand, 
domestically  and  internationally,  and  are  free  from 
strict  control  by  bureaucracies. 

Locke,  Smith,  and  Kant:  Three  Pillars 
of  Liberal  Internationalism 

Liberal  internationalism  consists,  at  its  most  funda¬ 
mental  level,  in  the  attempt  to  promote  the  afore¬ 
mentioned  principles  and  institutions  across 
national  borders  and  apply  variations  thereof  to 
international  relations.  The  classical  realists  from 
Thucydides  onward  described  an  international  state 
of  war  that  could  be  mitigated,  but  not  overcome, 
short  of  a  world  Leviathan.  The  classical  liberals, 
with  important  variations,  broke  with  this  skeptical 
tradition  and  announced  the  possibility  of  a  state  of 
peace  among  independent,  sovereign  states. 

Contemporary  scholarship  on  liberalism  and 
international  relations  looks  back  at  three  distinct 
traditions  of  liberalism,  attributable  to  three  groups 
of  theorists:  John  Locke — the  great  founder  of 
modern  liberal  individualism,  who  claimed  that 
states  have  themselves  rights  derived  from  indi¬ 
vidual  rights  to  life  and  liberty  (political  indepen¬ 
dence)  and  property  (territorial  integrity),  thereby 
providing  the  liberal  foundations  of  international 
law;  Adam  Smith,  Baron  de  Montesquieu,  and 
Joseph  Schumpeter — brilliant  explicators  of  com¬ 
mercial  liberalism  and  what  they  saw  as  its  natural 
result,  liberal  pacifism;  and  finally,  Immanuel  Kant 
and  Giuseppe  Mazzini — liberal  republicans  who 
theorized  an  internationalism  that  institutes  peace 
among  fellow  liberal  republics.  The  liberal  repub¬ 
lican  tradition,  while  incorporating  to  some  degree 
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both  liberal  individualism  and  commercial  liberal¬ 
ism,  has  exerted  the  greatest  influence  on  contem¬ 
porary  liberal  international  relations  theory.  It 
argues  that  liberal  democracy  leaves  a  coherent 
international  legacy  on  foreign  affairs:  a  separate 
peace.  Liberal  states  are  peaceful  with  each  other, 
but  they  are  also  prone  to  make  war  on  nonliberal 
states. 

A  Separate  Peace  Among 
Liberal  Democracies 

The  claim  that  liberal  constitutional  states  behave 
differently  in  their  foreign  relations  goes  back  at 
least  as  far  as  Immanuel  Kant  and  Thomas  Paine, 
but  attempts  to  demonstrate  it  empirically  are 
more  recent.  In  the  20th  century,  Clarence  Streit 
(1938)  first  pointed  out  the  tendency  of  modern 
liberal  democracies  to  maintain  peace  among 
themselves,  and  Dean  V.  Babst  (1972)  was  the  first 
to  find  statistical  support  for  the  hypothesis.  Over 
the  past  3  decades,  scholars  have  found  strong 
empirical  evidence  for  the  existence  of  a  separate 
peace  among  liberal  democracies  but  not  between 
democracies  and  nondemocracies.  Critiques  of  the 
separate-peace  proposition  have  focused  largely  on 
the  underlying  causal  argument,  suggesting  that 
the  interdemocratic  peace  might  be  simply  a  by¬ 
product  of  bipolarity  and  related  strategic  alliance 
patterns  during  the  Cold  War  (see,  e.g.,  Henry 
Farber  &  Joanne  Gowa,  1997). 

Michael  Doyle,  in  his  1997  book  Ways  of  War 
and  Peace,  argues  that  two  centuries  of  separate 
peace  among  liberal  democracies  cannot  be  dis¬ 
missed  as  an  epiphenomenon,  or  by-product,  of 
strategic  alliances;  in  fact,  stable  international  alli¬ 
ance  patterns  among  liberal  democracies  appear  to 
be  largely  a  consequence  of  shared  liberal  values 
and  domestic  institutions.  Doyle  develops  an  origi¬ 
nal  explanation  of  the  separate  peace  among  lib¬ 
eral  democracies  based  on  Kant’s  essay  “Perpetual 
Peace.”  In  Doyle’s  interpretation,  Kant’s  hypo¬ 
thetical  peace  treaty  shows  how  liberal  republics 
lead  to  a  dichotomous  international  politics:  peace¬ 
ful  relations — a  pacific  union — among  similarly 
liberal  states  and  a  state  of  war  between  liberals 
and  nonliberals. 

First,  Kant  viewed  the  republic,  based  on  consti¬ 
tutionalism  and  popular  representation,  as  the 
ideal  form  of  government;  he  understood  that 


republican  governments  would  introduce  various 
institutional  restraints  on  foreign  policy  and 
ingrain  the  habit  of  respect  for  individual  rights. 
Of  course,  we  know  today  that  domestic  republi¬ 
can  restraints  do  not  automatically  end  war.  (If 
they  did,  liberal  states  would  not  be  warlike,  which 
is  far  from  the  case.)  Kant  seems  to  have  been  well 
aware  of  this:  He  pointed  out  that  institutional 
restraints  merely  introduce  republican  caution,  or 
hesitation,  in  place  of  monarchical  caprice.  In  line 
with  this  intuition,  modern  democratic  liberalism 
does  not  need  to  assume  either  that  public  opinion 
directly  rules  foreign  policy  or  that  the  entire  gov¬ 
ernmental  elite  is  liberal.  It  can  instead  assume  that 
the  elite  typically  manages  public  affairs  but  that 
potentially  nonliberal  members  of  the  elite  have 
reason  to  doubt  that  illiberal  policies  would  be 
elect  orally  sustained  and  endorsed  by  the  majority 
of  the  democratic  public.  In  other  words,  liberal 
states  fight  only  for  popular,  ostensibly  liberal  pur¬ 
poses  since  elites  need  to  be  constantly  concerned 
about  domestic  support  for  the  war  effort. 

Second,  Kant  foresaw  that  liberal  republics 
would  progressively  establish  peace  among  them¬ 
selves  by  means  of  the  pacific  union  described  in 
his  Second  Definitive  Article  of  Perpetual  Peace. 
Kant  probably  had  in  mind  a  mutual  nonaggres¬ 
sion  pact  or  perhaps  a  collective  security  agree¬ 
ment  with  a  rudimentary  court  of  arbitration. 
Complementing  the  constitutional  guarantee  of 
caution,  international  law  adds  a  second  source — a 
pledge  of  peaceful  respect.  As  republics  emerge 
(the  first  source)  and  as  culture  progresses,  an 
understanding  of  the  legitimate  rights  of  all  citi¬ 
zens  and  of  all  republics  comes  into  play;  and  this, 
now  that  caution  characterizes  policy,  sets  up  the 
moral  foundations  for  the  liberal  peace.  Corres¬ 
pondingly,  international  law  highlights  the  impor¬ 
tance  of  Kantian  publicity.  Domestically,  publicity 
helps  ensure  that  the  officials  of  republics  act 
according  to  the  principles  they  profess  to  hold 
just  and  the  interests  of  the  citizens  they  claim  to 
represent.  Internationally,  free  speech  and  the 
effective  communication  of  accurate  conceptions 
of  the  political  life  of  foreign  peoples  are  essential 
to  establish  and  preserve  the  understanding  on 
which  the  guarantee  of  respect  depends. 

Kant’s  categorical  imperative  of  course  requires 
that  all  statesmen  and  liberal  republics  reject  impe¬ 
rialism  and  international  aggression  on  moral 
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grounds.  But  liberal  republics  cannot  simply  assume 
reciprocal  peace  with  all  other  states;  instead,  they 
understand  that  states  subject  to  international 
anarchy  are  potentially  aggressive.  Only  republics 
tend  to  be  consensual  and  constrained,  and  they 
are  therefore  presumed  capable  by  other  republics 
of  reliable  mutual  accommodation.  The  experience 
of  cooperation  among  republics  helps  engender 
further  cooperative  behavior  when  the  conse¬ 
quences  of  state  policy  are  unclear  but  (potentially) 
mutually  beneficial.  In  short,  fellow  liberals  benefit 
from  a  presumption  of  amity;  nonliberals  suffer 
from  a  presumption  of  enmity.  Both  presumptions 
may  be  accurate.  Each,  however,  may  in  particular 
cases  also  be  self-fulfilling. 

Finally,  Kant’s  cosmopolitan  law,  discussed  in 
his  “Third  Definitive  Article  of  Perpetual  Peace,” 
adds  material  incentives  to  moral  commitments. 
The  cosmopolitan  law  and  the  related  right  to  hos¬ 
pitality  permit  the  spirit  of  commerce  to  take  hold 
of  every  nation  sooner  or  later,  thus  creating  incen¬ 
tives  for  states  to  promote  peace  and  try  to  avert 
war.  Building  on  this  classical  liberal  intuition, 
modern  economic  theory  holds  that  under  a  coop¬ 
erative  international  division  of  labor  and  free 
trade  according  to  comparative  advantage,  each 
national  economy  is  better  off  than  it  would  have 
been  under  autarchy — hence,  each  participant 
acquires  an  incentive  to  solve  disputes  peacefully 
and  avoid  policies  that  would  lead  others  to  break 
mutually  advantageous  economic  ties.  Further¬ 
more,  the  international  market  removes  difficult 
decisions  of  production  and  distribution  from  the 
direct  sphere  of  state  policy.  As  a  result,  a  foreign 
state  does  not  appear  to  be  directly  responsible  for 
unfavorable  economic  outcomes — states  can  stand 
aside  from,  and  to  some  degree  above,  contentious 
market  rivalries  and  be  ready  to  step  in  to  resolve 
crises.  Finally,  the  interdependence  of  commerce 
and  the  related  international  contacts  of  state  offi¬ 
cials  help  create  crosscutting  transnational  ties  that 
serve  as  lobbies  for  mutual  accommodation.  The 
variety  of  ties  among  liberal  states  across  numer¬ 
ous  issue  areas  also  ensures  that  no  single  conflict 
sours  an  entire  relationship  by  setting  off  a  spiral 
of  reciprocated  retaliation. 

In  recent  years,  some  scholars,  such  as  Georg 
Cavallar  and  John  MacMillan,  have  taken  issue 
with  Doyle’s  interpretation  of  Kant  as  the  father  of 
modern  democratic  peace  theory.  According  to 


these  critics,  Kant’s  pacific  union,  the  foedus  paci- 
ficum  outlined  in  his  second  definitive  article,  was 
probably  intended  to  include  all  states  and  not  just 
liberal  republics.  Stefano  Recchia  and  Nadia 
Urbinati  (2009)  go  one  step  further  and  suggest 
that  the  first  to  explicitly  anticipate  the  emergence 
of  a  separate  peace  among  constitutional  democra¬ 
cies,  based  on  a  defensive  pact  of  alliance  against 
despotic  states,  was  not  Kant  but  Giuseppe  Maz- 
zini,  the  19th-century  revolutionary  thinker  and 
democratic  political  activist. 

Against  these  views,  Doyle  holds  that  there  are 
good  reasons  to  view  Kant  as  the  founding  figure 
of  modern  democratic  peace  theory,  and  he  inter¬ 
prets  Kant  as  requiring  that  peace  must  be  estab¬ 
lished  by  a  rightful  constitution  involving  all  three 
definitive  articles.  Most  current  scholarship  on  the 
democratic  peace  focuses  either  exclusively  on  the 
role  of  liberal-democratic  institutions,  liberal 
norms,  or  economic  interdependence.  But  Kantian 
liberal  peace  theory,  as  developed  by  Doyle,  is 
neither  solely  institutional,  nor  solely  ideological, 
nor  solely  economic:  It  is  only  together  that  the 
three  specific  strands  of  liberal  institutions,  liberal 
ideas,  and  the  transnational  ties  that  follow  from 
them  plausibly  connect  the  characteristics  of  lib¬ 
eral  polities  and  economies  with  sustained  liberal 
peace  among  states  that  meet  the  three  criteria 
embedded  in  the  three  definitive  articles.  Statistical 
data  sets  on  the  liberal  peace  do  not  adequately 
code  for  these  three  factors  together.  As  noted  by 
Bruce  Russett  and  John  Oneal,  the  most  thorough 
recent  empirical  test  of  the  liberal  peace  hypothe¬ 
sis  confirms  the  separate  positive  effects  of  demo¬ 
cratic  institutions  and  international  trade  (as  well 
as  membership  in  international  organizations), 
but  it  does  not  separately  code  for  liberal  norms 
and  related  interdemocratic  trust,  which  may 
indeed  be  difficult  to  measure  through  quantita¬ 
tive  analysis. 

Global  Governance  and  Cooperation 
Among  Democracies 

Classical  liberals  such  as  Bentham,  Kant,  and 
Mazzini  anticipated  that  international  institutions 
(especially  arbitration  courts  but  also  more 
advanced  international  federations  with  their  own 
parliamentary  assemblies)  would  reduce  uncer¬ 
tainty  and  improve  mutual  trust  among  states, 
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thereby  attenuating  the  security  dilemma  and 
actively  promoting  international  cooperation  and 
world  peace.  In  recent  decades,  international  rela¬ 
tions  theorists  have  systematically  developed  and 
corroborated  this  intuition. 

Relying  on  new  insights  from  game  theory, 
scholars  during  the  1980s  and  1990s  emphasized 
that  so-called  international  regimes,  consisting  of 
agreed-on  international  norms,  rules,  and  deci¬ 
sion-making  procedures,  can  help  states  effec¬ 
tively  coordinate  their  policies  and  collaborate  in 
the  production  of  international  public  goods,  such 
as  free  trade,  arms  control,  and  environmental 
protection.  Especially,  if  embedded  in  formal  mul¬ 
tilateral  institutions,  such  as  the  World  Trade 
Organization  (WTO)  or  North  American  Free 
Trade  Agreement  (NAFTA),  regimes  crucially 
improve  the  availability  of  information  among 
states  in  a  given  issue  area,  thereby  promoting 
reciprocity  and  enhancing  the  reputational  costs 
of  noncompliance.  As  noted  by  Robert  Keohane, 
institutionalized  multilateralism  also  reduces  stra¬ 
tegic  competition  over  relative  gains  and  thus 
further  advances  international  cooperation. 

Most  international  regime  theorists  accepted 
Kenneth  Waltz’s  (1979)  neorealist  assumption  of 
states  as  black  boxes — that  is,  unitary  and  rational 
actors  with  given  interests.  Little  or  no  attention 
was  paid  to  the  impact  on  international  coopera¬ 
tion  of  domestic  political  processes  and  dynamics. 
Likewise,  regime  scholarship  largely  disregarded 
the  arguably  crucial  question  of  whether  pro¬ 
longed  interaction  in  an  institutionalized  interna¬ 
tional  setting  can  fundamentally  change  states’ 
interests  or  preferences  over  outcomes  (as  opposed 
to  preferences  over  strategies),  thus  engendering 
positive  feedback  loops  of  increased  overall  coop¬ 
eration.  For  these  reasons,  international  regime 
theory  is  not,  properly  speaking,  liberal,  and  the 
term  neoliberal  institutionalism  frequently  used  to 
identify  it  is  somewhat  misleading. 

It  is  only  over  the  past  decade  or  so  that  liberal 
international  relations  theorists  have  begun  to  sys¬ 
tematically  study  the  relationship  between  domes¬ 
tic  politics  and  institutionalized  international  coop¬ 
eration  or  global  governance.  This  new  scholarship 
seeks  to  explain  in  particular  the  close  international 
cooperation  among  liberal  democracies  as  well  as 
higher-than-average  levels  of  delegation  by  democ¬ 
racies  to  complex  multilateral  bodies,  such  as  the 


European  Union  (EU),  North  Atlantic  Treaty 
Organization  (NATO),  NAFTA,  and  the  WTO 
(see,  e.g.,  John  Ikenberry,  2001;  Helen  Milner  & 
Andrew  Moravcsik,  2009).  The  reasons  that  make 
liberal  democracies  particularly  enthusiastic  about 
international  cooperation  are  manifold:  First, 
transnational  actors  such  as  nongovernmental 
organizations  and  private  corporations  thrive  in 
liberal  democracies,  and  they  frequently  advocate 
increased  international  cooperation;  second, 
elected  democratic  officials  rely  on  delegation  to 
multilateral  bodies  such  as  the  WTO  or  the  EU  to 
commit  to  a  stable  policy  line  and  to  internation¬ 
ally  lock  in  fragile  domestic  policies  and  constitu¬ 
tional  arrangements;  and  finally,  powerful  liberal 
democracies,  such  as  the  United  States  and  its 
allies,  voluntarily  bind  themselves  into  complex 
global  governance  arrangements  to  demonstrate 
strategic  restraint  and  create  incentives  for  other 
states  to  cooperate,  thereby  reducing  the  costs  for 
maintaining  international  order. 

Recent  scholarship,  such  as  that  of  Charles 
Boehmer  and  colleagues,  has  also  confirmed  the 
classical  liberal  intuition  that  formal  international 
institutions,  such  as  the  United  Nations  (UN)  or 
NATO,  independently  contribute  to  peace,  espe¬ 
cially  when  they  are  endowed  with  sophisticated 
administrative  structures  and  information-gathering 
capacities.  In  short,  research  on  global  governance 
and  especially  on  the  relationship  between  democ¬ 
racy  and  international  cooperation  is  thriving,  and 
it  usefully  complements  liberal  scholarship  on  the 
democratic  peace. 

The  Ethics  of  Military  Intervention: 

Should  Liberal  Democracy  Be  Imposed? 

Liberal  thinkers  on  international  relations  have 
always  displayed  a  keen  interest  in  the  ethical 
dimension  of  foreign  policy,  based  on  the  assump¬ 
tion  that  ideas,  as  well  as  material  interests,  ulti¬ 
mately  determine  state  behavior.  Thus,  questions 
about  the  admissibility  and  desirability  of  military 
intervention  to  spread  or  uphold  liberal  values 
abroad  were  central  to  the  political  thought  of 
seminal  figures,  such  as  Kant,  Mazzini,  and  Mill. 
The  classical  realists,  for  their  part,  did  not  neces¬ 
sarily  dismiss  normative  concerns  entirely  (unlike 
their  contemporary  followers);  yet  they  were  skep¬ 
tical  about  the  possibility  for  moral  behavior  in  an 


1438  Liberalism  in  International  Relations 


anarchical  environment  where  state  survival  was 
assumed  to  be  constantly  at  stake. 

Contemporary  liberal  theory  on  military  inter¬ 
vention  consciously  builds  on  the  classics.  At  the 
risk  of  oversimplification,  one  can  identify  two 
groups  of  liberal  scholars  in  the  ongoing  normative 
debate  on  military  intervention  and  regime  change: 
cosmopolitan  interventionists,  on  the  one  hand, 
and  liberal  internationalists,  on  the  other. 

Cosmopolitan  interventionists  typically  build 
on  Kant’s  moral  theory,  but  they  only  loosely  fol¬ 
low  his  political  thought.  They  assert  that  every¬ 
one  who  has  the  ability  to  intervene  militarily  in 
the  face  of  systematic  human  rights  violations  also 
has  a  moral  duty  to  do  so,  subject  to  criteria  of 
effectiveness  and/or  proportionality.  For  cosmo¬ 
politans,  if  a  state  is  tyrannical  and  systematically 
oppresses  its  own  population,  it  “forfeits  any 
respect  for  its  independence.”  As  noted  by  Brian 
Barry  (1998),  by  implication,  “international  [mili¬ 
tary]  intervention  to  displace  the  government  and, 
if  necessary,  place  the  country  under  international 
trusteeship”  (p.  160)  is  always  prima  facie  morally 
justified  and  indeed  required,  although  prudential 
considerations  might  ultimately  counsel  against 
the  use  of  force.  (See  also  David  Luban,  1980.) 

Liberal  internationalists,  on  the  other  hand, 
have  tended  to  place  greater  value  on  state  sover¬ 
eignty  and  the  attendant  international  duty  of 
nonintervention.  Kant  favored  absolute  noninter¬ 
vention  as  a  matter  of  principle:  He  thought  it 
necessary  to  stabilize  international  relations  and  to 
ensure  that  each  political  community  could  freely 
determine  its  own  way  of  life.  Mazzini  and  Mill 
were  not  categorically  opposed  to  military  inter¬ 
vention  (e.g.,  they  justified  it  to  end  protracted 
civil  wars  and  to  save  helpless  populations  from 
outright  slaughter);  yet  they  vigorously  opposed 
the  use  of  force  for  the  purpose  of  promoting  lib¬ 
erty  and  democracy  more  generally.  They  sensed 
that  unless  tyranny  was  defeated  domestically, 
with  economic  and  diplomatic  assistance  from  the 
outside  but  crucially  without  foreign  military 
intervention,  any  liberty  achieved  would  remain 
exceedingly  fragile  and  could  be  hardly  sustained 
in  the  long  run. 

Contemporary  liberal  internationalists  such  as 
Michael  Walzer  (1977)  and  John  Rawls  (1999) 
typically  justify  (but  contrary  to  the  cosmopolitan 
interventionists  do  not  require)  humanitarian 


military  intervention  as  a  last  resort  in  the  face  of 
the  worst  human  rights  violations,  such  as  state- 
sponsored  slaughter  or  genocide,  suggesting  that 
sovereignty  can  be  disregarded  under  similar  cir¬ 
cumstances.  But  they  crucially  insist  that  military 
intervention  ought  to  be  multilaterally  authorized 
and  overseen,  ideally  by  the  UN  Security  Council, 
if  it  is  to  be  legitimate.  The  underlying  assumption 
is  that  collective  authorization  and  oversight 
reduce  the  risk  of  usurpation  by  powerful  states 
(Doyle,  2006).  Most  contemporary  liberal  interna¬ 
tionalists  follow  their  classical  forebears  and  reject 
policies  of  forcible  democratization  on  both  prin¬ 
cipled  and  consequentialist  grounds.  Democratic 
transformation  is  best  fostered  peacefully  and  indi¬ 
rectly  through  trade,  investment,  and  foreign  aid. 
These  can  help  diversify  societies,  and  diversified, 
growing  societies  tend  to  demand  responsive  gov¬ 
ernance  in  the  long  run. 

Finally,  most  contemporary  liberals  agree  that 
becoming  a  democracy  is  hardly  a  cure-all. 
Research  suggests  that  overall  and  on  average,  the 
diffusion  and  consolidation  of  liberal  democracy 
within  countries  reduces  the  chances  of  both  inter¬ 
national  and  civil  war.  However,  there  is  also  evi¬ 
dence  that  transitions  to  democracy  often  produce 
political  turmoil  at  the  domestic  level,  unless  they 
are  carefully  managed.  Where  the  rule  of  law  and 
public  institutions  are  weak,  political  elites  will  be 
tempted  to  use  nationalist  rhetoric  and  violence  to 
achieve  and  hold  office,  which  may  result  in  inter¬ 
national  or  civil  war.  Furthermore,  as  Doyle 
(1983)  pointed  out,  the  very  respect  for  individual 
rights  and  shared  commercial  interests  that  estab¬ 
lish  grounds  for  peace  among  liberal  democracies 
may  establish  grounds  for  additional  conflict  in 
relations  between  liberal  and  nonliberal  societies. 
Evidence  of  this  can  be  found  today  in  relations 
between  the  United  States  and  its  liberal  allies,  on 
the  one  hand,  and  a  resurgent  Russia,  emerging 
China,  or  defiant  Iran,  on  the  other.  In  short,  lib¬ 
eral  internationalism  is  no  recipe:  It  merely  offers 
a  set  of  normative  guidelines  and  empirical  hypoth¬ 
eses — some  of  which  are  indeed  supported  by  solid 
evidence — and  it  needs  constant,  prudent  vigilance 
to  avoid  crusades  and  misguided  interventions. 

Michael  Doyle  and  Stefano  Recchia 
Columbia  University 
New  York  City,  New  York,  United  States 
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Liberalization 


Liberalization  is  a  process  that  reduces  state  control 
over  the  lives  of  persons  subject  to  the  authority  of 
a  state.  It  may  have  both  economic  and  political 
dimensions.  Economic  liberalization  reduces  state 
intervention  in  the  marketplace.  Political  liberaliza¬ 
tion  expands  individual  liberty  and  rights,  includ¬ 
ing  the  right  to  speak  freely  against  state  authorities 
and  to  organize  with  others  to  oppose  those 
authorities.  Economic  and  political  liberalization 
may  or  may  not  go  together.  Political  liberalization 
may  or  may  not  lead  to  democratization,  which 
also  enables  a  broadly  inclusive  electorate  to  unseat 
an  incumbent  government. 

The  concept  of  liberalization  must  be  under¬ 
stood  in  the  context  of  liberalism,  the  dominant 
modern  political  philosophy.  Liberalism  first 
emerged  in  the  17th  century  as  a  challenge  to  the 
notion  that  monarchs  had  God-given,  absolute 
authority.  Thomas  Hobbes  defended  absolute 
authority  but  grounded  it  not  in  divine  will  but 
rather  in  the  hypothetical  agreement  of  the  sub¬ 
jects  to  yield  entirely  to  a  sovereign  their  natural 
rights  to  defend  life  and  property. 

John  Locke  rejected  Hobbes’s  argument.  While 
agreeing  that  governmental  authority  is  indeed 
grounded  in  the  consent  of  the  governed,  Locke 
held  that  people  would  leave  the  state  of  nature 
and  set  up  a  commonwealth  only  if  they  could 
thereby  protect  their  natural  rights  to  life,  liberty, 
and  property.  Rather  than  cede  their  natural  rights 
to  a  sovereign,  the  people  became  the  sovereign  by 
virtue  of  the  social  contract  through  which  they 
established  the  commonwealth.  Monarchs  were  no 
more  than  magistrates  who  could  be  removed  by 
the  sovereign  people  if  they  failed  to  protect  natu¬ 
ral  rights. 

Jean-Jacques  Rousseau,  in  turn,  rejected  Locke’s 
emphasis  on  individual  rights,  returning  instead  to 
the  Hobbesian  concept  of  ceding  natural  rights  to 
an  absolute  sovereign.  But  Rousseau  also  rejected 
Hobbes’s  idea  of  a  sovereign  separate  from  the 
people.  He  envisioned  the  whole  people,  acting 
together,  a  radically  democratic  polity  in  which 
individual  rights  had  no  place.  Locke’s  liberalism 
was  thus  bracketed  by  two  absolutisms. 

Liberalism  after  Locke  remained  committed  to 
protecting  individual  liberty,  but  it  left  behind  the 
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conventional  device  of  the  social  contract.  From 
the  late  18th  through  the  19th  centuries,  liberals 
developed  the  idea  of  utility,  or  usefulness,  as  the 
central  tool  for  discerning  the  good.  Jeremy 
Bentham  produced  the  most  systematic  formula¬ 
tion  of  utilitarianism  as  a  means  of  judging  what  is 
good  and  bad.  Adam  Smith  developed  the  quintes¬ 
sential  defense  of  the  free  market  as  the  best  way 
to  maximize  productivity  in  the  economic  sphere. 
John  Stuart  Mill,  the  most  influential  liberal  of  the 
19th  century,  elaborated  utilitarianism  as  the 
foundation  for  a  classic  defense  of  individual  lib¬ 
erty  as  well  as  for  the  enfranchisement  of  workers 
and  women  in  a  representative  government. 

Liberalism  began  with  the  project  of  defending 
individual  liberties  against  encroachment  by  the 
state.  The  American  Declaration  of  Independence 
is  a  classic  statement  of  that  sort  of  liberalism. 
Increasingly,  though,  liberalism  was  also  con¬ 
cerned  with  the  growing  problem  of  democracy. 
The  American  Constitution,  taking  the  form  of  an 
explicit  contract  among  the  people  to  set  up  a  gov¬ 
ernment,  is  just  as  concerned  with  protection  from 
majority  tyranny  as  it  is  with  guarding  against 
individual  or  oligarchic  despotism.  And  John 
Stuart  Mill,  for  all  his  advocacy  of  expanded  suf¬ 
frage,  feared  the  tyranny  of  an  unenlightened  mass. 

Liberalism  thus  betrays  a  fundamental  tension. 
It  denies  God-given  or  traditional  rank  and  privi¬ 
lege  and  posits  the  equality  of  all  persons,  yet 
enshrines  a  free  marketplace  that  leads  inexorably 
to  capitalism  and  growing  inequality  of  power  and 
of  wealth.  Political  liberalism  presumes  equality 
and  opens  the  way  to  democracy  but  fears  the 
threat  to  liberty  posed  by  majority  rule. 

Liberalization,  then,  should  be  seen  as  move¬ 
ment  toward  liberalism,  complete  with  the  contra¬ 
dictions  and  tensions  we  have  just  explored.  Its 
contemporary  usage  in  political  science  may  pertain 
either  to  the  adoption  of  economic  policies  that 
reduce  state  intervention  or  to  the  opening  of 
political  processes  to  broader  exercise  of  individual 
rights  and  liberties. 

One  major  controversy  concerns  the  extent  of 
the  association  between  economic  and  political 
liberalization.  During  the  1980s  and  1990s,  advo¬ 
cates  of  the  Washington  Consensus  (the  set  of 
economic  policies  to  be  implemented  by  govern¬ 
ment  in  countries  that  are  in  a  situation  of  eco¬ 
nomic  crisis  and  recommended  by  international 


institutions  such  as  the  International  Monetary 
Fund  and  the  World  Bank,  both  of  them  based  in 
Washington,  D.C.)  expected  that  adoption  of 
stringent  economic  liberalization  (i.e.,  neoliberal- 
ism)  would  lead  inevitably  to  political  liberaliza¬ 
tion  and  democratization.  This  expectation  was 
supported  by  the  strong  historical  correlation 
between  capitalist  economic  development  and 
political  democracy. 

Nevertheless,  recent  evidence  on  this  issue  is  far 
from  conclusive.  The  heyday  of  the  Washington 
Consensus  did  coincide  with  a  substantial  wave  of 
democratization  during  the  same  period.  On  the 
other  hand,  a  good  case  can  be  made  that  stringent 
economic  liberalization  imposes  severe  stresses  on 
the  society  and  thereby  challenges  the  incumbent 
government.  Many  of  those  incumbent  govern¬ 
ments  in  the  1980s  and  1990s  were  authoritarian; 
implementing  economic  liberalization  tended  to 
increase  opposition  and  bring  pressure  for  liberal¬ 
ization  and  eventual  democratization.  However,  in 
other  cases,  such  as  the  newly  democratic  Russia  in 
the  1990s,  economic  liberalization  promoted 
renewed  movement  in  an  authoritarian  direction. 

Two  outstanding  examples  of  rapid  economic 
development  strongly  suggest  that  there  is  no  asso¬ 
ciation  between  economic  and  political  liberaliza¬ 
tion.  Singapore  has  had  a  vigorous  capitalist 
economy  since  independence  while  maintaining  an 
authoritarian  regime  with  relatively  low  levels  of 
repression  but  no  serious  challenge  to  the  regime. 
And  China,  since  the  death  of  Mao  Zedong  in 
1978,  has  seen  a  hugely  successful  economic  liber¬ 
alization  coupled  with  maintenance  of  authoritar¬ 
ian  control  by  the  Chinese  Communist  Party,  with 
no  significant  political  liberalization. 

A  second  issue  concerns  the  relationship 
between  liberalization  and  democratization.  The 
earliest  democracies  (the  United  States,  the 
United  Kingdom,  and  France)  all  went  from 
political  liberalization  to  democratization  over 
decades.  The  same  may  be  said  about  some  later- 
emerging  democracies,  such  as  Costa  Rica,  South 
Korea,  or  Taiwan.  Even  in  these  cases  of  success¬ 
ful  democratization,  liberalization  was  some¬ 
times  used  by  incumbent  rulers  in  attempts  to 
reduce  pressure  for  democratization,  as  in  Costa 
Rica  in  the  1940s.  Where  democratization  was 
successful,  it  was  often  in  spite  of  such  prophy¬ 
lactic  liberalization. 
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There  have  also  been  successful  cases  of  pro¬ 
phylactic  liberalization — notably  Mexico  and 
Botswana.  Mexico  arguably  had  the  most  success¬ 
ful  authoritarian  regime  of  the  20th  century,  with 
one  ruling  party  (the  Institutional  Revolutionary 
Party,  PRI)  in  power  for  more  than  70  years.  The 
regime  used  repression  when  it  had  to,  but  funda¬ 
mentally,  the  party’s  survival  depended  on  a 
facade  of  regular  elections,  which  were  invariably 
won  by  the  PRI  without  egregiously  obvious  fraud, 
and  substantial  freedom  of  speech  and  press.  Only 
toward  the  end  of  its  rule,  from  1988  to  2000,  did 
it  become  obvious  that  the  party  had  to  use  fraud 
and  coercion  to  win.  It  was  at  that  point  that  the 
PRI  finally  lost  the  presidency  to  an  opposition 
candidate. 

A  similar  pattern  may  be  seen  in  Botswana, 
which  was  long  cited  as  Africa’s  most  successful 
democracy  (until  the  South  African  transition  to 
majority  rule  in  1996).  As  in  Mexico,  the  same 
party  has  ruled  the  country  for  decades  (since  inde¬ 
pendence  in  1966),  but  it  operates  through  a 
regime  with  regular  elections  and  substantial 
political  liberties.  In  Botswana,  the  ruling  party 
keeps  winning  because  there  is  simply  no  viable 
opposition  alternative.  Fraud  and  repression  play  a 
smaller  role  here  than  in  Mexico,  but  Botswana  is 
still  a  one-party-dominant,  liberalized  regime  and 
not  a  full-fledged  democracy. 

A  final  issue  concerns  the  variables  that  pro¬ 
mote  or  retard  political  liberalization.  A  substan¬ 
tial  literature  on  transitions  from  authoritarian  to 
democratic  regimes  focuses  on  the  conditions  of 
liberalization.  A  key  question  is  the  balance 
between  internal  and  external  pressures.  An 
authoritarian  regime  that  faces  no  organized  inter¬ 
nal  opposition  is  extremely  unlikely  to  liberalize, 
much  less  democratize.  On  the  other  hand,  an 
opposition  that  is  strong  and  unwilling  to  compro¬ 
mise  with  the  regime  may  lead  the  latter  to  dig  in 
and  refuse  to  liberalize.  As  Adam  Przeworski  for¬ 
mulated  the  problem,  liberalization  is  most  likely 
when  there  are  powerful  regime  elements  that 
incline  toward  liberalizing  reforms  as  a  means  of 
keeping  power,  while  powerful  opposition  ele¬ 
ments  favor  negotiating  with  the  regime  for  liber¬ 
alization,  hoping  that  greater  political  openness 
will  lead  to  democratization.  An  inclination  toward 
reform  may  result  from  deteriorating  economic 
conditions  or  other  chronic  problems  that  render 


the  authoritarian  regime  less  secure.  In  such  a  situ¬ 
ation,  regime  reformers  will  favor  liberalization  as 
a  prophylactic  against  democratization,  while 
opposition  moderates  will  see  it  as  a  step  toward 
democratization.  Each  side  may  well  think  it  has 
deceived  and  used  the  other,  but  they  can  nonethe¬ 
less  agree  on  moves  toward  liberalization. 

Liberalization  may  also  be  a  response  to  exter¬ 
nal  pressures.  Samuel  Huntington  showed  that 
there  have  been  three  major  global  waves  of 
democratization  since  the  early  1 9th  century,  sepa¬ 
rated  by  periods  in  which  democracy  was  in 
retreat.  The  waves  were  powered  by  great  power 
policies  in  favor  of  democracy  as  well  as  political 
contagion  from  neighboring  countries.  For  exam¬ 
ple,  the  Third  Wave,  running  from  the  1970s  to 
the  1990s,  saw  the  United  States  and  its  European 
allies  strongly  pushing  for  democratization  of 
authoritarian  regimes  in  Latin  America  in  the 
1980s  and  in  Eastern  Europe  in  the  1990s.  Facing 
such  pressure,  even  authoritarian  regimes  that  did 
not  face  powerful  internal  opposition  were  none¬ 
theless  constrained,  at  a  minimum,  to  liberalize. 

In  conclusion,  liberalization  is  a  movement 
toward  either  economic  or  political  liberalism, 
enacted  by  an  incumbent  government.  Economic 
liberalization  is  typically  enacted  under  outside 
pressure,  with  the  hope  that  a  relatively  unfettered 
market  will  benefit  all,  even  the  least  advantaged. 
Political  liberalization  may  respond  to  internal 
pressure  or  external  pressure  and  is  typically 
enacted  in  hopes  of  heading  off  full-fledged  democ¬ 
ratization.  More  often  than  not,  however,  liberal¬ 
ization  opens  the  way  to  democratization. 

John  Peeler 
Bucknell  University 
Lewisburg,  Pennsylvania,  United  States 
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Libertarianism  refers  to  a  normative  political 
theory  that  gives  top  priority  to  the  value  of  free¬ 
dom  of  choice  over  other  competing  political 
values;  moreover,  libertarianism  understands  a 
person  to  possess  freedom  of  choice  so  long  as  no 
other  agent  coercively  interferes  with  his  or  her 
choices.  Since  the  state  characteristically  acts  by 
defining  laws  and  coercively  enforcing  them,  lib¬ 
ertarians’  hostility  to  coercion  typically  leads 
them  to  conclude  that  only  a  very  minimal  state  is 
legitimate — namely,  a  state  whose  only  purposes 
are  to  protect  citizens  against  acts  of  coercion 
(murder,  assault,  theft,  and  so  on)  and  acts  of 
fraud  in  a  system  of  free  enterprise.  As  a  result, 
libertarians  regard  the  modern  welfare  state  to  be 
illegitimate. 

Defined  in  this  way,  libertarianism  names  a 
family  of  political  theories  rather  than  a  single 
theory;  diversity  among  libertarian  political  theo¬ 
ries  arises  depending  on  just  how  strong  a  prior¬ 
ity  is  given  to  the  value  of  freedom  of  choice  (a 
stronger  priority  tending  to  push  libertarianism 
in  the  direction  of  anarchist  political  theories, 
such  as  anarcho-capitalism).  Diversity  among 
libertarian  political  theories  also  arises  depend¬ 
ing  on  the  type  of  argument  a  libertarian  uses  to 
justify  assigning  this  priority  to  freedom  of 
choice.  Some  libertarians  justify  this  priority  in  a 
consequentialist  fashion — that  is,  by  appealing  to 


the  beneficial  consequences  (understood  in  terms  of 
happiness  or  efficiency)  of  allowing  individuals  the 
freedom  to  act  on  their  choices.  By  contrast,  non- 
consequentialist  libertarians  adopt  a  rights-based 
approach  to  justifying  libertarianism:  Some  regard 
a  natural  right  to  freedom  of  choice  to  be  an  intui¬ 
tively  obvious  moral  truth,  whereas  others  purport 
to  derive  this  right  from  an  even  more  basic  moral 
principle,  such  as  a  principle  of  self-ownership  (i.e., 
full  ownership  of  one’s  body  and  labor).  These  dif¬ 
ferent  foundational  choices  will  in  some  cases  lead 
to  differences  in  practical  recommendations  (e.g., 
regarding  how  to  treat  economic  monopolies,  how 
to  raise  funds  for  legitimate  government  activities, 
and  how  to  deal  with  pollution  and  other  environ¬ 
mental  issues). 

Historically,  many  libertarians  trace  their  roots 
back  to  the  seminal  writings  of  John  Locke,  in 
particular  his  Second  Treatise  on  Government 
(1689),  a  founding  text  of  liberalism,  which  from 
a  principle  of  self-ownership  argues  in  favor  of 
strong  rights  to  property  and  in  favor  of  limited 
government  (though  it  is  a  matter  of  scholarly 
dispute  exactly  how  limited  a  Lockean  state  would 
be).  In  light  of  this  claimed  historical  pedigree  and 
further  historical  connections  with  past  influential 
thinkers,  such  as  Adam  Smith,  some  libertarians 
prefer  to  call  themselves  classical  liberals,  as  dis¬ 
tinct  from  egalitarian  liberals,  who  depart  from 
classical  liberalism’s  strong  defense  of  free  mar¬ 
kets  and  property  rights.  Indeed,  some  libertarians 
refuse  to  apply  the  term  liberal  to  anyone  but  clas¬ 
sical  liberals,  regarding  egalitarian  liberal  as  a  cor¬ 
ruption  of  the  term  insofar  as  it  envisions  an 
expansive  role  for  the  state;  meanwhile,  some 
egalitarian  liberals  return  the  favor  by  refusing  to 
consider  libertarians  as  liberals,  on  the  grounds 
that  libertarianism  in  principle  permits  economic 
inequalities  of  such  a  size  as  to  make  a  mockery 
of  liberalism’s  core  commitment  to  equal  citizen¬ 
ship.  What  can  be  safely  said  is  that  libertarians 
and  egalitarian  liberals  share  both  the  core  lib¬ 
eral  commitment  to  constitutionalism,  under¬ 
stood  as  government  under  the  rule  of  law,  and 
the  core  liberal  commitment  to  the  robust  protec¬ 
tion  of  civil  liberties  such  as  freedom  of  expres¬ 
sion,  religion,  and  association.  In  this  sense,  both 
libertarians  and  egalitarian  liberals  are  heirs  to 
the  intellectual  tradition  that  has  its  roots  in 
thinkers  such  as  Locke. 
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Natural  Rights  Libertarianism 

Perhaps  reflecting  libertarianism’s  Lockean  heri¬ 
tage,  the  most  familiar  form  of  libertarianism  is  the 
form  (mentioned  above)  that  appeals  to  a  natural 
right  to  freedom  of  choice  grounded  in  a  moral 
principle  of  self-ownership,  according  to  which  we 
have  a  property  right  to  our  body  and  labor.  Those 
who  accept  such  a  principle  of  self-ownership  find 
it  appealing  for  its  ability  to  explain,  in  one  fell 
swoop,  the  wrongness  of  phenomena  such  as  slav¬ 
ery,  murder,  rape,  and  other  forms  of  bodily 
assault  as  well  as  the  wrongness  of  horrifying 
hypothetical  cases  such  as  the  forced  harvesting  of, 
say,  kidneys  and  eyes  in  order  to  meet  the  need  for 
donor  organs.  Indeed,  this  principle  establishes  an 
extremely  strong  moral  presumption  against  any 
use  of  nonconsensual  physical  force,  the  one  per¬ 
missible  exception  being  said  to  be  the  use  of  non¬ 
consensual  physical  force  to  prevent  or  punish 
violations  of  the  self-ownership  principle  itself.  On 
this  basis,  libertarians  conclude  that  governments 
must  limit  their  functions  exclusively  to  the  pre¬ 
vention  of  force  and  fraud.  (Fraud  is  understood  as 
the  breaking  of  a  contract.  Libertarians  do  not 
judge  the  coercion  inherent  in  enforcing  contracts 
to  be  objectionable  since  in  making  a  contract, 
signatories  have  consented  to  be  liable  to  coercion 
in  case  of  noncompliance.)  A  government  that 
adopts  functions  beyond  the  prevention  of  force 
and  fraud  violates  the  moral  presumption  against 
coercion.  Thus,  for  instance,  a  government  that 
bans  the  consumption  of  recreational  drugs  has 
violated  the  principle  of  self-ownership,  which 
permits  individuals  to  do  to  their  bodies  whatever 
they  please,  so  long  as  they  do  not  violate  others’ 
self-ownership  rights. 

Less  obvious,  but  no  less  true,  according  to  lib¬ 
ertarians,  is  the  claim  that  individuals’  self¬ 
ownership  rights  are  violated  when  a  government 
builds  a  park  or  library  or  provides  some  other 
public  good.  For  provision  of  these  goods  requires 
resources,  and  since  a  state  acquires  its  resources 
via  coercive  taxation,  libertarians  argue  that  such 
state-supplied  goods  in  effect  conscript  citizens  into 
working  to  supply  such  goods,  whether  they  desire 
them  or  not.  Hence,  the  famous  claim  of  Robert 
Locke,  one  of  the  20th  century’s  best  known  liber¬ 
tarians,  that  “taxation  of  earnings  from  labor  is  on 
a  par  with  forced  labor”  (Robert  Nozick,  1974, 
p.  169).  Along  similar  lines,  libertarians  frequently 


denounce  taxation  as  a  form  of  state-sponsored 
theft,  at  least  when  the  taxes  are  used  for  ends 
other  than  the  prevention  of  force  and  fraud. 

This  condemnation  of  taxation  as  a  form  of 
theft,  however,  presumes  that  a  person  has  a  natu¬ 
ral  property  right  to  retain  in  full  whatever  he  or 
she  comes  to  possess  as  a  result  of  economic  trans¬ 
actions  or  gift  giving — a  right  that  is  violated  when 
a  state  appropriates  a  portion  of  these  possessions 
for  its  own  purposes.  In  response,  critics  of  liber¬ 
tarianism  argue  that  a  natural  property  right  to 
external  possessions  does  not  straightaway  follow 
from  rights  of  ownership  to  one’s  body  and  labor. 
And  if  in  counterreply  libertarians  assert  that  own¬ 
ing  one’s  labor  entitles  one  to  sell  it  in  exchange  for 
some  external  good  (such  as  money),  critics  will 
insist  that  this  merely  pushes  the  question  back  a 
level,  for  it  must  be  asked  how  the  person  from 
whom  one  acquired  the  external  good  came  him¬ 
self  or  herself  to  possess  a  property  right  to  that 
good.  Perhaps  the  answer  is  that  this  person 
acquired  it  in  exchange  with  an  even  earlier  pos¬ 
sessor.  But  then  the  same  question  can  be  asked  of 
the  earlier  possessor.  And  so  on.  It  follows  that  a 
chain  of  economic  and  gift  transactions  must 
stretch  back  into  time  and  at  some  point  terminate 
in  an  act  whereby  a  person  comes  to  acquire  some 
previously  unowned  resource  (say,  a  hitherto  unin¬ 
habited  piece  of  land).  According  to  critics  of  lib¬ 
ertarianism,  this  act  of  original  acquisition  is  left 
unexplained  by  a  principle  of  self-ownership,  for  it 
is  unclear  how  ownership  of  one’s  body  and  labor 
could  create  a  title  to  some  previously  unowned 
external  good  distinct  from  one’s  body. 

Undoubtedly,  the  most  famous  attempt  to  use 
the  principle  of  self-ownership  to  justify  the  origi¬ 
nal  acquisition  of  unowned  resources  is  Locke’s 
own  attempt.  In  what  has  come  to  be  known  as  the 
labor-mixing  argument,  Locke  in  essence  argued 
that  since  you  own  your  labor,  it  follows  that  when 
you  mix  your  labor  with  some  previously  unowned 
resource  (tilling  the  soil,  say,  or  gathering  apples 
from  a  tree),  you  come  to  own  the  resulting  mix¬ 
ture  (cultivated  land,  a  bushel  of  apples,  etc.). 
Well-known  objections  to  this  argument  exist, 
however;  Locke  himself  worried  about  excessive 
acquisition  and  insisted  in  response  to  this  worry 
that  an  act  of  acquisition  must  leave  “enough  and 
as  good”  of  unowned  resources  for  others  to 
acquire.  Assessing  the  labor-mixing  argument  as 
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well  as  the  more  general  challenge  of  justifying  the 
original  acquisition  of  unowned  goods  remains  an 
active  area  of  debate.  Indeed,  within  this  debate,  a 
group  of  political  theorists  who  call  themselves 
left -libertarians  has  arisen.  Such  theorists  accept 
the  principle  of  self-ownership  of  one’s  body  and 
labor  but  argue  that  external  goods  initially  belong 
to  everyone  in  an  egalitarian  manner.  On  this 
view,  private  appropriation  is  permissible,  but 
individuals  who  appropriate  more  than  their  equal 
share  of  external  goods  owe  others  compensation. 
On  these  grounds,  some  left-libertarians  have  even 
endorsed  a  measure  of  income  redistribution  as  a 
form  of  compensation  for  hitherto  uncompensated 
past  acts  of  appropriation. 

Consequentialist  Libertarianism 

As  earlier  indicated,  not  all  libertarians  base  their 
theory  on  a  principle  of  self-ownership  or  on  any 
other  principle  from  which  a  natural  right  to  free¬ 
dom  of  choice  is  said  to  flow.  Instead  of  looking  to 
an  abstract  principle  of  rights,  these  libertarians 
argue  that  a  minimal  government  restricted  to  pre¬ 
venting  force  and  fraud  leads  to  better  overall 
future  consequences  than  do  more  expansive  gov¬ 
ernments:  Resources  will  be  used  more  efficiently, 
markets  will  respond  rationally  to  people’s  needs, 
people  will  do  a  better  job  of  looking  after  them¬ 
selves,  and  hence,  the  society  will  in  general  be 
more  happy  and  prosperous.  This  style  of  reason¬ 
ing  is  consequentialist  in  nature,  consequentialism 
being  the  name  for  the  moral  doctrine  according  to 
which  the  right  action  to  perform  is  the  one  with 
the  best  overall  consequences.  (Utilitarianism  is  the 
best  known,  but  not  the  only,  consequentialist 
moral  theory.) 

The  main  consequentialist  arguments  for  liber¬ 
tarianism  comprise  arguments  based  on  incentives 
and  an  argument  based  on  the  practical  constraints 
faced  by  governments.  Those  who  appeal  to  incen¬ 
tives  argue  that  the  public  provision  of  goods 
found  in  welfare  states  dampens  private  incentives 
to  work  and  to  invest  in  one’s  skills;  they  argue 
that  holding  property  in  common  reduces  individ¬ 
ual  incentives  to  care  for  it  (the  so-called  tragedy 
of  the  commons)  and  that,  by  contrast,  a  competi¬ 
tive  free  market  (which  entails  both  the  promise  of 
profit  and  the  peril  of  being  outdone  by  one’s 
rivals)  creates  incentives  for  firms  to  use  the  most 


efficient  means  to  produce  goods  that  consumers 
desire  and  creates  incentives  for  individuals  to 
acquire  skills  that  are  of  use  in  this  production. 

A  second,  related  consequentialist  argument  for 
libertarianism  stresses  the  practical  problems  that 
governments  face  in  gathering  the  information 
they  need  to  make  good  decisions;  this  argument  is 
most  famously  associated  with  Friedrich  August 
Hayek.  Hayek  noted  that  prices  in  a  competitive 
market  function  as  signals,  widely  dispersing  use¬ 
ful  information  to  potential  producers  (e.g.,  a 
sharp  rise  in  price  means  a  valued  product  is 
undersupplied).  Prices  also  give  people  an  incen¬ 
tive  to  respond  to  this  information  by  changing 
production  patterns.  For  instance,  a  good  that  is 
undersupplied  relative  to  demand  can  fetch  a  high 
price,  thereby  giving  producers  incentive  to  supply 
more  of  the  good,  whereas  a  good  that  is  oversup¬ 
plied  will  experience  a  drop  in  price,  thereby  giving 
producers  of  that  good  incentive  to  switch  produc¬ 
tion  to  more  desired  goods.  Hayek  argued  that  no 
individual  or  group  of  individuals  (such  as  the 
planning  board  of  a  socialist  economy)  could  pos¬ 
sibly  replicate  the  informational  and  incentive 
effects  of  a  free  market  system  of  competitively 
determined  prices;  he  concluded  from  this  that  the 
free  market  was  the  most  rational  system  of  allo¬ 
cating  goods. 

These  consequentialist  arguments  have  been  tre¬ 
mendously  influential  among  mainstream  econo¬ 
mists,  though  most  such  economists  do  not  end  up 
endorsing  as  pure  a  form  of  laissez-faire  capitalism 
as  libertarians  desire.  For  a  large  body  of  economic 
doctrine  is  devoted  to  the  topic  of  market  failures — 
situations  in  which  the  market  fails  to  respond  to 
need  or  fails  to  allocate  resources  efficiently — and 
many  economists  conclude  on  consequentialist 
grounds  that  such  failures  call  for  government 
intervention.  Examples  of  market  failure  discussed 
in  the  literature  include  natural  monopolies,  exter¬ 
nalities  (a  term  that  refers  to  costs  of  production 
that  producers  externalize,  i.e.,  push  off  onto  oth¬ 
ers — pollution  being  a  key  example),  and  public 
goods  that  are  undersupplied  by  the  free  market 
owing  to  the  free-rider  problem  (when  the  costs 
and  benefits  of  common  resources  are  not  shared 
fairly).  Consequentialist  libertarians  respond  to 
these  worries  either  by  proposing  market  solutions 
to  these  issues  (thereby  denying  that  they  are  genu¬ 
ine  cases  of  market  failure)  or  by  agreeing  that 
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markets  are  less  than  wholly  efficient  in  such  cases 
but  then  arguing  that  government  interventions 
would  be  even  more  wasteful  (a  line  of  argument 
most  closely  associated  with  the  public  choice 
school  of  economics). 

Criticisms  of  Libertarianism 

Libertarians  face  criticism  from  both  the  Right  and 
the  Left.  Right-leaning  critics  who  are  anarcho- 
capitalists  object  to  libertarianism’s  willingness  to 
endorse  any  state  at  all,  whereas  Left-leaning  crit¬ 
ics  argue  that  libertarianism’s  case  for  the  minimal 
state  assumes  a  flawed  account  of  freedom,  fails  to 
recognize  competing  values  such  as  fairness  and 
the  meeting  of  basic  needs,  and  ignores  the  social 
inputs  that  are  a  necessary  part  of  any  system  of 
production,  so  that  one’s  possessions  are  never 
purely  the  fruits  of  one’s  own  labor.  While  of 
course  these  are  not  the  only  criticisms  made  of 
libertarianism,  they  are  among  the  most  promi¬ 
nent,  and  each  will  be  discussed  in  turn. 

Anarcho-Capitalism 

Anarcho-capitalists  such  as  Murray  Rothbard 
argue  that  the  functions  performed  by  a  libertarian 
minimal  state  can  and  ought  to  be  performed 
instead  by  private  firms  operating  in  a  capitalist  free 
market  system.  Hence,  instead  of  a  public  system  of 
police,  law,  and  courts,  anarcho-capitalists  envision 
a  competitive  market  of  private  defense  agencies, 
each  offering  potential  customers  its  own  code  of 
protection,  complete  with  its  own  security  agents  to 
enforce  this  code  and  its  own  courtlike  procedures 
for  interpreting  and  applying  it.  Nor,  according  to 
anarcho-capitalists,  would  there  be  need  for  a 
government  to  print  and  regulate  currency;  there 
would  likewise  be  a  competitive  market  of  private 
currencies.  And  so  on  for  the  remaining  functions 
typically  associated  with  government.  In  short, 
anarcho-capitalists  argue  that  insofar  as  libertarian¬ 
ism  approves  of  a  minimal  state  rather  than  trusting 
individuals  to  meet  all  their  needs  through  a  system 
of  free  exchange,  it  shows  insufficient  regard  for 
freedom  of  choice. 

Consequentialist  libertarians  have  an  easier  time 
replying  to  the  criticisms  of  anarcho-capitalists 
than  do  natural  rights  libertarians,  for  consequen- 
tialists  can  simply  argue  that  a  minimal  state  would 


in  fact  do  a  better  job  of  preventing  violence  than 
would  private  defensive  agencies  (who  may  end  up 
fighting  each  other) — or  at  least,  that  it  is  not  suf¬ 
ficiently  clear  that  anarcho-capitalism  would 
deliver  better  results  to  make  it  worth  the  risks  of 
dismantling  the  state  entirely.  By  contrast,  natural 
rights  libertarians  must  argue  that  the  minimal 
state  does  not  violate  the  economic  rights  of  entre¬ 
preneurs  who  would  like  to  set  up  their  own  pri¬ 
vate  defense  agency  to  compete  with  the  public 
system  of  police  and  courts,  but  who  are  denied 
this  opportunity  by  the  minimal  state’s  coercively 
enforced  monopoly  on  the  supply  of  protection. 
One  celebrated  libertarian  argument  in  this  regard 
was  made  by  Nozick,  who  in  his  book  Anarchy, 
State,  and  Utopia  argued  that  in  a  competitive 
market  of  private  defense  agencies,  a  dominant 
defense  agency  would  eventually  arise  and  estab¬ 
lish  a  monopoly;  hence,  a  minimal  state  would  in 
essence  naturally  arise  from  within  an  anarcho- 
capitalist  system. 

Criticisms  of  Libertarian  Liberty 

Some  critics  from  the  Left  argue  against  liber¬ 
tarians’  account  of  freedom  of  choice.  One  form  of 
this  criticism  argues  that  libertarianism’s  exclusive 
focus  on  physical  coercion  is  too  narrow,  for  con¬ 
centrated  economic  power  can  itself  be  a  form  of 
coercive  power:  Monopolists  can  charge  exorbi¬ 
tant  prices  for  their  goods,  and  business  owners 
can  wield  their  threat  of  firing  to  extort  actions 
from  desperate  employees  that  the  employees 
would  never  otherwise  consent  to  perform.  In  this 
view,  freedom  does  not  necessarily  increase  as  gov¬ 
ernment  shrinks,  for  without  antitrust  laws, 
employee  protections,  and  other  legal  instruments, 
many  individuals  may  find  themselves  the  unfree 
victims  of  economic  coercion. 

Another  criticism  of  the  libertarian  account  of 
freedom  of  choice  argues  against  libertarianism’s 
definition  of  freedom  exclusively  in  terms  of  the 
absence  of  coercive  interference.  Freedom  so 
defined  is  referred  to  as  negative  liberty,  and  the 
critics  in  question  argue  that  freedom  is  instead 
best  understood  as  autonomy  or  positive  liberty, 
where  the  autonomy  or  positive  liberty  to  perform 
some  action  requires  not  just  the  absence  of  coer¬ 
cive  interference  blocking  that  action  but  also  the 
presence  of  a  genuine  ability  to  perform  that 
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action.  On  this  definition,  for  instance,  a  person 
who  is  too  poor  to  afford  a  university  education  is 
not  free  to  attend  university  and  a  wheelchair- 
bound  person  is  not  free  to  work  at  a  second-story 
business  if  there  are  no  elevators.  Accordingly, 
these  nonlibertarians  conclude  that  overall  liberty 
is  enhanced  by  a  state  that  relieves  poverty  through 
redistributive  taxation  or  that  requires  accommo¬ 
dations  for  the  disabled. 

Libertarians  typically  respond  to  these  criti¬ 
cisms  by  denying  that  economic  power  amounts 
to  a  form  of  coercive  power  and  by  arguing  that  it 
is  a  mistake  to  define  liberty  in  terms  of  ability. 
For  instance,  in  support  of  this  latter  claim,  a  lib¬ 
ertarian  might  argue  that  although  most  people 
lack  the  mental  ability  to  become  physics  profes¬ 
sors,  it  would  be  misleading  to  conclude  (as  pro¬ 
ponents  of  positive  liberty  apparently  must)  that 
most  people  lack  the  freedom  to  become  physics 
professors. 

Competing  Values 

Libertarianism  is  also  criticized  for  assigning 
freedom  of  choice  priority  over  all  other  values,  it 
being  argued  instead  that  in  some  contexts  values 
such  as  fairness  or  the  meeting  of  basic  needs 
ought  to  take  priority  over  freedom.  Regarding 
the  value  of  fairness,  for  instance,  critics  point  out 
that  libertarians  reject  equal  opportunity  provi¬ 
sions,  such  as  antidiscrimination  employment 
laws  and  publicly  financed  primary  and  secondary 
schools;  these  critics  then  argue  that  without  such 
provisions,  one’s  life  prospects  could  be  unfairly 
diminished  by  accidents  of  birth,  such  as  one’s 
race  or  sex  or  one’s  family’s  socioeconomic  class. 
Regarding  the  value  of  meeting  basic  needs,  crit¬ 
ics,  for  example,  argue  that  the  health  needs  of  a 
large  number  of  individuals  would  go  unmet  in  a 
libertarian  society  since  no  health  insurance  firm 
can  make  a  profit  by  insuring  individuals  with 
known  chronic  and  serious  health  problems.  A 
common  libertarian  reply  to  these  charges  is  to 
argue  that  private  charity  is  the  best  response  to 
these  problems  of  poverty  and  health  care  need. 

Society  as  a  Partner  in  Production 

Some  critics  of  libertarianism  defend  the  legiti¬ 
macy  of  taxation  by  challenging  the  libertarian 


claim  that  individuals  have  a  right  to  retain  in  full 
whatever  they  come  to  possess  by  exchange  in  a 
market  system.  This  challenge  proceeds  by  arguing 
that  a  person’s  possessions  are  not  the  result  of 
exclusively  individual  efforts  but  result  instead 
from  individual  efforts  conjoined  with  a  set  of 
social  conditions  that  make  the  individual’s  pros¬ 
perity  possible.  According  to  this  view,  one  per¬ 
son’s  success  is  never  wholly  self-made  but  depends 
on  factors  such  as  the  prevailing  level  of  education 
in  one’s  society,  the  prevailing  level  of  technology, 
and  the  prevailing  level  of  wealth;  also  crucial  are 
the  levels  of  government  investment  in  infrastruc¬ 
ture  (e.g.,  highways  and  utilities),  government  sta¬ 
bilization  of  the  economy  (e.g.,  controlling  the 
money  supply),  and  more  generally,  the  cultural 
capital  of  one’s  society  (by  which  is  meant  the 
benign  cultural  practices  and  social  institutions 
that  ensure  that  the  daily  interactions  of  thou¬ 
sands,  or  millions,  of  one’s  fellow  citizens  are  by 
and  large  peaceful).  For  these  reasons,  according 
to  these  critics,  society  should  be  viewed  as  a  silent 
partner  that  is  ever  present  alongside  individual 
efforts  of  production;  taxation  thus  represents 
society’s  due  return  on  its  contribution  to  produc¬ 
tion.  If  society  (through  its  duly  elected  representa¬ 
tives  in  government)  decides  to  spend  this  social 
wage  on  additional  purposes  over  and  above  those 
of  preventing  force  and  fraud,  such  as  the  con¬ 
struction  of  parks  and  libraries  and  support  for  the 
arts,  say,  then  according  to  these  critics,  that  is  its 
prerogative.  Libertarians  commonly  respond  that 
the  notion  of  society  employed  in  this  argument  is 
at  best  a  vague  and  unhelpful  generalization  and  at 
worst  a  collectivist  illusion  denying  the  moral  pri¬ 
macy  of  the  individual. 

Conclusion 

Debates  between  libertarians  and  their  critics 
were  a  vibrant  part  of  20th-century  political  the¬ 
ory,  and  the  debate  looks  set  to  last  throughout 
the  21st  century  and  beyond,  in  part  owing  to 
developments  such  as  an  increasingly  global 
economy.  Many  libertarians  welcome  this  devel¬ 
opment,  viewing  a  world  of  free  economic  agents 
as  an  appealing  prospect.  By  contrast,  many  non¬ 
libertarians  view  economic  globalization  with 
alarm,  seeing  it  as  a  threat  to  values  such  as  fair¬ 
ness  and  autonomy.  As  such,  the  stage  is  set  for 
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the  development  of  new  arguments  and  ideas  in 
this  long-running  debate. 
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Two  postulates  encapsulate  the  necessary  and 
sufficient  conditions  for  the  existence  of  partic¬ 
ular  instances  of  freedom  and  unfreedom, 
respectively: 

F  Postulate:  A  person  is  free  to  9  if  and  only  if  he 
is  able  to  9.  (Alternative  formulations  of  this 
postulate  are  “A  person  is  free  to  9  if  and  only  if  it 


is  possible  for  him  to  9”  and  “A  person  is  free  to  9 
if  and  only  if  he  is  unprevented  from  9-ing.) 

U  Postulate:  A  person  is  unfree  to  9  if  and  only  if 
both  of  the  following  conditions  obtain:  (1)  he 
would  be  able  to  9  in  the  absence  of  the  second  of 
these  conditions  and  (2)  irrespective  of  whether  he 
actually  endeavors  to  9,  he  is  directly  or  indirectly 
prevented  from  9-ing  by  some  action(s)  or  some 
disposition(s)  to  perform  some  action(s)  on  the  part 
of  some  other  person(s). 

In  each  of  these  formulations,  the  Greek  letter  9 
(which  stands  for  any  germane  verb  or  set  of  verbs 
plus  any  accompanying  words)  can  denote  one’s 
performance  of  some  action,  one’s  existence  in 
some  condition,  or  one’s  undergoing  of  some  pro¬ 
cess.  Throughout  this  entry,  incidentally,  the  terms 
freedom  and  liberty  are  used  interchangeably. 

These  two  postulates,  which  will  be  explicated 
further  below,  are  associated  with  the  idea  of 
negative  liberty.  The  principal  concern  of  this 
entry  is  to  distinguish  negative  liberty  from  certain 
other  types  of  freedom.  After  an  initial  elaboration 
of  the  nature  of  negative  liberty  itself  through 
some  amplification  of  the  F  and  U  Postulates,  this 
entry  will  draw  four  principal  contrasts:  between 
negative  liberty  and  positive  liberty,  between  nega¬ 
tive  liberty  and  moralized  liberty,  between  nega¬ 
tive  liberty  and  republican  liberty,  and  between 
physical  freedom  and  deontic  freedom. 

Negative  Liberty:  Two  Postulates 

The  F  Postulate  distills  the  nature  of  negative  lib¬ 
erty  as  it  exists  in  particular  instantiations. 
However,  the  U  Postulate  does  not  comprehend  all 
the  situations  in  which  particular  instances  of 
negative  liberty  are  absent.  That  is,  the  two  postu¬ 
lates  are  not  jointly  exhaustive  in  their  coverage. 
Apart  from  being  free  to  9  or  being  unfree  to  9, 
somebody  can  be  simply  not  free  to  9.  In  other 
words,  is  not  free  and  is  unfree  are  not  equivalent; 
the  latter  predicate  entails  the  former,  but  not  vice 
versa.  (What  are  equivalent  are  the  predicates  is 
not  free  and  is  unable.)  Likewise,  the  predicates  is 
free  and  is  not  unfree  are  not  equivalent.  The  for¬ 
mer  entails  the  latter  but  not  vice  versa. 

For  example,  although  Joe  is  not  able  to  run  a 
mile  under  3  minutes  and  is  therefore  not  free  to 
run  a  mile  under  3  minutes,  it  is  not  the  case  that 
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he  is  unfree  to  run  a  mile  in  such  a  short  span  of 
time.  His  lack  of  freedom  to  run  a  mile  so  rapidly 
is  a  mere  inability  rather  than  an  instance  of 
unfreedom.  It  is  a  mere  inability  because  it  is  not 
due  to  any  action(s)  or  disposition(s)  to  perform 
some  action(s)  on  the  part  of  anyone  else.  Instead, 
it  is  a  purely  natural  limitation. 

Hence,  the  concept  of  freedom  as  explicated 
here  is  trivalent  rather  than  bivalent.  Instead  of 
separating  people’s  abilities  and  inabilities  dichot- 
omously  into  freedoms  and  unfreedoms,  it  sepa¬ 
rates  them  trichotomously  into  freedoms,  unfree¬ 
doms,  and  mere  inabilities.  The  mere  inabilities  are 
infinitely  expansive  in  their  scope,  for  most  of  the 
countless  ways  in  which  any  person  falls  short  of 
omnipotence  are  due  to  natural  limitations  rather 
than  to  the  conduct  of  other  people. 

Why  should  a  trivalent  conception  of  freedom 
be  favored  over  a  bivalent  conception?  To  glimpse 
the  answer  to  this  question,  we  have  to  take 
account  of  another  feature  of  the  F  and  U  Postulates. 
Those  postulates  deal  with  particular  instances  of 
freedom  and  unfreedom,  rather  than  with  any¬ 
one’s  overall  quantity  of  liberty.  A  particular  free¬ 
dom  is  an  ability  to  engage  in  a  certain  mode  of 
conduct,  to  be  in  a  certain  condition,  or  to  alter 
one’s  situation  in  a  certain  way.  Any  particular 
freedom  has  a  content  that  differentiates  it  from 
other  particular  freedoms.  A  person’s  overall  level 
of  liberty  is  a  complicated  aggregate  of  his  or  her 
myriad  particular  freedoms  and  also  of  his  or  her 
myriad  particular  unfreedoms.  This  entry  will  not 
concern  itself  with  the  details  of  the  complex  cal¬ 
culations  by  which  each  person’s  freedoms  and 
unfreedoms  (or,  rather,  each  person’s  combina¬ 
tions  of  freedoms  and  combinations  of  unfree¬ 
doms)  are  aggregated.  Rather,  the  key  point  for 
present  purposes  is  that,  although  the  F  and  U 
Postulates  are  concerned  only  with  particular  lib¬ 
erties  and  unfreedoms,  they  have  been  formulated 
with  an  eye  toward  the  ultimate  aggregation  of 
those  liberties  and  unfreedoms.  That  is,  if  an 
understanding  of  particular  freedoms  and  unfree¬ 
doms  is  to  be  compatible  with  the  aim  of  establish¬ 
ing  that  the  overall  liberty  of  each  person  is  a 
measurable  property,  a  trivalent  conception  of 
particular  freedoms  and  unfreedoms  is  essential. 

Given  that  this  entry  will  not  recount  the  intri¬ 
cacies  of  measuring  people’s  levels  of  overall  lib¬ 
erty,  the  unique  suitability  of  a  trivalent  conception 


of  freedoms  and  unfreedoms  cannot  be  fully  sub¬ 
stantiated  here.  Nonetheless,  the  gist  of  the  matter 
resides  in  the  fact  that  the  calculation  of  the  level 
of  anyone’s  overall  liberty  proceeds  through  a 
complicated  fraction.  If  mere  inabilities  were  not 
distinguished  from  freedoms  and  unfreedoms,  then 
the  numerator  or  the  denominator  of  the  afore¬ 
mentioned  fraction  would  be  infinitely  large, 
respectively.  After  all,  as  has  been  stated,  anyone’s 
natural  inabilities  are  infinitely  expansive  in  their 
scope.  For  example,  each  person  is  not  only  unable 
to  fly  around  the  Milky  Way  Galaxy  once,  but  is 
also  unable  to  fly  around  it  twice  or  thrice  or  any 
other  number  of  times.  Mere  inabilities  are  limit¬ 
less.  Thus,  if  those  inabilities  were  to  be  classified 
as  liberties,  both  the  numerator  and  the  denomina¬ 
tor  of  the  fraction  for  measuring  each  person’s 
overall  liberty  would  be  infinitely  large.  If  mere 
inabilities  were  instead  to  be  classified  as  unfree¬ 
doms,  the  denominator  of  that  fraction  would  be 
infinitely  large.  In  either  case,  then,  the  project  of 
measuring  anyone’s  overall  freedom  would  be 
fatally  undermined  in  principle  as  well  as  in  prac¬ 
tice.  To  avoid  such  an  upshot,  a  theory  of  liberty 
needs  to  distinguish  mere  inabilities  both  from 
freedoms  and  from  unfreedoms.  The  requisite  dis¬ 
tinctions  are  in  effect  drawn  by  the  F  Postulate  and 
the  U  Postulate  together. 

One  other  aspect  of  the  U  Postulate  is  in  need  of 
elucidation.  What  is  meant  by  the  notion  that 
somebody’s  dispositions  to  perform  actions  can 
prevent  other  people  from  doing  various  things? 
Plainly,  a  sheer  disposition,  which  will  remain 
unmanifested  if  the  circumstances  that  would  acti¬ 
vate  it  never  arise,  does  not  in  itself  prevent  any¬ 
thing  through  the  actual  application  of  force  or  the 
actual  introduction  of  material  obstacles.  Never¬ 
theless,  dispositions  to  perform  actions  can  be 
preventive  factors  because  they  will  lead  to  the 
actual  application  of  force  or  the  actual  introduc¬ 
tion  of  material  obstacles  in  the  event  that  certain 
triggering  circumstances  do  materialize. 

In  other  words,  each  person’s  freedom  or 
unfreedom  is  affected  not  only  by  what  other 
people  in  fact  do  but  also  by  what  those  other 
people  would  have  done  if  events  had  unfolded 
differently.  Dispositions  bear  crucially  on  the  free¬ 
dom  and  unfreedom  of  each  person  since  those 
dispositions  play  a  key  role  in  determining  whether 
the  abilities  and  inabilities  of  each  person  would 
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continue  as  such  if  the  person’s  conduct  or  situa¬ 
tion  were  altered  in  various  respects.  Only  by  ask¬ 
ing  how  far  those  abilities  and  inabilities  reach — 
past  actual  events  into  counterfactual  events — can 
a  theorist  ascertain  the  extent  of  each  person’s 
overall  liberty  and  the  existence  of  many  of  his  or 
her  particular  liberties.  Until  we  know  whether 
people  would  or  would  not  have  acted  in  certain 
ways  if  a  given  person  had  sought  to  do  X,  we  can¬ 
not  know  whether  that  person  was  free  to  do  X. 
Nor  can  we  know  whether  the  person  was  free  to 
perform  X  in  combination  with  manifold  subse¬ 
quent  actions. 

A  simple  example  can  serve  to  highlight  the 
effects  of  people’s  dispositions  on  other  people’s 
freedoms  and  unfreedoms.  Suppose  that  Jim  is  sit¬ 
ting  in  a  room.  Just  outside  the  door  of  the  room 
are  four  people  with  submachine  guns  who  will 
shoot  and  kill  him  if  he  seeks  to  exit  through  the 
door.  If  Jim  does  not  seek  to  leave  the  room,  then 
the  homicidal  dispositions  of  the  four  people  out¬ 
side  the  door  will  not  be  activated.  Let  us  suppose 
that  he  in  fact  does  not  leave  the  room  and  that 
there  is  consequently  no  occasion  for  the  other 
four  people  to  harm  him.  All  the  same,  he  is  unfree 
to  exit  the  room.  If  any  theory  of  freedom  were  to 
lead  to  a  contrary  conclusion — that  is,  if  any  the¬ 
ory  of  freedom  were  to  pretermit  the  four  people’s 
unmanifested  dispositions  and  conclude  that  Jim 
has  not  been  prevented  from  exiting — it  would  pro 
tanto  be  defective.  Opportunities  get  closed  off  to 
individuals  not  only  because  of  the  actual  applica¬ 
tion  of  force  or  the  actual  erection  of  material  bar¬ 
riers  but  also  because  of  the  readiness  of  other 
people  to  exert  such  force  or  to  erect  such  barriers 
(even  if  no  occasion  arises  for  the  activation  of 
their  readiness). 

Of  course,  the  explicit  mention  of  dispositions 
in  the  U  Postulate  should  not  be  taken  to  indicate 
that  they  are  productive  only  of  unfreedoms.  Other 
people’s  dispositions  are  also  centrally  involved  in 
the  establishment  and  sustainment  of  countless 
freedoms.  Suppose,  for  example,  that  Julia  lives  in 
a  well-fortified  apartment  building  in  the  middle  of 
New  York  City.  Whenever  a  resident  of  the  build¬ 
ing  wishes  to  leave  through  the  front  doors — the 
only  set  of  doors — a  security  guard  has  to  release 
the  computerized  locks.  Because  no  resident  knows 
how  to  operate  the  locks,  the  guard’s  cooperative¬ 
ness  is  essential  for  each  resident’s  ability  to  leave 


the  building.  Now,  if  Julia  relaxes  lazily  in  her 
apartment  one  morning  and  thus  makes  no  attempt 
to  go  out  before  noon,  the  security  guard  during 
the  morning  will  have  no  occasion  to  act  on  his 
disposition  to  release  the  locks  for  her.  Nonetheless, 
if  the  guard  would  have  released  the  locks  in  the 
event  that  Julia  had  indeed  sought  to  exit  from  the 
building,  then  Julia  has  been  free  throughout  the 
morning  to  leave  via  the  front  doors.  Because  of 
the  guard’s  preparedness  to  open  those  doors  if 
Julia  should  endeavor  to  depart — and  only  because 
of  his  preparedness — she  is  unprevented  from 
departing.  She  is  free  in  that  respect.  In  this  con¬ 
text,  as  in  myriad  other  contexts,  her  particular 
freedoms  are  dependent  on  the  inclinations  of 
other  people. 

Negative  Versus  Positive  Liberty 

Especially  since  the  writings  of  Isaiah  Berlin  in  the 
1950s  and  1960s,  the  most  famous  contrast  per¬ 
taining  to  freedom  lies  between  negative  liberty  and 
positive  liberty.  As  is  evident  from  what  has  been 
said  so  far,  negative  liberty  consists  in  opportuni¬ 
ties.  To  be  negatively  free  to  <p  is  to  be  able  to  9  and 
is  thus  to  be  unprevented  from  tp-ing.  If  somebody 
is  negatively  free  to  9,  then  neither  internal  inca¬ 
pacities  nor  external  impediments  have  made  his 
cp-ing  impossible.  Hence,  to  be  negatively  free  to  9 
is  to  have  opportunities  to  9,  whether  or  not  one 
avails  oneself  of  those  opportunities. 

Positive  liberty  is  very  different.  Instead  of  con¬ 
sisting  in  opportunities,  it  consists  in  the  following 
of  certain  codes  of  conduct,  the  attainment  of  cer¬ 
tain  objectives,  or  the  purification  of  one’s  motiva¬ 
tions  and  outlook.  Being  presented  with  various 
opportunities  is  not  sufficient  for  positive  freedom; 
in  addition,  a  person  must  take  advantage  of  some 
of  those  opportunities  in  certain  ways.  Whereas 
negative  liberty  is  a  matter  of  unpreventedness, 
positive  liberty  is  a  matter  of  accomplishments. 

As  is  suggested  by  the  vagueness  of  these  descrip¬ 
tions,  numerous  divergent  accounts  of  positive  lib¬ 
erty  have  been  propounded  from  the  time  of  Plato 
onward.  Some  theorists  maintain  that  a  person 
becomes  truly  free  only  when  she  or  he  has  persis¬ 
tently  exercised  certain  faculties,  such  as  her  capac¬ 
ity  to  reason  and  deliberate.  Others  contend  that 
people  are  truly  free  only  when  they  interact  regu¬ 
larly  in  democratic  institutions  with  their  fellow 
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citizens.  Still  other  proponents  of  positive  liberty 
submit  that  people  attain  freedom  only  if  they  rid 
themselves  of  certain  ignoble  desires  or  only  if  they 
subject  their  sundry  desires  and  inclinations  to 
rational  scrutiny  and  refinement.  Many  other  vari¬ 
eties  of  positive-liberty  theories  have  likewise 
emerged  over  the  centuries. 

Although  negative-liberty  theorists  have  sus- 
tainedly  criticized  various  doctrines  of  positive 
freedom,  nobody  should  think  that  the  very  use  of 
the  terms  freedom  or  liberty  by  the  advocates  of 
those  doctrines  is  itself  mistaken.  Their  errors  are 
errors  of  substantive  political  philosophy  rather 
than  linguistic  lapses.  For  example,  it  is  not  an 
abuse  of  language  to  declare  that  a  person  who 
achieves  a  high  degree  of  autonomy  has  thereby 
become  free  in  the  sense  of  having  liberated  him¬ 
self  from  the  sway  of  the  influences  that  might 
have  kept  him  in  a  heteronomous  condition.  Such 
a  characterization  is  not  optimally  clear  and  pre¬ 
cise  but  is  far  from  ridiculous  or  unintelligible. 
Similarly,  it  is  hardly  ridiculous  or  solecistic  to 
assert  that  people  who  together  shape  their  destiny 
through  institutions  of  democratic  decision  mak¬ 
ing  are  thereby  keeping  themselves  free  by  avoid¬ 
ing  subjection  to  mandates  that  they  have  not 
themselves  collectively  fashioned.  Though  the 
conception  of  freedom  that  is  operative  in  such  a 
claim  is  plainly  not  equivalent  to  the  negative- 
liberty  conception,  the  classification  of  the  demo¬ 
cratic  state  of  affairs  as  freedom  is  by  no  means  an 
outlandish  linguistic  slip. 

Still,  although  any  accusations  of  linguistic 
errors  would  be  misguided,  doctrines  of  positive 
liberty  are  themselves  misconceived  in  a  number  of 
respects.  While  a  full  exploration  of  the  shortcom¬ 
ings  of  such  doctrines  as  theories  of  freedom  is  not 
possible  within  the  confines  of  this  entry,  two  of 
those  shortcomings  should  be  noted  here. 

First,  a  key  problem  for  any  doctrine  of  positive 
liberty  is  that  it  generates  untenable  ascriptions  of 
freedoms  and  unfreedoms.  For  example,  suppose 
that  one  such  doctrine  (which  can  be  labeled  here  as 
the  Aesthetic  Thesis)  proclaims  that  each  person 
becomes  truly  free  only  by  developing  his  or  her 
aesthetic  abilities  to  the  maximal  degree.  Suppose 
further  that  Kevin  is  a  gifted  pianist  whose  tendency 
to  become  distracted  by  nonaesthetic  pursuits  will 
thwart  his  development  of  his  musical  talents  unless 
he  is  chained  to  his  immobile  instrument  for  several 


hours  every  day.  Now,  according  to  the  Aesthetic 
Thesis,  any  opportunities  that  do  not  facilitate  the 
maximal  development  of  a  person’s  aesthetic  abili¬ 
ties  are  not  freedoms  at  all.  Hence,  the  countless 
opportunities  closed  off  to  Kevin  by  his  being 
shackled  to  his  piano  are  not  freedoms  of  which  he 
has  been  deprived;  they  are  not  freedoms,  period. 
Instead,  according  to  the  Aesthetic  Thesis,  they  are 
obstacles  to  the  realization  of  his  true  freedom — 
obstacles  that  his  chains  have  enabled  him  to  over¬ 
come.  His  shackles  will  have  helped  bring  about 
his  freedom  without  causing  him  to  lose  even  the 
slightest  instance  of  liberty.  Because  the  severe 
curbs  on  his  mobility  involve  no  sacrifices  of  any 
particular  freedoms,  he  has  not  been  rendered 
unfree  by  those  curbs  in  any  respect.  So  the  sup¬ 
porters  of  the  Aesthetic  Thesis  must  contend. 

While  the  scenario  of  Kevin  and  his  piano  is  con- 
trivedly  vivid,  it  well  illustrates  the  far-fetched  con¬ 
clusions  that  are  generated  by  any  positive-liberty 
credo.  Every  such  credo,  which  affirms  that  true 
freedom  resides  in  a  person’s  exertion  of  certain 
faculties,  or  his  or  her  performance  of  certain 
actions  or  following  of  certain  procedures,  will 
commit  its  advocates  to  the  view  that  any  opportu¬ 
nities  inconsistent  with  the  relevant  exertions  or 
performances  or  procedures  can  be  removed  whole¬ 
sale  with  no  loss  of  any  particular  freedoms.  Instead 
of  characterizing  the  removal  of  those  opportunities 
as  the  elimination  of  some  of  a  person’s  liberties  for 
the  sake  of  increasing  her  overall  liberty,  the  posi¬ 
tive-freedom  theorists  are  obliged  to  maintain  that 
no  liberties  have  been  removed  at  all.  Their  position 
in  that  respect  is  not  self-contradictory  or  unintel¬ 
ligible,  but  it  is  indefensibly  sinister.  It  should  be 
rejected,  as  a  substantive  matter  of  political  philoso¬ 
phy.  Although  some  of  the  objectives  favored  by 
positive-freedom  theorists  are  undoubtedly  worthy 
of  pursuit,  no  one  should  pretend  that  sacrifices  of 
liberties  are  not  involved  when  various  opportuni¬ 
ties  are  closed  off  in  furtherance  of  those  objectives. 

A  second  main  shortcoming  of  positive-liberty 
theories  pertains  to  the  very  objectives  that  they 
aim  to  promote.  Whereas  the  phenomenon  that  the 
negative-liberty  theorists  designate  as  freedom  or 
liberty  cannot  be  recharacterized  in  an  illuminating 
fashion,  there  are  numerous  more  precise  designa¬ 
tions  for  the  phenomenon  that  positive-liberty 
theorists  label  as  true  freedom.  To  denote  the  con¬ 
dition  of  negative  liberty,  some  rather  awkward 
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and  uncommon  terms  such  as  unpreventedness 
and  unprecludedness  would  be  necessary  as 
replacements  for  freedom  and  liberty.  The  only 
adequate  common  substitute  would  be  ability, 
which  is  less  effective  than  liberty  or  freedom  in 
highlighting  the  relational  character  of  the  desig¬ 
nated  phenomenon  (i.e.,  the  fact  that  the  desig¬ 
nated  phenomenon  consists  not  only  in  a  person’s 
possession  of  capacities  but  also  in  his  or  her  being 
unprevented  from  exercising  those  capacities).  For 
discussions  focused  on  the  conceptual  space  carved 
out  by  the  negative-liberty  theorists,  the  language 
of  liberty  and  freedom  is  singularly  apposite. 

A  very  different  situation  obtains  when  we  turn 
our  attention  to  positive  liberty.  While  the  sundry 
versions  of  the  positive-liberty  ideal  will  warrant 
the  application  of  a  variety  of  terms — most  of 
which  will  be  appropriate  for  only  some  of  those 
versions — there  are  indeed  many  pertinent  desig¬ 
nations  available.  Terms  and  phrases  such  as  self- 
fulfillment,  self-realization,  self-expression,  self- 
mastery,  autonomy,  self-reliance,  self-control, 
self-determination,  flourishing,  self-development, 
self-direction,  popular  political  participation,  and 
active  citizenship  can  each  figure  saliently  in  one  or 
more  of  the  major  positive-liberty  credos.  More¬ 
over,  when  wielded  aptly,  each  of  those  terms  and 
phrases  will  typically  be  much  more  exact — and 
probably  more  vivid — than  freedom  or  liberty  as  a 
means  of  pinpointing  the  state  or  process  that  is 
posited  as  a  fundamental  human  desideratum. 
Hence,  not  only  will  little  or  nothing  be  lost  if  the 
positive  uses  of  freedom  and  liberty  are  eschewed, 
but  in  addition,  the  avoidance  of  those  terms  as 
designations  for  the  positive-liberty  writers’  objec¬ 
tives  will  promote  theoretical  precision.  Freedom 
and  liberty  in  their  positive  senses  are  superfluous 
since  the  potential  replacements  for  them  are 
myriad,  and  those  replacements  are  generally  supe¬ 
rior  in  denoting  the  specificities  of  the  ideals  that 
the  positive-liberty  theorists  extol. 

Of  course,  as  has  been  remarked  above,  nobody 
should  doubt  that  many  of  the  aforementioned 
ideals  are  worthy  of  pursuit.  What  is  objection¬ 
able  about  positive-liberty  doctrines  is  not  (in 
many  cases)  the  objectives  that  they  uphold  but 
instead  their  characterizing  of  those  objectives  as 
true  freedom.  For  example,  when  theories  of 
autonomy,  democracy,  or  active  citizenship  pres¬ 
ent  themselves  as  accounts  of  those  desiderata 


rather  than  of  liberty,  they  may  well  be  admirable. 
Whether  they  are  indeed  admirable  is  of  course 
dependent  on  the  specifics  of  their  arguments  and 
analyses;  however,  when  they  are  not  misleadingly 
packaged  as  theories  of  freedom,  they  at  any  rate 
stand  a  chance  of  being  admirable. 

Note  that  the  comments  in  this  section  about 
the  appropriate  senses  of  the  terms  freedom  and 
liberty  are  concerned  with  the  employment  of 
those  designations  in  the  rigorous  theorizing  that 
constitutes  political  philosophy.  It  would  be  quite 
foolish  to  aim  to  regiment  the  multifarious  pat¬ 
terns  of  usage  that  occur  in  the  much  less  rigorous 
discourses  of  everyday  life.  In  many  of  those  dis¬ 
courses,  where  consistency,  subtlety,  clarity,  and 
precision  are  of  far  less  importance  than  in  philo¬ 
sophical  writing,  one’s  labeling  of  self-fulfillment 
as  the  attainment  of  true  freedom  might  be  apt. 
Plainly,  as  has  already  been  remarked,  there  is 
nothing  semantically  illicit  about  the  use  of  such 
terminology.  Given  that  the  language  of  freedom 
or  liberty  is  intelligible  and  coherent  and  given  that 
the  inexactitude  of  its  application  does  not  greatly 
matter  in  the  to-and-fro  of  quotidian  deliberations 
and  exhortations,  there  are  no  grounds  for  reining 
in  the  multiple  senses  that  might  attach  to  freedom 
and  liberty  in  a  host  of  ordinary  contexts.  Such  an 
endeavor  would  be  patently  futile  in  any  event. 

Not  at  all  futile  or  ridiculous,  however,  is  an 
effort  to  enhance  the  rigor  and  precision  of  philo¬ 
sophical  thinking  about  freedom.  If  the  aim  is  to 
come  up  with  a  theory  that  carefully  distinguishes 
the  concept  of  freedom  from  other  major  political 
and  moral  concepts  while  capturing  its  myriad 
complexities,  then  some  degree  of  terminological 
regimentation  is  inevitable.  Some  uses  of  freedom 
or  liberty  countenanced  in  day-to-day  parlance 
have  to  fall  by  the  wayside  when  a  painstaking 
philosophical  investigation  of  freedom  seeks  to 
elaborate  the  necessary  and  sufficient  conditions 
for  the  truth  of  any  proposition  “P  is  free  to  9.” 
For  the  purposes  of  such  an  investigation,  the  inex¬ 
actitude  and  misleadingness  and  inconsistency  of 
many  of  those  ordinary  uses  are  to  be  shunned. 
Philosophical  analysis  proceeds  by  clarifying  and 
refining  the  concepts  that  are  invoked  in  relatively 
unreflective  modes  of  thought  and  discourse,  as  it 
aspires  to  transcend  the  murkiness  and  looseness 
of  those  familiar  modes  of  speaking.  The  tasks  of 
conceptual  clarification  and  purification  can 
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scarcely  go  ahead  without  some  notable  tightening 
of  everyday  terminology.  To  insist  as  much,  how¬ 
ever,  is  not  at  all  to  insist  that  a  similar  tightening 
is  advisable  or  feasible  in  everyday  discourses 
themselves. 

Negative  Versus  Moralized  Liberty 

In  some  prominent  respects — though  by  no  means 
all  respects — moralized  conceptions  of  freedom 
are  similar  to  positive-liberty  theories.  Although 
moralized  conceptions  of  freedom  are  usually 
focused  on  opportunities  rather  than  on  achieve¬ 
ments,  every  such  conception  affirms  one  or  both 
of  the  following  theses: 

1.  the  preclusion  of  some  action  or  some  state  of 
affairs  does  not  eliminate  any  particular  liberties 
unless  the  preclusion  is  illegitimate  or 

2.  the  preclusion  of  some  action  or  some  state  of 
affairs  does  not  eliminate  any  particular  liberties 
unless  the  stymied  action  or  state  of  affairs 
would  have  been  legitimate. 

Under  the  first  of  these  two  theses,  the  prevention 
of  a  person  from  (p-ing  will  count  as  a  pro  tanto 
curb  on  his  freedom  only  if  he  had  a  moral  right 
against  the  sort  of  interference  that  thwarted  him 
from  (p-ing.  Under  the  second  thesis,  the  prevention 
of  a  person  from  cp-ing  will  count  as  a  pro  tanto 
curb  on  his  freedom  only  if  he  had  no  moral  obliga¬ 
tion  to  refrain  from  cp-ing.  In  either  case,  a  moral¬ 
ized  conception  of  freedom  will  decline  to  classify 
some  eliminated  liberties  as  eliminated  liberties. 

Thus,  for  example,  if  Alec’s  prevention  of  Susan 
from  wantonly  setting  fire  to  a  neighbor’s  house  is 
morally  legitimate,  then  Alec  does  not  deprive 
Susan  of  any  freedoms  at  all  when  he  manages  to 
avert  the  arson  by  grabbing  and  restraining  her.  If 
she  struggles  to  reach  the  neighbor’s  premises  so 
fiercely  that  he  has  to  pin  her  to  the  ground  and 
even  bind  her  hands  and  feet,  she  will  still  not  have 
been  deprived  of  any  freedoms.  Such  is  the  view 
taken  by  the  proponents  of  moralized  conceptions 
of  freedom.  Whereas  the  F  Postulate  and  U 
Postulate  explicate  the  concepts  of  freedom  and 
unfreedom  by  reference  to  one’s  abilities  and  to  the 
causes  of  one’s  inabilities,  a  moralized  account 
explicates  those  concepts  by  reference  to  the  moral 
status  of  the  causes  of  one’s  inabilities  or  to  the 


moral  status  of  one’s  exertions  of  one’s  abilities — 
that  is,  the  moral  legitimacy  or  illegitimacy  of  the 
causes  of  one’s  inabilities  or  the  moral  legitimacy 
or  illegitimacy  of  one’s  exertions  of  one’s  abilities. 

In  other  words,  every  moralized  account  of  free¬ 
dom  insists  that  even  the  severest  constraints  on  a 
person’s  latitude  might  not  remove  any  of  a  per¬ 
son’s  freedoms.  No  such  removal  will  have  taken 
place  unless  the  constraints  are  illegitimate  or 
unless  the  prevented  conduct  would  have  been 
legitimate.  Consequently,  if  the  placement  of  a 
highly  dangerous  man  in  chains  or  a  straitjacket  is 
legitimate  because  of  his  uncontrollably  violent 
behavior,  neither  of  those  means  of  immobiliza¬ 
tion  will  deprive  him  of  any  liberties.  A  theory  that 
generates  such  a  conclusion  can  hardly  claim  to  be 
cogently  illuminating. 

Moreover,  with  reference  to  any  situation  in 
which  some  person  P  has  been  deprived  of  the 
freedom  to  (p,  a  moralized  conception  of  freedom 
must  submit  either  that  any  questions  about  the 
illegitimacy  of  the  deprivation  are  pointlessly  pleo¬ 
nastic  or  that  any  questions  about  the  legitimacy  of 
P’s  (p-ing  are  pointlessly  pleonastic.  If  the  restric¬ 
tions  on  P’s  (p-ing  are  indeed  restrictions  on  his 
liberty,  then  either  ipso  facto  they  are  illegitimate 
or  ipso  facto  his  (p-ing  would  have  been  legitimate. 
So  the  proponents  of  moralized  conceptions  of 
freedom  are  logically  committed  to  maintaining. 
Hence,  any  moralized  conception  whisks  out  of 
existence  an  array  of  perfectly  ordinary  inquiries. 

Perhaps  even  more  unsettling  is  that,  when  a 
moralized  account  is  focused  on  the  illegitimacy  of 
preventive  constraints,  it  strips  freedom  of  any 
independence  as  a  factor  that  can  militate  either  in 
favor  of  various  sociopolitical  arrangements  or 
against  them.  Because  the  only  constraints  on 
human  conduct  that  will  count  as  limitations  on 
liberty  are  unjust  constraints,  a  denunciation  of 
certain  institutions  as  restrictive  of  liberty  will  add 
nothing  to  a  denunciation  of  them  as  unjust. 
Likewise,  because  such  a  moralized  account  con¬ 
strues  freedom  as  nothing  more  than  the  absence 
of  illegitimate  constraints,  a  commendation  of 
certain  institutions  as  promotive  of  freedom  will 
add  nothing  to  a  commendation  of  them  as  legiti¬ 
mate  and  fair.  In  short,  the  fostering  or  impairing 
of  freedom  (as  understood  by  a  moralized  theory 
of  this  type)  will  have  ceased  to  be  a  consideration 
that  might  carry  some  independent  justificatory  or 
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condemnatory  weight.  The  redundancy  of  that 
fostering  or  impairing  as  a  justificatory  or  condem¬ 
natory  factor  stems  from  the  status  of  freedom  as 
a  mere  facet  of  some  substantive  moral  ideal — an 
ideal  on  which  the  whole  burden  of  justification  or 
condemnation  rests.  Though  moralized  approaches 
to  liberty  that  are  focused  on  the  illegitimacy  of 
obstacles  might  seem  to  elevate  the  status  of  liberty 
by  imbuing  it  with  a  morally  favorable  tenor,  they 
in  fact  eliminate  liberty  as  an  independent  phe¬ 
nomenon  by  reducing  all  questions  of  greater  or 
lesser  freedom  to  questions  of  greater  or  lesser 
rectitude.  For  anyone  who  wishes  to  appraise 
sociopolitical  arrangements  not  only  on  the  basis 
of  their  justice  or  injustice  but  also  (separately)  on 
the  basis  of  their  conduciveness  or  inconduciveness 
to  high  levels  of  overall  liberty,  a  moralized 
conception  of  freedom  should  be  forsworn.  For 
anyone  who  believes  that  liberty  and  justice  can 
sometimes  conflict — in  a  clash  between  liberty  and 
equality,  for  example — a  moralized  conception  of 
freedom  should  be  forsworn. 

Negative  Versus  Republican  Liberty 

During  the  past  couple  of  decades,  the  longstand¬ 
ing  controversies  between  negative  and  positive 
conceptions  of  liberty  have  become  somewhat 
overshadowed  by  controversies  between  negative- 
liberty  theorists  and  civic-republican  theorists.  The 
latter  theorists  generally  subscribe  to  the  negative 
conception  of  freedom  in  opposition  to  positive- 
liberty  doctrines,  but  they  hold  that  the  negative 
conception  has  been  construed  too  narrowly  by 
most  of  its  exponents.  In  two  chief  respects,  they 
take  themselves  to  have  gone  salutarily  beyond 
those  exponents. 

First,  civic  republicans  are  keenly  alert  to  the 
role  of  public  virtue  and  public  service  in  bolstering 
institutions  that  provide  high  levels  of  freedom  for 
individuals.  They  maintain  that,  in  the  absence  of 
active  civic  participation  on  the  part  of  all  or  most 
of  the  adult  citizens  in  a  country,  the  reigning  gov¬ 
ernment  and  its  elite  supporters  will  very  likely 
amass  autocratic  powers  that  will  extinguish  many 
of  the  precious  liberties  that  the  citizens  theretofore 
enjoyed.  Individuals  who  wish  to  retain  their  free¬ 
doms  must  frequently  put  aside  their  private  affairs 
to  participate  collaboratively  in  holding  govern¬ 
mental  leaders  to  account.  Republican  theorists 


believe  that  their  attentiveness  to  the  crucial  role  of 
civic  virtue  in  securing  the  enjoyment  of  freedoms 
is  at  variance  with  the  perceived  emphasis  of  mod¬ 
ern  negative-liberty  theorists  on  the  sanctity  of  the 
private  spheres  of  individuals.  Whereas  the  latter 
theorists  are  said  to  be  primarily  concerned  with 
drawing  clear  limits  past  which  any  government 
cannot  legitimately  intrude  into  people’s  lives, 
republicans  are  principally  concerned  with  stimu¬ 
lating  people  to  engage  robustly  with  the  institu¬ 
tions  that  govern  them. 

The  contrast  just  outlined  between  civic  repub¬ 
licanism  and  modern  negative-liberty  theories  has 
been  advanced  with  considerable  erudition  by 
Quentin  Skinner  in  his  essays  on  liberty  during  the 
1980s  and  early  1990s.  Nevertheless,  the  contrast 
is  largely  misconceived.  After  all,  the  positing  of  an 
instrumental  connection  between  extensive  popu¬ 
lar  political  participation  and  the  safeguarding  of 
individuals’  liberties  is  perfectly  consistent  with 
negative-liberty  theories.  Of  course,  what  would 
be  inconsistent  with  those  theories  is  any  claim 
that  the  extensive  popular  political  participation  is 
itself  true  freedom.  Such  a  claim,  envisaging  a  rela¬ 
tionship  of  equivalence  between  civic  involvement 
and  liberty,  would  be  expressive  of  one  prominent 
positive-liberty  doctrine.  It  would  therefore  clash 
in  most  respects  with  negative-liberty  theories. 
However,  as  Skinner  himself  emphasizes,  the  civic- 
republican  writers  have  not  in  fact  embraced  any 
doctrines  of  positive  liberty.  When  they  have  high¬ 
lighted  the  instrumental  links  between  the  political 
engagedness  of  citizens  and  the  security  of  indi¬ 
viduals’  freedoms,  they  have  been  propounding  a 
thesis  about  negative  liberty  rather  than  about 
positive  liberty.  Accordingly,  they  have  not  been 
affirming  any  propositions  that  are  inconsistent 
with  those  affirmed  by  negative-liberty  theorists. 
Indeed,  their  thesis  about  the  aforementioned 
instrumental  links  is  a  commonplace  among  most 
modern  political  thinkers,  including  the  exponents 
of  negative  liberty.  Civic-republican  theorists 
undoubtedly  articulate  that  thesis  adeptly,  but  they 
do  not  thereby  establish  any  substantive  difference 
between  themselves  and  the  negative-liberty  phi¬ 
losophers. 

A  second  respect  in  which  civic  republicanism 
supposedly  goes  beyond  negative-liberty  doctrines 
has  been  elaborated  since  the  mid-1990s  by 
Quentin  Skinner  and  Philip  Pettit  (as  well  as  by 
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others  whom  they  have  influenced).  According  to 
these  modern  civic-republican  writers,  their  very 
conception  of  freedom  is  more  capacious  than  the 
standard  conception  within  the  negative-liberty 
tradition.  Instead  of  concentrating  on  freedom  as 
the  absence  of  the  actual  application  of  force, 
republicans  concentrate  on  freedom  as  the  absence 
of  domination.  Domination  occurs  through  the 
actual  application  of  force — by  a  government  or 
by  some  other  powerful  party — but  it  also  occurs 
through  the  maintenance  of  background  condi¬ 
tions  of  intimidatory  control  that  render  any 
actual  application  of  force  unnecessary.  Skinner 
and  Pettit  assert  that  unless  a  conception  of  free¬ 
dom  takes  account  of  the  full  range  of  ways  in 
which  people  can  be  hemmed  in  by  domination,  it 
will  obscure  more  than  it  illuminates.  It  will  fail  to 
reveal  all  the  debilitatingly  confining  effects  of 
social  and  economic  arrangements  that  subordi¬ 
nate  some  people  to  others. 

As  has  been  argued  in  response  by  contempo¬ 
rary  negative-liberty  theorists,  all  the  insights  of 
civic  republicans  are  easily  accommodated  by  a 
proper  exposition  of  the  notion  of  negative  liberty. 
Perhaps  some  negative-liberty  theorists  in  the  past 
have  believed  that  the  only  type  of  constraint  on 
anyone’s  freedom  is  the  actual  application  of  force 
by  others,  but  such  a  view  does  not  pass  muster 
among  contemporary  negative-liberty  philoso¬ 
phers.  Such  philosophers  recognize  and  indeed 
insist  that  the  constraints  on  people’s  liberty 
include  all  the  background  conditions  of  domina¬ 
tion  identified  by  civic-republican  writers.  As  is 
apparent  from  the  U  Postulate’s  inclusion  of  dispo¬ 
sitions  among  the  potential  constraints  on  human 
freedom,  the  actual  application  of  force  is  not  a 
necessary  condition  for  the  curtailment  of  any¬ 
body’s  liberty.  The  freedom-limiting  effects  of  the 
dispositions  and  actions  that  constitute  situations 
of  domination  are  all  captured  by  the  U  Postulate. 

A  key  point  in  support  of  what  has  just  been 
said  is  that  the  units  over  which  the  modern  nega¬ 
tive-liberty  theorists  aggregate  when  they  measure 
anyone’s  freedom  are  combinations  of  conjunc¬ 
tively  exercisable  options.  (The  aggregation  also 
covers  anyone’s  combinations  of  consistent  unfree¬ 
doms.  That  element  can  be  omitted  in  the  present 
discussion.)  A  combination  of  conjunctively  exer¬ 
cisable  options  is  a  set  of  liberties  that  can  all  be 
exercised  together  simultaneously  or  sequentially. 


When  a  person  is  subject  to  domination  by  some 
other  party,  many  of  the  combinations  of  conjunc¬ 
tively  exercisable  freedoms  that  would  have  been 
available  to  him  or  her  are  not  available.  For 
example,  her  liberty  to  act  at  odds  with  the  direc¬ 
tives  of  the  dominant  party  will  not  be  conjunc¬ 
tively  exercisable  with  her  liberty  to  do  anything 
that  the  dominant  party’s  punitive  response  to  her 
disobedience  would  prevent  her  from  doing. 
Because  a  relationship  of  domination  removes 
many  combinations  of  conjunctively  exercisable 
freedoms  that  would  otherwise  have  been  available 
to  the  person(s)  subordinated  in  that  relationship, 
it  pro  tanto  reduces  the  overall  liberty  of  the 
person(s)  in  question.  This  insight  into  the  free¬ 
dom-constricting  effects  of  domination  has  been 
expounded  rigorously  by  contemporary  negative- 
liberty  theorists;  an  awareness  of  those  effects  is 
hardly  unique  to  the  civic-republican  tradition. 

Still,  although  the  virtues  of  civic  republicanism 
are  also  characteristic  of  modern  negative-liberty 
theories,  the  republican  conception  of  liberty  and 
the  negative  conception  of  it  are  not  identical.  As 
negative-liberty  philosophers  have  contended,  the 
republican  conception  of  freedom  championed  by 
Skinner  and  Pettit  is  plagued  by  a  number  of  short¬ 
comings  that  do  not  similarly  afflict  the  negative 
conception.  For  one  thing,  Skinner  appears  to  take 
the  view  that  a  person  is  unfree  to  9  only  if  he  or 
she  knows  that  he  or  she  has  been  prevented  from 
9-ing.  No  such  untenable  restriction  figures  in  the 
negative  account  of  liberty.  Pettit  imposes  another 
such  restriction  when  he  declares  that  unfreedom 
is  caused  only  by  conduct  that  is  intended  to  pro¬ 
duce  such  an  effect.  No  similar  insistence  on  inten- 
tionality  or  deliberateness  is  included  in  the  U 
Postulate  since  the  reasons  invoked  by  Pettit  in 
support  of  such  an  insistence  are  in  fact  supportive 
of  a  focus  on  any  human  conduct  that  gives  rise  to 
constraints  (whether  the  constraints  are  imposed 
deliberately  or  unwittingly  and  whether  they  are 
imposed  wrongly  or  innocently). 

Even  more  important,  the  civic-republican 
approach  mishandles  any  situation — however 
rare — in  which  someone  strong  enough  to  mistreat 
and  exploit  others  is  resolutely  disinclined  to  do 
so.  In  any  such  set  of  circumstances,  where  the 
probability  of  serious  encroachments  by  the  domi¬ 
nant  person  on  the  overall  liberty  of  his  or  her 
contemporaries  is  practically  nil,  the  redoubtable 
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might  of  that  person  (whether  bodily  strength  or 
some  other  form  of  power)  does  not  lessen  anyone 
else’s  overall  liberty  significantly.  For  example, 
their  lineage  or  talents  can  make  them  someone  to 
whom  many  other  people  would  eagerly  attach 
themselves  as  loyal  subordinates  if  the  former  were 
to  allow  them  to  do  so. 

Because  of  his  or  her  reclusive  diffidence  and, 
consequently,  firm  unwillingness  to  take  advan¬ 
tage  of  his  or  her  superiority,  the  dominant  person 
does  not  oblige  other  people  to  adjust  their  behav¬ 
ior  to  his  or  her  desires.  Uncommon  though  such  a 
situation  may  be,  it  is  plainly  possible.  Civic  repub¬ 
licans,  who  assert  that  anyone’s  sheer  possession 
of  the  capacity  to  dominate  is  sufficient  to  deprive 
his  or  her  contemporaries  of  their  freedom,  are 
committed  to  the  view  that  the  diffident  recluse  in 
the  envisaged  situation  has  severely  curtailed  the 
freedom  enjoyed  by  the  people  in  his  or  her  vicin¬ 
ity.  Such  an  analysis  of  the  situation  is  distortive 
rather  than  illuminating. 

Still  more  distortive  is  the  republican  approach 
in  application  to  a  scenario  of  a  dominator  who  is 
subordinated  by  someone  whom  he  himself  could 
oppress.  Suppose  that  Lennie  is  mightily  capable  of 
prodigious  feats  of  strength,  while  George — who  is 
much  smaller  and  weaker  than  Lennie — is  pos¬ 
sessed  of  domineering  tough  mindedness  that  off¬ 
sets  his  physical  deficiencies.  (Lennie  and  George 
bear  a  considerable  resemblance,  though  not  a 
perfect  resemblance,  to  their  namesakes  in  John 
Steinbeck’s  Of  Mice  and  Men.)  Although  Lennie  is 
physically  much  more  powerful  than  George  and 
although  the  intellectual  disparity  between  the  two 
men  is  not  vast,  the  differences  between  their  tem¬ 
peraments  eventuate  in  the  general  domination  of 
the  more  brawny  man  by  the  more  diminutive. 
George  continually  browbeats  Lennie  into  per¬ 
forming  countless  menial  tasks  that  serve  George’s 
needs  and  comfort,  and  he  insists  on  getting  his 
way  whenever  Lennie  forms  intentions  that  are  at 
odds  with  his  own.  He  terrorizes  Lennie  with  his 
fits  of  temper  and  his  piercing  insults;  Lennie,  thor¬ 
oughly  overmastered  and  intimidated  by  George, 
is  his  dutiful  servant.  Now,  even  if  this  scenario 
were  to  be  moderated  by  the  addition  of  some  ties 
of  friendship  and  protection  between  the  two  men, 
the  basic  point  illustrated  by  it  would  remain 
prominent.  Somebody  fully  capable  of  dominating 
another  person — somebody,  therefore,  whom  the 


civic  republicans  will  classify  as  a  dominator — 
can  turn  out  to  be  exploited  and  bullied  by  that 
other  person.  If  we  were  to  describe  George  as 
having  been  made  unfree  by  Lennie’s  dominating 
strength,  we  would  be  distorting  his  situation 
markedly. 

Admittedly,  it  might  be  that  George’s  irascible 
and  imperious  behavior  is  itself  necessary  to  ward 
off  domination  by  Lennie.  If  George’s  aggressively 
tyrannical  conduct  is  indeed  a  means  of  defending 
his  own  overall  liberty,  then  that  liberty  has  been 
diminished  by  Lennie’s  daunting  presence.  After 
all,  were  George  to  abstain  from  his  domineering 
surliness  under  such  circumstances,  he  would  very 
quickly  render  himself  unfree  to  undertake  any 
projects  or  activities  that  will  have  been  precluded 
by  Lennie’s  assumption  of  the  ascendant  posture 
in  their  relationship.  Under  such  circumstances, 
that  is,  Lennie’s  latent  disposition  to  exert  his 
might  has  extinguished  the  conjunctive  exercis¬ 
ability  of  many  of  George’s  liberties.  Nonetheless, 
this  feature  of  the  situation  is  purely  contingent.  A 
softening  of  George’s  demeanor  might  instead 
produce  less  dramatic  effects.  Perhaps,  it  would 
not  induce  any  substantial  changes  at  all  in  the 
subordinate  posture  of  Lennie,  who  might  be 
unshakably  habituated  to  George’s  dominance. 
Or,  what  is  slightly  more  plausible,  the  growing 
emollience  of  George  might  simply  induce  Lennie 
to  become  less  dutiful  and  subservient  without 
actually  prompting  him  to  act  despotically  toward 
George.  Instead  of  inverting  the  previous  relation¬ 
ship  of  mastery  and  submission,  Lennie  might 
simply  opt  to  live  alongside  George  as  an  equal  or 
perhaps  he  would  separate  from  George  and  go 
his  own  way.  Whatever  might  be  the  precise  out¬ 
come  of  a  marked  alteration  in  George’s  authori¬ 
tarian  mien,  it  would  not  necessarily  involve  any 
significant  loss  of  liberty  for  George  himself  (espe¬ 
cially  if  the  baseline  for  measuring  the  loss  is  a 
situation  in  which  neither  of  the  two  men  domi¬ 
nates  the  other).  In  that  case,  his  currently  over¬ 
bearing  behavior  is  not  a  means  of  safeguarding 
his  own  freedom  against  the  potential  dominance 
of  Lennie  but  is  straightforwardly  a  means  of 
coercing  and  manipulating  the  bulkier  man.  Such 
a  state  of  affairs  can  obtain  even  though  the  civic 
republicans’  analyses  of  unfreedom  clearly  gener¬ 
ate  the  conclusion  that  Lennie  is  a  dominator. 
Someone  can  qualify  as  a  dominator — given  how 
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that  status  is  defined  by  the  civic  republicans — 
without  significantly  impairing  the  overall  free¬ 
dom  of  anyone  else. 

In  short,  in  the  rare  circumstances  where  the 
capacity  to  dominate  genuinely  involves  extremely 
low  probabilities  of  nontrivial  encroachments  on 
the  freedom  of  the  people  over  whom  that  capacity 
could  be  exerted,  the  sheer  susceptibility  of  those 
people  to  the  exercise  of  that  capacity  is  not  to  be 
classified  as  a  state  of  wide-ranging  unfreedom. 
Neither  a  diffidently  reclusive  mighty  person  nor 
Lennie  in  his  relationship  with  George  is  signifi¬ 
cantly  abridging  the  overall  freedom  of  anyone  to 
whom  he  is  hugely  superior  in  strength.  Civic 
republicans,  with  their  insistence  that  the  capacity 
to  dominate  is  itself  sufficient  to  produce  depen¬ 
dence  and  consequent  unfreedom,  do  not  provide 
satisfactory  accounts  of  the  situations  discussed  in 
the  previous  few  paragraphs. 

Most  of  these  remarks  about  the  curtailment  or 
noncurtailment  of  overall  freedom  are  applicable 
mutatis  mutandis  to  the  inexistence  or  existence  of 
particular  freedoms.  If  somebody  with  the  capacity 
to  dominate  is  firmly  disposed  to  use  his  or  her 
ascendancy  to  prevent  a  subordinate  person  S  from 
9-ing  in  the  event  that  S  endeavors  to  9,  then 
observers  are  warranted  in  saying  that  S  is  unfree 
to  9.  For  most  purposes,  any  probabilistic  qualifi¬ 
cation  attached  to  such  an  ascription  of  unfreedom 
can  be  left  implicit.  By  contrast,  if  the  likelihood  of 
the  powerful  person’s  prevention  of  S  from  9-ing  in 
the  event  of  S’ s  endeavoring  to  9  is  lower  but  still 
significant,  then  statements  about  S’ s  unfreedom  to 
9  should  be  overtly  probabilistic.  Finally,  if  the 
likelihood  of  the  powerful  person’s  prevention  of  S 
from  9-ing  in  the  event  of  S’ s  endeavoring  to  9  is 
exceedingly  small,  then  observers  are  warranted  in 
affirming  that  S  is  free  to  9  (vis-a-vis  the  powerful 
person).  Once  again,  any  probabilistic  qualification 
can  for  most  purposes  be  left  implicit.  In  other 
words,  with  regard  to  the  reduction  or  nonreduc¬ 
tion  of  a  person’s  overall  freedom  and  with  regard 
to  the  inexistence  or  existence  of  any  of  a  person’s 
particular  freedoms,  the  basic  focus  of  observers  in 
any  context  marked  by  someone’s  dominant 
strength  should  be  the  same.  The  crucial  consider¬ 
ation  in  such  a  context  is  not  the  sheer  presence  of 
the  dominant  strength  but  the  probability  that  that 
strength  will  result  in  the  prevention  of  sundry 
actions  or  sundry  combinations  of  actions. 


Physical  Versus  Deontic  Liberty 

Both  in  the  F  Postulate  and  in  the  U  Postulate,  the 
chief  concepts  are  modal  rather  than  deontic.  That 
is,  they  concern  what  can  or  cannot  occur  rather 
than  what  should  or  should  not  occur.  They  con¬ 
cern  what  each  person  is  able  or  unable  to  do 
rather  than  what  each  person  is  permitted  or  for¬ 
bidden  to  do.  They,  thus,  pertain  to  physical  free¬ 
doms  and  unfreedoms  rather  than  to  deontic  free¬ 
doms  and  unfreedoms.  Someone  is  physically  free 
to  9  if  and  only  if  he  or  she  is  physically  unpre¬ 
vented  from  9-ing,  and  he  or  she  is  physically 
unfree  to  9  if  and  only  if  he  or  she  is  physically 
prevented  from  9-ing  as  a  result  of  some  actions  or 
dispositions  to  perform  actions  on  the  part  of 
some  other  person(s).  Here,  physically  is  not  to  be 
understood  in  contrast  with  mentally  or  psycho¬ 
logically,  rather,  the  relevant  contrast  is  between 
physically  and  normatively. 

Deontic  freedom,  contrariwise,  consists  not  in 
physical  unpreventedness  but  instead  in  permitted- 
ness  or  unforbiddenness.  If  somebody  is  deonti- 
cally  free  to  9,  then  he  or  she  is  allowed  to  9  by 
any  applicable  authoritative  norms,  such  as  legal 
mandates,  moral  principles,  or  institutional  rules. 
Conversely,  if  somebody  is  deontically  unfree  to  9, 
then  he  or  she  is  prohibited  from  9-ing  by  one  or 
more  of  those  authoritative  norms.  When  we  ask 
whether  somebody  is  deontically  free  to  9,  we  are 
not  asking  whether  he  or  she  is  capable  of  9-ing; 
we  are  asking  whether  he  is  entitled  to  9. 

Physical  liberty  and  deontic  liberty  differ  in  a 
number  of  respects  that  derive  in  various  ways 
from  the  basic  modal/deontic  difference  just 
recounted.  For  one  thing,  the  predicates  “is  deonti¬ 
cally  free”  and  “is  deontically  unfree”  are  contra¬ 
dictories  rather  than  merely  contraries,  and  thus 
the  predicates  “is  deontically  unfree”  and  “is  not 
free  deontically”  are  equivalent.  In  other  words, 
the  concept  of  deontic  freedom  is  bivalent  rather 
than  trivalent.  Someone  is  deontically  free  to  9  if 
and  only  if  he  or  she  is  not  deontically  unfree  to  9. 
In  regard  to  such  freedom,  there  is  no  category  that 
corresponds  to  the  category  of  mere  inabilities. 

Perhaps  the  most  obvious  dissimilarity  between 
the  concept  of  physical  freedom  and  the  concept  of 
deontic  freedom  is  their  extensional  nonequiva¬ 
lence.  That  is,  a  person  will  often  be  deontically 
free  to  9  without  being  physically  free  to  9,  and 
vice  versa.  For  example,  although  Norma  is  both 


Liberty  1457 


legally  and  morally  permitted  to  run  a  mile  under 
4  minutes,  she  is  not  physically  able  to  do  so;  her 
deontic  liberty  to  run  at  that  speed  is  not  accompa¬ 
nied  by  a  corresponding  physical  liberty.  Con¬ 
versely,  although  she  is  physically  able  to  assault 
unprovokedly  the  person  standing  directly  ahead  of 
her  in  a  queue,  she  is  neither  legally  nor  morally 
permitted  to  do  so.  Her  physical  freedom  to  commit 
the  assault  is  not  accompanied  by  a  corresponding 
deontic  freedom.  Permissibility  and  ability  can  coin¬ 
cide  and  very  frequently  do  coincide,  but  they  like¬ 
wise  frequently  diverge. 

A  more  subtle  dissimilarity  between  physical 
liberty  and  deontic  liberty  pertains  to  the  isolability 
of  actions.  The  removal  of  someone’s  physical  free¬ 
dom  to  9  will  often  require  the  removal  of  his  or  her 
physical  freedoms  to  do  things  that  are  crucially 
prerequisite  to  9-ing,  whereas  the  removal  of  his  or 
her  deontic  freedom  to  9  (e.g.,  through  the  enact¬ 
ment  of  a  legal  ban  on  9-ing)  never  requires  the 
removal  of  any  of  his  or  her  deontic  freedoms  to  do 
things  that  are  crucially  prerequisite  to  9-ing.  If 
people  remain  physically  free  to  take  steps  that 
would  immediately  antecede  their  9-ing,  then  the 
prevention  of  their  9-ing  will  typically  depend  on 
monitoring  and  rapid  interventions  by  other  people. 
In  connection  with  some  activities,  such  monitoring 
and  interventions  will  be  feasible;  in  connection 
with  many  other  activities,  however,  those  last- 
minute  preventive  intrusions  will  not  be  realistically 
possible.  An  intervention  at  an  earlier  stage  is  some¬ 
times  essential  if  a  person’s  physical  freedom  to  9  is 
genuinely  to  be  eliminated.  Nothing  similar  is  ever 
essential  for  the  removal  of  someone’s  deontic  lib¬ 
erty  to  9.  Precisely  because  the  elimination  of  any 
person’s  deontic  freedom  to  9  concerns  what  is 
impermissible  rather  than  what  is  impossible,  that 
elimination  is  perfectly  consistent  with  a  situation 
in  which  the  person  is  deontically  free  to  do  virtu¬ 
ally  everything  that  is  physically  indispensable  for 
his  or  her  9-ing. 

A  further  difference  between  physical  liberty 
and  deontic  liberty  is  centered  on  the  avoidability 
of  actions.  A  person  whose  mind  has  not  been 
completely  taken  over  by  someone  else  in  a  science 
fiction  scenario  or  by  certain  severe  mental  ill¬ 
nesses  will  retain  the  physical  freedom  to  eschew 
any  particular  action.  At  the  very  least,  such  a 
person  will  always  have  the  option  of  surrendering 
in  a  wholly  passive  manner  to  the  operations  of 


external  forces.  Accordingly,  there  is  no  such  thing 
as  a  physically  unavoidable  action.  When  we  cross 
from  the  realm  of  the  physical  to  the  realm  of  the 
deontic,  however,  we  encounter  a  very  different 
situation.  Anybody  can  be  deontically  unfree  to 
forgo  certain  types  or  instances  of  conduct.  For 
example,  a  person  whose  income  is  subject  to 
taxation  will  typically  be  legally  unfree — and 
probably  also  morally  unfree — to  abstain  from 
writing  his  or  her  name  on  any  income  tax  forms 
that  require  his  or  her  signature.  Though  the  per¬ 
son  is  physically  free  to  refrain  from  signing  those 
forms  in  a  timely  fashion,  he  or  she  is  not  deonti¬ 
cally  free  to  refrain  and  is  legally  obligated,  and 
probably  also  morally  obligated,  to  sign  the  rele¬ 
vant  documents.  In  this  case,  as  in  multitudinous 
other  cases,  legal  or  moral  requirements  can  make 
the  performance  of  certain  actions  mandatory.  In 
that  respect,  legal  or  moral  mandatoriness  differs 
from  the  material  impediments  that  limit  some¬ 
body’s  physical  freedom. 

Of  course,  these  several  divergences  between 
physical  liberty  and  deontic  liberty  should  not 
induce  anyone  to  overlook  the  many  affinities 
between  them.  Liberty  of  each  type  consists  in  an 
absence  of  constraints.  Though  physical  constraints 
differ  from  deontic  constraints  in  the  sundry  ways 
that  have  just  been  recounted,  unconstrainedness  is 
the  essence  of  deontic  freedom  just  as  it  is  of  phys¬ 
ical  freedom.  Freedom  of  either  type  is  negative 
rather  than  positive. 
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Lobbying 


Lobbying  is  the  interaction  of  an  individual, 
group,  interest,  or  organization  with  government 
to  influence  current  policy  or  create  a  situation 
conducive  to  shaping  future  policy.  It  is  a  funda¬ 
mental  and  omnipresent  aspect  of  all  political 
systems,  from  liberal  to  elitist  democracies,  to 
authoritarian  and  even  totalitarian  regimes.  Yet 
although  there  are  common  elements  to  lobbying 
across  all  political  systems,  the  particular  form 
that  the  lobbying  process  takes  in  a  country,  state, 
province,  city,  or  even  internationally,  is  shaped 
by  several  governmental,  political,  and  cultural 
elements.  Furthermore,  lobbying  is  inextricably 
bound  with  the  activities  of  lobbyists — that  is, 
those  representing  individuals,  collective  interests, 
and  interest  groups  to  government. 

This  entry  draws  on  research  on  lobbying 
across  all  political  systems  but  focuses  mainly  on 
established  liberal  democracies  and  developing 
democracies.  First,  we  look  at  the  relationship 
between  practical  politics  and  political  science 
work  on  lobbying,  followed  by  the  common 
denominators  of  lobbying  and  its  ubiquitous 
nature.  Next,  the  major  theoretical  explanations 
are  discussed,  followed  by  how  institutional  struc¬ 
ture  affects  lobbying.  Then,  we  consider  the  types 
and  role  of  lobbyists  in  the  United  States  and  other 


Western  democracies,  followed  by  the  issue  of 
lobby  regulation.  Finally,  we  look  at  some  trends 
in  lobbying  across  the  world. 

Development  of  Lobbying 
Research  and  Theory 

Even  though  lobbying  has  always  been  a  fixture  of 
virtually  all  groups  and  governments,  only  in  the 
past  100  years  has  it  been  recognized  by  scholars  as 
a  fundamental,  prominent,  and  crucial  part  of  policy 
making.  Studies  of  lobbying  did  not  occupy  a  major 
place  in  American  political  science  until  after  the 
work  of  David  Truman  (The  Governmental  Process, 
1951).  The  increased  academic  interest  in  lobbying 
in  the  United  States  from  the  1960s  onward  was  also 
due  to  the  advocacy  explosion,  first  in  Washington, 
D.C.,  and  later  in  the  American  states,  as  more  and 
more  interests  and  interest  groups  began  to  lobby 
and  expanded  their  range  of  lobbying  techniques. 

Yet  as  late  as  the  mid-1950s,  many  scholars  saw 
lobbying  and  lobbyists  as  a  purely  American  phe¬ 
nomenon — a  product  of  the  separation-of-powers 
system  and  not  an  aspect  of  parliamentary  sys¬ 
tems.  In  Anonymous  Empire,  Samuel  Finer  showed 
this  to  be  an  erroneous  view  of  British  politics,  and 
work  on  lobbying  in  other  parliamentary  systems 
followed.  As  late  as  the  1970s,  however,  many 
texts  on  the  politics  of  countries  such  as  Britain, 
France,  Germany,  and  Australia  did  not  include 
the  terms  lobbying,  lobbyist,  or  even  interest 
groups  (or  pressure  groups  as  they  were  often 
called)  in  their  indexes.  Even  today,  these  terms  do 
not  appear  in  many  treatments  of  the  politics  of 
developing  and  reemerging  democracies,  such  as 
those  in  Latin  America,  Africa,  and  Asia. 

Despite  the  late  development  of  academic  inter¬ 
est  in  lobbying  and  lobbyists,  extensive  research 
and  some  theories  now  exist.  The  lessons  and  char¬ 
acteristics  of  this  research  can  be  summed  up  in  the 
following  six  points: 

1.  While  there  are  thousands  of  studies,  mainly 
case  studies,  from  around  the  world,  there  is  no 
general  theory  of  lobbying  and  lobbyists.  Even 
so,  there  is  general  agreement  among  scholars 
on  the  other  five  points. 

2.  There  are  three  interrelated  factors,  in 
particular,  that  shape  the  lobbying  environment 
in  a  place:  the  degree  of  political  pluralism  or 
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authoritarianism,  the  structure  of  governmental 
institutions  regarding  centralization  or 
fragmentation  of  the  policy  process,  and  the 
political  culture. 

3.  Lobbyists  exist  in  all  political  systems  but  take 
on  different  guises  in  different  systems. 

4.  The  last  40  years  or  so  have  seen  the  use  of  an 
increasing  range  of  lobbying  strategies  and 
tactics,  not  only  in  developed  pluralist 
democracies  but  also  in  developing  democracies 
and  even  authoritarian  systems. 

5.  Largely  because  of  the  popular  belief  that 
lobbying  and  lobbyists  benefit  existing  powerful 
interests  at  the  expense  of  the  rest  of  society  and 
because  of  corruption  scandals  associated  with 
lobbying  activity,  lobbying  and  lobbyists  are  held 
in  low  regard  around  the  world.  Ironically, 
however,  and  especially  in  democracies,  hundreds 
of  millions  of  citizens  belong  to  interest  groups. 

6.  Because  of  the  potential  for  abuse  and  for 
undermining  the  goals  of  authoritarian  regimes, 
the  regulation  of  lobbying  and  lobbyists  has  been 
an  aspect  of  public  policy  around  the  world. 

The  Nature  of  Lobbying 

When  two  or  more  people  get  together,  particu¬ 
larly  in  large  groups  and  in  countries,  they  want  to 
influence  others  largely  to  secure  economic,  politi¬ 
cal,  or  other  benefits  for  themselves,  their  group, 
or  society  as  a  whole.  This  is  the  motive  for  lobby¬ 
ing  and  why  it  has  always  been  a  central  part  of 
human  society. 

Common  Denominators  of  Lobbying 

The  most  fundamental  processes  involved  in 
lobbying  to  achieve  a  benefit  at  any  group,  state, 
national,  or  international  level  consist  of  three 
interrelated  activities.  First,  it  requires  getting 
access  to  the  person  or  persons  who  can  make  the 
decision;  second,  it  requires  building  a  relationship 
with  these  people  and,  third,  influencing  them — 
exerting  power. 

Whether  it  is  the  Roman  Empire,  the  court  of 
Louis  XIV  of  France,  Stalin’s  Soviet  Union,  a  mod¬ 
ern  liberal  democracy  such  as  the  United  States  or 
New  Zealand,  an  emerging  democracy  such  as 


Kenya  or  Vietnam,  or  a  closed  authoritarian 
regime  such  as  North  Korea  or  Myanmar,  these 
three  stages  of  lobbying  occur  in  some  form  at 
various  times.  In  nonpluralist  systems,  however, 
they  may  not  occur  in  all  policy  situations  and  may 
not  be  very  obvious. 

The  Ubiquitousness  of  Lobbying 

If  lobbying  takes  place  in  all  groups  and  political 
systems,  why  is  it  that  until  recently,  it  was  seen  to 
occur  in  the  United  States  only  and  was  seen  as  vir¬ 
tually  nonexistent  in  authoritarian  political  systems? 
The  answer  to  this  question  lies  in  the  difference 
between  officially  recognized  and  visible  lobbying 
and  unofficial  and  behind-the-scenes  lobbying. 

In  the  United  States,  much  of  the  lobbying  is 
open  to  view  and  officially  sanctioned  by  the 
Constitution  and  various  laws.  Until  recently,  in 
most  other  democracies  and  in  organizations  such 
as  the  European  Union  (EU)  and  the  United 
Nations,  it  was  less  public  (which  should  not  nec¬ 
essarily  be  equated  with  illegal  or  underhanded) 
and  thus  not  seen  to  be  performed  by  people  called 
lobbyists.  Nevertheless,  lobbying  and  lobbyists 
have  always  been  very  much  a  part  of  these  politi¬ 
cal  systems.  And  even  though  in  authoritarian 
regimes  lobbying  and  related  organizations  were 
either  banned  (as  under  many  military  regimes  in 
Latin  America)  or  not  seen  to  be  necessary  (as  in 
communist  countries),  individuals  close  to  the 
leaders,  such  as  military  personnel  or  their  close 
friends  and  allies,  engaged  in  lobbying  to  get  their 
policies  enacted  or  to  block  the  political  goals  of 
their  opponents. 

Explaining  Variations  in  Lobbying  Processes 

Thus,  just  because  it  appears  that  lobbying  is 
not  taking  place  in  a  particular  government  does 
not  mean  that  lobbying  is  absent.  The  actual  pro¬ 
cess  of  lobbying,  however — namely,  the  avenues  of 
access,  forging  relationships  with  policymakers, 
and  influencing  them — varies  across  political  juris¬ 
dictions  according  to  the  three  factors  identified 
earlier. 

First,  the  more  pluralist  the  political  system,  and 
particularly  the  institutionalization  and  acceptance 
of  political  opposition,  the  more  visible  and  for¬ 
malized  lobbying  and  the  activities  of  lobbyists  are 
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likely  to  be  and,  in  most  circumstances,  the  wider 
the  range  of  strategies  and  tactics  employed.  In 
contrast,  in  a  more  authoritarian  system,  lobbying 
will  include  little  more  than  personal  contacts  with 
the  monarch  or  dictator. 

Second,  institutional  structure  determines  the 
power  or  decision  points  that  lobbyists  need  to 
focus  on  in  their  lobbying  effort.  This  structure  is 
usually  very  clear  in  authoritarian  systems,  but  it 
can  be  more  complex  in  pluralist  democracies.  In 
the  U.S.  separation-of-powers  system,  where  there 
is  often  bipartisanship  on  some  issues,  power  is 
fragmented  between  the  three  branches  of  govern¬ 
ment  and  also  between  the  federal  and  state  levels. 
In  parliamentary  systems  with  strong  political  par¬ 
ties,  however,  power  is  much  more  concentrated  in 
the  executive,  even  in  federal  systems  such  as 
Germany.  Another  factor  is  the  extent  of  neo¬ 
corporatism  or  pluralism  in  liberal  democracies, 
particularly  those  in  Western  Europe. 

Third,  while  research  on  how  political  culture — 
the  political  values  of  the  governed  and  those  in 
government — affects  lobbying  is  not  extensive  and 
rather  inconclusive,  a  good  case  can  still  be  made 
that  this  factor  may  be  the  most  important  deter¬ 
minant  of  the  activity  of  lobbying  and  lobbyists. 
Broad  acceptance  of  the  existence  of  interest 
groups  and  the  legitimacy  of  their  activities,  as  in 
most  liberal  democracies,  leads  to  mass  group 
membership  and  the  necessity,  openness,  and  for¬ 
malization  of  lobbying,  albeit  with  some  skepti¬ 
cism,  in  a  pluralist  democracy.  In  contrast,  no  or 
minimal  public  and  policymaker  legitimization  of 
lobbying  and  lobbyists,  as  in  authoritarian  and 
many  developing  democracies,  leads  to  low  group 
membership  and  continued  and  less  formalized 
and  open  lobbying  processes. 

Lobbying  in  the  United  States: 

An  Aberrant  Case 

An  irony  of  the  lobbying  system  in  the  United 
States  is  that,  although  it  was  once  seen  as  the  only 
place  where  lobbying  took  place  and  is  often  held 
as  a  benchmark  for  being  the  most  advanced  of 
lobbying  systems,  in  many  ways,  it  is  an  aberration 
compared  with  the  lobbying  systems  in  other  lib¬ 
eral  democracies.  This  is  because  its  institutional 
structure  and  political  culture  have  influenced  lob¬ 
bying  in  some  unique  ways. 


The  U.S.  political  system  was  designed  to  prevent 
precipitous  actions  such  as  the  American  colonies 
had  experienced  under  George  III  of  England.  So 
the  separation-of-powers  system,  in  which  the 
Congress  and  the  President  share  legislative  power, 
was  born.  Later,  the  courts,  particularly  the  U.S. 
Supreme  Court,  also  acquired  a  policy  role  by  being 
able  to  interpret  and  strike  down  the  acts  of  the 
other  two  branches.  Add  to  this  the  federal  system 
(i.e.,  the  federal  agencies  that  make  up  the  “admin¬ 
istrative  state” )  and  the  fact  that  ideology  has  never 
been  as  strong  in  the  United  States  as  in  Europe, 
resulting  in  less  disciplined  political  parties,  and  the 
product  is  a  fragmented — very  pluralistic — policy 
process  with  many  power  points.  As  more  and  more 
groups  and  interests  got  involved  in  lobbying  from 
the  1960s  onward,  it  led  to  hyperpluralism — that  is, 
more  and  more  lobbying  forces  competing  for  the 
attention  of  the  same  number  of  policymakers.  In 
turn,  this  was  an  impetus  to  the  development  of 
new  and  indirect  lobbying  techniques,  such  as  grass¬ 
roots  lobbying,  media  campaigns,  and  political 
action  committees  (organizations  that  contribute 
money  to  candidates’  election  campaigns),  in  an 
effort  to  gain  access  to  and  influence  policymakers. 

While  not  all  the  processes  of  lobbying  in  other 
democracies  are  in  contrast  to  those  in  the  United 
States,  there  are  some  notable  differences.  With 
centralized  policy-making  processes  shaped  by  the 
executive  branch  in  most  of  these  countries,  there  is 
much  less  political  benefit  from  lobbying  the  parlia¬ 
ment  or  using  the  courts.  Plus,  much  lobbying  is 
done  through  political  parties  and  can  be  especially 
productive  for  those  associated  with  the  party  or 
coalition  in  power.  With  various  degrees  of  corpo- 
ratist  arrangements,  from  the  very  liberal  democra¬ 
cies  of  Scandinavia  to  the  reemerging  democracies 
of  Latin  America,  there  is  not  the  same  degree  of 
hyperpluralism  in  these  countries  and  thus  not  the 
same  need  for  the  broad  range  of  lobbying  tech¬ 
niques  used  in  the  United  States.  For  a  combination 
of  all  these  reasons  and  because  lobbying  was  not 
seen  as  part  of  these  systems  until  very  recently,  the 
U.S.  lobbying  profession  has  not  developed  in  the 
same  way  as  in  other  democracies. 

Theories  of  Lobbying  in  Liberal  Democracies 

Like  the  rest  of  political  science,  interest  group 
studies  from  the  1960s  onward  were  influenced  by 
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the  behavioralist  and  empirical  approaches  to 
research.  Since  then,  even  though  no  general  theory 
of  lobbying  or  one  that  integrates  existing  theoreti¬ 
cal  approaches  has  been  developed,  quantitative 
and  qualitative  techniques  and  a  combination  of 
the  two  have  been  used  to  produce  theoretical 
explanations  of  lobbying.  Virtually  every  major 
research  approach  in  the  social  sciences  has  been 
used,  including  decision  theory,  rational  choice 
theory,  cost-benefit  analysis,  game  theory, 
exchange  theory,  and  new  institutionalism. 

One  particularly  enlightening  theory  is  that 
developed  by  John  R.  Wright  using  communications 
theory.  Although  Wright’s  research  focused  on  the 
U.S.  Congress,  it  likely  has  general  application  for 
lobbying  around  the  world.  The  assumption  that 
lobbyists  are  primarily  information  providers  is  at 
the  root  of  his  theory.  After  Wright  establishes  that 
legislators  generally  have  three  overarching  goals — 
reelection,  the  desire  to  make  good  public  policy, 
and  to  exercise  some  power — he  argues  that  legisla¬ 
tors  are  forced  to  pursue  these  goals  in  a  very 
uncertain  and  ever-changing  environment.  This  is 
where  lobbyists  and  lobbying  come  in  with  a 
mutual  legislator-lobbyist  benefit  to  the  communi¬ 
cation  of  information.  On  the  one  hand,  this  infor¬ 
mation  will  influence  government  officials  to  the 
benefit  of  the  group  or  organization  the  lobbyist 
represents.  On  the  other,  this  information  reduces 
uncertainty  and  helps  legislators  learn  how  best  to 
achieve  their  goals.  This,  to  Wright,  is  not  only  the 
core  of  the  lobbying  process  but  the  major  form  of 
influence. 

This  theory  certainly  hits  on  a  key  element  in 
lobbying.  Yet  other  research,  especially  on  the 
50  states  in  the  United  States  by  Clive  Thomas  and 
Ronald  Hrebenar,  argues  that  there  are  other  key 
elements  in  the  lobbying  process  and  lobbying 
influence.  The  2  major  elements  of  the  12  that 
Thomas  and  Hrebenar  identify  are  the  extent  to 
which  policymakers  need  the  group  or  organization 
lobbying  them  and  lobbyist-policymaker  relations. 
The  more  a  policymaker  needs  a  group — the  more 
a  group  can  build  up  a  sense  of  need  or  obliga¬ 
tion — because  of  votes,  money,  information,  or 
whatever,  the  more  likely  the  policymaker  is  to  sup¬ 
port  the  group  and  its  cause.  Good  lobbyist-policy- 
maker  relations  built  on  credibility  and  trust  also 
appear  to  be  key,  and  the  longer  the  relationship, 
the  more  likely  the  lobbyist  is  to  influence  the 


policymaker.  Furthermore,  this  lobbyist-policy¬ 
maker  contact  is  the  essence  of  lobbying  and  deter¬ 
mines  the  ultimate  success  or  failure  of  an  issue. 
This  has  always  been  the  case  and  likely  always 
will  be. 

What  is  needed  is  an  integration  of  various  lob¬ 
bying  theories  to  provide  a  comprehensive  under¬ 
standing.  This  is  inhibited,  however,  by  the  diverse 
nature  of  interest  group  studies  and  the  fact  that 
several  scholars  are  wedded  to  one  research  method 
to  the  exclusion  of  others. 

Lobbyists  in  the  United  States 
and  Around  the  World 

Earlier,  we  defined  a  lobbyist  as  someone  who 
represents  individuals,  collective  interests,  and 
interest  groups  to  government.  A  more  compre¬ 
hensive  definition,  developed  by  Thomas,  which 
embraces  lobbyists  around  the  world  is  as  follows: 
A  lobbyist  is  a  person  designated  by  an  interest 
group  to  facilitate  influencing  public  policy  in  that 
group’s  favor  by  performing  one  or  more  of  the 
following  for  the  group:  (1)  directly  contacting 
public  officials,  (2)  monitoring  political  and 
governmental  activity,  (3)  advising  on  political 
strategies  and  tactics,  and  (4)  developing  and 
orchestrating  the  group’s  lobbying  effort. 

This  definition  succinctly  embraces  the  four 
major  roles  that  lobbyists  perform.  Not  all  lobby¬ 
ists  perform  all  four  tasks.  This  depends  on  the 
organization  and  on  the  issue  at  hand. 

Types  of  Lobbyists  and  Alternative  Designations 

The  word  lobbyist  is  often  used  as  if  all  lobby¬ 
ists  were  the  same.  In  fact,  there  are  different  types 
of  lobbyists  from  different  backgrounds,  which 
often  affects  their  power  base — their  ability  to  be 
effective.  Until  recently,  varying  designations  were 
used  to  distinguish  between  types  of  lobbyists, 
including  independent  lobbyists  and  professional 
lobbyist,  both  of  which  were  sometimes  also 
referred  to  as  hired  guns  because  they  are  paid  spe¬ 
cifically  to  lobby.  These  designations  were  some¬ 
times  contrasted  with  amateur  lobbyist,  though  it 
was  often  unclear  whether  the  term  amateur  meant 
unpaid  or  not  well  versed  in  lobbying  techniques. 
In  the  United  States,  by  the  late  1980s,  a  standard 
terminology  of  types  of  lobbyists  emerged  among 
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scholars  and  political  practitioners.  There  are  now 
seen  to  be  five  types  of  lobbyists: 

1.  Contract  lobbyists  are  hired  on  contract  for  a 
fee  specifically  to  lobby.  They  often  represent 
more  than  one  client  and  are  likely  to  be  former 
elected  officials  or  staffers  who  are  purveying  to 
their  clients  their  contacts  with  decision  makers 
built  up  over  many  years. 

2.  In-house  lobbyists  are  employees  of  an 
association,  organization,  or  business,  who  as 
part  or  all  of  their  job  act  as  lobbyists.  They 
represent  only  one  client — their  employer — and 
are  likely  to  have  come  from  the  business  or 
profession  they  represent.  Their  major  asset  is 
unequaled  knowledge  in  their  field,  such  as 
education,  the  environment,  or  issues  and  laws 
affecting  the  disabled. 

3.  Legislative  liaisons  are  employees  of  government 
agencies  who  represent  their  agency  to  the 
legislative  and  executive  branches  of 
government.  They  also  represent  only  one 
client — their  agency.  They  tend  to  be  career 
bureaucrats  or  former  legislative  or  executive 
staff  and  have  an  intimate  knowledge  of  their 
policy  area. 

4.  Volunteer  or  cause  lobbyists  are  those  who 
represent  citizen  and  community  organizations 
or  informal  groups.  They  rarely  represent  more 
than  one  interest  and  are  usually  not  paid.  Their 
power  base,  as  much  as  they  have  one,  is 
founded  on  their  commitment  to  the  cause  and 
their  tenacity. 

5.  Private  individual  lobbyists  are  those  acting  on 
their  own  behalf  and  not  designated  by  any 
organization  as  an  official  representative.  Unless 
these  are  prominent  citizens,  such  as  those  who 
own  sports  teams,  these  will  not  have  much  of  a 
power  base. 

Partly  because  of  the  negative  attitude  toward 
lobbyists  and  lobbying,  lobbyists  are  often  referred 
to  by  other  euphemistic  designations.  These 
include  political  consultant,  government  affairs 
representative,  public  affairs  representative,  and  so 
on.  And  because  of  the  negative  perception  of  lob¬ 
bying  in  other  Western  democracies  and  because 
they  have  only  recently  recognized  the  role  of 


interest  groups  and  lobbyists,  there  is  no  agreed- 
on  nomenclature  for  lobbyists  outside  the  United 
States.  Euphemistic  designations  similar  to  those 
used  in  the  United  States  are  used  for  lobbyists  in 
these  countries. 

One  major  difference  between  the  United  States 
and  other  democracies  is  the  role  of  the  hired  or 
contract  lobbyist  (the  term  contract  lobbyist  is 
rarely  used  outside  the  United  States,  and  they  are 
sometimes  referred  to  as  commercial  lobbyists).  In 
most  countries,  including  Australia,  Canada,  and 
those  in  the  EU,  these  individuals  are  less  likely  to 
contact  government  officials  directly  but  act  as 
group  advisors,  monitor  activity  affecting  the 
group,  and  act  as  facilitators  arranging  meetings 
of  group  officials  with  policymakers  (hence  the 
broader  definition  of  a  lobbyist  set  out  above).  In 
fact,  in  some  cases,  it  is  somewhat  of  a  misnomer 
to  call  these  individuals  group  “representatives”  as 
some  of  them  rarely  advocate  the  group’s  position 
directly  to  government.  Nevertheless,  these  people 
are  intimately  involved  in  the  process  of  lobbying 
as  defined  above  and  should  be  considered  as  part 
of  the  lobbyist  community  in  their  respective 
country. 

In  considering  the  types  and  role  of  lobbyists, 
the  aberrant  nature  of  the  U.S.  political  system  is 
once  again  evident.  While  lobbyists  in  the  United 
States,  particularly  Washington,  D.C.  (where  they 
are  referred  to  as  Washington  representatives), 
have  long  had  the  most  visibility  and  their  role  is 
often  viewed  as  a  benchmark,  in  several  ways,  they 
are  not  typical  of  lobbyists  around  the  world. 

Regulation  of  Lobbying 

Regarding  liberal  democracies,  while  the  most 
extensive  regulation  has  been  developed  in  the 
United  States,  most  liberal  democracies  have  enacted 
some  form  of  lobbying  and  lobbyist  regulation. 
Such  regulations  have  resulted  from  a  combination 
of  negative  public  attitudes  toward  lobbying  and 
those  who  perform  it,  a  populist  desire  to  even  up 
the  political  playing  field  against  powerful  special 
interests,  and  as  the  result  of  lobbying  scandals 
involving  corruption  and  illegality.  It  is  the  latter 
that  is  most  often  responsible  for  the  enactment  of 
regulation,  such  as  what  occurred  in  the  1990s  in 
the  United  States,  in  several  countries  in  Western 
Europe,  and  in  the  EU. 
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There  are,  however,  several  problems  with 
lobby  regulation,  and  its  results  have  been  mixed 
across  liberal  democracies.  One  is  that  regulation 
often  runs  up  against  the  right  of  free  speech  and 
the  right  to  petition  government,  as  in  the  First 
Amendment  to  the  U.S.  Constitution,  and  this 
restricts  the  extent  of  regulation.  A  second  prob¬ 
lem  is  that  the  public  and  policymakers,  often,  are 
not  clear  on  what  the  goals  of  regulation  are:  Some 
hope  to  even  up  the  political  playing  field,  while 
others  want  to  publicize  the  actions  of  lobbying 
among  other  goals.  A  third  point  is  that  regulation 
cannot  make  hitherto  weak  groups  powerful — 
only  increased  resources  and  political  acumen  can 
do  that. 

The  most  that  regulation  can  do  is  to  monitor 
the  activities  of  lobbying  organizations  and  lobby¬ 
ists  and  put  pressure  on  them  and  the  policymakers 
they  deal  with  not  to  engage  in  corrupt  activities. 
However  extensive  regulation  is  or  might  become, 
it  will  never  entirely  root  out  corruption  and  nefar¬ 
ious  activities  in  lobbying  (and  politics  in  general) 
as  long  as  the  stakes  are  so  high,  as  they  often  are 
in  many  lobbying  campaigns,  with  many  peoples’ 
livelihoods  and  futures  at  stake. 

Conclusion 

While  lobbying  has  always  existed  and  probably 
always  will,  several  recent  trends  can  be  detected 
regarding  this  most  basic  of  political  activities  and 
the  lobbyists  who  perform  it.  Three  are  particu¬ 
larly  noteworthy  and  interrelated. 

First,  since  the  rise  of  the  third  wave  of  democ¬ 
racy  in  the  mid-1980s,  an  internationalization  or 
globalization  of  lobbying  techniques  has  taken 
place.  This  is  not  to  say  that  all  the  new  tech¬ 
niques  in  the  United  States  are  appropriate  to  all 
other  political  systems,  but  these  techniques  are 
often  used  when  expedient  around  the  world. 
Second,  a  reduction  of  the  role  of  the  state  in 
many  countries,  especially  in  Western  Europe,  has 
undermined  the  neo-corporatist  approach  to  lob¬ 
bying  and  increased  the  level  of  pluralism.  The 
first  two  trends  have  produced  what  might  be 
considered  an  “Americanization”  of  lobbying 
across  advanced  liberal  democracies  and  increas¬ 
ingly  in  developing  democracies.  However,  this  is 
not  because  these  systems  are  trying  to  emulate  or 
mimic  the  U.S.  model.  It  is  due  more  to  the  U.S. 


system  exhibiting  many  characteristics  of  a  highly 
pluralistic  lobbying  system. 
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While  intending  to  give  a  comprehensive  introduc¬ 
tion  to  local  government,  the  entry  will  largely  focus 
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on  local  government  systems  in  Europe  in  its  empir¬ 
ical  references.  Such  focus  seems  justified  and 
promising  for  two  reasons.  For  one,  it  would  be 
feasible,  given  the  limited  space,  to  give  an  account 
on  local  government  in  a  global  perspective  and 
coverage.  Second,  because  of  the  structure  and  vari¬ 
ance  which  local  governments  in  European  coun¬ 
tries  exhibit  they  might  provide  insights  into  general 
issues  and  trends  which  may  well  relate  and  be 
“extrapolated”  to  local  government  developments 
in  other  regions  of  the  world.  Because  a  truly  com¬ 
prehensive  account  of  the  variations  and  trends  in 
local  governments  throughout  the  world  would 
require  far  more  space  than  is  available  here,  this 
entry  provides  an  introduction  to  local  government 
by  focusing  on  the  varieties  and  differences  among 
local  government  systems  in  Europe.  Such  a  review 
of  variations  in  local  governmental  structure  in 
European  countries  can  offer  insights  into  general 
issues  and  trends,  which  may  then  be  extrapolated 
to  developments  in  local  government  in  other 
regions  of  the  world. 

Here,  the  term  local  (self-) government,  which 
originated  and  is  used  in  the  English-speaking 
world,  will  also  be  applied  to  the  other  countries 
under  consideration.  The  reader  should  keep 
in  mind,  however,  that  the  variance  in  country- 
specific  terminology,  such  as  kommunale  Selbstver- 
waltung  (local  self-administration)  in  Germany, 
libre  administration  (free/autonomous  administra¬ 
tion)  in  France,  or  sdlvstyrelse  (self-steering)  in 
Sweden,  conveys  not  only  linguistic  but  also  the 
underlying  country-specific  conceptual  and  insti¬ 
tutional  differences. 

Intergovernmental  Setting 

A  distinction  between  decentralization  and  decon¬ 
centration  should  be  made  with  regard  to  the 
arrangement  and  distribution  of  powers,  func¬ 
tions,  and  responsibilities  in  the  intergovernmental 
setting.  Decentralization  is  an  intrinsically  political 
concept  as  it  revolves  around  the  devolution  of 
powers  and  responsibilities  from  the  upper 
government  level  to  a  subnational  level  with  dem¬ 
ocratically  elected  and  politically  accountable 
decision-making  and  administrative  bodies  of  its 
own.  By  contrast,  deconcentration  is  an  essentially 
administrative  concept  as  it  aims  at  transferring 
administrative  functions  from  an  upper  to  a  lower 
administrative  level. 


Regionalization 

Historically,  in  (unitary)  countries,  the  intergov¬ 
ernmental  architecture  comprised  two  levels — the 
central  government  and  the  local  government — 
with  the  exception  of  federal  states  where,  histori¬ 
cally,  an  intermediate/regional  governmental  level 
has  been  in  place — in  the  German  case,  in  fact, 
preceding  the  creation  of  the  national  state. 

In  recent  years,  in  some  hitherto  unitary 
European  countries,  particularly  the  larger  ones, 
the  intergovernmental  setting  has  been  shaped  by 
the  formation  of  regions  on  the  intermediary 
level — placed  between  the  central  government  and 
the  existing  local  government  levels.  Among  conti¬ 
nental  European  countries,  the  hitherto  unitary 
(Napoleonic)  states  have  shown  remarkable  vari¬ 
ance  in  the  degree  of  regionalization.  The  factually 
most  advanced  case  is  Spain  where,  when  demo¬ 
cratic  government  was  reestablished  after  1978,  the 
regions  (comunidades  autonomas)  were  created 
with  significant  legislative  powers  and  fiscal 
resources  of  their  own.  In  Italy,  the  regions  were 
given  legislative  and  operative  competences  to  a 
degree  that  has  been  termed  quasi- federal.  By  con¬ 
trast,  in  France,  where  as  an  element  of  decentral¬ 
ization  in  1982  regions  were  introduced,  it  was 
decided  to  keep  them  at  a  simple,  local  government 
status  (as  a  third  level  of  collectivites  locales). 

The  United  Kingdom  (UK),  too,  abandoned  its 
path-dependent  unitary  trajectory  in  that,  in  1998 
and  1999,  Scotland  and  Wales  gained  regional  sta¬ 
tus  (with  elected  regional  assemblies  of  their  own). 
Currently,  however,  the  UK  has  also  entered  the 
“road  towards  quasi-federalism”  (David  Wilson  & 
Chris  Game,  2006),  which  has  remained  “asym¬ 
metrical,”  though,  as  with  England  (which  has  85% 
of  the  UK  population),  remaining  (highly)  central¬ 
ized. 

Whereas  regionalization,  particularly  in  its 
quasi-federal  nuances,  has  strengthened  the  demo¬ 
cratic  as  well  as  the  operative  potential  of  the  inter¬ 
mediate/regional  level,  its  implications  for  the  local 
government  levels  have  been  somewhat  problem¬ 
atic.  The  politically  and  functionally  empowered 
regional  level,  while  proclaiming  decentralism  vis- 
a-vis  the  central  government  level,  may  be  disposed 
to  take  a  centralist  posture  in  relation  to  the  local 
government  levels.  The  somewhat  hierarchical 
influence,  which  in  Germany’s  federal  system  the 
regional  states  (Lander)  tend  to  exercise  over  the 
local  level,  hints  at  a  paradox  of  decentralization. 
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Similarly  Spain’s  regions  (comunidates  autono¬ 
mas)  have  exhibited  some  dominant  stand  vis-a-vis 
the  country’s  local  level. 

Local  Government  Level 

In  most  countries,  the  local  government  levels 
are  historically  made  up  of  two  tiers,  called,  for 
instance,  counties  and  boroughs  or  districts  in  the 
UK,  Kreise  and  Gemeinden/Stadte  in  Germany, 
departements  and  communes  in  France,  and  land- 
sting  kommuner  and  kommuner  in  Sweden.  In  the 
following,  the  terms  counties  and  municipalities 
will  be  generally  applied. 

In  some  countries  (single  tier)  local  authorities 
have  been  formed,  which  combine  municipal  and 
county  responsibilities.  The  German-Austrian  local 
government  tradition  has  long  since  known  such 
single-tier  local  authorities  (called  county  free  cit¬ 
ies,  kreisfreie  Stadte)  as  the  organizational  base 
particularly  of  larger  cities.  Similarly,  in  the  English 
local  government  tradition,  the  scheme  of  single¬ 
tier  county  boroughs  was  in  place  until  1972  and 
was  resumed,  under  the  new  label  unitary  authori¬ 
ties,  particularly  since  the  1990s;  by  now,  in  most 
urban  areas,  including  the  major  cities,  single-tier 
unitaries  have  been  formed.  In  central  Eastern 
European  countries  also,  such  as  in  Poland  and 
Hungary,  the  concept  of  single-tier  local  authorities 
has  been  put  to  work. 

Intercommunal  Bodies 

In  countries  in  which,  in  the  absence  of  territo¬ 
rial  reforms,  the  territorial  structure  is  marked  by 
a  multitude  of  small-scale  municipalities,  an  addi¬ 
tional  layer  of  intercommunal  bodies  has  been  cre¬ 
ated  or  has  come  into  existence,  which  are  meant 
to  provide  the  institutional  frame  and  encourage¬ 
ment  for  intercommunal  cooperation. 

Territorial  Organization 

The  European  countries  show  a  conspicuous  vari¬ 
ance  in  the  average  size  of  their  municipalities.  On 
one  end  of  the  continuum,  there  is  a  group  of  coun¬ 
tries  with  municipalities  with  populations  averag¬ 
ing  more  than  30,000,  such  as  the  UK  (with  an 
average  size  of  139,000  inhabitants),  Denmark 
(with  55,000),  and  Sweden  (with  31,000).  At  the 
other  end,  there  are  countries  with  municipalities 


having  average  populations  of  less  than  10,000, 
particularly  France  (1,720),  Hungary  (3,170),  Spain 
(5,400),  and  Italy  (7,200)  (see  Dexia,  2008,  p.  41). 

Territorial  Reforms 

The  current  territorial  structure  of  municipali¬ 
ties  largely  depends  on  whether  the  countries  have, 
in  the  past,  carried  out  territorial  reform  and  on 
the  underlying  political  and  cultural  factors  that 
shaped  the  decision  to  carry  out  or  not  to  carry  out 
territorial  reforms. 

In  the  first  group  of  countries,  particularly  during 
the  1960s  and  1970s,  territorial  reforms  were 
guided  and  driven  by  the  goal,  typical  of  the  (ratio¬ 
nalist)  Zeitgeist  of  that  period,  of  massively  redraw¬ 
ing  the  historically  small-size  boundaries  of  the 
municipalities  in  order  to  modernize  them  and 
make  them  administratively  more  amenable  and 
operationally  more  effective  in  conducting  the  mul¬ 
tiple  tasks  conveyed  on  the  local  government  level 
by  the  (then  expansive  Welfare)  State.  Labeled  by 
Alan  Norton  (1994)  as  the  “North  European  pat¬ 
tern,”  in  view  of  the  countries  in  question,  this 
reform  strategy  was  marked  by  the  political  deter¬ 
mination  of  the  governments  concerned  to  carry  out 
the  reforms,  possibly  against  the  will  of  and  in  the 
face  of  protests  by  the  local  population,  using  coer¬ 
cive  parliamentary  legislation  as  a  last  resort. 
According  to  John  Stewart  (2000),  particularly  in 
England,  the  scale  of  amalgamation  has  been  criti¬ 
cized  as  being  oversized  (“sizeism”),  fostering  polit¬ 
ical  alienation  of  local  citizens  (as  shown  by  the  low 
voter  turnout). 

In  other  countries  (with  small  municipalities), 
no  territorial  reforms  have  occurred.  In  France  and 
Italy,  in  the  early  1970s,  the  national  governments 
attempted,  in  line  with  the  Zeitgeist  of  the  era,  to 
territorially  reform  the  small  municipalities  (in  the 
French  case  averaging  1,700  inhabitants).  Yet  these 
reform  moves  almost  entirely  failed  because  the 
governments,  adhering  to  the  country’s  political 
culture  value  of  “voluntariness,”  made  amalgama¬ 
tion  contingent  on  the  approval  of  the  local  popu¬ 
lation,  and  such  a  local  consensus  was  not  reached. 

In  Central  East  European  countries  (and  simi¬ 
larly  in  East  German  Lander),  after  the  downfall 
of  the  Communist  regimes,  most  postsocialist 
governments  decided  to  do  without  amalgamation 
of  the  small  municipalities  in  order  not  to  impair 
the  newly  created  (small)  local  democracies  (in 
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Hungary,  the  number  of  municipalities  even 
jumped  after  1990,  from  1,600  to  3,170). 

Intercommunal  Bodies  (Inter-Communalite) 

In  countries  in  which  territorial  reforms  of 
the  municipal  level  did  not  come  to  pass,  different 
strategies  and  approaches  were  followed  to  institu¬ 
tionally  encourage  and  enable  the  multitude  of 
small-scale  municipalities  to  engage  in  intercommu¬ 
nal  cooperation,  for  instance,  in  the  provision  of 
services  for  the  local  population.  Against  the  back¬ 
ground  of  the  very  small  size  and  very  large  number 
of  municipalities  (communes),  France,  not  surpris¬ 
ingly,  was  the  first  and  exemplary  country  to  create 
the  legal  framework — the  first  as  early  as  1890 — 
for  a  great  number  of  such  intercommunal  bodies, 
called  inter- communalite,  at  first  in  the  form  of 
syndicats ,  then,  since  the  1960s,  in  the  form  of  com¬ 
munal  unions  ( communautes ),  with  the  most 
important  ones  being  the  communautes  urbaines 
(in  the  meantime)  in  16  metropolitan  areas.  As  a 
crucial  institutional  innovation,  the  communal 
unions  have  been  provided  with  a  taxing  power  of 
their  own  (fiscalite  propre ).  In  line  with  the  tradi¬ 
tional  principle  of  voluntariness  (voluntariat),  most 
of  these  intercommunal  bodies  have  been  formed 
on  a  voluntary  basis.  In  other  countries  (without 
territorial  reforms),  similar  institutional  develop¬ 
ments  have  got  under  way,  for  instance,  in  Italy 
(with  the  formation  of  consorzi,  in  part  by  binding 
legislation)  and  in  some  German  Lander  (with  the 
establishment  of  Verwaltungsgemeinschaften 
[administrative  unions],  formed  also,  last  resort,  by 
binding  legislation).  Most  recently,  a  new  round  of 
territorial  consolidation  has  gained  momentum. 
Further,  on  the  one  hand,  quite  massive  territorial 
amalgamation  strategies  have  been  inaugurated, 
such  as  in  Denmark  (2007)  and  Lithuania,  both 
arriving  at  municipalities  averaging  55,000  inhabit¬ 
ants.  On  the  other  hand,  political  initiatives  have 
been  undertaken  to  further  consolidate  the  inter¬ 
communal  networks. 

Political  Institutions 

Local  Democracy 

Local  self-government  hinges  on  the  idea  and 
imperative  that  the  local  citizens  govern  themselves 
in  all  matters  relevant  to  their  local  community.  In 


its  purest  form,  local  self-government  is  realized 
through  institutions  of  direct  democracy  when  the 
local  citizens  meet  directly  to  make  the  relevant 
decisions.  In  Europe,  the  mother  country  of  direct 
local  democracy  still  is  Switzerland,  where  in  some 
cantons,  and  even  in  major  cities,  citizens  meet 
periodically  to  make  relevant  decisions,  including 
those  on  local  taxation.  For  the  rest,  in  all  other 
European  countries,  the  institutions  and  proce¬ 
dures  of  representative  democracy  prevail,  accord¬ 
ing  to  which  the  key  political  right  of  citizens  is  to 
elect  the  local  councilors,  while  the  elected  local 
council  is  the  supreme  and  sole  local  decision¬ 
making  body. 

Political  parties  made  their  entry  into  local 
politics  quite  late,  when  national  parties  discov¬ 
ered  the  local  level  as  a  relevant  political  arena  to 
mobilize  political  support  and  to  recruit  political 
leaders.  Recently,  however,  as  noted  by  Marion 
Reiser  and  Everhard  Holtmann  (2008),  there  are 
indications  of  a  “farewell  to  the  party  model”  in 
local  politics. 

In  recent  years,  the  dominance  of  representative 
local  democracy  and  preponderance  of  the  elected 
local  council  as  the  prime  local  decision  maker  has 
been  challenged  as,  since  the  1990s,  in  a  number  of 
countries  (such  as  Germany  and  Italy),  binding 
local  referendums  have  been  introduced  as  a  com¬ 
plement  and  corrective  to  the  elected  local  coun¬ 
cils.  Significant  impulses  came  from  Central 
Eastern  European  countries  (including  East 
German  Lander)  when,  after  the  collapse  of  the 
communist  systems,  the  introduction  of  direct 
democratic  procedures  were  seen  as  a  crucial  step 
to  move  toward  reestablishing  and  reinforcing 
democratic  systems. 

Local  Political  Systems 

Among  the  local  institutional  arrangements  in 
European  countries,  two  systems  can  historically 
be  distinguished.  For  one,  essentially  in  the  UK  and 
in  Scandinavian  countries,  monistic  local  govern¬ 
ment  systems  have  developed  in  which  decision 
making  (as  well  the  direction  of  local  administra¬ 
tion)  is  ideally  the  collective  responsibility  of  the 
elected  council,  which,  in  turn,  has  delegated  this 
monistic  responsibility  to  sector  committees  (gov¬ 
ernment  by  committees).  From  this  followed  the 
notion  that  the  executive  function  (which  combines 
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decision  making  and  executive  functions)  should  be 
entrusted  to  (sectoral)  committees  formed  by  the 
(plenary)  council  (government  by  committee). 

This  monistic  government  by  committee  system 
is  contrasted  with  the  dualistic  local  government 
system,  which  has  emerged  in  Continental  European 
countries.  It  is  premised  on  the  (dualistic)  distinc¬ 
tion  made  between  the  local  council  as  the  prime 
local  (local  parliament  type)  decision-making  body, 
on  the  one  hand,  and  as  an  executive  body  with,  as 
a  rule,  a  mayor  elected  by  the  council,  on  the  other 
hand,  in  a  division-of-function  scheme  reminiscent 
of  a  (local)  parliamentary  system.  In  most  coun¬ 
tries,  the  mayor  was  elected  by  the  council.  Both 
local  government  “families”  have  seen  significant 
institutional  changes  in  recent  years,  which  were 
triggered  by  mounting  criticism. 

In  the  UK,  the  traditional  government-by- 
committee  system  was  attacked  for  lacking  clearly 
identifiable  accountability  and  for  fostering  policy 
fragmentation.  The  Local  Government  Act  of  2000 
provided  for  a  reform  that  was  undertaken  in 
England  and  is  the  option  chosen  by  most  councils. 
It  amounts  to  a  kind  of  “parliamentarization”  of 
the  local  government  system,  in  that  most  of  the 
decision-making  and  executive  powers  are  trans¬ 
ferred  to  one  of  the  committees  (the  “executive 
committee”  with  “executive  councilors”  as  the 
local  “parliamentary  cabinet”);  there  is  a  (council 
elected)  leader  of  the  committee  who  serves  as  a 
kind  of  local  “prime  minister,”  while  the  plenary  of 
the  council  and  its  councilors  are  assigned  a  scruti¬ 
nizing  function.  Sweden,  too,  has  moved  toward  a 
quasi  parliamentarization  of  the  local  government 
system,  stopping  short,  however,  of  abandoning 
the  collective  responsibility  of  sector  committees. 

In  continental  European  countries,  the  existing 
dualistic  system,  with  a  council-elected  executive 
mayor,  was  chiefly  criticized  for  constraining  the 
emergence  of  a  local  leadership  and  also  because 
the  mayor  lacked  democratic  accountability.  Since 
the  1990s,  many  European  countries  (German 
Lander,  Italy,  central  Eastern  European  countries) 
have  moved  toward  the  direct  election  of  the 
mayor,  which  is  reminiscent  of  a  local  presidential 
system.  To  put  a  political  check  on  the  (possibly 
domineering)  mayor,  in  most  German  Lander,  a 
procedure  to  recall  the  sitting,  directly  elected 
mayor  by  way  of  local  referendum  has  been 
installed. 


Functions 

The  local  government  levels  have  historically  taken 
on  an  ever-broader  multifunctional  profile  as  local 
authorities,  responding  to  mounting  social  and 
infrastructural  needs,  assumed  responsibility  for 
social  services  and  public  utilities  (water,  sewage, 
energy,  etc.)  in  what  conservatives  sneered  at  as 
municipal  socialism  and  which  in  fact  amounted  to 
a  local  embryonic  version  of  the  emergent  welfare 
state.  With  the  advances  of  national  welfare  states, 
which  climaxed  after  1945  well  into  the  1970s, 
local  government  levels  were  increasingly  put  in 
charge  by  central  governments  to  implement 
national  welfare  state  and  interventionist  policies. 

In  all  countries,  the  local  government  levels 
have  been  responsible  for  the  provision  of  social 
services,  urban  planning,  and  for  the  provision  of 
utilities.  Moreover,  the  concern  for  cultural  and 
recreational  matters  ranked  high  on  the  local  gov¬ 
ernment  agenda. 

The  most  important  intercountry  functional 
variations  are  related  to  education  and  health  ser¬ 
vices.  While,  for  instance,  in  Sweden  and  England, 
the  running  of  (primary  and  secondary)  schools 
falls  under  the  operational  and  financial  responsi¬ 
bility  of  the  local  government  levels,  in  continental 
European  countries,  education,  by  tradition,  is 
firmly  a  state  matter.  In  some  countries,  the  local 
government  levels  (in  Scandinavian  countries)  or 
the  regions  (in  Italy)  are  operationally  and  finan¬ 
cially  involved  in  the  public  health  system.  Recently, 
in  reaction  to  neoliberal  (lean  state)  and  marketi- 
zation  demands  as  well  under  budgetary  pressure, 
the  traditional  public  sector  model  and  with  it  the 
multifunctional  municipal  sector  profile  have  expe¬ 
rienced  significant  retrenchment  and  cutbacks 
both  in  functions  and  in  personnel,  thus  putting 
the  traditional  local  government  model  at  stake 
(see  below). 

The  significant,  in  part  preponderant,  func¬ 
tional  weight  that  the  subnational  levels,  particu¬ 
larly  the  local  government  levels,  have  so  far 
acquired  in  the  respective  countries  is  indicated  by 
the  high  proportion  of  personnel  at  these  levels 
compared  with  the  total  number  of  public  sector 
personnel. 

Among  the  unitary  countries,  the  list  is  topped 
by  Scandinavian  countries,  with  local  government 
personnel  constituting  up  to  83%  (in  the  case  of 
Sweden)  of  the  total  number  of  public  sector 
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employees  (see  Dexia,  2008,  p.  64),  and  also  by 
some  Central  East  European  countries  (such  as 
Elungary  with  65%)  and  by  the  UK  (with  56%). 
The  percentage  of  state  personnel  is  correspond¬ 
ingly  small  (e.g.,  17%  in  Sweden).  While  in  France 
the  percentage  of  local  government  personnel  has 
expanded  (to  30%)  since  the  beginning  of  decen¬ 
tralization  in  1982,  the  share  of  state  personnel 
continues  to  be  surprisingly  strong  (some  50% 
with  another  20%  in  public  hospitals).  In  Italy,  the 
central  state  continues,  despite  the  decentralization 
since  the  1990s,  to  employ  58%  of  the  total  public 
sector.  Thus,  notwithstanding  decentralization  in 
these  two  countries,  the  central  state,  hinting  at 
some  path-dependent  continuity  of  the  Napoleonic 
state  tradition,  continues  to  be  organizationally 
present  at  the  subnational  levels 

The  picture  in  federal  or  quasi-federal  countries 
is  somewhat  more  complex.  While  in  Germany  the 
portion  of  federal  personnel  is  just  12%  and  in 
Spain  23%  and  the  rest  are  employed  by  the  subna¬ 
tional  levels,  the  lion’s  share  of  public  sector  per¬ 
sonnel  is  employed  at  the  federal  (53%  in  Germany) 
or  the  quasi-federal/regional  levels  (50%  in  Spain), 
with  the  local  levels  also  showing  considerable  per¬ 
sonnel  strength  (e.g.,  with  30%  in  Germany). 

Local  Finances 

The  status  and  standing  of  the  local  government  in 
the  intergovernmental  setting  essentially  depends, 
of  course,  on  the  degree  of  its  financial  and  budget¬ 
ary  autonomy.  A  valid  indicator  of  this  could  be  the 
degree  to  which  local  authorities,  in  order  to  cover 
their  expenditures,  may  draw  on  local  taxes  of  their 
own  as  opposed  to  relying  on  grants  assigned  to 
them  at  the  state  level.  Historically,  the  local  gov¬ 
ernments  financed  their  spending  almost  entirely 
from  local  taxes,  the  “rate”  levied  by  English  local 
authorities  being  a  classical  example.  Signaling  the 
current  fiscal  dependence  of  local  authorities  is  the 
fall  in  percentage  of  “own”  tax  revenues  as  com¬ 
pared  with  the  entire  local  revenues  in  most  coun¬ 
tries.  Sweden  is  a  lone  exception,  in  that  67%  of 
their  local  revenues  still  comes  from  the  local 
(income)  tax;  France  and  Denmark  are  also  remark¬ 
ably  close,  with  49%.  By  contrast,  in  most  other 
countries,  the  self-financing  local  tax  margin  is  less 
than  20%  (see  the  table  in  Dexia,  2008,  p.  97). 
Correspondingly,  the  share  of  government  grants 


(which  can  quite  easily  be  changed  and  manipu¬ 
lated  and  could  also  come  with  strings  attached) 
from  the  central  government  has  conspicuously 
risen,  standing,  for  instance,  at  49%  in  the  UK  and 
47%  in  Italy  and  Poland. 

Local  Organizational  Structures 

Local  Administration:  Organization 
and  Personnel 

Historically,  in  preindustrial  times  and  in  rural 
contexts,  local  matters  were,  as  a  rule,  attended  out 
by  “laymen,”  that  is,  by  the  local  citizens  at  large 
in  what  literally  was  local  self-administration.  The 
layman  practice  in  local  administration  was  pur¬ 
sued,  for  instance,  in  Sweden  well  into  the  early 
20th  century  and  still  exists  in  Switzerland  in  cer¬ 
tain  forms. 

However,  in  countries  that  underwent  early 
industrialization  and  urbanization,  such  as  in  the 
UK,  an  industrial  front-runner,  and  somewhat 
later  in  Germany,  the  local  authorities  built  up 
regular  administrative  structures  with  profession¬ 
alized  staff.  In  continental  European  countries, 
within  a  state  tradition  geared  to  legal,  rule-bound 
hierarchical  administration  (often  identified  as  the 
Max  Weber  bureaucracy  model),  local  administra¬ 
tion  also  showed  a  Weberian  stance.  Reflecting  the 
advanced  welfare  state  and  its  public  sector- 
centered  implementation  model  in  some  countries, 
such  as  in  the  UK  and  in  Sweden,  social  services 
came  to  be  almost  entirely  rendered  (in-house)  by 
public — that  is,  local  government — personnel.  In 
some  countries,  for  instance,  in  Germany  and 
Italy,  traditionally  following  a  subsidiarity  princi¬ 
ple,  social  services  continued  to  be  provided  largely 
by  nonpublic,  nonprofit  organizations. 

Spearheaded  by  new  public  management  (NPM) 
concepts,  the  Weberian  model  of  legal,  rule-bound 
hierarchical  public  administration  was  criticized 
for  its  inherent  inflexibility  and  its  neglect  of  eco¬ 
nomic  efficiency  and  was  sought  to  be  replaced 
with  managerialist  concepts  and  instruments  that, 
borrowed  from  the  private  business  sector,  aimed 
at  making  municipal  administration  and  its  person¬ 
nel  more  flexible  and  more  cost  conscious.  The 
impact  that  the  NPM  message  had  on  the  adminis¬ 
trative  world  varied  from  country  to  country, 
depending  on  country-specific  cultural  and  institu¬ 
tional  conditions  and  traditions.  It  was  most 


Local  Government  1469 


noticeable  in  English-speaking  countries,  which,  as 
in  their  Common  Law  tradition  a  legally  defined 
distinction  between  the  public  and  the  private 
sphere  is  not  made,  appeared  more  receptive  to  the 
private  sector-derived  principles.  By  contrast,  NPM 
had  a  more  difficult  access  in  most  continental 
European  countries,  in  which,  against  the  back¬ 
drop  of  their  Roman  Law  and  Rule  of  Law 
(Rechtsstaat)  traditions,  the  traditional  administra¬ 
tive  model  was  culturally  more  firmly  entrenched. 
In  retaining  elements  of  the  traditional  model  and 
in — at  the  same  time — adopting  and  “translating” 
NPM  concepts,  these  countries  have,  in  their 
administrative  model,  not  least  in  local  administra¬ 
tion,  moved  toward  what  has  been  called  a  neo- 
Weberian  model. 

Under  the  combined  onslaught  of  (neoliberal) 
welfare  state  critique  and  budgetary  squeeze,  local 
governments  in  most  countries  have  resorted  to 
making  deep  cuts  in  their  personnel  over  the  past 
15  years.  Perhaps  the  most  conspicuous  case  is 
Germany  where,  between  1991  and  2004,  the  total 
local  government  staff  was  cut  by  30%  (in  East 
German  Lander,  it  was  even  higher  at  53%)  and  in 
the  UK  by  5  %.  By  contrast,  in  Lrance,  the  local 
government  staff  increased  by  24%,  obviously  in 
the  wake  of  decentralization  since  1982. 

Mounting  Interorganizational  Pluralization 
of  Single-Purpose  Actors 

In  the  (horizontal)  interorganizational  setting, 
the  traditional  multifunctional  leading  position  of 
local  government  in  the  local  arena  has  been  chal¬ 
lenged  through  a  number  of  powerful  currents, 
particularly  through  the  neoliberal  policy  message, 
through  the  NPM  message  (both  becoming  ram¬ 
pant  during  the  1980s),  and  increasingly  (since  the 
1990s)  through  the  market  liberalization  drive  of 
the  European  Union. 

Lirst,  inasmuch  as  the  previously  dominant  con¬ 
ception  of  local  government  as  the  public  sector/ 
municipal  sector-centered  providers  of  public  ser¬ 
vices  was  challenged  and  shattered,  the  local  author¬ 
ities  proceeded  to  “outsource”  the  conduct  of  local 
government  activities  and  the  provision  of  public 
services  to  outside  providers.  While  outsourcing 
was  not  an  entirely  new  concept  in  local  govern¬ 
ment  practice,  it  gained  momentum  when,  in  the 
1980s,  compulsory  competitive  tendering  became 


the  battle  cry  of  Britain’s  Conservative  Government 
under  Margaret  Thatcher  and  almost  irresistibly 
spilled  over  into  the  modernization  agendas  in  other 
countries.  The  provision  of  social  services  has  sub¬ 
sequently  seen  a  pluralization  of  providers — public/ 
municipal  or  semipublic,  nonpublic,  nonprofit,  or, 
increasingly,  private-commercial. 

The  other  important  field  was  the  provision  of 
public  utilities,  which,  in  some  countries,  has  tra¬ 
ditionally  been  the  responsibility  of  local  govern¬ 
ment  and  has  often  been  carried  out  by  them  in  an 
organizational  in-house  form.  Under  the  market 
liberalization  pressure,  the  local  authorities  have 
followed  two  options,  which  are  described  below. 

Lirst,  they  turned  these  activities,  in  what  has 
been  called  formal  (or  organizational)  privatiza¬ 
tion,  over  to  newly  created,  still  municipally  owned, 
but  organizationally  and  financially  self-standing 
corporations.  In  some  cases,  such  corporatization 
has  been  extended  to  a  broad  scope  of  local  func¬ 
tions,  sometimes  with  the  intention  of  tailoring  the 
entire  administration  to  a  private  sector-derived 
holding  (Konzern)  scheme.  Second,  often  beset  by 
budgetary  needs,  the  municipalities  embarked  on 
substantive  (or  asset)  privatization  by  selling  their 
local  facilities  (such  as  local  energy  or  water  com¬ 
panies)  to  outside  providers,  mostly  of  the  large 
national  or  international  corporation  kind. 

In  sum,  in  the  (horizontal)  interorganizational 
setting  of  the  local  arena,  public  tasks,  which,  in 
the  past,  were  attended  by  local  government  in- 
house  or  at  arm’s  length,  have  increasingly  been 
taken  over  by,  or  outsourced  to,  local-level  actors 
that  operate  outside  the  immediate  realm  and 
direct  influence  of  local  government  in  the  local 
arena.  They  constitute  the  kind  of  actor  networks 
that,  in  the  currently  dominant  social  science  ter¬ 
minology,  have  been  identified  as  governance.  On 
the  one  hand,  these  local  governance  actors  can  be 
expected  to  bring  their  specific — financial,  innova¬ 
tive,  entrepreneurial,  and  so  on — resources  and 
skills  to  bear  in  the  local  arena.  As  they  are  typi¬ 
cally  single-purpose  and  specific-interest  actors — 
that  characteristically,  first  of  all,  seek  to  fulfill 
their  own  organizational  goals  and  benefits  possi¬ 
bly  to  the  detriment  and  at  the  expense  of  other 
actors  and  their  rivaling  interests — their  basically 
“private-regarding”  action  orientation  is  bound  to 
pose  a  challenge  to  the  role  and  mandate  of  the 
elected  councils  to  be  (ideally)  the  advocates  and 
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guardians  called  on  to  ensure  the  “public-regarding” 
view  and  the  best  interests  of  the  local  community. 
To  exercise  its  advocacy  role  and  to  coordinate  the 
complicated  interactions,  the  local  authorities  are 
called  on  to  become  active  players  in  the  local  gov¬ 
ernance  networks.  It  may  well  be  that  the  reforms  in 
local  leadership  could  put  them  in  the  position  of 
acting  as  key  networker  (reticulist)  in  the  field.  In 
conclusion,  local  government  and  local  governance 
should  not  be  seen  as  mutually  exclusive  but  as 
complementary. 

Hellmut  Wollmann 
Humboldt-Universitat  Berlin 
Berlin,  Germany 

See  also  Decentralization;  Governance,  Multilevel; 
Government;  Organization  Theory;  Privatization; 
Regionalization 
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Locke,  John  (1632-1704) 


As  the  father  of  modern  empiricist  epistemology, 
critic  of  innate  ideas,  and  theorist  of  a  conception 
of  personal  identity  still  very  much  in  vogue  in 
contemporary  philosophy,  Locke  brings  key  con¬ 
cepts  of  constitutionalism  and  toleration  to  the 
modern  political  lexicon:  individual  rights  and 
freedoms,  the  rule  of  law,  the  separation  of  pow¬ 
ers  and  the  division  between  private  and  public. 
Political  power  must  justify  its  acts  and  choices 
and  must  practice  the  virtue  of  nonintervention  in 
those  areas  of  social  life  that  possess  some  sort  of 
internal  and  autonomous  normativity,  areas  where 
the  law  of  nature  does  not  require  the  support  of 
positive  law.  At  the  heart  of  Locke’s  political 
theory  lies  a  contractualist  political  model  that 
opposes  both  the  divine  right  of  kings  and  the 
absolutist  ramifications  of  Thomas  Hobbes’s  con- 
tractualism — a  model  that  reconciles  tradition  and 
modernity  and  addresses  both  secular  and  reli¬ 
gious  concerns. 

Between  1689  and  1690,  as  a  troubled  period  in 
English  history  came  to  a  close  with  the  Glorious 
Revolution  and  the  ascent  of  William  of  Orange  to 
the  throne,  Locke  published  An  Essay  Concerning 
Human  Understanding ,  the  Epistola  de  tolerantia, 
and  the  Two  Treatises  of  Government.  In  many 
respects,  these  three  works  represent  an  intellec¬ 
tual  program  begun  nearly  30  years  earlier,  in 
his  juvenile  manuscripts  on  the  law  of  nature — a 
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program  that  focused  on  the  limits  of  human 
understanding,  the  foundations  of  political  obliga¬ 
tion,  and  the  use  of  reason  in  discovering  the  prin¬ 
ciples  of  natural  law.  Belief  in  past  habits  and  tra¬ 
ditions  or  in  opinions  supported  by  custom  and 
convention  may  lead  a  person  into  grave  error. 
Both  ancient  historians  and  the  authors  of  modern 
travel  diaries  have  attested  to  the  great  variety  of 
human  moral  convictions  and  the  impossibility  of 
reducing  them  to  a  single,  coherent  system.  Neither 
history,  nor  innate  principles,  nor  a  general  human 
consensus  can  help  people  determine  universal 
rules  and  principles.  The  only  trustworthy  tool  in 
this  endeavor  is  one’s  own  reason,  by  which  one 
can  discover  the  order  that  derives  from  God’s 
supreme  will  and  transcend  the  multiplicity  of  tra¬ 
ditions  and  opinions. 

Locke  places  himself  within  the  modern  indi¬ 
vidualist  tradition.  He  needs  individualism  and 
equality  to  challenge  the  hierarchical  and  patriar¬ 
chal  structure  of  feudal  society  but,  unlike  Hobbes, 
he  also  relates  to  them  as  fundamental  premises  for 
the  creation  of  a  lawful  political  power,  rooted  in 
consensus.  Hobbes’s  atomistic  and  competitive 
individualism  make  it  impossible  to  imagine  a 
social  and  political  order  that  is  not  the  product  of 
artifice,  in  the  absence  of  which  there  is  only 
chaos,  anarchy,  and  helium  omnium  contra  omnes 
(war  of  all  against  all),  a  precarious  and  dreary 
condition  in  which  there  is  neither  mine  nor  thine, 
justice  nor  injustice.  According  to  Hobbes,  only 
the  absolute,  unlimited,  and  illimitable  positive 
law  of  a  sovereign  can  end  the  state  of  war  between 
men  and  guarantee  order,  peace,  and  property. 
Conversely,  the  author  of  the  Second  Treatise  of 
Government  distinguishes,  at  the  very  outset,  the 
state  of  perfect  freedom  of  man  in  the  state  of 
nature  from  license  and  the  state  of  nature  from 
the  state  of  war.  In  the  state  of  nature,  people  are 
not  unrelated  atoms  moved  by  instrumental  and 
calculating  reason  but  moral  persons  who  seek 
relationships  of  mutual  recognition  and  harbor 
feelings  of  benevolence  and  mutual  cooperation. 
Moral  norms  exist  even  in  the  absence  of  positive 
laws,  as  a  result  of  natural  human  sociability  and 
a  sense  of  kinship — as  members  of  the  same 
species — and  finally,  as  a  result  of  the  theological 
premise  that  all  men  are  equal  because  they  are  all 
God’s  workmanship,  all  His  creatures,  and  all 
equally  dependent  on  their  common  Creator,  and 


called  on  to  participate  in  his  divine  plan.  This 
moral  equality  forbids  their  using  one  another  in 
an  instrumental  fashion.  Moreover,  inasmuch  as 
he  is  God’s  workmanship,  man  cannot  take  his 
own  life  or  that  of  another  but  must  strive  for  his 
survival  and  the  survival  of  the  entire  species. 

In  the  First  Treatise  of  Government ,  in  response 
to  Filmer’s  Patriarcha  (published  posthumously  in 
1680,  during  the  exclusion  crisis),  Locke  denies 
that  God’s  will,  as  expressed  in  the  book  of 
Genesis,  mandates  the  reduction  of  men  to  a  con¬ 
dition  of  natural  slavery  and  subjection.  God  did 
not  make  Adam  master  of  all  men  and  sole  owner 
of  the  Earth.  He  gave  the  Earth  to  Adam  and  Eve 
and,  by  extension,  to  all  men,  that  they  might  mul¬ 
tiply  and  improve  the  conditions  of  their  lives.  The 
necessity  to  improve  the  conditions  of  one’s  life 
and  ensure  one’s  survival  justifies  appropriation. 
God  gave  the  world  to  all  men  in  common,  but 
they  would  die  of  starvation  were  each  to  await 
the  consensus  of  the  rest  of  humanity  before  con¬ 
suming  the  fruits  of  nature.  Property  does  not 
derive  from  consensus  but  from  labor.  Through 
their  labor,  men  transform  and  improve  nature, 
creating  wealth.  The  law  of  nature  does  not  allow 
natural  resources  to  be  used  in  a  profligate  man¬ 
ner.  The  inequalities  that  derive  from  the  existence 
of  private  property  and  the  accumulation  of 
wealth,  made  possible  by  the  introduction  of 
money,  are  justified  inasmuch  as  they  constitute  a 
general  improvement  in  the  human  condition.  If 
justice  gives  every  man  title  to  the  product  of  his 
labor,  charity  gives  one  who  is  unable  to  support 
himself  the  right  to  assistance:  The  poor  man  is 
entitled  to  the  surplusage  of  the  rich. 

The  statement  that  every  man  is  born  free, 
equal,  and  independent — that  is,  capable  of  choos¬ 
ing  his  own  destiny — entails  many  important  con¬ 
sequences  from  a  political  perspective.  Every  man 
is  responsible  for  his  own  actions:  The  sins  of  the 
fathers  cannot  be  visited  on  the  sons,  no  paternal 
act  can  deprive  the  son  of  life  and  liberty,  and  no 
pact  undertaken  by  the  father  can  be  considered 
binding  for  future  generations.  Natural  law  and 
the  moral  equality  of  men  make  consent  the  only 
possible  basis  for  lawful  interaction  between 
equals.  The  relationship  between  husband  and 
wife  is  determined  by  a  pact,  which  recognizes  the 
legitimacy  of  divorce  and  even  a  woman’s  right  to 
negotiate  the  terms  of  custody  of  her  children  and 
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other,  similar  issues.  Voluntary  pacts  govern  inter¬ 
action  between  religious  associations  and  their 
believers,  between  master  and  servant,  and  between 
magistrate  and  citizens.  The  political  compact, 
however,  unlike  all  other  pacts,  is  irrevocable: 
When  a  citizen  has  given  his  express  consent  to  be 
part  of  the  political  community,  he  can  no  longer 
abandon  it;  he  has  no  right  to  emigrate  and  join 
another  state. 

In  the  state  of  nature,  every  man  possesses  the 
executive  power  of  the  law  of  nature,  which 
affords  him  the  power  to  judge  and  punish,  in  a 
commensurate  fashion,  those  who  break  the  law  of 
nature.  One  who  infringes  on  the  life,  liberty,  or 
property  of  another  thereby  places  himself  in  a 
state  of  war  with  the  rest  of  humanity.  Justice  in 
the  state  of  nature  fails  only  as  a  result  of  men’s 
passions,  which  may  lead  to  excessive  partiality  in 
the  application  of  sanctions.  That  is  why  men 
decide  to  abandon  the  state  of  nature  and  form  a 
political  community.  In  adhering  to  such  a  com¬ 
pact,  they  surrender  to  the  state  that  executive 
power  granted  to  them  by  the  law  of  nature.  The 
state’s  monopoly  over  lawful  violence  is  thus  con¬ 
strained  to  respect  the  limits  within  which  it  was 
legitimately  exercised  by  every  individual  in  the 
state  of  nature.  The  ramifications  of  this  are  far- 
reaching  and  concern  even  the  relationship  between 
states:  No  state  has  the  right  to  initiate  a  war  as  an 
aggressor  because  the  executive  power  of  the  law 
of  nature  that  has  been  delegated  to  the  state  can 
only  be  exercised  for  the  purposes  of  reparation 
and  restraint.  Locke,  unlike  Hobbes,  may  thus 
distinguish  between  just  and  unjust  wars,  conceiv¬ 
ing  limits  to  the  contenders’  lawful  conduct  during 
a  war,  implicitly  following  Hugo  Grotius’  distinc¬ 
tion  between  ius  ad  bellum  (the  right  to  go  to  war) 
and  ius  in  bello  (right  conduct  in  war). 

The  creation  of  a  single  judge  would  not  be  an 
improvement  on  the  state  of  nature,  were  it  not 
accompanied  by  the  separation  of  powers  and 
were  the  principles  of  the  law  of  nature  not  fixed 
in  established  and  impartial  laws  decided  by  the 
community  itself.  The  compact  by  which  every 
man  authorizes  the  political  community  to  make 
laws  may  not,  therefore,  institute  absolute  sover¬ 
eignty.  An  absolute  sovereign,  legibus  solutus  and 
sole  master  of  all  powers,  is  still  in  a  state  of  nature 
with  his  subjects  or  rather  in  that  degenerate  form 
of  the  state  of  nature  that  is  the  state  of  war.  No 


man,  Locke  maintains,  can  be  so  foolish  as  to 
abandon  the  condition  of  perfect  freedom  of  the 
state  of  nature  in  order  to  put  himself  under  the 
absolute  power  of  another  man.  In  a  civil  society, 
legislative  power  and  executive  power  derive  from 
a  fiduciary  trust  granted  by  the  people.  Since  such 
power  is  founded  on  trust,  it  can  be  revoked  by  the 
very  same  people  when  its  rulers  systematically 
break  the  law.  In  response  to  repeated  and  wide¬ 
spread  violations  of  individual  rights  by  political 
power,  when  judiciary  intervention  is  not  feasible, 
there  is  always  the  so-called  appeal  to  heaven.  It  is 
only  in  extreme  cases,  according  to  Locke,  that 
people  will  have  recourse  to  violence.  The  right  of 
resistance  may  thus  be  seen  more  as  a  warning  to 
those  who  govern,  that  they  must  obey  the  law, 
than  as  a  real  threat  by  the  people. 

In  A  Letter  Concerning  Toleration,  Locke 
resorts  once  again  to  the  strategy  of  separation 
that  would  later  become  the  legacy  of  the  entire 
liberal  tradition,  separating  the  legitimate  field  of 
action  of  the  state  from  that  of  the  churches,  con¬ 
ceived  as  voluntary  associations.  Care  for  the  soul 
is  a  private  affair,  pertaining  only  to  the  individual. 
Men  will  never  reach  an  agreement  on  religious 
issues  but  will  continue  to  divide  and  adhere  to 
different  churches.  Locke  is  one  of  the  first  phi¬ 
losophers  to  seriously  address  the  persistence  and 
inevitability  of  disagreement  on  religious  issues. 
Each  and  every  believer  will  continue  to  consider 
his  church  the  sole  representative  of  religious 
orthodoxy.  As  the  magistrate  has  no  greater 
knowledge  of  the  path  to  salvation  than  any  indi¬ 
vidual  citizen,  he  cannot  impose  his  will  in  reli¬ 
gious  matters.  The  state,  however,  has  no  reason 
to  fear  religious  pluralism.  Religious  citizens  in 
fact  possess  social  and  moral  virtues  that  can  be 
useful  to  the  state.  Furthermore,  as  it  welcomes  the 
existence,  in  the  social  space,  of  a  plurality  of  civil, 
commercial,  and  scientific  associations,  so  the 
state  should  admit  the  presence  of  a  plurality  of 
religious  associations.  They  are  no  more  danger¬ 
ous  than  other  associations,  as  long  as  they  are 
allowed  to  compete  freely.  Competition  might  also 
improve  the  moral  conduct  of  each  church,  both 
internally  and  in  its  relations  with  other  churches. 
It  is  atheists  who  represent  a  threat  to  the  social 
order:  Their  word  cannot  be  trusted  because  they 
do  not  believe  in  the  existence  of  a  God  who  can 
see  and  judge  us  even  in  the  dark.  A  church,  such 
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as  the  Catholic  Church,  that  strives  to  unite  in 
itself  both  spiritual  and  temporal  power,  remains  a 
grave  threat  to  the  state  and  to  the  public  order,  as 
does  a  church  that  promulgates  beliefs  detrimental 
to  human  society,  such  as  the  belief  that  it  is  justi¬ 
fied  not  to  keep  a  promise  to  one  who  espouses 
unorthodox  religious  doctrines. 
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University  of  Florence 
Florence,  Italy 

See  also  Constitutionalism;  Contract  Theory;  Hobbes, 
Thomas;  Liberalism;  Natural  Law;  Tolerance 

Further  Readings 

Grant,  R.  W.  (1987).  John  Locke’s  liberalism.  Chicago: 

University  of  Chicago  Press. 

Simmons,  J.  (1994).  The  Lockean  theory  of  rights. 

Princeton,  NJ:  Princeton  University  Press. 

Spitz,  J.-F.  (2001).  John  Locke  et  les  fondements  de  la 
liberte  modeme  [John  Locke  and  the  basis  of  modern 
liberty].  Paris:  Presses  Universitaires  de  France. 
Tuckness,  A.  (2002).  Locke  and  the  legislative  point  of 
view:  Toleration,  contested  principles  and  the  law. 
Princeton,  NJ:  Princeton  University  Press. 

Waldron,  J.  (2002).  God,  Locke  and  equality:  Christian 
foundations  in  Locke’s  political  thought.  Cambridge, 
UK:  Cambridge  University  Press. 


Logic  of  Appropriateness 


James  G.  March  defines  the  logic  of  appropriate¬ 
ness  thus:  Individuals  and  organizations  fulfill  or 
enact  identities  by  following  rules  and  procedures 
that  they  imagine  to  be  appropriate  to  the  situa¬ 
tions  they  are  facing.  Following  standard  operat¬ 
ing  procedures,  they  act  according  to  what  is 
expected  of  them,  whether  they  are  politicians, 
civil  servants,  or  citizens.  Appropriateness  is  a 
concept  taken  from  culturally  oriented  theory  and 
is  concerned  with  informal  norms  and  values,  but 
it  is  also  related  to  structural  theory  and  formal 
norms  because  formal  structure  influences  the 
development  of  informal  norms.  A  distinction  is 
often  made  between  the  processes  whereby  appro¬ 
priate  modes  of  thought  and  action  evolve  and  the 


situations  that  call  for  appropriate  action — generally 
termed  matching  situations. 

In  this  entry,  the  main  features  of  the  logic  of 
appropriateness  are  outlined  and  discussed,  includ¬ 
ing  the  key  concepts  underlying  it — identities  and 
rules,  and  the  relationship  between  them.  Next, 
the  entry  will  show  that  acting  in  an  appropriate 
way  involves  matching  situations  that  are  poten¬ 
tially  complex  and  ambiguous  and  that  formal 
rules  often  heavily  influence  informal  ones,  which 
affects  the  logic  of  appropriateness.  Finally,  we 
will  discuss  how  the  logic  of  appropriateness  dif¬ 
fers  from  the  logic  of  consequentiality,  the  logic 
most  often  connected  with  more  rationally  ori¬ 
ented  theories.  The  examples  used  are  mostly 
taken  from  studies  of  public  organizations. 

Main  Features  and  Key  Concepts 

The  three  main  questions  constituting  the  logic  of 
appropriateness  are  the  following: 

1.  First,  what  kind  of  situation  am  I  actually  facing 
as  an  actor  (individually  or  institutionally)?  This 
is  a  question  of  recognition. 

2.  Second,  what  is  the  main  identity  of  my 
institution  and  what  is  my  own  main  identity? 

3.  Third,  what  am  I  and  my  organization  expected 
to  do  in  a  situation  like  this?  That  is,  what  are 
the  rules  connecting  the  situation  and  our 
identities? 

A  crucial  question  is  how  actors  set  about 
answering  such  questions  in  matching  situations. 
One  way  is  to  learn  from  experience — experiential 
learning — either  by  looking  at  how  the  matching 
has  been  done  recently  or  by  finding  out  how  the 
institution  traditionally  does  this  matching.  A  sec¬ 
ond  way  is  to  react  cognitively  or  through  catego¬ 
rization  based  on  mental  maps  or  else  through 
what  Karl  Weick  labels  “sense-making.”  A  third 
way  is  to  use  the  experience  of  others  to  match, 
either  by  stressing  that  the  context  is  exactly  the 
same  as  the  one  others  have  experienced  (contex- 
tualization)  or  by  arguing  that  the  context  belongs 
to  a  broad  category  that  is  typical  for  many  orga¬ 
nizations  (decontextualization). 

According  to  James  G.  March  and  Johan  P. 
Olsen,  the  logic  of  appropriateness  deals  with 
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what  is  essential  in  a  role,  not  with  what  is  instru- 
mentally  or  arbitrarily  defined.  Faced  with  differ¬ 
ent  kinds  of  stimuli,  actors  respond  in  complex, 
standardized,  and  almost  intuitive  ways,  without 
any  immediate  comprehensive  analysis,  problem 
solving,  or  discretion — the  latter  being  features 
that  become  part  of  the  logic  over  time.  Approp¬ 
riateness  presupposes  that  actors  have  multiple 
identities  or  a  repertoire  of  identities — and  hence 
multiple  rule  options — and  these  become  relevant 
according  to  the  different  situations  actors  face. 
One  implicit  assumption  is  that  these  identities  and 
rules  exist  in  relatively  consistent  sets  to  further 
systematic  behavior. 

Identities,  Rules,  and  Situations 

Identities  are  connected  both  to  the  individual 
actor  and  to  institutions.  For  the  individual,  iden¬ 
tity  is  a  conceptualization  of  the  self  organized  into 
rules  in  order  to  match  action  to  situations. 
Identities  are  constructed  by  the  individual  actor, 
imposed  by  the  environment,  and  influenced  by  a 
broader  cultural  context.  Individual  identities  can 
be  formed  by  two  interwoven  processes:  a  process 
of  individualization,  where  the  actor  voluntarily 
chooses  from  self-imposed  and  self-selected  roles 
and  rules,  and  a  process  of  socialization,  where 
obligations,  responsibilities,  and  commitment  are 
learned  and  followed,  not  chosen. 

Institutional  identities  are  based  on  the  evolu¬ 
tion  of  common  individual  or  group  identities, 
resulting  from  the  same  type  of  experiences  or 
from  internal  and  external  constraints,  thereby 
creating  similar  attitudes,  norms,  and  values. 
These  are  organized  into  broader  social  roles,  and 
individual  identities  can  be  derived  from  them. 
Stephen  Krasner  describes  the  same  phenomenon 
with  his  concept  of  “vertical  depth”  in  the  process 
of  institutionalizing  norms  and  values,  referring  to 
factors  that  are  especially  important  for  defining 
attitudes  and  activities. 

The  logic  of  appropriateness  alludes  more  to 
informal  than  formal  rules,  but  formal  rules  defi¬ 
nitely  influence  informal  ones,  for  example,  con¬ 
cerning  appropriateness  in  the  civil  service.  Rules 
are  defined  independently  of  the  individual  actors 
who  enact  them,  meaning  that  personal  characteris¬ 
tics  are  of  less  importance  and  that  different  people 
in  the  same  time  period,  position,  and  institution 


will  match  situations,  identities,  and  rules  in  some 
of  the  same  ways.  Rule  following  can  be  seen  as  a 
kind  of  implicit  contract  or  pact;  if  an  actor 
behaves  appropriately,  he  or  she  will  also  be 
treated  appropriately.  Rules  are  codified  and  stan¬ 
dardized  to  a  certain  extent,  as  a  precondition  for 
channeling  ideas  and  behavior,  and  may  sometimes 
be  inconsistent.  They  are  meant  to  organize  and 
regularize  behavior  and  to  further  standardization 
instead  of  individual  discretionary  behavior.  But 
because  rules  can  be  violated,  are  sometimes  too 
inflexible,  may  disregard  consequences,  or  may 
function  as  socially  constructed  myths,  they  may 
potentially  have  problems  fulfilling  these  functions. 

The  theory  of  the  logic  of  appropriateness  is 
little  concerned  with  what  types  of  situations 
actors  face  when  trying  to  connect  or  match  rules, 
identities,  and  situations.  Appropriateness  is  often 
connected  with  familiar  and  routine  situations,  but 
there  are,  of  course,  also  appropriate  ways  to 
behave  in  poorly  defined  situations,  such  as  turbu¬ 
lent  or  crisis  periods.  Matching  is  not  only  about 
stability  but  also  takes  place  when  both  rules  and 
situations  are  changing.  Philip  Selznick  emphasizes 
in  his  cultural  theory  that  so-called  critical  deci¬ 
sions  are  what  change  public  institutions  the  most 
and  are  different  from  routine  decisions. 

One  starting  point  for  elaborating  the  logic  of 
appropriateness  is  to  underscore  that  actors,  for 
example,  in  public  organizations,  are  probably  sel¬ 
dom  in  unambiguous  situations  where  identities  are 
clear-cut  and  where  one  identity  is  dominant.  They 
have  multiple  individual  and  institutional  identities. 
Their  environment  is  often  turbulent,  with  complex 
dependences,  creating  a  complex  attention  struc¬ 
ture,  especially  when  a  number  of  identities  are  sig¬ 
nificant  and  possibly  in  conflict.  A  civil  servant  can, 
for  example,  have  identities  relating  to  the  special¬ 
ization  of  the  formal  structure  and  the  complexity  of 
its  cultural  history,  to  professional  competence,  and 
to  gender.  Identities  are  further  connected  with 
employment  in  a  certain  ministry,  in  a  specialized 
division  or  in  a  smaller  unit,  or  else  with  certain 
specialized  tasks  and  functions.  Another  set  of  iden¬ 
tities  is  related  to  the  interpretation  of  the  cultural 
history  and  traditions  of  a  public  institution.  Some 
civil  servants  may  have  a  long  tenure  related  to  a 
subculture  in  the  organization,  while  others  may 
have  a  short  tenure  during  which  they  have  become 
socialized  into  general  cultural  norms  and  values. 
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And  actors  differ  according  to  their  professional 
backgrounds  and  the  professional  contexts  they 
work  in.  All  this  illustrates  that  identities  are 
numerous  and  characterized  by  complexity  and 
varying  degrees  of  consistency. 

Variety  can  lead  to  ambiguity  concerning  iden¬ 
tities  or  to  situations  with  possibly  competing 
logics  of  appropriateness;  what  is  appropriate  for 
one  actor,  individually  or  institutionally,  may  be 
inappropriate  for  another.  Selznick’s  analysis 
of  cooptation  in  the  Tennessee  Valley  Authority 
provides  an  example  of  this.  When  the  leadership 
choose  cooptation  as  a  strategy  to  address  busi¬ 
ness  interests  in  the  “task  environment,”  this 
could  be  seen  as  appropriate  from  a  developmen¬ 
tal  perspective.  But  had  they  favored  another  type 
of  appropriateness,  attending  to  a  more  conserva¬ 
tionist  point  of  view,  they  would  have  probably 
chosen  another,  less  inclusive  strategy. 

A  general  question  is,  therefore,  what  type  of 
conditions  are  important  for  bringing  some  identi¬ 
ties  or  some  aspects  of  identities  to  the  forefront 
and  keeping  others  in  the  background  in  an  actor’s 
allocation  of  attention.  We  would  claim  that  iden¬ 
tity  related  to  the  formal  structure,  role,  and  func¬ 
tions  in  public  organizations  is  more  dominant  than 
culturally  evolved  identities  inside  organizations 
or  external  social  status,  which  are  subordinate  to 
formal  conditions  and  less  relevant  for  decision 
making.  Another  answer  points  to  the  fact  that 
organizations  differ  with  respect  to  developing  col¬ 
lective,  institutional  identities  and,  therefore,  also 
with  respect  to  the  ease  with  which  members  can 
base  their  thoughts  and  actions  on  such  identities. 
March  and  Olsen  emphasize  that  some  organiza¬ 
tions  can  have  strong  integrative  features,  creating 
a  feeling  of  common  goals  and  destiny  among  citi¬ 
zens,  politicians,  or  civil  servants,  while  others  are 
more  aggregative  and  atomized. 

One  would  expect  the  identities  of  most  actors  to 
be  dominated  by  clear-cut  formal  rules  but  also  to 
have  informal  rules  or  complex  patterns  of  formal 
and  informal  rules,  which  are  either  consistent  or 
inconsistent,  as  their  main  guidelines  for  action.  But 
rules  can  also  be  said  to  evolve  during  the  process  of 
matching;  the  matching  process  also  involves  creat¬ 
ing  new  rules  and  identities,  not  only  selecting  from 
preexisting  ones.  Sometimes  rules  and  identities 
undergo  a  transformation  or  editing  process  when 
they  are  confronted  with  new  or  complex  situations. 


Consequentiality  and  Appropriateness 

The  logic  of  consequentiality  is  based  on  a  rational 
instrumental  tradition,  where  utilitarian  reasoning 
about  cost  and  effects  is  an  important  part  of  the 
equation.  The  logic  of  consequentiality  may  be  seen 
as  competing  with  the  logic  of  appropriateness,  but 
it  is  more  often  seen  as  only  one  version  of  rule  fol¬ 
lowing;  in  other  words,  rationality  is  one  rule  that 
is  connected  to  reasoning  about  consequences.  In 
the  logic  of  appropriateness,  contrary  to  the  logic 
of  consequentiality,  preferences  and  expectations 
about  the  consequences  of  action  are  not  the  most 
important  consideration;  the  main  thing  is  to  iden¬ 
tify  what  is  appropriate.  Rules  can  be  followed 
even  if  the  consequences  are  negative — with  regard 
to  self-interest,  for  example.  But  what  is  appropri¬ 
ate  can  also  reflect  a  history  of  negotiations,  con¬ 
sensus  building,  or  winning  coalitions. 

Although  analytically  distinguishable,  it  is 
often  difficult  to  separate  arguments  of  appropri¬ 
ateness  from  arguments  of  consequentiality  in 
concrete  decision-making  processes.  The  main 
reason  for  this  is  that  the  extensive  application  of 
rationality  and  the  logic  of  consequentiality  is  a 
very  strong  ideology  permeating  modern  organi¬ 
zations,  to  such  an  extent  that  behaving  appropri¬ 
ately  often  means  demonstrating  clearly  that  one 
is  acting  in  accordance  with  this  logic.  Some 
aspects  of  the  puzzling  mixture  of  arguments  of 
appropriateness  and  consequentiality  find  expres¬ 
sion  in  the  creation  of  rationalized  myths  (i.e., 
popular  ideas  about  how  to  build  and  change 
modern  organizations)  and  in  the  interpretation 
of  their  effects. 
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Logit  and  Probit  Analyses 


Logit  and  probit  models  are  designed  to  analyze 
research  questions  where  the  dependent  variable  is 
binary.  Examples  of  these  kinds  of  problems  are 
studies  of  voter  turnout  (where  each  individual 
might  vote  or  abstain),  legislative  voting  on  a  bill 
(where  each  legislator  might  vote  yes  or  no),  and 
war  (where  each  country  might  choose  to  go  to 
war  or  stay  at  peace).  In  practice,  there  is  little  dif¬ 
ference  between  the  logit  and  probit  models,  and 
these  models  are  interchangeable  in  almost  all 
research  settings.  Logit  and  probit  models  fall  into 
the  category  of  discrete  choice  models,  so  named 
because  the  political  agents  we  are  studying  are 
making  choices  from  among  a  finite  set  of  distinct 
options.  In  this  entry,  the  derivation  and  interpre¬ 
tation  of  such  models  are  discussed. 

Deriving  the  Logit  and  Probit  Models 

Perhaps  the  most  straightforward  way  to  derive 
the  logit  and  probit  models  is  to  assume  that  a 
political  agent’s  choice  is  determined  by  some  con¬ 
tinuous  underlying  rating  of  a  political  item  or  a 
propensity  to  engage  in  some  behavior  known  as  a 
latent  variable.  Each  political  agent’s  position  on 
this  latent  variable  (y*)  is  assumed  to  be  a  linear 
function  of  a  set  of  some  observed  variables  and 
coefficients  (X.|3)  and  some  unobserved  stochastic 
influences  (e,).  Thus,  a  political  agent’s  latent 
choice  preference  is  given  by 

y*  =  +  £,• 

If  y*  was  observed,  we  could  simply  estimate 
this  equation  via  linear  regression.  However,  in  a 
binary  choice  setting,  the  agent’s  position  on  this 
latent  variable  is  unobserved.  Instead,  a  choice 
threshold  t  divides  the  latent  variable  into  two 
categories,  and  we  only  observe  y,  which  indicates 
which  choice  category  the  agent  selected.  We 


observe  y,  =  0  for  agents  who  are  at  or  below  t 
and  yt  —  1  for  agents  above  the  threshold.  For 
instance,  a  country  might  have  an  underlying  pro¬ 
pensity  to  fight  with  its  neighbors,  but  we  can  only 
observe  this  country  in  one  of  two  discrete  choice 
categories:  war  or  peace. 

Since  y*  is  latent,  we  must  make  several  iden¬ 
tifying  assumptions  to  estimate  the  probability 
that  a  political  agent  makes  a  particular  choice. 
Although  these  identifying  assumptions  are  arbi¬ 
trary,  they  do  not  influence  the  estimate  of  the 
relationship  between  the  X(s  and  choice  behav¬ 
ior.  The  first  of  these  assumptions  is  to  set  t  =  0. 
We  make  this  assumption  because  we  cannot 
obtain  a  unique  estimate  for  both  t  and  the  con¬ 
stant  term  in  the  model:  Since  y*  is  unobserved, 
we  could  add  an  arbitrary  constant  to  both  t  and 
the  constant  term  and  still  generate  the  same 
probability  of  an  observed  outcome.  With  this 
assumption,  and  noting  that  y*  —  X;p  +  e„  the 
probability  that  an  agent  selects  the  choice  cate¬ 
gory  represented  by  y  =  1  given  the  observed 
variables  X;  is 

Pr(y,  =  1  I  X.)  =  Pr(X,p  +  e.  >  0  I  X,). 

This  equation  reveals  that  the  probability  that 
political  agents  select  the  choice  category  denoted 
by  y  —  1  will  depend  in  part  on  the  unobserved 
error  term  e,.  By  subtracting  X.(3  from  both  sides 
of  the  inequality  and  assuming  that  the  distribu¬ 
tion  of  e,  is  symmetric,  we  can  rearrange  this  equa¬ 
tion  to  read  as  follows: 

Pr(y,  -  1  I  X,.)  -  Pr(e,  <  X,(3  I  X;). 

This  equation  is  simply  the  evaluation  of  the 
cumulative  density  function  (CDF)  of  &i  at  X^, 
which  we  can  express  as  Pr(yJ  =  1  I  X,)  =  F(X.(3). 

The  probability  that  we  will  observe  yt  —  1 
given  X;  can  now  be  estimated  once  we  make  some 
identifying  assumptions  about  the  distribution  and 
the  variance  of  the  error  term  e,.  Almost  any  dis¬ 
tribution  can  be  assumed — for  instance,  we  could 
assume  that  this  error  term  follows  a  uniform  dis¬ 
tribution  and  derive  a  linear  probability  model. 
However,  in  practice,  we  typically  assume  one  of 
two  distributions. 

One  common  assumption  is  that  the  error  term 
is  drawn  from  an  independently  and  identically 
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distributed  (HD)  normal  distribution.  This  results 
in  the  probit  model: 

Pr(y,  =  1  I  X,.)  -  <J>(X,.0), 

where  is  the  CDF  of  the  normal  distribution.  As 
y  *  is  unobserved,  we  cannot  estimate  the  variance 
of  the  error  term  as  we  can  with  a  linear  regres¬ 
sion.  Thus,  a  standard  assumption  for  probit  mod¬ 
els  is  that  Var(e;)  =  1. 

Another  common  assumption  is  that  the  error 
term  follows  an  IID  logistic  distribution.  The  logis¬ 
tic  distribution  is  very  similar  to  a  normal  distribu¬ 
tion  but  with  slightly  more  probability  in  the  tails. 
This  results  in  the  logit  model: 

Pr(y,  =  1  I  X,)  =  exp((xX,P)/[l  +  exp(p.X,p)], 

where  exp  indicates  that  we  are  taking  the  expo¬ 
nential  of  the  term  in  parentheses,  and  p.  is  a  posi¬ 
tive  scale  parameter.  As  with  the  probit  model,  we 
must  make  an  assumption  about  the  variance  of 
this  CDF:  In  this  case,  we  assume  p  =  1,  which  is 
equivalent  to  assuming  that  Var(s,)  =  tt2/3.  Thus, 
the  estimation  of  a  logit  model  will  produce  esti¬ 
mates  of  p  that  are  approximately  \J tt2/3  «  1.81 
times  larger  than  the  corresponding  estimates  from 
a  probit  model.  This  difference  is  due  solely  to  the 
different  assumptions  made  about  the  variance  of 
the  error  term  in  the  two  models.  In  practice,  there 
is  almost  never  a  substantive  difference  between 
the  logit  and  probit  models,  and  either  choice  is 
valid  in  virtually  all  cases. 

Both  the  logit  and  probit  models  produce  a 
nonlinear  relationship  between  the  X,s  and  the 
choice  probabilities.  If  the  probability  of  observing 
y  —  1  increases  as  some  variable  X  increases,  then 
graphing  Pr(y  =  1  I  X)  against  X  will  produce  the 
well-known  S-shaped  probability  curve  character¬ 
istic  of  these  models  (the  mirror  image  of  this  curve 
is  produced  if  the  probability  of  observing  y  —  1 
decreases  as  the  variable  X  increases). 

Estimation 

Logit  and  probit  models  are  not  estimated  via  ordi¬ 
nary  least  squares  (OLS)  due  to  the  nonlinearity 
discussed  above.  Instead,  these  models  are  esti¬ 
mated  through  maximum  likelihood  estimation 
(MLE).  This  is  done  by  first  creating  a  likelihood 


function  and  then  taking  the  natural  log  of  this 
function  to  create  a  log-likelihood  function  (we 
take  the  natural  log  of  the  likelihood  function 
because  it  is  easier  to  work  with). 

For  instance,  the  log  likelihood  of  the  probit 
model  is 

InL  =  £y,lnT>(X,P)  +  (1  -  y,)ln[l  -  3>(X,-0)]. 

i 

Maximizing  this  log-likelihood  means  maximizing 
the  sum  of  the  predicted  probabilities  assigned  to 
the  choices  that  political  agents  in  fact  made.  This 
is  done  by  estimating  p  to  have  values  such  that 
X;P  tends  to  be  large  when  yt  —  1  and  small  when 

y,  =  o. 

One  difficulty  that  can  arise  in  the  estimation  of 
logit  and  probit  models  is  known  as  separation, 
which  occurs  when  an  independent  variable  per¬ 
fectly  predicts  the  binary  dependent  variable.  In 
these  cases,  the  coefficient  and  standard  error  on 
the  independent  variable  will  be  infinite.  For 
example,  by  many  commonly  used  definitions  of 
war,  there  are  no  instances  where  two  nuclear 
states  have  gone  to  war  with  each  other.  Including 
a  variable  indicating  that  a  state  is  a  nuclear  power 
in  a  logit  or  probit  model  with  war  as  the  depen¬ 
dent  variable  would  create  a  separation  problem  as 
the  nuclear  power  variable  perfectly  predicts  peace. 
Researchers  sometimes  solve  these  kinds  of  separa¬ 
tion  problems  by  omitting  the  problematic  inde¬ 
pendent  variable,  adding  random  noise  to  their 
data,  or  adopting  a  penalized  likelihood  estimation 
approach. 

In  general,  however,  the  log-likelihood  functions 
for  both  the  logit  and  probit  models  are  relatively 
easy  to  maximize,  with  most  statistical  software 
packages  reaching  the  solution  in  seconds.  However, 
note  that  the  consistency,  normality,  and  efficiency 
of  MLE  depends  on  asymptotic  arguments — the 
small-sample  properties  of  MLE  are  largely 
unknown.  Thus,  researchers  should  be  cautious 
when  estimating  logit  or  probit  models  with  small 
samples. 

Interpretation 

A  simple  examination  of  the  coefficients  in  logit 
and  probit  models  reveals  whether  an  independent 
variable  had  a  positive  or  negative  effect  on  the 
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probability  of  observing  y  —  1  and  whether  that 
variable  was  statistically  significant.  However, 
interpretation  of  the  substantive  effect  of  an  inde¬ 
pendent  variable  in  these  models  is  more  difficult 
than  the  interpretation  of  linear  regression  models 
as  the  relationship  between  X;(3  and  the  choice 
probabilities  depends  on  a  nonlinear  function.  The 
effect  of  a  one-unit  change  in  one  independent 
variable  on  the  choice  probabilities  will  depend  on 
the  values  of  the  other  independent  variables  in  the 
model.  This  means  that  we  cannot  interpret  the  (3s 
in  logit  and  probit  models  as  we  would  interpret 
them  in  an  OLS  setting. 

The  most  common  method  for  interpreting  the 
substantive  effects  of  coefficients  in  logit  and  pro¬ 
bit  models  in  political  science  is  through  the  use  of 
a  hypothetical  observation.  The  researcher  begins 
by  selecting  values  for  the  independent  variables  in 
the  model  that  are  representative  of  some  case  of 
interest  in  the  study.  Setting  variables  to  their 
mean  or  modal  values  is  common,  although  this  is 
not  the  only  possibility.  Often,  the  researcher  will 
select  values  for  the  independent  variables  that 
reflect  some  case  of  substantive  interest.  Then, 
using  the  estimated  values  of  p  and  the  selected 
values  of  X,  a  baseline  probability  for  y  =  1  is 
calculated.  For  instance,  we  would  interpret  the 
coefficients  in  a  probit  model  by  calculating 
Pr(y  =  1|X)  =  c£>(Xp),  where  X  represents  the 
selected  values  for  the  independent  variables.  The 
influence  of  a  particular  independent  variable  on 
the  probability  that  y  —  1  can  then  be  examined  by 
changing  the  value  of  this  variable  while  holding 
all  other  independent  variables  constant  at  X  and 
recalculating  Pr(y  =  1|X). 

As  these  predicted  probabilities  are  based  on 
our  model  estimate  and  are  thus  themselves  esti¬ 
mates,  it  is  usually  desirable  to  calculate  standard 
errors  for  these  probabilities.  In  political  science, 
this  is  most  commonly  done  through  simulation. 
Hundreds  or  thousands  of  draws  are  taken  from 
the  multivariate  normal  distribution  defined  by 
the  coefficients  and  covariance  matrix  estimated 
by  the  logit  or  probit  model,  a  predicted  probabil¬ 
ity  is  calculated  for  each  draw,  and  the  mean  and 


standard  deviation  of  this  set  of  predicted  proba¬ 
bilities  is  reported. 

Related  Discrete  Choice  Models 

Ordered  logit  and  ordered  probit  models  are 
derived  from  the  logit  and  probit  models  by  adding 
additional  choice  thresholds  to  the  latent  variable 
and  were  developed  for  research  questions  where 
the  dependent  variable  consists  of  more  than  two 
ordered  categories.  Multinomial  logit  and  multino¬ 
mial  probit  models  have  been  developed  to  study 
research  questions  where  the  dependent  variable 
consists  of  more  than  two  unordered  categories. 
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Machiavellism  commonly  refers  to  a  cynical, 
amoral  view  of  power  and  human  relations  and 
also  to  a  politics  of  force  and  of  calculated  arbi¬ 
trariness.  In  his  major  works,  The  Prince  and 
Discourses  on  Livy,  the  political  philosopher  and 
diplomat  Niccolo  Machiavelli  took  the  scandal¬ 
ous  liberty  of  speaking  well  of  evil:  namely,  of 
cruelty,  obscenity,  treachery,  lies,  avarice,  disbe¬ 
lief,  irreligion,  craftiness,  and  boldness.  His  name 
has  been  consequently  charged  with  the  abstract 
and  moral  notion  of  evil,  particularly  in  politics. 
This  was  certainly  a  way  to  render  his  theories 
illicit  and  detestable;  nonetheless,  a  strong  interest 
in  them  has  never  diminished.  Machiavelli  remains 
one  of  the  most  widely  published  authors  of  the 
past  5  centuries  and  one  of  the  rare  ones,  along 
with  the  Marquis  de  Sade  and  Leopold  von 
Sacher-Masoch,  to  have  his  name  substantiated 
for  describing  scary  and  excessive  behaviors.  This 
entry  presents,  first,  some  of  Machiavelli’s  most 
controversial  concepts  and,  second,  major  events 
concerning  his  life. 

Machiavelli  as  Political  Thinker 

For  political  science,  Machiavelli  has  symbolized 
the  advent  of  a  realist  approach  to  politics  and 
society.  “Realist”  means  both  the  criticism  of  any 
utopian  and  transcendental  conception  of  politics 
and  a  demystifying  analysis  of  human  history.  One 


could  insist  that  medieval  jurists,  such  as  Bartolus 
of  Sassoferrato  who,  in  commenting  on  Aristotle’s 
Politics,  opposed  the  moral  philosophers  of  his 
times,  anticipated  such  an  approach.  But 
Machiavelli’s  oft-quoted  formula,  according  to 
which  it  is  more  appropriate  to  go  behind  appear¬ 
ances  to  the  effectual  truth  of  the  thing  than  to  the 
imagination  thereof  (The  Prince,  15),  brought 
something  different  to  the  debate.  Machiavelli  pro¬ 
posed  analytic  tools  for  an  evaluation  of  the 
political  world  as  it  is  essentially  and  not  as  it 
appears  to  be  superficially  by  immediate  experi¬ 
ence  or  by  moral  assumption.  His  realism,  how¬ 
ever,  is  not  a  skeptical  submission  to  the  existing 
state  of  things  and  to  the  authority  of  tradition  and 
the  powerful.  It  is  based  on  a  constant  dialogue 
between  his  own  experience  of  the  events  of  his 
times  and  his  continual  study  of  antiquity.  This  is 
the  condition  for  a  theoretical  understanding  of 
what  can  and  cannot  be  possible.  The  notion  that 
humankind  is  the  same  throughout  history  does 
not  imply  that  the  order  of  things  cannot  be 
changed,  nor  does  it  imply  that  an  invisible  hand 
rules  it.  It  makes  possible,  to  the  contrary,  the 
intelligibility  of  political  experience. 

Philosophically  trained  by  his  early  work  on  the 
ancient  poet  Lucretius’s  antireligious  On  the 
Nature  of  Things,  Machiavelli  aimed  at  rendering 
men  the  artisans  of  their  own  fortune  ( The  Prince, 
25).  He  was  the  first  to  propose  to  everyone,  in  a 
bold,  clear  style,  a  kind  of  knowledge  about  power 
that  was  previously  reserved  for  an  elite  educated 
in  thorny  juridical  casuistry  and  in  the  practice  of 
political  affairs.  He  has  been  both  condemned  and 
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praised  for  having  unveiled  the  “mysteries  of  the 
state”  and  the  tools  of  its  authority,  potentially 
acquainting  the  masses  with  the  inner  workings  of 
politics.  Until  the  advent  of  the  age  of  revolutions, 
from  Jean  Bodin  to  John  Adams,  Machiavelli  was 
reasonably  considered  as  the  founder  of  a  plebeian 
political  philosophy.  He  introduced  the  topic  of 
political  equality  and  the  equal  political  ability  of 
all  men  without  consideration  of  rank,  riches,  and 
lineage  at  the  same  time  as  he  introduced  the  con¬ 
cept  of  a  class  division  of  “desire”  between  the 
aristocracy  ( grandi )  and  the  people.  The  aristoc¬ 
racy  is  moved  by  its  desire  to  oppress  the  people 
and  the  people  by  their  desire  not  to  be  oppressed 
by  the  aristocracy  (The  Prince,  9).  Tyranny,  in 
Machiavelli’s  thinking,  is  set  in  motion  by  the 
ambition  of  the  aristocracy,  which  considers  itself 
above  the  laws  and  tends  to  weaken  society  in 
order  to  make  it  submit  it  to  its  own  purposes 
(Discourses,  I  40,  III  28).  According  to  his  agonis¬ 
tic  and  dynamic  vision  of  social  institutions,  which 
suggests  that  popular  discord  makes  society  free 
and  powerful,  the  people  appear  to  be  the  best 
guardians  of  liberty  (Discourses,  1 4-5,  58).  By  giv¬ 
ing  voice  to  even  the  most  extremist  discourse  of 
the  rebellious  working  classes  (the  ciompi)  Machi¬ 
avelli  granted  it  a  literary  and  political  dignity 
(Florentine  Histories,  III  13).  Added  to  his  consid¬ 
eration  that  it  may  sometimes  be  necessary  to 
extinguish  the  aristocracy  in  order  to  establish  a 
republic  (Discourses,  I  55),  the  scandalous  rupture 
this  anti-elitist  political  stance  introduced  into 
political  philosophy  forced  most  successive  politi¬ 
cal  theorists  to  measure  themselves,  explicitly  or 
not,  against  his  thought.  In  spite  of  the  censure 
exerted  by  the  Catholic  Church  on  his  works  and 
personal  history,  most  of  the  great  philosophical 
and  political  ideas  of  the  past  5  centuries  had  their 
beginnings  in  Machiavelli.  His  works  had  a  posi¬ 
tive  influence  on  Bodin,  Montaigne,  Bacon,  Vanini, 
Spinoza,  Sidney,  Bayle,  Montesquieu,  the  Encyclo¬ 
pedists,  the  Federalists,  Hegel,  Marx,  Weber,  and 
Gramsci,  among  many  others. 

The  Florentine  World 

The  third  son  of  an  attorney  and  his  wife,  Niccolo 
Machiavelli  was  born  in  Florence,  Tuscany,  into  a 
territorial  state  shaped  by  heavy  political,  social, 
and  economic  contradictions.  The  banking  and 


mercantile  aristocracy  of  Florence,  who  controlled 
the  state’s  economic  and  financial  organization, 
had  built  a  fiscal  system  in  which  its  contributions 
took  the  form  of  loans  bearing  interest.  By  con¬ 
trast,  the  inhabitants  of  Florentine’s  subject  territo¬ 
ries  and  the  masses  of  the  Florentine  people  carried 
the  weight  of  taxes,  from  which  a  big  part  was 
absorbed  by  the  service  of  the  state  debt.  In  1494, 
with  the  beginning  of  the  Italian  Wars,  the  Medici 
regime,  which  had  consolidated  the  financial  sys¬ 
tem  in  favor  of  the  aristocracy,  collapsed.  A  popu¬ 
lar  government  was  established  with  one  third  of 
the  Florentine  householders  qualifying  to  partici¬ 
pate  in  the  assembly  of  the  Great  Council.  It  had 
to  face  the  disaggregation  of  the  territories,  which 
had  rebelled  against  the  previous  forms  of  subjec¬ 
tion.  As  Machiavelli  asserted,  the  Florentine  ruling 
class  decided  to  disarm  the  people  in  order  to  enjoy 
the  immediate  usefulness  of  being  able  to  plunder 
them  (Discourses,  II  30).  The  intimate  link  between 
public  debt,  the  employment  of  mercenary  troops, 
and  aristocratic  interests  (whose  tremendous  nega¬ 
tive  social  consequences  appeared  clearly  when  an 
external  threat  arose,  as  after  1494)  is  constitutive 
of  Machiavelli’s  political  thought  and  activity.  He 
directed  most  of  his  energy,  both  in  practice  and  in 
theory,  to  “arm  the  people”  for  the  independence 
of  the  republic. 

Machiavelli’s  Life  and  Works 

In  June  1498,  during  a  period  of  heavy  political 
conflicts  caused  by  a  major  financial  and  institu¬ 
tional  crisis,  Machiavelli,  then  29  years  old,  was 
elected  head  of  the  Second  Chancery  of  the  Floren¬ 
tine  government,  in  charge  of  administering  the 
city’s  territories;  he  was  soon  after  nominated  to 
serve  the  office  for  military  affairs.  Involved  for  the 
next  14  years  in  numerous  diplomatic  missions  in 
Italy  and  abroad,  he  nonetheless  dedicated  most  of 
his  time  to  territorial  affairs  and  to  the  organiza¬ 
tion  of  the  militia.  The  institution  of  the  militia, 
enacted  in  December  1506,  drew  strong  opposi¬ 
tion  from  the  Florentine  aristocracy.  The  militia’s 
major  achievement  was  the  defeat  of  the  pro¬ 
tracted  Pisan  rebellion  (June  1509).  Its  major  fail¬ 
ure  was  its  inefficiency  in  the  face  of  the  Spanish 
armies  at  Prato  (August  1512).  But  internal  con¬ 
flicts  within  the  Florentine  republic  and  its  inca¬ 
pacity  to  grant  forms  of  citizenship  to  its  territorial 
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subjects  had  already  prevented  the  consolidation 
and  development  of  the  militia  in  Tuscany. 
Machiavelli  emphasizes  in  the  Art  of  War  (pub¬ 
lished  in  1521  and  the  most  neutral  among  his 
major  works)  that  his  concept  of  the  people  in 
arms  harbored  broader  ambitions  than  he  could 
actualize  or  even  express.  The  fall  of  Prato  led  to 
the  overthrow  of  the  republican  regime  and  to  the 
return  of  the  Medici  to  Florence.  In  October  1512, 
Machiavelli  attempted  in  vain  to  convince  the 
Medici  to  build  an  alliance  with  the  people  against 
the  aristocracy  (A  Caution  to  the  Medici ).  In 
November  1512,  he  was  one  of  the  few  members 
of  the  Chancery  dismissed  from  his  functions.  A 
few  months  later,  imprisoned  under  the  accusation 
of  conspiracy,  he  was  tortured,  but  he  confessed 
nothing  and  ultimately  benefited  from  the  amnesty 
that  followed  the  election  of  Giovanni  de  Medici 
as  Pope  (Leo  X). 

Over  the  next  years,  Machiavelli  seemed  to 
play  a  kind  of  double  game.  In  attempting  to  get 
closer  to  the  Medici,  he  dedicated  to  one  of  them 
the  manuscript  of  his  Prince,  apparently  a  satire  of 
the  literature  directed  at  the  moral  instruction  of 
princes,  in  which  he  maintained  the  central  idea 
expressed  in  his  Caution  to  the  Medici.  But  he  also 
frequented  those  who  developed,  under  his  teach¬ 
ing,  a  radical  resistance  to  the  new  tyrants  of 
Florence.  His  Discourses  on  Livy,  composed  of 
materials  assembled  over  20  years,  conserves  part 
of  this  teaching.  Published  posthumously  in  1531, 
it  appeared  to  be  dedicated  to  one  of  these  radical 
opponents.  The  attitudes  of  those  opponents  and 
of  the  Medici  themselves  toward  Machiavelli, 
even  after  the  failed  anti-Medicean  conspiracy  of 
1522  in  which  he  was  not  directly  involved,  indi¬ 
cated  that  neither  ever  doubted  Machiavelli’s  con¬ 
tinuous  attachment  to  the  former  republic.  In 
1521,  in  his  Discourse  on  Reforming  the  State  of 
Florence,  Machiavelli  had  proposed  to  Leo  X  a 
constitutional  reform,  at  that  time  considered 
eccentric,  by  which  the  dissolution  of  the  Medici 
regime  would  be  followed  by  the  reconstruction  of 
a  popular  state.  Machiavelli  was  then  commis¬ 
sioned  by  Cardinal  Giulio  de’  Medici — later  Pope 
Clement  VII — to  write  the  Florentine  Histories, 
presented  in  1525  and  published  posthumously 
(1532).  This  book  draws  a  ferocious  portrait  of 
the  rulers  of  late  medieval  Florence,  the  Medici 
included,  in  accord  with  Machiavelli’s  political 


theory.  In  May  1527,  with  the  Sack  of  Rome,  the 
Medici  lost  Florence  and  the  republic  was  restored. 
Machiavelli  was  not  recalled  to  his  old  functions, 
as  he  and  his  republican  friends  had  wanted.  He 
died  soon  after.  Nonetheless,  his  idea  that  the 
militia  could  strengthen  solidarity  among  the 
people  and  also  serve  as  a  bit  in  the  mouth  of 
the  aristocracy  quickly  became  the  official  decla¬ 
ration  of  the  last  Florentine  republic.  After  the 
republic’s  fall  in  1530,  Clement  VII  authorized  the 
publication  of  Machiavelli’s  unpublished  works. 
It  is  said  that,  while  reading  The  Prince  (published 
in  1532),  he  laughed  about  ideas  of  Machiavelli 
that  were  so  contrary  to  the  building  of  the 
Medicean  hegemony. 

Machiavelli’s  ensuing  reputation  followed  along 
lines  already  established  during  his  lifetime,  even 
before  he  had  written  The  Prince  (1513-1514). 
While  organizing  the  militia  in  1505  to  1506, 
Machiavelli  was  accused  by  some  members  of  the 
Florentine  aristocracy  of  being  a  lower-class  scoun¬ 
drel  and  someone  who  wished  to  establish  a  tyr¬ 
anny  in  Florence.  But  he  was  also  seen  as  a  new 
Moses,  an  emancipator  of  the  people,  and  a  phi¬ 
losopher  who  was  not  for  fools  to  understand. 
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Maoism 


Maoism  is  a  body  of  doctrine  developed  by  Mao 
Zedong  (formerly  transliterated  as  “Mao  Tse-Tung,” 
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1893-1976)  and  his  associates  in  the  Chinese 
Communist  Party  (CCP)  for  Chinese  revolution 
and  socialist  construction  from  the  1920s  until 
Mao’s  death  in  1976.  It  is  composed  of  many  dif¬ 
ferent  kinds  of  ideas  and  ideology,  and  strategy 
and  tactics  and  is  believed  to  be  the  creative  result 
of  applying  Marxism-Leninism  to  China,  a  semi- 
feudal  and  semicolonial  country  without  modern 
industrial  developments.  Mao  Zedong  is  the  prin¬ 
cipal  Chinese  Marxist  theorist,  a  Communist 
statesman  who  contributed  to  the  founding  of  the 
CCP  in  1921,  the  Communist  Army  in  1927,  and 
the  People’s  Republic  of  China  (PRC)  in  1949 
after  a  long  period  of  military  struggle.  In  contrast 
with  Karl  Marx  spending  his  most  of  life  in  read¬ 
ing  and  writing  as  an  editor,  reporter,  and  scholar, 
Mao  had  taken  part  in  almost  all  military  strug¬ 
gles,  political  conflicts,  and  social  movements  dur¬ 
ing  his  time.  He  emerged  as  a  supreme  leader  in 
1935  in  the  CCP  because  his  smart  ideas  and  suc¬ 
cessful  tactics  were  adopted  by  most  of  the  CCP 
leaders.  He  was  chairman  of  the  PRC  from  1949 
to  1959  and  chairman  of  the  CCP  until  his  death 
in  1976. 

After  a  bloody  split  in  April  1927,  Chiang 
Kai-shek  of  the  Kuomintang,  or  Nationalist  Party, 
dismantled  the  united  front  with  the  Communist 
Party  against  the  warlord  government  in  Beijing 
and  broke  with  his  allies  in  the  Communist  Party. 
In  the  following  campaigns,  many  communist 
organizations  were  destroyed,  and  a  large  number 
of  communist  leaders  and  members  killed.  Mao 
led  several  peasant  uprisings  in  Hunan  and  Jiangxi 
provinces  and  established  the  communist  military 
bases  during  the  early  1930s.  Based  on  these  expe¬ 
riences,  Mao  realized  the  importance  of  Chinese 
peasantry  in  the  Chinese  communist  revolution.  In 
October  1934,  Mao  and  the  communists  retreated 
from  Jiangxi  under  the  strong  military  attack  by 
Chiang’s  Nationalist  Army  and  started  their  epic 
Long  March  to  the  new  base  in  Shanxi  province. 
By  making  use  of  the  second  united  front  with  the 
Nationalist  Party  against  Japan,  Mao  and  his 
communist  comrades  had  not  only  consolidated 
their  base  and  expanded  their  sphere  of  influence 
but  also  formulated  a  set  of  ideology  and  method¬ 
ology  for  Chinese  revolution,  which  was  officially 
described  as  Maoism  in  1945.  In  a  new  civil  war 
between  the  Communists  and  Nationalists  from 
1946  to  1949,  the  Nationalist  government  ruled 


by  Chiang  was  defeated  and  retreated  to  Taiwan. 
As  a  result,  the  Communist  Party  ascended  to 
power  and  Mao  Zedong  declared  the  founding  of 
the  PRC  in  Beijing  on  October  1,  1949. 

According  to  the  logic  of  Marxist  socialism, 
socialism  based  on  it  would  succeed  only  after  the 
capitalist  commodity  economy  developed  fully  and 
highly.  After  the  communist  revolution  succeeded 
and  the  Communist  Party  came  to  power  in  1949 
in  China,  Mao  Zedong  and  the  communist  leaders 
decided  to  implement  and  promote  the  socialist 
transformation  of  Chinese  economy  and  society, 
ignoring  the  basic  facts  about  underdevelopment 
and  dreaming  that  their  country  would  immedi¬ 
ately  leap  forward  into  communism.  Therefore,  to 
build  socialism  and  realize  communism  as  soon  as 
possible,  they  conducted  radical  and  drastic 
reforms  in  socialist  practice  in  the  light  of  Marxist 
theory,  especially  Stalinist  theory.  They  tried  to 
eliminate  private  ownership  and  establish  public 
ownership,  reduced  free  competition,  developed 
the  planned  economy,  limited  the  role  of  capital  in 
distribution  of  resources,  and  promoted  egalitari¬ 
anism  to  eradicate  economic  exploitation  and  sup¬ 
pression.  To  promote  the  socialist  transformation 
and  build  a  new  socialist  country,  the  Communist 
Party  must  always  keep  the  power  in  the  hand  of 
the  proletarian  class,  take  the  class  struggle  as 
basic  line,  and  consolidate  the  central  leadership. 
After  8  years  of  socialist  transformation,  Mao 
Zedong  declared  confidently  in  1957  that  China 
was  building  socialism,  that  the  petty  bourgeoisie 
in  agriculture  and  handicrafts  and  the  bourgeoisie 
in  industry  and  commerce  had  both  experienced 
changes,  that  the  individual  economy  had  been 
transformed  into  a  collective  economy,  and  that 
capitalist  private  ownership  had  been  transformed 
into  socialist  public  ownership. 

In  search  of  its  own  socialist  model  after  Stalin 
was  criticized  on  the  20th  Soviet  Communist  Party 
Congress  in  February  1956,  China  under  Mao 
Zedong  adopted  many  radical  policies  and 
launched  mass  movements  one  after  another.  The 
most  important  were  the  Great  Leap  Forward 
(GLF,  1958-1960)  and  the  Cultural  Revolution 
(1966-1976).  During  the  period  of  the  GLF,  in 
rural  areas  of  China,  750,000  higher-stage  coop¬ 
eratives  were  merged  into  25,000  people’s  com¬ 
munes  with  multifunction  to  transform  the  private 
farmers  into  collective  peasants  and  to  pursue 
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egalitarian  ideals.  A  huge  amount  of  labor  power 
had  been  mobilized  to  build  the  famous  backyard 
steel  furnaces  and  irrigation  works.  China’s  lead¬ 
ers  were  eager  to  catch  up  with  Britain,  and  even¬ 
tually  with  the  United  States,  and,  at  least  for  a 
brief  moment,  were  willing  to  believe  that  utopian 
methods  worked  and  would  produce  more  steel 
and  food.  The  GLF  caused  a  great  deal  of  waste  in 
the  development  of  socialist  economy.  Also, 
because  this  policy  emphasized  accumulation 
rather  than  consumption,  the  meager  development 
of  heavy  industry  came  at  the  huge  cost  of  little  or 
no  improvement  in  people’s  quality  of  life.  The 
GLF  caused  an  immense  famine,  and  the  death  rate 
rose  from  18.12  per  thousand  in  1957  to  44.60  per 
thousand  in  1960 — the  consequence  not  only  of 
declining  harvests  but  also  of  excessive  requisition¬ 
ing  of  grain,  based  on  false  reports  that  far  more 
grain  had  been  produced  than  was  actually  the 
case.  Many  years  later,  official  sources  admitted 
that  8  million  people  had  died  of  causes  related  to 
the  GLF;  unofficial  sources  estimated  the  figure  at 
between  12  million  and  20  million. 

During  the  period  of  the  Cultural  Revolution, 
under  the  banner  of  a  proletariat  revolution,  Mao 
took  the  class  struggle  as  fundamental.  He  believed 
that  the  main  goal  of  the  Cultural  Revolution  was 
to  save  Chinese  socialism  from  the  threat  of  “revi¬ 
sionism”  by  purging  his  lieutenants  who,  as  in  the 
case  of  Liu  Shaoqi  and  Deng  Xiaoping,  attached 
greater  importance  to  economic  efficiency  than  to 
ideological  purity.  The  Cultural  Revolution  placed 
emphasis  on  purifying  the  superstructure  rather 
than  changing  the  economic  conditions,  because 
Maoists  believed  that  purification  would  give  a 
push  to  economic  development  in  which  the 
bureaucrats  would  engage  directly  and  to  which 
the  masses,  imbued  with  revolutionary  ideals, 
would  exert  themselves  on  behalf  of  collective 
undertakings.  Meanwhile,  with  the  ideal  of  egali¬ 
tarianism,  a  leftist  “wind”  swept  through  the 
country,  resulting  in  the  curtailing  of  private  land- 
holding,  a  free  market,  and  other  personal  rights. 
As  a  result,  however,  Chinese  society  in  1976  was 
neither  more  efficient  nor  more  equitable. 

Although  Maoism  contains  many  different  and 
sometime  contradictory  elements  in  the  different 
stages  of  Chinese  revolution  and  socialist  construc¬ 
tion,  several  salient  features  can  be  found  in  Mao’s 
works  and  experiences.  Primary  is  his  emphasis  on 


the  importance  of  peasant  issues  in  Chinese  revolu¬ 
tion  and  socialist  construction.  The  Marxist- 
Leninist  tradition  treated  peasants  as  incapable  of 
revolutionary  initiative  and  only  marginally  useful 
in  backing  urban  proletarian  revolution.  Based  on 
his  life  experiences  and  his  analysis  of  the  rural 
situation  in  China,  Mao  came  to  recognize  the 
potential  power  of  China’s  hundreds  of  millions  of 
peasants  and  decided  to  establish  his  base  in  rural 
areas  instead  of  in  big  cities.  The  peasants  consti¬ 
tuted  the  vast  majority  of  China’s  population,  and 
most  of  them  were  the  most  hard-pressed  of  all 
citizens  and  lived  in  extreme  poverty.  According  to 
Mao,  they  were  very  receptive  to  revolutionary 
agitation  and  could  become  a  revolutionary  force 
if  fully  mobilized  and  properly  guided.  Proceeding 
from  this  belief,  Mao  proposed  to  instill  in  them  a 
revolutionary  consciousness  and  make  their  force 
alone  suffice  for  revolution.  By  so  doing,  Mao  led 
the  Chinese  revolution  to  success  and  gradually 
formed  a  special  attachment  to  peasants.  During 
the  Cultural  Revolution,  Mao  sent  many  city 
workers,  intellectuals,  and  bureaucrats  to  rural 
areas  and  forced  them  to  receive  reeducation 
through  agricultural  labor  together  with  peasants 
because  he  believed  that  big  cities  were  a  corrupt¬ 
ing  influence  for  many. 

In  Mao  Zedong’s  thinking,  there  were  long¬ 
standing  populist  elements  and  continuous  empha¬ 
sis  on  the  “mass  line.”  The  mass  line  means  the 
emphasis  on  the  interests  and  preferences  of  the 
common  people  and  demands  that  the  government 
be  responsive  to  them.  It  was  first  created  in  the 
revolutionary  period,  based  on  the  idea  of  class 
struggle.  The  idea  of  the  disadvantaged  classes 
overthrowing  the  privileged  classes  through  inter¬ 
class  struggle  naturally  entailed  the  idea  of  mobi¬ 
lizing  the  oppressed  masses  to  fight  for  their  own 
interests  against  those  of  the  oppressing  classes. 
Thus,  the  notion  of  class  struggle  led  to  Mao 
Zedong’s  stress  on  the  importance  of  the  masses 
and  mass  movements  and  to  what  Mao  explicitly 
labeled  as  the  “mass  line”  in  the  Yan’an  period 
(late  1930s  and  early  1940s)  and  extolled  as  one  of 
the  vital  traditions  in  Communist  history.  There¬ 
fore,  it  is  common  to  assert  that  the  mass  line  is  a 
style  of  leadership,  which  is,  at  its  best,  a  demo¬ 
cratic  style  of  leadership.  Mao  believed  that  even 
the  vanguard  party  needed  to  be  rectified  and 
reformed  through  criticism  from  the  people  it  led 
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and  that  the  masses  of  China  should  be  encour¬ 
aged  to  become  involved  in  even  the  highest  affairs 
of  state.  In  the  Cultural  Revolution  of  1966  to 
1976,  the  masses  had  been  mobilized  so  broadly 
and  deeply  that  the  country  verged  on  anarchy. 

In  spite  of  political  disorder  and  economic 
depression  in  China,  Maoism  developed  into  a 
worldwide  movement  in  the  1960s  and  thereafter. 
All  Maoists  expressed  fidelity  to  the  thought  of 
Mao  Zedong.  But  at  a  practical  level,  self-identified 
Maoist  political  formations  differed  considerably. 
In  parts  of  Asia  where  conditions  were  similar  to 
those  that  prevailed  in  China  before  1949,  Maoism 
was  largely  a  peasant  movement,  engaging  in  guer¬ 
rilla  warfare  and  establishing  bases  in  rural  areas, 
and,  if  successful,  surrounding  the  cities  and  seizing 
state  power.  Elsewhere  in  the  Third  World,  espe¬ 
cially  in  Latin  America,  facing  very  different  condi¬ 
tions,  Maoists  had  to  modify  classical  Maoist 
forms  of  revolutionary  struggle.  In  the  developed 
capitalist  countries,  Maoism  meant  something  very 
different.  Western  Maoism  was  particularly  attrac¬ 
tive  to  young  people  during  the  1960s  only  for  its 
ostensible  purity  and  populist  nature.  Although 
Maoism  has  been  upheld  as  one  of  the  major  guid¬ 
ing  principles  in  the  CCP  and  other  left-wing  par¬ 
ties,  it  has  gradually  lost  its  appeal  in  China  and 
other  parts  of  the  world  since  China  adopted  its 
new  “Opening  and  Reform”  policy  in  1978. 

Dingping  Guo 
Fudan  University 
Shanghai,  China 
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Market  Economy 


In  the  1940s  and  1950s,  two  types  of  economy 
were  frequently  contrasted:  the  market  economy 
(sometimes  also  called  the  “free  market”  econ¬ 
omy)  and  the  centrally  planned  economy  such  as 
that  of  the  former  Soviet  Union.  Whereas  in  the 
latter,  prices  and  production  decisions  were  set  by 
administrative  decision,  in  the  former,  they 
emerged  as  a  result  of  the  interactions  of  myriad 
firms  and  individuals  who  negotiate,  buy,  sell,  and 
set  prices  for  goods  and  services. 

However,  since  the  1960s,  political  science  has 
developed  a  more  sophisticated  analysis  than  this 
simple  dichotomy.  It  has  perhaps  returned  to 
political  economy  traditions  of  Adam  Smith  and 
David  Ricardo,  with  concerns  about  how  institu¬ 
tions  and  norms  create  markets  and  indeed  lead  to 
different  types  of  market  economy.  The  starting 
point  of  analysis  is  the  argument  that  markets  are 
socially  created,  through  formal  institutions  such 
as  law,  and  informal  institutions  that  govern 
exchanges,  such  as  norms  and  trust,  so  that  mar¬ 
kets  are  in  fact  embedded  in  societies  and  marked 
by  historical  developments.  Given  this,  the  inter¬ 
esting  questions  become:  Which  kind  of  market 
economies  exist?  Why?  How  do  they  change  and 
develop?  What  are  the  effects  of  different  kinds  of 
market  economy? 

Debates  about  these  questions  are  frequently 
linked  to  broader  normative  questions  about  the 
distribution  of  power  and  wealth  and  indeed  issues 
about  democracy.  At  the  same  time,  in  practice, 
they  have  (with  some  delay)  been  influenced  by 
events  in  markets  and  governance.  Hence  “market 
economy”  is  not  only  a  complex  concept  but  one 
intimately  linked  with  both  broader  questions  in 
political  science  and  the  evolution  of  markets  in 
practice. 

Debates  in  the  1960s  and  1970s 

Although  many  debates  in  the  1960s  and  1970s 
were  dominated  by  Marxism-pluralism  conflicts, 
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at  least  two  sets  of  analyses  were  undertaken  that 
remain  relevant  today.  The  first  concerned  prob¬ 
lems  with  “free  markets”  and  was  chiefly  based  in 
the  United  States.  Political  scientists  began  to  study 
the  effects  of  free  markets  on  the  distribution  of 
power,  both  among  social  groups  and  classes  and 
in  the  political  arena.  They  worried  that  the  power 
of  business,  especially  large  firms,  was  excessive. 
Business  enjoyed  not  only  great  lobbying  power 
but  also  systemic  power:  Politicians  needed  a 
well-functioning  economy  and,  even  without  orga¬ 
nized  pressure,  they  would  seek  to  protect  business 
interests.  The  dangers  of  “capture”  of  regulators, 
especially  by  large  concentrated  suppliers,  was 
underlined,  as  these  firms  could  lobby  and  through 
votes  and/or  money  could  “buy”  regulation  that 
would  distort  the  free  market  through  entry 
restrictions  and  other  protective  measures  that 
would  allow  them  to  earn  rents. 

Meanwhile,  in  Europe,  the  stark  division 
between  planned  and  market  economies  became 
blurred  as  political  scientists  mapped  out  both  dif¬ 
ferent  forms  of  capitalism  and  also  the  ways  in 
which  governments  structured  markets.  Thus,  for 
instance,  British  and  French  state  roles  in  markets 
differed — the  latter’s  ability  to  plan  and  intervene 
on  matters  from  investment  to  prices  was  con¬ 
trasted  with  the  former’s  more  limited,  arm’s- 
length  role.  Thereafter,  work  on  corporatism 
examined  the  formal  and  informal  linkages 
between  governments,  firms,  and  organized  labor 
that  were  particularly  strong  in  northern  European 
countries  and  used  to  decide  matters  such  as  wage 
increases,  the  division  of  growth  between  capital 
and  labor,  or  public  spending.  Corporatism  was 
linked  to  the  capacity  of  certain  states  to  pursue 
high  economic  growth  and  a  large  welfare  state.  It 
was  often  heralded  as  a  means  of  cooperation  and 
negotiation  between  different  interests  that  allowed 
them  to  shape  and  control  markets. 

Regulatory  Reform  and  the 

Continuing  Role  of  the  State 

Just  as  political  scientists  were  discovering  non- 
market  forms  of  control  and  the  power  of  large 
firms,  a  major  turn  toward  “the  market”  took 
place  both  in  academic  study  and  in  policy  making. 
Political  science  “rediscovered”  the  market,  as  it 
analyzed  worldwide  moves  toward  “regulatory 


reform.”  Many  state-owned  strategic  industries 
were  sold,  such  as  network  industries,  banking, 
extraction  industries,  and  suppliers  in  public  own¬ 
ership  from  the  19th  century.  Privatization  was 
frequently  accompanied  by  liberalization  as,  con¬ 
trary  to  the  predictions  of  the  capture  theorists, 
incumbent  suppliers  failed  to  prevent  the  ending  of 
their  legal  monopolies.  Instead,  competition  was 
permitted  and  indeed  encouraged  even  in  indus¬ 
tries  previously  seen  as  “natural  monopolies,”  such 
as  energy,  telecommunications,  and  postal  services. 

The  changes  gave  rise  to  vigorous  debates.  One 
concerned  explanations  for  these  “market 
reforms.”  One  argument  was  that  they  were  driven 
by  technological  and  economic  developments, 
which  made  state  ownership  and  monopolies 
untenable.  However,  such  an  explanation  lacked 
political  mechanisms  and  suffered  from  problems 
similar  to  those  associated  with  the  technological 
determinism  of  the  1950s  and  1960s.  Another 
framework  of  analysis  underlined  the  interests  of 
key  actors — elected  politicians  in  the  proceeds  of 
privatization,  managers  able  to  enjoy  greater  free¬ 
dom  and  rewards,  consultants,  and  lawyers — in 
promoting  change.  However,  the  spread  of  liberal 
ideas  has  been  another  argument,  as  changes 
needed  to  be  accepted  and  legitimated.  Finally, 
however,  the  reforms  can  be  seen  as  part  of  a  long 
cycle  of  reforms  as  the  pendulum  of  policy  swings 
between  “state”  and  “market,”  due  to  endogenous 
developments  that  undermine  each.  Hence,  the 
post-1945  move  to  “collectivism”  had  engendered 
inefficiencies  and  dissatisfactions  and  thus  pres¬ 
sures  for  more  individualism. 

However,  the  extent  and  nature  of  the  changes 
were  also  contested.  “Market-oriented”  reforms 
were  accompanied  by  the  growth  of  organizations 
and  rules  governing  competition.  Governments  cre¬ 
ated  a  series  of  economic  regulators,  often  in  the 
form  of  IRAs  (independent  regulatory  authorities) — 
both  sectoral  regulators  and  general  competition 
authorities — and  delegated  powers  to  them.  In 
addition,  “social  regulation”  (and  regulators)  con¬ 
cerning  matters  such  as  health  and  safety  grew. 
Moreover,  supranational  regulation  expanded, 
notably  by  the  European  Union  and  World  Trade 
Organization.  As  a  result,  although  U.S.  scholars 
dubbed  it  “deregulation,”  in  fact,  reform  involved 
the  creation  of  a  “regulatory  state,”  as  increased 
competition  went  hand  in  hand  with  reregulation. 
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Globalization  and  Varieties  of  Capitalism 

Liberalization  was  accompanied  by  startling 
increases  in  cross-border  capital  flows.  Political 
scientists  responded  by  looking  at  their  effects  on 
national  economic  policies  and  institutions.  Some 
argued  that  these  flows  of  capital  reduced  the 
power  and  autonomy  of  nation-states.  Strong  glo- 
balizationists  indeed  claimed  that  to  attract  “foot¬ 
loose  capital,”  and  driven  by  economic  efficiency, 
all  countries  would  have  to  follow  a  neoliberal 
path  toward  a  market  economy  with  a  smaller 
state,  less  powerful  trade  unions,  and  greater 
attention  to  the  needs  of  international  firms. 

However,  these  claims  were  vigorously  attacked 
by  the  “varieties  of  capitalism”  literature,  which 
pointed  out  that  there  were  several  different  possi¬ 
ble  forms  of  markets  and  indeed  of  capitalism.  It 
argues  that  market  players,  such  as  managers,  inves¬ 
tors,  and  workers,  face  coordination  problems, 
which  can  be  resolved  through  either  competition 
or  cooperation.  Using  the  notion  of  institutional 
complementarities,  whereby  institutional  arrange¬ 
ments  in  different  spheres  are  linked,  it  claims  that 
there  are  two  models  of  a  market  economy — liberal 
market  economies  and  coordinated  market  econo¬ 
mies.  Other  work  on  models  of  capitalism 
has  added  further  types,  such  as  statist  market 
economies,  based  on  a  leading  role  for  the  public 
sector.  The  literature  on  varieties  of  capitalism 
claims  that  a  diversity  of  market  economies  can 
exist  and  indeed  flourish  in  the  face  of  globaliza¬ 
tion,  as  each  has  its  distinct  strengths  and  weak¬ 
nesses  and  there  is  no  single,  most  efficient  market 
economy. 

Current  Debates  and  Trends 

Much  political  science  work  on  “the  market”  has 
assumed  that  market  institutions  are  largely  static. 
Current  debates  largely  center  on  changes  in  mar¬ 
ket  institutions.  One  debate  concerns  the  speed 
and  extent  of  change.  Whereas  the  varieties-of- 
capitalism  literature,  which  was  closely  linked  to 
historical  institutionalism,  largely  claimed  or 
assumed  that  national  market  economies  remained 
stable,  more  recent  work  has  suggested  that  major 
change  does  occur  but  incrementally  and  by  build¬ 
ing  on  existing  institutions  so  that  countries  move 
toward  liberalism  while  maintaining  their  distinc¬ 
tive  heritages. 


A  further  contribution  concerns  the  role  of 
international  factors  within  domestic  settings  and 
their  role  in  the  spread  of  “liberal  market”  institu¬ 
tions  such  as  privatization,  liberalization,  and 
independent  regulatory  agencies.  One  argument  is 
that  change  has  taken  place  through  diffusion 
rather  than  rational  choice  and  calculation.  Owing 
to  mechanisms  of  coercion  and  emulation,  liberal 
market  institutions  have  spread  regardless  of  their 
efficacy.  Another  is  that  certain  forms  of  interna¬ 
tionalization  lead  to  change  in  market  institutions 
whereas  others  do  not.  Thus,  for  instance,  transna¬ 
tional  technological  and  economic  developments 
may  create  pressures  for  change,  but  these  can  be 
resisted,  whereas  other  international  forces  such  as 
overseas  reforms  or  supranational  regulation  may 
lead  to  the  spread  of  “liberal”  institutions  because 
of  their  impact  on  policy  making,  notably  legiti¬ 
mating  change.  A  further  line  of  development  is  to 
look  at  relationships  as  much  as  at  formal  institu¬ 
tions  and  hence  to  see  whether,  even  if  the  latter 
change,  enduring  patterns  of  behavior  remain  that 
differ  sharply  across  types  of  economy.  Hence,  the 
claim  is  that  even  if  nations  adopt  similar  “liberal” 
market  institutions,  these  operate  in  very  diverse 
ways  due  to  informal  institutions. 

To  many,  the  recent  financial  and  economic 
crisis  offers  challenges  to  the  supposed  superiority 
of  competition  and  hence  to  many  normative 
underpinnings  of  the  market  economy.  However, 
at  least  most  political  science  work  does  not 
assume  that  only  one  type  of  market  economy 
exists  and  does  not  neglect  the  crucial  roles  of  the 
institutions,  relationships,  and  the  state  in  shaping 
market  economies. 
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Market  Failure 


Market  failure  can  be  seen  along  a  continuum. 
There  is  a  wider  and  a  narrower  definition:  the 
wider  and  imprecise  one  implying  lack  of  output  or 
even  utter  skepticism  concerning  the  performance 
and  functioning  of  markets  and  the  narrower  one 
limiting  the  term  to  deficiencies  of  the  market.  In 
this  entry,  the  description  of  market  failure  and  its 
basic  model  is  followed  by  a  discussion  of  market 
failures  resulting  from  a  wide  variety  of  factors 
including  asymmetrical  information,  monopolies, 
and  external  effects.  Next,  the  entry  examines  pub¬ 
lic  goods  deficiencies  as  the  narrower  category  of 
market  failure.  The  last  section  discusses  the  distinc¬ 
tion  between  market  failure  and  economic  crises. 

With  the  worldwide  financial  and  economic 
crisis  that  began  in  2008,  market  failure  has 
become  a  renewed  focus  of  public  discussion.  Yet 
there  is  no  need  to  let  the  state  do  what  the  market 
can  do  better — for  example,  organize  free  exchange 
relationships  and  produce  consumer  goods.  Time- 
dependent  public  opinion  creates  shifts  to  and 
from  more  advocacy  of  state  economic  activities  in 
creating  an  extended  infrastructure  for  transport 
and  goods  such  as  public  health  and  education 
and,  nowadays,  a  heavy  control  and  regulation  of 
the  financial  sector.  Irrespective  of  such  tendencies, 
the  “golden  path” — apart  from  producing  the 
“social  goods  proper”  (see  below) — is  a  combina¬ 
tion  of  an  effective  judicial  infrastructure  respect¬ 
ing  individual  rights  and  property  rights,  market 
competition,  the  supervision  of  free  access  to  mar¬ 
kets,  and  avoidance  of  misuses  by  monopolies. 

Market  failure  denotes  situations  where  the 
result  of  market  transactions  is  not  equivalent  to  a 
Pareto  optimal  allocation.  With  such  an  allocation, 
no  individual  position  could  be  improved  while 


diminishing  that  of  another  person.  Market  failure 
thus  implies  a  nonoptimal  use  of  scarce  resources. 
In  neoclassical  theorizing,  which  focuses  on  alloca¬ 
tion  issues  rather  than  issues  of  distribution,  mar¬ 
ket  failure  as  a  pure  deficit  in  allocation  stems  from 
factors  such  as  lack  of  information,  external  effects, 
market  power,  or  the  character  of  public  goods, 
also  called  collective  goods. 

A  basic  causal  model  of  market  failure  is  pre¬ 
sented  in  Figure  1.  Market  power  is  a  background 
factor  to  informational  asymmetry  (as  are  other  fac¬ 
tors  omitted  here).  Market  power  has  a  direct  effect 
on  market  failure  and  an  indirect  one  via  external 
effects.  As  argued  by  Richard  Musgrave  (1959),  the 
failure  to  produce  public  goods  is  the  key  compo¬ 
nent  of  market  failure.  The  first  three  factors  are 
instances  of  market  deficiencies  that  can  be  healed  to 
some  extent  by  appropriate  institutional  devices. 
Public  goods,  by  definition,  cannot  be  created  in  a 
market  since  citizens  cannot  be  excluded  from  con¬ 
sumption  and  are  thus  not  willing  to  pay,  therefore 
private  production  fails  to  emerge.  One  could  also 
argue  that  the  lack  of  public  goods  production  is  a 
cause  for  market  failure  just  as  much  as  a  conse¬ 
quence.  We  leave  this  debate  unresolved  here,  with 
just  a  line  between  the  two  variables  to  indicate  the 
different  nature  of  the  variable  “public  goods”  com¬ 
pared  with  the  other  three  causal  variables,  which,  in 
turn,  could  also  be  consequences  of  market  failure. 

Public  goods,  just  as  much  as  “public  bads,”  are 
characterized  by  nonrivalry  in  consumption  and 
by  the  lack  of  excludability — that  is,  the  ability  to 
exclude  others  from  consuming  a  good  (e.g.,  secu¬ 
rity  of  a  country).  Partially,  this  is  a  case  of  positive 
externality.  Free  riding  is  tempting  when  potential 
consumers  cannot  be  excluded  from  consuming 
the  good  without  paying.  A  lack  of  effective 
demand  is  the  result.  Here  the  state  can  organize 
the  provision  of  such  public  goods,  yet  often  with¬ 
out  sufficient  information  as  to  the  adequate 
amount  of  such  a  good.  Gathering  such  informa¬ 
tion  is  costly,  however.  Also  the  limitation  of  pub¬ 
lic  goods  would  in  itself  be  Pareto  inefficient. 
Neoclassical  economists  argue  that  the  state  should 
interfere  only  in  cases  of  market  failure  but  not 
beyond  and  not  for  political  goals. 

Following  the  specifications  of  Musgrave  and 
thus  opting  for  a  more  limited  definition  of  market 
failure,  allocation  problems  in  a  market  economy 
can  be  either  market  imperfections  (market  power 
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Figure  I  A  Basic  Causal  Model  of  Market  Failure 

and  external  costs/effects)  or  market  failure  (insuf¬ 
ficient  supply  of  public  goods). 

Usually,  market  failure  is  limited  to  a  specific 
market.  Market  failure  calls  for  state  intervention. 
The  counterposition  is  that  state  interventionism 
could  lead  to  state  failure.  With  the  state  increas¬ 
ingly  being  engaged  in  the  economy,  information 
scarcity  increases  due  to  the  absence  of  fungible 
markets.  Individuals  might  be  treated  as  if  they  all 
had  the  same  preferences.  The  greater  the  differ¬ 
ences  in  their  preferences  are,  the  greater  the  welfare 
losses  become.  Citizens  might  be  confronted  with 
package  solutions  they  would  not  agree  to  in  their 
entirety,  or  coalitions  may  go  for  solutions  other 
than  those  initially  proclaimed  by  individual  parties. 

Consequently,  the  Austrian  school  of  economics 
(e.g.,  Friedrich  Hayek,  1944;  Murray  Rothbard, 
1962)  even  denies  such  a  thing  as  market  failure, 
since  markets  are  instruments  of  individual  discov¬ 
ery  with  no  preassigned  collective  goals.  According 
to  Hayek,  the  market  is  a  context  of  discovery, 
whereas  planned  state  economies  never  make  dis¬ 
coveries  stemming  from  the  spontaneous  coordi¬ 
nation  of  individual  actors  in  a  market.  Yet  on 
many  occasions,  Hayek  does  not  question  the 
standard  theory  of  public  goods. 

Instances  of  Market  Failure 

Among  the  factors  that  may  bring  about  market 
failure  are  the  ones  related  to  asymmetrical  infor¬ 
mation,  monopolies,  and  external  effects. 


Asymmetrical  Information 

Asymmetrical  information  occurs  when  the 
potential  contract  partners  do  not  have  equal 
information  about  the  supplied  goods,  services,  or 
risks  to  be  insured.  George  Akerlof  (1970)  argues 
that  uncertainty  about  the  quality  of  goods  on  the 
part  of  consumers  drives  out  good  products  (e.g., 
cars  just  as  much  as  hard  currencies  in  the  case  of 
Gresham’s  law)  and  leaves  only  the  bad  ones  on 
the  market.  This  “lemon”  principle  leaves  only  the 
poorest  suppliers  able  to  sell  their  cars.  There  is  no 
market  clearing,  and  a  negative  selection  takes 
place. 

Guarantees  on  the  part  of  the  suppliers  or  inde¬ 
pendent  institutions  monitoring  the  quality  of 
goods  as  well  as  legal  guarantees  could  help  in 
mitigating  this  information  asymmetry.  The  disci¬ 
pline  of  new  institutional  economics  emphasizes 
the  importance  of  appropriate  incentives  from  and 
controls  by  institutions.  The  protection  of  patents 
is  another  instance  of  information  asymmetry. 
Here  the  trade-off  is  between  quickly  disseminat¬ 
ing  new  knowledge  versus  protecting  research  and 
the  economic  incentives  that  are  derived  from  par¬ 
ticular  knowledge,  at  least  for  some  time. 

In  addition  to  adverse  selection,  as  in  the 
“lemon”  cases,  asymmetrical  information  may 
also  arise  when  a  contract  between  partners  pro¬ 
vides  that  both  should  have  symmetrical  informa¬ 
tion,  but,  subsequently,  one  party  is  unable  to 
observe  the  behavior  of  the  other.  When  such  a 
situation  arises  after  contract  formation,  it  creates 
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a  moral  hazard  in  which  one  partner  may  be 
tempted  to  cheat. 

Monopolies 

Market  power  stems  from  three  factors:  unique 
technological  inventions,  the  limitation  of  compe¬ 
tition,  and  the  indivisibility  of  the  production 
apparatus.  Under  a  monopoly,  production  is  inef¬ 
ficient  (a  lower  amount  of  goods  is  turned  out  at 
higher  prices).  A  monopolist  maximizing  his  or  her 
market  power  is  selling  products  at  a  price  above 
the  marginal  costs  (Cournot  point)  and  not  where 
the  marginal  returns  equal  those  of  the  marginal 
production  costs,  as  in  a  competitive  market  (a 
higher  turnout  of  goods  at  lower  prices). 

Remedies  against  monopolistic  markets  are 
antitrust  policies  and,  in  cases  of  indivisibilities 
and  natural  monopolies,  tax  and  subvention  poli¬ 
cies.  The  indivisibility  of  the  production  apparatus 
(e.g.,  networks  for  electric  power  and  gas,  the 
canal  system  railways)  leads  to  scale  effects  and 
thus  to  lower  average  costs  but  also  to  larger  sunk 
costs,  that  is,  past  costs  that  cannot  be  recovered 
when  being  confronted  with  new  competitors.  If 
only  one  supplier  can  do  it  efficiently,  this  is  called 
a  natural  monopoly. 

If  there  is  a  Pareto  efficient  situation,  state  inter¬ 
vention  by  necessity  will  violate  that  equilibrium, 
often  for  the  sake  of  redistribution  issues  that  are 
decided  politically  through  majority  voting.  The 
median  position  of  the  voter  in  democracies  with 
an  interest  in  redistribution  from  the  rich  to  the 
poor  and  the  activities  of  lobby  groups  are  key  fac¬ 
tors  in  such  a  view.  The  underlying  factor  is  rent 
seeking — securing  incomes  above  those  in  a  free 
and  competitive  market — by  private  groups,  gov¬ 
ernment  bureaucracies,  and  politicians.  (The  latter 
two  entities  point  to  the  principal-agent  problem. ) 
Here,  “market  failure”  is  often  defined  in  the 
reverse  sense  as  an  effect  of  state  regulation  and 
interference  with  otherwise  free  markets. 

Adherents  of  public  choice  theorizing  point  to 
lack  of  causal  evidence  when  bringing  in  the  state 
to  remedy  situations  of  market  failure.  They  argue 
that  the  costs  of  state  failure  could  be  even  higher 
than  those  of  market  failure.  Market  failure  can  be 
closely  linked  to  state  failure,  not  just  in  case  of  the 
communist  collapse  but  also  in  the  market  rule 
distortions  created  through  the  networks  and 


influence  of  the  financial  sector  on  politicians  to 
loosen  rules  on  the  financial  markets.  Oliver 
Blanchard  (2009)  succinctly  lists  these  failures  of 
the  past  decade: 

The  underestimation  of  risk  contained  in  newly 
issued  assets;  the  opacity  of  the  derived  securities 
on  the  balance  sheets  of  financial  institutions;  the 
interconnection  of  financial  institutions,  both 
within  and  across  countries;  and  the  high  degree 
of  leverage  of  the  financial  system  as  a 
whole ...  all  combined  to  create  the  perfect 
(financial)  storm,  (pp.  38-39) 

External  Effects 

External  effects  occur  in  two  variants.  Negative 
external  effects  occur  where  the  economic  activi¬ 
ties  of  two  partners  cause  damage  to  a  third  one 
(e.g.,  toxic  emissions  into  the  air).  Sometimes 
positive  external  effects — benefits  (e.g.,  beautifying 
the  environment) — are  created  by  economic  con¬ 
tractors.  The  internalization  of  external  costs  is 
theoretically  possible,  for  example,  via  Coase 
negotiations.  The  Coase  theorem  states  that,  under 
clearly  defined  property  rights,  perfect  rationality 
and  no-transaction-costs  bargaining  may  occur  in 
such  a  way  that  external  effects  will  be  internalized 
by  market  participants.  For  future  generations, 
such  a  condition  is  absent,  however.  By  means  of 
the  Pigou  tax,  the  state  places  a  tax  on  the  pro¬ 
ducer  equal  to  the  external  costs.  The  latter  must 
be  known,  however,  and  again  transaction  costs 
are  not  allowed. 

Negative  external  effects  (e.g.,  environmental 
degradation)  are  the  most  often  quoted  instances 
of  market  failure.  Firms  externalize  their  costs  of 
production  onto  third  parties  (so  that  customers  of 
respective  firms  do  not  have  to  pay  directly)  or,  in 
the  widest  instance,  onto  the  global  environment, 
turning  everyone  into  a  victim. 

Positive  external  effects  occur  in  cases  where  a 
good  is  underproduced,  because  the  supplier  does 
not  receive  adequate  compensation  for  the  exter¬ 
nal  rewards  that  would  be  provided  if  full  produc¬ 
tion  capacity  were  used.  Other  examples  come 
from  the  health  sector  where  a  general  vaccination 
could  have  external  benefits  for  society  at  large, 
but  costs  for  poor  people  might  be  too  high,  thus 
creating  negative  external  effects  of  undersupply. 
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The  educational  sector  offers  another  instance,  in 
which  the  provision  of  a  good  education  nation¬ 
wide  provides  many  additional  advantages  for 
society  at  large.  The  provision  of  subventions  for 
an  increasing  demand  for  these  goods  may,  how¬ 
ever,  be  inefficient  and  create  issues  of  moral  haz¬ 
ard  (e.g.,  downgrading  one’s  own  private  learning 
efforts  since  the  education  does  not  cost  anything). 


Public  Goods  Provision 


Deficiencies  in  producing  public  goods  are  often 
listed  as  further  instances  of  market  failure.  Public 
goods,  such  as  clean  air,  are  ones  from  which  indi¬ 
viduals  cannot  be  excluded.  Therefore,  unlike 
private  goods,  public  goods  cannot  be  produced 
effectively  through  the  private  market.  Following 
the  logic  of  Musgrave  and  thus  requiring  more 
precise  argumentation,  they  comprise  the  second 
and  narrower  category  of  market  failure,  in  which 
the  market  fails  entirely.  In  the  cases  dealt  with 
thus  far,  gradual  deficiencies  of  markets  occurred. 
While  the  market  is  functioning  in  principle,  there 
is,  however,  too  much  or  too  little  of  goods  pro¬ 
duced.  But  in  the  more  basic  case  of  complete 
market  failure,  some  goods  are  not  produced  at 
all,  even  though  the  optimal  allocation  of  goods 
requires  them.  Goods  with  only  external  returns, 
those  with  returns  only  to  consumers  and  not  to 
producers,  are  heavily  undersupplied.  The  light¬ 
house  is  a  classic  example.  Although  the  light¬ 
house  provides  strong  external  benefits  for  seafar¬ 
ing,  there  is  no  individual  incentive  to  build  it.  The 
lighthouse  is  a  public  or  collective  good  that  is 
characterized  by  two  features: 

There  is  no  rivalry  in  consump¬ 
tion  and  no  excludability  (that  is, 
one  person’s  use  of  a  good  such 
as  a  lighthouse  does  not  reduce 
the  availability  of  the  good  to 
others).  Free  riding  is  a  conse-  ye~ 

quence  of  such  a  constellation.  As 
long  as  individuals  do  not  reveal  Exclusion 
how  strong  their  preferences  for 
public  goods  are,  there  will  be  no  nc 

production  of  such  goods  unless 
the  state  steps  in.  The  state 

monopoly  of  the  legitimate  use  of  _ 

force,  external  defense,  levees,  Figure  2  Exc 
and  the  judicial  institutional  Priv 


Figure  2 


framework  are  also  examples  of  public  goods 
(what  Musgrave  describes  as  “social  goods 
proper”).  According  to  James  Buchanan  and 
Gordon  Tullock  (1962),  the  real  task  of  the  state  is 
to  protect  property  rights. 

Crossing  the  two  criteria  of  rivalry  and  exclu¬ 
sion  in  Figure  2,  four  types  emerge. 

The  two  pure  types  of  goods — private  goods 
(Case  1  in  Figure  2)  and  public  goods  (Case  4  in 
Figure  2) — have  already  been  discussed.  With 
respect  to  the  mixed  goods  in  Case  2,  consumers 
can  be  excluded  by  imposing  fees  on  the  consump¬ 
tion  of  these  goods.  Even  though  there  is  little  or  no 
rivalry  in  consumption  (e.g.,  of  streets,  bridges, 
sports  events,  pipelines,  or  even  cable  TV — goods 
resembling  the  so-called  toll  goods),  there  is  a  ten¬ 
dency  for  underproduction  of  those  goods  (resulting 
in,  e.g.,  traffic  congestion).  With  decreasing  average 
production  costs,  there  is  also  an  inherent  tendency 
toward  monopoly  in  such  situations.  Supervision  of 
market  tendencies  by  independent  authorities  and/ 
or  subsidies  to  consumers  could  counteract  such 
tendencies  toward  underproduction. 

With  mixed  goods  in  Case  3,  where  rivalry  but 
no  exclusion  exists,  the  reverse  instance  of  overus¬ 
age  emerges,  as,  for  example,  in  the  tragedy  of  the 
commons,  described  by  Elinor  Ostrom.  By  defin¬ 
ing  rights  of  usage  for  natural  resources  such  free 
riding  might  be  controlled.  Other  means  of  regula¬ 
tion  could  be  mutual  controls  or  by  an  agent,  sanc¬ 
tions  when  violating  rules,  reciprocity,  and  mutual 
trust. 

The  mixed  goods  in  Cases  2  and  3  show  some 
resemblance  to  the  cases  of  external  effects 


Rivalry 

yes  no 


1 

Private  goods. 

e.g.,  bread 

2 

Mixed  goods  with  tendency 

of  underproduction, 

e.g.,  highways 

3 

Mixed  goods  with 
tendency  of  overusing, 
e.g.,  the  commons 

4 

Public  goods 

e.g.,  external  security 

3n  and  Rivalry  With  Public  Goods,  Mixed  Goods,  and 
Goods 
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discussed  earlier.  While  external  effects  are  market 
imperfections,  (pure)  public  goods  as  such  are  not 
produced  by  the  market  at  all. 

With  the  state  producing  such  goods  on  its  own, 
at  least  two  challenges  arise.  First,  the  state  pro¬ 
duces  at  costs  that  are  too  high  (which  can  be 
counteracted  by  employing  market  means  in  the 
whole  production  process  wherever  possible). 
Second,  consumers  may  waste  resources  because 
the  state  produces  them  at  no  cost  to  the  consum¬ 
ers,  creating  an  ethical  issue.  Also,  consumer  sov¬ 
ereignty  can  be  limited  through  state  production  of 
the  wrong  goods  or  in  wrong  quantities  ( crowding 
out).  The  production  of  so-called  merit  goods, 
such  as  health,  education,  and  high  culture,  is  a  fiat 
usually  set  by  political/economic  elites  and  rarely 
by  majority  opinion.  In  other  markets,  allocation 
of  goods  may  suffer. 

In  the  opposite  cases  of  demerit  goods,  the  state 
again  could  work  with  legal  provisions  and  prohi¬ 
bitions,  taxes,  and  subventions  and  with  tradable 
certificates  to  introduce  market  elements — for 
example,  in  controlling  environmental  deteriora¬ 
tion.  Here  economically  inefficient  and  ecologically 
aberrant  rules  from  the  past  create  no  incentives  for 
technological  progress.  They  can  be  substituted  by 
technological  progress  the  external  costs  of  which 
are  addressed  better  by  auctions  and  by  the  selling 
of  certificates.  In  the  long  run,  overregulation  of 
and  by  the  state  may  be  a  crucial  factor  contribut¬ 
ing  to  market  failure,  just  as  much  as  the  lack  of 
monopoly  of  violence  and  failure  to  guard  property 
rights  to  begin  with  are. 

Economic  Crises 

Analytically  and  empirically,  one  has  to  distinguish 
an  economic  crisis  from  situations  of  market  fail¬ 
ure.  Situations  of  market  failure  (lack  of  public 
goods  production)  and  market  imperfection  can  be 
the  causes  as  well  as  the  correlates  and  conse¬ 
quences  of  economic  crises.  Yet  economic  crises  in 
terms  of  supply  shocks  (e.g.,  the  oil  shocks  of  1973 
and  1980),  or  simply  the  overproduction  of  goods 
(as,  e.g.,  in  the  automotive  industry  today),  as  well 
as  demand  shocks  (e.g.,  in  cases  of  natural  catastro¬ 
phes  and  poor  harvests)  must  be  distinguished  from 
the  cases  and  analyses  of  market  failures.  In  a 
sloppy  colloquial  sense,  the  term  market  failure  is 
often  applied  here  but  for  incorrect  and  inconsistent 


theoretical  reasons.  An  economic  crisis  basically 
means  that  producers  have  miscalculated  the 
amount  and  type  of  goods  consumers  are  willing 
to  buy.  The  consequence  is  an  uncleared  market 
and,  in  case  of  a  deepening  crisis,  a  further  exten¬ 
sion  of  unsold  products.  This  cumulative 
downward  process  (see  Knut  Wicksell,  1898)  of 
overproduction  and  falling  prices  stops  when 
consumers  anticipate  higher  prices  in  the  future 
and  start  buying  again.  Thus,  a  normal  economic 
crisis  has  the  character  of  a  cleansing  crisis,  wip¬ 
ing  out  unproductive  suppliers. 

Such  economic  crises  can  be  caused  and  intensi¬ 
fied  by  financial  crises  (just  as  much  as  they  can  be 
alleviated  through  built-in  stabilizers  as  in  the 
European  social  market  economies  from  2008 
onward;  see  Joseph  Stiglitz,  2010).  The  Great 
Depression  with  the  financial  crises  emerging  in 
the  early  1930s  is  a  case  in  point.  Institutional 
adjustments  in  the  financial  sector  then  included 
repealing  the  liberalization  measures  of  the  Glass- 
Steagall  Act  and  separating  commercial  activities 
of  banks  from  their  investment  activities,  giving 
the  Federal  Bank  a  stronger  leeway  in  raising  lia¬ 
bilities  for  banks  yet  with  very  little  Keynesian 
reflation  of  the  economy.  In  the  crisis  after  2008, 
institutional  adjustments  included  increased  liabil¬ 
ity  for  banks  and  other  financial  agents,  more 
transparency  with  hedge  funds,  better  control  of 
the  stock  exchanges  and  of  rating  agencies,  elimi¬ 
nation  of  speculative  deals  such  as  short  sales,  tax 
on  financial  transactions,  higher  taxing  of  banks, 
separation  of  investment  banks  from  retail  and 
commercial  banks,  breaking  down  banks  that  are 
“too  big  to  fail,”  and  the  socialization  of  private 
losses  via  a  bailout  through  the  taxpayer;  these 
were  intended  to  prevent  financial  bubbles  (and 
the  oversupply  of  goods  such  as  housing)  in  the 
future.  At  the  same  time,  from  2008  on  there  was 
a  heavy  reflation  of  the  major  economies  to  avoid 
the  procyclical  monetary  and  fiscal  policies  of  the 
1930s.  Yet  as  noted  by  Carmen  Reinhart  and 
Kenneth  Rogoff  (2009),  underlying  factors  such  as 
human  speculation  and  miscalculation  as  well  as 
greed  will  always  contribute  to  economic  crises  so 
that  the  very  basic  features  of  the  next  bubble  will 
not  be  so  different  from  those  of  previous  ones. 
The  economic  fallout  of  both  the  Great  Depression 
and  the  current  crisis  coming  close  to  a  breakdown 
of  the  financial  engine  of  the  free  market  system  is 
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grave  enough.  It  is,  however,  even  in  its  magnified 
effects,  an  instance  of  absent  or  failing  institutional 
regulations.  Given  human  creativity  and  greed, 
one  will  have  to  reckon  with  further  such  imbal¬ 
ances.  They  are  instances  of  far-reaching  contrac¬ 
tions  of  markets  due  to  a  lack  of  institutional 
safeguards.  Market  failure  in  a  more  theoretically 
strict  sense  has,  however,  to  be  distinguished  from 
these  grave  challenges. 

Nevertheless,  the  failure  of  the  institutional 
framework  (creation  of  a  public  bad  in  not  con¬ 
taining  the  financial  speculation  and  letting  it 
transgress  into  the  goods  and  services  markets)  is  a 
clear  case  of  de  facto  market  failure  in  the  present 
financial  and  economic  crisis.  At  the  global  level, 
there  are  no  adequate  rules  to  allow  for  the  separa¬ 
tion  of  real  growth  effects  of  financial  transactions 
from  cases  of  gross  overspeculation.  (Some  econo¬ 
mists  argue  that  there  could  never  be  such  a 
separation  between  sound  and  unsound  financial 
transactions  since  every  speculator  is  always  met 
by  a  countertrader.)  Legitimacy  for  a  market  soci¬ 
ety  has  been  strongly  undermined,  fortunately  in 
the  absence  of  other  more  repressive  system  alter¬ 
natives  as  present  and  tried  in  the  1930s. 

Given  the  high  debt  burdens,  the  global  market 
power,  informational  asymmetries,  the  external- 
ization  of  costs,  environmental  damages,  moral 
risks,  and  rent  seeking  in  all  its  varieties,  market 
failures  may  become  more  likely.  The  market  does 
better  than  the  state  in  providing  incentives  and 
competition,  in  allocating  scarce  resources,  and  in 
controlling  economic  and,  thus  often,  political 
power.  For  these  functions,  state  activities  cannot 
be  a  substitute  for  the  market.  With  historically 
unseen  market  extensions  in  a  fully  globalized 
economy,  however,  it  may  become  more  difficult 
to  establish  rules  of  consent  shared  worldwide. 
There  are  always  some  externalities  involved,  be 
they  only  in  the  form  of  misperceptions  of  one’s 
own  long-term  advantages  with  respect  to,  for 
example,  issues  of  global  warming  and  other  cli¬ 
mate  changes.  The  failed  Copenhagen  Climate 
Change  Conference  of  2009,  just  as  much  as  the 
current  financial  crisis  with  dramatic  public  debt, 
not  only  figures  in  European,  American,  and  the 
majority  of  Asian  states  but  also  speaks  just  as 
dramatically  to  new  (potential)  mixtures  of  market 
failures  cum  failures  of  political  systems,  whether 
they  are  organized  around  autocratic  principles  or 


around  the  preferences  of  the  median  voter,  or 
occur  in  emergent  systems  such  as  the  European 
Union  and  the  eurozone.  The  temptations  for  free 
riding  may  have  become  higher  under  global  eco¬ 
nomic  markets,  but  so  have  external  costs  and 
burdens  on  future  generations.  There  is  a  lag  in 
ordered  structures  for  global  markets  to  function 
without  creating  excessive  external  costs. 
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Karl  Marx  is  among  the  most  famous  and  influen¬ 
tial  theorists  of  the  modern  age,  from  whom  the 
socialist  or  communist  movements  derived  their 
ideas.  He  is  not  only  a  political  thinker  but  also  a 
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social  philosopher  and  economist,  whose  research 
ranged  widely  over  many  fields.  Marx  has  had  a 
profound  impact  on  the  thoughts  and  actions  of 
people  in  many  countries  since  the  mid-  19th  cen¬ 
tury,  and  in  the  21st  century,  he  is  still  regarded  as 
the  greatest  instructor  by  the  political  left,  includ¬ 
ing  the  adherents  of  communist  parties,  and 
derided  as  a  source  of  political  and  social  chaos  by 
the  political  right.  The  ideas  and  programs  devel¬ 
oped  by  Karl  Marx  in  cooperation  with  Friedrich 
Engels  have  been  generally  called  Marxism. 

Born  in  Trier,  Germany,  into  a  Jewish  family  on 
May  5,  1818,  Marx  received  a  good  education  and 
displayed  great  potential  as  an  outstanding  stu¬ 
dent.  At  the  universities  of  Bonn  and  Berlin,  Marx 
studied  history  and  philosophy,  took  a  strong 
interest  in  the  works  of  the  philosopher  Georg 
Wilhelm  Friedrich  Hegel,  and  joined  a  student- 
professor  group  called  the  Young  Hegelians.  Marx 
submitted  his  doctoral  thesis  at  the  University  of 
Jena  in  1840  and  received  a  doctoral  degree  the 
following  year.  After  an  initial  and  unsuccessful 
effort  to  establish  an  academic  career,  his  liberal 
political  views  led  him  to  find  employment  as  an 
editor  of  a  radical  magazine  in  Cologne,  Rheinische 
Zeitung.  Because  of  his  journalistic  abilities  and 
radical  beliefs,  Marx  was  well  received  in  liberal 
circles  in  Germany  and  quickly  promoted  to  editor 
of  the  magazine.  This  radical  publication,  under 
Marx’s  guidance,  had  to  face  the  problem  of  cen¬ 
sorship  by  the  authoritative  Prussian  government 
and  was  finally  suppressed  after  the  printing  of 
Marx’s  article  on  the  poverty  of  farmers  in  the 
Mosel  Valley. 

In  1843,  Marx  married  his  girlfriend  Jenny  von 
Westphalen  and  emigrated  to  Paris  with  her  to 
escape  political  persecution.  There  he  made  the 
acquaintance  of  French  socialist  thinkers  and  began 
to  witness  firsthand  the  living  conditions  of  people 
in  poverty  by  socializing  with  working-class  people. 
More  important,  he  came  into  contact  for  the  first 
time  and  subsequently  established  his  lifelong 
friendship  with  Engels,  the  author  of  the  classical 
work  The  Conditions  of  the  Working  Class  in 
England  published  in  1844.  As  a  result  of  his  eco¬ 
nomic  and  philosophical  research,  Marx  wrote 
Economic  and  Philosophical  Manuscripts  in  1844, 
in  which  he  showed  great  concern  for  the  dignity 
and  freedom  of  the  individual.  In  February  1844, 
Marx  started  a  new  journal,  the  Franco-German 


Annals ,  together  with  the  philosopher  and  political 
writer  Arnold  Ruge,  in  which  he  published  articles 
on  a  broad  range  of  matters,  such  as  philosophy, 
politics,  and  society. 

Based  on  his  experiences  living  among  working- 
class  people  and  his  comprehensive  researches  on 
history,  economics,  politics,  and  philosophy,  Marx 
became  an  ardent  communist.  He  proposed  his 
ideas  about  communism  by  criticizing  the  alien¬ 
ation  of  labor  under  capitalism.  According  to 
Marx,  under  capitalism,  the  working  class  invests 
its  creative  labor  while  the  capitalist  class  appro¬ 
priates  the  results  of  this  labor  in  exchange  for 
wages.  This  means  that  the  human  world  created 
by  the  proletariat  does  not  belong  to  them  but  is 
instead  owned  by  a  class  of  nonlaboring  owners. 

His  radical  ideas  were  not  tolerated  in  France. 
When  Marx  published  an  article  on  capitalism  in 
the  Franco-German  Annals ,  he  upset  his  partner 
Ruge,  and  the  journal  was  banned  in  France  and 
Germany.  In  January  1845,  Marx  was  expelled 
from  Paris  by  Premier  Franqois  Guizot  at  the 
instance  of  the  Prussian  government  and  moved  to 
Brussels,  Belgium.  During  his  stay  in  Brussels, 
Marx  exchanged  polemics  with  the  Hegelians, 
Feuerbach,  Stirner,  and  the  “True  Socialists”  and 
finished  two  important  works,  The  Floly  Family 
and  The  German  Ideology,  in  collaboration  with 
Engels.  In  The  German  Ideology,  Marx  provided  a 
historical  and  material  basis  for  his  radical  views 
and  insisted  that  the  nature  of  individuals  depended 
on  the  material  conditions  determining  their  own 
productions.  According  to  his  interpretations  of 
history,  the  sum  total  of  the  relations  of  produc¬ 
tion  forms  the  real  basis  of  society,  on  which  a 
legal  and  political  superstructure  is  established.  In 
1847,  Marx  started  another  polemical  exchange 
with  Pierre-Joseph  Proudhon  and  wrote  The 
Poverty  of  Philosophy,  in  which  he  developed  the 
fundamental  propositions  of  his  economic  inter¬ 
pretation  of  history. 

By  early  1846,  Marx  had  established  the 
Communist  Correspondence  Committee  to  con¬ 
nect  all  of  Europe’s  socialist  leaders.  The  following 
year,  the  socialists  held  a  conference  in  Fondon  and 
established  the  Communist  Feague.  Marx  went  to 
attend  the  Communist  Feague  meeting  and  wrote 
The  Communist  Manifesto  together  with  Engels, 
inspired  by  Engels’s  The  Principles  of  Communism. 
The  Communist  Manifesto,  originally  written  as 
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the  platform  of  the  Communist  League,  has  become 
one  of  the  most  radical  and  influential  books  since 
it  was  first  published  on  February  1848.  It  begins 
with  the  famous  proposition  “The  history  of  all 
hitherto  existing  society  is  the  history  of  class 
struggle”  and  contains  a  summary  of  Marxist  the¬ 
ory.  For  example,  one  of  the  major  points  is  the 
need  to  abolish  private  property  and  implement 
public  ownership  of  the  economy.  The  theory  of 
the  Communists  may  perhaps  be  summed  up  in  the 
single  phrase:  abolition  of  private  property.  Another 
aim  is  to  bring  the  proletariat  to  power  and  annihi¬ 
late  the  exploiting  class,  especially  the  bourgeoisie, 
in  politics.  According  to  Marx  and  Engels,  the  first 
step  toward  the  revolution  by  the  working  class  is 
to  raise  the  proletariat  to  the  position  of  the  ruling 
class  in  order  to  win  the  battle  of  democracy.  The 
Communist  Party  as  the  avant-garde  of  the  prole¬ 
tariat  then  comes  to  power  after  winning  the 
struggle  against  the  old  classes,  such  as  landowners 
and  the  bourgeoisie.  The  third  is  to  envision  a  class¬ 
less  society  in  which  the  free  development  of  each  is 
the  condition  for  the  free  development  of  all.  On  its 
publication  in  1848,  The  Communist  Manifesto 
quickly  became  the  credo  of  the  poor  and  oppressed 
all  over  the  world,  which  led  to  the  greatest  politi¬ 
cal  upheavals  of  the  19th  and  20th  centuries  and  to 
the  establishment  of  the  communist  governments 
that  ruled  half  the  globe  for  several  decades. 

After  the  Manifesto  came  to  light,  even  the  rela¬ 
tively  tolerant  Belgian  government  served  Marx 
with  an  expulsion  order.  The  revolutionary  atmo¬ 
sphere  in  Germany  in  1848  enabled  him  to  return 
to  Cologne  where  he  persuaded  some  liberal  indus¬ 
trialists  to  back  a  new  version  of  his  old  journal, 
the  Neue  Rheinische  Zeitung.  Under  Marx’s 
editorship,  it  became  extremely  radical  and  anti¬ 
government  and  was  suppressed  again  by  the 
authorities.  Marx  printed  the  last  issue  of  the 
Zeitung  in  red  ink.  For  this  reason,  he  was  arrested 
for  press  offenses  and  incitement  to  armed  insur¬ 
rection.  But  after  a  long  and  powerful  speech  at  his 
trial,  Marx  was  acquitted  by  a  jury  in  Cologne. 
Faced  with  expulsion  from  Cologne  and  suppres¬ 
sion  of  his  journal,  Marx  visited  Paris  again  as  a 
representative  of  German  democracy  before  the 
Paris  National  Assembly  but  was  similarly  served 
with  an  expulsion  order  from  Paris. 

In  1849,  Marx  moved  to  London,  where  he 
lived  with  his  large  and  devoted  family  until  his 


death.  Although  Marx  was  a  correspondent  to  the 
New  York  Tribune  from  1852  to  1861,  for  the 
most  part  he  was  financially  dependent  on  gener¬ 
ous  support  from  Engels.  His  typical  workday, 
from  10  a.m.  to  7  p.m.,  was  spent  in  the  reading 
room  of  the  British  Museum,  where  wrote  many 
volumes  on  different  subjects.  Sometimes  he 
lacked  money  for  postage  to  send  his  manuscripts 
to  the  publishers.  Afflicted  with  boils,  eye  and 
liver  trouble,  and  a  contentious  and  uncompro¬ 
mising  temper,  Marx  was  not  a  prepossessing 
sight  during  his  last  years.  Although  virtually 
unknown  in  England,  he  enjoyed  great  popularity 
on  the  Continent,  especially  in  liberal  circles  and 
among  the  working  people.  In  1864,  the  Inter¬ 
national  Workingmen’s  Association  was  founded 
at  a  meeting  in  St.  Martin’s  Hall,  London,  and 
Marx  was  invited  to  draw  up  the  inaugural 
address.  In  1867,  the  first  volume  of  Marx’s  great¬ 
est  work,  Capital  (Das  Kapital ),  was  published. 
The  second,  third,  and  incomplete  fourth  volumes 
did  not  come  out  until  after  Marx’s  death  in  1883. 

In  Capital,  Marx  analyzed  the  secret  of  capi¬ 
talist  production  by  focusing  on  the  concept  of 
surplus  value  and  formulated  his  revolutionary 
theory  by  revealing  the  injustice  of  the  capitalist 
system.  According  to  Marx,  labor  is  a  commod¬ 
ity  like  any  other;  therefore,  following  the  labor 
theory  of  value,  it  must  be  valued  by  the  man¬ 
hours  devoted  to  its  “production” — that  is,  to 
feeding,  clothing,  and  sheltering  the  worker  in 
order  to  maintain  life  at  subsistence  level.  In  the 
capitalist  system,  labor  is  bought  just  like  any 
other  commodity.  But,  unlike  any  other  com¬ 
modity,  labor  is  not  consumed  in  a  clearly  deter¬ 
mined  period  of  time.  A  laborer  is  bought  for  the 
price  of  sustaining  him  physically,  prorated  in 
hours  or  days  or  weeks.  But  he  may  produce  the 
equivalent  of  the  price  in  economic  value  in  6  or 
8  hours  of  work,  whereas  the  factories  of  Marx’s 
day  kept  men  going  for  10,  12,  or  14  hours  a  day. 
The  difference  between  what  the  worker  does 
and  what  he  is  paid  is  surplus  value,  the  source  of 
all  capitalist  profits.  In  the  capitalist  society  that 
is  divided  into  the  capitalist  class  with  the  means 
of  production  and  the  proletariat  without  the 
means  of  production,  the  injustice  heaped  on  the 
workers  is  not  the  result  of  bad  men  but  of  a 
particular  system.  Reform  within  the  system, 
however  well  intentioned,  is  doomed  to  failure. 
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Only  revolutionary  overthrow  of  the  whole  capi¬ 
talist  system  can  succeed. 

Marx  died  on  March  14,  1883,  and  was  buried 
in  Highgate  Cemetery,  London,  with  a  tombstone 
epitaph  reading  “Workers  of  all  lands  unite,”  the 
last  slogan  in  The  Communist  Manifesto.  In  the 
years  following  Marx’s  death,  Engels  edited  and 
translated  his  works  and  in  many  ways  continued 
their  friendship  until  his  own  death  in  1895. 
Although  Marx  spent  most  of  his  life  in  reading 
and  writing  as  a  student  and  scholar,  he  has  had  a 
strong  influence  not  only  on  modern  ideas  but  also 
on  political  practices  and  social  movements  in 
many  countries  all  over  the  world.  Even  today,  his 
works  are  reprinted  and  read  widely,  and  his  ideas 
are  discussed  and  debated  in  philosophy,  sociol¬ 
ogy,  and  political  science. 

Dingping  Guo 
Fudan  University 
Shanghai,  China 
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Marxism 


Marxism  is  the  system  of  social  and  political  the¬ 
ory  about  human  life,  historical  development,  the 
capitalist  crisis,  and  the  communist  revolution 
developed  by  the  German  philosopher  and  econo¬ 
mist  Karl  Marx  along  with  his  close  friend  and 
supporter  Friedrich  Engels  during  the  middle  and 


late  19th  century  and  subsequently  elaborated  by 
their  disciples  from  various  backgrounds  all  over 
the  world.  Although  there  are  inconsistencies  and 
contradictions  in  Marx’s  theory  during  the  differ¬ 
ent  periods  of  its  development,  and  there  are 
considerable  debates  and  disputes  over  its  nature 
and  structure,  some  basic  consensus  can  be  reached 
based  on  the  analysis  of  Marx’s  works  and  studies 
on  the  subsequent  evolution  of  the  theory.  A  his¬ 
toric  landmark  in  social  and  political  thought, 
Marxism  provided  the  foundation  for  the  com¬ 
munist  revolution  and  socialist  reform  that  marked 
the  20th  century,  the  influence  of  which  continues 
to  be  felt  worldwide. 

Theoretical  Sources 

Marx  developed  his  eponymous  theoretical  system 
from  three  major  sources:  German  philosophy, 
French  politics,  and  English  economics.  Marx 
showed  great  interest  in  law  and  philosophy  in  his 
early  works;  his  later  works  were  more  concerned 
with  political  economy  and  political  strategy. 

The  German  philosophy  on  which  Marx  drew 
was  primarily  that  of  Georg  Wilhelm  Friedrich 
Hegel  but  also  included  the  Young  Hegelians  and 
Ludwig  Feuerbach’s  materialism.  During  his  stu¬ 
dent  days  at  the  universities  of  Bonn  and  Berlin, 
Marx  studied  history  and  philosophy,  took  a 
strong  interest  in  the  works  of  the  philosopher 
Hegel,  and  joined  a  student-professor  group  called 
the  Young  Hegelians.  Many  of  Marx’s  basic  ideas, 
such  as  his  critique  of  civil  society  and  private 
property,  emerged  when  he  was  writing  Critique 
of  Hegel’s  Philosophy  of  Right.  He  asserts  that 
religion  is  the  “opium  of  the  people”  and  calls  for 
an  “uprising  of  the  proletariat”  to  realize  the  con¬ 
ceptions  of  philosophy,  a  point  also  made  in 
Theses  on  Feuerbach  (1845).  Marx  had  been 
strongly  influenced  by  Hegel’s  Logic  and  dialecti¬ 
cal  method,  and  his  great  work  Capital  is  imbued 
with  intellectual  categories  derived  from  Hegel. 

French  politics  and  socialist  movements  played 
an  important  role  in  shaping  Marx’s  thoughts. 
Marx’s  father-in-law,  Baron  von  Westphalen,  and 
his  teachers  were  all  strongly  influenced  by  the 
French  Enlightenment.  Marx  was  also  strongly 
influenced  by  the  French  Revolution  and  by  French 
thinkers  such  as  Jean-Jacques  Rousseau.  After  he 
emigrated  to  Paris  together  with  his  wife  to  escape 
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political  persecution  by  German  authorities,  Marx 
became  acquainted  with  French  socialist  thinkers 
and  began  to  witness  the  living  conditions  of  peo¬ 
ple  in  poverty  by  socializing  with  working-class 
people.  French  socialism  enabled  Marx  to  break 
with  Flegel’s  teleological  approach  to  history,  to 
develop  a  broad-ranging  social  economy,  to  under¬ 
stand  the  social  and  personal  impact  of  modern 
industry,  and  to  grasp  the  significance  of  socialism. 
After  studying  the  development  of  Bonapartism 
and  commenting  on  the  nature  and  significance  of 
the  Paris  Commune,  Marx  completed  several 
political  works  (Class  Struggles  in  France  and  The 
Eighteenth  Brumaire)  and  expounded  his  major 
political  ideas  about  the  state  and  revolution. 

The  third  major  source  of  Marx’s  theory  was 
English  economics,  exemplified  by  writers  such  as 
Adam  Smith,  David  Ricardo,  and  Thomas  Malthus. 
It  was  during  his  years  in  Paris  that  Marx  began 
his  study  of  English  economics;  from  the  early 
1840s,  he  made  an  increasingly  detailed  study  of 
the  works  of  English  economists.  After  moving  to 
London,  Marx  undertook  deep  and  systematic 
research  on  the  development  of  the  capitalist  mode 
of  production  in  England.  In  writing  the  1844 
Manuscripts ,  Marx  relied  extensively  on  the  work 
of  Adam  Smith,  especially  his  views  on  the  division 
of  labor,  rent,  subsistence  wages,  and  the  three 
stages  of  society.  Once  he  became  acquainted  with 
Ricardo’s  work,  Principles  of  Political  Economy , 
Marx  abandoned  the  economic  theory  developed 
in  the  1844  Manuscripts.  His  critique  of  The 
Poverty  of  Philosophy  was  Ricardian  in  character. 
By  absorbing  the  ideas  in  the  works  of  classic 
political  economists  and  analyzing  the  capitalist 
development  in  England,  Marx  established  his 
own  status  as  a  political  economist. 

Although  Marx  drew  on  various  sources,  he 
did  not  merely  combine  them  mechanically.  A  dis¬ 
tinctive  feature  of  Marx’s  theory  is  his  creative 
synthesizing  ability.  A  thorough  study  of  German 
philosophy,  French  politics,  and  English  econom¬ 
ics  allowed  Marx  to  develop  his  own  philosophi¬ 
cal,  economic,  social,  and  political  theory. 

Historical  Materialism 
and  Social  Development 

Marx’s  unique  contribution  to  historical  philoso¬ 
phy  is  his  historical  materialism  and  theory  of 


social  development.  According  to  his  explanations 
in  The  German  Ideology  (1846)  and  The  Critique 
of  Political  Economy  (1859),  the  nature  of  indi¬ 
viduals  depends  on  the  material  conditions  deter¬ 
mining  their  productions.  In  the  social  production 
of  their  existence,  people  enter  into  definite  rela¬ 
tions  that  are  indispensable  and  independent  of 
their  will.  According  to  Marx, 

These  relations  of  production  correspond  to  a 
definite  stage  of  development  of  their  material 
forces  of  production.  The  sum  total  of  these 
relations  of  production  constitutes  the  economic 
foundation  of  society  on  which  there  arise  legal 
and  political  superstructures  and  to  which 
correspond  definite  forms  of  social  consciousness. 

( Contribution  to  the  Critique  of  Political 
Economy,  Preface) 

The  mode  of  production  in  material  life  deter¬ 
mines  the  general  character  of  the  social,  political, 
and  intellectual  processes  of  life: 

It  is  not  the  consciousness  of  men  that  determines 
their  existence,  but,  on  the  contrary,  their  social 
existence  determines  their  consciousness.  At  a 
certain  stage  of  their  development,  the  material 
forces  of  production  come  eventually  into  conflict 
with  the  existing  relations  of  production.  [.  .  .] 
From  forms  of  development  of  the  forces  of 
production  these  relations  turn  into  their  fetters. 
Then  comes  the  period  of  social  revolution.  With 
the  change  of  economic  foundation  the  entire 
immense  superstructure  [.  .  .]  is  more  or  less 
rapidly  transformed.  In  considering  such 
transformations  the  distinction  should  always  be 
made  between  the  material  transformation  of  the 
economic  condition  of  production  [.  .  .]  and  the 
legal,  political,  religious,  aesthetic,  philosophical, 
in  short,  ideological  transformation.  ( Critique  of 
Political  Economy,  Preface) 

All  ideological  transformations  “must  be  explained 
from  the  contradictions  of  material  life,  from  the 
existing  conflict  between  the  social  forces  of  pro¬ 
duction  and  the  relations  of  productions”  ( Critique 
of  Political  Economy,  Preface).  Therefore,  the 
legal  relations  as  well  as  the  forms  of  state  could 
neither  be  understood  by  themselves  nor  explained 
by  the  so-called  general  progress  of  the  human 
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mind;  they  are  rooted  in  the  material  conditions  of 
life. 

Since  every  society  is  divided  into  various 
groups,  a  strong  minority  tends  to  use  their 
economic  power  in  order  to  exploit  the  mass 
population  by  appropriating  the  economic  surplus 
for  their  own  benefit.  This  inherently  conflicting 
situation  gives  rise  to  a  class  struggle  that  centers 
on  the  ownership  and  control  of  the  means  of  pro¬ 
duction.  The  social  group  that  controls  the  means 
of  production  forms  the  ruling  class,  and  the  group 
without  the  means  of  production  constitutes  the 
ruled  class.  All  political  institutions  and  cultural 
beliefs  are  shaped  by  the  ruling  class  so  as  to  bol¬ 
ster  the  unequal  distribution  of  resources.  The  his¬ 
tory  of  all  hitherto  existing  society  is  the  history  of 
class  struggles.  According  to  Marx, 

Freeman  and  slave,  patrician  and  plebeian,  lord 
and  serf,  guild-master  and  journeyman,  in  a 
word,  oppressor  and  oppressed,  stood  in  constant 
opposition  to  one  another,  carried  on  an 
uninterrupted  [.  .  .  J  fight  that  each  time  ended, 
either  in  a  revolutionary  re-constitution  of  society 
or  in  the  common  ruin  of  the  contending  classes. 
(Section  I,  Communist  Manifesto) 

Based  on  the  conflict  between  the  forces  of  produc¬ 
tion  and  the  relations  of  production,  the  history  of 
mankind  progresses  through  revolutions  to  the 
next  higher  stage.  In  the  Marxist  view  of  history, 
the  primitive  agrarian  society  was  followed  by  the 
slave  society  of  the  ancient  world,  the  feudal  soci¬ 
ety,  the  capitalist  society,  and  finally  the  commu¬ 
nist  society.  The  progress  is  made  by  inevitable  and 
ultimately  uncontrollable  material  forces,  rather 
than  human  thought  and  initiative.  This  is  some¬ 
times  summarized  as  so-called  economic  determin¬ 
ism.  In  fact,  while  Marx  emphasized  the  crucial 
role  of  material  forces  in  social  development,  he 
also  analyzed  the  important  and  strong  influences 
of  political  superstructure  and  human  initiative  on 
history. 

Capitalist  Crisis  and  Surplus  Value 

In  February  1848,  Marx  and  Engels  published  the 
well-known  pamphlet,  Communist  Manifesto.  It  is 
usually  regarded  as  a  public  statement  of  general 
theory  and  political  principle  of  Marxism  and  a 


call  for  general  cooperation  among  different  work¬ 
ers’  organizations.  According  to  their  analysis, 
capitalism  as  a  revolutionary  mode  of  production 
was  fundamentally  changing  the  course  of  civiliza¬ 
tion.  It  introduced  market  relations  and  cash  nexus 
into  all  spheres  of  society  and  throughout  the 
world.  The  market  kept  ever  growing,  the  demand 
ever  rising.  This  market  has  given  an  immense 
development  to  commerce,  to  navigation,  to  com¬ 
munication.  The  bourgeoisie  cannot  exist  without 
constantly  revolutionizing  the  instruments  of  pro¬ 
duction,  and  thereby  the  relations  of  production, 
and  with  them  the  whole  relations  of  society.  By 
continually  modernizing  the  forces  of  production 
and  promoting  the  division  of  labor,  capitalism 
prepared  the  material  conditions  necessary  for 
social  cooperation  and  planned  management  in 
economic  life.  Despite  the  ever-increasing  social 
character  of  capitalist  production  or  socialization 
of  the  forces  of  production,  the  capitalist  system 
was  operated  for  private  profit  under  private  own¬ 
ership.  The  search  for  private  profit  imposed  fet¬ 
ters  on  the  further  development  of  production.  The 
capitalist  relations  of  production  came  finally  into 
conflict  with  its  forces  of  production.  While  a  huge 
sum  of  wealth  was  accumulated  in  the  hands  of 
capitalists,  its  direct  producers  were  impoverished. 
Lack  of  demand  coexisting  with  unsold  goods  pro¬ 
duced  ever-worsening  economic  crises.  This 
dynamic  of  capitalism  created  conditions  of  its 
own  overthrown. 

Moreover,  capitalism  was  creating  the  indus¬ 
trial  proletariat  as  its  own  grave  diggers.  As  capi¬ 
talism  destroyed  precapitalist  modes  of  production 
at  home  and  abroad,  other  classes  were  eliminated 
and  the  proletariat  expanded.  According  to  Marx, 

With  the  development  of  industry  the  proletariat 
not  only  increased  in  number;  in  became 
concentrated  in  greater  masses,  its  strength  grew, 
and  it  felt  that  strength  more.  The  various 
interests  and  conditions  of  life  within  the  ranks 
of  the  proletariat  were  more  and  more  equalized, 
in  proportion  as  machinery  obliterated  all 
distinctions  of  labor,  and  nearly  everywhere 
reduced  wages  to  the  same  low  level.  (Section  I, 
Communist  Manifesto) 

As  individual  workers,  then  groups  of  workers  in 
a  factory  or  trade,  and  eventually  all  workers  in  a 


1498  Marxism 


nation-state  or  even  the  world  economy  mobilized 
to  resist  capitalist  exploitation,  the  proletariat 
would  grow  more  conscious  of  their  shared  class 
position  and  their  common  interest  in  the  over¬ 
throw  of  capitalism.  When  their  economic  strug¬ 
gles  encountered  resistance  of  the  state  as  well  as 
individual  capitalists  and  groups  of  employers,  the 
working  class  would  develop  a  revolutionary  con¬ 
sciousness  and  move  from  trade  unionism  to 
political  party.  With  economic  crisis  deepening 
and  the  proletariat  gaining  in  strength,  revolution 
would  be  inevitable. 

In  1867,  the  first  volume  of  Marx’s  greatest 
work,  Capital  (Das  Kapital ),  was  published.  The 
second,  third,  and  incomplete  fourth  volumes  did 
not  come  out  until  after  Marx’s  death  in  1883.  In 
Capital,  Marx  analyzed  the  secret  of  capitalist  pro¬ 
duction  by  focusing  on  the  concept  of  surplus  value 
and  formulated  his  revolutionary  theory  by  reveal¬ 
ing  the  injustice  of  capitalist  system.  According  to 
Marx,  labor  is  a  commodity  like  any  other  com¬ 
modity;  therefore,  following  the  labor  theory  of 
value,  it  must  be  valued  by  the  man-hours  devoted 
to  its  “production” — that  is,  to  feeding,  clothing, 
and  sheltering  the  worker  in  order  to  maintain  life 
at  subsistence  level.  In  the  capitalist  system,  labor  is 
bought  just  like  any  other  commodity.  But,  unlike 
any  other  commodity,  labor  is  not  consumed  in  a 
clearly  determined  period  of  time.  Labor  is  bought 
for  the  price  of  sustaining  the  laborer  physically, 
prorated  in  hours  or  days  or  weeks.  But  he  may 
produce  the  equivalent  of  the  price  in  economic 
value  in  6  or  8  hours  of  work,  whereas  the  factories 
of  Marx’s  day  kept  men  going  for  10,  12,  or 
14  hours  a  day.  The  difference  between  what  the 
worker  does  and  what  he  is  paid  is  surplus  value, 
the  source  of  all  capitalist  profits.  In  the  capitalist 
society  that  is  divided  into  the  capitalist  class  with 
the  means  of  production  and  the  proletariat  with¬ 
out  the  means  of  production,  the  injustice  heaped 
on  the  workers  is  not  the  result  of  bad  men,  but  of 
a  particular  system.  Reform  within  the  system, 
however  well  intentioned,  is  doomed  to  failure. 
Only  revolutionary  overthrow  of  the  whole  capital¬ 
ist  system  can  succeed. 

Communist  Revolution  and  Classless  Society 

Since  the  revolution  is  an  inevitable  historical  prod¬ 
uct  as  the  result  of  the  conflict  between  the  forces  of 


production  and  the  relations  of  production,  and 
especially  class  struggle  between  the  bourgeoisie  and 
the  proletariat  in  capitalist  society,  neither  Marx  nor 
Engels  paid  attention  to  the  means  of  revolution, 
especially  political  leadership  and  political  strategy, 
which  would  be  explained  and  expounded  by  their 
followers  such  as  Lenin  and  Mao  Zedong.  They  also 
said  little  about  what  would  happen  after  the  revo¬ 
lution.  It  was  believed  that  it  would  be  absurd  to 
predict  the  future  society  in  detail.  Nonetheless, 
some  major  ideas  about  communism  can  be  found 
in  the  classic  works  of  Marx  and  Engels. 

When  revolution  breaks  out,  the  proletariat 
seizes  the  power  of  the  state  and  transforms  the 
means  of  production  in  the  first  instance  into  state 
property.  As  Marx  and  Engels  suggested,  the  revo¬ 
lutionary  measures  in  the  most  advanced  countries 
would  include  abolition  of  private  property,  a 
heavy  progressive  or  graduated  income  tax,  aboli¬ 
tion  of  all  right  of  inheritance,  confiscation  of  the 
property  of  all  emigrants  and  rebels,  centralization 
of  credit  in  the  hands  of  the  state,  centralization  of 
the  means  of  communication,  and  transport  in  the 
hands  of  the  state.  By  doing  so,  it  puts  an  end  to 
itself  as  the  proletariat,  it  puts  an  end  to  all  class 
differences  and  class  antagonisms,  and  it  puts  an 
end  to  the  state  as  state.  The  government  of  per¬ 
sons  is  replaced  by  the  administration  of  things 
and  the  direction  of  the  process  of  production.  The 
state  is  not  “abolished”;  it  withers  away.  But  in  a 
few  places,  Marx  and  Engels  referred  to  the  transi¬ 
tional  stage  as  “the  dictatorship  of  proletariat.” 
The  existence  of  classes  is  bound  up  with  particu¬ 
lar  historic  phases  in  the  development  of  produc¬ 
tion;  the  class  struggle  necessarily  leads  to  the 
dictatorship  of  the  proletariat;  this  dictatorship 
itself  only  constitutes  the  transition  to  the  aboli¬ 
tion  of  all  classes  and  to  a  classless  society. 

With  regard  to  postrevolutionary  politics,  Marx 
cited  the  experience  of  the  Paris  Commune  and 
talked  about  the  possibility  of  bridging  the  gap 
between  the  state  and  civil  society  that  had  been 
opened  up  by  capitalist  democracy.  As  an  instance 
of  the  abolition  of  the  division  of  labor  in  politics, 
Marx  welcomed  the  Commune’s  proposal  to  have 
all  officials,  including  judges,  elected  by  universal 
suffrage  and  revocable  at  any  time;  to  pay  officials 
the  same  wages  as  manual  laborers;  to  replace  the 
standing  army  by  the  armed  people;  and  to  divest 
the  police  and  clergy  of  their  political  influence. 
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The  initiative  of  the  Commune  could  yield  a  decen¬ 
tralized,  federal  political  structure  and  an  economy 
based  on  cooperatives  united  by  a  common  plan. 

According  to  Marx’s  explanation  and  predic¬ 
tion,  the  fundamental  features  of  communism 
include  at  least  the  following  elements: 

•  The  first  is  to  eliminate  the  private  property  and 
implement  the  public  ownership  in  economy.  The 
theory  of  the  Communists  may  be  summed  up  in 
a  single  phrase:  abolition  of  private  property. 

•  The  second  is  to  limit  free  competition  and  carry 
out  economic  planning.  The  classic  socialist 
believes  that  capitalist-free  competition  may  lead 
to  economic  disorder  and  increasing  inequality. 
Only  after  all  economic  activities  are  placed 
under  the  comprehensive  economic  plan  can 
economic  development  be  promoted  and 
economic  crisis  avoided. 

•  The  third  is  to  distribute  the  economic  surplus 
based  on  labor  and  need.  In  contrast  to  the 
capitalism  in  which  capital  plays  the  most 
important  role  in  the  process  of  distribution,  the 
Communists  insist  that  labor  and  need  are  the 
most  important  factors  in  distributing  social 
wealth. 

•  Finally,  the  state  as  a  tool  of  rule  by  the  ruling 
class  would  wither  away  and  would  gradually  be 
replaced  by  the  administration  of  public  affairs. 
In  place  of  the  old  bourgeois  society,  with  its 
classes  and  class  antagonisms,  there  shall  be  an 
association,  in  which  the  conditions  for  the  free 
development  of  one  are  the  conditions  for  the 
free  development  of  all. 

Marxism  After  Marx 

The  classic  Marxist  theory  was  expounded  and 
elaborated  based  on  the  historical  developments  in 
the  Western  industrialized  countries  such  as 
England,  France,  and  Germany.  After  economic, 
social,  and  political  changes  took  place,  many  social 
theorists  and  political  leaders  tried  to  redefine  and 
develop  Marxism  based  on  the  new  situations. 

Revisionist  Marxism 

During  the  period  of  economic  depression  and 
political  repression  in  the  1880s,  Marxism  became 
dominant  in  the  German  Social  Democratic  Party. 


Karl  Kautsky  explained  and  defended  the  theories 
of  surplus  value,  class  struggle,  and  capitalist  cri¬ 
sis.  His  works  defined  Marxism  for  the  generation 
after  Marx  and  constituted  the  fundament  of 
“orthodox  Marxism.”  Another  theorist,  Eduard 
Bernstein,  launched  the  revisionist  attack  on 
“orthodox  Marxism”  and  directly  refuted  the 
theories  of  surplus  value,  impoverishment,  capital 
concentration,  and  crisis.  According  to  Bernstein, 
workers  were  not  becoming  poorer;  the  numbers 
of  peasants  was  not  declining;  a  new  middle  class 
was  growing  in  size  and  importance;  share  owner¬ 
ship  refuted  the  claim  of  capital  concentration;  and 
capitalism  was  developing  mechanisms  to  reduce 
competition  and  remove  recurrent  economic  crisis. 

Western  Marxism 

The  term  Western  Marxism  normally  excludes 
orthodox  communists  of  strict  Marxist  obedience 
and  is  confined  to  the  collection  of  thinkers  that 
centered  on  the  work  of  Georg  Lukacs  and  Karl 
Korsch  in  Central  Europe,  Antonio  Gramsci  in 
Italy,  and  the  Frankfurt  school  in  Germany.  Unlike 
the  previous  generation  of  Marxist  theorists,  most 
of  the  Western  Marxist  thinkers  were  not  impor¬ 
tant  figures  in  political  parties.  They  tended  to  be 
academics  rather  than  activists,  writing  in  a  period 
of  declining  working-class  movements  due  to  capi¬ 
talist  democratic  and  economic  developments. 
Western  Marxism  is  a  philosophical  meditation  on 
the  defeat  of  Marxism  in  the  West.  Although  some 
people  might  question  whether  Western  Marxism  is 
real  Marxism,  the  modes  of  thought  in  Western 
Marxism  undoubtedly  extended  the  horizons  of 
Marxist  discussion  beyond  the  rather  limited  per¬ 
spective  of  the  Second  International  and  Leninist 
orthodox.  Antonio  Gramsci’s  concept  of  hegemony 
and  its  consequences  for  political  culture,  the  treat¬ 
ment  of  Sigmund  Freud  by  Herbert  Marcuse,  the 
drastic  critique  of  the  Enlightenment  by  Max  Hork- 
heimer  and  Theodor  Adorno,  all  these  attempts  to 
remedy  weakness  or  gaps  in  the  classical  Marxist 
tradition  have  produced  meaningful  and  insightful 
works  on  philosophy,  politics,  and  society. 

Leninism 

Contrary  to  Marx’s  expectations,  the  socialist 
countries  were  not  founded  in  the  Western 
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advanced  countries  but  in  some  underdeveloped 
countries  such  as  Russia  and  China.  After  the 
Bolshevik  Revolution  of  October  1917,  the  first 
socialist  country  was  established  in  Russia  under 
the  strong  leadership  of  Vladimir  Lenin.  To  lead 
the  proletarian  revolution,  Lenin  contributed  a  lot 
to  Marxist  theory  in  his  theory  of  the  party  and  his 
concept  of  capitalistic  imperialism.  In  the  process 
of  building  the  first  socialist  country,  Lenin  had 
been  searching  for  several  models  of  socialism, 
such  as  War  Communism  and  the  New  Economic 
Policy,  in  the  face  of  foreign  invasion  and  domestic 
hardship.  In  comparison  with  the  comprehensive 
state  direction  and  management  of  the  economy  in 
War  Communism,  the  New  Economic  Policy  was 
the  strategic  retreat  in  which  the  state  withdrew 
from  the  ownership  and  management  of  small  and 
medium  enterprises,  retaining  only  the  very  large- 
scale,  strategically  important  parts  of  industry  and 
communications.  Freedom  for  peasants  and  trad¬ 
ers  to  market  their  goods  was  extended  as  the  state 
withdrew.  However,  after  Joseph  Stalin  ascended 
to  the  top  of  the  party-state  in  Soviet  Union,  he 
proceeded  to  announce  radical  plans  for  the  rapid 
industrialization  of  the  country  and  the  collectiv¬ 
ization  of  agriculture. 

Maoism  and  the  Theory  of  Deng  Xiaoping 

Mao  Zedong,  the  principal  Chinese  Marxist 
theorist,  was  the  Communist  statesman  who  con¬ 
tributed  to  the  founding  of  the  Chinese  Communist 
Party  in  1921,  the  Communist  Army  in  1927,  and 
the  People’s  Republic  of  China  (PRC)  in  1949  after 
a  long  period  of  military  struggle.  Maoism  is  com¬ 
posed  of  many  different  kinds  of  ideas  and  ideol¬ 
ogy  and  strategy  and  tactics  and  believed  to  be  the 
creative  results  of  applying  Marxism-Leninism  to 
China,  a  semifeudal  and  semicolonial  country 
without  modern  industrial  developments.  The 
most  salient  feature  of  Maoism  is  his  emphasis  on 
the  importance  of  the  peasant  issue  in  Chinese 
revolution  and  socialist  construction.  The  Marxist- 
Leninist  tradition  treated  peasants  as  incapable  of 
revolutionary  initiative  and  only  marginally  useful 
in  backing  urban  proletarian  revolution.  Based 
on  his  living  experiences  and  his  analysis  of  the 
rural  situation  in  China,  Mao  came  to  recognize 
the  potential  power  of  China’s  hundreds  of  mil¬ 
lions  of  peasants  and  decided  to  establish  his  base 


in  rural  areas  instead  of  big  cities.  The  peasants 
constituted  the  vast  majority  of  China’s  popula¬ 
tion,  but  most  of  them  were  hard-pressed  and 
lived  in  extreme  poverty.  According  to  Mao,  they 
were  very  receptive  to  revolutionary  agitation  and 
could  become  a  revolutionary  force  if  fully  mobi¬ 
lized  and  properly  guided.  Proceeding  from  this 
belief,  Mao  proposed  to  instill  in  them  a  revolu¬ 
tionary  consciousness  and  make  their  force  alone 
suffice  for  revolution.  By  so  doing,  Mao  led 
Chinese  revolution  to  succeed  and  gradually 
formed  a  special  sentiment  for  peasants.  During 
the  Cultural  Revolution,  Mao  sent  many  city 
workers,  intellectuals,  and  bureaucrats  to  rural 
areas  and  forced  them  to  receive  reeducation 
through  agricultural  labor  together  with  peasants 
because  Mao  believed  that  many  of  them  began  to 
become  corrupt  in  the  big  cities. 

After  the  death  of  Mao  and  the  end  of  the 
Cultural  Revolution,  Deng  Xiaoping  emerged  as 
the  new  supreme  leader  and  began  to  review  and 
revise  the  basic  line  adopted  by  Mao.  Deng  Xiaoping 
thought  it  was  imperative  to  give  a  new  perspective 
on  socialism  and  make  a  breakthrough  on  socialist 
theory  under  the  banner  of  “emancipating  the 
mind”  and  “seeking  the  truth  from  the  facts.”  The 
new  theory  of  socialism  with  Chinese  characteris¬ 
tics  has  been  expounded  and  enriched  by  Deng 
Xiaoping  and  other  Communist  leaders  during  the 
past  3  decades.  At  the  beginning  of  1992,  Deng 
Xiaoping  made  a  famous  southern  tour  in  which  he 
talked  a  lot  about  the  nature  of  socialism  and  rede¬ 
fined  it  as  “liberating  productivity,  developing  pro¬ 
ductivity,  eradicating  exploitation,  getting  rid  of 
the  polarization  between  rich  and  poor,  and  finally 
getting  rich  together.”  According  to  Deng,  the 
planned  economy  is  not  equal  to  socialism,  and 
there  are  plans  also  in  capitalist  countries.  The  mar¬ 
ket  economy  is  not  equal  to  capitalism,  and  there 
are  markets  also  in  socialist  countries.  The  criteria 
for  socialism  or  capitalism  are  not  that  there 
are  more  markets  or  more  plans  but  whether  it  is 
helpful  to  develop  productivity,  enhance  the  com¬ 
prehensive  national  power,  and  improve  living 
standards  for  the  common  people.  In  contrast  with 
the  traditional  socialist  theory  that  rejected  com¬ 
modity  economy  and  market  mechanism,  the 
new  theory  of  socialist  market  economy  insisted 
on  the  coexistence  of  socialism  and  market 
economy  because  socialism  has  been  redefined  as 
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the  grandiose  schemes  for  developing  productivity 
and  getting  rich,  and  market  has  been  regarded  as 
the  mere  means  to  organize  and  regulate  economic 
relations.  Deng  Xiaoping’s  theory  has  been  called 
Marxism  of  contemporary  China,  the  latest  product 
of  applying  Marxism  to  China’s  socialist  practice. 

Dingping  Guo 
Fudan  University 
Shanghai,  China 
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The  concept  of  masses  is  intricately  interwoven 
with  the  concept  of  elites.  Both  concepts  were 
introduced  into  the  social  sciences  during  the 
period  of  industrialization  in  the  late  19th  and 
early  20th  centuries.  With  the  demise  of  feudal 
society,  new  social  categories  emerged  that  defied 
the  traditional  classifications  of  the  old  corpora¬ 
tive  state.  Just  as  modern  elites  do  not  coincide 
with  traditional  aristocracies,  masses  do  not  coin¬ 
cide  with  the  third  estate.  Neither  was  the  Marxist 
concept  of  class  appropriate  for  grasping  the 
socially  heterogeneous  composition  of  mass  elec¬ 
torates  and  social  movements.  In  this  entry,  the 
origins  of  this  concept,  its  major  proponents  and 
critics,  the  respective  empirical  evidence,  and  its 
contemporary  relevance  are  discussed. 


Social  philosophers  and  social  scientists  writing 
about  masses  have  mostly  referred  to  their  large 
numbers  and  their  lack  of  structure.  The  rise  of 
masses  was  seen  as  a  consequence  of  industrializa¬ 
tion  and  urbanization.  The  German  sociologist 
Ferdinand  Tonnies  argued  that  the  increasing  divi¬ 
sion  of  labor  would  replace  primary  interpersonal 
relations  with  indirect,  secondary  relations,  thus 
leading  to  an  erosion  of  the  traditional  social 
bonds  of  family,  kinship,  and  neighborhood.  This 
development  is  aptly  captured  by  his  dichotomy  of 
Gemeinschaft  and  Gesellschaft.  Since  that  period 
was  also  one  of  mass  political  mobilization  and 
the  political  enfranchisement  of  ever  larger  parts 
of  the  population,  the  concept  of  masses  also 
played  a  role  in  the  political  debate  about  the 
effects  of  universal  suffrage.  Earlier  critics  such  as 
Edmund  Burke  and  Alexis  de  Tocqueville  had 
primarily  raised  concerns  that  this  might  impair 
the  quality  of  political  leadership  and  governance. 
The  classic  elite  theorists  Vilfredo  Pareto,  Gaetano 
Mosca,  and  Robert  Michels  instead  argued  that 
democratization,  rather  than  increasing  the  politi¬ 
cal  influence  of  ordinary  citizens,  would  lead  to  a 
centralization  of  political  power  in  the  hands  of  a 
small  elite.  Mosca  and  Michels  explained  this  with 
the  inability  of  large  groups  to  organize  for  effec¬ 
tive  political  action  and  the  personal  and  organi¬ 
zational  advantages  enjoyed  by  the  elites.  During 
the  first  half  of  the  20th  century,  the  rise  of  com¬ 
munist  and  fascist  totalitarian  mass  movements 
showed  that  some  of  the  concerns  raised  by  the 
older  authors  were  not  unfounded.  However, 
when  the  success  of  representative  democracy 
proved  that  most  of  their  dire  predictions  did  not 
materialize,  the  preoccupation  with  the  presumed 
dangers  of  mass  politics  gradually  subsided. 
Today,  the  term  masses  is  mostly  used  inter¬ 
changeably  with  ordinary  citizens,  mass  publics, 
or  nonelites. 

In  his  2002  book  Against  the  Masses,  Joseph 
Femia  analyzed  three  major  lines  of  reasoning 
against  mass  democracy: 

1.  The  perversity  thesis  holds  that  democratization 
is  not  capable  of  achieving  its  main  objective  of 
transferring  political  power  from  a  small 
aristocracy  to  the  majority  of  the  people  but 
will  instead  achieve  the  opposite — namely,  a 
centralization  of  power  that  makes  popularly 
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elected  elites  even  more  dominant  than  the  old 
aristocracy  ever  was. 

2.  The  futility  thesis  emphasizes  the  inevitability  of 
elite  rule  and  argues  that  the  natural  inequality 
of  human  beings  and  the  functional  necessities 
of  organizations  will  thwart  any  attempt  to 
empower  the  masses. 

3.  The  jeopardy  thesis  finally  points  to  the  conflict 
between  democracy  on  one  side  and  other 
important  social  values,  in  particular  liberty, 
social  cohesion,  and  economic  development,  on 
the  other. 

While  the  proponents  of  the  futility  thesis 
shared  a  pessimistic  view  of  human  nature  and 
expected  that  both  democratic  elites  and  masses 
pursued  particularistic  interests,  proponents  of  the 
perversity  and  jeopardy  theses  were  critical  of  the 
political  changes  brought  about  by  democratization. 
They  believed  in  the  superiority  of  the  traditional 
republican  model  of  government  by  a  deliberative 
representative  assembly  elected  by  limited  suf¬ 
frage,  whose  members  are  primarily  motivated  by 
the  intention  to  serve  their  country.  They  were 
therefore  afraid  that  universal  suffrage  would 
require  the  formation  of  organized  political  parties 
and  interest  groups  that  would  primarily  pursue 
their  particularistic  interests.  The  observed  short¬ 
comings  of  representative  democracy  were  thus 
compared  with  an  idealized  model  of  a  predemo- 
cratic  political  order  that  had  rarely  existed  in 
reality. 

The  critics  of  mass  democracy  also  emphasized 
the  inability  of  masses  to  understand  the  com¬ 
plexities  of  political  life  as  well  as  their  lack  of 
motivation  to  devote  much  time  to  public  affairs. 
They  argued  that  the  classic  conception  of  democ¬ 
racy  as  self-government  by  the  people  made  unre¬ 
alistic  assumptions  about  the  masses.  Joseph 
Schumpeter  and  others  took  up  these  arguments 
and  developed  a  more  realistic  theory  of  represen¬ 
tative  democracy  that  acknowledges  the  limited 
political  role  of  ordinary  citizens  and  instead  relies 
on  competitive  elections  and  institutional  con¬ 
straints  for  enforcing  the  political  accountability  of 
elected  leaders.  Under  these  relaxed  assumptions, 
most  of  the  problems  predicted  by  the  critics  of 
mass  democracy  turned  out  to  be  much  less  severe 
in  practice  than  in  theory. 


While  the  simple  dichotomy  of  elites  and  masses 
made  by  the  classics  of  elite  theory  has  given  way 
to  more  differentiated  analyses  that  treat  neither 
elites  nor  masses  as  unitary  actors,  three  basic 
theoretical  questions  raised  by  these  writers  are  of 
continuing  relevance.  These  are  the  motivation 
and  qualification  of  mass  publics  to  participate  in 
public  affairs,  their  susceptibility  to  manipulation 
by  elites,  and  finally  the  ways  in  which  citizen 
demands  are  transmitted  into  the  political  decision¬ 
making  process. 

Political  Involvement  and  Political  Beliefs 

Michels  and  Schumpeter  were  the  first  to  provide 
an  in-depth  analysis  of  the  division  of  labor 
between  politicians  and  ordinary  citizens  in  repre¬ 
sentative  democracies.  They  argued  that  the  over¬ 
whelming  majority  of  citizens  are  only  marginally 
involved  in  political  affairs.  While  Michels  tried  to 
demonstrate  the  existence  of  an  iron  law  of  oligar¬ 
chy  governing  all  organizational  life,  Schumpeter 
developed  a  new  model  of  democracy  as  electoral 
competition  among  political  parties,  thereby 
attempting  to  reconcile  democracy  and  elite  rule. 
Both  assumed  a  division  of  labor  among  voters 
and  elected  representatives,  thereby  claiming  an 
active  role  for  politicians  in  determining  the  politi¬ 
cal  agenda  and  a  largely  passive  role  for  citizens. 

Public  opinion  research  has  confirmed  that  the 
number  of  citizens  who  are  continually  involved  in 
public  affairs  is  relatively  small.  Philip  Converse 
and  others  have  repeatedly  claimed  that  only  about 
10%  to  20%  of  citizens  possess  a  high  level  of 
political  sophistication.  However,  empirical 
research  has  also  demonstrated  the  existence  of  a 
large  degree  of  variation  in  political  interest  and 
involvement  among  citizens,  ranging  from  the 
politically  apathetic  to  well-informed  political  activ¬ 
ists.  It  is,  therefore,  more  appropriate  to  assume  the 
existence  of  a  hierarchy  of  political  involvement 
rather  than  a  dichotomy  of  political  elites  and  mass 
publics.  Finally,  public  opinion  surveys  in  consoli¬ 
dated  democracies  have  also  provided  evidence  on 
differences  in  political  values  and  beliefs  between 
political  elites  and  mass  publics.  While  both  elites 
and  citizens  overwhelmingly  support  fundamental 
principles  of  democracy — such  as  general  elections 
and  free  speech — support  for  minority  rights,  civil 
liberties,  limits  on  governmental  power,  or  the 
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right  of  due  process  are  much  lower  among  mass 
publics  than  among  elites.  These  differences  are 
especially  large  where  respondents  are  confronted 
with  a  choice  between  those  rights  and  other 
highly  valued  political  goals  such  as  public  safety 
or  economic  well-being.  As  noted  by  Ian  McAllister 
(1991),  the  political  issue  attitudes  of  elites  are 
finally  more  polarized,  especially  when  they  are 
related  to  the  traditional  cleavage  lines  of  a  party 
system. 

Based  on  the  limited  support  of  mass  publics  for 
democratic  values,  some  political  scientists,  most 
notably  Herbert  McClosky,  concluded  that  mass 
political  culture  is  of  only  minor  relevance  for  the 
consolidation  and  sustenance  of  democracy. 
However,  libertarian  value  orientations  are  con¬ 
siderably  higher  among  the  highly  educated  and 
politically  involved.  It  would  therefore  be  wrong 
to  assume  that  only  elites  can  be  considered  as  the 
carriers  of  the  democratic  creed,  as  McClosky  con¬ 
cluded.  Rather  than  relying  on  the  existence  of 
elite  competition  as  a  sufficient  barrier  against 
nondemocratic  tendencies,  the  role  of  active  citi¬ 
zens  as  political  watchdogs  against  elite  transgres¬ 
sions  should  not  be  underestimated.  They  provide 
the  crucial  link  between  political  leaders  and  pas¬ 
sive  citizens  and  are  therefore  indispensable  for 
enforcing  political  accountability. 

Psychological  Characteristics  of  Masses 

Cultural  critics  have  frequently  emphasized  that 
modernization  inevitably  leads  to  an  erosion  of 
traditional  social  bonds  of  family,  kinship,  ethnic¬ 
ity,  castes,  and  so  on  and  will  ultimately  result  in 
an  atomization  of  society.  Gustave  Le  Bon’s  still 
popular  work  on  the  psychology  of  crowds  is  an 
example  of  the  dangers  such  critics  have  associated 
with  the  twin  developments  of  modernization  and 
democratization.  Le  Bon  claimed  that  crowds — 
understood  as  large  gatherings  of  individuals — are 
susceptible  to  persuasion  by  political  agitators  and 
therefore  prone  to  participating  in  political  actions 
that  their  individual  members  would  never  con¬ 
sider  for  themselves.  He  emphasized  their  lack  of 
critical  judgment  and  their  potential  for  irrespon¬ 
sible  and  destructive  behavior. 

Some  of  Le  Bon’s  observations  have  been  partly 
confirmed  by  social  psychological  experiments 
dealing  with  the  phenomenon  of  risky  shift  in 


group  decision  making;  in  other  words,  a  tendency 
for  groups  to  engage  in  riskier  decisions  than  indi¬ 
viduals.  While  there  is  solid  empirical  evidence 
that  risky  shifts  do  indeed  occur,  this  is  not  neces¬ 
sarily  the  case.  Based  on  a  review  of  several 
decades  of  social  psychological  research,  Daniel 
Isenberg  concluded  that  group  decisions  are  influ¬ 
enced  by  a  variety  of  factors,  of  which  the  most 
important  were  the  initial  preferences  of  the  group 
members,  the  plausibility  of  the  arguments  brought 
forward  for  different  courses  of  action,  and  the 
social  status  of  the  individuals  advocating  different 
solutions.  Risky  shifts  can,  therefore,  only  be  con¬ 
sidered  as  a  special  case  of  choice  shifts  and  are  by 
no  means  the  inevitable  result  of  collective  decision 
making. 

Moreover,  although  collective  violence  by 
crowds  (e.g.,  lynchings)  and  political  riots  are  not 
uncommon,  such  incidents  are  nothing  peculiar  to 
modern  societies.  Le  Bon’s  assumption  that  even 
mass  electorates  and  parliaments  are  susceptible  to 
crowdlike  behavior  is  especially  vastly  overdrawn 
and  cannot  be  sustained  empirically.  Electoral 
research  has  instead  shown  that  voters  are  exposed 
to  a  variety  of  contradictory  influences  by  primary 
and  secondary  groups  as  well  as  the  mass  media. 
Moreover,  although  the  relevance  of  short-term 
factors  has  increased,  social  structural  (social  class, 
religion,  ethnicity)  and  psychological  (party  identi¬ 
fication)  commitments  continue  to  be  relevant 
determinants  of  voting  behavior.  Le  Bon  did  not 
adequately  take  into  account  the  persistence  of 
such  particularistic  loyalties  and  even  less  so 
the  capacity  of  individuals  to  form  independent 
opinions. 

Theories  of  Mass  Society 

In  his  1959  book  The  Politics  of  Mass  Society, 
William  Kornhauser  attempted  to  integrate  the 
assumptions  of  aristocratic  and  democratic  critics 
of  modern  society  into  a  comprehensive  theory  of 
society.  Based  on  their  patterns  of  elite-mass  rela¬ 
tions,  Kornhauser  distinguished  four  ideal  types  of 
society:  communal  (traditional),  pluralist,  mass, 
and  totalitarian.  The  bottom-up  perspective 
involves  the  accessibility  of  elites  for  mass  demands, 
the  top-down  perspective  the  availability  of  non- 
elites  for  elite  domination  and  manipulation.  This 
yields  a  fourfold  table  (Table  1). 
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Table  I  Four  Ideal  Types  of  Society 


Accessibility 
of  Elites 

Availability  of  Nonelites 

Low 

High 

Low 

Communal 

Totalitarian 

society 

society 

High 

Pluralist  society 

Mass  society 

Source:  Adapted  from  Kornhauser,  W.  (1959).  The  politics  of 
mass  society  (p.  40).  Glencoe,  IL:  Free  Press. 


Kornhauser  argued  that  pluralist  society  is  char¬ 
acterized  by  a  high  degree  of  responsiveness  of 
elites  to  citizen  demands  as  well  as  a  high  degree  of 
elite  autonomy  from  public  pressures.  Citizen 
demands  are  transmitted  into  the  political  deci¬ 
sion-making  arena  through  a  dense  web  of  inter¬ 
mediary  associations  (political  parties,  labor 
unions,  etc.)  via  institutionalized  channels  of  inter¬ 
est  articulation  rather  than  through  direct  mass 
action.  Mass  society  is  instead  characterized  by  a 
lack  of  voluntary  associations,  which  implies  that 
elite  and  nonelite  interactions  become  precarious. 
Elites  are  put  under  undue  pressure  to  accede  to 
mass  demands  articulated  by  direct  political  action, 
while  citizens  become  objects  of  elite  manipula¬ 
tion.  Although  Kornhauser  emphasized  the  dis¬ 
tinction  between  mass  society  and  totalitarian 
society,  he  argued  that  mass  society  is  vulnerable 
to  totalitarianism  because  atomized  individuals 
have  no  means  of  influencing  politics  other  than 
through  participation  in  direct  political  action. 

Kornhauser’s  theory  is  but  one  example  of  a 
long-standing  tradition  of  theories  emphasizing 
the  crucial  function  of  intermediary  associations 
for  societal  integration,  ranging  from  theories  of 
pluralism  to  more  recent  approaches  such  as  com- 
munitarianism  or  theories  of  civil  society  or  social 
capital.  Many  of  these  writers  have  also  warned  of 
the  dangers  associated  with  the  demise  of  interme¬ 
diary  organizations.  However,  such  diagnoses 
have  frequently  been  preoccupied  with  the  decline 
of  specific  associational  types,  without  considering 
that  new  types  of  organizations  may  already  be  on 
the  rise  and  that,  rather  than  leading  to  social 
atomization,  the  eclipse  of  traditional  organiza¬ 
tions  may  simply  indicate  their  historical  obsoles¬ 
cence.  In  this  vein,  the  abolition  of  compulsory 
guild  membership  in  West  European  countries 


during  the  period  of  industrialization  paved  the 
way  for  the  formation  of  a  wealth  of  business  and 
professional  associations  as  well  as  labor  unions. 
Moreover,  the  expansion  of  the  suffrage  in  the 
19th  century  required  the  formation  of  political 
parties  as  instruments  for  mobilizing  political  sup¬ 
port  among  newly  enfranchised  voters.  As  pluralist 
theorists  later  showed,  rather  than  becoming 
atomized,  industrial  societies  were  characterized 
by  the  existence  of  an  intense  network  of  interme¬ 
diary  associations.  Likewise,  preoccupation  with 
the  current  decline  in  membership  among  tradi¬ 
tional  mass  membership  organizations  such  as 
political  parties  and  labor  unions  tends  to  neglect 
or  underestimate  the  rise  of  new  types  of  associa¬ 
tions  (e.g.,  nongovernmental  organizations,  third- 
sector  organizations)  that  may  serve  as  functional 
equivalents.  The  fact  that  they  are  different  does 
not  necessarily  imply  that  they  are  incapable  of 
fulfilling  the  functions  of  their  older  counterparts. 

Continued  Relevance  or 
Obsolescence  of  the  Concept? 

In  emphasizing  the  inevitability  of  power  and 
elites,  the  classic  elite  theorists  replaced  the  dichot¬ 
omous  class  model  of  Marxism  by  an  equally 
crude  dichotomy  between  elites  and  masses. 
Nevertheless,  some  of  their  insights  are  of  continu¬ 
ing  relevance  and  have  contributed  to  the  develop¬ 
ment  of  a  revised  model  of  democratic  politics. 
Representative  democracy  has  institutionalized  a 
pluralist  elite  structure  with  competing  political 
parties,  thus  enforcing  political  accountability  of 
elected  representatives.  Likewise,  nonelites  are  not 
necessarily  tantamount  to  atomized  masses. 
Modern  democracies  are  instead  characterized  by 
the  existence  of  a  multitude  of  voluntary  associa¬ 
tions  and  private  interest  groups  performing  a 
crucial  linkage  function  between  elites  and  non- 
elites.  While  their  leaders  belong  to  the  elites,  they 
are  internally  stratified  according  to  the  degree  of 
involvement  of  their  members. 

Moreover,  empirical  research  has  demonstrated 
that  the  assumption  that  mass  electorates  are  char¬ 
acterized  by  low  levels  of  political  sophistication 
and  involvement  in  public  affairs  and  susceptibility 
to  manipulation  by  populist  elites  grossly  misrep¬ 
resents  the  reality  of  modern  democracies,  even 
though  most  voters  may  not  live  up  to  the  classic 
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ideal  of  citizenship.  Simply  comparing  elites  and 
masses  therefore  provides  a  simplified  portrait  of 
modern  democracies  that  disregards  the  complex 
structure  of  both  elites  and  mass  publics. 

Ursula  Hoffmann-Lange 
University  of  Bamberg 
Bamberg,  Germany 

See  also  Civil  Society;  Communitarianism;  Elites; 
Pluralism;  Political  Culture;  Representation;  Social 
Capital 
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Matching  is  a  statistical  method  that  can  be  used 
to  estimate  quantities  of  interest  that  depend  on 
missing,  that  is,  unobserved,  values  of  some  vari¬ 
able  Y.  (As  is  made  clear  later  in  this  entry,  the 
variables  with  missing  values  in  causal  inference 
applications  are  subtly  different  from  the  observed 
outcome  variable,  which  is  commonly  referred  to 
as  Y.)  Schematically,  matching  works  as  follows. 
For  each  observation  with  a  missing  value  of  Y, 
find  another  observation  that  does  not  have  a 
missing  Y  value  but  that  is  otherwise  maximally 
similar  to  the  initial  observation  in  question.  This 


similar  observation  is  said  to  match  the  observa¬ 
tion  with  the  missing  Y  value.  Now  use  the 
observed  Y  value  from  the  matched  observation  to 
fill  in  the  missing  Y  value.  Matching  can  be  done 
by  selecting  matching  observations  with  or  with¬ 
out  replacement  from  the  original  dataset.  It  is 
also  possible  to  match  many  observations  to  a 
single  missing  observation,  in  which  case  the  mean 
of  Y  from  the  matching  observations  is  typically 
used  to  fill  in  the  missing  Y  value. 

Matching  can  be  applied  to  a  variety  of  missing 
data  problems — from  estimating  the  population 
mean  of  Y  to  estimating  causal  effects.  Examples 
below  make  this  clearer.  Matching  is  not  a  pana¬ 
cea.  Matching  methods  rely  on  assumptions  that 
can  only  be  tested  given  auxiliary  data  and/or 
assumptions.  The  key  assumptions  of  conditional 
ignorability  and  overlap  are  discussed  later  in  this 
entry.  There  are  a  wide  variety  of  ways  that  match¬ 
ing  can  be  implemented  by.  A  discussion  of  par¬ 
ticular  matching  methods  and  their  statistical 
properties  is  beyond  the  scope  of  this  entry. 

Examples 

The  easiest  way  to  begin  to  understand  how 
matching  works  is  to  walk  through  some  relatively 
simple  examples. 

Estimating  a  Population  Mean 

Consider  a  situation  where  we  are  interested  in 
estimating  the  fraction  of  Republicans  in  a  popula¬ 
tion.  We  sample  20  individuals  from  this  popula¬ 
tion  and  administer  a  face-to-face  survey.  The 
pollster  records  the  respondent’s  gender  (0  =  male, 
1  =  female),  race  (0  =  nomvhite,  1  =  white),  and 
partisanship  (0  =  non-Republican,  1  —Republican). 
All  respondents  report  their  gender  and  race  accu¬ 
rately;  however,  some  respondents  do  not  report 
their  partisanship.  Respondents  who  report  their 
partisanship  will  be  called  reporters  and  those 
who  do  not  report  their  partisanship  will  be 
called  nonreporters.  These  data  are  summarized 
in  Table  f. 

We  would  like  to  use  our  sample  data  to  esti¬ 
mate  the  fraction  of  individuals  in  the  population 
who  self-identify  as  Republicans.  The  simplest  way 
to  do  this  is  to  take  the  sample  average  of  the  par¬ 
tisanship  variable  among  the  reporters.  Doing  so, 
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Table  1 

Hypothetical  Missing  Data  Example  in  Which  One-to-One  Matching  With  Replacement  Is  Used  to  Estimate 
Average  Partisanship 

True 

Observation  Partisanship 

Observed 

Partisanship 

Imputed 

Partisanship 

Race 

Gender 

1 

0 

0 

0 

0 

2 

0 

0 

0 

0 

3 

0 

0 

0 

0 

4 

0 

0 

0 

0 

0 

5 

0 

0 

0 

1 

0 

6 

0 

1 

1 

0 

7 

1 

1 

1 

1 

0 

8 

1 

1 

1 

1 

0 

9 

1 

0 

1 

0 

10 

1 

1 

1 

0 

11 

0 

0 

0 

1 

12 

0 

0 

0 

1 

13 

0 

0 

0 

0 

1 

14 

0 

0 

0 

1 

15 

0 

0 

0 

1 

1 

16 

0 

0 

0 

1 

1 

17 

0 

1 

1 

1 

18 

1 

1 

1 

1 

1 

19 

1 

1 

1 

1 

1 

20 

1 

0 

1 

1 

Average 

0.35 

0.44 

0.35 

Note:  Sample  average  of  observed  partisanship  is  not  equal  to  the  sample  average  of  true  partisanship  because  of  missing  data. 
Using  matching  to  impute  partisanship  produces  a  sample  average  that  is  correct. 


we  would  estimate  that  44%  of  the  population  are 
Republican  identifiers.  Note  that  if  individuals 
who  are  more  likely  to  be  Republicans  answer  the 
partisanship  question  more  often,  this  simple 
approach  will  generally  yield  estimates  of 
Republican  partisanship  that  are  falsely  high. 
Looking  at  the  true  (but  partially  unobserved)  par¬ 
tisanship  of  each  individual  in  Table  1  and  the 
associated  sample  average,  we  see  that  the  simple 
approach  of  ignoring  the  missing  data  yields  an 
estimate  that  is  9  percentage  points  too  high. 

While  it  is  clearly  not  the  case  that  the  average 
partisanship  of  reporters  and  nonreporters  is 
equal — hence  the  9-percentage-point  difference — if 
it  is  the  case  that  (a)  the  distribution  of  reporters’ 


partisanship  is  equal  to  that  of  nonreporters  within 
each  race-gender  combination  and  (b)  each  race- 
gender  combination  that  has  nonreporters  also  has 
at  least  one  reporter,  then  one  can  use  matching  to 
produce  better  estimates  of  partisanship.  Condition 
(a)  is  sometimes  called  the  conditional  ignorability 
assumption,  and  condition  (b)  is  sometimes  called 
the  overlap  assumption. 

Looking  at  Table  1,  we  see  that  we  do  not 
observe  the  partisanship  variable  for  Observation  f . 
The  only  observation  that  has  the  same  race  and 
gender  values  as  Observation  f  and  that  has  an 
observed  partisanship  value  is  Observation  4.  We 
can  thus  match  Observation  4  to  Observation  f  to 
impute  Observation  f ’s  partisanship.  Observation  4 
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is  also  matched  to  Observations  2  and  3.  A  more 
interesting  situation  lies  in  how  to  deal  with  the 
missing  partisanship  of  Observation  6.  Here,  three 
other  observations — 5,  7,  and  8 — have  the  same 
race  and  gender  values  as  Observation  6.  Here,  we 
would  randomly  pick  one  of  these  three  observa¬ 
tions  to  be  the  match  for  Observation  6.  Doing 
this,  we  pick  Observation  8,  which  implies  that  the 
imputed  partisanship  for  Observation  6  is  1.  Note 
that  while  this  is  not  equal  to  Observation  6’s  true 
partisanship,  randomly  choosing  matches  from 
among  the  observations  with  1  =  race  and  0  = 
gender  and  observed  partisanship  will  be  correct  in 
expectation  because  of  the  conditional  ignorability 
assumption.  Thus,  while  individual-level  values 
may  be  incorrect,  sample  averages  will  be  correct 
in  expectation.  Imputing  data  in  this  way  is  some¬ 
times  referred  to  as  hot-deck  imputation.  We  con¬ 
tinue  to  impute  the  partisanship  variable  in  this 
way  until  it  is  completely  filled  in.  After  doing  this, 
we  see  that  the  sample  average  of  the  imputed  par¬ 
tisanship  variable  is  equal  to  the  true  sample  aver¬ 
age  of  0.35.  In  actual  applications,  the  average 
based  on  the  imputed  partisanship  variable  will 
only  equal  the  true  sample  average  in  expectation. 

Estimating  Average  Treatment  Effects 

Most  recent  applications  of  matching  in  the 
social  sciences  have  been  within  the  context  of 
estimating  average  treatment — that  is,  causal — 
effects.  To  see  how  matching  works  in  such  situa¬ 
tions,  we  consider  a  simple  example.  Let  X  =  0,  1 
denote  the  treatment  variable,  Z1  and  Z2  denote 
the  measured  confounders,  and  Y  denotes  the  out¬ 
come  variable.  We  are  interested  in  the  average 
treatment  effect  (ATE),  that  is,  the  difference 
between  the  average  value  of  Y  in  a  world  where 
all  units  get  treatment  (X  —  1)  and  the  average 
value  of  Y  in  a  world  where  all  units  do  not  get 
treatment  (control)  (X  =  0).  To  formalize  this,  let 
Y(l)  denote  the  Y  value  of  a  randomly  chosen  unit 
if  it  were  assigned  treatment  (X  =  1)  and  let  Y(0) 
denote  the  Y  value  of  a  randomly  chosen  unit  if  it 
were  assigned  to  the  control  condition  (X  —  0). 
The  ATE  is  E[Y(1)  -  Y(0)J  =  E[Y(1)J  -  E[Y(0)J. 
In  an  abuse  of  notation,  we  will,  at  some  point 
below,  refer  to  the  Y(0)  and/or  Y(l)  values  of  a 
particular  unit  without  subscripting  the  potential 
outcomes. 


If  one  is  willing  to  make  assumptions  that 
ensure  that  the  potential  outcomes  Y(l)  and  Y(0) 
are  well-defined,  then  the  estimation  of  the  ATE 
requires  solving  two  missing  data  problems  that 
are  analogous  to  the  simple  example  of  estimating 
a  population  mean  in  the  presence  of  missing  val¬ 
ues.  Note  that  we  need  to  estimate  E[Y(1)J  and 
E[Y(0)J  where  the  expectations  are  taken  over  all 
units  in  the  population.  We  get  to  observe  Y(l)  for 
units  that  received  treatment  but  we  do  not  get  to 
observe  Y(0)  for  these  units.  Conversely,  we  get  to 
observe  Y(0)  for  units  that  were  in  the  control 
group,  but  we  do  not  get  to  observe  Y(l)  for  these 
units.  Table  2  presents  a  simple  data  set  that  makes 
this  clearer.  Column  (a)  displays  the  true,  but 
unobserved,  potential  outcomes  for  all  units  in  the 
study.  Here,  we  see  that  the  average  value  of  Y 
under  treatment  is  0.85  and  the  average  value  of  Y 
under  control  conditions  is  0.70.  Thus,  the  true 
ATE  in  this  sample  is  0.15.  Life  is  complicated 
because  we  do  not  get  to  observe  all  of  the  infor¬ 
mation  in  Column  (a).  Instead,  we  only  get  to 
observe  the  information  in  Column  (b).  Again,  we 
can  match  units  that  have  maximally  similar  values 
of  the  measured  confounders — Z1  and  Z2 — to 
impute  the  missing  values  of  Y(0)  and  Y(l). 

To  see  how  this  works,  look  at  Unit  11,  which, 
having  received  treatment,  is  missing  its  Y(0) 
value.  Units  7,  8,  9,  and  10  received  control  and 
have  the  same  Z1  and  Z2  values  as  Unit  11.  They 
are  thus  potential  matches.  Half  of  these  units  have 
Y(0)  =  0  and  half  have  Y(0)  =  1.  There  should 
thus  be  a  50%  chance  that  Unit  ll’s  value  of  Y(0) 
is  0  and  a  50%  chance  it  is  1.  Suppose  we  match 
Unit  11  to  Unit  9.  Then  Y(0)  for  Unit  11  is  0.  This 
is  reported  in  Column  (c).  Again,  note  that  this  is 
not  equal  to  Unit  ll’s  true  value  of  Y(0)  that  we 
see  in  Column  (a).  However,  the  50%  chance  of 
being  matched  to  a  Y(0)  =  0  unit  is  correct  for  all 
units  with  X  =  1,  Z1  =  0,  and  Z2  =  1  (Units  11 
and  12),  which  keeps  the  sample  average  of  Y(0) 
among  the  Z 1  =  0,  Z2  =  1  units  correct  in  expec¬ 
tation.  Using  this  same  sort  of  matching  procedure 
to  fill  in  the  remaining  missing  Y(0)  values  and  the 
missing  Y(l)  values,  we  arrive  at  the  imputed 
potential  outcomes  in  Column  (c).  Taking  the 
averages  of  these  variables  and  then  taking  the  dif¬ 
ference  of  the  averages  gives  us  an  estimate  of  ATE 
that  is  in  line  with  the  true  value  reported  in 
Column  (a). 
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Table  2 

Hypothetical  Causal  Inference  Example  Where  One-to-One  Matching  Is  Used  to  Estimate  the  Average 
Treatment  Effect  (ATE) 

Unit 

(a) 

(b) 

(c) 

X 

Z1 

Z2 

True 

Y(0) 

True 

Y(l) 

Observed 

Y(0) 

Observed 

Y(l) 

Imputed 

Y(0) 

Imputed 

Y(l) 

1 

0 

1 

0 

0 

0 

0 

0 

0 

2 

1 

0 

1 

1 

1 

0 

0 

0 

3 

1 

0 

1 

1 

0 

0 

0 

0 

4 

0 

1 

0 

0 

1 

0 

0 

0 

5 

1 

0 

0 

1 

0 

1 

0 

0 

6 

0 

1 

1 

0 

1 

1 

0 

0 

7 

0 

1 

0 

0 

1 

0 

0 

1 

8 

1 

1 

1 

1 

1 

0 

0 

1 

9 

0 

1 

0 

0 

1 

0 

0 

1 

10 

1 

1 

1 

1 

1 

0 

0 

1 

11 

1 

1 

1 

0 

1 

1 

0 

1 

12 

0 

1 

1 

1 

1 

1 

0 

1 

13 

1 

1 

1 

1 

1 

0 

1 

1 

14 

1 

1 

1 

1 

1 

0 

1 

1 

15 

1 

1 

1 

1 

1 

1 

1 

1 

16 

1 

1 

1 

1 

1 

1 

1 

1 

17 

1 

1 

1 

1 

1 

1 

1 

1 

18 

1 

1 

1 

1 

1 

1 

1 

1 

19 

1 

1 

1 

1 

1 

1 

1 

1 

20 

1 

1 

1 

1 

1 

1 

1 

1 

Average 

0.70 

0.85 

0.60 

0.90 

0.70 

0.85 

Estimated 

ATE 

0.15 

0.30 

0.15 

Note:  Because  units  with  Y(  0)  =  0  are  more  likely  to  be  assigned  to  the  control  group  than  the  treatment  group  and  units  with 
Y(l)  =  0  are  also  more  likely  to  be  in  the  control  rather  than  the  treatment  group,  the  estimate  of  ATE  in  Column  (b)  is 
incorrect.  Imputing  data  via  matching  on  Z1  and  Z2  solves  this  problem  in  Column  (c). 


Why  Does  Matching  Work? 

As  noted  above,  standard  matching  methods  rely 
on  two  important  assumptions — conditional  ignor- 
ability  and  overlap.  Each  is  briefly  discussed  as 
follows. 

Understanding  the  Conditional 
Ignorability  Assumption 

There  are  multiple  versions  of  the  conditional 
ignorability  assumption.  Here  we  will  focus  on  a 


fairly  weak  version — conditional  mean  ignorability — 
that  is  relatively  easy  to  understand.  Conditional 
mean  ignorability  states  that,  among  observations 
with  the  same  values  of  measured  covariates,  the 
mean  of  the  missing  Y  values  is  the  same  as  the 
mean  of  the  observed  Y  values.  In  causal  inference 
applications  similar  to  that  represented  in  Table  2, 
missingness  is  completely  determined  by  the  treat¬ 
ment  assignment  X.  Thus,  conditional  mean  ignor¬ 
ability  also  implies  that,  among  units  with  the 
same  values  of  measured  covariates,  the  mean  of 
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the  potential  outcome  Y(l) — the  value  of  Y  under 
a  hypothetical  assignment  of  a  unit  to  the  treat¬ 
ment  (X  =  1)  condition — does  not  depend  on 
whether  a  unit  actually  received  treatment  or  con¬ 
trol.  The  same  is  true  for  the  mean  of  the  potential 
outcome  Y(0).  Assuming  conditional  mean  ignor- 
ability  allows  one  to  estimate  the  mean  of  Y  when 
some  Y  values  are  missing  because  it  states  exactly 
how  the  expected  value  of  the  missing  data  relates 
to  observed  values  of  Y. 

Understanding  the  Overlap  Assumption 

Of  course,  for  a  conditional  ignorability 
assumption  to  be  useful,  it  must  be  the  case  that 
observations  exist  with  observed  Y  values  that  are 
appropriate  matches  for  observations  with  missing 
Y  values.  The  overlap  assumption  is  a  formal 
statement  of  this  need  for  good  matches.  While  the 
examples  above  make  use  of  exact  matches,  it  is 
not  necessary  to  obtain  exact  matches  for  the 
matching  to  be  effective.  For  instance,  it  is  possible 
to  eliminate  bias  by  matching  on  a  unidimensional 
summary  of  the  relationship  between  measured 
covariates  and  the  missingness  pattern.  In  situa¬ 
tions  where  overlap  does  not  hold,  it  is  common — 
especially  in  causal  inference  applications — to 
change  the  quantity  of  interest  to  one  for  which 
there  is  good  overlap. 

Kevin  M.  Quinn 
University  of  California,  Berkeley 
Berkeley,  California,  United  States 

See  also  Causality;  Data,  Missing;  Selection  Bias 
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Maximum  likelihood  is  a  general  method  for  esti¬ 
mating  parameters  in  a  statistical  model.  Given  a 
known  probability  distribution  Y  with  known 
probability  density  function  (pdf),  assume  that  we 
have  a  random  sample,  yx, .  .  .  ,  yn  from  Y,  where 
0  is  an  unknown  population  parameter  associated 
with  Y.  The  likelihood  function  L(0)  is  the  prod¬ 
uct  of  the  pdf  for  each  value  evaluated  on  the  n 
sample  points: 

k(0)  =  /'(yi,....y„|0)  =  f{y\)f(yi)--f{yn)\§ 

=  n  /'(X/l©)- 

i'=l 

Maximum  likelihood  chooses  the  estimate  of 
the  parameter  0  that  maximizes  the  likelihood  of 
the  observed  data.  Joint  pdfs  and  likelihoods 
appear  to  be  quite  similar,  but  the  two  differ  in 
an  important  respect.  A  joint  pdf  is  a  function  of 
the  data  where  the  parameter  is  assumed  to  be 
known,  while  the  likelihood  is  assumed  to  be  a 
function  of  the  unknown  parameter  0  and  not 
the  data.  The  value  of  0  that  maximizes  the  like¬ 
lihood  function  is  the  maximum  likelihood  esti¬ 
mate  for  0.  Common  estimators  such  as  ordi¬ 
nary  least  squares  and  the  sample  mean  and 
proportion  are  in  fact  maximum  likelihood 
estimators.  Maximum  likelihood  possesses  a 
number  of  desirable  properties  that  account  for 
its  widespread  use  in  statistical  estimation. 
Maximum  likelihood  is  widely  used  in  political 
science  to  estimate  logit  and  probit  models, 
count  models,  and  event  history  or  survival 
models  among  others.  In  this  entry,  the  origins, 
properties,  and  possible  applications  of  this 
method  are  discussed. 

Origins 

R.  A.  Fisher  invented  the  method  of  maximum 
likelihood  in  a  series  of  papers  published  early 
in  the  20th  century.  Fisher’s  work  on  maximum 
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likelihood  began  with  his  derivation  of  the  princi¬ 
ple  of  “absolute  criterion”  in  a  paper  he  pub¬ 
lished  as  a  third-year  undergraduate.  While  this 
paper  contains  the  origins  of  maximum  likeli¬ 
hood  estimation  (MLE),  there  is  little  in  the 
paper  that  many  readers  would  recognize  as 
MLE.  In  later  papers,  he  developed  the  concept 
of  likelihood  as  distinct  from  probability.  Then, 
in  1922,  Fisher  united  several  earlier  streams  of 
his  research  and  was  the  first  to  use  the  term 
maximum  likelihood  for  a  class  of  estimators  as 
an  alternative  to  Bayesian  or  method  of  moments 
estimators.  In  the  same  paper,  Fisher  proposes 
that  maximum  likelihood  estimators  have  prop¬ 
erties  of  efficiency,  sufficiency,  and  consistency. 
Later  work  by  other  statisticians  would  establish 
the  properties  of  MLE  more  rigorously. 

A  simple  example  is  helpful  for  understanding 
the  principles  of  MLE.  Let  us  say  we  wish  to  esti¬ 
mate  the  sample  proportion  for  a  set  of  data. 
Assume  we  have  a  random  sample  of  data  yv  , 
yn  with  n  observations  randomly  drawn  from  a 
binomial  distribution  with  common  parameter  p, 
where  0  <P<  1  and  each  y  is  either  1  for  success 
or  0  for  failure.  For  these  n  independent  and  iden¬ 
tically  distributed  variables  yv  ... ,  yn ,  the  density 
of  each  observation  is 

We  next  write  the  likelihood  function,  which  is 
the  density  evaluated  at  the  data  as  a  function  of 
the  parameter  p.  However,  because  the  binomial 
coefficient  does  not  depend  on  the  parameter  of 
interest,  p,  we  can  omit  it  from  our  derivation  of 
the  maximum  likelihood  estimator.  The  likelihood 
function  is  the  product  of  the  individual  densities 
for  each  observed  data  point: 


P(Y i  =  yu  Y2  =  y2,-..,  Yn  =  yn) 

(i) 

X  •  •  •  Xpy"(  1  -  p)l~y" 

(2) 

n  p”<  i  -  ?)"■”. 

(3) 

i=i 


Due  to  the  independence  of  observations,  we 
can  write  the  likelihood  as  the  product  of  the 
n  binomial  densities.  The  maximum  likelihood 


estimate  for  p  is  the  value  of  p  that  maximizes 
this  likelihood  function.  To  find  the  maximum, 
we  take  the  first  derivative  of  the  likelihood 
function,  set  the  derivative  equal  to  zero,  and 
solve  for  p.  It  is  often  easier  to  work  with  the 
logarithm  of  the  likelihood  function.  Since  the 
likelihood  function  is  a  monotonic  function, 
taking  the  log  of  it  will  not  affect  the  estimate 
of  p.  The  log-likelihood  function  takes  the 
following  form: 

n  n 

In  Up)  =  X>ln p  +  n  -  £yln(l  -  p). 

i=  1  /  =  1 

Differentiating  the  log-likelihood  with  respect 
to  p  returns, 


d 


t*  v — '  i-  v — '  / 

-iz  =  lj-  - » -  l  y  7 

dp  i= i  p  i=  i  \1 


We  set  the  derivative  of  the  log-likelihood  equal 
to  zero  and  solve: 


Zn 

_ i=i 

1  ~P 


-0, 


The  maximum  likelihood  estimate  for  p  then  is 
simply  the  proportion  of  successes  we  observe  in 
the  sample. 


Properties  of  Maximum 
Likelihood  Estimators 

Maximum  likelihood  estimators  have  a  number  of 
desirable  properties.  The  properties  were  outlined 
by  Fisher  in  an  informal  way  but  were  not  proven 
rigorously  until  later.  The  properties  of  maximum 
likelihood  estimators  require  a  set  of  regularity 
conditions. 

These  regularity  conditions  are  as  follows: 

•  The  values  of  y  for  which  f(y  10)  >  0  (i.e.,  the 
sample  space)  do  not  depend  on  0. 

•  fly  10)  is  twice  differentiable  with  respect  to 
0,V0  €  ©  C  Rk. 

•  The  information  matrix  is  positive  definite  and 
bounded. 
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•  The  first  three  partial  derivatives  of  the 
likelihood  function  with  respect  to  0  are 
bounded,  and  the  bounds  on  the  third  such 
derivative  do  not  depend  on  0. 

Subject  to  these  regularity  conditions,  the  prop¬ 
erties  of  maximum  likelihood  estimators  come  in 
two  forms:  finite  sample  and  asymptotic  proper¬ 
ties.  The  finite  sample  properties  hold  regardless  of 
the  sample  size  used  for  estimation.  Asymptotic 
properties  hold  as  the  sample  size  increases.  The 
finite  sample  properties  of  maximum  likelihood 
estimators  are  the  following: 


estimator  for  the  sample  proportion.  One  might 
ask  whether  it  is  a  minimum  variance  unbiased 
estimator.  We  need  to  compare  the  relevant  quan¬ 
tity  from  the  MLE  for  the  proportion  with  the 
Cramer-Rao  lower  bound.  The  second  derivative 
of  the  log-likelihood  for  the  sample  proportion  is 

d2\nf(y,p)  =  _y_  _  1  -  y 

d2p  p2  (1  ~pf 

The  expected  value  of  this  second  derivative  is 
the  following  quantity: 


•  Maximum  likelihood  estimators  are  invariant  to 
reparameterization.  The  invariance  property 
ensures  that  the  MLEs  for  a2  are  equal  to  the 
square  root  of  the  MLE  for  c r. 

•  In  a  finite  sample,  if  a  minimum  variance 
unbiased  estimator  (MVUE)  exists,  then  the 
method  of  MLE  chooses  it.  For  an  MVUE,  its 
variance  is  said  to  equal  the  Cramer-Rao  lower 
bound.  For  example  let  0  be  an  unbiased 
estimator.  If  it  is  an  MVUE,  the  following  will 
be  true  for  its  variance: 


Var(0) 


d1  In  f  (y,  0) 
dzQz 


The  asymptotic  properties  of  the  MLE  are  as 
follows: 


•  It  is  consistent;  that  is, 

lim  P( |0„  —  0|  <  e)  =  I,  Vs;  >  0.  In  other 

words,  the  sampling  distribution  for  the  MLE  of  0 
collapses  to  a  spike  at  0  as  n  — »  °°. 

•  It  is  asymptotically  normal.  The  sampling 
distribution  of  0  converges  to  the  normal 
distribution  as  n  — >  °°. 

•  It  is  asymptotically  efficient.  For  large  n,  the 
standardized  distribution  of  0„  has  variance 
equal  to  the  Cramer-Rao  lower  bound. 

Therefore,  the  maximum  likelihood  estimator, 
compared  with  any  other  consistent  and 
asymptotically  normal  estimator,  has  a  smaller 
asymptotic  variance. 

As  an  example,  let  us  return  to  the  maximum 
likelihood  estimator  for  the  sample  proportion. 
One  can  easily  prove  that  it  is  an  unbiased 


r<92  In  L(p)~\  =  _p^  _  1  ~  p  =  1 

L  dzp  \  P2  (1  -p)2  p(i-py 

We  next  take  the  inverse  and  multiplying  by  —n: 


1  _P(1~P) 


This  quantity  is  equal  to  the  variance  for  maxi¬ 
mum  likelihood  estimator  for  the  sample  propor¬ 
tion.  Therefore  the  maximum  likelihood  estimator 
for  the  sample  proportion  is  not  only  unbiased  but 
no  other  estimator  for  the  proportion  has  a  smaller 
variance. 

Other  than  ordinary  least  squares,  which  is  a 
special  case  of  maximum  likelihood,  no  other  esti¬ 
mator  is  more  widely  used  in  political  science. 
MLE  is  especially  important  for  the  analysis  of 
categorical  variables.  For  example,  vote  outcomes 
are  quite  often  measured  with  nominal  variables 
that  record  which  candidate  or  political  party 
receives  an  individual’s  vote.  Such  variables  are 
analyzed  with  models  where  maximum  likelihood 
is  the  usual  estimation  method.  Other  types  of 
political  variables  typically  analyzed  with  models 
that  use  maximum  likelihood  for  estimation  are 
counts  of  political  conflict,  length  of  political  con¬ 
flict,  duration  of  governments,  and  voter  turnout 
among  others.  In  almost  every  area  of  political 
science,  analysts  will  encounter  categorical  vari¬ 
ables  that  are  often  analyzed  with  models  esti¬ 
mated  by  maximum  likelihood.  Even  for  analysts 
that  embrace  the  Bayesian  tradition,  the  likeli¬ 
hood  function  remains  a  critical  part  of  their 
analyses.  Maximum  likelihood  is  also  used  for  the 
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estimation  of  propensity  scores,  which  are  impor¬ 
tant  for  matching  estimators. 

Luke  Keele 
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Columbus,  Ohio,  United  States 
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Measurement  may  be  defined  as  the  process  that 
connects  theoretical  concepts  with  empirical 
indicator(s)  designed  to  represent  those  concepts. 
As  such,  it  is  vitally  important  to  social  science 
research.  This  entry  presents  the  most  important 
properties  of  measurement,  validity,  and  reliabil¬ 
ity,  in  their  various  forms.  In  addition,  measure¬ 
ment  levels,  scales,  indices,  and  related  statistical 
techniques  are  briefly  discussed. 

It  has  been  argued  that  inadequate  measure¬ 
ment,  more  than  mistaken  concepts  or  faulty 
hypotheses,  has  hindered  progress  in  the  social  sci¬ 
ences.  The  sources  of  inadequate  measurement  are 
complex  but  may  be  rooted  in  an  oft-cited  defini¬ 
tion  of  measurement.  In  1951,  Stanley  Smith 
Stevens  characterized  measurement  as  assigning 


numbers  to  objects  or  events  according  to  particu¬ 
lar  rules.  This  is  in  itself  an  inadequate  conceptual¬ 
ization  of  the  process.  It  presents  measurement  as 
an  empirical,  almost  mechanistic  process  that 
overlooks  the  important  role  of  theory  in  social 
science  research.  Until  researchers  have  worked 
through  their  measurement  problems,  they  may 
not  really  have  a  theory  capable  of  generating  test¬ 
able  hypotheses.  The  difficult  process  of  measuring 
theoretical  concepts  can  make  theories  clearer, 
richer,  and  more  subject  to  empirical  investigation. 
A  fuller  and  more  appropriate  definition  of  mea¬ 
surement,  then,  is  the  representation  of  abstract, 
theoretical  concepts  with  concrete,  empirical  indi¬ 
cators.  This  is  accomplished  through  the  process  of 
construct  building  by  linking  abstract,  theoretical, 
and  unobservable  concepts  with  empirical  indica¬ 
tors  for  which  researchers  have  direct  observa¬ 
tions.  This  definition  involves  both  theoretical  and 
empirical  considerations.  Empirically,  the  focus  is 
on  the  observable  response — answers  on  a  ques¬ 
tionnaire,  observed  behavior  in  an  experiment, 
material  from  an  archive.  Theoretically,  the  inter¬ 
est  is  in  the  underlying  unobservable  (and  not 
directly  measurable)  concept  that  is  used  in  the 
explanation  of  some  social  phenomenon  and  is 
represented  by  the  response. 

When  the  relationship  between  the  theoretical 
concept  and  the  observable  response  is  strong, 
analysis  of  empirical  indicators  can  lead  to  useful 
inferences  about  the  relationships  among  the 
underlying  concepts  and  a  greater  understanding 
of  the  phenomenon  under  investigation.  When  the 
relationship  between  concept  and  indicator  is 
weak  or  faulty,  analysis  of  the  indicators  leads  to 
incorrect  inferences  and  misleading  conclusions 
concerning  the  underlying  concepts.  From  this  per¬ 
spective,  the  auxiliary  theory  specifying  the  rela¬ 
tionship  between  concepts  and  indicators  is  as 
important  to  social  research  as  the  substantive 
theory  linking  concepts  to  one  another. 

Measurement  issues  arise  in  many  contexts 
within  political  science.  For  example,  when  respon¬ 
dents  are  asked  in  public  opinion  surveys  which 
party  or  candidate  they  voted  for  in  the  previous 
election,  questions  of  measurement  are  immediately 
apparent.  Do  some  of  the  respondents  who  voted 
for  the  losing  party  or  candidate  systematically  mis- 
remember  and  claim  that  they  voted  for  the  win¬ 
ner?  If  so,  then,  as  we  will  see  below,  this  evidence 
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would  raise  serious  questions  about  the  validity  of 
recall  questions  that  purport  to  measure  previous 
voting  behavior.  Other  measurement  issues  arise 
with  instruments  such  as  that  used  in  the  Polity 
Project,  which  codes  the  authority  characteristics 
of  states  in  the  world  system  for  purposes  of  com¬ 
parative,  quantitative  analysis.  Coders  are  instructed 
to  assign  yearly  scores  to  all  major,  independent 
states  in  the  global  system,  measuring  features  of 
these  regimes  such  as  constraints  on  executive 
authority,  openness  of  political  competition,  and 
peaceful  changes  in  government.  Obviously,  a  cru¬ 
cial  concern  in  this  project  is  the  degree  to  which 
different  coders  assign  identical  or  widely  different 
scores  when  evaluating  these  characteristics  of  these 
countries,  an  issue  of  reliability  as  we  will  also  see 
below.  These  examples  illustrate  the  ubiquitous 
nature  of  measurement  in  the  social  sciences  gener¬ 
ally  and  political  science  in  particular. 

Properties  of  Measurement 

There  are  two  key  properties  of  measurement: 
validity  and  reliability.  Validity  is  the  most  impor¬ 
tant  property  of  measurement.  Validity  focuses  on 
whether  a  measure  represents  the  phenomenon  it  is 
claimed  to  measure.  This  is  fundamental  to  any 
inferences  that  can  be  drawn  about  the  relation¬ 
ships  between  the  theoretical  concepts.  If  empirical 
indicators  do  not  measure  the  theoretical  concept 
at  issue,  it  is  not  a  valid  measure  of  that  concept 
and  any  inferences  concerning  the  relationships 
between  concepts  will  be  problematic  if  not  down¬ 
right  wrong. 

Reliability  is  the  extent  to  which  an  experiment, 
test,  or  any  measuring  procedure  yields  the  same 
results  on  repeated  trials.  It  is  concerned  with  the 
consistency  of  a  measure  over  repeated  observa¬ 
tions.  Reliability  focuses  on  random  error — all  of 
the  chance  factors  that  confound  the  measurement 
of  any  phenomenon.  If,  on  the  one  hand,  an  indi¬ 
cator  is  a  reliable  indicator  of  a  theoretical  con¬ 
cept,  that  indicator  will  produce  consistent  results 
on  repeated  observations  because  the  random 
error  is  not  great  enough  to  cause  notable  fluctua¬ 
tion  from  one  observation  to  the  next.  On  the 
other  hand,  an  unreliable  indicator  will  produce 
inconsistent  readings  from  one  measurement  to 
another.  Thus,  the  greater  the  random  error,  the 
less  reliable  is  the  measure.  For  example,  if  an 


automobile  speedometer  registers  the  speed  of  the 
car  at  3  kilometers  more  than  the  true  speed  on 
Monday,  6  kilometers  less  than  the  true  speed  on 
Tuesday,  and  10  kilometers  more  than  the  true 
speed  on  Wednesday,  the  readings  of  the  car’s 
speed  are  being  affected  by  random  error  and  the 
reliability  of  the  speedometer  is  low. 

There  is  a  second  basic  type  of  error  that  affects 
empirical  measurements:  nonrandom  error.  Non- 
random  error,  or  systematic  error,  has  no  effect  on 
a  measure’s  reliability.  For  example,  if  an  automo¬ 
bile  speedometer  always  registers  the  speed  of  the 
auto  as  5  kilometers  per  hour  more  than  the  true 
speed,  it  is  a  reliable  or  consistent  indicator.  The 
error  in  this  case  is  not  random,  indeed,  it  is  quite 
nonrandom — it  is  consistently  5  kilometers  per 
hour  too  high  every  time.  In  this  case,  the  measure 
is  perfectly  reliable,  but  it  is  not  a  valid  measure.  It 
does  not  measure  the  true  speed  of  the  automobile. 
The  scale  does  not  measure  what  it  is  intended  to 
measure. 

Validity 

Validity  is  the  extent  to  which  any  measuring 
instrument  measures  what  it  purports  to  measure 
rather  than  reflecting  some  other  phenomenon, 
some  source  of  nonrandom  measurement  error. 
The  use  to  which  one  puts  the  test  must  always  be 
considered  when  assessing  validity.  That  is,  an  eye 
examination  may  be  valid  for  determining  the 
quality  of  one’s  vision,  but  it  will  not  be  valid  at  all 
for  determining  one’s  body  temperature.  Validity 
is  always  an  argument  between  competing  theo¬ 
retical  claims.  Because  of  this,  what  is  validated  is 
not  the  instrument  itself  but  the  instrument  in  rela¬ 
tion  to  the  purpose  for  which  it  is  being  used. 

There  are  several  types  of  validity  that  are 
appropriate  in  social  science  research.  Each  takes  a 
slightly  different  approach  in  assessing  the  degree 
to  which  a  measure  is  valid.  One  may  find  refer¬ 
ences  in  the  literature  to  internal  validity,  statistical 
validity,  construct  validity,  convergent  validity, 
discriminant  validity,  cross-validation,  face  valid¬ 
ity,  concurrent  validity,  external  validity,  content 
validity,  sampling  validity,  criterion  validity,  pre¬ 
dictive  validity,  and  empirical  validity.  Some  of 
these  types  of  validity  overlap.  Face  validity  is 
sometimes  discussed  as  a  separate  type  of  validity 
and  sometimes  as  a  subtype  of  content  validity. 
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Some  are  the  same  only  with  different  names,  for 
example,  criterion-related  and  empirical  validity 
are  used  to  mean  the  same  thing.  Some  are  used  to 
denote  subtypes  of  a  main  type  of  validity.  For 
example,  both  concurrent  validity  and  predictive 
validity  are  subtypes  of  criterion-related  validity. 
Convergent  validity,  discriminant  validity,  and 
cross-validation  are  used  to  denote  types  of  con¬ 
struct  validity.  This  discussion  presents  the  three 
most  basic  types  of  validity — content  validity,  cri¬ 
terion-related  validity,  and  construct  validity — and 
their  relevance  to  social  science. 

But  before  discussing  these  three  types  of  valid¬ 
ity,  we  first  need  to  consider  the  difference  between 
internal  and  external  validity.  Internal  validity 
concerns  the  extent  to  which  causal  inferences 
from  a  given  set  of  cases  are  correct  for  those  cases. 
In  experimental  research,  the  question  of  internal 
validity  relates  to  whether  the  experimental  treat¬ 
ment  made  a  difference  in  this  specific  experimen¬ 
tal  instance.  In  nonexperimental  research,  the 
question  focuses  on  whether  the  independent  vari¬ 
able  causes  a  given  change  in  the  dependent  vari¬ 
able.  In  both  types  of  research,  the  fundamental 
issue  related  to  internal  validity  is  whether  there  is 
a  strong  logical  and  empirical  basis  for  establishing 
causal  inferences.  External  validity ,  by  contrast, 
focuses  on  generalizability — that  is,  the  extent  to 
which  causal  inferences  about  a  given  set  of  cases 
can  be  applied  to  other  cases.  External  validity  con¬ 
cerns  what  populations,  settings,  treatment  vari¬ 
ables,  and  measurement  variables  the  established 
causal  inference  apply  to.  From  the  perspective  of 
this  distinction,  all  the  three  types  of  validity  dis¬ 
cussed  below  relate  to  internal  validity. 

Content  Validity 

Content  validity  focuses  on  the  extent  to  which 
a  particular  empirical  measure  reflects  a  specific 
domain  of  content.  That  is,  does  the  set  of  items 
adequately  and  comprehensively  represent  what  it 
is  supposed  to  measure?  The  items  are  said  to  be 
content  valid  if  they  reflect  this  full  domain  of  con¬ 
tent.  For  example,  a  driver’s  test  that  consisted 
only  of  right  turns  and  excluded  left  turns,  park¬ 
ing,  stopping,  and  an  understanding  of  traffic  sig¬ 
nals  would  not  be  content  valid. 

Obtaining  content  validity  involves  two  inter¬ 
related  steps.  First,  the  researcher  must  be  able  to 


specify  the  entire  domain  of  content  that  is  relevant 
to  a  particular  measurement  situation.  In  the 
example  of  the  driver’s  test,  everything  that  one 
needs  to  know  to  operate  an  automobile  safely  and 
legally  is  contained  in  the  state’s  driver’s  manual. 
This  is  the  domain.  It  is  much  more  difficult  to 
demarcate  the  full  domain  of  social  science  con¬ 
cepts.  Take,  for  example,  the  concept  of  democ¬ 
racy.  Minimally,  the  researcher  would  need  to 
include  an  indicator  that  represents  free,  fair,  and 
competitive  elections.  But  it  might  also  need  to 
include  an  indicator  of  a  free  press,  which  is  often 
considered  fundamental  to  democratic  rule. 

The  second  step  involves  selecting,  or  construct¬ 
ing,  the  specific  items  that  are  used  in  the  measure. 
For  example,  a  written  driver’s  test  contains  a 
sample  of  items  from  the  driver’s  manual.  In  this 
example,  specification  and  selection  procedures  are 
relatively  straightforward.  This  is  more  complex  in 
the  social  sciences.  Specification  of  the  domain  of 
content  for  abstract  concepts  such  as  ideology  or 
alienation  is  a  formidable  task.  One  would  begin  by 
consulting  the  literature  on  the  subject  to  come  to 
an  understanding  of  the  concept.  Once  the  researcher 
has  a  general  understanding,  the  researcher  would 
then  construct  items  that  reflect  the  meaning  of  par¬ 
ticular  aspects  of  the  phenomenon  under  study.  It  is 
impossible  to  state  a  general  rule  for  the  number  of 
items  that  should  be  included  to  represent  any  par¬ 
ticular  domain  of  content.  It  is  always  preferable  to 
include  too  many  indicators  because  deficient  items 
can  be  dropped,  while  it  is  much  harder  (and  some¬ 
times  impossible)  to  add  new  or  better  items  at  a 
later  stage  in  the  research. 

Establishing  a  content-valid  measure  of  a  con¬ 
cept  used  in  social  science  such  as  ideology  or 
alienation  is  a  very  difficult  task,  much  more  com¬ 
plex  than  developing  a  content-valid  measure  of 
driving  proficiency.  When  dealing  with  abstract 
concepts,  it  is  difficult  to  establish  the  domain  of 
content  relevant  to  the  phenomenon,  as  most  theo¬ 
retical  concepts  in  the  social  sciences  have  not  been 
described  with  the  required  exactness.  Further, 
when  measuring  most  concepts  in  the  social  sci¬ 
ences,  it  is  impossible  to  sample  content.  A 
researcher  chooses  one  or  a  set  of  items  that  are 
intended  to  reflect  the  content  of  a  given  theoreti¬ 
cal  concept.  Without  a  random  sampling  of  content, 
however,  it  is  impossible  to  ensure  the  representa¬ 
tiveness  of  the  particular  chosen  items. 


Measurement  1515 


Thus,  there  are  two  fundamental  limitations  of 
content  validity  as  applied  to  the  social  sciences. 
First,  the  domain  of  content  must  define  the  vari¬ 
able  of  interest.  But  as  easy  as  this  may  be  to 
achieve  with  regard  to  some  tests,  such  as  profi¬ 
ciency  tests,  it  is  extremely  difficult  to  accomplish 
for  more  abstract  phenomena  that  tend  to  charac¬ 
terize  the  social  sciences.  The  second  limitation  of 
content  validity  is  the  lack  of  agreed-on  criteria  for 
determining  the  extent  to  which  a  measure  has 
attained  content  validity.  This  leaves  the  researcher 
with  the  task  of  having  to  provide  a  plausible 
rationale  for  accepting  his  or  her  version  of  what 
constitutes  the  domain  of  content  and  for  believing 
that  the  items  included  in  the  measure  have  been 
satisfactorily  sampled.  Because  of  these  limita¬ 
tions,  content  validity  is  not  a  fully  satisfactory 
means  of  assessing  the  validity  of  social  science 
measures. 

Criterion-Related  Validity 

Criterion-related  validity  is  a  second  type  of 
validity,  and  it  is  more  closely  related  to  what  is 
usually  meant  in  everyday  usage  of  the  term.  This 
type  of  validity  concerns  the  correlation  between 
a  measure  and  some  criterion  variable  of  interest. 
Using  criterion-related  validity,  one  can  validate  a 
driver’s  test  by  demonstrating  that  the  test  is  a 
good  predictor  of  the  ability  of  a  well-defined 
group  of  subjects  to  drive  a  car.  Criterion-related 
validity  is  fully  determined  by  the  degree  of  cor¬ 
respondence  between  the  measure,  or  test,  and  its 
criterion.  If  the  correlation  is  high,  the  measure  is 
valid  for  that  criterion.  If  the  test  does  not  corre¬ 
late  significantly  with  the  criterion,  it  is  not  valid 
for  that  criterion  and  thus  useless  for  that  particu¬ 
lar  purpose.  The  higher  the  correlation,  the  more 
valid  is  a  measure  for  a  specific  criterion.  For 
criterion-related  validity,  this  is  all  that  matters.  It 
is  the  only  evidence  that  is  relevant.  It  does  not 
matter  if  the  test  makes  no  theoretical  sense  as  a 
predictor  of  the  criterion.  If  the  accuracy  of  one’s 
horseshoe  pitching  is  found  to  be  highly  corre¬ 
lated  with  college  success,  then  horseshoe  pitch¬ 
ing  would  be  a  valid  measure  for  predicting 
success  using  criterion-related  validity.  There  is 
also  no  single  validity  coefficient.  There  are  as 
many  coefficients  as  there  are  criteria  for  a  par¬ 
ticular  measure. 


Criterion-related  validity  can  be  differentiated 
into  two  types.  If  the  criterion  exists  in  the  present, 
then  one  can  assess  concurrent  validity  by  correlat¬ 
ing  the  measure  and  the  criterion  at  the  same  point 
in  time.  For  example,  a  verbal  report  of  voting 
behavior  could  be  correlated  with  participation  in 
an  election,  as  revealed  by  official  voting  records. 
Predictive  validity,  by  contrast,  concerns  a  future 
criterion  that  is  correlated  with  the  relevant  mea¬ 
sure.  Using  the  (formerly  known  as)  Scholastic 
Aptitude  Test  (SAT)  as  a  predictor  of  success  in 
college  is  an  example.  Scores  on  the  SAT  could  be 
correlated  with  a  student’s  subsequent  perfor¬ 
mance  in  college  to  demonstrate  the  predictive 
validity  of  the  SAT.  The  logic  of  concurrent  and 
predictive  validity  is  the  same.  The  only  difference 
between  them  concerns  the  current  or  future  exis¬ 
tence  of  the  criterion  variable. 

What  is  sometimes  overlooked  in  assessing  crite¬ 
rion-related  validation  procedures  is  that  the  scien¬ 
tific  and  practical  utility  of  criterion  validity  depends 
as  much  on  the  measurement  of  the  criterion  as  it 
does  on  the  quality  of  the  measuring  instrument 
itself.  For  example,  in  many  different  types  of  train¬ 
ing  programs,  much  effort  and  expense  goes  into 
the  development  of  a  test  for  predicting  who  will 
benefit  from  the  program  in  terms  of  subsequent  job 
performance.  However,  the  measurement  of  subse¬ 
quent  performance,  the  criterion,  is  often  given  very 
little  attention.  Job  performance  is  very  difficult  to 
assess.  Thus,  those  using  criterion-related  validation 
procedures  should  provide  independent  evidence  of 
the  extent  to  which  the  measurement  of  the  crite¬ 
rion  is  valid.  While  criterion  validation  is  intuitively 
appealing,  it  has  a  major  limitation  with  regard  to 
the  social  sciences.  For  many  if  not  most  measures 
in  the  social  sciences,  no  relevant  criterion  variables 
exist.  It  is  not  clear,  for  example,  what  an  appropri¬ 
ate  criterion  variable  would  be  for  political  ideol¬ 
ogy.  Thus,  criterion-related  validation  has  limited 
usefulness  in  the  social  sciences.  Further,  the  more 
abstract  the  concept,  the  more  difficult  it  is  to  find 
an  appropriate  criterion. 

Construct  Validity 

The  third  basic  type  of  validity  is  construct  valid¬ 
ity.  Construct  validity  is  concerned  with  the  rela¬ 
tionship  between  the  measure  under  consideration 
and  theoretical  expectations  about  that  measure. 
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Construct  validity  is  important  when  there  is  no 
universally  agreed-on  domain  of  content  for  the 
phenomenon  under  investigation  and  no  relevant 
criteria  for  its  assessment.  This  is  the  typical  situa¬ 
tion  in  the  social  sciences.  Thus,  construct  validity 
is  theory  driven.  Using  theory,  the  researcher  for¬ 
mulates  theoretical  predictions,  types,  directions, 
and  degrees  of  relationship  that  are  used  to  vali¬ 
date  the  construct  empirically.  If  the  empirically 
observed  outcomes  are  predicted  by  the  theory, 
then  the  measure  is  said  to  be  construct  valid.  This 
type  of  validity  is  more  pertinent  in  the  social  sci¬ 
ences  than  either  criterion-related  validity  or  con¬ 
tent  validity. 

There  are  three  distinct  steps  involved  in  con¬ 
struct  validation.  First,  the  theoretical  relationship 
between  the  concepts  themselves  must  be  specified. 
Second,  the  empirical  relationship  between  the 
measures  of  the  concepts  must  be  examined. 
Finally,  the  empirical  evidence  must  be  interpreted 
in  terms  of  how  it  clarifies  the  construct  validity  of 
the  particular  measure.  A  researcher  might  specify 
that  political  ideology  is  related  to  social  class,  such 
that  those  respondents  with  lower  social  status 
would  more  likely  be  leftist  in  political  orientation, 
whereas  those  with  higher  status  would  more  likely 
be  on  the  right.  If  income  was  employed  as  the 
measure  of  social  status,  and  attitudes  toward  the 
government  provision  of  health  care  were  used  as 
the  measure  of  political  ideology,  then  the  researcher 
could  calculate  the  relationship  between  these  vari¬ 
ables.  If  the  correlation  between  these  variables  was 
statistically  significant  in  the  expected  direction, 
then  this  correlation  would  constitute  one  piece  of 
evidence  supporting  the  construct  validity  of  the 
measure  of  political  ideology. 

Thus,  the  fundamental  feature  of  construct  vali¬ 
dation  is  theory.  There  must  be  a  theoretical 
framework  that  is  relevant  to  the  concept  or  it  will 
be  impossible  to  validate  the  measure.  Without 
this  theoretical  framework,  it  is  impossible  to  gen¬ 
erate  theoretical  predictions  that,  in  turn,  lead 
directly  to  empirical  tests  involving  a  measure  of 
the  concept.  What  is  required,  then,  is  that  the 
researcher  must  be  able  to  investigate  several  theo¬ 
retically  derived  hypotheses  involving  the  particu¬ 
lar  concept.  Construct  validity  is  not  established 
by  confirming  a  single  prediction  on  different 
occasions  or  confirming  many  predictions  in  a 
single  study.  Instead,  construct  validation  ideally 


requires  a  pattern  of  consistent  findings  involving 
different  researchers  across  different  studies  involv¬ 
ing  a  variety  of  diverse  but  theoretically  relevant 
variables.  Only  if  and  when  these  conditions  are 
met  can  one  speak  with  confidence  about  the  con¬ 
struct  validity  of  a  particular  measure. 

A  problem  exists  if  the  theoretically  derived  pre¬ 
dictions  and  the  empirical  relationships  are  incon¬ 
sistent  with  each  other — that  is,  the  evidence  rele¬ 
vant  to  construct  validity  is  negative.  This  negative 
evidence  typically  indicates  that  the  measure  lacks 
construct  validity.  That  is,  the  indicator  does  not 
measure  what  it  purports  to  measure — the  construct 
of  interest.  The  accumulation  of  negative  evidence 
leads  to  the  conclusion  that  the  measure  is  not  con¬ 
struct  valid  and  should  not  be  used  as  an  empirical 
representation  of  that  concept  in  future  research. 
Previous  research  using  that  particular  measure  of 
the  concept  is  also  called  into  doubt.  There  are, 
however,  other  conclusions  that  are  consistent  with 
this  sort  of  negative  evidence.  One  could  interpret  it 
as  meaning  that  the  theoretical  framework  used  to 
generate  the  empirical  predictions  is  incorrect,  that 
is,  the  theory  is  wrong.  Another  interpretation  is 
that  the  statistical  method  or  technique  used  to  test 
the  theoretically  derived  hypotheses  is  faulty  or 
inappropriate  or  that  the  researcher  could  be  using 
it  incorrectly.  Finally,  negative  evidence  could  be 
interpreted  as  being  due  to  the  lack  of  construct 
validity  or  the  unreliability  of  some  other  variable(s) 
in  the  analysis.  It  is  a  very  subtle  point,  but  when 
one  assesses  the  construct  validity  of  the  measure  of 
interest,  one  is  also  evaluating  simultaneously  the 
construct  validity  of  measures  of  other  theoretical 
concepts.  Thus,  it  could  be  the  case  that  the  con¬ 
struct  validity  of  the  measure  is  quite  high,  but  the 
measure  hypothesized  to  correlate  with  that  mea¬ 
sure  is  invalid. 

There  is  no  foolproof  procedure  for  determining 
which  one  (or  more)  of  these  interpretations  of 
negative  evidence  is  correct  in  any  given  instance. 
The  first  interpretation,  that  the  measure  lacks 
construct  validity,  becomes  increasingly  compel¬ 
ling  as  grounds  for  accepting  the  other  interpreta¬ 
tions  become  untenable.  To  the  extent  possible, 
one  should  assess  the  construct  validity  of  a  par¬ 
ticular  measure  in  situations  characterized  by  the 
use  of  strong  theory,  appropriate  methodological 
procedures,  and  other  well-measured  variables. 
Only  in  these  situations  can  one  confidently 
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conclude  that  negative  evidence  is  probably  due  to 
the  absence  of  construct  validity  of  a  particular 
measure  of  a  given  theoretical  concept.  One  can 
see  from  this  discussion  that  construct  validity  is 
the  most  appropriate  and  generally  applicable  type 
of  validity  used  to  assess  measures  in  the  social  sci¬ 
ences.  The  researcher  can  assess  the  construct 
validity  of  an  empirical  measurement  if  the  mea¬ 
sure  can  be  placed  in  theoretical  context.  That  is, 
this  type  of  validity,  unlike  other  types,  focuses  on 
the  extent  to  which  a  measure  performs  in  accor¬ 
dance  with  theoretical  expectations. 

Reliability 

Classical  Test  Theory 

Classical  test  theory  is  used  to  assess  random 
measurement  error.  By  determining  the  amount  of 
random  error,  one  can  estimate  reliability.  Random 
error  is  present  in  any  measure.  Indeed,  estimating 
random  error  and  eliminating  it  to  the  extent  pos¬ 
sible  is  fundamental  to  measurement.  In  classical 
test  theory,  the  observed  score  is  equal  to  the  true 
score  that  would  be  obtained  if  there  were  no  mea¬ 
surement  error,  and  a  random  error  component,  or 

X  =  t  +  e, 

where  X  is  the  observed  score,  t  is  the  true  score, 
and  e  is  the  random  error  component,  or  random 
disturbance. 

The  true  score  is  the  unobservable  quantity  that 
cannot  be  directly  measured.  Theoretically,  it  is  the 
average  that  would  be  obtained  if  a  phenomenon 
were  measured  repeatedly  over  an  infinite  number 
of  times.  The  random  error  component,  or  random 
disturbance,  indicates  the  differences  between 
observed  score  and  the  true  score.  For  example,  an 
individual’s  observed  score  may  be  a  little  above 
the  true  score  on  one  observation  and  a  little  below 
on  another. 

We  make  the  following  assumptions  about 
measurement  error  (Frederic  Lord  &  Melvin 
Novick,  1968,  p.  36): 

Assumption  1:  The  expected  error  score  is  zero, 

E{e)  =  0. 

Assumption  2:  The  correlation  between  the  true 

score  and  errors  is  zero,  p(fe)  =  0. 


Assumption  3:  The  correlation  between  the  errors 
on  one  measurement  and  the  true  score  on  a  second 
measurement  is  zero,  p,  f }  =  0. 

Assumption  4:  The  correlation  between  the  errors 
on  distinct  measurements  is  zero,  =  0. 

Next,  using  these  assumptions,  we  see  that  the 
expected  value  of  the  observed  score  is  equal 
to  the  expected  value  of  the  true  score: 
E(X)  =  E(t)  +  E(e).  And  following  Assumption 
1,  the  expected  value  of  e  is  zero  or  E(e)  =  0, 
then,  E(X)  -  E(f). 

This  formula  applies  to  repeated  measurements 
of  a  single  variable  for  a  single  person.  But  reli¬ 
ability  refers  to  the  consistency  of  repeated  mea¬ 
surements  across  persons  and  not  within  a  single 
person,  and,  therefore,  we  make  a  simple  transfor¬ 
mation.  We  rewrite  the  equation  for  the  observed 
score  so  that  it  applies  to  the  variance  of  the  single 
observed  score,  true  score,  and  random  error: 

Var(X)  =  Var(f  +  e)  —  Var(?) 

+  2Co  v(t,e)  +  Var(e). 

Since  Assumption  2  above  states  that  the  cor¬ 
relation  between  the  true  score  and  error  is  zero, 
then  2Cov(t,  e)  —  0,  and  Var(x)  =  Var  (t)  +  Var(e). 

So  the  observed  score  variance  equals  the  sum 
of  the  true  score  variance  and  the  random  error 
score  variance.  To  calculate  reliability,  we  com¬ 
pute  the  ratio  of  the  true  score  variance  to  the 
observed  score  variance: 

Px  -  Var(f)/Var(X). 

Thus,  px,  is  the  reliability  of  X  as  a  measure  of  T. 
Alternatively,  reliability  can  be  expressed  in  terms 
of  the  error  variance: 

px  =  1  —  [Var(c)/Var(X)]  =  reliability. 
Parallel  Measurements 

One  estimate  of  a  measure’s  reliability  can  be 
obtained  by  correlating  parallel  measurements. 
Two  measurements  are  defined  as  parallel  if  they 
have  identical  true  scores  and  equal  variances. 
Thus,  X  and  X'  are  parallel  if  X  =  t  +  e  and  X'  = 
t  +  e'  where  Var(e)  =  Var(e')  and  t  =  t.  Parallel 
measures  are  functions  of  the  same  true  score 
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where  the  differences  between  these  measures 
result  from  purely  random  error. 

Assessing  Reliability 

There  are  four  basic  methods  for  estimating  the 
reliability  of  empirical  measurements.  These  are 
the  retest  method,  the  alternative-form  method, 
the  split-halves  method,  and  the  internal  consis¬ 
tency  method. 

Retest  Method 

Perhaps  the  most  common  method  of  assessing 
reliability  is  to  correlate  the  same  measures  admin¬ 
istered  at  different  points  in  time.  Figure  1  shows  a 
representation  of  the  retest  method. 

The  equations  for  the  tests  at  Times  1  and  2  can 
be  written  thus: 

Xi  =  Xt  +  e  i, 

Xi  =  Xt  +  <?2- 

Since  in  parallel  measures  t  —  t  and 
Var(ei)  =  Var(c2),  and  by  Assumption  3, 
P (ei,t2)  =  0  and  by  Assumption  4,  p(eiei)  =  0,  it  fol¬ 
lows  that  px  =  pXt  Xi. 

So  reliability  in  this  case  is  the  correlation 
between  the  scores  on  the  same  test  obtained  at 
two  points  in  time.  If  the  retest  reliability  coeffi¬ 
cient  is  exactly  1.0,  the  results  on  the  two  admin¬ 
istrations  of  the  test  are  the  same.  However, 
because  there  is  almost  always  random  measure¬ 
ment  error,  the  correlations  across  time  will  not  be 
perfect. 


P(e,,e2) 


Xt  X2 


e1  e2 
Time  1  Time  2 

Figure  I  A  Representation  of  the  Retest  Method  for 
Estimating  Reliability 


Although  the  test-retest  method  is  a  simple  and 
intuitively  appealing  way  to  assess  reliability,  it 
has  some  serious  problems  and  limitations.  First, 
often  researchers  can  obtain  a  measure  of  a  given 
theoretical  concept  at  only  a  single  point  in  time. 
It  may  be  too  expensive  or  impractical  to  measure 
some  phenomenon  at  multiple  points.  And  if  the 
test-retest  correlations  are  low,  it  may  not  indi¬ 
cate  unreliability  but  the  possibility  that,  in  the 
interim  between  tests,  the  theoretical  concept  of 
interest  has  undergone  a  fundamental  change. 
Another  problem  that  affects  test-retest  correla¬ 
tions  and  lowers  reliability  estimates  is  reactivity. 
Sometimes  the  very  act  of  measuring  a  phenome¬ 
non  can  induce  change  in  the  phenomenon  itself. 
A  person  may  become  sensitized  to  the  concept 
being  measured  and  may  answer  the  question  on 
the  second  administration  of  the  test  based  only 
on  the  earlier  measurement.  Lowered  reliability 
estimates  are  not  the  only  effects.  If  the  period 
between  Time  1  and  Time  2  is  short  enough,  the 
participant  may  remember  his  or  her  answer  at 
Time  1  and  appear  more  consistent  than  the  he  or 
she  actually  is.  These  memory  effects  can  inflate 
reliability  estimates. 

Alternative-Form  Method 

Another  method  for  assessing  reliability  is  used 
widely  in  education  to  estimate  the  reliability  of 
all  types  of  tests.  It  is  called  the  alternative-form 
method  and  is  similar  to  the  retest  method  in  that 
it  requires  administering  a  test  to  the  same  people 
at  different  points  in  time.  The  difference  between 
this  method  and  the  retest  method  is  that  an  alter¬ 
native  form  of  the  same  test  is  given  on  the  second 
testing.  The  two  forms  of  the  same  test  are 
intended  to  measure  the  same  theoretical  concept. 
The  two  forms  should  not  differ  systematically 
from  each  other.  Using  random  procedures  to 
select  items  for  the  different  forms  of  the  test  can 
ensure  that  this  does  not  happen.  In  this  case,  the 
reliability  is  the  correlation  between  the  alterna¬ 
tive  forms  of  the  test.  The  two  forms  should  be 
administered  about  2  weeks  apart  to  allow  for 
day-to-day  fluctuations  in  the  individual  to  occur. 

The  alternative-form  method  for  assessing  reli¬ 
ability  is  superior  to  the  retest  method  because  it 
reduces  the  extent  to  which  an  individual’s  mem¬ 
ory  can  inflate  the  reliability  estimate.  Like  the 
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retest  method,  the  alternative-form  method,  when 
used  at  only  two  points  in  time,  does  not  allow  the 
researcher  to  distinguish  true  change  from  the 
unreliability  of  the  measure.  Therefore,  the  results 
of  alternative-form  reliability  studies  are  easier  to 
interpret  if  the  phenomenon  being  measured  is 
relatively  enduring  and  not  subject  to  rapid  and 
radical  alteration.  A  limitation  of  this  method  is 
the  impracticality  of  designing  alternative  forms  of 
measures  that  are  truly  parallel.  This  is  easier  to 
do  in  education  where  achievement  tests  are 
widely  employed  but  much  more  difficult  to 
accomplish  in  other  social  sciences  where  research¬ 
ers  typically  focus  on  measuring  abstract  theoreti¬ 
cal  concepts. 

Split-Halves  Method 

Reliability  estimated  by  using  the  split-halves 
method,  unlike  the  retest  or  alternative-form 
methods,  can  be  conducted  on  only  one  occa¬ 
sion.  The  total  set  of  items  is  divided  in  half, 
and  the  scores  on  the  halves  are  correlated  to 
yield  an  estimate  of  reliability.  The  halves  can 
be  considered  approximations  to  alternative 
forms.  The  correlations  between  the  two  halves 
would  be  the  reliability  for  each  half  of  the  test 
and  not  for  the  total  test.  To  estimate  the  reli¬ 
ability  of  the  entire  test,  we  must  estimate  a 
statistical  correction,  called  the  Spearman- 
Brown  prophecy  formula.  Since  the  total  test  is 
twice  as  long  as  each  half,  the  Spearman-Brown 
prophecy  formula  is 

=  2Pxx' 

PxX"  1  +  Pxx’ 

where  p^is  the  reliability  coefficient  for  the  whole 
test  and  pxx,  is  the  split-half  correlation. 

The  more  general  version  of  the  Spearman- 
Brown  prophecy  formula  is 


and  test  length.  To  estimate  the  number  of  items 
required  to  obtain  a  particular  reliability,  one  uses 
the  following  formula: 

N  =  PxA1  ~  p**')/p**'(i  ~  p«-)> 

where  pxx„  =  the  desired  reliability,  pxx,  —  the  reli¬ 
ability  of  the  existing  test,  and  N  —  the  number  of 
times  the  test  would  be  lengthened  to  obtain  reli¬ 
ability  of  PXX". 

There  is  some  degree  of  indeterminacy  in  using 
the  split-halves  technique  to  estimate  reliability 
due  to  the  different  ways  that  the  items  can  be 
grouped  into  halves.  The  most  typical  way  to 
divide  the  items  is  to  place  the  even-numbered 
items  in  one  group  and  the  odd-numbered  items  in 
the  other  group.  But  other  ways  of  partitioning 
the  total  item  set  are  also  used  including  sepa¬ 
rately  scoring  the  first  and  second  halves  of  the 
items  and  randomly  dividing  the  items  into  two 
groups.  For  a  10-item  scale,  there  are  126  differ¬ 
ent  splits,  and  each  will  probably  result  in  a 
slightly  different  correlation  between  the  two 
halves,  which  will  lead  to  a  different  reliability 
estimate.  It  is  therefore  possible  to  obtain  different 
reliability  estimates  even  if  the  same  items  are 
administered  to  the  same  individuals. 

Internal  Consistency  Methods 

The  aforementioned  methods  of  estimating 
reliability  require  either  the  splitting  or  repeat¬ 
ing  of  measures  and  thus  have  distinct  theoreti¬ 
cal  and  practical  limitations.  An  alternative 
approach  to  estimating  reliability  focuses  on  the 
internal  consistency  of  measurements.  The  most 
popular  of  these  coefficients  for  assessing  inter¬ 
nal  consistency  is  Cronbach’s  alpha.  Cronbach’s 
alpha  is  equal  to  the  average  of  all  possible  split- 
half  correlations  for  a  composite  scale  2 N  items 
long  and  is  calculated  from  the  variance-covariance 
matrix  as  follows: 


Px„x;;  =  NPxx./[l  +  (N  -  l)pxx,}. 

This  general  formula  gives  the  reliability  of  a  scale 
that  is  N  times  longer  than  the  original  scale. 

A  researcher  can  also  use  this  Spearman-Brown 
prophecy  formula  to  determine  the  number  of  items 
that  would  be  needed  to  attain  a  given  reliability 


a 


V  [  _  £Var(Y,) 
N  -  1  [  Varx 


where  N  —  the  number  of  items,  £Var(Y,-)  =  the 
sum  of  the  item  variances,  and  VARX  =  the  vari¬ 
ance  of  the  total  composite. 
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If  one  is  using  the  correlation  matrix  rather  than 
the  variance-covariance  matrix,  the  formula 
becomes 


a 


u  r  ci 

-  l  - - 

a  —  1  L  a  +  2b  ’J  ’ 


where  a  —  the  number  of  items  in  the  composite  and 
b  =  the  sum  of  the  correlations  among  the  items. 

Coefficient  alpha  thus  depends  on  the  number 
of  items  in  the  scale  and  the  intercorrelations 
among  those  items.  Generally  speaking,  longer 
scales  with  higher  intercorrelations  are  more 
reliable  than  comparable  scales  composed  of 
fewer  items  that  are  less  highly  intercorrelated. 
Alpha  is  a  lower  bound  to  the  reliability  of  a 
scale  of  N  items  where  each  item  contributes 
equally  to  the  scale.  Representing  a  lower  bound, 
the  reliability  of  the  scale  is  always  equal  to  or 
greater  than  alpha.  It  is  equal  to  the  reliability  if 
the  items  are  parallel.  The  reliability  of  a  scale 
can  never  be  lower  than  alpha  even  if  the  items 
depart  substantially  from  being  parallel  mea¬ 
surements.  Thus,  alpha  is  a  conservative  esti¬ 
mate  of  reliability. 

Cronbach’s  alpha  is  a  generalization  of  Kuder 
and  Richardson’s  procedure  designed  to  estimate 
the  reliability  of  scales  composed  of  dichotomously 
scored  items.  These  items  are  scored  either  zero  or 
one  depending  on  whether  the  individual  possesses 
the  particular  characteristic  of  interest.  Because 
Cronbach’s  alpha  can  handle  multiply  scored  or 
dichotomously  scored  items,  because  it  encom¬ 
passes  the  Spearman-Brown  prophecy  formula, 
because  it  makes  use  of  all  of  the  information  con¬ 
tained  in  the  items,  and  because  it  is  easy  to  com¬ 
pute,  it  has  become  widely  used  as  a  measure  of 
reliability. 


Correction  for  Attenuation 

No  matter  which  specific  method  a  researcher 
uses  to  compute  an  estimate  of  reliability,  one  of 
the  estimate’s  most  important  uses  is  to  “correct” 
correlations  for  unreliability  due  to  random  mea¬ 
surement  error.  If  we  can  estimate  the  reliability  of 
each  variable,  we  can  use  these  estimates  to  deter¬ 
mine  what  the  correlation  between  the  two  vari¬ 
ables  would  be  if  they  were  perfectly  reliable.  This 


process  is  called  correction  for  attenuation.  The 
formula  for  the  correction  for  attenuation  is 


Pxtyt 


P  xffj 


where  pv  ,,  =  the  correlation  corrected  for  attenu- 

r  xtyt 

ation,  pvv  =  the  observed  correlation,  p  ,  =  the 
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reliability  of  X,  p  #  =  the  reliability  of  Y. 

The  resulting  correlation — the  observed  correla¬ 
tion  purged  of  random  measurement  error — might 
be  thought  of  as  the  correlation  between  theoreti¬ 
cal  concepts  that  have  been  measured  with  perfect 
reliability. 


Levels  of  Measurement 

Because  of  the  fundamental  importance  of  measure¬ 
ment  in  making  inferences,  drawing  conclusions,  and 
formulating  hypotheses  about  theoretical  concepts,  it 
is  essential  to  be  as  precise  in  our  measurement  of 
these  concepts  as  possible.  The  statistical  tech¬ 
niques  that  a  researcher  can  use  in  a  particular 
analysis  depend  on  the  level  of  precision  at  which 
variables  are  measured.  More  precise  measures 
allow  the  researcher  to  use  more  sophisticated 
and  rigorous  statistical  techniques  and  to  have 
more  confidence  in  the  results.  The  precision  of 
an  indicator  depends  on  the  level  at  which  it  is 
measured.  Stevens  identified  four  levels  into 
which  we  can  classify  measures.  In  order  from 
least  precise  to  most  precise,  they  are  nominal, 
ordinal,  interval,  and  ratio.  Each  level  has  par¬ 
ticular  characteristics  that  set  it  apart  from  the 
others.  By  determining  the  level  of  the  measures 
available,  a  researcher  can  determine  which  sta¬ 
tistical  techniques  are  appropriate  for  the  analysis 
and  which  are  not. 

A  variable  measured  at  the  nominal  level  repre¬ 
sents  a  typology  in  which  the  attributes  of  a  vari¬ 
able  are  classified  into  various  categories.  The 
categories  of  a  nominal  variable  are  mutually  exclu¬ 
sive  and  together  they  exhaust  the  possibilities — 
that  is,  there  is  one  and  only  one  category  for 
every  possible  attribute.  Thus,  a  grandparent  can 
be  classified  as  either  a  grandmother  or  grandfa¬ 
ther.  This  is  the  least  precise  of  the  levels  of  mea¬ 
surement,  and  for  that  reason,  the  researcher  is 
limited  in  the  quantitative  techniques  applicable 
to  those  appropriate  to  variables  measured  by 
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unordered  categories,  for  example,  frequency  dis¬ 
tributions  and  determining  the  mode. 

The  next  most  precise  level  of  measurement  is 
at  the  ordinal  level.  Ordinal-level  variables  possess 
the  characteristics  of  the  nominal-level  variables 
and  add  the  property  that  the  attributes  can  be 
ordered  from  less  of  an  attribute  to  more  of  that 
attribute.  More  sophisticated  statistical  proce¬ 
dures  can  be  used  with  variables  measured  at  the 
ordinal  level.  Determining  the  median,  percentiles, 
Spearman’s  rho,  and  gamma  are  examples  of  sta¬ 
tistical  techniques  that  can  be  applied  to  variables 
measured  at  the  ordinal  level. 

In  our  hierarchy  of  measurement,  the  next  step 
toward  more  precision  is  the  interval  level.  In 
addition  to  having  all  of  the  characteristics  of 
nominal-  and  ordinal-level  variables,  the  intervals 
between  each  of  the  categories  are  equal.  This  was 
not  a  requirement  of  the  ordinal  level.  To  move  up 
from  nominal  to  ordinal,  a  variable’s  categories 
had  to  have  a  logical  order,  but  the  distances 
between  those  categories  did  not  have  to  be  equal. 
Now,  to  move  from  ordinal  to  interval,  those 
intervals  must  be  equal.  Statistical  techniques  that 
are  appropriate  for  variables  measured  at  the 
interval  level  are  very  sophisticated  and  include 
adding,  subtracting,  and  computing  the  mean, 
standard  deviation,  and  Pearson’s  product  moment 
correlation.  Finally,  the  most  precise  level  of  mea¬ 
surement  is  the  ratio  level.  It  differs  from  the  inter¬ 
val  level  in  that  a  ratio  level  variable  contains  a 
natural  zero  point.  Because  of  this  zero  point,  we 
can  express  categories  as  ratios,  for  example, 
someone  who  is  in  a  particular  category  has  twice 
as  much  of  the  attribute  as  someone  who  falls  into 
another  category. 

Multiple-Indicator  Measures 

As  we  have  said,  measurement  involves  a  process 
whereby  abstract,  theoretical  concepts  are  repre¬ 
sented  by  concrete,  empirical  indicators.  These 
complex  concepts  are  more  appropriately  mea¬ 
sured  by  using  a  composite  of  multiple  indicators 
rather  than  a  single  indicator.  Estimating  reliability 
using  a  single  indicator  is  almost  always  impossible 
without  having  some  a  priori  information  that  is 
usually  unavailable.  And  even  if  the  reliability  of  a 
single  indicator  can  be  estimated,  it  will  usually  be 
affected  by  random  error  more  than  a  composite 


and  thus  have  a  lower  estimated  reliability. 
Multiple-indicator  measures,  or  multiple-item  mea¬ 
sures,  are  indices  made  up  of  more  than  one  item. 
They  are  ubiquitous  in  the  social  sciences.  They  are 
used  to  measure  attitudes,  opinions,  emotions,  per¬ 
sonality,  and  many  other  social  science  concepts. 

There  are  many  reasons  why  a  composite  mea¬ 
sure  is  preferable  to  a  single  item.  First,  many  of 
the  theoretical  concepts  being  measured  are  com¬ 
plex,  broad,  and  abstract  and,  as  a  result,  cannot 
be  adequately  represented  by  a  single  item.  This  is 
a  question  of  validity  as  discussed  earlier.  Does  the 
empirical  measure  actually  provide  an  adequate 
representation  of  the  theoretical  concept?  As  a 
simple  example,  no  one  would  argue  that  an  indi¬ 
vidual  true/false  question  on  an  American  govern¬ 
ment  examination  is  a  valid  measure  of  the  degree 
of  knowledge  of  American  government  possessed 
by  a  student.  If  several  questions  concerning  the 
subject  are  summed,  we  get  a  better  estimate  of  the 
student’s  understanding  of  American  government. 
To  take  another  example,  a  person’s  level  of  anxi¬ 
ety  cannot  be  measured  adequately  by  a  single  item 
in  a  clinical  evaluation  but  is  routinely  done  so 
with  a  battery  of  questions. 

Another  reason  to  use  multiple-item  measures  is 
accuracy.  Single  items  lack  precision  because  they 
may  not  distinguish  subtle  distinctions  of  an  attri¬ 
bute.  If  the  item  is  dichotomous,  it  will  only  recog¬ 
nize  two  levels  of  the  attribute.  The  final  reason  for 
using  multiple-item  measures  is  reliability.  Single 
items  include  more  random  error  than  multiple- 
item  measures  because  the  latter  allow  random 
error  to  cancel  out  because  of  the  multiple  mea¬ 
surements.  Because  they  are  constructed  of  more 
than  one  indicator  of  a  particular  phenomenon, 
multiple-item  measures  increase  the  reliability  of  an 
empirical  representation  of  a  concept  that  would  be 
attainable  if  single  indicators  were  used  in  analysis. 
In  other  words,  combining  several  individual  items 
into  a  single  multiple-item  composite  indicator  will 
generally  decrease  random  error. 

Scale 

A  scale  is  a  composite  measure  used  to  measure 
some  underlying  concept.  Many  phenomena  in 
social  science  are  theoretical  constructs  that  can¬ 
not  be  directly  measured  by  a  single  variable.  To 
understand  the  causes,  effects,  and  implications  of 
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these  phenomena,  the  researcher  must  develop  a 
valid  and  reliable  empirical  indicator  of  these  con¬ 
cepts.  This  empirical  indicator  is  called  a  scale.  In 
this  sense,  the  scale  is  composed  of  a  set  of  mea¬ 
surable  items  that  capture  empirically  the  essen¬ 
tial  meaning  of  the  theoretical  construct.  Good 
scales  are  data  reduction  devices  that  simplify  the 
information  and,  in  one  composite  measure, 
articulate  the  direction  and  intensity  of  the  con¬ 
cept.  They  are  usually  constructed  at  the  ordinal 
level  of  measurement,  although  some  can  be  at 
the  interval  level.  Because  they  are  constructed  of 
more  than  one  indicator  of  a  particular  phenom¬ 
enon,  they  increase  both  the  reliability  and  valid¬ 
ity  that  would  be  attainable  if  the  individual 
indicators  were  used  in  analysis. 

There  are  three  related  but  distinct  purposes  for 
scaling.  In  the  first  case,  scaling  may  be  intended  to 
test  a  specific  hypothesis,  for  example,  that  a  single 
dimension,  party  identification,  structures  voters’ 
choice  of  presidential  candidates.  In  this  case,  the 
scaling  model  is  used  as  a  criterion  to  evaluate  the 
relative  fit  of  a  given  set  of  observed  data  to  a  spe¬ 
cific  model.  Another  purpose  for  scaling  is  to 
describe  a  data  structure.  For  example,  a  political 
scientist  might  use  a  scale  to  discover  the  underlying 
social,  economic,  and  cultural  dimensions  of  the 
U.S.  electoral  system.  This  would  be  an  exploratory 
analysis  rather  than  a  hypothesis-testing  approach. 
Finally,  scaling  can  be  used  to  construct  a  measure 
on  which  individuals  can  be  placed  and  their  scores 
related  on  that  scale  to  other  measures  of  interest. 

There  are  several  possible  models  that  can  be 
used  to  combine  items  into  a  scale.  These  include 
Likert  scaling,  Thurstone  scaling,  and  Guttman 
scaling.  The  method  chosen  depends  on  the  pur¬ 
pose  that  the  scale  is  intended  to  serve.  Scaling 
models  may  be  used  to  scale  persons,  stimuli,  or 
both  persons  and  stimuli. 

The  Likert  scale  is  a  scaling  model  that  scales 
only  subjects.  Broadly,  any  scale  obtained  by  sum¬ 
ming  the  response  scores  of  its  constituent  items  is 
called  a  Likert  or  summative  scale  or  a  linear  com¬ 
posite.  An  examination  in  a  mathematics  class  is 
an  example  of  a  linear  composite.  The  scale  (test) 
score  is  found  by  adding  the  number  of  correct 
answers  to  the  individual  items  (questions).  The 
composite  score  on  the  scale  (test)  is  a  better  indi¬ 
cator  of  the  student’s  knowledge  of  the  material 
than  is  any  single  item  (question). 


Louis  Thurstone’s  interest  lay  in  measuring  and 
comparing  stimuli  when  there  is  no  evident  logical 
structure.  The  Thurstone  scaling  model  is  an 
attempt  to  identify  an  empirical  structure  among 
the  stimuli.  To  do  this  Thurstone  scaling  uses 
human  judgments.  Individuals  are  given  multiple 
statements,  maybe  as  many  as  100.  These  “judges” 
order  the  statements  along  an  underlying  dimen¬ 
sion.  Those  statements  that  produce  the  most 
agreement  among  the  judges  are  selected  as  the 
items  to  be  included  in  the  scale.  These  remaining 
items  (approximately  20)  should  cover  the  entire 
latent  continuum.  These  items  are  then  presented 
to  subjects  who  are  asked  to  identify  those  that 
they  accept  or  agree  with.  The  average  scale  value 
of  those  items  chosen  represents  the  person’s  atti¬ 
tude  toward  the  object  in  question. 

A  unidimensional  structure  exists  in  those  situ¬ 
ations  where  a  single,  fundamental  dimension 
underlies  a  set  of  observations.  Examples  of  this 
condition  are  measures  used  to  evaluate  the 
qualifications  of  political  candidates,  a  scale  of 
political  activism,  or  a  measure  designed  to 
describe  a  single  personality  trait.  The  presump¬ 
tion  is  that  there  is  a  single  common  dimension 
on  which  the  candidates,  or  citizens,  or  personal¬ 
ity  traits  are  arrayed.  Many  theoretical  constructs 
used  in  the  social  sciences  are  quite  complex.  A 
single,  homogeneous,  underlying  dimension  may 
not  be  adequate  to  express  these  complex  con¬ 
cepts.  This  complexity  may  require  us  to  turn  to 
more  elaborate  explanations  of  the  variables’ 
behavior  in  which  more  than  one  latent  dimen¬ 
sion  underlies  a  set  of  empirical  observations. 
This  is  multidimensionality.  The  task  of  the 
researcher  is  to  uncover  these  multiple  dimen¬ 
sions.  We  hypothesize  their  existence  because 
theory  suggests  that  they  should  exist,  or  we  can 
predict  these  latent  dimensions  because  of  the 
variation  in  the  empirical  observations  at  hand. 
Almost  all  hypotheses  in  the  social  sciences  con¬ 
cern  relationships  between  concepts.  The  more 
abstract  these  concepts,  the  more  likely  they  are 
to  be  multidimensional. 

Louis  Guttman  developed  a  method  of  scaling  as 
an  alternative  to  other  scaling  methods  that  did  not 
account  for  the  dimensionality  of  the  phenomena 
being  measured.  Using  this  method,  one  can  estab¬ 
lish  that  a  series  of  items  belong  on  a  specifically 
unidimensional  continuum.  Guttman’s  method 
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orders  both  items  and  subjects  with  regard  to  some 
underlying  cumulative  dimension  according  to 
intensity.  Guttman  scaling,  also  known  as  scalo- 
gram  analysis  and  cumulative  scaling,  is  cumulative 
in  the  sense  that  if  the  researcher  knows  that  a 
respondent’s  score  is,  for  example,  3,  the  researcher 
knows  not  only  that  the  respondent  answered  three 
items  “correctly”  but  also  knows  that  the  items 
answered  “correctly”  were  Items  1,  2,  and  3.  A 
perfect  Guttman  scale  is  rarely  achieved  in  practice; 
indeed,  scalogram  analysis  anticipates  that  this 
ideal  model  will  be  violated.  It  then  becomes  a 
question  of  the  degree  of  deviation  that  one  is  will¬ 
ing  to  tolerate  before  conceding  that  the  empirical 
data  cannot  be  adequately  represented  by  a  single 
quantitative  variable  on  an  underlying  unidimen¬ 
sional  continuum. 

There  are  two  principal  methods  used  to  deter¬ 
mine  this  degree  of  deviation  from  the  ideal: 
minimization  of  error  proposed  by  Guttman  and 
deviation  from  perfect  reproducibility  based  on 
work  by  Allen  Edwards.  According  to  the  minimi- 
zation-of-error  technique,  the  number  of  errors  is 
the  least  number  of  positive  responses  that  must  be 
changed  to  negative  or  negative  responses  that 
must  be  changed  to  positive  for  the  observed 
response  to  be  transformed  into  an  ideal  response 
pattern.  The  method  of  deviation  from  perfect 
reproducibility  begins  with  a  perfect  model  and 
counts  the  number  of  responses  that  are  inconsis¬ 
tent  with  that  pattern.  Error  counting  based  on 
deviations  from  perfect  reproducibility  will  result 
in  more  errors  than  the  minimization-of-error  tech¬ 
nique.  It  provides  an  accurate  description  of  the 
data  based  on  scalogram  theory  and  for  this  reason 
is  superior  to  the  minimization-of-error  method. 

While  agreeing  that  the  Guttman  scale  is  intui¬ 
tively  appealing,  Jum  Nunnally  argues  that  it  is 
impractical  for  several  reasons.  First,  because  it  is 
a  deterministic  model,  having  no  measurement 
error,  there  are  very  few,  if  any,  sets  of  items  that 
fit  the  model’s  requirements.  Second,  the  perfect 
pattern  of  responses  is  a  necessary  but  not  suffi¬ 
cient  condition  for  establishing  a  scale’s  unidimen- 
sionality  and  therefore  the  items  may  not  all 
belong  to  the  same  factor.  Moreover,  the  fewer 
the  number  of  items  in  the  scale,  the  easier  it  is  to 
manipulate  the  model  to  get  the  proper  pattern  of 
responses.  Finally,  the  goal  of  the  Guttman  model 
is  to  develop  only  ordinal,  not  interval,  scales. 


Index 

An  index  is  a  measure  in  which  the  researcher  com¬ 
bines  more  than  one  indicator  of  some  abstract 
theoretical  concept  into  a  single  summary  score.  In 
this  regard,  a  scale  is  a  type  of  index.  Often, 
researchers  use  the  terms  index  and  scale  inter¬ 
changeably.  But  whereas  scales  arrange  individuals 
on  the  basis  of  patterns  in  the  data,  an  index  is  typi¬ 
cally  simply  an  additive  composite  of  several  indica¬ 
tors,  called  items.  Unlike  a  scale,  there  is  no  theo¬ 
retical  specification  of  a  measurement  model  in  the 
construction  of  an  index.  For  this  reason,  scales  are 
more  difficult  to  construct.  While  indexes,  or  indi¬ 
ces,  may  generally  seem  valid  and  reliable,  because 
of  the  lack  of  an  underlying  measurement  model, 
these  properties  are  not  usually  examined  in  depth. 

Further,  an  index  may  not  be  unidimensional. 
For  example,  political  scientists  use  a  7-point 
index  to  measure  party  identification  in  the  United 
States.  This  index  is  arranged  along  a  continuum 
as  follows:  strong  Democrat,  weak  Democrat, 
Independent  leaning  toward  Democrat,  Indepen¬ 
dent,  Independent  leaning  toward  Republican, 
weak  Republican,  strong  Republican.  One  can  eas¬ 
ily  see  that  this  index  consists  of  two  separate 
dimensions.  The  first  is  a  directional  dimension, 
from  Democrat  through  Independent  to  Repub¬ 
lican.  However,  there  also  exists  an  intensity 
dimension,  going  from  strong  through  weak,  inde¬ 
pendent,  weak,  and  back  to  strong. 

Frequently,  governments  use  indexes  of  official 
statistics  to  measure  aspects  of  the  country.  For 
example,  the  consumer  price  index  is  used  as  a 
measure  of  the  level  of  prices  that  consumers  pay 
for  goods  and  services  and  the  Federal  Bureau  of 
Investigation’s  crime  index  is  the  sum  of  the  seven 
so-called  index  crime  rates  and  is  used  as  an  indi¬ 
cator  of  overall  crime  in  the  United  States.  Another 
example  is  the  Human  Development  Index  com¬ 
piled  annually  by  the  United  Nations  Development 
Programme,  which  combines  rates  of  average  life 
expectancy,  infant  mortality,  and  literacy  for  all 
United  Nations  member  states. 


Multidimensional  Scaling 

Scaling  models  may  differ  with  regard  to  several 
characteristics — the  purpose  for  which  the  scale  is 
being  used,  the  phenomena  being  scaled,  whether 
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the  model  is  deterministic  or  probabilistic,  the  type 
of  data  used,  and  whether  the  model  is  unidimen¬ 
sional  or  multidimensional.  Dimensionality  is  a 
complex  issue  and  can  take  on  different  meanings 
for  different  scaling  models.  A  unidimensional 
scale  is  appropriate  in  those  situations  where  there 
exists  a  single,  fundamental  dimension  that  under¬ 
lies  a  set  of  observations.  Examples  could  include 
a  scale  to  evaluate  the  qualifications  of  political 
candidates  or  a  scale  designed  to  measure  a  single 
personality  trait.  The  presumption  is  that  there  is  a 
single  common  dimension  on  which  the  candidates 
or  personality  traits  are  arrayed. 

Many  theoretical  concepts  used  in  the  social  sci¬ 
ences,  however,  are  quite  complex  and  may  not  be 
amenable  to  being  expressed  on  a  simple,  homoge¬ 
neous,  unidimensional  scale.  Multidimensional 
scaling  models  are  used  when  more  than  one 
dimension  underlies  a  set  of  observations.  Multi¬ 
dimensional  scaling  techniques  are  mathematical 
methods  used  to  model  the  similarity  (or  dissimi¬ 
larity)  of  a  set  of  data  points  to  uncover  the  under¬ 
lying  structure  of  those  data.  The  objects  under 
study  are  placed  on  a  “map,”  a  representation  of 
the  points  in  a  spatial,  or  geometric,  form  simulta¬ 
neously  along  more  than  one  dimension,  which 
can  then  be  interpreted  by  the  researcher. 

In  multidimensional  scaling,  respondents  judge 
the  similarity  or  difference  among  a  set  of  stimuli. 
In  that  regard,  this  is  an  extension  of  the  unidimen¬ 
sional  scaling  of  stimuli  represented  by  the 
Thurstone  scale.  A  subject  must  compare  stimuli 
and  determine  which  are  more  similar  to  each 
other.  If  a  researcher  wanted  to  measure  the  weight 
of  different  objects  but  did  not  possess  a  scale,  he 
or  she  could  present  pairs  of  the  objects  to  respon¬ 
dents  and  ask  which  of  the  two  is  heavier,  then 
compare  the  heavier  with  another  object  in  the  set 
and  so  on  until  all  of  the  objects  are  arrayed  from 
lightest  to  heaviest.  Construction  of  a  multidimen¬ 
sional  scale  may  proceed  in  the  same  manner 
except  that  the  subject  would  be  asked  to  make  the 
judgment  of  similarity  but  would  not  be  presented 
with  a  dimension  along  which  to  order  the  objects. 
Thus,  some  subjects  may  array  the  objects  along  a 
dimension  of  weight,  while  others  may  group  them 
on  a  dimension  of  size.  Multidimensional  scaling 
can  be  used  when  the  researcher  does  not  know  the 
structure  of  the  observations — that  is,  the  underly¬ 
ing  dimensions  may  be  unknown.  These  methods 


can  also  be  used  when  the  researcher  has  developed 
and  then  tests  specific  hypotheses  about  what 
dimensions  underlie  the  data. 

A  related  method  used  to  uncover  the  underly¬ 
ing  dimensions  of  data  is  factor  analysis.  Factor 
analysis  focuses  on  the  measurement  of  latent  vari¬ 
ables.  The  technique  attempts  to  statistically  reduce 
the  number  of  variables  to  a  minimum  number  of 
underlying  dimensions  (latent  variables  or  factors) 
that  underlie  the  covariation  among  the  observed 
variables.  Each  observed  variable  is  linked  to  one 
or  more  unobserved  factor.  These  links  are  called 
factor  loadings  and  can  be  thought  of  as  a  correla¬ 
tion  between  the  observed  variable  and  the  unob¬ 
served  factor.  There  are  two  types  of  factor  analysis: 
exploratory  factor  analysis  (EFA)  and  confirma¬ 
tory  factor  analysis  (CFA). 

When  a  researcher  has  no  (or  not  much)  a  priori 
knowledge  of  the  latent  factors  underlying  a  set  of 
variables,  EFA  can  be  used  to  uncover  that  structure. 
The  correlation  between  each  observed  variable  and 
each  unobserved  factor  is  estimated  and  the  observed 
variables  that  have  high  loadings  are  said  to  define 
the  latent  variable.  Using  the  factor  loadings  as 
weights,  the  researcher  can  construct  measures  of 
the  latent  variables  and  use  them  in  subsequent 
analyses.  CFA  is  an  inferential  process  that  can  be 
used  to  test  specific  hypotheses  about  the  latent  fac¬ 
tors  underlying  a  set  of  observed  variables.  That  is, 
based  on  theory  or  prior  empirical  study,  the 
hypothesis  may  be  that  Variables  1,  2,  and  3  define 
a  factor,  Factor  1,  and  Variables  4,  5,  and  6  define  a 
second  factor,  Factor  2.  The  adequacy  of  this  mod¬ 
el’s  representation  of  the  observed  data  can  be 
assessed  by  a  variety  of  goodness-of-fit  indexes. 
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In  its  basic  form,  research  is  about  the  relation¬ 
ships  between  concepts  measured  as  variables. 
Researchers  make  inferences,  draw  conclusions, 
and  form  new  hypotheses  based  on  the  patterns 
they  detect  in  these  relationships.  It  is  important, 
therefore,  for  researchers  to  be  as  precise  in  their 
measurement  of  concepts  as  possible.  The  statis¬ 
tical  techniques  that  can  be  applied  to  a  set  of 
variables  are  contingent  on  the  level  of  precision 
at  which  the  variables  are  measured.  The  more 
precise  a  measure,  the  more  sophisticated  statisti¬ 
cal  techniques  can  be  employed  in  the  analysis, 
and  the  stronger  the  conclusions  that  can  be 
drawn.  In  this  entry,  the  major  levels  of  measure¬ 
ment,  the  appropriate  statistical  procedures 


based  on  them,  and  a  few  concrete  examples  are 
discussed. 

Measurement  is  usually  discussed  in  terms  of 
four  levels,  as  stated  by  S.  S.  Stevens  (1946). 
Moving  from  least  precise  to  most  precise,  they  are 
nominal,  ordinal,  interval,  and  ratio.  Nominal 
measures  simply  classify  members  based  on  par¬ 
ticular  attributes.  These  must  be  mutually  exclu¬ 
sive  and  exhaustive.  For  example,  gender  is  a 
nominal  variable.  It  is  made  up  of  the  mutually 
exclusive  and  exhaustive  categories  of  (a)  male  and 
(b)  female  or  (a)  female  and  (b)  male  (neglecting 
other  possible  distinctions).  It  makes  no  difference 
how  numbers  are  assigned  to  the  categories  because 
there  is  no  judgment  made  as  to  how  much  of  a 
variable  is  possessed  by  an  attribute.  That  is,  male 
is  not  higher  than  female,  or  vice  versa.  It  is  simply 
a  way  to  categorize  a  sample  or  population,  and 
neither  male  nor  female  possess  more  of  the  qual¬ 
ity  of  gender  than  the  other. 

For  instance,  suppose  we  have  a  population  in 
which  everyone  belongs  to  one  of  four  religions: 
Catholicism,  Judaism,  Islam,  or  Protestantism. 
Then,  religion  is  a  variable — that  is,  it  is  mutually 
exclusive  and  exhaustive  (at  least  with  reference  to 
our  example  population).  Everyone  is  classified  into 
one  of  the  four  categories.  However,  none  is  higher 
on  the  scale.  Protestant  is  not  more  or  less  religious 
than  any  of  the  others,  nor  is  Catholic,  Jewish,  or 
Muslim.  The  only  quantitative  techniques  we  can 
use  with  nominal  variables  are  those  based  on  the 
categories — that  is,  frequency  distributions  and  the 
mode.  We  cannot  perform  statistical  procedures — 
for  instance,  take  a  mean  or  find  a  standard 
deviation — using  this  variable.  We  cannot  add  or 
subtract  them.  Statistical  procedures  must  turn 
these  variables  into  binary  (“dummy”)  variables  to 
be  able  to  use  them;  this  is  discussed  below.  Many 
examples  of  nominal  variables  exist  in  social  sci¬ 
ence,  such  as  race,  ethnicity,  region,  marital  status, 
and  occupation. 

The  next  level  of  measurement  is  the  ordinal 
level.  Ordinal  variables,  like  nominal  variables, 
allow  the  researcher  to  classify  attributes  into 
mutually  exclusive  and  exhaustive  categories,  but, 
unlike  nominal  variables,  the  attributes  catego¬ 
rized  in  ordinal  variables  can  be  ordered  so  that 
they  run  from  less  of  an  attribute  to  more  of  that 
attribute.  Big,  bigger,  biggest  is  an  example  of  a 
logical  order.  Ordinal  variables  are  quite  common 
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in  social  sciences.  Agree/disagree  questions  mea¬ 
sure  on  an  ordinal  scale.  For  example,  “The  world 
is  flat.  Do  you  (a)  strongly  disagree,  (b)  disagree, 
(c)  neutral,  (d)  agree,  or  (e)  strongly  agree?”  This 
answering  scheme  produces  an  ordinal  measure 
because  there  is  a  logical  ordering  from  strongly 
agree  (with  the  statement)  to  strongly  disagree.  A 
“feeling  thermometer”  is  an  ordinal  variable.  For 
example,  on  a  scale  of  0  to  100,  with  0  being  very 
cold  and  100  being  very  warm,  one  can  ask, 
“How  do  you  feel  about  Barack  Obama?  John 
McCain?”  Statistical  measures  that  one  can  use 
when  the  variables  are  measured  at  the  ordinal 
level  are  the  median,  percentiles,  Spearman’s  rho, 
and  gamma.  Other  examples  of  ordinal  variables 
used  in  social  science  are  variables  with  answer 
patterns  running  from  like  to  dislike  and  variables 
measuring  attitudes. 

Moving  up  the  continuum  of  levels  of  measure¬ 
ment,  the  third  level  of  measurement  is  the  interval 
level.  Interval-level  variables  allow  one  to  classify 
attributes  into  mutually  exclusive  and  exhaustive 
categories,  like  nominal  and  ordinal  levels.  Like 
ordinal  variables,  they  possess  a  natural  order.  But, 
now,  the  intervals  between  the  ordered  categories 
are  equal.  In  the  earlier  example  of  the  variables 
with  the  categories,  strongly  agree  through  strongly 
disagree,  the  distances  between  each  of  these  cate¬ 
gories  are  not  equal;  indeed,  they  are  not  even 
known.  One  respondent  may  interpret  them  as 
equal,  while  another  may  interpret  the  distance 
between  strongly  agree  and  agree  to  be  shorter  than 
the  distance  between  agree  and  neutral,  or  vice 
versa.  There  is  no  way  to  know.  All  that  is  known 
is  that  this  is  the  order  of  the  responses.  If  a  variable 
is  measured  at  the  interval  level,  the  distances  are 
all  equal.  The  usual  example  given  of  an  interval 
variable  is  temperature.  Every  degree  is  equal  to 
every  other  degree.  For  example,  30  degrees  is 
10  degrees  colder  than  40  degrees,  and  80  degrees 
is  10  degrees  colder  than  90  degrees.  Because  of 
this  property,  the  statistical  techniques  that  can  be 
used  on  variables  measured  at  the  interval  level  are 
some  of  the  most  sophisticated  known.  Analysts 
can  add  and  subtract,  determine  the  mean  and  stan¬ 
dard  deviation,  and  compute  a  Pearson’s  product- 
moment  correlation  or  regression  coefficient. 

The  property  that  interval-level  measures  lack  is 
a  natural  zero  point,  or  origin.  Temperature, 
which  we  have  already  classified  as  an  interval 


scale,  has  a  zero  point — on  the  Celsius  scale,  it  is 
the  freezing  point  of  water;  on  the  Fahrenheit 
scale,  it  is  32  degrees  below  the  freezing  point  of 
water.  The  fact  that  there  are  two  scales  with  dif¬ 
ferent  points  labeled  0  is  illustrative  of  why  tem¬ 
perature  is  not  a  ratio  scale.  Neither  0  degrees 
Celsius  nor  0  degrees  Fahrenheit  means  the  absence 
of  temperature.  Zero  in  this  case  is  simply  an  arbi¬ 
trary  point  on  the  scale.  We  can  measure  tempera¬ 
ture  below  zero.  What  matters  in  this  case  is  the 
interval.  Each  interval  is  the  same  size.  That  is,  any 
interval  (degree)  on  the  Fahrenheit  scale  is  equal  to 
every  other  interval  (degree)  on  the  Fahrenheit 
scale,  and  any  interval  on  the  Celsius  scale  is  equal 
to  every  other  interval  on  the  Celsius  scale.  Of 
course,  an  interval  on  the  Fahrenheit  scale  is  not 
equal  to  an  interval  on  the  Celsius  scale.  But,  a 
reading  of  10  degrees  on  a  Fahrenheit  or  Celsius 
scale  is  not  twice  as  much  as  5  degrees  or  half  as 
much  as  20  degrees.  One  cannot  say  that  when  the 
temperature  is  100  degrees,  the  temperature  is 
twice  the  temperature  as  when  the  temperature  is 
50  degrees.  In  the  big,  bigger,  biggest  example,  one 
cannot  say  that  big  is  half  as  big  as  bigger.  These 
comparisons  make  no  sense  because  there  is  no 
zero  and  no  starting  point.  Examples  of  interval- 
level  variables  in  social  science  are  standardized 
tests  such  as  IQ  (intelligence  quotient)  tests  or 
SATs  (formerly  known  as  Scholastic  Assessment 
Tests),  crime  rates,  and  voter  turnout. 

A  true  zero  point  allows  researchers  to  form 
ratios,  and  this  leads  to  the  most  sophisticated 
level  of  measurement  of  all,  the  ratio  scale.  The 
ratio  scale  is  classificatory,  the  categories  can  be 
ordered,  the  intervals  between  categories  are  equal, 
and  the  scale  has  a  natural,  or  true,  zero.  Using 
ratio-level  variables  one  can  form  ratios — for 
example,  the  amount  of  money  in  my  bank 
account  is  half  as  much  as  it  was  yesterday.  It  can 
be  described  this  way  only  because  there  is  a  zero, 
a  starting  point.  One  can  say  that  family  income  is 
4  times  what  it  was  in  1970  or  that  A  is  twice  as 
tall  as  B.  Money,  income,  and  height  are  examples 
of  ratio  variables.  There  are  many  examples  of 
ratio-level  variables  in  social  science,  such  as  the 
number  of  children  one  has,  the  number  of  times 
one  has  been  married,  the  number  of  years  of  edu¬ 
cation  that  one  has  had,  the  number  of  times  one 
attends  church  in  a  week,  and  the  number  of  mur¬ 
ders  that  have  occurred  in  a  particular  jurisdiction 
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in  a  year.  With  ratio-level  variables,  analysts  can 
use  statistics  such  as  the  geometric  mean  and  the 
coefficient  of  variation,  a  ratio  itself. 

When  designing  research,  it  is  desirable  for 
researchers  to  give  much  thought  to  the  kinds  of 
conclusions  they  would  like  to  draw  and  how  they 
want  to  state  them.  The  way  findings  are  presented 
is  a  result  of  the  level  of  measurement  at  which 
researchers  collect  their  data.  If  they  intend  to  give 
broad,  general  conclusions,  nominal  variables  may 
be  acceptable.  Usually,  however,  researchers  want 
to  give  specific,  precise  statements  of  their  find¬ 
ings.  This  will  probably  entail  collecting  their  data 
at  the  highest  possible  level. 

It  is  possible  to  aggregate  a  higher-level  variable 
into  a  lower-level  variable.  For  example,  we  could 
measure  age  by  asking  how  old  the  respondent  is 
in  calendar  years.  This  would  be  a  ratio  scale 
because  it  classifies  orders  with  equal  intervals  and 
has  at  least  a  theoretical  zero.  In  our  analysis,  we 
may  group  the  years  into  several  groups,  for  exam¬ 
ple,  less  than  18  years  old,  from  18  to  21  years  old, 
from  22  to  25  years  old,  and  so  on  up  to  more  than 
65  years  old.  This  would  be  an  ordinal  variable — 
the  first  and  last  groups  are  not  equal  to  the  others. 
The  ordinal  level  may  be  exactly  what  we  want  in 
this  case.  However,  we  may  want  to  address 
another  hypothesis  using  the  original  variable  in 
years.  This  is  the  sort  of  question  to  be  addressed 
before  data  collection  begins.  It  is  a  simple  matter 
to  group  calendar  years  into  groups;  however,  it  is 
impossible  to  disaggregate  data  into  finer  grained 
categories  if  the  data  have  been  collected  in  groups 
in  the  first  place.  A  rule  of  thumb  is  to  collect  data 
in  the  most  disaggregated  form  that  time  and  bud¬ 
get  will  permit. 

It  is  also  possible  to  transform  nominal  vari¬ 
ables  into  variables  that  can  be  used  as  interval 
variables.  These  are  called  dummy  variables. 
Dummy  variables  are  binary  variables  in  that  they 
have  two  categories — presence  of  the  attribute  and 
absence  of  the  attribute.  In  the  nominal  variable 
discussed  earlier,  religion,  we  classified  our  popu¬ 
lation  into  four  groups — Catholic,  Jew,  Muslim, 
and  Protestant.  As  a  nominal  variable,  we  can 
show  a  frequency  distribution  or  report  a  mode, 
but  that  is  about  all.  If  we  transform  the  variable, 
we  may  be  able  to  use  the  resulting  dummies  in 
higher  level  statistical  techniques.  In  this  case, 
we  would  create  four  dummies — one  for  each 


category.  Our  original  variable  was  mutually 
exclusive  and  exhaustive,  so  all  respondents  are  in 
one  and  only  one  category.  We  compute  a  dummy 
variable  for  the  Catholic  group — 1  if  Catholic  and 
0  if  not.  We  do  the  same  for  the  other  three  cate¬ 
gories.  When  the  process  is  completed,  we  have 
created  four  new  variables  that  tell  us  whether 
someone  is  Catholic  (yes/no),  Jewish  (yes/no), 
Muslim  (yes/no),  or  Protestant  (yes/no).  We  can 
then  use  these  dummy  variables  in  statistical  pro¬ 
cedures  such  as  multiple  regression  analysis. 
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In  the  natural  sciences,  scaling  is  usually  very  pre¬ 
cise,  and  a  single  measure  is  often  sufficient  to 
describe  a  dimension — for  example,  when  mea¬ 
suring  temperature  in  Celsius,  weight  in  grams,  or 
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distances  in  meters.  In  these  cases,  the  assignment 
is  quite  simple:  Distances  can  be  measured  by  a 
ruler,  temperature  by  a  thermometer,  and  weight 
by  a  scale.  The  situation  in  economics  is  somewhat 
different.  One  of  the  most  important  indicators  in 
international  finance  is  the  purchasing-power  par¬ 
ity.  The  validity  of  this  measure  depends  on  the 
choice  of  an  appropriate  “basket”  of  consumer 
goods,  which  is  not  the  same  in  all  countries.  In 
1986,  The  Economist  magazine  introduced 
“Burgernomics”  and  has  been  publishing  since 
then  annually  the  prices  of  McDonald’s  Big  Mac 
hamburgers  in  different  countries.  The  advantage 
of  the  Big  Mac  is  that  it  is  produced  and  consumed 
around  the  world;  the  unique  basket  consists  of 
prices  for  beef,  cheese,  lettuce,  onions,  and  bread — 
and  since  the  recipe  is  the  same  worldwide,  there 
are  (almost)  no  differences  in  the  share  of  these 
five  ingredients  among  countries.  Note  that  the 
one-dimensional  Big  Mac  Index  consists  of  several 
indicators  that  have  different  weights — there  is, 
for  example,  a  higher  share  of  beef  than  onions  in 
a  hamburger. 

In  the  social  sciences,  scaling  is  the  process  of 
measuring  or  ordering  entities  with  respect  to 
quantitative  attributes.  While  natural  scientists  use 
instruments  that  are  as  precise  as  possible,  social 
and  political  scientists  use  survey  research  methods 
such  as  face-to-face  or  telephone  interviews  to 
evaluate  the  attitudes  and  beliefs  of  the  popula¬ 
tion.  In  the  following,  some  examples  from  large 
data  sets  such  as  the  International  Social  Survey 
Programme  (ISSP),  the  World  Value  Surveys 
(WVS),  and  the  American  Nationwide  Election 
Studies  (ANES)  are  shown  to  illustrate  problems  of 
frequently  used  measurement  scales  such  as  Likert 
or  Guttman  scales. 

Typical  research  questions  include  the  discus¬ 
sion  of  latent  variables  (dimensions,  factors)  such 
as  the  level  of  national  identity  or  the  level  of 
political  efficacy  and  trust.  The  validity  of  the  mea¬ 
sures  depends  on  the  quality  of  the  instruments:  It 
is  not  possible  to  measure  the  correct  length  with  a 
broken  ruler,  and  it  is  not  possible  to  measure  true 
attitudes  and  beliefs  with  poorly  worded  ques¬ 
tions.  Starting  with  a  single  question,  closest  to  a 
metric  variable  might  be  a  feeling  barometer  that 
runs  from  0  to  100.  The  ANES  2004  provides  the 
following  question:  “Where  on  that  thermometer 
would  you  rate  George  W.  Bush?”  The  problem 


with  this  kind  of  question  is  that  respondents  are 
not  able  to  differentiate  between  values  such  as  43 
and  44;  most  are  not  even  able  to  differentiate 
between  scores  such  as  62  and  76.  An  alternative 
is  to  reduce  the  scale  to  values  between  0  and  10 
or,  for  getting  an  easy-to-catch  midpoint,  to  values 
between  —5  and  5,  as  is  done,  for  example,  in  the 
German  Politbarometer.  Although  it  is  probably 
easier  to  respond  to  an  11 -point  scale  than  to  a 
101-point  scale,  it  is  still  difficult  to  differentiate 
between  categories  such  as  2  and  3. 

While  it  is  possible  to  evaluate  political  parties 
and  politicians  by  a  single  question,  for  estimating 
the  level  of  a  complex  phenomenon  such  as  political 
trust  and  efficacy,  one  has  to  use  a  larger  set  of 
items.  In  the  ISSP  2004,  participants  were  asked  to 
grade  the  following  six  statements  on  a  5-point  scale 
running  from  strongly  agree  to  strongly  disagree: 

1.  People  like  me  don’t  have  any  say  about  what 
the  government  does. 

2.  I  don’t  think  the  government  cares  much  what 
people  like  me  think. 

3.  I  feel  I  have  a  pretty  good  understanding  of  the 
important  political  issues  facing  [country]. 

4.  I  think  most  people  in  [country]  are  better 
informed  about  politics  and  government  than 
I  am. 

5.  Most  of  the  time,  we  can  trust  people  in 
government  to  do  what  is  right. 

6.  Most  politicians  are  in  politics  only  for  what 
they  can  get  out  of  it  personally. 

A  first  approach  to  analyze  this  kind  of  items 
has  been  formulated  by  Rensis  Likert  (1932);  his 
ideas  are  known  under  the  terms  snmmative  scal¬ 
ing,  Likert  scaling,  and  technique  of  summated 
ratings.  They  can  be  summarized  in  three  steps: 
First,  collecting  a  large  set  (approximately  100)  of 
positively  and  negatively  formulated  items,  which 
are  possible  indicators  of  the  dimension  of  interest, 
say  political  efficacy  and  trust;  all  of  them  running 
from  strongly  agree  to  strongly  disagree.  Second, 
rating  the  items:  Respondents  in  a  sample  that  is 
representative  of  the  target  population  have  to  rate 
how  strong  they  are  in  favor  (disfavor)  of  each 
statement.  Then,  items  are  selected  by  excluding 
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those  to  which  very  different  respondents  gave  the 
same  answers  and  those  to  which  very  similar 
respondents  gave  very  different  responses,  only 
keeping  the  items  that  differentiate  most  clearly 
between  the  top  and  bottom  quarter  of  the  respon¬ 
dents.  The  set  of  remaining  items,  say  10,  should 
have  high  item-total  correlations,  that  is,  high  cor¬ 
relations  with  the  sum  score  of  all  items,  and  high 
discrimination  values.  In  the  third  step,  these  finally 
selected  items  are  used  in  the  main  survey.  The 
mean  values  of  the  item  scores  reflect  the  level  of 
political  efficacy  and  trust  of  each  respondent.  Note 
that  values  of  negatively/positively  formulated  items 
have  to  be  reversed  so  that  high  values  always  refer 
to  a  high/low  level  of  political  efficacy  and  trust. 

There  are  several  drawbacks  to  just  adding  the 
scores  of  such  items.  Let  us  take  a  5-point  scale 
running  from  1  to  5,  from  strongly  agree  to 
strongly  disagree.  First,  the  extreme  points,  say 
means  that  are  greater  than  4.5  and  smaller  than 
1.5,  are  relatively  easy  to  reconstruct,  while  other 
values  can  be  the  result  of  very  different  responses; 
for  example  (considering  two  variables  only),  giv¬ 
ing  a  3  two  times  produces  the  same  mean  as  2  and 
4,  4  and  2,  1  and  5,  and  5  and  1 — that  is,  very 
different  answers  result  in  the  same  scale  values. 
Second,  in  case  of  5-point  items,  the  mean  catego¬ 
ries,  often  verbalized  as  “neither-nor,”  have  two 
meanings:  On  the  one  hand,  they  are  used  as  real 
“neither-nor”  and,  on  the  other  hand,  as  a  substi¬ 
tute  for  “I  don’t  know”  (Jorg  Blasius  &  Victor 
Thiessen,  2001).  Third,  there  may  be  respondent- 
specific  answer  formats:  While  some  people  use 
the  extreme  categories  quite  often,  others  are  more 
careful,  for  example,  using  agree  more  often  than 
strongly  agree.  Fourth,  all  items  have  the  same 
weight,  for  example,  all  items  determine  the  latent 
variable  to  the  same  degree.  All  these  problems  are 
well  known;  applying  principal  component  analy¬ 
sis  (PCA)  provides  different  values  for  the  items 
and  solves  the  fourth  problem.  Even  better  is 
applying  categorical  PCA,  which  provides  differ¬ 
ent  weights  for  the  single-item  categories,  solving 
the  first  problem  at  least  partly.  For  solving  the 
second  and  third  problem,  dichotomized  items  are 
often  used — with  the  drawback  of  receiving  less 
information  from  the  single  indicators. 

Before  discussing  dichotomous  items,  we  intro¬ 
duce  ordinal  variables  with  categories  that  are  not 
in  symmetric  order:  for  example, 


There  has  been  some  discussion  about  abortion 
during  recent  years.  I  am  going  to  read  you  a 
short  list  of  opinions.  Please  tell  me  which  one  of 
the  opinions  best  agrees  with  your  view?  .  .  .  : 

1.  By  law,  abortion  should  never  be  permitted. 

2.  The  law  should  permit  abortion  only  in  case  of 
rape,  incest,  or  when  the  woman’s  life  is  in 
danger.  3.  The  law  should  permit  abortion  for 
reasons  other  than  rape,  incest,  or  danger  to  the 
woman’s  life,  but  only  after  the  need  for  the 
abortion  has  been  clearly  established.  4.  By  law, 
a  woman  should  always  be  able  to  obtain  an 
abortion  as  a  matter  of  personal  choice.  (ANES, 
2004,  http://www.electionstudies.org) 

In  this  example,  there  is  an  implicit  order  of 
responses,  but  it  includes  four  categories  only. 
Considering  an  additional  response  such  as  “the 
law  should  permit  abortion  in  cases  when  the 
child’s  life  is  in  danger”  would  pose  the  problem 
where  to  place  it — between  Categories  2  and  3  or 
between  Categories  3  and  4?  Another  frequently 
discussed  example  is  religious  beliefs;  here,  as  an 
example,  are  seven  items  from  the  WVS  1996  to 
which  respondents  could  either  agree  or  disagree: 
(1)  Do  you  believe  in  God?  (2)  Do  you  believe 
people  have  a  soul?  (3)  Do  you  believe  in  sin? 
(4)  Do  you  believe  in  life  after  death?  (5)  Do  you 
believe  in  heaven?  (6)  Do  you  believe  the  Devil 
exists?  (7)  Do  you  believe  in  hell? 

In  the  1940s,  Louis  Guttman  (1944,  1950) 
developed  a  method  based  on  a  set  of  dichotomous 
variables  as  the  one  above,  which  is  known  as  sca- 
logram  analysis,  cumulative  scaling,  or  Guttman 
scaling.  His  idea  was  to  construct  a  set  of  items  that 
has  an  inherent  order,  say,  in  religious  beliefs.  If  the 
order  of  the  items  is  perfect,  every  respondent 
should  agree  with  the  items  as  long  as  they  are 
below  their  own  level  of  difficulty.  Thereby,  diffi¬ 
culty  is  reflected  by  the  share  of  agreements:  The 
fewer  agreements  an  item  receives,  the  more  diffi¬ 
cult  it  is.  With  respect  to  the  WVS  1996,  Wijbrandt 
van  Schuur  and  Jorg  Blasius  (2006)  found  for  the 
Netherlands  an  empirical  order  of  soul,  God,  sin, 
life  after  death,  heaven,  Devil,  and  hell;  that  is,  it 
was  most  difficult  for  the  respondents  to  believe  in 
hell  and  easiest  to  believe  in  soul.  In  a  perfect 
Guttman  scale,  nobody  was  expected,  for  example, 
to  believe  in  God  without  believing  in  soul;  and 
somebody  who  believes  in  hell  should  believe  in  all 
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the  other  religious  items.  The  level  of  religious 
beliefs  for  all  respondents  is  equal  to  their  number 
of  agreements,  which  are  in  case  of  a  perfect 
Guttman  scale  the  ranking  places  of  their  most  dif¬ 
ficult  items.  The  measurement  error  of  the  scale  is 
the  share  of  misplaced  zeros,  for  example,  the  share 
of  neglected  items  between  two  accepted  items. 

The  main  problem  of  this  scale  is  that  all  items 
receive  the  same  weight;  the  respondent  scores  are 
equal  to  the  number  of  agreements.  An  extension 
of  this  method  is  Rasch  scaling  where  it  is  assumed 
that  the  items  follow  a  probabilistic  distribution 
instead  of  a  deterministic  one.  In  Rasch  scaling,  the 
distances  between  the  items  and  their  weights  are 
not  equal;  they  are  estimated  by  using  the  logistic 
distribution. 

Jorg  Blasius 
University  of  Bonn,  Institute  for 
Political  Science  and  Sociology 
Bonn,  Germany 
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The  word  medium  denotes  an  intermediary  agency 
that  enables  communication,  by  which  is  meant 
the  production  and  transmission  to  other  parties 
of  messages,  information,  knowledge,  discourses, 
and  culture  in  the  broad  sense.  Historically,  the 
first  electronic  media — the  telegraph  and  the 
telephone — arose  at  the  height  of  the  Industrial 
Revolution,  together  with  the  electrification  that 
gradually  came  to  replace  steam  power.  These 
media  had  the  capacity  to  restructure  people’s 
perceptions  of  time  and  space.  Electronic  media 
today,  in  the  fullest  sense,  are  radio,  television, 
and  the  so-called  new  media  (in  this  entry  desig¬ 
nated  as  “online”  media).  These  media  have  vari¬ 
ous  features  in  common: 

1 .  they  transmit  knowledge  to  a  heterogeneous  and 
potentially  limitless  audience, 

2.  they  are  typical  products  of  late  modernity, 

3.  they  are  important  agents  of  socialization,  and 

4.  they  perform  an  essential  role  in  democratic 
processes. 

Only  with  the  development  of  electronic  media 
has  it  been  possible  to  speak  properly  of  the  “rise 
of  mass  communication.”  Communication  media 
in  early  modernity  disseminated  knowledge,  cul¬ 
ture,  and  ideas — such  as  books,  newspapers,  the 
theatre,  architecture,  painting,  and  sculpture — but 
widespread  illiteracy  and  generalized  poverty  pre¬ 
vented  access  to  them  for  the  bulk  of  the  popula¬ 
tion.  Although  printing  had  been  invented  in  the 
1 5th  century,  books  and  newspapers  could  only 
find  a  “mass  market”  after  political,  social,  and 
economic  conditions  had  radically  changed. 
Moreover,  in  the  20th  century,  cinema,  radio,  and 
then  television  also  became  more  easily  available 
and  were  associated  with  the  leisure  and  entertain¬ 
ment  that  Europeans  had  attained  through  social 
struggles  in  the  late  19th  and  early  20th  centuries. 
These  characteristics  ensured  that  mass  media 
indeed  became  ubiquitous. 

This  entry  first  outlines  the  history  of  the  elec¬ 
tronic  media,  paying  particular  attention  to  the 
early  development  of  radio,  television,  and  the 
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Internet.  This  is  followed  by  a  survey  of  some  of 
the  main  theories  espoused  in  research  on  mass 
communication — in  particular,  the  mass  commu¬ 
nication  research  derived  from  studies  on  political 
communication  conducted  in  the  United  States 
during  the  1940s  and  1950s  and  media  studies  in 
Great  Britain  during  the  1970s  and  1980s.  Next, 
the  role  of  the  electronic  media  in  the  political 
dynamics  of  contemporary  democracies  is  ana¬ 
lyzed.  This  topic  will  be  addressed  from  two  points 
of  view:  The  first  concerns  the  influence  and 
impact  of  electronic  media  on  political  attitudes 
and  behavior — an  issue  explored  by  the  first  analy¬ 
ses  in  communication  research  and  even  today  one 
of  its  main  fields  of  inquiry.  The  second  aspect 
deals  with  the  effects  of  the  electronic  media  on  the 
political  system,  which  are  commonly  identified 
with  the  “mediatization  of  politics.” 

The  Rise  of  Electronic  Media 

Radio 

From  a  technological  point  of  view,  the  radio 
evolved  out  of  the  telegraph  and  the  telephone.  A 
decisive  advance  in  the  development  of  the  radio  as 
we  know  it  today  occurred  in  June  1896,  when 
Guglielmo  Marconi  presented  in  Great  Britain  the 
progress  achieved  in  the  study  of  electromagnetic 
waves  for  the  production,  transmission,  and  recep¬ 
tion  of  acoustic  signals.  The  subsequent  evolution 
of  applied  studies  on  radio  broadcasting  led  to  the 
medium’s  first  application  in  business  and  public 
administration  and  subsequently  to  its  use  for 
military  communications  during  World  War  I.  The 
radio  gradually  entered  private  homes  from  the 
1920s  onward,  first  in  the  United  States  and  then 
in  Great  Britain  and  the  Netherlands.  Beginning 
with  the  British  Broadcasting  Company  (BBC)  in 
1922,  followed  by  the  National  Broadcasting 
Company  (NBC)  in  1925  and  the  Columbia 
Broadcasting  System  (CBS)  in  1927,  the  institu¬ 
tionalization  of  companies  able  to  organize  and 
produce  programs  for  a  broad  public  transformed 
radio  broadcasting  into  a  full-fledged  form  of  mass 
communication. 

Political  powers  understood  the  potential  of 
radio  as  early  as  the  1930s;  from  that  period  until 
the  outbreak  of  World  War  II,  the  radio  became  a 
powerful  instrument  with  which  to  manipulate  the 
mass  public.  In  that  decade,  in  fact,  authoritarian 


regimes  in  Europe  controlled  and  used  radio 
broadcasting  for  the  purposes  of  propaganda.  The 
Soviet  Union  was  the  first  regime  to  exploit  the 
radio  systematically  so  that  its  voice  could  pene¬ 
trate  homes  in  even  the  remotest  regions.  The  same 
propaganda  system  was  adopted  in  Nazi  Germany, 
where  Joseph  Goebbels  played  a  decisive  role  in  its 
development,  first  as  chief  of  Nazi  propaganda 
and  then  as  Minister  of  Public  Enlightenment  and 
Propaganda  (Volksaufklarung).  In  fascist  Italy, 
too,  the  radio  became  the  voice  of  the  regime,  first 
with  the  creation  of  a  central  body  for  the  control 
of  information  and  propaganda  and  then  with 
government  measures  to  increase  collective  radio 
listening  in  public  places.  The  radio  was  given  the 
delegated  power  of  amplification  and  persuasion. 
In  democratic  countries,  American  President 
Franklin  Delano  Roosevelt  used  the  radio  to 
deliver  his  “fireside  chats”:  talks  that  were  broad¬ 
cast  every  week  throughout  the  country  and  in 
which  Roosevelt  directly  addressed  American  citi¬ 
zens,  his  purpose  also  being  to  create  and  strengthen 
a  favorable  climate  of  public  opinion  during  the 
Depression  and  subsequently  to  boost  the  nation’s 
morale  during  World  War  II.  From  the  end  of  the 
1960s  onward,  radio  lost  its  position  as  the  princi¬ 
pal  mass  medium  as  a  result  of  the  diffusion  of 
television. 

The  massive  use  of  radio  in  homes  and  public 
places,  the  growth  of  the  organizations  that  pro¬ 
duced  its  programs,  and  its  use  for  political  propa¬ 
ganda  purposes  also  aroused  strong  interest  among 
academic  researchers.  It  was  mainly  in  U.S.  com¬ 
munication  research  that  the  first  pioneering  stud¬ 
ies  on  radio  were  conducted,  with  the  focus,  in 
particular,  on  the  effects  produced  on  the  audi¬ 
ence.  The  political  scientist  Harold  D.  Lasswell 
also  concentrated  on  the  content  of  messages  by 
analyzing  key  symbols  in  Radio  Moscow’s  propa¬ 
ganda  broadcasts.  From  a  more  sociological  per¬ 
spective,  the  social  impact  of  radio  broadcasting 
was  studied  as  early  as  1940.  The  Princeton  Radio 
Project,  for  example,  challenged  beliefs  concerning 
the  medium’s  purportedly  excessive  power.  These 
were  market  analyses  of  a  quantitative  type  based 
mainly  on  opinion  polls.  One  of  the  first  and  best 
known  of  these  studies  examined  the  public’s  reac¬ 
tions  to  Orson  Welles’s  dramatization  of  The  War 
of  the  Worlds  broadcast  as  part  of  the  CBS 
Mercury  Theatre  radio  series.  This  program  was 
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reported  to  have  caused  panic  among  its  listeners, 
many  of  whom  believed  that  the  simulation  of  an 
invasion  from  Mars  was  real.  However,  research 
showed  that  these  effects  had  been  mediated  by 
numerous  other  factors  not  dependent  on  the 
medium. 

Television 

Like  the  radio,  television  as  a  means  of  commu¬ 
nication  first  came  into  use  in  Great  Britain  and  the 
United  States  and  then  spread  through  other 
European  countries.  The  technological  develop¬ 
ment  of  this  medium  is  more  difficult  to  recon¬ 
struct,  because  diverse  research  groups,  countries, 
and  companies  already  consolidated  in  other  sec¬ 
tors  (from  photography  to  the  cinema)  contributed 
to  its  success.  A  decisive  step  in  the  diffusion  of  this 
new  medium,  with  its  combination  of  sounds  and 
images,  came  during  the  mid-1930s  in  Great 
Britain,  where  this  service  began  to  be  offered  to  a 
limited  public  and  was  resumed  in  1946  after  the 
interruption  of  World  War  II.  In  the  meantime, 
television  had  also  developed  in  the  United  States, 
where  already  in  the  early  1950s  there  were  10  mil¬ 
lion  television  sets  in  American  homes.  The  linkage 
between  radio  and  television  services  has  been  one 
of  the  distinctive  features  of  the  development  of 
television.  In  fact,  organizations  such  as  the  BBC  in 
Great  Britain,  and  CBS  and  NBC  in  the  United 
States,  have  been  the  vehicles  through  which  televi¬ 
sion  has  been  strengthened,  spread,  and  entrenched 
in  the  everyday  habits  of  millions  of  consumers.  In 
television’s  first  decades  of  existence,  while  in  the 
United  States  the  development  and  production  of 
television  programs  was  immediately  appropriated 
by  the  market,  in  Europe  it  was  allocated  to  a  pub¬ 
lic  service:  The  organizations  that  produced  and 
broadcasted  programs  were  official  institutions 
financed  mainly  by  license  fees.  Only  after  the  mid- 
1970s  did  private  organizations  in  Europe,  too,  go 
into  the  business  of  television  information  and 
entertainment  and  obtained  large  market  shares.  In 
this  way,  the  commercialization  of  television 
occurred,  and  advertising  became  the  main  source 
of  financing.  Some  public  service  broadcasters  also 
began  to  supplement  their  revenues  by  broadcast¬ 
ing  commercial  messages. 

The  advent  of  television  brought  about  one  of 
the  greatest  social,  cultural,  and  political  changes 


of  the  20th  century.  Academic  studies  on  com¬ 
munication,  which  had  already  acquired  strong 
legitimacy  in  pretelevision  social  research,  were 
mobilized  to  examine  the  characteristics  and 
effects  of  this  change.  The  result  was  a  large  and 
diversified  scientific  output,  with  an  abundance  of 
theoretical  perspectives  and  methodologies,  as 
well  as  empirical  data  and  interpretative  para¬ 
digms.  Now  that  television  had  grown  into  a 
global  phenomenon,  the  study  of  mass  communi¬ 
cation  became  internationalized,  and  American 
communication  research  was  supplemented  by 
British  media  studies,  French  poststructuralist  theo¬ 
ries,  and  other,  minor,  schools  of  thought.  In  this 
period,  moreover,  media  analysts  began  to  examine 
the  other  actors  involved  in  the  communication  pro¬ 
cess:  not  only  the  receiver  but  also  the  transmitter 
and  the  content  that  was  transmitted.  It  was  during 
the  late  1970s  and  early  1980s  that  a  large  body  of 
studies  concerned  with  the  routines  and  logics  deter¬ 
mining  the  production  of  information  were  con¬ 
ducted,  which  provided  a  detailed  analysis  of  the 
content  of  television  programs.  As  research  extended 
to  include  the  other  components  of  the  communica¬ 
tion  process,  questions  were  raised  as  to  the  effect  of 
television  on  its  audience.  The  impact  and  ubiquity 
of  the  medium  aroused  fears  that  it  influenced  the 
public  excessively  in  terms  of  voting  behavior,  con¬ 
sumption,  lifestyle  choices,  and  the  construction  of 
frames  of  meaning  to  interpret  reality. 

Not  coincidentally,  it  was  precisely  in  this 
period  that  “political  communication” — a  specific 
area  of  communication  studies — acquired  greater 
autonomy  and  further  academic  legitimacy,  both 
in  political  science  and  sociology.  The  link  between 
the  development  of  electronic  media  and  the  prac¬ 
tices  of  political  communication  has  been  well 
evidenced  in  the  work  of  Jay  Blunder  and  Dennis 
Kavanagh,  who  have  identified  three  phases  in 
political  communication  since  World  War  II.  The 
first  phase,  which  lasted  until  the  1950s,  saw  the 
dominance  of  the  political  scene  by  parties  that  fed 
on  the  social  cleavages  and  dynamics  typical  of  the 
reconstruction  years.  In  that  phase,  political  com¬ 
munication  was  subordinate  to  a  system  of  institu¬ 
tions  and  solid  political  loyalties.  Citizens 
responded  to  party-political  communications  by 
strengthening  the  opinions  and  attitudes  already 
manifest  in  their  strong  political  allegiances.  The 
second  phase,  from  the  1960s  through  the  1980s, 
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was  marked  by  the  spread  of  television.  It  was 
during  this  period  that,  albeit  to  a  different  extent 
in  the  existing  democracies,  traditional  political 
loyalties  increasingly  faded.  A  weakening  of  the 
public’s  selection  mechanisms  exposed  it  to  the 
messages  of  all  political  leaders,  major  and  minor, 
whether  during  electoral  campaigns  or  otherwise. 
Television,  moreover,  enabled  politicians  to  reach 
sections  of  the  electorate  that  had  made  scant  use 
of  the  old  media  and  had  been  consequently  unaf¬ 
fected  by  party-political  communication.  Television 
formats  and  languages  had  an  impact  not  only  on 
the  public  but  also  on  the  time  frames,  languages, 
and  forms  of  presentation  of  politics.  The  third 
phase  began  in  the  1990s  and,  since  then,  has 
consisted  of  five  main  processes:  (1)  professional¬ 
ization  of  the  relationship  with  public  opinion, 
(2)  increased  competition  between  media  contents 
and  political  communication,  (3)  populism,  (4)  cen¬ 
trifugal  communication,  and  (5)  sporadic  consump¬ 
tion  of  political  communication.  The  growth  and 
spread  of  online  media,  especially  the  Internet,  have 
prompted  academic  studies  to  envisage  the  onset  of 
a  fourth  phase  of  political  communication,  which  is 
discussed  in  the  following  section. 

In  the  second  half  of  the  1990s,  technological 
development  also  decisively  altered  the  way  in 
which  television  was  conceived.  The  media  scenario 
radically  changed  because  traditional  analogue  ter¬ 
restrial  television  was  now  supplemented  by  satellite 
and  digital  broadcasting.  Today,  digitization  is  hav¬ 
ing  three  major  consequences:  (1)  growth  in  multi- 
media  and  convergent  practices  in  production  and 
consumption,  (2)  proliferation  of  mono/multime¬ 
dia  channels,  and  (3)  the  internationalization  and 
globalization  of  content  (with  a  small  and  well- 
identifiable  group  of  media  companies  acting  as 
global  leaders). 

The  Internet 

The  gestation  and  early  development  of  the 
Internet  were  much  slower  than  those  of  radio  and 
television,  but  thereafter,  technological  progress 
meant  that  this  new  medium  underwent  rapid  and 
tumultuous  growth.  It  still  harbors  numerous  sur¬ 
prises,  especially  in  the  applicative  domain.  The 
Internet  came  into  being  at  the  end  of  the  1950s, 
but  for  20  years  its  use  was  restricted  to  military 
communications  in  the  United  States.  During  the 


Cold  War,  the  creation  of  the  Advanced  Research 
Project  Agency  (ARPA)  in  1958  to  research  and 
develop  new  technologies  led  to  the  setting  up 
of  Arpanet,  a  system  that  linked  the  agency’s  com¬ 
puters.  With  time,  this  network — which  closely 
resembled  telephony  because  it  was  designed  for 
one-to-one  communication,  as  opposed  to  the  one- 
to-many  system  of  radio  and  television — also  cre¬ 
ated  connections  with  civil  and  nonmilitary  users 
(universities  and  research  centers),  giving  rise  to 
the  network  of  networks  today  known  as  the 
Internet.  The  closure  of  Arpanet  in  1989  enabled 
the  expansion  and  commercialization  of  the 
Internet,  so  that  users  belonging  to  neither  the 
armed  forces  nor  research  centers  could  access  it. 
Thereafter,  with  the  advent  of  the  World  Wide 
Web,  the  Internet  underwent  extremely  rapid 
development,  which  definitively  changed  it  from 
military  to  civil  use  and  increasingly  enlarged  its 
boundaries.  Since  the  1990s,  access  to  and  use  of 
the  Internet  has  also  been  favored  by  technological 
upgrading.  Greater  speed  of  transmission  and 
increased  content  have  been  followed  by  technolo¬ 
gies  that  facilitate  Internet  access  and  connection 
(broadband,  Wi-Fi,  and  Bluetooth).  These  succes¬ 
sive  technological  advances  have  given  rise  to  phe¬ 
nomena  that  have  radically  changed  not  only 
Internet  use  but  the  entire  media  system  as  well. 
The  process  of  multimedia  convergence,  which 
had  already  come  about  in  television,  was  driven 
by  the  growth  of  quality  and  the  contents  trans¬ 
missible  via  the  Internet.  Moreover,  because  of 
these  technological  changes,  the  use  of  the  Internet 
is  now  becoming  increasingly  interactive. 

Academic  research  in  the  field  immediately 
grasped  the  importance  of  this  new  communication 
medium.  But  the  interest  of  social  research  increased 
especially  during  the  transition  from  the  so-called 
Web  1.0  to  the  Web  2.0  and  subsequently  intensi¬ 
fied  with  the  further  transition  to  Web  3.0.  The 
term  Web  1.0  indicates  merely  passive  use  of 
the  tool:  access  to  use  the  contents  offered  by  the 
Internet.  Web  2.0  denotes  the  evolution  that  led  to 
greater  interaction  in  the  use  of  Internet  content, 
with  users  being  able  to  leave  comments  and  pro¬ 
vide  feedback.  The  next  development  has  been 
termed  the  convergence  culture  driven  by  the 
spread  of  Web  logs  (blogs)  and  the  creation  of 
social  networks  (Facebook,  YouTube,  and  Twitter, 
among  others).  Studies  on  the  political  implications 
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of  these  technological  changes  have  suggested  that 
a  fourth  phase  of  political  communication  can  be 
added  to  the  three  already  identified.  This  interpre¬ 
tation,  however,  is  anything  but  unanimously  held, 
and  opinions  have  polarized  between  two  extremes. 
On  the  one  hand,  there  are  those  who  maintain 
that  this  technological  revolution  is  producing  an 
electronic  democracy  in  which  grassroots  practices 
create  a  public  sphere  in  which  citizens  can  advo¬ 
cate  their  own  issues,  build  alternative  frames  for 
institutional  political  communication,  mobilize 
public  opinion  on  certain  themes,  and  act  as 
“watchdogs”  on  the  public  powers.  On  the  other 
hand,  there  are  those  who  argue  that  the  develop¬ 
ment  of  the  Internet  and  the  opening  of  spaces  in 
which  users  can  express  their  opinions  strengthens 
the  power  of  those  who  already  possess  it,  primar¬ 
ily  because  in  this  way  disintermediation  channels 
can  be  activated:  Politics  has  preferential  forms  of 
communication  with  the  electorate  and  can  thus 
bypass  mediation  by  the  more  traditional  media 
(television  and  newspapers).  The  mass  of  content 
and  opinions  on  the  Internet  makes  it  possible  to 
monitor  the  mood  of  the  electorate  just  as — and  in 
certain  cases  better  than — any  type  of  quantitative 
survey  can. 

Applied  Research  Versus  Critical  Theory 

The  principal  theories  employed  in  the  study  of 
media  in  general  and  the  electronic  media  in 
particular — and  especially  their  effects  on  the 
public — can  be  related  to  two  particular  periods: 
the  1940s  onward  with  the  development  of  the 
sociology  of  communication  in  the  United  States 
and  the  mid-1960s  with  the  birth  of  “media  stud¬ 
ies”  in  Great  Britain. 

The  first  of  these  two  periods  was  characterized 
by  the  distinction  between  “applied  research”  and 
“critical  theory.”  Applied  research  originated  in 
U.S.  communication  research — which  was  the  first 
of  all  contexts  in  which  study  began  on  the  elec¬ 
tronic  media  (which  did  not  yet  include  television). 
As  its  founder  Paul  Lazarsfeld  defined  it,  applied 
research  was  academic  work  in  the  service  of 
external  public  or  private  customers,  and  it  devel¬ 
oped  techniques  to  gather  and  analyze  information 
on  attitudes  toward  the  mass  media  (especially 
radio,  print  media,  and  film).  It  is  significant  that 
the  first  research  undertaken  in  this  area  started 


from  opposite  questions  and  suggestions  concern¬ 
ing  the  results  obtained.  During  the  1940s,  the 
electronic  media  were  regarded  as  exercising  an 
almost  unlimited  power  over  the  public.  The 
period  between  the  two  world  wars,  in  fact,  was 
dominated  by  an  almost  universal  agreement  as  to 
the  capacity  of  the  media  to  exert  influence.  The 
literature  that  reconstructs  the  successive  theories 
on  media  effects  identifies  this  period  as  that  of  the 
“hypodermic  needle”  model:  The  media  were 
viewed  as  needles  able  to  penetrate  directly  the 
“skin”  of  their  audiences.  This  theory — which  had 
no  specific  founder  but  was  more  akin  to  a 
“mood”  concerning  how  the  mass  public  received 
and  internalized  media  contents — was  also  partly 
motivated  by  the  above-described  intensive  use 
made  of  the  radio  by  the  authoritarian  regimes  of 
Europe. 

The  applied  strand  of  U.S.  communication 
research  has  been  distinguished  by  the  work  of  the 
Bureau  of  Applied  Social  Research  of  Columbia 
University  in  New  York — whence  its  name  as  the 
“Columbia  school” — directed  by  Lazarsfeld.  Here, 
it  is  necessary  to  distinguish  between  two  different 
lines  of  inquiry,  which  collected  complementary 
results  but  used  different  methodologies:  the  exper¬ 
imental  psychology-based  approach  and  the 
empirical  sociology-based  one.  There  were  two 
main  similarities  between  these  two  lines  of  inquiry: 
First,  they  were  driven  by  a  particular  interest  in 
how  the  media  in  general,  and  the  electronic  media 
in  particular,  operate  during  election  campaigns 
(i.e.,  how  they  influence  voters);  second,  both  of 
them  led  to  the  definitive  superseding  of  the  behav- 
iorist  stimulus-response  model  typical  of  the  hypo¬ 
dermic  needle  theory. 

The  experimental  psychological  approach  must 
be  considered  as  an  autonomous  sector  of  com¬ 
munication  research.  In  this  case,  too,  it  was 
applied  mainly  in  the  area  of  political  communica¬ 
tion,  where  research  showed  that  the  audiences  of 
the  electronic  and  print  media  filter  their  contents 
selectively.  This  selectivity  depends  on  several  fac¬ 
tors:  the  interest  in  acquiring  information  (indiffer¬ 
ence  to  certain  themes  creates  impermeability  to 
the  messages  received),  selective  exposure  (the 
audience  tends  to  expose  itself  to  information  con¬ 
genial  with  its  attitudes  and  to  avoid  messages  that 
are  not),  selective  perception  (existing  predisposi¬ 
tions  influence  the  way  in  which  messages  are 
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received),  and  selective  retention  (contents  that 
match  the  audience’s  predispositions  are  remem¬ 
bered  more  easily).  This  type  of  inquiry  did  not 
focus  solely  on  intermediary  factors  relative  to  the 
public;  it  also  examined  those  relative  to  the  mes¬ 
sage,  seeking  to  understand  the  characteristics  that 
a  message  must  have  to  be  effectively  influential  on 
its  audience.  The  factors  identified  were,  among 
others,  the  credibility  of  the  communicator,  the 
order  of  arguments  (on  some  occasions  the  initial 
arguments  are  more  effective  and  on  others  those 
expounded  at  the  end  are),  the  completeness  of  the 
overall  argument,  and  the  clarity  of  the  conclusions. 
With  the  introduction  of  intermediary  factors, 
therefore,  it  was  established  that  the  media  consti¬ 
tute  just  one  component  of  political  persuasion  and 
work  in  ways  very  different  from  that  feared  in 
keeping  with  the  hypodermic  needle  theory. 

The  sociological  approach  definitively  struc¬ 
tured  the  history  of  communication  research.  The 
body  of  studies  conducted  in  this  area  was,  for  the 
U.S.  social  and  political  context  of  the  time,  so 
exhaustive  and  convincing  that  in  1959  Bernard 
Berelson — one  of  the  most  prestigious  members  of 
the  Columbia  school,  on  par  with  Robert  K. 
Merton  and  Elihu  Katz — declared  that  analysis  of 
media  effects  could  be  considered  complete.  This 
research  had  the  merit  of  linking  the  processes  of 
mass  communication  with  the  characteristics  of 
the  social  context  in  which  they  occurred.  For  this 
reason,  too,  a  further  step  forward  was  taken  from 
the  notion  arising  from  psychological  inquiry  that 
the  media  are  a  source  of  persuasion.  They,  indeed, 
have  an  influence;  but  it  is  a  limited  influence 
always  mediated  by  the  context  and  the  social  rela¬ 
tionships  in  which  media  users  are  embedded.  The 
best  known  studies  conducted  by  the  Columbia 
school  were  The  People’s  Choice  (1944),  Voting 
(1954),  and  Personal  Influence  (1955)  (the  first 
two  dealt  with  communicative  dynamics  during 
the  presidential  election  campaigns  of  1940  and 
1948,  respectively).  In  the  results  of  these  studies, 
three  theories  embracing  the  sociological  dimen¬ 
sions  of  the  effects  of  media  communication  can  be 
identified,  which  add  to  the  selectivity  of  the  psy¬ 
chological  approach: 

1.  Social  determinism:  Social  characteristics 
determine  political  preferences;  voting  is  an 
individual  behavior  regulated  by  collective  norms. 


2.  The  limited  effect  of  election  campaigns: 

Change  in  voting  intentions  in  response  to 
messages  received  during  the  election  campaign 
is  restricted  to  a  small  group. 

3.  Social  influence:  Messages  and  ideas  imparted 
by  the  mass  media  reach  the  “opinion  leaders,” 
who  act  as  intermediaries  for  the  less  active 
members  of  the  electorate. 

In  short,  the  media  are  dependent  variables  in  the 
process  of  influence. 

The  first  studies  on  television  then,  however, 
reversed  this  vision  of  the  limited  effects  of  the 
media.  Using  the  theoretical  framework  developed 
within  communication  research,  Jay  G.  Blunder 
and  Denis  McQuail  studied  the  1964  British  elec¬ 
tion  campaign,  showing  that  voters/televiewers  did 
indeed  expose  themselves  to  messages  to  have  their 
opinions  confirmed.  But,  to  a  statistically  very  sig¬ 
nificant  extent,  they  did  so  to  know  whom  to  vote 
for,  to  keep  up  with  the  campaign,  and  to  gain  an 
idea  of  how  one  party  or  the  other  was  behaving. 

The  counterpoint  to  the  empirical  approach 
adopted  by  much  U.S.  research — and  also  typical  of 
other  schools,  such  as  the  “voting  studies”  carried 
out  at  the  Survey  Research  Center  of  the  University 
of  Michigan — was  critical  theory.  This  theory, 
which  sought  to  uncover  power  relations  in  cultural 
phenomena,  is  historically  identified  with  the  group 
of  scholars  at  the  Institut  fur  Sozialforschung  of 
Frankfurt — better  known  as  the  Frankfurt  School — 
who  greatly  influenced  culture  and  research  in 
Europe  and  the  United  States.  The  advent  of 
Nazism  forced  some  members  of  the  school  to 
move  to  the  United  States,  where  they  made  contact 
with  the  Columbia  school.  A  close  intellectual  rela¬ 
tionship  developed  between  Theodor  Adorno  and 
Paul  Fazarsfeld,  and  they  frequently  engaged  in 
joint  research.  The  attention  of  the  Frankfurt 
School  in  the  United  States  at  the  New  School  of 
Social  Research  in  New  York  shifted  to  the  phe¬ 
nomenon  of  the  “massification”  of  culture.  The 
intention  of  Max  Horkheimer,  Theodor  Adorno, 
and  Herbert  Marcuse  was  to  bring  social  research 
back  to  a  theory  of  society  understood  as  a  whole. 
They  consequently  criticized  communication 
research  for  conducting  what  they  regarded  as  spec¬ 
tral  studies.  They  focused  on  the  new  dynamics 
characterizing  society  at  that  time:  in  particular  the 
birth  and  growth  of  the  culture  industry,  in  which 
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the  electronic  media  predominated.  For  the 
Frankfurt  School,  the  mass  market  compelled  stan¬ 
dardization  and  uniformity:  The  tastes  and  needs  of 
the  public  imposed  stereotypes  and  poor  quality. 
The  logic  of  the  production  system  that  created  and 
disseminated  “mass  culture”  was  that  of  ideological 
hegemony  and  manipulation  of  the  public,  which 
lost  all  its  autonomy.  The  individual  envisaged  by 
critical  theory  was  no  longer  able  to  select  among 
media  messages;  rather,  she  or  he  was  compelled  to 
adjust  uncritically  to  imposed  values  and  conse¬ 
quently  forced  into  a  subordinate  role. 

Despite  the  numerous  criticisms  brought  against 
critical  theory — indeed,  many  of  its  contentions 
became  obsolete  owing  to  social  changes  and  to 
other  lines  of  inquiry — it  had  the  merit  of  shifting 
the  focus  of  attention  among  media  analysts. 
Research  no  longer  concentrated  solely  on  media 
effects  but  extended  its  scope  to  include  the  role  of 
the  media  in  society.  This  approach  was  not  new, 
however,  for  it  had  precedents  in  the  U.S.  function¬ 
alist  branch  of  communication  research  developed 
by  Merton  in  the  1960s,  which  originally  sprang 
from  the  studies  on  the  functions  of  the  media  con¬ 
ducted  in  the  1930s  by  Lasswell.  The  functionalist 
theory  of  the  media  had  numerous  proponents,  and 
it  had  important  consequences  for  communication 
studies.  It  showed  that,  although  the  media  were 
not  a  sufficient  cause  for  changes  in  people’s  opin¬ 
ions  and  behavior,  they  nevertheless  performed  an 
important  social  role.  One  theoretical  development 
that  arose  from  this  framework  was  the  analysis 
conducted  in  the  mid-1980s  by  Daniel  Dayan  and 
Elihu  Katz  of  “media  events.”  These  are  ceremoni¬ 
als  (such  as  the  Olympic  Games  or  the  Football 
World  Cup)  given  solemn  coverage  by  television 
and  that  interrupt  normal  schedules.  The  function 
of  these  important  media  events  is  ritual  unification 
of  the  viewers’  experiences.  They  unite  the  indi¬ 
viduals  watching  the  event  at  home,  so  that  the 
entire  society  is  involved  in  a  collective  celebration. 
These  events,  therefore,  affirm  a  shared  identity. 

The  earlier  hypotheses  and  suggestions  concern¬ 
ing  the  strong  influence  of  the  media  on  their  audi¬ 
ences  had  been  abandoned  in  light  of  the  findings 
of  U.S.  communication  research.  But  they  returned 
during  the  1970s — by  which  time  both  the  media 
and  society  had  substantially  changed.  There  were 
now  two  major  differences:  first,  the  apotheosis  of 
television  and  second,  the  decline  of  the  political 


parties.  Of  particular  interest  are  the  two  strands 
of  analysis  that  addressed,  with  different  out¬ 
comes,  this  changed  situation.  Both  arose  within 
British  media  studies  during  the  1970s:  “cultural 
studies” — particularly  the  section  devoted  to  audi¬ 
ence  studies — and  the  “political  economy”  of  the 
media.  Although  these  two  strands  sprang  from  a 
Marxist  tradition,  their  respective  research  trajec¬ 
tories  yielded  very  different  results. 

Starting  from  a  more  complex  vision  of  the 
dynamics  of  mass  communication,  cultural  studies 
analyzed  a  specific  form  of  the  social  process:  the 
one  by  which  the  public  makes  sense  of  reality 
through  its  use  of  media  contents.  The  focus  was, 
therefore,  on  shared  social  practices  and  on  a  com¬ 
mon  stock  of  meanings.  It  was  assumed  that  the  use 
of  contents  transmitted  by  the  electronic  media 
involved  values,  meanings,  and  practices  and  that 
the  media  were  active  elements  in  those  constructs. 
Investigation  was  therefore  made  of  the  structures 
whereby  the  institutions  of  mass  communication 
sustain  and  reproduce  social  and  cultural  stability. 
But  for  cultural  studies,  this  process  did  not  come 
about  smoothly;  rather,  it  was  negotiated  and  con¬ 
tradictory  because  social  and  cultural  stability  had 
necessarily  to  adjust  to  constant  conflicting  pres¬ 
sures.  Cultural  studies  developed  in  two  different 
directions,  analyzing,  on  the  one  hand,  the  produc¬ 
tion  of  media  contents  (paying  special  attention  to 
news  making  and  thus  combining  analysis  of  the 
electronic  and  the  print  media)  and,  on  the  other, 
media  consumption  as  the  locus  of  negotiation 
among  diverse  communicative  practices.  It  was  in 
the  evolution  of  the  latter  research  trajectory  that, 
despite  numerous  differences  in  methodology  and 
theory,  the  results  matched  those  already  obtained 
by  U.S.  communication  research.  Studies  con¬ 
ducted  in  this  area  highlighted  (a)  the  ambiguous 
and  contradictory  nature  of  media  texts  (especially 
televisual  ones,  although  some  studies  also  ana¬ 
lyzed  the  cinema),  which,  although  they  are  never 
completely  overt,  at  least  have  a  “structured  poly¬ 
semy,”  and  (b)  the  public  reconceptualized  as  an 
active  audience  able  to  produce  meanings  through 
its  use  of  media  contents.  These  findings  thus 
entailed  a  more  cautious  evaluation  of  the  allegedly 
strong  influence  of  the  media. 

The  political  economy  theory  was  part  of  the 
“radical  approach.”  While  cultural  studies  reevalu¬ 
ated  the  practices  of  the  audience,  the  political 
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economy  of  the  media  scaled  down  the  specificity 
of  the  cultural-ideological  dimension.  It  regarded 
economic  dynamics  as  furnishing  the  necessary 
explanation — in  some  cases  sufficient — to  under¬ 
stand  the  process  of  media  effects.  Here,  the  focus 
was  on  the  power  exerted  by  the  owners  of  organi¬ 
zations  on  both  the  production  process  and 
consumers.  This  approach  concentrated  on  two 
aspects  in  particular:  ownership  and  the  size  of  the 
market.  The  idea  with  regard  to  ownership  was 
that  the  owners  of  communications  media  were 
able  to  control  them,  so  that  they  also  exerted 
strong  influence  on  the  public’s  habits  and  tastes. 
As  regards  the  size  of  markets,  the  political  econ¬ 
omy  approach  highlighted  the  progressive  concen¬ 
tration  of  the  electronic  media  market,  arguing  that 
this  diminished  the  plurality  of  supply.  These  eco¬ 
nomic  variables,  therefore,  reduced  the  users  of 
media  content  to  subservience  while  strengthening 
the  positions  of  those  who  already  wielded  power 
in  society. 

Media  and  Political  Behavior 

As  mentioned  above,  academic  research  on  the 
media  on  the  two  sides  of  the  Atlantic  has  often 
addressed  themes  and  issues  that  concern  the 
sphere  of  politics.  Interest  in  the  impact  of  the 
media  with  regard  to  the  exercise  of  political 
power  first  arose  when  presidents  and  govern¬ 
ments  sought  to  know  public  opinion  so  that  they 
could  influence  it  or  when  political  parties  and 
candidates  wanted  to  use  the  media  effectively  for 
propaganda  purposes  during  election  campaigns. 
Both  the  Columbia  school  and  the  field  of  voting 
studies  investigated  the  role  of  not  only  the  mass 
media  but  also  social  networks  in  the  dynamics  of 
influence  on  political  attitudes,  opinions,  and 
behavior.  This  was  the  area  of  inquiry  that  ana¬ 
lyzed  the  psychosocial  effects  of  the  media,  as 
opposed  to  the  systemic  effects  discussed  below. 
Turning  to  the  most  recent  period,  that  of  the  so- 
called  third  phase  of  political  communication,  the 
current  state  of  scientific  knowledge  suggests  that 
the  least  amount  of  influence  attributable  to  the 
media  is  due  to  their  being  the  principal  sources  of 
political  knowledge  and  political  information. 
However  desirable  it  may  be  to  reduce  the  influ¬ 
ence  of  the  media,  it  is  indisputable  that  for  citi¬ 
zens  they  are  essential  vehicles  of  knowledge  about 


political  affairs.  The  media,  in  their  twofold  form 
as  electronic  and  print  media,  often  initiate  pro¬ 
cesses  of  individual  or  interpersonal  knowledge 
processing.  Despite  the  massive  growth  of  infor¬ 
mation  and  its  delivery  through  new  channels,  to 
state  that  the  media  are  the  main  sources  of  politi¬ 
cal  knowledge  is  not  to  imply  that  the  current 
phase  of  political  communication  has  seen  the 
advent  of  a  rational  citizen  informed  on  all  politi¬ 
cal  facts.  Numerous  studies  have  shown  that  civic 
commitment  in  the  mature  democracies  coincides 
with  low  knowledge  gathering.  The  U.S.  scholar 
Michael  Schudson  has  introduced  the  notion  of  the 
“monitorial  citizen.”  Such  citizens  are  substan¬ 
tially  uninterested  in  political  information  and 
action,  they  monitor  events,  and  they  mobilize 
when  a  problem  of  particular  interest  to  them 
arises.  The  information  used  by  monitorial  citizens 
comes  in  small  doses;  it  is  the  “soft  news,”  which 
may  be  considered  insufficient  but  is  all  that  they 
feel  they  need. 

With  regard  to  political  knowledge,  a  second 
level  in  the  degree  of  influence  exerted  by  the 
media  is  that  at  which  they  supply  clues  with 
which  to  interpret  political  reality.  The  media 
should  not  be  considered  merely  as  sources  of 
knowledge  and  information  but,  on  the  contrary, 
as  vehicles  of  meanings,  emotions,  and  visions  of 
the  world.  The  frames  through  which  the  media 
treat  events  significantly  influence  people’s  atti¬ 
tudes,  beliefs,  and  preferences.  A  third  level  is  the 
one  emphasized  by  the  agenda-setting  theory:  The 
media  are  able  to  determine  what  people  should 
know  simply  by  giving  coverage  to  some  facts  and 
ignoring  others.  The  greater  the  exposure  of  the 
public  to  information,  the  more  intense  the  agenda¬ 
setting  effect  becomes. 

Studies  more  specifically  concerned  with  the 
influence  of  the  media  on  political  behavior — be  it 
participation  or  voting — can  be  divided  between 
two  opposing  groups:  those  that  are  pessimistic 
and  those  that  are  optimistic.  Lazarsfeld  and 
Merton  had  already  spoken  in  communication 
research  of  the  “narcotizing  effect”  of  the  media. 
As  already  emphasized,  their  attitude  toward  the 
media  did  not  consist  of  a  preconceived  and  ideo¬ 
logical  aversion.  Instead,  their  judgment  arose  from 
the  observation  that  increasing  doses  of  mass  media 
content  sap  the  energy  that  individuals  reserve  for 
active  participation.  These  considerations  still  did 
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not  take  account  of  television,  which  would  become 
a  decisive  factor — both  in  the  United  States  and 
Europe — in  reinvigorating  the  notion  that  the  diffu¬ 
sion  of  the  electronic  media  deflected  political  par¬ 
ticipation.  It  was  Robert  Putnam  who  argued  that 
television  privatizes  civic  activity  and  discourages 
social  interaction  and  individual  civic  commitment. 

Contrary  to  these  pessimistic  views  are  those 
that  instead  see  the  increased  circulation  of  politi¬ 
cal  information  as  an  important  stimulus  to 
participation.  This  position  has  been  bolstered  by 
the  spread  of  the  online  media  and  the  consequent 
proliferation  and  fragmentation  of  information 
channels.  The  Internet,  in  fact,  has  on  several  occa¬ 
sions  proved  to  be  an  important  resource  for  mobi¬ 
lization,  even  on  a  global  scale. 

Research  has  not  yet  reached  clear  and  firm 
conclusions  with  regard  to  media  influence  on  vot¬ 
ing  choices.  Although  such  influence  cannot  be 
ruled  out,  it  is  still  difficult,  if  not  impossible 
according  to  some  authors,  to  produce  clear-cut 
empirical  evidence  for  it.  The  media  share  their 
influence  with  numerous  other  determinants  of 
voting  behavior,  which  international  research  has 
listed  as  follows:  party  identification,  the  positions 
of  parties  and  candidates  on  issues,  negative  polit¬ 
ical  preference,  and  the  image  of  the  party  leader. 
In  addition,  this  is  connected  with  informal  net¬ 
works  of  social  interaction. 

A  great  deal  of  research  has  focused  on  televi¬ 
sion’s  influence  on  the  outcomes  of  elections,  but 
once  again,  the  results  have  been  contradictory 
because  the  forms  of  influence  observed  in  one 
country  are  not  directly  applicable  to  others,  given 
the  differences  in  political  culture,  traditions  of 
electoral  behavior,  media  systems,  and  styles  and 
habits  of  television  consumption.  Television  cer¬ 
tainly  has  a  great  capacity  to  shape  visions  of  the 
world  and  therefore  political  opinions,  but  it  is  not 
clear  whether  this  capacity  operates  in  the  brief 
period  of  an  election  campaign  or  in  the  long 
period  of  formation  of  political  beliefs  and  atti¬ 
tudes.  Sociological  prudence  a  la  Lazarsfeld 
requires  distinctions  to  be  drawn  among  these 
diverse  effects  according  to  the  degrees  of  political 
conviction  among  citizen  electors.  The  sections  of 
the  electorate  that  are  either  undecided  or  do  not 
regularly  consume  television  information  seem 
more  susceptible  to  the  influence  of  television 
messages  during  election  campaigns,  while  voters 


with  strong  political  allegiances  are  more  resistant 
to  the  action  of  television.  In  short,  television — like 
other  media,  electronic  or  otherwise — does  not  have 
magical  powers  that  give  omnipotence  to  those  who 
use  or  control  it — at  least,  not  in  a  context  of  a 
healthy  democracy  and  sufficient  pluralism  in  infor¬ 
mation  sources  and  media.  In  a  situation  where 
channels  for  the  circulation  of  information  and 
ideas  proliferate  by  virtue  of  new  technologies — 
digital  television,  satellite  broadcasting,  mobile  tele¬ 
phony  with  its  numerous  and  versatile  uses,  Internet 
applications  (blogs,  forums,  social  networks,  and 
the  like) — it  is  difficult  to  attribute  influence  pre¬ 
cisely  to  any  particular  electronic  medium.  But  what 
is  certain  is  that  all  these  media  are  today  important 
tools  of  political  communication. 

Media  and  the  Political  System 

Although  the  media,  and  the  electronic  media  in 
particular,  have  uncertain  effects  on  the  behavior 
of  individuals,  they  have  nonetheless  produced 
significant  changes  in  the  political  process.  They 
have  influenced  political  systems,  the  formation  of 
leadership,  and  the  ways  in  which  politicians  com¬ 
municate  to  such  an  extent  that  it  is  difficult  to 
describe  the  workings  of  a  political  system  or  the 
dynamics  of  a  polity  without  considering  their 
media-driven  dimensions.  Scientific  analysis  has 
highlighted  what  is  termed  the  mediatization  of 
politics,  the  contemporary  process  whereby  public 
political  action  takes  place  within  the  media  arena 
and  that  depends  to  a  significant  extent  on  the 
action  of  the  media.  The  systemic  perspective  on 
the  influence  of  the  electronic  media  has  generated 
a  large  body  of  studies  mainly  concerned  with  the 
effects  of  television.  During  the  1990s,  fears  were 
voiced  that  a  “videocracy”  was  imminent — 
especially  when  the  political  success  of  the  televi¬ 
sion  mogul  Silvio  Berlusconi  was  analyzed.  Among 
the  theorists  of  this  drift  toward  a  videocracy  is  the 
leading  Italian  political  scientist  Giovanni  Sartori, 
who  has  argued  that  television  has  turned  politics 
into  a  “videopolitics”  that  tends  to  replace,  if  not 
to  destroy,  the  political  parties,  at  least  in  their 
mass  organized  form.  Videopolitics  is,  however, 
also  an  attitude:  the  “videodependency”  of  the 
political  parties,  which  concern  themselves  increas¬ 
ingly  less  with  genuine  events  and  increasingly 
more  with  media  events. 
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The  effects  exerted  by  the  media  on  politics  are 
generally  grouped  into  two  categories:  media 
driven  and  political.  The  former  concern  the  media 
aspects  of  political  communication,  the  latter  the 
nature  and  functions  of  the  political  system. 

Media-Driven  Effects 

For  some  years,  research  on  the  role  of  the  media 
in  civil  society  and  politics  has  emphasized  the 
changes  that  have  taken  place  in  political  discourse 
and  in  the  ways  political  messages  are  produced, 
expressed,  and  transmitted.  Television  especially, 
despite  its  control  by  certain  governments  (in 
Europe  and  elsewhere),  has  managed  to  exert  influ¬ 
ence  on  politics  directly  with  more  aggressive  forms 
of  investigative  journalism  and  indirectly  with  the 
force  of  popular  culture  and  the  language  of  images. 

Political  actors  have  had  to  adjust  to  the  inva¬ 
sive  and  uncontrollable  presence  of  television.  The 
predominant  logic  of  this  electronic  medium  is 
commercial:  Television  companies — including  the 
public  ones  in  Europe — respond  to  industrial 
imperatives  in  order  to  remain  in  the  market.  This 
means  that  they  must  cater  to  popular  tastes  and 
demands,  producing  political  information  intended 
to  satisfy  the  needs  of  the  audience  rather  than 
those  of  political  leaders,  or  of  a  government.  The 
scientific  literature  has  often  pointed  to  infotain¬ 
ment,  “soft  news,”  “media  populism,”  and  the 
popularization  of  politics  (“pop  politics”)  as  the 
responses  by  the  televised  representation  of  politics 
to  the  commercial  logic  of  the  electronic  media. 

There  are  three  media-driven  effects  on  political 
discourse.  The  first  is  the  spectacularization  of  poli¬ 
tics.  Television  has  heightened  the  traditional 
tendency  of  politics  to  dramatization.  Under  its  spot¬ 
lights,  political  activity  has  come  to  lose  its  sacred 
aura  and  has  been  forced  to  adapt  its  traditional 
forms  to  the  new  languages  and  the  new  tastes  of 
television.  The  rhetoric  of  mass  mobilization  has 
given  way  to  the  rhetoric  of  seduction.  Today,  no 
politician  can  communicate  effectively  without 
couching  his  or  her  message  in  the  grammar  of  tele¬ 
vision.  The  spectacularization  of  political  discourse 
is  a  circular  process  whereby  the  media  dramatize 
politics,  emphasizing  its  more  marketable  aspects, 
and  the  politicians — in  search  of  consensus — obey 
the  rules  of  the  game  by  adopting  communicative 
strategies  that  secure  the  attention  of  the  media. 


A  second  effect  is  that  of  “agenda  shaping,”  the 
ability  of  the  media  to  select,  and  also  to  impose, 
the  important  issues  in  the  public  debate — those 
that  politicians  cannot  ignore  and  on  which  they 
must  take  a  stance.  It  is  true,  however,  that  not 
always  and  not  everywhere  are  the  media  able  to 
set  the  political  agenda,  for  this  depends  on  the  type 
of  power  relations  that  exist  between  the  political 
system  and  the  information  system.  Where  the  rela¬ 
tionship  is  that  of  subordination  of  the  latter  to  the 
former,  the  agenda  is  more  likely  still  firmly  in  the 
hands  of  the  government  and  the  parties. 

The  third  media-driven  effect  in  the  relationship 
between  media  and  politics  is  the  fragmentation  of 
political  discourse.  Television  has  reduced  the  pub¬ 
lic  and  political  debate  to  its  minimal  terms,  pre¬ 
cisely  because  of  the  production  constraints  imposed 
by  the  media  industry.  American  democracy  of  the 
mid- 19th  century  was  epitomized  by  the  debates 
between  Abraham  Lincoln  and  Stephen  A.  Douglas. 
For  the  debate  of  August  21,  1858,  the  two  con¬ 
tenders  agreed  that  Douglas  should  first  speak  for 
an  hour;  Lincoln  would  then  have  an  hour  and  a 
half  to  reply;  and  Douglas  would  finally  have  a 
further  half  hour  to  respond.  This  agreement  was 
reached  to  drastically  reduce  the  usual  length  of  the 
debate!  In  the  simultaneity  offered  by  the  electronic 
media,  political  discourse  instead  consists  of  sound 
bites  and  ready-made  quotes  perfectly  suited  to  the 
urgent  pace  of  television  news  or  talk  shows. 

Political  Effects 

As  regards  political  effects,  the  most  significant 
and  evident  of  them  in  modern,  and  all  the  more 
so  in  postmodern,  politics  is  the  “personalization” 
or  “individualization”  of  politics.  As  a  medium 
that  prefers  to  give  faces  to  ideas  and  issues,  televi¬ 
sion  has  changed  the  ways  in  which  politics  is 
conducted  by  politicians  and  perceived  by  citizens. 
The  mediatization  process  has  led  to  representa¬ 
tion  of  the  politician  as  no  longer  an  exponent  of 
a  party  or  an  ideology  but  as  a  person  with  his  or 
her  own  personal,  professional,  and  familial  pecu¬ 
liarities.  The  private  dimension  is  no  longer  disre¬ 
garded  by  the  media;  rather,  it  forms  a  part,  often 
a  large  one,  of  the  narration  of  politics. 

This  effect  cannot  entirely  be  attributed  to  the 
electronic  media  alone.  Personalization  is  also  the 
result  of  the  loss  of  centrality  of  party  organizations. 


1540  Media,  Electronic 


The  importance  of  personae — be  they  presidents, 
heads  of  government,  leaders  of  parties,  or  election 
candidates — has  increased  in  concomitance  with  the 
decline  of  the  great  ideologies  and  with  the  transfor¬ 
mation  of  parties  into  electoral  machines  at  the 
service  of  actions  and  programs  linked  with  specific 
political  personalities.  The  second  political  effect 
frequently  reported  by  research  on  political  com¬ 
munication  is  the  personalization  of  leadership, 
which  is  often  closely  connected  with  the  just  dis¬ 
cussed  individualization  of  political  representation. 
Democracy  headed  by  a  leader  is  the  form  best 
suited  to  modernization  in  the  Western  democra¬ 
cies,  and  especially  to  mass  communication.  The 
media,  with  their  peculiar  coverage  of  the  political 
events  and  their  celebrity-building  processes,  repre¬ 
sent  a  crucial  variable  in  the  personalization  of 
leadership  that  adds  to  the  institutional  variables 
(e.g.,  the  constitutional  order). 

The  third  and  final  political  effect  of  mediatiza- 
tion  concerns  the  selection  of  political  elites.  As  a 
result  of  the  loss  of  importance  by  party  organiza¬ 
tions,  the  mechanisms  for  recruitment  of  the  politi¬ 
cal  class  have  been  transferred  from  the  party 
machines  to  agents  external  to  the  party-political 
system,  which  operate  outside  the  control  of  the 
traditional  party  selectors.  Often  predominant 
among  these  external  agents  are  the  media,  whose 
preference  for  telegenic  personalities  skilled  in 
debate  and  with  a  ready  wit  has  given  rise  to  the 
so-called  winnowing  effect:  the  choice  of  candi¬ 
dates  who  perform  best  in  the  arenas  constituted  by 
the  media.  The  news  media  seek  to  dramatize  elec¬ 
tion  campaigns,  and  they  give  intense  coverage  to 
the  most  mediagenic  personalities,  creating  candi¬ 
date-celebrities  to  the  detriment  of  those  left  out  of 
the  spotlight.  In  this  regard,  the  United  States  pro¬ 
vides  a  good  example  of  the  power  to  select  politi¬ 
cal  elites  exercised  by  the  media  in  the  political 
arena.  This  effect  is  becoming  increasingly  evident 
in  other  countries  and  political  contexts  as  well. 

Conclusion 

The  electronic  media,  in  their  diverse  forms  as 
radio;  television;  terrestrial  and  satellite  broadcast¬ 
ing  networks,  both  analogue  and  digital;  fiction; 
entertainment;  journalism;  online  services;  and  sys¬ 
tems  of  public  and  commercial  media,  are  crucial 
determinants  of  modern  and  postmodern  cultures 


and  societies,  as  well  as  important  actors  in  political 
contexts  and  systems.  The  media-politics  interac¬ 
tion  characterizes  political  life,  shapes  the  public 
debate,  and  conditions  the  actions  of  governments, 
leaders,  parties,  and  movements.  It  is  today  impos¬ 
sible  to  imagine  and  study  politics  without  consid¬ 
ering  the  influence  of  the  media  and  their  contents 
on  national  political  cultures  and,  in  an  increas¬ 
ingly  globalized  world,  also  on  the  policies  and  the 
communication  strategies  of  the  great  international 
political  actors.  Since  television’s  advent  on  the 
domestic  and  international  political  stage,  it  has 
often  been  (a)  the  co-protagonist,  together  with 
political  actors  of  great  historical  events,  from  the 
Kennedy-Nixon  debates  to  the  fall  of  the  commu¬ 
nist  regimes  in  Eastern  Europe;  (b)  the  sometimes 
unwilling  witness  to  dreadful  events,  from  natural 
catastrophes  to  the  terrorist  attacks  on  September 
11,  2001,  on  the  World  Trade  Center  and  the 
Pentagon;  and  (c)  itself  an  agent  of  great  and  small 
changes,  simply  by  being  present  in  places  and  situ¬ 
ations  where  it  has  conditioned  the  behavior  of 
those  using  it  as  actors  or  spectators. 

The  current  development  of  television,  and  the 
great  popularity  of  the  new  interactive  media  with 
their  constantly  new  technologies  and  modes  of 
consumption,  suggests  that  the  electronic  media 
will  continue  to  be  decisive  actors  in  politics.  In 
particular,  the  diffusive  many-to-many  nature  of 
social  networks  resists  attempts  to  control  them  by 
governments,  especially  those  of  an  authoritarian 
and  repressive  type,  and  they  can  be  viewed  as 
vehicles  for  the  democratic  empowerment  of  citi¬ 
zens  unimaginable  in  the  age  of  one-to-many  mass 
communication. 
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The  print  media  have  long  been  key  players  in 
politics.  Newspapers,  periodicals,  and  individu¬ 
ally  produced  publications  engage  the  public  and 
elites  by  providing  information  and  analyses.  In 
the  process,  they  influence  the  decisions  and  per¬ 
ceptions  of  the  entire  range  of  political  actors.  The 
print  media  and  their  journalists  and  editors  are 
engaged  in  all  aspects  of  the  political  process,  and 
the  political  institutions  and  players  are  occupied, 


in  different  ways,  in  determining  what  appears  in 
the  media.  The  print  media  are  also  an  economic 
force  in  societies,  although  they  account  for  only 
a  small  percentage  of  media  usage  even  in  highly 
literate  and  wealthy  countries.  In  this  entry,  their 
more  general  political  impact  and  their  role  in  dif¬ 
ferent  types  of  political  systems  are  discussed. 

The  role  of  the  print  media  in  politics  is  deter¬ 
mined  by  the  political  system  in  which  they  work, 
the  initial  ways  in  which  the  press  worked  in  that 
system,  the  available  technology,  and  the  demo¬ 
graphic  and  economic  character  of  the  state’s  popu¬ 
lation.  That  role  has  changed  as  new  broadcasting 
technologies  have  developed.  But  the  print  media 
remain  paramount  in  local  news  as  well  as  in  sig¬ 
nificant  national  discussions.  They  are  also  more 
likely  than  broadcast  media  to  be  focused  on  specific 
groups  rather  than  aimed  at  a  general  audience. 
Within  the  print  media,  there  are  clear  differences 
between  media  focused  on  drawing  a  mass  reader- 
ship  and  media  aimed  at  the  educated  and  the  afflu¬ 
ent.  It  is  in  the  print  media  that  the  focus  on  the 
latter  is  most  feasible  and  common,  while  the  mass 
audience  has  shifted  predominantly  to  television  as 
its  source  of  information  and  entertainment. 

The  print  media,  as  they  are  traditionally  known, 
normally  consist  of  newspapers  and  periodicals, 
although  other  print  media  such  as  books  and  flyers 
also  have  played  a  role  in  politics.  The  traditional 
print  media  are  produced  by  a  staff  of  journalists 
and  editors,  who  usually  develop,  edit,  and  place 
the  articles  on  various  issues  in  the  publication  and 
put  it  together.  In  addition,  to  reach  a  large  audi¬ 
ence,  printing  equipment  and  distribution  channels 
are  needed.  To  provide  ongoing  national  and  inter¬ 
national  information,  most  print  media  rely  not 
only  on  their  own  staff  but  also  on  outside  con¬ 
tributors  and  wire  services,  for  instance,  the  British 
Broadcasting  Corporation  (BBC)  or  Agence  France 
Press,  which  are  public,  and  private  agencies  such 
as  Associated  Press.  The  funding  this  requires  ties 
the  media  to  a  sponsoring  organization,  to  the  state, 
or  to  private  owners,  to  whom  the  print  media  are 
to  give  their  profits.  The  simplicity  of  the  produc¬ 
tion  of  the  most  basic  print  media  also  has  allowed 
its  use  by  opponents  of  a  regime  more  readily  than 
radio  or  television.  Finally,  in  areas  where  the  lit¬ 
eracy  level  is  low,  print  media  are  used  to  present 
images  to  the  population  and  also  as  ways  to 
increase  literacy  and,  in  turn,  activate  citizens. 


1542  Media,  Print 


The  development  of  technology  has  been  signifi¬ 
cant  in  the  transformation  of  the  print  media  in  the 
political  and  social  arena.  Initially,  getting  informa¬ 
tion  and  then  producing  and  distributing  print 
media  were  very  slow  processes.  As  a  result,  they 
were  used  only  for  commercial  purposes,  to  list  the 
dates  ships  were  due  and  their  cargo,  or  to  present 
ideas.  After  the  telegraph  was  invented,  the  speed 
with  which  information  could  be  transferred  began 
to  increase  exponentially.  Developments  in  photog¬ 
raphy  and  in  printing  technology  also  brought 
events  and  images  to  audiences  increasingly  rapidly 
and  with  greater  emotional  impact.  Consequently, 
the  role  of  the  print  media  shifted  from  announcing 
what  was  planned  to  reporting  on  events  that  had 
happened  and  engaging  people’s  interest. 

Beginning  after  the  1920s  (the  “golden  age”  of 
the  press  in  the  United  States),  the  development 
and  increasing  range  of  the  broadcast  media  (radio 
and  then  television)  resulted,  over  time,  in  a  shift 
in  the  role  of  the  print  media  away  from  being  the 
prime  source  of  information  on  events  outside  an 
individual’s  personal  experiences.  The  pattern 
came  to  be  that  television  news  introduced  and 
highlighted  events  and  issues  and  the  daily  press 
and  periodicals  provided  in-depth  information. 
The  print  media  became  an  agenda  setter  and  a 
source  of  analysis.  This  function  has  been  fur¬ 
thered  by  the  ever-increasing  speed  and  immediacy 
with  which  news  can  be  reported  on  the  broadcast 
media  and  by  the  development  of  the  Internet  and 
other  technologies.  In  response,  the  print  media 
first  moved  to  provide  both  paper  copies  and 
Internet  versions  and  then,  to  maintain  readership 
levels  that  would  bring  in  advertising  revenues, 
shifted  to  continual  updating  of  reports  and  analy¬ 
ses.  Gradually,  though,  the  print  media,  even  in 
their  online  editions,  have  lost  audiences  and  rev¬ 
enue  as  individuals  and  groups  have  been  able  to 
produce  the  news  through  blogs  and  special  sites. 
This  means  that  more  information  and  opinion  are 
not  being  filtered  through  professionals  who  are 
committed  and  have  the  resources  to  investigate 
and  present  a  balanced  analysis.  Without  the  print 
media’s  editorial  leadership,  issues  are  increasingly 
presented  on  the  Internet  by  unnamed  or  biased 
sources  without  market  or  professional  controls 
on  the  accuracy  of  what  is  said  or  how  it  is  pre¬ 
sented.  Unlike  the  print  media,  these  broadcast  and 
Internet  sources  reach  across  national  boundaries 


and  can  circulate  far  more  rapidly  while  avoiding 
government  control. 

Theories  of  the  Relationship  Between 
the  Print  Media  and  Government 

The  initial  set  of  models  of  the  relationship 
between  the  print  media  and  government  focused 
on  the  initial  uses  of  the  press  by  those  in  power. 
The  models  in  Fred  Seaton  Siebert,  Theodore 
Peterson,  and  Wilbur  Schramm’s  seminal  1963 
work,  Four  Theories  of  the  Press ,  were  as  follows: 

1.  the  “authoritarian  model”  of  the  early 
European  monarchies,  in  which  the  rulers 
controlled  the  print  media  to  support  and 
advance  their  policies; 

2.  the  “libertarian  model”  of  the  American  press 
and  European  democracies,  in  which  the  print 
media  were  a  separate  institution  free  from  state 
influence  and  were  engaged  in  informing  and 
entertaining  the  public,  with  the  profit-making 
incentive  ensuring  that  the  print  media  worked 
to  provide  “the  truth”; 

3.  the  “social  responsibility  model”  of  the  United 
States  and  European  democracies  in  the  20th 
century,  which  includes  all  the  roles  of  the 
“libertarian  model”  and  in  which  the  media 
deliberately  serve  as  a  forum  for  open 
discussions;  and 

4.  the  “Soviet-totalitarian  model,”  in  which  the 
media  were  used  by  the  rulers  to  mobilize  and 
control  their  societies. 

In  the  authoritarian  model  and  the  Soviet- 
totalitarian  model,  the  print  media  are  closely 
controlled  and  directed.  In  the  libertarian  and 
social  responsibility  models,  the  print  media  are 
self-directing,  and  “control”  emerges  through 
competition.  Later  theories  of  government-media 
relations  have  focused  on  the  impact  of  the  politi¬ 
cal  processes  themselves,  how  the  print  media 
influence  politics,  and  the  nature  of  the  print 
media  that  emerges. 

Democratic  Media  Systems 

Democratic  media  systems  in  the  developed  world 
differ  in  the  nature  of  pluralism  in  the  print  media, 
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the  role  of  the  market  versus  the  state  in  funding 
and  directing  the  print  media,  and  the  nature  of 
the  journalism  profession  itself.  The  model  devel¬ 
oped  by  Daniel  C.  Hallin  and  Paolo  Mancini 
essentially  divides  the  print  media  into  three  differ¬ 
ent  categories: 

1.  In  the  Anglo-American  “liberal  model,” 
where  there  was  early  democratization  and  a  more 
individualized  system  of  representation,  there  is  a 
long-established  commercial  press  that  seeks  to 
appeal  to  the  broadest  audience  by  presenting 
neutral  information  oriented  to  the  public  interest. 
In  this  model,  market  mechanisms  and  commercial 
media  dominate.  Although  party  papers  (those  tied 
to  a  specific  social  or  political  group)  are  significant 
in  Britain,  most  publications  are  deliberately  less 
ideological  in  their  coverage,  often  providing 
commentary  across  the  political  spectrum.  In  most 
cases,  editorial  opinion  is  distinct  from  news 
coverage.  Journalism  is  highly  professionalized 
and  self-regulating. 

2.  In  Central  Europe  and  Scandinavia,  where 
there  was  also  early  democratization,  with  a 
historic  emphasis  on  consensus,  pluralism,  and 
social  welfare,  the  “democratic  corporatist  model” 
holds.  In  this  model,  the  print  media  have  high 
circulations.  Historically,  particularly  in  the 
national  press,  party  papers  were  strong;  but  there 
has  also  long  been  a  more  neutral  commercial 
press.  Journalists  belong  to  powerful  professional 
organizations.  The  state  provides  subsidies  and 
regulates  the  work  conditions  for  journalists.  But 
freedom  of  the  press  is  also  well  protected. 

3.  In  France,  Italy,  Spain,  Portugal,  and  Greece, 
the  “politically  pluralized  model”  holds.  These 
states  were  marked  by  later  democratization  and 
strong  roles  for  political  parties,  alternating  with 
periods  of  authoritarianism.  In  this  model,  the 
print  media  have  focused  on  the  elites,  not  on  mass 
circulation,  and  they  are  characterized  by  a  high 
degree  of  political  partisanship.  Government 
intervention  is  greater,  with  press  subsidies  and 
periods  of  government  censorship. 

Communist  Media  Systems 

Communist  media  systems  were  based  on  the 
Leninist  adage  that  the  press  should  be  a  collective 


organizer,  agitator,  and  propagandist.  Providing 
information  and  news  was  incidental  to  that  role. 
These  media  systems  were  orchestrated,  to  a 
greater  or  lesser  degree  depending  on  the  interests 
of  the  regime,  by  the  Communist  Party.  In  the 
Soviet  and  East  European  media  systems,  individ¬ 
ual  print  media  were  aimed  at  specific  audiences 
and  interests.  The  themes  to  be  covered  were  set 
out  in  advance  by  Communist  Party  authorities  in 
both  general  and  individual  plans.  They  also  con¬ 
trolled  the  resources,  circulation,  and  coverage 
options  of  each  publication.  Reporting  was  to 
reflect  positively  on  the  system  as  a  whole  and  also 
reveal  individual  problems.  Control  was  exercised 
by  the  official  censors’  review  of  what  was  to  be 
printed;  by  the  editors  based  on  the  directions  they 
were  given;  or,  after  publication,  by  the  Party’s 
press  departments.  Journalists  sometimes  used 
coded  language  to  give  more  information  or  pre¬ 
sent  a  veiled  critique.  Readers  also  interpreted 
what  was  said  in  the  print  media  by  “reading 
between  the  lines”  and  looking  for  telling  differ¬ 
ences  in  what  was  said,  as  well  as  comparing  what 
they  read  with  what  they  experienced  directly. 

In  times  of  weakness  and  division  among  the 
leadership,  the  print  media  have  been  the  bell¬ 
wether  in  these  countries.  Increasingly  critical 
reporting  and  analyses,  as  well  as  different  posi¬ 
tions  appearing  in  the  press,  signaled  the  possibility 
of  more  openness  and  triggered  further  public  dis¬ 
cussion.  When  the  print  media  grew  increasingly 
open  and  critical,  there  was  increased  dissension. 
By  the  end  of  the  1980s,  the  openness  of  the  media 
and,  in  Hungary,  Poland,  and  Czechoslovakia, 
public  action  contributed  to  the  dissension  prior  to 
the  collapse  of  the  communist  systems.  In  the  post- 
Stalin  period,  in  some  countries  including  the 
Soviet  Union,  Poland,  Hungary,  and  Czechoslo¬ 
vakia,  an  independent  samizdat  (self-published) 
print  media  developed.  While  such  publication  and 
distribution  of  print  materials  without  state 
approval  was  illegal  and  those  involved  were  sanc¬ 
tioned,  these  materials  provided  an  important  out¬ 
let  for  the  exchange  of  alternative  views,  informa¬ 
tion,  and  literature  that  could  not  be  published  in 
the  legal  media.  They  served  as  the  central  work  of 
those  opposed  to  the  system. 

The  Chinese  model  under  Mao  Zedong  was  dif¬ 
ferent  from  the  Soviet  model.  It  put  even  less 
emphasis  on  information  and  much  more  on  the 
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press  as  a  mobilizer.  The  traditional  print  media, 
in  this  period,  had  a  far  smaller  role  to  play. 
Instead,  banners,  flyers,  and  publications  of  slo¬ 
gans  and  Maoist  statements  were  considered 
important  in  stimulating  people  to  get  engaged  in 
various  campaigns  and  to  attack  the  evils  in  the 
society  as  perceived  by  the  regime. 

In  the  transition  from  communism,  the  print 
media  shifted  to  reporting  the  news  and  scandals 
not  publishable  earlier.  There  also  remained  a  ten¬ 
dency  on  the  part  of  journalists  to  try  to  direct 
people’s  thinking  on  political  issues.  Where  sepa¬ 
rate  parties  emerged,  they  established  their  own 
print  media  organs.  Where  they  did  not,  as  in 
China,  the  media  have  diversified  while  remaining 
within  the  permissible  boundaries.  The  journalism 
profession  was  also  affected  by  the  increasing 
financial  pressures  that  came  with  privatization. 

Print  Media  in  the  Developing  World 

The  print  media  play  different  roles  in  the  develop¬ 
ing  nations,  where  wealth  and  literacy  are  more 
limited.  They  tend  to  be  less  significant  for  the  vast 
majority  of  the  population  in  these  countries 
because  of  low  literacy  levels  and  the  inability  of 
people  with  marginal  incomes  to  buy  the  pub¬ 
lished  material.  They  also  have  tended  to  be 
weaker  in  their  reporting  and  coverage  simply 
because  of  the  high  cost  of  international  or  even 
national  reporting.  As  a  result,  their  political  sig¬ 
nificance  is  limited  largely  to  the  educated  classes 
and  to  government  programs  to  educate  the  larger 
population. 

The  Uses  of  the  Print  Media: 

Individuals,  Elites,  and  Scholars 

Research  on  the  specific  impact  of  the  print  media 
on  political  knowledge  and  behavior  has  been 
indicative  rather  than  definitive.  Those  who  use 
the  print  media  tend  to  be  more  educated  and 
affluent  than  television  viewers.  The  influence  of 
the  print  media  on  their  attitudes  and  knowledge 
tends  to  be  gradual  and  depends  on  their  cognitive 
skills.  In  addition,  research  shows  that  individuals 
tend  to  expose  themselves  to  media  and  subjects 
that  reflect  their  preexisting  beliefs.  They  interpret 
messages  in  a  way  that  makes  them  consistent  with 
those  beliefs.  And  they  are  most  likely  to  retain 


information  that  fits  with  that  belief  set,  often  for¬ 
getting  specific  facts  and  remembering  the  broad 
principles.  There  is  also  some  evidence  that  news¬ 
paper  readers  have  higher  levels  of  social  capital 
than  those  not  exposed  to  media  reports.  It  also 
appears  that  regular  exposure  to  negative  reports 
about  politicians  and  politics  tend  to  result  in  more 
negative  perceptions  of  politics  and  politicians 
than  individuals  would  have  otherwise. 

The  print  media  do  play  a  major  role  in  agenda 
setting  by  both  politicians  and  the  public.  In  evalu¬ 
ating  what  issues  are  politically  salient,  politicians 
often  use  highly  respected  print  media  not  only  to 
get  their  messages  out  but  also  to  identify  the 
people’s  concerns  and  make  policy  decisions  based 
on  them.  Research  also  shows  that  the  media’s 
presentation  of  issues  as  social  rather  than  per¬ 
sonal,  for  example,  unemployment,  causes  people 
to  see  their  personal  problems  as  larger  societal 
problems.  Finally,  the  context  in  which  print 
media  place  an  issue  affects  how  seriously  indi¬ 
viduals  consider  the  issue  and  how  it  is  incorpo¬ 
rated  into  their  broader  priorities  and  beliefs. 

Exposure  to  any  media  gives  individuals  a 
greater  sense  of  the  world  and  their  position  in  it. 
Although  the  print  media  were  less  easily  accessi¬ 
ble  until  they  began  to  appear  on  the  Internet,  for 
the  educated  populace,  the  print  media  provide  the 
most  information  and  analysis.  Exposure  to  inter¬ 
national  media,  particularly  broadcast  media  but 
also  print,  also  allows  people  who  do  not  have 
direct  contact  with  the  rest  of  the  world  to  see  how 
others  live  and  make  comparisons  with  their  own 
lives.  This  can  create  a  sense  of  relative  depriva¬ 
tion,  which  stimulates  popular  demand  for 
improvements  in  the  system. 

For  scholars  of  politics,  the  print  media  are 
often  the  most  accessible  resource  for  looking  at 
what  has  been  said  and  how.  Therefore,  they  are 
the  most  used  source  for  research  on  what  hap¬ 
pened  in  a  given  case  or  what  information  people 
had  access  to  at  the  time  of  a  given  political  inci¬ 
dent  or  on  a  given  political  topic. 

Jane  Curry 
Santa  Clara  University 
Santa  Clara,  California,  United  States 
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Mediation  in  International 
Relations 


The  international  environment  of  the  21st 
century,  much  more  so  than  its  predecessors,  is 
experiencing  conflict  and  violence  both  within 
states  and  between  states.  Such  conflict  is  costly, 
destructive,  and  highly  dangerous  for  individuals, 
communities,  and  regions  of  the  world.  In  addi¬ 
tion  to  conflicts  between  well-defined  actors,  other 
forms  of  violence  (such  as  terrorism)  and  serious 
threats  (such  as  environmental  degradation)  pose 
a  major  risk  to  the  very  viability  of  the  interna¬ 
tional  system.  Clearly,  these  conflicts  and  issues 
have  to  be  resolved.  Mediation  is  fast  becoming 
one  of  the  most  effective  ways  of  responding  to 
proliferating  conflicts  and  the  numerous  threats 
that  we  face  in  international  relations  today. 

The  Charter  of  the  United  Nations  (UN)  is  quite 
explicit  on  the  diplomatic  structures  that  can,  and 
should,  be  used  to  deal  with  these  problems. 
Article  2(3)  of  the  UN  Charter  proscribes  any  form 
of  violence  and  exhorts,  “All  member  states  shall 
settle  their  international  disputes  in  such  a  manner 
that  international  peace  and  security,  and  justice, 
are  not  endangered.”  The  available  methods  of 
peaceful  settlement  of  international  conflicts  are 
numerous  and  varied.  They  are  listed  in  Article  33 
of  the  UN  Charter,  which  requests  that 

the  parties  to  any  dispute,  the  continuance  of 

which  is  likely  to  endanger  the  maintenance  of 

international  peace  and  security,  shall,  first  of  all, 


seek  a  solution  by  negotiation,  inquiry,  mediation, 
conciliation,  arbitration,  judicial  settlement, 
resort  to  regional  agencies  or  arrangements,  or 
other  peaceful  means  of  their  choice. 

The  UN  Charter  highlights  three  basic  diplo¬ 
matic  structures  to  deal  peacefully  with  interna¬ 
tional  and  other  conflicts:  (1)  direct  negotiation 
between  parties  to  a  conflict,  (2)  various  forms  of 
mediation,  and  (3)  binding  methods  of  third-party 
intervention  (e.g.,  arbitration  and  adjudication). 
Each  of  these  methods  has  its  own  characteristics, 
strengths,  and  disadvantages,  and  each  has  an 
important  role  in  the  practice  of  modern  diplo¬ 
macy.  Whatever  differences  there  may  be  between 
these  methods,  they  all  exemplify  the  peaceful 
nature  of  diplomacy  and  the  desire  to  avoid  the  use 
of  force.  This  entry  discusses  mediation,  which  is 
growing  in  popularity  as  a  method  for  resolving 
such  issues.  Indeed,  there  is  considerable  empirical 
evidence  to  suggest  that  mediation  has  been  the 
most  popular  way  of  dealing  with  international 
conflicts  since  1945.  The  following  sections 
describe  its  unique  features,  explain  what  media¬ 
tors  actually  do,  and  posit  some  conditions  that 
may  help  or  hinder  the  process  of  international 
mediation. 

Mediation:  Definition  and  Features 

For  many  years,  the  study  of  mediation  has  suf¬ 
fered  from  conceptual  imprecision  and  a  startling 
lack  of  information.  Practitioners  of  mediation, 
formal  or  informal,  in  the  domestic  or  interna¬ 
tional  arena  were  keen  to  sustain  its  image  as  a 
mysterious  practice,  akin  to  some  art  form,  taking 
place  behind  closed  doors.  Scholars  of  mediation, 
on  the  other  hand,  did  not  think  their  field  of  study 
was  susceptible  to  a  systematic  analysis.  In  short, 
neither  group  believed  that  it  could  discern  any 
pattern  of  behavior  in  mediation’s  various  forms 
or  that  any  generalizations  could  be  made  about 
the  practice  in  general. 

The  prevalent  agnosticism  toward  analysis  and 
the  desire  to  maintain  the  mystery  and  uniqueness 
of  mediation  acted  like  something  of  a  ghost  that 
haunted  many  scholars  and  practitioners  for  too 
long.  There  may  be  little  consensus  on  how  best  to 
study  or  practice  mediation.  Mercifully,  there  is 
now  a  very  broad  agreement  that  this  particular 
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ghost  should  be  exorcised.  Mediation  can,  should, 
and  must  be  studied  properly,  and  the  lessons 
derived  from  such  a  study  should  serve  as  sign¬ 
posts  in  the  quest  for  a  better  understanding  of  the 
process  and  more  effective  conflict  management. 

Etymologically,  mediation  comes  from  the  Latin 
root  for  to  halve ,  but  different  definitions  of  media¬ 
tion  purport  to  (a)  capture  the  gist  of  what  media¬ 
tors  do  or  hope  to  achieve,  (b)  distinguish  between 
mediation  and  related  processes  of  third-party 
intervention  (i.e.,  arbitration),  and  (c)  describe 
mediators’  attributes.  Some  see  mediation  as 
purposeful  action  designed  to  remove  any  misun¬ 
derstanding,  others  may  see  it  as  an  extension  of 
negotiation,  and  yet  other  scholars  may  prefer  to 
emphasize  the  neutral  and  impartial  nature  of  all 
mediators. 

The  many  approaches  to  definition  are  simply 
indicative  of  the  enormous  scope  of  mediation. 
Mediation  may  take  place  in  conflicts  between 
states,  within  states,  between  groups  of  states  or 
organizations,  and  between  individuals.  Mediators 
enter  a  conflict  to  help  those  involved  achieve  a 
better  outcome  than  they  would  be  able  to  achieve 
by  themselves.  Once  involved  in  a  conflict,  media¬ 
tors  may  use  a  wide  variety  of  behaviors  to  achieve 
this  objective.  Some  mediators  make  suggestions 
for  a  settlement,  others  refrain  from  doing  so. 
Some  mediators  are  interested  in  achieving  a  com¬ 
promise,  others  are  decidedly  not.  This  is  why  a 
comprehensive  definition  of  mediation  is  a  prereq¬ 
uisite  for  understanding  this  complex  reality.  The 
following  broad  definition  provides  suitable  crite¬ 
ria  for  inclusion  (and  exclusion)  and  serves  as  a 
basis  for  identifying  differences  and  similarities. 
Mediation  is  here  defined  as  a  process  of  conflict 
management,  related  to  but  distinct  from  the  par¬ 
ties’  own  negotiations,  where  those  in  conflict  seek 
the  assistance  of,  or  accept  an  offer  of  help  from, 
an  outsider  (whether  an  individual,  an  organiza¬ 
tion,  a  group,  or  a  state)  to  change  their  percep¬ 
tions  or  behavior  and  to  do  so  without  resorting  to 
physical  force  or  invoking  the  authority  of  law. 

This  may  be  a  broad  definition,  but  it  is  one  that 
can  be  generally  and  widely  applied.  It  forces  us  to 
recognize,  as  surely  we  must,  that  any  mediation 
situation  comprises  (a)  parties  in  conflict,  (b)  a 
mediator,  (c)  a  process  of  mediation,  and  (d)  the 
context  of  mediation.  All  these  elements  are  impor¬ 
tant  in  mediation.  Together  they  determine  its 


nature,  quality,  and  effectiveness,  as  well  as  why 
some  mediation  efforts  succeed  while  others  fail. 

What,  then,  are  the  main  features  or  character¬ 
istics  of  mediation  across  levels?  A  number  of  these 
are  listed  as  follows: 

•  Mediation  is  an  extension  and  continuation  of 
peaceful  conflict  management. 

•  Mediation  involves  the  intervention  of  an 
outsider — an  individual,  a  group,  or  an 
organization — into  a  conflict  between  two  or 
more  states  or  other  actors. 

•  Mediation  is  a  noncoercive,  nonviolent,  and, 
ultimately,  nonbinding  form  of  intervention. 

•  Mediators  enter  a  conflict,  whether  internal  or 
international,  to  affect  it,  change  it,  resolve  it, 
modify  it,  or  influence  it  in  some  way. 

•  Mediators  bring  with  them,  consciously  or 
otherwise,  ideas,  knowledge,  resources,  and 
interests  of  their  own  or  of  the  group  or 
organization  they  represent.  Mediators  often 
have  their  own  assumptions  and  agendas  about 
the  conflict  in  question. 

•  Mediation  is  a  voluntary  form  of  conflict 
management.  The  actors  involved  retain  control 
over  the  outcome  (if  not  always  over  the  process) 
of  their  conflict,  as  well  as  the  freedom  to  accept 
or  reject  mediation  or  mediators’  proposals. 

•  Mediation  operates  on  an  ad  hoc  basis  only. 

•  Mediation  in  international  relations  is 
particularly  appropriate  when  a  conflict  is  long- 
drawn-out  and  complex,  when  the  parties’  own 
efforts  have  failed  or  reached  an  impasse,  and 
when  both  parties  are  prepared  to  cooperate  to 
avoid  further  loss  of  life. 

Mediation  Behavior 

What  is  it  that  mediators  do  to  achieve  a  cessation 
of  violence  and  a  political  agreement?  Considerable 
attention  has  been  given  to  this  subject,  and  it 
seems  apt  to  suggest  that  in  general  mediators  enter 
a  conflict  (whether  they  are  invited  or  not)  to 
change  its  structure,  dynamics,  and  termination. 
They  do  so  by  first  trying  to  understand  the  conflict 
(i.e.,  understand  the  issues  involved,  the  history 
and  causes  of  the  conflict,  and  the  parties  involved 
in  the  conflict  and  their  relationship);  then  arrang¬ 
ing  for  some  communication  between  the  parties 
(directly  or  indirectly),  establishing  some  protocol, 
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and  delineating  an  agenda  to  discuss;  and  finally 
helping  the  parties  save  face  by  rewarding  the  con¬ 
cessions  made  by  each  party  and  recommending 
possible  solutions.  These  types  of  behavior  corre¬ 
spond  roughly  to  three  broad  strategies  that  the 
literature  suggests  mediators  may  engage  in — that 
is,  facilitation-communication  behavior,  formula- 
tive  behavior,  and  directive  behavior. 

As  to  the  question  of  how  involved  a  mediator 
can  or  should  be,  we  can  say  that  mediator  behav¬ 
ior  may  range  along  a  spectrum  of  increasing  levels 
of  intervention.  In  some  cases,  a  mediator  may 
engage  only  in  facilitative  behavior,  while  in  oth¬ 
ers,  a  more  radical  intervention  may  be  required  if 
the  parties  are  to  be  nudged  from  their  inflexible 
positions.  Thus,  we  may  speak  of  a  facilitative 
mediator,  a  formulative  mediator,  and  a  directive 
(or  better  still,  a  power)  mediator,  each  of  whom 
creates  different  circumstances  and  produces  dif¬ 
ferent  outcomes. 

The  strategy  mediators  adopt,  and  the  way  they 
behave  in  each  international  conflict,  is  rarely  ran¬ 
dom.  Mediators  do  not  just  happen  to  choose  a 
strategy.  Mediator  behavior  is  influenced  by  fac¬ 
tors  such  as  conflict  intensity  and  fatalities  (e.g., 
high-intensity  conflicts  may  call  for  more  active 
strategies),  time  pressure  (when  time  is  pressing, 
there  is  a  higher  need  for  more  directive  strategies), 
mediator  rank  (high-ranking  mediators,  such  as 
heads  of  state,  can  use  more  resources  and  strate¬ 
gies),  and  previous  relations  between  the  parties 
(where  parties  have  a  history  of  friendly  relations, 
facilitative  strategies  may  be  sufficient).  The  actual 
process  of  international  mediation  is  one  of  recip¬ 
rocal  relationship  between  a  mediator,  the  conflict 
itself,  and  the  parties  involved.  At  times,  mediators 
can  be  quite  active  and  use  significant  resources,  at 
other  times,  all  they  can  do  is  act  as  the  go-between. 
Effective  mediators  know  how  best  to  identify 
actors  in  conflict,  the  nature  of  a  conflict,  which 
strategy  to  use,  what  resources  to  marshal,  and  just 
how  much  control  to  exercise  over  the  parties. 

Whichever  strategy  mediators  use,  their  under¬ 
lying  objectives  in  any  conflict  are  to  change  the 
following: 

a.  the  physical  environment  of  conflict  management 
(e.g.,  by  maintaining  secrecy  or  imposing  time 
limits,  as  U.S.  President  Carter  did  at  Camp 
David  during  Israeli-Palestinian  negotiations); 


b.  the  perception  of  what  is  at  stake  (e.g.,  by 
structuring  an  agenda  and/or  identifying  and 
packaging  new  issues);  and 

c.  the  parties’  motivation  to  reach  a  peaceful 
outcome  by,  for  example,  using  subtle  pressure. 

Any  international  conflict  presents  opportuni¬ 
ties  for  some  form  of  mediation.  To  be  effective, 
however,  mediation  strategies  must  reflect  the  real¬ 
ity  of  the  conflict  and  the  resources  of  the  media¬ 
tor.  To  that  extent,  international  mediation  is  truly 
a  contingent  and  reciprocal  political  activity.  It 
depends  on,  and  is  shaped  by,  the  parties,  their 
conflict,  and  the  conflict  environment. 

Success  and  Failure  in 
International  Mediation 

When  is  international  mediation  most  likely  to  be 
successful?  This  is  a  question  that  practitioners 
and  scholars  have  been  asking  for  many  years. 
Some  guidelines  may  now  be  offered. 

To  be  successful,  international  mediation  must 
take  place  at  the  right,  or  the  ripe,  moment.  By  this 
we  mean  that  a  mediator  must  ensure  that  parties 
to  a  conflict  are  genuinely  ready  to  tackle  their 
conflict.  This  usually  occurs  when  the  parties  find 
themselves  in  a  situation  that  they  both  dislike  and 
yet  cannot  escape  from  or  when  they  recognize 
some  opportunities  for  a  mutually  beneficial  settle¬ 
ment.  An  experienced  mediator  can  convince  the 
parties  that  continuing  to  engage  in  violence  will 
be  futile  and  that  the  moment  is  ripe  for  resolution. 
The  timing  of  mediation  and  a  perception  among 
the  parties  that  opportunities  may  be  lost  if  they 
fail  to  recognize  their  mutually  hurting  behavior  is 
a  crucial  factor  affecting  mediation  success  in 
international  relations. 

Closely  related  to  this  is  a  strong  and  genuine 
motivation  by  each  of  the  parties  to  engage  in  seri¬ 
ous  mediation.  If  the  belligerents  approach  media¬ 
tion  merely  in  an  attempt  to  buy  time  or  forestall 
a  possible  defeat,  then  any  mediation  effort, 
whether  undertaken  by  the  UN  or  a  state,  will  be 
doomed  to  failure.  Mediation  works  best  when 
both  parties  are  willing,  interested,  and  genuinely 
committed  to  the  process.  Here  again,  a  mediator 
can  create  the  conditions,  in  the  early  phases  of  a 
mediation,  where  both  parties  are  optimistic  about 
the  process  and  truly  committed  to  it. 
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Mediation  also  appears  to  work  best  in  interna¬ 
tional  relations  where  there  are  no  major  power 
disparities  between  the  parties,  when  both  states 
are  small  or  medium  powers  (rather  than  super¬ 
powers),  and  when  both  states  hold  similar  politi¬ 
cal  values.  Shared  norms,  political  similarity,  and 
relative  equality  in  military  and  economic  power 
enhance  the  likelihood  of  successful  mediation. 

In  addition  to  these  factors,  we  also  note  that 
certain  kinds  of  mediators  and  mediator  behavior 
are  more  likely  to  succeed.  Although  much  is  made 
of  the  potential  of  regional  and  international  organi¬ 
zations,  the  reality  is  that  the  more  successful  media¬ 
tors  tend  to  be  the  larger  states  with  more  leverage 
and  resources.  Mediation  is  an  attempt  to  change 
the  parties’  perception  and/or  behavior;  hence,  it 
stands  to  reason  that  more  powerful  countries,  with 
more  resources,  prestige,  and  leverage  possibilities, 
are  more  likely  to  succeed  in  this  attempt.  And,  of 
course,  related  to  that,  we  can  see  that  mediators 
who  can  influence  the  parties  through  a  wide  range 
of  interventions  from  the  least  directive  to  the  most 
directive  are  the  ones  most  likely  to  succeed. 

Other  factors  may  have  an  impact  on  the  success 
or  failure  of  international  mediation  (e.g.,  relation¬ 
ship  between  the  parties,  nature  of  conflict  between 
them)  and  must  be  taken  into  account.  What  is 
important  about  international  mediation  is  that  it  is 
part  of  the  political  process.  It  is  a  voluntary  pro¬ 
cess,  yet  one  in  which  states  may  use  mediation  as 
an  extension  of  their  foreign  policy,  or  they  may 
use  it  out  of  a  genuinely  altruistic  desire  to  resolve 
a  conflict.  Whichever  their  motives,  mediation  is  a 
powerful,  important,  and  often  successful  tool  in 
resolving  international  conflicts. 

Jacob  Bercovitch 
University  of  Canterbury 
Christchurch,  New  Zealand 
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Mercantilism 


The  rise  of  early-modern  states  in  the  form  of 
strong  monarchies  provided  the  historical  context 
for  economic  writers  who  have  been  named 
mercantilists — members  of  a  particular  “mercantil¬ 
ism  school”  of  political  economy.  Among  them  are 
writers  between  the  late  16th  and  mid-18th  centu¬ 
ries  who  published  pamphlets  and  tracts  on 
economic  issues,  especially  regarding  international 
trade,  money,  finance,  and  beneficial  governance. 
They  were  state  bureaucrats,  merchants,  and 
politicians  from  all  over  Europe,  from  Spain  to 
northern  Scandinavia,  during  the  so-called  early- 
modern  period.  They  generally  believed  that  eco¬ 
nomic  wealth  and  income  could  be  increased 
through  the  capture  of  foreign  trade  and  com¬ 
merce.  Perhaps  not  all  of  them  agreed  that  the 
creation  of  wealth  was  a  zero-sum  game,  but  many 
of  them  came  quite  close  to  believing  so. 

The  concept  of  systeme  mercantile  was  first 
used  in  print  by  the  Marquis  de  Mirabeau  in  1763. 
However,  it  was  Adam  Smith,  who  in  his  Wealth 
of  Nations  in  1776  called  it  the  “mercantile  sys¬ 
tem”  and  made  it  famous.  Yet,  to  what  extent 
mercantilism  is  really  a  “system”  or  “school”  of 
economic  thinking  has  been  a  hotly  debated  issue 
ever  since.  The  bulk  of  what  is  commonly  known 
as  “mercantilist  literature”  appeared  in  Great 
Britain  from  the  1620s  up  until  the  middle  of  the 
18th  century.  Perhaps  the  most  well-known  English 
mercantilists  were  Thomas  Mun  (1571-1641),  writ¬ 
ing  in  the  1620s,  and  James  Steuart’s  (1713-1780) 
whose  Principles  of  Political  Oeconomy  (1767)  is 
regarded  as  perhaps  the  last  major  mercantilist 
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work.  Other  English  economic  writers  regarded  as 
belonging  to  the  mercantilist  school  are  Josiah 
Child  (1630-1699),  Charles  Davenant  (1656- 
1714),  and  Nicholas  Barbon  (1640-1698). 
According  to  the  main  inventor  of  the  “mercantile 
system,”  Adam  Smith,  the  core  of  the  “commer¬ 
cial”  (mercantile)  system  consisted  of  the  popular 
folly  of  confusing  wealth  with  money.  Hence,  their 
suggestion  that  a  country  must  export  more  than  it 
imports  leading  to  a  net  inflow  of  bullion,  the 
“positive  balance  of  trade”  theory. 

According  to  Smith,  mercantilist  theory  and 
practice  served  as  a  cloak  for  a  special  interest  that 
used  the  idea  of  a  positive  balance  of  trade  to 
propagate  a  protective  trade  policy  in  general, 
including  duties  on  imports,  tariffs,  and  bounties. 
The  mercantile  system  implied  a  giant  conspiracy 
on  the  part  of  master  manufacturers  and  mer¬ 
chants  to  skin  the  public  and  the  consumers.  From 
Smith  onward,  the  view  of  the  mercantile  system 
as  state  dirigisme  (particularly  export  and  import 
protection)  to  support  a  special  interest  with  the 
aid  of  the  ideology  of  the  positive  balance  of  trade 
developed  into  its  present  status  as  the  canonical 
interpretation  of  17th-  and  18th-century  economic 
thinking  and  writing.  David  Hume  had  made  his 
specie-flow  theory  public  in  1750,  arguing  that  the 
favorable-balance  theory  was  an  intellectual  error 
(a  net  inflow  of  bullion  must  certainly  mean  a  rela¬ 
tive  rise  in  prices,  which,  through  the  export  and 
import  mechanism,  will  tend  to  correct  itself). 
Smith  then  drew  the  conclusion  that  the  argument 
for  protection  and  against  free  trade  in  general  was 
based  on  a  mere  intellectual  mistake. 

Undoubtedly,  the  fixation  of  a  mercantile 
system — or  mercantilism — became  an  important 
part  of  the  story  that  liberal  political  economy  dur¬ 
ing  the  19th  century  sought  to  explicate.  Up  until 
1776 — according  to  its  stylized  version — the  think¬ 
ing  as  well  as  the  practical  policy  making  of  the 
17th  and  18th  centuries  was  dominated  by  dirigiste 
and  protectionist  ideas.  After  this  intervention  by 
Smith,  a  new,  more  scientific  version  of  economics 
developed  that  emphasized  the  role  of  the  “invisi¬ 
ble  hand”  and  free  trade  as  a  motor  of  economic 
growth  and  prosperity.  The  overturn  of  the  mer¬ 
cantilist  school  became  part  of  a  wider  “Whiggish” 
interpretation  of  history,  which  used  Great  Britain 
as  an  ideal  type  for  depicting  the  development  of 
free  trade  and  minimal  government  as  a  natural 


force  in  history.  Without  doubt,  such  a  simplistic 
version  seldom  receives  enthusiastic  applause 
today.  On  the  contrary,  it  is  most  often  argued 
nowadays  that  the  step  from  “mercantilists”  such 
as  Steuart  to  the  “liberal”  Smith  was  not  as  big  as 
it  was  once  described.  Hence,  many  scholars — for 
example,  the  English  writers  before  1776 — were 
ready  to  admit  the  advantages  of  market  liberaliza¬ 
tion,  free  trade,  and  even  the  existence  of  an  invis¬ 
ible  hand.  Indeed,  very  few  of  them  made  the 
mistake  of  identifying  wealth  with  gold  and  silver. 
Likewise,  none  of  the  members  of  the  so-called 
classical  political  school  in  the  beginning  of  the 
19th  century  was  a  free  trader  or  a  libertarian  in 
any  modern  sense  of  the  world.  Many  of  them 
admitted  to  the  positive  role  of  state  intervention, 
and  some,  even,  were  reluctant  to  propagate  for 
free  trade  as  long  as  not  all  countries  were  ready  to 
abolish  their  tariffs  and  duties. 

Perhaps  the  most  important  concern  with  the 
Whig  interpretation,  however,  is  that  it  needs  to  be 
seen  in  its  historical  context.  The  world  that  eco¬ 
nomic  writers  during  the  17th  and  18th  century 
contemplated  was  characterized  by  an  often  vio¬ 
lent  and  competitive  struggle  for  power  and  influ¬ 
ence.  From  the  late-medieval  period,  it  had  been 
well  understood  that  the  economic  strength  of  a 
state  required  a  powerful  political  and  military 
position.  One  method  to  increase  the  wealth  of  a 
ruler  was  to  conquer  more  land  plowed  by  peas¬ 
ants  who  could  pay  taxes  and  land  rents  to  the 
Crown.  Exceedingly,  however,  it  was  perceived 
that  more  income  could  be  gained  by  taxing  profit¬ 
able  trading  operations,  to  introduce  indirect  taxes 
and  accises  (excise  duties).  Against  this  back¬ 
ground,  competition  over  trade  and  trade  routes 
became  an  ever  more  important  princely  occupa¬ 
tion.  It  was  generally  believed  that  a  country  that 
could  capture  important  trade  routes  as  well  as 
establish  colonies,  or  “plantations,”  would  have  an 
advantage  in  times  of  military  conflict  and  political 
power  struggles.  Moreover,  during  the  17th  cen¬ 
tury,  a  view  increasingly  emerged  among  rulers 
that  it  was  most  favorable  to  establish  domestic 
industry  in  order  to  use  up  raw  materials  instead  of 
exporting  them.  By  encouraging  manufactories, 
more  hands  could  be  employed  while  at  the  same 
time  industrious  manufacturers  and  merchants 
could  earn  more  profits,  which  would  lead  to  even 
more  production  and  employment — resulting  in 
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more  taxes,  of  course.  Hence,  wealth  creation 
could  be  cumulative  and  not  only  a  consequence  of 
zero-sum  activities.  However,  without  a  strong 
state  with  an  ability  to  keep  up  a  trade  monopoly, 
such  gainful  spirals  of  income  creation  were  always 
threatened  and  insecure. 

Certainly,  the  political  economy  of  the  relation¬ 
ship  between  economic  means  and  power  politics 
goes  back  at  least  to  the  Florentine  political  think¬ 
ers  of  the  Renaissance  period,  including  Niccolo 
Machiavelli.  As  is  well-known,  Florence’s  republi¬ 
canism  was  a  broad  tradition  that  recognized  the 
role  of  patriotism  and  other  manly  values  in  a 
well-governed  and  virtuous  state.  In  England,  dur¬ 
ing  the  17th  and  early  18th  century,  such  patriotic 
thinking  focused  strongly  first  on  the  Dutch  repub¬ 
lic  and  later,  after  1660,  on  France.  Holland,  espe¬ 
cially,  was  a  showcase  for  those  who  wanted 
to  emphasize  the  wealth-producing  effects  of  cap¬ 
turing  international  commerce  and  establishing 
manufactories.  It  was  on  that  basis  that  this  tiny 
country  was  able  to  house  such  a  plentiful  popula¬ 
tion  that  was  a  cornerstone  of  political  power  and 
military  strength.  In  the  early  years  of  the  17th 
century,  the  Dutch  had  snatched  the  Baltic  fishing 
industry  from  under  the  nose  of  the  English,  as  the 
mercantilist  writer  Thomas  Mun  wrote  in  the 
1620s.  It  was  time  to  take  it  back  as  well  as  to 
capture  more  of  the  international  trade  in  grain 
and  other  products  that  flowed  from  East  to  West 
and  from  the  Baltic  to  the  North  Sea. 

Clearly,  the  position  of  a  specific  state  in  the 
international  competitive  struggle  for  power  and 
influence  is  a  key  to  understanding  the  writings  of 
the  mercantilists  in  their  respective  countries. 
Antonio  Serra  in  Naples  would  ponder  over  how  a 
small  nation  without  domestic  resources  in  silver 
and  gold  could  be  able  to  survive  and  even  gain 
from  this  fact.  In  1613,  he  suggested  that  Naples 
should  export  to  cover  both  the  importation  of 
necessities  and  luxuries  as  well  as  the  importation 
of  money  (silver).  In  turn,  this  meant  that  Naples 
had  to  develop  a  “favorable  balance  of  trade.” 
Pamphleteers  in  states  that  felt  they  were  in  the 
same  position,  such  as  England  in  the  1620s,  lent 
a  ear  to  Serra  and  themselves  became  proponents 
of  a  favorable  balance  of  trade. 

In  Spain,  on  the  other  hand,  a  shortage  of  silver 
and  gold  was  not  a  major  disturbance  during  the 
16th  and  17th  centuries.  Such  wealth  in  awesome 


proportions  was  amply  provided  by  the  Spanish 
imperial  forces  and  shipped  over  from  the  Americas 
protected  by  a  potent  navy.  Already,  by  the  end  of 
the  16th  century,  it  was  well  known  that  this  bul¬ 
lion  had  not  brought  riches  to  Spain  but,  rather, 
poverty.  Already,  by  1556,  Martin  de  Azpilcueta 
had  formulated  the  famous  so-called  quantity  the¬ 
ory  of  money  (that  a  great  influx  of  silver  and  gold 
implies  that  the  value  of  money  will  fall).  In  fact, 
the  so-called  price  revolution  and  its  dire  effects 
was  a  well-known  phenomenon  among  contempo¬ 
rary  Europeans.  When  the  price  level  increased  in 
Spain,  domestic  industries  as  well  as  agriculture 
suffered  from  cheap  foreign  competition.  In  the 
1580s  in  Spain,  Luis  Ortiz,  controller  of  the  public 
finance,  argued  for  a  strict  ban  to  prevent  Spaniards 
from  exporting  their  money  and  buying  foreign 
goods — a  policy  that  had  been  used  by  many  states 
since  the  medieval  period  and  that  in  England  was 
called  the  Statute  of  Employment. 

In  France,  a  policy  of  economic  nationalism  was 
developed  from  the  beginning  of  the  17th  century 
onward.  Around  1600,  the  valet  du  chambre  to 
King  Henry  IV,  Berthelemy  Laffemas,  published 
tracts  in  which  he  advocated  the  establishment  of 
manufacturing  in  France  to  avoid  “unnecessary” 
imports.  In  a  highly  aggressive  tone  some  decades 
later,  the  message  that  foreign  imports  should  be 
banned  and  the  ugly  face  of  foreigners  forever 
deported  from  la  belle  France  was  reinforced  by 
the  writer  Antoine  de  Montchretien  (who  was  the 
first  to  use  the  concept  “political  economy”).  His 
argument  for  strong  protectionist  measures  taken 
by  a  dirigiste  state  became  after  1660  the  back¬ 
bone  of  the  so-called  Colbert  system,  after  Jean- 
Baptiste  Colbert,  Louis  XIV’s  minister  of  finance, 
who  greatly  provoked  the  English  at  the  end  of  the 
same  century. 

Hence,  the  historical  context  in  which  the  mer¬ 
cantilists  wrote  and  advised  was  perhaps  better 
understood  by  19th-century  German  scholars  such 
as  Gustav  Schmoller  (belonging  to  the  German 
historical  school),  who  identified  mercantilism 
as  an  expression  of  nation  building  during  the 
early-modern  period,  than  by  19th-century  British 
liberalism.  Nevertheless,  Smith’s  definition  of  mer¬ 
cantilism  as  a  specific  school  of  economic  theory 
building  on  flawed  views  of  the  true  relationship 
between  commerce  and  wealth  is  still  widely 
acknowledged.  To  what  extent  the  straw  man  of  a 
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mercantilist  school  he  produced  corresponds  to 
historical  reality  may  be  doubted,  although  it  did 
serve  to  convince  everybody  of  the  revolutionary 
character  of  Smith’s  own  contribution. 

Lars  Magnusson 
University  of  Uppsala 
Uppsala,  Sweden 
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Meta-Analysis 


The  meta-analysis  research  design  involves  ana¬ 
lyzing  many  individual  cases  or  research  reports 
and  looking  for  commonalities  and  discrepancies 
in  the  published  findings.  In  this  design,  a  group 
of  previously  researched  cases  is  selected  and  sub¬ 
jected  to  comparative  analysis.  Gene  V.  Glass 
(1976)  introduced  the  technique  as  a  “rigorous 
alternative  to  the  casual,  narrative  discussions  of 
research  studies  which  typify  our  attempts  to 
make  sense  of  the  rapidly  expanding  research  lit¬ 
erature”  (p.  3). 

Meta-analysis  is  an  observational  technique 
used  to  summarize  and  compare  the  results  of 
studies  produced  by  other  researchers.  Objectives 
for  meta  case  research  include  establishing  the 
state  of  research  findings  on  a  subject  and  provid¬ 
ing  an  overview  of  what  others  are  saying  about 
the  subject.  Referred  to  as  an  analysis  of  analyses, 
the  method  involves  statistical  analysis  of  a  large 


collection  of  individual  case  studies  for  the  pur¬ 
pose  of  integrating  the  findings  of  the  total  set. 
Below,  the  advantages  and  disadvantages  of  meta¬ 
analysis  and  the  requisite  steps  are  discussed  in 
greater  detail. 

Meta-analyses  should  only  be  applied  to  empir¬ 
ical  research  reports — that  is,  studies  that  have 
analyzed  primary  quantitative  data  collected  by 
researchers  who  prepared  the  original  reports.  The 
findings  of  previously  prepared  research  reports 
are  compared  or  evaluated  using  such  statistical 
processes  as  regression  and  correlation  analyses.  A 
recent  example  of  the  design  is  the  Chris 
Doucouliagos  and  Mehmet  Ali  Ulaba§oglu  (2008) 
meta-analysis  of  84  published  studies  of  the 
hypothesized  relationship  between  political  democ¬ 
racy  and  economic  growth.  The  individual  results 
were  spread  across  a  continuum  of  positive,  nega¬ 
tive,  and  insignificant  findings,  leading  them  to 
conclude  that  while  democracy  does  not  have  a 
direct  impact  on  economic  growth,  an  inconclusive 
relationship  is  common  across  the  studies.  Their 
meta-analysis  resulted  in  three  key  points: 

1.  It  provided  a  comprehensive  analysis  of  the 
democracy-growth  research  based  on  the  entire 
body  of  published  cases. 

2.  Their  quantitative  analysis  made  it  possible  to 
make  inferences  based  on  the  significance  of  the 
democracy-growth  relationship. 

3.  It  revealed  the  heterogeneity  of  the  results  in 
previous  individual  studies. 

Advantages  and  Disadvantages 
of  Meta-Analysis 

Measurement  and  statistics  researchers  at  the 
University  of  Maryland  identified  what  may  be  the 
single  most  important  advantage  of  meta-analysis: 
The  human  mind  may  be  unable  to  effectively  pro¬ 
cess  and  evaluate  a  large  number  of  alternatives;  a 
meta-analysis  resolves  this  issue.  Individuals  have 
a  difficult  time  when  asked  to  evaluate  the  results 
of,  say,  20  similar  studies.  When  the  number  of 
studies  is  increased  to  200 — a  not  untypical  num¬ 
ber  in  many  meta-analyses — the  mind  reels. 
Fortunately,  statistical  methods  and  software  are 
readily  available  for  coping  with  the  complexity 
inherent  in  large  numbers  of  cases. 
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Mark  W.  Lipsey  and  David  B.  Wilson  identified 
four  additional  advantages  of  a  meta-analysis. 
First,  the  process  of  coding  and  establishing  crite¬ 
ria  for  selecting  studies  (a  survey  protocol ),  read¬ 
ing  the  study  reports,  coding  the  material,  and 
subjecting  it  to  a  rigid  statistical  analysis  imposes  a 
discipline  on  the  researcher  that  is  sometimes  miss¬ 
ing  in  qualitative  summarizations  and  comparative 
analyses.  Second,  the  summaries  of  research  on 
similar  topics  may  produce  finer  measurement  gra¬ 
dations  of  themes  that  might  have  otherwise  been 
missed  with  a  design  using  only  one  or  a  few  cases. 
The  application  of  common  statistical  tests  across 
all  the  studies  can  correct  for  wide  differences  in 
sample  size.  Third,  because  the  meta-analysis 
examines  many  case  studies,  it  may  be  possible  to 
find  effects  or  associations  that  one  or  a  smaller 
number  of  other  comparative  case  analyses  may 
have  missed.  Fourth,  the  meta-analysis  process 
provides  a  way  to  organize  and  structure  diverse 
information  from  a  wide  variety  of  study  findings. 

Meta-analysis  is  not  without  its  disadvantages. 
A  few  of  the  criticisms  that  have  been  identified  for 
the  method  include  the  following: 

1.  The  large  amount  of  effort  and  expertise  it 
requires  is  an  often-cited  disadvantage  of  the 
method.  A  meta-analysis  takes  considerably 
more  time  than  a  conventional  research 
literature  review  and  may  require  specialized 
knowledge  of  case  information  that  the 
researcher  may  not  have. 

2.  The  original  may  have  missed  some  important 
issues,  including  but  not  limited  to  the  social 
context  of  the  study,  theoretical  influences  and 
implications,  methodological  quality,  design 
issues,  and  procedures. 

3.  The  mix  of  studies  combined  into  larger  groups 
may  hide  subtle  differences  seen  in  individual 
studies. 

4.  Inclusion  of  studies  that  are  methodologically 
weak  can  detract  from  the  findings  in  the  strong 
studies  included  in  the  analysis. 

The  Meta-Analysis  Method 

A  meta-analysis  follows  the  same  system  of  proce¬ 
dures  as  single-case  research.  Researchers  first 
identify  a  research  question  (the  purpose  for  the 


review)  and  establish  their  objectives  for  the  study. 
Second,  they  select  an  organizational  framework 
for  the  analysis — that  is,  they  determine  what  vari¬ 
ables  they  will  measure  across  the  entire  sample  of 
cases.  Third,  they  select  a  sample  of  case  studies 
and  specify  why  the  individual  cases  (sample  ele¬ 
ments)  will  be  chosen  for  the  meta-analysis. 
Fourth,  researchers  collect,  code,  and  tabulate  the 
data;  this  involves  investigating  and  reporting  on 
the  relationships  found  between  the  studies 
included  in  the  review.  Reporting  the  findings 
involves  connecting  the  findings  back  to  the  study 
question,  purpose,  and  objectives  of  the  analysis. 
The  following  describes  the  complete  process  in 
greater  detail. 

Formulating  the  Problem 

The  first  step  involves  identifying  the  reason  for 
doing  the  research:  defining  the  research  problem. 
In  a  meta-analytical  study,  the  core  task  is  deciding 
what  questions  or  hypotheses  should  be  examined 
and  what  evidence  should  be  included  in  the 
review  of  the  selected  cases.  Formulating  the  prob¬ 
lem  means  deciding  what  research  results  should 
be  examined. 

Criteria  for  Selecting  Cases 

Criteria  for  selecting  cases  to  be  included  in  the 
analysis  might  include  characteristics  of  the  client 
population,  geographic  location,  history  and/or 
experience  level  of  the  delivering  organization  and / 
or  its  sponsoring  government  agency,  or  research 
method  used  by  the  case  writer.  Each  selected  case 
should  add  to  the  research  question  knowledge 
base.  A  good  meta-analysis  is  not  concerned  with 
drawing  a  representative  sample  of  the  literature 
on  the  topic;  rather,  it  seeks  to  include  the  entire 
population  of  published  studies  on  the  topic  of 
interest. 

Standardizing  the  Points  for  Analysis 

Since  the  meta-analysis  design  involves  analysis 
of  analyses  and  all  analyses  are  different  in  some 
way,  this  is  one  of  the  most  problematic  steps  of 
meta-analysis.  Glass  referred  to  this  as  the  “apples 
and  oranges”  problem.  The  cases  selected  are  stud¬ 
ies  of  different  aspects  of  a  phenomenon:  They 
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involve  different  subjects,  are  done  at  different 
times  by  different  researchers,  and  often  involve 
different  research  methods.  Therefore,  they  are  not 
all  studies  of  “apples”  or  studies  of  “oranges.”  To 
get  around  the  difficulties  that  these  differences 
entail,  the  researcher  must  examine  the  same  ele¬ 
ment  in  every  case,  ignoring  findings  peculiar  to 
the  one  case  alone. 

Standardizing  the  Analysis  Procedure 

Here,  the  emphasis  is  on  method  rather  than 
focus.  The  analysis  procedure  usually  involves 
standard  statistical  techniques  such  as  regression 
and  correlation  analysis.  However,  the  techniques 
and  procedures  used  in  the  individual  cases  studied 
are  often  specific  to  that  research  question  or 
political  science  subdiscipline.  The  meta-analysis 
reviewer  must  differentiate  between  what  the 
original  report  writer  supposed  and  what  the 
results  of  the  study  indicate. 

Selecting  a  Coding  Scheme 

Coding  of  variables  for  comparison  or  tabula¬ 
tion  must  be  consistent  across  the  body  of  litera¬ 
ture  reviewed  in  the  meta-analysis.  There  are  few 
rules  for  coding,  other  than  to  retain  consistency. 
Numeric  coding  is  the  preferred  approach  since 
most  researchers  will  use  computers  and  statistical 
software  for  their  analysis. 

Describing  the  Statistical  Processes  Used 

A  guide  to  follow  in  describing  the  statistical 
processes  used  in  meta-analysis  is  to  refer  to  the 
available  statistical  software.  Much  of  this  soft¬ 
ware  has  been  available  free.  Meta-Stat,  from  the 
University  of  Maryland,  is  an  example  of  meta¬ 
analysis  software.  Statistical  analysis  described  in 
this  software  ranges  from  simple  descriptive  statis¬ 
tics  and  correlation  analysis  to  regression  analysis 
and  analysis  of  variance,  to  name  just  a  few  of  the 
popular  tests  included.  Meta-analysis  is  facilitated 
using  a  meta-analytic  schedule  or  cross-case  table. 
Rows  are  cases;  columns  are  the  results  of  the 
analysis  on  the  items  or  attributes  of  interest.  Cell 
entries  may  be  brief  summaries  of  the  findings  or 
as  simple  as  a  check  mark.  The  schedule  makes  it 
easy  to  record  and  communicate  analysis  results. 


Analyzing  the  Data  and  Writing  the  Report 

Data  analysis  is  a  product  of  the  statistical  soft¬ 
ware  used  by  the  researcher  and,  as  such,  is  rather 
straightforward.  Writing  the  report,  however, 
introduces  a  subjective  element  to  the  study.  The 
researcher  must  decide  how  much  information  to 
include  and  how  much  to  leave  out  of  the  report. 
Care  must  be  taken  to  control  for  issues  of  validity, 
although  statistical  inferences  are  generally  not 
made  with  meta-analysis  findings.  Because  meta¬ 
analysis  involves  the  interpretation  of  materials 
written  by  others,  great  care  is  needed  in  preparing 
the  final  report  of  the  analysis. 

Meta-analysis  analysis  has  been  found  to 
be  facilitated  by  following  the  principles  of  herme¬ 
neutic  analysis.  Hermeneutics  is  a  method  of 
analyzing  all  types  of  data,  but  it  is  particularly 
relevant  for  the  analysis  of  written  material  such  as 
that  found  in  case  studies.  Hermeneutics  follows  a 
set  of  principles  that  requires  the  analyst  to 
(a)  decipher  the  meaning  in  the  texts  through  the 
eyes  and  intent  of  the  writer  or  creator,  (b)  frame 
the  meaning  in  the  time  period  of  the  case  research 
and  writing  (often  referred  to  as  the  context  within 
which  the  document  was  prepared),  and  (c)  deter¬ 
mine  the  meaning  considering  the  political  and 
social  environments  at  the  time  of  the  creation  of 
the  text  or  artifact. 

A  Final  Caveat 

Although  use  of  the  meta-analysis  technique  has 
grown  dramatically  since  its  adoption  in  1976, 
Glass  suggested  in  2000  that,  despite  this  increas¬ 
ing  popularity,  it  might  be  time  to  replace  meta¬ 
analysis  with  an  approach  that  reflects  more 
closely  the  nature  of  modern  research  in  social 
science. 

David  E.  McNabb 
Pacific  Lutheran  University 
Tacoma,  Washington,  United  States 

Note:  This  entry  contains  some  material  from  D.  E.  McNabb’s 
Case  Research  in  Public  Management,  published  by  M.  E. 
Sharpe,  Inc.;  it  is  used  here  with  permission  of  the  publisher. 

See  also  Comparative  Methods;  Quantitative  Methods, 
Basic  Assumptions;  Quantitative  Versus  Qualitative 
Methods 


1554  Metagovernance 


Further  Readings 

Bengtsson,  L.,  Larsson,  R.  Griffiths,  A.,  &  Hine,  D. 
(2007).  Case  and  iCase:  Facilitating  case  survey 
methods  for  creating  research  and  teaching  synergies 
in  innovation  and  enterprise.  In  D.  Fline  &  D.  Carson 
(Eds.),  Innovative  methodologies  in  enterprise  research 
(pp.  101-123).  Cheltenham,  UK:  Edward  Elgar. 

Doucouliagos,  H.,  &  Ulaba§oglu,  M.  A.  (2008). 

Democracy  and  economic  growth:  A  meta-analysis. 
American  Journal  of  Political  Science,  52(1),  61-83. 

Glass,  G.  V.  (1976).  Primary,  secondary,  and  meta-analysis 
of  research.  Educational  Researcher,  11  ( 18),  3-8. 

Glass,  G.  V.  (2000).  Meta-analysis  at  25  (Conference 
Paper).  Retrieved  December  22,  2008,  from  http:// 
gvglass.info/papers/meta25.html 

Lipsey,  M.  W.  (2003).  Those  confounded  moderators  in 
meta-analysis:  Good,  bad,  and  ugly.  Annals  of  the 
American  Academy  of  Political  and  Social  Science, 
587,  69-81. 

Lipsey,  M.  W.,  &  Wilson,  D.  B.  (2001).  Practical  meta¬ 
analysis.  Thousand  Oaks,  CA:  Sage. 

McMillan,  J.  H.,  &  Schumacher,  S.  (1997).  Research  in 
education.  New  York:  Longman. 

McNabb,  D.  E.  (2010).  Case  research  in  public 
management.  Armonk,  NY:  M.  E.  Sharpe. 

Meta-Stat.  (n.d.).  The  meta-analysis  of  research  studies. 
College  Park:  University  of  Maryland,  Department  of 
Measurement,  Statistics  and  Evaluation.  Retrieved 
December  22,  2008,  from  http://echo.edres.org: 8080/ 
meta 


Metagovernance 


Metagovernance  refers  to  the  need  of  formal  public 
organizations  to  exercise  some  control  over  devolved 
and  decentralized  decision-making  organizations. 
In  line  with  the  common  use  of  the  prefix  meta-, 
which  means  over  and  beyond,  the  shortest  defini¬ 
tion  of  the  term  metagovernance  is  the  governance 
of  governance.  Just  as  the  meaning  of  metaphysics 
is  determined  by  the  definition  of  physics,  the  mean¬ 
ing  of  metagovernance  in  practice  depends  on  how 
the  term  governance  is  defined.  It  is  generally 
accepted  that  three  ideal-typical  styles  of  gover¬ 
nance  can  be  distinguished,  which  usually  form 
combinations:  hierarchical,  network,  and  market 
governance.  When  one  style  dominates,  the  other 
two  often  run  in  the  background.  Other  forms  of 
governance,  such  as  public-private  partnerships 


and  the  European  Union’s  open  method  of  coor¬ 
dination,  can  be  considered  as  hybrids  of  the  three 
basic  styles. 

Each  of  the  ideal  types  has  a  clear  and  distinct 
internal  logic.  The  central  value  of  hierarchical 
governance  is  authority;  therefore  an  authoritative 
and  legitimate  form  of  governance  is  sought.  The 
central  values  of  network  governance  are  empathy 
and  trust,  and  therefore,  the  results  are  preferably 
based  on  consensus.  Market  governance  is  based 
on  competition  and  price,  which  makes  it  logical 
that  the  best  results  are  the  most  competitive  and 
cheapest  products. 

This  most  used  threefold  concept  of  governance 
leads  to  defining  metagovernance  as  the  gover¬ 
nance  of  hierarchies,  networks,  and  markets.  More 
precisely,  metagovernance  is  that  which  produces 
some  degree  of  coordinated  governance,  by  design¬ 
ing  and  managing  sound  combinations  of  hierar¬ 
chical,  market,  and  network  governance,  to  achieve 
the  best  possible  outcomes  from  the  viewpoint  of 
those  responsible  for  public  sector  performance. 
Metagovernance  is  not  a  supergovernance  style 
but  an  attitude  and  an  approach  that  is  expected  to 
help  overcome  some  of  the  typical  failures  of  each 
of  the  governance  styles  and  of  their  combinations. 

Variations  of  Metagovernance 

The  term  metagovernance  was  coined  by  Bob 
Jessop  in  1997.  He  argued  that  the  three  ideal- 
typical  governance  styles  can  be  mutually  under¬ 
mining  and  that  they  each  have  their  typical 
failures.  He  proposed  metagovernance  as  a  required 
coordination  mechanism.  After  having  investigated 
reforms  of  the  Australian  public  sector,  Lynn  Davis 
and  Rod  Rhodes  (2000)  confirmed  this  argument: 
Creating  effective  mixtures  of  the  three  styles  if  they 
undermine  each  other  is  an  important  challenge  for 
the  public  sector.  Another  reason  why  the  concept 
of  metagovernance  may  be  useful  is  that  each  of  the 
three  styles  has  such  a  distinct  logic  that  it  is  quite 
tempting  to  consider  it  as  a  solution  for  every¬ 
thing — a  panacea — while  neglecting  the  inherent 
failures  of  the  style  and  the  mitigating  characteris¬ 
tics  of  the  other  styles. 

A  last  reason  is  that  devising  successful 
approaches  to  governance  has  become  more  diffi¬ 
cult  since  our  societies  have  become  more  com¬ 
plex.  With  the  emergence  of  Manuel  Castells’s 
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network  society  and  the  acceptance  of  network 
governance  as  the  third  main  style,  the  dilemma  of 
dealing  with  two  major  approaches,  hierarchy  and 
market,  evolved  into  a  triple  dilemma,  or  tri¬ 
lemma.  The  proponents  of  metagovernance  argue 
that  applying  metagovernance  as  judiciously  inter¬ 
vening  in  governance  style  mixtures,  taking  a 
bird’s-eye  perspective,  increases  the  capability  of 
public  sector  organizations  to  deal  with  complex 
societal  problems  or  opportunities. 

The  three  ideal  types  of  governance  are  value 
laden.  Using  metagovernance  implies  deliberately 
taking  a  situational  view:  What  is  best  is  deter¬ 
mined  by  the  type  of  problem,  the  organizational 
culture,  and  the  level  of  pressure  of  stakeholders, 
for  example.  The  concept  has  a  light  normative 
dimension,  because  the  underlying  concept  of  gov¬ 
ernance  has  inherent  assumptions,  for  example, 
that  the  intention  is  to  solve  collective  problems, 
not  for  individual  profit  but  for  the  common  good. 

Some  scholars  use  the  term  metagovernance  in 
the  meaning  of  the  governance  of  one  specific  gov¬ 
ernance  style.  This  form  of  metagovernance  can  be 
called  first-order  metagovernance,  compared  with 
the  then  second-order  metagovernance  of  hierar¬ 
chies,  networks,  and  markets.  Following  this  line  of 
thought,  Eva  Sorensen  and  Jacob  Torfing  (2007) 
define  metagovernance  as  a  way  to  enhance  net¬ 
work  governance,  by  use  of  hierarchical  mecha¬ 
nisms  or  instruments,  such  as  introducing  house 
rules  in  a  network  or  producing  a  legal  framework 
in  which  network  processes  should  occur.  This 
makes  sense  when  the  field  of  research  is  democratic 
network  governance,  where  the  main  tensions  are 
between  participative  and  bureaucratic  forms  of 
democracy.  Others,  such  as  Josie  Kelly  (2006), 
define  metagovernance  as  an  attempt  to  regain  state 
control  over  new  forms  of  governance.  They  observe 
that  hierarchical  governance  emerges  in  a  new  form 
to  coordinate  network  and  market  styles  of  gover¬ 
nance.  This  secures  governmental  influence  in  the 
form  of  command  and  control  within  network  and 
market  style  governance  regimes.  As  Mark 
Whitehead  (2002)  notes,  metagovernance  is  then  a 
counterprocess  to  governance  of  networks. 

The  Metagovernor’s  Rationale 

The  role  of  metagovernor  can  be  taken  up  by  any 
public  manager  or  politician  in  charge  of  finding 


solutions  for  societal  problems  or  creating  new 
societal  opportunities.  When  such  responsible 
agents  design  and  intervene  in  governance  style 
mixtures,  they  may  have  a  specific  logic  of  action 
or  rationale.  The  question  is  how  they  can  govern 
from  the  center  while  also  maintaining  enough 
autonomy  (market  governance)  and  involvement 
(network  governance)?  A  comparative  research 
showed  that  public  managers  used  three  metagov¬ 
ernance  strategies: 

1.  The  first  strategy  is  combining  styles. 
Combining  the  styles  not  only  prevents  conflicts 
but  also  creates  synergy.  Hierarchy  brings  struc¬ 
ture  and  market  governance  enriches  a  network 
with  efficiency  and  entrepreneurship.  Network 
governance  may  secure  just  enough  empathy  in 
a  new  public  management  approach  (which  is 
a  combination  of  the  two  rational  ideal  types — 
hierarchy  and  market). 

2.  The  second  strategy  is  switching  between 
styles  when  the  situation  requires  this.  A  policy 
project  may  start  with  a  network  approach  and 
then  introduce  hierarchy  by  establishing  rules, 
while  the  next  phase  may  be  dominated  by  market 
mechanisms  such  as  efficiency-driven  autonomous 
activities  of  the  involved  stakeholders.  A  hierarchi¬ 
cal  phase  may  be  necessary  to  secure  the  results, 
after  which  a  new  network  phase  may  start. 

3.  The  third  strategy  is  maintenance  of  situation- 
ally  successful  governance-style  mixtures.  This  is  a 
second-order  strategy  and  complements  the  com¬ 
bining  and  switching  of  strategies.  For  example, 
conflicts  are  prevented  by  isolation — separating  a 
team’s  approach  to  prevent  it  from  being  under¬ 
mined  by  the  characteristics  of  the  other  styles — or 
empowerment — giving  the  team  a  sufficient  high 
degree  of  discretion.  Maintenance  requires  aware¬ 
ness  of  the  weaknesses  of  the  three  ideal  types. 
During  a  crisis,  for  example,  the  almost  unlimited 
discretion  may  lead  to  abuse. 

Another  dimension  of  the  metagovernor’s 
rationale  is  the  understanding  of  five  frame¬ 
work  conditions  that  influence  the  feasibility  of 
metagovernance: 

1.  The  first  condition  concerns  the  culture,  tradi¬ 
tions,  and  history  of  the  administrative  and  societal 
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system.  National  underlying  preferences  for  a  gov¬ 
ernance  style  influence  the  composition  of  gover¬ 
nance  mixtures  to  a  certain  extent  but  do  not 
predict  a  specific  style  combination.  However,  they 
seem  to  predict  the  first  style  to  be  considered — the 
“default  style” — which  is,  for  example,  market 
governance  in  the  United  Kingdom,  network 
governance  in  the  Netherlands,  and  hierarchical 
governance  in  Germany.  The  other  styles  are 
only  applied  when  the  default  style  turns  out  to  be 
inappropriate. 

2.  The  second  condition  is  the  personal  affinity 
of  the  responsible  politician  with  one  of  the  ideal 
types. 

3.  The  third  condition  regards  societal  expecta¬ 
tions  of  the  role(s)  of  governmental  organizations. 
Civil  society  may  lobby  for  network  arrangements, 
whereas  enterprises  typically  strive  for  a  combina¬ 
tion  of  market  and  hierarchical  governance. 

4.  The  fourth  condition  concerns  organizational 
characteristics.  The  organizational  culture  may  be 
open  or  closed,  professional  or  task  oriented,  and 
the  dominant  style  of  leadership  may  be  command 
and  control,  coaching,  or  enabling. 

5.  The  fifth  framework  condition  is  the  type  of 
problem.  This  codetermines  which  style  would 
serve  best  as  the  dominant  one.  If  the  policy  prob¬ 
lem  is  defined  as  an  urgent  matter,  the  rationale  is 
to  choose  a  hierarchical  approach.  If  it  is  a  routine 
issue  that  should  be  dealt  with  as  efficiently  as  pos¬ 
sible,  market  governance  often  works  best,  and  for 
complex  problems,  network  governance  has  a  lot 
to  offer. 

Together  the  strategies  and  the  framework  con¬ 
ditions  constitute  the  metagovernor’s  rationale. 

Practical  Implications  and  Feasibility 

Metagovernance  provides  an  analytical  framework 
that  enables  discussions  between  professionals 
with  different  belief  systems  and  can,  to  a  certain 
extent,  be  used  for  the  design  and  management  of 
governance  approaches.  This  applies  to  both  policy 
making  and  organizational  design. 

In  addition,  the  broad  version  of  metagovernance 
may  provide  useful  insights  for  the  design  of  man¬ 
agement  development  programs.  Such  programs 


usually  mirror  the  dominant  style  of  an  organiza¬ 
tion.  The  problem  is  that  a  hierarchical  conception 
of  management  development  implies  training  sub¬ 
ordinates  only  to  listen  well  and  obey  authority,  a 
market-oriented  training  program  promotes  clients 
to  become  more  entrepreneurial,  and  network 
training  concentrates  on  helping  colleagues  create 
effective  dialogues.  Management  development 
programs  based  on  the  concept  of  metagovernance 
would  be  based  on  different  assumptions.  They 
would  teach  managers  to  apply  governance  styles 
for  which  they  feel  no  personal  affinity.  They 
would  stimulate  personal  development  toward 
being  able  to  reflect  on  all  governance  styles  and 
having  enough  self-insight  to  understand  their  own 
biases.  In  addition,  they  would  include  training  in 
management  techniques  that  are  typical  for  the 
three  styles:  (1)  line  and  project  management  for 
hierarchy,  (2)  business  management  for  market 
governance,  and  (3)  process  management  for 
network  governance.  They  would  also  involve 
training  in  investigating  and  assessing  the  gover¬ 
nance  environment,  to  be  able  to  deal  with  the 
framework  conditions  distinguished  earlier. 

Advocates  of  each  of  the  three  governance 
styles  have  different  assessments  of  the  feasibility 
of  metagovernance.  From  a  network  perspective, 
metagovernance  may  seem  a  rational  attempt  to 
steer  “unsteerable,”  chaotic  situations,  which 
implies  that  the  feasibility  is  low.  Taking  the  more 
rational  perspective  of  hierarchical  and  market 
governance,  there  is  no  fundamental  reason  to 
doubt  whether  metagovernance  can  be  done; 
however,  the  problem  may  have  a  normative  char¬ 
acter.  For  those  who  take  this  perspective,  intro¬ 
ducing  metagovernance  implies  that  also  the 
softer,  nonrational  network  approach  must  be 
taken  into  consideration,  which  might  make  the 
governance  system  messy  and  slow.  Metagov¬ 
ernance  does  not  suggest  a  rational  bias  but  a 
perspective  over  and  beyond  the  rationality  versus 
chaos  discussion. 

To  conclude,  metagovernance  as  mixing  hierar¬ 
chical,  network,  and  market  governance  provides  a 
bird’s-eye  view  of  public  issues.  The  concrete  feasi¬ 
bility  of  metagovernance  of  the  three  ideal-typical 
styles  depends  on  the  (metagovern)ability  of  key 
people  and  on  the  framework  conditions  distin¬ 
guished  above.  Scholars  concentrating  on  this  con¬ 
cept  hope  that  such  a  reflexive,  multiperspective 
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view  may  prevent  the  public  sector  from  running 
into  unnecessary  problems. 

Louis  Meuleman 
Nyenrode  Business  University 
Breukelen,  Netherlands 

See  also  Governance;  Hierarchical/Multilevel  Models; 
Networks;  New  Public  Management 
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Political  methodology  deals  with  all  issues  related 
to  empirical  political  research  (nonempirical  work, 
such  as  pure  formal  or  normative  theory,  is 
excluded  here).  Methodology,  as  it  is  understood 
here,  simply  refers  to  the  ways  in  which  we 
acquire  knowledge  and  comprises  a  multitude  of 
specific  methods  and  techniques.  As  such,  it  is 
embedded  in  an  epistemological  tradition  of  “crit¬ 
ical  rationalism”  (Karl  Popper)  and  “scientific 
realism.”  This  has  been  summarized  as  the  “two¬ 
fold  conviction  that  the  world  consists  of  causal 
mechanisms  that  exist  independently  of  our 
study — or  even  awareness — of  them,  and  that  the 
methods  of  science  hold  our  best  possibility  of  our 
grasping  their  true  character”  (Ian  Shapiro,  2005, 


pp.  8-9).  While  it  is  often  confused  with  narrower 
topics  such  as  statistics,  methodology  is  a  broad 
area  that  deals  with  every  aspect  of  political 
research,  both  quantitative  and  qualitative.  While 
some  methodological  issues  are  more  relevant  to 
certain  subfields  or  types  of  research,  all  political 
science  is  subject  to  similar  standards  and  logic. 
While  political  methodology  is  related  to  more 
general  social  science  methodology,  there  are  spe¬ 
cific  issues  that  distinguish  political  methodology, 
while  there  is,  of  course,  a  shared  logic  and  stan¬ 
dards  across  the  empirical  social  sciences.  Since 
political  science  is  itself  defined  by  substantive 
questions,  there  is  much  importing  of  methods 
from  other  disciplines  into  political  science. 
Questions  of  what  is  imported  and  the  relevance 
of  imported  methods  are  important  issues  in 
political  methodology.  This  entry  discusses  some 
of  the  major  advances  in  this  field. 

While  political  methodology  deals  with  empiri¬ 
cal  research,  there  cannot  be  any  purely  empirical 
research.  Every  empirical  study  involves  some  rela¬ 
tionship  between  a  theoretical  concept  and  its 
empirical  referent.  Even  very  narrow  empirical 
research,  such  as  the  measurement  of  electoral  turn¬ 
out  in  a  given  locality  in  a  given  period,  requires  a 
theoretical  assessment  of  what  is  electoral  turnout. 
For  example,  are  people  who  are  of  legal  age  to  vote 
but  excluded  from  the  process  because  of  a  prior 
felony  (as  some  are  in  the  United  States)  counted  in 
the  denominator?  While  empirical  studies  of  voting 
turnout  are  less  complex  theoretically  than  studies 
of,  say,  whether  being  a  democracy  causes  a  nation 
to  be  more  pacific,  both  studies  involve  a  mix  of 
theoretical  and  empirical  analysis  and  both  can  be 
assessed  using  the  same  logic.  Thus,  issues  of  mea¬ 
surement  are  always  critical;  such  issues  have 
become  even  more  critical  as  technology  makes  new 
forms  of  data  (video,  blogs)  available,  or  it  makes  it 
possible  to  easily  analyze  data  that  we  have  always 
used  but  found  hard  to  deal  with  (text).  The  ability 
to  deal  with  new  sources  of  very  complicated  data 
and,  with  modern  computers,  the  ability  to  code 
massive  amounts  of  nonquantitative  data,  as  well  as 
the  ability  to  collect  individual  data  via  the  Internet, 
are  among  the  most  exciting  developments  in  polit¬ 
ical  methodology.  Along  with  this,  there  has  been 
much  progress  on  issues  of  measurement. 

Similarly,  there  can  be  no  difference  in  the 
underlying  logic  of  qualitative  and  quantitative 
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research.  While  obviously  the  specific  tools  will  be 
different,  if  the  question  of  interest  is  why  coun¬ 
tries  have  differing  regulatory  systems,  we  may 
pursue  this  in  a  number  of  ways.  But,  in  the  end, 
all  such  studies  must  be  able  to  answer  whether  the 
evidence  used  leads  to  the  conclusion  asserted. 
While  process  tracing  through  official  papers  of 
decision  makers  is  different  from  regressing  regula¬ 
tory  rules  on  political  variables,  both  may  use  one 
type  of  quantitative  method,  while  students  of 
cross-national  comparative  politics  may  use 
another  type;  these  subfields  are  also  subject  to  the 
same  fundamental  logic.  This  point  has  been  force¬ 
fully  made  in  recent  years  in  important  books  by 
Gary  King,  Robert  Keohane,  and  Sidney  Verba 
and  by  Henry  Brady  and  David  Collier.  The  inter¬ 
relationship  (both  similarities  and  dissimilarities) 
between  quantitative  and  qualitative  analysis,  and 
how  to  combine  both  types  to  improve  research,  is 
another  research  area  in  political  methodology 
that  is  seeing  much  discussion. 

Some  political  science  research  is  purely  descrip¬ 
tive:  how  many  people  vote,  whether  more  people 
voted  the  previous  year  than  this  year,  and  the  like. 
As  noted,  even  for  this  simple  issue,  serious  mea¬ 
surement  issues  arise:  What  kind  of  people  vote, 
and  what  does  it  mean  to  vote?  Obviously,  such 
issues  are  simpler  than  asking  how  many  countries 
are  democracies,  but  the  methodological  logic  does 
not  change.  Most  political  science  research,  how¬ 
ever,  is  not  simply  descriptive.  While  sometimes 
we  care  only  about  associations,  much  research 
uses  causal  language,  metaphors,  and  ideas.  Thus, 
while  we  might  be  interested  only  in  whether  there 
is  a  correlation  between  being  a  democracy  and  the 
amount  of  public  goods  provided,  we  are  often 
more  interested  in  whether  there  is  a  causal  rela¬ 
tionship,  such  that  being  a  democracy  leads  to 
more  public  goods  being  provided.  In  purely  asso- 
ciational  studies,  the  variables  are  all  treated  sym¬ 
metrically;  when  using  the  language  of  causality, 
there  is  always  a  variable  that  is  being  caused  and 
one  or  more  variables  doing  the  causing.  Issues  of 
assessing  causality  are  in  the  forefront  of  current 
discussions  of  political  methodology,  whether 
quantitative  or  qualitative.  These  issues  can  all  be 
considered  under  the  heading  of  research  design. 

Finally,  when  the  above  issues  have  been  dealt 
with,  there  is  the  issue  of  how  to  get  the  data  to 
speak  clearly  and  how  to  assess  what  inferences 


can  be  drawn  from  the  data  (be  it  qualitative  or 
quantitative).  This  is  the  realm  of  statistics  (for 
quantitative  studies).  Obviously,  the  use  of  sophis¬ 
ticated  statistical  methods  has  mushroomed  in 
political  science,  at  least  partly  as  a  function  of  the 
increased  computer  power  that  is  now  commonly 
available.  But  while  this  part  of  methodology  is 
often  seen  as  highly  mathematical  and  compli¬ 
cated,  it  is  usually  the  simplest  part  of  political 
methodology.  It  is  also  the  case  that  the  most 
sophisticated  statistics  cannot  save  a  poor  research 
design  or  poor  measurement;  by  contrast,  good 
research  design  often  leads  to  simple  statistical 
analysis.  This  entry  begins  with  developments  in 
data  and  measurement,  proceeds  to  research  design 
and  measurement,  and  concludes  with  a  discussion 
of  statistical  methods. 

Data  and  Measurement 

Empirical  research,  of  course,  deals  with  data 
(observations),  and  earliest  empirical  work,  that  of 
Aristotle  on  constitutions,  took  as  data  the  various 
Greek  constitutions.  Data  are  gathered  in  a  multi¬ 
tude  of  ways,  ranging  from  reading  diaries  or 
internal  records  of  decision  making,  to  conducting 
field  observations  and  (unstructured  or  semistruc- 
tured)  interviews  with  local  leaders,  to  analyzing 
carefully  collected  economic  statistics  or  a  variety 
of  highly  structured  survey  data.  Every  type  of 
research  has  certain  types  of  data  that  are  more 
commonly  used,  but  all  types  of  data  are  subject  to 
the  same  standards  and  issues.  There  is  a  difference 
between  journalism  and  political  science,  and 
much  of  that  difference  has  to  do  with  standards 
for  data.  Both  journalists  and  political  scientists 
may  interview  political  leaders,  but  the  way  they 
collect  and  code  that  information  is  usually  quite 
different. 

Modern  technological  developments  have  had 
an  enormous  impact  on  political  science  data. 
Much  of  the  data  we  use  are  textual  (laws,  party 
manifestos,  records  of  debate  and  deliberation, 
court  decisions,  newspaper  accounts,  and  minutes 
and  records  of  administrative  procedures,  among 
many  others).  Our  discipline  has  always  used  such 
data,  but  coding  these  data  was  extremely  difficult 
and  time-consuming,  leading  to  such  data  being 
underused.  Now,  much  of  these  data  come  in 
machine-readable  form,  and  any  data  available  in 
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hard  copy  can  be  scanned  and  put  into  such  form. 
Thus,  it  is  now  relatively  easy  to  code  documents 
for  the  use  of  different  types  of  words  (or  themes 
or  tropes  or  whatever  one  likes).  Machines  can 
quickly  search  the  records  of  various  newspapers 
to  code  for  various  types  of  political  events,  a  task 
that  used  to  require  a  large  team  of  graduate  stu¬ 
dents  and  a  very  large  budget.  With  modern  com¬ 
putations,  it  is  relatively  easy  for  any  investigator 
to  code  these  textual  data  in  a  way  that  meets  the 
needs  of  an  individual  research  project.  This  is  one 
of  the  most  exciting  advances  in  data  collection  in 
our  discipline,  an  advance  that  is  well  under  way. 
The  possibilities  for  the  analysis  of  textual  data  are 
almost  limitless. 

The  analysis  of  surveys  has  been  a  mainstay  in 
our  discipline  for  the  past  half  century  or  more. 
Until  recently,  analysts  were  at  the  mercy  of  the 
survey  organization;  conducting  a  survey  was  a 
multimillion-dollar  task.  Thus,  electoral  analysts 
in  each  country  worked  with  the  same  standard 
survey  collected  by  some  national  research  organi¬ 
zation;  research  could  not  go  beyond  the  questions 
asked  by  that  organization.  With  modern  advances 
in  communications  and  computers,  it  is  now  rela¬ 
tively  easy  for  researchers  to  design  their  own 
surveys  to  suit  the  needs  of  a  specific  research  proj¬ 
ect.  We  are  also  seeing  more  standardized  surveys, 
so  students  of  elections  can  now  analyze  a  similar 
set  of  questions  in  almost  any  European  country 
(and  there  are  similar  efforts  in  other  parts  of  the 
world,  with  the  Afrobarometer  and  the  Latino- 
barometro  and  the  World  Values  Survey).  At  the 
same  time,  a  researcher  wanting  to  study  a  specific 
event  can  put  a  survey  in  the  field  in  a  short  space 
of  time  and  at  a  reasonable  cost.  Larger  survey 
houses  can  monitor  populations  over  the  course  of 
an  election  and  even  change  or  add  questions  as 
issues  arise  over  the  course  of  a  campaign.  Critical 
to  this  is  the  ability  to  monitor  a  survey  on  a  day- 
to-day  basis.  In  earlier  times  it  would  have  been 
months  or  years  until  survey  data  became  avail¬ 
able;  with  the  new  technology  such  data  are  avail¬ 
able  on  a  daily  basis.  Thus,  for  example,  it  was 
possible  to  track  changes  in  the  British  electorate 
in  2010  and  their  responses  to  the  introduction  of 
debates  into  the  British  campaign  soon  after  or 
almost  immediately. 

Modern  technology  also  makes  it  easy  to 
embed  experiments  into  a  survey.  Thus,  one  can 


give  different  respondents  different  scenarios  or 
different  question  wordings  or  whatever  else  one 
likes,  and  these  different  treatments  can  be  chosen 
randomly.  This  technology  has  greatly  increased 
the  use  of  field  experiments.  The  incredible  growth 
of  the  Internet  has  been  extremely  important  here. 
Various  survey  organizations  in  many  countries 
give  the  researcher  access  to  a  huge  pool  of 
respondents,  as  well  as  the  tools  to  quickly  design 
a  survey  instrument  and  to  allow  the  researcher  to 
take  advantage  of  experimental  manipulation  in 
the  questions  (subject  only  to  ethical  constraints 
on  such  manipulation).  Internet  surveys  can  be 
undertaken  at  very  low  cost  and  are  within  the 
budget  even  of  students.  While  there  are  still  many 
issues  on  the  use  of  the  Internet  in  this  manner,  we 
are  clearly  seeing  more  and  more  use  of  the 
Internet  (both  for  reasons  of  cost  and  sample  size 
and  because  other  modes  of  interviewing  are 
becoming  more  problematic). 

Once  data  are  collected,  numbers  must  be 
assigned.  This  is  the  process  of  measurement, 
which  relies  both  on  concept  formation  and  (for 
quantitative  studies)  various  methods  often  associ¬ 
ated  with  psychometrics.  Qualitative  scholars  have 
paid  much  attention  to  concept  formation,  and,  in 
conjunction  with  tools  such  as  Charles  Ragin’s 
“Qualitative  Comparative  Analysis,”  much  prog¬ 
ress  has  been  made.  Students  of  comparative  poli¬ 
tics  have  paid  much  attention  to  how  concepts 
generalize  across  geographic  locations  and  to  how 
we  can  generalize  across  locations  without  having 
too  much  “conceptual  stretching.” 

At  the  same  time,  statistical  and  computing 
advances  have  allowed  researchers  to  move  beyond 
the  psychometric  techniques  available  20  years 
ago.  Today,  there  is  vibrant  activity  in  multidimen¬ 
sional  analysis,  and  in  recent  years,  scaling  tech¬ 
niques,  both  uni-  and  multidimensional,  have  been 
put  on  a  much  firmer  theoretical  basis.  The  new 
textual  data  have  brought  to  the  forefront  issues 
such  as  how  to  locate  political  parties  in  a  multidi¬ 
mensional  issue  space,  and  new  statistical  methods 
have  allowed  for  great  advances  in  the  location  of 
individual  legislators  in  such  a  space. 

The  past  decade  has  seen  a  great  increase  in  the 
amount  of  data  sharing  (largely  through  the  impetus 
of  funding  agencies)  as  well  as  in  journals  requiring 
authors  to  make  replication  data  sets  available. 
While  this  works  very  well  for  quantitative  data,  it 
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is  more  problematic  for  qualitative  data  (inter¬ 
views,  field  observation  notebooks,  and  the  like). 
However,  as  it  becomes  easier  to  either  collect  these 
data  in  digitized  format,  or  to  convert  them  to  such 
format,  it  can  be  expected  that  it  will  be  as  easy  to 
make  qualitative  data  publicly  available  as  it  now 
is  for  quantitative  data  (though  obviously  there  are 
more  issues  of  confidentiality  and  the  like). 

Research  Design 

Political  scientists  have  relied  heavily  on  observa¬ 
tional  studies  (whether  quantitative  or  qualitative). 
However,  while  there  is  some  interest  in  pure 
description,  there  is  usually  more  interest  in  mak¬ 
ing  causal  interpretations  from  the  data.  Thus, 
while  we  begin  with  simply  observing  that  pairs  of 
democracies  usually  do  not  go  to  war,  we  are  more 
interested  in  the  question  of  whether,  as  countries 
democratize,  they  become  less  likely  to  go  to  war. 
Finally,  we  try  to  deepen  the  explanation,  asking 
what  facet  of  democracy  makes  democracies  less 
likely  to  fight  each  other. 

The  question  of  how  we  can  infer  causality 
from  observational  data  has  vexed  philosophers  as 
long  as  there  have  been  philosophers.  The  meaning 
of  causality  is  a  vibrant  topic  in  modern  philoso¬ 
phy.  Applied  researchers  have  attempted  to  find 
ways  to  assess  causality  and,  at  a  minimum,  to 
attempt  to  rule  out  other,  noncausal  explanations 
for  findings.  In  the  above  example,  being  a  democ¬ 
racy  may  not  really  be  the  causal  variable;  perhaps, 
instead,  the  real  causal  variable  is  economic  devel¬ 
opment  and  richer  countries  are  simply  more  likely 
to  be  democracies.  Thus,  the  observed  association 
(correlation)  between  democracy  and  peacefulness 
could  be  artifactual  or  spurious.  Both  quantitative 
and  qualitative  researchers  have  devoted  enor¬ 
mous  attention  to  this  issue. 

On  the  purely  qualitative  side,  researchers  have 
paid  great  attention  to  J.  S.  Mill’s  methods.  Thus, 
we  see  large  numbers  of  comparative  case  studies, 
with  researchers  choosing  the  cases  so  as  to  obtain 
variation  on  the  key  dependent  variable  and  the 
causal  variable  but  little  or  no  variation  on  other 
variables.  Researchers  also  choose  cases  for  theory 
testing  based  on  the  cases  that  are  likely  to  prove 
hardest  for  the  theory  to  explain.  Researchers  are 
also  taking  advantage  of  difference  in  designs 
where  two  cases  are  compared  at  two  different 


times:  where  the  cases  were  originally  similar  but  a 
key  variable  (and  preferably  only  that  key  vari¬ 
able)  has  changed  in  one  case  but  not  in  the  other. 

Moving  to  larger  numbers  of  cases,  researchers 
have  used  various  configurational  techniques  to 
see  how  variables  are  related  to  each  other  and  to 
study  complicated  causal  paths.  Great  attention 
has  been  paid  to  necessary  and  sufficient  condi¬ 
tions  and  to  designs  that  can  distinguish  whether  a 
condition  is  necessary,  sufficient,  or  both  or  in 
some  more  complicated  relationship  to  a  variable 
of  interest. 

Qualitative  researchers  have  often  chosen  their 
cases  based  on  issues  of  research  design  or  the 
importance  of  their  cases;  quantitative  researchers, 
conversely,  have  often  chosen  cases  to  maximize 
generalizability,  either  via  national  surveys  or  via 
large  cross-country  analyses.  But  trends  in  quanti¬ 
tative  research  are  tending  to  blur  the  difference 
between  qualitative  and  quantitative  designs  (with 
both  designs  subject  to  similar  standards  about 
inferring  causality).  All  of  these  are  attempts  to 
bring  experimental  insights  into  political  science. 

One  relatively  new  development  that  shows  the 
convergence  of  various  approaches  is  the  “analytic 
narrative.  ”  Here,  researchers  attempt  to  show  how 
some  important  development  in  history  can  be 
explained  by  modern  analytic  theory.  While  these 
analytic  narratives  cannot  test  analytic  theories, 
they  can  make  such  theories  more  or  less  plausible. 
There  is  much  controversy  about  whether  this  tool 
can  really  be  used  to  either  help  validate  a  theory 
or  to  explain  an  important  historical  event:  can  a 
clever  user  of  the  tool  explain  anything,  or,  as  Jon 
Elster  put  it,  are  they  simply  “just  so”  stories? 
However,  this  joining  of  modern  analytic  (usually 
formal)  theory  and  careful  historical  evidence 
shows  how  two  very  different  traditions  can  be 
combined  in  a  potentially  fruitful  manner. 

Political  science  studies  are  most  frequently 
observational.  Other  fields,  such  as  medicine,  rely 
more  heavily  (though  far  from  exclusively)  on 
experiments.  In  medicine,  the  gold  standard  is  the 
clinical  trial,  where  subjects  in  a  selected  pool  are 
randomly  assigned  to  either  a  “treatment”  or 
“control”  group  and  where,  under  appropriate 
conditions,  one  can  infer  whether  the  treatment 
causes  a  better  outcome. 

Experiments  are  much  harder  in  political  sci¬ 
ence.  We  cannot  assign  countries  randomly  to  be 
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either  democracies  or  not,  nor  can  we  assign  voters 
to  be  randomly  rich  or  poor.  We  can,  however,  set 
up  a  laboratory  and  then  randomly  assign  partici¬ 
pants  in  an  experiment  to  a  treatment  or  control 
condition.  Thus,  for  example,  we  can  study 
whether  people  prefer  “fair  outcomes”  by  having 
pairs  of  subjects  bargain,  where  each  pair  is  ran¬ 
domly  assigned  to  some  initial  endowment  or  price 
system.  While  this  is  an  exciting  new  area  of 
political  science,  issues  of  generalizing  from  exper¬ 
iments  to  the  real  world  (external  validity)  will 
always  limit  the  use  of  laboratory  experiments  in 
our  discipline.  However,  where  experiments  are 
possible,  we  can  be  much  more  certain  about 
assessing  whether  some  political  treatment  had  a 
causal  (and  not  spurious)  effect  on  an  outcome  of 
interest.  Experiments  may  be  particularly  useful 
for  testing  formal  theories  of  politics,  since  those 
theories  are  themselves  highly  abstract  representa¬ 
tions  of  the  political  universe. 

Experiments  need  not  be  limited  to  a  labora¬ 
tory;  it  is  perfectly  possible  (and  now  with  modern 
technology  even  easier)  to  conduct  field  experi¬ 
ments.  The  move  from  the  laboratory  to  the  field 
increases  external  validity  at  the  cost  of  our  being 
less  certain  about  our  causal  inferences  (internal 
validity).  Perhaps  the  first  examples  of  this  came  in 
conjunction  with  surveys,  where  different  people 
could  randomly  receive  different  question  word¬ 
ings  or  question  orders.  It  was  easy  to  move 
beyond  this  to  providing  different  people  different 
information  randomly  (subject  of  course  to  ethical 
guidelines  on  dealing  with  human  subjects,  which 
do  not  permit,  at  a  minimum,  misleading  them). 
Perhaps  the  most  common  field  experiments  have 
to  do  with  the  effect  of  various  attempts  to  moti¬ 
vate  people  to  vote  and  to  what  extent  voting 
turnout  can  be  influenced  by  various  communica¬ 
tions  strategies. 

Field  experiments  are  now  also  common  in  the 
assessment  of  various  interventions.  Thus,  if  we 
want  to  know  if  certain  types  of  political  interven¬ 
tions  (say  national-level  monitoring  of  local  cor¬ 
ruption)  have  an  effect,  and  if  the  area  of  interven¬ 
tion  is  chosen  randomly  (because  the  state  cannot 
monitor  all  localities),  it  is  then  relatively  easy  to 
study  the  impact  of  the  anticorruption  interven¬ 
tion.  Of  course,  this  depends  on  the  willingness  of 
the  state  to  intervene  randomly,  something  they 
are  not  always  (or  often)  willing  to  do.  On  a  simple 


level,  we  can  often  study  things  such  as  educa¬ 
tional  reforms  by  comparing  students  who  were 
randomly  selected  for  the  reform  with  those  who 
applied  but  were  not  selected  in  a  lottery.  New 
programs  that  are  oversubscribed  often  choose 
participants  in  this  way.  Of  course,  if  we  simply 
compare  those  in  the  program  with  those  not  in 
the  program,  we  do  not  know  if  the  program,  or 
factors  that  led  people  to  choose  to  be  in  the  pro¬ 
gram,  led  to  the  observed  outcome,  and  so  no 
causal  inference  is  possible.  But  there  is  more  and 
more  demand  for  careful  evaluation  of  programs 
(such  as  aid  programs  sponsored  by  various  large 
foundations),  and  so,  this  type  of  approach  will 
become  more  and  more  common.  This  is  a  major 
step  for  applied  researchers  who  want  to  see 
whether  innovations  actually  work. 

A  somewhat  different,  but  related,  strategy  is  to 
keep  laboratory  control  but  move  the  laboratory 
from  the  research  university  to  real-world  settings. 
In  a  university  laboratory,  we  can  study  how 
undergraduates  in  research  universities  bargain.  In 
field  laboratories  (again  made  possible  by  techno¬ 
logical  innovations)  we  can  study  how  a  wider 
group  of  people  bargains.  Researchers  can  also 
embed  more  “real-world”  features  in  these  experi¬ 
ments.  Thus,  in  some  particularly  exciting  experi¬ 
ments  on  the  role  of  ethnicity  and  trust,  some 
people  bargained  with  people  of  their  own  ethnic 
group,  while  others  simply  bargained  with  a  ran¬ 
domly  selected  person.  Thus,  we  can  now  make 
advances  in  studying  group  trust  with  studies  that 
are  both  at  least  somewhat  externally  valid  while 
still  allowing  for  reliable  causal  inference. 

Perhaps  the  strongest  convergence  of  qualita¬ 
tive  and  quantitative  thinking  comes  in  what 
Donald  T.  Campbell  and  Julian  Stanley  called 
“quasi  experiments”  in  their  pathbreaking  1963 
book  on  research  design.  Unlike  experiments, 
some  external  force  (political  or  natural)  has 
“assigned”  one  group  to  a  treatment  and  another 
to  a  control.  Since  this  is  not  a  true  randomized 
experiment,  researchers  must  show  that  the  treat¬ 
ment  was  assigned  in  such  a  way  that  the  assign¬ 
ment  process  was  independent  of  the  outcome. 
For  example,  the  British  drew  boundaries  in 
Africa  as  if  they  were  random  (i.e.,  following 
various  geographical  markers);  thus,  one  ethnic 
group  might  be  divided  between  two  countries, 
and  one  can  then  see  whether  there  are  differences 
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in  the  behavior  of  the  same  ethnic  group  in  the 
two  countries.  In  the  actual  study  of  Daniel 
Posner,  there  were  two  ethnic  groups  split  across 
two  countries,  with  the  division  into  countries 
being  “as  if”  random;  Posner  could  then  see 
whether  political  and  social  rivalries  between  the 
two  groups  differed  as  a  function  of  the  larger 
political  structures  in  each  country.  We  are  seeing 
more  and  more  such  designs.  These  research 
designs  obviously  have  high  external  validity, 
though  they  lack  the  clear  ability  to  show  causal 
effects,  since  the  assignment  to  groups  was  not 
totally  random.  But  researchers  are  taking  advan¬ 
tage  of  “almost  random”  natural  assignments  to 
study  the  effects  of  changing  laws  (with  laws  cut¬ 
ting  natural  labor  markets  artificially,  the  effect 
of,  say,  a  change  in  the  minimum  wage  law  in  half 
the  labor  market  can  be  compared  with  what  hap¬ 
pened  in  the  same  labor  market  not  subject  to  the 
change).  This  approach  is  very  exciting,  though  of 
course  one  must  work  hard  to  show  that  the 
assignment  process  was  effectively  random.  From 
a  methodological  standpoint,  it  does  not  matter 
whether  the  data  collected  from  the  two  groups 
are  quantitative  or  qualitative,  and,  in  general, 
both  types  are  collected.  But  even  if  the  data  from 
the  two  groups  come  from  large  surveys,  we  are 
still  comparing  only  two  groups. 

A  related  research  design  that  brings  together 
both  quantitative  and  qualitative  researchers  is  the 
difference-in-difference  design.  In  a  simple  before- 
and-after  comparison,  we  do  not  know  if  the  inter¬ 
vention  between  the  observations  caused  the 
observed  change.  If  we  simply  compare  two  units, 
one  with  an  intervention  and  one  without,  we  do 
not  know  if  the  intervention  or  something  else 
caused  the  observed  difference.  The  difference-in¬ 
difference  design  asks  the  researcher  to  find  two 
similar  units  where  one  unit  had  an  intervention 
(e.g.,  a  change  in  a  law)  and  the  other  did  not.  We 
need  to  be  able  to  observe  both  units  both  before 
and  after  the  intervention  in  one  unit.  If  there  is  a 
bigger  difference  in  the  unit  with  the  intervention, 
then  we  have  evidence  that  the  intervention  had  a 
causal  impact.  As  before,  this  design  has  good 
external  validity,  but  it  does  not  rule  out  all  other 
causal  explanations.  And,  as  before,  we  can  com¬ 
pare  many  units,  leading  to  a  quantitative  design 
(so  long  as  the  units  were  similar  beforehand),  or  we 
can  do  a  simpler  two-cases  difference-in-difference 


design,  allowing  for  more  in-depth  analysis  of  the 
two  before-and-after  cases. 

The  study  of  causality  has  also  been  a  big  issue 
in  statistical  analysis.  But  even  without  statistical 
innovations,  all  empirical  researchers  have  clearly 
been  affected  by  new  thinking  about  using  good 
research  design  to  infer  causality. 

Statistics 

Multiple  regression  is  clearly  the  workhorse  of  the 
quantitative  political  scientist.  But  political  scien¬ 
tists  have  been  quick  to  use  related  methods  that 
better  fit  the  data  analyzed.  A  quarter  of  a  century 
ago  researchers  still  found  methods  such  as  logit 
and  probit  for  dichotomous  dependent  variables  to 
be  either  just  at  or  just  beyond  their  grasp.  Today, 
these  methods  are  commonplace.  Similarly, 
researchers  with  ordered  dependent  variables, 
event  count-dependent  variables,  or  length-of- 
time-dependent  variables,  typically,  know  how  to 
find  the  correct  methods  (and  all  commonly  used 
software  makes  it  easy  to  use  these  methods  in 
practice).  While  the  gains  here  are  often,  but  not 
always,  small,  they  usually  come  at  no  price,  so 
there  is  no  question  that  researchers  should  match 
their  choice  of  method  to  the  type  of  data  being 
analyzed.  These  issues,  typically  important  in 
cross-sectional  research,  are,  by  and  large,  now 
solved  problems. 

Similar  strides  have  been  made  in  the  analysis  of 
time  series.  While  in  the  past  the  important  issues 
of  time  series  (which  often  have  enormous  conse¬ 
quences  for  results)  were  ignored,  over  the  past 
quarter  of  a  century  the  discipline  has  become 
much  more  sophisticated.  Thus,  most  researchers 
analyzing  time  series  get  the  technical  details  right. 
Econometricians,  at  the  same  time,  have  made 
great  strides  in  studying  data  that  are  trending  (or, 
more  technically,  nonstationary).  Political  scien¬ 
tists  have  been  quick  to  pick  up  on  this,  and  we  see 
much  fewer  spurious  regressions.  So  while  the 
issues  here  are  more  complicated,  and  there  are 
still  open  issues  about  trending  series,  as  with 
cross-sectional  analyses,  time-series  analyses  in 
political  science  are  now  done  reasonably  well. 

In  comparative  politics,  we  have  data  that 
consist  of  time  series  observed  over  a  number  of 
countries:  time-series  cross-sectional  data.  Many 
articles  now  analyze  such  data,  and  the  discipline 
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has  become  good  at  analyzing  such  data.  Similarly, 
we  can  have  cross-sectional  surveys  with  repeated 
observations  on  each  individual:  panel  data.  The 
analysis  of  such  data  has  also  become  common¬ 
place,  with  the  appropriate  methods  often  used. 
Finally,  there  have  been  great  advances  in  data 
where  individuals  are  observed  over  multiple  units 
(e.g.,  common  surveys  in  different  countries):  mul¬ 
tilevel  data.  Again,  there  have  been  great  strides 
recently  in  the  analysis  of  such  data,  and  the  cor¬ 
rect  analysis  of  multilevel  data  has  also  become 
more  common. 

This  is  not  to  say  that  all  statistical  issues  have 
been  solved.  Most  current  methods  assume  that 
observations  in  one  unit  are  independent  of  obser¬ 
vations  in  other  units.  But  this  assumption  is 
clearly  false  for  political  science.  What  goes  on  in 
one  country  affects  its  neighbors  and  trading  part¬ 
ners;  a  dyad  going  to  war  must  have  impacts  on  a 
large  number  of  other  dyads.  Recently,  political 
methodologists  have  been  investigating  methods 
for  modeling  spatially  dependent  data,  and  great 
strides  are  being  made  in  this  area. 

Another  active  area  of  research  is  on  ecological 
data,  that  is,  data  where  interest  is  on  individuals 
but  only  aggregate  data  are  observed.  Political  sci¬ 
ence  is  rich  with  aggregate  data,  particularly  vot¬ 
ing  data  collected  at  the  precinct  level.  But,  often, 
interest  is  at  the  individual  level.  For  example,  who 
voted  for  the  National  Socialist  Party  in  Germany 
in  the  1930s?  We  have  lots  of  data  on  aggregate 
vote  at  the  precinct  level  and  some  knowledge  of 
the  social  characteristics  of  such  precincts. 
Obviously,  we  would  like  to  do  surveys,  but  these 
are  impossible  for  events  in  the  past.  Since  William 
Robinson’s  classic  1950  work  on  the  “ecological 
fallacy,”  we  have  known  that  it  is  not  simple  to 
make  inferences  about  individuals  from  data  col¬ 
lected  at  a  higher  level  of  aggregation.  However, 
recent  advances  have  shown  that  we  can  use  such 
data  to  gain  insights  into  individual  data  (and  also 
to  show  when  the  data  cannot  support  such 
insights). 

An  important  issue  that  is  currently  the  subject 
of  much  discussion  is  how  to  interpret  statistical 
results.  Political  science  has  been  dominated  by  the 
null  hypothesis-testing  framework,  where  we  cal¬ 
culate  the  probability  of  obtaining  the  data 
observed  if  the  null  hypothesis  (almost  always  that 
two  or  more  variables  are  unrelated)  is  correct.  If 


this  probability  is  low  enough,  we  “reject”  the  null 
hypothesis,  otherwise  we  do  not  reject  it.  This 
approach  is  highly  problematic,  since  rejecting  the 
null  hypothesis  does  not  imply  that  there  is  a 
strong  relationship  between  variables,  and  failing 
to  reject  the  null  hypothesis  does  not  mean  that 
there  is  no  relationship  between  variables. 

In  the  past  few  years,  there  has  been  much  dis¬ 
cussion  of  moving  to  a  Bayesian  paradigm.  Much 
of  this  is  driven  by  the  computing  power,  rather 
than  the  interpretive  possibilities,  made  possible  by 
a  Bayesian  approach.  Bayesian  interpretation 
assumes  we  know  something  about  the  world, 
expressed  as  a  statistical  “prior  distribution”  on 
some  parameters  of  interest.  This  prior  distribu¬ 
tion  is  combined  with  the  information  in  the 
data — the  “likelihood” — to  produce  a  posterior 
distribution.  Statements  about  the  parameters  of 
interest  can  be  made  based  on  this  posterior  distri¬ 
bution.  There  is  much  controversy  on  how  to  use 
prior  information  and  on  the  issue  of  different 
scholars  having  different  priors.  But  this  is  a  very 
active  area  of  current  research,  both  in  political 
methodology  and  beyond,  and  Bayesians  ideas  (as 
well  as  computational  methods)  are  making  strong 
inroads  in  political  science. 

Statistics  and  Causality 

Regression  and  its  maximum  likelihood  cousins 
(limited  dependent  variables,  event  counts,  event 
history,  and  time  series)  estimate  a  model  of  a 
dependent  variable  conditional  on  an  observed  set 
of  independent  variables;  these  independent  vari¬ 
ables  are  assumed  to  be  exogenous — that  is,  they 
are  determined  outside  the  system  being  modeled 
and  hence  can  be  taken  as  given.  For  pure  descrip¬ 
tion,  this  is  fine.  But  we  generally  want  to  make 
causal  inferences.  To  take  the  simplest  regression 
case  (and  all  holds  in  the  more  complicated  cases 
mentioned  above),  we  believe  the  data  were  gener¬ 
ated  for  unit  i  by  the  process 

Vi  =  P-T  +  s„ 

where  x  refers  to  either  a  single  independent  vari¬ 
able  or  a  vector  of  such  variables,  and  e  is  a 
standard  unknown  error  term.  Clearly,  (3  can  be 
interpreted  descriptively — that  is,  as  the  slope  of  a 
line  (or  plane)  that  best  fits  the  points.  But  can  it 
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be  interpreted  causally — that  is,  do  we  believe  that 
if  for  a  given  unit  x  increases  by  one  point,  then  y 
will  increase  by  points?  (Obviously,  we  will  have 
to  use  an  estimate  of  p,  but  this  discussion  holds 
even  if  we  know  the  value  of  p  for  sure.  This  is  not 
an  issue  of  estimation.) 

There  are  a  number  of  reasons  why  the  relation¬ 
ship  between  x  and  y  could  be  noncausal.  The 
simplest  is  that  there  is  some  other  variable,  z,  that 
causes  both  x  and  y.  For  example,  there  may  be  a 
good-sized  P  in  a  regression  of  spending  on  public 
goods  on  democracy,  but  it  may  be  that  it  is  really 
how  rich  a  country  is  that  is  causing  both  spending 
on  public  goods  and  democracy;  there  may  be 
no  causal  relationship  between  democracy  and 
spending  on  public  goods  in  the  sense  that  simply 
making  a  country  more  democratic,  but  keeping 
everything  else  the  same,  may  lead  to  no  increase 
in  spending  on  public  goods. 

Traditionally,  this  was  dealt  with  by  including  z 
in  the  regression  and  seeing  if  the  coefficient  on  x 
has  changed.  This  is  not  an  unreasonable  way  to 
proceed.  However,  it  can  be  problematic.  First,  it 
assumes  that  the  effect  of  x  and  z  on  y  are  linear 
and  additive.  For  just  the  two  variables,  this  means 
that  we  are  assuming  that 

Yi  -  7 *,•  +  8z,-  +  e„ 

where  x  and  z  are  now  scalars.  While  the  details 
are  a  bit  more  complicated,  this  procedure  esti¬ 
mates  the  effect  of  x  on  y  by  subtracting  off  from 
each  observation  8 zr  But  if  the  effect  of  x  on  y  var¬ 
ies  with  z,  or  if  the  linear  additive  model  is  other¬ 
wise  incorrect,  this  correction  is,  alas,  not  correct. 

This  is  not  the  only  problem.  Let  us  say  x  is 
binary  (democracy/nondemocracy)  and  let  z  be 
national  income.  Can  we  make  a  poor  autocracy 
comparable  (in  terms  of  public  goods  spending) 
with  a  rich  democracy  by  simply  adding  8z  (z  is 
income)  to  its  y  (spending  on  public  goods)?  Given 
that  there  are  few  rich  autocracies  or  poor  democ¬ 
racies,  this  approach  depends  a  lot  on  extrapola¬ 
tion  well  outside  the  data  and  so  depends  on  a 
strong  belief  that  the  linear  additive  assumptions 
are  correct. 

Recently,  researchers  have  proceeded  in  a  differ¬ 
ent  way,  at  least  for  the  binary  x  case.  For  each 
democracy,  they  attempt  to  find  one  or  more  auto¬ 
cracies  that  are  very  close  on  various  exogenous 


variables  that  might  influence  x  and  z  (what  Judea 
Pearl  has  called  backdoor  paths  between  x  and  y 
and  what  Paul  Rubin  has  called  confounders).  If 
one  has  eliminated  all  potential  “confounders”  by 
matching  them,  then  the  difference  in  means 
between  the  democracies  and  autocracies  will  give 
us  the  effect  of  democracy  on  y.  Of  course,  this 
means  that  the  various  confounders  must  be 
observable  and  measured  in  the  data  set  (and,  of 
course,  there  are  many  technical  issues  that  must 
be  resolved  by  the  researcher). 

What  if  we  cannot  match  all  the  democracies 
with  autocracies?  These  unmatched  cases  are  sim¬ 
ply  dropped  from  the  analysis.  Thus,  we  do  not 
have  to  extrapolate  well  beyond  the  data,  but  this 
limits  us  to  studying  causal  impacts  in  comparable 
cases;  thus,  for  example,  we  cannot  say  what 
would  happen  if  Denmark  were  to  become  an 
autocracy.  This  is  almost  certainly  the  right  degree 
of  modesty. 

This  matching  literature  is  undergoing  rapid 
development  at  the  current  time.  Issues  that  must 
be  studied  include  how  to  handle  continuous  (or 
multivalued)  xs  and  how  to  deal  with  studies 
where  we  cannot  focus  solely  on  one  independent 
variable  of  interest.  There  are  also  many  technical 
issues  that  are  continually  being  dealt  with,  such  as 
what  does  it  mean  for  two  cases  to  match  and  how 
many  and  which  cases  should  be  dropped  from  an 
analysis  because  they  do  not  match.  But  clearly, 
this  approach  is  often  superior  to  multiple  regres¬ 
sion  (and  when  multiple  regression  is  correct,  it 
provides  roughly  the  same  answer). 

Perhaps  more  important,  even  if  one  decides  to 
continue  to  run  regressions,  the  insights  of  the 
matching  and  causality  literatures  are  of  great 
value.  There  are  two  difficult  issues  in  multiple 
regression  on  which  statistics  give  few  insights: 
which  independent  variables  to  include  in  the 
regression  and  which  cases  should  be  studied.  The 
matching  approach  suggests  that  only  variables 
that  are  on  backdoor  paths  between  the  key  inde¬ 
pendent  variable  and  the  dependent  variable 
should  be  included  in  the  regression.  Equally 
important,  variables  on  front  door  paths,  where  x 
causes  z  and  y,  should  not  be  included  in  the 
regression.  Thus,  if  some  variable  is  a  consequence 
of  x,  if  we  include  it  in  the  regression,  we  may 
incorrectly  conclude  that  x  has  no  causal  impact 
on  y. 


Methodology  1565 


In  terms  of  which  cases  to  include  in  a  regres¬ 
sion,  the  matching  literature  tells  us  that,  for  any 
given  potential  causal  variable,  some  cases  give  us 
no  leverage  because  it  is  impossible  to  match  cases 
where  the  causal  variable  is  present  to  those  where 
it  is  absent.  This  is  often  not  a  problem  in  survey 
analysis  but  can  be  a  major  problem  in  the  study 
of  comparative  and  international  politics.  We 
often  analyze  a  group  of  countries  because  they 
belong  to  a  data-reporting  organization;  the  match¬ 
ing  approach  gives  a  more  principled  way  of  start¬ 
ing  to  think  about  which  cases  should  be  included 
in  an  analysis.  And,  just  as  important,  the  cases  to 
include  vary  with  the  causal  variable  being  studied. 
But,  as  with  simple  matching,  we  then  must 
remember  that  the  causal  effect  that  is  estimated  is 
a  function  of  which  cases  are  studied. 

The  matching  approach  (and  multiple  regres¬ 
sion)  assumes  that  we  can  observe  the  various 
confounders  that  impede  causal  inference.  But 
what  if  they  cannot  be  observed?  There  are  sev¬ 
eral  approaches  that  are  promising,  though,  as 
with  any  method,  they  must  be  used  with  care. 
One  is  to  model  what  is  known  as  selection  and 
the  other  is  to  use  what  are  known  as  instrumen¬ 
tal  variables.  These  deal  with  issues  of  selection 
bias  and  endogeneity. 

Selection  bias  is  a  critical  empirical  issue.  In 
applied  work,  if  we  want  to  see  if,  say,  some  new 
type  of  school  provides  better  outcomes,  and  we 
compare  outcomes  of  those  who  attend  the  new 
school  against  a  sample  from  other  schools,  we 
may  find  that  the  new  type  of  school  seems  to 
work  either  because  better  students  choose  to  go 
there  or  because  students  who  have  knowledge  of 
themselves  and  who  have  good  reason  to  believe 
that  the  new  type  of  school  will  work  for  them 
choose  the  new  school.  The  former  problem  is 
always  critical,  while  the  latter  is  critical  if  we  wish 
to  encourage  everyone  to  use  the  new  type  of 
school. 

This  problem  was  formalized  by  the  Nobel 
Prize-winning  economist  James  Heckman  in  the 
1980s.  He  was  interested  in  the  returns  to  educa¬ 
tion  (in  terms  of  wages)  of  women;  we  only 
observe  the  wages  of  women  who  choose  to  enter 
the  labor  market.  This  may  lead  to  underestimat¬ 
ing  the  effect  of  education  on  women’s  wages, 
since  women  with  less  education  may  only  enter 
the  labor  market  if  they  have  some  reason  to  know 


that  they  will  do  well  in  that  market.  Alas,  such 
reasons  are  usually  not  observable  in  a  data  set. 

In  political  science,  we  may  be  interested  in  the 
effect  of  being  involved  in  a  scandal  on  the  elec¬ 
toral  success  of  incumbents  running  for  reelection. 
But  those  who  see  little  chance  of  reelection  may 
choose  not  to  run,  and  those  who  were  involved  in 
a  scandal  but  chose  to  run  anyway  may  have  pri¬ 
vate  reasons  to  know  that  they  are  likely  to  do 
well.  Are  campaign  ads  effective?  Perhaps  people 
who  already  like  a  candidate  are  more  likely  to  see 
that  candidate’s  ads. 

Similarly,  in  international  relations,  if  we  only 
study  the  outcomes  of  wars,  many  nations  that  are 
militarily  weak  may  simply  choose  not  to  fight; 
thus,  the  weak  will  fight  only  if  they  have  some 
private  information  that  they  have  a  chance  of 
winning,  and  so  we  may  underestimate  the  effect 
of  military  strength  on  winning  a  war.  Similarly, 
does  international  mediation  actually  help  solve 
conflicts?  Perhaps  mediators  only  take  on  their 
task  when  they  think  success  is  likely.  In  compara¬ 
tive  politics,  autocrats  who  believe  that  they  can 
remain  in  power  if  they  liberalize  are  perhaps  more 
likely  to  liberalize.  These  are  just  a  few  examples, 
but  selection  bias  is  pervasive  in  observational 
studies. 

One  solution  is  to  match  those  who  select  some 
treatment  (war,  watching  an  ad,  etc.)  with  those 
who  chose  not  to  do  so.  But  if  the  data  set  does  not 
contain  enough  information  to  match  on  critical 
variables  (nations  that  are  militarily  weak  but  have 
other,  unobserved,  private  reasons  to  believe  that 
war  is  in  their  interest),  this  approach  does  not 
work.  Heckman  suggested  a  two-equation  model, 
one  for  selection  and  one  for  the  outcome-given 
selection,  with  the  errors  in  the  two  equations  cor¬ 
related  (so  that  nations  that  should  not  have  gone 
to  war  but  did  for  unobservable  [error  term]  rea¬ 
sons  will  also  be  more  likely  to  do  better,  again  for 
reasons  that  are  in  the  error  term).  Note  also  that 
the  various  research  design  issues  (experiments, 
quasi  experiments,  and  such)  can  also  be  a  critical 
tool  for  dealing  with  selection  bias.  But  even  if 
there  is  no  statistical  solution  available,  and  we 
are  not  lucky  enough  to  observe  a  good  quasi- 
experimental  situation,  understanding  the  nature 
of  the  problem  is  critical  for  causal  inference. 

The  other  issue  is  endogeneity.  Does  high  income 
lead  to  good  institutions  or  do  good  institutions 
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lead  to  high  income?  Do  voters  who  like  some 
candidate  assume  his  or  her  position  to  be  close  to 
theirs  or  does  closeness  on  issues  lead  voters  to 
choose  that  candidate?  Disentangling  whether  x 
causes  y  or  y  causes  x  is  critical  to  political  science; 
and  observational  data  on  cross-sections  cannot 
help  answer  this  question,  since,  as  is  well  known, 
association  tells  us  nothing  about  causation.  For 
the  above  two  examples,  we  would  observe  the 
same  exact  data  regardless  of  the  causal  process. 
The  research  design  tools  discussed  previously  can 
sort  out  these  issues;  if  x  changed  for  some  reason 
external  to  the  system  (perhaps  because  of  some 
natural  event),  then  we  could  study  whether  x 
causes  y  and  is  not  simply  associated  with  it. 

In  the  1950s,  economists  associated  with  the 
Cowles  Foundation  at  Yale  thought  they  could 
solve  the  problem  by  estimating  a  series  of  equa¬ 
tions,  one  for  x  and  one  for  y.  Of  course,  these  had 
to  be  estimated  jointly,  and  this  technique  came  to 
be  known  as  the  estimation  of  simultaneous  equa¬ 
tions.  Interest  in  this  approach  waned  as  it  became 
obvious  that  we  simply  lacked  strong  enough  the¬ 
ory  to  allow  the  estimation  of  such  equations.  This 
theory,  at  a  minimum,  was  necessary  to  tell  us  that 
there  was  some  exogenous  z  that  affected  y  and 
not  x  and  some  other  exogenous  w  that  affected  x 
and  not  y.  At  least  in  political  science,  it  seemed 
hard  to  find  such  exogenous  variables  with  such 
asymmetrical  effects. 

Interest  in  part  of  this  approach — instrumental 
variables — started  to  reappear  in  the  1990s  and 
now  has  become  an  extremely  active  area  of 
research.  The  basic  idea  is  that  we  are  interested  in 
the  causal  impact  of,  say,  economic  growth  on 
having  a  civil  war.  However,  in  a  cross-sectional 
study,  we  would  worry  that  civil  wars  hurt  eco¬ 
nomic  growth.  The  instrumental-variable  approach 
is  to  find  some  exogenous  variable  that  affects 
growth  but  only  affects  civil  wars  through  its  link 
with  economic  growth.  In  an  ingenious  study,  Ted 
Miguel,  Shanker  Satyanath,  and  Ernest  Sergenti 
decided  that,  for  southern  Africa,  rainfall  would  be 
a  good  instrument.  They  had  to  convince  them¬ 
selves  that  rainfall  affected  the  outbreak  of  civil 
war  only  through  economic  growth  (since  it  is 
clear  that  rainfall  is  exogenous).  The  method  of 
instrumental  variables,  in  its  simplest  form,  then 
regressed  both  civil  wars  and  economic  growth  on 
rainfall  (both  regressions  are  fine  since  rainfall  is 


exogenous).  Having  the  effect  of  rainfall  on  both 
variables,  we  can  divide  these  effects  and  then 
obtain  a  good  estimate  of  the  impact  of  economic 
growth  on  civil  wars  without  worrying  about  the 
reverse  direction  of  causality. 

Research  on  instrumental  variables,  both  on  the 
theory  on  when  they  are  useful  and  also  in  various 
applications,  is  one  of  the  most  vibrant  current 
research  areas  in  political  science.  It  is  hard,  but 
not  impossible,  to  find  good  instruments.  There  is 
much  current  research  on  what  properties  a  good 
instrument  should  possess  and  if  and  how  empiri¬ 
cal  researchers  can  test  whether  a  given  instrument 
is  a  good  one. 

Conclusion 

Political  methodology  was  not  even  considered  a 
field  for  research  30  years  ago.  The  world  has 
changed  remarkably.  The  American  Political 
Science  Association  has  subfield  groups  for  both 
quantitative  and  qualitative  methodology;  these 
groups  are  among  the  largest  such  groups  in  the 
association.  The  past  few  years  have  seen  a  huge 
increase  in  specialized  short  courses  in  methodol¬ 
ogy,  both  quantitative  and  qualitative,  worldwide. 
Almost  all  departments  of  political  science  now 
require  methods  training  for  all  of  their  PhD 
students. 

Thirty  years  ago,  the  focus  was  on  statistical 
inference.  While  many  still  focus  on  that  issue,  the 
advances  over  the  past  30  years  have  made  the 
estimation  of  many  complicated  models  quite  easy. 
Today,  there  is  an  incredible  revolution  in  data 
collection  and  measurement,  and  renewed  interest 
in  research  design,  especially  as  it  relates  to  making 
causal  inferences.  There  has  been  huge  growth  in 
thinking  about  experimental  and  quasi-experimen- 
tal  approaches  to  studying  critical  questions. 

While  qualitative  and  quantitative  researchers 
often  go  their  separate  ways,  there  is  surely 
renewed  interest  in  what  these  two  approaches 
have  in  common  methodologically  (and  where 
they  appropriately  differ).  In  short,  political  meth¬ 
odology  has  been  one  of  the  great  success  stories  of 
our  discipline  over  the  past  30  years. 

Nathaniel  Beck 
New  York  University 
New  York  City,  New  York,  United  States 
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Migration  is  a  topic  that  is  characterized  by  an 
interdisciplinary  approach,  both  theoretically  and 
empirically.  It  crosses  several  disciplines,  includ¬ 
ing  demography,  geography,  sociology,  anthro¬ 
pology,  economy,  history,  and  political  science. 
Social  scientists  do  not  study  migration  from  a 
shared  paradigm  but  from  a  variety  of  competing 
theoretical  viewpoints  that  are  fragmented  across 
disciplines.  Migration  emerged  as  a  field  of 
research  in  political  science  in  the  mid-20th  cen¬ 
tury,  and  in  the  1980s  to  1990s,  it  had  already 
become  a  major  research  topic.  It  is  now  one  of 
the  major  political  issues  facing  the  21st  century. 
It  has  taken  time  for  migration  to  acquire  legiti¬ 
macy  as  a  research  topic  in  the  discipline.  Several 


factors  contributed  to  create  this  delay,  among 
which  are  the  absence  of  a  political  theory  of 
international  migration,  a  lack  of  cohesion 
between  macro-  and  microanalysis,  and  the  fact 
that  the  problematics  raised  by  migrations  are 
constantly  evolving. 

The  term  migration  refers  to  human  mobility: 
A  migrant  is  someone  who  has  left  his  or  her 
country  of  birth  and  who  is  living  in  another 
country.  A  migrant  differs  from  a  foreigner,  the 
latter  being  defined  juridically  as  a  “nonnational.” 
There  are  internal  and  external  migrations.  Most 
international  migrants  are  foreigners,  and  most 
foreigners  are  migrants,  but  not  automatically  so. 
Today,  we  are  facing  the  second  wave  of  world 
migration  since  the  mid-19th  century,  when  mil¬ 
lions  of  Europeans  left  Europe  for  the  New  World 
to  find  work  (United  States,  Canada,  Australia, 
New  Zealand,  Latin  America),  to  colonize  and 
conquer  (Asia  and  Africa),  for  trade,  or  as  mis¬ 
sionaries.  The  number  of  migrants  (200  million, 
3%  of  the  world  population)  has  tripled  since 
1970,  and  globalization,  the  transnationalization 
of  networks,  and  push-and-pull  factors  have  made 
individuals  mobile  internationally.  Together,  these 
factors  challenge  national  borders,  the  sovereignty 
of  states,  and  the  notion  of  citizenship,  which  in 
turn  affects  international  relations  (IR)  and  politi¬ 
cal  identities. 

Most  political  science  research  that  focuses  on 
migration  tends  to  deal  with  either  the  role  of  the 
nation-state  in  controlling  its  borders,  the  making 
of  immigration  policy,  the  impact  of  migration  on 
sovereignty  (including  foreign  and  national  secu¬ 
rity),  or  issues  of  citizenship  (dissociation  between 
nationality  and  citizenship,  identity,  allegiances, 
and  political  inclusion).  Such  research  draws  on 
several  disciplines,  including  IR,  public  policies, 
and  the  sociology  of  political  behavior.  Research 
conducted  within  the  field  of  migration  studies 
takes  two  main  directions:  on  the  one  hand,  the 
study  of  flows  (migration  policies  and  compara¬ 
tive  and  international  analysis),  which  focuses  on 
macrolevel  issues,  and  on  the  other  hand,  the 
study  of  stocks  (on  living  together  and  political 
incorporation),  which  focuses  mainly  on  micro¬ 
level  issues.  Placing  emphasis  on  the  existence/the 
study  of  social  ties,  some  political  scientists  have 
proposed  a  “mesolevel”  of  analysis.  Some  prob¬ 
lematics  remain  unexplored,  and  gaps  in  the 
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research  include  topics  such  as  emigration  policies 
(rules  of  exit),  the  “diplomacy  of  migration”  led 
by  emigration  countries,  or  the  root  causes  of  the 
establishment  of  regional  or  world  migration  sys¬ 
tems.  Many  political  scientists  have  studied  the 
respective  impact  of  political  decision  making  and 
of  other  competing  factors,  such  as  economic  lib¬ 
eralization  and  securitization,  on  the  opening  and 
closing  of  borders.  Others  have  focused  on  the 
relation  between  citizenship  and  the  political 
inclusion  of  migrants.  However,  only  a  few  have 
analyzed  the  link  between  immigration  flows, 
incorporation  (integration)  policies,  and  political 
behavior. 

If  migration  has  become  an  object  of  analysis 
encompassing  local,  regional,  national,  and  inter¬ 
national  levels,  it  has  also  weakened  the  two  cen¬ 
tral  pillars  of  the  international  political  system: 
(1)  sovereignty  and  (2)  citizenship.  Migration  brings 
anomie  to  the  national  and  international  realms, 
where  the  state  would  want  to  remain  the  exclusive 
actor.  Nation-states  are  currently  threatened  from 
above  by  globalization  and  from  below  by  multicul- 
turalism.  In  the  following  sections,  we  will  analyze 
to  what  extent  migration,  as  it  blurs  the  frontiers 
between  sovereignty  and  citizenship,  undermines 
the  external  and  the  internal  political  order. 

Migration  and  Sovereignty 
Difficulties  of  Border  Control 

Despite  the  development  of  transnationalism, 
mainstream  scholars  have  continued  to  view  the 
state  as  a  unitary  and  rational  actor  and  to  place  it 
at  the  center  of  their  analyses.  The  first  question 
that  is  raised  by  migration  in  relation  to  the  issue 
of  sovereignty  is  linked  to  border  control.  Migration 
challenges  the  Weberian  notion  of  sovereignty,  in 
that  it  questions  the  relation  between  population, 
territorial  space,  and  monopoly  of  power.  In  the 
past,  many  states  viewed  migration  as  a  break  of 
allegiances,  and  they  fought  against  nomadism  to 
control  their  territories.  Today,  several  factors 
undermine  state  sovereignty,  notably  the  inability 
of  states  to  control  the  flow  of  illegal  immigrants 
across  their  borders,  the  capacity  of  states  to  rule 
migrations  through  regional  agreements  and  inter¬ 
national  rules,  and  the  fact  that  heterogeneous 
populations  and  minorities  now  contribute  to 
define  national  identity.  In  a  context  where  the 


vast  majority  of  the  world  is  sedentary,  migration 
appears  as  an  exception  that  has  rapidly  grown 
due  to  fluid  exchanges  (transportation,  and  infor¬ 
mation  facilities  and  networks).  The  right  to 
mobility  is  itself  becoming  a  human  value.  The 
main  difference  between  the  first  world  migration 
wave  of  1880-1920  and  the  second  one  of  1980- 
2000s  lies  in  the  border  control  of  immigration 
and  emigration  countries.  A  century  ago,  it  was  as 
difficult  for  one  to  leave  one’s  own  country 
(because  borders  were  closed  from  inside)  as  it  was 
easy  for  one  to  enter  another  country  (because 
there  were  at  that  time  only  few  controls  and  pass¬ 
ports).  The  population  was  a  state’s  main  resource, 
and  its  movement  was  either  prohibited  or  strictly 
controlled.  On  the  other  hand,  in  countries  of 
arrival,  new  populations  were  welcome  to  work 
and  to  settle.  Now,  the  situation  has  reversed,  even 
in  countries  such  as  the  former  USSR,  China,  and 
most  Third  World  countries,  where  the  change 
took  longer  to  settle.  Opened  borders  are  now 
commonplace  in  countries  of  departure,  which  see 
emigration  as  a  good  thing  for  them.  On  the  other 
hand,  it  has  become  very  difficult  to  enter  coun¬ 
tries  that  are  attempting  to  fight  undesired  immi¬ 
gration  flows. 

Markets  and  Mobility  Rights 

Liberal  democracies  in  particular  have  difficul¬ 
ties  in  controlling  immigration.  The  notion  of 
sovereignty  and  its  emphasis  on  upholding  politi¬ 
cal  order  and  a  feeling  of  political  community 
within  borders  is  challenged  by  a  liberal  commit¬ 
ment  to  the  free  flow  of  goods,  money,  and  people 
across  borders.  Migrations  exacerbate  the  rela¬ 
tions  between  states  and  markets:  Sometimes  the 
state  is  brought  back  in,  and  sometimes  the  gap 
between  actual  migration  flows  and  migration 
policies  deepens.  James  Hollifield  has  called  this 
the  “liberal  paradox.”  Pressure  on  immigration 
policy  comes  from  numerous  organized  and  con¬ 
tradictory  interest  groups  such  as  political  parties, 
trade  unions,  employers,  associations  of  human 
rights,  and  countries  of  origin.  However,  it  is  dif¬ 
ficult  to  conciliate  control  in  democratic  societies. 
Why,  then,  do  states  “risk”  migration?  They  risk 
migration  for  economic,  demographic,  and  human 
rights  reasons  (asylum).  Why,  on  the  contrary,  do 
states  close  their  borders?  The  states  close  their 
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borders  to  maintain  internal  political  order  and  a 
welfare  system  as  well  as  to  accommodate  public 
opinion.  As  noted  by  Myron  Weiner,  in  a  global¬ 
ized  world  where  states  fail  to  fully  manage  migra¬ 
tion  and  in  which  individuals  are  the  main  actors, 
the  increase  of  international  migration  is  viewed  as 
a  threat  to  security  and  stability.  Although  interna¬ 
tional  and  national  rights  set  out  a  framework 
within  which  states  control  immigration,  most 
democratic  states  eventually  end  up  both  failing  to 
control  borders  and  violating  human  rights  prin¬ 
ciples  and  international  conventions.  At  the 
European  level,  and  through  the  use  of  buffer 
zones,  states  are  developing  regional  migratory 
systems  as  well  as  externalized  and  remote  con¬ 
trols  of  borders.  Neighboring  countries — which 
sometimes  tend  to  serve  as  departure  or  transit 
countries — are  becoming  border  guards.  However, 
repression  does  not  prove  dissuasive,  and  states 
fail  to  fully  control  population  movements.  In  fact, 
pull  factors  and  networks  are  revealed  to  be  stron¬ 
ger  than  the  public  policies  of  immigration  coun¬ 
tries.  Some  political  scientists  have  formulated 
hypotheses  about  the  political  dimension  of  inter¬ 
national  migration.  In  particular,  as  they  noticed 
that  the  “main  gates”  of  entrance  were  closing  and 
the  “back  doors”  were  opening  to  illegal  immi¬ 
grants,  some,  such  as  Aristide  Zolberg  and  col¬ 
leagues  (1989),  developed  the  idea  that  controlling 
the  territory  actually  blurs  the  boundaries  between 
the  internal  and  the  international  political  order. 
When  migration  for  economic  purposes  is  closed, 
illegal  immigrants  seek  asylum,  citing  environmen¬ 
tal  concerns  or  family  links  as  reasons  for  migra¬ 
tion.  As  a  result,  they  contribute  to  blur  traditional 
categories  of  migrants.  Such  a  mix-up  is  also 
caused  by  the  fact  that,  depending  on  circum¬ 
stances,  illegal  immigrants  fit  within  various 
migrant  categories.  Even  when  poverty  level  and 
demographic  pressure  are  high,  they  are  able  to 
build  networks,  without  which  there  is  no  migra¬ 
tion.  They  are  also  very  active  in  campaigning  for 
the  recognition  of  the  right  to  mobility  as  a  “right 
to  have  rights,”  as  defined  by  philosophers  such  as 
Immanuel  Kant,  Hannah  Arendt,  and  Zygmunt 
Bauman.  Thereby,  the  right  to  mobility  is  seen  as 
a  human  right  in  the  process  of  being  consecrated. 
An  idea  of  migrations  without  borders  is  emerging 
that  would  be  seen  as  a  world  public  good  and  that 
could  be  managed  through  global  governance  on 


the  legal  basis  set  out  in  international  rules,  such  as 
the  Universal  Declaration  of  Human  Rights  of 
1948  and  the  United  Nations  Convention  on  the 
Rights  of  All  Migrant  Workers  and  Their  Families 
of  1990.  In  such  a  system,  multilateral  decision 
making  would  guarantee  well-balanced  mobility. 
The  human  rights  regime  that  is  currently  taking 
shape  goes  beyond  states.  It  is  a  postnational 
regime  within  which  international  conventions 
will  grant  migrants  an  international  legal  status. 

Transnational  Networks  Another  factor  that  is 
currently  challenging  the  sovereignty  of  the  nation¬ 
state  is  the  growing  importance  of  social  networks 
that  link  sending  and  receiving  countries  together. 
States  are  no  longer  the  sole  actors  of  international 
migrations:  firms,  individuals,  and  transnational 
communities  bypass  the  regulatory  authority  of 
sovereign  states.  While  the  world  is  going  through 
a  process  of  individualization,  states  are  experienc¬ 
ing  deterritorialization.  Transnational  networks 
challenge  the  international  order  of  nation-states. 
Not  being  able  to  control  migrants,  nation-states 
are  not  the  most  legitimate  actors  of  IR  anymore. 
The  development  of  transnational  economies  led 
to  the  creation  of  transnational  social  links,  and 
increased  communications  rendered  national 
migration  policies  nearly  obsolete.  These  policies 
are  now  challenged  by  networks  and  by  migrants 
themselves.  Alejandro  Portes  developed  the  notion 
of  “transnational  communities”  to  explain  the 
logic  of  international  migration  flows  and  patterns 
of  immigrant  incorporation.  While  the  interna¬ 
tional  economy  pushes  for  borders  to  be  more 
open,  internal  policy  pushes  for  their  closure.  As 
Saskia  Sassen  has  observed,  globalization  has  led 
to  the  rise  of  a  structural  demand  for  foreign  labor 
and  to  a  loss  of  border  control.  Demand  for  for¬ 
eign  labor  is  “structurally  embedded”  (Wayne 
Cornelius,  1994)  in  the  more  advanced  industrial 
societies  that  cannot  function  without  a  cheap 
labor  force.  As  a  result,  little  space  is  left  for  states 
and  national  regulations,  and  the  state  is  marginal¬ 
ized  in  its  efforts  to  structure  international  migra¬ 
tion.  Transnational  networks  and  transnational 
communities  also  contribute  to  strengthening  eco¬ 
nomic  globalization:  Through  remittances  ($300 
billion  in  2007),  migrants  become  the  main  actors 
of  development  in  their  country  of  origin;  ethnic 
entrepreneurship  creates  gray  areas  where  goods, 
people,  and  money  can  circulate  freely;  family 


1570  Migration 


reunification  and  transnational  marriages  represent 
half  of  the  legal  entries  in  most  immigration  coun¬ 
tries,  and  ethnic  minorities  are  a  new  factor  of 
transnational  citizenship.  Diasporas  and  quasi  dia- 
sporas  collectively  contribute  to  instill  more  fluid¬ 
ity  in  the  world  of  nation-states  and  closed  borders. 

Blurred  Boundaries  Between  Internal 
and  External  Political  Order 

As  Zolberg  and  his  colleagues  note,  migration 
blurs  the  distinction  between  the  internal  and  the 
external  political  order.  The  theory  of  globaliza¬ 
tion,  which  takes  inspiration  from  the  sociology  of 
IR,  introduces  with  migration  a  bottom-up  analy¬ 
sis  of  the  social  texture  of  IR.  It  is  well  recognized 
that  external  factors  may  have  an  impact  on  inter¬ 
nal  political  orders.  For  example,  conflicts  abroad 
affect  refugees,  displaced  persons,  and  economic 
and  demographic  gaps;  and  environmental  issues 
affect  population  movements.  Receiving  and  send¬ 
ing  countries  often  settle  bilateral  or  multilateral 
agreements  that  can  involve  trading  raw  materials 
or  labor  force  against  legalization  procedures,  as 
was  the  case  between  Mexico  and  the  United 
States.  Along  the  same  lines,  France  has  been 
granting  residence  cards  to  elites  of  poor  countries 
and  promised  development  policies  in  exchange 
for  the  repatriation  of  illegal  migrants.  Some  emi¬ 
gration  countries  such  as  Morocco,  Turkey,  and 
Mexico  are  now  pursuing  a  “diplomacy  of  migra¬ 
tion”  with  rich  neighboring  countries:  Dual  nation¬ 
als,  who  became  voters  and  were  sometimes 
elected  in  immigration  countries,  are  being  used  to 
influence  and  sometimes  to  intrude  in  some  deli¬ 
cate  matters  such  as  the  practice  of  Islam  within 
host  societies,  plural  allegiances,  or  multiple 
belongings  of  new  citizens.  The  security  approach 
to  migration  has  contributed  to  an  increasing  mix 
of  internal  and  external  politics,  leaving  little  space 
for  state  and  national  regulations  while  favoring 
regional  agreements  such  as  those  addressing  the 
security  of  European  borders.  Migration  previously 
dealt  with  “low  politics,”  but  it  is  now  regulated 
through  “high”  IR  politics,  which  traditionally 
deals  with  war  and  peace,  national  security,  foreign 
policy,  and  regional  equilibriums.  However,  pres¬ 
sures  to  put  international  migration  higher  on  the 
IR  agenda  did  not  succeed  before  the  late  1990s. 
Some  IR  theorists  who  were  initially  interested  in 


international  security  questions  turned  to  issues 
related  to  population  control  and  terrorism  instead 
and  began  to  assert  that  international  population 
movements  and  transnational  networks  can  have  a 
dramatic  impact  on  the  security  and  sovereignty  of 
states. 

Migration  and  Citizenship 

Migrants  as  Political  Actors  in  a 
Multidimensional  Political  Space 

With  migration  in  the  foreground,  citizenship 
cannot  be  understood  in  its  traditional  context  any 
more.  Initial  research  introducing  migration  as  a 
research  object  of  political  science  focused  on 
migrants  as  future  citizens  (from  political  accul¬ 
turation  to  political  participation);  on  migrants 
who  were  mostly  concerned  with  politics  in  their 
country  of  origin,  either  because  they  were  refu¬ 
gees  or  because,  on  the  contrary,  they  hoped  to 
return  to  their  home  country;  or  on  migrants  as 
politically  alienated  individuals,  eventually 
involved  in  social  mobilizations  around  housing 
issues  or  at  work  (politicization  of  nonpolitical 
matters).  Such  research  was  rooted  altogether  in 
sociology  (theories  of  incorporation),  economics 
(labor  market  analysis  of  migrants  as  short-term 
guest  workers),  and  law  (prohibition  of  any  politi¬ 
cal  expression  for  migrants  coming  from  dictator¬ 
ships  or  from  former  colonized  territories).  In  the 
workplace,  most  migrants  were  integrated.  They 
progressively  gained  equal  representation  in  firms 
and  trade  unions,  putting  forward  claims  for  free¬ 
dom  of  association  and  for  the  right  to  political 
representation.  In  Northern  European  countries, 
where  they  were  granted  local  political  rights  while 
still  being  denied  full  citizenship,  migrants  became, 
in  Tomas  Hammar’s  term,  denizens.  Albert 
Hirschman’s  (1970)  treatise  Exit,  Voice  and 
Loyalty  illustrates  this  situation  well. 

Evolution  of  the  Concept  of  Citizenship 

The  notion  of  citizenship  became  part  of  the 
political  analysis  of  migration  in  the  1980s.  Since 
then,  migration  progressively  altered  the  very  con¬ 
tent  of  citizenship,  most  notably  through  dissociat¬ 
ing  citizenship  from  nationality.  Studies  of  the 
long-term  settlement  of  immigrants  in  Canada  and 
Australia,  and,  through  comparative  approaches, 
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in  European  countries,  examined  how  far  migra¬ 
tion  was  altering  the  content  of  citizenship  and  to 
what  extent  citizenship  was  raising  new  issues  in 
the  field  of  migration.  Field  research  showed 
that  the  emergence  of  migrants — mostly  second- 
generation  migrants — in  the  political  sphere  raised 
the  questions  of  “membership”  and  “belonging” 
in  immigration  countries  and  of  a  “double  pres¬ 
ence”  through  immigration.  Some  years  later,  the 
idea  of  a  “double  presence”  affecting  citizenship 
(through  double  citizenship,  multiple  allegiances, 
and  plural  citizenship)  could  indeed  be  substituted 
for  Abdelmalek  Sayad’s  concept  of  a  “double 
absence”  of  migrant  workers  from  both  their 
home  and  host  countries.  As  new  nationals  of 
immigrant  origin  take  part  in  the  local  civic  life, 
citizenship  is  seen  more  as  a  participative  matter 
rather  than  being  limited  to  the  act  of  voting.  This 
constitutes  a  return  of  the  concept  of  citizenship  as 
was  understood  during  the  French  Revolution  of 
1789  (e.g.,  in  the  Constitution  of  1793,  participa¬ 
tion  in,  and  adhesion  to,  the  new  ideals  were  con¬ 
sidered  more  important  than  the  question  of 
whether  one  was  or  was  not  a  national). 

Dissociation  Between  Nationality, 

Citizenship,  and  Dual  Citizenship 

The  dissociation  between  nationality  and  citi¬ 
zenship  is  the  most  important  issue  that  has  been 
raised  by  increasing  migration  and  integration 
among  European  states.  The  granting  of  local  vot¬ 
ing  rights  to  all  foreigners  in  some  Northern 
European  countries  followed  by  the  granting  of 
local  voting  rights  to  all  European  citizens  living  in 
a  different  European  country  (European  citizen¬ 
ship)  showed  that  one  can  be  a  citizen  without 
being  a  national  of  a  country,  as  a  result  of  involve¬ 
ment  in  local  political  affairs.  In  the  past,  some 
nationals  were  not  considered  full  citizens  (colonial 
indigenous  people,  women,  those  condemned  to 
prison  or  to  death,  the  disabled,  and  soldiers). 
Approaches  to  citizenship  have  evolved  through 
time,  and  citizenship  and  nationality  have  only 
recently  been  associated  with  the  rise  of  the  nation¬ 
state.  The  newly  found  importance  of  the  local 
manifests  itself  through  the  emergence  of  the 
notion  of  “residence  citizenship”  and  through  the 
use  of  jus  soli  by  immigration  countries  that  needed 
to  include  the  new  citizens.  In  France,  the  Ubeur 


movement”  added  meaning  to  the  classical  under¬ 
standing  of  citizenship  when  it  introduced  the  idea 
of  “new  citizenship.”  “New  citizenship”  is  a  mix  of 
localism,  grassroots  participation  in  inner-city  com¬ 
munities,  multiple  allegiances,  and  respect  for 
republican  values  while  expressing  collective  claims 
to,  and  identification  with,  ethnicity  and  Islam.  The 
idea  is  that  one  can  be  a  citizen  while  being  cultur¬ 
ally  different  from  other  citizens.  In  the  1990s, 
most  European  countries  faced  widespread  debates 
concerning  the  reform  of  their  nationality  codes. 
States  reacted  by  introducing  elements  of  jus  soli 
into  their  nationality  codes  that  traditionally  rested 
on  jus  sanguinis.  Rogers  Brubaker  theorized  the 
relations  between  immigration  and  citizenship  and 
showed  that  naturalization  law  and  policies  affect 
the  rate  at  which  newcomers  are  politically  incor¬ 
porated.  Jus  sanguinis  immigration  countries  that 
were  most  reluctant  to  open  their  nationality  codes 
to  newcomers  now  offer  some  opportunities  to 
include  them  as  future  citizens.  In  most  cases, 
degrees  of  political  rights  and  of  incorporation  can 
be  pictured  as  a  series  of  concentric  circles:  At  the 
center  are  national  citizens  who  are  granted  the 
whole  range  of  political  rights,  and  at  the  outskirts 
are  illegal  immigrants  and  asylum  seekers  who 
have  no  means  of  political  action  apart  from  hold¬ 
ing  hunger  strikes  or  other  protests  on  the  streets. 
Even  in  a  country  such  as  Japan,  which  refuses  to 
change  its  criteria  for  Japanese  identity  in  order  to 
preserve  “racial  purity,”  the  nationality  code  now 
includes  various  hierarchized  categories  of  migrants. 
The  transnational  nature  of  migration  has  also  led 
to  situations  of  “dual  citizenship,”  with  jus  sangui¬ 
nis  being  the  rule  in  the  country  of  origin  of  first- 
generation  migrants  and  jus  soli  being  applied  in 
the  country  of  birth  of  following  generations.  With 
such  massive  increases  in  migration,  this  concept  of 
dual  citizenship  will  develop  throughout  the  world. 
New  realities  further  stem  from  this  context: 
Individuals  now  have  plural  allegiances  to  several 
political  spheres;  countries  of  origin  attempt  to 
increase  their  political  power  by  using  their  own 
nationals  who  live  abroad  and  who  became  citizens 
in  the  host  country  as  a  potential  collective  force; 
diasporic  states  use  displaced  persons  and  refugees 
living  abroad  as  part  of  an  “influence  diplomacy.” 
New  citizens  (e.g.,  Latinos  in  the  United  States, 
Germans  of  Turkish  origin  in  Germany,  French 
from  the  so-called  diversity  in  France,  and 
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Commonwealth  citizens  in  the  United  Kingdom) 
are  cherished  by  most  political  parties,  which  are 
always  in  need  of  new  voters  to  extend  their  politi¬ 
cal  base.  Indeed,  these  new  citizens  do  not  necessar¬ 
ily  form  a  voting  bloc  based  on  ethnicity  or  other 
common  identity.  In  that  sense,  Islam  has  somehow 
introduced  new  types  of  relationships  between  citi¬ 
zenship  and  religion,  leading  to  a  plural  citizenship, 
related  to  two  different  nation-states,  that  created 
an  alternative  to  voting  on  an  ethnic  or  on  a  reli¬ 
gious  basis. 

Multicultural  and  Transnational  Citizenship 

Citizenship  can  no  longer  be  understood  in  the 
traditional  context  of  a  unified  culture.  Some 
countries  such  as  Canada  and  Australia,  where 
citizenship  used  to  be  based  on  the  myth  of 
national  homogeneity,  have  now  included  multi- 
culturalism  as  part  of  the  very  definition  of  citizen¬ 
ship.  As  a  result,  ethnicity  is  now  a  constitutive 
part  of  the  identity  of  host  societies.  In  the  same 
way  as  liberalism  and  multiculturalism  are  under¬ 
stood  to  be  working  hand  in  hand,  citizenship  is 
now  necessarily  linked  to  multiculturalism  in 
countries  with  large  immigrant  populations.  Over 
the  past  few  decades,  transnationalism  has  become 
an  important  conceptual  approach,  and,  as  noted 
by  Rainer  Baubock,  transnational  citizenship  is 
now  accepted  as  characterizing  most  or  all  post¬ 
modern  states.  The  nation-state  is  overwhelmed  by 
such  an  expression  of  citizenship  across  borders:  It 
is  unable  to  both  confront  the  political  conse¬ 
quences  of  the  world  economy  and  respond  to  the 
challenge  of  ethnicity.  As  mobility  becomes  a  way 
of  life  and  as  the  free  movement  of  people  and 
cross-national  relations  intensify,  new  forms  of 
transnational  citizenship  increasingly  contribute  to 
reshape  traditional  models  of  citizenship. 

The  Individual  Migrant  as  an  Actor 

At  the  international  level,  citizenship  has  also 
been  transformed  through  the  emergence  of  migrants 
as  individual  actors  on  the  international  scene. 
Migrants  are  at  the  cross-point  between  identities  of 
states  and  blurred  boundaries  of  territories.  As  they 
struggle  to  achieve  their  individual  goals  and  to 
reshape  their  own  life,  they  become  “actors  of  the 
future”  par  excellence:  They  no  longer  feel  doomed 


just  because  they  were  born  in  a  misgoverned  and 
poor  country.  Through  remittances,  migrants  also 
have  an  effect  on  the  lives  of  individuals  in  their 
country  of  origin.  Further,  various  transnational 
practices  of  migrants  challenge  the  international 
order  that  was  created  by,  and  served  to,  nation¬ 
states  with  a  large  immigrant  population. 

At  the  internal  level,  the  existence  of  discrimina¬ 
tion  and  diversity  complicate  the  issue  of  citizen¬ 
ship.  Also,  as  noted  above,  Islam,  which  is  a 
minority  religion  in  most  immigration  countries, 
introduces  a  system  of  plural  allegiances.  This  sys¬ 
tem  differs  from  the  former  model  of  citizenship, 
which  implied  citizens’  exclusive  allegiance  to  the 
nation-state.  Ethnic  diversity  breaks  the  myth  of  a 
homogeneous  nation,  in  which  citizenship  would 
rest  on  a  social  contract  and  where  cultural  prac¬ 
tices  and  values  of  citizens  would  only  be  expressed 
in  a  “private  sphere,”  while  the  public  sphere 
would  be  freed  from  such  practices  and  values. 
Minority  religions,  poverty,  and  ethnicity  often 
lead  to  discrimination.  Discrimination  in  turn  casts 
doubt  on  the  effectiveness  of  declarative  citizen¬ 
ship  rights  that  neither  question  citizens’  unequal 
access  to  rights  nor  their  unequal  treatment  by 
institutions.  Thanks  to  migration,  diversity  is  seen 
as  a  value  in  itself  that  should  be  considered  as  an 
integral  part  of  modern  citizenship;  and  modern 
citizenship  has  become  plural,  transnational,  multi¬ 
cultural,  and  antidiscriminative. 

Migration  and  the  Field  of  Political  Science 

Migration,  by  nature,  is  multidisciplinary;  thus,  it 
took  a  long  time  for  political  scientists  to  include  it 
as  a  legitimate  political  science  object  of  study. 
Migration’s  main  contribution  to  the  field  of 
political  science  lies  in  the  way  it  challenges  the 
state  as  the  main  actor  that  regulates  borders,  iden¬ 
tity,  and  citizenship.  Migration  places  individuals 
on  the  forefront:  Through  it,  they  become  actors  of 
IR.  Transnational  ties  have  become  the  most 
important  networks,  and  migrants,  who  play  an 
active  role  in  managing  mobility,  have  become 
multilateral  decision  makers.  By  creating  new  val¬ 
ues  for  living  together  in  a  political  community, 
migration  enriches  citizenship.  Illegal  immigrants 
are  now  campaigning  for  the  consecration  of  a 
right  to  mobility  as  a  human  right.  In  a  world  of 
free  movement  and  blurred  borders,  such  a  right 
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could  be  managed  through  global  governance.  Up 
until  now,  attempts  to  unify  migration  theories 
have  failed  because  these  theories  rest  on  numer¬ 
ous  and  very  dissimilar  macro-  and  micro¬ 
approaches.  When  states  failed  to  control  borders 
and  to  prevent  violence  and  religious  extremism, 
they  attempted  to  regulate  migration  through  secu¬ 
rity  approaches  that  came  from  the  field  of  strate¬ 
gic  studies.  Migration,  however,  is  an  ordinary 
human  phenomenon  that  has  always  existed  and 
that  should  be  studied  through  the  lens  of  com¬ 
parative  approaches  on  “living  together”  in  an 
interdependent  and  constantly  changing  world. 

Catherine  Wihtol  de  Wenden 
CERI-Sciences  Po/CNRS 
Paris,  France 

See  also  Advocacy  Networks,  Transnational;  Citizenship; 
Sovereignty 
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Military  Rule 


For  most  of  human  history,  attaching  “military” 
to  “rule”  would  have  been  redundant,  because 
almost  all  political  regimes  in  large-scale  societies 
of  the  premodern  period  fused  military,  religious, 
economic,  and  monarchical  power.  The  separa¬ 
tion  of  military  and  civilian  powers  and  the  devel¬ 
opment  of  professional,  bureaucratic  armed  forces 
in  European  states  in  the  18th  and  19th  centuries 
gave  birth  to  the  contemporary  understanding  of 
military  rule.  The  most  useful  definition  of  the 
term  is  a  political  regime  in  which  the  military  as 
an  organization  holds  a  preponderance  of  power. 
Military  rule  in  this  definition  is  synonymous  with 
military  regime  and  refers  to  a  subtype  of  author¬ 
itarian  regime.  Other  definitions  can  be  useful  for 
exploring  specific  issues.  For  example,  military 
rule  is  sometimes  defined  as  a  political  regime  in 
which  the  head  of  the  executive  is  an  active-duty 
member  of  the  armed  forces.  Another  definition 
restricts  military  rule  to  a  regime  in  which  the 
executive  is  ruled  by  a  Latin  American-style  junta 
consisting  of  commanders  of  each  of  the  branches 
of  the  armed  forces.  Nevertheless,  the  broad  orga¬ 
nizational  definition  given  above  is  probably  the 
most  useful.  It  allows  for  the  exploration  of  a 
form  of  rule  that  has  been  both  common  and — 
frequently — violently  repressive.  This  entry  dis¬ 
cusses  the  following  important  issues  related  to 
military  rule:  the  factors  that  lead  to  military  rule; 
how  militaries  mobilize  support  and  exercise 
power;  why  and  how  military  rule  comes  to  an 
end;  and  what  sorts  of  political  arrangements  are 
most  likely  to  diminish  military  rule,  now  and  in 
the  future. 

Not  all  authoritarian  regimes  involve  military 
rule.  In  the  20th  century,  the  most  repressive  non- 
democratic  regimes,  most  notably  the  Nazis  in 
Germany  and  the  Stalinist  regime  in  the  Soviet 
Union,  were  party  dictatorships  in  which  civilian 
control  of  the  military  was  well  established.  Other 
types  of  authoritarian  rule  distinct  from  military 
rule  include  traditional  (e.g.,  absolutist  monar¬ 
chies)  and  personalistic  or  “sultanistic”  regimes. 

Military  rule  since  the  end  of  World  War  II  has 
occurred  almost  exclusively  in  countries  of  the  so- 
called  developing  world.  Explanations  of  the  phe¬ 
nomenon  vary.  Some  scholars  explain  various 
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aspects  of  military  rule  largely  in  terms  of  political 
economy  and  specifically  to  the  position  of  a  given 
country  in  the  global  capitalist  system  as  well  as  its 
domestic  class  structure.  This  is  a  hallmark,  for 
example,  of  studies  of  “bureaucratic-authoritarian” 
military  regimes  or  military  rule  marked  by  a  high 
degree  of  state  capacity  both  in  managing  the 
economy  and  demobilizing  and  repressing  civil 
society.  Other  explanations  rely  more  on  geopo¬ 
litical  dynamics,  including  the  international  alli¬ 
ances  within  which  military  regimes  are  embed¬ 
ded.  Studies  of  Cold  War  “clients”  of  the  two 
superpowers,  many  of  which  were  military  regimes, 
are  an  example  of  this  approach.  A  third  perspec¬ 
tive  ascribes  military  rule  to  specific  national  and 
regional  cultures.  Finally,  a  fourth,  institutionalist 
perspective  sees  military  rule  as  an  outcome  of 
more  or  less  rational  and  more  or  less  historically 
conditioned  responses  to  specific  challenges  of  eco¬ 
nomic  development  and  political  legitimacy  in  the 
developing  world. 

Origins  of  Military  Rule 

Militaries  are  part  of  the  state  apparatus  but 
enjoy  a  high  degree  of  relative  autonomy  because 
of  their  control  over  the  means  of  coercion.  (This 
control,  while  still  significant  in  most  places, 
does  not  necessarily  represent  a  monopoly,  owing 
to  the  prevalence  of  irregular  armed  forces  in  the 
developing  world.)  However,  militaries  do  not 
constitute  a  monolithic,  single  actor.  They  are 
hierarchically  divided  between  a  high  command, 
junior  officers,  and  enlisted  personnel,  and  hori¬ 
zontal  competition  and  rivalry  between  the  dif¬ 
ferent  service  branches  (typically  the  Army, 
Navy,  and  Air  Force)  can  be  intense.  Further, 
they  are  often  divided  along  class,  regional,  and 
gender  lines  (although  militaries  in  most  develop¬ 
ing  countries  still  allow  very  limited  roles  for 
women).  In  ethnically  divided  societies,  variation 
in  rates  of  military  recruitment  across  the  major 
ethnic  groups  can  result  in  the  armed  forces 
being  seen  as  constituted  by,  or  representing,  one 
ethnic  group  against  others.  All  these  divisions 
tend  to  be  exacerbated  when  the  military  comes 
to  power,  and  many  military  regimes  have  foun¬ 
dered  due  to  their  inability  to  manage  them. 

Modernization  theorists,  influential  in  the 
1950s  and  1960s,  were  initially  confident  that 


the  newly  independent  nations  of  the  Middle 
East,  Africa,  and  Asia  (as  well  as  Latin  America) 
would  evolve  into  capitalist  democracies,  with 
civilian  control  over  the  military.  These  expecta¬ 
tions  were  dashed  by  a  wave  of  military  coups 
d’etat  that  reached  its  height  in  the  1960s  and 
1970s.  The  army  was  usually  the  most  important 
actor  in  these  actions.  Uncovering  the  mecha¬ 
nisms  of  coups  is  difficult,  since  of  course  the 
necessity  of  surprise  requires  coup  plotters  to 
work  in  secret.  Coups  have  been  common  in 
modern  politics.  One  study  claims  that  since 
1945  there  have  been  345  successful  military 
coups  around  the  world  (see  Paul  Collier,  2009, 
p.  8)  on  an  average  of  more  than  five  per  year. 
But  for  each  successful  coup,  there  were  several 
attempted  coups  and  coup  plots  that  were  foiled 
before  they  could  be  carried  out,  making  the  phe¬ 
nomenon  even  more  ubiquitous.  Factors  that 
have  been  associated  with  successful  coups 
include  acceptance  of  the  coup  by  a  hegemonic 
regional  power,  the  military’s  cultivation  of  key 
civilian  allies,  the  exploitation  of  the  weakness  or 
absence  of  the  deposed  leader,  the  speed  and 
decisiveness  of  the  coup,  the  seizure  and  use  of 
the  major  means  of  communication  to  justify  the 
coup  and  establish  the  new  regime,  and  purges  of 
opponents  in  the  armed  forces,  the  rest  of  the 
state,  and  civil  society. 

A  key  moment  after  a  military  coup  is  in  the 
intervening  weeks  and  months,  when  the  new  lead¬ 
ers  establish  their  political  regime.  Military  regimes 
are  usually  marked  by  one  or  more  of  the  follow¬ 
ing  characteristics: 

key  political  leadership  held  by  military  officers; 

the  lack  of  central,  civilian  political  control  over  the 
armed  forces; 

the  application  of  military  law  to  civilians;  and 

the  threat  or  use  of  extrajudicial  repression  (such  as 
torture,  disappearances,  and  killings)  by  the  state’s 
security  forces. 

Military  rule  can  be  either  temporary  or  long 
term.  It  can  also  revolve  around  a  single  strong 
man  or  woman  and  his  or  her  followers  or  develop 
a  more  corporate  and  institutionalized  ethos.  The 
rise  of  military  regimes  elicited  a  reaction  from 
modernization  theorists,  some  of  whom  decried 
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the  trend,  while  others  identified  the  military  as  a 
preeminently  “modernizing”  institution  and 
praised  the  armed  forces’  allegedly  unique  capacity 
to  achieve  economic  development  and  political 
stability.  However,  cross-national  statistical  work 
has  not  found  a  correlation  between  military  rule 
and  high  economic  growth  rates.  While  examples 
of  successful  economic  management,  such  as  that 
of  the  Park  Chung-hee  regime  in  South  Korea 
(1961-1979)  exist,  there  are  also  many  examples 
of  military  regimes  whose  economic  record  is  less 
than  stellar. 

Analyses  of  the  rise  of  military  rule  in  develop¬ 
ing  countries  abound.  Large-w  empirical  studies 
suggest  that  there  is  no  direct  correlation  between 
the  size  of  the  military  or  its  budget  and  its  pro¬ 
pensity  to  seize  power.  Further,  the  reasons  for 
hierarchical  coups  (led  by  the  high  command) 
tend  to  be  different  from  those  for  coups  led  by 
junior  officers  (those  with  the  rank  of,  or  equiva¬ 
lent  to,  Army  captain  or  below).  Rather  more  use¬ 
ful  is  the  distinction  between  factors  internal  to 
the  armed  forces,  domestic  political  variables,  and 
international  influences.  In  the  first  category,  vio¬ 
lations  of  military  hierarchy  by  civilian  politicians, 
an  expansion  of  the  military’s  capacity  and/or 
sense  of  mission,  and  a  heightened  sense  of  threat 
can  all  trigger  coups.  With  regard  to  domestic 
politics,  high  degrees  of  political  conflict  (espe¬ 
cially  ethnic  and  religious  conflict),  economic  cri¬ 
ses,  weak  political  parties  (especially  right-wing 
parties),  and  low-capacity  state  institutions  have 
been  observed  to  precede  military  takeovers. 
Significant  in  this  category  is  also  the  image  of  the 
military  in  national  politics  and  in  particular  the 
degree  of  popular  identification  of  the  military 
with  certain  positive  national  values.  Inter¬ 
nationally,  the  threat  of  or  defeat  in  war,  foreign 
political  and  military  assistance,  and  an  enabling 
international  environment,  including  military  rule 
in  neighboring  countries  and  international  recog¬ 
nition  of  military  regimes,  can  facilitate  coups.  A 
“cascade  effect”  has  been  observed  in  some 
regions,  whereby  military  rule,  first  established  in 
a  single  country,  occurs  elsewhere  in  subsequent 
years,  leading  to  cooperation  between  military 
regimes.  (For  example,  the  1964  coup  in  Brazil 
was  followed  by  a  coup  in  1966  in  Argentina, 
coups  in  1973  in  Chile  and  Uruguay,  and  again  in 
Argentina  in  1976.) 


In  general,  there  has  been  a  shift  in  recent 
decades  away  from  macrohistorical  explana¬ 
tions  of  military  rule  to  studies  that  emphasize 
the  importance  of  contingency,  strategic  inter¬ 
action,  and  short-term  factors.  Nevertheless, 
much  recent  work  suggests  that  an  important 
structural  variable  is  causally  connected  to  the 
rise  and  decline  of  military  rule:  superpower 
competition  during  the  Cold  War.  Large 
amounts  of  military  assistance  from  the  United 
States  and  the  Soviet  Union  strengthened  mili¬ 
tary  capacity  within  allied  or  “client”  states. 
Within  the  U.S.  sphere  of  influence,  the  increased 
emphasis  on  internal  security  threats  in  the 
wake  of  the  Cuban  Revolution  (1959)  contrib¬ 
uted  to  an  increase  in  direct  military  involve¬ 
ment  in  politics.  Since  the  end  of  the  Cold  War 
and  the  breakup  of  the  Soviet  Union  in  1991, 
there  has  been  a  marked  decline  in  the  number 
of  military  regimes  in  the  developing  world. 
This  decline  had  already  begun  during  the  so- 
called  third  wave  of  democratization  that  started 
in  the  mid-1970s,  continued  with  the  fading  of 
the  Cold  War  in  the  1980s,  and  extended  into 
the  post-Cold  War  period  of  the  1990s  and 
2000s. 

Mechanisms  and  Impacts  of  Military  Rule 

Militaries  are  hierarchical  organizations  that  spe¬ 
cialize  in  the  deployment  of  violence,  so  it  is  often 
assumed  that  militaries  rule  by  force  and  force 
alone.  However,  military  rule  often  involves  com¬ 
plicated  attempts  to  secure  some  measure  of  con¬ 
sent  from  the  governed.  Some  military  regimes,  for 
example,  have  permitted  elections  to  national  and 
subnational  representative  bodies.  Others  have 
used  judiciaries,  of  varying  degrees  of  indepen¬ 
dence,  to  approximate  or  simulate  the  rule  of  law. 
Still  others  have  promulgated,  and  sometimes 
actually  adhered  to,  constitutions. 

Unsurprisingly,  empirical  studies  suggest  that 
military  regimes  are  likely  to  increase  military 
budgets  more  than  their  civilian  counterparts. 
Similarly,  they  tend  to  engage  in  more  human 
rights  abuses  than  do  civilian  regimes.  Military 
rulers  usually  confront  a  “coercion  problem,”  in 
which  they  not  only  require  security  forces  to 
engage  in  repression  but  also  need  to  monitor 
those  security  forces,  something  they  can  do  in 
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different  ways,  each  of  which  has  costs.  When  it 
comes  to  regime  legitimation,  most  military  rulers 
take  pains  to  present  their  intervention  in  politics 
as  being  compelled  by  some  sort  of  crisis  (of  cor¬ 
ruption,  economic  instability,  succession,  or  the 
like)  and  carried  out  strictly  in  the  service  of  the 
nation.  “Nation  building”  has  often  been  a  key 
goal  of  military  rule  in  the  developing  world.  In 
several  instances,  military  regimes  lacked  a  legiti¬ 
mating  ideology  of  their  own,  describing  their 
rule  as  a  temporary  interlude  necessary  for  restor¬ 
ing  “order”  before  restoring  power  to  civilians. 
This  led  Juan  Linz  (1974)  to  call  such  instances  of 
military  rule  “authoritarian  situations”  rather 
than  “authoritarian  regimes.” 

Military  rule  increases  the  probability  of  subse¬ 
quent  military  coups  and  attempted  coups.  The 
rewards  of  direct  rule  often  increase  competition 
and  conflict  within  the  armed  forces.  Some  mili¬ 
tary  regimes  attempt  to  manage  this  competition 
by,  for  example,  allocating  the  spoils  of  office 
equitably  between  the  different  service  branches. 
(This  was  true  of  the  1976-1983  military  regime  in 
Argentina. )  Other  military  regimes  carefully  moni¬ 
tor  and  purge  personnel  within  the  armed  forces 
and/or  the  state  as  a  whole. 

Military  regimes  have  also  been  linked  to  mili¬ 
tarism  or  the  glorification  of  war  and  military 
prowess.  Many  military  leaders  see  politics  as  a 
continuation  of  war  by  other  means.  This  leads 
them  to  resort  to  force  in  the  resolution  of  con¬ 
flicts.  Military  rulers  may  also  demand  that  civil¬ 
ian  organizations  develop  hierarchical  and  disci¬ 
plined  configurations  along  military  lines. 

Such  demands  can  backfire.  Some  military 
regimes  have  inadvertently  stimulated  a  flowering 
of  oppositional  cultural  and  political  activity,  as 
artists,  students,  religious  leaders,  dissidents,  and 
others  express  themselves  in  new  ways  in  opposi¬ 
tion  to  the  authoritarianism  inherent  in  military 
rule.  The  attempted  imposition  of  martial  stan¬ 
dards  of  behavior  on  recalcitrant  populations  can 
produce  rare  moments  of  political  electricity  in 
which  large  numbers  of  people  are  united  in  defi¬ 
ance  of  the  generals.  The  popularity  of  Fela 
Anikulapo  Kuti  (1938-1997),  the  outspoken  musi¬ 
cian  and  critic  of  military  rule  in  Nigeria,  or  the 
participation  of  many  of  the  most  popular  artists 
of  the  day  in  the  “direct  elections  now”  campaign 
(diretas-ja)  in  Brazil  in  1984  are  cases  in  point. 


Such  groundswells  of  cohesive,  broad-based  oppo¬ 
sition  usually  dissolve  once  military  rule  has  ended, 
however. 

Transitions  From  Military  Rule 

Most  military  regimes  of  the  1960s  and  1970s 
became  civilian  in  subsequent  decades.  Analysts 
distinguish  between  regime  liberalization,  or  the 
lifting  of  repression  and  the  restoration  of  various 
civil  liberties,  and  democratization,  or  the  reestab¬ 
lishment  of  a  civilian  multiparty  regime  with 
accompanying  democratic  rights.  There  is  some 
debate  over  whether  the  first  process  leads  inevita¬ 
bly  to  the  second.  Regime  transitions  presided  over 
by  the  military,  in  which  democracy  is  the  osten¬ 
sible  end  goal,  have  been  especially  problematic 
because  militaries  have  tended  to  periodically 
interfere  in  the  process  in  order  to  produce  their 
desired  outcome.  An  example  of  this  is  Nigeria, 
where  the  military  regime  of  Ibrahim  Babangida 
(1985-1993)  initially  promised  to  return  the  coun¬ 
try  to  civilian  rule  by  1990,  pushed  that  deadline 
back  to  1992  after  a  coup  attempt,  and  then 
annulled  the  presidential  elections  of  1993.  The 
Babangida  “transition”  ended  in  a  coup  led  by 
General  Sani  Abacha  in  1993. 

Alfred  Stepan  (1988)  makes  the  important  dis¬ 
tinction  between  the  military  as  government  (usu¬ 
ally  a  president  and  his  advisors),  the  security 
forces,  and  the  military  government  (the  chain  of 
command  of  the  active-duty  military).  He  argues 
that  the  long,  complicated  transition  to  civilian 
rule  overseen  by  the  Brazilian  military  regime 
(1964-1985)  involved  a  tacit  pact  between  the  first 
and  third  of  those  organizations  with  moderate 
opponents  of  the  regime.  Hardliners  in  the  security 
forces  and  radical  opponents  of  military  rule  were 
both  marginalized  in  a  transition  marked  by  a  high 
degree  of  institutional  conservatism  and  continuity 
of  political  leadership. 

Military  regimes  have  ended  in  a  variety  of 
ways.  Some  have  collapsed  after  a  failed  military 
adventure  (e.g.,  Greece  in  1974  and  Argentina  in 
1983)  while  others  managed  to  negotiate  their  way 
out  of  power  through  the  use  of  formal  or  informal 
agreements  (an  example  of  the  former  is  the  1984 
Naval  Club  Pact  in  Uruguay).  In  an  unusual  exam¬ 
ple,  the  Chilean  military  regime  (1973-1990)  was 
voted  out  of  office  in  a  1988  plebiscite  scheduled 
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in  its  1980  Constitution;  this  was  followed  by  a 
1989  election  that  restored  civilian  rule.  The  usual 
combination  of  mechanisms  for  establishing  or 
reestablishing  a  civilian  rule  after  a  military  regime 
is  a  collapse  or  insurrection,  a  pact,  or  holding  of 
elections.  Examples  of  all  these  mechanisms, 
including  transitions  that  involved  all  three,  can  be 
found  in  the  historical  record.  Not  all  transitions 
actually  lead  to  civilian  rule,  of  course.  Military 
regimes  are  sometimes  replaced  by  a  new  version 
of  the  same  type  of  regime,  as  in  Nigeria  in  1993, 
when  General  Abacha  replaced  General  Babangida. 

Military  prerogatives  established  under  military 
rule  can  outlast  the  military  regime  itself.  These 
prerogatives  include  army  control  over  the  police 
and/or  a  role  for  the  military  in  internal  public 
security;  a  special  responsibility  for  “law  and 
order”  or  the  rule  of  law  being  bestowed  on  the 
armed  forces  in  the  constitution,  giving  it  constitu¬ 
tional  cover  for  political  intervention;  a  fixed  allot¬ 
ment  of  the  national  budget  for  the  military;  higher 
salaries  for  military  officers  than  other  state  offi¬ 
cials;  control  over  the  intelligence  apparatus;  con¬ 
trol  over  civilian  activities  (such  as  civil  aviation); 
economic  privileges  (such  as  special  export-import 
licenses,  direct  control  over  state-owned  firms,  and 
the  like);  and  military  veto  power  over  various 
decisions  beyond  national  defense.  An  important 
question  that  can  be  asked  of  any  civilian  regime 
established  after  military  rule  is  which  of  the  state 
activities  need  military  approval?  The  answer  in 
some  new  democracies  would  comprise  a  lengthy 
list.  Stable  civilian  rule  is  not  synonymous  with  the 
reduction  of  military  prerogatives,  and  indeed 
civil-military  peace  is  sometimes  purchased  at  the 
price  of  not  reviewing  or  reforming  any  of  these 
authoritarian  enclaves  or  legacies  of  military  rule. 

Politics  and  the  Future  of  Military  Rule 

In  recent  decades,  there  have  been  strong  interna¬ 
tional  pressures  to  civilianize  military  regimes  and 
to  avert  or  roll  back  military  coups.  The  end  of  the 
Cold  War  and  economic  globalization  have  cre¬ 
ated  an  environment  in  which  overt  military  rule  is 
less  accepted  than  it  was  in  the  past  and  in  which 
military  interventions  and  military  regimes  are 
more  likely  to  falter  or  fade  than  they  would  previ¬ 
ously.  “Failed”  coup  attempts  in  Guatemala  in 
1993,  Paraguay  in  1996,  and  Ecuador  in  2000  are 


cases  in  point  (the  degree  of  their  failure  is  open  to 
interpretation;  in  some  instances,  such  as  in 
Ecuador,  coup  participants  did  succeed  in  ousting 
an  elected  president).  Similarly,  the  desire  for 
admission  into  the  European  Union  (EU)  has  led 
the  Turkish  military,  which  has  engaged  in  fre¬ 
quent  military  interventions  in  the  last  few  decades, 
to  engage  in  a  new  kind  of  restraint  in  its  interac¬ 
tions  with  civilian  politicians.  Another  example  is 
that  of  Pakistan  where  the  regime  of  General 
Pervez  Musharraf  (1999-2008)  was  eventually 
replaced  by  an  elected  civilian  regime.  Despite 
these  trends,  the  increased  emphasis  on  security  in 
the  aftermath  of  the  2001  terrorist  attack  in  the 
United  States  has  enhanced  the  possibilities  for 
militaries  to  engage  in  authoritarian  “solutions”  to 
political  problems  around  the  world.  The  military 
may  therefore  remain  a  powerful  force  behind  the 
scenes  in  many  different  regimes. 

There  are  few  overt  military  regimes  today. 
Even  in  Burma  (Myanmar),  which  has  been  ruled 
by  a  military  junta  since  1962,  the  generals  spoke 
of  the  possibility  of  elections  in  2010.  Similarly, 
the  Thai  military  coup  of  2006  that  toppled  Prime 
Minister  Thaksin  Shinawatra  did  not  lead  to  a 
long-standing  military  regime.  Elections  were  held 
at  the  end  of  2007,  and  the  military  rule  lasted  for 
only  16  months.  Military  rule  established  after  a 
coup  in  the  West  African  country  of  Guinea 
(Conakry)  in  December  2008  also  appears  des¬ 
tined  to  be  replaced  by  a  civilian  regime. 

Debates  exist  as  to  how  best  to  institutionalize 
civilian  control  over  the  military,  once  military  rule 
is  over.  Some  scholars  assert  that  limiting  militaries 
to  a  classic  external  defense  role  is  the  best  recipe 
for  success.  By  forbidding  the  military  to  engage  in 
internal  roles  such  as  drug  interdiction,  social  wel¬ 
fare  programs,  the  building  of  infrastructure,  and 
the  like,  civilians  will  curb,  the  reasoning  goes,  the 
military’s  temptation  to  abuse  its  power  and  inter¬ 
vene  in  politics.  Others  take  the  opposite  view  and 
argue  that  expanded  internal  missions  for  the 
armed  forces  are  healthy  for  democracy,  in  that 
they  enhance  the  military’s  image,  focus  officers’ 
minds  on  the  country’s  problems,  and  diminish  the 
chances  of  coups.  Others  go  beyond  this  particular 
debate  to  point  to  the  importance  of  the  develop¬ 
ment  of  civilian  expertise  on  defense  matters — not 
only  in  the  Ministry  of  Defense  and  parliament  or 
Congress  but  also  in  civil  society. 
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The  recent  discussion  of  “hybrid”  regimes 
(regimes  with  mixed  authoritarian  and  democratic 
characteristics)  is  relevant  to  the  analysis  of  mili¬ 
tary  rule.  While  coups  and  overt  military  rule  are 
less  prevalent  now  than  in  the  past,  military  influ¬ 
ence  is  more  likely  to  be  exerted  in  subtle  and 
disguised  ways,  as  civilian  state  institutions  are 
“hollowed  out”  and  military  forces  take  their 
place.  Protected  democracies,  with  strong  military 
prerogatives  and  effective  military  veto  power  over 
issues  outside  national  defense,  are  more  common 
today  than  conventional  military  regimes. 

The  recent  diminution  of  military  rule  in  the 
developing  world  has  been  a  significant  and  wel¬ 
come  development.  Nevertheless,  the  international 
and  domestic  political  circumstances  that  helped 
foster  this  transformation  are  not  necessarily  per¬ 
manent.  If  military  rule  has  become  a  kind  of 
political  taboo  in  international  circles,  the  mili¬ 
tary’s  control  over  the  means  of  coercion  still  gives 
it  significant  power.  This  power  could  become 
more  overt  than  it  is  now  in  moments  of  crisis.  It 
is  therefore  too  early  to  write  the  obituary  for  this 
type  of  regime. 

Anthony  W.  Pereira 
King’s  College  London 
Strand,  United  Kingdom 
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Militias 


The  term  militia  can  have  multiple  meanings  rang¬ 
ing  from  the  legal  to  the  historical,  but  in  general, 
it  denotes  a  military  force  composed  of  ordinary 
citizens.  A  militia  may  be  used  to  provide  defense, 
law  and  order,  or  security  during  emergencies.  It 
may  be  a  way  for  the  state  to  fulfill  a  legal  obliga¬ 
tion  to  provide  defense  activities  to  protect  a  com¬ 
munity,  its  territory,  property,  or  laws.  In  some 
countries  or  cultures,  a  militia  may  consist  of 
volunteers  who  are  able-bodied  citizens,  including 
women,  whereas  in  other  countries  or  cultures,  it 
may  be  a  reserve  force  composed  of  citizens/sol¬ 
diers.  It  can  also  be  a  private,  nonstate  force  with 
or  without  government  sanction.  The  term  militia 
can  also  refer  to  a  national  police  force,  as  in  Nazi 
Germany  or  in  former  communist  states  such  as 
the  Soviet  Union  or  Yugoslavia.  This  entry  dis¬ 
cusses  the  empirical  meaning  of  the  concept  and 
the  phenomenon  and  the  role  of  public  and  pri¬ 
vate  militias  in  a  number  of  countries. 

Definition  and  Early  Examples 

Etymologically,  the  term  militia  is  derived  from  the 
Latin  miles,  implying  soldiers;  the  suffix  itia 
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denotes  a  state,  activity,  quality,  or  condition  of 
being.  Hence,  the  term  militia  ( miles  +  itia)  implies 
a  military  service.  Although  the  classical  Latin 
meaning  of  militia  implied  soldiers  in  service  of  a 
sovereign  or  a  state,  today  it  means  “a  military 
force  raised  from  the  civilian  population  of  a  coun¬ 
try  or  region,  especially  to  supplement  regular 
army  in  an  emergency,  distinguished  from  merce¬ 
naries  or  professional  soldiers”  ( Oxford  English 
Dictionary,  2002).  It  is  important  to  distinguish 
militia  composed  of  professional  soldiers  in  paid 
service  from  those  made  of  persons  giving  volun¬ 
tary  or  unpaid  service  on  an  ad  hoc  basis. 

Its  history  can  be  traced  back  at  least  as  far  as 
the  early  colonial  period  in  the  United  States,  when 
there  were  many  examples  of  groups  of  able- 
bodied  males  who  were  ready  to  fight  in  any  emer¬ 
gency.  In  fact,  the  militia  was  the  first  permanent 
armed  military  force  in  the  13  colonies.  George 
Washington,  then  adjutant  general  of  the  Virginia 
militia,  called  on  the  militia  to  respond  to  a  fron¬ 
tier  Indian  attack.  Article  VI  of  the  Articles  of 
Confederation  (1777)  stated, 

Every  State  shall  always  keep  up  a  well-regulated 
and  disciplined  militia,  sufficiently  armed  and 
accoutered  and  shall  always  provide  and 
constantly  have  ready  for  use,  in  public  stores,  a 
due  number  of  field  pieces  and  tents,  and  a 
proper  quantity  of  arms,  ammunition  and  camp 
equipage. 

Similarly,  militia  turned  out  with  great  alacrity 
in  Maryland  and  Pennsylvania,  where  they  per¬ 
formed  the  role  of  internal  police  and  looked  out 
for  people  with  suspect  allegiances.  Militias  con¬ 
tinued  to  be  raised  following  the  Revolutionary 
War  but  often  lacked  traditional  military  values. 
Officers  were  commonly  selected  on  the  basis  of 
popularity  rather  than  their  military  training, 
experiences,  or  ethos.  The  public  was  also  reluc¬ 
tant  to  support  the  militia  through  taxation.  The 
militia  was  seen  as  a  peacetime  army  outside  civil¬ 
ian  control.  For  instance,  in  1794,  a  militia  com¬ 
prising  about  13,000  people  was  raised  by  President 
George  Washington  to  quell  the  Whiskey  Rebellion 
in  the  state  of  Pennsylvania.  In  1802,  the  federal 
academy  at  West  Point  was  established  to  rectify 
some  of  the  flaws  in  the  training  of  the  militia  and 
to  inculcate  in  them  a  greater  sense  of  discipline. 


After  the  end  of  the  U.S.  Civil  War,  the  role  of 
“policing  the  southern  states”  fell  to  provisional 
militia.  A  significant  number  of  these  troops  were 
black,  and  constant  tension  prevailed  between  the 
official  and  unofficial,  permanent  and  ad  hoc, 
white  and  black,  and  male  and  female  sections  of 
the  militia.  In  1903,  a  Militia  Act  was  passed  that 
allowed  various  states  to  maintain  a  reserve  mili¬ 
tary  force  known  as  the  National  Guard.  Their 
primary  job  was  to  assist  the  soldiers  and  the  mili¬ 
tary  police.  It  was  composed  of  able-bodied  men  in 
the  age  group  of  17  to  45  years.  Today,  the  age 
limit  for  enlistment  with  no  prior  service  is 
35  years  for  the  Army  National  Guard  and 
40  years  for  the  Air  Force  National  Guard.  For  the 
Army  and  Air  Force  National  Guards,  the  maxi¬ 
mum  age  for  enlistment  for  those  with  prior  service 
is  59  years,  as  long  as  the  member  has  enough  years 
of  prior  service  to  be  able  to  complete  20  years  of 
creditable  service  for  retirement  by  60  years  of  age. 

Militias  in  the  Contemporary  World 

The  role  of  the  modern  militia  can  be  seen  both  as 
a  service  and  as  a  duty.  It  can  be  organized  or 
unorganized.  There  are  also  private  militias  formed 
of  nonorganized  individuals  based  on  their  own 
concept  of  militia.  In  Austria,  the  multiple  militias 
became  affiliated  with  certain  political  parties  after 
World  War  I.  For  instance,  the  Heimwehr  (Home 
Defense)  became  affiliated  with  the  Christian 
Socialist  Party.  In  Canada,  too,  the  term  militia 
was  associated  with  the  reserve  force  of  the 
Canadian  Army  in  earlier  times.  Today,  Canadian 
militias  have  no  official  standing  but  are  essen¬ 
tially  private  armies.  In  China,  there  were  militias 
of  varying  abilities.  Their  prime  motto  was  to  seek 
certain  concessions  from  the  British  rulers  during 
the  colonial  period,  such  as  inheritance,  property, 
or  marriage  rights  for  the  indigenous  peoples.  In 
Denmark,  the  Danish  Militia  played  a  crucial  role 
in  repelling  an  attack  from  Sweden  on  Copenhagen 
in  1659.  In  Iran,  the  Basij  Militia  founded  by 
Ayatollah  Ruhollah  Khomeini  in  November  1979, 
shortly  after  the  removal  of  the  Shah,  was  com¬ 
posed  of  90,000  regular  soldiers  and  300,000 
reservists.  Today,  it  has  more  than  11  million 
members. 

Similarly,  in  Iraq,  we  find  several  armed  militias 
engaged  in  defending  their  respective  cultures  and 
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territories  along  with  neighborhoods  from  any 
type  of  insurgency.  They  are  like  the  vigilante 
groups  functioning  in  underpoliced  areas  in  some 
parts  of  the  United  States  of  America.  In  Sri  Lanka, 
the  militias  were  formed  by  the  kings  for  military 
campaigns  both  within  and  outside  the  island.  In 
2004,  the  LTTE  (Liberation  Tigers  of  Tamil 
Eelam)  claimed  to  establish  an  auxiliary  force  to 
help  in  rehabilitation,  construction,  forest  conser¬ 
vation,  and  agriculture  besides  serving  as  a  reserve 
force.  However,  with  the  defeat  of  the  LTTE  at  the 
hands  of  the  Sri  Lankan  Army,  it  ceased  to  exist  in 
early  2009. 

Quite  surprisingly,  the  Swiss  militia,  at  one  time 
comprising  33%  of  the  total  population,  had  only 
2.7%  in  2004.  It  is  composed  of  voluntary  partici¬ 
pants,  including  women.  Article  59  of  the  Swiss 
constitution  has  a  provision  for  militia  known  as 
militardienst  in  German,  militaire  in  French,  ser- 
vizio  militare  in  Italian,  and  servetscb  militar  in 
Rumantsch. 

From  service-oriented  groups,  militias  have 
evolved  into  groups  with  the  right  to  defend  or 
protect.  People  engage  in  militias  in  response  to  a 
call-up  by  any  person  aware  of  the  threat  requiring 
the  response.  Even  a  single  person  or  a  group  of 
people  can  take  the  responsibility  of  defending  the 
community.  As  such,  there  cannot  be  any  mini¬ 
mum  size  of  the  militia  in  this  general  sense. 

Private  Militias 

Private  militias  are  armed  groups  of  civilians  not 
necessarily  recognized  by  the  federal  or  state  gov¬ 
ernments.  Today,  about  50%  of  the  states  in  the 
United  States  of  America  forbid  private  militias  to 
parade  and  exercise  in  public,  though  the  forma¬ 
tion  of  private  militias  is  not  forbidden  per  se. 

In  the  United  States,  some  private  militias  par¬ 
ticipated  in  the  tax  protestor  movement  during  the 
1970s  and  1980s.  Today,  the  militias  may  support 
presidential  candidates  and  oppose  certain  political 
parties  or  pressure  groups.  Other  private  militias  in 
the  United  States  have  expressed  sentiments  against 
globalization  and  the  “new  world  order”  and  are 
often  right-wing  extremist  groups  whose  leaders 
consider  the  U.S.  federal  government  illegitimate. 
It  is  difficult  to  estimate  the  number  of  U.S.  militia 
groups;  however,  the  Southern  Poverty  Law  Center 
(SPLC),  a  civil  rights  group  that  monitors  hate 


crimes,  estimated  that  militia  groups  existed  in  20 
states  in  1994,  42  states  by  late  1995,  and  all  50 
states  by  1996.  The  SPLC  has  also  reported  a 
resurgence  in  “patriot  groups”  or  militias  and 
other  extremist  groups  that  see  the  federal  govern¬ 
ment  as  their  enemy. 

Even  the  North  Atlantic  Treaty  Organization 
(NATO)  forces  in  Afghanistan  are  said  to  be  rely¬ 
ing  on  illegal  militia,  often  run  by  warlords  respon¬ 
sible  for  human  rights  abuses  and  drug  trafficking. 
NATO  is  said  to  be  using  private  militias  not  only 
to  guard  their  camps  and  convoys  but  also  for 
“black  ops.”  These  militias  function  outside 
Afghan  law,  which  bans  unlicensed  armed  groups. 
Many  of  them  compete  with  the  state  authority 
and  are  usually  run  by  former  military  command¬ 
ers  responsible  for  antisocial  activities.  Many  gov¬ 
ernments  are  now  financing  private  militias  as 
alternative  power  structures  to  fulfill  their  short¬ 
term  security  needs,  but  this  practice  can  have  very 
bad  repercussions  in  the  long  term.  Once  the  pri¬ 
vate  militias  are  financed  and  armed,  it  becomes 
difficult  to  disarm  them,  and  those  hired  to  pro¬ 
vide  security  can  become  the  very  source  of  “new 
threats”  to  the  state  by  indulging  in  antisocial 
activities. 
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Misspecification 


Misspecification  is  a  fundamental  problem  in 
empirical  modeling.  The  origins  of  this  problem 
can  be  found  in  the  theoretical  exercise  of  using  a 
statistical  model  from  a  sample  to  make  inferences 
about  an  unobservable  population  of  interest.  Any 
deviation  from  the  true  population  model  in  the 
sample  model  means  that  the  sample  model  is  mis- 
specified.  This,  in  turn,  means  that  the  inferences 
from  the  sample  model  about  the  population  are 
suspect. 

The  importance  of  problems  of  misspecification 
is  underscored  by  the  amount  of  attention  paid  to 
different  types  of  misspecification  in  introductory 
texts  on  ordinary  least  squares  (OLS)  regression 
models.  As  an  example,  consider  Damodar 
Gujarati’s  widely  used  textbook,  Basic  Econo¬ 
metrics.  Gujarati’s  treatment  of  OLS  is  centered 
around  10  assumptions  of  the  linear  regression 
model.  Six  of  these  10  assumptions  are  statements 
that  the  model  does  not  contain  one  or  more  types 
of  misspecification. 

Discussed  throughout  this  entry  is  misspecfica- 
tion  in  OLS.  The  logic  and  importance  of  mis¬ 
specification  in  OLS  extends  directly  to  other, 
more  complicated  types  of  models.  Almost  all 
such  models  contain  some  equation  that  indicates 
the  relationship  between  the  independent  vari¬ 
ables  and  the  dependent  variable  is  dictated. 
Getting  this  equation  wrong  means  that  the  model 
is  misspecified. 

As  the  Gujarati  example  shows,  misspecifica¬ 
tion  can  take  on  many  different  forms.  In  the  sec¬ 
tions  that  follow  are  discussions  of  several  of  the 
most  common  forms  of  misspecification,  the  detec¬ 
tion  of  statistical  problems  caused  by  misspecifica¬ 
tion,  and  strategies  for  avoiding  misspecification. 

Types  of  Misspecification 

The  purpose  of  empirical  model  specification  is  to 
try  to  develop  an  accurate  model  of  relationships 
in  an  unobservable  population  with  observed 
sample  data.  In  OLS,  we  can  represent  the  popula¬ 
tion  regression  model  as 

Yj  =  a  +  PjX,  +  P2  Z,  +  £, 


and  the  sample  regression  model  as 

Y,  =  a  +  (3jX,  +  P2Z,  +  e„ 

where  Y  is  the  dependent  variable;  X  and  Z  are  the 
independent  variables,  a,  (3r,  and  P2  are  sample 
estimates  of  the  population  parameters  a,  p, ,  and 
P2;  and  e,  is  the  sample  estimate  of  the  population 
stochastic  term  £,. 

If  any  element  of  the  population  model  is  not 
appropriately  represented  in  the  sample  regres¬ 
sion  model,  then  the  inferences  about  the  popu¬ 
lation  model  from  the  sample  model  may  be 
problematic.  The  entry  now  turns  to  more  in- 
depth  discussions  of  four  of  the  most  common 
forms  of  misspecification. 

Omitted  Variable  Bias 

One  of  the  most  common  critiques  of  empirical 
work  is  that  the  authors  have  left  a  relevant  inde¬ 
pendent  variable  out  of  their  model  specification. 
This  problem  associated  with  this  critique  is 
known  as  “omitted  variable  bias.”  For  example,  if 
the  population  regression  model  is 

Y,  =  a  +  (3 1 X,  +  [32Z,  +  £, 
but  the  sample  regression  model  is  specified  as 

Y,  =  a  +  (3|X,  +  £,, 

then  there  is  a  strong  possibility  that  the  sample 
model  is  prone  to  omitted  variable  bias.  To  illus¬ 
trate  the  nature  of  this  problem,  consider  what 
happens  to  the  parameter  estimate  for  the  effect  of 
X  on  Y,  (3 1 .  In  summation  notation,  the  OLS  for¬ 
mula  for  this  parameter  estimate  is 

B  =  El,(X.-X)(y.-Y) 

‘  El  ,(X,-X)2  ’ 

and  because  one  of  the  properties  of  OLS  is  that 
the  resulting  regression  line  (or  plane  in  the  case  of 
a  model  with  two  independent  variables)  goes 
through  the  mean  of  each  variable,  we  know  that 

(Y,  -  Y)  =  Pi (X;  -  X)  +  P2(Z,  -  Z) 

+  (e«  -  e); 
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we  can  see  that  the  expected  value  of  01?E (pj,  but  the  sample  regression  model  is  specified  as 
under  these  circumstances  is 


£!P,j  =  P,  +  P 


E  L,(x,-x)(z,-z) 
EE(x-x)2 


Any  time  the  expected  value  of  the  parameter  esti¬ 
mate  is  not  equal  to  the  population  value,  we  have 
a  biased  estimator.  In  this  circumstance,  £(p:)  =  0j 
only  if 


E'.,(x,  -  x)(z,  -  2) 

ZE(x,-x)2 


and  this  will  occur  only  if  either  one  of  two  things 
is  true: 


1.  The  covariance  between  X  and  Z  is  equal  to 
zero, 

cov>zIZ(*-*)(z,-2) 

n 

2.  The  true  impact  of  Z  on  Y  is  equal  to  zero, 
p2  =  0. 


Yj  —  a  +  fhX,  +  02Z,  +  f>3  W,  +  £j 

the  inclusion  of  the  variable  W,  in  the  model  is 
unnecessary. 

Wrong  Functional  Form 

Another  common  form  of  misspecification 
occurs  when  the  model  specification  fails  to  prop¬ 
erly  take  into  account  the  underlying  relationship 
between  the  independent  variable  and  the  depen¬ 
dent  variable. 

For  example,  if  the  population  regression  model  is 

Yj  =  a  +  (BjX,  +  02Z~  +  Ej 
but  the  sample  regression  model  is  specified  as 

Yj  =  a  +  p:X,  +  02Z,-  +  Ej, 

this  model  will  not  give  us  an  accurate  estimate 
of  02 . 


If  neither  of  these  conditions  holds,  then  the 
estimated  impact  of  X  on  Y  is  biased  due  to  the 
omission  of  Z  from  the  model. 

Unnecessary  Extra  Variables 

In  the  previous  section,  we  saw  the  problems 
that  can  occur  when  a  variable  that  should  be  in  a 
model  is  left  out.  The  problem  of  “unnecessary 
extra  variables”  is  the  reverse  of  the  problem  of 
omitted  variable  bias,  in  the  sense  that  variables 
that  are  not  in  the  unobservable  population  model 
have  been  included  in  the  sample  model.  Note, 
however,  that  we  do  not  call  this  problem  a  form 
of  “bias.”  This  is  because  the  presence  of  unneces¬ 
sary  extra  variables  does  not  bias  the  estimates  of 
the  parameters  for  variables  that  should  be  in  the 
model.  Instead,  the  impact  of  unnecessary  extra 
variables  is  that  they  artificially  drive  up  the  mod¬ 
el’s  R-squared  statistic  and  chew  up  extra  degrees 
of  freedom. 

For  example,  if  the  population  regression  model 
is 

Yj  =  a  +  0jX,  +  02Z,  +  Ej 


Failure  to  Include  Interactions 

Political  scientists  have  increasingly  become 
aware  of  interactive  relationships.  Although 
researchers  frequently  expressed  their  theoreti¬ 
cal  ideas  about  the  relationships  between  vari¬ 
ables  in  a  way  that  implies  interactive  relation¬ 
ships,  they  have  often  failed  to  specify  their 
models  as  such. 

For  example,  if  the  population  regression  model  is 

Yj  =  a  +  [i  j  X,  +  02Z,  +  f> ;  X,  Z,  +  Ej , 
the  sample  regression  model  is  specified  as 

Yj  =  a  +  PjX,  +  02Z,  +  Ej. 

Detecting  Misspecification 

Detecting  misspecification  is  not  an  easy  process. 
There  is  no  direct  way  of  testing  for  all  types  of 
model  misspecification.  Instead,  there  are  a  variety 
of  different  ways  to  test  for  specific  symptoms  of 
misspecification.  Most  of  the  procedures  for  doing 
this  involve  visual  and/or  statistical  examinations 
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of  the  estimated  stochastic  components  or  residu¬ 
als  after  the  original  model  has  been  estimated. 
These  tests  amount  to  using  residuals,  en  to  make 
inferences  about  the  unobserved  population  sto¬ 
chastic  components,  £,. 

Tests  for  Autocorrelation  and  Heteroskedasticity 

The  use  of  numerical  tests  for  autocorrelation 
and  heteroskedasticity  has  become  standard 
practice.  Autocorrelation  and  heteroskedasticity 
are  often  symptoms  of  model  misspecification 
(although  this  is  often  overlooked).  Consider,  for 
instance,  Gujarati’s  discussion  of  the  sources  of 
heteroskedasticity  (pp.  389-391)  and  autocorre¬ 
lation  (pp.  442-448).  Each  of  the  eight  sources 
of  heteroskedasticity  (failure  to  account  for 
learning  processes,  failure  to  account  for  income 
differences,  failure  to  account  for  improvements 
in  the  measurement  of  data  across  settings,  outli¬ 
ers,  incorrect  model  specification,  failure  to 
account  for  skewness,  incorrect  data  transforma¬ 
tion,  and  incorrect  functional  form)  and  each  of 
the  eight  sources  of  autocorrelation  (inertia, 
excluded  variables,  incorrect  functional  form, 
the  cobweb  problem,  failure  to  include  lags, 
improper  manipulation  of  data,  improper  trans¬ 
formation  of  data,  and  failure  to  account  for 
nonstationarity)  can  be  classified  as  a  type  of 
model  misspecification. 

The  general  logic  of  these  tests  is  best  seen 
through  a  discussion  of  the  omega  (Cl)  matrix.  The 
ft  matrix  contains  the  expected  covariation 
between  population  stochastic  components  across 
observations, 


e(uu')  =  ft  = 


/  cov(e1;8i) 
cov(e2,Si) 

\COv(e„,Si) 


cov(si,e2) 

C0V(82,S2) 

cov(e„,s2) 


cov(£i,e„)\ 

C0V(S2,£„) 

cov(e„,e„)/ 


which  can  be  rewritten  as 


E{uu')  =  ft  = 


/  varjEj) 
COv(s2,  £2) 

\cov(s„,£i) 


cov(s1;s2) 

var(s2) 

cov(s„,s2) 


cov(s,,s„)  \ 
cov(s2,s„) 

var(s„)  / 


Two  of  the  assumptions  that  are  implicit  in  the 
use  of  OLS  are  that  of  no  autocorrelation  and  that 
of  no  heteroskedasticity.  A  general  expression  of 
the  assumption  of  no  autocorrelation  is 

covEi,E.  =  0  V  i  ±  /, 

which  means  that  we  do  not  expect  to  see  evidence 
of  covariation  between  any  pair  of  population  sto¬ 
chastic  components.  A  general  expression  of  the 
assumption  of  no  heteroskedasticity  is 

varEi  =  a2  V  i, 

which  means  that  we  do  not  expect  to  see  evidence 
of  unequal  variance  across  population  stochastic 
components.  If  both  of  these  assumptions  hold, 
then 


/V 

o  • 

••  0  \ 

ft  = 

0 

2 

<r 

••  0 

u 

0  . 

..  a  / 

-) 

which  can  be  rewritten  as  ft  =  a1!. 

The  ft  matrix  and  the  assumptions  that 
we  make  about  it  are  important  because  they  are 
used  to  estimate  the  variance  of  OLS  parameter 
estimates, 

var(p)  =  (X'X)“1X'£(ft)X(X'X)“1. 

If  we  make  the  OLS  assumptions  about  the  sto¬ 
chastic  component,  then  this  means  substituting  in 
E(ft)  =  a2/.  We  can  then  substitute  in 


Because  we  use  var(|3)  to  test  our  theoretically 
derived  hypotheses  about  the  relationships 
between  variables,  if  our  assumptions  about  no 
autocorrelation  and  no  heteroskedasticity  are 
invalid,  we  run  the  risk  of  falsely  accepting  or 
falsely  rejecting  the  null  hypothesis  and  thereby 
arriving  at  faulty  conclusions  about  our  causal 
theories. 
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Because  of  the  importance  of  knowing  about 
the  presence  of  evidence  of  autocorrelation  and 
heteroskedasticity,  there  are  now  a  wide  range  of 
statistical  tests  that  can  be  used  to  diagnose  these 
problems.  All  of  these  tests,  either  implicitly 
or  explicitly,  use  information  from  sample  model 
residuals  contained  in  A  to  make  hypothesis¬ 
testing  inferences  about  fl.  Most  of  these  tests  do 
so  by  looking  for  specific  forms  of  autocorrelation 
or  heteroskedasticity. 

For  instance,  tests  for  first-order  serial  autocor¬ 
relation  in  time-series  data  look  for  evidence  that 
st  —  pet-  i  +  et,  which  would  show  up  in  an  A 
matrix  as 
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while  tests  for  spatial  autocorrelation  test  the 
hypothesis  that 
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Tests  for  heteroskedastity  test  the  hypothesis  that 
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Tests  for  “ Unnecessary  ”  Variables 

There  are  a  variety  of  different  statistical  tests 
that  are  designed  to  help  researchers  decide 
whether  or  not  they  should  add  one  or  more  vari¬ 
ables  to  their  model  specification.  These  tests  fol¬ 
low  the  logic  that  if  the  new  variable  or  group  of 
variables  does  not  significantly  increase  the 
explanatory  power,  then  they  are  unnecessary 


and  therefore  should  not  be  added.  While  these 
tests  can  be  helpful  guides  for  keeping  a  model 
specification  parsimonious,  the  discussion  in  the 
section  “Omitted  Variable  Bias”  should  not  be 
forgotten  when  considering  the  usefulness  of 
these  tests. 

One  of  the  most  common  tests  along  these  lines 
is  an  F  test,  for  which  the  null  hypothesis  is  that 
the  R 2  statistic  for  a  model  is  not  improved  when 
one  or  more  new  variables  are  added  to  the  speci¬ 
fication.  This  test  statistic  is  calculated  by  going 
through  the  following  steps: 

1.  Estimate  the  original  model,  which  we’ll  call 
Model  1,  and  obtain  the  R2  statistic  from  this 
model,  Rj. 

2.  Estimate  the  original  model  with  the  new 
variable(s)  included,  which  we’ll  call  Model  2, 
and  obtain  the  R 1  statistic  from  this  model,  RT 

3.  Using  the  R 2,  R~t  statistics,  the  number  of 
degrees  of  freedom  from  Model  1  (kf)  and  from 
Model  2  (k2)  to  calculate 

~  _  {R\-Rl)/(k2  -*,) 

k"kl  (1  -  Rl)/(n  -  k2) 

Correcting  for  Misspecification 

As  discussed  above,  heteroskedasticity  and  auto¬ 
correlation  get  a  lot  of  attention  in  books  and 
classes  that  introduce  OLS  regression  models.  In 
practice,  most  empirical  researchers  tend  to  view 
heteroskedasticity  and  autocorrelation  as  statistical 
diseases  that  can  be  cured  by  statistical  procedures 
such  as  feasible  generalized  least  squares  (FGLS; 
discussed  in  the  following  section).  This  type  of 
cookbook  approach  is  popular,  but  it  has  come 
under  attack  in  recent  years.  As  noted  above,  most 
of  the  sources  of  autocorrelation  and  heteroskedas¬ 
ticity  are  one  form  or  another  of  misspecification. 
When  these  symptoms  of  misspecification  are 
“cured”  by  FGLS,  opportunities  may  be  missed. 


FGLS  Procedures 

In  the  presence  of  evidence  that  £(fl)  /  cr2/ 
many  scholars  turn  to  a  set  of  procedures  known 
as  FGLS.  The  logic  of  FGLS  procedures  is  to  diag¬ 
nose  the  specific  type  of  violation  of  the  OLS 
assumptions  about  fl  by  estimating  A  and  then  to 
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correct  for  them  by  using  the  estimated  nature  of 
the  problem  contained  in  A. 

In  regular  OLS,  the  matrix  algebra  formula  for 
the  parameter  estimates  is  3OLS  =  (X'X)  X'Y. 
With  FGLS,  this  formula  becomes 

PFGLS  —  (x'  n_1  x)  1  x'  A-1  y  . 

The  formula  for  the  variance  of  the  parameter  esti¬ 
mates  is 

var(p)  =  (X'X)~1X'E(fL)X(X'X)~1 

in  both  models,  but  E(CL)  =  a  I  in  OLS. 

It  is  worth  noting  that  with  real-world  data,  we 
never  see  fl  but  we  make  inferences  from  our  OLS 
estimates  of  A.  When  we  find  evidence  of  viola¬ 
tions  in  the  OLS  assumptions  about  CL  in  our  esti¬ 
mated  matrix  A,  we  use  FGLS  to  correct  them. 
This  is  a  leap  of  faith  from  A  to  fl  and  is  worth 
keeping  in  mind  as  we  use  it  to  adjust  our  esti¬ 
mates. 

Guy  D.  Whitten 
Texas  A  &  M  University 
College  Station,  Texas,  United  States 

See  also  Interaction  Effects;  Logit  and  Probit  Analyses; 
Model  Specification;  Nonlinear  Models 
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Mixed  Methods 


Variants  of  mixed-methods  research  design,  that 
is,  the  combination  of  qualitative  and  quantitative 
components  within  one  single  research  design, 


have  been  practiced  since  the  beginnings  of  the 
social  and  related  sciences  and  before  the  complex 
construct  mixed-methods  research  was  conceived. 
It  could  be  argued  that  in  the  first  half  of  the  20th 
century,  the  combination  within  one  research 
project  of  what  was  subsequently  referred  to  as 
qualitative  and  quantitative  research  was  not 
unusual,  at  least  for  well-known  studies  such  as 
Samuel  A.  Stouffer  and  colleagues’  American 
Soldier  (1949),  Theodor  Adorno  and  colleagues’ 
Authoritarian  Personality  (1950),  and  most  types 
of  psychometric  measurement  and  scaling  proce¬ 
dures  during  this  time.  Only  in  the  1990s  did 
researchers  begin  to  systematize  this  research 
approach  and  design  to  the  extent  to  which  it 
became  explicitly  and  systematically  studied  and 
applied.  In  this  second  wave  of  conceptualization 
and  application,  Abbas  Tashakkori,  Charles 
Teddlie,  John  Creswell,  Julia  Brannen,  and  Alan 
Bryman  were  most  influential  in  establishing  this 
type  of  research  design.  It  is  often  proclaimed  that 
a  third  generation  of  mixed-methods  studies  is 
about  to  emerge.  Some  optimistic  predictions 
include  the  replacement  of  mono-method  studies 
by  mixed-methods  studies,  research  becoming 
more  ethical  or  democratic  due  to  the  integration 
of  different  methods  and  perspectives,  the  ability 
to  access  a  phenomenon  under  investigation  objec¬ 
tively  due  to  the  combination  of  qualitative  and 
quantitative  methods,  and  new  ways  of  conceiv¬ 
ing,  collecting,  and  analyzing  data  that  are  no 
longer  identifiable  as  qualitative  or  quantitative  in 
nature.  The  field  of  mixed-methods  research  is 
indeed  changing,  but  the  third  generation  is  most 
likely  to  rectify  some  of  the  simplifications  and 
misconceptions  that  have  established  themselves 
in  the  past  3  decades,  due  in  part  to  various 
attempts  to  establish  separate,  that  is,  qualitative 
and  quantitative  “parachgms”  and  the  resulting 
incompatibility  thesis.  This  entry  discusses  defini¬ 
tions,  forms,  applications,  and  further  develop¬ 
ments  of  mixed-methods  research. 

Nomenclature 

Although  now  well  established,  the  use  of  the  term 
mixed-methods  design  is  not  ubiquitous.  One  con¬ 
tender  is  methods  triangulation.  A  shortcoming  of 
this  term  is  that  it  implies  only  studies  in  which 
the  results  of  the  qualitative  and  quantitative  parts 
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converge  or  verify  each  other.  Under  certain  cir¬ 
cumstances,  this  may  indeed  be  of  interest,  but 
there  are  other  possibilities  of  mixing  methods 
unrelated  to  triangulation.  Another  alternative  is 
using  words  such  as  combining,  blending,  synthe¬ 
sizing,  or  merging  instead  of  mixing  because, 
strictly  speaking,  qualitative  and  quantitative 
methods  are  not  mixed  but  rather  combined  in 
different  ways  and  for  different  purposes.  Despite 
these  alternative  proposals,  mixed-methods 
research  design  is  now  the  accepted  nomenclature 
for  this  kind  of  design. 

Two  related  designs  should  be  mentioned 
briefly.  Multimethod  design  is  the  label  given  to 
research  designs  that  combine  at  least  two  differ¬ 
ent  quantitative  or  two  different  qualitative  meth¬ 
ods  within  one  research  project.  Accordingly,  a 
multimethod  quantitative  design  consists  of  at 
least  two  distinct  quantitative  components,  while 
multimethod  qualitative  design  consists  of  distinct 
qualitative  components  within  one  research  design. 
Finally,  mixed  model  design  describes  a  research 
design  that  combines  quantitative  and  qualitative 
components  across  all  phases  of  the  research  pro¬ 
cess,  where  it  ostensibly  is  no  longer  possible  to 
clearly  distinguish  between  the  qualitative  and 
quantitative  contributions  to  the  overall  research 
results.  However,  it  is  questionable  whether  the 
research  components  that  most  authors  refer  to  in 
this  context  can  indeed  be  clearly  attributed  to  the 
quantitative  or  qualitative  component.  Thus, 
mixed  model  design  is  best  used  for  research 
designs  where  the  quantitative  and  qualitative 
components  are  intertwined  such  that  they  are  no 
longer  clearly  separable  or  where  the  line  of 
demarcation  between  quantitative  and  qualitative 
research  components  becomes  indistinguishable. 

Justifications 

Among  the  many  interrelated  reasons  listed  in  the 
literature  on  the  improvement  of  mixed-methods 
research  on  mono-method  research  designs  are  that 
some  research  questions  are  better  answered  by 
using  the  strengths  of  qualitative  and  quantitative 
research  approaches;  mixed-methods  designs  may 
help  control  for  a  methods  bias  inherent  in  mono¬ 
method  approaches,  and  results  from  different 
methods  may  cross-validate  each  other;  a  combina¬ 
tion  of  methods  may  allow  for  a  greater  diversity 


of  theoretical  and  empirical  approaches  or  it  may 
contribute  to  a  more  holistic  perspective  on  a  given 
research  problem;  and  mixed-methods  designs 
may  improve  the  ecological  and  external  validity 
as  well  as  the  internal  consistency  of  a  study.  Most 
reasons,  however,  can  be  summarized  into  three 
families:  (1)  convergence,  (2)  complementarity, 
and  (3)  holism. 

Convergence 

The  first  set  of  justifications  for  a  mixed-meth¬ 
ods  research  design  relates  to  the  traditional  idea 
of  methods  triangulation,  where  researchers  use 
qualitative  and  quantitative  approaches  to  cross- 
validate  their  findings.  For  instance,  clinical  inter¬ 
views  often  aim  at  verifying  whether  the  score  of 
a  formal  psychometric  test  converges  with  clini¬ 
cians’  assessments.  In  voting  participation  studies, 
researchers  may  use  semistructured  interviews  to 
assess  the  convergence  of  the  interview  data  with 
the  responses  in  a  survey  on  various  reasons  for 
the  types  and  degrees  of  political  participation. 
Given  that  survey  or  test  items  are  structured  such 
that  respondents  merely  take  position  to  the 
researcher-generated  items,  and  given  that  in  a 
semistructured  or  clinical  interview,  interviewees 
are  able  to  present  their  own  perspective  on  a  par¬ 
ticular  issue,  a  convergence  of  these  two  sets  of 
data  may  indeed  contribute  to  the  triangulation 
and  thus  validation  of  research  findings. 

Complementarity 

The  second  set  of  justifications  for  employing 
mixed-methods  research  design  relates  to  the  idea 
that  different  methods  may  contribute  additional 
insights  into  a  phenomenon  under  investigation. 
In  this  sense,  the  qualitative  and  quantitative  parts 
complement  each  other  to  produce  a  particular 
answer  to  a  research  question.  In  other  words,  the 
results  would  not  have  emerged  from  employing 
either  a  qualitative  or  a  quantitative  approach. 
For  instance,  before  a  standardized  scale  can  be 
established,  the  phenomena  underlying  the  scale 
may  have  to  be  explored  in  a  nonstandardized, 
exploratory  manner.  This  procedure  is  well  estab¬ 
lished  in  measurement  theory  in  psychometrics, 
and  it  is  also  often  applied  in  question  and  ques¬ 
tionnaire  design.  In  this  sense,  political  scientists 
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may  want  to  first  explore  the  way  in  which  mem¬ 
bers  of  a  population  of  interest  think  and  behave  in 
relation  to  political  participation  before  selecting 
different  survey  items  for  their  study.  Alternatively, 
the  same  researchers  may  find  interesting  or  sur¬ 
prising  relations  between  variables  during  their 
statistical  analysis.  Rather  than  engaging  in  post 
hoc  hypothesizing,  it  may  be  interesting  to  also 
conduct  a  few  exploratory  interviews  to  help  gener¬ 
ate  explanations  or  hypotheses  for  future  studies. 

Holism 

A  number  of  important  texts  on  mixed-methods 
design  go  as  far  as  to  claim  that  the  ultimate  goal 
of  mixed-methods  research  should  be  the  examina¬ 
tion  of  a  particular  phenomenon  from  all  possible 
positions,  be  it  all  theories  potentially  suitable  for 
the  investigation  of  a  phenomenon,  all  data  avail¬ 
able,  or  all  possible  analyses  of  these  data.  Starting 
from  the  argument  that  qualitative  and  quantitative 
methods  have  different  strengths  and  weaknesses 
relating  to  ontology,  epistemology,  axiology,  the 
ability  to  identify  general  causal  laws,  and  so  on, 
their  combination  may  help  draw  from  the  strengths 
of  one  to  offset  the  weaknesses  of  the  other  types 
of  methods.  Detailed  analyses  of  all  relevant  data 
from  all  possible  theoretical  positions  and  with  all 
possible  analytic  methods  ostensibly  produce  many 
different  pieces  of  information.  Sometimes  the 
metaphor  of  a  puzzle  is  used  in  this  context.  If 
assembled  correctly,  the  different  results  produce 
objective  findings,  that  is,  findings  that  are  devoid 
of  any  form  of  bias.  In  this  context,  John  Brewer 
and  Albert  Hunter  refer  to  not  only  two  kinds  of 
principal  research  methods — qualitative  and  quan¬ 
titative — but  four:  (1)  fieldwork,  (2)  survey 
research,  (3)  experimentation,  and  (4)  nonreactive 
research.  Within  this  framework,  fieldwork  would 
provide  ecologically  valid  and  testable  hypotheses 
relating  to  a  phenomenon  under  investigation,  sur¬ 
vey  research  would  examine  the  extent  to  which 
the  phenomenon  is  generalizable,  experiments 
would  determine  the  causes  of  the  phenomenon, 
and  in  the  nonreactive  portion  of  the  research, 
investigators  would  examine  “naturally  occurring” 
data,  that  is,  data  unbiased  by  the  data  collection 
process.  In  the  end,  all  theories,  data,  results  from 
analyses,  and  different  interpretations  ought  to 
converge.  However,  it  is  unlikely  that  collecting 


many  data  sets,  employing  all  possible  methods, 
and  subjecting  data  collection  and  analyses  to 
all  possible  theoretical  frameworks  will  lead  to  a 
single  valid  answer  to  a  research  question.  Thus, 
this  third  family  of  reasons  for  employing  a  mixed- 
methods  design  is  reminiscent  of  pre-Popperian 
positivism.  Holism  is  unlikely  to  lead  to  successful 
research  outcomes,  because,  among  other  reasons, 
research  methods  do  not  combine  in  the  manner 
envisioned  by  their  proponents  and  because  such 
an  undertaking  would  be  impossibly  complex, 
expensive,  labor  intensive,  and  unfocused. 

Forms  of  Mixed-Methods  Research  Designs 

Creswell  and  his  colleagues  have  specialized  in  the 
identification  of  various  forms  of  mixed-methods 
design.  For  illustrative  purposes,  only  three  fami¬ 
lies  of  mixed-methods  design  are  covered  here. 

Sequential  Mixed-Methods  Design 

Sequential  mixed-methods  design  includes  at 
least  one  qualitative  and  one  quantitative  research 
component,  which  take  place  in  a  temporal  sequence. 
For  example,  the  exploratory  investigation  of  appro¬ 
priate  wording  or  phrasing  of  survey  questions  in 
the  form  of  semistructured  interviews  implies  a 
sequence  of  data  collection  and  analysis,  that  is, 
exploratory  interviews,  analysis  of  the  interviews, 
wording  and  phrasing  of  survey  questions  using 
some  of  the  findings  of  the  interviews,  and,  finally, 
the  survey  research.  It  is  also  possible  to  use  survey 
research  and  interviews  in  a  reverse  order,  that  is, 
analyze  survey  data  and,  based  on  some  of  the  find¬ 
ings,  conduct  exploratory  or  semistructured  inter¬ 
views,  which  may  help  in  interpreting  some  of  the 
findings  from  the  statistical  analysis  of  the  survey 
data.  The  findings  from  these  interviews  could  lead 
to  a  further  link  in  the  sequence  by,  for  instance, 
formally  testing  some  of  the  hypotheses  that  emerge 
from  the  post  hoc  interviews. 

Concurrent  or  Parallel  Mixed- 
Methods  Research  Design 

Concurrent  or  parallel  mixed-methods  research 
design  describes  a  family  of  approaches,  which 
include  relatively  separate  qualitative  and  quanti¬ 
tative  strands  within  one  research  project.  Focus 


1588  Mixed  Methods 


groups  or  nonparticipant  observations  may  produce 
data  for  qualitative  analysis  on  political  decision 
making  with  a  given  set  of  stakeholders.  The  link  of 
these  findings  to  their  voting  records  or  responses  on 
surveys  may  be  fruitfully  explored  in  a  convergent  or 
complementary  mixed-methods  framework. 

Nested  Mixed-Methods  Design 

Nested  mixed-methods  design  describes  a  set  of 
designs  in  which  either  the  qualitative  or  the  quan¬ 
titative  part  of  the  research  project  is  nested  in  the 
quantitative  or  qualitative  part,  respectively.  For 
instance,  the  main  part  of  a  study  may  focus  on  a 
specific  group  of  people  who  refuse  to  vote.  A 
dominant,  qualitative  research  design  may  focus 
on  identifying  the  dimensionality  of  their  justifica¬ 
tions  for  nonvoting  as  well  as  alternative  forms  of 
political  participation.  The  embedded  quantitative 
part  of  the  research  may  quantify  the  extent  of  the 
phenomenon  under  investigation  as  far  as  this  is 
possible  with  available  data,  for  example,  nonvot¬ 
ing  among  a  specific  subgroup  of  a  population  and 
its  relations  to  demographic  characteristics. 

Fields  of  Applications 

The  uptake  of  mixed-methods  research  design  var¬ 
ies  tremendously  between  the  social  and  related 
sciences  as  well  as  between  the  different  fields  of 
interest  within  the  disciplines.  Currently,  the  most 
frequent  applications  can  be  found  in  education, 
public  health,  nursing,  sociology,  media  studies, 
management  studies,  and  evaluation.  Interestingly, 
political  science  and  psychology  are  currently 
underrepresented  in  this  emergent  design,  even 
though,  in  principle,  both  have  well-developed 
qualitative  and  quantitative  traditions.  Other  fields 
not  represented  due  to  a  traditional  preference  for 
qualitative  or  quantitative  methods  are  social 
anthropology,  social  work,  gender  studies,  geogra¬ 
phy,  and  economics.  It  could  be  argued  that  these 
disciplines  are  still  engaged  in  the  oft-proclaimed 
paradigm  war,  where  proponents  on  both  sides  are 
still  attempting  to  argue  the  overall  superiority  of 
one  type  of  method  over  another.  With  the  rise  of 
interdisciplinary  and  transdisciplinary  research, 
however,  more  researchers  are  likely  to  embrace 
mixed-methods  research  designs  within  and  across 
disciplines.  The  few  published  examples  that  can 


be  found  on  topics  relating  to  political  science 
include  reception  studies  of  political  campaign 
messages,  political  alienation  of  voters,  gender  and 
political  power,  social  justice  in  relation  to  action 
research,  and  appropriateness  of  welfare  policy  in 
different  contexts. 

Limitations  and  Misuses 

The  mixed-methods  research  design  has  numerous 
limitations.  First,  it  is  a  far  more  complicated 
research  design  than  a  mono-methods  design.  Not 
only  do  researchers  have  to  engage  in  qualitative 
and  quantitative  research — and  most  empirical 
researchers  are  not  specialized  in  more  than  one 
family  of  methods — but  they  also  have  to  combine 
results  in  a  convincing  way.  Even  or  especially 
when  research  teams  work  in  groups  divided  by 
methods  competence,  the  combination  of  results 
may  turn  out  to  be  nearly  impossible  because  of 
the  lack  of  communication  skills  and  understand¬ 
ing  between  them.  Second,  unfocused  research 
often  leads  to  different  data  collection  and  analy¬ 
sis  possibilities.  Rather  than  finding  or  deciding  on 
a  well-focused  research  question,  many  researchers 
and  especially  research  teams  prefer  to  leave 
options  open  and,  thus,  believe  that  a  mixed- 
methods  design  gives  them  the  justification  and 
tools  to  pursue  unfocused  research.  What  often 
results  from  this  is  a  number  of  disparate  research 
results  around  a  research  theme.  This,  however,  is 
not  mixed-methods  research.  Third,  it  is  often 
believed  that  a  mixed-methods  research  design  can 
bridge  the  gap  between  different  epistemological 
and  ontological  positions.  This,  too,  is  not  the 
case  because  it  does  not  make  sense  to  frame  the 
qualitative  part  of  the  research  in  a  constructivist 
framework  and  the  quantitative  part  in  a  (post-) 
positivistic  framework.  At  fault  here  are  the  prob¬ 
lematic  assumptions  underlying  qualitative  and 
quantitative  research,  which  were,  in  part,  a  result 
of  the  influence  of  French  social  theorists  on 
qualitative  research  from  the  1970s.  Fourth,  and 
connected  to  this  point,  is  the  claim  that  the 
mixed-methods  research  design  is  a  third  para¬ 
digm  and  separable  from  qualitative  and  quantita¬ 
tive  paradigms.  It  is  questionable  whether  the  use 
of  the  term  paradigm  is  suitable  for  the  tremen¬ 
dous  variability  within  the  family  of  qualitative 
methods  or  quantitative  methods.  As,  on  closer 
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inspection,  it  is  extremely  difficult  to  describe  the 
core  characteristics  of  all  qualitative  methods,  on 
the  one  hand,  and  of  all  quantitative  methods,  on 
the  other,  stating  that  mixed-methods  research  is 
systematically  different  due  to  its  ability  to  deal 
with  heterogeneous  theoretical  and  empirical 
approaches  is  unconvincing  to  some  researchers. 
Fifth,  many  texts  on  mixed-methods  research 
inadvertently  and  often  unintentionally  defend  the 
use  of  this  design  to  obtain  more  valid  (in  the 
problematic  sense  of  objective  and  positivistic) 
research  results.  Some  of  the  better  known  forms 
of  biases  may  indeed  be  detected,  using  different 
analysis  methods.  However,  much  of  the  current 
literature  on  this  design  subordinates  qualitative 
exploratory  research  to  quantitative  research 
and  formal  hypothesis  testing.  It  thus  practically 
neglects  the  possibility  of  conducting  mixed- 
methods  research  in  a  constructivist  and  exploratory 
framework. 

Conclusion 

Mixed-methods  research  design  is  a  label  that 
applies  to  a  large  family  of  research  designs  that 
can  integrate  diverse  theoretical  and  empirical 
approaches  in  empirical  research  and  evaluation. 
It  is  suitable  for  all  social  and  related  sciences  but 
not  to  all  research  questions.  Some  research  ques¬ 
tions  will  remain  best  answered  with  a  well- 
focused  mono-method  design.  The  answers  to 
other  research  questions  will  be  enriched  by  the 
possibilities  of  linking  qualitative  and  quantitative 
methods.  But  the  mere  application  of  a  method 
that  is  both  qualitative  and  quantitative  to  explore 
a  research  topic  is  a  necessary  but  insufficient  con¬ 
dition  for  mixed-methods  research.  In  practice, 
the  flexibility  of  this  design  leads  many  inexperi¬ 
enced  researchers  to  conduct  unfocused  research. 
However,  due  to  its  flexibility  and  capacity  to 
enrich  both  mono-  and  interdisciplinary  research, 
it  is  not  surprising  that  this  type  of  research 
approach  and  design  has  enjoyed  so  much  popu¬ 
larity  throughout  the  history  of  the  social  and 
related  sciences.  Now,  at  the  end  of  the  second 
generation  of  mixed-methods  research,  which 
mainly  aimed  at  incorporating  assumptions  about 
the  possibilities  and  limits  of  mono-method 
approaches,  documenting  different  types  of  mixed- 
methods  design,  and  cataloguing  good-practice 


examples,  the  next  generation  of  mixed-methods 
literature  is  likely  to  turn  away  from  carrying  bag¬ 
gage  left  over  from  the  paradigm  wars  and 
attempting  to  address  some  of  the  inconsistencies 
by  referring  to  a  vague  form  of  pragmatism. 
Instead,  mixed-methods  research  is  likely  to 
become  even  more  dominant  in  applied  research 
within  and  between  fields,  playing  out  its  strength 
particularly  in  inter-  and  transdisciplinary 
research. 
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Political  mobilization  is  the  process  of  organizing 
groups,  social  networks,  crowds,  and  social  units 
for  political  goals.  Challengers  to  the  status  quo 
mobilize  to  redress  grievances  and  for  reforms 
when  conventional  political  institutions  fail  to 
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respond.  Regimes  mobilize  to  realize  a  political 
program,  subdue  adversaries,  and  counter  mobili¬ 
zation  by  challengers  when  they  need  popular  sup¬ 
port.  Resources  for  mobilization  are  manpower 
and  funds  (often  from  volunteers),  commitment 
and  solidarity  of  supporters,  and  shared  beliefs  and 
values.  By  mobilizing,  a  group  gains  power  against 
an  adversary.  Political  goals  tend  to  be  collective 
goods  whose  attainment  in  large  groups  entails 
organization  costs  and  free-rider  tendencies,  as 
Mancur  Olson  (1965)  theorized  for  interest  groups 
and  voluntary  associations.  When  political  mobili¬ 
zation  is  countered  by  the  authorities  with  repres¬ 
sion,  obstacles  are  even  higher.  Nevertheless,  in  a 
1990  empirical  study  of  political  mobilization  by 
53  challenging  groups  in  the  19th-  and  20th- 
century  United  States,  William  Gamson  found  that 
the  Olson  theory  is  a  useful  starting  point  for 
explaining  both  conventional  lobbying  and  uncon¬ 
ventional  modes  of  collective  action.  Charles  Tilly 
(1978),  Anthony  Oberschall  (1993),  Sidney  Tarrow 
(1994),  Bert  Klandermans  (1997),  and  Mayer  Zald 
and  John  McCarthy  (1987)  have  added  key 
insights,  terms,  and  processes  to  the  theory  that 
have  increased  its  explanatory  power. 

There  are  hundreds  of  empirical  studies  of 
political  mobilization  in  particular  instances,  many 
of  ethnic  and  religious  minorities  around  the  world 
and  many  others  of  advocacy  groups  (e.g.,  envi¬ 
ronmentalist,  feminist,  and  antiabortion).  Two 
core  dimensions  that  provide  theoretical  coherence 
to  these  studies  are  micromobilization  and  mobi¬ 
lizing  structures. 

Micromobilization 

Micromobilization  explains  participation  with  five 
pivotal  variables:  P,  V,  N,  S,  and  C: 

P  is  the  expected  probability  of  success  for  collective 
action; 

V  is  the  value  of  the  goal,  collective  good,  or  issue 
for  the  participant  (e.g.,  justice  for  a  minority); 

N  is  the  expected  number  of  participants  or 
contributors; 

S  is  the  selective  incentive  or  personal  benefit  from 
participation;  and 

C  is  the  expected  cost  of  participation. 


The  theory  hypothesizes  that  for  participants,  P 
times  V  (value  expectancy  or  benefit)  plus  S  is 
greater  than  C;  that  is,  PV  +  S  —  C  >  0. 

V  is  distributed  in  a  population  from  the  most 
positive  values  (true  believers)  to  most  negative  (oppo¬ 
nents).  V  is  not  necessarily  a  personal  experience — for 
example,  justice  for  a  minority  may  be  valued  by 
some  who  are  themselves  not  targets  of  injustice. 
V  is  fought  over  in  the  court  of  public  opinion  by 
adversaries  who  package  issues  in  media  frames 
for  persuasive  communication.  Framing  situates 
and  connects  events,  people,  and  groups  into  a 
meaningful  narrative  in  which  beliefs  and  explana¬ 
tions  are  communicated  persuasively  using  meta¬ 
phors,  catchphrases,  and  symbols.  It  colors  judg¬ 
ment  about  the  truth  of  beliefs  and  legitimacy  of 
causes. 

N,  the  expected  number  of  participants,  enters 
the  expression  because  both  P(N)  and  C(N)  are  a 
function  of  N:  Expectation  of  success  increases  and 
expected  cost  decreases  (“safety  in  numbers”)  with 
the  number  of  participants.  P  and  C  functions, 
called  production  functions,  are  not  linear  with  N. 
Their  shape  varies  with  the  type  of  collective  action 
and  other  circumstances,  and  they  influence  the 
strategies  and  tactics  of  confrontation  used  by  the 
adversaries.  Some  tactics  require  only  small  num¬ 
bers  of  participants  for  achieving  what  participants 
define  as  “success” — for  example,  a  vigil  by  dissi¬ 
dents.  Others  require  huge  numbers,  sometimes  in 
the  hundreds  of  thousands,  to  achieve  success,  as  in 
the  case  of  a  petition  drive  or  a  demonstration. 
Critical  mass  is  a  number  or  range  of  turnout  at 
which  the  participants  and  opponents  judge  the 
collective  action  to  be  successful.  Turnout  is  there¬ 
fore  contested  by  the  adversaries,  who  engage  in  a 
battle  over  numbers  in  the  court  of  public  opinion. 

S  stands  for  rewards  and  benefits  that  are 
obtained  by  participation  that  free  riders  don’t  get. 
For  challengers,  S  tends  to  be  social,  moral,  and 
ideological  rewards  of  standing  up  for  a  cause.  C 
is  the  cost  in  time  and  resources  of  contribution  to 
collective  action  and  expected  cost  due  to  social 
control  such  as  arrest  and  marginalization.  In  con¬ 
trast,  regime  mobilization  is  well  placed  to  lower  C 
and  increase  S  with  material  and  political  benefits 
to  supporters.  At  an  extreme,  repressive  regimes 
target  adversaries,  with  license  to  their  followers  to 
attack  and  loot  (S  is  positive)  adversaries  without 
police  interference  (C  is  zero). 
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The  five  variables  are  socially  constructed  by 
the  challengers  and  their  adversaries  during  con¬ 
tention  and  confrontation.  If  V  is  a  collective  good 
such  as  nondiscrimination  legislation,  the  motiva¬ 
tion  for  V  comes  for  varied  social  psychological 
reasons:  anger  at  injustice,  belief  that  justice  for  all 
is  a  moral  imperative,  solidarity  with  an  oppressed 
group,  personal  experience  of  discrimination,  and 
others.  Activists  argue  and  publicize  their  cause 
through  moral  and  ideological  appeals  to  increase 
V.  Because  social  and  political  movements  have 
limited  resources  to  provide  tangible,  selective 
incentives  for  participants,  they  build  a  sense  of 
community  among  kindred  souls  that  makes  par¬ 
ticipation  itself  a  source  of  satisfaction  (increase  S). 
Ritual,  drama,  and  entertainment  provided  by 
rock  bands  and  celebrities  sharing  the  stage  at 
mass  gatherings  build  a  sense  of  community  and 
reward  those  who  turned  out  with  a  memorable 
public  spectacle  that  free  riders  don’t  get.  Survey 
research  on  participants  in  diverse  settings  has 
found  that  such  moral  appeals  and  selective  incen¬ 
tives  overcome  free  riding  and  boost  participation. 

To  apply  micromobilization,  consider  dissidents 
in  a  repressive  regime.  There  are  few  members  (N 
is  low).  They  mount  a  vigil  or  some  other  symbolic 
protest  in  a  crowded  square  during  rush  hour. 
Only  a  few  participants  are  needed  to  unfurl  a 
banner  or  hold  up  signs  before  they  are  arrested  ( C 
is  high).  They  define  success  as  visibility  to  many 
people  and  the  media:  P  as  a  function  of  N 
increases  steeply  and  is  close  to  1  for  small  N.  A 
dozen  well-placed  protesters  can  do  it;  two  or 
three  may  not  be  enough;  more  than  a  dozen  does 
not  add  to  visibility  and  leads  only  to  unnecessary 
arrests.  V,  the  value  for  them  of  standing  up  for 
freedom  or  a  cause,  is  very  high.  S  consists  princi¬ 
pally  of  the  respect  and  social  standing  earned 
among  their  peers  for  courage.  Thus,  PV  plus  S 
minus  C  for  small  N  can  be  positive  for  the  vigil. 
Contrast  symbolic  protest  to  a  petition  campaign 
in  a  democratic  society.  Signing  the  petition  is 
close  to  cost  free  to  the  signer.  For  success,  N  has 
to  be  large  (possibly  tens  of  thousands)  for  a  posi¬ 
tive  effect  on  the  public,  the  media,  and  legislators. 
P  as  a  function  of  N  is  a  J  curve.  Because  S  for 
signing  is  low,  organizers  may  give  away  a  free  pen 
or  sticker  with  an  icon  as  a  token  to  boost  S. 
Because  many  signers  are  needed,  the  organizers 
can’t  simply  sign  up  the  high-V  public  but  must 


appeal  to  the  mid-  to  low- V  sympathizers  through 
advocacy  and  create  high  N  expectations. 
Opponents  will  attempt  to  confuse  the  public  by 
claiming  that  the  petition  has  all  sorts  of  hidden 
costs  and  undesired  consequences:  V  is  not  what 
the  advocates  claim,  and  there  are  negative  C. 
These  are  two  quite  different  modes  of  political 
mobilization,  in  two  very  different  political  set¬ 
tings,  for  which  micromobilization  applies.  Other 
types  of  collective  action  have  their  own  produc¬ 
tion  functions,  such  as  for  demonstrations,  inclined 
S  shape,  and  P  increasing  with  N,  then  decreasing 
beyond  a  critical  mass.  Whether  one  analyzes  peti¬ 
tion  drives,  strikes,  marches,  demonstrations, 
sit-ins,  building  occupations,  selective  buying 
campaigns,  contributions  for  a  cause,  or  vigils,  the 
same  five-variable  micromobilization  model  for 
participation  in  collective  action  is  salient. 

Mobilizing  Structure 

Olson  assumed  that  obtaining  a  collective  good 
voluntarily  in  a  large  group  requires  considerable 
costs  of  organizing  and  that  collective  good  incen¬ 
tives  are  not  sufficient  for  participation.  Coercion 
is  needed  for  supplying  collective  goods.  Coercive 
powers  such  as  a  state  enjoys  with  compulsory 
taxation  is,  however,  beyond  the  reach  of  volun¬ 
tary  groups. 

The  1960s  witnessed  an  outburst  of  social  and 
political  movements  in  the  United  States  and 
Western  Europe  that  succeeded  in  mobilizing  large 
numbers  of  participants  in  large  populations:  the 
civil  rights  movement;  the  anti-Vietnam  War  and 
other  peace  movements;  youth  and  counterculture 
movements;  environmentalist,  feminist,  and  ethnic 
minority  movements,  and  so  on.  Empirical  study 
of  these  movements  discovered  a  variety  of  mobi¬ 
lizing  structures  that  were  omitted  in  Olson’s  the¬ 
ory.  Large  populations  are  not  made  up  of  isolated 
persons  who  had  to  be  recruited  at  a  considerable 
cost  by  a  large  organization.  On  the  contrary, 
many  populations  are  thick  with  associations, 
communities,  groups,  and  networks.  There  exist 
structures  with  leaders,  members,  internal  links, 
communication  networks,  and  resources  for  col¬ 
lective  action,  such  as  African  American  churches 
and  the  student  governments  in  historically  black 
colleges  in  the  South.  The  leadership  and  members 
of  such  groups  committed  to  political  goals  in  the 
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1960s  (e.g.,  ending  segregation)  and  joined  local 
autonomous  units  such  as  churches  or  campus 
student  associations  into  a  federated  overarching 
structure  called  a  social  movement  organization 
(SMO),  as  the  Southern  Christian  Leadership 
Conference  and  the  Student  Nonviolent  Coord¬ 
inating  Committee  became.  Preexisting  structures 
make  federation  into  an  SMO  possible.  “Bloc” 
recruitment  takes  place;  that  is,  blocs  of  persons 
already  networked  and  sharing  an  identity  are 
recruited,  instead  of  individuals  one  at  a  time. 
Shared  identity  activates  a  “grievance  multiplier” 
(V increases)  when  a  perceived  injustice  to  a  mem¬ 
ber  of  a  group  or  category  is  experienced  as  a  col¬ 
lective  injustice  by  others  as  well.  Resources  such 
as  funds  for  action  campaigns,  leadership  training 
centers,  volunteer  programs,  and  the  like  are  pro¬ 
vided  by  nonbeneficiary  “outsiders”  who  are 
moved  by  their  values  and  conscience  to  support  a 
cause.  Federated  structures,  bloc  recruitment, 
grievance  multipliers,  and  nonbeneficiary  support 
lower  organization  costs  and  weaken  free-rider 
tendencies. 

Other  research  on  the  anti-Vietnam  war  and 
New  Left  student  and  counterculture  movements 
found  that  dynamic  mobilization  emerges  during 
confrontations  in  a  loose  structure  that  adapts  to 
opportunities  against  opposition.  Loosely  struc¬ 
tured  collective  action  characterizes  a  coalition  of 
activists,  part-timers,  and  sympathizers  whose 
boundaries  are  ill-defined  and  shifting;  who  lack  a 
shared,  central  leadership  and  organization;  and 
who  use  the  mass  media  as  a  tool  for  communicat¬ 
ing  to  potential  supporters  in  lieu  of  a  strong  grass¬ 
roots  base.  In  a  loose  structure,  mobilization  costs 
are  kept  low  through  a  division  of  labor  that  links  a 
small  leadership  cadre  of  full-time  activists  to  a 
much  larger  pool  of  part-time  teams  who  contribute 
their  presence  episodically  in  short-lived  campaigns 
for  a  cause — for  example,  a  campus  sit-in.  Supporting 
them  is  a  larger  “conscience  constituency”  that  con¬ 
tributes  funds  and  public  opinion  support.  Beyond 
them  is  a  much  larger  bystander  public  that  is  atten¬ 
tive  to  the  issue  but  initially  indifferent  or  nonparti¬ 
san.  Loose  structure  makes  for  low-cost  mobilization 
and  flexible  tactics  as  the  leadership  responds  to  its 
adversary’s  moves:  shifting  from  confrontations 
with  transitory  teams  to  legislative  lobbying,  to 
fund  raising  for  the  next  confrontation  campaign, 
and  peaking  with  a  massive  peaceful  gathering  for 


which  the  conscience  constituency  and  bystanders 
are  mobilized  in  addition  to  the  activists. 

The  effectiveness  of  political  mobilization  has 
been  empirically  studied,  most  completely  in  the 
Netherlands  with  respect  to  new  social  movements 
(NSM),  such  as  the  peace,  women’s,  and  environ¬ 
mental  movements.  Public  opinion  polls  measure 
the  sentiment  pool  for  an  issue  and  identify  the 
sympathizers.  When  activists  organize  a  campaign 
for  a  petition,  a  march,  or  other  collective  action, 
survey  research  measures  what  proportion  of  sym¬ 
pathizers  was  reached,  was  motivated  to  act,  and 
actually  participated.  The  overall  effectiveness  of 
the  campaign  can  thus  be  measured  as  well  as  that 
of  various  targeting  tactics  such  as  door-to-door 
canvassing,  telephoning,  media  appeals,  and  so  on. 
Further  statistical  analysis  clarifies  the  effects  of  V, 
S,  C,  and  P  and  N  on  participation  and  which 
components  of  mobilizing  structure  are  effective. 
Hanspeter  Kriesi  (1993)  compared  the  effective¬ 
ness  of  mobilization  across  five  NSMs  and  com¬ 
pared  NSM  mobilization  with  that  of  political 
parties  and  trade  unions.  Among  his  conclusions  is 
that  political  values,  V,  are  the  crucial  determi¬ 
nants  of  the  level  of  mobilization. 

Diffusion 

Large-scale  collective  action  has  taken  place  in 
autocratic  regimes  that  curb  opposition  and  has  on 
occasion  toppled  regimes  long  thought  to  be  pow¬ 
erful,  as  in  Eastern  Europe  in  1989.  How  can  tens 
of  thousands  of  people  coordinate  their  actions 
without  a  central  organizing  mechanism?  Protests 
spontaneously  spread  from  the  capital  to  other  cit¬ 
ies  and  from  one  country  to  another,  and  the 
authorities  were  helpless  to  stop  the  diffusion. 
Diffusion  was  also  typical  in  the  1960s  movements 
of  the  United  States,  when  students  seized  campus 
buildings  at  many  universities  after  the  initial 
Columbia  University  student  protests  in  the  spring 
of  1968,  the  Harvard  protest  a  year  later,  and  the 
diffusion  of  the  African  American  sit-in  movement 
in  1960  throughout  the  South. 

To  understand  diffusion,  further  specification 
of  underlying  mechanisms  for  micromobilization 
and  mobilizing  structures  are  needed:  focal  point, 
repertoire  of  collective  action,  signaling,  paceset¬ 
ter-follower  dynamics,  political  opportunity,  and 
assurance  or  uncertainty  reduction.  Focal  point  is 
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the  key  to  understanding  coordination  of  action 
with  only  minimal  communication  and  without 
leadership.  It  requires  common  knowledge  embed¬ 
ded  in  political  culture — for  example,  knowledge 
by  citizens  that  in  the  past,  for  momentous  politi¬ 
cal  events,  crowds  gathered  at  a  certain  date  in  a 
historic  square  demanding  freedom  or  reforms. 
They  also  know  what  flags,  songs,  and  other  sym¬ 
bols  to  carry  and  sing,  signaling  unity  of  commit¬ 
ment  and  purpose.  The  shared  symbols  and  behav¬ 
ior  codes  that  are  embedded  in  the  political  culture 
and  well  known  to  participants  is  called  the  reper¬ 
toire  of  collective  action.  The  important  matter  is 
that  not  only  do  most  people  know  the  focal  point 
and  repertoire,  but  they  also  know  that  others 
know  the  same  thing  and  will  respond  in  the  same 
way  to  “when  and  where  we  all  assemble,  and 
what  we  do  there.” 

The  contemporary  protest  repertoire  par  excel¬ 
lence  of  large  crowds  is  the  demonstration  and 
march  that  winds  its  way  through  the  major  ave¬ 
nues  and  landmarks  of  the  capital  city.  Repertoires 
create  expectations  for  the  adversaries  and 
bystander  public  and  provide  some  predictability 
for  action;  yet  some  spontaneous  actions  also  take 
place.  New  repertoires  are  invented  and  tailor- 
made  for  specific  purposes. 

Collective  action  depends  on  expectations  for  P, 
C,  and  N,  which  are  uncertain.  To  trigger  conver¬ 
gence  on  a  focal  point  and  activate  the  repertoire, 
what  is  needed  is  a  political  opportunity  signal  to 
many  people  simultaneously  that  N  is  expected  to 
be  large,  P  is  increasing,  and  C  is  decreasing.  The 
signal  might  be  a  lack  of  support  for  a  regime  by 
an  important  ally,  as  when  the  Soviet  premier 
Mikhail  Gorbachev  signaled  in  1989  to  East 
European  communist  regimes  that  they  were  on 
their  own,  or  a  successful  challenge  in  neighbor 
states,  as  when  Prague  crowds  witnessed  the  col¬ 
lapse  next  door  of  the  East  German  regime.  When 
political  opportunity  is  judged  to  be  favorable,  a 
potential  participant  who  estimates  P,  C,  and  N 
has  some  assurance  that  he  or  she  will  not  be  at  the 
demonstrations  with  only  a  few  others.  Research 
has  shown  that  an  assurance  process  takes  place  in 
neighborhood,  peer,  work,  and  civic  groups  whose 
members  communicate  with  one  another  and  build 
confidence  and  commitment  to  participation.  The 
assurance  process  has  also  been  conveyed  in  recent 
years  on  websites,  through  cell  phones  and  e-mail, 


text  messaging,  and  other  modes  of  electronic 
communication.  Assurance  reduces  uncertainty, 
which  inhibits  collective  action.  Thus,  the  turnout 
is  thousands,  not  just  a  few. 

When  protest  starts  at  a  focal  point,  it  diffuses  to 
other  places  and  social  units  with  pacesetter-fol¬ 
lower  dynamics.  By  virtue  of  history,  culture,  and 
geography,  a  mobilizing  structure  has  a  center- 
periphery  structure:  What  happens  in  the  capital  is 
a  wake-up  call  to  provincial  cities,  and  what  hap¬ 
pens  at  a  prominent  university  campus  triggers  fol¬ 
lowers  in  peripheral  colleges.  Diffusion  is  explained 
by  changing  expectations  about  P,  C,  and  N  by 
potential  followers  depending  on  the  outcome  of 
confrontations  at  the  pacesetter  and  at  other  sites 
that  have  already  acted.  The  diffusion  dynamic 
explains  the  spatial  and  temporal  sequence  of  col¬ 
lective  action  because  proximity  and  similarity  are 
salient  signals  for  expectations.  The  waxing  and 
waning  of  participation  at  a  particular  site,  be  it  the 
center  or  periphery,  can  also  be  thus  modeled. 

Violence  Specialists 

Much  political  mobilization  researched  in  the 
1960s  through  the  1980s  was  that  of  social  and 
political  movements  using  unconventional  modes 
of  collective  action  that  were  by  and  large  nonvio¬ 
lent.  Since  the  end  of  the  Cold  War,  there  have 
been  many  insurgencies  for  the  overthrow  of  a 
government  by  violent  means  and  terrorism.  In 
response,  regimes  mobilized  paramilitaries  and 
special  forces  that  not  only  fought  armed  enemies 
but  assaulted  the  civilian  population  suspected  of 
sympathy  with  the  opposition.  The  issues  at  con¬ 
tention  were  state  formation,  secession,  political 
autonomy,  and  power  sharing  in  ethnically  deeply 
divided  societies.  Research  has  found  that  micro¬ 
mobilization  and  mobilization  structure  models 
explain  insurgent  and  regime  mobilization  in  vio¬ 
lent  conflicts. 

Ethnic  and  religious  communities  provide  a 
mobilizing  infrastructure  of  ethnic  associations, 
kin  networks,  churches,  mosques  or  temples,  reli¬ 
gious  schools,  funding  and  welfare  organizations, 
an  ideology,  a  shared  culture,  and  leaders. 
Insurgents,  fighters,  and  terrorists  are  recruited, 
trained,  armed,  and  indoctrinated  within  this 
infrastructure,  get  social  support,  and  are  shielded 
from  the  authorities.  Resources  are  obtained  from 
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external  sponsors,  including  states  that  provide 
weapons,  funding,  safe  places,  and  bases  across  a 
border.  Just  as  for  peaceful  movements,  the 
Internet,  websites,  and  electronic  modes  of  com¬ 
munication  have  made  loose  mobilizing  structures 
possible. 

Studies  of  insurgents  and  terrorists  cannot  rely 
on  survey  research  with  participants  and  on  other 
routine  social  science  methods.  Nevertheless,  court 
records  from  trials,  interrogations  of  captured 
militants,  informers,  captured  data  from  computer 
hard  drives,  and  confessions  from  those  who  give 
up  insurgency  enable  one  to  piece  together  the 
contours  of  micromobilization  and  mobilizing 
structure.  Marc  Sageman  (2004)  reports  that  for 
more  than  400  A1  Qaeda  terrorists,  the  conversion 
to  jihad  was  not  a  solitary  decision  but  came  about 
through  interpersonal  relations  and  with  social 
support.  Jihadists  formed  links  with  others  like 
themselves  and  congregated  in  the  same  mosques, 
students  associations,  neighborhood  clubs,  and 
Islamic  bookshops  and  often  lived  together.  In 
these  local  communities,  they  became  radicalized 
militants  who  advocated  the  violent  overthrow  of 
corrupt  Arab  regimes  and  other  Islamist  causes. 
These  findings  are  in  line  with  existing  models  and 
theories. 

Mobilization  of  regime  by  violence  specialists 
for  ethnic  cleansing,  mass  purges,  and  killings 
rests  on  a  four-legged  structure:  (1)  a  radical  elite 
and  cadres  running  a  party-state,  (2)  bands  of 
armed  militants  and  paramilitaries,  (3)  core  con¬ 
stituencies  who  vote  for  the  regime  and  keep  it  in 
power,  and  (4)  a  bystander  public  swayed  or  con¬ 
fused  by  propaganda.  The  armed  bands,  who 
perpetrate  much  of  the  violence  against  civilians, 
are  recruited,  organized,  financed,  trained,  and 
indoctrinated  by  political  parties  and  state  security 
forces.  The  pattern  is  top-down  mobilization  by 
the  regime  elites  for  violence  specialists  and  bot¬ 
tom-up  for  the  core  constituency  and  bystander 
public.  The  armed  bands  become  distinct  and 
privileged  formations  that  serve  the  regime  more 
reliably  and  loyally  than  the  regular  police  and 
army.  Their  members  are  attracted  by  the  racist 
and  extreme  nationalist  ideologies  of  leaders  who 
manipulate  perceived  threats,  fears,  and  hate 
against  target  groups,  often  minorities.  Although 
opportunists  are  attracted  by  the  prospects  for 
loot  (S),  many  are  convinced  by  propaganda  that 


their  cause  is  patriotic  and  morally  justified  (V) 
and  that  a  preemptive  strike  against  unarmed 
civilians  of  the  target  groups  is  actually  saving 
their  society  from  subversion  and  destruction.  An 
example  is  the  Interahamwe  village  militias  orga¬ 
nized  by  “Hutu  Power”  leaders  in  the  Rwanda 
genocide.  Similarly,  paramilitary  bands  in  the 
Yugoslav  Wars  were  organized  by  political  par¬ 
ties,  criminal  groups,  local  crisis  committees,  and 
the  authorities  and  were  responsible  for  much 
ethnic  cleansing  and  massacres  of  civilians. 

According  to  Donald  Horowitz  (2001),  studies 
of  ethnic,  religious,  and  sectarian  riots  around  the 
world  (e.g.,  Hindu-Muslim  riots  in  Indian  cities) 
found  a  similar  pattern  of  top-down,  bottom-up 
mobilization  as  for  insurgents  and  for  regime 
mobilization.  Political  parties,  factions,  individual 
politicians,  extremist  organizations,  and  secret 
societies  organize  criminal  gangs  and  train  young 
men  and  women  for  fighting;  these  people  are  in 
the  forefront  of  rioting,  burning,  and  looting  shops 
and  attacking  innocent  people.  These  bands  are 
joined  by  ordinary  people  who  attack  a  target 
group,  burn  their  homes  and  shops,  and  commit 
atrocities  in  passionate  killings. 

Elaborations  of  micromobilization  and  mobiliz¬ 
ing  structure  models  have  to  be  made  in  these 
applications.  The  social  psychology  of  motivation 
for  a  suicide  attacker  is  going  to  be  more  complex 
than  for  a  petition  signer  or  a  protest  marcher. 
Clandestine  insurgent  organization  is  going  to  be 
different  and  more  complex  than  a  legal  associa¬ 
tion  that  advocates  a  clean  environment.  Never¬ 
theless,  the  five  pivotal  variables  (P,  V,  N,  S,  and 
C),  focal  points,  signaling,  uncertainty  reduction, 
diffusion  dynamic,  loose  structure,  external  allies, 
and  other  key  terms  and  processes  in  political 
mobilization  are  useful  for  theory  development  for 
all  manner  of  mobilization,  covering  both  demo¬ 
cratic  polities  and  autocratic  regimes,  by  regimes 
and  by  opponents  of  the  authorities,  using  uncon¬ 
ventional  modes  of  contention  and  confrontation, 
both  nonviolent  and  violent. 
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Model  Specification 


Model  specification  refers  to  the  process  of 
expressing  a  theory  in  mathematical  (functional) 
form.  The  choice  of  model  specification  affects  the 
validity  of  causal  inferences.  Arguably,  severe 
specification  errors  impede  the  validity  of  infer¬ 
ences  more  than  the  choice  of  a  suboptimal  esti¬ 
mator.  Yet  applied  researchers  rarely  follow  a 
strategy  when  trying  to  develop  and  potentially 
improve  the  specification  of  their  model.  Rather, 
the  choice  of  a  model  specification  usually  depends 
on  a  crude  mixture  of  what  is  common  in  the  field 
of  research,  methodological  fads,  and  individual 
intuition.  Misspecification  occurs  when  the 
assumptions  underlying  an  empirical  analysis 
deviate  from  the  true  data-generating  process. 
Nonrandom  sampling,  measurement  error,  model 
uncertainty,  and  a  lack  of  independence  of  obser¬ 
vations  rank  most  prominently  among  the  sources 


for  model  misspecification.  Since  researchers 
hardly,  if  ever,  know  the  true  model,  most  models 
analyzed  in  the  social  sciences  will,  necessarily,  be 
misspecified.  The  most  common  misspecifications 
result  from  insufficient  theoretical  guidance.  The 
resulting  model  uncertainty  usually  implies  that 
researchers  do  not  know  the  correct  set  of  regres¬ 
sors  (independent  variables),  their  optimal  opera¬ 
tionalization,  or  the  accurate  functional  form. 
This  entry  discusses  the  origins  and  types  of 
specification  problems  and  reviews  the  standard 
solutions  to  misspecification.  The  entry  concludes 
with  a  discussion  of  the  way  in  which  researchers 
deal  with  specification  issues. 

Model  Uncertainty 

Model  uncertainty  results  from  underspecified 
theories.  In  the  social  sciences,  theories  aim  at  sim¬ 
plifying  reality,  therefore  making  generous  use  of 
ceteris  paribus  clauses  and  of  highly  stylized 
assumptions.  In  other  words,  theories  neither  pro¬ 
vide  full  guidance  about  model  specification  to 
applied  researchers  nor  do  they  aim  at  providing 
such  information.  Usually,  they  do  not  say  much 
about  which  variables  to  include  in  the  list  of 
regressors,  which  functional  form  relates  the  right- 
hand-side  variables  to  the  dependent  variable,  the 
existence  of  conditional  effects,  and  the  existence 
and  correct  specification  of  temporal  and  spatial 
dependence.  This  section  discusses  the  resulting 
specification  problems  in  turn. 

Model  Uncertainty  and  Sensitivity  Analysis 

Model  uncertainty  has  multiple  origins;  it  occurs 
when  researchers  do  not  know  the  correct  set  of 
right-hand-side  variables,  the  correct  functional 
form  that  relates  the  dependent  to  the  independent 
variables,  the  correct  structure  of  conditionality 
between  exogenous  variables  (those  whose  values 
are  independent  of  the  states  of  other  variables  in 
the  system  under  study),  and/or  the  causal  depen¬ 
dences  between  exogenous  variables,  to  mention 
just  a  few  model  uncertainties.  Logically,  research¬ 
ers  need  to  start  with  identifying  all  exogenous 
factors  that  influence  the  variation  of  the  depen¬ 
dent  variable  across  observations.  Both  the  inclu¬ 
sion  of  too  few  or  too  many  regressors  may  cause 
bias  and  will  render  inferences  potentially  invalid. 
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Assume,  as  an  example  for  the  “too  many”  case, 
that  researchers  intend  to  estimate  the  effect  of 
income  on  vote  choice  but  include  the  vote  inten¬ 
tion  into  the  model.  Since  vote  intentions  depend 
on  structural  factors  such  as  income  and  educa¬ 
tion,  the  inclusion  of  the  vote  intention  variable 
causes  right-hand-side  endogeneity  (i.e.,  regressors 
may  not  be  independent  of  the  dependent  variable 
or  the  error  term;  see  below).  If  this  endogeneity  is 
not  correctly  modeled,  it  makes  the  correct  inter¬ 
pretation  of  results  impossible.  Indeed,  the  vote 
intention  variable  will  capture  much  of  the  influ¬ 
ence  of  income  and  education  on  vote  choice,  but 
researchers  cannot  separate  the  two  causal  mecha¬ 
nisms  unless  they  estimate  a  simultaneous  equa¬ 
tion  model  or  unless  they  drop  vote  intention  from 
the  list  of  regressors,  hoping  that  income  and  edu¬ 
cation  fully  determine  vote  intentions. 

In  the  absence  of  perfect  knowledge  of  the  true 
set  of  explanatory  variables,  researchers  may  con¬ 
duct  a  sensitivity  analysis  to  validate  inferences.  A 
sensitivity  analysis  distinguishes  three  types  of 
explanatory  variables:  (1)  the  variable(s)  of  inter¬ 
est,  (2)  consensual  determinants,  and  (3)  contested 
determinants.  Researchers  then  follow  a  quasi- 
Monte  Carlo  approach  in  which  they  estimate  a 
large  number  of  models,  which  always  include  the 
first  two  sets  of  variables  and  randomly  draw  a 
fixed  number  of  “contested  determinants”  (typi¬ 
cally  three)  into  each  single  model.  While  method¬ 
ologists  agree  on  the  structure  of  these  sensitivity 
tests,  they  disagree  on  the  criterion  under  which  an 
effect  may  be  considered  to  be  robust.  Suggestions 
include  the  condition  that  an  effect  is  robust  if  and 
only  if  the  estimated  coefficients  of  all  iterations 
have  the  same  sign  and  remain  significantly  differ¬ 
ent  from  zero.  A  more  generous  definition  ignores 
the  significance  criterion.  Finally,  an  even  more 
relaxed  definition  of  robustness  in  sensitivity  tests 
accepts  an  effect  as  robust  if  it  generates  the  same 
sign  in  95%  of  the  iterations.  Unfortunately,  none 
of  these  criteria  are  fully  convincing  since  the  esti¬ 
mation  will  almost  certainly  be  misspecified  in  a 
large  number  of  the  iterations.  For  example,  when 
(at  least)  two  regressors  are  highly  correlated,  the 
parameter  estimate  will  become  inefficient,  and  the 
point  estimate  can  produce  almost  any  result.  As  a 
consequence,  sensitivity  analyses  are  telling  when 
they  bring  about  robust  results,  but  they  are  not 
when  they  do  not.  If  a  hypothesis  does  not  find 


robust  support,  it  is  not  necessarily  wrong.  There 
may  just  have  been  too  many  misspecified  models 
among  the  iterations. 

Functional  Form  and  Conditionality 

Exceptions  may  exist,  but  the  huge  majority  of 
social  science  theories  do  not  predict  the  functional 
form  of  an  effect.  The  absence  of  theoretical  guid¬ 
ance  stimulates  empirical  researchers  to  simply 
assume  a  linear  function  or  to  take  the  logarithm 
of  either  the  independent  variable  alone  or  of  both 
the  independent  and  dependent  variables.  While 
this  arbitrariness  gives  researchers  some  influence 
over  estimation  results,  the  true  functional  form 
can  greatly  differ  from  all  the  above  standard 
assumptions.  Curve  fitting  offers  an  obvious,  but 
inherently  a  theoretical,  way  to  deal  with  this 
problem.  Adding  the  squared  term  and  possibly 
the  cubic  term  (and  so  on)  of  a  variable  to  the  list 
of  regressors  allows  for  a  sufficiently  uncon¬ 
strained  functional  form  that  allows  the  right- 
hand-side  variables  to  “fit”  the  variance  of  the 
dependent  variable.  Yet  curve  fitting  not  only  leads 
to  overconfidence  because  it  shrinks  standard 
errors,  but  it  can  also  result  in  vastly  wrong  esti¬ 
mates  if  the  functional  form  is  not  the  only  specifi¬ 
cation  problem.  In  the  presence  of  other  mis- 
specifications,  the  higher  power  terms  will 
undoubtedly  pick  up  some  variance  of  the  other 
specification  problem,  thereby  vastly  increasing 
the  bias  of  the  estimate. 

Whether  or  not  a  variable  influences  the  depen¬ 
dent  variable  often  depends  on  certain  conditions. 
For  example,  the  response  to  international  tax 
competition  depends  on  the  degree  to  which  polit¬ 
ical  institutions,  other  political  actors,  and  the  vot¬ 
ers  constrain  the  government.  Failure  to  model 
conditionality  has  the  same  consequences  as  omit¬ 
ted  variable  bias  and  biases  at  least  the  conditional 
variables.  Researchers  can  easily  mistake  a  mis¬ 
specified  functional  form  or  an  omitted  variable 
for  conditionality,  as  these  misspecifications  have 
very  similar  consequences  for  the  residuals.  If 
researchers  cannot  distinguish  between  a  misspeci¬ 
fied  functional  form  and  an  unmodeled  condition¬ 
ality,  a  semiparametric  analysis  of  the  potentially 
misspecified  variables  may  help.  In  such  an  analy¬ 
sis,  researchers  estimate  a  model  using  dummies  of 
different  combinations  of  variables  x  and  z  rather 
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than  including  the  two  variables  and  their  product 
separately.  For  example,  if  both  variables  x  and  z 
can  be  conveniently  split  into  three  categories 
(low,  middle,  and  high)  each,  researchers  either 
include  these  nine  categories  and  estimate  without 
intercept  or  include  eight  of  the  nine  categories  and 
estimate  with  intercept. 

Incomplete  Models  and  Unit  Heterogeneity 

Omitted  variables  usually  reveal  their  existence 
in  the  form  of  structure  in  the  residuals,  suggesting 
that  the  Gauss-Markov  conditions  have  been 
violated.  Panel  data  help  detect  structure  in  the 
residuals  because  errors  must  be  simultaneously 
independent  and  identically  distributed  across 
both  space  and  time,  which  is  more  demanding 
than  the  same  assumption  for  either  simple  cross- 
sectional  or  single  time-series  data.  While  structure 
in  the  residuals  indicates  model  misspecification,  it 
reveals  little  about  the  true  causes  of  model  mis¬ 
specification.  For  example,  in  panel  data,  the  sum 
of  residuals  for  a  single  unit  often  deviates  from 
zero.  Econometricians  interpret  this  structure  as 
unobserved,  time-invariant  heterogeneity,  which  is 
but  one  of  the  possible  reasons.  However,  this  vio¬ 
lation  of  Gauss-Markov  conditions  can  result  from 
most  types  of  model  misspecification,  including 
omitted  time-varying  variables,  a  wrongly  speci¬ 
fied  functional  form,  unmodeled  conditionality, 
and  so  on. 

Panel  data  textbooks  suggest  a  radical  “solu¬ 
tion”  to  the  problem  of  panel  heteroskedasticity 
(i.e.,  unequal  variance  in  the  regression  errors). 
When  the  estimated  model  does  not  fully  explain 
the  cross-sectional  variation,  econometricians  sug¬ 
gest  completely  ignoring  all  cross-sectional  varia¬ 
tion  (the  so-called  between  variation).  However, 
this  common  solution  may  lead  to  an  improvement 
as  well  as  a  deterioration  in  the  ex  ante  validity  of 
causal  inferences.  In  fact,  inferences  tend  to  become 
less  reliable  when  the  eliminated  variation  largely 
exceeds  the  remaining  variation.  In  this  case,  the 
decline  in  analytical  efficiency  reduces  the  ex  ante 
reliability  of  the  analysis  more  than  the  potential 
decline  in  bias  improves  the  validity  of  the  analysis. 
In  addition,  when  the  model  omits  both  time- 
varying  and  time-invariant  variables,  the  elimina¬ 
tion  of  all  between-variation  may  actually  increase 
the  bias  from  the  omitted  time-varying  variable. 


Dependence 

Regression  models  assume  independent  obser¬ 
vations.  Yet  strict  independence  appears  illusory  in 
the  social  sciences,  where  individual  action  usually 
depends  on  past  actions  (temporal  dependence) 
and  on  what  other  actors  do  or  have  done  in  the 
past  (spatial  dependence).  As  a  consequence,  most 
social  science  data  sets  violate  the  assumption  of 
independence.  Fortunately,  this  problem  can  be 
solved  or  at  least  moderated.  Solutions  to  the 
problem  of  dynamic  misspecification  require  the 
existence  of  time  series.  To  account  for  temporal 
dependence,  researchers  would  usually  need  to 
observe  at  least  30  sufficiently  independent  periods 
per  case.  The  more  within-variation  can  be  col¬ 
lected,  the  more  reliable  the  results  eventually  will 
become.  If  it  is  possible  to  gather  sufficient  variation 
over  time,  temporal  dependence  can  be  modeled  in 
numerous  ways  ranging  from  serially  correlated 
errors  through  the  inclusion  of  the  lagged  depen¬ 
dent  variable  or  a  Prais-Winsten  transformation, 
distributed  lags  models  to  more  demanding  models 
involving  panel  co-integration.  The  choice  of  the 
optimal  dynamic  specification  for  solving  this 
problem  depends  on  the  degree  of  serial  correla¬ 
tion,  whether  researchers  have  a  theoretical  inter¬ 
est  in  short-  or  long-term  adjustment  (or  both), 
and  whether  the  dynamics  of  different  included 
regressors  are  sufficiently  similar  to  justify  simple 
techniques. 

Spatial  misspecification  can  take  three  different 
forms.  First,  the  dependent  variable  in  the  unit  of 
analysis  depends  on  the  value  of  the  dependent 
variable  in  other  units  of  analysis  (not  necessarily  in 
all  other  units  of  analysis).  Second,  the  dependent 
variable  can  depend  on  the  value  of  explanatory 
variables  in  other  units.  Finally,  the  errors  may  be 
spatially  correlated  due  to  some  form  of  unob¬ 
served  spatial  heterogeneity  such  as  common  shocks 
or  common  trends.  The  standard  solution  to  spatial 
dependence  requires  the  inclusion  of  a  correctly 
specified  “spatial  lag”  in  the  empirical  model, 
which  often  needs  to  be  instrumented  (or  analyzed 
by  spatial-ML  methods  [a  markup  scheme  for  rep¬ 
resenting  places  mentioned  in  text  and  their  rela¬ 
tionships])  to  account  for  endogeneity  (see  below). 

Yet  the  existence  of  numerous  solutions  to  prob¬ 
lems  of  temporal  and  spatial  dependence  creates  a 
new,  higher-order  problem.  Since  applied  research¬ 
ers  have  very  different  solutions  available — model 
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specifications  that  solve  the  problem  of  temporal 
and  spatial  dependence — the  choice  of  the  solution 
becomes  a  specification  issue  in  itself.  Econometric 
theory  offers  little  guidance  here,  and  different 
solutions  may  lead  to  substantively  different  results. 
Specifically,  the  impossibility  of  testing  the  weighting 
matrix  independently  from  the  spatial  autoregres¬ 
sive  term  leaves  applied  researchers  with  important 
arbitrary  decisions.  Other  issues,  such  as  the  choice 
of  the  correct  functional  form  for  the  continuous 
weighting  matrices,  further  add  to  these  important 
specification  problems. 

Sampling  and  Data  Imperfections 

Sampling  usually  attracts  major  attention  in  empir¬ 
ical  research.  Indeed,  a  sample  is  optimal  if  it  rep¬ 
resents  the  population  and  if,  at  the  same  time,  it  is 
large  enough.  When  researchers  are  able  to  observe 
a  sufficiently  large  number  of  observations,  ran¬ 
dom  sampling  appears  to  be  the  ultimate  strategy. 
Any  possible  selection  bias  should  be  carefully 
avoided  or — if  this  proves  to  be  impossible — be 
accounted  for.  The  following  sections  discuss, 
first,  the  impossibility  of  perfect  random  sampling; 
second,  they  examine  the  specification  issues  and 
research  designs  in  the  presence  of  (systematic) 
data  imperfections;  and  third,  they  discuss  unit 
heterogeneity. 

Nonrandom  Sampling 

While  econometricians  often  assume  that 
researchers  analyze  a  random  sample  drawn  from 
the  population,  applied  researchers  seldom  analyze 
a  sample  that  represents  the  population.  This 
problem  also  occurs  when  researchers  analyze  a 
“random  draw”  from  a  preselected  sample,  which 
is  unlikely  to  represent  the  population.  However, 
the  population  consists  of  all  cases  about  which  a 
theory  makes  predictions.  It  is  this  population 
from  which  cases  must  be  drawn  to  constitute  a 
true  random  sample.  For  example,  a  random  draw 
from  the  record  of  British  telephone  landlines  in 
2009  is  not  a  random  sample  of  the  population  of 
cases  on  which  a  voting  theory  makes  predictions. 
First  of  all,  this  theory  claims  validity  not  just  for 
the  British  electorate  but  certainly  for  all  democra¬ 
cies.  Second,  the  theory’s  validity  does  not  begin  in 
2009  and  end  in  2010. 


Nonrandom  sampling  occurs  frequently  in  com¬ 
parative  politics  and  international  relations,  where 
scholars  have  to  deal  with  the  mere  existence  of  24 
typical  Organisation  for  Economic  Co-operation 
and  Development  (OECD)  countries,  roughly  60 
typical  democracies,  and  a  total  of  120  countries 
for  which  reliable  data  are  available.  Standard 
samples  cannot  be  drawn  from  a  larger  population 
of  countries  (which  does  not  exist),  but  the  existing 
country-years  do  not  constitute  the  population 
about  which  a  theory  makes  predictions.  Non- 
random  sampling  limits  the  generalizability  of 
findings  and  may  invalidate  inferences.  Yet  social 
scientists  have  to  live  with  these  consequences. 
Unless  social  scientists  formulate  theories  of  rather 
limited  scope,  samples  will  never  be  truly  random. 
The  consequence  of  the  absence  of  true  random 
samples  is  clear:  external  validity  remains  limited. 
Yet  the  positive  message  is  that  this  validity  varies 
with  the  quality  of  the  research  design  and  with  the 
extent  to  which  researchers  take  nonrandom  sam¬ 
pling  and  model  uncertainty  seriously. 

Data  Imperfections 

In  the  social  sciences,  the  limited  quality  of  mea¬ 
surement  may  lead  to  serious  problems.  Many 
phenomena  that  social  scientists  need  to  measure 
cannot  be  directly  observed.  Take,  for  example, 
the  “measurement”  of  per  capita  income.  First  of 
all,  per  capita  income  is  a  composite  measure  that 
divides  the  gross  domestic  product  by  population 
size.  Second,  to  compute  the  gross  domestic  prod¬ 
uct  of  a  country,  one  would  have  to  aggregate  all 
economic  transactions.  Yet  whether  a  transaction 
is  economic  or  a  gift  exchange  depends  on  social 
standards.  Likewise,  the  relative  share  of  black 
market  and  illegal  transactions  depends  on  the 
scope  of  market  distortions.  Third,  in  the  absence 
of  a  common  currency,  conversion  rates  between 
countries  can  be  arbitrary.  Fourth,  population 
measures  result  from  census  data.  Population 
counts  are  imprecise  in  censuses,  and  between  cen¬ 
sus  years,  they  are  simply  estimates.  Neither  the 
census  count  nor  the  estimates  in  between  can  be 
precise  because  illegal  immigrants  and  undisclosed 
emigrants  cause  measurement  error.  Neither  part 
of  the  measurement  error  appears  to  be  unsystem¬ 
atic.  Rather,  social,  economic,  political,  and  cul¬ 
tural  factors  influence  the  accuracy  of  measures  of 
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per  capita  income  heavily.  As  a  consequence,  a 
comparison  of  per  capita  income  between  countries 
gives  an  idea  of  per  capita  income  in  an  interna¬ 
tional  comparison,  but  analysts  using  this  informa¬ 
tion  need  to  understand  that  their  data  come  with 
potentially  significant  measurement  error. 

Applied  empirical  research  too  often  ignores 
measurement  error.  With  the  rise  of  Bayesian 
methods,  however,  attention  to  uncertainties 
became  more  common  in  the  social  sciences. 
Bayesian  econometrics  clearly  paved  the  way,  but 
frequentist  approaches  followed  suit.  Both  meth¬ 
odological  paradigms  allow  researchers  to  esti¬ 
mate  models  in  a  way  that  generates  measures  of 
the  uncertainty  of  an  estimate.  For  example,  using 
standard  econometric  measures,  scholars  can 
repeatedly  estimate  a  model  with  measurement 
error  and  randomly  add  this  to  the  variable  of 
interest  that  likely  suffers  from  measurement 
error.  This  procedure  gives  a  distribution  of  coef¬ 
ficients.  If  this  distribution  includes  the  zero,  the 
measurement  error  is  strong  enough  to  influence 
inferences. 

Yet  the  limits  to  observability  lead  not  just  to 
measurement  error  but  also  to  censoring,  trunca¬ 
tion,  and  selection.  Over  the  past  decades,  econo¬ 
metricians  have  developed  a  fine-grained  set  of 
estimation  tools  that  help  deal  with  very  different 
kinds  of  sample  selection  issues.  Censoring  arises 
when  the  dependent  variable  is  measured  exactly 
only  above  (or  below,  or  both)  a  certain  threshold. 
As  a  consequence,  researchers  know  only  parts  of 
the  distribution.  Income  data,  for  example,  may  be 
measured  exactly  above  the  poverty  line,  while 
lower  income  is  reported  at  the  poverty  line. 
Models  for  censored  data  deal  with  the  problem  of 
censoring  by  estimating  a  replacement  for  the  cen¬ 
sored  observations. 

Truncation  arises  when  the  dependent  variable 
is  observed  only  above  (or  below  or  both)  a  certain 
threshold.  The  information  on  both  the  dependent 
and  the  independent  variables  can  be  truncated.  In 
this  case,  researchers  may  fit  a  regression  model 
from  the  sample  drawn  from  the  restricted  part  of 
the  population.  Under  the  assumption  that  the 
error  terms  in  the  truncated  regression  have  a  trun¬ 
cated  normal  distribution,  a  correction  for  the 
truncation  becomes  possible. 

Other  conditions  may  limit  the  observability  of 
the  dependent  variable  in  a  subset  of  the  population. 


Not  all  countries  report  socioeconomic  information 
to  international  organizations.  Obviously,  the  sam¬ 
ple  of  nonreporting  countries  deviates  largely  from 
the  sample  of  reporting  countries.  Usually,  govern¬ 
ments  in  less  democratic  and  poorer  countries  are 
significantly  less  likely  to  provide  information. 
Nonresponse  in  survey  studies  creates  similar  prob¬ 
lems  for  researchers.  Heckman  selection  models 
correct  for  sample  selection  if  researchers  are  able  to 
specify  a  model  that  explains  selection.  The  esti¬ 
mated  nonselection  hazard  is  then  used  in  the  final 
stage  of  the  estimator  to  correct  for  the  selection 
process. 

More  recently,  econometricians  modeled  solu¬ 
tions  for  very  specific  selection  processes.  For 
example,  selection  may  occur  but  cannot  be 
directly  observed.  Fishing  at  a  pond  in  a  park  pro¬ 
vides  the  standard  example.  Since  the  number  of 
fish  caught  is  not  observed  reliably  at  the  pond  but 
only  at  the  gate  of  the  park,  zeros  can  be  caused  by 
either  not  fishing  or  by  failure  to  catch  fish.  The 
zero-inflated  Poisson  or  negative  binomial  model 
generates  most  promising  results  in  situations  such 
as  this.  However,  the  model  does  not  correct  for 
the  possibility  that  visitors  spend  a  very  different 
amount  of  time  fishing.  This  problem  cannot  be 
solved  by  a  zero-inflated  model  unless  somehow 
miraculously  the  time  visitors  spend  fishing  can  be 
observed  through  the  number  of  fish  caught.  This 
problem  should  similarly  apply  in  most  applica¬ 
tions  of  the  zero-inflated  Poisson  model. 

Endogeneity  and  Simultaneity 

In  the  social  sciences,  almost  nothing  is  truly  exog¬ 
enous.  Social  sciences  are  concerned  with  choices. 
Actors  choose  between  options  by  calculating  the 
consequences  of  their  options.  The  criteria  on 
which  actors  base  their  decisions  are  often  the  con¬ 
sequence  of  other  intentional  actions.  As  a  conse¬ 
quence,  social  interactions  shape  the  basis  on 
which  decisions  rest,  but  these  decisions  provide 
parts  of  the  basis  of  other  decisions.  For  example, 
participatory  political  institutions  have  a  higher 
probability  of  survival  in  relatively  affluent  societ¬ 
ies.  At  the  same  time,  political  institutions  and 
political  stability  influence  economic  growth. 
Therefore,  estimating  the  effects  of  political  insti¬ 
tutions  on  socioeconomic  factors  is  hampered  by 
endogeneity. 
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Classical  Endogeneity 

Endogeneity  is  often  defined  in  a  narrow  sense 
meaning  that  at  least  one  of  the  regressors  is  not 
independent  of  the  dependent  variable.  To  use  a 
famous  example,  simple  estimates  of  the  effect  of 
political  institutions  on  economic  growth  are  likely 
to  be  biased  by  the  fact  that  the  choice  of  institu¬ 
tions  depends  to  a  certain  degree  on  economic  per¬ 
formance.  At  the  very  least,  social  conflict  and 
cleavage  structures  codetermine  political  institu¬ 
tions  and  economic  performance.  In  a  broader 
sense,  endogeneity  is  defined  by  its  consequences: 
the  correlation  between  the  error  term  and  one 
regressor.  In  this  definition,  omitted  variables,  mea¬ 
surement  error,  or  a  misspecified  functional  form 
exclude  conditionality  and  so  cause  endogeneity. 

This  problem  can  be  solved  in  three  classical 
ways.  First,  researchers  may  be  able  to  find  an 
exogenous  proxy  for  the  endogenous  right-hand- 
side  variable.  Of  course,  a  good  proxy  often  does 
not  exist.  A  second,  more  popular  solution  requires 
the  use  of  instruments  for  the  endogenous  vari¬ 
ables.  Instruments  need  to  satisfy  two  criteria.  They 
have  to  explain  as  much  as  possible  of  the  endog¬ 
enous  right-hand-side  variable  (the  instrumented 
variable)  and  need  to  be  uncorrelated  with  the 
error  term.  Applied  researchers  often  generously 
ignore  the  first  condition  and  exclusively  pay  atten¬ 
tion  to  the  second.  For  example,  political  institu¬ 
tions  have  been  instrumented  by  settler  mortality 
rates,  because  settlers  only  invested  in  good  politi¬ 
cal  institutions  where  mortality  was  low.  While  the 
argument  seems  convincing,  settler  mortality  data 
do  not  provide  a  good  instrument  for  institutional 
quality.  Institutions  may  change  slowly,  but  they 
change.  Using  historic  settler  mortality  data  as  the 
sole  instrument,  however,  only  captures  the  time- 
invariant  exogenous  part  of  political  institutions. 
In  addition,  while  settler  mortality  may  influence 
the  choice  of  institutions,  economic  growth  and 
settler  mortality  are  hardly  the  only  influences  on 
political  institutions.  Yet  the  choice  of  poor  instru¬ 
ments  largely  reduces  the  efficiency  of  the  parame¬ 
ter  estimate,  and  it  may  easily  result  in  unreliable 
point  estimates.  The  use  of  poor  instruments  then 
leads  to  less  reliable  estimates  than  estimating  a 
model  that  ignores  existing  endogeneity  issues. 

Endogeneity,  thus,  is  a  serious  problem  in  the 
social  sciences.  “Solving”  endogeneity  problems 
with  weak  instruments  generates  an  even  more 


serious  problem.  A  weak  instrument  is  a  variable 
that  is  not  correlated  with  the  error  term  but  only 
weakly  correlated  with  the  endogenous  variable. 
Estimates  with  weak  instruments  are  always  less 
efficient  than  estimates  that  ignore  the  endogene¬ 
ity,  and  they  can,  at  the  same  time,  be  even  more 
biased.  In  these  cases,  the  cure  is  worse  than  the 
disease.  This  result,  though  known  since  the  early 
1990s,  contrasts  starkly  with  claims  that  the 
instrumental  variable  (IV)  model  allows  causal 
inferences  in  a  nonexperimental  setting. 

Right-Hand- Side  Endogeneity 

Misspecification  also  occurs  when  one  regressor 
(partly)  depends  on  at  least  one  other  regressor.  In 
this  case,  estimates,  first,  suffer  from  inefficiency 
caused  by  multicollinearity.  As  a  consequence, 
inferences  become  less  reliable.  Second,  researchers 
may  find  it  difficult  to  interpret  the  effects  of  the 
two  correlated  right-hand-side  variables.  When 
two  or  more  regressors  depend  on  each  other, 
parts  of  the  effect  of  one  variable  are  captured  by 
the  other  variable.  If  researchers  are  not  aware  of 
the  dependence,  serious  misinterpretations  become 
likely.  To  solve  the  problem  of  right-hand-side 
endogeneity,  solutions  that  also  deal  with  reversed 
causality  are  feasible.  As  before,  using  instruments 
for  one  of  the  right-hand-side  endogeneous  vari¬ 
ables  or  estimating  a  simultaneous  equation  model 
that  explicitly  models  the  endogeneity  between  the 
regressors  provides  a  solution. 

“Solving”  Specification  Issues: 

A  Cautionary  Note 

Though  model  misspecifications  bias  estimation 
results  and  may  lead  to  invalid  inferences,  method¬ 
ologists  and  applied  researchers  tend  to  take  some 
specification  issues  more  seriously  than  others. 
The  extreme  importance  of  unobserved  time- 
invariant  heterogeneity  relative  to  time-varying 
heterogeneity  seems  unreasonable  unless  one 
understands  that  methodologists  believe  that  they 
have  a  solution  to  the  former  but  not  to  the  latter 
problem.  Are  solvable  specification  problems  more 
common  than  unsolvable  ones?  Certainly  not,  but 
at  least  researchers  can  deal  with  them. 

Specification  problems  are  fads  and  deeply 
embedded  in  the  cultures  of  various  disciplines.  A 


Model  Specification  1601 


serious  problem  for  one  discipline  might  easily  be 
a  nuisance  for  another  discipline.  To  mention  just 
another  example,  over  the  last  decade,  economists 
appeared  to  be  obsessed  with  endogeneity  and  thus 
with  instrumental  variable  models.  At  the  same 
time,  researchers  working  in  other  disciplines  were 
much  more  likely  to  ignore  endogeneity.  Who  is 
right?  The  answer  is  that  it  depends  on  the  quality 
of  the  instruments,  which  is  more  an  assumption 
than  a  known  entity.  Too  many  IV  models  are 
certainly  not  any  better  than  too  few.  And  this 
result  is  generalizable:  Too  much  effort  to  solve 
even  the  most  obscure  potentially  existing  specifi¬ 
cation  problem  is  as  costly  as  the  ignorance  of 
specification  problems.  And  often,  this  holds  even 
if  the  solutions  work  in  the  desired  direction.  This 
is  because  these  solutions  come  at  a  price.  Currently, 
econometricians  suggest  two  standard  solutions, 
which  can  be  defined  as 

f.  removing  “bad  variation”  and 

2.  removing  “bad  observations.” 

Both  standard  solutions  (which  can  of  course  be 
combined)  and  their  costs  are  discussed  in  turn. 
Fixed  effects  models  and  instrumental  equations 
are  the  most  common  techniques  when  researchers 
seek  to  eliminate  parts  of  the  variation  to  improve 
proper  variable  “identification.”  Fixed  effects 
models  assume  that  a  part  of  the  variation  of  the 
dependent  variable  is  explained  by  an  unobserved 
factor,  which  is  correlated  with  one  of  the  regres¬ 
sors.  If  researchers  do  not  find  a  way  to  account 
for  the  unobserved  regressor,  the  existence  of  this 
unobserved  omitted  variable  will  bias  the  estimates 
of  all  correlated  regressors.  Similarly,  instrumental 
equation  models  assume  that  the  dependent  vari¬ 
able  explains  an  unknown  but  positive  quantity  of 
the  variation  of  at  least  one  of  the  regressors.  In 
both  cases,  researchers  would  like  to  perfectly 
account  for  the  problematic  parts  of  the  variation. 
Unfortunately,  this  is  not  what  fixed  effects  models 
do,  and  instrumental  variable  models  are,  at  best, 
unlikely  to  just  strip  the  problematic  variation  of 
the  instrumented  variable.  In  fact,  the  fixed  effects 
model  does  not  use  variation  between  units  at  all, 
and  instruments  typically  explain  less  than  the 
“unproblematic  part”  of  the  endogenous  variable. 

As  a  consequence,  researchers  live  with  what 
many  think  is  the  second  best  option:  a  model  that 


throws  away  too  much  variation.  In  the  standard 
econometric  textbook  world,  researchers  pay  an 
almost  irrelevant  price  because  econometricians 
are  mainly  interested  in  asymptotic  properties;  that 
is,  they  ask,  “How  do  we  get  the  optimal  estimate 
when  we  have  an  infinite  amount  of  information?” 
In  this  world,  throwing  away,  say,  half  of  the 
variation  does  not  matter,  because  half  of  infinity 
still  gives  infinity.  But  applied  researchers  do  not 
live  in  this  convenient  world.  They  have  limited 
amounts  of  information  at  their  disposal,  and 
when  they  want  to  make  the  best  of  it  (i.e.,  the 
most  reliable  causal  inferences),  throwing  away 
more  than  necessary  may  really  be  costly. 

More  recently,  methodologists  have  developed 
research  designs  that  “solve”  specification  issues 
not  by  eliminating  variation  but  by  “intelligent” 
sampling  techniques.  For  example,  rather  than 
estimating  a  fixed  model,  researchers  could  also 
sample  observations  that  do  not  seem  to  have 
significant  unobserved  unit  heterogeneity.  Of 
course,  both  techniques  should  give  relatively 
similar  results — but  this  hardly  ever  happens  with 
real  data.  In  principle,  however,  by  eliminating 
observations  from  the  sample,  researchers  can 
account  for  the  influence  of  some  problematic 
regressors.  In  this  case,  researchers  may  use 
matching  techniques  to  make  the  distribution  of 
the  problematic  variable  more  similar  across 
groups.  This  technique  thus  clearly  accounts  for 
misspecified  functional  forms,  but  it  cannot 
reduce  bias  from  a  wrongly  specified  model,  and 
it  comes  at  the  price  of  reduced  efficiency  and 
dubious  external  validity. 

Experiments  provide  a  more  radical  solution  to 
the  problem  of  model  uncertainty.  If  researchers 
are  able  to  design  controlled  situations  in  which  all 
but  one  (potential)  influence  on  a  variable  are  held 
constant,  they  can  get  close  to  identifying  the  true 
effect  of  the  variable.  Yet  in  the  social  sciences, 
one  usually  cannot  hold  variables  constant  with¬ 
out  losing  external  validity.  For  example,  holding 
gender  constant  will  reduce  the  extent  to  which 
the  experiment’s  results  can  be  generalized  to  the 
true  world  in  which  different  genders  exist. 
However,  experimental  design  can  hold  the  distri¬ 
bution  of  variables  stable  between  different  groups. 
In  experiments,  researchers  usually  randomize  the 
treatment  and  then  hope  that  the  distribution  of 
confounding  factors  is  identical  in  the  treatment 
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and  control  groups.  Whether  this  assumption 
holds  depends  on  the  number  of  independent 
experimental  observations  as  well  as  the  distribu¬ 
tion  of  the  potential  confounding  factors.  The 
rarer  a  confounding  factor  is,  the  stronger  its  influ¬ 
ence,  and  the  larger  the  number  of  groups,  the 
greater  the  number  of  experimental  observations 
needed  to  guarantee  a  sufficient  approximation  to 
equal  distribution. 

To  give  but  one  example,  assume  that  social 
scientists  conduct  a  decision  experiment  with 
1,000  participants — a  number  larger  than  what  we 
usually  see  in  the  social  science  experiments.  Five 
hundred  participants  receive  a  treatment,  and  the 
remaining  500  get  a  placebo  (or  some  other  non¬ 
treatment).  Now,  assume  the  existence  of  an 
important  genetic  property,  which  occurs  in  1  %  of 
the  population.  This  leads  to  two  questions:  First, 
what  is  the  probability  that  1  %  of  the  participants 
have  this  defect?  And  second,  what  is  the  probabil¬ 
ity  that  the  defect  occurs  equally  among  the  par¬ 
ticipants  in  both  groups?  The  answer  to  the  first 
question  is  that  if  the  sample  is  drawn  from  the 
population,  roughly  100  ±  10%  will  have  the 
defect.  Of  course,  there  is  a  roughly  5%  probabil¬ 
ity  that  the  sample  includes  fewer  than  80  or  more 
than  120  participants  of  this  type.  The  second 
question  proves  to  be  more  important.  The  prob¬ 
ability  that  both  the  treated  and  the  control  group 
include  the  same  number  of  participants  of  this 
type  is  only  marginally  above  zero.  On  average, 
the  difference  will  be  6.66.  Accordingly,  if  the 
experimental  design  controls  confounding  factors 
by  randomization,  bias  will  be  different  from  zero 
unless  the  sample  size  approaches  infinity.  Thus, 
experiments  identify  the  “true  effect”  only  asymp¬ 
totically  and  fail  to  do  so  in  all  real-world  experi¬ 
ments.  However,  experiments  still  have  a  clear 
advantage  over  estimation — namely,  that  the  true 
model  does  not  need  to  be  known.  The  price 
researchers  pay  for  this  advantage  is  a  sharp 
decline  in  external  validity. 

Robustness  tests  offer  a  radical  alternative  to 
the  logic  of  removing  variation  or  removing  obser¬ 
vations.  Rather  than  solving  specification  prob¬ 
lems,  robustness  tests  seek  to  provide  an  estimate 
of  the  importance  of  specification  issues  for  causal 
inference.  Robustness  tests  identify  areas  in  which 
the  true  specification  remains  unknown  and  then 
vary  the  model  specification  in  a  way  that  accounts 


for  the  model  uncertainty.  For  example,  when 
measurement  errors  reduce  the  validity  of  point 
estimates,  researchers  can  generate  additional  con¬ 
trolled  measurement  error  (using  Monte  Carlo 
techniques)  to  analyze  the  influence  of  measure¬ 
ment  error  on  parameter  estimates.  In  this  case,  the 
parameters  will  vary  across  the  iterations  of  this 
experiment,  but  this  variation  may  not  affect 
causal  inferences  and  thus  demonstrates  the  robust¬ 
ness  of  the  results.  Further  robustness  tests  include 
groupwise  jack-knives,  bootstrapping,  sensitivity 
analysis,  semiparametric  techniques,  the  choice  of 
alternative  estimators,  and  many  more.  Robustness 
tests  do  not  try  to  solve  specification  problems 
when  solutions  appear  to  be  too  costly.  And  they 
do  not  claim  to  produce  certainty  where  of  course 
there  can  be  none.  After  all,  even  the  best  solution 
to  the  most  pressing  specification  problem  can  give 
unbiased  results  if  and  only  if  the  model  is  other¬ 
wise  perfectly  specified. 

Thomas  Plumper 
University  of  Essex 
Colchester,  United  Kingdom 
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All  models  of  politics  are  models  in  the  colloquial 
sense  of  being  simplified  “miniature”  artifacts 
that  their  creators  believe  represent  some  impor¬ 
tant  aspect  of  the  real  world.  The  hope  is  that 
manipulating  moving  parts  of  the  model  gives  use¬ 
ful  intuitions  about  how  the  real  world  works. 
“Useful”  in  this  sense  means  something  that  is  a 
valid  inference  from  the  model,  nonobvious,  and, 
at  least  a  priori,  empirically  plausible.  Theoretical 
models  are  constructed  from  abstract  statements 
about  the  real  world.  Useful  general  propositions 
about  the  real  world  may  be  derived  analytically 
by  “solving”  the  model  using  some  form  of  logical 
manipulation.  For  example,  if  one  knows  the 
radius  r  of  a  circle  and  wants  to  know  its  area,  A, 
he  or  she  can  make  use  of  an  ancient  analytical 
result:  A  —  tt  r2.  This  result  derives  from  a  formal 
mathematical  proof  and  is  always  and  forever  true 
in  a  Euclidean  space. 

The  same  result  could  be  derived  from  a  well- 
designed  computational  experiment: 

f.  Scatter  p  dots  at  random  on  a  Euclidean  plane, 
in  a  uniform  distribution;  every  locus  on  the 
plane  is  equally  likely  to  have  a  dot  on  it. 

2.  Randomly  select  a  locus  (x,  y)  on  the  plane; 
draw  a  circle  with  radius  r  centered  on  (x,  y), 
picking  values  of  x,  y,  and  r  at  random;  count  n, 
the  number  of  scattered  dots  inside  the  circle;  n 
is  a  computed  estimate  of  A,  the  area  of  the 
circle. 

3.  Iterate  (1)  and  (2)  above  m  times. 

Careful  analysis  of  observations  from  this  experi¬ 
ment  would  reveal,  among  other  things,  that  n  is 
completely  uncorrelated  to  the  values  of  x  and  y,  the 
coordinates  of  the  center  of  the  circle;  the  best-fit 
prediction  of  n  is  n  =  it  r2.  The  precision  of  this  esti¬ 
mate  will  depend  on  the  number  of  dots  scattered,  d, 
and  the  number  of  times  the  scattering  is  done,  m.  In 
other  words,  it  depends  on  the  amount  of  computa¬ 
tional  resources  deployed  on  the  problem — the  more 
the  better.  All  this  computation  would  be  an  utterly 
pointless  waste  of  resources.  It  is  known  analytically 
that  n  ~  A  =  tt  r2. 


Very  often,  however,  empirically  plausible  mod¬ 
els  of  real-world  processes  that  actually  interest 
researchers  are  analytically  “difficult”  or  “intrac¬ 
table,”  leaving  them  unable  to  generate  the  formal 
proofs  that  they  would  very  much  like,  because 
they  cannot  solve  the  model  analytically.  An 
intractable  model  is  one  that  researchers  know 
cannot  be  solved  analytically.  A  difficult  model  is 
one  that  researchers  do  not  know  cannot  be  solved 
analytically,  but  they  do  know  that  it  has  not  yet 
been  solved,  and  they  do  not  yet  know  how  to 
solve.  There  are  two  basic  solutions  to  this  very 
common  intellectual  problem.  The  first  is  to  make 
the  model  tractable  by  simplifying  its  representa¬ 
tion  of  the  real  world.  The  resulting  model  looks 
less  like  the  real  world,  but  at  least  it  can  be  solved 
analytically.  The  second  solution  is  to  use  compu¬ 
tational  methods  to  investigate  and  in  some  sense 
“solve”  difficult  or  intractable  models.  In  this 
entry,  the  strengths  and  limitations  of  computa¬ 
tional  modeling  and  its  possible  applications  in 
political  science  are  discussed. 

Computation  and  Analysis 

Consider  a  two-dimensional  policy  space  in  which 
voters’  preferences  are  described  in  terms  of  “ideal 
points.”  Assume  that  these  ideal  points  have  a 
symmetric  bivariate  normal  distribution  with  a 
mean  at  the  origin  of  the  space.  We  want  to  esti¬ 
mate  the  number  of  ideal  points  (dots)  located 
within  a  circle  of  radius  r  from  an  arbitrary  point 
(x,  y)  in  the  space — for  example,  the  number  of 
voters  within  distance  r  of  some  party  policy  posi¬ 
tion.  This  problem,  while  precisely  specified,  is  at 
the  very  least  difficult  to  solve  analytically  since  we 
cannot  use  the  analytical  result  n  ~  A  —  tt  r1.  This 
is  because,  given  the  bivariate  normal  distribution 
of  ideal  points,  the  number  of  ideal  points  in  a 
circle  of  given  radius  now  depends  critically  on  the 
locus  of  its  center.  Generalize  the  problem  further 
to  any  arbitrary,  or  even  unknown,  distribution  of 
voter  ideal  points.  The  problem  becomes  increas¬ 
ingly  difficult,  if  not  intractable,  analytically.  One 
solution  is  to  simplify  the  model  by  assuming  the 
arbitrary  distribution  of  ideal  points  to  be  uni¬ 
form,  restoring  access  to  the  analytical  result  n  ~ 
77  r2.  The  gain  would  be  analytical  rigor,  in  some 
odd  sense,  but  the  cost  would  be  realism  and  thus 
empirically  valid  intuition.  The  “rigor”  of  the 
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result  would  depend  on  assuming  something  we 
knew  for  certain  to  be  false.  The  other  solution 
would  be  to  run  the  very  simple  computational 
experiment  outlined  in  Items  1  to  3  above,  ridicu¬ 
lous  with  a  uniform  distribution  of  ideal  points  but 
now  powerful  in  allowing  us  to  estimate,  in  a  trans¬ 
parent  way,  the  number  of  points  within  a  circle 
with  radius  r  and  center  (x,  y),  for  any  arbitrary 
distribution  of  ideal  points.  More  generally,  a  com¬ 
putational  experiment  can  be  designed  to  retrieve 
any  analytical  result,  but  computational  experi¬ 
ments  can  also  be  designed  to  retrieve  results  that 
can  never,  or  cannot  yet,  be  derived  analytically. 

Such  computational  work  does  not  at  all  depend 
on  using  electronic  computers;  all  computations 
could  in  theory  be  carried  out  by  hand.  In  practice, 
the  relatively  low  cost  of  electronic  computers  and 
the  increasing  sophistication  of  the  programs  that 
run  on  them  mean  that  nearly  all  computational 
work  is  now  done  on  computers.  In  the  same  way, 
analytical  results  are  in  practice  not  all  derived 
using  a  pencil  and  paper.  Computer  programs  may 
be  used  to  solve  difficult  systems  of  equations. 
Unseen  inside  the  program,  these  solutions  may 
involve  simulations,  while  program  output,  and 
thus  computer-assisted  formal  “proofs,”  may  or 
may  not  be  amenable  to  checking  by  the  human 
analyst. 

Theoretical  Intractability,  Decision 
Heuristics,  and  Agent-Based  Modeling 

A  famous  and  intensively  studied  intractable 
problem  that  is  simply  stated  is  the  Traveling 
Salesmen  Problem  (TSP).  Given  n  cities  arbitrarily 
located  on  a  map,  find  the  shortest  route  that  vis¬ 
its  all  of  them  and  returns  to  the  starting  point. 
This  cannot  be  solved  analytically  for  the  general 
case  with  an  arbitrary  number  of  cities  at  arbi¬ 
trary  locations.  Any  particular  example  of  the  TSP 
can  in  theory  be  “smashed”  computationally  by 
elaborating  every  possible  route,  computing  the 
length  of  each,  and  picking  the  shortest  route. 
However,  the  number  of  different  possible  routes 
between  n  cities  is  (n  —  l)!/2,  which  expands  at 
an  explosive  rate  with  the  number  of  cities.  The 
number  of  different  routes  between  “only”  100 
cities  is  about  4.666  X  1015J.  This  problem  cannot 
in  practice  be  smashed  with  feasible  finite  comput¬ 
ing  resources.  Since  the  TSP  is  a  real  problem  for 


real  humans,  for  example,  when  they  devise  travel 
itineraries  or  vehicle  delivery  routes,  its  analytical 
intractability  has  important  implications,  both  for 
actual  decision  making  by  real  humans  and  for  the 
representation  of  real  humans  as  agents  in  models. 
In  the  absence  of  a  formally  provable  best  solution 
to  the  problem,  any  analyst — including  the  decision¬ 
making  agents  whose  behavior  is  being  modeled — 
must  rely  on  informal  decision  heuristics  or  rules 
of  thumb.  These  are  decision  rules  that  have  been 
generally  found,  through  a  process  of  experimen¬ 
tation  and  learning,  to  produce  good  solutions  to 
the  problem  at  hand  but  that  cannot  be  proved  to 
be  optimal  and  may  indeed  be  highly  suboptimal 
in  certain  cases.  One  heuristic  for  attacking  the 
TSP,  for  example,  is  “start  with  a  random  city; 
visit  the  closest  unvisited  city;  iterate  until  all  cities 
have  been  visited.”  This  is  a  “greedy  algorithm,” 
doing  what  looks  best  at  any  given  stage  in  the 
search  and  performs  well  in  many  circumstances. 
It  often  finds  something  “close”  to  the  shortest 
route  in  easy  cases  where  the  shortest  route  is 
known  from  having  smashed  the  problem.  But 
particular  examples  can  easily  be  constructed  for 
which  this  algorithm  produces  disastrous  results. 
Thus,  the  search  for  efficient  heuristics  for  attack¬ 
ing  the  TSP  is  an  important  ongoing  project  in 
computer  science. 

Political  science  is,  of  its  essence,  concerned 
with  many  analytically  intractable  problems.  One 
mainstream  example  concerns  the  search  by  party 
leaders  for  vote-maximizing  party  policy  positions 
in  a  multidimensional  policy  space,  in  a  setting 
where  all  voters  support  their  closest  party.  This  is 
a  special  case,  in  a  political  context,  of  the  more 
general  mathematical  problem  of  competitive  spa¬ 
tial  location,  a  problem  that  has  also  been  shown 
to  be  formally  intractable  in  a  space  of  more  than 
one  dimension.  Despite  formal  intractability  of 
this  problem,  party  leaders  must  still  set  party 
policy  positions.  One  response  to  this,  very  com¬ 
mon  in  political  science,  is  to  simplify  the  model 
of  party  competition  in  ways  that  allow  the  deri¬ 
vation  of  an  analytical  result.  A  very  common 
simplification  is  to  assume,  not  n  parties  and  m 
dimensions,  but  one  dimension  and  a  small  num¬ 
ber  of  parties,  possibly  even  only  two.  This  gives 
access  to  a  portfolio  of  well-known  analytical 
results,  for  example,  about  the  role  of  the  “median” 
voter  on  the  solitary  dimension.  These  results  may 
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be  considered  intuitively  useful  if  the  empirical 
setting  under  investigation  can  plausibly  be 
described  as  having  only  one  policy  dimension 
and  a  small  number  of  parties.  If  researchers  are 
intrinsically  interested  in  competition  in  multi¬ 
party  systems  with  multidimensional  policy  spaces, 
however,  they  will  not  find  one-dimensional 
analytical  results  intuitively  useful.  Formal  intrac¬ 
tability  of  the  problem  of  competitive  spatial  loca¬ 
tion  in  multidimensional  spaces  tells  researchers, 
further,  that  these  one-dimensional  analytical 
results  do  not  generalize  and  suggests  the  use  of 
computational  methods  to  attack  the  problem. 

Moving  beyond  methods  that  might  be  used  to 
attack  interesting  but  intractable  problems,  a  far 
deeper  substantive  implication  of  intractability 
concerns  the  behavior  of  real  humans.  In  the 
absence  of  formally  provable  best-response  strate¬ 
gies  for  setting  party  positions  in  multidimensional 
policy  spaces,  for  example,  and  given  that  such 
positions  must  nonetheless  be  set,  party  leaders 
must  use  informal  decision  heuristics.  This  is  a 
very  good  example  of  the  type  of  problem  that  can 
usefully  be  analyzed  using  agent-based  modeling. 
Agent-based  models  (ABMs)  are  bottom-up  mod¬ 
els  that,  typically,  assume  settings  with  a  fairly 
large  set  of  autonomous  decision-making  agents, 
each  agent  using  some  well-specified  decision  rule. 
The  modeling  exercise  is  to  elaborate  the  patterns 
that  emerge  when  these  agents  interact  with  each 
other.  The  essence  of  agent-based  modeling  is  con¬ 
veyed  clearly  in  a  simple  but  powerful  model  of 
spatial  segregation  put  forward  by  Thomas 
Schelling  in  the  1970s.  The  original  Schelling 
model  was  computational  but  was  implemented 
with  coins  on  a  chessboard,  not  an  electronic  com¬ 
puter.  Scatter  some  pennies  and  nickels,  say  20  of 
each,  at  random  on  the  chessboard;  move  each 
coin  to  the  center  of  its  closest  square.  Each  coin 
has  eight  adjacent  (neighboring)  squares.  (Coins 
on  the  edge  squares  are  different  but  this  does  not 
affect  the  result.)  Coins  represent  people,  and 
people  have  two  different  colors.  Assume  that 
people  have  some  view  about  the  color  of  their 
neighbors.  This  is  expressed  by  a  preference 
parameter  p,  the  minimum  proportion  of  people 
on  neighboring  squares  I  would  like  to  be  the  same 
color  as  me.  This  preference  can  be  very  mild.  If 
p  —  .25,  for  example,  I  am  unhappy  only  if  less 
than  a  quarter  of  my  neighbors  are  the  same  color 


as  me;  if  p  —.5,  I  am  unhappy  if  less  than  half  of 
my  neighbors  are  the  same  color.  The  modeled 
behavior  is  simply  that  unhappy  agents  move  to  a 
randomly  chosen  nearby  empty  square.  A  model 
run  begins  with  agents  scattered  at  random;  agent 
behavior  is  then  implemented  iteratively  using  this 
rule  until  no  unhappy  agent  wants  to  move. 

Figure  1  shows  an  example  of  the  surprisingly 
strong  patterns  of  spatial  segregation  that  emerge 
from  this  simple  model,  implemented  on  a  com¬ 
puter  using  the  agent-based  modeling  environment 
NetLogo.  The  left  panel  shows  a  starting  configu¬ 
ration  with  1,000  gray  and  1,000  white  agents 
scattered  at  random  on  a  51  X  51  square  “chess¬ 
board.”  At  the  beginning  of  the  run,  given  this 
particular  scatter,  on  average,  agents  find  that 
50%  of  their  neighbors  are  the  same  color.  Agents 
in  this  model  run  were  given  a  p  of  .50;  those  find¬ 
ing  themselves  in  a  location  where  less  than  50% 
of  their  neighbors  were  the  same  color  moved  to  a 
new  location.  The  right  panel  of  Figure  1  shows 
the  steady-state  configuration  of  agent  locations 
that  was  the  outcome  of  this  particular  run.  The 
emergent  pattern  of  spatial  segregation  can  be  seen 
very  clearly.  The  surprising  result  is  that,  on  aver¬ 
age,  88%  of  each  agent’s  neighbors  are  now  the 
same  color.  This  social  interaction  generated  a 
much  more  intense  pattern  of  spatial  segregation 
than  any  individual  agent  prefers.  Even  very  mild 
preferences  about  neighbors  can  increase  spatial 
segregation.  From  the  same  random  start  shown  in 
Figure  1,  setting  p  —  .25  increases  the  mean  per¬ 
centage  of  similar  neighbors  from  50%  to  about 
60%.  In  other  words,  even  if  people  only  move 
when  less  than  a  quarter  of  their  neighbors  are  the 
same  as  them,  segregation  will  increase.  This  was 
an  unexpected  and  counterintuitive  result  that 
continues  to  be  debated. 

The  striking  Schelling  segregation  results  would 
be,  at  the  very  least,  difficult  to  derive  analytically; 
yet  a  simple  computer  program  gives  full  access  to 
them.  A  theorist  approaching  this  problem  with 
the  tools  of  classical  formal  analysis,  for  example, 
would  in  essence  have  to  model  strategic  location 
decisions  made  by  each  of  2,000  agents,  each  deci¬ 
sion  taking  into  account  what  each  of  the  other 
1,999  agents  is  likely  to  do.  This,  of  course,  would 
also  be  making  a  preposterous  assumption  about 
the  behavior  of  real  humans.  The  Schelling  ABM 
in  effect  makes  the  “satisficing”  behavioral 
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Figure  I  Random  Start  and  Steady-State  Outcome  of  the  Schelling  Model  of  Spatial  Segregation 


assumption  that  people  in  such  complex  settings 
stay  put  if  they  are  happy,  defining  happy  in  some 
precise  way,  and  move  if  they  are  not.  This  is  a 
classic  example  of  a  “win-stay  lose-shift”  decision 
rule  that  is  a  common  behavioral  assumption  for 
ABMs.  His  computational  model  then  systemati¬ 
cally  elaborates  interactions  between  large  num¬ 
bers  of  agents  using  this  rule.  Thus,  while  ABMs 
do  deploy  decision  heuristics  as  methodological 
devices  to  attack  analytically  intractable  prob¬ 
lems,  the  fundamental  motivation  for  using  ABMs 
is  a  behavioral  assumption  that,  given  the  com¬ 
plex  settings  in  which  they  find  themselves  forced 
to  make  decisions,  real  humans  use  rules  of  thumb 
when  they  must  pick  a  course  of  action.  ABMs  are 
much  better  justified  as  plausible  models  of  real 
behavior  than  as  methodological  responses  to 
intractability. 

Trade-Offs  Between  Rigor, 

Realism,  and  Parsimony 

There  are  thus  two  quite  distinct  epistemological 
dimensions  that  distinguish  agent-based  modeling 
from  classical  formal  analysis  in  political  science. 
The  first  is  methodological  and  contrasts  formal 
analysis  with  carefully  designed  computational 
experiments  as  distinctive  ways  of  deriving  general 
results.  The  second  is  behavioral  and  contrasts  the 
forward-looking  strategic  analysis  that  is  assumed 
by  formal  game  theorists  to  be  performed  by  real 
humans,  for  example,  with  the  deployment  of 
simple  decision  rules,  which  is  the  typical  behavioral 


assumption  of  agent-based  modelers.  The  com¬ 
plexity  of  the  problem  under  investigation  may 
well  dictate  the  trade-offs  that  are  made  within  this 
epistemological  space.  In  a  simple  setting  that  is 
analytically  tractable,  formal  analysis  can  yield 
easily  accessible  yet  rigorous  results.  Given  this, 
further,  it  may  seem  reasonable  to  assume,  behav- 
iorally,  that  real  humans  can  perform  the  same 
analyses  as  the  modeler.  This  convergence  of 
method  and  behavioral  assumption  is  pleasing, 
since  a  sophisticated  analyst  is  then  not  assuming 
that  real  humans  can  do  things  that  the  analyst 
cannot  do  himself  or  herself.  In  such  cases,  there  is 
little  point  in  using  computational  methods  or 
ABMs. 

Typically,  however,  the  real  political  problem 
under  investigation  is  complex,  and  analysts  face 
hard  trade-offs.  One  response,  as  noted  above,  is 
to  simplify  the  model  to  a  point  at  which  it  is  trac¬ 
table,  by  making  assumptions  that  are,  inevitably, 
less  realistic.  (If  they  were  more  realistic,  there 
would  be  no  point  in  the  more  complex  model.) 
Typically,  though  not  inevitably,  in  the  case  of 
game  theoretic  models  in  political  science,  such 
simplifying  assumptions  concern  the  setting  being 
modeled  not  the  rationality  of  the  agents.  An 
intrinsically  dynamic  setting  is  assumed  to  be 
static — for  example,  the  dimensionality  of  the 
preference  space  is  reduced;  only  a  very  small  sub¬ 
set  of  actors  is  modeled,  and  so  on.  Set  against  the 
benefits  arising  from  analytical  rigor,  given  the 
simplifying  assumptions,  the  costs  arise  because 
the  model  is  now  less  realistic.  A  contradiction 
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arises,  further,  if  real  agents  are  now  assumed  to 
apply  a  sophisticated  strategic  decision-making 
model  to  an  assumed  setting,  which,  if  they  were 
indeed  sophisticated,  they  would  know  is  far  more 
complex  than  the  one  being  modeled.  In  a  nutshell, 
agents  may  be  assumed  to  be  smart  enough  to  per¬ 
form  complex  strategic  calculations  in  a  one¬ 
dimensional  space,  for  example,  but  not  smart 
enough  to  see  that  the  real  space  is  not  one¬ 
dimensional,  and  their  calculations  are  thus  point¬ 
less.  The  pleasing  conjunction  of  method  and 
behavioral  assumption  has  been  lost  in  the  search 
for  rigorous  analytical  results  in  intractable  com¬ 
plex  settings. 

In  complex  settings  involving  many  agents, 
the  desire  to  attack  substantively  interesting 
problems  may  more  or  less  mandate  computa¬ 
tional  methods  if  even  a  semblance  of  empirical 
realism  is  to  be  retained.  Considering  the  spatial 
segregation  problem  illustrated  in  Figure  1,  for 
example,  most  people  would  feel  that  the  core 
intuition  would  be  lost  by  simplifying  this  down 
to  a  fabulously  rigorous  formal  model  of  strate¬ 
gic  location  decisions  made  by  five  agents  in  a 
one-dimensional  space.  The  fundamental  sub¬ 
stantive  interest  of  this  problem  is  that  it  is  at 
least  two-dimensional  and  involves  unintended 
consequences  of  interactions  between  large  num¬ 
bers  of  agents.  Riots,  revolutions,  and  indeed 
voting  in  large  electorates  are  other  examples  of 
this  type  of  problem.  While  we  can  easily  specify 
a  formal  game  played  between  “a  rioter”  and  “a 
government,”  most  people  would  probably  feel 
this  misses  the  entire  point  about  riots,  which 
concerns  how  interactions  between  people  in 
large  groups  are  in  some  important  sense  differ¬ 
ent.  The  inevitable  price  paid  for  all  computa¬ 
tional  work  is  that  computation,  of  its  essence, 
involves  observing  outcomes  of  particular  model 
runs,  conditional  on  particular  parameter  set¬ 
tings.  An  analytical  result,  if  general,  is  good  for 
all  conceivable  valid  parameter  settings.  Strictly 
speaking,  computational  results  are  only  valid 
for  those  points  in  a  model’s  parameter  space 
that  have  actually  been  investigated.  Inferences 
about  parameter  settings  that  have  not  been 
investigated  are,  in  effect,  interpolations. 

This  focuses  attention  on  two  matters  of  pri¬ 
mary  concern  to  those  engaged  in  computational 
work  that  is  carefully  designed  and  rigorously 


executed.  The  first  concerns  sampling  from  the 
model’s  parameter  space;  the  second  concerns  the 
techniques  used  to  draw  general  inferences  from 
particular  observations.  Computational  work  in 
essence  generates  simulated  data  about  model  out¬ 
comes,  conditional  on  model  parameter  settings. 
Viewed  in  this  way,  it  is  closely  analogous  to 
empirical  work,  which  collects  “real”  data  about 
outcomes  under  investigation,  together  with  asso¬ 
ciated  values  of  independent  variables  of  theoreti¬ 
cal  interest.  In  each  case,  collecting  more  (unbiased) 
data  is  always  better,  while  it  is  vital  to  match 
techniques  of  statistical  inference  to  the  underly¬ 
ing  data-generating  process.  In  the  case  of  compu¬ 
tational  analysis,  therefore,  analysts  become  more 
confident  in  their  inferences,  the  more  “data”  they 
have,  and  this  is  achieved  by  having  more  model 
runs  that  investigate  more  points  in  the  parameter 
space.  A  single  model  run  tells  researchers  rather 
little — very  much  like  an  election  study  that  inter¬ 
views  a  single  individual.  One  thousand  model 
runs,  sampling  1,000  points  at  random  from  the 
model’s  parameter  space  would,  in  contrast,  tell 
researchers  quite  a  lot.  Of  course,  there  is  always 
the  possibility  that  there  is  an  unsampled  param¬ 
eter  vector  that  would  generate  a  “peculiar” 
result,  but  two  distinct  things  happen  as  the  size 
of  the  sample  is  increased.  First,  the  probability  of 
failing  to  sample  some  peculiar  point  in  the 
parameter  space  is  driven  down.  Second,  even  if 
analysts  do  fail  to  sample  some  peculiar  point, 
they  become  more  confident  that  the  inference 
drawn  from  their  sample  is  in  some  sense  a  good 
representation  of  model  outcomes  and  that  the 
hypothetical  unsampled  peculiarity  is  an  anomaly. 
If  analysts  really  do  worry  about  unsampled 
parameter  points  that  might  have  blown  up  the 
inferences  had  the  analysts  only  sampled  them,  the 
solution  is  simple — increase  the  size  of  the  sample. 
Analysts  can,  if  they  wish  to  reduce  the  possibility 
of  undiscovered  anomalies,  conduct  a  million 
model  runs.  The  only  matter  at  issue  is  the  oppor¬ 
tunity  cost  of  doing  something  more  productive 
with  their  time  and  computing  resources.  It  is  thus 
perfectly  possible  to  design  “heavy-duty”  compu¬ 
tational  work,  which,  combined  with  rigorous 
statistical  analysis  of  model  outcomes,  enables 
confident  and  replicable  inferences  to  be  drawn 
about  the  effects  of  all  the  model’s  moving  parts. 
At  the  end  of  the  day,  these  are  precisely  the  same 
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types  of  inference  that  emerge  from  classical  ana¬ 
lytical  modeling. 

Finally,  it  is  important  to  note  that  the  ability  to 
derive  intuitions  from  any  model,  computational  or 
analytic,  depends  on  the  model  being  parsimonious, 
in  the  sense  of  having  a  transparent  logic  and  rela¬ 
tively  few  parameters.  Parsimonious  models  tend 
not  only  to  be  easier  to  analyze  than  complicated 
models,  but  also  their  substantive  implications  are 
typically  much  easier  to  interpret.  To  set  against 
this,  the  price  paid  for  a  more  parsimonious  model 
is  that  it  may  be  less  realistic  than  a  richer  (more 
complicated)  model  of  the  same  process.  In  the 
realm  of  classical  analysis,  parsimony  is  to  a  large 
extent  self-policing.  There  are  few  incentives  to 
complicate  a  parsimonious  model  that  has  been 
solved  with  some  difficulty  and  seems  to  be  doing  a 
good  job  at  explaining  something  of  interest.  It  is 
typically  very  easy,  by  contrast,  to  take  a  computa¬ 
tional  model  and  graft  on  layer  after  layer  of  com¬ 
plication,  each  layer  added  in  a  quest  for  enhanced 
realism.  Parsimony  is  not  self-policing,  and,  if  model 
outcomes  do  indeed  seem  more  realistic  as  a  result 
of  the  added  complication,  the  trade-off  between 
parsimonious  but  less  realistic  models,  as  opposed 
to  complicated  but  more  realistic  ones,  is  drawn  in 
much  sharper  relief.  The  matter  of  whether  more 
realistic  and  complicated  or  more  parsimonious  but 
less  realistic  computational  models  provide  better 
intuitions  about  real  politics  is  to  a  large  extent  a 
matter  of  taste,  however,  since  there  is  no  gold  stan¬ 
dard  for  good  intuition  in  political  science. 
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Modernization  Theory 


The  analysis  of  modernity — of  the  modern  social 
order  and  civilization — has  constituted  a  basic 
core  in  the  modern  intellectual  discussion  of  the 
development  of  modernity  in  sociological,  anthro¬ 
pological,  and  historical  scholarship.  In  sociology, 
the  analysis  of  modernity  constituted  the  focus  of 
the  major  early  evolutionists,  such  as  Auguste 
Comte  and  Herbert  Spencer.  In  different  ways, 
such  analysis  was  also  the  central  focus  of  the 
discussion  of  modernity  in  the  works  of  Karl 
Marx,  Alexis  de  Tocqueville,  Emile  Durkheim  and 
Max  Weber,  Ferdinand  Tonnies,  and  many  oth¬ 
ers.  This  entry  traces  the  evolution  of  modernity 
from  the  theories  of  the  1950s  through  the  present 
time,  focusing  especially  on  the  varied  forms  that 
modernity  has  taken  and  its  relationship  to  chang¬ 
ing  concepts  of  power  and  the  legitimacy  of  the 
nation-state.  The  entry  concludes  with  a  look  at 
the  impact  of  globalization  and  at  the  negative 
aspects  of  modernity. 

Classical  Theories 

The  “classical”  theories  of  modernization  of  the 
1950s  have  indeed  identified  the  core  characteris¬ 
tics  of  modernity  and  of  modern  society,  such  as 
the  decomposition  of  older  “closed”  institutional 
frameworks;  the  development  of  new  structural, 
institutional,  and  cultural  features  and  formations; 
and,  to  use  the  terminology  of  Karl  Deutsch,  the 
growing  potential  for  social  mobilization.  The 
most  important  structural  dimension  of  modernity 
attesting  to  the  decomposition  of  former  relatively 
narrow  formations  was  seen  in  the  growing  ten¬ 
dency  for  structural  differentiation — manifested, 
for  example,  in  growing  urbanization;  commodifi¬ 
cation  of  the  economy;  and  in  the  continual  devel¬ 
opment  of  distinctive  channels  of  communication 
and  agencies  of  education.  On  the  institutional 
level,  such  decomposition  gave  rise  to  the  develop¬ 
ment  of  new  institutional  formations — such  as  the 
modern  state,  modern  national  collectivities,  and 
new  market  (especially  capitalist)  economies — that 
were  perceived  or  defined  to  some  extent  at  least 
as  autonomous  and  that  were  indeed  regulated  by 
specific,  distinct  mechanisms — such  as  rules  of  the 
market,  of  bureaucratic  organization,  and  the  like. 
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In  some  later  formulations,  it  was  the  development 
of  such  distinct  autonomous  spheres,  each  regu¬ 
lated  by  its  own  logic,  that  was  very  often  defined 
in  the  essence  of  modern  institutional  formations. 
Concomitantly,  modernity  was  seen  as  bearing  a 
distinct  cultural  program  and  shaping  a  distinct 
type  of  personality  characteristic. 

These  theories,  as  well  as  classical  sociological 
analyses  of  Marx,  Durkheim,  and  to  a  large  extent 
even  of  Weber  (or  at  least  one  reading  of  him), 
have  implicitly  or  explicitly  conflated  the  struc¬ 
tural  and  cultural  major  dimensions  of  modernity, 
as  they  saw  it  developing  in  the  West.  A  very 
strong,  even  if  implicit,  assumption  of  the  studies 
of  modernization  was  that  the  cultural  dimensions 
or  aspects  of  modernization — the  basic  cultural 
premises  of  Western  modernity,  the  “secular” 
rational  worldview,  including  a  strong  individual¬ 
istic  orientation — are  inherently  and  necessarily 
interwoven  with  the  structural  ones.  Accordingly, 
most  of  the  classics  of  sociology  as  well  as  the  stud¬ 
ies  of  modernization  of  the  1940s  and  1950s  and 
the  closely  related  studies  of  convergence  of  indus¬ 
trial  societies  have  assumed,  even  if  only  implicitly, 
that  the  basic  institutional  formations,  the  defini¬ 
tions  of  the  institutional  arenas,  the  modes  of  their 
regulation  and  integration  that  developed  in 
European  modernity,  and  the  cultural  program  of 
modernity  as  it  developed  in  the  West  will  “natu¬ 
rally”  be  ultimately  taken  over,  with  possibly  local 
variations,  in  all — or  at  least  in  the  “successful” — 
modernizing  societies.  It  is  further  assumed  that 
this  project  of  modernity,  with  its  hegemonic  and 
homogenizing  tendencies,  will  continue  in  the 
West  and,  with  the  expansion  of  modernity,  will 
prevail  throughout  the  world. 

Multiple  Modernities 

The  reality  that  emerged  proved  to  be  radically  dif¬ 
ferent.  Modernity  has  indeed  spread  to  most  of  the 
world,  but  it  did  not  give  rise  to  a  single  institu¬ 
tional  pattern  or  to  a  single  modern  civilization 
but,  rather,  led  to  the  development  of  several  con¬ 
tinually  changing  modern  civilizations  or  at  least 
civilizational  patterns — that  is,  of  societies  or  civi¬ 
lizations  that  do  indeed  share  some  central  core 
characteristics  but  that  nevertheless  tend  to  develop 
different,  even  if  cognate,  ideological  and  institu¬ 
tional  dynamics.  Moreover,  far-reaching  changes 


that  go  beyond  the  original  premises  of  modernity 
have  been  taking  place  also  in  Western  societies. 

At  the  same  time,  the  institutional  formations 
that  developed  in  most  societies  of  the  world  have 
been  distinctively  modern  even  if  their  dynamics 
have  been  greatly  influenced  by  their  own  distinc¬ 
tive  cultural  premises,  traditions,  and  historical 
experiences.  This  of  course  runs  contrary  to  the 
view  that  the  best  way  to  understand  the  dynamics 
of  different  “modernizing”  societies  is  to  see  them 
as  continuations,  even  if  in  new  ways,  of  their  tradi¬ 
tions  and  of  their  traditional  institutional  patterns 
and  dynamics — a  view  that  was  to  some  extent 
revived  in  the  context  of  the  contemporary  scene  as 
the  theory  of  the  “clash  of  civilizations.”  Of  special 
importance  in  this  context  was  the  fact  that  the 
most  important  social  and  political  movements  that 
became  predominant  in  these  societies — such  as  the 
nationalistic  ones  and,  later  on,  even  the  fundamen- 
talistic  ones,  which  often  promulgated  strong  anti- 
Western  or  even  antimodern  ideas — were  basically 
distinctively  modern  ones,  promulgating  distinctive 
interpretations  of  modernity. 

Concomitantly  with  the  growing  recognition  of 
the  great  complexity  and  variability  of  modern  and 
contemporary  societies,  a  much  more  complex  pic¬ 
ture  of  modernity  emerged.  From  the  very  beginning 
of  the  discourse  about  modernity,  there  developed 
two  opposing  evaluations,  attesting  to  the  inherent 
contradictions  of  modernity:  One  such  evaluation, 
implicit  also  in  theories  of  modernization  and  those 
of  “convergence  and  of  industrial  societies”  of  the 
1950s  and  early  1960s,  saw  modernity  as  a  positive, 
emancipating,  progressive  force  epitomizing  prom¬ 
ises  of  a  better,  inclusive,  emancipating  world.  The 
other  such  evaluation  that  developed  first  from 
within  the  very  core  of  the  first  European  societies 
and  later  found  strong  resonance  in  non-Western 
European  societies  espoused  a  negative  or  at  least  a 
highly  ambivalent  approach  to  modernity — seeing  it 
as  a  morally  destructive  force — and  emphasized  the 
negative  effects  of  some  of  its  core  characteristics,  be 
it  technology  or  the  empowerment  of  egoistic  and 
hedonistic  attitudes  and  goals. 

All  these  developments  have  yielded  a  continual 
reexamination  of  theories  of  modernization,  giving 
rise  to  the  idea  or  concept  of  multiple  modernities 
as  the  best  way  to  understand  the  contemporary 
world.  Thus,  to  explain  the  history  of  modernity  is 
to  see  it  as  a  story  of  continual  constitution  and 
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reconstitution  of  a  multiplicity  of  cultural  pro¬ 
grams  and  cultural  patterns  of  modernity. 

One  of  the  most  important  implications  of  the 
term  multiple  modernities  is  that  modernity  and 
Westernization  are  not  identical;  Western  patterns 
of  modernity  are  not  the  only  “authentic”  moder¬ 
nities,  though  they  enjoy  historical  precedence  and 
continue  to  be  a  basic  reference  point  for  others. 

This  view  of  multiple  modernities  entails  certain 
assumptions  about  the  nature  of  modernity.  The 
first  such  assumption  is  that  modernity  is  to  be 
viewed  as  a  distinct  civilization,  with  distinct  insti¬ 
tutional  and  cultural  characteristics.  According  to 
this  view,  the  core  of  modernity  is  the  crystalliza¬ 
tion  and  development  of  a  mode  or  modes  of  inter¬ 
pretation  of  the  world  or,  to  follow  Cornelius 
Castoriadis’s  terminology,  of  a  distinct  social 
“imaginaire,”  indeed  of  the  ontological  vision,  of 
a  distinct  cultural  program,  combined  with  the 
development  of  a  set  or  sets  of  new  institutional 
formations — the  central  core  of  both  being  an 
unprecedented  “openness”  and  uncertainty. 

This  civilization,  the  distinct  cultural  program 
with  its  institutional  implications,  crystallized  first 
in  Western  Europe  and  then  expanded  to  other 
parts  of  Europe,  to  the  Americas,  and  later  on 
throughout  the  world,  giving  rise  to  continually 
changing  cultural  and  institutional  patterns,  which 
constituted  different  responses  to  the  challenges 
and  possibilities  inherent  in  the  core  characteris¬ 
tics  of  the  distinct  civilizational  premises  of 
modernity. 

The  core  of  this  program  has  been  that  the 
premises  and  legitimation  of  the  social,  ontologi¬ 
cal,  and  political  order  were  no  longer  taken  for 
granted.  Consequently,  there  developed  an  inten¬ 
sive  reflexivity  around  the  basic  ontological  prem¬ 
ises  as  well  as  around  the  bases  of  social  and 
political  order  of  authority  of  society — a  reflexiv¬ 
ity  that  was  shared  even  by  the  most  radical  critics 
of  this  program,  who  in  principle  denied  the  legiti¬ 
macy  of  such  reflexivity.  Politics  became  a  matter 
for  discussion,  even  if  it  was  then  reputed  to  be 
based  on  rationality. 

In  conjunction  with  these  conceptions,  there 
developed  the  belief  in  the  possibility  of  active 
formation  of  society  by  conscious  human  activity. 
Two  basic  complementary  but  also  potentially 
contradictory  tendencies  about  the  best  ways  in 


which  such  construction  could  take  place  devel¬ 
oped  within  this  program.  The  first  such  tendency 
was  that  the  program,  as  it  crystallized  above  all 
in  the  major  revolutions  in  which  people  claimed 
to  lead  the  political  change,  gave  rise,  perhaps  for 
the  first  time  in  the  history  of  humanity,  to  the 
belief  in  the  possibility  of  bridging  the  gap 
between  the  transcendental  and  mundane  orders, 
of  realizing  through  conscious  human  actions  in 
the  mundane  orders,  in  social  life,  some  of  the 
utopian,  eschatological  visions.  The  second  ten¬ 
dency  was  rooted  in  the  growing  recognition  of 
the  legitimacy  of  multiple  individual  and  group 
goals  and  interests  and  of  multiple  interpretations 
of  the  common  good. 

The  modern  program  entailed  also  a  radical 
transformation  of  the  conceptions  and  premises  of 
the  major  institutional  arenas — the  political  and 
economic  ones  as  well  as  those  of  collective  identi¬ 
ties,  social  hierarchy,  and  economy.  The  core  of 
this  transformation  was  the  breakdown  of  tradi¬ 
tional  legitimation  of  the  political  and  social 
order,  the  concomitant  opening  up  of  different 
possibilities  of  construction  of  such  order,  and  the 
consequent  contestation  about  the  ways  in  which 
political  order  was  to  be  constructed  by  human 
actors.  A  strong  emphasis  on  at  least  the  potential 
active  participation  of  the  periphery,  of  “society,” 
and  of  all  its  members  in  the  political  arena  com¬ 
bined  with  orientations  of  rebellion  and  intellectual 
antinomianism,  together  with  strong  orientations 
to  center  formation  and  institution  building,  giv¬ 
ing  rise  to  social  movements  and  movements  of 
protest  as  a  continual  component  of  the  political 
process. 

Moreover,  the  concrete  contours  of  the  different 
cultural  and  institutional  patterns  of  modernity  as 
they  crystallized  in  different  societies  have  indeed 
been  continually  changing  due  to  the  combination 
of  the  tensions  inherent  in  the  cultural  and  political 
program  of  modernity  and  the  continual  institu¬ 
tional,  social,  political,  and  economic  developments 
attendant  on  the  development  and  expansion  of 
modernity. 

The  institutional  and  cultural  contours  of  moder¬ 
nity  were  continually  changing  because  the  very 
expansion  of  modernity,  beginning  in  Europe, 
entailed  a  confrontation  between  the  concrete  prem¬ 
ises  and  institutional  formations  as  they  developed 
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in  Europe,  later  in  the  Americas,  and  then  in  Asia — 
in  the  Islamic,  Hindu,  Buddhist,  Confucian,  and 
Japanese  civilizations.  The  manifestations  of 
modernity  were  shaped,  above  all,  by  the  internal 
dynamics  of  the  technological,  economic,  political, 
and  cultural  arenas  as  they  developed  in  different 
societies  and  expanded  beyond  them.  Second,  they 
were  a  result  of  the  central  struggle  between  differ¬ 
ent  centers  of  political  and  economic  power  that 
constituted  a  continual  component,  first,  of  the 
formation  of  European  modernity  and,  later,  the 
continual  expansion  of  European,  American,  and 
Japanese  modernities.  This  struggle  took  place  in  a 
context  of  shifting  hegemonies  in  the  different 
international  systems  that  developed  in  the  wake  of 
the  continual  developments  in  the  economic,  politi¬ 
cal,  technological,  and  cultural  arenas,  and  in  cen¬ 
ters  thereof.  The  manifestation  of  modernity  was 
shaped  by  these  forces,  along  with  the  continual 
confrontations  between  interpretations  promul¬ 
gated  by  different  centers,  elites,  and  movements, 
and  the  concrete  developments,  conflicts,  and  dis¬ 
placements  attendant  on  the  institutionalization  of 
these  premises.  Such  confrontations  developed 
already  within  Europe  with  the  crystallization  of 
the  modern  European  state  system  and  became 
further  intensified  with  the  crystallization  of  “world 
systems”  from  the  16th  or  17th  centuries  onward. 

Modernity,  Globalization, 
and  the  Nation-State 

The  multiple  and  divergent  modernities  of  the 
“classical”  age  of  modernity  crystallized  during 
the  19th  century  and  above  all  during  the  first  6  or 
7  decades  of  the  20th  century  in  the  different  ter¬ 
ritorial  nation-states  and  revolutionary  states  and 
in  the  social  movements  that  developed  in  Europe, 
in  the  Americas,  and  in  Asian  and  African  societies 
after  World  War  II.  These  contours — institutional, 
symbolic,  and  ideological  contours  of  the  modern 
national  and  revolutionary  states  and  movements, 
which  were  seen  as  the  epitome  of  modernity — 
have  changed  drastically  on  the  contemporary 
scene  with  the  intensification  of  tendencies  to  glo¬ 
balization,  as  manifest  in  growing  movements  of 
autonomy  of  world  capitalist  forces,  intense  move¬ 
ments  of  international  migrations,  and  the  con¬ 
comitant  development  on  an  international  scale  of 


social  problems,  such  as  prostitution  and  delin¬ 
quency,  all  of  which  reduce  the  control  of  the 
nation-state  over  its  own  economic  and  political 
affairs,  despite  the  continual  strengthening  of  the 
“technocratic”  rational  secular  policies  in  various 
arenas — be  it  in  education  or  family  planning.  At 
the  same  time,  the  nation-states  lost  some  of 
their — always  only  partial — monopoly  of  internal 
and  international  violence  to  many  local  and  inter¬ 
national  groups  of  separatists  or  terrorists  without 
any  nation-state  or  the  concerted  activities  of 
nation-states  being  able  to  control  the  continually 
recurring  occurrences  of  such  violence.  Con¬ 
comitantly,  the  processes  of  globalization  were 
closely  connected  in  the  cultural  arena,  with  the 
expansion  especially  through  the  major  media  in 
many  countries  around  the  world,  including 
Western  ones  such  as  those  of  Europe  or  Canada, 
of  what  were  seemingly  uniform  hegemonic  Amer¬ 
ican  cultural  programs  or  visions. 

Above  all,  the  ideological  and  symbolic  central¬ 
ity  of  the  nation  and  revolutionary  state,  of  its 
being  perceived  as  the  charismatic  locus  of  the 
major  components  of  the  cultural  program  of 
modernity  and  of  collective  identity,  became  weak¬ 
ened,  and  new  political,  social,  and  civilizational 
visions  and  visions  of  collective  identity  developed. 
These  new  visions  and  identities  were  promulgated 
by  several  types  of  new  social  movements.  Such 
“new”  social  movements,  which  developed  in 
most  Western  countries  (e.g.,  the  women’s  and 
ecological  movements),  were  closely  related  to  or 
rooted  in  the  student  and  anti-Vietnam  War 
movements  of  the  late  1960s  and  1970s. 

These  movements  developed  in  tandem  with  the 
crystallization  of  new  social  settings  and  frame¬ 
works  that  also  went  beyond  the  “classical”  model 
of  the  nation-state.  Among  these  settings — new 
especially  to  the  Muslim,  Chinese,  and  Indian 
diasporas — were  new  types  of  ethnic  minorities, 
such  as  the  Russian  ones,  which  emerged  in  many 
of  the  successor  states  of  the  Soviet  Union.  In 
these,  and  in  many  other  settings,  new  types  of  col¬ 
lective  identities  emerged,  often  in  movements 
whose  focus  was  no  longer  the  nation-state.  Many 
of  these  hitherto  seemingly  “subdued”  identities — 
ethnic,  regional,  local,  and  transnational  alike — 
moved  naturally,  though  in  a  highly  reconstructed 
way,  into  the  centers  of  their  respective  societies 
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and  also  often  into  the  international  arena.  They 
contested  the  hegemony  of  the  older  homogenizing 
programs,  claiming  their  own  autonomous  places  in 
central  institutional  arenas — educational  programs, 
public  communications,  or  the  media — and  very 
often,  they  also  made  far-reaching  claims  with 
respect  to  the  redefinition  of  citizenship  and  the 
rights  and  entitlements  connected  with  it.  In  these 
settings,  local  dimensions  were  often  brought  together 
in  new  ways  beyond  the  model  of  the  classical 
nation-state,  with  transnational  ones,  such  as  the 
European  Union,  or  with  broad  religious  identities — 
many  of  them  rooted  in  the  great  religions,  such 
as  Islam,  Buddhism,  or  different  branches  of 
Christianity,  but  reformulated  in  new  modern  ways. 

In  parallel  fashion,  continuous  shifts  occurred 
in  the  relative  hegemony  of  different  centers  of 
modernity — first  European  and  U.S.  ones,  moving 
to  East  Asia — shifts  that  became  continually  con¬ 
nected  with  the  growing  contestations  between 
such  centers  around  their  presumed  hegemonic 
standing. 

Destructive  Dimensions  of  Modernity 

Contrary  to  the  optimistic  views  of  modernity  as 
progress,  the  development  and  expansion  of 
modernity  was  not  peaceful.  It  also  bore  within  it 
very  destructive  possibilities — which  were  indeed 
voiced,  and  also  often  promulgated,  by  some  of  its 
most  radical  critics,  who  saw  modernity  as  a  mor¬ 
ally  destructive  force  and  emphasized  the  negative 
effects  of  some  of  its  core  characteristics.  The 
crystallization  of  the  first  and  the  development  of 
later  modernities  were  continually  interwoven 
with  internal  conflicts  and  confrontations,  rooted 
in  the  contradictions  and  tensions  attendant  on 
the  development  of  capitalism  and,  in  the  political 
arena,  the  growing  demands  for  democratization 
and  with  international  conflicts  in  the  framework 
of  the  modern  state  and  imperialist  systems. 
Above  all,  the  evolution  of  modernity  was  closely 
interwoven  with  wars,  and  genocides,  repressions, 
and  exclusions  constituted  continual  components 
thereof.  Wars  and  genocide  were  not,  of  course, 
new  in  the  history  of  mankind.  But  they  became 
radically  transformed  and  intensified,  generating 
continuous  tendencies  to  specifically  modern  bar¬ 
barism,  the  most  important  manifestation  of 
which  was  the  ideologization  of  violence,  terror, 


and  war — manifest  most  vividly  first  in  the  French 
Revolution.  Such  ideologization  emerged  out  of  the 
interweaving  of  wars  with  the  basic  constitutions  of 
the  nation-states,  with  those  states  becoming  the 
most  important  agent — and  arena — of  constitution 
of  citizenship  and  symbols  of  collective  identity, 
with  the  crystallization  of  the  modern  European 
state  system  and  of  European  expansion  beyond 
Europe  and  with  the  intensification  of  the  technolo¬ 
gies  of  communication  and  of  war. 

S.  N.  Eisenstadt  (Deceased) 
The  Van  Leer  Jerusalem  Institute 
Jerusalem,  Israel 
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The  Concise  Oxford  English  Dictionary  defines 
monarchy  as  a  “state  under  monarchical  govern¬ 
ment”  where  a  monarch  is  a  “sovereign  with  title 
of  king,  queen,  emperor,  empress  or  equivalent.” 
Although  this  definition  has  the  merit  of  describ¬ 
ing  popular  usage  in  the  English  language,  it  is 
readily  apparent  that  it  is  indeterminate.  It  amounts 
to  saying  that  monarchy  is  what  monarchs  do,  and 
it  leaves  open  what  other  titles  might  be  equivalent 
(in  the  masculine  forms)  to  king  or  emperor. 

Etymology — and  the  classical  Greek  origin  of  the 
word — offers  an  approach  that  initially  seems  more 
determinate:  Monarchy  is  the  rule  of  a  single  per¬ 
son.  This  can  be  contrasted  with  aristocracy — the 
rule  of  the  “best”  or  the  “elite” — and  democracy — 
variously  the  rule  of  all  the  people  or  the  “ordi¬ 
nary”  people.  However,  common  usage  in  English 
and  several  other  languages  has  come  to  invest  the 
idea  of  monarchy  with  at  least  the  additional  crite¬ 
rion  of  hereditary  acquisition.  In  fact,  the  heredi¬ 
tary  principle  lies  at  the  heart  of  the  popular  image 
of  monarchy,  even  though  some  of  the  most 
notable  monarchies  in  history,  including  the 
Roman  Empire  and  its  supposed  successor  the 
Holy  Roman  Empire  were,  at  least  nominally, 
nonhereditary,  the  latter  being  elected  by  seven 
“Electors,”  most  of  whom  did  acquire  their  right 
of  franchise  through  the  hereditary  principle.  The 
contemporary  Malaysian  monarchy  is  rotated 
among  12  provincial  sultans,  normally  for  a  5 -year 
term,  and  each  new  incumbent  must  be  ratified  by 
parliament.  It  would  be  easy  to  argue  that  many 
modern  monarchies,  often  incorrectly — or  at  least 
confusingly — called  “constitutional”  monarchies 
where  the  monarch  is  the  head  of  state  but  does 
not  wield  executive  power,  are  much  less  like  mon¬ 
archies  in  the  Greek  sense  than  is  the  papacy  or  a 
regime  such  as  that  of  the  Haitian  dictator  Franqois 
“Papa  Doc”  Duvalier,  who  caused  himself  to  be 
declared  “president  for  life”  in  1964. 

Thus,  the  question  of  how  many  monarchs  have 
survived  into  the  21st  century  is  not  a  straightfor¬ 
ward,  factual  one  as  it  might  first  appear.  The 
answer  depends  partly  on  whether  separate  monar¬ 
chical  traditions  within  the  same  state  are  distin¬ 
guished,  the  obvious  example  being  England, 
Scotland,  Northern  Ireland,  the  Principality  of 


Wales,  the  Isle  of  Man,  the  Bailiwick  of  Guernsey, 
and  the  States  of  Jersey  within  the  British  Isles. 
Some  dependent  territories  of  the  United  Kingdom 
offer  similar  counting  problems,  and  there  is  the 
comparable  case  of  a  federated  multiple  monarchy, 
such  as  the  United  Arab  Emirates:  Is  it  one  monar¬ 
chy  or  seven?  The  answer  would  also  depend  on 
whether  you  include  nonstandard  and  nonheredi¬ 
tary  forms  of  monarchy  such  as  the  papacy,  lifetime 
dictators,  or  republics  with  a  local  monarchical 
element,  such  as  acknowledged  tribal  leaders  within 
some  African  states.  It  is  not  normally  assumed  that 
the  answer  depends  on  the  extent  to  which  a  mon¬ 
arch  has  any  executive  power;  the  United  Kingdom, 
where  the  monarch  has  possessed  powers  that  have 
been  small  and  diminishing  since  the  17th  century, 
and  Morocco,  where  King  Mohamed  VI  deter¬ 
mines  much  policy,  are  both  considered  to  be  mon¬ 
archies  to  the  same  degree.  Nor  is  it  considered 
relevant  that  in  many  cases  the  continued  existence 
of  the  monarchy  is  assumed  to  be  dependent  on  the 
popular  will.  In  most  countries  that  are  monarchies 
but  have  democratically  elected  governments,  it 
would  be  accepted  that  a  referendum  on  a  transi¬ 
tion  to  a  republic  might  be  held  and  that  in  the 
event  of  a  vote  for  such  a  transition,  it  would  occur 
peacefully.  The  Greek  monarchy  was  finally  abol¬ 
ished  as  the  result  of  a  plebiscite  in  1974,  while  in 
Australia  in  1999  the  change  was  rejected.  However, 
the  assumption  that  such  a  change  could  occur 
“merely  on  account  of  the  number  of  votes” — to 
borrow  a  phrase  from  a  monarch  (Queen  Victoria 
in  her  Diaries) — might  be  thought  to  erode  the 
nature  of  monarchy  in  its  original  sense. 

The  answer  to  the  question  of  how  many  monar¬ 
chies  remain  therefore  varies  from  around  25  to  50. 
Either  of  these  figures  might  come  as  a  surprise  to 
the  many  political  observers  who  have,  over  the 
past  2  centuries,  predicted  the  complete  demise  of 
monarchy.  They  might  be  less  surprised  to  discover 
that  about  25  states  had  the  same  monarch,  Queen 
Elizabeth  II  of  England.  This  entry  discusses  the 
debate  about  monarchy  and  its  aftermath  as  well  as 
the  political  science  research  on  monarchy. 

The  Early  Modern  Debate  About 
Monarchy  and  Its  Aftermath 

Monarchy  was  at  the  heart  of  early  modern 
debates  about  politics  and  government,  which 


1614  Monarchy 


were  primarily  normative  debates.  The  issue  was 
sometimes  whether  monarchy  was  a  legitimate  (or 
the  legitimate)  form  of  government,  but  more 
often,  the  debates  were  about  the  form  monarchy 
should  take,  the  rules  of  succession,  the  limits  on 
monarchical  authority,  and  the  subjects’  duties  of 
obedience  to  monarchs  and  rights  of  rebellion 
against  them.  All  this  can  be  portrayed  as  a  crisis 
of  legitimacy  in  which  states  and  their  apologists 
desperately  sought  to  establish  their  legitimacy 
and  thus  their  right  to  govern.  It  would  be  wrong 
to  portray  this  crisis  as  something  new  and  “mod¬ 
ern”  in  the  sense  of  post-Renaissance  or  post- 
Reformation:  One  would  only  have  to  read  the  10 
plays  of  Shakespeare  on  English  historical  topics 
to  realize  that  the  nature  of  kingship  and  its  suc¬ 
cession  were  always  contested.  But  there  was  a 
new  element,  insofar  as  there  was  no  longer  an 
accepted  religious  source  of  authority  providing  a 
natural  counterbalance  to  the  claims  of  monarchs. 

It  would  also  be  an  oversimplification  to  por¬ 
tray  the  debate  about  government  in  the  17th  and 
18th  centuries  as  a  contest  between  the  ideas  of 
sovereignty  as  vested  in  “the  Lord’s  Anointed” 
versus  its  being  vested  in  “the  people.”  The  main 
complication  is  the  appearance  of  religion  on  both 
sides.  In  the  English  Civil  War,  for  example,  both 
sides  referred  ultimately  to  religious  authority  to 
justify  their  actions.  The  men  who  signed  the  death 
warrant  of  Charles  I  in  1649  were,  for  the  most 
part,  no  more  believers  in  popular  sovereignty 
than  he  was,  and  they  did  go  on  to  suppress  those 
who  did  believe  in  it,  including  the  “Levellers.” 

In  this  context,  it  is  important  to  distinguish 
between  the  development  of  monarchy  in  France 
and  in  Great  Britain,  both  in  terms  of  the  working 
of  the  institution  and  the  justifications  offered  for 
it.  In  France,  kings  continued  to  claim  executive 
power  and  to  rule  by  “divine  right,”  justified  by 
apologists  such  as  the  prelate  Jacques-Benign  e 
Bossuet.  They  increasingly  ignored  ancient  repre¬ 
sentative  institutions  such  as  the  parlements  and 
the  etats.  But  in  Britain,  following  the  Civil  War 
and  the  restoration  of  the  monarchy  in  1660,  the 
monarchy  developed  as  a  more  limited  institution 
with  diminished  and  diminishing  executive  pow¬ 
ers.  This  has  often  been  referred  to  as  “constitu¬ 
tional  monarchy,”  but  there  is  at  least  an  element 
of  paradox  about  the  use  of  this  phrase  because  it 
is  also  widely  stated  that  Britain  has  not  had  a 


constitution.  This  is  true  if  what  is  meant  by  “con¬ 
stitution”  is  the  sort  of  coherent,  codified  docu¬ 
ment  that  the  United  States  of  America  and  France 
(in  several  different  versions)  developed  after  their 
revolutions. 

Proponents  of  the  British  monarchy  tended  to 
defend  a  more  limited  form  of  the  institution  and 
to  put  their  case  in  a  more  secular  mode.  (Even 
after  the  French  Revolution,  defenders  of  monar¬ 
chy  such  as  Joseph  de  Maistre  tended  to  remain 
religious  and  fundamentalist  rather  than  conse- 
quentialist  in  their  defense  of  the  institution.)  As 
an  extreme  case,  one  might  cite  Sir  Robert  Filmer’s 
Patriarcha  (1657).  At  first  sight,  it  seems  to  be  a 
fundamentalist  case  for  monarchy  written  under  a 
failing  republic.  Filmer  argues  that  legitimate  mon¬ 
archs  have  a  God-given  right  to  rule  comparable 
with — and  an  extension  of — the  God-given  author¬ 
ity  of  husband-fathers  as  heads  of  households. 
Filmer’s  account  of  the  proper  order  of  things  is 
overtly  religious,  but  it  has  often  been  pointed  out 
that  his  argument  can  easily  be  reproduced  in  a 
purely  secular  form  as  an  account  of  the  organic 
existence  of  society  whose  natural  bonds  give  it 
stability  and  determine  its  proper  leadership:  The 
King  is  the  father  of  the  nation.  This  can  also  be 
seen  as  the  forerunner  of  later  defenses  of  a  limited 
monarchy,  including  those  by  Edmund  Burke  and 
Walter  Bagehot,  in  which  the  monarch  is,  respec¬ 
tively,  the  core  of  a  system  of  institutions  that  can¬ 
not  be  radically  changed  without  incurring  disaster 
and  is  something  that  is  necessary  at  the  symbolic 
level  of  politics  rather  than  the  rational  level. 

In  the  1 8th  century,  the  commercial  success  and 
relative  political  stability  of  Great  Britain — a  par¬ 
liamentary  government  under  Hanoverian  kings — 
attracted  new  defenses  of  limited  monarchy.  To 
the  Baron  de  Montesquieu,  it  was  an  example  of 
“mixed  government,”  its  different  institutions 
being  based  on  separate  and  even  conflicting  prin¬ 
ciples,  relating  to  each  other  with  a  kind  of  creative 
tension.  But  his  analysis  of  the  working  of  British 
government  as  being  based  on  a  “separation  of 
powers”  was  even  more  inaccurate  than  John 
Locke’s  50  years  earlier:  The  powers — as  opposed 
to  the  forms  and  symbols — were  now  overwhelm¬ 
ingly  on  one  side.  But  the  most  intellectually  unen¬ 
cumbered  and  radical  defense  of  the  monarchical 
status  quo  came  from  the  Scottish  philosopher 
David  Hume.  In  his  essays,  especially  “Of  the 
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Origin  of  Government,”  “Of  the  Original 
Contract,”  and  “Of  Passive  Obedience,”  Hume 
scornfully  dismissed  the  rival  Tory  and  Whig 
claims  to  explain  political  obligation  and  legiti¬ 
macy.  It  was  nonsense  to  claim  that  government 
was  based  on  any  form  of  “contract,”  just  as  it 
was  nonsense  to  claim  that  we  had  an  absolute 
obligation  to  obey  royal  government.  Hanoverian 
stability  and  prosperity,  for  which  Hume  was  duly 
grateful,  came  about  partly  because  the  regime  was 
incapable  of  providing  a  theoretical  justification 
for  its  existence,  and  it  should  be  judged  on  its 
consequences,  not  on  its  origins.  In  the  final  chap¬ 
ter  of  his  History  of  England  (Vol.  6,  chap.  81), 
Hume  contrasts  William  of  Orange,  who,  in  the 
name  of  his  wife  Mary,  usurped  the  English  throne 
in  1689,  sending  James  II  into  exile.  James  was  the 
more  virtuous  man  of  the  two,  remarks  Hume, 
and  his  claim  to  the  throne  was  a  superior  one.  But 
the  consequences  of  the  usurpation  were  better 
than  the  alternative,  because  of  certain  negative 
virtues  that  William  possessed:  He  was  not  a 
Catholic,  and  the  very  weakness  of  his  claim  was  a 
strength  in  a  limited  monarchy.  Of  the  “Glorious 
Revolution,”  Hume  concludes  that  “it  gave  such 
an  ascendant  to  popular  principles”  and  “this 
island,  have  ever  since  enjoyed  ...  at  least  the 
most  entire  system  of  liberty.” 

It  should  not  be  inferred  that  Hume  favored  the 
Hanoverian  arrangements  because  they  were  pop¬ 
ular  government  in  disguise.  Like  many  of  those 
who  devised  the  constitution  of  the  United  States, 
Hume  was  clearly  of  the  view  that  popular  sover¬ 
eignty  as  an  overriding  principle  was  as  dangerous 
as  the  divine  right  of  kings.  His  necessary  condi¬ 
tion  of  liberty  was  that  no  principle  be  overriding 
and  that  monarchy  must  remain  to  ensure  this,  at 
least  in  Great  Britain. 

The  French  Revolution  and  the  excesses  it 
engendered  advanced  and  renewed  the  arguments 
in  favor  of  monarchy  in  Britain.  Some  Englishmen, 
including  Tom  Paine  and  Richard  Price,  were 
enthusiastic  in  wanting  Britain  to  follow  France  in 
a  republican  direction.  William  Wordsworth 
expressed  the  enthusiasm  of  a  generation:  “Bliss 
was  it  in  that  dawn  to  be  alive”  ( The  Prelude, 
Book  II).  But  this  initial  enthusiasm  soon  faded  as 
France  moved  toward  the  Reign  of  Terror,  and 
war  broke  out  between  the  two  countries.  Edmund 
Burke  argued  in  his  Reflections  on  the  Revolution 


in  France  in  1790  that  if  “revolution”  was  taken  to 
mean  what  the  French  seemed  to  mean  by  it — the 
radical  reconstruction  of  society  along  lines  sug¬ 
gested  by  abstract  reason — then  it  was  doomed  to 
degenerate  into  failure  and  tyranny.  This  was  not 
true  of  the  more  modest  form  of  revolution  that 
had  occurred  in  England  in  1789  and  in  America 
in  1776,  both  revolutions  that  Burke  favored.  He 
argued  that  the  “real  rights  of  men”  existed  only 
as  properly  recognized  and  developed  systems  of 
obligation  in  well-established  and  well-ordered 
societies.  The  liberties  of  the  English  were  part  of 
a  slowly  developing  system  of  government  and 
society  of  which  monarchy  was  an  integral  part. 
Thus,  liberty  and  monarchy  were  part  of  one 
another,  and  there  was  little  chance  of  sustained 
liberty  in  a  popular  republic  that  was  based  on 
abstract  principles.  The  utilitarian  philosopher 
Jeremy  Bentham  was  even  more  emphatic  in  his 
condemnation  of  the  revolutionaries.  In  Anarchical 
Fallacies,  he  attacked  the  very  nature  of  the  French 
revolutionary  doctrines  such  as  “the  natural  rights 
of  man,”  arguing  that  they  failed  both  ontological 
and  ethical  tests  and  were,  therefore,  “nonsense  on 
stilts.”  But  Bentham  was  a  very  strict  and  self- 
conscious  consequentialist,  so  his  support  for 
monarchy  was  necessarily  more  contingent  and 
mutable  than  was  Burke’s. 

In  short,  the  defenses  of  British  monarchy  by 
Montesquieu,  Hume,  Burke,  and  Bentham  were 
different,  but  had  in  common  the  view  that  by 
combining  parliamentary  power  with  royal  sover¬ 
eignty  British  government  was  “limited,”  “moder¬ 
ate,”  or  “mixed”  in  ways  that  were  conducive  to 
sustaining  liberty.  Such  government  was  compared 
favorably  with  republican  government  in  which 
there  was  no  natural  balance  to  the  popular  will  or 
limit  to  abstract  principles.  All  of  them  might 
argue  that  “natural  rights”  or  “popular  sover¬ 
eignty”  were,  if  anything,  slightly  less  nonsensical 
than  “the  divine  right  of  kings,”  but  in  the  circum¬ 
stances  they  found  themselves,  the  latter  was  the 
less  dangerous  doctrine. 

The  normative  argument  in  favor  of  limited 
monarchy  took  a  new  turn  with  the  publication  of 
Walter  Bagehot’s  book  The  English  Constitution 
in  1867.  Bagehot  attacked  the  idea  that  govern¬ 
ment  could  be  understood  only  in  terms  of  power 
and  the  making  of  decisions,  arguing,  in  effect, 
that  such  accounts  were  one-dimensional: 
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In  such  constitutions  there  are  two  parts  .  .  .  first, 
those  which  excite  and  preserve  the  reverence  of 
the  population — the  dignified  parts,  .  .  .  ;  and 
next,  the  efficient  parts — those  by  which  it,  in 
fact,  works  and  rules  .  .  .  every  constitution  must 
first  gain  authority,  and  then  use  authority;  it 
must  first  win  the  loyalty  and  homage  of  mankind 
and  then  employ  that  homage  in  the  work  of 
government.  .  .  .  The  dignified  parts  of  the 
government  are  those  which  bring  it  force — 
which  attract  its  native  power.  .  .  .  The  Queen  is 
only  at  the  head  of  the  dignified  part  of  the 
constitution.  The  prime  minister  is  at  the  head  of 
the  efficient  part.  (pp.  3ff) 

“Dignity,”  in  Bagehot’s  sense,  humanizes  and 
personalizes  our  relationship  with  the  state,  allow¬ 
ing  us  to  identify  it  with  a  family,  a  history,  and  a 
nation.  This  is  unselfconsciously,  but  entirely,  a 
functionalist  account  of  the  working  of  govern¬ 
ment,  and  its  predecessors  therefore  include  the 
accounts  of  “the  body  politic”  going  back  to  Plato 
and  Aristotle  while  its  successors  include  20th- 
century  systems  analysts  such  as  David  Easton  and 
Oran  Young.  It  remains  an  important  argument  on 
several  levels  including  the  defense  of  monarchy 
and  the  general  understanding  of  political  systems. 

Monarchy  and  Political  Science 
in  the  20th  Century 

Bagehot’s  theory  of  the  working  of  a  limited  mon¬ 
archy  has  been  widely  quoted,  accepted,  and 
taught.  It  has  acquired  some  of  the  properties  of  a 
self-fulfilling  prophecy,  influencing  both  the  way  in 
which  British  monarchs  have  conducted  themselves 
and  the  perception  of  monarchy  in  the  population 
as  a  whole.  For  example,  socialist  politicians  have 
been  surprisingly  easily  persuaded  that  the  exis¬ 
tence  of  monarchy  is  compatible  with  their  egali¬ 
tarian  principles — a  compatibility  that  would  seem 
unlikely  prima  facie.  Moreover,  they  have  often 
made  the  judgment  that  there  are  fewer  obstacles 
to  their  policy  proposals  under  a  monarchy  than 
there  would  be  with  a  formal,  written  constitution. 
In  any  case,  monarchy  as  a  political  issue  would 
likely  be  highly  emotive  and  distracting;  like  abor¬ 
tion,  it  would  be  an  unlikely  issue  for  a  rational 
career  politician  to  choose  to  be  involved  in.  As  a 
result,  the  existence  of  the  monarchy  has  been  kept 


off  the  main  political  agenda  in  Britain,  and  it  is 
only  slightly  less  rare  that  the  precise  role  of  the 
monarchy  has  been  debated. 

Of  course,  what  applies  to  Britain  also  applies 
to  different  degrees  in  a  number  of  Commonwealth 
countries.  It  is  interesting  to  note  that  monarchy 
has  been  more  of  an  issue  in  Australia  than  it  has 
in  Canada,  though  in  the  Canadian  version,  the 
role  of  the  monarchy  is  more  circumscribed,  and 
there  has  been  no  equivalent  of  the  events  in 
Australia  in  1975  when  the  Queen’s  representa¬ 
tive,  the  Governor-General  Sir  John  Kerr,  played 
an  active  part  in  dismissing  the  government  of  the 
day  during  what  has  been  described  as  the  greatest 
constitutional  crisis  in  Australian  history.  But  it  is 
also  the  case  that  Bagehot’s  theory  of  the  role  of 
monarchy  applies  at  least  in  part  to  other  monar¬ 
chies,  because  in  these  cases  as  well,  the  monarchy 
can  be  seen  as  located  in  a  different  dimension 
from  normal  politics.  The  Allied  (principally 
American)  decision  to  allow  Emperor  Hirohito  of 
Japan  to  continue  as  head  of  state  after  the 
Japanese  surrender  in  1945  falls  into  this  category 
and  almost  certainly  facilitated  the  legitimate  and 
peaceful  postwar  development  of  the  country.  In 
Spain,  the  restoration  of  the  monarchy  in  1975 
also  proved  similarly  to  facilitate  reconciliation 
and  legitimation.  In  one  sense,  Spain  had  been  a 
monarchy  since  the  end  of  the  Civil  War  in  1939, 
but  it  was  a  monarchy  without  a  monarch,  with 
Generalissimo  Francisco  Franco  as  “Caudillo”  and 
acting  head  of  state.  To  see  the  Spanish  monarchy 
in  this  Bagehotian  light  is  surprising,  given  that 
monarchy  had  remained  an  ideological  issue  in 
Spain,  but  King  Juan  Carlos  I  has  proved  adept  at 
working  with  elected  politicians  of  all  persuasions 
in  the  British  style.  He  enhanced  his  reputation 
considerably  by  acting  decisively  against  an 
attempted  military  coup  in  1981. 

The  case  of  Thailand  is  more  difficult  to  assess. 
The  country  has  had  a  constitutional  monarch 
since  1932,  and  King  Bhumibol  (as  it  happens,  a 
native  of  Massachusetts)  has  been  in  office  since 
1946.  The  monarchy  is  revered  in  Thailand,  and 
the  monarch  has  a  religious  status  as  the  spiritual 
leader  and  highest  level  of  being  within  the  national 
form  of  Buddhism.  But  Thailand  has  not  experi¬ 
enced  political  stability;  it  has  had  a  cycle  of  mili¬ 
tary  coups  and  democratic  restorations  with  the 
longest  period  of  sustained  democratic  government 
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being  1992  to  2006.  Spain  and  Thailand  can  be  seen 
as  the  inverse  of  each  other:  In  Spain,  the  monarchy 
has  helped  provide  the  conditions  for  reconciliation 
and  democratization  despite  the  institution  being 
initially  divisive,  whereas  in  Thailand,  the  monarchy 
is  revered  as  “above”  politics  but  has  been  unable  to 
prevent  political  instability.  But  one  could  argue 
that  without  the  monarchy  Thailand’s  divisions 
would  have  been  expressed  much  more  violently 
and  its  sense  of  national  solidarity  lost  completely. 
Thus,  it  is  not  out  of  the  question  to  argue  for  the 
Bagehotian  functionalist  role  of  monarchy  even  in 
the  Thai  case.  It  may  be  that  without  monarchy, 
Thailand  would  have  experienced  more  bloodshed 
and  less  economic  progress. 

In  all  these  cases,  it  is  at  least  arguable  that 
monarchy  has  been  very  important  politically, 
principally  by  providing  the  context  in  which  poli¬ 
tics  takes  place.  So  it  is  surprising  to  report  that 
there  has  been  very  little  attention  in  the  academic 
study  of  politics  to  the  institution  of  monarchy. 
Bagehot’s  account  of  the  institution,  though  well 
known,  has  barely  been  refined  or  developed.  The 
subject  has  been  left  to  journalists  and  historians, 
and  it  is  instructive  to  consider  why  this  should 
have  been  so.  It  is  even  the  case  that  most  univer¬ 
sity  courses  on  British  politics  do  not  discuss  the 
monarchy  as  such. 

First,  there  is  the  question  of  what  James  March 
called  “the  power  of  power.”  The  concept  of 
power  combines  the  appeal  of  “the  glories  of  clas¬ 
sical  mechanics”  with  that  of  “the  cynicism  of 
Realpolitik.  ”  And  concern  with  it  has  been  a  defin¬ 
ing  condition  of  the  modern  study  of  politics. 
“Politics  is  about  power”  is  the  mantra,  which 
appears  to  specifically  exclude  the  study  of  “con¬ 
stitutional”  monarchs.  But  the  distinction  between 
the  powerful  and  the  formal  can  never  be  com¬ 
pletely  impermeable  as  is  shown  by  the  incidents  in 
Spain  in  1981.  It  is  possible  to  imagine  a  constitu¬ 
tional  crisis  in  the  United  Kingdom  in  which  the 
monarch  would  assume  a  similarly  important  role. 
In  any  case,  in  Bagehot’s  account  of  monarchy,  the 
monarch  retains  certain  residual  political  rights — 
to  warn,  to  advise,  and  to  be  consulted — which 
may  generate  influence,  particularly  when  an  expe¬ 
rienced  monarch  is  dealing  with  a  relatively  inex¬ 
perienced  politician. 

Second,  in  the  case  of  those  monarchies  that 
retain  executive  power,  the  methodological 


constraints  on  research  are  considerable.  To  inves¬ 
tigate  the  Moroccan  monarchy  or  any  of  those  in 
the  Arabian  peninsula  would  present  a  Western 
political  researcher  with  obstacles  of  language, 
culture,  access,  censorship,  and  coercion.  In  any 
case,  since  there  is  only  a  limited  set  of  monar¬ 
chies,  even  effective  research  might  appear  contin¬ 
gent  and  anecdotal  with  little  or  nothing  to  say 
about  monarchy  per  se. 

Third,  there  is  also  the  case  of  progressivism, 
whether  in  its  Whig  or  Marxist  forms.  That  is, 
although  monarchy  may  exist  and  be  thriving 
now,  forms  of  developmental  model  relegate 
monarchy  to  the  past.  Research  in  the  social  sci¬ 
ences  is  naturally  more  oriented  to  the  present 
and  the  future  rather  than  the  past.  We  could 
argue  that  20th-century  political  science  devoted 
too  much  attention  to  unanswerable  and  superfi¬ 
cial  questions  about  public  opinion  and  too  little 
to  monarchy. 

Fourth — and  finally — there  is  the  undeniable 
effect  that  research  feeds  on  research  and  generates 
fashions.  Monarchy  is  not  studied  partly  because 
nobody  ever  makes  an  attempt  to  study  it,  and  it 
would  be  a  brave  young  researcher  who  announced 
that  he  or  she  was  going  to  devote  his  or  her  career 
to  studying  monarchy. 

The  absence  of  a  political  science  of  monarchy 
is  more  complete  at  the  microscopic  level  than  at 
the  macroscopic.  At  the  broad  level,  some  political 
scientists  have  accorded  at  least  a  curious  atten¬ 
tion  to  the  effects  of  monarchy  on  political  culture 
and  thus  on  the  working  of  political  systems.  In 
particular,  Seymour  Martin  Lipset  (1960)  has 
contrasted  the  cultural  condition  of  American 
republicanism  with  the  more  deferential  and 
“governable”  characteristics  of  Canada  and 
European  monarchies.  Contemporary  monar¬ 
chists  might  argue,  based  on  this  research,  that 
monarchies  as  a  whole  remain  generally  more 
stable  and  more  prosperous  than  republics  as  a 
whole,  but  the  obvious  response  would  be  to 
point  out  that  this  is  a  spurious  correlation  depen¬ 
dent  on  the  further  observation  that  most  monar¬ 
chies  are  either  in  Northern  Europe  or  the 
Commonwealth.  They  would  be  on  firmer  ground 
in  saying  that  the  principal  advantage  of  monar¬ 
chy,  especially  when  compared  with  executive 
presidencies,  is  that  it  furnishes  a  state  with  a  head 
who  did  not  seek  the  role  and  whose  identity  is 
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bound  up  with  that  of  the  society,  rather  than  an 
ambitious  politician  with  executive  responsibili¬ 
ties  whom  at  least  a  substantial  minority  of  the 
population  are  likely  to  dislike. 
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Monetary  Relations 


International  “monetary  relations”  refers  to  the 
efforts  of  sovereign  states  to  influence  the  condi¬ 
tions  of  cross-border  flows  of  money  and  other 
financial  assets,  especially  money  flows  that  are 
not  the  direct  counterpart  of  real  exchanges  of 
goods  and  services.  These  conditions  include  but 
are  not  limited  to  exchange  rate  regimes  and  lev¬ 
els,  capital  and  investment  controls,  foreign  debt 
contracts,  the  use  of  reserve  currencies,  regulation 
of  multinational  banks  and  nonbank  financial 
institutions,  and  balance-of-payments  crisis  man¬ 
agement.  After  situating  the  topic  in  its  theoretical 
and  historical  contexts,  this  entry  discusses  the 
relationship  of  monetary  legitimacy  to  state 
power,  the  ambiguous  nature  of  global  monetary 
governance,  and  the  contemporary  monetary 
issues  of  greatest  concern.  A  final  section  high¬ 
lights  the  range  of  theoretical  and  methodological 
perspectives  in  use. 

Theory  and  History 

Neoclassical  economists  typically  make  a  number 
of  assumptions  when  examining  the  international 
monetary  system,  among  the  most  significant 
being  that  financial  firms  and  investors  act  inde¬ 
pendently  of  one  another,  that  multinational 
banks  have  little  home  bias  in  their  lending  and 
investing  decisions,  and  that,  under  fully  liberal¬ 
ized  global  capital  markets,  firms  and  countries 
with  objectively  similar  economic  profiles  will  face 
homogeneous  credit,  insurance,  and  bankruptcy 
conditions.  These  analysts  model  the  global  mon¬ 
etary  system  as  a  decentralized,  self-equilibrating 
market.  Within  this  cognitive  framing,  national 
decisions  to  depart  from  fully  liberalized  capital 
accounts  appear  suboptimal.  Yet  the  neoclassical 
approach  ignores  the  role  played  by  states  in  con¬ 
stituting  the  rules  and  institutions  within  which 
market  transactions  occur. 

Political  scientists  in  contrast  assume  an  inter¬ 
national  political  economy.  World  markets 
are  embedded  in  and  permeated  by  social  institu¬ 
tions,  including  formal  international  governmental 
organizations  (IGOs;  with  membership  limited  to 
sovereign  states)  as  well  as  informal  clubs  and  pro¬ 
cesses,  each  associated  with  norms,  laws,  or  rules, 
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and  standardized  procedures.  Yet  there  is  neither  a 
world  government  nor  a  collective  and  authorita¬ 
tive  enforcement  mechanism  for  global  market 
transactions.  Those  who  defy  the  “rules”  of 
exchange  (honor  contracts,  represent  merchandise 
honestly,  don’t  bribe,  and  don’t  manipulate  prices) 
may  be  punished  by  the  market  in  the  form  of 
reputational  losses.  Powerful  states  also  possess  a 
host  of  additional  punishments  and  inducements, 
particularly  access  to  their  home  markets,  that 
core  country  governments  may  deploy  to  get  the 
rules,  compliance  from  others,  and  occasional 
exceptions  for  themselves  that  they  desire. 
Neoclassical  economics  does  not  model  these  non¬ 
trivial  special  privileges. 

International  monetary  transactions  over  the 
past  century  and  a  half  have  occurred  within  four 
broad  monetary  eras.  Three  were  characterized  by 
sets  of  rules  and  social  institutions  designed  and 
enforced  by  representatives  of  a  dominant  state  or 
states,  while  the  fourth  period  was  an  ultimately 
unsuccessful  attempt  to  establish  a  durable  regime. 

During  the  classical  gold  standard  era,  roughly 
1870  to  1914,  the  major  trading  states  pegged 
their  paper  currencies  (fiat  monies)  to  gold,  with 
incumbent  governments  promising  to  redeem  this 
paper  on  demand.  The  system’s  anchor  was  the 
credibility  of  the  promises  of  key  states,  particu¬ 
larly  Britain,  to  exchange  intrinsically  valueless 
paper  for  a  preset  quantity  of  precious  metal. 

The  second  period  was  the  two  interwar  decades 
of  unsuccessful  attempts  to  reconstruct  the  prewar 
gold  standard.  With  its  industrial  economy  deci¬ 
mated  by  World  War  I,  Britain  especially  tried  to 
reestablish  sterling  convertibility  at  the  prewar 
gold  parity  but  gave  up  in  1931.  Barry  Eichengreen 
observes  that  the  major  industrial  capitalist  states, 
responding  to  pressure  from  demobilized  soldiers, 
all  quickly  instituted  universal  male  suffrage  fol¬ 
lowing  World  War  I,  subsequently  making  it 
politically  very  difficult  to  reimpose  the  rigid  and 
harsh  automatic-adjustment  procedures  built  into 
the  gold  standard.  In  a  contrasting  explanation  for 
the  failure  to  establish  a  durable  monetary  regime, 
Charles  Kindleberger  famously  contended  that  the 
crucial  brake  on  interwar  monetary  stability  was 
that  the  only  state  with  sufficient  economic  and 
political  resources  to  lead,  the  United  States,  was 
insufficiently  committed  to  doing  so.  There  was  no 
solution  until  after  World  War  II. 


The  key  outlines  of  the  postwar  multilateral 
economic  institutions  negotiated  by  the  soon-to-be 
victors  reflected  the  generally  liberal  economic 
preferences  of  the  United  States,  which  emerged 
from  the  war  even  more  relatively  powerful  than 
previously.  The  Bretton  Woods  regime,  named 
after  the  New  England  resort  where  the  conferees 
met  in  late  1944,  had  the  goal  of  free  currency 
convertibility  on  the  current  account  of  the  bal¬ 
ance  of  payments,  so  that  the  inability  of  would-be 
importers  to  get  a  license  to  purchase  foreign 
exchange  would  not  serve  as  an  undeclared  trade 
barrier.  However,  the  Bretton  Woods  negotiators 
saw  liberal  convertibility  for  the  capital  account — 
corresponding  to  cross-border  investment  flows 
lacking  a  direct  trade  counterpart — as  dangerous 
and  destabilizing.  Another  key  provision  was  the 
historically  unprecedented  creation  of  two  multi¬ 
lateral  banks  offering  loans  to  sovereign  govern¬ 
ments:  the  International  Monetary  Fund  (IMF), 
providing  foreign  exchange  to  countries  whose 
currency  was  under  attack,  and  the  World  Bank, 
to  extend  longer-term  credits  for  war  reconstruc¬ 
tion  and  infrastructure  development.  The  final  pil¬ 
lar  of  the  postwar  monetary  regime  was  a  mostly 
fixed  exchange  rate  among  major  currencies.  This 
was  the  adjustable  peg,  to  be  moved  only  through 
formal  application  to  the  IMF,  which  was  hardly 
ever  done  in  practice.  Only  the  U.S.  dollar,  the 
system’s  linchpin,  was  convertible  into  gold.  The 
United  States’  strong  postwar  economy  meant  that 
its  business  community  initially  accepted  low 
import  tariffs,  even  when  trading  partners  did  not 
reciprocate.  But  America’s  postwar  trade  surplus 
eventually  disappeared.  In  1971,  and  without  con¬ 
sultation  with  the  United  States’  European  allies, 
U.S.  President  Richard  Nixon  ended  convertibility 
and  imposed  a  10%  across-the-board  import  sur¬ 
charge,  effectively  devaluing  the  dollar  by  an 
equivalent  amount.  By  the  mid-1970s,  all  the 
major  industrial  democracies  had  responded  by 
floating  their  currencies. 

The  post-Bretton  Woods  regime,  or  financial 
globalization  era,  extends  from  the  mid-1970s  to 
the  present.  Its  dominant  themes  have  been  removal 
of  capital  controls  and  dismantling  of  a  wide  vari¬ 
ety  of  financial  regulatory  barriers,  such  as  U.S. 
legislation  prohibiting  commercial  banks  from 
operating  in  securities  markets.  Deregulation  led 
to  heightened  integration  of  previously  segmented 
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national  financial  markets  and  a  dramatic  expan¬ 
sion  of  the  profits  and  size  of  the  financial  sector 
in  the  advanced  economies.  The  period  also  has 
coincided  with  an  increase  in  financial  crises,  often 
both  balance-of-payments  (exchange  rate)  crises 
and  domestic  banking  crashes.  Crises  occurred 
most  often  in  developing  and  transitional  coun¬ 
tries,  with  the  notable  exception  of  1991-1992’s 
large  crisis  in  Western  Europe’s  Exchange  Rate 
Mechanism.  Members  of  the  European  Union 
(EU)  then  sought  to  protect  themselves  from  finan¬ 
cial  uncertainty  by  the  dramatic  step  of  adopting  a 
common  currency.  Their  Economic  and  Monetary 
Union  (EMU)  entered  into  full  effect  in  January 
2002,  although  without  the  participation  of  the 
United  Kingdom  and  Sweden. 

Until  2008,  most  expected  the  United  States,  the 
center  of  world  financial  innovation  and  possessor 
of  the  dominant  global  reserve  currency,  to  remain 
immune  from  serious  financial  crisis.  Yet  the  U.S. 
and  worldwide  financial  crisis  of  2008-2009  was 
the  worst  since  the  1930s,  leading  the  global  econ¬ 
omy  to  shrink  by  almost  3%  in  2009.  The  neolib¬ 
eral  model  of  ever-freer  global  finance  has  been  at 
least  partially  discredited.  Robert  Wade  is  hardly 
the  only  observer  to  ask  whether  a  new  era  of 
international  monetary  relations  is  in  the  process 
of  being  born. 

Money  and  State  Power 

As  recently  as  the  late  19th  century  it  was  not 
uncommon  for  multiple  currencies,  including  both 
coins  (specie)  and  paper  obligations  of  public  and 
private  entities,  to  circulate  freely  in  a  national  ter¬ 
ritory.  Yet  from  the  early  20th  century  onward, 
possession  of  a  unique  national  currency  operating 
as  sole  legal  tender  within  the  geographical  extent 
of  the  territory  came  to  be  seen  as  an  indispensable 
component  of  sovereignty.  Because  fiat  money 
depends  on  the  public’s  trust  of  the  issuer,  cur¬ 
rency  strength  is  closely  linked  to  state  legitimacy 
and  strength.  Niall  Ferguson  claims  that  it  was 
early-modern  England’s  superior  ability  to  con¬ 
struct  effective  public  debt  markets,  encouraging 
voluntary  loans  to  the  state  from  wealthy  private 
citizens,  that  allowed  England  to  surpass  France  in 
military  and  political  power.  Rodney  Bruce  Hall 
investigates  how  national,  and  ultimately  global, 
monetary  credibility  is  socially  constructed  in  the 


contemporary  world,  locating  central  banks  at  the 
core  of  this  process.  Leonard  Seabrooke  empha¬ 
sizes  the  critical  role  of  mass  publics  in  the  wealthy 
democracies  in  sustaining  belief  in  national  money 
and  finance.  Key  to  this  has  been  what  John 
Gerard  Ruggie  baptized  the  compromise  of 
“embedded  liberalism”  or  support  by  the  advanced 
capitalist  democracies  for  comparatively  open, 
liberal  international  economic  relations  coupled 
with  buffering  of  their  domestic  populations  from 
economic  downturns  via  extensive  welfare  net¬ 
works  and  capital  controls. 

National  monetary  strength  in  turn  becomes  a 
source  of  international  influence,  allowing  states 
to  employ  financial  levers  to  achieve  both  pre¬ 
ferred  global  financial  governance  outcomes  and 
unrelated  foreign  policy  goals.  David  M.  Andrews 
points  to  the  1956  Suez  Canal  crisis,  in  which  the 
United  States  informed  Britain  that  it  would  not 
lend  its  support  to  the  sterling,  then  under  market 
pressure,  until  the  British  and  French  changed  their 
policy  and  agreed  to  withdraw  from  Egypt. 
Contributors  brought  together  by  Eric  Helleiner 
and  Jonathan  Kirshner  worry  about  the  problem 
of  global  imbalances — specifically  the  United 
States’  persistent  and  ever-growing  current  account 
deficit  mirrored  by  the  similarly  “structural”  sur¬ 
pluses  of  China  and  others — and  what  the  gradual 
redistribution  of  monetary  capabilities  toward 
large  emerging  powers  such  as  China  might  mean 
for  the  dollar-centric  global  economic  governance 
system  still  managed  by,  and  some  would  claim 
for,  the  major  advanced  industrial  economies.  In 
mid-2009,  all  the  so-called  BRIC  countries  (Brazil, 
Russia,  India,  and  China)  were  among  the  10  larg¬ 
est  holders  of  official  foreign  exchange  reserves. 
There  were  only  two  advanced  industrial  democra¬ 
cies  on  the  list:  Japan,  number  two  after  China, 
and  Germany,  number  nine. 

The  Amorphous  Global 
Financial  Architecture 

Throughout  the  Bretton  Woods  and  financial  glo¬ 
balization  eras  the  scope,  goals,  and  membership 
of  the  international  governance  regimes  for  inter¬ 
national  money  and  finance  have  been,  in  Jacque¬ 
line  Best’s  felicitous  phrase,  notably  “ambiguous.” 
One  possibility  raised  by  Benjamin  J.  Cohen  is  that 
today’s  sophisticated  monetary  relations  are 
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qualitatively  distinct  from  those  in  most  other 
international  arenas:  It  is  the  market  itself  that 
increasingly  “governs,”  limiting  the  actions  of 
states.  On  the  other  hand,  the  monetary  regime 
does  possess  formal  organizations.  The  IMF  and 
World  Bank  are  official  membership  institutions 
with  wide  authority  over  their  borrowers,  although 
since  the  early  1980s,  their  borrowers  have  mostly 
been  developing  countries  and  post-centrally 
planned  economies.  In  the  years  following  the 
mid-1970s  breakdown  of  fixed  exchange  rates 
among  the  major  economies  of  the  day,  it  has 
been  the  Group  of  Seven  (G7,  initially  the  United 
States,  the  United  Kingdom,  Germany,  France, 
and  Japan,  later  joined  by  Italy  and  Canada)  that 
has  exercised  real  international  monetary  gover¬ 
nance  authority,  managing  sometimes  acrimonious 
exchange  rate  negotiations  among  the  dominant 
capitalist  economies,  coping  with  international 
financial  crises,  and  promoting  (or  vetoing)  multi¬ 
lateral  and  transnational  innovations  in  collabora¬ 
tive  financial  and  monetary  regulation,  irrespec¬ 
tive  of  the  formal  venues  for  regulatory  negotia¬ 
tions.  As  Andrew  Baker  explains,  the  G7  is  a 
“process,”  at  present  consisting  of  quarterly  meet¬ 
ings  of  finance  ministers  and  central  bank  presi¬ 
dents  along  with  annual  heads-of-state  summits, 
rather  than  a  formal  organization  with  a  head¬ 
quarters  building  and  permanent  staff.  Membership 
in  this  exclusive  club  is  on  the  basis  of  power,  in 
the  dual  senses  of  latent  capabilities  and  realized 
influence,  although  whether  this  is  overall  global 
power  or  monetary  regime-specific  power  is  not 
always  clear.  Overall  power  is  a  necessary  but  not 
sufficient  membership  qualification,  assisted  by 
international  assertiveness,  yet  underlying  friendly 
relations,  with  the  leading  economies.  Thus,  Italy 
successfully  demanded  inclusion  for  itself  in  the 
mid-1980s  by  threatening  to  close  American  air¬ 
bases.  In  the  early  1990s,  the  newly  independent 
Russia  was  invited  to  the  heads-of-state  summit 
process  (the  G8)  but  not  for  the  technical  mone¬ 
tary  consultations,  a  move  widely  understood  as  a 
concession  to  Russia’s  global  importance  and 
nuclear  weapons  status. 

Even  the  G7  has  been  to  some  degree  a  fiction: 
In  both  the  Latin  American  peso/“ tequila”  crisis  of 
1994-1995  and  the  East  Asian  financial  crisis  of 
1997-1998,  U.S.  officials  in  the  Treasury,  White 
House,  and  Federal  Reserve  brushed  aside  the 


preferences  of  their  Western  European  and 
Japanese  G7  partners.  Moreover  monetary  gover¬ 
nance  in  the  sense  of  regulatory  innovation  and 
multilateral  or  collaborative  supervision  of  cross- 
border  financial  flows  takes  place  even  further 
below  the  radar  of  public  scrutiny,  in  technical 
committees  associated  with  the  Bank  of 
International  Settlements — an  invitation-only 
membership  association  of  the  central  banks 
of  systemically  important  countries  and  other 
countries  they  choose  to  include — and  in  a  host  of 
public-private  transnational  bodies  associated 
with  various  branches  of  the  financial  industry, 
such  as  the  International  Organization  of  Securities 
Commissions  and  the  International  Accounting 
Standards  Board.  By  comparison,  the  global  trade 
regime,  which  is  often  accused  of  being  biased 
against  developing  countries  in  its  operation, 
nonetheless  is  headed  by  a  formal  organization, 
the  World  Trade  Organization  (WTO),  whose 
members  include  a  majority  of  countries.  The 
WTO  has  democratic  voting  procedures,  formal 
rules  for  trade  agreements,  a  complaint  and  adju¬ 
dication  process,  and  ongoing  negotiations  over 
agreed-on  agendas.  The  de  facto  global  monetary 
governance  regime  lacks  all  these  qualities. 

Monetary  Relations  of 

the  Status  Quo  Powers 

It  may  be  that  the  combination  of  ambiguity  and 
centralization  in  monetary  relations  has  served  the 
world  well.  There  has  been  steady  expansion  in 
the  world  economy  since  the  Bretton  Woods  sys¬ 
tem  came  into  being,  although  the  rate  of  world 
growth  has  been  notably  slower  since  the  break¬ 
down  of  its  fixed  exchange  rate  component.  Prior 
to  2008-2009,  there  had  been  no  truly  systemic 
international  financial  crisis  since  the  1930s.  The 
consensus  of  policymakers  in  the  G7  countries  as 
of  late  2009  was  that  collaborative  crisis  manage¬ 
ment  was  working,  and  world  growth  would 
recover  in  2010.  A  repeat  of  the  Great  Depression 
had  been  averted,  at  least  in  the  core  capitalist 
economies.  As  international  monetary  conditions 
apparently  returned  to  normal,  the  attention  of 
both  policymakers  and  scholars  in  advanced 
industrial  countries  refocused  on  their  traditional 
concerns,  including  exchange  rate  negotiations, 
the  degree  of  autonomy  from  political  oversight  to 
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be  accorded  to  central  banks,  financial  regulation 
and  supervision,  and  the  desirability  and  feasibil¬ 
ity  of  multilateral  macroeconomic  coordination. 

Many  of  the  theoretical  contributions  of  politi¬ 
cal  scientists  are  related  to  these  topics.  Eric 
Helleiner  and  Benjamin  J.  Cohen,  respectively, 
have  clarified  the  political  economy  of  capital 
account  liberalization  (CAL)  and  that  of  the 
“unholy  trinity”  of  CAL:  exchange  rate  stability 
under  a  floating  rate  regime  and  autonomous 
domestic  monetary  policy.  Jeffry  A.  Lrieden  has 
been  at  the  center  of  a  group  in  pursuit  of  a  parsi¬ 
monious  theory  of  domestic  exchange  rate  prefer¬ 
ences,  employing,  on  the  one  hand,  interest-group 
categories  such  as  exporters  versus  importers, 
producers  of  tradables  in  contrast  to  producers  of 
nontradables,  liquid-  or  fixed-asset  holders,  and 
sectors  with  or  without  foreign  debt,  and  on  the 
other  hand,  political  institution  variables  such  as 
majoritarian  versus  plurality  systems  and  measures 
of  central  bank  independence.  Explaining  the  deci¬ 
sion  of  a  majority  of  Western  European  states  to 
yield  up  their  national  currencies  by  joining  the 
EMU  has  been  a  dominant  task  for  political  scien¬ 
tists  investigating  exchange  rate  politics. 

Other  scholars  analyze  the  negotiating  strate¬ 
gies  employed  by  major  states  and  their  use  of 
international  monetary  power  to  achieve  state 
goals.  In  general,  countries  prefer  to  have  their 
trading  partners  make  the  necessary  adjustments 
rather  than  having  to  intervene  to  move  the  level 
of  their  home  currency.  Thus,  during  the  G7  nego¬ 
tiations  in  the  mid-1980s  known  as  the  Plaza  and 
Louvre  Accords,  all  parties  agreed  that  the  U.S. 
dollar  was  objectively  overvalued.  But  the 
Europeans  wanted  the  United  States  to  find  a  way 
to  rein  in  government  budget  and  trade  deficits, 
while  the  Americans  argued  that,  since  their 
reserve  currency  position  made  it  impractical  for 
them  to  encourage  the  markets  to  let  the  dollar 
depreciate,  it  was  up  to  the  European  and  Japanese 
to  push  their  currencies  up.  The  United  States  and 
China  had  similar  disagreements  in  2008  to  2009. 

Additional  Concerns  of  Emerging 
Powers  and  Peripheral  States 

The  world  of  the  very  early  21st  century  remains 
unipolar,  with  only  a  single  superpower.  But  the 
trend  is  toward  the  emergence  of  new  powers, 


particularly  China.  Opinion  on  Europe  splits 
sharply  between  observers  who  identify  countries 
such  as  Britain,  France,  and  even  Germany  as 
states  of  declining  global  significance  and  those 
who  perceive  the  inevitability  of  the  United  States 
of  Europe  and  thus  anticipate  future  bipolarity  or 
tripolarity  with  the  United  States  and  China. 
Japan’s  future  status  is  similarly  debated.  What  is 
clear  is  that  since  sometime  in  the  1990s,  several 
emerging  powers  plausibly  are  approaching  mem¬ 
bers  of  the  G7  in  global  prominence,  particularly  if 
one  employs  economic  size  as  the  single  most  criti¬ 
cal  component  of  relative  capabilities.  New  countries 
push  to  be  included  in  global  monetary  delibera¬ 
tions.  The  U.S.  President  George  W.  Bush  in  the 
final  months  of  his  administration  in  late  2008 
convened  the  first  heads-of-state  summit  of  the 
countries  of  the  Group  of  Twenty  (financial  G20, 
not  to  be  confused  with  the  trade  G20  of  develop¬ 
ing  countries),  until  then  a  peripheral  albeit  multi¬ 
lateral  process  initially  organized  by  the  United 
States  in  the  wake  of  the  East  Asian  financial  crisis. 
Besides  the  G7,  G20  members  included  Australia 
and  a  geographically  representative  selection  of 
larger  emerging  economies  friendly  to  the  United 
States.  Meanwhile  the  BRICs  countries,  formally 
organized  by  Russia,  had  begun  meeting  at  the 
heads-of-state  level  in  early  2008,  cognizant  that 
their  main  common  interest  lay  in  joint  pressure 
for  inclusion  in  the  more  exclusive  clubs  for  global 
governance  but  pragmatically  seeking  to  develop 
other  collective  negotiating  aims.  From  the  initial 
summit  of  the  financial  G20  (or  “larger  economies 
group”)  in  November  2008  onward,  the  G20 
swiftly  moved  to  supplant  the  G7.  Meanwhile  the 
four  BRICs  countries  have  made  increasingly  bold, 
if  still  guarded,  statements  about  the  need  to 
replace  the  dollar  with  a  truly  global  currency, 
perhaps  based  on  the  IMF’s  Special  Drawing 
Rights. 

Global  monetary  and  financial  reform  propos¬ 
als  important  to  developing  countries,  yet  long 
brushed  aside  by  G7  policymakers  and  academics, 
finally  have  entered  the  elite  economic  discourse. 
These  include  topics  such  as  recognition  that  coun¬ 
tries  with  prudent  macroeconomic  and  financial 
regulatory  policies  may  be  victims  of  financial  con¬ 
tagion,  that  there  may  be  a  legitimate  role  for  the 
state  in  overcoming  both  national  and  global  mar¬ 
ket  failures  in  financial  markets,  that  “host”  rather 
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than  “home”  state  regulation  of  multinational 
banks  may  better  protect  investor  and  borrower 
rights,  and  that  multinational  insurance  and  stabi¬ 
lization  funds  for  commodity  producers  might  be 
economically  sound  concepts.  More  generally, 
entirely  free  markets,  particularly  in  money  and 
securitized  assets,  may  be  neither  self-equilibrating 
nor  socially  desirable.  In  the  terminology  of  inter¬ 
national  relations,  some  states  in  the  developing 
world,  particularly  those  possessed  of  objective 
power  resources  or  influence  relevant  to  the  global 
monetary  governance  regime,  have  begun  to 
demand  the  extension  of  the  protections  of  embed¬ 
ded  liberalism  to  their  citizens.  Many  of  these 
themes  were  echoed  in  a  2009  international  com¬ 
mission  of  experts  put  together  at  the  behest  of  the 
president  of  the  United  Nations  General  Assembly 
(UNGA)  and  headed  by  Joseph  Stiglitz,  former 
chief  economist  of  the  World  Bank  and  fierce  critic 
of  the  IMF.  Interestingly,  the  UNGA  documents 
criticize  the  financial  G20  as  insufficiently  demo¬ 
cratic  and  representative  in  its  composition. 

Contending  Political  Perspectives 
on  Global  Money 

The  conceptual  lenses  through  which  political  sci¬ 
entists  analyze  international  monetary  relations 
are  extraordinarily  heterogeneous.  In  one  intellec¬ 
tual  corner,  associated  with  several  of  the  most 
prestigious  U.S. -based  journals,  are  scholars  who 
define  their  task  as  the  application  of  the  dominant 
methodology  of  contemporary  neoclassical  eco¬ 
nomics — sophisticated  statistics — to  explore,  for 
example,  the  relationship  between  hourly  electoral 
results  in  a  wealthy  democracy  and  the  fluctuating 
value  of  benchmark  global  bonds.  Elsewhere,  stu¬ 
dents  of  philosophy,  social  trust,  or  of  the  psychol¬ 
ogy  of  panics  are  fascinated  with  the  mystique  of 
money,  in  its  modern  and  nearly  entirely  electronic 
form — the  epitome  of  accelerating  globalization. 
Practitioners  of  traditional  diplomatic  history  are 
finding  monetary  negotiations  to  be  increasingly 
central  to  interstate  security  relations.  Last  but 
hardly  least  are  Marx’s  contemporary  heirs:  for 
example,  authors  in  the  volume  edited  by  Leo 
Panitch  and  Martijn  Konings  make  a  compelling 
case  that  the  current  international  financial  archi¬ 
tecture  directly  reflects  the  interests  of  the  United 
States’  private  financial  sector.  One  almost  might 


conclude  that  money  is  as  ubiquitous  as  power  in 
international  relations — or  that  the  two  are  the  same. 
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Monitoring 


Monitoring  can  be  defined  as  a  systematic  and 
continuous  surveillance  of  a  series  of  events. 
Monitoring  is  practiced  to  secure  that  the  activities 
inside  an  organization,  or  the  outputs  of  an  orga¬ 
nization,  are  according  to  established  goals. 
Monitoring  thus  refers  to  the  control  of  organiza¬ 
tions.  In  politics,  monitoring  takes  place  both 
between  politicians  (versus  bureaucrats)  and  within 
the  public  administration;  in  the  latter  form,  it 
deals  with  organizational  control.  Monitoring  and 
evaluation  are  closely  linked;  both  focus  on  exam¬ 
ining  the  procedures  and  processes  involved  and 
gathering  information  about  the  level  of  perfor¬ 
mance.  In  government  practice,  the  two  concepts 
overlap,  as  when,  for  example,  the  follow-up  of  a 
government  program  can  be  called  either  monitor¬ 
ing  or  evaluation  without  a  major  difference.  It  is 
also  possible  to  see  monitoring  as  a  basic  form  of 
evaluation  when  we  add  some  “how  and  why” 
questions,  such  as  why  a  given  program  is  lagging 
behind  its  goals  or  how  certain  societal  results  can 
be  explained  by  certain  policy  measures. 

Some  evaluation  scholars  do  not  make  a  dis¬ 
tinction  between  monitoring  and  evaluation. 
Others  argue  that  monitoring  does  not  include 
judging  the  worth  and  value  of  performance,  and 
hence,  evaluation  consists  of  more  than  monitor¬ 
ing  or  accounting.  In  addition,  the  focus  on  theory 
separates  evaluation  research  from  other  forms  of 
evaluation,  such  as  monitoring.  A  central  concept 
related  to  monitoring  is  accountability.  Politico- 
administrative  systems  are  built  on  a  hierarchical 
structure,  which  gives  the  citizens  the  right  to 
control  the  parliament;  the  parliament  the  right  to 
control  the  government;  the  government,  the  pub¬ 
lic  administration;  and  the  upper  levels  of  bureau¬ 
cracy,  the  lower  levels.  Much  has  been  written  on 


the  possibilities  and  limitations  of  parliaments  and 
cabinet  members  in  controlling  bureaucracy. 
Although  bureaucrats  may  be  subject  to  political 
surveillance,  they  know  how  to  use  their  often 
long  (at  least  in  comparative  terms)  experience, 
expertise,  and  isolation  from  political  battles  to 
bypass  and  make  strategic  use  of  any  monitoring 
efforts  directed  toward  them.  Whether  monitoring 
reaches  the  ultimate  points  of  delivery,  where 
street-level  bureaucrats  deal  with  various  sorts  of 
clients,  is  an  important  issue  as  well,  not  least 
from  the  viewpoint  of  democracy.  Finally,  moni¬ 
toring  may  be  used  when  governments  finance 
various  external  projects,  as,  for  example,  with 
development  aid.  Monitoring  can  even  refer  to  the 
control  of  governmental  outlays  for  typically  out¬ 
sourced  projects. 

The  theory  and  methodology  of  monitoring  are 
closely  linked  to  organizational  science.  Organ¬ 
izations  are  collective  efforts,  and  large  organizations 
always  need  to  control  the  unity  of  the  organiza¬ 
tion.  The  theoretical  issues  around  the  concept 
thus  deal  with  questions  such  as  what  kind  of 
monitoring  tools  work  in  what  kind  of  organiza¬ 
tions,  what  motives  underlie  the  use  of  monitor¬ 
ing,  and  to  what  extent  governments  are  using 
similar  or  different  kinds  of  monitoring  tools  and 
why.  As  noted  earlier,  monitoring  is  tied  to  char¬ 
acteristics  of  the  organization  in  question.  A  small 
organization,  a  club,  or  an  association  can  usually 
maintain  order  or  homogeneity  simply  by  exchang¬ 
ing  communication  through  face-to-face  interac¬ 
tion.  Larger  and  more  complex  organizations, 
such  as  major  hospitals,  political  parties,  religious 
bodies  such  as  the  Catholic  Church,  or  national 
bureaucracies,  have  a  much  greater  need  for  ensur¬ 
ing  that  what  the  members  of  the  organization  do 
corresponds  with  the  overall  will  of  the  leaders  of 
the  organization,  be  they  outside  or  inside  the 
organization. 

Alternative  Organizational  Models 

The  way  the  researchers  conceptualize  monitoring 
depends  on  the  specific  organization  model,  or 
metaphor,  that  they  are  using.  Traditionally,  in 
what  we  could  call  the  old  institutionalism,  the 
politico-administrative  system  is  seen  as  constituted 
of  rules  and  organizational  arrangements  that 
aim  to  cement  the  tasks  and  responsibilities — the 
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division  of  labor — within  the  public  administra¬ 
tion.  The  pioneering  sociologist  Max  Weber 
depicted  bureaucracies  as  rational  systems  com¬ 
pared  with  the  earlier  historical  forms  of  state.  The 
prominent  features  of  this  system  were  specifica¬ 
tion  of  jobs  with  detailed  rights,  obligations, 
responsibilities,  scope  of  authority,  and  a  system  of 
supervision  and  subordination.  Monitoring  in  this 
context  served  to  ensure  that  the  rules  were  fol¬ 
lowed  and  malpractices  would  be  discovered  and 
stopped.  Hierarchic  and  bureaucratic  modes  imply 
the  use  of  coercion  or  threat  to  ensure  compliance 
with  authoritative  rules.  In  some  cases,  this  moni¬ 
toring  by  rules  and  regulations  may  lead  to  an 
overemphasis  on  caution  and  slow,  multifaceted 
processing  of  issues.  Today,  we  also  think  of 
bureaucracies  as  inefficient,  slow,  and  generally 
bad.  Thus,  the  concept  of  bureaucracy  has  through 
the  years  been  interpreted  in  a  variety  of  ways. 

Later  on,  however,  the  Weberian  model  of  pub¬ 
lic  administration  was  modified,  if  not  replaced,  by 
a  number  of  other  models. 

First,  a  theory  contrary  to  the  Weberian  ratio¬ 
nal  model  is  that  bureaucratic  officials,  like  all 
other  agents  in  society,  are  significantly,  though 
not  solely,  motivated  by  self-interest.  This  theory 
follows  the  tradition  of  economic  thought  from 
Adam  Smith  onward.  One  influential  model  in 
this  area  is  the  principal-agent  model.  The  model 
treats  the  difficulties  that  arise  under  conditions  of 
incomplete  and  asymmetric  information  when  a 
principal  (e.g.,  parliament)  hires  an  agent  (govern¬ 
ment  department).  In  this  situation,  the  two  may 
not  have  the  same  interests,  even  though  the  prin¬ 
cipal  is  presumably  hiring  the  agent  to  its  interests. 
While  perhaps  not  fitting  the  public  administra¬ 
tion  per  se,  the  model  seems  to  catch  the  prob¬ 
lems  embedded  in  contractual  relationships,  in 
which  the  public  authority  aims  at  defining  in  the 
form  of  a  contract  the  liabilities  and  sanctions 
dealing  with  a  private  agent.  The  question  is 
whether  the  agents,  say  individual  bureaucrats, 
strive  for  selfish  benefits  or  are  servants  of  the 
public.  Solving  this  question  can  be  difficult 
because  in  reality,  bureaucrats  whose  aim  is  to 
expand  their  organizations  may  serve  both  their 
own  and  the  public  interest.  In  any  case,  adding 
the  possibility  of  selfish,  individual  interests  to  the 
picture  increases  and  somewhat  complicates  the 
monitoring  task. 


Second,  a  (neo-)institutional  approach  would 
place  the  actors  in  context  and  predict,  for  exam¬ 
ple,  that  doctors  have  a  particular,  professional 
way  of  legitimizing  their  activities,  and  an  outsider 
(the  principal)  demanding  scrutiny  may  be  in  a 
frustrating  or  weak  position.  In  addition,  organi¬ 
zational  change,  such  as  adjusting  to  new  forms  of 
performance  monitoring,  may  not  depend  on  the 
deliberate  decisions  made  by  the  organization’s 
leaders  but  on  the  fit  or  misfit  between  the  reform 
and  the  norms  of  the  organization.  It  is  well 
known  that  the  various  reforms  originating  from 
the  new  public  management  paradigm  have  been 
implemented  in  ways  reflecting  the  specific  histori¬ 
cal,  societal,  and  political  culture  of  a  country. 

Performance  Management 

Monitoring  the  Weberian  rule  of  law  emphasizes 
the  avoidance  of  mistakes  but  does  not  emphasize 
or  require  results.  Poor  performance  was  not  really 
a  question  in  the  fiscal  accounting  system.  Now, 
however,  monitoring  has  widened  to  include  a  pro¬ 
gram  element,  and  likewise,  researchers  have 
shifted  their  focus  to  questions  of  performance 
management.  A  central  issue  in  research  concerns 
the  possibilities  and  limitations  of  measuring  per¬ 
formance  within  public  administration.  One  debate 
focuses  on  the  nature  of  public  policies.  It  has  been 
determined  that  performance  targets  are  often 
quantitative,  and  this  causes  bureaucrats  to  neglect 
the  qualitative,  nonmeasured  aspects  of  their  per¬ 
formance.  Public  policies,  further,  are  often  services 
or  societal  interventions,  which  make  it  difficult 
to  point  out  exactly  what  the  influence  of  the  out¬ 
put  is.  On  top  of  that,  so-called  performance  indi¬ 
cators  often  do  not  measure  outcomes  but  out¬ 
puts — as,  for  example,  where  the  police  aim  at 
making  as  many  arrests  as  possible,  while  the  fun¬ 
damental  societal  goal  is  to  diminish  criminality.  In 
addition,  performance  indicators  have  been  criti¬ 
cized  because  of  their  rigidity.  The  use  of  perfor¬ 
mance  indicators  can  have  dysfunctional  effects  on 
an  organization’s  performance  if  it  does  not  react 
to  changes  or  surprising  events.  Finally,  monitoring 
performance  can  be  a  complicated  task.  More  and 
more  public  policies  are  produced  not  by  single 
organizations  but  by  several  ones  in  concert.  These 
policy  networks  or  partnerships  often  follow  their 
own  situation-bound,  autonomous  norms  but  at 
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the  same  time  form  a  part  of  the  public  authority. 
The  question  of  accountability  becomes  more  dif¬ 
ficult  the  more  actors  are  involved  and  can  be 
more  accurately  called  multiple  accountability.  In 
recent  years,  there  has  been  a  lively  debate  on  gov¬ 
ernance — for  example,  concerning  a  new  form,  or 
account,  of  public  policy  making,  which  empha¬ 
sizes  the  interaction  between  private  actors  and 
public  authorities  on  various  levels.  The  implica¬ 
tions  for  monitoring  are  obvious:  Instead  of  coer¬ 
cion  and  authority,  concepts  such  as  trust,  custom, 
and  solidarity  play  an  important  role. 

At  a  deeper  level,  performance  monitoring 
reflects  questions  of  ethics  and  trust.  In  the  Weberian 
system,  it  was  widely  accepted  that  malpractices 
need  to  be  identified  and  omitted.  Performance 
monitoring  raises  the  question  of  why  organiza¬ 
tional  rules,  information  from  different  sources, 
and  professional  education  are  not  considered  suf¬ 
ficient  to  guarantee  a  well-functioning  public 
administration.  It  suggests  that  the  organizational 
model  in  use  assumes  that  bureaucrats  are  selfish 
and  lazy.  Devoted  professionals  prepared  to  further 
develop  their  policy  areas  may  become  disillusioned 
because  of  the  straightforward  simplification  of 
progress  into  numeric  indicators.  The  central  ques¬ 
tion  is  how  organizations  function  and  to  what 
extent  their  coherence  is  based  on  trust  versus  con¬ 
trol.  As  mentioned  earlier,  excessive  monitoring  in 
the  form  of  frequently  demanded  reports,  control 
of  communication,  time  limits  for  tasks,  and  so 
forth  can  also  demand  an  excessive  workload. 
Thus,  one  research  task  in  this  area  is  to  find  and 
argue  for  a  healthy  balance  between  trust  and  con¬ 
trol,  whether  based  on  the  organization’s  size  and 
structure  or  the  type  of  organization. 

Monitoring  and  Public  Policy 

Loosening  top-down  control,  a  strict  way  of  moni¬ 
toring,  may  raise  questions  of  democratic  account¬ 
ability.  Democracy  is  often  seen  from  a  narrow 
perspective  as  properly  expressed  only  through 
aggregative  representation.  Flexible,  interactive 
forms  of  public  policy  making  can  be  justified  with 
the  help  of  democracy  theory  too.  At  the  same  time, 
the  transformation  of  public  policy  making  may 
radically  change  the  overall  opportunities  of  moni¬ 
toring  by  upper  levels  of  government.  The  move 
away  from  this  traditional  linear  interpretation  of 


the  policy  cycle  and  toward  a  more  nuanced  con¬ 
ceptualization  of  the  policy  process  reflects  a  gen¬ 
eral  recent  trend  toward  “post-positivist”  modes 
of  analysis  in  policy  science  as  a  whole.  Research 
on  monitoring  cannot  avoid  the  changes  caused  by 
the  transformation  of  monitoring  itself. 
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Monte  Carlo  Methods 


Monte  Carlo  methods  describe  a  set  of  computer 
simulation  techniques  that  rely  on  random  num¬ 
ber  generation  to  solve  complex  optimization  and 
integration  problems  arising  in  statistics  and  its 
related  fields.  The  term  Monte  Carlo  is  a  reference 
to  the  probabilistic  rules  underlying  casino  games 
of  chance,  and  though  the  theoretical  underpin¬ 
nings  for  their  use  emerged  in  the  postwar  period, 
it  is  the  advent  of  inexpensive,  modern  computing 
that  enabled  their  widespread  adoption  for  practi¬ 
cal  problem  solving. 

Typically  presented  as  an  alternative  to  time- 
consuming  analytical  efforts,  Monte  Carlo  methods 
allow  researchers  more  freedom  to  posit  models 
and  make  more  subtle  inferences  from  them — that 
is,  they  no  longer  need  to  rely  on  standard,  but 
unrealistic,  assumptions  about  complex  (causal) 
processes  that  ensure  a  tractable  solution  exists. 
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This  has  proved  particularly  important  in  Bayesian 
analysis,  where  quantities  of  interest  are  available 
in  principal  by  complex  integration  operations,  yet 
can  be  (well)  approximated  via  simulation. 
Precisely  because  such  techniques  have  proved 
popular  in  political  science,  this  integration  aspect 
of  Monte  Carlo  methods  is  the  focus  here.  This 
entry  begins  with  a  discussion  of  the  basic  integra¬ 
tion  operation;  then,  it  moves  to  the  importance  of 
random  number  generation  and  methods  of  sam¬ 
pling  before  considering  Markov  chain  Monte 
Carlo  methods.  This  latter  set  of  techniques  has 
proved  extremely  popular  in  political  science 
over  the  past  10  years  or  so,  especially  for  apply¬ 
ing  item-response  models  to  obtain  “ideal  points” 
from  roll  call  data.  There  are  also  large  literatures 
dealing  with  optimization  via  Monte  Carlo  pro¬ 
cedures  and  integration  via  (nonsimulation) 
deterministic  numerical  approaches;  these  are  not 
discussed  here. 

Classical  Monte  Carlo  Integration 

To  keep  matters  simple,  consider  the  evaluation  of 
the  following  integral: 

I  =  Jxg(x]dx,  (1) 

where  g(x)  is  a  probability  density  function.  As 
presented,  the  integral  will  yield  the  expectation  of 
the  random  variable  X  denoted  E(X).  But  perform¬ 
ing  the  operation  may  be  difficult,  perhaps  because 
the  integral  has  no  closed  form.  A  helpful  alterna¬ 
tive  to  this  effort  is  to  generate  a  sample  of  size  m 
from  the  density  g(x)  and  then  to  compute  the 
empirical  average, 

|  m 

/=-!*„  (2) 

As  m  increases,  by  the  strong  law  of  large  num¬ 
bers,  I  converges  almost  surely  to  the  true  value  of 
I — and  thus,  the  true  value  of  E(X).  Denoted  the 
Monte  Carlo  method,  the  idea  is  more  general.  We 
can  replace  x  in  Equation  1  with  some  arbitrary 
function  of  x,  denoted  h(x).  This  might  be  the 
median  or  some  other  percentile.  And  we  can 
assess  definite  integrals  so  long  as  we  can  produce 
a  sample  of  x  values  between  the  specified  bounds. 


The  variance  of  7  is  decreasing  in  m.  When  var(m) 
is  finite,  the  central  limit  theorem  applies  as  the 
sample  size  increases,  and  we  can  place  bounds  on 
our  estimates.  Moreover,  asymptotically,  it  is  pos¬ 
sible  to  obtain  quantities  that  can  be  used  to  assess 
convergence  directly. 

This  simulated  integration  takes  on  an  impor¬ 
tant  role  in  Bayesian  statistics,  since  we  often  want 
to  perform  a  variant  on  Equation  1.  In  particular, 
we  are  concerned  with  characteristics — such  as  the 
mean  or  median — of  the  posterior  distribution 
7t(0ly),  where  0  is  an  unknown  parameter  and  y  the 
observations.  In  that  case,  we  need 

E(h(Q))  —  J  tt ( 0 1 x ) /? ( 0 ) , 

which  we  approximate  as  (l/tn)Y”‘=1h(Q).  For 
many  regular  cases,  this  simple  approach  works 
well,  though  more  efficient  alternatives  typically 
exist  that  do  not  require  being  able  to  directly 
simulate  samples  from  g(x). 

Random  Number  Generation  and  Sampling 

All  methods  of  simulation  necessitate  the  genera¬ 
tion  of  random  numbers;  in  the  case  of  simple 
Monte  Carlo  integration,  those  random  numbers 
were  from  some  probability  function  g(x).  For  a 
very  broad  range  of  functions  (the  g(x)  and  7t(0ly) 
above),  at  base,  the  task  requires  the  production  of 
uniform  random  variables  on  the  unit  interval.  We 
then  obtain  draws  from  g(x)  via  the  inverse  trans¬ 
form.  Though  there  are  some  cases  where  nonuni¬ 
form  random  number  generation  can  be  used,  the 
techniques  are  difficult  to  generalize  outside  of 
specific  distributions.  What  constitutes  an  optimal 
or  efficient  random  number  generator  is  debatable 
and  often  subjective. 

The  inverse  transform  may  not  be  available,  or 
may  not  work  well,  for  many  distributions.  An 
alternative  is  to  use  accept-reject  methods  (also 
known  as  rejection  methods  or  rejection  sampling) 
that  rely  on  a  fundamental  result  involving  the 
uniform  distribution.  Recall  that  we  wish  to  gener¬ 
ate  variates  from  g(x),  which  we  will  assume  has 
bounded  support.  Imagine  now  that  the  random 
variable  X  is  jointly  distributed  with  an  auxiliary 
variable  U  such  that  any  hypothetical  pair  of 
values  drawn  will  have  an  x  component  with  some 
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corresponding  g(x)  and  u  component  that  is 
strictly  less  than  the  g(x)  and  yet  nonzero.  To 
obtain  that  u  value,  we  simply  generate  from  the 
uniform  bounded  by  0  and  g(x).  Yet  in  doing  this, 
we  have,  in  fact,  generated  a  draw  from  g(x):  For 
a  given  value  of  x,  our  uniform  draw  n  must  be 
“under  the  curve”  described  by  g. 

Clearly,  we  need  an  x  value  to  evaluate  g(x),  but 
this  cannot  be  sampled  directly  from  g(x)  itself 
(since  this  is  assumed  unavailable  to  us).  In  prac¬ 
tice  then,  for  a  one-dimensional  density,  we  sample 
the  x  value  from  a  uniform  between  zero  and  the 
maximum  of  g(x)  and  the  u  value  from  a  uniform 
that  has  a  range  identical  to  (the  relevant  part  of) 
g(x).  Notice  that  these  two  intervals  draw  a  “box” 
around  the  density  g(x).  It  must  be  the  case  that 
some  of  these  pairs  are  points  in  two-dimensional 
space  that  fall  under  the  curve  g(x),  and  we  will 
“accept”  these.  By  contrast,  some  must  fall  else¬ 
where,  and  we  will  “reject”  those. 

The  accept-reject  approach  can  be  used  more 
generally  for  cases  where  neither  the  maximum 
nor  the  support  of  g  are  bounded,  so  long  as  simu¬ 
lating  uniform  variates  over  the  implied  larger 
space  is  possible.  The  key  is  to  find  another  density 
function  f(x)  such  that  for  every  possible  value  of 
x,  g(x)  £  Mf(x),  where  M  is  a  constant  greater 
than  or  equal  to  one.  Now,  we  begin  by  generating 
x-  values  from  our  “envelope”  density  f(x).  We 
then  take  a  single  draw  u  from  a  uniform  t/fO,  IJ. 
If  the  value  of  u  is  less  than  g(x;) / Mf(Xj) ,  the  x  -  is 
accepted  as  a  sample  from  g(x).  If  not,  we  reject 
the  candidate  x..  This  approach  has  an  important 
implication  for  Bayesian  analysis,  because  we  do 
not  need  to  know  the  normalizing  constant  for  g  to 
sample  from  it.  Yet  this  is  precisely  the  situation 
when  dealing  with  a  posterior  that  is  proportional 
to  the  prior  multiplied  by  the  likelihood: 
TT(0|x)|xp(0)g(x|0).  Thus,  we  can  draw  samples 
from  7t(0lx)  without  calculating  the  normalizing 
constant. 

For  some  problems,  accept-reject  sampling  can 
be  very  inefficient.  To  reduce  the  variance  in 
simulations,  an  alternative  is  to  choose  an  approx¬ 
imate  distribution  f  and  then  weight  the  resulting 
draws  according  to  the  probability  that  they  repre¬ 
sent  a  sampled  point  from  the  target  distribution 
g.  This  idea  is  known  as  importance  sampling. 
The  beauty  of  this  sampling  scheme  is  that  there 
is  very  little  restriction  on  the  choice  of  the 


instrumental  distribution  f,  and  so  it  can  be  cho¬ 
sen  such  that  it  is  easy  to  sample  from  (e.g.,  a 
multivariate  normal). 

Markov  Chain  Monte  Carlo 

As  described  above,  Monte  Carlo  methods  pro¬ 
duce  independent  draws  from  some  density  g(x). 
By  contrast,  Markov  chain  Monte  Carlo  methods 
produce  simulated  values  that  are  dependent  on 
previous  draws  from  the  distribution  in  question. 
Recall  that  a  Markov  chain  is  a  sequence  of  ran¬ 
dom  variables  with  the  Markov  property,  meaning 
that  the  present  state  01'1  depends  only  on  the 
immediately  preceding  one  0[i_11.  Note  that  the 
chain  is  defined  with  respect  to  a  state  space  ©  that 
constrains  the  possible  values  for  the  variables, 
and  its  moves  around  this  space  are  probabilisti¬ 
cally  controlled  by  the  chain’s  kernel.  So  long  as 
the  chain  possesses  certain  properties  (among  them 
ergodicity,  recurrence,  and  irreducibility),  it  will 
eventually  reach  a  stationary  distribution  tt  and 
stay  there.  That  is,  in  this  limiting  distribution, 
every  draw  is  from  it.  More  important,  analogous 
to  Equation  2,  the  average  converges  to  the  expec¬ 
tation  we  seek. 

Markov  chain  Monte  Carlo  methods  are  pre¬ 
ferred  to  simple  Monte  Carlo  methods,  including 
importance  sampling,  because  they  are  often  more 
efficient,  especially  for  complex  problems  that 
involve  very  high-dimensional  integrals.  With 
Markov  chain  Monte  Carlo  algorithms,  the 
researcher  can  spend  less  time  finding  the  “right” 
instrumental  distribution;  rather,  simulations  from 
g  can  be  generated  from  essentially  any  arbitrary 
distribution  f.  The  most  widespread  procedure  in 
use  is  the  Metropolis-Hastings  algorithm,  which 
relies  on  a  random  walk.  The  basics  are  straight¬ 
forward:  Suppose  we  are  interested  in  a  single 
parameter  0  for  which  the  posterior  is  thus  7t(0). 
After  some  starting  point,  at  each  step  t  in  the 
chain,  we  propose  a  candidate  value  of  0,  denoted 
0'.  In  practice,  0'  is  a  draw  from  a  proposal  (or 
jumping)  distribution  q  (0l0[f_1]),  where  0['“1]  is  the 
prevailing  value  of  0  from  the  previous  step  in  the 
chain. 

The  restrictions  on  the  proposal  distribution  are 
not  especially  onerous,  and  it  is  often  chosen  such 
that  it  is  easy  to  sample  from,  with  a  normal  distri¬ 
bution  centered  at  0['-11  being  a  common  choice. 
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We  accept  the  proposal  0'  as  the  value  for  0W  if  u, 
a  draw  from  U|0,  1J,  is  less  than  the  ratio  of  the 
posteriors  evaluated  at  the  different  proposals 
(assuming  q(-)  is  symmetric).  If  the  proposal  is  not 
accepted,  0W  is  simply  0['“1],  and  the  process  starts 
again. 

The  primary  concern  with  Markov  chain  Monte 
Carlo  techniques  is  that  it  is  not  obvious  how 
many  of  these  updating  steps  are  needed.  That  is, 
although  we  are  guaranteed  that  an  ergodic  chain 
will  eventually  reach  its  stationary  state — and  thus 
that  we  will  be  sampling  from  the  posterior  we 
seek — we  cannot  know  beforehand  how  long  this 
“mixing”  will  take.  A  large  literature  exists  on 
diagnosing  nonconvergence,  and  the  methods 
therein  generally  rely  on  monitoring  the  chain  over 
time,  or  monitoring  the  performance  of  chains 
with  different  starting  points.  Related  to  this 
endeavor,  researchers  typically  dispose  of  early, 
nonstationary  chain  values  as  “burn-in.”  A  special 
case  of  the  Metropolis-Hasting  approach,  popular 
in  political  science  and  elsewhere,  is  the  Gibbs 
sampler.  This  requires  that  researchers  have  the 
relevant  conditional  distributions  for  the  posterior 
7t(0)  available  to  sample  from.  When  feasible, 
Gibbs  sampling  is  more  efficient  than  a  more  gen¬ 
eral  Metropolis-Hasting  approach  since  the  candi¬ 
date  values  are  accepted  with  probability  1  every 
time.  Free  software  for  specifically  fitting  models 
using  Markov  chain  Monte  Carlo  methods  is 
widely  available. 

Arthur  Spirling 
Harvard  University 
Cambridge,  Massachusetts,  United  States 
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Multiculturalism  as  a  concept,  principle,  and  policy 
has  become,  since  the  1980s,  one  of  the  most  con¬ 
troversial  issues  in  social  sciences  and  humanities. 
The  term  refers  to  countries  with  territorial  and/or 
linguistic  minorities  and  those  formed  as  a  result  of 
the  migration  of  religious  or  racial  and/or  ethnic 
groups.  Minority  claims  for  equality  have  given  rise 
to  what  the  philosopher  Charles  Taylor  (1992)  has 
called  the  politics  of  recognition,  relating  it  to  the 
“democratic  defense  of  cultural  diversity  within  a 
universalistic  perspective.”  Recognition  policies 
along  with  differentiated  group  rights  are  at  the  core 
of  a  “multicultural  citizenship”  elaborated  by  Will 
Kymlicka.  For  defenders  of  the  principle,  multicul¬ 
turalism  matches  with  the  public  recognition  of 
cultural  identities,  with  equal  rights  for  ethnic, 
racial,  religious,  or  national  minorities.  It  therefore 
constitutes  the  foundation  of  democracy.  For  those 
who  oppose  the  principle,  it  leads,  on  the  contrary, 
to  the  “disunion”  of  the  nation  and  to  isolated  com¬ 
munities  in  the  political  arena,  and  it  is  therefore 
perceived  as  a  challenge  to  the  national  unity  guar¬ 
anteed  by  the  state.  For  some,  it  serves  to  thwart 
nationalism  and  for  others,  inversely,  it  serves  as  the 
basis  of  national  sentiments  and  expressions. 

The  debates  oppose  those  who  defend  a  liberal 
vision  against  a  republican  vision  of  pluralist  society 
that  recognizes  citizen  identity  only  on  the  grounds 
of  social  justice.  While  liberal  multiculturalism 
looks  for  a  response  to  the  management  of  cultural 
diversity  as  a  means  of  equal  inclusion  of  minorities 
in  the  political  community,  the  republican  view  rep¬ 
resents  multiculturalism  as  politics,  turning  the 
society  into  a  battleground  where  common  values 
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are  transformed  into  particular  interests  and  where 
identities  perceived  as  majority  or  minority  com¬ 
pete  with  each  other  in  search  of  public  resources 
and  representation.  As  a  whole,  multiculturalism  is 
fundamentally  concerned  with  both  universalistic 
ideology  and  the  idea  of  a  common  civic  space  of 
political  participation  for  all  groups;  it  questions 
how  to  reconcile  the  integration  of  minorities  into 
a  common  civic  culture  with  the  protection  of  the 
most  vulnerable  groups.  The  process  has  trans¬ 
formed  an  anthropological  perspective  of  cultural 
diversity  into  a  normative  vision  of  plural  societies. 
Multiculturalism  is  thus  systematically  associated 
with  the  question  of  national  unity  and  its  integra¬ 
tive  capacity. 

Each  state  has  its  own  understanding  of  minor¬ 
ity  and  elaborates  specific  relations  with  its  minor¬ 
ities.  Progress  in  the  judicial  sphere  now  involves 
questions  regarding  the  cultural  and  religious 
rights  of  minorities  in  their  fight  against  all  forms 
of  discrimination.  That  does  not  resolve  the  issue 
of  whether  to  define  a  minority  in  territorial  or 
nonterritorial  terms.  Definitions  continue  to  remain 
ambiguous  and  differ  according  to  national  experi¬ 
ences  that  define  the  relations  between  states  and 
minorities.  In  Canada,  the  confrontation  between 
the  French  and  English  languages,  because  of 
Quebec’s  status  and  the  debates  around  a  bilingual 
and  bicultural  society,  defined  as  such  by  the  Royal 
Commission  on  Multiculturalism,  gave  political 
legitimacy  to  the  concept,  thanks  to  the  constitu¬ 
tional  multiculturalism  used  in  the  Charter  of 
Rights  and  Freedoms,  enacted  by  the  British 
Parliament  and  signed  by  Queen  Elizabeth  II  as 
part  of  the  Canada  Act  in  1982,  which  was  thus 
officially  accepted  as  the  fundamental  characteris¬ 
tic  of  the  Canadian  state.  In  the  United  States,  the 
concept  has  been  grounded  on  the  civil  rights 
movement  of  the  1960s.  It  took  a  political  and 
legal  shape  with  the  establishment  of  affirmative 
action  starting  in  1965,  as  a  way  of  increasing  the 
access  of  members  of  historically  disadvantaged 
groups  to  benefits  such  as  employment,  college  or 
university  admission,  and  the  granting  of  govern¬ 
ment  contracts.  These  measures  sought  to  reduce 
racial  or  ethnic  inequalities  and  historical  injustices 
by  trying  to  repair  the  effect  of  past  policies,  nota¬ 
bly  slavery  and  racial  segregation. 

In  Europe,  the  term  multiculturalism  is  used  to 
refer  to  various  situations  according  to  the  structure 


of  the  state  and  the  recognition  of  regional  and  lin¬ 
guistic  particularities  and  of  the  minorities.  Some 
countries  in  continental  Europe  have  institutional¬ 
ized  pluralism  through  the  creation  of  regions 
granted  limited  power,  as  in  Italy  and  Spain;  others 
have  built  the  state  on  linguistic  pluralism,  as  in 
Belgium  and  Switzerland,  where  the  linguistic  and 
territorial  communities  each  have  their  own  institu¬ 
tions.  In  Eastern  Europe,  the  presence  of  some 
populations  on  a  border  of  the  neighbor  state  has 
led  to  the  elaboration  of  minority  rights  in  kin 
states,  such  as  for  Hungarians  in  Romania.  But  in 
France,  Germany,  Great  Britain,  and  the  Nether¬ 
lands,  the  term  multiculturalism  refers  to  immigrant 
populations  organized  around  a  common  national¬ 
ity  or  religion  (or  both)  and  their  demand  for  spe¬ 
cific  voices  in  the  public  sphere,  as  with  ethnic 
minorities  or  Native  Americans  and  African  Amer¬ 
icans  in  the  United  States. 

To  some  extent,  similar  situations  entail  recourse 
to  concepts  that,  used  in  different  national  con¬ 
texts,  require  new  definitions  and  different  policies. 
In  Western  Europe,  the  use  of  the  term  multicul¬ 
turalism  marks  the  shift  from  temporary  economic 
immigration  to  the  permanent  presence  of  immi¬ 
grant  populations  and  their  political  participa¬ 
tions.  From  the  state  perspective,  this  implies  the 
extension  of  the  welfare  state  to  a  new  realm — that 
of  immigration  and  identity — with  the  establish¬ 
ment  of  social  policies  to  guarantee  integration  of 
these  minorities  in  the  larger  society.  In  Germany, 
the  city  of  Frankfurt  created  a  bureau  of  multicul¬ 
tural  affairs,  whose  head  advocated  a  “multicultural 
democracy”  inspired  by  Jean-Jacques  Rousseau’s 
social  contract.  In  France,  too,  during  the  same 
period,  the  media  as  a  political  class  described 
French  society  as  “multiracial,”  “multicultural,” 
“plural,”  and  “pluri-cultural.”  This  terminology 
found  legitimacy  in  a  political  discourse  that  privi¬ 
leged  “the  right  to  difference,”  established  in 
1981.  In  Great  Britain,  the  Commission  for  Racial 
Equality  promulgated  in  1976  the  Race  Relations 
Act.  The  main  objectives  were  to  fight  racism,  to 
eliminate  discrimination,  and  to  ensure  equality  of 
opportunity  and  thus  establish  good  relations 
among  different  racial  groups.  The  targeted  dis¬ 
crimination  against  Muslims  in  Britain  today  has 
led  sociologists  to  argue  for  the  inclusion  of  reli¬ 
gion — mainly  Islam — in  ethnic  and  racial  policy 
and  its  consideration  by  antidiscrimination  laws. 
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In  the  Netherlands  in  the  1980s,  a  minority  policy 
took  the  objective  of  “promoting  multiculturalism 
and  the  emancipation  of  ethnic  communities.” 

Whatever  the  discourse  or  the  practice  and 
whatever  the  definition  of  a  minority,  gradually  all 
European  countries  have  converged  in  a  sort  of 
“applied  multiculturalism.”  The  question  is  then 
what  are  the  cultures  and  groups  entitled  to  recog¬ 
nition?  What  group  rights  would  be  legitimate  to 
recognize?  In  Europe,  multiculturalist  policies  refer 
mainly  to  Islam — the  religion  of  the  majority  of 
postcolonial  immigrants  as  new  minorities — and  to 
its  institutional  and  legal  recognition.  The  percep¬ 
tion  of  Islam  as  not  separating  politics  and  religion 
challenges  secularist  European  states,  while  not 
recognizing  Muslims  as  a  minority  is  an  obstacle  to 
equal  inclusion  of  differences.  Its  legitimacy  stems 
from  equal  representation  of  religions  in  liberal 
democratic  societies;  and,  as  Bhiku  Parekh  (2000) 
suggests,  it  comes  as  the  extension  of  existing  insti¬ 
tutions  in  a  way  to  include  the  newly  emerging 
Islam  and  to  promote  at  the  same  time  a  common 
membership  and  a  common  civic  culture,  allowing 
citizens  to  find  adequate  identification.  Here  is  one 
of  the  paradoxes  of  multiculturalism:  political  and 
institutional  integration  of  differences  into  demo¬ 
cratic  values  to  claim  the  recognition  of  some  cul¬ 
tural  particularities  of  groups  that  question  their 
compatibility. 

Critics  of  multiculturalism  have  emphasized  the 
radicalization  of  Muslim  minorities  in  Europe. 
Myths,  claims,  and  discourses  of  such  groups  per¬ 
ceived  as  a  threat  to  universal  democratic  values 
came  to  justify  the  retreat  from  multiculturalism  in 
countries  such  as  Great  Britain,  the  Netherlands, 
and  even  Canada,  where,  as  Charles  Joppke  notes, 
the  accommodation  of  minorities  has  been  the  core 
of  the  integration  policy.  Arguments  are  grounded 
on  the  effect  of  multiculturalism  on  the  economic, 
cultural,  and  political  isolation  of  communities; 
ethnic  violence,  perceived  as  a  result  of  identity 
politics  that  have  failed  to  ensure  civic  harmony; 
and  a  fear  of  the  global  transnational  forces  that 
penetrate  national  societies  and  create  a  competition 
between  cultural-religious  communities  expanded 
beyond  the  borders  and  the  territorially  bounded 
national-secular  community.  In  the  Netherlands, 
the  assassinations  by  extremist  groups  of  politi¬ 
cian  Pirn  Fortuyn  and  filmmaker  Theo  van  Gogh 
have  engendered  a  radical  change  in  politics  and 


opinions  toward  minorities,  with  an  emphasis  on 
the  failure  of  the  politics  of  difference  that  did  not 
lead  to  the  expected  economic  integration  of 
migrants  but,  on  the  contrary,  isolated  them  cultur¬ 
ally  and  politically.  Great  Britain  has  also  changed 
its  policy  after  the  terrorist  attacks  on  London’s 
public  transportation  system  in  London  on  July  7, 
2005,  realizing  that  the  society  is  getting  away  from 
Bhiku  Parekh’s  aspiration  for  a  community  of  com¬ 
munities  and  a  community  of  citizens  at  the  same 
time.  In  Canada,  the  attempt  to  establish  a  sharia 
court  in  Ontario  brought  the  debate  to  a  legal  and 
constitutional  level  and  has  created  a  dilemma 
on  the  search  for  equal  justice  and  the  limits  of 
toleration. 

Despite  the  retreat  of  liberal  democracies  from 
multiculturalism,  the  principle  and  the  discourse 
are  now  diffused  on  international  and  suprana¬ 
tional  levels  within  European  institutions  and 
beyond.  The  question  is  associated  with  minority 
rights,  more  specifically  with  national  minorities 
and  indigenous  populations;  it  does  not  concern 
immigrants  and  nonterritorial  ethnic  groups.  As 
noted  by  Kymlicka  (2007),  the  principle  is  inter¬ 
preted  as  an  extension  of  human  rights.  In  1966, 
Article  27  of  the  United  Nations  (UN)  International 
Covenant  on  Civil  and  Political  Rights  (Human 
Rights  Committee)  stipulated  that 

in  those  states  in  which  ethnic,  religious  and 
linguistic  minorities  exist,  persons  belonging  to 
such  minorities  shall  not  be  denied  the  right,  in 
community  with  other  members  of  their  group, 
to  enjoy  their  own  culture,  to  profess  and  practise 
their  own  religion,  or  to  use  their  own  language. 

In  1992,  the  UN  adopted  a  Declaration  on  the 
Rights  of  Persons  Belonging  to  National  or  Ethnic, 
Religious  and  Linguistic  Minorities  and  began  to 
debate  in  2007  the  rights  of  indigenous  people. 
After  the  Cold  War,  at  the  first  summit  of  the 
Council  of  Europe  in  Vienna  in  1993,  states  seemed 
to  rush  things  by  advocating  “to  draft  with  mini¬ 
mum  delay  a  framework  convention  specifying  the 
principles  which  contracting  States  commit  them¬ 
selves  to  respect,  in  order  to  assure  the  protection 
of  national  minorities.  This  instrument  would  also 
be  open  for  signature  by  non-member  States.” 

The  establishment  of  minority  rights  as  an  inter¬ 
national  standard  through  normative  institutions 


1632  Multilateralism 


as  well  as  “policy  networks” — the  nongovernmen¬ 
tal  organizations,  intellectuals,  media,  and  actors 
themselves — is  accepted  on  a  discursive  level  by 
many  states  (Kymlicka,  2007).  Their  objective  is  to 
adopt  effective  policies  in  a  timely  manner  regard¬ 
ing  cultural  diversity  and  ethnic  and/or  religious 
conflict  in  their  respective  societies.  It  becomes  a 
sign  of  democratization  for  non-Western  states. 
This  does  not  mean,  however,  that  states — Western 
or  non-Western — accept  multiculturalism  as  a  prin¬ 
ciple  or  as  a  norm.  Each  state  acts  with  regard  to 
the  complexity  of  situations,  to  the  variety  of 
minorities,  and  to  their  specific  historical  relations. 
At  stake  are  peaceful  coexistence  of  ethnically 
diverse  populations,  national  security,  and  civic 
harmony. 

Riva  Kastoryano 
CERl-Sciences  Po/CNRS 
Paris,  France 
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Multilateralism 


In  everyday  language,  the  term  multilateralism  is 
often  used  synonymously  with  international  coop¬ 
eration.  Among  diplomats,  it  refers  to  certain 


cooperative  diplomatic  practices,  and,  according 
to  a  common  scholarly  definition,  multilateralism 
is  simply  international  cooperation  that  includes 
three  or  more  states.  This  minimalist  conception  is 
sometimes  supplemented  by  a  number  of  condi¬ 
tions,  including  the  idea  that  cooperation  between 
states  should  be  based  on  generalized  principles  of 
behavior.  Given  this  plethora  of  different  mean¬ 
ings,  it  seems  not  unfair  to  say  that  the  term 
multilateralism  is  a  convenient  yet  potentially 
confusing  shorthand  “tip  of  the  iceberg”  concept, 
representing  and  connoting  a  multitude  of  phe¬ 
nomena.  This  entry  reviews  three  major  different 
ways  of  referring  to  multilateralism,  for  which 
reason  it  is  introduced  as  an  institutional  system, 
a  foreign  policy  strategy,  and  as  political  ideology. 

The  Multilateral  System 

In  a  longue  duree  perspective — that  is,  a  temporal 
perspective  spanning  centuries — we  have  seen  the 
emergence  of  an  increasingly  dense  network  of 
multilateral  institutions  and  agreements.  This  net¬ 
work  is  frequently  called  the  multilateral  system 
and  has  essentially  developed  through  four  main 
phases.  During  the  first  phase,  reaching  back  to  the 
early  19th  century,  the  system  was  characterized 
by  the  first  experiments  in  collective  diplomacy. 
This  form  of  diplomacy  was  a  feature  of  the  post- 
Napoleonic  era  and  was  cultivated  by  European 
diplomats  and  heads  of  state  meeting  in  1815  at 
the  Vienna  Congress  to  determine  the  future 
European  order.  Subsequently,  a  series  of  more 
than  30  conferences,  called  the  Concert  of  Europe, 
constituted  the  first  example  of  recurrent  collective 
diplomacy.  In  this  context,  “collective”  does  not 
refer  to  all  interested  states — the  Concert  was 
characterized  by  special  great-power  roles  and 
responsibilities.  The  first  phase  also  included  the 
introduction  of  international  conflict  resolution  by 
means  of  legal  instruments,  especially  arbitration 
and  negotiation,  essentially  the  precursor  for  the 
International  Court  of  Justice  in  The  Hague. 
Notably,  the  first  phase  also  saw  the  creation  of 
the  first  proper  international  organization,  specifi¬ 
cally  the  launch  of  the  International  Telegraphic 
Union  in  1865.  Subsequently,  other  specialized 
international  organizations  were  initiated,  and  by 
1900  about  a  dozen  had  been  created.  All  these 
origins  represent  the  beginning  of  a  novel  and 
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increasingly  institutionalized  feature  being  intro¬ 
duced  to  the  international  states  system. 

The  second  phase  starts  with  the  establishment 
of  the  multipurpose  and,  in  principle,  global 
League  of  Nations  after  World  War  I.  Membership 
in  the  league  consisted  mainly  of  the  European 
states,  thereby  reflecting  the  colonial  times  in 
which  it  was  established,  the  eurocentrism  of  the 
international  states  system,  and  the  reluctance  of 
the  United  States  to  engage  in  international  poli¬ 
tics.  The  creation  of  the  league  was  a  deliberate 
attempt  to  avoid  future  major  wars,  for  which 
reason  a  system  of  collective  security  was  intro¬ 
duced.  Apart  from  the  league,  some  further  spe¬ 
cific  organizations  were  created,  including  the 
International  Labour  Organization  (1919).  The 
League — and  perhaps  foremost  its  member  states — 
clearly  failed  in  the  key  objective  of  securing  the 
peace,  yet  as  failure  is  one  source  of  experience  and 
knowledge,  the  lessons  learned  were  used  when 
states  designed  the  successor  organization  during 
and  after  World  War  II. 

The  third  phase  begins  with  the  creation  of  the 
United  Nations  (UN)  and  an  increasingly  broad 
range  of  so-called  functional  agencies,  including 
the  World  Health  Organization  (WHO),  Food  and 
Agriculture  Organization  (FAO),  and  the  United 
Nations  Educational,  Scientific  and  Cultural 
Organization  (UNESCO).  During  this  phase,  not 
all  intended  organizational  projects  were  feasible, 
as  demonstrated  by  the  aborted  International 
Trade  Organization  (ITO;  replaced  by  the  less 
ambitious  trade  regime,  the  General  Agreement  on 
Tariffs  and  Trade  [GATTJ)  and  by  the  never  fully 
ratified  European  Defence  Community  (EDC), 
which  was  substituted  later  by  the  Western 
European  Union  (WEU).  In  turn,  the  WEU  never 
got  to  play  a  significant  defense  role,  as  this  func¬ 
tion  was  provided  by  the  North  Atlantic  Treaty 
Organization  (NATO).  Similarly,  the  Cold  War 
context  did  not  allow  the  UN  to  function  as 
intended,  and  the  full  potential  of  the  organization 
never  came  to  fruition.  The  third  phase  is  also 
characterized  by  experiments  in  terms  of  regional 
integration  and  organizations,  most  significantly 
the  European  Community.  Other  examples  include 
the  Association  of  Southeast  Asian  Nations 
(ASEAN),  the  Arab  League,  the  Nordic  Council, 
the  Gulf  Cooperation  Council  (GCC),  and  the 
(failed)  East  African  Common  Market. 


The  fourth  phase  starts  with  the  end  of  the  Cold 
War  and  is  characterized  by  a  relaunch  of  the  UN 
(now  freed  from  the  constraints  of  the  Cold  War), 
a  new  wave  of  international  organizations  (e.g., 
World  Trade  Organization  [WTOJ,  African  Union 
[AUJ,  Organization  for  Security  and  Cooperation 
in  Europe  [OSCE],  North  American  Free  Trade 
Agreement  [NAFTA],  Common  Market  of  the 
South  [Mercado  Comrin  del  Sur;  MERCOSUR], 
Asia  Pacific  Economic  Community  [APEC]),  and  a 
series  of  reforms  of  the  European  Community, 
notably  its  transformation  into  the  European 
Union  (EU).  Subsequently,  the  EU  aims  at  playing 
a  role  in  international  organizations,  not  any  lon¬ 
ger  wishing  to  be  just  an  international  organiza¬ 
tion  among  others.  In  summary,  it  is  striking  that 
each  new  phase  of  international  organization 
begins  as  a  more  or  less  collective  political  response 
to  fatal  crossroads  situations  at  the  end  of  major 
wars.  Combined,  the  many  international  organiza¬ 
tions  constitute  an  increasingly  dense  network  of 
organizations,  each  based  on  treaties  or  agree¬ 
ments  characterized  by  specified  sets  of  principles, 
norms,  and  rules.  Some  analysts  call  the  outcome 
global  governance ,  whereas  others  prefer  the  con¬ 
cept  of  a  fragmentary  world  state. 

Formal  organizations  do  not  constitute  the  only 
feature  of  the  multilateral  system.  To  realize  better 
the  diversity  of  institutions,  it  is  useful  to  make  a 
distinction  between  international  (governmental) 
organizations,  international  regimes,  and  interna¬ 
tional  conventions.  Whereas  organizations  are 
characterized  by  a  postal  address,  employed 
bureaucrats,  and  budgets,  international  regimes 
can  be  seen  as  agreements,  whether  formal  or 
informal,  about  very  specific  issues.  Examples  of 
such  specific  issues  comprise  the  EU’s  regulation  of 
the  import  of  bananas,  regimes  concerning  the 
export  of  missile  technology  control  (MTCR),  and 
the  former  trade  regime,  the  GATT  (being  institu¬ 
tionalized  in  the  mid-1990s  and,  thus,  becoming 
the  WTO).  Regimes  have  always  been  an  impor¬ 
tant  part  of  the  multilateral  system,  but  their  devel¬ 
opment  is  more  difficult  to  categorize  in  phases. 
Concerning  informal  nonbinding  agreements, 
states  have  a  clear  advantage  in  the  sense  of  not 
being  constrained  by  the  agreement.  The  problem 
is  that  other  states  enjoy  a  similar  advantage  for 
which  reason  their  future  behavior  becomes  less 
predictable. 
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Finally,  international  norms  or  conventions 
(notably  spelled  with  a  small  c)  refer  to  implicit 
rules  and  tacit  understandings.  Such  conventions 
are  customs  and  habits,  and  both  allow  us  to  have 
expectations  about  likely  behavior  or  interpreta¬ 
tions  of  agreements.  In  this  sense,  conventions  or 
norms  are  informal  institutions  that  can  exist  for 
centuries  without  being  codified.  Often,  international 
treaties  and  agreements  are  little  more  than  codifi¬ 
cations  of  previous  conventional  understandings. 
Hence,  there  is  a  complex  and  intimate  interplay 
between  conventions,  regimes,  and  organizations. 
Often,  these  institutional  forms  are  closely  inter¬ 
twined  in  reality,  and  their  separation  is  foremost 
analytically  convenient.  This  feature  can  be  illus¬ 
trated  by  the  example  of  the  field  of  nonprolifera¬ 
tion  of  weapons  of  mass  destruction.  It  is  a  field 
characterized  by  a  number  of  formal  agreements — 
that  is,  treaties  of  nonproliferation,  including  the 
Non-Proliferation  Treaty  (NPT)  of  1968.  In  addi¬ 
tion,  a  group  of  states  have  created  a  handful  of 
export  control  regimes,  including  the  MTCR, 
which  helps  these  states  specify  and  reach  the 
objective  of  nonproliferation.  The  number  of  con¬ 
tracting  states  is  rather  limited,  yet  many  states 
adhere  to  the  principles  of  the  regimes  as  if  they 
were  members.  Further,  some  international  organi¬ 
zations  have  been  assigned  specific  tasks  in  the 
field,  for  example,  the  International  Atomic  Energy 
Agency  (IAEA)  and  the  Organization  for  the 
Prohibition  of  Chemical  Weapons  (OPCW). 
Finally,  throughout  most  of  the  20th  century,  there 
has  been  an  international  convention,  a  custom, 
not  to  use  chemical  weapons,  and  in  1997,  a  formal 
agreement,  the  Chemical  Weapons  Convention, 
came  into  force. 

In  a  wider  international  system  perspective,  the 
multilateral  system  can  be  seen  as  a  historically 
new  feature  of  the  international  states  system  or, 
differently  conceptualized,  as  a  fundamental  insti¬ 
tution  of  international  society.  The  multilateral 
system  is  often  said  to  be  strongly  state-centric;  the 
reason  being  that  states  create,  reproduce,  and, 
sometimes,  terminate  the  life  of  multilateral  insti¬ 
tutions.  Further,  states  are  usually  exclusive  mem¬ 
bers  of  such  institutions;  they  negotiate  and  sign 
international  agreements  that  materialize  in  inter¬ 
national  regimes.  States  also  engage  in  reproducing 
the  principles,  norms,  and  rules  underpinning  both 
organizations  and  regimes,  thereby  securing  that 


they  are  alive  and  working  and  not  just  dead  letters 
in  a  written  agreement.  Paradoxically,  the  multi¬ 
lateral  system  is  so  state-centric  that  the  EU,  not 
being  a  state,  experiences  severe  difficulties  in  per¬ 
forming  as  an  international  actor  pursuing 
European  interests.  However,  while  states  might 
have  been  exclusive  players  in  the  past,  transna¬ 
tional  actors  (TNAs)  play  an  increasingly  signifi¬ 
cant  role  in  the  contemporary  multilateral  system. 
TNAs  frequently  contribute  to  agenda  setting, 
sometimes  provide  intellectual  leadership,  push  for 
some  international  agreements,  and  campaign 
against  other  agreements.  Prominent  examples 
include  the  creation  of  the  International  Criminal 
Court  and  the  international  treaty  banning  land¬ 
mines,  both  initiatives  being  sponsored  by  the  EU 
and  NGOs  working  in  the  field  of  human  rights. 
The  enduring  significance  of  TNAs  in  global  gov¬ 
ernance  remains  an  empirical  question,  yet  it  is 
beyond  discussion  that  they,  even  if  not  formal 
members,  increasingly  contribute  to  the  politics  of 
multilateralism. 

Multilateral  Foreign  Policy 

Multilateral  institutions  have  states  as  members 
and  therefore  need  these  states  to  engage  in  some 
political  gardening  to  keep  the  institutions  on 
track  and  on  mission.  Hence,  the  foreign  policy  of 
states  requires  a  more  or  less  prominent  multilat¬ 
eral  dimension.  Multilateralism  is  one  option  among 
a  range  of  foreign  policy  strategies.  Multilateral 
foreign  policy  strategies  prioritize  the  promotion 
of  and  commitment  to  international  institutions, 
including  the  resources  it  takes  to  cultivate  these 
multilateral  institutions.  The  United  States  opted 
for  this  strategy  after  the  end  of  World  War  II  and 
also  immediately  after  the  end  of  the  Cold  War.  At 
other  times,  the  commitment  to  multilateral  insti¬ 
tutions  has  been  more  ambivalent.  One  of  the 
stated  strategic  objectives  of  European  foreign 
policy  is  to  promote  effective  multilateralism. 
Other  great  powers  have  been  more  reluctant  or 
simply  unable  to  shape  the  multilateral  system  or 
its  individual  institutions.  Thus,  Russia,  China, 
India,  and  Japan  have  had  a  modest  impact  on  the 
development  of  the  system.  It  is  telling  that  the 
international  financial  institutions,  the  International 
Monetary  Fund,  and  the  World  Bank  thrived  dur¬ 
ing  the  Cold  War  partly  due  to  the  fact  that  the 
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Soviet  Union  decided  not  to  join.  By  contrast, 
middle  powers  such  as  Canada  and  the  Nordic 
states  have  traditionally  been  keen  supporters  of 
multilateralism:  In  some  cases,  they  almost  fully 
comply  with  their  international  obligations;  that 
is,  they  ratify  the  international  treaties  they  sign 
and  subsequently  display  few  infringements  against 
the  treaties  they  have  signed. 

To  further  specify  the  meaning  and  characteris¬ 
tics  of  multilateral  foreign  policy,  it  is  useful  to 
proceed  ex  negativo,  or  to  explain  what  it  is  not. 
The  antonym  to  multilateralism  is  unilateralism,  a 
“going-it-alone”  foreign  policy  strategy.  Application 
of  the  strategy  implies  that  a  given  state  essentially 
does  what  it  wants  to  do,  abandoning  consulta¬ 
tions,  coordination,  or  cooperation  with  allies  or 
third  parties  and  employing  an  approach  that  is 
characterized  by  the  absence  of  international  legal 
or  political  constraints.  However,  proponents  of  a 
“going-it-alone”  strategy  sometimes  advance  their 
arguments  on  the  basis  of  a  narrow  understanding 
of  international  customary  law.  Especially,  great 
powers  enjoy  the  option  of  unilateral  action. 
During  the  George  W.  Bush  administration,  the 
United  States  was  prone  to  unilateral  action,  diplo¬ 
matically  labeled  a  la  carte  multilateralism.  Though 
having  a  preference  for  multilateral  foreign  policy, 
the  EU  is  no  stranger  to  unilateral  action,  as  dem¬ 
onstrated  by  the  rather  frequent  use  of  sanctions — 
that  is,  in  cases  where  the  EU  acts  like  a  unitary 
actor.  However,  the  invasions  of  Kuwait  by  Iraq 
and  the  Falkland  Islands  by  Argentina  suggest  that 
it  is  not  only  the  great  powers  that  enjoy  the  privi¬ 
lege  or  temptations  of  unilateral  military  action. 

Like  unilateralism,  bilateralism  is  also  not  mul¬ 
tilateralism  as  it  only  takes  two  to  cultivate  a  bilat¬ 
eral  relationship.  An  illustrative  example  is  Japan 
and  the  United  States  cultivating  a  broad  range  of 
bilateral  agreements,  including  defense  and  secu¬ 
rity.  Sometimes  bilateralism  and  multilateral 
arrangements  are  intertwined,  for  example,  France 
and  Germany  cultivating  a  close  bilateral  relation¬ 
ship,  yet  within  the  context  of  the  EU.  Also  the 
United  States  and  the  EU  can  be  said  to  cultivate 
an  exclusive  bilateralism  within  the  context  of  a 
multilateral  WTO,  in  turn  triggering  opposition 
coalitions  such  as  the  Cairns  Group. 

The  three  strategies  are  not  necessarily  mutually 
exclusive.  Thus,  unilateral  diplomatic  action  can 
provide  the  necessary  conditions  for  a  major 


breakthrough  in  otherwise  stalled  multilateral 
negotiations,  and  bilateral  relations  sometimes 
function  as  the  innovative  undergrowth,  providing 
the  leadership,  coalition  building,  and  preparatory 
work  that  secure  an  agreement  within  multilateral 
diplomacy,  characterized  by  a  large  number  of 
participants.  Reaching  international  agreements  by 
a  select  subgrouping  of  a  larger  multilateral  orga¬ 
nization  or  regime,  which  are  subsequently  adopted 
by  other  states,  is  known  as  minilateralism.  Hence, 
minilateralism  is  characterized  by  an  exclusive 
group  of  key  states  within  a  specific  issue  area 
reaching  an  agreement  that  is  subsequently 
accepted  within  a  larger  multilateral  setting.  One 
example  is  the  policy  by  France  and  the  United 
Kingdom  vis-a-vis  the  breakup  of  the  former 
Yugoslavia,  a  policy  that  was  accepted  by  partners 
within  the  EU  and  subsequently  adopted  by  the 
UN  Security  Council.  Minilateralism  should  not  be 
conflated  with  plurilateral  agreements  being  con¬ 
cluded  by  a  small  select  grouping  of  states  and 
subsequently  adhered  to  by  a  larger  segment  of 
international  society.  A  reference  to  the  plurilateral 
instrument  was  made  after  the  2003  WTO  Doha 
Round  meltdown  in  Cancun,  when  the  EU  sug¬ 
gested  that  given  the  multilateral  deadlock,  per¬ 
haps  a  plurilateral  approach  would  be  a  desirable, 
attractive,  or,  at  least,  possible  alternative. 

Multilateralism  as  Political  Ideology 

In  certain  contexts,  multilateralism  should  be  seen 
as  an  ideology  that  exists  in  both  positive  and 
negative  versions.  These  versions  share  the  view 
that  the  qualities  of  multilateralism  are  a  matter  of 
belief  or  faith  and  therefore  beyond  debate  and 
examination.  In  the  first  place,  multilateralism  is 
seen  as  morally  superior  to  other  foreign  policy 
strategies.  Further,  multilateralism  tends  to  be  seen 
as  an  end  in  itself,  for  which  reason  the  instrumen¬ 
tal  dimension  is  systematically  downplayed.  Being 
multilateral  is  what  counts  and,  even  if  nothing 
can  be  achieved,  it  remains  imperative  to  keep  the 
multilateral  machinery  in  place.  During  the  late 
1970s  and  early  1980s,  this  attitude  characterized 
the  European  approach  to  the  CSCE  (Conference 
on  Security  and  Cooperation  in  Europe)  process. 
Finally,  in  some  understandings,  multilateralism 
and  the  UN  are  simply  synonymous.  Due  to  its 
global  membership,  only  the  UN  is  considered  a 


1636  Multilateralism 


genuine  example  of  multilateralism.  Interestingly, 
the  authors  of  the  European  Commission’s  annual 
report  seem  to  take  this  stance.  The  positive  ver¬ 
sion  has  also  been  called  globalism ,  and  adherents 
argue  that  global  agreements  should  be  prioritized 
even  if  the  number  of  veto  players  almost  guaran¬ 
tees  that  no  agreement  is  the  likely  outcome.  The 
alternative,  in  terms  of  an  agreement  concluded  by 
the,  say,  14  most  important  states  for  global  cli¬ 
mate,  is  dismissed  and  accompanied  by  references 
to  problems  of  legitimacy.  Similarly,  plurilateral 
agreements  are  also  dismissed  because  they  are  by 
their  nature  essentially  exclusive. 

According  to  the  dystopian  ideology  of  multilat¬ 
eralism,  multilateral  approaches  are  synonymous 
with  inaction,  unwarranted  constraints  on  state 
action,  or  both.  One  argument  is  that  it  would  be 
irresponsible  to  leave  decisions  concerning  crucial 
issues  to  multilateral  institutions  as  inaction  and 
nondecision  is  often  the  likely  outcome.  If  states 
nevertheless  opt  for  multilateralism,  it  is  because 
they  use  multilateral  institutions  instrumentally  to 
cover  for  inaction.  In  this  perspective,  multilateral 
agreements  function  as  window  dressing — for 
example,  the  commitment  to  0.7%  gross  domestic 
product  (GDP)  on  development,  a  commitment 
that  very  few  states  seem  to  adhere  to.  By  contrast, 
the  Millennium  Development  Goals  have  not  been 
met  due  to  a  wider  set  of  failings,  including  donor 
shortfalls,  recipient  shortcomings,  and  organiza¬ 
tional  malfunction.  A  second  argument  is  that 
powerful  states  should  not  constrain  themselves  by 
getting  embroiled  in  a  web  of  rules  and  obliga¬ 
tions.  A  third  argument,  for  instance,  used  by 
opponents  of  European  integration,  is  that  small 
states  should  not  give  up  the  formal  and  symbolic 
dimensions  of  independence  or  autonomous  deci¬ 
sion-making  power.  Finally,  proponents  of  an 
exclusive  league  of  democracies  to  replace  the  UN 
argue  that  universalism  triggers  a  fundamental 
problem  as  nondemocratic  states  are  given  too 
much  of  a  voice. 

Theoretical  Reflections  on  Multilateralism 

Major  theoretical  debates  in  the  discipline  of  inter¬ 
national  relations  concern  the  promise  of  interna¬ 
tional  institutions  and  the  multilateral  system. 
Four  issues  constitute  the  backbone  of  these 
debates  especially.  In  the  first  place,  some  theorists 


claim  that  multilateral  institutions  should  be  seen 
as  mere  arenas  on  which  (strong)  states  engage  in 
international  politics.  Such  arenas  have  no  inde¬ 
pendent  or  significant  impact  on  the  script  or  on 
what  is  being  played.  In  contrast,  other  analysts 
emphasize  that  the  level  of  institutionalization  in 
the  international  system  has  an  impact  on  state 
behavior,  that  international  organizations  some¬ 
times  teach  states  about  their  interests,  and  that 
organizations  can  be  seen  as  agents  that  are  not 
entirely  controlled  by  their  principals.  In  various 
ways,  all  these  approaches  emphasize  that  the 
arena  approach  is  most  misleading  and  largely 
unfounded  by  evidence. 

Second,  there  is  an  enduring  issue  concerning 
the  instrumentality  of  international  organizations. 
While  some  theorists  are  keen  to  theorize  what 
international  organizations  can  do  and  what  dif¬ 
ference  they  can  possibly  make  to  international 
politics,  other  theorists  focus  on  instrumental 
action.  As  Inis  Claude  has  famously  emphasized, 
we  should  ask  not  what  the  UN  can  do  but  what 
the  UN  can  be  used  for. 

Third,  some  theorists  claim  that,  at  best,  multi¬ 
lateral  institutions  reflect  the  changing  distribution 
of  power  and,  at  worst,  trigger  naive  and  danger¬ 
ous  illusions  about  their  potential,  making  us  for¬ 
get  those  few  but  important  things  that  really 
make  a  difference  in  international  politics.  If, 
indeed,  the  impact  of  multilateral  institutions  is 
epiphenomenal  to  power  politics,  then  why  waste 
time  analyzing  something  that  essentially  is  derived 
from  more  powerful  explanatory  variables?  By 
contrast,  other  theorists  claim  that  the  institution¬ 
alization  of  world  politics  mold  international  anar¬ 
chy,  thereby  reducing  the  power  of  power  politics 
and  making  a  more  peaceful  world  possible.  These 
scholars  emphasize  that  there  is  a  linkage  between 
the  growth  of  multilateral  institutions  and  the  pro¬ 
nounced  decline  in  interstate  war. 

Finally,  many  scholars  tend  to  take  the  positive 
qualities  of  international  institutions  for  granted,  a 
tendency  that  is  particularly  strong  among  those 
theorists  who  regard  multilateralism  a  means 
toward  cooperation  and  peace.  By  contrast,  other 
analysts  claim  that  multilateral  institutions  fre¬ 
quently  display  an  alarming  degree  of  dysfunc¬ 
tional  or,  worse,  pathological  features — that  is, 
characteristics  that  potentially  undermine  whatever 
legitimacy  these  institutions  might  have  and,  in 
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any  case,  provide  significant  obstacles  to  reaching 
global  solutions  to  urgent  global  problems. 

Crises  of  Multilateralism 

The  development  of  the  multilateral  system  and  its 
international  institutions  has  frequently  been  char¬ 
acterized  by  crises,  ranging  from  the  failure  of  the 
League  of  Nations  to  the  less  than  perfect  perfor¬ 
mance  of  the  UN  during  the  Cold  War  and  the 
doldrums  of  the  European  Community  during  the 
1970s.  However,  the  contemporary  crisis  of  multi¬ 
lateralism  appears  to  be  more  profound  than  previ¬ 
ous  crises,  and  it  seems  to  have  several  sources. 
Thus,  several  major  multilateral  institutions  were 
created  during  the  Cold  War,  and,  as  the  world  has 
subsequently  changed,  they  are  in  severe  need  of 
more  or  less  comprehensive  reform.  Further,  the 
international  institutions  are  characterized  by  an 
ever  more  present  trade-off  between  inclusion, 
legitimacy,  and  effectiveness.  As  the  number  of  par¬ 
ticipant  state  members  increases,  so  does  the  num¬ 
ber  of  veto  players.  At  the  same  time,  there  is  an 
increasing  imbalance  between  the  provision  of  lead¬ 
ership,  which  is  lacking,  and  an  ever-broader  port¬ 
folio  of  global  demands  and  tasks  to  handle.  It  does 
not  help  that  several  key  states  pay  lip  service  to 
their  obligations  toward  international  institutions 
or  act  as  custodians  insisting  on  yesterday’s  arrange¬ 
ments,  not  necessarily  because  they  are  perfect  or 
efficient  but  because  they  represent  a  certain  distri¬ 
bution  of  power,  pride,  and  prestige.  In  the  UN,  the 
Group  of  77  (G77),  controlling  the  budget  and 
personnel,  has  shown  little  interest  in  administrative 
reform.  Finally,  multilateral  institutions  are  charac¬ 
terized  by  an  ever-wider  expectation-investment 
gap,  triggering,  in  turn,  frustration,  apathy,  or 
cynicism  concerning  absent  expected  deliverables. 
Examples  of  the  crisis  abound,  ranging  from  the 
stalled  UN  reform  process,  via  the  dire  straits  of  the 
Non-Proliferation  Treaty  regime,  to  the  stalled 
WTO  Doha  Round.  It  is  clear  that  sustainable  solu¬ 
tions  to  the  crisis  require  some  exquisite  intellectual 
innovative  thinking  and  strong  political  leadership 
and,  it  is  hoped,  not  another  calamitous  external 
shock  to  the  system  to  provoke  change. 
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Multilevel  Analysis 


Multilevel  modeling  is  used  in  the  analysis  of  data 
that  have  a  clustered  structure.  Such  data  arise  in 
various  fields — for  instance,  in  educational 
research,  where  pupils  are  nested  in  classes;  in 
medical  research,  where  patients  are  nested  within 
hospitals;  and  also  in  political  research,  where 
individuals  are  nested  in  a  social  context.  A  cru¬ 
cial  problem  in  the  statistical  analysis  of  clustered 
data  is  the  dependencies  between  individual  obser¬ 
vations.  For  example,  voters  from  the  same  city 
are  not  independent  from  one  another,  because 
they  are  all  influenced  by  local  policy.  The  statisti¬ 
cal  analysis  performed  on  clustered  data  should 
account  for  this  dependency.  However,  statistical 
analysis  can  also  benefit  from  such  dependency. 
The  characteristics  of  different  levels  can  be  com¬ 
bined  into  one  explanatory  model  and  conclusions 
can  be  drawn  about  their  effects,  both  at  each 
level  and  in  interaction.  In  the  following  sections, 
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the  multilevel  model  is  explained  and  illustrated 
by  means  of  a  political  example. 

The  starting  point  for  explaining  the  multilevel 
regression  model  is  the  idea  that  the  dependent 
variable,  allocated  at  the  lowest  level,  is  thought  to 
be  influenced  by  all  distinguished  levels.  With 
respect  to  a  two-level  model,  this  principle  can  be 
seen  in  the  following  equation  (the  intercept-only 
model): 

Yij  =  Too  +  uoj  +  e>r  (1) 

The  dependent  variable  Y  of  an  individual  i  in 
group  /  is  decomposed  into  three  different  parts. 
Because  there  are  no  explanatory  variables  in  the 
model  yet,  the  explanatory  part  of  the  model  con¬ 
sists  only  of  the  grand  mean  y00.  The  unexplained 
part  of  the  model  consists  of  two  parts:  an  error  at 
the  highest  level  (u0j)  and  an  error  at  the  lowest 
level  (e  ).  This  means  that  the  total  variance  of  the 
dependent  variable  is  decomposed  into  two  parts: 
the  error  variance  at  the  lowest  level  (ae)  and  the 

error  variance  at  the  second  level  ^cr2  J.  A  good 
explanation  of  the  dependent  variable  is  therefore 
based  on  both  levels.  Of  course,  one  should  keep  in 
mind  that  the  division  over  both  levels  can  differ  per 
variable.  The  more  error  variance  there  is  at  the  low¬ 
est  level,  the  less  important  the  second  level  will  be. 

Adding  explanatory  variables  of  both  levels 
gives  the  following  equation: 

Yij  =  7oo  +  Z'Ypo  xph  +  E'Yo  >?z®  +  uo j  +  etj .  (2) 

P  q 

In  this  equation,  p  explanatory  variables  of  the 
lower  level  and  q  explanatory  variables  of  the  sec¬ 
ond  level  are  added  to  the  model.  Interactions 
between  variables  of  the  lowest  level  are  also  called 
p,  and  interactions  between  variables  of  the  second 
level  are  called  q.  Because  these  variables  will 
explain  a  part  of  the  variance  at  both  levels,  the 
errors  will  become  smaller  and  the  explained  vari¬ 
ance  in  the  dependent  variable  can  be  calculated. 

Interactions  between  variables  of  different  levels 
take  up  a  special  position  in  the  model.  The  influ¬ 
ence  of  a  lower-level  variable  on  the  dependent 
variable  may  depend  on  a  second-level  variable. 
This  is  called  a  cross-level  interaction  (or  modera¬ 
tor  effect).  Adding  cross-level  interactions  to  the 


model,  gives  the  final,  and  most  elaborated,  multi¬ 
level  model: 

Yij  =  Y)0  +  Z  yP0XPH  +  Z  7o qZqi  +  U0 i  +  eij 
P  q 

+  ZZ'Vp?z«xp'/  +  Y.uPixP.i  +  uo i  +  eij-  (3) 
q  p  p 

Looking  at  this  equation,  one  can  see  that  not  only 
are  the  cross-level  interactions  added  to  the  model 
but  there  also  are  p  extra  error  terms  in  the  model: 
uPiXpir  The  errors  upj,  the  unexplained  parts  of  the 
regression  coefficients  of  the  lowest  level,  are  mul¬ 
tiplied  with  the  lowest-level  variables  and  are 
therefore  different  for  different  values  of  the  X 
variables.  This  is  called  heteroskedasticity.  In  ordi¬ 
nary  regression  analysis,  homoskedasticity  is 
assumed,  which  means  that  the  variance  of  the 
errors  is  independent  of  the  values  of  the  explana¬ 
tory  variables.  Therefore,  analyzing  the  model 
presented  in  Equation  3  requires  a  multilevel 
analysis.  The  variance  of  the  errors  u.  .  is  cr2  . 

The  assumptions  of  the  most  commonly  used 
multilevel  regression  model  are  that  the  residuals 
at  the  lowest  level  et/  are  normally  distributed  with 
a  mean  of  zero  and  a  common  variance  cr2  in  all 
groups.  The  second-level  residuals  u0j  and  u  .  are 
assumed  to  be  independent  from  the  lowest-level 
errors  et-  and  to  have  a  multivariate  normal  distri¬ 
bution  with  means  of  zero.  Other  assumptions, 
identical  to  the  common  assumptions  of  ordinary 
multiple  regression  analysis,  are  fixed  predictors 
and  linear  relationships.  Most  multilevel  software 
assumes  by  default  that  the  variance  of  the  residual 
errors  et-  is  the  same  for  all  second-level  units. 
However,  certain  forms  of  heteroskedasticity  can 
be  explicitly  modeled. 

Estimation  of  the  parameters  (regression  coeffi¬ 
cients  and  variance  components)  is  generally  done 
using  maximum  likelihood.  The  restricted  maxi¬ 
mum  likelihood  method  maximizes  a  likelihood 
function  that  is  invariant  for  the  fixed  effects  and 
therefore  leads,  in  theory,  to  better  estimates  of  the 
variance  components  than  the  full  maximum  likeli¬ 
hood  method  in  which  both  the  regression  coeffi¬ 
cients  and  the  variance  components  are  included  in 
the  likelihood  function.  In  practice,  the  differences 
between  the  two  methods  usually  are  not  large. 

Assume  that  we  have  data  from  a  random  sam¬ 
ple  of  cities  and  also  data  from  a  random  sample  of 


Multilevel  Analysis  1639 


inhabitants  of  these  cities.  We  are  interested  in 
explaining  involvement  of  individuals  with  local 
policy,  measured  on  a  scale  ranging  from  1  ( not  at 
all)  to  10  ( extremely ).  We  use  the  variable  gender 
as  an  explanatory  variable  at  the  lowest  level, 
assuming  that  men  are  more  interested  in  politics 
than  women,  and  the  variable  local  political  activi¬ 
ties  (measured  on  a  scale  from  1  [almost  none J  to  7 
[very  often J)  as  an  explanatory  variable  at  the  sec¬ 
ond  level,  assuming  that,  in  cities  with  more  local 
political  activities,  individuals  are  more  interested 
and  involved  in  political  issues.  We  have  data  from 
2000  individuals  from  100  cities  (see  Table  1).  The 
first  model  that  is  specified  is  the  intercept-only 
model.  The  total  variance  of  the  dependent  variable 
is  decomposed  into  two  independent  parts:  one  at 
the  individual  level  and  one  at  the  city  level. 
Because  there  are  no  explanatory  variables  in  the 
model,  the  variance  found  at  each  level  is  error 
variance.  The  percentage  variance  at  the  second 


level,  calculated  by  dividing  the  error  variance  of 
the  second  level  by  the  total  error  variance,  is 
31.85%.  This  means  that  almost  one  third  of  the 
total  variance  in  the  political  involvement  of  city 
members  lies  at  the  second  level.  Multilevel  analysis 
is  therefore  used  not  only  to  account  for  depen¬ 
dency  but  also  for  building  an  explanatory  model 
containing  variables  from  both  levels  and  their 
interactions. 

In  the  second  model,  the  explanatory  variables 
gender  and  local  political  activities  are  added  to  the 
model.  Both  variables  are  significant.  The  regres¬ 
sion  coefficient  of  gender  is  .88.  This  means  that 
men  score  almost  1  point  higher  on  political 
involvement  than  women  (controlling  for  local 
political  activities).  This  is  not  only  a  significant  but 
also  a  relevant  result:  almost  1  point  on  the  10- 
point  scale  of  political  involvement.  The  regression 
coefficient  of  local  political  activities  is  .22.  The 
interquartile  range  of  the  local  political  activities 
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Model  3: 

Model  4: 

Model  1: 

Model  2: 

With  Random  Slope 

With  Cross-Level 

Intercept  Only 

With  Predictors 

Gender 

Interaction 

Fixed  part 

Predictor 

Coefficient 

Coefficient 

Coefficient 

Coefficient 

(Standard  Error) 

(Standard  Error) 

(Standard  Error) 

(Standard  Error) 

Intercept 

4.92  (0.07) 

4.48  (0.05) 

4.47  (0.04) 

4.47  (0.04) 

Gender 

0.88  (0.04) 

0.86  (0.08) 

0.85  (0.08) 

Local  political 

0.22  (0.02) 

0.20  (0.02) 

0.18  (0.02) 

activities 

Gender  X  local 

0.09  (0.04) 

political 

activities 

Random  part 

2 

<Je 

0.95  (0.03) 

0.79  (0.03) 

0.68  (0.02) 

0.68  (0.02) 

2 

CT 

"0 

0.44  (0.07) 

0.13  (0.02) 

0.06  (0.02) 

0.05  (0.02) 

2 

(7 

"1 

0.47  (0.09) 

0.43  (0.08) 

CT«01 

-0.05  (0.03) 

-0.04  (0.03) 

Deviance 

5802.67 

53338.78 

5145.00 

5138.76 
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Figure  I  Interaction  Effect  of  Gender  and  Local  Political  Activities 


variable  is  4,  meaning  that  for  this  range  the  maxi¬ 
mum  difference  on  political  involvement  is  4  times 
.22,  which  is  .88.  This  also  seems  quite  relevant. 
Another  measure  for  relevance  is  the  explained 
variance.  The  explained  variance  is  calculated  by 
subtracting  the  error  variance  of  the  model  with  the 
explanatory  variables  (Model  2)  from  the  error 
variance  of  the  model  without  explanatory  vari¬ 
ables  (Model  1:  the  intercept-only  model)  and 
dividing  by  the  total  error  variance  of  the  intercept- 
only  model.  On  the  individual  level,  the  explained 
variance  is  17%;  on  the  city  level,  70%  is  explained. 
Both  are  really  relevant. 

In  the  third  model,  the  question  to  be  answered 
is  whether  the  influence  of  gender  is  the  same  in  all 
cities.  The  answer  to  this  question  is  no;  there  is 
variation  between  cities  for  this  effect  (slope  vari¬ 
ance  is  0.47).  The  next  step  is  to  explain  this 
variation  (Model  4).  For  the  explanation,  the 
interaction  between  gender  and  local  political 
activities  is  included  in  the  model.  The  effect  is 
significant  and  positive.  This  means  that  men  ben¬ 
efit  more  from  local  political  activities  than  women 
(see  Figure  1;  for  statistical  reasons,  the  variable 
local  political  activities  is  centered  on  the  grand 
mean).  The  explained  slope  variance  is  10%.  The 


difference  between  political  involve¬ 
ment  of  men  and  women  is  almost  2 
times  as  large  when  there  are  a  lot  of 
local  political  activities — in  compari¬ 
son  with  when  there  are  almost  no 
local  political  activities. 

The  example  discussed  in  the  previ¬ 
ous  section  shows  that  multilevel  anal¬ 
ysis  is  the  appropriate  way  to  answer 
research  questions  concerning  data 
with  a  clustered  structure.  Multilevel 
analysis  not  only  accounts  for  depen¬ 
dency  in  data,  but  it  also  takes  advan¬ 
tage  of  the  clustered  structure.  The 
individual  political  involvement  is 
explained  with  an  individual  character¬ 
istic  (gender),  a  city  characteristic  (local 
political  activities),  and  the  interaction 
between  these  two  characteristics. 
Multilevel  analysis  provides  an  optimal 
use  of  all  available  information. 

In  this  short  introduction  to  multi¬ 
level  analysis,  many  other  possibilities 
have  not  been  discussed.  For  instance, 
the  model  discussed  assumes  a  con¬ 
tinuous  dependent  variable  and  normally  distrib¬ 
uted  residuals.  The  check  of  these  assumptions  is 
not  discussed.  Of  course,  this  check  should  always 
be  carried  out,  but  it  is  beyond  the  scope  of  this 
entry.  When  the  response  variable  is  a  dichoto¬ 
mous  variable  or  a  proportion,  both  the  aforemen¬ 
tioned  assumptions  of  continuous  scores  and  the 
normal  distributed  errors  are  violated.  Multilevel 
logistic  regression  gives  the  solution  to  this  prob¬ 
lem.  Other  possibilities  are,  for  instance,  longitudi¬ 
nal  multilevel  analysis,  the  multilevel  approach  to 
meta-analysis,  cross-classified  models,  and  multi¬ 
level  structural  equation  models. 

Cora  J.  M.  Maas  (Deceased) 
Utrecht  University 
Utrecht,  Netherlands 
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Multinational  Corporations 

(MNCs) 


Multinational  corporations  fascinate  many  inter¬ 
national  political  economy  scholars.  Scholars  dis¬ 
agree  about  their  significance  and  their  impact  but 
continue  to  analyze  their  dynamic  evolution  in  the 
global  economy.  Multinational  corporations  are 
private  firms  that  operate  in  multiple  countries. 
Their  primary  motivation  is  economic  profit. 
They  may  engage  in  investment,  sales,  production, 
services,  extractive  processes,  or  any  combination 
of  these.  Multinational  corporations’  activities 
propel  economic  globalization.  They  may  invest, 
set  up  shop,  or  even  buy  existing  domestic  firms  as 
the  multinational  expands.  This  entry  addresses 
first  the  broad  views  about  multinational  corpora¬ 
tions  and  then  the  corporations’  motivations  to  go 
abroad.  The  entry  continues  with  a  historical 
overview  and  concludes  with  controversies  sur¬ 
rounding  multinational  corporations. 

Multinational  corporations  are  incorporated  in 
a  home  country  and  are  chartered  according  to  the 
home  country’s  laws  and  regulations.  Analysts 
have  debated  the  extent  to  which  home  country 
regulations  shape  multinational  corporations’  out¬ 
ward  conduct.  Some  argue  that  such  regulations 
make  a  major  difference  in  corporations’  behavior 
and  standards  abroad.  Others  see  multinational 
corporations  as  footloose  harbingers  of  global 
capitalism  with  no  particular  state-based  alle¬ 
giances.  Realist  scholars,  who  emphasize  the 
supremacy  of  state  power,  emphasize  corporations’ 
subordination  to  state  laws,  jurisdiction,  and  poli¬ 
cies.  They  tend  to  cite  instances  in  which  the  state 
took  actions  against  the  wishes  of  particular  corpo¬ 
rate  interests — for  example,  favoring  national  secu¬ 
rity  over  economic  considerations.  Other  analysts 


suggest  that  corporations  can  be  more  powerful 
than  states  and  that  states  may  be  actually  in 
retreat  in  the  face  of  rapid  economic  integration 
and  globalization.  These  analysts  note  the  increase 
in  the  size  and  role  of  the  private  sector  at  the 
expense  of  the  public  sector.  Beginning  in  the 
1980s,  many  states  delegated  an  ever-greater  range 
of  once  publicly  supported  activities  to  private 
actors.  Private  companies  have  come  to  run 
schools,  jails,  hospitals,  municipal  water  systems, 
and  have  even  provided  security  in  conflict  zones. 

Why  Firms  Go  Abroad 

Given  the  risks  and  costs  associated  with  initiating 
multinational  operations,  one  must  understand 
firms’  motives  to  do  so.  In  the  case  of  extractive 
industries,  the  logic  is  quite  straightforward. 
Natural  resource  firms  need  to  establish  themselves 
at  the  site  of  the  resources — gold,  oil,  copper,  dia¬ 
monds,  cobalt,  and  so  on.  Service  firms  may  seek 
to  be  closer  to  customers  and  be  able  to  adapt  to 
and  serve  local  markets  better.  Manufacturing 
firms  may  seek  out  highly  skilled  or,  alternatively, 
low-cost  labor.  Manufacturing  firms  may  seek  to 
get  behind  tariff  walls,  producing  and  selling  their 
goods  within  regions  protected  by  high  import 
taxes.  These  examples  demonstrate  firms’  motiva¬ 
tions  to  leverage  location-specific  advantages. 
Another  class  of  motivations  is  asset  specific. 
Often,  asset-specific  advantages  are  reputation 
based  or  trademark  related.  Obvious  examples  of 
firms  seeking  to  leverage  asset-specific  advantages 
would  be  Coca-Cola  and  McDonald’s,  entertain¬ 
ment  companies  (e.g.,  Disney),  and  brand-name 
luxury  goods  producers  and  distributors. 

Historical  Perspectives 

While  some  analysts  write  about  multinational 
corporations  as  if  they  were  relatively  recent  phe¬ 
nomena,  in  fact,  private  companies  engaged  in 
economic  activity  abroad  date  back  at  least  to  the 
age  of  empires  in  the  early  17th  century.  European 
countries  competing  for  colonies  often  sent  com¬ 
panies,  which  the  rulers  chartered,  to  distant 
shores  to  establish  a  presence  and  engage  in  eco¬ 
nomic  activity.  Charter  companies  such  as  the 
British  East  India  Company  and  the  Dutch  East 
India  Company  facilitated  European  expansion 
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beyond  what  the  countries’  militaries  alone  could 
accomplish.  In  this  way,  these  companies  and  their 
governments  participated  in  a  symbiotic  relation¬ 
ship  that  helped  fuel  both  imperial  expansion  and 
economic  growth.  Historical  appreciation  of  the 
age  of  mercantilism  puts  claims  of  the  novelty  of 
private  sector  power  and  the  stark  separation 
between  states  and  markets  into  question.  In  the 
colonial  era,  power  and  wealth  were  inextricably 
bound  together. 

By  the  late  19th  century,  firms  such  as  General 
Electric  and  Singer  Sewing  Machines  were  operat¬ 
ing  abroad.  At  the  end  of  World  War  I,  many  more 
companies  began  to  invest  and  build  plants  in  war- 
ravaged  Europe.  Latin  American,  Middle  Eastern, 
and  African  countries  attracted  investors  in  various 
extractive  and  natural  resource  industries.  After 
World  War  II,  manufacturing  firms  invested  abroad 
and  often  set  up  turnkey  plants  (fully  built  and 
ready  to  operate  with  the  turn  of  a  key)  for  produc¬ 
tion,  and  service  industries  established  branches 
abroad.  The  extractive  natural  resources  industries 
would  integrate  their  business  operations  vertically, 
exercising  control  over  all  stages  of  production  and 
processing.  They  sought  to  reduce  risk  by  maximiz¬ 
ing  control.  By  contrast,  manufacturing  and  service 
firms  tend  to  be  horizontally  integrated  and  repli¬ 
cate  themselves  across  countries.  Their  subsidiaries 
produce  the  same  products  or  provide  the  same 
service  in  plants  and  branches  in  different  countries. 
They  are  motivated  to  expand  or  defend  market 
shares  by  getting  behind  tariff  walls  (producing  for 
or  servicing  the  host  country’s  domestic  market)  or 
following  their  competitors  abroad. 

In  the  wake  of  decolonization,  until  about 
I960,  foreign  corporations  were  often  the  primary 
source  of  capital  and  technology  in  newly  inde¬ 
pendent  countries.  Eager  to  promote  industrializa¬ 
tion  in  the  face  of,  or  to  counterbalance,  large 
numbers  of  imported  goods,  many  newly  indepen¬ 
dent  countries  welcomed  foreign  investment  into 
their  economies.  Over  time,  however,  foreign 
companies  became  a  critical  target;  Marxist- 
inflected  dependency  theory  took  root  in  Latin 
American  countries  and  highlighted  some  of  the 
more  exploitative  aspects  of  multinational  corpo¬ 
rate  operations.  Critics  of  multinational  corpora¬ 
tions  complained  that  they  did  very  little  to  help 
develop  the  host  economy.  In  the  1960s  and 
1970s,  many  countries  in  Central  and  Latin 


America  adopted  measures  designed  to  channel 
foreign  investment  into  domestic  capacity  build¬ 
ing.  For  example,  governments  could  limit  foreign 
ownership  to  less  than  50%  of  a  domestic  firm  or 
require  foreign  firms  to  hire  a  certain  percentage 
of  domestic  labor  or  reinvest  a  set  percentage  of 
profits  into  the  domestic  economy. 

The  spread  of  multinational  corporations  height¬ 
ened  scholarly  attention  to  their  activities  and  the 
implications  of  their  global  reach.  Political  scien¬ 
tists  began  to  examine  whether  the  power  and 
influence  of  multinational  corporations  was  sur¬ 
passing  that  of  states.  Economic  interdependence 
and  dependence  on  natural  resources  such  as  oil 
sharpened  the  focus  on  this  question.  In  1971, 
Raymond  Vernon  published  Sovereignty  at  Bay, 
the  title  captured  some  observers’  anxiety  about 
multinational  corporations.  Political  scientists  rec¬ 
ognized  such  corporations  as  important  actors  in 
international  politics.  Realists  maintained  that  sov¬ 
ereign  states  were  still  the  most  important  actors 
and  pointed  out  that  states  could  expropriate  pri¬ 
vate  property  and  eject  foreign  companies. 
Economic  liberals  celebrated  the  spread  of  multina¬ 
tional  corporations  as  vehicles  for  deepening  eco¬ 
nomic  integration  and  interdependence.  Liberals 
believed  that  interdependence  could  lead  to  peace. 
Assertive  state  efforts  in  developing  countries  to 
channel  or  control  their  activities  and  academic 
criticism  of  multinational  corporations  faded  out 
over  the  course  of  the  late  1970s  and  1980s.  High 
oil  prices  and  a  devastating  debt  crisis  meant  that 
bank  lending  was  scarce.  Multinational  corpora¬ 
tions  became  important  sources  of  capital  and 
investment,  so  formerly  resistant  countries  rapidly 
dismantled  restrictive  foreign  investment  regimes  in 
an  effort  to  attract  multinational  investment. 

Controversies 

Scholars  analyzed  the  role  of  multinational  corpo¬ 
rations  in  the  political  process.  A  number  of  schol¬ 
ars  highlighted  their  bargaining  role  as  one  leg  of  a 
tripod  between  home  government,  host  govern¬ 
ment,  and  corporation.  Other  scholars  adopted  a 
more  nuanced  view,  rejecting  binary  views  of  firms 
versus  states,  and  instead  focused  on  the  mutual 
interaction  between  states’  and  firms’  preferences. 
The  globalization  literature  of  the  1990s  and  2000s 
has  examined  the  exercise  of  private  authority  in 
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the  global  political  economy,  and  particularly  cor¬ 
porations,  in  standard  setting,  economic  integra¬ 
tion,  and  norm  creation.  One  set  of  debates  has 
addressed  the  so-called  race  to  the  bottom,  in 
which  multinational  corporations  seek  the  lowest 
cost  or  the  least  restrictive  venues.  Critics  of  multi¬ 
national  corporations  focus  on  races  to  the  bottom 
to  highlight  their  contribution  to  environmental 
degradation,  the  creation  of  sweatshops,  child 
labor,  and  gendered  exploitation  in  export¬ 
processing  zones.  Critics  of  “race-to-the-bottom” 
arguments  counter  by  demonstrating  “races  to  the 
top,”  in  which  firms  export  the  more  stringent  or 
environmentally  responsible  standards  when  they 
establish  operations  abroad. 

Many  nongovernmental  organizations  have 
highlighted  multinational  corporations’  exploit¬ 
ative  practices  and  have  launched  naming  and 
shaming  campaigns  against  sweatshops,  child 
labor,  and  environmentally  unsustainable  produc¬ 
tion.  These  campaigns  have  targeted  high-profile 
companies  with  easily  recognizable  brand  names 
such  as  Nike,  The  Home  Depot,  and  Starbucks.  In 
response  to  these  campaigns,  multinational  corpo¬ 
rations  have  increasingly  embraced  a  “corporate 
social  responsibility”  approach  to  dampen  criti¬ 
cism  and  to  improve  their  reputations.  The  United 
Nations  launched  the  Global  Compact — a  volun¬ 
tary  reporting  program  in  which  multinational 
corporations  pledge  to  uphold  certain  standards  of 
good  conduct.  Corporate  social  responsibility  ini¬ 
tiatives  are  all  voluntary.  Critics  have  contended 
that  they  are  ineffective  in  altering  corporate  con¬ 
duct  and  only  serve  as  public  relations  exercises 


designed  to  prevent  stricter  government  regulation. 

Supporters  of  corporate  social  responsibility  initia¬ 
tives  argue  that  they  are  having  a  positive,  albeit 

incremental,  impact. 
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Nation  Building 


Nation-states  are  mostly  multiethnic  and  com¬ 
posed  of  various  subnations.  Nation  building  is  a 
process  of  building  a  social  community  within  a 
nation-state.  It  is  closely  related  to  the  formation 
of  states  in  postconflict  situations  after  regime 
change,  after  decolonization,  or  after  wars.  The 
nation-building  process  is  particularly  important 
in  multination  states  in  Africa.  African  colonies 
were  formed  by  the  colonial  powers  without 
regarding  their  ethnic  and  linguistic  cleavages. 
With  independence,  the  process  of  state  building 
had  to  be  strengthened  by  social  and  cultural  cohe¬ 
sion.  National  symbols  such  as  national  anthems, 
national  flags,  national  holidays,  and  national 
myths  were  used  to  overcome  tribalism  and  ethnic 
and  social  rivalries.  Nation  building  of  this  kind  is 
a  cultural  foundation  for  state  building  and  is  nec¬ 
essary  for  economic  and  social  development. 
Processes  of  nation  building  also  became  impor¬ 
tant  with  the  breakdown  of  the  Socialist  bloc  and 
state  and  nation  failure  in  the  Soviet  Union, 
Czechoslovakia,  and  the  former  Yugoslavia. 
Nation  building  is  primarily  an  indigenous  pro¬ 
cess,  and  external  assistance  often  fails.  It  is  meant 
to  reinforce  states  and  prevent  secession.  Nation 


building  is  related  to  sentiments  of  national  identity, 
which  internally  have  positive  effects  of  social  cohe¬ 
sion  by  bridging  and  bonding  but  which  may  have 
negative  consequences  in  the  form  of  xenophobia 
against  noncitizens.  Externally,  the  emphasis  on 
national  identity  can  also  reinforce  aggression 
against  neighboring  countries.  This  entry  discusses 
the  various  forms  of  historical  and  contemporary 
nation  building  and  its  ongoing  problems. 

Nation  Building:  Etymology  and  Definition 

Etymologically,  the  word  nation  is  derived  from 
the  Latin  natio,  which  stands  for  “the  act  of  being 
born.”  So,  a  nation  is  usually  defined  as  a  set  of 
people  and  tribes.  Natio  originally  was  strongly 
connected  with  a  term  such  as  ethnos  (a  people 
based  on  the  idea  of  a  common  descent),  in  con¬ 
trast  to  demos  (a  population  defined  by  common 
citizenship).  Nations  are  socially  constructed,  so  it 
is  unclear  who  is  included  and  who  is  excluded. 
This  unclear  social  cultural  definition  is  expressed 
in  Benedict  Anderson’s  concept  of  a  nation  as 
socially  constructed  “imagined  communities.” 
The  broader,  “territorial”  concept  of  the  nation¬ 
state  produces  a  clearer  definition.  Nation  building 
is  a  process  of  collective  identity  formation  to 
assert  power  in  a  certain  territory.  It  refers  to  exist¬ 
ing  institutions,  customs,  and  traditions,  and  it 
redefines  national  characteristics.  The  process  of 
state  building  depends  on  this  uniqueness  and  sov¬ 
ereignty.  The  building  of  a  nation  is  based  mostly 
on  values  and  beliefs  that  enhance  support  for  and 
the  legitimacy  of  the  (new)  state. 
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Nation  building  as  a  process  of  developing  a 
national  identity  can  be  seen  as  the  cultural  foun¬ 
dation  for  the  nation-state  and  its  supralocal 
power  structure.  If  this  cultural  projection  of  a 
nation  is  no  longer  valid,  the  social  contract 
between  the  state  and  individual  citizens  or  groups 
of  citizens  may  be  destroyed  and  violence  may  be 
triggered.  This  can  be  seen  in  the  breakup  and  col¬ 
lapse  of  a  state  such  as  Yugoslavia,  leading  to  civil 
wars  in  Bosnia-Herzegovina  and  Kosovo  in  the 
1990s. 

The  existence  of  nation-states  can  be  seen  as  a 
safeguard  for  security  and  stability.  Nation-states 
act  as  the  basis  for  economic  and  social  develop¬ 
ment.  According  to  Eric  Hobsbawm  (1990), 
national  identity  and  national  pride  are  created  by 
a  feeling  of  one’s  supremacy,  and  they  provide  a 
basis  for  nation  building.  All  forms  of  national 
identity  (national  pride,  patriotism,  etc.)  can 
become  problematic  and  are  related  to  outgroup 
hostility. 

State  Building 

Interaction  between  different  ethnic  groups  is  a 
prerequisite  for  a  successful  and  sustainable  nation¬ 
building  process.  For  this  reason,  the  social,  eco¬ 
nomic,  and  cultural  aspects  of  nation  building  are 
important.  New  forms  of  cultural  discourses  repre¬ 
senting  the  nation  as  a  whole  become  significant. 
In  this  respect,  aspects  such  as  a  common  lingua 
franca,  a  common  literature,  sports  events,  and  so 
on  are  relevant.  Finally,  a  national  infrastructure 
and  national  space  and  communication  between 
different  relevant  groups  have  to  be  developed. 
Civic  education  programs  and  information 
strengthen  the  education  of  citizens  and  demo¬ 
cratic  nation  building.  Administratively,  nation 
building  also  needs  an  effective  state  apparatus. 

Nation  failure  can  be  seen  as  an  aggravated 
form  of  state  failure  in  multicommunity  states. 
Different  subnations  define  themselves  by  shared 
class,  religion,  language,  or  ethnicity.  Too  many 
cleavages  will  make  it  difficult  to  build  a  state  and 
may  even  destroy  the  social  contract  between  citi¬ 
zens  and  the  state.  This  situation  makes  it  unlikely 
that  government  decisions  will  be  accepted  or 
adhered  to. 

State  failure  is  the  breakdown  of  public  institu¬ 
tions  meant  to  deliver  political  goods  to  citizens. 


This  may  undermine  the  legitimacy  of  the  state  and 
lead  to  failing  states  characterized  by  disharmony 
between  communities,  criminal  violence,  corrupt 
institutions,  and  a  decaying  infrastructure,  as  well 
as  inability  to  control  the  borders. 

State  building  is  the  establishment  of  political 
structure  and  policies  in  a  territory.  It  is  organized 
by  sovereign  actors  as  an  expression  of  collective 
power  without  the  use  of  coercive  measures  and 
physical  force. 

Nationalism 

Nation  building  needs  an  integrated  ideology  that 
derives  from  a  national  identity.  National  identity 
is  one  pattern  of  orientation  within  a  set  of  multiple 
social  identities  that  become  relevant  in  different 
contexts.  It  concurs  with  other  identity  narratives. 
All  are  important  for  the  cohesion  of  social  groups 
and  for  the  fostering  of  individual  self-esteem. 
Identities  are  constructed,  and  they  may  be  decon¬ 
structed  as  well.  Identity  may  occur  in  the  form  of 
chauvinistic  nationalism  and  outgroup  hostility.  It 
may  also  be  defined  and  constructed  as  a  kind  of 
constitutional  patriotism,  as  pride  in  the  social  wel¬ 
fare  state  or  as  pride  in  social-inclusion  policies.  In 
democracies,  pride  in  democratic  performance,  in 
societal  values,  and  in  peaceful  policies  becomes 
important. 

In  his  seminal  work,  Karl  Deutsch  (1954)  high¬ 
lights  the  construction  of  identities  and  nationalism 
as  well  as  its  latent  problems:  “A  nation  is  a  group 
of  persons  united  by  a  common  error  about  their 
ancestry  and  a  common  dislike  of  the  neighbors” 
(p.  3).  He  defines  nationalism  as  a  doctrine  whereby 
people  believe  that  their  culture,  history,  institu¬ 
tions,  religion,  or  principles  are  distinct  and  aspire 
for  self-rule  under  a  political  system  that  expresses 
and  protects  those  distinct  characteristics. 

Banal  Nationalism 

According  to  Michael  Billig  (1995),  the  usage  of 
symbols  and  common  narratives  can  be  seen  as  a 
kind  of  banal  nationalism.  National  identity  can  be 
related  to  national  symbols  such  as  flags,  anthems, 
and  other  symbols.  In  the  cultural  field,  a  country’s 
pride  in  its  “achievement  in  the  arts  and  literature” 
is  significant,  but  more  often  banal  nationalism  is 
related  to  sports  events.  This  “sportive  nationalism” 
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is  well-known  in  mega  sport  events  such  as  soccer 
(football)  world  cups  and  the  Olympics.  In  these 
events,  national  symbols  such  as  anthems  and  flags 
are  used.  Pride  in  the  country’s  “achievements  in 
sports”  is  expressed  as  a  form  of  patriotism. 
Nevertheless,  as  Hannah  Arendt  pointed  out,  banal 
nationalism  is  not  necessarily  harmless  or  benign. 
It  is  always  only  a  small  step  from  benign  national 
identity  to  aggressive  xenophobic  nationalism. 

National  identity  is  also  related  to  national  per¬ 
formance  and  pride  in  different  domains,  such  as 
“political  nationalism.”  Pride  in  “the  way  democ¬ 
racy  works,”  pride  in  the  “social  security  system,” 
and  pride  in  a  country’s  “fair  and  equal  treatment 
of  all  groups  in  society”  are  other  domains  within 
democratic  “patriotism.”  Pride  in  a  country’s 
“political  influence  in  the  world”  and  pride  in  “the 
armed  forces”  are  more  nationalistic.  Pride  in  a 
country’s  “economic  achievement”  and  pride  in  its 
“scientific  and  technological  achievements”  are 
more  peaceful  expressions  of  national  pride. 

Nation  building  is  related  to  national  identity. 
Identity  is  a  complex  phenomenon;  in  the  setting 
of  multiple  identities,  different  forms  of  belongings 
compete.  National  identity  competes  with  different 
local  identities  and,  increasingly,  with  suprana¬ 
tional  identities  (e.g.,  in  the  European  Union). 
There  is  also  the  question  of  who  is  included  and 
who  is  excluded. 

Banal  nationalism  is  strongly  related  to  the 
state-  and  nation-building  process,  which  is  obvi¬ 
ous  in  multiethnic  states  such  as  South  Africa. 
South  Africa  has  one  of  the  most  complex  national 
identities.  Currently,  various  identity  narratives 
exist.  The  divisions  between  the  “self”  and  the 
“other”  are  regarded  as  phenomena  characterized 
by  cleavages  in  ethnicity  and  race,  as  well  as 
language  and  religion.  Banal  nationalism  as  a  gov¬ 
ernmental  strategy  is  obvious.  Although  group 
identities  are  strong,  South  Africa  is  a  country  with 
a  very  high  level  of  nationalism,  national  pride, 
and  sports  patriotism.  This  development  of  national 
identity  is  strongly  related  to  nation-building  ide¬ 
ologies  and  strategies.  In  1995,  1  year  after  the  first 
democratic  elections,  the  rugby  World  Cup  victory 
produced  slogans  such  as  the  Rainbow  Nation  and 
Simunye  (“We  are  one”).  The  common  support  for 
the  rugby  team  (in  the  past  mainly  a  “White” 
sport)  was  seen  as  a  triumph  of  national  reconcili¬ 
ation  between  Blacks  and  Whites.  During  the  2010 


football  World  Cup  in  South  Africa,  the  use  of 
national  symbols  was  openly  regarded  as  part  of  a 
broader  nation-building  strategy. 

History  of  Nation  Building 

Efforts  of  nation  building  are  strong  in  times  of 
regime  change.  In  the  different  waves  of  democrati¬ 
zation  as  well  as  with  the  winds  of  change  accompa¬ 
nying  the  independence  of  African  countries,  nation 
building  became  a  central  aspect.  But  historically, 
nation-building  processes  are  much  older. 

In  traditional  societies,  political  power  structures 
were  mostly  oriented  toward  regional  and  local 
demands  in  relatively  autonomous  rural  village 
structures.  The  process  of  nation  building  destroyed 
most  of  the  existing  local  traditional  systems  of  cli- 
entelism  and  personal  rule.  The  centrifugal  shift  of 
power  toward  the  regional  or  national  level  can  be 
seen  as  a  process  of  conflict.  Cultural,  social,  politi¬ 
cal,  and  economic  dominance  have  to  be  redefined. 

In  general,  the  birth  of  states  and  nations  took 
place  in  conflict  situations.  As  primordial  tradi¬ 
tional  communities  shared  similar  values  and  lan¬ 
guage,  states  existed  before  the  nations  themselves 
developed. 

In  the  foundation  phase  of  most  multiethnic 
states,  the  ruling  power  reinforces  the  development 
of  the  national  state  by  not  only  using  oppression 
but  also  providing  strong  economic  incentives  and 
implementing  cultural  policies  aimed  at  cultural 
homogeneity  (language  policy,  religion,  policies  in 
primary  and  secondary  education,  etc.) 

In  the  early  phases  of  nation  building,  political 
leaders  are  often  interested  in  financial  extraction 
from  their  constituencies.  They  often  impose  a 
state  apparatus.  They  also  have  to  act  against  new 
social  movements  and  groups  agitating  for  more 
democratic  rights  and  devolution  and/or  secession. 

Regional  Characteristics 

Stein  Rokkan  analyzed  the  territorial  centralization 
and  unification  that  took  place  during  the  period 
after  the  Thirty  Years  War  ending  in  1648  in 
Europe.  He  highlights  the  role  of  language  mobili¬ 
zation  in  unification  and  centralization.  In  Europe, 
the  large,  old  dynasties  and  monarchies  encom¬ 
passed  different  ethnic  and  regional  entities  and 
used  oppression  and  personal  clientelistic  networks 
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to  develop  nation-states,  often  based  on  a  common 
religion  (cuius  regio,  eius  religio  [whose  realm,  his 
religion]).  The  Peace  of  Westphalia  in  1648  can  be 
seen  as  the  beginning  of  the  modern  state.  It  rede¬ 
fined  new  regimes,  although  most  remained  mon¬ 
archies.  In  this  regard,  the  spatial  and  temporal 
contingency  of  the  nation-state  in  Europe  in  the 
18th  and  19th  centuries  was  still  significant.  The 
Industrial  Revolution  reinforced  centralization  and 
the  development  of  cultural  harmonization  and  a 
common  ideology.  This  was  inculcated  by  nation¬ 
wide  school  systems,  the  media,  and  so  on.  In  the 
late  19th  century  and  in  the  20th  century  during 
the  interwar  period,  many  new  states  were  formed 
after  the  breakdown  of  the  former  empires.  In  most 
countries,  multiethnic  states  developed,  which 
resulted  in  different  nation-building  processes,  such 
as  consensus-oriented  strategies  (e.g.,  in  Switzerland 
and  later  also  in  the  Netherlands).  Some  conflicts 
have,  however,  persisted  (e.g.,  in  Belgium  or  Spain). 
In  the  1990s,  after  the  breakdown  of  the  Eastern 
bloc,  new  nations  emerged.  The  breakup  of 
Yugoslavia  as  well  as  of  Czechoslovakia  showed 
that  nation  building  under  socialist  rule  had  not 
been  successful  either. 

Most  Asian  states  had  been  subjected  to  British, 
Dutch,  or  French  colonial  rule.  After  indepen¬ 
dence — mostly  after  World  War  II,  with  some 
exceptions  such  as  the  secession  of  Bangladesh 
from  Pakistan  as  well  as  the  independence  of  some 
small  island  states — nation  building  was  not  seri¬ 
ously  questioned.  This  is  due  to  the  fact  that  most 
of  these  countries  became  nondemocratic,  authori¬ 
tarian,  or  socialist  states,  suppressing  any  attempts 
at  secession. 

In  South  America,  with  the  exception  of  Brazil 
(1880),  countries  achieved  independence  early  in 
the  19th  century.  Local  elites  rebelled  against  the 
Spanish  or  Portuguese  colonial  powers.  State  for¬ 
mation  mostly  followed  the  previous  colonial 
administrative  divisions.  With  very  few  exceptions 
these  were  no  longer  questioned.  In  most  coun¬ 
tries  relatively  strong  “national”  identities  emerged 
with  some  remaining  ethnic,  racial,  and  regional 
divisions.  By  contrast,  African  countries  have 
experienced  many  problems  with  regard  to  state 
and  nation  building.  At  the  Berlin  Conference  of 
1884,  the  European  powers  agreed  on  a  division 
of  Africa.  The  African  colonies  were  formed  with¬ 
out  any  respect  for  existing  ethnic  or  linguistic 


cleavages.  With  independence,  most  African  states 
inherited  a  very  heterogeneous  sociocultural  and 
ethnic  structure.  The  colonial  state  had  been  held 
together  mostly  by  oppression  of  the  colonial 
power.  The  democratic  multiparty  systems 
installed  by  the  departing  colonial  powers  were 
often  immediately  restructured  after  indepen¬ 
dence.  Most  African  countries  introduced  strong 
presidential  one-party  systems  in  the  name  of 
national  unity.  Presidents  often  attempted  to 
become  lifetime  presidents,  and  many  considered 
themselves  as  father  figures  for  disunited  nations. 
This  trend  toward  one-party  systems  was  sup¬ 
ported  during  the  Cold  War  as  the  two  blocs  dis¬ 
couraged  secessions  and  favored  stability.  Some 
authoritarian  regimes,  supported  by  the  United 
States  and  its  allies,  as  well  as  African  socialist 
regimes,  often  supported  by  either  the  former 
USSR  or  China,  used  the  ideology  of  one  party/ 
one  nation-state  to  strengthen  their  power.  In  fact, 
the  constitutional  reforms  toward  centralization 
and  a  strong  executive  were  seen  as  instruments 
against  secession. 

With  the  end  of  the  Cold  War  and  the  wave  of 
democracy  that  swept  over  the  African  continent 
in  the  early  1990s,  some  of  these  authoritarian  and 
socialist  regimes  came  to  an  end.  Secession  of  some 
regions  (e.g.,  Eritrea  and  Somaliland)  and  new 
waves  of  civil  war  (as  in  Rwanda  and  Sudan)  made 
it  obvious  that  the  nation-building  process  was  still 
not  completed.  The  latest  new  state,  which  was 
formed  after  a  referendum  in  2011,  is  South 
Sudan.  New  forms  of  power  sharing  came  into 
place,  because  many  political  groups  would  not 
accept  defeat  during  elections,  in  which  the  win¬ 
ning  party  takes  it  all.  National  governments  are 
often  seen  as  a  temporary  solution  while  a  new 
constitution  is  drawn  up  and  elections  are  orga¬ 
nized.  But  these  are  not  quick-fix  solutions  for  the 
lack  of  a  national  identity. 

In  Africa,  in  spite  of  the  artificiality  of  state  for¬ 
mation  and  the  ethnic  heterogeneity  in  many  coun¬ 
tries,  the  number  of  separatist  movements  has 
remained  relatively  small.  Separatist  movements 
base  their  claims  on  existing  administrative  divi¬ 
sions:  in  the  former  Eastern  Nigeria  (Biafra), 
Casamance  in  Senegal,  Eritrea,  Somaliland,  the 
three  separatist  provinces  in  Sudan,  Katanga  in  the 
Congo,  and  the  Anglophone  region  of  Cameroon. 
Separatist  movements  such  as  in  the  Comoros  and 
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in  Zanzibar  have  been  reinvigorated  in  the  new 
millennium.  In  many  countries,  ethnic  groups  also 
have  become  co-opted,  incorporated,  and  included 
in  the  new  political  systems.  Under  certain  rules 
and  agreements,  de-ethnicized  territorial  national¬ 
ism  coexists  quite  harmoniously  with  ethnic  politi¬ 
cization  (see,  e.g.,  South  Africa  [Zulu],  Nigeria 
[Igbo  and  YorubaJ,  and  Ethiopia  [OromoJ). 

Instruments  of  Nation  Building  and 
Constitutional  Identity  Building 

In  democratic  systems,  constitutional  identity 
building  is  an  important  factor  for  nation  building. 
Constitutional  processes  are  often  seen  as  the 
enshrining  of  political  values  in  a  basic  document. 
Jurgen  Habermas  as  well  as  Francis  Fukuyama 
championed  the  idea  of  constitutional  patriotism, 
where  the  constitution  itself  becomes  the  center  of 
strong  collective  loyalties.  It  is  a  base  for  identifica¬ 
tion  and  identity;  older,  authoritarian,  traditional 
forms  of  identity  are  replaced  and  become  less 
relevant. 

Constitutional  processes  today  are  regarded  as 
important  steps  in  nation  building;  however,  they 
often  follow  an  unrealistic  procedure.  The  consti¬ 
tutional  process  usually  starts  with  the  election  of 
a  national  assembly  by  the  people  to  “write”  the 
constitution.  The  constitution  will  then  be  offi¬ 
cially  adopted  by  a  referendum.  This  ideal  type  of 
constitutional  process  is  often  based  on  wrong 
assumptions.  Nation  building  frequently  happens 
in  postconflict  situations,  where  elections  and  ref¬ 
erenda  are  neither  feasible  nor  desirable.  In  many 
postconflict  societies,  the  former  undemocratic 
bureaucracy  has  to  implement  a  referendum  in  a 
postauthoritarian  situation.  Early  general  elections 
may  culminate  in  the  marginalization  and  alien¬ 
ation  of  minority  groups  that  had  been  neglected  in 
the  former  regimes.  This  often  triggers  civil  unrest 
and  hinders  the  development  of  a  broad  acceptance 
of  the  new  constitution.  Postconflict  situations 
require  other  forms  of  early  nation-building  pro¬ 
cesses  with  deadlock-breaking  mechanisms  as  well 
as  other  instruments  of  reaffirmation  and  conflict 
resolution.  An  agreement  on  general  constitutional 
principles  or  a  referendum  at  the  beginning  of  the 
process  may  have  a  reassuring  effect. 

States  may  consist  of  different  nations,  and 
there  are  also  multination  states.  The  drafting  of 


the  constitution  must  reflect  the  diversity  of  exist¬ 
ing  traditional  values  and  norms.  Multinational 
fragmented  societies  should  aim  for  a  broad  repre¬ 
sentation  and  a  strong  devolution  of  power  to  the 
local  and  regional  levels. 

The  process  of  the  constitutional  discourse  as 
well  as  the  constitutional  text  itself  should  include 
traditional  institutions,  symbols,  and  values  that 
highlight  the  deep  collective  identity  to  enhance 
constitutional  patriotism  in  order  to  build  the 
united  nation. 

In  postwar  societies  with  regime  changes  (e.g., 
Germany  after  World  War  II,  Afghanistan,  and 
Iraq  in  the  2000s),  nation  building  often  is  regarded 
as  a  process  that  can  be  supported  by  external 
actors.  This  support  can  take  place  through  devel¬ 
opment  aid  and  investment  in  the  form  of  resources, 
personal  assistance,  and  economic  contributions 
by  the  external  power.  Military  support  may  also 
be  given  for  the  new  state  to  maintain  order  in  the 
transition  period.  Nation  building  is,  however, 
basically  an  indigenous  process  of  identity  forma¬ 
tion.  The  ability  of  outsiders  to  influence  nation 
building  is  often  overestimated.  The  idea  of  inject¬ 
ing  certain  values  and  concepts  often  fails  because 
the  development  of  a  national  identity  is  a  long¬ 
term  process.  Both  external  hegemons  and  supra¬ 
national  institutions  such  as  the  United  Nations 
often  fail  in  their  overconfident  search  for  a  quick 
solution.  The  new  states  tend  to  fail  before  they  are 
formed. 

Nation  building  is  more  successful  when  the 
external  power  realizes  that  short-term  strategies 
are  unlikely  to  succeed  in  a  country  with  strong 
religious,  ethnic,  and  economic  cleavages.  In  such 
cases,  the  intervening  external  power  has  to  iden¬ 
tify  allies  that  support  its  ideas  or  ideology.  This 
process  of  transformation  and  nation  building  has 
to  be  supported  by  institutional  development  and 
new  political  structures.  Although  it  is  often  prob¬ 
lematic,  it  is  crucial  to  remove  political  leaders 
from  key  positions.  Nevertheless,  most  postau¬ 
thoritarian  regimes  are  characterized  by  a  “pacted 
transition”  process,  where  the  former  regime’s 
state  apparatus  must  be  integrated  in  the  formation 
process  of  the  new  nation.  A  strategy  of  acceptance 
of  the  values  of  the  people  instead  of  insensitive 
intervention  seems  to  be  more  successful.  National 
building  is  based  on  national  identity  as  the  cohe¬ 
sive  force  that  holds  nation-states  together. 
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Internally,  it  can  be  used  benignly  and  harmlessly 
in  the  nation-building  process.  But  this  ingroup 
solidarity  encompasses  the  risk  of  outgroup  hostil¬ 
ity.  In  some  cases,  it  may  directly  lead  to  external 
aggression.  But  strong  nationalism  also  encom¬ 
passes  other  problems.  On  the  one  hand,  a  strong 
national  identity  may  be  directed  against  migrants 
and  noncitizens.  On  the  other  hand,  nation  build¬ 
ing  can  be  used  as  a  form  of  repression  by  the 
majority  ethnic  group  within  the  same  community. 
This  can  lead  to  conflicts  oriented  toward  seces¬ 
sion.  The  politicization  of  national  identities  and 
ethnic  conflicts  remains  a  risk. 

Norbert  Kersting 
Stellenbosch  University 
Stellenbosch,  South  Africa 
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National  Interest 


The  concept  of  the  national  interest  has  long  been 
central  to  the  conduct  and  the  analysis  of  state 
action,  and  particularly  of  foreign  policy.  The 
concept  is  so  central  because  it  is  assumed  that  the 
notion  of  interest  captures  the  motives  that  drive 


states,  or  their  decision  makers,  to  act.  Calling  it 
the  national  interest  is,  of  course,  something  of  a 
misnomer,  as  the  interest  in  question  belongs  to 
the  state — assumed  to  be  a  nation-state — in  its 
interactions  with  other  states.  Governments,  and 
specifically  their  foreign  policy  decision  makers, 
determine  the  substantive  content  of  the  national 
interest.  Although  this  content  changes  over  time 
and  varies  across  space,  the  term  generally  denotes 
the  state’s  most  important  foreign  policy  aims, 
ones  that  require  extensive  resources  and  even,  in 
the  extreme  case,  the  sacrifice  of  lives.  Since 
World  War  II,  national  security  has  frequently 
been  posited  as  the  most  salient  national  interest, 
which  captures  the  idea  that,  ultimately,  the  pri¬ 
mary  interest  of  the  state  has  to  be  the  protection 
of  its  existence  through  the  defense  of  its  sover¬ 
eignty  and  its  territory. 

How  the  Concept  Is  Deployed 

As  a  concept,  the  national  interest  is  both  ubiqui¬ 
tous  and  divisive.  It  is  ubiquitous  because  it  is 
deployed  in  at  least  three  important  and  overlap¬ 
ping  ways,  which  serve  distinct  purposes.  First,  for 
scholars  of  international  relations  and  foreign 
policy,  the  concept  is  understood  to  be  explana¬ 
tory.  Scholars  thus  invoke  the  national  interest  to 
explain,  for  instance,  why  the  United  States  entered 
World  War  II  (e.g.,  it  was  in  the  U.S.  national 
interest  to  prevent  German  expansion  in  Europe), 
why  the  United  States  fought  the  Vietnam  war 
(e.g.,  it  was  contrary  to  U.S.  national  interests  to 
allow  South  Vietnam  to  fall  to  communism),  and 
why  the  Taliban  had  to  be  removed  from  power  in 
Afghanistan  in  2001  (e.g.,  it  is  in  the  U.S.  national 
interest  both  to  promote  democracy  and  to  fight 
terrorism).  In  this  usage,  the  national  interest  is 
invoked  causally,  to  explain  policy  decisions  and 
the  attendant  state  actions. 

Second,  the  national  interest  can  also  be  thought 
of  as  the  language  of  foreign  policy  decision  mak¬ 
ing  or,  more  generally,  of  state  action.  It  is  through 
the  notion  of  the  national  interest  that  foreign 
policy  decision  makers  seek  to  determine  what  the 
aims  of  their  foreign  policies  should  be.  The 
national  interest  thus  forms  the  practical  basis  for 
state  action  in  international  politics.  This  adds  a 
normative  dimension  to  the  concept,  in  that  deci¬ 
sion  makers  generally  assume  that  they  ought  to 
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act  in  the  national  interest.  In  the  aftermath  of  the 
attacks  of  September  11,  2001,  for  example,  Tony 
Blair’s  government  in  the  United  Kingdom  (UK) 
determined  that  it  was  in  the  British  national 
interest — that  is,  that  Britain  ought — to  support  the 
Bush  administration’s  global  war  on  terror,  support 
the  U.S.  war  against  the  Taliban  in  Afghanistan, 
and  participate  in  producing  regime  change  in  Iraq 
in  2003.  In  this  second  conception  of  the  national 
interest,  as  defined  by  decision  makers,  it  provides 
the  practical  aims  or  goals  of  state  action. 

Third,  the  national  interest  serves  a  prominent 
legitimating  function.  It  is  routinely  deployed  by 
both  politicians  and  commentators  to  justify  for¬ 
eign  policy  decisions  and  state  actions.  The  national 
interest  is  a  powerful  rhetorical  tool,  which,  in 
addressing  its  audience  as  members  of  a  nation, 
asks  them  to  agree  to  policies  in  their  own — that 
is,  national — interest.  Most  starkly,  when  states 
require  soldiers  to  put  their  lives  in  jeopardy  in  the 
service  of  foreign  policy,  this  is  done  in  the  name 
of  the  national  interest.  The  national  interest,  on 
this  third,  rhetorical  usage,  thus  contributes  to  the 
creation  of  public  consent  to  and  support  for  for¬ 
eign  policy  and  state  action. 

As  several  of  the  examples  above  already  indi¬ 
cate,  in  addition  to  being  ubiquitous,  claims  to  the 
national  interest  are  often  divisive.  This  is  because 
the  actual  content  of  the  national  interest  is  inevita¬ 
bly  political:  It  depends  on  and  reproduces  differen¬ 
tial  power  relations  and  differentially  affects  diverse 
publics.  As  a  result,  the  content  of  the  national 
interest  is  both  inevitably  contestable  in  principle 
and  often  contested  in  practice.  For  instance,  the 
prosecution  of  the  U.S.  war  in  Vietnam  during  the 
Cold  War  precipitated  mass  protest  as  diverse  pub¬ 
lics  and  elites  in  the  United  States  challenged  the 
claim  that  fighting  communism  in  South  Vietnam 
was  actually  in  their  national  interest.  Similarly, 
protests  in  the  UK  since  the  invasion  of  Iraq  in  2003 
demonstrate  that  many  Britons,  both  among  the 
public  and  among  elites,  question  whether  either 
the  invasion  of  Iraq  or  the  close  alliance  with  the 
United  States  are  in  fact  in  their  national  interest. 

Although  a  variety  of  different  theoretical 
approaches  can  and  do  deploy  the  concept  of  the 
national  interest,  at  least  two  broad  and  distinctly 
different  approaches  can  be  identified.  The  first 
treats  national  interests  in  a  positivist  manner  as 
objective,  as  given  by  the  international  system  and 


by  power  relations  among  states.  The  second  treats 
national  interests  in  a  postpositivist  manner  as 
social  constructions,  as  socially  constituted  in  rela¬ 
tion  to  identity.  These  different  understandings  of 
the  character  of  the  national  interest  lead  to  diver¬ 
gent  analyses  of  foreign  policy  and  international 
politics. 

Comparison  of  Theoretical  Approaches 

Theoretical  approaches  to  the  study  of  foreign  pol¬ 
icy  and  international  relations  such  as  both  realism 
(and  its  variants)  and  liberalism  (and  its  variants) 
treat  national  interests  as  objective  and  as  given. 
Political  realism  provides  a  good  example  of  such  a 
conceptualization.  In  this  approach,  it  is  assumed 
that  an  international  system  made  up  of  sovereign 
states  is  anarchic,  without  overarching  or  legitimate 
authority.  States  must  therefore  provide  for  their 
own  security.  Uncertainty  about  the  behavior  of 
other  states  means  that  the  fundamental  interest  of 
any  state  must  be  to  amass  enough  power  to  protect 
itself  from  the  threat  potentially  posed  by  other 
states.  The  most  basic  national  interest — amassing 
power  to  ensure  the  state’s  survival — is  thus  objec¬ 
tively  given  by  the  nature  of  the  international  sys¬ 
tem.  Since  the  actual  survival  of  states  is  rarely  at 
risk,  more  specific  national  interests  are  determined 
through — read  off  of — the  realistic  assessment  of 
objective  power  relations  and  potential  external 
threats.  In  the  face  of  a  threatening  powerful  neigh¬ 
bor,  for  example,  a  small  state  has  a  national  inter¬ 
est  in  allying  with  other  states.  In  the  face  of  its 
objective  geographical  location,  Russia  has  a 
national  interest  in  access  to  warm- water  ports.  In 
the  face  of  a  relative  decline  in  economic  efficiency, 
the  national  interest  in  protectionist  measures 
increases.  And  in  the  face  of  international  terror¬ 
ism,  a  global  war  on  terror  becomes  the  national 
interest.  In  each  case,  the  concrete  national  interest 
is  given;  it  is  determined  by  objective  external 
threats  and  conditions.  Interests,  therefore,  are 
assumed  to  follow  from  external  and  objectively 
identifiable  threats  and  are  directly  accessible  to 
policy  makers  and  analysts  alike. 

Theoretical  approaches  such  as  constructivism 
and  poststructuralism,  in  contrast,  take  a  funda¬ 
mentally  different  approach.  Rather  than  treating 
national  interests  as  objective  and  given,  they  treat 
national  interests  as  socially  constructed  in  relation 
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to  identity.  One  constructivist  version  of  this  argu¬ 
ment  begins  with  the  understanding  that  the  social 
world  is  constituted  through  discursive  practices — 
that  is,  practices  that  produce  social  meaning, 
including  identities,  whether  individual  or  collec¬ 
tive.  Foreign  policy  discourse,  for  example,  consti¬ 
tutes  the  state  as  a  particular  state,  constitutes  the 
diverse  identities  of  other  states,  and  defines  inter¬ 
ests  in  relation  to  those  identities.  It  was  thus  a 
particular  U.S.  state — constituted  as  democratic 
and  freedom  loving  rather  than,  say,  as  aggressive 
and  imperialist — that  had  the  Cold  War  national 
interest  in  leading  its  allies  in  the  free  world  in  the 
grand  strategy  of  containing  the  totalitarian  Soviet 
threat.  From  this  viewpoint,  threats  to  states  and 
the  interests  of  states  in  the  face  of  those  threats  are 
fundamentally  matters  of  interpretation.  Nuclear 
weapons  provide  an  example.  It  is  not  the  physical 
fact  that  nuclear  weapons  can  vaporize  cities  that 
makes  them  threats.  British,  U.S.,  and  Israeli 
nuclear  capabilities  are  not  interpreted  in  that  way 
by  many  states.  Nuclear  weapons  only  become 
threats  to,  say,  U.S.  interests — necessitating  their 
removal  (e.g.,  from  Cuba)  or  the  prevention  of 
their  development  (e.g.,  by  Iran) — when  their  exis¬ 
tence,  or  the  possibility  of  their  existence,  is  inter¬ 
preted  as  dangerous  because  they  are  wielded  by 
communists,  by  rogue  states,  or  by  other  dangerous 
identities.  That  is,  threats  are  socially  constituted  as 
such  in  discourse — they  are  not  obvious  or  given. 
In  this  approach,  the  same  is  true  of  interests. 

These  two  distinct  approaches  to  the  national 
interest  issue  result  in  divergent  analyses  of  foreign 
policy  and  international  politics.  Contrasting  them 
also  highlights  the  inevitably  political  nature  of  the 
identification  and  deployment  of  the  national 
interest  within  either  perspective. 

Briefly  sketching  the  Cuban  missile  crisis  of 
1962  from  within  the  first  approach — of  given, 
objective  interests — highlights  the  deployment  in 
Cuba  of  Soviet  nuclear-capable  missiles  and  the 
need  for  a  U.S.  policy  response.  On  this  view,  the 
situation  faced  by  the  United  States  and  its  allies  in 
October  1962  was  one  in  which  power  relations 
had  changed  significantly.  The  initially  secret  Soviet 
deployment  meant  that  the  former  Soviet  Union 
now  had  at  least  40  more  nuclear  weapons  and  that 
these  weapons  could  strike  targets  in  North  and 
South  America.  These  offensive  missiles  posed  an 
objective  threat  to  the  United  States  for  several 


(albeit  debated)  reasons:  They  altered  the  strategic 
balance;  they  gave  the  Soviet  Union  a  military 
bridgehead  in  the  Caribbean;  and  they  provided  a 
bargaining  chip  to  force  a  settlement  in  West  Berlin 
and  in  Germany  as  a  whole.  The  U.S.  national 
interest  in  response  to  this  changed  situation  was 
indisputable:  The  missiles  had  to  be  removed. 

Briefly  sketching  the  missile  crisis  from  the 
second  perspective — that  of  interests  socially  con¬ 
structed  in  relation  to  identity — provides  a  differ¬ 
ent  analysis.  This  approach  asks:  How  was  this 
particular  national  interest  constructed  and  in  rela¬ 
tion  to  which  identities?  How  did  the  removal  of 
the  missiles  come  to  be  understood  as  the  U.S. 
national  interest?  It  examines  the  construction  in 
U.S.  foreign  policy  discourse,  by  both  foreign  pol¬ 
icy  decision  makers  and  analysts,  of  a  democratic 
and  freedom-loving  United  States  leading  the  free 
world  (its  allied  others)  in  the  fight  against  the 
Soviet  Union  (the  evil,  totalitarian  other).  In  Cold 
War  discourse,  U.S.  leadership  was  always  defen¬ 
sive:  Whether  in  Berlin,  Korea,  Vietnam,  or  Cuba, 
it  protected  freedom  from  an  aggressive  and  total¬ 
itarian  international  communism  led  by  the  Soviet 
Union.  In  this  context,  the  missiles  were  inter¬ 
preted  as  necessarily  offensive  and  the  interest  in 
their  removal  confirmed.  However,  missiles  are 
not  inherently  offensive  or  defensive;  rather,  they 
must  be  constructed  as  such.  Moreover,  interpret¬ 
ing  them  as  offensive  dismissed  from  the  outset 
Soviet  and  Cuban  claims  that  the  missiles  were 
deployed  legally  to  defend  Cuba  from  a  second 
Bay  of  Pigs  invasion.  But  other  constructions — 
that  the  missiles  were  annoying  but  legal  or  that 
they  were  irrelevant  as  the  United  States  did  not 
intend  to  invade  Cuba — were  in  fact  possible. 
Within  this  approach,  the  U.S.  national  interest  in 
response  to  the  Soviet  missile  deployment  was  not 
indisputable  and  might  have  allowed  the  United 
States  to  make  no  policy  response  at  all. 
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Nationalism 


Nationalism  is  one  of  the  most  difficult  and  con¬ 
tested  concepts  in  social  science.  There  are  funda¬ 
mental  disagreements  on  its  definition,  origins, 
historical  location,  and  normative  status.  This 
entry  examines  the  concept  of  nationalism  and 
explores  these  disagreements  on  how  it  should  be 
understood. 

Definition 

In  a  very  general  sense,  nationalism  implies  a 
strong  attachment  to  the  nation  as  a  human  collec¬ 
tive.  In  the  social  sciences,  however,  it  tends  to  be 
defined  more  strictly  as  the  belief  that  the  nation 
should  form  the  basis  for  political  order.  At  one 
time,  it  was  common  to  define  it  as  a  movement  in 
search  of  its  own  state  or  to  extend  the  reach  and 
power  of  an  existing  state.  In  recent  years,  it  has 
been  recognized  that  nationalists  may  pursue  other 
forms  of  polity,  including  that  of  their  own  decen¬ 
tralized  or  federal  region.  Attention  has  also 
extended  from  nationalisms  seeking  to  change  the 
political  order  to  nationalism  as  an  everyday 
mechanism  for  sustaining  the  existing  political 
order  of  the  nation-state,  as  in  Michael  Billig’s 
“banal  nationalism.” 

This  still  leaves  the  question  of  how  the  nation 
itself  is  defined.  For  some,  the  nation  is  a  politi¬ 
cized  ethnicity,  but  this  merely  begs  the  equally 


difficult  question  of  what  an  ethnic  group  is. 
Besides,  whatever  definition  of  ethnic  group  we 
use,  many  of  them  are  not  organized  as  nations 
nor  would  their  members  recognize  them  as  such. 
There  appears  to  be  no  one  single  characteristic 
that  would  qualify  a  group  as  a  nation,  be  it  cul¬ 
ture,  institutions,  territory,  or  history.  In  many 
cases,  indeed,  the  idea  that  a  particular  group  is  a 
nation  is  contested.  There  are  various  ways  out  of 
this  conundrum.  One  is  to  divide  nationalisms  into 
different  types  qualified  with  prefixes  or  adjectives, 
but  this  risks  losing  the  central  concept  altogether. 
A  second  is  to  treat  the  nation  as  bearing  resem¬ 
blance  to  a  family  rather  than  one  with  a  single 
core  meaning.  This  allows  us  to  cope  with  the 
complexity  and  variety  of  nationalisms  and  to 
compare  aspects  of  each,  rather  than  forcing  them 
into  the  same  mold.  A  third  way  is  to  see  the 
nation  as  a  subjective  category,  so  that  if  people 
feel  they  are  a  nation,  then,  they  are.  A  fourth  is  to 
treat  nationhood  as  a  claim  made  both  as  to  the 
existence  of  a  group  and  to  a  set  of  rights  that  per¬ 
tain  to  nationality,  rather  than  as  a  sociological 
category.  These  central  definitional  problems  have 
made  it  impossible  to  arrive  at  a  general  theory  of 
nationalism  applicable  in  all  contexts  and  at  all 
times.  Most  of  the  literature  therefore  looks  at 
particular  nationalisms  or  tries  to  generalize  from 
specific  cases,  with  decidedly  mixed  results. 

Origins 

The  origins  and  causes  of  nationalism  are  equally 
disputed.  It  is  common  to  place  primordialists  and 
perennialists  at  one  extreme  and  constructivists 
and  modernists  at  the  other.  A  primordialist  would 
believe  that  nations  have  their  origins  in  essential 
features  of  humanity  that  are  difficult  if  not  impos¬ 
sible  to  change.  Some  nationalists  themselves  have 
subscribed  to  this  view,  especially  in  the  late  19th 
century  when  biological  theories  of  race  were 
developing.  Nationality  is  presented  here  as  an 
essential  characteristic  bred  into  individuals.  Very 
few  nationalists  and  fewer  social  scientists  nowa¬ 
days  would  admit  to  being  primordialists,  although 
some  sociobiologists  seek  to  derive  the  nation  from 
the  human  need  for  collectivities  and  the  instinct 
for  territory  and  mutual  protection.  Whatever  the 
merits  of  this  as  a  theory  of  human  behavior,  how¬ 
ever,  it  can  scarcely  explain  why  communities 
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should  take  the  form  of  nations  in  general  or  of 
specific  nations  in  particular.  It  thus  fails  as  a  the¬ 
ory  of  nationalism.  Perennialists  are  not  primordi- 
alists,  in  that  they  do  not  believe  that  nations  are 
part  of  the  hardwiring  of  the  human  being  and 
rather  emphasize  culture  and  deeply  ingrained 
social  habits.  They  insist  that  nations  go  back  a 
long  way  in  time  and  are  not  as  easily  manipulated 
or  rebuilt  as  constructivists  and,  especially,  instru¬ 
mentalists  would  think.  They  will  cite  the  existence 
of  things  looking  very  like  nations  in  antiquity  and 
locate  the  origins  of  European  nations  in  premod¬ 
ern  times. 

Constructivists,  on  the  other  hand,  see  nations 
as  the  work  of  human  volition  and  interaction. 
Although  they  could  logically  be  constructivists 
and  perennialists,  locating  the  construction  of 
nations  in  the  past,  they  are  almost  invariably 
modernists,  insisting  that  nations  and  nationalism 
are  the  product  precisely  of  modernization  and 
that  it  makes  no  sense  to  talk  of  them  before,  at  the 
earliest,  the  late  18th  century.  Some  modernists 
such  as  Ernest  Gellner  are  sociological  and  func¬ 
tionalist  in  their  approach,  seeking  the  principle  of 
the  nation  in  the  needs  of  modern  industrial  soci¬ 
ety  for  literacy,  communication,  and  mobilization 
and  the  breaking  down  of  old  barriers  to  social 
and  geographical  mobility.  Benedict  Anderson  is 
famous  for  the  phrase  “imagined  communities,” 
by  which  he  meant  not  that  nations  are  imaginary 
and  somehow  false  but  that  since  members  of  a 
nation  cannot  know  each  other  personally,  they 
have  to  imagine  the  collectivity  (in  fact,  the  idea 
had  earlier  been  suggested  by  Hans  Kohn).  This 
was  not  possible  before  the  invention  of  print 
media.  Sociological  modernists  have  been  criti¬ 
cized  as  teleological  or  functionalist,  seeing  the 
function  that  nationalism  serves  as  somehow 
explaining  why  it  came  about  without  specifying 
the  mechanism  or  the  agent,  although  Anderson 
makes  more  of  an  effort.  The  timing  and  geogra¬ 
phy  are  also  problematic,  given  that  some  nations 
and  nationalisms  appear  to  have  emerged  before 
industrialization  and  in  nonindustrial  societies. 

More  political  approaches  agree  with  John 
Breuilly  that  nationalism  is  essentially  a  form  of 
politics,  especially  modern  politics.  Often  it  is 
linked  to  mass  politics  and  democratization.  A  key 
legacy  is  that  of  the  French  Revolution,  which  pro¬ 
claimed  the  sovereignty  of  the  people.  This  made  it 


necessary  to  define  the  people  in  some  way  short 
of  the  entire  human  species,  and  the  nation  was  the 
result.  In  exporting  their  revolution,  the  French 
invited  other  peoples  to  throw  off  their  rulers  and 
proclaim  their  own  sovereignty,  with  the  ironic 
result  that  they  turned  against  their  French  libera¬ 
tors  and  forged  national  movements  of  their  own. 

Instrumentalists  stress  the  role  of  elites,  whether 
state  rulers  or  their  challengers,  in  creating  nations 
to  mobilize  mass  populations  in  the  modern  era. 
They  insist  that  there  is  nothing  natural  about 
nations  and  that  they  are  manufactured  for  specific 
purposes,  although  sometimes  they  take  on  a  life 
of  their  own.  While  perennialists  believe  that  it  is 
nations  that  create  states,  instrumentalists  will 
often  stress  the  role  of  states  in  creating  nations 
using  the  socialization  tools  that  they  possess, 
including  control  over  education,  the  media,  civic 
ceremonial  military  service,  and  the  welfare  state, 
to  create  shared  identities  and  solidarities.  Other 
scholars  believe  that  either  route  is  possible,  so 
that  we  must  distinguish  between  those  cases 
where  the  nation  created  the  state  and  those  where 
the  reverse  occurred.  Instrumentalists,  in  their 
turn,  have  been  criticized  on  the  grounds  that 
nation  builders  need  something  preexisting  to 
work  with  and  cannot  create  communities  of  such 
power  and  durability  ex  nihilo. 

In  favor  of  the  modernist  argument  is  the  fact 
that  the  19th  century  was  an  era  of  marked  nation¬ 
alism.  The  French  Revolution,  which  took  a  cen¬ 
tury  to  work  itself  out,  was  a  profoundly  national 
affair,  and  it  is  argued  that  it  created  Frenchmen 
out  of  the  population  of  a  dynastic  state  with  no 
previous  sense  of  popular  unity.  The  message  was 
taken  up  in  Germany  and  Italy,  where  nationalism 
was  harnessed  by  leaders  seeking  to  enlarge  their 
states  and,  for  the  first  time,  inviting  the  people  into 
the  enterprise.  Across  the  great  empires  (German, 
Habsburg,  Ottoman,  Russian,  and  British),  there 
was  an  “awakening  of  the  nationalities”  as  move¬ 
ments  emerged  demanding  self-government.  Most 
of  these  sought  a  measure  of  autonomy  within  their 
imperial  systems,  but  some  wanted  full  indepen¬ 
dence;  with  the  collapse  of  the  empires  in  World 
War  I,  independent  statehood  seemed  the  only 
option.  The  problem,  however,  was  that  there  were 
not,  and  could  not  be,  enough  states  to  satisfy  all 
the  demands  being  made,  which  were  often 
focused  on  the  same  territories.  So  the  nation-state 
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principle  could  triumph  only  in  restricted  parts  of 
the  European  continent. 

Against  the  modernist  argument  is  the  fact  that 
many  European  nations  can  trace  their  origins 
back  to  an  earlier  period.  Adrian  blastings  and 
Liah  Greenfeld  cite  England  as  an  example  of  early 
nationhood,  certainly  by  the  16th  century. 
Scotland,  having  resisted  incorporation  into  the 
English  Crown,  was  another  example,  complete 
with  an  early  doctrinal  justification  for  its  indepen¬ 
dence.  We  may  rightly  be  skeptical  of  the  claims  of 
nationalist  historians  about  the  antiquity  of  France 
or  its  continuity  with  earlier  polities  in  the  same 
region,  but  the  French  state  can  trace  a  long  lin¬ 
eage  all  the  same.  Probably  the  best  way  of  resolv¬ 
ing  the  question  “When  was  the  nation?”  is  to  tie 
it  to  the  question  “What  is  the  nation?”  Then,  we 
can  accept  that  there  were  indeed  proto-nations  in 
premodern  times  while  confining  the  meaning  of 
the  nation  and  of  nationalism  in  its  modern  sense 
to  the  modern  era. 

An  effort  to  bridge  the  primordialist/constructivist 
and  perennialist/modernist  divides  is  the  ethno- 
symbolism  of  Anthony  Smith.  Nations,  in  this  vision, 
are  built  on  ethnic  cores  or  etbnies,  which  them¬ 
selves  are  rather  ancient  and  hard  but  which 
expand  to  incorporate  peripheral  territories,  usu¬ 
ally  through  the  agency  of  the  state,  so  creating 
modern  civic  nations.  The  notion  of  an  ethnic  core, 
however,  is  problematic  and  difficult  to  operation¬ 
alize.  The  center-periphery  model  of  nation  build¬ 
ing  is  a  common  device  in  macro  history,  but 
endowing  the  center  with  an  ethnic  element  is 
another  matter.  In  the  case  of  England  and  then 
Britain  (one  of  Smith’s  examples),  the  core,  in  the 
southeast  of  England,  was  an  ethnic  melting  pot  of 
Anglo-Saxons,  Norman-French,  and  Danes. 
Softening  the  concept  of  ethnic  group  by  labeling 
it  as  an  etbnie  does  not  resolve  the  essential  diffi¬ 
culties  of  this  concept,  which  seems  to  have  no 
core  meaning  but  refers  to  a  range  of  traits,  includ¬ 
ing  language,  culture,  kinship,  and  proximity, 
none  of  which  is  either  necessary  or  sufficient. 
Showing  that  modern  nations  had  earlier  predeces¬ 
sors  does  not  prove  the  case  either  since  there  were 
myriad  collective  identities  in  premodern  times, 
the  great  majority  of  which  did  not  evolve  into 
nations.  So,  like  the  argument  about  the  need  for 
community  and  belonging,  this  does  not  explain 
the  modern  nation  in  general  or  in  particular. 


Nationalist  Doctrine 

Nationalism  has  often  been  criticized  for  lacking  a 
core  doctrine  or  great  thinkers.  It  is  true  that  there 
have  been  few  grand  normative  theories  and  that 
scholars  writing  about  nationalism  have  often 
tended  to  be  skeptical  if  not  outright  condemna¬ 
tory,  as  is  Elie  Kedourie.  It  is  also  argued  that  an 
overall  doctrine  of  nationalism  is  impossible  since 
it  is  based  essentially  on  a  fiction  (the  existence  of 
nations)  and  that  it  is  not  universalizable  since 
nationalisms  are  by  definition  opposed  to  each 
other.  There  is  an  element  of  truth  in  this,  but 
nationalism  is  no  more  or  less  constructed  than 
other  sociological  concepts  like  class.  Nineteenth- 
century  nationalists  such  as  Giuseppe  Mazzini  and 
Johann  Gottfried  von  Herder  did  try  to  universal¬ 
ize  their  ideas,  albeit  with  very  limited  success.  The 
principle  of  national  self-determination  has  enjoyed 
a  measure  of  international  support  since  the  mid- 
20th  century,  recognized  by  the  United  Nations 
but  tightly  circumscribed  to  apply  only  to  existing 
states  or  to  colonial  territories  (by  the  “salt  water 
doctrine”  whereby  colonies  are  by  definition  sepa¬ 
rated  from  the  colonizer  by  a  sea)  and  not  to 
nationalist  movements  within  consolidated  states, 
especially  when  these  are  democracies.  Beyond 
that,  the  view  has  prevailed  that  a  universal  doc¬ 
trine  right  of  self-determination  would  be  unwork¬ 
able,  given  the  overlapping  and  competing  claims 
and  an  invitation  for  opportunists  to  create  their 
own  national  movements. 

The  doctrines  employed  by  nationalist  move¬ 
ments  also  adapt  themselves  to  the  times  and  to 
prevailing  ideas  of  legitimacy.  So  in  the  19th  cen¬ 
tury,  the  concept  of  race  was  used  rather  freely, 
together  with  primordialist  ideas  since  these  were 
also  being  deployed  by  states.  In  recent  years,  many 
nationalist  movements  have  sought  to  stress  their 
liberal  democratic  credentials  and  the  voluntarist 
conception  of  the  nation  as  a  union  of  citizens,  in 
line  with  prevailing  norms.  With  the  collapse  of  the 
central  empires  in  Europe,  nationalism  became  a 
doctrine  of  separatism,  inspired  partly  by  Woodrow 
Wilson’s  apparent  endorsement  of  the  doctrine  of 
self-determination.  In  the  late  20th  century,  how¬ 
ever,  other  movements  adopted  postsovereigntist 
ideas  about  self-determination  within  larger  entities 
like  federations,  confederations,  or  supranational 
orders  such  as  the  European  Union. 
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Classifying  Nationalisms 

Given  the  difficulties  in  identifying  a  core  meaning 
or  doctrine  for  nationalism,  it  is  not  surprising  that 
scholars  have  sought  to  define  nationalism  accord¬ 
ing  to  different  types.  This  enterprise  has  also 
proved  frustrating  for  a  number  of  reasons,  not 
least  that  the  concepts  employed  often  have  a 
strong  normative  loading.  There  is  a  practical  dis¬ 
tinction  between  the  nationalism  of  established 
states  and  that  of  movements  seeking  to  create  their 
own  states  or  otherwise  advance  self-determination 
claims.  As  noted,  this  underpins  the  international 
norm  upholding  the  former  but  not  the  latter.  It  is 
less  clear  that  this  represents  a  difference  in  princi¬ 
ple.  Now  that  the  nation-state  has  been  demystified 
and  its  sovereignty  curtailed  by  transnational  inte¬ 
gration,  its  moral  and  practical  supremacy  can  no 
longer  be  taken  for  granted,  and  it  must  justify 
itself  on  the  same  principles  as  its  challengers.  A 
distinction  is  also  drawn  between  aggressive  nation¬ 
alisms,  seeking  domination  over  other  nations,  and 
defensive  nationalisms,  seeking  only  to  run  their 
own  affairs.  This  looks  like  both  a  practical  and  a 
normative  distinction,  but  where  nationality  claims 
are  entangled  and  overlapping,  as  in  the  Balkans,  it 
does  not  always  yield  a  definitive  solution. 

Perhaps  the  most  enduring  dichotomy  is  that 
between  civic  and  ethnic  nationalism.  This  origi¬ 
nated  in  19th-century  comparisons  of  France  and 
Spain  and  was  articulated  in  Hans  Kohn’s  distinc¬ 
tion  between  Eastern  and  Western  nationalisms. 
The  former,  which  included  Germany,  were  por¬ 
trayed  as  inward  looking  and  backward  looking, 
defining  the  nation  on  narrowly  ascriptive  grounds 
and  prone  to  xenophobia  and  aggression.  Western 
nationalisms,  by  contrast,  were  built  on  the  civic, 
territorial  state,  open  to  all  within  their  borders 
and  founded  on  democratic  principles.  John 
Plamenatz  made  a  similar  distinction  between  lib¬ 
eral  nationalism  developing  in  England  and  the 
nationalisms  of  continental  Europe,  although  he 
mostly  placed  Germany  on  the  civic  side  of  the 
line.  The  geographical  distinction  has  been  criti¬ 
cized  as  questionable  and  as  arising  from  stereo¬ 
typing  and  is  based  on  a  border  that  changes  with 
the  various  authors.  It  is  easily  used  as  an  ideo¬ 
logical  justification  for  the  Western  powers  to 
assert  their  superiority.  The  idea,  however,  keeps 
coming  back.  Recent  authors  have  retained  the 


ethnic-civic  distinction  while  seeing  examples  of 
both  in  all  parts  of  Europe.  Others,  such  as 
Maurizio  Viroli,  have  reframed  the  distinction  as 
that  between  nationalism  and  patriotism.  The  same 
distinction  has  been  tapped  by  differentiating  inclu¬ 
sive  from  exclusive  nationalism  and  cultural  from 
political  nationalism  or  from  territorial  nationalism. 
While  subtly  different,  these  distinctions  are  similar 
in  tendency,  and  all  encompass  both  analytical  and 
normative  elements.  They  have  been  criticized  in 
turn  on  the  ground  that  there  are  no  pure  examples 
of  either  the  civic  or  the  ethnic  (or  their  various 
approximations).  This  objection  is  beside  the  point 
if  the  categories  are  taken  not  as  a  taxonomy  of  real 
cases  but  as  ideal  types,  or  end  points  on  a  spec¬ 
trum,  to  which  real  cases  approximate  to  a  greater 
or  lesser  degree.  It  is,  however,  necessary  to  specify 
the  characteristics  to  which  these  labels  refer,  and 
here  there  is  considerable  confusion. 

The  central  distinction  in  all  of  this  is  between  a 
nationalism  that  defines  the  nation  by  strictly 
ascriptive  criteria  (which  is  what  ethnicity  means  in 
this  context)  and  one  that  is  more  open  and  in 
which  nationality  can  be  acquired.  The  most  com¬ 
mon  ascriptive  criterion  is  ancestry,  so  that  it  is  not 
possible  for  somebody  to  join  the  nation.  Citizenship 
laws,  for  example,  are  often  seen  as  indicators  for 
ethnic  or  civic  conceptions  of  nationhood,  with  ius 
sanguinis  (citizenship  by  descent)  seen  as  ethnic 
and  ius  soli  (by  birth  in  the  territory)  as  civic. 
Certainly  ius  soli  is  more  inclusive,  but  one  can 
argue  that  both  are  ascriptive  criteria  for  member¬ 
ship.  The  real  question  is  whether  people  satisfying 
neither  of  these  criteria  can  join  the  nation  legally 
through  naturalization  and,  even  if  they  can, 
whether  they  are  accepted  socially  as  conationals. 
Another  mechanism  is  culture  and  language,  with 
nationalisms  insisting  on  a  particular  culture  said 
to  be  exclusive  and  ethnic.  This  is  also  not  clear  in 
practice  since  languages  can  be  acquired,  and  in 
some  cases,  such  as  the  United  States,  insistence  on 
a  single  language  is  defended  as  a  mechanism  for 
inclusion  of  immigrants  rather  than  exclusion;  the 
same  argument  is  made  in  Quebec  and  Catalonia. 
Rather  than  fix  on  particular  mechanisms,  then,  it 
might  be  helpful  to  focus  on  the  intent  of  specific 
practices  and  whether,  how,  and  whom  they  tend 
to  include  or  exclude,  accepting  that  any  national¬ 
ism  by  definition  includes  some  people  and  excludes 
others.  For  all  these  difficulties,  however,  there 
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does  seem  to  be  a  difference  in  nationalist  practices 
in  specific  contexts  between  those  that  seek  to 
unite  the  whole  community  within  a  given  territo¬ 
rial  boundary  and  subject  to  common  institutions 
and  those  that  seek  to  redraw  boundaries  and  insti¬ 
tutions  to  fit  preexisting  conceptions  of  commu¬ 
nity.  Some  national  communities  are  open  to 
incomers,  at  least  in  the  second  generation,  while 
others  are  more  restrictive. 

Sometimes  this  distinction  is  mapped  onto  that 
between  the  nationalism  of  existing  states,  often 
assumed  to  be  civic  and  inclusive,  and  the  nation¬ 
alism  of  component  territories  and  groups,  assumed 
to  be  ethnic  and  exclusive.  There  seems  to  be  little 
justification  for  this  other  than  a  predisposition  to 
existing  order  since  nationalisms  at  both  levels  can 
be  more  or  less  exclusive. 

The  distinction  is  often  given  a  strong  normative 
charge,  with  civic  nationalism  seen  as  liberal  and 
democratic  and  ethnic  nationalism  as  antidemo¬ 
cratic.  Civic  nationalism  is  individualist  and  based 
on  voluntary  adhesion;  ethnic  nationalism  is  col¬ 
lectivist  and  forced.  In  practice,  however,  member¬ 
ship  and  exit  from  the  nation  is  rarely  a  matter  of 
individual  choice,  and  Liah  Greenfeld  argues  that 
civic  nationalism  itself  may  take  individualist  or 
collectivist  forms.  The  concept  of  civic  nationalism 
is  sometimes  further  stretched  to  cover  republican¬ 
ism  (an  order  based  on  political  commitment  of 
citizens  toward  the  polity),  Jurgen  Habermas’s 
constitutional  patriotism  (the  idea  that  order  can 
be  based  on  support  for  constitutional  values  with¬ 
out  an  ethnic  component),  or  even  multiculturalism 
(the  idea  that  the  state  should  not  impose  a  single 
culture).  This  is  to  miss  the  point  that  even  civic 
nationalism  is  a  form  of  nationalism,  albeit  shorn 
of  ascriptive  requirements.  It  may  point  to  a  single 
language,  shared  values,  and  socialization  through 
state  and  society  into  a  single  identity,  as  long  as 
these  are  available  to  all  within  the  territory. 

The  Ethics  of  Nationalism 

Nationalism  has  historically  been  associated  with 
liberalism  and  the  advance  of  democracy  through 
popular  sovereignty  and  with  totalitarianism.  It  has 
been  a  force  for  liberation  from  foreign  domination 
and  for  conquest.  In  the  19th  century,  John  Stuart 
Mill  advocated  nationally  integrated  societies 
(through  socialization  rather  than  ethnic  exclusion) 


as  necessary  for  the  trust  that  underpins  liberal 
democracy.  Lord  Acton,  in  contrast,  saw  in  the  con¬ 
gruence  between  nation  and  state  the  potential  for 
tyranny  and  preferred  the  multinational  society 
with  its  balance  of  interests.  The  issue  was  reopened 
in  the  late  20th  century  with  the  debate  on  liberal 
nationalism,  a  concept  first  introduced  by  Yael 
Tamir.  It  was  argued,  in  Millian  fashion,  that 
shared  nationality  could  underpin  democracy  as 
long  as  it  was  not  construed  in  a  narrow  ethnic 
fashion.  Others  noted  that  stateless  nations  could 
equally  be  the  sites  for  liberal  and  civic  nation  build¬ 
ing.  David  Miller  and  others  have  more  recently 
rehabilitated  nationality  as  a  principle  for  social 
solidarity,  so  underpinning  the  social-democratic 
project  and  previous  conceptions  of  socialism, 
which,  in  theory  if  not  in  practice,  had  been  cos¬ 
mopolitan.  It  was  not  clear  in  these  writings  what 
the  exact  link  between  nationality  and  nationalism 
is,  but  it  was  agreed  that  the  state  had  a  legitimate 
interest  in  promoting  common  identities.  This  idea 
has  been  widely  taken  up  in  Europe  in  the  face  of 
immigration  and  a  questioning  of  the  ideal  of  mul¬ 
ticulturalism.  As  the  state  itself  is  challenged  and 
demystified  in  the  face  of  global  change  and  trans¬ 
national  integration  as  well  as  challenges  from 
internal  communities  (whether  these  are  labeled  as 
regions  or  stateless  nations),  it  can  no  longer  rely 
on  the  implicit  normative  supremacy  it  had  in  the 
past  and,  having  to  legitimate  itself,  has  fallen  back 
on  the  nationality  principle. 

Nationalist  movements,  whether  at  the  state, 
the  substate,  or  the  transnational  level,  are  still 
making  incompatible  claims  about  the  proper 
boundaries  of  political  community.  In  the  present 
day,  however,  they  tend  to  subscribe  to  the  same 
principles,  eschewing  purely  ethnic  arguments  and 
insisting  on  more  political  ones  based  on  the  right 
of  democratic  self-determination.  They  also  tend 
to  stress  their  democratic  credentials,  if  only  on  the 
basis  that  they  represent  the  “people.”  This  is  per¬ 
haps  a  form  of  ideological  isomorphism  as  nation¬ 
ality  claims  reflect  the  normative  principles 
regarded  as  legitimate  at  the  time.  So  “race”  is 
abandoned  and  “self-determination”  comes  in. 
The  result,  however,  is  once  again  a  set  of  princi¬ 
ples  for  defining  political  community  that  produce 
no  determinate  outcome  since  the  claims  are  often 
competing  and  almost  always  overlapping.  Some 
theorists  seek  to  adjudicate  among  these  claims 
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based  on  which  of  them  have  the  best  liberal 
democratic  credentials.  Others  regard  this  as  pre¬ 
sumptuous  and  unrealistic  since  there  is  no  supra¬ 
national  authority  to  enforce  such  adjudications. 
Most  observers  have  concluded  that  nationality 
issues  can  never  be  resolved  by  redrawing  political 
boundaries.  At  best,  they  can  be  managed  by  rec¬ 
ognizing  different  conceptions  of  the  nation  and  of 
the  division  of  political  authority. 
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Nationalist  Movements 


Nationalist  movements  are  (often  utopian)  politi¬ 
cal  campaigns  to  “build”  nations  that  correspond 
to  state  boundaries.  “Nation”  is  notoriously  diffi¬ 
cult  to  define,  and  “nationalists”  often  attempt  to 


convince  or  force  people  to  assume  identities 
about  which  they  were  previously  unaware  of  or 
with  which  they  may  be  uncomfortable.  Attempts 
to  make  nations  congruent  with  state  boundaries 
often  fail;  in  many  cases,  such  attempts  result  in 
efforts  to  claim  national  homogeneity  by  ignoring 
or  oppressing  other  ethnic  or  national  groups 
within  the  territory  of  the  “nation-state.”  This 
entry  discusses  problems  of  definition,  varying 
historical  conditions,  and  the  continuing  political 
relevance  of  nationalist  movements. 

Definition  of  Nationalism 

The  failure  of  most  nation-building  projects  in 
modern  history  raises  the  question  of  why  nation¬ 
alist  movements  have  long  been  so  influential  and, 
in  many  parts  of  the  world,  remain  so  today.  The 
belief  that  every  nation  should  have  a  state — and 
that  every  state  should  be  a  nation — has  been  the 
most  widely  accepted  form  of  legitimation  for 
modern  states  since  the  French  Revolution.  French 
nationalists  themselves  devoted  much  effort,  par¬ 
ticularly  in  the  19th  century,  to  creating  a  common 
cultural  and  political  identity,  a  process  that  Eugen 
Weber  (1976)  famously  termed  “turning  peasants 
into  Frenchmen.  ”  Weber  argued  that  popular  con¬ 
sciousness  of  being  part  of  the  French  nation  was 
weakly  developed  in  the  rural  areas,  with  elites 
using  the  school  curriculum  and  also  universal 
conscription  to  instill  a  strong  sense  of  nationality 
in  the  country’s  provinces. 

Only  a  few  other  countries  can  be  seen  to  have 
successfully  imitated  this  French  example  of  “suc¬ 
cessful”  assimilation,  among  them  Sweden  and 
Japan.  In  the  20th  century,  this  assimilationist 
project  grew  more  difficult,  even  in  venerable 
states  like  Spain.  Here,  substate  nationalisms 
flourished,  particularly  among  the  Basques  and  the 
Catalans.  Arguably,  few  nation-states  were  suc¬ 
cessfully  created  in  the  last  century,  with  almost  all 
developing  countries  that  emerged  from  colonial¬ 
ism  being  multinational  or  at  least  multiethnic  and 
multicultural  states.  Juan  Linz  and  Alfred  Stepan 
(1996)  have  proposed  that  it  would  be  more 
empirically  accurate  to  speak  of  “state-nations” 
(multicultural  and  often  multinational)  that  still 
manage  to  command  the  loyalty  of  their  citizens 
despite  the  lack  of  a  homogeneous  nation  under¬ 
pinning  the  state.  Among  state-nations  there  are 
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those  in  which  robustly  politicized  nationalist 
movements  are  active,  such  as  in  Belgium,  Spain, 
or  Sudan.  Other  state-nations,  however,  have 
managed  to  create  a  strong  sense  of  common  iden¬ 
tity  despite  multiculturalism  (such  as  Singapore, 
Switzerland,  and  the  United  States).  Yet  the  belief 
in  the  necessity  of  nation-state  building  by  nation¬ 
alist  movements  persists,  making  the  task  of 
achieving  legitimacy  through  the  establishment  of 
tolerant  state-nations  more  difficult. 

Nationalists  mostly  adhere  to  the  “ideal”  of  state 
boundaries  being  or  becoming  congruent  with  the 
nation  (for  Ernest  Gellner,  1983,  this  is  the  defining 
feature  of  nationalism).  This  means  that  from  a 
nationalist  perspective,  many  states  are  “too  large,” 
that  is  multinational,  or  “too  small,”  so-called  rump 
states  that  were  formerly  a  part  of  larger  states.  In 
interwar  Eastern  Europe,  examples  of  the  former 
were  the  newly  created  states  of  Yugoslavia  and 
Czechoslovakia.  Cases  of  the  latter  were  Austria 
(many  Austrians  then  considered  themselves 
German,  developing  a  separate  identity  only  after 
World  War  II)  and  Hungary  (where  many  ethnic 
Hungarians  were  living  outside  of  the  state  bound¬ 
aries).  Nation  building,  the  attempt  to  make  national 
loyalties  and  state  boundaries  correspond,  thus  ran 
into  the  awkward  problem  of  ethnic  minorities. 
Either  minorities  within  the  state  stood  in  the  way 
of  national  homogeneity,  or  they  lived  outside  the 
boundaries  of  the  mother  state  with  which  they 
shared  a  common  nationality.  These  two  types  of 
minorities  often  overlapped  with  an  ethnic  minority 
in  a  multinational  state  and  often  were  of  the  same 
nationality  that  was  the  dominant  group  in  an 
adjoining  country.  Attempts  by  oversized  states  in 
interwar  Eastern  Europe  to  strengthen  national 
identity  were  often  perceived  as  discriminatory  by 
minorities;  nationalism  in  undersized  states  led  to 
irredentist  claims  against  (or,  in  the  case  of  Austria, 
hopes  for  merger  with)  a  neighboring  state  or  states. 

Nationalism  can,  thus,  play  a  major  role  in 
undermining  and  discrediting  democracy  when  the 
logics  of  nation  building  and  crafting  democracies 
clash.  Excessive  emphasis  by  nationalist  move¬ 
ments  on  nation  building  encourages  the  rise  of 
extreme  nationalism  that  can  doom  simultaneous 
efforts  to  craft  democracies.  Political  democracy 
requires  the  acceptance  of  societal  pluralism. 
Extreme  nationalism,  on  the  other  hand,  claims  a 
common  interest  based  on  shared  nationality  that 


overrides  all  other  social  divisions.  When  a  nation¬ 
building  project  leads  to  a  conflict  with  national 
minorities  or  with  neighboring  countries,  it  tends 
to  create  an  environment  in  which  all  internal 
opposition  is  considered  antinational  and  thus 
nothing  less  than  treason.  These  developments 
break  the  original  French  connection  between 
national  identity  and  democratic  government. 
When  national  unity  is  stressed  at  the  expense  of 
political  pluralism,  a  democratic  polity  can  be 
severely  weakened  or  even  undermined. 

Stalin’s  notorious  effort  to  find  a  definition  of  a 
nation  relied  on  supposedly  objective  criteria:  a 
people’s  common  history,  territory,  language,  eco¬ 
nomics,  and  culture.  While  there  are  nations  that 
lack  a  common  language  (multilingual  nationalities 
such  as  the  Indians  and  the  Swiss),  a  common  reli¬ 
gious  or  ethnic  culture  (such  as  the  multiethnic  and 
religiously  diverse  Singaporeans),  or  a  common 
territory  (such  as  those  with  a  Kurdish  identity 
who  are  spread  across  several  states),  this  primor- 
dialist  view  remains  influential  among  nationalists 
to  the  present  day.  Subjective  interpretations  of  the 
nation  are  more  plausible,  at  least  from  a  scholarly 
point  of  view.  Benedict  Anderson  famously  claimed 
that  nations  are  but  “imagined  communities” 
(1983/2006) — social  constructions  in  which  people 
imagine  themselves  to  be  part  of  a  group.  This 
view,  in  turn,  has  been  criticized  for  not  adequately 
specifying  under  what  conditions  people  are  likely 
to  come  to  have  this  feeling  of  national  commonal¬ 
ity,  which  has  led  some  scholars  to  combine  the 
two  perspectives. 

In  English  and  all  languages  derived  from  Latin 
the  word  nation  is  related  to  “being  born”  and 
“native.”  But  the  word  natio  in  Latin  referred  to 
units  like  tribes,  clans,  and  families,  not  the  territo¬ 
rial  nations  we  think  of  today.  The  idea  that 
“nation”  should  apply  only  to  a  specific  territory 
and  a  particular  people  is  a  modern  idea.  Thus, 
although  much  recent  research  has  pointed  to  the 
often  ancient  roots  of  recurring  national  traditions 
(influential  perennialists  are  Armstrong,  1982; 
Smith,  1995),  nationalist  movements  are  a  modern 
phenomenon. 

Nationalism  and  Modernity 

The  relationship  between  nationalist  movements 
and  modernity  has  been  a  source  of  controversy 
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among  scholars.  Ernest  Gellner  influentially  argued 
that  nationalism  played  a  critical  role  in  modern¬ 
ization.  In  the  “agro-literate”  stage  of  history,  rul¬ 
ers  had  little  reason  to  impose  uniformity  on  their 
subjects.  But  with  the  division  of  labor  in  modern 
times,  with  work  becoming  increasingly  technical, 
a  need  for  cultural  standardization  and  context- 
free  communication  arose.  Nationalist  movements 
helped  provide  the  conditions  necessary  to  create 
workforces  suited  to  such  industrial  modernity, 
making  peasants  into  moderns.  In  an  age  of  indus¬ 
trialization  and  bureaucratization,  territories 
became  more  precisely  defined,  with  national  iden¬ 
tities  used  to  justify  state  boundaries.  The  long 
conflict  between  Germany  and  France  over  Alsace- 
Lorraine  saw  the  Germans  and  French  each  making 
claims  on  the  territory  based  on  different  national 
narratives  (the  fact  that  the  natives  spoke  a  dialect 
of  German  was  posed  against  the  counterargument 
that  the  inhabitants  identified  themselves  as  French 
citizens).  This  national  competition  was  also  a 
battle  over  resources  among  rapidly  industrializing 
neighbors. 

Gellner’s  approach  has  been  criticized  for  being 
overly  functionalist  (modernity  cannot  work  prop¬ 
erly  without  the  nation),  failing  to  account  for  the 
rise  of  nationalism  in  preindustrial  societies,  and 
being  unable  to  account  for  the  passions  that 
nationalism  arouses  (soldiers  dying  willingly  for 
their  nation).  But  the  link  between  nationalism  and 
modernity  is  undeniable,  even  if  it  is  less  direct 
than  Gellner  suggests.  To  criticize  nationalist 
movements  thus  seems  to  involve  criticizing  moder¬ 
nity  itself.  Yet  this  should  not  lead  the  hardships 
caused  by  nationalist  movements  to  minority 
groups  to  be  relativized.  Regular  media  reports 
about  state-perpetuated  violations  of  minority 
groups’  human  rights  (be  it  of  the  Uyghurs  and 
Tibetans  in  China  or  Black,  non-Muslims  in  south¬ 
ern  Sudan,  for  example)  can  be  linked  to  national 
movements  using  state  power  to  enforce  their 
vision  of  a  nation-state. 

By  praising  the  nation  as  the  most  valuable 
identity  (raising  it  above  class,  region,  gender,  or 
other  identities),  many  nationalist  movements  in 
the  19th  and  20th  centuries  became  “secular  reli¬ 
gions.”  They  developed  complex  “theologies”  that 
stressed  supposed  commonalities  of  the  “nation” 
while  overlooking  or  even  ignoring  the  territorial, 
ethnic,  religious,  and  other  forms  of  diversity  their 


movement  faced  in  realizing  its  goal  of  a  homoge¬ 
neous  nation.  By  stressing  the  interest  of  their 
nation,  they  sacrificed  the  interests  of  other  com¬ 
peting  nationalities,  leading  to  chauvinism,  war, 
and  often  ethnic  cleansing,  which  was  seen  in 
Europe  as  recently  as  1990s  in  the  Yugoslav  wars. 

Nationalist  movements  have  not  been  ideologi¬ 
cally  exclusive,  however.  They  have  combined  or 
formed  hybrids  with  liberal,  socialist,  and  conser¬ 
vative  ideas  and  causes.  Liberals  were  among  the 
earliest  advocates  of  nationalism  in  the  19th  cen¬ 
tury.  During  the  revolution  of  1848,  some  dele¬ 
gates  to  St.  Paul’s  Church  in  Frankfurt  argued  that 
everyone  who  lived  on  German  territory  was  a 
German  and  that  national  identity  was  not  deter¬ 
mined  by  one’s  ethnic  background  or  language  but 
was  a  political  organism  defined  through  the  state, 
making  nationalism  a  political  concept  not  an  eth¬ 
nic  one.  This  soon  became  a  minority  position  in 
Germany,  in  much  of  the  rest  of  Europe,  and  in 
many  other  places  in  the  world  where  ethnic,  par¬ 
ticularistic,  and  collective  conceptions  of  the  nation 
replaced  humanistic,  universalist,  and  individualis¬ 
tic  ones. 

Early  socialist  movements  adopted  the  national¬ 
ist  agenda  only  implicitly  (but  later  ones,  such  as 
Mao  in  China  and  Ho  Chi  Minh  in  Vietnam,  inter¬ 
twined  nationalism  and  socialism  so  closely  that 
they  cannot  be  separated  in  their  thought  and 
actions).  Karl  Marx  and  Friedrich  Engels  argued 
that  class  conflict  was  more  important  than  the 
differences  between  nations.  Yet  they  still  sup¬ 
ported  several  nationalist  movements  against 
oppressive  empires  (particularly  struggles  against 
the  Russian  and  the  Habsburg  empires  but  also  the 
Irish  independence  fight  against  the  British).  But 
Engels,  in  particular,  suggested  that  only  “great, 
historic  nations”  would  survive  into  the  modern 
world,  advancing  the  cause  of  socialism,  while 
smaller  nations  would  be  counterrevolutionary, 
doomed  to  wither  away.  An  important  exception 
to  the  chauvinist  orientation  of  many  socialist 
thinkers  behind  a  facade  of  internationalism  was 
the  Austro-Marxists.  In  1907,  Otto  Bauer  advo¬ 
cated  a  program  of  “cultural-national  autonomy” 
to  lessen  tensions  between  nationalities  in  the 
Austro-Hungarian  Empire,  a  multinational  terri¬ 
tory  under  Habsburg  rule  while  maintaining  the 
solidarity  of  the  working  class.  Rather  than  repu¬ 
diating  nationalism,  he  advocated  that  socialists 
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should  embrace  national  differences  and  work  to 
allow  the  flowering  of  each  national  culture  in  a 
socialist  state.  Bauer’s  tolerant,  state-nation 
approach  drew  contempt  from  Lenin  and  other 
orthodox  Marxists,  foreshadowing  the  dogmatism 
and  oppressiveness  of  the  Soviet  Union  in  regard  to 
national  minorities.  Benito  Mussolini  was  a  social¬ 
ist  who  became  a  fanatical  nationalist  during 
World  War  I.  While  he  was  disavowed  by  the 
socialist  movement,  he  had  exploited  the  chauvin¬ 
ist  side  of  many  socialist  thinkers  to  create  what 
became  known  as  fascism.  The  fanaticism  of  the 
Khmer  Rouge  in  Cambodia,  who  perpetrated  one 
of  the  worst  mass  murders  in  the  post-World  War 
II  period,  was  fueled  by  a  deadly  mixture  of  social¬ 
ism  and  nativist  nationalism. 

Initially,  conservatives  were  the  slowest  to  rec¬ 
ognize  the  potential  power  of  nationalism  for  their 
political  purposes,  instinctively  fearing  the  mobili¬ 
zation  of  the  masses.  But  when  they  did  make  this 
discovery,  the  consequences  were  often  terrifying. 
Militarist  Japan  in  the  1930s  and  early  1940s  is 
one  well-known  example.  While  Japanese  propa¬ 
gandists  justified  their  country’s  conquest  of  much 
of  Asia  through  a  vague  notion  of  Pan-Asianism,  a 
sense  of  national  superiority  and  a  corresponding 
contempt  for  other  nations  underlay  this  aggres¬ 
sion,  creating  lasting  bitterness  toward  Japan  in 
the  region,  up  to  the  present  (not  helped  by  the 
unwillingness  of  some  Japanese  conservatives  to 
acknowledge  the  country’s  war  crimes).  National 
intolerance  reached  its  pinnacle  under  Nazi  rule  in 
Germany  and  their  conquest  of  much  of  the  rest  of 
Europe  during  World  War  II.  Although  the 
Holocaust  has  been  attributed  to  the  Nazis’  racist 
ideology,  nationalist  appeals  were  key  to  mobiliz¬ 
ing  initial  support  for  National  Socialism  in 
Germany. 

Following  the  famous  historian  Friedrich 
Meinecke  who  distinguished  between  state  and 
cultural  nationalism,  the  great  scholar  of  compara¬ 
tive  nationalism  Hans  Kohn  (1944)  made  a  dis¬ 
tinction  between  the  earlier  universalist  French 
nationalism  that  was  a  product  of  the  Enlightenment 
and  the  later  particularistic  German,  Russian,  and 
Eastern  European  nationalism  that  stressed  cul¬ 
tural  particularism.  In  a  more  recent  effort  that 
draws  on  Kohn’s  insights,  Liah  Greenfeld  (1993) 
has  argued  that  nationalist  movements  have  fol¬ 
lowed  different  “routes  to  modernity.  ”  Individualist 


and  civic  nationalism  arose  first  in  Britain  and  the 
United  States;  then,  collective  but  civic  nationalism 
took  hold  in  France;  and  finally  collective  and  eth¬ 
nic  nationalism  shaped  the  politics  of  Russia  and 
Germany.  Ethnocollectivist  types  of  nationalism 
tend  to  be  authoritarian,  with  a  high  degree  of 
inequality  between  nationalist  elites  and  the  disem- 
powered  masses.  In  short,  efforts  to  create  a 
nationally  defined  people  have  been  compatible 
with  democracy  in  the  more  individualistic,  civic 
versions  of  nationalist  movements.  In  other  cases, 
however,  nationalist  movements  that  have  appealed 
to  collective,  ethnic  identities  leave  little  room  for 
individual  rights,  including  those  of  ethnonational 
minorities. 

Contemporary  Nationalist  Movements 

One  disturbing  recent  development  among  nation¬ 
alist  movements  has  been  the  rise  of  citizen  groups 
accusing  the  state  of  insufficiently  upholding  the 
national  interest.  Whether  it  be  in  China  where 
nationalist  “netizens”  have  used  the  Internet  to 
attack  the  Communist  Party’s  supposed  weak  pur¬ 
suit  of  the  national  interest  (particularly  vs.  Japan), 
in  Thailand  where  for  domestic  political  reasons 
“yellow  shirt”  protests  reopened  a  border  dispute 
with  Cambodia  that  was  long  thought  to  be 
resolved,  or  among  Buddhist  nationalists  in  Sri 
Lanka  demanding  stronger  action  against  Tamil 
rebels,  such  societal  nationalism  has  increased 
internal  and  external  conflicts  and  made  national¬ 
ism  more  extremist.  As  discussed  above,  this  is 
highly  unfavorable  for  an  existing  democracy  or 
for  democratic  transitions  in  the  future.  Such  civil 
societal  nationalism  has  also  weakened  govern¬ 
ment  efforts  to  harness  nationalism  to  issues  such 
as  national  development  as  this  risks  allowing 
societal  groups  to  seize  on  it  to  accuse  the  govern¬ 
ment  of  being  insincere  in  its  commitment  to 
“genuine”  nationalism. 

The  countries  of  the  European  Union  (EU)  have 
prided  themselves  on  defusing  the  impact  of  rival 
nationalisms  within  their  community.  Post-World 
War  II  reconciliation  between  Germany  and 
France,  in  which  competing  nationalisms  led  to  a 
bloody  rivalry  between  these  two  European  pow¬ 
ers,  was  a  crucial  milestone.  But  the  postnational¬ 
ist  age  that  many  European  politicians  have  hoped 
for  has  yet  to  dawn.  National  interests  still  strongly 
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drive  the  behavior  of  European  community  coun¬ 
tries;  on  the  periphery  of  the  EU,  Greek  national¬ 
ism  continues  to  clash  with  Turkish  national 
claims.  Yet  Western  Europe  has  arguably  come 
furthest  in  healing  the  old  wounds  inflicted  by 
nationalist  movements.  By  contrast,  Eastern 
Europe  (where  the  short  war  between  Georgia  and 
Russia  in  2009  showed  how  such  tensions  can 
quickly  erupt  into  war)  remains  a  region  full  of 
nationalist  conflicts  despite  the  end  of  the  Yugoslav 
war.  In  Black  Africa  and  Latin  America,  by  con¬ 
trast,  nationalist-based  conflicts  have  subsided  on 
the  whole  (with  the  exception  of  a  continuing 
ethnic-based  civil  war  in  Sudan  and  recent  nation¬ 
alist  tensions  between  Venezuela  and  Colombia). 
In  the  Middle  East,  the  age  of  Pan-Arabism  has 
passed  (which  reached  its  peak  during  the  1960s), 
with  Arab  nationalism  now  focused  on  the  Palestine 
conflict  with  Israel.  A  series  of  Arab  regimes  origi¬ 
nally  legitimized  by  national  movements  that 
seized  power,  such  as  Nasserism  in  Egypt,  have 
lost  legitimacy  as  economic  performance  declined, 
with  Islamist  groups  such  as  the  Egyptian  Muslim 
Brotherhood  replacing  nationalists  as  the  strongest 
source  of  opposition.  In  South  Asia,  nationalist 
tensions  between  Pakistan  and  India  remain  high, 
with  the  dispute  over  Kashmir  a  perennial  source 
of  conflict.  In  Sri  Lanka,  a  decades-long  civil  war 
recently  ended  with  the  defeat  of  the  Tamil  Tiger 
rebel  group,  but  reconciliation  still  seems  far  off  as 
there  is  little  indication  that  the  victorious  majority 
Sinhalese  are  willing  to  take  the  key  steps  neces¬ 
sary  to  make  the  country  more  tolerant  toward  the 
Tamil  minority.  In  Northeast  Asia,  conflicts  over 
disputed  islands  (as  well  as  the  wounds  left  from 
the  experience  of  World  War  II)  between  China 
and  Japan  remain  a  major  source  of  potential  con¬ 
flict  as  does  the  ongoing  dispute  between  China 
and  Taiwan,  with  the  former  warning  that  it  will 
invade  if  the  latter  declares  independence  (as  some 
Taiwanese  nationalist  activists  nonetheless  wish  to 
do).  In  Southeast  Asia,  the  military  government  in 
Burma/Myanmar  has  reached  accords  with  most 
rebel  groups  representing  non-Burman  nationali¬ 
ties  after  a  long-running  civil  war,  but  these  were 
based  on  its  own  despotic  rules,  not  a  genuine 
spirit  of  accommodation.  Indonesia  appears  to  be 
a  contrasting  case:  Releasing  East  Timor  (Timor- 
Leste)  into  independence  in  2002 — and  overcom¬ 
ing  ethnic  conflict  in  the  immediate  aftermath  of 


the  overthrow  of  the  long-ruling  dictator  Suharto 
in  1998 — has  helped  this  country,  the  world’s  larg¬ 
est  archipelago  state,  to  become  the  most  stable 
democracy  in  the  region. 

But  nationalist  movements  will  continue  to  thrive 
in  areas  in  which  inhabitants  feel  oppressed.  In  this 
sense,  nationalism  retains  its  liberationist  impulse 
that  it  so  famously  developed  during  the  French 
Revolution  but  also,  for  example,  in  the  Greek 
struggle  for  independence  from  the  Ottoman  Empire 
in  the  19th  century.  One  of  the  paradoxes  of  modern 
nationalism  is  how  progressive  causes  in  opposition 
to  ruling  empires  often  degenerate  into  new  forms  of 
political  oppression  once  control  over  the  state  is 
achieved.  Here,  the  experience  of  the  countries  of  the 
former  Soviet  Union  and  Yugoslavia  is  instructive. 
Nationalist  movements  were  at  the  forefront  of 
opposition  to  (post)totalitarian  states.  Once  these 
multinational  autocracies  had  collapsed,  however, 
the  new  nation-states  that  replaced  them  often 
proved  to  be  intolerant.  The  war  in  the  former 
Yugoslavia  along  ethnonational  lines  is  only  the 
most  extreme  example,  with  Serbian  nationalists 
justifying  war  in  the  name  of  greater  Serbia  (Croatian 
nationalism  was  less  destructive  only  because  its 
state  power  was  weaker).  But  in  the  Soviet  Union, 
even  some  of  the  most  enlightened,  democratic 
countries — the  Baltic  states  of  Estonia,  Lithuania, 
and  Latvia — have,  sometimes  openly  but  often 
covertly,  discriminated  against  their  ethnic  Russian 
populations.  Ethnic  heterogeneity  remains  the  spec¬ 
ter  that  haunts  many  states  claiming  to  be  homoge¬ 
neous  nations. 
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Natural  Law 


The  notion  of  “natural  law”  is  a  prime  and  endur¬ 
ing  one  in  the  Western  philosophical  and  political 
tradition.  In  antiquity,  the  doctrine  of  natural  law 
was  based  on  a  conception  of  nature  as  a  primary 
ordering  force  in  which  every  being  was  ascribed  a 
place.  In  contemporary  times,  the  doctrine  has  been 
reformulated  under  the  guise  of  “natural  rights,” 
meaning  that  laws  should  respect  inherent  human 
values.  Such  a  multimillennium  history  could  not 
but  entail  major  variations  over  time  in  the  mean¬ 
ing,  the  purpose,  and  the  place  of  natural  law.  For 
this  reason,  any  comprehensive  approach  to  it  must 
make  room  for  the  remarkable  permanence  of 
some  traits  of  what  is  called  natural  law  as  well  as 
for  dramatic  changes  in  its  object  and  definition. 

For  the  Greeks  and  then  the  Romans,  who 
crafted  the  notion  of  natural  law  in  a  doctrine 
sometimes  referred  to  as  jus  naturalism  (from  the 
Latin,  jus  naturale ,  which  means  natural  law),  the 
concept  of  law — nomos  in  Greek  and  lex  in 
Latin — is  grounded  in  a  cosmological  perspective 
according  to  which  the  universe  is  organized  on  a 
definite  pattern  that  everyone  has  to  follow.  Such 
a  conception  extends  its  reach  far  beyond  the 
realm  of  humanity  and  embraces  animals  and  even 
the  inanimate.  In  Aristotle’s  perspective,  for 
instance,  the  natural  place  of  a  stone  is  the  ground 
where  it  stays  naturally,  while  in  any  other  place  it 
would  fall.  Such  is  the  case  with  human  beings. 
The  city  is  organized  through  natural  ties  of  subor¬ 
dination  and  exchanges,  and  laws  should  respect 


nature  in  the  social  bonds  thus  dispatched.  The 
minutiae  with  which  Plato  in  The  Republic  or  in 
The  Laws  and  Aristotle  in  his  Politics  describe 
under  which  conditions — of  climate,  population, 
relations  between  sexes,  and  so  on — a  successful 
city  can  thrive  reflects  a  general  preoccupation. 
This  establishes  the  good  law  as  the  one  respecting 
natural  conditions,  which  range  from  the  demog¬ 
raphy  of  a  city  to  the  actual  ties  of  subordination 
between  men  and  women,  free  citizens  and  slaves, 
and  classes  in  society. 

Despite  pretensions  to  the  contrary,  ancient  con¬ 
ceptions  of  natural  law  did  not  form  an  actual 
coherent  body  of  doctrine.  Authors  were  keen  on 
disputing  what  was  according  to  nature  or  against 
nature;  while  the  principle  of  natural  law  was  well 
established,  what  it  tangibly  meant  was  open  to 
continual  discussion.  There  is  probably  not  a  single 
law,  rule,  or  moral  obligation  of  the  antique  world 
that  has  not  been  submitted  to  critical  scrutiny  and 
challenged  or  reasserted  on  the  grounds  of  what  is 
natural  or  not:  The  precedence  of  family  duties 
over  city  laws  (as  is  the  case  in  the  famous  Antigone 
myth,  where  Antigone  refuses  to  obey  her  uncle, 
the  local  tyrant,  who  forbade  the  proper  burial  of 
her  brother,  but  also  as  in  Plato’s  dialogue, 
Euthyphro,  where  Socrates  critiques  a  son  suing  his 
father  for  the  death  of  a  murderer),  the  issue  of 
slavery  (whether  it  is  natural  or  not,  and  under 
which  circumstances,  was  a  constant  debate),  the 
question  of  appropriate  leadership  (who  is  the 
natural  leader  in  a  city?),  the  separation  of  classes, 
or  what  it  is  proper  to  do  or  not  do — all  these  could 
be  subjected  to  dispute  in  the  name  of  nature. 

As  Leo  Strauss  contended,  these  continuing  and 
multiform  disputes  over  what  is  against  or  accord¬ 
ing  to  nature  supposed  that  nature  was  an  object 
of  inquiry  in  an  exercise  of  reflection  and  dialogue 
akin  to  philosophy.  Natural  law,  to  that  extent, 
favors  the  exercise  of  personal  critical  reason  by 
insisting  on  the  fact  that  human  beings  should  not 
be  forced  to  comply  with  rules  that  are  repugnant 
to  reason.  As  Cicero  put  it  in  a  famous  sentence  of 
the  third  book  of  his  De  Republica : 

There  is  a  true  law,  a  right  reason,  conformable 
to  nature,  universal,  unchangeable,  eternal, 
whose  commands  urge  us  to  duty,  and  whose 
prohibitions  restrain  us  from  evil.  Whether  it 
enjoins  or  forbids,  the  good  respect  its  injunctions, 
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and  the  wicked  treat  them  with  indifference.  This 
law  cannot  be  contradicted  by  any  other  law, 
and  is  not  liable  either  to  derogation  or 
abrogation.  Neither  the  senate  nor  the  people  can 
give  us  any  dispensation  for  not  obeying  this 
universal  law  of  justice. 

Natural  law  conforms  to  reason  because  it  is 
discovered  by  reason  and  satisfies  the  demands  of 
reason,  which,  in  its  turn,  is  nothing  else  but  the 
discovery  of  the  preexisting  order  of  nature. 

Quite  surprisingly,  considering  how  intimately 
it  was  initially  linked  to  a  pagan  conception  of 
the  world,  the  doctrine  of  natural  law  found  its 
place  among  Christian  conceptions.  Theoretically, 
though,  the  development  of  Christianity  constituted 
a  deadly  challenge  to  this  doctrine.  For  Christians, 
nature  is  no  more  than  a  creation  of  God,  who  is 
the  ultimate  reason  and  sense  of  the  world  as  the 
Gospel  of  John  bears  it:  “All  things  were  made  by 
[GodJ;  and  without  him  was  not  any  thing  made 
that  was  made”  (1:3).  Nature  as  “all  the  things”  is 
second  to  God’s  decision  to  create  it.  Far  from  rep¬ 
resenting  the  achieved  order  of  the  world,  nature  is 
flawed  with  imperfection  since  only  God  is  perfect. 
There  is  worse:  This  imperfection  means  that 
nature  can  be  regarded  as  the  source  of  sin.  For 
thinkers  like  Augustine  (354-430),  the  fact  that 
nature  in  man  is  marked  by  the  fate  of  death  is  a 
testimony  to  its  corruption  and  sinful  condition 
from  which  only  redemption  beyond  the  natural 
condition  of  humanity  can  save  us. 

Despite  this  potential  tension,  the  cultural  con¬ 
text  in  which  the  primitive  church  developed  was 
so  strongly  influenced  by  classical  theories  that  the 
reference  to  natural  law  was  spontaneously  main¬ 
tained,  as  shown,  for  instance,  in  Paul’s  Letter  to 
the  Romans:  “Sometimes  [Gentiles,  i.e.,  non-Jews 
or  pagans]  do  naturally  what  the  law  orders.  So 
they  have  a  law  in  themselves,  although  they  do 
not  have  the  law”  (2:14).  Obviously,  the  law  of 
God  may  be  followed  out  of  a  “natural,”  inner 
sense  of  good  and  right,  thus  bridging  the  gap 
between  revelation  and  natural  law.  Centuries 
later,  in  a  major  integration  of  natural  law  in 
Christian  conceptions,  Thomas  Aquinas  (1225- 
1274)  develops  the  point  in  his  Theological  Summa 
(Part  1,  Question  91):  Natural  law  is  the  “partici¬ 
pation”  of  the  rational  creature — man — in  the  law 
of  God,  and  reason  is  thus  the  way  to  participate 


in  divine  law  through  natural  law.  Far  from  dis¬ 
qualifying  it  as  a  model  of  law,  creation  establishes 
nature  as  God’s  order  in  the  world  and  must  be 
followed  by  men. 

The  absorption  of  the  conception  of  natural  law 
inherited  from  the  pagan  past  in  Christian  doc¬ 
trines  did  not  stabilize  a  consensual  understanding 
of  it  but  elicited  new  questions  eventually  provok¬ 
ing  a  divide  sometimes  referred  to  as  the  opposi¬ 
tion  between  two  conceptions  of  jus  naturalisms : 
classic  and  modern.  One  key  element  has  been  a 
radical  transformation  of  the  meaning  of  “nature,” 
progressively  denoting  the  inscription  of  nature  in 
man  and  no  more  the  insertion  of  man  in  the 
whole  order  of  the  universe.  Illustrating  the  evolu¬ 
tion  in  his  Spirit  of  the  Laws,  Baron  de  Montesquieu 
(1689-1755)  states  that  “the  laws  of  nature  are  the 
laws  that  derive  from  our  constitution.”  Natural 
law,  which  the  ancients  considered  as  part  of  a 
cosmology,  has  become  the  outcome  of  a  new 
anthropology  insisting  on  the  value  of  the  indi¬ 
vidual  and  his  personal  relation  to  nature. 

Spanning  centuries,  the  transformation  of  natu¬ 
ral  law  eventually  acquired  a  definition  during  the 
Enlightenment,  when  natural  law  slowly  made 
room  for  a  conception  of  individual  natural  rights, 
which  led  to  the  declarations  of  the  American  and 
French  revolutions  in  1776  and  1789.  The  causes 
and  the  forms  of  this  revision  of  the  sense  of  “natu¬ 
ral  law”  are  many.  It  has  long  been  considered  that 
the  change  occurred  briskly  in  the  18th  century. 
Modern  historiography  tends  rather  to  agree  on  an 
evolutionary  process  originating  in  the  Middle 
Ages,  with  William  of  Ockham  (1285-1349)  being 
often,  though  controversially,  cited  as  the  instiga¬ 
tor  of  this  development  in  the  wake  of  the  work  of 
the  French  historian  of  law  Michel  Villey  (1914- 
1988).  But  it  is  certain  that  in  the  18th  century, 
natural  law  eventually  came  to  mean  that  laws 
must  respect  equality  and  freedom  in  men  (the 
issue  of  women  was  debated),  which  are  consid¬ 
ered  to  be  natural  features  of  humanity.  This  is 
what  the  notion  of  “rights”  grants,  despite  infinite 
variations  over  centuries  in  the  definition  of  these 
rights  among  modern  and  contemporary  authors — 
no  less  keen  in  the  modern  age  on  discussing  natu¬ 
ral  rights  than  their  ancient  predecessors  had  been 
on  disputing  over  what  is  natural  or  not. 

Beyond  historical  fractures,  jus  naturalism  rests 
today  on  a  fundamental  set  of  convictions  regarding 
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the  essence  of  law,  the  first  one  being  that  laws  are 
not  arbitrary.  Opposing  positivist  juridical  concep¬ 
tions,  which  assert  that  a  law  is  no  more  than  a 
norm  regarded  as  legal,  jus  naturalism  is  a  reminder 
that  masses  could  be  massively  unjust,  and  the  will 
of  the  prince  or  the  leader  can  be  detached  from 
any  common  sense  of  justice.  The  theme  of  natural 
law  thus  assumes  a  normative  role  as  well  as  a 
critical  one.  On  the  normative  side  is  the  idea  that 
nature  commands  certain  actions  and  forbids  oth¬ 
ers:  The  appeal  to  natural  law  can  serve  the  pur¬ 
pose  of  justifying  laws  or  justifying  the  call  for  new 
laws  by  considering  them  as  requisites  from 
nature.  By  the  same  token,  the  reference  to  natural 
law  also  represents  a  powerful  critical  tool  to  con¬ 
test  the  validity  of  laws  that  would  appear  to  go 
against  nature.  This  is  the  basis  of  one  of  the  most 
important  features  in  jus  naturalist  conceptions: 
the  distinction  between  natural  law,  which  is  the 
source  of  justice,  and  conventional  laws — the  laws 
of  a  collectivity — which,  in  the  best  cases,  obey  or 
translate  natural  law  but  in  the  worst  situations 
may  well  violate  any  natural  sense  of  justice. 

In  all  its  conceptions,  natural  law  addresses  the 
fundamental  paradox  in  human  laws,  whether 
understood  in  a  specifically  legal  or  in  a  broader 
moral  sense.  On  the  one  hand,  norms  vary 
greatly:  What  seems  “natural”  to  one  human 
group  might  well  be  considered  deviant  or  even 
strictly  forbidden  in  another  one.  On  the  other 
hand,  rules  are  often  felt  to  be  “natural”  by  their 
followers,  a  fact  that  gives  way  to  uncertainty  or 
bewilderment  when  they  are  challenged  by  the 
existence  of  other  potentially  contradictory  rules. 
To  that  extent,  obedience  to  rules  corresponds  to 
a  natural  sense  of  justice.  For  centuries,  the  notion 
of  natural  law  has  translated  this  sense  in  the 
theory  of  law. 
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A  natural  resource  can  be  defined  as  a  feature  of 
the  natural  environment  that  has  some  sort  of  eco¬ 
nomic,  cultural,  aesthetic,  ecological,  or  spiritual 
value.  The  study  of  natural  resources  in  political 
science  is  ultimately  about  how  such — often 
conflicting — values  are  managed  and  distributed 
by  political  actors,  communities,  organizations, 
bureaucracies,  and  countries.  After  a  brief  histori¬ 
cal  background,  the  remainder  of  the  text  reviews 
current  research  themes  in  the  study  of  natural 
resource  management.  The  concluding  paragraph 
offers  some  thoughts  about  emerging  research 
trends. 

The  History  of  the  Study  of 
Natural  Resource  Management 

Although  natural  resources  and  politics  have  been 
intimately  connected  throughout  the  history  of 
human  civilization,  a  politics  of  natural  resource 
management  in  a  modern  sense  did  not  appear 
until  the  1960s.  The  “discovery”  of  a  wider  set  of 
environmental  problems  brought  with  it  a  growing 
realization  that  many  natural  resources  were  being 
exploited  in  ways  that  rapidly  undermined  their 
future  existence.  Out  of  this  realization  came  the 
perhaps  single  most  influential  article  ever  pub¬ 
lished  in  the  field  of  natural  resource  management: 
Garrett  Hardin’s  1968  Science  article,  “The 
Tragedy  of  the  Commons,”  has  had  a  substantial 
and  long-lasting  impact  on  how  problems  of  natu¬ 
ral  resource  management  are  conceived  of  and 
theorized.  By  modeling  the  usage  of  jointly  owned 
natural  resources — the  commons — as  a  collective 


1666  Natural  Resources 


action  problem,  Hardin  argued  that  finite  natural 
resources  that  were  jointly  used  would  eventually 
be  exhausted.  Since  no  single  resource  user  has  an 
incentive  to  limit  his  or  her  outtake  from  the 
resource,  the  resource  would  soon  suffer  from 
overexploitation.  Because  most  natural  resources 
are  collectively  owned,  Hardin’s  prediction  was 
that  the  world’s  natural  resources  were  on  the 
brink  of  depletion,  unless  they  could  be  brought 
under  either  private  or  governmental  ownership. 

The  situation  described  in  the  “Tragedy  of  the 
Commons”  continued  to  be  the  mainstream  model 
for  understanding  interactions  between  society  and 
natural  systems  until  the  next  seminal  work  in  natu¬ 
ral  resources  management  appeared  in  1990.  Elinor 
Ostrom’s  path-breaking  classic  Governing  the 
Commons:  The  Evolution  of  Institutions  for 
Collective  Action  built  on  the  fundamental  problem 
of  jointly  used  resources  and  the  resulting  overuti¬ 
lization  assumed  by  Hardin  but  argued  that  the 
depletion  of  the  resource  was  not  an  unavoidable 
outcome.  One  of  Ostrom’s  core  contributions  is  the 
conceptual  distinction  between  common  property 
(resources  owned  jointly)  and  common  pool 
resources  (CPRs — resources  that  are  highly  sub- 
tractable  and  have  high  exclusion  costs  associated 
with  them).  Relying  on  evidence  from  multiple  case 
studies  and  game  theoretical  models,  O  strom  pointed 
out  that  natural  resource  users  may  solve  the  com¬ 
mons  dilemma  through  self-organizing  institutions 
that  regulate  access  and  resource  extraction  activi¬ 
ties.  However,  simply  setting  up  an  institution  is  not 
a  panacea  for  the  problem  of  sustainable  natural 
resource  management.  Since  the  very  act  of  con¬ 
structing  an  institution  is  a  collective  action  problem 
in  itself,  creating  the  institution  in  many  cases  poses 
an  insurmountable  challenge.  Moreover,  even  with 
the  institution  in  place  there  is  still  a  large  amount  of 
variation  in  how  well  different  types  of  institutional 
configurations  are  able  to  achieve  sustainable 
resource  use.  Factors  such  as  power  asymmetries 
and  levels  of  trust  among  users,  the  ability  to  control 
access  to  the  resource,  whether  or  not  resource  units 
are  easily  distinguishable,  and  the  degree  of  volatility 
in  the  resource  system  itself  are  all  highly  influential 
factors  for  successful  management. 

The  combined  legacy  of  Hardin  and  Ostrom  has 
securely  placed  the  study  of  natural  resource  man¬ 
agement  in  an  overall  theoretical  and  methodolog¬ 
ical  framework  of  rational-choice  institutionalism. 


Most  of  the  work  that  followed  in  the  wake  of 
Governing  the  Commons  can  be  placed  within  this 
paradigm,  as  also  a  good  deal  of  the  policy  recom¬ 
mendations  that  were  based  on  the  findings  from 
CPR  studies.  Moreover,  most  contemporary  work 
in  natural  resource  studies  has  retained  many  of 
Ostrom’s  core  notions,  such  as  the  pivotal  role  of 
institutions  and  the  importance  of  considering  the 
characteristics  of  the  resource  itself. 

Contemporary  Directions  in 
Natural  Resource  Management 

At  least  two  broad  themes  can  be  discerned  in  con¬ 
temporary  research  on  natural  resource  manage¬ 
ment.  The  first  theme  emphasizes  the  inherent 
complexity  of  natural  resource  systems  and  argues 
that  this  precludes  any  strong  notions  of  actually 
governing  natural  resources.  Complete  and  exhaus¬ 
tive  knowledge  about  the  dynamics  of  a  given 
ecosystem  is  simply  beyond  the  reach  of  scientific 
inquiry,  and  it  is  therefore  not  possible  to  govern 
resource  systems  based  on  rationalistic  and  exact 
templates  such  as  sustainable  yield  or  carrying 
capacity  as  these  concepts  can  never  be  estimated 
in  any  precise  sense.  As  a  consequence,  new  man¬ 
agement  paradigms  such  as  adaptive  management 
or  adaptive  comanagement  are  put  forward  as 
necessary  readjustments  of  how  natural  resources 
should  be  governed.  Such  management  approaches 
typically  stress  the  role  of  learning,  innovation, 
local  knowledge,  stakeholder  participation,  and 
flexibility  as  core  virtues  of  resource  management. 
There  is  also  a  tendency  to  view  policy  making  as 
trial  and  error  or  as  a  constantly  ongoing  experi¬ 
ment,  rather  than  as  the  pursuit  of  specific  prede¬ 
termined  goals.  Critics  argue  that  adaptive  man¬ 
agement  and  adaptive  comanagement  are  built  on 
overly  optimistic  assumptions  about  the  effect  of 
stakeholder  involvement,  self-organization,  and 
participation  in  policy  making,  while  downplaying 
problems  such  as  power  and  resource  asymmetries 
and  the  need  for  accountability  and  predictability 
of  public  policies. 

The  second  theme  problematizes  the  dichoto¬ 
mous  conception  of  nature  and  society — in  which 
the  latter  governs  the  former — by  arguing  that  eco¬ 
logical  systems  and  social  and  political  systems  in 
reality  are  highly  interlinked,  constituting  so-called 
social-ecological  systems.  The  social-ecological 
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systems  view  of  natural  resource  management  pro¬ 
poses  that  rather  than  studying  the  effects  of  social 
organization  on  natural  resources,  analyses  must 
be  based  on  the  realization  that  society  and  nature 
are  locked  into  coupled  causal  relationships,  which 
means  that  they  cannot  be  studied  in  isolation 
from  each  other.  Closely  linked  to  this  approach 
are  studies  concerned  with  issues  of  adaptability 
and  vulnerability.  Against  the  backdrop  of  ongo¬ 
ing  processes  of  global  environmental  change,  such 
as  biodiversity  loss  and  climate  change,  scholars  in 
this  area  seek  to  understand  how  the  vulnerability 
of  people  situated  in  different  social-ecological 
systems  can  be  reduced  through  processes  of  adap¬ 
tation  to  novel  circumstances. 

Comparative  and  International  Perspectives 

Due  to  the  strong  emphasis  on  understanding  link¬ 
ages  between  institutional  factors  and  natural  sys¬ 
tems,  most  empirical  studies  of  natural  resource 
management  tend  to  take  the  form  of  comparative 
case  studies  of  small  or  medium-sized  CPRs.  There 
is,  however,  a  smaller  subset  of  large-N  compara¬ 
tive  studies  in  which  nation-level  political  factors 
are  used  to  explain  cross-national  variation  in 
natural  resource  management  performance.  An 
example  of  a  widely  discussed  topic  is  the  impact 
of  corruption  on  natural  resource  management. 
Some  studies  have  identified  a  negative  effect  of 
corruption  on  indicators  of  natural  resource  man¬ 
agement,  whereas  other  authors  argue  that  corrup¬ 
tion  obstructs  more  effective  resource  exploitation. 

Studies  of  the  effect  of  democracy  on  natural 
resource  management  show  a  similar  pattern: 
Some  studies  assert  that  stronger  and  more  encom¬ 
passing  democratic  regimes  generally  perform  bet¬ 
ter  in  managing  and  protecting  their  natural 
resources,  while  other  studies  claim  that  democ¬ 
racy,  due  to  its  intimate  relationship  with  market 
economies  and  the  need  for  sustained  economic 
growth,  inevitably  leads  to  mismanagement  of 
natural  resources. 

The  effect  of  decentralization  has  also  been 
studied  in  a  comparative  perspective,  focusing  on 
the  issue  of  a  possible  race-to-the  bottom  effect  of 
reforms  aiming  at  a  more  decentralized  manage¬ 
ment  of  natural  resources.  Some  studies  support 
the  existence  of  such  an  effect,  whereas  others 
argue  that  there  is  a  race  to  the  top. 


A  further  topic  of  comparative  natural  resource 
management  has  to  do  with  the  relationship 
between  the  market  economy  and  nature.  The  key 
notion  is  that  the  relationship  between  market 
economy  and  resource  degradation,  when  viewed 
over  time,  can  be  thought  of  as  an  inverted 
U-shaped  curve.  Economies  in  the  early  stages  of 
industrialization  rely  heavily  on  the  extraction  of 
natural  resources  for  growth  generation  and  there¬ 
fore  have  a  large  impact  on  ecological  systems. 
This  phase  of  economic  development  is  also 
marked  by  the  use  of  inefficient  and  crude  tech¬ 
nologies  that  contribute  to  further  environmental 
damage.  This  relationship  between  nature  and 
economy  persists  until  a  certain  point  of  economic 
wealth  is  reached,  beyond  which  the  curve  starts  to 
slope  downward.  In  this  new  phase,  economic 
growth  is  increasingly  generated  through  the  pro¬ 
duction  of  services  and  the  development  of  new 
technologies,  which  lessens  the  need  for  extraction 
of  crude  natural  resources.  In  addition,  some 
scholars  have  argued  that  people  in  more  affluent 
societies  start  to  value  things  other  than  material 
wealth,  a  process  that  again  serves  to  weaken  the 
relationship  between  economic  growth  and  envi¬ 
ronmental  degradation.  The  environmental 
Kuznets  curve  theory  (according  to  which  environ¬ 
mental  degradation  increases  while  a  country  is 
developing  but  decreases  as  the  GDP  rises)  has  not 
gone  unchallenged,  and  some  argue  that  it  is  only 
valid  for  a  smaller  class  of  end-of-pipe  type  of 
emissions,  whereas  other  negative  impacts  on  nat¬ 
ural  systems  such  as  biodiversity  loss,  carbon  diox¬ 
ide  emissions,  and  the  spread  of  toxins  follow  the 
level  of  economic  development  in  a  roughly  linear 
fashion.  Another  criticism  holds  that  the  environ¬ 
mental  Kuznets  curve  theory  might  be  less  appli¬ 
cable  to  currently  developing  countries  due  to  their 
access  to  already  available  clean  technologies. 

As  many  natural  resources  are  located  on 
regional  or  even  global  scales  (e.g.,  migratory  birds, 
sea  mammals,  and  global  fish  stocks),  the  politics 
of  natural  resources  is  also  to  be  found  in  the  inter¬ 
national  arena.  A  central  concept  for  many  schol¬ 
ars  studying  international  natural  resource  politics 
is  that  of  environmental  regimes.  This  concept  is 
similar  to  the  concept  of  institutions  in  CPR  studies 
but  is  normally  used  in  studies  of  how  international 
treaties  and  organizations  influence  the  behavior  of 
actors  in  the  international  environmental  arena. 
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Research  in  this  field  has  focused  on  understand¬ 
ing  mechanisms  underlying  regime  effectiveness  as 
well  as  on  how  regimes  are  constructed  and  evolve 
over  time. 

Emerging  Themes  in  Natural 
Resource  Management  Research 

A  number  of  significant  themes  are  currently 
emerging  in  natural  resource  management 
research.  An  already  ongoing  shift  is  the  gradual 
move  away  from  the  institution  as  the  central  unit 
of  analysis.  Instead,  theoretical  models  and 
research  designs  are  increasingly  framed  in  terms 
of  systems  theory  or,  more  specifically,  in  terms  of 
socioecological  systems.  The  heightened  interest  in 
how  social-ecological  systems  are  able  to  adapt  to 
novel  circumstances  caused  by  global  environmen¬ 
tal  change  in  turn  tends  to  play  down  focus  on 
action  problems  related  to  natural  resource  man¬ 
agement.  This  redirection  also  signals  a  renewed 
interest  in  complex  systems  theory  as  well  as  in 
notions  of  resilience  in  social-ecological  systems. 
Another  topic  of  recent  debate  concerns  method¬ 
ologies  in  CPR  research.  The  dominance  of  quali¬ 
tative  case  study  methods  has  recently  been 
complemented  with  more  quantitative  methods, 
including  systems  modeling  and  social  network 
analysis.  A  final  emerging  theme  in  natural 
resource  management  research  relates  to  ongoing 
shifts  toward  more  governance-oriented  forms  of 
policy  making,  which  is  likely  to  have  wide-ranging 
repercussions  for  the  way  in  which  natural  resources 
are  governed. 
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Neo-Corporatism 


Corporatism  is  an  idea  that  has  been  present  in 
political  and  social  life  for  at  least  9  to  10  centu¬ 
ries.  Its  origins  are  indisputably  European  and 
related  to  differing  conceptions  of  how  the  Roman 
Catholic  Church  and  medieval  cities  should  be 
governed.  Its  popularity,  however,  has  had  an 
erratic  fate — both  as  a  practice  in  political  life  and 
as  a  concept  in  political  theory.  It  has  been  her¬ 
alded  as  a  novel  and  promising  way  of  ensuring 
harmony  between  conflicting  social  groups,  and  it 
has  also  been  condemned  as  a  reactionary  and 
antidemocratic  formula  for  suppressing  the 
demands  of  autonomous  associations  and  move¬ 
ments.  In  other  words,  corporatism  has  always 
been  politically  controversial  and  conceptually 
ambiguous.  There  is  no  better  evidence  for  this 
than  the  frequency  with  which  it  is  so  often  pre¬ 
ceded  by  contradictory  qualifying  adjectives  or 
prefixes:  state  or  societal,  liberal  or  authoritarian, 
archeo-  or  neo-,  Catholic  or  secular,  macro-  or 
meso-,  voluntary  or  compulsory,  social  demo¬ 
cratic  or  conservative,  and,  most  recently,  national 
or  supranational. 

The  Sites  of  Neo-Corporatism 

After  the  defeat  of  fascism  in  Italy,  National 
Socialism  in  Germany,  and  various  other  authori¬ 
tarian  regimes  that  flourished  in  Europe  during 
the  interwar  period  (1919-1939) — almost  all 
of  which  claimed  to  be  practicing  some  form  of 
corporatism — the  concept  disappeared  from  the 
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lexicon  of  respectable  political  discourse,  except  in 
Franco’s  Spain  and  Salazar’s  Portugal,  where  the 
practice  was  left  anachronistically  on  display  until 
both  countries  transitioned  to  democracy  in  the 
mid-1970s. 

At  almost  the  same  time,  scholars  from  several 
countries  and  academic  disciplines  revived  the  con¬ 
cept  to  describe  certain  features  of  the  politics  of 
advanced  democratic  polities  that  did  not  seem 
adequately  accounted  for  by  the  dominant  model 
that  had  been  applied  to  state-society  relations — 
namely,  pluralism.  Austria,  Finland,  Norway, 
Sweden,  and  tiny  Luxembourg  were  singled  out  as 
archetypical  “neo-corporatist”  countries  that  had 
become  deeply  penetrated  by  this  type  of  interest 
politics  since  the  end  of  World  War  II.  Important 
traces  of  its  practice  on  a  less  centralized  or  more 
erratic  basis  have  been  observed  in  the  politics  of 
Belgium,  the  Federal  Republic  of  Germany, 
Denmark,  the  Netherlands,  Switzerland,  and  even 
in  postauthoritarian  Portugal  and  Spain.  Great 
Britain  and  Australia  have  attempted  to  establish 
similar  arrangements  without  success.  Elsewhere 
among  the  advanced  capitalist  democracies,  neo¬ 
corporatism  seems  confined  to  specific  sectors 
(especially  agriculture).  The  United  States,  Canada, 
and  New  Zealand,  for  example,  have  never  even 
tried  to  practice  it  at  the  macro  or  national  level; 
France  has  only  resorted  to  it  in  exceptional  circum¬ 
stances,  as  an  emergency  (and  short-lived)  measure. 

At  the  core  of  this  variation  in  practice  were  two 
political  prerequisites.  Unless  they  were  simultane¬ 
ously  present  (and  this  was  a  relatively  rare  occur¬ 
rence),  neo-corporatism  in  its  most  centralized  and 
effective  form  would  not  emerge:  (a)  a  relative  bal¬ 
ance  of  class  forces  between  capital  and  labor  and 
(b)  an  active  disposition  by  state  agencies  to 
promote  it.  War  and  its  aftermath  provided  one 
favorable  context,  as  did  the  proto-revolutionary 
situation  induced  by  protracted  economic  depres¬ 
sion.  Social-democratic  or  socialist  party  domi¬ 
nance  in  government,  especially  without  serious 
competition  from  far  left  communist  parties,  was 
another.  Both  of  these  contributed  to  overcoming  a 
major  impediment — namely,  the  reluctance  of  cap¬ 
italists  and  their  associations  to  participate  in  and 
to  be  bound  by  such  arrangements.  Only  when  the 
alternative  pluralist  course  of  action  was  not  viable 
did  they  embrace  neo-corporatism  and  even  will¬ 
ingly  seek  to  exploit  it  to  their  benefit.  Once  these 


constraints  waned,  as  both  did  in  the  1970s  and 
1980s  in  Western  Europe,  so  did  neo-corporatist 
policy  making — usually  due  to  the  defection  of  busi¬ 
ness  interests.  Neither  the  relative  balance  of  class 
forces  nor  the  simultaneous  success  of  left-wing  par¬ 
ties  characterized  North  America  or  the  countries  of 
the  White  Commonwealth  as  much  and,  hence,  its 
almost  complete  absence  in  the  United  States, 
Canada,  Australia,  and  New  Zealand — except, 
interestingly  enough,  for  a  brief  flirtation  with 
something  like  it  during  the  American  New  Deal. 

Definitions  of  Neo-Corporatism 

While  corporatism  has  been  defined  as  an  ideology, 
a  variant  of  political  culture,  a  type  of  state,  a  form 
of  economy,  or  even  as  a  kind  of  society,  the  most 
productive  approach  has  been  to  consider  it  as  one 
of  several  possible  arrangements  through  which 
organized  interests  can  intermediate  between  their 
members  (individuals,  families,  firms,  communities, 
groups)  and  various  interlocutors.  Representatives 
of  capital  and  labor  were  initially  assigned  the  most 
prominent  role,  although  more  recent  versions  have 
extended  participation  to  include  representatives  of 
other  interests  such  as  women,  consumers,  environ¬ 
mentalists,  neighborhoods,  youth,  and  so  forth. 
Agents  of  the  state  or  government  are  omnipresent 
in  such  arrangements  but  usually  in  a  facilitative 
rather  than  a  protagonistic  role.  They  may  or  may 
not  be  overtly  present  at  the  table,  but  those  who 
are  there  know  that  legitimate  coercion  may  be 
necessary  for  implementing  the  agreements  they 
reach  and  may  be  brought  to  bear  if  they  fail  to 
reach  a  consensus.  Central  to  all  these  negotiating 
processes  in  the  contemporary  context  is  the  role  of 
class,  sectoral,  or  professional  associations,  perma¬ 
nently  established  and  staffed,  that  specialize  in 
identifying,  advancing,  and  defending  the  interests 
of  their  members  by  negotiating  agreements  directly 
with  organized  representatives  of  conflicting  inter¬ 
ests  and/or  by  influencing  and  contesting  the  poli¬ 
cies  of  public  authorities.  Unlike  political  parties — 
the  other  principle  intermediaries  in  modern  poli¬ 
ties — these  organizations  neither  present  candidates 
for  electoral  approval  nor  accept  overt  responsibil¬ 
ity  for  forming  governments. 

The  definition  of  modern  corporatism  that  initi¬ 
ated  much  of  the  contemporary  discussion  is  that 
of  Philippe  Schmitter  (1974): 
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A  system  of  interest  representation  in  which  the 
constituent  units  are  organized  into  a  limited 
number  of  singular,  compulsory,  non-competitive, 
hierarchically  ordered  and  functionally 
differentiated  categories,  recognized  or  licensed 
(if  not  created)  by  the  state  and  granted  a 
deliberate  representational  monopoly  within  their 
respective  categories  in  exchange  for  observing 
certain  controls  on  their  selection  of  leaders  and 
articulation  of  demands  and  supports,  (pp.  93-94) 

This  approach  emphasizes  almost  exclusively  the 
input  side,  that  is,  the  organizational  structure  of 
interest  associations.  Gerhard  Lehmbruch  (1979) 
defined  what  he  called  “liberal  corporatism”  more 
from  the  perspective  of  decisional  outputs  as 

an  institutionalized  pattern  of  policy-formation 
in  which  large  interest  organizations  cooperate 
with  each  other  and  with  public  authorities  not 
only  in  the  articulation  (or  even  “intermediation”) 
of  interests,  but — in  its  developed  forms — in  the 
“authoritative  allocation  of  values”  and  in  the 
implementation  of  such  policies,  (p.  94) 

As  Alan  Cawson  (1986)  and  Peter  Williamson 
(1985)  note,  subsequent  definitions  tended  to 
combine  both  the  input  and  output  dimensions, 
and  it  became  virtually  axiomatic  that  successful 
neo-corporatism  required  their  coincidence.  Only 
where  the  interests  involved  were  organized  hierar¬ 
chically  into  monopolistic  and  comprehensive 
associations  was  it  thought  to  be  possible  to  reach 
and  to  implement  voluntary  agreements  on  such 
crucial  matters. 

The  Success  and  Failure 
of  Neo-Corporatism 

Once  the  authoritarian-fascist-statist  variety  of 
corporatism  had  been  virtually  extinguished — 
first,  by  the  post-World  War  II  wave  of  democra¬ 
tizations  and,  later,  by  the  post-1974  one — it 
became  increasingly  clear  which  were  the  polities 
most  successful  in  practicing  the  “societal”  version 
of  neo-corporatism:  small  European  countries 
with  well-organized,  relatively  centralized  class- 
based  associations  and  highly  vulnerable,  interna¬ 
tionalized  economies.  The  tendency  was  all  the  more 
marked  if  they  also  had  strong  social-democratic 


parties,  stable  electoral  preferences  and  ruling 
coalitions,  relative  cultural  homogeneity,  and 
neutral  foreign  policies.  Indeed,  those  that  had 
the  most  difficulty  sustaining  such  social  pacts 
had  weaker  social  democracies,  more  volatile 
electorates,  and  deeper  divisions  over  military 
and  security  issues,  for  example,  the  Netherlands 
and  Denmark.  Belgium’s  relative  lack  of  success 
in  reaching  such  voluntary  macrosocial  contracts 
could  be  traced  to  its  division  into  rival  linguistic 
groups. 

In  retrospect,  it  became  clear  that  neo-corporatism 
had  reached  its  apex  more  or  less  at  the  very  same 
time  when  it  was  discovered  and  labeled  as  such  by 
social  scientists.  During  the  rest  of  the  1970s,  the 
1980s,  and  well  into  the  1990s,  its  practice  seemed 
to  be  in  irrevocable  decline.  A  number  of  knowl¬ 
edgeable  scholars,  such  as  Scott  Lash  and  John 
Urry  (1987)  and  Mark  Gobeyn  (1993),  even 
declared  it  defunct  on  the  grounds  that  “post- 
Fordist”  systems  of  production,  the  decline  in 
political  commitment  to  full  employment,  the 
demise  of  Keynesian  economics,  the  rise  of  neolib¬ 
eral  ideology,  and  the  growing  weakness  of  trade 
unions — not  to  mention  the  growing  significance 
of  globalization  and  Europeanization — all  con¬ 
spired  against  its  viability.  When  Swedish  capitalists 
noisily  withdrew  in  1991  from  all  policy-making 
instances  they  shared  with  organized  labor,  its 
death  warrant  was  thought  to  have  been  issued. 
Prior  to  this — and  for  a  period  stretching  back  to 
the  late  1930s — Sweden  was  rightfully  regarded  as 
the  archetypal  practitioner  of  neo-corporatism. 

The  Resurgence  and  Transformation 
of  Neo-Corporatism 

And  then,  just  as  it  was  supposed  to  disappear  alto¬ 
gether,  neo-corporatism  dramatically  reemerged 
toward  the  end  of  the  1990s,  leading  at  least  two 
observers  to  predict  that  maybe  neo-corporatism 
was  a  cyclical  product  related  to  the  vicissitudes  of 
capitalist  performance  and  the  public’s  shifting 
involvement  with  public  and  private  goods  and, 
hence,  destined  to  revive — Sisyphus  like — every  20 
to  25  years  or  so,  only  to  fail  again  at  the  end  of  a 
similar  period.  European  integration,  which  had 
been  considered  a  negative  factor  because  it  tended 
to  impose  a  neoliberal  bias  on  policy  making  and 
reduced  the  degrees  of  freedom  available  to  national 
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politicians,  suddenly  became  a  positive  factor  as  a 
series  of  countries  used  neo-corporatist  mecha¬ 
nisms  to  prepare  themselves  for  entry  into  the  Euro 
zone  and  to  improve  their  competitive  advantage 
when  other  policy  alternatives  had  been  eliminated. 
Scholars  competed  with  each  other  to  explain  why 
policy  concertation  and  tripartism  had  again 
become  so  appealing  (Giuseppe  Fajertag  &  Phillipe 
Pochet,  1997). 

Closer  observation  of  this  revival  would  reveal 
some  important  differences.  In  a  few  cases,  it 
involved  reanimating  tripartite  bargaining  arrange¬ 
ments  that  had  existed  in  the  recent  past  with  the 
participation  of  more  or  less  the  same  actors.  The 
Netherlands,  Denmark,  and  Finland  were  cases  in 
point;  even  in  the  much  trumpeted  case  of  Sweden, 
elements  of  neo-corporatism  seemed  to  have  crept 
back  into  its  policy  practices  at  a  lower  and  less  vis¬ 
ible  level  of  aggregation,  especially  after  the  return 
of  the  Social  Democrats  to  power.  What  was  defi¬ 
nitely  novel,  however,  was  its  emergence  in  coun¬ 
tries  that  had  either  not  practiced  it  at  all  or  done  so 
on  an  erratic  basis.  Moreover,  the  participants  in 
these  cases  (Ireland,  Italy,  Spain,  Portugal,  and 
Greece)  were  not  organized  “properly”  according 
to  prevailing  theory.  The  associations  representing 
capital  and  labor  in  these  negotiations  were  often 
weak  in  membership  density,  poorly  coordinated  at 
the  national  level,  fragmented  into  competing  ideo¬ 
logical  units,  and  manifestly  less  capable  of  ensuring 
the  compliance  of  the  firms  and  individuals  in  their 
respective  categories.  And,  in  some  instances,  the 
old  tripartite  formula  gave  way  to  a  multipartite 
one  in  which  representatives  of  other  social  interests 
(and  even  passions)  were  allowed  to  sit  at  the  table: 
agriculturists,  feminists,  consumers,  environmental¬ 
ists,  local  governments,  youths,  and  other  assorted 
groups  of  “policy  takers.”  As  if  this  was  not  enough, 
the  subject  matter  being  negotiated  was  less  focused 
on  inflation,  wage  contention,  social  peace,  and 
redistributive  side  payments  than  with  things  such 
as  restructuring  welfare  systems,  introducing  more 
flexible  work  procedures,  reducing  national  budget 
deficits,  improving  competitiveness  at  the  sectoral 
or  plant  level,  meeting  environmental  standards, 
and  ensuring  gender  equality.  And  agents  of  the 
state  played  a  much  more  active  and  visible  role  in 
convoking  these  negotiations  and  pressuring  the 
participants  to  reach  agreement.  Increasingly,  they 
were  even  prepared  to  decree  the  results  and  make 


them  publicly  binding  when  faced  with  dissent, 
especially  by  trade  unions. 

The  change  in  policy  content  may  explain  why 
neo-corporatist  policy  making  could  produce  results 
after  the  revival  of  the  1990s  without  the  collabora¬ 
tion  of  neo-corporatist  interest  intermediaries.  The 
growing  imbalance  of  forces  between  capital  and 
labor  due  to  globalization  had  already  put  the  latter 
on  the  defensive  and,  therefore,  more  likely  to  agree 
to  negotiate  in  retreat  in  defense  of  existing  jobs, 
wages,  and  benefits  under  the  impending  threat  of 
“delocalization.”  It  may  also  have  lessened  their 
opposition  to  including  new  participants  on  the 
grounds  that  farmers,  consumers,  and  environmen¬ 
talists  might  prove  to  be  allies  on  specific  issues.  But 
the  real  difference  has  involved  the  increased  role  of 
firms  and  state  agencies.  The  burden  of  implement¬ 
ing  most  of  these  “new”  policies  no  longer  rests  on 
the  autonomous  capacity  of  class  associations  to 
deliver  the  compliance  of  their  members.  Many  are 
only  “recommended  standards”  or  “voluntary 
guidelines”  at  the  macro  or  meso  levels,  and  their 
implementation  is  largely  at  the  discretion  of  indi¬ 
vidual  enterprises  or  plant-level  works  councils  at 
the  microlevel.  Also,  those  that  are  intended  to  be 
binding  on  all  those  affected  can  be  imposed  by  the 
government  through  “legitimate  coercion.”  This 
has  become  much  more  credible  since  the  conver¬ 
gence  in  programs  and  policies  between  Right  and 
Feft  parties  ensures  that  these  policies  will  not  vary 
with  rotations  in  power.  The  former  association 
of  neo-corporatism  with  social-democratic  or  left- 
center  governing  coalitions  has  become  less  rele¬ 
vant.  Indeed,  analysts  have  argued  that  it  has  either 
become  indifferent  to  the  party  in  power  or  that  it 
is  the  favored  practice  when  weak,  multiparty  coali¬ 
tions  rule,  on  the  grounds  that,  under  such  centrist 
governments,  neither  capital  or  labor  can  rely  on 
especially  favored  treatment  by  the  legislature  or  the 
executive,  and  therefore  they  have  to  resort  to  the 
second-best  alternative  of  negotiating  compromises. 

So  substantial  have  been  these  changes  to  the 
practice  of  neo-corporatism  since  the  late  1990s 
that  it  is  certainly  legitimate  to  question  whether 
the  same  label  should  be  attached  to  it. 

The  Performance  of  Neo-Corporatism 

Protracted  neo-corporatism  at  the  national  or 
macroeconomic  level  has  been  convincingly  linked 
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to  certain  desirable  outcomes  during  the  post-war 
period  until  the  mid-1970s:  less  unruliness  of  the 
citizenry,  lower  strike  rates,  more  balanced  bud¬ 
gets,  greater  fiscal  effectiveness,  lower  rates  of 
inflation,  less  unemployment,  less  income  inequal¬ 
ity,  less  instability  at  the  level  of  political  elites, 
and  less  of  a  tendency  to  exploit  the  “political 
business  cycle” — all  of  which  suggested  that  coun¬ 
tries  scoring  high  on  this  property  were  likely  to  be 
more  governable.  Subsequent  econometric  studies 
with  more  recent  data  have  questioned  these  find¬ 
ings,  and  no  one  has  ever  been  able  to  show  that 
neo-corporatist  systems  have  been  capable  of 
higher  rates  of  economic  growth.  In  the  turbulent 
times  at  the  end  of  the  1990s  and  at  the  beginning 
of  this  century,  as  we  have  noted  above,  policy 
concentration  among  social  classes,  sectors,  and 
professions  has  shifted  toward  matters  such  as 
improving  productivity,  encouraging  worker  flex¬ 
ibility,  and  reforming  welfare  systems. 

Regardless  of  its  effects  on  the  economy 
and  society,  neo-corporatism  has  long  had  an 
ambiguous  and  contestable  relation  to  the  polity — 
especially  with  regard  to  democracy.  From  its 
rediscovery  in  the  mid-1970s,  corporatism  has 
borne  the  burden  of  its  past  association  with  fas¬ 
cism  and  other  forms  of  authoritarian  rule.  To 
describe  a  polity  or  practice  as  “corporatist”  was 
practically  synonymous  with  accusing  it  of  being 
undemocratic.  Moreover,  some  of  its  enduring  fea¬ 
tures  seemed  to  confirm  this  suspicion:  Organizations 
replaced  persons  as  the  principal  agents  in  political 
life;  specialized  professional  representatives  gained 
at  the  expense  of  generally  interested  citizens  and 
broadly  aggregative  political  parties;  privileged  (if 
not  exclusive)  access  was  accorded  to  particular 
associations;  monopolies  were  recognized  and  even 
extolled  at  the  expense  of  overlapping  and  compet¬ 
ing  intermediaries;  organizational  hierarchies  reach¬ 
ing  up  to  very  comprehensive  national  peak  asso¬ 
ciations  diminished  the  autonomy  of  more  local 
and  specialized  organizations;  and  decisions  were 
made  by  secretive  negotiations,  rather  than  by  pub¬ 
lic  tallying  of  votes. 

As  inquiry  into  corporatism  expanded,  how¬ 
ever,  judgment  about  its  impact  on  democracy 
shifted.  For  one  thing,  many  of  the  countries  that 
are  manifestly  corporatist  are  also  obviously  dem¬ 
ocratic  in  the  sense  that  they  protect  the  full  range 
of  civic  freedoms,  define  citizenship  in  the  broadest 


fashion,  hold  regular  competitive  elections  of 
uncertain  outcome,  render  political  authorities 
accountable  for  their  actions,  and  pursue  public 
policies  that  seem  responsive  to  popular  demands. 
Some  of  them,  especially  those  in  Scandinavia,  have 
even  been  in  the  vanguard  of  experimentation, 
with  such  advanced  democratic  measures  as  worker 
participation  in  management,  open  disclosure  of 
policy  processes,  ombudsman  arrangements  for 
hearing  citizen  complaints,  public  financing  of 
political  parties,  and  even  wage-earner  funds  for 
extending  popular  ownership  of  the  economy. 

Also,  it  soon  became  apparent  that  corporatist 
arrangements  have  a  substantial  impact  on  the 
conditions  under  which  competing  interests  can 
participate  to  influence  the  process.  The  spontane¬ 
ous,  voluntaristic,  and  episodic  relations  of  plural¬ 
ism  seem  freer  in  principle,  but  in  practice  they 
produce  a  greater  inequality  of  access  to  those  in 
power.  Privileged  groups  with  smaller  numbers, 
concentrated  resources,  and  more  compact  loca¬ 
tion  have  a  natural  advantage  over  larger  and  dis¬ 
persed  ones  such  as  workers  and  consumers. 
Corporatism  tends  to  even  out  the  distribution  of 
resources  across  more  comprehensively  organized 
categories  and  to  guarantee  at  least  a  formal  parity 
of  access  to  the  making  of  decisions.  Moreover,  the 
direct  incorporation  of  associations  into  subse¬ 
quent  implementation  processes  may  ensure  greater 
responsiveness  to  group  needs  than  the  “arms 
length”  relationship  that  separates  the  public  and 
the  private  realm  under  pluralism. 

Evaluating  the  impact  of  corporatism  on  democ¬ 
racy  depends  very  much  on  which  qualities  of 
democracy  one  chooses  to  use.  Seen  from  the  clas¬ 
sical  perspective  of  encouraging  the  participation  of 
individuals  in  the  decisions  that  collectively  affect 
them  and  of  ensuring  that  all  public  authorities 
accord  equal  accessibility  to  citizen  demands,  these 
arrangements  have  a  negative  effect.  But  when 
viewed  from  a  more  output-oriented  perspective 
that  asks  whether  those  in  power  can  be  held  effec¬ 
tively  accountable  for  their  actions  and  whether 
these  actions  are  likely  to  be  responsive  to  citizen 
needs,  the  judgment  of  corporatism  is  bound  to  be 
more  positive.  Its  impact  on  the  central  mechanism 
of  democracy — competitiveness — is  more  ambigu¬ 
ous.  On  the  one  hand,  this  is  diminished  by  elimi¬ 
nating  the  struggle  between  rival  associations  for 
membership  and  access.  On  the  other  hand,  it  is 
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enhanced  by  encouraging  rival  conceptions  of  com¬ 
mon  interest  to  express  themselves  within  the  same 
association.  One  can  conclude  that  modern  democ¬ 
racies  are  being  transformed  by  the  practice  of 
modern  corporatism.  Organizations  are  becoming 
citizens  alongside,  if  not  in  the  place  of,  individuals. 
Accountability  and  responsiveness  are  increasing 
but  at  the  expense  of  participation  and  access. 
Competitiveness  is  less  interorganizational  and 
more  intraorganizational.  The  pace  is  uneven,  the 
acceptance  is  unequal,  and  the  outcome  is  by  no 
means  unequivocal,  but  democracy  in  almost  all 
modern  societies  is  becoming  more  interested,  more 
organized,  and  more  indirect. 

The  Future  of  Neo-Corporatism 

With  the  dramatic  crash  of  late  2008,  the  condi¬ 
tions  that  have  previously  promoted  or  impeded 
neo-corporatism,  tripartism,  policy  concentration, 
and  social  pacting,  or  whatever  it  should  be  called, 
have  radically  altered.  After  years  of  decline  in  the 
balance  of  forces  between  capital  and  labor  in  favor 
of  the  former,  the  terms  of  encounter  are  suddenly 
no  longer  the  same.  The  hegemony  of  business 
interests  has  been  seriously  undermined  by  the  col¬ 
lapse  of  neoliberal  ideology  as  well  as  the  revela¬ 
tions  of  fraud  and  misconduct  by  financial  interests. 
Materially  speaking,  many  enterprises  have  been 
devastated  in  their  balance  sheets,  and  their  recov¬ 
ery  to  profitability  will  require  the  willing  coopera¬ 
tion  of  labor.  Whether  mass  unemployment  will 
reach  the  levels  of  the  1930s  and  trigger  a  poten¬ 
tially  disruptive  collective  response  by  workers  and 
citizens  has  yet  to  be  seen.  Moreover,  the  initial 
reaction  by  state  authorities — even  in  regimes 
dominated  by  conservative  parties — demonstrates 
that  they  are  not  just  disposed  but  anxious  to  inter¬ 
vene.  So  far,  their  emergency  measures  have 
involved  distributing  massive  welfare  to  capitalists 
and  no  high-level  negotiations  with  labor.  There 
has  simply  not  been  sufficient  time  for  tripartite 
policy  concentration.  But  eventually — if  the  past 
scenario  is  any  guide  to  the  future — this  combina¬ 
tion  of  factors  could  well  lead  to  yet  another 
revival  of  it,  especially  in  small,  relatively  homoge¬ 
neous  and  internationally  vulnerable  countries. 
The  only  safe  prediction,  however,  is  that,  if  and 
when  it  returns,  it  will  not  be  the  same  in  either 
form  or  substance.  Perhaps,  then,  we  can  finally 


declare  that  neo-corporatism  is  dead  and  gone, 

and  begin  calling  it  something  else. 
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Neoliberal  Institutionalism 


Neoliberal  institutionalism  (NLI)  is  an  umbrella 
term  for  liberal  research  programs  in  the  study  of 
international  relations  (IR)  that  focus  on  the  coop¬ 
erative  role  of  institutions.  At  its  core,  NLI  argues 
that  international  cooperation  is  possible  and 
most  readily  achievable,  with  the  creation  and 
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maintenance  of  international  institutions  broadly 
defined.  Both  formal  and  informal  institutional 
arrangements  are  the  subjects  of  NLI  analysis. 
Formal  institutions  include  multilateral  organiza¬ 
tions  such  as  the  United  Nations  (UN),  the  World 
Trade  Organization  (WTO),  and  the  European 
Union  (EU).  States  create  and  voluntarily  submit  to 
such  institutions,  which  possess  collective  goals 
and  establish  mechanisms  to  achieve  them.  Informal 
institutional  arrangements,  or  regimes,  are  also 
voluntarily  created  by  states  and  constitute  sets  of 
implicit  and  explicit  principles,  norms,  rules,  and 
procedures  around  which  actor’s  expectations  con¬ 
verge  in  a  particular  issue  area.  A  well-established 
regime  exists  in  the  issue  area  of  capitalist  free 
trade,  for  example,  since  both  formal  institutions 
and  implicit,  collectively  shared  principles,  norms, 
rules,  and  procedures  are  foundational  to  such 
activity. 

NLI  argues  that  both  formal  institutions  and 
regimes  have  the  potential  to  create  lasting  bonds 
between  nation-states.  They  do  so  by  facilitating 
iterated  interaction,  diffusing  information,  height¬ 
ening  transparency,  and  lessening  the  ability  of 
actors  to  defect  from  institutional  agreements.  By 
normalizing  rules  and  regulations  in  this  way, 
institutions  promote  an  environment  of  trust  in 
which  nation-states  can  obtain  a  variety  of  collec¬ 
tive  gains  that  they  would  otherwise  eschew.  Thus, 
institutions  aid  in  efficiently  solving  collective 
action  dilemmas,  particularly  in  areas  that  do  not 
involve  security  issues. 

As  the  term  indicates,  NLI  is  part  of  the  larger 
theoretical  category  of  IR  liberalism,  and  it  is  pre¬ 
mised  on  basic  liberal  assumptions  about  the 
importance  of  rationality,  information,  iteration, 
and  institutional  arrangements  to  cooperative  out¬ 
comes  in  IR.  NLI  also  constitutes  a  renewed  con¬ 
frontation  with  realism — hence  its  “neo”  status.  It 
emerged  in  the  late  1970s  as  a  theoretical  com¬ 
petitor  to  neorealism,  which  argues  that  there  are 
debilitating  constraints  on  the  efficacy  of  interna¬ 
tional  cooperation  in  an  anarchic  world  of  self- 
interested,  egocentric  nation-states.  In  neorealist 
analyses,  the  anarchic  international  environment 
fosters  uncertainty  and  suspicion,  which  then  pro¬ 
duce  frequent  security  dilemmas,  volatile  alliances, 
and  ongoing  trade  and  military  competition.  In 
such  an  environment,  international  cooperation  is 
difficult  to  obtain,  even  when  both  parties  would 


gain  from  the  effort,  due  to  fear  that  any  relatively 
greater  gains  will  be  employed  for  competitive 
purposes.  Thus,  NLI’s  theoretical  competitor  tends 
to  underscore  the  dangers  of  defection  by  high¬ 
lighting  the  distrust  between  actors  engaged  in 
potential  agreements  with  one  another. 

NLI  challenges  the  pessimistic  conclusions  of 
neorealist  analysis,  but  it  does  so  by  adopting 
many  of  the  same  analytical  assumptions.  It  is  for 
this  reason  that  neorealism  and  NLI  are  sometimes 
categorized  together  epistemologically  and  why 
NLI  can  be  distinguished  from  earlier  liberal  vari¬ 
ants  such  as  pluralism.  Both  NLI  and  neorealist 
scholars  agree  that  the  international  system  is 
anarchic,  which  means  that  there  is  no  interna¬ 
tional  authority  or  government  to  force  states  to 
comply  with  demands  or  cooperate  with  one 
another.  As  a  result,  states  must  help  themselves 
and  look  out  for  their  own  individual  survival. 
NLI  concurs  that  international  cooperation  will  be 
difficult  to  obtain  in  an  anarchic  international 
environment  that  induces  fear  and  uncertainty. 
Both  theories  focus  on  the  state  as  the  main  actor 
in  international  affairs.  While  NLI  and  neorealist 
scholars  recognize  that  other  international  actors 
do  exist  and  can  influence  international  affairs, 
each  theoretical  lens  views  the  state  as  the  primary 
mover  of  international  politics.  And  both  perspec¬ 
tives  view  the  state  as  a  rational,  utility-maximizing, 
and  self-interested  governing  entity.  In  doing  so, 
both  are  heavily  indebted  to  the  study  of  econom¬ 
ics,  and  NLI  is  sometimes  referred  to  as  rational- 
choice  institutionalism  as  a  result. 

NLI  and  neorealism  differ,  however,  over 
whether  the  fear  of  relative  gains  is  a  primary 
inhibitor  in  international  cooperation.  NLI  argues 
that  states  can  be  motivated  to  cooperate  in  order 
to  achieve  absolute  gains  (or  the  totality  of  gains 
achieved  by  an  actor  regardless  of  the  relative 
gains  of  others)  if  their  fears  of  being  cheated  by 
one  another  can  be  mitigated.  This  is  where  insti¬ 
tutions  play  a  key  role  in  NLI  analysis.  NLI  schol¬ 
ars  argue  that  institutions  mitigate  the  effects  of 
anarchy,  thereby  making  the  realization  of  abso¬ 
lute  gains,  and  hence  the  possibility  of  interna¬ 
tional  cooperation,  more  likely.  Institutions  do  so 
because  they  reflect  mutually  accepted  rules  and 
regulations  that  are  created  by  the  institutional 
actors  themselves.  Because  actors  have  a  say  in 
what  is  allowed,  and  disapproved  of,  there  is  an 
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assumption  that  actors  will  not  agree  to  terms  that 
they  cannot  meet,  thus  making  institutional  agree¬ 
ments  fairly  straightforward  and  easy  to  uphold. 
Institutions  also  foster  transparency  and  informa¬ 
tion  sharing.  They  provide  actors  with  access  to 
data  about  other  institutional  partners  to  which 
the  former  would  not  normally  be  privy.  The  more 
information  that  actors  possess  the  better,  as  this 
diminishes  feelings  of  anxiety,  apprehension,  and 
distrust.  And,  because  the  rules  of  the  game  are 
well  specified  in  institutional  arrangements,  actors 
are  well  aware  of  the  institutional  expectations 
that  exist.  The  more  information  actors  have,  the 
better  they  feel  about  taking  part  in  an  interna¬ 
tional  agreement  that  makes  them  dependent  on 
other  actors  and,  arguably,  more  vulnerable. 

NLI  also  stresses  the  importance  of  long-term 
gains.  NLI  scholars  note  that  institutions  provide  a 
forum  for  repeated  interaction,  and  they  argue  that 
actors  engaged  in  repeated,  continuous  interac¬ 
tions  are  less  likely  to  defect  from  cooperative 
arrangements.  In  making  this  argument,  NLI  bor¬ 
rows  from  game  theory  and  the  multiple  game 
scenarios  (prisoners’  dilemma,  chicken,  tit-for-tat) 
that  materialize  when  actors  seek  to  engage  in 
bilateral  and  multilateral  agreements.  These  can  be 
distinguished  from  one-shot-only  interactions, 
which,  NLI  argues,  is  the  assumption  of  neorealist 
analysis.  Alternatively,  NLI  argues  that  once  actors 
submit  to  an  institutional  agreement,  they  become 
locked  in.  Actor  interaction  becomes  more  fre¬ 
quent  and  common. 

Institutions  are  thus  viewed  as  mechanisms  that 
can  create  long-term  and  long-lasting  benefits  for 
states  in  areas  related  to  security,  human  rights, 
health,  and  the  international  political  economy. 
NLI  views  iterated  interaction  as  a  positive  spill¬ 
over  effect  of  institutional  agreements.  The  more 
actors  interact,  the  more  they  come  to  trust  one 
another  and  learn  the  preferences  that  others  pos¬ 
sess.  Iterated  interaction,  therefore,  catalyzes  coop¬ 
eration  among  divergent  actors.  Actors  are  less 
likely  to  engage  in  hostile  interaction  because  they 
“know”  one  another.  More  important,  institu¬ 
tional  arrangements  lead  actors  to  depend  on  one 
another.  This  dependence  lessens  the  desire  among 
actors  to  defect  or  cheat  because  they  too  can  be 
hurt — economically  and  politically — by  this  mis¬ 
behavior.  Consequently,  defectors  are  left  vulner¬ 
able  because  they  can  no  longer  depend  on  the 


actions  of  others  to  fill  the  institutional  needs  that 
had  been  previously  met. 

Finally,  NLI  argues  that  institutions  diminish  the 
ability  of  actors  to  free  ride.  Free  riding  occurs  when 
one  actor  benefits  from  the  actions  of  other  actors 
without  paying  the  price  or  anteing  up.  Institutions 
not  only  spell  out  requirements  that  actors  are 
expected  to  abide  by  and  uphold  but  also  delineate 
punishments  for  failing  to  comply  with  one’s  insti¬ 
tutional  mandate.  This  reduces  the  ability,  or  desire, 
of  other  actors  to  cheat  or  free  ride  in  turn  because 
there  are  known  consequences  for  noncompliance, 
like  sanctions  or  institutional  shunning.  Although  it 
can  be  difficult  to  enforce  institutional  punishments, 
the  mere  fact  that  such  mechanisms  exist  can  dis¬ 
suade  some  potential  institutional  dissenters. 

For  NLI  scholars,  institutional  agreements  do 
not  impede  state  sovereignty  because  states  still 
retain  final  authority  over  foreign  and  domestic 
policy  decisions.  Instead,  NLI  notes  that  institu¬ 
tions  reduce  the  autonomy  of  states  to  act  unilater¬ 
ally,  which  means  that  states  are  forced  to  consider 
the  needs  and  interests  of  other  actors  if  they  wish 
to  obtain  their  own  interests  in  turn.  This  creates  a 
more  pacifistic  and  harmonious  international  envi¬ 
ronment  because  states  will  not  make  irrational 
decisions  based  solely  on  their  own  desires  and 
wants.  Instead,  they  learn  to  compromise  and  take 
into  account  other  actors  while  at  the  same  time 
obtaining  their  own  long-term  interests. 

Although  institutions  can  mitigate  the  effects  of 
anarchy  in  these  ways,  NLI  acknowledges  that  IR 
prior  to  the  20th  century  conformed  to  neorealist 
expectations.  Two  historical  developments  created 
a  window  of  opportunity  for  the  spread  of  formal 
and  informal  institutions  in  the  20th  century.  The 
first  was  the  development  of  complex  interdepen¬ 
dence.  Thanks  to  modern  technological  and  indus¬ 
trial  advances,  nation-states  have  become  more 
entwined  at  almost  every  level  and  issue  area.  This 
has  produced  multiple  channels  of  interaction 
between  state  and  nonstate  actors,  and  this  allows 
actors  to  more  easily  identify  their  common  inter¬ 
ests,  overcome  barriers  to  collective  action,  and 
reach  cooperative  agreements  to  obtain  common 
interests.  It  has  also  encouraged  nonmilitary  prob¬ 
lem  solving  in  areas  such  as  environmental  degra¬ 
dation,  health  crises,  and  human  rights  abuses. 

The  second  historical  development  that  encour¬ 
aged  the  spread  of  international  institutions  was 
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that  of  the  United  States  playing  the  role  of  a  hege¬ 
mon  after  World  War  II.  Borrowing  again  from  the 
study  of  international  economics,  NLI  scholars 
argue  that  the  support  of  a  very  powerful  state  is 
usually  necessary  for  nation-states  to  have  confi¬ 
dence  in  and  engage  with  free  trade.  The  United 
States  created  and  promoted  a  variety  of  multilat¬ 
eral  institutions,  thereby  producing  a  period  of 
hegemonic  stability  in  the  latter  half  of  the  20th 
century.  While  this  promotion  was  driven  by  self- 
interest,  it  served  as  the  necessary  foundation  for  the 
subsequent  growth  of  international  institutions  and 
regimes  and  the  cooperative  benefits  they  provide. 

It  is  important  to  acknowledge  that  NLI  does 
not  posit  that  institutions  always  matter.  This  is 
because  institutions  are  not  infallible.  Institutions 
can  aid  in  achieving  but  do  not  guarantee  collec¬ 
tive  gains.  NLI  scholars  are  cognizant  of  the  fact 
that  institutions  can  break  down  or  fail  to  achieve 
a  desired  collective  action  outcome;  therefore,  how 
institutions  are  designed — or  the  rational  design  of 
institutions — is  an  important  focus  and  research 
agenda  within  NLI.  Another  emerging  branch  of 
NLI,  known  as  principal-agent  theory,  examines 
how  nation-states  delegate  tasks  and  authority  to 
international  institutions  and  whether  this  achieves 
the  desired  objectives. 
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Neoliberalism 


Neoliberalism,  as  the  prefix  neo  suggests,  is  an  old 
concept  that  reemerged  as  a  policy  response  to  the 
crisis  of  Keynesianism  and  was  made  popular  by 
the  political  ascendancy  of  Margaret  Thatcher  in 
Great  Britain  and  Ronald  Reagan  in  the  United 
States  in  the  late  1970s  and  early  1980s.  While 
neoliberalism  has  become  a  central  concept  in  the 
social  sciences  describing  the  structural  changes  in 
the  global  economy  since  the  1970s,  the  concept  is 
much  contested.  At  the  most  fundamental  level, 
neoliberalism  builds  on  the  classical  liberal  notion 
implying  the  triumph  of  market  forces  and  indi¬ 
vidual  autonomy  over  state  power.  But  there  is  a 
considerable  normative  divergence  between  advo¬ 
cates  of  neoliberal  ideas,  who  celebrate  the  ascen¬ 
dancy  of  the  market,  and  those  who  suggest  that 
the  policies  of  neoliberalism  are  associated  with 
global  inequality,  economic  disparity,  growth  of 
unemployment,  social  exclusion,  environmental 
destruction,  and  cultural  homogeneity.  Optimists 
stipulate  that  unfettered  market  forces  will  result 
in  global  prosperity,  freedom,  democracy,  and 
peace.  For  pessimists,  neoliberalism  has  become  an 
ideological  construct  associated  with  radical  mar¬ 
ket  fundamentalism  based  on  the  universal  imper¬ 
atives  of  competitive  deregulation,  liberalization, 
and  privatization.  This  latter  interpretation  is  often 
used  synonymously  with  the  concept  of  an  exploit¬ 
ative  form  of  neoliberal  economic  globalization. 

Defining  neoliberalism  is  all  the  more  difficult 
because  the  concept  as  it  emerged  first  in  the  1930s 
differs  fundamentally  from  the  form  in  which  it 
reemerged  in  the  1970s.  In  fact,  Andreas  Renner, 
of  the  Walter  Eucken  Institute  in  Germany,  sug¬ 
gests  that  there  are  two  neoliberalisms.  One  is  a 
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continental  European  (i.e.,  German)  version  and 
the  other  an  Anglo-Saxon  interpretation. 
Historically,  the  European  concept  of  neoliberalism 
originated  in  the  1930s  in  opposition  to  the  Anglo- 
Saxon  laissez-faire  liberalism  of  self-regulating 
markets.  The  best  account  of  such  a  laissez-faire 
economic  system  is  found  in  Karl  Polanyi’s  The 
Great  Transformation ,  in  which  he  argues  that 
the  collapse  of  the  international  economic  system  in 
the  1930s  was  a  direct  consequence  of  the  attempt 
to  organize  the  economy  on  the  basis  of  laissez-faire 
ideas  influenced  by  the  British  and  Austrian  schools 
of  liberal  (laissez-faire)  economics.  In  today’s  social 
sciences,  the  terms  laissez-faire  and  neoliberalism 
are  used  interchangeably,  referring  to  the  ascen¬ 
dancy  of  the  market  over  state  authority.  The  his¬ 
torical  origin  of  neoliberalism  tells  a  different  story. 
The  next  section  explores  the  origin  of  this  concept 
and  its  relations  to  laissez-faire  liberalism  of  the 
19th  century  before  turning  to  the  reemergence  of 
neoliberalism  in  the  late  1970s  and  early  1980s. 

Historical  Origin  of  Continental  European 
Neoliberalism  and  Anglo-Saxon 
Laissez-Faire  Liberalism 

According  to  the  German  economist  Wilhelm 
Ropke,  the  term  neoliberalism  was  coined  in  Paris 
in  1938  at  a  Colloque  Walter  Lippmann,  a  sympo¬ 
sium  held  to  discuss  Walter  Lippmann’s  recently 
released  book,  The  Good  Society.  The  participants 
at  the  Paris  meeting  chose  the  term  neoliberalism 
to  signal  the  creation  of  a  new  liberal  movement 
against  the  laissez-faire  liberalism  of  the  1 9th  cen¬ 
tury.  The  historical  importance  of  this  neoliberal 
circle — consisting  of  members  such  as  Wilhelm 
Ropke,  Alfred  Muller- Armack,  Alexander  Riistow, 
Walter  Eucken,  and  Franz  Bohm — is  that  these 
ideas  and  norms  became  the  basis  for  the  continen¬ 
tal  social  market  economy  of  the  1950s.  While  not 
all  members  endorsed  the  term  neoliberalism ,  it 
nevertheless  became  an  umbrella  designation  for 
different  trends  of  liberalism  developed  under  its 
roof,  of  which  the  Freiburger  school,  also  referred 
to  as  Ordoliberalism,  is  the  most  well-known 
group  with  Walter  Eucken  and  Franz  Bohm  as  its 
most  renowned  representatives. 

The  exponents  of  this  neoliberal  circle  united  in 
rejecting  the  economic  reductionism  they  perceived 
as  central  to  the  ideas  of  19th-century  laissez-faire 


liberalism.  Instead,  they  emphasized  a  normative- 
ethical  foundation  of  economics,  delineating  an 
important  role  for  governments  to  set  the  institu¬ 
tional  parameters  for  economic  competition  in 
order  to  serve  the  larger  interests  of  society.  These 
intellectual  proponents  of  neoliberalism  combined 
economic  efficiency  with  human  decency  to  achieve 
a  just  and  stable  social  order.  As  suggested  by  the 
term  social  market  economy,  which  developed 
from  the  earlier  neoliberal  circle  and  is  still  used 
today  to  describe  some  of  the  continental  European 
economic  models,  the  belief  in  the  self-regulatory 
capacity  of  the  market  was  rejected. 

In  contrast  to  the  continental  European  school 
of  neoliberalism,  Anglo-Saxon  laissez-faire  philoso¬ 
phers  and  economists — such  as  Jeremy  Bentham, 
David  Ricardo,  Thomas  Malthus,  Edmund  Burke, 
and,  from  the  Austrian  economic  school,  Ludwig 
von  Mises  and  Friedrich  August  von  Hayek — 
developed  the  theoretical  foundation  for  claiming 
the  superiority  of  economic  freedom  over  public 
intervention.  Characteristic  of  laissez-faire  liberal¬ 
ism  as  practiced  in  the  early  20th  century  was  a 
market  system  based  on  competitive  labor  markets, 
the  automatic  gold  standard,  and  free  trade.  Laissez- 
faire  intellectuals  postulated  that  unfettered  eco¬ 
nomic  competition  was  superior  to  any  form  of 
state  guidance  in  coordinating  human  efforts.  But 
precisely  the  very  belief  in  the  naturalness  of  the 
market  and  the  self-regulating  power  of  market 
forces  was  disputed  in  Karl  Polanyi’s  narrative  of 
the  historical  transformation  from  a  traditional 
socially  embedded  economy  to  a  laissez-faire  sys¬ 
tem  during  the  19th  century.  The  introduction  of  a 
market  economy  necessitated  deliberate  state  action 
and,  contrary  to  the  theories  of  laissez-faire,  did  not 
result  from  natural  market  forces. 

In  summary,  the  continental  European  develop¬ 
ment  of  neoliberalism  in  the  1930s  was  an  outright 
challenge  to  the  19th-century  Anglo-Saxon  belief 
in  self-regulating  markets.  In  rejecting  the  laissez- 
faire  liberalism  with  its  emphasis  on  creating  the 
largest  possible  space  for  the  self-determination  of 
individuals,  the  proponents  of  neoliberalism  ques¬ 
tioned  the  fundamental  separation  between  the 
political  spheres  and  economic  spheres.  Seen  from 
this  historical  perspective,  the  later  reemergence  of 
neoliberalism  in  the  1970s  has  more  in  common 
with  the  belief  system  of  laissez-faire  liberalism  of 
the  19th  century  than  with  the  original  meaning  of 
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neoliberalism  in  the  1930s.  In  fact,  when  Anthony 
Giddens  advocates  a  “third  way”  between  a  lais¬ 
sez-faire  neoliberal  orientation  and  a  top-down 
bureaucratic  state  management,  he  in  fact  comes 
close  to  the  norms,  ideology,  and  practices  cham¬ 
pioned  by  the  original  intellectuals  of  the  1930 
continental  European  neoliberalism. 

The  Reemergence  of  Neoliberalism 
in  the  1970s 

In  contrast  to  the  intellectuals  of  the  1930s  who 
united  under  the  umbrella  of  neoliberalism  and 
identified  themselves  as  part  of  a  new  neoliberal 
movement,  the  reemergence  of  neoliberalism  in  the 
latter  part  of  the  20th  century  lacks  any  group 
affiliation  or  identity  with  a  larger  neoliberal  move¬ 
ment.  Intellectuals  most  closely  identified  with  the 
new  norms  of  market  fundamentalism  are  Friedrich 
August  von  Hayek  and  his  student  Milton  Friedman. 
However,  Hayek’s  teachings  are  much  closer  to  the 
laissez-faire  ideas  of  his  teacher  and  mentor,  Fudwig 
von  Mises,  than  to  the  original  meaning  of  neolib¬ 
eralism  and  later  the  ordoliberalism  of  the  Freiburg 
school  in  Germany.  It  is  thus  not  surprising  that 
Friedrich  von  Hayek  and  Milton  Friedman  do  not 
use  the  term  neoliberalism  in  their  writings,  prefer¬ 
ring  instead  the  concept  of  laissez-faire  liberalism. 

Today  the  term  neoliberalism  is  used  to  describe 
global  economic  processes  of  governance  systems 
that  fundamentally  reconfigure  contemporary  eco¬ 
nomic  and  social  systems  around  the  globe. 
Neoliberal  economic  ideas  emerged  as  a  result  of 
the  economic  stagflation  of  the  1970s.  This  in  turn 
led  to  a  rejection  of  the  postwar  consensus  of 
Keynesian  demand  management.  Most  promi¬ 
nently,  Margaret  Thatcher  and  subsequently  Ronald 
Reagan  popularized  a  radical  market-oriented  sys¬ 
tem  based  on  supply  side  economics  and  rejecting 
state  intervention  in  the  economy.  The  closest 
approximation  today  of  a  neoliberal  socioeconomic 
model  in  the  real  world  is  the  United  States. 

Twenty-five  years  later,  there  is  still  no  shared 
consensus  on  the  meanings  of  neoliberalism  and 
neoliberal  globalization.  At  the  most  fundamental 
level,  these  terms  convey  a  rebalancing  of  state  and 
market,  tilting  to  the  privatization  of  cross-border 
governance.  But  these  processes  are  enormously 
contested.  Proponents  suggest  that  measures  such 
as  liberalization,  deregulation,  and  privatization 


remove  barriers  to  trade  and  financial  cross-border 
transactions  and  thereby  unleash  the  productive 
forces  of  capitalism.  In  this  context,  liberalization 
is  to  allow  the  unfettered  cross-border  movements 
of  capital,  labor,  services,  and  goods,  while  dereg¬ 
ulation  is  geared  to  remove  unwanted  government 
control  that  interferes  with  market  processes,  and 
privatization  transfers  previously  publicly  pro¬ 
vided  services  to  the  private  sector.  Critics  of  neo¬ 
liberal  discourse  and  practice,  on  the  other  hand, 
argue  that  the  rise  and  power  of  global  finance  and 
mobile  capital  as  the  dominant  force  in  this  gover¬ 
nance  framework  have  a  negative  impact  on  state- 
societal  relations.  The  diminished  capacity  of  the 
state  to  sustain  public  policies  has  increased  the 
progressive  detachment  of  individuals  from  social 
networks.  Since  public  authorities  are  faced  with 
dwindling  resources  to  fulfill  their  traditional  man¬ 
dates,  welfare  is  increasingly  provided  through 
market  mechanisms.  The  result  is  an  increase  in 
the  individualization  of  risks. 

One  of  the  most  contentious  intellectual  debates 
concerns  the  role  of  the  state  in  neoliberalism. 
Earlier  proponents  of  neoliberalism  envisaged  the 
retreat  of  the  state  since  they  assumed  that  the 
market  was  the  most  efficient  allocator  of  resources. 
However,  scholars  from  the  Feft  pointed  out  that 
the  state  was  a  central  actor  in  creating  the  emerg¬ 
ing  transnational  governance  system.  In  the  pro¬ 
cess  of  acting  as  a  supposed  midwife  to  a  new 
neoliberal  order,  the  state  also  changed  from  a 
distributive  to  a  more  internationally  competitive 
actor.  Other  key  players  who  are  identified  with 
the  norms  and  practices  of  market  fundamentalism 
are  the  World  Trade  Organization,  the  World 
Bank,  the  International  Monetary  Fund,  and  the 
Organisation  for  Economic  Co-operation  and 
Development.  These  organizations  were  maligned 
for  imposing  the  Washington  Consensus  in  devel¬ 
oping  countries  facing  liquidity  problems.  The 
policy  doctrine  of  structural  adjustment  involves 
macroeconomic  stabilization  through  fiscal  auster¬ 
ity  programs,  trade  liberalization,  and  removing 
barriers  to  capital  movements.  More  than  any 
other  policy,  the  Structural  Adjustment  Programs 
of  the  International  Monetary  Fund  and  the  World 
Bank  came  to  represent  all  that  is  wrong  with  neo¬ 
liberal  market  fundamentalism.  Critics  from  devel¬ 
oping  countries  point  to  the  damaging  effects  of 
neoliberal  transformation,  including  the  depletion 
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and  destruction  of  the  local  ecological  and  bio¬ 
logical  systems  that  sustain  life  and  nature.  Studies 
have  pointed  out  that  many  poor  and  low-skilled 
women  in  developing  countries  have  borne  the 
brunt  of  the  negative  effects  of  neoliberal  global 
transformation. 

Stephen  Gill  has  gone  the  farthest  to  theorize 
the  neoliberal  transformation  with  its  commitment 
to  liberal  governmental  and  market  reforms  as  a 
shift  from  embedded  liberalism  to  disciplinary 
neoliberalism.  The  change  toward  disciplinary 
neoliberalism  points  to  the  disciplinary  social 
mechanism  used  to  lock  in  the  market-based  com¬ 
mitments  to  prevent  future  governments  from 
undoing  the  reforms.  The  result  is  a  new  constitu¬ 
tionalism  as  a  means  to  consolidate  the  market- 
based  governance  framework.  Whether  the  2008 
financial  and  economic  crisis,  the  worst  since  the 
Great  Depression  of  the  1930s,  spells  the  demise  of 
the  neoliberal  norms,  ideas,  and  practices  domi¬ 
nant  since  the  1970s  is  an  open  question.  It  is  true 
that  even  erstwhile  proponents  of  neoliberal  pri¬ 
vate  governance  have  voted  to  “bring  the  state 
back  in”  in  order  to  stabilize  the  global  financial 
and  banking  systems.  But  whether  this  signals  a 
shift  from  neoliberalism  to  postneoliberalism  can¬ 
not  be  determined  at  this  time. 
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The  term  neo-patrimonialism  refers  to  a  hybrid 
mode  of  rule  in  which  informal  political  ties  and 
exchanges  suffuse  the  management  of  a  state.  In  a 
neo-patrimonial  regime,  the  political  chief  execu¬ 
tive  and  his  agents  exercise  authority  mainly 
through  personal  whim  and  material  incentive 
rather  than  through  ideology  or  the  rule  of  law. 
Within  the  state,  the  distinction  between  private 
and  public  interests  is  purposely  blurred,  and  offi¬ 
cials  occupy  bureaucratic  posts  less  to  deliver 
public  goods  and  services  than  to  acquire  personal 
wealth  and  status.  While  elements  of  such  self- 
serving  practice  can  be  found  under  various 
political  regimes,  neo-patrimonial  rule  is  a  defin¬ 
ing  characteristic  of  personal  autocracies  in  the 
world’s  most  underdeveloped  states. 

As  with  other  building  blocks  of  social  science, 
the  concept’s  origins  lie  in  the  ideas  of  great 
German  sociologist  Max  Weber.  In  his  opus 
Economy  and  Society,  Weber  sought  to  under¬ 
stand  how  political  leaders  justify  the  exercise  of 
political  power.  He  proposed  a  threefold  classifica¬ 
tion  of  types  of  authority  ( Herrschaft )  based  on  the 
sources  of  a  leader’s  claim  to  political  legitimacy: 
tradition,  charisma,  or  legality.  Implicit  in  the 
Weberian  schema  is  the  recognition  that  these 
types  of  authority  are  ideal  constructs  whose 
essential  features  are  abstracted  from  empirical 
observation  but  unlikely  to  exist  in  pure  forms  in 
the  real  world.  From  the  outset,  therefore,  Weber 
entertained  the  likelihood  that,  in  practice,  actual 
political  systems  would  operate  according  to 
mixed,  or  hybrid,  principles.  While  Weber  never 
spoke  of  neo-patrimonialism — the  concept  is  a 
late-20th-century  one  developed  by  Shmuel 
Eisenstadt  in  1973,  Jean-Franqois  Medard  in 
1982,  and  Christopher  Clapham  in  1985) — he  did 
anticipate  an  acute  form  of  bureaucratized  patri¬ 
monial  rule. 

In  short,  neo-patrimonialism  is  an  amalgam.  It 
mixes  traditional  (specifically,  patrimonial)  and  legal 
(or  bureaucratic)  authority.  The  first  component — 
patrimonial  authority — is  based  on  a  ruler’s  claim 
to  exercise  intimate  personal  control  over  members 
of  society,  much  as  a  father  does  over  a  household. 
As  a  form  of  traditional  authority,  patrimonial  rule 
is  justified  in  terms  of  inherited  customs  that  have 
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been  sanctified  since  time  immemorial.  To  the 
extent  that  patrimonial  authority  is  vested  in  older 
males,  it  is  also  associated  with  gerontocracy  and 
patriarchal  command  over  women  and  children. 
The  second  component  is  legal  authority,  a  form  of 
governance  based  on  written  constitutional  rules 
and  the  routines  of  bureaucratic  organization.  To 
Weber,  this  rational  (i.e.,  knowledge  driven)  form 
of  authority  was  quintessentially  modern.  Insofar  as 
citizens  owe  obedience,  they  do  so  not  to  individual 
leaders  but  to  professional  office  holders  in  a  rule- 
governed  political  order.  Administration  is  imper¬ 
sonal,  and  power  is  constrained  by  the  rule  of  law. 

Patrimonial  and  legal-rational  forms  of  author¬ 
ity  are  commonly  juxtaposed  in  postcolonial 
situations.  On  the  one  hand,  agrarian  societies — 
especially  in  sub-Saharan  Africa  but  also  in  parts  of 
Asia  and  Latin  America — are  governed  by  informal 
customary  norms  exercised  by  traditional  chief¬ 
tains  or  feudal  landlords.  On  the  other  hand,  newly 
independent  countries  possess  the  formal  apparatus 
of  statehood,  even  if  the  empirical  coverage  of  the 
state’s  legal  and  bureaucratic  rules  constitutes  little 
more  than  a  thin  veneer.  One  origin  of  such  dualis- 
tic  polities,  as  noted  by  Mahmood  Mamdani,  can 
be  found  in  the  colonial  policy  of  indirect  rule  as 
practiced  in  South  Asia  and  West  Africa,  in  which 
imperial  overlords  whose  resources  were  strained 
devolved  local  responsibilities  to  extant  traditional 
authorities.  In  short,  neo-patrimonial  rule  was  born 
in  the  political  encounter  between  industrial  empires 
and  agrarian  societies  and  is  manifest  today  in  the 
uneasy  coexistence  of  formal-legal  and  patrimonial 
modes  of  rule. 

Patrimony  and  bureaucracy  are  both  based  on  a 
principle  of  hierarchy  but,  when  they  coexist,  the 
ultimate  source  of  legitimacy  becomes  blurred  and 
contested.  Is  the  state  the  personal  property  of  the 
ruler  or  is  it  an  impersonal  instrument  of  constitu¬ 
tional  rule?  Are  citizens  required  to  obey  the 
supreme  leader  because  they  owe  him  political 
loyalty  or  because  they  are  required  to  defer  to  a 
rule  of  law? 

The  resolution  of  such  tensions  gives  rise  to 
neo-patrimonial  rule.  The  educated  generation  of 
nationalist  leaders  that  rose  to  power  in  the  mid- 
20th  century  with  the  dissolution  of  the  European 
empires  adapted  to  their  surroundings.  Unable  or 
unwilling  to  satisfy  mass  demands  for  rapidly 
ending  poverty,  these  leaders  instead  devised 


forms  of  rule  that  aimed  mainly  at  consolidating  a 
hold  on  office.  On  the  one  hand,  as  pointed  out  by 
Robert  Jackson  and  Carl  Rosberg  (1982),  leaders 
claimed  legal  authority  by  virtue  of  the  sovereign 
status  of  their  countries  under  international  law, 
and  they  greatly  expanded  the  scope  of  state 
responsibilities  by  embarking  on  ambitious 
national  development  plans  and  increasing  the 
size  of  the  public  bureaucracy.  On  the  other  hand, 
they  treated  the  state  as  if  it  were  a  private  pre¬ 
serve:  They  appropriated  public  resources  for 
personal  gain,  to  build  political  followings,  and  to 
establish  “official”  political  parties.  Some  leaders 
even  invested  in  a  cult  of  personality,  emphasizing 
themes  such  as  “father  of  the  nation.” 
Characteristically,  all  neo-patrimonial  leaders 
awarded  loyal  followers  with  special  favors,  both 
within  the  state  (public  sector  jobs)  and  across 
society  (licenses,  contracts,  and  projects).  In  return 
for  material  benefits,  clients  mobilized  political 
support  and  referred  all  decisions  upward  as  a 
mark  of  deference  to  patrons. 

This  new  (neo)  form  of  rule  differed  from  clas¬ 
sic  patrimonialism  in  that  its  venue  and  instrument 
was  a  modern  state.  But  the  process  of  construct¬ 
ing  neo-patrimonial  rule  also  distorted  beyond 
recognition  the  classic  features  of  Weberian 
bureaucracy.  The  top  leader  and  his  subordinate 
barons  seldom  make  important  decisions  on  the 
basis  of  written  rules  or  objective  knowledge,  pre¬ 
ferring  instead  to  maximize  their  own  subjective 
discretion.  They  routinely  violate  Weber’s  rule  that 
officials  should  be  separated  from  the  ownership 
of  the  means  of  administration,  instead  turning 
public  means  to  personal  ends.  Political  loyalty 
more  than  technical  qualification  becomes  the  cri¬ 
terion  for  appointing  office  holders.  And  these 
appointees  are  compensated  not  only  with  fixed 
salaries  but  also  with  opportunities  to  secure  illicit 
rents.  Most  important,  the  bureaucratic  principle 
of  equality  of  treatment  for  clients  is  narrowed  and 
targeted  to  political  favorites,  often  defined  along 
lines  of  ethnic  or  other  communal  solidarities. 

Indeed,  the  formal  architecture  of  the  state  is 
thoroughly  penetrated  and  often  superseded  by 
informal  networks  of  power  and  privilege.  The 
supremo  appoints  loyal  cronies  to  strategic  posi¬ 
tions  in  key  institutions  and  outlying  regions,  con¬ 
ceding  to  them  the  authority  to  deploy  resources 
from  their  own  fiefdoms.  As  long  as  these  barons 
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contribute  to  the  incumbent’s  goal  of  retaining 
political  power,  they  are  assured  of  impunity  from 
charges  of  corruption.  A  pyramid  of  such  local  net¬ 
works  stretches  down  from  the  center  to  the  local¬ 
ity  through  formal  structures  such  as  ruling  parties 
and  local  governments  and  through  informal  links 
to  traditional  authorities  and  even  youth  militias. 
As  Diana  Cammack  (2007)  notes,  to  the  extent  that 
political  loyalty  is  rewarded  with  material  benefit, 
neo-patrimonial  networks  operate  according  to  an 
informal  logic  that  is  resistant  to  well-meaning 
efforts  at  legal  reform  promoted  by  international 
donors,  opposition  parties,  or  civic  organizations. 

This  is  not  to  say  that  neo-patrimonial  rulers 
never  employ  the  formal  powers  of  public  law  when 
it  suits  them  to  do  so.  Often  lacking  the  capacity  to 
govern  all  corners  of  their  territories,  such  leaders 
insist  vociferously  on  the  international  legal  princi¬ 
ple  of  noninterference  in  the  internal  affairs  of  sov¬ 
ereign  states.  They  tend  to  retain  restrictive  colonial- 
era  constitutions  when  these  documents  allow  the 
concentration  of  powers  in  the  hands  of  a  political 
chief  executive  and  offer  instruments  for  suppress¬ 
ing  political  opposition.  At  the  same  time,  bowing 
to  the  waves  of  democratization  that  have  swept 
the  non-Western  world  in  recent  decades,  neo- 
patrimonial  leaders  seek  to  wrap  themselves  in  the 
legitimating  mantle  of  democracy.  Even  autocrats 
now  subject  themselves  to  election  on  regular  cycles 
mandated  in  national  constitutions.  To  be  sure,  they 
often  use  the  powers  of  incumbency  to  manipulate 
such  elections — through  gerrymandering,  intimida¬ 
tion,  vote  buying,  or  ballot  stuffing — but  they  none¬ 
theless  recognize  that,  in  a  modern  world,  political 
legitimacy  inexorably  rests  on  seeming  to  abide  by 
constitutional  and  electoral  laws.  In  sum,  neo- 
patrimonial  rulers  employ  universal  rules  in  defense 
of  particular  privilege. 

This  unorthodox  fusion  of  governance  strategies 
is  found  in  differing  degrees  in  various  types  of  con¬ 
temporary  political  regimes.  Neo-patrimonialism  is 
largely  absent  in  liberal  democracies  but  can  serve 
as  a  governing  strategy  in  electoral  democracies;  it 
is  the  core  modus  operandi  in  electoral  autocracies, 
and  it  resembles  original  forms  of  patrimonialism  in 
traditional  monarchies  and  unreformed  autocra¬ 
cies.  Neo-patrimonial  exchanges  can  be  found  in 
both  civilian  and  military  regimes,  especially  in  one- 
party  systems  with  a  dominant  leader  who  prefers 
adulatory  plebiscites  to  genuinely  competitive 


elections.  The  model  reached  its  zenith  in  the  per- 
sonalistic  administrations  of  strongman  presidents 
like  Mobutu  Sese  Seko  of  Zaire  (now  the  Democratic 
Republic  of  the  Congo),  1965-1997;  Ferdinand 
Marcos  of  the  Philippines,  1965-1986;  and 
Anastasio  Somoza  Debayle  of  Nicaragua,  1967- 
1979.  Contemporary  examples  include  the  elec¬ 
toral  authoritarian  regimes  of  Robert  Mugabe  of 
Zimbabwe  and  Hugo  Chavez  of  Venezuela  who,  by 
2011,  had  enjoyed  an  unbroken  hold  on  power 
since  1980  and  1998,  respectively. 
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The  term  neo-Weberian  has  a  variety  of  applica¬ 
tions  in  the  literature  of  political  science,  sociology, 
and  organization  theory  as  well  as  in  public  and 
business  administration.  In  most  cases,  it  is  based  on 
variants  of  the  Weberian  model  and  employs  the 
methods  of  analysis  used  by  the  German  sociologist 
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and  economist  Max  Weber.  The  term  neo-Weberian 
usually  refers  to  the  application  of  Weberian  prin¬ 
ciples  to  a  modern  state  or  organization.  The  label 
neo-Weberian  state  (NWS)  was  introduced  by 
Christopher  Pollitt  and  Geert  Bouckaert  in  2004 
in  the  second  edition  of  their  groundbreaking 
book,  Public  Management  Reform:  A  Comparative 
Analysis.  NWS  refers  to  traditional  (Weberian) 
administrative  systems  that  are  in  the  process  of 
modernization  but  retain  distinctive  public  service 
qualities. 

NWS  was  originally  intended  as  a  descriptive 
concept  based  on  the  empirical  evidence  of  public 
management  reforms.  It  was  an  attempt  at  finding 
common  denominators  in  Continental  European 
developments  while  acknowledging  numerous  varia¬ 
tions  within  the  region.  Turning  to  a  more  explana¬ 
tory  mode,  NWS  could  be  interpreted  as  a  political 
response  to  globalization  and  political  de-alignment 
in  the  rich,  strong  liberal-democratic  regimes  of 
Western  Europe.  Thus,  according  to  Pollitt,  the  pur¬ 
suit  of  an  NWS-like  solution  could  be  seen  as  an 
attempt  to  protect  the  “European  social  model” 
from  the  depredations  of  global  markets  and  neolib¬ 
eral  ideology.  Also,  NWS  appears  to  receive  the 
attention  of  the  European  Commission.  Having  been 
originally  set  up  according  to  the  French  model,  it 
began  to  reform  in  the  1990s,  aiming  to  make  itself 
more  externally  oriented  and  consultative,  more  pro¬ 
fessional  and  more  efficient,  and  yet  at  the  same  time 
refusing  to  abandon  key  Weberian  principles. 

NWS  challenges  the  Anglo-American  new  pub¬ 
lic  management  (NPM)  model,  which  aims  to 
import  business  and  market  principles  and  manage¬ 
ment  techniques  from  the  private  into  the  public 
sector  and  is  based  on  a  neoliberal  understanding 
of  the  state.  NPM,  an  initiative  launched  during 
the  1980s,  envisaged  an  entrepreneurial,  market- 
oriented  society,  with  a  tiny  government  on  top, 
and  is  characterized  by  marketization  and  com¬ 
petition,  including  large-scale  privatization  and 
contracting  out,  contractual  appointments,  “client- 
orientedness,”  and  performance-related  manage¬ 
ment  tools.  The  American  equivalent  of  these 
reforms  was  the  National  Partnership  for  Reinvent¬ 
ing  Government,  a  task  force  formed  during  the 
Clinton  administration. 

NWS  was  developed  in  a  context  of  concerns 
about  NPM  shortcomings  and  other  managerial 
reforms;  it  is,  therefore,  a  post-NPM  concept. 


Meanwhile,  NWS  co-opts  the  positive  characteris¬ 
tics  of  NPM  on  a  Weberian  foundation  (rather 
than  adding  Weberian  elements  to  NPM).  Table  1 
sums  up  the  NWS  model  by  incorporating  the 
“neo”  elements  (NPM  influence)  and  the  charac¬ 
teristics  of  traditional  public  administration 
(“Weberian”  foundation). 

The  purpose  of  NWS  is  not  simply  to  criticize 
NPM  but  to  present  a  coherent  and  up-to-date  alter¬ 
native,  which  appears  to  be  the  answer  of  continen¬ 
tal  European  governments  to  the  NPM  paradigm. 
According  to  Pollitt,  the  ideas  of  political  power  and 
modernization  are  the  two  main  dimensions  of  the 
original  NWS  concept.  On  the  one  hand,  compared 
with  Anglo-American  governments,  the  continental 
Europeans  hold  a  more  optimistic  attitude  regarding 
the  future  role  of  the  state.  Thus,  their  objective  is 
not  the  minimal  state  but  a  state  that  retains  a 
strong  steering  and  regulatory  presence  in  society. 
The  state  is  not  seen  as  a  burden  on  the  economy 
and  society  or  as  a  necessary  evil.  Rather,  it  is  the 
guarantor  and  partner  of  both  a  strong  economy 
and  a  socially  cohesive  society.  It  initiates  or  facilitates 
a  range  of  democratic  mechanisms,  both  representa¬ 
tive  and  direct.  Continental  European  governments 
continue  to  regard  the  state  as  the  irreplaceable 
integrative  force  in  society,  with  a  legal  personality 
and  operational  value  system  that  cannot  be  reduced 
to  the  private  sector  discourse  of  efficiency,  com¬ 
petitiveness,  and  client  satisfaction. 

On  the  other  hand,  the  state  is  steadily  modern¬ 
izing,  professionalizing,  and  improving  its  effi¬ 
ciency.  Such  changes  typically  include  results-based 
budget  reforms,  more  flexible  personnel  policies 
(which,  however,  do  not  amount  to  abandoning  a 
distinctive  career  public  service),  extensive  decen¬ 
tralization  and  devolution  of  authority  from  central 
ministries  and  agencies,  a  greater  emphasis  on  stra¬ 
tegic  planning,  selective  and  gradual  privatization, 
and  a  strengthened  commitment  to  improving  the 
quality  of  public  service  provision.  Yet  it  is  far  from 
an  assumption  that  copying  the  private  sector  is  the 
only  way  for  governments  to  improve  their  effi¬ 
ciency,  performance,  and  professionalism.  Private 
sector  methods  may  be  adopted  on  some  occasions, 
in  certain  types  of  organizations,  and  for  specific 
policies,  but  they  have  no  automatic  priority  or 
superiority.  Consequently,  NWS  has  led  to  some 
changes,  whose  effect,  however,  is  often  dampened 
by  existing  structures  and  traditions  and  which  are 
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Table  I  Neo-Weberian  State 


Neo-  Weberian 


Shift  from  an  internal  orientation  toward  bureaucratic 
rules  to  an  external  orientation  toward  meeting 
citizens’  needs  and  wishes;  the  primary  route  to 
achieving  this  is  not  the  employment  of  market 
mechanisms  (although  they  may  occasionally  come  in 
handy)  but  the  creation  of  a  professional  culture  of 
quality  and  service 

Supplementation  (not  replacement)  of  the  role  of 
representative  democracy  by  a  range  of  devices  for 
consultation  with,  and  direct  representation  of, 
citizens’  views 

In  the  management  of  resources  within  government,  a 
modernization  of  the  relevant  laws  to  encourage  a 
greater  orientation  on  the  achievements  of  results 
rather  than  merely  the  correct  following  of 
procedure;  this  is  expressed  partly  in  a  shift  from  ex 
ante  to  ex  post  controls  but  not  a  complete 
abandonment  of  the  former 

A  professionalization  of  the  public  service,  so  that  the 
bureaucrat  becomes  not  simply  an  expert  in  the  law 
relevant  to  his  or  her  sphere  of  activity  but  also  a 
professional  manager,  oriented  to  meeting  the  needs 
of  his  or  her  citizen/users 


[but:]  Reaffirmation  of  the  role  of  the  state  as  the  main 
facilitator  of  solutions  to  the  new  problems  of 
globalization,  technological  change,  shifting 
demographics,  and  environmental  threat 


[but:]  Reaffirmation  of  the  role  of  representative 
democracy  (central,  regional,  and  local)  as  the 
legitimating  element  within  the  state  apparatus 

[but:]  Reaffirmation  of  administrative  law — suitably 
modernized — in  preserving  the  basic  principles 
pertaining  to  the  citizen-state  relationship,  including 
equality  before  the  law,  legal  security,  and  the 
availability  of  specialized  legal  scrutiny  of  state 
actions 

[but:]  Preservation  of  the  idea  of  a  public  service  with 
a  distinct  status,  culture,  and  terms  and  conditions 


Source:  Adapted  from  Pollitt  &  Bouckaert  (2004,  pp.  99-100). 


more  concerned  with  democratization  and  mod¬ 
ernization  than  with  entrepreneurial  government  or 
blindly  copying  private  sector  practices. 

Although  the  NWS  was  originally  intended  as 
an  empirical-analytical  model  and  not  as  a  norma¬ 
tive  one,  in  recent  years  the  concept  has  also 
assumed  a  strong  normative  meaning,  in  particular, 
for  new  democracies  (which,  incidentally,  were  not 
the  focus  of  Pollitt  and  Bouckaert’s  original  analy¬ 
sis).  It  has  often  been  proposed  that  the  key  for 
administrative  development  in  new  democracies  is 
first  to  make  sure  that  the  Weberian  foundation  of 
the  NWS  model  is  present  and  only  then  to  start 
gradually  introducing  the  “neo”  elements  (modern 
management  tools).  Guy  Peters  argues  that  despite 
the  appeal  of  the  NPM  ideas  of  deregulation  and 
flexibility,  governments  attempting,  at  the  same 
time,  to  build  democracy  and  effective  administra¬ 
tion  must  place  a  much  greater  emphasis  on  formal 
requirements,  rules,  and  strong  ethical  standards. 
The  values  of  efficiency  and  effectiveness  are 


important,  but  in  the  short  run  they  should  take 
second  place  to  fostering  fairness  and  responsibil¬ 
ity.  The  NWS  paradigm  also  includes  a  reaffirma¬ 
tion  of  the  role  of  administrative  law.  In  addition  to 
preserving  the  basic  principles  pertaining  to  the 
citizen-state  relationship — such  as  equality  before 
the  law,  legal  security,  and  legal  scrutiny  of  state 
actions — the  law  should  be  an  anchor  of  predict¬ 
ability,  reliability,  and  legitimacy  in  the  chaotic 
political  and  administrative  environment  of  new 
democracies.  Of  particular  relevance  for  new 
democracies  are  the  NWS’s  (Weberian)  elements 
that  promote  the  unity  of  public  administration 
and  common  public  service  culture,  such  as  the 
preservation  (or  first  of  all,  the  creation)  of  a  public 
service  with  a  distinct  status,  culture,  and  condi¬ 
tions  as  well  as  the  recognition  of  the  need  for  a 
strong  state.  This  has  significant  policy  implica¬ 
tions  for  newly  independent  states  (e.g.,  in  Central 
and  Eastern  Europe)  who  may  be  easily  attracted  to 
NPM-like  reform  ideas  without  having  a  solid 
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Weberian  foundation.  This  makes  NWS  a  sound 
basis  for  reform  strategies  in  these  countries. 

The  NWS  model  presents  interesting  opportu¬ 
nities  for  both  policy  makers  and  researchers.  For 
example,  Lawrence  Lynn  has  developed  the  NWS 
model  further  from  a  theoretical  point  of  view  by 
proposing  NWS  as  either  an  independent  variable 
or  a  dependent  variable.  While  the  NWS  implies  a 
positive  synthesis  of  the  new  and  the  traditional, 
neo-Weberian  administration  has  also  been  viewed 
critically  by  Lynn  as  a  threat  to  liberal  democracy, 
by  which  organizations  could  be  subverted  by 
informal  and  illegitimate  patterns  of  authority  and 
decision  making,  thus  reducing  their  legitimacy. 
This  is  why  some  less  democratic  countries  could 
also  support  the  idea  of  a  strong  modernizing 
state  that  underlies  the  NWS  model.  Therefore,  it 
could  be  argued  that,  normatively  speaking,  the 
implementation  of  NWS  also  presupposes,  in 
addition  to  the  Weberian  bureaucracy,  a  viable 
liberal  democracy. 

The  economic  crisis  has  radically  altered  the 
context  in  which  public  administration  operates: 
The  state  is  again  playing  a  part  in  the  economy. 
The  crisis  thus  casts  public  institutions  as  enablers 
of  (rather  than  barriers  to)  growth  and  innovation. 
Peter  Evans  and  James  Rauch  have  demonstrated 
that  there  is  a  connection  between  Weberianism 
and  sustained  economic  growth.  Wolfgang 
Drechsler  has  developed  the  NWS  model  further  by 
linking  it  to  innovation,  economic  growth,  and 
information  technology.  Effective  public  adminis¬ 
tration  is  seen  as  a  sine  qua  non  condition  for  eco¬ 
nomic  growth  and  innovation,  as  innovation-based 
society  draws  on  and  requires  a  highly  competent, 
long-term-oriented,  and  dedicated  civil  service  to 
implement  it — features  aimed  for  by  the  NWS 
model.  In  addition,  recent  research  has  shown  that 
information  and  communication  technology  (ICT), 
especially  e-governance,  which  was  long  associated 
with  NPM  because  of  its  parallel  occurrence,  is 
actually  just  as  conducive  (if  not  even  more  so)  to 
the  NWS  and  in  turn  is  promoted  by  it  as  well.  And 
beyond  crisis  and  ICT,  whatever  the  future  leading 
technologies  will  be — nanotech,  biotech,  conver¬ 
gence,  or  something  completely  different — their  setup 
will  require  a  particularly  capable  state  actor  and  a 
science  and  technology  policy  implemented  by  a  civil 
service  that  is  characterized  by  long-term  thinking 
as  well  as  by  tolerance  of  mistakes — characteristics 


that  are  not  enabled  by  NPM.  In  other  words,  the 
period  in  history  that  we  are  now  entering  is  bound 
to  be  more  state  friendly  than  the  1990s,  and  NWS 
seems  to  be  one  of  the  most  interesting  theoreti¬ 
cal  and  normative  answers  to  the  question  of 
how  a  complex  innovation-based  society  should 
be  governed. 

Tima  Randma-Liiv 
Tallinn  University  of  Technology 
Tallinn,  Estonia 
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Network  Analysis 


Network  analysis  systematically  quantifies  and 
visualizes  relationships  between  actors,  such  as 
knowledge  transfer  or  resource  flow.  The  underly¬ 
ing  formal  concepts  of  graph  theory  make  network 
analysis  both  a  theoretical  approach  and  a  toolkit  of 
formal  approaches.  Social  network  analysis  is  an 
emerging  field  that  combines  contributions  from 
different  disciplines,  such  as  sociology,  anthropol¬ 
ogy,  mathematics,  statistics,  and,  recently,  physics 
and  biology.  The  research  approach  has  recently 
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inspired  several  books,  such  as  Duncan  Watts’s  Six 
Degrees  of  Separation,  Albert-Laszlo  Barabasi’s 
Linked,  and  Nikolas  Christakis  and  James  Fowler’s 
Connected.  This  entry  first  briefly  describes  the 
goals  and  historical  developments  of  network  anal¬ 
ysis.  Then,  the  research  questions,  research  design 
strategies,  data  collection  procedures,  and  basic 
formal  data  analysis  mechanisms  are  discussed.  The 
entry  concludes  with  a  look  at  recent  and  future 
developments  of  network  analysis. 

Definition  of  Networks 

Networks  are  social  relational  systems  character¬ 
ized  by  a  set  of  actors  and  their  social  ties.  They 
consist  of  a  finite  set  of  identifiable  nodes;  the  rela¬ 
tionships  among  these  nodes  (actors)  are  repre¬ 
sented  as  ties  (edges).  Ties  are  dichotomous:  pres¬ 
ent  or  absent,  or  unordered  (undirected)  or  ordered 
(directed  ties).  The  nodes  represent  a  single  entity 
that  potentially  may  take  part  in  the  relationships 
under  study.  A  network  analysis  takes  the  characteris¬ 
tics  of  nodes  and  the  characteristics  of  the  relations 
connecting  the  nodes  into  account.  Attributes — 
additional  information  about  the  actors — are  char¬ 
acteristics,  such  as  the  behavior,  attitudes,  or  other 
properties  of  actors.  Networks  are  represented  as  a 
(social)  graph,  which  is  defined  by  a  set  of  nodes 
together  with  the  set  of  pairwise  relationships 
among  them.  Networks  present  an  opportunity 
and  constraint  structure:  on  the  one  hand  offering 
access  to  resources  flowing  through  the  ties  and,  on 
the  other  hand,  restricting  choices  and  controlling 
behavior. 

Definition  and  Goals  of  Network  Analysis 

Network  analysis  is  an  interdisciplinary  field  of 
research  with  a  history  in  sociology  and  anthropol¬ 
ogy.  It  provides  the  formal  mechanisms  for  repre¬ 
sentation,  measurement,  and  modeling  of  relational 
structure  and  is  based  on  the  assumption  that 
actions  and  decisions  of  actors  are  dependent  on 
the  context  and  the  actions  of  other  actors.  The 
social  structure  is  an  area  of  (inter)action  in  which 
emergent  patterns  of  behavior  can  be  observed, 
and  structural  variables  help  analyze  the  resulting 
interactions.  Generally,  a  network  consists  of 
actors  (nodes)  that  are  connected  with  each  other 
through  ties  (edges).  The  actors  can  represent 


people,  organizations,  countries,  or  other  entities. 
The  relations  and  ties  are  conduits  for  the  flow  of 
resources  in  the  form  of  knowledge,  finances,  col¬ 
laboration,  and  so  on.  The  units  of  analysis  are, 
therefore,  the  interactions  between  the  actors. 

In  a  network  analytical  approach  different  types 
of  data  can  be  distinguished: 

•  attributes,  which  are  descriptors  of  the  individual 
actors  in  the  network,  such  as  age,  sex, 
profession,  or  political  affiliation; 

•  relational  data  that  are  derived  from  the 
interactions  of  at  least  two  actors,  such  as  Country 
A  imports  Product  x  from  Country  B;  and 

•  structural  characteristics  of  the  overall  network 
that  can  be  derived  from  the  relational 
characteristics  of  all  studied  actors,  such  as 
density  or  centralization. 

The  main  goal  of  network  analysis  is  to  under¬ 
stand  the  emergence  of  the  network  structure  and 
its  consequences  by  description,  visualization,  and 
statistical  modeling.  In  more  formal  terms,  net¬ 
work  analysts  analyze  which  independent  vari¬ 
ables  have  led  to  the  observed  structure  and  how 
the  social  structure  influences  other  emerging 
social  processes.  In  this  form  of  “structural  analy¬ 
sis,”  the  relationships  among  actors  become  the 
first  priority,  and  individuals’  properties  are  sec¬ 
ondary  in  the  study  of  the  flow  of  structural  regu¬ 
larities  that  might  influence  actors’  choices  and 
their  resulting  behavior.  The  result  is  an  approxi¬ 
mation  to  the  structure  of  a  more  complex  system 
for  purposes  of  studying  a  particular  property 
(such  as  the  diffusion  of  a  disease  in  a  community). 
Ultimately,  complex  situations  can  be  represented 
using  multiple  relationships  (multiplexity),  such  as 
group  membership,  friendship,  hate  relationships 
within  the  group,  hierarchy  and  reporting  struc¬ 
ture  among  the  group,  and  the  strengths  and  fre¬ 
quencies  of  interactions.  There  are  different  ways 
of  looking  at  the  social  structure:  The  researcher 
can  either  look  at  the  outcomes  of  the  existing 
network  structure,  at  the  emergent  behavioral  pat¬ 
terns,  or  the  characteristics  of  the  overall  structure. 

Brief  History  of  Network  Analysis 

The  first  accounts  of  network  analysis  can  be 
traced  back  to  the  1930s  when  Jacob  Moreno 
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used  sociograms  as  a  form  of  analysis  to  visualize 
the  complex  structure  of  social  interactions.  Socio¬ 
grams  are  a  method  of  mapping  or  graphically 
representing  individuals’  perceptions  of  a  complex 
social  structure.  Matrix  algebra  and  graph  theory 
were  used  in  the  1940s  and  1950s  as  a  mechanism 
of  formalizing  social  structure.  Alex  Bavelas  added 
new  insights  into  the  functioning  of  small  groups 
by  conducting  experiments  on  communication 
networks.  The  experiments  were  used  to  under¬ 
stand  the  speed  and  accuracy  of  centralized  and 
decentralized  structures  to  find  the  shortest  path  a 
message  can  take  through  different  forms  of  social 
structures.  Stanley  Milgram  added  his  “Six  degrees 
of  separation”  research  and  Harrison  White 
looked  at  structural  equivalence  as  a  form  of 
similar  incoming  and  outgoing  ties  to  form  a  new 
and  reduced  network  in  the  1970s.  Mark 
Granovetter  then  developed  his  strength-of-weak- 
ties  theory;  he  showed  that  strong  contacts  tend  to 
have  very  similar  information,  whereas  weak  con¬ 
tacts  could  be  the  source  of  innovative  informa¬ 
tion.  Based  on  this  theory,  Ron  Burt  developed  his 
notion  of  structural  holes  in  the  1980s  and  1990s 
and  applied  it  to  the  benefits  of  bridging  and 
bonding  ties. 

Basic  Assumptions  of  Network  Analysis 

The  basic  underlying  assumption  of  a  network  ana¬ 
lytical  approach  is  based  on  the  notion  that  differ¬ 
ent  types  of  ties  serve  as  conduits  for  flow  of 
resources  through  the  network.  Ties  are  either 
absent  or  present.  In  most  network  research,  an 
absent  tie  needs  to  be  interpreted  as  this  implies  that 
resources  are  not  flowing  through  the  nonexistent 
tie.  Moreover,  ties  can  either  be  directed  or  undi¬ 
rected:  Directed  ties  hint  at  asymmetric  relation¬ 
ships  between  actors,  with  resources  only  flowing  in 
one  direction.  Undirected  ties  are  symmetric  and 
represent  either  a  confirmed  relationship  or  a  rela¬ 
tionship  in  which  both  actors  have  to  be  present. 

The  common  assumption  of  independent  obser¬ 
vations  does  not  hold  in  networks.  Multiple  ties  to 
and  from  actors  are  related,  and  so  it  cannot  be 
assumed  that  the  tie  between  A  and  B  is  indepen¬ 
dent  of  C.  In  addition,  the  assumption  of  continu¬ 
ous,  normally  distributed  variables  does  not  hold 
when  tie  variables  are  binary,  nominal,  ordinal,  or 
count  variables.  Moreover,  two  observations  are 


usually  available  for  each  pair  of  actors  (dyads) 
and  are  assumed  to  be  correlated  actor-wise. 

There  are  two  different  forms  of  research  ques¬ 
tions.  First,  a  network  analytic  approach  can  aim 
to  use  independent  variables  to  explain  the  emer¬ 
gence  of  the  social  structure.  Second,  the  social 
structure  itself  can  also  be  used  as  an  explanatory 
variable  to  understand  specific  outcomes,  such  as 
voting  behavior.  Generally,  two  research  design 
strategies  can  be  distinguished:  The  researcher 
needs  to  decide  whether  to  use  ego-centered  per¬ 
sonal  networks  or  complete  (whole)  networks.  In 
an  ego-centered  network  approach,  network  data 
are  usually  collected  from  a  sample  of  actors  (egos) 
reporting  on  the  ties  with  and  between  other 
people  (alters).  Here,  it  is  important  to  obtain  as 
complete  a  picture  of  the  respondents’  networks  as 
possible.  Name  generators  are  used  to  collect  ego- 
centered  data  by  providing  a  clear  definition  of 
which  persons  known  by  ego  qualify  as  a  network 
member  (or  alter)  of  ego.  The  relational  system  is 
then  assumed  to  be  composed  of  the  sampled  egos 
and  reported  alters  and  their  ties  as  well  as  possible 
additional  actor  and  tie  information.  In  a  second 
phase,  the  alters  will  be  asked  about  their  relation¬ 
ships  among  each  other  and  with  ego.  A  complete 
or  whole  network  approach  includes  a  well- 
defined  group  of  actors  who  report  on  their  ties 
with  all  other  actors  in  this  preset  group.  A  high 
response  rate  of  actors  is  needed  to  get  a  close-to- 
complete  picture  of  the  interactions.  It  can  be 
assumed  that  the  ties  reported  by  actors  are  not 
usually  independent. 

After  defining  the  research  question  and  fram¬ 
ing  the  goal  and  empirical  context  of  the  network 
analysis,  the  next  step  is  to  define  who  should  be 
included  in  the  final  network  and  from  whom  net¬ 
work  data  will  be  solicited.  Simply  said,  this  deci¬ 
sion  will  set  the  limits  on  the  social  relations  to  be 
collected.  While  there  are  different  schools  of 
thoughts  on  this  topic,  two  approaches  seem  to  be 
dominant:  realist  and  nominalist  philosophies  of 
boundary  specifications  highlighted  by  David 
Knoke.  The  realist  approach  assumes  that  the  sub¬ 
jects  identified  by  the  researchers  know  best  who 
should  be  nominated  as  part  of  the  network.  In 
comparison,  a  nominalist  approach  assumes  an 
upfront  theoretical  or  practical  limit  to  the  bound¬ 
aries  of  a  network.  The  researcher  has  either  devel¬ 
oped  a  set  of  theoretical  hypotheses  that  define 
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clearly  who  is  in  and  who  is  outside  the  network 
boundaries,  or  it  is  not  practical  or  doable  to  col¬ 
lect  network  data  from  the  whole  universe.  An 
example  of  a  nominalist  approach  is  to  use  defined 
roles  in  organizational  hierarchical  settings,  such 
as  all  CEOs  of  Fortune  500  companies.  Both  strat¬ 
egies  have  advantages  and  disadvantages:  Clearly, 
a  too  extensive  inclusion  will  impose  a  burden  on 
the  data  collection  process,  and  a  forced  exclusive¬ 
ness  might  omit  important  actors  and  therefore 
bias  the  results. 

In  addition,  researchers  distinguish  between 
positional  and  relational  boundary  specification 
strategies  that  are  closely  linked  to  the  previous 
decisions.  A  positional  approach  defines  the  net¬ 
work  membership  based  on  the  formal  position  an 
actor  assumes  in  a  given  social  circle,  such  as  the 
formal  affiliation  as  a  team  member  of  a  specific 
department.  The  relational  approach — which  is 
sometimes  also  called  a  snowball  sampling 
approach — emerges  similar  to  the  realist  approach 
based  on  nominations  by  already  included  sub¬ 
jects:  Additional  subjects  are  recovered  by  the 
current  subject,  and  their  relationships  to  the 
emerging  network  actors  are  taken  into  account. 
Moreover,  a  researcher  might  decide  to  look  at 
participation  of  specific  actors  in  an  incident  to 
answer  a  research  question  such  as  “Which  orga¬ 
nizations  participated  in  the  Iran  election  pro¬ 
tests?”  This  event-based  strategy  is  different  from 
formal  affiliations  and  memberships  and  can  help 
identify  relevant  groups  and  subjects. 

In  a  network  analytical  approach,  four  different 
data  collection  strategies  are  distinguished: 

f.  collection  of  the  formal  structure  of  the 
network, 

2.  collection  of  data  on  the  content  of  the  ties, 

3.  collection  of  data  on  the  frequency  or  intensity 
of  interactions,  and 

4.  data  collected  that  focus  on  the  perceived 
connection. 

Complete  or  whole  networks  are  also  called  total 
personal  networks.  In  this  approach,  the  network 
boundaries  are  well  defined,  and  it  is  assumed  that 
all  listed  network  actors  either  know  each  other  or 
have  at  least  heard  of  each  other  (e.g.,  all  members 
of  one  department).  This  macrolevel  of  analysis 


has  the  goal  of  using  every  type  of  relationship  in 
a  given  network  and  of  representing  the  network 
in  its  entirety.  The  special  focus  in  whole  networks 
is,  therefore,  mainly  on  the  identification  of  spe¬ 
cific  roles  and  positions  or  on  the  similarities  of 
these  roles  played  by  different  actors. 

There  are  several  different  ways  in  which  net¬ 
work  data  of  a  whole  network  can  be  solicited: 

•  unrestricted  choices  (where  the  subjects  are 
asked  to  list  everyone  they  know  in  the 
network), 

•  ranking  with  respect  to  attractiveness  (the 
subjects  rank  the  most  important  contacts), 

•  paired  comparison  (a  range  of  choices  have  to  be 
compared), 

•  free  recall  rosters  (where  no  names  are  provided, 
and  the  subjects  have  to  nominate  the  actors 
they  remember), 

•  fixed  number  of  nominations  (“name  the  five 
most  important  actors  in  this  network”),  and 

•  name  roster  with  full  network  (where  all  actors 
are  provided,  and  the  subject  can  choose  among 
the  full  list). 

Data  Collection  of  Egocentric  Networks 

Egocentric  networks  describe  the  relationship  a 
focal  subject  (ego)  has  to  third  parties  (alters).  Ego- 
centered  networks  of  social  relations  around  a  par¬ 
ticular  individual  are  also  called  partial  networks 
and  focus  on  specific  types  of  relationships.  These 
relationships  can  be  multiplex,  where  there  are  mul¬ 
tiple  overlapping  relationships  that  can  be  used  to 
describe  the  quality  of  the  relationships.  The  stan¬ 
dard  approach  to  collecting  egocentric  network 
data  focuses  on  the  question  of  whom  the  focal 
person  is  connected  to  and  results  in  a  list  of  dyadic 
interactions.  In  addition,  attribute  data  on  each  of 
the  alters  can  be  collected,  and  the  ego  can  be  asked 
to  record  his  or  her  perceptions  about  the  interac¬ 
tions  among  his  or  her  alters  to  derive  the  cognitive 
structure  of  the  network.  Overlaps  among  different 
egocentric  networks  will  result  in  larger  networks 
than  just  the  star  formation  of  a  single  egocentric 
network.  The  goal  of  an  egocentric  network  analy¬ 
sis  is  to  understand  the  global  features  of  the  uni¬ 
verse.  Researchers  have  discovered  the  need  to 
conceptualize  specific  aspects  of  the  total  network 
that  resulted  in  ego-centered  network  research. 
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Data  collection  on  egocentric  networks  is  used 
to  identify  family  structures  or  group  composi¬ 
tions,  such  as  friendship  structures  or  problem¬ 
solving  groups.  The  challenges  of  an  egocentric 
approach  are  to  (a)  identify  the  appropriate  egos 
that  are  representative  so  as  to  describe  generaliz- 
able  dimensions  of  the  overall  network  and 
(b)  delimit  the  number  of  alters  at  the  right  upper 
boundary  without  excluding  too  many  potential 
alters  and  still  receive  meaningful  results.  Specific 
name  generators  are  usually  used  in  egocentric 
network  analysis  approaches  to  derive  the  per¬ 
ceived  network  structure  but  also  to  solicit  infor¬ 
mation  about  the  number  of  actors  that  have  to  be 
included  in  the  analysis. 

The  most  common  method  is  to  use  name  gen¬ 
erators,  where  the  names  of  the  alters  are  elicited 
from  the  ego  to  understand  from  his  or  her  per¬ 
spective  what  the  social  structure  around  him  or 
her  looks  like.  In  a  second  round  based  on  the 
names  collected  with  the  help  of  a  name  generator 
instrument,  a  name  interpreter  is  used,  and  the  ego 
is  asked  to  highlight  details  in  the  form  of  attri¬ 
butes  about  the  alters  (such  as  age  and  relationship 
type  to  ego).  As  opposed  to  asking  who  is  con¬ 
nected  to  whom  to  explore  the  formal  structure  of 
networks,  the  content  of  ties  can  be  identified  by 
asking  the  following  questions: 


What  are  people  talking  about? 

Why  are  they  connected? 

What  is  the  nature  of  their  relationship  ? 

Table  1  shows  the  different  qualities  of  interac¬ 
tions:  Question  1  asks  for  the  professional  support 
network,  Question  2  asks  for  the  social  support 
network,  and  Question  3  highlights  the  purely 
instrumental  network. 

Instead  of  asking  which  actors  in  a  focal  net¬ 
work  are  connected  to  each  other,  researchers 
might  want  to  find  out  how  intense  these  relation¬ 
ships  are.  The  interaction  is  already  established 
(1/0);  in  addition,  it  might  be  of  interest  what  the 
quality  of  the  interaction  is,  to  understand  whether 
an  interaction  constitutes  a  strong  or  weak  tie. 
Table  2  shows  an  example  of  a  questionnaire  that 
includes  the  frequency  of  the  interaction. 

The  result  of  each  of  the  questions  is  usually  a 
dichotomous  matrix  in  which  an  existing  or  pres¬ 
ent  tie  is  denoted  by  a  1  and  a  missing  tie  by  a  0. 
The  cells  indicating  the  relationships  between 
actors  x  and  y  can  also  be  filled  with  a  numerical 
value  indicating  the  frequency  or  intensity  of  the 
relationship.  In  this  symmetric  adjacency  matrix, 
the  diagonal  is  not  defined,  indicating  that  actors 
do  not  have  ties  with  themselves. 


Table  I  Examples  of  Questions  About  Content  of  Ties 


Question  1 

Question  2 

Question  3 

We  talk  about  problems 

We  talk  about  problems 

We  only  exchange 

at  work 

with  our  kids 

professional  information 

Actor  a 
Actor  b 
Actor  c 


Table  2  Example  of  a  Question  About  Frequency  of  Interactions 

How  often  do  you  talk  to 

the  following  actors  At  least  once  per  day  (3)  At  least  once  per  week  (2)  Only  once  per  month  (1) 

Actor  a 
Actor  b 


Actor  c 
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Network  Analytical  Concepts 

The  most  prominent  form  of  network  analysis  is 
descriptive,  following  Linton  Freeman’s  approach 
to  use  analysis  procedures  often  based  on  the 
decomposition  of  the  adjacency  matrix.  An  adja¬ 
cency  matrix  is  an  n  X  n  representation  of  the 
nodes  in  a  social  graph  that  are  next  to  each  other. 
Actors  are  compared  on  the  basis  of  their  tie  vari¬ 
ables  and  by  taking  their  actor  characteristics  (attri¬ 
butes)  into  account.  The  relationships — and  even 
nonexisting  ties — represent  nomination  patterns 
among  actors  in  a  network.  The  more  often  a  focal 
actor  is  nominated  by  other  actors  in  the  network, 
the  more  central  the  actor  becomes.  Centrality  is  an 
indicator  of  prominence,  importance,  reputation,  or 
power  within  the  overall  structure  and  generally 
describes  the  number  of  times  an  actor  is  chosen  by 
other  actors.  Centrality  shows  the  level  of  influence 
specific  actors  have  in  the  network  when  it  comes 
to  power  in  the  network.  Some  examples  of  net¬ 
work  research  questions  that  can  be  answered  using 
centrality  measures  are  the  following:  How  are 
political  opinions  adopted,  and  how  do  they  diffuse 
through  social  groups?  or  How  are  diseases  spread¬ 
ing  in  a  classroom,  school,  or  other  social  circles? 

Different  forms  of  centrality  can  be  distin¬ 
guished.  Degree  centrality  is  the  sum  of  the  num¬ 
ber  of  ties  a  node  has.  It  can  be  distinguished  into 
in-degree  centrality  (the  number  of  incoming  ties) 
and  out-degree  centrality  (the  number  of  times  the 
actor  itself  nominates  other  actors).  Closeness  cen¬ 
trality  represents  the  total  distance  of  a  node  from 
all  other  nodes.  A  larger  value  indicates  a  less  cen¬ 
tral  actor,  while  a  smaller  value  indicates  a  more 
central  actor.  On  the  whole  network  level,  this 
measure  is  called  centralization  and  is  usually  used 
as  a  comparative  measure.  The  Freeman  “between¬ 
ness  centrality”  measures  the  number  of  times  a 
node  needs  a  given  node  to  reach  another  node — 
or  in  other  words,  the  number  of  shortest  paths 
that  pass  through  a  given  node.  This  form  of  cen¬ 
trality  shows  how  strategically  important  an  actor 
is  to  connect  different  parts  of  the  overall  network 
that  might  otherwise  be  disconnected.  It  might  also 
indicate  the  resource  dependences  and  interdepen¬ 
dences  between  individual  actors. 

The  density  of  a  network  is  represented  by  the 
number  of  links  divided  by  the  number  of  possible 
links.  It  is  an  indicator  of  the  overall  connectedness 
of  the  social  graph.  In  a  complete  graph,  every  node 


is  connected  with  every  other  node,  and  the  density 
is  1.  Smaller  numbers  indicate  a  less  well-connected 
network.  Moreover,  it  might  be  interesting  for  a 
researcher  to  understand  whether  there  are  distinct 
subgroups  within  the  overall  network  that  restrict 
access  or  connectedness.  Components  in  a  network 
represent  a  subset  of  actors,  with  the  characteristic 
that  there  is  a  path  between  any  node  and  any  other 
one  in  this  subset.  The  whole  graph  forms  one  com¬ 
ponent  and  is  therefore  called  totally  connected. 
Cliques  are  subgraphs  in  which  any  node  is  directly 
connected  to  any  other  node  of  the  subgraph. 

Sociograms 

Another  way  to  interpret  the  social  structure  is  to 
use  sociograms — a  visual  two-dimensional  repre¬ 
sentation  of  the  relationships  between  all  actors 
in  the  network.  A  sociogram  is  a  reduction  of  the 
complex  relationships  to  a  single  dimension  at 
one  specific  point  in  time,  on  a  well-defined  set 
of  discrete  components  with  strictly  dyadic  rela¬ 
tionships.  A  sociogram  represents  the  formal 
properties  of  the  social  configuration.  Sociograms 
as  a  form  of  visual  exploration  of  the  social 
structure  are  one  form  of  network  analysis  and 
help  interpret  the  structure  without  necessarily 
using  formal  analytical  methods.  Figure  f  shows 
a  sociogram,  where  nodes  are  connected  to  each 
other  through  ties. 
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Applications  of  Network  Analysis 

Networks  can  be  useful  in  different  analytical 
settings.  Policy  networks  explain  outcomes  such 
as  policy  decisions  and  the  interactions  between 
the  different  parties  involved  in  the  preparation 
and  decision-making  processes.  Networks  can 
also  be  used  to  explain  the  spread  of  political 
opinions  and  how  these  might  result  in  specific 
voting  behavior.  The  spread  of  diseases  and  also 
the  most  central  hubs  in  a  diffusion  network  can 
be  identified,  and  measures  can  be  targeted 
directly  to  the  central  node.  Network  analysis 
has  recently  become  a  method  to  fight  terrorism 
and  increase  national  security.  The  capture  of 
Saddam  Hussein  is,  for  example,  attributed  to  a 
network  analytical  approach.  Recently,  network 
analysis  has  also  gained  some  momentum  in 
resilience  research  to  understand  how  well  con¬ 
nected  communities  are  and  how  they  might 
rebuild  or  restructure  when  central  nodes  are 
removed. 

In  the  future,  network  analysis  will  be  used 
more  and  more  to  predict  how  the  future  structure 
of  a  network  might  develop  based  on  the  past 
interactions  of  its  actors  and  what  might  influence 
the  social  structure.  First  models  have  been  devel¬ 
oped  but  are  not  widely  accepted  at  the  moment. 
Moreover,  network  analysis  is  used  for  new  forms 
of  networks  found  in  online  social  networking 
services,  mobile  phone  networks,  or  other  forms  of 
online  networks,  such  as  the  link  connections  of 
the  World  Wide  Web. 

hies  Mergel 
Syracuse  University 
Syracuse,  New  York,  United  States 
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Networks 


Networks  are  about  relations.  Accordingly,  rather 
than  the  properties  and  attributes  held  by  actors, 
it  is  the  ties  connecting  any  two,  three,  or  more 
individuals,  organizations,  or  institutions  that 
form  the  basic  unit  of  analysis. 

In  the  most  general  terms,  the  emergence  of  the 
network  paradigm  during  recent  decades  can  be 
attributed  to  the  following: 

•  transformations  in  the  reality  of  states,  markets, 
and  societies  as  perceived  by  competent 
observers  (e.g.,  the  emergence  of  organized 
society;  the  events  of  functional  differentiation, 
decentralization,  and  fragmentation;  and  the 
growing  interdependence  and  complexity  of 
virtually  all  societal  spheres); 

•  changes  in  conceptual  and  theoretical 
developments  in  the  respective  disciplines  dealing 
with  these  systems  of  order  (e.g.,  increasing 
attention  drawn  toward  informal  arrangements 
in  politics,  new  modes  of  governance  and 
public-private  alliances,  and  a  shift  from 
hierarchical  control  toward  horizontal 
coordination);  and 

•  the  development  of  a  methodological  apparatus 
for  relational  analysis  as  a  result  of  a  more 
pronounced  structural  approach  in  the  social 
sciences  (e.g.,  formal  analysis  of  relational 
configurations,  new  statistical  procedures, 
advancements  in  available  software  programs). 

The  boom  in  network  research  in  sociology, 
organization  theory,  and  political  science  must  be 
understood  as  part  of  a  general  shift,  beginning  in 
the  second  half  of  the  20th  century,  away  from  indi¬ 
vidualist,  essentialist,  and  atomistic  explanations 
toward  more  relational,  contextual,  and  systemic 


Networks  1691 


understandings.  This  entry  first  presents  the  histori¬ 
cal  background  of  this  trend  in  the  social  sciences 
and  then  discusses  some  of  the  methods  used  in  that 
area  of  inquiry.  Next,  it  turns  to  those  disciplines 
that  have  embraced  network  approaches  most  often 
and  successfully  and  addresses  the  use  made  of 
them  in  political  science  and  policy  analysis. 

Epistemological  and  Historical  Background 

In  the  history  of  science,  there  has  been  a  shift 
from  the  analysis  of  substance  and  essence  toward 
analysis  of  relations  and  connections.  More  than 
60  years  ago,  in  a  series  of  papers  on  epistemology 
and  logic,  two  eminent  political  philosophers/ 
scientists,  John  Dewey  and  Arthur  F.  Bentley, 
outlined  three  historic  levels  of  paradigmatic  orga¬ 
nization  of  the  social  sciences.  In  order  of  chrono¬ 
logical  appearance,  these  advancements,  presented 
almost  as  a  secular  trend,  underwent  three  major 
transformations  in  perspective — self-action,  inter¬ 
action,  and  transaction.  The  first  two  of  these  are 
labeled  substantialist,  while  the  latter  could  today  be 
recast  in  terms  of  relational  thinking.  The  perspec¬ 
tive  of  self-action  is  described  as  a  prescientific  con¬ 
cept  regarding  humans  and  things  as  possessing 
powers  of  their  own  that  initiate  or  cause  their 
actions.  It  was  most  characteristic  of  ancient  and 
medieval  philosophy  and  of  the  Christian  doctrine  of 
the  soul.  In  the  second  version  of  substantialism — 
interaction — the  relevant  action  takes  place  among 
the  entities  themselves.  Yet  much  like  billiard 
balls  or  the  particles  in  Newtonian  mechanics, 
these  entities  remain  fixed  and  unchanged 
throughout  such  interactions.  It  is  only  in  the  per¬ 
spective  of  transactionalism,  fundamentally 
opposed  to  both  variants  of  substantialism,  does 
scientific  inquiry  turn  to  aspects  and  phases  of 
action,  without  final  attribution  to  elements  or 
other  independent  entities  and  without  isolation 
of  presumably  detachable  relations  from  such 
independent  elements. 

By  reinterpreting  and  contextualizing  this  early 
contribution,  its  classificatory  scheme  was  later 
refurbished  by  Mustafa  Emirbayer  (1997)  in  a 
manifesto  for  a  relational  sociology.  In  particular, 
the  author  draws  attention  to  the  units  of  analysis 
inherent  in  and  dominating  social  science  models 
of  more  recent  derivations.  Methodological  indi¬ 
vidualism,  norm-based  approaches,  and  variants 


of  structuralism  are  all  assigned  to  the  paradigm 
of  self-action.  The  first,  in  its  rational-choice  ver¬ 
sion,  takes  individual  human  action  as  the  ele¬ 
mentary  unit  of  social  life.  In  the  game-theoretic 
version,  pregiven  entities  are  seen  to  generate 
self-action — that  is,  actors  engage  in  games  with 
others  without  their  underlying  interests  and 
identities  encountering  substantial  change.  In 
norm-based  approaches,  individuals  are  depicted 
as  self-propelling,  self-subsistent  entities  pursuing 
internalized  norms,  with  the  latter  actually  form¬ 
ing  the  basic  unit  of  analysis.  Finally,  structuralism 
does  not  posit  individuals  but  self-subsistent  soci¬ 
eties,  or  social  systems,  as  the  exclusive  sources  of 
action.  Accordingly,  it  is  durable,  coherent  entities 
that  constitute  the  starting  point  of  inquiry.  The 
second  variant  of  substantialism — interaction — is 
present  today  in  the  form  of  conventional  survey 
research  and  historical-comparative  analysis.  It  is 
the  so-called  variable-centered  approach,  includ¬ 
ing  methods  such  as  multiple  regression  and  fac¬ 
tor  analysis,  that  best  represents  this  variant. 
Providing  merely  the  empty  settings  within  which 
causation  occurs,  it  is  not  the  substances  or  actors 
in  question  that  do  the  acting  according  to  this 
particular  view  but,  rather,  the  very  variable  attri¬ 
butes  themselves. 

What  Dewey  and  Bentley  have  called  transac¬ 
tionalism  actually  represents  the  cornerstone  for 
the  type  of  relational  perspective  adopted  and 
methodologically  elaborated  by  various  strands 
of  network  theory  and  analysis.  These  strands 
have  a  number  of  origins.  The  prehistory  of  the 
approach  developed  through  a  process  of  accumu¬ 
lation  of  knowledge  over  approximately  40  to 
50  years.  Some  have  detected  its  origins  in  the 
empirical  and  conceptual  work  of  Bronislaw 
Malinowski  (1884-1942),  Georg  Simmel  (1858- 
1918),  Alfred  Radcliffe-Brown  (1881-1955),  and 
Claude  Levi-Strauss  (1908-2009),  while  others 
refer  to  important  methodological  contributions 
in  the  tradition  of  Jacob  Moreno  (1889-1974).  In 
any  case,  both  the  method  and  its  theoretical 
underpinnings  have  emerged  from  the  multidisci¬ 
plinary  efforts  of  a  whole  range  of  scholars  in 
anthropology,  social  psychology,  and  sociology. 
Key  advancements  and  topics  addressed  include 
structural  functionalism,  theories  of  social  struc¬ 
ture,  gift  giving,  systems  of  specific  and  general¬ 
ized  forms  of  reciprocity,  and  social  exchange 


1692  Networks 


circuits  (anthropology);  Gestalt  psychology,  field 
theory,  cognitive  balance,  and  sociometry  (social 
psychology);  and  tertium  gaudens  (“the  third 
who  benefits”),  friendship  and  informal  relations 
of  assistance  at  the  workplace,  community 
research,  and  social  exchange  theory  (sociology). 
This  prehistory  of  today’s  network  analysis  cul¬ 
minated  in  what  some  have  called  the  Harvard 
breakthrough,  achieved  by  scholars  such  as 
Harrison  White  and  his  associates  in  the  late 
1960s  and  early  1970s.  White’s  contribution  to 
relational  analysis  is  manifold  both  methodologi¬ 
cally  (structural  equivalence,  positional  and  role 
analysis,  block  modeling)  and  in  terms  of  the 
imprints  it  has  left  on  various  disciplines,  includ¬ 
ing  sociology,  political  science,  and  economics. 

Units  of  Analysis  and  Methodological  Tools 

Because  network  analysis  is  explicitly  interested  in 
the  interrelatedness  of  social  units,  network  data 
are  stored  in  a  rectangular  matrix  where  the  col¬ 
umns  generally  contain  the  same  variables  as  the 
rows.  These  can  be  individual  actors,  such  as  direc¬ 
tors  occupying  positions  on  different  company 
boards,  or  organizations,  such  as  government 
departments  or  private  corporate  actors.  The 
dependences  among  them  are  measured  using 
structural  variables.  Compared  with  conventional 
attribute-based  measures,  a  focus  on  such  vari¬ 
ables  opens  up  a  different  range  of  possibilities  for, 
and  constraints  on,  data  analysis  and  model  build¬ 
ing.  Leaning  on  specific  mathematical  and  alge¬ 
braic  procedures,  the  methods  employed  provide 
explicit  formal  statements  about  structural  phe¬ 
nomena  that  might  otherwise  be  defined  only  in 
metaphorical  terms.  Metaphors,  present  in  much  of 
the  qualitative  literature,  include  phrases  such  as 
webs  of  relationships,  closely  knit  networks  of  rela¬ 
tions,  social  role,  social  position,  group,  clique, 
popularity,  isolation,  prestige,  prominence,  and  so 
on.  In  social  network  analysis,  these  can  be  quanti¬ 
fied  and  made  subject  to  interpersonal  verification. 
Analysis  normally  starts  with  delimiting  a  particu¬ 
lar  group  of  people  or  organizations  according  to 
what  is  called  boundary  specification — a  procedure 
following  either  a  realist  (based  on  snowball  sam¬ 
pling  techniques  or  reputational  analysis)  or  a 
nominalist  (based  on  the  researcher’s  own  concep¬ 
tual  framework  and  understanding)  strategy. 


Indices  and  procedures  most  frequently  used  include 
measures  of  degree  and  betweenness,  centrality, 
density,  clique  analysis,  structural  equivalence  and 
block  modeling,  multidimensional  scaling,  hierar¬ 
chical  cluster  analysis,  and  so  forth.  The  most  useful 
and  widely  diffused  handbooks  of  network  analysis 
are  the  ones  by  John  Scott  (1991),  and  Stanley 
Wassermann  and  Katherine  Faust  (1994).  Important 
software  programs  for  the  storing,  processing,  and 
visualization  of  relational  data  include  Ucinet, 
Gradap,  Krackplot,  Netdraw,  Visone,  and  Pajek. 

Networks  in  Sociology 
and  Organization  Theory 

Sociology,  management,  and  organization  theory 
are  the  disciplines  where  network  models  have 
been  employed  most  successfully.  The  fields  of 
application  and  the  main  topics  addressed  by  that 
literature  comprise  a  whole  range  of  different 
dimensions.  These  include,  among  others,  the 
impact  of  urbanization  on  individual  well-being, 
the  world  political  and  economic  system,  commu¬ 
nity  elite  decision  making,  social  support,  commu¬ 
nitarian  forms  of  collective  action,  group  problem 
solving,  the  role  of  strong  versus  weak  ties  for 
group  and  intergroup  cohesion,  diffusion  and 
adoption  of  innovations,  corporate  interlocking, 
belief  systems,  cognition  or  social  perception, 
interfirm  alliances,  trust  among  competitors,  forms 
of  social  and  political  exchange  and  power,  and 
consensus  and  social  influence.  The  volume  of 
social  network  research  in  these  fields  has  increased 
radically  and  exponentially  in  recent  years,  as  pre¬ 
sented  in  Figure  1,  which  shows  articles  indexed  by 
Sociological  Abstracts  containing  the  words  “social 
network”  in  the  abstract  or  title. 

Political  scientists  and  network  analysts  interested 
in  gaining  insights  from  adjacent  disciplines,  where 
relational  thinking  is  both  more  pronounced  and 
longer  established,  are  advised  to  consult  the  hand¬ 
book  on  organizations  edited  by  Joel  Baum  (2002). 
Comprising  the  contributions  of  some  50  influential 
international  scholars,  the  volume’s  tour  de  force 
through  intraorganizational,  organizational,  and 
interorganizational  levels  represents  the  state-of-the- 
art  current  research  in  that  area.  Networks  figure 
prominently  in  this  all-encompassing  treatment 
where  relations  form  the  principal  units  of  analysis 
throughout. 
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Figure  I  Exponential  Growth  of  Network-Analytic 
Literature  in  Organizational  Sociology 

Source:  Stephen  Borgatti  and  Pacey  Foster  (2003,  p.  992). 

Political  Networks  and 
Policy  Domain  Networks 

These  developments  have  been  noted  relatively 
little  in  mainstream  political  science.  Even  scholars 
working  on  state-society  relations  and  in  interna¬ 
tional  relations  somehow  remained  unaffected  by 
transactionalist  thinking.  In  other  words,  although 
they  were  dealing  with  connections  between  enti¬ 
ties  and  actors  of  different  societal  spheres,  schol¬ 
ars  treated  these  spheres  separately.  The  reciprocal 
influence  exerted  by  entities  on  relations  and,  vice 
versa,  by  relations  on  entities,  has  not  formed  an 
explicit  part  of  these  research  programs  for  quite 
some  time.  A  major  breakthrough  then  occurred 
with  the  emergence  of  interorganizational  research 
and  the  neo-corporatist  paradigm  in  the  late 
1970s.  Focusing  on  entities  such  as  organized 
interests  and  the  state,  including  the  attributes  of 
these  entities,  and  on  relations,  especially  in  the 
form  of  interest  intermediation  and  political 
exchange,  neo-corporatist  authors  tended  to  adopt 
a  transactional  perspective.  This  was  most  obvious 
in  their  conceptualization  of  two  logics  to  which 
associations  of  both  capital  and  labor  had  to 
adhere  in  the  interest  of  organizational  reproduc¬ 
tion  and  survival — the  logic  of  influence  and  the 
logic  of  membership. 


Not  by  chance,  the  first  major  attempt  to  empir¬ 
ically  verify  the  relevance  of  large  corporate  actors 
in  state  policy  making  occurred  at  a  time  when  the 
neo-corporatist  debate  reached  its  height  toward 
the  mid-1980s.  It  was  Edward  O.  Laumann’s  and 
David  Knoke’s  seminal  study  on  The  Organizational 
State  (1987)  that,  for  the  first  time,  exposed  the 
power  of  empirical  network  analysis  to  a  wider 
public  in  political  science  and  political  sociology. 
The  authors  adopted  a  policy  domain  perspective — 
that  is,  they  aggregated  different  policies  and  policy 
events  in  the  health  (e.g.,  drugs  regulation,  Medicare 
and  Medicaid  funding)  and  the  energy  domains 
(e.g.,  nuclear  waste  disposal,  strip-mining  control) 
to  study  the  configuration,  power,  and  behavior  of 
diverse  actors  within  these  organizational  fields. 
These  actors’  issue  interests,  monitoring  resources, 
and  influence  reputation  were  treated  as  antecedent 
variables  ultimately  affecting  positions  in  communi¬ 
cation  and  resource  exchange  networks.  Together, 
these  five  variables  jointly  affect  the  range  of  core 
organizations’  efforts  to  influence  the  outcomes  of 
policy  events  in  the  two  domains.  What  was  impor¬ 
tant  was  that  the  volume  combined  structural  with 
dynamic  analysis  and  took  account  of  actors’ 
beliefs,  frame  disputes,  and  reinterpretations,  thus 
avoiding  the  indeterminacy  present  in  much  earlier 
work  while  at  the  same  time  anticipating  parts  of 
the  research  program  of  the  advocacy  coalition 
framework.  The  database  used  for  these  studies  was 
impressive.  After  boundary  specification  and  a 
reduction  of  the  lists  of  resulting  organizations  to  an 
empirically  manageable  number,  the  authors  ended 
up  with  a  total  of  333  public  and  private  actors, 
which  were  then  approached  with  the  help  of  ques¬ 
tionnaires.  The  number  of  analyzed  policy  events 
was  166.  Although  interview  schedules  and  lists 
comprised  about  80  pages  (see  the  appendix  to  the 
Laumann  and  Knoke  volume),  the  overall  response 
rate  amounted  to  92.3%.  Figures  like  this  remained 
unmatched  by  other  attempts  of  comparing  policy 
processes  across  domains  and  countries.  At  the 
same  time,  this  also  resulted  in  an  initial  reluctance 
of  established  types  of  research  vis-a-vis  network- 
analytic  approaches.  The  sheer  number  of  ties  and 
relations,  in  terms  of  contacts  and  resource 
exchanges  generated  by  such  a  high  number  of 
actors,  together  with  the  intricacy  of  analyzing  them 
in  algebraic  terms  led  many  scholars  to  question  the 
benefit  of  and  value  added  by  such  a  painstaking 
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procedure.  Overall,  the  study  of  political  and  policy 
domain  networks  considers  state  and  society  in 
their  complexity  and  comprises  the  entire  range  of 
policies,  politics,  and  the  polity.  Its  focus  is  essen¬ 
tially  on  influence  and  domination  as  determinants 
of  different  types  of  power  (coercive,  authoritative, 
egalitarian,  persuasive)  and,  in  its  most  advanced 
form,  on  the  role  of  power  differentials  in  the  pre¬ 
dictions  of  future  policy  outcomes  and  courses  of 
action  (Knoke,  1990). 

Policy  Network  Analysis 

Compared  with  the  analysis  of  political  networks 
in  the  broader  sense,  which  share  quite  a  number 
of  questions  and  of  terminology  with  organiza¬ 
tional  sociology,  there  is  a  considerably  higher 
number  of  publications  in  the  field  of  policy  net¬ 
work  analysis — a  field  that  started  to  boom  in  the 
early  1990s.  In  logical  terms,  the  latter  actually 
represents  a  subcategory  of  the  former.  It  also  rests 
on  a  different  research  tradition,  namely  on  public 
policy  analysis.  Beginning  in  the  early  1990s,  there 
has  been  a  steady  increase  in  the  number  of  both 
qualitative  and  quantitative  publications.  Compared 
to  the  absolute  figures  of  relevant  publications  in 
organizational  sociology  for  the  year  2000  (about 
440),  the  development  of  policy  network  analysis 
(about  50)  appears  to  be  relatively  modest.  Within 
the  narrower  field  of  political  science,  however,  its 
expansion  is  impressive. 

In  one  of  the  first  publications  opening  the 
debate  on  policy  networks,  a  research  program 
was  sketched  out  that  soon  became  a  growth 
industry  for  the  following  two  decades.  It  was  most 
vigorously  exposed  in  the  contribution  by  Patrick 
Kenis  and  Volker  Schneider  (1991)  to  an  edited 
volume  on  policy  networks  that  assembled  theo¬ 
retical,  methodological,  and  empirical  contribu¬ 
tions.  The  authors  argued  that  policy  analysis  and 
network  analysis  possess  an  important  potential 
for  cross-fertilization.  Network  analysis  was  intro¬ 
duced  as  a  technical  tool  applicable  to  any  form  of 
social  or  political  exchange.  At  the  same  time,  net¬ 
works  were  also  said  to  represent  a  kind  of  broad 
societal  governance  structure — that  is,  to  reflect 
real  changes  in  the  way  policy  processes  unfolded 
in  highly  modern  settings.  New  modes  of  gover¬ 
nance,  characterized  by  an  increasing  blurring  of 
boundaries  dividing  the  public  and  the  private 


spheres  in  a  number  of  advanced  countries,  have 
indeed  emerged  more  or  less  simultaneously  with 
the  network  paradigm.  After  20  years,  since  that 
early  observation  about  a  simultaneous  occurrence 
in  advancements  in  methods  and  transformations 
in  empirical  reality,  the  question  must  be  asked  as 
to  whether  such  a  compound  understanding  of 
networks  has  really  enriched  the  new  subdisci¬ 
pline.  At  least,  it  has  given  way  to  an  endless 
debate  on  the  actual  meaning  of  networks  in  poli¬ 
tics  and  on  the  usefulness  of  the  concept’s  applica¬ 
tion  to  concrete  cases. 

Views  by  most  scholars  today  converge  on  the 
image  of  a  semisovereign  state  of  increasing  ungov¬ 
ernability,  of  demand  overload,  and  of  the  subse¬ 
quent  emergence  of  public-private  alliances  and 
networks  in  agenda  setting,  decision  making,  and 
the  implementation  of  policies.  For  most  of  the 
1990s,  nonetheless,  a  large  fraction  of  the  literature 
became  entangled  in  attempts  at  categorizing  exist¬ 
ing  approaches,  mostly  according  to  rather  subjec¬ 
tive  criteria,  rather  than  embarking  on  the  empirical 
study  of  specific  policies.  Networks  were  thought  of 
by  many  as  reflecting  properties  alternative  to 
established  organizations  and  forms  of  organizing 
social  life  and,  hence,  were  held  to  be  empowering, 
enfranchising,  and  socially  desirable,  thus,  ulti¬ 
mately  undermining  inflexible,  deadlocked,  and 
irresponsive  authority  structures.  One  of  the  issues 
frequently  addressed  was  the  relationship  between 
the  so-called  issue  networks,  policy  communities, 
and  policy  networks — all  concepts  differently  char¬ 
acterized  and  classified  according  to  personal  taste 
and  proclivity.  Another  schism  was  linked  to  an 
assumed  dichotomy  of  vertically  versus  horizontally 
structured  configurations.  While  the  governance 
school  claimed  networks  to  be  equalizing  instru¬ 
ments  for  the  pursuit  of  efficiency  enhancement  in 
the  delivery  of  public  services,  empirical  analysis  all 
too  often  found  relational  structures  exhibiting 
power  differentials  and  vertical  forms  of  subordina¬ 
tion  of  all  sorts.  Some  authors  treated  networks  as 
quasi-organizations,  with  a  sort  of  membership 
statute  and  a  corporate  identity  of  their  own,  delib¬ 
erately  built  up  and,  therefore,  subject  to  network 
management  strategies.  Conversely,  others  main¬ 
tained  that  networks  could  only  be  identified  by  an 
outside  observer  with  the  help  of  boundary  specifi¬ 
cation  and  other  empirical  tools  since  sample  mem¬ 
bers  were  frequently  not  aware  of  forming  part  of  a 
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network  altogether.  Finally,  another  view  had  it 
that  policy  networks  would  only  emerge  under  very 
specific  conditions  most  often  linked  to  high  degrees 
of  societal  modernization,  while  others  claimed 
networks  to  be  a  ubiquitous  phenomenon  indepen¬ 
dent  of  any  particular  political  environment. 

Overall,  no  single  review  article  has  managed  to 
structure  the  field  in  a  coherent  and  empirically 
grounded  way.  This  unsustainable  condition  has 
now  been  rectified.  In  a  2007  study,  Leifeld  takes 
both  the  extensive  growth  rates  of  the  literature  and 
the  conceptual  disorder  characterizing  the  field  as  a 
rationale  for  presenting  a  systematic  and  empiri¬ 
cally  based  classification.  He  draws  on  a  structured 
bibliography  of  1,014  publications  related  to  polit¬ 
ical  networks.  With  a  view  to  empirical  (noncon- 
ceptual)  contributions  alone  (746  entries),  the  final 
analysis  yields  the  following  insights:  In  terms  of 
decreasing  relative  frequency,  the  areas  of  inquiry 
most  often  addressed  are  governance/public  policy; 
Europeanization;  local,  urban,  and  rural  studies; 
nongovernmental  organizations  (NGOs)  and  social 
movements;  international  relations;  electoral  sys¬ 
tems  and  voting;  civil  society;  elite  networks;  social 
capital;  globalization;  and  democracy.  Regarding 
the  policy  domains  studied  by  policy  network  ana¬ 
lysts,  the  rank  ordering  is  as  follows:  environment, 
health,  agriculture,  social  policy,  labor,  telecommu¬ 
nications,  economy,  regional  policy,  domestic 
affairs,  energy,  research  and  technology,  industry, 
science  and  education,  gender  issues,  chemicals, 
migration,  and  transport.  The  bulk  of  empirical 
research  focuses  on  Europe  rather  than  the  United 
States,  with  Germany  and  the  United  Kingdom 
attracting  the  most  attention.  Both  of  these  are 
closely  followed  by  studies  of  policy  processes  at 
the  level  of  the  European  Union.  Considering  only 
strictly  quantitative  contributions,  the  author  has 
found  that  the  following  methods  and  indices  are 
the  ones  that  are  most  often  used  (decreasing  rela¬ 
tive  frequency):  centrality,  density,  multidimen¬ 
sional  scaling,  block  modeling,  cluster  analysis,  and 
clique  analysis.  Administration,  governments,  and 
parliaments  lead  the  list  in  terms  of  main  govern¬ 
mental  targets  under  scrutiny,  while  trade  associa¬ 
tions,  trade  unions,  companies,  NGOs,  political 
parties,  think  tanks,  and  social  movements  figure 
most  prominently  in  the  category  of  private  actors. 
The  most  frequently  analyzed  types  of  relations 
are  information  exchange,  contact,  cooperation, 


influence  reputation,  resource  exchange,  alliance, 
mutual  issue  interest,  and  conflict. 

An  extensive  cocitation  analysis  of  the  quantita¬ 
tive  policy-analytic  literature,  itself  being  pursued 
in  a  network-analytic  fashion,  reveals  the  existence 
of  four  clearly  distinguished  thematic  clusters  (i.e., 
epistemic  communities  or  invisible  colleges):  first, 
political  exchange  in  the  organizational  state;  sec¬ 
ond,  elite  networks  in  North  America  and  world 
systems  theory;  third,  participation  and  social 
capital;  and  fourth,  governance  and  interest  inter¬ 
mediation.  As  Leifeld  notes,  a  further  and  slowly 
emerging  fifth  cluster  includes  culture-based 
approaches  essentially  centered  on  the  advocacy 
coalition  framework. 

Conclusion 

Even  if  there  should  be  something  like  a  secular 
trend  pointing  toward  an  increasing  relevance  of 
relational  research,  it  is  far  too  early  to  speak  of  a 
comprehensive  paradigm  shift  with  respect  to  the 
growing  attractiveness  of  network  analysis. 
Mainstream  political  science,  at  least,  continues  to 
be  concerned  with  heuristics,  being  more  tradi¬ 
tional,  and  prefers  working  within  clearly  delim¬ 
ited  disciplinary  boundaries  that  can  be  traced 
back  to  the  perspectives  of  self-action  and  interac¬ 
tion.  Nonetheless,  the  current  move  toward  the 
study  of  new  forms  of  governance,  in  conjunction 
with  network  models  and  approaches,  may  lay  the 
foundations  for  a  relational  political  science — or 
structural  “politology” — capable  of  addressing  the 
evolution  of  complex  organizational  ecologies 
(intraorganizational,  organizational,  and  interor- 
ganizational)  and  forms  of  societal  order  (state, 
market,  and  society)  at  the  subnational,  national, 
European,  and  international  levels.  The  theoretical 
and  methodological  bridges  and  connections  for 
the  emergence  of  such  a  theory,  at  least,  are  ready 
for  use.  They  just  need  to  be  picked  up. 
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Prague,  Czech  Republic 
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Neutrality 


Neutrality  refers  to  a  fundamental  foreign  and  secu¬ 
rity  policy  orientation  of  a  state:  A  neutral  state  will 
not  take  part  either  directly  or  indirectly  in  any 
forthcoming  war  between  third-party  states. 
Neutrality  consists  of  a  legal  core — neutrality  law — 
and  political  guidance  notes — neutrality  policy.  It 
contains  both  realistic  assumptions  about  state 
survival  and  idealistic  commitments  to  nonviolent 
means  of  conflict  resolution.  Until  the  rise  of  collec¬ 
tive  security,  neutrality  was  seen  as  the  only  viable 
security  policy  alternative  to  membership  in  a  mili¬ 
tary  alliance  available  to  small  states.  This  entry 
begins  with  a  historical  overview  and  an  introduc¬ 
tion  to  neutrality  law  and  neutrality  policy.  This  is 
followed  by  a  comparison  of  neutrality  and  collec¬ 
tive  security  as  well  as  some  final  reflections  on  the 


significance  of  neutrality  in  the  modern  era  of 
globalization. 

Historical  Origin 

Taken  literally,  neutrality  means  neither-nor  (from 
the  Latin  ne-uter).  Its  origin  as  a  foreign  and  secu¬ 
rity  policy  orientation  dates  back  to  a  time  when 
European  nation  states  were  commonly  fighting 
each  other.  The  concept  of  state  sovereignty 
emerged  in  the  same  period.  The  right  to  wage  war 
was  constitutive  of  that  international  order. 
Neutrality  correlated  to  this  right:  it  offered  the 
possibility  of  abstaining  from  war  in  an  environ¬ 
ment  in  which  not  taking  sides  would  otherwise  be 
viewed  as  opportunism  or  even  cowardice. 

Neutrality  was  mainly  a  survival  strategy  for 
especially  small  states,  which  were  in  permanent 
danger  of  being  overrun  by  larger  ones.  Smaller 
states  were  not  usually  primary  targets,  but  con¬ 
quering  them  was  a  way  of  preventing  them  from 
either  falling  to  one  side  or  freely  siding  with  the 
other.  Thus,  neutrality’s  original  purpose  was  to 
help  states  defend  their  interests  in  an  international 
environment  marked  by  interstate  conflicts.  Besides 
this  realistic  perspective,  neutrality  has  also  repre¬ 
sented  a  traditionally  idealistic  approach  to  inter¬ 
national  relations;  a  neutral  state’s  claim  to  abstain 
from  war  was  also  a  commitment  not  to  add  vio¬ 
lence  to  existing  conflicts.  Even  though  it  was 
originally  conceived  of  as  a  survival  strategy,  neu¬ 
trality  inspired  ways  of  thinking  linked  to  a  non¬ 
violent  approach  to  international  conflicts.  One 
example  was  the  concept  of  neo-neutrality  put 
forth  by  Georg  Cohn  in  the  period  between  World 
Wars  I  and  II.  It  advocated  active  and  collective 
disqualification  of  war  parties  by  all  neutral  states, 
including  the  use  of  sanctions.  Neutrals  would  stay 
outside  the  war,  and  it  was  stipulated  that  their 
sanctions  should  not  be  of  a  military  nature. 

This  idealistic  connotation  of  neutrality  was  rein¬ 
forced  through  the  tendency  of  neutral  states  to 
compensate  for  their  military  absenteeism  by 
strengthening  their  political  engagement  to  diminish 
the  suffering  caused  by  war.  This  engagement  could 
take  the  form  of  humanitarian  actions  or  attempts 
to  end  military  conflicts.  It  was  usually  not  pursued 
out  of  idealism  or  as  a  peace  strategy  but  out  of  a 
perceived  need  to  show  solidarity  with  the  war  par¬ 
ties  and  the  difficulties  and  costs  they  incurred. 
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Neutrality  Law 

The  legal  rights  and  obligations  of  neutrality  are 
rooted  in  the  Hague  Conventions  of  1907  and  in 
the  Geneva  Conventions  of  1949  on  international 
humanitarian  law.  Neutral  states  are  required  not 
to  participate  in  war  either  directly  or  indirectly. 
They  should  not  support  war  parties  with  military 
means  nor  should  they  make  their  territory  avail¬ 
able  to  such  parties,  supply  them  with  weapons  or 
funds,  or  restrict  private  weapon  exports  in  a  one¬ 
sided  way.  Neutrals  are  also  required  to  defend 
themselves  autonomously  against  violations  of 
their  neutrality.  Neutrality  excludes  the  right  to 
belong  to  a  military  alliance  since  such  a  member¬ 
ship  would  entail  the  exact  opposite  of  neutrality: 
While  neutrality  raises  the  expectation  that  a  state 
will  not  participate  in  a  future  military  conflict, 
participation  in  a  military  alliance  includes  the 
obligation  that  a  state  will  support  its  fellow  mem¬ 
bers  in  a  potential  military  conflict.  Thus,  accord¬ 
ing  to  international  law,  neutrality  is  a  clearly  and 
narrowly  defined  status.  If  a  state  adopts  an  exten¬ 
sive  neutrality  policy,  this  occurs  out  of  political 
considerations  and  not  out  of  legal  obligations. 

Neutrality  Policy 

To  fulfill  its  function,  neutrality  demands  accep¬ 
tance  by  third  parties.  This  leads  to  a  minimum 
degree  of  credibility.  Neutrality  policy  compre¬ 
hends  the  measures  a  state  takes  to  guarantee  the 
efficiency  and  credibility  of  its  neutrality.  In  con¬ 
trast  to  neutrality  law,  neutrality  policy  is  not 
legally  regulated.  Neutrality  policy  may  be  defined 
in  line  with  the  interests  of  each  individual  neutral 
state.  States  that  qualify  themselves  as  permanently 
neutral  obligate  themselves  to  conduct  a  foreign 
policy,  even  in  peace  time,  that  enables  them  to 
remain  neutral  in  a  potential  conflict.  This  is  based 
on  what  are  called  the  pre-effects  of  neutrality. 

Active  neutrality  refers  to  a  policy  stance  that 
underscores  that  neutrality  does  not  necessarily 
need  to  be  equated  with  foreign  policy  abstention. 
Neutral  states  have  often  interpreted  their  neutral¬ 
ity  as  providing  them  with  both  an  option  of 
assuming  and  an  obligation  to  assume  specific 
tasks  in  international  relations.  Switzerland  has 
followed  a  policy  of  so-called  good  offices  since 
the  end  of  the  19th  century.  Such  offices  attempt 
to  bring  parties  in  conflict  closer  together.  They 


exert  no  political  influence  or  pressure,  striving 
instead  to  assume  protective  power  mandates,  pro¬ 
vide  conference  space,  or  offer  arbitration  assis¬ 
tance.  Other  West  European  neutrals  such  as 
Austria,  Finland,  Ireland,  or  Sweden  have  a  long 
tradition  of  active  engagement  with  United  Nations 
(UN)  peacekeeping  operations.  During  the  Cold 
War,  all  neutral  states  played  an  important  role  in 
the  Conference  on  Security  and  Cooperation  in 
Europe.  Together  with  the  nonaligned  states,  they 
formed  the  so-called  Neutral  and  Nonaligned 
group,  which  strived  to  be  a  third  actor  between 
the  Western  and  Eastern  blocs. 

Neutrality  policies  have  varied  considerably 
across  time  and  states.  Switzerland  has  adopted  a 
rather  comprehensive  neutrality  policy.  It  did  not 
even  join  the  UN  until  2002  because  it  saw  the 
obligations  of  collective  security  as  incompatible 
with  its  neutrality  obligations.  The  other  Western 
European  neutrals,  however,  had  no  second 
thoughts  about  being  members  of  that  worldwide 
organization. 

The  situation  was  different  in  regard  to  the 
European  integration  process:  With  the  exception 
of  Ireland,  no  neutral  Western  European  state 
joined  the  European  Community  (EC)  during  the 
Cold  War.  They  considered  this  type  of  political 
and  trade  cooperation  as  incompatible  with  their 
neutrality  policy.  From  a  legal  point  of  view,  there 
would  have  been  no  obstacles  as  the  EC  was  not  a 
military  alliance.  After  the  Cold  War  had  ended, 
Austria,  Finland,  and  Sweden  joined  the  European 
Union  in  1995. 

Neutrality  and  Collective  Security 

The  core  objectives  of  collective  security  and  neu¬ 
trality  are  similar:  Both  want  to  regulate  and  limit 
the  use  of  force  in  interstate  relations.  But  they  dif¬ 
fer  in  regard  to  the  means:  Whereas  collective  secu¬ 
rity  builds  on  active  commitment  and  participation 
of  all  states,  neutrality  is  based  on  individual 
abstention. 

Neutrality  and  collective  security  may  be  seen  as 
concurrent  concepts:  If  there  were  a  perfectly 
working  system  of  collective  security,  there  would 
be  no  need  for  neutrality.  The  international  com¬ 
munity  would  deal  with  any  international  aggres¬ 
sion.  However,  as  no  perfect  collective  security 
system  has  yet  emerged,  neutrality  may  be  seen  as 
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complementary  to  collective  security:  The  better 
collective  security  works,  the  less  neutrality  is 
needed,  and  vice  versa.  In  history,  this  has  repeat¬ 
edly  confronted  neutral  states  with  difficult  choices. 

With  the  emergence  of  the  League  of  Nations  in 
1920,  neutral  states  had  to  decide  whether  to  join 
the  newly  created  organization  of  collective  secu¬ 
rity.  All  of  them  opted  for  participation  in  the 
League.  Switzerland  distinguished  between  its  read¬ 
iness  to  participate  in  the  League’s  economic  sanc¬ 
tions  and  its  reservations  toward  potential  military 
sanctions,  for  which  participation  was  excluded. 
The  term  differential  neutrality  was  coined  to  cover 
this  nuance.  After  the  failure  of  the  League,  the 
country  returned  to  its  integral  neutrality. 

Today,  all  neutral  states  are  members  of  the 
UN.  Military  interventions  led  or  approved  by  the 
UN  are  not  considered  as  traditional  interstate 
wars  but  as  supranational  police  actions  in  which 
the  international  community  sanctions  the  trans¬ 
gression  of  international  norms  by  one  or  several 
of  its  members.  Such  measures  do  not  qualify  as 
traditional  wars  and  therefore  neutrality  law  does 
not  apply  to  them.  However,  neutrality  remains 
relevant  in  the  case  of  international  interventions 
that  are  not  decided  by  the  UN.  Examples  of  such 
interventions  were  the  NATO-led  intervention  in 
Serbia/Kosovo  in  1999  and  the  U.S.  Iraq  interven¬ 
tion  in  2003. 

Neutrality  and  Globalization 

Neutrality  reflects  an  attitude  toward  military  con¬ 
flicts  between  third-party  states.  It  does  not  apply 
to  intrastate  conflicts  nor  to  transnational  forms  of 
conflicts  such  as  terrorism.  Neutrality  is  not  rele¬ 
vant  to  global  nonmilitary  perils:  It  provides  no 
answer  regarding  problems  of  global  health,  cli¬ 
mate  change,  or  international  financial  stability. 
Thus,  neutrality  has  lost  much  of  its  traditional 
core  significance  in  the  era  of  globalization.  It  pro¬ 
vides  no  guidance  regarding  some  of  the  most  rel¬ 
evant  issues  in  international  relations. 

However,  neutrality  still  qualifies  as  a  foreign 
policy  role  conception:  In  countries  with  a  neutral¬ 
ity  tradition,  it  provides  political  legitimacy  for  the 
respective  foreign  policy  inside  the  country.  It  also 
allows  third  parties  to  have  certain  expectations  in 
regard  to  the  country’s  foreign  policy  behavior: 
They  may  expect  that  a  neutral  state — compared 


with  other  countries — will  have  a  stronger  ten¬ 
dency  to  refrain  from  the  use  of  military  means  in 
international  affairs.  A  neutral  state  will  also  show 
a  natural  inclination  to  promote  international  con¬ 
flict  resolution.  Owing  to  their  track  records,  neu¬ 
tral  countries  have  comparative  advantages  as 
honest  brokers. 

These  attributes  of  neutrality  and  also  its  contri¬ 
bution  to  a  nation’s  political  identity  build  on 
neutrality’s  political  core  and  not  on  its  legal 
dimension.  Neutrality  was  at  the  origin  of  both 
realistic  and  idealistic  role  conceptions.  The  first 
emanated  from  neutral  states’  military  absentee¬ 
ism,  the  second  from  the  basic  commitment  of 
neutral  states  to  regulate  and  limit  the  use  of  force. 
The  relative  importance  of  both  categories  depends 
on  the  international  structure  in  which  neutral 
states  operate.  In  a  state  system  characterized  by  a 
clear  reduction  in  interstate  military  conflicts,  the 
idealistic  role  conception  has  gained  significance 
compared  with  the  realistic  one.  Neutrality’s  role 
as  a  foreign  policy  identity  provider  has  become  its 
most  important  function  since  the  end  of  the  Cold 
War.  It  is  based  on  what  used  to  be  neutrality’s 
secondary  function — namely,  its  idealism  or  mis¬ 
sionary  belief.  Neutral  states  used  to  be  experts  in 
compensating  for  their  lack  of  military  force  and 
engagement  with  other  power  and  activity  dimen¬ 
sions.  Neutrality  may  keep  this  function  as  long  as 
neutral  states  actively  contribute  to  the  promotion 
of  peaceful  international  relations.  It  should  not  be 
seen  as  encouraging  passive  contemplation  of 
injustice,  violence,  and  oppression.  Nor  was  neu¬ 
trality  defined  as  a  neutral  attitude  toward  attempts 
to  prevent  military  conflicts. 
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New  Public  Management 


New  public  management  (NPM)  is  a  general  con¬ 
cept  denoting  a  global  wave  of  administrative 
reform  that  has  had  an  impact  on  the  public  sectors 
of  many  countries  for  more  than  2  decades.  It  is 
inspired  by  a  broad  neoliberal  ideology  and  a  par¬ 
ticular  set  of  normative  values  whose  main  focus  is 
on  increasing  efficiency.  In  this  sense,  NPM  is  rather 
one  dimensional  and  produces  some  tensions  with 
other  norms  and  values  within  the  public  sector. 

The  concept  was  launched  by  Christopher 
Hood  in  an  article  published  in  1991.  Most  NPM 
reform  efforts  have  had  similar  goals:  to  improve 
the  effectiveness  and  efficiency  of  the  public  sector, 
enhance  the  responsiveness  of  public  agencies  to 
their  clients  and  customers,  reduce  public  expendi¬ 
ture,  and  improve  managerial  accountability. 

Definition  of  NPM 

One  primary  characteristic  of  NPM  involves  pub¬ 
lic  organizations  adopting  the  management  and 
organizational  forms  used  by  private  companies.  It 
challenges  two  traditional  doctrines  of  public 
administration:  that  public  sector  organizations 
are  “insulated”  from  the  private  sector  in  terms  of 
personnel,  structure,  and  business  methods  and 
that  they  operate  in  accordance  with  a  precise  set 
of  rules  limiting  the  freedom  of  public  officials  in 
handling  money,  staff,  contracts,  and  so  on.  In 
contrast,  the  NPM  movement  subscribes  to  the 
principle  that  the  formal  organization  of  the  public 
and  private  sectors  should  be  similar  and  that 
managers  in  public  sector  organizations  should 
have  discretion  and  leeway  in  their  daily  work,  so 
as  to  make  efficient  use  of  allocated  resources. 


Even  if  NPM  fundamentally  espouses  economic 
values  and  objectives,  as  a  concept,  it  is  loose  and 
multifaceted  and  encompasses  a  range  of  different 
administrative  doctrines.  It  offers  a  kind  of  “shop¬ 
ping  basket”  of  different  elements  for  reformers  of 
public  administration.  The  main  components  of 
NPM  are  hands-on  professional  management, 
which  allows  for  active,  visible,  and  discretionary 
control  of  an  organization  by  people  who  are  free 
to  manage;  explicit  standards  of  performance;  a 
greater  emphasis  on  output  control;  disaggregation 
of  units;  and  private  sector  management  techniques. 
It  also  includes  splitting  up  public  organizations 
through  horizontal  specialization  (single-purpose 
organizations)  and  vertical  specialization  (structural 
devolution);  increased  exposure  to  competition, 
contract  management,  and  market  orientation  (con¬ 
tracting  out,  purchaser-provider  models);  increased 
emphasis  on  service  orientation  and  user  participa¬ 
tion  (citizen  charters);  and  cost  cutting  and  budget¬ 
ary  discipline.  NPM  is  supposed  to  score  high  on 
managerial  autonomy  and  ex  ante  control  and  low 
on  policy  autonomy  and  ex  post  control. 

Ewan  Ferlie  distinguishes  among  four  different 
NPM  models:  the  efficiency  drive,  downsizing  and 
decentralization,  the  search  for  excellence,  and 
public  service  orientation.  NPM  promises  to  inte¬ 
grate  these  themes,  linking  efficiency  and  account¬ 
ability.  Other  distinctions  are  between  “hard  NPM 
tools,”  which  address  accounting,  auditing,  and 
performance  measurement,  and  “soft  NPM  tools,” 
which  include  things  such  as  human  factors,  user 
orientation,  quality  improvement,  and  individual 
development. 

NPM  in  a  Theoretical  Context 

NPM  reforms  are  not  based  on  one  theoretical 
foundation  but  are  a  collection  of  reform  elements 
sharing  several  common  characteristics.  When 
NPM  reforms  are  said  to  be  typically  theoretical, 
this  often  means  that  contractualism  and  economic 
theories  dominate.  Examples  are  public-choice 
theories,  principal-agent  models,  and  transaction- 
cost  models.  NPM,  with  its  economic  performance 
and  market  focus,  sees  other  values  and  consider¬ 
ations  embedded  in  the  civil  service  as  more  or  less 
unproblematic  and  not  generally  threatened  or 
negatively  affected  by  the  efficiency  and  economy 
focus  of  NPM. 
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The  second  set  of  ideas  comes  from  the  manage¬ 
rialist  school  of  thought,  which  focuses  on  the 
need  to  reestablish  the  primacy  of  managerial  prin¬ 
ciples  in  the  bureaucracy.  This  concentration  on 
enhancing  the  capacity  of  managers  to  take  action 
requires  attention  to  decentralization,  devolution, 
and  delegation. 

The  tensions  arising  from  the  hybrid  character 
of  NPM,  which  combines  economic  organization 
theory  and  management  theory,  are  well  known. 
These  tensions  result  from  the  contradiction 
between  the  centralizing  tendencies  inherent  in 
contractualism  and  the  devolutionary  tendencies 
of  managerialism.  NPM  is  something  of  a  hybrid, 
advocating  both  decentralization  (let  the  managers 
manage)  and  centralization  (make  the  managers 
manage).  NPM  thus  prescribes  both  more  auton¬ 
omy  and  more  central  control  at  the  same  time. 

Many  of  the  most  important  and  problematic 
reform  elements,  such  as  the  relationship  between 
public  managers  and  elected  officials,  reflect  the 
potential  tensions  in  the  way  these  reform  elements 
are  combined.  Through  devolution  and  contract¬ 
ing  out,  NPM  has  sought  to  separate  policy 
making  more  clearly  from  policy  administration  and 
implementation.  Policy  makers  make  policy 
and  then  delegate  its  implementation  to  managers 
and  hold  them  accountable  by  contract. 

Driving  Forces  Behind  NPM 

There  is  no  one-factor  explanation  for  the  emer¬ 
gence  of  NPM.  The  driving  forces  behind  such 
reforms  are  partly  ideological,  but  they  are  also 
marked  by  administrative  culture,  characteris¬ 
tics  of  the  political-administrative  system,  and 
the  fiscal  situation  of  the  country  in  question. 
The  institutional  dynamics  of  reforms  can  best 
be  interpreted  as  a  complex  mixture  of  environ¬ 
mental  pressure,  polity  features,  and  historical- 
institutional  context.  These  factors  define  how 
much  leeway  political  leaders  have  in  making 
choices  about  reforms — that  is,  they  both  fur¬ 
ther  and  hinder  NPM  reforms. 

One  school  of  thought  regards  NPM  primarily 
as  a  response  to  external  pressure.  This  environ¬ 
mental  determinism  can  be  of  two  kinds.  In  the 
first  instance,  a  country  may  adopt  internationally 
based  norms  and  beliefs  about  how  a  civil  service 
system  should  be  organized  and  run,  simply 


because  these  have  become  the  prevailing  doctrine. 
NPM  had  its  origins  in  certain  English-speaking 
countries  and  international  organizations,  like  the 
Organisation  for  Economic  Co-operation  and 
Development  (OECD),  where  a  kind  of  reform 
myth  took  hold,  became  ideologically  dominant, 
and  diffused  all  over  the  world.  This  diffusion  pro¬ 
cess  implied  isomorphic  elements,  creating  pres¬ 
sure  for  similar  reforms  and  structural  changes  in 
many  countries. 

In  the  second  instance,  NPM  may  be  seen  as 
the  optimal  solution  to  widespread  technical 
problems — that  is,  it  has  been  adopted  to  solve 
problems  created  by  a  lack  of  instrumental  perfor¬ 
mance  or  by  economic  competition  and  market 
pressure.  In  this  instance,  NPM  reforms  were 
adopted  not  because  of  their  ideological  hege¬ 
mony  but  because  of  their  technical  efficiency. 

Another  view  holds  that  reforms  are  primarily  a 
product  of  the  national  historical-institutional  con¬ 
text.  Different  countries  have  different  historical- 
cultural  traditions,  and  their  reforms  are  path 
dependent,  meaning  that  national  reforms  have 
unique  features.  The  reform  roads  taken  reflect  the 
main  features  of  national  institutional  processes, 
where  institutional  “roots”  determine  the  path  fol¬ 
lowed  in  a  gradual  adaptation  to  internal  and 
external  pressure.  This  view  stresses  institutional 
autonomy  and  internal  dynamics.  The  greater  the 
consistency  between  the  values  underlying  the 
reforms  and  the  values  on  which  the  existing 
administrative  system  is  based,  the  more  likely  that 
the  reforms  will  be  successful. 

A  third  view  emphasizes  that  different  countries 
have  different  constitutional  features  and  political- 
administrative  structures  and  that  these  factors  go 
some  way  in  explaining  how  they  handle  national 
problems  and  reform  processes.  The  main  features 
of  the  polity,  the  form  of  government,  and  the 
formal  structure  of  decision  making  within  the 
political-administrative  system  may  all  affect  a 
country’s  capacity  to  realize  administrative  reforms. 
From  a  structural  or  instrumental  point  of  view, 
the  reforms  may  generally  be  seen  as  conscious 
organizational  design.  This  perspective  is  based  on 
the  assumption  that  political  and  administrative 
leaders  use  the  structural  design  of  public  entities 
as  instruments  to  fulfill  public  goals.  The  major 
preconditions  for  this  are  that  the  leaders  have  a 
relatively  large  degree  of  control  over  change  or 
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reform  processes  and  that  they  score  high  on  ratio¬ 
nal  calculation  or  means-end  thinking. 

Thus,  external  reform  components  and  pro¬ 
grams  are  filtered,  interpreted,  and  modified  by  a 
combination  of  two  further  nationally  based  fea¬ 
tures.  One  feature  involves  the  national  political- 
administrative  history,  culture,  traditions,  and 
style  of  government.  The  other  involves  national 
polity  features,  as  expressed  in  constitutional  and 
structural  factors.  Within  these  constraints  politi¬ 
cal  and  managerial  executives  have  varying 
amounts  of  leeway  to  launch  NPM  reforms  via  an 
active  administrative  policy. 

NPM  Diffusion:  Convergence, 
Divergence,  or  Both? 

According  to  the  OECD,  NPM  represents  a  global 
change  of  paradigm  concerning  the  control  and 
organization  of  public  service.  This  convergence 
thesis  is,  however,  contested.  NPM  has  led  to  major 
changes  in  the  public  sector  in  many  countries. 
However,  the  process  of  reform  has  not  been  the 
same  everywhere.  In  some  countries  there  might  be 
a  strong  element  of  diffusion  of  NPM  ideas  from 
outside,  whereas  in  others  the  reform  process  might 
be  more  a  result  of  national  or  local  initiatives  that 
have  subsequently  acquired  an  NPM  label.  Thus, 
the  spread  of  NPM  is  a  complex  process,  going 
through  different  stages  and  packaged  in  different 
ways  in  different  countries,  with  each  country  fol¬ 
lowing  its  own  reform  trajectory  within  a  broader 
framework.  NPM  is  not  a  neat  package  of  reform 
elements  having  a  specific  starting  point  and  follow¬ 
ing  a  specific  path  toward  a  common  destination. 

NPM  ideas  have  been  implemented  to  different 
degrees,  at  different  paces,  and  with  differing 
emphases  on  the  various  elements  of  the  reform 
package  in  different  countries  and  sectors.  A  gen¬ 
eral  finding  is  that  the  degree  of  variation  between 
countries  and  also  between  policy  areas  increases 
when  we  move  away  from  the  world  of  ideas,  dis¬ 
cussion,  and  policy  programs  and  look  at  specific 
decisions,  and  this  increase  is  even  more  apparent 
when  we  consider  the  implementation  and  impact 
of  the  reforms.  It  is  debatable  whether  NPM  has 
led  to  the  convergence  of  administrative  systems  in 
different  countries,  but  there  is  much  to  suggest 
that  ideas  and  policy  programs  resemble  one 
another  more  than  the  corresponding  practices  do. 


Even  though  countries  to  some  extent  present 
their  reforms  in  similar  terms  and  support  some  of 
the  same  general  administrative  doctrines,  closer 
scrutiny  reveals  considerable  variation.  Christopher 
Pollitt  and  Geert  Bouckaert  distinguish  among 
four  groups  of  NPM  reformers:  the  maintainers, 
the  modernizers,  the  marketizers,  and  the  minimal 
state  category.  Countries  such  as  the  United  King¬ 
dom,  Australia,  and  New  Zealand  fit  the  mar- 
ketizer  profile.  Continental  Europeans  are  more 
skeptical  about  NPM,  except  for  the  Scandinavian 
countries,  which  are  somewhere  in  between. 

Having  begun  in  Britain,  the  reform  wave  has 
become  strongest  in  Anglo-American  countries.  In 
Westminster-style  parliamentarian  systems,  NPM 
reforms  fell  on  fertile  ground  and  were  therefore 
implemented  early  and  had  far-reaching  effects. 
This  was  due,  on  the  one  hand,  to  strong  external 
economic  and  institutional  pressure  and,  on  the 
other,  to  few  constitutional  and  administrative 
obstacles,  a  compatible  culture,  and  parliamentary 
conditions  that  favored  a  radical  strategy  and 
reform  entrepreneurs.  By  contrast,  the  Scandinavian 
countries  were  reluctant  to  implement  reforms. 
Environmental  pressure  was  weaker,  their 
Rechtsstaat  culture  and  strong  egalitarian  norms 
were  less  compatible  with  the  values  of  NPM, 
there  were  more  obvious  constitutional  obstacles, 
and  parliamentary  conditions  often  characterized 
by  minority  coalition  governments  made  a  radical 
reform  strategy  difficult  to  pursue. 

Thus,  there  is  no  consistent  movement  toward  a 
new  isomorphic  model  of  civil  service  systems. 
Most  governments  still  share  some  main  elements 
of  the  traditional  system  of  public  administration. 
However,  some  strong  common  trends  in  modern¬ 
izing  public  services  have  emerged  across  groups  of 
countries.  One  of  these  is  a  reduction  in  the  differ¬ 
ences  between  the  public  and  private  sectors. 
Nevertheless,  the  story  is  not  only  one  of  conver¬ 
gence,  meaning  that  all  countries  are  moving  in  the 
same  direction,  nor  is  it  a  story  only  of  divergence, 
whereby  all  countries  follow  their  own  trajectories 
constrained  by  their  specific  context,  legacy,  and 
tradition.  Instead,  what  we  are  seeing  is  a  complex 
mixture  of  robustness  and  flexibility  and  of  reform 
paths  that  can  hardly  be  explained  by  using  a 
single-perspective  approach. 

Variations  in  reform  practice  from  one  country 
to  another  are  the  rule  rather  than  the  exception. 
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Different  countries  and  governments  face  different 
contexts,  risks,  and  problems  and  start  out  with 
different  values  and  norms.  In  other  words,  they 
have  different  starting  points,  are  at  different 
stages  of  reform,  and  face  different  external  and 
internal  constraints.  What  we  might  see  is  two 
trajectories.  One  is  represented  by  civil  services 
that  have  been  modernized  within  state  traditions 
and  are  therefore  rather  closed  and  resistant  to 
external  pressure.  Here,  the  reform  process  is 
more  hesitant  and  does  not  involve  major  shifts. 
The  other  trajectory  is  represented  by  civil  services 
that  are  more  vulnerable  to  external  pressure  and 
more  open  to  NPM  concepts,  resulting  in  more 
radical  reforms.  The  first  instance  is  typical  of  the 
Scandinavian  welfare  state;  the  second  character¬ 
izes  Australia  and  New  Zealand. 

Effects  of  NPM  Reforms 

The  main  hypothesis  of  NPM  reforms  is  that 
increased  market  orientation  and  management 
focus  lead  to  increased  efficiency,  without  causing 
negative  side  effects  for  other  goals  and  concerns. 
But  this  hypothesis  is  far  from  having  been  con¬ 
firmed  as  evidence-based  knowledge.  It  might  be 
right  under  specific  conditions  but  not  as  a  general 
characteristic  of  NPM  reforms  everywhere  and  at 
all  times. 

Effects  are  often  assumed  or  promised,  but  there 
are  few  systematic  and  reliable  studies  of  the 
effects  of  NPM  reforms.  Hard  evidence  is  often 
lacking.  Often  strategies,  plans,  and  selective  suc¬ 
cess  stories  are  the  focus  of  attention  rather  than 
systematic  analyses  of  results.  Research  has  gener¬ 
ally  focused  more  on  answering  questions  about 
why  reform  happens  than  on  trying  to  reveal  the 
effects  of  initiatives. 

We  know  less  about  external  political  learning 
and  societal  effects  than  about  internal  administra¬ 
tive  effects  on  efficiency.  One  finding  is  that  verti¬ 
cal  and  horizontal  specialization  through  structural 
devolution  and  the  establishment  of  single-purpose 
organizations  have  led  to  fragmentation  in  public 
administration  and  thus  increased  coordination 
problems.  Another  finding  is  that  NPM  reforms  in 
some  cases  can  have  positive  effects  on  efficiency, 
but  the  efficiency  gains  also  vary  according  to  the 
tasks  and  services  in  question.  Competition  might 
bring  savings  on  costs,  more  efficient  production, 
and  more  flexible  and  user-friendly  services.  But 


there  might  also  be  negative  side  effects  in  the  form 
of  increased  social  problems. 

The  effects  on  efficiency  are  usually  less  than 
what  reform  advocates  have  predicted.  This  is 
partly  because  transaction  costs  as  well  as  admin¬ 
istration  and  operational  costs  of  the  new 
arrangements  may  not  have  been  taken  into  con¬ 
sideration.  NPM  reforms  have  led  to  increased 
efficiency  in  some  public  sector  organizations,  at 
least  in  the  short  term.  Responsiveness  to  users 
tends  to  be  improved  by  NPM  reforms,  but  there 
are  also  clear  indications  of  a  reduction  in  equal¬ 
ity.  It  appears  that  improved  efficiency  and 
responsiveness  have  been  achieved  at  the  expense 
of  equal  treatment.  On  the  other  hand,  NPM 
reforms  have  had  a  positive  effect  on  the  freedom 
of  choice  of  services. 

A  fundamental  dilemma  for  many  NPM  reforms 
is  the  tension  between  autonomy  and  control. 
Organizations  should  have  enough  freedom  to  be 
run  in  an  efficient  way,  yet  not  be  so  free  that  supe¬ 
rior  levels  of  leadership  lose  power  and  control.  The 
aim  is  to  achieve  more  freedom  and  greater  control 
simultaneously.  Whether  or  not  NPM  strengthens 
political  control  and  leads  to  better  political  steer¬ 
ing  is  an  open  question.  Some  claim  that  NPM 
reforms  will  reduce  political  steering  and  control 
and  transfer  power  and  influence  to  state-owned 
companies  and  autonomous  agencies,  and  there  is 
some  evidence  for  that.  But  there  are  also  studies 
indicating  that  NPM  tools  like  performance  man¬ 
agement  have  improved  top-level  executive  control. 
The  new  organizational  forms  have  led  to  changes 
in  how  public  organizations  are  controlled.  The 
traditional,  ex  ante ,  informal,  internal,  collegial, 
and  trust-based  forms  of  control  are  waning,  and 
the  more  ex  post,  formal,  external,  professional, 
and  distrust-based  forms  of  control  are  waxing. 

One  conclusion  to  be  drawn  is  that  the  design  of 
various  NPM  reforms  may  vary  considerably  across 
countries,  tasks,  sectors,  and  administrative  levels 
and  will  have  consequences  for  effects  studies.  The 
implication  is  that  discussions  of  the  effects  of 
reform  must  strive  for  exceedingly  precise  terminol¬ 
ogy  and  must  not  be  conducted  at  a  general  level. 

In  sum,  it  is  hard  to  say  unequivocally  what  the 
effects  of  NPM  reforms  are.  NPM’s  results  are  dis¬ 
puted  and  uncertain.  The  paradox  as  stated  by 
Pollitt  and  Bouckaert  is  that  these  kinds  of  reforms 
do  not  seem  to  need  results  to  fuel  their  onward 
march. 
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Future  Development  and  Application 

There  is  a  need  to  look  beyond  NPM — what  has 
been  termed  transcending  new  public  management. 
The  central  question  being  asked  is  whether  NPM 
is  finished.  While  some,  citing  the  crises  that  NPM 
has  experienced,  proclaim  that  it  is  indeed  dead, 
others  tend  to  support  the  view  that  this  is  by  no 
means  so.  It  has,  however,  been  challenged:  New 
types  of  reforms  have  been  added  to  those  already 
in  place,  and  there  have  been  some  reversals,  espe¬ 
cially  when  it  comes  to  the  disaggregation  compo¬ 
nents  of  NPM.  The  new  reforms  tend,  however, 
not  to  replace  the  old.  They  represent  readjust¬ 
ments,  modifications,  and  supplements  rather  than 
radical  change.  Priorities  have  shifted  from  a  drive 
to  create  agencies  and  autonomous  bodies  that 
enhance  efficiency  to  a  quest  to  find  the  right  bal¬ 
ance  between  accountability  and  autonomy  by 
focusing  on  weak  coordination  devices,  lack  of 
governing  capacity,  and  weak  accountability 
mechanisms. 

The  reforms  that  were  undertaken  under  the 
NPM  label  paved  the  way  for  further  reforms  and 
transformations  in  the  post-NPM  era.  Market 
solutions  and  market  ideology  now  seem  to  have 
become  more  or  less  institutionalized  within  the 
public  sector,  albeit  without  erasing  major  Weber¬ 
ian  features  of  the  Old  System,  and  a  certain 
amount  of  reregulation  has  taken  place  in  recent 
years.  The  trend  toward  single-purpose  organiza¬ 
tions  is  another  feature  of  NPM  reforms  that 
recent  reforms  have  modified  by  introducing  more 
coordination  and  collaboration  across  and  within 
political-administrative  systems.  A  third  element  of 
NPM  was  structural  devolution,  which  resulted  in 
the  autonomization  and  agencification  of  public 
sector  organizations.  However,  in  recent  years  this 
has  been  countered  by  a  reassertion  of  the  center 
and  a  strengthening  of  central  state  capacity. 

A  main  finding  in  the  research  in  this  area  is  that 
administrative  reforms  have  not  taken  place  along  a 
single  dimension.  In  practice,  we  face  mixed  models 
and  increased  complexity.  It  is  fair  to  say  that  NPM 
is  still  very  much  alive  in  many  countries  and  that 
NPM  reforms  have  normally  not  been  replaced  by 
new  reforms  but  rather  revised  or  supplemented  by 
post-NPM  reforms.  The  pace  and  comprehensive¬ 
ness  of  these  trends  has  varied  significantly  from 
one  country  to  another  and  from  one  policy  area  to 
another,  and  reform  activities  embrace  a  wide  spec¬ 
trum.  Even  though  NPM  in  certain  ways  has  been  a 


success,  it  is  too  early  to  conclude  that  the  old  pub¬ 
lic  administration  model  is  unsustainable.  It  has 
considerable  capacity  to  adapt  and  is  both  robust 
and  flexible,  even  after  a  long  period  of  NPM 
reforms  and  emerging  post-NPM  reforms. 

Typical  for  the  NPM  reforms  was  that  the 
formal  structural  system  changed  from  an  inte¬ 
grated  to  a  fragmented  one.  The  formal  levers  of 
steering  were  weakened,  the  distance  to  the  agen¬ 
cies  grew,  political  signals  became  weaker,  and 
horizontal  specialization  increased  according  to 
different  principles.  The  second  generation  of 
reforms  uses  formal  structures  to  regain  control  or 
modify  the  loss  of  political  influence  by  making 
them  more  centralized,  complex,  and  varied. 

Formal  structural  instruments  have  been  used  to 
modify  not  only  devolution  and  vertical  specializa¬ 
tion  but  also  horizontal  fragmentation  and  special¬ 
ization,  especially  in  Australia  and  New  Zealand. 
Vertical  control  and  levers  of  control  are  increas¬ 
ingly  being  applied,  while  a  “whole-of-government” 
approach  uses  new  coordination  instruments  and 
cross-sector  programs  and  projects  to  modify  hori¬ 
zontal  fragmentation. 

The  spread  and  diffusion  of  agency  or  regula¬ 
tory  agency  forms  across  countries  in  the  first  wave 
of  reforms  may  be  seen  as  an  institutional  standard, 
script,  or  prescription  with  symbolic  value.  This 
applies  just  as  much  to  the  second  generation  of 
modern  reforms,  where  the  fashion  is  now  to  have 
“whole-of-government”  or  “joined-up  govern¬ 
ment”  models  that  foster  more  coordination  and 
control.  The  question  is  whether  these  trends  are 
more  about  symbols  than  about  reality  and  whether 
they  really  constitute  a  clear  break  with  NPM. 

What  kind  of  further  trends  might  we  expect  in 
countries  that  have  implemented  elements  of 
NPM?  A  first  scenario  is  the  idea  of  a  linear  pro¬ 
cess  toward  more  market,  management,  and  effi¬ 
ciency.  A  second  scenario  is  a  cyclical  development 
implying  that,  after  a  period  of  NPM,  there  will  be 
a  reaction  to  the  norms  and  values  that  the  reform 
is  built  on  leading  to  a  return  to  some  of  the  main 
features  of  “old  public  management”  and  a  redis¬ 
covery  of  the  Weberian  bureaucracy. 

A  third  scenario  is  a  more  dialectical  process 
under  which  public  services  become  more  complex. 
Institutional  change  is  a  sedimentation  process  in 
which  new  administrative  reforms  do  not  replace  old 
forms  but  supplement  them.  There  is  a  layering  and 
mixture  of  old  public  administration,  NPM,  and 
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post-NPM  reforms  producing  hybrid  organizational 
forms.  The  changes  we  have  seen  in  recent  years  tend 
to  come  closer  to  the  third  scenario  than  to  the  first. 

Per  Lcegreid 
University  of  Bergen 
Bergen,  Norway 
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Nongovernmental 
Organizations  (NGOs) 

The  origin  of  the  term  nongovernmental  organiza¬ 
tion  lies  in  the  need  for  those  drafting  the  United 


Nations  (UN)  Charter  in  1945  to  make  a  distinc¬ 
tion  between  the  procedures  for  the  UN’s  relations 
with  other  intergovernmental  organizations  linked 
to  the  UN  as  specialized  agencies  and  its  relations 
with  international  organizations  that  had  not  been 
established  by  intergovernmental  agreement. 
Under  Article  70,  the  Economic  and  Social  Council 
of  the  UN  (ECOSOC)  was  to  make  arrangements 
for  representatives  of  the  agencies  “to  participate, 
without  vote,  in  its  deliberations,”  whereas  under 
Article  71  NGOs  would  have  a  secondary  status: 

The  Economic  and  Social  Council  may  make 
suitable  arrangements  for  consultation  with  non¬ 
governmental  organizations  which  are  concerned 
with  matters  within  its  competence.  Such 
arrangements  may  be  made  with  international 
organizations  and,  where  appropriate,  with 
national  organizations  after  consultation  with 
the  Member  of  the  United  Nations  concerned. 
(Article  71  of  the  UN  Charter) 

Gradually  the  term  nongovernmental  organiza¬ 
tion  passed  from  the  diplomatic  world  into  general 
usage.  However,  its  meaning  at  the  UN  and  its 
meaning  in  popular  discourse  are  not  the  same. 

At  the  UN,  an  NGO  is  any  organization  that 
has  been  granted  consultative  status  by  ECOSOC 
or,  by  extension,  any  organization  that  may  be 
eligible  for  consultative  status.  In  the  early  years  of 
the  UN,  various  criteria  were  adopted  by  the  coun¬ 
cil  to  establish  which  organizations  would  be 
accredited  and  what  participation  rights  they 
would  have.  The  initial  decisions  were  reviewed 
and  consolidated  into  an  NGO  Statute  by  ECOSOC 
Resolution  288  B  (X)  of  February  27,  1950.  It  was 
revised  by  Resolution  1296  (XLIV)  of  May  23, 
1968,  and  again  by  Resolution  1996/31  of  July  25, 
1996.  Most  of  the  text  of  the  current  statute  has 
remained  unchanged  since  the  first  resolutions  in 
1946.  The  implied  assumption  in  1945  was  that 
ECOSOC  would  accredit  a  small  number  of  large, 
well-established  global  organizations.  In  particu¬ 
lar,  for  50  years,  very  few  exceptions  were  made  to 
the  rule  than  only  international  NGOs  rather  than 
national  NGOs  should  be  accepted.  Despite  this 
restriction,  it  was  soon  apparent  that  many  more 
NGOs  than  were  initially  expected  would  wish  to 
participate  in  the  UN’s  work.  The  numbers 
increased  steadily  from  197  in  1950,  to  334  in 
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1960,  and  419  in  1970.  They  then  increased  more 
substantially  in  the  1970s,  reaching  608  in  1980 
and  893  in  1990,  and  increased  even  more  after 
1990  to  1,995  in  2000  and  3,413  in  2010.  As 
practice  has  evolved,  it  has  become  apparent  that 
the  official  criteria  are  not  applied  and  conversely 
some  criteria  not  in  the  statute  are  consistently 
applied.  Thus,  with  very  few  exceptions,  any  pri¬ 
vate  organization  will  gain  consultative  status 
provided  that  it  is  not  a  commercial  company,  it 
does  not  use  or  advocate  the  use  of  violence,  and  it 
does  not  violate  the  diplomatic  norm  of  noninter¬ 
ference  in  the  internal  affairs  of  individual  coun¬ 
tries.  This  norm  finds  one  expression  in  not  accept¬ 
ing  a  political  party  for  accreditation  as  an  NGO. 
The  restrictions  on  companies  and  political  parties 
do  not  prevent  non-profit-making  international 
associations  representing  commercial  interests  or 
international  federations  of  political  parties  from 
being  accepted. 

The  norm  of  noninterference  was,  until  the 
1970s,  a  severe  restriction  on  the  discussion  of 
human  rights  in  all  bodies  of  the  UN.  A  second 
expression  of  this  norm  was  added  to  the  NGO 
statute  in  1968:  Human  rights  NGOs  “should 
have  a  general  international  concern  with  this  mat¬ 
ter,  not  restricted  to  the  interests  of  a  particular 
group  of  persons,  a  single  nationality  or  the  situa¬ 
tion  in  a  single  State  or  restricted  group  of  States” 
(ECOSOC  Resolution  1296,  para.  17).  In  1996, 
this  text  was  replaced  by  a  more  vague,  general 
assertion  that  they  should  act  “in  accordance  with 
the  spirit  of  the  Charter  of  the  United  Nations,  the 
Universal  Declaration  of  Human  Rights  and  the 
Vienna  Declaration  and  Programme  of  Action.” 
Despite  this  change,  the  1968  principles  are  still 
invoked  when  any  government  attempts  to  block 
the  accreditation  of  an  NGO  that  is  critical  of  their 
human  rights  record.  As  a  result,  the  Indian  gov¬ 
ernment  has  been  able  to  block  the  World  Sikh 
Congress,  and  the  Chinese  government  has  blocked 
any  NGO  that  focuses  on  Tibet.  Nevertheless,  such 
governments  cannot  prevent  criticism  in  the  UN 
from  the  large  global  human  rights  NGOs,  who 
have  consultative  status.  The  accreditation  of 
human  rights  NGOs  has  often  been  a  matter  of 
great  controversy;  even  when  they  are  initially 
rejected,  however,  they  do  eventually  win  a  vote  in 
their  favor  in  ECOSOC,  provided  they  conform  to 
the  statute,  including  the  discarded  text  from  1968. 


In  all  versions  of  the  statute,  there  have  been  two 
vague  general  clauses  specifying  that  an  NGO  must 
be  concerned  with  questions  within  the  competence 
of  ECOSOC,  and  its  aims  “shall  be  in  conformity 
with  the  spirit,  purposes  and  principles”  of  the  UN 
Charter.  This  might  have  been  a  basis  for  a  process 
of  prohibiting  access  to  certain  types  of  NGOs  on 
specific  political  grounds.  The  only  field  in  which 
such  censorship  has  been  applied  is  in  the  field  of 
human  sexuality.  The  International  Planned 
Parenthood  Federation  was  not  able  to  gain  access 
to  the  UN  system  until  the  mid-1960s,  a  decade 
after  it  was  founded.  Human  Life  International,  an 
antiabortion  organization,  was  rejected  on  the 
technical  grounds  that  it  was  hostile  to  UNICEF, 
but  a  few  antiabortion  NGOs  have  now  been 
accepted.  Most  dramatically  of  all,  a  special  session 
of  ECOSOC  was  convened  in  1994  solely  for  the 
purpose  of  withdrawing  consultative  status  from 
the  International  Lesbian  and  Gay  Association, 
when  it  became  public  that  one  of  their  constituent 
organizations  promoted  pedophilia. 

Since  1945,  there  has  been  a  slow  but  consistent 
expansion  in  the  role  of  NGOs  in  policy  making 
within  the  UN  system,  to  the  point  where  the  out¬ 
comes  in  all  areas  of  multilateral  diplomacy, 
including  security  questions,  can  no  longer  be 
explained  without  analyzing  the  influence  exer¬ 
cised  by  NGOs.  They  have  also  greatly  increased 
their  status  in  international  law,  to  the  point  where 
it  can  be  argued  that  those  with  ECOSOC  consul¬ 
tative  status  have  international  legal  personality. 
Their  most  significant  impact  has  been  in  the  cre¬ 
ation  of  the  Internet.  NGOs  made  two  essential 
contributions  to  the  conversion  of  communications 
technology  from  disparate  unconnected  private 
networks  to  the  contemporary  Internet  as  a  global 
public  communications  system.  In  the  1980s,  they 
established  the  first  Internet  service  providers  and 
linked  them  together  to  provide  global  coverage. 
Also,  they  were  the  first  to  create  gateways  to  link 
all  the  diverse  networks  to  each  other. 

In  popular  discourse,  an  NGO  is  often  pre¬ 
sumed  to  be  a  voluntary  organization  acting  in  the 
public  interest.  Sometimes,  distinctions  are  made 
between  operational  NGOs,  which  raise  money 
and  spend  it  on  projects  to  assist  the  needy  and  the 
vulnerable,  and  campaigning  NGOs,  which  seek  to 
articulate  the  concerns  of  disempowered  and  mar¬ 
ginalized  people.  Such  an  approach  brings  to  mind 
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NGOs  concerned  with  development,  humanitarian 
relief,  women’s  rights,  human  rights,  peace,  and 
environmental  questions.  However,  it  is  confusing 
and  misleading  to  conceptualize  NGOs  in  such  a 
restricted  manner.  All  these  fields  of  global  politics 
are  also  influenced  by  other  NGOs  that  would  not 
be  primarily  identified  either  as  operational  or 
campaigning  NGOs  in  such  fields.  At  the  UN,  the 
3,000-plus  organizations  that  have  consultative 
status  as  NGOs  include  religious  bodies,  trade 
unions,  business  groups,  scientific  and  technical 
bodies,  professional  associations,  youth  organiza¬ 
tions,  and  parliamentarians.  Many  of  these  also 
make  their  own  specialist  contributions  on  global 
issues.  Some,  such  as  religious  bodies  (also  calling 
themselves  faith  communities)  and  trade  unions, 
actively  object  to  being  called  NGOs.  Nevertheless, 
within  the  UN,  all  legitimate  nonstate  actors  gain 
access  through  becoming  accredited  as  NGOs. 

In  the  domestic  politics  of  individual  countries, 
NGOs  are  more  commonly  referred  to  as  interest 
groups,  pressure  groups,  lobbies,  or  private  volun¬ 
tary  organizations.  However,  as  noted  above,  the 
term  NGO  may  still  be  used  to  refer  to  a  more 
limited  range  of  public  interest  groups  and/or 
groups  that  engage  in  transnational  activities. 
NGOs  vary  greatly  in  the  geographical  scope  of 
their  structure  and  their  activities.  Some  may  be 
highly  local,  community  groups  in  which  all  those 
involved  know  each  other  personally.  Others  are 
organized  in  individual  towns  or  cities,  within  a 
larger  area,  or  over  a  whole  country,  while  some 
cover  a  continent  or  have  a  presence  throughout 
the  world.  The  scope  of  their  structure  does  not 
necessarily  relate  to  the  scope  of  their  activities. 
Local  groups  may  be  completely  parochial,  or  they 
may  concern  themselves  with  issues  affecting  the 
other  side  of  the  world — for  example,  when  a 
small  community  decides  to  raise  money  for  the 
victims  of  an  earthquake.  National  NGOs  often 
have  a  minimal  organizational  structure  outside 
their  home  country  but  engage  in  extensive  trans¬ 
national  cooperation  with  other  NGOs  around  the 
world.  One  of  the  most  important  features  of 
NGOs  is  that  many  of  them  break  down  the  dis¬ 
tinction  between  domestic  and  international  poli¬ 
tics  by  simultaneously  operating  at  both  levels. 
Initially,  local  and  national  NGOs  could  influence 
global  politics  only  through  the  membership  of 
international  NGOs.  With  the  expansion  of  global 


communications,  in  particular  the  development  of 
the  World  Wide  Web  since  1993,  any  NGO  can 
now  easily  have  a  transnational  impact.  In  addi¬ 
tion,  the  only  major  amendment  to  the  UN’s  NGO 
statute  was  the  decision  in  1996  to  allow  accredi¬ 
tation  to  national  NGOs.  The  prime  reason  for 
this  decision  was  to  encourage  participation  by 
NGOs  from  developing  countries. 

There  is  great  variety  in  the  structures  of  NGOs. 
Some  decide  policy  through  democratic  processes 
open  to  all  their  members,  but  others  rely  on  sup¬ 
porters  who  merely  have  an  indirect  influence  on 
policy,  by  increasing  or  reducing  their  level  of  sup¬ 
port;  some  are  formed  by  individual  people,  but 
some  consist  of  separate  independent  NGOs  who 
form  an  umbrella  organization  to  cooperate;  some 
relate  to  members  and/or  supporters  directly 
through  a  centralized  structure,  but  others  have 
complex  hierarchies  or  a  federal  structure.  NGOs 
also  have  a  diverse  range  of  political  roles.  Some 
are  altruistic,  but  others  represent  the  interests  of 
their  members;  some  have  charitable  status  and 
obtain  tax  concessions  from  governments,  but  oth¬ 
ers  are  not  eligible;  some  represent  a  very  small 
number  of  people,  but  others  have  an  active  mem¬ 
bership  measured  in  millions;  some  are  highly 
specialized,  but  others  are  concerned  with  issues 
that  affect  a  wide  range  of  social,  political,  and 
economic  questions;  some  have  no  public  profile, 
but  others  obtain  regular  media  coverage;  some 
have  no  desire  to  engage  in  politics,  but  others  are 
campaigning  organizations;  some  are  insiders  in 
government  policy  making,  but  others  are  outsid¬ 
ers,  concentrating  on  mass  mobilization;  some  are 
progressive  or  Left  wing,  some  identify  themselves 
as  being  apolitical,  and  some  are  Right  wing.  It  is 
impossible  for  any  individual  to  support  all  NGOs, 
except  by  arbitrarily  defining  some  as  not  being 
true  NGOs.  There  is  no  such  thing  as  a  typical 
NGO. 

Peter  Willetts 
City  University,  London 
London,  United  Kingdom 
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In  political  science,  as  in  many  other  disciplines, 
linear  regression  is  the  workhorse  tool  for  statisti¬ 
cal  analysis.  It  is  easy  to  interpret,  and  under 
many  conditions  the  estimates  it  provides  are 
unbiased  and  efficient.  Unfortunately,  many  of  the 
theories  in  the  social  sciences  imply  a  nonlinear 
relationship  between  variables.  In  those  cases,  it  is 
inappropriate  to  use  a  classic  linear  regression 
model  because,  at  a  minimum,  one  of  the  assump¬ 
tions  of  the  model  would  be  violated. 

This  entry  discusses  two  types  of  linear  models: 
first,  those  that  through  a  relatively  simple  process 
can  be  transformed  in  a  way  that  allows  running 
of  a  classical  linear  regression  model;  second,  those 
that  are  essentially  nonlinear,  for  which  a  transfor¬ 
mation  is  not  possible.  In  that  situation,  nonlinear 
least  squares  estimation  is  necessary. 

The  linear  regression  model  specifies  a  linear 
relationship  between  a  response — or  dependent — 
variable  and  an  explanatory — or  independent — 
variable.  It  is  assumed  that  a  vector  of  response  vari¬ 
ables,  y,  can  be  approximated  by  a  linear  function 
of  the  vector  of  explanatory  variables  X,: 

y  =  Xp  +  e,  (1) 

which  can  be  expressed  more  generally  as 

y  =  F(X)  +  8.  (2) 

While  in  the  linear  model  F  is  a  linear  function 
and  the  error  is  additive,  in  nonlinear  models  F  can 


take  other  functional  forms  such  as  exponential, 
logistic,  or  other,  more  complicated  forms,  though 
the  setup  still  assumes  that  the  error  is  additive. 

It  is  perhaps  easier  to  understand  the  difference 
between  linear  and  nonlinear  models  if  we  think  of 
a  regression  model,  in  scalar  form,  with  only  one 
explanatory  variable  and  no  disturbance: 

y  =  a  +  px.  (3) 

The  marginal  effect  of  the  explanatory  variable, 
x,  on  the  response  variable,  y,  or,  in  other  words, 
the  effect  of  a  one-unit  increase  of  x  in  y,  can  be 
estimated  by  taking  the  partial  derivative  with 
respect  to  x: 


In  the  classical  linear  model,  when  x  increases 
by  one  unit,  the  effect  on  y  is  always  a  constant,  p, 
regardless  of  the  current  level  of  x.  In  contrast,  in 
a  nonlinear  model,  the  marginal  impact  of  the 
explanatory  variable,  x,  on  the  response  variable, 
y,  is  dependent  on  the  level  of  x.  In  other  words,  p 
is  not  necessarily  a  constant,  but  instead  dy/dx  is  a 
function  of  x. 

A  more  complicated  model  has  y  as  a  nonadditive 
function  of  the  independent  variables  and  the 
disturbances,  that  is, 

y  =  E(X,e).  (5) 

While  in  general  this  can  cause  problems,  there  are 
some  cases  where  this  can  be  transformed  away. 
For  example,  if  we  assume  that  the  response  func¬ 
tion  that  describes  y  is, 

y  =  exp(a  +  Px  +  s),  (6) 

we  can  linearize  the  model  by  first  taking  the  loga¬ 
rithms  obtaining  a  linear  response  function  and 
rewrite  the  equation  such  that 

ln^j-^ — ^  =  a  +  px  +  s.  (7) 

The  response  variable  is  now  In  (y/(  1  —  y),  a 
quantity  known  as  the  logit,  and  the  response 
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function  is  linear.  Note  here  that  y  is  continuous; 
if  it  is  binary,  the  standard  logit  model  is  not  the 
same.  The  procedure  is  similar  for  other  transfor¬ 
mations  that  fall  in  the  category  of  generalized 
linear  models. 

However,  there  are  some  models  for  which 
slightly  more  complicated  procedures  are  neces¬ 
sary  to  make  the  linear  model  appropriate.  One  of 
the  most  common  is  the  Box-Cox  transformation, 
where  transformations  are  indexed  by  X,  an 
unknown  parameter,  and  all  the  parameters  in  the 
model  are  estimated  by  standard  methods  of  infer¬ 
ence.  The  transformation  is 


when  X  A  0  and 

yx  =  ln(y)  (9) 

when  X  =  0.  It  is  assumed  that  for  each  X,  yxis  a 
monotonic  function  of  y  over  the  admissible 
range  and  that  for  some  unknown  X,  the  trans¬ 
formed  observations,  yx ,  satisfy  the  linear  model 
assumptions. 

The  Box-Cox  transformation  can  also  be  used 
to  discriminate  among  alternative  functional  forms 
(log  linear,  quadratic,  etc.).  The  process  works  by 
estimating  the  values  of  the  transformation 
parameters.  If  X  is  one,  a  linear  model  is  appropriate. 
If  it  is  near  zero,  a  logarithmic  model  is  appropriate. 
And  if  it  is  near  two,  a  quadratic  model  is 
appropriate.  Thus,  analysts  no  longer  have  to 
prespecify  the  nature  of  the  nonlinear  transformation 
since  the  Box-Cox  model  can  be  estimated  using 
nonlinear  squares. 

Unfortunately,  some  of  our  theories  lead  us  to 
models  that  cannot  be  made  linear  in  parameters. 
In  this  case,  it  is  appropriate  to  use  nonlinear  least 
squares  (NLS)  regression.  With  the  exception  of 
linearity,  all  the  other  assumptions  of  ordinary 
least  squares  (OLS)  regression  apply.  Conceptually, 
the  process  is  the  same:  to  find  the  values  of 
parameters  that  minimize  the  sum  of  squared 
residuals.  Numerically,  the  process  is  much  more 
intensive.  If  we  assume  the  same  model  notation 
from  Equation  1,  then 

(10) 


We  estimate  (3  by  finding  the  (3  values  that  mini¬ 
mize  the  sum-of-squares  residuals: 

S  =  £(y-X|3).  (11) 

For  any  estimate  of  p,  Equation  2  allows  the 
calculation  of  a  residual,  and  we  can  minimize  the 
sum  of  squared  residuals.  While  the  maximization 
process  is  more  complicated  than  for  linear  regres¬ 
sion,  the  underlying  process  is  similar,  and  the 
estimated  coefficients  have  similar  properties.  In 
particular,  they  are  consistent  and  asymptotically 
normal,  so  the  usual  testing  procedures  apply.  This 
process  is  a  part  of  most  statistical  packages  used 
by  social  scientists. 

There  are  some  caveats  about  NLS.  First,  it 
is  possible  to  get  similarly  good  fits  with  very 
different-looking  functions.  Also,  different  models 
will  produce  different  predictions  when  extrapo¬ 
lating,  and  multiple  minima  are  possible. 

As  a  general  conclusion,  for  real-world  data  it  is 
hard  to  believe  that  linear  specifications  can  char¬ 
acterize  the  relationships  between  all  variables  in 
political  science  studies.  However,  through  the 
transformation  of  variables — either  to  a  general¬ 
ized  linear  model  or  a  Box-Cox  transformation — 
or  by  using  modified  modeling  strategies  such  as 
nonlinear  least  squares  regression,  it  is  possible  to 
test  many  of  the  theories  that  do  not  easily  fit  a 
linear  model. 

Mariana  Medina 
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Nonparametric  Methods 


Nonparametric  methods  are  a  class  of  statistical 
techniques  that  use  minimal  assumptions  for  both 
testing  and  estimation.  Here,  standard  statistical 
assumptions  are  often  replaced  with  computation¬ 
ally  intensive  calculations.  Nonparametric  meth¬ 
ods  provide  a  valuable  alternative  to  classical 
parametric  techniques.  They  are  often  called  weak 
assumption  statistics  because  the  assumptions 
required  for  validity  are  quite  general  compared 
with  classical  parametric  techniques.  In  many 
cases,  even  the  weak  assumptions  made  can  be 
further  relaxed.  As  such,  conclusions  based  on 
nonparametric  techniques  need  not  be  tempered 
by  qualifying  statements  about  the  underlying 
assumptions.  Other  advantages  are  that  they 
(a)  are  often  easy  to  understand  and  apply,  (b)  are 
especially  good  for  small  samples,  (c)  are  fre¬ 
quently  appropriate  for  discrete  data,  and  (d)  may 
be  robust  with  incomplete  or  imprecise  data. 
Nonparametric  techniques  date  back  to  1710 
when  John  Arbuthnott  introduced  the  sign  test. 
More  widespread  development  of  nonparametric 
techniques  did  not  occur  until  the  1940s  when 
Frank  Wilcoxon  developed  rank-based  tests. 
Nonparametric  statistical  techniques  come  in 
many  different  forms.  Below,  three  different  types 
of  nonparametric  techniques  are  highlighted.  The 
first  is  the  bootstrap,  a  method  for  replacing  dis¬ 
tributional  assumptions  in  statistical  tests.  The 
second  is  an  example  of  a  classical  nonparametric 
test.  The  final  technique  is  that  of  nonparametric 
regression. 

Bootstrap 

The  bootstrap  relies  on  resampling.  While  it  has 
many  applications,  it  is  often  used  when  errors  in  a 
regression  model  are  nonnormal.  Assume  that  one 
estimates  the  following  simple  regression  model: 

y  —  a  +  (3%  +  e. 

In  this  model,  we  assume  that  the  errors,  8,  follow 
a  normal  distribution.  We  can  relax  this  assump¬ 
tion  using  the  bootstrap.  The  bootstrap  simulates 
the  sampling  distribution  for  3  through  sampling 
from  the  original  data  with  replacement.  In  short, 
we  treat  the  sample  as  the  population  and  then 


sample  from  it.  For  example,  assume  that  the 
sample  size  for  this  regression  model  is  100  cases. 
For  the  bootstrap,  we  would  take  a  random  sam¬ 
ple  of  size  100  with  replacement  from  the  original 
data.  Since  we  sample  with  replacement,  some 
data  points  will  appear  in  this  sample  more  than 
once.  We  then  reestimate  the  original  model  using 
this  new  sample  and  save  the  new  estimate  of  (3. 
We  repeat  this  process  a  large  number  of  times. 
Typically,  we  would  resample  and  estimate  the 
model  from  1,000  to  5,000  times.  This  results  in 
1,000  to  5,000  estimates  of  (3.  This  new  set  of  p 
estimates  serves  as  an  empirical  sampling  distribu¬ 
tion  for  this  parameter.  Percentiles  of  this  distribu¬ 
tion  can  serve  as  confidence  intervals  for  the 
parameter.  With  minor  adjustments,  one  can  also 
calculate  a  p  value  to  test  hypotheses  about  (3.  The 
bootstrap  is  nonparametric  since  we  use  an  empir¬ 
ical  estimate  for  the  sampling  distribution  instead 
of  assuming  that  the  test  statistic  is  from  a  t  distri¬ 
bution.  One  strength  of  the  bootstrap  stems  from 
the  fact  that  this  basic  algorithm  can  be  used  to 
provide  inferences  for  a  wide  class  of  statistics. 

Classical  Nonparametric  Test 

Another  common  nonparametric  technique  is  the 
rank-sum  test,  which  is  an  alternative  to  the  classi¬ 
cal  two-sample  t  test  of  location.  Consider  an 
experiment  where  we  select  seven  students  to  play 
a  divide-the-dollar  game  with  a  computer  program. 
Three  of  the  students  are  randomly  chosen  to 
receive  the  treatment,  a  prime  expected  to  make 
them  more  altruistic.  If  the  treatment  is  effective, 
we  would  expect  that  the  students  who  receive  it 
would  give  away  more  of  their  money.  If  the  treat¬ 
ment  is  without  effect,  we  would  expect  no  such 
difference  across  the  two  sets  of  students.  Relying 
on  the  classical  parametric  techniques,  we  might 
translate  these  expectations  into  a  t  test,  comparing 
the  mean  difference  we  observe  with  a  null  of  no 
mean  difference.  We  would  then  calculate  a  p  value 
using  the  critical  value  from  a  t  distribution  with 
five  degrees  of  freedom.  For  this  inference,  we  must 
assume  that  the  test  statistic  follows  a  parametric  t 
distribution — a  distribution  that  can  be  character¬ 
ized  with  parameters  for  the  mean,  variance,  and 
other  higher  moments.  As  an  alternative,  we  can 
use  the  nonparametric  rank-sum  test  to  derive  the 
test  from  the  randomization  of  treatment  assign¬ 
ment  and  avoid  the  parametric  assumption. 
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Our  expectation,  again,  is  that  treated  students 
should  give  away  more  than  nontreated  students. 
The  best  evidence  that  the  prime  increases  altruis¬ 
tic  behavior  would  be  obtained  if  the  three  stu¬ 
dents  who  received  the  treatment  rank  1,  2,  and  3 
in  terms  of  the  amount  given  away.  So  we  might 
ask:  What  is  the  probability  that  the  three  students 
in  the  treatment  group  would  happen  to  rank  1,  2, 
and  3  in  terms  of  the  amount  given  away  if  the  null 
hypothesis  were  true?  To  develop  a  probability 
statement  about  this  hypothesis  that  hinges  on  the 
composition  of  the  treatment  and  control  groups — 
the  feature  that  was  assigned  by  a  random 
process — we  turn  to  basic  combinatorics.  Knowing 
that  the  number  of  ways  of  selecting  r  objects  from 
a  set  of  n  is  n\/[r\(n  —  r)!J  tells  us  that  there  are 
35  ways  to  select  three  students  from  a  set  of 
seven.  If  the  treatment  had  no  effect,  and  simple 
chance  were  the  only  factor  governing  which  stu¬ 
dents  were  in  the  control  group  and  which  were  in 
the  treatment  group,  then  each  of  these  35  combi¬ 
nations  would  be  equally  likely.  Since  only  one  of 
the  35  outcomes  is  congruent  with  ranks  of  1,  2, 
and  3  in  the  treatment  group,  this  tells  us  that  there 
is  a  1/35  or  approximately  a  0.0286  chance  that 
we  would  observe  the  best  evidence  case  if  the  null 
were  true.  That  is,  if  the  treatment  has  no  effect, 
the  chance  that  random  assignment  will  produce 
this  exact  outcome  is  1/35.  This  p  value  indicates 
that  the  best  evidence  outcome  indeed  enables  us 
to  be  fairly  confident  that  the  prime  has  an  effect 
on  student  behavior  in  our  divide-the-dollar  game. 

We  can  make  this  approach  to  hypothesis  test¬ 
ing  more  general  by  introducing  a  summary  statis¬ 
tic  that  enables  us  to  translate  ranks  into  a  single 
measurement  of  the  outcome  among  the  treatment 
subjects.  One  possible  statistic  for  this  purpose  is 
the  sum  of  the  ranks  of  the  treated  subjects.  This 
statistic  will  be  lower  if  the  treated  subjects  are 
generally  higher  in  their  giving  than  the  control 
subjects  and  higher  if  they  are  not.  The  subjects 
could  just  as  easily  be  ranked  in  the  opposite  man¬ 
ner,  and  then  higher  rank  sums  would  be  associ¬ 
ated  with  higher  amounts  given.  Using  this  statis¬ 
tic,  we  can  answer  the  question  of  what  the  chance 
is  of  observing  an  outcome  of  a  specific  degree  (or 
smaller/larger)  among  the  treatment  subjects  if  the 
treatment  actually  has  no  effect.  For  example,  sup¬ 
pose  the  outcome  we  observed  among  the  treated 
subjects  was  the  ranks  1,  2,  and  7.  Our  summary 


statistic  would  be  lOf^  1  +  2  +  7).  This  seems 
close  to  the  best  evidence  outcome  we  just  consid¬ 
ered,  where  the  sum  of  the  ranks  would  be  1  +  2 
+  3  =  6,  but  is  it  close  enough  to  be  convincing 
evidence  of  a  treatment  effect?  To  answer  this 
question,  we  work  out  the  probability  of  observing 
a  rank  sum  of  the  same  amount  or  less  than  the 
one  we  obtained  under  the  null  hypothesis — that 
the  treatment  had  no  effect  on  giving — by  return¬ 
ing  to  our  enumeration  of  all  35  possible  combina¬ 
tions  of  the  three  ranks  and  calculating  the  rank 
sum  for  each  combination.  Four  of  the  35  possible 
rank  combinations  produce  a  sum  of  10,  3  more 
produce  a  sum  of  9,  2  result  in  a  sum  of  8,  1  set 
sums  to  7,  and  another  to  6.  Thus,  if  the  treatment 
had  no  effect,  the  chance  of  observing  an  outcome 
like  the  one  we  did  or  smaller  would  be  p  =  11/35 
or  approximately  0.314.  Put  another  way,  if  the 
prime  has  no  effect,  we  could  expect  to  see  a  value 
for  the  summed  ranks  as  low  as  or  lower  than  the 
one  we  observed  31  out  of  every  100  times  we 
randomly  assigned  the  treatment  to  these  particu¬ 
lar  subjects.  Using  the  traditional  threshold  of  .05, 
the  p  value  we  calculated  would  not  allow  us  to 
reject  the  null  hypothesis;  the  observed  outcome 
did  not  provide  sufficient  evidence  that  the  prime 
had  an  effect  on  the  behavior  of  our  subjects.  This 
demonstrates  the  logic  of  the  rank-sum  test  and 
nonparametric  methods  more  generally,  where  we 
are  able  to  test  a  hypothesis  about  a  treatment 
effect  without  the  parametric  assumption  neces¬ 
sary  for  the  t  test. 

Nonparametric  Regression 

Smoothed  regression  is  a  common  nonparametric 
estimation  technique.  This  approach  to  regression 
is  based  on  the  belief  that  standard  parametric 
regression  models  are  often  misspecified  due  to  an 
incorrect  functional  form.  Smoothing  and  non¬ 
parametric  regression  are  generally  interchange¬ 
able  terms  for  a  set  of  statistical  techniques  used  to 
summarize  bivariate  relationships  in  scatterplots. 
With  more  commonly  used  parametric  statistical 
techniques,  the  relationship  between  two  vari¬ 
ables,  x  and  y,  is  summarized  with  a  parameter 
such  as  a  regression  coefficient.  With  nonparamet¬ 
ric  regression,  there  is  no  single  parameter;  instead, 
the  statistical  relationship  between  x  and  y  is  sum¬ 
marized  with  a  line  drawing.  Given  that  there  is  no 
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single  parameter  produced  by  the  statistical  model, 
these  models  are  nonparametric.  A  more  specific 
comparison  between  the  parametric  and  nonpara¬ 
metric  approaches  to  regression  clarifies  the  differ¬ 
ences  between  these  methods. 

Assume  that  we  have  y  and  x,  two  continuous 
variables,  and  we  wish  to  estimate  the  mean  of  y 
conditional  on  the  regressor  x.  We  can  write  this 
relationship  formally  as 

ylx  =  fix)  +  s. 

The  term  f  refers  to  the  functional  form  for  the 
relationship  between  y  and  x,  and  we  assume  that 
f  is  some  function  that  is  smooth.  The  familiar 
linear  functional  form  is  a  special  case  where  the 
following  is  true: 

f  —  a  +  px. 

Since  a  linear  relationship  is  a  smooth  function, 
it  is  a  part  of  the  family  of  smooth  functions  that 
comprise  f.  The  linear  regression  model  is  para¬ 
metric  because  the  parameter  p  summarizes  the 
statistical  dependence  between  x  and  y. 

With  nonparametric  regression,  we  estimate  the 
functional  form  f  from  the  data.  Therefore,  the  a 
priori  assumption  of  linearity  is  replaced  with  the 
much  weaker  assumption  of  a  smooth  population 
function.  The  usual  parametric  estimate  is  consid¬ 
ered  global  since  a  single  parameter,  p,  based  on 
all  the  data  summarizes  the  statistical  relationship. 
There  are  a  number  of  methods  for  estimating 
nonparametric  regression  models.  All  of  these 
methods  use  a  series  of  local  estimates,  estimates 
based  on  a  subset  of  the  entire  sample,  to  form  a 
nonparametric  estimate. 

The  following  provides  a  simple  example  of  one 
method  for  estimating  nonparametric  regression 
models.  Say  we  are  interested  in  estimating  the 
association  between  incumbent  vote  share  and 
incumbent  spending.  We  suspect  that  the  relation¬ 
ship  is  nonlinear  and  thus  the  usual  linear  regres¬ 
sion  model  will  be  misspecified  since  the  functional 
form  is  incorrect.  One  form  of  nonparametric 
regression  we  might  use  is  the  moving  average 
smoother.  First,  the  data  must  be  sorted  according 
to  x — here  incumbent  spending.  We  can  obtain  the 
nonparametric  estimate  by  calculating  the  mean  of 
the  incumbent  vote  share  within  ordered  intervals 


of  spending.  That  is,  we  calculate  the  average 
incumbent  vote  share  for  a  small  range  of  x,  per¬ 
haps  the  first  5%  of  the  data.  We  repeat  this  across 
the  range  of  incumbent  spending.  Once  this  pro¬ 
cess  is  complete  we  have  a  series  of  local  averages. 

Plotting  these  local  averages  in  a  line  graph 
summarizes  the  statistical  dependence  between 
incumbent  vote  share  and  incumbent  spending. 
These  local  averages  may  increase  linearly.  If  so, 
the  nonparametric  estimate  is  nearly  equivalent  to 
an  ordinary  least  squares  estimate.  What  is  impor¬ 
tant  is,  however,  that  these  local  averages  are  not 
constrained  to  be  linear  and  can  produce  a  highly 
nonlinear  statistical  estimate.  Matching  estima¬ 
tors,  which  have  gained  popularity  as  of  late,  are 
another  form  of  nonparametric  regression. 

The  local  estimator  need  not  be  based  on  the 
mean.  Least  squares  regression  can  replace  the 
mean  as  the  local  estimator.  Here,  the  predicted 
value  from  a  regression  model  becomes  the  local 
point  estimate.  Nonparametric  regression  methods 
such  as  LOESS  and  LOWESS  are  based  on  local 
regression  models.  Basing  the  local  estimate  on  a 
regression  model  produces  a  nonparametric  esti¬ 
mate  with  less  bias.  Splines  are  another  frequently 
used  form  of  nonparametric  estimators  based  on  a 
series  of  joined  local  regression  estimates. 

Due  to  the  weak  assumptions  needed  for  non¬ 
parametric  tests,  they  are  less  efficient  than  classi¬ 
cal  parametric  tests  if  the  assumptions  for  the 
parametric  tests  hold.  For  example,  the  rank  test 
will  be  less  efficient  than  a  t  test  if  the  data  are 
normal.  However,  if  the  data  are  nonnormal,  the 
nonparametric  alternative  will  be  more  powerful. 
Nonparametric  tests  are  common  in  the  statistical 
literature  on  causal  inference.  For  example,  non¬ 
parametric  statistics  are  critical  to  the  development 
of  sensitivity  analyses  for  matching  estimators.  In 
the  causal  inference  literature,  close  attention  is 
paid  to  statistical  assumptions,  and  the  goal  is  to 
weaken  assumptions  where  possible.  Nonpara¬ 
metric  statistics  allow  this  to  a  great  extent. 

Luke  Keele 
Ohio  State  University 
Columbus,  Ohio,  United  States 
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Nonstate  Actors 


The  phrase  nonstate  actors  arose  in  the  study  of 
world  politics  during  the  1970s  in  the  context  of 
a  transnationalist  critique  of  the  then  prevailing 
realist  orthodoxy.  Realist  theory  of  international 
relations  holds  that  only  states  are  and  can  be 
actors  in  politics  beyond  the  national  realm.  In 
contrast,  transnationalist  analysis  argues  that 
other  entities  besides  states — such  as  business 
enterprises,  mass  media  organizations,  civil  soci¬ 
ety  associations,  and  political  parties — can  also 
operate  as  actors  in  world  politics. 

An  actor  is  a  behavioral  unit  that  can  engage 
and  influence  its  situation.  A  social  actor  can  be 
either  an  individual  person  or  a  group  of  people 
assembled  in  a  formal  or  informal  collective  body. 
Realism  maintains  that  only  one  kind  of  actor — 
the  state  (i.e.,  a  national-territorial  government) — 
can  affect  relations  between  and  among  countries. 
Transnationalism  affirms  that  multiple  types  of 
agents,  including  a  variety  of  nonstate  actors,  can 
shape  world  politics.  Some  scholars  therefore  draw 
a  distinction  between  international  relations 
(among  states)  and  transnational  relations  (involv¬ 
ing  a  plurality  of  actors,  both  state  and  nonstate). 

Realism  and  Transnationalism 

From  a  realist  perspective,  nongovernmental  bodies 
are  always  subject  to  state  power  in  world  politics. 
Hence,  on  a  realist  premise,  business  corporations 


cannot  operate  outside  their  base  country  except 
insofar  as  home  and  host  states  permit  them  to  do 
so.  Companies  such  as  Coca-Cola,  LUKOIL,  and 
Microsoft  would,  on  a  realist  understanding,  always 
be  subject  to  the  full  control  of  national  govern¬ 
ments.  Likewise,  for  realists,  civil  society  associa¬ 
tions  such  as  the  human  rights  organization 
Amnesty  International,  the  ecological  lobby 
Greenpeace,  and  the  religious  movement  A1  Qaeda 
have  no  impact  on  world  politics  except  when 
states  allow  it.  Indeed,  some  realist  analyses  suggest 
that  nonstate  entities  operate  wholly  and  solely  as 
tools  of  state  policy.  Thus,  for  example,  a  realist 
could  argue  that  the  humanitarian  relief  agency 
World  Vision  only  acts  inasmuch  as  it  serves  the 
interests  of  donor  states  in  the  Global  North  and 
recipient  states  in  the  Global  South. 

The  realist  (or  “statist”)  approach  to  action  in 
world  politics  has  never  gained  acceptance  by  all 
scholars.  For  example,  from  Karl  Marx  onward, 
historical  materialists  have  maintained  that  states 
serve  capital  rather  than  the  other  way  around  and 
that  workers  need  to  unite  across  borders  in  order 
to  achieve  social  transformation.  Liberal  interna¬ 
tionalists,  too,  suggested  already  in  the  19th  and 
early  20th  centuries  that  citizen  movements  (e.g., 
of  feminists,  pacifists,  religious  revivalists)  could 
affect  the  course  of  interstate  relations.  In  the 
1920s  and  1930s,  political  and  sociological 
research  on  world  affairs  regularly  considered 
nongovernmental  as  well  as  governmental  actors. 

Between  the  1940s  and  the  1960s,  however, 
most  students  of  international  relations  took  the 
realist  position  that  nonstate  actors  play  no  auton¬ 
omous  role  in  world  politics.  In  the  context  of 
World  War  II  and  the  subsequent  Cold  War,  it 
appeared — particularly  to  scholars  in  North 
America  and  Western  Europe — that  world  affairs 
were  reducible  to  state  action.  After  all,  entire  soci¬ 
eties  were  subordinated  to  state  direction  in  World 
War  II.  Similarly,  both  sides  in  the  Cold  War  mar¬ 
shaled  their  respective  business  sectors,  mass 
media,  political  parties,  and  civil  societies  in  the 
bipolar  struggle. 

Yet  even  during  this  period  of  heightened  inter¬ 
state  conflict,  it  was  not  always  evident  that  states 
were  the  sole  actors  in  world  politics.  For  example, 
many  anticolonial  movements  at  this  time  success¬ 
fully  exploited  transnational  links  to  promote  their 
causes.  Likewise,  the  continued  global  expansion 
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of  major  oil  corporations  in  the  third  quarter  of 
the  20th  century  was  not  merely  a  reflection  of 
state  policy.  Indeed,  then  as  now,  firms  sometimes 
used  states  to  further  their  investments  abroad.  For 
instance,  the  U.S.  government  intervention  in 
Guatemala  in  1954  came  largely  at  the  behest  of 
the  United  Fruit  Company. 

Such  circumstances  where  nonstate  entities 
exercised  influence  in  world  politics  became 
increasingly  visible  as  colonial  empires  receded 
and  Cold  War  tensions  eased.  From  the  early 
1970s,  a  number  of  scholars  began  to  assert  that 
the  realist  conception  of  actors  was  overly  narrow. 
The  term  transnational  relations  gained  popularity 
as  a  description  of  a  situation  where  plural  actors 
engage  in  world  politics.  Similarly,  researchers 
who  have  since  the  1980s  adopted  concepts  of 
“global  politics”  and  “globalization”  usually  argue 
that  nonstate  players  figure  importantly  in  world 
affairs. 

Today,  in  the  early  21st  century,  it  seems  incon¬ 
trovertible  that  nonstate  entities  in  world  politics 
can  have  powers  beyond  those  of  national  govern¬ 
ments.  Many  companies,  media  organizations, 
civil  society  associations,  political  parties,  and 
individuals  can  act  on  the  world  stage  at  least 
partly  in  their  own  right.  If  the  statist  premises  of 
realist  theory  were  questionable  even  at  the  height 
of  the  Cold  War,  they  are  certainly  unsustainable 
in  current  world  politics. 

Types  of  Nonstate  Actors 

According  to  the  World  Investment  Report  2008, 
transnational  corporations  (TNCs)  now  number 
79,000  (as  against  around  200  states).  Between 
them,  these  companies  own  some  790,000  subsid¬ 
iaries  outside  their  base  country.  TNCs  collectively 
hold  about  US$15  trillion  in  foreign  direct  invest¬ 
ment  (FDI)  and  generate  annual  sales  of  US$31 
trillion.  The  largest  TNCs  have  a  yearly  turnover 
that  exceeds  the  GDP  of  a  majority  of  countries.  A 
substantial  proportion  of  cross-border  commerce 
(estimates  range  from  a  quarter  to  a  third  of  the 
total)  involves  intrafirm  trade  within  TNCs.  In  the 
light  of  this  significance  of  TNCs,  all  states  now,  to 
one  degree  or  another,  adjust  their  policies  on  regu¬ 
lation  and  taxation  with  a  view  to  attracting  and 
retaining  FDI.  States  also  struggle  to  monitor — let 
alone  manage — the  several  trillion  dollars’  worth 


of  liquid  financial  flows  that  banks,  securities 
firms,  and  hedge  funds  move  (mostly  electroni¬ 
cally)  around  the  globe  each  day.  Moreover,  some 
transnational  companies  are  illicit  traffickers  of 
goods  and  people,  and  they  make  it  their  business 
to  evade  state  laws  altogether. 

Mass  media  outlets,  too,  figure  as  important 
nonstate  players  in  contemporary  world  politics. 
News  agencies,  such  as  Reuters  and  Inter  Press 
Service,  shape  much  of  the  information  on  world 
affairs  that  circulates  in  contemporary  society. 
Certain  newspapers  and  magazines,  such  as  the 
Financial  Times  and  Cosmopolitan,  are  distributed 
across  the  planet.  World  service  radio  stations 
have  broadcast  intercontinentally  from  the  1930s, 
while  satellite  television  operators  have,  since  their 
launch  in  the  1960s,  reached  hundreds  of  millions 
of  households.  Starting  in  1992,  the  suitably 
named  World  Wide  Web  has  offered  a  platform 
for  countless  online  global  information  providers. 
Although  states — and  particularly  stronger  states — 
can  exert  substantial  influence  over  transnationally 
operating  mass  media,  it  is  hardly  credible  to  argue 
on  realist  lines  that  CNN  and  YouTube  are  wholly 
subject  to  state  control  and  nothing  more  than 
tools  of  foreign  policy. 

Likewise,  civil  society  constitutes  an  important 
sphere  of  nonstate  actors  in  world  politics.  Civil 
society  organizations  (CSOs)  are  associations  of 
citizens  that  seek,  from  outside  political  parties,  to 
shape  the  rules  that  govern  society.  Many  CSOs 
are  formally  organized,  legally  registered,  and  pro¬ 
fessionally  staffed  advocacy  bodies.  Examples 
include  business  forums  like  the  International 
Chamber  of  Commerce,  consumer  groups  like 
Consumers  International,  democracy  promoters 
like  the  Open  Society  Institute,  development  sup¬ 
porters  like  Oxfam,  environmental  activists  like 
Friends  of  the  Earth,  faith  assemblies  like  the 
Roman  Catholic  Church,  farmer  lobbies  like  Via 
Campesina,  health  action  networks  like  Medecins 
Sans  Frontieres,  human  rights  advocates  like  the 
International  Lesbian  and  Gay  Association,  labor 
unions  like  Public  Services  International,  philan¬ 
thropic  foundations  like  the  Rockefeller  Brothers 
Fund,  and  think  tanks  like  the  Club  of  Rome. 
Other  civil  society  initiatives  in  world  politics  take 
shape  as  informal  social  movements,  for  instance, 
of  anarchists,  diaspora  networks,  indigenous  peo¬ 
ples,  peasants,  religious  revivalists,  street  vendors, 
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and  youth.  Indeed,  some  of  the  more  prominent 
civil  society  actions  in  recent  world  politics  have 
been  ephemeral  public  demonstrations.  Examples 
include  street  protests  at  the  Ministerial  Conference 
of  the  World  Trade  Organization  in  Seattle  in 
1999  and  across  the  globe  on  February  15,  2003, 
against  the  impending  U.S.  invasion  of  Iraq. 

In  contrast  to  CSOs,  who  aim  to  influence  poli¬ 
tics  from  positions  outside  official  circles,  political 
parties  have  as  their  overriding  goal  the  occupation 
of  public  office.  Although  these  actors  have  tradi¬ 
tionally  centered  their  efforts  on  capturing  state 
power  (locally,  provincially,  and  nationally),  over 
recent  decades,  they  have  also  held  seats  in  several 
suprastate  regional  assemblies  in  Africa,  Europe, 
and  Latin  America.  Moreover,  political  parties 
with  broadly  shared  visions  have  formed  global 
associations  such  as  the  Socialist  International 
(with  origins  in  1889),  the  Trotskyist  Fourth 
International  (founded  in  1938),  the  Liberal 
International  (created  in  1947),  the  International 
Democrat  Union  (1983),  and  the  Global  Green 
Network  (2001). 

Other  nonstate  actors  with  influence  in  world 
politics  operate  as  individuals  rather  than  collec¬ 
tive  agents.  For  example,  some  persons  such  as 
Mahatma  Gandhi,  Che  Guevara,  and  Wangari 
Maathai  have  shaped  world  affairs  as  charismatic 
visionaries  from  outside  states.  In  addition,  so- 
called  celebrity  diplomacy  has  figured  with  consid¬ 
erable  prominence  in  several  contemporary  global 
campaigns  regarding  ecology,  human  rights,  and 
poverty.  Out  of  the  limelight,  countless  other  non¬ 
state  individuals  have  affected  world  politics  as 
artists,  athletes,  bloggers,  carers,  entertainers,  mer¬ 
cenaries,  migrants,  patients,  pilgrims,  program¬ 
mers,  scholars,  terrorists,  and  tourists. 

As  the  example  of  mercenaries  indicates,  nonstate 
actors  can  in  some  instances  complement  or  even 
rival  states  in  the  execution  of  governance  functions 
in  world  affairs.  It  is  often  assumed  that  governance 
(i.e.,  the  formulation  and  implementation  of  societal 
rules)  occurs  only  through  the  public  sector  of  gov¬ 
ernments  and  intergovernmental  agencies.  However, 
in  contemporary  world  politics,  many  regulatory 
processes  are  conducted  at  least  partly  through  non¬ 
state  bodies.  The  many  instances  of  private  transna¬ 
tional  governance  include  the  International  Court  of 
Arbitration  of  the  International  Chamber  of 
Commerce  (for  settlement  of  disputes  involving 


TNCs),  the  Wolfsberg  Group  (for  guidelines  to 
prevent  money  laundering),  and  the  Forestry 
Stewardship  Council  (for  certification  of  sustain¬ 
able  logging). 

In  numerous  other  cases,  governance  of  world 
affairs  transpires  through  multistakeholder  combi¬ 
nations  of  state  and  nonstate  actors.  Thus,  for 
example,  the  International  Labour  Organization 
(ILO)  has  since  1920  operated  as  a  triangular  col¬ 
laboration  among  governments,  employer  federa¬ 
tions,  and  trade  unions.  More  recently,  created 
multistakeholder  governance  bodies  include  the 
Global  Reporting  Initiative  (GRI,  for  corporate 
social  responsibility  schemes),  the  Internet  Corpor¬ 
ation  for  Assigned  Names  and  Numbers  (ICANN, 
for  the  regulation  of  domain  names),  and  the 
Global  Fund  to  Fight  AIDS,  Tuberculosis  and 
Malaria.  In  addition,  most  major  interstate  gather¬ 
ings  (such  as  United  Nations  conferences  and 
Group  of  Eight  summits)  are  nowadays  accompa¬ 
nied  by  parallel  meetings  of  nonstate  actors. 

The  Future  of  Nonstate  Actors 

To  be  sure,  nothing  in  the  preceding  discussion 
points  to  an  approaching  irrelevance  of  states  in 
world  politics.  Especially  from  the  1970s  to  the 
1990s,  certain  transnationalists  and  globalists 
have  suggested  that  the  proliferation  and  growth 
of  nonstate  actors  entail  a  decline  or  even  a  demise 
of  the  state.  Yet  the  situation  is  clearly  not  a  zero- 
sum  game.  As  the  example  of  multistakeholder 
forums  illustrates,  states  and  nonstate  actors  are 
often  in  relationships  of  cooperation  and  mutual 
reinforcement.  Indeed,  many  states  have  expanded 
and  gained  increased  capacities  at  the  same  time 
that  nonstate  actors  have  grown  and  become  more 
prominent.  Bigger  states  in  particular  continue  to 
be  great  powers  in  world  affairs.  Although  some 
analysts  have  highlighted  the  prospect  of  so-called 
failed  states,  thus  far  only  a  handful  of  countries 
have  even  temporarily  experienced  a  comprehen¬ 
sive  state  collapse.  On  the  whole,  states  remain 
highly  robust  in  a  world  that  is  also  populated  by 
nonstate  actors. 

Hence  the  issue  is  less  whether  nonstate  entities 
matter  in  world  politics  and  whether  states  will 
survive  but  more  how  and  why  state  and  nonstate 
bodies  interact  in  the  ways  that  they  do.  Dispensing 
with  unrealistic  statism,  as  the  above  discussion 
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has  done,  is  rather  straightforward.  However, 
determining  more  precisely  how  and  why  nonstate 
actors  are  relevant  in  world  affairs  is  more  prob¬ 
lematic  and  controversial.  Widely  divergent 
answers  to  this  question  are  available,  depending 
on  whether  one  adopts  a  liberalist,  constructivist, 
Marxist,  feminist,  poststructuralist,  or  other  theo¬ 
retical  perspective  on  world  politics.  In  particular, 
the  different  approaches  relate  actors  (both  state 
and  nonstate)  to  contrasting  conceptions  of  social 
structure  and  structural  power.  However,  debates 
about  agent-structure  relations  go  beyond  the 
scope  of  the  present  entry.  Suffice  it  to  say  here 
that  establishing  that  nonstate  actors  matter  is 
only  one  step  in  a  much  larger  endeavor  of  build¬ 
ing  knowledge  of  world  politics. 
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Normative  Political  Theory 


Political  theory  deals  with  the  creation  of  good 
governing  institutions  according  to  some  principle 


and  the  assessment  of  what  the  institutions  do. 
Virtually  by  definition,  assessment  of  institutions 
requires  resolving  issues  in  political  theory  and 
more  or  less  simultaneously  issues  in  social  sci¬ 
ence.  Although  in  our  time  the  two  streams  sel¬ 
dom  merge,  both  are  essential  to  the  task  of 
political  theory.  Normative  political  theory  offers 
a  framework  for  the  evaluative  component  of  this 
assessment.  This  entry  first  reviews  the  modern 
origins  of  normative  political  theory  and  its 
explanatory  basis  and  then  examines  its  main 
issues:  equality,  civil  society,  civil  liberties,  justice, 
democracy,  and  constitutionalism.  Because  these 
issues  are  often  discussed  in  substantial  isolation 
from  each  other  or  with  the  principal  focus  on  one 
major  issue,  as  in  John  Rawls’s  book  A  Theory  of 
Justice,  one  might  conclude  that  there  is  no  gen¬ 
eral  normative  theory  that  covers  all  of  these. 
Most  of  them  are  often  used  purely  descriptively 
without  any  normative  weight.  For  example, 
democracy  is  a  form  of  government  that  can  be 
well  described;  or  rather,  it  is  several  different 
forms,  each  of  which  can  be  described  indepen¬ 
dently  of  its  normative  justification. 

One  could  consider  normative  political  thought 
in  the  classical  era  of  Plato  and  Aristotle;  in  the 
medieval  thought  of  Aquinas,  Marsilius  of  Padua, 
and  others;  or  in  modern  thought  from  Thomas 
Hobbes  to  the  present.  However,  because  of  space 
limitations,  this  entry  focuses  on  modern  thought, 
which  is  the  most  relevant  to  an  understanding  of 
contemporary  politics.  Normative  political  theory 
must  be  empirically  grounded  if  it  is  to  have  any 
bearing  on  the  study  of  politics  and  political  sci¬ 
ence;  although  classical  and  medieval  theories  may 
have  been  somewhat  grounded  in  their  own  time, 
they  are  far  less  so  today. 

There  are  two  overriding  concerns  in  contempo¬ 
rary  political  theory:  (1)  that  citizens  be  formally 
equal  in  various  ways  and  (2)  that  government  and 
collective  decision  making  be  democratic.  These 
concerns  are  essentially  modern.  There  is  virtually 
universal  agreement  on  these  two  normative  posi¬ 
tions,  with  one  major  exception:  that  the  supposi¬ 
tion  of  equality  may  not  apply  to  economic  status 
or  resources,  as  is  centrally  at  issue  in  theories  of 
distributive  justice  and  in  economic  libertarianism. 
There  is,  however,  strong  agreement  that  citizens 
should  be  formally  equal  in  politics,  as  in  the  slo¬ 
gan  “One  person,  one  vote.”  John  Stuart  Mill 
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argued  that  more  intellectually  and  educationally 
qualified  citizens  should  have  extra  votes;  how¬ 
ever,  virtually  no  political  or  democratic  theorist 
would  defend  this  view  today. 

There  are  further  normative  commitments, 
many  of  which  are  social  libertarian  constraints  on 
government  actions.  Articulate  social  libertarian¬ 
ism,  such  as  that  of  Mill,  however,  is  on  the  wane, 
especially  in  the  United  States,  whose  constitution 
can  rightly  be  characterized  as  one  of  the  truly 
great  social  libertarian  documents  and  one  that  has 
had  a  massive  impact  around  the  world.  Social 
libertarianism  was  earlier  very  nearly  the  American 
creed,  although  it  never  dominated  any  major 
political  party,  other  than  perhaps  the  Republican 
Party  with  William  Howard  Taft  and  later  Robert 
Taft.  Its  popular  decline  in  the  United  States  today 
is  not  reflected  in  the  writings  of  political  theorists. 

The  modern  era  of  political  theory  continues  to 
contribute  to  the  growth  of  explanatory  political 
science  as  well  as  normative  political  theory. 
Indeed,  because  of  the  issues  it  must  address,  it 
must  take  into  account  the  social  science  enterprise 
of  explanation.  In  the  struggle  to  master  more  or 
less  normative  issues,  especially  church-state  rela¬ 
tions,  there  is  no  well-defined  boundary  between 
normative  and  causal  explanatory  issues. 

Starting  the  discussion  of  modern  theory  with 
Hobbes  has  an  analytic  point,  not  merely  a  his¬ 
torical  one.  Hobbes  is  the  first  major  theorist  who 
deliberately  and  systematically  assumes  that  we  are 
motivated  primarily  by  our  individual  interests.  He 
is  committed  to  the  centrality  of  individualism  and 
self-interest.  Contrary  to  many  common  views,  he 
assumes  that  the  only  values  of  concern  are  indi¬ 
vidual  interests  and  that  there  are  no  collective 
interests  other  than  those  that  can  be  seen  as  aggre¬ 
gations  of  individual  interests.  This  individualism 
has  since  been  the  basis  of  many  arguments  and 
counterarguments,  to  such  an  extent  that  the  pros 
and  cons  of  the  individualist  vision  often  dominate 
discussion  in  political  theory.  No  one,  therefore, 
can  fully  understand  any  major  contemporary 
position  in  normative  political  theory  without 
addressing  individualism.  Rawls  shares  this  view, 
although  he  phrases  the  assumption  in  his  own 
distinctive  way.  He  supposes  that  we  are  “mutually 
disinterested”  in  the  welfare  of  each  other.  The 
main  point  of  this  claim  is  to  rule  out  envy  over 
others’  shares  of  the  collective  provision. 


Although  the  religious  vision  of  politics  has 
been  important  in  the  past,  today  this  perspective 
is  found  mainly  in  Islam  and  in  the  Catholic,  not 
the  Aristotelian,  half  of  the  natural-law  tradition. 

Ironically,  the  classical  era  arguably  has  greater 
relevance  to  contemporary  thought  than  does 
medieval  thought  because  the  former  is  de  facto 
secular  and  in  it  there  is  presumed  to  be  no  higher 
authority  than  man.  Hobbes  dismisses  reliance  on 
supernatural  inspiration,  arguing  that  faith  comes 
by  hearing — that  is  to  say,  by  being  taught.  In  a 
sentence,  he  convincingly  dismisses  several  centu¬ 
ries  of  political  theorizing.  Political  theory  had  to 
begin  almost  de  novo  after  leaving  religious  justifi¬ 
cations  behind.  Although  there  are  forerunners,  we 
may  take  Hobbes  as  the  de  novo  creative  thinker 
who  puts  us  on  the  path  to  modern  political  theory. 

Normative  political  theory  is  in  some  respects 
an  odd  endeavor  in  that  it  is  often  carried  on  at  the 
strictly  theoretical  level  with  little  or  no  serious 
reference  to  our  political  experience,  as  for  exam¬ 
ple  in  Plato’s  Republic.  Some  authors  refer  to  their 
work  as  “ideal  theory.”  This  is  apparently  meant 
to  be  a  defense  of  their  approach.  At  its  strongest, 
they  can  say  that  their  position  is  aspirational.  For 
example,  one  could  concede  that  widespread  egali¬ 
tarian  dispositions  are  not  likely  in  our  world  at 
this  time  but  still  hold  that  egalitarian  theorists 
should  push  our  understanding  of  how  to  make 
egalitarianism  work  and  how  to  refine  the  theory. 

Modern  Beginnings 

Let  us  begin  with  two  giants  of  social  theory  in  the 
modern  era:  Thomas  Hobbes  and  David  Hume. 
Both  Hobbes  and  Hume  think  that  humans  are 
primarily  motivated  by  self-interest.  Hume  states 
this  view  with  characteristic  force  in  his  Treatise  on 
Human  Nature :  “Nothing  is  more  certain,  than 
that  men  are,  in  a  great  measure,  governed  by  inter¬ 
est,  and  that  even  when  they  extend  their  concern 
beyond  themselves,  ’tis  not  to  any  great  distance” 
(Pt.  2,  sec.  7).  This  is  Hume’s  dominant  claim 
about  human  nature  in  the  Treatise.  Holding  this 
view  seemingly  must  make  his  program  to  explain 
moral  or  mutually  beneficial  outcomes  in  the  world 
of  human  and  institutional  action  implausible.  But 
that  is  his  extraordinary  program — to  explain  why 
it  is  that  a  large  collection  of  substantially  self- 
interested  people  could  achieve  justice  and  other 
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good  social  results,  all  without  a  primary  or  a  reli¬ 
gious  commitment  to  acting  morally. 

Some  moralists  in  Hume’s  time  assumed  that 
moral  views,  moral  character,  and  moral  commit¬ 
ments  were  God-given.  Hume,  however,  believed 
that  our  nature  is  not  fundamentally  moral;  he 
further  doubts  all  arguments  for  the  truth  of  reli¬ 
gious  claims.  Book  3  of  Hume’s  Treatise  attempts 
to  explain  how  we  can  act  as  though  we  were 
moral  and  how  we  can  establish  social  rules, 
norms,  and  laws  evidently  without  the  necessary 
moral  commitment  to  their  working.  A  clue  to  his 
argument  is  that  if  we  had  characters  such  as  the 
religious,  moralistic,  and  pure-reason  schools  of 
thought  suppose,  we  would  not  be  much  con¬ 
cerned  with  law  because  we  would  be  well-ordered 
without  its  additional  incentives.  Hume  argues 
that  debate  over  law  and  moral  issues  is  essentially 
proof  that  these  visions  are  wrong  and  that  we 
need  tougher  analysis. 

Both  Hobbes  and  Hume  enunciate  laws  of 
nature ,  by  which  they  both  mean  essentially  socio¬ 
logical  conclusions  by  which  we  must  abide  if  we 
are  to  prosper.  For  example,  both  theorists  hold 
that  it  is  a  law  of  nature  that  we  should  keep  our 
promises  (Hobbes  more  often  speaks  of  covenants 
and  contracts).  Hobbes  and,  more  clearly,  Hume 
argue  that  it  us  in  our  interest  to  keep  our  prom¬ 
ises.  Why?  Because  if  we  do  not,  we  will  find  it 
difficult  to  develop  cooperative,  promising  rela¬ 
tions  with  people  whose  cooperation  would  occa¬ 
sionally  serve  us  well. 

Both  philosophers  focus  on  the  problem  of 
social  order,  without  which  there  will  be  turmoil, 
anarchy,  and  even  violence,  leading  to  a  world  in 
which  all  will  be  relatively  destitute.  Hume  bases 
his  own  theory  of  social  order  on  spontaneous 
coordination  over  centuries  of  social  development. 
In  essence,  he  invents  and  applies  one  of  the  great¬ 
est  and  most  powerful  theories  in  all  of  social 
theory:  his  theory  of  convention. 

The  most  important  difference  between  Hobbes 
and  Hume  is  in  how  a  sovereign  is  empowered  ab 
initio.  Hobbes  recognizes  that  this  is  a  difficult 
problem  for  him  and  his  hypothetical  account  of 
the  creation  of  government  by  collective  agreement, 
or  a  “social  contract.”  In  his  discussion  of  the 
metaphor  of  the  social  contract,  Hobbes  does  not 
address  the  issue  of  how  individuals  would  transfer 
their  power  as  physical  selves  to  an  all-powerful 


sovereign.  Hume  is  notoriously  hostile  to  the  idea 
of  a  social  contract,  which  he  thinks  is  a  philoso¬ 
pher’s  perversely  unrealistic,  false,  and  implausible 
device.  Hobbes  himself  soon  abandons  his  own 
social  contract. 

Explanatory  Theory 

Hobbes  does  not  present  a  general  moral  theory. 
He  sees  himself  as  the  scientist  of  social  order. 
John  Locke,  writing  after  Hobbes,  assumes  that 
people  have  natural  rights,  especially  a  right  to 
own  the  property  (land)  with  which  they  have 
mixed  their  labor.  This  intuitionist  vision  has  had 
an  enormous  impact  ever  since,  as  in  the  work  of 
Robert  Nozick,  although  it  has  been  repeatedly 
dismissed  as  ungrounded  and  irrelevant.  It  is  a  rare 
instance  of  intuitionism  in  political  theory, 
although  intuitionism  is  a  major  part  of  moral 
theory.  On  this  score,  political  theory  is  far  more 
solidly  grounded.  This  account,  however,  cannot 
readily  generalize  into  a  fuller  moral  theory. 
Moreover,  in  a  view  that  most  of  us  might  share, 
Hobbes  and  Hume  reject  any  notion  of  property 
or  ownership  if  there  is  no  state  to  define  and 
regulate  them.  Indeed,  it  is  only  by  law  that  prop¬ 
erty  is  defined.  In  Locke’s  vision,  when  you  mix 
your  labor  with  a  bit  of  unowned  land,  it  becomes 
yours  even  before  there  is  a  state  to  define  owner¬ 
ship.  The  idea  of  mixing  one’s  labor  may  long 
predate  Locke’s  account.  In  old  English  property 
law,  when  you  sell  your  land  to  me,  I  must  shovel 
some  of  its  soil  as  though  symbolically  to  acknowl¬ 
edge  its  being  transformed  into  my  property. 
Locke’s  theory  elevates  this  odd  English  legal  prac¬ 
tice  from  a  mere  custom  to  the  status  of  a  moral 
principle.  If  the  land  has  no  prior  owner,  you  may 
appropriate  it  according  to  the  laws  of  the  nation 
of  which  it  is  a  part. 

One  of  the  great  strengths  of  political  philoso¬ 
phers  such  as  Hobbes,  Hume,  Mill,  and  Rawls  is 
that  they  take  seriously  and  try  to  understand  the 
world  to  which  their  theories  are  to  apply,  although 
Rawls  largely  passes  on  the  burden  of  understand¬ 
ing  the  political  and  institutional  world  to  others 
who,  one  hopes,  wish  to  apply  his  theory  of  justice. 
Those  others  would  have  to  include  social  scientists 
with  deep  understandings  of  institutions,  how  they 
work,  and  what  they  can  be  expected  to  accom¬ 
plish.  So  far,  those  others  have  primarily  been 
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moral  philosophers  who  focus  almost  exclusively 
on  the  moral  aspects  of  Rawls’s  theory  and  claims 
rather  than  on  the  workability  of  the  theory  as  a 
system  of  justice. 

Rawls  recognizes  that  institutional  analysis  is 
based  on  a  normative  theory.  He  supposes  that 
relevant  institutions  will  manage  redistribution 
and  inculcate  a  commitment  to  justice  in  the  popu¬ 
lation.  But  these  institutions  must  first  be  designed. 
There  is  a  circular  quality  to  this  hope  because 
without  the  widespread  commitment  to  a  just  dis¬ 
tribution,  relevant  institutions  are  unlikely  to  arise 
or  be  created.  What  set  of  just  institutions  could 
push,  say,  the  U.S.  Congress  or  the  British  Parlia¬ 
ment  aside?  Is  there  a  living  Republican  or  Tory  in 
government  who  would  support  policies  of  redis¬ 
tribution  at  such  a  level  as  to  achieve  a  high  degree 
of  equality  in  American  or  British  society?  Some 
revolutionary  movements  would  achieve  near 
equality  by  impoverishing  everyone,  as  Winstanley 
and  the  Diggers  argued  for  England  in  their  time. 
So  far,  no  one  has  proposed  a  serious  way  to 
achieve  greater  equality  without  massive  negative 
trade-offs.  The  workability  of  Rawls’s  theory  is 
therefore  deeply  in  question.  Suppose  that  the 
leadership  of  an  advanced  democratic  industrial 
state  tried  to  implement  Rawls’s  theory  of  justice. 
Is  there  any  chance  at  all  that  it  would  significantly 
advance  the  society  toward  justice  as  fairness?  If 
this  empirical  question  cannot  be  answered,  the 
enterprise  is  in  default. 

Hobbes  tries  to  ground  his  theory  in  an  account 
of  how  individuals  act  and  an  explanation  of  why 
they  act  this  way,  and  he  assertively  sees  himself  as 
a  scientist.  He  uses  his  account  of  human  incen¬ 
tives  to  explain  how  the  world  works.  This  is  an 
astonishingly  grand  move  on  his  part.  On  the  other 
hand,  he  sets  himself  a  very  restricted  purpose — to 
ground  social  order  and  then  to  leave  individuals 
to  make  their  own  “investments”  in  their  own 
prosperity  once  there  is  order.  Stylized  brief  read¬ 
ings  of  Hobbes  restrict  his  concern  to  survival, 
thereby  trivializing  his  theory.  In  fact,  he  gives  far 
more  space  to  order  as  the  necessary  enabler  of 
individual  prosperity  through  protecting  individu¬ 
als’  efforts  on  their  own  behalf  against  plundering 
and  theft.  In  this  vision,  he  foreshadows  the  later 
economic  arguments  of  Adam  Smith  and  others. 
His  draconian  dictator  reduces  the  risks  of  plant¬ 
ing  our  gardens  or  otherwise  investing  for  our 


future.  Even  today,  we  might  see  Hobbes’s  concern 
for  order  as  an  incontrovertible  element  of  good 
government.  People  in  the  North  Atlantic  commu¬ 
nity,  however,  can  typically  take  such  an  order  for 
granted  and  may  assume  rather  than  explain  it. 

Equality  and  Inequality 

It  is  commonly  presumed  that  equality  is  prima 
facie  preferable  to  inequality  (this  is,  of  course,  a 
moral  claim)  but  that  it  should  be  balanced  against 
other  considerations  that  may  causally  trade-off 
with  it  (an  explanatory  or  causal  claim).  Moral 
and  explanatory  theories  therefore  come  sharply 
together  in  the  analysis  of  distributive  justice  and 
egalitarianism.  There  are  at  least  five  standard 
arguments  about  the  interaction  between  inequal¬ 
ity  and  trade-offs  with  production.  In  some  of 
these,  inequality  enhances  production,  either 
because  unequal  incentives  (such  as  higher  sala¬ 
ries)  or  unequal  power  (as  in  hierarchy)  is  needed 
to  organize  the  production  of  beneficial  goods, 
both  personal  and  collective.  These  two  arguments 
generalize  to  macrolevel  arguments  about  the 
organization  of  society: 

1.  Equality  entails  reduced  incentives  to  those 
who  are  especially  productive  and  leads  to  a  trade¬ 
off  between  equality  and  efficiency  of  production 
and  therefore  lower  average  welfare. 

2.  Hierarchy,  and  likely  therefore  inequality,  is 
virtually  necessary  for  achieving  many  desirable 
social  goals. 

3.  Those  who  have  greater  resources  than  oth¬ 
ers  can  be  trailblazers  who  support  innovations 
that  eventually  benefit  almost  everyone. 

These  three  relationships  all  involve  direct  trade¬ 
offs  through  increasing  the  overall  welfare  at  the 
price  of  reduced  equality,  or  vice  versa.  There  are 
also  trade-offs  of  greater  equality  with  other  con¬ 
siderations.  Two  that  are  well  argued  in  extensive 
literatures  are  as  follows: 

4.  The  political  power  to  attempt  to  achieve 
equality  entails  the  power  to  do  much  else,  includ¬ 
ing  very  undesirable  things  such  as  suppression  of 
thought  and  dissent.  In  1934,  Joseph  Stalin 
declared  that  socialist  realism,  a  gross  misnomer, 
was  the  only  legitimate  style  for  artists.  After  not 
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enjoying  a  performance  of  Shostakovich’s  music, 
he  banned  the  gloriously  patriotic  composer  for  a 
while.  Stalin’s  tastes  ruled.  Many  artists  died  or 
ended  in  the  Gulag,  and  many  of  the  great  painters 
and  poets  of  the  era  were  suppressed.  The  effort  to 
achieve  equality  has  often  gone  badly  awry  in 
similar  ways  even  in  supposedly  humane  places, 
such  as  Tanzania  under  its  founding  prime  minis¬ 
ter,  Julius  Nyerere.  Nyerere’s  program  of  African 
socialism  included  an  economic  policy,  ujamaa, 
that  called  for  the  collectivization  of  farms  and 
forced  relocation.  Although  the  intention  was  to 
foster  economic  equality,  the  policy  exacerbated 
poverty  and  proved  to  be  unviable. 

5.  Equality  in  a  single  society  requires  autarky 
and  risks  the  selective  emigration  of  the  especially 
productive  individuals  to  nations  where  they  can 
thrive  better.  The  emigration  is  likely  to  be  eco¬ 
nomically  crippling  for  the  society  being  aban¬ 
doned,  as  it  was  in  the  former  East  Germany  and 
Czechoslovakia  under  Soviet  hegemony.  In  both 
these  nations,  as  also  in  politically  liberal  India, 
deliberate  policies  of  autarky  hampered  economic 
development.  When  India  ended  autarkic  control 
of  its  economy  in  1991,  its  economy  exploded  into 
very  rapid  growth.  China’s  economy  has  similarly 
boomed  since  it  turned  to  economic  liberalism 
while  maintaining  political  autocracy. 

With  respect  to  the  first  three  arguments,  inequal¬ 
ity  de  facto  can  typically  benefit  people  other  than 
merely  those  who  are  better-off.  The  negative  side 
of  inequality  in  the  last  two  cases  comes  from  the 
side  effects  of  attempting  to  achieve  equality  and 
not  from  a  direct  trade-off  between  equality  and 
welfare.  But  these  patterns  are  all  potentially  rele¬ 
vant  for  the  institutional  structures  for  achieving 
distributive  justice.  The  last  two  are  at  the  institu¬ 
tional  or  societal  level  and  should  therefore  be  espe¬ 
cially  interesting  to  political  scientists.  They  are  at 
the  normative  core  of  political  theory. 

Each  of  these  relationships  involves  a  trade-off 
of  equality  for  something  else:  productivity,  suc¬ 
cessful  organization,  freedom  of  expression,  eco¬ 
nomic  viability,  or  innovation.  All  of  these  involve 
productivity  in  some  way.  In  each  of  the  five  cases, 
although  there  might  be  disagreements  about  the 
scale  of  the  trade-off  of  equality  that  we  should 
bear,  there  is  not  likely  to  be  disagreement  that 
some  trade-off  is  desirable  or  even  almost  logically 


necessary.  These  issues  are  especially  important  in 
the  microcontexts  of  differential  power  of  groups, 
such  as  those  defined  by  gender  and  ethnicity,  and 
in  the  macrocontexts  of  national  and  developing 
economies. 

Civil  Society 

It  is  a  broadly  held  view  that  democracy  and  liber¬ 
alism  require  civil  society.  Discussions  of  civil 
society  are  both  normative  and  causal.  The  norma¬ 
tive  claims  are  that  we  will  be  better  people  and 
that  we  will  constitute  better  polities  or  societies  if 
we  have  civil  society.  This  sometimes  sounds  like 
nothing  more  than  a  definitional  claim,  but  it  is 
also  sometimes  a  causal  claim,  such  as  those  John 
Stuart  Mill,  Carole  Pateman,  and  others  make  for 
a  connection  between  liberal  democracy  and  per¬ 
sonal  autonomy  and  development.  But  the  most 
challenging  and  potentially  interesting  of  the  claims 
of  exponents  of  civil  society  is  the  grand  causal 
claim  that  we  need  it  if  we  are  to  cohere  politically 
and  socially.  This  is  often  accompanied  by  the 
claim  that  we  need  broad  trust  in  government  if 
government  is  to  serve  us  well. 

Against  these  theses,  there  is  a  long  tradition  of 
distrust  in  government,  especially  in  the  United 
States.  Why  distrust?  Because  the  experiences  both 
of  England  for  centuries  before  the  U.S.  Constitution 
and  of  the  13  states  during  their  brief  union  under 
the  Articles  of  Confederation  were  rent  by  govern¬ 
ment  attempts  to  control  the  economy  in  often 
destructive  ways.  Often  there  were  identifiable 
beneficiaries  of  the  controls.  Hence,  given  the 
power  to  intervene,  one  can  be  fairly  sure  that 
governments  will  often  do  so.  The  straightforward 
incentives  of  government  agents  are  to  arrange 
benefits  for  themselves  through  impositions  on 
others.  Such  incentives  are  a  recipe  for  distrust  in 
the  sense  that  those  on  the  wrong  end  of  the  inter¬ 
ventions  can  see  that  their  own  interests  are  sacri¬ 
ficed  for  those  of  others  merely  because  someone 
has  the  power  to  intervene.  Distrust  is  therefore 
not  merely  a  theoretical  stance;  it  is  a  clear 
response  to  experience.  One  need  not  have  a  the¬ 
ory  of  why  any  kind  of  government  exceeds  its 
bounds  when  one  has  extensive  evidence  that  it 
does.  In  the  20th  century,  technological  innova¬ 
tions  radically  increased  the  scope  of  governmental 
excess  to  include  massive,  murderous  brutality. 
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Even  before  James  Madison’s  arguments  for  the 
U.S.  Constitution,  the  recognition  that  govern¬ 
ments  were  prone  to  abusing  people  in  various 
ways  was  a  central  part  of  the  development  of 
liberal  thought,  especially  in  the  work  of  Locke, 
Hume,  and  Smith.  The  original  contributions  of 
Madison  to  this  long  tradition  were,  first,  to  create 
a  government  that  was  self-regulated  so  that  it 
could  not  easily  overreach  its  authority  and,  sec¬ 
ond,  to  give  that  government  very  little  authority 
while  also  diminishing  the  authority  of  the  indi¬ 
vidual  states.  In  Madison’s  view,  the  states  were 
the  principal  threat  to  individual  liberty,  both 
economic  and  social. 

Civil  Liberties 

Liberty  is  the  antithesis  (and  victim)  of  despotism. 
Securing  civil  liberties  is  the  main  point  of  consti¬ 
tutionalism  and  of  court  review  of  legislative  and 
administrative  acts.  Historically,  the  defining  issue 
in  civil  liberties  in  the  nascent  United  States  was 
the  separation  of  church  and  state.  One  might  even 
say  that  religion  was  the  most  important  issue  in 
English  and  U.S.  political  history  and  remains 
among  the  most  important  today,  although  appar¬ 
ently  more  in  the  United  States  than  in  the  United 
Kingdom.  England  slowly  followed  the  American 
solution  despite  having  a  state  church  that  now  has 
virtually  no  political  role.  The  issue  is  perhaps 
most  often  cast  as  theoretical  or  normative.  The 
principal  issue  on  the  ground  for  several  centuries 
was,  however,  pragmatic.  It  was  how  to  stop  inter¬ 
ference  in  politics  and  governance  from  the  clergy, 
the  church,  and  especially  the  Pope,  all  of  whom 
acted  as  though  they  were  hierarchically  superior 
to  secular  political  leaders,  whom  they  even 
thought  they  had  the  God-given  right  to  remove  or 
force  from  office.  Many  English  citizens  and  lead¬ 
ers  long  wanted  such  religious  interference  out  of 
politics,  and  many  were  eventually  willing  to 
change  their  religious  commitment  in  order  to 
force  the  Catholic  Church  out  of  the  political  life 
of  England. 

Many  thinkers  also  address  the  abuse  of  indi¬ 
viduals  by  the  church  in  its  efforts  to  impose  par¬ 
ticular  beliefs.  Hobbes  and  Locke  suppose  that 
such  efforts  cannot  succeed  because  they  think  that 
beliefs  are  not  subject  to  choice,  so  that  it  is  point¬ 
less  to  punish  “wrong”  beliefs.  Hume  would  go 


further  to  say  there  is  no  way  to  establish  the  truth 
of  religious  views.  In  any  case,  punishing  beliefs  as 
wrong  is  an  epistemological  sin.  If  you  have  not 
chosen  your  Catholic  or  Protestant  beliefs,  you 
cannot  sensibly  be  held  responsible  for  them. 

If  they  are  politically  empowered,  conflicting 
religious  views  get  in  the  way  of  social  order,  as  in 
the  European  wars  of  religion.  They  also  compli¬ 
cate  constitutional  arrangements,  as  Madison  and 
Thomas  Jefferson  recognized.  With  a  multiplicity 
of  religions  seeking  special  status,  the  only  sensible 
move  is  to  deny  recognition  to  all  sects.  This  is 
both  normatively  and  pragmatically  a  compelling 
move.  In  the  views  of  Hobbes,  Locke,  and  many 
others,  it  is  also  epistemologically  compelling. 
Reaction  to  centuries  of  brutal  treatment  of  people 
for  holding  wrong  beliefs  is  one  of  the  fundamen¬ 
tal  causes  of  the  rise  of  liberalism,  along  with  reac¬ 
tion  to  the  frequent  arrogance  and  arbitrariness  of 
government  officials,  especially  monarchs. 

John  Locke  and  Samuel  von  Pufendorf  largely 
address  theoretical  issues  of  religion  and  gover¬ 
nance,  as  opposed  to  pragmatic  issues  of  actual 
problems  on  the  ground.  Many  English  thinkers 
before  and  after  them  have  been  far  more  con¬ 
cerned  with  the  latter,  especially  with  papal  and 
clerical  interference  in  domestic  politics.  After 
much  bloodshed,  the  church  was  subdued  enough 
to  be  transformed  into  the  far  less  intrusive 
Anglican  church.  The  first  major  move,  on  largely 
pragmatic  grounds,  to  take  government  out  of  reli¬ 
gious  hands  altogether  was  the  separation  of  state 
and  church  implied  in  the  First  Amendment  to  the 
U.S.  Constitution.  French  constitutional  provisions 
were  even  stronger  after  the  revolution  and  have 
led  to  banning  the  hijab  in  public  facilities,  such  as 
schools,  and  even  in  public.  These  policies  have 
overwhelming  support  in  France,  in  part  because 
overt  religious  display  is  severely  contentious  and 
has  been  since  the  revolution,  which  was  as  anti- 
Catholic  as  anti-aristocratic. 

Nations  in  which  a  particular  religion  is  given 
substantial  political  influence  generally  tend  to  be 
illiberal,  as  in  the  cases  of  Iran  under  the  ayatollahs 
and  Saudi  Arabia  under  Wahabbist  hegemony.  In 
the  latter,  possessing  a  Bible  is  a  capital  offense. 
Under  both  these  regimes,  the  harshest  principles 
of  sharia  law  are  imposed. 

The  substantial  separation  of  church  and  state 
seems  to  be  a  necessary  step  for  creating  a  liberal 
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society.  This  is  likely  not  a  theoretical  but  only  a 
pragmatic  claim,  but  it  fits  the  contemporary  situ¬ 
ation  in  many  nations.  Again,  philosophy  must  be 
informed  by  causal  understandings  if  it  is  to  be 
relevant  to  contemporary  debate  on  theory  or 
practice. 

There  is  another  pragmatic  objection  to  state 
sanction  of  individual  beliefs.  The  multiplicity  of 
faiths  in  the  United  States  and  many  other  nations 
makes  establishment  of  a  state  religion  very  con- 
flictive,  although  this  is  not  an  issue  for  Locke, 
who  would  establish  a  state  church  but  would  not 
punish  those  who  do  not  follow  its  beliefs.  As 
though  the  issue  were  strictly  a  matter  of  norma¬ 
tive  theory,  too  much  of  what  is  written  today  still 
neglects  pragmatic  objections  to  state  involvement 
in  religion,  especially  Madison. 

Given  all  the  debate  in  the  United  States  sur¬ 
rounding  the  Establishment  Clause  of  the  First 
Amendment,  debate  that  suggests  that  this  is  a 
severely  fraught  issue,  it  is  remarkable  that  there 
was  apparently  no  constitutional  test  of  any  law 
on  religious  grounds  for  nearly  a  century  after  the 
adoption  of  the  constitution.  The  case  that  finally 
went  to  the  Supreme  Court  to  test  the  meaning  and 
scope  of  the  principle  of  the  separation  of  church 
and  state  was  Reynolds  v.  United  States  (1879),  in 
which  the  court  allowed  the  constitutionality  of 
laws  that  penalized  bigamy  even  for  those  men 
who  claimed  that  multiple  marriages  were  part  of 
their  Mormon  religious  duty.  The  court  quoted 
Jefferson  as  holding  that  “the  legitimate  powers  of 
the  government  reach  actions  only,  and  not 
beliefs.”  Hence,  the  government  could  regulate 
behavior,  including  behavior  motivated  and  even 
supposedly  required  by  religious  beliefs.  This  posi¬ 
tion  is  causally  consonant  with  the  view  of  Hobbes 
that  the  state  can  act  against  behavior  but  that  it 
should  not  attempt  to  “correct”  beliefs,  which  one 
perhaps  cannot  even  know  a  person  has  and  which 
one  cannot  force  the  person  to  change,  even  under 
threat  from  the  Grand  Inquisitor. 

The  constitutional  protection  of  positive  indi¬ 
vidual  rights,  perhaps  of  the  freedom  of  religion 
foremost,  has  changed  over  the  centuries,  and 
therefore,  the  legal  conception  of  the  content  of 
religion  has  changed  and  broadened.  Naturally,  this 
has  been  especially  true  after  Reynolds.  For  exam¬ 
ple,  the  doctrine  of  separation  of  church  and  state 
stands  uneasily  behind  conscientious  objection  to 


military  service.  It  is  argued  that  one’s  religious 
commitment  to  pacifism  or  objecting  to  killing  on 
the  battlefield  is  violated  by  requirements  to  serve 
in  the  military.  Proof  of  one’s  religious  beliefs  on 
this  issue  is  prima  facie  demonstrated  by  member¬ 
ship  in  a  church  that  has  long  preached  pacifism, 
such  as  the  Quakers  and  Jehovah’s  Witnesses,  but 
the  idea  has  been  broadened  to  include  nonreli¬ 
gious  moral  objections. 

Distributive  Justice 

Hume  famously,  if  too  easily,  dismisses  the  quest 
for  “exact”  equality.  As  Brian  Barry  argues  in 
Theories  of  Justice,  Rawls  treats  fairness  as  per¬ 
haps  a  secondary  concern  with  welfare  and  its 
enhancement,  as  in  his  difference  principle,  which 
allows  inequalities  if  they  are  produced  by  institu¬ 
tions  that  enhance  the  welfare  of  the  worst-off 
class  of  people.  Indeed,  the  issues  in  equality  and 
its  trade-offs  with  efficiency  are  at  the  core  of 
Rawls’s  theory.  Thus,  there  are  two  parts  of  the 
theory:  fairness  and  mutual  advantage.  The  authors 
of  the  U.S.  Constitution  (1787)  were  worried  that 
democracy  would  bring  the  majority,  who  were 
poor  farmers,  to  power  and  that  they  would  con¬ 
fiscate  or  tax  away  property  to  redistribute  its 
value  to  themselves.  In  the  most  advanced  nations, 
to  which  Rawls  supposes  his  theory  applies,  there 
are  virtually  no  remaining  subsistence  farmers, 
who  a  century  ago  still  defined  poverty  for  most 
people  in  many  societies. 

Recall  Rawls’s  theory  of  mutual  disinterest  in 
each  other’s  welfare.  If  the  best  distribution  we  can 
manage  gives  me  fewer  resources  than  it  gives  you, 
I  am  not  concerned  that  you  have  done  better  than 
I  have.  If  that  is  not  true,  we  might  degenerate  into 
caring  about  how  each  of  us  ranks  rather  than  our 
welfare  per  se. 

If  Rawls’s  preferred  social  organization  is  to  be 
achieved  voluntarily  rather  than  by  forceful  impo¬ 
sition,  it  seems  implausible  that  anyone  now  living 
will  see  any  significant  progress  toward  it. 
Therefore,  his  theory  seems  at  best  to  be  an  ideal 
theory  in  our  time,  and  we  can  be  morally  certain 
that  we  will  not  see  it  put  to  work.  Voluntary 
moves  toward  just  distributions  face  a  grand 
instance  of  the  logic  of  collective  action.  The  logic 
of  collective  action  is  devastating  for  the  voluntary 
achievement  of  any  large-scale  provision.  As  Hume 
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says,  it  is  impossible  that  people  would  contribute 
to  a  massive  collective  provision.  Impossible  is  too 
strong  a  term,  but  only  just  barely  so.  That  logic 
coupled  with  strong  leanings  toward  self-interest 
implies  that  Rawls’s  theory  could  be  put  into  effect 
only  through  coercion  or  through  radically  alter¬ 
ing  human  nature  to  override  self-interest. 

Representative  Democracy 

Democracy  has  taken  many  forms  from  its  earliest 
rise  as  direct  democracy  with  all  citizens  in  the  same 
forum,  making  decisions  together.  In  contemporary 
societies,  this  is  of  course  not  feasible  for  national 
governance.  Hence,  the  basic  form  that  democracy 
must  take  is  representative  democracy,  which  is  a 
new  invention  to  handle  large  societies.  Strictly 
speaking,  only  direct,  face-to-face  democracy  is 
republican.  The  most  important  theorists  of  repre¬ 
sentative  democracy  are  John  Stuart  Mill  and 
Bernard  Manin.  Any  ideal  version  of  republicanism 
is  doomed  under  modern  conditions.  Large  size 
requires  representative  government  with  represen¬ 
tatives  who  cannot  possibly  represent  all  of  the 
diverse  elements  of  their  constituencies;  frequent, 
short  parliaments  are  desirable  for  republican  val¬ 
ues,  with  state  and  local  elections  every  year.  But  all 
of  this  is  impractical  in  our  geographic  and  demo¬ 
graphic  conditions.  Unfortunately,  in  a  representa¬ 
tive  system,  a  strong  parliament  meeting  frequently 
exacerbates  the  separation  of  the  parliamentarians 
from  the  people,  and  representatives  may  come  to 
resemble  the  rulers  rather  than  the  ruled. 

Democratic  practice  is  not  without  major  flaws 
and  serious  failings.  Briefly,  let  us  consider  a  few 
problems  in  the  working  of  representative  democ¬ 
racy:  First,  deliberative  democracy  makes  demands 
that  go  well  beyond  what  is  possible  and  at  its 
worst  descends  to  yelling  and  name  calling  when 
activists  mobilize  crowds  to  vent  their  ire  at  their 
opponents  and  elected  officials.  At  its  best,  signifi¬ 
cant  participation  and  deliberation  beyond  voting 
and  talking  with  associates  is  restricted  to  propor¬ 
tionately  very  few  people.  Second,  Anthony  Downs 
argues  convincingly  that  individuals  generally  have 
no  interest  in  going  to  the  trouble  of  voting.  There 
is  a  vast  literature  both  for  and  against  his  claims. 
Third,  issues  of  equality  and  inequality  suggest  that 
the  egalitarian  core  principle  of  democracy — one 
person,  one  vote — is  de  facto  violated  by  massive 


spending  and  by  differential  mobilization,  as  in  the 
antidemocratic  disruptions  of  vote  counting  in 
Florida’s  presidential  election  in  2000. 

Finally,  pause  a  moment  to  consider  the  perver¬ 
sion  of  the  role  of  governors,  as  argued  by  John 
Calhoun.  After  leaving  the  national  political  stage 
a  decade  before  the  U.S.  Civil  War,  Calhoun  wrote 
A  Disquisition  on  Government ,  which  was  an 
analysis  of  forces  distorting  the  U.S.  political  sys¬ 
tem.  In  it,  he  stated  that  “those  who  exercise 
power  and  those  subject  to  its  exercise — the  rulers 
and  the  ruled — stand  in  antagonistic  relations  to 
each  other.”  Calhoun  described  the  rulers  and  the 
ruled  as  two  hostile  classes:  “The  advantages  of 
possessing  the  control  of  the  powers  of  govern¬ 
ment,  and,  thereby,  of  its  honors  and  emoluments, 
are,  of  themselves,  exclusive  of  all  other  consider¬ 
ations,  ample  to  divide  [any]  community  into  two 
great  hostile  parties.”  The  division  here  is  not  only 
one  of  status  but  also  one  of  emoluments.  In 
essence,  Calhoun  argues  that  political  officehold¬ 
ers  are  separated  from  the  rest  of  the  community; 
they  are  a  class  apart.  The  supposed  trustee 
becomes  independent  from  the  persons  reposing 
their  trust.  The  creature  is  stronger  than  the  cre¬ 
ator;  hence,  we  suffer  a  lack  of  confidence  in  the 
representative  system. 

Constitutionalism 

The  mechanical  point  of  a  constitution  is  to  estab¬ 
lish  institutions  of  governance  and  to  protect  citi¬ 
zens  from  arbitrary  state  intrusions.  In  any  given 
context,  institutions  could  take  many  forms.  Without 
prior  order,  a  constitution  cannot  likely  be  made  to 
work;  indeed,  without  order,  it  might  even  be  hard 
to  engage  some  political  body  to  write  a  constitu¬ 
tion,  although  one  might  be  promulgated  by  a  cur¬ 
rent  government,  even  an  autocratic  government. 
Having  a  prior  history  of  successful  government 
might  be  the  best  predictor  of  constitutional  success. 
For  example,  the  13  American  colonies  had  work¬ 
ing  governments  and  even,  in  some  cases,  articulate 
constitutions  before  the  federal  constitution  of 
1787.  This  prior  experience,  as  well  as  the  experi¬ 
ence  of  living  under  a  relatively  orderly  English 
government,  enabled  the  constitutionalists  of  1787 
to  write  their  constitution  in  relative  quiet. 

The  main  focuses  of  constitutionalism  are  to 
enable  government  and  to  constrain  it.  These 


Normative  Political  Theory  1723 


purposes  sound  contradictory,  and  they  com¬ 
monly  are.  Why  create  government  to  constrain 
government?  As  odd  as  the  question  sounds,  it  is 
easy  to  answer,  so  easy  that  its  answers  were 
likely  taken  for  granted  by  the  authors  of  the  U.S. 
Constitution  in  1787  and  by  its  ratifiers  in  subse¬ 
quent  state  conventions.  In  this  particular  case, 
the  federal  constitution  was  needed  to  override 
state  constitutions,  especially  in  economic  policy, 
with  free  trade  among  the  states  and  exclusive 
federal  regulation  and  taxing  of  international 
trade.  Thereby,  the  constitution  established  a 
nation.  In  many  other  cases,  extant  governments 
have  been  altered  or  replaced  by  newly  created 
constitutions.  Many  of  these  extant  governments 
were  autocracies,  usually  monarchical  or  military. 

The  main  constraint  on  government  action 
from  a  strong  constitution  is  the  rule  of  law.  The 
rule  of  law  against  arbitrary  government  is  the 
high  dogma  of  17th-century  English  constitution¬ 
alists.  It  essentially  involves  limitations  on  the 
exercise  of  official  power.  This  is  now  often  seen 
as  the  main  point  of  a  constitution  and  the  central 
principle  of  Anglo-American  law.  Nonarbitrariness 
implies  equal  treatment,  in  part  by  blocking  arbi¬ 
trary  action  by  monarchs  and  political  office  hold¬ 
ers.  It  appears  that  the  rule  of  law  does  not  require 
a  written  constitution,  because  England  has  none 
and  yet  its  rulers  have  generally  followed  the  lib¬ 
eral  principle.  Moreover,  from  the  earliest  times, 
Saxons  in  Germany  lived  under  an  unwritten 
“constitution”  of  liberty,  a  fact  that  distinguishes 
English  political  developments. 

There  are  two  general  strategic  approaches  to 
constitutionalism,  treating  the  issue  as  primarily 
one  of  contracting  or  one  of  coordinating.  That 
there  could  be  varied  forms  that  the  constitution 
for  a  particular  society  could  take  already  suggests 
that  the  problem  is  at  least  partly  one  of  coordina¬ 
tion.  With  only  modest  changes,  the  very  brief  and 
often  vague  U.S.  Constitution  or  the  massively 
detailed  Indian  Constitution  might  both  work  in 
the  sense  that  they  define  institutions  that,  when 
put  in  place,  successfully  govern  their  societies. 

Perhaps  the  dominant  school  of  thought  on 
how  government  may  be  morally  justified  has  been 
contractarianism,  to  which  an  enormous  array  of 
philosophers  have  contributed,  including  classical 
thinkers  as  well  as  Thomas  Hobbes,  John  Locke, 
Jean-Jacques  Rousseau,  Immanuel  Kant,  and 


countless  others.  There  are  subtle  variations  in 
contractarian  arguments,  but  essentially  they  rely 
on  a  claim  of  consent  that  obligates  us  to  obey  the 
government  to  which  we  have  consented.  This  is 
commonly  framed  as  a  moral  argument.  We  are 
morally  obligated  by  our  prior  consent.  This  is 
perhaps  most  clearly  asserted  by  Locke,  who  is 
likely  the  most  influential  proponent  of  the  social 
contract  in  the  Anglo-Saxon  world.  Hobbes  holds 
more  simply  that  we  collectively  agree  on  election 
of  a  tyrant  or  dictator  to  coerce  us  into  decent 
behavior.  His  is  not  an  argument  from  ostensibly 
moral  consent  but  from  rational,  self-interested 
consent.  After  taking  this  limited  step  of  establish¬ 
ing  government,  there  is  no  further  project  for  the 
contractors  to  take  on,  and  they  have  no  further 
role  in  governing  themselves. 

Calling  this  contract  theory  can  be  seen  as  a 
grand  instance  of  persuasive  definition.  If  you  and 
I  contract  legally,  we  commonly  are  morally 
bound  to  perform,  unless  we  both  agree  not  to  do 
so.  Hume  mercilessly  demolishes  the  implicit  his¬ 
torical  arguments  of  the  development  from  a  state 
of  nature  to  a  contracted  government,  as  Hobbes 
posits  while  saying  that  such  a  state  probably 
never  existed.  Hume  notes  simply  that  if  we  had 
contracted  for  or  consented  to  our  government,  we 
would  remember  the  event.  Evidently  none  of  us 
does  remember,  nor  is  there  much  of  a  record  of 
our  forebears  having  agreed.  Worse  still,  many  of 
us  can  hardly  understand  the  contractarian  claim. 
Contemporary  constitutions  are  generally  written 
by  committees  and  voted  into  effect  by  some  far 
less  than  unanimous  part  of  the  national  popula¬ 
tion.  Opponents  of  contractarians  would  argue 
that  the  masses  cannot  be  morally  obligated  by 
such  a  process. 

Having  demolished  the  going  theory  in  his  time, 
Hume  offers  an  alternative  account,  stating  that 
government  arises  by  convention.  There  was  no 
start  date,  and  there  was  most  surely  no  conference 
room  agreement.  Our  government  evolves  over 
time  and  may  become  dramatically  different  in 
form  from  one  generation  to  the  next.  The  form  of 
the  government  is  generally  an  unintended  conse¬ 
quence  of  the  disparate  interests  and  inputs  of  the 
citizens,  as  portrayed  in  Charles  Mcllwain’s  his¬ 
tory  of  constitutionalism. 

In  this  view,  the  “contract”  in  the  social  contract 
is  generally  seen  as  morally  binding  by  analogy 
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with  a  standard  ordinary  contract  agreed  to  by  the 
parties  and  backed  by  the  law.  Coordination  as  in 
Hume’s  theory  is  not  morally  binding;  either  it 
works  or  it  does  not.  If  there  is  a  coordination  con¬ 
vention,  it  might  be  immoral  not  to  comply  with  it. 
For  example,  not  driving  on  the  right  when  there  is 
a  convention  of  driving  on  the  right  would  be  mur¬ 
derous  and  immoral  as  well  as  stupid.  If  I  go  to 
Australia,  I  will  do  my  best  to  follow  the  Australian 
driving  convention.  But  this  choice  does  not  turn 
on  any  prior  agreement.  My  connection  with  the 
coordination  is  not  a  moral  but  a  rational,  self- 
interested  commitment.  We  all  act  from  our  own 
interest  by  driving  right  in  North  America;  without 
agreement  or  negotiation,  we  create  a  beneficial 
order  for  all.  Similarly,  Hume’s  convention  account 
of  the  state  is  not  normative,  and  any  state  that 
arises  by  convention  is  neither  right  nor  wrong  in 
principle.  In  Hume’s  vision,  this  is  a  plus.  It  is  his 
great,  persuasive  move:  creating  collective  benefits 
and  good  institutions  from  the  self-interested  moti¬ 
vations  of  individuals.  Prior  theorists  almost  uni¬ 
formly  think  that  self-interest  is  an  obstacle  to 
good  government  and  social  cooperation.  In  con¬ 
trast,  Hume  sees  it  as  central  to  the  explanation  of 
achieving  good  government. 

In  sum,  a  reason  for  or  an  appeal  of  social  con¬ 
tractarianism  is  that  it  seemingly  yields  a  moral 
reason  to  comply  with  its  dictates.  As  Francois 
Guizot  says,  the  hypothesis  “of  a  primitive  contract, 
as  the  only  legitimate  source  of  social  law,  rests 
upon  an  assumption  that  is  necessarily  false  and 
impossible”  (Pt.  1,  Lecture  4).  Unfortunately,  on 
this  issue,  the  theory  fails.  Its  failure  seems  to  be  a 
source  of  nostalgia  for  many  theorists  who  would 
like  contractarianism  to  work  and  to  cohere.  Alas,  it 
does  neither.  The  vision  of  general  consent  is  sweet 
but  contrary  to  the  possibilities  of  human  nature  in 
a  world  of  scarcity  of  both  status  and  goods.  Again, 
Guizot  states  that  the  “necessary  coexistence  of  soci¬ 
ety  and  government  shows  the  absurdity  of  the 
hypothesis  of  the  social  contract”  (Pt.  1,  Lecture  6). 
It  is  astonishing  that  this  cooperative  vision  is  most 
famously  argued  by  the  nearly  antisocial  Rousseau. 
Guizot,  who  has  done  the  massive  work  of  catalog¬ 
ing  the  complicated,  back-and-forth  development, 
over  more  than  a  millennium,  of  constitutionalism 
and  representative  government  in  Europe,  seems 
morally  offended  by  the  simplistic  claims  for  a  social 
contract  as  the  standard  of  the  right. 


Conclusion 

We  have  only  three  major,  broadly  developed  nor¬ 
mative  schools  of  thought  on  this  question:  the  lib¬ 
eral,  utilitarian,  and  egalitarian  schools.  There  is 
also  a  developing  fairness  school  that  has  been  clas¬ 
sified  as  a  theory  only  recently  in  the  Rawlsian 
theory  of  distributive  justice.  We  have  yet  to  see 
how  it  will  stand  the  test  of  time.  Rawls’s  followers 
often  seem  far  more  to  fragment  than  to  develop  the 
theory.  As  noted  above,  what  the  theory  most  needs 
now  is  social-scientific  analysis  of  institutions. 

Utilitarianism  is  a  general  moral  and  political 
theory — in  fact,  the  only  one,  Rawls  says. 
Liberalism  is  a  richly  developed  political  theory 
that  can  live  with  almost  any  extant  major  moral 
theory;  indeed,  many  of  the  great  liberal  writers, 
such  as  Mill,  Hume,  and  Smith,  generally  brought 
individual  morality  and  political  theory  together. 
In  this  combination  seemingly  across  the  two  lev¬ 
els,  utilitarianism  and  liberalism  are  especially 
richly  developed.  Many  other,  often  ad  hoc,  argu¬ 
ments  do  not  catch  on  to  become  genuine  schools 
of  thought.  At  its  height,  in  the  work  of  John 
Stuart  Mill,  Henry  Sidgwick,  and  Leonard  T.  Hob- 
house,  liberalism  was  generally  seen  as  at  one  with 
utilitarianism.  Libertarianism  is  a  limited  version 
of  liberalism,  which  has  many  followers,  such  as 
Nozick.  Alternative  approaches  make  little  head¬ 
way.  For  example,  variants  of  Kantian  liberalism, 
as  in  the  Victorian  writings  of  Thomas  Hill  Green, 
have  virtually  no  strong  contributors  today.  The 
extraordinarily  deep  moral  vision  of  Kant  and  his 
school  does  not  readily  carry  over  to  the  study  of 
political  institutions,  which  seem  not  to  interest 
most  Kantians.  Marxism  is  a  partial  political  the¬ 
ory  that  still  has  many  proponents,  but  its  main 
focus  has  been  economic,  and  its  economic  theory 
largely  fails.  It  has  essentially  no  moral  theory 
adjunct;  indeed,  many  Marxists  reject  standard 
views  of  morality,  as  though  to  say  that  all  social 
relations  are  caused  by  natural  forces,  not  by 
choices  of  free  agents.  Hence,  in  their  view,  choice 
and  morality  are  not  relevant  to  our  explanations. 

It  would  hardly  make  sense  to  discuss  some 
problems  in  our  social  world  without  the  use  of  at 
least  verbal  game  theory,  strategic  analysis,  or  eco¬ 
nomic  models.  Debates  over  how  a  constitution 
comes  into  being  or  how  it  works  once  it  is  enacted 
are  unlikely  to  be  fruitful  without  the  clarity  of  the 
game-theoretic  argument  for  dividing  the  two 
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main  positions  on  this  issue  into  contract  and 
coordination.  Hume  had  a  nearly  miraculous  gift 
for  grasping  strategic  interactions  entirely  verbally. 
Thomas  Schelling  says  he  is  not  a  game  theorist, 
meaning  one  who  contributes  to  the  theory. 
Rather,  he  is  a  practitioner,  using  game  structures 
to  help  clarify  patterns  of  social  interaction. 
Political  theory  needs  such  practitioners. 

The  beginnings  of  liberalism  and  liberal  society 
are  in  the  resolution  of  the  dreadful  conflict 
between  civil  order  and  religious  beliefs.  The 
church’s  pervasive  meddling  in  English  politics 
provoked  arguments  that  became  general  claims 
for  further  attaining  civil  liberties.  Monarchs  in 
England  fought  off  the  church,  but  they  could  not 
indefinitely  fight  off  their  own  citizens.  English  his¬ 
tory  then  informed  debates  in  Philadelphia  in 
1787.  Had  the  Church  followed  its  own  ideology, 
its  minions  would  have  rendered  unto  Caesar  what 
was  Caesar’s.  Instead,  they  hastened  the  growth  of 
liberty  and  liberalism,  which  went  far  toward 
destroying  the  Church.  The  ultimate  move  was  the 
separation  of  church  and  state,  which  we  might 
justly  call  the  necessary  constitutional  liberty. 
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Normative  Theory  in 
International  Relations 


The  theory  of  international  relations  (IR)  is  subdi¬ 
vided  into  an  analytical  field,  which  describes  and 
explains  international  events,  and  a  normative 
field,  which  prescribes  conduct  and  formulates 
moral  judgments.  This  second  field  aims  at  pro¬ 
ducing  ideals  regarding  what  IR  ought  to  be. 

This  entry  first  reviews  the  evolution  of  norma¬ 
tive  theory  as  technological  advances  and  the 
emergence  of  nation-states  created  the  need  for 
new  norms  to  govern  IR.  During  the  past  3 
decades,  norms  in  IR  have  undergone  a  second 
period  of  expansion,  as  a  result  of  increased  inter¬ 
est  on  the  part  of  academics,  along  with  factors 
outside  academia,  including  globalization,  the  end 
of  bipolarity,  and  terrorist  attacks.  After  exploring 
this  development,  the  entry  examines  the  dilemmas 
associated  with  war,  including  the  question  of  how 
to  define  war  and  issues  related  to  decisions  in  the 
aftermath  of  war.  Is  a  military  intervention  neces¬ 
sary  to  protect  victims  of  genocide?  Should  inno¬ 
cents  be  killed  in  the  name  of  political  community? 
Should  soldiers’  duties  end  after  victory?  Next, 
problems  arising  in  the  context  of  global  justice  are 
discussed,  with  a  focus  on  corrective,  distributive, 
and  environmental  justice.  The  origin,  the  nature, 


and  the  extent  of  obligations  in  humanitarian  law 
and  in  social  redistribution  are  discussed,  as  well 
as  new  realms  such  as  protection  of  the  environ¬ 
ment  and  the  use  of  international  sanctions.  The 
entry  concludes  with  a  look  at  the  possibility  of  a 
global  world  order  in  which  national  identities  can 
be  transcended  in  a  postnational,  global  commu¬ 
nity  characterized  by  cosmopolitan  democracy. 
There  are  various  conceptions  of  the  global  politi¬ 
cal  order  regarding  the  creation  of  a  cosmopolitan 
democracy:  Is  the  disjunction  between  modern 
liberty  (focused  on  the  private  sphere)  and  the 
republican  public  sphere  in  democratic  states  even 
greater  in  a  cosmopolitan  democracy?  Is  the  polit¬ 
ical  unity  of  the  world  possible  despite  the  plural¬ 
ity  of  cultures  and  of  ways  of  life? 

In  taking  a  philosophical  approach  to  IR,  nor¬ 
mative  IR  plays  several  roles.  The  first  function  of 
normative  theory  is  to  argue  for  epistemological 
transparency.  Any  scientific  theory  expresses  the 
more  or  less  hidden  normative  preferences  of  the 
analyst,  and  this  can  give  rise  to  the  development 
of  ideologies  in  the  public  sphere.  For  many,  this 
first  function  often  leads  to  a  deadlock.  The  second 
function  of  normative  theory,  which  this  entry 
explores,  is  related  to  orientation.  The  philosopher 
Immanuel  Kant,  to  illustrate  what  he  means  by 
“orientation  in  thinking,”  uses  spatial  indicators — 
the  north,  the  south,  the  east,  and  then  the  west. 
Afterward,  the  same  reasoning  is  applied  to  the 
world  of  logic.  Orientation  in  thinking  means  to 
come  of  age,  to  be  able  to  judge  by  oneself,  to  use 
one’s  common  sense  to  make  the  right  choice.  To 
achieve  this,  shadow  zones  or  fault  lines  have  to  be 
identified.  This  is  a  major  purpose  of  political 
theory,  according  to  John  Dunn  (1996).  He  insists 
that  political  theory  should  consist  in  indicating  “a 
series  of  sharp,  and  as  yet  very  poorly  recognized 
disjunctions”  (p.  33).  The  issues  discussed  in  this 
entry  reveal  points  of  tension  between  the  univer¬ 
sal  and  the  particular  and  among  liberals,  commu¬ 
nitarians,  socialists,  and  pragmatists,  with  the  aim 
of  finding  the  right  relationship  among  them. 

The  Resurgence  of  Normative  Theory 

In  the  1960s,  Stanley  Hoffmann  insisted  that 
political  philosophy  was  in  a  lamentable  state. 
According  to  him,  this  weakness  was  the  result  of 
philosophers’  difficulties  in  applying  their  ideals  at 
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the  international  level.  Martin  Wight  emphasizes  the 
same  issue  when  he  distinguishes  between  a  theory 
of  good  life  ( inside  political  entities)  and  a  theory  of 
survival  ( between  political  entities).  Today,  the  situ¬ 
ation  is  quite  different  for  philosophy,  which  appears 
as  an  “old  thought”  in  IR. 

An  Old  Thought 

The  term  international  was  first  used  by  Jeremy 
Bentham  in  1789.  However,  modern  philosophies 
of  IR  were  born  in  the  Renaissance  under  the  influ¬ 
ence  of  two  processes.  The  first  corresponds  to  the 
transition  from  feudalism  to  the  sovereign  state. 
The  question  of  resort  to  force  became  of  great 
interest.  Private  wars  conducted  without  the 
endorsement  of  the  monarch  were  forbidden,  and 
at  the  same  time  potentas  was  transferred  from  the 
Church  to  temporal  power  (through  the  creation 
of  an  interstate  Law  of  War).  The  principle  of  sov¬ 
ereignty  was  built  against  foreign  powers  (the  Pope 
and  the  Emperor),  and  rules  were  set  up  to  imple¬ 
ment  a  legitimate  practice  of  war  between  states. 
The  second  process  is  related  to  the  “great  discov¬ 
erers.”  Galileo’s  telescope,  Copernicus’s  astro¬ 
nomical  system,  and  Columbus’s  explorations  led 
to  a  much  broader  understanding  of  human  beings 
and  their  world.  With  this  new  recognition  of 
human  diversity  came  questions  about  the  rela¬ 
tionships  among  people.  The  entry  into  modernity 
was  marked  by  a  new  awareness  that  the  time  of 
autarky  was  over. 

Nevertheless,  philosophy  did  exist  before  the 
construction  of  modern  states.  Even  in  ancient  and 
medieval  times,  philosophers  considered  how  to 
define  otherness  through  the  boundaries  of 
churches,  cities,  and  communities.  Stoicism 
defended  a  moral  universalism  that  transcended 
ethnic  differences.  In  the  Middle  Ages,  the  concept 
of  empire  involved  a  pacific  function  that  was 
theologically  justified.  Civic  humanism,  which  first 
appeared  in  the  Italian  peninsula,  would  become 
one  of  the  sources  of  foreign  policies  even  in  Great 
Britain  and  the  United  States.  Thus,  foreign  rela¬ 
tions  came  to  be  theorized  by  normative  dis¬ 
courses,  which  reveal  different  ways  of  considering 
the  boundaries  between  inside  and  outside. 

With  modernity  and  the  rise  of  the  nation-state, 
relations  between  states  become  a  more  complex 
issue  that  became  visible  through  the  constitution 


of  political  science  as  a  new  discipline.  In  1919,  the 
birth  of  IR  as  a  science  brought  back  these  norma¬ 
tive  orientations.  The  first  debate  opposed  liberals 
and  realists.  Liberals  promoted  not  only  Aufklarung 
butalso  “ doux commerce”  (Baron de Montesquieu), 
historical  progress  through  education,  and  the 
peaceful  virtues  of  democracy.  These  were  the  tar¬ 
get  of  realists’  criticism,  which  stressed  the  role  of 
the  sovereign  states  and  their  competition  and 
overshadowed  biological  and  spiritual  factors. 
However,  the  moral  dimension  of  liberal  thought 
was  acknowledged  by  Hans  Morgenthau  himself, 
who  is  known  as  the  father  of  an  approach  that 
separates  morality  from  politics.  As  observers  and 
judges,  political  scientists  have  to  denounce  wrong 
choices  (as  Morgenthau  did  during  the  Vietnam 
War).  As  practitioner,  the  statesman  must  adopt 
an  ethic  of  “lesser  evil,”  which  breaks  with  an  ide¬ 
alist  perspective  without  opting  for  an  ethic  that 
claims  the  absolute  good,  which  is  inaccessible. 
Contrary  to  the  English  school  (Martin  Wight),  the 
major  part  of  the  following  generation  was  strongly 
influenced  by  positivism,  which  was  at  its  peak 
from  the  1980s;  the  behavioralist  moment  has 
departed  from  these  normative  trends. 

Intra-Academic  Factors  in  the 
Resurgence  of  Normative  Theory 

The  first  intra-academic  factor  in  favor  of  the 
development  of  normative  theory  lies  at  the  heart 
of  Western  political  theory.  John  Rawls’s  opus 
magnus,  A  Theory  of  Justice  (1971),  gave  to  nor¬ 
mative  theory  a  second  birth.  Although  Rawls  did 
not  directly  tackle  the  issue  of  international  justice 
in  this  early  work,  several  commentators  did,  thus 
generating  a  broad  consideration  of  the  idea  of 
global  distributive  justice. 

The  second  factor  lies  within  IR  itself.  In  the 
1980s,  critics  of  new  research  on  scientific  realism 
(structural  realism  and  neorealism)  or  liberalism 
broadened  the  basis  of  their  critique  and  ques¬ 
tioned  the  scientific  authority  of  these  works  by 
means  of  philosophy.  The  return  of  norms,  nar¬ 
rowly  speaking  (rules  of  international  law)  and  in 
the  wider  sense  (standards  of  conduct  for  public 
and  private  actors),  is  one  of  their  best  arguments. 
These  approaches  tended  to  explain  the  nature  of 
international  norms  (their  origin,  implementation, 
and  national  integration  and  change)  as  well  as 
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their  impact  on  states’  behavior.  But  above  all, 
they  produced  an  ethical  discourse  on  what  global 
regulation  ought  to  be  that  nourishes  several  sec¬ 
tors  of  the  field — for  example,  humanitarian  inter¬ 
ventions  and  strategic  cultures.  This  is  why  they 
deserve  the  designation  of  ideational  turn.  Such 
tendencies  can  be  considered  as  a  return  to  the 
origin  of  the  field,  as  the  first  interparadigmatic 
debate  between  liberals  and  realists  was  taking 
into  consideration  the  issue  of  norms. 

Extra-Academic  Factors  in  the 
Resurgence  of  Normative  Theory 

Three  extra-academic  factors  are  especially 
noteworthy  in  the  resurgence  of  normative  IR. 
First,  the  process  of  globalization  became  stronger. 
If  the  abolition  of  distances  originates  from  the 
first  phase  of  modernity  (the  Renaissance),  wars 
and  development  in  communication  technologies 
have  strengthened  this  process.  In  this  regard,  glo¬ 
balization  is  characterized  by  a  density  of  net¬ 
works,  a  greater  velocity  of  information  flows,  and 
a  complex  interdependence  that  favors  transna¬ 
tional  participation.  In  such  a  context,  at  what 
level  must  the  accurate  meaning  of  political  action 
be  defined — at  the  national  level  or  beyond? 

Furthermore,  the  end  of  bipolarity,  which  had 
structured  identities  since  1917,  strengthened  the 
already  existing  uncertainties  regarding  the  iden¬ 
tity  of  human  beings,  and  the  delineation  between 
self  and  other  became  increasingly  ambiguous. 
David  Boucher  (1997)  sums  up  this  situation  in  his 
famous  question  “Who  am  I?”  and  adds,  “the 
consequent  politics  of  recognition,  inclusion  and 
exclusion  bring  into  focus  not  only  politics  of  the 
nation,  state  and  cosmopolitanism,  but  also  of 
gender,  race,  religion  and  ethnicity”  (p.  173). 

Finally,  terrorist  attacks  brought  a  new  urgency 
to  normative  IR.  When  an  unpredictable  event 
happens  and  seems  to  brutally  interrupt  the  politi¬ 
cal  and  social  processes,  reflection  is  needed  to  find 
out  to  what  extent  “the  thread  of  tradition  has 
been  broken,”  to  use  Hannah  Arendt’s  expression. 
The  attacks  of  September  11,  2001,  as  well  as  the 
terrorist  assaults  in  Bali  and  Mombasa  (2002)  and 
Madrid  (2004)  are  evidence  of  such  a  breakdown. 
Of  course,  it  would  be  presumptuous  to  consider 
that  the  attacks  in  the  United  States  have  had  the 
same  consequences  that  Nazism  and  the  Holocaust 


had  on  a  whole  generation  of  intellectuals,  such  as 
Theodor  Adorno  and  Hannah  Arendt.  However, 
the  global  media  coverage  of  these  events  has  con¬ 
fronted  both  analysts  and  citizens  with  something 
tragic  that  calls  for  a  meaning.  In  Robert  Keohane’s 
view,  the  9/11  attacks  reemphasized  the  impor¬ 
tance  of  integrating  normative  issues  into  political 
philosophy  and  research  on  IR.  Keohane  empha¬ 
sizes  that  a  basic  function  of  a  liberal  state  is  to 
protect  its  citizens  from  the  fear  of  cruelty  and  that 
helping  people  come  to  terms  with  terrorism  is 
part  of  that  task. 

Dilemmas  of  War 

Contrary  to  pessimistic  expectations,  the  bipolar 
system  of  the  Cold  War  era  ended  without  military 
struggle.  However,  the  collective  euphoria  van¬ 
ished,  and  fears  regarding  the  feasibility  of  setting 
up  a  peaceful  world  order  have  appeared.  Today, 
irregular  wars,  civil  wars  that  evolved  into  ethnic 
cleansing,  and  increasing  military  interventions 
outside  the  United  Nations  (UN)  Security  Council’s 
authorization  all  indicate  a  need  for  moral  judg¬ 
ments.  Thomas  Hobbes’s  world  is  coming  back, 
whereas  it  seemed  that  we  were  living  in  Kant’s. 
But  is  war  still  the  ultima  ratio  legem  between 
political  communities?  We  can  distinguish  three 
kinds  of  dilemmas  regarding  the  nature,  the  justifi¬ 
cation,  and  the  termination  of  war. 

Defining  War 

Modernity  draws  on  a  specific  definition  of 
war:  a  struggle  between  enemies  conceived  as 
political  entities.  For  example,  Jean-Jacques 
Rousseau  gives  a  definition  of  war  in  his  Social 
Contract:  It  is  “a  relation,  not  between  man  and 
man,  but  between  state  and  state,  and  individuals 
are  enemies  only  accidentally,  not  as  men,  nor  even 
as  citizens,  but  as  soldiers”  (Book  1,  chap.  4).  This 
definition  is  the  opposite  of  current  conceptions  of 
war,  which  go  far  beyond  its  first  usage.  Today, 
one  speaks  of  a  war  against  terrorism,  war  against 
poverty,  or  war  between  civilizations.  First,  the 
number  of  violent  actors  has  increased.  The  state  is 
not  the  unique  gladiator  anymore.  Moreover, 
some  states,  and  even  intergovernmental  institu¬ 
tions,  call  on  private  military  societies  to  provide 
security.  Such  a  practice  is  ambiguous:  Do  private 
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actors  benefit  from  a  right  to  kill?  Several  libertar¬ 
ians  subscribe  to  this  possibility  and  are  delighted 
to  see  that  security  is  becoming  a  good,  among 
others.  They  stand  in  opposition  not  only  to  clas¬ 
sical  liberals  but  also  to  radical  anarcho-capitalists, 
who  agree  with  Adam  Smith  on  this  point — that 
wealth  alone  cannot  ensure  peace  because  har¬ 
mony  of  interests  between  states  cannot  be 
achieved.  However,  in  Milton  Friedman’s  opinion, 
states  must  develop  their  own  defense  posture  in 
the  nuclear  era. 

In  addition,  Clausewitzian  logic,  which  subordi¬ 
nates  the  military  to  politics,  would  also  be  called 
into  question.  New  actors  would  put  aside  negotia¬ 
tions  and  compromises  and  instead  advocate  an 
apocalyptic  rhetoric.  States,  particularly  Western 
states,  would  also  resort  to  a  new  approach  to  war. 
They  would  act  not  to  restrain  their  enemies’  will 
but  to  socialize  them  by  punishing  them,  with  the 
aim  of  changing  their  values.  These  post-bipolar 
interventions  are  known  as  policing  wars.  The  “war 
against  terrorism”  as  well,  which  critics  have  called 
a  false  expression  at  a  conceptual  and  an  opera¬ 
tional  level  alike,  fits  in  with  this  logic  of  a  “postna¬ 
tional  war.”  The  underlying  goal  would  consist  in 
achieving  a  global  political  homogeneity  on  the 
basis  of  the  dominant  parties’  values.  This  debate 
on  the  nature  of  war  has  taken  a  critical  aspect 
across  the  Atlantic  with  the  multinational  military 
campaign  known  as  Operation  Iraqi  Freedom. 
Because  this  strategic  action  originated  in  a  neocon¬ 
servative  ideology,  some  analysts  have  pointed  out 
the  influence  of  Leo  Strauss.  Such  interpretation  is 
based,  however,  on  a  misunderstanding  of  Strauss. 
His  assessment  of  modern  liberal  democracy,  espe¬ 
cially  in  the  United  States,  is  deeply  pessimistic,  and 
this  frame  of  mind  is  the  opposite  of  the  current 
neoconservatives’  optimistic  and  demiurgic  mood. 

Justifying  War 

When  is  a  military  intervention  justified?  Is  it 
moral  to  resort  to  force  in  order  to  protect  popula¬ 
tions  that  are  victims  of  oppression?  A  tension 
appears  between  the  principle  of  noninterference  in 
the  internal  affairs  of  a  sovereign  state  and  the  prin¬ 
ciple  of  protection  of  human  rights.  This  tension  is 
all  the  more  acute  when  resort  to  force  in  the  name 
of  universal  values  happens  to  be  selective;  for 
instance,  the  type  of  international  intervention 


considered  in  Bosnia  will  not  be  considered  for 
Chechnya.  How  can  the  dilemma  between  universal 
principles  and  specific  cases  be  resolved?  Does  the 
just  war  tradition,  which  refuses  to  separate  ethics 
from  politics,  still  provide  accurate  orientations? 

Elaborated  by  Saint  Augustine,  the  doctrine  of 
just  war  sets  up  criteria  regarding  the  beginning 
and  the  progress  of  military  interventions:  (a)  just  a 
causa  (intervention  should  aim  at  protecting  life), 
(b)  auctoritas  principi  (only  duly  constituted 
authorities  may  wage  war),  and  (c)  recta  intentio 
(rules  of  law  must  be  observed).  To  adapt  it  to  the 
20th  century,  Michael  Walzer  undertook  a  revi¬ 
sion  of  just  war  theory.  First,  he  refocused  justa 
causa  on  self-determination  and  resistance  to 
aggression  (against  the  territorial  integrity  and 
political  sovereignty  of  an  independent  state).  The 
highest  purpose  of  a  state — or  of  a  community 
threatened  by  a  metropolitan  government — is  to 
defend  a  territory  and  a  way  of  life  (shared  values). 
Thus,  Walzer  subscribes  to  the  British  utilitarian 
ideals  of  the  beginning  of  the  19th  century,  accord¬ 
ing  to  which  democracy  cannot  be  imposed  from 
outside  but  arises  out  of  particular  experiences. 
Moreover,  when  a  community  is  threatened  and 
has  no  power  to  struggle,  as  in  Kosovo  in  1999, 
military  intervention  is  required.  This  is  the  only 
type  of  just  war  acknowledged  by  Walzer. 
Humanitarian  intervention  should  be  a  form  of 
mutual  help  that  occurs  only  after  the  identifica¬ 
tion  of  an  “unbearable”  aggression  and  sufficient 
inadequacies  in  self-defense. 

Second,  Walzer  argues  that  only  a  “supreme 
emergency  exemption”  can  justify  aggression  that 
violates  moral  norms  such  as  the  principle  of  dis¬ 
crimination  between  civilians  and  combatants. 
The  best  example  of  such  justifiable  but  immoral 
action  (“dirty  hands”)  remains  Winston  Churchill’s 
decision  to  bomb  German  towns  in  1940  to  1941 
in  order  to  create  intolerable  conditions  for  the 
population.  Neither  absolutist  nor  utilitarian 
moral  theories  can  explain  such  cases.  Absolutist 
theories  hold  that  moral  rules  or  principles  do  not 
admit  of  any  exceptions;  hence,  there  can  be  no 
justification  in  the  name  of  “innocent  victims.” 
Utilitarian  theories  deny  the  relevance  of  inno¬ 
cence  applied  to  a  part  of  a  population  as  long  as 
the  consequences  of  an  act  can  benefit  the  greatest 
number.  Both  approaches  reveal  weaknesses.  The 
real  question  is  “How  can  we  support  criminal 
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acts  decided  on  by  leaders  without  forgetting  the 
ethical  issues?”  As  long  as  the  action  is  taken  by 
duly  elected  political  leaders  and  the  survival  of  the 
community  is  at  stake,  such  actions  can  be 
endorsed.  However,  this  limits  the  application  of 
the  emergency  exemption;  it  cannot  be  applied  to 
terrorist  practices  and  can  be  used  only  by  duly 
elected  political  leaders.  Even  though  terrorist 
movements  may  claim  to  be  acting  in  the  name  of 
just  a  causa,  they  cannot  endorse  any  legitimate 
authority  or  assume  responsibility  for  actions  that 
are  morally  unjustified.  Thus,  states  have  the 
monopoly  on  “dirty  hands.”  This  pro-state  bias  is 
one  of  the  major  criticisms  formulated  against  this 
theory.  Critics  argue  that  Walzer  is  applying  a 
double  standard,  denying  to  terrorists  what  is 
acceptable  when  done  by  political  leaders.  Thus, 
critics  have  questioned  whether  such  an  approach 
is  based  on  too  much  confidence  in  the  moral  rec¬ 
titude  of  our  leaders. 

After  9/11,  just  war  and  its  underlying  moral 
dilemmas  reappeared  at  two  different  levels.  The 
first  one  regards  the  interpretation  of  Operation 
Iraqi  Freedom:  Was  this  a  humanitarian  interven¬ 
tion  to  end  a  tyrannical  form  of  government? 
Several  analyses  challenge  this  view  on  the  ground 
that  humanitarian  interventions  cannot  be  carried 
out  to  change  the  nature  of  a  regime  but  only  when 
great  atrocities  are  committed.  Beyond  the  ques¬ 
tion  of  the  support  brought  by  the  other  demo¬ 
cratic  states  (desirable  but  not  mandatory  for  those 
who  hold  that  intervention  to  end  tyranny  is 
acceptable,  necessary  for  those  who  believe  that  it 
is  justified  only  in  the  case  of  great  atrocities), 
what  is  called  into  question  here  is  the  goal  of  such 
intervention.  Several  interpreters  have  disagreed 
with  the  rhetoric  of  a  crusade  or  crusading  spirit  to 
justify  the  Iraqi  operation,  invoking  one  of  the 
main  features  of  just  war  theory.  The  case  of  Iraq 
promotes  in-depth  discussion  on  the  ethical  aspects 
of  the  resort  to  force  against  an  indirect  threat. 
International  law  distinguishes  between  preemp¬ 
tive  and  preventive  war.  In  moral  philosophy, 
there  is  a  similar  debate  on  preventive  war  and 
anticipatory  attack.  According  to  Walzer,  anticipa¬ 
tory  attack  would  be  morally  acceptable  only  if 
three  cumulative  elements  are  present:  ( 1 )  an  intent 
to  cause  harm  by  a  “determined  enemy”  (whether 
in  the  present  or  in  the  past),  (2)  a  degree  of  active 
preparation  that  turns  this  intent  into  a  positive 


danger  (e.g.,  building  up  offensive  forces  along  the 
border),  (3)  and  a  situation  in  which  waiting  or 
doing  anything  other  than  fighting  increases  the 
risk  of  being  attacked.  Preventive  war,  on  the  con¬ 
trary,  is  based  on  speculative  arguments  and  is 
aimed  at  maintaining  a  balance  of  power.  It  can¬ 
not,  therefore,  rely  on  moral  values. 

The  second  level  concerns  the  possibility  of 
applying  just  war  theory  to  the  fight  against  terror¬ 
ism.  In  the  Augustinian  tradition,  some  thinkers 
define  terrorist  actions  as  acts  of  war  that  justify 
resort  to  force  against  their  authors.  Again,  the 
“dirty  hands”  issue  arises.  When  a  terrorist  refuses 
to  reveal  information  about  an  attack  against  a 
civilian  population,  the  use  of  torture  has  been 
approved  according  to  the  paradigm  of  the  “tick¬ 
ing  bomb.”  It  stands  for  an  extreme  case  where 
torture  is  justified  so  as  to  prevent  harm  to  citizens. 
This  argument  is  consequentialist  in  its  appeal  to 
the  prevention  of  an  imminent  catastrophe  and  its 
application  of  the  principle  of  cost-benefit  analy¬ 
sis.  Such  a  procedure  must  be  visible,  and  those 
responsible  must  be  held  accountable  to  minimize 
the  frequency  and  the  severity  of  the  practice. 
However,  despite  these  measures  of  control,  the 
practice  of  “dirty  hands”  can  be  excessive  or  may 
favor  double  standards  (discriminatory  interpreta¬ 
tion  of  torture  by  the  actor). 

The  deployment  of  armed  forces  abroad  and  the 
struggle  against  terrorism  in  the  name  of  just  war 
are  not  instances  of  actions  taken  in  accordance 
with  consensual  principles.  Are  we  required  to 
think  on  a  “case-by-case”  basis?  Pragmatism 
argues  that  we  must  do  so. 

Ending  War 

Kant  argues  that  victory  is  not  synonymous 
with  moral  superiority.  Military  victory  changes 
the  balance  of  power  but  not  one’s  moral  commit¬ 
ment.  Kant  adopts  a  symmetric  reasoning:  If  rules 
of  jus  bellum  are  not  respected  at  the  beginning  of 
a  war,  they  will  not  be  respected  after  the  war 
either.  However,  just  war  theory  is  rather  silent 
with  respect  to  the  end  of  war.  What  does  the  post¬ 
conflict  situation  require  from  the  winner,  and 
what  kind  of  dilemmas  are  highlighted  here? 

Once  a  war  has  ended,  five  principles  should  be 
applied  to  avoid  grievances  leading  to  acts  of 
revenge: 
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1.  the  proportionality  (the  fairness  of  the  terms  of 
a  treaty)  and  the  publication  of  peace 
agreements; 

2.  the  recognition  of  civil  rights; 

3.  discrimination  among  leaders,  soldiers,  and 
civilians  when  imposing  sanctions; 

4.  compensation  (financial  restitution,  material 
reconstruction,  etc.);  and 

5.  civil  and  military  rehabilitation  (demilitarization 
and  disarmament,  police  and  judicial  retraining, 
and  human  rights  education). 

However,  can  these  obligations  guarantee  a  just 
peace  and  renew  the  reflection  on  positive  peace? 
A  just  peace  implies  a  “thin”  recognition  of  the 
other,  conceived  as  an  autonomous  entity  from  a 
liberal  standpoint;  a  “thick”  recognition  of  the 
cultural  identity  of  the  other;  the  renunciation  of 
armed  violence;  and  the  establishment  of  rules, 
rights,  and  duties  and  their  formalization  in  a  legal 
document.  The  point  is  to  create  a  new  common 
language  between  different  communities.  Liberals 
have  criticized  this  idea  because  they  define  peace 
in  strictly  legal  terms.  In  other  words,  peace  can 
only  have  a  legal  dimension  in  the  framework  of 
the  existing  international  rules.  This  restrictive 
definition  is  grounded  in  the  “cultural  neutrality” 
of  liberalism  in  the  public  sphere.  Identity  is  there¬ 
fore  a  concept  that  belongs  to  private  life  and  is 
not  acknowledged  in  the  framework  of  diplomatic 
negotiations. 

The  danger  of  neocolonialism  represents  a  sec¬ 
ond  dilemma  if  the  change  of  political  system 
becomes  the  goal  of  the  post-World  War  II  era. 
Historical  examples  (the  Japanese  and  the  German 
experience)  as  well  as  consequentialist  arguments 
(a  government’s  change  is  the  best  way  to  ensure 
security)  may  rule  out  this  danger.  However,  this 
danger  will  always  exist.  The  imposition  of  democ¬ 
racy  by  force  appears  to  be  a  revolutionary  prac¬ 
tice,  all  the  more  dangerous  since  it  is  based  on  a 
strategy  of  exportation  that  excludes  any  other 
way  of  implementing  democracy. 

Dilemmas  of  Global  Justice 

According  to  the  Greco-Roman  tradition,  both 
corrective  and  distributive  justice  can  flourish  only 


inside  political  entities.  By  integrating  jus  post- 
bellum,  the  just  war  theory  has  opened  a  new 
perspective  in  favor  of  international  corrective 
justice,  as  the  creation  of  the  International  Criminal 
Court  (ICC)  shows.  The  road  to  global  justice  gets 
wider  as  other  directions  (e.g.,  environment,  sanc¬ 
tions)  are  being  explored.  Unfortunately,  none  of 
them  have  elaborated  their  own  theory  of  justice. 

Corrective  Justice 

The  20th  century  underwent  two  World  Wars, 
colonial  wars,  and  a  good  deal  of  civil  warfare. 
Perhaps  the  main  consequence  of  this  consists  in 
the  development  of  public  international  law  to 
regulate  the  relationships  between  states  (the 
Geneva  Convention)  and,  after  conflicts,  to  judge 
those  responsible  (e.g.,  the  Nuremberg  and  Tokyo 
trials,  ad  hoc  tribunals,  and  the  ICC).  This  process 
bears  out  Kantian  views  according  to  which  the 
painful  experience  of  war  arouses  the  need  for 
unity  relying  on  a  morally  grounded  system.  In  the 
opinion  of  the  first  president  of  the  Criminal  Court 
for  the  former  Yugoslavia,  Antonio  Cassese,  it  is 
more  and  more  shameful  not  to  be  involved  in  this 
corrective  justice  effort.  However,  this  trend  has 
been  called  into  question.  Behind  this  kind  of  jus¬ 
tice  lies  a  paradox  that  the  German  jurist  Carl 
Schmitt  was  the  first  to  criticize. 

Modernity  produced  a  Jus  Publicum  Europaeum, 
a  set  of  European  rules  produced  in  the  16th  cen¬ 
tury  to  put  an  end  to  the  wars  of  the  Middle  Ages 
and  regulate  the  resort  to  force.  This  transforma¬ 
tion  created  a  new  humanized  “nomos  of  earth”; 
that  is,  a  permanent  principle  of  space  distribution 
that  usually  consists  in  seizing,  dividing,  and 
exploiting  territories.  Modernity  does  allow  hostil¬ 
ity  in  the  acquisition  of  new  territory,  but  this 
practice  must  be  regulated.  Thus,  within  the  new 
international  law  of  the  20th  century,  war  becomes 
illegal  as  a  method  for  resolving  the  competition 
among  states.  The  enemy  becomes  a  criminal:  On 
the  one  hand,  no  negotiations  can  take  place  with 
him;  on  the  other  hand,  he  must  be  punished.  The 
relation  that  Carl  Schmitt  sees  is  the  exact  opposite 
of  the  league  of  states  described  by  Kant.  This 
change  within  the  law  has  given  rise  to  total  wars; 
the  denial  of  hostility  is  the  root  of  absolute  hostil¬ 
ity.  The  goal  of  war  is  not  to  fight  an  enemy  but  to 
annihilate  it  under  the  assumption  that  the  enemy 
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is  guilty.  In  this  view,  Schmitt  sees  a  correlation 
between  the  criminalization  of  the  enemy  and  the 
increasing  resort  to  depersonalized  aerial  wars, 
which  leads  to  the  suppression  of  the  Other. 
Corrective  justice  becomes,  therefore,  an  ideology 
that  promotes  inhumanity  on  behalf  of  humanity. 

Distributive  Justice 

According  to  Dunn  (1996),  a  sharp  contrast  can 
be  seen 

between  a  world  of  mutually  recognizing  Nation 
States  and  conscientious  agencies  of  putative 
international  beneficience,  created,  funded  and 
defended  by  those  states,  and  a  world  of  brutally 
unequal  suffering  and  enjoyment  in  which  vast 
masses  of  the  poor  have  little,  if  any,  prospect  of 
a  happier  existence  even  for  their  children’s 
children,  (p.  33) 

Two  bridges  are  conceived  to  fill  up  this  gap 
and  establish  new  transnational  obligations  toward 
the  weakest  states.  The  first  one  was  conceived 
after  the  1972  food  crisis  in  Bangladesh  by  propo¬ 
nents  of  utilitarianism,  who  argued  that  responsi¬ 
bility  is  based  not  only  on  moral  considerations 
but  also  on  utilitarian  ones.  Passivity  toward  the 
pain  of  another  moral  agent  involves  a  responsibil¬ 
ity,  which  is  identical  to  the  pain  we  would  feel  if 
we  had  caused  it  ourselves.  The  pain  of  other 
moral  agents  creates  a  responsibility  among  oth¬ 
ers.  This  moral  postulate  is  criticized  because  of  its 
excessive  requirements,  which  go  far  beyond 
“agent-centered  prerogatives”  and  free  consent. 
Yet  such  minimalist  reactions  define  international 
aid  as  a  question  of  charity  rather  than  of  justice. 
Other  political  analysts  hold  intermediary  views 
and  draw  on  Kant’s  concept  of  imperfect  obliga¬ 
tion:  Helping  people  in  foreign  countries  does  not 
depend  on  an  agent’s  arbitrary  will,  even  though 
these  agents  decide  the  moment  and  conditions  of 
its  application.  Once  again,  this  third  option 
entails  imperfections:  The  decision  to  act  is  most  of 
all  individual  here;  hence,  it  will  not  necessarily 
meet  the  emergency.  To  tackle  such  issues,  deci¬ 
sions  must  be  taken  by  several  actors. 

The  second  bridge  is  proposed  by  left  Rawlsians. 
They  intend  to  overcome  a  weakness  within  the 
first  postulate,  which  does  not  allow  moral  agents 


to  meet  each  other  (e.g.,  those  who  give  and  those 
who  get).  The  ground  of  the  argument  here  is  the 
application  of  the  entire  Rawlsian  theory  at  the 
international  level.  In  this  view,  states  find  them¬ 
selves  in  an  “original  position”  from  which  they 
must  determine  principles  of  justice  between  states 
in  the  same  way  that  individuals  are  to  choose 
principles  of  justice  independently  of  their  specific 
position  in  society  (i.e.,  independent  of  whether 
states  are  rich,  poor,  etc.).  However,  the  frame¬ 
work  used  in  the  theory  of  justice  is  too  narrow 
and  needs  to  be  enlarged  to  include  public  actors. 
Moreover,  the  two  Rawlsian  principles  of  maxi¬ 
mal  equal  freedom  and  the  differences  that  would 
be  allowed  by  individuals  behind  a  “veil  of  igno¬ 
rance”  are  transferred  to  the  international  level. 
This  extension  is  demanded  by  the  historical  con¬ 
text  and  brings  out  the  development  of  constrain¬ 
ing  institutional  mechanisms.  In  light  of  the  unfair 
economical  inequalities  to  which  they  contribute, 
well-off  states  have  to  subscribe  to  a  consequen- 
tialist  posture  that  compels  them  to  set  up  new 
rules,  especially  in  the  field  of  taxation. 

These  two  last  perspectives,  both  of  which  are 
part  of  the  cosmopolitan  standpoint,  meet  the  same 
challenge:  Can  distributive  justice  emerge  outside  a 
specific  political  framework?  Liberals  and  commu¬ 
nitarians  are  cautious,  and  their  answer  is  “No.” 
Rawls  attacks  these  views  as  misinterpretations: 
Distributive  justice  as  defined  in  Rawls’s  A  Theory 
of  Justice  must  be  experienced  inside  society;  its 
setting  cannot  be  global.  Some  communitarians 
share  this  point  of  view.  They  insist  on  the  idea  of 
affective  proximity.  No  moral  relationship  can 
develop  between  agents  without  this  element.  They 
also  stress  that  each  mode  of  distribution  depends 
on  the  community’s  identity.  Therefore,  interna¬ 
tional  redistribution  does  not  result  from  any  obli¬ 
gation  but  relies  on  the  goodwill  of  rich  people. 

New  Frontiers 

Global  justice  enlarges  its  boundaries  in  three 
directions.  The  first  means  to  integrate  new 
dimensions,  such  as  environmental  justice,  which 
is  based  on  the  “ecological  footprint”  principle.  In 
this  view,  each  of  us  leaves  a  footprint  in  the  envi¬ 
ronment  that  is  determined  by  the  amount  of 
resources  we  require  and  the  pollution  we  contrib¬ 
ute.  This  footprint  may  be  very  large  in  the  case  of 
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armed  conflict,  which  is  often  very  destructive  of 
the  environment.  Understanding  our  moral  obli¬ 
gation  to  the  environment  requires  both  deonto- 
logical  and  consequentialist  approaches.  It 
strengthens  Hans  Jonas’s  “responsibility  princi¬ 
ple”  regarding  future  environmental  degradation 
and  natural  disasters  resulting  from  the  develop¬ 
ment  of  new  techniques.  Such  a  principle  draws 
on  a  “heuristic  of  fear,”  which  conveys  moral 
awareness  of  anticipated  human  threats  to  the 
environment.  Nevertheless,  one  source  of  environ¬ 
mental  justice — political  romanticism  that  pro¬ 
motes  a  “return  to  nature” — remains  a  major 
issue.  Indeed,  this  doctrine  is  restricted  to  the 
damages  caused  by  human  beings  only.  Thus,  this 
anthropocentrist  bias  tends  to  emphasize  human 
pain  resulting  from  environmental  degradation. 

The  second  direction  relies  on  a  post-bipolar 
trend:  the  increasing  use  of  sanctions  against  states 
and  transnational  actors.  Must  we  resort  to  sanc¬ 
tions,  such  as  embargos,  to  compel  agents  to 
respect  the  rules  of  international  law?  Sanctions 
represent  a  third  way,  which  lies  between  not 
responding  to  violence  and  launching  a  military 
intervention.  Morality  speaks  in  favor  of  sanctions. 
According  to  Woodrow  Wilson,  a  sanction  embod¬ 
ies  a  virtuous  foreign  policy.  It  is  the  ideal  tool  for 
the  censor,  and  it  can  have  a  deadly  impact  on  the 
target.  However,  its  detractors  put  forward  three 
arguments  based  on  classical  moral  doctrines.  First, 
sanctions  call  into  question  the  discrimination  prin¬ 
ciple  of  just  war  theory  because  they  cause  pain 
(physically  and  psychologically)  to  innocents. 
Second,  civilians  are  transformed  into  a  means  of 
exerting  pressure  to  shape  political  decisions,  which 
contradicts  the  deontological  ethic  because  such 
pressure  is  coercive.  Third,  sanctions  cause  dam¬ 
ages  to  humans  while  not  achieving  their  political 
goals,  which  is  incompatible  with  utilitarian  logic. 
All  these  critics  benefit  from  a  large  audience, 
which  leads  today  to  an  “abolitionist”  standpoint 
with  respect  to  sanctions.  Most  of  all,  they  express 
the  vivacity  of  contemporary  ethical  reflections 
regarding  international  punishment  measures. 

The  third  dimension  of  the  current  analysis 
aims  at  enlarging  the  boundaries  between  caution 
and  cosmopolitanism.  Could  these  two  postures 
meet  regarding  the  concept  of  negative  obliga¬ 
tions?  That  is  the  aim  of  the  “no-harm”  propo¬ 
nents.  In  IR,  “Do  no  harm  or  cause  prejudice” 


provides  the  basis  for  a  moral  consensus  between 
cultures.  These  statements  echo  Judith  Shklar’s 
“liberalism  of  fear,”  which  also  involves  universal 
features,  as  each  of  us  can  share  an  emotion  that 
paralyzes  the  whole  being  in  the  face  of  cruelty. 
The  do-no-harm  principle  is  criticized  because  it  is 
sometimes  considered  incomplete.  It  lacks  positive 
obligations,  such  as  the  duty  to  rescue.  Several  of 
its  defenders  emphasize  that  no  harm  is  a  first  step 
to  building  benevolence  (positivity  is  therefore 
delayed  but  not  ruled  out).  Besides,  the  fact  of 
being  aware  of  the  vulnerability  of  “distant  for¬ 
eigners”  helps  develop  an  attachment  to  humanity. 
Thus,  cosmopolitanism  sets  up  a  link  between  the 
principle  of  justice  and  the  implementation  of  a 
world  community,  as  discussed  in  the  following 
section. 

Dilemmas  of  Global  Political  Order 

Political  theory  of  IR  deals  with  the  emergence  of 
a  world  community:  how  a  political  community 
becomes  more  and  more  inclusive.  Can  it  eventu¬ 
ally  embody  the  whole  of  humanity?  As  such,  the 
very  idea  of  a  world  community  calls  into  question 
the  Westphalian  order  as  well  as  the  international 
order  defined  by  Hobbes.  This  theory  also  conveys 
to  us  a  universalism  that  transcends  the  diversity  of 
cultural  expressions. 

Universum/Pluriversum: 

Two  Emblematic  Dualities 

When  refering  to  a  civitas  gentium  maxima  in 
the  18th  century,  Christian  Wolf  had  in  mind  a 
fiction  gathering  all  human  beings  inside  the  same 
political  unit.  Three  centuries  later,  his  dream 
remains  attractive.  This  emergence  of  a  concrete 
world  community  gives  rise  to  two  main  philo¬ 
sophical  debates. 

Kojeve/Strauss 

After  Georg  Hegel,  Alexandre  Kojeve  describes 
the  “end  of  history,”  which  means  the  end  of  poli¬ 
tics,  defined  as  a  state  of  conflictual  interaction 
between  states.  A  universal  and  homogeneous 
state  emerges,  and  individual  states  become  its 
federated  components.  Public  international  law 
becomes  domestic  law  for  members  who  share  the 
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same  way  of  being  and  thinking.  The  wise  man  is 
involved  in  this  centralization  process.  He  becomes 
the  prince’s  advisor  because,  historically,  thinking 
is  synonymous  with  improving  efficiency.  Leo 
Strauss  challenges  this  idea  in  two  ways.  First,  he 
tries  to  keep  philosophy  apart  from  history,  espe¬ 
cially  from  the  instrumentalist  deviation  of  politi¬ 
cal  leaders.  Indeed,  the  rulers  of  the  universal  state 
may  attempt  to  provide  wisdom  to  all  and  practice 
tyranny.  Second,  the  idea  of  the  universal  state 
meets  with  two  obstacles:  (1)  the  irreducible  plu¬ 
rality  of  political  systems  (they  arise  out  of  specific 
morals  that  cannot  be  imported)  and  (2)  that  an 
ideal  system  is  not  achievable  at  the  world  level  (it 
remains  a  regulating  idea). 

Schmitt/Kelsen 

These  two  lawyers  often  disagree  regarding  con¬ 
stitutional  issues  in  modern  states.  They  also  have 
opposite  views  regarding  IR.  Hans  Kelsen  criticizes 
the  “dogma  of  sovereignty,”  a  concept  doomed  to 
be  overshadowed  by  the  history  of  justice.  Kelsen’s 
normativism  draws  a  parallel  between  relation¬ 
ships  among  states  and  interindividual  relation¬ 
ships.  The  creation  of  a  third  party  in  primitive 
tribes  and  of  courts  of  law  in  states  allowed  the 
solving  of  disputes  between  individuals.  These 
procedures  stand  for  the  foundation  of  the  com¬ 
munity.  Likewise,  the  creation  of  international 
criminal  justice  mechanisms  is  the  mainspring  of 
the  international  community.  The  project  pro¬ 
posed  by  Kelsen  to  the  UN,  therefore,  gives  prece¬ 
dence  to  a  jurisdiction  and  not  to  the  Security 
Council.  Carl  Schmitt  states  that  such  ideas  draw 
on  an  Anglo-Saxon  (belonging  to  English-speaking 
countries)  framework  that  calls  into  question  the 
jus  publicum  Europaeum.  Theologically,  the  “one 
world”  embodies  the  Antichrist.  Indeed,  univer- 
sum  means  to  achieve  what  cannot  be  achieved 
from  a  human  standpoint.  The  Tower  of  Babel  is 
an  illusion,  and  we  have  to  get  rid  of  it.  It  is  an 
expression  of  the  secularization  of  the  mind  often 
pointed  out  by  Carl  Schmitt.  Through  universum , 
human  beings  try  to  put  themselves  in  God’s  place. 
At  a  political  level,  Schmitt  contends  that  it  is 
impossible  to  create  a  federal  state  at  a  world  level. 
Such  federal  state  lacks  the  most  important  feature 
of  a  state:  homogeneity.  Moreover,  this  project  of 


unity  forgets  the  underlying  elements  of  politics: 
the  struggle  for  power  between  small  unities 
(empires  in  the  20th  century). 

These  two  approaches  have  nourished  debates 
among  political  scientists  and  have  given  rise  to  a 
wide  range  of  reactions,  (e.g.,  Arendt).  They  also 
show  how  political  views  on  universal  and  plural¬ 
ity  can  be  different  inside  the  same  culture.  These 
debates  are  still  going  on,  especially  regarding  the 
idea  of  cosmopolitan  democracy. 

A  Cosmopolitan  Democracy 

Cosmopolitan  democracy’s  program  aims  at  re¬ 
actualizing  our  understanding  of  Kant  by  inviting 
us  to  go  beyond  it.  The  Kantian  project  of  perpet¬ 
ual  peace  consists  of  three  articles.  First,  a  civil 
constitution  must  be  republican.  Contrary  to  des¬ 
potism,  which  imposes  its  political  ideas  and  con¬ 
ception  of  happiness  on  everyone,  the  republic  is 
synonymous  with  separation  of  powers,  protection 
of  freedom,  and  guarantees  of  peace,  thanks  to  the 
general  will  of  the  people.  The  second  article  aims 
at  implementing  a  peace  treaty  between  states 
(mutual  nonagression  through  institutional  means). 
Such  a  treaty  does  not  lead  to  the  creation  of  a 
supranational  state  but  rather  to  the  recognition 
by  each  state  of  the  sovereignty  of  the  others:  in 
other  words,  a  confederation  of  free  states.  The 
third  and  last  article  introduces  the  idea  of  a  cos¬ 
mopolitan  law  based  on  universal  hospitality. 
Thus,  Kant  insists  that  foreigners  have  the  right  to 
be  treated  without  hostility  in  another  state. 

In  Jurgen  Habermas’s  view,  this  project  is  con¬ 
tradictory,  inconsequential,  and  inadequate.  If  Kant 
intends  to  create  a  cosmopolitan  state,  he  does  not 
call  into  question  states’  autonomy.  Kant  does  not 
foresee  the  adoption  of  a  constitution  that  would 
make  rules  legally  binding.  The  inconsistency  of  the 
Kantian  position  results  from  the  lack  of  coherence 
between  the  cosmopolitan  state  and  the  confederal 
architecture.  Ultimately,  the  nature  of  the  latter  is 
quite  different  from  what  Kant  had  first  in  mind:  a 
world  citizenship.  Individuals  keep  depending  on 
the  state.  Kant  does  not  remain  faithful  to  his  indi¬ 
vidualistic  and  liberal  philosophy.  Finally,  accord¬ 
ing  to  Habermas,  Kant’s  views  are  inadequate 
because  they  are  mainly  the  expression  of  the  con¬ 
ceptual  and  political  reality  of  the  1 8th  century. 


Normative  Theory  in  International  Relations  1735 


Habermas  revises  this  project  in  three  ways:  (1)  he 
conducts  a  criticism  of  the  nation-state,  (2)  he  draws 
on  the  creation  of  a  postnational  identity,  and 
(3)  he  transfers  this  identity  to  the  global  level.  The 
nation-state  is  condemned  as  a  political  organiza¬ 
tion.  It  is  unadapted  to  the  globalization  process 
and  is  a  source  of  bloody  conflicts.  Moreover,  the 
link  between  republic  and  nation  is  more  sociopsy- 
chological  than  conceptual.  It  arises  out  of  a  his¬ 
torical  process  (a  catalyst)  and  the  individual’s 
mentality  (sacrifice  to  protect  the  homeland). 

Habermas  explores  the  link  between  republic 
and  nationality.  He  compares  the  liberty  of  the 
state  with  individual  freedom.  The  latter  is  a  right 
that  is  separate  from  the  national  conscience,  as 
modern  natural  law  suggests  it.  This  separation 
allows  the  formation  of  a  new  supranational  iden¬ 
tity  that  articulates  national  identity  (culture  with¬ 
out  any  political  dimension)  and  constitutional 
patriotism  (universalization  of  human  rights  and 
democracy).  Habermas  contends  to  transform  this 
identity  into  an  association  of  free  and  equal  cos¬ 
mopolitans.  An  ethics  of  discussion  is  developed 
beyond  borders  thanks  to  intersubjectivity.  This 
ethics  is  based  on  concrete  situations  (to  see  and  to 
manage  the  state  of  unequality  and/or  distress 
experienced  by  individuals).  Influenced  by  these 
views,  David  Held  aims  at  strengthening  a  “robust 
political  cosmopolitism”  grounded  on  a  multisca¬ 
lar  democracy,  which  occurs  at  two  levels:  decision 
taking  on  the  one  hand,  citizenship  on  the  other. 
So  as  to  be  legitimized  but  also  efficient,  public 
decisions  must  be  taken  at  the  more  adapted  level. 
Three  criteria  allow  verification  of  (1)  expanse  (to 
assess  the  field  of  individuals  whose  life  expec¬ 
tancy  and  chance  of  survival  are  affected  by  public 
choices),  (2)  intensity  (to  measure  the  impact  of  a 
public  policy  on  a  given  group),  and  (3)  compared 
efficiency  (to  determine  the  necessity  of  a  regional 
or  world  intervention  compared  with  an  interven¬ 
tion  at  a  local  level).  A  concept  of  “differentiated 
sovereignty”  emerges  from  Held’s  thought.  The 
subsidiary  principle  must  be  called  on  to  determine 
what  level  is  the  most  accurate.  Held  also  intro¬ 
duces  the  idea  of  a  multilevel  citizenship  that  relies 
not  solely  on  territorial  community.  Voting  is  no 
longer  the  unique  expression  of  citizenship.  To 
carry  weight  on  decisions  and  to  practice  account¬ 
ability,  for  instance,  are  other  ways  of  exerting 


citizenship.  Therefore,  Held’s  cosmopolitanism 
maintains  the  nation-state  but  invites  going  beyond 
it  through  new  public  commitments. 

Reserves  and  Alternative  Conceptions 

Several  criticisms  have  been  formulated  against 
this  cosmopolitan  democracy.  These  criticisms 
have  not  been  limited  to  the  utopian  features 
underlying  the  latter  and  come  from  pluralistic 
traditions  (liberal,  communitarian,  or  socialist). 
From  a  practical  standpoint,  the  deliberative  activ¬ 
ity  characteristic  of  democracy  arose  out  of  a  com¬ 
mon  political  culture  that  has  developed  over  time 
and  has  required  many  sacrifices.  Considered  in 
the  light  of  the  national  construction  process  of  the 
1 9th  century,  it  appears  that  a  political  culture 
results  from  the  political  institutions  that  produce 
culture  and  not  the  reverse.  Outside  this  frame¬ 
work,  citizenship  is  fragile  and  appears  as  an 
abstraction.  In  the  second  case,  cosmopolitism 
would  hide  imperial  temptations. 

Ontologically,  society  is  synonymous  with  plu¬ 
rality.  Rawls  holds  this  view  when  he  analyzes  the 
extension  of  the  justice  principle  outside  liberal 
societies.  He  challenges  the  idea  of  a  unity  of  the 
world.  There  will  always  be  different  peoples  (e.g., 
liberal  people,  decent  people),  and  rogue  states  will 
always  exist.  Therefore,  a  law  of  peoples  could  be 
implemented  but  not  cosmopolitism  itself. 

Alternative  conceptions  draw  on  other  frame¬ 
works.  Thus,  Walzer  claims  a  pluralism  of  high 
density  that  controls  states’  action  through  three 
kind  of  agents — international  institutions,  non¬ 
governmental  organizations,  and  regional  unions. 
This  order  is  midway  between  a  unified  world 
state  (unrealistic  and  dangerous  in  case  of  author¬ 
itarian  deviation)  and  a  state  of  savage  interna¬ 
tional  anarchy  (as  much  a  threat  as  the  unification 
of  the  world).  Departing  from  this  conception, 
Amitai  Etzioni  states  that  the  international  order 
must  be  built  on  persuasion.  In  his  opinion,  the 
world  dynamic  will  achieve  what  he  calls  a  “com¬ 
munity  of  communities,”  which  is  a  source  of 
order.  Only  imagined  until  now,  this  global  com¬ 
munity  is  likely  to  be  effective.  Indeed,  states  are 
adopting  a  new  logic  of  responsibility  and  new 
diplomatic  practices,  which  favor  a  transnational 
moral  dialogue  between  individuals.  Ultimately, 
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global  safety  authorities  are  called  on:  They  share 
not  only  moral  values  and  costs  but  also  the 
power  to  struggle  against  traditional  and  nontra- 
ditional  threats.  However,  such  a  trend  has  to 
cope  with  the  U.S.  leadership — often  tempted  by 
imperial  adventures — and  with  the  lack  of  power 
of  the  UN,  characterized  by  Etzioni  as  the  “old 
regime.” 

Republicanism  is  at  the  source  of  another  trend. 
The  very  idea  of  the  republic  challenges  that  of 
dominium  (to  possess  the  subjects’  or  citizens’ 
bodies  as  the  feudal  lord  did)  and  of  imperium  (the 
power  of  authority  that  relied  on  force).  The 
republic  aims  at  achieving  civic  liberty  and  at 
developing  participation  in  the  public  sphere.  This 
political  model  was  associated  with  a  defensive 
foreign  policy  by  Abbe  de  Saint-Pierre,  Montes¬ 
quieu,  and  Rousseau  in  their  description  of  a  con¬ 
federation  of  small  republics.  Too  large,  and 
republics  are  exposed  to  domestic  discords.  Too 
small,  and  they  are  threatened  by  their  powerful 
neighbors.  How  can  republics  maintain  their 
political  unity?  Through  the  restriction  of  their  ter¬ 
ritory  and  the  reinforcement  of  their  military 
capacities,  which  are  the  main  advantages  of  these 
confederations.  Their  foreign  policies  are  reactive 
rather  than  aggressive.  Thus,  this  ideal  remains 
very  close  to  the  autarky  of  the  Ancients.  Moreover 
in  Rousseau’s  opinion,  its  transfer  outside  a  spe¬ 
cific  setting — the  Helvetic  confederation — remains 
hazardous.  Nevertheless,  republicanism  is  still  a 
source  of  inspiration  today.  In  an  economic  global¬ 
ization  context,  its  critique  of  liberalism  creates 
new  interest.  Overall,  republicanism  promotes 
values  of  public  virtue  and  civic  conscience  that 
minimize  the  arbitrary  exercise  of  power  and  the 
impacts  on  foreign  policy,  which  becomes  non¬ 
dominating.  Thus,  a  global  civic  republicanism 
could  provide  new  rules  for  interstate  cooperation. 

Conclusion 

Fault  lines  regarding  war,  justice,  or  global  order 
nourish  many  debates.  Nevertheless,  two  main 
trends  have  appeared:  The  first  claims  to  revise 
major  political  and  ethical  issues,  while  the  second 
aims  to  create  new  ideas  regarding  current  chal¬ 
lenges.  Nowadays,  a  third  trend  consists  in  decen¬ 
tering  our  philosophical  standpoint  by  integrating 
ideas  from  non-Western  cultures.  But  all  these 


inquiries  share  the  same  questions  and  challenges: 
What  is  humanity?  Does  it  stand  for  an  idea  that 
is  becoming  a  norm,  a  set  of  concrete  rules,  or  an 
ideal  that  influences  action?  Is  it  a  source  of  besti¬ 
ality  or  an  appeal  to  mutual  kindness?  A  group 
called  to  build  a  world  political  culture  or  a 
reduced  community  that  must  remain  particular? 
Normative  theories  of  IR  will  continue  to  provide 
different  answers.  Thus,  these  debates  should  not 
be  tackled  by  Greek  “garden  philosophers,”  with¬ 
drawn  into  the  private  sphere,  but  by  philosophers 
open  to  the  public  sphere,  who  get  to  the  heart  of 
contemporary  issues. 
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Oligarchy 


The  concept  of  oligarchy  repeatedly  appears  in 
political  and  common  discourse  from  ancient 
Greece  and  up  through  modern  times.  Practically 
all  reference  sources  provide  similar  definitions  of 
oligarchy — rule  of  the  few  in  their  own  interests 
and  not  in  the  interest  of  the  majority  or  the  pub¬ 
lic  good.  Etymology  confirms  this  connotation: 
oligos  —  few,  arche  —  rule  (ancient  Greek).  From 
Aristotle  comes  the  initial  meaning  of  oligarchy — 
it  is  a  “degeneration”  of  aristocracy  as  the  form  of 
rule;  oligarchy  is  an  “incorrect”  and  “spoilt”  aris¬ 
tocracy  when  the  few  rule  exclusively  in  their  own 
interests.  The  notion  of  oligarchy  can  be  found  in 
the  writings  of  Plato,  Polybius,  and  their  contem¬ 
poraries  and  later  in  Niccolo  Machiavelli.  It  has 
been  used  in  modern  social  science  by  Moisei 
Ostrogorsky,  Vilfredo  Pareto,  Gaetano  Mosca, 
Robert  Michels,  and  others.  The  term  oligarchy  is 
applied  in  political  and  nonpolitical  contexts — 
political  regimes  and  political  parties,  trade 
unions,  church  organizations,  educational  estab¬ 
lishments.  This  entry  presents  the  contemporary 
and  classic  meanings  of  the  concept,  the  main 
existing  theory,  and  its  further  development  in 
post-Communist  countries. 

Who  Are  the  “Few”? 

From  today’s  point  of  view,  one  can  say  that  the 
“few”  who,  according  to  the  logic  of  the  notion, 
form  the  backbone  of  oligarchy  may  actually 


represent  very  different  groups — slave  owners, 
landlords,  nobility,  the  rich  and  wealthy,  the  top 
brass  of  any  kind,  party  bureaucracy,  and  so  on. 
Rule  of  the  rich  and  wealthy  (“plutocracy”)  may 
be  only  one  of  the  meanings  of  the  notion  of  oli¬ 
garchy.  In  economic  science,  the  closest  analogue 
of  oligarchy  is  oligopoly:  a  situation  in  which  only 
a  few  large-scale  “players”  dominate  an  economy 
and  are  practically  independent  of  others  who  can¬ 
not  compete  with  them  on  an  equal  footing. 

Oligarchic  rule  is  not  necessarily  dictatorship 
and  arbitrariness.  While  one  of  the  major  charac¬ 
teristics  of  oligarchy  is  nonaccountability  of  power, 
it  may  in  a  way  rely  on  law  (i.e.,  be  legal);  how¬ 
ever,  this  law  is  in  the  interests  of  the  few.  Similar 
to  monarchies,  oligarchic  regimes  may  even  be 
considered  by  some  as  legitimate,  although  their 
legitimacy  does  not  rest  on  popular  consent  and 
support.  An  oligarchic  regime,  unlike  personal  dic¬ 
tatorship,  may  be  partly  predictable;  it  defines  the 
limits  of  the  politically  permissible  and  the  forbid¬ 
den  in  terms  of  the  critique  and  opposition.  It  is 
usually  said  that  the  oligarchs  “rule  but  not  man¬ 
age” — they  would  seldom  occupy  official  positions 
in  government,  but  their  opinion  is  crucial  because 
of  their  power  and  influence  (economic  and  other). 
There  may  be  internal  tensions  and  conflicts 
within  the  oligarchic  system;  this  points  to  the 
importance  for  the  oligarchs  of  the  existence  of  an 
external  arbiter  who  is  accepted  as  such  by  them. 

The  term  oligarchy  is  used  widely  today:  how¬ 
ever,  more  likely  not  as  an  analytical  concept  but 
as  a  descriptive  image.  From  the  point  of  view  of 
modern  political  science,  it  is  a  relatively  vague 
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and  amorphous  notion  with  clear  negative  evalua¬ 
tive  connotations.  Many  critics  consider  the  notion 
of  oligarchy  to  be  too  reductive  or  even  too  poor 
for  effective  political  analysis,  with  insufficient 
theoretical  base. 

In  modern  social  science  and  historical  litera¬ 
ture,  this  notion  is  often  used  to  describe  very  dif¬ 
ferent  historical  and  political  phenomena — such  as 
medieval  oligarchy  in  China;  the  Venice  republic  in 
the  1 3th  to  1 6th  centuries,  ruled  by  a  few  patrician 
families;  England  from  the  Glorious  Revolution 
until  the  middle  of  the  19th  century,  with  the 
Magna  Carta  guaranteeing  the  rights  of  the  nobility 
(i.e.,  the  rights  of  the  oligarchy);  colonial  America 
until  the  1776  revolution  as  a  combination  of 
democratic  and  oligarchic  elements;  the  Meiji 
Restoration  in  Japan,  and  so  on.  Some  authors 
apply  the  notion  of  oligarchy  to  the  political 
regime  in  the  former  USSR  after  Stalin  (and  espe¬ 
cially  under  Brezhnev’s  so-called  collective  leader¬ 
ship),  which  was  no  longer  a  personal  dictatorship 
but  the  rule  of  the  privileged  “nomenclatura.” 
South  Africa  under  the  apartheid  system  provides 
an  example  of  racial  oligarchy  based  on  the  rule  of 
the  White  minority — about  20%  of  the  popula¬ 
tion.  In  Latin  America,  various  military-latifundist 
political  regimes  are  often  described  as  oligarchic. 
Some  neo-Marxists  argue  that  the  majority  of 
Western  democracies,  especially  the  United  States, 
are  actually  “oligarchic  democracies”  (or  “repre¬ 
sentative  oligarchies”)  where  “big  money”  de  facto 
determines  the  outcome  of  elections. 

There  are  impressive  case  studies  of  political 
dynamics  in  various  countries  of  Southeast  Asia 
(Indonesia,  Thailand,  Malaysia,  Singapore,  the 
Philippines,  Hong  Kong,  South  Korea)  that  dem¬ 
onstrate  how  oligarchic  systems  based  on  the 
economic  power  of  several  dozen  families  or  cor¬ 
porate  clans  adapt  to  political  change  and  eco¬ 
nomic  reforms  and  reproduce  themselves  within  a 
new  context.  The  case  of  Indonesia  after  the  col¬ 
lapse  of  Suharto’s  New  Order  at  the  end  of  the 
1990s  may  be  a  good  example  of  the  survival  and 
reproduction  of  some  basic  structures  of  the  oli¬ 
garchic  system  of  rule  and  power  relationships 
irrespective  of  the  ongoing  reforms.  Some  research 
has  registered  oligarchic  trends  in  a  variety  of 
large-scale  organizations — such  as  political  parties, 
trade  unions,  professional  associations,  educa¬ 
tional  systems,  and  clerical  organizations. 


The  Existing  Theories 

To  the  extent  that  a  “theory  of  oligarchy”  exists, 
its  classic  form  is  that  of  Michels  (1876-1936).  In 
his  analysis  of  the  formation  and  evolution  of  the 
Social  Democratic  Party  of  Germany,  he  formu¬ 
lated  the  so-called  iron  law  of  oligarchy:  “Who 
says  organization,  says  oligarchy.”  According  to 
Michels,  oligarchy  is  an  unavoidable  consequence 
of  the  functioning  of  all  and  sundry  large  social 
and  political  organizations.  His  logic  is  as  follows: 
For  effective  management  of  any  large  organiza¬ 
tion,  a  professional  bureaucratic  elite  with  special 
managerial  competencies  is  needed;  this  elite 
acquires  specific  social  status  and  interests  and 
becomes  estranged  from  the  rank-and-file  mem¬ 
bers  and  nonaccountable  to  them;  this  elite  believes 
that  it  knows  better  what  is  good  and  what  is  bad 
for  the  masses.  Even  when  elected,  the  bureau¬ 
cratic  elite  will  try  to  keep  its  position  or  will  pass 
it  only  to  other  elites  (the  influence  of  Mosca  and 
particularly  Pareto  with  his  “circulation  of  elites” 
is  obvious  here).  Michels  argues  that  even  demo¬ 
cratic  organizations  eventually  degenerate  into 
oligarchy  since  they  cannot  exist  without  bureau¬ 
cratic  structures  of  management  that  eventually 
turn  oligarchic.  This  thesis  actually  questions  the 
basic  principle  of  democratic  representation,  since, 
according  to  Michels,  any  elected  representative 
will  sooner  or  later  turn  into  a  master  estranged 
from  the  electors. 

For  Michels,  the  roots  of  the  emergence  of  oli¬ 
garchy  are  not  in  personal  psychological  motiva¬ 
tion  or  ill  will  but  in  general  organizational  logic 
and  managerial  technology:  Professionalism  of 
management  and  administration  requires  specific 
skills;  those  who  acquire  them  become  estranged 
from  the  masses  and  strive  for  preservation  of 
their  status.  Thus,  according  to  Michels,  any  orga¬ 
nization,  due  to  its  internal  logic,  will  sooner  or 
later  degenerate  into  oligarchy. 

Some  of  Michels’s  ideas  were  further  devel¬ 
oped  by  Maurice  Duverger,  who  used  a  compara¬ 
tive  approach  to  the  analysis  of  different  political 
parties  in  order  to  confirm  the  generic  logic  of 
“oligarchic  degeneration” — leadership  in  all  par¬ 
ties  eventually  becomes  oligarchic;  an  isolated 
and  closed  “ruling  strata”  of  professional  bureau¬ 
crats  emerges  and  tends  to  reproduce  itself,  irre¬ 
spective  of  the  ideological  or  political  nature  of 
the  organization. 
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Today’s  academic  community  responds  to 
Michels’s  theoretical  heritage  somewhat  ambigu¬ 
ously.  Some  empirical  research  of  large  organizations 
seems  to  confirm  his  idea  of  oligarchic  degenera¬ 
tion;  other  studies  do  not.  However,  even  some 
empirically  proved  exceptions  undermine  the  uni¬ 
versal  nature  of  the  declared  “iron  law  of  oligar¬ 
chy.”  It  is  hardly  possible  to  falsify  this  “law,” 
since  it  is  based  on  the  probabilistic  assumption 
that  any  organization  would  “eventually”  turn 
into  oligarchy.  Furthermore,  today’s  criticism  of 
Michels’s  theory  underlines  other  vulnerable 
points,  such  as  the  amorphous  nature  of  the  notion 
of  the  “few”  as  a  definition  of  the  group  of  oligar¬ 
chic  leaders  and  the  vagueness  of  the  concept  of 
“oligarchic  tendencies”  and  its  insufficiency  for 
effective  political  analysis.  Conceptual  reevalua¬ 
tion  of  Michels  in  recent  decades  contributed  to 
the  growing  interest  in  the  theories  of  organiza¬ 
tions,  organizational  logic,  and  organizational 
behavior  and  in  the  detailed  analysis  of  mecha¬ 
nisms  of  influence,  leadership,  and  submission. 

The  Theory  Updated  in 

the  Postcommunist  Era 

The  notion  of  oligarchy  has  acquired  new  circula¬ 
tion  and  popularity  in  the  early  1990s  as  applied 
to  a  description  of  economic  and  political  dynam¬ 
ics  in  Russia  and  some  other  post-Soviet  Union 
countries.  Its  connotation  became  not  simply 
negative  but  even  pejorative  and  abusive:  History’s 
largest  predatory  privatization  of  state  property 
(“grabization”)  in  Russia  was  made  possible  only 
through  governmental  channels  because  of  the 
“appointments”  of  a  few  new  oligarchs  by  Presi¬ 
dent  Boris  Yeltsin’s  “clan”  (the  so-called  family) 
and  their  merging  with  state  power.  The  outcome 
of  this  process  was  the  emergence  in  Russia  of 
“oligarchic  capitalism” — a  common  noun  for  the 
fusion  of  superbig  capital  with  the  machinery  of 
the  state.  The  peculiarity  of  the  situation  in  Russia 
is  emphasized  by  the  fact  that  state  power  itself 
has  created  the  opportunity  for  the  oligarchic  sys¬ 
tem  to  emerge,  since  it  was  the  state  that  had  given 
out  its  former  property  to  a  few  “confidants,” 
counting  on  their  subsequent  financial  support. 
This  support,  indeed,  became  one  of  the  impor¬ 
tant  factors  for  the  reelection  of  Yeltsin  in  1996. 
Thus,  the  emergence  of  oligarchic  capitalism  in 


Russia  is  often  perceived  as  very  specific:  In  the 
Russian  case,  you  have  not  the  supremacy  of  big 
capital  over  state  power  but  the  other  way 
around — state  power  is  creating  the  superbig  pri¬ 
vate  owners  for  the  sake  of  its  reproduction.  Some 
analysts  argue  that  the  oligarchic  system  of 
patronage  and  clientelism  in  Russia  at  the  federal 
and  regional  levels  was  created  by  the  regime 
itself  for  the  main  purpose  of  securing  financial 
support  for  its  reproduction  in  democratic,  even 
imitative,  forms.  Also,  this  was  made  possible  in 
Russia  during  a  period  of  economic  recession,  not 
during  a  time  of  economic  growth,  as  in  other 
contemporary  cases  of  oligarchy  coming  into 
being.  Various  researches  stress  the  absence  of 
formal  institutions  and  the  informal  character  of 
ties  between  Russian  oligarchs  and  state  power. 
They  also  point  to  the  fragmentary  nature  of 
Russian  oligarchy  with  different  and  often  con¬ 
flicting  interests,  its  noninstitutionalized  charac¬ 
ter,  and  other  aspects. 

At  the  end  of  the  1990s,  some  political  analysts 
even  started  to  talk  about  the  danger  of  the  “oli¬ 
garchic  coup”  in  Russia.  At  least  an  important 
part  of  the  massive  support  for  Vladimir  Putin  in 
Russia  in  the  early  2000s  came  from  his  “antioli- 
garchic”  rhetoric.  However,  many  observers 
argue  that  the  outcome  of  Putin’s  so-called  antio- 
ligarchic  revolution  was  the  replacement  of 
Yeltsin-era  oligarchs  by  new  ones — largely  origi¬ 
nating  from  the  security  services  and  other 
“power”  structures  (so-called  silogarchs — from 
the  Russian  word  sila  —  power — or  securocrats, 
i.e.,  coming  from  various  security  agencies).  Some 
argue  that  the  paradigm  of  “oligarchic  capital¬ 
ism”  that  emerged  under  Boris  Yeltsin  is  repro¬ 
duced  in  Russia  under  Vladimir  Putin  and  Dmitry 
Medvedev.  Be  that  as  it  may,  one  should  note  that 
this  concept  of  “oligarchic  capitalism”  with  refer¬ 
ence  to  Russia  and  some  other  post-Soviet  coun¬ 
tries  needs  further  theoretical  and  experimental 
study. 
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One-Party  Dominance 


There  is  no  consensus  on  what  constitutes  one- 
party  dominance,  but  two  features  stand  out. 
First,  a  party  becomes  dominant  when,  over  time, 
it  is  much  more  successful  in  elections,  in  parlia¬ 
ment,  and  in  the  government  than  any  other 
party.  How  such  success  is  measured  and  for  how 
long  the  party  needs  to  be  successful  in  order  to 
qualify  as  a  dominant  party  is  debated.  The 
threshold  for  time  ranges  from  two  elections  won 
in  a  row  to  never  losing  a  single  election.  The 
threshold  for  electoral  success  ranges  from  a  plu¬ 
rality  of  votes  to  a  supermajority  of  seats.  Second, 
it  matters  how  dominance  is  established.  Does  the 
ruling  party  enjoy  genuine  support  in  the  popula¬ 
tion,  or  does  it  maintain  its  hold  on  power 
through  nondemocratic  means?  In  other  words, 
we  need  to  distinguish  between  dominant  parties 
and  dominant  authoritarian  parties.  In  the  classic 
definition  of  the  Italian  political  scientist  Giovanni 
Sartori,  a  dominant-party  system  exists  when  the 
same  party  wins  at  least  three  consecutive  elec¬ 
tions  that  are  free,  fair,  and  competitive.  The 
dominant  party  needs  to  gain  an  absolute  major¬ 
ity  in  parliament  and  govern  alone.  In  case  of  a 
presidential  system  of  government,  the  dominant 
party  also  needs  to  win  the  presidential  elections. 
If  the  elections  were  not  free,  fair,  or  competitive, 
we  are  dealing  with  a  dominant-authoritarian- 
party  system. 


Clearly,  there  are  not  many  parties  that  manage 
to  govern  alone  with  a  parliamentary  majority 
behind  them  for  more  than  12  years.  The  classic 
case  is  the  Liberal  Democratic  Party  in  Japan, 
which  has  governed  for  almost  the  entire  period 
after  World  War  II.  The  Christian  Democrats  in 
Italy  were  the  dominant  force  for  4  decades, 
always  being  in  the  government  until  the  Second 
Republic  ended  in  corruption  scandals.  However, 
these  were  coalition  governments,  so  Italy  is  not  a 
pure  case  of  one-party  dominance.  Neither  is 
Sweden,  because  the  Socialists  often  had  to  form  a 
minority  government.  The  Canadian  Liberals, 
although  in  power  most  of  the  time  at  the  federal 
level,  nonetheless  lost  at  regular  intervals.  There 
are  more  cases  of  dominance  at  the  regional  level, 
for  example,  the  Democratic  South  in  the  United 
States  in  the  first  half  of  the  20th  century  or  the 
position  of  the  Christian  Social  Union  in  Bavaria. 
Unfortunately,  there  are  not  many  studies  of  sub¬ 
national  one-party  dominance. 

Even  this  brief  overview  of  the  best  known 
examples  of  dominant  parties  suggests  that  one- 
party  dominance  is  uncommon,  and  sustained  one- 
party  dominance  is  even  more  rare.  In  comparison, 
it  is  easier  for  dominant  authoritarian  parties  to 
prolong  their  rule.  The  classic  case  is  the  Insti¬ 
tutional  Revolutionary  Party  in  Mexico,  which 
managed  to  stay  in  power  for  most  of  the  20th 
century.  When  a  dominant  authoritarian  party 
loses  an  election,  it  amounts  to  a  democratic  tran¬ 
sition.  Other  recent  examples  are  the  Kuomintang 
in  Taiwan  and  Kenya  African  National  Union  in 
Kenya. 

The  global  spread  of  democracy  since  the  mid- 
1970s  has  resulted  in  a  proliferation  of  dominant 
parties,  especially  in  Africa.  Paradoxically,  the 
(re)introduction  of  multiparty  elections  in  sub- 
Saharan  Africa  has  resulted  in  the  dominance  of 
dominant-party  systems.  The  emblematic  case  is 
South  Africa,  where  the  African  National  Congress 
(ANC)  has  won  all  four  elections  since  the  end  of 
apartheid  with  overwhelming  victories,  even  to  the 
point  where  it  could  change  the  constitution  by 
itself.  This  has  led  observers  to  worry  about  the 
effect  of  one-party  dominance  on  the  prospects  of 
democratic  consolidation.  Even  in  mature  democ¬ 
racies,  there  are  indications  that  domination  by 
one  party  and  the  resulting  lack  of  political  com¬ 
petitiveness  has  consequences  for  the  quality  of 
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democracy.  The  dominant  parties  of  Japan  and 
Italy  are/were  notorious  for  their  factionalism  and 
corruption.  If  parties  are  in  power  for  so  long,  the 
lines  between  party  and  state  are  blurred,  compla¬ 
cency  replaces  responsiveness,  and  infighting  over¬ 
shadows  competition  with  the  often  fragmented 
and  powerless  opposition.  If  this  is  true  for  mature 
democracies  (there  is  in  fact  little  systematic  evi¬ 
dence  to  support  these  accusations),  how  much 
more  damaging  will  the  effects  of  one-party  domi¬ 
nance  be  for  new  democracies? 

However,  while  dominant-party  systems  exhibit 
a  low  level  of  competitiveness,  they  do  not  neces¬ 
sarily  lack  political  competition,  understood  as 
potential  competitiveness,  and  the  frequent  talk 
about  dominant  parties  as  “de  facto  one-party 
states”  is  often  misleading.  It  is  too  simple  to 
equate  one-party  dominance  with  lack  of  democ¬ 
racy  as  done  by  scholars  who  use  election  out¬ 
comes  to  measure  democracy.  The  ruling  party  in 
Botswana,  in  power  since  independence,  has  never 
lost  an  election.  One  influential  measure  of  democ¬ 
racy  takes  this  lack  of  alternation  in  government  as 
proof  that  the  election  results  were  manipulated  by 
a  nondemocratic  regime.  It  considers  dominant 
parties  as  undemocratic  until  they  lose  power.  This 
coding  scheme  puts  the  ANC  in  South  Africa  and 
Robert  Mugabe’s  Zimbabwe  African  National 
Union-Patriotic  Front  in  Zimbabwe  in  the  same 
category.  Clearly,  such  a  measure  fails  to  distin¬ 
guish  between  dominant  parties  and  dominant 
authoritarian  parties.  While  the  ANC  may  be 
voted  out  of  power  at  the  next  elections,  Mugabe 
has  shown  that  he  will  hang  on  to  power  by  any 
means  necessary. 

There  is  some  confusion  about  dominant  parties 
and  dominant-party  systems.  There  is  no  doubt 
that  the  Italian  Christian  Democrats  constituted  a 
dominant  party  in  the  loose  sense  of  a  party  that 
outdistances  all  others  in  popular  support. 
However,  Italy  did  not  have  a  dominant-party  sys- 
tem.  The  pattern  of  interactions  between  the 
political  parties  was  shaped  by  the  rivalry  between 
the  Christian  Democrats  and  the  Communists, 
who  were  regarded  as  an  antisystem  party.  The 
Italian  Communists  always  came  in  second  place 
and  were  never  invited  to  join  the  government  but 
because  of  their  sheer  size  forced  other  parties  to 
form  a  coalition.  The  party  system  therefore  has 
been  characterized  as  “polarized  pluralism.” 


Likewise,  even  if  the  British  Conservatives  or 
Labour  Party  manage  to  win  three  elections  in  a 
row,  as  they  have  both  done  in  the  past  30  years, 
this  does  not  change  the  character  of  the  British 
two-party  system.  The  key  question,  which  has 
been  insufficiently  acknowledged  and  explored,  is 
to  what  extent  dominant  parties  create  a  domi¬ 
nant-party  system.  In  other  words:  When  and  how 
do  dominant  parties  come  to  shape  the  pattern  of 
political  competition? 

No  unified  theory  explains  the  emergence  and 
longevity  of  dominant  parties,  but  partial  expla¬ 
nations  have  been  offered,  often  tightly  bound  to 
a  particular  case  or  class  of  cases.  First,  access  to 
state  resources  plays  a  role.  This  theory  is  espe¬ 
cially  helpful  to  explain  the  continued  success  of 
dominant  authoritarian  parties,  such  as  the 
Institutional  Revolutionary  Party  in  Mexico. 
Second,  dominant  parties  may  benefit  from  posi¬ 
tioning  themselves  on  key  social  cleavages.  The 
ANC  in  South  Africa  would  be  a  case  in  point. 
Third,  political  institutions,  especially  the  elec¬ 
toral  system,  have  an  effect.  The  Liberal  Demo¬ 
cratic  Party  in  Japan  benefited  from  an  unusual 
electoral  system  called  the  single  nontransferable 
vote.  Against  this,  dominant  parties  have  thrived 
in  a  variety  of  electoral  systems,  from  propor¬ 
tional  representation  (the  ANC  in  South  Africa 
and  the  SWAPO  Party  in  Namibia)  to  plurality 
(the  Botswana  Democratic  Party  in  Botswana). 
Fourth,  there  may  be  something  special  about 
dominant  parties — an  aura  of  invincibility,  the 
sense  of  inevitably,  an  association  with  an 
epoch — that  is  hard  to  capture  empirically  but 
easily  recognized  by  those  familiar  with  the 
cases. 

There  is  growing  pluriformity  in  approaches  to 
conceptualizing,  measuring,  and  explaining  one- 
party  dominance.  On  the  one  hand,  this  analytical 
diversity  may  be  regarded  as  disappointing  after 
decades  of  theorizing  and  research  on  one-party 
dominance.  On  the  other  hand,  the  growing  litera¬ 
ture  on  this  subject  attests  to  the  continuous  devel¬ 
opment  of  our  thinking  and  even  a  new  vitality  in 
the  study  of  dominant  parties.  A  particularly  inter¬ 
esting  development  is  the  use  of  voting  power 
indexes  to  measure  the  extent  of  dominance,  espe¬ 
cially  when  combined  with  the  classic  typologies. 
This  innovation  has  the  potential  of  bridging  the 
divide  between  quantitative  and  qualitative 
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research  on  dominant  parties  and  dominant-party 

systems. 

Matthijs  Bogaards 
Jacobs  University  Bremen 
Bremen,  Germany 
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Opposition 


The  term  opposition  comes  from  the  Latin  word 
opponere,  meaning  to  put  something  in  front  of 
something  else.  An  actor’s  opponents  are  those  who 
want  to  block  his  way.  Such  a  conflicting  situation 
may  be  a  game  with  established  rules  in  which  the 
opponent  accepts  defeat.  Some  opponents  may, 
however,  try  to  end  the  game  and  set  up  new  rules 
that  work  for  their  (exclusive)  advantage.  When  the 
rules  of  game  allow  everyone  to  enter  and  to  com¬ 
pete  under  fair  conditions,  players  with  limited  skill 
may  lose  repeatedly  or  be  excluded  for  not  abiding 
by  the  rules.  In  other  cases,  games  may  exclude 
players  arbitrarily,  or  the  rules  may  be  unfair. 

Against  this  background,  we  can  distinguish 
four  types  of  fighting  against  a  competitor: 

1.  Opposition  refers  to  those  trying  to  defeat  a  so 
far  victorious  competitor  but  following  all  rules 
of  the  game. 


2.  Extremists  are  those  wanting  to  overthrow  a 
fair  and  inclusive  game. 

3.  Members  of  resistance  are  those  trying  to 
overthrow  an  unfair  or  exclusive  game  in  order 
to  establish  a  fair  and  inclusive  one. 

4.  Rebels  are  those  fighting  for  victory  in  an  unfair 
or  exclusive  game  without  desiring  to  replace  it 
by  an  inclusive  and  fair  one. 

When  “game”  is  understood  as  a  polity  and 
“rules  of  the  game”  as  a  constitution  or  set  of 
laws,  then  opposition  is  a  generic  concept  for  those 
engaging  in  politics  to  fight  against  unwelcome 
policies  and  politicians  and  accepting  the  constitu¬ 
tion  and  laws  even  in  spite  of  possible  defeat.  This 
entry  analyzes  the  context  and  the  actors  of  oppo¬ 
sition;  it  also  discusses  Robert  Dahl’s  analytic 
framework,  the  functions  of  opposition,  and 
finally,  the  most  recent  research  developments. 

Contexts 

It  is  not  self-evident  that  one  should  fight  for 
political  goals  while  accepting  defeat.  Regimes 
with  a  loyal  opposition  are,  therefore,  an  excep¬ 
tion.  Certainly  political  dissent  and  nonviolent 
opposition  were  common  in  Democratic  Athens, 
in  Republican  Rome,  in  some  republics  of  medi¬ 
eval  Italy,  and  within  European  Estate  assemblies. 
But  most  of  that  disappeared  with  the  advent  of 
strong  monarchic  power. 

The  exception  was  England.  First,  the  principle  of 
“king  in  parliament”  allowed  even  conflicts  between 
the  Crown  and  parliament  to  be  understood  as  com¬ 
petition  for  the  best  of  one  single  “body  politic.” 
Second,  the  decline  of  monarchic  power  due  to  the 
civil  war,  the  Glorious  Revolution,  and  the  specific 
relations  of  Sir  Robert  Walpole  (1676-1745)  with 
the  two  first  kings  of  the  Hanoverian  dynasty  led  to 
the  emergence  of  the  new  role  of  a  “prime  minister” 
with  a  personal  power  base  in  parliament.  Patronage 
and  benefits  for  allies  were  the  central  means  to  cre¬ 
ate  support  for  the  prime  minister.  Such  “govern¬ 
ment  by  corruption”  made  parliament  split:  One 
side  benefited  from  “their”  prime  minister  and  his 
policies;  the  other  side  advocated  different  policies 
and  hoped  to  be  part  of  the  spoils  system  under  a 
new  prime  minister. 

In  this  situation,  Walpole’s  political  adver¬ 
sary  Henry  St.  John  1st  Viscount  Bolingbroke 
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(1678-1751)  presented  a  first  theory  of  political 
opposition  that  was  very  influential.  According  to 
this  theory,  opposition  serves  liberty  and  the  com¬ 
mon  good  by  continually  criticizing  selfish  or 
incompetent  politics  of  the  government;  opposing 
the  government  does  not  mean  disloyalty  to  the 
Crown  or  the  constitution  but  only  the  desire  to 
replace  bad  policies  or  politicians  with  better  ones. 
Actually  it  has  been  one  of  the  most  important 
inventions  in  human  history  to  implement,  and  to 
acknowledge,  the  practice  of  a  legitimate  and  loyal 
opposition.  Unfortunately,  the  concept  was  subse¬ 
quently  narrowed  to  a  form  of  parliamentary 
activity  and  did  not  encompass  phenomena  such  as 
political  pluralism  and  separation  of  power. 
Moreover,  it  was  put  in  the  context  of  “ethically 
better  alternatives,”  claiming  counterfactually  a 
superior  moral  status  for  oppositional  parties. 

Who  Is  “the  Opposition”? 

Under  the  assumption  that  opposition  proper  is 
“parliamentary  opposition,”  typologies  were  cre¬ 
ated  that  distinguish  parliamentary,  extraparlia¬ 
mentary,  and  antiparliamentary  opposition.  But 
any  split  between  opposition  inside  and  outside 
parliament  is  artificial,  and  there  is  not  even  a  clear 
line  between  the  governing  majority  and  the  oppo¬ 
sition  in  presidential  systems.  One  more  realistic 
typology  was  suggested  by  Philip  Norton  (2008). 
According  to  him,  “the  Opposition”  means  either 
the  principle  of  opposition  or  the  largest  opposi¬ 
tional  party  in  a  parliamentary  system.  Opposition 
parties  are,  in  a  parliamentary  system,  parties  that 
expressly  do  not  support  the  government. 
Opposition  covers  different  forms  of  conflicting 
relationships  between  the  legislative  and  executive 
branches  of  government.  Among  them,  as  also 
noted  by  Norton,  five  “modes”  can  be  distin¬ 
guished,  according  to  Anthony  King  (1976).  In  the 
opposition  mode,  cohesive  parliamentary  parties 
confront  a  governing  party  or  coalition.  In  the 
intraparty  mode,  it  comes  to  opposition  within  a 
(governing)  party;  that  is,  “faction  building.”  In  the 
nonparty  mode,  parliamentarians  cooperate  out¬ 
side  party  contexts,  which  allows  for  majority 
building  while  ignoring  party  lines.  In  the  cross¬ 
party  mode,  governing  and  oppositional  parties 
cooperate  on  some  issues  while  disagreeing  on  oth¬ 
ers.  Extraparliamentary  opposition  means  all  forms 


of  political  dissent  and  government-opposing  activ¬ 
ities  outside  (a  chamber  of)  parliament. 

The  last  category  is,  however,  quite  overcrowded. 
First,  there  are  extraparliamentary  party  structures. 
These  cannot  reasonably  be  detached  from  parlia¬ 
mentary  party  groups,  because  local  and  regional 
party  organizations  are  often  led  by  members  of 
parliament  and  work,  throughout  the  country,  as 
their  combat  groups.  Second,  there  may  be  “higher 
houses”  that  act  as  a  “second  opposition,”  in  par¬ 
ticular,  if  elections  to  them  are  used  for  expressing 
dissatisfaction  with  the  government.  Third,  constitu¬ 
tional  courts  perform  like  an  opposition  if  opposi¬ 
tional  parties  get  them  ruling  on  politically  contro¬ 
versial  issues.  Fourth,  actors  from  other  levels  of 
government  may  impede,  or  veto,  a  cabinet’s  policy 
making  in  federal  systems.  This  gives  even  small  par¬ 
ties  with  regional  strongholds  significant  opposi¬ 
tional  power  at  a  superior  level  of  government.  Fifth, 
nongovernmental  organizations  can  accumulate  and 
exert  very  significant  oppositional  power  in  liberal 
societies.  Sixth,  there  is  even  potential  for  opposi¬ 
tional  power  within  the  executive  branch  of  govern¬ 
ment  if  it  comes  to  implement  policy  programs  that 
are  not  regarded  as  administrable  by  civil  servants. 

Dahl’s  Analytic  Framework 

Contemporary  opposition  theory  started  with  Dahl 
(1966).  Coining  the  concept  of  polyarchy,  he  put 
legitimate  opposition,  together  with  free  political 
participation,  at  the  center  of  liberal  democracy. 
None  of  his  analytical  categories  is  restricted  to  par¬ 
liamentary  opposition  (see  Dahl,  1966,  pp.  332-386). 
Going  by  his  concept,  the  following  questions  must 
be  asked  to  determine  the  nature  of  opposition: 

1.  How  cohesively,  or  organizationally 
concentrated,  do  oppositional  groups  act? 

2.  How  much  loss  of  governmental  power  can  be 
effectuated  by  how  much  increase  in 
oppositional  power? 

3.  What  sites  are  available  for  encounters  between 
the  opposition  and  government?  How  fair  is 
competition  in  which  arena,  and  how  important 
is  which  site  for  actual  power  politics? 

4.  How  identifiable  are,  depending  on  those  factors, 
oppositional  groups  in  a  political  system? 
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5.  What  are  the  goals  of  which  oppositional 
groups  ? 

6.  What  are  their  strategies? 

In  addition,  five  “primary  conditions”  are  claimed 
to  explain  which  patterns  of  opposition  will  emerge: 

1.  constitutional  structure  and  the  electoral  system; 

2.  widely  shared  cultural  premises,  in  particular 
about  cooperation  and  problem  solving; 

3.  specific  subcultures  in  a  given  society  and  its 
political  culture  (inclined  toward  violence  and 
secession  or  toward  proportional  share  of  public 
goods); 

4.  the  record  of  grievances  against  the  government, 
out  of  which  people  learn  about  the  regime’s 
responsiveness;  and 

5.  the  kind  or  extent  of  social  and  economic 
differences/cleavages  in  which  the  party  system 
is  rooted. 

Two  more  specific  factors  intervene:  a  society’s 
individual  pattern  of  cleavage,  conflict,  and  agree¬ 
ment  in  attitudes  and  opinions  and  its  extent  of 
polarization.  These  primary  conditions  vary  inde¬ 
pendently  from  each  other,  but  not  beyond  certain 
limits;  they  may  reinforce  each  other;  and  a  big 
change  in  one  of  them  will  likely  make  an  existing 
pattern  of  opposition  change  as  well. 

Based  on  similar  considerations,  Heinrich 
Oberreuter  (1975,  p.  20)  has  suggested  a  parsimo¬ 
nious  typology  of  oppositional  behavior  patterns: 
issue-oriented  ad  hoc  opposition,  cooperative 
opposition,  and  competitive  opposition,  all  of 
them  possible  inside  or  outside  parliament. 

Oppositional  Functions 

A  function  is  a  service  rendered  by  a  system  for 
its  environment.  Criticism  of  a  government’s 
programs,  policies,  and  actors  is  the  first  service 
of  the  opposition  for  its  regime:  Issues  are 
brought  to  public  attention,  incentives  for  policy 
learning  are  set,  and  otherwise  neglected  prob¬ 
lems  are  put  on  the  agenda.  The  next  function  is 
control,  unfolded  both  as  scrutiny  of  single  mea¬ 
sures  and  as  checking  the  government’s  general 


course.  Effective  control  depends  on  the  govern¬ 
ment’s  responsibility  before  parliament  and 
makes  rational  politicians  avoid  actions  that 
might  otherwise  be  undertaken.  Offering  alterna¬ 
tives  encompasses  different  programs,  policies, 
and  persons  to  citizens  and  voters,  thereby  estab¬ 
lishing  pluralism.  In  this  perspective,  integration 
allows  even  those  whose  positions  and  leaders 
are  minoritarian  to  identify  with  the  polity  and 
have  the  chance  to  become  influential  in  the 
future.  Without  these  advantages  produced  by  an 
influential  opposition,  political  systems  will  work 
less  well  than  those  with  a  legitimate  and  loyal 
opposition. 

Research  on  Opposition 

A  straightforward  framework  such  as  Dahl’s  should 
have  inspired  a  large  body  of  comparative  empiri¬ 
cal  research,  providing  us  with  reliable  knowledge 
on  how  to  establish  effective  opposition  as  a  means 
for  better  governance.  However,  as  Klaus  von 
Beyme  (1987)  has  shown,  research  on  opposition 
was  scarce  for  a  long  while,  although  “protest 
movements”  (oppositional),  “new  social  move¬ 
ments,”  or  minorities  struggling  for  a  better  legal  or 
social  status  have  attracted  much  scholarly  atten¬ 
tion.  In  the  1960s  and  later,  one  reason  for  such 
neglect  was  the  revival  of  the  mistaken  thesis  of 
“declining  parliamentary  power,”  in  the  perspec¬ 
tive  of  which  parliamentary  opposition  becomes 
uninteresting  as  well  (pp.  30-31).  As  recalled  by 
Ludger  Helms  (2008,  p.  8),  the  “governance  turn” 
in  comparative  politics  came  next,  with  its  focus  on 
cooperative  and  corporatist  forms  of  interaction 
between  state  and  society.  In  this  perspective, 
opposition  looks  rather  negligible.  Finally,  confin¬ 
ing  opposition  to  parliamentary  opposition  deprives 
this  concept  of  comparative  power  beyond  parlia¬ 
mentary  systems  of  government.  For  all  these  rea¬ 
sons,  a  more  comprehensive  theory  of  opposition 
should  be  worked  out,  and  broad-scope  compara¬ 
tive  empirical  research  on  opposition,  seen  as  an 
element  of  pluralism,  division  of  power,  and  good 
governance,  should  be  encouraged. 

Werner  J.  Patzelt 
Technical  University  of  Dresden 
Dresden,  Germany 
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Organization  theory  is  a  field  of  study  in  which 
the  phenomena  of  interest  are  related  to  organiza¬ 
tions  and  organizing.  The  general  aim  is  to  under¬ 
stand  what  determines  organizational  forms  and 
processes  and  the  consequences  of  different  types 
and  aspects  of  organizational  forms  and  pro¬ 
cesses.  It  is  somewhat  contested  whether  organiza¬ 
tion  theory  should  be  regarded  as  an  academic 
discipline  in  its  own  right,  since  it  draws  heavily 
on  concepts  and  ways  of  thinking  from  disciplines 
such  as  anthropology,  economics,  political  sci¬ 
ence,  psychology,  and  sociology.  Nevertheless, 
organization  theory  may  also  contribute  to  the 
development  of  political  science  and  other  disci¬ 
plines.  In  North  America,  a  distinction  is  often 
made  between  organizational  behavior  and  orga¬ 
nization  theory,  roughly  corresponding  to  an 
exploration  of  micro-  and  macro-organizational 
phenomena,  respectively.  Elsewhere,  a  broad 
notion  of  organization  theory  is  more  common, 
and  it  is  also  adopted  here.  Thus,  organization 
theory  is  seen  as  the  academic  field  (or  discipline) 


specializing  in  the  study  of  organizational  phe¬ 
nomena  (both  micro  and  macro)  and  used  as  a 
synonym  for  organization  studies.  After  a  brief 
overview  of  the  development  of  organization  the¬ 
ory  as  an  academic  field,  this  entry  reviews  some 
important  phenomena  and  relationships  being 
examined  in  contemporary  organization  theory. 
Some  notes  on  the  relationship  between  organiza¬ 
tion  theory  and  political  science  are  then  made, 
and  finally  some  thoughts  on  future  developments 
are  offered. 

Historical  Development 

The  design  and  management  of  specific  types  of 
organizations  have  been  practiced  for  several  thou¬ 
sands  of  years  and  studied  for  a  couple  of  centuries. 
Nevertheless,  generalizations  about  organizations 
and  the  development  of  organization  theory  as  an 
academic  field  are  restricted  to  the  post-World 
War  II  period.  The  actual  phrase  organization 
theory  was  most  actively  promoted  by  Herbert 
Simon  from  1950  onward,  being  seen  as  a  broad 
category  that  included  many  existing  approaches, 
such  as  scientific  management  and  industrial  psy¬ 
chology.  However,  while  organizations  in  these 
approaches  were  primarily  seen  as  settings  within 
which  work  was  carried  out,  they  were  now  seen 
as  units  of  interest  in  their  own  right  and  analyzed 
as  distinctive  social  systems  and  collective  actors. 

In  what  has  retrospectively  become  known  as 
classical  organization  theory,  the  emphasis  was  on 
universal  principles  of  administration  and  manage¬ 
ment  that  could  lead  to  goal  achievement.  While 
produced  about  a  hundred  years  ago,  elements  of 
contributions  from  people  such  as  Frederick  Taylor 
on  scientific  management,  Herbert  Fayol  on 
administrative  theory,  and  Max  Weber  on  the 
theory  of  bureaucracy  are  still  evident  in  today’s 
organizations.  However,  since  the  1950s,  this 
rational-  and  closed-systems  perspective  on  organi¬ 
zations  has  been  supplemented  by  other  types  of 
perspectives.  While  classical  organization  theory 
indicated  that  there  is  one  best  way  of  organiza¬ 
tional  design  and  practice,  contingency  theory 
from  the  1960s  onward  suggests  that  this  depends 
on  the  characteristics  of  each  situation.  Thus, 
according  to  this  kind  of  rational-  and  open- 
systems  perspective,  variations  in  the  environ¬ 
ments,  tasks,  and  technologies  of  organizations 
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imply  variations  in  organizational  forms  and 
processes. 

The  human  relations  approach  from  the  1930s 
onward  represented  a  natural-systems  perspective, 
where  the  emphasis  was  not  primarily  on  achiev¬ 
ing  goals  but  on  organizational  survival.  Here,  too, 
since  the  1960s,  characteristics  of  the  environ¬ 
ments  of  organizations  have  become  increasingly 
important,  representing  a  turn  from  closed-  to 
open-systems  perspectives.  Moreover,  in  this  latest 
period,  social  constructionist  approaches  have 
become  more  prominent.  According  to  these 
approaches,  the  organization  and  its  forms  and  pro¬ 
cesses  do  not  exist  as  objects  separate  from  people 
but  are  created  and  maintained  as  organizational 
members  talk  about  what  they  think  is  happening 
and  what  needs  to  be  done.  Recently,  postmodern 
approaches  have  also  become  popular  among  some 
organizational  researchers;  for  example,  in  the 
study  of  language  games  and  discourses  involving 
organizations  and  organizing. 

Phenomena  and  Relationships 

Contemporary  organization  theory  is  character¬ 
ized  by  a  large  variety  of  units  of  analysis,  perspec¬ 
tives,  and  themes.  Many  researchers  examine 
individual  organizations,  for  example,  how  differ¬ 
ent  organizational  aspects  affect  performance. 
Some  focus  on  organizations  in  a  certain  field, 
examining  how  they  interact  and  affect  each  other. 
Moreover,  population  ecologists  examine  the 
dynamics  of  entire  populations  of  organizations. 
In  conferences,  books,  and  journals  in  the  field, 
there  is  a  lively  debate  on  the  strengths  and  weak¬ 
nesses  of  the  various  theoretical  positions  and  the 
relevance  of  the  various  topics.  Despite  some  dis¬ 
agreements,  the  phenomena  and  relationships 
reviewed  here  are  commonly  regarded  as  being 
among  the  most  important  ones. 

Organizational  Structure 

The  importance  of  structure  for  organizational 
behavior  has  been  a  central  topic  in  the  study 
of  organizations  throughout  the  history  of  this 
academic  field.  Questions  such  as  how  a  given 
structure  may  constrain  and  enable  instrumental 
rational  action  in  and  by  organizations,  and  how  it 
is  possible  to  affect  actual  organizational  behavior 


indirectly  by  structural  design,  have  been  discussed 
for  a  long  time  and  remain  highly  relevant. 
According  to  the  structural-instrumental  perspec¬ 
tive,  organizations  are  regarded  as  tools  for  achiev¬ 
ing  certain  goals.  From  this  perspective,  it  is 
assumed  that  organizations  and  their  members  act 
with  instrumental  rationality  in  carrying  out  tasks 
and  that  the  organizational  structure  is  designed  in 
accordance  with  means-ends  assessments. 

The  formal  structure  of  an  organization  consists 
of  positions  and  rules  that  determine  who  shall  or 
can  do  what  and  define  how  various  tasks  should 
be  executed.  Organizations  are  composed  of  a  set 
of  positions  and  subordinated  units  and  can  them¬ 
selves  fall  under  other  larger  units.  In  addition, 
organizational  units  can  be  divided  up  and  coordi¬ 
nated  in  different  ways.  Vertical  specialization  con¬ 
cerns  whether  and  how  they  consist  of  subordinate 
bodies,  as  well  as  how  they  are  related  to  superior 
bodies.  Horizontal  specialization  refers  to  how  dif¬ 
ferent  tasks  are  thought  to  be  allocated  on  a  certain 
level  by  means  of  organizational  structure.  Thus, 
organizations  may  be  based  on  certain  principles  of 
specialization,  such  as  purpose,  process,  clientele, 
or  geography.  Through  the  superior-subordinate 
relationship  between  different  levels  in  a  hierarchy, 
there  will  be  a  great  degree  of  vertical  coordination 
within  and  between  organizations.  Hierarchy  can 
also  involve  vertical  specialization,  in  that  different 
types  of  tasks  are  assigned  to  different  levels  in  the 
organization  or  to  organizations  at  different  levels. 
Routines  can  constitute  a  form  of  coordination 
both  vertically  and  horizontally.  Procedural  rules 
can  be  used  as  tools  within  an  organization,  but 
they  can  also  be  used  to  coordinate  activities  in  a 
way  that  cuts  across  organizations. 

A  bureaucratic  organizational  form,  as  Max 
Weber  described  it,  is  marked  by  a  high  extent  of 
hierarchy,  horizontal  specialization,  and  routines. 
Many  types  of  organizational  forms  can  be  thought 
of  as  an  alternative  or  as  supplementary  to  a 
bureaucratic  organizational  form.  At  one  extreme 
is  a  completely  flat  structure,  that  is,  an  organiza¬ 
tion  without  hierarchical  ordering  but  with  several 
positions  and  subunits  at  the  same  level.  In  a  col¬ 
legial  structure,  a  board  of  directors  or  an  advisory 
council  can  be  set  up  instead  of,  or  in  addition  to, 
the  top  leadership  in  the  hierarchy.  Another  alter¬ 
native  is  a  matrix  structure.  Here,  a  position  or 
subunit  is  subordinated  to  several  superior  units 
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simultaneously.  These  superior  units  usually  oper¬ 
ate  according  to  different  principles  of  specializa¬ 
tion.  In  addition  to  arrangements  such  as  these, 
which  are  without  time  limitations,  organizations 
may  have  temporary  arrangements  that  extend 
beyond  the  bounds  of  bureaucratic  organizational 
forms,  such  as  task  forces  and  project  organiza¬ 
tions.  Boards  of  directors,  advisory  councils,  task 
forces,  and  project  organizations  are  all  various 
forms  of  network  structure  that  supplement  the 
bureaucratic  organizational  form.  This  is  not 
merely  the  case  within  individual  organizations  but 
also  applies  to  relationships  between  organizations. 

Organizational  Culture 

Organizational  culture  is  concerned  with  the 
informal  norms  and  values  that  are  important  for 
the  activities  of  organizations.  The  emphasis  is  not 
on  goals  but  on  informal  norms,  values,  and  iden¬ 
tities  that  develop  gradually.  The  classic  distinction 
made  by  Philip  Selznick  is  between  the  organiza¬ 
tion,  as  an  instrument  for  achieving  goals,  and  the 
institution,  whereby  an  organization,  through  the 
process  of  institutionalization,  is  “infused  with 
value  beyond  the  technical  requirements  of  the 
task  at  hand”  (1957,  p.  17).  Thus,  in  addition  to 
solving  tasks  in  an  instrumental  sense,  an  organi¬ 
zation  has  then  become  a  value-bearing  institution 
with  its  own  distinct  identities  and  opinions  about 
what  the  relevant  problems  and  solutions  are.  This 
makes  for  a  more  complex  organization,  less  flex¬ 
ible  or  adaptable  to  new  demands  but  also  one 
equipped  with  new  and  necessary  qualities  that 
will  potentially  help  the  organization  to  solve  tasks 
more  expediently  and  function  well  as  a  socially 
integrated  unit.  Thus,  the  organizational  culture 
demarcates  organizations  from  one  another  and, 
through  mechanisms  of  socialization,  strengthens 
cohesion  and  commitment  among  its  members. 
Culture  provides  members  of  the  organization 
with  a  sense  of  belonging  to  a  community  with  a 
shared  goal  and  mission. 

Organizational  Environments 

In  the  study  of  organizations  and  the  develop¬ 
ment  of  open-systems  perspectives,  a  distinction  is 
commonly  made  between  technical  and  institu¬ 
tional  environments.  Technical  environments  can 


be  defined  either  broadly  as  all  aspects  of  the  envi¬ 
ronment  potentially  relevant  to  goal  setting  and 
goal  achievement  or  more  narrowly  as  the  sources 
for  inputs  and  markets  for  outputs,  competitors, 
and  regulators.  In  the  academic  literature,  several 
dimensions  of  technical  environments  affecting 
organizational  uncertainty  and  dependence  have 
been  proposed  and  examined — for  example,  the 
degree  of  homogeneity/heterogeneity,  the  degree  of 
stability/variability,  and  the  degree  of  concentra¬ 
tion/dispersion. 

The  technical  environments  of  an  organization 
are  to  a  large  extent  determined  by  the  type  of 
tasks  it  performs.  As  with  the  case  for  the  other 
aspects,  the  existing  tasks  may  constrain  and 
enable  what  an  organization  and  its  members  are 
doing,  and  over  time,  tasks  may  be  changed  as  a 
result  of  purposeful  action. 

Since  the  1970s,  an  emphasis  on  what  has  been 
called  the  institutional  environments  has  been 
added,  stressing  the  importance  of  the  symbolic 
aspects  of  environments.  Institutional  environments 
are  characterized  by  the  development  of  socially 
created  norms  to  which  individual  organizations 
must  conform  in  order  to  receive  legitimacy  and 
support.  Thus,  in  institutional  environments,  orga¬ 
nizations  are  rewarded  for  using  what  is  seen  as 
correct  organizational  forms  and  processes  and  not 
for  the  quantity  and  quality  of  their  outputs,  as  in 
technical  environments. 

Organizational  Decision  Making 

Studies  of  decision-making  behavior  within  and 
by  organizations  have  traditionally  emphasized 
purposeful  choice  in  accordance  with  a  logic  of 
consequences.  According  to  this  logic  of  action, 
organizations  and  their  members  are  seen  as  assess¬ 
ing  their  existing  goals,  the  alternatives  for  actions, 
and  the  future  consequences  in  relation  to  the  goals 
that  might  follow  from  each  alternative.  The  con¬ 
cept  of  full  instrumental  rationality  refers  to  an 
organization’s  having  clear  and  consistent  goals,  a 
full  overview  of  all  the  alternatives,  and  full  insight 
into  the  consequences  that  these  alternatives  will 
bring  in  relation  to  its  goals.  From  this,  it  often  fol¬ 
lows  that  the  organization  chooses  the  alternative 
that  gives  the  maximal  degree  of  goal  achievement. 
Even  so,  many  empirical  studies  of  how  organiza¬ 
tions  act  show  that  this  is  realistic  only  to  a  certain 
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degree.  This  is  expressed  through  the  concept  of 
bounded  rationality,  which  implies  that  an  organi¬ 
zation’s  goals  are  diffuse,  inconsistent,  or  unstable 
and  that  the  problems  it  faces  are  complex.  The 
concept  also  includes  the  idea  that  an  organization 
has  incomplete  information  about  alternatives  and 
consequences.  From  this,  it  follows  that  the  organi¬ 
zation  chooses  an  alternative  that  yields  a  satisfac¬ 
tory  degree  of  goal  achievement. 

Moreover,  different  individuals  or  groups  within 
an  organization  can  be  committed  to  different  goals 
and  interests,  and  the  organization  or  its  individual 
parts  must  relate  to  other  organizations  that  may 
have  other  goals  and  interests.  Thus,  organizations 
can  be  understood  as  coalitions.  Every  actor  acts  in 
a  purposeful  way,  is  motivated  by  interests,  and 
can  also  enter  into  coalitions  with  actors  outside 
the  organization,  who,  according  to  their  interests, 
act  in  similar  ways.  Interest  distribution  may  be 
rooted  in  formal  structures  within  and  between 
organizations — for  example,  related  to  actors  car¬ 
rying  out  specialized  tasks.  Their  resources  for 
articulating  their  own  interests  may  also  be  rooted 
in  the  formal  structure,  such  as  through  superordi¬ 
nation  or  subordination  of  actors  and  through 
horizontal  coordination. 

Conflicts  of  interest  within  and  between  organi¬ 
zations  can  be  dealt  with  in  different  ways.  First,  a 
dominant  coalition  can  choose  between  relevant 
alternatives  of  action  and  assert  its  own  goals  and 
interests.  Second,  the  actors  can  negotiate  a  com¬ 
promise  between  different  interests,  which  in  turn 
provides  the  basis  for  purposeful  choice  based  on 
knowledge  about  alternatives  and  consequences. 
Third,  the  competing  goals  can  be  addressed 
sequentially,  so  as  not  to  come  into  conflict  with 
one  another.  Fourth,  goals  in  different  parts  of  an 
organization,  or  in  different  organizations,  do  not 
need  to  be  viewed  vis-a-vis  each  other  but  can  be 
addressed  independently.  Conflicts  of  interest  can 
also  be  dealt  with  by  actors  who  come  to  an  agree¬ 
ment  on  means. 

More  radical  modifications  of  purposeful  choice 
in  organizational  decision  making  are  introduced 
in  the  garbage  can  model.  Here,  decisions  are  to  a 
large  extent  seen  as  being  produced  by  temporal 
linkages.  Thus,  time  and  attention  are  limited 
resources,  and  the  arrival  and  coupling  of  indepen¬ 
dent,  exogenous  streams  of  problems,  solutions, 


decision  makers,  and  opportunities  for  choice  are 
important  for  determining  how  decisions  are  made 
and  interpreted. 

The  importance  of  organizational  culture  for 
decision  making  is  related  to  its  impact  on  what  is 
seen  as  appropriate  behavior  in  and  by  organiza¬ 
tions.  Over  time,  organizations  develop  distinctive 
identities.  Faced  with  a  new  situation,  they  have  to 
recognize  the  nature  of  that  situation  and  find  a 
rule  for  action  that  is  consistent  with  their  identity. 
Organizational  culture  entails  a  relatively  consis¬ 
tent  set  of  rules  and  identities,  so  such  links  are 
simple  to  make.  What  makes  an  action  appropri¬ 
ate  in  a  certain  organization  may  be  highly  diver¬ 
gent  from  what  is  found  in  other  organizations, 
depending  on  how  an  organizational  culture  has 
evolved  and  what  its  dominant  informal  norms 
and  values  are. 

Organizational  Change 

In  the  academic  literature  on  organizational 
change,  a  distinction  is  often  made  between  change 
through  adaptation  and  through  selection  and 
between  the  transformation  and  evolution  of  orga¬ 
nizations.  To  a  large  extent,  the  perspectives  on 
change  being  employed  reflect  the  various  theo¬ 
retical  positions  for  the  study  of  organizations  in 
general. 

Most  students  of  organizational  change  exam¬ 
ine  how  individual  organizations  adapt  in  response 
to  changing  circumstances,  but  these  adaptations 
may  take  different  forms.  For  example,  contin¬ 
gency  theory  focuses  on  the  alignment  between 
various  situational  features  (e.g.,  technical  envi¬ 
ronment)  and  structural  features  (e.g.,  horizontal 
and  vertical  specialization).  Thus,  organizational 
change  is  expected  if  changes  in  the  technical  envi¬ 
ronments  imply  a  misalignment  with  organiza¬ 
tional  structure.  Rational  adaptation  is  also 
emphasized  in  purposeful  action  approaches 
where  organizations  are  seen  as  having  some 
capacity  to  handle  their  dependency  on  environ¬ 
ments  or  even  change  the  environments.  Moreover, 
approaches  that  emphasize  the  importance  of 
institutional  environments  focus  on  how  organi¬ 
zations  adapt  their  structures  to  prevailing  norma- 
tively  endorsed  modes  of  organizing.  This  also  leads 
to  a  large  extent  of  isomorphism  in  an  organizational 
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field.  On  the  other  hand,  there  may  be  a  decoupling 
between  organizational  structures  and  organiza¬ 
tional  activities,  where  changes  in  organizational 
structures  reflect  the  myths  embedded  in  the  insti¬ 
tutional  environments  and  are  not  meant  to  affect 
organizational  activities. 

Organizational  change  through  selection  is  most 
central  in  population  ecology  theory.  Here,  the 
focus  is  on  the  development  of  organizational 
forms  at  the  population  level.  While  variation 
through  the  birth  of  new  organizational  forms  may 
happen  by  chance,  those  that  survive  are  selected 
by  the  environment,  based  on  their  ability  to  com¬ 
pete  with  others  in  a  niche.  Finally,  retention  refers 
to  the  forces  that  maintain  and  perpetuate  certain 
organizational  forms. 

While  some  researchers  applying  rational- 
systems  perspectives  analyze  whether  and  how 
change  agents  can  accomplish  radical  transforma¬ 
tions  of  organizations,  others  emphasize  the  evolu¬ 
tionary  and  incremental  character  of  structural 
change  due  to  bounded  rationality  or  organiza¬ 
tions  as  coalitions.  Some  also  combine  purposeful 
transformation  and  evolution  of  organizational 
structure  through  ideas  of  punctuated  equilibrium, 
where  long  periods  of  convergent  change  are  inter¬ 
rupted  by  short  periods  of  abrupt  divergent  change. 
Stability  and  evolution  of  organization  are  also 
focused  on  by  researchers  emphasizing  the  impor¬ 
tance  of  organizational  culture  and  appropriate 
behavior.  When  an  attempt  is  made  to  introduce 
new  organizational  solutions,  they  will  often  be 
revealed  as  incompatible  and  unsuitable.  Thus,  if 
they  are  seen  as  clashing  with  the  values  an  orga¬ 
nization  cares  about  and  is  committed  to,  they  risk 
being  rejected.  Here,  too,  there  may  be  a  combina¬ 
tion  of  transformation  and  evolution  through 
ideas  of  punctuated  equilibrium,  where  long  peri¬ 
ods  of  stability  are  interrupted  by  abrupt  change  in 
the  case  of  major  critical  events. 

Organization  Theory  and  Political  Science 

Many  of  the  most  prominent  early  scholars  in 
organization  theory  came  from  political  science, 
and  many  of  the  earliest  studies  of  organizations 
were  studies  of  public  administration.  Thus,  ideas 
of  bounded  rationality,  pluralistic  competition,  mud¬ 
dling  through,  cooptation,  resource  dependence, 


garbage  can  decision  processes,  and  loose  coupling 
all  emerged  from  studies  of  organizations  in  the 
public  sector.  However,  from  the  1960s  onward, 
political  science  and  organization  theory  were 
mainly  characterized  by  parallel  agendas  but 
mutual  disregard.  This  may  be  related  to  develop¬ 
ments  in  the  study  of  organizations:  Most  scholars 
in  organization  theory  were  located  in  business 
schools,  and  most  research  was  on  private  sector 
organizations.  It  may  also  be  related  to  develop¬ 
ments  in  political  science  and  the  study  of  public 
administration,  where  other  topics  and  perspec¬ 
tives  became  more  prominent. 

Nevertheless,  in  recent  years,  there  are  some 
tendencies  toward  rediscovering  a  lost  tradition  in 
public  administration  that  combines  three  commit¬ 
ments:  (1)  understanding  administrative  practice, 
(2)  contributing  to  a  general  understanding  of 
organizations,  and  (3)  illuminating  political  phi¬ 
losophy.  Moreover,  it  may  be  argued  that  an  orga¬ 
nization  theory  for  the  public  sector  should  be 
delimited  from  general  organization  theory.  First, 
an  organization  theory  for  the  public  sector  should 
contribute  to  clarifying  the  key  organizational 
forms  that  exist  within  public  administration,  as 
well  as  those  that  exist  between  the  public  admin¬ 
istration  and  various  groups  in  society.  Second,  an 
organization  theory  of  this  kind  should  help  clarify 
the  types  of  selection  different  organizations  make. 
This  means  the  extent  to  which  they  attend  to,  are 
neutral  toward,  or  opposed  to  values,  situations, 
or  interests  within  society.  Third,  it  should  help 
explain  the  existence  of  different  forms  of  organi¬ 
zation,  with  an  emphasis  on  examining  to  what 
extent  such  forms  are  determined  by  public  policy. 
Seen  from  the  vantage  point  of  an  organization 
theory  rooted  in  political  science,  it  is  not  enough 
to  concentrate  attention  on  economy  and  effi¬ 
ciency.  The  way  the  public  sector  operates  must  be 
described,  analyzed,  and  evaluated  from  a  demo¬ 
cratic-political  vantage  point.  This  means  directing 
the  focus  toward  the  sector’s  basis  of  values, 
knowledge,  and  power. 

Future  Development 

The  academic  field  of  organization  theory  is  embed¬ 
ded  in  its  times.  Just  as  the  significant  features  of 
the  field  have  been  molded  by  certain  critical 
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events,  future  developments  will  also  be  affected  by 
coming  events.  This  makes  it  a  bit  difficult  to  for¬ 
mulate  expectations  of  what  will  happen. 

Nevertheless,  based  on  recent  trends  in  organi¬ 
zation  theory  and  actual  organizational  forms 
and  processes,  some  ideas  on  future  developments 
may  be  presented.  First,  the  increase  of  networks, 
public-private  partnerships,  and  other  organiza¬ 
tional  forms  crossing  the  boundaries  between  the 
public  and  private  sectors  may  stimulate  an 
understanding  of  organizations  and  organizing 
beyond  markets  and  hierarchies.  Second,  the 
global  spread  of  certain  organizational  forms — in 
public  sector  organizations  related  to  new  public 
management  or  post-new  public  management 
doctrines — may  stimulate  an  understanding  of 
the  adaptation  of  these  forms  across  countries 
and  policy  areas.  Third,  the  development  of  more 
complex  organizational  forms  may  stimulate  the 
development  of  more  complex  theories  of  how 
organizations  change  and  of  the  consequences  of 
organizational  change.  Thus,  there  may  be  a 
mutual  interplay  of  structural,  cultural,  and  envi¬ 
ronmental  features  and  also  a  two-way  dynamic 
between  organizational  forms  and  actions. 
Fourth,  there  may  be  a  two-way  relationship 
between  the  development  of  organization  theory 
and  the  development  of  new  organizational  forms 
as  well  as  the  popular  understandings  of  organi¬ 
zations  and  organizing. 

Summing  up,  this  means  that  organization  the¬ 
ory  and  political  science  may  benefit  from  increased 
cross-fertilization.  Empirical  studies  of  decision 
making  and  change  in  public  sector  organizations 
will  provide  some  observations  and  theoretical 
ideas  that  may  be  useful  to  students  of  politics. 
Increased  attention  toward  organizations  from 
students  of  politics  is  also  likely  to  change  organi¬ 
zation  theory.  By  focusing  on  organizational  char¬ 
acteristics  of  parliaments,  public  administration, 
political  parties,  interest  organizations,  and  social 
movements,  as  well  as  on  themes  such  as  conflict, 
power,  justice,  and  equality  and  the  development 
of  norms,  goals,  and  meanings,  the  academic  field 
of  organization  theory  may  expand  in  ways  that 
are  relevant  for  political  scientists. 
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Orientalism 


The  word  orientalism  has  several  meanings. 
Specifically,  it  is  the  science  that  has  for  its  subject 
matter  “Oriental”  languages  and  civilizations 
and,  by  extension,  the  taste  for  Oriental  objects 
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and  arts,  as  well  as  their  use  in  Western  artistic 
fields,  particularly  decoration,  painting,  and 
music.  Orientalism  was  born  in  the  18th  century, 
although  the  word  was  not  commonly  used  before 
the  19th  century.  In  the  current,  postcolonial  era, 
the  term  orientalism  and  the  concept  of  the 
Oriental  have  taken  on  pejorative  connotations, 
especially  among  people  of  Asian  and  Middle 
Eastern  descent.  Recent  scholarship,  perhaps  most 
notably  that  of  the  late  Edward  Said,  points  out 
the  social  psychological  function  of  “Oriental”  as 
a  label  whose  use  tacitly  assumes  the  absolute 
primacy  of  a  Eurocentric  perspective,  thereby 
categorically  objectifying  what  is  not  European, 
rendering  it  exotic  and  essentially  “other.”  This 
entry  examines  the  history  of  this  concept,  the 
different  kinds  of  orientalism  that  must  be  taken 
into  account,  and  the  debates  that  have  taken 
place  around  this  concept. 

Origins  of  Orientalism 

Orientalism  originated  as  the  extension  of  Renais¬ 
sance  humanism  to  Oriental  fields,  which  were 
then  Turkish,  Arabic,  and  Persian.  Its  aim  was  to 
constitute  a  universal  literature  by  merging  Eur¬ 
opean  and  Oriental  contributions.  Its  tremendous 
success  was  the  translation  of  One  Thousand  and 
One  Nights  at  the  beginning  of  the  18th  century, 
thus  creating  the  Oriental  literary  genre.  Mean¬ 
while,  Catholic  missionaries  created  the  first  sinol¬ 
ogy,  while  travelers  described  the  power  and 
organization  of  large  bureaucratic  Oriental  states 
under  the  name  of  despotism. 

The  great  novelty  of  the  18th  century  was  a 
willingness  to  accept  the  possibility  of  a  compara¬ 
tive  universal  history  of  all  human  societies.  The 
Orientalists  of  the  Enlightenment  were  first  and 
foremost  philologists  and  scholars,  but  their  works 
were  taken  up  and  popularized  by  philosophers. 
The  advent  of  social  sciences  at  the  end  of  the  cen¬ 
tury  introduced  an  increasingly  critical  aspect  of 
Oriental  realities.  While  they  had  a  scientific  foun¬ 
dation,  they  were  also  useful  in  legitimizing  the 
calling  into  question  of  an  Oriental  despotism, 
which  from  then  on  was  defined  as  a  source  of 
powerlessness  and  decadence. 

Comparative  studies  were  linked  to  the  appear¬ 
ance  of  the  idea  of  progress.  It  turned  the  past  of 
Europe  and  of  the  East  into  landmarks  according 


to  which  one  could  define  the  movement  of  his¬ 
tory.  After  1750,  the  East  was  defined  as  a  sort  of 
“past  in  the  present”  insofar  as  it  showed  a 
bygone  state  of  society  compared  with  the  new 
realities  of  Europe.  As  early  as  1780,  it  was  said 
that  Europe  was  the  future  of  the  East.  The  former 
was  now  able  to  take  over  or  dominate  the  big 
“Oriental”  countries  from  the  Mediterranean  to 
India  and  soon  to  China  and  Japan.  Its  expansion 
was  justified  as  being  either  a  liberation  or  an 
improvement  in  the  lot  of  the  peoples  subjected  to 
Oriental  despotism. 

While  the  “practical”  orientalism  of  the  “drag¬ 
omans” — translators  and  interpreters  of  Oriental 
realities — already  existed,  European  expansion 
implied  the  management  of  Oriental  populations 
and  thus  the  constitution  of  a  corpus  of  linguistic, 
legal,  and  social  knowledge,  crucial  for  European 
civil  and  military  administrators. 

Types  of  Orientalism 
Classical  Orientalism 

At  the  beginning  of  the  19th  century,  oriental¬ 
ism  was  divided  into  several  distinct  but  closely 
intertwined  types.  The  first  one  was  “classical” 
orientalism,  gathering  scholarly,  philological,  and 
archaeological  knowledge  about  ancient  Oriental 
societies.  It  was  the  product  of  several  phenome¬ 
nal  discoveries  of  lost  languages  and  civiliza¬ 
tions — deciphering  Egyptian  hieroglyphs  and 
cuneiform;  the  archaeological  discovery  of  Assyria, 
Sumer,  and  the  Hittites;  learning  of  Sanskrit;  and 
the  birth  of  modern  sinology  and  then  of  Japan 
studies. 

Universal  history  was  endowed  with  a  few  more 
millennia.  More  than  ever,  the  medieval  idea  of 
Egypt  and  the  Near  East  as  the  birthplace  of  human 
culture  prevailed.  The  history  of  human  progress 
had  several  versions.  According  to  the  first  one,  the 
arts  and  sciences  were  born  in  Egypt;  moved  to  the 
Near  East,  then  to  Greece  and  to  Rome,  and  from 
there  to  the  Arabs;  and  ended  up  in  Europe.  The 
second  version  started  from  the  discovery  of  a  kin¬ 
ship  between  European  and  Indian  languages  that 
served  to  define  the  presence  of  a  group  of  peoples 
called  Indo-German,  Aryan,  or  Indo-European. 
Simultaneously,  the  existence  of  a  “Semitic”  group 
was  constructed,  followed  by  that  of  a  “Touranian” 
group.  Those  definitions  quickly  took  on  a  racial 


1754  Orientalism 


nature  since  language  was  supposed  to  contain  a 
whole  definition  of  the  world.  The  opposition 
between  Semites  and  Indo-Europeans  was  thus 
used  to  explain  both  the  history  of  religions  and  the 
superiority  of  modern  Europe. 

Though  the  peoples  of  the  Far  East  were  not 
included  in  this  pattern  of  analysis,  the  existence 
of  two  large  cultural  areas  was  recognized — 
India  and  China — and  within  them,  so  was  the 
existence  of  distinct  national  personalities.  Thus, 
Vietnam  was  part  of  the  Chinese  sphere  of  civili¬ 
zation,  and  Cambodia  was  part  of  the  Hindu 
one,  but  each  had  its  respective  specific  identity. 
In  the  same  way,  French  Orientalists  discovered 
the  wonders  of  Angkor  and  gave  to  Cambodians 
a  sense  of  historical  continuity  that  they  did  not 
have  before. 

Modern  Orientalism 

The  second  type  is  modern  orientalism,  studying 
existing  Oriental  civilizations.  Their  precariousness 
was  known,  hence  the  necessity  of  describing 
them  before  their  potential  destruction  caused  by 
modernization/Westernization.  Scholars  rushed  to 
take  stock  of  the  large  corpus  of  texts  already 
defined  by  those  involved  as  “classics”  and  to  turn 
them  into  the  first  printed  scientific  editions.  If 
those  texts  were  already  known,  they  were  given  a 
more  precise  historical  context,  particularly  thanks 
to  philology  and  archaeology.  This  heritage- 
oriented  form  of  orientalism  ended  in  the  setting 
up  of  museums  for  Oriental  objects,  first  in 
Western  countries  and  then  in  the  countries  where 
the  works  originated. 

By  successive  waves,  the  Oriental  literary  and 
artistic  heritage  was  massively  introduced  into 
19th-century  European  and  Western  culture.  It 
was  one  of  the  prime  movers  of  major  aesthetic 
transformations  and  artistic  revolutions.  The  same 
applied  a  little  later  to  the  arts  of  Black  Africa. 
Thus,  artistic  modernity  will  largely  claim  an 
Oriental  element  for  itself. 

Practical  Orientalism 

The  third  type  is  the  practical  orientalism  of 
colonial  administrators.  If  the  latter  presented 
themselves  as  the  modernizers  of  the  societies  they 


dominated,  they  also  sought  to  unify  the  tradi¬ 
tional  legal  systems  by  resorting  to  the  ancient 
sources,  considered  as  the  most  authentic.  Thus, 
the  French  limited  the  field  of  common  laws  and 
undertook  the  codification  of  a  “Muslim”  law 
that  had  never  existed  as  such.  In  India,  the 
British  systematically  used  Sanskrit  sources,  which 
had  been  largely  forgotten  or  neglected.  More 
generally,  practical  Orientalists,  for  administra¬ 
tive  convenience,  tended  to  freeze  realities  that 
otherwise  would  have  been  naturally  subject  to 
continuing  change,  particularly  religious  and  eth¬ 
nic  definitions.  To  do  this,  they  relied  on  the 
works  of  modern  Orientalists,  while  defining 
their  superiority  over  the  peoples  they  adminis¬ 
tered  through  the  racial  discourse  stemming  from 
classical  orientalists. 

This  racial,  philology-based  discourse,  also 
called  linguistic  ethnology,  was  first  used  to 
understand  the  origins  of  Europe  and  of  Christi¬ 
anity.  It  was  then  used  as  a  classifying  tool  so  as 
to  define  the  different  ethnic  groups  as  they  were 
being  reviewed — hence  the  quasi-indefinite  exten¬ 
sion  of  the  Semitic  group  toward  Black  Africa  and 
of  the  Touranian  group  toward  Eastern  Asia.  A 
complete  correspondence  was  established  between 
linguistic  subgroups  and  cultural  entities,  by 
finally  endowing  them  with  a  history  of  their 
own.  The  prevailing  scientism  of  the  late  19th 
century  added  to  it  the  measures  established  by 
physical  anthropology,  particularly  by  craniol- 
ogy.  Some  of  the  premises  of  craniology  were 
pseudoscientific,  making  a  simple  equation 
between  measures  of  cranial  capacity  and  “intel¬ 
ligence,”  with  departures  from  cranial  measure¬ 
ments  of  Europeans  taken  as  evidence  of  racial 
primitivism  or  inferiority. 

The  racial  discourse  claimed  to  have  a  universal 
impact.  It  was  first  applied  to  Europe  even  if  it  was 
secondarily  used  to  lay  the  foundations  of  the 
legitimacy  of  Western  superiority  over  the  rest  of 
the  world. 

Nineteenth-century  orientalism  has  since  been 
reproached  as  having  an  “essentialist”  vision  of 
Oriental  peoples,  considered  as  unable  to  really 
change.  By  definition,  orientalism  is  a  “culturalist” 
approach,  because  it  is  founded  on  the  literary  and 
artistic  heritage  of  the  societies  involved.  At  the 
same  time,  it  sought  to  give  the  latter  a  meaning  by 
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striving  to  historicize  the  contents  of  this  heritage, 
thus  redefining  the  historical  paths  of  the  societies 
involved.  More  often  than  not,  those  paths  were 
defined  along  criteria  borrowed  from  European  his¬ 
tory — religious  tolerance,  promotion  of  sciences, 
and  so  on.  At  the  same  time,  it  was  also  necessary 
to  determine  the  causes  of  the  “backwardness  of  the 
East”  and  European  superiority,  that  is  to  say,  con¬ 
temporary  deficits.  The  usual  pattern  was  a  combi¬ 
nation  of  past  golden  age  and  present  failings. 

However  absorbed  in  attaining  a  mastery  of 
modern  knowledge,  the  19th-century  Orientalist 
was  also  a  creature  of  dialogue,  constantly  in  con¬ 
tact  with  his  informants,  who  were  Oriental 
learned  men,  his  partners  in  the  reading  of  the 
ancient  texts.  He  was  the  one  who  introduced  and 
accompanied  the  “modernist”  representatives  of 
Oriental  societies,  particularly  in  the  latter’s  travels 
in  the  West. 

Current  Issues 

Facing  the  social  sciences,  Orientalists  can  hardly 
hide  their  uneasiness.  They  have  the  feeling  that 
concepts  that  were  elaborated  to  fit  Western  realities 
are  almost  mechanically  applied  without  taking 
heed  of  Oriental  specificities.  Indeed,  Orientalists 
tend  to  reproduce  rather  than  criticize  the  dis¬ 
courses  that  Orientals  make  about  themselves, 
which  is  inevitable  in  any  culturalist  approach. 

While  Orientalists  have  given  all  the  required 
justifications  to  colonial  expansion,  they  have  also 
provided  the  arms  to  fight  it.  It  is  by  starting  from 
historical  definitions  produced  by  orientalism  that 
Oriental  nationalisms  were  constructed.  National¬ 
isms,  be  they  Arab,  Indian,  Turkish,  Persian,  or 
Indochinese,  have  largely  borrowed  their  historical 
and  national  discourses  from  European  Oriental¬ 
ists.  The  ancient  golden  ages  were  turned  into 
hopes  of  renewal,  and  current  failings  served  as  the 
basis  of  governmental  programs. 

In  the  20th  century,  the  role  of  Orientalists  con¬ 
stantly  diminished.  While  classical  orientalism  still 
lived,  it  was  mainly  about  heritage,  and  public 
access  was  largely  limited  to  exhibitions,  touristic 
travels,  and  translations  of  masterpieces  of  ancient 
literature.  Today,  many  of  its  missions  have  been 
taken  up  by  academics  of  the  countries  involved, 
and  Westerners  are  only  partners,  even  if  they  have 


sometimes  kept  a  training  function.  The  most 
burning  issue  is  the  demand  that  museums  and 
similar  institutions  in  Western  countries  return 
objects  of  cultural  significance  to  their  countries  of 
origin. 

Modern  orientalism  is  caught  in  a  contradictory 
position.  It  continues  its  work  of  reassessment  of 
historical  paths,  particularly  that  of  the  times 
directly  preceding  European  domination,  no  lon¬ 
ger  defined  as  decadent.  Historians  broaden  their 
concerns  to  economic  and  social  fields  and  also 
integrate  the  research  of  anthropology  and  social 
sciences.  It  is  no  longer  possible  to  talk  about  a 
purely  philological  and  textual  orientalism. 

Decolonized  countries  were  initially  plunged 
into  a  premature  and  turbulent  modernity,  often 
with  Marxist  overtones.  Orientalists  had  to  recon¬ 
sider  their  categories  of  analysis  to  be  able  to 
interpret  these  revolutionary  ruptures.  Newly 
independent  nations  came  to  see  the  moment  of 
independence  as  the  end  of  a  long  history  and  the 
promise  of  a  new  future. 

A  few  years  later,  the  majority  discourses  of  the 
countries  involved  have  shifted  toward  the  defense 
of  a  largely  imaginary  cultural  authenticity,  with  a 
discourse  about  the  superiority  of  the  endogenous 
over  the  exogenous,  the  restoration  of  Islamic, 
Indian,  or  Asian  values.  Islamist  movements, 
through  their  willingness  to  go  back  to  their  roots, 
have  reinvested  fields  that  were  formerly  those  of 
Islamologists.  Often,  Orientalists  are  faced  with 
reified  categories  of  their  own  knowledge,  which 
they  had  judged  to  be  obsolete  and  had  aban¬ 
doned.  Faced  with  Orientals  who  themselves 
resort  to  essentialism,  they  tend  either  to  remain 
silent  or  to  practice  contextualism  so  as  to  reject 
any  continuity  between  the  past  and  the  present 
time.  The  word  orientalism  has  become  the  equiv¬ 
alent  of  an  insult.  In  its  turn,  it  has  become  an 
ahistorical  essence  and  a  polemical  accusation. 

Practical  orientalism  disappeared  with  colonial 
administrators.  It  is  not  certain  whether  the  experts 
working  for  the  states  and  nongovernmental  orga¬ 
nizations  as  well  as  those  responsible  for  develop¬ 
ment  have  a  better  knowledge  of  the  societies 
involved.  Globalization  seems  to  bring  peoples 
together,  but  the  progress  of  communications 
reduces  long-term  stays  and  “immersions”  “on  the 
spot.”  International  English  and  its  impoverished 
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vocabulary  are  not  the  equivalent  of  the  former 
knowledge  of  “indigenous”  languages. 

In  its  lasting  existence,  orientalism  has  claimed 
to  be  a  tool  in  the  constitution  of  the  universal. 
Comparative  studies  introduced  the  delicate  ques¬ 
tion  of  the  interpretation  of  what  is  different.  Of 
course,  Orientalists  have  served  politicians,  but 
they  have  also  significantly  shaped  our  vision  of 
our  globalized  universe. 

Henry  Laurens 
College  de  France 
Paris,  France 

See  also  Buddhism;  Culturalism;  Hinduism;  Islam 
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Pacifism 


Pacifism  is  both  a  political  doctrine  aimed  at  pro¬ 
moting  or  preserving  peace  and  a  political  behavior 
adopted  by  opponents  of  violence  and  particularly 
of  war  as  a  means  of  settling  disputes.  The  belief 
that  violence  is  unacceptable  and  should  be 
avoided  has  presumably  existed  throughout 
history  and  has  been  theorized  over  the  past  2  mil¬ 
lennia  by  diverse  political,  philosophical,  and 
theological  thinkers.  Organized  political  opposi¬ 
tion  to  violence  has  existed  since  the  generaliza¬ 
tion  of  conscription  during  the  19th  century  and 
took  place  during  both  World  Wars.  The  elabora¬ 
tion  of  the  concept  of  pacifism  dates  back  to  the 
beginning  of  the  20th  century  and  encompasses  a 
variety  of  political  thoughts  and  attitudes. 

Pacifists  differ  on  issues  such  as  the  inevitability 
and,  more  generally,  the  causes  of  war;  the  accept¬ 
ability  of  violence  as  a  suitable  means  toward 
peace;  how  to  prevent  war  and  resolve  conflicts; 
and  the  very  definition  and  conditions  of  peace 
itself.  Those  described  as  unconditional  or  abso¬ 
lute  pacifists  are  committed  to  a  principled  rejec¬ 
tion  of  violence,  considered  as  deontologically 
wrong.  Relative,  conditional,  contingent,  or  prag¬ 
matic  pacifists — sometimes  called  pacificists — 
adopt  a  consequentialist  view  of  pacifism;  they 
consider  violence  to  be  counterproductive  but 
believe  that  as  a  last  resort,  force  is  occasionally 
justified  to  advance  the  cause  of  peace.  Besides 
the  diversity  of  pacifist  conceptions  and  behav¬ 
iors,  their  historical  evolutions,  and  their  mutual 


influences  and  intricacies,  one  can  distinguish 
between  thinkers  who  concentrate  on  the  individ¬ 
ual  and  conceive  pacifism  as  an  appeal  to  people’s 
sense  of  fundamental  human  values  and  virtues 
and  those  who  focus  on  pacifism  as  a  goal  for  sys¬ 
temic,  societal  change,  be  it  at  a  national,  regional, 
or  global  level. 

The  Appeal  to  Individual  Morality 

According  to  thinkers  privileging  an  individual 
conception  of  peace,  violence  and  war  are  inherent 
to  human  nature  but  are  morally  wrong  and  can 
be  reduced  by  promoting  individual  virtues  and 
morality.  This  conception  is  shared  by  many  polit¬ 
ical  and  religious  thinkers. 

Major  religions  advocate  peace  and  mutual 
respect  among  human  beings.  However,  both  his¬ 
torical  circumstances  and  religious  exclusiveness 
have  exacerbated  tensions  between  and  within 
religious  communities  and  greatly  influenced  theo¬ 
logical  dogmas.  No  religion  has  been  exempted 
from  violence  justified  by  religious  purpose,  and 
their  positions  range  from  absolute  pacifism  and 
nonviolence  to  just  and  even  holy  war. 

All  dharmic  religions  promote  nonviolence 
( ahimsa ),  embodied  by  Gandhi,  but  to  different 
extents.  Buddhism  and  Jainism  condemn  killing  in 
any  form,  be  it  human  or  not,  and  Buddhism 
defends  nonresistance  through  tolerance  to  the 
enemy.  Hinduism — and  later  Sikhism — justify  war 
in  defense  of  good,  justice,  and  righteousness. 
When  order  and  law  have  failed  to  maintain  peace 
and  security,  just  war  (dharma  yuddha )  is  to  be 
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waged  against  those  responsible,  either  Hindus  or 
non-Hindus,  in  order  to  maintain  peace  and  secu¬ 
rity  within  cosmic  order.  Warriors  must  belong  to 
specific  castes  devoted  to  military  action  and  have 
to  respect  the  laws  of  war  designed  to  ensure  that 
warfare  is  conducted  in  a  fair  way.  The  use  of 
weapons  that  cause  unnecessary  pain  is  prohibited, 
civilians  shall  not  be  harmed,  and  territory  shall 
not  be  annexed. 

Taoist  religions  do  not  rely  on  an  absolute  pre¬ 
cept  of  nonviolence  but  rather  consider  violence  to 
be  a  form  of  weakness  that  induces  more  violence 
and  is  thus  counterproductive.  Inaction  (wu  ivei) 
is  hence  privileged,  and  if  conflict  cannot  be 
avoided,  one  should  try  to  resolve  it  without  a 
direct  confrontation.  War  can,  however,  be  justi¬ 
fied  in  self-defense  or  when  a  morally  evil  leader 
must  be  punished  to  deliver  oppressed  and 
exploited  people. 

Abrahamic  religions  praise  peace  while  provid¬ 
ing  ambiguous  answers  to  the  question  of  violence, 
and  they  have  developed  the  concept  of  just  war  as 
a  major  issue.  The  Hebrew  Bible  conveys  an  omni¬ 
present  violence  sent  by  God  and  addressed  in  ritu¬ 
als  and  laws,  as  well  as  longings  for  peace.  Judaism 
is  committed  to  the  ideal  of  peace  and  well-being 
(shalom)  and  provides  containment  rules  such  as 
the  lex  talionis  of  “an  eye  for  an  eye,”  which  aims 
at  preventing  escalation  of  violence,  and  prohibits 
murder  (“You  shall  not  kill,”  Exodus  20:13). 
However,  God  sanctions  and  even  commands  holy 
wars  ( milhemet  mitzvab)  that  shall  lead  to  the 
annihilation  of  the  enemy  of  the  people  of  Israel 
{herein).  Violence  and  war  thus  remain  suitable 
options  for  self-defense  and  survival  but  should 
not  be  engaged  in  for  proselytizing,  revenge,  or 
unprovoked  aggression. 

The  New  Testament  emphasizes  individual 
morality  and  relates  the  teachings  of  Jesus,  such  as 
those  typified  by  the  Sermon  on  the  Mount 
(“Blessed  are  the  peacemakers,  for  they  shall  be 
called  sons  of  God,”  Matthew  5:9).  Early 
Christianity  promoted  nonviolent  resistance  and 
martyrdom  until  it  became  the  official  religion  of 
the  Roman  Empire  under  Emperor  Constantine. 
Theologians  such  as  Saint  Augustine  and  later 
Francisco  de  Vitoria  developed  the  just  war  theory, 
which  establishes  the  right  to  wage  war  in  certain 
circumstances  (jus  ad  bellum)  and  specifies  accept¬ 
able  forms  of  warfare  (jus  in  bello).  If  defensive 


wars  are  just  wars  by  nature,  offensive  wars  can  be 
considered  so  if  they  are  waged  by  legitimate 
authorities,  for  the  righting  of  wrongs,  and  with 
common  good  intentions.  This  doctrine  has  been 
used  to  justify  holy  wars  conducted  to  recover  the 
Holy  Land  from  Muslim  domination  during  the 
Reconquista  and  the  Crusades,  persecutions  of 
heretics  during  the  Protestant  Reformation,  and 
imperial  conquests.  It  has  been  contested  by  many 
reformed  churches  such  as  Brethren,  Anabaptists, 
and  Quakers,  who  advocate  unconditional  paci¬ 
fism  through  conscientious  objection  and  nonresis¬ 
tance.  Over  the  past  decades,  the  Roman  Catholic 
Church  has  narrowed  just  war  doctrine  to  the 
legitimation  of  defensive  wars  and  the  promotion 
of  the  right  to  interfere  in  case  of  human  rights 
violations,  but  some  theologians  commend  abso¬ 
lute  pacifism  on  the  grounds  that  modern  tech¬ 
nology,  including  nuclear  weapons,  prevents 
proportionate  confrontations  and  generates  intol¬ 
erable  casualties. 

The  Koran  rests  on  the  notion  of  effort  (jihad). 
All  Muslims  must  spiritually  fight  to  overcome 
self-centeredness  (jihad  akbar,  or  greater  jihad). 
They  also  have  to  extend  the  Islamic  community 
through  preaching  and  education,  but  holy  war 
for,  or  in  defense  of,  Islam  (jihad  bis-saif)  must  be 
carried  out  according  to  strict  rules  and  is  not  rec¬ 
ognized  by  some  Muslim  movements  such  as 
Sufism.  Believers  shall  make  every  effort  to  ending 
conflict,  including  restorative  justice  and  peace¬ 
making  ( sulh ). 

The  significance  of  individual  morality  in  pre¬ 
serving  and  defending  peace  is  shared  by  many 
Western  political  thinkers,  but  most  of  them,  from 
Plato  to  Aristotle  or  later  Kant  and  Baron  de 
Montesquieu,  include  it  in  a  broader  reflection  on 
natural  law,  society,  state,  or  humanity.  Writers 
such  as  Henry  Thoreau,  Leo  Tolstoy,  and  Romain 
Rolland  laud  individual  nonviolence.  Psycho¬ 
analysts  like  Sigmund  Freud  urge  that  aggressive¬ 
ness  be  channeled  through  the  development  of 
emotional  and  most  of  all  cultural  links  between 
individuals.  Some  absolute  pacifists  condemn  all 
kinds  of  taking  of  life,  including  capital  punish¬ 
ment,  euthanasia,  abortion,  meat  eating,  animal 
cruelty,  or  killing  in  self-defense.  On  the  other 
hand,  philosophers  such  as  Niccolo  Machiavelli, 
Thomas  Hobbes,  and  Jean-Jacques  Rousseau 
argue  that  war  can  sometimes  be  useful,  and  a  few 
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glorify  it  as  a  potential  for  exalting  either  the  indi¬ 
vidual  (Friedrich  Nietzsche)  or  the  nation  and/or 
the  state  (Johann  Gottlieb  Fichte,  Georg  Wilhelm 
Friedrich  Hegel). 

The  Need  for  Systemic  Change 

Many  thinkers  conceive  of  pacifism  at  a  more  sys¬ 
temic  level  and  insist  on  the  necessity  of  transform¬ 
ing  society  at  various  levels  to  alleviate  or  eliminate 
violence.  They  differ  on  the  means  to  be  used  for 
such  a  transformation,  with  proposals  ranging 
from  free  trade  to  political  regime,  federalism, 
international  law,  collective  security,  disarma¬ 
ment,  class  solidarity,  or  social  mobilizations. 

In  opposition  to  mercantilists  who  argue  that 
war  can  be  necessary  for  the  consolidation  of  states, 
utilitarian  philosophers  such  as  David  Hume  or 
Jeremy  Bentham  insist  on  the  cost  of  war  and  asso¬ 
ciate  virtue  and  interest  to  show  how  free  trade  is 
useful  both  for  stimulating  economic  health  and  for 
pacifying  behaviors,  as  exchanges  promote  a  shared 
belonging  to  the  same  community. 

As  political  systems,  empires  can  ensure  an 
enduring  peace  but  are  not  considered  to  be  paci¬ 
fist  political  systems  because  they  maintain  a 
domination  of  their  subjects.  Kant  grounds  his 
project  for  a  “perpetual  peace”  on  three  levels  of 
action,  as  peace  occurs  from  a  moral  individual 
choice,  best  guaranteed  by  a  republican  regime, 
which  represents  and  respects  people’s  choice,  and 
at  a  more  global  level  by  a  federation  of  republican 
states  on  a  cosmopolitan  basis. 

This  insistence  on  political  regimes  is  shared  by 
later  advocates  of  the  democratic  peace  theory, 
according  to  which  democracies  do  not  go  to  war 
with  one  another,  unlike  potentially  warlike  auto¬ 
cratic  regimes  driven  by  dynastic  ambitions  or 
power  politics.  However,  some  elitist  thinkers 
regard  democracies  as  potentially  bellicose  because 
their  use  of  conscription  and  the  irrationality  of 
the  masses  can  support  nationalism  and  thus  go 
beyond  their  recognition  of  a  people’s  right  to  self- 
determination.  Federalism  is  also  seen  by  numer¬ 
ous  political  thinkers  as  an  answer  for  preserving 
and  promoting  regional  or  universal  peace  through 
a  federation  of  states  that  share  executive,  legisla¬ 
tive,  and  judicial  powers  and  in  some  cases  a  com¬ 
mon  army,  giving  rise  to  contemporary  reflections 
that  could  ultimately  lead  to  regional  integration. 


International  law  extends  the  just  law  doctrines 
of  jus  ad  bellum  and  jus  in  bello  to  establish  com¬ 
mon  rules  to  prevent,  reduce,  or  judge  violence 
between  international  actors.  Since  the  end  of  the 
19th  century,  various  strategies  have  been  tested, 
such  as  arbitration,  conciliation,  mediation,  trea¬ 
ties  (e.g.,  the  Briand-Kellogg  Pact,  signed  in  1928, 
which  forbids  war),  or  international  courts  (e.g., 
the  International  Criminal  Court).  Pacifist  associa¬ 
tions  from  various  countries  actively  supported  the 
creation  of  the  League  of  Nations  (1919)  and  of 
the  United  Nations  (1945),  based  on  the  principle 
of  collective  security,  and  incomplete  disarmament 
initiatives. 

Claims  for  solidarity  among  the  people  across 
state  boundaries  spread  among  leftist  workers 
movements  and  trade  unions  throughout  the  19th 
century.  For  most  of  them,  war  mainly  constrains 
the  working  classes  and  benefits  the  dominant 
classes.  Their  pacifism  therefore  associates  interna¬ 
tionalism  with  antimilitarism,  since  they  advocate 
general  strike  against  militarism  as  conveyed  by  the 
lyrics  of  The  Internationale  anthem,  written  in  1871 
( “The  kings  make  us  drunk  with  their  fumes  /  Peace 
among  ourselves,  war  to  the  tyrants!  /  Let  the 
armies  go  on  strike  /  Guns  in  the  air,  and  break 
ranks”).  However,  Marxists  believe  that  peace  is  a 
remote  goal,  as  the  working-class  mobilization  for  a 
classless  society  will  abolish  war  through  the  decline 
of  the  state,  but  only  through  the  use  of  violence. 

Since  1945,  social  mobilizations  have  developed 
new  arguments  for  peace  on  the  grounds  of  the 
existence  of  a  nuclear  threat  endangering  the  very 
existence  of  humanity,  be  it  through  political  mobi¬ 
lizations,  some  of  them  led  by  communist  interna¬ 
tionalism  (World  Peace  Council)  during  the  Cold 
War,  or  the  creation  of  environmental  nongovern¬ 
mental  organizations  concerned  with  broader  eco¬ 
logical  purposes. 

Rejection  of  violence  may  be  considered  as  one 
of  the  most  commonly  shared  political  ideas 
throughout  history,  though  it  takes  widely  differ¬ 
ent  shapes,  which  make  its  identification  complex 
and  probably  contributes  to  its  not  playing  a 
major  role  within  political  theory. 
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Panel  data  consist  of  a  cross-section  of  “individu¬ 
als”  for  which  there  are  repeated  observations 
over  time.  Individuals  can  be  any  cross-sectional 
unit  of  analysis,  such  as  states,  dyads,  or  survey 
respondents.  Panel  data  sets  are  typically  dichoto¬ 
mized  between  long  panels,  which  have  many 
measurement  occasions  relative  to  the  size  of  the 
cross  section,  and  short  panels,  which  have  many 
individuals  in  the  cross  section  relative  to  the 
number  of  repeated  measurement  occasions,  or 
“waves.”  In  general,  the  methods  associated  with 
the  term  panel  data  analysis  or  longitudinal  analy¬ 
sis  focus  on  short  panels,  while  methods  under  the 
time-series  cross  section  umbrella  focus  more  on 
analyzing  long  panels.  The  key  advantage  of  panel 
data  is  that  such  data  offer  the  opportunity  to  bet¬ 
ter  evaluate  causal  propositions  than  strictly 
cross-sectional  data.  Whereas  cross-sectional  data 
only  allow  the  researcher  to  observe  covariances, 
panel  data  further  allow  the  researcher  to  observe 


whether  a  change  in  an  input  precedes  a  change  in 
the  outcome.  In  other  words,  since  panel  data 
consist  of  the  same  individuals  over  time,  the  ana¬ 
lyst  can  observe  a  shift  in  responses  as  a  reaction 
to  an  input.  One  example  would  be  an  evaluation 
of  whether  a  state’s  present  behavior  responds  to 
the  prior  behavior  of  its  neighbors.  Another  exam¬ 
ple  might  be  using  a  survey  panel,  such  as  those 
often  incorporated  into  the  American  National 
Election  Studies,  to  assess  how  partisan  strength 
influences  campaign  interest  over  time.  Panel  data 
have  a  number  of  features  that  can  pose  chal¬ 
lenges  to  analysts.  These  issues  include  unit 
effects,  serial  correlation,  heteroskedasticity,  and 
contemporaneous  correlation.  Panels  also  have 
special  problems  of  missingness.  The  remainder  of 
this  entry  focuses  on  these  issues  and  some  reme¬ 
dies  for  each. 

Unit  Effects 

Whenever  individuals’  mean  responses  differ,  unit 
effects  are  present  in  the  data.  Unit  effects  can  pose 
serious  problems  for  inference  as  failure  to  account 
for  them  in  some  way  can  produce  bias  in  esti¬ 
mates  akin  to  omitted  variable  bias.  If  the  mean 
response  varies  cross-sectionally  via  unobserved 
unique  means,  but  this  difference  is  not  modeled 
(and  thereby  left  in  the  error  term),  then  any  cross- 
sectionally  varying  covariate  will  correlate  with  the 
error  term.  Such  a  situation  produces  endogeneity 
bias,  that  is,  the  independent  variable  is  correlated 
with  the  error  term  in  the  model’s  coefficients. 

In  the  econometric  tradition,  two  approaches 
are  widely  used  to  handle  unit  effects.  One  is  the 
fixed-effects  model,  typically  estimated  with  least 
squares  dummy  variables  (LSDV).  This  approach 
estimates  the  desired  model  using  ordinary  least 
squares  (OLS),  including  dummy  variables  for  each 
individual,  save  a  reference  individual.  This 
approach  has  the  advantage  of  being  computation¬ 
ally  simple  and  accounting  for  a  known  source  of 
variance  in  the  model  specification.  However,  indi¬ 
vidual  dummies  are  perfectly  collinear  with  any 
variable  that  varies  only  cross-sectionally.  Hence, 
LSDV  precludes  the  inclusion  of  time-invariant 
variables  in  a  model.  An  alternative  that  does  allow 
time-invariant  covariates  is  a  generalized  least 
squares  (GLS)  model  with  a  compound  symmetry 
covariance  structure,  known  as  a  random-effects 
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model.  This  model  recognizes  that  repeated  obser¬ 
vations  will  covary,  so  the  estimator  accounts  for 
this  structure  by  including  a  term  that  forces  all 
repeated  observations  to  correlate  at  a  constant 
level  with  each  other. 

It  should  be  noted  that  the  terms  fixed  and  ran¬ 
dom  effects  have  multiple  meanings.  Econometri¬ 
cians  typically  call  a  LSDV  model  a  fixed-effects 
model  and  a  GLS  compound  symmetry  model  a 
random-effects  model.  These  terms  take  a  different 
meaning  when  analyzing  data  from  the  view  of 
hierarchical  modeling.  Specifically,  a  fixed  effect 
refers  to  any  model  quantity  estimated  in  the  fitting 
of  a  model  (i.e.,  obtained  via  least  squares  or  max¬ 
imum  likelihood),  while  a  random  effect  refers  to 
any  parameter  that  is  unique  to  the  individual  but 
can  be  predicted  separately.  Mixed-effects  models 
contain  both  fixed  and  random  effects.  Confusion 
can  arise  because  a  random-effects  model  is  a  spe¬ 
cial  case  of  a  mixed-effects  model.  For  example,  the 
general  form  of  a  linear  mixed-effects  model  is 

Yjj  =  X[j?>  +  z\pi  +  e,y, 

where  Y;/  is  the  response  value  for  individual  i  at 
time  /,  X.  is  the  vector  of  all  covariate  values  for 
individual  i  at  time  /,  3  is  a  vector  of  fixed  effects — 
coefficients  that  apply  to  all  individuals,  Z;.  is  a 
subset  of  X  that  may  include  any  time-varying 
covariate  or  a  constant,  b ,  is  a  vector  of  random 
effects  for  individual  i,  and  etj  is  the  error  term  for 
individual  i  at  time  /.  One  special  case  would  be  a 
model  in  which  there  is  only  a  random  intercept, 
which  becomes 

Y„  =  X'fi  +  b,  +  eir 

Each  b{  is  not  estimated  directly  in  the  fitting  of 
the  model  but  can  be  predicted  using  empirical 
Bayes  techniques.  By  decomposing  the  unexplained 
variance  into  b,  and  etj,  which  are  independent  of 
each  other,  the  model  successfully  accounts  for  dif¬ 
ferences  in  the  mean  responses  for  individuals  and 
the  necessary  correlation  among  observations. 
Hence,  the  random-effects  model  is  seen  as  a  spe¬ 
cial  case  of  the  more  general  mixed-effects  model. 

There  are  several  practical  considerations  when 
deciding  how  to  control  for  unit  effects  in  a  longi¬ 
tudinal  model.  Again,  fixed-effects  models  cannot 
include  time-invariant  covariates.  Further,  when 


the  number  of  individuals  is  large,  especially  rela¬ 
tive  to  the  number  of  waves,  then  estimating  a 
LSDV  model  is  inefficient.  An  alternative  fixed- 
effects  estimator  to  LSDV  is  the  within  estimator, 
wherein  the  outcome  variable  and  the  covariates 
are  all  rescaled  as  deviations  from  an  individual’s 
mean  of  the  variable.  The  within  estimator  avoids 
the  inefficiency  of  estimating  unique  intercepts  for 
each  individual  and  yields  the  same  coefficient  esti¬ 
mates  at  LSDV;  however,  just  like  LSDV,  it  cannot 
accommodate  time-invariant  covariates. 

The  model  of  random  effects  for  units  allows 
for  time-invariant  covariates  and  avoids  the  inef¬ 
ficiency  problem  that  could  emerge  from  LSDV. 
Hence,  with  especially  short  panels  or  any  model 
for  which  the  effects  of  time-invariant  covariates 
are  to  be  estimated,  random-effects  models  are 
probably  the  most  practical  option.  However,  this 
model  assumes  that  unit  effects  are  independent  of 
all  covariates.  If  the  unit  effects  are  correlated  with 
any  of  the  input  variables,  then  the  random-effects 
model  is  biased  and  inconsistent.  Whether  or  not 
independent  unit  effects  is  a  fair  assumption  can  be 
evaluated  with  a  Hausman  test,  under  which  the 
null  hypothesis  is  that  the  unit  effects  are  indepen¬ 
dent,  implying  that  a  random-effects  model  is  con¬ 
sistent.  Rejection  of  this  null  hypothesis  implies 
that  the  random-effects  model  has  problems  of 
endogeneity  bias.  As  a  final,  practical  point  on 
random-effects  models,  GLS  models  such  as  this 
require  the  analyst  to  specify  how  the  errors  of  the 
model  are  correlated.  However,  the  true  correla¬ 
tion  between  the  errors  of  individuals’  repeated 
measurements  is  unknown,  so  feasible  GLS  must 
be  used.  Feasible  GLS  (FGLS)  is  estimated  with  a 
multistep  procedure  whereby  residuals  of  an  initial 
model  are  used  to  estimate  the  correlation  of 
errors,  which  is  then  inserted  into  a  GLS  estimator. 
For  instance,  the  Cochrane-Orcutt  FGLS  estimator 
repeats  this  iterative  process  until  the  estimate  of 
correlation  of  errors  ceases  to  improve.  All  of  this 
suggests  that  analysts  must  carefully  weigh  the 
structure  of  their  data  and  the  goals  of  their  model 
when  choosing  how  best  to  handle  unit  effects. 

Serial  Correlation,  Heteroskedasticity, 
and  Contemporaneous  Correlation 

Serial  correlation  refers  to  the  fact  that  repeated 
observations  on  the  same  individual  are  highly 
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correlated.  In  general,  this  correlation  tends  to  be 
large  and  positive  but  diminishes  as  the  time 
between  measurements  increases.  Serial  correlation 
violates  the  OLS  assumption  of  uncorrelated  errors. 
The  solutions  to  this  problem  resemble  the  fixes  for 
unit  effects.  One  solution  is  to  include  a  lagged 
response  as  a  covariate,  as  this  term  often  accounts 
for  serial  correlation  and  makes  the  remaining 
errors  independent.  Lagged  outcome  variables  are 
more  commonly  used  for  long  panels  because  the 
first  wave  of  observations  cannot  be  modeled  with 
this  approach,  which  is  more  costly  when  repeated 
observations  are  scarce.  (It  should  be  noted  that 
while  many  argue  that  a  lagged  response  most 
effectively  accounts  for  unit  effects  and  serial  cor¬ 
relation,  others  maintain  that  an  endogeneity  bias 
can  occur  if  the  lagged  term  does  not  filter  all  of  the 
serial  correlation.)  Another  solution  is  to  estimate 
a  GLS  model  that  includes  a  covariance  pattern 
matrix,  which  estimates  the  covariances  between 
each  pair  of  time  waves:  The  matrix  may  be 
unstructured  or  defined  by  a  clear  pattern,  such  as 
first-order  autoregressive.  Finally,  mixed-effects 
models  produce  correlation  matrices  based  on  the 
variances  and  covariances  of  the  random  effects. 
Thus,  a  pattern  of  correlation  can  also  be  captured 
by  random  effects.  It  should  be  noted  that  in  par¬ 
ticularly  short  panels  (e.g.,  three  waves),  serial  cor¬ 
relation  can  be  hard  to  account  for  with  any  of 
these  methods:  A  lagged  dependent  variable  costs 
one  wave  of  data,  covariance  pattern  matrices 
more  complex  than  a  simple  random-effects  model 
can  be  difficult  to  estimate,  and  very  short  panels 
do  not  allow  for  a  lot  of  random  parameters. 

The  methods  for  covariance  patterns  and  ran¬ 
dom  effects  also  can  be  incorporated  into  the  gen¬ 
eral  linear  model  framework,  which  means  that 
remedies  for  unit  effects  and  serial  correlation  also 
can  be  used  for  limited  dependent  variables  (e.g., 
counts  or  binary  outcomes).  Marginal  models, 
estimated  with  the  generalized  estimating  equa¬ 
tions,  require  the  analyst  to  specify  how  repeated 
observations  are  associated  and  thereby  resemble 
the  covariance  pattern  GLS  model  for  continuous 
outcomes.  General  linear  mixed-effects  models 
incorporate  random  effects  into  the  specification 
and  account  for  the  correlation  of  repeated  obser¬ 
vations  through  the  random  effects. 

Heteroskedasticity  can  be  present  in  panel  data 
if  the  unmodeled  variance  in  outcomes  differs  from 


one  individual  to  the  next.  This  problem  can  be 
addressed  through  a  GLS  estimator  that  allows  for 
unique  variances  among  individuals,  in  addition  to 
the  correlation  pattern.  Contemporaneous  correla¬ 
tion  arises  when  individuals  have  similar  errors  at 
particular  times.  This  may  arise  because  some 
time-dependent  factor  is  simultaneously  influencing 
all  individuals.  In  the  presence  of  contemporane¬ 
ous  correlation,  the  error  variance  of  linear  coef¬ 
ficient  estimates  increases  relative  to  the  estimates 
of  coefficients  when  Gauss-Markov  assumptions 
hold.  Regular  standard  errors  do  not  account  for 
this  inefficiency,  however.  Rather,  panel-corrected 
standard  errors  will  better  account  for  the  larger 
error  variance,  thereby  making  statistical  inference 
on  coefficient  estimates  less  prone  to  Type  I  errors 
(rejecting  the  null  hypothesis  when  it  is  true). 

Missing  Data 

With  panel  data,  a  key  concern  is  that  measuring 
each  individual  at  every  wave  of  observations  may 
not  be  possible.  One  reason  for  this  may  be  censor¬ 
ing  that  arises  from  the  structure  of  the  study.  For 
example,  if  different  individuals  were  recruited  to 
participate  in  a  study  with  staggered  start  times, 
but  the  study  ended  simultaneously  for  all,  then 
late  joiners  would  have  fewer  repeated  observa¬ 
tions.  In  this  situation,  the  nonobservance  of  later 
waves  for  late  joiners  would  be  missing  completely 
at  random  (MCAR),  as  qualities  of  the  individuals 
had  no  bearing  on  how  often  they  were  observed. 
In  this  case,  as  with  any  panel  data  with  observa¬ 
tions  MCAR,  the  data  could  be  analyzed  by  com¬ 
plete-case  analysis  (analyzing  only  cases  for  which 
all  waves  are  observed)  or  available-data  analysis 
(i.e.,  methods  that  do  not  require  response  vectors 
of  equal  length). 

Another,  more  serious  cause  of  missing  data  in 
panels  is  attrition  (also  called  dropout  or  panel 
mortality).  Individuals  who  are  part  of  the  study 
may  choose  not  to  participate  after  a  few  observa¬ 
tions,  or  the  researcher  may  lose  track  of  individu¬ 
als  and  be  unable  to  reach  them  for  further  study. 
Dropout  specifically  refers  to  the  situation  where, 
once  an  individual  goes  unobserved  in  one  wave, 
he  or  she  is  not  observed  in  any  future  wave. 
Whereas  data  that  are  missing  due  to  censoring  are 
nearly  always  MCAR,  data  missing  due  to  dropout 
may  not  fit  this  criterion.  If  the  data  are  at  least 


Paradigms  in  Political  Science  1763 


missing  at  random  (MAR,  meaning  that  the  prob¬ 
ability  of  missingness  is  conditional  only  on  observ¬ 
able  information),  then  imputation  methods  can 
yield  unbiased  estimates  of  model  quantities.  Many 
studies  impute  missing  values  from  dropout  by 
assuming  all  missing  values  of  a  response  are  equal 
to  the  last  observed  value.  This  method  assumes, 
however,  that  the  responses  would  not  have 
changed  since  dropout,  which  is  usually  unrealistic. 

A  better  alternative  is  multiple  random  imputa¬ 
tion.  One  technique  for  multiple  random  imputa¬ 
tion  is  to  model  the  probability  of  missingness  and 
match  missing  observations  with  observed  obser¬ 
vations  that  have  similar  probabilities  of  being 
missing,  randomly  drawing  several  observations 
with  similar  probabilities  of  being  missing  to 
impute  the  missing  value.  A  second  technique  for 
multiple  random  imputation  is  to  model  the  value 
of  an  observation  at  a  particular  time  with  the 
observed  data  and  impute  a  value  for  the  missing 
observation  that  is  computed  with  known  infor¬ 
mation  about  the  subject  plus  a  random  distur¬ 
bance.  For  individuals  who  later  return  to  the 
study,  observations  at  later  waves — as  well  as  early 
waves — should  be  used  to  impute  missing  middle 
values. 

As  a  final  consideration  for  attrition,  a  researcher 
may  choose  to  refresh  the  sample  by  adding  new 
observations  toward  the  end  of  the  study,  these 
new  observations  being  called  a  refreshment  sam¬ 
ple.  Though  this  strategy  does  not  directly  remedy 
the  problem  of  uneven  panels,  it  does  prevent  the 
sample  size  from  shrinking  too  much  when  con¬ 
structing  an  overall  response  profile.  Furthermore, 
refreshment  allows  the  researcher  to  diagnose  the 
severity  of  panel  effects.  This  can  be  done  by  com¬ 
paring  variable  means  from  the  refreshment  sam¬ 
ples  with  the  means  of  those  still  in  the  panel  at  a 
given  wave  to  see  how  dropout  is  influencing  the 
makeup  of  the  sample.  Furthermore,  the  process  of 
being  part  of  a  panel  study  may  influence  an  indi¬ 
vidual’s  response  over  time,  a  process  called  panel 
conditioning.  Refreshment  samples  allow  for  the 
possibility  of  adjusting  for  panel  effects  through 
techniques  such  as  fractional  pooling  or  two-stage 
auxiliary  instrumental  variables. 

James  E.  Monogan  III 
Washington  University  in  St.  Louis 
St.  Louis,  Missouri,  United  States 
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Paradigms  in  Political  Science 


A  paradigm  is  a  constellation  of  scientific  theories, 
values,  and  methods  shared  by  the  members  of  the 
scientific  community,  forging  a  disciplinary  matrix 
and  excluding  any  other  theory.  The  word  is  of 
Greek  origin  and  emerged  in  philosophical  litera¬ 
ture  in  the  15th  century.  The  concept  of  a  para¬ 
digm  was  formulated  by  Thomas  Kuhn  in  his 
book  The  Structure  of  Scientific  Revolutions 
(1962).  Today,  this  concept  is  accepted  in  the  his¬ 
tory  of  natural  sciences  and  in  the  sociology  of 
science.  It  is  also  frequently  used  in  political  sci¬ 
ence  as  in  all  social  sciences  in  spite  of  the  fact  that 
Kuhn  explained  in  the  preface  of  his  book  that  its 
use  is  not  justified  in  the  social  sciences.  In  1965, 
three  years  after  the  publication  of  The  Structure 
of  Scientific  Revolutions,  the  American  Political 
Science  Association  (APSA)  president,  David 
Truman,  thought  that  the  paradigmatic  explana¬ 
tion  of  scientific  progress  was  not  applicable  to 
political  science.  Since  then,  the  concept  of  a  para¬ 
digm  has  nonetheless  seduced  many  political  scien¬ 
tists  but  much  less  so  among  comparativists  than 
among  other  scholars.  The  word  has  even  spread 
to  mass  media.  Today  it  may  be  too  late  to  expel 
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this  word  from  the  lexicon,  but  it  can  be  specified. 
The  basic  question  is  the  following:  In  political  sci¬ 
ence,  are  there  sudden  overturnings,  scientific 
breakthroughs  involving  a  tabula  rasa,  compara¬ 
ble  with  those  engendered  by  Nicolaus  Copernicus, 
Isaac  Newton,  Charles  Darwin,  or  Louis  Pasteur? 
For  the  following  reasons,  the  answer  to  such  a 
question  cannot  be  affirmative.  What  is  involved 
here  is  the  identity  of  political  science  as  a  science 
focusing  on  social  diversity,  social  change,  and 
cumulative  growth  and  knowledge. 

In  all  sciences,  the  extent  of  the  innovation  that 
any  individual  can  produce  is  necessarily  limited, 
for  each  individual  must  employ  in  research  the 
tools  that  he  or  she  acquires  and  cannot  in  his  or 
her  own  lifetime  replace  them  all  (Kuhn,  1970). 
Progress  does  not  arise  in  a  vacuum  but  develops 
out  of  the  scientific  patrimony.  It  is  difficult  to 
find  in  the  social  sciences  a  virgin  domain.  Every 
decade  that  passes  adds  layer  on  layer  to  the 
patrimony.  New  explanations  supersede  older  inter¬ 
pretations.  Many  examples  of  cumulative  progress 
can  be  given.  Even  giants  rely  on  patrimony.  Karl 
Marx  refers  to  Adam  Smith  in  his  Theory  of 
Surplus  Value.  In  this  book,  he  draws  on  the  work 
of  his  predecessor  in  more  than  one  page  out  of 
every  six.  Ralf  Dahrendorf  cites  Marx  160  times 
in  his  book  Society  and  Democracy  in  Germany. 
Max  Weber  does  not  cite  Marx,  but  many  of  his 
writings  were  in  response  to  the  thesis  of  his  pre¬ 
decessor.  He  once  noted  that  he  could  not  have 
accomplished  crucial  parts  of  his  work  without 
the  contributions  of  Marx  and  Friedrich  Nietzsche. 
Darwin  had  recognized  his  debt  to  Thomas 
Robert  Malthus;  John  Maynard  Keynes  would 
not  have  been  able  to  write  one  of  the  most 
famous  books  of  the  20th  century  without  the 
incremental  advances  achieved  by  several  genera¬ 
tions  of  economists.  The  theory  of  development 
consists  of  a  long  chain  of  accumulated  contribu¬ 
tions  in  several  disciplines.  The  literature  on  elites 
is  a  good  example,  among  others,  of  cumulative 
knowledge.  Many  contributions  emphasize  the 
sedimentation  of  layers  of  knowledge.  The  work 
of  Gaetano  Mosca  and  Vilfredo  Pareto  on  elitism 
has  become  consolidated  groundwork,  and  since 
then,  important  books  have  built  an  impressive 
patrimony. 

The  scope  of  social  sciences  is  rarely  the  discov¬ 
ery  of  laws  with  universal  validity;  rather,  it  is  the 


explanation  of  social  diversity.  In  the  social  sci¬ 
ences,  truth  is  not  universal,  it  is  contextual  and 
plural.  Social  scientists  do  not  make  inventions  and 
rarely  make  discoveries;  what  they  do  best  is  to 
observe  regularities  and  exceptions  (anomalies).  It 
is  for  this  reason  that  the  most  imaginative  strategy 
in  the  social  sciences  is  the  comparative  method. 
The  patrimony  is  common  property.  Although 
every  concept  or  theory  or  method  has  its  creators 
and  developers,  they  do  not  need  to  be  cited  every 
time  the  term  is  used.  Even  the  identity  of  the 
originator  may  disappear  into  anonymity.  We  do 
not,  and  cannot,  remember  who  used  terms  such  as 
role ,  revolution,  or  social  mobility  for  the  first  time. 
Given  such  a  patrimony,  scholars  today  can  start 
their  research  at  a  much  higher  level  than  did  their 
predecessors.  Graduate  students  today  know  more 
than  the  founders  of  their  field — even  if  they  do  not 
have  the  same  capabilities  of  their  forebears. 
Knowledge  is  largely  acquired  by  accumulation. 

Because  the  patrimony  of  political  science  results 
from  progressively  accumulated  knowledge  rather 
than  from  revolutionary  leaps  or  bounds,  because 
the  discipline  is  deeply  divided  in  specialties  and  in 
schools,  and  because  one  of  its  most  productive 
research  strategies  is  the  comparative  method,  the 
mainstream  of  the  discipline  of  political  science 
cannot  be  labeled  as  paradigmatic.  Other  expres¬ 
sions  are  available:  general  framework,  basic  theo¬ 
ries,  encompassing  synthesis,  lasting  postulates, 
and  so  on.  None  of  the  major  trends  in  contempo¬ 
rary  political  science  has  the  pretention  of  flying 
the  paradigmatic  flag.  Many  schools  were  and  are 
innovative  and  seminal,  but  they  directly  contra¬ 
dict  the  concept  of  a  paradigm  dominating  the 
entire  discipline  or  its  prominent  domains. 

There  is  a  basic  incompatibility  between  para¬ 
digmatic  monoliths  and  comparative  analyses.  All 
great  theories,  concepts,  and  strategies  in  political 
science  implicitly  reject  paradigmatic  assumptions 
(except  maybe  the  rational  theory).  The  history  of 
political  science  is  a  history  of  multiple  competitive 
theories  and  methods:  plural  societies,  behavior¬ 
ism,  structural  functionalism,  development,  depen¬ 
dency,  the  role  of  the  state  and  of  civil  society,  the 
changing  roles  of  institutions,  electoral  cleavages, 
religion  and  politics,  civil  culture,  nationalism, 
social  stratification,  social  class,  alienation,  legiti¬ 
macy,  trust,  ethnocentrism,  imperialism,  hege¬ 
mony,  the  clash  of  civilizations,  and  so  on.  The 
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thesaurus  of  the  disciplines  is  impermeable  to 
paradigmatic  oversimplifications.  The  Kuhnian 
overview  of  the  history  of  astronomy,  physics,  or 
biology  cannot  be  extrapolated  to  the  social  sci¬ 
ences,  according  to  Mattei  Dogan  and  Robert 
Pahre  (1990). 

The  borrowing  and  lending  of  concepts,  theo¬ 
ries,  and  methods  from  one  discipline  to  another  of 
the  social  sciences  over  generations  also  shows  the 
continuous  accumulation  of  knowledge.  Using 
various  encyclopedias  and  dictionaries,  it  is  possi¬ 
ble  to  compile  an  inventory  of  more  than  200 
concepts  that  political  science  “imported.” 

•  From  sociology:  accommodation,  aggregate, 
assimilation,  elite  circulation,  clique,  cohesion, 
collective  behavior,  hierarchy,  ideal  type, 
individualism,  legitimacy,  mass  media,  mass 
society,  militarism,  nationalism,  pattern 
variables,  Protestant  ethic,  secular,  segregation, 
social  class,  social  control,  social  integration, 
social  structure,  socialization,  status 
inconsistency,  working  class,  gemeinschaft, 
gesellschaft 

•  From  psychology:  affect,  alienation, 
ambivalence,  aspiration,  attitude,  behavior, 
consciousness,  dependency,  empathy, 
personality,  social  movement,  stereotype,  gestalt 

•  From  economics:  allocation  of  resources,  cartel, 
corporatism,  diminishing  returns,  industrial 
revolution,  industrialization,  liberalism, 
mercantilism,  gross  national  product,  scarcity, 
undeveloped  areas 

•  From  philosophy  and  Greek  writers:  anarchism, 
aristocracy,  consensus,  democracy,  faction, 
freedom,  general  will,  idealism,  monarchy, 
oligarchy,  phratry,  pluralism,  tyranny,  value, 
Weltanschauung 

•  From  anthropology:  acculturation,  affinity,  caste, 
nepotism,  patriarchy,  plural  society,  rites  of 
passage 

•  From  theology:  anomie,  charisma 

•  From  journalists  and  politicians:  imperialism, 
internationalism,  isolationism,  the  Left  and  the 
Right,  lobbying,  neutralism,  nihilism,  patronage, 
plebiscite,  propaganda,  socialism,  syndicalism 

In  the  process  of  adoption  and  adaptation 
the  semantic  meaning  of  many  concepts  has 
changed.  Many  concepts  have  multiple  origins. 


Authoritarianism  has  two  roots,  one  psychological 
and  one  ideological.  It  is  often  inadvertently  inter¬ 
changeable  with  despotism,  autocracy,  absolutism, 
dictatorship,  and  so  on.  Authority  has  been  ana¬ 
lyzed  from  different  disciplinary  perspectives  by 
Bronislaw  Malinowski,  Max  Weber,  Talcott 
Parsons,  Harold  Lasswell,  Abraham  Kaplan, 
Bertrand  de  Jouvenel,  and  Carl  Friedrich,  among 
others.  The  concept  of  culture  (civic,  political,  and 
national)  has  many  variants:  cultural  convergence, 
cultural  configuration,  cultural  evolution,  cultural 
integration,  cultural  lag,  cultural  parallelism,  cul¬ 
tural  pluralism,  cultural  relativity,  cultural  system, 
postmaterialist  culture.  In  the  past  2  decades, 
political  scientists  have  been  very  productive  in 
this  subfield. 

Paradigms  do  not  dialogue,  they  are  mortal 
enemies;  new  ones  replace  old  ones  entirely. 
Introductory  textbooks  in  physics  or  chemistry 
published  40  years  ago  rest  now  in  the  cemetery  of 
books.  On  the  contrary,  in  political  science,  dozens 
of  books  published  long  ago  are  still  inspiring  con¬ 
temporary  scholars:  Max  Weber,  Karl  Marx, 
Adam  Smith,  Vilfredo  Pareto,  Anthony  Downs, 
Kenneth  J.  Arrow,  Harold  Lasswell,  John  Stuart 
Mill,  Joseph  Schumpeter,  and  many  others,  even 
Aristotle  and  Machiavelli,  are  still  preeminent  in 
the  patrimony  of  political  science,  which  advances 
by  a  process  of  accumulation  of  work  within  sub¬ 
fields  and  develops  out  of  the  patrimony  by  the 
scientist’s  own  insights.  The  history  of  political 
science  is  not  a  history  of  breakthrough  discoveries 
as  in  physics  or  chemistry.  It  is  an  uninterrupted 
chain  of  advances,  with  many  competitive  schools, 
controversies,  temporary  syntheses,  predecessors 
and  successors,  and  methodological  disputes. 

Mattei  Dogan 

National  Center  for  Scientific  Research  (CNRS) 

Paris,  France 

See  also  Normative  Political  Theory;  Political  Philosophy; 

Political  Theory 


Further  Readings 

Almond,  G.  A.  (1990).  A  discipline  divided:  Schools  and 
sects  in  political  science.  Newbury  Park,  CA:  Sage. 

Bell,  W.  (1990).  What  do  we  mean  by  paradigm?  American 
Sociological  Association  Footnotes,  18(9),  17. 


1766  Parliamentary  Systems 


Dogan,  M.  (1996).  Political  science  and  the  other  social 
sciences.  In  R.  E.  Goodin  &  H.  D.  Klingenmann 
(Eds.),  A  new  handbook  of  political  science 
(pp.  97-132).  Oxford,  UK:  Oxford  University  Press. 

Dogan,  M.  (1997).  The  new  social  sciences:  Cracks  in  the 
disciplinary  walls.  International  Social  Science  Journal, 
153,  429-443. 

Dogan,  M.,  &  Pahre,  R.  (1990).  Creative  marginality: 
Innovation  at  the  intersections  of  social  sciences. 
Boulder,  CO:  Westview  Press. 

Holt,  R.  T.,  &  Richardson,  J.  M.  (1970).  Competing 
paradigms  in  comparative  politics.  In  R.  T.  Holt  & 

J.  E.  Turner  (Eds.),  The  methodology  of  comparative 
research  (pp.  21-72).  New  York:  Free  Press. 

Kuhn,  T.  S.  (1962).  The  structure  of  scientific  revolutions. 
Chicago:  University  of  Chicago  Press. 

Kuhn,  T.  S.  (1970).  Reflections  on  my  critics.  In 

I.  Lakatos  &  A.  Musgrave  (Eds.),  Criticism  and  the 
growth  of  knowledge  (pp.  231-278).  Cambridge,  UK: 
Cambridge  University  Press. 


Parliamentary  Systems 


Parliamentary  government  is  a  form  of  govern¬ 
ment  where  the  executive  derives  its  political 
legitimacy  from  the  parliamentary  representative 
body.  From  a  historical  point  of  view,  it  is  a 
European  invention  and  continues  to  be  the  most 
common  form  of  government  on  this  continent. 
Outside  Europe,  its  diffusion  has  been  limited, 
and  presidential  or  semipresidential  forms  have 
often  been  preferred.  This  entry  discusses  the  ori¬ 
gins  of  parliamentary  systems,  the  various  forms 
they  take,  and  current  problems  with  respect  to 
this  form  of  government. 

The  Origins  of  Parliamentary  Government 

Parliamentary  government  is  one  of  the  possible  solu¬ 
tions  to  a  crucial  liberal  and  democratic  problem — 
the  problem  of  how  to  ensure  both  the  political 
legitimacy  of  the  executive  and,  at  the  same  time, 
its  accountability.  The  basic  idea  behind  this  form 
of  government  is  that  the  parliamentary  institu¬ 
tion,  as  the  highest  representative  body  of  the 
political  community,  should  hold  the  keys  to  the 
life  of  the  executive.  This  constitutional  idea  is 
essentially  implemented  by  giving  the  parliament 
the  power  to  express  a  vote  of  no  confidence  in  the 


executive  if  it  does  not  approve  its  behavior  and  by 
obligating  the  executive  to  resign  when  faced  with 
such  a  manifestation  of  political  distrust. 

The  origins  of  this  principle  and  of  the  institu¬ 
tional  mechanisms  used  to  implement  it  can  be 
easily  detected  in  European  history  of  the  past 
2  centuries.  Within  the  framework  of  limited 
monarchical  government,  that  is,  a  political  condi¬ 
tion  whereby  the  monarchy  was  balanced  by  a 
representative  body,  the  increasingly  strong  politi¬ 
cal  role  of  the  latter  has  progressively  produced  a 
situation  in  which  the  executive  nominated  by  the 
monarch  could  not  survive  without  the  political 
approval  of  the  parliament.  Criticisms,  which  the 
parliamentary  body  could  not  address  against  the 
king  himself  without  getting  into  a  situation  of 
lese-majeste  and  thus  creating  an  unsol vable  prob¬ 
lem  of  political  legitimacy,  could  be  more  freely 
directed  against  the  government.  In  this  case,  a 
mutually  acceptable  solution  could  be  for  the  king 
to  dismiss  the  government,  attributing  to  it  all 
political  responsibilities  and  thus  preserving  his 
own  supremacy.  On  its  side,  the  representative 
body  could  “obtain  the  head”  of  the  prime  minis¬ 
ter  and  of  his  or  her  ministers  while  at  the  same 
leaving  the  authority  of  the  king  untouched.  In  this 
way,  the  executive,  while  originally  chosen  by  the 
monarch  and  deriving  from  him  its  political  legiti¬ 
macy,  could  be  gradually  drawn  into  the  sphere  of 
the  parliament  until  the  monarchy  ultimately  lost 
every  substantive  political  control  over  the  execu¬ 
tive  and  only  retained  a  symbolic  role. 

With  this  transformation,  the  government  and 
its  components  have  been  increasingly  drawn  into 
the  sphere  of  electoral  and  party  politics  and  have 
gained  independence  from  the  sphere  of  the  court 
and  of  the  state  bureaucracy.  The  process  leading 
to  a  full  parliamentary  control  over  the  executive 
has  been  often  difficult  and  nonlinear.  The  idea 
that  the  government  should  answer  to  elected 
politicians  and  become  responsive  to  the  opinions 
and  passions  of  the  populace  was  difficult  to 
accept — both  for  the  monarchy  and  for  the  state 
establishment  that  was  loyal  to  the  throne.  For  an 
extended  period  of  time,  in  most  European  coun¬ 
tries,  the  monarchs  attempted  to  maintain  some 
degree  of  political  control  over  the  formation  of 
governments  and  over  the  selection  of  prime  min¬ 
isters  and  some  ministers  (particularly  in  the  fields 
of  foreign  affairs  and  defense).  They  also  sought  to 
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influence  some  of  the  policy  decisions.  On  a  num¬ 
ber  of  occasions,  they  even  resorted  to  prematurely 
dissolving  the  parliament  with  the  purpose  of 
obtaining  a  more  favorable  position  of  the  repre¬ 
sentatives  with  regard  to  their  preferred  cabinets. 
This  resistance  has  delayed  but  not  stopped  the 
process  of  parliamentarization;  in  some  cases, 
however,  it  has  led  to  the  downfall  of  the  monar¬ 
chy  and  to  the  (permanent  or  temporary)  adoption 
of  republican  rule  (e.g.,  in  France,  Germany, 
Greece,  Italy,  Portugal,  and  Spain).  Where  the 
monarchy  has  more  smoothly  accepted  the  new 
developments,  it  has  survived,  although  with  a 
more  limited  role.  In  these  cases,  it  has  accompa¬ 
nied  the  consolidation  of  a  stable  parliamentary 
government  (Belgium,  the  Netherlands,  the  Scan¬ 
dinavian  countries,  and  the  United  Kingdom  [UKJ 
are  good  examples  of  such  a  development). 

The  Basic  Features  of 
Parliamentary  Government 

As  previously  mentioned,  the  crucial  and  defining 
feature  of  a  parliamentary  government  is  the 
power  of  the  parliament  to  express  a  vote  of  no 
confidence  against  the  government.  Corresponding 
to  this  power  is  the  duty  of  the  government  to 
resign  after  such  an  event.  As  it  had  already  been 
clearly  noticed  by  Walter  Bagehot  in  The  English 
Constitution  (1867),  this  produces  a  “close  union 
of  the  executive  and  legislative  powers.”  The  cabi¬ 
net  and  the  parliament,  or  more  precisely  the 
parliamentary  majority,  become  closely  intercon¬ 
nected.  The  other  features  of  this  form  of  govern¬ 
ment  derive  from  the  first  feature.  The  second  is 
that  the  executive  does  not  have  a  fixed  term  of 
office.  Its  duration  may  coincide  with  a  full  parlia¬ 
mentary  term,  or  even  beyond,  if  a  newly  elected 
parliament  continues  to  ensure  its  political  sup¬ 
port,  but  it  may  also  be  interrupted  beforehand. 
The  principle  of  collegial  responsibility  that  typi¬ 
cally  characterizes  parliamentary  government  is 
also  strictly  linked  to  the  power  of  the  parliament 
to  express  a  vote  of  no  confidence  against  the  gov¬ 
ernment.  The  government  is  a  collective  decision¬ 
making  body  and  is  collectively  responsible  in 
front  of  the  parliament.  This  principle  can,  how¬ 
ever,  coexist  with  significant  variations  in  the 
internal  structure  of  the  executive.  The  role  and 
the  authority  of  the  head  of  the  government  can 


vary,  as  well  as  the  degree  of  autonomy  of  the 
other  ministers.  In  most  cases,  the  duration  of  par¬ 
liament  is  not  fixed:  The  parliament  can  be 
dissolved  before  its  normal  conclusion  and  early 
elections  can  be  called.  This  possibility  is  generally 
viewed  by  constitution  makers  as  an  exit  strategy 
in  situations  where  providing  the  required  parlia¬ 
mentary  support  for  a  cabinet  has  proven  to  be  a 
difficult  task.  With  regard  to  the  dissolution  of 
parliament,  it  is  particularly  relevant  to  establish 
who  has  the  substantive  power  to  decide  this.  A 
further  institutional  feature  of  this  form  of  govern¬ 
ment  is  that  the  head  of  state  is  clearly  separated 
from  the  head  of  the  government  and  that  it  mainly 
has  a  representative  and  procedural  role  rather 
than  a  governing  one.  Parliamentary  government 
is,  however,  compatible  with  a  variety  of  possibili¬ 
ties  with  regard  to  the  selection  of  the  head  of  state. 
Especially  in  Europe,  a  good  number  of  parliamen¬ 
tary  monarchies  survive  where  the  head  of  state  is 
designated  by  the  monarchical  principle  of  inheri¬ 
tance  within  an  established  dynasty.  In  most  of  the 
other  cases,  when  a  republican  solution  is  adopted, 
a  president  is  elected  by  the  parliament.  In  a  few 
cases,  a  popular  election  takes  place  but,  as  opposed 
to  presidential  or  semipresidential  forms,  this  does 
not  attribute  to  the  president  a  governing  role. 

A  crucial  aspect  of  the  parliamentary  form  is  the 
formation  of  the  government;  unlike  the  presiden¬ 
tial  form  of  government,  it  is  not  formally  decided 
by  elections  and  thus  requires  a  more  complex 
procedure.  Typically,  this  procedure  entails  the 
nomination  of  the  government  so  that  the  parlia¬ 
ment  can  express  its  support  either  explicitly  (if  a 
confidence  vote  is  required  at  its  inception)  or 
implicitly  (when  such  a  vote  is  not  required  and 
support  is  presumed  until  proof  of  the  contrary). 
With  few  exceptions,  the  role  of  nominating  is  in 
the  hands  of  the  head  of  state,  who  is  called  to 
perform  this  function  after  an  election  or  the  resig¬ 
nation  of  the  government  in  office.  Constitutional 
rules  are  usually  not  very  specific  about  what  the 
head  of  state  (or  another  authority  endowed  with 
this  power)  should  do,  but  it  is  rather  obvious  that 
the  designation  must  be  oriented  toward  produc¬ 
ing  a  cabinet  capable  of  obtaining  the  support  of 
the  parliament.  Otherwise,  it  could  later  be  struck 
by  a  negative  vote  of  the  representative  body  and 
thus  nullified.  The  task  of  the  head  of  state  is, 
therefore,  to  find  out  the  will  of  the  parliamentary 
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majority,  in  terms  of  both  the  composition  of  the 
majority  itself  and  the  identification  of  the  person 
to  lead  the  new  government.  Depending  on  the 
electoral  system  and  the  configuration  of  the  party 
system  in  the  parliament,  this  step  may  be  more  or 
less  simple.  In  some  cases,  electoral  results  so 
clearly  define  which  party  or  coalition  of  parties 
has  won  and  to  whom  the  task  of  cabinet  leader 
should  be  assigned  that  the  head  of  state  has  prac¬ 
tically  no  need  to  conduct  a  search  and  can  pro¬ 
ceed  immediately  to  the  nomination  of  the  prime 
minister.  In  other  cases,  especially  in  multiparty 
systems,  the  process  is  less  clear.  After  new  elec¬ 
tions  or  after  the  fall  of  an  existing  government,  it 
may  not  be  immediately  obvious  which  parliamen¬ 
tary  majority  will  be  formed  that  is  able  to  sustain 
a  new  government.  To  establish  which  parties  will 
compose  it  and  which  leader  will  be  acceptable 
may  require  a  long  and  complicated  process  of 
negotiation.  The  head  of  state  will  typically  assist 
the  process  through  consultations  with  the  politi¬ 
cal  forces  until  a  viable  solution  emerges.  Under 
special  circumstances,  this  role  of  facilitator  may 
acquire  a  somewhat  greater  political  significance. 
When  parties  cannot  come  to  an  agreement,  the 
intervention  of  the  head  of  state  may  help  guide 
the  process  by  either  favouring  one  solution  or 
creating  obstacles  for  another  one. 

Institutional  Variations  in 

Parliamentary  Government 

Within  this  general  scheme,  there  is  great  space  for 
institutional  variations.  With  regard  to  parliamen¬ 
tary  confidence  in  the  government,  it  has  already 
been  mentioned  that  while  the  power  of  the  repre¬ 
sentative  assembly  to  express  a  vote  of  no  confi¬ 
dence  is  the  common  and  defining  feature  of 
parliamentary  governments,  a  positive  confidence 
vote  is  not  always  required  when  a  new  govern¬ 
ment  is  formed.  In  many  cases,  the  support  is  pre¬ 
sumed  unless  proven  otherwise.  Other  aspects 
deserve  to  be  noted.  When  the  parliament  is 
bicameral,  the  power  to  provide  or  to  deny  the 
confidence  to  the  government  is  generally  reserved 
to  only  one  of  the  chambers — the  one  with  the 
broader  democratic  base.  Only  exceptionally,  as  in 
Italy  under  the  constitution  of  1948,  is  the  positive 
confidence  of  both  chambers  required  for  a  gov¬ 
ernment  to  survive.  Given  its  crucial  influence  on 


the  stability  of  the  executive,  the  vote  of  no  confi¬ 
dence  has  been  regulated  with  variable  degrees  of 
restrictions.  Some  of  these  rules,  such  as  the 
requirement  of  a  temporal  delay  before  a  motion 
of  no  confidence  is  put  to  a  vote,  are  directed  at 
preventing  parliamentary  “ambushes.”  A  more 
strict  mechanism  is  the  one  introduced  with  the 
German  Basic  Law  of  1949,  which  has  since  found 
some  imitations.  The  so-called  constructive  vote  of 
no  confidence  prescribes  that  the  vote  by  the  par¬ 
liament  against  the  government  in  office  should 
simultaneously  designate  a  new  head  of  govern¬ 
ment  (the  chancellor  in  the  German  version).  The 
purpose  is  to  prevent  negative  coalitions  that  are 
only  united  by  the  purpose  of  bringing  down  a 
government  and  do  not  possess  the  willingness  to 
unite  in  the  formation  of  a  new  one.  Differences 
can  also  be  found  in  the  process  of  government 
formation.  If  the  normal  rule  is  that  the  head  of 
state  nominates  the  prime  minister,  this  duty  may 
be  transferred,  as  in  Sweden  after  the  constitu¬ 
tional  reform  of  1974,  to  the  parliament’s  speaker. 
Another  solution,  adopted  in  the  German  Basic 
Law,  is  to  have  the  chancellor  individually  elected 
by  the  parliament  (in  fact  by  the  Bundestag)  on  the 
proposal  of  the  head  of  state. 

With  regard  to  legislative  matters,  further  varia¬ 
tions  may  concern  the  powers  of  the  government 
and  the  head  of  state.  The  head  of  state  may  have 
the  power  to  send  back  to  the  legislature  a  piece  of 
legislation  approved  by  parliament  (in  most  of  the 
cases  of  governmental  initiative).  Governments 
may  have,  as,  for  instance,  in  Italy,  the  power  to 
enact  decrees  that  enjoy  immediately  the  same 
authority  as  laws  but  that  must  be  subsequently 
ratified  by  parliament  within  a  given  period  of 
time.  The  role  of  the  government  within  parlia¬ 
ment  may  also  vary,  especially  with  regard  to  the 
governance  of  the  lawmaking  process.  Of  special 
interest  are  its  agenda-setting  powers  and  the  con¬ 
trol  it  has  of  the  parliamentary  calendar. 

The  most  important  dimension  of  variation 
concerns  the  relative  power  and  roles  of  ministers 
and  prime  minister.  Parliamentary  governments 
may  oscillate  between  an  extreme  where  the  prime 
minister  has  a  dominant  role  in  guiding  the  cabinet 
and  another  extreme  where  his  or  her  authority  is 
much  more  limited  and  he  or  she  is  confined  to  a 
role  of  mediation  among  powerful  “cabinet  bar¬ 
ons”  (Giovanni  Sartori,  1994).  This  only  partially 
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depends  on  formal  constitutional  rules;  other  fac¬ 
tors  that  are  essentially  related  to  the  electoral  and 
party  aspects  of  government  formation  have  a 
much  greater  impact,  and  it  is  these  factors  that 
now  need  to  be  examined. 

The  Working  of  Parliamentary  Government 

On  the  basis  of  the  constitutional  rules  defining  it, 
the  parliamentary  form  of  government  can  be 
interpreted  as  a  fairly  straightforward  “chain  of 
delegation  and  accountability”  (Kaare  Strom, 
2000).  Through  elections,  citizens  delegate  the 
members  of  parliament;  these  in  turn  delegate  the 
government,  which  then  delegates  the  ministerial 
bureaucracies.  Accountability  runs  in  the  opposite 
direction.  Things,  however,  are  a  bit  less  simple 
and  linear  when  political  factors  are  introduced 
into  this  picture.  Of  crucial  importance  is  the  party 
variable  with  its  two  main  dimensions:  (1)  the 
internal  structure  of  parties  and  (2)  the  nature  of 
the  party  system.  The  relationship  between  parlia¬ 
ment  and  government  can  change  significantly 
depending  on  these  two  aspects.  The  main  reason 
is  that  the  nature  of  the  party  system  and  the  inter¬ 
nal  structure  of  parties  have  a  direct  bearing  on 
the  form  and  the  cohesion  of  the  parliamentary 
majority — the  crucial  linchpin  in  this  form  of  gov¬ 
ernment.  The  potential  variations  in  the  practical 
workings  of  the  parliamentary  government  can  be 
broad.  At  one  extreme,  we  can  find  the  situation, 
common  in  many  countries  at  the  origins  of  parlia¬ 
mentary  government,  in  which  parties  were  more 
or  less  nonexistent  as  organizations.  In  front  of 
parliamentary  aggregations  with  a  weak  degree  of 
cohesion,  the  government  (especially  when  backed 
by  a  monarch  who  had  not  yet  abandoned  the 
ambition  to  exert  political  influence)  could  easily 
manipulate  majorities  using  the  instruments  of 
political  patronage  and,  if  needed,  the  threat  of 
early  elections.  It  is  true  that  the  government  could 
also  unexpectedly  face  a  parliamentary  ambush 
due  to  the  volatility  of  parliamentary  support.  The 
consolidation  of  highly  organized  parties  during 
the  20th  century  has  deeply  changed  the  relation¬ 
ship  between  executives  and  parliaments  in  most 
countries.  Parties  (and  especially  their  extraparlia¬ 
mentary  organizations  and  leadership)  have 
become  the  crucial  linchpin  of  this  relationship.  In 
practical  terms,  both  the  members  of  parliament 


and  the  executive  became  the  delegates  of  parties’ 
leaderships.  Parliamentary  government  became 
truly  “party  government”  (Richard  Katz,  1986), 
whereby  party  leaders  define  to  a  large  extent  the 
policy  platforms  to  be  followed  by  the  government 
and  select  the  persons  to  occupy  ministerial  posi¬ 
tions.  In  this  context,  the  parliamentary  connota¬ 
tion  means  essentially  that  it  is  the  “parliamentary 
size”  of  parties  resulting  from  their  electoral  per¬ 
formance  that  determines  their  weight  in  this 
process.  Depending  on  the  variety  of  party  organi¬ 
zation  models,  the  degree  of  parliamentarization 
of  the  party  leadership  can  also  vary,  and  this  also 
reflects  on  the  parliamentarization  of  the  cabinet 
members.  The  current  decline  of  external  party 
organizations  is  opening  the  way  to  a  situation 
where  the  government  less  strictly  depends  on 
party  orders  and  in  fact  influences  internal  party 
dynamics.  Thus,  there  is  more  of  a  two-way  rela¬ 
tionship  between  parties  and  government. 

The  structure  of  the  party  system  and  electoral 
dynamics  combine  to  produce  other  important 
effects.  Particularly  relevant  is  the  impact  that 
these  elements  can  have  on  the  relationship  between 
the  electoral  and  the  parliamentary  moment  of  the 
process  of  delegation.  A  good  example  is  the 
British  one.  Normally,  the  existence  of  a  (de  facto) 
two-party  system,  combined  with  the  fact  that 
elections  determine  which  party  controls  a  parlia¬ 
mentary  majority  and  the  fact  that  the  leader  of  a 
party  is  also  in  principle  the  candidate  for  head  of 
government,  produces  a  kind  of  short  circuit  in  the 
process  of  delegation.  Although  voters  in  principle 
only  select  the  members  of  parliament,  they  also 
determine  de  facto  the  party  that  will  govern  and 
its  leader.  A  “popular  investiture”  of  the  govern¬ 
ment  is  thus  produced  and  the  parliament  cannot 
do  anything  but  ratify  it.  Of  course,  this  does  not 
detract  from  other  aspects  of  the  parliamentary 
model:  The  members  of  parliament  maintain  the 
power  to  renege  on  their  support  for  the  govern¬ 
ment  (and  the  government  knows  that  it  must 
work  toward  maintaining  this  support).  Electoral 
mandate  and  parliamentary  support  are  thus  com¬ 
bined.  Changes  in  the  configuration  of  the  party 
system  and  electoral  results  may,  however,  negate 
the  possibility  of  a  one-party  government  “man¬ 
dated”  by  the  electoral  result  and  thus  open  the 
way  to  a  coalition  negotiated  among  the  parties/ 
parliamentary  groups.  In  fact,  this  happened  in 
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2010  in  the  UK,  and  it  is  the  “normal  way”  in 
multiparty  systems.  In  such  situations,  elections 
are  not  decisive;  a  process  of  coalition  making  can 
only  begin  among  the  parties  represented  after  the 
parliament  is  formed.  In  this  case,  the  delegation 
from  the  parliament  (the  parties)  to  the  govern¬ 
ment  becomes  more  evident.  The  government  will 
receive  its  policy  mandate  and  derive  its  personal 
composition  from  this  process.  The  relationship 
between  the  government  and  its  supporting  coali¬ 
tion  is  then  crucial.  The  government  can  only  last 
if  its  supporting  coalition  holds  together.  The 
instruments  for  stabilizing  the  majority  and  regu¬ 
lating  its  internal  working  acquire  an  obvious 
importance,  according  to  Wolfgang  Muller  and 
Kaare  Strom  (2000).  Among  them,  the  coalition 
agreement  deserves  special  attention.  It  is  a  docu¬ 
ment  that  specifies  the  broad  goals  of  the  coalition, 
its  specific  policy  objectives,  the  allocation  of  min¬ 
istries,  and  the  instruments  and  the  rules  for  the 
solution  of  conflicts.  Other  instruments  can  also 
help  solve  coalition  conflicts  during  the  life  of  the 
cabinet,  such  as  coalition  committees — “summits” 
of  party  leaders  that  are  held  to  interpret  or  rene¬ 
gotiate  the  original  agreement. 

Two  further  situations  must  be  considered.  The 
first  is  when  a  coalition  is  formed  before  the  elec¬ 
tions.  The  second  is  when  the  government,  based 
on  one  or  more  parties,  does  not  have  in  parlia¬ 
ment  the  support  of  a  stable  coalition  holding  the 
majority  of  seats.  The  first  case  can  be  seen  in 
some  way  as  an  intermediate  situation  between  a 
single-party  government  and  a  normal  coalition 
government.  If  a  preelectoral  coalition  wins  the 
majority  required  to  govern,  the  formation  of  the 
cabinet  will  be  much  speedier  than  is  the  case  with 
a  postelectoral  coalition.  Most  of  the  policy  and 
distributional  questions  will  have  been  solved  in 
advance.  Moreover,  the  coalition  and  its  leader 
will  enjoy  a  sort  of  “direct  investiture”  by  the 
people.  However,  as  with  any  coalition  govern¬ 
ment,  this  one  will  also  be  faced  with  the  problem 
of  maintaining  its  cohesion.  This  problem  may 
become  more  serious  if,  with  the  approaching  of 
new  elections,  any  of  the  coalition  members  fears 
that  its  participation  may  not  be  rewarded  by  the 
voters  and  sees  its  chances  better  served  by  pursu¬ 
ing  a  different  political  path.  The  second  case 
derives  from  the  fact  that  in  a  parliamentary  gov¬ 
ernment  the  cabinet  must  not  necessarily  rely  on  a 


supporting  majority.  In  fact,  the  most  important 
thing  is  that  there  should  not  be  a  majority  against 
it.  This  enables  the  creation  of  minority  govern¬ 
ments,  made  up  of  one  or  more  parties  that  do  not 
enjoy  a  majority  in  parliament  but  explicitly  or 
implicitly  receive  the  support  of  other  parties 
either  in  a  more  stable  way  or  on  an  ad  hoc  basis. 
The  frequency  of  such  governments  is  greater  than 
expected:  This  suggests  that  participating  in  a 
cabinet  with  full  ministerial  responsibilities  is 
sometimes  more  of  a  cost  than  a  benefit. 

Whenever  a  one-party  government  is  not  possi¬ 
ble,  the  formation  of  (preelectoral  or  postelectoral) 
coalitions  is  obviously  a  crucial  aspect  in  the  work¬ 
ing  of  parliamentary  government.  This  has  gener¬ 
ated  a  whole  sector  of  studies  devoted  to  coalitions 
that,  starting  with  the  pathbreaking  study  of 
William  Riker  (1962),  have  analyzed  the  different 
types  of  coalitions  with  regard  to  their  size  (mini¬ 
mal  winning,  minority,  oversize,  etc.)  and  their 
ideological  composition  (connected/unconnected). 
Furthermore,  these  studies  have  explored  different 
explanations  (utilitarian  calculus,  ideological  pref¬ 
erences,  and  policy  orientations)  for  their  forma¬ 
tion  and  their  ends. 

Advantages  and  Disadvantages 

Is  the  parliamentary  form  of  government  a  better 
or  worse  form  than  the  others,  in  particular  the 
presidential  form?  This  question  has  recurrently 
been  at  the  center  of  discussions  and  has  received 
varying  answers.  This  is  partly  explained  by  the 
different  points  of  view  adopted  and  by  the  chang¬ 
ing  criteria  of  evaluation.  It  has  also  been  due  to 
the  lack  of  systematic  comparative  empirical  evi¬ 
dence;  only  in  recent  times  has  this  obstacle  been 
gradually  overcome.  Governmental  instability  has 
traditionally  been  one  of  the  main  points  of  criti¬ 
cism  for  parliamentary  governments.  The  experi¬ 
ences  of  a  number  of  European  countries  between 
the  two  World  Wars  seemed  to  suggest  that  parlia¬ 
mentary  cabinets  tend  to  be  unstable  and  weak  and 
that  heads  of  government  may  lack  authority,  thus 
creating  a  situation  that  could  open  the  road  to 
nondemocratic  consequences.  In  contrast,  a  presi¬ 
dential  or  semipresidential  form  of  government  has 
often  been  presented  as  a  better  solution.  Putting 
the  selection  of  the  head  of  the  executive  directly 
into  the  hands  of  the  voters  and  establishing  a 
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fixed  term  for  its  duration  seemed,  to  many  schol¬ 
ars  and  observers,  to  be  a  recipe  for  overcoming 
the  failures  of  parliamentary  government.  More 
recent  contributions  have  retorted  that  parliamen¬ 
tary  government,  rather  than  being  responsible  for 
creating  these  problems,  reflects  a  situation  exist¬ 
ing  at  the  level  of  parties  that  would  be  dangerous 
to  ignore.  Moreover,  it  can  be  said  that  in  such 
situations  the  parliamentary  form  of  government 
allows  a  flexibility  in  the  formation  and  change  of 
governments  that  may  reduce  personalized  zero- 
sum  conflicts  and  open  the  road  to  compromises. 
On  the  contrary,  presidential  forms  of  government 
often  are  characterized  by  a  rigidity  that  cannot 
easily  provide  alternative  solutions  if  the  adopted 
one  fails.  Whatever  the  conclusions  of  this  schol¬ 
arly  and  political  debate  are,  it  is  worth  noting  that 
recently  established  democracies  have  increas¬ 
ingly  attempted  to  combine  elements  of  the  parlia¬ 
mentary  and  presidential  forms  of  government, 
adopting  some  form  of  semipresidentialism  (or 
semiparliamentarianism). 

Maurizio  Cotta 
University  of  Siena 
Siena,  Italy 
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The  notion  of  parliament  has  existed  for  almost  as 
long  as  people  have  gathered  to  debate  alternative 
viewpoints  and  take  associated  decisions.  As  the 
complexity  and  formality  ascribed  to  these  gather¬ 
ings  increased,  the  institution  of  parliament  devel¬ 
oped.  It  comes  under  a  variety  of  names,  from  the 
most  common  assembly,  legislature,  or  parliament 
to  more  regionally  specific  ones  such  as  Storting 
or  Diet.  Parliaments  are  institutions  that  bring 
together  formally  recognized  members  to  discuss 
and  legitimize  decisions  on  matters  affecting  the 
community  at  large.  The  level  of  formality  and 
power  credited  to  those  discussions  and  decisions 
varies,  as  does  the  level  of  representation.  This 
entry  first  describes  how  the  role  of  parliaments 
has  changed  considerably  since  the  19th  century. 
The  factors  affecting  the  role  and  power  of  parlia¬ 
ments  are  considered,  and  the  key  roles  performed 
by  these  institutions  are  identified.  Each  of  these 
roles — legislation,  scrutiny,  and  representation — 
is  assessed,  and  the  organization  of  parliaments  is 
described.  The  entry  concludes  by  outlining  the 
main  areas  on  which  research  on  parliaments  has 
focused. 

The  history  of  parliaments  is  intrinsically  linked 
to  that  of  democracy.  Each  wave  of  democratization 
identified  by  Samuel  Huntington  has  corresponded 
to  stages  in  the  development  of  parliaments.  The 
first  wave  encompasses  the  spread  of  liberalism  in 
19th-century  Europe.  With  this  came  the  creation 
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of  new  parliamentary  institutions,  as  well  as  the 
legitimization  of  their  representative  value  and  role 
in  the  decision-making  process.  As  the  value  of 
representative  democracy  grew,  parliaments 
became  central  institutions  of  political  systems, 
particularly  in  Europe.  With  the  20th  century 
came  considerable  changes  that  affected  the  power 
and  role  of  parliaments.  As  democracy  expanded 
with  the  emergence  of  newly  independent  coun¬ 
tries  (the  second  wave  of  democratization),  so  did 
its  complexity.  The  changes  affecting  parliament 
were  twofold.  First  was  the  development  of  mass 
parties;  the  expansion  of  the  electoral  franchise 
saw  parties  becoming  key  institutions  in  the  repre¬ 
sentation  process,  acquiring  an  important  role  in 
the  expression  of  policies  and  mediation  of  inter¬ 
ests,  which  had  been  key  roles  of  the  original  lib¬ 
eral  parliaments.  Second,  there  was  an  expansion 
of  the  state;  the  emergence  of  the  welfare  state  saw 
legislation  growing  more  complex,  increasingly 
developing  into  a  matter  for  the  executive  rather 
than  parliament.  By  the  mid-20th  century,  some 
parliamentary  roles  and  powers  had  been  trans¬ 
ferred  to  other  institutions.  However,  as  the  third 
wave  of  democratization  started,  in  the  1970s, 
many  authoritarian  regimes  turned  to  democracy, 
putting  parliaments  at  the  core  of  their  transitions. 
Simultaneously,  a  wider  range  of  parliamentary 
roles  and  powers  started  to  be  more  fully  recog¬ 
nized.  This  was  a  time  when  the  paradigm  of  the 
decline  of  parliament  gave  way  to  the  paradigm  of 
the  role  of  parliament.  Besides  representation  and 
legislation,  parliaments  play  a  variety  of  other 
roles,  from  education  to  providing  a  safety  valve, 
legitimation,  and  scrutiny.  Never  before  have  there 
been  so  many  parliaments,  not  only  at  the  national 
level  but  also  locally,  regionally,  and  supranation- 
ally.  Since  the  end  of  the  20th  century,  parliaments 
have  expanded  considerably  in  number  and  in 
complexity. 

The  powers  and  roles  of  parliaments  differ 
according  to  their  political  context — the  type  of 
political,  electoral,  and  party  systems.  Being  part  of 
a  parliamentary,  hybrid,  or  presidential  political 
system  has  implications  for  the  separation  of  pow¬ 
ers  and  the  relationship  between  parliament  and 
other  institutions,  especially  the  executive.  Different 
electoral  systems,  and  associated  party  systems,  also 
affect  parliaments  by  determining  the  number  of 
parties  present  in  parliament,  the  potential  support 


base  for  government,  the  strength  of  party  disci¬ 
pline,  the  type  of  representative  mandate,  and  the 
parliament’s  organization.  Indeed,  some  parlia¬ 
ments  are  not  elected  at  all  and  are  not  composed 
of  party  members.  Therefore,  parliaments  have 
differing  powers  and  roles  according  to  the  charac¬ 
teristics  of  these  structural  variables,  though  all,  in 
varying  degrees,  make  a  link  between  citizens  and 
governance. 

The  relationship  with  government  is  one  of  the 
key  dimensions  to  understanding  parliaments. 
Depending  on  the  type  of  political  system,  parlia¬ 
ment  may  have  different  tools  with  which  to  hold 
government  to  account,  such  as  a  motion  of  cen¬ 
sure  that  could  bring  the  government  down  if 
approved.  But  the  relationship  with  the  executive 
goes  well  beyond  accountability.  The  main  focus 
of  that  relationship  usually  revolves  around  the 
production  of  legislation,  but  it  also  concerns  the 
scrutiny  of  government  activity.  It  is  generally 
accepted  in  the  literature  that  the  executive  domi¬ 
nates  this  relationship,  due  to  the  support  ensured 
through  party  discipline  and  governmental  legisla¬ 
tive  powers.  That  said,  the  more  checks  and  bal¬ 
ances  the  system  includes,  the  less  likely  it  is  that 
the  executive  will  dominate  parliament. 

The  main  function  traditionally  associated  with 
parliaments  is  legislation.  However,  most  legislation 
produced  today  has  little  contact  with  parliament. 
Legislation  has  become  so  complex  that  parliaments 
have  little  capacity  to  deal  with  the  detail  and  extent 
of  today’s  regulatory  frameworks.  Notwithstanding, 
parliaments  do  still  hold  a  role  in  the  legislative 
process.  This  role  varies  according  to  structural 
characteristics,  as  explained  above,  resulting  in 
varying  degrees  of  power  in  the  capacity  to  initiate, 
amend,  approve,  and/or  reject  legislation.  Many 
parliaments  only  have  a  legitimizing  role  of  the  leg¬ 
islative  process,  with  little  power  to  introduce  legis¬ 
lation,  let  alone  change  it.  They  often  act  as  the  final 
stamp  of  approval,  which  explains  why  authors 
refer  to  rubber-stamp  parliaments.  As  David  Olson 
has  famously  stated,  there  is  a  general  90%  rule  in 
the  legislative  process,  whereby  the  executive  pro¬ 
poses  90%  of  the  legislation  and  gets  90%  of  what 
it  wants.  Within  the  10%  leeway,  many  parliaments 
have  the  capacity  and  ability  to  change  legislation, 
through  formally  approving  amendments  or  infor¬ 
mally  influencing  the  final  outcome.  Occasionally, 
parliaments  have  the  ability  to  reject  government 
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legislation.  When  parliament  regularly  rejects  gov¬ 
ernment  legislation,  the  relationship  between  the 
two  institutions  can  break  down,  leading  to  new 
elections  (or  even  a  regime  change).  Regardless  of 
the  percentage  of  legislation  on  which  parliament 
can  act,  one  must  also  consider  the  focus  and  level 
of  legislation  (primary,  secondary,  or  delegated). 
Some  parliaments  reserve  the  right  to  legislate  on 
specific  matters,  which  means  that  the  whole  of 
the  bill  is  introduced,  discussed,  and  approved  in 
parliament;  while  in  other  cases  the  government 
has  areas  of  reserved  legislation  through  which 
it  can  legislate  without  having  to  go  through 
parliament.  Additionally,  other  institutions  can 
intervene  in  the  legislative  process,  such  as  supra¬ 
national  parliaments. 

Scrutiny  is  another  of  parliament’s  main  func¬ 
tions.  It  comes  under  different  names  such  as  over¬ 
sight  or  parliamentary  control  and  expresses  itself 
in  a  wide  range  of  activities  and  tools.  Scrutiny 
plays  a  key  part  in  the  relationship  between  parlia¬ 
ment  and  government  and  has  increased  in  impor¬ 
tance  as  parliament’s  role  in  the  legislative  process 
has  diminished.  Scrutiny  can  focus  on  the  applica¬ 
tion  of  legislation,  on  seeking  information,  or  sim¬ 
ply  on  keeping  a  check  on  governmental  activity. 
Most  parliaments  have  provision  for  asking  ques¬ 
tions  of  government,  in  writing  and/or  orally  in  the 
plenary  chamber  and/or  in  committee.  The  fre¬ 
quency  of  these  questions  and  their  response  rate 
are  key  indicators  to  assess  the  effectiveness  of 
these.  Parliaments  often  provide  for  several  for¬ 
mats  for  questions  to  the  government  according  to 
the  types  of  request  and  matters.  Interpellations 
are  another  popular  scrutiny  tool.  Usually  more 
general  in  scope,  these  involve  a  small  debate  on 
the  topic,  with  the  participation  of  a  wider  number 
of  representatives  besides  the  proponent.  Finally, 
committees  of  enquiry  form  the  other  main  scru¬ 
tiny  tool  used  by  parliaments.  Despite  exceptions, 
these  tend  to  be  set  on  an  ad  hoc  basis  when  there 
is  a  serious  matter  to  investigate.  The  ability  to 
scrutinize  and  investigate  government  adequately 
depends  greatly  on  parliament’s  power  to  request 
information  from  the  government  and  to  summon 
the  presence  of  members  of  the  government  and  of 
the  public  administration,  as  well  as  the  degree  to 
which  the  opposition  can  use  these  powers. 

As  a  key  function  for  most  parliaments,  repre¬ 
sentation  takes  many  forms  and  is  closely  linked  to 


the  type  of  electoral  system  through  which  the 
parliament  is  elected.  Representation  can  be  collec¬ 
tive,  where  the  party’s  parliamentary  group  is  the 
main  representative  entity,  or  individual,  through 
members  of  parliament  (MPs,  also  called  deputies, 
legislators,  or  senators).  The  representative  man¬ 
date  is  often  linked  to  constituencies  but  can  have 
a  national  basis.  The  strength  of  the  link  between 
representatives  and  constituencies  depends  on 
these  factors.  Parliaments  elected  through  a  first- 
past-the-post  electoral  system  tend  to  encourage  a 
more  individual  and  constituency-based  type  of 
representation.  Besides  this,  representatives  may 
also  follow  a  trustee  approach  to  their  mandate  or 
a  delegate  one.  In  the  trustee  model,  representa¬ 
tives  make  decisions  by  following  their  own  judg¬ 
ment,  whereas  in  the  delegate  model,  MPs  act 
foremost  on  behalf  of  a  particular  group.  The  del¬ 
egate  model  tends  to  dominate  today,  and  the 
debate  is  mainly  about  which  group  should  prevail 
in  that  representation — the  party,  the  constituency, 
or  the  nation. 

With  respect  to  their  organization,  parliaments 
can  be  unicameral,  with  only  one  chamber,  or 
bicameral,  with  two  chambers.  Bicameral  parlia¬ 
ments  tend  to  exist  in  countries  with  a  large  terri¬ 
tory,  a  federation,  and/or  great  ethnic  diversity. 
Some  countries  do  not  fall  in  any  of  these  catego¬ 
ries  but  have  a  second  chamber  for  historical  rea¬ 
sons.  The  role  and  powers  of  second  chambers 
vary  considerably,  but  they  tend  to  complement 
the  first  chamber  and  to  act  as  a  check  and  bal¬ 
ance.  A  key  consideration  is  whether  the  second 
chamber  is  nominated,  directly  elected,  or  indi¬ 
rectly  elected.  Often,  parliamentary  activity  is 
divided  between  the  plenum — a  chamber — and 
committees.  The  plenum  is  usually  reserved  for 
larger  debates  and  committees  for  more  detailed 
review  and  scrutiny.  Normally,  plenum  debates 
are  public,  whereas  committees  can  hold  meetings 
behind  closed  doors.  Today,  committee  work 
plays  a  considerable  part  in  parliamentary  activ¬ 
ity.  One  final  point  to  consider  in  parliamentary 
organization  is  the  level  of  resources  and  staffing. 
These  affect  parliaments’  ability  to  play  any  of 
their  roles. 

Political  scientists  still  know  little  about  parlia¬ 
ments  around  the  world.  Most  research  focuses 
on  a  very  few  case  studies,  normally  from  West¬ 
ern  democracies,  and  on  limited  dimensions  of 
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parliamentary  activity.  The  relationship  between 
parliament  and  government,  parliament’s  role  in 
the  legislative  process,  and  parliamentary  voting 
behavior  constitutes  the  main  areas  of  research. 

Cristina  Leston-Bandeira 
University  of  Hull 
Hull,  United  Kingdom 
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Anyone  doing  empirical  research  into  social  activ¬ 
ities  and  social  reality  has  three  basic  options: 
(1)  evaluating  documents,  (2)  conducting  a  written 
or  oral  survey  to  gain  information  on  the  knowl¬ 
edge  and  the  activities  of  subjects,  or  (3)  partici¬ 
pating  in  the  everyday  practices  of  the  people  of 
interest.  The  final  method  is  called  participant 
observation  or  ethnography.  It  has  its  roots  in 
anthropology  and  ethnology;  it  is  rarely  used  in 
political  science.  After  a  brief  overview  of  the  his¬ 
torical  and  theoretical  development  of  participant 
observation,  this  entry  demonstrates  the  opportu¬ 
nities  and  restrictions  of  participant  observation  as 
a  research  method  in  political  science.  It  concludes 
with  some  advice  on  putting  participant  observa¬ 
tion  into  practice. 

Historical  Development 

The  methodological  development  of  participant 
observation  dates  back  to  the  1920s  with  the 


research  of  the  anthropologist  Bronislaw  Malin¬ 
owski  as  well  as  Robert  Park,  William  Thomas, 
and  Florian  Znaniecki  of  the  Chicago  School  of 
Sociology.  Classics  such  as  Marienthal:  The 
Sociography  of  an  Unemployed  Community 
(1933)  by  Marie  Jahoda  and  Paul  Lazarsfeld  or 
Street  Corner  Society  (1943)  by  William  Whyte 
indicate  that  participant  observation  has  been  part 
of  the  methodological  instruments  used  in  the 
social  sciences  from  the  start.  However,  in  the 
beginning  of  the  1950s,  such  observation  methods 
fell  out  of  favor  due  to  the  further  development  of 
survey  research  and  standardized  statistical  meth¬ 
ods.  In  the  1980s,  when  the  discussion  about  the 
usefulness  of  a  qualitative  research  approach 
increased,  interest  in  participant  observation  was 
revived. 

In  comparison  with  other  disciplines  such  as 
sociology  or  education,  participant  observation 
has  until  now  been  neglected  in  political  science. 
Not  only  has  it  been  rarely  used  in  political  sci¬ 
ence  research,  but  political  scientists  have  not 
been  involved  in  its  methodological  development. 
The  work  of  Richard  Fenno  is  one  of  the  few 
exceptions.  His  description  of  his  research  strat¬ 
egy  as  “soaking  and  poking”  has  become  stan¬ 
dard  terminology.  Fenno  observed  members  of  the 
U.S.  House  of  Representatives  in  their  constituen¬ 
cies  and  inductively  developed  a  theory  of  their 
working  styles.  He  later  also  accompanied  U.S. 
senators  on  their  campaigns,  and  in  several  of  his 
works,  he  reflected  on  the  experiences  he  had 
while  observing  politicians  in  Washington,  D.C., 
and  in  their  home  districts. 

Theory  of  Scientific  Observation 

At  the  most  basic  level,  observation  orients  us  to 
our  environment — it  is  something  we  do  every 
day.  It  becomes  scientific  observation  if  it  allows 
us  to  systematically  examine  certain  assumptions 
with  results  that  are  repeatable  and  intersubjec- 
tively  comprehensible.  Scientific  methods  of 
observation  can  be  divided  into  the  following  five 
categories: 

1.  covert  versus  open  observation, 

2.  participant  versus  nonparticipant  observation, 

3.  structured  versus  unstructured  observation, 
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4.  observation  in  the  field  versus  laboratory 
observation,  and 

5.  introspection  versus  external  observation. 

In  political  science,  participant  observation  is 
the  most  common  variant.  In  general,  this  type  of 
research  is  conducted  openly,  in  a  natural  environ¬ 
ment,  and  in  the  form  of  external  observation.  The 
extent  of  participation  can  be  thought  of  as  a  con¬ 
tinuum,  since  it  can  take  a  variety  of  forms,  and 
the  influence  of  the  observer  on  the  field  can  vary. 
For  example,  as  a  guest  in  committee  meetings  the 
political  scientist  is  a  quiet  listener,  but  as  a  pas¬ 
senger  in  a  politician’s  car  on  the  way  to  the  next 
appointment  he  or  she  can  also  be  asked  for 
advice.  Accordingly,  participant  observation  is  not 
to  be  understood  only  literally,  since  in  many  insti¬ 
tutional  contexts  action  primarily  consists  of 
verbal  communication.  Observing  means  commu¬ 
nicating.  In  most  cases,  participant  observation  is 
therefore  combined  with  other  methods  of  data 
collection.  Take,  for  example,  the  observation  of  a 
parliamentary  committee.  Here,  observation  does 
not  only  mean  direct  participation  in  the  meeting 
for  the  purpose  of  describing  its  course  and  the 
actions  of  the  legislators.  Essential  to  the  observa¬ 
tion  is  researching  background  information  on 
committee  members  or  studying  the  agenda  and 
the  motions,  informal  conversations,  as  well  as 
later  interviews  with  representatives.  After  the 
meeting,  the  observations  must  be  examined  to 
determine  whether  they  confirm  previous  assump¬ 
tions  and  earlier  observations  or  whether  new 
hypotheses  must  be  generated. 

Participant  observation  is  in  many  respects  a 
process.  On  the  one  hand,  the  political  scientist  has 
to  gain  access  to  the  field.  On  the  other,  the  obser¬ 
vation  increasingly  has  to  concentrate  on  those 
aspects  that  are  essential  to  the  research  question. 
James  Spradley  distinguishes  three  phases:  descrip¬ 
tive,  focused,  and  selective  observation.  In  the  first 
phase,  the  researchers  get  their  bearings  and  collect 
unspecific  descriptions  of  the  field.  The  complexity 
of  the  field  is  grasped  as  comprehensively  as  possi¬ 
ble,  and  concrete  research  questions  are  developed. 
During  focused  observation,  the  perspective  is  nar¬ 
rowed  and  focused  on  the  problems,  processes,  and 
persons  that  are  of  particular  importance  for  the 
problem  under  investigation.  At  the  end  of  the 
observation  process,  selective  observation  serves  to 


find  further  proof  and  further  examples  for  the 
previously  found  behavior. 

Participant  observation  aims  to  understand  its 
research  subject  from  within.  Through  participant 
observation,  subjective  views,  and  interpretations 
(symbolic  interactionism),  interactive  processes  in 
which  social  reality  is  constructed  (ethnomethod- 
ology)  or  the  cultural  and  social  rules  that  shape 
the  perception  and  construction  of  social  reality 
(structuralism)  can  be  understood.  The  research 
principle  that  underlies  participant  observation  is 
understanding.  Typically,  the  starting  point  of  par¬ 
ticipant  observation  is  a  singular  case,  from  which 
one  progresses  to  general  or  comparative  state¬ 
ments.  First,  the  individual  case  is  reconstructed. 
Then,  the  analyses  and  results  from  other  cases  are 
checked  for  comparison,  and  finally  a  typology  is 
developed.  What  is  understood  as  a  singular  case 
depends  on  the  theoretical  viewpoint:  subjects  and 
their  points  of  view,  interactions,  or  social  and 
cultural  contexts.  In  all  these,  one  message  is  per¬ 
vasive:  Reality,  as  it  presents  itself  to  the  observer, 
is  never  a  given  but  is  constructed  by  the  various 
(individual  or  collective)  players. 

Advantages  and  Restrictions 
of  Participant  Observation 

The  role  of  the  observing  political  scientist  is  not 
very  different  from  that  of  the  ethnologist  who 
works  in  alien  cultures.  By  direct  participation,  the 
political  scientist  attempts  to  understand  typical 
organizational  conventions  and  the  rules  and 
everyday  routines  that  form  the  cultural  bases  of 
functioning  political  institutions.  The  field  re¬ 
searcher’s  aim  is  the  understanding  of  political 
institutions  from  an  endogenous  perspective. 
Observations  of  legislators  show  that  the  social 
processes  and  rules  guiding  communicative  pro¬ 
cesses  or  everyday  behavior  are  difficult  to  research 
through  interviews  because  something  that  is 
taken  for  granted  often  remains  unconscious.  If 
you  ask  politicians  which  rules  guide  their  work  or 
which  strategies  they  pursue  to  achieve  their  goals, 
the  answers  are  often  nondescript.  For  example, 
many  legislators  are  not  aware  of  the  informal 
rules  of  parliamentary  business,  and  they  have  sel¬ 
dom  reflected  on  “how  it  is  done”  to  enforce  their 
positions.  Participant  observation  makes  it  possi¬ 
ble  to  discover  the  behavioral  patterns  of  everyday 
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actions  in  addition  to  what  the  subject  of  the 
observation  reveals. 

Examining  the  influence  of  informal  rules  is 
another  advantage  of  participant  observation. 
Politicians  are  used  to  formulating  their  positions 
with  the  public  in  mind;  this  behavior  comes 
through  in  interviews,  from  which  the  observer 
often  learns  more  about  the  ideas  of  the  politician 
interviewed  and  about  how  something  ought  to  be 
rather  than  about  his  or  her  actual  behavior  and 
how  things  are.  This  is  particularly  true  when 
examining  sensitive  areas,  including  informal 
actions,  because  these  may  be  considered  to  be 
synonymous  with  “fiddling”  and  “shady  deals.” 
The  combination  of  interviews  and  observations 
offers  the  advantage  of  being  able  to  correlate  and 
connect  the  sphere  of  perceptions  with  that  of 
actions.  The  results  of  the  observations  can  be  con¬ 
trasted  and  compared  with  those  of  the  interviews. 
Participant  observation  makes  it  possible  to  expe¬ 
rience  social  players  as  they  actually  behave  in 
social  and  political  reality. 

Political  scientists  involved  in  field  research  are 
not  immune  from  the  danger  described  in  ethnol¬ 
ogy  as  “going  native”:  a  process  in  which  the 
observer  loses  the  outside  perspective  of  the  alien 
in  his  fieldwork  and  adopts  the  views  of  the  sub¬ 
jects  under  observation.  This  makes  the  systematic 
evaluation  of  gained  insights  more  difficult. 
However,  if  the  researcher  is  self-consciously 
aware  of  this  problem,  the  closeness  to  the  research 
object  is  of  immense  value.  No  other  method 
allows  a  researcher  to  come  so  close  to  the  object 
of  his  or  her  enquiry  as  participant  observation. 
The  research  process  is  not  clouded  by  theories 
developed  beforehand,  traditional  views,  or  stan¬ 
dardized  procedures.  Participant  observation 
makes  it  possible  to  see  the  progress  and  the  con¬ 
tradictions  of  political  processes  and,  as  Fenno  put 
it,  to  see  the  world  the  way  the  agents  see  it. 

Participant  observation  also  has  its  disadvan¬ 
tages,  and  this  explains  why  only  a  few  political 
scientists  use  it.  First,  the  method  is  highly  time- 
consuming  and  therefore  costly.  Second,  restricted 
access  to  the  field  limits  the  opportunities  for 
scientific  political  field  research.  Third,  the  for¬ 
malization  of  participant  observation  methodol¬ 
ogy  is  only  possible  to  a  limited  degree.  The 
method  of  participant  observation  is  difficult 
to  divide  into  systematic  steps  and  to  teach 


systematically.  Because  of  this,  it  can  be  burden¬ 
some  to  researchers. 

Advice  for  Conducting  Practical  Research 

In  conducting  participant  observations,  the  follow¬ 
ing  recommendations  may  be  helpful. 

f .  Participant  observation  is  not  about  the  casual 
collection  of  impressions.  Therefore,  the  implemen¬ 
tation  of  theoretical  research  questions  has  to  be 
done  carefully.  An  observation  guideline  (but  not  a 
detailed,  limiting  observation  scheme)  sensitizes  the 
observers  to  the  observed  situation,  ensures  concen¬ 
tration  on  the  research  questions,  and  facilitates 
coordination  between  various  researchers. 

2.  Restrictions  to  field  access  are  an  everyday 
occurrence  for  political  scientists  using  the  partici¬ 
pant  observation  method,  due  to  the  fact  that 
political  decisions  are  often  taken  behind  closed 
doors.  However,  sample  selection  should  not  be 
done  with  the  anticipation  of  potential  access 
problems.  A  hybrid  approach  is  recommended 
here — that  is,  taking  a  random  sample  based  on 
theoretical  criteria  that  ought  to  be  flexible  enough 
to  be  extended  and  modified  in  the  research  pro¬ 
cess.  Here,  Barney  Glaser’s  and  Anselm  Strauss’s 
idea  of  “theoretical  sampling”  is  instructive.  The 
following  methods  may  increase  access  to  the  field: 

•  being  perseverant  and  obstinate; 

•  identifying  “gatekeepers,”  using  hierarchies  and 
mediators  to  open  doors; 

•  winning  over  members  of  institutions  to  function 
as  mentors  to  mediate  in  the  solution  of 
problems  that  occur  in  the  field; 

•  avoiding  academic  jargon,  because  politicians  are 
not  interested  in  the  formulation  of  scientific 
theories,  and  instead  communicating  meaning  by 
formulating  the  usefulness  of  the  research  results 
to  be  expected  in  the  language  of  politics;  and 

•  acting  competently,  sensitively,  honestly,  and 
neutrally  because  safe  field  access  depends  on 
how  the  researchers  are  perceived  in  the  field. 

3.  Political  scientists  usually  do  research  in  con¬ 
texts  involving  many  documents  and  papers,  so 
that  drawing  up  field  notes  will  go  unnoticed.  If 
the  opportunity  to  write  down  extensive  notes  is 
not  taken,  one  misses  the  chance  to  collect  detailed 
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observation  data,  unadulterated  by  gaps  in  one’s 
memory.  To  ensure  data  quality,  explicit  observa¬ 
tion  records  have  to  be  made  from  the  notes  close 
to  the  actual  time  of  the  events.  If  too  much  time  is 
allowed  to  elapse,  the  records  will  be  increasingly 
imprecise.  Thus,  spare  time  for  recording  data 
should  already  be  set  aside  when  planning  the  field¬ 
work.  In  the  observation  of  institutional  bodies, 
common  behavioral  patterns  will  be  discovered, 
depending  on  the  organizational  context,  from 
which  a  recording  guideline  can  be  derived,  which 
will  structure  and  simplify  the  following  recording. 

4.  Observation  by  several  researchers  offers  an 
opportunity  to  ensure  the  reliability  of  observation 
data.  One  observer  compiles  an  initial  record, 
which  is  then  worked  over  by  the  second  one.  That 
way,  information  can  be  complemented,  contra¬ 
dictions  reconciled,  and  interpretations  checked. 
With  the  help  of  pretests,  coordination  among  the 
observers  can  be  improved.  Another  possibility  for 
validating  the  results  is  to  get  feedback  from  the 
persons  observed  in  the  field. 

Helmar  Scboene 
University  of  Education  Scbwaebiscb  Gmuend 
Scbwaebiscb  Gmuend,  Germany 
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Participation 


No  society,  be  it  traditional  or  modern,  can  exist 
and  survive  without  the  active  engagement  of  its 


members  in  the  various  domains  of  life  that  have 
resulted  from  processes  of  social  differentiation 
and  that,  in  toto,  constitute  societies  and  polities 
at  large.  This  does  not  imply  that  every  member  of 
any  given  society  has  to  be  active  in  each  domain. 
What  it  means,  however,  is  that  each  domain 
encompasses  individuals  (the  microlevel),  an  inter¬ 
mediary  structure  (the  mesolevel)  linking  individ¬ 
uals  and  institutions  in  that  domain,  and  systemic 
elements  crystallized  in  rules  and  institutions  regu¬ 
lating  the  particular  domain  in  question  (the  mac¬ 
rolevel).  Especially  in  modern  times,  these  three 
levels  are  complemented  by  a  fourth  overarching 
level  linking  units  in  various  ways  (globalization 
is  a  particular  phenomenon  resulting  from  such 
linking  processes).  Notwithstanding  the  fact  that 
presently  there  is  an  ongoing  debate  in  political 
science  on  the  shrinking  role  of  the  nation-state, 
the  nearly  200  nation-states  in  existence  are  the 
major  macro-units  of  analysis  in  comparative 
political  science,  whether  they  are  considered  in 
full  or  as  subsets  and  whether  the  core  emphasis 
in  systematic  analyses  is  on  the  micro-,  the  meso-, 
or  the  macrolevel  or  any  combination  of  them. 
Below,  first,  a  definition  of  various  aspects  of  par¬ 
ticipation  is  provided.  This  is  followed  by  a  dis¬ 
cussion  of  the  underlying  methods  and  theoretical 
concepts.  The  various  dimensions  of  political 
participation  are  then  discussed  in  greater  detail. 
In  conclusion,  some  of  the  major  current  develop¬ 
ments  and  consequences  for  contemporary  democ¬ 
racies  are  pointed  out. 

Participation  refers,  first,  to  activities  by  indi¬ 
vidual  members  of  any  given  meso-  or  macro-unit 
of  analysis.  Second,  in  the  core  of  participation  is 
the  action  itself — that  is,  individual  behavior — 
even  if  attitudes  as  the  antecedents  of  such  behav¬ 
ior,  as  obtained  in  survey  research,  may  also  be  of 
interest.  Third,  individuals  never  act  in  a  social 
void;  therefore,  to  understand  why  people  act  at  all 
and  in  the  way  they  do,  one  has  to  consider  the 
embeddedness  of  individuals  in  a  context  condu¬ 
cive  to  action.  This  context  can  be  the  institutional 
arrangements  on  the  macrolevel  (e.g.,  for  voting, 
the  electoral  law  in  a  given  country)  or  the  social 
environment  that  an  individual  is  part  of.  Fourth, 
modern  social  science  is  not  only  about  structure 
and  persistence  but  also  about  dynamics  and 
change,  with  obvious  implications  for  research 
designs.  Fifth,  and  returning  to  the  concept  of 
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domains  as  subunits  of  sociopolitical  systems 
resulting  from  processes  of  differentiation  in  con¬ 
temporary  societies,  participation  for  the  purpose 
of  this  entry  is  divided  into  a  part  on  political  par¬ 
ticipation  and  a — smaller — part  on  social  partici¬ 
pation  (there  are  other  fields  worth  looking  at, 
e.g.,  cultural  participation,  but  here  the  data  situa¬ 
tion  is  unsatisfactory,  and  to  reach  out  beyond  the 
two  fields  just  alluded  to  would  overstretch  the 
scope  of  the  entry). 

Methodology 

As  mentioned  above,  participation  of  individual, 
collective,  or  institutional  actors  is  a  constituent 
feature  of  any  kind  of  sociopolitical  structures  and 
processes,  including  nation-states.  To  study  par¬ 
ticipation  empirically  as  an  individual  property 
requires  a  particular  set  of  research  instruments  of 
an  obtrusive  or  unobtrusive  nature,  and  among 
those  undoubtedly  the  most  prominent  one  is  the 
representative  sample  survey  that  acquires  infor¬ 
mation  from  individual  members  of  any  given 
meso-  or  macro-unit.  These  individuals  need  to  be 
selected  at  random  based  on  probability  theory  in 
order  to  permit  generalizations  from  the  sample  to 
the  population  from  which  the  sample  was  drawn. 
By  now,  survey  research  can  be  regarded  as  a  well- 
established  research  methodology  and  does  not 
require  any  elaboration  here,  although  using  this 
methodology  comparatively  is  a  challenge.  It  is  not 
by  chance  that  the  worldwide  spread  of  this  meth¬ 
odology,  which  started  in  the  1950s,  has  helped 
make  individual  political  participation  one  of  the 
best  researched  fields  in  political  science. 

But  there  are  other  important  factors  conducive 
to  this  state  of  affairs.  The  first  is,  on  the  macro¬ 
level,  the  economic  growth  and  modernization  that 
have  contributed  to  the  development  of  a  survey 
research  industry,  both  academic  and  commercial, 
that  is  an  important  tool  for  self-observation  in 
societies.  Second,  surveys  in  the  early  days  of  this 
research,  especially  in  the  academic  realm,  were  a 
rare  and  costly  resource;  this  is  not  the  least  of  the 
reasons  for  the  establishment  of  academic  data 
archives.  This  situation  did  not  simply  improve 
over  time  as  survey  studies  became  more  numerous; 
a  more  important  change  was  the  transition  from 
one-shot  cross-sectional  surveys  to  systematic  repli¬ 
cations  over  time  to  create  longitudinal  databases 


capable  of  analyzing  sociopolitical  change,  at  least 
in  the  aggregate  (panels  that  require  the  same  indi¬ 
viduals  to  be  surveyed  over  time  are  a  separate  and 
more  complicated  issue  and  are  therefore  still  quite 
rare).  Third,  starting  in  the  1980s,  the  scholarly 
community  has  conducted  a  broad  set  of  compara¬ 
tive  longitudinal  academic  cross-sectional  surveys 
from  which  participation  research  has  benefited 
enormously,  such  as  the  World  Values  Studies 
(WVS)  and  European  Values  Studies  (EVS),  the 
International  Social  Survey  Programme  (ISSP),  the 
European  Social  Survey  (ESS),  and  the  Cross- 
National  Study  of  Electoral  Systems  (CNES),  data 
sets  from  which  many  of  the  current  analyses  and 
findings  have  originated.  In  addition  to  these  aca¬ 
demically  driven  surveys,  other  comparative  sur¬ 
veys  had  already  been  established  already  in  the 
1970s,  such  as  the  Eurobarometer  surveys  funded 
by  the  European  Commission,  surveys  that  were 
later  complemented  by  other  Barometer  surveys  in 
Africa  ,  South  America,  and  Asia. 

Fourth,  with  the  establishment  of  social  science 
data  archives  since  the  1960s  that  now,  organized 
in  the  International  Federation  of  Data  Archives 
(IFDO),  cover  large  parts  of  the  world,  the  col¬ 
lected  data  are  increasingly  becoming  quickly 
available  to  the  research  community  (in  part 
already  through  the  Internet)  in  a  well-documented 
fashion  and  at  little  or  no  cost,  with  a  major 
impact  on  research,  publications,  PhD  disserta¬ 
tions,  and  student  training  in  social  science  research 
methodology.  Finally,  as  the  outreach  of  compara¬ 
tive  surveys  covers  more  and  more  countries  (e.g., 
the  WVS,  which  is  by  now  conducted  in  more  than 
70  countries),  the  theoretical  and  analytical  scope 
of  research  is  beginning  to  be  extended  substan¬ 
tially  through  so-called  multilevel  analyses  that 
permit  the  systematic  assessment  of  the  impact  of 
macrofactors  on  individual  behavior. 

Participation  and  Theories  of  Democracy 

The  Austrian-born  sociologist  Paul  Lazarsfeld 
escaped  from  the  threat  of  fascism  in  the  1930s  to 
the  United  States  and  became  one  of  the  major 
driving  forces  in  the  development  of  systematic 
theory-guided  empirical  social  research.  In  addi¬ 
tion  to  his  work  in  media  studies,  he  became  inter¬ 
ested  in  elections  and  conducted,  at  Columbia 
University,  the  first  major  election  studies  in  the 
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United  States.  After  World  War  II,  a  group  of  four 
political  scientists  at  the  Institute  for  Social 
Research  at  the  University  of  Michigan  started  a 
series  of  election  studies  that  later  developed  into 
the  still  existing  American  National  Election 
Studies  (NES)  project.  Similar  national  longitudi¬ 
nal  election  study  projects  have  come  into  being, 
especially  in  some  parts  of  Europe  and  for  studying 
elections  to  the  European  Parliament;  these  data 
are  also  freely  available  through  the  respective 
data  archives.  The  Michigan  scholars  and  their 
works  paved  the  way  for  contextualizing  them  in 
empirical  democratic  theory  and  squared  well  with 
the  variant  of  election-centered  elite  democratic 
theory  formulated  by  Joseph  Schumpeter  in  his 
classic  book  Capitalism,  Socialism  and  Democracy. 
Basically,  Schumpeter’s  notion  was  that  voters’ 
most  important  role  was  to  choose,  in  general  elec¬ 
tions,  among  those  elites  who  stood  for  political 
office.  Thus,  it  is  not  by  chance  that  research  on 
political  participation  initially  was  very  much 
related  to  the  electoral  process  and  the  role  citizens 
play  in  this  process.  In  contemporary  social  science 
parlance,  the  key  normative  issues  involved  here 
for  democracy  are  not  only  how  responsive  the 
political  system  is  to  the  needs  and  requests  of  the 
people  (Jan  Teorell,  2006)  but  also  which  political 
institutions  safeguard  accountability  for  the  citi¬ 
zens  (Pippa  Norris,  2007). 

This  clearly  limited  citizen  role  was  challenged 
in  the  late  1960s  and  early  1970s  in  the  wake  of 
the  anti-Vietnam  War  demonstrations  in  the 
United  States  and  student  protests  in  many  parts  of 
the  democratic  world.  Requests  for  an  extended 
political  role  of  the  citizenry  became  louder;  they 
culminated,  at  least  in  Europe,  in  expectations 
about  the  demise  of  the  “late-capitalist  state” 
(Max  Kaase  &  Kenneth  Newton,  1995).  Thus, 
liberal-elitist  democratic  theory  was  extended  in 
scope  through  the  notion  of  participatory  democ¬ 
racy,  a  notion  that  claimed  direct  political  chan¬ 
nels  of  influence  for  citizens  going  beyond  the 
established  modes  of  representative,  indirect  par¬ 
ticipation  through  parties  and  elections.  In  the 
seminal  study  Political  Action ,  Samuel  Barnes  et 
al.  (1979)  spoke  of  an  extended  repertory  of  polit¬ 
ical  action  available  to  citizens  and  coined  the 
terms  conventional  and  unconventional  political 
participation  to  combine  elitist  and  participatory 
theories  of  democratic  political  participation. 


The  most  recent  variant  of  thinking  about 
democracy  and  democratic  participation  comes 
under  the  label  of  deliberative  democracy  and 
owes  much  to  the  work  of  Jurgen  Habermas  and 
his  followers  as  advocates  of  domination-free  dis¬ 
course.  Teorell  (2006)  sums  things  up  when  he 
writes  that  deliberation  is  to  be  considered  “as  a 
process  of  opinion  formation  rather  than  a  proce¬ 
dure  for  decision  making”  (p.  791).  It  is  not  by 
chance  that  one  of  the  few  viable  instrumental 
efforts  along  such  lines  is  the  concept  of  delibera¬ 
tive  polls,  where  a  representative,  randomly 
selected  group  of  citizens  gets  together  for  a  couple 
of  days  to  deliberate,  with  pre-  and  postmeasure¬ 
ments,  on  critical  issues  to  unfold  the  impact  of  a 
dynamic  free  political  discourse  on  improving  and 
legitimating  political  decision  making. 

Any  normative  theory  of  democracy  has  to  be 
closely  linked  to  the  notions  of  equal  representa¬ 
tion  and  general  accountability;  this  is  why  the 
liberal-elitist  theory  of  democracy  is  attractive  to 
many  in  politics  and  in  political  science.  Moving 
beyond  institutionalized  political  participation  in 
the  electoral  process  creates  obvious  theoretical 
and  practical  problems  for  participatory  democ¬ 
racy  on  those  two  accounts,  and  this  becomes  ever 
more  visible  as  politics  is  forced  to  overcome 
national  barriers  and  to  deal  with  various  institu¬ 
tionalized  (e.g.,  the  United  Nations,  the  World 
Bank)  or  noninstitutionalized  transnational  institu¬ 
tions  and  networks  of  action  (e.g.,  nongovernmen¬ 
tal  organizations  [NGOsj).  On  the  international 
level,  though,  the  actors  or  institutions  as  units  of 
analysis  are  usually  quite  clear  and  easily  identifi¬ 
able.  The  same  cannot  be  said  for  those  involved 
in  deliberative  discourses,  not  to  speak  of  the  prob¬ 
lems  of  legitimating  who  is  represented  in  such 
discourses  and  in  relating  deliberative  discourses 
to  accountable  political  decision  making. 

Recently,  in  an  interesting  reflection  on  problems 
of  deliberative  democracy,  another  facet  to  this 
debate  has  been  added.  It  is  argued  that  in  delibera¬ 
tion,  not  only  individuals  or  groups  but  also  the  full 
array  of  discourses  on  a  particular  problematic 
should  be  represented,  for  instance,  through  a 
Chamber  of  Discourses,  formal  or  informal. 
Obviously,  a  variety  of  important  problems  is  involved 
here,  such  as  authorization,  accountability,  and  crite¬ 
ria  for  selecting  representatives  (who,  as  actors,  have 
to  satisfy  criteria  related  to  discursive  democracy, 
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which  cannot  do  without  individual  actors).  The 
scholarly  discussion  on  such  ideas  has  at  best  just 
started;  these  ideas  have  not  yet  found  entry  into 
the  debate  on  (political)  participation  and  are, 
therefore,  not  yet  represented  in  the  empirical 
study  of  participation. 

This  discussion  should  make  one  thing  clear: 
The  empirical  study  of  participation  is  closely 
linked  to  normative  theories  of  democracy  and  has 
to  be  understood  in  such  terms.  The  voluntary, 
free,  and  consequential  political  involvement  of 
citizens  is  a  minimal  conditio  sine  qua  non  for  any 
legitimate  democratic  political  system,  however 
developed  it  may  be  and  however  well  it  may  sat¬ 
isfy  the  demands  of  particular  strands  of  demo¬ 
cratic  theory.  Needless  to  say,  political  and  social 
participation  also  take  place  in  authoritarian  or 
totalitarian  systems.  But  this  is  not  the  subject  to 
be  addressed  in  this  entry,  not  the  least  because  of 
the  fact  that  scientific  as  well  as  commercial  empir¬ 
ical  social  research  in  such  systems,  as  historic 
examples  have  shown,  is  unlikely  or  even  impos¬ 
sible  when  citizens  cannot  freely  express  their 
opinions. 

Political  Involvement 

For  political  scientists  as  well  as  for  all  others 
involved  directly  or  indirectly  in  the  political  pro¬ 
cess,  interest  in  political  matters  is  a  given.  The 
same  can,  though,  not  be  said  for  all  citizens  even 
in  democratic  polities.  Surveys  such  as  the  2002 
Round  of  the  ESS  (see  Miki  Kittilson,  2007),  com¬ 
paring  the  relative  importance  of  various  life 
domains,  have  demonstrated  time  and  again  that 
politics  ranks  very  low  compared  with  domains 
such  as  family,  work,  health,  or  even  leisure.  This 
makes  it  even  more  interesting  to  look  at  those 
who  show  political  involvement  and  what  conse¬ 
quences  this  has  for  political  participation  as  active 
acts  of  engagement.  In  normative  democratic  the¬ 
ory,  political  involvement  is  the  raison  d’etre  of  a 
citizen;  nevertheless,  as  was  pointed  out,  such 
involvement  cannot  be  taken  for  granted. 

Participation  research  concentrates  on  acts  of 
participation — that  is,  on  concrete  behavior 
directed  toward  having  influence  on  various  out¬ 
comes  of  the  political  process  (see  below).  By  con¬ 
trast,  political  involvement  must  be  regarded  as  an 
individual  psychological  predisposition,  and  it  is 


an  empirical  question  to  what  extent  and  under 
which  conditions  such  involvement  precedes  polit¬ 
ical  action.  In  their  political  action  study,  Barnes 
et  al.  (1979)  analyzed  five  out  of  the  set  of  eight 
countries  participating  in  the  study  and,  in  the 
concluding  chapter,  looked  at  those  respondents 
who  showed  no  political  interest  but  showed  a 
tendency  to  act  politically  anyway.  This  was  called 
an  expressive  mode,  and  it  was  found  that  between 
18%  and  32%  of  the  respondents  fell  into  this 
category  (p.  528).  In  their  seminal  study  Voice  and 
Equality,  Sidney  Verba,  Kay  Schlozman,  and 
Henry  Brady  (1995)  also  emphasize  the  impact  of 
political  engagement — as  they  call  it — on  partici¬ 
pation  in  the  sense  of  being  mutually  reinforcing, 
although  they  mention  that  the  closeness  of  this 
relationship  makes  it  less  interesting  for  the  study 
of  participation. 

In  recent  empirical  studies,  political  involve¬ 
ment  is  operationalized  in  similar  and  fairly  com¬ 
mon  ways,  although  standardization  for  the 
empirical  assessment  of  this  concept  has  not  yet 
been  established.  Jan  Van  Deth  (2008)  proposes, 
based  on  19  countries  in  Round  1  of  the  ESS, 
looking  at  four  separate  elements  of  the  involve¬ 
ment  concept:  (1)  political  interest,  (2)  frequency 
of  engaging  in  political  discussions  with  friends 
and  family,  (3)  the  personal  importance  of  poli¬ 
tics,  and  (4)  the  saliency  of  politics  (politics  is  the 
most  important  of  the  seven  life  domains).  These 
four  indicators  are  obviously  more  or  less 
“demanding,”  and  it  is  not  surprising  that  interest 
in  politics  is  the  least  and  saliency  of  politics  the 
most  demanding  factor.  One  important  finding  is 
that  the  analyzed  countries  differ  substantially  in 
the  degree  of  political  involvement  of  their  citi¬ 
zens,  with  Southern  European  countries  (Greece, 
Spain,  Italy,  and  Portugal)  the  least  involved  (van 
Deth,  2008,  pp.  196-199).  Such  differences  stimu¬ 
late  the  search  for  macro  and  contextual  factors  as 
explanatory  variables.  However,  “the  attempts  to 
account  for  cross-national  differences  in  involve¬ 
ment  with  macro-level  factors  do  not  seem  to  be 
very  promising  .  .  .  The  multi-level  models  tested 
here  underline  the  relevance  of  conventional  indi¬ 
vidual-level  factors,”  such  as  education,  age,  gen¬ 
der,  and  external  efficacy  beliefs  (van  Deth,  2008, 
pp.  204-205,  215-216). 

The  Citizen,  Involvement,  Democracy  (CID) 
survey  conducted  between  1999  and  2002  in 
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12  European  countries  (Germany  for  historical 
reasons  was  counted  as  two  countries:  West 
Germany  and  the  area  of  the  former  German 
Democratic  Republic)  addresses  a  whole  range  of 
sociopolitical  participatory  activities  (Jan  van 
Deth,  Jose  Montero,  &  Anders  Westholm,  2007). 
There  also  the  topic  of  political  involvement  is 
covered,  although  in  a  slightly  different  vein  as  the 
analysis  is  only  based  on  responses  to  two  ques¬ 
tions  on  political  interest  and  political  importance. 
First,  four  ideal  types  of  citizenship  based  on 
democratic  theory  are  developed:  a  decisionist,  a 
liberal-representative,  a  participatory,  and  a  uni¬ 
tary  type.  This  typology  is  then  operationalized  by 
cross-tabulating  the  two  involvement  items  of 
interest  and  importance:  low  importance  and  low 
interest  make  for  the  decisionist  type,  low  impor¬ 
tance  and  medium  interest  make  for  the  liberal- 
representative  type,  high  importance  and  high 
interest  make  for  the  participatory  type,  and  high 
importance  and  low  interest  make  for  the  unitary 
type.  Whether  this  typological  approach  will  sur¬ 
vive  the  tides  of  time  is  an  open  question  given  the 
fact  that  the  two  involvement  items  are  highly  cor¬ 
related,  thereby  challenging  the  empirical  basis  of 
the  above  typology.  There  are  nevertheless  two 
findings  worth  emphasizing:  Looking  at  country 
clustering,  the  CID  study,  too,  finds  Southern 
European  countries  to  be  the  least  politicized 
(Newton  &  Montero,  2007),  and  the  distinction 
between  the  liberal-representative  type  and  the 
participatory  type  in  terms  of  antecedents  and  level 
of  participatory  engagement  is  very  clear-cut. 

Both  involvement  studies  fall  short  on  one 
important  dimension:  changes  over  time.  This  is  a 
very  interesting  problematic  because  the  postwar 
period  can  be  characterized  not  the  least  through 
the  extension  of  educational  systems  to  supply 
more  and  more  people  with  a  higher  level  of  formal 
education,  and  it  is  well-known  from  all  participa¬ 
tion  studies  that  political  and  social  participation  is 
positively  related  to  higher  levels  of  formal  educa¬ 
tion.  Russell  Dalton  (2008)  brings  the  concept  of 
increasing  cognitive  mobilization  into  play.  His 
data  for  four  countries  (France,  Germany,  Great 
Britain,  and  the  United  States)  indeed  point  to  an 
increasing  interest  in  politics  from  the  1950s 
onward.  However,  other  analyses  looking  at  levels 
of  political  discussion  (which  is,  of  course,  a  more 
demanding  activity)  in  Europe  for  the  1973  to  1991 


period  based  on  Eurobarometer  data  no  longer 
show  an  upward  trend.  In  fact,  there  is  no  neces¬ 
sary  relationship  between  rising  levels  of  education 
and  higher  levels  of  political  involvement  because 
period  effects  may  also  interfere  substantially. 

Political  Participation 

Political  participation  refers  simply  to  activity  that 
is  intended  to  influence  government  action — either 
directly  by  affecting  the  making  or  implementation 
of  public  policy  or  indirectly  by  influencing  the 
selection  of  people  who  make  those  policies. 
Voluntary  activity  means  participation  that  is  not 
obligatory — no  one  is  forced  to  volunteer — and 
that  receives  no  pay  or  only  token  financial  com¬ 
pensation.  This  is  an  operationalization  that  is 
shared  by  a  great  deal  of  other,  established  partici¬ 
pation  studies.  In  this  operationalization,  the  target 
of  the  activities  covered  is  and  remains  policymak¬ 
ers,  although  by  including  protest  activities  such  as 
participating  in  political  demonstrations  the 
authors  extend  the  repertoire  of  activities.  At  this 
point,  the  scholars  involved  in  the  CID  project  pro¬ 
pose  an  extension  based  on  David  Easton’s  classi¬ 
cal  notion  of  politics  as  the  authoritative  allocation 
of  values.  On  this  basis,  political  participation  is 
conceptualized  to  also  capture  activities  by  ordi¬ 
nary  citizens  not  targeted  at  political  authorities. 
The  innovation  the  study  proposes  is  to  go  beyond 
voting,  party  activity,  protest  activity,  and  contact¬ 
ing  and  to  include  consumer  participation  as  a  new 
category  in  the  form  of  boycotts — declining  to  buy 
certain  products — and  buycotts — buying  certain 
products,  although  the  concept  of  politics  here 
appears  to  be  ambiguous  and  not  very  precise. 

In  the  following,  the  various  dimensions  of 
political  participation  are  discussed  in  greater 
detail. 

Voting 

In  the  spirit  of  liberal  democratic  theory,  encom¬ 
passing  participation  in  free,  equal,  and  secret 
political  elections  is  the  core  legitimating  mecha¬ 
nism  by  which  representatives  are  chosen  for  a 
limited  period  of  time  to  participate,  in  various 
functions,  in  accountable  political  decision  mak¬ 
ing.  Given  the  practical  importance  of  elections  for 
selecting  those  actors  with  the  right  to  legitimately 
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exercise  power  and,  therefore,  the  enormous  pub¬ 
lic  visibility  of  and  interest  in  elections,  it  is  not 
surprising  that  political  sociology  has  early  on 
studied  electoral  participation — election  turnout 
and  voting  behavior.  However,  the  early-20th- 
century  studies  usually  concentrated  on  single  elec¬ 
tions,  were  based  on  aggregate  voting  statistics, 
and  could,  therefore,  not  contribute  much  to  the 
understanding  of  why  individual  people  voted  or 
abstained  in  such  elections  and  why  they  voted  for 
a  particular  political  party.  This  situation  changed 
significantly  with  the  establishment  of  the  survey 
method,  producing  individual  data  that  permitted 
a  much  better  understanding  of  electoral  participa¬ 
tion  and,  as  time  went  on,  of  the  electoral  process 
and  its  dynamics  at  large.  Further  enrichment 
came  through  international  collaboration  stimu¬ 
lated  by  American  psephologists  at  the  Institute  for 
Social  Research  at  the  University  of  Michigan  since 
the  1960s,  which  over  time  helped  create  world¬ 
wide  scholarly  networks  with  integrated  theoreti¬ 
cal  approaches. 

An  excellent  documentation  of  the  payoffs  from 
a  longitudinal  comparative  study  of  turnout  is 
Voter  Turnout  and  the  Dynamics  of  Electoral 
Competition  in  Established  Democracies  since 
1945  (Mark  Franklin,  2004).  This  book  addresses 
three  major  questions:  (1)  Why  do  people  vote  at 
all  given  that  according  to  rational  choice  theory, 
individual  votes  have,  for  all  practical  purposes,  no 
influence  on  the  aggregate  outcome  of  elections? 
(2)  Is  there  a  general  decline  in  turnout,  and,  if  so, 
what  can  one  learn  with  respect  to  whether  such  a 
decline  has  happened  because  of  changing  civic 
virtues  and  political  disaffection?  (3)  What  are  the 
major  macro-  and  microfactors  that  influence 
variations  in  turnout  within  and  across  countries? 

These  questions  cannot  be  answered  here  in  any 
detail.  A  very  important  finding  is  that  turnout  in 
established  democracies  increased  between  1950 
and  1965  and  then  has  gradually  declined  until  the 
present  period.  Andre  Blais  (2007)  calculates  an 

8- percentage-point  decline  for  106  countries  and  a 

9- percentage-point  decline  for  29  established 
democracies.  The  statistically  sophisticated  analy¬ 
ses  by  Franklin  shed  a  reliable  light  on  the  reasons 
for  this  decline  in  attributing  it  to  three  macrode¬ 
velopments:  ( 1 )  changes  in  the  size  of  the  electorate 
through  generational  replacement  with  young 
cohorts  less  inclined  to  vote,  (2)  lowering  of  the 


voting  age  in  many  countries  in  the  late  1960s,  and 
(3)  the  degree  and  nature  of  party  competition. 
These  are  excellent  examples  of  processes  that 
happen  at  the  macrolevel  of  societies  and  then 
trickle  down  to  aspects  of  individual  behavior. 
This,  in  turn,  indicates  that  micronotions  of  declin¬ 
ing  trust  in  institutions,  decreases  in  the  civic 
mindedness  of  citizens,  and  disaffection  with 
democracy  do  not  contribute  to  the  observed 
decline. 

A  different  story  can  be  told  with  respect  to  dif¬ 
fering  average  levels  of  turnout  for  any  given  coun¬ 
try  and  for  within-country  variations  over  time. 
Regarding  levels  of  turnout,  there  is  a  lot  of  path 
dependency  in  the  sense  that  major  ad  hoc  varia¬ 
tions  in  turnout  are  unlikely,  due  to  stable  institu¬ 
tional  factors  such  as  electoral  laws,  registration 
rules,  or  compulsory  voting.  By  contrast,  within- 
country  turnout  changes  are  greatly  affected,  espe¬ 
cially  by  the  competitiveness  of  any  given  election, 
which  makes  the  “every  vote  counts”  notion  more 
plausible.  That  this  effect  is  weaker  for  established 
than  for  young  cohorts  cannot  come  as  a  surprise 
given  the  fact  that  socialization  into  politics  takes 
time  and  is  especially  difficult  for  younger  people, 
for  whom  a  lot  of  other  priorities  such  as  finishing 
an  education  and  starting  a  family  take  prece¬ 
dence.  To  sum  up,  then,  to  understand  voting 
participation  along  the  time  dimension  requires 
modeling  the  complex  interaction  of  individual 
and  systemic  factors,  requires  a  longitudinal  data¬ 
base,  and  benefits  most  strongly  from  a  combina¬ 
tion  of  micro-  and  macrolevel  information. 

Other  Dimensions  of  Political  Participation 

Research  on  political  participation  going  beyond 
turnout  and  party  choice  started  in  the  1960s  with 
Political  Participation:  Plow  and  Why  Do  People 
Get  Involved  in  Politics ?  (Lester  Milbrath,  1965). 
Milbrath  conceptualized  political  participation  in 
the  form  of  a  unidimensional  pyramid  of  electorally 
based  attitudes,  with  the  “easier,”  more  frequent 
items  at  the  bottom  and  the  more  demanding  ones 
toward  the  top.  It  is  interesting  to  note  that  Milbrath 
already  mentioned  demonstrations  but  argued  at 
the  time  that  they  “do  not  fit  into  the  hierarchy  of 
political  involvement  in  the  United  States”  (p.  18). 
In  a  later  edition  of  the  book,  the  reach  has  been 
extended  to  also  include  demonstrations. 
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A  major  step  forward  in  the  empirical  analysis 
of  political  participation  was  a  seven-nation  study 
looking  at  the  structure,  antecedents,  and  conse¬ 
quences  of  political  participation  (Sidney  Verba, 
Norman  Nie,  &  Jae-On  Kim,  1978).  In  this  study, 
the  concept  of  the  unidimensionality  of  the  par¬ 
ticipatory  space  was  overcome,  through  factor 
analysis,  by  establishing  four  separate  sets  of 
activities:  voting,  campaign  activity,  (particular¬ 
ized)  contacting  of  public  officials,  and  communal 
activity.  But  this  study  did  not  yet  cover  the 
domain  of  protest  politics;  the  authors  state  explic¬ 
itly  that  “our  concern  is  with  activities  ‘within  the 
system’ — ways  of  influencing  politics  that  are  gen¬ 
erally  recognized  as  legal  and  legitimate”  (Verba 
&  Nie,  1972,  p.  3). 

The  first  large  comparative  survey  study  that 
explicitly  reached  out  into  the  protest  dimension 
was  a  political  action  study  (Barnes  et  ah,  1979)  in 
eight  Western  democracies;  of  those,  the  Nether¬ 
lands,  (West)  Germany,  and  the  United  States  were 
revisited,  in  a  panel  study,  between  1979  and  1981 
(Kent  Jennings  &  Jan  van  Deth,  1989).  These  stud¬ 
ies  were  triggered  by  the  rise  in  protest  activities,  in 
particular  petitions  and  demonstrations,  beginning 
in  the  mid-1960s  especially  in  the  United  States, 
Western  Europe,  and  Japan.  In  the  history  of  state 
building,  protest — often  violent  in  nature — was 
not  unusual;  however,  political-democratic  con¬ 
solidation  after  World  War  II  emphasized  a  “civic 
culture”  and  not  a  strongly  participatory  one 
(Gabriel  Almond  &  Sidney  Verba,  1963)  until  a 
protest  culture  began  to  build  up  in  the  1960s. 

The  political  meaning  of  this  development  in 
the  beginning  was  not  quite  clear.  A  great  theo¬ 
retical  and  methodological  difficulty  for  a  survey 
study  looking  at  cross  sections  of  the  voting-age 
population  was  the  relative  scarcity  of  activities 
beyond  the  electoral  realm  at  the  time  of  the  sur¬ 
vey.  This  was  the  reason  for  the  approach  by  the 
political  action  group — an  approach  later  criti¬ 
cized  by  other  scholars — to  not  only  concentrate 
on  concrete  activities  but  also  look  at  attitudes 
toward  the  behavior  the  group  was  interested  in, 
resulting  in  the  concept  of  protest  potential.  But 
the  major  achievement  of  the  project  was  to  rede¬ 
fine  the  participatory  space  by  adding  a  dimension 
of  uninstitutionalized  participation,  thus  juxtapos¬ 
ing  the  two  dimensions  of  conventional  and 
unconventional  participation.  One  of  the  first 


pertinent  findings  was  that  these  two  dimensions 
were  positively  correlated  in  all  countries,  thereby 
suggesting  an  increase  in  the  political  action  reper¬ 
toire  of  citizens  and  not  the  demise  of  liberal 
democracy.  It  is  remarkable  that  this  finding  from 
1974  was  fully  corroborated  by  other  studies 
almost  3  decades  later. 

The  second  result  was  that  the  most  important 
antecedents  of  protest  potential  were  high  levels  of 
education  and  young  age.  In  particular,  the  posi¬ 
tive  impact  of  education  on  protest  attitudes 
squared  well  with  the  socioeconomic  standard 
(SES)  model  developed  by  Verba  and  Nie  (1972), 
which  emphasized  high  levels  of  education,  high 
income,  and  high  social  status  as  pertinent  ante¬ 
cedents  of  political  participation.  Whether  youth¬ 
fulness  was  a  permanent  corollary  of  protest 
potential  was  impossible  to  assess  at  that  time 
without  the  availability  of  longitudinal  data.  The 
second  wave  of  the  political  action  study  (Jennings 
&  van  Deth,  1989)  then  corroborated  the  finding 
that  education  is  a  factor  conducive  to  all  modes 
of  potential  and  real  participation.  However,  the 
initial  findings  on  the  positive  role  of  youthfulness 
had  to  be  qualified  in  the  sense  that  young  age 
became  less  important  as  one  moved  from  attitude 
closer  to  action,  thereby  pointing  to  the  role  of 
different  elements  in  contextual  mobilization.  In  a 
more  recent  study  of  participation  in  demonstra¬ 
tions  in  Belgium,  the  reduced  role  of  youthfulness 
in  protest  participation  was  not  confirmed,  while 
the  impact  of  education  remained  relevant.  What 
is  important  to  learn  from  the  Belgian  study  is 
that  the  sociostructural  and  attitudinal  partici¬ 
pation  profiles  of  demonstrators  vary  substan¬ 
tially  depending  on  the  event  that  triggered  the 
demonstration. 

The  juxtaposition  of  conventional  and  uncon¬ 
ventional  participation  in  political  action  owed 
much  to  the  relative  scarcity  of  acts  such  as 
petitions,  demonstrations,  citizen  initiatives,  and 
boycotts  at  the  time  of  the  survey.  However,  in 
participation  research  there  is  now  agreement  that 
especially  the  term  unconventional  participation  is 
no  longer  meaningful  given  the  fact  that  through 
processes  of  sociopolitical  change  these  acts  have 
become  a  regular  and  legitimate  part  of  citizens’ 
action  repertoires.  To  overcome  this  problem, 
Norris  (2007)  proposes  to  introduce  a  distinction 
“between  citizen-oriented  actions,  relating  mainly 
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to  elections  and  parties,  and  cause-oriented  reper¬ 
toires  which  focus  attention  upon  specific  issues 
and  policy  concerns”  (p.  639).  Edeltraud  Roller 
and  Tatjana  Rudi  (2008)  speak  of  electoral  and 
nonelectoral  participation;  Kaase  prefers  the  dis¬ 
tinction  between  institutionalized  and  noninstitu- 
tionalized  forms  of  participation.  Ronald  Inglehart, 
already  in  1977,  suggested  the  terms  of  elite- 
directed  action  (related  to  elections  and  parties) 
and  elite-challenging  action.  The  most  recent  pro¬ 
posal  for  terminological  innovation  comes  from 
the  CID  group,  which,  based  on  a  factor  analysis 
of  their  participation  items,  finds  four  participa¬ 
tory  dimensions  (contacting,  party  activity,  protest 
activity,  consumer  participation)  and  adds  voting 
as  the  fifth  dimension  and  organizes  these  types  in 
a  fivefold  table,  distinguishing  between  mecha¬ 
nisms  of  influence  (exit  or  voice)  and  channels  of 
expression  (representational  or  extrarepresenta- 
tional).  These  are  all  terminological-conceptual 
efforts  without  greatly  changing  the  dimensional 
structure  of  the  participatory  space  known  from 
many  previous  studies,  except  for  finding  a  place 
for  a  new  type  of  involvement  no  longer  exclu¬ 
sively  directed  at  political  authorities:  consumer 
participation  measured  as  buycotts  or  boycotts. 
All  in  all,  the  famous  saying  by  Mao  Zedong,  “let 
a  hundred  flowers  bloom,”  may  be  good  for 
culture;  in  political  science,  greater  continuity  in 
theoretical  approaches  and  empirical  work  would 
certainly  be  of  advantage,  leading  to  a  cumulative 
growth  of  knowledge. 

Levels,  Antecedents,  and  Consequences 
of  Political  Participation 

One  of  the  charms  of  the  liberal-elitist  model  of 
democracy  is  its  emphasis  on  equal  participatory 
rights  through  general  voting.  If  everyone  or  at 
least  a  large  part  of  the  population  does  indeed 
exercise  the  right  to  vote,  it  is  a  logical  conse¬ 
quence  that  no  particular  individual  factors  can  be 
detected  that  influence  participation.  As  soon  as 
one  goes  beyond  the  electoral  realm,  though,  the 
differences  between  modes  of  action  within  and 
between  countries  become  very  large.  In  the  CID 
study,  for  example,  between  4%  and  21%  of  the 
citizenry  around  the  year  2000  engaged  in  at  least 
one  act  of  contacting,  between  1%  and  6%  in  a 
party-related  activity,  between  1%  and  8%  in  an 


act  of  protest,  and  between  5%  and  44%  in  at 
least  one  act  of  consumer  buycott  or  boycott.  A 
similar  picture  emerges,  although  with  consider¬ 
able  differences  due  to  the  choice  of  indicators, 
with  regard  to  both  acts  of  participation  and  the 
countries  of  concern  from  the  analysis  of  Round  1 
of  the  ESS  (Newton  &  Montero,  2007). 

Concentrating  first  on  individual-level  factors, 
the  initial  SES  model  has  now  been  replaced  by  a 
more  sophisticated  model:  the  civic  voluntarism 
model  (CVM).  In  a  detailed  step-by-step  analysis, 
Verba  et  al.  (1995)  go  beyond  the  SES  model  to 
analyze  the  explanatory  links  between  SES  and 
participation.  The  CVM  includes  the  explanatory 
dimensions  of  education  and  language,  income 
and  time,  civic  skills  and  political  engagement: 
“Political  participation,  then,  is  the  result  of  politi¬ 
cal  engagement  and  resources”  (Verba  et  ah,  1995, 
p.  354).  The  authors  regard  the  CVM  as  a  power¬ 
ful  instrument  to  predict  political  activity  in  gen¬ 
eral,  and  they  further  argue  in  favor  of  the  strength 
and  validity  of  the  CVM  because  it  applies  to  all 
modes  of  political  activity  and  finds  different 
weights  for  its  predictors  depending  on  the  partic¬ 
ular  kind  of  activity  to  be  explained. 

As  soon  as  one  moves  beyond  the  study  of  one 
individual  country,  however,  one  has  the  chance 
also  to  look  at  the  impact  of  institutional  (macro) 
variables  and  to  study  the  effects  of  eventual  inter¬ 
actions  between  micro-  and  macrovariables  in  a 
multilevel  research  design.  Two  such  analyses  are 
of  interest  here,  both  with  data  coming  from  the 
Round  1  of  the  ESS.  In  the  first  of  these  analyses, 
the  authors  have  collected  22  macroproperties 
assessing  the  dimensions  of  political  order,  demo¬ 
cratic  effectiveness,  wealth,  government  expendi¬ 
ture,  and  population  and  have  regressed  those 
against  an  encompassing  composite  country  par¬ 
ticipation  score.  Of  those  macro  indicators,  rule  of 
law  (a  complex  measure  provided  by  the  World 
Bank)  has  turned  out  to  be  the  most  important  vari¬ 
able  for  influencing  levels  of  conventional  political 
participation,  whereas  protest  is  most  strongly 
related  to  a  high  level  of  economic  development, 
according  to  Newton  and  Montero.  Both  findings 
are  well  in  line  with  existing  theoretical  approaches 
to  specify  systemic  conditions  for  the  emergence 
and  existence  of  various  modes  of  participation. 

The  multilevel  analysis  by  Roller  and  Rudi 
(2008)  starts  out  with  the  individual  variables  of 
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the  CVM  plus  measures  of  social  capital  to  opti¬ 
mally  exploit  the  potential  for  the  micro  explana¬ 
tion  of  electoral  and  nonelectoral  participation. 
This  analysis  first  confirms  the  findings  of  Verba 
et  al.  (1995)  that  the  CVM  can  be  used  to  explain 
both  forms  of  participation,  although  the  various 
components  of  the  model  do  not  equally  contrib¬ 
ute  to  explain  the  two  forms.  For  the  multilevel 
analysis,  three  macrofactors  measured  by  eight 
indicators  are  introduced:  (1)  socioeconomic  mod¬ 
ernization,  (2)  the  electoral  system,  and  (3)  measures 
of  social  capital  composed  from  the  aggregation  of 
individual-level  indicators.  The  findings  for  elec¬ 
toral  and  nonelectoral  participation  regarding  the 
impact  of  systemic  variables  are,  however,  disap¬ 
pointing:  The  share  of  the  total  variance  explained 
by  contextual  factors  for  electoral  participation  is 
just  8%  and  for  nonelectoral  participation,  an 
even  lower  6%,  speaking  again  for  the  power  of 
the  individual  CVM  model.  Nevertheless,  modern¬ 
ization  has  the  expected  effect  on  levels  of  nonelec¬ 
toral  participation,  while  political-institutional 
factors  have  an  impact  on  levels  of  electoral  par¬ 
ticipation.  These  findings  square  well  with  the 
theoretical  expectations. 

Participatory  theories  of  democracy  emphasize 
the  right  and  the  need  for  citizens  to  have  a  direct 
and  not  only  an  indirect  say  in  political  decision 
making.  One  aspect  derived  from  empirical  par¬ 
ticipation  studies,  however,  has  been  from  the 
beginning  that,  other  than  the  vote,  citizen  partici¬ 
pation  is  biased  along  the  lines  of  the  CVM  model; 
it  is  the  most  resourceful  and  engaged  citizens  who 
dominate  involvement  in  acts  of  participation 
beyond  the  vote.  Early  on  this  has  already  created 
concerns  about  the  fairness  of  the  democratic  pro¬ 
cess  (Verba  et  ah,  1978;  Verba  &  Nie,  1972). 
What  if  the  citizen  and  group  voices  heard  most 
loudly  in  politics  are  biased  against  the  needs  and 
preferences  of  the  population  at  large?  In  Voice 
and  Equality  (Verba  et  ah,  1995),  the  authors 
address  this  problematic  in  a  very  detailed  fashion. 
They  conclude, 

According  to  a  liberal  model  of  American 
democracy,  the  principal  role  of  citizen 
participation  is  to  transmit  information  to  public 
authorities  about  activists’  self-interested 
objectives.  This  chapter  demonstrated  a  consistent 
pattern  when  it  comes  to  the  participatory 


politics  of  self-interest.  Whether  we  are 
considering  attitudes  on  the  economy,  actual 
economic  circumstances  and  needs,  or  opinions 
on  government  efforts  to  assist  Blacks  or  Latinos, 
the  process  operates  to  bias  participatory  input 
in  the  direction  of  the  needs  and  the  preferences 
of  the  advantaged,  (p.  506) 

The  question  to  be  addressed  next  is  to  what 
extent  the  reported  findings  for  the  United  States 
also  pertain  to  a  broader  array  of  countries. 
Comparing  the  effect  of  education  as  a  proxy  for 
the  command  of  resources  for  electoral  and  non¬ 
electoral  participation,  the  findings  are  very  much 
in  line  with  those  for  the  United  States:  There  is 
almost  no  effect  of  education  on  electoral  partici¬ 
pation,  and  a  substantial  effect  on  nonelectoral 
participation  (Roller  &  Rudi,  2008).  Moving  to 
the  multilevel  analysis,  contextual  variables  do  not 
add  much  to  explain  the  modes  of  participation 
studied.  But  there  is  one  result  that  challenges  tra¬ 
ditional  wisdom  of  political  science:  It  is  majority 
voting  and  majoritarian  political  institutions  that 
reduce  political  inequality  for  both  modes  of  par¬ 
ticipation  (Roller  &  Rudi,  2008). 

There  are  at  least  two  weaknesses  in  participa¬ 
tion  research  that  need  to  be  addressed.  The  first — 
a  minor  one — is  the  increasing  use  of  the  Internet 
as  a  means  of  political  communication  and  mobi¬ 
lization,  which  will  require  more  attention  now. 
The  second,  more  consequential  one  refers  to  the 
fact  that  inequality  of  participatory  input  into  the 
political  system  does  not  per  se  justify  the  conclu¬ 
sion  that  the  policy  output  suffers  systematically 
from  a  similar  bias.  A  further  necessary  differentia¬ 
tion  concerns  the  levels  of  the  political  system  to 
which  participatory  activities  are  directed.  What 
will  be  necessary  in  the  future,  according  to  Norris, 
will  be  to  assess  reliably  the  relationship  between 
participatory  input  from  various  groups  of  the 
citizenry  and  the  policy  output  created  by  the  insti¬ 
tutionalized  policy  process  on  the  various  system 
levels. 

There  is,  incidentally,  one  aspect  in  the  analyses 
reported  above  that  permits  a  friendlier  interpreta¬ 
tion  of  the  observed  inequalities.  Since  those  citi¬ 
zens  who  are  politically  the  most  active  are  also 
well  embedded  in  social  networks  and  are  highly 
resourceful,  their  engagement  may  help  improve 
the  overall  quality  of  the  democratic  process,  a 
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consideration  leading  back  to  aspects  of  delibera¬ 
tive  democracy.  This  is  another  challenge  for 
future  research  on  political  participation. 

Changes  in  Preferences  for  Different 
Modes  of  Political  Participation 

While  participation  research  in  general  does  not 
suffer  from  a  paucity  of  data  and  findings,  there  is 
one  aspect  that  is  not  covered  sufficiently:  the  lon¬ 
gitudinal  comparative  study  of  developments  in 
preferences  for  the  various  acts  and  modes  of  par¬ 
ticipation  (one  exception  is  turnout,  but  this  field 
profits  from  the  availability  of  aggregate  voting 
statistics).  There  are  some  longitudinal  data, 
though,  that  relate  to  items  of  elite-challenging 
actions  derived  from  the  1974  political  action 
study  and  replicated  in  four  waves  of  the  WVS 
between  1981  and  2000  (for  details,  see  Inglehart 
&  Catterberg,  2003).  These  data  point  to  a  notice¬ 
able  increase  over  the  past  3  decades  in  elite- 
challenging  action  in  the  eight  political  action 
countries  and  France — all  Western  democracies — 
an  increase  the  authors  attribute  to  an  ongoing 
process  of  economic  modernization.  Unfortunately, 
the  data  situation  does  not  permit  an  analysis  of 
possible  replacement  effects  between  the  various 
dimensions  of  political  engagement.  In  particular, 
it  would  have  been  very  interesting  to  see  whether 
nonelectoral  activities  may  have  grown  at  the 
expense  of  electoral  activities. 

Social  Participation 

Other  than  political  participation,  social  participa¬ 
tion  has  not  yet  achieved  the  status  of  a  well- 
defined  social  science  subfield.  Analogous  to 
political  participation,  social  participation  can  be 
defined  as  all  activities  by  individual  citizens  in 
social  contexts  going  beyond  the  inner  circle  of 
family  and  friendship  relations  that  are  not  pri¬ 
marily  directed  toward  influencing  political  out¬ 
comes.  As  such,  social  participation  constitutes  a 
core  element  of  civil  society.  For  a  long  time,  in 
social  research,  the  core  representation  of  civil 
society  was  voluntary  associations  integrating 
individuals  into  the  intermediary  structures  of 
society — the  mesolevel.  For  example,  the  study  of 
membership  in  organizations  such  as  churches  and 
trade  unions  was  a  normal  part  of  electoral 


research  under  the  assumption  that  social  cleav¬ 
ages  had  been  translated  into  party  systems  through 
the  constant  interaction  of  group  and  party  elites. 
However,  the  major  breakthrough  in  making 
social  participation  an  independent  field  of  study — 
even  if  closely  related  to  concerns  about  demo¬ 
cratic  governance — came  through  the  study 
Making  Democracy  Work  (Robert  Putnam,  1993), 
which  looked  at  conditions  favoring  or  hindering 
the  overall  administrative  and  economic  perfor¬ 
mance  of  Italian  regions.  In  pointing  especially  to 
the  organizational  underpinning  of  high  economic 
performance  through  established  cultural  and  eco¬ 
nomic  groups  working  together  in  an  environment 
of  mutual  trust  and  solidarity,  Putnam  employed 
the  term  social  capital  (a  term  already  previously 
used  by  Pierre  Bourdieu  and,  later,  James  Coleman) 
to  characterize  the  reasons  for  the  diversity  of 
regional  performance. 

The  concept  of  social  capital  became  a  major 
incentive  for  political  science  when  this  was  related 
to  the  observation  that  in  the  United  States  a 
decline  in  social  bonding  was  occurring  that — 
recalling  the  Tocquevillean  tradition  of  American 
democracy — was  a  challenge  to  American  democ¬ 
racy  and  democracy  at  large.  Social  capital  can  be 
regarded  as  a  macroproperty  of  societies  in  the 
sense  of  systemic  capital.  The  concept,  however, 
has  in  the  meantime  also  been  transferred  to 
microresearch  through  surveys.  While  there  is  no 
standard  operationalization  in  empirical  research, 
there  is  agreement  that  social  capital  has  three 
major  dimensions:  (1)  the  degree  to  which  people 
trust  each  other,  (2)  the  acceptance  of  norms  of 
reciprocity  as  cultural  elements  of  social  capital, 
and  (3)  the  involvement  in  social  networks,  be  it 
through  voluntary  associations  or  through  other 
less  formal  regular  contacts. 

With  regard  to  longitudinal  comparative  data 
sets  reliably  measuring  social  capital,  the  situation 
is  not  fully  satisfactory,  although  information  accu¬ 
mulated  from  a  variety  of  data  sources  indicates 
that  a  general  decline  in  social  capital  cannot  be 
observed.  In  another  study  it  is  argued  that  elite- 
challenging  participation  is  at  least  as  important  a 
constituent  of  social  capital  as  is  membership  in 
voluntary  associations.  Longitudinal  data  from  the 
WVS  for  12  postindustrial  societies  support  the 
argument  that  there  is  no  measurable  decrease  in 
overall  organizational  membership  between  1980 
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and  2000,  although  within  this  overall  category 
sociotropic  organizations  have  gained  in  member¬ 
ship,  whereas  membership  in  traditional  organiza¬ 
tions  (e.g.,  trade  unions)  has  slightly  decreased  or 
remained  constant.  Particularly  interesting  in  the 
context  of  participation  research,  though,  is  the 
finding  that  good  governance  is  much  more  strongly 
related  to  the  level  of  elite-challenging  action  than 
to  the  level  of  associational  membership. 

One  step  away  from  the  link  between  social  par¬ 
ticipation  and  social  capital  is  taken  in  the  context 
of  the  CID  study  with  the  new  concept  of  small- 
scale  democracy  in  the  sense  of  the  potential  option 
for  citizens  to  control  their  own,  personal  life  situa¬ 
tion.  There,  small-scale  (local)  democracy  is  consid¬ 
ered  to  be  an  important  field  because  it  offers 
insights  into  the  overall  state  of  any  given  society, 
including  aspects  of  democratic  governance;  looks 
at  trade-offs  between  small-scale  and  large-scale 
democracy;  and  pursues  questions  important  also 
in  social  capital  and  participatory  theory  terms  to 
what  extent  involvement  in  small-scale  democracy 
activities  enhances  a  sense  of  identity,  self-esteem, 
and  personal  skills  that  may  have  payoffs  also  for 
large-scale  democracy.  To  answer  these  questions,  a 
broad  roster  of  incentives,  actions,  and  action  con¬ 
sequences  for  the  three  fields  of  education,  health 
care,  and  working  life  is  looked  at,  fields  that  are 
highly  relevant  for  the  citizens’  daily  lives.  It  will 
remain  to  be  seen  whether  along  those  lines  in  the 
future  a  new  field  of  systematic  research  will 
develop  given  the  finding  that  involvement  in  the 
three  domains  is  quite  frequent  across  the  CID 
countries.  Such  efforts  may  be  encouraged  by  the 
fact  that  participation  in  small-scale  democracies 
and  in  large-scale  democracies,  especially  in  the 
fields  of  contacting  and  consumer  participation, 
according  to  the  CID  data,  are  positively  correlated. 
Thus,  the  conclusion  may  be  justified  that  in 
contemporary  societies  social  and  political  partici¬ 
pation  are  indeed  fields  related  by  an  underlying 
general  tendency  to  become  involved. 
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Bremen,  Germany 
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Participation,  Contentious 


In  the  most  conventional  usage,  contentious  par¬ 
ticipation  refers  to  forms  of  political  participation 
that  employ  nonconventional,  confrontational 
means  of  action  in  expressing  collective  interests, 
such  as  demonstrations,  strikes,  and  boycotts. 
Contentious  participation  here  represents  an  alter¬ 
native  to  institutionalized  channels  of  participation, 
such  as  election,  interest  group  activities,  and 
political  parties.  However,  a  growing  number  of 
scholars  have  been  starting  to  use  the  concept  of 
contentious  participation  to  illustrate  the  conten¬ 
tious  nature  inherent  in  all  forms  of  political  action, 
which  range  from  peaceful  acts  such  as  voting  and 
lobbying  to  disruptive  and  oftentimes  violent  pro¬ 
test.  This  latter  view  highlights  aspects  of  continuity 
in  what  have  been  conventionally  understood  to  be 
categorically  different  forms  of  participation. 


This  changing  interpretation  of  contentious  par¬ 
ticipation  mirrors  the  change  in  how  the  literature 
has  come  to  understand  protests  and/or  social 
movements.  Previously,  a  social  movement  was 
commonly  understood  to  be  an  action  form  carried 
out  by  outsiders  of  a  political  establishment — that 
is,  those  who  lack  power  and  resource  and  those 
who  do  not  have  regular  access  to  the  decision¬ 
making  process.  Those  inside  the  system,  many 
believed,  would  seek  reform  or  policy  change  by 
expressing  their  voices  through  interest  groups  or 
political  parties.  Gradually,  however,  a  new  frame¬ 
work  arose  that  drew  attention  to  the  fuzzy  bound¬ 
ary  dividing  the  two  and  to  how  social  movements 
and  institutionalized  forms  of  participation  are  not 
mutually  exclusive  but  complementary. 

This  latter  view  is  predicated  on  the  idea  that,  in 
essence,  the  basic  principles  of  contentious  partici¬ 
pation  and  established  forms  of  participatory  insti¬ 
tutions,  such  as  elections,  are  not  dissimilar.  In 
fact,  many  established  political  parties,  such  as  the 
Social  Democratic  parties  in  Western  Europe,  have 
had  historical  roots  in  social  movements. 
Nowadays,  many  social  movement  groups  employ 
conventional  forms  of  action  (e.g.,  lobbying  and 
petitioning)  in  conjunction  with  their  typical  street 
repertoire,  while  many  interest  groups  organize 
large  public  demonstrations  to  add  leverage  to 
their  lobbying  effort.  They  may  look  different  on 
the  surface,  but  in  essence  both  contentious  and 
institutionalized  participation  aim  at  getting  the 
voices  of  the  public  to  be  heard  and  to  be  taken 
seriously  by  decision  makers. 

It  seems  as  if  there  is  a  growing  consensus  over 
the  overlap  between  contentious  and  institutional¬ 
ized  methods  of  participation.  Still,  it  is  important 
to  note  that  not  all  societies  are  equipped  with  the 
same  levels  of  institutionalized  channels  of  partici¬ 
pation.  As  a  result,  contentious  participation  may 
take  different  forms  and  carry  different  meanings 
across  varying  contexts.  This  is  because  forms  of 
contentious  participation  are  often  mirror  images 
of  the  political  environment  in  which  they  operate. 

Varying  Contexts  of 
Contentious  Participation 

In  authoritarian  regimes  where  channels  of  partici¬ 
pation  barely  exist,  any  attempts  at  expressing  a 
collective  voice  in  the  public  arena  are  often  driven 
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underground.  The  violent  forms  many  social 
movements  take  under  these  circumstances  reflect 
the  highly  exclusive  and  repressive  nature  of 
authoritarian  regimes.  The  risks  are  high,  but 
when  a  social  movement  mobilizes  significant  sup¬ 
port  and  grows  stronger,  it  often  leads  to  demo¬ 
cratic  change  that  opens  up  opportunity  for  more 
equal  and  broader  participation.  This  was  the  case 
in  South  Africa  and  a  number  of  Latin  American 
countries  in  the  1980s. 

Nowadays,  many  authoritarian  regimes  offer 
limited  political  space,  if  only  because  they  cannot 
command  absolute  control  over  the  flow  of  infor¬ 
mation  and  movement  of  people.  This  is  particu¬ 
larly  so  in  authoritarian  regimes  with  a  relatively 
large  economy  and  a  population  that  is  relatively 
highly  educated.  In  these  contexts,  ordinary  peo¬ 
ple  try  to  take  advantage  of  the  limited  political 
spaces  in  various  attempts  to  voice  their  concerns 
and  influence  important  policies.  For  example, 
women,  students,  teachers,  and  workers  in  Iran 
have  created  organizations  and  have  worked  with 
sympathetic  allies  in  various  public  sectors  to 
defend  and  to  proactively  claim  new  rights  since 
the  death  of  the  charismatic  leader  Khomeini  in 
1989,  which  led  to  small  cracks  within  the  ruling 
elites.  In  China,  since  the  late  1990s,  workers  and 
peasants  have  also  been  taking  advantage  of  the 
limited  openings  made  available  by  the  post- 
Maoist  reforms,  which  allowed  them  to  lodge 
complaints  to  higher  level  officials  and  stage 
various  forms  of  collective  action  to  draw  the  gov¬ 
ernment’s  attention  to  local  corruption  and  the 
hardships  caused  by  the  detrimental  forces  of  the 
market  economy. 

In  either  case,  participatory  efforts  were  highly 
contentious  but  not  to  the  extent  of  that  of  the  anti¬ 
apartheid  movement  in  South  Africa.  At  the  same 
time,  these  efforts  were  not  submissive  supplications 
asking  the  power  holder  for  benign  policies.  On  the 
contrary,  they  were  meticulously  designed  cam¬ 
paigns  to  maximize  the  available  political  resources 
and  were  aimed  at  influencing  the  formation  of 
crucial  policies  germane  to  their  lives  amid  adverse 
political  environments.  These  examples  serve  as  the 
archetypes  of  contentious  participation  serving  as 
an  alternative  to  institutionalized  channels,  if  only 
because  such  channels  are  highly  limited.  They  also 
show  that  popular  mobilization  and  institutional¬ 
ized  channels  are  not  mutually  exclusive  means  of 


political  participation.  This  becomes  more  evident 
in  Western  democracies. 

One  of  the  characteristics  of  liberal  democracy 
is  the  institutionalization  of  conflict  through  guar¬ 
anteeing  various  rights  and  access  points  to  deci¬ 
sion  making.  Citizens  in  liberal  democracies  are 
guaranteed  the  rights  to  choose  their  own  leaders 
or  run  for  office  through  elections,  form  associa¬ 
tions  that  serve  as  an  organizational  vehicle  for 
pursuing  the  common  interests  of  like-minded  fel¬ 
lows,  and  have  the  right  to  freely  express  their 
opinions  through  various  venues,  including,  but 
not  limited  to,  publications,  the  media,  and  the 
Internet.  Contentious  forms  of  participation,  such 
as  demonstration,  are  also  protected  by  law  and 
are  also  considered  legitimate  means  of  political 
participation  as  a  way  to  publicly  express  political 
views.  With  the  various  means  of  political  partici¬ 
pation  available,  explicitly  contentious  forms  of 
participation,  often  the  only  means  of  participation 
in  more  restricted  political  systems,  are  but  one 
among  the  many  options  in  liberal  democracies. 

As  a  result,  various  forms  of  contentious  partici¬ 
pation  are  often  combined  with  the  routine  work¬ 
ing  of  formal  institutions,  and  a  greater  overlap 
between  contentious  and  institutionalized  politics 
is  observed.  For  example,  in  lobbying  and  petition¬ 
ing  their  representatives  in  seeking  their  interests, 
many  interest  groups  organize  large  public  demon¬ 
strations  to  display  their  commitment  and  power 
and  pressure  decision  makers  to  take  their  demands 
seriously.  Likewise,  radical  activist  groups  may  also 
adopt  interest  group  tactics  and  engage  in  lobbying 
or  petitioning  while  mobilizing  thousands  of  people 
on  the  street.  As  these  action  patterns  become  insti¬ 
tuted,  solid  connections  between  political  parties 
and  social  movements  often  emerge.  In  the  United 
States,  the  steady  alliance  between  the  Democratic 
Party  and  various  social  movements,  including  the 
civil  rights  movement,  the  women’s  movement,  and 
the  environmental  movement,  and  between  the 
Republican  Party  and  the  evangelical  movement 
and  prolife  movements  provide  but  a  few  examples. 
In  many  similar  cases,  various  social  groups  and 
political  parties  enter  an  exchange  relationship  in 
which  social  groups  provide  the  parties  with  votes 
and  funds  while  the  parties  deliver  on  the  policies 
these  social  groups  favor. 

The  institutional  devices  and  patterns  of  politi¬ 
cal  participation  in  liberal  democracies  are  meant 
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to  absorb  and  neutralize  the  potentially  harmful 
impacts  of  large-scale  conflicts  within  the  institu¬ 
tional  setting  by  providing  various  ways  for  dis¬ 
gruntled  groups  to  express  their  grievances  and 
political  demands.  However,  the  use  of  institu¬ 
tional  channels  does  not  necessarily  mean  that  the 
processes  are  peaceful  and  without  conflict. 

Contentious  Nature  of  Participation 

Political  participation  is  necessarily  a  contentious 
process  because  there  are  competing  interests  in 
any  society  and  any  claim  made  in  public  will  bear 
on  someone  else’s  interest.  It  is  extremely  difficult 
to  have  a  policy  that  everyone  can  agree  on,  and 
legislation  often  becomes  a  focal  point  of  conten¬ 
tion.  The  landmark  U.S.  Supreme  Court  ruling  in 
Roe  v.  Wade  acknowledging  partial  constitutional 
rights  for  abortion  in  1973  and  the  subsequent 
contention  over  the  issue  of  abortion  in  the  United 
States  represent  a  prime  example.  Despite  the 
highly  institutionalized  court  processes,  the  ruling 
was  interpreted  by  many  as  a  partial  result  of  the 
strenuous  campaigns  by  women  activists  who  had 
voiced  their  demands  for  women’s  right  to  make 
decisions  concerning  their  own  bodies.  This  trig¬ 
gered  a  countermovement  against  abortion  and 
ignited  a  new  cycle  of  contention  on  the  issue.  The 
debate  over  the  issue  took  place  in  various  public 
forums  and  in  Congress  but  often  involved  disrup¬ 
tive  and  violent  clashes  on  the  streets  as  well. 
When  social  groups  of  competing  convictions 
make  participatory  efforts  at  the  same  time,  the 
process  can  turn  highly  contentious. 

Similarly,  despite  the  original  intention  to 
channel  participation  in  a  peaceful  and  orderly 
way,  elections  too  often  become  focal  points  of 
conflict,  especially  when  many  hold  suspicion 
over  the  fairness  of  the  process.  This  was  the  case 
in  Kenya  after  the  election  in  late  2007,  where 
unprecedented  violence  followed  accusations  of  a 
rigged  election.  Many  nascent  democracies  have 
suffered  from  damaging  conflicts  related  to  elec¬ 
tions,  but  contest  over  electoral  results  may  also 
emerge  in  established  democracies,  as  was  the 
case  in  the  United  States  after  the  presidential 
elections  in  2000.  The  specific  ways  in  which  con¬ 
tention  over  electoral  results  materialized  in  the 
United  States  and  in  Kenya  may  have  been  differ¬ 
ent,  but  they  both  show  how  ostensibly  peaceful 


participatory  mechanisms  may  be  highly  conten¬ 
tious  in  nature. 

Another  aspect  that  renders  the  nature  of  par¬ 
ticipation  inherently  contentious  lies  in  the  nature 
of  democratic  institutions,  which  are  inclusive  and 
exclusive  at  the  same  time.  However  inclusive  they 
are  at  one  point  in  time,  democratic  institutions 
tend  to  favor  some  actors  over  others  and  leave 
unheard  some  voices.  In  addition,  as  a  society 
moves  toward  progress  in  democratic  inclusion, 
there  will  always  emerge  new  groups  that  will 
demand  serious  attention.  The  expansion  of  suf¬ 
frage  offers  a  good  example.  The  expansion  of 
voting  rights  to  all  males  in  Western  Europe  came 
only  as  a  result  of  unrelenting  contentious  chal¬ 
lenges  to  the  order  by  those  who  were  initially 
excluded,  namely,  the  workers.  When  it  happened, 
it  left  out  women,  who  then  started  to  organize 
campaigns  to  claim  their  rights.  When  women 
were  allowed  suffrage  in  most  Western  democra¬ 
cies  by  the  early  19th  century,  many  found  that 
there  were  serious  barriers  for  participation  among 
groups  along  the  lines  of  race,  ethnicity,  and  sexual 
orientation.  Even  to  this  day,  many  of  these  groups 
engage  in  contentious  politics,  calling  for  an  end  to 
discrimination  and  for  full  participation  in  many 
parts  of  the  Western  world  and  beyond.  Participa¬ 
tion  invites  more  participation,  and  whenever  this 
happens,  it  first  takes  the  form  of  a  contentious 
challenge  to  the  status  quo. 
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Parties 


The  word  party  refers  to  one  of  the  oldest  con¬ 
cepts  used  in  political  science.  Depending  on  the 
era  chosen  to  determine  the  beginning  of  scientific 
analysis  of  political  facts  in  the  modern  sense — for 
example,  if  one  goes  back  to  Arthur  Bentley, 
James  Bryce,  Robert  Lowell,  or  Andre  Siegfried, 
that  is,  to  the  beginning  of  the  20th  century — the 
concept  of  party  can  be  older  than  that  of  political 
science.  Its  use  in  historical,  philosophical,  or 
polemical  vocabulary  appeared  in  the  17th  cen¬ 
tury  with  the  memoirs  of  Cardinal  de  Retz  in 
France,  Viscount  Bolingbroke  in  England,  and, 
above  all,  David  Hume,  who  in  the  early  18th 
century  initiated  what  was  to  become  the  analysis 
of  parties.  Nonetheless,  the  word  has  been  used 
since  the  Middle  Ages  to  refer  to  the  opposite  sides 
in  a  civil  war,  for  example,  York  and  Lancaster 
during  the  War  of  the  Roses,  and  consequently  has 
a  strong  connotation  of  conflict.  Even  the  etymol¬ 
ogy  of  the  word  party  is  telling:  parti  in  French, 
Partei  in  German,  partido  in  Spanish,  and  even 
partia  in  Russian  and  Polish  and  in  many  other 
languages — derived  from  the  verb  partir,  which  in 
medieval  French  meant  to  split  into  parts  or 
divide.  All  European  languages,  including  Slavic 
ones  that  use  other  terms,  such  as  strana  in  Czech 
or  stanka  in  Croat  or  Serbian,  use  words  meaning 
“side.”  The  idea  is  the  same:  to  take  sides  or  to 
choose  one’s  side,  or  one’s  camp,  in  a  political 
conflict.  The  definitions  of  the  concept  of  party 
are,  therefore,  older  and  more  numerous  than  for 
the  concept  of  social  class:  There  are  more  than 
100  of  them,  provided  by  authors  from  Edmund 
Burke  to  Alan  Ware,  including  Leon  Epstein, 
Joseph  LaPalombara,  and  Myron  Weiner.  All 
definitions  can  be  grouped  into  three  broad  cate¬ 
gories,  which  are  sometimes  combined.  First  of  all, 
following  Burke,  parties  can  be  defined  according 
to  the  ideas  that  they  convey.  Then,  following 


Max  Weber,  Robert  Michels,  and  Maurice 
Duverger,  one  can  define  parties  as  organizations. 
Finally,  the  trend  since  the  end  of  the  20th  century 
has  been  to  use  the  criterion  of  elections  and  the 
existence  of  a  representative,  or  at  least  demo¬ 
cratic,  regime.  A  remark  attributed  to  Max 
Weber — “parties  are  the  children  of  democracy 
and  universal  suffrage” — is  put  forward  to  sup¬ 
port  this  thesis.  One  should  not,  nevertheless, 
forget  the  classic  definition  given  during  the  reign 
of  George  III  by  Edmund  Burke:  “A  party  is  a 
body  of  men  united  for  promoting  by  their  joint 
endeavors  the  national  interest  upon  some  particu¬ 
lar  principle  in  which  they  are  all  agreed.”  This 
definition  remains  the  best,  even  if  the  subsequent 
evolution  of  political  systems  has  made  it  impre¬ 
cise,  as  it  is  now  incomplete.  Here,  we  propose  to 
use  the  term  in  the  following  way: 

A  party  is  an  organization  of  individuals  engaged 
in  collective  action,  in  order  to  mobilize  as  many 
individuals  as  possible  against  other  equally 
mobilized  individuals  in  order  to  accede,  either 
alone  or  in  coalition,  to  the  exercise  of  government 
functions.  This  engagement  and  this  claim  for 
power  are  justified  by  a  particular  conception  of 
the  national  interest. 

Below,  this  entry  first  discusses  this  definition  in 
greater  detail,  next  it  turns  to  the  historical  origins 
and  conceptualization  of  the  concept  in  the 
European  context,  and,  finally,  it  discusses  con¬ 
temporary  forms  of  party  organization  and  recent 
developments. 

Definitional  Elements 

The  above  definition  contains  four  major  elements: 

1.  Parties  are  the  product  of  a  collective  orga¬ 
nized  action  that  is  permanent  and  continuous  in 
time.  It  is  therefore  intended  to  outlive  the  action 
of  its  founders  and  continue  throughout  history  as 
long  as  it  is  able  to  mobilize  the  supporters  that 
keep  it  alive.  As  an  institution,  parties  present  char¬ 
acteristics  common  to  all  organizations  studied  by 
organizational  theory.  Parties  are  in  the  category  of 
association-type  organizations — that  is,  based  on 
voluntary  membership  and  the  choice  of  the  actors: 
members,  militants,  elected  representatives,  and 
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leaders.  If  membership  is  automatically  granted 
on  the  basis  of  birth,  family,  or  clan,  it  is  not  a 
party. 

2.  Any  organization  is  structured  according  to 
an  objective,  which,  in  the  case  of  a  party,  is  to 
accede  to  the  different  functions  of  government: 
national,  regional,  and  local.  Parties  can  exist  that 
are  limited  to  one  or  more  of  these  levels  of  gov¬ 
ernment,  as  is  often  the  case  in  Canada,  for  exam¬ 
ple.  A  political  organization  that  does  not  strive 
for  power  but  merely  for  influence  is  not  a  party. 

3.  Claiming  power  is  not  an  end  in  itself;  it  is 
justified  for  the  sake  of  the  national  interest  that 
the  party  intends  to  defend  or  promote  depending 
on  the  particular  conception  of  the  actors  involved. 
Claiming  power  in  the  name  of  a  particular  con¬ 
ception  of  the  national  interest  constitutes  the  rai¬ 
son  d’etre  of  a  party  and  a  condition  sine  qua  non 
for  a  political  organization  to  be  a  party. 

4.  The  way  to  reach  the  objective  of  the  party  to 
which  its  organization  is  rationally  conditioned  is 
the  mobilization  of  as  many  individuals  as  possi¬ 
ble.  The  most  frequently  used  means  is  electoral 
mobilization,  and  most  parties  were  born  with  the 
establishment  of  more  or  less  competitive  repre¬ 
sentative  political  systems. 

Democratization  whether  gradual  (e.g.,  United 
Kingdom  [UK])  or  brutal  (e.g.,  France)  gave  rise  to 
the  development  of  parties.  By  contrast,  in  author¬ 
itarian  systems,  nonelectoral  modes  of  organiza¬ 
tion  and  mobilization  exist.  These  can  be  peaceful 
(e.g.,  meetings,  demonstrations,  strikes,  and  peti¬ 
tions,  e.g.,  Chartism  in  Great  Britain,  Solidarity  in 
Poland,  and  the  Civic  Forum  in  former  Czecho¬ 
slovakia)  or  violent  (e.g.,  uprisings,  revolutions). 
The  common  objective  shared  by  electoral  mobili¬ 
zation  and  other  forms  of  party  mobilization  is  the 
need  to  appeal  to  the  popular  masses,  that  is,  to 
strive  for  popular  support,  according  to  LaPalom- 
bara  and  Weiner.  Partisan  mobilization  is  carried 
out  against  individuals  who  are  also  organized 
with  a  view  to  acceding  to  government  in  the  name 
of  a  different,  often  opposite,  conception  of 
national  interest.  As  we  have  seen,  party  means 
“part”  (division)  and  therefore  implies  conflict. 
Jean  Blondel  (1978)  sees  behind  every  party  “a 
protracted  social  conflict.” 


As  a  corollary,  both  conflict  and  competition 
are  essential: 

1.  It  can  be  said  that  there  are  no  parties  without 
conflicts;  they  always  convey  either  a  current,  active 
conflict  of  which  they  are  agents  or  a  past  conflict 
of  which  they  are  witnesses,  for  example,  the  oppo¬ 
sition  between  Fianna  Fail  and  Fine  Gael  in  Ireland, 
which  corresponds  historically  to  the  struggle 
between  anti-Treaty  and  anti-Free  State  Republicans 
in  the  bloody  civil  war  of  1921  to  1922.  One  of  the 
major  aspects  of  the  seminal  contribution  by 
Seymour  Martin  Lipset  and  Stein  Rokkan  (1967)  is 
to  have  assigned  to  parties  the  function  that  sum¬ 
marizes  all  the  others:  agents  of  conflict  and  instru¬ 
ments  of  integration.  The  dialectic  is  the  following: 
By  expressing  conflict,  parties  thus  allow  negotia¬ 
tion  and  contribute  at  the  end  of  a  more  or  less  long 
evolution  to  pacifying  political  life. 

2.  Conflict  and  integration  as  well  as  the  name 
party — or  part — imply  ipso  facto  plurality  and 
competition  between  parties.  The  term  parties 
means  a  system  of  parties  and  consequently  that 
there  are  at  least  two  of  them.  A  single  party  is  a 
contradiction  in  terms:  It  is  impossible  to  be  at  the 
same  time  a  single  entity  and  a  part.  This  obvious 
fact  was  stated  at  the  beginning  of  the  last  century 
by  Max  Weber.  The  contradictory  concept  of  a 
single  party  or  a  one-party  system  was  nonetheless 
used  at  the  time  of  the  Cold  War  by  eminent 
political  scientists  such  as  Gabriel  Almond  and 
Jean  Blondel.  The  latter  speaks  of  a  “party  of 
mobilization,”  while  Almond  distinguishes 
between  parties  that  are  “one-party  pluralistic,” 
modeled  on  the  Institutional  Revolutionary  Party 
(Partido  Revolucionario  Institucional,  PRI)  in 
Mexico  before  democratization,  and  those  that  are 
“one-party  revolutionary,  centralizing.”  While 
contesting  the  logical  pertinence  of  the  concept  of 
one-party  system,  Giovanni  Sartori  nonetheless 
admits  this  category  as  the  first  level  of  his  seven- 
rank  typology. 

To  clarify  this  debate,  which  has  retained  its 
historical  pertinence,  it  is  necessary  to  distinguish 
two  totally  different  situations.  On  the  one  hand, 
there  is  the  situation  Max  Weber  described  with 
the  example  of  the  Guelph  party  (parte  Guelfa)  in 
the  medieval  republic  of  Florence,  where  one  party 
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eliminated  its  rivals  to  become  incorporated  in  the 
apparatus  of  the  state — the  party  changes  and 
ceases  to  be  a  political  party.  It  then  falls  under 
some  other  sociological  concept.  Raymond  Aron, 
a  disciple  of  Weber,  applies  this  approach  to  the 
cases  of  fascism,  Nazism,  and  Stalinism  and  speaks 
of  “monopolistic  parties,”  which  as  soon  as  they 
come  to  power  hasten  to  eliminate  all  the  other 
parties  and  thus  change  their  nature.  Monopolistic 
parties  that  had  changed  their  nature  by  eliminat¬ 
ing  their  rivals  have  in  most  cases  gone  back  to 
their  original  nature  with  the  loss  of  power  and  the 
return  to  democracy.  Thus  today,  Kuo  Min  Tang 
in  Taiwan  alternated  in  government  with  the  inde¬ 
pendence  movement,  while  the  PRI  in  Mexico  and 
the  Communist  Party  in  the  Russian  Federation 
embody  the  loyal  opposition.  Similarly,  commu¬ 
nist  parties  in  Eastern  Europe  have  been  able  to 
reconvert  themselves. 

On  the  other  hand,  there  are  so-called  parties 
founded  after  a  military  or  nonmilitary  clique  has 
taken  over  power.  These  are  sham  parties,  such  as 
those  of  certain  totalitarian  regimes  formed  to  con¬ 
trol  the  population,  for  example,  the  Popular 
Movement  of  Zaire  at  the  time  of  President 
Mobutu  (1967-1997).  With  the  changes  of  regime 
and  the  return  to  democratic  forms  of  power,  these 
so-called  single  parties  disappeared  with  the 
regimes  that  created  them.  This  was  not  the  case, 
however,  with  Antonio  de  Oliveira  Salazar’s 
National  Union  in  Portugal  or  the  National 
Movement  created  by  Francisco  Franco  to  support 
his  dictatorship  in  Spain. 

History 

The  history  of  the  appearance  and  development  of 
parties  corresponds  to  that  of  the  scientific  study 
of  the  phenomenon.  Lipset  and  Rokkan  note  four 
thresholds  in  the  evolution  of  a  party:  legitimiza¬ 
tion,  incorporation,  representation,  and  majority 
power.  One  can  apply  these  to  every  party  and  to 
every  stage  of  the  party  systems. 

Parties  were  not  always  considered  to  be  legiti¬ 
mate.  As  a  sign  of  conflict  in  societies  seeking  bal¬ 
ance  and  harmony,  they  were  associated  with  a 
form  of  evil.  Political  modernity,  which  developed 
with  the  disintegration  of  the  feudal  order  in  the 
Renaissance,  was  embodied  in  absolutist  states,  and 
parties  only  emerged  in  times  of  crisis  and  during 


civil  or  religious  wars.  Even  with  the  establishment 
of  a  representative  regime,  the  party  was  first  per¬ 
ceived  as  something  that  divides  and  was  equated 
with  a  faction.  The  timing  of  the  legitimization  of 
parties,  the  very  idea  of  a  party  system,  depends  on 
the  country.  Thus,  three  men  of  action  who  were 
also  political  thinkers  reflect  the  same  perplexity 
toward  parties  in  three  countries  during  three  dif¬ 
ferent  periods:  Viscount  Bolingbroke  in  early- 18th- 
century  England,  James  Madison  at  the  time  of  the 
American  Revolution,  and  Charles  de  Gaulle  in 
France  in  the  mid-20th  century.  All  three  of  them 
in  their  own  way  were  deeply  concerned  about 
parties  and  their  struggles  and  advocated  national 
unity  against  all  the  divisions,  yet  they  finally 
became  involved  in  the  struggles  of  the  parties 
themselves.  As  a  necessary  evil,  the  party  is  always 
the  party  of  the  other,  and  the  temptation  to  find 
other  democratic  paths  was  present  for  a  long  time 
but  was  doomed  to  failure.  This  was  the  case  of  the 
“era  of  good  feelings”  desired  by  President  James 
Monroe  after  1816  to  put  an  end  to  the  opposition 
between  the  Federalists  and  the  Republican- 
Democrats.  For  the  United  States,  the  American 
historian  Richard  Hofstadter  situates  the  turning 
point  between  1780  and  1840,  after  which  the  idea 
of  legitimate  opposition  between  parties  was  taken 
for  granted.  In  Great  Britain,  the  phenomenon 
appears  earlier,  and  the  writings  of  Hume  bear  wit¬ 
ness  to  this  fact,  but  here  it  remains  within  the  elite, 
whereas  in  the  United  States  it  concerned  the 
masses.  The  distinction  between  parties  and  fac¬ 
tions  is  the  determining  criterion,  and  it  is  with 
Hume  that  this  was  clearly  established. 

The  origin  of  parties  and  their  existence  before  a 
representative  regime  depends  on  the  definition.  If 
we  retain  the  three  proposed  criteria — a  particular 
conception  of  the  national  interest,  free  organiza¬ 
tion,  and  mobilization — the  Guelphs  (13th  century) 
were  a  party,  even  if  their  means  of  action  were 
different  from  those  of  modern  parties.  Their  fight 
against  the  Ghibellines,  however,  degenerated  into 
a  struggle  between  factions.  The  Cavaliers  and  the 
Roundheads,  the  Whigs  and  the  Tories  were  also 
parties.  Moreover,  the  process  is  not  linear,  and  in 
the  oligarchy  that  existed  in  England  in  the  18th 
century,  as  Sir  Lewis  Namier  remarks,  during  the 
reign  of  George  III  a  large  number  of  members  of 
parliament  (MPs)  were  neither  Whigs  nor  Tories. 
By  contrast,  “parties”  at  the  time  of  the  Ancient 
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Roman  republic  or  those  evoked  by  Niccolo 
Machiavelli  in  Florence  at  the  time  of  the  Medicis, 
and  that  existed  in  numerous  other  Italian  cities  dur¬ 
ing  the  Renaissance,  were  factions  and  family  clans. 

When  they  were  still  badly  organized,  parties 
were  discovered  and  studied  by  social  scientists  at 
first  as  carriers  of  ideas.  Then,  with  the  extension 
of  the  electoral  franchise  and  civil  rights,  they  were 
studied  as  organizations-.  James  Bryce,  Robert 
Michels,  Moisey  Ostrogorsky,  and,  above  all,  Max 
Weber  laid  down  the  foundations  in  the  late  19th 
and  early  20th  centuries.  Finally,  the  study  of  par¬ 
ties  as  the  mobilization  of  actors  began  in  the  20th 
century  with  Siegfried  on  electoral  geography  and 
Duverger  on  circles  of  participation  in  partisan 
activity.  It  branched  out  in  many  directions — 
militants,  members,  sympathizers,  and  voters — 
and  was  favored  by  the  development  of  the  various 
forms  of  sociological  survey  research,  from  opin¬ 
ion  polls  to  the  deep  analyses  of  political  attitudes. 
These  divergent  approaches  are  necessary  and 
have  to  converge  if  one  is  to  understand  a  given 
political  party,  a  national  system  of  parties,  or,  in 
a  comparative  manner,  the  classification  of  parties. 

Raison  d’Etre  and  Identity  of  Parties 

Devoted  to  the  defense  and  the  promotion  of  a 
particular  conception  of  the  national  interest  in 
Burke’s  sense,  many  19th-  and  20th-century  histo¬ 
rians  likened  parties  to  the  great  schools  of  politi¬ 
cal  thought:  conservatism,  liberalism,  socialism, 
Christian  democracy,  communism,  fascism,  and  so 
on.  The  links  between  the  two  phenomena  are 
obvious  except  that  parties,  agents  of  conflict  but 
also  instruments  of  integration,  are  led  in  majority 
governments  to  betray  their  initial  ambition  and 
adapt  themselves  to  the  constraints  of  the  exercise 
of  power,  to  become  institutionalized,  and  to 
change  their  program  and  sometimes  their  ideol¬ 
ogy  in  order  to  become  catchall  parties,  according 
to  Otto  Kirchheimer.  One  must,  therefore,  observe 
the  social  interests  that  are  expressed  through 
these  ideas  and  justify  the  “particular  conceptions 
of  national  interest.” 

Socioeconomic  Bases 

Duverger  saw  in  parties  the  translation  of  two 
successive  class  struggles:  the  conflict  between  the 


land-owning  nobility,  expressed  by  the  conserva¬ 
tives,  and  the  capitalist  bourgeoisie,  represented  by 
the  liberals,  on  the  one  hand,  and  that  between  the 
bourgeoisie  and  the  proletariat,  organized  by  the 
socialists,  social  democrats,  and  labor  parties,  on 
the  other.  Liberals  were  confronted  with  the 
dilemma  whether  to  ally  themselves  with  their  for¬ 
mer  conservative  enemies  against  the  peril  repre¬ 
sented  by  workers’  parties  or  to  accept  alliances 
with  the  social  democrats,  which  was  the  case  with 
the  so-called  radical  parties.  Duverger  notes  that 
some  Christian  Democratic  parties  in  Catholic 
Europe  or  agrarian  ones  in  some  Nordic  countries 
remained  outside  these  class  struggles  on  which  the 
dualism  between  left  and  right  is  founded.  It  is  the 
superimposition  of  dualisms  that  generates  multi¬ 
party  systems.  Duverger  thinks  that  using  the 
Anglo-American  majority  vote — “first  past  the 
post” — facilitates  the  establishment  of  a  two-party 
system  opposing  conservative  liberals  and  social 
democrats. 

The  same  inspiration  can  be  found  in  Lipset, 
who  in  Political  Man  sees  in  parties  the  expression 
of  social  classes  of  which,  for  him,  there  are  three: 
(1)  the  upper  class,  supported  by  the  Church, 
which  is  expressed  in  conservative  parties;  (2)  the 
secular  middle  class,  expressed  in  liberal  parties; 
and  (3)  the  working  class,  expressed  in  labor, 
socialist,  and  social-democratic  parties.  The  right, 
center,  and  left  form  a  democratic  spectrum  to 
which  corresponds  an  antidemocratic,  extremist 
spectrum,  including  the  communist  extreme  left 
and  the  authoritarian  monarchist,  clerical,  and 
reactionary  extreme  right  as  the  expression  of  a 
refusal  of  change  by  a  threatened  upper  class,  of 
which  Antonio  Salazar  in  Portugal,  Miklos  Horthy 
in  Hungary,  Engelbert  Dollfuss  in  Austria,  and 
Francisco  Franco  in  Spain  were  illustrations.  The 
originality  of  Lipset  is  to  show  that  the  middle  class 
also  engendered  an  extreme  center  with  Italian  fas¬ 
cism  and  German  Nazism,  which  was  opposed 
to  the  extreme  left  and  extreme  right.  Today,  the 
concept  of  extreme  center  is  enlightening  in  the 
description  of  parties  such  as  the  Freedom  Party 
(Freiheitliche  Partei  Osterreichs,  FPO)  in  Austria 
or  the  National  Front  (Front  National,  FN)  in 
France  and  in  the  definition  of  a  clear  relationship 
between  fascism  and  the  National  Alliance 
(Alleanza  Nazionale,  AN)  in  Italy.  It  is  more  pre¬ 
cise  and  more  scientific  than  that  of  populism. 
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Lipset  attenuated  his  position  somewhat  by 
remarking  that  certain  traditional  parties,  such  as 
Catholic  parties,  combine  cultural  conservatism 
with  socioeconomic  reformism  and  that  new 
forces  such  as  the  green  parties  mix  cultural  liber¬ 
alism  and  anti-industrialist  reaction,  thus  consti¬ 
tuting  a  neo-bourgeois  ideology. 

The  approach  exposed  in  Political  Man  remains 
the  most  fruitful  but  leaves  aside  the  existence  of 
interclass  parties,  which  are  nonetheless  not  catch¬ 
all  parties.  The  most  fitting  example  was  the 
Italian  Christian  Democrats  (Democrazia  Christi¬ 
ana,  DC),  backed  by  a  workers’  union,  the  Italian 
Confederation  of  Trade  Unions  (Confederazione 
Italiana  Sindacati  Lavoratori,  CISL),  but  there 
were  also  certain  employers  grouping  people  with 
nothing  in  common  other  than  the  defense  of  the 
interests  and  values  of  the  Catholic  community. 
This  approach  included  as  many  as  nine  tendencies 
(correnti)  spreading  from  the  pro-Marxist  left  to 
the  traditionalist  extreme  right.  This  party,  which 
was  born  out  of  antifascist  resistance  and  anticom¬ 
munism  after  World  War  II,  broke  up  in  an  inter¬ 
esting  way  in  the  1990s:  The  right  was  recovered 
by  Silvio  Berlusconi;  its  center  right  refused  to  join 
the  people’s  freedom  party  People  of  Liberty  (II 
Populo  della  Liberta,  PDL),  created  by  Berlusconi 
in  2009;  while  its  center-left  and  left  merged  with 
the  former  members  of  the  Italian  Communist 
Party  (Partito  Comunista  Italiano,  PCI)  to  found 
the  Democratic  Party  in  2008.  In  fact,  the  DC  met 
the  same  fate  as  the  French  Popular  Republican 
Movement  (Mouvement  Republicain  Populaire, 
MRP)  30  years  earlier.  The  latter  was  also  formed 
during  the  resistance  and  dispersed  its  forces  to  the 
right,  the  center,  and  the  left.  The  Christian 
Democratic  parties  of  the  Benelux  countries  and 
Switzerland  correspond  to  the  same  model  as  the 
Italian  one.  The  same  can  be  said  about  the 
German  Zentrum  party  from  1871  to  1933  but  not 
for  the  German  Christian  Democratic  Union 
(Christlich  Demokratische  Union  Deutschlands, 
CDU),  which  is  no  more  clerical  than  Nicolas 
Sarkozy’s  Union  for  a  Popular  Movement  (Union 
pour  un  movement  populaire,  UMP)  in  France.  By 
contrast,  in  Bavaria,  the  Christian  Social  Union 
(Christlich-Soziale  Union  in  Bayern,  CSU)  belonged 
to  the  tradition  of  classic,  clerical  conservatism, 
but  clericalism  also  became  contested  within  the 
party  itself. 


Historical  Cleavages 

Finally — if  only  to  understand  multiparty  sys¬ 
tems  with  more  than  six  parties — it  is  necessary  to 
use  a  multidimensional  space  as  in  the  systematic 
model  of  the  origin  of  parties  in  Europe  set  up  by 
Lipset  and  Rokkan.  For  them,  cleavages  are  neither 
ephemeral  oppositions  nor  contingent  divisions  but 
structural  effects  resulting  from  the  political  transla¬ 
tion  of  profound  traumatic  changes  that  affected  the 
history  of  a  country  or  a  group  of  countries.  These 
conflicting  effects  are  exerted  along  two  axes:  the 
functional  axis  and  the  territorial-cultural  axis.  In 
the  particular  case  of  Europe,  originally  marked  by 
Catholic  Christianity,  two  revolutions — national  rev¬ 
olutions  and  the  Industrial  Revolution — engendered 
four  cleavages.  The  former  broke  the  unity  between 
countries  born  of  the  Reformation  and  those 
marked  by  the  Catholic  Counter-Reformation.  It 
engendered  two  cleavages:  (1)  along  the  unitary 
center/periphery  axis,  opposing  modernization  to  the 
resistance  of  the  subjected  cultures  from  the  prov¬ 
inces  and  peripheral  areas,  and  (2)  along  the  func¬ 
tional  axis  of  Church  versus  State,  opposing  the 
modernizing,  secular  elite  to  the  defense  of  the  inter¬ 
ests  of  the  Church  in  the  fields  of  education  and 
values.  The  Industrial  Revolution  generated  the  fol¬ 
lowing  cleavages:  (3)  along  the  territorial  axis,  pri¬ 
mary  versus  secondary  economy  opposing  landed 
interests  to  the  rising  class  of  industrial  entrepreneurs, 
and  (4)  along  the  functional  axis  of  employers  versus 
owners,  opposing  the  interests  of  property,  capital, 
and  business  to  the  labor  union  movement,  which 
defended  the  interests  of  wage  earners. 

These  four  cleavages  are  conveyed  in  the  short 
term  through  the  issues  that  oppose  the  parties  and 
in  the  long  term  through  the  party  systems.  In 
Catholic  and  Protestant  Europe  on  both  sides  of 
each  cleavage,  there  arose,  according  to  the  coun¬ 
try,  families  of  parties  that  became  established  with 
the  extension  of  suffrage  and  democratization. 
Since  then,  these  cleavage-based  party  systems  had 
become  “frozen”  for  a  considerable  time.  Depending 
on  the  period,  one  may  add  that  a  cleavage  can 
dominate  electoral  parliamentary  debate:  Church 
versus  State  in  Catholic  countries  in  the  19th  cen¬ 
tury  and  the  primary  versus  secondary  sector  in 
Sweden  at  the  same  time,  and  the  center  versus 
periphery  cleavage  in  the  Basque  country  or  in 
Ireland.  However,  the  most  important  cleavage  in 
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Western  Europe  since  the  crisis  of  1929,  and  even 
before,  was  owners  versus  workers,  which,  apart 
from  the  two  cases  cited  previously,  forms  the  axis 
of  the  most  frequent  parliamentary  constellations. 

At  the  beginning  of  the  21st  century,  a  majority 
of  parties  are  based  on  the  functional-economic 
owners  versus  workers  cleavage.  In  other  words, 
on  the  one  hand,  there  are  parties  for  the  defense 
of  owners  that  have  formal  and/or  informal  links 
with  employers,  companies,  and  the  business  com¬ 
munity  in  general  but  with  a  much  broader  elec¬ 
toral  base  that  includes  the  middle  classes.  This 
family  unites  former,  previously  opposed  parties 
such  as  conservatives  and  liberals  in  Protestant 
countries,  Switzerland,  and  the  Benelux  countries. 
It  also  includes  parties  of  other  origins  from  for¬ 
mer  Christian  Democratic  groups  such  as  CDU/ 
CSU  in  Germany  or  former  nationalists  such  as 
UMP  (this  party  includes  former  Gaullists,  conser¬ 
vatives,  and  liberals),  as  well  as  new  parties  such  as 
the  Social  Democratic  Party  (Partido  Social 
Democrata,  PSD)  in  Portugal,  the  post-Franco 
People’s  Party  (Partido  Popular)  in  Spain,  and 
Forward  Italy  (Forza  Italia),  which  merged  with 
the  postfascist  AN  to  form  the  people’s  freedom 
party,  PDF,  a  unified  right-wing  party. 

On  the  other  hand,  since  the  1 9th  century,  there 
has  been  a  systematic  development  of  parties  for  the 
defense  of  workers ,  which,  historically,  constitute 
the  labor  movement  and  maintain  special  links  with 
labor  unions.  Their  voters  are  salaried  employees, 
mainly  working  class  but  also  some  white-collar 
workers  and  civil  servants.  They  were  born  in  the 
wake  of  the  Industrial  Revolution  from  the  conver¬ 
gence  of  four  forces:  two  ideologies,  (1)  the 
Jacobinism  of  the  French  Revolution  and  radical 
philosophy  and  (2)  social  Christianity  in  Protestant 
countries,  and  two  forms  of  political  organization, 
(3)  the  labor  unions  and  cooperative  movements 
and  (4)  the  internationalism  of  Karl  Marx,  Friedrich 
Engels,  and  their  disciples.  The  combination  of  these 
four  ingredients,  in  variable  proportions  depending 
on  the  country,  created  three  genetic  models  in  the 
sense  of  Angelo  Panebianco,  which  developed  into 
four  traditions  that  are  very  visible  today: 

1.  The  Fabour  tradition,  born  out  of  the  failure 
of  Chartism,  was  translated  into  parties  of  ideo¬ 
logical  and  religious  pluralism,  dominated  organ¬ 
ically  by  the  labor  unions  in  which,  ideologically, 


social  Christianity  is  slightly  more  important  than 
radicalism,  whereas  the  Socialist  International  and 
Marxism  are  in  minority  or  even  marginal. 

2.  The  social-democratic  tradition,  in  which 
trade  unionism  emanated  from  the  party,  was  born 
in  Germany  and  dominated  by  the  Socialist 
International  and  Marxism.  These  parties  have 
kept  a  controlled  and  solid  organization  that  has 
particularly  subsisted  in  the  Swedish  Social 
Democratic  Party  (Sveriges  socialdemokratiska 
arbetareparti,  SAP)  and  in  the  Austrian  SPO. 

3.  The  social-democratic  tradition  of  the  French 
Revolution  of  1848  was  marked  by  Jacobin  radi¬ 
calism  and  the  republican  and  anticlerical  struggle. 
These  parties  from  the  start  came  up  against  the 
distrust  of  the  anarcho-syndicalist  movement, 
which  was  hostile  to  any  collaboration  with  par¬ 
ties.  The  anarcho-syndicalists  combined  theoreti¬ 
cal  anarchism  founded  on  the  rejection  of  the  state 
and  electoral  politics,  advocating  a  mutual  benefits 
system,  self-management,  and  federalism  with 
practical  bread-and-butter  reformism  within  com¬ 
panies.  Moreover,  the  union  movement  was 
divided  by  a  new  Christian  labor  movement  that 
was  equally  wary  of  party  politics  and  a  commu¬ 
nist  labor  movement  that  would  ultimately  sup¬ 
plant  anarcho-syndicalism.  The  socialist-democratic 
tradition  became  embodied  in  weaker,  intellectual 
parties  and  was  neither  controlling  the  labor 
movement  nor  being  controlled  by  it.  These  social¬ 
ist  parties — French,  Italian,  and  Spanish — would 
practice  ideological  extremism  and  give  more  than 
their  due  to  Marxism  while  practicing  shortsighted 
reformism. 

4.  The  communist  tradition  is,  in  fact,  a  variant 
of  the  social  democracy  that  was  implanted  in 
France  and  Italy,  where  it  failed.  It  is  a  kind  of  radi¬ 
cal  social  democracy  that  has  accentuated  its  specific 
features:  orthodox  Marxism;  a  centralized  organiza¬ 
tion  adapted  to  the  political  struggle  in  authoritar¬ 
ian  regimes  (“democratic  centralism”);  control  over 
the  unions,  which  had  become  the  “driving  belts”; 
and,  above  all,  the  primacy  of  the  Socialist 
International.  With  the  Komintern,  communist  par¬ 
ties  were  the  only  party  with  an  international 
dimension,  devoted  for  many  years  to  the  interests 
of  the  USSR  and  presented  under  Joseph  Stalin  as 
the  fatherland  of  socialism.  The  social-democratic 
sociology  of  communist  parties  favored  their 
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incorporation  into  party  systems,  their  unofficial 
social  democratization  including  membership  of 
the  Socialist  International. 

The  PCI,  the  Italian  Communist  Party,  was  the 
first  to  take  this  path,  soon  to  be  joined  with  the 
dissolution  of  the  former  USSR  by  the  Hungarians, 
Bulgarians,  and  Lithuanians.  The  development  of  a 
Marxism  sui  generis  adapted  to  Italy  and  the  West 
thanks  to  Antonio  Gramsci  and  to  the  strategic 
intelligence  and  political  savoir  faire  of  its  leaders — 
Palmiro  Togliatti  and  Enrico  Berlinguer — anxious 
to  promote  “a  national  path  toward  socialism” 
helped  this  transformation  of  the  PCI  first  into  the 
Democratic  Party  of  the  Left  (Partito  Democratico 
della  Sinistra,  PDS)  and  then  into  the  Democrats  of 
the  Left  (Democratici  di  Sinistra,  DS)  and  the 
Democratic  Party  (Partito  Democratico,  or  PD) 
with  the  gradual  help  of  the  Christian  Democrats. 
The  French  Communist  Party  (Parti  Communiste 
Franqais,  PCF)  remained  Stalinist  for  a  long  time 
but  was  headed  by  leaders  of  no  great  stature — 
Maurice  Thorez  and  Georges  Marchais — and  did 
not  take  the  opportunity  to  return  to  social  democ¬ 
racy.  In  the  Scandinavian  countries,  where  social 
democracy  is  strong,  the  extreme  left  is  also  of 
communist  origin,  with  the  Danish  Socialist 
People’s  Party  (Socialistisk  Folkeparti,  SF),  which 
broke  off  links  with  Moscow  in  1956,  and  in 
Sweden  the  Communist  Party  became  the  Left 
Party.  The  situation  has  been  the  same  in  Finland 
since  the  end  of  the  Soviet  Union.  By  contrast,  the 
left-wing  socialists  in  Norway  are  a  dissident 
movement  of  the  Labor  Party.  The  case  of  the  Left 
(Die  Linke)  in  Germany  is  more  ambiguous:  The 
former  Communist  Party  of  East  Germany  with  its 
strong  organization  merged  with  social-democratic 
dissidents  from  the  Social  Democratic  Party  of 
Germany  (Sozialdemokratische  Partei  Deutsch- 
lands,  SPD).  The  most  original  of  all  parties  situ¬ 
ated  “on  the  left  of  the  left”  is  the  Dutch  Socialist 
Party  (Socialistische  Partij,  SP),  an  anticapitalist 
protest  party  composed  of  former  Maoists,  which 
can  win  up  to  10%  of  the  votes. 

Center-Periphery  Cleavages 

Outside  the  dominant  socioeconomic  cleavage, 
the  other  important  cleavage  is  center-periphery, 
composed  of  parties  for  territorial  defense,  which 


is  divided  into  two  opposed  families.  On  the  one 
hand,  there  are  the  parties  of  centralized  state 
nationalism  corresponding  historically  to  a  unifying, 
imperialistic  state  nationalism  that  is  economically 
protectionist  and  that,  socially,  carries  policies  that 
are  favorable  to  a  protective  state.  One  is  reminded 
of  Bismarckism  in  Germany,  which  during  the 
Weimar  Republic  became  the  German  National 
People’s  Party  (Deutschnationale  Volkspartei, 
DNVP),  and  of  Bonapartism  in  France,  then  the 
republican  current  from  which  Gaullism  origi¬ 
nated.  These  parties,  concerned  about  the  author¬ 
ity  of  the  State,  are  inclined  to  deviate  toward 
authoritarianism  and  have  engendered  some 
extreme  versions:  the  total  state  that  identifies 
nation,  state,  and  leader  in  fascist  totalitarianism 
in  Italy  and  Nazism  in  Germany  and  has  been 
emulated  by  many  others.  These  extreme-center 
parties  are  wrongly  assimilated  to  the  extreme 
right  and  characteristically  attract  not  only  voters 
from  the  working  class  but  also  leaders  from  left- 
wing  parties,  such  as  Benito  Mussolini,  a  former 
socialist  in  charge  of  the  newspaper  Avanti;  Jacques 
Doriot,  a  former  communist  deputy  and  the 
founder  of  the  French  Popular  Party  (Parti 
Populaire  Franqais,  PPF);  as  well  as  the  less  con¬ 
ventional  Oswald  Mosley,  a  former  Labor  MP  and 
the  founder  of  the  British  Union  of  Fascists.  World 
War  II,  with  its  cortege  of  horror  and  crimes 
against  humanity,  has  discredited  fascism,  which  is 
no  longer  portrayed  as  such,  except  by  marginal 
groups  that  are  not  considered  as  parties. 

Nevertheless,  the  ideological  ground  has 
remained  fertile  and  able  to  produce  analogous 
parties  that  out  of  caution  tone  down  their  dis¬ 
course.  Globalization,  immigration,  and  the  crisis 
since  the  end  of  the  1970s  have  favored  the  rebirth 
of  postfascism  in  places  where  a  previous  tradition 
existed:  the  National  Front  of  Jean-Marie  Le  Pen 
in  France,  the  FPO  (heir  of  the  Pan-Germanic 
nationalist  current)  in  Austria,  and  in  some 
Lander  in  Germany,  the  National  Democratic 
Party  (Nationaldemokratische  Partei  Deutsch- 
lands,  NPD).  We  may  add  in  Flanders  Vlaams 
Belang,  the  new  name  of  Vlaams  Blok,  heir  of  the 
pro-Nazi  Flemish  National  Union  (Vlaamsch 
Nationaal  Verbond,  VNV),  which  existed  between 
the  two  World  Wars.  In  other  countries  where 
there  was  no  fascist  tradition  before,  new  move¬ 
ments  have  emerged  with  a  similar  sociology  and  a 


1798  Parties 


less  articulate  discourse  than  fascism.  Their  creed 
resides  in  a  radical  xenophobia  enhanced  since  the 
beginning  of  the  21st  century  by  anti-Islamism. 
The  longest  lasting  case  is  the  Danish  People’s 
Party  (Dansk  Folkeparti,  DF),  but  the  most  spec¬ 
tacular  is  to  be  found  in  the  Netherlands.  The  fact 
that  the  founder  and  leader  of  the  DF  is  a 
woman — Pia  Kjaersgaard — and  her  Dutch  coun¬ 
terpart  in  the  Pirn  Fortuyn  List  (Lijst  Pirn  Fortuyn, 
LPF)  was  a  millionaire  and  a  militant  homosexual 
(he  was  assassinated  by  an  animal  rights  activist) 
constitutes  a  break  with  fascism.  However,  it  is  a 
mistake  to  consider  them  as  new  parties:  Their 
organization  is  new,  yet  the  issues  that  fuel 
them — xenophobia  and  racism — have  political 
roots  that  go  back  to  the  19th  century.  Whether 
moderate  or  extremist,  democratic  or  authoritar¬ 
ian,  state-nationalist  parties  are — or  were — 
authentic  catchall  parties. 

The  historical  opponents  of  centralism  are  par¬ 
ties  for  the  defense  of  the  periphery,  sometimes 
regionalist  and  federalist,  sometimes  nationalist 
and  separatist.  They  are  the  expression  of  ethnic  or 
linguistic  minorities  and  have  a  territory  that  is 
quite  easily  definable.  Their  existence  is  not  recent 
and  most  often  corresponds  to  countries  with  an 
imperial  structure:  Austria-Hungary  before  1919 
had  many  such  parties.  Today,  the  oldest  party  to 
defend  the  periphery  is  the  Basque  Nationalist 
Party  (Partido  Nacionalista  Vasco,  PNV),  founded 
in  1895,  which  identified  with  the  cause  of  the 
language,  culture,  and  democratic  traditions  of  the 
Basque  country,  where  it  is  the  main  party.  The 
Swedish  Popular  Party  in  Finland  is  also  an  old 
organization — early  20th  century — that  holds  the 
monopoly  of  representation  of  the  Swedish¬ 
speaking  minority  in  Finland,  and  for  this  reason  it 
has  participated  in  almost  all  government  coali¬ 
tions.  The  Scottish  National  Party,  founded  in 
1925,  only  managed  to  break  through  in  the 
1960s,  when  it  became  alternately  the  second  or 
third  party  in  Scotland.  Wales  also  has  its  national¬ 
ist  organization,  which  is  less  strong  and  has  fewer 
seats  in  Westminster:  Plaid  Cymru.  Two  parties 
that  were  created  after  World  War  II  enjoy  a 
majority  in  their  region,  though  they  are  insignifi¬ 
cant  on  a  national  scale:  the  popular  party  of 
South  Tyrol  (South  Tyrolean  People’s  Party)  in 
Alto  Adige  and  the  Val  d’ Aosta  Union  in  Italy.  In 
Flanders,  the  party  defending  the  periphery, 


People’s  Union  (Volksunie),  split  in  two,  creating 
the  more  centrist  New  Flemish  Alliance  (Nieuw- 
Vlaamse  Alliantie,  NVA)  and  SPIRIT,  which  is 
more  social  libertarian.  Spain  counts  the  largest 
number  of  parties  of  this  type.  In  Euskadi  as  well 
as  PNV,  we  must  add  the  nationalist,  left-wing 
Basque  Solidarity  (Eusko  Alkartasuna,  EA).  In 
Catalonia,  both  the  autonomist-centrist  demo¬ 
cratic  convergence  of  Catalonia  and  the  left-wing 
separatists  of  the  Republican  Left  of  Catalonia 
(Esquerra  Republicana  de  Catalunya,  ERC)  can  be 
included.  Numerous  Spanish  regions  have  their 
autonomists.  The  periphery  also  has  extremists, 
sometimes  violent  ones  but  that,  unlike  the  terror¬ 
ist  movements  of  the  1970s,  such  as  the  Red 
Brigades,  are  endowed  with  a  legal  electoral  voice 
that  shows  popular  and  even  widespread  support. 
This  is  the  case  in  Northern  Ireland,  where  the 
Irish  Republican  Army  (IRA)  is  linked  to  Sinn 
Fein,  and  in  the  Basque  country,  where  Basque 
Homeland  and  Freedom  (Euskadi  Ta  Askatasuna, 
ETA)  was  related  with  Batasuna  until  the  party 
was  prohibited  by  the  Spanish  courts.  In  Corsica, 
the  situation  is  the  same,  but  the  nationalists  are 
only  represented  at  the  regional  and  local  levels. 
There  is  one  special  case:  that  of  Northern  League 
(Lega  Nord)  in  Italy,  which  has  gone  from  the 
defense  of  federalist  positions  to  the  instrumental- 
izing  of  xenophobia  and  from  enthusiastic  Eur- 
opeanism  to  staunch  Euroscepticism. 

Church  and  State  Cleavage 

By  contrast  to  the  center-periphery  opposition, 
another  cleavage  resulting  from  national  revolu¬ 
tion  as  defined  by  Rokkan  is  the  cleavage  between 
Church  and  State,  which  used  to  be  of  prime 
importance  but  now  belongs  to  history.  The 
Christian  Democratic  parties  whose  role  was 
essential  for  European  integration  are  experiencing 
an  existential  crisis:  The  major  one,  in  Italy,  broke 
up  and  now  only  exists  as  the  Center  Democratic 
Union  with  a  marginal  role.  Their  area  of  strength 
is  limited  to  the  Netherlands  and  the  Benelux  coun¬ 
tries,  where  the  Christian  Democrats  have  lost  a 
great  deal  of  electoral  weight,  whereas  they  have 
collapsed  dramatically  in  Switzerland.  They  were 
and  still  remain  the  best  examples  of  interclass, 
horizontal  parties,  that  is,  covering  all  the  ground 
from  the  right  to  the  left — from  fundamentalism  to 
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progress  through  centrism.  In  fact,  they  reflected 
the  sociology  as  much  as  the  ideological  structure 
of  the  Catholic  subculture.  They  are  not  catchall 
parties  because,  even  when  they  are  nonconfes¬ 
sional,  they  embody  the  political  will  of  believers 
and  citizens  steeped  in  Catholic  culture.  Christian 
Democratic  programs  as  a  result  of  a  dialogue 
between  their  bourgeois,  agricultural,  and  work¬ 
ing-class  tendencies  constitute  a  useful  compro¬ 
mise  for  setting  up  government  coalitions  with 
either  the  liberals  or  the  social  democrats.  The 
latter  explains  their  unparalleled  longevity  in  gov¬ 
ernment:  Democrazia  Christiana  participated  in  all 
the  coalitions  of  the  first  Italian  republic,  but  the 
record  is  held  by  the  Luxemburg  Christian 
Democrats,  with  more  than  a  century  in  power, 
followed  closely  by  the  Belgians  and  the  Dutch, 
with  almost  100  years.  They  are  the  axis  both  of 
the  center-right,  so-called  conservative  coalitions 
in  Belgium  and  the  Roman  Blue  in  the  Netherlands 
and  of  the  so-called  labor  center-left  coalitions  in 
Belgium  and  the  Roman  Red  in  the  Netherlands. 
Even  the  most  conservative  party  in  Austria,  the 
Austrian  People’s  Party  (Osterreichische  Volks- 
partei,  OVP),  has  most  often  participated — since 
1945 — in  so-called  Red-Black  governments  along 
with  the  Social  Democrats  in  spite  of  its  question¬ 
able  alliance  with  the  xenophobic  nationalists  of 
the  FPO.  It  is  interesting  to  note  that  in  the  Czech 
Republic  the  only  noncommunist  party  to  have 
lived  through  the  Soviet  era  is  the  Czech  People’s 
Party  (Ceskoslovenska  strana  lidova,  CSL),  which 
participates  in  all  coalition  governments  as  it  used 
to  before  1938,  its  foundation  dating  back  to  the 
Austrian-Hungarian  empire. 

New  Cleavages 

In  addition  to  the  large  families  of  parties,  there 
is  one  small  one — the  greens.  One  thesis  by 
Thomas  Poguntke  claims  that  the  greens  are  the 
embodiment  of  New  Politics  based  on  postmateri¬ 
alist  issues  such  as  quality  of  life,  protection  of 
nature,  and  libertarian  individualism — in  opposi¬ 
tion  to  the  supporters  of  Old  Politics,  based  on 
materialistic  issues  and  values  such  as  wage 
increases  and  bread-and-butter  issues  in  general. 
This  idea  was  influenced  by  the  work  of  Ronald 
Inglehart,  who  developed  his  theory  of  postmateri¬ 
alism  based  on  the  proposition  that  the  generation 


marked  by  the  Great  Depression  followed  by  war 
and  reconstruction  was  succeeded  by  a  generation 
socialized  in  a  context  of  prosperity,  the  “affluent 
society”  of  the  “golden  sixties.”  The  new  post- 
industrial  Revolution  era  is  said  to  have  given  rise 
to  a  new  cleavage,  that  is,  materialists  versus  post¬ 
materialists,  with  the  Greens  occupying  the  post¬ 
materialist  side  and  the  extreme  right,  the  side  of 
the  materialists.  The  ecologists  represent  a  new 
force,  but  the  parties  qualified  as  far  right  are  as 
old  as  parliamentary  democracy  itself.  In  the 
1980s,  Lipset  insisted  on  the  importance  of  post- 
materialism  and  saw,  on  the  one  hand,  the  emer¬ 
gence  of  a  libertarian-style,  neo-bourgeois  ideology 
that  divided  left-wing  parties  upheld  by  intellec¬ 
tual  circles  from  the  more  authoritarian  tradition¬ 
alist,  materialist  workers  who  valued  morals.The 
postmaterialism  adopted  by  social-democratic 
parties  partly  explains  the  success  of  extreme-cen¬ 
ter  parties  among  workers.  On  the  other  hand, 
Lipset  noted  that  the  ecologists’  claims  such  as 
cultural  liberalism  are  a  luxury  enjoyed  by  well-off 
citizens.  In  the  same  way,  Lipset  remarked  that  the 
Greens  combine  avant-garde  themes  opposed  to 
traditional  values  with  anti-industrialist  themes 
for  the  defense  of  nature,  which  take  part  in  a 
revolt  against  modernity  that  he  qualifies  as  a 
“backlash.”  Stefano  Bartolini  and  Peter  Mair 
noted  that  the  Greens  participated  in  what  the  two 
authors  called  intrablock  mobility  within  the  left, 
that  is,  the  workers’  side  of  the  owners-versus- 
workers  cleavage.  Later  events  seem  to  have 
proved  them  right,  with  the  experiences  of  the 
Greens  in  government  first  in  Finland,  then  in 
Belgium,  and  finally  in  Germany  and  France  in 
coalitions  with  social  democrats.  In  France,  they 
owe  the  few  seats  they  won  in  parliament  to  elec¬ 
toral  alliances  with  the  socialists.  However,  in 
Belgium  and  Germany,  local  coalitions  with  the 
right  have  occurred  since  the  beginning  of  this 
century.  An  intermediary  hypothesis  could  be  put 
forward,  namely,  that  the  Greens  stem  from  a 
restructuring  of  the  territorial-economic  cleav¬ 
age — primary  sector  versus  secondary  sector — 
setting  the  industrialized  world  in  opposition  to 
nature,  which  explains  the  mixture  of  postmodern 
and  traditionalist  features  in  the  discourse. 
Concerning  their  closeness  to  the  left,  it  can  be 
considered  as  the  result  of  their  hostility  to  capital¬ 
ism,  which  destroys  the  balance  of  nature. 
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Parties  as  Organizations 

Parties  are  not  biological  organisms,  nor  do  they 
have  a  lifetime  association  with  a  cleavage:  They 
are  autonomous  forces.  The  German  conservatives 
in  the  CDU  are  an  excellent  example  of  a  change 
of  cleavage  and  of  realignment.  It  was  created  as  a 
Christian  Democratic  Party  and  heir  of  the  pre- 
1933  Catholic  Zentrum.  Its  founding  programs 
defined  a  third  way  between  Marxist  collectivism 
and  liberal  capitalism  by  a  Christian  socialism 
founded  on  personalism  and  a  respect  for  prop¬ 
erty,  the  quest  for  the  common  good,  and  the  prin¬ 
ciple  of  subsidiarity.  The  CDU  and  its  Bavarian 
sister  party,  the  CSU,  had  accepted  anti-Nazi 
Protestant  intellectuals  from  the  “Confessing 
Church”  (“Bekennende  Kirche,”  a  Protestant  anti¬ 
fascist  movement).  Under  pressure  from  the  Allies, 
who  were  concerned  about  the  Soviet  threat,  and 
fearing  the  Marxism  displayed  by  the  SPD  of  1945 
to  1946,  the  CDU  was  forced  to  open  its  doors  to 
Protestant  conservatives  who  were  not  always  for¬ 
mer  members  of  the  Resistance.  In  the  person  of 
Konrad  Adenauer,  the  former  mayor  of  Cologne, 
a  moderate,  the  CDU  provided  itself  with  a  leader 
who  turned  out  to  be  a  true  visionary  both  on  the 
question  of  European  integration  and  on  the  future 
of  Germany.  Certain  that  Germany  would  sooner 
or  later  be  reunified  and  that  in  this  context 
Catholics  who  were  roughly  equal  in  number  to 
the  Protestants  in  the  West  would  once  again 
become  a  minority,  he  reoriented  the  CDU  along  a 
more  conservative  line  to  take  the  place  of  the  old 
Zentrum  party  and  the  German  Right,  which  had 
been  discredited  for  its  support  of  Nazism.  The 
party  then  allied  itself  with  and  later  assimilated 
the  conservatives  of  Lower  Saxony  in  the  German 
Party  (Deutsche  Partei,  DP);  it  then  did  the  same 
with  the  party  representing  refugees  from  former 
German  territories  in  the  east,  the  League  of 
Expellees  and  Deprived  of  Rights  (Block  der 
Heimatvertriebenen  und  Entrechteten,  BHE).  This 
was  possible  thanks  to  a  moderate  liberal  program 
inspired  by  the  new  economic  mentor  of  the  CDU, 
Ludwig  Erhard.  This  program,  the  social  market 
economy,  came  up  against  the  opposition  of  the 
social  Christian  Democrats.  Adenauer  won  his  bet, 
and  the  CDU/CSU  became  a  party  with  a  majority 
vocation,  inspired  by  the  catchall  party  defined  by 
Otto  Kirchheimer.  As  early  as  1953,  as  Peter  Merkl 


noted,  the  CDU/CSU  became  a  conservative  party 
dedicated  to  defending  the  interests  of  industry, 
business,  and  agriculture,  with  its  popular  voters 
(blue-  and  white-collar  workers)  taking  advantage 
of  the  spin-off  from  prosperity.  As  for  Christian 
socialism,  it  became  a  mere  memory,  and  to  many 
of  the  party  leaders,  it  was  not  a  pleasant  one. 
However,  the  CDU  was  the  first  example  of  politi¬ 
cal  realignment  where  the  logic  of  organization 
took  precedence  over  ideological  considerations 
and  program.  The  CDU/CSU  was  able  to  increase 
considerably  the  number  of  its  new  voters,  while  to 
a  great  extent  keeping  its  traditional  electorate, 
and  was  not  affected  by  the  growing  secularization 
of  society.  It  increased  its  score  from  31%  in  the 
1949  elections  to  50%  in  1957.  For  the  change  to 
be  successful,  the  cleavage  had  to  be  followed  by 
lasting  electoral  alignment  as  defined  by  V.  O. 
Key.  This  requires  good  partisan  organization. 
When  within  a  party  the  requirements  of  organiza¬ 
tion — and  therefore  accession  to  power — are  in 
contradiction  to  the  preservation  of  its  principles 
and  identity,  it  either  undergoes  a  refoundation  or 
changes  its  identity  and  undergoes  realignment 
toward  an  electorally  more  beneficial  cleavage. 

The  organization  of  parties,  therefore,  conveys 
a  different  logic  than  ideas  and  cleavages  and 
needs  to  be  studied  separately.  The  concept  most 
commonly  used  to  classify  partisan  organization  is 
the  opposition  between  mass  parties  and  cadre 
parties.  It  is  often  attributed  to  Duverger,  but  he 
borrowed  it  from  Weber  and  developed  it. 
Duverger’s  great  contribution  was  to  distinguish 
between  parties  of  inside  creation  and  parties  of 
outside  creation,  depending  on  whether  the  found¬ 
ers  were  in  parliament — a  typical  example  is  that 
of  Whigs  and  Tories — or  were  outsiders  who  had 
no  access  to  power,  not  even  to  parliament.  The 
groups  that  can  exist  before  the  organization  of  the 
party  can  be  labor  unions,  associations,  Masonic 
lodges,  or  leagues — including  terrorist  ones.  The 
cadre  parties  are  therefore  parliamentary  parties 
resulting  from  the  widening  of  the  electorate, 
aimed  at  inciting  new  voters  to  enroll  on  the  elec¬ 
toral  register  and  support  the  party  and  the  elec¬ 
toral  committees  of  the  candidates.  Mass  parties 
are  created  outside  the  spheres  of  power,  and  their 
only  means  of  access  is  to  have  the  largest  possible 
number  of  voluntary  activists  and  regular  financial 
contributors. 
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More  than  the  number  of  members,  it  is  the 
criterion  of  funding  that  distinguishes  mass  par¬ 
ties.  The  regularity  and  registration  of  contribu¬ 
tions  is  very  important  in  mass  parties  whose 
internal  legitimacy  is  embodied  by  its  members 
rather  than  its  electorate.  It  is  necessary  to  add  a 
characteristic  that  Duverger  does  not  mention:  the 
stability  of  leadership  indifferent  to  the  vagaries  of 
the  economic  situation.  Thus,  during  the  whole  of 
the  20th  century,  the  Swedish  social  democrats 
had  only  five  leaders,  whereas  other  parties  had 
many  more.  The  Industrial  Revolution;  the  expan¬ 
sion  of  means  of  transport,  communication,  and 
propaganda  leading  to  the  development  of  inex¬ 
pensive  newspapers  and  enabling  national  elec¬ 
toral  campaigns;  and  the  running  of  centralized 
national  bodies  favored  the  action  and  develop¬ 
ment  of  mass  parties  until  the  1960s.  Some  cadre 
parties  adapted  to  their  competition,  often  parties 
organized  later,  such  as  the  British  and  Scandinavian 
conservatives,  who  were  organized  by  penetration 
from  the  center  to  the  periphery,  becoming,  accord¬ 
ing  to  Angelo  Panebianco,  electoral  parties — with 
professionals  endowed  with  strong  leadership 
qualities,  recruiting  large  numbers  of  members, 
and  above  all  oriented  toward  the  opinion  of  their 
potential  voters.  The  difference  with  mass  parties 
is  threefold.  First,  and  regardless  of  the  number  of 
members — under  Harold  Macmillan,  the  British 
Conservative  party  had  almost  1  million  mem¬ 
bers — the  funding  of  the  party  depends  on  gifts 
from  business  or  from  rich  contributors  and  not  on 
the  members,  who  sometimes  do  not  pay  their  fees 
but  are  not  excluded.  Second,  legitimacy  and 
power  belong  to  parliamentarians  and  are  vested 
in  them  by  the  voters,  whose  opinions  are  more 
important  than  those  of  members.  Finally,  the  sur¬ 
vival  of  the  leaders  depends  on  their  success  in  the 
general  elections. 

Some  parties  have  kept  a  more  archaic  organi¬ 
zation:  a  federation  of  electoral  committees  com¬ 
posed  of  local  personalities,  headed  by  a  much 
more  undisciplined  parliamentary  party  and  with  a 
weak  leadership.  These  less  developed  cadre  par¬ 
ties  are  to  be  found  in  countries  such  as  France, 
Spain,  Portugal,  and,  to  a  lesser  degree,  Italy.  Jean 
Chariot  suggests  calling  them  partis  de  notables. 
Originally,  they  were  the  earliest  parties  to  be 
organized,  which  corresponds  to  a  model  of  orga¬ 
nization  by  diffusion,  that  is,  they  were  created  on 


the  initiative  of  constituency  committees  and  by 
becoming  closer  and  closer  ultimately  became  fed¬ 
erated  from  the  bottom  up.  During  the  100  years 
from  1860  to  1960,  technical  development  favored 
mass  parties  that  in  certain  cases — Catholic 
Zentrum  and  social  democrats  in  Germany;  Cath¬ 
olic  and  socialist  parties  in  Austria,  Belgium,  and 
the  Netherlands;  French  and  Italian  Communists — 
managed,  in  the  words  of  Siegmund  Neumann,  to 
“take  charge  of  voters  from  the  cradle  to  the 
grave.”  These  “rigid  mass  parties,”  to  use  Duver- 
ger’s  terminology,  had  a  very  large  membership 
involved  in  a  network  of  parallel  organizations  for 
women,  children,  young  people,  cooperatives, 
travel  agencies,  sport  clubs,  choirs,  not  to  mention 
a  party  press.  According  to  Neumann,  these  are 
“social  integration  parties,”  and  in  Germany  dur¬ 
ing  the  Weimar  republic,  they  were  even  qualified 
as  Sozialgbettoparteien  (social  ghetto  parties). 

Not  all  mass  parties  reached  such  organiza¬ 
tional  perfection,  and  several  were  content  to  live 
off  the  voluntary  work  of  their  militants  and  are 
what  Duverger  calls  “flexible  mass  parties.”  But, 
as  Blondel  remarks,  not  attaining  a  large  member¬ 
ship  is  always  a  failure  for  a  party.  In  fact,  whereas 
parties  of  social  integration  have  a  strong  organi¬ 
zational  culture  that  generates  party  patriotism  or 
the  attachment  of  members,  and  even  voters  and 
are  more  concerned  with  the  organization  itself 
than  the  idea  that  it  embodies,  flexible  mass  par¬ 
ties  are  in  a  perpetual  debate  over  ideas,  a  source 
of  divisions,  and  this  is  unproductive  from  an 
electoral  point  of  view.  Since  the  1960s  and  the 
development  of  political  communication  centered 
on  television  and  now  the  Internet,  mass  parties, 
and  especially  the  most  powerful  among  them, 
have  experienced  a  crisis  of  adaptation  brought 
about  by  a  decreasing  membership  that  has  hit  all 
the  large  social  and  political  organizations  charac¬ 
terized  by  the  omnipotence  of  image  and  enter¬ 
tainment,  ensuring  a  domination  of  form  and 
appearance  over  substance,  style  over  ideas,  and 
the  personality  of  the  leader  over  the  political 
party.  Duverger  thought — rightly  at  the  time — 
that  cadre  parties  were  doomed,  but  thanks  to  a 
ruse  of  history,  it  is  the  mass  parties  that  are  the 
dinosaurs,  and  hie  et  nunc,  it  is  the  professional 
electoral  parties,  according  to  Panebianco,  that 
have  the  wind  in  their  sails  in  Europe:  They  are 
more  reactive  to  variations  in  public  opinion  and 
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more  personalized,  and  while  easily  disposing  of 
leaders  who  fail,  they  are  on  the  same  wavelength 
as  the  media.  As  they  lose  more  and  more  mem¬ 
bers,  the  old  parties  of  integration  tend  to  be 
reduced,  to  the  advantage  of  their  functionaries,  to 
a  large,  permanent  staff  paid  by  the  party.  The  lat¬ 
ter  can  be  qualified  by  the  concept  of  “bureau¬ 
cratic  mass  party,”  proposed  by  Panebianco. 

Moreover,  the  increasingly  high  costs  of  elec¬ 
tion  campaigns,  along  with  the  development  of 
the  media  and  particularly  television,  have  pro¬ 
voked  a  change  in  the  organization  of  parties. 
First,  whatever  the  social  organization,  individu¬ 
als  are  less  engaged  in  long-term  action  but  are 
more  willing  to  commit  themselves  in  a  limited 
way  to  a  precise  objective:  Parties  lose  members, 
but  so  do  labor  unions,  and  there  are  fewer  prac¬ 
ticing  members  of  the  Church.  This  poses  a  vital 
problem  to  mass  parties  confronted  with  a  decrease 
in  the  number  of  their  contributors  on  the  one 
hand  and  with  the  increasing  cost  of  electoral 
campaigns  on  the  other.  They  resort  increasingly 
to  funding  by  private  enterprises,  as  other  parties 
did,  which  changes  their  nature.  As  the  mass  par¬ 
ties  were  often  unpopular  with  the  business  com¬ 
munity,  they  were  driven  to  use  practices  that  in 
some  countries  are  considered  as  corrupt  and 
condemned  by  law.  Consequently,  states  not  only 
developed  their  legislation  on  parties  but  also 
replaced  or  limited  private  funding  with  public 
funds.  Last,  as  television  does  not  lend  itself  to  a 
deepening  of  political  discourse  or  to  nuances, 
parties  are  forced  to  put  on  performances,  which 
they  have  done  since  the  1 9th  century  but  now  do 
for  television.  That  is,  they  have  to  embody  them¬ 
selves  in  a  leader,  who  has  to  build  an  image  of  a 
prime  minister  or  of  a  president,  as,  for  example, 
in  France.  Countries  such  as  Belgium,  the  Nether¬ 
lands,  or  Italy,  which  were  governed  by  coalitions 
of  parties  and  appointed  their  prime  minister  by  a 
process  of  negotiation  and  arbitration  between 
parties  and  the  currents  within  a  party,  have  had 
to  resign  themselves  to  personalization.  The  lead¬ 
ing  German  parties  were  the  first  to  go  down  this 
path  by  designating  their  candidates  for  the  chan¬ 
cellery.  In  the  UK,  around  1965,  the  Conservative 
Party  decided  to  have  its  leader  elected  by  the 
members  of  parliament,  whereas  previously  the 
appointment  was  the  result  of  a  secret  process  in 
which  the  outgoing  leader  played  a  substantial 


role.  From  the  1970s  and  1980s,  more  and  more 
parties,  which  until  then  had  had  their  leaders 
elected  by  delegates  at  their  congress  or  party  con¬ 
ference,  changed  to  direct  election  by  paid-up 
party  members.  In  1995,  the  French  Socialist 
Party  did  the  same  to  choose  its  candidate  for  the 
presidential  election,  and  so  did  the  British 
Conservatives.  In  Italy  and  France,  the  left-wing 
coalition  and  the  socialists  tried  to  import  the 
American  system  of  primaries.  The  model  setup  in 
Italy  appears  to  be  the  closest  to  the  original,  with 
the  center,  left,  and  extreme  left  taking  part — but 
the  ballot  was  organized  privately  within  the 
offices  of  the  parties.  For  the  French  Socialist 
Party,  it  was  an  internal  election  enlarged  to 
accommodate  members  admitted  for  the  circum¬ 
stance  in  return  for  a  reduced  financial  contribu¬ 
tion.  In  both  cases,  the  analogy  with  the  American 
primaries  resides  in  the  fact  that  there  was  an 
internal  election  and  even  debates  between  the 
candidates.  The  aim  was  to  attract  the  attention  of 
the  media  and  thus  favor  the  campaigns  of  the 
parties  concerned.  This  way  of  functioning  does 
not  correspond  to  the  ideal  of  mass  parties. 

The  distinction  between  professional  electoral 
parties  on  the  one  hand  and  bureaucratic  mass 
parties  on  the  other  has  become  blurred  in  numer¬ 
ous  countries.  They  have  become  publicly  financed 
institutions  oriented  toward  the  media  and, 
according  to  Peter  Mair,  could  even  manage  with¬ 
out  members.  Mair  suggests  that  the  reasons  for 
maintaining  the  role  of  members  in  certain  parties 
are  of  a  symbolic  nature.  He  has  proposed  a  new 
mode  of  partisan  organization,  the  cartel  party, 
which  since  the  1970s  has  succeeded  the  hege¬ 
mony  of  catchall  parties,  which  had  taken  over 
from  mass  parties  after  replacing  the  elitist  parties. 
The  fact  is  that  what  Mair  calls  “parties  in  opin¬ 
ion”  have  taken  precedence  over  the  “central 
apparatus  of  the  party”  in  an  arena  of  dialogue 
with  the  “party  in  public  office” — leader,  govern¬ 
ment,  parliamentary  group — all  by  political  pro¬ 
fessionals.  Are  there  any  alternatives  to  cartel 
parties?  There  is  one  in  Italy — Forza  Italia,  devoted 
to  the  promotion  and  the  defense  of  the  interests 
of  its  founder,  Silvio  Berlusconi.  Such  a  phenom¬ 
enon  was  only  made  possible  thanks  to  two  “acci¬ 
dents,”  the  deregulation  of  television,  which 
opened  the  doors  to  the  establishment  by  Berlus¬ 
coni  of  a  television  empire,  and  the  collapse  of  the 
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first  Italian  Republic,  which  freed  some  space  on 
the  right  of  the  political  spectrum,  that  is,  on  the 
side  of  the  defense  of  liberal  capitalism. 

Both  the  right  and  the  democrats  in  Italy  as  well 
as  a  large  number  of  parties  in  Europe  are  greatly 
influenced  by  the  organizational  models  of  the 
United  States.  However,  the  latter  remain  different 
and  are  based  on  various  autonomous  strata, 
either  for  the  Democrats  or  for  the  Republicans. 
Since  the  seminal  contribution  of  Key,  scholars 
have  analyzed  American  parties  as  tripartite  struc¬ 
tures:  (1)  the  party  in  the  electorate,  (2)  the  party 
organization,  and  (3)  the  party  in  government.  The 
party  in  the  electorate  refers  to  the  loyalty  and 
identification  of  the  voters,  and  the  party  in  gov¬ 
ernment  refers  to  public  office  holders  from  the 
president  to  local  councilors;  the  party  organiza¬ 
tion  is  structured  in  a  manner  defined  by  Sam 
Eldersveld  (1982)  as  a  stratarchy,  which  “is  an 
organization  with  layers,  or  strata  of  control, 
rather  than  centralized  leadership  from  the  top 
down”  (p.  106).  American  party  organizations  are 
far  older  than  the  European  ones,  and  in  spite  of 
the  decline  of  party  identification,  they  still  remain 
adapted  to  the  various  and  numerous  changes  and 
evolutions  affecting  the  practice  of  democratic 
government.  This  is  obviously  not  the  case  in 
Europe,  where  political  scientists  may  speak  of  a 
“crisis  of  parties.” 
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Parties  need  money  if  they  are  to  be  able  to  carry 
out  their  main  political  functions:  to  run  election 
campaigns,  to  maintain  organizations  both  within 
the  legislature  and  outside  it,  and  to  carry  out 
research  into  future  policy.  Party  finance  is  a 
major  component  of  political  finance.  The  latter 
includes  in  addition  the  funding  of  lobby  groups 
and  of  other  aspects  of  political  life.  Party  finance 
includes  any  costs  of  election  campaigns  that  are 
the  responsibility  of  party  organizations.  Where 
candidates  run  for  public  office  independently  of 
parties,  election  campaigns  too  are  an  aspect  of 
political  finance  not  covered  by  the  term  party 
finance.  This  entry  discusses  problems  of  party 
finance  in  terms  of  normative  democratic  criteria, 
current  research,  and  attempts  of  regulation. 

Problems  of  Party  Finance 

Party  finance  raises  problems  for  three  main  rea¬ 
sons.  First,  parties  with  access  to  disproportion¬ 
ately  large  amounts  of  money  for  their  election 
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campaigns  potentially  have  a  large  advantage  over 
poorer  rivals.  Insofar  as  high  expenditure  enables 
parties  to  gain  votes,  money  may  create  unfair 
electoral  advantages  for  parties  with  rich  support¬ 
ers.  In  a  considerable  number  of  countries  in  the 
Third  World  (including  Thailand,  the  Philippines, 
and  countries  in  Latin  America),  vote  buying  is  a 
common  feature  of  elections.  In  Nigeria,  electoral 
officials  reportedly  have  been  bribed  to  falsify  vot¬ 
ing  results.  In  countries  such  as  the  United  States, 
money  is  characteristically  spent  on  buying  adver¬ 
tising  time  on  television.  Studies  on  electoral 
results  in  congressional  elections  in  the  United 
States  and  in  parliamentary  elections  in  the  United 
Kingdom  (UK)  have  suggested  a  clear  link  between 
the  amount  spent  and  the  number  of  votes 
obtained. 

A  second  problem  is  that  parties’  need  for 
money  may  make  them  reliant  on  donors  who 
demand  illegitimate  benefits.  Typically,  these  con¬ 
sist  of  public  contracts,  licenses,  or  legislation 
favorable  to  the  givers.  The  demand  for  donations 
is  a  major  cause  of  political  corruption,  both  at  the 
national  and  at  the  local  level.  Major  scandals  con¬ 
cerning  party  funding  occur  so  regularly  that  any 
attempt  to  list  them  will  soon  be  outdated.  Scandals 
not  only  involve  justified  accusations  against  poli¬ 
ticians  and  their  aides  but  also  unfounded  but 
nevertheless  powerful  allegations. 

Third,  those  desiring  to  become  candidates  for 
public  office  may  need  to  spend  considerable  sums 
in  obtaining  party  nominations.  Primary  elections 
and  other  internal  party  contests  can  be  costly  for 
those  involved.  The  more  expensive  it  is  to  gain  a 
party  nomination  the  lower  will  be  the  chances  for 
persons  without  private  resources  to  embark  on 
political  careers. 

It  is  not  only  the  demand  for  resources  that 
affects  political  parties;  the  control  of  money  has  a 
significant  impact  on  the  structure  of  power  within 
political  parties.  Robert  Michels  stressed  in  his 
classic  study  of  the  German  Social  Democrats  in 
the  early  20th  century  that  party  leaders  tend  to 
gain  control  of  the  purse  strings  of  the  party  and 
that  this  has  the  effect  of  centralizing  power.  There 
is  no  single  pattern  of  internal  control  over  fund¬ 
ing.  In  some  countries,  such  as  the  United  States, 
central  party  organizations  do  not  have  an  over¬ 
whelming  control  where  it  comes  to  raising  and 
spending  money. 


Issues  in  the  Academic 
Study  of  Party  Finance 

The  study  of  party  funding  and  of  political  finance 
generally  has  become  an  increasingly  significant  and 
popular  field  of  political  science.  Members  of  the 
International  Political  Science  Association’s 
Research  Committee  on  Political  Finance  and 
Political  Corruption  (RC  20)  have  carried  out  some 
of  the  most  significant  work  in  this  field.  Herbert 
Alexander  devoted  his  professional  life  to  a  long 
series  of  studies  of  the  funding  of  presidential  and 
other  election  campaigns  in  the  United  States. 
Among  the  most  significant  works  in  the  compara¬ 
tive  study  of  party  finance  are  those  of  Arnold 
Heidenheimer,  Khayyam  Paltiel,  Karl-Heinz  Nass- 
macher,  and  Kevin  Casas-Zamora.  Heidenheimer’s 
pioneering  study  of  1963  attempted  to  compare  the 
overall  level  of  spending  in  a  number  of  different 
countries.  He  then  tried  to  explain  why  party  poli¬ 
tics  costs  so  much  more  in  some  countries  (e.g.,  the 
Philippines  or  Israel)  than  in  others  (e.g.,  the  UK).  In 
2009,  Nassmacher  found  that  the  pattern  of  highl¬ 
and  low-spending  countries  was  largely  unchanged. 
Like  Casas-Zamora  and  Pinto-Duschinsky,  he  also 
cast  doubts  on  the  popular  notion  that  the  costs  of 
politics  have  greatly  risen  as  the  result  of  spending 
by  parties  on  modern  campaigning  techniques,  in 
particular  on  television  advertising. 

However,  the  attempt  to  make  reliable  com¬ 
parisons  between  the  levels  of  spending  in  different 
countries  has  raised  serious  methodological  and 
practical  problems: 

1.  Though  the  financial  accounts  of  parties  are 
now  published  in  many  more  countries  than  in  the 
past,  they  often  are  inaccurate  (works  of  fiction,  as 
the  French  scholar  Yves-Marie  Doublet  has  called 
them).  Moreover,  accounts  are  drawn  up  in  differ¬ 
ent  ways,  thereby  making  comparisons  technically 
difficult. 

2.  The  term  finance  needs  to  be  considered  care¬ 
fully,  for  donations  may  come  in  the  form  of  gifts 
in  kind.  Rather  than  supply  money  to  enable  a 
party  to  purchase  vehicles  or  other  forms  of  trans¬ 
port  for  electioneering  by  party  leaders  and  candi¬ 
dates,  donors  may  supply  vehicles  or  the  use  of 
helicopters.  Clearly,  their  value  needs  to  be  taken 
into  account  when  reckoning  the  income  and 
expenditures  of  a  party. 
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In  a  large  number  of  countries  in  the  former 
Soviet  Union,  Africa,  and  elsewhere,  in-kind 
donations  to  political  parties  come  from  the  gov¬ 
ernment.  This  is  known  in  Eastern  Europe  as 
“administrative  resources.”  Typically,  the  govern¬ 
ment  in  office  employs  officials  supposedly  to 
carry  out  public  duties  but  actually  to  provide 
political  services  to  the  party.  Office  facilities,  tele¬ 
phones,  and  vehicles  in  government  ministries,  in 
the  offices  of  legislators,  or  mayors’  offices  are 
common  forms  of  administrative  resources. 

The  partisan  use  of  public  resources  by  political 
officeholders  creates  problems  of  research  and 
measurement  for  political  scientists.  It  also  is  a 
source  of  unfairness  between  governing  parties 
and  opposition  parties.  This  imbalance  is  espe¬ 
cially  important  in  poor  countries  dependent  on 
foreign  assistance.  Here  the  private  sector  of  the 
economy  frequently  is  very  weak,  and  control  of 
governmental  resources  is  crucial. 

3.  Academic  studies  also  confront  the  problem 
of  definition  of  the  term  political  party.  In  practice, 
there  are  “offshore  islands”  of  parties:  These  are 
organizations  that  are  legally  independent  of  par¬ 
ties  but  are  in  practice  linked  to  a  party.  An  impor¬ 
tant  example  of  organizations  with  such  a  status  is 
the  “political  foundations”  attached  to  each  of  the 
main  parties  in  Germany.  The  Konrad  Adenauer 
Foundation  is  “close  to”  the  Christian  Democrats, 
the  Friedrich  Ebert  Foundation  is  “close  to”  the 
Social  Democrats,  and  so  on.  Where  newspapers 
are  strongly  connected  to  a  political  party,  there 
arises  the  question  of  whether  or  not  their  incomes 
and  expenditures  are  to  be  included  in  academic 
studies  of  party  finance. 

4.  It  is  hard  to  measure  the  effects  of  expendi¬ 
ture  on  election  results.  In  the  case  of  spending  by 
or  on  behalf  of  candidates  for  the  legislature  in 
countries  that  both  have  first-past-the-post  elec¬ 
toral  systems  and  require  the  disclosure  of  expen¬ 
ditures,  it  is  possible  to  make  statistical  analyses 
of  the  relationship  between  spending  and  votes. 
The  difficulties  are  that  the  official  spending  sta¬ 
tistics  may  be  incomplete  or  inaccurate  and  also 
that  factors  other  than  money  need  to  be  consid¬ 
ered.  For  example,  a  local  party  organization 
with  more  members  or  more  active  members  may 
raise  relatively  large  sums  of  money.  However, 
electoral  success  may  stem  from  the  work  of  the 


local  activists  rather  than  from  the  amount  of 
money  spent. 

As  a  result  of  these  problems,  the  main  progress 
by  political  scientists  has  been  in  studies  of  party 
funding  in  single  countries  (e.g.,  Marcin  Walecki’s 
study  of  Poland)  and  in  the  study  of  political 
finance  laws  (e.g.,  by  the  Congressional  Research 
Services  of  the  U.S.  Fibrary  of  Congress  and  by  the 
GRECO  [Group  of  States  Against  Corruption] 
unit  of  the  Council  of  Europe). 

Regulation  and  Subsidy 

In  recent  decades,  there  has  been  a  strong  interna¬ 
tional  tendency  to  increase  legal  regulation  of  both 
party  funding  and  public  subsidies.  Arguably, 
there  has  been  too  much  law  and  too  little  enforce¬ 
ment.  This  is  partly  because  laws  themselves  are 
loosely  worded  and  leave  loopholes  that  can  be 
exploited  by  donors  and  by  political  parties  wish¬ 
ing  to  evade  them.  It  is  partly  because  electoral 
management  bodies  and  law  enforcement  bodies 
have  neither  the  technical  capacity  nor  the  will  to 
challenge  powerful  political  figures  in  major  polit¬ 
ical  parties.  Where  a  body  separate  from  the  elec¬ 
toral  management  body,  such  as  an  anticorruption 
commission,  is  made  responsible  for  the  enforce¬ 
ment  of  laws  relating  to  party  funding,  there  may 
be  a  greater  chance  of  active  enforcement. 

The  main  types  of  regulation  include  (a)  require¬ 
ments  on  parties  and/or  donors  to  declare  financial 
accounts  and  donations  to  an  electoral  manage¬ 
ment  authority;  (b)  requirements  about  the  publica¬ 
tion  of  such  information  (including,  increasingly, 
publication  on  the  World  Wide  Web);  (c)  require¬ 
ments  for  disclosure  of  in-kind  donations  and  of 
loans  to  parties;  (d)  requirements  for  elected  politi¬ 
cians  to  declare  their  personal  assets;  (e)  bans  on 
particular  sources  of  donations:  for  example,  for¬ 
eign  donations  and  donations  by  companies,  trade 
unions,  or  public  contractors;  (f)  limits  on  permit¬ 
ted  spending  by  parties  or  by  candidates  on  election 
campaigns;  (g)  limits  on  the  amounts  that  a  donor 
is  permitted  to  give — regulations  concerning  forms 
of  disclosure  are  the  most  common  around  the 
world;  and  (h)  bans  on  or  regulation  of  party 
spending  on  advertising  on  television  or  radio. 

The  main  forms  of  subsidy  are  (a)  provision  of 
free  or  subsidized  time  on  television  and  radio 
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(and,  more  occasionally,  free  or  subsidized  news¬ 
paper  advertising)  for  parties  and  candidates  to 
convey  their  campaign  messages  to  the  electors, 
(b)  direct  financial  subsidies  to  parties  for  elec¬ 
tion  campaign  expenses  and/or  for  routine  orga¬ 
nizational  expenses,  (c)  financial  and  in-kind 
allowances  for  party  groups  in  the  legislature  or 
for  individual  legislators,  and  (d)  in-kind  elec¬ 
toral  facilities  for  parties  from  public  funds — for 
example,  use  of  public  halls  for  electoral  meet¬ 
ings  and/or  postage  for  electoral  communications 
without  cost. 

The  most  common  form  of  subsidy  is  the  provi¬ 
sion  of  free  broadcasting  opportunities.  More 
than  half  the  countries  have  some  form  of  direct 
public  funding.  Political  parties  debate  the  effects 
of  such  subsidies.  One  fear  is  that  they  create  an 
incentive  for  rival  political  parties  to  unite  in 
defense  of  their  common  financial  privileges.  In 
the  words  of  Richard  Katz  and  Peter  Mair,  this 
creates  “cartel  parties.”  Another  criticism  is  that 
public  funding  fails  to  stem  the  demand  of  politi¬ 
cal  parties  for  money  and  thus  fails  to  provide  a 
cure  for  corrupt  funding.  Public  funding  is  popu¬ 
lar  among  professional  politicians — its  main 
beneficiaries — but  is  generally  unpopular  with 
ordinary  electors.  Supporters  of  public  funding 
argue  that  it  provides  greater  fairness.  They  argue 
also  that,  combined  with  improved  regulation, 
public  funding  does  help  decrease  corruption  even 
if  it  does  not  eliminate  it. 

International  Standards 

Several  bodies,  including  the  Council  of  Europe, 
have  set  out  standards  for  the  regulation  and  sub¬ 
sidy  of  political  parties.  Such  standard-setting 
exercises  present  problems,  though  they  may  have 
their  uses.  There  are  essential  differences  of  politi¬ 
cal  philosophy  between  those  who  give  priority  to 
the  need  for  fairness  between  parties  and  those 
whose  main  aim  is  to  allow  freedom  of  expression. 
Such  freedom  may  mean  allowing  some  parties  to 
outspend  others,  thus  creating  a  tension  between 
the  aims  of  freedom  and  fairness. 

Even  when  the  aim  of  transparency  in  party 
funding  is  considered,  there  are  arguments  in  some 
circumstances  for  permitting  donor  privacy,  espe¬ 
cially  in  regimes  where  anyone  known  to  give  finan¬ 
cial  support  to  a  party  opposing  the  government 


may  be  subjected  to  harassment  and  even  to 
violence. 
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Brunei  University 
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Parliamentary  and  presidential  elections  are  regu¬ 
larly  held  every  3  to  5  years,  and  the  behavior  of 
the  electorate  is  characterized  by  a  certain  inertia, 
sometimes  more  and  sometimes  less.  This  inertia 
at  the  aggregate  level  is  the  consequence  of  deci¬ 
sions  of  individual  voters  whose  party  attach¬ 
ments  predispose  them  to  elect  candidates  of  the 
same  party  in  a  series  of  elections.  Various  terms 
are  used  as  synonyms  for  this  general  idea  of  party 
attachment,  such  as  partisanship,  party  closeness, 
party  loyalty,  and  party  identification.  This  type 
of  enduring  attachment  has  to  be  distinguished 
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from  short-term  party  evaluations  or  from  current 
party  preference  as  the  immediate,  direct  cause  of 
voting  behavior.  Whereas  party  attachment  is  usu¬ 
ally  measured  as  attachment  to  one  particular 
party,  current  party  preference  is  sometimes  oper¬ 
ationalized  as  a  preference  order  for  a  set  of  par¬ 
ties,  from  one’s  most  to  least  preferred  party. 

As  used  in  voting  behavior  research,  party  identi¬ 
fication  differs  from  the  other  terms  because  the 
general  idea  of  party  attachment  is  embedded  in  a 
special  theoretical  approach.  Its  definition  is  more 
specific,  and  its  causes  and  consequences  are  theo¬ 
retically  grounded.  Party  identification  as  a  theoreti¬ 
cal  concept  was  developed  by  the  Michigan  social 
psychological  school  of  political  behavior.  This 
concept  is  introduced  in  the  first  section,  followed  by 
a  section  on  an  alternative,  revisionist  concept.  In 
the  third  section,  a  new  conceptualization  in  the 
classical  spirit  and  new  challenges  are  discussed.  The 
entry  concludes  with  a  fourth  section  on  the  applica¬ 
bility  of  the  concept  outside  the  United  States. 

The  Michigan  Social  Psychological  Concept 

Party  identification  was  originally  defined  in  The 
American  Voter  by  Angus  Campbell,  Philip  Con¬ 
verse,  Warren  Miller,  and  Donald  Stokes  (1960)  as 
“the  individual’s  affective  orientation  to  an  impor¬ 
tant  group  object  in  his  environment”  (p.  121).  In 
U.S.  politics,  with  its  stable  two-party  system,  the 
Republican  and  Democratic  parties  function  as 
these  “important  group  objects,”  offering  political 
orientation  to  their  supporters.  Campbell  et  al.  did 
not  assume  that  this  is  self-evident  but  conceded 
that  other  groups  are  able  to  function  as  political 
reference  groups  as  well,  such  as  unions  and  reli¬ 
gious  or  ethnic  groups.  Whether  parties  do  indeed 
fulfill  this  orientation  function  for  a  large  part  of 
the  national  electorate  is  one  of  the  questions  that 
have  to  be  answered  when  the  original  American 
concept  is  applied  in  other  countries.  The  basic 
assumption  of  the  Michigan  approach  is  first  of  all 
that  voters  depend  on  such  groups  for  political 
orientation,  this  being  originally  derived  from  ref¬ 
erence  group  theory  in  mainstream  social  psychol¬ 
ogy  of  the  1940s  and  1950s. 

The  classical  Michigan  concept  of  party  identi¬ 
fication  became  one  of  the  cornerstones  of  voting 
research,  first  in  the  United  States  but  to  so  some 
extent  also  in  Europe  and  Japan.  Its  standard 


measure  is  asked  in  all  American  National  Election 
Studies:  “Generally  speaking,  do  you  think  of 
yourself  as  a  Republican,  a  Democrat,  an  Indepen¬ 
dent,  or  what?”  The  independents  are  then  asked, 
whether  they  think  of  themselves  as  closer  to  the 
Republican  or  Democratic  Party,  leading  to  the 
support  type  of  “leaners.”  The  self-identified 
Republicans  and  Democrats  are  further  catego¬ 
rized  as  strong  or  weak,  depending  on  their  answer 
as  to  whether  they  think  of  themselves  as  strong  or 
not  very  strong  Republicans  or  Democrats.  The 
end  result  is  a  categorization  of  the  American  elec¬ 
torate  into  seven  groups,  running  from  strong  to 
weak  Republicans,  to  Independents  leaning  toward 
the  Republican  Party,  to  Independents,  to  the 
equivalent  groups  on  the  Democratic  side.  Much 
of  the  literature  focuses  on  the  reliability  of  this 
measurement,  which  combines  direction  and 
strength  of  attachment  in  one  scale.  Not  denying 
the  importance  of  the  respective  methodological 
controversy,  the  basic  question  concerns  the  con¬ 
cept  validity  of  the  directional  component:  Do  the 
various  measures  of  identification  with  a  party 
measure  what  they  are  supposed  to  measure? 

There  are  some  easy  conditions  that  these  mea¬ 
sures  have  to  fulfill  in  cross-sectional  surveys.  Thus 
one  expects  that  the  political  attitudes  of  party 
identifiers  are  more  in  line  with  their  party  than 
those  of  leaners  and  that  party  identification  has  a 
consistent  impact  on  voting  behavior.  But  party 
identification  is  only  a  predisposition  to  elect  a 
Republican  or  a  Democratic  candidate,  so  that  elec¬ 
tions  differ  in  terms  of  this  impact.  In  the  American 
context,  with  changing  presidential  candidate  con¬ 
stellations,  vote  deviations  of  identifiers  from  their 
party  occur  more  frequently  than  in  European  par¬ 
liamentary  systems,  where  under  proportional  rep¬ 
resentation  the  electorate  often  has  only  a  choice 
between  closed  party  lists.  These  deviations  in  the 
United  States  are  seen  not  as  a  weakness  but  as  a 
strength  of  the  concept  if  voters  return  to  the  party 
they  identify  with  in  later  elections. 

The  crucial  test  of  concept  validity  is  the  exog¬ 
enous  status  of  party  identification  in  relation  to 
short-term  electoral  forces  such  as  issue  and  candi¬ 
date  orientations.  People  become  passively  attached 
to  a  party  during  their  primary  political  socializa¬ 
tion  in  their  teens,  as  a  result  of  the  overwhelming 
influence  of  their  parents,  and  the  ties  to  their 
party  become  strengthened  by  a  continuous  voting 
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record.  Thus,  younger  voters  tend  to  deviate  more 
than  older  ones.  But  in  general,  the  inertia  of  the 
electorate  in  the  medium  term  should  be  quite 
large.  The  theory  allows  only  two  exceptions.  The 
first  is  sudden  changes  in  critical  elections  when 
many  voters  reorient  themselves  politically,  react¬ 
ing  to  important  events  such  as  the  world  eco¬ 
nomic  crisis  in  the  early  1930s,  and  the  other 
concerns  gradual  readjustments  in  parts  of  the 
electorate  as  in  the  U.S.  South  since  the  late  1960s. 
Here  the  successful  civil  rights  movement  led  to  the 
enfranchisement  of  the  African  American  popula¬ 
tion,  which  voted  overwhelmingly  for  the  Demo¬ 
cratic  Party;  as  a  result,  many  traditional  Southern 
White  Democrats  crossed  the  party  divide  to  main¬ 
tain  a  racial  divide. 

The  Revisionist  Concept 

In  the  U.S.  context,  the  validity  of  the  classical 
concept  was  well  corroborated  by  empirical  evi¬ 
dence  collected  by  the  Michigan  school.  Compared 
with  other  political  attitudes,  party  identification 
proved  to  be  more  stable  over  time  even  at  the 
individual  level,  as  shown  by  long-wave  panels 
spanning  4  years.  But  there  remained  doubts 
whether  party  identification  was  really  the  sup¬ 
posed  unmoved  mover,  being  stable  and  exoge¬ 
nous  itself  but  influencing  the  evaluation  of  new 
political  events,  issues,  and  candidates.  The  find¬ 
ing  that  vote  deviations  from  party  identification 
weaken  this  predisposition  and  prevent  it  from 
returning  to  its  former  strength  led  to  a  slight  revi¬ 
sion  of  the  concept.  As  soon  as  deviations  are 
understood  as  more  than  mere  episodes,  the  ques¬ 
tion  arises  as  to  what  caused  deviations  in  the  first 
place.  One  possible  answer  was  that  there  were 
retrospective  evaluations  of  parties  in  govern¬ 
ment,  rewarding  them  for  good  performance  and 
punishing  them  for  bad  performance.  Voters  will 
remember  parties’  performance  not  only  at  the 
moment  of  voting  but  more  permanently.  This 
consideration  led  Morris  Fiorina  (1981)  to  the 
reinterpretation  of  party  identification  “as  a  run¬ 
ning  tally  of  retrospective  evaluations”  (p.  89). 
Party  identification  at  time  t  influences  identifica¬ 
tion  at  time  t  +  I;  this  is  the  aspect  of  continuity, 
but  identification  at  t  +  1  is  also  influenced  by 
retrospective  evaluations  that  are  themselves  not  a 
simple  consequence  of  the  biased  worldview  of  the 


identifiers.  As  a  process  of  Bayesian  updating  of 
beliefs  about  parties,  the  affective  component  of 
the  concept  is  downgraded  in  favor  of  cognitive 
content  and  information  that  is  not  selectively 
perceived  to  confirm  one’s  own  attitudes.  To  what 
degree  this  is  possible  is  a  problem  that  is  contro¬ 
versially  discussed  in  the  literature. 

The  implications  of  the  revisionist  concept  for 
the  aggregate  level  of  the  electorate  are  short-term 
fluctuations  beyond  sample  errors  that  are  not 
compatible  with  the  classic  version.  As  already 
mentioned,  the  latter  does  not  postulate  complete 
stability  either  but  allows  rare  realigning  elections 
or  gradual  movements  due  to  generational  turn¬ 
over  or  reorientations  in  parts  of  the  electorate 
such  as  the  U.S.  South.  But  fluctuations  over  time 
that  correlate  with  performance  indicators  of  the 
president  or  with  the  situation  of  the  economy  are 
definitely  excluded.  Not  so  by  revisionist  theory. 
Time-series  analyses  of  the  proportions  of  Repub¬ 
lican  and  Democratic  identifiers  were  introduced 
as  studies  of  macropartisanship.  Building  on  the 
revisionist  concept,  the  task  was  to  show,  on  the 
one  hand,  that  its  ups  and  downs  were  reduced 
compared  with  the  performance  indicators  due  to 
the  stickiness  of  party  identification  and,  on  the 
other,  that  they  correlated  with  the  collective 
memory  of  the  performance  history  of  the  parties’ 
presidents  and  of  the  respective  economic  situa¬ 
tion.  Applying  an  error  correction  model,  Robert 
Erikson,  Michael  MacKuen,  and  James  Stimson 
(2002)  showed  for  the  United  States  not  only  that 
current  changes  in  macropartisanship  are  the  con¬ 
sequences  of  current  changes  of  performance  indi¬ 
cators  but  that  macropartisanship  returns  to  its 
medium-term  equilibrium  and  that  these  equilibria 
are  themselves  changing  over  time.  They  interpret 
this  latter  change  as  “the  enduring  impact  of 
political  and  economic  performance”  (p.  132). 
This  result  did  not  remain  unchallenged,  one  argu¬ 
ment  being  that  medium-term  changes  are  the 
consequences  of  classical  realignment  processes, 
such  as  those  during  the  Reagan  administration 
that  favored  the  Republican  Party. 

New  Theoretical  Foundations 
and  New  Challenges 

Reference  group  theory  being  outdated  in  social 
psychology,  the  classic  concept  deserves  a  new 
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theoretical  foundation  that  keeps  its  content  as  an 
affective  party  attachment.  Such  a  new  foundation 
is  possible  with  social  identity  theory  as  developed 
by  Henri  Tajfel,  who  stresses  that  people  bolster 
their  self-esteem  by  membership  in  groups  that  they 
value  positively  and  that  are  emotionally  signifi¬ 
cant  to  them.  Based  on  these  assumptions,  the 
social  identity  of  a  person  is  more  than  a  mere 
cognitive  self-categorization.  A  strong  motivational 
content  should  guarantee  selective  perception  of 
political  events  in  favor  of  the  respective  political 
ingroup,  accompanied  by  outgroup  hostility.  This 
theory  fits  well  with  the  original  concept  of  party 
identification,  the  only  problem  now  being  which 
party  group  people  have  in  mind  when  “they  think 
of  themselves  as  Republicans,  Democrats,  etc.”  or 
when  they  think  of  the  party  as  an  “important 
group  object  in  their  environment,”  the  party  as  an 
organization,  or  other  identifiers  of  the  same  party. 
Donald  Green,  Bradley  Palmquist,  and  Eric  Schick- 
ler  (2002)  argue  for  the  latter  interpretation  and 
insist  that  it  agrees  best  with  the  classic  meaning. 
People  are  supposed  to  have  an  image  of  their  fel¬ 
low  partisans,  with  whom  they  identify  even  if  they 
have  no  contact  whatsoever  with  the  party  organi¬ 
zation  or  the  party  elite.  Otherwise  one  could  not 
explain  the  high  percentages  of  regular  voters  who 
identify  with  parties. 

This  new  theoretical  foundation  of  the  classical 
concept  is  important  because  it  helps  formulate 
adequate  survey  questions  that  tap  the  identity 
character  of  party  identification.  This  is  less  a 
problem  for  the  standard  American  question  that 
qualifies  well  as  an  identity  question  than  for  the 
type  of  questions  asked  in  some  other  countries. 
Questions  asking  whether  one  feels  close  to  a  party 
or  whether  one  is  an  adherent  of  a  party  are  less 
plausible  as  measures  of  identification  with  fellow 
partisans. 

The  identity  approach  is  sometimes  contrasted 
in  the  literature  with  the  so-called  attitudinal 
approach.  John  Bartle  and  Paolo  Bellucci  (2009) 
subsume  the  revisionist  concept  under  this  heading 
as  well  as  strong  party  attitudes  combining  cogni¬ 
tive,  emotional,  and  behavioral  aspects  of  this 
positive — or  negative — disposition  toward  a  party 
as  an  attitude  object.  Indicators  of  the  strength  of 
an  attitude  are  its  durability  and  its  impact  on 
behavior  or  on  other  attitudes.  These  are  the  crite¬ 
ria  that  were  also  applied  to  check  the  construct 


validity  of  the  classic  concept.  Thus,  the  general 
idea  holding  various  versions  of  party  identifica¬ 
tion  together  is  its  function  as  a  political  predispo¬ 
sition  influencing  short-term  political  preferences 
and  evaluations.  In  a  funnel  of  causality  leading 
from  background  characteristics  such  as  religion  or 
class  membership  to  the  final  vote,  political  predis¬ 
positions  enter  early,  being  influenced  themselves 
only  by  the  background  characteristics.  A  more 
general  version  of  partisanship  such  as  the  revi¬ 
sionist  concept  allows  for  regular  updating  caused 
by  new  information,  but  it  also  implies  higher 
durability  than  short-term  political  evaluations 
and  an  impact  on  evaluations  of  current  political 
cues.  Larger  permissiveness  concerning  long-term 
stability  means  at  the  same  time  fewer  challenges 
from  the  research  front.  Thus,  it  makes  sense  to 
focus  on  the  challenges  the  identity  approach  has 
to  cope  with  since  it  is  the  continuation  of  the 
demanding  classic  version  of  party  identification. 

From  the  beginning,  the  stability  of  party  iden¬ 
tification  over  time  was  the  most  demanding  pre¬ 
supposition  of  the  classic  concept.  Only  durable 
identifications  are  able  to  function  as  unmoved 
movers  for  short-term  evaluations  and  issue  pref¬ 
erences.  Compared  with  other  political  attitudes, 
party  identification  turned  out  to  be  indeed  rela¬ 
tively  stable  in  the  United  States,  and  the  stability 
is  about  at  the  same  high  level  as  for  various  eth¬ 
nic,  religious,  or  social  class  identities.  The  auto¬ 
correlations  of  identities  over  panel  waves  can  be 
made  even  higher  if  one  takes  measurement  error 
into  account.  An  answer  to  a  survey  question  is  a 
mixture  of  the  true  value  and  an  error  term  due  to 
inattention,  lack  of  understanding,  coding  mis¬ 
takes,  and  so  on.  Multiwave  panel  data  allow  the 
identification  of  these  errors  and  therefore  the 
computation  of  correlations  that  are  not  attenu¬ 
ated  by  errors.  Such  autocorrelations  reach  values 
well  above  .90  for  party  identification  in  the 
United  States. 

A  new  challenge  in  this  situation  is  posed  by 
competing  political  predispositions,  which  can  be 
as  stable  as  party  identification  once  measurement 
errors  are  taken  into  account  for  them,  too.  Such 
competing  predispositions  are  ideological  self¬ 
categorizations  or  policy-relevant  value  orienta¬ 
tions.  These  concepts  are  more  abstract  than 
social  identifications  but  are  often  measured  in 
mass  surveys  with  simple  questions,  one  item  for 
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each  concept.  State-of-the-art  measurement  would 
demand  multiple  items.  There  exist  first  results 
showing  that  issue  preferences  can  be  very  stable 
over  time  if  one  can  construct  indices  based  on 
multiple  items  and  if  one  takes  measurement 
errors  into  account.  Such  results  challenge  the 
monopoly  of  party  identification  as  the  only 
promising  candidate  for  the  function  of  a  political 
predisposition. 

Another  challenge  related  to  this  are  doubts 
about  the  applicability  and  fruitfulness  of  the  iden¬ 
tity  concept  outside  the  United  States.  The  best 
answer  to  this  question  can  be  given  by  acknowl¬ 
edging,  first,  the  importance  of  political  predispo¬ 
sitions  for  opinion  formation  and  voting  behavior 
and  by  finding  out,  second,  which  type  of  predis¬ 
position  has  the  best  explanatory  power  in  a  spe¬ 
cific  country,  given  its  political  history  and  party 
system.  Once  more  candidates  for  the  predisposi¬ 
tion  part  of  the  funnel  of  causality  are  available, 
there  arises,  of  course,  the  new  challenge  of  deci¬ 
phering  the  mutual  influences  among  them. 

Party  Identification  Outside 
the  United  States 

The  most  serious  challenge  of  the  classic  or  the 
identity  concept  of  party  identification  accrues 
from  research  results  on  European  party  systems 
and  voting  behavior.  After  early  attempts  to  trans¬ 
fer  the  core  concept  of  the  Michigan  school  to 
Britain,  France,  the  Netherlands,  and  other 
European  countries  in  the  1960s,  one  conclusion 
was  that  the  concept,  successful  as  it  is  in  the 
United  States,  is  less  powerful  or  even  nonappli- 
cable  in  Europe.  Some  authors  argue  that  it  is 
indistinguishable  from  vote  choice  or  is  much  less 
stable  over  time  than  in  the  United  States  or  that 
there  exist  better  alternatives  that  European  voters 
can  use  for  political  orientation.  Other  researchers 
strongly  disagree,  coming  to  the  opposite  conclu¬ 
sion  that,  depending  on  certain  conditions,  the 
concept  passed  critical  tests  of  construct  validity. 

A  first  condition  is,  of  course,  to  come  up  with 
an  equivalent  question  that  taps  the  affective  iden¬ 
tification  or  the  identity  aspect.  Richard  Johnston 
(2006)  compiled  an  overview  of  the  questions  used 
in  various  countries  and  found  that  only  the  ques¬ 
tions  in  the  English-speaking  countries  and  in 
Japan  fall  back  on  the  phrase  whether  one  thinks 


of  oneself  as  “Conservative,  Labor,  Liberal,  or 
what,”  as  the  British  question  is  formulated.  In 
other  countries  respondents  are  asked  whether 
they  consider  themselves  as  adherents  of  a  party 
(Denmark,  Sweden,  and  the  Netherlands),  feel 
close  to  a  party  (France),  or  lean  toward  a  party 
(Germany).  The  identity  aspect  decreases  in  this 
sequence.  In  most  countries,  researchers  have 
agreed  on  a  standard  formulation  that  is  asked 
regularly  in  national  election  studies. 

For  the  crucial  validity  check  of  stability,  panel 
studies  are  necessary,  and  in  this  respect  most  of 
the  early  results  were  disappointing  compared 
with  the  stability  of  party  identification  in  the 
United  States.  For  the  Netherlands,  Jacques 
Thomassen  even  reported  a  higher  stability  of  the 
vote  than  of  party  identification  and  corroborated 
this  finding,  together  with  Martin  Rosema,  recently 
(cited  in  Bartle  &  Belluci,  2009).  But  taking 
measurement  error  into  account  and  applying  sta¬ 
tistical  models  appropriate  for  the  respective  mea¬ 
surement  level  of  the  data  can  change  the  overall 
impression  of  instability.  Thus,  Green  and  Schickler 
(cited  in  Bartle  &  Belluci,  2009)  concluded  that 
many  of  the  reports  on  the  instability  of  party 
identification  in  Europe  are  grossly  exaggerated. 
They  note  at  the  same  time  that  the  identity 
concept  does  not  imply  stability  under  all  circum¬ 
stances.  As  one  would  expect,  the  party  identifica¬ 
tions  of  the  Italian  electorate  became  fluid  in  the 
1990s,  due  to  major  changes  in  the  Italian  party 
system.  A  final  conclusion  concerning  the  stability 
of  party  identification  in  Europe,  however,  is  not 
yet  reached.  The  answers  given  depend  on  the 
methods  used  to  measure  stability.  Error  term  cor¬ 
rections  rely  on  classical  measurement  theory, 
which  postulates  random  errors  in  an  otherwise 
homogeneous  population.  Alternatively,  one  can 
assume  a  heterogeneous  population  consisting  of 
movers  and  stayers  that  can  be  tested  by  latent 
class  analysis.  Such  analyses  show  substantial 
dynamics  and  a  class  of  movers  in  Britain,  for 
example,  of  one  third  to  two  fifths  of  the  elector¬ 
ate.  These  results  do  confirm  the  revisionist  con¬ 
cept  that  Harold  Clarke,  David  Sanders,  Marianne 
Stewart,  and  Paul  Whiteley  call  valenced  partisan¬ 
ship  (2004,  p.  211).  The  transition  probabilities 
between  British  parties  are  not  random  but  show 
trends  parallel  to  the  “accumulated  party  and 
party  leader  performance  evaluations.” 
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A  condition  at  the  macrolevel  of  the  political 
system  that  may  have  an  impact  on  partisanship, 
especially  if  some  voters  have  multiple  party  iden¬ 
tifications,  is  the  coalition  situation  in  a  country. 
In  some  countries,  there  exist  rather  stable  options 
for  government  formation.  In  Germany,  for  exam¬ 
ple,  a  left-wing  government  coalition  is  an  option, 
but  one  can  also  have  a  grand  coalition  between 
Social  Democrats  and  Christian  Democrats,  a  gov¬ 
ernment  of  the  Social  Democrats  and  the  Liberals, 
or  finally  the  option  of  a  bourgeois  ( biirgerlich ) 
government  of  Christian  Democrats  and  Liberals. 
These  types  of  coalition  also  serve  as  attitude 
objects  competing  with  parties  in  their  importance 
for  the  electorate. 

The  most  discussed  alternative  of  party  identifi¬ 
cation  as  a  political  predisposition  is  ideology, 
often  asked  as  ideological  self-categorization  as 
conservative  or  liberal  in  the  United  States  or  as 
left  or  right  leaning  in  Europe.  European  voters 
have  few  problems  in  classifying  parties  as  well  as 
themselves  on  self-anchoring  left-right  scales.  In 
some  countries  such  as  France,  parties  either 
change  their  names  from  time  to  time  or  enter  and 
leave  the  party  system  altogether,  so  ideology  can 
serve  as  a  more  stable  orientation  device  than 
party  identification.  Both  party  identification  and 
ideological  self-categorization  exist  in  France,  the 
question  being  their  reciprocal  influences  and  their 
comparative  impact  on  the  vote.  Recent  research 
results  based  on  panel  surveys  show  that  ideology 
influences  the  legislative  vote  (first  round)  more 
than  party  identification. 

Such  country-specific  findings  are  sometimes 
criticized  as  distractions  from  a  general  theory  of 
party  identification.  This  critique  is  unjustified  if 
the  goal  is  to  identify  the  conditions,  especially  at 
the  macrolevel,  that  strengthen  or  dilute  party 
identification  as  a  political  predisposition.  The 
identity  concept  is  based  on  firm  theoretical 
grounds  postulating  the  predispositional  character 
of  party  identification,  which  is  challenged  empiri¬ 
cally  by  findings  on  the  dynamics  of  identification. 
Valenced  partisanship,  by  contrast,  is  always  in 
danger  of  crossing  the  divide  between  predisposi¬ 
tions  and  short-term  political  attitudes. 

Franz  Urban  Pappi 
Mannheim  University 
Mannheim,  Germany 
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Party  Linkage 


The  concept  of  party  linkage  refers  to  the  ways  in 
which  political  parties  provide  linkage  to  the 
state.  In  democratic  theory,  parties  link  citizens  to 
the  state,  but  linkage  theory  encompasses  as  well 
how  parties  link  specific  entities  such  as  special 
interests  and  campaign  donors  to  the  state  and 
how  they  sometimes  develop  stronger  linkages  to 
the  state  for  themselves  alone  rather  than  for  any 
other  entities.  Depending  on  what  kind  of  linkage 
they  provide,  parties  contribute  to  participatory 
democracy,  responsive  democracy,  responsive  oli¬ 
garchic  government,  or  coercive  authoritarian 
government.  They  may  also  serve  as  agencies  of 
prerevolutionary  linkage  or  of  market  linkage.  It 
is  often  possible  to  find  various  kinds  of  linkage 
by  party  within  the  same  nation-state.  This  entry 
briefly  examines  each  form  of  linkage  by  party  in 
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turn  and  then  examines  work  that  helps  explain 
why  certain  forms  have  become  increasingly  com¬ 
mon  while  others  seem  to  be  falling  into  decline. 

The  use  of  the  term  party  linkage  is  relatively 
new  but  is  in  fact  an  elaboration  of  the  older  con¬ 
cept  of  the  party  as  a  broker  between  citizens  and 
the  state,  capable  of  aggregating  specific  interests, 
forming  programs,  and,  when  successful  in  elec¬ 
toral  competition,  placing  representatives  in  office 
to  carry  them  out.  A  similar  approach  conceived  of 
parties  as  transmission  belts ,  moving  ideas  through 
the  body  politic,  transforming  them  into  proposals 
for  legislation  and  eventually  into  laws  for  imple¬ 
mentation  in  the  halls  of  governance. 

These  early  metaphors,  drawn  from  the  boister¬ 
ous  world  of  industrial  finance  and  production  in 
the  heyday  of  the  Industrial  Revolution  and  its 
aftermath,  presented  a  uniquely  positive  and 
American  point  of  view.  It  was  not  until  after 
World  War  II  that  the  term  linkage  was  developed 
to  extend  the  idea  of  parties  as  connectors  to  the 
state  and  to  make  it  possible  to  consider  the  wider 
range  of  the  kinds  of  connections  that  can  be  made 
by  parties.  The  first  effort  to  do  so  systematically 
was  made  in  1980  and  identified  four  kinds  of  link¬ 
age  by  party:  (1)  participatory  linkage,  in  which 
parties  serve  as  agencies  through  which  citizens  can 
participate  in  government;  (2)  responsive  linkage, 
in  which  parties  serve  as  agencies  that  strive  to 
ensure  that  government  officials  will  be  responsive 
to  the  views  of  rank-and-file  voters;  (3)  clientelistic 
linkage,  in  which  parties  serve  as  channels  for  the 
exchange  of  votes  for  favors;  and  (4)  coercive  link¬ 
age,  where  parties  help  authoritarian  governments 
maintain  coercive  control  over  their  subjects. 

Over  time,  research  on  parties  as  agencies  of 
linkage  has  explored  the  applicability  of  this  typol¬ 
ogy,  often  concentrating  on  the  extent  to  which 
either  participatory  or  responsive  linkage  is  in  fact 
being  provided.  Other  studies  have  elaborated  the 
typology,  pointing  out,  for  example,  that  parties 
may  engage  in  selective  linkage  rather  than  collec¬ 
tive  linkage,  may  offer  representative  linkage  to 
other  organizations  in  exchange  for  electoral  sup¬ 
port,  and  may  develop  different  forms  and  pro¬ 
cesses  of  linkage  within  complex  and  multilayered 
polities. 

A  particularly  modern  form  of  these  kinds  of 
linkage  is  market  linkage.  Market  linkage  is  link¬ 
age  for  sale.  In  one  sense,  it  is  not  new:  Once  in 


power,  parties  in  government  have  always  favored 
those  who  helped  them  to  office  and  rewarded 
them  for  doing  so.  What  makes  market  linkage 
different  today  is  the  overwhelming  power  of 
advertising  and  the  consequent  need  for  vast  sums 
of  money  to  win  votes.  In  this  form  of  linkage,  par¬ 
ties  link  groups  to  the  state  by  collecting  funds  and 
waging  campaigns  on  behalf  of  the  candidates 
selected  by  those  groups;  they  seek  to  place  their 
patrons’  candidates  in  office  and  thus  become 
indispensable  partners  in  power.  Ordinary  citizens 
are  encouraged  to  believe  that  they  will  be  the  ben¬ 
eficiaries  of  responsive  linkage,  but  their  hopes  and 
needs  will  be  attended  to,  if  at  all,  only  after  those 
of  the  patrons  have  been  met.  Their  only  hope  in  a 
system  where  market  linkage  is  well  entrenched  is 
to  form  well-financed  mass  movements,  that  is,  for 
citizens  to  become  patrons  themselves  and  for 
movements  to  take  the  place  of  parties  as  agencies 
of  linkage  (although  movements  never  themselves 
move  into  positions  of  governance  as  parties  suc¬ 
cessful  in  elections  do). 

The  forms  of  linkage  discussed  so  far  focus,  by 
and  large,  on  contemporary  parties  in  established 
democracies.  More  recent  studies  examining  the 
relationship  between  parties  and  democracy  in  the 
past  as  well  as  in  nondemocratic  states  have 
revealed  another  form  of  linkage  by  party,  revolu¬ 
tionary  linkage.  Such  linkage  takes  place  in  nations 
where  there  is  neither  democratic  governance  nor 
a  legal  party  system,  and  citizens  who  find  the 
autocratic  rule  intolerable  join  parties  that  have 
been  outlawed  or  are  forced  to  go  underground  or 
else  form  new,  embryonic  parties.  The  linkage  the 
party  provides  in  such  a  case  is  not  to  a  state  per 
se  but  to  an  imagined  one,  one  that  is  seen  as 
embodied  in  the  words  and  deeds  of  its  leader.  For 
the  followers,  linkage  to  a  legitimate  state  is  the 
dream,  but  linkage  to  a  charismatic  leader  and  his 
or  her  entourage  is  the  presently  available  reward 
for  joining  the  battle  for  liberty. 

Revolutionary  linkage  is  found  throughout  his¬ 
tory,  including  at  the  present  time.  A  period  of 
organized  struggle  for  liberty  is  found  in  the  his¬ 
tory  of  every  currently  democratic  state  except 
those  on  whom  democracy  has  been  imposed  by 
external  conquest.  As  a  contemporary  phenome¬ 
non,  it  appears  in  states  where  democracy  has  not 
yet  been  established  or  where  it  has  fallen  and 
must  be  fought  for  all  over  again.  In  the  latter  case, 
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the  illegal  movements  pursuing  the  dream  may  be 
parties  that  have  been  outlawed. 

The  six  forms  of  linkage  identified  can  readily 
be  divided  into  those  that  foster  democracy  and 
those  that  work  against  it.  Parties  building  linkage 
in  prerevolutionary  settings  and  those  fostering 
participatory  and  responsive  linkage  in  established 
democracies  all  offer  prodemocracy  forms  of  link¬ 
age  (or  at  least  purport  to  be  doing  so),  whereas 
those  that  link  only  by  clientelistic  rewards,  abet¬ 
ting  agencies  of  oppression,  or  favoring  their  most 
generous  contributors,  use  antidemocratic  meth¬ 
ods  of  creating  linkage. 

Recent  studies  of  the  relationship  between  par¬ 
ties  and  democracy  suggest  that  the  latter  types  of 
linkage  are  on  the  rise  and  that  the  former  are 
weakening.  Throughout  the  world,  in  long- 
established  democracies,  in  new  democracies,  and 
in  nations  still  only  partially  or  not  at  all  democra¬ 
tized,  opportunities  for  participatory  linkage  via 
parties  have  diminished,  responsive  linkage  has 
weakened,  new  forms  of  clientelistic  and  directive 
linkage  have  evolved,  struggles  for  liberation  via 
prerevolutionary  linkage  have  been  baffled  and 
stalled,  and  market  linkage  has  become  ever  more 
common. 

External  forces  such  as  market  globalization 
and  the  rising  costs  of  effective  campaigning  with 
modern  forms  of  communication,  as  well  as  stron¬ 
ger  and  more  sophisticated  oppression  by  dictato¬ 
rial  regimes,  have  provided  pressure  on  parties  to 
shift  to  (or  continue  to  limit  themselves  to)  the  less 
democratic  forms  of  linkage,  and  parties  have 
responded.  Recent  studies  of  this  process  of  de¬ 
democratization  by  party  have  identified  many  of 
the  steps  taken  to  move  in  this  direction.  (Specific 
examples,  some  of  which  are  noted  below,  have 
been  found  in  46  nations,  in  the  parties  of  North 
and  Latin  America,  West  and  Eastern  Europe, 
post-Soviet  states,  Asia,  Africa,  Oceania,  and  the 
Middle  East.) 

In  nondictatorial  regimes,  the  shift  by  parties 
from  participatory  or  at  least  responsive  linkage  to 
less  democratic  forms  of  linkage  is  often  difficult  to 
isolate  and  identify,  because  the  role  parties  play  in 
de-democratization  is  frequently  combined  with 
other  activities  by  the  same  parties  that  contribute 
to  the  maintenance  of  democratic  forms  of  linkage. 
In  such  hybrid  systems,  parties  are  routinely  con¬ 
sidered  accountable  to  a  democratic  electorate  and 


may  in  fact  still  be  so  to  a  considerable  extent.  In 
addition,  only  a  portion  of  a  party’s  leadership  and 
supporters  may  engage  in  de-democratizing  behav¬ 
ior,  and  its  deleterious  effects  on  democracy  may  be 
difficult  to  recognize.  However,  certain  easily  rec¬ 
ognizable  de-democratizing  forms  of  party  behav¬ 
ior,  augmenting  clientelistic  and  market  forms  of 
linkage  within  established  democracies,  have  been 
pointed  out  and  have  been  categorized,  according 
to  whether  their  effect  is  direct  or  indirect,  as 
proactive  and  complicit.  Examples  of  proactive 
linkage-changing  behavior  by  parties  include  delib¬ 
erately  reducing  the  powers  of  the  party  base, 
overtly  sponsoring  or  supporting  legislation  or  judi¬ 
cial  decisions  that  eliminate  regulations  protecting 
rights  of  political  participation,  and  campaigning 
dishonestly,  with  no  intention  of  implementing  the 
promises  made  to  the  majority  of  supportive  voters. 

Parties  are  complicit  in  facilitating  de-democra- 
tization  when  they  aid  those  who  proactively  seek 
its  accomplishment  by  seeking  other  goals.  There 
are  three  subtypes  of  complicity:  unconscious, 
venal,  and  ideological.  Unconscious  complicity 
refers  to  failure  to  recognize  the  likely  de- 
democratizing  effect  of  the  proposals  one  sup¬ 
ports.  A  legislator  who  votes  for  a  new  electoral 
law  without  recognizing  that  the  implementation 
of  one  of  its  clauses  will  lead  to  the  effective  dis¬ 
enfranchisement  of  a  portion  of  the  population 
provides  an  example  of  unconscious  complicity. 
Venal  complicity  is  manifest  in  the  behavior  of 
party  representatives  who  accept  roles  and  rewards 
for  themselves  and  in  exchange  engage  in  no 
actions  contrary  to  the  wishes  of  those  who  grant 
such  perquisites.  It  is  also  seen  in  the  behavior  of 
those  who  vote  strictly  according  to  the  wishes  of 
their  largest  donors  (a  form  more  difficult  to  iden¬ 
tify,  as  such  votes  may  in  fact  be  in  accordance 
with  the  legislator’s  own  opinions).  Ideological 
complicity  takes  place  when  a  party  rejects  funda¬ 
mental  precepts  of  representation  and  account¬ 
ability  on  principle.  Extremist  and  racist  parties 
preaching  the  exclusion  of  minority  populations 
can  be,  without  ever  winning  office,  ideologically 
complicit  in  facilitating  the  shift  to  nondemocratic 
forms  of  linkage. 

In  dictatorial  regimes,  parties  reduce  the  possi¬ 
bility  of  establishing  democratic  forms  of  linkage 
when  they  engage  in  venal  complicity,  accepting 
personal  rewards  for  not  pursuing  the  interests  of 
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their  supporters;  offer  clientelistic  rewards  in 
exchange  for  votes;  or  assist  the  regime  in  tactics 
of  oppression  (coercive  linkage).  They  may  also 
contribute  to  the  entrenchment  of  nondemocratic 
forms  of  linkage  with  retreative  behavior,  as  when 
parties  that  support  democracy  disband  when 
defeated,  heavily  oppressed,  or  outlawed,  behavior 
that  may  appear  thoroughly  reasonable  and  justi¬ 
fied  but  nonetheless  weakens  the  struggle  to  initi¬ 
ate  democratic  governance  or  maintain  any  prog¬ 
ress  made  in  that  direction.  All  these  forms  of 
behavior  have  been  clearly  identified  in  many 
states  in  the  contemporary  Arab  world  but  have 
also  been  noted  throughout  the  world  at  various 
points  in  time. 

These  recent  efforts  to  combine  the  study  of 
de-democratization  with  the  study  of  party  linkage 
offer  new  perspectives  on  the  nature  of  linkage 
today,  a  rich  field  of  study  for  students  of  party  as 
well  as  students  of  democracy. 
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A  party  manifesto  or  platform  is  the  authoritative 
statement  of  party  policy  at  the  time  of  a  general 
election.  Many  other  pamphlets  and  policy 


documents  may  be  issued  by  leaders,  candidates, 
or  different  party  organs  around  this  time.  How¬ 
ever,  the  manifesto  is  unique  in  that  it  is  the 
authoritative  statement  of  the  party  position  for 
that  election,  usually  endorsed  by  a  representative 
conference  of  party  members,  according  to  recog¬ 
nized  procedures.  This  has  implications  for  its 
content  and  presentation:  Otherwise,  they  might 
not  have  been  approved  as  authoritative.  Mani¬ 
festos  are  not,  therefore,  just  an  arbitrarily  selected 
policy  document  out  of  many  available  at  that 
time.  They  are  the  documents  that  have  been 
crafted  for,  and  received,  official  approval.  In  the 
following,  the  significance  of  such  documents  is 
assessed  for  elections  and  the  formation  of  coali¬ 
tions,  of  party  policies,  and  of  democratic  theory 
as  a  whole.  In  a  few  cases,  this  authoritative  elec¬ 
tion  statement  of  policy  is  made  verbally  by  an 
authorized  spokesman — party  chair  or  leader — 
sometimes  as  a  radio  or  TV  speech  (in  Australia) 
or  an  interview  for  a  leading  newspaper  (in 
Japan).  In  Scandinavia,  the  document  may  not  be 
published  as  a  whole;  instead,  different  sections 
may  be  distributed  to  different  groups.  However, 
all  these  processes  of  transmission  are  carried  out 
by  designated  spokespersons  or  party  bodies. 
Fundamental  programs — statements  of  party 
principle — are  also  approved  from  time  to  time  by 
some  parties  but  should  not  be  confused  with  the 
action  program  or  campaign  platform  endorsed 
for  an  election. 

Usually  issued  as  a  booklet  or  pamphlet,  the 
manifesto  is  normally  presented  at  the  beginning 
of  the  election  campaign  at  a  press  conference 
designed  to  publicize  its  leading  concerns.  Not 
many  electors  read  it;  its  influence  is  felt  mainly 
through  press  and  TV  discussions  where  party  rep¬ 
resentatives  will  be  confronted  with  its  statements. 
The  manifesto  thus  functions  as  an  equilibrium 
point  around  which  party  policy  oscillates  and  to 
which  it  returns  during  the  campaign. 

Parties  are  the  only  organizations  that  issue 
such  comprehensive  medium-term  programs  for 
the  whole  of  society.  Until  the  middle  of  the  post¬ 
war  period,  conventional  wisdom  was  that  mani¬ 
festos  did  not  count  for  much,  as  electors  did  not 
read  them  and  parties,  once  elected,  forgot  about 
them  and  certainly  did  not  wish  to  be  bound  by 
them.  Were  this  critique  true,  it  would  constitute  a 
devastating  critique  of  representative  democracy, 
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defined  as  a  political  system  that  makes  a  neces¬ 
sary  connection  between  popular  preferences  and 
public  policy.  This  is  because  party  policies  as 
outlined  in  their  manifestos  are  what  voters  are 
asked  to  choose  between.  If  manifestos  do  not  get 
their  message  through,  then  the  expression  of 
popular  preferences  in  the  vote  will  be  wayward 
and  ill  informed.  Similarly,  if  parties  ignore  them 
once  they  are  in  office,  popular  preferences  will 
not  shape  public  policy. 

A  more  positive  evaluation  of  manifestos  has 
emerged  over  the  past  30  years.  The  mass  media 
need  them  to  structure  their  election  discussions  so 
that  their  main  features  and  concerns  get  through 
to  electors.  Parties  too  need  a  basis  for  policy  coor¬ 
dination  and  innovation  once  they  get  into  gov¬ 
ernment,  which  they  are  generally  too  busy  to 
generate  once  they  are  there.  They  have,  therefore, 
to  fall  back  on  the  manifesto  as  a  readily  available 
and  authoritative  basis  for  action. 

This  is  true  even  where  coalition  governments 
are  formed.  Either  each  party  has  carte  blanche  to 
do  as  it  wishes  inside  the  ministries  and  policy  sec¬ 
tors  assigned  to  it  (in  which  case  its  manifesto 
serves  as  a  basis  for  action  there)  or  an  overall 
coalition  agreement  is  developed  from  the  constit¬ 
uent  party  manifestos.  Either  way,  the  latter  feed 
into  government  policy. 

These  reevaluations  of  the  role  of  the  manifesto 
have  come  about  partly  through  the  work  of  the 
Manifesto  Research  Group  (MRG)  and  its  succes¬ 
sor  project,  the  Comparative  Manifestos  Project 
(CMP),  based  at  the  Wissenschaftszentrum,  Berlin. 
The  MRG,  a  grouping  of  about  20  mainly  European 
political  scientists,  counted  the  (quasi)  sentences  of 
the  manifestos  of  all  significant  parties  in  25  coun¬ 
tries  in  56  categories  covering  the  full  range  of 
party  policy.  Quasi  sentences  are  defined  as  argu¬ 
ments  or  phrases  that  are  the  verbal  expression  of 
one  idea  or  meaning.  Where  they  do  not  coincide 
with  a  natural  sentence,  they  are  often  marked  off 
by  bullet  points,  commas,  or  semicolons.  Raw 
counts  of  these  for  each  election  manifesto  are 
expressed  as  percentages  to  allow  comparison  of 
documents  of  varying  lengths.  These  percentages 
can  then  be  used  to  compare  the  policy  concerns  of 
different  parties,  to  distinguish  them  ideologically, 
or  to  trace  the  evolution  of  their  thinking  over  time. 

Statistical  analyses  of  these  manifesto-derived 
data  have  also  been  useful  for  giving  credibility 


and  substance  to  the  idea  of  a  basic  left-right  divi¬ 
sion  underlying  democratic  politics.  This  has 
emerged  from  several  different  types  of  analysis 
applied  to  the  data.  The  MRG-CMP  Left-Right 
scale  contrasts  policy  concerns  of  the  Left  (inter¬ 
vention,  welfare,  and  peace)  with  those  of  the 
Right  (freedom,  traditional  values,  and  military 
strength).  By  subtracting  the  summed  references  to 
Left  topics  from  those  to  Right  topics,  party  posi¬ 
tions  and  movements  can  be  mapped  across  54 
countries  for  the  postwar  period  (Hans-Dieter 
Klingemann,  Andrea  Volkens,  Judith  Bara,  &  Ian 
Budge,  2007). 

Quantifying  manifesto  texts  in  this  way  has  also 
been  essential  in  establishing  relationships  between 
party  election  programs  and  subsequent  govern¬ 
ment  action.  Emphases  in  the  manifestos  can  be 
linked  to  government  expenditure  to  see  if  they  get 
carried  through.  The  answer  is  clearly  that  they  do, 
thus  upholding  the  validity  of  election  results  as  a 
guide  for  government  action. 

For  a  more  limited  range  of  countries,  other 
studies  have  examined  the  extent  to  which  specific 
pledges  made  in  the  manifesto  are  actually  fulfilled 
by  governments.  Again  it  appears  that  a  substan¬ 
tial  number  support  the  idea  of  a  party  mandate 
from  the  electorate  based  on  the  manifesto. 

Manifestos  and  the  data  describing  them  are 
very  important  for  the  development  of  systematic 
political  science  because  they  are  the  main  source 
of  information  on  the  preferences  of  the  main 
political  actors  across  countries  and  over  time.  The 
ability  to  measure  such  preferences  in  a  fully  com¬ 
parable  way  enables  political  scientists  to  check 
the  major  theories  and  models  of  representation 
and  party  competition  systematically  in  a  way  that 
was  never  conceived  of  before.  Rational  choice 
theories  in  particular,  which  depends  crucially  on 
assumptions  about,  and  measurement  of,  prefer¬ 
ences,  can  now  be  put  directly  to  the  test.  Derived 
measures  from  the  manifestos  can  be  used  to  esti¬ 
mate  government  intentions  and  the  preference  of 
the  median  voter  (Hee-Min  Kim  &  Richard 
Fording,  chap.  8,  in  Ian  Budge,  Hans-Dieter 
Klingemann,  Andrea  Volkens,  Judith  Bara,  &  Eric 
Tanenbaum,  2001).  It  is  particularly  useful  to  have 
the  latter  where  direct  survey  evidence  is  lacking. 
Since  manifestos  state  the  party’s  position  in  its 
own  words  and  from  its  own  point  of  view,  they 
provide  more  direct  evidence  on  its  position  than 
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experts’  judgments  or  electors’  perceptions  of 
party  positions  do. 

So  far,  the  only  quantified  data  from  manifestos 
have  been  the  MRG-CMP  time  series,  based  on 
manual  coding  of  the  documents.  Because  of  their 
research  importance,  increasing  efforts  have  been 
made  to  develop  computerized  analyses  of  the 
texts.  Indeed,  manifestos  have  become  the  focus 
and  growth  point  for  computerized  analyses  of 
political  texts,  not  the  least  because  the  efforts  of 
the  MRG-CMP  in  collecting,  storing,  and  scanning 
them  make  them  easily  accessible  for  all  users. 
Clearly,  therefore,  manifestos  offer  a  major  source 
for  political  analyses  of  the  central  democratic  pro¬ 
cesses,  particularly  for  representational  processes. 
Political  parties  are  so  crucial  to  these  (both  in 
organizing  election  choices  for  voters  and  in  effect¬ 
ing  the  majority-supported  preference  in  govern¬ 
ment)  that  the  authoritative  policy  document  they 
issue  must  be  of  central  interest  in  studying  both 
elections  and  governments.  The  combination  of  its 
major  importance  to  the  democratic  political  pro¬ 
cess  and  the  fact  that  it  can  be  relatively  easily 
quantified  using  modern  techniques  gives  it  a  key 
role  in  the  future  development  of  systematic  politi¬ 
cal  science.  Central  concepts  and  relationships  that 
can  be  measured  using  the  policy  positions  con¬ 
veyed  by  manifestos  in  combination  with  other 
data  (bracketed)  include 

•  the  median  voter  (party  votes), 

•  congruence  between  popular  preferences  and 
government  intentions  (cabinet  posts), 

•  long-term  bias  in  government  intentions 
compared  with  popular  preferences,  and 

•  the  relationship  between  settled  popular 
preferences  and  enacted  government  policy. 

As  indicated,  representational  processes  are 
strongly  shaped  by  the  way  parties  behave  in  pol¬ 
icy  terms.  Here  again,  manifestos,  particularly  in 
their  quantified  form,  are  important  in  systemati¬ 
cally  tracing  policy  movement  over  time  and  in 
relation  to  other  parties.  Particularly  important  in 
this  regard  have  been  the  maps  of  left-right  move¬ 
ment  produced  by  the  MRG-CMP  published  in 
recent  years.  Generally,  these  show  little  sign  of 
party  policy  convergence  on  the  center  or  on  the 
median  elector  as  predicted  by  some  influential 
models  of  party  behavior.  Rather,  they  show 


parties  as  sticking  fairly  consistently  to  their  own 
ideological  space  and  thus  maintaining  the  policy 
differentiation  that  is  important  in  offering  voters 
the  kind  of  choice  between  alternative  programs 
envisaged  in  the  theory  of  the  party  mandate 
(Michael  McDonald  &  Ian  Budge,  2005).  Some 
strategic  or  other  forms  of  variation  also  occur  in 
their  positions  over  time.  This  has  the  effect  of 
varying  the  policy  options  offered  from  election  to 
election,  another  important  element  in  democratic 
choice.  It  is  paradoxical  that  political  science,  as 
opposed  to  normative  political  theory,  has  for  so 
long  neglected  the  systematic  study  of  texts.  Both 
the  main  inputs  and  the  major  outputs  of  political 
science  are  textual — reports,  loans,  budgets,  circu¬ 
lars,  and  declarations.  Central  on  the  input  side  is 
the  election  program  or  manifesto.  Perhaps  the 
manifesto  research  described  above  will  spill  over 
into  the  analysis  of  other  political  texts  in  the  not 
too  distant  future. 

Ian  Budge 
University  of  Essex 
Colchester,  United  Kingdom 

See  also  Coalitions;  Democracy,  Theories  of;  Electoral 
Behavior;  Parties 
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To  study  party  organizations  it  is  essential  to 
draw  all  necessary  analytical  suggestions  from 
organization  theory  and  to  adapt  them  to  the  spe¬ 
cial  case  of  parties.  Of  course,  this  is  not  the  only 
way  of  studying  political  parties.  Many  different 
points  of  view  can  be  adopted.  But  this  particular 
way  provides  the  opportunity  to  explore  the  dif¬ 
ferent  ways  in  which  the  internal  rules  of  the 
game,  that  is,  the  system  of  organizational  incen¬ 
tives  and  opportunities,  influence  the  actions  of 
party  members  at  the  top  levels,  at  the  grassroots 
levels,  and  at  the  intermediate  hierarchical  levels. 

An  innovative  definition  of  party  organization 
is  not  needed.  Political  parties  are  formal  organiza¬ 
tions.  Therefore,  we  can  begin  with  a  standard 
definition  of  formal  organization  as 

a  group  of  people  formally  constituted  and 
endowed  with  an  official  mission,  a  hierarchy 
(more  or  less  elaborated),  as  well  as  a  structure  of 
internal  coordination,  boundaries  (more  or  less 
open),  and  some  kind  of  task  specialization 
(more  or  less  developed). 

A  political  party  is  a  formal  organization  spe¬ 
cialized  in  the  presentation  of  candidates  in  local 
and/or  national  elections.  In  this  perspective,  the 
first  and  most  important  difference,  according  to 
Giovanni  Sartori,  is  among  parties  operating  in  a 
competitive,  namely,  democratic  environment  and 
parties  operating  in  a  noncompetitive  environ¬ 
ment — that  is,  some  modern  authoritarian  single¬ 
party  system. 

This  entry  investigates  some  aspects  of  the 
complex  question  of  party  organizations.  It  begins 
with  some  general  suggestions  about  the  historical 
evolution  of  political  parties:  their  formation, 
their  institutionalization,  and  path  dependency 
effects.  Next,  it  considers  parties’  organizational 
structures  (different  kinds  of  hierarchies,  different 
kinds  of  power  structure)  and  the  linkages  between 
a  party’s  official  goals  and  its  organizational 
structure,  the  relations  between  parties’  organiza¬ 
tions  and  their  external  environment,  and  the 
causal  mechanisms  at  work  when  political  parties 
experiment  with  organizational  changes.  Finally, 
this  entry  briefly  examines  some  typologies  of 


parties,  summarizing  the  most  relevant  features  of 
recent  developments  in  party  organization. 

Historical  Evolution 

In  a  historical-institutionalist  perspective,  under¬ 
standing  party  organization  requires  an  analytical 
reconstruction  of  each  political  party’s  origin  and 
specific  institutionalization.  The  features  of  par¬ 
ties’  organizations  depend  on  past  history:  how  the 
organizations  originated  and  how  they  consoli¬ 
dated.  Path  dependency  rules  explain  why  every 
organization,  and  political  parties  too,  bears  the 
mark  of  its  origin  and  consolidation  (institutional¬ 
ization)  even  several  decades  later.  Reconstructing 
the  genetic  model  (Angelo  Panebianco,  1988)  of 
political  parties  means  considering  three  elements: 

f.  The  organizational  development:  The  birth 
of  a  party  can  be  due  to  territorial  penetration  or 
territorial  diffusion,  or  their  combination.  Pene¬ 
tration  means  that  a  “center”  organizes,  controls, 
and  directs  the  development  of  a  territorial  “periph¬ 
ery.”  Diffusion  means  that  party  organization  is 
the  product  of  the  aggregation/federation  of  previ¬ 
ous  local  groups  and  elites.  In  the  first  case,  the 
party  will  probably  become  a  strong,  centralized 
organization  controlled  by  a  unified  central  oligar¬ 
chy.  In  the  second  case,  the  party  will  be  a  decen¬ 
tralized  organization  with  many  diversified  and 
competing  groups:  a  stratarchy,  as  described  by 
Samuel  Eldersveld  in  1964,  in  which  every  sub¬ 
group  fights  for  power,  making  precarious  and 
instable  compromises  with  other  subgroups. 

2.  The  presence  or  absence  of  an  external  spon¬ 
sor  of  an  institution  (a  church,  trade  unions,  the 
Comintern)  as  actual  founder  of  the  party:  If  an 
external  sponsor  exists,  the  party  is  its  “political 
weapon.”  The  external  sponsor  is  the  main  center 
of  loyalties  and  identifications  for  party  followers 
and  members  as  well  as  the  source  of  legitimation 
for  party  leaders.  Therefore,  externally  legitimated 
parties  (confessional  parties,  labor  parties,  commu¬ 
nist  parties)  and  internally  legitimated  parties  can  be 
distinguished.  This  circumstance  will  influence  all 
aspects  of  the  future  organizational  developments. 

3.  The  presence  or  absence  of  a  charismatic 
leader  as  founder  of  parties:  Charismatic  parties 
have  very  special  features.  The  leader  holds  the  full 
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control  of  the  party’s  dominant  coalition.  He  or 
she  is  the  de  facto  owner  of  the  party. 

The  characteristics  of  the  genetic  model  influ¬ 
ence  the  manner  of  institutionalization,  the  process 
of  structural  consolidation  of  parties.  Institution¬ 
alization  is  the  process  by  which  an  organization 
incorporates  its  founder’s  values  and  aims,  by 
which  it  becomes  an  institution — develops  bound¬ 
aries,  an  internal  career  system,  a  consolidated 
hierarchy,  and  a  professionalized  leadership.  Two 
ideal  types  can  be  distinguished:  strong  institution¬ 
alization  and  weak  institutionalization.  Strong 
institutionalization  means  high  autonomy  from 
the  environment  and  high  interdependencies  and 
coherence  among  its  internal  components.  Weak 
institutionalization  means  low  external  autonomy 
and  a  low  degree  of  internal  interdependence.  In 
the  first  case,  the  party  will  be  a  centralized, 
bureaucratic,  organization  led  by  a  strong  central 
oligarchy.  It  will  hold  the  control  of  many  external 
organizations  (unions,  interest  groups,  etc.),  and  it 
will  adopt  an  aggressive,  expansionist,  policy 
toward  the  external  environment. 

In  the  second  case,  the  party  will  be  a  decentral¬ 
ized  organization,  controlled  by  external  groups 
(external  organizations)  and/or  local  notables, 
with  a  poorly  developed  internal  administration 
system.  It  usually  will  be  unable  to  develop  aggres¬ 
sive  policies  toward  the  external  environment. 

Genetic  model  and  institutionalization  are 
related.  A  strong  institution  is  mainly  associated 
with  territorial  penetration  and  the  absence  of  an 
external  sponsor  (with  the  notable  exception  of 
the  communist  parties  in  the  20th  century).  Weak 
institutionalization  is  mainly  related  to  territorial 
diffusion  and  the  existence  of  external  sponsors. 
Finally,  the  presence  of  charismatic  leaders  as 
party  founders  is  rarely  associated  with  institu¬ 
tionalization  (the  routinization  of  charisma,  in 
Max  Weber’s  terms).  Only  a  few  charismatic  par¬ 
ties  become  institutions  (e.g.,  the  Gaullist  party  in 
the  French  Fifth  Republic).  The  majority  of  them 
disappear  with  the  end  of  the  human  and  political 
journey  of  the  leader. 

Physiognomy  of  Party  Organizations 

The  official  mission  comprises  the  ideological  goals, 
the  organizational  constitution,  and  the  power 


structure,  which  are  the  three  (related)  aspects  that 
define  the  physiognomy  of  party  organizations. 

The  official  mission  of  the  party,  its  manifest 
ideological  goals,  influences  both  its  organizational 
structure  and  its  culture  (and  its  relations  with  the 
environment  too).  Many  formal  and  informal  rules 
depend  on  the  features  of  the  official  mission.  But 
the  official  mission  is  also  too  vague  an  indicator  of 
the  characteristics  of  party  organizations.  Moreover, 
the  original  official  mission  is  usually  transformed 
during  the  process  of  institutionalization  and  after. 
Robert  Michels’s  (1911)  idea  of  “substitution  of 
goals”  may  be  exaggerated,  given  that  a  complete 
process  of  substitution  of  goals  in  a  party  is  a  rare 
occurrence.  But  usually,  in  the  course  of  time,  an 
“adaptation  of  goals”  takes  place:  The  goals  are 
adjusted  to  the  environmental  circumstances,  and 
as  a  consequence,  they  become  vaguer. 

Party  members  need  to  believe  in  those  goals, 
and  the  capability  of  mobilization  of  followers  by 
party  leaders  depends  on  their  capacity  to  demon¬ 
strate  themselves  as  zealous  defenders  of  the  ideo¬ 
logical  goals  (as  well  as  their  capacity  to  distribute 
or  promise  material  incentives  and  resources).  But 
the  role  of  the  official  mission  will  vary.  For  exam¬ 
ple,  when  parties  are  in  power,  there  is  less  need  of 
mobilizing  members  and  followers  and  the  official 
mission  becomes  less  important.  On  the  contrary, 
when  parties  are  in  the  opposition,  there  is  a 
greater  urgency  to  mobilize  people.  In  this  case,  the 
official  mission,  the  ideological  goals,  will  be 
emphatically  affirmed. 

The  second  aspect  is  the  organizational  consti¬ 
tution.  The  constitution  defines  the  rules  of  the 
game:  the  distribution  of  formal  authority  in  the 
party,  the  ways  of  coordination  among  the  official 
party  roles,  the  type  of  task  specialization,  and  the 
organizational  boundaries  (who  is  a  member  and 
who  is  not).  Sometimes,  students  of  party  organi¬ 
zations  reduce  the  organizational  analysis  of  par¬ 
ties  to  the  description  of  their  formal  constitutions. 
But  the  constitution  is  only  a  fragment  of  a  com¬ 
plex  organizational  whole. 

The  third  aspect  regards  the  power  structure.  In 
every  party,  there  is  a  dominant  coalition,  a  group 
of  leaders  who  control  the  organization.  The 
physiognomy  of  the  dominant  coalition  is  an 
essential  defining  feature  of  party  organizations.  In 
the  case  of  the  internally  legitimated  party,  the 
dominant  coalition  comprises  only  party  members. 
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In  the  case  of  externally  legitimated  parties,  it 
includes  the  leaders  of  the  external  sponsor  organi¬ 
zations:  For  example,  the  top  officials  of  the 
British  trade  unions  were,  for  a  long  time,  mem¬ 
bers  of  the  Labour  Party’s  dominant  coalition. 

Moreover,  the  dominant  coalition  leaders  can 
hold  positions  of  authority  in  the  party  (the  gen¬ 
eral  secretary,  the  members  of  the  party  national 
central  committee,  etc.),  or  they  can  be  parliamen¬ 
tarians.  Finally,  the  dominant  coalition,  in  some 
cases,  can  include  not  only  national  leaders  but 
also  regional  or  local  leaders  (e.g.,  the  secretaries 
of  some  important  local  federations). 

Dominant  coalitions  can  also  be  classified  on 
the  basis  of  their  levels  of  cohesion/division  and 
stability/instability.  Both  aspects  are  related  to  the 
power  competition  within  the  dominant  coalition. 
If  the  subgroups  of  the  dominant  coalition  are  not 
organized,  if  they  are  only  tendencies,  as  Richard 
Rose  put  it,  the  level  of  cohesion  of  the  dominant 
coalition  will  be  high.  If  the  subgroups  are  strongly 
organized  (factions),  the  level  of  cohesion  will  be 
low,  and  the  dominant  coalition  will  be  divided. 

The  level  of  stability/instability  refers  to  the 
capacity  of  members  and  subgroups  of  the  domi¬ 
nant  coalition  to  stipulate  durable  compromises 
among  them.  Therefore,  cohesive-stable  dominant 
coalitions  (e.g.,  the  communist  parties),  divided- 
stable  dominant  coalitions  (the  French  socialist 
party  of  the  time  of  Jean  Jaures  and  Leon  Blum, 
the  Christian  Democratic  Union  [CDUJ  under 
Konrad  Adenauer),  and  divided-instable  dominant 
coalitions  (the  Italian  Christian  Democracy,  1953- 
1993)  can  be  distinguished. 

Furthermore,  dominant  coalitions  can  be  oligar¬ 
chies  (cohesive  and  stable,  without  a  single  promi¬ 
nent  leader),  monocracies  (a  single  leader,  usually 
of  the  charismatic  type,  controlling  the  dominant 
coalition  and,  by  consequence,  the  party),  or 
poliarchies  (divided  and  instable,  usually  a  collec¬ 
tion  of  factions). 

Parties  and  Their  External  Environment 

Like  all  other  formal  organizations,  political  parties 
interact;  that  is,  they  exchange  vital  resources  with 
their  external  environment.  The  portions  of  the 
environment  that  are  relevant  for  the  party  are  its 
“task  environment”  or  domain.  First  of  all,  there  is 
a  special  portion  of  a  party’s  environment  that 


the  party  identifies  as  its  “hunting  ground,”  its 
privileged  classe  gardee,  or  reserved  territory;  it  is 
that  portion  comprising  the  electors  of  the  party 
and  from  which  the  party  draws  the  majority  of  its 
members.  The  party’s  official  mission  identifies  the 
domain.  Middle-class  parties,  manual  workers’ 
parties,  ethno-regional  parties,  and  confessional 
parties  are  all  denominations  that  make  reference 
to  those  portions  of  the  social  territory  that  the 
official  mission  identifies  as  the  privileged  classe 
gardee  of  the  organization.  Changes  and  transfor¬ 
mations  of  the  social  territory  usually  cause  changes 
and  transformations  in  the  party.  Alternatively,  the 
leaders  can  deliberately  manipulate  and  change  the 
original  official  mission  of  the  party  to  enlarge  or 
change  its  domain. 

The  traditional  sociology  of  political  parties  is 
usually  concentrated  on  these  aspects  of  the  rela¬ 
tions  between  parties  and  the  environment.  But  the 
identification  of  the  classe  gardee  does  not  entirely 
cover  the  topic.  We  should  examine  the  question 
of  the  relations  among  parties  and  environments 
from  a  different  perspective.  In  the  case  of  parties 
operating  in  democratic,  competitive  regimes,  it 
can  be  useful  to  divide  the  party  environment  into 
three  main  (interconnected)  arenas:  the  electoral, 
the  parliamentary,  and  the  interorganizational  (the 
set  of  relations  between  the  party  and  interest 
groups,  unions,  etc.). 

Different  levels  of  complexity  and  different  lev¬ 
els  of  stability/turbulence  can  be  identified  in  each 
arena.  On  complexity,  organization  theory  assumes 
that  there  are  isomorphic  pressures:  The  more 
complex  the  environment,  the  more  complex  the 
organization  becomes.  An  increase  in  the  complex¬ 
ity  of  the  environment  forces  the  organization  to 
increase  its  internal  specialization.  In  a  complex 
arena  (electoral,  parliamentary,  or  interorganiza¬ 
tional),  the  party  is  obliged  to  specialize  its  internal 
offices  in  order  to  face  the  different  external  pres¬ 
sures  and  threats. 

As  regards  stability/turbulence,  organization 
theory  predicts  (ceteris  paribus)  more  decentraliza¬ 
tion  in  a  stable  environment  and  more  central¬ 
ization  in  a  turbulent  environment.  In  a  stable, 
predictable  environment,  there  are  few  external  dan¬ 
gers:  Parties  adapt  to  the  environment,  the  different 
subsections  of  the  party  have  stable  exchanges  with 
different  portions  of  the  environment,  and  concen¬ 
tration  of  power  is  not  a  necessity.  In  a  turbulent 
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environment,  unpredictability  is  very  high,  and 
there  are  many  external  dangers:  The  organization 
must  concentrate  power  at  the  top  of  the  hierarchy 
to  survive. 

During  the  period  from  1920  to  1960,  the  elec¬ 
toral  arenas  of  Western  Europe  were  highly  stable. 
In  consequence,  party  organizations  too  were  sta¬ 
ble  or  changed  very  slowly.  By  the  late  1960s,  the 
picture  had  changed:  The  European  electoral  are¬ 
nas  became  more  volatile  and  turbulent,  and  party 
organizations  underwent  important  changes. 
Above  all,  concentration  of  power  at  the  top  level 
of  the  hierarchy  became  a  necessity  for  all  parties. 
For  instance,  the  so-called  personalization  of  party 
politics,  the  new  relations  between  party  leaders 
and  the  public,  was  not  only  brought  about  by  the 
new  role  of  the  mass  media  but  was  also  an  aspect 
of  power  concentration  processes  in  turbulent  elec¬ 
toral  environments.  The  theory  predicts  decadence 
(in  the  case  of  parties,  severe,  permanent,  electoral 
losses)  if  the  organization  fails  to  adapt  itself  to  the 
new  environmental  conditions. 

Organizational  Changes 

Every  organization  can  experience  two  different 
kinds  of  changes:  (1)  continuous,  small,  and  incre¬ 
mental  changes  that  do  not  modify  the  essential 
features  of  the  organization  (official  mission, 
organizational  constitution,  power  structure)  and 
(2)  rare  and  extraordinarily  big  transformations 
that  happen  suddenly,  deeply  modifying  those 
features.  The  punctuated  equilibrium  model  is  a 
stylized  representation  of  the  process. 

Major  transformations  of  political  parties  are 
rare  but  do  happen.  The  process  can  be  broken 
down  into  three  phases.  The  first  phase  is  the  rise 
of  an  organizational  crisis.  The  crisis  is  usually  the 
effect  of  environmental  pressures.  In  the  case  of 
parties,  a  severe  electoral  defeat  is  the  most  fre¬ 
quent  external  challenge  that  can  give  rise  to  the 
organizational  crisis.  The  second  phase  is  a  change 
in  the  composition  of  the  dominant  coalition:  Old 
leaders  are  discredited  and  removed,  and  new  lead¬ 
ers  enter  the  coalition.  There  is  a  more  or  less  brisk 
“circulation  of  elites.”  The  third  phase  sees  a 
simultaneous  change  in  the  organizational  consti¬ 
tution  and  the  official  mission.  The  (partially) 
changed  dominant  coalition  must  consolidate 
itself.  Usually,  it  will  introduce  change  in  the 


physiognomy  of  the  organizational  structure  and 
(partially)  modify  the  official  mission.  A  succes¬ 
sion  of  ends  (partial  substitution  of  the  old  goals 
with  new  ones)  is  a  consequence  of  the  change  in 
the  composition  of  the  dominant  coalition.  The 
final  effect  of  the  process  will  be  a  more  or  less 
deep  transformation  of  the  relations  between  the 
party  and  its  task  environments. 

Two  (related)  regularities  can  be  identified: 
( 1 )  the  stronger  the  environmental  pressure,  that  is, 
the  more  dangerous  the  external  threat,  the  more 
severe  is  the  organizational  crisis,  and  (2)  the  more 
severe  the  organizational  crisis  the  deeper  is  the 
change  in  the  composition  of  the  dominant  coali¬ 
tion  and  the  transformations  of  both  the  organiza¬ 
tional  structure  and  the  official  mission  as  well. 
This  ideal-typical,  highly  stylized  representation  of 
parties’  changes  may  be  assumed  as  a  quite  useful 
analytical  tool  for  empirically  observing  the  trans¬ 
formations  that  parties  sometimes  experiment  with. 

Typologies 

In  his  classic  Political  Parties  (1951),  Maurice 
Duverger  proposed  a  famous  classification  of 
party  organizations.  In  the  Western  historical 
experience,  he  identified  four  fundamental  types: 
(1)  the  cadre  party,  (2)  the  mass  party,  (3)  the  cell 
party,  and  (4)  the  militia  party.  The  first  two  types 
were  the  most  important  and  diffuse.  But 
Duverger’s  analysis  was  not  original.  It  followed 
the  classical  works  of  Moisey  Ostrogorski,  Robert 
Michels,  Max  Weber,  and  James  Bryce. 

The  cadre  party  is  the  traditional  bourgeois 
party:  a  loose  electoral  organization,  without  party 
discipline,  financed  by  notables  and  controlled  by 
the  parliamentary  elite.  The  mass  party  is  a  very 
strong  organization.  It  is  a  membership  party.  Its 
organizational  “inventions”  are  the  territorial  sec¬ 
tion,  the  membership  card,  the  party  bureaucracy, 
and  the  periodical  congresses  in  which  the  leaders 
are  officially  selected  and  the  political  strategy  is 
approved.  An  “inner  circle”  (the  general  secretary, 
the  party  headquarters)  controls  the  mass  party. 
Usually,  the  parliamentarians  are  dependent  on  the 
inner  circle.  The  mass  party  is  the  organization  that 
is  able  to  proselytize  among  the  popular  classes  of 
the  society:  manual  workers,  peasants,  and  artisans. 

Duverger’s  prophecy  is  well-known:  The  mass 
party  would  become  the  dominant  type  of  party  in 
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the  mature  Western  democracies.  Like  Michels 
40  years  earlier,  Duverger  was  influenced  by  the 
history  of  the  European  socialist  parties.  Sixteen 
years  later,  Otto  Kirchheimer  (1966)  reversed  the 
perspective.  A  new  form  of  party  was  emerging: 
the  catchall  party.  The  catchall  party  was  different 
from  the  mass  party  of  the  past.  Its  communicative 
style  was  pragmatic,  not  ideological,  and  its  link¬ 
ages  with  the  traditional  classe  gardee  (the  manual 
workers,  the  religious  voters)  were  declining.  The 
transformation  of  the  mass  party  into  a  catchall 
party  was  an  effect  of  the  social  and  political  trans¬ 
formations  of  European  societies:  the  economic 
development,  the  rise  of  mass  education  levels,  the 
new  role  of  the  mass  media,  and  so  on. 

Party  organizations  were  changed  too.  New 
types  of  professional  figures  slowly  took  over  the 
old  mass  party  bureaucracy:  mass  media  experts 
and  marketing  and  fund-raising  specialists,  among 
others.  The  traditional  role  of  the  membership,  so 
important  in  the  old  mass  party,  was  declining. 
From  an  organizational  viewpoint,  the  passage 
from  the  mass  party  to  the  catchall  party  has  been 
synthesized  as  the  transformation  of  the  bureau¬ 
cratic  party  into  the  professional  electoral  party 
(Panebianco,  1988). 

After  Duverger  and  Kirchheimer,  there  have 
been  many  attempts  to  identify  the  recent  transfor¬ 
mations  of  Western  parties.  The  cartel  party  model 
(Richard  Katz  &  Peter  Mair,  1995)  is  one  of  these. 
In  this  perspective,  the  most  important  change  is 
with  regard  to  the  “statization”  of  parties,  their 
new  symbiotic  relationship  with  state  agencies  and 
its  impact  on  the  traditional  party  organization. 
Some  empirical  analyses  confirm  that  cartelization 
is  one  of  the  possible  transformations  of  Western 
political  parties  (Klaus  Detterbeck,  2005). 

In  another  interpretation,  the  Western  political 
parties  are  becoming  franchise  systems:  A  central 
organization  provides  ideological  arguments  and 
material  services  to  a  lot  of  autonomous  subparty 
organizations.  The  franchise  model  implies  the  end 
of  the  traditional  internal  party  hierarchy. 
Stratarchies  are  everywhere  replacing  the  tradi¬ 
tional  oligarchies  (R.  Kenneth  Carty,  2004). 

All  these  attempts  to  analyze  contemporary 
trends  and  transformations  are  useful,  but  pru¬ 
dence  is  needed.  It  is  inaccurate  to  imagine  (like 
Duverger  or  Kirchheimer)  that  some  type  of  party 
organization  is  becoming  the  dominant  type  and 


that  all  the  existing  parties  will  imitate  that  type. 
On  the  contrary,  a  plurality  of  very  different  party 
organizations  always  coexists  in  democracies.  As 
we  have  seen,  parties  are  influenced  by  their  origi¬ 
nal  missions,  by  the  personality  and  roles  of  their 
founding  leaders,  and  by  the  crucial  organizational 
decisions  that  accompanied  their  birth  and  institu¬ 
tionalization.  After  the  institutionalization,  path 
dependency  processes  reduce  both  the  leaders’ 
menu  of  choices  and  the  probability  of  radical 
changes  in  the  organization.  Moreover,  political 
parties  are  born  in  different  historical  moments, 
and  all  organizations  are  strongly  influenced  by 
the  cultural  models  prevailing  in  those  moments. 

A  second  reason  for  prudence  has  to  do  with 
new  political  parties’  organizations  in  the  new 
democracies.  The  picture  is  very  differentiated, 
and  a  lot  of  empirical  and  comparative  work  is 
needed  to  understand  the  features  and  the  evolu¬ 
tions  of  these  new  political  organizations. 

In  the  case  of  Europe,  there  is  a  third  prudential 
consideration.  The  European  integration  process 
exerts  pressures  on  national  party  organizations, 
but  the  effect  is  still  largely  unknown. 

The  organizational  analysis  of  political  parties 
is  a  “structural”  kind  of  analysis.  The  assumption 
is  that  the  organizational  structure  influences  the 
behavior  of  actors  (leaders,  members,  and  follow¬ 
ers).  This  assumption  is  common  to  the  traditional 
organization  theory  and  to  the  most  recent  neo- 
institutional  theory.  In  any  case,  despite  its  correct¬ 
ness,  this  perspective  tells  only  a  part  of  the  story. 
One  must  also  take  into  consideration  the  freedom 
of  action  of  the  organizational  actors. 

An  organization  is  a  system  of  roles  equipped 
with  incentive  and  punishment  mechanisms  that 
influence  actors’  behavior.  It  is  also  an  arena  in 
which  actors  compete  for  power,  and  the  rules  of 
competition  depend  on  the  arena  characteristics. 
But  “to  influence”  is  not  “to  determine”:  In  real 
life,  different  actors  can  react  differently  to  the  same 
incentives.  The  organizational  analysis  is  a  useful 
starting  point,  but  “structures”  (including  organiza¬ 
tional  structures)  do  not  substitute  for  social  actions. 
A  theory  of  action  is  the  indispensable  complement 
of  any  analysis  of  a  party’s  organization. 

Angelo  Panebianco 
University  of  Bologna 
Bologna,  Italy 
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Party  System  Fragmentation 


A  party  system  is  fragmented  if  it  contains  more 
than  two  parties,  none  of  which  comes  close  to 
obtaining  an  absolute  majority  in  the  representa¬ 
tive  assembly.  Party  system  fragmentation  thus  has 
two  aspects:  the  number  of  parties  in  the  system 
and  their  relative  size.  A  pure  two-party  system 
fails  to  meet  both  of  these  defining  criteria.  A  sys¬ 
tem  with  many  parties  of  which  one  is  large 
enough  to  approach  an  absolute  majority  on  its 
own  also  cannot  be  considered  fragmented.  In 
empirical  terms,  fragmentation  in  practice  refers  to 


those  parties  that  gain  representation.  Parties  with¬ 
out  official  representation  are  often  difficult  to 
observe  and  document.  Moreover,  most  theories 
of  the  causes  and  consequences  of  party  system 
fragmentation  deal  with  parties  with  seats  in  a 
representative  body.  If  the  representative  assembly 
of  a  country  contains  many  fairly  small  parties  and 
no  party  approaching  majority  status,  the  party 
system  of  that  country  can  be  considered  frag¬ 
mented.  Several  measures  are  available  to  deter¬ 
mine  varying  degrees  of  fragmentation;  they  are 
strongly  intercorrelated  and  can  therefore  be  used 
interchangeably.  The  Laakso-Taagepera  Index  of 
the  Effective  Number  of  Parties  is  the  most  widely 
used  measure.  It  is  calculated  as  follows: 

N  = 

where  N  stands  for  the  effective  number  of  parties 
and  pt  denotes  the  fraction  of  the  seats  held  by 
each  party  i  in  the  assembly.  All  parties  are 
accorded  some  weight,  but  the  index  weights  the 
largest  parties  most.  Theoretically,  if  there  are  2 
parties  of  exactly  the  same  size,  the  effective  num¬ 
ber  of  parties  is  2.0.  Three  parties  of  equal  size 
render  the  index  value  3.0,  10  such  parties  10.0, 
and  so  on.  The  party  systems  of  the  United  States, 
Sweden,  and  Finland  can  be  used  as  empirical 
illustrations.  The  mean  value  of  the  Laakso- 
Taagepera  Index  for  the  U.S.  House  of  Rep¬ 
resentatives  since  1960  is  1.9.  For  Sweden,  with  a 
multiparty  system  where  the  Social  Democrats 
have  held  around  or  more  than  40%  of  the 
Riksdag  seats,  the  corresponding  figure  is  3.6.  In 
Finland,  no  party  normally  wins  more  than  a 
quarter  of  the  parliamentary  seats;  the  mean  effec¬ 
tive  number  of  parties  is  5.2.  The  three  cases 
represent  a  two-party  system,  a  moderately  frag¬ 
mented  multiparty  system,  and  a  highly  frag¬ 
mented  system,  respectively. 

The  numerical  aspects  of  party  systems  have 
always  been  an  important  factor  in  the  compara¬ 
tive  literature  on  political  parties.  Claims  that  sys¬ 
tems  with  many  parties  were  less  governable  than 
two-party  systems  were  presented  already  in  the 
19th  century.  As  the  interwar  years  saw  the  demise 
of  democracy  in  a  large  number  of  European 
states,  the  criticism  against  multiparty  government 
gained  in  prominence.  Nevertheless,  it  is  only  with 
the  emergence  of  comprehensive  comparative 
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databases  during  recent  decades  that  robust  empir¬ 
ical  generalizations  have  become  possible. 

The  dynamics  of  electoral  systems  go  a  long 
way  toward  explaining  why  some  countries  have 
fragmented  party  systems  while  others  do  not. 
Single-member  plurality  (SMP)  systems  effectively 
reduce  the  number  of  parties  that  gain  representa¬ 
tion,  while  proportional  (PR)  list  systems  are  con¬ 
ducive  to  fragmentation,  especially  when  electoral 
districts  are  large.  Among  democratic  countries 
with  SMP  systems,  the  mean  effective  number  of 
parties  since  1960  is  2.1,  while  the  corresponding 
figure  for  democracies  with  PR  list  systems  is  3.9. 
Countries  with  other  electoral  formulas,  including 
mixed-member  systems,  usually  attain  values 
between  these  extremes.  It  is  very  difficult  for  par¬ 
ties  other  than  the  two  largest  ones  to  win  seats  in 
SMP  systems,  while  proportional  systems  fre¬ 
quently  allow  even  quite  small  parties  to  gain  rep¬ 
resentation.  Voters’  awareness  of  this  mechanism 
reinforces  its  effect.  If  voters  deem  it  highly 
unlikely  that  a  vote  for  a  minor  party  will  result  in 
the  party’s  winning  a  seat — as  is  the  case  in  SMP 
systems — their  propensity  to  vote  for  it  becomes 
smaller.  By  the  same  token,  a  higher  probability  of 
a  small  party’s  winning  representation  in  PR  sys¬ 
tems  will  mean  that  voters  are  more  likely  to  stay 
with  their  preferred  party  even  if  it  is  a  minor  one. 

The  number  of  politically  relevant  social  cleav¬ 
ages  is  another  important  determinant  of  party 
system  fragmentation.  In  most  democracies,  politi¬ 
cal  parties  can  readily  be  placed  on  a  left-right  scale; 
this  scale  frequently  coincides  with  the  major  socio¬ 
economic  divisions  in  the  population.  These  divi¬ 
sions  may  give  rise  to,  for  example,  conservative, 
left-wing,  and  centrist  parties.  If,  however,  other 
cleavages  such  as  religion,  language,  or  region  cut 
across  the  left-right  dimension,  the  result  may  be  a 
considerably  higher  degree  of  party  system  frag¬ 
mentation.  The  effect  of  multiple  cleavages  is 
particularly  pronounced  if  the  electoral  system  per¬ 
mits  numerous  parties  to  gain  representation.  A 
good  example  of  this  is  Belgium,  where  the  division 
between  French  and  Flemish  speakers  cuts  right 
across  the  left-right  divide  and  where  the  PR  list 
system  presents  few  obstacles  to  minor  parties’  gain¬ 
ing  parliamentary  seats.  The  mean  effective  number 
of  parties  in  Belgium  since  1960  is  as  high  as  6.3. 

The  size  of  countries  and  other  political  units, 
whether  measured  by  population  or  by  geographic 


area,  has  an  independent  effect  on  party  system 
fragmentation  even  after  the  effects  of  electoral 
systems,  socioeconomic  diversification,  and  cul¬ 
tural  heterogeneity  have  been  controlled  for.  This 
rule  seems  to  apply  at  both  the  cross-national  and 
the  subnational  levels.  Large  size  is  related  to 
greater  social  distances,  functional  specialization, 
as  well  as  organizational  diversity  and  complexity. 
All  these  factors  are  conducive  to  greater  attitudi- 
nal  diversity,  which  in  turn  may  enhance  party 
system  fragmentation. 

Several  other  factors  have  been  proposed  as 
explanations  of  the  variation  in  party  system  frag¬ 
mentation.  It  has  been  suggested  that  bicameralism 
fosters  party  system  fragmentation.  The  timing  of 
elections  at  various  governmental  levels  may  also 
have  an  effect.  If  parliamentary,  presidential, 
regional,  and  local  elections  are  held  at  different 
points  in  time,  the  variety  of  issues  and  interests  at 
stake  in  these  elections  may  foster  partly  separate 
sets  of  political  parties.  Moreover,  it  has  been  pro¬ 
posed  that  strong  presidential  executives  may 
reduce  the  incentive  to  maintain  large,  unified  par¬ 
ties.  On  all  these  points,  the  empirical  evidence 
cannot  be  considered  conclusive  yet. 

The  effects  of  party  system  fragmentation  are 
particularly  related  to  government  formation  and 
stability.  While  the  winning  party  in  a  two-party 
system  can  form  the  postelection  government 
based  on  a  majority  of  its  own,  this  alternative  is 
not  available  in  fragmented  party  systems.  Instead, 
the  choice  must  be  made  between  a  coalition  of 
several  parties  or  a  minority  cabinet.  The  incidence 
of  both  minority  governments  and  coalition  cabi¬ 
nets  is  positively  associated  with  the  degree  of 
party  system  fragmentation;  the  latter  association 
is  particularly  strong.  Thus,  although  ideological 
and  political  factors  naturally  influence  the  way 
parties  cooperate  and  interact  with  each  other,  the 
numerical  criterion  is  an  important  general  deter¬ 
minant  of  government  type. 

The  internal  functioning  and  prospects  of  a  gov¬ 
ernment  are  strongly  affected  by  its  type  and  status 
and  are  thus  linked  to  party  system  fragmentation. 
One-party  majorities  are,  ceteris  paribus,  the 
strongest  government  type.  They  have  the  neces¬ 
sary  strength  to  get  their  agreed-on  policies  enacted 
and  implemented;  at  the  same  time,  they  are  not  as 
frequently  plagued  by  major  internal  differences 
like  other  types  of  governments.  As  fragmented 
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systems  present  the  choice  between  minority  status 
or  coalition  government  (or  both),  the  combina¬ 
tion  of  strength  and  political  unity  is  much  rarer  in 
these  systems.  While  internally  unified,  one-party 
minorities  are  at  all  times  at  the  mercy  of  the 
political  majority  outside  the  government.  Coali¬ 
tion  cabinets  may  find  it  difficult  to  reconcile  the 
differences  between  the  government  parties,  which 
frequently  results  in  government  crises.  Conse¬ 
quently,  cabinets  in  countries  with  fragmented 
party  systems  tend  on  the  whole  to  be  more  short¬ 
lived  than  cabinets  in  countries  where  one-party 
majorities  are  the  rule. 

From  the  point  of  view  of  the  citizen,  party  sys¬ 
tem  fragmentation  creates  a  problem  when  it 
comes  to  holding  parties  and  politicians  account¬ 
able  for  their  accomplishments  and  failures.  When 
governments  are  coalitions  of  several  parties,  it  is 
not  easy  for  voters  to  know  whom  to  punish  or 
reward  for  government  policy.  All  government 
parties  tend  to  claim  credit  for  popular  policies, 
while  failures  or  unpopular  decisions  are  claimed 
to  be  the  fault  of  their  partners  in  government.  In 
fact,  alternative  coalitions  are  not  always  available 
in  fragmented  systems.  If  attaining  majority  status 
requires  the  participation  of  several  parties,  the 
chances  are  that  at  least  some  of  the  parties  from 
the  preceding  coalition  will  be  present  in  the  next 
one  as  well.  On  the  other  hand,  fragmented  party 
systems  present  the  voters  with  a  wide  range  of 
partisan  choices  in  elections.  In  these  systems,  vot¬ 
ers  can  support  the  parties  that  they  genuinely 
prefer  without  risking  a  large  number  of  wasted 
votes — that  is,  votes  for  parties  that  do  not  gain 
any  seats  at  all.  In  terms  of  democratic  qualities, 
fragmented  systems  tend  to  perform  poorly  on 
accountability  but  well  on  representativeness. 

Due  to  their  tendency  toward  cabinet  crises  and 
short-lived  governments,  fragmented  systems  are 
often  claimed  to  be  vulnerable  to  pressures  from 
nondemocratic  political  forces.  Besides  the  feeble¬ 
ness  of  governments  as  such,  the  fact  that  anti¬ 
system  parties  easily  gain  representation  in 
fragmented  systems  is  sometimes  mentioned  as  a 
risk  from  the  point  of  view  of  democratic  viability. 
In  connection  with  the  economic  and  political  cri¬ 
sis  between  the  World  Wars,  party  system  frag¬ 
mentation  was  among  those  factors  that  made  it 
more  difficult  to  sustain  democracy  in  the  face  of 
pressure  from  extreme  political  movements.  To  be 


sure,  examples  of  long-standing  democratic  stabil¬ 
ity  despite  a  high  degree  of  party  system  fragmen¬ 
tation  are  not  difficult  to  find:  Belgium,  Denmark, 
Finland,  and  the  Netherlands.  Still,  countries 
where  the  democratic  form  of  government  is  not 
yet  entirely  consolidated  would  be  well  advised  to 
design  their  electoral  institutions  so  as  to  discour¬ 
age  excessive  party  fragmentation,  especially  if 
there  are  numerous  deeply  entrenched  social  and 
cultural  cleavages  to  be  overcome. 

Lauri  Karvonen 
Abo  Akademi 
Abo,  Finland 
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Party  Systems 


Approaches  to  the  study  of  party  systems  are  mul¬ 
tiple,  as  are  their  definitions.  In  its  simplest  form, 
the  party  system  is  conceived  of  as  a  set  of  pat¬ 
terned  relationships  between  political  parties  com¬ 
peting  for  power  in  a  given  political  system.  Such  a 
notion  assumes  the  existence  of  rules,  norms,  and 
regularities  in  party  interactions,  concerning  mainly 
coalition-building  efforts  and  electoral  competition. 
This  implies  also  that  a  party  system  is  composed, 
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as  any  other  system,  of  distinguishable  parts  and 
the  empirically  testable  quality  of  its  “systemness.” 
Below,  some  major  features  of  such  systems,  for 
example,  the  kinds  and  numbers  of  parties  involved, 
their  evolution  and  organization  over  time,  their 
social  bases  and  dynamic  processes,  and  their  rela¬ 
tionship  to  institutional  aspects  of  electoral  systems 
and  government  formation,  are  discussed. 

An  ambitious  understanding  of  a  system  assumes 
that  it  displays  features  that  do  not  belong  to  a 
single  entity,  that  is,  to  one  element  alone  of  the 
system,  and  that  the  system  exists  only  if  it  hinges 
on  the  patterned  relationships  of  its  component 
parts,  hence  creating  systemic  boundaries,  clearly 
indicating  what  belongs  to  it  and  what  is  located 
outside  it.  A  system  is  always  composed  of  inter¬ 
acting  participant  units,  in  which  the  action  of 
each  and  every  participant  affects  the  actions  of  all 
other  participants.  A  systemic  approach  reveals 
how  everything  at  the  level  of  the  system  and  its 
structural  properties  differs  from  the  level  of  the 
interacting  units.  Systemic  traits  structure  the 
interactions  between  parties  and  the  games  that 
parties  play.  These  interactions  hinge  on  party 
resources,  preferences,  and  the  constraints  they 
face.  Moreover,  this  implies  that  the  interdepen¬ 
dence  of  the  units  or  their  attributes  is  ordered,  in 
the  sense  that  developments  are  predictable  and 
that  not  just  anything  can  happen.  This  order 
tends  toward  self-maintenance,  which  supports  the 
delineation  and  freezing  of  the  boundaries  of  the 
system  within  which  patterned  relationships  take 
place.  Consequently,  parties  create  a  system  only 
when  their  component  parts  interact  in  a  patterned 
systematic  way.  The  system  works  when  parties 
within  the  system  are  dependent  on  each  other  and 
each  party,  as  Giovanni  Sartori’s  (1976)  classic 
work  puts  it,  “is  a  function  (in  the  mathematical 
sense)  of  the  other  parties  and  reacts  competitively 
or  otherwise,  to  the  other  parties”  (p.  44).  In  short, 
a  party  system  is  a  system  of  interactions  resulting 
from  interparty  competition. 

Analyses  of  parties  have  been  made  prior  to 
deliberations  on  party  systems.  Parties  as  political 
institutions  began  to  draw  the  attention  of  the  social 
sciences  from  the  second  half  of  the  19th  century, 
whereas  interest  in  party  systems  increased  in  the 
second  half  of  the  20th  century.  Without  an  under¬ 
standing  of  the  ontology  of  parties,  the  systemic 
features  of  their  interactions  cannot  be  depicted. 


Three  aspects  of  party  systems  should  be  men¬ 
tioned.  First,  behind  almost  every  classification  of  a 
party  system  is  the  idea  that  they  are  structures  of 
representation.  Second,  an  equally  frequent  pro¬ 
posal  submits  that  they  are  institutionalized  chan¬ 
nels  of  articulation.  Finally,  they  are  conceived  as 
means  of  governability.  In  other  words,  the  three 
most  important  functions  are  (1)  to  recruit  and 
present  candidates  for  public  office  who  are  likely 
to  be  accepted  by  voters;  (2)  to  offer  distinguishable 
policy  packages,  programmatic  and  ideological 
alternatives  that  take  into  account  the  preferences 
and  values  of  citizens;  and  (3)  to  implement  policies 
effectively  and  contribute  to  the  overall  stability  of 
the  system.  The  composite  units  of  party  systems 
are  the  most  influential  manifestation  of  popular 
organizations  that  articulate  and  press  demands 
and  deliver  support  to  those  who  govern  in  contem¬ 
porary  democracies.  These  diverse  functions  offer 
challenges  as  often  they  are  in  conflict  with  one 
another.  Achieving  quality  representation  calls  for 
a  large  number  of  parties  satisfying  the  aspirations 
of  even  small  social  groups  with  clearly  defined  nar¬ 
row  issue  profiles  and  specific  aspirations.  The 
more  parties  there  are,  the  better  the  policy  fit 
between  groups  of  voters  and  their  institutional 
representatives.  At  the  same  time,  such  a  party  sys¬ 
tem  contributes  to  polarization,  divisiveness,  and 
ultimately  to  poor  governability.  If,  in  turn,  we 
wish  to  achieve  effective  governability,  we  should 
opt  for  one-party  dominance,  with  a  consensus- 
driven,  pragmatic  party  oriented  toward  the  mobi¬ 
lization  of  resources  to  satisfy  the  needs  of  the 
majority.  Such  a  solution,  with  a  single-party 
monopoly,  would  satisfy  the  needs  of  political 
effectiveness  but  at  the  same  time  violate  other 
important  democratic  values:  plurality,  choice,  and 
open  competition.  Consequently,  in  reality,  party 
systems  ought  to  compromise  on  maximizing  each 
of  their  basic  functions  and  adopt  more  moderate, 
optimal  solutions  guaranteeing  the  accomplishment 
of  different  goals. 

Number  and  Size  of  Parties 

Descriptions  of  party  systems  rely  on  their  widely 
accepted,  numerous  attributes.  One  finds  a  rich 
array  of  proposals  concerning  these  attributes  and 
their  relevance  and  importance  for  party  systems, 
depicting  at  the  same  time  both  the  essence  and  the 
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dynamics  of  a  given  party  system.  The  most  popu¬ 
lar  describe  a  party  system  with  reference  to  the 
number  and  the  size  of  parties  considered  relevant. 
Which  parties  should  be  considered  relevant  and / 
or  successful  enough  for  inclusion  into  the  system 
is  the  single  most  important  question  students  of 
party  systems  seek  to  answer.  This  is  important 
because  it  has  a  direct  effect  on  other  significant 
systemic  features,  such  as  the  relative  size  of  par¬ 
ties  and  consequently  the  dominant  logic  and 
mechanisms  of  competition,  the  nature  and  dura¬ 
bility  of  cooperation  and  alliances,  the  degree  of 
polarization  of  the  system,  its  center  of  ideological 
gravity,  its  stability  and  thus  its  predictability,  and 
the  way  social  strata  and  important  social  groups 
are  represented  in  the  overall  political  system. 

Most  scholars  distinguish  between  one-party, 
two-party,  and  multiparty  systems.  The  first  type 
is  questionable;  for  many  it  is  an  oxymoron, 
although  scores  of  political  systems  of  the  world 
are  still  based  on  the  existence  of  a  single  party, 
because  other  parties  and  representative  organiza¬ 
tions  are  legally  forbidden.  There  exist,  however, 
systems,  in  which  one-party  hegemony  over  a  pro¬ 
longed  period  of  time  is  the  rule,  although  legally 
opposition  parties  are  not  banned.  Their  perma¬ 
nent  marginalization  is  determined  by  a  combina¬ 
tion  of  discriminatory  practices,  the  harassment  of 
citizens,  and  clientelistic  relationships.  Furthermore, 
several  countries  with  clearly  democratic  creden¬ 
tials  (e.g.,  Japan  and  Sweden)  and  a  multiparty 
system  design  in  reality  experience  a  system  in 
which  one  party  dominates  for  a  long  period  of 
time,  even  though  the  functioning  of  opposition 
parties  is  not  constrained  legally  or  in  practice. 

A  fundamental  distinction  was  made  in  the  mid- 
20th  century  between  two-party  and  multiparty 
systems,  their  causes,  and  consequences.  The 
major  cause  was  seen  in  the  respective  electoral 
laws:  Simple  plurality  rules  (first-past-the-post) 
applied  in  single-member  districts  tend  to  produce 
two-party  systems,  while  two-round  majoritarian 
and  proportional  rules  favor  the  creation  of  multi¬ 
party  constellations.  The  consequences,  it  was 
assumed,  were  essential  as  well:  A  two-party  sys¬ 
tem  was  believed  to  create  a  stable  political  system, 
with  moderate  centripetal  competition,  based  on 
clarity  of  responsibility  and  accurate  attribution  of 
accountability.  Multiparty  systems,  by  contrast, 
were  believed  to  enhance  extremism,  centrifugal 


competition,  limited  alternation  of  governing 
parties,  unclear  accountability  due  to  complex 
coalition  formation  procedures,  and  vague  respon¬ 
sibility  for  the  policies  implemented. 

This  simple  numerical  criterion  used  in  distin¬ 
guishing  party  systems  faces  many  problems,  the 
crucial  one  being  how  parties  should  be  counted. 
It  is,  and  always  has  been,  obvious  that  parties 
cannot  be  treated  equally,  mainly  because  of  their 
divergent  electoral  support,  legislative  strength, 
and  potential  for  entering  coalitions,  and  the  pecu¬ 
liarities  of  their  social  following.  Scholars  dealing 
with  this  problem  are  aware  of  the  difficulty  in 
finding  adequate  criteria  by  which  to  include  or 
exclude  parties  or,  alternatively,  assign  them  a 
proper  “weight.”  In  the  mid-1970s,  a  solution 
seemed  to  be  found;  Giovanni  Sartori  offered  the 
criterion  of  relevance  of  parties.  To  be  relevant,  a 
party  had  to  disclose  its  coalition  and/or  blackmail 
potential.  The  first  feature,  the  coalition  potential, 
depends  on  whether  a  party  is  attractive  enough, 
because  of  its  size  or  a  unique,  pivotal  position  in 
the  space  of  political  competition,  to  effectively 
join  governmental  coalitions  and  share  executive 
responsibility.  The  second,  party  blackmail  poten¬ 
tial,  is  less  obvious  as  it  refers  to  a  specific  factor 
that  depends  on  our  interpretations:  A  party  exerts 
blackmail  potential  if  it  can  influence  the  behavior 
of  other  systemic  and  relevant  parties,  despite  the 
fact  that  it  cannot  itself  participate  in  a  coalition 
government.  This  applies  to  systems  in  which  size¬ 
able  parties  are  excluded  from  mainstream  politics 
by  the  other  parties  because  of  alleged  antisystemic 
features  or  traits  pointing  to  their  radicalism  and 
destabilizing  potential.  Good  examples  of  such 
parties  are  the  former  Italian  Communists  or  the 
contemporary  Czech/Moravian  Communist  Party 
(Komunisticka  Strana  Cech  a  Moravy,  KSCM).  In 
both  instances,  the  parties  enjoyed  substantial, 
double-digit  percentages  of  social  support  but 
were  never  allowed  to  participate  in  any  executive 
responsibility  at  the  national  level. 

Irrespective  of  how  important  the  relevance  cri¬ 
terion  is,  Sartori’s  proposal  still  focuses  on  the 
number  of  parties  and  their  respective  ideological 
and  programmatic  distance  as  the  main  features  of 
a  party  system.  Combining  these  two  criteria,  the 
proposal  allows  us  to  distinguish  between,  what  he 
calls,  moderate  (limited)  pluralism  and  polarized 
(extreme)  pluralism.  In  the  first  instance,  a  party 
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system  consists  usually  of  three  to  five  parties  and 
reveals  relatively  little  ideological  distance  between 
them.  In  the  second  case  (of  polarized  pluralism), 
the  party  system  is  usually  composed  of  six  or 
more  parties  and  manifests  significant  ideological 
distance  between  the  parties.  The  clear  virtue  of 
this  proposal  is  its  dynamic  nature.  The  variables 
used  interact  with  each  other,  allowing  one  to  pre¬ 
dict  more  or  less  accurately  the  development  of  a 
system  in  practice.  For  instance,  moderate  plural¬ 
ism  not  only  reveals  a  smaller  number  of  relevant 
parties  and  lesser  ideological  distance,  but  it  is  also 
very  likely  to  develop  a  centripetal  direction  of 
systemic  competition  between  two  clearly  distin¬ 
guishable  blocs,  whereas  polarized  pluralism  is 
most  likely  to  develop  centrifugal  competition 
with  various  opposition  parties,  leading  to  their 
irresponsible  behavior,  poor  accountability  mecha¬ 
nisms,  and  destabilizing  effects. 

Policies,  Programs,  Ideological  Divisions 

Party  systems  are  frequently  depicted  using  spatial 
metaphors,  indicative  of  policy  or  ideological 
dimensions  and  divisions:  Among  them  the  left- 
right  dimension  is  the  most  popular  one.  It  is  pre¬ 
cisely  this  aspect  of  party  systems  that  distinguishes 
one  from  another:  the  substance  of  policy  and  the 
number  of  salient  dimensions  that  need  to  be  taken 
into  account  in  understanding  and  interpreting  the 
logic  of  party  system  structure,  development,  and 
change. 

Many  scholars  define  the  party  system  not  only 
by  referring  to  the  number  of  parties  and  their  pat¬ 
terned  relationship  but  also  by  indicating  their 
belonging  to,  or  leaning  toward,  a  particular  party 
family.  The  latter,  in  turn,  is  defined  as  a  group  of 
parties  in  different  countries  that  have  similar  ide¬ 
ologies  and  party  programs.  Each  country  has  a 
unique  party  system:  a  unique  combination  of  par¬ 
ties,  ideological  and  programmatic  profiles,  size  of 
electoral  support,  and  coalitions.  Nevertheless, 
certain  patterns  across  countries  are  discernable. 

The  description  of  party  systems  from  this  point 
of  view  uses  the  language  of  social  divisions , 
dimensions  of  competition,  political  divides,  and 
cleavages.  The  terminological  universe  is  complex 
and  blurred  by  the  inconsistent  way  scholars  use 
these  notions.  At  this  point,  let  us  reiterate  that 
indeed  society  is  differentiated  by  many  factors 


and  that  these  distinctions  are  more  or  less  salient 
and  durable  and/or  conflicting.  If  in  addition  to 
this  salience  and  durability,  they  are  politicized, 
one  can  speak  of  cleavages.  Cleavages  are  a  rela¬ 
tively  stable  product  of  social  conflicts,  but  not  all 
conflicts  generate  cleavages;  the  ones  that  do  are 
intense  and  prolonged.  For  cleavages  to  exert 
structuring  power  over  political  developments, 
party  formations,  their  stability,  and  patterned 
interactions,  political  camps  have  to  be  established 
on  the  basis  of  strong  identifications  and  a  vision¬ 
ary  leadership  that  is  able  both  to  attract  public 
attention  with  the  appeals  of  their  manifesto  and 
to  maintain  the  effective  functioning  of  the  party 
organization. 

Only  certain  social  divisions  become  politicized, 
where  parties  represent  opposite  sides.  Parties  con¬ 
centrate  on  divisions  and  policy  areas  that  show 
electoral  volatility,  that  is,  where  voters,  for  differ¬ 
ent  reasons,  are  responsive  to  party  appeals.  Other 
divides  are,  however,  frozen;  that  is,  voters  are 
entrapped  within  a  social  relationship  where  there 
is  no  room  for  individual  rationality  and  free 
choice.  In  such  instances,  cleavage  mechanisms, 
group  membership  ties,  and  partisanship  deter¬ 
mine  party  support. 

One  of  the  crucial,  though  still  unresolved, 
issues  is  the  relationship  between  the  number  of 
parties  and  the  number  of  societal  divides  or  cleav¬ 
ages.  In  principle,  the  higher  the  number  of  salient 
divisions,  the  more  room  there  is  for  the  existence 
of  divergent  parties.  This  relationship,  however,  is 
far  from  universal;  its  fluidity  hinges  on  the  extent 
to  which  these  divisions  overlap  and  on  the  ability 
of  parties  to  simplify  the  multidimensionality  of 
the  space  of  competition  by  reducing  its  number  to 
two  or  three  of  the  most  salient  ones.  The  limited 
cognitive  abilities  of  citizens  to  process  complex 
information  create  the  necessity  for  such  simplifi¬ 
cations,  and  only  those  parties  that  effectively 
reduce  the  number  of  articulated  policy  alterna¬ 
tives  enjoy  an  electoral  advantage.  Policy  simplifi¬ 
cation  arises  also  because  of  certain  macrosocietal 
processes,  that  is,  the  growing  middle  class  and/or 
redistributive  successes  of  the  welfare  state,  which 
allow  for  the  simpler,  one-dimensional  appeal  of  a 
party  manifesto. 

Another  essential  distinction  for  understanding 
the  different  modes  of  party  competition  is  the 
one  between  valence  and  positional  issues.  The 
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former  concerns  universally  prized  values  or 
goods  that  most  politicians  and  citizens  agree  on. 
The  latter  assumes  clear  differences  between 
groups  of  people  in  policy  stances  and  value  ori¬ 
entations.  Due  to  this,  parties  may  offer  divergent 
policies  concerning  some  substantive  issues,  for 
example,  taxes,  health  services,  and  welfare  bene¬ 
fits.  The  relationship  of  the  dominance  of  valence 
or  positional  modes  of  political  competition  with 
quality  of  representation  and  accountability  as 
well  as  with  clientelistic  or  programmatic  politi¬ 
cian-voter  linkages  is  still  vague  and  underscruti¬ 
nized.  Nevertheless,  some  claim  that  “candidate 
appeals  play  out  in  most  instances  into  valence 
competition  and  only  rarely  as  positional  competi¬ 
tion.  Clientelistic  accountability  works  mostly  as 
valence  competition  among  parties”  (Herbert 
Kitschelt,  2009,  p.  623). 

Party  System  Institutionalization 

The  well-known  saying  “No  parties,  no  democ¬ 
racy”  has  strong  empirical  support.  So  far,  no 
example  of  a  working  modern  democracy  without 
parties  exists.  Contemporary  global  developments, 
since  new  Third  Wave  democracies  in  large  num¬ 
bers  increased  the  family  of  democratic  polities, 
call  for  more  scrutiny  and  clear-cut  indicators  of 
what  counts  as  a  consolidated  system  of  political 
parties.  One  of  the  most  widely  accepted  proposals 
was  introduced  in  the  mid-1990s  by  scholars  com¬ 
paring  the  development  of  Latin  American  polities. 
Currently,  the  idea  is  widely  shared  by  scholars 
focusing  on  other  newly  democratized  countries 
and  their  inchoate  party  systems.  At  the  most 
abstract  level,  institutionalization  is  a  process  by 
which  organizations  and  procedures  acquire  value, 
stability,  and  consequently  predictability.  There 
are  several  major  features  of  an  institutionalized 
party  system: 

1.  stability  in  interparty  competition, 

2.  the  existence  of  parties  that  have  stable  roots  in 
society, 

3.  broad  acceptance  of  elections  and  parties  as 
legitimate  institutions  deciding  who  governs, 
and 

4.  the  existence  of  parties  as  organizations 
governed  by  stable  rules  and  structures. 


A  more  detailed  portrait  of  an  institutionalized 
party  system  ought  to  emphasize  the  fact  that  the 
stability  of  interparty  competition  signifies  relative 
stability  and  does  not  amount  to  a  stagnant,  frozen 
relationship.  The  stable  roots  of  parties  in  society 
imply  that  some  parties  enjoy  overproportional 
support  from  certain  social  strata,  the  labor  mar¬ 
ket,  and  social  or  ethnic  groups  and  that  this 
support  is  stable  from  election  to  election.  Parties 
differ  in  the  extent  to  which  voters  identify  with 
them  as  well  as  in  the  quality  of  the  linkage 
between  citizens  and  parties.  Apart  from  this,  par¬ 
ties  still  have  to  manifest  firm  and  enduring  ideo¬ 
logical,  programmatic,  or  policy  positions  for  citi¬ 
zens  to  understand  what  they  stand  for.  The  broad 
acceptance  of  elections  and  the  decisions  taken  by 
parties  after  the  election  (concerning,  e.g.,  the  for¬ 
mation  of  coalitions)  as  the  only  legitimate  means 
of  taking  responsibility  for  governing  and  con¬ 
trolled  access  to  power  concerns  mainly  political 
elites  and  their  attitudes  and  preferences.  Finally, 
the  stability  of  rules  and  structures  points  to  the 
importance  of  the  autonomy  of  parties,  both  from 
populist  leaders  and  from  other  organizations  or 
powerful  groups,  even  those  that  might  have  his¬ 
torically  been  instrumental  in  their  creation  (e.g., 
the  churches,  trade  unions,  and  businesses).  Parties 
also  have  to  be  independent  financially  and  to 
operate  on  the  basis  of  highly  routinized  norms 
and  procedures. 

In  short,  this  approach  emphasizes  party  inter¬ 
actions  that  reveal  stability,  based  on  a  well- 
developed  connection  between  politicians  and 
their  voters;  it  also  considers  electoral  volatility  to 
be  a  fundamental  indicator  of  the  systemic  nature 
of  the  party  system. 

There  is,  however,  another  approach  to  the 
structuring  of  the  party  system  that  concentrates 
on  its  most  important  attribute,  namely,  the  com¬ 
petition  for  power.  It  comprises  the  core  element 
and  the  main  criterion  for  evaluating  the  mode  of 
patterns  of  competition,  that  is,  its  open  or  closed 
nature.  Whether  a  party  system  reveals  stable  and 
predictable  party  competition  depends  on  several 
easily  observable,  empirical  factors.  Alternation  in 
government  is  one,  and  it  takes  three  forms: 
(1)  wholesale  alternation,  when  a  governmental 
party  is  totally  replaced  by  an  opposition;  (2)  par¬ 
tial  alternation,  when  the  newly  established  gov¬ 
ernment  includes  at  least  one  party  of  the  outgoing 
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government;  and  (3)  nonalternation ,  when  there  is 
no  change  at  all.  The  second  factor  is  familiarity 
with  government  or  innovation.  This  could  indi¬ 
cate  the  degree  to  which  parties  tend  to  be  in 
government  in  the  same  configurations,  which 
signifies  familiarity,  or  their  combinations  are  new 
every  time,  which  indicates  innovation.  Access  to 
government  is  the  third  crucial  factor;  it  focuses  on 
whether  all  relevant  parties  in  the  system,  over  a 
long  time  period,  had  access  to  executive  offices  or 
whether  there  are  some  parties  permanently 
excluded  from  power.  It  indicates  whether  access 
to  government  is  widely  or  narrowly  distributed. 
Based  on  these  criteria,  we  arrive  at  two  funda¬ 
mental  types  of  party  competition.  The  first  is  the 
open  structure,  where  alternations  of  governments 
take  place  and  no  stable  governmental  coalitions 
exist,  accompanied  by  a  highly  dispersed  and  easy 
access  to  governmental  positions  for  all  or  most 
relevant  parties.  The  second  type  is  the  closed 
structure,  which  is  in  place  when  the  reverse  pat¬ 
tern  occurs — that  is,  either  there  are  no  alterna¬ 
tions  or  durable  coalitions  remain  in  power,  with 
some  relevant  parties  being  permanently  excluded. 

Party  System  Change 

Analyzing  system  change  calls  for  terminological 
precision.  The  first  problem  concerns  the  distinc¬ 
tion  between  changes  within  the  system  as  opposed 
to  a  change  of  the  system.  The  second  problem, 
related  to  the  first,  pertains  to  the  scope  and  level  of 
change;  in  this  regard,  four  abstract  manifestations 
can  be  distinguished:  (1)  incidental  swings,  (2)  lim¬ 
ited  change,  (3)  general  change,  and  (4)  alternation 
of  the  system.  Incidental  swings  are  usually  tempo¬ 
rary  distortions  in  the  patterned  way  a  party  system 
operates;  this  might  have  to  do  with  the  occurrence 
of  a  new  (small)  party,  a  change  in  the  polarization 
of  the  system,  the  appearance  of  new  salient  issues, 
and  similar  factors.  The  particular  trait  of  such  a 
change  is  that  it  is  minor  in  its  effects;  it  does  not 
alter  the  traditional  logic  of  party  competition  and 
coalition  formation.  These  swings  are  indeed  inci¬ 
dental  and  short-lived,  having  a  negligible  effect  on 
the  main  actors  of  the  system. 

Limited  change  is  prolonged  or  even  permanent; 
the  change  is  nevertheless  restricted  either  to  one 
area  or  confined  to  the  emergence  of  a  party  that 
replaces  another  one.  Sometimes  a  party  simply 


disappears;  nevertheless,  a  new  one  is  created  simul¬ 
taneously,  which  serves  as  its  functional  equivalent. 
Limited  change  might  also  mean  a  modification  in 
the  dynamic  features  of  the  system,  for  instance,  an 
increase  in  its  net,  gross,  or  within-block  volatility. 
Or  the  system  may  become  more  polarized. 
Nevertheless,  all  these  “limited”  changes  do  not 
alter  the  previous  logic  and  stability  of  the  system  as 
a  whole;  it  still  falls  into  the  same  category. 

General  change  is  more  serious  and  relates  to 
several  aspects,  that  is,  the  fact  that  changes  are 
multifaceted,  deep,  and  prolonged  and  that  they 
concern  salient  features  of  the  system.  Moreover, 
these  changes  indicate  the  destabilization  of  the  sys¬ 
tem  and  problems  in  achieving  a  (new)  equilibrium. 
Such  a  change  can  lead  to  a  significant  shift  in  the 
relative  strength  of  parties,  the  shape  of  coalitions, 
their  fragmentation,  and  their  polarization.  Overall, 
such  general  changes  raise  the  question  of  whether 
the  system  can  still  be  categorized  as  before.  Finally, 
the  alternation  of  the  system  signifies  a  dramatic 
change  in  most  of  its  aspects:  the  party  composition; 
its  strength,  alliances,  and  leadership;  and  all  other 
mechanisms  by  which  it  was  governed  before  the 
alternation  occurred.  Transformations  of  this  sort 
may  also  concern  the  nature  of  parties  themselves 
(mass  to  catchall  or  catchall  to  cartel).  Total  alter¬ 
nations  happen  rarely  and  are  usually  related  to 
extraordinary  conditions  or  events  (wars,  revolu¬ 
tions,  a  new  constitution,  new  electoral  laws,  a 
severe  economic  crisis,  the  forming  of  an  interna¬ 
tional  alliance,  and  fundamental  changes  in  social 
structure  or  similar  occurrences). 

Consequently,  party  system  change  may  take  a 
variety  of  forms,  from  marginal  change  to  the 
alternation  of  essential  features,  necessitating  its 
reclassification.  In  other  words,  change  of  the 
original  nucleus  of  the  party  system  can  be  seen  as 
party  system  change,  while  marginal  changes  (usu¬ 
ally)  are  not.  Empirically,  such  a  change  signifies 
alternation  of  the  direction  of  competition,  major 
changes  in  structured  alliances  and  cooperation,  or 
the  occurrence  of  a  new  governing  formula. 

Party  systems  also  change  their  format  due  to 
long-term  social  and  economic  developments.  The 
processes  of  dealignment  or  realignment  of  party 
affiliations  result  from  structural  demographic 
changes,  accompanied  by  culture  shifts.  Previously 
excluded  voters,  new  generations,  and  newly  cre¬ 
ated  sectors  of  the  labor  force  come,  at  some  point, 
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to  decide  on  public  policies  and  general  values  as 
well  as  lifestyle  preferences.  These  processes  affect 
parties;  their  programs,  policy  stances,  and  pro¬ 
files;  the  nature  of  their  leadership;  and  ultimately 
the  nature  and  format  of  the  entire  party  system. 
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Path  Dependence 


Path  dependence,  a  highly  popular  concept  in 
social  science,  simply  refers  to  a  dynamic  pattern 
or  continuity  that  evolves  as  a  result  of  its  own 
past.  As  a  historically  sensitive  approach,  path 
dependence  emphasizes  the  role  of  the  timing  and 
sequence  of  events  in  the  social  and  political 
world.  Small,  accidental,  or  random  occurrences 
happening  at  a  certain  time  are  expected  to  have 
long-lasting,  self-reproducing  patterns  or  paths.  In 
other  words,  when  things  happen  in  a  chain  of 


events  affects  how  they  take  place.  Below,  the 
importance  and  also  some  of  the  limitations  of 
this  concept  are  discussed. 

Although  path  dependence  simply  asserts  that 
particular  choices  and  events  that  occur  in  the  past 
mold  the  unfolding  sequence  of  events  (i.e.,  history 
matters),  any  continuity  or  sequence  of  events  may 
not  constitute  a  path-dependent  process.  Path 
emergence,  maintenance,  and  breaking  in  a  path- 
dependent  process  should  have  specific  features. 
Regarding  path  emergence,  contingent,  accidental, 
brief  occurrences  or  nonoccurrences  launch  event 
chains.  In  other  words,  initial  conditions  are  treated 
as  stochastic.  Otherwise,  path  dependence  would  be 
nonfalsifiable  because  one  can  easily  link  any  event 
to  an  antecedent  or  a  temporarily  distant  cause.  As 
a  result,  any  outcome  might  be  marked  as  path 
dependent.  To  avoid  such  a  problem  of  endless  flow 
of  causally  connected  events  (i.e.,  the  trap  of  infinite 
regress),  the  historical  event  that  launches  the  path 
is  expected  to  have  properties  of  contingency. 

Once  historical  contingencies  or  junctures  set 
into  motion  certain  patterns  or  sequences,  they 
lock  themselves  in,  leading  to  inertia.  In  other 
words,  paths  become  inflexible  or  rigid  over  time, 
making  it  difficult  to  shift  to  another  path  or 
return  to  the  initial  conditions.  The  processes  or 
mechanisms  that  reproduce  a  path  might  be  rather 
different  from  the  processes  producing  the  path. 
Although  stochastic  factors  start  a  path,  certain 
causal  mechanisms  reproduce  it.  A  highly  empha¬ 
sized  mechanism  of  path  reproduction  is  increas¬ 
ing  returns,  defined  as  s elf-reinforcing  positive 
feedback  processes.  The  idea  of  increasing  returns 
suggests  that  the  growing  benefits  that  a  certain 
path  cultivates  with  its  continued  adoption  create 
further  incentives  for  path  maintenance,  leading  to 
dormancy  over  time.  Learning  and  coordination 
effects,  adaptive  expectations,  and  large  setup  or 
fixed  costs  constitute  some  sources  of  positive 
feedback  processes.  Thus,  a  path  may  reproduce 
itself  even  in  the  absence  of  the  dynamics  and  fac¬ 
tors  responsible  for  its  creation. 

With  respect  to  change,  path  dependence 
expects  it  to  be  incremental  and  evolutionary  (i.e., 
path  following  or  change  within  a  path).  Sub¬ 
stantial,  pathbreaking  changes  (i.e.,  change  to  a 
new  path)  rarely  happen  after  long  periods  of 
continuity.  Such  major  changes  require  critical 
junctures  or  periods  (also  known  as  branching 
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points)  in  which  some  externally  driven  contin¬ 
gent  events,  such  as  wars,  economic  crises, 
dramatic  technological  developments,  natural 
disasters,  or  epidemics,  make  the  extant  path  less 
attractive  to  follow.  Thus,  critical  junctures  unlock 
the  existing  path  or  equilibrium  and  generate  a 
new  one,  leading  to  a  new  stasis  (also  referred  to 
as  punctuated  equilibrium). 

Polya  urn  process,  a  simplified  mathematical 
illustration,  delineates  the  logic  of  path  depen¬ 
dence  quite  well.  Imagine  an  urn  containing  two 
colored  balls,  one  yellow  and  one  red.  Let  us  draw 
one  ball  and  then  return  it  to  the  urn  with  an  addi¬ 
tional  ball  of  the  same  color.  If  we  repeat  this 
process  until  we  fill  the  urn  completely,  which 
balls  will  be  the  most  frequent  ones  in  the  urn? 
Obviously,  to  a  great  extent  this  depends  on  the 
outcome  of  the  initial  draws  (i.e.,  initial  condi¬ 
tions).  If  we  draw  the  red  ball  in  the  first  round, 
then  the  likelihood  of  drawing  a  red  ball  increases 
in  the  following  draw.  Over  time,  the  distribution 
would  settle  down  to  a  majority  of  red  balls.  In 
other  words,  the  ratio  will  eventually  reach  an 
equilibrium  (i.e.,  the  red  one).  This  process  indi¬ 
cates  that  sequence  matters.  In  the  beginning,  we 
cannot  say  with  certainty  which  color  will  lock 
itself  in.  We,  however,  know  that  the  results  of 
early  draws  in  the  sequence  will  have  a  substantial 
impact  on  which  of  the  balls  will  be  the  dominant 
one  at  the  end.  Thus,  early  developments  or  events 
not  only  involve  a  substantial  degree  of  random¬ 
ness  but  also  produce  a  large,  determinative  impact 
on  subsequent  developments.  In  other  words,  the 
impact  of  random  occurrences  early  in  a  sequence 
of  events  does  not  cancel  out.  Instead,  they  exert  a 
long-lasting  influence  on  the  evolution  of  the 
sequence  of  events  (also  known  as  nonergodicity) . 

It  is  important  to  indicate  at  this  point  that 
actors  may  not  always  follow  the  most  efficient  or 
beneficial  path.  In  other  words,  despite  conven¬ 
tional  economic  models  in  which  rational  actors 
are  assumed  to  make  efficient  decisions  and  maxi¬ 
mize  profits,  suboptimal  outcomes  or  paths  might 
also  lock  themselves  in.  A  classical  example  of 
path  inefficiency  is  the  story  of  the  QWERTY  key¬ 
board  layout.  Although  it  is  widely  acknowledged 
that  this  keyboard  is  less  efficient  than  the  one 
developed  by  August  Dvorak,  it  was  able  to  lock 
itself  in.  Why?  It  is  put  forward  that  the  domi¬ 
nance  of  QWERTY  was  simply  due  to  a  historical 


accident.  This  keyboard  layout  was  introduced  in 
the  late  19th  century,  while  Dvorak’s  simplified 
keyboard  was  initiated  in  the  1930s.  Due  to  tech¬ 
nical  interrelatedness,  economies  of  scale,  and  the 
quasi  irreversibility  of  investment,  it  became  diffi¬ 
cult  to  switch  from  the  previously  introduced  key¬ 
board  to  the  new  one,  regardless  of  its  relative 
efficiency.  This  case,  too,  indicates  that  the  timing 
and  sequence  of  events  matter. 

The  idea  of  path  dependence  is  a  popular  yet 
highly  questioned  notion  on  many  accounts.  Several 
studies,  for  instance,  draw  attention  to  its  weak¬ 
nesses  in  dealing  with  change.  It  is  observed  that 
path  dependence  has  a  bias  toward  stability.  Other 
than  that,  it  relies  on  an  exogenous  deus  ex 
machina  to  explain  substantial,  pathbreaking 
change.  In  other  words,  change  is  explained  by 
exogenously  induced  variation.  This  understanding 
is  based  on  the  assumption  that  institutions  have 
self-reinforcing  features,  and  changes  in  such  struc¬ 
tures  should  be  driven  by  exogenous  factors  and 
dynamics.  Critiques  contend  that  such  an  under¬ 
standing  remains  limited  in  accounting  for  the 
timing,  direction,  and  mechanisms  of  the  change 
process.  For  instance,  it  is  asserted  that  the  source 
of  change  may  also  be  endogenous  to  institutions 
or  sometimes  both  endogenous  and  exogenous.  It  is 
further  argued  that  path  dependence  underesti¬ 
mates  the  role  of  incremental  changes.  Critiques 
propose  that  minor,  small  changes  might  be  quite 
important  because  they  may  accumulate  over  time 
and  create  a  sudden  pathbreaking  change.  As  a 
remedy  for  such  weaknesses,  path  dependence  is 
advised  to  have  a  stronger  sense  of  agency. 

This  approach  is  also  criticized  for  its  material¬ 
ist  propensity.  Although  path  dependence  acknowl¬ 
edges  that  it  is  not  always  guaranteed  to  reach  and 
maintain  Pareto  optimality  (i.e.,  a  state  in  which 
no  one  can  be  made  better  off  without  making 
someone  worse  off),  it  treats  the  cost-benefit  cal¬ 
culus  (i.e.,  increasing  returns,  positive  feedback)  as 
the  main  dynamic  behind  the  continuation  or  the 
persistence  of  a  path.  The  utilitarian,  materialist 
bias,  however,  substantially  limits  the  explanatory 
power  of  path  dependence  because  there  are  also 
several  cases  in  the  institutional  world  where  non¬ 
material  factors  and  considerations  (i.e.,  ideas, 
values,  and  legitimacy  concerns)  constitute  a  path. 
It  is  suggested  that  once  successfully  institutional¬ 
ized  (i.e.,  taken  for  granted  by  actors),  ideas  shape 


1832  Patriotism 


actors’  perceptions,  goals,  behaviors,  and  conse¬ 
quently  political  outcomes.  As  a  result,  some 
choices  will  be  regarded  as  more  favorable  or 
appropriate  than  others.  This  process,  in  the  long 
run,  would  lock  in  certain  actions  or  policies  while 
locking  out  others  (also  referred  to  as  cognitive 
locking).  Thus,  actors  may  maintain  a  certain  path 
not  because  of  an  expected  utility  down  the  path 
but  because  of  simply  believing  in  the  appropriate¬ 
ness  of  certain  ideas  and  values.  Put  differently,  the 
main  mechanism  of  self-reinforcement  or  path 
reproduction  in  such  ideational  continuities 
becomes  taken  for  granted  or  legitimate  rather 
than  being  due  to  cost-benefit  assessments  (i.e., 
increasing  returns). 

In  response  to  these  criticisms,  recent  studies 
employing  path  dependence  have  paid  more  atten¬ 
tion  to  incremental,  endogenous  changes  and  the 
role  of  agency  in  path  emergence,  path  mainte¬ 
nance,  and  path-breaking  processes.  Despite  its 
shortcomings,  it  is  important  that  a  researcher  have 
path  dependence  in  his  or  her  tool  kit.  The  syn¬ 
chronic  approaches,  which  are  concerned  with  the 
impact  of  variance  in  current  variables  on  political 
outcomes,  provide  us  a  “snapshot”  view  of  politi¬ 
cal  life.  For  a  much  better  understanding,  however, 
these  approaches  should  be  complemented  with 
diachronic  approaches,  which  are  more  sensitive  to 
historical  causes  or  conditions,  triggering  particular 
self-reproducing  sequences  of  events. 
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Patriotism 


As  Maurizio  Viroli  notes,  patriotism  (love  of  coun¬ 
try)  and  nationalism  (loyalty  to  one’s  nation)  are 
often  taken  to  be  synonymous.  Indeed,  for  writers 
such  as  Ernest  Geller,  nationalism  constitutes  a 
distinctive  expression  of  patriotism.  Yet  the  origin 
of  the  concept  can  be  traced  back  to  the  notion  of 
patria  in  Greek  and  especially  Roman  antiquity, 
which  links  patriotism  with  the  preservation  of  res 
publica  and  the  protection  of  common  liberty.  It  is 
only  with  the  rise  of  nationalism  in  the  19th  cen¬ 
tury  that  patriotism  becomes  associated  with  the 
exclusive  attachment  to  a  prepolitical,  homoge¬ 
neous,  national  community.  Because  of  this  asso¬ 
ciation,  patriotism  has  been  frequently  regarded  as 
incompatible  with  a  typically  liberal  commitment 
to  universal  human  rights.  However,  recent  schol¬ 
arship  has  sought  to  disassociate  patriotism  from 
nationalism  by  emphasizing  new  forms  of  loyalty 
compatible  with  universal  values,  respect  for 
human  rights,  and  tolerance  of  ethnic  and  national 
differences,  giving  rise  to  concepts  such  as  consti¬ 
tutional  or  cosmopolitan  patriotism  as  well  as 
contemporary  restatements  of  the  classical  repub¬ 
lican  tradition. 

Greek  and  especially  Roman  antiquity  provide 
the  roots  for  a  political  patriotism  that  conceives 
of  political  loyalty  to  the  patria  as  loyalty  to  a 
political  conception  of  the  republic  and  is  associ¬ 
ated  with  the  love  of  law  and  common  liberty,  the 
search  for  the  common  good,  and  the  duty  to 
behave  justly  toward  one’s  country.  This  classical 
Roman  meaning  of  patria  reemerges  in  the  context 
of  the  Italian  city  republics  of  the  15th  century. 
Here,  patria  stands  for  the  common  liberty  of  the 
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city,  which  can  only  be  safeguarded  by  the  citizens’ 
civic  spirit.  For  Niccolo  Machiavelli,  the  love  of 
common  liberty  enables  citizens  to  see  their  private 
and  particular  interests  as  part  of  the  common 
good  and  helps  them  to  resist  corruption  and  tyr¬ 
anny  and,  if  necessary,  to  fight  to  protect  and 
preserve  the  political  community.  While  this  love 
of  the  city  is  typically  intermixed  with  pride  in  her 
military  strength  and  cultural  superiority,  it  is  the 
political  institutions  and  way  of  life  of  the  city  that 
form  the  distinctive  focal  point  of  this  kind  of 
patriotic  attachment.  To  love  the  city  is  to  be  will¬ 
ing  to  sacrifice  one’s  own  good — including  one’s 
life  and,  if  need  be,  one’s  soul — for  the  protection 
of  common  liberty. 

In  contrast  to  the  classical  republican  conception 
of  patriotism,  Jean-Jacques  Rousseau’s  “Consider¬ 
ations  on  the  Government  of  Poland”  can  be  seen 
as  an  early  example  of  the  link  between  national¬ 
ism  and  patriotism.  Yet  while  Rousseau  advocates 
the  love  of  the  nation  and  the  celebration  of 
national  culture,  he  believes  that  national  culture  is 
valuable  primarily  because  it  helps  foster  loyalty  to 
the  political  fatherland.  Thus,  Rousseau’s  national¬ 
ism  stems  from  and  serves  his  typically  republican 
emphasis  on  securing  citizens’  loyalty  to  their 
political  institutions.  A  more  explicit  link  between 
nationalism  and  patriotism  can  be  found  in  the 
work  of  Johann  Gottfried  Herder.  In  Herder’s 
work,  patriotism  refers  not  to  a  political  virtue  but 
to  a  spiritual  attachment  to  the  nation.  In  this  con¬ 
text,  fatherland  becomes  synonymous  with  the 
nation  and  its  distinct  language  and  culture,  which 
give  it  unity  and  coherence.  Thus,  instead  of  linking 
patriotism  to  the  preservation  of  political  liberty, 
Herder  associates  love  of  one’s  country  with  the 
preservation  of  a  common  culture  and  the  spiritual 
unity  of  a  people.  While  in  the  classical  republican 
tradition,  “fatherland”  is  synonymous  with  politi¬ 
cal  institutions,  for  Herder,  the  nation  is  prepoliti¬ 
cal  and  love  of  one’s  national  culture  is  a  natural 
inclination  that  allows  a  people  to  express  their 
distinctive  character.  On  this  account,  patriotism  is 
associated  with  the  exclusive  attachment  to  one’s 
own  culture  and  thus  stands  in  opposition  to  cos¬ 
mopolitanism  and  cultural  assimilation.  Freedom  is 
equated  not  with  the  fight  against  political  oppres¬ 
sion  but  with  the  preservation  of  a  unique  people 
and  patriotic  sacrifice  with  the  desire  to  secure  the 
long-term  survival  of  the  nation. 


This  association  between  patriotism  and  the 
exclusive  attachment  to  one’s  nation  has  led  crit¬ 
ics  like  Martha  Nussbaum  to  view  the  sentiment 
of  patriotic  pride  as  morally  dangerous,  giving 
rise  to  a  chauvinism  that  is  incompatible  with 
cosmopolitan  aspirations  and  the  recognition  of 
the  equal  moral  worth  of  all  human  beings.  Yet 
recent  approaches  to  patriotism  have  sought  to 
ground  patriotism  in  new  forms  of  loyalty  that 
are  compatible  with  universal  values,  respect  for 
human  rights,  and  tolerance  of  ethnic  and  national 
differences.  At  the  heart  of  this  renewed  interest 
in  patriotism  lies  the  belief  that  to  be  stable, 
democratic  societies  require  a  strong  sense  of  alle¬ 
giance  on  the  part  of  their  citizens.  Not  only  does 
the  high  degree  of  pluralism  that  characterizes 
contemporary  societies  potentially  give  rise  to 
tensions  and  disagreements  among  citizens  that 
may  destabilize  the  polity,  modern  democratic 
states  committed  to  a  degree  of  equality  rely  on 
the  willingness  of  citizens  to  make  sacrifices  for 
the  common  good,  be  it  in  terms  of  the  everyday 
redistribution  of  income  to  meet  welfare  needs  or 
the  provision  of  collective  goods  and  services  such 
as  education  or  health  care.  Hence,  in  the  eyes  of 
advocates  of  new  forms  of  patriotism,  stable 
democratic  societies  require  a  strong  sense  of 
solidarity. 

The  most  prominent  example  of  this  search  for 
new  forms  of  solidarity  is  Jurgen  Habermas’s 
notion  of  constitutional  patriotism  (Verfassungs- 
patriotismus ),  which  seeks  to  ground  the  loyalty  of 
citizens  not  in  the  idea  of  a  prepolitical,  homoge¬ 
neous  community  but  in  a  commitment  to  universal 
liberal  principles  as  enshrined  in  the  constitution  of 
the  modern  liberal  state.  To  ensure  that  citizens 
who  subscribe  to  different  cultural,  ethnic,  and  reli¬ 
gious  forms  of  life  can  coexist  in  and  identify  with 
their  own  country  on  equal  terms,  Habermas 
argues  that  the  modern  constitutional  state  must 
ensure  that  its  political  culture  does  not  favor  or 
discriminate  against  any  particular  subculture.  To 
achieve  this,  it  is  vital  to  differentiate  the  majority 
culture  from  a  shared  political  culture  grounded  in 
respect  for  fundamental  constitutional  principles 
and  basic  law.  On  this  account,  membership  of  a 
nation  of  citizens  no  longer  rests  on  an  appeal  to  a 
shared  language  or  a  common  ethical  and  cultural 
origin  but  merely  reflects  a  shared  political  culture 
based  on  standard  liberal  constitutional  principles. 
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Habermas’s  attempt  to  ground  patriotism  in  an 
attachment  to  universal  liberal  principles  is  also 
associated  with  what  is  at  times  referred  to  as 
cosmopolitan  patriotism,  which  seeks  to  construct 
a  postnational  identity  based  on  the  recognition  of 
democratic  values  and  human  rights  as  conceptu¬ 
alized  within  a  particular  constitutional  tradition. 
Such  cosmopolitan  patriotism  is  said  by  advocates 
such  as  Kwame  Anthony  Appiah  to  give  rise  to  a 
rooted  cosmopolitanism  that  couples  attachment 
to  one’s  homeland  and  cultural  particularities 
with  an  appreciation  of  different  places  and  differ¬ 
ent  people  and  a  robust  respect  for  the  equal 
moral  worth  of  all  human  beings.  Advocates  of 
forms  of  constitutional  patriotism  often  cite  the 
United  States  as  an  example  of  a  nonnational  pol¬ 
ity  held  together  by  an  expressly  political  patrio¬ 
tism.  John  Schaar,  for  instance,  refers  to  American 
patriotism  as  “covenanted  patriotism,”  a  form  of 
patriotic  attachment  characterized  by  a  commit¬ 
ment  to  the  principles  and  goals  set  out  in  the 
founding  covenant  and  the  duty  to  carry  on  the 
work  of  the  Founding  Fathers.  Another  strand  of 
contemporary  thought  appeals  to  the  classical 
republican  principles  of  love  of  liberty,  active  citi¬ 
zenship,  and  self-sacrifice  for  the  common  good  in 
their  attempt  to  formulate  new  forms  of  solidarity 
that  do  not  depend  on  the  idea  of  a  prepolitical, 
ethically  homogeneous  nation.  Maurizio  Viroli 
writes  in  this  tradition  when  he  advocates  a 
republican  patriotism  characterized  by  a  political 
culture  of  liberty  that  emphasizes  active  citizen¬ 
ship  and  civic  virtue  and  gives  rise  to  a  reflexive, 
self-critical  love  of  country  that  aims  to  ensure 
that  the  polity  lives  up  to  its  highest  traditions  and 
ideals. 

However,  critics  of  such  attempts  to  generate 
new,  nonexclusionary  forms  of  solidarity  have 
expressed  doubts  about  the  extent  to  which  patriotic 
sentiments  can  be  reconciled  with  a  commitment  to 
universal  principles.  While  critics  of  constitutional 
patriotism  have  questioned  the  feasibility  of 
Habermas’s  attempt  to  decouple  the  political  culture 
from  the  wider  majority  culture,  pointing  to  the 
extent  to  which  the  political  culture  of  even  as  cul¬ 
turally  diverse  a  society  as  America  draws  on 
national  symbols  and  myths  that  are  laden  with 
prepolitical  meanings,  commentators  like  Margaret 
Canovan  have  argued  that  classical  republican 
patriotism  was  much  more  illiberal  and  hostile  to 


outsiders  than  modern  proponents  of  the  republi¬ 
can  tradition  suggest.  According  to  Canovan,  not 
only  is  the  patriotic  virtue  celebrated  in  the  classi¬ 
cal  republican  tradition  primarily  a  military  virtue, 
the  republican  preoccupation  with  the  education 
and  socialization  of  citizens  to  systematically 
instill  loyalty  and  commitment  to  the  state  is  liable 
to  be  seen  by  many  contemporary  liberals  as  an 
unacceptable  form  of  manipulation  and  indoctri¬ 
nation.  Furthermore,  advocates  of  both  constitu¬ 
tional  and  modern  republican  patriotism  typically 
presuppose  the  existence  of  established  political 
boundaries  and  common  political  institutions  that 
have  their  origins  in  the  rise  and  consolidation  of 
the  nation-state.  Thus,  the  extent  to  which  patri¬ 
otism  can  be  reconciled  with  a  commitment  to 
universal  values,  respect  for  human  rights,  and 
tolerance  of  ethnic  and  national  differences  remains 
contested. 
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Pay  for  performance  (PFP;  also  known  as  merit 
pay  or  performance-related  pay)  involves  the  pro¬ 
vision  of  a  financial  reward  contingent  on  some 
indicator  of  individual  (or  less  frequently,  group) 
success.  PFP  has  been  a  consistent  reform  in  gov¬ 
ernment  since  the  1980s  and  remains  popular. 

In  an  era  when  elected  officials  call  for  evidence- 
based  policy  and  rigorous  evaluations  of  program 
performance,  PFP  represents  both  an  affirmation 
and  an  uncomfortable  anomaly.  On  the  one  hand, 
it  perfectly  captures  basic  themes  of  the  current 
era:  Public  actors  should  be  held  accountable  for 
performance,  rewarded  for  success,  and  punished 
for  failure.  At  the  same  time,  however,  there  is 
considerable  evidence  that  PFP  as  implemented 
thus  far  has  been  ineffective.  For  example,  the  U.S. 
federal  government  has  tried  multiple  iterations  of 
PFP  since  1978.  New  variations  of  PFP  are  pro¬ 
posed  but  with  little  effort  to  fix  the  fundamental 
problems  that  undermined  the  previous  version. 

Reasons  for  Failure 

There  are  a  variety  of  reasons  why  PFP  systems 
have  struggled  in  the  public  setting: 

Attribution  difficulty.  For  any  moderately  complex 
task,  outcomes  are  generally  the  result  of  group 
effort,  and  the  relative  contribution  of  a  single 
individual  is  difficult  to  discern. 

Measurement  difficulty.  Attribution  difficulty 
makes  it  difficult  to  come  up  with  individual-level 
performance  measures  that  connect  well  with 
organizational  goals.  As  a  result,  PFP  systems 
generally  rely  on  subjective  ratings  of  performance 
as  determined  by  a  supervisor.  It  is  often  unclear  to 
employees  how  to  do  well  on  such  scales  beyond 
staying  on  the  good  side  of  their  supervisor.  From 
the  supervisors’  perspective,  they  often  lack  the 
information  necessary  to  provide  an  informed 
assessment. 

Grade  inflation-.  Employee  ratings  generally  follow 
a  fixed  scale,  for  example,  1  is  excellent  and  5  is 
below  expectations.  The  subjectivity  of  these  scales, 
the  lack  of  an  incentive  to  critically  differentiate 
between  good  and  bad  performers,  and  the 


discomfort  of  critically  rating  an  individual  result 
in  the  vast  majority  being  graded  as  above  average. 

Resources:  PFP  systems  have  generally  been 
underfunded.  It  seems  that  although  politicians 
argue  for  high  risk  and  high  reward,  in  practice  they 
cannot  tolerate  the  potential  media  scrutiny  that 
might  arise  if  public  servants  earn  genuinely  high 
rewards.  Public  service  unions,  generally  critical  of 
PFP  systems,  also  work  to  keep  the  size  of  the 
rewards  small  in  order  to  ensure  that  members  do 
not  actually  lose  out  under  new  systems.  As  a  result, 
the  rewards  associated  by  PFP  systems  are  generally 
too  small  to  be  effective  motivators. 

Incorrect  assumptions :  PFP  systems  are  based  on 
the  assumption  that  individuals  are  motivated  by 
financial  rewards.  But  there  is  a  substantial 
literature  documenting  the  relatively  high 
prevalence  of  intrinsic  motivation  among  public 
sector  workers.  At  the  very  least,  it  seems 
reasonable  to  assume  that  there  will  be 
heterogeneity  in  reward  preferences  among  public 
employees  and  PFP  systems  will  not  be  a  strong 
motivator  for  many. 

At  a  broader  level,  the  poor  record  of  PFP  in  the 
public  sector  may  be  the  result  of  (a)  failure  to 
implement  it  properly,  (b)  a  poor  fit  with  the  pub¬ 
lic  setting,  or  (c)  the  fact  that  the  theory  simply 
does  not  work  well  in  any  setting.  While  some  of 
the  problems  outlined  above  might  be  remedied 
with  better  implementation  (e.g.,  better  measure¬ 
ment  systems  or  more  resources),  the  nature  of  the 
public  setting  often  makes  such  changes  unlikely. 
Most  public  services  are  complex  and  hard  to  mea¬ 
sure,  and  a  reluctance  to  provide  appropriate 
resources  is  tied  to  the  political  context  of  public 
services.  There  is  also  evidence  that  the  presumed 
success  of  PFP  in  the  private  setting  is  overstated, 
which  questions  the  basic  validity  of  the  theory 
behind  PFP.  The  empirical  evidence  on  PFP  in  the 
private  setting  is  quite  mixed,  with  success  gener¬ 
ally  contingent  on  the  factors — ease  of  measure¬ 
ment  and  attribution,  resource  adequacy — that 
make  it  difficult  to  implement  in  the  public  sector. 

Costs  of  Pay  for  Performance 

In  addition  to  evidence  that  PFP  tends  to  work 
only  under  some  fairly  specific  circumstances  that 
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are  unusual  in  the  public  sector,  there  is  also  evi¬ 
dence  that  it  generates  costs.  The  most  obvious  are 
the  transaction  costs  involved  in  creating  and 
implementing  an  individualized  performance 
appraisal  system. 

There  are  also  unanticipated  costs.  One  is  dis¬ 
trust  and  conflict.  In  a  standard  salary  system, 
supervisors  have  limited  financial  rewards  to  offer, 
which  reduces  the  potential  for  complaints  about 
favoritism  or  abuse  of  power.  But  in  PFP  systems, 
the  reliance  on  subjective  assessments,  the  limited 
availability  of  rewards,  and  the  general  belief 
among  employees  that  they  deserve  rewards  create 
the  conditions  where  some  employees  will  feel  that 
favoritism  is  employed  and  that  neither  appraisal 
systems  nor  pay  reflect  their  effort.  PFP  systems 
may  also  create  tension  between  employees  who 
view  themselves  as  competing  for  a  limited  number 
of  rewards. 

Another  potential  cost  is  the  crowding-out 
effect,  which  is  centered  on  evidence  that  financial 
incentives  can  undermine  intrinsic  motivations.  As 
individuals  lose  their  sense  of  self-determination, 
and  find  that  public  service  performance  is  mone¬ 
tized,  some  may  become  discouraged  and  with¬ 
draw  effort.  Over  time,  a  reliance  on  extrinsic 
incentives  also  creates  a  selection  process  that 
reinforces  crowding  out,  as  those  driven  by  intrin¬ 
sic  motivation  become  less  likely  to  join  or  stay  in 
contexts  where  PFP  is  dominant. 

The  final  potential  cost  of  PFP  is  that  it  can 
encourage  perverse  behavior,  in  the  form  of  gaming 
and  goal  displacement.  Faced  with  high-powered 
incentives,  individuals  may  seek  to  manipulate  per¬ 
formance  measures  (gaming)  or  to  improve  metrics 
at  the  expense  of  some  unmeasured  but  important 
value  (goal  displacement).  For  example,  many  of 
the  studies  of  PFP  that  show  positive  results  are  set 
in  health  care  settings  that  allowed  piece-rate  com¬ 
pensation  programs,  where  providers  were  rewarded 
for  the  number  of  services  provided.  While  health 
care  providers  may  approve  of  such  system,  it  has 
been  criticized  as  creating  broader  social  costs,  since 
it  encourages  the  provision  of  unnecessary  services 
and  increases  the  overall  cost  of  health  care. 

As  awareness  of  unanticipated  consequences 
grows,  the  tendency  to  underresource  PFP  systems 
identified  above  may  not  be  a  weakness  but  an 
unintended  protection  against  the  type  of  misbehav¬ 
ior  that  is  encouraged  by  high-powered  incentives. 


And  some  have  argued  that  lower-powered  incen¬ 
tives  are  actually  optimal  for  individuals  in  the 
public  sector. 

Reasons  for  Adoption 

Given  its  poor  record  of  success,  why  does  PFP 
remain  durable?  PFP  has  high  intuitive  and  sym¬ 
bolic  appeal.  It  is  easy  to  understand,  is  believed  to 
be  successful  in  the  private  sector,  and  is  assumed 
to  provide  an  element  of  risk  and  competition  that 
runs  contrary  to  the  image  of  bureaucracies.  It 
communicates  a  desire  to  hold  inefficient  bureau¬ 
crats  accountable  for  performance.  Past  failures  of 
PFP  can  be  dismissed  as  implementation  failures. 
Of  course,  if  the  primary  motivation  is  symbolic, 
concern  about  the  actual  success  of  the  system  is  of 
little  consequence. 

PFP  may  also  be  viewed  as  a  means  by  which 
political  principals  control  agents.  There  is  some 
circumstantial  evidence  to  support  this  argument. 
PFP  systems  are  more  likely  to  be  found  in 
Organisation  for  Economic  Co-operation  and 
Development  (OECD)  countries  where  there  is  no 
tradition  of  crossover  between  politics  and  bureau¬ 
cracy,  and  political  appointees  tend  to  be  support¬ 
ive  of  PFP  systems.  But  the  functional  utility  of  PFP 
as  a  means  of  control  is  limited  when  information 
asymmetry  and  measurement  problems  pervade 
conditions  that  underpin  principal-agent  problems 
in  the  first  place. 

As  PFP  has  matured,  it  appears  to  have  found 
greater  application  in  performance  contracts  with 
third-party  providers.  In  such  settings,  government 
principals  do  not  face  the  same  constraints  in 
terms  of  civil  service  rules  or  bonus  size,  and  so  can 
create  high-powered  incentives.  As  with  PFP  sys¬ 
tems  in  government,  much  of  the  success  of  such 
systems  depends  on  the  capacity  to  measure  and 
specify  complete  contracts,  with  objectives  that  tie 
to  reward.  When  contracts  are  incomplete,  third- 
party  providers  have  stronger  incentives  to  engage 
in  gaming  and  goal  displacement,  and  fewer  con¬ 
straints  to  stop  them. 

As  we  look  to  the  future,  it  is  tempting  to  cate¬ 
gorize  PFP  as  an  example  of  what  H.  F.  Henry 
Mencken  observed:  “There  is  always  a  well-known 
solution  to  every  human  problem — neat,  plausible, 
and  wrong.”  It  is  an  appealing  and  durable  idea, 
despite  its  record  of  failure.  There  will  always  be  a 
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market  for  such  ideas.  If  the  conditions  are  right, 

PFP  might  even  work,  though  such  conditions  are 

not  generally  conducive  to  the  public  setting. 
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Peace 


Peace  has  long  been  understood  in  political  science 
in  two  quite  distinct  ways.  The  dominant  concep¬ 
tion  of  peace  has  been  the  absence  of  war.  The 
very  word  peace  (and  the  French  paix)  derives 
from  the  Latin  pax ,  meaning  an  agreement — a 
pact — to  refrain  from  hostilities.  This  is  sometimes 


characterized  as  “negative  peace”  as  opposed  to 
“positive  peace,”  with  the  latter  denoting  not  just 
the  absence  of  direct  violence  but  also  the  absence 
of  indirect  or  structural  violence,  sometimes 
described  as  the  presence  of  justice.  This  entry 
surveys  the  religious  and  normative  context  within 
which  peace  came  to  be  understood,  the  various 
ways  in  which  peace  has  been  seen  as  a  subject  of 
research,  and  the  diverse  explanations  offered  by 
political  science  on  the  conditions  for  peace. 

Given  the  implicit  link  between  peace  and  war, 
peace  was  long  understood  largely  through  the 
lens  of  conflict  and  its  management.  During  the 
Cold  War,  however,  the  “management”  of  conflict 
through  mutually  assured  destruction  and  detente 
led  to  research  on  peace  that  meant  more  than  just 
the  avoidance  of  annihilation.  This  often  embraced 
a  constructivist  perspective  on  peace,  while  realists 
tended  to  conceive  of  peace  in  terms  of  the  inter¬ 
ests  of  states,  and  idealists  emphasized  the  role  that 
norms  and  institutions  could  play. 

Norms  and  Religion 

Rules  governing  the  conduct  of  war  have  existed  in 
virtually  every  culture.  The  Aztecs,  for  example, 
developed  elaborate  rituals  that  preceded  conflict, 
including  dispatching  ambassadors,  providing  an 
opportunity  for  a  prospective  foe  to  submit  peace¬ 
fully  to  Aztec  rule.  (Failure  to  accept  three  such 
offers  was  required  before  military  operations 
could  commence — even  though  the  process  meant 
abandoning  all  possibility  of  surprise  and,  since 
the  ambassadors  traditionally  brought  gifts  of 
weapons,  the  opponent’s  military  capacity  was 
increased.)  War  itself,  however,  was  typically 
regarded  as  a  natural  state.  Early  Western  philoso¬ 
phers  such  as  Aristotle  and  Plato  gave  relatively 
little  thought  to  war  as  an  atypical  phenomenon. 
The  seeds  of  what  is  now  the  just  war  tradition, 
limiting  not  only  the  conduct  of  war  but  the 
recourse  to  war  itself,  lay  in  the  transformed  per¬ 
ception  of  war  from  being  the  norm  to  being  an 
exceptional  state  that  required  justification. 

Christian  theology  contributed  much  of  the 
content  of  the  just  war  tradition  but  only  after  lit¬ 
eral  interpretations  of  Christ’s  injunction  to  “turn 
the  other  cheek” — on  its  face  a  doctrine  of  radical 
pacifism — could  be  rationalized  by  St.  Ambrose 
and  St.  Augustine  into  a  dualistic  framework  that 
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separated  inward  disposition  from  outward  action. 
This  dualism  lives  on  today  in  the  principles  of 
double-effect,  right  intention,  and  proportionality, 
each  important  to  the  modern  legal  framework  for 
the  use  of  force. 

Religion  has  been  significant  at  both  ends  of  the 
spectrum  of  peace  and  war.  Gandhi,  among  oth¬ 
ers,  stuck  to  the  utopian  position  of  extreme 
nonviolence — adopted  as  a  way  of  life  rather  than 
simply  a  tactic.  Similar  pacifist  tendencies  continue 
to  inform  much  research  on  peace,  despite  some 
views  relegating  it  to  the  margins  of  international 
affairs.  At  the  other  extreme,  radical  Islam  has 
elevated  the  traditional  Muslim  notion  of  jihad 
(literally  “struggle”)  to  a  holy  war  against  all  infi¬ 
dels  and  any  Muslim  who  collaborates  with  them. 

The  role  of  norms  in  constraining  or  shaping 
behavior  is  a  contested  area  of  political  science.  Yet 
in  the  area  of  war  and  peace,  the  most  important 
normative  transformation  of  the  20th  century  was 
the  outlawing  of  war.  The  Kellogg-Briand  Pact  of 
1928  “condemnjedj  recourse  to  war.”  The  United 
Nations  (UN)  Charter  of  1945  more  explicitly  pro¬ 
hibited  the  “threat  or  use  of  force,”  allowing  excep¬ 
tions  only  in  the  case  of  self-defense  against  an 
armed  attack  or  authorization  by  the  UN  Security 
Council  to  deal  with  a  threat  to  or  breach  of  “the 
peace.”  Though  this  manifestly  did  not  end  war,  it 
is  noteworthy  that  states  attempted  to  justify  virtu¬ 
ally  every  use  of  force  from  1945  under  one  of 
these  exceptions. 

The  late  20th  and  early  21st  centuries  saw  the 
UN  Security  Council  expanding  its  understanding 
of  “threats  to  international  peace  and  security”  as 
not  merely  embracing  traditional  interstate  conflict 
but  in  some  circumstances  extending  to  internal 
disputes  that  might  spill  over  into  other  states  or 
are  of  such  gravity  in  their  humanitarian  conse¬ 
quences  that  they  warrant  international  interven¬ 
tion.  This  expansive  interpretation  of  the  role  of 
supranational  institutions  was  accompanied  by  the 
emergence  of  “human  security”  and  the  “responsi¬ 
bility  to  protect” — doctrines  that  further  advanced 
the  notion  of  peace  as  meaning  much  more  than 
merely  the  absence  of  traditional  interstate  conflict. 

Human  security  was  an  attempt  to  challenge  tra¬ 
ditional  conceptions  of  security  by  arguing  that  the 
proper  referent  for  security  was  the  individual  rather 
than  the  state.  Building  on  the  UN  Development 
Programme’s  Human  Development  Report,  first 


published  in  1994,  human  security  argues  that  con¬ 
flict  and  deprivation  are  linked;  addressing  insecu¬ 
rity  thus  requires  addressing  freedom  from  fear  as 
well  as  freedom  from  want.  In  more  elaborate  for¬ 
mulations,  this  is  said  to  include  protection  as  well 
as  empowerment  strategies.  Critics  of  human  secu¬ 
rity  challenge  this  approach  on  the  basis  that  the 
concept  is  vague  and  not  helpful  in  understanding 
the  causes  of  conflict  or  formulating  responses. 

Responsibility  to  protect  (frequently  abbreviated 
to  R2P)  was  the  product  of  long-standing  efforts  to 
get  beyond  the  humanitarian  intervention  debates  of 
the  1990s.  First  coined  by  an  International  Commis¬ 
sion  in  2001,  the  doctrine  was  later  adopted  unani¬ 
mously  by  the  UN  General  Assembly  in  2005 — a 
remarkably  swift  endorsement  of  a  once  controver¬ 
sial  subject.  In  its  watered-down  form,  it  merely 
affirms  that  a  state  has  the  primary  responsibility  for 
protecting  its  population  from  genocide,  war  crimes, 
crimes  against  humanity,  and  ethnic  cleansing.  The 
failure  of  a  state  to  act  on  this  responsibility  might 
be  grounds  for  the  Security  Council  to  act,  in  excep¬ 
tional  circumstances,  in  the  name  of  the  interna¬ 
tional  community — something  the  Council  has 
already  done  on  a  handful  of  occasions. 

Peace  as  a  Subject  of  Research 

Though  war  and  peace  had  long  been  central  to  the 
study  of  international  relations,  the  idea  of  study¬ 
ing  peace  as  such  gained  traction  only  during  the 
Cold  War.  This  was  not  only  driven  by  the  unique 
security  threats  posed  by  nuclear  weapons  but  also 
found  encouragement  in  U.S.  universities,  in  par¬ 
ticular  by  opposition  to  the  Vietnam  War.  Much 
early  research  took  place  in  or  was  funded  by  Scan¬ 
dinavia:  Norway’s  International  Peace  Research 
Institute  in  Oslo  was  founded  in  1959  and  began 
publishing  the  Journal  of  Peace  Research  in  1964; 
the  Stockholm  International  Peace  Research 
Institute  was  established  in  1966.  Peace  studies 
became  more  popular  in  the  1980s,  and  today  there 
are  hundreds  of  such  programs  in  universities  and 
think  tanks  around  the  world. 

The  relevant  literature  encompasses  much  work 
on  positive  peace,  with  the  dominant  questions 
tending  to  remain  those  connected  with  explaining 
the  conditions  necessary  to  prevent  conflict.  As 
conflicts  from  the  1990s  were  increasingly  internal 
rather  than  international,  the  distinction  between 
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positive  and  negative  peace  has  to  some  extent 
been  elided  by  the  finding  that  half  of  all  countries 
emerging  from  civil  war  fall  back  into  it  within  5 
years  of  a  peace  agreement  being  signed.  The  ques¬ 
tion  of  resolving  one  conflict  has  therefore  become 
linked  with  the  question  of  preventing  another. 

This  in  turn  led  to  an  expansive  literature  on 
peace  building,  an  imprecise  term  that  at  times  is 
used  to  describe  virtually  all  forms  of  international 
assistance  to  countries  that  have  experienced  or  are 
at  risk  of  armed  conflict.  State  building  (sometime 
rendered  as  nation  building  in  the  United  States) 
more  narrowly  refers  to  the  construction  or  recon¬ 
struction  of  institutions  of  governance  capable  of 
providing  citizens  with  physical  and  economic  secu¬ 
rity.  Most  often  conducted  as  part  of  UN  peace 
operations,  such  activities  have  included  many  quasi- 
governmental  activities  ranging  from  electoral  assis¬ 
tance  and  technical  assistance  to  maintain  human 
rights  and  the  rule  of  law  to  the  actual  administra¬ 
tion  of  territory  in  Kosovo  and  Timor-Leste. 

Explanations  of  the  Conditions  for  Peace 

Despite  increased  attention  to  positive  peace,  a 
disproportionate  amount  of  the  literature  focuses 
on  conditions  for  peace  in  the  narrow  sense  of 
avoiding  war.  The  most  important  and  widely 
debated  tends  to  be  the  democratic  peace  thesis, 
but  much  has  also  been  written  on  economic  fac¬ 
tors  and  their  link  with  conflict.  The  shift  from 
international  to  internal  conflict  as  a  major  preoc¬ 
cupation  has  reduced  the  prominence  of  strategic 
considerations,  though  the  “post-post-Cold  War” 
era  and  the  rise  of  militarily  powerful  China  and 
Russia  may  suggest  a  return  to  strategic  studies 
and  great-power  politics  in  the  near  future.  Finally, 
the  growing  importance  attributed  to  asymmetric 
threats  such  as  those  from  terrorists  groups  of 
global  reach  has  raised  the  question  of  what 
“peace”  means  when  dealing  with  nonstate  groups 
that  may  or  may  not  behave  as  rational  actors. 

Democratic  Peace 

Much  quantitative  energy  has  been  spent  on 
the  democratic  peace  thesis,  which  holds  that 
democracies  are  statistically  less  likely  to  go  to  war 
than  states  that  are  undemocratic.  Overemphasis 
on  this  empirical  argument  (which  has  itself  been 


contested)  obscures  a  secondary  finding  in  the  lit¬ 
erature  that  an  autocratic  state  in  the  process  of 
democratization  may  in  fact  be  more  likely  to 
descend  into  conflict,  especially  internal  conflict. 
The  more  robust  explanations  of  what  scholars 
such  as  Michael  Doyle  term  the  liberal  peace  theory 
link  it  to  three  pillars:  republican  representation,  an 
ideological  commitment  to  fundamental  human 
rights,  and  transnational  interdependence.  These 
correspond  to  the  arguments  first  made  by  Immanuel 
Kant  in  his  1795  essay  “Perpetual  Peace.” 

Economics 

There  is  a  long  tradition  of  attributing  economic 
causes  to  conflict.  One  of  the  most  direct  linkages 
that  is  often  made  is  between  the  economic  imposi¬ 
tions  on  Germany  following  World  War  I  and  the 
planting  of  the  seeds  of  World  War  II.  Marxist  theo¬ 
ries  of  international  relations  drew  more  systemati¬ 
cally  on  economics  to  explain  the  unfolding  of 
historical  stages,  certain  inevitable  conflicts,  influenc¬ 
ing  dependency  theorists  in  particular.  More  recent 
work  on  civil  wars  offered  counterintuitive  sugges¬ 
tions  that  an  abundance  of  natural  resources  may 
increase  the  probability  and  duration  of  conflict. 

In  the  context  of  international  armed  conflict, 
economic  activity  in  the  form  of  cross-border  eco¬ 
nomic  flows  has  also  been  linked  to  peace — either 
because  it  raises  the  costs  of  severing  those  links  or 
because  it  offers  a  more  efficient  path  to  growth 
than  conquest.  An  early  advocate  of  trade  as  a 
path  to  peace  was  the  19th-century  British  states¬ 
man  Richard  Cobden,  though  it  has  subsequently 
been  discussed  in  the  context  of  interdependence 
and,  more  recently,  globalization. 

The  link  between  economics  and  peace  is  now 
well  accepted.  Since  1997,  the  Development  Assis¬ 
tance  Committee  of  the  Organisation  for  Economic 
Co-operation  and  Development  (OECD)  has  been 
at  the  forefront  of  efforts  to  “mainstream”  conflict 
prevention  into  donor  development  policies.  What 
is  now  termed  structural  prevention  (as  opposed  to 
operational  prevention )  seeks  to  bring  development 
perspectives  to  bear  on  what  were  traditionally 
regarded  as  political  and  security  problems. 

Nevertheless,  the  “greed”  versus  “grievance” 
debates  in  the  empirical  literature  revealed  the  dan¬ 
gers  of  reductive  theories  of  conflict  prevention. 
One-dimensional  analysis  has  now  largely  given  way 
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to  more  nuanced  accounts  that  acknowledge  a  mix 
of  economic,  political,  ethnic,  and  security  dynamics. 

Strategic  Studies 

Though  much  of  the  focus  on  peace  today  con¬ 
cerns  positive  peace  and  the  problem  of  civil  wars, 
strategic  studies  was  a  fertile  area  of  research  in  the 
Cold  War.  Doctrines  such  as  mutual  assured 
destruction  relied  on  the  presumed  rationality  of 
one’s  opponent  and  maintained  the  peace  through  a 
balance  of  terror. 

Such  perspectives  enjoyed  less  prominence  after 
the  end  of  the  Cold  War,  but  with  the  rise  of  China 
and  Russia  as  military  powers  and  the  potential  for 
conflict  over  diminishing  natural  resources,  such 
fields  might  experience  a  resurgence. 

Asymmetric  Threats 

A  phenomenon  that  is  not  new,  but  that  has 
come  to  dominate  discussion  of  conflict  in  many 
circles,  is  terrorism.  The  question  of  whether  ter¬ 
rorist  threats  are  appropriately  addressed  as  a 
military  rather  than  criminal  matter  is  the  subject 
of  significant  division  in  the  literature  and,  indeed, 
the  practice  of  states.  The  “war  on  terror”  of  the 
Bush  administration  in  the  United  States  was  at 
times  presented  as  an  intergenerational  conflict. 
Such  language  was  subsequently  dropped,  among 
other  reasons  because  of  the  difficulty  of  determin¬ 
ing  when  such  a  “war”  has  been  “won.” 

Conclusion 

One  of  the  most  important  factors  affecting  the 
study  of  war  and  peace  across  historical  periods 
has  been  whether  war  or  peace  was  in  fact  the 
“normal”  state  of  human  affairs.  It  is  perhaps 
ironic  that  the  century  that  rendered  meaningless 
the  formal  legal  category  of  “war” — indicating  a 
state  of  affairs  where  two  sides  may  lawfully  use 
force  against  one  another — was  the  bloodiest  in 
human  history.  At  the  beginning  of  a  century  that 
threatens  to  be  no  less  bloody,  this  is  a  disquiet- 
ingly  uncertain  conclusion. 
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Peacekeeping 


Theories  of  war  try  to  explain  why  war  occurs, 
given  that  it  is  often  an  inefficient  way  to  resolve 
disputes.  Scholars  have  proposed  several  explana¬ 
tions,  among  them  the  lack  of  information  about 
relative  capabilities  or  resolve  and  the  inability  of 
the  parties  to  reach  a  credible  commitment  to  end 
their  dispute  without  resorting  to  violence.  Some 
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argue  that,  once  war  occurs,  the  only  way  to  end 
it  and  ensure  that  there  is  no  recurrence  of  vio¬ 
lence  is  to  have  a  decisive  victory  of  one  side  as 
most  negotiated  settlements  generate  credible 
commitment  problems.  This  is  an  argument  of 
particular  relevance  to  civil  (or  intrastate)  wars, 
which  are  today  the  most  common  form  of  large- 
scale  armed  conflict.  One  of  the  striking  stylized 
facts  emerging  from  the  literature  on  civil  wars  is 
that  these  wars  last  a  long  time,  and  when  they 
end,  they  frequently  restart.  These  insights  from 
rationalist  theories  of  interstate  and  intrastate  war 
point  to  an  important  role  for  third  parties  in 
mediating  and  resolving  disputes.  Third-party 
intervention  can  help  defuse  conflicts  before  they 
become  violent  by  providing  information  or  exter¬ 
nal-security  guarantees.  When  wars  do  occur, 
security  guarantees  to  help  support  a  transition  to 
peace  often  take  the  form  of  peacekeeping. 
Peacekeeping  is  how  the  international  community, 
through  its  multilateral  organizations,  can  reduce 
the  risk  that  conflicts  will  escalate. 

Today,  peacekeeping  is  the  multidimensional 
management  of  a  complex  peace  operation,  usu¬ 
ally  in  a  post-civil  war  context.  Peacekeeping, 
authorized  under  Chapter  VI  of  the  United  Nations 
(UN)  Charter,  is  designed  to  provide  interim  secu¬ 
rity  and  assist  parties  to  make  those  institutional, 
material,  and  ideational  transformations  that  are 
essential  to  make  a  peace  sustainable.  It  was  not 
always  this  way.  The  record  of  UN  peacekeeping 
begins  during  the  Cold  War  with  a  limited  activity, 
monitoring  the  performance  of  a  truce  by  two  hos¬ 
tile  parties. 

Contemporary  peacekeeping  doctrine  is  embod¬ 
ied  in  the  Report  of  the  Panel  on  UN  Peace 
Operations  (2000;  also  known  as  the  Brahimi 
Report)  and  the  Report  of  the  Secretary-General 
on  “No  Exit  Without  Strategy:  Security  Council 
Decision-Making  and  the  Closure  or  Transition  of 
United  Nations  Peacekeeping  Operations”  (2001; 
also  known  as  the  “No  Exit  Without  Strategy” 
Report ),  which  expanded  on  Secretary- General 
Boutros  Boutros-Ghali’s  1992  report  Agenda  for 
Peace  and  its  1995  Supplement.  The  goal  of 
peacekeeping  today  is  to  help  parties  achieve  sus¬ 
tainable  peace.  This  goal  can  be  achieved  by  address¬ 
ing  the  underlying  root  causes  of  hostility  and  by 
enhancing  local  capacities  for  institutional  change 
and  economic  development.  But  peacekeeping 


requires  the  cooperation  of  the  parties,  which 
must  realize  that  they  cannot  win  a  quick  and 
decisive  military  victory.  Once  the  parties  have 
reached  a  settlement  that  defines  the  contours  of  a 
postwar  political  system,  there  is  still  danger  that 
the  implementation  of  the  peace  will  fail.  It  is  at 
that  point  that  international  capacities — a  multi¬ 
dimensional  peacekeeping  force  with  a  UN 
mandate  and  resources  to  help  implement  the 
settlement — can  help  allay  the  parties’  concerns  by 
providing  transparency  on  the  peace-building 
process  and  each  other’s  observance  of  the  terms 
of  the  settlement,  by  policing  violations  of  the 
settlement  by  noncooperative  parties  or  spoilers, 
and  by  helping  jump-start  the  economy  and  offer¬ 
ing  technical  expertise  in  the  design  of  transforma¬ 
tional  postwar  political  institutions. 

These  goals  represent  a  significant  expansion  of 
the  scope  of  UN  intervention  in  the  post- 1990 
world.  The  Agenda  for  Peace  envisioned  a  role  for 
the  UN  in  preventive  diplomacy,  to  engage  the 
parties  before  they  get  to  war;  peacemaking,  to 
help  warring  parties  reach  an  agreement;  peace¬ 
keeping,  to  help  police  an  agreement  once  it  is  in 
place;  peace  enforcement,  to  convince  reticent 
parties  to  stop  fighting  through  an  application  of 
Chapter  VII  of  the  UN  Charter;  and  postconflict 
reconstruction,  to  help  parties  rebuild  their  econo¬ 
mies  when  the  war  ends.  In  the  early  years  of  UN 
peace  operations,  the  UN  mainly  aimed  to  sepa¬ 
rate  combatants  through  the  establishment  of  a 
buffer  zone  that  it  policed  in  what  is  now  called  a 
traditional  peacekeeping  mandate.  Traditional 
peacekeeping  referred  to  a  UN  presence  in  the 
field,  with  the  consent  of  all  the  parties  concerned, 
as  a  confidence-building  measure  to  monitor  a 
truce  while  diplomats  negotiated  a  comprehensive 
peace.  Peacekeeping  was  therefore  designed  as  an 
interim  arrangement  where  there  was  no  formal 
determination  of  aggression,  and  it  was  frequently 
used  to  monitor  a  truce,  establish  and  police  a 
buffer  zone,  and  assist  the  negotiation  of  a  peace. 
Typically,  it  involved  less  well-armed  troops,  and 
it  often  resulted  in  freezing  conflict  in  time  rather 
than  helping  resolve  the  underlying  causes. 

Today,  following  a  process  of  evolution  of  UN 
mandates  and  new  strategic  thinking  about  UN 
intervention,  the  model  peace  operation  combines 
elements  of  diplomacy,  peacemaking,  peacekeep¬ 
ing,  postconflict  reconstruction,  and  at  times  also 
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peace  enforcement,  aiming  to  help  build  long¬ 
term  foundations  for  stable,  legitimate  govern¬ 
ment  (see  Table  1).  To  do  this,  UN  peace  missions 
have  a  multidimensional  role,  which  ranges  from 
disarming  noncooperative  parties  and  repatriat¬ 
ing  refugees  to  monitoring  elections  and  design¬ 
ing  school  curricula.  Between  1987  and  1994,  the 
Security  Council  quadrupled  the  number  of  reso¬ 
lutions  it  issued,  tripled  the  peacekeeping  opera¬ 
tions  it  authorized,  and  multiplied  by  seven  the 
number  of  economic  sanctions  it  imposed  per 


year.  Military  forces  deployed  in  peacekeeping 
operations  increased  from  fewer  than  10,000  to 
more  than  70,000.  The  annual  peacekeeping  bud¬ 
get  skyrocketed  correspondingly  from  $230  mil¬ 
lion  to  $3.6  billion  in  the  same  period,  thus 
reaching  about  3  times  the  UN’s  regular  operat¬ 
ing  budget  of  $1.2  billion.  Yet  even  these  high 
costs  represent  a  fraction  of  the  economic  costs 
of  a  failed  peace  or  a  long  war.  Peacekeeping 
is  not  only  potentially  lifesaving,  but  it  is  also 
cost-effective. 


Table  I  Principal  United  Nations  Peacekeeping  Missions,  1 947-2003 


Mission 

Date 

Peak  Force  Size 

Function 

UNSCOB 

1947-1952 

36 

Monitor  violations  of  Greek  border 

UNCI 

1947-1951 

63 

Observe  Indonesian  ceasefire  and  Dutch  troop 
withdrawal 

UNTSO 

1948-present 

SI  2 

Report  on  Arab-Israeli  ceasefire  and  armistice 
violations 

UNMOGIP 

1949-present 

102 

Observe  Kashmir  ceasefire 

UNEF  1 

1956-1967 

6,073 

Observe,  supervise  troop  withdrawal,  and 
provide  buffer  between  Israeli  and  Egyptian 
forces 

UNOGIL 

1958 

591 

Check  on  clandestine  aid  from  Syria  to  Eebanon 
rebels 

ONUC 

1960-1964 

19,828 

Maintain  order  in  the  Congo,  expel  foreign 
forces,  and  prevent  secession  and  outside 
intervention 

UNSF 

1962-1963 

1,576 

Maintain  order  during  transfer  of  authority  in 
New  Guinea  from  Netherlands  and  Indonesia 

UNYOM 

1963-1964 

189 

Supervise  military  disengagement  in  Yemen 

UNFICYP 

1964-present 

6,411 

Prevent  internal  conflict  in  Cyprus  and  avert 
outside  intervention 

DOMREP 

1965-1966 

2 

Report  ceasefire  between  domestic  factions 

UNIPOM 

1965-1966 

96 

Observe  India-Pakistan  border 

UNEF  II 

1973-1979 

6,973 

Supervise  ceasefire  and  troop  disengagement 
and  control  buffer  zone  between  Egypt  and 
Israel 

UNDOF 

1974-present 

1,450 

Patrol  Syria-Israel  border 

UNIFIF 

1978-present 

7,000 

Supervise  Israeli  troop  withdrawal,  maintain 
order,  and  help  restore  authority  of  Lebanese 
government 

UNGOMAP 

1988-1990 

50 

Monitor  Geneva  Accords  on  Afghanistan  and 
supervise  Soviet  withdrawal 
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Mission 

Date 

Peak  Force  Size 

Function 

UNIMOG 

1988-1991 

399 

Supervise  ceasefire  and  mutual  withdrawal  of 
forces  by  Iran  and  Iraq 

UNAVEM  I 

1989-1991 

70 

Verify  withdrawal  of  Cuban  troops  from 

Angola 

UNTAG 

1989-1990 

4,493 

Assist  Namibia’s  transition  to  independence  and 
ensure  free  and  fair  elections 

ONUVEN 

1989-1990 

120 

Monitor  Nicaraguan  elections 

ONUCA 

1989-1992 

1,098 

Verify  compliance  by  Costa  Rica,  El  Salvador, 
Guatemala,  Honduras,  and  Nicaragua  with 
agreement  to  disarm  and  neutralize  irregular 
forces  in  the  area 

ONUVEH 

1990-1991 

260 

Observe  elections  in  Haiti 

UNIKOM 

1991-2003 

1,440 

Monitor  demilitarized  zone  between  Kuwait 
and  Iraq,  removed  with  the  occupation  of 

Iraq  by  an  American-led  coalition  (small 
observer  group  remains  but  technically 
nonfunctioning  and  awaiting  Security  Council 
action) 

UNAVEM  II 

1991-1995 

476 

Verify  compliance  with  Peace  Accord  to  end 
civil  strife  in  Angola 

ONUSAL 

1991-1995 

1,003 

Monitor  ceasefire  and  human  rights  agreements 
in  El  Salvador’s  civil  war 

MINURSO 

1991-present 

375 

Conduct  referendum  in  Western  Sahara  on 
independence  or  union  with  Morocco 

UNAMIC 

1991-1992 

380 

Assist  Cambodian  factions  to  keep  ceasefire 
agreement 

UNPROFOR 

1992-1995 

21,980 

Encourage  ceasefire  in  Croatia  and  Bosnia- 
Herzegovina  and  protect  relief  programs 

UNTAC 

1992-1993 

19,500 

Demobilize  armed  forces  of  Cambodian 
factions,  supervise  interim  government,  and 
conduct  free  elections 

UNOSOM  I 

1992-1993 

550 

Monitor  ceasefire  between  Somali  parties  and 
protect  shipments  of  relief  supplies 

ONUMOZ 

1992-1994 

7,500 

Supervise  internal  peace  accord  in  Mozambique, 
disarm  combatants,  establish  a  nonpartisan 
army,  hold  national  elections,  and  conduct 
humanitarian  program 

UNOMIG 

1993-present 

120 

Verify  ceasefire  agreement  with  Abkhazia  and 
observe  CIS  peacekeeping  force 

UNOMUR 

1993-1994 

100 

Observer  mission  in  Uganda-R wanda  and 
monitoring  arms  shipments 

UNOSOM  II 

1993-1995 

UN  mission  in  Somalia  and  peacemaking 
operations 

(Continued) 
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Table  I  (Continued) 


Mission 

Date 

Peak  Force  Size 

Function 

UN  AMIR 

1993-19 96 

5,500 

Stop  the  massacre  of  the  defenseless  population 
of  Rwanda,  assist  refugees,  and  report 
atrocities 

UNMIH 

1993-1996 

900 

Mission  in  Haiti,  pacification,  and  monitoring 
elections 

UNOMIL 

1993-1997 

91 

Observer  group  in  Liberia  and  monitoring  OAS 
peacekeeping 

UNASOG 

1994 

25 

Observer  group  in  Aouzou  Strip  and  Libya- 
Chad  border 

UNMOT 

1994-2000 

24 

Investigate  ceasefire  violations  and  work  with 
OSCE  and  CIS  missions  in  Tajikistan 

UNMIBH 

1995-2002 

1,584 

Monitor  law  enforcement  in  Bosnia  and 
Herzegovina 

UNPREDEP 

1995-1999 

1,150 

Preventive  deployment  force  for  the  former 
Yugoslav  Republic  of  Macedonia 

UNCRO 

1995-1996 

20 

Confidence  restoration  in  Croatia 

UNAVEM  III 

1995-1997 

5,560 

Angola  verification  in  Mission  of  the  Peace 
Accords  (1991),  the  Lusaka  Protocol  (1994), 
and  relevant  Security  Council  resolutions 

UNMOP 

1996-2002 

28 

Monitor  demilitarization  in  Prevlaka  Peninsula, 
Croatia 

UNTAES 

1996-1998 

5,257 

Facilitate  demilitarization  in  Eastern  Slavonia 
(Croatia) 

UNSMIH 

1996-1997 

1,549 

Support  Mission  in  Haiti 

UNTMIH 

1997 

300 

Transition  Mission  in  Haiti 

MINUGUA 

1997 

155 

Verification  Mission  in  Guatemala 

MIPONUH 

1997-2000 

290 

Civilian  Police  Mission  in  Haiti 

MONUA 

1997-1999 

5,560 

Observer  Mission  in  Angola  and  a  follow-on  to 
UNAVEM  III 

MINURCA 

1998-2000 

1,350 

Help  maintain  and  enhance  security  and 
stability  in  the  Central  African  Republic 

UNAMSIL 

1998-present 

109 

Observe  and  report  to  the  Security  Council  the 
military  conditions  in  Sierra  Leone 

MONUC 

1999-present 

5,537 

Monitor  ceasefire  agreement  and  provide 
humanitarian  assistance  to  the  Democratic 
Republic  of  Congo 

UNMIK  Kosovo 

1999-present 

40,000  (KFOR) 

Combine  efforts  in  pacification  of  Kosovo  with 
KFOR/NATO  forces  (essentially  a 
humanitarian  assistance) 

UNMEE  Ethiopia  and 
Eritrea 

2000-present 

4,300 

Monitor  cessation  of  hostilities 
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Mission 

Date 

Peak  Force  Size 

Function 

UNAMA  Afghanistan 

2002-present 

450 

Work  with  International  Security  Assistance 

Force  and  provide  humanitarian  aid  (not 
technically  a  peacekeeping  mission) 

UNMISET  East 

Timor  (Timor 

Leste) 

2002-present 

5,000 

Transitional  security  for  the  new  Timor  Leste 
government 

MINUCI 

2003-present 

0 

Oversee  implementation  of  Linas-Marcoussis 
Agreeement  with  ECOWAS  and  French 
troops 

UNMIL 

2003-present 

15,000 

Oversee  implementation  of  ceasefire  and  peace 
agreement,  provide  police  training,  and  assist 
in  formation  of  a  new  restructured  military 

Sources:  Compiled  by  author  from  data  in  Ziring,  L.,  Riggs,  R.,  &  Plano,  J.  (2005).  The  United  Nations  (pp.  216-219).  Belmont, 
CA:  Wadsworth;  United  Nations  Peacekeeping  Operations,  Police,  Troops  and  Military  Observers — Contributors  by  Mission 
and  Country,  UN  Peacekeeping  home  page,  December  2002;  Center  for  International  Relations,  Current  UN  Peace-Keeping 
Operations,  Zurich,  Switzerland,  PKO  webmaster,  January  1998. 


The  UN  has  a  commendable  record  of  success 
in  multidimensional  peace  operations  as  diverse 
as  those  in  Namibia  (UNTAG  [United  Nations 
Transition  Assistance  Group]),  El  Salvador 
(ONUSAL  [United  Nations  Observation  Mission 
in  El  Salvador]),  Cambodia  (UNTAC  [United 
Nations  Transitional  Authority  in  Cambodia]), 
Mozambique  (ONUMOZ  [The  United  Nations 
Operations  in  Mozambique]),  and  Eastern  Sla¬ 
vonia  (UNTAES  [United  Nations  Transitional 
Authority  in  Eastern  Slavonia,  Baranja,  and 
Western  SirmiumJ).  Success  is  of  course  an  ambig¬ 
uous  and  contested  term,  and  here,  it  implies  an 
end  to  large-scale  violence,  undivided  sovereignty 
for  the  government  of  the  host  state,  and  a  mod¬ 
est  degree  of  political  openness.  Using  this  stan¬ 
dard,  the  best  available  evidence  suggests  that  UN 
multidimensional,  second-generation  missions 
have  been  successful  in  helping  countries  transi¬ 
tion  from  war  to  peace.  The  UN  has  been  less 
successful  in  fighting  wars,  however.  Enforcement 
operations,  which  draw  on  the  authority  of 
Charter  Article  42,  present  a  challenge  for  the 
UN,  and  the  experiences  in  Bosnia  and  Somalia  in 
the  1990s  exposed  the  practical  limitations  that  a 
multilateral  organization  such  as  the  UN  faces  in 
peace  enforcement.  An  international  organization 
without  an  army  must  rely  on  the  ongoing  coop¬ 
eration  of  troop-contributing  states  and  donors 
for  financing.  This  makes  the  organization  too 


inflexible  and  underresourced  to  fight  wars.  The 
UN’s  comparative  advantage  lies  with  the  peace¬ 
enhancing  role  of  its  multidimensional  missions. 
On  average,  these  have  been  more  successful  than 
peacekeeping  missions  by  other  agencies  and 
smaller  regional  organizations.  Regional  organi¬ 
zations  (with  the  exception  of  the  North  Atlantic 
Treaty  Organization  [NATO])  may  lack  the 
technical  expertise  or  resources  for  such  complex 
missions,  or  they  may  be  perceived  as  biased 
mediators  by  the  parties. 

If  political  conflict  reaches  the  level  of  war,  and 
if  the  war  causes  high  numbers  of  deaths  and 
extensive  physical  and  economic  destruction,  and 
if  parties  to  the  war  are  many  and  incoherent, 
unable  to  implement  a  negotiated  peace  settle¬ 
ment,  then  UN  peacekeeping  may  help  shore  up 
support  for  a  lasting  peace  that  is  based  on  a  mutu¬ 
ally  acceptable  compromise. 

Nicholas  Sambanis 
Yale  University 
New  Haven,  Connecticut,  United  States 


Author’s  Note:  This  chapter  draws  on  Michael  W.  Doyle 
and  Nicholas  Sambanis’s  Making  War  and  Building  Peace: 
United  Nations  Peace  Operations  (Princeton,  NJ:  Princeton 
University  Press,  2006). 
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Peasants’  Movements 


Peasants’  movements  have  been  important  expres¬ 
sions  of  the  formation  of  the  modern  world,  and 
they  still  are.  Whether  resisting  colonization  of  the 
commons,  protesting  capture  of  their  granaries  by 
merchants  and  rulers,  disputing  land  claims,  con¬ 
tending  with  centralizing  states  over  the  unequal 
terms  of  their  incorporation,  or  joining  millenar- 
ian  struggles,  peasants  have  been  compelled  to 
respond  to  the  array  of  forces  attending  the  devel¬ 
opment  of  capitalist  property  relations.  Why 
should  this  not  be  the  case  today?  Conventional 
wisdom  would  have  it  that  the  world’s  peasantries 
are  destined  to  disappear  in  the  face  of  competi¬ 
tive  pressures  from  capital-intensive  agriculture, 
expelling  them  into  burgeoning  urban  labor  forces. 
This  development  scenario  embodies  a  modernity 
trope  that  modern  civilization  is  centered  on  the 
city,  as  an  escape  from  parochialism  and  the 
drudgery  of  rural  life.  Peasants,  here  understood 
in  a  wider  sense  as  agricultural  smallholders  or 
laborers,  but  also  including  pastoralists  or  fisher¬ 
men,  are  still  a  social  force  to  reckon  with.  In  the 
following,  some  of  the  regional  and  transnational 
forms  are  discussed  that  have  gained  a  voice 
addressing  major  issues  such  as  food  security,  eco¬ 
logical  sustainability,  and  biodiversity. 

While  certainly  the  demographic  composition  of 
the  world’s  population  is  now  majority  urban,  and 
peasant  agriculture  is  evidently  increasingly  mar¬ 
ginal  in  global-market  terms,  each  of  these  indica¬ 
tors  is  deeply  problematic.  And  it  has  taken  the  rise 
and  consolidation  of  a  contemporary  peasants’ 
movement  to  underline  the  shortcomings  of  such 
seemingly  commonsense  observations.  At  present, 
there  are  more  than  1  billion  slum  dwellers  across 


the  world,  accounting  for  one  in  six  of  the 
world’s  population.  While  many  of  these  seem¬ 
ingly  redundant  people  do  work  in  sweatshops  at 
the  extreme  end  of  corporate  supply  chains,  or 
recycle  urban  waste,  they  are  not  absorbed  by  the 
urban-industrial  jobs  predicted  by  the  develop¬ 
ment  paradigm.  Furthermore,  the  displacement  of 
peasant  agriculture  is  largely  due  to  the  with¬ 
drawal  of  protections  and  the  opening  up  of 
domestic  markets  to  cheap,  subsidized  (at  below 
production  cost)  foodstuffs  from  the  cereal¬ 
exporting  regions  of  the  global  North  and  the  role 
of  Southern  states  in  diverting  land  use  for  agro¬ 
exporting  as  a  condition  of  debt  rescheduling. 

The  role  of  agro- exporting  has  shaped  agricul¬ 
tural  policy  in  a  neoliberal  world,  where  food 
security  has  been  redefined  from  a  social-contract 
function  to  a  global-market  function.  The  short¬ 
coming  of  this  policy  shift  became  clear  during  the 
2007-2008  food  crisis,  when  governments  found 
national  food  security  compromised  by  the  inter¬ 
ruption  of  their  food  import  dependence.  But  the 
transnational  peasants’  movement  had  already 
identified  this  problem,  opposing  neoliberal  argu¬ 
ments  for  food  security  provided  by  food  traders 
with  the  concept  of  food  sovereignty,  which  pro¬ 
claimed  hunger  as  a  problem  of  rights  not  of 
means.  Initially,  food  sovereignty  was  defined  as  a 
territorial  right  to  produce  food  premised  on  the 
notion  that  food  is  a  source  of  nutrition  first  and 
only  secondarily  a  trade  item.  This  intervention 
underlined  the  vulnerability  of  agricultural  produc¬ 
ers  to  subsidy  schemes  favoring  corporate  agricul¬ 
ture  in  a  highly  uneven  global  market,  maintaining 
that  neoliberal  multilateral  and  national  policies 
privilege  the  rights  of  food  traders  and  agribusiness 
over  peasants  and  low-income  consumers.  And  it 
made  visible  the  dramatic  dispossession  of  small¬ 
holders  everywhere  by  the  liberalization  of  agricul¬ 
tural  trade. 

The  transnational  peasant  (pastoralist,  fisher- 
folk,  and  rural  workers)  movement,  La  Via 
Campesina,  or  the  Peasants’  Way,  articulated  the 
concept  of  food  sovereignty  in  the  1996  Rome 
World  Food  Summit,  proclaiming  the  right  of 
nations  to  self-sufficiency,  and  respect  of  cultural 
and  productive  diversity,  as  a  direct  critique  of  the 
market-driven  definition  of  food  security  and  its 
unrealized  claim  to  feed  the  world  with  Northern 
food  surpluses.  Representing  millions  of  farmers 
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through  more  than  150  organizations  across  more 
than  50  countries,  La  Via  Campesina  is  the  promi¬ 
nent  voice  of  a  social  force  advocating  for  the 
survival  of  small  producers  claiming  the  right  to 
produce  staple  foods,  to  reproduce  society,  and  to 
manage  local  resources.  In  making  these  claims,  the 
peasants’  movement  calls  into  question  a  market 
episteme  that  governs  the  development  vision.  From 
the  movement’s  perspective,  it  is  an  episteme  that 
assumes  scale  efficiency  on  questionable  grounds: 
highly  subsidized  energy-intensive  industrial  agricul¬ 
ture  that  externalizes  environmental  costs  (e.g.,  soil 
and  genetic  erosion,  greenhouse  gas  emissions,  and 
chemical  pollution)  and  social  impacts  (e.g.,  dis¬ 
placed  populations,  food  dependence,  and  pesti¬ 
cide  contamination),  supported  by  asymmetrical 
trade  rules.  In  contrast,  the  United  Nations  (UN)- 
sponsored  International  Assessment  of  Agricultural 
Knowledge,  Science  and  Technology  for  Develop¬ 
ment  (IAASTD)  Report  (2008)  noted,  for  example, 
that  a  half-hectare  plot  in  Thailand  produces  70 
species  of  vegetables,  fruits,  and  herbs,  improving 
nutrition  and  feeding  more  people  than  a  half-hectare 
plot  of  a  high-yielding  rice  variety,  and  such  biodi- 
verse  farming  is  far  less  energy  intensive,  replenishing 
soil  and  water  nutrients  and  cycles  and  sustaining 
farming  as  a  culture  rather  than  supporting  a  remote 
business.  This  ontological  distinction  is  central  to  the 
vision  of  La  Via  Campesina,  arguably  the  most 
politicized  of  the  peasants’  movements  today. 

While  highly  diverse  (respecting  locally  specific 
social  and  ecological  projects),  La  Via  Campesina 
nevertheless  has  forged  a  historic  and  common 
politics  of  resistance  to  the  commodification  of 
land,  seed,  and  food  and  to  a  World  Trade 
Organization  (WTO)  trade  regime  premised  on  the 
efficiency  calculus.  As  a  transnational  coalition,  for 
example,  it  ranges  from  Brazil’s  Landless  Workers 
Movement  (Movimiento  dos  Trabalhadores  Rurais 
Sem  Terra,  MST),  with  its  emphasis  on  resettling 
displaced  rural  people  and  favela  dwellers  in  landed 
cooperatives;  through  the  movement  of  rich  farm¬ 
ers:  the  Karnataka  State  Farmers’  association 
(Karnataka  Rajya  Raita  Sangha,  KRRS)  in  India, 
opposed  to  biotechnology  and  global  agribusiness 
as  well  as  land  reform,  and  the  foundational 
Peasant  Movement  of  the  Philippines  (Kilusang 
Magbubukid  ng  Pilipinas,  KMP)  in  the  Philippines; 
to  the  European  Farmer  Co-ordination  (Coordin¬ 
ation  Paysanne  Europeenne,  CPE),  dedicated  to 


preserving  seed  sovereignty  as  the  basis  of  farmer 
autonomy.  The  coalition  is  dedicated  to  respecting 
the  different  needs  and  conditions  of  its  chapters, 
even  as  it  unites  on  global  campaigns.  For  example, 
the  Global  Campaign  for  Agrarian  Reform  (GCAR) 
has  directly  challenged  the  World  Bank’s  Market- 
Led  Agrarian  Reform  (MLAR)  development  strat¬ 
egy,  with  some  success  in  reshaping  debates  in  the 
development  institutions  and  maintaining  pressure 
through  land  occupations,  urban  protests,  and 
anti-genetically  modified  (anti-GM)  crop  activities. 
Networking  allies  such  as  the  Food  First  Information 
and  Action  Network  (FIAN),  focusing  on  land 
rights,  and  the  Land  Research  and  Action  Network 
(LRAN)  constitute  a  combined  mobilization  around 
land  and  food  sovereignty. 

The  principle  of  food  sovereignty,  while  appeal¬ 
ing  to  a  Westphalian  concept  as  a  strategic  inter¬ 
vention,  arguably  forms  a  vision  of  a  different 
future.  First,  it  politicizes  the  global  food  system, 
revealing  its  capture  by  corporations  for  which 
food  is  just  another  investment  portfolio.  Second, 
it  challenges  the  ontology  of  economism,  which 
erases  the  viability  of  small  producers,  physically 
expelling  them  from  their  territories  and  epistemo¬ 
logically  removing  them  from  history.  By  linking 
the  food  trade  with  dispossession,  the  peasants’ 
movement  reveals  the  relationship  between  de- 
peasantization  and  the  formation  of  a  flexible 
surplus  labor  force  for  global  industry.  Third,  the 
ontology  of  the  food  sovereignty  movement 
encourages  the  revaluation  of  agriculture,  rurality, 
and  food  as  foundational  to  general  social  and 
ecological  sustainability.  And  fourth,  the  social 
force  leading  this  charge  is  a  politicized  peas¬ 
antry — not  simply  reviving  “tradition”  but  draw¬ 
ing  on  tacit  knowledge  of  ecological  relations  and 
customary  principles  of  social  cooperation  as  the 
bedrock  of  human  survival  in  an  era  of  climatic 
and  environmental  emergency. 

Of  course,  the  transnational  peasants’  move¬ 
ment  cannot  change  the  world  on  its  own.  But  this 
movement  is  changing  the  way  we  think  about  the 
world,  and  its  future  possibilities.  By  giving  voice 
to  the  world’s  remaining  peasants,  the  movement 
denaturalizes  the  vision  of  food  security  via  agro¬ 
industrialization.  This  in  turn  resonates  with  grow¬ 
ing  concern  over  food  safety,  food  miles,  and  the 
environmental  degradation  associated  with  energy- 
intensive  agriculture.  The  peasants’  movement 
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thus  connects  with  new  social  movements 
concerned  with  reconfiguring  societies  around 
reducing  social  distance  and  carbon  emissions, 
eco-agriculture,  and  renaturalizing  food.  Here,  the 
alternative  political  coordinates  of  the  bioregion 
concept  depends  on  reducing  the  social  and  physical 
divide  between  urban  and  rural  and  reconnecting 
social  reproduction  with  its  natural  foundations. 
Food  sovereignty,  as  a  political  slogan  and  plat¬ 
form,  reasserts  both  the  self-organizing  capacity  of 
communities  of  people  and  the  centrality  of  agri¬ 
culture  to  human  survival. 

The  transnational  peasants’  movement  is 
uneven  and  differentiated.  Its  strongest  regions  are 
Latin  America  and  Southeast  Asia,  whereas 
African  farmers  are  still  developing  organizational 
capacity.  Organizations  rise  and  fall — for  exam¬ 
ple,  the  Central  American  Peasant  Coalition 
(Asociacion  de  Organizaciones  Campesinas  de 
Centroamerica  para  la  Cooperation  y  el  Desarrollo, 
ASOCODE)  was  a  powerful  regional  coalition  in 
the  1990s,  funded  by  European  nongovernmental 
organizations  (NGOs)  and  dedicated  to  forming 
indigenous  peasant  networks  and  coordinating 
activities  with  the  Central  American  Farmer  to 
Farmer  (Campesino  a  Campesino)  Movement 
(peasant-based  extension).  La  Via  Campesina 
emerged  in  this  context,  its  durability  depending 
on  its  political  autonomy  and  its  distrust  of  NGO 
alliances.  This  stemmed  from  the  need  for  self¬ 
representation  in  an  environment  of  skepticism 
toward  smallholders — reinforced  by  the  conserva¬ 
tive,  rich-farmer  organization,  the  International 
Federation  of  Agricultural  Producers,  which  has 
until  now  monopolized  access  to  organizations 
such  as  the  Food  and  Agriculture  Organization 
(FAO)  and  its  branch,  the  International  Fund  for 
Agricultural  Development  (IFAD).  The  slogan 
“not  about  us  without  us”  became  essential  to  the 
insertion  of  a  peasant  voice  in  global  civil  society, 
especially  as  neoliberal  development  projects 
impinged  on  peasant  futures.  Thus,  the  recent 
Plan  Puebla-Panama,  an  industrial  corridor  link¬ 
ing  southern  Mexico  to  Panama  to  mobilize  indig¬ 
enous  labor,  spawned  two  quite  different  peasant 
networks  (representing  various  peasant  unions)  to 
negotiate  or  contest  it,  respectively:  the  Meso- 
american  Initiative  for  Trade,  Integration  and 
Sustainable  Development  and  the  Mesoamerican 
Peasant  Platform. 


Farther  south,  in  Brazil,  the  Movement  of  Small 
Farmers  (Movimiento  de  Pequenos  Agricul tores),  a 
peasant  union,  has  mobilized  more  than  10,000 
families  in  more  than  a  dozen  states  to  oppose  agri¬ 
business  plantations  of  soy,  sugar,  and  eucalyptus 
works,  while  the  more  substantial  and  iconic 
Landless  Rural  Workers’  Movement  (Movimento 
Dos  Trabalhadores  Rurais  Sem  Terra,  MST)  in 
Brazil  has  managed  to  settle  roughly  half  a  million 
peasants  and  landless  workers  on  seized,  unused 
land  to  produce  food  for  Brazil’s  working  poor  and 
broaden  the  meaning  of  citizenship  to  agrarian 
social  projects.  Meanwhile,  in  Africa,  the  Network 
of  Farmers’  and  Agricultural  Producers’  Organisa¬ 
tions  of  West  Africa  (Reseau  des  Organisations 
Paysannes  et  de  Producteurs  Agricoles  de  L’Afrique 
de  Le’Ouest,  ROPPA)  is  developing  methods  of 
documenting  flexible  seed  selection  by  farming 
women  in  West  Africa  as  a  form  of  drought  man¬ 
agement  (with  climate  change),  and  the  Agriculture 
Peasant  and  Modernisation  Network-Mondial 
(Reseau  Mondial  Agricultures,  Paysannes,  Ali¬ 
mentation  et  Mondialisation,  APM  Mondial), 
formerly  a  regional  network  in  Africa  and  now 
organizationally  transnational,  is  developing  ties 
with  the  Chinese  peasantry,  the  last  of  the  peasant¬ 
ries  to  be  drawn  into  the  movement. 

Under  pressure  everywhere,  peasants  are  self¬ 
organizing  to  protect  their  rights  and,  with  allies, 
are  building  networks  and  consolidating  a  voice  in 
global  development  fora,  including  demanding  a 
reformed  UN  and  alternative  multilateral  institu¬ 
tions  such  as  a  Convention  on  Food  Sovereignty 
and  Trade  in  Food  and  Agriculture  and  a  World 
Commission  on  Sustainable  Agriculture  and  Food 
Sovereignty.  Today,  peasants’  movements  are 
strengthening  through  mobilization  as  well  as  a 
public  realization  of  environmental  limits.  In  this 
sense,  the  significance  of  peasants’  movements  lies 
in  the  recentering  of  agriculture  as  a  process  of 
reterritorializing  modernity — that  is,  reversing  the 
marginalization  of  land-based  culture  and  direct¬ 
ing  the  world’s  attention  to  securing  its  ecological 
nest  as  a  survival  strategy. 

Philip  McMichael 

Cornell  University 
Ithaca,  New  York,  United  States 
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Performance 


At  the  most  general  level,  performance  refers  to 
two  dimensions  of  an  action:  the  doing  of  it 
(descriptive  aspect)  and  how  well  or  badly  the 
actor  is  doing  something  (evaluative  aspect).  In 
political  science  literature,  political  performance  is 
defined  in  broad  and  narrow  ways,  both  building 
on  the  general  definition  outlined  above.  Broadly 
defined,  political  performance  comprises  the  activ¬ 
ities  of  political  actors  (descriptive)  as  well  as  the 
evaluation  of  these  activities  and  their  outcomes. 
Hence,  it  concerns  the  description  of  particular 
activities  such  as  passing  bills  and  spending  money 
as  well  as  the  assessment  of  whether  political 
actors  make  public  policies  (the  activity),  for 
example,  in  efficient  ways  and  whether  they 
achieve  intended  goals  (outcomes)  such  as  wealth 
or  liberty.  Narrowly  understood,  political  perfor¬ 
mance  is  only  an  evaluative  concept,  referring  to 
the  evaluation  of  what  political  actors  do  and  what 
the  outcomes  of  their  actions  are.  This  narrow 
definition,  dominating  in  scientific  research,  is  the 
one  adopted  here.  Some  authors  suggest  a  further 
narrowing  of  the  concept  by  focusing  on  the  evalu¬ 
ation  of  governments  or,  more  specifically,  on  the 
evaluation  of  democratic  governments.  In  this 
case,  the  terms  quality  of  government  or  good  gov¬ 
ernment  are  synonyms  for  political  performance. 
Here,  such  a  specification  is  renounced  in  favor  of 
a  more  comprehensive  understanding  of  the  con¬ 
cept  applicable  to  all  political  actors. 

This  entry  outlines  the  different  theoretical 
and  historical  contexts  of  research  on  political 
performance,  classifies  criteria  for  evaluating 


political  performance,  discusses  issues  of  mea¬ 
surement,  reviews  theories  and  research  explain¬ 
ing  political  performance,  and,  finally,  suggests 
future  research  perspectives. 

Theoretical  and  Historical  Contexts 

The  idea  of  evaluating  political  systems  is  a  core 
concern  in  classical  political  science  literature. 
Criteria  of  political  performance  such  as  liberty  and 
equality  dominated  the  writings  of  classical  political 
theorists  (e.g.,  Thomas  Hobbes  and  Jean-Jacques 
Rousseau)  in  their  search  for  good  types  of  govern¬ 
ment.  Additionally,  performance  criteria  such  as  the 
common  good  and  stability  served  as  central  yard¬ 
sticks  for  scholars  of  comparative  government, 
from  Aristotle  to  Karl  Loewenstein,  in  describing 
and  comparing  different  nondemocratic  and  demo¬ 
cratic  regimes.  This  preliminary  stage  in  the  debate 
on  political  performance,  however,  is  characterized 
by  the  fact  that  its  protagonists  did  not  make 
explicit  use  of  the  term  political  performance. 
Furthermore,  they  mainly  relied  on  theoretical 
arguments  and  unsystematic  empirical  observations 
to  assess  the  merits  of  different  systems. 

The  first  systematic  theoretical  and  empirical 
work  on  political  performance  was  not  presented 
until  the  1970s.  At  this  point,  two  crucial  obsta¬ 
cles  had  been  overcome.  The  first  obstacle  con¬ 
cerns  the  availability  of  cross-national  data.  It 
became  less  of  an  obstacle  with  the  collection  and 
documentation  of  a  wide  range  of  social  and 
political  indicators  for  many  independent  coun¬ 
tries  of  the  world.  First  editions  of  several  data 
handbooks  appeared  during  the  1960s  (e.g., 
World  Handbook  of  Political  and  Social 
Indicators).  The  second  obstacle  concerns  the 
deliberate  avoidance  of  evaluation  by  empirically 
oriented  political  scientists.  This  was  revised  when 
political  scientists  dared  to  make  explicit  and  sys¬ 
tematic  appraisals  of  concrete  political  systems  on 
the  basis  of  empirical  data.  The  outset  of  this 
phase  can  be  dated  to  1971,  when  Harry  Eckstein 
developed  and  justified  theoretical  criteria  for 
evaluating  political  systems  and  when  Ted  Gurr 
with  Muriel  McClelland  made  a  first  systematic 
attempt  to  empirically  translate  and  apply  these 
criteria  to  a  sample  of  democratic  and  nondemo¬ 
cratic  countries.  In  1978,  Gabriel  Almond  and  G. 
Bingham  Powell  developed  the  most  influential 
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and  lasting  concept  of  political  performance 
emerging  from  this  phase.  In  the  context  of  sys¬ 
tems  theory,  they  suggested  their  concept  of 
political  productivity,  enclosing  eight  different 
political  goods:  (1)  system  maintenance,  (2)  sys¬ 
tem  adaptation,  (3)  participation,  (4)  compliance 
and  support,  (5)  procedural  justice,  (6)  welfare, 
(7)  security,  and  (8)  liberty.  Additionally,  they 
empirically  studied  the  attainment  of  some  of  these 
goods  in  several  democratic  and  nondemocratic 
countries.  This  first  and  early  stage  of  performance 
research,  however,  provoked  only  a  few  isolated 
studies.  These  studies  mainly  used  systems  theory 
as  a  frame  of  reference  and  analyzed  empirically 
democratic  and  nondemocratic  systems. 

In  the  aftermath,  research  on  political  perfor¬ 
mance  advanced  significantly  and  resulted  in  the 
establishment  of  an  independent  research  area  in 
the  1990s.  At  least  two  reasons  account  for  this 
advancement — one  historical,  the  other  theoretical. 
Historically,  the  collapse  of  the  state  socialist  sys¬ 
tems  in  Central  and  Eastern  Europe,  resulting  in  the 
abolition  of  the  most  important  alternative  to 
democracy,  turned  the  attention  of  comparative 
politics  to  quality  differences  within  democracies, 
more  specifically  to  quality  differences  between 
various  types  of  democratic  governments.  Theor¬ 
etically,  the  rise  of  new  institutionalism  in  the 
1980s,  with  its  central  premise  that  different  institu¬ 
tional  arrangements  produce  different  results  (insti¬ 
tutions  matter),  directed  the  attention  to  political 
performance.  Robert  Putnam’s  Making  Democracy 
Work  (1993)  and  Arend  Lijphart’s  Patterns  of 
Democracy  (1999)  are  among  the  most  important 
and  influential  works  in  this  period.  Putnam  sug¬ 
gested  responsiveness  and  effectiveness  as  central 
criteria  in  evaluating  the  performance  of  democratic 
government;  he,  furthermore,  introduced  social 
capital  in  addition  to  the  already  existing  factors 
explaining  political  performance — namely,  political 
institutions  and  socioeconomic  modernity.  Lijphart, 
on  the  other  hand,  is  interested  in  the  performance 
of  so-called  majoritarian  and  consensus  democracy. 
He  asks  which  type  works  best,  and  he  offers  the 
most  comprehensive  empirical  analysis,  covering  36 
democracies  and  32  performance  indicators  catego¬ 
rized  into  four  areas  (macroeconomic  management, 
control  of  violence,  quality  of  government,  and 
kinder  and  gentler  policy  areas:  welfare  state,  envi¬ 
ronmental  policy,  criminal  justice,  and  foreign  aid). 


While  these  and  other  studies  mainly  focus  on  the 
policy  performance  of  democracies,  the  issue  of 
democratic  quality  or  the  quality  of  democracy  itself 
gained  interest  since  2000.  Since  then,  democratic 
criteria  such  as  accountability,  responsiveness,  free¬ 
dom,  and  equality  are  increasingly  considered  and 
studied  empirically  in  the  research  on  political 
performance. 

Presently,  political  performance  is  an  estab¬ 
lished  field  of  research  in  political  science.  It  is 
mainly  concerned  with  evaluating  different  types 
of  democratic  governments  such  as  majoritarian 
and  consensus  democracies  or  presidential  and 
parliamentary  democracies,  on  the  one  hand,  and 
in  evaluating  specific  democratic  institutions  such 
as  electoral  systems,  party  systems,  and  federalism, 
on  the  other.  At  first  sight,  this  focus  on  evaluating 
political  institutions  seems  to  contradict  the 
adopted  definition  of  political  performance  refer¬ 
ring  to  political  actors.  This  definition,  however,  is 
consistent  with  new  institutional  theory.  Accord¬ 
ing  to  this  theory,  only  actors — not  political 
institutions — can  act.  Political  institutions  exert  an 
influence  on  individual  or  collective  actors  by  con¬ 
stituting  constraints  on  their  actions. 

A  Typology  of  Criteria 

of  Political  Performance 

Normative  criteria  are  essential  for  an  explicit  and 
systematic  evaluation  of  political  performance; 
they  function  as  yardsticks  against  which  the  activ¬ 
ities  of  political  actors  can  be  assessed.  To  be  able 
to  evaluate  different  political  systems,  these  norma¬ 
tive  criteria  should  represent  universal  values. 
Furthermore,  there  should  be  consensus  that  these 
values  embody  legitimate  obligations  of  political 
systems,  which  means,  on  the  one  hand,  that  these 
values  guide  what  political  actors  do  and,  on  the 
other  hand,  that  citizens  demand  from  government 
the  attainment  of  these  values.  Finally,  these  uni¬ 
versal  values  should  be  justified  theoretically. 

Most  authors  propose  individual  concepts  of 
performance  consisting  of  a  list  of  criteria  without 
referring  to  those  of  others.  As  a  result,  a  multiplic¬ 
ity  of  heterogeneous  criteria  of  political  perfor¬ 
mance  exists,  and  there  is  little  convergence  between 
the  different  lists  proposed  by  different  authors. 
With  the  help  of  a  typology  of  performance  criteria 
suggested  by  Edeltraud  Roller  (2005),  the  most 
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important  types  of  criteria  can  be  identified.  The 
typology  refers  to  the  performance  of  liberal 
democracies  and  is  based  on  two  dimensions: 

•  The  first  dimension  distinguishes  between  goal- 
oriented  and  general  performance.  Goal- 
oriented  or  substantive  performance  aims  at 
the  attainment  of  particular  goals  such  as  wel¬ 
fare  or  liberty.  General  or  procedural  perfor¬ 
mance  is  independent  of  these  particular  goals; 
it  helps  promote  the  attainment  of  any  particu¬ 
lar  goals.  Examples  are  stability  or  efficient  use 
of  resources. 

•  The  second  dimension  differentiates  between 
systemic  and  democratic  performance.  It  con¬ 
ceptualizes  different  normative  expectations 
that  exist  with  regard  to  political  systems  in 
general  and  with  regard  to  democratic  systems 
in  particular.  Systemic  performance  refers  to 
achievements  every  political  system  must  gen¬ 
erate  for  society,  such  as  economic  growth  and 
stability.  Democratic  performance  refers  to 
specific  criteria  that  are  to  be  ensured  by  demo¬ 
cratic  systems,  such  as  liberty,  responsiveness, 
and  accountability. 

By  combining  these  two  dimensions,  four  types 
of  performance  criteria  can  be  established.  Both 
goal-oriented  and  general  political  performance 
can  be  distinguished  according  to  whether  they  are 
to  be  provided  by  democracies  either  in  their  func¬ 
tion  as  political  system  in  general  or  in  their  func¬ 
tion  as  democratic  system  in  particular.  Examples 
for  goal-oriented,  systemic  criteria  are  security  and 
welfare;  examples  for  goal-oriented,  democratic 
criteria  are  liberty  and  equality;  examples  for  gen¬ 
eral,  systemic  criteria  are  efficiency  and  stability; 
and  examples  for  general,  democratic  criteria  are 
accountability  and  participation. 

Applying  this  typology,  one-dimensional  and 
multidimensional  concepts  of  political  perfor¬ 
mance  can  be  distinguished.  While  one-dimensional 
concepts  rely  on  a  single  type  of  performance  cri¬ 
teria,  multidimensional  concepts  combine  different 
types  of  criteria.  For  example,  Harry  Eckstein  sug¬ 
gested  a  one-dimensional  concept  of  general  per¬ 
formance  valid  for  all  political  systems.  It  includes 
durability,  civil  order,  legitimacy,  and  decisional 
efficacy.  Roller’s  normative  model  of  political 
effectiveness  is  also  a  one-dimensional  concept 


aiming  at  a  complete  list  of  goal-related,  systemic 
performance  criteria.  It  covers  international  secu¬ 
rity,  domestic  security,  wealth,  socioeconomic 
security  and  socioeconomic  equality,  and  envi¬ 
ronmental  protection.  Larry  Diamond  and 
Leonardo  Morlino  (2005)  developed  a  two- 
dimensional  concept  of  quality  of  democracy 
covering  two  general  (rule  of  law  and  account¬ 
ability)  and  three  goal-oriented  (responsiveness, 
liberty,  and  equality)  criteria.  Finally,  Almond 
and  Powells’s  concept  of  political  productivity 
(1978)  is  a  truly  multidimensional  concept  cover¬ 
ing  all  four  types  of  criteria:  (1)  goal-related, 
systemic  criteria  (welfare  and  security);  (2)  goal- 
related,  democratic  criteria  (liberty);  (3)  general, 
systemic  criteria  (system  maintenance,  system 
adaptation,  compliance,  and  support);  and  (4) 
general,  democratic  criteria  (participation  and 
procedural  justice). 

These  and  other  suggested  concepts  of  political 
performance  share  the  assumption  that  different 
values  cannot  be  attained  simultaneously.  They 
assume  trade-offs  or  conflictual  relationships, 
meaning  that  one  value  can  be  attained  only  at  the 
cost  of  another  value.  Trade-offs  are  proposed 
between  different  types  of  performance  criteria 
(e.g.,  policy  performance  and  democratic  quality) 
as  well  as  within  types  of  performance  criteria 
(e.g.,  wealth  and  socioeconomic  equality).  Authors 
differ,  however,  with  respect  to  the  proposed 
trade-offs.  Their  ideological  position  often  deter¬ 
mines  whether  specific  performance  criteria  are 
seen  as  incompatible  or  mutually  compatible. 
Irrespective  of  the  assumed  type  of  relationship, 
any  assessment  of  performance  should  entail  an 
analysis  of  the  trade-offs  between  different  values. 

Finally,  as  far  as  the  theoretical  justification  of 
performance  criteria  is  concerned,  most  authors 
tend  to  select  their  criteria  arbitrarily.  If  they  jus¬ 
tify  them  at  all,  they  refer  to  different  theories 
depending  on  the  type  of  performance  criteria.  In 
the  case  of  democratic  criteria,  whether  goal  ori¬ 
ented  or  general,  normative  democratic  theory 
(e.g.,  liberal  democracy)  is  used.  In  the  case  of 
systemic  criteria,  especially  goal-oriented  criteria 
such  as  security,  wealth,  and  environmental  pro¬ 
tection,  empirical  political  theories,  especially 
theories  on  the  development  of  policies  or  the 
expansion  of  the  role  of  government,  are  used  to 
justify  performance  criteria. 
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Measurement 

The  measurement  of  political  performance  neces¬ 
sarily  requires  a  comparative  approach.  Political 
performance  is  either  assessed  in  relation  to  theo¬ 
retically  defined  normative  criteria  (e.g.,  a  literacy 
rate  of  100%)  or  in  comparison  with  empirically 
defined  maximum  or  minimum  values  (e.g.,  rela¬ 
tive  to  the  country  with  the  best  and/or  the  worst 
literacy  rate).  Most  often,  the  second  type  of  com¬ 
parison  is  used  in  empirical  research. 

In  general,  the  measurement  of  political  perfor¬ 
mance  raises  several  issues,  which  are  discussed  as 
follows. 

Structure  Versus  Process 

Although  authors  agree  that  political  perfor¬ 
mance  refers  to  the  evaluation  of  the  political 
process — that  is,  what  political  actors  actually  do — 
some  authors  measure  political  performance  at  the 
level  of  political  structure.  This  is  especially  true  in 
the  case  of  democratic  performance.  While  mea¬ 
sures  of  democratic  structure  identify  whether  a 
democratic  structure  exists  and  is  actually  working 
in  a  country  or  not  (e.g.,  universal  suffrage  and 
competitive  elections),  measures  of  democratic 
quality  presuppose  such  a  democratic  structure  and 
assess  to  what  extent  democratic  performance  is 
given  (e.g.,  responsiveness  and  accountability). 
Obviously,  there  is  a  fundamental  theoretical  differ¬ 
ence  between  structure  and  process,  and  valid  mea¬ 
sures  of  political  performance  should  refer  to  the 
political  process.  Nevertheless,  while  the  difference 
is  clear  in  conceptual  terms,  it  is  rather  difficult  to 
separate  between  both  dimensions  empirically. 

Outputs  Versus  Outcomes 

Conceptually,  performance  does  not  refer  to 
actions  or  efforts  to  reach  goals,  whether  in  the 
form  of  laws,  personal  spending,  or  state  spending, 
but  to  the  outcomes  or  actual  results  of  these 
actions.  Rather  than  using  output  indicators  (e.g., 
the  degree  of  redistribution  measured  on  the  basis 
of  social  expenditures),  outcome  indicators  should 
be  used  (e.g.,  poverty  rate).  While  outcomes  are 
the  real  test  as  to  whether  the  outputs  have  pro¬ 
duced  the  intended  results  or  not,  usually  they  can¬ 
not  be  directly  controlled  by  political  actors. 
Hence,  the  question  has  been  raised  whether  one 


can  speak  of  political  performance  here  at  all.  To 
avoid  this  problem,  some  authors  consciously 
decide  to  use  outputs  rather  than  outcomes  as 
measure  for  political  performance.  For  example, 
Putnam’s  Index  of  Institutional  Performance  pri¬ 
marily  measures  outputs  such  as  the  number  of 
day  care  centers  and  health  expenditures  rather 
than,  say,  mortality  rates.  This  is  convincing  only 
on  condition  that  outputs  could  serve  as  valid 
proxies  for  outcomes.  But  this  is  a  much  disputed 
assumption.  In  their  attempts  to  measure  perfor¬ 
mance,  other  scholars  deliberately  focus  on  actual 
results  or  outcomes  rather  than  mere  efforts  to 
reach  goals.  For  example,  Roller’s  Index  of 
Effectiveness  of  Democracies  is  designed  as  a  pure 
measure  of  outcomes  covering  indicators  such  as 
poverty,  infant  mortality,  and  unemployment  rate. 

Objective  Versus  Subjective  Measures 

In  principle,  political  performance  can  be  mea¬ 
sured  either  on  the  basis  of  objective  measures 
(e.g.,  the  rate  of  unemployment  and  the  extent  of 
congruence  between  voters’  policy  preferences  and 
government’s  ideological  stances)  or  on  the  basis 
of  subjective  evaluations  of  citizens  (e.g.,  citizens’ 
satisfaction  with  the  functioning  of  democracy  and 
citizens’  perception  of  the  degree  of  government’s 
responsiveness  to  people’s  demands).  Whether 
objective  or  subjective  measures  should  be  used  is 
a  theoretical  decision  depending  on  the  research 
question.  It  is  equally  valid  to  ask  for  the  quality  of 
government  on  the  basis  of  objective  criteria  and 
to  ask  for  citizens’  evaluation  of  the  quality  of 
government.  Due  to  the  substantial  difference 
between  both  dimensions,  however,  objective  mea¬ 
sures  of  political  performance  cannot  be  used  as 
proxies  for  subjective  measures  and  vice  versa. 

Single  Versus  Composite  Measures 

Normally,  studies  on  political  performance  are 
interested  not  only  in  the  assessment  of  several 
specific  criteria  (e.g.,  unemployment  rate  and  pov¬ 
erty)  but  also  in  the  overall  level  of  performance  of 
political  actors.  The  most  efficient  method  to 
gauge  the  overall  level  of  political  performance  is 
to  construct  a  composite  measure  integrating  and 
summarizing  evaluations  of  several  performance 
dimensions.  The  available  composite  measures 
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mainly  refer  to  policy  performance.  The  well- 
known  Human  Development  Index  (HDI)  of  the 
United  Nations  Development  Programme  (UNDP) 
represents  one  example  for  such  a  measure.  It  sub¬ 
sumes  three  dimensions  of  welfare — a  long  and 
healthy  life,  knowledge,  and  a  decent  standard  of 
living — and  relies  on  four  indicators  (life  expec¬ 
tancy,  adult  literacy  rate,  gross  enrollment  in 
school,  and  gross  domestic  product).  It  assesses  the 
average  achievements  in  a  country  in  these  dimen¬ 
sions  of  welfare.  Roller’s  Index  of  Effectiveness  of 
Democracies  represents  another  composite  measure 
aiming  at  the  policy  level.  It  takes  a  broad  view, 
encompassing  effectiveness  in  all  domestic  policy 
areas:  domestic  security  policy,  economic  policy, 
social  policy,  and  environmental  policy.  Each  policy 
area  is  measured  by  a  composite  measure,  and  these 
four  measures  are  integrated  into  an  overall  Index 
of  Effectiveness.  Altogether,  it  is  composed  of  14 
indicators  (e.g.,  murder  and  manslaughter,  gross 
domestic  product,  infant  mortality,  emissions  of 
sulfur  oxides,  etc.).  Examples  of  composite  mea¬ 
sures  aiming  at  general  or  procedural  performance 
are  the  good  governance  indicators  of  the  World 
Bank.  These  indicators  cover  composite  measures 
for  six  performance  dimensions — (1)  voice  and 
accountability,  (2)  political  stability  and  absence  of 
violence,  (3)  government  effectiveness,  (4)  regula¬ 
tory  quality,  (5)  rule  of  law,  and  (6)  control  of 
corruption — and  an  overall  composite  measure 
comprising  these  six  dimensions.  The  Worldwide 
Governance  Indicators  (WGI)  are  based  on  several 
hundred  individual  variables  measuring  perceptions 
of  governance  by  experts  and  ordinary  citizens  as 
well  (e.g.,  press  freedom  index,  risk  of  political 
instability,  quality  of  bureaucracy,  foreign  invest¬ 
ment,  violent  crime,  frequency  of  corruption,  etc.). 
Constructing  composite  measures  implies  decisions 
about  the  techniques  of  standardization,  weighting, 
and  aggregation  of  individual  values.  Different  sta¬ 
tistical  procedures  of  standardization  (e.g.,  z-score 
transformation  and  indexing),  weighting  (e.g.,  equal 
and  unequal),  and  aggregation  (e.g.,  arithmetic 
mean  and  unobserved-components  model)  are  used, 
and  their  relative  merits  are  discussed  intensively. 

Explaining  Political  Performance 

Research  on  political  performance  also  aims  to 
explain  differences  between  countries  and  why 


some  governments  are  more  successful  than  others. 
National  political  institutions  take  center  stage 
with  regard  to  this  question.  This  is  for  two  differ¬ 
ent  reasons,  one  theoretical  and  the  other  practical. 
Knowledge  about  the  effect  of  political  institutions 
is  of  theoretical  interest  because  the  idea  that  insti¬ 
tutions  matter  is  the  central  premise  of  the  new 
institutionalism  paradigm.  This  knowledge  is  also 
of  great  practical  importance.  If  political  institu¬ 
tions  reveal  a  direct  and  unidirectional  effect  on 
political  performance,  recommendations  can  be 
made  as  to  which  kinds  of  constitutions  should  be 
implemented  in  new  democracies  (constitutional 
engineering)  and  what  kind  of  institutional 
reforms  should  be  conducted  in  established 
democracies  to  enhance  political  performance. 
Besides  political  institutions,  cultural  factors  (e.g., 
social  capital),  socioeconomic  modernity  (e.g., 
wealth),  globalization  (e.g.,  foreign  trade),  and, 
particularly,  preferences  of  political  actors  (e.g., 
the  ideological  orientation  of  governmental  par¬ 
ties)  are  suggested  as  other  factors  explaining 
political  performance. 

Empirical  research  focuses  on  the  effect  of  demo¬ 
cratic  institutions,  either  on  the  effect  of  types  of 
democracies  (e.g.,  majoritarian  or  consensus  democ¬ 
racies,  presidential  or  parliamentary  democracies) 
or  on  the  effect  of  specific  democratic  institutions 
(e.g.,  majoritarian  or  proportional  electoral  sys¬ 
tems,  federal  or  unitary  states).  The  core  question  is 
whether  diffusion  or  concentration  of  power  results 
in  higher  levels  of  political  performance.  Numerous 
studies  dealt  with  the  performance  of  consensus  and 
majoritarian  democracies.  Empirical  analyses  of 
Lijphart  indicate  that  consensus  democracies  enjoy 
higher  levels  of  political  performance  than  majori¬ 
tarian  democracies  and  that  these  institutional  set¬ 
tings  themselves  display  a  direct  and  unidirectional 
effect  on  political  performance.  Replications  and 
additional  studies  can  also  show  that  democratic 
institutions  indeed  matter  for  political  performance 
(e.g.,  Roller,  2005).  But  they  matter  only  sometimes 
and  to  a  limited  degree,  and  their  effect  is  less  direct 
and  less  unidirectional  than  assumed  so  far.  It  seems 
that  political  institutions  interact  with  other  factors, 
especially  with  the  preferences  of  political  actors 
such  as  governments.  Therefore,  the  theoretically 
and  practically  attractive  hypothesis  stating  direct 
and  unidirectional  effects  of  political  institutions  on 
political  performance  is  open  to  challenge. 


1854  Performance  Management 


Research  Perspectives 

Although  we  are  able  to  now  witness  an  established 
research  area  of  political  performance,  its  achieve¬ 
ments  are  far  from  being  solid,  comprehensive,  and 
cumulative.  This  is  true  for  several  reasons.  First, 
the  available  studies  refer  to  a  multiplicity  of  het¬ 
erogeneous  criteria  of  political  performance;  a  con¬ 
sensus  on  relevant  criteria  is  still  to  evolve.  Second, 
theoretical  work  and  empirical  work  on  political 
performance  are  only  loosely  coupled.  Theoretical 
work  focusing  on  the  deduction  and  justification  of 
criteria  of  performance  tends  to  ignore  the  task  of 
specifying  empirical  indicators  that  could  flow 
from  its  normative  criteria.  In  turn,  empirical  work 
tends  to  disregard  theoretical  reasoning  on  perfor¬ 
mance  criteria.  Third,  empirical  research  is  mainly 
devoted  to  the  evaluation  of  democracies,  especially 
to  Western  democracies.  It  rarely  pays  attention  to 
the  performance  of  Asian,  African,  and  Latin 
American  democracies.  Fourth,  systematic  com¬ 
parisons  of  contemporary  democratic  and  non- 
democratic  systems  are  still  in  a  very  early  stage. 

To  make  progress,  it  might  be  helpful  to  focus 
on  the  four  types  of  performance  criteria  presented 
above.  This  could  open  up  the  possibility  of  devel¬ 
oping  common  theoretical,  methodological,  and 
empirical  foundations  for  each  of  these  four  types, 
and  this  could  provide  a  basis  for  cumulative 
knowledge.  Additionally,  one  question  should  be 
investigated  more  in  depth  in  future:  How  political 
institutions  exert  their  influence  on  political  per¬ 
formance  and  how  they  interact  with  other  factors, 
political  actors  in  particular.  Up  to  now,  theoreti¬ 
cal  and  empirical  knowledge  about  this  central 
question  is  too  provisional  and  incomplete. 

Edeltraud  Roller 
Johannes  Gutenberg  University  Mainz 
Mainz,  Germany 
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Performance  Management 


Performance  management  involves  managing 
organizations  as  well  as  managing  the  entire 
machinery  of  government.  More  precisely,  it  is  a 
process  of  establishing  goals  and  regularly  check¬ 
ing  the  progress  made  toward  achieving  those 
goals.  It  incorporates  continual  feedback  and  is 
based  on  the  principle  of  management  by  con¬ 
tract  rather  than  by  command.  This  entry  exam¬ 
ines  the  concept  of  performance  management,  its 
theoretical  context,  and  related  developments 
and  applications. 

Conceptual  Overview 

Performance  management  includes  activities  to 
ensure  that  goals  are  being  met  in  an  effective  and 
efficient  manner;  thus,  it  becomes  a  mechanism  for 
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accountability.  The  managerial  tradition  of  the 
past  several  decades  has  focused  on  performance 
as  a  part  of  management  practice  in  the  public  sec¬ 
tor  of  Western  countries.  Governments  have  sup¬ 
ported  administrative  reforms  from  old  public 
administration  rules  and  procedures  to  modern 
managerial  models,  within  the  framework  of  the 
new  public  management  doctrine. 

As  public  values,  efficiency,  productivity,  and 
service  orientation  have  become  more  important. 
Bureaucrats  have  been  forced  to  transform  from 
their  traditional  roles  into  more  managerial  roles 
and  to  become  more  concerned  with  performance. 

Generally  speaking,  many  ideas  concerning 
reform  have  been  borrowed  from  the  private  sec¬ 
tor.  Evaluation  has  focused  more  on  economic 
issues  than  on  political  ones.  Egoistic  interests, 
competition  and  contracting,  cost-effectiveness, 
and  the  role  of  customers  are  discussed  more  and 
more  in  the  core  of  public  domain. 

Performance  management  is  important  in  both 
the  private  and  the  public  sectors.  By  using  perfor¬ 
mance  management,  governments  are  searching 
for  better  performance  in  their  public  sector  orga¬ 
nizations.  The  use  of  market  mechanisms  in  public 
administration  is  one  means  of  increasing  competi¬ 
tion  with  the  private  sector  and  improving  the 
efficiency  of  public  services.  Service  delivery  is 
expected  to  be  more  transparent  and  accountable. 
In  the  redefined  and  new  governance,  administra¬ 
tive  control  is  transformed  from  traditional  action 
controls  to  modern  output  controls. 

However,  performance  management  is  cur¬ 
rently  faced  with  many  challenges.  Sometimes  it  is 
difficult  to  implement  in  the  public  sector  due  to 
politically  unstable  and  multiple  goals,  vague  out¬ 
puts,  and  complexity  of  services.  These  challenges 
raise  the  question  of  whether  performance  man¬ 
agement  is  more  a  myth  than  a  reality  in  modern¬ 
izing  public  organizations.  Another  issue  is  how  to 
reform  performance  measurement,  which  should 
be  more  than  simply  accounting  exercises.  As  John 
Halligan  and  Geert  Bouckaert  (2009)  ask,  is  there 
a  gap  between  rhetoric  and  practice  with  respect  to 
performance  management? 

In  the  performance  management  debate,  warn¬ 
ings  are  given  about  the  dysfunctions  and  unin¬ 
tended  consequences  of  performance  management 
practices  that  may  result  in  a  decline  rather  than  an 
improvement  in  the  performance  of  organizations. 


According  to  Frank  Verbeeten  (2008,  p.  430),  such 
effects  may  include  the  following: 

a.  growth  of  internal  bureaucracy, 

b.  lack  of  innovation, 

c.  reduction  of  system  or  process  responsibility, 

d.  tunnel  vision, 

e.  suboptimization, 

f.  gaming  of  performance  measures,  and 

g.  measure  fixation. 

How  should  we  understand  performance  man¬ 
agement?  In  performance  management,  targets  are 
defined,  expectations  are  set,  processes  are  mea¬ 
sured  and  evaluated,  and,  if  all  these  are  fulfilled, 
top  management  is  rewarded.  Performance  man¬ 
agement  is  both  a  management  tool  and  an 
umbrella  for  a  variety  of  different  management 
techniques.  Performance  management  is  connected 
to  several  other  management  techniques,  such  as 
the  stakeholder  approach,  a  balanced  scorecard, 
and  even  quality  management.  In  national  applica¬ 
tions,  performance  management  is  called  manage¬ 
ment  for  results  or  management  by  objectives  and 
results.  In  some  cases,  the  separate  nature  of  steer¬ 
ing  and  managing  is  being  emphasized. 

As  Zoe  Radnor  and  Mary  McGuire  (2004)  note, 
the  two  terms  performance  measurement  and  per¬ 
formance  management  are  often  used  interchange¬ 
ably.  The  basic  idea  in  performance  management  is 
measuring  performance  and  defining  aims  and 
goals.  Through  the  management  process,  strategies 
are  selected  to  achieve  the  goals  in  an  efficient  way. 
Performance-related  pay  for  groups  or  individuals 
is  also  used  in  performance  management. 

In  the  process  of  performance  management,  an 
agency  involves  its  employees,  as  individuals  and 
members  of  a  group,  in  improving  organizational 
effectiveness  in  the  accomplishment  of  the  agency 
mission.  According  to  John  Mwita  (2000),  the  fac¬ 
tors  that  affect  performance  are  personal  factors 
(e.g.,  motivation  and  commitment),  leadership  fac¬ 
tors  (encouragement,  guidance,  and  support),  team 
factors  (support  of  colleagues),  system  factors 
(instruments  of  labor),  and  contextual  factors 
(changes  of  internal  and  external  environment). 

Performance  management  is  basically  a  control 
system  as  well,  with  both  the  process  and  the 
outputs  being  controlled.  Different  opinions  are 
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presented  about  the  nature  of  the  control  func¬ 
tion.  For  example,  the  control  function  is  consid¬ 
ered  to  be  either  a  universal  and  highly  general  or 
a  detailed,  highly  specialized  function. 

Theoretical  Context 

In  public  administration  literature,  the  doctrine  of 
performance  management  is  referred  to  by  several 
theorists,  such  as  Amitai  Etzioni,  Herbert  Simon, 
James  March,  and  Henry  Mintzberg.  For  instance, 
Max  Weber’s  model  of  bureaucracy  is  considered  as 
a  machine  organization.  A  bureaucratic  organiza¬ 
tion  is  efficient,  routinized,  and  predictable. 
Sometimes  customer  organizations,  both  in  the  pub¬ 
lic  and  the  private  sector,  are  designed  like  machines, 
and  their  employees  are  expected  to  behave  as  if 
they  were  components  of  machines.  Thus,  currently, 
key  concerns  in  performance  management  are  the 
change  from  measuring  performance  to  managing 
performance  and  expanding  the  performance  man¬ 
agement  from  an  economic  perspective  to  others, 
such  as  social  and  citizen  perspectives. 

In  a  politico-administrative  process,  governance 
and  management  are  broader,  including  different 
stakeholders  in  the  democratic  system,  such  as 
elected  officials  and  citizens.  Commonly  shared 
values  are  based  on  the  public  interest  and  political 
accountability.  If  the  engagement  of  citizens  is  very 
limited  in  performance  processes,  performance 
management  as  a  system  will  not  know  what 
people  care  about  in  their  different  roles  as  voters, 
taxpayers,  customers,  and  so  on. 

As  Amber  Wichowsky  and  Donald  Moynihan 
state,  in  a  well-functioning  democracy,  citizens  are 
capable  of  self-governance,  they  are  treated  with 
equal  respect  and  dignity,  and  they  are  responsible 
to  their  communities  and  participate  actively  in 
political  life. 

According  to  Paul  Epstein,  Lyle  Wray,  and 
Courtney  Harding  (2006),  in  a  performance  man¬ 
agement  process,  potential  citizen  roles  are,  for 
instance,  framing  issues  and  setting  the  agenda 
(planning  and  setting  goals  and  performance  tar¬ 
geting),  as  stakeholders  and  collaborators  (imple¬ 
menting  policies),  and  as  evaluators  (measuring 
and  reporting  results). 

Public  sector  leaders  encounter  numerous 
requirements  in  managing  the  performance  mea¬ 
surement  of  government  services.  These  include 


(a)  understanding  better  the  real  nature  of  service 
delivery  systems;  (b)  having  the  capability  of  agree¬ 
ing  on  strategic  objectives,  providing  incentives, 
and  evaluating  programs;  and  (c)  collecting,  ana¬ 
lyzing,  and  presenting  performance  management 
data. 

In  the  workplaces  of  public  organizations,  per¬ 
sonnel  are  aware  of  the  purposes  of  performance 
management.  By  measuring  and  managing  perfor¬ 
mance,  it  is  expected  that  managers  and  employees 
are  learning  to  improve  their  performance.  Citizens 
also  want  to  know  for  what  purposes  their  money 
is  spent.  Besides,  in  large-scale  branches  of  admin¬ 
istration,  massive  amounts  of  performance  data 
are  collected.  Effective  exploitation  of  performance 
information  is  a  challenge  because  expectations 
might  be  very  high.  Effective  ways  of  implement¬ 
ing  and  rational  decisions  on  improving  perfor¬ 
mance  are  both  needed. 

Development  and  Application 

An  example  of  the  development  of  performance 
management  is  the  system  existing  in  Finland, 
which  is  in  line  with  practices  in  other  Western 
countries.  Within  the  public  administration  estab¬ 
lishment  in  Finland,  reform  began  in  the  late 
1980s.  These  public  sector  reforms  were  mainly 
influenced  by  administrative  and  management 
reforms  in  other  Organisation  for  Economic 
Co-operation  and  Development  (OECD)  coun¬ 
tries.  Steering  and  budgetary  systems,  including 
laws,  rules,  and  procedures,  were  reformed.  New 
management  techniques  were  implemented  in  the 
central,  regional,  and  local  government. 

According  to  the  definition  in  the  Finnish  Hand¬ 
book  on  Performance  Management  (2006),  per¬ 
formance  management  is  an  agreement-based 
interactive  control  model.  Its  operational  core  lies 
in  the  ability  of  the  involved  parties  to  find  the 
appropriate  balance  between  the  available  resources 
and  the  results  to  be  attained  using  them.  The  basic 
idea  of  performance  management  in  operations  is 
to  balance  resources  and  targets,  on  the  one  hand, 
and  efficiency  and  quality,  on  the  other,  and  to 
ensure  that  the  desired  effects  are  achieved  in  a 
cost-effective  manner. 

In  formal  performance  management  systems, 
performance  targets  are  discussed,  agreed  on,  and 
set  between  the  ministry  and  the  subordinate 
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agency.  The  performance  criteria  are  used  in  setting 
targets  and  in  reporting  operations.  If  the  system  is 
to  have  legitimacy,  the  targets  must  be  measurable, 
clear,  and  based  on  systematic  and  reliable  evalua¬ 
tion.  Performance  criteria  consist  of  policy  effec¬ 
tiveness,  operational  efficiency,  outputs,  quality 
management,  and  management  of  human  resources. 

Another  challenge  for  performance  applications 
is  the  use  of  performance  criteria.  Performance 
criteria  should  be  developed  by  applying  incentive 
systems.  In  other  words,  who  receives  praise  or 
criticism?  If  the  criteria  are  determined  for  evaluat¬ 
ing  the  performance  of  lower  level  public  servants 
in  their  daily  interactions  with  users  of  public  ser¬ 
vices,  the  decision  process  will  be  much  easier  than 
it  is  for  higher  level  officials. 

In  Finland,  performance  management  is  imple¬ 
mented  through  performance  agreements  between 
the  ministry  and  the  agency.  The  agreement  is  a 
control  tool  in  the  interaction  of  the  parties  involved. 
Because  performance  is  reported,  there  are  conse¬ 
quences  for  both  good  and  poor  performance.  For 
instance,  in  the  Finnish  university  sector,  the  Ministry 
of  Education  and  all  20  state-run  universities  sign  a 
performance  agreement  defining  the  objectives:  the 
number  of  degrees  to  be  awarded,  the  development 
projects,  and  the  level  of  funding.  The  agreement  is 
signed  for  3  years  but  is  reviewed  in  the  yearly  nego¬ 
tiations  between  the  ministry  and  a  single  university. 
In  line  with  the  ministry-university  negotiations, 
each  university  arranges  its  own  performance  nego¬ 
tiations  between  the  rector  and  the  faculties. 

The  example  of  Finnish  universities  illustrates 
the  fundamental  character  of  performance  man¬ 
agement.  Despite  the  problems  of  measurement 
and  implementation,  this  approach  to  manage¬ 
ment  has  developed  as  a  major  method  for  man¬ 
agement  and  accountability. 

Ari  Salminen 
University  of  Vaasa 
Vaasa,  Finland 
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Personalization  of  Politics 


Personalization  of  politics  refers  to  a  shift  from 
collective  to  individual  actors  and  institutions.  Its 
scope  and  degree  can  vary  according  to  regime 
type,  national  and  cultural  environments,  and  the 
time  span  under  consideration.  However,  person¬ 
alization  is  commonly  considered  as  an  emerging 
trait  in  a  number  of  relevant  arenas:  from  voting 
behavior  to  governmental  steering  and  from  elec¬ 
toral  campaigning  to  party  organization.  Most 
studies  emphasize  the  role  of  leadership  and  media 
as  the  key  determinants  of  political  personaliza¬ 
tion,  though  it  certainly  partakes  of  the  more 
general  trend  toward  an  individualized  society. 
This  entry  discusses  recent  trends  and  major 
implications  for  contemporary  democracies. 

The  most  visible  aspect  of  political  personaliza¬ 
tion  concerns  the  transformation  of  electoral  cam¬ 
paigning,  with  the  rise  of  candidate-centered 
politics.  This  transformation  affects  the  whole  con¬ 
sent  mobilization  process.  The  media,  in  the  first 
place,  have  become  the  main  intermediary  channel 
between  candidates  and  their  constituencies.  With 


1858  Personalization  of  Politics 


traditional  party  structures  losing  their  grip  on  the 
electorate,  candidates  tend  to  rely  on  old  as  well  as 
new  media  to  spell  out  their  programs  and  contact 
potential  voters,  especially  in  large-scale  elections — 
for  governors,  prime  ministers,  or  presidents — 
where  TV  and  the  Internet  offer  invaluable  short¬ 
cuts  to  reaching  mass  audiences.  To  best  exploit  the 
media,  candidates  use  media  language  and  logic, 
which  put  a  premium  on  individual  personality. 

This  has  led  to  a  thorough  reorganization  of 
campaign  strategies  and  funding,  with  the  main 
focus  on  the  media  image  and  performance  of  the 
candidates.  In  the  wake  of  the  famous  TV  debate 
in  1960  between  U.S.  presidential  candidates 
Richard  Nixon  and  John  F.  Kennedy,  which  was 
held  responsible  for  the  victory  of  Kennedy,  the 
Democratic  nominee,  the  media  have  become  the 
key  investment  of  all  major  electoral  campaigns. 
Several  empirical  studies  have  questioned  whether 
the  actual  impact  of  personalized  politics  fully 
matches  the  vision  and  aims  of  campaign  manag¬ 
ers  and  spin  doctors.  However,  while  issues  have 
remained  important  and  party  identification  still 
plays  a  role  in  voting  behavior,  it  is  a  widespread 
opinion  that  voters,  today,  are  more  heavily  influ¬ 
enced  by  the  personality  factor  than  they  were 
20  years  ago. 

Though  less  visible  and  less  debated  than  the 
personalization  of  elections,  a  no  less  important 
area  of  political  personalization  is  the  strengthen¬ 
ing  of  monocratic  government.  Whether  in  presi¬ 
dential,  semipresidential,  or  parliamentary  regimes, 
the  chief  executive  has  gained  more  power  and 
authority  with  respect  to  three  arenas:  ( 1 )  the  gov¬ 
ernment  as  a  whole,  (2)  the  party,  and  (3)  the 
wider  public.  Personalization  in  government  has 
been  described  as  a  process  of  presidentialization. 
That  is,  prime  ministers  in  old  as  well  as  new 
democracies  are  now  stronger  with  respect  to  the 
collegial  bodies  they  chair,  performing  several 
functions  that  were  once  the  prerogative  of  the 
U.S.  presidential  model.  Increase  in  prime  ministe¬ 
rial  power  concerns  the  normative  as  well  as  the 
organizational  sphere.  With  the  expansion  of  del¬ 
egated  legislation  and  emergency  regulation,  a 
large  share  of  legislative  activity  has  shifted  from 
parliament  to  the  executive  and,  within  the  execu¬ 
tive,  to  the  prime  ministerial  office.  This,  in  turn, 
has  grown  both  in  human  resources  and  in  the 
scope  of  intervention,  a  trend  that  in  many  ways 


resembles  the  transformation  of  the  U.S.  presi¬ 
dency  following  Franklin  D.  Roosevelt’s  reforms. 
Prime  ministers  have  also  gained  better  control 
over  the  governmental  agenda  and  the  overall 
decision-making  process,  thus  taking  more  direct 
responsibility  for  their  electoral  mandate. 

In  only  a  few  instances,  the  emerging  of  prime 
ministerial  dominance  is  the  result  of  outright 
constitutional  changes.  In  most  cases,  the  trans¬ 
formation  has  occurred  through  new  rules  and 
procedures  regulating  the  relationship  between 
the  legislative  and  the  executive  branch,  as  well  as 
through  the  adoption  of  laws  providing  for  an 
overall  reorganization  of  the  prime  minister’s 
office.  As  a  consequence,  the  traditional  constitu¬ 
tional  divide  between  presidential  and  parliamen¬ 
tary  regimes  has  been  to  some  extent  blurred  by 
what  could  be  considered  a  silent  institutional 
revolution. 

A  major  contribution  to  this  end  has  come  from 
the  personalization  of  party  control.  Parties  have 
long  been  the  stronghold  of  collective  decision 
making  and  responsibility,  largely  concurring  with 
the  preeminence  of  legislatures  as  collegial  bodies. 
In  many  Western  democracies  as  well  as  in  the 
Soviet  bloc,  the  prevalent  form  of  party  rule  has 
been  oligarchic,  by  a  system  of  elite  cooptation.  In 
the  past  2  decades,  party  leadership  has  become 
more  associated  with  individual  personalities.  The 
change  has  been  heralded  by  countries  as  different 
as  Margaret  Thatcher’s  United  Kingdom  and  Mik¬ 
hail  Gorbachev’s  Soviet  Union,  soon  to  become  the 
format  of  most  political  regimes. 

Party  personalization  can  vary  according  to  sev¬ 
eral  factors.  In  old,  established  parties,  the  process 
is  contrasted  by  the  entrenched  nomenclature.  It 
took  12  years  for  Tony  Blair  to  successfully  take 
over  the  Labour  party  in  the  United  Kingdom  and 
impress  his  own  brand  on  the  New  Labour. 
Helmut  Kohl’s  primacy  in  the  Christian  Democratic 
Union  (Christlich  Demokratische  Union  Deutsch- 
lands,  CDU)  in  Germany  was  mainly  related  to  the 
extraordinary  opportunities  of  Germany’s  reunifi¬ 
cation.  On  the  whole,  the  German  party  system, 
also  as  a  legacy  of  the  Nazi  era,  remains  a  hostile 
environment  to  one-man  rule.  Yet  whenever  new 
parties  are  formed,  there  is  a  diffused  tendency 
that  they  be  promoted  and  controlled  by  individ¬ 
ual  leaders.  The  ideal-typical  case  is  Silvio 
Berlusconi’s  Forward  Italy  (Forza  Italia).  Banking 
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on  the  favorable  conditions  created  by  the  collapse 
of  the  governing  parties,  Berlusconi  immensely 
benefited  from  his  corporate  empire  to  found  his 
own  personal  party.  As  a  media  tycoon  and  one  of 
the  wealthiest  men  in  the  world,  he  could  rely  on  a 
highly  skilled  professional  apparatus  as  well  as  on 
huge  financial  means  to  set  up,  within  less  than 
6  months,  a  vote-generating  machine. 

A  key  aspect  of  Berlusconi’s  success  was  his 
extensive  use  of  TV,  with  the  new  infomercial 
techniques  he  borrowed  from  Ross  Perot’s  1992 
presidential  campaign  in  the  United  States,  thus 
bypassing  journalists’  intermediation  and  directly 
targeting  his  audiences.  Direct  appeal  to  the  masses, 
together  with  a  constant  focus  on  charismatic 
leadership,  is  the  main  trait  of  the  so-called  media 
populism,  a  phenomenon  strongly  associated  with 
party  personalization.  From  Hugo  Chavez’s  Ven¬ 
ezuela  to  Vladimir  Putin’s  Russia,  political  and/or 
economic  control  of  the  media  has  become  a  deci¬ 
sive  factor  for  the  rise  of  one-man-dominated  par¬ 
ties  with  a  large  popular  following. 

The  Internet,  too,  has  greatly  contributed  to 
political  personalization.  While  the  broadcasting 
media,  such  as  radio  and  TV,  require  big  budgets 
and  complex  scheduling  and  cannot  be  aimed  at 
specific  population  segments,  creating  one’s  own 
website,  blog,  and  direct  mailing  system  is  a  much 
easier  and  less  costly  undertaking.  More  and  more 
politicians  use  the  Internet  to  promote  their  mes¬ 
sage  and  interact  with  their  constituencies  on  a 
personal  basis,  thus  contributing  to  rendering  party 
channels  obsolete.  In  most  cases,  Internet  personal¬ 
ization  operates  on  a  smaller  scale  than  the 
TV-based  marketing  of  political  leadership. 
However,  the  two  media  can  be  used  to  feed  and 
reinforce  one  another,  as  in  Barack  Obama’s  2008 
presidential  campaign.  Thanks  to  the  web,  Obama’s 
organization  reached  millions  of  supporters  with 
daily  updates  on  the  most  controversial  issues  and 
invitations  to  join  virtual  or  real-life  debates,  thus 
motivating  participation  and  spurring  donations. 
The  money  raised  through  the  Internet  was,  in  turn, 
devoted  to  a  massive  use  of  TV  advertising. 

In  spite  of  its  overwhelming  presence  in  contem¬ 
porary  politics,  measuring  the  real  impact  of  politi¬ 
cal  personalization  on  strictly  empirical  grounds  is 
not  easy.  Research  frameworks  have  been  slow  to 
adapt  to  a  phenomenon  that  affects  so  many  are¬ 
nas.  With  a  few  relevant  exceptions,  there  is  a  lack 


of  comparative  and  longitudinal  studies,  the  more 
so  as  the  most  striking  trait  of  political  personaliza¬ 
tion  consists  of  its  cutting  across  different  disciplin¬ 
ary  subdomains.  Studies  of  electoral  behavior 
rarely  venture  into  the  analysis  of  the  transforma¬ 
tion  of  party  organizations,  and  most  accounts  of 
the  rise  of  stronger  executives  only  focus  on  the 
governmental  institutional  setting. 

This  is  not  to  say  that  empirical  investigation  is 
not  possible  or  necessary.  The  relevance  of  this 
topic  looms  even  larger  in  a  wider  theoretical  and 
historical  perspective.  In  what  has  become  the 
mainstream  definition  of  political  power,  Max 
Weber  distinguished  three  types  of  legitimate 
authority.  Two  types,  the  traditional  and  the  char¬ 
ismatic,  were  grounded  in  personal  resources.  Yet 
Weber  considered  the  third  type  of  legal-rational 
authority,  with  its  emphasis  on  bureaucratic  pro¬ 
cedures  and  collective  decision  making,  as  the  pil¬ 
lar  of  modern  society,  both  in  the  political  and  in 
the  economic  realms.  Traditional  and  charismatic 
power  was  mainly  treated  as  a  remnant  of  the  past, 
inevitably  bound  to  be  superseded  by  the  advent  of 
legal-rational  authority.  Moreover,  in  dealing  with 
traditional  and  charismatic  power,  Weber’s  ten¬ 
dency  was  to  present  these  two  patterns  as  sepa¬ 
rate,  occurring  in  different  historical  and  cultural 
contexts.  Quite  to  the  contrary,  one  of  the  major 
characteristics  of  contemporary  personal  power  is 
the  interaction  and  cumulation  of  these  two  ele¬ 
ments.  The  modern  process  of  personalization  of 
politics  results,  to  a  large  extent,  from  the  combi¬ 
nation  of  patrimonial  and  charismatic  resources. 
This  combination  becomes  all  the  more  stronger 
when  it  can  use — and  possibly  merge  into — the  key 
central  institutions  of  a  democratic  regime. 

Political  personalization  thus  appears  as  a  mix  of 
old  and  new  forces.  The  main  novelty  consists  in 
the  role  of  the  media  and  their  natural  focus  on 
individuals  and  their  personality.  This  has  led  to  a 
revival  of  charisma  in  an  electronic  form.  Media 
charisma  may  have  little  if  any  of  the  extraordinary 
gift  of  grace  that  characterized  charismatic  leaders 
in  the  past,  with  their  mass  religious  followings.  Yet 
on  various  occasions,  media  charisma  has  proved 
to  be  a  key  factor  in  the  rise  of  powerful  leaders. 
The  comeback  of  personal  power  also  reflects  the 
resilience,  in  industrialized  societies,  of  cultural 
traits  long  considered  bygones.  The  tendency  to 
rely  once  again  on  personal  rather  than  collective 
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and  impersonal  attributes  is  a  sign  of  weaker  social 
cohesion  and  increased  instability.  On  the  positive 
side,  one  may  consider  that  the  process  of  political 
personalization,  today,  largely  falls  within  the  rou¬ 
tines,  discourse,  and  boundaries  of  democratic 
politics.  The  nightmare  of  personal  power  getting 
out  of  control  seems  to  belong  to  the  past. 

Mauro  Calise 
University  of  Naples  Federico  II 
Naples,  Italy 
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Pillarization 


Pillarization  of  society  means  the  vertical  integra¬ 
tion  of  a  subcultural  community  in  a  system  of 
political  representation.  More  often  than  not,  this 
kind  of  integration  is  considered  as  necessary  for  a 
minority  to  politically  participate  and  survive 
when  democratization  of  a  polity  is  under  way  or 
more  or  less  completed  by  means  of  electoral  rights 
(e.g.,  universal  suffrage)  and  the  establishment  of  a 
party  system  (at  the  national  level).  Pillarized 


political  systems  emerged  in  a  limited  number  of 
cases  only  and  strongly  correlate  with  a  specific 
type  of  parliamentary  democracy:  “consocia- 
tional”  systems  (Arend  Lijphart)  in  which  agree¬ 
ments  among  the  elites  of  the  respective  pillars 
bridge  their  divisions.  In  the  following  sections,  the 
characteristics  of  such  pillars  as  they  occur  in  the 
major  cases,  their  political  relevance,  and  recent 
tendencies  of  depillarization  are  discussed. 

Characteristics  and  Political  Relevance 

The  term  pillar  ( verzuiling  in  Dutch  and  Flemish 
and  Lager  in  German)  is  sociologically  defined  as 
strong  political-cultural  submilieus  based  on  eth¬ 
nic-linguistic,  religious,  or  similar  cleavages  with  a 
strong  internal  organization  and  a  relatively  high 
level  of  external  autonomy.  Under  such  circum¬ 
stances,  you  can  spend  much  of  your  life  within  the 
networks  of  such  pillars,  often  including  schools, 
media,  political  parties,  and  so  on.  These  organized 
submilieus  are  politically  relevant  for  understand¬ 
ing  the  stability  and  survival  of  democracy  in 
(deeply)  divided  societies.  Lijphart  mentions  five 
criteria  to  measure  the  degree  of  pillarization: 

1.  the  role  of  ideology  (more  often  than  not 
cleavage  related)  within  the  pillar, 

2.  the  size  and  density  of  the  organized  network 
representing  the  pillar, 

3.  the  (institutionalized)  cohesiveness  of  the  pillar’s 
network, 

4.  the  extent  of  “enclosure”  of  the  pillar  (or  the 
absence  of  cross-cultural  relations),  and 

5.  the  extent  to  which  pillarized  behavior  is 
encouraged  and  directed  by  its  elites. 

The  application  of  these  criteria  to  West  European 
democracies  shows  that  there  are  four  polities  that 
indeed  qualify  as  being  pillarized:  Austria,  Belgium, 
the  Netherlands,  and  Switzerland.  The  last  is  seen 
as  the  least  pillarized.  Others,  for  example,  Gerhard 
Lehmbruch,  have  claimed  that  the  Federal  Republic 
of  Germany  belongs  to  the  pillarized  community, 
too.  This  is  questionable  because  Lehmbruch 
focused  on  the  organization  of  the  German  postwar 
party  system  and  its  ramifications  for  decision  mak¬ 
ing  at  the  federal  level.  His  explanation  rested  more 
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on  the  cleavage-related  mechanisms  and  the  com¬ 
plexities  of  organizing  representation  in  a  federa¬ 
tion.  Hence,  the  working  of  the  German  polity 
showed  some  of  the  features  of  a  pillarized  system. 

Switzerland  can  also  be  considered  as  a  border¬ 
line  case.  Rather  than  being  the  well-organized  and 
cohesive  pillars  (as  defined  here),  the  extant  cleav¬ 
ages  in  Swiss  politics — religious/secular,  language- 
cum-territory  divisions,  and  the  rural/urban  divide — 
accounted  for  the  development  of  consociational 
practices  through  the  mechanisms  of  the  party  sys¬ 
tem  dynamics.  For  example,  Pascal  Sciarini  and 
Simon  Hug  claim  that  pillarization  in  Switzerland  is 
weak  because  it  only  plays  a  role  in  finding  amica¬ 
ble  agreement  based  on  cleavages  and  by  means  of 
proportionality  on  the  federal  level.  Pillarized  net¬ 
works  are  not  the  main  agents  but  rather  national 
political  parties  that  are  in  turn  checked  and  bal¬ 
anced  by  institutional  safeguards  through  cantonal 
representation  and  the  referendum  instrument. 

The  other  cases  mentioned  also  show  differ¬ 
ences.  Belgium  is  complex  in  terms  of  its  pillariza¬ 
tion  because  it  is  strongly  influenced  by  two 
cleavages.  One  is  secular/religious  and  the  other  is 
language/territory.  While  the  former  is  not  more 
relevant  than  in  many  other  (European)  party  sys¬ 
tems,  the  latter  is:  Since  the  1960s,  this  conflict  is 
rising  high  on  the  political  agenda;  it  has  been 
conducive  to  the  federalization  of  the  Belgian  state 
and  was  preceded  by  the  bifurcation  of  the 
nationwide  party  system. 

Austria  and  the  Netherlands  stand  out  as  the 
strongest  examples  of  politicized  pillarization.  This 
can  be  explained  by  two  factors:  One,  the  minori¬ 
ties  are  not  geographically  concentrated,  but  their 
“members”  reside  throughout  the  entire  territory. 
Precisely  for  this  reason,  a  socially  cohesive  net¬ 
work  reinforcing  the  control  of  membership  was 
deemed  necessary.  Two,  only  by  means  of  elite 
politics  and  the  proportional-representation  (PR) 
electoral  system — registering  the  relative  strength 
of  each  pillar — was  it  feasible  to  establish  accom¬ 
modation  between  the  pillarized  communities. 

Other  cases  of  pillarization  have  been  men¬ 
tioned,  but  mostly  defined  post  hoc,  to  explain  the 
relative  stability  of  the  emerging  political  system. 
Examples  are  Czechoslovakia  (1989-1993),  Leba¬ 
non  (during  the  1960s),  and  Colombia. 

Pillarization,  as  it  has  emerged  and  existed, 
is  strongly  correlated  to  a  particular  form  of 


democracy:  consociationalism.  This  can  explain  the 
paradox  of  the  occurrence  of  stable  democracies 
with  strong  political  divisions  between  minorities. 
A  consociational  democratic  system  is  character¬ 
ized  by  the  fact  that  there  is  no  majority  party  but 
only  minority  parties  that  are  by  and  large  repre¬ 
senting  pillarized  communities.  In  contrast  to  the 
theory  of  pluralism  that  predicts  political  instability 
under  such  circumstances,  many  (but  not  all) 
democracies  appear  capable  of  consensual  and 
effective  decision  making.  This  paradox  was  solved 
by  developing  institutional  mechanisms  and  con¬ 
ventions  that  allowed  for  cooperation  between 
elites  (representing  the  various  pillars):  (a)  the  intro¬ 
duction  of  a  proportional  electoral  system  that 
secured  access  for  each  pillarized  party;  (b)  securing 
the  autonomy  of  each  pillar  within  society  (often  by 
means  of  basic  laws,  as  is  the  case  in  Austria  and 
the  Netherlands,  or  by  carefully  describing  the 
degrees  of  self-regulation  by  subnational  territories, 
as  in  Germany  and  Switzerland  as  well  as  in 
Belgium  of  late);  and  (c)  a  careful  but  often  com¬ 
plex  procedure  regarding  the  formation  of  govern¬ 
ment  coalitions  (which  were  often  all-inclusive  or  at 
least  broad  in  composition).  These  mechanisms 
enabled  elites  to  find  agreement  and  to  avoid  open 
conflict  over  substantial  issues  that  directly  con¬ 
cerned  the  ideology  or  subcultural  differences  that 
had  led  to  pillarization.  Hence,  the  political  conse¬ 
quences  of  pillarization — that  is,  deep-seated 
conflict — could  be  overcome  by  means  of  elite 
appeasement  and  (often  secret)  agreements,  pro¬ 
portional  payoffs,  and  (often)  depoliticization  of 
contested  policy  issues. 

It  should  be  noticed,  however,  that  consocia¬ 
tional  democracy  cannot  be  considered  as  a  uni¬ 
versal  recipe.  Lijphart,  for  instance,  showed  that  in 
other  cases,  primarily  outside  Western  Europe,  this 
type  of  democracy  did  not  develop  or  last  long 
under  the  same  circumstances  (e.g.,  Lebanon, 
Colombia,  and  perhaps  contemporary  Belgium). 
Furthermore,  critics  of  the  consociational  model 
have  pointed  to  certain  flaws,  namely,  the  fact  that 
the  minorities  had  to  be  roughly  of  the  same  size 
or  that  these  subcultural  groups  were  insufficiently 
organized  and  therefore  less  loyal  to  their  political 
elites.  Finally,  the  political  practice  often  depended 
on  other  factors  such  as  social  and  economic  devel¬ 
opment  (e.g.,  the  effective  equalization  of  resources 
across  segments  of  the  population).  Finally,  the 
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question  arose  whether  or  not  consociational 
success — that  is,  a  stable  and  effective  govern¬ 
ment  for  all  communities — did  in  fact  produce  its 
own  disappearance  over  time.  This  last  argument 
seems  to  be  correct,  for  consociationalism  can  be 
considered  as  a  historical  phenomenon  for  the 
simple  reason  that  most  pillarized  political  sys¬ 
tems  have  indeed  faded  away  or  failed. 

Depillarization 

Depillarization  is  a  sociological  process  that  has 
taken  place  since  the  1970s  and  is  considered, 
first  of  all,  as  a  result  of  modernization  (e.g.,  in 
terms  of  developments  in  communication  such  as 
information  and  communication  technologies 
[ICTJ,  which  impair  forms  of  social  control), 
secularization  (e.g.,  de-confessionalization,  which 
weakens  the  loyalty  of  members  of  a  pillar),  glo¬ 
balization  (e.g.,  due  to  de-industrialization  being 
conducive  to  a  changing  class  composition),  and 
changes  in  elite  behavior — all  resulting  in  less 
cohesiveness  and  relaxing  organizational  ties 
within  the  pillars. 

Politically,  this  process  can  be  observed  in  terms 
of  high  degrees  of  electoral  volatility,  indicating 
changing  behavior  and  a  willingness  to  vote  on  the 
basis  of  motives  other  than  being  a  member  of  a 
pillar.  It  is  illustrative  that  during  the  1960s  the 
cross-European  level  of  electoral  volatility  was 
8%,  but  in  pillarized  countries,  this  was  3.3% 
lower.  Yet  from  1990  onward,  this  has  changed 
dramatically:  On  average,  electoral  volatility 
increased  by  3  points  in  Europe,  but  in  the  five  pil¬ 
larized  countries,  it  rose  to  an  unprecedented  level 
of  22.3%,  a  growth  of  16.8%  (Peter  Mair,  2008). 

Second,  the  emergence  of  new  parties  that  chal¬ 
lenge  pillarized  parties  and  are  mainly  nonpolar¬ 
ized  has  contributed  to  these  developments. 
Examples  are  the  Greens  and  right-wing  parties: 
Both  party  families  have  become  particularly 
strong  in  Austria,  Belgium,  the  Netherlands,  and 
Switzerland.  During  the  1960s,  electoral  support 
for  all  new  parties  in  pillarized  societies  was  below 
the  cross-European  average,  being  around  4.4%, 
whereas  their  support  grew  to  10%  above  the 
cross-European  level  in  2000.  Third,  the  politics  of 
accommodation  as  indicated  inter  alia  by  the  type 
of  government  coalitions  has  also  changed:  Instead 
of  broad  coalitions,  one  finds  more  often  than  not 


minimal  winning  coalitions  (apart  from  Switzer¬ 
land,  due  to  its  magic  formula,  which  is,  however, 
under  pressure),  with  parties  included  in  govern¬ 
ment  coalitions  that  had  not  been  participating 
before,  such  as  in  Austria,  the  Netherlands,  and 
Germany. 

All  this  points  to  a  process  of  depillarization 
affecting  a  changing  profile  of  organizational 
cohesiveness,  electoral  representation,  and  tradi¬ 
tional  cleavage  politics.  Finally,  the  impact  of 
(party)  elites  diminished  as  well.  Pillarization, 
taken  in  its  political-sociological  meaning,  seems 
to  have  faded  away  and  with  it  the  typical  features 
of  consociational  democracy. 

Hans  Keman 
Vrije  Universiteit  Amsterdam 
Amsterdam,  Netherlands 
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Planning  is  deciding  about  future  decisions,  either 
in  the  sense  of  defining  what  has  to  be  decided  or 
in  the  sense  of  deciding  how  to  decide.  This  entry 
discusses  alternative  conceptions  of  planning,  its 
place  in  social  science  theories,  and  some  of  the 
applications  of  planning  in  governing.  After  a 
consideration  of  planning  as  a  fundamental 
human  activity,  the  entry  explains  the  rational- 
comprehensive  model  of  planning  and  its  limita¬ 
tions  and  then  examines  alternative  approaches  to 
planning,  including  procedural  models,  systems 
theory,  and  strategic  planning.  It  then  discusses  the 
place  of  planning  in  social-scientific  theories  and 
some  of  the  applications  of  planning  to  governance. 

The  Activity  of  Planning 

Planning  is  a  very  common  activity.  We  plan 
courses  of  action  so  that  we  can  achieve  goals. 
However,  it  is  important  to  distinguish  between 
the  concept  of  decision — the  choice  between  differ¬ 
ent  courses  of  actions  whose  consequences  it  is 
possible  to  forecast — and  the  notion  of  planning. 
The  latter  concept,  in  fact,  encompasses  several 
elements,  namely,  that 

•  to  reach  the  goal  or  purpose  of  the  activity,  the 
decision  maker  has  to  take  into  consideration  a 
set  (and  possibly  a  sequence)  of  different 
decisions; 

•  usually  there  is  the  need  to  allocate  scarce 
resources  between  different  goals;  and 

•  central  to  the  notion  of  planning  is  the  concept 
of  public  policy,  alternatively  defined  as  the  set 
of  actions  linked  to  the  solution  of  a  collective 
problem,  a  program  of  action,  or  a  system  of 
goals. 


Planning,  then,  can  be  defined  as  the  attempt  to 
secure  the  coherence  of  a  set  or  a  sequence  of  deci¬ 
sions  in  relation  to  a  specific  problem — that  is,  a 
public  policy.  Therefore,  even  if  we  often  find  a 
plan  or  a  program — that  is,  a  document  spelling 
out,  in  more  or  less  detail,  the  content  of  the  deci¬ 
sions  to  be  taken  and/or  the  procedures  to  adopt  to 
this  end — planning  in  itself  cannot  be  identified 
with  the  decision  to  approve  the  said  document; 
rather,  it  is  a  continuous  activity,  a  process  through 
which  the  future  actions  are  organized  and 
described. 

The  theory  and  practice  of  planning  affect  a 
plurality  of  different  policy  fields.  For  instance,  the 
first  university  chair  of  urban  planning  was  created 
in  Liverpool  in  the  United  Kingdom  in  1909,  thus 
recognizing  the  existence  of  a  specific  discipline, 
distinct  from  architecture,  geography,  and  the 
social  sciences.  But  the  socialist  and  communist 
movements  in  the  1920  to  1940  period  gave  popu¬ 
larity  to  the  concept  and  practice  of  economic 
planning — that  is,  the  adoption  by  the  political 
power  of  a  detailed  and  binding  blueprint  of  the 
way  in  which  the  economic  life  has  to  be  orga¬ 
nized,  by  prescribing  the  type  and  quantity  of 
goods  and  services  to  be  produced  in  a  given  soci¬ 
ety  in  a  given  period  of  time.  In  the  years  following 
World  War  II,  planning  became  widespread  and 
was  equated  with  rational  and  “scientific”  decision 
making.  This  was  also  a  consequence  of  the  devel¬ 
opment  of  new  methodologies  of  analysis  (from 
cost-benefit  analysis  to  operation  research  and 
linear  programming)  holding  the  promise  of  being 
able  to  inform  public  decision  making.  The  mete¬ 
oric  rise  of  the  Planning,  Programming,  Budgeting 
System  (PPBS)  in  the  U.S.  federal  government  in 
the  1960s  (and  the  subsequent  demise  of  the  PPBS 
program  in  the  U.S.  military  in  the  early  1970s)  is 
a  good  example  of  the  importance  of  this  approach 
in  that  period.  Even  today,  planning  is  sometimes 
still  presented  as  the  true  answer  to  all  policy  prob¬ 
lems,  as  the  “right  way”  to  organize  the  exercise  of 
political  power  in  public  policies. 

It  is  interesting  to  note  that  even  politicians  and 
scholars  who  share  a  common  belief  in  the  virtues 
of  market  and  democracy,  and  consequently 
strongly  oppose  any  form  of  socialism  and  planned 
economy,  seem  to  advocate  more  planning  when  it 
comes  to  public  policy  making,  possibly  as  a  way 
to  constrain  political  and  administrative  discretion 
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and  certainly  to  increase  the  level  of  coherence  and 
predictability  of  governmental  interventions.  This 
basically  means  that  the  value  of  planning  is  seen 
as  a  way  to  improve  the  content  of  the  decisions  by 
making  them  more  “rational.” 

Alternative  Conceptions  of  Planning 
The  Rational-Comprehensive  Model 

The  starting  point  of  the  whole  debate  on  the 
virtues  of  planning  in  policy  making  and  policy 
implementation  is  therefore  the  idea  that  it  is  pos¬ 
sible,  indeed  desirable,  to  base  all  decisions  on  a 
scientific  analysis  of  the  consequences  of  the 
available  alternatives.  This  implies  the  following 
postulates: 

1.  It  is  possible  to  analyze  the  costs  and  benefits  of 
all  the  different  courses  of  action  open  to  the 
public  decision  maker. 

2.  It  is  possible  in  a  democratic  society  to  use 
intellectual  abilities  in  order  to  identify  a  system 
of  goals  related  to  the  public  interest  or  the 
common  good. 

It  is  clear  that  these  two  elements  represent  the 
basis  of  what  Charles  Lindblom  has  called  the 
rational-comprehensive  or  rational-synoptic  model 
of  decision  making,  whose  ideological  roots  are  to 
be  traced  back  to  the  18th-century  Enlightenment 
and  19th-century  positivist  philosophy,  with  their 
faith  in  the  reason  and  in  the  intellectual  capacities 
of  the  citizens  or  at  least  of  the  elites. 

The  two  aforementioned  postulates,  however, 
have  been  widely  criticized.  Starting  from  the  first 
postulate,  as  early  as  1947,  Herbert  Simon  has 
pointed  out  how  the  idea  that  all  alternatives  and 
all  their  consequences  are  known  is  highly  unreal¬ 
istic.  No  decision  maker  (and  no  planner)  has  all 
the  time,  the  information,  and  the  cognitive  abili¬ 
ties  required  by  the  rational  model  of  decision 
making.  The  consequence  is  that  the  maximum 
that  can  be  achieved  is  to  strive  for  a  limited  form 
of  rationality,  organizing  the  decision  through  a 
sequential  search  of  the  different  alternatives  and 
choosing  the  solution  that  is  considered  “good 
enough” — that  is,  that  meets  the  “satisficing”  cri¬ 
terion.  Often  this  implies  the  necessity  of  not  tak¬ 
ing  into  consideration  the  consequences  of  the 


preferred  alternative  for  all  the  desirable  goals,  and 
therefore,  it  contravenes  one  of  the  basic  require¬ 
ments  of  rational  planning — that  is,  the  idea  that 
everything  should  be  coordinated.  But  even  if  the 
subjective  aspect  of  the  postulate  is  satisfied  (i.e., 
the  decision  makers  have  all  the  relevant  knowl¬ 
edge),  one  has  to  remember  that  because  planning 
is  about  future  decisions  and  behaviors,  an  inherent 
level  of  uncertainty  exists,  as  external  shocks  can 
alter  some  basic  elements  and  forecasts  can  prove 
radically  wrong.  And  finally,  especially  in  contem¬ 
porary  welfare  states,  citizens  have  the  tendency  to 
raise  issues  or  problems  for  which  there  is  no  clear- 
cut  solution  available  (e.g.,  drug  addiction  or  long¬ 
term  unemployment).  In  such  cases,  it  is  obvious 
that  a  process  of  rational  choice  is  not  possible  and 
that  the  attempt  of  the  planners  to  allocate  the 
“right”  amount  of  resources  required  for  the 
achievement  of  the  goals  is  particularly  difficult. 

Preferences-Guided  Decision  Making 

These  examples  of  intractable  problems  call  into 
question  the  second  postulate  mentioned  above — 
that  it  is  possible  to  use  the  knowledge  and  the 
intellectual  skills  of  the  ruling  elite  in  order  to  iden¬ 
tify  a  set  of  goals  able  to  maximize  the  common 
good.  The  objection  here  has  to  do  with  the  distinc¬ 
tion,  again  introduced  by  Lindblom,  between  two 
different  views  of  the  society:  the  intellectually 
driven  society  and  the  preferences-guided  society. 
In  the  first  type  of  society,  the  faith  in  human  intel¬ 
lect  is  such  that  a  theory  of  social  change  is  shared, 
there  is  no  conflict  between  the  needs  of  the  indi¬ 
viduals  composing  the  society,  and  therefore  syn¬ 
optic  planning  is  possible  and  indeed  necessary  to 
secure  progress.  In  the  second  type  of  society,  the 
recognition  of  the  limitation  of  human  intellect  and 
the  fact  that  conflicts  are  not  only  inevitable  but 
also  valuable  in  securing  progress  makes  it  impos¬ 
sible  to  use  the  same  type  of  approach.  In  the  latter 
type  of  societies,  the  poliarchies,  the  solution  of  the 
collective  problems  can  be  informed  by  analysis  but 
is  eventually  determined  through  the  processes  of 
social  interaction,  based  on  the  preferences 
expressed  by  the  members  of  the  society.  In  the 
preferences-guided  society,  in  which  political  power 
is  shared  by  a  plurality  of  actors  and  institutions, 
the  basic  criterion  for  policy  choice  is  the  agree¬ 
ment  between  the  relevant  actors — in  Lindblom’s 
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words,  the  mutual  adjustment  between  them — and 
therefore,  planning  as  described  above  is  virtually 
impossible.  Not  only  is  reliable  knowledge  about 
the  expected  and  likely  outcomes  of  the  decisions 
absent,  but  even  more  fundamentally,  there  is  not 
necessarily  agreement  on  the  very  purpose(s)  of  the 
planning  exercise(s) — that  is,  the  goals  and  priori¬ 
ties  to  be  attained  through  public  actions.  Actually, 
Lindblom  contends  that  this  does  not  mean  that  in 
the  preferences-guided  society  there  is  no  planning 
but  that  a  different  form  of  planning,  labeled  stra¬ 
tegic  planning,  is  needed.  However,  from  the 
examples  he  proposes,  it  is  apparent  that  the  abil¬ 
ity  to  unequivocally  shape  future  choices — a  basic 
element  of  the  definition  of  planning  proposed 
above — is  drastically  limited.  According  to  his 
vision,  policy  making  in  pluralistic  societies  is  nec¬ 
essarily  incremental,  each  decision  departing  only 
slightly  from  the  status  quo.  To  steer  the  process  of 
decision  making  in  the  desired  direction,  the 
“planner”  can  only  try  to  anticipate  the  reactions 
of  the  other  actors,  adapting  his  or  her  own 
choices,  possibly  by  adopting  a  piecemeal  approach 
to  complex  problems.  In  this  way,  it  will  be  pos¬ 
sible  to  minimize  the  oppositions  in  the  short  run 
and  reach  a  more  advanced  equilibrium  from 
which  to  start  the  construction  of  the  next  stage  of 
policy  development.  As  one  Dutch  minister  once 
said — and  one  has  to  remember  that  in  the 
Netherlands  a  highly  sophisticated  machinery  for 
policy  planning  has  been  in  operation  for  a  long 
time — the  true  definition  of  a  plan  is  “what  we  can 
agree  upon” — the  task  of  political  power  consist¬ 
ing  in  the  ability  to  pave  the  road  toward  the  next 
plan.  Therefore,  even  if  Lindblom’s  strategic  plan¬ 
ning  is  fundamentally  different  from  what  is  gener¬ 
ally  understood  as  planning,  from  its  proponent 
point  of  view,  it  is  far  from  being  a  useless  exercise. 
This  style  of  policy  making,  as  shown  in  Lindblom’s 
The  Intelligence  of  Democracy,  is  not  only  consid¬ 
ered  more  desirable  from  the  normative  point  of 
view  but  also  is  seen  as  being  more  effective  in 
securing  progress  and  development  than  any  other 
alternative,  since  it  is  more  adaptable  to  the  com¬ 
plexities  of  contemporary  society. 

Systems  Theory 

An  even  more  radical  critique  of  rational 
planning — albeit  developed  at  a  different  and 


higher  level  of  abstraction — is  the  one  put  forward 
by  Niklas  Luhmann  (1995)  in  the  context  of  his 
systems  theory.  Basically,  says  Luhmann,  planning 
will  never  work  as  intended  because  “in  planning 
the  system  reacts  not  only  to  the  conditions  that 
are  attained,  to  the  success  or  failure  of  planning, 
but  also  to  the  planning  itself”  (p.  470).  The  mere 
existence  of  planning  modifies  the  behaviors  that 
it  was  aiming  to  shape  by  establishing  their  prem¬ 
ises  and,  therefore,  is  structurally  unable  to  reach 
its  goals.  Furthermore,  planning  introduces  a  nec¬ 
essarily  simplified  model  of  the  complexity  of  the 
system  into  the  system  itself. 

Considering  these  devastating  critiques,  one 
could  be  tempted  to  simply  give  in  and  abandon  the 
very  idea  of  planning  defined  as  the  attempt  to 
secure  the  coherence  of  a  set  or  a  sequence  of  deci¬ 
sions  in  relation  to  a  public  policy,  following  the 
conclusion,  reached  by  Aaron  Wildavsky  already  at 
the  beginning  of  the  1970s  that  “if  planning  is 
everything,  may  be  it’s  nothing.”  The  problem, 
however,  remains:  As  already  pointed  out,  planning 
is  a  very  common  activity,  but  it  is  also  somewhat 
necessary  because  of  the  need  to  give  a  meaning  to 
the  present  activities  by  shaping  future  choices. 
Hence,  several  attempts  have  been  made  to  propose 
alternative  forms  of  planning  that  are  able  to  over¬ 
come  the  previous  objections. 

Procedural  Approaches  to  Planning 

One  set  of  ideas  is  linked  to  the  possibility  of 
focusing  the  planning  exercises  not  so  much  on  the 
content  of  the  future  decisions  but  on  the  way(s)  in 
which  they  should  be  reached,  by  prescribing  the 
procedures  of  the  decision-making  process.  This 
can  be  done  with  two  different  purposes  in  mind. 
In  the  first  place,  and  more  in  line  with  tradition, 
the  prescribed  procedures  might  try  to  force  the 
participants  to  consider  more  carefully  the  likely 
consequences  of  their  preferred  choices  through, 
for  instance,  cost-benefit  analysis  or  similar  tech¬ 
niques.  Using  such  tools  in  a  multi-actor  context,  it 
is  assumed,  can  have  the  effect  of  decreasing  the 
danger  of  negative  externalities,  improving  policy 
coordination,  and,  in  general,  reaching  more  “ratio¬ 
nal”  decisions.  Unfortunately,  as  Fritz  Scharpf 
(1986)  remarked  more  than  20  years  ago,  the 
“improvement  of  the  policy  making  through  better 
meta-policy  ...  is  littered  with  the  skeletons  of  too 
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many  practical  failures  of  perfectly  logical  improve¬ 
ments  of  policy  making  institutions  and  proce¬ 
dures”  (p.  182).  The  reason,  he  contends,  quoting 
Wildavsky,  is  that  “no  amount  of  procedural  inno¬ 
vation  [isj  able  to  exorcise  the  inherently  political 
character  of  public  policy  choices”  (p.  182). 

Applications  of  Planning  in  Governing 

The  second  purpose  of  a  procedural  approach  to 
planning  is  more  subtle.  Bearing  in  mind  the  obser¬ 
vation  made  by  Lindblom  that  procedures  are 
highly  valuable  in  pluralistic  political  systems,  as  a 
way  of  preserving  personal  liberties  and  popular 
control,  Melvin  Webber  has  redefined  the  effec¬ 
tiveness  of  planning  by  saying  that  it  does  not  lie 
in  the  fact  that  the  plan  gives  the  “right  answers” 
to  policy  problems  but  rather  in  its  ability  to 
improve  the  process  of  public  debate  and  public 
decision.  Its  purpose  is  therefore  to  help  poliar- 
chies  make  policy  choices  in  an  acceptable  (and 
accepted)  way.  The  value  of  planning  as  a  set  of 
procedural  rules  adapted  to  the  problematic  situa¬ 
tions  they  have  to  tackle  is  therefore  also  an  end  in 
itself,  as  it  enhances  the  democratic  character  of  a 
given  society.  This  approach  has  been  influential  in 
recent  years — in  its  different  versions  variously 
labeled  as  communicative  planning,  participatory 
planning,  collaborative  planning,  and  deliberative 
planning — and  has  gained  widespread  acceptance 
in  the  urban  and  territorial  planners  community. 
However,  even  the  most  enthusiastic  supporters  of 
this  approach  cannot  deny  that  this  opening  up  of 
the  planning  process  might  actually  entail  the  dan¬ 
ger  of  powerful  self-interested  actors  being  able  to 
influence  the  process  away  from  the  pursuit  of 
common  good  in  order  to  maximize  their  own 
short-term  goals,  thus  frustrating  the  democratic 
value  of  the  entire  exercise. 

Strategic  Planning 

At  a  different  level,  an  attempt  to  revive  the 
discourse  on  planning  as  a  way  of  shaping  future 
choice  is  the  notion  of  strategic  planning  recently 
proposed  in  organizational  theory  and  in  territo¬ 
rial  planning.  This  proposal  is  rather  different 
from  the  one  put  forward  by  Lindblom,  being  basi¬ 
cally  centered  on  the  concept  of  framing.  Frames 
are,  according  to  Donald  Schon  and  Martin  Rein, 


systems  of  meaning  that  organize  what  we  know 
and  provide  conceptual  coherence,  direction  of 
action,  and  a  basis  of  persuasion.  Strategies,  there¬ 
fore,  are  emergent  social  products  and  conscious 
attempts  to  create  and  discover  such  frames  and 
are  able  to  shape  future  actions  through  the  per¬ 
suasive  power  of  their  core  concepts,  as  pointed 
out  by  Patsy  Healey,  following  the  work  of  Henry 
Mintzberg  in  organizational  theory. 

Planning,  in  its  original  meaning  of  deciding 
about  future  decisions,  becomes  again  possible, 
taking  into  consideration  both  the  problem  of 
insufficient  knowledge  and  the  complexity  of  con¬ 
temporary  political  systems.  The  clearest  applica¬ 
tion  of  this  line  of  reasoning  is  likely  found  in  the 
practice  of  strategic  planning  at  the  metropolitan 
level  as  it  emerged  in  the  past  20  years.  By  and 
large,  and  with  some  variation  across  the  cities, 
strategic  plans  are  exercises  based  on  the  mobiliza¬ 
tion  of  all  relevant  actors  in  a  given  urban  area,  on 
the  production  of  a  shared  long-term  vision  of  the 
desirable  future,  and  on  an  action  plan  for  immedi¬ 
ate  action.  The  idea  is  that  cities  are  not  able  to 
innovate  and,  therefore,  run  the  risk  of  decline, 
both  because  they  lack  a  clear  direction  and 
because  of  diffuse  veto  powers  and  a  lack  of  trust 
between  the  relevant  actors.  The  ability  to  produce 
a  shared  vision — that  is,  a  sufficiently  precise  and 
distinctive  image  of  the  desirable  future,  whose 
value  is  immediately  tested  in  the  preparation  of 
the  action  plan  (i.e.,  a  list  of  projects  coherent  with 
the  vision  and  needing  the  cooperation  of  the  main 
institutional,  economic,  and  social  actors) — can 
overcome  the  inertia  and  solve  at  the  same  time  the 
knowledge  problem  (the  uncertainty  about  conse¬ 
quences)  and  the  political  problem  (the  need  to 
make  the  decision).  The  value  of  linking  long-term 
vision  with  short-term  projects,  in  fact,  lies  not 
only  in  testing  the  viability  of  a  coherent  and  dis¬ 
tinctive  idea  of  a  desirable  future  but  also  in  the 
projects  of  the  action  plan  that  can  represent  quick 
wins  that  are  able  to  show  the  value  of  coopera¬ 
tion  and,  therefore,  to  overcome  distrust  and 
increase  social  capital. 

This  reemergence  of  planning  in  the  context  of 
modern  complex  governance  systems  should  not 
be  overemphasized.  These  attempts  are  very  diffi¬ 
cult,  certainly  fragile  because  of  unanticipated 
external  shocks,  and  necessarily  based  on  a  seg¬ 
mentation  of  the  problems  either  by  policy  sector 
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or  territorially,  as  in  the  case  of  urban  plans.  The 
strategic-framing  approach,  according  to  its  pro¬ 
ponents  and  followers,  is  in  no  way  devoid  of  risk 
and  uncertainty,  and  its  practitioners  should  there¬ 
fore  act  with  modesty  and  prudence. 

But  the  idea  of  the  rational,  comprehensive  plan 
that  is  able  to  allocate  scarce  resources  in  the  most 
efficient  way  to  the  relevant  societal  priorities  has 
been  shown  as  what  it  is,  a  mythological  notion, 
impossible  to  achieve,  dangerous  in  its  conse¬ 
quences,  and,  if  one  has  to  follow  the  postmodern¬ 
ist  school  of  thought,  a  mere  way  of  suffocating 
conflict  by  concealing  the  all-pervasiveness  of 
political  power  and  special  interests. 

The  Future  of  Planning 

However,  like  the  mythological  phoenix,  planning 
in  its  more  top-down,  synoptic,  and  comprehensive 
version  is  perpetually  reborn  from  its  ashes,  as  one 
can  easily  see  in  business  studies,  new  public  man¬ 
agement  discourse,  and  political  rhetoric.  Despite 
more  than  50  years  of  accumulation  of  knowledge, 
and  the  very  apparent  failures  of  those  exercises, 
the  myth  of  the  “intellectually  driven  society,”  to 
use  Lindblom’s  terms,  in  which  comprehensiveness 
and  perfect  coordination  of  policy  efforts  can  and 
indeed  should  be  achieved,  continues.  This  possibly 
means  that,  on  the  one  hand,  the  “desirable  imper¬ 
fections”  of  democracy  in  contemporary  complex 
governance  systems  have  not  been  fully  appreciated 
and,  on  the  other,  that  the  quest  for  neutral  guid¬ 
ance  through  science  and  technology  is  deeply 
embedded  in  modern  society  and  cannot  be  exor¬ 
cized  by  any  amount  of  social  and  political  inquiry. 
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Pluralism 


Pluralism,  a  term  first  used  by  Christian  Wolff  and 
Immanuel  Kant,  gained  currency  as  a  philosophical 
view  of  the  universe  in  the  early  20th  century. 
American  pragmatism,  developed  by  William 
James,  conceived  the  world  in  pluralistic  rather 
than  in  monistic  terms:  interconnected  but  irreduc¬ 
ible  to  unity,  its  parts  self-governed,  not  a  “block 
universe”  but  rather  a  “federal  republic.”  Philo¬ 
sophical  pluralism,  as  John  Dewey  had  observed 
earlier,  acknowledged  the  possibilities  of  variety,  of 
freedom,  and  of  change.  The  concept’s  applicability 
to  political  and  cultural  contexts  was  rapidly 
noticed  by  English  and  American  thinkers:  James’s 
pluralistic  universe  could  be  construed  as  a  polity 
where  groups,  possessing  inherent  rights  not  con¬ 
ceded  by  the  state,  elicited  individual  loyalties  and 
pursued  social  ends.  The  approach  came  to  assume 
central  significance  both  for  Harold  Laski’s  plural¬ 
istic  theory  of  the  state  and  for  Horace  Kallen’s 
vision  of  a  commonwealth  of  different  ethnic 
groups  and  their  cultures.  Pluralism  offered  an  ana¬ 
lytical  and  a  normative  notion  that  responded  to 
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the  increasingly  associative  character  of  society,  the 
rise  of  governmental  interventionism,  the  lobbying 
activities  of  organized  groups,  the  nascence  of 
immigrant  subcultures,  and  the  maldistribution  of 
political  resources.  This  entry  explains  the  concept 
of  pluralism  as  it  is  used  in  political  and  cultural 
contexts.  It  then  considers  alternative  conceptions 
that  lead  further  into  the  theory  of  interest  repre¬ 
sentation.  The  entry  concludes  with  a  look  at  the 
relevance  of  pluralism  in  the  21st  century,  particu¬ 
larly  with  respect  to  ethno-cultural  groups  and  to 
all  forms  of  inequality. 

Positively,  the  concept  establishes,  first,  the  exis¬ 
tence  of  a  plurality  of  interests  and  corresponding 
social  groups  that,  as  latent  centers  of  power,  may 
organize  into  associations  and,  second,  the  trans¬ 
formation  of  this  diversity  into  public  policies 
through  pressure  exerted  on  each  other  and  on 
governments.  Normatively,  the  concept  endorses 
the  process  of  individuals  turning  to  organization, 
the  resulting  formation  of  interest  groups  as  sub¬ 
jects  of  democratic  politics,  and  the  sequence  of 
group  conflict,  bargaining,  and  compromise  that 
characterizes  the  shaping  of  public  policies,  on 
condition  that  basic  rights  and  the  principles  of 
justice  remain  respected  and  protected. 

Pluralism  may  thus  be  defined  as  both  a  descrip¬ 
tive  and  a  prescriptive  theory  of  individual  partici¬ 
pation  by  social  association  in  the  political  process. 
Groups,  in  other  words,  are  envisaged  to  operate  as 
instruments,  representing  individuals  rather  than 
replacing  them  in  the  political  process,  thereby 
enhancing  their  chances  for  individual-centered 
democracy  in  a  world  of  increasingly  complex 
sociopolitical  interaction.  Political  man  acting 
through  and  in  control  of  his  freely  established  asso¬ 
ciations:  That  has  been  pluralism’s  normative  vision. 

As  with  every  representative  arrangement,  how¬ 
ever,  inbuilt  tensions  remain  between  the  original 
individual  interests  and  eventual  group  (i.e.,  lead¬ 
ership)  action.  To  the  extent  that  such  group  action 
today  has  come  to  be  treated  as  a  substitute  for 
individual  action  in  reaching  political  outcomes, 
the  chances  for  individual-centered  democracy  are 
diminished.  This  inference  is  especially  relevant 
because  political  resources  enter  the  picture  as  a 
second  pivotal  element.  Their  grossly  unequal  dis¬ 
tribution  skews  the  political  process  in  favor  of 
powerful  minorities.  More  often  than  not,  from  the 
viewpoint  of  the  theory  and  practice  of  democracy, 


noncommitted  or  indoctrinated  citizens  who  are 
caught  in  a  web  of  hierarchical  organizations  are 
indicative  of  a  less  than  satisfactory  reality. 

Over  time,  pluralism  has  vacillated  between 
diagnosing  severe  democratic  deficits  in  the  politi¬ 
cal  process  and  accepting  the  ongoing  results  of 
that  process.  Discussions  of  ethno-cultural  plural¬ 
ism  have  evolved  largely  separately  from  the 
debate  on  political  pluralism  and  have  raised  addi¬ 
tional  thorny  problems,  which  are  discussed  below. 

Political  Pluralism 

Borrowed  from  James’s  pragmatist  philosophy, 
the  term  pluralism  was  first  introduced  into  politi¬ 
cal  science  by  the  British  Labour  Party  intellectual 
Harold  Laski  in  a  1915  lecture  delivered  at 
Columbia  University.  Even  before  World  War  I, 
business  corporations  and  industrial  combines — 
the  first  multinationals  among  them — had  risen  to 
prominence,  underscoring  the  unequal  distribution 
of  power  between  labor  and  capital.  However,  if 
continuing  entrepreneurial  hegemony  seemed 
assured,  a  labor  movement  had,  after  bitter  strug¬ 
gles,  also  emerged.  Unions  had  been  organized, 
and  in  most  industrialized  countries,  the  right  to 
strike  had  been  won. 

Concurrently,  the  19th  century’s  rigid  class 
structure  had  started  to  dissolve.  The  working 
class  was  segmenting  into  numerous  blue-  and 
white-collar  strata — groups,  in  fact — differentiated 
by  vocation  and  attitude,  by  income  and  educa¬ 
tion,  and,  again,  by  grossly  unequal  influence  and 
control  both  economically  and  politically. 

The  intellectual  climate  seemed  to  be  ready  for 
a  “new”  political  concept,  reformulating  the 
notions  of  freedom  and  democracy  in  a  determined 
attempt  to  attain  the  “good  society”  in  the  context 
provided  by  organized  capitalism  and  the  large 
nation-state.  The  answer  was  a  theory  of  groups 
and  associations;  of  positive,  interventionist 
government;  and  of  industrial  democracy  as  a 
complement  of  political  democracy,  which  Laski 
put  forward  between  1915  and  1925. 

Laski’s  thinking  was  considerably  influenced  by 
the  ideas  of  the  Fabian  Society,  an  intellectual  circle 
of  “respectable”  socialists — established,  among  oth¬ 
ers,  by  Sidney  and  Beatrice  Webb  and  George 
Bernard  Shaw — that  had  taken  part  in  the  forma¬ 
tion  of  the  British  Labour  Party.  Another  significant 
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influence  was  the  argument  of  the  guild  socialists 
(“young  rebels”  in  the  Fabian  ranks,  building  on 
French  anarcho-syndicalism),  particularly  George 
D.  H.  Cole,  who  held  that  associations  sprang  up  in 
society  according  to  the  logic  of  functional  differen¬ 
tiation  and  that  self-government,  consequently,  was 
identical  with  functional  representation  on  every 
social  level.  That  definitely  included  the  workplace, 
the  factory,  the  enterprise;  in  a  nutshell,  it  repre¬ 
sented  the  control  of  production  by  worker  organi¬ 
zations,  since  individuals  (having,  by  steps,  been 
enfranchised  in  the  political  sphere)  had  remained 
“enslaved”  by  industrial  autocracy  in  the  economic 
sector. 

Like  Cole,  Laski  remained  convinced  that  no 
political  democracy  could  be  real  without  being 
underpinned  by  an  economic  democracy.  His  con¬ 
cept  focused  on  control  (rather  than  on  ownership) 
of  the  means  of  production  and  on  the  enfran¬ 
chisement  of  the  citizens  (blue-  and  white-collar 
employees)  of  the  political  body  that  was  and  is  the 
modern  enterprise — its  proper  “stakeholders,”  in 
today’s  parlance.  Nationalizations  would  leave  the 
position  of  these  employees  unchanged.  What  they 
needed  was  a  share  in  the  actual  decision  making. 

It  required  the  Great  Depression  of  1929  and  the 
circumstances  of  the  formation  of  the  British 
National  Government  in  1931  for  Laski  to  move 
more  clearly  in  a  Marxist  direction,  without,  how¬ 
ever,  as  has  been  erroneously  suggested,  “rejecting” 
pluralism.  Rather,  by  combining  pluralism  and 
Marxism,  he  proposed  in  1937  to  transcend  the 
capitalist  system,  envisaging  not  violent  action  but, 
in  a  term  Laski  (1948)  was  to  coin  during  World 
War  II,  a  revolution  by  consent.  Only  in  a  classless 
society,  he  now  contended,  could  authority  be 
“pluralistic  both  in  form  and  expression”  (p.  xii). 

Cultural  Pluralism 

Likewise  inspired  by  James’s  philosophy,  Kallen 
introduced  the  term  cultural  pluralism  into  the 
American  debate  in  1924.  Immigrant  subcultures 
were  by  then  flourishing  in  the  eastern  United 
States,  after  nearly  15  million  immigrants — mostly 
from  Southern  and  Eastern  Europe — had  been 
admitted  to  the  country  between  1901  and  1920. 
Writing — not  unlike  Laski — between  1915  and 
1924  and  arguing  against  assimilating  pressures  and 
melting  pot  conformity,  Kallen  offered  his  vision  of 


a  federated  republic  of  different  nationalities. 
Convinced  that  self-government  was  impossible 
without  self-realization,  that  the  latter — in  the 
sense  of  personal  identity — hinged  on  the  assertion 
of  ethnic  differences,  and  that  society’s  creativity 
would  benefit  from  such  heterogeneous  strains,  he 
proposed  granting  equal  treatment  to  each  ethno¬ 
cultural  tradition. 

Both  Kallen’s  and  Laski’s  agendas  favored  a 
politics  of  inclusion,  in  the  sense  that  policy  mak¬ 
ing  in  pluralist  democracies  should  embrace,  on  an 
equitable  basis,  as  many  societal  interests  as  pos¬ 
sible.  Both  were  rediscovered — in  substance,  if  not 
through  explicit  recourse — after  World  War  II. 
Laski’s  was  rediscovered  by  the  American  political 
scientist  Robert  Dahl,  who  in  his  later  works 
returned  to  pluralism  the  democratizing  dimension 
that  the  British  thinker  had  first  supplied  (indus¬ 
trial  self-government — an  employee-controlled 
economy — as  the  main  prerequisite  of  a  more  par¬ 
ticipatory  democracy).  Kallen’s  ideas  were  redis¬ 
covered  by  a  succession  of  theorists,  such  as  Tariq 
Modood  in  Britain  and  Will  Kymlicka  in  Canada, 
who  supported  progressing  toward  multicultural 
acceptance,  avoiding  both  fragmentation  and  con¬ 
formity  (a  plural  state,  informed  by  a  “politics  of 
recognition,”  as  the  vehicle  for  achieving  true  mul¬ 
ticultural  citizenship). 

Even  before  Laski  and  Kallen,  Arthur  Bentley 
had  presented  an  approach  that  essentially  reduced 
human  behavior  to  group  action.  Describing  his 
effort  as  “strictly  empirical,”  rejecting  both  indi¬ 
vidualism  and  institutionalism,  Bentley  in  his  1908 
treatise  The  Process  of  Government  aimed  at  intro¬ 
ducing  the  group  as  the  central  analytical  category. 
If  the  immediate  impact  of  his  book  was  negligible, 
that  situation  would  change  after  a  considerable 
number  of  pressure  group  studies  had  been  pub¬ 
lished  in  the  United  States  between  World  War  I 
and  World  War  II  and  after  the  New  Deal  reform 
period  had  finally  established  organized  labor  and 
organized  agriculture  as  political  players  alongside 
business  in  bargaining  for  political  benefits.  Bentley’s 
work  was  resuscitated  in  the  early  1950s  by  David 
Truman,  Earl  Latham,  and  other  group  theorists 
who  judged  that  organized  interest  groups  made  up 
the  principal  ingredient  of  present-day  government. 

At  the  same  time,  against  the  backdrop  of  the 
Cold  War,  the  need  was  felt  for  a  “legitimating 
discourse,”  according  to  Richard  Merelman, 
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designed  to  explain  and  justify  the  political  sys¬ 
tems  of  the  free  world,  meaning  the  United  States 
and  Western  Europe.  Stripped  of  most  of  its  pre¬ 
scriptive  implications,  elevated  to  the  status  of  an 
antitotalitarian  (rather  than  antimonist)  public 
philosophy,  the  concept  of  pluralism  seemed  to 
serve  the  purpose  perfectly.  The  1950s  to  1970s 
were  the  heyday  of  the  academic  and  political  dis¬ 
course  on  pluralist  democracy. 

In  its  entirety,  the  approach  resembled  nothing 
more  than  a  modified  marketplace  model  ( “interest- 
group  liberalism,”  in  Theodore  Lowi’s  designation), 
with  the  individual  replaced  by  organizational 
competition.  Such  “liberal”  pluralism,  while  claim¬ 
ing  to  disregard  considerations  of  a  normative  kind 
(and  thus  contrasting  with  Laski’s  “radical”  vari¬ 
ety),  was  admittedly  biased  toward  group  leader¬ 
ship.  It  was,  in  fact,  an  elite  concept,  according  to 
which  leaders  of  associations  conducted  the  process 
of  organized  pressure  and  bargaining,  thereby  con¬ 
trolling  each  other.  Control  among  leaders,  how¬ 
ever,  provided  but  one  significant  attribute  of  the 
political  process.  A  second  was  control  of  leaders 
by  means  of  periodic  elections,  holding  them 
accountable  to  party  or  interest-group  members 
and  to  the  electorate  at  large. 

Both  Truman  and  Dahl  (even  in  his  early  works) 
conceded  that  control  of  group  leaders  and  access 
to  government  were  determined,  to  a  considerable 
extent,  by  income,  education,  and  status.  Cross¬ 
pressures,  resulting  from  conflicting  group  loyal¬ 
ties,  might  lead  to  political  apathy.  Political 
resources  were  unequally  distributed,  most  con¬ 
spicuously  between  business  and  labor.  Capitalist 
democracies  offered  extensive  opportunities  for 
“pyramiding”  such  resources  into  structures  of 
social  power  and  political  influence. 

Conceptual  Alternatives 

In  view  of  these  limitations,  two  avenues  for  fur¬ 
ther  theorizing  suggested  themselves.  Both  were 
pursued  from  the  mid-1970s.  One  was  to  unequiv¬ 
ocally  embrace  “realistic”  Schumpeterianism — that 
is,  fitting  groups  into  a  model  of  elitist  democracy 
where  governments  would  explicitly  privilege  the 
organizations  of  capital  and  labor  as  “partners”  in 
policy  formulation  and  implementation  over  asso¬ 
ciations  with  weaker  political  resources.  Group 
leaders  would  manufacture  consent  for  policies 


resulting  from  institutionalized  “interest  interme¬ 
diation,”  by  which  group  leaders  are  able  to 
reshape  social  interests  in  order  to  make  them  com¬ 
patible  with  public  policies.  Group  members  would 
consequently  be  mobilized  and  controlled  in  a  top- 
down  process.  Such  conceptions  of  liberal  (or  neo-) 
corporatism,  as  they  came  to  be  labeled,  were  par¬ 
ticularly  developed  in,  and  focused  on,  Western 
Europe,  with  Philippe  Schmitter  and  Gerhard 
Lehmbruch  in  the  forefront  of  neo-corporatist 
writers. 

Consequently,  the  neo-corporatist  school  of 
thought  has  insisted  that  the  socioeconomic  attri¬ 
butes  around  which  interests  organize  are  indeed 
unequally  distributed;  that  existing  inequalities  are 
further  reinforced  by  the  politics  of  interest  asso¬ 
ciations,  not  the  least  because  certain  associations, 
particularly  organized  business  and  organized 
labor,  are  granted  privileged  access  to  governmen¬ 
tal  decision  making;  and  that  political  processes 
typically  take  the  form  of  oligarchically  structured 
interest  intermediation  through  bargaining  among 
leaders. 

Proponents  of  corporatism  and  liberal  pluralism 
alike  have  usually  conceded  that,  more  often  than 
not,  pluralist  and  corporatist  arrangements  may  be 
found  in  mixed  combinations  (Germany  provides 
a  typical  example).  Both  varieties  may,  in  fact,  not 
perform  too  differently,  considering  what  liberal 
pluralism  had  to  say  about  bargaining  among  lead¬ 
ers  and  about  the  possible  pyramiding  of  unequally 
distributed  political  resources. 

Because  trade  unions  typically  have  difficulty  in 
gaining  the  support  of  their  members  on  agree¬ 
ments,  the  involvement  of  social-democratic  par¬ 
ties  as  predominant  players,  able  to  secure  trade 
union  support,  has  been  considered  a  salient 
requirement  of  corporatist  structures.  To  the 
extent  that  governments  led  by  social-democratic 
parties,  accepting  the  notion  of  international  com¬ 
petitiveness  as  the  new  social  and  economic  ortho¬ 
doxy,  have  joined  conservative  administrations  in 
attempts  at  mediating  the  consequences  of  global¬ 
ization  by  the  pursuit  of  deregulatory  and  privati¬ 
zation  policies,  the  social-democratic/trade  union 
alliance  has  been  eroded. 

Neo-corporatism,  moreover,  always  remained 
very  much  a  Europeanist  concept.  With  regard  to 
the  United  States,  observers  continued  to  note 
that  because  of  the  political  strength  of  corporate 
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business  and  the  corresponding  weakness  of  labor, 
no  genuine  interest  in  such  an  arrangement  existed 
on  the  part  of  the  American  business  elite. 

The  second  conceptual  alternative,  according  to 
Dahl,  consisted  in  raising  the  normative  question  of 
how  one  might  remedy  the  defects  of  pluralism.  This 
meant  conceiving  pluralism  in  terms  of  a  normative 
concept  of  democratic  transformation  as  it  had  ear¬ 
lier  emerged  in  England.  Looking  for  possible  solu¬ 
tions  to  what  “authority  in  a  good  society”  might 
be  like,  Dahl — ever  more  critical  of  institutional 
rigidity,  social  inequality,  and  political  apathy — 
suggested  a  radical  alternative  to  the  present  status 
quo.  The  large  business  corporation  and  the  politi¬ 
cally  privileged  position  of  corporate  executives — 
the  latter,  as  held  by  Dahl  and  his  fellow  critic 
Charles  Lindblom,  being  subject  to  neither  effective 
internal  control  by  stockholders  nor  effective  exter¬ 
nal  control  by  governments  and  markets — became 
the  major  targets  for  the  proposed  participatory 
reforms.  In  a  way  reminiscent  of  Laski’s  earlier 
efforts,  but  more  systematically,  Dahl  argued  that 
control  over  the  political  body,  which  the  modern 
business  enterprise  has  in  fact  become,  should  no 
longer  rest  with  investors  or  managers  but  rather 
with  blue-  and  white-collar  employees.  Starting 
from  the  premise  that  unequal  social  resources  will 
translate  into  unequal  political  resources,  Dahl’s 
program  focused  on  diminishing  the  discretionary 
exercise  of  organizational  power  by  the  economi¬ 
cally  privileged  minorities. 

Ethno-Cultural  Pluralism 

Such  radical  pluralist  propositions  have  been  merg¬ 
ing  into  the  larger,  more  comprehensive  debate  on 
democratization,  a  spillover  of  democratic  norms 
onto  economy  and  society.  The  attribute  “politi¬ 
cal”  has  thus  been  reconceptualized,  now  relating 
to  any  form  of  group  decision  making,  whose 
democratization  would  be  intended  (as  Amitai 
Etzioni  has  emphasized)  to  make  all  sectors  of  soci¬ 
ety  more  responsive  to  their  members.  As  a  caveat, 
it  should  be  added,  however,  that  the  extent  to 
which  such  visions  of  increased  citizen  competence 
and  control  might  be  accepted,  internalized,  and 
practiced  by  increasing  segments  of  present-day 
consumption-oriented  societies  is  far  from  clear. 

On  top  of  that,  neoliberal  maxims  have  become 
firmly  entrenched  with  the  progress  of  financial 


and  economic  globalization.  Bent  on  cutting  busi¬ 
ness  regulation  and  welfare  expenditure,  and  opting 
for  the  privatization  of  public  services,  governmen¬ 
tal  and  market  players  alike  have  worked  at  reduc¬ 
ing  the  size  of  government.  The  “reform”  label  has 
been  put  to  service  as  a  facade  for  such  programs. 
Ongoing  globalization  has  not  simply  diminished 
governance  capabilities  due  to  the  pressure  of  mul¬ 
tinational  investors  and  foreign  competition — it  is 
governments  themselves  that  have  determinedly 
been  restricting  their  performance.  Such  promarket 
state  intervention  urgently  calls  for  institutional  and 
attitudinal  changes  that  would  again  reinforce 
political  pluralism. 

Concurrently,  the  migration  component,  which 
the  process  of  globalization  involves,  has  been  fur¬ 
ther  pluralizing  and  diversifying  Western-type  soci¬ 
eties  along  ethno-cultural  lines.  Once  again,  the 
pattern  of  societal  cleavages  and  linkages  has  been 
changing.  The  fragmentation  of  interests  is  being 
advanced,  and  traditional  institutional  loyalties  are 
put  in  jeopardy.  Socioeconomic  inequalities  and 
insecurities  rank  high  among  the  factors  that  have 
been  fueling  the  rediscovery  of  ethnic-based  identi¬ 
fication  as  a  source  of  belonging  among  these 
increasingly  multicultural  populations. 

When  Kallen  proclaimed  a  “democracy  of 
nationalities”  as  the  desired  goal  of  cultural  plural¬ 
ism,  he  wanted  individuals  in  every  ethnic  group  to 
realize  their  inherent  possibilities  according  to  that 
group’s  cultural  traditions  rather  than  being  dis¬ 
criminated  against  by  culturally  privileged  majori¬ 
ties.  Two  generations  later,  Giovanni  Sartori 
stressed  that  pluralism,  because  it  considered 
diversity  a  pivotal  value,  should  first  and  foremost 
focus  on  cultural  diversity,  aiming  at  a  “cross- 
fertilized”  rather  than  a  “tribalized”  culture. 
Pluralist  policies  should  accept  (recognize)  the  cul¬ 
tural,  religious,  and  linguistic  heterogeneity  of 
different  ethnic  groups,  without  inviting  further 
societal  segmentalization. 

Strategies  of  differentiated  group  treatment 
(e.g.,  affirmative  action  procedures  in  employment 
and  education,  introduction  of  official  multilin¬ 
gualism,  a  composition  of  political  bodies  reflect¬ 
ing  the  existence  of  various  ethnic  groups)  may 
work  to  reinforce  cultural  pluralism.  However, 
these  policies  do  not  maintain  neutrality  toward 
ethno-cultural  differences.  Remaining  neutral 
would  imply  that  discrimination  on  ethnic  grounds 
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is  legally  prohibited  while  benefits  are  solely  pro¬ 
vided  according  to  individual  eligibility. 

Most  of  the  relevant  debate  has  centered  on  the 
controversial  question  of  how  to  balance  individ¬ 
ual  against  group  rights.  Might  not  any  determined 
movement  in  the  direction  of  group  rights  work  to 
endanger  individual  autonomy  and  to  bar  indi¬ 
viduals  from  opting  out  of  their  group  if  they  wish 
to  adopt  ideas  and  practices  running  counter  to 
their  ethno-cultural  heritage?  To  be  effective,  need 
not  policies  of  differentiated  group  treatment  show 
an  awareness  of  the  connection  between  economic 
and  cultural  power?  A  more  equitable  distribution 
of  social  and  political  resources  might  help  indi¬ 
viduals  make  meaningful  choices,  including  the 
choice  to  exit  a  group  perceived  as  confining. 

A  pluralist  politics  informed  by  a  spirit  of  both 
political  participation  and  social  justice  clearly 
would  concede  ethno-cultural  groups  some  politi¬ 
cal  standing  and  legal  rights — but  to  what  extent? 
Available  options  for  overcoming  ethnic  cleavages 
include  legal  protection  and  public  funding  for  the 
expression  of  cultural  peculiarities,  federalism  as  a 
form  of  self-government,  and  finally,  group-based 
political  representation,  up  to  the  complex  arrange¬ 
ments  of  consociationalism  (in  which  representa¬ 
tives  of  varied  groups  share  decision-making 
power).  However,  consociational  government  has 
been  shown  to  favor  elite  (group  leader)  predomi¬ 
nance,  arcane  negotiations,  and  political  immobil- 
ism  rather  than  a  more  vibrant  pluralist  democracy. 

Pluralism  in  the  21st  Century 

Persisting  inequalities  of  socioeconomic  (and,  in 
addition,  ethno-cultural)  influence  and  control, 
equivalent  to  so  many  embedded  participatory  bar¬ 
riers,  have  keep  resurfacing  during  this  overview  of 
research  on  pluralism  and  democracy.  Resulting  in 
unequal  chances  for  the  organized  representation 
of  interests  and,  hence,  in  limits  to  more  robust 
redistributive  and  regulatory  public  policies,  they 
will  remain  among  the  major  issues  bedeviling 
21st-century  democracy.  As  a  powerful  approach 
to  inquiring  into  these  issues,  pluralism  will  be  of 
continuing  relevance,  provided  it  retains  the  role  of 
a  critical  political  theory — critical,  as  it  sporadi¬ 
cally  has  been,  of  the  status  quo  of  concentrated 
economic,  political,  and  cultural  power.  This,  in 
turn,  requires  that  two  problems  remain  foremost 


among  the  concerns  addressed  by  political  science: 
first,  securing  broader  societal  participation — and 
thus,  again,  a  more  equitable  representation  of 
social  interests — in  the  shaping  of  public  policies 
and,  second,  reducing  disparities  in  control  over 
political  resources  to  ensure  the  accessibility, 
accountability,  and  (in  the  final  instance)  legiti¬ 
macy  of  a  representative  democratic  government. 
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Pluralist  Interest 
Intermediation 


Modern  democracies  are  determined  by  a  great 
number  of  diverse  interests.  That  is  why  structures 
and  actors  that  show  and  assert  individual  social 
interests  are  necessary.  Nevertheless,  although 
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interests  can  be  articulated  individually,  a  collec¬ 
tive  and  organized  representation  of  interests, 
called  interest  intermediation,  promises  greater 
success.  Because  of  this,  organized  interest  inter¬ 
mediation  plays  an  increasingly  decisive  role  in  the 
political  process.  It  belongs  to  democracy  “like  the 
piston  belongs  to  the  cylinder”  (Ralf  Kleinfeld, 
2007,  p.  7),  because  without  organized  aggrega¬ 
tion  and  articulation  of  interests,  modern  societies 
and  democratic  forms  of  government  would  not 
be  able  to  act.  Modern  democracies  show  very  dif¬ 
ferent  structures  of  interest  intermediation. 
Corporate  and  pluralistic  systems  of  interest  inter¬ 
mediation  can  be  distinguished.  The  interest  of 
political  science  in  this  process  lies,  in  particular, 
in  the  identification  of  regular  patterns  of  interest 
intermediation  and  conflict  resolution.  However, 
before  the  process  of  interest  intermediation  is 
dealt  with  in  more  detail,  the  concept  of  interest 
must  first  be  defined.  This  is  followed  by  a  discus¬ 
sion  of  major  actors,  their  patterns  of  interaction, 
and  the  respective  concepts  in  political  science. 

According  to  Carola  Schulz  (1984),  interests  can 
be  defined  as  “action-relevant  consolidations  of 
needs  which  arise  from  the  subjective  perception  of 
situations  where  a  distinct  feeling  of  want  is  pres¬ 
ent”  (p.  15).  For  citizens’  needs  to  find  their  way 
into  the  political  system,  they  must  be  articulated 
and  organized.  It  is  exactly  here  that  political  inter¬ 
est  intermediation  applies.  Since  political  systems 
are  social  systems  that  make  collectively  binding 
decisions,  it  is  helpful  to  bring  in  a  theoretical  per¬ 
spective  from  systems  theory  in  order  to  illustrate 
the  process  of  interest  intermediation  more  closely. 
Here,  the  classic  model  developed  by  David  Easton 
can  serve  as  a  basis.  To  be  sure,  Easton’s  model 
expresses  a  concept  of  democratic  ideals,  but  it 
facilitates  a  brief  representation  of  the  procedural 
character  of  interest  intermediation.  For  Easton,  the 
decisive  actors  of  the  political  system  are  the  parties 
and  interest  groups,  on  the  one  hand,  and  govern¬ 
ment  and  the  administration,  on  the  other. 
Interactions  between  these  actors  are  aimed  at  gen¬ 
erating  collectively  binding  decisions.  Demands  for 
such  decisions  are  brought  into  the  political  system 
as  inputs,  processed  where  appropriate,  and  con¬ 
verted  into  outputs.  At  best,  these  outputs  result  in 
the  fulfillment  of  these  demands  and  thus  enhance 
the  legitimization  of  the  political  actors  and  there¬ 
fore  the  political  system  itself.  In  turn,  this  may  lead 


to  new  demands  being  brought  into  the  political 
system  in  a  feedback  cycle.  For  this  reason,  Easton 
places  both  political  demands  and  political  support 
on  the  input  side.  In  this  context,  politics  can  be 
defined  as  the  process  of  the  production  and 
enforcement  of  obligatory  cross-societal  decisions. 

Political  demands  are  the  necessary  raw  mate¬ 
rial  for  the  political  process.  During  their  process¬ 
ing,  however,  problems  can  occur  since,  on  the  one 
hand,  not  all  demands  can  be  integrated  and,  on 
the  other,  not  each  demand  for  a  collectively  bind¬ 
ing  decision  can  be  fulfilled.  Demands  can  contra¬ 
dict  each  other,  and  a  state’s  resources  are  limited. 
According  to  Easton,  for  demands  to  enter  the 
political  system  at  all,  structural  mechanisms  such 
as  parties  and  organized  interest  groups  are  neces¬ 
sary.  As  such,  they  have  an  intermediate  function 
between  the  political  system  and  society  at  large. 

Actors  of  Political  Interest  Intermediation 

In  modern  democracies,  various  actors  apart  from 
political  parties  assume  the  task  of  aggregating  and 
articulating  the  interests  of  the  population.  In  this 
way,  it  can  be  ensured  that  the  political  system 
remains  stable  and  can  adapt  to  new  tasks  at  the 
same  time.  Modern  democracies  rely  on  highly 
effective  forms  of  interest  intermediation  from  both 
a  normative  and  a  functional  point  of  view.  The 
high  complexity  of  political  interests  and  social 
demands  makes  the  work  of  specialized  actors  of 
interest  intermediation  indispensable.  An  efficient 
system  of  political  interest  intermediation  is  the  cen¬ 
tral  precondition  for  the  legitimacy  of  collectively 
binding  decisions.  Ideally,  political  interest  interme¬ 
diation,  therefore,  connects  the  contents  of  politics 
to  the  will  of  the  sovereign  and  facilitates  political 
decision  making  by  reducing  complexity. 

Political  parties  continuously  fulfill  the  function 
of  interest  intermediation  since  they  are  the  only 
ones  to  work  on  cross-societal  problems;  but  asso¬ 
ciations,  churches,  unions,  social  movements,  non¬ 
governmental  organizations  (NGOs),  and  other 
groups  also  represent  important  interests — even 
more  so  if  they  succeed  in  representing  the  total 
interest  of  their  organized  members.  While  indi¬ 
vidual  interest  associations  usually  attempt  to  influ¬ 
ence  the  political  process  through  lobbying,  social 
movements  attempt  to  prevent  or  reverse  decisions 
or  create  social  change  through  unconventional 
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actions.  In  addition,  since  the  1990s,  in  many  coun¬ 
tries,  NGOs  have  played  an  increasingly  stronger 
role  in  the  intermediation  of  social  interests.  Similar 
to  social  movements,  many  NGOs  claim  to  be  com¬ 
mitted  to  broader  social  values,  but  they  also  can 
be  distinguished  from  social  movements  by  their 
higher  degree  of  organization. 

The  conditions  of  political  interest  intermedia¬ 
tion  have  been  changing  more  recently,  however. 
Both  exogenous  and  endogenous  changes  are 
responsible  for  this.  On  the  one  hand,  the  number 
of  political-administrative  actors  has  expanded 
(and  thus  the  possibility  for  influence)  through  the 
formation  of  supranational  institutions,  such  as 
the  European  Union.  On  the  other  hand,  individu¬ 
alization  and  pluralization  of  social  interests  com¬ 
plicate  their  integration  and  organization.  Thus,  it 
is  no  surprise  that  a  pluralization  of  the  representa¬ 
tions  of  interests  can  be  observed,  which  is  accom¬ 
panied  by  a  certain  loss  of  importance  of  major 
traditional  intermediate  actors  such  as  unions  or 
employers’  associations. 

Political  Interest  Intermediation 

as  a  Subject  of  Political  Science 

If  we  want  to  abstract  from  the  actors  and  analyze 
interest  intermediation  from  a  political  science 
point  of  view,  it  is  useful  to  distinguish  aspects  of 
policy,  polity,  and  politics.  Since  the  intermedia¬ 
tion  of  interests  can  be  regarded  as  a  permanent 
political  process,  it  falls  into  the  procedural  dimen¬ 
sion  of  politics.  Yet  the  basis  for  interest  interme¬ 
diation  lies  in  the  principle  of  a  liberal  political 
order — the  polity  dimension.  In  this  respect,  basic 
freedoms  are  the  freedoms  of  expression  and  asso¬ 
ciation  as  well  as  the  right  to  vote.  The  policy 
dimension  is  equally  affected  by  social  interest 
intermediation  since  government  policies  are 
strongly  influenced  by  the  interests  of  society 
(Ulrich  von  Alemann,  1987). 

Due  to  the  variety  and  conflicting  nature  of 
interests  and  the  scarcity  of  resources,  a  pluralist 
democratic  political  process  is  continuously  antag¬ 
onistic.  Although  the  disputes  concerning  the  dis¬ 
tribution  of  benefits,  the  satisfaction  of  needs,  or 
the  pursuit  of  ideals  thus  mean  permanent  con¬ 
flicts,  these  must,  however,  not  be  conducted  in 
violent  ways,  endangering  the  system  as  a  whole. 


This  is  an  essential  task  of  political  interest  inter¬ 
mediation.  Thus,  in  political  science,  a  conflict¬ 
regulating  function  is  ascribed  to  the  process  of 
interest  intermediation  from  which  stability  and 
legitimacy  in  Easton’s  sense  can  arise. 

Pluralist  Interest  Intermediation 

A  general  distinction  in  political  science  is  made 
between  pluralism  and  corporatism  as  the  two 
essential  strategies  of  political  interest  intermedia¬ 
tion.  The  concept  of  pluralism  regards  interest 
articulation,  aggregation,  and  the  fulfillment  of 
demands  as  a  continuous  process.  In  the  long  run, 
this  dynamic  process  leads,  in  the  view  of  the  pro¬ 
tagonists  of  such  concepts,  to  the  achievement  and 
maintenance  of  the  common  good  for  society. 
Pluralism  in  this  sense  can  be  defined  according  to 
Philippe  Schmitter  (1974)  as 

a  system  of  interest  representation  in  which  the 
constituent  units  are  organized  into  an  unspecified 
number  of  multiple,  voluntary,  competitive,  non- 
hierarchical,  ordered  and  self-determined  (as  to 
type  of  scope  of  interest)  categories  (and),  which 
are  not  subsidized,  specially  licensed,  created  or 
otherwise  controlled  in  leadership  selection  or 
interest  articulation  by  the  state  and  which  do 
not  exercise  a  monopoly  of  representational 
activity  within  their  respective  categories,  (p.  96) 

On  the  one  hand,  Schmitter  thus  emphasizes  the 
structure  of  interest  intermediation  determined  by 
competition.  On  the  other  hand,  he  stresses  the 
fact  that  such  actors  are  neither  supported  nor 
controlled  by  the  state.  Pluralistic  interest  interme¬ 
diation  is,  therefore,  characterized  by  the  volun¬ 
tarism,  spontaneity,  and  autonomy  of  the  interest 
groups.  Unlike  corporatism,  pluralist  interest  inter¬ 
mediation  is  not  based  on  the  agreed-on  forms  of 
negotiation  between  interest  groups  and  the  polit¬ 
ical-administrative  bodies  but  rather  on  competi¬ 
tion  and  conflict.  From  a  pluralistic  point  of  view, 
politics  can  therefore  be  regarded  as  resulting  from 
social  interests  and  the  respective  power  of  the 
mediating  interest  groups.  In  this  process,  the 
actors  can  compete  relatively  autonomously  for 
the  enforcement  of  their  interests  and  take  advan¬ 
tage  of  the  resources  available  to  them. 
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In  this  respect,  interests  that  are  capable  of  deal¬ 
ing  with  conflict  and  easy  to  organize  have  an 
advantage.  This  in  turn  leads  to  a  decisive  problem 
from  the  point  of  view  of  a  normative  theory  of 
democracy:  Interests  that  are  difficult  to  organize 
and  the  interests  of  groups  of  lower  significance 
have  only  a  small  chance  of  representation  and 
intermediation.  So,  due  to  real  existing  pluralistic 
structures,  capital  and  labor  interests  are  favored 
in  particular,  and  interests  are  not  represented 
according  to  their  importance  for  society  as  a 
whole.  As  opposed  to  the  thesis  of  balanced  inter¬ 
est  implementation  as  a  basis  of  the  pluralist  the¬ 
ory  of  democracy  proposed  by  Robert  Dahl,  in 
particular,  the  fact  remains  that  representation  of 
interests  in  modern  democracies  is  not  balanced — 
and  cannot  be  balanced  due  to  their  complexity 
and  variety.  Thus,  it  is  not  possible  in  the  process 
of  collective  decision  making  for  every  citizen  to 
have  the  same  opportunity  to  express  his  or  her 
interests.  Such  a  representation  (as  complete  as  is 
possible)  of  interests  of  different  social  groups  is, 
however,  a  condition  of  pluralist  interest  interme¬ 
diation.  Considerable  legitimization  problems  can, 
therefore,  result  from  the  unbalanced  interest  rep¬ 
resentation  of  pluralistic  structures.  So  the  possi¬ 
bility  exists  that,  on  the  one  hand,  the  results  of 
political  decision  making  are  no  longer  accepted 
and,  on  the  other,  that  this  disappointment  can 
lead  to  the  refusal  of  established  forms  of  partici¬ 
pation  and  intermediation  mechanisms. 

Here,  it  must  be  considered  that  the  greater  suc¬ 
cess  of  interests  that  are  well  organized  is  not  neces¬ 
sarily  a  dysfunction  of  the  political  system  but  is 
rather  the  product  of  a  pluralistic  competitive 
democracy.  The  political  administrative  addressees 
of  pluralistic  interest  intermediation  have  them¬ 
selves  the  ability  to  guarantee,  however,  by  means 
of  law,  a  minimum  of  fairness  in  this  competition 
for  influence.  Conditions  of  action  for  interest 
groups  can  be  legally  regulated,  for  instance,  in  the 
procedure  of  consultation  in  Switzerland.  This  pro¬ 
cedure  is  part  of  the  legislative  process  in  which  all 
important  interest  groups  are  to  be  consulted  inde¬ 
pendently  of  their  negotiating  power. 

For  functional  reasons,  a  corporate  interest 
intermediation  also  can  fulfill  only  a  selective  con¬ 
sideration  of  interests  and  thus  perform  only  an 
inadequate  representation.  Here,  the  negotiation 


process  is  also  determined  by  those  interest  groups 
having  the  strongest  influence.  The  difference  lies 
only  in  the  mutual  cooperation  of  interest  organi¬ 
zations  with  the  political-administrative  actors. 
The  problem  of  underrepresentation  of  certain 
interests,  therefore,  remains  not  only  unsolved 
in  this  way  but  would  moreover  become  firmly 
institutionalized. 

The  difference  with  regard  to  the  pluralist  inter¬ 
est  intermediation  lies  rather  in  the  enforcement  of 
interests  between  the  actors — which  has  less  poten¬ 
tial  for  conflict  due  to  lack  of  competition.  Because 
of  this,  an  integrative  effect  for  society  can  also  be 
achieved  by  corporate  interest  intermediation, 
which  is  not  subject  to  majority  decisions.  Further¬ 
more,  this  mode  of  interest  intermediation  suc¬ 
ceeds  in  providing  a  peaceful  solution  to  class 
conflicts  via  collective  negotiations  between  the 
state,  employers,  and  trade  unions.  By  contrast, 
the  sluggishness,  inflexibility,  and  status  quo  ori¬ 
entation  of  corporate  interest  intermediation  is 
often  criticized  since  it  can  lead  to  a  blocking  of 
reforms  with  many  veto  players.  Corporate  struc¬ 
tures  are,  however,  only  partially  possible  in  situ¬ 
ations  where  social  interests  can  be  integrated  on 
a  high  organizational  level.  As  a  result,  the  avail¬ 
able  corporate  structures  are  often  part  of  a  more 
comprehensive  pluralist  system  of  mediation.  In 
the  literature,  pluralism  is  usually  regarded  as  a 
strategy  superior  to  corporatism  for  the  overcom¬ 
ing  of  socioeconomic  crises  and  as  a  relative  stabil¬ 
ity  guarantee  for  democratic  governments. 

Since  pluralism  and  corporatism  are  not  only 
procedural  strategies  but  are  integrated  into  a 
country’s  political  culture  via  the  context  of  ideo¬ 
logical  acceptance,  a  concrete  form  of  interest 
intermediation  cannot  be  imposed  without  consid¬ 
eration  of  the  political  culture.  Yet  the  guiding 
values  of  compromise  and  consensus  anchored  in 
the  political  culture,  and  thus  also  in  political 
socialization,  further  complicate  interest  interme¬ 
diation  in  the  sense  of  pluralistic  competition. 

Ulrich  von  Alemann 
University  of  Diisseldorf 
Diisseldorf,  Germany 
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The  police  is  the  institution  in  charge  of  protecting 
public  order  and  repressing  crime,  and  it  is  enti¬ 
tled  to  use  physical  force  in  order  to  meet  these 
functions.  Police  institutions  are  the  product  of  a 
long-term  historical  trend  toward  differentiation 
and  specialization.  Before  the  beginning  of  the 
19th  century,  differentiation  as  a  contemporary 
function  of  policing  (e.g.,  keeping  peace  and  com¬ 
bating  crime  and  disorder)  was  in  the  hands  of 
watchmen  appointed  from  households  or  by  fam¬ 
ily  or  tribal  constables.  This  “private”  policing 
occurred  in  several  areas  (e.g.,  China,  main  parts 
of  Africa,  and  South  America)  before  the  Europe- 
led  colonization  process.  Autonomous,  profes¬ 
sional,  permanent — and  in  most  cases  local — 
police  forces  arose  in  the  wake  of  urban  growth 
and  urban  disorders,  industrial  disputes  and  riots, 
and  crime  fears.  This  institutionalization  process 


accompanies  a  specialization  process  in  the  coun¬ 
tries  of  continental  Europe,  where  police  imposed 
an  ensemble  of  norms,  rules,  and  measures  to 
regulate  the  entire  urban  existence,  from  public 
hygiene  to  passport  control,  from  milk  inspection 
to  supervision  of  libraries,  and  so  on.  If  not  in  the 
hands  of  watchmen  or  private  forces,  such  as  dock 
bosses’  militia,  the  control  of  crime  and  disorder 
was  only  a  small  part  of  the  duties  of  what  then 
constituted  the  police.  During  the  19th  century  in 
continental  Europe,  police  could  specialize  only  in 
these  functions  because  administrative  law  and 
jurisdiction  regulated  other  aspects  of  social  life  at 
that  time.  This  entry  discusses  the  main  distinctive 
features  of  police,  especially  in  connection  with 
the  use  of  force;  the  trend  toward  specialization, 
especially  with  relation  to  efficiency;  some  of  the 
most  recent  transformations;  and  the  organiza¬ 
tional  differences  between  a  central  organization 
and  a  local  organization. 

Key  Distinctive  Features 

From  a  more  theoretical  point  of  view,  the  police 
institution  has  two  distinctive  features.  First,  it  is 
wholly  instrumental:  It  is  meant  to  perform  a  defi¬ 
nite  task,  linked  to  crime  and  disorder,  under  civil 
supervision  (i.e.,  the  functional  dimension  of  the 
police).  Second,  it  is  defined  by  its  capacity  to  use 
physical  force  (i.e.,  the  substantive  dimension  of 
the  police).  A  crucial  dimension  of  police  relies 
therefore  on  its  intimate  relationship  with  the 
monopoly  on  physical  force  that  Max  Weber  sees 
as  the  distinctive  feature  of  the  state.  But  this  cru¬ 
cial  dimension  is  at  the  same  time  oxymoronic, 
since  (contrary  to  the  military  use  of  force),  police 
force  is  expected  to  be  legitimate  (i.e.,  reasonable, 
proportionate,  and  based  on  consent).  Robert 
Reiner  (2000)  asserts  that,  with  respect  to  police 
action,  the  two  terms  consent  and  force  are  antag¬ 
onists.  The  police  can  use  force  either  in  the  con¬ 
text  of  a  local  breach  of  order  (a  contested  arrest, 
an  unlawful  strike,  or  a  riot)  or  in  the  context  of  a 
broader  social  or  political  breakdown  resulting  in 
great  hostility  against  the  regime  in  place.  However, 
police  legitimacy  relies  on  consent  from  the  public 
and/or  the  ruling  regime. 

On  a  more  sociological  level,  the  substantial 
affinity  between  force  and  the  police  has  been 
questioned.  Empirical  evidence  clearly  shows  that 
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the  actual  use  of  force  is  rather  rare.  Some  police 
officers  never  use  physical  force  throughout  their 
career.  On  the  other  hand,  trying  to  list  and  ratio¬ 
nalize  all  the  tasks  ever  performed  by  police  offi¬ 
cers  is  nearly  impossible.  Therefore,  some  theorists 
like  John-Paul  Brodeur  (2007)  are  inclined  to 
define  the  police  as  the  only  public  organization 
aimed  at  “doing  everything.”  To  do  so,  they  use 
means  that  are  illegal  or  unlawful  if  taken  by  ordi¬ 
nary  citizens  (from  driving  the  wrong  way  on  a 
one-way  street  to  bugging  phone  devices). 

Nevertheless,  use  of  physical  force  is  a  central 
issue  in  police  organizations  because  cases  of  abuse 
of  force  weaken  the  political  legitimacy  of  the 
police.  The  beating  of  the  African  American 
motorist  Rodney  King  in  Los  Angeles  in  1991  by 
White  police  officers  (and  their  acquittal  on 
charges  of  excessive  force  1  year  later)  led  to  urban 
riots  in  the  United  States.  The  death  of  two  minor¬ 
ity  youths  in  France  in  2005,  who  allegedly  fled 
from  the  police  into  a  power  substation,  where 
they  were  accidentally  electrocuted,  led  to  a 
3-week-long  period  of  urban  unrest.  Abusive 
arrests  and  stops-and-searches  in  London,  Birming¬ 
ham,  and  Liverpool  also  led  to  urban  riots  in 
British  cities  from  the  end  of  the  1970s  to  the  mid- 
1980s.  It  is  difficult  to  assess  if  substantial  changes 
occurred  as  a  result  of  these  episodes  of  scandal 
and  protest.  Nevertheless,  police  organizations 
faced  a  legitimation  crisis  in  the  course  of  the 
1970s,  due  to  the  emergence  of  urban  disorders 
and/or  race  riots  (in  the  United  States,  England, 
and  France),  on  the  one  hand,  and  to  the  rise  of 
property  crime,  on  the  other. 

Specialization  Versus  Efficiency? 

Contemporary  evolutions  of  police  organizations 
contribute  to  the  empirical  and  theoretical  dispute 
over  the  accurate  definition  of  police.  First,  from 
the  initial  wave  of  modernization  and  profession¬ 
alization  (from  the  early  1970s  onward),  police 
institutions  have  experienced  both  an  increased 
specialization  and  a  commodification  of  the  ser¬ 
vices  they  are  expected  to  deliver.  Both  phenomena 
are  closely  linked  with  a  legitimation  crisis  that 
occurred  during  the  1970s.  Overspecialization  has 
been  implemented  to  cope  with  different  aspects  of 
crime  and  delinquency,  leading  police  organiza¬ 
tions  to  become  an  archipelago  of  diverse  and 


isolated  occupations  (from  drug  squads  to  envi¬ 
ronmental  crime  enforcement  and  from  juvenile 
delinquency  units  to  transnational  police  experts). 
The  increased  use  of  surveillance  devices  and  tech¬ 
nologies  has  played  a  major  role  in  this  process. 
This  evolution  toward  more  segmentation  into 
specialized  operations  and  the  increased  impor¬ 
tance  of  surveillance  and  proactive  activities  tend 
to  make  physical  force  a  secondary  if  not  purely 
metaphorical  aspect  of  police  work.  Some  analysts 
have  underlined  this  phenomenon  in  stating  that 
police  are  not  violence  workers  but  “knowledge 
workers,”  noting  the  massive  amount  of  data  and 
information  police  officers  are  expected  to  sort  out 
and  analyze  in  order  to  conduct  their  tasks. 

This  trend  toward  an  overspecialization,  specifi¬ 
cally  in  the  most  diverse  domains  of  crime  fighting, 
has  not  prevented  a  decline  of  police  efficiency. 
Therefore,  police  organizations  have  communalized 
a  certain  amount  of  tasks,  the  implementation  of 
which  may  differ  among  nations.  Since  the  1990s, 
there  has  been  a  trend  toward  a  kind  of  network 
policing  in  which  diverse  private  and  public  institu¬ 
tions  or  agencies  are  brought  together  to  combat 
and  prevent  crime  and  urban  disorder.  Police  offi¬ 
cers  are  now  civil  servants  confronted  with  private 
and  public  agents  in  different  kinds  of  negotiation 
arenas.  Several  labels  have  been  employed  to 
describe  these  evolutions.  The  notion  of  “problem¬ 
solving  policing”  was  first  introduced  at  the  begin¬ 
ning  of  the  1980s  to  encourage  the  police  to  tackle 
daily  local  problems  such  as  loitering  or  drinking  in 
public  spaces,  domestic  violence,  noise  distur¬ 
bances,  and  so  on.  Thus,  police  institutions  began 
to  focus  on  broader  social  issues  rather  than  solely 
on  major  crimes.  Facing  the  difficulty  of  imple¬ 
menting  internal  police  reforms,  local  or  (more 
rarely)  central  governments  developed  programs 
such  as  community  policing  and  neighborhood 
policing,  aimed  at  integrating  police  organizations 
into  larger  institutional  structures  and,  therefore,  at 
resisting  police  organizations’  tendency  toward 
insularity  and  self-agenda.  Moreover,  the  increasing 
pluralization  of  policing  (the  growth  of  private 
security  forces,  informal  policing,  vigilantism,  the 
introduction  of  local  police  forces  competing  with 
state  police  forces,  etc.)  threatens  the  monopoly  of 
the  police  in  the  prevention  and  repression  of  disor¬ 
der.  Network  policing  programs,  such  as  commu¬ 
nity  policing,  provide  an  opportunity  for  the  police 
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to  become  an  integral  part  of  the  larger  institu¬ 
tional  structures  and  to  work  collaboratively 
within  these  structures  to  combat  crime. 

Recent  Transformations 

Other  transformations  have  cropped  up  in  the 
course  of  these  evolutions.  The  first  has  been  the 
introduction  of  regulations  and  guidelines  issued 
from  the  new  public  management  perspective  into 
police  organizations  during  the  1990s,  in  the  wake 
of  the  broader  public  policy  reforms  initiated  by  the 
Clinton  administration  in  the  United  States  and  the 
government  of  Tony  Blair  in  the  United  Kingdom. 
A  major  focus  has  been  control  over  police  officers, 
the  stress  of  individual  accountability,  and  the  dif¬ 
fusion  of  “better  with  less”  management  policies. 
As  a  result,  police  chiefs  and  captains  devote  a 
major  part  of  their  time  to  crime  and  activity  statis¬ 
tics  to  comply  with  political  and  public  expecta¬ 
tions.  Such  management  systems,  whose  real  ability 
to  cope  with  crime  and  security  has  been  widely 
discussed  in  the  literature  on  policing,  have  been 
exported  throughout  the  world  and  adopted  by 
countries  and  cities  in  very  different  ways.  As  in 
other  agencies,  one  effect  of  this  management  sys¬ 
tem  is  the  development  of  strategies  that  may  lead 
to  an  increase  in  the  bureaucratic  insularity  of 
police  organizations  and  to  a  further  public  concern 
with  the  ability  of  police  organizations  to  cope  with 
real  crime. 

The  second  trend  is  the  rising  importance  of 
law-and-order  policies  in  the  political  agenda  of 
many  Western  countries.  Proving  one’s  ability  to 
deal  with  crime  and  lead  police  forces  tends  to  be 
a  major  personal  asset  in  the  political  game.  The 
fight  against  terrorism  and  international  crime  is 
vital  to  this  trend.  In  combination  with  the  gaming 
strategies  developed  within  police  organizations, 
this  tendency  sheds  light  on  the  fact  that  the  police 
have  a  symbolic  role  that  seems  more  and  more 
important  in  today’s  political  systems. 

Central  and  Local  Organization 

Police  are  in  fact  a  part  of  the  political  system,  in 
which  they  help  consolidate  the  political  legitimacy 
and  resources  of  the  (local  or  national)  govern¬ 
ment.  One  must  distinguish  here  between  regimes 
in  which  police  organizations  are  centralized  under 


the  direct  command  of  the  government  (as  in 
France)  and  countries  in  which  police  forces  are 
essentially  municipal  (as  in  United  States)  or 
regional  forces  (as  in  Germany  and  Great  Britain), 
with  some  specialized  forces  devoted  to  federal 
issues  (e.g.,  organized  crime,  terrorism,  and  intel¬ 
ligence).  Contrary  to  a  widespread  assumption  in 
political  science,  the  degree  of  centralization  of 
forces  is  not  correlated  to  the  degree  of  corruption 
of  the  police  by  political  forces.  The  Napoleonic 
model  in  France  offers  a  striking  example  of 
police  organizations  heading  numerous  and  opu¬ 
lent  political  intelligence  and  antiriot  sections 
under  the  immediate  leadership  of  the  Ministry  of 
Interior,  which  is  always  devoted  to  the  political 
tasks  of  repressing  protests,  preventing  public  dis¬ 
order,  and  influencing  or  shaping  the  political 
landscape.  Municipal  police  systems  in  the  United 
States,  on  the  other  hand,  are  characterized  by  a 
high  level  of  what  the  U.S.  National  Commission 
on  Law  Observance  and  Enforcement  (the 
Wickersham  Commission)  in  1931  called  “law¬ 
lessness  in  law  enforcement,”  because  such  police 
forces  are  considered  to  be  part  of  the  local  politi¬ 
cal  system  and  are  under  the  direction  of  the 
mayor  or  supervisors  of  the  municipality.  The 
wave  of  reform  of  police  forces  during  the  1950s 
resulted  in  a  high  level  of  “professionalism” — that 
is,  quasi-military  police  departments  with  high 
standards  of  integrity.  As  it  appears,  the  nature  of 
the  political  involvement  of  police  organizations  is 
strongly  linked  to  the  organization  of  the  political 
system. 

Fabien  Jobard 
Centre  de  Recherche  Sociologique  sur  le 
Droit  et  les  Institutions  Penales,  CNRS 
Guyancourt,  France 
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Social  constructivists  depart  from  the  assumption 
that  reality  is  what  Peter  Berger  and  Thomas 
Luckmann  call  social  reality.  To  say  that  reality  is 
socially  constructed  is  to  say  that  social  reality  is 
the  product  of  human  ways  of  knowing  and  com¬ 
municating.  Social  constructivists,  therefore,  draw 
attention  to  the  processes  and  ways  through  which 
the  world  is  represented  in  language.  They  feel  that 
the  access  we  have  to  a  reality  outside  human  sym¬ 
bol  systems  is  highly  problematic.  This  entry 
describes  the  relevance  of  this  perspective  for 
understanding  public  policy. 

Language,  to  take  the  most  obvious  symbol  sys¬ 
tem,  does  not  simply  report  facts.  Things  get  their 
meanings  through  language,  irrespective  of  whether 
they  exist  outside  language.  And  the  meaning  of 
anything  always  exists  in  particular  social  contexts; 
meaning  is  always  contextual,  contingent,  and  his¬ 
torical.  Since  human  beings  always  depend  on  their 
symbol  systems — and  the  theoretical  frames  they 
build  with  the  help  of  these — to  make  sense  of  the 
world  that  surrounds  them,  the  way  they  interact 
with  the  world  is  the  result  of  socialization  and, 
more  generally,  of  human  history.  We  act  toward 
things  on  the  basis  of  the  meaning  they  have  to  us. 
We  tend  to  forget  the  contingencies  of  the  social 
constructs  and  the  forces  that  shaped  them. 
Therefore,  constructivists  remind  us  that  social  real¬ 
ity  is  always  in  flux,  even  if  the  processes  through 
which  change  comes  about  might  be  slow  at  times. 

Kinds  of  Social  Constructivism 

Social  constructivist  thinking  started  from  the  soci¬ 
ology  of  knowledge  and  has  spread  through  the 
social  sciences.  Since  the  1960s,  it  has  flourished  and 
has  been  further  elaborated  on  with  the  help  of  a 
wide  variety  of  (sometimes  conflicting)  insights 


from — among  others — Ferdinand  de  Saussure’s  lin¬ 
guistics,  the  phenomenology  and  hermeneutics  of 
Edmund  Husserl  and  Hans-Georg  Gadamer,  the 
poststructuralist  thinking  of  Jacques  Derrida  and 
Michel  Foucault,  and  the  pragmatism  of  Richard 
Rorty.  There  are  “stronger”  and  “weaker”  forms  of 
constructivism,  differing  in  their  theories  on  realities 
beyond  the  social.  Strong  constructivism  claims  that 
a  reality  beyond  the  socially  constructed  ones  might 
exist  but  that  it  is  not  relevant  to  talk  about  it  in 
terms  of  an  objective  reality  because  we  cannot  have 
direct,  unmediated  access  to  it;  it  believes  in  “mul¬ 
tiple  truths.”  A  weaker  form  of  constructivism  holds 
on  to  the  idea  of  an  objective  reality  (of  brute  facts 
and/or  of  “one  truth”)  that  is  within  reach  of  human 
knowing.  It  is  important  to  keep  in  mind  that  it  is 
not  the  (material)  world,  as  such,  that  is  referred  to 
when  we  talk  about  social  constructions  but  the 
perceptions  of  this  (material)  world.  Material 
objects,  including  all  of  nature  and  human  artifacts, 
are  assigned  a  certain  place  in  the  social  process  of 
the  construction  of  meaning. 

Social  constructivism  can  also  be  situated  in 
terms  of  the  position  of  the  subject  who  knows 
and  the  object  that  can  be  known.  If  one  claims 
that  the  knower  and  knowledge  are  somehow 
separable,  one  takes  a  dualist  position.  Social  con¬ 
structivists  often  take  a  monistic  position.  That  is, 
they  argue  that  the  knower  and  the  world  that  the 
knower  tries  to  understand  are  part  of  the  same. 
Put  differently,  the  findings  are  the  result  of  an 
interaction  between  the  knower  and  what  there  is 
to  be  known.  Crucial  in  this  interaction  is  the  pro¬ 
cess  through  which  the  knower  crafts  observations 
into  meaningful  fact  with  the  help  of  a  theoretical 
framework.  This  leads  social  constructivists  to 
suggest  that  there  is  no  Archimedean  point,  “no 
view  from  nowhere,”  from  which  we  could  see  the 
essence  of  the  things  that  make  up  our  social  reali¬ 
ties.  We  can  only  see  reality  from  a  cultural,  social, 
political,  historical,  or  other  perspective.  This  is 
why  we  encounter  multiple  social  realities,  each  of 
which  is  created  and  sustained  with  the  help  of  a 
particular  set  of  values  and  shared  only  by  the 
members  of  a  certain  interpretive  community. 

Social  Constructivist  Research 

Various  kinds  of  constructivism  are  in  use  in  policy 
science.  The  most  well-known  are  frame  analysis, 
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narrative  analysis,  and  discourse  analysis.  Although 
the  levels  of  analysis  vary,  all  three  have  a  common 
focus  on  sense  making  and  a  special  interest  in 
language.  Their  aim  is  to  uncover,  understand,  and 
possibly  explain  sense-making  practices  (e.g.,  vot¬ 
ing  and  warfare),  processes  (e.g.,  crises),  and  insti¬ 
tutions  (e.g.,  agencies,  governments,  and  nations) 
in  political  life. 

Framing  is  the  effort  of  drawing  attention  to 
some  parts  of  situations  while  ignoring  others.  As 
such,  it  becomes  a  central  part  of  the  political  act 
of  problem  setting.  Problem  definitions  inevitably 
predispose  certain  solutions,  and  vice  versa. 
Narrative  analysis,  often  inspired  by  literary  the¬ 
ory  and  philosophers  like  Mark  Johnson  and 
Alasdair  MacIntyre  and  historians  like  William 
Whyte,  starts  from  the  idea  that  sense  making  is 
often  done  with  the  help  of  storytelling;  through 
stories,  people  enact  themselves  and  their  context. 
In  stories,  characters  who  engage  in  actions  in  a 
certain  setting  can  be  identified.  Some  characters 
might  be  given  the  role  of  villains,  while  others  are 
depicted  as  heroes,  victims,  or  innocent  bystand¬ 
ers.  At  the  same  time,  acts  are  attributed  to  human 
beings  or  other  entities.  The  depiction  of  the  rele¬ 
vant  setting,  for  instance,  a  crisis  situation,  also 
helps  shape  the  story.  The  way  the  various  ele¬ 
ments  of  a  story  are  ordered  creates  a  plot  that  can 
help  the  audience  grasp  what  is  going  on.  With  the 
help  of  frames  and  narratives,  political  actors 
describe  and  explain  reality  in  conflicting  ways, 
arguing  for  often  opposite  courses  of  action.  In 
policy  analysis,  scholars  like  Emery  Roe  and 
Michel  van  Eeten  designed  and  used  tools  (e.g., 
reframing  problems  and  creating  metanarratives) 
that  help  actors  in  the  policy  process  to  set  new 
agendas,  create  understanding  of  other  actors,  and 
overcome  conflicts  and  intractable  policy  issues. 

Discourse  analysts,  often  building  on  poststruc¬ 
turalist  efforts  such  as  that  of  Foucault  or  on  post- 
Marxism  versions  developed  by  Ernesto  Laclau 
and  Chantal  Mouffe,  have  engaged  in  critical 
analyses  of  political  practices,  often  at  a  more 
abstract  level  than  those  using  frames  or  narrative. 
A  discourse  can  be  defined  as  a  set  of  ideas,  con¬ 
cepts,  or  categories  that  is  produced  in  certain 
social  practices.  A  discourse  analysis  shows  which 
discursive  objects  and  subjects  emerge  in  social 
practices  and  which  conceptualizations  are  used. 
Consequently,  what  is  left  out  in  social  practices 


also  emerges.  Since  it  is  not  the  purpose  of  dis¬ 
course  analysis  to  retrieve  what  actors  exactly 
meant  or  felt  when  writing  or  speaking  but  more  to 
focus  on  the  effects  of  texts  on  other  texts,  there  is 
usually  not  much  space  for  agency;  this  is  called 
decentering  of  the  subject.  Many  discourse  analy¬ 
ses  are  historical,  like  Foucault’s  archaeological 
and  genealogical  studies.  There  are  also  social  con¬ 
structivists  who  have  worked  on  metaphor  analysis 
and  category  analysis — for  instance,  in  the  decon¬ 
struction  of  ethnicity  categories  in  policy  making. 

Social  Constructivist  Policy  Making 

Social  constructivism,  nowadays,  plays  a  role  in 
various  social  science  disciplines,  such  as  political 
science,  public  administration,  and  policy  studies.  In 
public  administration,  it  was  fueled  by  postmodern¬ 
ism,  which  was  introduced  in  1988  by  a  group  of 
scholars  who  were  members  of  Public  Administration 
Theory  Network  (PAT-Net)  and  published  in  the 
journal  Administrative  Theory  and  Praxis.  Feminist 
thinking  has  also  been  a  strong  stand  in  constructiv¬ 
ist  thinking.  The  rise  of  social  constructivism  in  the 
policy  sciences  can  be  traced  to  roughly  the  same 
period.  While  Egon  Guba  and  Yvonna  Lincoln 
introduced  a  constructivist  approach  to  evaluation, 
Deborah  Stone  and  Giandomenico  Majone  kicked 
off  the  “argumentative  turn”  in  policy  sciences  in 
their  books  on  the  role  of  rhetoric  in  policy.  This 
perspective  was  further  elaborated  on  in  a  volume 
edited  by  Frank  Fischer  and  John  Forester  and  has 
been  developing  ever  since. 

But  how  does  one  put  a  social  constructivist 
view  on  policy  making  into  practice?  John  Dryzek 
was  one  of  the  first  to  answer  that  question.  In  a 
seminal  contribution  to  the  field,  he  opposed 
instrumental  rationality  (i.e.,  the  belief  that  it  is 
possible  to  find  the  right  means  to  the  just  ends) 
and  the  related  objectivism  (i.e.,  the  belief  that  neu¬ 
tral  standards  exist  for  doing  so).  Instrumental 
rationality  and  objectivism  not  only  lead  to  Max 
Weber’s  iron  cage  and  Jurgen  Habermas’s  coloni¬ 
zation  of  the  lifeworld  but  are  also  unable  to  supply 
usable  policy  analyses,  and  they  actually  obstruct 
progress  in  the  social  sciences,  in  Dryzek’s  view. 
The  alternative  method  of  policy  making  is  what 
Dryzek  calls  discursive  and  democratic  rationality 
based  on  practical  reason  instead  of  on  power. 
Participatory  democracy  should  make  this  possible. 
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Anne  Schneider  and  Helen  Ingram  think  that 
“degenerate  policy  making”  is  the  root  of  all  evil. 
Degenerate  policy  making  is  the  manipulation  of 
images  for  the  politician’s  personal  interests.  It 
leads  to  the  construction  and  punishing/rewarding 
of  ingroups  and  outgroups  and  to  the  abuse  of  sci¬ 
ence  for  political  reasons.  Invoking  science  also 
leads  to  apathy  and  alienation  among  the  citizens, 
as  the  experts  come  to  monopolize  the  debate. 
Schneider  and  Ingram’s  own  approach,  which  they 
called  policy  design,  is  based  on  seven  principles 
that  should  mend  these  faults. 

Also  concerned  with  the  dysfunctional  role  of 
the  expert  in  the  policy  process  is  Fischer.  Based 
on  John  Dewey’s  argument,  Fischer  argues  that 
the  role  of  the  expert  is  to  help  the  citizen  decide 
and  not  to  decide  in  his  or  her  place.  The  expert 
should  try  to  design  deliberative  forums  in  which 
citizens,  with  the  help  of  experts,  can  participate 
in  constructing  policy.  More  recent  contribu¬ 
tions  to  deliberative  policy  analysis  argue  for  a 
“decentered  governance”  by  networks  of  stake¬ 
holders.  Those  engaged  in  this  project  strive  to 
map  such  policy  making  in  all  its  varieties,  ana¬ 
lyzing  it  without  looking  for  general  laws  or 
essences. 
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Discourse  analysis  draws  on  central  insights  from 
post-Marxism,  poststructuralism,  and  radical  ver¬ 
sions  of  interpretative  analysis  and  pragmatist  phi¬ 
losophy.  Discourse  analysis  aims  to  transcend  the 
objectivist,  reductionist,  and  rationalist  bias  in 
mainstream  social  science  theory  by  emphasizing 
the  role  of  discourse  in  shaping  social,  political, 
and  economic  relations.  Discourse  is  commonly 
defined  as  a  more  or  less  stable  and  well-defined 
system  of  rules,  norms,  and  significant  differences 
that  is  produced  in  and  through  political  struggles 
and  provides  a  contingent  horizon  for  the  construc¬ 
tion  of  any  meaningful  subject,  object,  or  action. 
Many  discourse  theorists  have  been  preoccupied 
with  developing  a  sophisticated  theoretical  frame¬ 
work,  but  a  growing  number  of  discourse  analysts 
have  become  interested  in  applying  the  concepts 
and  arguments  of  discourse  theory  in  problem- 
driven  empirical  analysis,  and  the  study  of  policy 
and  policy  making  is  at  the  top  of  the  agenda. 

A  discourse  approach  to  policy  analysis  can  help 
us  better  understand  the  discursive  conditions  of 
possibility  for  formulating  particular  policies,  and 
it  draws  our  attention  to  the  role  of  identity  con¬ 
struction  and  the  negotiation  of  meaning  in  policy 
implementation.  In  contrast  to  mainstream  policy 
analysis,  discourse  analysis  highlights  the  semantic, 
performative,  and  rhetorical  aspects  of  public 
policy  making.  Moreover,  it  insists  that  policy 
problems,  policy  solutions,  and  governmental  ratio¬ 
nalities  are  discursively  constructed  and  therefore 
contingent.  Finally,  it  aims  to  uncover  the  power 
struggles  and  political  conflicts  that  shape  the  dis¬ 
cursive  conditions  for  the  formulation  and  imple¬ 
mentation  of  public  policy.  Whereas  discourse 
models  inspired  by  the  works  of  the  German  phi¬ 
losopher  Jurgen  Habermas  tend  to  focus  on  the  role 
of  public  deliberation  and  tend  to  see  power  and 
conflict  as  a  source  of  distorted  communication, 
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discourse  models  inspired  by  the  French  philoso¬ 
pher  Michel  Foucault  tend  to  view  power  and  con¬ 
flicts  as  constitutive  of  policy  discourse. 

Traditional  approaches  to  policy  analysis  hold 
that  interests  and  institutional  constraints  pro¬ 
vide  the  key  to  understanding  the  formulation 
and  implementation  of  policy.  As  such,  it  is  often 
assumed  that  policy  can  be  explained  in  terms  of 
the  interests  of  rational  and  resourceful  actors 
and  the  institutional  rules,  norms,  and  procedures 
that  constrain  the  choice  of  rational  actors. 
Discourse  models  of  policy  take  a  different 
approach  in  arguing  that  interests  are  incomplete, 
ambiguous,  and  shaped  by  the  contingent  dis¬ 
courses  in  which  they  are  embedded.  Likewise, 
discourse  models  tend  to  define  institutions  not 
only  as  a  recursively  validated  system  of  regula¬ 
tive  and  normative  constraints  but  also  as  a  broad 
set  of  values,  symbols,  rituals,  forms  of  knowl¬ 
edge,  codes,  and  vocabularies  that  facilitate  and 
guide  action.  This  means  that  interests  cannot  be 
taken  for  granted  and  that  institutions  are  not 
only  staging  the  choice  and  interaction  of  the  rel¬ 
evant  policy  actors  but  also  seem  to  be  scripting 
their  actions. 

There  are  several  important  precursors  to  the 
development  of  a  constructivist  discourse  approach 
to  policy  analysis.  Some  rational  choice  theorists 
have  relaxed  the  classical  assumptions  about  full 
information  and  unlimited  cognitive  capacities  and 
have  elaborated  the  traditional  model  of  individual 
choice  subject  to  institutional  constraints.  Douglas 
North,  for  example,  has  introduced  the  notion  of 
“mental  models”  to  explain  the  choice  and  action 
of  the  entrepreneurs  who  are  responsible  for  con¬ 
structing  and  changing  institutions.  The  feasible 
options  and  their  payoffs  are  determined  not  only 
by  institutional  rules  and  norms  but  also  by  the 
subjective  perceptions  (mental  models)  of  the  insti¬ 
tutional  entrepreneurs.  Mental  models  are  acquired 
through  processes  of  socialization  and  learning, 
and  they  help  rational  actors  interpret  and  explain 
their  changing  environment.  However,  the  relation 
between  the  objective  interest  of  rational  actors 
and  their  subjective  perceptions  remains  unclear, 
and  if  the  latter  is  constitutive  of  the  former,  it 
becomes  difficult  to  maintain  the  basic  assumption 
that  choice  is  derived  from  interests.  Moreover,  if 
the  mental  models  are  shaped  by  collective  ideas 
and  ideologies,  the  methodological  individualism 


at  the  heart  of  rational  choice  institutionalism  is 
undermined. 

While  most  rational  choice  institutionalists  are 
reluctant  to  embrace  the  idea  of  objective  interests 
being  shaped  by  subjective  perceptions,  Paul 
Sabatier  (1988)  goes  all  the  way  in  viewing  policy 
output  as  a  result  of  political  decisions  informed 
by  competing  advocacy  coalitions  that  are  com¬ 
posed  of  public  and  private  actors  sharing  a  set  of 
normative  and  causal  beliefs.  The  value  priorities 
and  causal  assumptions  that  unify  a  particular 
advocacy  coalition  are  transformed  by  policy 
learning,  which  is  instrumental  for  generating  new 
insights  that  will  only  be  incorporated  into  the 
core  beliefs  of  an  advocacy  coalition  if  they  further 
its  main  policy  objectives.  Although  the  notion  of 
advocacy  coalitions  clearly  emphasizes  the  role  of 
ideas  and  beliefs  in  interactive  policy  making,  there 
is  hardly  any  account  of  the  discursive  construc¬ 
tion  of  the  ideas  and  beliefs.  Hence,  there  is  no 
explanation  of  what  regulates  the  production  and 
advancement  of  particular  ideas  and  beliefs. 

Historical  institutionalists  have  also  emphasized 
the  role  of  ideas  in  public  policy.  Ideas  tend  to 
replace  interests  as  drivers  of  political  action,  and 
institutions  are  seen  as  a  constraint  on  the  impact 
of  ideas  on  policy  making.  New  ideas  are  adopted 
to  the  extent  that  they  respond  to  concrete  policy 
problems,  resonate  with  the  interest  and  ideas  of 
key  actors,  and  are  brought  to  the  attention  of 
relevant  public  agencies  that  can  implement  new 
ideas.  As  such,  the  reception  of  new  ideas  is 
explained  with  reference  to  objective  socioeco¬ 
nomic  and  sociopolitical  conditions.  However,  the 
discursive  embeddedness  of  new  policy  ideas,  the 
struggles  about  the  content  of  the  new  ideas,  and 
their  rhetorical  and  argumentative  framing  are  not 
really  taken  into  account. 

Sociological,  or  organizational,  institutionalism 
breaks  with  the  idea  that  social  and  political  action 
is  driven  by  the  actors’  calculation  of  the  conse¬ 
quences  of  alternative  options  and  offers  a  cultural 
approach  that  emphasizes  the  role  of  normative 
and  cognitive  frames  and  meaning  systems  in  defin¬ 
ing  appropriate  action.  Policy  change  is  seen  as  a 
result  of  a  bricolage,  through  which  concepts,  ritu¬ 
als,  scripts,  and  values  are  recombined.  In  the  devel¬ 
opment  of  new  policies,  legitimacy  plays  a  pivotal 
role.  The  basic  assertion  that  interests,  rationality, 
and  information  are  socially  constructed  within  a 
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particular  institutional  context  is  shared  with  the 
new  discourse  models  of  policy,  but  a  major  differ¬ 
ence  is  that  whereas  sociological  institutionalism 
tends  to  treat  cultural  conditions  of  action  as  a 
static  equilibrium,  the  discourse  approach  to  pol¬ 
icy  analysis  is  more  interested  in  accounting  for  the 
formation  and  transformation  of  discursive  struc¬ 
tures  through  conflicts. 

Ideational  institutionalism  is  a  new  branch  of 
institutionalism  that  focuses  on  the  substantive  con¬ 
tent  of  ideas  and  the  interactive  processes  through 
which  these  ideas  are  generated  and  communicated 
to  the  public  (Colin  Hay,  2001).  Rather  than  seeing 
ideas  as  constituted  by  specific  cultures,  ideational 
institutionalists  view  ideas  in  relation  to  the  dis¬ 
courses  that  provide  the  resources  for  legitimizing 
particular  ideas.  Ideas  are  shaped  by  discourses, 
which  are  in  turn  shaped  by  institutions  that  deter¬ 
mine  who  talks  to  whom  about  what,  when,  and 
where.  In  state-centered  polities,  ideas  and  dis¬ 
courses  are  communicated  to  the  public  to  secure 
support,  whereas  in  decentered  polities  the  relevant 
actors  must  first  establish  consensus  about  the  con¬ 
tent  of  the  policy  discourse  and  its  embedded  ideas. 
By  placing  ideas  within  a  broader  discourse,  ide¬ 
ational  institutionalism  provides  a  crucial  stepping 
stone  to  discourse  models  of  policy.  However,  fore¬ 
grounding  singular  ideas  and  the  conception  of 
institutions  as  an  extradiscursive  constraint  sepa¬ 
rates  ideational  institutionalism  from  a  discourse 
theoretical  approach  that  focuses  on  the  impact  of 
discourses  constructed  through  hegemonic  strug¬ 
gles,  which  take  place  in  a  sedimented  institutional 
context  that  is  dislocated  by  new  events. 

Discourse  models  of  policy  focus  on  the  struc¬ 
tural  power  exercised  when  the  formulation  and 
implementation  of  policy  are  conditioned  by  hege¬ 
monic  discourses  that  are  shaped  and  contested  by 
political  strategies.  Foucault’s  governmentality  the¬ 
ory  provides  a  good  starting  point  for  analyzing  the 
discursive  conditions  for  policy  making.  Govern¬ 
mentality  theory  aims  to  answer  the  key  question  of 
“how  to  govern  and  be  governed”  by  studying  how 
government,  in  the  sense  of  the  “the  conduct  of 
conduct,”  has  been  problematized  in  modern  soci¬ 
ety  and  how  successive  problematizations  have 
transformed  the  collective  and  institutionalized 
mentalities,  rationalities,  and  technologies.  Govern¬ 
mentality  defines  a  particular  “art  of  government” 
that  conditions  the  concrete  “acts  of  government.” 


The  object  of  regulation,  the  regulating  and  regu¬ 
lated  subjectivities,  the  implicit  tel  os  of  regulation, 
and  the  available  governance  techniques  are  formed 
by  the  ruling  governmentality  (Mitchell  Dean, 
1999).  Foucauldians  use  the  archaeological  method 
to  analyze  governmental  discourse  and  deploy  the 
genealogical  method  to  analyze  how  governmen- 
talities  are  constructed  in  and  through  myriad 
struggles  that  include  and  exclude  particular  con¬ 
cepts,  identities,  and  practices. 

Another  important  discourse  approach  to  pol¬ 
icy  analysis  is  provided  by  critical  discourse  analy¬ 
sis  (CDA),  which  has  been  used  in  studies  of  policy 
responses  to  globalization  and  the  politics  of  New 
Labour  in  Britain.  According  to  Norman  Fairclough 
(2001),  CDA  aims  to  address  social  and  political 
issues  and  problems  by  conducting  a  close  study  of 
texts,  interactions,  and  other  forms  of  semiotic 
material  in  order  to  detect  the  changes  taking  place 
around  key  issues.  CDA  emphasizes  the  role  and 
impact  of  ideology  in  policy  making.  Ideology  is 
not  defined  in  terms  of  the  comprehensive  and 
coherent  worldviews  but  in  terms  of  a  discursive 
naturalization  of  contingently  constructed  mean¬ 
ings  and  identities.  Particular  groups  and  elites 
may  have  an  interest  in  reproducing  and  propagat¬ 
ing  particular  ideologies  as  they  might  help  them 
maintain,  or  even  enhance,  their  political  power. 

Whereas  Foucault’s  governmentality  theory  and 
Fairclough’s  ideology  analysis  study  affect  the  dis¬ 
cursive  contexts  of  policies,  the  theories  associated 
with  the  “argumentative  turn”  in  policy  analysis 
focus  on  discursive  strategies  that  shape  the  form 
and  content  of  public  policy  and  guide  its  imple¬ 
mentation.  Discourse  frames  policy  problems  and 
policy  solutions,  and  the  political  actors  struggle  to 
include  and  exclude  particular  themes,  concepts, 
and  ideas.  In  the  discursive  battle,  institutionally 
situated  actors  advance  particular  arguments  and 
aim  for  acceptance  of  these  arguments  from  other 
actors  by  mobilizing  their  ability  to  argue  a  persua¬ 
sive  case  (logos),  their  reputation  and  credibility  as 
speakers  (ethos),  and  the  rhetorical  skills  that 
arouse  particular  emotions  (pathos).  A  hegemonic 
policy  discourse  can  structure  the  context  in  which 
particular  phenomena  are  understood  and  defined 
(“discourse  structuration”).  Over  time,  the  hege¬ 
monic  discourse  may  become  sedimented  into  a  set 
of  concepts  and  organizational  practices  that  are 
taken  for  granted  by  social  and  political  actors 
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(“discourse  institutionalization”).  The  crux  of  the 
argument  is  that  hegemonic  discourses  often  hinge 
on  the  construction  of  a  story  line  that  provides  a 
short,  condensed,  and  often  metaphorical  expres¬ 
sion  of  how  the  policy  discourse  defines  problems 
and  solutions.  A  story  line  simplifies  the  discursive 
space  and  helps  bring  about  discursive  closure. 
Social  and  political  actors  who  subscribe  to  a  par¬ 
ticular  story  line  form  a  discourse  coalition. 
Adherence  to  a  particular  story  line  tends  to  facili¬ 
tate  mutual  learning  and  compromises  while 
excluding  competing  problem  definitions  and 
alternative  policy  solutions. 

Poststructuralist  discourse  theory,  synthesized 
by  Ernesto  Laclau  and  Chantal  Mouffe  and  sum¬ 
marized  by  Jacob  Torfing  (1999),  offers  a  concep¬ 
tual  and  argumentative  framework  that  further 
develops  the  ideas  implicit  to  the  argumentative 
turn.  First,  the  notion  of  “floating  signifiers”  helps 
analyze  instances  of  conceptual  ambiguity  where 
an  expression  has  different  contents  and  meanings 
and  therefore  enables  policy  actors  to  use  the  same 
vocabulary  to  talk  about  different  things.  Second, 
the  notion  of  “articulation”  provides  an  under¬ 
standing  of  how  meaning  is  partially  fixed  within 
discourse  through  practices  that  establish  a  rela¬ 
tion  between  elements  such  that  their  identity  is 
mutually  modified.  Finally,  the  concept  of  “social 
antagonism”  identifies  how  policy  discourses  are 
unified  with  reference  to  a  threatening  otherness 
that  both  tends  to  stabilize  the  discursive  system 
and  provides  a  source  of  potential  disruption. 

Jacob  Torfing 
Roskilde  University 
Roskilde,  Denmark 

See  also  Advocacy  Coalition  Framework;  Policy  Analysis 
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The  realm  of  employment  policy  includes  a  diverse 
set  of  public  programs  and  regulations  that  seek  to 
improve  a  population’s  employment  experience.  In 
particular,  employment  policy  attempts  to  reduce 
employment-related  risks  for  the  population,  to 
improve  the  overall  quantity  of  employment  oppor¬ 
tunities  available,  and  to  promote  the  quality  of 
established  employment  relationships.  Within 
employment  policy,  one  commonly  distinguishes 
between  “passive”  and  “active”  labor  market  poli¬ 
cies:  While  passive  policies  focus  on  attenuating  the 
consequences  of  market  rule,  for  example,  through 
offering  unemployment  insurance  that  guarantees 
income  support  during  periods  of  unemployment, 
active  policies  intervene  in  the  labor  market  to  help 
the  unemployed  gain  employment.  Conceptually  in 
between,  labor  laws  regulate  diverse  aspects  of  the 
employment  relationship,  including  issues  such  as 
nondiscrimination  and  unfair  dismissal. 

Seen  through  a  sociological  lens,  employment 
policies  are  an  integral  part  of  any  stable  labor 
market.  Given  the  strong  empirical  and  conceptual 
links  of  employment  policies  with  education  and 
social  security  policies,  as  well  as  with  the  regula¬ 
tion  of  industrial-labor  relations,  the  boundaries 
between  these  policy  realms  are  quite  fluid.  This 
entry  first  reviews  the  functional  needs  for  employ¬ 
ment  policy  in  theoretical  terms  and  then  describes 
the  evolution  of  employment  policy  making  in  the 
advanced  industrialized  countries  from  its  early 
beginnings  in  the  19th  century  through  the  tumul¬ 
tuous  interwar  period,  the  golden  age  of  welfare 
capitalism  in  the  1960s,  the  stagflation  of  the 
1970s,  and,  finally,  the  contemporary  period. 
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Functional  Needs  for  Employment  Policy 

In  contrast  to  simple  product  markets,  the  labor 
market  requires  special  employment  policies  that 
help  sustain  the  market’s  legitimacy  in  the  eyes  of 
the  population  and  produce  a  high  level  of  eco¬ 
nomic  performance.  Indeed,  while  standard  prod¬ 
ucts  can  easily  be  allocated  as  commodities  through 
market  mechanisms,  this  is  far  more  difficult  in  the 
case  of  labor.  The  units  of  the  labor  market  neither 
are  created  for  the  purpose  of  exchange  nor  can 
they  be  treated  as  disembedded  from  their  social 
surroundings;  instead,  they  are  human  beings  with 
social  and  physical  needs.  As  such,  labor  is  a  rather 
imperfect  “commodity,”  with  a  supply  function 
that  is  very  different  from  standard,  less  socially 
embedded  commodities.  For  instance,  at  both  ends 
of  the  supply  spectrum,  labor  supply  tends  to  show 
a  negative  correlation  between  prices  and  quantity, 
which  is  the  opposite  of  the  standard  relationship: 
At  the  bottom  end,  wage  reductions  can  prompt 
workers  to  work  longer  hours  in  pursuit  of  the 
particular  income  level  necessary  for  a  desired  liv¬ 
ing  standard;  at  the  higher  end,  wage  increases  can 
induce  workers  to  choose  more  leisure.  But  such 
behavior  is  not  the  only  deviation  from  standard 
commodity  markets.  Equally  crucial  for  policy 
making  and  unusual  compared  with  other  com¬ 
modities,  employers  cannot  autonomously  decide 
how  to  use  their  purchased  labor;  instead,  they 
always  depend  on  the  repeated  and  active  collabo¬ 
ration  of  their  workers. 

Given  these  characteristics,  all  advanced  capitalist 
countries  have  created  social  protection  mecha¬ 
nisms  in  their  labor  markets  to  reduce  the  disloca¬ 
tions  associated  with  market  allocation,  safeguard 
the  stability  of  society  at  large,  and  thus  support 
the  long-term  viability  of  the  labor  market  as  an 
allocation  device.  Moreover,  governments  have 
also  tended  to  allow  for  the  formation  of  unions  as 
workers’  collective  organizations,  the  establish¬ 
ment  of  collective  wage-bargaining  systems,  and 
the  creation  of  mechanisms  for  exercising  stake¬ 
holder  voice  within  the  firm.  Such  measures  have 
reduced  the  frequency  and  scale  of  worker  revolts, 
which  were  once  common. 

In  capitalist  democracies,  the  functional  require¬ 
ments  for  employment  policy  are  particularly  high, 
because  employment  policy  plays  an  important 
role  in  “decontesting”  the  organizing  principles  of 
capitalism  and  those  of  democracy:  Employment 


policy  serves  as  a  means  to  make  compatible 
(a)  employers’  attempts  to  effectively  and  efficiently 
use  human  resources  and  (b)  the  upholding  of  civil 
and  social  rights  that  democracies  guarantee  to  all 
their  citizen.  At  times,  policy  solutions  can  be  found 
that  simultaneously  serve  both  democratic  and 
capitalist  logics,  such  as  with  governments’  recent 
attempts  to  broaden  access  to  higher  education, 
which  serves  to  increase  the  equality  of  citizens’ 
opportunity  as  well  as  to  provide  companies  with 
more  much  needed  human  capital.  Flowever,  in 
other  instances,  democratic  goals  have  taken  clear 
precedence,  such  as  when  some  countries  sought  to 
promote  “workplace  democracy”  and  the  “human¬ 
ization  of  work”  or  when  citizens  are  uncondition¬ 
ally  guaranteed  basic  income  support  that  might 
undercut  work  incentives. 

Finally,  employment  policies  have  an  important 
role  to  play  in  improving  economic  performance. 
Consider,  for  example,  workers’  investments  in  the 
acquisition  of  skills  that  can  only  be  used  in  par¬ 
ticular  economic  activities.  The  success  of  individ¬ 
ual  companies  and  whole  sectors  depends  on  the 
availability  of  such  specific  skills;  however,  with¬ 
out  institutional  guarantees  by  companies  or  gov¬ 
ernments  that  reduce  the  risks  associated  with 
specific-skill  investments,  specific  skills  might  be 
undersupplied,  and  workers  might  choose  to  invest 
in  general  skills  instead.  Institutional  guarantees 
that  can  encourage  specific-skill  investments 
include  employment  protection  rules  supporting 
long-term  employment,  a  collective  wage-bargain¬ 
ing  system  buttressing  real-wage  stability,  and  an 
unemployment  insurance  system  providing  work¬ 
ers  with  wage-related  unemployment  benefits  in 
case  of  job  loss  and  releasing  them  from  the  need 
to  accept  any  available  job. 

In  summary,  by  helping  the  population  deal 
with  employment-related  risks,  employment  poli¬ 
cies  are  the  basis  for  a  stable  labor  market,  and  if 
designed  well,  they  can  contribute  to  the  effective 
and  efficient  employment  of  the  population. 

A  Short  History  of  Employment  Policy 

Public  employment  policies  have  their  roots  in  the 
establishment  of  large-scale  labor  markets  during 
industrialization.  The  goals  and  content  of  these 
policies  have  since  then  evolved,  driven  by  the 
development  of  new  political  answers  to  changing 
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socioeconomic  challenges.  As  such,  the  evolution 
of  employment  policies  has  been  at  the  center  of 
the  changing  character  of  statehood  and  the  shift¬ 
ing  regulation  of  large  parts  of  social  life. 
Importantly,  even  though  individual  countries’ 
responses  to  common  challenges  have  always 
included  similar  features,  each  country  has  trav¬ 
eled  on  a  distinct  and  contingent  national  develop¬ 
mental  trajectory,  in  which  feedback  from  earlier 
policies  has  tended  to  bound  policy  innovations  in 
later  periods.  The  next  section  highlights  both 
common  trends  and  national  particularities. 

Building  Welfare  States 

Industrialization  during  the  19th  century  gave 
rise  to  the  “social  question”  of  how  economic  prog¬ 
ress  could  be  maintained  in  the  face  of  the  political 
and  ethical  threats  emanating  from  the  abysmal 
conditions  under  which  the  working  class  was 
employed.  The  answer  was  the  progressive  develop¬ 
ment  of  welfare  states,  with  employment  policy 
being  a  core  part  of  a  new  active  form  of  statehood. 
Over  time,  and  framed  around  the  nation  as  the 
community  of  fate,  prudent  political  leaders  set  up 
social  protection  institutions  to  provide  collective 
mechanisms  for  covering  the  costs  of  progress  and 
to  manage  employment-related  risks.  The  first  labor 
market  regulations  and  social  insurance  programs 
were  set  up  before  the  end  of  the  19th  century;  by 
the  1930s,  unemployment  insurance  and  the  first 
active  labor  market  policies  were  established. 

Some  governments  addressed  new  social  needs 
more  quickly  and  some  more  slowly,  some  to  pre¬ 
empt  pressure  from  below,  some  in  reaction  to 
such  pressure,  and  some  out  of  ethical  conviction. 
A  leader  in  the  expansion  of  suffrage,  Britain  had 
established  labor  market  regulations,  such  as  on 
working  conditions  in  certain  industries,  maxi¬ 
mum  working  hours,  and  age  thresholds  for  legal 
employment  by  the  later  part  of  the  century.  At  the 
same  time,  Germany’s  monarchist  government 
sought  to  calm  revolutionary  impulses  through 
carrot-and-stick  policies  that  included  both  the 
outlawing  of  the  Social  Democratic  Party’s  activi¬ 
ties  (1878-1890)  and  the  introduction  of  statutory 
social  insurance  schemes  for  health  (1883),  acci¬ 
dents  (1884),  and  disability  and  old  age  (1889). 

The  establishment  of  national  unemployment 
insurance  systems  took  a  little  longer.  In  the 


United  Kingdom,  a  mandatory  public  unemploy¬ 
ment  scheme  was  established  in  1911.  In  Germany, 
an  unemployment  insurance  scheme  was  intro¬ 
duced  in  1927,  replacing  a  system  of  income  sup¬ 
port  set  up  in  1911.  In  the  United  States,  national 
unemployment  insurance  was  established  in  the 
mid-1950s  as  part  of  the  Social  Security  Act,  a 
piece  of  legislation  conceived  to  limit  what  were 
seen  as  the  dangers  of  modern  American  life.  But 
the  eligibility  and  disbursement  of  unemployment 
benefits  can  also  be  organized  in  another  way: 
Named  after  the  Belgian  city  where  it  was  first 
implemented,  the  Ghent  system  is  a  voluntary 
scheme  that  grants  unions  the  responsibility  to 
administer  unemployment  benefits.  With  the 
exception  of  Norway,  the  Scandinavian  countries 
run  Ghent  unemployment  insurance  systems  to 
this  day,  although  the  regulation  and  financing  of 
these  systems  has  changed  over  the  years,  with  the 
countries’  national  governments  regulating  and 
subsidizing  union-administered  funds. 

Active  labor  market  policies  were  introduced  on 
a  larger  scale  in  the  interwar  years,  prompted  by 
widespread  unemployment  in  the  wake  of  the 
Great  Depression.  For  instance,  in  the  United 
States,  several  New  Deal  agencies  were  set  up  to 
provide  employment  until  the  economy  improved. 
One  of  these  agencies,  the  Works  Progress  Admin¬ 
istration  (WPA),  became  the  nation’s  largest 
employer.  Offering  almost  8  million  jobs  between 
1935  and  1943,  the  WPA  carried  out  public  works 
projects  all  over  the  country,  from  constructing 
new  public  buildings  and  roads  to  running  arts 
and  literacy  projects.  Only  the  country’s  entry  into 
World  War  II  brought  an  end  to  the  efforts  of  the 
WPA  and  other  agencies. 

Parallel  developments  were  under  way  in 
Europe,  albeit  under  very  different  political  aus¬ 
pices.  In  Germany,  a  democratic  government 
introduced  a  voluntary  labor  service  in  1931  as  a 
means  to  reduce  unemployment.  A  few  years  later, 
the  National  Socialists  established  a  6-month  man¬ 
datory  scheme  that  all  young  men,  and  after  1939 
also  all  women,  had  to  join.  Finally,  facing  totali¬ 
tarian  threats  from  both  the  East  and  the  South, 
the  Scandinavian  countries  completely  reoriented 
their  national  economic  policy  regimes  around  a 
more  interventionist  state  during  the  1930s.  Based 
on  explicit  political  deals  between  Social  Democrats 
and  agricultural  parties,  policy  solutions  were 
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developed  that  served  both  the  struggling  working 
class  and  farmers.  For  instance,  in  Denmark,  the 
so-called  Kanslergade  Agreement  laid  the  founda¬ 
tion  for  extending  the  length  of  collective  agree¬ 
ments  for  currently  employed  workers  without 
wage  reductions,  initiating  public  works  programs 
and  winter  relief  for  the  unemployed  and  their 
families,  granting  farmers  both  an  export-stimulat¬ 
ing  currency  devaluation  and  agricultural  price 
supports,  and  consolidating  the  existing  social 
insurance  and  poverty  relief  measures  into  a  com¬ 
prehensive  program  of  old-age  pensions  and 
health,  unemployment,  and  accident  insurance. 

The  Golden  Age 

The  postwar  period  brought  a  30-year-long 
economic  boom  and  the  golden  age  of  welfare 
capitalism.  Nurtured  by  a  broad  discourse  about 
social  rights,  of  which  the  1944  Beveridge  Report 
in  Britain  was  just  one  example,  and  sustained  by 
system  competition  with  communism,  the  workers 
in  Western  democracies  received  their  peace  divi¬ 
dend  in  the  consolidation  of  the  mixed-economy 
welfare  state,  which  sought  to  combine  economic 
growth,  full  employment,  social  partnership 
between  capital  and  labor,  and  universal  welfare 
programs.  Based  on  the  efficiency  increases  and 
employment  opportunities  provided  by  the  wide¬ 
spread  adoption  of  Fordist  mass  production  tech¬ 
niques  within  industry  and  supported  by  political 
commitments  to  wage  compression  in  the  economy 
more  generally,  the  pleasures  of  middle-class  life 
became  accessible  to  broad  majorities  of  the  popu¬ 
lation,  even  for  those  workers  without  higher  lev¬ 
els  of  education. 

Within  the  realm  of  employment  policy,  macro- 
economic  Keynesian  demand  management  was  put 
into  the  service  of  guaranteeing  the  full  employ¬ 
ment  of  all  willing  man  power.  This  was  feasible 
because  the  newly  founded  international  economic 
regime  around  the  Bretton  Woods  institutions  tol¬ 
erated  policy  making  geared  toward  domestic 
concerns  and  allowed  for  cheap  public  borrowing 
at  rates  of  interest  below  those  of  economic 
growth.  Moreover,  in  part,  Keynesian  demand 
management  was  also  necessary:  Having  replaced 
the  traditional  craft  organizations  with  Taylorist 
principles  internally,  companies  recast  themselves 
as  vertically  integrated  corporations,  geared  to 


making  use  of  economies  of  scale  and  “pushing” 
standardized  products  into  the  market  over  long 
production  runs.  Such  large-scale  manufacturing 
came  with  a  degree  of  rigidity  in  the  economy, 
with  drops  in  demand  being  difficult  to  absorb  for 
companies,  leaving  the  potential  for  initial  down¬ 
turns  in  demand  to  accumulate  into  bigger  crises. 

Reacting  to  Stagflation 

With  the  collapse  of  the  international  financial 
regime  and  simultaneous  increases  in  unemploy¬ 
ment  and  inflation,  the  1970s  heralded  the  end  of 
the  Keynesian  era.  A  series  of  increases  in  oil  prices 
provided  the  advanced  economies  with  negative 
supply  shocks,  which  traditional  methods  of  eco¬ 
nomic  policy  making  could  not  effectively  address. 
Keen  to  get  wage-push  inflation  under  control, 
many  governments  experimented  with  income 
policy  that  sought  to  provide  a  ceiling  for  wage 
increases.  Moreover,  in  exchange  for  wage  mod¬ 
eration  in  the  short  term,  some  governments 
offered  increased  pension  levels  that  would  provide 
workers  with  delayed  benefits  in  the  long  term.  In 
these  quests,  countries’  relative  success  depended 
very  much  on  how  much  their  macroeconomic 
institutions  facilitated  strategic  coordination 
between  the  government,  organized  labor,  and 
capital,  as  well  as  the  country’s  interest  rate-setting 
central  bank.  Here,  countries  with  strong  macro- 
corporatist  institutions  had  certain  advantages. 

Particularly,  in  the  pluralist  English-speaking 
countries,  which  were  institutionally  least  equipped 
for  effective  tripartite  coordination  between  the 
government  and  the  organized  social  partners, 
conservatives’  ideas  about  how  to  proceed  became 
increasingly  popular.  Conservatives  located  the 
source  of  current  economic  problems  in  welfare 
states’  institutions  themselves.  Moreover,  the 
advanced  democracies’  populations  were  increas¬ 
ingly  tired  of  shouldering  the  tax  increases  associ¬ 
ated  with  states’  weakening  finances,  and  tax 
revolts  occurred  in  places  as  far  apart  as  California 
and  Denmark.  Talk  about  sclerotic  institutions 
and  the  problems  of  bureaucracy  eventually 
became  mainstream.  On  that  basis,  government 
action  was  increasingly  seen  as  the  problem  rather 
than  the  solution;  moreover,  rather  than  being 
perceived  as  positive  agents  for  increasing  worker 
welfare,  unions  came  to  be  seen  as  self-interested, 
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rent-seeking,  and,  ultimately,  harmful  lobbies. 
These  ideas  helped  bring  Margaret  Thatcher  and 
Ronald  Reagan  to  power;  in  moderated  form,  they 
animated  the  election  victories  of  the  political 
center-right  in  other  countries. 

Breaking  with  demand-side-oriented  policies 
during  the  postwar  period,  the  advanced  democra¬ 
cies’  governments  sought  to  improve  their  econo¬ 
mies’  supply  sides  through  deregulation  throughout 
the  1980s.  Often,  however,  liberating  market  forces 
and  retrenching  the  welfare  state  proved  to  be  a 
more  difficult  undertaking  than  imagined,  because 
electorates  continued  to  support  many  social  insur¬ 
ance  programs.  In  this  context,  it  was  symptomatic 
that  the  U.S.  Social  Security  retirement  program  in 
the  United  States  became  known  as  the  “third  rail 
of  American  politics,”  promising  certain  political 
death  to  those  who  would  seek  to  reduce  it.  While 
cuts  were  eventually  made  even  in  Social  Security, 
these  remained  comparatively  moderate  in  contrast 
to  the  cuts  in  employment  policy  programs  that 
primarily  benefited  the  poorer  and  politically  less 
vocal  strata  of  the  population.  Labor  market  pro¬ 
grams  that  were  linked  to  the  U.S.  War  on  Poverty 
during  the  1960s  and  to  the  public  employment 
initiatives  launched  in  the  1970s  fared  particularly 
badly,  as  they  came  to  be  easy  targets  for  politicians 
who  framed  the  former  as  special-interest  policies 
for  racial  minorities  and  the  latter  as  vehicles  for 
corruption. 

While  it  was  broadly  agreed  that  welfare  states 
had  grown  to  their  limits,  countries  diverged  in 
how  staunchly  they  walked  the  deregulation  walk. 
For  instance,  the  policy  record  of  Germany’s  cen¬ 
ter-right  government  at  the  time  was  quite  differ¬ 
ent  from  that  of  Thatcher  or  Reagan.  In  line  with 
some  other  countries,  Germany  strongly  embraced 
early-retirement  policies  as  a  means  to  reduce 
long-term  unemployment  and  to  buy  peace  in 
labor  relations.  In  the  early  1980s,  policymakers 
introduced  the  possibility  for  people  aged  60-plus 
to  claim  a  full  pension,  as  long  as  they  had  been 
unemployed  for  more  than  a  year.  Helped  by  such 
regulations,  overall  employment  in  Germany 
declined  by  an  average  of  0.3%  annually  from 
1980  to  1990;  by  1988,  less  than  one  out  of  every 
three  German  men  between  the  ages  of  60  and  64 
still  worked.  Finally,  when  hit  by  the  seismic  shock 
of  unification  in  1990,  the  government  followed  a 
familiar  adjustment  pattern,  unburdening  the  East 


German  labor  market  of  excess  labor  through 
early  retirement,  job  creation,  and  training  schemes 
that  brought  down  the  official  East  German  unem¬ 
ployment  rate  to  half  of  what  it  would  otherwise 
have  been. 

Increasing  Labor  Market  Flexibility 

Driven  by  the  continued  deepening  of  economic 
globalization,  economies’  turn  toward  services, 
and — maybe  above  all  other  reasons — persistent 
long-term  unemployment,  the  debates  about  nec¬ 
essary  reforms  of  employment  policies  further 
intensified  throughout  the  1990s.  Inspired  by  neo¬ 
classical  economic  analysis  that  saw  labor  market 
rigidities  at  the  root  of  Europe’s  unemployment 
problems,  international  experts — including  those 
from  the  Organisation  for  Economic  Co-operation 
and  Development  (OECD) — recommended  in¬ 
creasing  the  flexibility  of  labor  markets.  Given 
that  the  future  of  economic  growth  depended 
strongly  on  further  increases  in  labor  productivity, 
some  analysts  called  for  the  removal  of  regula¬ 
tions  that  stood  in  the  way  of  the  most  efficient 
employment  of  labor  resources,  including  employ¬ 
ment  protection  rules.  Others  stressed  the  impor¬ 
tance  of  increasing  labor  market  participation, 
partially  as  a  means  to  support  economic  growth 
but  also  to  increase  the  ratio  of  the  active  to  the 
inactive  population.  The  latter  was  seen  as  neces¬ 
sary  to  sustain  the  working  population’s  contin¬ 
ued  ability  to  finance  social  security  benefits  and 
services  for  those  unable  to  work  in  the  context  of 
rapid  population  aging.  Finally,  while  most  dis¬ 
cussions  about  flexibility  tended  to  stress  the 
importance  of  expanding  employers’  choice  sets, 
changing  employment  patterns  had  also  prompted 
workers  to  demand  employment  regulations  that 
offered  increased  flexibility  for  themselves:  For 
instance,  with  double-earner  households  having 
replaced  male-breadwinner  arrangements  as  the 
norm,  parents  have  been  looking  for  innovative 
ways  to  combine  work  and  family  life. 

These  discussions  prompted  a  series  of  complex 
changes,  the  breadth  of  which  is  probably  best  illus¬ 
trated  in  the  European  Employment  Strategy: 
Through  a  continued  monitoring  and  benchmark¬ 
ing  process,  the  European  Union  seeks  to  encourage 
its  member  states  to  increase  the  role  of  preventive 
labor  market  policy,  put  in  place  measures  that 
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strengthen  workers’  employability  and  adaptabil¬ 
ity,  equalize  employment  opportunities  for  men 
and  women,  as  well  as  invest  in  education  and 
lifelong  learning. 

The  biggest  change  in  employment  policy  across 
all  rich  countries  has  occurred  in  how  states  treat 
their  unemployed  workers,  with  governments  of  all 
colors  having  embraced  new  measures  that  seek  to 
activate  the  unemployed  and  reduce  the  time  they 
spend  in  unemployment.  Moreover,  the  changes 
have  been  so  far-reaching  that  many  sociologists 
see  them  as  heralding  a  lasting  transformation  in 
the  character  of  the  state,  that  is,  one  in  which  the 
state  withdraws  from  its  commitment  to  guarantee 
comprehensive  welfare  independent  of  the  market 
and  moves  toward  a  commitment  to  support  indi¬ 
viduals’  attempts  to  succeed  in  the  market. 

Emphasizing  that  citizens  do  not  only  have 
rights  to  claim  but  also  responsibilities  to  fulfill, 
politicians  strengthened  job  search  requirements 
for  the  unemployed  and  introduced  stronger  pen¬ 
alties  for  the  repeated  rejection  of  job  offers.  In 
many  countries,  unemployment  benefits  have 
become  less  of  an  entitlement  geared  toward  sus¬ 
taining  a  family’s  lifestyle  and  ever  more  a  basic 
job  seeker’s  allowance  (to  use  the  official  British 
term).  Finally,  workers  on  benefits  are  increasingly 
expected  to  quickly  accept  job  offers  that  pay 
below — or  offer  fewer  responsibilities  than — what 
former  employment  positions  had  provided.  This 
is  an  important  change  from  earlier  practices  that 
gave  workers  more  time  to  look  for  employment 
appropriate  to  their  professional  status  and  level 
or  focus  of  education. 

Importantly,  the  implementation  of  activation 
policies  differs  strongly  between  countries.  While 
activation  in  some  countries  can  often  be  quite 
punitive,  more  progressive  versions  of  “workfare” 
policies  seek  to  empower  individuals  through 
providing  extensive  placement  services,  setting  up 
individual  action  plans,  and  offering  training 
opportunities.  For  instance,  in  the  United  States 
with  its  (neo)liberal  institutions,  the  welfare-to- 
work  regime  set  up  in  1996  has  strong  elements  of 
control  and  punishment,  which  some  critics  view  as 
being  harsh  on  benefit  claimants.  In  contrast,  run¬ 
ning  a  workfare  policy  regime  within  the  context  of 
a  social  democratic  universal  welfare  state,  Denmark 
seeks  to  reach  all  potential  workers,  not  just  benefit 
claimants;  moreover,  caseworkers  are  supposed  to 


offer  activation  options  that  can  increase  individu¬ 
als’  capabilities.  Finally,  Germany  as  a  Christian 
democratic  country  falls  probably  somewhere  in 
the  middle,  offering  more  substantial  income  sup¬ 
port  than  the  U.S.  system  but  having  strongly  cut 
back  on  training  offers  for  the  long-term  unem¬ 
ployed.  Common  to  activation  approaches  in  all 
countries  is  the  adoption  of  new  public  manage¬ 
ment  techniques  in  the  public  employment  services 
that  administer  activation  policies. 

As  this  overview  about  employment  policy  has 
sought  to  show,  the  history  of  employment  policy 
has  been  both  long  and  full  of  changes.  Cutting- 
edge  political  science  work  on  employment  policy 
seeks  to  uncover  the  political  drivers  of  diverging 
national  developments  and  to  conceptualize  the 
links  between  the  neighboring  policy  realms  of 
employment,  welfare,  and  education. 
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Policy  advice  does  not  exist  in  a  vacuum.  If  it  did, 
it  would  be  irrelevant  to  political  affairs,  of  only 
abstract  interest.  Instead,  policy  advice  can  only 
be  understood  in  relational  terms.  It  is  offered  by 
some  to  others.  Therefore,  a  number  of  central — 
and  closely  interrelated — questions  can  be  asked: 

1.  What  is  public  policy  advice? 

2.  To  whom  is  such  advice  provided  and  why? 

3.  Who  offers  policy  advice? 

4.  Why  is  policy  advice  needed? 

5.  How  is  policy  advice  provided? 

6.  What  constitutes  “good”  policy  advice? 

What  Is  Public  Policy  Advice? 

Policy  advice  is  a  generic  term  referring  to  a  wide 
range  of  activities  within  the  processes  of  govern¬ 
mental  policy  making.  These  include  research,  both 
formal  and  informal;  the  writing  and  presentation  of 
policy  memoranda  laying  out  policy  choices  and 
options;  discussions  and  negotiations  on  public  pol¬ 
icy  issues;  and  the  monitoring  and  evaluation  of  the 
effectiveness,  efficiency,  and  economy  of  policies. 

Policy  advice  may  usefully  be  distinguished 
from  political  advice,  though  there  is  no  clear  and 
unambiguous  distinction  but  rather  shades  of  gray. 
Whereas  policy  advice  relates  to  the  content  of 
public  policy,  and  may  often  address  complex 
technical  issues,  political  advice  comprises  counsel 
that  addresses  considerations  of  political  costs  and 
benefits  that  may  be  borne  by  those  policymakers 
who  hold  elected  office  in  liberal  democratic  soci¬ 
eties.  All  high-level  policy  advisers  are  usually 
required  to  be  sensitive  to  the  political  context  of 
the  advice  they  provide. 


For  heuristic  purposes,  the  so-called  public 
policy  process  is  often  depicted  as  a  policy  cycle 
moving  sequentially  from  problem  definition,  to 
policy  formulation,  to  policy  implementation,  to 
policy  evaluation,  and  then  perhaps  back  to  prob¬ 
lem  redefinition  and  policy  reformulation,  and  so 
on.  In  reality,  the  process  is  usually  not  that  tidy, 
and  various  stages  of  the  cycle  can  occur  simulta¬ 
neously  rather  than  sequentially.  As  the  American 
political  scientist  Carl  Friedrich  (1940)  famously 
put  it,  “Public  policy  is  being  formed  as  it  is  being 
executed,  and  it  is  likewise  being  executed  as  it  is 
being  formed”  (p.  6).  Nevertheless,  policy  advice 
can  be  directed  at  one  or  all  of  these  stages  of  pub¬ 
lic  policy  making,  and  advisers  may  be  specialists 
in  particular  areas,  most  notably  perhaps  in  policy 
formulation,  in  policy  implementation,  or  in  policy 
evaluation. 

To  Whom  Is  Policy  Advice  Provided? 

The  principal  answer  is  that  policy  advice  is  pro¬ 
vided  to  those  whose  formal  responsibility  it  is  to 
make  public  policy.  Generally  speaking,  in  parlia¬ 
mentary  systems,  this  is  the  main  function  of  the 
political  executive,  headed  by  a  prime  minister 
whose  government  must  enjoy  the  confidence  of 
the  legislature,  from  which  it  is  drawn;  and  in 
republican  constitutional  systems,  in  which  the 
executive  and  legislative  arms  of  government  are 
formally  separate,  it  is  the  president  and  his  or  her 
top  executives.  It  also  includes  members  of  the 
legislature  in  both  types  of  system,  whose  role  is  to 
scrutinize  and  debate  public  policy  and  to  pass  the 
laws  that  enact  it. 

Again,  as  indicated  by  Friedrich’s  observation, 
some  policy  advisers  may  simultaneously  be  the 
recipients  of  advice,  especially  those  who  hold 
senior  bureaucratic  offices.  They  themselves  will 
often  be  receiving  policy  advice  from  their  organi¬ 
zational  subordinates. 

Who  Offers  Policy  Advice? 

In  Western  democracies,  just  about  anyone  can 
offer  policy  advice.  A  policy  advice  continuum 
could  show  that  at  one  extreme  are  citizens  who 
make  their  views  known  to  governments  on  public 
policy  issues  and  at  the  other  extreme  are  those 
who  earn  their  living  by  offering  policy  advice  to 
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governments.  The  former  may  have  no  expertise  in 
a  policy  field  except — and  it  can  be  an  important 
exception — the  “expertise”  embodied  in  their  own 
understanding  of  their  interests  and  aspirations 
and  their  personal  familiarity  with  a  particular 
policy’s  effect  on  them.  The  latter  is  dominated  by 
bureaucratic  officials  and  policy  analysts  and 
advisers  working  in  government  agencies. 

Between  these  two  loosely  defined  extremities 
can  be  found  a  wide  range  of  people  who  offer 
policy  advice  to  government  decision  makers. 
These  people  include,  in  particular,  academics  in 
tertiary  education  institutions,  researchers  in  vari¬ 
ous  nongovernmental  policy  think  tanks,  analysts 
and  lobbyists  working  for  interest  groups,  and 
independent  private  consultants.  In  the  case  of 
academic  researchers,  those  most  likely  to  offer 
policy  advice  to  governmental  authorities  are  those 
who  engage  in  what  James  Coleman  (1972)  calls 
“decision-oriented,”  as  distinct  from  “conclusion- 
oriented,”  research.  The  former  is  an  explicit  guide 
to  action,  attempting  to  connect  the  academic 
world  with  the  world  of  political  action.  This  is 
often  a  fraught  endeavor  for  both  parties. 

Why  Is  Policy  Advice  Needed? 

The  total  absence  of  public  policy  advice  in  any 
political  system  would  imply  the  existence  of  a 
single  dictator  dependent  solely  on  his  or  her  own 
counsel  for  the  decisions  that  he  or  she  takes.  That 
has  probably  never  been  the  case  anywhere. 

Instead,  policy  advice  is  needed  to  better  inform 
the  policy  decisions  taken  and  to  try  to  ensure  that 
the  prospects  for  policy  effectiveness  are  maximized 
and  the  risk  of  policy  failure  minimized.  This  is 
because  public  policy  is  purposive  in  nature,  at  least 
in  its  most  explicit  form,  rationally  designed  to 
attain  intended  and  finite  outcomes  or  to  achieve, 
modify,  or  maintain  certain  standards  of  perfor¬ 
mance  over  time.  Governments  universally  are 
required  to  formulate  public  policies,  often  in  the 
face  of  high  levels  of  technical  complexity  and 
uncertainty,  and  in  doing  so,  they  must  depend 
heavily — but  by  no  means  exclusively — on  often 
sophisticated  bodies  of  theoretical  knowledge, 
much  of  which  will  be  generated  by  technological 
and  scientific  research.  For  these  reasons,  there  has 
been  a  huge  growth  over  the  past  several  decades  in 
most  Western  democracies,  particularly  in  the 


United  States,  of  what  might  be  called  the  “policy 
analysis  industry” — producing  rapidly  increasing 
numbers  of  policy  analysts  and  advisers  in  public 
organizations  who  have  been  trained  and  educated 
in  public  policy  and  associated  disciplines  in  tertiary 
education  institutions.  Whereas  before  World  War 
II  the  study  of  public  policy  focused  largely  on  why 
and  how  governments  made  policy,  analysis  of 
policy,  since  then  the  focus  has  become  much  more 
normative  and  prescriptive,  analysis  for  policy. 

How  Is  Policy  Advice  Offered? 

Policy  advice  can  be  offered  formally  or  infor¬ 
mally.  Most  of  what  has  been  said  above  refers  to 
the  formal  offering  of  policy  advice  through  estab¬ 
lished  and  managed  institutional  arrangements, 
such  as  those  between,  in  parliamentary  systems, 
cabinet  ministers  and  their  senior  bureaucratic 
advisers,  or  in  a  republican  system,  such  as  that  of 
the  United  States,  between  the  president  and  his  or 
her  top  appointees  in  executive  agencies.  Advice 
can  be  offered  in  formal  memoranda  or  verbally 
(which  has  perhaps  become  a  more  common  prac¬ 
tice  because  what  was  once  considered  confiden¬ 
tial  advice  is  now  more  publicly  accessible  through 
official  disclosure  legislation). 

Decision  makers  seeking  policy  advice  can, 
however,  do  so  outside  the  formal  institutional 
arrangements,  at  their  discretion  (and  with  sensi¬ 
tivity  to  political  risk).  The  bureaucratic  reforms 
that  took  place  in  several  Westminster  parliamen¬ 
tary  systems  in  the  1980s  and  1990s — such  as 
New  Zealand  and  Australia — were  intended, 
among  other  things,  to  render  policy  advice  more 
“contestable,”  that  is,  to  ensure  that  cabinet  min¬ 
isters  were  less  dependent  on  the  counsel  provided 
by  their  top  bureaucratic  advisers. 

What  Is  “Good”  Policy  Advice? 

Policy  advice  in  liberal  democracies  ideally  aspires 
to  satisfy  two  main  criteria:  to  be  open  and  politi¬ 
cally  contestable,  on  the  one  hand,  and  to  be  ratio¬ 
nal,  on  the  other.  The  two  are  not  necessarily 
always  compatible.  In  his  seminal  article,  “The 
Science  of  Muddling  Through,  ”  Charles  Lindblom 
(1959)  recognized  this  tension  in  arguing  that 
the  test  of  “good”  policy  itself  can  be  that  it 
achieves  its  objectives  or  that  those  contributing  to 
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its  formulation  can  agree  to  act  together,  even 
though  they  may  not  agree  on  the  objectives  a 
policy  is  intended  to  achieve. 

Thus,  good  policy  advice  can  be  demonstrably 
instrumental  in  shaping  “successful”  policy,  in 
that  the  desired  objectives  are  achieved;  or  in 
securing  political  coalitions  of  the  “policy  willing” 
to  ensure  that  action  itself  takes  place;  or  in  achiev¬ 
ing  some  combination  of  both.  Policy  advice  may 
be  seen  to  be  good  in  political  terms — by  a  govern¬ 
ment,  by  a  cabinet  member,  or  by  a  minister — but 
not  by  others,  whether  they  be  policy  analytical 
peers  or  people  affected  by  or  who  have  an  interest 
in  the  particular  policy. 

As  Lindblom  (1977)  shrewdly  observed,  “A 
good  defense  for  a  bad  policy  will  be  that  it  is  the 
result  of  a  process  in  which  everyone  was  heard” 
(p.  256).  This  insight  speaks  to  the  importance  of 
genuine  consultation  in  democratic  policy  pro¬ 
cesses,  in  the  interests  of  effective  coalition  build¬ 
ing.  Consultation  can  itself  be  an  important  form 
of  research,  involving  the  interviewing  and  survey¬ 
ing  of  interested  and  affected  parties,  carried  out 
by  policy  analysts  and  advisers. 

The  word  policy  itself  may  suggest  that  in  con¬ 
tradistinction  to  politics,  policy  is  apolitical.  But 
such  a  claim  cannot  be  sustained.  Peter  Jackson 
(2007)  forcefully  argues  that  “policy  advice  and  its 
knowledge  bases  are  ideological  in  nature  and  it  is 
pure  obscurantism,  pretence  and  conceit  to  suggest 
that  it  is  ‘objective’  knowledge,  that  it  is  indepen¬ 
dent  of  political  or  social  interests  and  without 
moral  considerations”  (p.  263).  While  this  asser¬ 
tion  may  be  seen  by  those  of  a  more  technocratic, 
or  antipolitical,  disposition  as  extravagant,  it  is  not 
difficult  to  defend. 

The  clients  of  policy  advisers  more  often  than 
not  have  to  march  to  the  beat  of  a  political  drum, 
which  is  not  always  well  understood  by  those  who 
find  it  difficult  to  understand  the  inevitable  tensions 
between  the  norms  of  democracy  and  rationality. 
Often  politicians  may  be  less  inclined  to  hear  argu¬ 
ments  that  run  counter  to  what  they  “know”  to  be 
true  or  desirable,  even  when  those  arguments  are 
supported  by  logical  reasoning  and  impeccable  evi¬ 
dence.  These  latter  qualities,  as  desirable  as  they  are 
in  themselves,  cannot  always  carry  the  day. 

Policy  making  is  generally  based  as  much  on 
beliefs  as  it  is  on  evidence.  The  politicians  who  are 
held  publicly  responsible  for  it  are  required  to  act, 


more  often  than  not,  rather  than  to  reflect  and 
may — for  reasons  perfectly  rational  in  themselves — 
bemoan  what  they  see  as  “paralysis  by  analysis.” 
For  their  part,  policy  advisers  soon  learn  that  their 
craft  demands  a  willingness  to  make  acceptable 
trade-offs  between  the  need  to  be  technically  dis¬ 
passionate  and  objective  and  the  requirement  that 
they  also  be  politically  relevant.  They  may  not 
always  admire  politicians,  but  they  must  learn  to 
understand  them.  The  converse  is  equally  true. 

Neither  quantitative  nor  qualitative  methods  of 
analysis  and  research  have  any  monopoly  on  the 
quality  of  policy  advice.  Rigorous  analysis  that  can 
form  the  basis  of  sound  policy  advice,  however 
assessed,  is  not  confined  to  the  application  of 
quantitative  techniques.  Disciplined  and  rigorous 
thinking  is  also  a  central  requirement  in  qualitative 
research — for  example,  in  applying  the  “lessons” 
of  history  to  current  policy  issues  (Richard 
Neustadt  &  Ernest  May,  1986).  In  any  event, 
good  policy  advice  will  depend  on  grounded  expe¬ 
rience,  discerning  judgment,  and  intuitional  wis¬ 
dom  as  much  as  on  rigorous  analysis. 

Finally,  policy  advice  must  be  kept  open  to  revi¬ 
sion  in  the  light  of  changing  circumstances, 
unfounded  assumptions,  and  poor  judgment.  It 
should  embody  a  capacity  for  ongoing  learning, 
resisting  any  tendency  to  become  self-fulfilling  in 
its  effects.  And  while  it  may  aspire  to  wisdom,  as  a 
category  that  transcends  data,  information,  and 
knowledge,  it  may  also  reflect  some  understanding 
that  wisdom  itself  inheres  in  an  intuitive  sense  of 
the  interconnectedness  of  things  in  the  complex 
and  uncertain  world  of  public  policy  making. 

Robert  Gregory 
Victoria  University  of  Wellington 
Wellington,  New  Zealand 
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Policy  analysis  is  a  term  that  is  applied  to  a  wide 
variety  of  practices  and  intellectual  approaches  and 
that  cannot  be  reduced  to  a  neat  definition.  The 
relationship  between  intellectual  approaches  and 
practice  is  not  clear.  Certainly,  there  is  a  dominant 
paradigm  that  sees  policy  analysis  as  an  exercise  in 
applied  social  science,  in  which  the  analytic  method 
of  the  social  sciences  is  mobilized  to  enable  govern¬ 
ments  to  make  the  optimal  choice,  and  graduate 
programs  in  policy  (particularly  in  North  America) 
elaborate  and  refine  the  knowledge  and  skills 
required  for  policy  analysis  in  this  sense.  But  even 
those  who  are  trained  and  employed  as  policy  ana¬ 
lysts  find  a  disjunction  between  the  methodology  in 
which  they  have  been  trained  and  the  demands  of 
the  job,  and  there  is  an  undercurrent  of  anxiety 
among  analysts  about  the  extent  to  which  their 
analysis  is  used  and  uncertainty  about  whether, 
and  in  what  way,  they  should  involve  themselves  in 
policy  practice.  So  it  is  difficult  to  give  an  unam¬ 
biguous  definition  of  policy  analysis  as  a  field  of 
study  or  of  practice.  It  is  conventional  to  distin¬ 
guish  between  analysis  for  policy  and  the  analysis 
of  policy,  and  we  will  begin  with  this  distinction. 
But  it  will  become  clear  that  the  diversity  of  percep¬ 
tions  of  policy  analysis  reflects  the  different  ways  in 
which  we  think  about  policy,  mobilizing  multiple 
and  overlapping  frameworks  of  meaning  to  make 
sense  of  what  we  see  and  what  we  do,  so  we  will 
need  to  dig  deeper  to  get  a  fuller  understanding  of 
policy  analysis.  Working  through  these  various 
usages  of  the  term  will  be  like  peeling  an  onion, 
though  we  hope  to  do  it  without  too  many  tears. 


Analysis  of  Policy  or  for  Policy? 

The  distinction  between  analysis  of  and  analysis  for 
reflects  an  attempt  to  separate  observation  from 
practice  and,  in  particular,  to  distinguish  policy 
analysis  as  a  specific  form  of  applied  social  science 
from  the  broader  field  of  the  study  of  government. 
Analysis  of  has  tended  to  be  the  less  significant 
branch,  perhaps  because  its  focus  on  the  analysis  of 
process  made  it  hard  to  distinguish  from  traditional 
political  science.  At  one  point,  there  was  a  tendency 
for  the  study  of  government  to  be  renamed  “public 
policy,”  with  the  work  of  government  being  seen  as 
the  production  of  policy  and  the  study  of  the  pro¬ 
cess  being  “policy  analysis.”  This  rested  on  a  per¬ 
ception  of  policy  as  a  process  of  official  problem 
solving,  which  was  widely  shared  but  which  later 
came  under  scrutiny  from  what  was  called  the 
critical  or  interpretive  approach  to  policy  analysis 
(which  is  discussed  further  in  this  entry).  It  was  also 
challenged  (less  directly)  by  the  governmentality 
approach  deriving  from  the  work  of  Michel 
Foucault,  which  looked  beyond  the  implied  ques¬ 
tion  of  “what  is  the  best  way  for  the  government  to 
solve  this  problem”  to  ask  how  situations  became 
problems,  who  could  or  should  address  them,  what 
modes  of  practice  are  seen  as  appropriate  responses, 
and  in  what  way  these  situations  recede  from  the 
ranks  of  problems  that  claim  public  attention. 

Policy  Fields 

Much  of  the  analysis  of  was  focused  on  specific 
policy  fields:  health  policy,  transportation  policy, 
environment  policy,  and  so  on.  This  interpretation 
of  policy  analysis  shares  the  assumption  that  gov¬ 
ernment  is  a  process  of  official  problem  solving, 
and  the  task  of  policy  analysis  is  to  identify  the 
problems,  the  participants,  the  responses,  and  the 
outcomes.  The  analyst  is  seen  as  the  source  of 
expert  knowledge  on  what  has  been  done  and 
could  be  done,  and  the  focus  is  on  what  is  described 
as  the  “substance”  of  policy — problems  and  puta¬ 
tive  solutions — rather  than  on  the  process.  Here, 
the  distinction  between  analysis  of  and  analysis  for 
has  tended  to  fade,  as  the  outside  observers  who 
become  expert  in  what  has  been  done  are  often 
seen  as  sources  of  advice  about  what  should  be 
done  (and  in  any  case,  tend  to  have  strong  views 
on  this  topic  whether  they  are  asked  or  not).  At  the 
same  time,  those  employed  as  policy  analysts  tend 
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to  become  more  specialized,  experts  in  a  particular 
field  (and  increasingly,  engaged  in  it  and  commit¬ 
ted  to  particular  approaches)  rather  than  detached 
technicians  used  to  compare  policy  alternatives 
with  one  another  and  concerned  to  give  an  “objec¬ 
tive”  answer.  This  perception  of  policy  analysis  as 
“knowing  about”  a  particular  policy  field  is  widely 
shared — among  academics,  practitioners,  and  the 
wider  public — and  underlies  much  of  the  writing 
with  policy  in  the  title. 

The  Call  for  a  Policy  Science 

But  the  main  game  in  policy  analysis  has  not  been 
about  the  subject  matter  but  about  method.  The 
origins  of  policy  analysis  as  an  intellectual  approach 
and  a  field  of  practice  are  often  traced  back  to  the 
United  States  in  the  aftermath  of  World  War  II, 
when  Harold  Lasswell  called  for  a  “policy  science” 
that  would  be  problem  focused,  interdisciplinary, 
and  explicitly  normative.  The  focus  on  policy  and 
analysis  would,  it  was  felt,  “lift  the  game”  of  the 
American  political  system  so  that  it  would  be 
driven  by  a  focus  on  problems  and  outcomes 
rather  than  by  techno-bureaucratic  inertia  and 
partisan  allocation  and  become  both  more  trans¬ 
parent  and  effectively  democratic.  The  policy 
analyst,  by  applying  the  social  sciences,  would 
produce  expert  and  objective  advice  on  the  ques¬ 
tions  confronting  government.  At  the  same  time, 
the  existence  of  this  technology  of  critique  would 
empower  the  “outsiders”  in  government,  strength¬ 
ening  their  ability  to  challenge  the  operations  of 
government  by  creating  a  discourse  of  instrumen¬ 
tal  rationality  in  relatively  transparent  terms. 

Policy  Analysis  as  a  Field  of  Study 

Through  the  1950s  and  1960s,  policy  analysis  took 
root  in  the  United  States,  both  as  a  body  of  knowl¬ 
edge  and  as  a  field  of  practice.  The  early  policy 
analysts  were  likely  to  have  PhDs  in  economics  or 
operations  research  and  to  come  from  institutions 
such  as  RAND  in  California.  But  soon  graduate 
schools  of  public  policy  emerged,  offering  master’s 
degrees  (comparable  with  MBA).  These  graduate 
programs  were  more  oriented  to  political  science, 
but  the  core  courses  in  policy  analysis  that  they 
usually  offered  elaborated  a  technology  of  choice 
grounded  in  economics,  with  the  task  of  the  policy 


analysis  being  to  identify  a  number  of  options;  to 
estimate  their  feasibility,  cost,  and  likely  impact; 
and  by  making  the  appropriate  calculations,  to 
determine  the  optimal  course  of  action.  The  meth¬ 
odology  was  grounded  in  (usually  unspoken) 
assumptions — that  policy  activity  consists  of  deal¬ 
ing  with  discrete  and  context-free  problems,  that 
government  is  composed  of  “decision  makers” 
who  are  recognized  as  being  responsible  for  any 
given  problem,  that  their  sole  concern  is  to  solve 
the  problem,  and  that  calculations  from  welfare 
economics  are  recognized  as  determinant  in  argu¬ 
ments  about  the  best  course  of  action.  The  work  of 
the  policy  analyst  was  to  use  this  methodology  to 
determine  the  optimal  course  of  action  and  to  com¬ 
municate  this  to  the  “policymakers” — “speaking 
truth  to  power,”  as  Aaron  Wildavsky,  the  founding 
dean  of  the  Graduate  School  of  Public  Policy  at  the 
University  of  California,  Berkeley,  put  it.  The 
teaching  of  these  institutions  was  supplemented  by 
conferences,  journals,  and  the  formation  of  a  pro¬ 
fessional  association,  the  Association  for  Public 
Policy  and  Management  (APPAM).  APPAM  con¬ 
ferences  became  major  meeting  places  for  the  inter¬ 
section  of  policy  analysis  as  a  field  of  study  with 
policy  analysis  as  a  field  of  practice,  and  its  journal 
has  been  a  showcase  for  exercises  in  the  systematic 
evaluation  of  options.  By  the  turn  of  the  century,  it 
was  confidently  claimed  that  policy  analysis  in  the 
United  States  had  “come  of  age.” 

Policy  Analysis  as  a  Field  of  Practice 

What  was  less  clear,  though,  was  just  what  it  was 
that  these  policy  analysts  were  doing.  The  initial 
perception  had  been  that  they  would  be  detached 
scientific  experts  who  could  be  called  in  when 
required,  or  if  they  were  to  have  a  permanent  pres¬ 
ence,  it  would  be  in  a  small  unit  close  to  the  top, 
where  they  would  be  “advising  the  Prince.”  This 
was  the  way  Robert  McNamara  sought  to  use  pol¬ 
icy  analysis  as  U.S.  Secretary  of  Defense  in  the 
1960s,  seeing  it  as  offering  a  systematic  and  impar¬ 
tial  appraisal  of  the  ambitions  of  the  military  chiefs. 
But  this  could  set  in  motion  a  dispersal  of  policy 
analysis  through  the  wider  world  of  governing:  If 
chief  executives  were  using  policy  analysis  as  the 
basis  for  their  choices,  it  would  be  only  sensible  for 
executives  at  the  next  lower  level  to  employ  their 
own  policy  analysts  to  try  to  negotiate  with  the 
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top-level  analysts  or  at  least  to  second-guess  them. 
For  this  reason,  policy  analysts  became  common  at 
lower  levels  in  the  organization,  and  in  business 
associations  and  nonprofit  organizations,  to  facili¬ 
tate  their  dealings  with  government.  Increasingly, 
policy  analysts  found  themselves  not  so  much  advis¬ 
ing  the  prince  as  negotiating  with  policy  analysts 
from  other  organizations.  Rather  than  being  experts 
in  choosing  between  courses  of  action,  they  became 
experts  in  particular  fields  of  action,  such  as  child 
care  or  urban  mass  transportation,  and,  increas¬ 
ingly,  advocates  of  particular  approaches  to  these 
areas  of  concern.  The  technologies  of  choice  in 
which  they  had  been  trained  became  not  recognized 
ways  of  identifying  the  optimal  course  of  action  but 
“dueling  swords”  to  be  used  as  resources  in  encoun¬ 
ters  with  policy  analysts  and  others  from  other 
organizations.  And  the  work  of  policy  analysts  was 
often  only  loosely  connected  to  advising  decision 
makers,  covering  everything  from  background 
research  for  legislators  to  public  education  cam¬ 
paigns  to  lobbying  for  particular  public  policies. 

Policy  Analysis  and  Policy  Advising 

In  the  development  of  policy  analysis,  the  focus 
was  on  the  application  of  the  canons  of  the  social 
sciences  to  generate  the  basis  for  a  decision  maker 
to  make  the  optimal  choice.  It  was  assumed  that 
this  would  be  what  the  decision  maker  wanted,  and 
policy  analysts  expressed  frustration  when  their 
carefully  crafted  solution  was  not  adopted.  But 
leaders  in  government  found  that  “problems”  were 
not  neatly  set  up  for  them  to  address  like  a  golf  ball 
on  a  tee  but  were  part  of  a  continuing  flow  of  activ¬ 
ity  involving  a  range  of  participants  who  might 
have  divergent  perceptions  of  the  nature  of  the 
problem,  its  significance,  and  what  response  would 
be  appropriate  and  whose  responses  were  in  any 
case  shaped  by  their  experience  of  past  dealings 
with  the  other  participants  and  their  expectations 
about  future  dealings.  These  leaders  might  feel  that 
the  knowledge  that  they  needed  was  not  so  much  a 
calculation  of  the  optimal  outcome  as  guidance 
about  the  best  way  to  manage  an  ambiguous,  con¬ 
tested,  and  continuing  field  of  activity.  Policy 
workers  found  that  they  were  concerned  as  much 
with  the  strategic  management  of  the  policy  issue 
as  with  the  systematic  comparison  of  options,  and 
in  many  cases,  their  work  was  described  as  “policy 


advising.”  Some  writers  have  stressed  the  impor¬ 
tance  of  distinguishing  between  analysis  (seen  as 
detached  and  scientific)  and  advising  (seen  as  more 
engaged,  experiential,  and  judgmental).  Others 
built  the  diversity  and  conflict  into  the  analysis, 
looking  for  formulae  that  would  incorporate  the 
divergent  perceptions  and  agendas  of  the  partici¬ 
pants  into  the  calculations  in  the  hope  that  they 
would  accept  the  outcome  because  their  views  had 
been  “taken  into  account,”  although  it  was  com¬ 
monly  found  that  trying  to  resolve  conflict  through 
formulae  was  not  successful  because  the  conflicting 
parties  simply  used  the  data  from  the  analysis  as 
ammunition  in  the  continuing  conflict.  But  this 
extension  of  policy  analysis  into  the  management 
of  policy  conflict  was  done  through  formulae  or 
advising  based  on  criteria  drawn  from  experience: 
It  raised  for  policy  workers  the  question  of  whether 
they  saw  policy  analysis  as  a  technical  input  into 
the  policy  process  or  as  a  continuing  and  commit¬ 
ted  engagement  with  it. 

For  some  policy  workers  (and  writers),  the  tech¬ 
nical  skill  of  policy  analysis  is  seen  in  terms  of  the 
analyst’s  understanding  of  the  process — the  situa¬ 
tion  as  well  as  the  problem — and  the  task  for 
analysis  as  “in  these  circumstances,  how  can  we 
achieve  the  most  satisfactory  outcome?”  Policy 
analysis  becomes  less  the  selection  of  the  optimal 
outcomes  and  more  an  exercise  in  skilful  maneu¬ 
vering  in  a  crowded  terrain,  involving  an  under¬ 
standing  of  the  participants,  stances,  and  struc¬ 
tured  processes  as  well  as  the  substance  of  the 
policy  issue,  with  policy  analysis  as  a  more  or  less 
formal  exercise  that  might  be  mobilized  in  sup¬ 
port.  In  this  perspective,  books  on  policy  analysis 
are  less  concerned  with  analytic  method  than  with 
street-smart  guidance  for  the  practitioner,  though 
recently  there  has  been  more  research  (particularly 
in  Europe)  on  how  policy  workers  approach  the 
task  of  making  these  strategic  appreciations  of  the 
question  and  the  extent  to  which  this  generates  an 
experience-based  analysis  that  might  counter  the 
instrumental  calculations  and  projects  of  tradi¬ 
tional  American  policy  analysis. 

Evaluation:  The  Policy 
Analysis  That  Got  Away 

The  first  generation  of  policy  analysts  in  the 
United  States  was  trained  in  the  expert  appraisal  of 


1896  Policy  Analysis 


alternative  courses  of  action,  and  the  methodology 
for  making  these  appraisals  became  a  standard 
part  of  the  graduate  programs  in  policy  that 
emerged  in  the  United  States.  It  was  essentially 
based  on  welfare  economics  and  sought  to  calcu¬ 
late  the  net  utility  gain  from  each  of  a  limited 
number  of  alternatives — usually  before  the  event 
(ex  ante )  and  sometimes  after  it  (ex  post).  This 
developed  into  a  distinct  field  of  knowledge  and 
practice  called  evaluation.  It  can  be  argued  that 
this  had  its  origins  in  the  assessment  of  educational 
practice,  but  it  gained  its  impetus  in  the  1960s  and 
1970s,  particularly  in  the  United  States,  as  the 
instrumental  use  of  grants  by  governments 
(particularly  central  governments)  became  more 
common  and  was  increasingly  accompanied  by 
requirements  that  the  performance  of  the  grantee 
agency  be  subject  to  independent  evaluation.  This 
reflected  the  increasing  appeal  of  principal-agent 
theory  in  government.  It  had  been  common  for 
governments  to  give  financial  support  in  various 
ways  to,  for  instance,  religious  or  community  bod¬ 
ies  carrying  out  charitable  works,  but  in  the  later 
years  of  the  20th  century,  this  support  was  seen  as 
the  means  for  government  to  contract  with  agents 
to  achieve  known  and  specified  outcomes.  The 
charitable  bodies  were  defined  as  the  agents  of 
government,  and  evaluation  was  seen  as  the  means 
whereby  governments  could  check  that  what  they 
had  paid  for  had  been  delivered.  The  methodology 
of  these  evaluations  drew  broadly  on  the  research 
methods  of  the  social  sciences,  applied  in  a  frame¬ 
work  of  instrumental  rationality:  What  were  the 
objectives  of  the  program?  How  could  these  be 
measured?  Were  the  specified  outcomes  achieved? 
In  this  way,  it  was  argued,  the  systematic  analysis 
of  the  social  sciences  could  be  mobilized  in  support 
of  the  pursuit  of  policy  goals. 

But  as  with  policy  analysis,  beneath  the  devel¬ 
opment  of  methodologies  for  evaluation  lay  the 
tension  between  detachment  and  engagement:  Is 
the  evaluator  to  be  an  inspector  (a  detached  part  of 
the  apparatus  of  control)  or  a  coach  (a  committed 
part  of  the  team)?  The  central  officials  who  com¬ 
mission  evaluation  may  see  it  as  a  means  of  exer¬ 
cising  control  over  the  evaluated,  but  on  the 
ground,  inspectors  tend  to  enter  into  trading  rela¬ 
tionships  and  develop  shared  understandings  with 
the  subjects  of  their  scrutiny.  Egon  Cuba  and 
Yvonna  Lincoln  (1989)  have  argued  that  there  has 


been  a  shift  over  time,  from  a  detached  inspectorial 
form  of  evaluation,  which  they  called  “third- 
generation  evaluation,”  to  a  more  supportive  and 
negotiated  “fourth-generation”  type,  but  while 
there  is  some  evidence  to  support  this  argument,  it  is 
challenged  by  the  increased  use  of  quasi-contractual 
relationships  in  government  and  the  demand  for 
“objective”  measures  of  performance,  which  sug¬ 
gest  that  the  inspectorial  evaluation  is  alive  and 
well.  The  explanation  seems  to  be  that  there  is  an 
inherent  tension  between  inspection  and  collabo¬ 
ration,  but  both  have  their  part  to  play  in  the 
action,  with  the  internal  logic  pointing  to  practices 
of  collaboration  and  the  external  logic  calling  for 
the  presentation  of  evaluation  in  terms  of  detached 
inspection. 

Policy  Analysis  as  Applied  Knowledge 

The  last  subcategory  of  analysis  for  might  be  called 
applied  knowledge — that  is,  where  policy  analysis 
is  seen  as  generating  the  sort  of  knowledge  that  is 
needed  for  decision  making.  Certainly,  there  are 
often  research  bureaus  established  within  govern¬ 
ment  in  specific  areas  such  as  health  policy,  indus¬ 
try  policy,  or  agriculture  policy,  and  this  can  be 
seen  as  evidence  of  a  need  for  specialized  analysis 
of  policy  areas.  In  some  cases,  though,  there  seems 
to  be  a  tendency  to  wind  down  specialist  units 
within  government  and  rely  more  on  the  ad  hoc 
use  of  consultants.  Also,  the  evidence  that  political 
leaders  seek  the  advice  of  the  subject  experts 
within  the  tent,  or  that  it  constitutes  a  distinct 
form  of  analysis,  is  not  very  strong,  and  institu¬ 
tional  analysts  like  James  March  and  Johan  Olsen 
would  argue  that  this  sort  of  explicit  gathering  of 
knowledge  is  not  so  much  to  inform  the  decision 
makers  as  to  reassure  the  audience  that  the  deci¬ 
sion  was  appropriately  made. 

Perhaps  more  significant  is  policy  analysis  as 
oversight:  the  mobilization  of  analysis  for  a  more 
comprehensive  and  overarching  interrogation  of 
the  work  of  government.  Sometimes  high-level 
policy  analysis  and  review  units  are  established  in 
government,  such  as  the  United  Kingdom  (UK)’s 
Central  Policy  Review  Staff  in  the  1970s.  The  argu¬ 
ment  is  that  a  small  unit,  not  involved  in  the  opera¬ 
tions  of  government  and  reporting  to  the  top-level 
leaders,  will  be  able  to  generate  an  analysis  that 
takes  in  the  whole  of  government  activity,  identifies 
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the  interconnections,  and  focuses  on  outcomes  and 
alternatives.  This  may  be  so,  but  there  are  not 
many  examples  of  these  units,  they  tend  not  to  be 
very  long-lived,  and  it  is  difficult  to  discern  a  dis¬ 
tinctive  mode  of  analysis  emerging  from  them. 
Often,  governments  prefer  to  commission  a  one-off 
independent  review  of  a  policy  area  (to  which  they 
can  pay  as  much  or  as  little  attention  as  they  like) 
rather  than  to  have  systematic  review  built  into  the 
normal  expectations  of  government.  It  can  also  be 
argued  that  now  the  claims  for  the  merits  of  over¬ 
arching  policy  analysis  carried  out  ad  hoc  by  high- 
level  special  units  have  been  supplanted  by  the 
broader  based  advocacy  of  evidence-based  policy 
as  part  of  normal  expectations. 

Practicing  Analysis,  Analyzing  Practice 

Clearly,  policy  analysis  is  a  term  applied  to  a 
diversity  of  intellectual  approaches  and  forms  of 
practice.  Because  it  is  a  concept  that  is  mobilized 
in  practice,  its  meaning  will  reflect  its  utility  in 
facilitating  practice  as  well  as  in  its  contribution 
to  the  analysis  of  practice.  The  idea  of  policy 
analysis  is  important  in  validating  the  activity  of 
government,  part  of  a  set  of  rituals  by  which  soci¬ 
ety  assures  itself  that  human  existence  is  built 
around  intelligent  choice.  But  the  forms  of  prac¬ 
tice  that  count  as  policy  analysis  vary,  reflecting 
the  fact  that  “policy”  itself  is  made  up  of  a  com¬ 
plex  overlap  of  meanings  that  are  manifest  in  an 
interplay  of  accounts.  Hal  Colebatch  argues 
(2009)  that  it  is  helpful  to  see  policy  in  terms  of 
the  interplay  of  accounts  of  authoritative  choice , 
structured  interaction ,  and  social  construction. 
Authoritative  choice  accounts  present  policy  in 
terms  of  governments  making  decisions,  struc¬ 
tured  interaction  accounts  see  it  more  as  the  out¬ 
come  of  continuing  interaction  by  stakeholders, 
and  social  construction  accounts  focus  on  the 
“collective  puzzling”  about  what  is  normal  and 
what  is  problematic,  what  demands  action,  and 
what  sorts  of  action  are  appropriate.  Each  of  these 
accounts  of  policy  makes  meaningful  and  legiti¬ 
mate  a  distinct  array  of  practices,  to  all  of  which 
the  term  policy  analysis  can  legitimately  be 
applied.  To  make  sense  of  this  diversity  of  prac¬ 
tices,  then,  we  need  to  consider  the  nature  of  these 
accounts  and  the  sort  of  policy  analysis  that  they 
validate. 


Policy  Analysis  in  an  Authoritative 
Choice  Account 

The  dominant  account  of  policy  is  that  it  is  an 
exercise  in  authoritative  choice:  It  is  whatever  gov¬ 
ernment  decides  to  do  or  not  to  do.  This  focuses 
on  the  episodes  of  attention  in  governing  that  give 
rise  to  rituals  of  choice — making  decisions,  autho¬ 
rizing  programs,  allocating  resources,  and  so  on — 
and  underpins  the  perception  of  policy  analysis  as 
a  technology  of  intelligent  choice:  gathering  data 
on  the  problem,  identifying  alternative  ways  of 
responding  to  it,  calculating  the  costs  and  benefits 
of  these  alternatives,  systematically  comparing 
options,  and  advising  the  decision  makers  of  this 
information  to  enable  them  to  make  the  optimal 
choice.  Once  the  choice  has  been  made,  the  same 
technology  can  be  applied  to  assess  whether  the 
action  solved  the  problem — evaluation. 

In  general,  policy  analysis  is  seen  as  an  activity 
that  precedes  the  choice  (or  that,  in  the  case  of 
evaluation,  follows  closely  in  its  footsteps).  Its 
knowledge  base  is  the  technology  of  selection,  and 
its  focus  is  on  the  decision  process,  not  on  the  func¬ 
tional  knowledge  of  the  operational  staff  that  tends 
to  be  relegated  to  a  secondary  place:  The  func¬ 
tional  experts  may  propose,  but  the  policy  analyst 
is  needed  to  evaluate  their  proposals,  drawing  on 
skills  of  selection  and  choice.  But  the  separation  of 
the  technology  of  choice  from  operational  knowl¬ 
edge  can  never  be  complete,  and  the  demand  for 
evidence-based  policy  links  the  field  knowledge  of 
the  functional  expert  to  the  choice  expertise  of  the 
specialist  policy  analyst,  imposing  on  the  func¬ 
tional  specialist  a  requirement  to  demonstrate  the 
instrumental  efficacy  of  the  proposed  course  of 
action  and  on  the  analyst  a  demand  to  identify  the 
optimal  course  of  action.  The  complexity  of  gov¬ 
ernment  also  gives  rise  to  a  supporting  role  in 
policy  analysis  for  the  “process  manager,”  the 
expert  in  procedure  who  can  identify  the  steps  that 
need  to  be  taken  in  order  to  translate  a  decision  to 
act  into  a  correctly  formulated  and  approved  act  of 
state. 

Policy  Analysis  in  a  Structured 
Interaction  Account 

The  authoritative  choice  account  sees  the  task 
for  analysis  as  enabling  the  policymakers  (unspeci¬ 
fied)  to  make  the  optimal  informed  decision.  The 
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structured  interaction  account,  though,  focuses  on 
the  variety  of  participants,  all  of  whom  contribute 
to  the  process,  but  few  can  bring  it  to  a  close  with  a 
“decision.”  These  participants  are  likely  to  have 
distinct  perceptions  of  the  nature  of  the  problem 
being  addressed,  what  could  be  done  about  it,  and 
how  important  it  is.  In  this  account,  the  task  for 
analysis  is  to  identify  the  participants  and  their 
stances,  discover  the  extent  of  shared  meanings  and 
understandings  and  divergent  and  conflicting  ones, 
and  explore  the  discourses,  practices,  and  locations 
through  which  mutually  acceptable  outcomes  could 
be  pursued.  It  is  more  like  diplomacy  than  labora¬ 
tory  science,  seeking  to  discover  a  basis  for 
concerted  action  among  participants  who  are  not 
necessarily  seeking  to  solve  the  same  problem.  It  is 
concerned  to  facilitate  consultation  and  negotiation, 
and  here  it  overlaps  with  the  process  work  of  inter- 
organizational  communication,  which  has  in  many 
cases  become  a  cottage  industry  in  its  own  right, 
with  consulting  firms  becoming  adept  at  organizing 
“public  consultation”  exercises  in  which  policy  con¬ 
cerns  or  (more  commonly)  proposals  are  subject  to 
structured  discussion  by  nonofficials. 

Here,  we  can  see  that  the  cognitive  and  the 
social  element  of  policy  analysis  are  closely  related 
to  one  another:  Sharing  an  understanding  of  the 
policy  problem  makes  it  easier  to  work  together 
toward  an  agreed  outcome,  and  people  who  are 
accustomed  to  working  together  are  likely  to 
develop  shared  understandings.  In  the  1970s,  sci¬ 
entific  concern  about  the  climatic  implications  of 
the  emerging  hole  in  the  earth’s  ozone  layer  led, 
extremely  quickly,  to  international  agreements  to 
prohibit  the  use  of  chlorofluorocarbons  in  refrig¬ 
erators.  Peter  Haas  argued  that  this  remarkable 
piece  of  policy  development  was  due  to  the  policy 
activity  of  an  international  network  of  scientists 
who  shared  an  understanding  of  the  problem  and 
were  all  able  to  convince  their  home  governments 
to  act  in  concert  with  the  rest  of  the  world;  he 
called  this  an  “epistemic  community”  (from  the 
Greek  episteme — “knowledge”):  The  knowledge 
defined  the  community.  But  we  can  also  see  that  in 
any  policy  area,  there  will  be  a  range  of  partici¬ 
pants  who  are  all  interested  in  the  policy  area  but 
from  diverse  perspectives  and  knowledge  bases.  In 
agriculture  policy,  for  instance,  there  will  be  agri¬ 
culture  bureaucrats,  farmers’  organizations,  unions 
of  agricultural  workers,  transporters,  marketers, 


journalists,  academics,  and  so  on.  Groupings  such 
as  this  are  not  epistemic  communities  in  Haas’s 
terms,  because  they  do  not  start  from  a  common 
body  of  knowledge,  but  the  more  they  work 
together  over  time,  the  more  they  develop  shared 
understandings  about  who  should  be  involved  in 
the  policy  field,  how  to  talk  about  it,  and  what 
sorts  of  action  can  be  contemplated,  and  as  this 
happens,  they  come  to  form  a  “policy  community” 
of  participants  who  recognize  one  another’s  legiti¬ 
mate  expectation  of  a  place  at  the  table  and  who 
share  an  interest  in  securing  a  mutually  acceptable 
outcome  and  a  form  of  discourse  that  is  likely  to 
lead  to  this.  In  cases  such  as  this,  the  forces  for 
social  cohesion  lead  to  the  development  of  modes 
of  discourse  that  in  turn  strengthen  the  cohesion. 

The  recognition  of  the  importance  in  the  policy 
process  of  shared  understandings  has  sparked  a 
distinct  school  of  interpretive  or  discursive  policy 
analysis,  which  argues  that  much  of  the  work  of 
policy  analysis  is  concerned  with  the  negotiation  of 
meaning,  often  in  contested  and  ambiguous  situa¬ 
tions,  and  focuses  attention  on  the  way  that  mean¬ 
ing  is  constructed  and  communicated  in  policy 
work.  Studies  of  relatively  low-level  workers  (e.g., 
town  planners,  police,  or  social  workers)  find  them 
engaged  in  constructing  the  links  and  negotiating 
the  meaning  through  which  governing  is  accom¬ 
plished  and  argue  that  this  communication  and 
meaning  construction  is  an  integral  part  of  policy 
making.  Beryl  Radin  found  that  while  U.S.  policy 
analysts  tend  to  define  their  role  as  “advising  the 
Secretary,”  they  often  spend  most  of  their  time 
negotiating  with  policy  analysts  from  other  agen¬ 
cies,  with  the  policy  analysis  techniques  they 
learned  in  graduate  school  being  deployed  as 
“dueling  swords”  in  these  encounters,  and  she 
argues  that  policy  analysis  has  moved  from  an 
ambition  to  “speak  truth  to  power”  to  developing 
the  capacity  to  speak  truths  to  multiple  powers. 

Policy  Analysis  in  a  Social  Construction  Account 

Policy  cannot  be  reduced  to  decisions  by  “the 
authorities”  (authoritative  choice)  or  deals  between 
stakeholders  (structured  interaction)  but  rests  on 
shared  perceptions  of  what  is  normal  and  what 
demands  action,  who  can  speak  with  authority, 
what  sorts  of  action  are  appropriate,  and  who  can 
be  held  responsible — that  is,  it  rests  on  a  bed  of 
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socially  constructed  problematization.  The  devel¬ 
opment  of  policy  on  climate  change,  for  instance, 
has  not  been  a  matter  of  governments  simply  rec¬ 
ognizing  a  problem  but  of  an  extended  process  of 
argumentation  among  scientists,  activists,  officials, 
industrialists,  and  commentators  about  how  to 
interpret  phenomena  and  to  relate  them  to  pat¬ 
terns  of  governing.  It  has  been  a  continuing  and 
contested  process  across  constitutional  and 
national  boundaries,  a  pattern  of  meaning  con¬ 
struction  to  which  government  players  could  con¬ 
tribute  but  that  they  cannot  control.  Indeed,  when 
the  former  U.S.  Vice  President  A1  Gore  won  the 
Nobel  Peace  Prize  for  his  efforts  in  raising  the  pub¬ 
lic  consciousness  on  climate  change,  there  was  a 
demand  for  him  to  return  to  politics  as  a  candidate 
for  the  presidency,  but  he  resisted  this,  and  it  was 
suggested  that  he  had  come  to  realize  that  even  the 
president  could  not  change  U.S.  policy  on  climate 
change  until  public  opinion  had  come  to  accept 
both  the  policy  problem  and  the  implications  for 
corporate  and  individual  practice. 

In  this  context,  then,  policy  analysis  (both  of 
and  for  policy)  has  to  do  with  the  construction  and 
maintenance  of  the  problematic.  Analysis  for  is 
concerned  with  the  development  of  an  account  of 
practice  that  focuses  attention  on  a  problem,  vali¬ 
dates  ways  of  thinking  about  it  and  appropriate 
responses,  and  allocates  responsibility  for  action. 
In  this  respect,  the  work  of  policy  analysis  is  more 
like  that  of  a  lawyer  than  a  laboratory  scientist:  It 
is  finding  good  reasons  for  doing  things.  When 
Tony  Blair  was  prime  minister  of  the  UK,  there 
was  in  his  press  office  an  official  called  the  Head 
of  Story  Development,  whose  task  was  to  ensure 
that  what  was  being  done  in  government  made  “a 
good  story.” 

But  to  say  that  policy  analysis  is  concerned  with 
framing  the  action  suggests  that  there  may  be  more 
than  one  way  of  doing  this  and  that  the  analysis 
will  have  to  contend  with  competing  frameworks 
of  meaning.  Policy  workers  in  health,  for  instance, 
recognizing  that  the  available  resources — hospitals 
and  doctors — are  preempted  by  the  ever-expand¬ 
ing  claims  of  “illness  response”  at  the  expense  of 
activities  that  would  contribute  more  to  health,  try 
to  strengthen  the  discourses  on  health  maintenance 
and  self-care  (rather  than  illness  treatment)  and  the 
position  of  the  participants  who  engage  in  these 
discourses.  Policy  workers  closely  analyze  the 


wording  in  official  discourse  and  try  to  make  sure 
that  the  words  used  will  bear  the  interpretation 
that  they  want  to  put  on  them  and,  if  not,  will  seek 
to  replace  them  with  words  that  are  more  conve¬ 
nient.  And  the  framing  is  not  only  done  with 
words:  Titles,  uniforms,  and  the  architecture  of 
buildings  can  be  powerful  symbols  that  convey  the 
meaning  of  policy. 

So  from  a  social  construction  perspective,  anal¬ 
ysis  for  policy  is  not  a  technical  exercise  conducted 
in  advance  but  part  of  the  continuing  strategic 
judgment  of  the  participants,  as  they  frame  the 
world  (and  react  to  the  framings  of  others)  in  a 
way  that  makes  sense  both  to  themselves  and  to 
other  participants.  This  means  that  they  must  be 
sensitive  to  the  different  framings  that  are  in  use  at 
any  point  and  make  the  appropriate  responses. 
Tim  Tenbensel,  drawing  on  Aristotle,  argues  that 
within  the  one  policy  context,  there  are  likely  to  be 
discourses  grounded  in  knowledge  from  study 
(Aristotle’s  episteme ),  in  knowledge  from  practice 
( techne ),  and  in  ethical-practical  judgment  (pbro- 
nesis)  and  shows  that  an  important  skill  in  policy 
work  is  the  ability  to  analyze  the  policy  setting, 
discern  the  nature  of  the  discourse  in  use,  and 
engage  in  it  with  the  appropriate  language.  This 
brings  us  back  to  a  recurring  theme  in  the  discus¬ 
sion  of  analysis  for  policy:  Is  the  task  to  do  the 
analysis  or  to  bring  it  into  use  in  the  shaping  of 
policy  outcomes? 

Conclusion 

Policy  analysis  is  a  concept  in  practice,  and  what  it 
means  reflects  the  way  in  which  it  is  used.  The  intel¬ 
lectual  foundations  of  the  concept  lay  not  only  in 
political  science  but  also  in  economics  and  opera¬ 
tions  research,  and  as  it  developed  as  a  conceptual 
field,  concerns  about  organizational  dynamics, 
problematization,  and  the  construction  of  meaning 
were  added.  As  a  component  of  practice,  it  is  used 
to  describe  a  wide  variety  of  activities.  The  question 
is  not  so  much  “What  does  policy  analysis  mean?” 
as  “How  is  the  term  policy  analysis  used  to  charac¬ 
terize  and  validate  activity?” 

Initially,  it  was  seen  as  a  distinct  and  prior  stage 
in  the  process  of  governing,  a  form  of  intellectual 
activity  that  would  precede  the  exercise  of  author¬ 
ity.  This  presentation  was  part  of  a  modernist 
account  of  government  as  a  form  of  systematic 
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collective  instrumentality,  in  which  the  govern¬ 
ment  identified  problems  and  decided  what  to  do 
about  them.  The  argument  ran  that  by  hitching  the 
investigative  techniques  of  social  science  to  the 
operational  capacity  of  government,  governments 
would  be  better  informed  about  the  nature  of  the 
problem  and  the  likely  outcomes  of  alternative 
responses  open  to  government.  Its  advocates  saw  it 
as  a  source  of  scientific  and  unbiased  answers  to 
contested  questions  in  government. 

In  this  respect,  the  experience  was  somewhat 
disappointing.  Formal  modes  of  the  identification 
and  comparison  of  alternative  courses  of  action 
were  elaborated,  taught  in  graduate  programs,  and 
reported  in  the  literature  (e.g.,  “Optimal  Compen¬ 
sating  Wages  for  Military  Personnel”  or  “The 
Impact  of  State  Tax  Credits  in  Stimulating  R&D”). 
But  often,  those  doing  the  analysis  felt  that  it  had 
little  impact  on  the  decision  process  and  that  some¬ 
times  the  decision  had  been  made  before  the  analy¬ 
sis  was  called  for.  One  response  sees  this  as  a 
“two-cultures”  problem  and  calls  for  policy  ana¬ 
lysts  to  be  more  “user-friendly,”  less  concerned 
with  adding  to  the  systematic  body  of  academic 
knowledge,  and  more  concerned  with  the  quick 
delivery  of  answers  to  urgent  policy  questions, 
even  if  this  means  ignoring  the  methodological 
canons  of  good  social  science:  Analysis  that  is 
“quick  and  dirty”  might  be  what  is  needed  in 
policy.  This  response  also  calls  for  policy  analysts 
to  pay  more  attention  to  the  communication  of 
their  findings,  writing  shorter  and  more  accessible 
texts  that  focus  on  the  immediate  implications  for 
policy.  This  raises  the  question  of  the  extent  to 
which  policy  analysts  should  involve  themselves  in 
“marketing”  their  analysis  to  increase  the  likeli¬ 
hood  of  it  being  used  in  the  policy  process.  Should 
they  be  advocates  for  their  own  findings?  Should 
they,  in  carrying  out  their  analysis,  cultivate  rela¬ 
tionships  that  are  likely  to  strengthen  the  chances 
of  its  acceptance  or  to  neutralize  potential  opposi¬ 
tion?  In  other  words,  are  they  to  be  detached  sci¬ 
entists  or  active  participants  in  the  policy  process? 

This  has  not  been  a  topic  for  abstract  debate  but 
has  reflected  the  way  in  which  policy  analysis 
became  institutionalized  in  and  around  govern¬ 
ment,  particularly  in  the  United  States.  Radin,  who 
has  been  both  a  participant  in  and  an  observer  of 
this  process,  found  that  policy  workers  tended  to 
become  specialists  in  particular  policy  fields  rather 


than  general-purpose  analysts  and  that  while  they 
have  been  trained  in  the  systematic  comparison  of 
policy  alternatives  and  see  this  as  the  real  work  of 
policy  analysis,  they  find  that  they  are  likely  to  be 
spending  more  of  their  time  talking  not  to  policy¬ 
makers  looking  for  guidance  but  to  policy  analysts 
in  other  organizations,  each  primarily  concerned 
with  protecting  the  interests  of  his  or  her  own  orga¬ 
nization.  She  argues  that  in  practice,  policy  analysis 
has  moved  from  being  a  purportedly  objective  judg¬ 
ment  that  could  resolve  policy  arguments  to  being 
the  vehicle  through  which  they  could  be  continued. 
It  has  become  a  mode  of  discourse  mediating  rela¬ 
tions  between  the  various  participants  in  the  policy 
process — not  simply  between  policy  analysts  from 
different  agencies  but  also  between  elected  leaders, 
officials,  and  activists.  Officials  call  for  analysis  to 
question  and  delegitimize  the  projects  of  elected 
leaders,  leaders  commission  analysis  to  defend  these 
projects  against  bureaucratic  scrutiny,  and  activists 
table  analysis  to  support  their  demands  and  to 
establish  their  credibility  as  serious  players  in  the 
policy  process.  Policy  participants  were  interested 
in  policy  analysis  not  because  it  would  deliver  the 
self-evidently  correct  answer  to  a  policy  problem 
but  because  it  could  facilitate  interaction  by  giving 
the  contending  participants  a  shared  language. 

This  evident  development  in  policy  practice 
(analysis  for)  has  been  paralleled  by  a  growing 
interest  in  interpretive  modes  of  policy  analysis 
(analysis  of),  which  focus  attention  on  the  con¬ 
struction  of  meaning  and  the  way  it  is  related  to 
patterns  of  interaction.  The  relationship  is  (as 
Anthony  Giddens  puts  it)  “recursive”:  The  accepted 
construction  of  meaning  indicates  who  the  appro¬ 
priate  participants  are  and  how  they  should  act  and 
interact,  and  who  participates  and  how  they  inter¬ 
act  determines  the  meaning  of  the  question. 
Interpretive  policy  analysis  directs  attention  to  the 
ways  in  which  meaning  is  constructed  from  the 
competing  interpretations  (story  lines)  available: 
What  is  the  policy  about?  Is  policy  on  child  care, 
for  instance,  about  the  reconstruction  of  gender 
relations,  increasing  workforce  participation,  pro¬ 
moting  socialization,  or  the  education  of  the 
young?  These  are  all  valid  policy  aspirations,  but 
how  significant  they  are  in  any  specific  context  is 
likely  to  depend  on  which  participants  are  involved, 
and  the  various  participants  are  all  concerned  with 
ensuring  that  their  perception  of  the  problem  is 
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reflected  in  the  policy  statement.  Interpretive  policy 
analysis  draws  attention  to  the  way  in  which  par¬ 
ticipation  and  meaning  interact  with  one  another. 

At  the  same  time  that  analysis  of  policy  has 
been  broadening  its  gaze  and  recognizing  the 
ambiguity  of  purpose  in  policy,  there  has  been 
among  some  practitioners  a  strong  move  in  the 
other  direction,  back  in  search  of  more  clarity  and 
a  policy  analysis  that  will  identify  the  right  policy 
to  choose — in  a  sense,  returning  to  the  spirit  of 
Lasswell’s  1951  call  for  a  policy  science  and  the 
American  graduate  programs  that  it  inspired.  This 
perspective  on  the  function  of  analysis  has  been 
boosted  by  the  application  of  principal-agent  the¬ 
ory  in  government,  which  underlies  the  enthusiasm 
for  purchaser-provider  splits,  the  contracting  out 
of  functions,  and  quasi-market  relations  between 
the  center  and  the  operational  arms  of  govern¬ 
ment.  In  this  context,  the  function  of  policy  analy¬ 
sis  was  to  clarify  what  it  was  that  the  center  should 
be  seeking  and  to  verify  whether  the  operational 
arms  were  delivering  it. 

By  the  turn  of  the  century,  this  model  of  arm’s- 
length  trading  was  being  supplanted  by  discourses 
of  partnership  and  governance,  but  there  was  a 
growing  demand  for  evidence-based  policy,  which 
also  looked  to  policy  analysis  to  reduce  uncer¬ 
tainty  in  the  policy  process.  This  followed  pressure 
in  the  health  sector  for  evidence-based  medicine, 
stemming  from  a  realization  that  while  systematic 
comparison  of  alternatives  had  played  a  large  part 
in  the  advance  of  medical  science,  there  was  little 
comparative  evaluation  of  the  standard  operating 
procedures  of  medical  practice.  This  concern 
resulted  in  the  formation  of  the  Cochrane  Collab¬ 
oration  as  a  forum  for  scrutinizing  the  evidence  on 
the  efficacy  of  different  forms  of  treatment,  with 
the  randomized  controlled  trial  as  the  gold  stan¬ 
dard  of  analytic  methods.  This  movement  was 
paralleled  in  policy  analysis  by  the  establishment 
of  the  Campbell  Collaboration,  which  aimed  to 
establish  definitively  what  works  in  policy,  and 
this  gave  new  life  to  traditional  American  policy 
analysis,  with  its  focus  on  the  definition  of  the 
problem,  the  identification  of  options,  and  the  sys¬ 
tematic  comparison  of  alternatives. 

So  policy  analysis  is  characterized  by  two  major 
tensions.  One  is  the  tension  between  scholarship 
and  practice:  To  what  extent  is  policy  analysis  the 
generation  of  scientific  knowledge,  and  to  what 


extent  is  it  a  contribution  to  governmental  prac¬ 
tice?  The  second  tension  is  between  the  two  diver¬ 
gent  paradigms  of  policy  on  which  it  draws.  One 
is  the  paradigm  of  instrumental  rationality  from 
which  policy  analysis  emerged,  which  portrays  a 
world  in  which  the  problem  is  clear  and  there  is  a 
single  decision  maker,  a  known  hierarchy  of  values 
and  objectives,  and  certainty  of  outcomes.  In  this 
world,  the  role  of  the  policy  analyst  is  to  calculate 
which  course  of  action  will  best  achieve  the  values 
of  the  policymaker.  This  is  countered  by  an  alter¬ 
native  paradigm  that  sees  policy  as  emerging  from 
a  world  characterized  by  diversity:  multiple  par¬ 
ticipants,  conflicting  agendas,  dispute  over  values 
and  objectives,  and  uncertainty  about  the  out¬ 
comes  that  might  be  achieved  or  how  these  would 
be  evaluated.  In  this  paradigm,  policy  analysis  is  a 
part  of  the  game,  mobilized  by  the  participants  as 
part  of  the  continuing  struggle  for  support.  These 
two  paradigms  are  not  entirely  separate;  rather, 
one  tends  to  assume  the  other.  There  is  an  aware¬ 
ness  of  diversity  and  also  a  yearning  for  cohesion. 
Policy  participants  recognize  the  multiple  voices  in 
the  process  of  governing  but  at  the  same  time  find 
it  difficult  not  to  use  a  single-voiced  narrative  in 
which  an  actor  called  the  government  makes  deci¬ 
sions  that  resolve  conflicts  by  the  creation  of  pol¬ 
icy.  The  question  for  participants  is  to  what  extent 
policy  analysis  is  seen  as  a  vehicle  for  accomplish¬ 
ing  the  purposes  of  the  government  and  to  what 
extent  it  operates  as  a  device  for  facilitating  the 
achievement  of  some  degree  of  concerted  action 
among  the  diverse  and  fractious  players  in  the 
game,  with  statements  of  objectives  (or  about  the 
problem  itself)  being  means  for  the  achievement  of 
collaboration  rather  than  evidence  of  its  existence. 
And  for  the  academic  observers,  the  question  is 
how  much  they  take  policy  analysis  on  its  own 
terms,  as  the  careful  documentation  of  an  exercise 
in  instrumental  rationality,  and  how  much  they  see 
it  as  a  part  of  the  process  that  they  are  attempting 
to  analyze. 
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In  current  conversational  use,  policy  community 
refers  to  the  population  of  organizations  with  a 
stake  in  an  area  of  public  policy.  In  its  more 
technical  sense,  it  refers  to  interorganizational 
structures  exhibiting  a  close,  stable,  cooperative 
relationship  between  a  limited  number  of,  mainly 
self-selected,  interest  groups  and  “partnering” 
elements  of  the  governmental  machinery.  This 
entry  describes  the  development  of  this  term  and 
also  discusses  the  numerous  critiques  that  have 
been  made  concerning  it. 

Developments 

The  term  policy  community  appeared  in  the  litera¬ 
ture  independently  in  several  sources  in  the  1970s 
(e.g.,  Jeremy  John  Richardson  &  A.  Grant  Jordan, 
1979),  and  this  timing  reflected  the  shift  in  focus 
in  political  science  away  from  formal  legal  and 


legislature-based  study  to  tracking  empirical  policy 
biography.  Empirical  research  found  that  the  effec¬ 
tive  locus  of  decision  making  was  not  the  legisla¬ 
ture,  cabinets,  or  politician-led  committees  but 
group/bureaucrat  arenas.  The  term  recorded  the 
conclusion  that  much  policy  evolution  was  not  the 
result  of  ideological  struggle  between  conflicting 
political  parties  with  distinctive  ideologies  and 
agendas  but  of  apolitical  (or  at  least  nonpartisan) 
discussion  and  information  exchange. 

It  was  thus  one  of  a  family  of  concepts  captur¬ 
ing  segmented  policy-making  features — for  exam¬ 
ple,  group  subgovernment,  corporate  pluralism, 
and  iron  triangles.  The  characteristics  of  a  generic 
subsystem  approach  include  the  following:  the 
expectation  of  bargaining  in  stable  (sectoral)  envi¬ 
ronments,  the  evolution  of  stable  coalitions,  the 
low  visibility  of  decisions,  well-defined  and  uncon¬ 
tested  policy  jurisdiction,  the  narrow  and  low 
scope  of  conflict,  a  small  number  of  participants, 
and  some  restriction  of  access  to  the  process. 

In  real-life  settings,  policy  communities  are  not 
black-and-white  arrangements  with  gatekeepers 
and  badges  of  admission.  The  policy  community 
was  not  intended  as  a  one-size-fits-all  policy  expla¬ 
nation.  What  it  intended  to  capture  was  a  sense  of 
surprise  (reflecting  a  wave  of  case  study  research) 
that  policy  making  was  to  a  substantial  degree  the 
result  of  non-legislature-based  bargaining. 

Whereas  in  party  and  parliamentary  venues  pol¬ 
icy  differences  are  exaggerated  in  adversarial  fash¬ 
ion  for  political  advantage,  in  policy  subsystems, 
there  is  a  premium  on  minimizing  differences  and 
underplaying  the  apparent  significance  of  the  out¬ 
come.  Participants  resolve  issues  within  the  network 
in  the  belief  that  it  is  counterproductive  to  highlight 
grievances  (attracting  other  competing  perspec¬ 
tives).  The  suppression  of  competing  interests  is 
reinforced  by  presenting  issues  as  humdrum  or  tech¬ 
nical.  Trumpeting  the  importance  of  one’s  concerns 
might  just  attract  attention.  So  it  would  be  difficult 
to  get  an  impression  of  the  number  and  importance 
of  these  arrangements  from,  say,  a  sampling  of  press 
articles  or  ministerial  speeches.  The  policy  commu¬ 
nity  premium  is  in  avoiding  such  attention. 

The  Main  Critiques 

The  concept  has  found  little  empirical  application 
and  instead  has  generated  debate  about  what  it 
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implies  rather  than  demonstrations  of  its  useful¬ 
ness  or  otherwise. 

Its  original  usage  in  the  United  States  and  the 
United  Kingdom  (UK)  was  remarkably  congruent. 
In  the  UK,  Richardson  and  Jordan’s  Governing 
Under  Pressure  sought  to  reorient  attention  in 
policy-making  studies  toward  a 

special  type  of  stable  network  which  has 
advantages  in  encouraging  bargaining  in  policy 
resolution  .  .  .  where  there  are  effective  shared 
“community”  views  on  the  problem.  Where 
there  are  no  such  shared  attitudes  no  policy 
community  exists.  (Jordan,  1990,  p.  327) 

However,  while  the  policy  community  was  seen  as 
being  a  set  of  participants  with  shared  understand¬ 
ings,  this  did  not  mean  a  lack  of  conflict: 
Community  deliberately  implied  a  sort  of  “circum¬ 
scribed  conflict,”  not  its  absence. 

Its  usage  in  the  UK  stood  as  an  implicit  rejection 
of  Arendt  Lijphart’s  majoritarianism  that  sug¬ 
gested  that  British  policy  making  was  top-down — 
using  stable  parliamentary  majorities  to  drive 
through  manifesto-based  policy.  It  suggested 
instead  that  policy  is  normally  built  up  through 
agreement  in  constituencies  of  affected  interests.  A 
theoretical  capacity  for  policy  imposition  is  actu¬ 
ally  marginalized  by  the  political  benefits  of  build¬ 
ing  consensus  and  implementing  them  through 
cooperation. 

The  general  assumption  in  these  sources  is  that 
much  policy  making  is  in  effect  subcontracted  to 
informal  institutions  of  the  relevant  civil  servants 
and  affected  interests.  For  example,  Randall  Ripley 
and  Grace  Franklin,  writing  about  the  United 
States,  observed  that  most  of  the  policy  making 
with  which  subgovernments  engage  consists  of 
routine  matters  resolved  in  fairly  restricted  and 
consensual  settings.  Thus,  an  assumption  under¬ 
pinning  these  policy  resolution  “institutions”  is 
that  the  breadth  and  complexity  of  the  government 
agenda  require  the  disaggregation  of  policy  into 
more  manageable  subissues  involving  a  smaller 
number  of  relevant  and  knowledgeable  partici¬ 
pants.  The  fragmentation  of  policy  takes  decisions 
to  a  level  of  particularity  where  few  are  interested 
enough  to  mobilize  or  have  the  time  or  resources 
to  become  involved.  Those  with  an  interest  look 
for  acceptable  outcomes. 


As  suggested  by  Keith  Dowding,  the  original 
concept  was  no  more  than  metaphorical,  useful  in 
signaling  as  important  a  particular  set  of  (nonparty 
political)  participants  and  suggesting  a  commu- 
nity-like  quality  to  these  relationships.  Others  have 
been  more  ambitious  for  the  concept — but  mainly 
those  exaggerating  its  ambitions  to  prove  a  weak¬ 
ness.  Critics  have  generated  a  caricaturing  process. 
Martin  Smith  criticized  as  being  “too  vague” 
Richardson  and  Jordan’s  assertion  that  policy 
communities  developed  in  response  to  complexity 
in  the  wider  political  system.  Smith’s  claim  fails  to 
distinguish  between  a  definition  that  is  vague  and 
a  reasonably  precise  definition  of  uncertainty. 

The  attention  by  politicians  to  a  handful  of 
high-profile  areas  (not  necessarily  the  most  vital) 
implies  that  they  have  to  neglect  the  vast  majority 
of  the  policy  mix.  It  is  argued  in  the  policy  com¬ 
munity  literature  that  many  (but  by  no  means  all) 
of  the  numerous  remaining  areas  fall  to  policy 
community-type  resolution.  There  is,  however,  a 
preliminary  assumption  that  it  makes  sense  for 
participants  to  try  to  deal  with  matters  in  the 
milieu  of  consultation  rather  than  electoral  poli¬ 
tics.  It  is  not  a  deficiency  of  a  model  of  routine 
decision  making  that  it  does  not  very  well  accom¬ 
modate  nonroutine  decisions. 

Another  criticism  of  the  policy  community 
model  is  that  it  fails  to  account  for  change.  For 
example,  Wyn  Grant  (2005)  cites  Richardson  to 
the  effect  that  the  main  weakness  of  the  concept  is 
the  implication  of  stable  policies,  relationships, 
and  membership.  This  does  not  seem  a  particularly 
strong  criticism — akin  to  complaining  that  a  map 
of  Washington  is  a  poor  guide  to  Montreal:  The 
policy  community  sketch  was  not  meant  to  account 
for  change  but  to  explain  enduring  stability. 

Frank  Baumgartner  and  Bryan  Jones  (1993,  p.  6) 
used  the  term  policy  monopoly  to  label  informal 
policy  structures  very  similar  to  policy  communities. 
Their  term  referred  to  the  “monopoly  on  political 
understandings,”  that  is,  the  ability  of  certain 
groups  to  maintain  a  dominant  policy  framing.  The 
policy  community  is  in  essence  the  venue  or  institu¬ 
tional  arrangement  that  reflects  and  reinforces  the 
dominant  understandings  of  which  Baumgartner 
and  Jones  wrote  in  their  work  on  agenda.  They  sug¬ 
gested  that  policy  making  is  often  unlike  the  plural¬ 
ist  idea  of  mobilization  and  countermobilization, 
and  instead  monopoly/community  arrangements 
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are  seen  primarily  as  instruments  of  mutual  nonin¬ 
terference  in  the  “absence  of  serious  opposing 
voices”  (p.  14). 

Frank  Baumgartner  and  Beth  Leech  supplied  a 
sense  of  scale  about  what  they  call  “policy  band¬ 
wagons” — political  issues  generating  widespread 
group  interest.  These  they  proved  to  be  atypical. 
They  identified  a  random  sample  of  137  cases  of 
lobbying  and  looked  at  high-  and  low-conflict  pro¬ 
portions.  They  found  extreme  polarization,  with 
the  top  four  issues  accounting  for  more  than  a 
third  of  all  interest-group  activity.  Twenty-six 
issues  out  of  137  captured  81%  of  the  lobbying. 
The  reverse  is  of  course  that  111  cases  reflected 
19%.  This  confirms  two  things:  (1)  that  not  all 
lobbying  events  are  the  same  and  (2)  that  many  of 
them  are  low  profile,  with  few  people  participat¬ 
ing.  It  would  be  a  leap  to  say  that  all  the  cases  with 
a  small  number  of  participants  were  examples  of 
policy  communities,  but  the  data  at  least  underline 
that  much  political  processing  is  low-participant 
activity. 

As  argued  by  Grant  Jordan  and  William 
Maloney,  the  arrangements  exist  because  there  is  a 
“logic”  to  building  cooperation  about  delivery  by 
preconsulting  relevant  interests.  Organizations 
trade  or  exchange  information/advice  to  secure 
access  to,  and  influence  within,  decision  procedures. 
The  logic  of  this  relationship  holds,  regardless  of 
the  political  persuasion  controlling  the  govern¬ 
ment.  Therefore,  radical  policy  shift  is  rare  and 
does  not  routinely  follow  a  change  of  government. 

Critics  tend  to  define  the  policy  community  con¬ 
cept  in  a  comprehensive,  rigid  way  that  can  be 
demonstrated  as  not  fitting  real-life  cases.  In  fact, 
the  articulation  may  have  itself  overencouraged 
ideas  about  institutional  formality.  In  real-life  pol¬ 
icy  development,  participants  are  not  clearly  identi¬ 
fied.  It  is  conspicuous  and  significant  that  propolicy 
community  authors  fail  to  give  definitive  lists  of 
their  examples.  This  may  not  be  because  the  con¬ 
cept  is  vague  but  because  it  tries  to  capture  a  phe¬ 
nomenon  that  lacks  formality  and  certainty  about 
a  list  of  participants.  Policy  community  “struc¬ 
tures”  may  be  less  important  than  the  “logic  of 
accommodation”  procedures  that  they  exhibit  but 
do  not  monopolize.  Even  where  the  policy-making 
arrangements  do  not  make  a  policy  community 
description  appropriate,  the  process  “borrows” 
some  of  the  conflict  reduction  features  because 
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Policy  cycle  refers  to  the  process  whereby  political 
actors  attempt  to  shape  the  definition  of  problems, 
the  setting  of  a  policy  agenda,  the  formulation  of 
policy  alternatives,  the  adoption  and  implementa¬ 
tion  of  policy  decisions,  and  the  evaluation  of  pol¬ 
icy  outcomes.  This  perspective  is  underpinned  by  a 
linear,  problem-oriented,  rationalist,  and  systemic 
view  of  the  policy  process.  In  the  conventional  con¬ 
ception  of  the  policy  cycle,  policy  problems  are 
identified  by  policy  advocates  and  other  political 
actors  and  then  put  onto  the  policy  agenda;  politi¬ 
cians  and  officials  then  develop  policy  measures 
to  address  these  problems  and  adopt  the  best  pos¬ 
sible  alternative  to  tackle  the  problems  after 
considering  the  likely  impact  of  these  alternatives; 
and  evaluation  would  be  done  after  the  policy  is 
implemented,  and  the  responses  and  reactions  to 
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such  evaluation  would  be  fed  back  into  the  policy 
process. 

Policy  Process 

The  emphasis  of  the  policy  cycle  perspective  is  on 
the  process  of  policy  actions  and  interactions 
involving  many  different  political  and  bureaucratic 
players.  Hence,  it  has  gone  beyond  the  study  of 
formal  political  institutions  or  the  input  side  of 
politics,  such  as  interest-group  activities  and  polit¬ 
ical  participation,  which  were  once  the  main  con¬ 
cerns  of  political  science.  The  conception  of  the 
policy  cycle  could  be  traced  to  Harold  Lasswell’s 
seven  prescriptive  steps  in  policy  making,  namely, 
(1)  intelligence,  (2)  promotion,  (3)  prescription, 
(4)  invocation,  (5)  application,  (6)  termination, 
and  (7)  appraisal.  Nonetheless,  his  prescriptive 
conception  focuses  mainly  on  policy  making  within 
government,  without  taking  into  account  the 
impact  of  other  political  actors  and  the  external 
environment.  Major  texts  published  in  the  1970s 
and  1980s  by  Garry  Brewer  and  Peter  DeLeon, 
Charles  O.  Jones,  and  James  E.  Anderson  have 
further  contributed  to  the  policy  cycle  framework. 
While  the  most  commonly  recognized  stages  of  the 
policy  cycle  include  problem  definition  and  agenda 
setting,  policy  making,  implementation,  and  evalu¬ 
ation,  other  formulations  have  proposed  some¬ 
what  different,  but  still  largely  similar,  stages,  such 
as  initiation,  estimation,  selection,  adoption,  legiti¬ 
mation,  implementation,  evaluation,  and  termina¬ 
tion.  The  terms  policy  cycle  and  policy  stage  are 
often  used  interchangeably  in  the  literature,  hence 
this  is  also  called  the  stages  approach  to  the  policy 
process.  The  cyclical  metaphor,  however,  high¬ 
lights  the  role  of  feedback  in  linking  the  input  and 
output  phases  of  policy  making,  as  in  David 
Easton’s  system  model,  and  hence  emphasizes  the 
continuous  evolution  of  the  policy  process. 

Criticisms  of  the  Policy  Cycle  Concept 

The  policy  cycle  perspective  is  perhaps  the  most 
widely  adopted  and  long-standing  framework  used 
in  organizing  major  texts  on  policy  making  and 
policy  analysis  because  it  simplifies  the  compli¬ 
cated,  amorphous  processes  into  distinct  and  easily 
identifiable  stages.  Its  contribution  to  the  conceptual 
organization  of  the  policy  process  should  not  be 


underestimated.  However,  despite  its  widespread 
usage,  it  has  come  under  criticism  from  various 
angles,  ranging  from  critique  of  its  empirical  rele¬ 
vance  to  its  promise  as  a  policy  theory.  First,  one 
key  criticism  is  that  the  policy  cycle  is  not  a  causal 
theory  or  model.  It  does  not  offer  an  explanation  of 
the  entire  policy  process,  even  though  the  perspec¬ 
tive  encompasses  the  most  prominent  aspects  of 
public  policy  making.  It  fails  to  clearly  identify  a  set 
of  key  variables  that  explains  the  process,  which  is 
considered  not  conducive  to  the  further  develop¬ 
ment  of  policy  theory.  Nor  does  it  specify  the  rela¬ 
tionship  between  these  variables  or  generate  a 
hypothesis  for  testing  its  claims.  In  the  eyes  of  the 
critics,  the  policy  cycle  at  best  provides  a  descrip¬ 
tive  account  and  an  analytical  framework  of  the 
policy  process  rather  than  a  theoretical  explanation 
of  how  the  policy  process  proceeds  in  the  way  sug¬ 
gested  by  its  proponents.  Hank  Jenkins-Smith  and 
Paul  Sabatier  prefer  to  call  it  the  “stages  heuristic,” 
namely,  just  a  learning  device. 

Second,  another  major  criticism  argues  that  the 
policy  cycle  presumes  a  linear,  unidirectional  con¬ 
ception  of  the  policy  process,  as  if  one  stage  will 
lead  to  another,  which  does  not  resemble  the  real 
political  world.  In  reality,  interactions  among  dif¬ 
ferent  stages  of  the  policy  process  often  take  place, 
and  the  interplay  of  politics  and  the  influence  of 
the  external  environment  are  critical  in  shaping  the 
twists  and  turns  of  the  policy  process.  Third,  the 
sequence  of  stages  in  the  policy  cycle  is  often  con¬ 
sidered  as  empirically  inaccurate  because  different 
phases  of  the  cycle  may  take  place  at  the  same  time 
and  policies  do  not  always  proceed  as  suggested  in 
the  framework.  For  instance,  the  evaluation  of 
existing  policies  affects  agenda  setting  and  policy 
making,  and  the  unintended  outcome  of  one  policy 
may  pose  a  problem  that  needs  to  be  addressed  by 
another  policy.  While  the  different  phases  or  stages 
of  the  policy  cycle  are  conceived  as  discrete  steps 
clearly  differentiated  from  each  other,  it  is  often 
not  easy  to  delineate  the  different  phases  neatly  in 
reality.  Policies  are  proposed  in  the  context  of 
other  existing  policies,  and  they  are  usually 
amended  or  revised  continuously. 

Fourth,  the  policy  cycle  takes  into  account  pol¬ 
icy  evaluation  but  not  policy  learning,  although 
many  studies  have  studied  how  policy  analysis  and 
policy  learning  can  be  integrated  into  the  policy 
process.  Fifth,  some  critics  further  argue  that  the 
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framework  may  incorrectly  highlight  the  policy 
cycle  as  the  unit  of  analysis.  Instead  of  having  only 
one  cycle,  many  different,  interacting  policy  cycles 
concerning  a  multitude  of  diverse  policy  measures 
initiated  at  different  governmental  levels  may  be 
proposed  at  the  same  time,  especially  in  big,  fed¬ 
eral  political  systems  such  as  the  United  States. 

Sixth,  a  top-down,  legalistic  bias  is  also  implicit 
in  the  policy  cycle  because  its  focus  is  on  the  adop¬ 
tion,  promulgation,  and  implementation  of  govern¬ 
ment  legislation  rather  than  the  interactions  between 
different  aspects  of  the  policy  process  or  the  politics 
between  governmental  and  societal  players.  Politics, 
whether  within  the  government  in  the  form  of 
bureaucratic  politics  or  in  the  community  at  large, 
however,  are  often  powerful  determinants  of  the 
dynamics  of  the  policy  process.  Such  a  deficiency 
was  later  explicitly  addressed  in  other  competing 
theoretical  frameworks  of  the  policy  process. 

Last  but  not  least,  the  policy  cycle  cannot  fully 
capture  the  multifaceted  political  dynamics  of  the 
policy  process,  which  is  more  critical  than  the 
rational  calculus  in  determining  policy  outputs  and 
outcomes.  The  seminal  contribution  on  agenda 
setting  by  John  Kingdon  (1984),  for  instance,  aptly 
emphasizes  the  importance  of  the  coupling  of  the 
three  streams,  namely,  the  problem,  the  policy, 
and  political  streams,  in  shaping  the  policy  pro¬ 
cess.  According  to  Kingdon,  the  problem  stream 
refers  to  the  perception  and  definition  of  problems 
as  worthy  of  further  attention  and  action  by  policy 
actors;  the  policy  stream  refers  to  the  ongoing 
policy  deliberation  among  the  policy  actors  and 
the  emergence  of  policy  proposals  within  the  pol¬ 
icy  community;  and  the  political  stream  refers  to 
the  major  developments  in  the  political  arena,  such 
as  the  changes  in  the  national  mood,  turnovers  in 
administration,  and  personnel  reshuffling.  The 
combination  of  these  three  forces  constitutes  a 
powerful  factor  in  opening  the  policy  window  that 
will  usher  in  policy  decisions  or  policy  change.  By 
emphasizing  the  interplay  of  political  as  well  as 
ideational  factors,  Kingdon’s  view  contrasts 
sharply  with  the  more  ideal-typical,  simplified  pro¬ 
cession  of  stages  in  the  policy  cycle  perspective. 

Contributions  to  the  Study  of  Public  Policy 

The  policy  cycle  framework  is  clearly  not  embedded 
in  a  casual  theory,  but  is  it  completely  inadequate  as 


an  important  scholarly  contribution  to  the  study  of 
public  policy  except  in  its  service  as  a  learning 
device?  On  balance,  the  framework  has  helped 
organize  research  by  highlighting  significant,  generic 
features  of  the  policy  process,  even  though  it  has 
not  postulated  a  policy  theory.  Indeed,  the  specifi¬ 
cation  of  the  key  stages  of  the  policy  process  in  this 
framework  is  both  a  strength  and  a  weakness.  If  a 
sound  theoretical  explanation  of  policy  evolution  is 
to  be  formulated,  there  is  a  need  to  put  the  frame¬ 
work  within  specific  institutional  contexts  in  order 
to  take  into  account  the  impact  of  political  struc¬ 
tures,  the  multiple  layers  of  government  and  their 
interactions,  or  the  specific  constitutional  arrange¬ 
ments  in  shaping  policy  politics  in  different  national 
contexts.  At  the  same  time,  the  overarching  con¬ 
cepts  of  the  policy  cycle  do  highlight  comparable 
features  of  the  policy  process  that  can  be  easily 
identified  in  different  political  systems,  even  though 
the  actual  characteristics  of  each  phase  vary  across 
nations.  Such  a  conceptual  space  is  especially  criti¬ 
cal  in  fostering  comparative  research,  particularly 
because  the  literature  on  public  policy  primarily 
focuses  on  the  United  States  and  other  developed 
democracies. 

Most  major  scholarly  works  on  public  policy  in 
the  1990s  do  not  attempt  to  develop  a  theory  of 
the  entire  policy  process;  rather,  they  tend  to 
examine  the  dynamics  of  each  of  the  constituent 
phases  of  the  policy  cycle  in  much  greater  depth 
and  with  increasing  methodological  sophistication. 
For  instance,  many  studies  investigate  how  prob¬ 
lems  come  to  the  attention  of  policymakers  (e.g., 
John  Kingdon),  how  political  systems  process 
information  and  prioritize  problems  (e.g.,  Bryan 
Jones  and  Frank  Baumgartner),  and  how  policy 
agendas  have  evolved  and  changed  over  time  (e.g., 
Jones  and  Baumgartner).  Others  have  analyzed 
how  competing  advocacy  coalitions  sharing  basic 
policy  beliefs  within  a  policy  subsystem  have  orga¬ 
nized  to  change  policies  (e.g.,  Paul  Sabatier  and 
Hank  Jenkins-Smith)  and  how  public  policies  are 
implemented  (e.g.,  Michael  Hill  and  Peter  Hupe). 
Scholars  have  also  studied  how  policy  design  has 
affected  the  behavior  and  interests  of  target  clients 
of  policies  (e.g.,  Anne  Schneider  and  Helen  Ingram). 
Moreover,  while  investigating  one  policy  phase 
involving  a  piece  of  policy  legislation  may  require 
a  relatively  shorter  period  of  time,  some  major 
studies  have  taken  a  long  view  of  the  policy  cycle 
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to  take  into  account  the  effects  of  implementation 
and  policy  learning.  For  instance,  research  on  the 
advocacy  coalition  framework  has  studied  policy 
change  and  learning  over  a  decade  (researchers 
include  Jenkins-Smith  and  Sabatier),  and  the  punc¬ 
tuated  equilibrium  theory  studies  changes  in  the 
American  policy  agenda  over  many  decades 
(researchers  include  Bryan  Jones  and  Frank 
Baumgartner).  The  duration  of  a  policy  cycle  is 
critical  if  a  more  thorough  analysis  of  the  entire 
policy  cycle  is  to  be  attempted. 

Since  the  actual  policy  process  concerns  highly 
complex,  fluid,  and  strategic  interactions  between 
many  political  actors  over  time,  the  simplicity  of 
the  policy  cycle  continues  to  offer  not  only  a  sound 
heuristic  device  but  also  a  very  effective  frame¬ 
work  to  stimulate  thinking  and  organize  research 
on  public  policy.  In  fact,  disaggregating  the  policy 
process  into  stages  constitutes  a  critical  first  step 
that  facilitates  further  conceptual  refinement,  theo¬ 
retical  inquiry,  and  empirical  research.  Research 
hypotheses  within  specific  stages  of  the  policy 
cycle,  such  as  agenda  setting  and  policy  making,  or 
across  these  stages,  for  instance,  could  be  gener¬ 
ated  for  further  theory  building.  Such  hypotheses 
would  then  be  made  amenable  for  empirical  test¬ 
ing  or  refutation.  Therefore,  in  spite  of  the  lack  of 
a  core  theoretical  argument,  the  policy  cycle  per¬ 
spective  has  not  limited  the  conceptual  and  meth¬ 
odological  advancement  in  the  study  of  the  policy 
process. 
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Policy  Evaluation 


Policy  evaluation  concerns  the  estimation  of  the 
effectiveness  of  public  policies,  programs,  or  proj¬ 
ects  and  their  management  and  implementation. 
Because  policy  is  about  change,  policy  evaluation 
should  explain  what  factors  or  determinants  led 
citizens  or  groups  to  do  things  or  take  actions  that 
otherwise  they  would  not  do  (e.g.,  quit  smoking, 
recycle  products,  practice  sustainable  behavior, 
end  domestic  violence,  retain  students  in  school, 
attract  investments,  reduce  unemployment).  To 
implement  such  policies,  governments  have  created 
programs  with  specific  achievement  goals.  The  role 
of  policy  evaluators  is  to  give  educated  estimates  of 
the  effects  of  changes  in  government  policies.  From 
a  public  administration  perspective,  program  eval¬ 
uation  is  a  scientific  activity  that  entails  the  use  of 
scientific  methods  to  measure  the  outputs,  imple¬ 
mentation,  and  outcomes  of  programs  for  deci¬ 
sion-making  purposes.  But  beyond  all  this,  the  key 
question  in  policy  evaluation  is  the  following: 
What  does  government  do  to  improve  the  quality 
of  life  of  individuals-citizens?  It  is  also  important 
for  policy  evaluators  to  decide  if  it  is  better  to  do 
something  than  do  nothing  to  resolve  a  problem. 
This  entry  examines  approaches  to  policy  evalua¬ 
tion,  the  political  role  of  evaluation,  and  some  of 
the  techniques  used  for  policy  evaluation. 

Historically,  the  role  of  social  scientists  and  pro¬ 
gram  evaluators  in  governments  has  been  at  the 
forefront  of  the  debate  on  the  relevance  of  norma¬ 
tive  versus  empirical  research  in  the  decision-making 
process.  The  opposition  between  those  who  favor 
the  “scientifization”  of  politics  and  those  for  the 
“politicization”  of  science  has  created  two  classes  of 
program  evaluators:  those  who  think  that  we  can 
bring  social  changes  by  looking  at  and  comparing 
the  action  taken  by  several  governmental  entities  to 
resolve  social  problems  and  those  who  believe  that 
everything  remains  normative  and  the  support  of 
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stakeholders  is  essential  for  the  implementation 
purposes.  Looking  back  on  the  New  Deal  of 
Franklin  D.  Roosevelt,  the  Beveridge  Report,  the 
Coleman  Report  of  education,  and  the  War  on 
Poverty,  all  these  policy  initiatives  led  to  several 
actions  and  new  programs  to  respond  to  social 
issues. 

Therefore,  policy  evaluation  refers  also  to  the 
broad  idea  of  social  engineering,  meaning  the 
capacity  of  evaluators  to  resolve  social  problems 
and  to  find  superior  solutions.  The  evaluators’  role 
in  governing  is  also  to  suggest  the  best  policy  alter¬ 
natives  to  resolve  a  social  problem.  Decision  mak¬ 
ers  need  to  understand  the  impact  of  their  actions; 
therefore,  policy,  program,  and  project  evaluations 
as  a  knowledge  activity  contribute  to  the  rational¬ 
ization  of  decision  making  by  determining  the  best 
course  of  action  and  the  limitations  of  governmen¬ 
tal  interventions  (Lee  Sechrest  &  Aurelio  Figueredo, 
1993;  Evert  Vedung,  1997).  Thus,  evaluators  are 
not  mere  technicians  measuring  what  government 
has  been  doing  but  also  are  intimately  connected 
to  making  and  improving  public  policies. 

The  Role  of  Policy/Program  Evaluators 
in  a  Changing  Environment 

The  concept  of  “experimenting  societies,”  that  is, 
trying  to  test  social  theories  and  hypotheses  in  our 
social  environment,  in  many  ways  poses  a  chal¬ 
lenge  and  a  burden  on  program  evaluators  (Donald 
Campbell  &  Jean  Russo,  1999).  How  can 
program  evaluators  improve  the  well-being  of  citi¬ 
zens?  How  can  they  reduce  the  social  risks  associ¬ 
ated  with  unemployment  and  slow  economic 
growth,  poverty,  and  income  inequalities?  How 
can  they  improve  the  social  status  of  unprivileged 
groups?  To  bring  about  such  changes,  evaluators 
realized  that  the  state  is  only  one  of  many  sources 
of  power  in  society.  Governments  have  to  compete 
with  many  actors — interest  groups,  social  groups, 
nongovernmental  organizations — who  want  to 
participate  in  the  policy  process.  Furthermore,  in 
addition  to  competition,  the  public  sector  also  now 
engages  in  extensive  cooperation  with  actors  in  the 
private  sector  in  the  provision  of  public  services. 

One  difficult  task  for  program  evaluators  is  to 
convince  decision  makers  that  the  social  and  polit¬ 
ical  risks  of  inaction  are  often  more  costly  than 
taking  action  to  provide  a  remedy  to  a  problem. 


Evaluators  have  to  tell  politicians  not  only  what 
they  are  supposed  to  do  but  also  what  they  ought 
to  do.  If  governments  wait  too  long  to  repair 
bridges,  the  lives  of  citizens  can  be  threatened  and 
important  economic  and  social  consequences  may 
ensue.  Evaluators  have  to  establish  clearly  the  con¬ 
sequences  of  change  or  no  change — which  are 
often  the  cause  of  fear,  inertia,  and  conservatism — 
on  the  future  of  society.  Social  changes  or  policy/ 
program  changes  cannot  be  implemented  without 
an  overall  social  or  organizational  consensus 
about  the  necessity  of  such  a  transformation.  The 
determination  of  the  appropriate  change  criteria 
based  on  the  goals  and  objectives  of  programs  is  at 
the  heart  of  establishing  the  indicators  of  perfor¬ 
mance  and  both  the  economic  and  the  political 
feasibility  of  programs. 

From  a  methodological  viewpoint,  a  discussion 
about  the  “best”  research  protocol  or  research 
design  and  the  use  of  reliable  measuring  instru¬ 
ments  are  essential  elements  of  any  evaluation  to 
establish  either  the  success  or  the  failure  of  any 
governmental  program.  The  task  of  evaluators  is 
to  operationalize  the  program  goals,  to  specify  the 
indicators  to  measure  the  social  changes  associated 
with  the  program,  and  to  suggest  the  appropriate 
policy  instruments  to  achieve  the  programs  goals. 
Evaluators  are  concerned  not  only  with  outcomes 
measurement  but  also  with  the  implementation 
process.  Therefore,  putting  in  place  within  govern¬ 
ments  or  organizations  a  monitoring-tracking  sys¬ 
tem — to  evaluate  regularly  changes  in  the  program 
outcomes  and  the  processes  through  which  results 
are  supposed  to  be  obtained — is  also  a  crucial  ele¬ 
ment  of  the  evaluation  function. 

Thus,  evaluators  are  first  of  all  researchers,  with 
policy  evaluation  remaining  a  scientific  activity. 
However,  some  scholars  have  suggested  that  eval¬ 
uators  can  also  become  “agents”  of  social  change 
and,  in  some  cases,  policy  advisers  do  devise  the 
most  effective  strategies  for  achieving  program 
goals.  From  this  perspective,  evaluators  constitute 
a  class  of  “new  social  scientists”  who  understand 
what  governments  do  (governance),  what  are  the 
key  determinants  of  the  decision-making  process 
(public  administration),  what  are  the  best  policy 
instruments  to  use,  and  how  to  evaluate  the  impact 
or  outcomes  of  decisions  (public  policy).  The  task 
of  evaluators  therefore  is  enormous  and  focuses  on 
the  multifaceted  nature  of  governmental  activities. 
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Governance  and  Policy  Evaluation  Processes 

If  in  the  United  States,  program  evaluation  was 
developed  largely  outside  government,  being  firmly 
rooted  in  the  experimental  tradition  of  the  social 
sciences  and  at  the  border  between  public  policy 
and  public  administration,  this  was  not  the  case  in 
most  industrial  countries.  In  the  United  Kingdom, 
Canada,  and  Australia,  the  evaluation  function  has 
developed  and  evolved  within  governmental  insti¬ 
tutions,  departments,  and  agencies.  In  other  countries, 
such  as  France  and  Germany,  program  evaluation 
is  still  perceived  as  a  means  of  administrative  con¬ 
trol,  and  the  line  between  program  evaluation  and 
audit  remains  fuzzy.  Therefore,  the  policy  evalua¬ 
tion  processes  have  followed  different  paths  and 
have  been  carried  out  by  different  stakeholders. 

Sometimes,  program  evaluation  is  perceived  not 
as  a  part  of  the  field  of  public  policy  or  of  public 
administration  but  simply  as  a  function  within  the 
government  management  and  monitoring  system. 
It  is  a  means  used  by  decision  makers  to  move 
toward  good  governance.  The  development  of 
policy  evaluation  was  also  perceived  as  a  part 
of  the  budgetary  process  since  the  implementation  of 
the  Programming,  Planning,  and  Budgeting  System 
(PPBS)  by  the  Lyndon  B.  Johnson  administration 
in  the  United  States;  it  is  also  considered  as  an  ele¬ 
ment  of  the  accountability  of  government.  That  led 
to  confusion  between  the  functions  of  evaluation 
and  audit  and  between  cost-effectiveness  and  cost- 
benefit  analysis.  If  program  evaluation  is  a  careful 
assessment  of  government  interventions,  internal 
audit  is  more  concerned  about  the  use  of  the  finan¬ 
cial  resources  of  the  public  sector.  Nevertheless, 
both  can  play  complementary  roles  in  determining 
the  criteria  leading  to  good  governance  and 
improve  the  delivery  of  programs  to  citizens. 

In  recent  years,  program  evaluation  as  a  field  of 
research  has  been  under  heavy  criticism,  with  sev¬ 
eral  scholars  arguing  that  over  the  years  no  sub¬ 
stantive  knowledge  has  been  attached  to  program 
evaluation.  The  main  reason  for  a  situation  of  this 
kind  to  have  developed  was  that  program  evalua¬ 
tion  operates  in  a  highly  politicized  context  and 
has  become  part  of  the  political  process  where 
program  evaluators  are  embedded  in  the  program 
itself  (Eleanor  Chelimsky,  1997).  Therefore,  it  was 
proposed  to  distinguish  formative  from  summative 
evaluation,  the  former  leading  to  managerial 
change  in  the  delivery  of  programs  while  the  latter 


looks  at  the  factors  leading  to  the  success  of  pro¬ 
grams.  Furthermore,  when  new  governments  take 
over,  the  origins  and  outcomes  of  the  public  pro¬ 
grams  of  the  previous  government  are  forgotten. 

The  Science  of  Program  Evaluation 

Policy  evaluation  was  clearly  an  outgrowth  of  the 
development  of  the  welfare  state  and  one  of  its  prin¬ 
cipal  consequences:  the  increase  of  governmental 
expenditures.  Even  if  there  are  several  theoretical 
elements  beyond  policy  evaluation,  the  central  the¬ 
ory  is  to  model  governmental  change  (Marvin  Alkin, 
2004).  In  1959,  when  Charles  Lindblom  wrote  his 
seminal  article  “The  Science  of  Muddling  Through,” 
he  suggested  that  in  any  incremental  analysis,  social 
scientists  should  distinguish  small  budgetary  changes 
(simple  incrememental  analysis)  from  the  complex 
(disjointed  incrementalism)  and  very  complex  situa¬ 
tions  (strategic  analysis).  For  Lindblom,  incremen¬ 
talism  is  simply  a  method  of  analysis  and  action — 
the  most  effective  way  is  to  respond  to  bureaucrats’ 
and  voters’  demands.  From  his  perspective,  nonin- 
cremental  policies  are  simply  irrational  and  politi¬ 
cally  irrelevant  because  their  consequences  are 
unforeseeable  for  any  political  system.  Leaders  who 
want  to  stay  in  power  will  be  wiser  to  bring  small 
budgetary  changes  and  introduce  small  adjustments. 
In  some  ways,  incrementalism  can  be  perceived  as 
an  outbreak  of  rational  decision  making. 

Therefore,  incrementalism  became  a  central  con¬ 
cept  of  public  policy  and  decision-making  processes, 
and  it  was  defined  as  a  mutual  adjustment  process 
that  may  produce  small  ranges  of  budgetary  out¬ 
puts.  If  this  concept  is  widely  accepted  among  schol¬ 
ars,  it  is,  however,  inadequate  to  explain  policy 
changes,  especially  in  periods  of  rapid  political  and 
social  transformations.  The  study  of  incremental 
change  also  led  several  studies  to  determine  if  the 
type  of  political  and  party  system  influences  the  bud¬ 
getary  process.  Also,  questions  related  to  the  impact 
of  newly  elected  governments  and  the  increase  of 
departments’,  ministries’,  or  agencies’  budgets  as  a 
factor  leading  to  changes  in  public  policies  (Niskanen 
theory  on  bureaucracies)  became  part  of  the  task  of 
explaining  governmental  growth.  Thus,  with  the 
election  of  new  leaders  and  political  parties,  some 
policy  adjustments  and  some  partisan  mutual  adjust¬ 
ment  processes  in  the  policy  goals  and  objectives  can 
be  expected.  If  the  traditional  incrementalist  school 
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suggests  that  political  elites  have  a  small  influence 
on  the  policy  process  and  its  outputs,  some  schol¬ 
ars,  on  the  contrary,  think  that  to  measure  the 
impact  of  policy,  political  analysts  should  look 
beyond  the  “muddling  through”  process.  Therefore, 
the  purpose  of  policy  evaluation  is,  basically,  to 
understand  the  role  and  functions  of  governments 
in  our  societies. 

Incrementalism  and  Social  Transformations 

The  postulate  of  incrementalism  is  that  the 
observed  policy  change  in  a  given  time  period  is 
determined  by  the  preceding  time  period.  In  other 
words,  a  good  part  of  the  change  observed  is  highly 
dependent  on  the  existing  situation.  In  the  case  of 
budgetary  change,  annual  budgets  represent  incre¬ 
mental  decisions  because  political  leaders,  bureau¬ 
crats,  and  appropriate  committees  are  concerned 
only  with  small  changes  in  the  budget  year  after 
year.  Therefore,  governmental  policies,  programs, 
and  budgets  are  never  completely  reviewed  as  a 
whole  because  of  the  incremental  calculations  from 
the  actors,  and  the  value  of  all  programs  is  never 
completely  compared  with  other  policy  alterna¬ 
tives.  In  the  case  of  the  budgetary  process,  the  pre¬ 
vious  year’s  budget  is  the  main  reference  to  fix  the 
appropriations  of  the  next  year.  The  incremental¬ 
ists  do  not  assume  that  cuts  and  reductions  in  bud¬ 
gets  and  programs  are  impossible  from  year  to  year 
but  that  decision  makers  are  only  interested  in  the 
increments — small  increases  or  decreases.  Then,  it 
is  possible  to  observe  some  incremental  cuts  in 
departmental  budgets  or  programs. 

From  a  public  administration  perspective,  the 
increment  approach  refers  to  the  difference  between 
what  the  administrators  of  a  department  or  pro¬ 
gram  request  in  a  specific  year  and  what  they  actu¬ 
ally  received  in  the  prior  year.  If  a  department  or 
program  receives  less  than  it  had  in  the  previous 
year,  policy  analysts  talk  about  a  decrement  instead 
of  an  increment.  However,  such  decrements  are 
rarely  observed  directly  and  in  some  ways  represent 
deviant  cases  in  public  administration.  In  other 
words,  since  most  policies  and  program  budgets  are 
rarely  controversial,  the  budgetary  base  does  not 
really  change  from  one  budget  to  the  successive 
ones.  However,  if  the  budget  of  a  program  remains 
the  same  over  time,  one  can  argue  that  there  are 
actually  incremental  cuts  because  program  expenses 


are  not  following  the  normal  inflation  growth  pat¬ 
tern.  Aaron  Wildavsky,  for  example,  gives  another 
definition  of  the  budgetary  base.  For  him,  the  policy 
analyst  should  take  into  account  the  expectations  of 
the  stakeholders.  Rarely,  decision  makers  will  ask 
for  a  decrease  of  their  program  budgets,  and  there¬ 
fore  the  base  represents  “the  commonly  held  expec¬ 
tations  among  participants  in  budgeting  that 
programs  will  be  carried  out  at  close  to  the  going 
level  of  expenditures”  (Wildavsky,  1986,  p.  11). 

If  these  concepts  (e.g.,  incremental  change, 
increment-decrement,  budgetary  base)  are  accepted 
among  policy  analysts,  it  is  more  their  applicability 
for  decision-making  purposes  that  is  challenged. 
Some  authors  have  criticized  the  normative  char¬ 
acter  of  the  incrementalist  theory  because  of  the 
perceived  conservative  trend  in  this  approach  and 
the  absence  of  any  reflection  about  the  policy 
implementation  process  of  new  policies.  The  the¬ 
ory  and  the  models  proposed  are  essentially  too 
stochastic.  Also,  because  changes  are  only  margin¬ 
ally  different  from  the  status  quo,  the  hypothesis 
that  policy  administrators  and  politicians  will 
adopt  incremental  behavior  is  merely  descriptive. 
It  should  also  be  proven  that  the  base  actually  rep¬ 
resents  the  status  quo.  In  others  words,  measuring 
change  is  not  easy  and  depends  on  the  evaluators’ 
reference  standards  before  the  evaluators  can 
establish  the  size  of  the  change  observed. 

The  last  issue  concerning  incrementalism  is  to 
define  it  empirically.  It  is  difficult  to  determine 
what  an  incremental  (small)  change  is.  At  which 
level  can  budgetary  changes  be  considered  signifi¬ 
cant — 5%,  10%,  or  30%  of  the  program  budget? 
And  what  are  the  characteristics  of  such  changes?  It 
is  suggested  that  policy  evaluation  should  be  able  to 
make  the  distinction  between  mandatory  requests — 
the  new  expenditures  needed  to  keep  a  program 
operating  at  the  same  level — and  programmatic 
requests  for  initiating  new  programs.  The  first  type 
of  budgetary  change  is  perceived  as  incremental  but 
not  the  second  one.  It  is  also  suggested  that  in  any 
budget,  there  is  a  part  that  is  controllable  and  a 
part  that  is  uncontrollable  and  includes  all  expendi¬ 
tures  not  regulated  by  the  legislation. 

Policy  Processes  and  Bureaucratic  Changes 

According  to  Lindblom  (1959),  “democracies 
change  their  policies  almost  entirely  through 
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incremental  adjustments”  (p.  84).  In  democracies, 
it  is  easier  for  political  leaders  to  focus  just  on 
incremental  changes.  The  effect  of  such  behavior 
is  that  the  value  of  the  status  quo  is  constantly 
reinforced.  A  high  degree  of  social  stability  and 
continuity  is  necessary  in  any  political  system  to 
allow  bureaucrats  to  follow  the  muddling  through 
approach;  in  periods  of  rapid  social  changes,  this 
method  becomes  inadequate.  If  incrementalism 
confines  the  bureaucrats  in  the  day-to-day  routine 
and  reduces  their  motivation  and  their  capacity  to 
innovate,  then  incrementalism  is  a  way  to  rein¬ 
force  the  pro-inertia  ideology  and  support  the 
tendency  of  organizations  to  resist  change. 
Therefore,  the  central  question  is  how  to  define 
the  status  quo  (the  baseline  measurement)  and 
how  to  measure  small  changes  from  the  status 
quo.  Also,  do  small  changes  really  reinforce  the 
status  quo? 

However,  Lindblom  remains  skeptical  about 
the  argument  that  his  policy  model  supports  the 
antimotivation  forces  existing  in  any  organiza¬ 
tion.  If  the  rational-comprehensive  model  seeks  to 
stimulate  the  administrators  and  bureaucrats,  he 
cannot  foresee  how  his  model  can  be  seen  as  a 
tactic  for  conservatism.  He  argued  that  fast- 
moving  sequences  of  small  changes  can  bring 
about  policy  changes  more  rapidly  than  can  only 
infrequent  redefinition  of  policies.  Lindblom 
argues  that  in  periods  of  political  crises,  the  value 
of  incremental  changes  is  that  they  can  be  imple¬ 
mented  rapidly  and  that  decision  makers  and 
bureaucrats  are  in  general  less  ideologically  ori¬ 
ented  since  their  main  goals  is  to  serve  the  public 
interest.  Consequently,  one  important  element  of 
the  incrementalist  theory  is  that  the  bureaucracy  is 
relatively  autonomous  within  the  political  system 
and  immune  to  pressure  coming  from  interest 
groups  and  stakeholders. 

Therefore,  incremental  politics  offers  to  demo¬ 
cratic  societies  the  best  way  to  introduce  the 
changes  requested  by  citizens.  The  model  proposed 
by  Lindblom  and  others  suggests  that  governmen¬ 
tal  budgets  and  the  changes  observed  (reductions, 
increases)  are  the  result  of  incrementalism.  For 
stakeholders,  when  the  decisions  made  are  only 
incrementally,  it  is  not  only  easy  for  them  to  antic¬ 
ipate  the  policy  direction,  but  they  can  also  develop 
their  own  strategy  accordingly.  In  the  case  of  com¬ 
plex  social  problems,  when  opposite  values  are  at 


stake,  decisions  makers  and  bureaucrats  will  focus 
mainly  on  the  feasible  changes  in  the  short  run.  A 
strategic  policy  approach  will  imply  the  develop¬ 
ment  of  the  best  possible  alternatives  to  respond  to 
the  stakeholders’  expectations  through  a  mutual 
adjustment  process.  It  then  can  be  argued  that 
incrementalism  is  an  outgrowth  of  pluralism. 

As  some  critics  have  mentioned,  the  incremen¬ 
tal  approach  does  not  describe  the  policy  evalua¬ 
tion  process  by  which  public  policies  or  programs 
are  modified  and  implemented.  Several  authors 
have  also  argued  that  one  difficulty  with  incre¬ 
mentalism  is  that  it  detaches  administrative  con¬ 
cepts  from  the  context  of  constitutional  and 
political  theory.  With  regard  to  budget  appropria¬ 
tions,  it  is  clear  that  over  the  years  this  function 
has  become  much  more  specialized,  and  in  many 
countries,  the  budgetary  process  has  become  more 
isolated  within  governments  and  in  some  way 
removed  from  political  and  judicial  controls. 
Policy  evaluation  becomes  a  complex  enterprise  in 
light  of  the  different  programs  and  activities  devel¬ 
oped  within  each  governmental  entity.  It  was  also 
noted  that  national  priorities  have  often  been 
determined,  based  on  evaluations,  analyses,  and 
reports,  by  bureaucrats,  who  are  isolated  from  the 
political  sphere.  This  has  raised  several  important 
questions  about  good  governance  and  the  account¬ 
ability  of  governments. 

Performance  Measurement 

The  issue  of  how  to  measure  the  effect  of  a  policy, 
a  program,  or  a  social  intervention  has  been  a 
recurrent  methodological  question  in  policy  evalu¬ 
ation.  In  1963,  Chester  Harris  published  a  semi¬ 
nal  work  titled  Problems  in  Measuring  Change , 
where  contributors  propose  several  approaches  on 
how  to  study  change.  By  definition,  the  study  of 
change  involves  two  or  more  measurements  of  the 
same  variable  over  time  to  provide  the  basis  for 
inferring  whether  change  has  or  has  not  taken 
place.  The  general  principle  is  to  predict  status  in 
the  absence  of  policy  intervention  and  to  calculate 
the  added  value  of  such  action.  In  practice,  the 
idea  of  policy  modeling  is  to  offer  policy-making 
elements  for  judging  the  nature  and  expected 
impact  of  policy  intervention.  This  also  raises  the 
question  about  the  linear,  curvilinear,  or  exponen¬ 
tial  nature  of  the  expected  impact.  Fundamentally, 
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any  decision  should  be  based  on  real  observations 
and  data. 

One  of  the  first  problems  indentified  was 
related  to  the  lack  of  valid  and  reliable  sources  of 
data.  If  measurement  is  defined  as  the  “process  of 
linking  abstract  concepts  to  empirical  indicators,” 
the  first  task  of  policy  evaluators  is  to  provide 
policymakers  accurate  representation  of  the  con¬ 
cepts  used.  If  the  goals  of  programs  are  to  reduce 
poverty  and  inequalities,  increase  productivity, 
improve  students’  performance,  improve  quality  of 
life,  and  so  on,  then  all  these  abstract  notions 
should  be  translated  into  questions  that  reflect  a 
domain  of  content.  Over  time,  several  critics  have 
asserted  that  theoretical  concepts  in  policy  analysis 
have  not  been  described  with  the  required  exact¬ 
ness.  This  issue,  which  has  been  termed  construct 
validity,  refers  to  the  efforts  by  evaluators  to  use 
impact  indicators  that  can  provide  an  accurate 
measure  of  the  effect  of  a  policy.  The  measure  of 
the  true  social  attitude  or  policy  impact  can  also  be 
affected  by  the  social  or  political  desirability  of  an 
intervention.  Therefore,  the  choice  of  the  right  cri¬ 
teria  for  determining  the  success  or  failure  of  a 
policy  intervention  is  also  at  the  heart  of  policy 
evaluation. 

In  the  field  of  policy  performance  measurement, 
which  is  the  regular  measurement  of  performance 
indicators,  many  issues/problems  have  been  identi¬ 
fied  that  might  lead  to  faulty  conclusions  in  policy 
evaluation  (Carolyn  Heinrich,  2007).  The  inade¬ 
quate  definition  of  a  concept,  the  sensitivity  of 
measures,  and  the  use  of  a  single  measure  (mono¬ 
operationalization)  or  a  single  line  of  evidence 
(monomethod  bias)  are  all  threats  to  conclusion 
validity.  In  recent  years,  governments  have  made 
efforts  to  put  in  place  some  results-based  measure¬ 
ment  frameworks  by  asking  departments  to 
develop  their  own  indicators  of  performance  and 
to  establish  what  they  judge  to  be  a  successful 
intervention.  In  other  words,  research  protocol 
should  be  sufficiently  precise  and  powerful  to  be 
able  to  detect  the  effects  of  policies,  programs, 
and/or  projects.  To  put  it  simply,  policy  evaluation 
can  be  a  risky  business,  especially  if  evaluators 
conclude  that  a  program  has  some  tangible  effect 
when  in  fact  it  does  not  (Type  I  error)  or  conclude 
that  a  program  effect  does  not  exist  when  it  does 
(Type  II  error).  The  reliability  of  performance  indi¬ 
cators  is  also  crucial.  Do  the  research  design  and 


the  methods  used  yield  the  same  results  on 
repeated  interventions?  Is  it  possible  that  the  mea¬ 
surement  instruments  (indicators,  scales)  used 
have  some  systematic  biases,  underestimating  or 
overestimating  the  net  impact  of  a  policy?  To 
assess  the  reliability  of  the  observed  results,  evalu¬ 
ators  should  compare  evaluation  results  with  the 
results  of  similar  studies.  Measurement  is  a  key 
element  in  policy  evaluation,  and  systematic  effort 
to  define  the  concepts  and  develop  the  appropriate 
performance  indicators  remains  at  the  heart  of 
evaluation. 

Since  the  early  works  on  measuring  changes,  the 
field  of  policy  evaluation  has  seen  an  increase  in  the 
number  of  books  and  articles  on  methods  and 
approaches.  In  general,  three  methods  have  been 
suggested.  The  first  approach  is  to  randomize  par¬ 
ticipants  between  experimental  and  control  groups 
in  order  to  be  able  to  differentiate  the  expected 
normal  growth  patterns  from  the  net  impact  of  a 
policy/program.  Second,  some  baseline  measure¬ 
ments  can  be  taken  prior  to  a  policy/program  inter¬ 
vention  to  evaluate  if  there  are,  for  example,  age  or 
income  variations  and  to  be  able  to  determine  the 
value  added  of  an  intervention  over  specific  groups. 
A  third  approach  is  to  use  reference  standards  such 
as  health  statistics.  For  example,  it  is  known  that 
growth  patterns  differ  for  boys  and  girls.  If  a  child 
has  a  growth  deficiency,  doctors  are  able  to  com¬ 
pare  the  individual  situation  with  what  is  known  as 
the  “normal”  growth  pattern.  The  challenge  for  a 
policy  evaluator  is  therefore  to  differentiate  what  is 
“normal”  change  over  time  and  to  see  if  a  govern¬ 
mental  intervention  can  accelerate  or  decelerate  the 
patterns  of  change. 

Finally,  many  social  and  environmental  phe¬ 
nomena  do  not  occur  in  a  linear  way.  In  general, 
people  expect  change  to  occur  in  a  linear  fashion. 
A  well-known  example  of  this  point  was  the  study 
of  governmental  expenditures.  Contrary  to  expec¬ 
tations,  especially  in  the  postwar  period,  the 
growth  of  governmental  expenditure  follows  a 
nonlinear  pattern.  Data  should  be  properly  ana¬ 
lyzed,  otherwise  there  may  be  a  systematic  bias 
that  policy  evaluators’  statistical  methods  will  not 
be  able  to  capture.  In  recent  years,  this  became 
even  more  evident  in  the  case  of  environmental 
policy  because  many  patterns  of  growth  are  expo¬ 
nential.  It  is  essential  to  know  the  exact  pattern  of 
change  before  conducting  an  evaluation. 
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Measuring  the  Impact  of 
Government  Policies 

Policy  Evaluation  Design 
and  Research  Strategy 

The  purpose  of  policy/program  evaluation  is  to 
develop  research  tools  to  be  able  to  measure  the 
relationships  between  the  objectives  of  a  program, 
its  activities,  and  its  impact.  Of  the  issues  related  to 
the  development  of  systematic  approaches  or  ana¬ 
lytical  procedures,  the  evolution  of  program  evalu¬ 
ation  as  a  scientific  activity  has  been  critical.  For  an 
evaluator,  it  is  essential  to  distinguish  between  the 
condition  that  led  to  the  implementation  of  a  pro¬ 
gram,  the  type  of  activities  proposed  to  achieve  the 
program  goals,  and  the  particular  factors  or 
threats  that  might  limit  the  evaluator’s  capacity  to 
get  an  unbiased  estimate  of  the  “real”  versus  the 
“observed”  impact  of  a  program  (Richard  Hurteau, 
Vincent  Lachapelle,  &  Guillame  Houle,  2006). 

The  first  step  before  conducting  any  policy 
evaluation  is  to  have  a  clear  indication  of  the  pol¬ 
icy  goals.  Over  the  years,  evaluators  have  com¬ 
plained  about  governmental  officials  and  their 
inability  to  define  precisely  the  goals  and  expected 
impact  of  their  policies.  Imprecise  and  diffuse 
goals — some  of  which  may  not  even  be  stated — 
make  evaluators’  work  extremely  difficult.  Govern¬ 
ments  might  want  to  reduce  the  level  of  poverty  or 
put  in  place  a  minimum-income  policy  without 
specific  achievement  goals  to  indicate  whether  the 
policy  has  attained  its  objectives.  Before  any  evalu¬ 
ation  is  conducted,  evaluators  have  to  communi¬ 
cate  with  administrators  to  establish  the  expected 
effects  of  the  policy/program. 

The  second  step  before  deciding  what  type  of 
research  design  should  be  used  is  to  have  a  clear 
knowledge  about  the  definition  of  a  particular  con¬ 
cept  and  the  conceptual  universe  related  to  it 
(Richard  Bingham  &  Claire  Felbinger,  1989;  John 
Owen  &  Patricia  Rogers,  1999).  If  the  goal  of  a 
program  is  to  reduce  inequalities,  to  measure  policy 
innovation,  or  to  improve  participants’  abilities,  a 
clear  and  undisputed  definition  of  the  concepts  as 
well  as  general  acceptance  of  the  idea  (content  and 
construct  validity)  are  prerequisites  for  conducting 
any  fieldwork.  Issues  such  as  measure  sensitivity 
and  the  relationship  between  concepts  and  mea¬ 
surement  lead  to  this  fundamental  question:  Does 
the  measurement  tool  measure  what  it  was  intended 


to  measure?  Extraneous  sources  of  error  might 
result  in  biased  estimates. 

The  third  step  is  to  look  at  plausible  rival  hypoth¬ 
eses  or  explanations  of  the  observed  impacts  of  a 
program.  This  is  never  an  easy  task;  it  is  quite  often 
difficult  to  determine  cause-and-effect  relationships 
or  how  societal  change  has  been  caused  by  policy 
action.  Several  outside  explanations — which  evalua¬ 
tors  have  described  as  “validity  threats” — should  be 
studied  to  determine  how  they  can  affect  or  have 
affected  the  conduct  of  the  evaluation.  The  evalua¬ 
tion  design  should  minimize  the  impact  of  these 
threats  to  validity  to  be  able  to  conclude  that  the 
statistical  association  observed  between  the  program 
intervention  and  its  measured  impact  can  reason¬ 
ably  be  considered  a  causal  relationship.  As  Thomas 
Cook  and  Donald  Campbell  (1979)  have  empha¬ 
sized,  the  evaluator  has  to  systematically  think 
through  how  each  of  the  internal  validity  threats 
may  have  influenced  the  data  and  to  test  which 
threats  can  be  ruled  out.  Threats  such  as  history, 
maturation,  and  selection  have  been  well  docu¬ 
mented  as  factors  that  might  affect  the  outcome 
measurement.  Sometimes,  pre-evaluation  or  the 
pretest  experience  (the  halo  effect)  of  participants 
may  produce  attitudinal  or  behavioral  changes  that 
can  be  confounded  with  the  policy/program  impact. 

Nevertheless,  different  strategies  can  be  used  in 
policy  evaluation.  This  entry  now  describes  the 
four  main  approaches  or  research  strategies  gener¬ 
ally  used  in  policy  evaluation:  (1)  observational 
study,  (2)  strategic  management  and  meta-analysis, 
(3)  experimental  and  quasi-experimental  studies, 
and  (4)  postevaluation  and  monitoring  studies. 
Each  of  these  four  has  some  real  advantages  and 
also  some  potential  costs  that  should  be  considered 
when  choosing  strategies  for  evaluation  as  well  as 
for  policy  interventions. 

Level  1  Evaluation:  Observational  Study 

The  goal  of  observational  study  is  to  evaluate 
the  policy  objectives  and  means  among  a  small 
number  of  citizens,  usually  up  to  30  participants. 
As  Lawrence  Mohr  (1995)  pointed  out,  observa¬ 
tional  study  is  defined  as  one  in  which  “no  central 
authority  decides  which  subjects  are  to  receive  the 
treatment  and  which  are  not,  or  what  intensity  of 
treatment  each  is  to  receive”  (p.  229).  The  goal  of 
such  Level  1  evaluation  is  to  determine  if  the  course 


1914  Policy  Evaluation 


of  action  is  appropriate,  that  the  policy/program 
does  not  harm  participants  in  any  way  (socially, 
politically,  morally,  or  physically),  how  they  react 
with  the  policy  tools  used,  and  if  the  relative  impact 
of  a  policy/program  project  can  be  estimated.  At 
this  stage,  evaluators  might  want  to  measure  the 
effects  of  different  policy  instruments  (e.g.,  level  of 
taxation,  financial  incentives,  and  training  meth¬ 
ods)  on  the  policy  goals  or  to  look  over  the  “best” 
case  study.  Methods  such  as  benchmarking  can  be 
used  to  assess  why  some  programs  were  successful 
while  others  were  unable  to  reach  their  objectives. 

The  evaluator  can  observe  and  record  the  reac¬ 
tion  of  different  cohorts  or  use  case-control  to 
study  and  determine  the  expected  benefits  for  par¬ 
ticipants.  At  this  point,  evaluators  are  interested  in 
observing  the  associations/correlations  between 
the  policy/program  effects  on  participants  and  the 
outcome  measures.  Evaluators  can  then  propose  to 
relevant  decision  makers  the  appropriate  course  of 
action  to  be  followed.  In  such  a  situation,  there  are 
two  key  questions:  (1)  Are  there  better  ways  to 
resolve  and/or  prevent  a  policy  problem?  (2)  What 
approaches  should  be  implemented? 

Participatory  evaluation  can  certainly  be  used 
as  a  means  to  gather  information  from  the 
stakeholders — the  key  social  and  political  actors 
who  should  be  involved  in  a  policy/program.  For 
several  authors,  policy  evaluation  can  be  a  means  to 
improve  the  democratic  debate  about  governmental 
decisions,  with  the  goal  of  achieving  social  consen¬ 
sus  and  offering  new  perspectives  and  solutions. 

Level  2  Evaluation:  Strategic 

Management  and  Meta- Analysis 

As  soon  as  decision  makers  have  chosen  a  strat¬ 
egy  to  follow  and  have  determined  the  type  of 
policy/program  (treatments)  that  will  be  imple¬ 
mented,  evaluators  have  the  task  of  assessing  its 
implementation.  At  this  point,  it  is  important  to 
determine  whether  various  types  of  policy  inter¬ 
ventions  are  working  or  not,  and  why.  For  this 
type  of  study,  evaluators  usually  look  for  larger 
groups  of  respondents. 

Probabilistic  measures  are  often  employed  at 
this  stage  to  measure  the  potential  effects  and  con¬ 
sequences  of  programs.  As  such,  feasibility  studies 
seek  to  determine  if  a  policy/program  is  capable  of 
achieving  its  goals  and  which  factors  can  impede  the 


achievement  of  the  goals.  Once  a  tentative  policy/ 
program  plan  has  been  developed,  decision  makers 
will  often  invest  in  this  type  of  evaluation  to  ensure 
that  the  assumptions  underlying  the  program  plan 
are  correct  and  that  there  are  no  unforeseen  issues 
that  might  affect  the  program  outcomes.  Economic 
feasibility  studies  can  provide  a  clear  assessment  of 
the  capacity  of  a  program  to  achieve  its  desired  out¬ 
comes  given  the  resources  available  to  the  program. 
Is  the  allocated  budget  sufficient  to  allow  the  pro¬ 
gram  to  function  as  intended?  For  example,  if  gov¬ 
ernments  want  to  launch  a  vaccination  campaign  to 
respond  rapidly  to  the  challenges  of  the  FI1N1  virus 
epidemic,  this  would  mean  ascertaining  the  cost  and 
availability  of  the  vaccine  and  the  required  medical 
equipment,  the  cost  of  hiring  personnel  (doctors 
and  nurses),  and  any  other  costs.  The  public  safety 
authorities  may  implement  the  vaccination  program 
on  a  small  scale  to  evaluate  the  most  efficient  strat¬ 
egy  and  to  identify  unexpected  expenses.  Another 
type  of  study  is  risk  analysis,  the  identification  and 
measurement  of  factors  that  could  potentially  pre¬ 
vent  a  program  from  achieving  its  goals.  Risk 
involves  the  possibility  of  harm/loss  and  uncer¬ 
tainty.  The  conduct  of  risk  analysis  requires  the 
prioritization  of  policy/program  goals,  identifica¬ 
tion  of  the  potential  risks  of  the  adopted  strategy, 
and  measurement  of  such  risk. 

Finally,  one  useful  research  method  for  policy 
evaluation  is  a  meta-evaluation  or  meta-analysis  of 
all  relevant  randomized  evaluation  (John  Chang  et 
ah,  2004).  Evaluators  might  judge  that  a  review  of 
all  the  studies  pertinent  to  a  specific  policy  prob¬ 
lem  is  necessary  for  a  clear  appreciation  of  the 
solutions  to  a  social  problem.  One  strategy  is  to 
look  at  all  successful  programs  and  measure  the 
“effectiveness  factor”  of  each  strategy  on  specific 
groups  or  cohorts.  This  type  of  approach  provides 
additional  insight  about  the  effect  of  a  policy  by 
applying  a  global  multivariate  model,  allowing  for 
assessment  of  the  relative  effectiveness  of  each 
intervention  component  while  controlling  for  the 
effect  of  other  components  in  multifactor  interven¬ 
tions  across  all  studies. 

Level  3  Evaluation:  Experimental 

and  Quasi-Experimental  Studies 

The  use  of  experimental  protocol  or  random¬ 
ized  controlled  trial  remains  the  ideal  in  evaluation 
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methodology.  Participants  are  randomly  assigned 
to  an  experimental  group  (those  who  participate  in 
the  program)  or  to  a  control  group  (those  who  do 
not  participate  in  the  program).  To  be  able  to  assess 
the  observed  change  due  to  the  program  (treat¬ 
ment),  the  two  groups  should  be  comparable 
throughout  the  evaluation  experimentation.  What 
have  been  described  as  validity  threats  are  the  plau¬ 
sible  rival  hypotheses  or  explanations  of  the 
observed  impacts  of  a  program  that  can  explain  the 
change  observed  between  participants  (experimen¬ 
tal  group)  and  nonparticipants  (control  group)  in 
the  program.  For  this  type  of  study,  evaluators  usu¬ 
ally  perform  multicenter  analysis,  comparing  out¬ 
comes  in  different  settings  (e.g.,  schools,  hospitals, 
and  communities)  on  larger  groups  of  respondents. 

The  purpose  of  an  evaluation  design  is  to  mini¬ 
mize  the  threats  to  validity  to  be  able  to  assess  the 
real  impact  of  the  program.  This  type  of  design  has 
an  advantage  in  political  terms:  It  is  scientifically 
approved  and  therefore  can  be  justified  to  the  pub¬ 
lic.  Research  protocols  or  terms  of  references 
describe  in  general  the  objectives  of  the  evaluation 
study,  the  research  designs  that  will  be  employed — 
the  use  of  multiple  lines  of  evidence  strategy — the 
methodology,  and  other  organizational  aspects 
concerning  the  timing  of  the  policy/program  evalu¬ 
ation.  Several  strategies  can  be  employed  in  design¬ 
ing  the  evaluation  protocol.  Research  design  such 
as  randomized  controlled  evaluation,  double-blind 
study,  or  placebo-controlled  evaluation  can  be 
used  to  measure  the  policy/program  impact.  The 
goal  is  to  compare  the  recipients  of  a  program 
(treatment  group)  with  a  comparable  group  (con¬ 
trol  group)  to  be  able  to  measure  the  changes  that 
occur  due  to  the  intervention. 

In  the  case  of  quasi  experimentation,  policy/ 
program  evaluators  want  to  approximate  the 
experimental  method  by  trying  to  identify  the  con¬ 
sequences  of  social  changes,  to  provide  controls 
for  confounding  variables,  and  then  to  probe  for 
causal  dependencies.  There  are  several  types  of 
research  designs  that  can  be  used  to  infer  the 
impact  of  policy  and  programs  on  society.  Among 
them  are  the  nonequivalent  control  group  design, 
the  field  experiment  design,  and  the  interrupted  or 
multiple-time-series  design.  In  these  cases,  policy 
researchers  manipulate  the  independent  variables 
to  identify  the  main  determinants  and  to  look  for 
patterns  of  causality.  Policy  research,  however, 


always  is  conducted  in  a  politicized  environment, 
in  which  it  is  difficult  to  control  many  of  the  fac¬ 
tors  that  threaten  validity. 

In  addition  to  these  results  from  evaluation 
studies,  the  evaluator  has  to  consider  the  unin¬ 
tended  consequences  of  policy  interventions.  Most 
evaluation  research  does  not  consider  these  unin¬ 
tended  consequences,  although  they  are  certainly 
significant  for  the  politics  of  policy.  There  are, 
however,  relatively  few  effective  means  for  com¬ 
bining  the  intended  and  unintended  consequences 
of  programs,  except  for  economic  instruments  that 
combine  costs  and  benefits.  The  political  conse¬ 
quences  of  programs  may  be  more  significant  than 
even  the  intended  ones  if  they  are  more  visible  and 
more  immediate,  so  the  politics  of  evaluation 
requires  careful  consideration  of  the  full  range  of 
outcomes. 

Level  4  Evaluation:  Postevaluation  Study  and 

the  Implementation  of  a  Policy/Program 

Monitoring  System 

At  this  level,  evaluators  can  conduct  a  summa- 
tive  evaluation,  which  is  a  comprehensive  assess¬ 
ment  of  the  degree  of  success  or  failure  achieved 
by  policies-programs,  to  discuss  the  overall  bene¬ 
fits  of  the  program  in  light  of  its  initial  goals.  At 
this  point,  evaluators  should  be  more  concerned 
about  the  impacts  of  a  program  on  its  stakehold¬ 
ers.  The  evaluators  should  also  be  able  to  conduct 
a  formative  evaluation  assessing  whether  the  pro¬ 
gram  has  been  implemented  and  whether  improve¬ 
ments,  radical  change,  or  termination  is  required 
(Peter  DeLeon  &  Linda  DeLeon,  2002).  The  focus 
of  such  comprehensive  evaluation  is  on  the  four 
Es  of  any  policy/program — efficiency,  effective¬ 
ness,  equity,  and  equality — in  comparison  with 
alternative  strategies  to  those  being  implemented. 
Such  an  evaluation  should  also  provide  additional 
information  and  periodic  and  continuous  feedback 
concerning  the  implementation  of  the  program. 

Program  Monitoring:  The  Short-  and 
Long-Term  Impact  of  Policies-Programs 

This  analysis  of  evaluation  also  raises  an  impor¬ 
tant  question  about  the  short-  versus  long-term 
impact  of  a  policy/program.  Fundamentally,  poli¬ 
cymakers  need  to  know  how  to  answer  the  follow¬ 
ing  question  often  asked  by  politicians,  journalists, 
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and  the  public:  How  long  will  it  take  to  find  out  if 
a  program  has  achieved  its  objectives?  First  of  all, 
some  policy  initiatives  have  an  immediate  impact — 
for  example,  increases  in  consumer  taxes  and  uni¬ 
versity  tuition  fees — while  some  policy  initiatives 
may  take  several  years  before  significant  change  is 
observed.  The  impact  of  free  trade  policies  or  envi¬ 
ronmental  policies  (e.g.,  the  Kyoto  Protocol)  is 
difficult  to  measure  in  the  short  term  because  such 
policies  imply  a  multi-action  strategy  over  a  longer 
period  of  time  to  achieve  their  goals.  Some  people 
may  condemn  such  policies  because  they  do  not 
see  their  immediate  effects  or  because  the  political 
benefits  cannot  be  identified  during  an  electoral 
mandate.  Governments  may  choose  to  focus  on 
policies  that  have  rapid  outcomes  since  they  are 
seeking  reelection  in  the  short  term.  Nevertheless, 
the  role  of  policy  evaluators  is  not  to  predict  but  to 
offer  an  educated  analysis  of  the  short-,  medium-, 
and  long-term  benefits  of  any  policy  intervention. 
Therefore,  several  governments  have  decided  that 
all  policies  or  interventions  be  revised  at  least  once 
every  5  years.  The  external  validity  of  the  policy 
research  refers  to  the  idea  that  the  strategy  adopted 
to  resolve  a  social  problem  can  be  used  in  different 
settings  and  is  replicable  and  that  observations  can 
be  generalized  to  other  policies  or  programs. 

Finally,  at  this  level,  it  is  essential  for  future 
evaluations  to  have  some  periodic  collection  of 
data  about  the  policy/program’s  processes  and 
outcomes.  Program  monitoring  involves  not  only 
collecting  data  during  and  after  the  implementa¬ 
tion  of  a  program  but  also  providing  basic  facts 
about  the  services  provided  and  its  participants 
(Huey-Tsyh  Chen,  2005).  The  main  objective  of 
program  monitoring  is  to  determine  whether  the 
participants’  well-being  has  improved,  deterio¬ 
rated,  or  remained  the  same  after  the  implementa¬ 
tion  of  the  program.  However,  it  can  hardly 
explain  why  this  happens  or  if  the  observed  results 
should  be  essentially  attributed  to  the  policy/ 
program’s  intervention.  Monitoring  also  implies  a 
more  or  less  continuous  collection  of  data  about 
the  programs  in  question  so  that  feedback  and 
adjustments  can  also  be  relatively  rapid. 

Two  main  methods  are  suggested  to  monitor 
government  programs:  (1)  process  monitoring  and 
(2)  outcome  monitoring.  Process  monitoring 
involves  the  collection  of  data  about  the  basic  ele¬ 
ments  of  the  program’s  implementation,  such  as 


the  participants’  age  and  socioeconomic  status, 
risk  factors,  and  the  services  delivered.  The  objec¬ 
tive  for  decision  makers  is  to  determine  whether 
the  program  considers  the  targeted  population, 
their  needs,  and  their  success  rates  in  completing 
the  expected  tasks.  Based  on  this  information, 
decision  makers  can  decide  about  the  future  of  a 
policy/program.  In  the  case  of  outcome  monitor¬ 
ing,  the  goal  is  to  assess  the  state  of  the  partici¬ 
pants  before  and  after  the  program  and  if  they  are 
doing  better,  worse,  or  the  same  after  receiving  the 
services  provided  by  a  program. 

These  tasks,  to  regularly  collect  information 
about  the  processes  and  outcomes  of  programs, 
should  normally  be  given  to  a  monitoring  board 
and  be  institutionalized  within  the  evaluation 
branch  of  the  government  or  department.  The 
functions  of  such  boards  are  first  to  see  if  within 
an  organization  the  appropriate  measures  have 
been  implemented,  and  in  the  case  of  an  evaluation 
if  all  the  fundamental  research  principles  have 
been  followed  by  the  policy/program  evaluators. 
Collecting  input  from  participants,  stakeholders, 
and  decision  makers  may  also  be  judged  to  be 
important  for  determining  future  policy  action. 

Conclusion 

Evaluation  is  a  difficult  and  demanding  enterprise. 
It  calls  for  substantial  imagination,  tenacity,  a  sys¬ 
tematic  research  protocol,  and  the  development  of 
research  skills  and  ability.  Effective  evaluation  also 
involves  a  good  deal  of  political  skill  because 
evaluation  involves  working  within  the  political 
system  to  improve  the  programs.  Even  if  different 
applied  research  strategies  or  frameworks  can  be 
employed  for  policy  evaluation,  the  central  func¬ 
tion  of  evaluators  is  to  keep  citizens  and  govern¬ 
ments  informed  about  how  well  programs  are 
working.  They  should  also  suggest  ways  of  improv¬ 
ing  the  existing  programs.  Policy  evaluation  has 
become  a  means  and  a  tool  for  decision  makers 
not  only  to  respond  to  citizens’  demands  for 
greater  accountability  but  also  to  support  the 
action  of  those  who  are  involved  in  their  own 
social  environment  (Frank  Fischer,  2003).  Citizens 
want  governments  to  offer  better  programs,  to  be 
more  effective  in  their  social  interventions,  and  to 
not  waste  resources  from  the  public  purse.  For  all 
these  reasons,  policy  evaluation  is  a  tool  for  both 
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policymakers  and  the  public  to  understand  what 

governments  do. 
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Policy  Formulation 


In  plain  language,  a  policy  is  a  statement  of  intent 
or  goal,  or  simply  a  decision  to  act.  The  crucial 
questions  in  public  policy  making  are  as  follows: 
Who  make(s)  policy  and  how?  Where  does  power 
lie  in  the  policy  process?  What  interests  and 
demands  are  taken  into  account?  Who  ultimately 
benefits  from  policy  decisions  and  outcomes? 

Different  policy  models  and  theories  capture  the 
understanding  and  reality  of  policy  formulation  dif¬ 
ferently.  Some  doubt  whether  policy  occurs  entirely 
prior  to  action  or,  at  times,  is  a  post  hoc  rational¬ 
ization  of  action  already  taken.  Others  question  the 
scientific  rationality  of  policy  formulation.  Aaron 
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Wildavsky  considers  policy  analysis  an  art:  While 
seeking  to  delineate  a  field  of  discipline  for  policy 
science,  public  policy  needs  to  incorporate  more 
awareness  of  the  human  aspect  of  policy  making, 
emphasizing  the  political  choices  made  in  a  com¬ 
petitive  environment  and  the  social  relations  that 
sustain  them.  In  practice,  policy  formulation  has  to 
be  both  a  science  and  an  art.  It  has  to  pay  attention 
to  a  genuine  process  of  identifying  problems  and 
issues,  looking  for  cost-effective  solutions,  and 
formulating  deliverable  implementation  measures 
to  get  things  done.  At  the  same  time,  it  needs  to 
recognize  the  reality  that  public  policy  is  an  inter¬ 
active  process  among  political  actors  in  the  selec¬ 
tion  of  goals  and  the  means  to  achieve  them  within 
institutional,  resource,  and  power  constraints. 
Policy  actors  and  institutions  are  both  important 
and  so  are  the  agenda-setting  processes  of  incorpo¬ 
rating  problems  and  ideas  into  policy  formulation 
and  the  subsequent  implementation  of  decisions  by 
the  bureaucracy. 

Scope  of  Policy  Process 

The  study  of  policy  varies  from  the  analysis  of 
policy  (to  understand  its  nature  and  content,  how 
it  is  made  and  implemented,  and  policy  as  a  politi¬ 
cal  process)  to  the  analysis  for  policy  (i.e.,  policy 
research,  information  and  advice  that  can  lead  to  a 
case  for  action  to  be  taken).  The  identification  and 
recognition  of  problems  in  the  policy  formulation 
process  can  be  conceptualized  as  a  predecision,  as 
agenda  setting  serves  to  limit  the  number  of  sub¬ 
jects  attracting  attention  and  getting  on  the  deci¬ 
sion  makers’  agenda.  Exclusion  of  a  subject  from 
the  agenda  is  in  itself  a  negative  decision  as  far  as 
the  subject  is  concerned,  since  agenda  setting  is  as 
important  as,  if  not  more  important  than,  decision 
making  itself. 

Conceptually,  policy  formulation  can  be  sepa¬ 
rated  from  policy  execution  or  implementation,  but 
there  is  a  growing  literature  that  sees  the  two  as 
highly  intermingled  and  being  part  of  a  policy  con¬ 
tinuum.  The  rationalist  approach  sees  policy  as  an 
outcome  of  a  rational — even  scientific,  technical, 
and  nonproblematic — decision-making  process 
within  a  relatively  controlled  environment  based 
on  an  instrumental  “means-ends”  causality,  lead¬ 
ing  problems  to  their  solutions  and  decisions  to 
their  implementation.  Implementation  is  similarly 


taken  as  nonproblematic,  technical,  and  highly 
directed  and  controlled.  In  contrast,  the  realist  or 
political  approach  sees  policy  as  essentially  an  out¬ 
come  of  negotiation  among  political  actors  and 
institutions  based  on  their  competing  values,  inter¬ 
ests,  agendas,  strategies,  and  power  resources 
within  an  uncertain,  conflictual,  and,  at  times, 
turbulent  environment.  In  the  real  world,  the 
actual  process  of  policy  formulation  is  shaped  and 
defined  by  the  policy  institutions  and  bound  by  the 
external  policy-making  environment  and  prevail¬ 
ing  trends — local,  national,  regional,  as  well  as 
global.  Policy  formulation  is  triggered  by  the  need 
to  either  formulate  new  policy  or  change  existing 
policy  in  order  to  deal  with  newly  identified  or 
redefined  problems.  Policy  change  thus  captures  a 
spectrum  of  types  from  policy  innovation  and  suc¬ 
cession  to  maintenance  and  termination.  Sometimes 
policy  learning,  diffusion,  and  transformation  also 
generate  the  need  for  policy  formulation. 

Policy  analysts  also  study  the  policy  capacity  of 
any  polity,  government,  and  public  institution, 
which  refers  to  its  ability  or  competence  to  bring 
together  information  and  decision-making  power 
to  make  intelligent  choices  and  set  strategic  direc¬ 
tions.  Choices  are  related  to  values.  In  the  formula¬ 
tion  of  public  policy,  apart  from  the  necessity  to 
find  solutions  to  solve  problems,  it  is  often  argued 
that  it  has  to  adhere  to  some  core  values  that  are 
relevant  to  all  public  policies,  such  as  the  five  Es 
(economy,  efficiency,  effectiveness,  efficacy,  and 
equity);  some  would  add  a  sixth  E  (electability)  to 
indicate  the  need  for  any  policy  to  respond  to  pub¬ 
lic  demands  and  hence  secure  the  support  of  the 
electorate.  These  values  may  conflict  with  one 
another;  achieving  the  best  balance  tests  the  politi¬ 
cal  skills  as  well  as  priorities  of  policymakers. 
Hence,  policy  formulation  is  not  an  exact  science 
or  purely  a  matter  of  economic  rationality. 

Policy  Models 

Broadly  speaking,  there  are  two  diverse  ways  of 
interpreting  decision  making:  (1)  the  rationalist 
versus  (2)  the  realist  understanding  of  policy  mak¬ 
ing  and  implementation.  The  former  sees  policy  as 
intention,  while  the  latter  considers  policy  more  as 
outcome.  The  former  sees  policy  formulation  as  a 
linear  process,  whereas  the  latter  considers  it  as  an 
interactive  process.  The  former  confines  policy 
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formulation  to  policy  choice  in  response  to  a  prob¬ 
lem,  but  the  latter  also  includes  problem  recognition 
and  agenda  setting,  as  well  as  policy  implementa¬ 
tion,  which  delivers  the  real  impact  of  the  policy 
decision  made.  Within  this  spectrum,  four  major 
approaches  to  policy  formulation  have  been  devel¬ 
oped  in  the  policy  literature  over  the  past  several 
decades,  namely,  (1)  the  rational  model,  (2)  the 
incrementalist  model,  (3)  the  garbage  can  model, 
and  (4)  the  institutionalist  model. 

Rational  Model 

The  purest  form  of  rationalism  is  the  rational 
comprehensive  model,  which  assumes  some  “per¬ 
fect”  conditions  for  rational  policy  making  to  be 
scientific,  somewhat  akin  to  rational  choice  in 
economic  theory.  Such  conditions  are  premised  on 
a  clearly  instrumental  logic  and  sequence  of  rea¬ 
soning — namely,  defining  and  ranking  values, 
specifying  objectives  and  subobjectives  with  regard 
to  the  problem,  identifying  alternative  means  or 
options  for  solving  the  problem,  calculating  all  the 
consequences  of  adopting  each  option,  ranking  the 
various  options  according  to  set  criteria  consistent 
with  specified  values  and  objectives,  and  finally, 
choosing  the  most  effective  option  that  maximizes 
benefits  and  minimizes  costs.  The  rational  model 
possesses  attractive  theoretical  clarity  but  lacks 
behavioral  realism.  Its  comprehensive  form  suffers 
from  cognitive  limitations  (lack  of  knowledge, 
imagination,  and  skills),  psychological  limitations 
(fear  of  uncertainty,  selective  perceptions),  organi¬ 
zational  limitations  (limited  goals  and  horizons 
due  to  fixed  jurisdictions  and  standard  operating 
procedures),  and  resource  limitations  (lack  of 
resources  and  time). 

Herbert  Simon  and  others  argue  that  these 
shortcomings  could  not  be  made  up  by  more  dili¬ 
gent  and  comprehensive  analysis.  Decision  makers 
are  limited  in  computational  capacity  and  can  only 
search  very  selectively  through  large  volumes  of 
possibilities  to  discover  what  alternatives  of  action 
are  available  and  what  the  consequences  of  each  of 
these  alternatives  are.  The  search  is  bound  to  be 
incomplete  and  inadequate,  based  on  uncertain 
information  and  partial  ignorance,  and  is  usually 
terminated  with  the  discovery  of  satisfactory,  not 
optimal,  courses  of  action.  Hence,  they  should  go 
for  the  bounded  rationality  of  homo  pyschologicus 


in  place  of  the  full  rational  choice  of  homo  eco- 
nomicus  and  seek  to  “satisfice”  rather  than  to 
“maximize.” 

Incrementalist  Model 

The  opposite  of  the  rationalist  policy  model  is 
the  incrementalist  model,  made  famous  by  Charles 
Lindblom’s  science  of  muddling  through.  This 
model  sees  decision  making  as  interaction  among 
actors,  hence  it  is  a  political  rather  than  a  rational 
process  per  se.  It  regards  the  future  as  a  linear 
function  of  the  past,  somewhat  similar  to  path 
dependence  theory.  According  to  Lindblom,  goals 
and  values  are  rarely  clear  or  predetermined.  They 
change  according  to  available  resources  and  as  we 
learn  about  the  problem  through  analysis.  Hence, 
in  practice,  we  do  not  review  and  analyze  all  pos¬ 
sible  options  but  make  successive  limited  compari¬ 
sons  with  current  policy  and  situation.  Lindblom’s 
theory,  sometimes  known  as  disjointed  incremen¬ 
talism,  seeks  not  only  to  describe  the  reality  of 
policy  making  but  also  to  prescribe  incrementalism 
as  a  better  science.  Problem  solving  is  seen  as  itera¬ 
tive  (step  by  step).  His  pragmatic  advice  is  to  avoid 
being  comprehensive  and  exhaustive  but  to  look 
after  your  part  of  the  problem  and  let  others  look 
after  those  parts  that  are  assigned  to  them. 
Policymakers  are  urged  to  define  and  redefine  val¬ 
ues  and  objectives  in  light  of  experience  and  to 
deal  only  with  solvable  problems;  they  should  look 
at  solutions  only  marginally  different  from  the 
status  quo,  apply  “feasibility”  as  the  key  decision 
criterion,  and  adopt  a  remedial  approach  (to  fix 
what  is  wrong  with  current  policy). 

Critics  of  incrementalism  contend  that  it  is 
inherently  conservative  and  pro  status  quo,  so  it 
can  cope  only  with  stable,  not  changing,  situations 
or  crises  and  is  ill  equipped  to  help  with  new,  unfa¬ 
miliar  problems  or  to  define  problems  or  solutions 
that  are  unconventional  or  that  fall  outside  the 
circle  of  proximate  policymakers.  Some  even 
accuse  it  of  accurately  describing  the  “worst  prac¬ 
tice”  but  not  prescribing  the  “best  practice”  and 
being  at  best  applicable  to  policy  succession 
instead  of  policy  innovation  activities. 

Between  the  utopian  and  realist  poles,  some 
have  sought  to  find  a  middle  ground  to  embrace 
both  the  innovative  side  of  the  rational  model  and 
the  pragmatic  side  of  incrementalism.  For  example, 
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Amitai  Etzioni  argues  for  a  third  approach  in  the 
form  of  mixed  scanning.  Scanning  refers  to  the 
search,  collection,  processing,  and  evaluation  of 
information,  as  well  as  the  drawing  of  conclusions. 
In  mixed  scanning,  policy  making  takes  place  at 
two  levels — a  broad,  high-level  overview  of  prob¬ 
lems  and  directions  (the  “fundamental”  problems, 
which  call  for  the  rational  approach)  and  a  detailed 
examination  of  specific  problem  areas  (the  “incre¬ 
mental”  problems,  which  call  for  the  incremental¬ 
ist  approach).  This  approach  has  the  benefit  of 
both  a  comprehensive  overview  that  requires  full- 
range  and  in-depth  analysis  (back  to  first  princi¬ 
ples)  and  taking  incremental  steps,  which  requires 
incremental  and  low-level  adaptations  and  “fixing 
up.  ”  It  is  less  demanding  than  the  full  search  for  all 
possible  options  under  rationalism  but  more  stra¬ 
tegic  and  innovative  than  incrementalism.  While  it 
seems  to  have  an  intuitive  appeal,  even  Etzioni 
himself  admits  that  the  merit  of  mixed  scanning 
remains  to  be  empirically  substantiated. 

Garbage  Can  Model 

The  garbage  can  model — advanced  by  Michael 
Cohen,  James  March,  and  Johan  Olsen  (1972) — is 
sometimes  known  as  the  temporary  sorting  model. 
It  is  premised  on  certain  assumptions  about  the 
ambiguity  of  goals  and  values,  the  uncertainty  in 
knowledge  and  technology,  and  the  political  sym¬ 
bolism  of  policy  making,  such  as  political  rules  and 
procedures.  It  emphasizes  organizational  irratio¬ 
nality  or  what  Cohen  et  al.  call  organized  anar¬ 
chies.  According  to  them,  an  organization  is  a 
loose  collection  of  ideas  rather  than  a  coherent 
structure.  It  discovers  preferences  through  action 
more  than  it  acts  on  the  basis  of  preset  preferences, 
as  implied  in  linear  rational  planning.  In  the  pro¬ 
cess,  it  is  often  choices  looking  for  problems,  issues 
and  feelings  looking  for  decision  situations,  and 
solutions  looking  for  issues,  rather  than  the  other 
way  around.  In  this  model,  four  variables  are  pre¬ 
sumed  to  be  interacting  in  a  rather  complex  or 
even  random  manner:  (1)  a  stream  of  choices,  (2)  a 
stream  of  problems,  (3)  a  rate  flow  of  solutions, 
and  (4)  a  stream  of  energy  from  participants.  The 
garbage  can  process  is  one  in  which  problems, 
solutions,  and  participants  move  from  one  choice 
opportunity  or  situation  to  another.  Such  concep¬ 
tion  of  policy  formulation  may  well  score  high  on 


model  realism  and  applicability;  however,  it  is 
weak  in  terms  of  deductive  power. 

Institutionalist  Model 

Finally,  the  institutionalist  model  tries  to  steer  a 
more  independent  role  for  political  institutions  in 
policy  making  without  playing  down  the  impor¬ 
tance  of  both  the  social  context  of  politics  and  the 
motives  and  active  intervention  of  individual 
actors.  It  sees  organizational  change  as  a  contested 
process  involving  accidental  outcomes  and  ran¬ 
dom  activity,  suggesting  that  results  cannot  be 
predicted  and  change  cannot  be  controlled  by  fiat, 
thus  claiming  institutional  coherence  and  auton¬ 
omy.  Institutions  matter  not  only  because  they 
dispose  authority  and  power  but  also  because  they 
provide  the  physical,  cognitive,  and  moral  frames 
for  joint  action,  as  well  as  the  capacity  for 
intervention.  At  the  same  time,  institutions  also 
represent  certain  collective  conceptions  of  interpre¬ 
tation,  agenda,  memory,  rights  and  duties,  and 
even  values,  symbolism,  and  justice.  As  such,  there 
is  a  relevant  degree  of  institutional  consistency  and 
continuity  that  affects  the  process  and  outcome  of 
policy  formulation. 

Policy  and  Polity 

Different  theories  of  the  state  (or  polity)  provide 
different  interpretations  of  the  power  dimension  of 
the  policy  process.  The  pluralist  theory  sees  society 
as  made  up  of  diverse  individual  and  group  inter¬ 
ests,  some  organized  and  others  more  diffused. 
Government  decisions  are  an  outcome  of  competi¬ 
tion  and  bargained  compromise  or  consensus 
among  competing  demands  and  interests  operating 
within  a  liberal  democratic  system.  Policy  making 
is  thus  incremental  and  a  process  of  mutual  accom¬ 
modation  of  interests.  Most  would  agree  that  the 
political  real  world  is  more  elitist  and  less  inclusive 
than  is  implied  or  advocated  by  the  pluralist  doc¬ 
trine.  The  elitist  theory  sees  modern  industrial 
society  as  built  around  large,  hierarchical,  and  usu¬ 
ally  corporate  interests.  Politics  operate  in  the 
form  of  an  oligarchy;  political  or  policy  elites  exist 
who  engage  in  the  actual  bargaining  and  policy 
making. 

As  a  particular  form  of  elitist  theory  popular  in 
the  1980s,  the  corporatist  theory  sees  the  state  as 
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part  of  a  structure  of  interest  group  representation, 
where  it  accords  exclusive  corporate  groups  (usu¬ 
ally  peak  employers  and  employee  associations)  a 
monopoly  over  representing  dominant  social  inter¬ 
ests  (viz.,  capital  and  labor,  and  sometimes  also 
agriculture),  resulting  in  a  de  facto  arrangement 
existing  alongside  a  constitutional  system  of  repre¬ 
sentative  government.  Decision  making  and  con¬ 
sent  building  take  place  through  consultation, 
negotiation,  and  coordination  among  such  peak 
bodies.  Government  boards  and  commissions,  and 
“iron  triangles”  in  the  case  of  the  United  States 
(i.e.,  the  nexus  existing  among  congressional  com¬ 
mittees,  government  departments,  and  major  inter¬ 
est  bodies),  would  provide  the  formal  channels  for 
the  processing  and  intermediation  of  corporate 
interests  to  reach  a  collective  decision  or  a  consen¬ 
sus.  The  success  of  corporatist  policy  making 
depends  on  voluntary  cooperation  between  elites 
who  are  capable  of  withdrawing  from  any  bar¬ 
gains  struck.  Norms  and  informal  rules  emerge 
from  such  voluntary  bargaining  that  ought  to  be 
observed  by  the  corporate  interests  concerned. 

Opposed  to  the  liberal  democratic  political  tra¬ 
dition  is  the  Marxist  theory,  which  was  popular  in 
the  1960s  and  1970s  but  has  since  waned.  It  is 
premised  on  class-based  rationality  and  argues  that 
the  capitalist  state  exists  to  serve  the  interest  of 
capital  (though  not  necessarily  the  interests  of  indi¬ 
vidual  capitalists)  by  creating  the  conditions  for 
capital  accumulation  through  support  for  markets 
and  social  control  (the  instrumentalist  model). 
Neo-Marxists  have  added  that  the  capitalist  state 
also  supplies  goods  and  services  for  collective  con¬ 
sumption  so  as  to  manufacture  the  “legitimation” 
of  the  capitalist  order  (the  functionalist  model). 
The  state  autonomy  model,  on  the  other  hand,  pre¬ 
sumes  a  state-independent  policy-making  approach, 
where  state  officials  act  on  their  own  preferences 
with  regard  to  what  is  best  for  society,  based  on 
technocratic  and  bureaucratic  rationality.  This  may 
depict  a  more  authoritarian  form  of  polity  of  either 
the  extreme  right  (fascist,  military  dictatorship)  or 
the  extreme  left  (communist)  of  the  ideological 
divide. 

From  Problems  and  Ideas  to  Policy 

According  to  John  Kingdon,  problems,  politics, 
and  participants  all  contribute  to  agenda  setting. 


Not  all  adverse  conditions  get  recognized  as  prob¬ 
lems  that  warrant  something  be  done  about  them. 
Problem  identification  is  about  perception  and 
interpretation.  It  entails  political  competition 
among  various  interests  seeking  their  “preferred” 
condition  to  be  recognized  and  defined  as  a  prob¬ 
lem.  The  act  of  translating  a  condition  into  a  prob¬ 
lem  makes  use  of  the  changing  values  in  society  (the 
degree  of  mismatch  between  the  observed  condi¬ 
tions  and  one’s  conception  of  an  ideal  state),  com¬ 
parisons  (the  relative  gap  or  disparity  as  compared 
with  others,  such  as  cross-national  comparisons 
and  international  benchmarking,  to  give  it  a  sense 
of  legitimacy),  and  recategorization  (so  that  the 
previously  unrecognized  condition  can  be  reconsti¬ 
tuted  to  structure  people’s  perception  of  it). 
Problems  may  arise  as  a  result  of  a  focusing  event 
(e.g.,  a  major  disaster  or  crisis),  social  indicators, 
change  of  government  or  leadership,  or  a  combina¬ 
tion  of  these  factors.  However,  there  is  a  limit  to 
the  span  of  attention  of  the  public  as  well  as  poli¬ 
cymakers,  which  is  normally  short-lived.  Like  fash¬ 
ion,  problems  come  and  go.  If  there  is  a  failure  to 
address  the  problem,  the  result  may  be  frustration, 
and  after  a  while,  people  would  tend  to  shift  their 
focus  to  other  competing  issues  and  problems. 

Ideas  are  important  in  policy  formulation 
because  they  provide  new  ways  of  looking  at  the 
existing  situation  (e.g.,  making  some  conditions 
previously  considered  acceptable  as  unacceptable 
according  to  the  new  standards)  as  well  as  innova¬ 
tive  solutions  to  problems.  A  good  example  is  the 
rise  of  privatization  policies  in  the  1980s  due  to 
new  economic  rationalist  ideas  and  the  work  of 
neoliberal  think  tanks  that  operated  actively  on 
governments  and  parties.  The  real  world  of  politics 
abounds  in  policy  entrepreneurs  of  all  kinds — con¬ 
sultants,  politicians,  campaigners,  policy  advocates, 
and  think  tanks.  These  people  engage  in  idea  pro¬ 
motion  and  problem  redefinition  all  the  time 
through  publicity,  advocacy,  persuasion,  and  bar¬ 
gaining.  To  the  extent  that  a  condition  only  gets 
recognized  as  a  problem  if  a  solution  is  available, 
policy  entrepreneurs  who  come  up  with  ready  solu¬ 
tions  help  facilitate  building  the  consensus  needed 
for  problem  recognition.  Policy  advising  is  becom¬ 
ing  a  booming  activity  in  modern-day  policy  mak¬ 
ing,  but  it  is  effective  only  if  the  policy  climate  is 
conducive  and  receptive  policymakers  are  willing 
to  take  up  external  advice  and  prescriptions. 
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The  Politics  of  Decision 

The  garbage  can  model  suggests  that  policy  formu¬ 
lation  very  often  takes  the  form  of  solutions  chasing 
the  problems;  unless  solutions  are  readily  available, 
policymakers  would  be  reluctant  to  recognize  con¬ 
ditions  as  problems.  Kingdon,  too,  argues  that 
policy  making  depends  on  the  coupling  of  problems 
and  policy  solutions  and  of  problems  and  politics. 

Problems  are  recognized  and  defined  according 
to  the  processes  that  are  different  from  the  ways 
policies  are  developed  or  political  events  unfold. 
Policy  proposals  emerge  according  to  their  own 
incentives  and  selection  criteria,  irrespective  of 
whether  they  are  solutions  to  problems  or  respon¬ 
sive  to  particular  political  consideration.  On  the 
other  hand,  political  events  move  on  their  own 
schedule  and  according  to  their  own  rules  and 
dynamics,  with  no  necessary  or  even  logical  rela¬ 
tionship  to  problems  or  proposals.  The  three 
streams  (problem,  policy,  and  politics)  run  parallel 
and  independently  of  each  other  until  a  window  of 
opportunity  occurs  in  a  situation  where  they  are 
joined  or  coupled — a  pressing  problem  demands 
attention  and  a  policy  proposal  is  somehow 
around  that  is  coupled  to  the  problem  as  a  solu¬ 
tion;  or  an  event  in  the  political  stream  (e.g.,  a 
change  of  government  or  a  major  focusing  event  in 
the  form  of  a  calamity)  calls  for  different  direction. 
At  that  point,  proposals  that  fit  with  that  political 
event  come  to  be  recognized  and  are  coupled  with 
the  ripe  political  climate.  Similarly,  problems  that 
fit  available  policies  are  acknowledged,  while  oth¬ 
ers  remain  to  be  neglected  politically. 

Given  that  policy  formulation  is  an  interactive 
process,  the  policy  demanders  (i.e.,  clients  and  the 
interest  groups),  policy  entrepreneurs/advocates, 
and  policy  implementors  (i.e.,  the  bureaucracy) 
have  a  role  to  play  in  shaping  the  policy  process 
and  its  outcome.  Contemporary  policy  discourse 
recognizes  the  strong  influence  of  pressure  groups, 
policy  communities,  policy  and  issue  networks, 
and  what  are  known  as  iron  triangles  or  subgov¬ 
ernments  in  the  United  States.  A  policy  community 
can  be  either  a  formal  structure  or  informal  net¬ 
work  characterized  by  restricted  membership,  sta¬ 
ble  relationships,  vertical  interdependence,  and 
insulation  from  other  networks  and  institutions. 
Policy  and  issue  networks,  on  the  other  hand,  com¬ 
prise  a  larger  number  of  participants  with  varying 
degrees  of  mutual  commitment  or  dependence  on 


others  in  their  environment.  Most  policy  issues 
and  disputes  are  resolved  in  the  relatively  private 
and  specialized  worlds  of  the  policy  sectors  through 
the  relevant  policy  “professionals”  from  both  gov¬ 
ernment  agencies  and  interest  groups.  In  other 
words,  there  has  been  a  bureaucratization  or  pro¬ 
fessionalization  of  policy  formulation,  whereby 
civil  servants  and  interest  group  officials  work 
together  as  policy  elite  to  produce  mutually  satis¬ 
factory  outcomes. 

The  notion  of  iron  triangles  and  subgovern¬ 
ments  presumes  a  stable  and  largely  autonomous 
set  of  participants,  coalesced  to  control  fairly  nar¬ 
row  policy  programs  and  operating  to  further  the 
direct  economic  interest  of  each  party  to  the  alli¬ 
ance  of  this  small  circle.  A  related  model  is  the 
advocacy  coalition  framework,  which  recognizes 
two  to  four  advocacy  coalitions,  typically  active  in 
a  particular  policy  domain  where  groups  coalesce 
around  a  shared  set  of  values  and  interests.  Policy 
brokers  would  mediate  the  competition  among 
these  coalitions  within  relatively  stable  systems 
and  based  on  certain  ground  rules.  Similar  to  cor- 
poratist  policy  making,  decisions  tend  to  be  reached 
through  a  process  of  negotiation,  by  accommodat¬ 
ing  established  interests  and  reaching  a  consensus, 
while  maintaining  existing  forms  of  relations 
between  interest  groups  and  government  agencies 
within  the  particular  policy  community.  Hence,  as 
A.  Grant  Jordan  and  Jeremy  Richardson  observe, 
changes  in  policy  are  always  marginal,  and  deci¬ 
sion  making  is  essentially  incremental. 

State  managers  (whether  general  or  professional 
bureaucracies)  are  a  key  party  to  such  an  alliance, 
because  without  their  involvement,  it  would  be  dif¬ 
ficult  to  secure  any  problem  recognition  in  the 
agenda-setting  process.  They  are  also  the  ones  who 
eventually  implement  policy  decisions  and  are  thus 
in  a  position  to  influence  policy  formulation  in  a 
backward-mapping  manner.  For  these  state  actors, 
entering  into  formal  or  informal  alliance/network 
with  other  stakeholders  helps  ensure  that  the  pol¬ 
icy  process  is  managed  and  within  control,  giving 
them  greater  clout  when  advising  their  political 
masters.  As  well  documented  in  the  literature  on 
bureaucracy,  state  managers  are  not  just  passive 
instruments  of  implementation.  They  seek  to  influ¬ 
ence  policy  making  through  defining  the  problems 
and  the  range  of  policy  options,  determining  what 
can  be  done  and  what  cannot  be  implemented  for 
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various  technical  and  administrative  reasons,  and 
in  general  tendering  policy  advice  in  anonymity. 

Finally,  the  recent  rise  of  civil  society  activists 
and  new  Internet  platforms  such  as  Facebook  and 
Twitter  has  brought  about  large  numbers  of 
unconventional  and  unorganized  participants, 
breaking  down  the  existing  rules  and  norms  that 
govern  policy  communities  and  networks.  This 
makes  the  policy  scene  more  diverse  and  over¬ 
crowded  and  poses  an  unexpected  challenge  to  the 
hitherto  corporatist  nature  of  policy  making. 

Anthony  B.  L.  Cheung 
Hong  Kong  Institute  of  Education 
Tai  Po,  New  Territories,  Hong  Kong 
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Policy  Framing 


Policy  framing  is  a  concept  used  in  public  policy 
and  social  movement  theory  to  explain  the  process 
by  which  actors  seek  to  understand  and  act  on 
complex  situations.  The  policy  framing  process 
involves  policy  actors  (a)  confronting  a  situation 
where  the  understanding  is  problematic  and 
uncertain,  (b)  creating  an  understanding  or  story 
that  helps  analyze  and  make  sense  of  the  situation, 
and  (c)  then  acting  (and  persuading  others  to  act) 
on  it.  Its  basic  premise  refutes  the  notion  that  dif¬ 
ferent  individuals  can  observe  the  same  social  and 
natural  phenomena  and  necessarily  arrive  at  the 
same  conclusions.  Because  the  framing  of  the  situ¬ 
ation  requires  the  assessment  of  the  potential  roles 
of  other  policy  actors,  framing  will  define  the 
degree  to  which  other  potential  actors  are  included 
and  benefit  from  the  policy  process  and  policy 
decisions.  Marginalized  groups  are  more  likely 
to  contest  a  particular  frame  and  promote  a 
counterframe. 

Policy  framing  is  a  midrange  ideational  concept 
that  focuses  on  how  humans  conceptualize  the 
world  and  how  the  elements  of  the  world  engage 
with  each  other  and  the  policy  actors  themselves. 
It  captures  a  notion  of  ideas  that  is  more  focused 
on  the  inter  subjective  understanding  by  a  group  of 
policy  actors  as  compared  with  ideas,  observa¬ 
tions,  and  arguments  held  and  posed  by  individu¬ 
als.  It  is  also  focused  on  particular  policy  problems 
and  situations  when  compared  with  the  more 
encompassing  focus  on  wider  ideologies  and 
philosophies. 

Policy  framing  is  a  concept  that  places  an 
emphasis  on  the  dynamics  of  change  and  the  con¬ 
stantly  changing  and  contested  notion  of  policy 
reality.  In  this,  it  contrasts  strongly  with  policy 
concepts  such  as  institutions,  which  tend  to  empha¬ 
size  rules  and  norms  that  are  slow  to  change  over 
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time.  Conflicting  policy  frames  mobilize  policy 
actors  to  contest  the  status  quo.  Policy  frame 
analysis  also  has  a  fundamentally  different  set  of 
assumptions  from  the  rational  choice  perspectives 
that  tend  to  assume  that  while  circumstances  may 
change,  how  actors  calculate  their  interest  will  not. 
The  framing  approach  would  not  dismiss  the 
importance  of  interest  calculation,  as  will  be 
explained  below,  but  it  would  emphasize  how 
actors  may  “key”  on  different  core  elements  at  dif¬ 
ferent  times. 

Framing  has  been  used  in  a  wide  range  of  disci¬ 
plines,  including  psychiatry  and  psychology.  The 
concept  of  framing  has  attracted  considerable 
interest  in  public  policy  analysis  as  well  as  social 
movement  studies  in  sociology  and  politics  because 
it  incorporates  constructivist  understandings  of 
how  actors  shape  the  meanings  of  their  experi¬ 
ences.  At  the  same  time,  it  operates  on  a  level  that 
is  not  much  distant  from  more  positivist  method¬ 
ologies  and  approaches  to  political  and  social 
action.  Thus,  some  scholars  have  used  the  frame 
approach  in  combination  with  other,  more  construc¬ 
tivist  or  positivist  approaches,  such  as  institution¬ 
alism  and  advocacy  coalitions,  as  well  as  positivist 
methodologies.  For  example,  there  is  a  tradition  in 
the  study  of  political  behavior  of  examining  fram¬ 
ing,  focusing  on  topics  such  as  quantitative  studies 
of  how  the  media  shape  public  opinion. 

In  the  public  policy  field,  scholars  have  used  the 
framing  concept  to  study  efforts  by  decision  mak¬ 
ers  to  shape  understanding  and  courses  of  action  in 
areas  where  knowledge  is  uncertain  and/or  where 
there  are  substantial  differences  and  conflicts  over 
interpretation  and  action.  Thus,  policy  framing 
has  featured  significantly  in  studies  of  environmen¬ 
tal  policy,  contested  social  policies  such  as  refugee 
and  women’s  rights,  economic  policy,  knowledge- 
intensive  areas,  media  and  public  opinion,  health 
policy,  racial  policy,  foreign  policy,  agricultural 
policy,  poverty  and  education,  and  so  forth. 

This  entry  examines  the  two  key  strands  of  the 
framing  approach  (the  public  policy  and  social 
movement  conceptualization  of  framing),  focusing 
on  the  key  intellectual  propositions  and  methodol¬ 
ogies.  While  not  strictly  devoted  to  policy  analysis, 
the  social  movement  research  provides  a  number  of 
insights  on  how  politics  and  policy  are  shaped  by 
groups  in  contestation.  Because  of  the  focus  on 
essentially  the  same  concept,  these  two  literatures 


provide  insights  for  each  other.  At  the  same  time, 
scholars  from  both  communities  sometimes  can 
speak  past  each  other  because  their  main  reference 
points  are  in  their  own  groups,  so  it  is  helpful  to 
understand  the  development  and  arguments  of 
both  approaches. 

The  Evolution  of  the  Framing  Concept 

John  Noakes  and  Hank  Johnston  give  credit  to 
Gregory  Bateson  for  first  using  the  term  in  a  dis¬ 
cussion  of  epistemology  and  animal  behavior. 
Erving  Goffman’s  1974  monograph  first  brought 
the  concept  of  framing  onto  the  social  science 
agenda.  For  Goffman,  frames  were  cognitive  struc¬ 
tures  that  humans  generated  to  explain  events  and 
their  involvement  in  them.  Frame  approaches  have 
evolved  in  analytical  depth  and  focus  since 
Goffman’s  original  statement  of  the  concept.  This 
formulation  assumed  that  humans  were  for  the 
most  part  unconscious  of  the  framing  process, 
which  was  constituted  by  rituals  of  everyday  life 
that  defined  individuals  and  their  relations  to  oth¬ 
ers.  Much  analysis  focuses  on  how  individuals  key 
on  specific  elements  of  an  event  to  understand 
what  is  going  on  and  how  they  should  behave;  it 
nevertheless  also  emphasizes  that  humans  have  the 
ability  to  transform  their  realities. 

Both  in  sociology  and  political  science,  the 
emphasis  has  moved  toward  understanding  how 
actors  actively  and  consciously  seek  to  build  and 
defend  frames  and  how  they  project  their  frames 
onto  others.  This  enables  the  policy  frame  analysis 
to  be  used  as  a  critical  theory,  highlighting  actors’ 
efforts  to  refute  a  particular  representation  of 
social  reality. 

Social  Movement  Framing 

Arguably,  the  academics  using  this  sociological 
approach,  when  compared  with  public  policy 
scholars,  have  made  earlier  advances  in  developing 
a  more  systematic  elaboration  of  the  concepts  of 
framing.  Social  movement  theorists  use  the  frame 
concept  to  explain  how  actors  in  a  movement 
define  what  is  happening  in  a  situation  to  mobilize 
collective  action  by  the  movement.  This  approach 
supplanted  much  of  the  rationalist  and  organiza¬ 
tional  approaches  in  the  1980s  and  became  one  of 
the  core  approaches  to  social  movements  in  the 
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1990s.  Noakes  and  Johnston  credit  William 
Gamson,  David  Snow,  Robert  Benford  and  their 
various  collaborators  who  have  sought  to  system¬ 
atize  this  process. 

In  their  1988  statement  on  framing,  Snow  and 
Benford  provided  the  most  prevalent  typology  of 
frames.  To  mobilize  individuals  to  act  in  a  social 
movement,  the  framing  process  involves  three 
framing  activities.  First,  there  must  be  a  diagnostic 
framing  of  current  events  that  seeks  to  discredit  the 
prevailing  framing  and  offer  a  new  interpretation. 
Second,  prognostic  framing  involves  the  creation 
of  a  solution  to  the  problem/situation  that  the 
movement  can  take  forward.  Finally,  motivational 
framing  focuses  on  the  conceptualization  that 
actually  triggers  people  to  join  the  social  move¬ 
ment,  making  the  assumption  that  such  communi¬ 
cation  and  persuasion  are  necessary  beyond  the 
first  two  framing  actions. 

Using  such  conceptual  tools,  social  movement 
scholars  expect  entrepreneurial  leaders  to  take  on 
such  framing  roles  to  arouse  the  wider  public.  In 
doing  so,  these  leaders  will  seek  to  link  particular 
actions  and  events  as  evidence  of  injustice  or  some 
other  unsatisfactory  reality  and  to  frame  an  appro¬ 
priate  response.  The  leadership  will  also  endeavor 
to  highlight  particular  aspects  of  the  frames  to 
make  them  resonate  more  with  the  potential  fol¬ 
lowers  of  the  movement.  This  notion  of  social 
movement  framing  resonating  with  the  followers 
depends  on  a  number  of  different  elements,  includ¬ 
ing,  for  example,  how  credible  the  arguments  and 
linkage  seem  to  the  potential  membership  and  how 
closely  this  social  movement  frame  chimes  with  the 
belief  systems  of  the  targeted  audience. 

Various  kinds  of  research  have  been  under¬ 
taken  to  study  the  framing  processes  at  work.  The 
work  has  tended  to  take  the  form  of  case  studies 
of  particular  social  movements  at  particular  junc¬ 
tures.  The  overall  tendency  reflects  the  epistemo¬ 
logical  assumptions  of  much  of  this  field,  with  its 
attempt  to  understand  collective  beliefs  about 
reality,  as  well  as  the  ambiguity  inherent  in  such 
assumptions. 

Accordingly,  there  has  been  a  general  concern 
to  push  for  a  more  systematic  examination  of  pro¬ 
cesses  while  acknowledging  the  tendency  toward 
proliferation  of  concepts,  often  used  in  a  largely 
descriptive  fashion.  Nevertheless,  in  the  early  21st 
century,  a  significant  body  of  literature  on  political 


behavior  sought  to  empirically  isolate  these  frames 
and  processes  and  determine  their  impact  through 
quantitative  analysis  of  data  from  surveys  or 
experiments. 

Policy  Framing 

Concepts 

In  the  1980s  and  early  1990s,  Martin  Rein  and 
Donald  Schon  provided  many  of  the  key  state¬ 
ments  that  triggered  the  interest  in  policy  analysis 
for  framing.  The  context  of  their  effort  was  a  belief 
that  the  more  rationalist  orientation  of  policy  stud¬ 
ies  was  unable  to  get  at  core  aspects  of  policy  deci¬ 
sion  making.  A  core  puzzle  for  them  was  the 
intractable  policy  controversy  that  no  presentation 
of  facts  and  evidence  could  reconcile. 

Rein  and  Schon  attribute  this  inability  to  recon¬ 
cile  such  policy  positions  to  the  existence  of  con¬ 
flicting  policy  frames.  These  frames  involve  stories 
or  structures  of  understanding  defined  by  human 
belief  and  perception.  Faced  with  a  contested  or 
uncertain  policy  problem,  different  actors  will 
seize  on  different  elements  and  linkages  between 
these  elements  to  construct  very  different  views  of 
reality.  Part  of  the  difficulty  in  inducing  a  recon¬ 
ciliation  of  such  frames  is  their  tacit  nature,  which 
prevents  them  from  being  actively  scrutinized  and 
assessed  by  both  their  proponents  and  opponents. 
Furthermore,  because  the  frames  will  view  differ¬ 
ent  events  and  observations  as  central,  there 
appears  to  be  no  objective  reality  and,  therefore, 
no  potential  for  falsifying  a  particular  frame  in 
favor  of  another.  This  reality  gives  one  insight  into 
the  difficulty  that  climate  change  policy  activists 
face  in  persuading  climate  change  “deniers”  to 
accept  the  evidence  of  the  anthropogenic  changes 
to  Earth’s  atmosphere  and  the  difficulty  the  deniers 
face  in  winning  over  the  activists.  Actors  tend  to 
take  such  frames  for  granted  and  are  unaware  of 
how  they  define  the  policy  position  they  adopt. 

Of  course  this  notion  of  contesting  realities 
existed  long  before  in  policy  analysis  under  a  dif¬ 
ferent  terminology.  Since  the  1970s,  researchers  on 
agenda  setting,  such  as  Robert  Cobb  and  Charles 
Elder,  have  emphasized  the  importance  of  under¬ 
standing  how  the  definition  of  a  problem  shapes 
the  subsequent  policy  outcome.  In  the  1990s,  a 
number  of  scholars  found  a  window  of  opportu¬ 
nity  for  the  policy  frame  concept  as  a  way  to 
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bridge  the  constructivist  and  positivist  divide,  an 
approach  that  was  furthered  by  Rein  and  Schon’s 
incorporation  of  notions  of  discourse. 

Delving  into  Rein  and  Schon’s  1994  conceptu¬ 
alization  more  deeply,  the  authors  accept  that 
there  is  an  important  role  for  actor  interests  and 
how  they  relate  to  actor  frames;  this  inherent 
ambiguity,  the  relationship  between  interests  and 
ideas,  is  an  enduring  question  for  all  policy  analy¬ 
sis.  Rein  and  Schon  do  not  assume  that  these  con¬ 
cepts  are  identical,  but  they  accept  that  frames  may 
help  define  interests  and  that  frames  may  be  under¬ 
taken  to  benefit  particular  interests.  Nevertheless, 
the  core  reality  is  that  the  frames  will  be  instru¬ 
mental  to  defining  what  is  in  the  interest  of  the 
actors.  Institutions  and  other  organizations  push  a 
particular  frame  or  set  of  frames  because  individ¬ 
ual  members  sponsor  those  frames. 

Policy  discourse  is  the  mechanism  by  which 
frames  contest  the  understanding  of  a  policy  prob¬ 
lem  within  any  policy  forum.  Rein  and  Schon 
define  this  as  essentially  the  dialogue  that  exists 
between  different  parties  over  a  particular  issue. 
They  differentiate  two  forms  of  policy  discourse 
and  two  associated  frames.  The  first  discourse 
involves  the  policy  debate  in  which  the  different 
policy  contestants  seek  to  prevail  with  their  policy 
stories  and  frames  through  various  rhetorical 
efforts  involving  persuasion,  evidence,  and  sym¬ 
bols.  Rhetorical  frames  will  underpin  the  argu¬ 
ments  in  the  policy  debate.  The  second  discourse 
involves  the  debate  over  policy  practice  in  which 
the  actors  argue  and  develop  policy  stories  that 
influence  the  creation  of  procedures,  particular 
policy  instruments,  and  so  forth  to  deliver  the 
policy.  Action  frames  inform  this  discourse. 

Within  action  frames,  Rein  and  Schon  differen¬ 
tiate  three  types:  (1)  policy  frames,  (2)  institutional 
action  frames,  and  (3)  metacultural  frames.  Policy 
frames  are  those  created  by  institutional  actors  to 
define  the  problem  in  a  particular  policy  situation. 
Institutional  action  frames  represent  the  more  gen¬ 
eral  action  frame  used  by  institutional  actors  to 
structure  a  wide  range  of  policy  situations — this 
institutional  action  frame  concept  bears  some 
resemblance  to  policy  styles  and  even  standard 
operating  procedures.  The  institution  has  charac¬ 
teristic  ways  of  addressing  policy  problems  that 
the  public  can  anticipate.  A  metacultural  frame, 
the  broadest  concept,  is  essentially  the  worldview 


adopted  by  a  particular  culture  or  community. 
These  metacultural  frames  shape  the  policy  stories 
that  inform  both  the  rhetorical  and  the  action 
frames. 

Rein  and  Schon  view  the  key  policy  act  as  indi¬ 
viduals  becoming  aware  of  their  rhetorical  and 
action  frames,  a  process  they  defined  in  1994  as 
the  “construction”  of  the  frame.  They  urge  policy 
actors  to  develop  a  higher  degree  of  reflection, 
seeking  to  learn  more  about  their  actions  during 
the  process  of  making  decisions.  This  should  lead 
actors  to  a  process  of  reframing  in  which  they  use 
reflection  and  discourse  in  a  manner  that  leads  to 
a  resolution  of  policy  controversies  and  conflict. 
When  actors  engage  with  their  situation  and  with 
each  other  through  discourse,  they  may  recognize 
incongruities,  creating  a  situation  that  forces  peo¬ 
ple  to  reflect  on  their  beliefs  and  frames. 

As  Falk  Daviter  has  noted,  this  emphasis  on  the 
potential  for  rational  consensus  building  contrasts 
with  policy  approaches  that  focus  on  the  policy 
outcomes  occurring  through  conflict  and  competi¬ 
tion.  Rein  and  Schon  do  not  take  this  rational 
process  of  frame  reflection  for  granted.  They 
expect  it  to  be  extremely  difficult  and  recognize 
that  frame  reflection  may  not  lead  to  reframing, 
nor  will  reframing  necessarily  require  it.  The 
authors  formulate  the  possibility  that  academics 
can  aid  practitioners  in  developing  reciprocal 
frame  reflective  processes  as  well  as  the  mutual 
trust  required  for  practitioners  to  expose  them¬ 
selves  to  such  processes. 

Methods 

The  core  of  understanding  in  any  given  frame 
involves  deconstructing  key  elements  from  the 
policy  rhetoric  and  actual  policy  documents  and 
instruments  as  well  as  accounts  of  individual  play¬ 
ers  (and  also  the  characteristics  and  interests  of 
these  players).  In  these  sources,  students  can  dis¬ 
cern  the  policy  stories  that  the  various  actors  use  to 
analyze  particular  policy  situations  and  isolate  the 
frames  accordingly.  Because  any  given  policy  frame 
and  policy  situation  (particularly  conflictual  ones) 
are  too  complex  to  be  understood  by  one  individ¬ 
ual  and  are  subject  to  multiple  interpretations, 
researchers  must  develop  an  account  of  the  policy 
situation  from  a  number  of  perspectives.  As  one 
would  expect,  the  inherent  difficulty  of  isolating 
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intersubjective  constructions  of  reality  by  its  very 
nature  poses  enormous  practical  problems  for  the 
researcher.  How  does  one,  for  instance,  differenti¬ 
ate  a  dissembling  rhetorical  frame  from  the  core 
action  frame  when  the  actors  themselves  may  come 
to  believe  their  own  rhetorical  devices?  What  hap¬ 
pens  when  frames  are  translated  and  reshaped  at 
various  different  levels  of  a  policy  system — or  when 
there  are  conflicts  between  those  who  operate 
within  a  particular  frame?  An  overarching  frame 
does  not  have  to  be  uniform  and  indeed  is  unlikely 
to  be  so.  Perhaps  equally  significant,  the  research¬ 
ers  themselves  necessarily  bring  their  own  frames, 
which  shape  how  they  perceive  and  analyze  these 
phenomena,  into  such  a  research  endeavor. 

The  basic  policy  approach  that  scholars  have 
taken  involves  isolating  both  the  policy  rhetoric 
and  policy  actions  over  time  and  across  all  the 
relevant  levels  of  governance.  These  demands  for 
policy  detail  and  nuance  have  led  to  the  bulk  of  the 
policy  research  using  the  qualitative  method  of  a 
detailed  case  study.  Given  the  effort  involved  in 
studying  such  processes  over  time  and  multiple 
levels,  the  research  in  this  area  has  generally  taken 
the  form  of  a  very  limited  number  of  case  studies, 
which  are  also  in  evidence  in  the  social  movement 
research.  Much  of  this  research  is  clear  in  its  view 
that  framing  indeed  matters  and  drives  policy. 

However,  as  also  witnessed  in  the  social  move¬ 
ment  research,  a  substantial  body  of  literature 
tends  to  use  the  framing  terminology  to  express  a 
basic  metaphor,  much  in  the  same  way  that  some 
scholars  use  the  concept  of  networks  as  descriptive 
shorthand  to  refer  to  people  who  interact  over  time 
in  a  particular  political  setting.  As  noted  earlier, 
numerous  scholars  tend  to  use  framing  as  part  of  a 
larger  analytical  construct,  such  as  institutional 
analysis,  advocacy  coalitions,  garbage  can  models, 
and  so  forth.  This  entry  emphatically  does  not  dis¬ 
miss  such  efforts  as,  for  instance,  framing  and 
reframing  seem  to  require  more  theorizing  about 
intersubjective  activity  between  like-minded  parties 
and  more  conflicting  actors.  How  are  frames 
inserted  into  institutional  processes  by  various 
actors  and  groups  of  actors?  Whether  this  takes  the 
form  of  discourse  coalitions,  epistemic  communi¬ 
ties,  advocacy  coalitions,  or  some  other  concept, 
some  form  of  networking  processes  is  required. 

While  the  descriptive  and  the  multiconcep- 
tual  approaches  can  enhance  our  insights  about 


particular  policy  problems,  there  is  a  danger  that 
the  elaboration  and  evaluation  of  the  concept  will 
remain  largely  at  a  general  conceptual  stage. 
Systematic  studies  of  multiple  cases  seem  to  be 
the  next  step  to  isolate  key  elements  of  the  fram¬ 
ing  process  and  determine  how  particular  condi¬ 
tions  shape  their  evolution. 
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Policy  Instruments 


The  study  of  policy  instruments  dates  from  the 
early  1970s,  though  much  had  been  written  previ¬ 
ously,  especially  in  economics,  about  government 
intervention  in  relation  to  market  imperfections. 
A  policy  instrument  refers  to  the  means  of  govern¬ 
ment  intervention  in  markets  or,  from  a  broader 
perspective,  society  to  accomplish  goals  or  to 
solve  problems.  The  behavioral  assumption  under¬ 
lying  a  policy  instrument  is  that  it  attempts  to  get 
people  do  things  that  they  might  not  otherwise 
have  done.  Since  the  1960s,  there  has  been  a 
transformation  not  only  in  the  scope  and  scale  of 
the  role  of  the  government  but  also  in  the  prolif¬ 
eration  of  tools  that  it  has  at  its  disposal  for  public 
action.  In  retrospect,  a  distinction  can  be  made 
between  three  partly  overlapping  arenas  in  the 
study  of  policy  instruments. 

This  entry  starts  with  a  discussion  of  several 
approaches  to  defining  what  policy  instruments 
are.  Policy  instruments  are  not  isolated  but  exist  in 
relationship  to  objectives  as  well  as  resources;  tak¬ 
ing  this  into  account,  the  next  section  of  the  entry 
considers  theories  about  how  a  policy  instrument 
should  be  chosen.  A  classification  of  policy  instru¬ 
ments  is  presented,  differentiating  between  three 
families  of  policy  instruments.  Next,  the  entry 
examines  the  choice  of  policy  tools  (policy  design), 
the  application  of  policy  tools  (policy  implementa¬ 
tion),  and  the  second  generation  of  less  coercive 
and  more  indirect  policy  instruments  inspired  by 
the  limited  impact  of  earlier  policy  tools.  The 
assessment  of  the  impact  of  policy  instruments 
(policy  evaluation)  is  complicated  by  the  fact  that 
they  often  come  as  part  of  a  policy  mix;  the  entry 
describes  policy  evaluation,  noting  that  effective¬ 
ness  is  just  one  of  the  many  criteria  by  which  to 
measure  performance.  The  entry  concludes  with  a 
look  at  the  way  ahead. 

Approaches  to  Policy  Instruments 

The  classical  approach  to  defining  policy  instru¬ 
ments  (1970-1985),  rooted  in  the  instrumentalist 
school  of  thought,  has  been  largely  supplanted 
because  of  its  top-down  and  mechanical  view  of 
the  world.  It  put  quite  some  effort  into  a  rather 
semantic  discussion  about  the  definition  of  a  policy 


instrument,  the  risk  of  metaphors  and  reification, 
and  the  classification  of  policy  instruments  on  the 
basis  of  their  intrinsic  characteristics.  It  claimed 
that  the  selection  and  consequently  the  application 
of  instruments  are  made  mainly  on  the  basis  of  the 
characteristics  of  a  specific  instrument  and  its 
effects  in  terms  of  goal  attainment.  Soon  it  became 
clear  that  goal  attainment  cannot  be  attributed 
solely  to  the  characteristics  of  instruments  but  that 
the  characteristics  of  the  context  should  also  be 
taken  into  account. 

The  instrument-context  approach  (1985-1995) 
focused  on  the  development  of  a  theory  of  policy 
instruments  that  would  enable  policymakers  to 
select  the  appropriate  policy  instrument  for  the 
problem  at  hand.  The  basis  for  selection  of  a  policy 
instrument  is  not  only  the  “logic  of  consequence” 
but  also  the  “logic  of  appropriateness,”  which 
includes  “goodness  of  fit.”  The  attention  for  the 
context  has  led  to  what  may  be  labeled  “refined 
instrumentalism.  ” 

Finally,  instruments  are  considered  one  of  the 
many  variables  in  the  contextual  approach  (1995- 
present)  that  takes  policy  implementation  as  a 
point  of  departure.  The  study  of  policy  instru¬ 
ments  merges  with  the  study  of  implementation. 
The  study  of  policy  instruments  has  been  affected 
as  such  by  the  developments  in  the  field  of  imple¬ 
mentation,  notably  the  shift  from  implementation 
to  governance. 

Policy  Theory 

A  policy  instrument  comes  not  in  isolation  but  as 
part  of  intervention  theory.  It  can  be  further  speci¬ 
fied  by  identifying  the  form  of  the  intervention, 
that  is,  a  policy,  a  program,  or  a  reform.  In  this 
context,  attention  is  paid  to  a  policy  theory,  that  is, 
the  set  of  assumptions  underlying  a  policy  instru¬ 
ment.  A  policy  theory,  following  Andries  Hooger- 
werf,  consists  of  assumptions  about  three  kinds  of 
relations: 

1.  normative  relations  between  what  “is”  and 
what  “ought  to  be,” 

2.  relations  between  the  objectives  and  means  to 
accomplish  these  objectives,  and 

3.  more  in-depth  relations  between  causes  and 
effects. 
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The  study  of  policy  considers  the  causal  rela¬ 
tionship  between  a  policy  instrument  and  the  pur¬ 
pose  for  which  it  is  used.  To  put  it  differently,  goal 
attainment  is  considered  to  be  the  dependent  vari¬ 
able  while  instruments  are  seen  as  one  of  the  inde¬ 
pendent  variables.  It  should  be  underscored, 
though,  that  a  policy  theory  does  not  reflect  a  sci¬ 
entific  theory  in  the  traditional  sense  of  a  universal 
law  that  has  been  empirically  tested.  The  external 
validity  of  the  findings  of  a  policy  theory  is  fairly 
low.  A  policy  theory  provides  guidelines  for  action 
and  is,  as  such,  ideographic  rather  than  nomologic 
in  nature.  As  they  are  often  in  conflict,  a  policy 
theory  is  often  the  outcome  of  a  trade-off  and  con¬ 
sequently  suboptimal  in  nature.  The  quality  of  a 
policy  theory — coherence,  reliability,  validity — is 
an  important  source  of  a  policy  fiasco. 

Shapes  and  Sizes 

The  means  of  tackling  policy  problems  are  often 
called  policy  instruments  or  policy  solutions.  The 
labels  are  not  without  risks.  A  comparison  of  a 
policy  instrument  with  a  hammer,  a  pair  of  pin¬ 
cers,  or  a  screwdriver  may  be  misleading  because  it 
attributes  qualities  to  means  that  they  do  not  have 
in  reality.  Consequently,  it  may  lead  to  unintended 
and  unforeseen  problems  that  may  even  over¬ 
shadow  the  original  problems.  The  same  applies 
for  the  term  policy  solutions  as  it  gives  the  wrong 
impression  that  problems  may  be  solved  once  and 
for  all.  In  reality,  problems  are  solved  at  best  partly 
and  temporarily.  Moreover,  the  solution  of  one 
problem  may  create  another  problem,  as  Aaron 
Wildavsky  has  argued:  Policies  deal  less  with 
events  in  society  than  with  the  consequences  of 
policies  from  the  past.  Instead  of  permanent  solu¬ 
tions,  one  should  think  of  permanent  problems  in 
the  sense  that  one  problem  always  succeeds  and 
replaces  another.  A  policy  is,  according  to  Deborah 
Stone,  more  like  an  endless  game  of  Monopoly 
than  a  bicycle  repair. 

Three  Families  of  Instruments 

The  study  of  policy  instruments  has  been  long 
dominated  by  efforts  to  put  together  a  classification 
of  policy  instruments  on  the  basis  of  their  charac¬ 
teristics.  One  of  the  first  is  a  typology  by  Chris¬ 
topher  Hood  that  makes  a  distinction  between 


instruments  for  the  collection  of  information 
(detectors)  and  instruments  directed  to  influence 
development  in  society  (effectors).  A  quick  scan  of 
the  literature  reveals  that  the  study  of  policy  instru¬ 
ments  is  almost  exclusively  focused  on  the  latter.  A 
survey  provided  by  Frans  van  der  Doelen  identifies 
three  families  of  policy  instruments,  also  referred  to 
as  “sticks,  carrots  and  sermons.”  The  first  family 
consists  of  regulatory  instruments ,  such  as  orders 
and  prohibitions  (licenses,  permits,  and  regula¬ 
tions).  Rules  may  also  establish  rights.  Rights  must 
rest  on  authoritative  rules  from  the  state,  but  they 
are  distinctive  in  their  reliance  on  citizens  for 
enforcement.  The  second  family  embraces  financial 
means ,  providing  incentives.  They  may  be  positive 
(grants,  subsidies)  as  well  as  negative  (taxes,  user 
charges)  from  a  consumer’s  perspective.  The  third 
family  includes  communicative  tools,  the  develop¬ 
ment  of  which  has  received  a  boost  with  the 
advance  of  the  digital  age.  Such  tools  may 
be  directed  at  increasing  as  well  as  decreasing  the 
amount  of  information  on  the  other  party. 

In  addition,  a  fourth  family,  organizations,  can 
be  distinguished,  referring  to  direct  government, 
that  is,  the  provision  of  goods  and  services  includ¬ 
ing  treatment.  The  classification  of  policy  instru¬ 
ments  is  further  elaborated  by  Michael  Howlett  in 
a  spectrum  of  “substantive”  policy  instruments  on 
the  basis  of  the  level  of  state  provision,  with  direct 
provision  of  goods  and  services  by  the  state  at  one 
end  of  the  continuum  and  provision  of  goods  and 
services  by  the  family  and  community  with  no 
state  involvement  at  the  other.  In  addition,  “proce¬ 
dural”  instruments  can  be  distinguished  that  are 
primarily  intended  to  alter  the  policy  process 


Table  I  Three  Families  of  Policy  Instruments 


Policy 

Instrument 

Positive 

Negative 

Regulatory 

Prescriptions 

Prohibitions 

Economic 

Grants, 

Taxes,  user 

subsidies 

charges 

Communicative 

Information 

Propaganda 

Source:  Adapted  from  Bemelmans-Videc,  M.  L.,  Rist,  R.  C., 
&  Vedung,  E.  (Eds.).  (1998).  Carrots,  sticks  and  sermons: 
Policy  instruments  and  their  evaluation.  New  Brunswick, 
NJ:  Transaction  Publishers. 
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rather  than  the  mix  of  goods  and  services  provided 
to  society. 

The  typology  of  policy  instruments  is  not  with¬ 
out  controversy;  for  instance,  the  use  of  symbols 
as  policy  instruments — “words  that  succeed  and 
policies  that  fail”  to  quote  the  subtitle  of  Murray 
Edelman’s  book  Political  Language — does  not  fit 
into  the  typology.  In  most  cases,  hardly  any  atten¬ 
tion  is  paid  to  the  internal  instruments  used  to 
influence  administrative  processes,  even  though 
they  might  be  preferable  from  a  control  or  man¬ 
agement  perspective.  The  focus  is  almost  exclu¬ 
sively  on  external  instruments  geared  to  a  change 
of  the  environment  of  the  government — that  is, 
the  developments  in  society.  Besides,  there  is  no 
place  for  informal  instruments  to  shape  a  policy 
according  to  a  government’s  preferences  for 
changes  in  the  membership  or  size  of  the  decision¬ 
making  body,  through  bargaining  and  negotia¬ 
tion,  and  by  mobilization  of  political  support.  In 
response  to  these  and  other  drawbacks,  several 
other  typologies  have  been  offered  in  the  litera¬ 
ture,  for  instance,  on  the  basis  of  the  degree  of  the 
following: 

•  Coercion:  The  extent  to  which  a  policy  tool 
restricts  behavior  as  opposed  to  being  merely 
encouraging  or  discouraging 

•  Directness:  The  extent  to  which  a  public  agency 
is  involved  in  all  stages  of  the  policy  process 

•  Automaticity:  The  extent  to  which  a  policy  tool 
uses  the  existing  administrative  structure 

•  Visibility:  The  extent  to  which  the  resources 
attributed  to  a  policy  tool  show  up  in  the 
budget-end  program  review 

The  degree  of  coercion  is  considered  to  be  the 
main  criterion;  according  to  Lester  Salamon,  a 
variety  of  factors  have  put  a  premium  on  tools  that 
are  automatic  but  indirect  and  invisible.  Unfortu¬ 
nately,  no  criterion  is  completely  exhaustive  or 
mutually  exclusive. 

Last  but  not  least,  policy  instruments  rarely 
appear  in  a  pure  form.  They  come  bundled  in  pro¬ 
grams  combining  various  tools,  each  being  a  pack¬ 
age  containing  a  type  of  good  or  activity,  a  vehicle 
for  the  delivery  of  this  good  or  activity,  and  a 
delivery  system.  A  prohibition,  for  instance,  may 
be  delivered  by  a  public  agency  in  the  form  of  a 
rule. 


The  diversity  of  policy  instruments  should  be 
considered  as  a  benefit  rather  than  a  drawback, 
according  to  Ross  Ashby’s  law  of  requisite  variety — 
the  greater  the  variety  within  a  system,  the  greater 
is  its  ability  to  absorb  differences  without  system 
failure;  in  this  case,  variety  in  policy  instruments 
increases  the  effectiveness  of  government  interven¬ 
tion  by  enabling  it  to  address  a  greater  variety  of 
environmental  situations. 

Policy  Design:  The  Selection 

The  selection  of  policy  instruments  is  often  associ¬ 
ated  with  a  tool  kit  from  which  the  government  is 
free  to  choose  a  policy  instrument.  In  practice,  there 
is  no  such  tool  kit,  and  at  the  same  time,  the  gov¬ 
ernment  is  not  always  free  to  choose.  On  the  con¬ 
trary,  a  subsidy  is  not  simply  a  “bag  of  money”  but 
represents  an  organizational  unit,  too.  A  third  party 
may  have  preferences  of  its  own.  A  specific  policy 
instrument,  for  instance,  may  provide  discretionary 
power  and,  therefore,  room  to  maneuver.  The  same 
applies  for  the  target  group,  because  interest  groups 
that  oppose  the  implementation  of  a  policy  have 
tools  at  their  disposal  that  may  frustrate  a  poten¬ 
tially  effective  policy  instrument. 

The  choice  of  policy  tools,  according  to  B.  Guy 
Peters,  is  driven  by  what  he  calls  the  Live  I’s,  which 
combine  together  to  constitute  a  checklist  of  fac¬ 
tors  that  should  be  taken  into  consideration: 

1.  ideas  and  ideologies  that  may  shape  the 
selection  of  policy  instruments,  giving  preference 
to  a  specific  tool  above  other  tools; 

2.  interests  that  refer  to  the  assumption  that 
individuals  try  to  maximize  their  personal  utilities; 

3.  institutions  that  may  have  preferences  for 
instruments  over  which  they  have  control  and 
with  which  they  are  familiar; 

4.  individuals  pointing  at  the  role  of  political 
entrepreneurs  to  win  support  for  a  new  policy; 
and 

5.  international  environment  as  shown  by  the 
consequences  of  the  process  of  globalization. 

In  reality,  policy  instruments  are  often  chosen 
for  all  but  rational  arguments,  that  is,  the  assess¬ 
ment  of  their  positive  and  negative  effects  (impact 
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assessment).  A  policy  instrument  is  often  chosen 
because  of  extrarational  arguments.  The  following 
are  two  such  arguments: 

The  normative  or  political  dimension  of  a  policy 
instrument:  A  policy  instrument  is  not  just  a  tool. 
Contrary  to  the  well-known  statement  by  Niccolo 
Machiavelli,  the  ends  do  not  justify  the  means,  as 
Arthur  Ringeling  has  argued  correctly.  The  choice 
of  a  policy  instrument  is  not  value  free  as  it  may 
reflect  an  ideological  stance.  Besides,  a  policy 
instrument  may  strengthen  or  weaken  the  position 
of  an  actor. 

Path  dependency:  The  institutionalization  of  policy 
instruments  as  the  selection  of  a  policy  instrument 
may  be  subject  to  path  dependency.  A  policy 
instrument  that  works  is  also  adopted  in  other 
fields.  The  bias  a  policymaker  has  toward  a  specific 
policy  instrument  may  even  go  so  far  that  the 
problem  at  hand  is  shaped  in  such  a  way  that  it 
can  be  solved,  at  least  potentially,  by  the  policy 
instruments  of  his  or  her  choice. 

Last  but  not  least,  a  policy  instrument  may  be 
chosen  because  it  is  in  vogue,  as  illustrated  by 
the  deregulation  movement  of  the  1980s,  which 
seems  to  have  been  more  an  expression  of  a 
prejudice  against  the  government  or  a  preference 
for  the  market  than  the  outcome  of  a  careful 
scrutiny  of  the  pros  and  cons  of  regulation. 
Rules  and  standards  have  been  abolished  in 
favor  of  financial  incentives.  Only  recently  has 
there  been  a  movement  backward  in  favor  of 
re-regulation. 

Policy  Implementation:  The  Application 

The  application  of  instruments,  often  referred  to 
as  policy  implementation,  has  been  long  directed 
by  what  Herman  van  Gunsteren  calls  the  rational 
central-rule  approach,  based  on  the  superior  ana¬ 
lytical  insights  of  an  elite  and  on  centrally  con¬ 
trolled  implementation.  It  assumes  that  actors 
who  are  involved  in  the  enforcement  of  a  rule  are 
puppets  on  strings.  In  reality,  they  are  neither  will- 
less  nor  powerless,  as  illustrated  by  the  famous 
study  by  Jeffrey  Pressmann  and  Aaron  Wildavsky 
on  the  implementation  of  a  program  to  hire  the 
hardcore  unemployed  minorities  of  Oakland, 


titled  “Implementation:  How  Great  Expectations 
in  Washington  Are  Dashed  in  Oakland;  or  Why 
It’s  Amazing  That  Federal  Programs  Work  at  All, 
This  Being  a  Saga  of  the  Economic  Development 
Administration  as  Told  by  Two  Sympathetic 
Observers  Who  Seek  to  Build  Morals  on  a 
Foundation  of  Ruined  Hopes.” 

The  study  of  policy  instruments  has  been 
affected  by  developments  in  the  field  of  policy 
implementation.  The  confrontation  between  the 
top-down  and  the  bottom-up  approach  has  induced 
a  more  horizontal  mode  of  policy  implementation 
often  referred  to  as  governance.  The  shift  from 
hierarchy  to  markets  and  networks  has  substan¬ 
tially  changed  the  role  of  the  government.  The 
government  is  just  one  of  the  actors,  though  a 
prominent  one — the  government  has  a  monopoly 
on  the  utilization  of  power  and  more  resources  at 
its  disposal  and  often  serves  as  network  manager — 
each  pursuing  its  own  interests  and  strategy.  Being 
dependent  on  each  other  for  the  accomplishment 
of  their  objectives,  actors  are  involved  in  a  multi¬ 
actor  game  of  wheeling  and  dealing.  A  policy  is,  as 
such,  a  compromise,  the  outcome  of  the  interac¬ 
tion  between  these  actors  or,  to  put  it  differently, 
the  result  of  the  coproduction  of  various  actors 
rather  than  something  imposed  by  one  single 
actor — that  is,  the  government. 

The  Second  Generation 
of  Policy  Instruments 

The  shift  in  governance  from  hierarchy  to  markets 
and  networks  gave  birth  to  what  De  Bruijn  and 
Ten  Heuvelhof  have  called  the  second  generation 
of  policy  instruments.  A  multi-actor  game  requires 
other  features  beyond  the  conventional  tools, 
notably  multilateral  instruments  such  as  cove¬ 
nants,  public-private  partnerships,  and  voluntary 
agreements.  Contrary  to  the  first  generation  of 
policy  instruments,  the  second  generation  of  policy 
instruments  anticipates  the  potential  resistance  of 
the  stakeholders  by  taking  into  account  the  charac¬ 
teristics  of  a  network,  notably  the  autonomy  or 
isolation  of  the  target  group  and  the  interdepen¬ 
dency  of  the  actors,  as  well  as  the  pluriformity  of 
society. 

With  the  second  generation  of  policy  instru¬ 
ments,  the  government  takes  a  more  remote  stance. 
Instead  of  directing,  the  government  facilitates 
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development  in  society  (steering  from  a  distance). 
The  new  tools  are  highly  indirect  and  rely  heavily 
on  a  wide  assortment  of  third  parties  for  the 
implementation  of  a  policy.  In  addition,  they  are 
characterized  by  a  high  degree  of  relativism  and 
voluntarism.  It  is  left  to  the  target  group  to  take 
action  and  to  change  its  behavior. 

The  development  of  a  second  generation  of 
policy  instruments  does  not  mean  that  the  first 
generation  of  policy  instruments  suddenly  becomes 
obsolete.  To  the  contrary,  second-generation 
instruments  supplement  rather  than  replace  the 
first-generation  ones.  In  many  cases,  vertical 
instruments  are  still  appropriate,  even  in  a  net¬ 
work  setting.  The  introduction  of  more  horizontal 
instruments  may  be  seen  as  fine-tuning  for  a  new 
situation.  In  practice,  the  difference  is  not  that 
great.  The  “new”  policy  instruments  often  go 
under  the  same  label  as  the  “old”  ones,  although 
the  emphasis  is  more  on  the  bilateral  or  even  mul¬ 
tilateral  aspect  than  on  the  unilateral  aspect  of 
governance. 

Policy  Evaluation 

The  performance  of  the  government  is  often  mea¬ 
sured  in  terms  of  goal  attainment  and  effective¬ 
ness.  The  effectiveness  of  tools — the  contribution 
of  policy  instrument  to  goal  attainment — may  be 
blurred  by  external  or  intervening  variables,  such 
as  the  financial  crisis,  that  are  beyond  the  control 
of  the  government.  Consequently,  an  effective  tool 
may  not  lead  to  goal  attainment,  whereas  goal 
attainment  may  conceal  an  ineffective  tool. 

The  review  of  policy  instruments  is  further  com¬ 
plicated  by  a  number  of  circumstances  that  make 
it  difficult  to  be  conclusive: 

1.  Goals  are  not  set  once  and  for  all  but  may  be 
subject  to  goal  displacement.  The  assessment  of  a 
policy  instrument  in  terms  of  the  original  goals 
may  be  methodologically  sound  but  not  of  much 
practical  relevance. 

2.  It  may  be  hard  to  measure  the  effect  of  a 
policy  instrument  as  they  often  come  and  go  as 
part  of  a  policy  mix.  A  subsidy,  for  instance,  is 
often  shaped  as  a  regulation  containing  additional 
conditions.  A  regulation  is  usually  backed  up  by 
sanctions,  and  inducements  are  predicated  on  rules 


for  handing  out  rewards  or  punishments.  The 
packaging  could  be  threefold: 

•  horizontal  packaging,  which  is  used  if  two  or 
more  policy  instruments  are  directed 
simultaneously  at  the  same  agent; 

•  vertical  packaging,  which  refers  to  the  application 
of  a  policy  instrument  to  facilitate  the 
implementation  of  another  policy  instrument;  and 

•  chronological  packaging,  which  implies  a 
sequence  in  the  selection  or  application  of 
diverse  policy  tools. 

The  packaging  of  instruments  makes  it  extremely 
difficult,  if  not  impossible,  to  sort  out  the  precise 
effects  of  an  individual  policy  instrument  and, 
therefore,  its  contribution  to  the  accomplishment 
of  objectives. 

3.  The  effect  of  a  policy  instrument  may  be 
“hollowed-out,”  according  to  Roeland  in ’t  Veld, 
by  the  law  of  diminishing  effectiveness,  which 
states  that  a  policy  instrument  may  become  obso¬ 
lete  after  some  time  because  stakeholders  learn  how 
to  cope  with  its  negative  or  unwelcome  effects. 

4.  Last  but  not  least,  it  should  be  noted  that  the 
effectiveness  of  a  policy  instrument  is  just  one  of 
the  many  criteria  that  may  be  used  to  evaluate  a 
policy.  The  call  for  improvement  in  the  efficiency 
of  the  public  sector  in  the  past  few  years  has  been 
at  the  expense  of  equity  in  the  distribution  of 
goods  and  services.  Lately,  the  accountability, 
legitimacy,  and  responsiveness  of  government  have 
become  issues. 

The  Way  Ahead:  Old  Wine  in  New  Bottles? 

The  reinvention  of  government  not  only  has  put 
the  government  back  in  the  spotlight  but  also  has 
induced  other  changes  (steering,  not  rowing,  is  the 
device).  Borrowing  insight  from  the  new  public 
management  movement,  attention  has  shifted 
away  from  the  primary  toward  the  secondary  pro¬ 
cess  in  public  organizations.  Public  management 
has  replaced  public  policy  at  the  heart  of  public 
administration.  It  has  induced  a  change  in  vocabu¬ 
lary.  In  today’s  jargon,  instruments  are  referred  to 
as  outputs,  underscoring  that  the  provision  of 
goods  and  services  may  also  be  considered  a  policy 


Policy  Learning  1933 


instrument.  A  lot  of  effort  is  now  put  into  the 
development  of  indicators  to  measure  perfor¬ 
mance.  The  purpose  is  twofold.  On  the  one  hand, 
allocation  and  efficiency  may  be  fostered  by  link¬ 
ing  inputs  to  outputs.  On  the  other  hand,  effective¬ 
ness  may  be  promoted  by  relating  outputs  and 
outcomes.  It  is  still  too  early  to  judge  the  perfor¬ 
mance  movement,  but  the  focus  on  the  develop¬ 
ment  and  utilization  of  indicators  may  cause  a 
“performance  paradox,”  that  is,  contrary  or  even 
perverse  effects  generated  by,  inter  alia,  focusing 
on  what  we  can  measure  and  leaving  out  what  we 
cannot  measure.  The  results,  therefore,  should  be 
treated  with  care. 

Frans  K.  M.  van  Nispen 
Erasmus  University  Rotterdam 
Rotterdam,  Netherlands 

See  also  Governance;  Implementation;  Performance; 
Policy  Evaluation;  Policy  Network 
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Policy  Learning 


Learning  is  a  process  whereby  cognitive  frame¬ 
works  or  behavior  changes  occur  as  a  result  of 


new  information,  knowledge,  and  experiences. 
The  various  levels  at  which  learning  happens  are 
individual,  organizational,  governmental,  and 
even  societal.  Policy  learning  involves  all  these 
levels,  but  it  is  typically  focused  on  governmental 
or  societal  learning  about  specific  policy  issues  or 
broad  conceptual  frameworks  that  guide  govern¬ 
ment  action  in  different  policy  fields.  There  is  a 
vast  literature  on  the  psychology  of  individual 
learning,  as  well  as  a  management  literature  on 
the  dynamics  of  organizational  learning.  The  lit¬ 
erature  on  policy  learning  started  in  the  1960s, 
with  the  growing  realization  that  policy  making 
was  not  simply  about  power  but  also  about  collec¬ 
tive  puzzling  under  conditions  of  uncertainty. 

Learning  in  Public  Policy 

Learning  is  a  preoccupation  in  the  study  of  public 
policy  for  several  reasons.  First,  there  is  a  presump¬ 
tion  that  policy  making  is  not  entirely  a  matter  of 
self-interested  politics  or  power.  Obviously,  the 
policy  process  involves  interest  groups,  political  par¬ 
ties,  social  movements,  classes,  and  other  forces — 
and  they  battle  for  the  supremacy  of  their  agendas 
and  their  interests.  But  there  is  also  the  sense  that 
policy  making  is  about  pursuing — in  some  large 
measure — the  public  interest.  This  means  that  policy 
is  intended  to  contribute  to  the  public  good  and  that 
therefore  it  should  be  monitored  (evaluated)  regu¬ 
larly  as  conditions  and  information  change,  and 
improved  if  possible.  This  requires  learning.  Second, 
every  model  of  the  policy  process  assumes  that  poli¬ 
cies  (and  their  constituent  programs)  are  intended  to 
deal  with  public  problems  and  therefore  require  a 
careful  definition  of  exactly  what  the  key  features  of 
the  problem  are,  as  well  as  its  underlying  causal  fac¬ 
tors.  This  requires  reflection,  research,  and  learning. 
Third,  every  model  of  the  policy  process  also 
assumes  a  stage  of  evaluation,  where  policies  are 
assessed  for  their  effectiveness  and  efficiency  and  are 
modified  and  improved  if  possible.  Evaluation  is 
intrinsically  a  learning  function. 

The  successful  avoidance,  or  at  least  minimiza¬ 
tion,  of  mistakes  is  crucial  for  public  policy  and 
requires  a  capacity  to  learn.  Part  of  the  challenge  of 
policy  learning  is  that  capacity  is  often  blocked  by  a 
combination  of  factors.  Uncertainty  marks  all  pol¬ 
icy  fields,  some  more  than  others.  Climate  change 
science,  for  example,  is  enormously  complicated 
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and  was  being  seriously  debated  even  into  the 
1990s.  When  governments  around  the  world  began 
massive  bailouts  and  spending  programs  in  2009  to 
deal  with  the  global  financial  crisis,  no  one  was 
entirely  sure  what  the  impacts  would  actually  be. 
Sometimes  the  evidence  to  assess  policy  interven¬ 
tions  takes  years  to  appear  (e.g.,  poverty  reduction 
programs  in  developing  countries).  Other  policy 
fields,  such  as  abortion  or  same-sex  marriage,  are 
driven  more  by  moral  or  ethical  perspectives  than 
by  scientific  research  and  data.  For  these  and  other 
reasons,  policy  actors  have  to  rely  on  policy  issue 
frames  or  belief  systems  that  consist  of  a  reason¬ 
ably  consistent  mix  of  data,  information,  research, 
values,  interests,  and  deeply  held  assumptions  or 
beliefs.  These  can  operate  in  various  ways,  as 
broadly  consistent  worldviews  of  competing  play¬ 
ers  in  the  policy  process  (e.g.,  environmental  vs. 
economic  development  groups),  as  paradigms  that 
are  broadly  accepted  by  most  stakeholders  in  a 
policy  field,  or  even  broadly  at  the  societal  level 
(e.g.,  the  reality  and  danger  of  global  warming).  So 
policy  learning  is  not  simply  a  matter  of  processing 
clear  information  or  unambiguous  experiences. 
Belief  systems  and  paradigms  are  often  deeply 
embedded  among  social  actors  or  institutions  and 
are  internally  consistent  enough  that  they  can  resist 
contradictory  evidence. 

Organizational  Learning 

This  raises  the  question  of  whether  organizations 
can  actually  learn,  as  distinct  from  the  individuals 
that  make  them  up.  It  is  important  not  to  anthro¬ 
pomorphize  institutions — they  do  not  exist  as 
separate  entities.  However,  enormous  efforts  take 
place  in  almost  all  institutions  to  provide  both  col¬ 
lective  memory  and  learning.  Policy  manuals  and 
standard  operating  procedures  (SOPs)  mark  almost 
every  organization.  Organizations — from  local 
day  care  centers  to  the  military — have  retreats  and 
special  meetings  to  breathe,  get  focused,  and  act  as 
one.  The  larger  the  organization,  the  more 
challenging  is  the  learning  process.  Can  states — 
possibly  the  largest  organizations  we  know — learn, 
and  can  they  learn  policy  lessons?  The  current 
record  is  mixed.  The  financial  crisis  of  2008-2009 
had  the  U.S.  Federal  Reserve  and  other  national 
banks  intervene  to  keep  their  economies  afloat. 
Ben  Bernanke,  the  Chairman  of  the  Federal 


Reserve,  who  was  a  student  of  the  Great  Depression 
of  the  1930s,  learned  those  lessons  and  was  in  a 
position  to  apply  them  with  a  massive  stimulus 
package.  He  was  an  individual  in  charge  of  an 
organization,  and  the  organization  and  the  state 
absorbed  the  lesson  to  a  degree.  By  2010,  the 
new  challenge  was  deficits,  and  various  state 
organizations — primarily  central  banks  and  finance 
departments — were  attempting  to  shift  from  stimu¬ 
lus  to  cuts.  Learning  is  not  a  static  condition — it 
requires  adaptation,  shift,  and  change.  That  is  why 
it  is  difficult. 

So  can  states  learn?  In  principle,  yes,  if  we  de- 
anthropomorphize  the  organizations,  ministries, 
and  offices  that  make  them  up  and  see  leaders  who 
look  dispassionately  at  the  evidence  and  move  on. 
On  the  other  hand,  policy-making  systems  and  the 
states  within  which  they  are  embedded  face  at  least 
six  impediments  to  learning.  First,  while  making 
mistakes  is  the  best  path  to  learning  how  to  do  bet¬ 
ter,  mistakes  are  often  political  suicide.  Second,  the 
larger  the  organization,  the  more  reliant  it  is  on 
SOPs.  States  are  big — SOPs  are  important  and  often 
helpful,  but  they  become  impediments  to  fresh 
thinking.  Third,  stakeholders  fight  for  and  cling 
grimly  to  the  status  quo.  They  are  not  necessarily  the 
enemies  of  learning,  but  they  are  stalwart  defenders 
of  any  conventional  wisdom  that  serves  their  inter¬ 
ests.  Fourth,  many  political  systems  do  not  have 
well-developed  “cognitive  capacities”  (e.g.,  think 
tanks,  foundations,  and  university  departments  of 
political  science  or  public  policy).  Fifth,  there  is  the 
invisible  human  dimension  of  state  organizations 
(ministries) — the  fluid  companionship  of  experi¬ 
enced  seniors  and  enthusiastic  recruits.  Mentorship — 
much  understudied  in  the  literature — is  a  crucial 
mechanism  of  organizational  learning,  but  it  often 
happens  haphazardly.  Sixth,  in  policy  areas  that 
concern  the  safety  or  security  of  the  public,  if  things 
go  wrong,  there  is  almost  unbearable  pressure  to 
“do  something.”  In  other  words,  agencies  have  to 
react  immediately  rather  than  learn  from  the  experi¬ 
ence — terrorist  attacks  are  a  perfect  illustration.  The 
policy  reactions  to  the  attempt  to  blow  up  a  plane 
destined  to  the  United  States  from  Amsterdam  on 
Christmas  Day  in  2009  were  clearly  designed  to 
“tighten  security,”  even  if  that  was  largely  meaning¬ 
less  in  terms  of  a  real  capacity  to  stop  terrorists. 

However,  at  the  same  time,  the  Obama  admin¬ 
istration  ordered  a  review  of  agency  practices  in 
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light  of  the  fact  that  there  had  not  been  an  intelli¬ 
gence  failure  but  instead  a  failure  to  “connect  the 
dots”  of  existing  information.  Interestingly,  the 
first  response  by  the  U.S.  Department  of  Homeland 
Security  was  defensive — the  “system  had  worked.” 
In  subsequent  days,  the  administration  made  it 
clear  that  it  wanted  to  know  (to  learn)  what  had 
gone  wrong. 

Theoretical  Approaches 

Since  learning  and  policy  change  evidently  take 
place,  various  theoretical  approaches  to  under¬ 
standing  policy  learning  have  emphasized  the  idea 
of  “levels”  of  belief  systems  or  paradigms  that  are 
more  or  less  susceptible  to  alteration.  At  the  most 
microscopic  level,  there  is  a  steady  stream  of  indi¬ 
cators  and  data  that  wash  through  the  policy  sys¬ 
tem  and  trigger  attention  if  they  seem  anomalous 
or  unexpected.  This  represents  learning  from  the 
environment,  from  feedback  signals,  and  may  not 
generate  substantial  changes  in  policy  but  simply 
intensified  responses  based  on  preexisting  policy 
(e.g.,  new  cases  of  avian  flu).  Another  approach 
sees  social  learning  composed  of  three  orders. 
First-order  learning  absorbs  new  information  as  a 
basis  for  making  changes  in  policy  settings,  such  as 
incremental  adjustments  in  tax  rates  if  the  econ¬ 
omy  seems  to  be  slowing  down.  Second-order 
learning  occurs  when  it  appears  that  the  policy 
environment  has  changed  in  a  substantial  way, 
when  it  seems  that  existing  policy  interventions  are 
not  achieving  their  objectives.  In  this  case,  govern¬ 
ments  will  change  the  kinds  of  policy  instruments 
that  they  use — for  example,  from  direct  welfare 
payments  to  vouchers.  Third-order  learning  is  the 
deepest  and  most  challenging  and  reflects  such  an 
accumulation  of  anomalies  and  unexplained  or 
unexpected  events  and  circumstances  that  an 
entire  policy  paradigm — a  combination  of  deep 
assumptions  and  the  policy  instruments  that 
express  them — collapses  and  is  replaced  by  another. 
A  key  example  is  the  ascendancy  of  monetarism 
over  Keynesianism  in  the  1980s.  A  more  recent 
example  is  the  global  financial  crisis  of  late  2008. 
Governments  around  the  world  that  had  favored 
markets  and  had  been  skeptical  of  state  interven¬ 
tion  suddenly  championed  massive  public  spend¬ 
ing,  nationalization,  and  effectively  state-managed 
capitalist  economies. 


The  most  elaborate  version  of  this  multilayered 
learning  model  is  the  advocacy  coalition  framework 
(ACF).  The  ACF  oversees  policy  change  and  policy 
learning  usually  requiring  a  decade  or  more  and 
views  change  and  learning  in  terms  of  “policy  sub¬ 
systems,”  which  are  dominated  by  competing  advo¬ 
cacy  coalitions  that  share  causal  and  normative 
beliefs.  Most  important,  it  interprets  policies  and 
programs  as  belief  systems  consisting  of  value  pri¬ 
orities  and  causal  assumptions.  A  distinctive  feature 
of  the  ACF  is  its  emphasis  on  the  role  of  ideas  and 
values  in  the  policy  process.  The  ACF  assumes  that 
both  policy  actors  and  the  policies  themselves  can 
be  understood  in  terms  of  the  structure  of  their 
belief  systems.  These  systems  have  three  key  ele¬ 
ments.  The  first  is  the  deep  or  normative  core, 
which  consists  of  fundamental  axioms  about  human 
nature,  justice,  and  priorities  among  values  such  as 
security,  health,  and  life.  These  ideas  are  very  diffi¬ 
cult  to  change  through  policy  arguments.  The  sec¬ 
ond  set  of  ideas  is  the  near  (policy)  core,  and  it 
comprises  notions  about  the  proper  scope  of  gov¬ 
ernment  activity,  distributions  of  power  and  author¬ 
ity,  orientations  on  substantive  policy  conflicts,  and 
basic  choices  about  policy  instruments.  These  are 
difficult  to  change  but  can  be  altered  if  experience 
seriously  differs  from  theory.  The  final  set  contains 
secondary  aspects  and  consists  of  instrumental  deci¬ 
sions  needed  to  implement  the  policy  core,  such  as 
decisions  about  administrative  rules,  budgetary 
allocations,  and  statutory  interpretation.  These  are 
comparatively  easy  to  shift  or  change  and  constitute 
the  bulk  of  technical  policy  argumentation. 

The  ACF  also  has  several  distinct  hypotheses 
about  how  policy  subsystems  operate.  Among  them 
are  the  following:  (a)  in  any  subsystem,  the  lineup 
of  allies  and  opponents  is  stable  over  periods  of  a 
decade  or  so;  (b)  there  is  more  consensus  within 
coalitions  on  core  beliefs  than  on  secondary  ones; 
(c)  government  policies  rarely  change  if  the  original 
sponsoring  coalition  is  still  in  power;  (d)  policies 
for  which  there  are  quantitative  data  are  more  ame¬ 
nable  to  policy  learning  than  areas  distinguished  by 
qualitative  data;  and  (e)  policy  learning  across 
belief  systems  is  more  likely  when  there  exists  a 
prestigious  forum  that  forces  professionals  from  all 
sides  to  participate.  This  highlights  the  way  in 
which  policy  learning  can  take  place  at  different 
rates  and  intensity,  depending  on  the  characteristics 
of  policy  subsystems. 
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The  approaches  above  focused  on  policy  learn¬ 
ing  and  change  within  a  single  political  system.  In 
the  1990s,  research  began  to  focus  on  policy  learn¬ 
ing  between  and  among  different  political  jurisdic¬ 
tions  (also  referred  to  as  policy  transfer  and  policy 
diffusion).  There  are  varieties  of  diffusion  and 
learning:  lesson  drawing  (where  governments  see  a 
problem  and  borrow  an  existing  solution),  legiti¬ 
mation  (referring  to  other  international  examples 
to  satisfy  domestic  critics),  and  harmonization. 
The  latter  is  facilitated  by  international  govern¬ 
mental  institutions  such  as  the  Organisation  for 
Economic  Co-operation  and  Development  (OECD) 
and  the  World  Bank.  Researchers  have  emphasized 
that  “borrowing”  or  “learning”  can  be  voluntary 
but  sometimes  can  be  imposed  if  third  parties  have 
financial  or  other  forms  of  leverage  (such  as  the 
International  Monetary  Fund).  Moreover,  in  a 
globalized  and  competitive  world,  countries 
increasingly  aspire  to  meet  or  exceed  common 
governance  or  policy  standards.  The  OECD  is  a 
distinctive  example.  Even  without  any  coercive 
powers  over  its  30  member  states,  it  establishes 
through  its  research  and  publications  a  variety  of 
policy  gold  standards  (e.g.,  in  educational  perfor¬ 
mance  or  science  and  technology  policy),  which 
countries  study  and  then  often  use  as  guides  for 
their  own  policy  development. 

More  recent  work  in  critical  policy  studies  has 
emphasized  the  microdynamics  of  policy  diffusion 
and  learning.  The  original  diffusion  literature 
thought  of  policies  as  “things”  or  “packages”  that 
were  transported  and  implemented  in  different 
regimes.  But  policies  are  complex  constructs,  and 
their  penetration  and  adoption  involves  a  process 
of  “translation”  and  epistemic  recalibration.  The 
transfer  of  a  “model,”  for  example  (whatever  it 
might  be — education,  city  transport,  or  refuse  col¬ 
lection),  involves  persuasion,  coalition  building, 
organizational  development,  and  mobilization. 
Diffusion,  from  this  perspective,  is  much  more 
fluid,  episodic,  fragmented,  discursive,  and  contin¬ 
gent.  It  is  not  as  much  about  “learning”  as  it  is 
about  active,  social  recogitation  and  realignment. 

Effective  Policy  Learning 
in  a  Globalized  World 

A  final  issue  is  a  more  practical  one:  how  to 
enhance  effective  policy  learning?  One  of  the  best 


ways  to  learn  is  by  making  mistakes,  but  this  is 
hardly  politically  attractive,  and  governments  are 
naturally  reluctant  to  publicize  their  errors. 
Consequently,  it  is  important  to  institutionalize 
transparency  and  challenge  functions  within  a 
political  system  and  within  policy-making  organi¬ 
zations.  Policy-making  institutions  are  made  up  of 
people,  and  learning  will  be  enhanced  if  those 
people  are  encouraged  to  exchange  information 
and  be  continually  learning  at  the  individual  level. 
Within  policy  subsystems,  accessibility  to  outside 
actors,  standards  of  open  discussion  and  debate, 
and  the  development  of  reasonably  consensual 
policy  measures  and  indicators  will  enhance  learn¬ 
ing.  Generally  speaking,  in  both  organizations  and 
subsystems,  network  structures,  flexibility,  and 
adaptability  are  key  ingredients  in  effective  learn¬ 
ing.  Elierarchy  in  the  face  of  a  crisis,  stress,  or 
emergency  is  inferior  to  network  structures  that 
both  push  decision  making  down  and  keep  infor¬ 
mation  flow  open.  Modern  armies,  for  example, 
are  increasingly  functioning  this  way  to  enhance 
the  collective  capacity  to  respond  to  threats  on  the 
ground  and  learn  from  accumulated  information. 

Policy  learning  remains  hugely  difficult.  At  the 
human,  individual  level,  people  seem  to  make  mis¬ 
takes  all  the  time  and  hardly  seem  capable  of  learn¬ 
ing  from  them.  Organizations  struggle  constantly 
to  induce  their  members  to  learn,  in  the  face  of 
inertia  and  SOPs.  States  and  their  policy-making 
systems  are  riven  with  politics  and  all  the  forces 
that  encourage  routine  and  stasis.  The  interna¬ 
tional  system  is  pressing  more  and  more  countries 
to  “learn,”  but  the  transfer  process  is  complicated. 
Many  “receiving”  states  are  doing  so  only  to  cash 
in  on  development  aid.  Most  others  are  “translat¬ 
ing”  rather  than  simply  transferring.  Most  impor¬ 
tant,  many  states — developed  and  developing — are 
engaged  in  a  global  conversation  that  engages  them 
collectively  in  both  learning  and  transferring  ideas. 
The  Copenhagen  Climate  Change  conference  is  a 
case  in  point — it  did  not  achieve  an  “agreement”  in 
any  meaningful  sense,  but  it  contributed  to  a  messy 
and  disjointed  conversation  that  will  continue  for 
decades. 

Policy  learning  is  a  key  factor  in  effective  policy 
change  and  program  improvement  in  democracies. 
While  it  is  constrained  by  both  politics  and  the 
limits  of  human  cognition,  it  is  becoming  increas¬ 
ingly  important  in  a  globalized  world  where 
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stresses  coming  from  the  environment — natural, 
social,  economic — are  mounting  and  demand  a 
reasoned  response. 

Leslie  A.  Pal 
Carleton  University 
Ottawa,  Ontario,  Canada 
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Policy  networks  comprise  international  and 
national  governmental  and  nongovernmental  inter¬ 
connected  actors  in  a  given  policy  space.  The  cli¬ 
mate  change  policy  network,  for  example,  would 
include  United  Nations  (UN)  agencies,  government 
departments  of  the  environment,  Greenpeace,  A1 
Gore  (founder  and  current  chair  of  the  Alliance  for 
Climate  Protection),  and  David  Suzuki  (Japanese 
Canadian  academic,  science  broadcaster,  environ¬ 
mental  activist,  and  founder  of  David  Suzuki 
Foundation).  Network  members  need  not  agree 
with  each  other.  They  share  a  common  language 
and  a  set  of  reference  points  and  so  can  argue  or 
disagree  within  a  consensual  frame  of  discourse. 

The  backdrop  to  the  origins  of  network  analy¬ 
ses  is  a  concern  with  understanding  the  relation¬ 
ship  between  state  and  society  and,  in  particular, 


the  organization  of  interests  in  society.  The  early 
postwar  history  of  political  science  and  public 
policy  wrestled  with  the  best  way  to  theorize  the 
connections.  While  the  Marxist  literature  addressed 
itself  to  the  importance  of  class,  the  non-Marxist 
literature  eventually  settled  on  the  notion  of  inter¬ 
est  group  pluralism.  Work  in  the  1950s  focused  on 
the  limitless  array  of  interests  that  could  mobilize 
around  the  equally  limitless  range  of  policy  issues. 
If  people  shared  interests,  they  would  likely  form 
groups.  If  policy  issues  arose  that  affected  those 
interests,  then  the  groups  would  politicize  and 
lobby  government.  The  pluralist  tradition  de- 
emphasizes  the  state  or  policy-making  institutions 
and  stresses  the  influence  of  lobbies  and  interest- 
groups  politics. 

There  are  four  major  sources  of  inspiration  for 
the  concept  of  networks  in  political  science.  One  of 
the  first  breaks  with  pluralism  was  over  its  portrait 
of  the  associational  system.  Empirical  studies  in 
that  period  showed  that  associational  patterns 
were  much  more  stable  and  relationships  far  more 
closed  than  the  pluralists  had  suggested.  Policy 
making  took  place  not  in  the  legislature  or  the 
executive  but  in  “iron  triangles,”  which  truly  were 
subgovernments  in  that  they  might  all  be  operating 
according  to  different  principles,  with  different 
rhythms  and  often  conflicting  outcomes.  By  the 
mid-1970s,  the  notion  of  iron  triangles  seemed  like 
a  caricature  too,  and  a  new  concept  of  “issue  net¬ 
works”  arose. 

A  second  important  source  of  work  on  net¬ 
works  came  from  comparative  research  on  indus¬ 
trial  performance  and  economic  policy.  A  key 
conditioning  factor  of  foreign  policy  was  the  struc¬ 
ture  of  domestic  interests  and  institutions.  This 
branch  of  research  had  offshoots  that  remain 
highly  relevant  to  the  work  on  policy  networks 
today.  Work  on  state  structures  argued  that  states 
had  a  clear  pattern  of  associational  and  state  insti¬ 
tutions.  For  example,  corporatist  states  had  highly 
centralized  associational  systems,  working  in  tan¬ 
dem  with  governments  to  develop  and  implement 
policy. 

A  third  source  of  inspiration  for  network  analy¬ 
sis  was  the  growing  work  on  new  social  move¬ 
ments  and  public  interest  groups.  Social  movement 
organizations  rarely  act  alone,  and  they  connect 
through  various  types  of  networks.  The  distinction 
between  the  movement  and  the  organizations  built 
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on  it  is  important  and  gives  a  clue  as  to  why  the 
network  idea  spontaneously  arose  in  this  field  of 
research.  Any  movement  (e.g.,  environmental, 
consumers’,  women’s)  is  bound  to  spawn  a  variety 
of  organizations  that  address  different  aspects  of 
its  agenda,  but  those  organizations  will  have  a 
common  cause  and  will  seek  to  cooperate  in  order 
to  maximize  their  policy  impact. 

A  final  source  of  inspiration  for  the  network 
concept  has  been  modernity  itself.  Policy  domains 
are  increasingly  crowded  with  organizations,  and 
they  spontaneously  connect  to  each  other,  both  as 
antagonists  and  as  allies.  The  line  between  public 
and  private  is  increasingly  blurred,  particularly 
since  the  global  financial  crisis  of  2009.  Domestic 
politics  increasingly  is  leveraged  to  the  interna¬ 
tional  plane,  from  whaling  to  the  environment.  The 
result  is  that  states  increasingly  rely  on  networks  of 
external  actors  that  are  not  under  their  control. 

Policy  Network/Community  Analysis 

Early  models  of  policy  communities/networks 
divided  them  into  subgovernments  and  the  atten¬ 
tive  public.  Actual  decision  making  takes  place  in 
the  subgovernment,  which  is  dominated  by  large 
institutions,  groups,  and  core  government  agen¬ 
cies.  Players  in  the  subgovernment  often  try  to 
limit  participation  from  outsiders.  The  attentive 
public  is  the  outsiders,  whose  main  influence  on 
the  process  is  to  generate  ideas  and  discussion 
through  conferences,  publications,  and  occasional 
lobbying.  There  are  several  limitations  to  this  way 
of  thinking  about  policy  communities:  It  is  largely 
static  and  does  not  travel  well  across  policy  fields. 
Some  areas  are  indeed  dominated  by  government 
agencies.  But  many  others  are  increasingly  open  to 
pressure  from  the  attentive  public,  and  that  public 
is  not  prepared  to  be  polite.  In  social  and  educa¬ 
tional  policy,  and  increasingly  in  municipal  policy, 
for  example,  fundamental  assumptions  about  the 
role  of  governments  are  constantly  being  posed. 

Some  of  these  limitations  have  been  addressed 
in  more  recent  models  of  policy  networks.  There  is 
a  structural  approach  to  network  analysis,  for 
example,  that  focuses  on  patterns  of  relations 
among  actors,  patterns  that  can  be  mapped  and 
are  to  some  degree  distinct  from  the  beliefs  or  ideas 
of  policy  actors.  The  degree  of  organization  here 
means  analytic  capacity,  access  to  important  data 


and  information,  the  ability  to  act  unilaterally, 
coordination,  focus  on  long-  or  short-term  issues, 
and  a  reactive  or  anticipatory  policy  stance.  This 
depiction  of  policy  network  types  has  the  advan¬ 
tage  of  variety. 

Most  of  the  policy  network  literature  has  been 
applied  to  economic  policy  fields  and  assumes  that 
concentration  plus  organization  equals  policy- 
capable  systems.  While  this  makes  some  intuitive 
sense,  it  needs  to  be  treated  cautiously.  For  one 
thing,  it  has  a  vaguely  undemocratic  flavor.  The 
more  hierarchical,  coordinated,  and  tidy  the  policy 
sector,  the  fewer  opportunities  there  will  be  for  the 
attentive  public  to  get  into  the  act.  Current  policy 
thinking  is  that  the  wider  the  networks  and  the 
more  competition  among  players,  the  better  policy 
outcomes  will  be.  For  another,  as  policy  sectors  get 
more  complex  and  more  globalized,  the  demand 
for  information  from  all  sectors  and  connections 
among  the  players  rises  exponentially.  The  tightly 
coordinated  policy  networks  recommended  in  this 
literature  may  not  be  adequate  to  the  new  dynam¬ 
ics  of  modern  policy  process. 

Practical  Applications 

The  network  idea  itself  can  be  seen  as  a  response 
to  changing  political  realities.  Concepts  related  to 
policy  communities  and  networks  began  to  multi¬ 
ply  and  develop  around  the  time  that  associational 
systems  were  becoming  more  complex.  The  idea  of 
issue  networks  was  designed  to  capture  the  idea  of 
a  more  fluid,  information-based  policy  system  in 
which  government  departments  and  industry  play¬ 
ers  were  no  longer  dominant.  While  there  are  no 
reliable  data  on  broad  trends  in  the  past  decade, 
there  is  no  doubt  that  groups  continue  to  multiply 
across  most  sectors. 

The  range  of  global  policy  actors  is  increasing 
exponentially.  The  most  visible  are  international 
governmental  organizations  such  as  the  UN,  the 
Organisation  for  Economic  Co-operation  and 
Development,  and  the  World  Trade  Organization. 
Yet  another  layer  consists  of  nongovernmental 
organizations  (NGOs)  active  on  the  global  level 
but  connected  to  domestic  concerns  (the  environ¬ 
ment,  human  rights).  The  result  is  a  diffuse  system 
of  transnational  networks  of  loosely  coordinated 
governance,  as  well  as  of  networks  of  protest  and 
opposition  to  that  governance. 
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Technology  makes  these  networks  possible — 
the  technology  of  travel,  of  communications,  or  of 
the  transfer  and  circulation  of  information.  Before 
the  advent  of  cell  phones,  laptops,  PDAs,  Google, 
Blackberrys,  Twitter,  and  e-mail,  NGO  activists 
attending  UN  conferences  would  have  had  to 
queue  at  the  phones  to  make  long-distance  calls  to 
their  colleagues  at  home.  That  has  all  obviously 
changed,  and  the  next  wave  will  build  on  these 
platforms  and  create  even  more  empowered  net¬ 
work  capabilities. 

Policy  networks  are  important  today  not  only 
because  they  represent  interests  that  have  to  be 
integrated  into  the  policy  process  or  information 
that  is  crucial  to  analysis  but  because  they  are 
important  sinews  for  implementation  and  delivery. 
The  concept  of  working  partnerships  for  the  devel¬ 
opment  and  delivery  of  services  implies  a  very 
different  set  of  relationships  than  is  typically  envis¬ 
aged  in  the  communities/networks  literature.  That 
literature  focuses  more  on  the  political  dynamics 
of  interest  representation,  whereas  the  challenges 
of  partnerships  focus  more  on  the  logistics  of  joint 
action  to  achieve  common  goals. 

The  preceding  suggests  a  somewhat  confusing 
array  of  forces  that  serve  to  make  policy  communi¬ 
ties  and  networks — domestic  and  global — even 
more  important  than  they  have  been  in  the  past 
but  also  perhaps  more  challenging  to  integrate  into 
the  policy  process.  The  associational  system  shows 
no  signs  of  shrinking,  and  some  elements  of  it, 
such  as  those  involved  in  the  delivery  of  public 
services,  may  face  considerable  pressure  to  expand. 
Information  technologies  make  possible  even 
wider,  global  connections  of  interests  and  commu¬ 
nities.  Movements  such  as  human  rights,  environ¬ 
ment,  and  women’s  issues  are  truly  global  in  scope. 
At  the  same  time,  some  policy  issues  get  driven 
further  down,  and  so  some  networks  that  would 
have  had  their  center  of  gravity  at  the  national 
level  now  become  truly  local  or  regional. 

The  contemporary  importance  of  policy  net¬ 
works  and  communities  has  not  diminished; 
indeed,  it  has  grown.  However,  the  realities  of  the 
policy  process  continue  to  change  the  nature  and 
dynamic  of  those  communities  and  networks,  pos¬ 
ing  challenges  for  policymakers.  A  core  responsi¬ 
bility  for  any  public  manager  is  the  improvement 
of  learning  and  adaptive  capacities,  leading  to 
higher  levels  of  policy  debate  and  relevant  policy 


expertise.  What  this  entails  in  practice  depends  on 
the  type  of  policy  community  or  network  in  ques¬ 
tion  and  its  specific  needs.  Capacity  building  for 
intellectual  communities  may  mean  enhancing 
informational  resources  and  communication  abili¬ 
ties.  For  communities  involved  in  policy  delivery,  it 
may  mean  development  of  organizational  capacity 
through  training.  The  governance  dynamics  of 
small,  dense  networks  will  be  different  from  the 
dynamics  of  networks  with  a  large  number  of 
members.  Thinking  about  the  policy  networks  and 
communities  relevant  to  one’s  policy  responsibili¬ 
ties  should  be  a  key  focus  for  public  managers. 

Leslie  A.  Pal 
Carleton  University 
Ottawa,  Ontario,  Canada 
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The  term  model  has  multiple  meanings.  This  being 
so  in  general,  in  political  science  the  varying  use  of 
the  term  reflects  the  diversity  of  stances  on  philo¬ 
sophical  and  methodological  issues  in  this  disci¬ 
pline.  In  its  most  narrow  definition,  the  term 
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model  stands  for  a  mathematical  formula.  More 
broadly  conceived,  the  term  involves  some  kind  of 
metaphor  used  to  picture  a  phenomenon  in  social 
reality.  In  the  study  of  government,  both  sorts  of 
definitions  occur,  in  several  variants.  All  share  the 
aim  of  offering  a  cognitive  representation  of  real- 
world  phenomena. 

The  stages  model  of  the  policy  process  struc¬ 
tures  analytically  the  range  of  activities  involved  in 
the  collective  endeavors  to  turn  public  intentions 
into  public  achievements.  In  both  the  theory  and 
the  practice  of  modern  government,  this  model — 
as  it  is  called — has  been  widely  adopted.  Because 
of  its  prevalence,  the  stages  model  of  the  policy 
process  is  the  point  of  departure  in  this  entry.  The 
outline  of  the  latter  can  be  summarized  as  follows. 
First,  questions  are  explored  about  which  variants 
of  the  stages  model  can  be  distinguished  and  what 
kind  of  functions  these  fulfill.  Next,  the  method¬ 
ological  status  of  the  stages  model  as  a  representa¬ 
tion  of  the  policy  process  is  addressed.  In  a  more 
narrow  definition,  it  turns  out  to  be  a  metatheo- 
retical  heuristic.  It  is  an  analytical  framework, 
rather  than  a  causal  model.  However,  other  ways 
of  studying  the  policy  process  can  be  identified. 
Under  the  heading  of  governance  research,  ana¬ 
lytical  frameworks  are  available  that  can  serve  as 
metatheoretical  alternatives  to  the  stages  model. 
What  the  frameworks  addressed  in  the  governance 
section  have  in  common  is  that  they  enable  the 
systematic  study  of  government  in  action.  Two  of 
them  stem  from  the  field  called  policy  studies, 
policy  science,  or  the  policy  sciences,  itself.  In 
particular,  the  multiple  stages  and  multiple  gover¬ 
nance  frameworks  have  been  developed  as  a 
critique  to  the  stages  model.  The  other  two  frame¬ 
works  come  from  institutionally  different  parts 
of  the  study  of  government,  particularly  public 
management  ( nested  games  and  the  logic  of 
governance )  and  institutional  rational  choice  (insti¬ 
tutional  analysis  and  development).  These  alterna¬ 
tive  frameworks  allow  the  decomposition  of  the 
policy  process  into  different  elements  rather  than 
in  stages  and  in  a  greater  variety  rather  than  as  just 
sets  of  activities.  In  fact,  the  four  metatheoretical 
frameworks  provide  other  ways  of  structuring  the 
multiplicity  of  aspects  in  the  analysis  of  the  prac¬ 
tice  of  modern  government.  The  conclusion  then  is 
that  such  an  analysis  has  evolved  beyond  the  stages 
model  of  the  policy  process. 


The  Stages  Model 

Variants 

The  term  stages  implies  a  picture  of  the  process 
of  public  policy  making  as  a  chronological  sequence 
of  distinct  sets  of  activities.  First,  problems  in  soci¬ 
ety  are  brought  to  governments  for  solution.  Next, 
governmental  institutions  conceptualize  the  prob¬ 
lems,  formulate  alternatives,  and  select  policy  solu¬ 
tions.  Accordingly,  those  solutions  are  imple¬ 
mented.  When  finally  evaluated,  they  may  be 
revised.  These  are  the  successive  elements  Paul 
Sabatier  distinguishes  when  describing  the  stages 
model.  The  first  element  concerns  agenda  setting. 
Then,  policy  design  and  decision  making  take 
place,  together  usually  addressed  as  policy  forma¬ 
tion.  What  follows  is  implementation.  The  sequence 
is  concluded  by  evaluation.  Each  of  these  nouns 
refers  to  an  identifiable  cluster  of  activities.  The 
nouns  are  the  common  nominators  for  the  major 
stages  in  what  since  Harold  Lasswell  has  been 
labeled  as  the  policy  process.  As  the  founding  father 
of  policy  studies,  Lasswell  uses  the  term  phases  or 
stages  while  indicating  a  range  of  separate  and  suc¬ 
cessive  steps.  Policy  is  thought  of  as  being  made,  in 
principle,  in  a  chronological  order.  That  order 
starts  with  initiative  and  goes  via  formulation  and 
decision  on  to  evaluation  and  possibly  termination. 
Lasswell  distinguishes  between  seven  stages  of  what 
he  calls  the  decision  process.  As  such,  he  identifies 
(1)  intelligence,  (2)  promotion,  (3)  prescription, 
(4)  invocation,  (5)  application,  (6)  termination,  and 
(7)  appraisal.  The  last  stage  offering  feedback 
inputs  to  the  process  again  indicates  a  cyclical  char¬ 
acter  of  the  latter.  Therefore,  the  terms  policy  pro¬ 
cess  and  policy  cycle  are  synonyms. 

Lasswell  speaks  of  a  conceptual  map,  meant  to 
provide  a  guide  to  obtaining  a  generalized  image  of 
the  major  phases  of  any  collective  act.  Since 
Lasswell’s  sevenfold  distinction,  there  have  been 
many  variants  of  the  stages  model  of  the  policy 
process.  Each  specifies  the  separate  stages  and/or 
their  number  a  little  differently.  In  fact,  all  these 
definitions  concern  variants  of  what  James 
Anderson  views  as  a  sequential  pattern  of  action 
involving  a  number  of  functional  categories  of 
activity  that  can  be  analytically  distinguished.  In  his 
and  in  many  other  textbooks,  the  stages  model  has 
been  used  to  structure  the  range  of  accounts  on  the 
study  of  the  policy  process.  Given  the  complexities 
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of  the  object,  the  contemporary  study  of  the  policy 
process  appears  to  go  in  various  directions.  Wayne 
Parsons  calls  the  field  rich,  both  in  academic  disci¬ 
plines  and  in  approaches.  It  is  wealthy  in  models 
(heuristic  and  causal)  as  well  as  in  metaphors  and 
maps.  The  stages  model  then  offers  a  structure  to 
bring  analytical  order  to  this  wealth. 

Characteristics 

The  succession  of  steps  thought  of  in  a  chrono¬ 
logical  order  as  implied  by  the  stages  model  has 
aspects  that  can  be  called  universal.  As  such,  the 
stages  notion  has  occurred  before  and  beyond  the 
study  of  public  policy.  For  instance,  Herbert  Simon 
distinguished  intelligence,  design,  and  choice  as  the 
successive  stages  in  the  way  people  in  general  reach 
decisions.  What,  however,  makes  the  stages  model 
of  the  policy  process  more  than  a  reference  to  a 
chronological  or  logical  order,  thought  of  as  uni¬ 
versal,  is  its  connection  with  the  modern  state  and 
its  institutions.  Policy  implementation  is  supposed 
to  follow  policy  formation.  Means  are  chosen, 
while  ends  are  given.  Administration  takes  care  of 
those  means,  after  politics  has  spoken.  Woodrow 
Wilson  was  once  president  of  the  United  States.  He 
is  also  seen  as  founder  of  the  modern  study  of  pub¬ 
lic  administration,  having  coined  the  classic  poli¬ 
tics/administration  dichotomy.  In  this  context,  that 
dichotomy  can  be  seen  as  a  stagist  model  avant  la 
lettre.  Despite  the  cyclical  logic,  normatively,  a 
hierarchy  is  at  stake.  It  embodies  a  view  about  the 
primacy  of  politics. 

The  policy  orientation  was  introduced  by 
Lasswell  in  the  1950s.  Policy  is  directed  on  what 
John  Dewey  once  described  as  the  combination  of 
the  public  and  its  problems.  This  problem  focus 
distinguishes  the  public  policy  perspective  from 
that  of  public  administration  or  public  manage¬ 
ment.  Connected  with  the  problem-solving  objec¬ 
tives  of  the  welfare  state,  this  orientation  implies 
both  a  rational  view  on  knowledge  and  an  instru¬ 
mental  use  of  it.  In  the  policy  orientation,  knowl¬ 
edge  from  social  science  can  be  used  to  improve 
society.  It  also  should  be  used,  via  public  policies, 
as  a  form  of  social  engineering.  Michael  Howlett 
and  M.  Ramesh  show  how  the  separate  stages  in 
the  policy  cycle  each  refer  to  a  part  of  applied 
problem  solving.  Agenda  setting  then  corresponds 
with  problem  recognition.  Policy  formulation 


stands  for  the  proposal  of  a  solution.  Decision 
making  means  the  choice  of  a  specific  solution. 
Policy  implementation  implies  putting  the  chosen 
solution  into  effect;  and  policy  evaluation  refers  to 
monitoring  results.  Policy  making  thus  means 
problem  processing.  Deborah  Stone  speaks  of  a 
policy  process  as  a  production  process.  Public 
policy  is  the  result  of  what  happens  in  what  may 
be  likened  to  an  assembly  line. 

Its  widespread  use  makes  the  stages  model  of 
the  policy  process  universal  as  well  as  modern. 
Especially  during  the  period  of  growth  of  welfare 
states  after  World  War  II,  the  notion  of  stages  has 
accompanied  the  term  public  policy  like  a  shadow. 
It  has  since  been  adopted  widely,  both  in  theory 
and  in  practice.  An  element  of  that  widespread 
adoption  is  what  Michael  Hill  and  Peter  Hupe  call 
the  primacy  of  policy  on  paper.  The  assumption  is 
that  once  one  knows  the  good  intentions  as  laid 
down  in  a  law  or  official  policy  document,  one  can 
infer  the  results  from  it  to  be  realized  in  the  real 
world  in  a  one-to-one  relationship.  This  assump¬ 
tion  forms  part  of  the  explanation  of  the  functions 
the  stages  model  of  the  policy  process  fulfils. 

Functions 

As  a  comprehensive  picture  of  the  policy  pro¬ 
cess,  the  stages  model  is  the  most  prevalent  one.  It 
fulfils  societal,  practical,  analytical,  and  program¬ 
matic  functions.  In  the  public  debate  and  in  society 
in  general,  the  stages  model  of  the  policy  process 
stands  for  an  orientation  toward  problem  solving, 
or  at  least  problem  processing.  Linking  the  public 
and  its  problems,  as  Dewey  indicated,  in  the  public 
discourse,  it  has  an  enduring  normative  appeal.  In 
political-administrative  practice,  the  stages  model 
invites  and  legitimizes  a  certain  division  of  labor. 
This  particularly  involves  the  relation  between  two 
categories  of  actors.  The  first  category  concerns 
the  actors  and  institutions  formulating  and  legiti¬ 
mately  deciding  on  public  intentions.  The  second 
category  comprises  the  ones  responsible  for  turn¬ 
ing  those  intentions  into  factual  public  achieve¬ 
ments.  When  the  problem  has  been  stated  and  the 
decisions  on  necessary  measures  have  been  taken, 
the  rest  is  implementation.  That  implementation  is 
expected  to  follow  the  policy  as  documented  in  a 
one-to-one  relationship.  If  things  turn  out  to  be 
different,  the  logical  distinction  between  the  stages 
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appears  to  be  reified.  Especially  in  situations  of  a 
crisis  or  disaster,  the  stages  model  in  the  practice 
of  public  administration  may  function  as  a  way  of 
shifting  the  blame.  Journalists  revealing  what  they 
label  as  policy  fiascos  mirror  this  labor  division. 
For  instance,  in  the  case  of  an  employment  policy, 
the  results  of  the  implementation  stage  may  be 
obviously  disappointing.  Journalists  may  relate 
these  results  to  the  policy  intentions  concerning 
job  growth  as  laid  down  in  the  policy  statutory 
concerned — the  result  of  the  policy  formation 
stage.  In  the  practice  of  public  administration,  now 
and  then,  such  politics  of  blaming  can  be  observed. 
Actually,  the  stages  are  logical  constructions.  In 
situations  such  as  these,  however,  the  stages  have 
become  reified. 

With  its  (chrono)logical  character,  the  stages 
model  fulfils  an  analytical  function.  It  provides  a 
way  of  rationally  ordering  the  complexities  of  the 
reality  of  government.  This  being  so  for  political- 
administrative  actors  and  the  public  in  general,  in 
academia,  the  functionality  of  the  stages  model 
primarily  is  a  didactic  one.  In  teaching  modules 
and  handbooks,  it  functions  as  a  way  to  structure 
the  presentation  of  insights  on  the  policy  process. 
Because  of  this  use,  the  model  is  called  the  text¬ 
book  approach.  The  stages  model  also  has  a  func¬ 
tion  in  academic  research.  Similar  to  its  function  in 
the  practice  of  public  administration,  the  stages 
model  enables  a  labor  division  in  academic  work. 
Each  of  the  stages,  from  agenda  setting  to  evalua¬ 
tion,  has  grown  into  a  scholarly  theme  of  its  own. 
With  its  programmatic  function,  the  stages  model 
has  set  the  agenda  for  research,  leading  to  a  variety 
of  classic  texts  on  each  of  these  themes.  One  could 
think,  for  example,  of  the  studies  of  John  Kingdon 
on  agenda  setting,  Charles  Lindblom  on  policy 
formation  and  William  Dunn  on  methods  of  policy 
analysis,  Jeffrey  Pressman  and  Aaron  Wildavsky 
on  implementation,  Michael  Lipsky  on  the  related 
theme  of  street-level  bureaucracy,  and  Frank 
Fischer  on  evaluation.  The  stages  model  enhanced 
the  development  of  distinguished  approaches  such 
as  the  advocacy  coalition  framework  developed  by 
Paul  Sabatier,  Hank  Jenkins-Smith,  and  others. 
Focusing  on  the  analysis  of  what  is  happening  in  a 
specific  stage  of  the  policy  process,  theoretical 
approaches  such  as  these  are  not  an  alternative  to 
the  stages  model.  The  advocacy  coalition  model, 
for  instance,  instead  concerns  theory  formation  on 


what  happens  particularly  in  one  stage — in  this 
case,  agenda  setting. 

Limitations 

The  functions  the  stages  model  of  the  policy 
process  fulfils  involve  a  broad  usage  of  the  term 
model.  That  usage  goes  beyond  the  more  narrow 
methodological  requirements  the  term  model  in 
science  may  imply.  It  is  exactly  this  that  has  invited 
severe  criticism  from  scholars.  Peter  John,  for 
instance,  is  positive  about  the  functions  the  stages 
model  fulfils  in  society.  Where  public  policy  is 
often  justified  in  rational  terms,  the  metaphor  of 
stages  is  helpful.  Journalists  and  citizens  try  to 
make  sense  of  the  complexities  of  public  policies. 
They  expect  the  policy  process  to  work  in  stages. 
John  also  acknowledges  that  the  stages  model  still 
can  be  used  as  a  heuristic  or  learning  device.  At  the 
same  time,  however,  John  sees  serious  constraints 
in  using  the  stages  metaphor  in  research.  There  the 
stages  idea  confuses  more  than  it  illuminates. 
There  are  no  neat  divisions  between  different  types 
of  activities;  policy  is  continuous.  For  analysis, 
John  sees  the  stages  metaphor  as  too  simplified  to 
capture  enough  of  reality. 

In  several  publications,  Sabatier  has  expressed 
similar  criticisms.  It  made  him  prefer  the  term 
stages  heuristic  as  proposed  by  himself  and  Jenkins- 
Smith.  Those  criticisms  entail  the  following.  First, 
the  stages  model  is  not  a  causal  theory.  It  does  not 
enable  the  causal  explanation  of  empirical  varia¬ 
tion.  Second,  the  assumed  sequence  of  stages  is 
inadequate  in  a  descriptive  sense.  Implementing 
ambiguous  laws,  for  instance,  may  imply  actual 
policy  formulation  and  even  decision  making. 
Third,  the  stages  model  has  a  top-down  bias  and 
focuses  on  major  pieces  of  legislation.  It  neglects 
the  interaction  among  a  variety  of  such  pieces, 
including  smaller  ones.  Fourth,  the  focus  on  the 
cycle  of  one  policy-making  process  shows  indiffer¬ 
ence  to  actual  empirical  reality.  Within  a  given 
policy  domain,  multiple  interacting  cycles  can  be 
observed.  This  complexity  involves,  as  Sabatier 
stresses,  numerous  policy  proposals  and  statutes 
emerging  from  multiple  layers  of  government. 

It  is  in  theoretical-empirical  research  that  a 
model  is  ultimately  put  to  the  test.  In  the  case  of 
the  stages  model,  then,  some  methodological  limi¬ 
tations  appear  to  be  inherent.  The  first  one  was 
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already  indicated  previously:  the  reification  of 
each  separate  stage.  Instead  of  empirically  observ¬ 
ing  that  policy  making  in  fact  goes  on  in  the  sub¬ 
sequent  stages,  and  of  investigating  how,  the  sepa¬ 
ration  between  the  stages  is  taken  for  granted  on 
normative  grounds.  Furthermore,  a  certain  stage 
as  a  specific  set  of  activities  is  automatically  treated 
as  connected  with  a  specific  spot — the  second 
methodological  limitation.  When,  for  instance,  the 
legitimate  formulation  of  a  specific  policy  has 
taken  place  on  the  layer  of  the  ministry  concerned, 
the  subsequent  implementation  of  that  policy  is 
seen  as  consequentially  taking  place  on  the  layer  of 
local  government  under  it.  It  may  be  the  case,  how¬ 
ever,  that  on  the  latter  layer  policy  formation  (for¬ 
mulation  plus  decision  making)  can  be  observed. 
This  concerns  an  empirical  question.  The  stages 
model  induces  research  in  which  what  actors  do 
and  where  they  do  it,  a  priori,  are  supposed  as 
coinciding.  In  such  instances,  Hill  and  Hupe  speak 
of  the  methodological  fallacy  of  the  wrong  layer. 
Instead,  activities  (focus)  and  layers  (locus)  could 
be  observed  independently. 

As  a  third  methodological  limitation  of  the 
stages  model,  the  same  authors  speak  of  the  con¬ 
trol  trap.  With  that  label,  they  highlight  a  specific 
implication  of  the  stages  model,  the  inducement  of 
quick  judgments  on  implementation  failures  and 
policy  fiascos.  The  stages  discourse  implies  a  nor¬ 
mative  hierarchy.  To  what  extent  empirically 
observed  action  can  be  judged  as  legitimate  is  as  a 
normative  matter.  Researchers  may  not  always 
reflect  on  the  methodological  consequences  of  the 
top-down  bias  stemming  from  that  fact.  This  may 
be  the  case,  for  example,  when  seeking  to  do  con¬ 
tract  research.  The  implicit  adoption  of  a  view 
from  the  top  may  lead  to  confusing  the  normative 
and  the  empirical.  Observations  of  behavior  are 
treated  as  judgments  on  deviance  from  the  stan¬ 
dard  norm.  Instead,  an  open  observation  of  what 
happens  is  to  be  distinguished  from  a  judgment  on 
the  legitimacy  of  what  is  observed.  After  all,  for 
public  servants  and  other  practitioners,  there  is 
always  a  necessity  to  interpret  ambiguous  legisla¬ 
tion.  Also,  in  circumstances  that  may  have  been 
unforeseen  by  the  legislator,  there  is  a  need  to  act. 
Hupe  and  Hill  speak  of  an  action  imperative  here. 
In  the  research  of  policy  processes,  it  calls  for  dis¬ 
tinguishing  between  clinical  observation  and  nor¬ 
mative  judgments. 


Representations  of  the  Policy  Process 
A  Model 

The  Shorter  Oxford  English  Dictionary  pro¬ 
vides  three  main  categories  of  meanings  for  the 
noun  model.  The  term  can  refer  to  a  representation 
of  structure,  a  type  of  design,  and  an  object  of 
imitation.  Within  a  range  of,  in  total,  13  defini¬ 
tions,  the  ones  standing  for  both  the  narrow  and 
the  broad  meanings,  as  referred  to  above,  can  be 
found.  In  a  narrow  sense,  the  term  model  is  defined 
as  a  set  of  entities  that  satisfies  all  the  formulae  of 
a  given  formal  or  axiomatic  system.  In  the  broad 
sense,  the  term  model  is  used  as  a  conceptual  or 
mental  representation  of  something.  The  Dictionary 
speaks  of  a  simplified  description  of  a  system,  pro¬ 
cess,  and  so  on,  put  forward  as  a  basis  for  theo¬ 
retical  or  empirical  understanding. 

In  his  textbook  on  public  policy,  Parsons 
addresses  the  variety  of  meanings  of  the  term 
model  and  other  related  terms  as  an  issue  in  itself. 
He  states  that  in  a  complex  world,  people  need  to 
simplify.  They  seek  to  organize  their  ideas  and  con¬ 
cepts.  Models,  maps,  and  metaphors — as  the  title 
of  the  section  expresses — provide  ways  to  do  so. 
Parsons  calls  them  frameworks,  which  enable  peo¬ 
ple  to  think  and  explain.  The  activity  of  theorizing 
about  public  policy  is  like  drawing  a  map.  People 
need  such  a  map,  but  it  must  in  some  respect  dis¬ 
tort  their  impression  of  the  territory  it  represents. 
Parsons  stresses  that  the  way  people  see  and  inter¬ 
pret  the  world  of  public  policy  depends  on  the 
kinds  of  models  and  frameworks  they  employ.  A 
map  may  claim  to  explain  but  may  at  the  same  time 
be  heavily  normative  and  prescriptive. 

Parsons  refers  to  the  way  Graham  Allison  ana¬ 
lyzed  American  decision  making  in  the  1962 
Cuban  missile  crisis.  In  Essence  of  Decision 
(1971),  the  latter  uses  three  models  or  lenses. 
Allison  analyzes  this  one  decision-making  process 
in  terms  of  the  rational  actor,  in  terms  of  organi¬ 
zational  processes,  and  in  terms  of  bureaucratic 
politics.  Each  provides  a  very  different  kind  of 
interpretation  of  the  same  events  in  reality. 
Allison’s  study  shows  how  the  analysis  of  decision 
making  involves  an  awareness  of  the  different 
frames  that  can  be  employed  to  interpret  events. 
Such  awareness  concerns  the  way  the  observed 
object  of  study  exists  in  the  context  of  a  variety  of 
contending  interpretations. 
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Models,  Theories,  and  Frameworks 

The  most  important  criticism  of  the  stages  model 
is  that  it  is  not  a  causal  theory.  It  is  also  necessary  to 
ask  what  it  means  to  call  it  a  model.  Like  Parsons, 
Elinor  Ostrom  has  addressed  this  issue  of  the  mean¬ 
ing  of  the  term  model.  Parsons  makes  a  point  of  the 
constructivist  effect  of  using  specific  lenses — reality 
is  not  just  out  there.  He  treats  the  labels  for  such 
lenses  largely  as  synonyms.  Ostrom,  however,  makes 
a  plea  for  distinguishing  explicitly  between  models, 
theories,  and  frameworks.  Speaking  of  an  often 
overlooked  distinction,  she  positions  these  three  in  a 
specific  way.  In  her  view,  models  make  precise 
assumptions  about  a  limited  set  of  parameters  and 
variables.  Theories,  according  to  Ostrom,  enable  the 
analyst  to  specify  which  elements  of  a  broader 
framework  are  particularly  relevant  to  certain  kinds 
of  questions.  A  framework  helps  identify  the  ele¬ 
ments  needed  for  more  systematic  analysis.  It  pro¬ 
vides  a  list  of  variables  and  a  vocabulary  that  can  be 
used  to  compare  theories.  Several  theories  are  usu¬ 
ally  compatible  with  any  framework. 

Models,  theories,  and  frameworks  vary  on  the 
generality  scale.  Models  are  the  least  comprehensive 
but  the  most  specified.  They  entail  particular  sets  of 
expected  relationships  between  limited  variables. 
Frameworks  are  the  most  robust  but  substantively 
are  rather  empty.  They  enable  theory  formation  and 
model  building  by  providing  a  metatheoretical  lan¬ 
guage.  Theories  are  in  between.  They  address  a 
certain  phenomenon  in  empirical  reality  with  a  more- 
or-less  universal  character.  Inviting  assumptions  to 
be  made  about  elements  of  that  phenomenon,  theo¬ 
ries  enable  the  analyst  to  diagnose  a  phenomenon, 
explain  its  processes,  and  predict  outcomes. 

By  positioning  the  stages  model  in  this  threefold 
classification,  some  of  the  criticisms  mentioned 
above  can  be  tested.  First,  regarding  treatment  as  a 
model,  in  the  narrow  sense,  critics  are  surely  right 
that  it  is  not.  The  stages  model  may  fulfill  functions 
as  a  heuristic  device,  but  for  acting  as  a  causal 
model,  it  is  too  general.  It  leaves  too  many  param¬ 
eters  implicit.  At  the  same  time,  the  stages  model  is 
not  a  theory.  To  meet  Ostrom’s  definition,  it  would 
need  more  explanatory  power.  The  stages  model 
does  enhance  the  identification  of  certain  elements 
and  variables  relevant  for  the  study  of  public  policy. 
Therefore,  it  can  be  called  an  analytical  framework. 

At  a  metatheoretical  level,  the  stages  model  in 
research  functions  as  a  map.  Around  it,  theory 
formation  and  model  building  may  take  place.  We 


then  need  to  ask  for  the  requirements  for  an  ade¬ 
quate  framework.  Comprehensiveness,  enhancing 
middle-range  theory  formation,  and  empirical 
openness  can  be  identified  as  relevant  criteria  here. 
First,  an  analytical  framework  must  have  the  capac¬ 
ity  to  encompass  conceptually  the  multidimensional 
character  of  public  policy  processes.  It  must  enable 
the  identification  of  the  variety  of  actors,  actions, 
and  interactions  involved.  The  stages  model  indeed 
aims  at  comprehensiveness.  It  claims  to  conceptual¬ 
ize  an  entire  policy  process.  However,  the  focus  is 
singularly  on  activities,  while  the  locus  of  a  specific 
set  of  such  activities  (e.g.,  implementation)  corre¬ 
spondingly  is  presupposed.  Second,  not  being  a 
theory  itself,  the  stages  model  as  a  heuristic  tends  to 
be  seen  as  inviting  theory  formation.  The  classic 
texts  on  each  of  the  stages  are  proofs  of  that 
function.  Third,  meeting  the  criterion  of  empirical 
openness  for  the  stages  model  is  difficult.  Policy 
implementation,  for  instance,  is  addressed  as  nor- 
matively  subordinate  to  policy  formation.  Such  a 
hierarchical  view  may  prevent  the  analyst  from 
observing  what  happens  and  explaining  why,  before 
expressing  judgments  on  the  legitimacy  of  what  is 
recorded.  An  analysis  normatively  biased  in  this 
way  may  lead  to  less  than  well-grounded  advice  to 
practitioners.  To  the  extent  this  is  the  case,  it  may 
be  counterproductive  in  the  end.  Neither  the  scien¬ 
tific  nor  the  normative  goals  may  then  be  achieved. 

Thus,  the  stages  model  meets  the  criteria  of  com¬ 
prehensiveness,  to  a  certain  degree,  and  of  enhanc¬ 
ing  theory  formation.  It  has  difficulty,  however,  in 
meeting  the  criterion  of  empirical  openness.  The 
conclusion  is  that  it  is  a  general  framework  in  Ost¬ 
rom’s  sense,  albeit  one  with  shortcomings.  It  is  this 
character  that  lies  at  the  basis  of  its  functioning  in 
academia.  The  so-called  model  functions  as  a  ro¬ 
bust,  rather  empty,  framework  and  heuristic  device 
facilitating  teaching  and  research.  As  a  general  map 
for  the  analysis  of  policy  processes,  it  is  conceptu¬ 
ally  neither  multidimensional  nor  empirically  open 
enough  to  do  much  more  than  provide  a  starting 
point  for  systematic  theoretical-empirical  research. 

Theoretical  Approaches 

Public  policy  scholars  have  sought  to  develop 
various  forms  of  representations  of  phenomena  in 
and  around  public  policy.  Some  of  those,  such  as 
the  advocacy  coalition  framework  mentioned 
above,  imply  theoretical  approaches.  They  focus 
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on  aspects  of  one  particular  stage  rather  than  pro¬ 
viding  an  alternative  to  the  stages  model  as  an 
analytical  framework  at  the  metatheoretical  level. 

Sabatier  searched  for  alternatives  to  the  stages 
model.  In  Theories  of  the  Policy  Process,  he 
collected  eight  contributions  he  calls  “more  prom¬ 
ising.”  This  edited  volume  contains  chapters  on 
(1)  the  institutional  analysis  and  development 
framework,  (2)  the  multiple  streams  framework, 
(3)  social  construction  and  policy  design,  (4)  the 
networks  approach,  (5)  punctuated-equilibrium 
theory,  (6)  the  advocacy  coalition  framework, 
(7)  innovation  and  diffusion  models,  and  (8)  the 
policy  process  and  large-N  comparative  studies. 
What  the  the  contributions  pictured  in  the  eight 
chapters  have  in  common  is  that  they  aim  at 
(causal)  explanation,  in  particular  by  relating  ele¬ 
ments  of  policy  processes  to  other  elements. 
Besides,  all  have  led  to  subsequent  empirical 
research.  This  being  so,  the  contributions  show 
differences.  First,  there  is  variety  in  what  is  seen  as 
needing  explanation.  For  some  contributions,  this 
explanandum  regards  policy  change;  for  others,  it 
regards  variation  in  policy  outputs.  The  innova¬ 
tion  and  diffusion  models  look  at  policy  adoption. 
Second,  the  level  or  unit  of  analysis  differs  in  a 
range  going  from  that  of  a  single  policy,  via  a 
policy  domain,  to  that  of  a  (nation-)state.  Third, 
and  most  important  here,  the  eight  contributions, 
collected  as  theories,  are  labeled  differently.  One 
of  them  is  called  a  theory  (punctuated  equilibrium) 
and  four  a  framework  (institutional  rational 
choice,  multiple  streams,  social  construction  and 
policy  design,  and  advocacy  coalition).  The  head¬ 
ing  of  one  chapter  includes  the  term  models,  that 
of  another  one  the  term  studies,  while  there  is  also 
a  chapter  with  the  term  approach  in  its  title. 

The  variety  of  labels  for  these  contributions  as 
expressed  in  the  chapter  titles  do  not  preclude 
similarities.  At  the  same  time,  however,  there  is  a 
difference  between  the  common  denominator  of 
theories,  on  the  one  hand,  and  the  variety  of  contri¬ 
bution  headings,  on  the  other.  This  does  underline 
Ostrom’s  plea  for  precision.  A  systematic  applica¬ 
tion  of  her  requirements  for  each  type  of  academic 
work  would  specify  the  particular  characteristics  of 
these  collected  theoretical  contributions.  Some  of 
them  then  might  appear  to  involve  the  elaboration 
of  a  framework  in  Ostrom’s  sense — like  the  one 
developed  by  herself — and  some  to  provide  a  model. 
For  several  contributions,  the  term  theoretical 


approaches  or  conceptual  lenses  would  seem  appro¬ 
priate.  These  are  steps  toward  the  formation  of  a 
theory.  In  any  case,  all  the  contributions  in 
Sabatier’s  volume  differ  from  the  stages  model, 
assessed  as  it  is  above  as  a  metatheoretical  frame¬ 
work  with  shortcomings. 

Governance  Research 

Relatively  early  in  research  on  the  policy  pro¬ 
cess  Jon  Pierre  and  Guy  Peters  have  explored  the 
consequences  of  the  concept  of  governance.  Related 
to  the  Latin  verb  gubernare  (steering),  the  origins 
of  the  term  lay  in  the  past.  In  the  last  decade  of  the 
20th  century,  however,  the  concept  moved  to  the 
forefront  of  both  public  discourse  and  the  study  of 
government.  Public  tasks  today  are  being  fulfilled 
in  a  broader  variety  of  forms  than  before.  As  Pierre 
and  Peters  indicate,  the  term  governance  covers  the 
whole  range  of  institutions  and  relationships 
involved  in  the  process  of  governing.  Nevertheless, 
in  a  democracy,  the  rule  of  law  is  working. 
Therefore,  it  is  still  government  that  normatively 
has  the  appropriate  legitimacy  to  steer  society. 
Providing  direction  to  society,  whether  directly  or 
indirectly,  is  what  Pierre  and  Peters  consider  to  be 
the  activity  of  central  governance. 

Looking  at  shared  sets  of  problem  definitions 
and  proposed  solutions,  Hill  and  Hupe  speak  of  a 
current  governance  paradigm.  This  is  seen  as  being 
different  from  previous  ones,  including  the  policy 
implementation  paradigm,  as  prevalent  particularly 
in  the  1970s.  As  characteristics  of  the  governance 
paradigm,  the  authors  distinguish  an  awareness  of 
context,  a  reassessment  of  traditional  public  values, 
a  rehabilitation  of  the  hierarchical  mode  of  govern¬ 
ing,  and  a  renewed  attention  to  issues  of  account¬ 
ability.  Perhaps  the  most  prevalent  feature  of  the 
governance  paradigm  is  the  attention  to  action. 
Public  officials  do  not  just  rely  on  the  intentions 
stated  by  agents  and  other  actors  in  the  public 
domain.  Rather,  they  want  to  secure  a  view  on  what 
those  actors  are  actually  doing  and  on  what  works. 

To  a  certain  extent,  labels  such  as  governance 
imply  the  use  of  different  lenses  to  look  at  the 
world.  The  term  governance  is  meant  to  capture  the 
reality  of  government  in  the  beginning  of  the  3rd 
millennium.  The  Oxford  Handbook  of  Public 
Management  and  similar  publications  reflect  the 
state  of  the  art  in  the  study  of  government.  It  is  there 
that  authors  try  to  assess  changes  in  the  object  of 
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study,  different  ways  of  looking  at  that  object,  and 
the  implications  of  relating  the  two.  In  particular, 
scholars  like  Geert  Bouckaert,  George  Frederickson, 
Christopher  Hood,  Laurence  Lynn,  and  Christopher 
Pollitt  have  addressed  the  issue  of  positioning  para¬ 
digmatic  concepts  such  as  new  public  management 
and  governance  in  the  actual  study  of  government. 
Their  assessments  are  attempts  to  position  new 
developments  in  the  field  of  study  overall. 

Next  to  this,  several  academics  have  reflected  on 
the  operational  implications  the  concept  of  gover¬ 
nance  might  have  for  theoretical-empirical  research. 
As  in  the  present  practice  of  public  administration, 
researchers,  too,  are  inclined  to  search  behind  the 
policy  on  paper.  This  may  certainly  be  the  case 
when  using  an  empirical  research  design  aimed  at 
causal  explanation.  In  the  view  of  Laurence 
O’Toole,  the  distinguishing  feature  of  the  concept 
of  governance  is  that  it  incorporates  a  more  com¬ 
plete  understanding  of  the  multiple  levels  of  action 
and  kinds  of  variables  that  can  be  expected  to  influ¬ 
ence  governmental  performance.  As  a  consequence, 
the  analysis  of  government  in  action  becomes  mul¬ 
tidimensional.  Such  an  analysis  fundamentally 
comprises  both  perceived  action  and  rhetorical 
justification.  It  looks  at  unintended  consequences  as 
well  as  official  documents.  The  focus  is  on  both 
observed  actual  situations  and  original  good  inten¬ 
tions.  Almost  by  definition,  levels  of  analysis  dif¬ 
ferent  from  the  one  looked  at  play  a  role.  Hence, 
the  study  of  government  in  action  implicates 
governance  research,  with  its  multidimensional 
perspective. 

The  consequence  then  is  that  researchers  see 
themselves  confronted  with  an  overwhelming  array 
of  factors.  Each  of  these  can  be  expected  to  contrib¬ 
ute,  one  way  or  another,  to  the  explanation  of  what 
is  observed  as  happening.  Therefore,  the  aim  to 
explain  empirical  variation  implies  a  need  to 
structure  the  vast  range  of  candidate  explanatory 
variables.  This  is  a  characteristic  of  multiple  multi¬ 
dimensionality  in  particular.  It  seems  to  make  social 
science  at  least  as  complex  as  natural  science.  In 
research,  the  isolation  of  a  few  variables,  necessary 
and  unavoidable,  goes  hand  in  hand  with  leaving 
aside  the  impact  of  factors  held  constant  under  a 
ceteris  paribus  clause.  Studying  government  in 
action  entails,  by  definition,  more  than  one  level  of 
analysis.  In  a  nested  configuration,  factors  play  a 
role  from  different  scales  of  aggregation.  In  a  given 


research,  focus  may  be  on  explaining  the  relations 
between  only  a  few  variables.  Even  then,  explicitly 
or  implicitly,  the  impact  of  a  vast  range  of  interme¬ 
diary  factors  on  various  dimensions  is  presupposed. 
This  variety  may  be  nominated  as  contextual  fac¬ 
tors.  They  may  have,  however,  a  possible  influence 
reaching  far  beyond  that  general  label,  and  it  is 
often  difficult  to  treat  them  as  held  constant. 

Aiming  at  enhancing  scientific  rigor,  analysts  of 
government  in  action  explore  various  options.  In 
implementation  theory  and  research,  authors  tried 
to  close  the  top-down/bottom-up  controversy  by 
making  a  case  for  developing  synthesizing  ap¬ 
proaches.  The  sources  for  identifying  relevant  fac¬ 
tors  were  no  longer  to  be  confined  to  either  the  top 
or  the  bottom  of  the  policy  process.  Scholars  like 
O’Toole  and  Kenneth  Meier  have  pointed  to  the 
fact  that  the  number  of  such  variables  hence  grew 
to  unmanageable  proportions.  Their  plea  for  parsi¬ 
mony  made  them  develop  a  formal  model  of  public 
management  containing  hypotheses  on  relations 
between  a  limited  number  of  variables.  Since  then, 
Meier  and  O’Toole  have  been  testing  this  model 
successfully  on  large  aggregates  of  data. 

Studying  the  Policy  Process 
as  Governance  Research 

The  Study  of  Government  in  Action 

The  concept  of  governance  can  be  seen  as  an 
umbrella  concept  aimed  at  capturing  the  complex¬ 
ities  of  public  policy  beyond  the  rationalist  assump¬ 
tions  of  the  stages  model.  If  that  stance  is  adopted, 
the  study  of  the  policy  process  in  the  contemporary 
world  can  be  conceived  as  governance  research. 

In  addressing  the  question  of  what  is  needed  for 
such  research,  the  three  criteria  for  an  analytical 
framework,  mentioned  above,  need  to  be  elaborated. 
Comprehensiveness  then  refers  to  an  orientation 
toward  capturing  the  whole  range  of  elements 
involved  in  the  collective  endeavors  to  turn  public 
intentions  into  public  achievements.  The  short¬ 
comings  of  the  stages  model  have  been  identified 
above.  Serving  as  an  alternative,  an  analytical 
framework  should  provide  the  possibility  of  cate¬ 
gorizing  and  positioning  relevant  factors  that  both 
influence  and  are  influenced  by  actions  taken  by 
actors  involved  in  governmental  affairs.  Parsons 
speaks  of  the  need  to  deal  in  research  with  multi¬ 
framed  activity  while  mapping  the  wider  contexts 
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of  problems,  social  processes,  values,  and  institu¬ 
tions  in  which  policy  making  takes  place. 

Enhancing  theory  formation,  the  second  crite¬ 
rion  for  an  analytical  framework,  means  the  pur¬ 
suit  of  scientific  rigor  by  grounding  and  testing 
theoretical  suppositions  while  explaining  causal 
relations.  The  third  criterion,  empirical  openness, 
implies  specification  and  differentiation  regarding 
all  aspects  of  the  empirical  cycle.  Most  analysts  in 
social  science  will  concur  with  the  normative  prin¬ 
ciples  of  the  rule  of  law,  democracy,  and  perhaps 
also  the  welfare  state.  Also,  many  analysts  will 
underline  the  necessity  of  observing  and  explaining 
social  reality  in  academic  research,  in  a  way  as 
normatively  unbiased  as  possible.  After  all,  improv¬ 
ing  the  world  can  benefit  from  understanding  it. 

Both  in  practice  and  in  theory,  the  notion  of 
stages  may  remain  prevalent.  It  is  a  metaphorical 
picture  fitted  to  sketch  the  range  of  activities  actors 
in  governmental  affairs  engage  in.  Nevertheless, 
some  authors  have  attempted  to  provide  alterna¬ 
tive  general  frameworks.  With  those  frameworks, 
questions  for  research  can  be  asked  and  answered 
while  enabling  the  systematic  study  of  government 
in  action.  When  put  alongside  the  criteria  men¬ 
tioned  above,  four  of  such  alternative  frameworks 
can  be  identified.  The  contributions  to  be  addressed 
here  come  from  different  scholarly  backgrounds. 
In  particular,  these  include  the  study  of  public 
management  (Laurence  Lynn,  Carolyn  Heinrich, 
and  Carolyn  Hill),  public  choice  (Ostrom),  and 
public  policy  (Parsons,  Hill,  and  Hupe). 

Multiple  Stages 

Parsons  divides  his  almost  700-page  textbook 
Public  Policy  into  four  parts.  He  calls  them  meta¬ 
analysis,  meso-analysis,  decision  analysis,  and  deliv¬ 
ery  analysis.  The  first  part  is  on  analyzing  analysis 
(meta-analysis).  After  that,  Parsons  distinguishes 
three  levels  or  dimensions  of  analysis,  both  broad 
and  partly  overlapping.  Meso-analysis  deals  with 
the  way  in  which  issues  and  problems  are  defined 
and  policy  agendas  are  set;  theoretical  approaches 
categorized  as  meso  link  the  input  side  and  output 
side  of  the  policy-making  process.  Decision  analysis 
refers  to  approaches  looking  at  how  decisions  are 
taken,  policies  are  formed,  and  analysis  is  used. 
In  Parsons’s  conceptualization,  delivery  analysis 
refers  to  how  policies  are  administered,  managed, 


implemented,  evaluated,  and  possibly  terminated. 
The  author  stresses  that  the  parts  of  the  book  are 
not  to  be  read  as  a  set  of  stages.  There  is  no 
sequence,  because  in  the  real  world  there  are  no 
defined  or  distinct  phases.  Parsons  suggests  reading 
his  textbook  in  an  iterative  rather  than  linear  way, 
by  going  backward  and  forward. 

Nested  Games  and  the  Logic  of  Governance 

In  The  Power  Elite,  C.  Wright  Mills  (1956)  dis¬ 
tinguishes  between  the  top,  the  middle,  and  the  bot¬ 
tom  levels  of  power.  Characterizing  governmental 
activity,  Lynn  refers  to  that  insight  while  distin¬ 
guishing  three  different  elements  he  calls  policy 
games.  They  relate  to  each  other  in  a  nested  con¬ 
figuration.  The  high  game  involves  deciding  whether 
there  is  a  role  for  government  in  a  given  situation.  In 
the  middle  game,  the  direction  of  the  policy  is  deter¬ 
mined.  The  low  game  concerns  the  practical  side  of 
policy  making,  where  implementation  is  central. 

More  recently,  Lynn  has  related  these  levels  of 
the  game  to  governance.  With  Carolyn  Heinrich 
and  Carolyn  Hill,  he  provides  what  they  call  a  logic 
for  governance  research.  The  authors  have  devel¬ 
oped  an  aggregated  framework,  aimed  at  directing 
attention  to  the  dynamic  relations  within  and 
between  the  institutional,  managerial,  and  opera¬ 
tional  levels  of  governance.  Lynn  and  his  colleagues 
argue  that  the  logic  of  governance  involves  a  hierar¬ 
chy  of  relationships  that  carry  over  from  one  level 
to  the  next:  those  between  (a)  citizen  preferences 
and  legislative  choice;  (b)  legislative  preferences  and 
formal  structures  and  processes  of  public  agencies; 
(c)  formal  authority  and  the  organization  and  man¬ 
agement  of  agencies,  programs,  and  administrative 
activities;  (d)  organization,  management,  and 
administration  and  the  core  technologies  and  pri¬ 
mary  work  of  public  agencies;  (e)  primary  work  and 
consequences,  outputs,  or  results;  (f)  outputs  or 
results  and  stakeholder  assessments  of  agency  or 
program  performance;  and  (g)  stakeholder  assess¬ 
ments  and  political  preferences  and  interests.  The 
results  of  governmental  activity  mentioned  in  (f)  are 
measured  in  terms  of  the  availability,  quality,  and 
cost  of  publicly  sponsored  goods  and  services.  Lynn 
et  al.  stress  that  this  is  a  heuristic  framework.  It 
enables  the  identification  of  relationships  poten¬ 
tially  influencing  governmental  performance.  When 
the  authors  first  presented  the  framework,  they 
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explicitly  stated  that  it  should  not  be  called  a  unified 
theory  of  governance.  They  stressed  that  making 
statements  about  the  actual  influence  of  the  identi¬ 
fied  relationships  requires  theory,  models,  and  data. 

Institutional  Analysis  and  Development 

In  a  lifelong  career,  Ostrom  has  been  interested 
in  how  institutions  affect  the  incentives  confronting 
individuals  and  their  resultant  behavior.  Standing 
in  the  tradition  of  institutional  analysis  from  a 
microeconomic  perspective,  in  an  article  written 
with  Larry  Kiser,  she  presented  the  institutional 
analysis  and  development  framework.  Kiser  and 
Ostrom  speak  of  a  multitier  conceptual  map. 
Indeed,  it  is  a  metatheoretical  framework  meant  to 
order  various  sorts  of  institutional  analysis.  The 
authors  refer  to  three  tiers  of  decision  making:  con¬ 
stitutional  choice,  collective  choice,  and  operational 
decisions.  Together,  these  form  what  is  addressed 
as  the  three  worlds  of  action.  The  constitutional 
level  explains  the  design  of  mechanisms  of  collec¬ 
tive  choice.  At  the  level  of  collective  choice  itself, 
collective  decisions  are  made  by  officials,  including 
citizens  acting  as  officials.  These  decisions  are 
aimed  at  influencing  actions  in  one  way  or  another. 
The  decision  making  at  this  level  includes  the 
authority  to  impose  sanctions.  At  the  operational 
level,  individuals  take  direct  action,  sometimes 
without  prior  agreement  with  other  individuals. 
This  framework  has  a  nested  structure.  Decisions  at 
one  level  are  linked  to  those  at  the  other  levels. 
Constitutional  decisions  establish  institutional 
arrangements.  Within  given  institutional  arrange¬ 
ments,  collective  choices  regarding  specific  issues 
are  made.  These  decisions  are  meant  to  guide  indi¬ 
vidual  action,  in  particular  at  the  operational  level. 
Collective  decisions  are  enforceable  against  non- 
conforming  individuals. 

Multiple  Governance 

The  distinction  between  the  three  tiers  of  deci¬ 
sion  making  refers  to  different  sets  of  actions.  It 
leaves  open,  however,  where  and  by  whom  this 
action  is  performed.  This  characteristic  of  the  insti¬ 
tutional  analysis  and  development  framework  was 
adopted  by  Hill  and  Hupe  and  applied  to  the  study 
of  governance.  Seeking  to  develop  an  analytical 
framework  as  an  alternative  to  the  stages  model, 


they  propose  the  following  metaquestion  to  precede 
governance  research:  Who  acts  where,  doing  what, 
on  which  scale,  and  how?  A  policy  process  is 
viewed  as  positioned  in  a  multidimensional  setting 
of  government  in  action.  Each  part  of  the  question 
refers  to  an  element  of  that  setting.  Successively — 
although  not  in  a  chronological  order — these  ele¬ 
ments  concern  actors  (who),  administrative  layers 
(where),  sets  of  activities  (what),  action  situations 
and  action  scales  (on  which  scale),  and  political- 
administrative  craftsmanship  (how).  Thus,  the 
authors  frame  the  policy  process  in  terms  of  gover¬ 
nance.  They  do  so  taking  their  lead  from  both  the 
three  worlds  of  action  as  conceptualized  by  Kiser 
and  Ostrom  and  the  governance  perspective  of 
Pierre  and  Peters.  In  the  meaning  of  government  in 
action,  governance  consists  of  three  broad  sets  of 
activities  called  constitutive,  directional,  and  opera¬ 
tional  governance.  Constitutive  governance  con¬ 
cerns  the  fundamental  decisions  about  the  creation 
of  institutional  settings.  Within  given  settings, 
actors  provide  direction.  Directional  governance 
stands  for  the  formulation  of  and  decision  making 
about  collectively  desired  outcomes.  Operational 
governance  concerns  the  actual  managing  of  the 
realization  of  those  outcomes.  Hence,  the  multiple 
governance  framework  gives  attention  to  dimen¬ 
sions  of  structure,  content,  and  process.  The  dis¬ 
tinction  between  creating  settings,  giving  direction, 
and  getting  things  done  is  addressed  as  the  trinity  of 
governing,  or  trias  gubernandi. 

The  multidimensional  character  of  the  frame¬ 
work  implies  that  in  research,  the  specific  focus 
and  locus  are  to  be  expressed  separately.  The  posi¬ 
tion  of  and  relations  between  the  sets  of  activities 
comprising  government  in  action  are  neither  hier¬ 
archically  nor  chronologically  predetermined. 
Those  activities  can  be  performed  at  any  spot  in 
the  public  domain.  A  series  of  empirical  questions 
follows:  Who  are  the  actors,  looked  at  on  which 
scale  (individuals,  organizations,  aggregate  sys¬ 
tems),  acting  on  which  administrative  layer,  de 
facto  engaged  in  what  kind  of  activities  here  and 
now,  and  with  which  degree  of  craftsmanship? 
When  implementation  is  conceived  of  as  opera¬ 
tional  governance,  research,  for  instance,  entails 
the  question  whether  the  concretely  observed 
action  concerns  rule  application  or,  rather,  rule 
setting.  Thus,  factual  policy  formation  may  be 
traced  where  policy  implementation  was  expected. 


Policy  Process,  Models  of  1949 


Alternatives  to  the  Stages  Model 

Above,  the  theoretical  approaches  of  the  pol¬ 
icy  process  were  addressed  that  Sabatier  charac¬ 
terized  as  more  promising  than  the  stages  model. 
All  show  a  sophisticated  level  of  elaboration  and 
have  been  applied  in  several  research  projects.  As 
such,  they  meet  the  requirements  for  being 
included  in  the  edited  volume.  Ostrom’s  contri¬ 
bution  appears  to  meet  her  own  criteria  for  a 
general  analytical  framework.  That  is  why  her 
institutional  analysis  and  development  frame¬ 
work  were  included  above,  as  well.  Also,  the 
logic  of  governance  framework  and  the  multiple 
governance  framework  show  the  characteristics 
of  general  analytical  frameworks.  Parsons’s  dis¬ 
tinction,  labeled  as  multiple  stages,  seems  to  have 
been  primarily  intended  to  function  in  a  didactic 
setting.  Aiming  at  comprehensiveness  (the  first 
criterion),  the  four  frameworks  share  a  metathe- 
oretical  character  rather  than  providing  a  sub¬ 
stantive  grand  theory  themselves.  At  the  same 
time,  they  may  function  as  a  heuristic  toward  the 
formation  of  such  theories  (the  second  criterion). 
As  far  as  empirical  openness,  the  third  criterion, 
is  concerned,  the  frameworks  invite  specifying 
what  needs  explanation.  All  four  enable  the  iden¬ 
tification  and  differentiation  of  different  sorts  of 
actors,  activities  or  actions,  layers,  and  other  ele¬ 
ments  (comprehensiveness).  In  particular,  the 
frameworks  metatheoretically  specify  the  rela¬ 
tions  between  those  elements,  even  if  they  are 
thought  of  as  a  hierarchy  (theory  formation). 
Finally,  the  frameworks  make  it  possible  to 
structure  the  multiple  dimensions  of  social  reality 
in  an  analytical  rather  than  a  prescribing  way 
(empirical  openness).  In  research,  understanding 
implementation  deficits  prevails  over  condemn¬ 
ing  them.  In  any  case,  the  observation  and  expla¬ 
nation  of  specified  behavior  precede  judgments 
on  general  shortcomings. 


Conclusion 

O’Toole  described  the  subject  matter  of  implemen¬ 
tation  studies  as  what  happens  between  the  estab¬ 
lishment  of  policy  and  its  impact  in  the  world  of 
action.  He  did  so  in  an  assessment  of  research  on 
policy  implementation.  In  a  certain  sense,  this 
description  goes  for  the  object  of  what  has  been 


presented  earlier  as  governance  research,  overall. 
Thus,  describing  the  object  of  the  study  of  govern¬ 
ment  in  action  in  general  presupposes  a  specific 
conceptualization  of  governance.  In  the  O’Toole 
definition,  the  latter  concept  is  designed  to  incor¬ 
porate  a  more  complete  understanding  of  the  mul¬ 
tiple  levels  of  action  and  kinds  of  variables  that  can 
be  expected  to  influence  performance.  When  gov¬ 
ernance  is  conceptualized  in  this  way,  it  has  conse¬ 
quences  for  research.  Among  others,  it  opens  the 
view  on  perspectives  beyond  policy  studies  in  the 
narrow  sense. 

In  the  realm  of  “what  happens,”  so  many  factors 
and  mechanisms  are  active  that  a  device  is  needed  to 
structure  the  multiple  varieties  of  dimensions  of  the 
research  object.  This  being  true  in  general,  it  is  even 
more  applicable  inasmuch  as  analysts  of  govern¬ 
ment  aim  at  enhancing  scientific  rigor.  Against  that 
background,  the  analytical  frameworks  identified 
here  can  function  as  alternatives  to  the  stages 
model.  They  enable  researchers  to  explore  more 
systematically  what  they  want  to  know,  how  to 
frame  that,  and  which  accumulated  insights  can  be 
deemed  relevant  to  incorporate.  The  orientation  in 
such  research  may  shift  away  from  merely  a  vertical 
one.  Confronting  public  achievements  with  the  pub¬ 
lic  intentions  as  laid  down  in  the  concerned  official 
policy  document  will  not  always  be  the  research 
objective.  Governance  research  implies  explaining 
variation  in  actual  behavior  and  its  impact  on  all 
sorts  of  governmental  performance.  Questions 
aimed  at  explanation  will  be  at  the  forefront.  They 
will  not  replace  judgments  on  public  policy  results 
as  disappointing  but  may  sometimes  postpone 
them.  At  least,  answering  such  questions  will  pro¬ 
vide  a  more  solid  ground  for  any  judgment. 

Analytical  frameworks  such  as  the  ones  pre¬ 
sented  earlier  may  not  immediately  appeal  to  prac¬ 
titioners  as  broadly  as  the  stages  model  has  done 
thus  far.  At  the  same  time,  the  frameworks  respond 
to  the  criticisms  of  that  model  concerning  its  useful¬ 
ness  in  research.  The  alternative  frameworks  may 
enhance  the  formation  of  (middle  range)  theories 
and  the  development  and  testing  of  causal  models. 
Then,  they  would  contribute  to  making  the  study  of 
government  more  theory  driven.  In  that  perspec¬ 
tive,  the  discovery  of  the  nested  character  of  the 
study  of  government  in  action  can  be  welcomed  as 
an  important  insight.  The  gain  of  explanatory 
power  particularly  will  be  greater  to  the  degree 
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researchers  succeed  in  the  coweighing  of  factors 
from  more  than  one  level  of  analysis. 

The  range  of  activities  involved  in  the  collective 
is  structured  analytically  with  the  intention  of 
turning  public  intentions  into  public  achievements. 
Thus,  the  stages  model  of  the  policy  process  was 
defined  at  the  beginning  of  this  entry.  That  so- 
called  model  appears  to  fulfill  functions  going  far 
beyond  causal  explanation.  In  fact,  this  academic 
function  proved  to  be  the  least  developed.  At  the 
same  time,  alternatives  appeared  to  be  available. 

A  pertinent  element  in  the  notion  of  the  stages 
model  regards  the  wish  to  have  complexity  reduced 
to  linearity.  Since  Lasswell  introduced  the  policy 
orientation,  particularly  in  the  1990s,  the  multidi¬ 
mensional  character  of  action  in  government  has 
been  acknowledged.  This  is  the  case  not  only  in 
policy  studies  but  also  in  institutionally  different 
parts  of  social  science.  The  aim  for  methodical  rigor 
is  now  more  widely  embraced.  Multidimensional! ty- 
cum-nonlinearity  seems  to  have  become  the  leading 
principle  in  governance  research. 

Both  Dewey’s  focus  on  the  public  and  its 
problems  and  Lasswell’s  policy  orientation  seek 
to  help  government  in  dealing  with  societal 
issues.  The  current  developments  in  research 
stress  academic  specificities.  Therefore,  the  wish 
to  contribute  to  those  practical  aims  seems  to 
have  been  put  aside.  In  fact,  this  is  not  the  case. 
The  gains  can  be  deduced  not  only  scientifically 
but  from  a  normative  perspective  as  well.  Society 
looks  to  becoming  ever  more  complex.  In  such  a 
context,  improving  government  needs  a  better 
understanding  of  that  government  in  action. 
Although  partly  perhaps  under  different  head¬ 
ings,  the  best  years  for  the  study  of  the  policy 
process  are  likely  still  to  come. 

Peter  L.  Hupe 
Erasmus  University  Rotterdam 
Rotterdam,  Netherlands 
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The  concept  of  political  class  is  based  on  the  idea 
that  politicians  constitute  a  group  with  special 
characteristics  that  distinguish  them  from  other 
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social  groups.  It  also  contends  that  the  members 
of  this  group  share  common  views  and  interests 
and  act  with  a  relatively  high  degree  of  cohesive¬ 
ness.  The  theory  of  the  political  class  posits  the 
study  of  this  phenomenon  as  a  crucial  instrument 
for  the  understanding  of  political  life.  The  main 
questions  that  have  been  developed  within  this 
broad  theoretical  field  are  as  follows:  Who  are  the 
members  of  the  political  class?  What  determines 
their  special  status?  How  and  on  what  bases  are 
they  recruited?  What  are  the  links  that  keep  them 
united?  What  are  the  forces  that  drive  their  behav¬ 
ior?  And  what  determines  the  decline  of  a  political 
class  and  its  substitution  by  another  one?  In  this 
entry,  the  origins,  definitions,  empirical  studies, 
and  contemporary  relevance  of  the  concept  of 
political  class  are  discussed. 

Origin  of  the  Theory  of  Political  Class 

The  concept  of  political  class  has  acquired  a  well- 
established  status  among  crucial  concepts  of 
modern  political  science  since  Gaetano  Mosca 
(1858-1941).  In  his  first  major  work,  Teorica  Dei 
Governi  E  Governo  Parlamentare  (the  title  can  be 
literally  translated  as  “A  Theory  of  Political 
Regimes  and  the  Parliamentary  Government”), 
published  in  1883  when  he  was  only  24  years  old, 
the  Italian  scholar  proposed  the  concept  of  political 
class  as  the  key  to  a  new  innovative  approach  to  the 
study  of  political  phenomena.  Mosca’s  ambitions 
were  not  only  of  a  substantive  nature — to  explore 
a  political  phenomenon  that  supposedly  was  not 
yet  well  understood — but  also  epistemological  and 
methodological.  An  admirer  of  the  extraordinary 
developments  of  the  natural  sciences  during  the 
19th  century,  he  aimed  to  overcome  the  unsatisfac¬ 
tory  state  of  political  studies  and  to  attain  a  more 
scientific  level  that  would  put  them  on  the  same 
level  with  the  more  established  disciplines.  The  key 
to  this  was  to  be  found  first  in  the  development  of 
a  sound  base  of  empirical  observations  to  support 
the  formulation  of  scientific  propositions  and  then 
in  the  ability  to  capture  the  true  substance  of  the 
political  phenomena  and  move  beyond  the  mere 
appearances  that  often  mislead  the  observers. 

To  achieve  the  first  objective,  Mosca  set  out  to 
exploit  the  rich  treasures  of  historical  knowledge 
that  in  the  course  of  the  19th  century  had  made 
enormous  advances  and  to  draw  from  them  the 


empirical  materials  for  a  comparative  analysis  of 
political  phenomena.  With  regard  to  the  second 
objective,  which  was  to  become  the  focal  point  of 
his  research,  the  proposal  he  advanced  was  to  go 
beyond  the  images  of  politics  currently  offered  by 
the  dominant  ideologies  (both  democratic  and  tra¬ 
ditionalist)  and  to  look  for  the  real  power  holders, 
that  is,  the  small  minorities,  or  what  he  defined  as 
the  “political  class.”  That  power  is  always  in  the 
hands  of  small  groups  of  people  was  the  funda¬ 
mental  postulate  of  Mosca’s  theory  of  the  political 
class.  According  to  his  analysis,  neither  large  num¬ 
bers  of  people,  such  as  a  democratic  demos,  nor 
single  individuals,  such  as  monarchs,  who  are 
often  described  by  current  interpretations  of  poli¬ 
tics  (and  especially  by  ideologies)  as  the  rulers  of 
different  political  regimes,  are  the  real  power  hold¬ 
ers.  They  always  need  a  relatively  small  number  of 
organized  people  to  conduct  (in  their  name)  the 
business  of  government. 

The  concept  of  class,  which  Mosca  sought  to 
adapt  to  the  analysis  of  political  reality,  had 
become  a  frequently  used  instrument  for  the  analy¬ 
sis  of  society  in  the  19th  century,  thanks  to  the 
works  of  sociologists  and  economists.  Among 
them,  Karl  Marx  (1818-1883)  and  Friedrich 
Engels  (1820-1895)  had  gained  special  promi¬ 
nence  because  of  their  intellectual  influence  on  the 
socialist  movement  and  because  of  the  enormous 
political  impact  of  some  of  their  ideas.  The  same 
had  happened  to  their  theory,  which  centered  on 
the  idea  of  class  as  a  social  group  based  on  the 
relationship  between  individuals  and  the  economic 
processes  of  production  but  also  stressed  the  role 
played  in  defining  such  a  group  by  the  conscious¬ 
ness  achieved  by  its  members  of  their  specific  con¬ 
dition.  Marx  and  Engels  were  not  the  only  scholars 
working  with  this  concept.  Two  other  important 
figures  of  modern  sociology,  Henri  de  Saint  Simon 
(1760-1825)  and  Auguste  Comte  (1798-1857), 
had  already  given  the  concept  of  class  an  impor¬ 
tant  place  in  their  social  theories.  The  attention  of 
the  two  French  authors  was  directed  toward  the 
new  upper  strata  of  the  industrializing  society, 
which  were  viewed  as  being  in  the  process  of  tak¬ 
ing  the  place  of  the  old  aristocratic  and  clerical 
ruling  classes.  In  particular,  they  considered  the 
technical  and  managerial  skills  developing  with  the 
modern  model  of  industrial  production  to  be  the 
crucial  features  of  the  new  ruling  class.  As  these 
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two  authors  focused  their  attention  on  the  role  in 
society  of  relatively  small  groups  in  the  top  posi¬ 
tions,  they  contributed  even  more  than  Marx  and 
Engels  to  paving  the  way  for  the  theory  of  the 
political  class  as  developed  by  Mosca  and  other 
social  scientists  of  his  time.  However,  similar  to 
Marx’s  theory,  their  interpretation  left  little  room 
for  the  political  dimension  and  was  fundamentally 
inspired  by  a  reductionism  of  politics  to  economics. 

Other  authors  had  paid  more  attention  to  the 
political  dimension  and,  often  combining  the 
descriptive  and  the  prescriptive  perspective,  had 
made  use  of  similar  concepts  as  instruments  for 
understanding  (and  evaluating)  politics.  Among 
them,  one  might  recall  Edmund  Burke  (1729-1797) 
and  his  analysis  of  “natural  aristocracies”;  Lorenz 
von  Stein  (1815-1890),  who  not  only  theorized  but 
also  promoted  the  reform  of  the  governing  elites  of 
Prussia;  or  Hippolyte  Taine  (1828-1893),  with  his 
criticism  of  professional  politicians  and  of  their  cor¬ 
ruption  and  the  exaltation  of  a  class  of  natural 
leaders.  With  his  discussion  of  the  party  machine 
and  the  role  of  the  political  managers,  the  American 
scholar  and  politician  John  Calhoun  (1782-1850) 
anticipated  many  future  analyses  of  the  new  demo¬ 
cratic  elites.  One  may  remember  also  the  American 
Commonwealth  (1888)  by  the  British  scholar 
James  Bryce  (1838-1922),  who  began  to  system¬ 
atically  analyze  the  new  type  of  professional  politi¬ 
cians  operating  in  the  American  parties  and  saw 
their  role  as  not  incompatible  with  the  principles 
of  democracy.  The  writings  of  these  and  other 
authors  suggest  that  the  idea  was  starting  to  gain 
ground;  none  of  them,  however,  gave  to  this  theme 
such  a  central  place  as  Mosca  did  and  as  Vilfredo 
Pareto  and  Robert  Michels  would  do  later  on. 
While  scholars  such  as  Saint  Simon,  Comte,  and 
Marx  looked  at  society  in  general,  in  his  first 
work,  Mosca  was  already  focusing  his  analysis  on 
the  specific  developments  of  the  political  system. 
His  attention  was  particularly  attracted  to  the 
development  of  parliamentary  government  in  Italy 
as  in  many  other  European  countries  and  to  the 
striking  contradictions  between  the  (democratic) 
theory  of  the  regime  and  the  (oligarchic)  practice  it 
displayed.  At  the  same  time,  a  central  preoccupa¬ 
tion  of  Mosca,  when  he  started  developing  his 
theory  of  the  political  class,  was  to  establish  the 
study  of  politics  on  more  solid  grounds.  Writing  at 
the  time  of  great  advances  in  the  natural  sciences 


and  when  economics  was  well  on  its  way  to 
becoming  an  increasingly  professionalized  disci¬ 
pline,  Mosca  deplored  the  underdevelopment  of  a 
true  science  of  politics.  On  the  one  hand,  he  criti¬ 
cized  the  predominant  weight  of  “vulgar  beliefs”: 
a  priori  judgments  based  on  prejudices  and  non- 
systematic  evidence.  On  the  other  hand,  he 
expressed  his  disapproval  of  the  excessive  influ¬ 
ence  of  the  legal  disciplines,  with  their  attention  on 
the  purely  formal  dimension  of  rules,  or  of  ideolo¬ 
gies  (in  particular  the  democratic  ones).  In  his 
view,  the  scientific  study  of  politics  needed  a  pro¬ 
found  renewal  and  should  aim  at  identifying  the 
general  laws  of  political  phenomena.  This  would 
require  a  rich  empirical  basis  of  observations  as 
well  as  more  sophisticated  conceptual  and  theo¬ 
retical  tools.  With  regard  to  the  problem  of  empir¬ 
ical  evidence,  Mosca  thought  that  the  great 
advances  made  by  historical  studies  during  the 
19th  century  could  be  instrumental  in  providing  a 
systematic  source  of  data.  The  idea  that  political 
science  could  directly  produce  its  own  data  was 
not  yet  in  his  mind.  The  problem  was  to  make 
good  use  of  these  empirical  materials,  and  the  key 
to  accomplishing  this  was  to  discover  behind 
appearances  and  purely  legal  schemes  what  the 
true  mechanisms  of  political  life  were. 

Starting  from  a  criticism  of  the  traditional 
Aristotelian  classification  of  regimes  based  on  the 
number  of  rulers,  Mosca  formulated  a  sort  of  iron 
law  of  politics.  According  to  this  “law,”  all 
regimes,  in  spite  of  their  variable  legal  formula¬ 
tions  and  institutional  arrangements,  were  ruled 
by  a  “political  class.”  What  characterized  the 
political  class  was  the  relatively  small  number  of 
its  members,  the  superiority  of  their  moral 
resources,  and,  in  particular,  the  organizational 
factor  that  enabled  a  cohesive  minority  to  prevail 
over  the  “great  numbers”  who  were  lacking  the 
organizational  instruments  needed  to  act  purpo- 
sively  and  cohesively  (Teorica  dei  Governi,  chap. 
1).  Having  affirmed  that  these  were  the  common 
features  of  all  political  classes,  Mosca  then  speci¬ 
fied  that  variations  were  essentially  determined  by 
the  peculiar  qualities  of  each  political  class.  If 
membership  to  a  political  class  is  always  motivated 
by  a  natural  passion  for  power,  the  special  resources 
or  qualities  that  enable  an  individual  to  be  a  part 
of  that  class  vary  according  to  the  stages  of  civili¬ 
zation.  Different  types  of  political  classes  are 
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defined  by  the  specific  resources — military  force, 
economic  wealth,  family  ties,  personal  merit,  and 
ability  to  win  the  support  of  the  voters — that  char¬ 
acterize  them.  In  his  first  work,  Mosca  developed 
also  another  important  theme  for  the  theory  of  the 
political  class,  what  he  called  the  “political  for¬ 
mula”  ( formula  politico).  With  this  concept,  he 
designated  the  specific  discourse  that  any  political 
class  uses  to  justify  its  political  power  and  to  estab¬ 
lish  its  legitimacy.  Mosca  was  keen  to  affirm  that, 
contrary  to  commonly  held  beliefs,  these  legal, 
moral,  and  political  arguments  were  not  the  real 
bases  of  political  power.  In  fact,  they  served  more 
as  instruments  for  disguising  the  real  source  of 
power.  At  the  same  time,  they  played  a  far  from 
negligible  role  in  promoting  broader  bases  of  con¬ 
sent  for  an  existing  power  arrangement. 

These  original  ideas  were  further  developed  by 
Mosca  in  his  later  work  (Elementi  di  Scienza 
Politico  [The  Ruling  Class],  1895/1923)  but  with¬ 
out  many  fundamental  changes.  There,  he  more 
clearly  stated  his  criticism  against  traditional  clas¬ 
sifications  of  regimes  such  as  those  that  consider 
the  possibility  that  one  person  (monarchy)  or  the 
majority  of  the  people  (democracy)  rule  a  country. 
In  both  cases,  this  is  only  the  appearance,  while  the 
reality  is  that  of  a  government  by  a  small  group 
that  either  enables  the  monarch  to  actually  govern 
a  country  or  takes  the  place  of  the  people  in  the 
real  conduct  of  decisions  (Pt.  1,  chap.  2).  The  cir¬ 
culation  of  political  classes  is  another  important 
topic  in  the  theory  of  the  political  class.  If  old 
political  classes  decline  and  new  ones  take  their 
place,  how  is  this  process  to  be  explained? 
According  to  Mosca’s  perspective,  this  discussion 
hinges  on  the  analysis  of  the  specific  resources  that 
characterize  each  political  class.  The  crucial  factor 
for  the  continuity  of  a  political  class  is  the  ability 
of  the  ruling  class  to  maintain  its  control  over 
critical  resources.  Conversely,  what  determines  its 
substitution  is  the  ruling  class’s  inability  to  face  the 
challenge  of  a  new  class,  which  in  turn  proves 
more  able  to  exploit  new,  more  effective  resources. 
Political  classes  tend  to  close  themselves  to  exter¬ 
nal  influences  and  thus  run  the  risk  of  a  progres¬ 
sive  impoverishment;  this  leaves  space  for  the  rise 
of  new  groups  that  control  the  new  resources  that 
have  become  more  important  following  the  changes 
in  the  state  of  civilization.  In  this  discussion,  the 
temporal  perspective  adopted  is  grandiose:  It 


stretches  from  the  antiquity  of  Assyrians,  Greeks, 
and  Romans  to  the  feudal  ages,  to  recent  develop¬ 
ments  of  representative  democracy.  It  is  not  the 
small  changes  taking  place  over  the  years  or 
decades  but  the  macrotransformations  developing 
over  centuries  that  stimulated  the  interest  of 
Mosca. 

In  the  second  part  of  his  later  and  more  mature 
work,  the  Elementi  di  Scienza  Politico ,  Mosca 
addressed  a  possible  criticism  of  his  theory  of  the 
political  class — that  is,  that  it  was  based  on  an 
extreme  level  of  generalization.  If  every  political 
system  is,  by  definition,  ruled  by  a  political  class 
that  also  determines  its  life  and  death,  the  risk  of 
ending  up  in  a  circular  and  dogmatic  reasoning 
may  indeed  be  significant.  To  avoid  this,  Mosca 
suggested  that  further  systematic  studies  be  done 
to  analyze  both  the  constant  and  the  variable  fea¬ 
tures  of  political  classes.  Mosca  felt  that  such  stud¬ 
ies  would  have  highlighted  the  differences  among 
political  classes  and  would  have  revealed  the  rea¬ 
sons  for  their  duration  or  their  downfall  ( Elementi , 
Pt.  2,  chap.  1).  In  the  chapters  that  followed, 
Mosca  started  to  explore  the  development  of  dif¬ 
ferent  political  classes  but  still  on  a  very  broad 
historical  scale.  In  his  analysis,  Mosca  used  four 
conceptual  categories  (autocratic,  liberal,  aristo¬ 
cratic,  and  democratic)  and  proposed  two  pairs  of 
alternatives  (autocratic/liberal  and  aristocratic/ 
democratic).  With  the  first  pair,  Mosca  distin¬ 
guished  between  a  regime  based  on  power  origi¬ 
nating  from  above  (autocracy)  and  one  based  on 
power  originating  from  below  (liberalism);  with 
the  second  pair,  he  distinguished  between  regimes 
based  on  the  tendency  of  the  ruling  class  to  be 
closed  toward  the  outside  (the  aristocratic  princi¬ 
ple)  or  to  be  open  to  new  entrants  (the  democratic 
principle). 

In  about  the  same  period,  another  Italian 
scholar  proposed  a  not  too  different  version  of  the 
theory  of  the  political  class.  The  sociologist 
Vilfredo  Pareto  (1848-1923),  a  former  business 
manager  with  a  scientific  background,  developed 
an  ambitious  general  theory  of  society  in  his 
Trattato  di  Sociologia  Generate  (The  Mind  and 
Society;  1916).  Some  of  the  basic  points  had  been 
already  anticipated  by  him  in  a  1903  book,  Les 
systemes  socialistes  (The  Socialist  Systems),  as  well 
as  in  other  articles,  and  he  and  Mosca  contended 
rather  polemically  about  the  original  authorship  of 
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the  idea.  Pareto,  who  preferred  the  term  elite  to 
that  of  political  class,  attempted  a  more  formalized 
and  systematic  theory  of  societies  and  of  the  cor¬ 
responding  elites.  His  theory  started  from  an 
analysis  of  individual  behavior  and  incorporated 
many  elements  of  a  psychology  of  social  action. 
Yet,  in  spite  of  all  the  differences,  he  shared  with 
Mosca  (and  with  some  of  his  contemporaries) 
some  central  features  and  themes,  such  as  the  need 
to  distinguish  between  reality  and  the  ideological 
embellishment  of  it,  the  “grand  theory”  approach 
embracing  the  whole  known  history  of  mankind  as 
a  source  of  evidence,  and  many  ideas  about  the 
circulation  of  elites/political  classes.  While  Mosca 
had  stressed  the  significance  of  the  organizational 
factor  for  the  predominance  of  the  political  class, 
Pareto  was  keener  to  discuss  the  different  types  of 
qualities  (residui  as  he  defined  them)  that  different 
elites  would  share.  Not  very  dissimilar  from 
Mosca,  he  too  developed  the  theme  of  the  instru¬ 
ments  of  justification  (which  he  called  derivazioni ) 
that  elites  use  to  manipulate  the  masses  and  to 
strengthen  their  domination  over  them. 

The  grand  theory  approach  adopted  by  both 
Mosca  and  Pareto  was  indeed  fascinating  and 
probably  contributed  to  the  appeal  of  their  con¬ 
cepts.  However,  it  also  produced  some  significant 
shortcomings  that  reduced  the  final  impact  of  their 
contribution  to  contemporary  political  science. 
Probably  the  most  important  one  is  that  the  level 
of  abstraction  of  their  analysis  was  such  that  a  true 
empirical  testing  and  refutation  of  their  “laws” 
proved  almost  impossible.  Moreover,  the  evidence 
used  by  the  two  authors  was  essentially  secondary 
and  was  drawn  from  historical  researches  of  a  very 
broad  range.  Their  analysis  thus  depended  essen¬ 
tially  on  “data”  that  had  not  been  collected  with 
their  specific  purpose  in  mind  and  therefore  were 
not  always  really  suitable  for  offering  a  sound 
empirical  support  to  their  theories.  This  lack  of 
empirical  data  collected  explicitly  for  the  analysis 
of  the  political  class  was  a  particularly  serious 
handicap  for  a  correct  interpretation  of  the  new 
developments  of  political  life  that  were  taking 
place  as  a  result  of  the  processes  of  democratiza¬ 
tion  and  parliamentarization  (which  both  Mosca 
and  Pareto  observed  with  a  high  degree  of  skepti¬ 
cism).  It  must  also  be  added  that  their  theory, 
which  started  from  a  well-grounded  criticism  of 
the  legalistic  analysis  of  institutions,  went  too  far 


in  the  opposite  direction  and  therefore  missed  a 
crucial  element  for  the  understanding  of  the  new 
democratic  regimes:  the  interrelationships  between 
institutions  and  political  actors.  The  seminal 
importance  of  their  ideas  remains,  however,  unaf¬ 
fected  by  these  shortcomings,  and  the  theory  of  the 
political  class  contributed  to  producing  a  true 
revolution  in  the  study  of  politics. 

Around  the  same  time,  other  authors  were 
applying  their  basic  ideas  to  a  more  middle-range 
level  of  analysis  and  thus  brought  the  theory  of  the 
political  class  closer  to  empirical  reality.  Moisei 
Ostrogorski  ( La  Democratic  Et  Les  Partis  Poli- 
tiques  [Democracy  and  Political  Parties],  1902) 
and  Robert  Michels  (Soziologie  des  Parteiwesens 
[On  the  Sociology  of  Political  Parties],  1911)  most 
significantly  contributed  to  the  development  of  a 
systematic  analysis  of  the  new  phenomenon  of  the 
organized  mass  parties  that  were  beginning  to 
leave  a  strong  mark  on  political  life  in  the  United 
States  and  in  some  countries  of  Europe.  In  the 
analysis  of  these  new  political  actors,  these  authors 
paid  special  attention  to  the  “political  class”  that 
was  produced  within  the  parties  and  to  the  novel 
features  it  displayed.  Michels’s  contribution  had  a 
particularly  significant  impact  on  the  growing 
debate  about  the  nature  and  working  of  modern 
democracy  and  the  role  within  it  of  one  of  its  cru¬ 
cial  actors,  the  party.  His  book  on  the  sociology  of 
the  political  party  was  a  work  drawing  on  the 
direct  observation  of  real  parties  and  thus  based  on 
empirical  evidence  that  was  much  closer  to  real-life 
politics  than  that  used  by  Mosca  and  Pareto.  Taking 
the  socialist  parties  and  in  particular  the  German 
Social  Democratic  Party  (Sozialdemokratische 
Partei  Deutschlands,  SPD)  as  the  main  source  of  his 
evidence,  the  German  Italian  author  sought  to  dem¬ 
onstrate  that  even  in  the  most  apparently  demo¬ 
cratic  parties,  the  predominance  of  a  small  group  of 
more  powerful  people  could  not  be  avoided.  He 
thus  found  a  sounder  empirical  support  for  Mosca’s 
and  Pareto’s  ideas.  Michels  highlighted  that  while 
“in  theory”  the  whole  governing  process  in  the 
socialist  parties  was  regulated  according  to  demo¬ 
cratic  principles,  in  practice  the  elected  officials 
who  ruled  these  large  organizations  were  “driv¬ 
ing”  the  democratic  process  of  selection  and, 
thanks  to  their  resources  and  position,  could 
manipulate  the  choices  of  the  great  mass  of  party 
members.  Organization  and  its  logic  were,  in  fact, 
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prevailing  over  democracy  in  the  real  working  of 
the  party.  Michels,  then,  making  good  use  of  his 
deep  insider’s  knowledge  of  the  working  of  the 
socialist  parties  of  Europe,  could  show  the  con¬ 
crete  mechanisms  that  were  offering  the  leaders  an 
overwhelming  position  of  advantage  vis-a-vis  the 
common  members.  Elements  such  as  duration  in 
office,  better  education,  and  technical  competence 
all  contributed  to  the  creation  of  a  strong  power 
differential  between  the  party  professionals  and 
the  rank  and  file. 

Some  of  the  consequences  for  democracy  that 
Michels  had  inferred  from  his  findings  have  been 
since  then  seriously  questioned.  In  particular,  the 
skeptical  conclusions  about  the  possibility  of 
democracy  that  he  derived  on  the  basis  of  his 
observation  of  the  inner  life  of  parties  have  been 
found  to  be  insufficiently  grounded  (e.g.,  by 
Seymour  Lipset  in  his  1962  introduction  to  the 
American  translation  of  Michels’s  book).  An 
important  stream  of  reflections  on  the  working 
mechanisms  of  contemporary  democracy,  devel¬ 
oped  in  the  wake  of  Joseph  Schumpeter  by  Robert 
Dahl,  Giovanni  Sartori,  and  Juan  Linz,  have  high¬ 
lighted  the  need  to  distinguish  between  “democ¬ 
racy  within  a  party”  and  “democracy  among  the 
parties.”  If  democracy  within  the  parties  may 
indeed  be  contradicted  by  the  existence  of  an  inter¬ 
nal  oligarchy,  democracy  as  a  system  of  responsi¬ 
ble  and  responsive  government,  instead,  depends 
on  the  existence  of  healthy  competition  among 
parties,  which  makes  them  responsive  to  the 
demands  of  the  electorate.  The  theory  of  competi¬ 
tive  democracy  can  therefore  incorporate  the  exis¬ 
tence  of  elites/political  classes,  provided  that  their 
pluralism  is  ensured  and  that  all  the  institutional 
mechanisms  of  electoral  competition  are  fully 
working.  Whatever  the  shortcomings  of  his  analy¬ 
sis,  it  can  be  rightly  said  that  Michels’s  work  has 
made  a  crucial  contribution  to  the  empirical  study 
of  parties  and  of  their  organization,  a  field  that  has 
become  one  of  the  major  areas  of  contemporary 
political  science. 

Along  not  too  dissimilar  lines  and  around  the 
same  time,  in  his  lecture  at  the  University  of  Munich 
on  Politik  als  Beruf  (Politics  as  a  Vocation ,  1918), 
Max  Weber  (1864-1920)  was  bringing  into  light 
the  role  of  the  professional  politician.  In  his  view, 
both  the  modern  state  and  representative  govern¬ 
ment  increasingly  needed  a  stratum  of  politicians 


fully  dedicated  to  the  performance  of  public  func¬ 
tions.  In  modern  representative  systems,  to  count 
on  “rentiers”  and  “amateurs”  as  the  preferential 
recruitment  pool  of  politicians,  as  had  been  com¬ 
mon  in  the  past,  was  for  a  number  of  reasons 
becoming  both  impossible  and  unacceptable.  The 
situation  required  the  development  of  a  class  of 
professional  politicians  living  not  only  “for  poli¬ 
tics”  but  also  “off  politics” — drawing  their  eco¬ 
nomic  resources  from  paid  public  office.  In  the 
requirements  of  the  large  organizations  of  modern 
times,  Weber  saw  the  development  of  a  new  class  of 
politicians  who  would  find  in  politics  not  only  their 
main  engagement  but  also  a  very  concrete  economic 
interest.  Politics  was,  in  many  ways,  becoming  a 
profession,  thus  coming  to  share  some  of  the  typical 
features  of  any  other  professional  career. 

After  the  “Classics” 

In  Europe,  these  early  developments  of  a  new 
political  science  and  of  themes  crucial  for  the 
understanding  of  modern  political  regimes  were  to 
a  large  extent  stymied  by  the  advent  of  totalitarian 
or  authoritarian  regimes,  which  made  the  life  of 
this  scientific  discipline  increasingly  difficult.  In  the 
United  States,  however,  the  development  of  politi¬ 
cal  science  could  continue  uninterrupted,  and 
some  of  the  stimuli  of  the  theories  of  the  political 
class  could  find  a  fertile  terrain.  Harold  Lasswell’s 
book  Politics:  Who  Gets  What,  When,  How 
(1936)  and  James  Burnham’s  The  Machiavellians 
(1943)  were  very  much  under  this  influence.  After 
World  War  II,  a  rediscovery  of  the  “elitists,”  as 
they  were  sometimes  collectively  called,  also 
became  possible  in  Europe.  This  rediscovery  had 
to  overcome  some  not  entirely  unfounded  criti¬ 
cisms  about  their  alleged  sympathies  for  authori¬ 
tarian  leaders  and  their  more  well-documented 
skepticism  about  democracy.  But  the  themes  they 
had  addressed  soon  regained  the  foreground  in 
contemporary  political  science. 

The  new  developments  of  what  we  may  call  the 
“postclassic  studies  of  the  political  class”  took  two 
different  directions.  On  the  one  hand,  there  was  a 
continuation  of  macropolitical  analyses  of  contem¬ 
porary  societies  guided  by  the  aspiration  to  detect 
which  were  their  “ruling  minorities”  and  to  dis¬ 
cuss  how  much  this  phenomenon  could  be  seen  as 
compatible  with  democracy.  This  line  of  research 
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has  had  important  spillovers  on  the  scientific 
debate  about  the  interpretation  of  contemporary 
democracy  and  the  understanding  of  its  central 
mechanisms.  On  the  other  hand,  a  new  stream  of 
micropolitical  analyses  of  the  components  of  the 
political  classes  of  democratic  regimes  (but  also  of 
some  nondemocratic  regimes)  and  of  their  distin¬ 
guishing  features  began  to  burgeon.  This  second 
stream  could  take  advantage  of  the  behavioral 
revolution  and  of  its  program  of  quantitatively 
oriented  empirical  analysis  conducted  on  the  indi¬ 
vidual  level. 

In  the  first  direction,  a  series  of  studies  flourished 
in  the  1950s  and  1960s  that  were  intent  on  detect¬ 
ing  who  were  the  real  rulers  of  Western  societies 
behind  the  appearances  of  democracy.  C.  Wright 
Mills’s  book  The  Power  Elite  (1956)  had  a  special 
influence  on  this  perspective.  Contrary  to  the  plu¬ 
ralist  theories  describing  an  equilibrium  among  a 
broad  range  of  interests,  Mills  tried  to  demonstrate 
that  a  coalition  of  top  managers  of  economic  firms, 
high-level  military  officers,  and  government  politi¬ 
cians,  closely  linked  by  similar  social  backgrounds 
and  institutional  proximity,  was  creating  a  new 
“superelite”  that  was  able  to  dominate  the  political 
life  of  the  United  States.  This  type  of  studies  had  to 
face  criticisms  that  are  not  too  dissimilar  from 
those  addressed  to  the  classics  of  elitism.  Among 
them,  particularly  relevant  is  the  criticism  pointing 
at  the  difficulty  of  empirically  validating  or  invali¬ 
dating  the  central  hypothesis  of  these  studies — that 
is,  the  existence  of  a  united  group  of  rulers  acting 
cohesively.  Other  works  have  tried  to  challenge 
Mills’s  theory  by  way  of  a  more  empirical  analysis 
of  concrete  political  situations.  An  example  of  spe¬ 
cial  importance  was  the  book  Who  Governs ?  by 
Dahl  (1961),  which  moved  the  analysis  more 
clearly  toward  the  empirical  ground  and  could  be 
seen  as  a  response  to  Mills.  Other  books  (e.g., 
Who  Rules  America ?  by  G.  William  Domhoff, 
1967)  and  discussions  followed,  but  this  stream  of 
studies  has  progressively  exhausted  itself. 

Partially  linked  to  this  discussion  is  the  debate 
about  the  nature  of  contemporary  democracy  that 
developed  after  World  War  II.  The  central  theme 
of  the  debate  has  concerned  the  defining  elements 
of  this  political  regime  and  the  compatibility 
between  the  existence  of  a  relatively  small  number 
of  individuals  (elite/political  class)  who  occupy  the 
positions  of  power  and  thus  rule  over  the  greatest 


mass  of  people  and  the  deontological  tenets  of  the 
democratic  idea.  Against  the  positions  of  those 
who  see  in  the  existence  of  elites  a  denial  of  the 
principles  of  equality  and  popular  government  and 
consequently  of  democracy  itself,  there  emerges 
the  theory  of  “competitive  democracy.”  This  the¬ 
ory  argues  for  the  need  to  reformulate  the  concept 
of  democracy  from  that  of  “a  government  of  the 
people”  to  that  of  “a  government  controlled  by  the 
people,”  wherein  the  existence  of  a  political  class 
is  not  an  obstacle  provided  that  it  has  a  plural 
character.  When  these  conditions  are  satisfied, 
competition  among  the  components  of  the  politi¬ 
cal  class  enables  the  citizens  to  have  a  choice  and 
thus  to  trigger  the  mechanism  of  responsiveness  by 
which  the  rulers  are  compelled  to  respond  to  the 
demands  of  the  people.  The  activation  of  this 
response  mechanism  becomes  the  real  substance  of 
modern  democracy.  This  new  concept  of  democ¬ 
racy,  which  finds  its  foundations  in  Schumpeter’s 
original  formulation  ( Capitalism ,  Socialism  and 
Democracy,  1942)  and  is  further  developed  by, 
among  others,  Sartori  ( Theory  of  Democracy 
Revisited,  1987),  has  become  the  predominant 
paradigm  in  contemporary  political  science  and 
has  originated  a  large  stream  of  theoretical  and 
empirical  research  elaborating  on  the  relationship 
between  rulers  and  the  ruled. 

On  a  somewhat  less  general  level  of  analysis,  we 
must  also  highlight  the  persisting  exchanges  between 
the  theory  of  the  political  class  and  the  study  of  par¬ 
ties.  This  is  not  new,  as  the  two  domains  of  research 
have  shown  a  significant  degree  of  proximity  since 
Ostrogorski  and  Michels.  The  reasons  are  rather 
evident:  Parties  as  crucial  actors  in  contemporary 
democracies  contribute  significantly  to  the  forma¬ 
tion,  selection,  and  structuring  of  the  political  class. 
While  Michels  concentrated  his  attention  on  the 
(oligarchic)  structure  that  party  elites  acquired  in 
the  mass  parties,  other  approaches  have  focused  on 
the  variable  features  that  the  leading  party  person¬ 
nel  display.  These  approaches  have  used  this  facet 
as  a  crucial  element  for  identifying  different  party 
types.  Starting  with  Weber  and  his  analysis  of  the 
party  of  notables  (the  Honoratioren  Partei),  a 
stream  of  political  scientists  from  Maurice  Duverger 
to  Otto  Kirchheimer,  Angelo  Panebianco,  and 
finally  Richard  Katz  and  Peter  Mair  have  high¬ 
lighted  the  parallels  between  changes  in  the  modes 
of  electoral  representation,  patterns  of  social 
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embeddedness,  and  linkages  with  democratic  insti¬ 
tutions  that  characterize  the  different  types  of  par¬ 
ties  (the  parliamentary  party,  the  organized  mass 
party,  the  catchall  party,  the  electoral-professional 
party,  and  the  cartel  party)  and  the  types  of  politi¬ 
cians  they  produce  (notables,  party  bureaucrats, 
professionals  of  electoral  and  media  techniques, 
politicians  entrenched  in  public  office,  etc.).  Such 
typological  exercises  have  opened  the  way  to  inter¬ 
esting  explorations  on  the  mutable  resources,  con¬ 
straints,  perspectives,  and  goals  that  characterize  the 
different  types  of  party  politicians  and  contribute  to 
explaining  their  behavior  and  that  of  their  parties. 

Empirical  Studies  of  the  Political  Class 

In  a  more  strictly  empirical  direction,  the  theory  of 
the  political  class  has  stimulated  a  large  stream  of 
studies  that  have  engaged  in  the  analysis  of  the 
components  of  the  stratum  of  politicians  who 
occupy  the  top  places  of  democracies  and  other 
regimes.  From  the  high  peaks  of  the  grand  theory 
of  the  “classics,”  these  studies  have  descended  to 
explore  the  terrain  of  the  specific  individuals  who 
compose  the  political  class  and  of  their  observable 
features.  The  highly  abstract  hypotheses  and  gen¬ 
eralizations  of  Mosca  and  Pareto  have  been  trans¬ 
lated  into  empirical  puzzles.  For  understandable 
reasons  and  particularly  for  their  greater  openness 
to  public  scrutiny,  the  political  classes  of  demo¬ 
cratic  regimes  have  received  a  comparatively 
greater  coverage.  But  some  important  studies  of 
the  communist  elites,  or  of  military  rulers,  have 
also  been  conducted. 

A  crucial  problem  that  empirical  studies  of  the 
political  class  have  had  to  face  is  that  of  the  iden¬ 
tification  of  its  components.  Who  are  the  members 
of  the  political  class?  Where  are  the  borders  to  be 
drawn  between  those  who  belong  to  it  and  those 
who  do  not?  Flow  large  is  its  membership?  These 
questions  have  an  obvious  methodological  dimen¬ 
sion:  When  we  want  to  study  the  political  class  of 
a  country  and  analyze  its  specific  features,  how  can 
we  identify  its  components?  To  solve  these  dilem¬ 
mas,  empirical  studies  have  typically  adopted  one 
of  three  methods:  the  positional,  the  reputational, 
and  the  decisional  one.  According  to  the  first 
method,  members  of  the  political  class  are  essen¬ 
tially  defined  by  the  position(s)  they  hold:  a  seat  in 
parliament,  a  ministerial  office,  membership  of  a 


party  executive  committee,  and  so  on.  According 
to  the  second,  the  reputation  is  the  best  way  to 
ascertain  who  are  the  members  of  the  political 
class:  By  asking  people  who  are  knowledgeable 
and  close  to  the  top  to  indicate  those  who  exert  the 
greatest  influence  in  a  given  country  and  by  select¬ 
ing  the  names  that  obtain  the  greatest  numbers  of 
nominations,  it  is  possible  to  identify  the  members 
of  the  ruling  elite.  The  third  method  identifies  the 
members  of  the  political  class  by  looking  at  the 
decision-making  processes  and  finding  out  who 
takes  a  dominant  part  in  it.  The  three  approaches 
are  based  on  somewhat  different  theoretical  inter¬ 
pretations  of  politics  itself,  and  each  of  them  has 
practical  advantages  and  shortcomings.  The  posi¬ 
tional  method  is  based  on  the  idea  that  political 
power  in  advanced  societies  is  typically  embedded 
in  institutions  and  that  institutional  positions  are 
thus  fairly  dependable  indicators  of  a  powerful 
role.  An  obvious  advantage  of  this  method  is  that, 
in  general,  it  is  fairly  easy  to  identify  the  positions 
and  the  persons  occupying  them.  This  method, 
however,  underestimates  the  potential  relevance  of 
noninstitutional  positions,  of  informal  power,  and 
may  then  run  the  risk  of  overestimating  the  impor¬ 
tance  of  some  formal  positions.  The  reputational 
method  rests  on  the  assumption  that  influence  is 
based  on  an  informal  relational  system  rather  than 
on  the  holding  of  formal  positions  of  authority  and 
that  its  reality  is  well  perceived  by  informed 
observers.  This  method  seems  better  suited  to 
catch  the  “real”  dimension  of  power  and  to  avoid 
some  of  the  pitfalls  of  the  other  method;  it  enables 
the  observer  to  go  beyond  the  formal  appearances 
that  exist  in  every  regime  and  get  to  the  true  sub¬ 
stance  of  power.  However,  it  is  much  more  labori¬ 
ous  than  the  positional  method,  and  the  quality  of 
its  results  obviously  depends  on  the  good  selection 
of  those  asked  to  nominate  the  elite  members. 
Reputations  can  also  be  inflated  and  may  not 
always  fully  correspond  to  the  reality  of  power.  As 
for  the  decisional  method,  it  is  based  on  the  idea 
that  power  is  best  evidenced  by  the  ability  to  take 
part  in  decisions  and  to  influence  them.  A  careful 
analysis  of  decision-making  processes  generally 
enables  us  to  identify,  with  sufficient  precision, 
those  who  play  a  leading  role.  The  main  problem 
with  this  method  is  that  the  number  of  decisions 
taken  in  any  political  system  is  high  and  scrutinizing 
all  of  them  (or  even  a  sample)  is  a  very  demanding 
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task.  In  the  end,  in  spite  of  some  of  its  weaknesses, 
the  positional  approach  has  in  fact  dominated  the 
empirical  studies  of  the  political  class,  particularly 
when  they  are  applied  on  a  large,  comparative,  and 
diachronic  scale.  The  other  methods  have  found 
greater  usefulness  for  studies  of  an  intensive  nature 
conducted  on  a  smaller  scale,  such  as  community 
studies  or  policy  studies. 

When  concerned  with  democratic  political  sys¬ 
tems,  empirical  studies  have  predominantly, 
although  not  exclusively,  concentrated  their  atten¬ 
tion  on  the  holders  of  positions  in  elected  assemblies. 
Because  of  their  seats  in  the  highest  democratic  insti¬ 
tution,  endowed  with  functions  of  representation, 
lawmaking,  and  executive  oversight,  and  because  of 
their  role  in  the  electoral  process,  members  of  par¬ 
liaments  are  quite  obviously  an  interesting  target 
for  the  study  of  the  political  class.  At  the  same 
time,  because  of  the  public  role  of  parliamentary 
institutions,  information  about  the  backgrounds, 
careers,  personal  features,  and  preferences  of  mem¬ 
bers  is  generally  quite  abundant  and  easy  to  exploit 
for  the  purpose  of  large-scale  studies.  As  a  conse¬ 
quence,  in  most  of  the  advanced  democracies,  sys¬ 
tematic  studies  of  legislators  have  been  conducted 
from  the  1960s  onward.  These  studies  have  typi¬ 
cally  explored  the  variables  pertaining  to  personal 
information  and  the  social  lives  of  politicians,  such 
as  age,  gender,  education,  professional  background, 
regional  origin,  and  religious  affiliation,  or  to 
aspects  of  their  political  professionalization,  such  as 
party  affiliation,  steps  in  the  local  and  national 
political  career,  seniority  in  office,  and  so  on 
(Robert  Putnam,  1976).  The  availability  of  a  rich 
series  of  comparative  and  longitudinal  studies  of 
members  of  parliament  has  enabled  us  to  discover 
the  existence  of  significant  variations  in  the  features 
of  the  members  of  the  political  class  across  coun¬ 
tries  and  political  parties  over  time.  The  empirical 
data  derived  from  these  studies  can  be  used  to 
identify  and  explore  the  factors  at  work  in  the 
recruitment  process  through  which  the  political 
class  is  reproduced  (Pippa  Norris,  1997)  as  well  as 
its  degree  of  stability  and  change.  In  a  more  general 
sense,  they  provide  useful  indicators  for  exploring 
the  transformations  of  political  representation  and 
of  the  relationships  between  citizens  and  rulers 
(Maurizio  Cotta  &  Heinrich  Best,  2007). 

Whereas  studies  adopting  what  we  may  call  the 
“input  perspective”  (who  are  the  members  of  the 


political  class  and  to  what  extent  do  they  represent/ 
mirror  society)  have  been  numerous  and  system¬ 
atic,  at  least  for  established  democracies,  the 
empirical  coverage  of  the  “output  perspective” 
(what  they  think  and  how  they  behave)  has  been 
much  less  complete.  The  attitudes  and  preferences 
of  elected  politicians  have  been  the  object  of  empir¬ 
ical  research  but  typically  on  a  single-country  basis 
and  without  longitudinal  continuity:  Such  studies 
are  obviously  more  costly  as  they  require  inter¬ 
views  or  content  analysis.  Studies  systematically 
comparing  attitudes  across  countries  are  much 
rarer.  The  same  can  be  said  about  the  attempts  to 
explicitly  study  the  output  side  and  to  analyze  the 
consequences  in  the  field  of  policy  making  of  the 
variable  features  of  the  parliamentary  elites.  At  this 
level,  our  knowledge  is  still  extremely  limited,  but 
some  interesting  advances  have  been  made  recently 
with  regard  to  the  effects  of  the  presence  of  women 
in  parliamentary  assemblies  on  policy  making. 

Empirical  studies  have  also  explored  the  fea¬ 
tures  of  other  groups  that  may  be  legitimately 
included  in  the  concept  of  political  class,  for 
instance,  party  leaders  and  party  cadres  that  do 
not  have  parliamentary  positions  or  the  high  strata 
of  public  bureaucracies  (Joel  Aberbach,  Robert 
Putnam,  &  Bert  Rockman,  1981).  Such  studies, 
however,  have  typically  been  less  comprehensive 
than  those  dealing  with  parliamentarians  as  the 
information  about  these  categories  is  much  less 
handy  and  systematic. 

Conclusion 

A  comprehensive  evaluation  of  the  impact  of  the 
concept  and  theory  of  the  political  class  on  the 
study  of  politics  would  require  specific  and  system¬ 
atic  research,  which  is  yet  to  be  done.  Some  tenta¬ 
tive  remarks  can,  however,  be  advanced  here. 
There  are  not  many  doubts  that  the  basic  idea  that 
“the  men  [and  increasingly  the  women]  at  the  top” 
are  a  crucial  object  of  study  for  anyone  who  wants 
to  understand  the  functioning  of  political  life  has 
been  a  milestone  in  political  science.  And  the  same 
can  be  said  about  the  contention  that  we  cannot  be 
happy  with  the  study  of  the  legal  definition  of  gov¬ 
erning  institutions  alone  and  that  we  must  look  at 
the  individuals  who  make  them  work.  In  this 
sense,  the  classical  theorists  of  the  political  class 
have  anticipated  some  of  what,  half  a  century  later, 
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has  been  among  the  central  components  of  the 
“manifesto”  of  the  behavioral  revolution  of  the 
1950s.  The  aspiration  of  Mosca  and  Pareto  to 
build  a  general  theory  of  politics  based  on  the  con¬ 
cept  of  political  class  has,  however,  proven  less 
easy  to  attain  than  expected.  To  move  from  the 
high  level  of  generality  of  the  basic  proposition 
that  the  “organized  few”  are  able  to  prevail  over 
the  “nonorganized  many”  to  a  set  of  intercon¬ 
nected  propositions,  establishing  the  relationships 
between  the  features  of  the  political  class  and  the 
different  models  of  political  life,  has  proven  much 
more  exacting  than  anticipated.  We  do  not  have 
yet  a  general  empirical  theory  of  politics  based  on 
this  idea.  However,  thanks  to  this  stimulus,  we 
have  gained  a  much  wider  knowledge  about  who 
the  members  of  the  political  class  are,  where  they 
come  from,  how  they  are  recruited,  what  their 
views  and  ambitions  are,  and  how  they  behave.  We 
have  also  gained  a  much  better  view  of  the  degree 
of  variation  in  the  basic  features  of  the  political 
classes  across  different  periods,  countries,  and 
regimes.  We  know  also  a  lot  more  about  the  inter¬ 
nal  dynamics  of  pluralist  political  classes.  Moreover, 
important  elements  of  a  theory  of  the  political  class 
have  been  incorporated  into  the  predominant 
interpretation  of  contemporary  democracy  as  a 
regime  based  on  competition  among  the  elites. 

Finally,  and  with  regard  to  more  recent  theo¬ 
retical  developments  in  political  science,  it  is  not 
completely  unwarranted  to  draw  some  significant 
connections  between  the  theory  of  the  political 
class  and  the  rational  choice  approach,  which  in 
the  past  decades  has  acquired  an  increasing  impor¬ 
tance  in  political  science.  This  may  not  be  immedi¬ 
ately  clear,  but  a  closer  look  at  some  of  the  crucial 
aspects  of  the  rational  choice  theory  can  show 
some  of  the  connections  between  this  more  recent 
development  in  political  science  and  the  “old  the¬ 
ory”  of  the  political  class.  On  a  rather  general  level, 
we  can  observe  that  in  both  cases  there  is  the  ambi¬ 
tion  of  building  an  encompassing  theoretical  para¬ 
digm  for  the  understanding  of  political  life. 
Moreover,  rational  choice  also  prescribes  that  we 
should  probe  beyond  the  formal  crust  of  political 
entities  to  analyze  the  motives  and  objectives  of  the 
concrete  individuals  existing  behind  them.  On  a 
more  specific  point,  we  can  add  that  rational  choice 
theory  has  in  fact  found  its  greatest  successes  when 
applied  to  the  sphere  of  politicians,  of  those  who 


live  for  politics — that  is,  the  political  class — rather 
than  for  the  common  people.  This  has  to  do  with 
the  basic  assumption  on  which  this  theory  has  been 
built,  that  of  rational  action:  The  general  (but  in 
itself  empty)  principle  that  the  behavior  of  indi¬ 
viduals  is  guided  by  the  choice  of  the  means  that 
will  maximize  utility  becomes  much  more  relevant 
when  the  preferences  (and  the  goals)  of  the  indi¬ 
viduals  can  be  well  specified  and  when  it  is  possible 
to  assume  that  the  individuals  will  have  a  sufficient 
knowledge  of  the  situation  (i.e.,  of  the  constraints, 
the  means  that  can  produce  the  expected  ends, 
etc.).  This  is  clearly  a  much  more  realistic  descrip¬ 
tion  of  the  situation  of  the  political  class  (i.e.,  of  the 
most  professionalized  section  of  the  political  sys¬ 
tem)  than  that  of  the  great  mass  of  other  individu¬ 
als  who  devote  much  more  limited  attention  to  the 
political  processes  and  have  much  lower  levels  of 
purposiveness  in  their  actions.  The  theory  of  ratio¬ 
nal  choice  is  then  particularly  interesting  for  inter¬ 
preting  the  games  the  political  class  plays:  the 
coalition-making  games  in  parliamentary  govern¬ 
ments  with  multiparty  systems  and  the  lawmaking 
games  within  legislatures  or,  on  the  international 
scene,  the  strategic  games  between  countries.  If  this 
is  true,  the  relevance  of  the  theory  of  the  political 
class  is  still  alive. 
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It  is  not  easy  to  define  the  boundaries  of  the  field  of 
political  communication.  The  existing  literature 
tries  to  limit  the  field  as  being  characterized  by  the 
interaction  of  three  main  actors:  political  actors, 
the  mass  media,  and  the  public.  A  fourth  dimension 
is  usually  added  to  better  define  the  field:  The  sub¬ 
jects  of  the  exchange  that  takes  place  in  political 
communication  are  related  to  matters  of  general 
interest.  But  even  with  this  last  addition,  the  field 
remains  uncertain  and  difficult  to  be  captured 
within  a  single  definition.  Indeed,  all  the  different 
dimensions  that  are  used  remain  vague  and  ambig¬ 
uous  and  may  be  the  subj  ects  of  different  approaches 
and  theoretical  fields:  How  do  we  define  political 
actors?  Who  are  they?  Are  union  leaders  political 
actors?  And  what  about  interest  group  spokesper¬ 
sons?  Are  they  political  actors?  Does  what  these 
actors  accomplish  concern  the  field  of  political 
communication?  Or  does  it  belong  to  the  field  of 
public  relations?  Similar  questions  may  be  raised 
with  regard  to  each  of  the  other  defining  dimen¬ 
sions.  This  entry  discusses  the  development  of 


political  communication  as  a  subfield  of  political 
science,  its  relationship  to  neighboring  disciplines, 
and  recent  tendencies  with  regard  to  technological 
changes  and  their  impact  on  different  spheres  of 
political  life. 

Because  of  the  vagueness  of  the  dimensions  that 
have  been  mentioned  above,  very  often  political 
communication  is  defined,  even  if  with  different 
words  in  relation  to  the  specific  approaches  that 
are  chosen,  as  a  field  placed  at  the  intersection  of 
the  four  dimensions  just  listed:  That  is,  political 
communication  regards  the  exchange  of  contents 
on  matters  of  public  interest  that  involve  citizens, 
social  actors,  and  the  mass  media.  The  components 
of  this  tentative  definition  are  not  self-exclusive — 
that  is,  even  if  one  of  these  components  is  missing, 
we  can  still  be  facing  “a  political  communication 
event.  ”  Citizens  can  be  absent  in  a  particular  event, 
and  notwithstanding  that,  media  and  political 
actors  may  interact  on  issues  of  general  interest. 
Similarly,  political  actors  may  be  involved  in  an 
exchange  with  citizens  on  issues  of  general  interest, 
but  this  interaction  may  take  place  without  the 
mass  media. 

In  spite  of  its  general  character  that,  as  we  shall 
see  later,  leaves  the  field  open  to  different  interpre¬ 
tations  that  may  include  a  plurality  of  social  events 
and  situations,  this  definition  is  marked  by  a  char¬ 
acteristic  that  seems  to  be  more  important  than 
that  of  the  others:  Political  communication  deals 
with  issues  of  general  interest  or  at  least  with  issues 
that  may  gain  some  sort  of  relevance  in  public 
interest.  It  may  be  too  easy  to  state  that  political 
communication  deals  with  “ruling  the  commu¬ 
nity,”  but  this  seems  to  be  its  inner  nature.  Affairs 
of  general  interest  can  be  treated  by  different 
actors  and  in  different  ways  so  that  the  other  defin¬ 
ing  components  may  be  useful  as  they  help  restrict 
and  define  better  the  field  of  pertinence. 

The  Nerves  of  Government  is  the  title  of  a 
seminal  work  by  Karl  Deutsch  published  in  1963. 
This  book  clearly  shows  the  importance  of  com¬ 
munication  in  government,  not  just  as  an  expres¬ 
sion  of  conflicting  interests  but  as  an  essential 
instrument  to  rule  the  community,  to  produce  an 
informed  citizenry,  and  to  reach  a  higher  level  of 
social  efficiency.  Even  if  some  scholars  do  not  see 
this  book  as  such,  The  Nerves  of  Government  can 
be  viewed  as  a  text  of  political  communication  that 
without  any  doubt  demonstrates  the  narrow  links 
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existing  between  political  communication  and 
themes  of  general  interest:  Ruling  the  community 
implies  necessarily  a  continuous  activity  of  com¬ 
munication  either  when  it  implies  conflict  or  when 
it  is  aimed  at  a  better  level  of  social  efficiency. 
Government  needs  an  appropriate  communication 
strategy  that  may  help  in  solving  the  problems  of 
the  society.  The  Nerves  of  Government  may  have 
a  dubious  placement  within  the  field  of  political 
communication  as  the  matters  it  deals  with  do  not 
imply  conflict.  Indeed,  for  most  scholars,  political 
communication  presents  another  important  fea¬ 
ture:  It  deals  with  controversial  issues  about  which 
there  are  different  and  conflicting  views  and  posi¬ 
tions.  For  some  scholars,  this  is  what  differentiates 
political  communication  from  other  forms  of  com¬ 
munication  and  specifically  from  what  is  defined 
to  be  “public  communication”:  Political  communi¬ 
cation  deals  with  controversial  issues  about  which 
conflicting  views  exist,  whereas  public  or  institu¬ 
tional  communication  regards  issues  about  which 
there  is  a  wider  general  agreement. 

To  conclude  this  defining  attempt,  political  com¬ 
munication  is  about  power.  It  deals  with  the  distri¬ 
bution  of  power  among  different  competing  views, 
actors,  and  interests.  Power  is  what  mostly  differen¬ 
tiates  political  communication  from  other  forms  of 
communications:  Advertising  is  about  goods  and 
audience,  journalism  is  about  news  (even  if  with 
many  overlaps  with  political  communication),  and 
so  on.  Political  communication  is  about  power  in 
society.  The  narrow  relationship  of  political  com¬ 
munication  with  power  implies  either  a  struggle  or, 
as  happens  mostly  in  democracies,  some  sort  of 
negotiation  and  reciprocal  adjustment  between 
competing  needs,  requests,  and  interests.  Therefore, 
political  communication  always  implies  an  exchange: 
Either  it  prepares  the  ground  for  possible  negotia¬ 
tions  or  its  final  product  is  an  agreement  that  comes 
out  of  a  process  of  negotiation  that  takes  place 
through  different  means  of  communication. 

This  brief  discussion  highlights  one  main  point: 
Political  communication  is  a  very  broad  field  that 
can  be  dealt  with  from  different  points  of  view  and 
that  may  involve  different  aspects.  From  a  schol¬ 
arly  point  of  view  (i.e.,  from  the  point  of  view  of 
those  who  study  political  communication),  there 
are  three  main  areas  within  the  broader  field:  com¬ 
munication  processes,  structural  tendencies,  and 
production  aspects.  The  area  of  communication 


processes  represents  the  most  traditional  and  deeply 
rooted  area  as  it  deals  with  the  analysis  of  commu¬ 
nicative  exchanges.  Indeed,  it  is  possible  to  include 
here  all  those  exchanges  that  take  place  between 
the  three  actors  mentioned  at  the  beginning:  politi¬ 
cal  actors,  citizens,  and  mass  media,  even  if  all  of 
them  do  not  necessarilyhave  to  be  included  at  the 
same  time  in  each  exchange  situation. 

The  result  of  each  communication  event  may 
influence  the  allocation  of  power  in  society,  and 
this  is  why  this  specific  area  has  always  been  the 
core  of  every  political  communication  approach. 
Indeed,  it  is  possible  to  include  within  this  area  all 
the  most  prominent  aspects  of  the  field,  from 
political  socialization  to  election  campaigns,  from 
persuasion  to  rhetoric,  and  from  interpersonal 
communication  to  chat  groups  on  the  Internet.  In 
particular,  two  main  domains  seem  to  constitute 
the  core  of  this  area:  (1)  the  study  of  the  content 
and  (2)  the  study  of  the  effects.  Both  these  domains 
have  featured  most  prominently  in  political  com¬ 
munication  research  since  its  inception  following 
different  approaches:  The  forms,  structures,  and 
content  of  political  communication  have  been  the 
main  subjects  of  many  scholars.  The  research  by 
Harold  Lasswell  on  the  symbols  and  the  language 
of  politics  has  strongly  contributed  to  establish  the 
field  of  political  science  as  such.  More  recently, 
Murray  Edelman  has  further  developed  this  kind 
of  research  on  the  importance  of  symbols  in  poli¬ 
tics,  while  other  scholars  have  focused  their  inter¬ 
est  on  quantitative  content  analysis  of  political 
messages. 

The  study  of  the  effects  of  messages  has  occa¬ 
sioned  the  establishment  of  a  new  discipline: 
communication  studies.  The  effects  of  political 
communication  were  among  the  fields  most  cov¬ 
ered  by  those  who  contributed  to  establishing  the 
new  discipline:  Paul  Lazarsfeld,  for  example,  is  a 
prominent  figure  in  the  history  of  the  social  sci¬ 
ences.  His  works  largely  focused  on  political  com¬ 
munication:  The  People’s  Choice,  written  with 
Bernard  Berelson  and  Hazel  Gaudet  in  1944,  and 
Personal  Influence,  written  with  Elihu  Katz  in 
1995,  are  milestones  not  just  in  political  communi¬ 
cation  research  but  in  media  studies  and  sociology 
in  general.  In  the  first  book,  Lazarsfeld,  Berelson, 
and  Gaudet  point  out  the  essentially  confirmatory 
role  that  mass  communication  has  on  voters’ 
behavior,  while  the  second  book  contributed  to 
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overcoming  the  previous  view  of  the  so-called 
powerful  media,  pointing  out  how  the  mass  media 
were  acting  in  a  more  complicated  way  than  was 
previously  thought:  The  message  does  not  reach 
the  citizen  directly  but  involves  other  figures  as 
well,  in  particular  the  so-called  opinion  leaders. 

Under  the  label  structural  tendencies ,  it  is  pos¬ 
sible  to  list  all  the  research  that  focuses  on  the 
changes  that  political  communication  is  undergo¬ 
ing.  The  main  tendency  appears  to  be  toward  the 
so-called  mediatization  of  politics:  The  logic  of 
the  mass  media  strongly  affects  the  structures  and 
the  discourse  of  politics  in  a  way  that  is  even 
changing  political  participation.  In  the  view  of 
Gianpetro  Mazzoleni  and  Winfried  Schultz,  medi¬ 
atized  politics  is  politics  that  has  lost  its  autonomy, 
has  become  dependent  in  its  central  functions  on 
mass  media,  and  is  continuously  shaped  by  inter¬ 
actions  with  mass  media. 

In  the  most  recent  period,  scholars  have  stressed 
different  evolutions  related  to  the  mediatization  of 
politics:  Political  communication  is  changing 
because  a  process  of  professionalization  is  taking 
place.  This  is  a  very  dramatic  change  because  it 
stresses  also  an  even  more  important  change  in  the 
way  in  which  social  interests  are  organized  and 
represented.  Indeed,  professionalization  of  political 
communication  is  taking  place  as  the  traditional 
mass  parties  are  weakening  and  almost  disappear¬ 
ing  in  some  cases:  Politics  itself  is  changing. 
Political  parties  often  are  no  longer  able  to  support 
candidates  during  election  campaigns,  and  they  are 
not  linked  any  more  to  citizens/voters  through  the 
“affiliation  vote,”  which  established  solid  and 
long-lasting  channels  of  communication  between 
mass  parties  and  citizens.  Angelo  Panebianco  sum¬ 
marized  this  shift  under  the  label  “electoral- 
professional  party”:  Progressively,  the  most  tradi¬ 
tional,  Western-type  mass  parties  were  losing 
ground  in  the  face  of  a  new  form  of  political  aggre¬ 
gation,  the  electoral-professional  party.  The  inter¬ 
vention  of  professionals  particularly  skilled  in 
communication  and  polling  is  required  to  reach  the 
citizen/voter,  who  acts  and  decides  to  no  longer  be 
bound  to  mass  parties  through  stable  links  but  to 
essentially  be  placed  in  a  situation  that  many  define 
as  a  “market,”  in  which  the  traditional  instruments 
of  the  market  are  required  and  used.  Marketing 
strategies  and  professionals  able  to  apply  these 
strategies  are  required. 


Other  tendencies  emerge  as  well.  First  of  all, 
there  is  general  agreement  about  the  increasing  per¬ 
sonalization  of  politics.  Individual  politicians  are 
taking  the  place  so  far  occupied  by  political  parties. 
This  tendency,  too,  highlights  the  process  of  trans¬ 
formation  of  mass  parties:  Their  weakening  leaves 
room  for  the  personalization  of  politics,  which  is 
enhanced  also  by  the  increasing  role  of  the  mass 
media,  and  particularly  of  television,  as  agents  of 
political  socialization.  This  process  of  personaliza¬ 
tion  of  politics  is  said  to  be  a  part  of  the  so-called 
Americanization  of  politics,  which  affects  different 
political  communication  activities  linked  to  the 
electoral  choice.  Together  with  the  process  of  per¬ 
sonalization  of  politics,  many  scholars  have  devoted 
their  attention  to  a  similar  process  taking  place  in 
the  equilibrium  of  political  power:  The  process  of 
presidentialization  highlights  how  powers  are  shift¬ 
ing  from  parliament  and  government  to  the  single 
figure  of  the  president.  Beyond  the  reasons  related 
to  the  structural  transformations  in  government, 
many  scholars  stress  the  increasing  role  of  the  mass 
media  that  allows  the  president  to  talk  and  interact 
directly  with  the  people,  which,  in  turn,  focuses  the 
main  attention  of  the  mass  media  on  the  president. 

Production  aspects  include  all  research  that 
looks  at  how  political  communication  is  conceived 
and  produced.  Production  aspects  concern  mainly, 
but  not  exclusively,  the  area  of  journalism  and  its 
relationship  with  the  political  system.  News  gath¬ 
ering  and  news  processing  are  important  moments 
in  the  production  of  political  news:  the  moment  of 
the  interaction  with  sources  of  information  (news 
gathering)  and  the  moment  in  which  news  items 
are  selected  and  become  stories  for  the  print  and 
electronic  media  (news  processing). 

Many  other  studies  have  been  conducted  on 
how  election  campaigns  are  run:  which  communi¬ 
cation  strategies  are  adopted,  how  decisions  are 
taken,  which  kinds  of  professionals  are  involved, 
and  how  the  messages  are  decided,  conceived,  and 
passed  on  to  the  public.  Much  of  this  research  has 
a  very  important  comparative  focus,  trying  to 
stress  the  differences  and  similarities  among  coun¬ 
tries.  In  this  area  as  well,  Americanization  seems  to 
be  a  very  widespread  interpretive  hypothesis:  Elec¬ 
tion  campaigns  worldwide  seem  to  have  become 
more  and  more  similar  as  they  are  applying  all  the 
strategies  and  procedures  already  used  in  the 
United  States. 
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The  Approaches 

From  what  has  been  said  so  far,  it  may  appear 
clear  that  political  communication  research  repre¬ 
sents  an  interdisciplinary  endeavor  as  it  deals  with 
different  areas  of  society  and  with  more  general 
processes  of  change.  Since  its  beginnings,  scholars 
from  different  disciplines  had  devoted  their  atten¬ 
tion  to  the  field  that  today  is  called  political  com¬ 
munication.  Indeed,  the  first  look  at  this  field  was 
by  psychologists  and  students  of  propaganda.  The 
seminal  study  by  Sergei  Tchackhotine  originally 
published  in  1938  may  be  considered  as  the  first 
work  entirely  devoted  to  the  analysis  of  propa¬ 
ganda  techniques;  it  was  strongly  affected  by 
behaviorism,  an  approach  that  in  those  years  was 
influencing  the  entire  field  of  psychology.  Many 
works  by  other  psychologists  were  then  devoted  to 
what  in  those  years  was  not  yet  called  political 
communication.  The  propaganda  label  was  indeed 
coming  from  the  perception,  very  diffuse  in  those 
years,  of  the  enormous  power  of  the  new  propa¬ 
ganda  techniques  that  political  structures  and  sys¬ 
tems  in  different  countries  were  developing.  This 
perception  was  the  fruit  of  both  the  behaviorist 
approach  that  was  used  to  observe  this  phenome¬ 
non  (which  stressed  the  causal  relationship  between 
a  single  stimulus  and  its  consequence)  and  the 
experience  that  the  entire  world  was  passing 
through  in  those  years.  Indeed,  the  works  by 
Tchackhotine  and  those  by  many  other  scholars 
were  devoting  a  lot  of  attention  to  the  propaganda 
machines  of  Soviet  Russia  and  Nazi  Germany.  To 
many  observers,  these  experiences  seemed  to  stress 
the  enormous  power  of  persuasion  reached  through 
the  use  of  new  techniques  such  as  cinema  and 
radio,  with  the  support  of  suggestions  coming 
from  developments  in  psychology. 

In  those  first  years,  persuasion  and  propaganda 
were  the  catchall  words  used  to  designate  what 
today  is  called  political  communication.  Of  course, 
there  were  differences  both  in  the  way  in  which 
these  phenomena  were  observed  and  studied  and 
in  the  way  the  communication  process  itself  was 
taking  place.  Psychologists  were  among  the  first 
scholars  to  study  how  political  power  was  trying 
to  influence  the  life  of  citizens,  and  for  many  years, 
psychology  was  the  main  approach  employed  to 
observe  communication  processes  in  government 
and  politics.  The  sociological  approach  to  political 
communication  was  born  out  of  psychology.  As 


already  mentioned,  the  founding  fathers  of  the 
discipline  themselves  were  deeply  affected  by  psy¬ 
chology:  Paul  Lazarsfeld  in  his  first  works  was 
influenced  by  the  works  of  the  psychologists 
Harold  Lasswell  and  Carl  Howland.  All  of  them 
devoted  great  attention  to  political  communication 
while  studying  other  forms  of  communication  as 
well.  Both  psychology  and  sociology  look  at  the 
interactions  between  single  persons  and  between 
persons  and  means  of  communication,  but  from 
different  points  of  view:  Psychology  is  interested  in 
looking  at  the  inner  motives  that  push  single  per¬ 
sons  and  groups  to  react  in  a  particular  way  to  the 
message;  sociology  gives  to  these  processes  a  more 
cumulative  view,  looking  for  the  variables  existing 
in  the  social  structure  that  may  affect  a  particular 
reaction.  Moreover,  sociology  looks  at  the  conse¬ 
quences  and  the  transformations  produced  by 
political  communication  in  the  society  at  large. 

Political  scientists,  too,  devote  great  attention  to 
political  communication.  In  a  way,  it  can  be  said 
that  the  field  of  political  communication  research 
itself,  as  we  know  it  today,  was  established  by 
political  scientists.  Indeed,  their  focus  has  been  on 
the  relationship  between  political  institutions  and 
communication:  They  have  been  interested  in  the 
consequences  produced  by  political  communica¬ 
tion  on  political  institutions  and  in  the  way  in 
which  political  institutions  communicate  and  orga¬ 
nize  to  communicate.  While  sociologists  studying 
political  communication  look  at  variables  derived 
from  the  observation  of  society  at  large,  including 
political  variables,  political  scientists  give  commu¬ 
nication  a  more  focused  attention:  They  look 
essentially  at  the  way  in  which  communication 
affects  the  distribution  of  power  in  society. 

Semiotics  and  linguistics  also  have  paid  atten¬ 
tion  to  political  communication.  In  particular, 
both  these  approaches  have  been  interested  in 
looking  at  the  content  and  structure  of  political 
messages,  developing  specific  methods,  such  as 
discourse  analysis,  aimed  at  the  analysis  of  these 
kinds  of  messages.  While  both  these  approaches 
have  been  able  to  highlight  specific  features  of 
political  communication,  their  interaction  with 
other  scientific  domains  has  been  limited  in  spite  of 
the  fact  that  their  results  could  be  very  useful  to 
integrate  and  complete  the  outcome  of  other 
approaches.  Indeed,  political  communication  is 
undoubtedly  an  interdisciplinary  field  placed  at  the 
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intersection  of  different  social  forces  and  activities 
producing  consequences  that  relate  to  different 
scientific  domains. 

Three  Phases  of  Political  Communication 

Political  communication  has  dramatically  changed 
over  the  years.  This  change  has  become  even  more 
rapid  following  the  introduction  of  new  technolo¬ 
gies.  In  1999,  Jay  Blunder  and  Dennis  Kavanagh 
proposed  a  distinction  between  three  main  phases 
in  the  evolution  of  political  communication  that  has 
been  adopted  by  many  other  scholars  and  that  can 
also  illustrate  the  more  recent  tendencies  in  the  field 
even  if  the  original  Blunder  and  Kavanagh  formula¬ 
tion  was  not  able  to  take  into  account  the  dramatic 
and  fast  changes  created  by  the  Internet.  The  first 
phase  covers  the  first  2  decades  after  World  War  II, 
which  many  political  scientists  also  define  as  “the 
golden  age  of  mass  parties.”  Indeed,  this  period  was 
characterized  by  the  important  role  played  by  mass 
parties:  These  were  organizational  machines  with 
thousands  of  employees  and  activists,  with  offices 
spread  all  over  the  national  territory.  These  organi¬ 
zations  also  performed  important  communication 
functions:  publishing  newspapers,  organizing  meet¬ 
ings  and  demonstrations,  and  carrying  out  general 
networking.  During  this  period,  interpersonal 
communication  was  the  main  way  to  reach  citizens 
and  party  activists,  and  party  employees  played  the 
main  role  in  diffusing  the  ideas  of  political  parties 
and  in  mobilizing  consensus.  Even  if  other  means 
of  communication  (e.g.,  radio  or  newspapers,  very 
often  bound  to  the  party  organizations  themselves) 
existed,  nevertheless,  local  circuits  of  relationships 
were  very  important  and  interpersonal  relations 
represented  the  main  channel  to  reach  citizens  who 
were  linked  to  party  organizations  by  deep-rooted 
and  well-established  ideological  and  cultural  affili¬ 
ations.  Party  offices  and  meetings  and  organiza¬ 
tions  in  charge  of  a  wide  range  of  activities,  from 
sports  to  culture  and  from  leisure  organizations  to 
educational  structures,  represented  the  occasions 
for  establishing  and  reinforcing  networks  that  were 
mostly  based  on  interpersonal  knowledge  and  rec¬ 
ognition.  In  these  instances,  communication  was 
mostly  aimed  at  reinforcing  the  existing  links  and 
opinions  as  voters  rarely  changed  their  electoral 
decisions.  Messages  had  little  innovative  content: 
They  were  limited  to  repeating  what  citizens 


already  knew  and  shared;  they  also  attacked  and 
painted  in  the  worst  possible  manner  the  competi¬ 
tors,  reinforcing  the  already  existing  feelings  and 
opinions. 

In  spite  of  the  widespread  use  of  what  today  is 
called  “negative  campaigning”  and  determined  by 
this  continuous  attempt  to  reinforce  preexisting 
feelings,  political  institutions,  both  government 
and  political  parties,  enjoyed  a  high  level  of  confi¬ 
dence  and  trust.  They  were  not  just  important 
organizational  machines  but  also  represented  trust¬ 
worthy  organizations  to  which  citizens  turned  for 
support  in  a  plurality  of  different  situations.  Voter 
volatility  was  limited,  and  ideological  changes  were 
not  very  frequent:  Political  values  were  quite  stable 
and  well  rooted  in  political  imagery. 

Just  as  the  first  phase  has  been  termed  the  golden 
age  of  mass  parties,  the  second  one  may  be  called 
“the  age  of  television.”  While  progressively  the 
mass  media  have  replaced  interpersonal  communi¬ 
cation  as  the  principal  sources  of  political  socializa¬ 
tion,  television  has  achieved  prominence  among 
these  sources.  Structures  and  forms  of  political  com¬ 
munication  change  dramatically  because  society  is 
changing  as  well:  The  process  of  secularization  and 
greater  social  mobility  weaken  the  role  of  mass  par¬ 
ties  that  transfer  to  the  mass  media,  and  particularly 
to  television,  the  communication  and  socialization 
functions  they  were  performing  previously.  This  has 
a  major  consequence:  The  existing  links  between 
citizens  and  party  organizations  lose  ground,  and 
therefore,  communication  strategies  that  up  to  that 
moment  were  basically  aimed  at  reinforcement 
change  direction.  Now,  the  main  aim  of  communi¬ 
cation  is  to  reach  and  persuade  voters  who  are  no 
longer  firmly  linked  to  a  particular  party.  The 
notion  of  a  voters’  market  enters  the  political  arena: 
Citizens  become  willing  to  buy,  among  a  number  of 
similar  products,  the  best  “political  offers”  in  the 
same  way  as  they  buy  goods.  Communication  strat¬ 
egies  follow  this  change:  Messages  are  essentially 
aimed  at  conquering  a  “buyer”  who  is  linked  to 
particular  information  sources. 

This  change  is  not  limited  to  elections,  but  the 
entire  political  life  and  government  activities  are 
affected  by  this  change;  the  intrusiveness  of  the 
mass  media  forces  many  organizations  to  be  better 
equipped  to  face  the  competition  from  other  insti¬ 
tutions  and  the  mass  media.  The  first  consequence 
is  the  process  of  professionalization  of  political 
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communication:  The  traditional  figures  of  political 
activists  and  party  bureaucrats  are  now  substituted 
by  professionals  who  are  trained  in  different  com¬ 
munication  branches.  Their  expertise  is  no  longer 
political;  their  links  with  political  and  government 
institutions  are  not  based  on  their  own  political 
preferences.  They  are  hired  because  they  possess 
those  skills  and  expertise  that  party  machines  are 
no  longer  able  to  offer  to  candidates  competing  for 
public  positions.  Government  and  public  institu¬ 
tions,  too,  look  for  professionals  to  support  their 
desire  to  win  public  support.  This  tendency  is 
greatly  enhanced  by  technological  developments 
that  make  possible  many  new  activities  but  that 
require  specific  skills  that  are  no  longer  provided 
by  traditional  political  organizations.  Words  such 
as  political  consultant,  pollster,  fund-raiser,  spin 
doctor,  and  so  on  become  common  in  political 
communication  as  political  parties  lose  ground 
and  single  candidates  need  to  be  supported  by  their 
own  electoral  machines.  Scholars  have  described 
this  as  a  shift  from  “labor-intensive  campaigns”  to 
“capital-intensive  campaigns”;  while  in  the  first 
phase,  the  support  of  party  members  and  activists 
was  essential  to  run  a  campaign,  in  the  second 
phase,  money  becomes  essential  to  pay  all  those 
professionals  who  are  able  to  get  the  best  possible 
result  out  of  new  technologies  and  media  commu¬ 
nication.  Most  important,  money  becomes  essen¬ 
tial  to  buy  airtime  for  political  advertisements. 

These  changes  do  not  concern  election  cam¬ 
paigns  alone,  but  the  entire  spectrum  of  political 
and  government  institutions  is  deeply  affected  by 
the  increasing  role  of  mass  media  and  particularly 
of  television.  At  the  same  time,  the  idea  of  “perma¬ 
nent  campaign”  emphasizes  that  the  new  media 
environment  does  not  allow  communication  strat¬ 
egies  to  be  limited  to  the  electoral  momentum: 
candidates,  political  and  government  institutions, 
and  public  administrations  are  placed  under  the 
control  of  mass  media  at  all  times  and  not  just 
when  they  are  competing  for  public  offices.  The 
continuous  flow  of  opinion  polls  also  represents  an 
instrument  through  which  all  those  who  are  inter¬ 
ested  in  gaining  public  support  can  gauge  how  they 
are  perceived  by  the  people  and  then  take  the 
appropriate  initiatives.  Permanent  campaigning 
indicates  that  support  building  is  a  continuous 
process  that  cannot  be  limited  just  to  specific 
moments  in  time,  such  as  elections. 


The  word  Americanization  has  often  been  used 
to  indicate  these  changes.  Americanization  can 
have  a  double  meaning:  On  the  one  hand,  it  indi¬ 
cates  that  these  changes  took  place  first  in  the 
American  political  scene  and  that  they  have  then 
been  adopted  in  other  social  contexts;  on  the  other 
hand,  it  indicates  that  there  is  some  kind  of  influ¬ 
ence  either  explicitly  or  implicitly  played  by  the 
United  States  in  exporting  their  cultural  and  polit¬ 
ical  practices  to  other  parts  of  the  world,  thus 
changing  local  cultures.  Many  studies  have  been 
carried  out  on  the  adoption  of  “American”  tech¬ 
niques  abroad  and  mostly  on  the  involvement  of 
American  political  consultants  in  other  parts  of  the 
world,  essentially  in  countries  that  are  culturally 
and  politically  close  to  the  United  States. 

Another  tendency  that  emerged  in  the  second 
phase  of  political  communication  and  that  becomes 
even  more  important  in  the  third  one  is  personal¬ 
ization.  The  progressive  weakening  of  mass  parties 
pushes  the  focus  of  political  communication 
toward  individual  politicians.  The  ideological  links 
that  existed  between  party  apparatuses  and  citi¬ 
zens  are  now  replaced  by  the  emotional  and  sym¬ 
pathetic  feelings  that  leaders  may  establish  with 
the  citizens.  The  entire  political  life  becomes  orga¬ 
nized  around  the  figures  of  individual  politicians: 
Political  communication  reinforces  this  tendency, 
which  moves  the  focus  of  the  political  debate  from 
ideology  and  programs  to  the  individual  features 
of  those  who  animate  public  life.  The  existing  bor¬ 
ders  between  private  and  public  life  become 
blurred,  and  the  electoral  decision  seems  to  be 
moved  essentially  by  personal  leadership  charac¬ 
teristics.  Political  candidates  are  no  longer  judged 
on  the  basis  of  their  political  competencies  and 
skills  and  on  their  leadership  in  running  the  gov¬ 
ernment  but,  essentially,  on  their  ability  to  meet 
the  requirements  of  television  discourse. 

There  are  different  reasons  for  this  change: 
Institutionally,  personalization  is  fostered  by  the 
changes  taking  place  in  electoral  systems  that  in 
many  countries  place  political  figures  at  the  center 
of  electoral  choice.  For  many  scholars,  mass  com¬ 
munication  plays  a  major  role  in  this  shift:  The 
development  of  televised  political  advertisements 
makes  it  possible  for  individual  candidates  to  enter 
the  electoral  arena  even  without  the  support  of 
political  parties.  Candidates  with  their  own  money 
or  with  money  coming  from  their  supporters  can 
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buy  directly  from  the  networks  the  airtime  that  is 
necessary  to  reach  the  voters,  and  in  this  way,  they 
can  talk  directly  to  them  without  the  intermedia¬ 
tion  of  party  organizations.  In  many  cases,  candi¬ 
dates  are  even  competing  against  the  choices  made 
by  their  own  parties.  In  this  way,  the  role  of 
political  parties  is  undermined,  and  the  road 
toward  personalization  is  fostered.  The  market- 
driven  logic  that  moves  the  mass  media  system  is 
another  reason  for  the  personalization  of  politics, 
as  they  are  able  to  reach  a  wider  audience. 
Personalization  represents  another  component  of 
the  Americanization  of  politics  and  more  precisely 
the  Americanization  of  political  communication. 
Indeed,  while  in  Europe  political  parties  still  play  a 
major  role  in  influencing  citizens’  behavior  and 
choice,  weak  political  parties  are  a  particular  fea¬ 
ture  of  the  American  political  system,  influencing 
the  evolution  of  other  political  systems  worldwide. 
All  communication  strategies  are  built  around 
single  political  figures  in  a  way  that  was  first  expe¬ 
rienced  in  the  United  States. 

For  Blunder  and  Kavanagh,  the  third  phase  of 
political  communication  is  characterized  by  the 
fragmentation  of  both  the  means  of  communica¬ 
tion  and  the  audiences.  First  of  all,  television  offers 
many  more  channels:  Everywhere,  satellite  TV, 
cable  TV,  and  other  forms  of  television  come 
alongside  the  traditional  terrestrial  networks. 
There  is  also  an  increase  in  the  television  formats 
within  which  political  communication  takes  place: 
Talk  shows,  infotainment,  and  docudramas  offer 
politicians  new  opportunities  to  address  their  audi¬ 
ence.  For  politicians,  it  is  just  a  matter  of  choosing 
in  which  transmissions  to  appear,  but  at  the  same 
time,  the  opportunity  to  exercise  control  over 
other  politicians  also  increases.  It  is  essentially  the 
development  of  the  Internet  that  characterizes  the 
third  phase  and  that  offers  citizens  many  more 
opportunities  to  receive  news  and  get  in  touch  with 
other  people.  The  increase  in  the  number  of 
sources  of  information  determines  the  fragmenta¬ 
tion  of  the  audience  with  the  development  of  dif¬ 
ferent  “publics”  either  interested  in  specific  topics, 
attitudes,  and  problems  or  particularly  linked  to 
specific  sources  of  information.  The  idea  and  the 
practice  of  a  coherent  mass  audience  that  already 
during  the  second  phase  had  undergone  a  process 
of  fragmentation  into  different  target  groups  are 
now  almost  completely  disappearing:  There  are 


many  publics  in  relationship  to  the  increasing 
number  of  communication  outlets.  Each  of  these 
has  different  messages,  producing  an  even  more 
dramatic  volatility  of  party  affiliation  and  ideo¬ 
logical  and  symbolical  constructions. 

Is  Everything  Alright  With  Political 
Communication  Today? 

In  liberal  theory,  the  major  goal  of  political  com¬ 
munication  is  to  construct  an  informed  citizenry 
that  is  able  to  consciously  take  the  decisions 
regarding  the  ruling  of  the  community.  Is  political 
communication  able  to  accomplish  this  task? 
Political  communication  research  demonstrates 
that  there  exist  several  problems  in  reaching  this 
goal.  An  initial  problem  arises  from  the  abundance 
of  sources  of  information:  There  are  too  many 
messages  in  circulation  that  compete  with  each 
other  to  gain  the  attention  of  the  citizens.  In  many 
cases,  the  consequence  is  a  “lost”  citizen  unable  to 
select  among  the  plurality  of  messages  that  he  or 
she  gets  every  day.  Very  frequently,  these  messages 
are  contradictory;  they  do  not  just  compete  with 
each  other  as  is  necessary  in  a  democracy.  Citizens 
are  lost  and  confused  because  they  are  flooded 
with  a  very  large  number  of  messages  from  which 
they  are  not  able  to  select  the  proper  one  or  judge 
their  reliability.  The  problem  becomes  even  more 
serious  with  the  Internet,  which  increases  the  num¬ 
ber  of  sources  of  information  whose  reliability  is 
even  less  verifiable. 

To  be  noticed  among  this  enormous  quantity, 
each  message  tends  to  be  increasingly  simple  and,  at 
the  same  time,  emotional:  Amusing  Ourselves  to 
Death  was  the  title  of  a  book  by  Neil  Postman  that 
has  become  one  of  the  most  important  reference 
books  in  all  communication  sciences.  Even  if 
Postman  was  not  dealing  just  with  political  com¬ 
munication,  his  book  gave  a  very  valuable  insight 
on  how  the  tendency  toward  entertainment,  derived 
from  the  increasingly  important  role  of  the  mass 
media,  and  particularly  television,  was  changing  the 
entire  society.  Following  this  line,  other  scholars 
have  stressed  the  tendency  toward  dramatization 
and  spectacularization  of  political  communication. 
Citizens  are  transformed  into  spectators:  Progres¬ 
sively  they  lose  any  active  role,  instead  being  trans¬ 
formed  into  passive  spectators  of  something  staged 
in  front  of  them  by  political  competitors.  Political 
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argumentation  becomes  simpler,  shorter,  and  more 
and  more  filled  with  emotional  discourse  now 
made  available  by  the  electronic  media. 

Negative  campaigning  is  another  tendency  that 
in  the  view  of  many  scholars  undermines  the  role 
of  political  communication  in  fostering  democracy. 
Very  often,  political  competitors  find  it  much  easier 
to  denigrate  the  opponent  than  to  propose  pro¬ 
grams  or  solutions  for  existing  problems.  Denigra¬ 
tion  is  not  just  more  emotionally  involving,  but  it 
can  catch  the  attention  of  the  news  media  very 
easily  and  therefore  get  a  wider  attention  from  the 
general  public.  Tom  Patterson  clearly  demon¬ 
strated  how  negative  campaigning  meets  the  needs 
of  the  media  to  reach  a  wider  audience.  At  the  same 
time,  the  mass  media  should  exercise  their  control 
function,  which  is  supposed  to  be  among  the 
founding  principles  of  liberal  democracy.  At  the 
end,  because  of  this  wave  of  negativism,  this  con¬ 
trol  function  is  “out  of  work”;  it  does  not  support 
the  development  of  a  more  mature  democracy. 

Jurgen  Habermas,  in  his  seminal  work  Structural 
Transformation  of  the  Public  Sphere,  has  opened  a 
field  for  scholars  of  political  communication, 
directing  their  attention  to  the  way  the  public 
sphere  is  organized  in  each  society  and  the  way  in 
which  it  may  foster  democracy.  Today,  a  great 
deal  of  political  communication  research  is  focused 
on  his  concept  of  the  public  sphere.  At  the  same 
time,  Habermas  has  clearly  shown  that  the  com¬ 
mercialization  of  the  public  sphere  can  undermine 
a  rational  debate  on  affairs  of  public  interest,  and 
this  seems  to  be  the  most  important  challenge  that 
political  communication  faces  today. 
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Political  culture  consists  of  a  relatively  coherent 
repertoire  of  cognitive  and  evaluative  models  that 
enable  members  of  a  political  community  to  give 
sense  to  their  role  as  political  actors,  to  other 
political  actors,  to  the  community  they  belong  to, 
and  to  the  institutional  structure  in  which  they 
live.  Thanks  to  this  framework,  they  can  decide 
which  objectives  to  pursue  and  shape  their  actions 
and  behaviors  accordingly.  Political  culture  has 
the  following  features: 

a.  it  is  a  shared  legacy  accumulated  over  time; 

b.  it  consists  of  a  collection  of  solutions  that  have 
proved,  with  experience,  to  be  effective  in 
solving  problems  concerning  human  survival, 
adaptation  to  the  external  environment,  and 
internal  integration;  and 

c.  it  is  transmitted  to  new  members  of  the  political 
community  through  socialization. 

Political  culture  has  two  fundamental  constituents: 

f.  Cognitive  models — that  is,  a  population  of 
concepts  enabling  the  imposition  of  an  order  on 
the  world  through  a  rational  process  of  critical 
objectivization — and 

2.  Evaluative  models,  which  make  it  possible  to 
attribute  meaning  to  the  world  through 
identification  with  particular  values  that 
separate  what  is  good,  right,  and  desirable  from 
what  is  bad,  wrong,  and  to  be  avoided. 
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The  first  section  of  this  entry  deals  with  the  sci¬ 
entific  context  in  which  the  concept  of  political 
culture  was  introduced  into  political  science,  the 
questions  it  seeks  to  answer,  and  the  operational 
definitions  used  in  empirical  studies.  In  the  second 
section,  there  is  a  description  of  the  main  theoreti¬ 
cal  and  empirical  developments  in  comparative 
studies  on  political  culture,  conducted  by  means  of 
representative  sample  surveys  in  an  increasing 
number  of  countries.  This  is  followed  by  a  section 
devoted  to  a  discussion  of  such  research  design  and 
the  methodological  and  epistemological  presup¬ 
positions  that  are  involved.  The  fourth  and  fifth 
sections  cover  different  research  designs  regarding 
political  culture  and  related  concepts.  A  critical 
discussion  of  the  analogies  and  differences  between 
various  research  designs  is  presented  in  the  final 
section,  where  emphasis  is  placed  on  the  need  to 
consider  the  depth  of  the  historic  roots  of  every 
cultural  pattern  and,  thus,  of  each  political  culture. 

Like  broader  cultural  orientations,  political  cul¬ 
ture  is  largely  experienced  unconsciously  by  indi¬ 
viduals,  who  are  first  and  foremost  carriers  and 
users.  To  put  it  in  another  way,  the  set  of  cognitive 
and  evaluative  models  that  make  up  political  cul¬ 
ture  are,  according  to  Edgard  Schein’s  definition, 
“assumptions  taken  for  granted.”  Individuals  who 
share  a  specific  political  culture  consider  such  cog¬ 
nitive  and  evaluative  models  to  be  common  sense — 
the  obvious  and  natural  way  to  give  meaning  to 
the  political  sphere,  to  its  actors  and  institutions, 
as  well  as  to  its  boundaries — what  is  politics  and 
what  is  not.  As  anthropologists  have  observed  in 
relation  to  cultural  models  in  general,  there  is 
nothing  objective  or  natural  in  the  way  in  which 
the  content  of  political  culture  is  defined.  It  has  a 
pragmatic  basis  and  depends  on  the  challenges  and 
problems  that  human  beings  have  to  handle.  So 
political  culture  is  not  just  a  mental  construction  of 
assumptions  that  are  taken  for  granted.  These 
assumptions  form  the  background  and  the  basis 
for  the  political  behavior  of  actors — that  is,  the 
framework  within  which  individuals  act  in  what  is 
considered  a  politically  appropriate  way.  This 
involves  excluding  actions  considered  to  be  inap¬ 
propriate  or  deplorable  and  deciding  whether  or 
not  to  take  part  in  elections,  to  cooperate  with 
institutions  or  act  in  a  clandestine  fashion,  or  to 
organize  peaceful  demonstrations  or  take  part  in 
violent  protests. 


Political  culture  is,  therefore,  molded  by  the 
accumulated  experience  of  a  political  community 
and  is  a  constraint  that  is  very  durable  over  time. 
There  is  debate  among  scholars  as  to  how  far  one 
must  go  back  to  find  the  roots  of  today’s  political 
culture.  This  problem  is  discussed  in  more  detail 
below.  However,  there  is  a  general  consensus  that 
change  in  political  culture,  as  in  all  forms  of  cul¬ 
tural  change,  is  a  slower  and  more  difficult  process 
than  institutional,  economic,  and  social  change. 
This  is  the  reason  why  political  culture  is  character¬ 
ized  by  a  certain  amount  of  ambiguity.  On  the  one 
hand,  it  is  a  valuable  collective  resource  in  that  it 
makes  perceptions,  beliefs,  and  individual  attitudes 
toward  political  institutions  and  actors  relatively 
homogeneous.  On  the  other  hand,  it  represents  an 
obstacle  in  the  face  of  social  and  economic  changes. 
In  such  cases,  political  culture  may  offer  solutions 
that  prove  ineffective  when  coping  with  problems 
of  adaptation  to  new  challenges  from  outside  or 
from  within  a  given  society. 

Special  consideration  is  given  to  the  analytic, 
methodological,  and  empirical  contribution  made 
by  studies  of  social  capital,  a  concept  that  in  the 
past  2  decades  has  prompted  extensive  research 
into  the  relationship  of  norms,  beliefs,  and  social 
organization  with  the  effectiveness  of  democracy 
and  economic  development.  The  concept  of  politi¬ 
cal  culture  is  also  compared  with  that  of  nation 
building,  as  used  in  comparative  studies  of 
European  political  development,  according  to  a 
research  design  that  is  complementary  in  many 
respects  to  the  one  based  on  representative  sample 
surveys.  Finally,  this  entry  discusses  the  concept 
that  in  a  political  culture  that  is  adequate  for  an 
effective  democracy,  there  needs  to  be  a  balance 
between  two  different  components — on  the  one 
hand,  an  emphasis  on  individual  well-being  and 
self-realization  and,  on  the  other,  a  commitment 
and  fairness  toward  institutions  and  a  moral  obli¬ 
gation  toward  one’s  local  community  and  nation. 

A  New  Scientific  Concept  and 
Its  Operational  Definitions 

Unlike  conceptual  innovations  in  everyday  life,  in 
the  field  of  scientific  research,  each  conceptual 
innovation  is  carefully  recorded  due  to  the  com¬ 
plete  institutionalization  of  science.  The  concept  of 
political  culture  was  introduced  by  Gabriel  Almond 
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and  Sidney  Verba  at  the  beginning  of  the  1960s  in 
their  book  The  Civic  Culture.  This  work  was  part 
of  an  extensive  program  of  theoretical  discussion 
and  empirical  research  into  the  major  processes  of 
political  development  that  had  started  in  the 
1950s.  The  tragedy  of  the  two  World  Wars  and  of 
the  totalitarianisms  in  Europe,  the  birth  of  new 
democracies  in  the  three  nations  defeated  in  World 
War  II  (Germany,  Italy,  and  Japan),  the  establish¬ 
ment  of  communist  regimes  in  Eastern  Europe,  the 
founding  of  new  states  in  Africa  and  Asia  follow¬ 
ing  the  end  of  the  French  and  British  colonial 
empires — all  these  gave  rise  to  a  series  of  impor¬ 
tant  research  issues  regarding  the  stability  of  the 
new  political  regimes,  in  particular  the  new  democ¬ 
racies.  According  to  the  new  functionalist  and 
behaviorist  approaches  of  this  period,  it  was 
imperative  for  political  science  to  study  the  cul¬ 
tural  orientations  of  individuals,  especially  their 
attitudes  to  democracy,  as  these  are  crucial  for  the 
stability  of  democratic  political  systems. 

The  Civic  Culture  stressed  the  importance  of 
political  culture  as  a  dimension  capable  of  influenc¬ 
ing,  if  not  determining,  the  stability  and  perfor¬ 
mance  of  democratic  regimes.  The  new  concept 
built  on  an  illustrious  tradition.  In  the  history  of 
political  thought,  many  authors  have  emphasized 
the  importance  of  the  cultural  and  moral  orienta¬ 
tions  of  citizens  for  the  prosperity  and  power  of 
states.  The  terms  may  vary,  but  the  meanings  are 
similar:  civic  virtues  (Aristotle),  values  and  feelings 
of  identity  and  commitment  (Niccolo  Machiavelli), 
morality  and  customs  (Jean-Jacques  Rousseau), 
and  above  all  the  “habits  of  the  heart,”  which, 
according  to  Alexis  de  Tocqueville,  animated  the 
citizens  of  the  United  States  in  the  first  few  decades 
after  independence  and  were  the  foundation  of 
American  democracy. 

To  shed  light  on  the  subjective  components  of 
politics,  the  new  concept  of  political  culture  drew 
on  these  classic  contributions,  producing  a  new 
paradigm  for  empirical  research  in  political  science 
that  still  underpins  the  majority  of  studies  in  polit¬ 
ical  culture.  This  new  paradigm  is  based  on  four  of 
the  most  significant  theoretical  and  analytic  sources 
for  the  social  sciences  in  the  first  half  of  the  20th 
century: 

1.  The  contributions  of  Max  Weber’s  sociology 
in  the  theory  of  action  and  in  defining  a  typology 


of  criteria  according  to  which  individuals  consider 
political  authority  legitimate  and  agree  to  comply 
with  its  rules:  On  the  one  hand,  the  importance  of 
values  in  orienting  individual  behavior  is  stressed 
by  Weber’s  distinction  between  goal  rationality — 
that  is,  decisions  based  on  a  calculation  of  possible 
individual  benefits — and  value  rationality,  namely, 
decisions  based  on  value  orientations,  irrespective 
of,  or  contrary  to,  one’s  own  interests.  On  the 
other  hand,  Weber  defines  three  ideal  types  of 
political  authority:  traditional,  legal-rational,  and 
charismatic.  Each  is  supported  by  different  beliefs: 
the  belief  in  its  conformity  to  the  past,  the  belief  in 
its  conformity  to  established  rules,  and  the  belief 
in  the  particular  personal  qualities  of  a  leader.  In 
other  words,  legitimation  depends  on  one  of  three 
different  values:  tradition,  the  institutional  struc¬ 
ture  in  use,  or  a  single,  extraordinary  person.  The 
first  two  values  support  political  continuity,  while 
the  latter  can  contribute  to  political  change. 

2.  The  four  conceptual  pairs  ( the  so-called  pat¬ 
tern  variables)  defined  by  Talcott  Parsons  in  rela¬ 
tion  to  the  theoretical  foundations  of  his 
functionalist  approach:  universalism  versus  particu¬ 
larism,  achievement  motivation  versus  ascriptive- 
ness,  specificity  versus  diffuseness,  and  affective 
neutrality  versus  affectivity:  The  first  term  in  each 
pair  is  considered  a  typical  trait  of  modernity,  while 
the  second  is  a  feature  of  traditional  orientations. 
This  set  of  opposing  categories  is  the  basis  for  all 
subsequent  studies  of  modernization  processes  in 
both  the  political  and  the  economic  realm. 

3.  The  empirical  analysis  of  attitudes  developed 
by  social  psychologists  such  as  Louis  Guttman, 
Rensis  Likert,  and  Charles  Osgood  in  the  context 
of  the  new  behavioral  approach:  Different  attitude 
scales  were  designed  to  collect  systematic  and  com¬ 
parable  data  on  mass  opinions,  beliefs,  and  value 
orientations  through  face-to-face  interviews. 

4.  The  influence  of  Freudian  theories  on  Ameri¬ 
can  psycho-anthropology,  with  the  notion  of  the 
“basic  personality  structure,”  and  the  importance 
attributed  to  socialization  processes  not  only  in 
childhood  but  throughout  the  life  cycle. 

These  four  analytic  contributions  were  then 
combined  with  the  new  methodology  of  public 
opinion  polls,  which  make  it  possible  to  collect 
data  on  opinions  and  attitudes  in  representative 
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samples  of  citizens.  Indeed,  the  research  design  of 
The  Civic  Culture  applies  the  operational  defini¬ 
tion  of  political  culture  in  sample  surveys  in  five 
democracies:  the  United  States,  Mexico,  Great 
Britain,  Germany,  and  Italy. 

According  to  the  new  paradigm  established  in 
The  Civic  Culture,  political  culture  has  four  char¬ 
acteristics:  (1)  it  consists  of  the  set  of  subjective 
orientations  toward  politics  of  the  individual  citi¬ 
zens  of  a  nation;  (2)  it  consists  of  knowledge  and 
beliefs  about  politics  and  a  commitment  to  certain 
political  values;  (3)  it  is  the  result  both  of  a  social¬ 
ization  process  that  begins  in  childhood  and  con¬ 
tinues  through  one’s  education  and  exposure  to 
the  mass  media  and  also  of  direct  experience 
acquired  during  adulthood  with  regard  to  the  per¬ 
formance  of  political  institutions  and  actors;  and 
(4)  it  has  an  influence  on,  even  if  it  does  not  deter¬ 
mine,  the  performance  of  political  institutions,  due 
to  a  two-way  causal  link  between  culture  and 
institutional  performance.  In  general,  political  cul¬ 
ture  has  an  impact  on  the  quality  of  democracy, 
but  the  latter  also  contributes  to  orienting  the 
political  culture  of  a  nation’s  citizens. 

The  study  revealed  the  existence  of  three  differ¬ 
ent  types  of  political  culture:  parochial,  subject, 
and  participant.  Parochial  culture  is  characterized 
by  a  prevalence  of  attitudes  based  on  particular¬ 
ism,  localism,  short-range  trust,  and  a  subjective 
separation  from  the  state  and  politics.  The  main 
features  of  the  second  ideal  type  are  compliance 
and  confidence  in  the  legal  authority  of  the  state, 
its  administrative  order,  and  its  decisions — the 
output,  according  to  a  systemic  view.  Participant 
political  culture  is  based  on  the  active  political 
engagement  of  citizens  who  fuel  the  input  side 
through  the  creation  of  free  associations,  in  keep¬ 
ing  with  Tocqueville’s  classic  reflections. 

“Civic  culture,”  which  consists  of  a  balance 
between  these  ideal  types,  is  considered  to  be  the 
most  suitable  cultural  foundation  for  a  stable 
democracy.  Of  the  five  political  systems  taken  into 
consideration,  the  United  States  and  Great  Britain 
had  a  civic  culture,  while  Germany  and  Italy  were 
considered  democracies  with  a  high  risk  of  insta¬ 
bility  at  the  time  when  the  data  for  the  The  Civic 
Culture  had  been  collected  toward  the  end  of  the 
1950s.  Germany  was  deemed  to  have  a  predomi¬ 
nantly  subject-based  political  culture,  while  Italy 
was  largely  parochial. 


Changes  in  Political  Culture: 

The  Rise  of  Postmaterialist  Values 

Some  of  the  most  important  findings  of  The  Civic 
Culture  were  reviewed  and  criticized  20  years  later 
by  Almond  and  Verba  themselves.  They  pointed  to 
the  growth  of  a  participatory  culture  in  Germany, 
the  reduction  of  subject  attitudes,  and  an  increase 
in  the  levels  of  dissatisfaction  and  distrust  in  Britain 
and  the  United  States.  In  the  meantime,  a  host  of 
other  investigations  had  been  conducted,  revealing 
a  drop  in  the  degree  of  confidence  in  democratic 
institutions  and  increasing  disaffection  and  politi¬ 
cal  protest  in  Western  democracies  and  Japan. 

The  observed  changes  in  value  orientations  are 
of  particular  interest.  On  the  basis  of  a  compara¬ 
tive  study  of  six  European  nations  carried  out  in 
1970,  Ronald  Inglehart  noted  that  the  youth  pro¬ 
test  movements  were  primarily  concerned  with 
issues  neglected  by  the  traditional  political  parties, 
for  instance,  environmental  conservation,  disarma¬ 
ment,  and  needs  associated  with  individual  self- 
fulfilment  rather  than  economic  improvement. 
Inglehart  considered  these  value  orientations  to  be 
the  effects  of  the  situation  of  economic  well-being 
in  which  the  socialization  of  young  people  had 
taken  place  in  Western  European  countries,  which 
had  reached  an  unprecedented  level  of  wealth  since 
World  War  II.  With  the  modification  of  political 
priorities,  cultural  change  was  fueled  by  the  demo¬ 
graphic  replacement  of  the  population  due  to  the 
arrival  of  generations  with  more  postmaterialist 
orientations  than  the  older  ones,  which  gradually 
disappeared  from  the  scene. 

Inglehart  based  his  thesis  on  the  theory  of  moti¬ 
vation  developed  by  the  psychologist  Abraham 
Maslow,  who  considers  the  gratification  of  needs 
to  be  as  decisive  for  human  action  as  the  classic 
principle  of  deprivation.  According  to  this  theory, 
the  fundamental  needs  of  human  beings  are  orga¬ 
nized  into  a  hierarchy  consisting  of  four  ascending 
levels:  (1)  basic  physiological  needs;  (2)  the  need 
for  safety  and  stability;  (3)  the  need  for  affection, 
self-esteem,  and  a  sense  of  belonging;  and  at  the 
highest  level  (4)  the  need  for  self-realization. 
Satisfaction  of  a  need  pertaining  to  a  lower  level 
brings  to  the  fore  the  one  relating  to  the  next  level. 
According  to  Maslow,  this  framework  is  an  orga¬ 
nizational  model  of  the  individual  personality. 

Inglehart  turned  it  into  an  explanatory  model 
of  the  changes  in  political  culture:  The  older 
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generations,  who  grew  up  amid  the  poverty  and 
insecurity  generated  before  and  during  the  two 
World  Wars,  were  oriented  toward  the  materialist 
values  induced  by  survival  and  safety  needs.  By 
contrast,  young  Europeans  born  after  World  War 
II  are  oriented  toward  postmaterialist  values — that 
is,  values  such  as  belonging,  self-esteem,  and  self- 
realization.  Having  grown  up  in  a  period  of 
unprecedented  economic  prosperity,  they  tend  to 
take  a  certain  level  of  material  comfort  for  granted 
and  therefore  develop  the  value  priorities  typical  of 
higher  levels  of  prosperity.  As  a  result,  they  are 
more  oriented  toward  themes  such  as  personal  ful¬ 
filment,  individual  freedom,  and  the  conservation 
of  nature.  This  difference  between  young  people’s 
values  and  those  of  their  parents  leads  to  cultural 
change  since,  according  to  the  Freudian  concept  of 
the  “basic  personality  structure”  developed  by 
Ralph  Linton  and  Abraham  Kardiner,  individuals 
tend  to  maintain  in  the  course  of  their  adult  life  the 
value  priorities  adopted  in  a  deep  sense  during  the 
formative  phases  of  their  childhood  and  youth. 

The  operational  definition  used  by  Inglehart  to 
collect  data  on  materialist  and  postmaterialist 
value  priorities  is  an  inventory  of  12  possible 
political  goals.  Representative  samples  from 
Western  countries  were  asked  to  choose  the  most 
important  political  goals  from  the  following  items: 

Material  goals:  maintain  order  in  the  nation;  fight 
rising  prices;  maintain  a  high  rate  of  economic 
growth;  make  sure  the  country  has  strong  defense 
forces;  maintain  a  stable  economy;  and  fight 
against  crime 

Postmaterial  goals:  give  people  more  say  in  the 
decisions  of  government;  protect  freedom  of  speech; 
give  people  more  say  in  how  things  are  decided  at 
work  and  in  their  community;  try  to  make  our  cities 
and  countryside  more  beautiful;  move  toward  a 
friendlier,  less  impersonal  society;  and  move  toward 
a  society  where  ideas  count  more  than  money 

Respondents  were  then  classified  as  materialists 
or  postmaterialists  depending  on  whether  they 
favor  one  of  the  two  kinds  of  goals  consistently. 

A  Worldwide  Research  Program 

The  operational  definition  used  by  Inglehart  to 
measure  value  change  has  become  a  standard  tool 


in  the  proliferating  studies  of  political  culture,  along 
with  questions  aimed  at  surveying  interpersonal 
and  institutional  trust,  preference  for  democracy  or 
autocracy,  life  satisfaction,  and  other  similar  issues. 
This  series  of  studies  followed  the  research  design 
originally  adopted  in  The  Civic  Culture.  The  design 
has  three  main  characteristics:  (1 )  a  sample  survey. 
data  collection  on  political  opinions,  attitudes, 
beliefs,  and  values  conducted  by  means  of  struc¬ 
tured  interviews  with  representative  samples  of 
citizens;  (2)  a  comparative  design:  the  same  ques¬ 
tionnaire  is  applied  in  different  political  systems  in 
the  same  period.  In  other  words,  the  same  opera¬ 
tional  definitions  are  used  in  different  countries, 
favoring  comparability  of  data  and  permitting  the 
testing  of  hypotheses  at  an  individual  as  well  as  a 
national  level;  and  (3)  a  longitudinal  design:  if  pos¬ 
sible,  the  same  questionnaires — or  the  same  subsets 
of  closed  questions — are  applied  in  the  same  coun¬ 
tries  in  different  years,  creating  a  rising  number  of 
time  series  for  many  political  culture  variables,  such 
as  levels  of  institutional  trust,  satisfaction  with 
democratic  performance,  support  for  leaders, 
national  pride,  and  so  on. 

A  number  of  agencies  have  been  established  in 
recent  decades  to  monitor  public  opinion  orienta¬ 
tions  and  political  attitudes.  The  Eurobarometer 
program  of  the  European  Union  was  set  up  in  1973 
and  since  1974  has  supplied  twice-yearly  data  on 
opinions  and  attitudes  for  each  member  or  candi¬ 
date-member  of  the  Union.  Similar  survey 
programs  have  recently  been  set  up.  The  New 
Democracies  Barometer,  established  in  1991,  cov¬ 
ers  12  East  European  countries;  the  Latinobarometer 
covers  19  countries  from  1996  onward;  while 
Afrobarometer  covers  more  than  12  states  since 
1999.  Cooperation  between  different  research  cen¬ 
ters  around  the  world  has  led  to  an  increase  in  the 
number  of  nations  for  which  data  on  political  cul¬ 
ture  indicators  are  available.  Increasingly,  extensive 
networks  have  been  built  up,  making  it  possible  to 
conduct  the  same  research  project  at  the  same  time 
in  an  ever  greater  number  of  nations.  In  particular, 
the  World  Values  Survey  and  the  European  Values 
Survey  have  conducted  five  waves  in  a  steadily  ris¬ 
ing  number  of  countries.  After  the  first  wave  in 
1981,  successive  waves  of  data  collection  were  car¬ 
ried  out  in  1990,  1995,  2000,  and  2005,  covering 
countries  on  all  continents  (with  more  than  100  in 
the  most  recent  wave).  A  further  advantage  has 
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been  the  setting  up  of  efficient  data  archives. 
Coupled  with  new  data  transmission  tools,  these 
archives  facilitate  secondary  analyses — that  is,  the 
research  and  empirical  testing  of  hypotheses  by 
researchers  who  have  not  taken  part  in  gathering 
and  analysing  the  original  data. 

All  these  developments  have  resulted  in  a  strong 
growth  in  the  degree  of  institutionalization  and 
standardization  of  research  into  mass  orientations. 
One  of  the  most  recent,  and  ambitious,  findings  of 
this  research  programme  is  illustrated  in  Figure  1. 
It  was  produced  by  Ronald  Inglehart  and  Christian 
Welzel  on  the  basis  of  data  collected  in  the  first 
four  waves  of  the  World  Values  Survey. 

Figure  1  shows  the  position  of  80  states  with 
respect  to  two  variables.  The  vertical  axis  is  an  index 
of  the  quality  of  democracy  in  the  different  countries 
and  takes  into  account  not  only  the  existence  or 


otherwise  of  free  elections  but  also  the  moral  integ¬ 
rity  of  the  political  elites  as  measured  by  the  “con¬ 
trol  of  corruption”  scores  provided  by  the  World 
Bank  and  other  organizations.  Corruption  among 
elites  is  in  fact  the  main  factor  preventing  respect  for 
the  equality  of  rights  and  the  law  and  therefore  for 
an  effective  democracy.  The  lower  values  on  the 
scale  relate  to  nondemocratic  countries  with  corrupt 
elites,  while  the  higher  scores  are  obtained  by 
democracies  with  political  elites  that  guarantee  the 
rule  of  law  and  equal  rights. 

The  horizontal  axis  shows  the  average  values  for 
each  country  on  an  index  that  measures  the  spread 
of  a  cultural  orientation  based  on  the  predominance 
of  values  associated  with  individual  self-realization 
and  self  well-being.  It  is  a  direct  evolution  of 
Inglehart’s  first  postmaterialism  scale.  This  new 
cultural  syndrome  is  surveyed  by  five  indicators: 
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Self-expression  values 

(Residuals  unexplained  by  “solid”  democrats) 


Figure  I  The  Impact  of  Self-Expression  Values  on  Effective  Democracy,  Controlling  for  Each  Country’s  Percentage 
of  Solid  Democrats 

Source-.  Inglehart,  R.,  &  Welzel,  C.  (2005).  Modernization,  cultural  change,  and  democracy.  Cambridge,  UK:  Cambridge 
University  Press. 
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1.  postmaterialist  liberty  aspirations  (give  people 
more  say  in  the  decisions  of  government, 
protect  freedom  of  speech,  and  give  people 
more  say  in  how  things  are  decided  at  work  and 
in  their  community); 

2.  forms  of  political  protest,  such  as  signing 
petitions; 

3.  tolerance  of  homosexuality  and  sexual  liberty; 

4.  interpersonal  trust;  and 

5.  life  satisfaction. 

The  relation  between  the  two  variables  in  Figure 
1  is  measured  controlling  for  the  percentage  of 
respondents  who  prefer  democracy  over  autocracy, 
to  exclude  spurious  effects  due  to  merely  instrumen¬ 
tal  prodemocratic  motives.  Countries  where  self- 
expression  values  are  relatively  less  widespread  than 
mere  support  for  democracy  would  suggest  are 
those  where  the  political  regime  violates  the  rule  of 
law  and  equal  rights  more  than  levels  of  mere  sup¬ 
port  for  democracy  would  suggest  (see  bottom  left- 
hand  corner).  In  the  opposite  corner,  countries 
where  self-expression  values  are  relatively  more 
widespread  than  mere  support  for  democracy  would 
suggest  are  those  where  democracy  is  more  effective 
than  mere  support  for  democracy  would  suggest. 

In  short,  Figure  1  shows  the  strong  linkage 
between  a  peculiar  syndrome  of  political  culture — 
self-expression  values — and  the  level  of  effective 
democracy:  Near  the  top  right-hand  corner  are  the 
small  democracies  of  Protestant  Northern  Europe, 
with  England  and  the  English-speaking  democra¬ 
cies  (the  United  States,  Canada,  and  Australia). 
Near  the  opposite  corner  are  some  African  and 
Asian  states,  with  Yugoslavia  (at  the  time  formed 
by  Serbia  and  Montenegro)  ranking  as  the  lowest 
European  country  on  both  variables. 

The  Limits  of  the  Comparative 
Survey  Approach 

The  application  and  extension  of  the  research  design 
of  The  Civic  Culture  in  the  40-year  period  since  it 
was  first  published  has  enabled  the  international 
political  science  community  to  build  up  a  large  num¬ 
ber  of  comparable  data  sets  on  a  growing  number  of 
nations.  The  availability  of  statistical  packages  and 
powerful,  low-cost  computers,  combined  with  the 


Internet  infrastructure,  has  facilitated  the  empirical 
testing  of  many  hypotheses  by  means  of  complex 
multivariate  models.  Important  changes  in  politi¬ 
cal  attitudes  and  beliefs  have  been  monitored  over 
time  and  compared  in  different  countries.  Like  any 
scientific  method,  technique,  or  decision,  this 
research  design  also  has  certain  limitations.  As 
with  all  research  tools,  the  capacity  of  sample  sur¬ 
veys  to  achieve  objectives  depends  on  the  degree  to 
which  they  offer  a  simplification  of  the  world,  the 
complexity  of  which  cannot  be  grasped  by  any  one 
tool. 

The  sample  survey  research  design  in  the  com¬ 
parative  study  of  political  culture  is  no  exception 
to  this  rule.  In  particular,  it  tends  to  emphasize  the 
orientations  of  mass  political  culture  rather  than 
those  of  the  elites.  Similarly,  the  wide-ranging 
comparative  design  makes  it  easier  to  concentrate 
on  mass  attitudes  relating  to  the  polity  and  politics 
levels  instead  of  the  policy  level,  which  is  more 
context  dependent.  Scholars  have  stressed  these 
limits  together  with  others  that  stem  largely  from 
the  basic  assumptions  of  the  two  main  approaches 
that  gave  rise  to  this  paradigm:  functionalism  and 
behaviorism. 

The  functionalist  approach  has  two  limits.  On 
the  one  hand,  there  is  a  tendency  to  regard  politics 
as  a  clearly  defined  sphere  with  respect  to  society 
and  the  economy,  which  is  easily  recognizable  even 
in  very  different  social  systems.  On  the  other  hand, 
functionalism  tends  to  favor  a  synchronic  perspec¬ 
tive,  with  a  consequent  reduction  in  the  attention 
devoted  to  the  diachronic  dimension  and  in  piecing 
together  the  historic  origins  of  the  observed  pro¬ 
cesses.  As  seen  above,  in  the  paradigm  of  the  com¬ 
parative  research  survey,  the  temporal  dimension 
is  only  taken  into  account  through  the  collection  of 
successive  “snapshots” — that  is,  the  various  waves 
of  sample  surveys. 

The  behaviorist  approach,  whose  roots  lie  in 
experimental  psychology,  has  greatly  stimulated 
the  operational  definition  of  citizens’  opinions,  atti¬ 
tudes,  and  value  orientations.  However,  it  is  based 
on  the  individualist  and  atomist  assumption  that 
the  whole  equals  the  sum  of  its  individual  parts. 
The  critical  point  is,  therefore,  the  link  between  the 
microlevel  (a  sample  of  individuals  interviewed) 
and  the  macrolevel.  The  political  culture  of  a  coun¬ 
try  is  viewed  as  a  statistical  aggregation  of  the 
opinions  and  attitudes  of  individual  citizens. 
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In  addition  to  these  limitations,  which  derive 
from  the  epistemological  features  of  functionalism 
and  behaviorism,  representative  sample  surveys 
also  have  several  methodological  limitations: 

•  Actual  behaviors,  which  are  the  overt  output  of 
cultural  orientations,  are  not  observed  but  only 
inferred  by  verbal  answers  to  questions. 

•  It  is  assumed  that  the  meanings  of  questions  and 
answers  are  the  same  in  different  countries  and 
languages — a  necessary  assumption  if  one  is  to 
consider  the  answers  of  interviewees  to  be 
comparable. 

•  The  number  of  interviews  in  national  samples  is 
usually  too  small  to  guarantee  the  statistical 
representativeness  of  subnational  samples.  This 
makes  it  impossible  to  explore  the  regional 
differences  in  political  culture  within  a  given 
country. 

•  Surveys  also  often  have  difficulties  of  grasping 
the  unconscious  and  “things  taken  for  granted,” 
as  pointed  out  above. 

The  following  sections  deal  with  two  different 
contributions  to  the  study  of  political  culture  that 
can  be  regarded  as  complementary  to  the  para¬ 
digm  established  by  The  Civic  Culture ,  in  that  they 
pursue  different  research  designs  or  strategies  of 
inquiry,  each  of  which  overcomes  some  of  the  lim¬ 
its  described  above,  though  in  different  ways. 

Social  Capital  and  Democracy 

One  of  the  principal  conceptual  innovations  in 
political  science  and  sociology  over  the  past  20 
years  is  the  notion  of  social  capital,  introduced  in 
the  1960s  by  the  economists  Gary  Becker  and 
James  Loury.  The  concept  of  social  capital  became 
popular  in  political  science  as  a  result  of  the  ana¬ 
lytic  work  of  James  Coleman,  who  related  it  to 
social  networks,  and  the  research  of  Robert  Putnam 
into  the  institutional  performance  of  Italian 
regional  governments  in  Making  Democracy 
Work.  According  to  Putnam’s  definition,  social 
capital  “refers  to  features  of  social  organization, 
such  as  trust,  norms,  and  networks  that  can 
improve  the  efficiency  of  society  by  facilitating 
coordinated  actions”  (p.  167). 

In  other  words,  social  capital  is  a  collective 
resource  and  has  some  of  the  features  of  a  public 


good:  It  offers  advantages  to  all  the  members  of  a 
group,  but  no  one  can  appropriate  it  in  an  exclu¬ 
sive  way.  If  one  person  benefits  from  social  capital, 
this  does  not  reduce  its  availability  for  others.  On 
the  contrary,  social  capital  has  a  radically  anti- 
economic  feature:  The  more  it  is  used,  the  more  of 
it  becomes  available  for  the  entire  community. 
Social  capital,  therefore,  offers  a  solution  to  the 
dilemmas  of  collective  action  posed  by  scholars 
such  as  Mancur  Olson  and  Elinor  Ostrom.  As  a 
form  of  social  organization,  social  capital  also  has 
positive  effects  on  economic  development  in  that  it 
contributes  to  creating  a  favorable  environment 
for  market  exchanges  thanks  to  cooperation,  trust¬ 
worthiness,  honesty,  and  compliance  with  formal 
and  informal  rules.  The  notion  of  social  capital  is 
closely  related  to  that  of  civic  culture.  They  have  a 
common  ancestor,  Tocqueville,  who  attributed 
great  importance  to  the  trustworthiness  of  citizens 
and  the  significance  of  free  associations.  The  two 
concepts  also  share  concern  about  the  relation 
between  culture  and  effective  democracy. 

Putnam’s  research  on  Italian  regions  exploited 
the  opportunity  to  apply  an  experimental  design. 
In  the  1970s,  the  newly  introduced  regional  insti¬ 
tutions  began  to  operate  within  the  same  nation¬ 
state.  The  research  question  was  which  variables 
explain  the  differences  in  the  output  of  the  new 
regional  governments  in  the  common  context. 
According  to  Putnam’s  findings,  the  marked  differ¬ 
ences  in  the  economic  development  of  the  Northern 
and  Southern  regions  tend  to  coincide  with  great 
differences  in  institutional  efficiency.  However, 
this  turned  out  to  be  a  spurious  correlation.  The 
independent  variable,  when  introduced  as  a  con¬ 
trol  variable  that,  in  fact,  explains  the  differences 
in  both  economic  development  and  institutional 
performance  is  the  amount  of  social  capital  present 
in  the  various  Italian  regions  at  the  end  of  the  19th 
century.  The  Northern  regions  had  the  same  level 
of  poverty  as  the  Southern  regions  but  appreciably 
higher  levels  of  social  capital.  Seventy  years  later, 
in  postwar  Italy,  social  capital  or  the  “civic  com¬ 
munity”  (Figure  2)  explains  both  the  difference  in 
economic  development  and  the  difference  in  insti¬ 
tutional  performance.  The  well-known  economic 
cleavage  between  the  North  and  South  (i.e.,  the 
Mezzo gior no)  is  only  one  aspect  of  a  multifaceted 
divide  that  sets  regions  with  a  high  social  capital 
and  high  institutional  performance  apart  from 
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regions  with  limited  social  capital  and  inefficient 
local  institutions. 

As  Figure  2  suggests,  Putnam’s  point  of  depar¬ 
ture  is  similar  to  Inglehart  and  Welzel’s,  but  the 
research  design  differs  on  many  significant  points: 

1.  There  is  no  comparison  between  different 
nations  but  an  analysis  of  a  single  nation,  Italy, 
with  various  research  techniques.  The  compara¬ 
tive  design  regards  the  Italian  regions,  thereby 
emphasizing  within-state  differences  and  reduc¬ 
ing  the  risk  of  comparing  cases  that  are  too 
heterogeneous. 

2.  The  nature  of  the  civic  community  is  mea¬ 
sured  not  only  by  means  of  elite  and  mass  surveys 
but  also  by  gathering  data  on  observable  and 
documentable  behavior  (involvement  in  voluntary 
or  other  associations,  newspaper  circulation  fig¬ 
ures,  and  election  turnout). 

3.  Data  gathered  from  official  documents  and 
historical  archives  also  make  it  possible  to  make 
intertemporal  comparisons  during  almost  a  century. 

4.  Differences  within  the  same  country  are  high¬ 
lighted  and  their  historic  origins  reconstructed 


Figure  2  The  Civic  Community  and  Institutional 
Performance 

Source :  Putnam,  R.  (with  Leonardi,  R.,  &  Nanetti,  R.). 
(1993).  Making  democracy  work:  Civic  traditions  in  modern 
Italy  (p.  98).  Princeton,  NJ:  Princeton  University  Press. 


using  data  and  findings  relating  to  the  Italian  tradi¬ 
tion  of  electoral  studies. 

5.  By  conducting  a  comparative  analysis  within 
a  single  nation,  it  is  possible  to  make  reliable  pre¬ 
dictions  on  the  basis  of  dynamic  models.  The  dif¬ 
ferences  between  the  Northern  and  Southern 
regions  are  not  only  significant  but  above  all  are 
hard  to  eliminate,  in  that  they  tend  to  create  two 
opposing  conditions  of  equilibrium.  Regions  with 
civic  communities  display  virtuous  circles  of  trust, 
participation,  effective  institutions,  and  economic 
development.  By  contrast,  regions  with  uncivic 
communities  are  entrapped  in  a  vicious  circle  of 
distrust,  defection,  inefficient  institutions,  and  eco¬ 
nomic  stagnation. 

Political  Culture,  Nation  Building, 
and  State  Formation 

Putnam’s  conclusions  confirm,  on  the  one  hand, 
the  importance  of  political  culture  for  the  quality 
of  democracy  and,  on  the  other  hand,  specify  the 
particularity  of  Italy,  as  had  been  observed  in  the 
1950s  and  1960s  not  only  in  The  Civic  Culture  but 
also  in  the  contemporary  ethnographic  fieldwork 
of  Edward  Banfield  in  The  Moral  Basis  of  a 
Backivard  Society  (referring  to  a  community  in 
Southern  Italy),  which  shed  light  on  the  syndrome 
of  “amoral  familism.”  Italy  is  a  case  of  a  divided 
political  culture  in  which  the  divisions  are  to  a 
large  extent  geographical.  In  Northern  Italy,  the 
civic  community,  which  corresponds  to  a  partici¬ 
pant  political  culture,  tends  to  prevail.  In  the 
Southern  regions,  there  tends  to  be  a  prevalence  of 
parochialism  characterized  by  localist  and  familis- 
tic  loyalties — that  is,  the  vicious  circle  of  the 
uncivic  community. 

This  latter  set  of  concepts  emphasizes  a  further 
aspect  of  political  culture,  namely,  that  political 
culture  also  consists  of  beliefs  and  attitudes  that  do 
not  have  an  explicit  political  content.  The  political 
meaning  and  consequences  of  familism  and  paro¬ 
chialism  are  implicit  and  embedded.  Nonetheless, 
they  are  just  as  important  as  the  explicit  political 
content  of  participant  and  subject  cultures.  More 
specifically,  it  can  be  said  that  parochial  culture  is 
the  consequence  of  historic  processes  marked  by 
limited  social  and  political  mobilization  on  the 
part  of  the  elites. 
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As  Karl  Deutsch  argued,  mobilization  is  a  pro¬ 
cess  of  change  that  involves,  entirely  or  partially, 
the  population  of  countries  that  are  undergoing 
modernization.  In  the  early  stages,  this  tends  to 
lead  to  changes  in  the  employment  and  residence  of 
individuals  while,  subsequently,  it  also  radically 
modifies  their  perceptions,  expectations,  beliefs, 
memories,  and  sense  of  identity.  In  other  words, 
the  process  of  mobilization  changes  the  assump¬ 
tions  that  people  take  for  granted,  in  that  it  changes 
behavior  and  the  problems  that  need  to  be  coped 
with.  Sectors  of  the  population  that  are  less  affected 
by  this  process  maintain  to  a  greater  degree  the 
traditional  set  of  assumptions  and  beliefs  that 
political  scientists  label  as  parochialism  or  familism. 

Social  and  political  mobilization  also  lies  at  the 
heart  of  Stein  Rokkan’s  study  of  state  formation 
and  nation-building  processes  in  Europe,  starting 
from  the  collapse  of  the  Roman  Empire.  In  general, 
the  political  development  of  Western  Europe  has 
taken  place  in  four  phases,  which  can  be  summed 
up  as  follows:  (1)  state  formation ,  which  involves 
the  establishment  of  politically  centralized  control 
over  a  given  territory,  which  is  defended  from 
external  or  internal  attack  and  administered  in  a 
uniform  way  by  civil  and  military  bureaucracies; 
(2)  nation  building,  promoted  through  a  process  of 
cultural  standardization  with  the  imposition  of  a 
common  language,  a  single  religion,  and  rituals 
and  myths  that  lend  legitimacy  to  the  power  of  the 
monarch  or  elites;  (3)  democratization,  through 
the  granting  of  suffrage  to  increasingly  large  por¬ 
tions  of  the  population;  and  (4)  the  creation  of  the 
welfare  state — that  is,  a  state  that  looks  after  its 
citizens,  guaranteeing  them  health  care,  education, 
and  protection  against  the  risks  of  poverty. 

The  ways  in  which  this  occurred  and  the  time  it 
has  taken  for  different  states  to  meet  these  four 
challenges  has  had  a  lasting  effect  on  the  quality 
and  stability  of  democratic  regimes.  The  older 
states,  formed  prior  to  the  Peace  of  Westphalia 
(1648),  have  proved  to  be  the  most  stable  democ¬ 
racies  in  the  20th  century.  Typical  cases  are 
England,  Sweden,  and  Denmark.  At  the  opposite 
extreme,  there  is  the  case  of  Italy  and,  to  a  lesser 
extent,  Germany,  which  only  became  unified  states 
in  the  1860s  and  1870s,  after  the  beginning  of  a 
process  of  social,  political,  and  economic  mobiliza¬ 
tion  fueled  throughout  the  continent  by  the  French 
Revolution  and  the  Industrial  Revolution.  These 


new  states  have  had  to  overcome  the  four  chal¬ 
lenges  in  less  than  a  century.  In  both  cases,  the 
liberal  and  democratic  state  collapsed  in  the  period 
between  the  two  World  Wars,  with  the  rise  of  the 
fascist  and  Nazi  regimes. 

According  to  this  line  of  research,  in  Europe,  the 
culture  of  a  country  derives  from  the  interaction 
between  three  fundamental  components:  ethnic- 
linguistic  identity,  religious  faith,  and  the  outcome 
of  processes  of  cultural  standardization  activated 
by  nation  builders  through  the  education  system, 
compulsory  military  service,  and  so  on.  Political 
culture  is,  therefore,  profoundly  influenced  by  the 
timing  and  the  modalities  of  state  formation  and 
nation-building  processes.  Moreover,  the  outcomes 
of  these  two  processes  create  the  patterns  that 
define  a  feature  of  political  culture  that  Weber  had 
already  considered  as  decisive  for  the  stability  of  a 
regime:  the  degree  of  legitimacy  enjoyed  by  politi¬ 
cal  institutions.  These  may  in  fact  be  regarded  as 
positive  values — symbols  with  which  to  identify  as 
members  of  a  nation — or  negative  values — that  is, 
symbols  of  a  political  regime  that  has  little  legiti¬ 
macy  and  arouses  distrust  and  suspicion.  This  top- 
down  schema  of  the  relationship  between  political 
culture  and  institutional  architecture  assigns  a 
fundamental  role  to  the  elites  of  nation  builders 
and  places  the  origins  of  modern-day  political  cul¬ 
tures  much  farther  back  in  the  past. 

Rokkan’s  schema  of  political  development  also 
highlights  cases  of  countries  with  nonuniform 
political  cultures.  This  lack  of  uniformity  may  be 
due  to  resistance  on  the  part  of  some  peripheral 
areas  to  the  process  of  cultural  standardization 
promoted  by  the  center  or  to  shortcomings  or 
delays  in  the  state  formation  process.  Spain,  the 
Netherlands,  and  Italy  are  three  examples  of  coun¬ 
tries  that  have  experienced  nation-building  difficul¬ 
ties  as  a  result  of  cultural  differences  within  their 
frontiers.  Spain  is  a  case  of  early  state  formation, 
but  it  has  been  unable  to  overcome  the  resistance 
of  peripheral  areas  with  considerable  economic 
resources,  which  have  managed  to  maintain  lin¬ 
guistic  and  cultural  autonomy.  After  the  transition 
to  democracy  in  the  1970s,  the  new  constitution, 
introduced  in  1978,  recognizes  the  existence  of  a 
plurality  of  nations  within  the  Kingdom  of  Spain. 

The  Netherlands  are  an  example  of  a  country 
with  different  subcultures  resulting  from  linguistic 
or  religious  cleavages.  These  cultural  cleavages 
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have  been  successfully  bridged  by  the  concept  of 
consociational  democracy  founded  on  “polariza¬ 
tion,”  involving  cooperation  of  the  elites  across 
cleavages.  This  institutional  accommodation  is 
effective  to  the  extent  that  it  recognizes,  confirms, 
and  reassures  each  of  the  different  cultural  identi¬ 
ties  (Catholic,  Protestant,  and  secular).  Incidentally, 
the  dramatic  developments  following  the  collapse 
of  the  Republic  of  Yugoslavia  in  the  1990s  testify 
that  this  accommodation  is  very  difficult  to  adopt, 
even  within  21st-century  Europe,  if  the  countries 
involved  do  not  have  a  democratic  political  culture. 

Italy  is  an  example  of  late  state  formation  car¬ 
ried  out  by  a  secularized  elite  of  nation  builders 
that  was  weak  and  isolated  with  respect  to  the 
twofold  opposition  of  the  Catholic  and  socialist 
movements.  These  movements  created  strong  anti- 
state  subcultures,  thus  contributing  to  the  demo¬ 
cratic  breakdown  in  1922.  Here,  too,  there  is  a 
general  lesson  to  be  learned:  The  historic  legacy  of 
an  element  of  weakness — strong  antistate  subcul¬ 
tures — can  become  a  resource  once  the  political 
and  institutional  framework  and  the  international 
context  have  changed.  In  postwar  Italy,  it  was  the 
networks  of  these  two  subcultures — the  unions, 
cooperatives,  voluntary  associations,  religious 
groups,  and  local  savings  banks — that  provided 
the  organizational  basis  of  the  civic  community  of 
Northern  Italy  as  described  by  Putnam. 

Two  Necessary  Components 
of  Political  Culture 

The  question  about  the  relationship  between 
political  culture  and  democracy  has  been  answered 
in  many  ways  over  the  past  few  decades  and  has 
opened  up  various  research  perspectives.  Three 
different  research  strategies,  among  the  most  influ¬ 
ential  and  well-known,  have  been  presented  in 
detail.  Of  the  three,  the  paradigm  of  comparative 
survey  research  into  political  culture  orientations 
adopts  methods  and  techniques  that  differ  greatly 
from  those  used  by  Rokkan  in  his  theory  about  the 
conditions  and  processes  that  led  to  the  birth  of 
democracy  in  Western  Europe.  There  are  four 
main  differences: 

1.  With  each  new  wave,  the  World  Values 
Survey  has  tended  to  enlarge  the  number  of  coun¬ 
tries  covered  by  the  representative  sample  surveys, 


without,  however,  taking  into  account  the  growing 
variance  in  the  level  of  economic  development,  the 
literacy  of  the  population,  and  the  duration  of  the 
democratic  regime.  By  contrast,  Rokkan  focused 
on  a  more  limited  area  that  is  relatively  homoge¬ 
neous  in  terms  of  historical,  cultural,  political,  and 
economic  development. 

2.  The  first  paradigm  seeks  to  explain  the  differ¬ 
ences  in  opinions,  beliefs,  and  values  by  resorting 
to  multivariate  models  capable  of  producing  high 
correlation  coefficients.  The  second  pieces  together 
the  complex,  centuries-old  web  of  social,  political, 
economic,  and  cultural  processes  with  typologies 
that  can  explain  the  individual  national  versions  of 
European  democracy. 

3.  The  first  paradigm  alternates  between  indi¬ 
vidual-  and  state-level  analyses.  The  second  is 
interested  in  detecting  the  existence  of  specific  ter¬ 
ritorial  cultures  within  states,  interpreting  them  as 
aspects  of  a  peculiar  path  of  state  formation  and 
national  building. 

4.  The  first  paradigm  recognizes  the  two-way 
relation  between  political  culture  and  the  effec¬ 
tiveness  of  democracy.  However,  the  rising  num¬ 
ber  of  countries  considered  under  a  synchronic 
perspective  tends  to  privilege  the  spread  of  civic 
and  self-expression  values  as  the  causal  factor  that 
makes  democracy  work,  according  to  a  bottom- 
up  schema.  The  second  paradigm  tends  instead  to 
emphasize  a  top-down  schema,  analyzing  the  dif¬ 
ferent  nation-building  capacities  of  state  institu¬ 
tions — that  is,  their  ability  to  define  political 
culture. 

On  all  four  points,  the  work  of  Putnam  and  his 
colleagues  lies  in  an  intermediate  position,  resort¬ 
ing  as  it  does  to  a  comparative  approach  limited  to 
a  homogeneous  area,  stressing  the  historical  roots 
of  the  differences  between  the  cases  considered  and 
recognizing  virtuous  or  vicious  circles  between 
effective  institutions  and  civic  community. 

Almost  paradoxically,  the  two  most  distant 
paradigms  yield  analogous  results.  The  countries 
that  the  series  of  World  Values  Surveys  have  shown 
to  be  characterized  by  the  greatest  democratic  effec¬ 
tiveness  and  a  more  self-expression-oriented  politi¬ 
cal  culture  include  the  European  countries  that,  on 
the  basis  of  Rokkan’s  analysis,  were  the  first  to 
achieve  a  stable  democracy,  having  concluded  the 
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state  formation  and  nation-building  phases  prior 
to  the  French  Revolution.  These  are  the  two  oldest 
and  most  powerful  Protestant  monarchies  of 
Northern  Europe  (England  and  Sweden),  and  the 
consociational  democracies  are  situated  at  the 
source  and  the  estuary  of  the  Rhine  in  Switzerland 
and  the  Netherlands,  respectively.  More  generally, 
the  West  European  nations  reveal  a  greater  pres¬ 
ence  of  postmaterialist  values  and  more  democratic 
effectiveness  than  the  East  European  countries. 
Rokkan’s  schema,  which  in  the  1970s  aimed  to 
explain  the  historical  process  of  democratization  in 
Europe,  is  therefore  a  good  predictor  of  the  results 
obtained  by  Inglehart  and  Welzel  in  sample  surveys 
conducted  over  the  past  2  decades  to  measure  the 
current  content  of  political  culture  and  recent 
changes.  This  convergence  of  results  suggests  that 
the  two  research  strategies,  though  they  differ 
greatly,  are  not  alternatives  but  are  supplementary 
with  regard  to  the  characteristics  required  of  a 
political  culture  in  terms  of  democratic  effective¬ 
ness.  There  is  a  continuity  between  early  and  suc¬ 
cessful  state  formation  and  nation  building  and  the 
rise  of  postmaterialist  and  self-expression  values. 

Such  a  convergence  is  by  no  means  obvious, 
because  the  process  of  nation  building  involves  the 
spread  of  altruistic  values  and  the  subordination  of 
individual  interests  to  those  of  the  community — 
which  runs  counter  to  the  emphasis  placed  by 
postmaterialist  values  on  the  primacy  of  individual 
liberty  and  self-expression. 

Analytically,  the  opposition  between  the  cul¬ 
tural  outcomes  of  successful  nation  building  and 
the  syndrome  of  self-expression  values  becomes 
evident  if  one  bears  in  mind  the  process  that  led  to 
the  expansion  of  citizenship  rights,  as  charted  by 
Thomas  Marshall.  According  to  a  cumulative 
schema,  citizens’  rights  first  saw  the  light  of  day  in 
the  18th  century  with  the  establishment  of  the  rule 
of  law,  whereby  the  civil  rights  of  all  citizens  were 
recognized  and  guaranteed  by  impartial  courts. 
The  following  century  saw  the  development  of 
political  rights,  quintessentially  symbolized  by  the 
increasing  role  of  independent  parliaments.  The 
20th  century  was  marked  by  the  introduction  of  a 
new  family  of  rights:  social  rights,  with  the  estab¬ 
lishment  of  compulsory  education,  public  health, 
and  so  on. 

This  threefold  typology  corresponds,  signifi¬ 
cantly  but  partially,  to  the  four  stages  of  political 


development  defined  by  Rokkan,  as  shown  in 
Table  1. 

The  succession  of  the  three  types  of  rights  cor¬ 
responds  to  three  of  Rokkan’s  four  phases,  with 
the  lower  row  corresponding  to  the  institutional 
structure  of  Western  European  democracies  after 
World  War  II:  a  generous  welfare  state  that  guar¬ 
antees  social  rights  and  satisfies  the  material  and 
security  needs  of  its  citizens.  There  is  an  empty  box 
alongside  the  nation-building  phase,  for  which 
there  is  no  corresponding  family  of  rights.  In  fact, 
nation  building  does  not  presuppose  the  recogni¬ 
tion  of  citizenship  rights.  On  the  contrary,  indi¬ 
viduals  have  duties  and  obligations  toward  the 
nation.  There  is,  then,  a  shift  from  the  preeminence 
of  individuals  to  that  of  the  community  as  a  whole. 
The  establishment  of  individual  rights  is  a  funda¬ 
mental  feature  of  European  political  identity.  The 
concept  of  the  nation  relates  not  so  much  to  an 
individualistic  as  to  a  holistic  perspective,  accord¬ 
ing  to  which  the  whole  is  greater  than  the  sum  of 
its  parts,  as  is  well  expressed  by  the  value  of  frater- 
nite  (fraternity)  in  the  motto  of  the  French 
Revolution. 

The  United  States  does  not  lend  itself  to  com¬ 
parison  with  Western  Europe.  But  even  in  the  case 
of  American  political  culture,  scholars  have  found 
a  similar  equilibrium  between  opposing  values: 
individual  freedom  on  the  one  hand  and  commu¬ 
nitarian  bonds,  loyalty,  and  commitment  to  insti¬ 
tutions  on  the  other.  As  Robert  Bellah  has  observed, 
Americans  consider  individualism  to  be  the  pre¬ 
eminent  and  distinguishing  value  of  their  culture. 
However,  this  individualism  is  counterbalanced  by 


Table  I  European  Political  Stages  and  Citizenship 


Rights 

Stages  of  European 

Political  Development 

Citizenship  Rights 

(Stein  Rokkan) 

(Thomas  Marshall) 

State  formation  (rule 

Civil  rights  (courts) 

of  law) 

Nation  building 

— 

Democratization 

Political  rights 
(parliaments) 

Welfare  state 

Social  rights  (schools) 

Source:  Cartocci,  R.  (2007).  Mappe  del  tesoro  [Treasure 
maps]  (p.  121).  Bologna,  Italy:  Mulino. 
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two  opposing  moral  orientations:  civic  republican¬ 
ism  and  the  Biblical  tradition.  Both  value  sets 
relate  to  the  holistic  nature  of  the  community — 
respect  for  the  dignity  of  all  human  beings  and  an 
invocation  of  the  moral  goals  that  guided  the 
Founding  Fathers,  which  place  on  each  citizen 
responsibility  for  the  common  good. 

In  both  Western  Europe  and  the  United  States, 
freedom  and  individual  rights  are  accompanied  by 
solidarity  values  and  subordination  to  the  common 
good.  A  political  culture  in  which  just  one  of  these 
components  prevails  becomes  a  risk  for  democratic 
stability.  An  effective  democracy  needs  a  political 
culture  with  a  balance  between  postmaterialist  val¬ 
ues,  which  stress  the  participation,  tolerance,  and 
self-expression  of  individuals,  and  the  values  of 
successful  nation  building,  such  as  loyalty  toward 
institutions,  considered  an  effective  means  of  guar¬ 
anteeing  the  safety  and  well-being  of  citizens. 

Participant  postmaterialist  citizens  stimulate  the 
renewal  of  democracies  and  prompt  them  to  find 
effective  institutional  solutions  to  deal  with  new 
forms  of  inequality  relating  to  gender,  ethnic  ori¬ 
gin,  sexual  orientation,  and  so  on.  However,  these 
value  orientations  are  associated  with  the  privi¬ 
leged  sectors  of  society,  especially  educated  young 
people.  Seymour  Lipset  and  Jason  Lakin  have 
recently  observed  that  an  excessive  number  of  par¬ 
ticipant  citizens  create  the  risk  of  provoking  a 
dangerous  overload  of  political  demands,  thereby 
generating  zero-sum  conflicts.  By  contrast,  schol¬ 
ars  such  as  Robert  Putnam,  Theda  Skocpol,  and 
Russell  Dalton  have  stressed  the  decline  of  civic 
engagement  in  Western  democracies,  even  in 
younger  cohorts. 

Democracies  have  to  find  a  way  to  adapt  to  the 
new  attitudes  and  behavior  of  Western  citizens, 
who  are  more  critical  and  less  confident  than 
before.  Democracies  owe  their  legitimacy  to  their 
ability  to  simultaneously  guarantee  both  the  self- 
fulfilment  needs  of  the  more  educated,  secularized, 
and  postmaterialist  sectors  of  society  and  the 
safety  and  physical  needs  of  the  majority  of  citi¬ 
zens,  who  share  the  more  traditional  and  material¬ 
ist  values  and  are  unwilling  to  engage  in  stronger 
forms  of  political  commitment. 

Roberto  Cartocci 
University  of  Bologna 
Bologna,  Italy 


See  also  Democratization;  Nation  Building;  Path 
Dependence;  Pillarization;  Political  Socialization; 

Public  Opinion;  Social  Capital;  State  Formation; 
Survey  Research;  Values 

Further  Readings 

Almond,  G.,  &  Verba,  S.  (1963).  The  civic  culture: 
Political  attitudes  and  democracy  in  five  nations. 
Princeton,  NJ:  Princeton  University  Press. 

Almond,  G.,  &  Verba,  S.  (Eds.).  (1980).  The  civic  culture 
revisited.  Boston:  Little,  Brown. 

Banfield,  E.  (1958).  The  moral  bases  of  a  backward 
society.  Glencoe,  IL:  Free  Press. 

Bellah,  R.,  Madsen,  R.,  Sullivan,  W.  M.,  Swidler,  A.,  & 
Tipton,  S.  M.  (1985).  Habits  of  the  heart: 
Individualism  and  commitment  in  Americal  life. 
Berkeley:  University  of  California  Press. 

Dalton,  R.  (2004).  Democratic  challenges,  democratic 
choices:  The  erosion  of  political  support  in  advanced 
industrial  democracies.  Oxford,  UK:  Oxford 
University  Press. 

Inglehart,  R.  (1990).  Culture  shift  in  advanced  industrial 
society.  Princeton,  NJ:  Princeton  University  Press. 
Inglehart,  R.,  &  Welzel,  C.  (2005).  Modernization, 
cultural  change,  and  democracy.  Cambridge,  NJ: 
Cambridge  University  Press. 

Lipset,  S.,  &  Lakin,  J.  (2004).  The  democratic  century. 

Norman:  University  of  Oklahoma  Press. 

Putnam,  R.  (with  Leonardi,  R.,  &  Nanetti,  R.).  (1993). 
Making  democracy  work:  Civic  traditions  in  modern 
Italy.  Princeton,  NJ:  Princeton  University  Press. 
Rokkan,  S.  (1999).  State  formation,  nation-building,  and 
mass  politics  in  Europe:  The  theory  of  Stem  Rokkan. 
Oxford,  UK:  Oxford  University  Press. 

Schein,  E.  (1988).  Organizational  culture  (Working  Paper 
No.  2088).  Cambridge:  MIT,  Sloan  School  of 
Management. 

Skocpol,  T.,  &  Fiorina,  M.  P.  (Eds.).  (2000).  Civic 
engagement  in  American  democracy.  Washington, 

DC:  Brookings  Institution  Press. 


Political  Economy 


Political  economy  can  be  understood  in  numerous 
ways,  depending  on  the  discipline  of  study. 
Economists  and,  more  recently,  some  political  scien¬ 
tists  define  political  economy  as  the  study  of  politics 
using  economics.  Yet  within  political  science,  it  is 


1980  Political  Economy 


more  commonly  understood  as  the  study  of  the 
relationship  between  states  and  markets.  The  two 
approaches  are  not  conflicting  but  can  be  quite 
dissimilar.  Here,  political  economy  is  understood 
as  the  study  of  the  interdependency  of  economics 
and  politics,  as  this  is  the  approach  used  by  the 
majority  of  political  scientists.  This  entry  empha¬ 
sizes  the  interdependency  of  politics  and  econom¬ 
ics  and  its  impact  on  several  aspects  of  political 
performance. 

How  economics  and  politics  determine  each 
other  has  been  a  driving  question  in  political  econ¬ 
omy  for  centuries.  According  to  John  Roemer, 
Adam  Smith  was  well  aware  of  the  mutual  interde¬ 
pendency  of  economics  and  politics.  In  The  Wealth 
of  Nations,  he  discusses  how  politically  determined 
tax  mixes  (a  range  of  sources  of  income,  with  a 
mix  of  part-time  jobs,  self-employment,  and  full¬ 
time  jobs)  on  wages  and  goods  have  diverse  eco¬ 
nomic  effects.  The  political  causes  of  tax  mixes, 
and  of  economic  and  social  welfare  policies,  in 
general,  are  still  at  the  heart  of  the  study  of  politi¬ 
cal  economy,  especially  as  the  provision  of  public 
services  and  goods  has  reached  unprecedented  lev¬ 
els.  Scholars  within  the  field  of  political  economy 
study  the  impact  of  political  competition  on  eco¬ 
nomic  outcomes  and  in  turn  on  how  underlying 
economic  conditions,  such  as  income  and  skill 
distribution  or  the  generosity  of  social  insurance, 
affect  political  behavior  and  political  competition. 
Common  questions  raised  within  the  field  of  polit¬ 
ical  economy  are  as  follows:  Why  do  taxes,  state 
pensions,  or  levels  of  public  debt  vary  between 
countries?  Do  political  parties  deliver  distinguish¬ 
able  economic  policies?  Under  what  conditions  do 
politicians  raise  taxes?  In  turn,  how  do  economic 
policies  and  the  underlying  economic  relations 
affect  preferences  for  redistribution  and  institu¬ 
tional  change,  such  as  electoral  system  reforms? 

Theoretical  models  and  empirical  research  have 
come  a  long  way  in  answering  these  questions. 
However,  controversies  still  exist,  primarily  due  to 
the  lack  of  extensive  data  outside  the  relatively 
small  group  of  the  rich  Organisation  for  Economic 
Co-operation  and  Development  (OECD)  coun¬ 
tries.  In  the  rest  of  this  entry,  major  theoretical  and 
empirical  contributions  in  the  field  of  political 
economy  are  reviewed.  The  first  part  looks  at  how 
the  economy  shapes  policy  preferences.  The  sec¬ 
ond  part  is  an  examination  of  how  the  institutional 


characteristics  of  polities  (their  electoral  and  party 
systems)  aggregate  preferences  into  political  action. 
Third,  distribution  and  redistribution  as  direct 
outcomes  of  political  competition  are  described. 
Finally,  in  the  last  part,  causation  from  economics 
to  politics  is  reversed  to  explain  the  origins  of  elec¬ 
toral  institutions  on  the  grounds  of  underlying 
economic  relations. 

Economic  Preferences 

Voters’  or  political  parties’  policy  preferences  are 
the  starting  point  of  formal  political  economy 
models.  Relying  on  the  assumption  of  self-interested 
actors  who  wish  to  maximize  the  utility  they  get 
from  consumption  and  leisure,  political  econo¬ 
mists  commonly  assume  that  voters  with  higher 
incomes  prefer  less  redistribution  than  citizens 
with  lower  incomes.  Similarly,  personal  income 
endowments  determine  voters’  preferences  over 
macroeconomic  outcomes  such  as  inflation  and 
unemployment.  Kenneth  Scheve  has  shown  that 
those  who  have  more  savings  prefer  lower  infla¬ 
tion,  even  at  the  cost  of  higher  unemployment,  in 
contrast  to  those  who  do  not  have  savings. 

Thus,  the  distribution  of  income  in  a  society  is 
a  determining  factor  of  fiscal,  monetary,  and  tax 
policies.  For  example,  the  Meltzer-Richard  model 
predicts  that  in  societies  with  high  levels  of  income 
inequality,  taxes  and,  thus,  redistribution  should 
be  higher  than  in  societies  with  low-income 
inequality.  Karl  Marx  and  Friedrich  Engels  relied 
on  similar  assumptions  since  they  believed  that 
once  workers  were  given  the  right  to  vote,  dra¬ 
matic  redistribution  would  be  achieved. 

Yet  reality  is  more  complex  than  that.  Among 
the  economically  developed  countries,  income 
inequality  is  highest  in  the  United  States  and  redis¬ 
tribution  is  among  the  lowest.  Why  is  there  not 
more  pressure  from  voters  to  redistribute  more? 
To  start  with,  it  is  possible  that  some  societies 
value  economic  equality  more  than  other  societies. 
Indeed,  Alberto  Alesina,  Edward  Glaeser,  and 
Bruce  Sacerdote  show  that  there  is  a  strong  corre¬ 
lation  between  a  nation’s  belief  that  luck  deter¬ 
mines  income  and  the  levels  of  social  spending  as 
a  percentage  of  the  gross  domestic  product  (GDP). 
In  countries  where  people  believe  that  the  poor 
have  been  unlucky,  social  spending  (and  thus 
taxes)  is  higher.  Thus,  assuming  that  voters  simply 
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want  to  maximize  their  utility  from  consumption 
is  too  simplistic. 

Similarly  simplistic  is  the  assumption  that  one’s 
vote  is  solely  driven  by  financial  position.  Often 
voters  care  about  other  issues  than  redistribution, 
such  as  abortion,  the  environment,  the  role  of 
women  in  the  economy,  and  so  no.  If  this  is  the 
case,  then  one’s  income  does  not  determine  one’s 
vote,  and  in  fact,  it  is  possible  that  redistribution  is 
not  placed  highly  on  the  policy  agenda  of  compet¬ 
ing  political  parties.  Another  complication  is  that 
in  unequal  societies,  rich  voters  are  likely  to  have 
more  political  influence  on  political  parties  than 
poorer  voters.  For  example,  it  is  established  that 
voter  turnout  is  higher  among  high-income  earners 
than  among  low-income  earners. 

Things  get  even  more  complicated  when  we 
take  into  account  the  underlying  economic  rela¬ 
tions  in  a  polity.  Production  systems  can  shape 
economic  preferences  independently  of  the  distri¬ 
bution  of  income.  Not  all  holders  of  capital  prefer 
low  redistribution,  and  not  all  workers  prefer 
more  redistribution.  Redistribution  preferences, 
and  particularly  preferences  for  social  protection, 
are  rather  formed  by  the  type  of  economic  produc¬ 
tion.  The  varieties  of  capitalism  approaches  have 
moved  the  discipline  forward  by  showing  that 
preference  formation  among  economic  actors  and 
citizens  is  not  universal  across  different  economic 
systems.  The  type  of  the  economy  and,  particu¬ 
larly,  the  mode  of  production  in  an  economy 
largely  determine  policy  preferences. 

In  economies  where  workers  are  required  to 
invest  in  firm-  or  industry-specific  skills,  employers 
have  incentives  to  lobby  for  higher  social  protection 
and  thus  side  with  the  trade  unions.  In  contrast,  in 
economies  where  workers  have  general  skills  and 
employers  do  not  invest  in  their  training,  employers 
tend  to  lobby  for  lower  social  protection  than  the 
trade  unions.  More  generally,  it  has  been  shown 
that  distinguishing  between  general  and  special 
skills  helps  explain  variations  in  the  levels  and  type 
of  social  protection.  For  example,  Torben  Iversen 
and  David  Soskice  show  that  citizens  who  have 
specific  skills  favor  more  generous  unemployment 
insurance  than  do  those  who  have  general  skills,  as 
the  latter  can  more  easily  switch  and  find  jobs. 

To  recap,  the  discipline  has  made  tremendous 
progress  in  identifying  the  factors  that  shape  voters’ 
preferences.  Yet  this  is  only  the  beginning  of  putting 


together  all  the  constitutive  parts  of  a  political- 
economic  model.  One  still  needs  to  study  the  condi¬ 
tions  under  which  political  parties  are  responsive  to 
the  existing  distribution  of  preferences. 

Formal  Models  of  Political  Party 
Competition  in  Two-Party  Systems 

Do  citizens’  economic  preferences  affect  policy  out¬ 
comes?  According  to  a  view  of  democracy  where 
the  majority  rules,  economic  outcomes  should 
reflect  the  will  of  the  electoral  majority.  In  other 
words,  if  the  majority  of  voters  demands  lower 
taxes,  politicians  should  deliver  lower  taxes. 
However,  one  cannot  assume  this  straightforward 
association;  a  number  of  issues  need  to  be  clarified, 
and  most  important,  the  role  of  political  parties 
must  be  considered.  Assumptions  about  the  nature 
of  political  parties,  whether  they  represent  particu¬ 
lar  economic  classes,  interest  groups,  or  simply 
their  own  interests,  need  to  be  made.  Moreover, 
other  characteristics  of  the  political  party  system 
are  also  critical:  How  many  political  parties  com¬ 
pete  for  office?  Is  political  competition  centered 
only  on  the  question  of  taxes?  And  how  are  votes 
translated  into  parliamentary  seats?  Without  taking 
all  these  matters  into  account,  one  cannot  reliably 
predict  policy  outcomes  given  a  certain  distribution 
of  preferences  within  the  electorate. 

Formal  models  of  political  party  competition 
are  relatively  recent.  Erik  Lindahl  was  the  first  to 
show  that  the  supply  of  public  goods  is  determined 
by  the  relative  power  of  political  parties  in  a  polity 
where  two  political  parties  represent  two  classes  of 
citizens  with  heterogeneous  marginal  utilities  over 
a  public  good.  In  his  seminal  paper,  he  briefly  men¬ 
tions  the  possibility  of  partisan  policy  outcomes  as 
power  shifts  from  one  political  party  to  the  other. 
Nonetheless,  Lindahl’s  work  did  not  lead  to  fur¬ 
ther  development  of  bargaining  models  on  the 
distribution  of  public  goods  until  much  later. 

Instead,  the  most  well-known  formal  model  in 
political  science  is  the  median  voter  theorem.  The 
theorem,  popularized  by  Anthony  Downs  in  his 
Economic  Theory  of  Democracy,  predicts  that 
when  two  parties  compete  on  a  single  policy  dimen¬ 
sion,  they  will  both  converge  to  the  median  voter’s 
most  ideal  policy  point.  If  the  policy  under  question 
is  the  level  of  income  tax  rate,  then  both  parties  will 
propose  the  tax  rate  that  is  preferred  by  the  median 
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voter.  The  result  is  easily  explained:  Parties  are 
office  seeking  (which  means  that  they  are  primarily 
driven  by  their  objective  to  win  office  rather  than  by 
their  ideology)  and  thus  adopt  whichever  policy  will 
win  them  the  majority  of  votes.  When  voters’  pref¬ 
erences  are  normally  distributed,  the  only  reason¬ 
able  position  for  parties  to  move  to  is  the  median. 

The  median  voter  theorem  initiated  a  new  era  of 
research  in  political  economy.  For  example,  the 
Meltzer-Richard  model  of  redistribution  and  theo¬ 
ries  of  electoral  business  cycles  rely  on  the  median 
voter  theorem.  In  practice,  however,  parties  tend  to 
diverge  in  their  policy  choices  rather  than  converge. 
To  start  with,  it  is  rare  that  parties  will  propose 
similar  tax  rates  at  elections.  If  anything,  when 
elections  are  contested  primarily  on  economic 
issues,  social-democratic  parties  propose  higher 
taxes  for  the  delivery  of  better  public  services, 
while  parties  on  the  right  propose  lower  taxes  and 
smaller  governments.  Similarly,  public  opinion 
research  testifies  that  voters  identify  with  parties  on 
the  grounds  of  personal  characteristics,  such  as 
their  socioeconomic  status,  religion,  and  so  on.  For 
example,  those  with  savings  who  prefer  lower 
inflation  tend  to  self-identify  with  right-wing  par¬ 
ties,  while  those  who  prioritize  employment  over 
low  inflation  tend  to  identify  with  left-wing  parties. 

Donald  Wittman  proposed  a  model  of  party 
competition  where  parties  are  solely  policy  seek¬ 
ing:  They  compete  on  the  premise  of  distinguish¬ 
able  policies.  TTis  model  then  predicts  that  parties 
will  diverge  in  their  policies  as  long  as  there  is  some 
level  of  uncertainty  as  to  who  the  winner  in  the 
next  election  will  be.  While  Wittman  assumes  that 
parties  are  exogenous  (i.e.,  they  are  not  the  agents 
of  citizens),  his  model  has  been  the  main  alterna¬ 
tive  to  the  median  voter  theorem.  Empirically,  the 
literature  has  found  evidence  of  partisan  policy 
outcomes,  for  example,  with  respect  to  inflation 
and  social  spending,  but  these  findings  remain  con¬ 
tested.  It  has  been  shown  that  policies  are  often 
conditioned  by  other  economic  and  political  insti¬ 
tutions  as  well  as  by  the  underlying  economic  con¬ 
ditions,  both  domestically  and  abroad. 

Formal  Models  of  Political  Party 
Competition  in  Multiparty  Systems 

While  the  median  and  partisan  models  of  two- 
party  competition  discussed  above  have  made  the 


largest  impact  in  the  discipline  with  respect  to  party 
competition,  the  majority  of  democracies  are  multi¬ 
party  systems.  This  means  that  the  two  most  well- 
known  models  of  political  competition  are  not 
suitable  for  the  analysis  of  party  competition  in  the 
majority  of  cases.  More  recently,  new  models  of 
political  party  competition  have  been  developed  to 
include  three  political  parties  competing  at  elections. 
For  example,  David  Austin-Smith  and  Jeffrey  Banks 
developed  a  model  where  three  office-seeking  par¬ 
ties  compete  and  coalition  governments  are  formed 
after  elections.  Their  model  predicts  that  parties 
take  divergent  policy  positions  to  attract  votes,  and 
as  a  result,  governments  consisting  of  a  large  and  a 
small  party  deliver  partisan  policies.  Another  nota¬ 
ble  example  is  David  Baron’s  model  of  electoral 
competition  of  three  policy-seeking  parties.  Baron’s 
model  is  a  two-stage  model:  In  the  first  stage,  parties 
compete  at  elections,  and  in  the  second,  postelec- 
toral  stage,  they  bargain  over  policy.  The  coalition 
governments  that  form  as  a  result  of  this  two-stage 
process  deliver  ideologically  partisan  policies. 

Postelectoral  bargaining  in  parliamentary 
democracies  is  of  critical  significance  in  policy  out¬ 
comes  in  multiparty  governments.  Given  that  the 
majority  of  parliamentary  countries  do  not  form 
single-party  governments,  postelectoral  bargaining 
delivers  to  a  great  extent  the  government’s  future 
policy  program.  Models  of  government  formation 
are  prominent  in  the  field  of  political  economy. 
Not  only  do  they  address  the  important  theoretical 
and  empirical  question  of  which  government  we 
should  expect  to  form  after  elections  have  failed  to 
provide  a  clear  winner,  they  also  provide  predic¬ 
tions  on  the  policy  outcomes  that  one  should 
expect  in  a  given  government,  as  the  Austin-Smith 
and  Banks  and  Baron  models  illustrate. 

One  of  the  most  well-known  models  of  govern¬ 
ment  formation  in  multiparty  systems  is  the  Laver- 
Shepsle  model,  based  on  the  concept  of  the  core 
within  the  tradition  of  cooperative  game  theory. 
The  Laver-Shepsle  model  predicts  that  policies  will 
reflect  the  policy  preferences  of  the  ministers  who 
hold  the  specific  ministerial  departments  and  who 
represent  the  median  voter’s  preference  in  that 
specific  dimension.  In  other  words,  if  the  policy 
space  is  multidimensional,  the  party  that  repre¬ 
sents  the  median  voter  in  each  dimension  will  con¬ 
trol  the  relevant  ministry;  thus,  policy  will  be 
located  at  the  multidimensional  median. 
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The  Laver-Shepsle  model  was  a  breakthrough  in 
the  study  of  parliamentary  democracies  as  it  offered 
a  unified  framework  of  analysis  of  government  for¬ 
mation,  duration,  and  policy  implementation.  Yet 
it  has  its  limitations,  both  on  the  theoretical  front 
(since  it  relies  on  the  restrictive  concept  of  the  core, 
which  can  be  found  only  in  a  three-party,  two- 
dimensional  space)  and  on  the  empirical  front 
(since  the  model  assumes  that  ministers  and  parties 
do  not  negotiate  prior  to  or  during  the  govern¬ 
ment’s  life).  Alternative  models  of  government  for¬ 
mation  based  on  noncooperative  game  theory 
prove  that  when  parties  are  policy  and  office  seek¬ 
ing,  different  governments  can  form  as  a  result  of 
bargaining  over  policy  and  side  payments.  Yet  they 
do  not  make  any  specific  claims  about  the  portfolio 
allocation  process,  like  Laver  and  Shepsle  do.  As  a 
result,  most  of  our  knowledge  regarding  portfolio 
allocation  and  its  impact  on  policy  comes  from 
empirical  studies  conducted  by  scholars  who  study 
coalition  governments. 

The  Electoral  System:  A  Critical  Intervening 
Factor  Between  Preferences  and  Policies 

The  electoral  system  is  another  intervening  factor 
that  significantly  determines  the  distribution  of 
political  power  and  thus  indirectly  determines  eco¬ 
nomic  policies.  The  main  direct  effect  of  the  elec¬ 
toral  system  on  policy  is  via  its  effect  on  electoral 
competition  at  the  district  level.  In  single-member 
district  plurality  (SMDP)  electoral  systems,  elec¬ 
tions  are  won  over  marginal  districts  because  par¬ 
ties  do  not  have  to  worry  about  safe  districts.  This 
motivates  parties  to  target  voters  and  interest 
groups  in  these  districts.  In  contrast,  in  propor¬ 
tional  electoral  systems  where  electoral  districts 
are  large  and  can  be  as  few  as  one  (e.g.,  in  the 
Netherlands),  political  parties  have  incentives  to 
target  groups  of  voters  instead  of  districts.  As  a 
result,  in  SMDP  systems,  there  is  less  social  spend¬ 
ing  but  more  public  spending  in  the  form  of  tar¬ 
geted  goods,  such  as  roads  and  hospitals. 

The  effects  of  electoral  system  on  spending  go 
beyond  the  district  level.  Since,  as  Maurice  Duverger 
showed,  the  electoral  system  largely  determines  the 
number  of  parties  in  the  political  system,  the  elec¬ 
toral  system  has  an  indirect  effect  on  policy  prefer¬ 
ence  aggregation  via  the  number  of  parties  elected 
in  the  parliament  and  in  government.  In  more 


proportional  electoral  systems,  more  voices  are 
represented  in  the  government  via  multiparty 
governments  as  well  as  via  strong  parliamentary 
committees.  G.  Bingham  Powell  shows  that  the 
median  voter  is  better  represented  in  countries 
with  proportional  electoral  systems,  which  have 
multiparty  governments  and  give  a  voice  to  oppo¬ 
sition  parties  in  the  parliament. 

Another  important  empirical  finding  is  that  mul¬ 
tiparty  governments  are  larger  governments  (having 
higher  public  spending)  simply  because  they  repre¬ 
sent  more  social  groups  than  single-party  govern¬ 
ments.  For  this  reason,  some  researchers  have 
argued  that  multiparty  governments  are  less  eco¬ 
nomically  efficient  than  single-party  governments.  In 
multiparty  governments,  every  party  has  an  incen¬ 
tive  to  spend  on  its  own  voters  as  much  as  it  can 
since  everyone  draws  from  the  same  pool  of  money. 
Yet  other  researchers,  such  as  Peter  Katzenstein  or 
Arend  Lijphart,  have  argued  that  multiparty  govern¬ 
ments  can  more  successfully  undertake  unpopular 
economic  reforms  because  they  can  better  achieve 
political  and  societal  consensus. 

The  work  briefly  reviewed  in  the  preceding 
paragraphs  is  certainly  not  exhaustive  of  the  rich 
literature  on  the  role  of  formal  political  and  elec¬ 
toral  institutions  in  the  aggregation  of  interests 
and  in  policy  outcomes.  It  is  rather  indicative  of 
the  complexity  and  interdependence  of  the  pro¬ 
cesses  that  take  place  at  the  economic  and  political 
spheres.  The  following  section  of  this  entry  focuses 
on  how  economic  conditions  affect  the  choice  of 
formal  political  institutions. 

Types  of  Economy  and  the  Origins 
of  Formal  Political  Institutions 

One  of  the  growing  areas  of  research  investigates  the 
source  of  electoral  institutions.  Currently,  it  is 
believed  that  political  parties,  primarily  of  the  right, 
chose  the  electoral  institutions  that  would  best 
ensure  their  continued  power,  based  on  their  knowl¬ 
edge  of  the  effects  of  electoral  institutions  on  elec¬ 
toral  behavior  and  government  formation.  The  best 
known  argument,  made  by  Stein  Rokkan  and,  more 
recently,  by  Carles  Boix,  purports  that  when  the  rul¬ 
ing  right-wing  parties  were  united  and/or  confronted 
with  a  weak  opposition,  they  chose  to  keep  the  exist¬ 
ing  plurality  electoral  systems.  This  favored  single¬ 
party  governments.  On  the  contrary,  right-wing 
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parties  that  were  afraid  that  they  would  be  ousted 
out  of  power  due  to  growing  socialist  dominance 
chose  proportional  representation  (PR).  Under  PR 
systems,  the  Left  would  not  be  able  to  form  strong 
majority  single-party  governments,  and  thus  the 
Right  would  still  have  power  in  the  political  system. 
These  arguments  then  suggest  that  the  underlying 
social  cleavages  and  political  parties’  survival  strate¬ 
gies  determined  the  choice  of  electoral  institutions. 

More  recently,  alternative  theories  of  the  origins 
of  electoral  systems  have  been  put  forward.  Here, 
we  focus  on  a  theoretical  account  that  links  the 
underlying  production  and  labor  relations  with  the 
origins  of  electoral  systems.  According  to  this 
account,  advanced  by  Thomas  Cusack,  Torben 
Iversen,  and  David  Soskice,  the  ruling  right-wing 
parties  acted  as  agents  of  the  groups  they  repre¬ 
sented,  which  were  employers’  organizations.  In 
economies  where  employers’  organizations  had 
already  established  networks  of  cooperation  and 
coordination  with  trade  unions,  the  ruling  right- 
wing  parties  chose  PR  electoral  systems.  The  rea¬ 
son  was  that  since  PR  systems  encourage  coalition 
governments  and  consensus  building  between  the 
government  and  the  opposition,  both  employers 
and  trade  unions  would  be  guaranteed  representa¬ 
tion  in  the  policy-making  process.  Only  under  a 
PR  system  could  the  existing  cooperation  between 
the  holders  of  capital  and  workers  be  further  fos¬ 
tered.  Thus,  in  countries  that  had  established  coor¬ 
dinated  forms  of  capitalism  (corporatist  systems), 
the  ruling  parties  chose  PR  electoral  systems.  In 
contrast,  where  unions  and  employers  did  not 
cooperate,  the  ruling  right-wing  parties  chose  to 
retain  the  existing  plurality  electoral  systems. 

Other  and  Future  Research 
in  Political  Economy 

The  debate  on  the  origins  of  formal  political  institu¬ 
tions  is  likely  to  continue  in  the  future.  In  the  mean¬ 
time,  other  important  questions  still  need  to  be 
addressed  within  the  field  of  political  economy. 
Questions  of  the  democratic  legitimacy  of  economic 
and  political  institutions  that  make  up  our  contem¬ 
porary  expert  democracies  (such  as  central  banks 
and  various  regulatory  agencies)  have  not  been 
addressed  sufficiently.  Does  economic  efficiency 
legitimize  reducing  democratic  rights  to  vote  on  the 
economy?  This  question  is  becoming  particularly 


relevant  as  economic  globalization  empowers  trans¬ 
national  institutions  such  as  the  International 
Monetary  Fund  or  the  European  Central  Bank. 
Scholars  of  political  economy  will  have  to  seriously 
study  the  role  of  such  institutions  in  domestic  poli¬ 
tics  as  they  are  becoming  relevant  players  and  even 
“partners”  in  governments’  policy  decisions  in  areas 
such  as  taxation  and  social  welfare. 

If  the  conduct  of  economic  policy  is  changing 
thanks  to  economic  globalization,  do  voters’  prefer¬ 
ences  and  evaluations  change  as  well,  or  do  they 
remain  strongly  determined  by  their  local  and 
national  realities?  How  do  voters  evaluate  their 
leaders  when  they  implement  policies  that  have  been 
“imposed”  on  them?  Do  we  see  a  new  cosmopolitan 
versus  national  cleavage  in  electoral  politics?  Such 
questions  cannot  be  adequately  answered  until  the 
role  of  international  actors  is  integrated  into  the 
study  of  domestic  economic  decision  making. 

Scholars  who  work  primarily  within  the  field  of 
international  political  economy  have  looked  at 
how  domestic  politics  affect  foreign  economic 
policy  (e.g.,  Beth  Simmons)  or  the  international 
financial  architecture  (e.g.,  Barry  Eichengreen).  In 
addition,  important  work  has  been  done  by  schol¬ 
ars  who  work  in  the  intersection  of  international 
and  comparative  political  economy,  such  as  Bill 
Clark  or  Ronald  Rogowski,  who  study  how  the 
global  economy  constrains  national  economic 
policy  making.  Unfortunately,  there  is  no  space 
here  to  present  this  important  work.  The  challenge 
for  political  economy  in  the  21st  century  is  to 
devise  models  that  capture  the  tension  between 
global  economics  and  national  politics  and,  par¬ 
ticularly,  the  tension  between  global  market  forces 
and  domestic  political  competition. 
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Political  integration  refers  to  the  integration  of 
components  within  political  systems;  the  integra¬ 
tion  of  political  systems  with  economic,  social, 
and  other  human  systems;  and  the  political  pro¬ 
cesses  by  which  social,  economic,  and  political 
systems  become  integrated.  The  context  in  which 
the  term  political  integration  is  used  indicates 
which  of  these  three  dimensions  of  political  inte¬ 
gration  is  being  referenced.  A  country  may  have  a 
highly  integrated  political  system  and  yet  have 
little  control  over  its  economic,  social,  and  regional 
systems.  Dictatorships  often  have  tightly  con¬ 
trolled  political  elites  but  cannot  dominate  the 
economy  and  society.  Regions  made  up  of  several 
countries  engaged  in  political  processes  of  creating 
common  institutions  for  both  security  and  trade 
are  often  poorly  integrated  as  political  systems. 
The  Southern  Common  Market  of  four  countries 


of  Latin  America  (Mercado  Comun  del  Sur 
[Mercosur] :  Argentina,  Brazil,  Paraguay,  and 
Uruguay,  founded  in  1991)  is  in  the  process  of 
strengthening  by  electing  a  parliament  and  expand¬ 
ing  membership,  while  the  North  American  Free 
Trade  Association  of  three  countries  (NAFTA; 
Canada,  Mexico,  and  the  United  States,  formed  in 
1993)  is  stagnating  or  weakening. 

The  core  meaning  of  integration  is  connected¬ 
ness.  The  theoretical  context  is  systems  and  the 
integration  of  its  components.  Integration  is  a  gen¬ 
eral  characteristic  of  the  relationships  among  the 
components  of  all  systems:  physical,  living,  and 
social. 

Human  systems  have  dynamic  processes  of 
becoming  more  or  less  integrated,  and  that  hap¬ 
pens  not  only  through  political  processes  but  also 
through  social  and  economic  ones  accompanied  by 
patterns  of  stabilization  and  destabilization.  More 
than  80%  of  human  societies  identified  by  archae¬ 
ological  research  have  disintegrated.  Integration  is 
part  of  discussions  of  the  decline  and  fall  of 
empires,  states,  and  civilizations. 

Connectedness  of  components  refers  to  the 
strength  of  the  relationships  among  the  compo¬ 
nents,  the  inclusiveness  of  each  of  the  component’s 
relationships,  and  the  proportion  of  the  properties 
of  each  component  that  is  affected  by  those  rela¬ 
tionships.  At  one  extreme  are  systems  where  a 
change  in  any  characteristic  of  any  component  will 
change  all  characteristics  of  all  other  components 
with  near  certainty.  An  example  is  a  finely  tooled 
clock.  At  the  other  extreme  are  disconnected 
components  where  each  component  behaves  inde¬ 
pendently  of  the  others.  This  is  a  set  of  random 
elements.  In  the  first  case,  the  system  is  nearly 
“perfectly”  integrated,  and  in  the  second  case, 
there  is  a  random  collection  of  items,  not  a  system. 
The  integration  of  systems  along  these  three 
dimensions  can  be  assessed  as  a  probability  rang¬ 
ing  from  0,  or  randomness,  no  relationships  at  all, 
to  1,  a  totally  determined  system. 

A  20th-Century  Concept 

The  idea  of  political  integration  is  old,  but  its 
“modern”  conceptual  foundations  were  estab¬ 
lished  in  the  20th  century.  Before  then,  the  concept 
of  political  integration  referred  to  several  kinds  of 
strengthening  political  systems.  Empire  building 
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through  force  from  a  center  to  the  peripheries  was  a 
dominant  form  of  political  organization  for  around 
8,000  years.  Empires  remain  shadows  of  a  past 
since  the  collapse  of  the  Soviet  Union  in  1991. 
Versions  of  European  states  with  “parliaments” 
and  “bureaucracies”  pervade  the  world,  using 
threats,  rewards,  communications,  and  nationalism 
to  enhance  their  control.  Federations  are  as  old  as 
our  knowledge  about  political  entities  attempting  to 
expand  beyond  assemblies  of  families,  clans,  and 
villages.  They  persist  into  the  21st  century,  if  only  in 
a  symbolic  form  of  sharing  authority  between 
localities  and  a  central  government.  Some  of  the 
most  influential  countries  of  this  century  are  federa¬ 
tions — Russia,  Australia,  Canada,  the  United  States, 
and  so  on,  the  United  States  being  among  the  oldest. 

The  concept  of  political  systems  had  taken  over 
from  that  of  states  as  the  defining  subject  matter  of 
modern  political  science  by  the  1950s.  The  idea  of 
political  integration  referred  to  cities,  countries, 
transnational  regions,  and  “security”  communi¬ 
ties,  such  as  the  North  Atlantic  Treaty  Organization 
(NATO).  Political  systems  were  more  general  than 
states,  and  most  social  systems  had  political  sys¬ 
tems — families,  schools,  universities,  churches — 
that  were  marked  by  identifiable  hierarchies  that 
could  make  binding,  authoritative-collective,  deci¬ 
sions.  The  concept  of  a  political  system  raised  the 
question  of  the  level  and  the  dynamics  of  its  inte¬ 
gration.  This  shift  to  general  theories  of  political 
systems  was  set  back  in  the  1970s  by  efforts  “to 
bring  the  state  back  in”  political  science  rather 
than  seeing  the  state  as  one  kind  of  a  variety  of 
political  systems. 

Hierarchical  Systems  and  Democracy 

Political  integration  is  theoretically  challenging 
because  political  systems  are  hierarchical  and  can 
never  be  highly  integrated  systems  as  though  they 
were  finely  tooled  machines  or  highly  cohesive 
societies.  All  hierarchical  systems  cut  and  direct 
relations  with  the  structural  principle  of  subordi¬ 
nation  and  superordination,  at  the  minimum  the 
rulers  (fewer)  and  the  ruled  (more),  attenuating 
information  through  vertical  flows  and  controls 
through  intermediaries.  The  component  at  the  top 
of  the  hierarchy  has  access  to  all  other  components 
through  intermediaries,  while  those  at  the  bottom 
have  only  one  or  two  connections,  just  a  level 


above  them.  The  challenge  is  democracy,  in  which 
information  and  control  should  be  nonhierarchical 
but  in  which  reliance  on  administrative  hierarchies 
to  pursue  democratic  decisions  effectively  is  also 
required.  The  democratic  aspiration  of  “self- 
governing  societies,”  “participatory  democracy,” 
and  elections  of  governmental  officials  have  not 
successfully  addressed  the  issues  of  complexity  and 
efficiencies  of  scale  necessary  for  the  responsive¬ 
ness  of  institutions  of  governance  to  democratic 
political  participation. 

Political,  Economic,  and  Social  Integration 

Since  the  16th  century,  the  conceptual  distinction 
between  the  society,  the  polity,  and  the  economy 
has  advanced  to  near  permanence  in  the  social  sci¬ 
ences.  This  tripartite  separation  of  political  sys¬ 
tems  of  countries  is  the  definition  of  the  liberal  or 
liberal-democratic  state,  distinguished  from  totali¬ 
tarian  states  with  their  asserted  monolithic  control 
of  all  individual  and  groups  and  from  socialist 
states  with  government  ownership  and  control  of 
the  economy  but  not  the  society. 

Social  integration  is  the  oldest  of  the  modern 
concepts  of  integration  attributed  to  the  “father” 
of  modern  sociology,  Emile  Durkheim  ( 1858— 
1917).  He  distinguished  between  two  basic  types 
of  social  integration:  mechanical  and  organic  soli¬ 
darity.  The  first  is  based  on  similarities  in  outlook, 
belief,  and  characteristics,  exemplified  by  feudal, 
peasant  societies.  The  second  derives  from  a  mod¬ 
ern  society  with  division  of  labor  and  “functional 
interdependence”  among  groups,  classes,  and  indi¬ 
viduals.  In  both  cases,  social  integration  is  social 
cohesion.  The  relationships  among  the  compo¬ 
nents  of  a  society  in  which  each  is  dependent  on 
others  in  ways  that  become  more  complex  consti¬ 
tute  social  development. 

Economic  integration,  like  political  integration, 
does  not  appear  in  writings  on  economics  until  the 
1950s.  Unlike  political  and  social  integration,  it 
has  a  clear  operational  meaning  in  that  it  is  defined 
as  the  absence  of  barriers,  primarily  political,  to 
exchange  and  trade  and  is  tantamount  to  the 
world  becoming  a  single  economic  system  with  an 
unencumbered  open  market. 

The  term  political  integration  is  not  found  in 
writings  on  nationalism  or  state  formation  until  the 
middle  of  the  1950s.  Nationalism  was,  however, 
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one  of  the  major  forces  for  political  integration  in 
the  150  years  or  so  before  the  end  of  World  War 
II,  after  which  it  spread  throughout  the  processes 
of  decolonization  in  Africa  and  Asia.  It  was  also  a 
strong  force  for  disintegration  under  the  principle 
of  “self-determination  of  peoples”  directed  against 
the  polyethnic  empires,  the  Soviet  Union  being  the 
last  of  the  large  ones.  The  idea  of  political  integra¬ 
tion  was  that  it  would  be  the  dominant  process  for 
creating  larger,  perhaps  more  viable,  effective, 
and,  indeed,  more  peaceful  political  entities  out  of 
thousands  of  localities  and  feuding  language  and 
ethnic.  States  would  emerge  through  persuasion 
rather  than  conquest,  which  defined  empires  that 
provided  peace  and  prosperity  in  the  past. 

Political  integration  confronts  more  complex 
phenomena  than  either  social  cohesion  or  open 
economic  systems.  Political  systems  are  defined  by 
a  string  of  several  characteristics,  at  least  one  of 
which  must  be  present — the  principle  of  superordi¬ 
nation  and  subordination — in  a  modern  context: 
hierarchy  with  authority.  For  theoretical  reasons, 
additional  characteristics  may  include  legitimacy, 
collective  decision  capacity,  and  rules  of  with¬ 
drawal.  Nevertheless,  political  systems  must  have 
hierarchy,  and  hierarchies  have  limits  to  their 
degree  of  integration  by  the  very  presence  of  inter¬ 
mediaries.  As  political  systems  expand  or  are 
involved  in  social  and  economic  changes,  their 
level  of  integration  decreases.  In  addition,  as  the 
economy  and  society  become  more  integrated  and 
more  responsive  to  each  other,  the  political  system 
will  lose  its  relative  control  over  them.  That  is,  the 
appeal  of  the  short-term  maxim  for  political  lead¬ 
ers  to  “divide  and  rule.”  Changes  as  well  as  social 
and  economic  integration  diminish  the  core  logic 
of  hierarchical  control  of  a  political  system  to 
select  and  pursue  collective  goals. 

Political  Development  and 
the  European  Union 

One  of  the  two  main  projects  of  political  science  to 
promote  political  integration  was  the  political 
development  of  the  newly  independent  states  to 
establish  the  post-World  War  II  order  of  a  decolo¬ 
nized  world.  Political  development  became  the 
focus  for  inducing  change  to  establish  political 
systems  in  newly  independent  states  that  could 
command  and  control  processes  of  economic  and 


social  development.  Political  integration  was  taken 
as  one  of  the  conditions  of  political  development. 
The  instruments  of  political  integration  were  a 
common  political  culture,  political  institutions, 
and  “modern”  values.  These  instrumentalities 
were  applied  to  authoritarian,  military,  as  well  as 
democratic  systems.  Nation  building,  however, 
took  time  and  patterns  of  success  were  mixed, 
especially  in  parts  of  Africa  and  Asia.  Political 
development  lost  its  theoretical  prominence  in 
political  science  in  the  late  1960s  as  the  tensions  of 
the  Cold  War  intensified.  It  might  regain  that  posi¬ 
tion  as  a  challenge  conceived  as  the  democratic 
political  development  of  all  political  systems  and 
not  just  of  new  democracies. 

The  second  political  integration  project  was 
European  integration  to  create  a  common  secu¬ 
rity  zone  both  among  European  countries  and 
between  them  and  an  ascendant,  threatening 
Soviet  Union.  That  project  attracted  a  variety  of 
theories  about  political  integration,  one  of  the 
better  known  of  which  was  the  spillover  theory  of 
learning  cooperation  from  narrow  sectors,  such 
as  steel  production,  that  could  expand  to  other 
areas  and,  eventually,  to  common  political  insti¬ 
tutions.  The  European  Union,  although  its  trajec¬ 
tory  of  political  integration  may  have  peaked  in 
the  first  decade  of  the  21st  century,  stands  as  one 
of  the  triumphs  of  political  thinking  and  theory  in 
changing  the  world.  That  knowledge  is  engaged 
in  addressing  the  problems  of  global  political 
integration. 
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Political  Philosophy 


Political  philosophy  is  the  part  of  philosophy  deal¬ 
ing  with  politics  and  government,  that  unstable 
mix  of  war  and  foedus  (treaty),  conflictual  divi¬ 
sion,  and  authoritative  union.  As  “philosophy,”  it 
is  part  of  the  vita  contemplative! ,  which  is  pure 
thinking;  as  “political,”  it  has  to  do  with  vita 
activa,  which  is  action  and  praxis  in  a  world  that 
exists  with  its  own  rules  and  language  games 
before  any  philosophical  attempt  is  made  to  make 
that  world  the  embodiment  of  a  theory,  as  did 
Plato.  Philosophy,  like  science,  is  about  truth, 
whereas  politics  is  about  power  or,  in  a  more 
sophisticated  view,  the  constrained  use  of  social 
power  that  reintroduces  truth  into  the  possible 
constraints  alongside  rhetoric,  persuasion,  com¬ 
promise,  and  negotiation.  Vita  activa  may  be 
viewed  either  as  a  part  of  the  philosophical  activ¬ 
ity,  the  “theory  of  praxis ,”  until  philosophy  disap¬ 
pears  only  to  be  “realized”  in  politics,  itself  bound 
to  wither  away  in  a  completely  new  age,  as  in 
Marx’s  construction,  or,  more  commonly,  as  a 
necessary  topic  of  political  philosophy  (Michael 
Walzer,  2007).  It  has  also  been  held  that  political 
philosophy  is  concerned  with  the  core  concepts 
that  issue  from  both  the  human  condition  and  the 
cultural  models  whose  change  is  beyond  the  scope 
of  conscious  collective  choices,  whereas  political 
theory,  closer  to  vita  activa,  should  deal  with  what 
is  the  object,  at  least  partially,  of  choice  and  willful 
collective  action.  In  any  case,  the  “real  talk”  in 
politics  is  far  away  from  the  “ideal  speech”  in  phi¬ 
losophy  as  well  as  from  the  “crucial  experiment” 
in  science;  it  is  endless  and  never  stops  (Walzer, 
2007)  and,  in  fact,  even  concludes  with  an  “evalu¬ 
ation,”  unless  a  stronger  power,  whether  it  is  a 
dictator,  a  majority,  or  the  “judgment  of  history,” 
decides  to  put  an  end  to  it  for  a  time.  After  defining 


the  field,  this  entry  examines  how  political  philoso¬ 
phy  got  its  autonomy  and  its  professionalization 
and  then  describes  its  epistemological  background. 
Finally,  this  entry  deals  with  the  challenges  to  an 
universalist  political  philosophy  represented  by  the 
claims  of  identities  and  the  slow  emergence  of  non- 
Western  political  philosophies. 

The  Domains  of  Political  Philosophy 

However  important  the  tensions  between  the 
respective  requirements  of  vita  contemplativa  and 
vita  activa,  political  philosophy  is  now  thriving  in 
political  science  as  “thought,”  “theory,”  or  “phi¬ 
losophy.”  Textbooks,  collections,  encyclopedias, 
treatises,  and  essays  about  political  philosophy 
flourish,  suggesting  a  need  for  connecting  the 
knowledge  gained  through  political  science  to  a 
concern  for  a  better  life  and  human  emancipation. 
This  work  suggests,  too,  that  political  scientists 
were  definitely  convinced  that  the  scientific  study 
of  politics  is  far  from  providing  a  sufficient  and 
adequate  knowledge  of  politics  and  that  critical 
theorists,  and  especially  Jurgen  Habermas,  had 
been  right  at  least  in  stating  that  a  mere  cognitive 
interest  in  objective  “facts”  divorced  from  any 
concern  for  a  better  life  and  human  emancipation 
produced  a  deeply  flawed  “science.”  Political  phi¬ 
losophies,  which  are  historically  variable,  share  in 
the  common  endeavor  to  elaborate,  make  explicit, 
and  assess  the  taken-for-granted  conceptions  of  an 
age,  with  political  philosophy  being  viewed  by 
Michael  Oakeshott  as  an  “abridgment  of  a  tradi¬ 
tion.”  The  historicists  hold  that  philosophies  are 
never  above  the  political  battles  but  are  part  and 
parcel  of  them.  Admittedly,  such  a  view,  albeit 
dominant,  gives  rise  to  many  debates,  even  among 
historians.  It  may  also  be  claimed  that  a  philoso¬ 
pher  can  and  must  reject  historicism,  escape  from 
the  confines  of  a  historical  context,  and  interpret 
all  historical  realities  by  taking  as  a  starting  point 
the  rejection  of  violence  and  the  liberty  of  each  one 
within  the  confines  of  the  satisfaction  of  all.  The 
“good”  is  not  to  be  considered  as  a  social  con¬ 
struction  but  as  an  objective  reality,  thus  putting 
some  rigorous  order  in  the  messy  state  of  political 
opinions,  however  thoughtful  (Leo  Strauss,  1959; 
Eric  Weil,  1971).  In  any  case,  philosophy,  pertain¬ 
ing  to  episteme,  is  not  the  servant  of  the  doxa  of 
opinions,  endowed  with  the  status  of  objective 
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reality  through  the  intensive  use  of  scientific  sur¬ 
veys,  since  its  aim  is  to  convince  people  that  they 
can  and  must  change  their  opinion,  an  arduous 
and  sometimes  very  perilous  task  (Strauss,  1959). 
The  combat  of  liberal  political  philosophy  can  be 
taken  as  an  example,  including  Muslim  philoso¬ 
phy’s  criticism  of  the  widely  popular  use  of  the 
most  rigid  interpretations  of  Islamic  shariah  in 
political,  criminal,  and  family  matters.  Even  the 
Straussian  brand  of  Islamic  philosophy  goes 
through  the  same  predicament  (Charles  Butter- 
worth,  2002). 

These  are  some,  among  many,  troubling  prob¬ 
lems  confronting  contemporary  political  philoso¬ 
phy  in  its  quest  for  the  relations  between  the  realm 
of  well-grounded  principles  and  the  realms  of 
actual  institutions  and  behaviors.  Political  philoso¬ 
phy,  like  empirical  political  science,  deals  with 
what  John  Searle  names  “institutional  reality.”  Yet 
that  does  not  mean  that  institutional  reality 
depends  on  our  linguistic  fancies.  It  has,  as  Max 
Weber  put  it,  a  “cultural  arbitrary  nature,”  but 
that  means  only  that  other  political  arrangements 
are  possible  and  the  current  ones  may  become 
unreal,  not  that  language  is  simply  a  way  to  take 
our  beliefs,  intentions,  or  desires  and  broadcast 
them  aloud.  Once  the  ontological  categories  are 
created  by,  and  accepted  in,  the  language,  the  real¬ 
ity  they  show  becomes  analogous  to  the  noninsti- 
tutional  reality,  and  the  Fregean  distinction 
between  meaning  and  reference  applies.  That  is 
why  political  philosophy  cannot  be  the  ideal  con¬ 
struction  of  a  well-meaning  solitary  philosopher. 

Since  the  1970s,  political  philosophy  has  been 
considered  as  the  part  of  political  theory  that, 
being  neither  empirical  nor  formal  (“positive”),  is 
not  scientific.  Bhikhu  Parekh  (1968)  sorts  out  its 
three  domains:  (1)  political  ontology  (e.g.,  Aristot¬ 
le’s  polis,  Jean-Jacques  Rousseau’s  general  will, 
and  Oakeshott’s  modes  of  experience),  (2)  political 
epistemology  (political  knowledge’s  structure, 
instruments,  and  practical  utility),  and  (3)  political 
logic  (argument,  language,  and  principles  of  strat¬ 
egy).  It  is  a  valid  mode  of  knowledge  different 
from  scientific  knowledge  yet  not  contradictory  to 
it.  It  is  not  disputed  that  in  certain  cases  it  is  even 
possible  to  derive  “ought”  from  “is.”  Political  phi¬ 
losophy  treats  empirical  data  in  a  nonempirical 
way;  it  elicits  their  essential  structure  and  puts 
them  within  a  coherent  framework.  It  is  both  a 


model  of  and  a  model  for  reality,  the  main  goal  of 
which  is  more  logic  than  empiricism,  so  its  prem¬ 
ises  and  goals  are  first  of  all  evaluative  regardless 
of  the  standards  of  evaluation  used.  This  raises  the 
issue  of  justification  when  the  idea  of  foundation, 
rational,  or  other,  is  missing.  Admittedly,  this  view 
is  not  shared  by  those  influenced  by  the  neurosci¬ 
ences  and  neo-Darwinism,  who  hold  that  science 
does  not  need  philosophy  to  promote  normative 
and  praxeological  principles,  nor  by  those  to 
whom  politics  cannot  be  an  object  of  science  and 
thus  does  not  need  it.  It  is,  however,  still  dominant 
despite  the  early  assault  of  analytical  philosophy. 
The  hard  version  of  analytical  philosophy  used  to 
deem  political  philosophy  meaningless:  If  philoso¬ 
phy  is  a  second-order  activity — that  is,  neither 
empirical  nor  evaluative  but  aiming  exclusively  at 
testing  the  logical  coherence  of  the  meaning  of 
concepts  and  statements — it  ensues  that  political 
principles,  precisely  because  they  are  evaluative — 
that  is,  neither  true  nor  false — cannot  be  philo¬ 
sophically  justified.  Political  philosophy  must  be 
confined  to  what  Felix  Oppenheim  calls  the 
“reconstruction”  of  concepts  and  must  get  rid  of 
illogical  assumptions  such  as  the  ideas  that  ethics 
has  no  relation  to  facts,  that  science  has  no  relation 
to  values,  and  that  it  is  contradictory  to  claim  that 
ethics  involves  both  value  judgments  and  scientific 
theory.  Yet  since  John  Rawls’s  integration  of  a 
normative  perspective  in  his  analysis  of  justice, 
analytical  philosophers  have  accepted  political 
philosophers  into  their  ranks,  and  conversely, 
empirical  theorists  are  no  longer  declaring  political 
philosophy  “dead”  for  lack  of  empirical  content, 
as  was  suggested  in  the  1950s  by  David  Easton, 
Peter  Easlett,  and  Robert  Dahl. 

Philosophy  was  long  considered  to  be  synony¬ 
mous  with  true  science.  Fikewise,  political  phi¬ 
losophy  was  the  fount  of  theoretical  political 
knowledge  and  thus  of  political  science.  In  the 
ancient  and  medieval  conception  of  “practical 
philosophy,”  this  knowledge  was  also  practical 
since  it  aimed  at  the  synthesis  of  philosophical 
principles  and  empirical  observations,  distinct 
from  both  the  analysis  of  processes  as  “objects” 
and  the  abstract  deduction  more  geometrico  from 
first  principles  taken  as  the  foundation  of  reality. 
Politics  was  considered  basically  as  a  process  of 
government  to  be  both  empirically  known  and 
normatively  assessed  in  order  to  improve  through 
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education  the  rulers’  moral  virtues  and  ability  to 
perform  their  task  and  the  contenders’  moral  and 
technical  bases  to  contest  the  incumbents’  posi¬ 
tions.  Political  philosophy  was  then  the  perfor¬ 
mance  of  that  endeavor,  whether  or  not  the 
members  of  a  polity,  be  it  a  tribe,  a  city,  a  kingdom, 
or  an  empire,  used  the  term  politics  or  thought  of 
its  rules  as  a  separate  domain,  distinct  from  religious 
or  ethnic  rules.  By  the  same  token,  political  phi¬ 
losophy  need  not  have  been  named  as  such  since 
the  term  philosophy  points  to  the  rational  quest 
for  wisdom  and  has  to  be  ignored  or  demeaned  in 
societies  ruled  by  the  belief  in  a  natural  order  of 
things  or  a  supreme  knowledge  submitting  reason 
to  the  revelation  of  a  god-given  law.  Political 
thought  was  never  absent  from  the  world  of  clas¬ 
sical  Islam,  yet  falsafa  (Islamic  thought  founded 
on  interpretations  of  Aristotelianism  and  Neo¬ 
platonism)  was  always  less  developed  and  author¬ 
itative  than  sunna  (the  knowledge  derived  from 
the  study  of  the  Holy  Koran),  despite  the  great 
names  of  Averroes  (Ibn  Rushd)  or  Al-Farabi  being 
associated  with  it.  In  the  Catholic  Church’s  doc¬ 
trine,  philosophy  was  deemed  “the  servant  of 
theology,”  and  though  Rene  Descartes  in  a  very 
Aristotelian  fashion  put  philosophy  at  the  root  of 
all  kinds  of  knowledge,  it  was  because  it  still 
appeared  possible  to  marry  a  specific  philosophy 
to  Christian  theology,  despite  Blaise  Pascal’s 
admonitions  against  philosophies.  Admittedly,  a 
problem  had  to  arise  if  religions  were  torn  apart 
by  religious  ( fitna  in  Islamic  parlance)  and  not 
merely  by  economic,  ethnic,  and  dynastic  deadly 
conflicts,  together  with  the  symmetric  problem  of 
the  relations  between  peoples  of  different  religions 
that  claim  to  be  universal  and  aim  for  the  conver¬ 
sion  of  nonbelievers  to  the  religion  reigning  in  a 
definite  area.  It  was  thus  necessary  to  coexist  and 
haggle  with  the  “significant  others”  since  they 
could  not  (and  for  certain  theologians,  such  as  the 
Italian  Jesuits  in  China,  should  not)  be  converted 
right  away.  In  those  cases,  a  philosophy  of  some 
sort,  as  well  as  a  corpus  of  laws  (the  Roman  jus 
gentium),  was  needed  to  justify  different  practices, 
from  war  to  peaceful  cooperation,  on  a  basis 
sounder  than  the  mere  empirical  fact  that,  until 
the  final  conversion  of  the  whole  world  to  the  true 
faith,  a  political  compromise  had  to  be  passed 
between  empires  even  though  they  were  waging 
war  on  one  another.  Usually,  such  a  philosophy 


took  the  form  of  some  natural  law  common  to 
different  religions,  a  view  still  largely  held  today, 
when  “religions”  are  complemented  or  replaced 
by  “cultures”  and  “civilizations.” 

Contemporary  Political  Philosophy 
and  Its  Professionalization 

The  fundamental  change  occurred  as  the  outcome 
of  two  processes:  the  autonomization  of  political 
reason,  contemporary  to  the  European  religious 
wars,  and  the  autonomization  of  philosophy,  fol¬ 
lowing  the  evolution  of  the  concept  of  science. 

In  the  realm  of  government,  after  the  age  of 
god-kings  in  large  units  came  the  age  of  an  onto¬ 
logical  vision,  including  the  notion  of  a  basic  ten¬ 
sion  between  the  transcendental  and  mundane 
orders  articulated  by  elites  that  carried  models  of 
a  cultural  order  centered  on  the  belief  in  the  cre¬ 
ation  of  the  world  according  to  some  transcenden¬ 
tal  vision  and  command  (Shmuel  Eisenstadt, 
1986).  Hence,  the  “clerical”  idea  that  the  “sword,” 
albeit  responsible  for  restructuring  the  political 
order,  was  accountable  before  a  higher  authority, 
be  it  God  or  divine  law,  represented  by  the  “pen.” 
This  model  was  slowly  thrown  down  with  the 
emergence  of  the  sovereign  state,  which  had  to  be 
justified  and  legitimated  for  its  own  sake.  In 
Strauss’s  curious  and  creative  interpretation  of 
Thomas  Hobbes,  nature  and  God  went  radically 
beyond  the  reach  of  Reason’s  intelligence,  and  the 
only  “natural  law”  (the  “desire  of  liberty” — i.e., 
of  survival)  that  makes  the  world  of  the  “state  of 
nature”  unbearable  by  each  one  also  makes  the 
political  world  of  “civil  society”  bearable  by  all. 
This  led  to  the  de  jure  separation  of  the  legitimacy 
of  an  order  given  and  revealed  by  a  “comprehen¬ 
sive  moral  doctrine”  (in  Rawls’s  parlance)  and  the 
legitimacy  of  a  constructed  artificial  order  founded 
on  the  necessity  of  peaceful  coexistence  between 
individuals  and  groups  divided  by  an  irrepressible 
mutual  distrust  and  prone  to  resort  to  violence  to 
avoid  being  unjustly  subjugated  by  others.  The 
safety  of  physical  bodies  came  to  be  seen  as  more 
important  than  the  salvation  of  souls.  The  move 
was  spotted  by  Strauss  in  Nicolo  Machiavelli’s 
work.  Strauss  thought  that  such  a  sea  change  jus¬ 
tified  political  “ferociousness,”  whereas  Isaiah 
Berlin  found  it  morally  valid  as  a  part  of  civic 
humanism.  This  marked  the  beginning  of  a  trend 
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leading  to  the  assertion  that  the  two  principles  of 
legitimacy  (religious  and  political)  should  be  kept 
separated  as  a  consequence  of  the  necessary 
supremacy  of  the  state’s  sovereignty.  In  the  politi¬ 
cal  realm,  the  logic  of  sovereignty  had  to  prevail, 
and  the  sovereign’s  authority,  according  to 
Hobbes,  was  jure  divino  (divine  law),  whereas  the 
clerks’  authority  was  only  jure  civili  (civil  law). 

Be  it  the  mark  of  the  new  age  of  “self-assertion,” 
breaking  with  any  revealed  theology  or  founda¬ 
tional  philosophy,  or,  as  it  is  still  held  by  some,  the 
permanence  of  a  political  theology  masquerading 
under  a  new  guise  (Carl  Schmitt,  1988),  the  origi¬ 
nal  feature  of  the  new  ontology  was  not  first  secu¬ 
larization,  since  religion  did  not  immediately  lose 
its  grip  on  public  life  to  be  replaced  by  an  improb¬ 
able,  even  to  this  day,  “naked  public  space,”  but 
the  progressive  appearance  of  a  public  sphere  gov¬ 
erned  by  the  primacy  of  politics  and  a  public 
morality  within  the  limits  of  the  political  order. 
This  does  not  mean  that  moral  requirements  disap¬ 
peared  altogether  from  the  depths  of  society,  since 
morality  is  a  universal  feature,  although  it  is 
doubtful  whether  a  substantive  “thick”  universal 
morality  does  exist,  or  that  the  sovereign  governed 
by  “reason  of  state”  is  devoid  of  any  morality.  Nor 
did  morality  depend  on  the  sovereign’s  self-interest 
since  norms  and  values  do  not  exist  just  for  effi¬ 
ciency  reasons. 

For  these  reasons,  the  ontology  of  the  axial  age 
faded  away  in  Western  political  spheres  and  only 
there.  It  is  still  more  alive  and  deep-rooted  than 
ever  in  Islamic  modernity,  where  a  host  of  self- 
styled  revolutionary  reformers  maintain  the  sub¬ 
jection  of  politics  to  justice — that  is,  a  God-given 
morality  under  which  the  individual  cannot  but  be 
genuinely  free  since  Islamic  law  and  human  nature 
are  in  perfect  harmony,  which  requires  a  type  of 
Islamic  politics  where  the  individual  could  be  both 
free  and  governed.  In  the  Western  world,  it  did  not 
wither  away  in  the  moral  sphere,  divine  law  being 
at  times  merely  replaced  by  human  rights,  global 
justice,  or  biodiversity,  all  things  supposed  to  be 
located  above  politics  yet  interacting  with  it  and 
imposing  duties  on  peoples  and  governments 
(which  may  induce  the  revival  of  the  old  and  per¬ 
manent  temptation  to  devise  a  monist  political 
theory  similar  to  the  Islamic  one).  When  Baron  de 
Montesquieu,  later  followed  by  Jean-Jacques 
Rousseau  and  Alexis  de  Tocqueville,  diagnosed  the 


emergence  of  a  “new  political  science,”  outlined  in 
L’Esprit  des  lois  (The  Spirit  of  the  Laws),  he  both 
rejected  the  direct  criticism  of  political  practice  for 
moral  reasons  and  advocated  a  morality  not  to  be 
left  to  the  decision  of  sentiments.  He  emphasized 
openly  the  need  for  a  political  philosophy  that  had 
been  slowly  developing  and  expanding  since  the 
16th  century,  namely,  the  search  for  the  relation¬ 
ship  between  power  politics  and  moral  character. 
Hence,  the  great  divide,  still  very  topical,  between 
the  “civil  Enlightenment”  of  Hobbes  and  Samuel 
Pufendorf,  where  political  authority  is  based  on  its 
desacralization  and  on  conventions  binding  physi¬ 
cal  bodies  governed  by  the  conatus  sese  preser- 
vandi  (self-preservation),  and  the  “metaphysical 
Enlightenment”  of  Gottfried  Wilhelm  Leibniz  and 
Immanuel  Kant,  “resacralizing”  the  political  realm 
by  basing  it  not  on  theology  but  on  a  philosophy 
of  the  “kingdom  of  ends”  (Ian  Hunter,  2001).  The 
first  Enlightenment  gives  preeminence  to  the  legal 
system  and  leaves  ethics  to  the  private  sphere, 
which  may  pave  the  way  for  moral  relativism  since 
many  legal  rules  disciplining  practices  need  not  be 
based  on  deep  philosophical  reasoning  but  only  on 
historical  conventions  and  political  convenience, 
defining  for  a  time  what  kind  of  behavior  cannot 
be  tolerated.  The  second  Enlightenment  expounds 
the  necessary  relation  between  the  legal  and  moral 
systems,  a  point  stressed  today  by  the  varieties  of 
republicanisms,  which  may  lead  to  the  determina¬ 
tion  of  a  single  moral  system  as  the  universally  best 
way,  a  subject  of  most  of  the  controversies  on  the 
relationship  between  multiculturalism,  universal- 
ism,  and  equality  in  liberal  democracy.  Therefore, 
political  philosophy  is  today  always  more  or  less  in 
tension  with  moral  philosophy  and  more  generally 
with  different  realms — aesthetic,  religious,  eco¬ 
nomic,  and  so  on. 

With  the  appearance  of  a  new  conception  of 
natural  science  and  its  later  extension  to  the  ide¬ 
ational,  linguistic,  material,  and  strategic  interac¬ 
tions  between  human  beings  in  specific  physical, 
ecological,  and  economic  contexts,  philosophy 
acquired  a  new  meaning. 

Science  has  always  been  viewed  as  a  cumulative 
way  to  reach  the  objective  truth  through  formal 
argument  and  regulated  observation  made  avail¬ 
able  to  everyone  (depending  on  the  political  con¬ 
texts)  rather  than  by  revelation,  persuasion,  or 
rhetoric,  but  it  was  still  conceived  as  a  way  to  be 
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the  mirror  of  reality,  a  “natural  philosophy”  con¬ 
firming  the  order  and  beauty  of  a  god-made 
world.  Yet  science  was  becoming  divorced  from 
this  concern,  and  in  the  realm  of  politics  it  came 
to  be  seen  as  no  longer  paving  the  way  for  the 
discovery  and  prediction  of  what  a  “good  polity” 
was  and  should  be  in  every  conceivable  case. 
Admittedly,  there  are  still  attempts  to  solve 
Montesquieu’s  problem  by  reducing  philosophy 
to  scientific  psychology,  as  advocated  by  Dahl  in 
the  1950s;  to  devise  a  theory  of  democracy  by 
bringing  empirical  and  formal  theories  closer;  to 
use  the  social  welfare  function  to  determine  the 
proper  domain  of  a  good  government;  to  elicit  the 
conditions  making  majority  rule  both  logical  and 
legitimate;  and  above  all  to  explain  why  democ¬ 
racy  is  the  only  good  regime  today,  a  statement 
that  for  the  moment  lacks  an  indisputable  scien¬ 
tific  foundation,  democracy  being  actually  less 
valued  than  the  security  brought  about  by  a  real 
and  effective  state  and  the  economic  prosperity 
brought  about  by  good  institutions.  However, 
several  psychological  studies  have  to  admit  that  it 
is  impossible  to  disentangle  the  logics  of  “appro¬ 
priation”  (avoiding  the  “gaffes”)  and  utility  max¬ 
imization,  and  it  is  necessary  to  rely  on  a  “sense  of 
fairness”  close  to  Aristotle’s  concept  of  justice, 
which  can  be  observed  through  surveys  but,  given 
its  variegated  meanings  in  specific  contexts,  can¬ 
not  be  conceptualized  without  resorting  to  a  phi¬ 
losophy  taking  into  account  the  “public  culture” 
of  a  given  society  (Rawls,  1971,  1993,  2001). 

For  that  reason,  most  of  these  attempts  are 
based  on  implicit  or  explicit  philosophical  founda¬ 
tions  despite  their  use  of  empirical  or  formal 
(rational  choice)  scientific  research.  Several  demo¬ 
cratic  empirical  theories  actually  depend  on  grand 
narratives  that  treat  history  as  a  whole  as  the 
grand  theories  of  big  science  treat  the  evolutionary 
nature  of  physical  realities,  and  they  do  so  by  mor¬ 
alizing  the  events.  It  is  because  political  science 
today  has  built-in  democratic  features — roughly, 
the  “bottom”  explains  the  “top,”  the  “micro” 
explains  the  “macro,”  and  the  individual  is  a  well- 
meaning  and  rational  being — that  it  comes  to  hold 
that  democracy  is  the  best  political  regime  and  not 
because  the  “value-free”  scientific  research  (a  con¬ 
tested  concept)  has  decisively  proved  that  democ¬ 
racy  is  the  one  best  way,  which  is  merely  confusing 
the  causes  and  the  consequences  of  the  statement. 


The  reverse — a  “neutral”  position  on  the  evalua¬ 
tion  of  democracy — seems  closer  to  the  facts 
because  both  the  “strong  program”  of  sociology  of 
science  and  liberalism  (as  both  an  intellectual 
climate  and  a  philosophy)  hold  that  empirical  polit¬ 
ical  science  can  flourish  only  when  there  is  a  plural¬ 
ism  that  does  not  treat  democracy  as  the  best  way. 

Besides,  with  the  waning  of  general  theories 
aiming  at  lawlike  generalizations  applicable  to  the 
whole  society  without  leaving  anything  outside 
their  explanatory  system  (organicism  and  holism — 
both  of  them  still  upheld  in  the  natural  sciences — 
and  functionalism,  systems  analysis,  and  histori- 
cism),  political  science  has  come  to  be  perceived  as 
the  systematic  study  of  institutions  and  social 
mechanisms  in  a  state  of  permanent  disequilibrium 
and  provisional  equilibrium,  viewed  as  a  game’s 
stable  outcome  achieved  through  institutions 
grown  by  rational  actors  in  a  context  not  finalized 
by  the  market.  The  mechanisms  and  models  are 
simplifications  of  reality,  not  because  they  are  not 
observable  in  real  life  but  because  they  are  always 
embedded  in  a  specific  combination  of  other 
mechanisms.  So,  between  the  mere  historical  nar¬ 
rative  and  the  nomological  ambition,  social  science 
aims  at  the  fine-grained  analysis  of  a  given  society 
and  regime  through  the  use  of  abstract  causal 
models  that  may  be  exported  from  a  given  context 
to  another  but  cannot  be  considered  as  “laws” 
since,  in  real  life,  two  mechanisms  may  logically 
coexist  while  being  contrary  to  each  other.  For 
example,  a  state  may  have  a  formal  commitment 
to  democracy,  yet  it  might  exhibit  authoritarian 
behaviors  in  its  operation.  Moreover,  mechanisms 
are  always  “local,”  that  is,  particular  and  limited 
by  borders,  and  so  causality  is  equally  local  and 
cannot  be  replicated  but  only  adapted  to  fit  empir¬ 
ical  observations.  Jon  Elster’s  two  laws  of  pseudo¬ 
science — “everything  is  a  little  bit  like  everything 
else”  and  “everything  is  causally  related  to  every¬ 
thing  else” — are  no  longer  taken  seriously.  Only 
functionalism  still  withstands  the  assault  of  ana¬ 
lytical  sociology,  at  least  in  so  far  as  it  is  carefully 
“situated.” 

This  move  toward  a  science  of  embedded  and 
combined  mechanisms  has  deprived  political  sci¬ 
ence  of  the  claim,  still  maintained  in  certain  depart¬ 
ments  of  economics,  of  being  able  to  forecast  the 
future  as  a  whole.  But  lest  the  science  of  politics 
become  the  art  of  entertainment  and  storytelling,  it 
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had  to  revive  the  plea  for  political  relevance,  long 
ago  taken  for  granted,  then  thriving  in  the  19th- 
century  “noble  science  of  politics”  and  later  openly 
voiced  as  a  reaction  to  behavioralism.  Not  that 
empirical  science  lacks  a  certain  political  relevance. 
For  example,  the  elucidation  of  mechanisms  and 
the  findings  of  behavioral  economics,  by  helping 
us  better  understand  the  causes  and  (often  unin¬ 
tended)  consequences  of  our  actions,  may  lead  to 
an  assessment  of  the  conditions  of  the  possibility 
of  political  actions  and  then  to  an  intelligent  prac¬ 
tical  action.  But  unless  those  prescriptions  are 
based  on  a  comprehensive  philosophical  view  of 
the  world  as  a  whole,  they  remain  limited  to  spe¬ 
cific  and  local  cases. 

So  the  revival  of  political  philosophy  was 
marked  by  “the  return  of  grand  theory”  (Quentin 
Skinner,  1990).  The  main  obstacle  to  philosophy 
as  a  rational  endeavor  was  long  erected  by  the 
natural  sciences  since,  as  Bernard  Williams  has 
noted,  the  pursuit  of  scientific  truth  alone  leads  to 
a  disconnection  between  reason,  on  the  one  hand, 
and  individual  and  collective  goals,  on  the  other. 
But  if  social  sciences  do  not  pursue  that  fantastic 
endeavor  or  pursue  it  in  a  more  modest  way,  phi¬ 
losophy  regains  some  ground.  If  the  good  polity 
made  by  good  citizens,  good  institutions,  and  good 
rulers  is  to  be  from  now  on  beyond  the  reach  of 
empirical  political  science  despite  the  valuable 
efforts  made  to  work  out  new  theories  of  govern¬ 
ment,  both  descriptive  and  prescriptive,  it  remains 
nevertheless  a  highly  relevant  endeavor  and  phi¬ 
losophy.  At  least  in  its  practical  and  prudential 
form,  it  has  to  thrive  again  to  restructure  political 
space  in  order  to  annul  for  a  moment  political  time 
by  encompassing  it  in  a  worldview.  So  political 
philosophy  has  revived  what  was  called  with  some 
contempt  the  “traditional  theory,”  combining 
perennial  questions  (What  should  man  be  for  the 
political  order  to  be  livable?  What  should  the 
political  order  be  for  man  to  live  a  “good  life”?) 
and  historical  contexts  (the  diagnosis  of  a  social 
disease,  outline  of  a  better  future  order,  or  reinser¬ 
tion  of  politics  into  a  wider  order).  Today,  political 
philosophy  figures  in  some  introductory  courses  of 
political  science  as  a  part  of  the  political  theory 
canon  identifying  the  main  domains  of  scientific 
research  of  which  it  is  a  necessary  input:  order, 
democracy,  community,  equality,  legitimacy,  jus¬ 
tice,  and  narrativity. 


Political  philosophy  stays  in  some  respect  aloof 
from  apocalyptic  or  revolutionary  utterances  since, 
contrary  to  the  opinion  of  Max  Weber,  the  differ¬ 
ent  quests  for  a  better  world  and  an  axiological 
rationality  are  not  doomed  to  lead  to  a  “war  of 
gods.”  Even  though  we  have  long  and  too  well 
known  that  the  collision  of  ultimate  values  may 
lead  to  murderous,  and  unfortunately  necessary 
and  even  “just,”  wars  (suffice  it  to  recall  the  real 
stake  of  World  War  II),  such  a  violent  outcome  is 
by  no  means  fatal  since  such  a  collision  is  also  a 
permanent  feature  of  a  pluralist  and  peaceful  view 
of  politics  (Berlin,  1962),  and  it  may  be  restricted 
and  kept  under  control  in  a  “cold  war.”  If  it  is  true 
that  several  worlds  are  indeed  conceivable  and 
equally  good,  it  is  impossible  to  have  the  best  of  all 
of  them,  and  political  philosophy  has  to  put  up 
with  this  imperfect  state  of  affairs  without  lapsing 
into  the  perspective  of  “the  extreme  case”  through 
the  subjection  of  politics  to  a  “methodological 
exceptionalism”  of  some  sort,  where  the  “excep¬ 
tion”  (the  crisis)  is  part  of  the  “rule”  (ordinary 
politics) — that  is,  a  “warlike”  framework  from 
which  all  the  other  political  states  should  be 
derived  (Schmitt,  1972,  1988). 

Philosophy  can  perform  that  task  in  two  entirely 
different  ways  that  have  long  been  the  watershed 
of  the  discipline.  The  first  one,  from  Plato  to  Karl 
Marx,  Friedrich  von  Hayek,  and  Robert  Nozick, 
considers  politics  as  something  awkward,  superfi¬ 
cial,  and  empty  since,  like  the  state  itself,  it  cannot 
be  reduced  by  an  iconography  of  order.  Therefore, 
the  genuine  reality  must  be  located  elsewhere,  in 
“society,”  biology,  religion,  law,  economy,  phi¬ 
losophy,  and  so  on.  Politics  is  then  a  social  misfor- 
mation  that  should,  and  could,  disappear  some 
day  from  this  world.  The  alternative,  dating  back 
to  Aristotle,  is  to  view  politics  with  all  its  “pat¬ 
terned  disorder”  as  a  constitutive  dimension  of  the 
human  condition  (Walzer,  2007),  the  frontiers  and 
values  of  which  may  widely  vary  and  be  expanded 
or  displaced,  as  evidenced  today  by  the  various 
feminist  political  philosophies  insisting,  for 
instance,  on  completing  “justice”  by  replacing  it 
with  “care”  as  the  foundation  of  a  fair  political 
community.  This  entails  substantial  changes  in  the 
assessment  of  past  political  philosophies  and  polit¬ 
ical  science  itself.  A  third  way  has  recently  appeared, 
stating  that  we  are  currently  witnessing  the  trans¬ 
formation  of  the  idea  of  political  community 
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inherited  from  European  history  and  its  Wesphalian 
turn,  which  may  lead  to  a  revival  of  cosmopolitics, 
with  notions  worked  out  by  empirical  as  well  as 
philosophical  theories  such  as  global  citizenship, 
global  civil  society,  transnational  governance, 
world  government,  global  state,  and  world-state, 
reminiscent  of  Dante’s  universal  “Empire” 
expounded  in  Monarchia  and  Alexandre  Koj  eve’s 
comparable  concept  of  a  universal  homogeneous 
state.  Although  the  idea  of  a  world-state  is  some¬ 
times  rejected  by  cosmopolitics  and  has  been 
strongly  criticized  by  Walzer  as  a  dystopian  top- 
down  tyranny  (the  opposite  of  the  Kantian  advo¬ 
cacy  of  a  league  of  small  republics),  those  ideas  are 
alive  since  many  of  them  stem  from  the  contention 
that  politics  should  not  be  viewed  as  the  taming  of 
the  social  war  and  the  containment  of  violent  con¬ 
flicts  opposing  virtual  enemies.  Rather,  striving  to 
address  problems  such  as  climate  change  and  pov¬ 
erty  by  addressing  the  factors  generating  them  is 
the  only  proper  task  of  politics  since  politics  is  first 
of  all  about  the  production  of  public  goods  and  the 
implementation  of  global  justice.  Although  the  last 
position  is  strongly  denied  by  those  holding  that 
social  justice  can  only  be  conceptualized  within 
domestic  politics — that  is,  the  state — globalism 
requires  other  virtues  such  as  the  Kantian  duty  to 
assist  those  who  need  it  the  most.  The  same  prob¬ 
lem  surfaces  when  one  moves  from  social  to  eco¬ 
nomic  justice,  raised,  among  others,  by  “the  more 
Rawlsian  than  Rawls,”  such  as  Thomas  Pogge. 
Actually,  the  plausibility  of  such  a  turn  does  not 
depend  on  philosophical  arguments  but  on  the 
psycho-sociological  consequences  of  those  global 
changes:  If,  as  it  seems  almost  certain,  they  entail  a 
vastly  unequal  distribution  of  costs  and  benefits 
among  different  areas,  then  the  “common  enemy” 
will  not  unite  humankind  but  will  turn  certain 
frustrated  human  groups  against  others  and  phi¬ 
losophy  shall  be  back  to  square  one. 

This  leads  at  times  to  the  fading  of  the  Hobbesian 
divide.  On  the  one  hand,  the  realist  theory  of  inter¬ 
national  relations  holds  that,  whatever  the  prefer¬ 
ences  of  reason,  conflict  is  the  order  itself  and, 
accordingly,  virtual  wars  are  always  possible  in  a 
metaphoric  state  of  nature,  which  does  not  mean 
that  realism  is  averse  to  international  law  and 
morality.  So  war  may  be  legitimized  by  self-defense 
against  aggression,  which  keeps  alive  the  issue  of 
the  moral  legitimacy  of  war,  a  very  ancient  concern 


of  Christian,  Muslim,  Indian,  Buddhist,  and 
Chinese  political  thought,  especially  in  the  thorny 
issues  of  the  preventive  use  of  force  and  armed 
humanitarian  interventions  without  the  mandate 
of  the  United  Nations.  On  the  other  hand,  the  lib¬ 
eral  philosophy  of  domestic  relations,  where  war  is 
banished,  civility  within  conflict  should  reign,  and 
conflict,  albeit  a  necessary  component  of  a  vibrant 
polity,  has  its  magnitude  and  severity  limited  by 
the  requirements  of  order,  may  question  the  neces¬ 
sity  of  a  sovereign  to  reach  that  goal,  one  of  the 
most  pressing  issues  of  contemporary  political  sci¬ 
ence  and  philosophy  since  it  encompasses  all  the 
claims  to  the  obsolescence  of  the  Hobbesian 
divide.  Rawls  has  attempted  to  take  into  account 
the  resilience  of  the  Hobbesian  divide  by  devising 
another  “veil  of  ignorance”  applicable  to  the 
construction  of  justice  in  the  “international  com¬ 
munity”  while  getting  rid  of  the  concept  of  (an 
implicitly  “sovereign”)  “state”  (replaced  by  “peo¬ 
ple”)  (Rawls,  1999). 

Whatever  the  life  chances  of  those  opposing 
viewpoints,  the  professionalization  of  political  phi¬ 
losophy,  different  according  to  each  historical 
context,  should  not  be  construed  as  making  irrele¬ 
vant  other  forms  of  political  reflections  derived 
from  religious  beliefs  and  above  all  from  works  of 
art,  such  as  Homer’s  epic  poetry,  Greek  tragedy, 
Shakespeare’s  plays,  or,  closer  to  our  times,  the 
works  of  Fyodor  Dostoyevsky,  Robert  Musil, 
George  Orwell,  or  Alexandr  Solzhenitsyn.  It  may 
even  be  claimed  that  political  philosophy  and  any 
form  of  political  thought  have  a  “literary  dimen¬ 
sion,”  with  Martha  Nussbaum  stressing  that  the 
texts  always  express  the  linkage  between  what  is 
said  and  hoiv  it  is  said  and  Stanley  Cavell  viewing 
autobiography  as  intrinsic  to  all  interesting  philo¬ 
sophical  writing.  Of  course,  philosophy  remains 
bound  to  seek  self-understanding  and  the  under¬ 
standing  of  human  culture  by  using  logical  reason¬ 
ing  instead  of  an  edifying  narrative.  Yet  it  remains 
possible  that  in  such  a  sense  it  may  miss  something 
vital  in  politics  if  it  is  a  knowledge  not  born  of  pain 
and  passion  since  those  passions  and  emotions  can¬ 
not  be  tamed  by  pure  reason  or  even  instrumental 
rationality,  so  reason  cannot  rule  unfettered  in  the 
human  world.  As  Williams  noticed,  philosophy’s 
shortcoming  may  be  having  kept  the  traces,  dating 
back  to  the  5th  century  BCE  and  earlier,  of  an 
Epicurean  conscience  not  yet  superseded  by  Plato’s 
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and  Aristotle’s  endeavors  to  make  our  relation  to 
the  world  fully  intelligible. 

Although  a  significant  part  of  contemporary 
political  philosophy  appears  dedicated  to  keeping 
political  tragedy  at  bay,  a  sense  of  tragedy  lingers, 
due  to  the  Senecaian  conflict  between  the  individ¬ 
ual  will  to  excel  in  the  practice  of  self-domination 
and  the  political  and  institutional  conventions  of 
the  city  and  also  to  a  conflict  involving  the  mean¬ 
ing  and  foundation  of  different  values  within  the 
same  set  of  conventions.  In  this  respect,  the  warn¬ 
ings  of  green  political  theory  are,  sometimes 
unwittingly,  not  devoid  of  a  tragic  character,  and, 
especially  but  not  only  in  international  relations, 
the  practical  tensions  between  restorative  justice 
(righting  the  wrongs  of  the  past)  and  peace  (ensur¬ 
ing  harmony  in  the  present)  are  more  complex 
than  several  philosophers  seem  to  think.  An  ideal¬ 
ist  and  heroic  effort  to  deny  any  place  to  tragedy 
lies  at  the  heart  of  many  contemporary  philoso¬ 
phies  of  democracy:  such  as  economic,  delibera¬ 
tive,  judicialized,  communication  based  and 
Kantian  republican,  educative,  without  enemies, 
strong,  and  cosmopolitan. 

Most  of  them  (the  economic  theories  being  the 
obvious  exception)  extol  the  virtues  of  rational 
and  peaceful  dialogue  as  a  necessary  and  some¬ 
times  sufficient  condition  not  only  of  philosophical 
practice  but  also  of  real-world  democratic  politics, 
forgetting  too  often  that  an  ideal  dialogue  leading 
to  consensus  is  only  one  of  the  many  features  of  a 
complex  process  that  brings  about  interaction, 
understanding,  and  mutual  agreement  through  a 
host  of  creative  proceedings,  majority  decisions, 
negotiation,  compromise,  lawmaking,  jurispru¬ 
dence,  socialization,  economic  innovation,  and  so 
on  (Walzer,  2007).  Likewise,  the  opposite  concept, 
the  will  of  the  majority,  does  not  exist  as  a  plain 
empirical  fact,  and  Dahl’s  concept  of  polyarchy  is 
closer  to  facts,  although  it  leaves  out  the  issue  of 
the  necessary  decision  stressed  by  Weber.  It  may 
even  be  dangerous  if  taken  as  the  dogma  of  the  will 
of  the  people.  The  people  as  a  plurality  (the 
Aristotelian  multitude)  has  no  unified  will  except 
the  universal  (and  universalist)  banality  that  every 
human  being  wants  to  live  in  the  tranquillitas  ordi- 
nis  (tranquility  of  order)  of  a  just  and  safe  society 
protected  against  “outsiders,”  which  leads  more  to 
interpretive  divisions  than  to  political  consensus, 
as  evidenced  in  collapsing  states  where  different 


peoples  want  to  make  up  a  people  without  agreeing 
on  its  identity,  one  of  the  main  issues  of  secession 
and  self-determination.  Yet  a  majority  is  neither 
useless  nor  devoid  of  validity.  It,  along  with  delib¬ 
eration,  is  one  of  the  many  practical  devices 
invented  to  reach  an  acceptable  decision.  Both  of 
them  need  prior  conditions  to  function  smoothly. 

A  Tentative  Normative  Epistemology 

of  Contemporary  Political  Philosophy 

That  leads  one  to  venture  out  into  a  bit  of  norma¬ 
tive  epistemology.  First,  like  historical  sociology, 
political  philosophy  must  eschew  both  the  illusion 
of  “historical  inevitability”  (the  present  is  already 
written  in  the  past)  and  the  “fallacy  of  discontinu¬ 
ity”  (the  present  is  radically  new  yet  bound  to 
happen). 

Second,  although  philosophy  resembles  science 
in  so  far  as  it  has  to  rely  on  argument  without 
claiming  any  longer  to  be  the  mirror  of  nature,  the 
validity  of  political  philosophy’s  discoveries  should 
be  viewed  as  different  from  scientific  discoveries. 
Because  philosophical  “truths”  cannot  claim  the 
monopoly  held  by  vindicated  scientific  theories  (at 
least  provisionally  until  they  are  falsified  by  a  new 
scientific  discovery),  they  do  have  the  same  status 
as  scientific  ones.  Political  philosophy  seeks  a  bal¬ 
ance,  depending  on  practical  contexts,  between 
values,  for  instance,  “negative”  and  “positive” 
liberty  or  the  “politics  of  faith”  and  the  “politics 
of  skepticism.”  Far  from  being  a  weakness,  this  is 
the  enduring  strength  of  contemporary  political 
philosophy:  not  to  edict  a  theory  of  everything  and 
to  accept  as  a  truth  the  plurality  of  reality  and  so 
the  reasonable  pluralism  of  different  philosophies, 
by  nature  partial  and  incomplete,  without  falling 
into  the  relativism  of  mere  opinions.  As  Williams 
says  about  Berlin,  being  “truthful”  is  perhaps  the 
only  way  to  be  true. 

Third,  political  philosophy  should  stay  away 
from  both  sociological  and  philosophical 
reductionisms  (Maurice  Merleau-Ponty,  1983). 
Sociological  reductionism  makes  philosophical 
statements  and  their  reception  a  secondary  effect 
of  social  determinations;  therefore,  they  need  not 
be  studied  in  themselves  but  can  be  explained 
away  as  prescientific.  Yet  sociology  properly 
understood  may  account  for  the  emergence  of 
ideas  and  their  context  of  discovery,  but  it  does  not 
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follow  that  the  context  of  justification  and  justifica¬ 
tion  itself  can  be  disposed  of  by  the  same  method. 
On  the  other  hand,  philosophical  reductionism  as  a 
narrative  drawing  a  stylized  and  idealized  picture  of 
politics  is  flawed,  not  so  much  because  it  may  pre¬ 
suppose  a  human  being  “floating”  above  history 
and  untied  from  any  social  relation  but  because  it 
excludes  from  empirical  reality  everything  that  does 
not  conform  to  its  philosophical  requisites.  As 
Walzer  (2007)  puts  it,  “Habermas  argues  for 
unconstrained  communication,  but  he  means  only 
to  exclude  the  constraints  of  force  and  fraud,  defer¬ 
ence,  fear,  flattery  and  ignorance”  (p.  26). 
Paradoxically,  such  a  reductionism  substitutes 
opinions  and  impressions  for  sound  empirical 
knowledge.  This  does  not  by  any  means  entail  the 
banishment  of  utopia  from  political  philosophy: 
Utopia  is  not  the  depiction  of  an  actual  perfect 
future  but  the  present  depiction  of  what  shall  never 
be.  It  is  a  “transcendental  horizon”  we  should 
always  keep  in  mind  to  help  us  stay  alert  when  deal¬ 
ing  with  the  actual  political  arrangements  of  today. 

Last,  political  philosophers  should  be  aware 
that  they  operate  in  different  contexts  and  tempo¬ 
ralities  than  political  actors.  Walzer  (2007)  sug¬ 
gests  that  perhaps  philosophy  is  politics  construed 
in  tranquility,  while  politics  is  philosophy  imple¬ 
mented  in  confusion.  This  does  not  mean  that 
philosophers  should  ignore  real  politics  or  sym¬ 
metrically  agree  to  become  politicians  (or  claim  to 
be  their  masters).  Quite  the  opposite — according 
to  Walzer,  they  should  be  aware  that  their  dis¬ 
course  has  no  value  if  they  do  not  care  for  the 
details  of  the  problems  and  situations  they  are 
commenting  on,  provided  they  refrain  from  play¬ 
ing  the  philosopher-king.  For  once,  it  is  not  the 
Devil  who  is  in  the  details;  it  is  humility  and  the 
absence  of  arrogance. 

Universalism,  Localism,  and 
the  Challenge  of  Identities 

Contemporary  political  philosophy,  constrained 
by  the  duty  to  be  contemplative  and  reflexive,  yet 
practical  and  prescriptive,  is  also  facing  the 
unavoidable  challenge  to  say  something  universally 
valid  at  least  “in  reach,”  in  Rawls’s  parlance,  while 
being  “municipal”  and  interpretative  of  the  local 
conventions  of  an  age  or  a  civilization.  Western 
philosophy  with  its  various  brands — contractarian, 


utilitarian,  republican,  even  communitarian,  or 
pragmatic  and  ironic — occupies  most  of  the  field, 
and  most  controversies  take  place  within  the  gen¬ 
eral  framework  of  political  liberalism  and  democ¬ 
racy.  Some  important  issues  include  the  require¬ 
ments  of  a  moral  life  in  liberalism,  the  nature  of 
political  obligation,  the  justification  of  economic 
liberalism,  the  logical  consistency  of  a  liberal  social 
justice  and  democratic  equality,  the  logical  requi¬ 
sites  of  justice  and  equality,  the  possibility  of  a 
“civic  liberalism,”  the  rules  governing  institutional 
design  in  a  democratic  context,  the  role  of  random 
selection  (“sortition”)  in  resource  allocation  and 
more  generally  in  the  improvement  of  democratic 
processes  in  various  domains,  and,  finally,  the 
debates  around  the  right  normative  framework  for 
justifying  democracy,  whereby  collective  decision¬ 
making  procedures  should  be  justified  in  terms  of 
their  epistemic  value — that  is,  their  ability  to  pro¬ 
duce  the  “right”  solution  for  collective  problems 
to  achieve  political  equality,  the  fundamental  stan¬ 
dard  of  evaluation  of  procedures  and  outcomes 
being  political  equality.  Two  requisites  are 
opposed:  epistemic  proceduralism  versus  a  partic¬ 
ular  substantive  theory  of  justice,  the  first  one 
being  conceived  as  escaping  the  accusation  of  eth- 
nocentrism  confining  democracy  in  a  predeter¬ 
mined  set  of  standards. 

One  should  mention  here  the  powerful  critiques 
of  Alastair  MacIntyre,  a  critic  of  analytical  phi¬ 
losophy  coming  from  its  ranks,  and  the  decon¬ 
structionist  assaults  of  Michel  Foucault.  Foucault 
is  more  appreciated  for  his  insistence  on  the  micro¬ 
foundations  of  power  in  the  depths  of  modern 
society,  disciplined  and  subjected  to  governmental- 
ity,  than  for  his  philosophical  ideas  on  the  status  of 
truth  and  ethics.  His  statements  on  “a  political 
history  of  truth”  are  ambiguous  enough  to  go 
beyond  a  mere  social  history  of  science  (and 
morality)  to  be  suspected  of  unfettered  relativism, 
which  would  render  void  his  own  statements. 
Actually,  Foucault  has  an  ethical  sensitivity.  He 
may  be  right  in  criticizing  the  definitive  finding  of 
substances  such  as  transcendental  truth  and  moral¬ 
ity  beyond  the  contingency  of  identities,  but  that 
does  not  exonerate  him  from  the  duty  to  search  for 
the  truth,  without  which  any  agreement  should  be 
considered  an  agreement  about  truth.  As  for 
Jacques  Derrida,  Gilles  Deleuze,  and  Emmanuel 
Levinas  (the  latter  being  more  included  by  others 
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in  political  philosophy  than  claiming  to  belong  to 
the  tribe),  they  oscillate  between  the  deconstruc¬ 
tion  of  liberal  philosophy,  a  metapolitical  philoso¬ 
phy,  and  a  philosophy  of  metapolitics. 

Therefore,  the  dominant  Western  philosophy 
remains  the  legatee  of  the  diverse  kinds  of 
Enlightenment.  One  of  the  only  noticeable  excep¬ 
tions  is  the  antiliberalism  represented  by  the 
Straussian  pocket  of  resistance.  Strauss,  in  his 
American  period,  had  moved  to  an  untimely  liber¬ 
alism  and,  albeit  careful  to  distinguish  between 
“ancient”  and  “modern”  liberalisms,  paid  his 
respects  to  modern  liberal  education.  The  other 
two  exceptions  are  the  legacy  of  Joseph  de  Maistre, 
and  Donoso-Cortes  and  Carl  Schmitt  and  the  con¬ 
servative  thought.  The  first  one  is  almost  outside 
the  borders  of  “professional”  political  philosophy, 
and  the  second  one  may  claim  that  being  “conser¬ 
vative”  it  does  not  deign  to  be  a  philosophy.  As  for 
nationalism,  flourishing  under  various  guises,  secu¬ 
lar  or  religious,  it  must  be  taken  seriously  by  a 
political  philosophy  not  entranced  by  the  charms 
of  cosmopolitanism.  There  may  be  important  argu¬ 
ments  about  the  different  types  of  nationalism  or 
the  liberal  flavor  of  certain  nationalisms  and  their 
compatibility  with  liberal  values,  but  it  cannot  be 
the  core  of  a  political  philosophy  since  a  nationalist 
philosophy  would  be  a  contradiction  in  itself. 

This  apparent  dominance  of  a  philosophy  born 
in  areas  that  do  not  make  up  more  than  15%  of 
the  world  population  and  to  which  the  fall  of  the 
Berlin  Wall  marks  the  beginning  of  an  epochal 
change  in  political  philosophy  brings  about  its 
logical  and  sociological  backlash,  namely,  its 
indictment  of  spurious  universalism  based  on  a 
local  conception  of  what  an  individual  is  and  what 
his  or  her  concept  of  liberty,  quest  for  justice,  and, 
more  broadly,  rationality  may  mean.  To  be  sure, 
this  small  part  of  the  world  had  once  dominated 
the  entire  world  by  founding  huge  overseas 
empires,  the  mere  existence  and  success  of  which 
incited  many  intellectuals  belonging  to  colonized 
peoples  to  desert  the  ranks  of  traditional  thinkers 
and  join  the  troop  of  “organic  intellectuals,” 
espousing  the  basic  tenets  of  the  conquerors’  pub¬ 
lic  philosophy  to  turn  it  against  their  imperial 
endeavor.  But  despite,  or  perhaps  more  accurately 
because  of,  the  universal  success  of  decolonization 
and  its  major  theme,  anticolonialism,  the  dual 
nature  of  the  West,  liberal  inside  and  dictatorial 


outside,  came  to  be  seen  not  as  a  logically  inconsis¬ 
tent  yet  sociologically  explicable  combination  of 
two  sets  of  contrary  values  but  as  a  whole  package 
of  values.  This  gave  rise  to  Occidentalism  (the 
term  was  coined  after  Edward  Said’s  famous 
Orientalism  to  depict  a  distorted  image  of  the  real 
object) — all  the  more  so  as  Marxism,  which  had 
long  been  a  powerful  tool  to  convince  the  colo¬ 
nized  elites  that  they  could  stick  to  the  Enlightenment 
while  getting  rid  of  its  racist  and  colonialist  side, 
had  lost  most  of  its  appeal,  hence  the  search  for 
new  public  philosophies  to  pursue  the  unfinished 
struggle  against  Western  intellectual  hegemony 
and  to  gain  a  better  understanding  of  what  the 
subaltern  needs  and  visions  actually  are. 

Such  an  offhand  yet  plausible  sociology  should 
have  nothing  to  do  with  a  serious  philosophical 
and  scientific  debate.  Yet  it  is  a  powerful  input  that 
paves  the  way  for  new  inquiries  into  the  meaning 
of  concepts  in  areas  until  now  left  out  of  Western 
philosophical  concerns  and  confined  to  anthropo¬ 
logical  and  ethnological  researches  not  devoid  of 
biases — hence,  the  renewed  interest  in,  for  exam¬ 
ple,  civil  society  or  freedom.  Even  more  important 
is  the  surge  of  concern  for  “identities”  that  deeply 
modify  the  usual  debates  about  justice  by  shifting 
them  from  redistribution  to  recognition.  The  quar¬ 
rels  over  “positive  discrimination”  lie  at  the  heart 
of  the  dilemma  of  the  universal  welfare  state, 
trapped  in  the  dual  duty  of  providing  services  to  all 
on  an  equal  footing  and  promoting  particular  cat¬ 
egories.  They  become  bitterer  when  the  relevant 
groups  are  characterized  by  their  location  in  a 
social  stratification  not  based  on  the  division  of 
labor  and  material  reward  but  on  ascriptive  traits 
(religion,  ethnicity,  and  even  gender).  This  may 
lead  to  a  justification  based  on  a  compensating 
rationality  and  restorative  justice,  sometimes  curi¬ 
ously  put  forward  to  claim  that  groups  whose 
identities  have  been  excluded,  suppressed,  or  mar¬ 
ginalized  have  a  right  to  the  recognition  and  public 
toleration  of  their  own  intolerant  practices.  So  a 
combination  of  cultural  relativism  and  respect  for 
the  rights  of  cultural  communities  pretends,  under 
the  guise  of  empirical  descriptions,  to  be  the  polit¬ 
ical  philosophy  of  our  age. 

Still  more  debatable  are  the  assaults  against  cur¬ 
rent  political  philosophy  and,  by  implication, 
political  science.  Starting  with  the  now  (rightly  or 
wrongly)  taken-for-granted  demise  of  sovereignty 
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as  a  sociological  concept  (which  it  has  never  been, 
being  a  legal  concept)  and  of  the  “old”  theory  of 
the  modern  nation-state,  caricatured  as  the  super¬ 
position  of  one  territory,  one  people,  one  sover¬ 
eignty,  one  constitution,  and  one  homogeneous 
political  space  relating  directly  the  individual  to 
the  state,  they  deny  most  of  the  value  of  liberal  and 
republican  philosophy  when  applied  to  other  areas 
than  those  where  they  were  born.  Parekh,  after 
exposing  the  cultural  particularity  of  liberal  democ¬ 
racy  and  excoriating  the  poverty  of  contemporary 
Indian  political  thought,  has  set  himself  the  task  of 
explaining  why  Western  philosophy  stands  no 
chance  of  succeeding  in  countries  where,  for  the 
historical  reasons  already  mentioned  above,  politi¬ 
cal  institutions  of  the  past  are  reenacted  as  alive  or 
revived.  Thus,  in  these  societies,  according  to 
Parekh  (2003),  the  political  imagination  is  less 
“disciplined”  (a  Foucaldian  allusion),  bolder,  more 
reckless,  and  prone  to  explore  a  broader  range  of 
possibilities  than  in  the  West. 

Parekh’s  eulogy  can  be  seen  as  nothing  other 
than  the  reverse  of  the  famous  “Orientalist” 
descriptions  (perhaps  grounded  in  Hegelian  phi¬ 
losophy)  that  considered  non-Western  peoples  as 
alien  peoples  deprived  of  the  values  of  modern 
civilization  and  thus  unable  to  contribute  to  his¬ 
torical  development.  A  thesis  such  as  Parekh’s  suf¬ 
fers  from  two  basic  flaws.  It  does  not  give  us  any 
idea  about  what  such  new  philosophies  might  look 
like,  once  we  have  disposed  of  the  Foucaldian 
fancy  of  a  new  “political  spirituality,”  to  which  he 
refers  when  interpreting  the  Iranian  revolution. 
One  can  ask  whether  the  idea  of  Islamic  human 
rights  betters  the  usual  idea  of  universal  human 
rights  or  which  is  the  more  creative  and  imagina¬ 
tive  of  the  two  conceptions  of  the  relations  between 
religion  and  politics  and  the  modern  role  of 
Muslim  legacy  that  mark  the  epistemological 
divide  in  Islamic  political  thought:  the  transcen¬ 
dence  of  justice  as  an  intangible  and  eternal  trait  of 
the  good  society  versus  the  historicity  of  reason. 
The  first  epistemology  favors  revolutionary  con¬ 
ceptions  of  Islam  based  on  ontological  premises 
adverse  to  political  freedom,  whereas  the  second 
relies  precisely  on  freedom  to  criticize  the  first 
one’s  tenets. 

In  addition,  Parekh  neglects  the  necessary  com¬ 
bination  of  universalism  and  particularism  offered 
by  Walzer’s  (2007)  concept  of  reiteration.  Walzer 


begins  with  a  distinction  between  two  universal- 
isms  or  rather  two  dimensions  of  universalism.  In 
the  covering  law  universalism,  there  is  one  and 
only  one  law,  one  justice,  and  one  correct  under¬ 
standing  of  the  good  life,  the  good  society,  or  the 
good  regime.  In  the  reiterative  universalism,  taking 
seriously  what  it  means  to  have  a  history,  every 
universalist  principle  is  inevitably  reiterated  in  a 
particular  way  in  its  historical  existence;  even  the 
will  to  reproduce  a  principle  and  to  replicate  its 
application  brings  about  some  historical  singular¬ 
ity.  It  ensues  that  the  second  universalism  makes 
no  prediction  about  the  substance  of  the  successive 
reiterations.  Such  a  line  of  reasoning  provides  a 
solid  basis  for  a  moral  universalism  opposing  the 
obligation  to  recognize  and  include  groups  whose 
morality  is  radically  at  odds  with  the  requirements 
of  a  Rawlsian  reasonable  pluralism  and  thus  free¬ 
ing  the  state  from  its  duty  of  indifference  while 
recognizing  at  the  same  time  that  liberalism  is  not 
averse  to,  and  even  promotes,  the  rights  of  minor¬ 
ity  cultures.  By  the  same  token,  political  (nonmeta¬ 
physical)  secularism  can  be  judged  as  a  universal 
value,  provided  that  it  is  not  restricted  to  a  particu¬ 
lar  conception  sometimes  attributed  to  Western 
secularism  and  mainly  conceived  from  a  Protestant 
culture  (a  position  held  by  Hobbes,  Locke,  and 
Voltaire  but  certainly  not  by  Kant,  Tocqueville,  or 
Rawls)  and  that  it  does  not  rely  on  a  rationalist 
conception  of  reason,  ignoring  its  plurality  and 
imposing  unjustified  limits  on  the  manners  in 
which  issues  are  to  be  brought  into  the  public 
domain  (Rajeev  Bhargava,  1998). 

To  be  fair,  let  us  recognize  that  Parekh’s  posi¬ 
tion  is  not  restricted  to  an  excessive  indictment  of 
Western  political  theory  when  it  ventures  overseas. 
Not  only  does  the  empirical  basis  of  Parekh’s  the¬ 
ory  seem  quite  defensible  and  supply  a  theoretical 
justification  for  its  call  to  take  into  account  the 
comprehensive  moral  doctrines  and  reconsider  the 
foundations  of  an  autonomous  political  philoso¬ 
phy,  but  it  also  offers  some  wise  advice  to  do  so. 
Philosophers  must  confront  those  doctrines,  expose 
their  logical  flaws  and  their  inadmissible  moral 
and  political  implications,  and  found  their  own 
political  theory  on  a  reasonably  convincing  con¬ 
ception  of  man  and  the  world.  However,  this  is  a 
very  difficult  task:  If  the  philosopher  is  involved  in 
politics,  he  or  she  cannot  start  a  deep  theoretical 
inquiry  for  lack  of  time.  According  to  Parekh 
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(2003),  if  the  philosopher  wants  to  escape  from 
vita  activa,  the  intellectual  liberty  required  by 
political  theory  is  missing  in  societies  dominated 
by  the  religious,  ethnic,  or  ideological  orthodoxies, 
and  even  in  free  societies  such  as  India,  the  inhibi¬ 
tion  stems  from  the  fear  to  question  the  dominant 
ideological  consensus  and  so  to  encourage  various 
forms  of  extremism.  The  point  is  very  well  taken, 
and  Strauss  would  have  been  sensitive  to  those 
disenchanted  reflections. 

At  least  in  countries  that  are  “disciplined”  by 
other  means  and  where  a  relative  freedom  of  opin¬ 
ion  and  expression  still  reigns,  one  could  expect 
political  science  and  political  philosophy  to  eschew 
the  strange  dual  efforts  to  (1)  accommodate  the 
idea  of  singular  cultural  outlooks,  irreducible  to 
one  another,  and  the  idea  of  a  world  community 
endowed  with  common  values  (short  of  reducing 
them  to  market  economy  rules)  and  (2)  substanti¬ 
ate  the  idea  that  the  static  identities  engendering 
territorial  and  cultural  boundaries  are  nothing 
other  than  the  outmoded  legacies  of  nationalism 
and  imperialism  while  holding  at  the  same  time  the 
opposite  idea  that  the  cultural  minorities  and  the 
borderless  nations  must  be  institutionally  pro¬ 
tected,  which  supposes  that  identities  are  some¬ 
what  static  and  worth  perpetuating. 
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Political  Psychology 


The  relationships  between  psychology  and  politics 
are  complex  and  have  generated  both  controversy 
and  a  substantial  amount  of  research.  This  entry 
discusses  these  relationships,  which  have  been  the 
focus  of  research  in  the  subfield  of  political 
psychology  in  recent  decades.  They  include  per¬ 
sonality  aspects,  group  and  mass  phenomena, 
emotional  intelligence,  and  political  symbolism. 

In  simplistic  terms,  sociology  refers  to  collective 
behaviors  and  psychology  to  individual  ones. 
Therefore,  it  might  be  expected  that  political 
researchers  who  adopt  a  collective  or  structural 
approach  toward  political  relationships  would  be 
the  most  reluctant  to  acknowledge  the  relevance  of 
political  psychology,  while  those  who  support  an 
individualistic  approach  would  be  more  likely  to 
recognize  its  usefulness  or  even  to  claim  its  necessity. 
But  this  has  not  occurred.  Political  psychology  has 
met  with  much  opposition  from  both  sides.  Why? 
After  all,  Alexis  de  Tocqueville,  whose  work  is  still 
highly  valued  by  political  scientists  in  both  America 
and  Europe,  devoted  significant  attention  to  psycho¬ 
logical  considerations  of  the  way  in  which  political 
systems  work.  In  the  1930s,  Harold  Lasswell 
creatively  investigated  the  links  between  psychopa¬ 
thology  and  politics,  particularly  as  regards  the 
motivations  behind  political  involvement. 

For  a  long  time,  however,  strong  objections  to 
political  psychology  prevailed  as  political  science 
became  more  and  more  meticulous  and  rigorous  in 
its  methodological  practices.  Many  works  in  polit¬ 
ical  psychology  were  speculative  essays  without 
valuable  data  or  detailed  theoretical  frameworks. 
In  Europe,  these  weaknesses  were  most  apparent 
in  the  macrolevel  literature  devoted  to  crowd  psy¬ 
chology  or  the  ethnic  temper  of  particular  nations. 


Among  the  concepts  in  this  literature  that  have 
been  rejected  by  political  scientists  are  the  notion  of 
“collective  unconscious”  coined  by  Carl  Jung,  a 
psychoanalyst  who  competed  with  Sigmund  Freud; 
the  cliche  of  the  irrational  crowd  moved  by  its 
“collective  soul”  put  forward  by  Gustave  Le  Bon, 
who  considered  himself  the  founder  of  crowd  psy¬ 
chology;  the  “herd  mentality,”  which  was  described 
as  a  human  instinct  by  Wilfred  Trotter;  and  the 
approximations  of  the  psychology  of  various  peo¬ 
ples  and  “national  character”  that  so  often  appeared 
in  works  of  the  first  half  of  the  20th  century.  All  of 
these  superficial  considerations  tended  to  cause 
confusion  and  suspicion  over  the  use  of  psychology 
in  politics.  As  for  the  psychology  of  individual 
actors,  most  sociologists  have  denied  its  relevance, 
arguing  either  that  the  real  motivations  of  personal 
behaviors  remain  inaccessible  to  knowledge  by  sci¬ 
entific  standards  or  that  investigating  these  is 
pointless  since  social  structures  or  the  valorization 
of  social  roles  (a  set  of  expectations  involved  in  an 
individual’s  social  status)  are  the  major  explana¬ 
tory  factors.  Some  of  these  arguments  against  the 
use  of  psychology  have  merit,  but  it  must  be  added 
that  a  lack  of  information  or  a  culture  predisposed 
against  the  psychological  field  may  also  explain,  at 
least  partially,  the  refusal  of  political  researchers  to 
borrow  frameworks  from  this  discipline. 

What  is  political  psychology?  It  is  not  a  genuine 
discipline  in  its  own  right,  nor  is  it  a  branch  of 
psychology  because  it  shows  no  special  interest  in 
generalizing  about  human  behavior  beyond  the 
political  arena.  Rather,  it  is  an  interdisciplinary 
academic  field  intended  to  provide  insights  into  the 
way  in  which  politics  functions.  In  fact,  each  kind 
of  collective  behavior,  each  political  event  is  an  out¬ 
come  of  individual  actions  aggregated  and  interact¬ 
ing,  as  Max  Weber  strongly  asserted.  So  it  is  not 
possible  to  avoid  some  theory  about  the  cognitive 
and  emotional  processing  of  man  in  social  life  when 
thinking  about  politics.  But  if  social  structures 
emerge  from  personal  behaviors,  personalities  are 
also  socially  shaped.  Moreover,  the  macrosocial 
dimensions  of  politics  strengthen  the  importance  of 
institutional  factors  to  the  detriment  of  free  will. 
This  implies,  as  Erving  Goffman  has  stated,  that  we 
need  a  psychology  for  sociologists,  very  different 
from  that  for  psychologists  and,  in  many  respects, 
quite  restricted.  Nevertheless,  it  is  useful  and  even 
necessary  to  avoid  neglecting  contributions  and 
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concepts  from  the  neurosciences,  cognitive  psychol¬ 
ogy,  developmental  or  community  psychology,  and, 
maybe,  psychoanalysis.  But  the  relevance  of  each  of 
these  approaches  depends  on  the  nature  of  the  fields 
perceived  as  psychosociological.  And  it  just  so  hap¬ 
pens  that  these  are  very  different. 

Personality  Psychology  and  Politics 

Personality  is  usually  defined  as  an  organized, 
dynamic  but  relatively  enduring  set  of  characteris¬ 
tics  that  affects  how  an  individual  responds  in 
various  situations.  In  the  study  of  politics,  a  focus 
on  this  concept  seems  to  make  sense  above  all 
when  a  limited  number  of  key  persons  are  involved 
in  a  process  of  decision  making.  For  example,  it 
would  be  important  to  know  whether  political 
leaders  are  proactive  or  reactive,  conciliatory  or 
aggressive,  open-minded  or  close-minded.  But  per¬ 
sonality  psychology  strongly  matters  if,  and  only 
if,  it  is  believed  that  the  will  of  a  few  policymakers — 
and  particularly  that  of  the  political  leadership — 
can  overcome  the  hurdles  imposed  on  the  policy¬ 
making  process  and  successfully  implement  their 
will.  This  rarely  occurs.  In  most  cases,  leaders  are 
compelled  to  search  for  compromises,  to  face 
internal  power  struggles  as  well  as  demands  from 
pressure  groups,  and  to  come  to  terms  with  the 
expectations  imposed  on  them  by  their  elite  status. 
So  they  are  obliged  to  curb  their  desires,  and 
instead  of  expressing  their  genuine  motivations  (or 
following  what  drives  them),  they  adopt  behaviors 
strictly  dictated  by  the  power  games  at  play.  Thus, 
it  is  unsurprising  that  many  sociologists  see  such 
psychological  approaches  as  groundless.  The  real 
explanatory  factors  may  be  elsewhere. 

This  is  all  true,  but  three  additional  points  must 
be  taken  into  consideration.  First,  as  Fred 
Greenstein  noticed,  it  may  happen  that  the  com¬ 
peting  forces  of  power  are  so  well  balanced  and 
under  such  fluid  circumstances  that  there  is  space 
for  a  key  decision  maker  to  make  a  genuinely  free 
choice  among  policy  options.  In  this  case,  psycho¬ 
logical  characteristics  matter.  Of  course,  this  situa¬ 
tion  is  much  more  infrequent  in  democratic  regimes 
than  in  dictatorships.  Furthermore,  in  either  gov¬ 
ernment  structure,  leaders  are  dependent  to  some 
degree  on  the  will  of  their  followers  and  the  nature 
of  the  alliances  they  have  forged.  Second,  even  if 
free  choice  is  highly  restricted,  the  fact  remains 


that  personal  psychology  affects  the  style  of  gover¬ 
nance  adopted  and  particularly  the  way  in  which 
leaders  appeal  to  citizens,  supporters,  and  oppo¬ 
nents.  This  has  a  great  deal  of  influence  on  the 
course  of  political  life.  A  charismatic  style  versus  a 
sober  tone  or  a  bright  versus  a  stoic  appearance 
can  make  all  the  difference  in  the  electoral  process — 
as  we  saw  in  the  competition  between  Silvio 
Berlusconi  and  Romano  Prodi  in  Italy  (2008)  or 
among  Barack  Obama  and  several  of  his  challeng¬ 
ers  in  the  United  States  primaries  of  the  same  year. 
Third,  personality  factors  are  relevant  for  leaders 
acting  in  situations  where  strong  emotions  are  trig¬ 
gered,  as  Betty  Glad  claims,  citing  the  case  of 
Jimmy  Carter  handling  the  Iranian  hostage  crisis 
or  Ronald  Reagan’s  inability  to  directly  confront 
people  he  liked,  which  hindered  his  dealings  with 
Menachem  Begin  at  the  time  of  the  Israeli  invasion 
of  Lebanon  in  1982.  Investigating  the  effects  of 
character  on  role  performance  is  never  pointless. 

Throughout  the  20th  century,  research  in  per¬ 
sonal  psychology  dealt  with  the  motivations  of 
politicians  and/or  their  followers.  Since  the  1950s, 
many  biographical  works  have  emphasized  the 
idea  that  politicians  are  qualitatively  different  from 
other  humans  because,  as  Carol  Barner-Barry 
noted,  political  behavior  is  strongly  related  to  mak¬ 
ing  choices  between  disputed  alternatives,  strongly 
oriented  toward  losing  or  winning,  and  often  moti¬ 
vated  by  feelings  of  justice  or  fairness.  So  in  these 
biographers’  minds,  political  involvement  derives 
from  special  incentives.  James  Payne  (and  his  asso¬ 
ciates)  have  uncovered  five  main  political  motiva¬ 
tions:  the  quest  for  status  or  prestige,  the  need  to 
work  on  concrete  issues,  the  need  to  have  convivi¬ 
ality  and  friendship,  the  fulfillment  of  moral  obli¬ 
gations,  and  the  pleasure  of  competing  aggressively 
in  political  games.  Their  research  shows  how  skep¬ 
tical  citizens  ought  to  be  about  the  assertions  of 
politicians,  who  tend  to  idealize  their  motivations. 
But,  even  if  Payne’s  list  is  useful  to  later  analysis,  it 
is  too  short.  Obviously,  activists  and  politicians 
share  other  common  emotional  drives,  such  as  the 
desire  to  overcome  low  self-esteem  by  public  self- 
realization  or  to  escape  the  monotony  of  life 
through  stimulating  social  games,  not  just  aggres¬ 
sive  ones.  In  the  last  few  decades,  several  authors 
also  tried  to  build  ideal  types  of  personality.  When 
applied  to  leaders,  such  classifications  aim  not  only 
to  make  their  behaviors  more  predictable  to 
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observers  but  also  to  uncover  the  mechanisms  by 
which  they  charm  their  followers  (or  threaten  their 
opponents).  Of  course,  the  appeal  of  revolutionary 
ascetics  or  moral  masochists  differs  from  that  of 
absolute  narcissists  or  catchall  seducers.  Moreover, 
we  may  expect  that  revolutionary  (and  stressing) 
situations  produce  different  types  of  leaders  than 
do  ordinary  democratic  times. 

Other  authors  were  interested  in  designing 
typologies  of  ordinary  citizens.  In  this  field,  the 
most  famous  investigation  was  carried  out  by 
Theodor  Adorno  and  a  team  of  psychologists  and 
psychoanalysts;  their  book  The  Authoritarian 
Personality  (1950)  later  inspired  much  subsequent 
field  research  in  political  science.  In  their  view, 
authoritarian  personalities  are  characterized  by 
traits  that  result  from  childhood  experiences  and 
become  stabilized  for  life.  Harsh  and  punitive  par¬ 
ents  produce  adults  whose  personalities  are  defined 
by  authoritarian  submission,  conventionalism, 
destructiveness,  superstition  and  stereotypy,  and 
fear  of  intraception  (the  ability  to  delve  into  one’s 
deep  feelings  and  attitudes).  They  used  this  syn¬ 
drome  to  help  explain  right-wing  ideology,  prefas¬ 
cism,  and  fascism.  This  thesis  was  widely  criticized 
for  methodological  flaws,  particularly  blindness 
about  the  characteristics  of  left-wing  authoritarian 
personalities.  But  the  main  objection  here  is  the 
very  use  of  the  concept  of  personality.  Adorno’s 
research  implies  that  political  opinions,  choices, 
and  behaviors  are  primarily  the  outcome  of  psy¬ 
chological  factors  since  personality  is  perceived  as 
the  center  at  which  such  choices  are  made.  It  also 
tends  to  exaggerate  the  consistency  of  individual 
responses  to  the  challenges  of  life.  Individuals 
sometimes  undergo  sudden  psychological  changes 
in  attitudes,  related  to  changing  circumstances 
such  as  fear  of  war,  local  unrest,  or  economic 
decline.  For  instance,  in  such  trying  circumstances, 
individuals  may  abandon  cooperation  strategies  in 
favor  of  more  competitive  or  defensive  ones.  In  his 
classic  monograph,  Personality  and  Assessment 
(1968),  Walter  Mischel  asserts  that  personal  psy¬ 
chology  searches  for  consistency  in  the  wrong 
places  (e.g.,  seeking  stable  responses  in  cross- 
situational  behaviors).  Because  the  individual’s  behav¬ 
ior  is  highly  dependent  on  situational  cues,  his  or 
her  consistency  is  found  in  stable  patterns  of  the 
“if-then”  variety — doing  A  when  X  but  B  when  Y. 
So,  for  instance,  similar  aggressive  personalities 


can  differ  dramatically  in  the  types  of  situations  in 
which  they  are  aggressed.  So  without  dismissing 
some  of  the  psychological  insights  put  forward  by 
Adorno  or  Milton  Rokeach  (author  of  the  famous 
The  Open  and  Closed  Mind,  1960),  it  is  better  to 
see  personality  psychology  as  complementing 
rather  than  supplanting  sociological  approaches. 
Too  much  attention  to  personal  psychology  can 
result  in  too  much  attention  being  paid  to  personal 
responsibility,  while  in  fact  political  life  is  made  up 
of  collective  processes  and  interactions. 

Today,  there  is  a  shift  in  political  psychology 
toward  other  conceptualizations  borrowed  from 
behaviorist  or  cognitive  psychology.  Both  under¬ 
score  the  role  of  external  stimuli  and  their  effects 
on  personal  behavior.  Burrhus  Skinner,  a  very 
influential  behaviorist,  has  demonstrated  that  there 
is  a  mutual  interaction  of  the  ego  with  its  environ¬ 
ment.  According  to  his  three-step  contingency 
model  (stimuli/responses/consequences),  everyone 
develops  habits  of  thinking  and  acting  that  are 
reinforced  by  their  effectiveness  in  earlier  situa¬ 
tions.  Cognitive  processes  involve  adaptive  selec¬ 
tivity.  In  the  political  field,  this  means,  for  instance, 
that  the  development  of  Adolf  Hitler’s  or  Joseph 
Stalin’s  personality  was  itself  a  process,  strongly 
related  to  the  course  of  their  political  rise  and 
sharpening  instrumental  tendencies  with  every  vic¬ 
torious  step.  Moreover,  it  can  be  said  that  the 
economical,  cultural,  and  political  circumstances 
of  their  days — a  humiliating  defeat  in  World  War 
I  for  Germany,  and  civil  war  and  the  triumph  of 
the  one-party  system  in  Russia — played  a  role  in 
the  later  selection  of  these  types  of  personalities  as 
omnipotent  leaders.  These  personality  types  fit 
well  with  the  kinds  of  crises  that  their  respective 
countries  were  confronting.  Many  political  scien¬ 
tists  may  argue  that  the  critical  question  is  not 
“Which  personality?”  but  “Under  which  circum¬ 
stances  did  particular  persons  establish  themselves 
as  the  leader  of  a  country?”  Though  this  argument 
underlines  the  importance  of  social  and  political 
considerations,  it  would  be  wrong  to  entirely  dis¬ 
miss  the  role  of  personal  psychology  because  it  has 
a  significant  impact  in  other  ways. 

Interactionist  Psychology  and  Groupthink 

Rather  than  focusing  solely  on  factors  shaping  a 
person,  political  psychology  must  also  take  into 
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account  interactions  between  thoughts  and  feel¬ 
ings  that  link  individuals  to  each  other  in  a  given 
situation.  Cognitive  processes  such  as  thinking  and 
judging  political  events  are  not  affected  only  by 
macrosocial  influences.  Of  course,  early  socializa¬ 
tion,  economic  and  cultural  status,  and  religious, 
ethnic,  or  political  affiliations  all  shape  the  ways  in 
which  citizens  make  up  their  minds  in  a  voting 
booth  or  engage  in  political  activities.  However, 
the  inner  circle — family,  friends,  highly  respected 
figures — matters,  too.  And,  in  this  circle,  emo¬ 
tional  relationships  play  a  significant  role.  Feelings 
of  confidence  or  mistrust,  a  tendency  to  judge  oth¬ 
ers,  the  degree  of  empathy  or  ambivalence  toward 
relatives,  or  a  fear  of  annoying  them — these  kinds 
of  relations  can  modify  a  specific  behavior  like 
choosing  a  candidate  in  an  election.  It  is  for  this 
reason  that  when  family  or  tribal  links  are  strong, 
as  is  the  case  in  rural  areas  or  in  many  non- 
Western  cultures,  electoral  processes  are  shaped  by 
factors  other  than  those  in  more  individualistic 
cultures.  Pressure  from  other  members  of  the 
“tribe”  may  compel  individuals  to  endorse  partic¬ 
ular  opinions.  In  addition,  the  appearance  of  com¬ 
petitors  may  awaken  emotional  tensions  resulting 
from  relationships  between  parents  and  children, 
especially  between  fathers  and  sons.  A  great  deal 
of  research  shows  that  these  tensions  in  turn  influ¬ 
ence  the  reproduction  process  of  inherited  political 
attitudes. 

The  influence  of  these  psychological  interrela¬ 
tions  at  a  microsocial  level  is  most  visible  and 
most  consequential  for  the  course  of  political  life 
when  studying  decision-making  processes.  With 
his  theory  of  groupthink,  Irving  Janis  describes 
how  systematic  errors  result  from  psychological 
biases  held  by  small  groups  of  policymakers  tak¬ 
ing  collective  decisions  in  a  very  closed  (and 
stressful)  environment.  The  more  a  group  is 
cohesive,  due  to  its  homogeneity  of  social  back¬ 
ground  and  ideology,  the  more  its  members  value 
unanimity  of  thoughts,  which  in  turn  may  over¬ 
ride  a  realistic  perception  of  facts  and  alternative 
interpretations  of  issues.  The  risk  occurs  when 
the  situational  context  is  highly  provocative 
because  of  recent  policy  failures  or  very  serious 
external  threats,  noticeably  in  the  field  of  secu¬ 
rity.  The  debate  can,  in  these  cases,  become 
overloaded  with  implicit  (or  sometimes  explicit) 
emotion.  Wrong  decisions,  Janis  claims,  result 


from  self-censorship  of  ideas  that  deviate  from 
the  apparent  group  consensus — each  of  the  mem¬ 
bers  fearing  being  dubbed  “disloyal”  to  the 
leader — and  from  channels  that  shield  the  group 
from  disturbing  information  in  order  to  prevent 
too  much  anxiety.  To  get  rid  of  these  biases, 
Janis  advocates  some  kind  of  reorganization  of 
the  decision-making  process,  particularly  by  set¬ 
ting  up  several  independent  groups  working  on 
the  same  issue  and  by  increasing  the  involvement 
of  outside  experts.  So  we  can  see  that  certain 
methods  of  political  management  are — or  should 
be — responses  to  psychological  challenges.  Much 
subsequent  research  has  tried  to  test  Janis’s  thesis 
in  laboratory  experiments  or  in  the  field.  In  a 
broad  survey  of  post-Janis  research,  Robert 
Baron  claims  that  groupthink  is  even  more  ubiq¬ 
uitous  than  Janis  asserted.  In  particular,  it  occurs 
even  if  there  are  neither  external  threats  nor 
much  anxiety  present.  So  in  Baron’s  opinion, 
such  an  approach  must  not  be  underestimated 
for  explaining  political  mistakes  and  misunder¬ 
standings  of  the  challenges  to  be  resolved. 

Interactionist  psychology  investigates  phenom¬ 
ena  such  as  the  way  in  which  political  judgment  is 
shaped  among  members  of  think  tanks,  associa¬ 
tions,  and  parties.  In  political  life,  engagement 
means  sharing  values  and  goals  with  others  within 
an  organization  and,  above  all,  self-identification 
with  those  ideas.  Politicians  and  grassroots  activ¬ 
ists  claim  an  identity  that  supposedly  summarizes 
what  they  think  and  what  they  are  going  to  do. 
They  call  themselves  either  Republican  or  Dem¬ 
ocrat,  right  wing  or  left  wing.  Even  within  a  party, 
they  often  claim  an  affiliation  to  one  faction.  This 
self-categorization  or  political  identity  puts  big 
constraints  on  their  actions:  first  on  the  kind  of 
labels  with  which  they  can  describe  a  situation 
and,  second,  on  the  set  of  issues  they  can  take  for 
consideration.  The  rank  and  file  have  to  represent 
the  positions  traditionally  taken  by  their  organiza¬ 
tion  in  order  to  avoid  accusations  of  disloyalty, 
prevent  reproaches  from  their  entourage,  and 
avoid  being  marginalized — which  may  be  painful. 
This  pressure  is  stronger  where  organizations  are 
well  established  and  their  members  allegedly  share 
mutual  friendships,  based  on  a  past  history  of 
political  struggles.  Sometimes  blind  allegiance 
causes  a  process  of  real  de-individuation:  for 
example,  a  loss  of  self-awareness  and  personal 
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evaluation  made  about  political  events  and 
leaders.  In  this  regard,  if  the  authoritarian  man¬ 
agement  of  a  party  mostly  attracts  persons  who 
completely  identify  with  the  leader,  this  generates 
feedback  reinforcing  authoritarian  management. 
A  process  of  isolation  and  insulation  from  society 
may  result,  as  many  examples  of  extremist  parties 
(right  wing  or  left  wing)  demonstrate. 

As  for  leaders,  if  they  have  space  to  undertake 
initiatives  and  formulate  political  judgments  of 
their  own,  they  can  work  in  a  more  or  less  polar¬ 
ized  world.  If  the  outgroups  they  confront  are 
particularly  despised  (which  is  the  case  when  com¬ 
petition  is  high),  positions  issued  by  political  lead¬ 
ers  may  be  greatly — and  negatively — influenced  by 
their  opponents’  stances.  “What  differentiates 
your  program  from  your  competitor’s?”  is  a  com¬ 
mon  question  leaders  have  to  address.  But  the 
distinctions  these  individuals  demonstrate  from 
each  other  may  not  only  be  political.  We  may  con¬ 
sider  that,  in  democratic  regimes,  political  action  is 
as  much  affected  by  emotions  as  formal  political 
allegiances.  The  degree  to  which  one  sees  his  or  her 
opponents  as  friends  or  enemies  or  the  degree  to 
which  one  treats  them  with  deep  respect  or  equally 
deep  disgust  colors  political  life  with  a  touch  of 
psychology. 

Mass  Belief  Systems  in  Electoral  Processes 

How  do  citizens  make  decisions  in  the  voting 
booth?  This  classic  question  in  political  science 
draws  much  attention.  Some  researchers  argue  that 
votes  are  generally  consistent  with  the  beliefs  and 
political  frames  with  which  the  voter  grew  up.  In 
this  regard,  the  so-called  Michigan  paradigm,  iden¬ 
tified  with  the  publication  of  the  influential  The 
American  Voter  (1960),  was  seminal.  This  book 
insists  on  the  long-standing  effects  of  socialization 
by  neighborhood,  education,  ethnic,  religious,  and 
class  affiliation.  It  puts  forward  party  identification 
as  a  decisive  variable  in  voting.  A  second  line  of 
thinking  sees  citizens  as  relatively  naive  individu¬ 
als,  relying  heavily  on  wrong,  irrelevant,  or  incom¬ 
plete  information.  This  research  is  interested  in  the 
way  volatile  (or  nonexistent)  opinions  are  influ¬ 
enced  by  media  coverage  in  electoral  campaigns, 
above  all  when  real  political  knowledge  is  lacking. 
A  third  stream  of  research,  closely  related  to  the 
theory  of  public  choice,  underscores  the  tactical 


ability  of  voters  to  pursue  rational  goals.  Voters 
are  portrayed  as  consumers  searching  for  the  most 
efficient  costs/benefits  choice. 

All  these  explanations  rely  on  implicit  or,  much 
less  often,  explicit  psychological  considerations, 
even  if  they  are  tightly  intertwined  with  sociologi¬ 
cal  and  political  ones.  Contributors  to  The  American 
Voter  like  Philip  Converse,  a  social  psychologist, 
accept  this.  Whether  people  have  “sticky”  prefer¬ 
ences  that  discriminate  according  to  partisan  affili¬ 
ations  or  remain  independent  and  open-minded 
and  whether  they  are  prone  to  trust,  or  not  to  trust, 
political  parties,  governmental  agencies,  union 
leaders,  and  so  on,  are  significantly  related  to  psy¬ 
chological  factors.  What  may  be  at  stake,  for 
instance,  are  the  degree  of  self-protection  involved 
and  the  ways  and  means  of  keeping  oneself  safe. 
Those  who  score  high  on  attitudinal  scales  measur¬ 
ing  anxiety  are  more  likely  to  adopt  vigilant  behav¬ 
ior  and  show  a  strictly  selected  confidence  in  a  few 
authorities.  As  soon  as  the  political  situation 
becomes  tense,  they  view  the  public  arena  as 
divided  and  feel  confronted  by  merciless  foes.  By 
contrast,  a  strong  self-esteem  makes  an  individual 
more  open  to  opposite  views  without  fearing  desta¬ 
bilization,  more  able  to  understand  others’  points 
of  views,  and  eventually,  more  likely  to  adopt  con¬ 
ciliatory  or  balanced  opinions  and  behavior. 

Political  scientists  have  always  known  that 
political  choices  are  related  to  ideological  values  in 
a  country’s  cultural  environment.  But  they  seldom 
investigate  this  subject  in  great  depth,  making  it 
more  difficult  for  them  to  understand  all  the  vari¬ 
ous  roles  ideology  plays  in  thinking  and  judging. 
Rationalizing  an  action  through  purported  values 
rewards  the  Ego,  helping  individuals  keep  a  sense 
of  self-consistency  and  self-esteem.  This  idealizes  a 
given  behavior  with  rationalizing  explanations; 
conceals  less  glorious  motivations  for  action,  such 
as  envy,  jealousy,  and  hate;  and  confirms  linkages 
between  individuals — which  are  of  the  utmost 
importance  for  individuals  participating  in  politi¬ 
cal  organizations.  In  this  respect,  ideological  and 
psychological  factors  are  strongly  related  to  each 
other  but  in  a  complex  way  that  needs  to  be 
explored.  Voters  may  cast  a  ballot  for  a  party  can¬ 
didate  because  they  think  that  he  or  she  is  the  best 
or  the  least  distasteful  among  the  choices;  but  they 
may  also  have  in  mind  a  desire  to  show  what  kind 
of  citizen  they  are — a  responsible  democrat,  an 
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enthusiastic  patriot,  a  justice  seeker,  or  even  an 
inflexible  rebel — no  matter  what  their  real  person¬ 
ality  may  be.  Framed  early  on  by  socialization  and 
strengthened  by  the  daily  influence  of  an  individu¬ 
al’s  social  location,  this  self-affirmation,  ostensibly 
relying  on  internalized  values,  induces  a  ballot 
choice  that  is  relatively  independent  from  the  real 
stakes  of  the  polling  day  (such  as  policy  relevance 
or  candidate  competence).  It  is  the  reason  why 
public  images  (of  a  party  or  a  candidate)  matter  so 
much.  Politicians,  in  turn,  who  make  identification 
processes  easier  will  more  easily  obtain  votes.  Issue 
positions  are  not  enough;  a  candidate’s  whole  life 
must  testify  in  his  or  her  favor. 

Political  psychology,  used  by  political  scientists 
interested  in  the  ways  in  which  mass  opinion  is  cre¬ 
ated,  is  hardly  psychological  in  the  eyes  of  “genu¬ 
ine”  psychologists  because  it  considers  external 
structures  of  psychology  as  well  as  internal  factors. 
Cultural  linkages,  as  well  as  the  cognitive,  ideo¬ 
logical,  and  institutional  structures  of  the  political 
arena,  are  all  taken  into  account  when  considering 
why  citizens  make  a  particular  choice.  In  electoral 
studies,  this  kind  of  political  psychology  has  been 
drawing  more  and  more  attention.  Since  the  later 
1970s,  research  based  on  surveys  or  various  attitu- 
dinal  scales  has  attempted  to  give  a  more  precise 
picture  of  different  types  of  voters  by  taking  into 
account  psychological  factors  such  as  whether  one 
is  pessimistic  or  optimistic,  self-directed  or  other- 
directed,  or  confident  or  suspicious.  In  a  survey, 
counting  likes  and  dislikes  about  issues  and  candi¬ 
dates  has  become  of  the  utmost  importance  in 
predicting  electoral  choices.  These  studies  have 
focused  less  on  personality  factors  than  on  psycho¬ 
logical  styles.  They  are  the  product  of  responses 
extracted  from  aggregated  statistical  data,  so  that 
certain  general  categorizations  can  be  correlated 
with  the  appeal  of  public  figures  in  a  highly  per¬ 
sonalized  competition.  When  Jack  Doppelt  and 
Ellen  Shearer  identify  five  types  of  nonvoters — 
doers,  unplugged,  irritated,  don’t  know,  and 
alienated — they  shed  some  new  light  on  the  phe¬ 
nomenon  of  abstention  and  the  reasons  why  such 
behavior  can  be  volatile  from  one  election  to 
another.  These  analyses  tend  to  depreciate  the  idea 
that  electoral  motivations  are  always  based  on 
pure  rationality  and  controlled  economic  calculus. 

The  ways  in  which  psychology  matters  in  the 
voting  process  are  well  known.  Electoral  campaigns 


mostly  activate  psychological  predispositions. 
Beliefs  and  emotions  like  fear  or  hope  and  anger  or 
enthusiasm  influence  perceptions  and  evaluations 
of  public  figures  (parties  or  candidates)  that,  in 
turn,  determine  the  final  choice.  But  news  media 
attention  to  particular  incidents  or  hitches  in  the 
campaign  can  also  modify  the  evaluation  process, 
depending  on  what  problems  are  being  stressed:  for 
example,  threats  to  national  security,  economic 
crises,  immigration,  taxes,  or  purchasing  power. 
Political  scandals  can  also  have  significant  negative 
effects.  So  voting  should  be  understood  not  only  as 
a  political  outcome  but  also  as  a  mediated  psycho¬ 
logical  process. 

People  cannot  be  locked  into  precise  categori¬ 
zations.  Many  citizens  change  their  mind  as  situ¬ 
ations  develop,  in  accordance  with  changes  in  the 
political  arena.  Even  the  Michigan  model  never 
claimed  that  party  identifications  remain  indefi¬ 
nitely  stable.  So  snapshots  taken  at  different 
times  of  the  fluid  mood  of  an  electorate  enable  us 
to  better  understand  the  way  in  which  beliefs  and 
fantasies  cause  opinions  and  emotions  and  even¬ 
tually  lead  to  a  political  choice.  An  interesting 
discovery  in  the  United  States,  valuable  to  some 
extent  for  other  Western  democracies,  is  that  a 
large  range  of  public  likes  and  dislikes,  extracted 
from  frequent  surveys,  can  be  aggregated  in  a 
“policy  mood”  (James  Stimson)  that  has  swung, 
over  four  decades,  from  a  “liberal”  high  to  a 
low,  in  regular  increments.  Politicians  cannot 
take  this  finding  lightly.  It  remains  to  be  empha¬ 
sized  again  that  pure  rationality  and  controlled 
calculus  will  never  successfully  explain  any  par¬ 
ticular  behavior.  But  people  with  higher  levels  of 
political  information  (and  education)  know  bet¬ 
ter  than  novices  or  lower  educated  persons  how 
to  express  through  strict  political  rationaliza¬ 
tions  their  candidate  or  issue  preference. 

Emotional  Intelligence 

and  Related  Concepts 

Since  the  later  1980s,  emotional  intelligence  has 
been  given  increasing  attention  in  a  wide  range  of 
academic  publications  that  draw  on  observations 
from  political  scientists.  Just  as  the  neurosciences 
underscore  the  links  between  reason  and  emotion  in 
the  way  the  human  brain  works,  these  psycholo¬ 
gists  dismiss  the  idea  of  pure  intelligence.  Political 
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scientists  may  borrow  some  crucial  assertions  from 
Daniel  Goleman’s  book  Emotional  Intelligence 
(1995) — above  all,  the  fact  that  all  people  harness 
their  emotions,  even  negative  ones,  to  understand 
and  decipher  external  information  and  manage 
these  emotions  to  achieve  their  intended  goals. 
Some  cross-cultural  research  suggests  that  there 
are  many  universally  recognized  emotions,  which 
means  they  are  biological  in  origin  even  if  cultures 
interfere  in  their  expression.  Of  this  list,  which 
ranges  from  6  to  15  in  Paul  Ekman’s  works  (the 
most  noteworthy  being  What  the  Face  Reveals, 
1999),  some  are  of  greater  interest  for  political 
studies:  anger  and  fear,  disgust,  shame  and  con¬ 
tempt,  all  are  often  described  as  negative  emotions; 
hope  and  pride,  relief,  and  compassion  are  all  per¬ 
ceived  as  positive  ones.  But  this  opposition  (nega¬ 
tive/positive),  most  clearly  expressed  by  the  pairing 
of  satisfaction  and  frustration,  needs  to  be  reap¬ 
praised  even  if  it  is  still  employed  in  some  field 
research.  Why  should  shame  or  pride  be  always 
seen  negatively?  More  important,  each  of  these 
emotions  is  dynamic,  and  its  intensity  is  strongly 
related  to  the  internalized  ability  of  each  individual 
to  maintain  self-control  (which  develops  through 
early  socialization)  as  well  as  to  the  nature  of  the 
challenges  faced  in  his  or  her  environment. 

The  first  lesson  of  this  literature  is  that  we  can¬ 
not  avoid  taking  into  account  the  primacy  of  feel¬ 
ings  in  political  evaluations — about  issues  as  well 
as  actors.  Affects  and  emotions  interfere  with 
political  value  judgments  and  contribute  to  a 
selective  memory  of  past  events,  whether  they 
were  pleasant  or  unpleasant.  The  fact  is  well- 
known  to  politicians  but  continues  to  be  underes¬ 
timated  by  social  scientists.  One  must  expect  a 
complex  interaction  of  emotion  and  cognition 
during  any  deliberation  over  political  decisions.  In 
stressful  circumstances,  affective  states  such  as 
fear  or  anger  may  unsettle  otherwise  sound  politi¬ 
cal  judgment,  resulting  in  a  misperception  of  the 
adequate  response  to  a  given  situation.  Noticeably, 
this  is  the  case  when  high-level  violence  occurs, 
targeting  personalities,  assets,  or  even  public  val¬ 
ues.  Islamist  terrorism  has  led  to  suspicion  toward 
Muslim  populations,  strong  war  threats  have 
resulted  in  an  overwhelming  denouncement  of 
foes,  and  ethnic  disturbances  reduce  those  affected 
to  a  single  trait  of  their  identity,  whether  this  be 
origin,  religion,  or  language.  But  less  noticed,  even 


in  day-to-day  political  life,  is  that  emotions  dimin¬ 
ish  or  stimulate  an  individual’s  capacity  to  judge 
actions  or  actors.  Satisfaction  prompts  people  to 
distance  themselves  from  disturbing  or  unpleasant 
information  or  to  adopt  convenient  interpreta¬ 
tions  of  it,  which  in  turn  make  it  easier  for  them 
to  remain  satisfied.  Frustration  causes  a  symmetri¬ 
cal  mechanism  of  increasing  frustration.  The  more 
this  process  remains  unconscious,  the  more  it 
weighs  in  political  evaluations  and,  particularly, 
voting  decisions.  Hope  and  fear,  and  pride  and 
humiliation,  whether  well  founded  or  imagined, 
are  the  main  emotions  interfering  with  rational 
evaluations  of  citizens.  In  electoral  campaigns, 
this  phenomenon  is  greatly  reinforced  either  by 
the  scarcity  or,  conversely,  the  excess  of  informa¬ 
tion  available,  which  is  in  all  cases  hard  to  con¬ 
trol.  Uncertainty  in  cognition  gives  way  to  more 
emotional  investment. 

The  second  lesson  is  that  emotions  are  the 
engine  of  behavior.  Where  you  stand  depends  on 
what  you  feel.  When  taking  any  action,  most 
people  search  to  build  their  self-esteem  and  earn  a 
good  reputation  among  the  individuals  or  groups 
whose  opinions  matter  to  them.  This  is  at  the  root 
of  both  conformism  and  rebellion.  What  differs 
between  people  are  the  values  or  guides  to  which 
they  refer.  Understanding  the  intensity  and  direction 
of  such  inclinations  is  decisive  for  the  interpreta¬ 
tion  of  political  choices.  But  such  an  investigation 
may  be  misguided  if  too  much  importance  is 
attached  to  authors’  statements  of  their  own  inten¬ 
tions.  If  circumstances  compel  people  to  take 
actions  of  which  they  are  not  proud,  their  real 
motives  will,  if  at  all  possible,  be  hidden  behind 
more  noble  legitimizations.  In  some  cultures, 
ambition,  outrage,  or  compassion  are  more  or  less 
valued,  even  if  everywhere  they  are  decisive  factors 
that  determine  the  degree  of  political  engagement 
and  its  relationship  to  the  quest  for  self-esteem.  So 
political  psychology  must  study  the  complex  fac¬ 
tors  that,  in  any  society,  tend  to  stimulate  or 
diminish  the  public  acknowledgment  of  these 
motivations.  Other  people  are  mainly  directed  by 
their  fears  or  even  by  paranoid  hatred.  In  these 
cases,  it  is  necessary  to  delve  into  the  conditions 
under  which  these  feelings  arise  or  even  affect  an 
entire  population.  We  know  that  people  who 
express  strongly  held  anger  as  a  result  of  difficult 
challenges  they  cannot  properly  face  want  to  hold 
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external  agents  responsible  as  scapegoats  for  their 
troubles.  But  what  explains  the  trajectory  that 
leads  to  mass  murder  or  genocidal  violence?  A 
combination  of  sociological  and  psychological  fac¬ 
tors  must  be  acknowledged  here. 

To  take  into  account  these  dimensions  of  the 
political  life,  it  is  useful  to  refer  to  the  theory  of 
rational  choice  but  understood  in  an  untraditional 
way.  The  basic  assertion  of  this  economic  para¬ 
digm  is  that  people  evaluate  the  costs  and  benefits 
of  the  choices  they  have  to  make  and  generally 
prefer  the  object  that  provides  the  greatest  reward 
(utility,  attractiveness)  at  the  lowest  price.  But  in 
political  life,  this  kind  of  calculus,  when  restricted 
to  material  costs  and  rewards,  is  a  myth.  This  is 
not  only  because  information  is  restricted — 
individuals  are  unwilling  to  pay  the  high  costs  for 
complete  information — but  also  because  there  are 
costs  and  rewards  of  another  kind  to  be  taken  into 
consideration.  Rational  choice  theory  sets  emo¬ 
tionality  and  rationality  against  each  other.  This  is 
an  error.  Even  if  emotions  easily  spin  out  of  con¬ 
trol  under  certain  circumstances,  they  are  always 
part  of  the  rational  calculus.  Individuals  tend  to 
choose  the  option  that  will  diminish  the  emotional 
costs  of  fear  or  humiliation.  They  want  to  feel 
more  secure  as  well  and  search  to  heal  any  frustra¬ 
tions,  regardless  of  whether  this  may  take  them 
down  a  favorable  or  unfavorable  path.  But  indi¬ 
viduals  are  also  attracted  by  intangible  rewards, 
such  as  pride,  self-esteem,  and  even  surpassing 
their  image  of  themselves.  All  these  considerations 
are  just  as  rational  as  purely  material  estimations 
of  gains  and  losses.  And  as  they  weigh  heavily  in 
political  life,  researchers,  therefore,  should  be 
interested  in  studying  what  is  called  (in  contrast  to 
ego-psychology)  “situation  psychology” — that  is, 
the  study  of  emotional  costs  and  rewards  that  are 
predictable  in  a  specific  environment.  Economic 
crises  create  particular  concerns  and  fears;  the  rise 
of  new  leaders  with  a  reputation  of  efficiency  or 
integrity  awakens  new  hopes  for  economic 
improvement,  political  uprightness,  and  so  on. 
Politicians  value  emotional  appeals  when  these 
further  their  own  goals.  In  dictatorships  or  in 
populist  discourses,  they  will  not  hesitate  to  mobi¬ 
lize  hatred,  suspicion,  or  aversion.  In  democracies, 
fear  or  hope,  and  pride  or  outrage  remain  instru¬ 
mental  but  in  a  way  that  encourages  sufficient 
monitoring  of  their  effects.  Nevertheless,  even  in 


these  regimes,  it  happens  that  “rational  calculus,” 
including  material  and  emotional  rewards  or 
costs,  is  somewhat  suspended  because  the  level  of 
emotional  intensity  can  spiral  out  of  control.  This 
is  the  reason  why  it  may  be  said — under  these 
circumstances  but  only  under  them — that  emotion 
can  become  “irrational.” 

It  should  not  be  forgotten  that  people  prob- 
lematize  their  own  levels  of  emotional  comfort  in 
accordance  with  their  early  socialization  and  expec¬ 
tations  derived  from  their  actual  position  in  social 
life.  On  the  one  hand,  some  research  strongly  sug¬ 
gests  that  individuals  acquire,  in  childhood  or  late 
adolescence,  lasting  predispositions  that  shape  their 
evaluations  of  the  situations  they  confront.  Racial 
prejudices,  ideological  identifications,  or  an  aver¬ 
sion  to  communism  or  capitalism  all  may  persist 
throughout  life.  On  the  other  hand,  short-term 
considerations  also  influence  attitudes  and  subse¬ 
quent  responses  to  challenges.  For  instance,  Linda 
Putnam  refers  to  the  concept  of  “bounded  emo¬ 
tionality”  to  suggest  that  “interrelatedness”  helps 
shape  individual  expectations  in  organizations.  In 
any  situation,  many  factors  that  can  be  perceived  as 
opportunities — bringing  hopes  or  threats,  or  induc¬ 
ing  fear — matter  as  well.  So  in  these  instances, 
attitudes  and  choices  reveal  some  kind  of  “rational¬ 
ity,”  even  if  (or,  more  precisely,  because)  this  ratio¬ 
nality  is  based  on  emotions.  Even  if  sometimes  it 
happens  that  emotions  get  out  of  control,  more 
often  than  not  they  are  useful  in  clarifying  the  real 
interests  and  aspirations  in  a  given  environment. 

Symbolic  Politics 

The  power  of  political  symbols  lies  in  their  strong 
capacity  for  evocation — that  is,  to  create  associative 
meanings  that  enrich  the  way  in  which  people  react 
to  them.  This  can  occur  first  of  all  with  knowledge 
more  or  less  forgotten  but  open  to  being  revived 
when  appropriately  stimulated.  But  above  all,  reac¬ 
tions  are  elicited  from  the  emotional  charges  con¬ 
tained  in  the  symbols — because  of  their  historical 
origins  and/or  added  content  since  then.  Some 
words  in  political  discourses  are  not  purely  referen¬ 
tial  but  engage  strong  connotations  under  special 
circumstances.  They  are  cognitively  and  affectively 
loaded.  That  is  the  case,  for  instance,  when  a  highly 
respected  politician  is  abruptly  charged  with  cor¬ 
ruption  or  treason  by  an  authoritative  agency. 
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Everybody  can  understand  the  destructive  stigma  of 
such  a  powerful  allegation.  When  used  in  all  seri¬ 
ousness,  some  terms  that  refer  to  positive  values, 
such  as  liberty  or  human  rights,  or  those  that  nega¬ 
tively  describe  political  foes,  such  as  fascists,  terror¬ 
ists,  plutocrats,  or  communists,  appear  to  contain 
heavy  emotional  associations.  Choosing  an  effective 
label  for  a  political  competition  or  defining  appro¬ 
priately  a  social  mobilization  can  give  an  organiza¬ 
tion  or  a  political  figure  a  distinct  advantage  in 
politics.  Take  the  prochoice  and  prolife  movements: 
Both  attempt  to  associate  their  position  with  indis¬ 
putable  values  while  denouncing  their  opponents  as 
being  antichoice  or  antilife. 

Strong  cognitive  and  emotional  connotations  are 
linked  not  only  to  single  words  but  also  to  elabo¬ 
rate  arguments.  Storytelling  discourses,  which  often 
incorporate  ethical  messages,  whether  strongly 
negative  or  strongly  positive,  can  spark  any  num¬ 
ber  of  emotions,  from  praise  to  reproach  or  even  to 
outrage.  This  is  often  the  way  history  is  taught  at 
school  to  very  young  children,  with  the  aim  of  hav¬ 
ing  them  embrace  the  heroism  of  their  country’s 
forefathers  and  feel  connected  to  the  accomplish¬ 
ments  of  their  nation  while  being  repulsed  by  the 
actions  of  their  enemies.  This  can  have  a  lasting 
effect  throughout  their  lives.  Political  parties  may 
similarly  refer  to  historical  figures  or  invoke  long- 
lasting  doctrines  to  make  their  assertions  more 
authoritative.  They  may  also  recall  past  disasters  to 
extract  painful  lessons.  So  some  events,  personali¬ 
ties,  and  accomplishments  can  become  overloaded 
with  affective  cognition. 

Within  these  storytelling  narratives,  there  are 
codified  lines  of  argumentation  that  use  a  common 
thread  to  make  sense  of  the  past  and  the  present  or 
to  help  decipher  the  future.  To  identify  them,  some 
academics  (Ibarra,  Kitsuse)  have  put  forward  the 
concept  of  “rhetorical  idioms.”  These  are  com¬ 
mon-sense  constructions  of  moral  competence. 
Their  deployment  tends  to  presume  that  the  listen¬ 
ers  are  obliged  to  acknowledge  the  importance  of 
the  values  expressed.  Moreover,  rhetorical  idioms 
are  useful  in  enlisting  people  to  make  sympathetic 
moves  in  a  particular  language  game.  For  example, 
the  “rhetoric  of  loss”  or  the  “rhetoric  of  endanger- 
ment”  prioritizes  facts  and  behavior  that  show,  in 
a  more  or  less  simplified  way,  how  people  should 
fear  the  extinction  or  devaluation  of  something 
highly  valuable  and  cherished,  whether  this  be 
national  pride,  ethnic  or  religious  identity,  or 


political  ethics.  Such  rhetoric  appeals  to  emotional 
responses:  mourning,  apprehension,  or  even  dread 
but,  above  all,  admiration  for  the  potential  rescuer 
and  a  desire  to  identify  with  such  a  wise  prophet. 

Symbolism  is  not  restricted  to  discourses  but  is 
also  found  in  both  material  objects  and  rituals 
performed  in  the  political  arena.  A  flag  is  much 
more  than  a  simple  piece  of  cloth:  it  symbolizes  the 
nation,  its  identity,  and  glory.  Burning  a  flag  is 
the  utmost  form  of  abuse  that  can  be  inflicted  on 
the  people  it  represents.  The  architecture  of  gov¬ 
ernmental  palaces  similarly  exhibits  power  or 
glory.  Statues  and  monuments,  particularly  war 
memorials,  signify  that  they  deserve  a  central  place 
in  peoples’  imagination  and  memory.  Political  ritu¬ 
als  such  as  the  opening  of  a  parliamentary  session, 
the  appointment  of  a  new  prime  minister,  or  dip¬ 
lomatic  formalities  and  protocols — all  of  these 
highly  codified  ceremonies  intend  to  arouse  intimi¬ 
dation  and  deference,  to  exhibit  the  gap  between 
ruling  personalities  and  ruled  citizens,  to  signify 
where  the  power  is  and  who  is  to  be  viewed  as 
being  at  the  center  of  the  social  order  (Clifford 
Geertz).  But,  of  course,  the  substance  of  the  sym¬ 
bolism  does  not  lie  within  these  material  objects  or 
ceremonies.  Their  ability  to  evoke  emotional  cog¬ 
nition  is  contingent  on  the  attitudes  and  comments 
they  arouse  in  a  given  cultural  environment.  So  it 
may  be  said  that  symbolism  results  from  an  ongo¬ 
ing  process  of  emotional  construction,  undertaken 
by  authorities  perceived  as  entitled  to  do  so:  intel¬ 
lectuals,  social  leaders,  and  politicians  interacting 
with  each  other.  If  this  process  fails,  it  may  be  that 
the  symbolic  appeal  has  been  exhausted.  This  may 
occur  when  indifference,  or  even  disrespect  toward 
such  materials  or  rites,  has  developed  over  time. 

What  are  the  uses  of  symbols  and  symbolic  ges¬ 
tures  in  politics?  They  can  possibly  fulfill  three  func¬ 
tions,  which  are  all  decisive  in  political  life.  First, 
many  symbols  help  develop  a  feeling  of  mutual  lik¬ 
ing  and  appreciation — what  is  called  “solidarity”  in 
political  language.  Governments  must  necessarily 
try  to  stimulate  solidarity  because  life  in  society 
compels  them  to  impose  obligations  and  sacrifices, 
which  may  be  more  easily  accepted  if  people  think 
they  have  much  in  common  beyond  strict  material 
necessity.  A  sense  of  allegiance  to  a  community, 
within  which  the  members  supposedly  share  the 
same  values  and  history,  makes  exercising  power 
much  easier.  Constructing  collective  identities, 
national  or  otherwise,  relies  on  emotional  narratives 
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that  emphasize  the  great  things  the  people  have  real¬ 
ized  together  and  the  highly  prized  goals  they  may 
achieve  if  they  remain  united.  The  reality  of  this 
“imagined  community”  (Benedict  Anderson)  is 
exhibited  through  the  symbols  everyone  learns  to 
love:  keywords  such  as  republic,  national  state,  and 
liberty  and  equality,  material  objects  such  as  flags, 
war  memorials,  and  other  emblems  of  the  commu¬ 
nity  and  its  legacy;  historical  figures  that  are  end¬ 
lessly  praised;  economic,  cultural,  and  sports 
achievements  as  a  source  of  communal  pride;  and 
so  on. 

A  second  function  of  symbols  in  politics  is  to 
control  the  need  for  peace  and  stability.  In  interna¬ 
tional  relations,  a  policy  of  recognition — one  that 
accepts  the  principle  of  equal  dignity  between  states 
and  recognizes  the  necessity  of  taking  the  blame  for 
injuries  and  damages  inflicted  on  another  by  the 
state — is  a  major  factor  permitting  appeasement 
between  peoples.  For  this  purpose  of  building  better 
relations  with  other  people,  words  and  gestures  mat¬ 
ter  a  lot,  as  suggested  by  the  respective  German 
Chancellors’  kneeling  down  at  the  Ghetto  Memorial 
in  Warsaw  (1970)  or  a  visit  to  Yad  Vashem  (2008), 
both  highly  charged  with  emotions.  Within  any  par¬ 
ticular  state,  symbolism  appears  to  be  much  more 
intense  when  institutions  become  more  vulnerable — 
which  predictably  occurs  at  certain  stages  of  institu¬ 
tional  life.  This  is  the  case,  for  example,  with  the 
formation  or  the  replacement  of  a  government. 
Constitutions  provide  explicit  rules  for  such  transi¬ 
tions.  It  is  necessary  to  ensure  that  these  rules  are 
uniformly  respected  in  order  to  create  a  consensus 
about  the  way  in  which  power  must  be  exercised. 
When  people  go  to  the  polls,  it  is  of  the  utmost 
importance  that  the  outcome  be  honored.  If  not, 
disorder  and  instability  may  quickly  appear  and 
destroy  the  rule  of  law.  It  is  in  these  times  of  uncer¬ 
tainty,  when  current  governments  are  challenged  by 
opposition  figures,  that  comments  on  the  value  of  an 
emotional  attachment  to  democratic  principles  will 
intensify  on  all  sides.  Any  fear  of  death  will  similarly 
evoke  symbolic  displays,  whether  this  be  the  deaths 
of  soldiers  on  the  battle  front  or  the  unforeseen 
death  of  a  prominent  leader.  Elaborate  ceremonies 
and  rituals  are  used  to  restrain  confusion  and  to 
prevent  the  situation  from  getting  out  of  control. 

A  third  function  of  symbols  is  to  mobilize  citi¬ 
zens  to  support  a  “good  cause.”  Purely  rational 
considerations  are  not  always  sufficient  to  get 
people  involved  in  political  life,  even  when  their 


own  interests  are  at  stake.  Invoking  the  breach  of 
equality  or  the  infringement  of  human  rights,  both 
highly  prized  values,  is  an  effective  way  to  awaken 
their  moral  sensibilities  and  provoke  some  kind  of 
outrage,  which  may  possibly  lead  to  a  commitment 
to  action.  More  broadly,  social  problems  to  be 
managed  by  those  in  power  are  coded  through 
contextually  grounded  discourses  and  vocabularies 
that  designate  and  dramatize  in  the  same  way. 
Noticeably,  they  are  inhabited  by  the  concept  of 
victim,  a  putative  person  being  subject  to  harmful 
conditions  of  which  a  victimizer  is  the  causal 
agent.  For  example,  common  victims  include  an 
unemployed,  impoverished  populace  or  powerless 
minorities,  while  the  victimizers  may  be  the 
wealthy,  capitalism,  or  even  the  “system”  itself. 
All  of  these  words,  when  related  to  a  victimization 
process,  become  charged  with  emotional  connota¬ 
tions  set  up  by  ideologies  and,  thus,  tend  to  appear 
as  signals  for  action.  In  this  way,  symbolic  politics 
constructs  victim-and-victimizer  categories  within 
particular  universes  of  morality  where  there  is 
good  to  be  loved  and  evil  to  be  condemned. 

Short  Methodological  Considerations 

Political  science  needs  scientific  precision.  This 
presents  a  particularly  strong  challenge  in  politi¬ 
cal  psychology.  First,  because  emotions  are  typi¬ 
cally  volatile  or  subtle  phenomena  and,  second, 
because  psychological  costs  and  rewards  are 
often  far  from  being  transparently  displayed.  In 
the  past,  the  key  obstacles  included  a  false  con¬ 
sensus  on  imprecise  core  definitions,  a  diversity 
of  underlying  psychological  theories,  and  above 
all,  insufficient  techniques  for  data  collection. 
Dramatic  advances  have  recently  been  achieved 
with  the  development  of  rich  interview  materials 
using  projective  questions,  ingenious  experi¬ 
ments  (the  first  of  them  being  Stanley  Milgram’s), 
the  construction  of  attitudinal  scales  and  EQ 
(emotional  quotient)  to  measure  preferences  and 
subjective  reactions  with  performance  metrics 
(Likert,  Altemeyer),  and  even  computer  simula¬ 
tions.  But  these  methods  must  avoid  traps  such 
as  an  abundance  of  missing  data  points,  sample 
bias,  poor  psychometric  techniques,  and  the  pos¬ 
sible  contradictions  between  attitudinal  factors 
and  real  behavior.  Certain  phenomena  will  never 
be  well  understood  without  longitudinal  research 
that  still  remains  quite  scarce.  Furthermore,  even 
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if  the  positivist-empirical  leaning  of  modern 
political  psychology  indisputably  favors  strong 
breakthroughs  in  better  understanding  the  emo¬ 
tional  dimensions  of  political  life,  there  are  pos¬ 
sible  negative  implications  as  well.  One  may  be 
tempted  to  give  up  questions  that  are  too  diffi¬ 
cult  to  translate  into  solid  empirical  inquiries  or 
to  use  exaggerated  simplifications.  Less  sophisti¬ 
cated  observations  may  produce  more  richness, 
even  if  the  findings  cannot  be  so  strongly  asserted. 

At  this  point,  it  is  necessary  to  discuss  what 
room  may  be  left  to  psychoanalysis.  In  mainstream 
political  psychology,  this  theory  is  widely  ruled  out 
as  a  method  of  analysis.  This  is  right  because  its 
techniques  are  far  from  being  compatible  with 
social  science  requirements  of  intersub jectively 
transmissible  and  controllable  findings.  Neverthe¬ 
less,  if  something  is  to  be  borrowed  from  psy¬ 
choanalysis,  it  should  be  the  theory  of  defensive 
attitudes  in  a  protection  of  the  self.  The  blocking 
of  cognitive  representation  and  affect  (denial),  a 
sudden  reversal  to  take  the  opposite  position,  a 
projection  leading  to  rationalization  or  identifica¬ 
tion  with  the  other — all  these  processes  matter 
greatly  when  dealing  with  psychological  or  socio¬ 
logical  material.  The  more  a  researcher  seriously 
engages  with  these  processes,  the  more  likely  it  is 
that  he  or  she  will  be  able  to  identify  both  the  bias 
that  may  interfere  with  his  or  her  interview  or  dis¬ 
course  material  and  the  bias  he  or  she  may  experi¬ 
ence  when  setting  up  a  research  project. 
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Paris,  France 
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Political  risk  analysis  is  rooted  in  the  intersection 
between  politics  and  business.  It  analyses  the  prob¬ 
ability  that  political  decisions,  events,  or  condi¬ 
tions  will  significantly  affect  the  profitability  of  a 
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business  actor  or  the  expected  value  of  a  given 
economic  action.  This  definition  incorporates  three 
different  approaches  among  early  theorists  about 
the  sources  of  political  risk — namely,  a  focus  on 
foreign  national  governments,  the  recognition  of 
the  impact  of  actors  from  both  government  and 
nongovernment  circles,  and  an  emphasis  on  his¬ 
torical  and  cultural  environments.  A  wide  spec¬ 
trum  of  political  risks  may  affect  business,  and 
political  risk  analysts  use  both  qualitative  and 
quantitative  frameworks  to  analyze  and  assess  the 
risks  to  business. 

Cross-border  traders  and  investors  are  often 
involved  in  forms  of  political  risk  analysis.  How¬ 
ever,  political  risk  analysis  only  became  recogniz¬ 
able  as  an  institutionalized  business  practice  in  the 
United  States  in  the  1970s.  Several  factors  enhanced 
the  prevalence  of  international  business  activities 
at  the  time.  Simultaneously,  the  Organization  of 
Petroleum  Exporting  Countries  (OPEC)  oil  boy¬ 
cott  in  1973  resulted  in  a  partial  institutionaliza¬ 
tion  of  the  political  risk  function  in  more  U.S. 
companies.  This  function  became  full-time  and 
more  firmly  centered  in  many  businesses  after  the 
Iranian  Revolution  of  1979.  Most  initial  academic 
research  on  political  risk  analysis  was  also  done  in 
the  1970s  and  1980s. 

Political  risk  analysts  identify  political  risks  and 
their  variables,  assess  their  significance  and  the 
relationships  between  them,  and  make  recommen¬ 
dations  regarding  the  management  and  mitigation 
of  political  risks.  Social  science  research  and 
nonacademic  interpretations  of  current  affairs 
influence  all  three  phases — namely,  the  analysis, 
assessment,  and  management  of  political  risk.  In 
principle,  political  risk  could  also  be  useful  in  ana¬ 
lyzing  the  general  judgment  and  policy  design  of 
politicians  under  conditions  of  uncertainty. 
However,  political  risk  analysis  is  undertheorized 
in  this  regard  and  currently  remains  rooted  in  the 
intersection  between  politics  and  business. 

Although  political  risk  analysis  could  apply  to 
domestic  business,  in  practice  it  mostly  comes  into 
play  when  a  business  is  considering  business  activ¬ 
ities  in  other  countries.  In  the  academic  literature, 
the  focus  tends  to  be  on  political  risk  analysis 
related  to  foreign  direct  investment  (FDI)  rather 
than  relatively  passive  portfolio  investment.  The 
exposure  of  assets  or  personnel  in  FDI  reinforces 
the  relevance  of  political  risk  analysis.  However, 
political  risk  can  also  affect  the  expected  profits 


and  market  stakes  of  exporters,  contractors,  and 
licensors. 

Sources  of  Political  Risk 

Several,  sometimes  overlapping,  government  func¬ 
tions  can  have  an  impact  on  business.  In  many 
industrialized  countries,  its  role  as  a  regulator  is 
especially  extensive,  resulting  in  legislation  related 
to  the  environment,  health  and  safety,  employ¬ 
ment,  trade  unions,  and  consumers.  A  government 
can  also  serve  as  a  restrictor  (tariffs  and  trade  quo¬ 
tas),  redistributor  (taxation  and  welfare  policies), 
director  (training,  regional  and  sectoral  develop¬ 
ment,  and  human  resource  policies),  customer 
(procurement),  or  sponsor  (subsidies). 

Some  authors  contend  that  there  is  an  inbuilt 
potential  bias  in  political  risk  analysis,  whereby 
any  government  intervention  in  the  economy  is 
seen  as  potentially  negative,  even  though  such  an 
intervention  may  be  motivated  by  relevant  local 
interests  and  aspirations.  It  is  in  any  case  meaning¬ 
ful  to  locate  the  particular  relationships  between 
multinational  business  and  national  governments 
or  other  political  actors  when  assessing  the  actual 
political  risk.  The  particular  cultural  and  historical 
context  may  also  influence  political  risk — for 
example,  where  energy  or  mineral  companies  are 
associated  with  earlier  colonial  projects  in  Africa 
or  the  Middle  East. 

The  most  familiar  relationship  between  business 
and  political  authorities  is  a  cooperative  arrange¬ 
ment,  where  negotiations  are  ongoing  and  a  nor¬ 
mal  part  of  operations.  While  the  government 
would  not  act  deliberately  to  affect  operations,  the 
company  would  only  use  lobbying,  either  singly  or 
through  participation  in  probusiness  pressure 
groups  and  associations.  A  second  relationship 
would  be  collaborative,  consisting  of  joint-venture 
relationships  with  public  sector  companies  or  pri¬ 
vately  owned  companies  with  a  strong  governmen¬ 
tal  presence. 

An  authoritative  relationship  applies  when  a 
multinational  corporation  and  a  government  are  at 
loggerheads.  Mostly,  a  government  can  impose 
new  rules,  which  may  result  in  divestment  by  the 
company.  Two  other  relationships  are  far  less  fre¬ 
quent.  A  home  government  may  use  a  multina¬ 
tional  company  to  promote  its  political  objectives. 
Alternatively,  in  the  case  of  subversion,  a  multina¬ 
tional  company  may  actively  work  to  undermine  a 
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host  government,  partly  with  the  covert  encourage¬ 
ment  of  the  company’s  home  government.  In  the 
latter  two  cases,  the  conduct  of  business  can  also 
constitute  a  source  of  political  risk. 

Risks  to  business  in  a  country  may  not  only 
ensue  from  a  government  in  a  foreign  country  but 
also  from  actions  by  the  governments  in  its  neigh¬ 
boring  or  other  countries.  Transnational  or  inter¬ 
national  actors,  opposition  groups  and  other 
domestic  stakeholders,  and  the  particular  political 
field  in  a  country  may  become  linked  to  political 
risk.  In  some  countries,  due  to  the  power  or 
authority  of  informal  networks  linked  to  the  gov¬ 
ernment,  these  groups,  rather  than  the  govern¬ 
ment,  may  be  the  main  source  of  political  risk  to  a 
particular  business. 

Types  of  Political  Risk 

Political  risk  may  vary  at  different  business 
levels — that  is,  for  all  foreign  business  actors,  a 
particular  industry  or  company,  or  a  particular 
project.  It  also  depends  on  the  type  of  investment, 
its  methods  of  financing,  its  location,  and  the  time 
frame  involved.  Political  risk  may  have  an  impact 
on  one  or  more  aspects  of  a  business  actor,  includ¬ 
ing  personnel,  assets,  contracts,  operations,  trans¬ 
fers,  company  goals,  and  business  continuity.  The 
impact  may  be  directly  or  indirectly  in  the  form  of 
opportunity  costs  forgone. 

Risks  to  personnel  and  operations  may  include 
intimidation,  kidnapping,  sabotage,  and  terrorism, 
if  these  risks  are  motivated  by  political  concerns. 
However,  some  of  these  risks  may  also  ensue  from 
nonpolitical  actors  and  constitute  a  general  secu¬ 
rity  risk  only,  requiring  a  different  set  of  preventive 
measures  and  incident  responses.  Asset  risks  may 
include  general  nationalization  and  specific  expro¬ 
priation,  restrictions  on  ownership,  and  an  insis¬ 
tence  on  locally  owned  shareholdings  or  local 
directorships.  Contractual  risks  may  include 
changes  in  contractual  conditions  due  to  legislative 
or  bureaucratic  action,  or  the  frustration  of  con¬ 
tracts  due  to  violent  or  political  change,  including 
a  revolution,  civil  war,  secession,  interstate  war, 
coup  d’etat,  or  peaceful  succession. 

Risks  to  operations  are  a  wide  category  and 
include  all  host  country  regulations  that  affect 
business  operations.  These  may  include  labor 
relations,  taxation,  restrictions  on  labor  or 


technology  transfer,  and  local  product  content 
regulations.  Some  other  examples  include  quotas 
and  tariffs,  environmental  and  consumer  protec¬ 
tion,  antitrust  and  merger  laws,  discrimination  in 
awarding  contracts,  and  bureaucratic  nepotism. 
Transfer  risks  could  include  exchange  controls, 
profit  repatriation,  and  restrictions  on  royalty 
payments.  Local  variations  in  these  risks  are  also 
possible  in  countries  where  the  regional  author¬ 
ity  of  an  area  is  at  loggerheads  with  the  central 
government  of  the  country  or  where  a  local 
power  broker  is  the  actual  authority  on  the 
ground. 

Differentiating  Political  Risk  Analysis 
From  Country  Risk  Analysis 

Political  risk  analysis  partly  grew  out  of  the  coun¬ 
try  risk  analysis  conducted  by  major  banks  and 
international  economic  agencies.  The  analysis  of 
country  risk  and  political  risk  differs  but  may 
sometimes  overlap.  Country  risk  analysis  tends  to 
include  political  risk  but  also  economic  and  opera¬ 
tional  risk.  Some  of  the  economic  factors  included 
under  country  risk,  for  example,  a  bad  balance  of 
payments  and  low  creditworthiness,  may  reflect  an 
inability  to  pay  debt  but  may  also  result  in  a  polit¬ 
ical  risk — namely,  an  unwillingness  to  pay  debt. 
Political  risk  may  also  overlap  with  some  of  the 
country  risk  factors,  where  events,  foreign  confi¬ 
dence,  and  capital  inflows  meet. 

Political  stability  as  an  indicator  is  included  in 
comparative  country  risk-rating  systems  such  as 
the  Peren-Clement  Index  or  the  Business  Environ¬ 
ment  Risk  Intelligence  (BERI)  Index.  However,  the 
management  of  political  risk  may  differ  from  that 
of  country  risk.  For  example,  many  political  risks 
are  insurable,  whereas  many  economic  and  finan¬ 
cial  risks  included  under  country  risk  are  not. 

Political  risk  analysis  may  distinguish  between  a 
microlevel  and  a  macrolevel  political  risk.  A  micro¬ 
level  political  risk  is  a  risk  specific  to  a  business  or 
some  sectors  only.  A  macrolevel  political  risk  is  a 
non-project-specific  risk  that  affects  all  participants 
in  a  given  country.  However,  it  does  not  include 
country-level  political  risk  alone,  but  it  may  link 
local,  national,  and  regional  political  forces,  events, 
and  environments.  Depending  on  the  requirement 
of  the  particular  business,  political  risk  analysis  can 
focus  on  both  or  one  of  the  two  levels. 
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The  Methodology  of  Analysis 

Some  major  business  actors  have  in-house  analysts, 
while  others  at  least  partially  outsource  the  task  of 
analysis  to  specialist  providers.  A  company’s  need 
for  political  risk  analysis  may  differ  at  different 
times.  The  perceived  need  for  political  risk  analysis 
tends  to  be  higher  around  the  decision  to  enter  or 
avoid  a  particular  country’s  marketplace,  but  dif¬ 
ferent  forms  of  political  risk  analysis  are  also  used 
as  a  regular  form  of  early  warning,  to  periodically 
review  in-country  operations,  or  sporadically  in 
response  to  new  uncertainties  or  setbacks. 

Political  risk  itself  is  a  subjective  category  and 
will  vary  according  to  the  actor  defining  the  risk 
and  the  field  of  action.  While  pure  risk  only  entails 
loss  or  a  chance  of  loss,  speculative  political  risk 
can  entail  the  chance  of  both  loss  and  opportunity. 
In  this  regard,  political  risk  analysis  is  not  neces¬ 
sarily  a  linear  process  of  unilateral  communication 
but  may  become  part  of  an  ongoing  dialogue 
between  the  analysts  and  (other)  actors  within  the 
particular  business,  who  may  have  a  better  appre¬ 
ciation  of  the  particular  business  or  project  than 
the  analyst. 

Analysts  use  both  quantitative  and  qualitative 
models  for  analysis,  and  there  is  no  consensus  on 
the  methodology.  A  model  is  an  extended  repre¬ 
sentation  to  better  understand,  adapt  to,  manage, 
and  control  identified  political  risk  factors.  The 
number  and  nature  of  variables,  their  combina¬ 
tions,  and  the  weights  assigned  to  them  by  the 
model  builders  are  based  on  the  interpretative 
frameworks  used  by  political  risk  analysts. 

Quantitative  assessment  models  include  complex 
econometric  models  and  simpler  macropolitical  risk 
indexing  models  that  purport  to  assess  various  indi¬ 
ces  such  as  political  stability,  based  on  nominal, 
ordinal,  or  interval  variables.  Some  models  have 
been  designed  for  particular  sectors,  for  example, 
the  financial  or  energy  sector,  and  most  models  also 
include  an  element  of  qualitative  judgment. 

The  main  qualitative  techniques  are  judgmental 
forecasting,  for  example,  the  so-called  Delphic 
method  or  accumulation  of  expert  opinion  under 
controlled  conditions  and  involving  expert  feed¬ 
back.  Informal  brainstorming  between  experts  is 
also  used,  especially  where  time  is  of  the  essence.  A 
more  systematic  scenario  model  may  be  used  to 
identify  key  assumptions  and  key  drivers  to  then 
construct  several  alternative  futures  within  different 


time  frames  and  to  estimate  the  likelihood  of  dif¬ 
ferent  outcomes  and  their  impact  on  particular 
business  concerns.  Political  risk  analysis  aims  to 
provide  insight  into  where  a  business  needs  to 
intervene  in  the  political  process  if  it  wants 
to  change  the  environment,  mitigate  its  potential 
risks,  and  maximize  its  potential  opportunities. 

Heinrich  Matthee 
Black  Hall  College 
London,  United  Kingdom 

See  also  Risk  and  Public  Policy;  Stability 

Further  Readings 

Brink,  C.  H.  (2004).  Measuring  political  risks:  Risks  to 
foreign  investment.  Aldershot,  UK:  Ashgate. 

Howell,  L.  D.  (Ed.).  (200f).  The  handbook  of  country 
and  political  risk  analysis  (3rd  ed.).  East  Syracuse, 

NY:  PRS  Group. 

Kobrin,  S.  J.  (1979).  Political  risk:  A  review  and 
reconsideration.  Journal  of  International  Business 
Studies,  10(1),  67-80. 

Moran,  T.  H.  (Ed.).  (2001).  International  political  risk 
management:  Exploring  new  frontiers.  Washington, 
DC:  World  Bank. 

Rogers,  J.  (Ed.).  (2003).  Global  risk  assessments:  Issues, 
concepts  and  applications.  Book  5.  Riverside,  CA: 
Global  Risk  Assessments. 


Political  Science 


See  Introduction  to  Political  Science 
(Volume  1) 


Political  Science, 

International 

Institutionalization 


Contemporary  political  science  has  its  ancient 
roots  in  the  legislations,  teachings,  and  writings  of 
Greek  and  Roman  statesmen,  travelers,  historians, 
and  philosophers.  The  issues  that  concerned  them, 
their  reflections,  and  their  advice  dominated 
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European  thinking  on  politics  to  the  Enlighten¬ 
ment,  enriched  by  medieval,  Renaissance,  and 
early  modern  authors.  They  are  still  considered 
relevant  by  a  substantial  number  of  political  scien¬ 
tists — not,  quite  clearly,  by  a  majority.  Political 
science  began  to  be  recognized  as  a  legitimate  aca¬ 
demic  discipline,  on  the  same  plane  as  history  and 
economics,  and  later  sociology  and  psychology,  in 
the  second  half  of  the  19th  century,  in  Europe  and 
the  United  States,  and  with  surprising  speed  in  the 
latter  country.  This  institutional  recognition  is 
considered  first.  The  second  part  of  this  entry  deals 
with  the  birth,  programs,  and  impact  of  the  inter¬ 
national  institutions,  communication  media,  and 
exchanges  that  have  shaped  the  present  political 
science  community  since  the  end  of  World  War  II. 
This  account  draws  on  an  extensive  literature,  too 
abundant  to  be  quoted,  and  also  on  the  author’s 
close  connection  with  the  International  Political 
Science  Association  (IPSA)  since  1952. 

Institutional  Institutionalization 
Through  Recognition 

The  first  professorships  and  chairs  dealing  with 
politics  were  created  in  the  Netherlands,  at  the 
University  of  Leiden  (1613);  in  Sweden,  at  the 
University  of  Uppsala  (1627);  and  at  the  Abo 
Akademi  (1640),  now  in  Finland.  Several  profes¬ 
sorships  and  departments,  including  in  their  title 
political  science  or  government,  were  created  in  the 
United  States  in  the  second  half  of  the  19th  cen¬ 
tury,  starting  in  1857  at  Columbia  University.  In 
Europe,  political  topics  started  to  be  taught  at 
about  the  same  time,  as  part  of  the  training  of 
future  civil  servants  and  members  of  the  political 
elite.  In  France,  the  Ecole  libre  des  Sciences 
Politiques  [Free  School  of  Political  Sciences],  set  up 
in  1872  as  a  private  institution  in  reaction  to 
France’s  defeat  in  1871  in  the  war  with  Prussia, 
used  in  its  name  the  plural,  thus  including  the  study 
of  politics  in  the  social  sciences.  The  Ecole  inspired 
the  creation  of  the  Columbia  School  of  Political 
Science  (1880)  and  the  London  School  of  Economics 
and  Political  Science  (1895),  both  of  which  used 
the  singular;  similar  institutions  that  followed  the 
Ecole’s  model  were  later  created  in  Belgium, 
Switzerland,  Italy,  Germany,  and  Czechoslovakia. 

The  main  impetus  came,  however,  from  the 
United  States,  where  a  substantial  number  of 


chairs  and  departments  of  political  science,  and 
also  of  government  or  public  administration,  were 
established  in  the  fourth  quarter  of  the  19th  cen¬ 
tury.  The  existence  of  an  American  community  of 
political  scientists  led  to  the  creation  in  1903  of  the 
American  Political  Science  Association,  the  first  of 
its  kind,  and  of  the  American  Political  Science 
Review  (1905).  National  political  science  associa¬ 
tions  were  later  created  in  Canada  (1913),  in  con¬ 
junction  with  economics,  and  in  China  (1932), 
Finland  (1933),  and  India  (1938). 

In  Europe,  with  few  exceptions,  political  science 
topics  were  investigated,  analyzed,  and  taught  in 
the  1920s  and  1930s  by  journalists  and  civil  ser¬ 
vants  and  by  academics  from  disciplines  such  as 
history,  public  and  constitutional  law,  sociology, 
geography,  and  philosophy.  Politics  was  meant  to 
provide  flesh  to  the  abstract  approaches;  thus,  the 
study  of  political  parties  and  elections  added  reality 
to  the  traditional  state  doctrine  and  constitutional 
law.  Analysis  of  political  ideologies  led  to  the  study 
of  political  movements  and  revolutions,  for  which 
traditional  political  philosophy  did  not  account. 
Political  science  was  not  recognized  as  a  discipline 
and  certainly  not  as  a  “science.”  At  best,  political 
topics  were  seen  as  belonging  to  an  interdisciplin¬ 
ary  field  and  politics  as  a  “crossroads”  discipline. 

At  the  same  time,  in  the  United  States,  partly 
under  the  influence  of  American  sociologists,  social 
psychologists,  and  empirically  minded  economists, 
political  scientists  were  increasingly  attracted  by 
theoretical  rigor,  quantitative  analysis,  and  system¬ 
atic  comparisons.  The  influx  of  German  and 
Central  European  social  scientists,  including  refu¬ 
gee  scholars  before  and  during  World  War  II,  con¬ 
tributed  to  that  transformation.  The  study  of  mass 
phenomena,  such  as  elections,  public  opinion,  com¬ 
munications,  authoritarian  regimes,  and  interna¬ 
tional  relations,  justified  innovative  approaches, 
enriched  by  the  views  and  the  experience  of  the 
refugees,  who  found  a  hospitable  haven  in  American 
universities  at  a  time  when  Europe  was  isolated  and 
European  social  science  stifled,  when  not  sup¬ 
pressed,  except  in  the  United  Kingdom  (UK).  Cross¬ 
breeding  was  important,  as  these  European  scholars 
discovered  the  unfamiliar  realities  of  American 
politics  and  America’s  political  traditions. 

When  the  war  ended,  emigre  social  scientists, 
some  of  whom  had,  at  U.S.  and  Canadian  universi¬ 
ties,  embraced  political  science,  were  invited  to 
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teach  at  universities  of  their  home  countries, 
together  with  native  American  colleagues,  and  laid 
the  ground  for  the  recognition  of  their  discipline  in 
Germany,  Austria,  France,  Italy,  and  many  other 
countries.  Their  courses  and  some  of  the  research 
fields  that  they  opened  up  or  investigated  in  new 
ways  contributed  greatly  to  interest  in  political  sci¬ 
ence  as  a  discipline.  European  graduate  students 
crossed  the  Atlantic,  professors  met  foreign  col¬ 
leagues  at  professional  conferences,  and  the  gap 
was  progressively  closed.  Their  contributions  to  a 
better  and  richer  understanding  of  politics  and  the 
use  of  more  rigorous  approaches  and  research 
methods  were  increasingly  accepted,  not  without 
resistance,  however,  from  traditional  scholars.  The 
same  applied  to  Asian  countries  such  as  Japan  and 
later  Taiwan  and  South  Korea. 

The  recognition  of  political  science  as  a  disci¬ 
pline  outside  of  the  United  States  was  generally  a 
slow  process.  It  depended  very  much  on  the 
strength  of  contrary  academic  traditions,  on  the 
sympathy  or  hostility  of  scholars  in  other  fields  of 
learning,  and  on  the  institutional  setting.  It  was 
understandingly  more  difficult  in  educational  sys¬ 
tems  where  decisions  concerning  the  curricula  and 
recruitment  procedures  were  made  at  the  ministe¬ 
rial  level  than  in  those  where  universities  enjoyed 
substantial  autonomy;  this  was  more  rapid  in  the 
United  Kingdom  and  in  countries  where  the 
knowledge  of  English  made  for  easier  and  more 
intense  exchanges  and  access  to  the  relevant  politi¬ 
cal  science  works.  The  UK  itself,  however,  was  for 
many  years  a  hotbed  of  such  resistance. 

Thus,  even  in  countries  where  political  science 
topics  were  commonly  analyzed  and  taught,  the 
obstacles  to  recognition  proved  difficult  to  over¬ 
come.  In  the  UK,  with  its  long  and  glorious  tradi¬ 
tion  of  political  inquiry  and  reflection,  there  was 
thus  still  no  political  science  department  at  the 
close  of  the  1940s,  and  it  is  significant  that  the 
discipline’s  academic  association,  founded  in  1950, 
is  still  called  the  Political  Studies  Association  of  the 
UK  and  its  official  journal,  Political  Studies  (1953); 
the  rival  British  Journal  of  Political  Science  was 
launched  only  in  1971.  Political  topics  were  long 
taught  by  historians,  sociologists,  and  sometimes 
very  eminent  constitutional  lawyers  and  political 
philosophers,  even  at  the  London  School  of 
Economics  and  Political  Science,  rather  than  by 
political  scientists. 


The  situation  was  partly  similar  in  France,  with 
an  additional  institutional  obstacle:  While  in  the 
UK,  the  Scandinavian  countries,  or  the  Netherlands 
universities  enjoyed  great  autonomy,  French  uni¬ 
versities  had  only  a  nominal  existence  until  1968; 
higher  education  was  administered  by  a  govern¬ 
ment  department  and  organized  at  the  level  of 
facultes.  Of  the  various  social  sciences,  history  and 
sociology  as  well  as  philosophy  and  geography 
belonged  to  the  Facultes  des  Lettres  (schools  of 
liberal  arts),  and  law  and  economics  to  the  Facultes 
de  Droit  (law  schools).  The  proponents  of  modern 
political  science  were  thus  divided  between  two 
separate  schools  in  the  universities  in  which  they 
were  active.  Introductory  political  science  courses, 
which  became  mandatory  in  the  law  schools’  cur¬ 
riculum  in  the  1950s,  were  taught  mainly  by  pro¬ 
fessors  of  public  law  who  had  had  no  training  in 
political  science  and  often  not  much  respect  for  it. 
For  many  years  after  1945,  the  main  institutions 
that  developed  both  teaching  and  research  in 
political  science  were  the  Paris  Institut  d’Etudes 
Politiques  [Institute  of  Political  Studies]  and  the 
Fondation  Nationals  des  Sciences  Politiques 
[National  Foundation  of  Political  Science],  the 
heirs  of  the  Ecole  libre  des  Sciences  Politiques, 
jointly  known  as  Sciences  Po,  where  research  cen¬ 
ters  on  French  politics,  area  studies,  and  interna¬ 
tional  relations  were  created  in  the  late  1950s  and 
the  1960s.  The  resistance  of  the  law  schools, 
finally  overcome  by  public  law  professors  who 
considered  themselves  political  scientists,  pre¬ 
vented,  until  1971,  the  creation  of  political  science 
doctorates  and  the  adoption  of  a  procedure  for  the 
recruitment  of  political  science  professors.  With  a 
distinct  chronology,  conditions  were  not  very  dif¬ 
ferent  in  Germany,  Italy,  and  Spain,  unlike  the  UK, 
where  academic  autonomy  allowed  a  spectacular 
development  of  disciplinary  teaching  and  research, 
with  ever  more  professorships  and  departments 
attracting  an  increasing  number  of  students. 

American  political  science  has  remained  domi¬ 
nant,  thanks  to  the  number  and  variety  of  U.S. 
university  institutions  and  the  recognition  of  the 
relevance  of  the  discipline  for  nonacademic  careers. 
It  has  been  supported  by  the  major  foundations 
concerned  with  the  development  of  the  social  sci¬ 
ences  and  by  the  creation  of  cooperative  institu¬ 
tions  such  as  the  Interuniversity  Consortium  for 
Political  and  Social  Research  established  in  1962 
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at  the  University  of  Michigan,  which  has  provided 
access  to  a  huge  international  collection  of  social 
science  data.  The  annual  American  National 
Election  Studies,  also  based  at  the  University  of 
Michigan,  and  since  1997  considered  a  national 
resource  by  the  National  Science  Foundation,  have 
provided  since  1948  invaluable  data  on  the  elector¬ 
ate  and  public  opinion.  Institutions  linked  to  the 
military,  such  as  the  RAND  Corporation,  have 
contributed  to  the  development  of  strategic  studies. 
Many  new  theoretical  approaches  and  method¬ 
ological  innovations  or  improvements  have  thus 
originated  in  the  United  States  and  spread  every¬ 
where.  Some  proved  to  be  fads,  but  the  “behavioral 
revolution”  of  the  1960s,  which  emphasized  rigor¬ 
ous  analysis  of  political  processes — sophisticated 
treatment  of  quantitative  data,  comparative  analy¬ 
sis,  and  the  study  of  new  fields — decisively  sup¬ 
ported  by  nongovernmental  bodies  such  as  the 
Social  Science  Research  Council  and  the  Ford  and 
Rockefeller  Foundations,  has  had  a  lasting  influ¬ 
ence,  despite  its  excesses.  A  large  number  of 
increasingly  specialized  journals  were  created;  the 
advent  of  the  information  technology  revolution, 
which  has  transformed  both  research  and  teaching, 
resulted  also  in  what  many  political  scientists  con¬ 
sider  an  unhealthy  fragmentation  of  the  discipline. 

By  the  1960s,  it  became  obvious  that  for  its 
acceptance  and  development,  political  science 
needed  an  environment  that  favored,  or  at  the  very 
least  allowed,  freedom  of  investigation  and  thought. 
Authoritarian  regimes  do  not  encourage  or  even 
tolerate  analyses  that  highlight  their  shortcomings. 
The  political  science  map  coincided  very  clearly 
with  that  of  democracy.  But  there  were  hidden 
exceptions:  The  rulers  of  powers  such  as  the  Soviet 
Union  understood  that  for  policy-making  pur¬ 
poses,  they  needed  reliable  information  and  analy¬ 
ses.  Strongly  controlled  centers  were  set  up,  where 
broad  access  to  foreign  publications  was  provided. 
The  collapse  of  communism  revealed  their  exis¬ 
tence,  and  starting  in  the  1990s,  there  appeared  in 
many  countries  of  the  former  Soviet  bloc  political 
science  journals  and  books  of  a  quality  close  to  the 
internationally  recognized  standards.  A  new  gen¬ 
eration  of  well-informed  political  scientists,  secretly 
self-trained,  became  visible.  Political  science  began 
to  be  taught  in  universities,  and  the  map  changed 
rapidly.  The  same  phenomenon  had  earlier  hap¬ 
pened  in  European  countries  such  as  Spain  and 


Portugal  and  in  various  Fatin  American  countries, 
where  dictatorships  had  prevented  the  open  devel¬ 
opment  of  political  science.  High-quality  scholars, 
who  had  found  refuge  in  the  United  States  and 
Europe,  went  back  when  conditions  changed  and 
contributed  to  the  academic  acceptance  and  devel¬ 
opment  of  their  discipline. 

International  Institutionalization  Through 
Organizations  and  Associations 

A  new  and  important  factor  of  institutionalization 
at  the  international  level  was  the  creation  in  1949 
of  the  IPSA,  to  which  most  of  this  second  section 
will  be  devoted. 

Before  World  War  II,  political  science  was 
highly  developed,  recognized,  and  taught  in  the 
United  States  in  a  majority  of  colleges  and  in  most 
research  universities  at  the  graduate  level.  It  was 
also  taught  under  various  names  and  in  various 
forms  in  certain  European  and  Asian  countries. 
There  were,  however,  almost  no  organized  inter¬ 
national  contacts  other  than  visits  and  exchanges 
of  scholars  and  students.  The  Political  Science 
Congress  held  in  Paris  in  1900  was  not  followed 
by  any  large  international  meetings  except  for  a 
few  regional  ones,  such  as  the  Scandinavian 
Political  Science  Congress  held  in  Stockholm  in 
1930. 

The  only  organization  of  any  importance  was 
active  in  the  field  of  international  relations. 
Sponsored  by  the  Teague  of  Nations’  International 
Institute  of  Intellectual  Cooperation  (IIIC),  located 
in  Paris,  a  Conference  of  Institutions  for  the 
Scientific  Study  of  International  Relations  had 
been  created  in  1928;  it  changed  its  name  in  1933 
to  International  Studies  Conference  (ISC).  The  ISC 
used  for  its  operations  the  Secretariat  of  the  IIIC 
and  held  annual  conferences,  the  last  of  which  was 
held  on  the  eve  of  the  war,  on  August  30,  1939.  It 
was  unable  to  survive  the  postwar  creation  of  the 
broader  IPSA. 

A  few  other  bodies  had  existed.  The  International 
Institute  of  Political  and  Social  Sciences  in  their 
Application  to  Countries  With  Different  Civiliza¬ 
tions  did  survive  but  devoted  itself  mostly  to  the 
study  of  colonized  areas.  The  International  Institute 
of  Political  and  Constitutional  History,  renamed 
the  International  Academy  of  Political  Science  and 
Constitutional  History,  was,  despite  efforts  by 
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some  of  its  leading  members,  active  mainly  in  the 
field  of  history. 

The  organization  responsible  for  the  concep¬ 
tion  and  birth  of  IPSA  was  the  United  Nations 
Educational,  Scientific  and  Cultural  Organization 
(UNESCO).  Implementing  a  recommendation 
made  at  its  very  first  General  Conference  (Paris, 

1946) ,  the  second  General  Conference  (Mexico, 

1947)  instructed  the  Director  General  to  promote 
a  study  of  political  science.  The  UNESCO  Social 
Science  Department  had  singled  out  that  discipline 
because  political  phenomena  were  considered 
major  factors  of  tensions  and  because  intellectual 
cooperation  was  hopefully  seen  as  contributing  to 
the  maintenance  of  peace. 

The  process  soon  gained  impetus.  A  massive 
international  study  of  political  science,  or  its 
absence,  was  undertaken  by  UNESCO  in  1948 
under  the  leadership  of  an  American  scholar, 
William  Ebenstein,  of  Princeton  University.  The 
huge  resulting  volume,  including  no  less  than  51 
national  reports,  was  published  by  UNESCO  in 
1950.  At  a  coordination  meeting  held  in  1948  at 
UNESCO’s  headquarters  in  Paris,  political  science 
was  tentatively  defined  as  comprising  political 
theory;  central,  regional,  and  local  institutions; 
parties,  groups,  and  public  opinion;  international 
policy  and  organization;  and  international  law.  It 
was  also  decided  to  call  a  conference  to  examine 
the  possibility  of  creating  an  IPSA. 

At  that  conference,  held  in  September  1949,  at 
the  initiative  of  UNESCO’s  Director  General,  23 
scholars — political  scientists,  political  theorists, 
sociologists,  constitutional  lawyers,  and  histori¬ 
ans — considered  and  approved  a  draft  constitution 
for  the  future  body,  constructed  as  a  federation  of 
national  associations,  and  elected  a  provisional 
executive  committee.  UNESCO  was  similarly 
active  in  the  creation  of  associations  in  other  social 
sciences  and  the  International  Social  Science 
Council. 

IPSA  thus  came  into  existence  officially  at  the 
end  of  1949,  as  a  “foreign  association,”  under 
French  law  and  regulations,  with  its  legal  seat  in 
Paris.  Its  founders  were  four  associations,  those 
already  active  in  the  United  States,  Canada,  and 
India,  plus  a  French  association  set  up  largely  for 
that  purpose.  Its  provisional  executive  committee 
elected  as  chairman  Quincy  Wright,  a  professor  at 
the  University  of  Chicago,  and  a  Frenchman, 


Franqois  Goguel,  the  secretary-general  of  the 
Fondation  Nationale  des  Sciences  Politiques,  as 
executive  secretary.  A  first  World  Congress, 
attended  by  80  participants,  was  held  in  Zurich  in 
1950  and  the  second  one  at  The  Hague  in  1952. 
Subsequent  congresses  were  organized  triennially, 
at  first  in  Europe  only,  then  in  more  exotic  places 
such  as  Montreal,  Rio  de  Janeiro,  Washington, 
D.C.,  and  Buenos  Aires.  From  1997  (Seoul)  to 
2008  (Santiago  de  Chile),  all  congresses  were  held 
outside  Europe.  Attendance  has  grown  to  about 
2,000  participants,  with  IPSA  funding  many  travel 
grants  for  young  scholars  and  scholars  from 
developing  countries. 

Over  its  60  years  of  existence,  the  number  of 
IPSA’s  collective  members  (national  associations) 
has  grown  from  the  initial  4  to  more  than  40. 
IPSA’s  existence  has  contributed  to  the  creation  of 
associations  in  almost  all  Western  European 
countries;  in  a  few  cases,  such  as  that  of  Italy, 
IPSA  has  had  to  recognize  a  new  association  to 
replace  a  previous  collective  member  considered 
too  traditional  by  the  younger  generation  of 
political  scientists. 

Never  a  Cold  War  instrument,  IPSA  empha¬ 
sized  inclusion  and  dialogue,  and  the  rulers  of  the 
Soviet  bloc  preferred  to  have  delegates  participate 
in  it,  as  in  UNESCO.  A  Polish  association  was  thus 
created  as  early  as  1950  and  was  soon  admitted  to 
IPSA.  Associations  were  later  set  up  in  Yugoslavia 
(1954),  the  Soviet  Union  (1960),  Czechoslovakia 
(1964),  Bulgaria,  Hungary,  and  Romania  (1968). 

Against  strong  opposition,  especially  in  the 
United  States,  IPSA  stood  by  its  decision  to  hold  its 
1979  Congress  on  strict  conditions  in  Moscow, 
despite  the  Soviet  invasion  of  Afghanistan  at  the 
end  of  1978.  It  was  later  acknowledged  that  the 
Moscow  Congress  had  helped  young  political  sci¬ 
entists  establish  the  discipline  both  in  the  Soviet 
Union  and  in  several  satellite  countries.  The  “mis¬ 
sionary”  purpose  of  IPSA  also  inspired  assistance 
to  political  scientists  in  Asia,  Africa,  and  Latin 
America,  especially  where  dictatorial  regimes  were 
hostile  to  political  analysis. 

A  major  problem  arose  when  Taiwan  was,  in 
1989,  admitted  as  a  collective  member,  under  the 
compromise  name  of  Chinese  Political  Science 
Association  (Taiwan),  although  its  association 
claimed  to  be  the  legitimate  heir  of  the  Chinese 
Association  created  in  1932.  The  compromise 
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accepted  by  the  delegate  of  the  Chinese  Political 
Science  Association  (People’s  Republic)  was  soon 
rejected,  and  as  of  2010,  the  Mainland  Chinese 
Political  Science  Association  had  not  returned  to 
IPSA. 

Starting  in  the  mid-1960s,  IPSA  was  strength¬ 
ened  and  enriched  in  its  operations  by  the  creation 
of  Study  Groups  and  Research  Committees,  later 
merged  under  the  second  denomination.  Number¬ 
ing  about  50,  the  committees  meet  at  each 
Congress  and  at  least  once  between  congresses, 
with  participation  by  scholars  from  different 
countries.  They  deal  with  both  traditional  topics 
and  innovative  issues  and  approaches.  Several 
research  committees  have  their  own  journals. 
Associate  members — about  100  political  science 
departments  and  research  organizations — and 
individual  members — nearly  2,500 — find  IPSA’s 
activities  valuable. 

IPSA’s  first  journal,  International  Political 
Science  Abstracts,  launched  in  1951,  2  years  after 
IPSA’s  creation,  and  prepared  since  then  in  Paris  at 
Sciences  Po,  is  still  the  standard  reference  tool  of  the 
discipline.  From  1951  to  1962,  it  published  annu¬ 
ally,  in  four  issues,  about  1,400  abstracts  of  journal 
articles.  It  now  provides  each  year  more  than  8,000 
abstracts,  in  six  issues,  reflecting  the  explosion  in 
the  number  of  journals;  it  has  since  2007  been  pub¬ 
lished  by  SAGE  Publications.  Its  database  goes  back 
to  1951;  at  the  end  of  2009,  it  contained  270,000 
abstracts  and  is  distributed  by  both  EBSCO  and 
Ovid.  The  second,  started  in  1980,  is  the 
International  Political  Science  Review,  also  pub¬ 
lished  by  Sage,  which  has  been  recognized  as  one  of 
the  leading  journals  in  the  discipline.  In  recent  years, 
the  two  journals  have  also  contributed  substantially 
to  IPSA’s  budget.  To  the  journals  must  be  added  the 
IPSA  Portal,  created  and  maintained  at  the 
University  of  Naples,  which  selects,  describes,  and 
evaluates  the  top  300  political  science  websites,  and 
IPSA’s  newsletter,  Participation. 

Ever  since  its  inception,  IPSA  has  sought  to 
participate  in  the  diffusion  of  political  science  as  a 
rigorous  discipline  and  also  to  serve  as  a  bridge 
between  American  political  science,  in  many  ways 
still  central,  and  the  rest  of  the  world  while 
respecting  the  distinct  national  traditions  and  dif¬ 
ferent  approaches,  sometimes  strongly  critical  of 
those  dominant  in  the  United  States.  Until  rela¬ 
tively  recently,  the  landscape  seemed  pretty  clear, 


with  IPSA  playing  the  leading  role.  It  has  since 
then  become  increasingly  complex  and  somewhat 
more  competitive.  Other  international  bodies 
have  appeared  in  some  regions.  American  organi¬ 
zations  also  sometimes  supplement  IPSA  or  com¬ 
pete  with  it. 

Regional  associations,  of  somewhat  uncertain 
strength,  have  in  recent  decades  been  organized  in 
Africa  (1973)  and  Asia-Pacific  (1983).  In  Europe, 
the  creation  in  1970  of  the  European  Consortium 
for  Political  Research,  based  at  the  University  of 
Essex  and  assisted  in  its  first  years  by  the  Ford 
Foundation,  has  proved  spectacularly  successful. 
From  8  founding  institutions,  it  has  grown  to 
more  than  300,  together  with  associate  institu¬ 
tions  throughout  the  world.  Its  workshops,  con¬ 
ferences,  and  summer  schools  have  contributed 
greatly  to  the  emergence  of  a  European  political 
science  community,  and  the  reputation  of  its  jour¬ 
nals — the  European  Journal  of  Political  Research, 
started  in  1973,  to  which  have  been  added 
European  Political  Science  and  the  European 
Political  Review — and  its  three  book  series  is  envi¬ 
able.  It  entertains  warm  relations  with  IPSA. 

More  recently,  in  2007,  23  European  national 
and  international  associations  have  joined  in  a 
European  Confederation  of  Political  Science 
Associations,  whose  main  aim,  according  to  its 
website,  is  to  “promote  the  interests  of  the  disci¬ 
pline”  and  “make  it  more  meaningful  in  public 
debate  and  policy-making.”  The  Confederation 
has  a  promising  future,  as  it  reflects  the  vitality  of 
political  science  associations  in  some  countries 
where  none  existed  until  recently  as  well  as  that  of 
the  established  ones. 

Challenges  have  also  come  from  the  United 
States.  Thus,  in  the  field  of  international  relations, 
the  U.S.  International  Studies  Association  and  its 
journals  have  progressively  become  representative 
of  International  Relations  scholars  throughout  the 
world.  The  same  can  be  said  of  area  studies  orga¬ 
nizations  such  as  the  U.S.  Latin  American  Studies 
Association,  which  attracts  and  welcomes  many 
Latin  American  political  scientists. 

Finally,  the  powerful  American  Political  Science 
Association,  which  numbered  more  than  15,000 
members  in  2010,  at  least  twice  as  many  as  there 
are  political  scientists  in  the  rest  of  the  world,  has 
increasingly  become  a  magnet  for  foreign  political 
scientists.  Its  conventions  have  the  dual  function  of 
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a  forum  for  scholarly  exchanges  and  an  unrivalled 
job  market  in  the  discipline.  They  attract  many 
participants  from  outside  the  United  States,  some¬ 
times  assisted  by  travel  grants. 

The  international  institutionalization  of  politi¬ 
cal  science  may  thus  be  at  a  crossroads.  IPSA  has 
proved  its  ability  to  play  a  major  role  in  it  by  con¬ 
tinuously  adapting  to  a  changing  landscape.  Thus, 
it  has  of  necessity  practically  forsaken  French  as  its 
second  language  for  its  meetings  and  publications. 
Its  secretariat,  which  until  recent  years  used  to 
move  to  the  city  of  residence  of  its  secretary- 
general  whenever  that  changed,  is  now  perma¬ 
nently  established  in  Montreal,  where  the  staff  is 
bilingual.  It  also  maintains  IPSA’s  archives,  now 
assembled  in  a  single  location. 

In  this  increasingly  complex  and  competitive 
scholarly  landscape,  new  activities  and  organiza¬ 
tional  schemes  will  need  to  be  experimented  with. 
Major  gaps  will  need  to  be  filled  in  Africa,  Latin 
America,  and  Asia-Pacific,  concerning  obviously 
China  but  also  countries  such  as  Indonesia  and 
the  Philippines.  The  relevance  and  the  vitality  of 
the  discipline  seem,  however,  to  be  ensured  for 
many  years  to  come,  certainly  much  better  than  a 
generation  ago. 

Serge  Hurtig 

International  Political  Science  Association 

Paris,  France 
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Political  socialization  is  a  form  of  socialization. 
An  individual’s  biological  birth  is  followed  by  a 
second  phase,  described  as  a  “social-cultural 
birth,  ”  which  turns  him  or  her  into  a  social  being. 
This  procedure  is  called  socialization.  It  is  a  mul¬ 
tidimensional  process  that  significantly  shapes  the 
relation  of  the  individual  toward  the  social  envi¬ 
ronment.  A  great  number  of  people  (e.g.,  educa¬ 
tors),  institutions  (e.g.,  schools),  and  factors  in  the 
social  environment  (e.g.,  mass  media)  are  involved. 
In  the  following,  the  major  forces  shaping  this 
process  in  modern  societies  are  discussed. 

Socialization  can  be  defined  as  the  following: 

•  a  process  of  forming  an  individual  into  a 
sociocultural  personality  through  the  shaping  of 
those  cognitive,  linguistic,  motivational, 
emotional-affective,  and  similar  features  that 
enable  persons  to  understand  the  social 
environment  and  to  participate  in  its  shaping 
(primary  socialization); 

•  a  process  of  imparting  social  values,  norms,  and 
abilities  for  the  accomplishment  of  social 
existence,  occurring  outside  the  family,  mainly  in 
peer  groups  and  schools  (secondary 
socialization);  and 

•  a  lifelong  process  of  social  experience,  on  the 
basis  of  primary  and  secondary  socialization 
(lifelong  learning). 

In  a  wider  definition  of  politics,  all  processes  of 
socialization  can  be  seen  as  political.  The  separa¬ 
tion  of  political  and  general  socialization  is  unnec¬ 
essary  if  politics  is  defined  by  all  relations  of  social 
power  (including  the  family,  the  workplace,  etc.). 
If  the  members  of  a  society  are  subjected  to  such 
relations  and  are  thereby  shaped  politically,  social¬ 
ization  always  develops  political  effects  as  well. 

In  a  narrower  sense,  political  socialization  is 
regarded  as  a  part  of  the  general  process  of  social¬ 
ization.  This  contains  purposive  and  functional 
dimensions.  The  purposive  part  regards  political 
socialization  in  the  context  of  pedagogical  institu¬ 
tions  and  methods  (education,  tuition).  The  func¬ 
tional  side  looks  at  political  goals,  values,  and 
norms  that  are  being  socialized  and  form  individu¬ 
als  into  political  beings. 
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Political  socialization  is  not  free  of  values.  It  is 
defined  as  the  entirety  of  all  learning  processes  that 
form  a  person  into  a  political  being  (citizen).  The 
learning  processes  can  be  measured  in  terms  of 
how  far  they  promote  or  impede  political  maturity. 
Maturity  in  this  context  means  all  abilities  neces¬ 
sary  to  meaningfully  participate  politically — the 
competence  of  participation.  Political  socialization 
gains  its  specific  political  content  from  an  under¬ 
standing  of  democracy,  which  underlies  this  value 
orientation.  The  aim  of  socialization  is  maturity,  in 
the  sense  of  self-determination,  and  political  deci¬ 
sion  making  and  responsibility.  Democracy 
determines  the  political-social  process  in  which 
politically  mature  individuals  develop.  The  struc¬ 
tural-organizational  basis  of  this  ability  lies  in  the 
institutions  and  policies  of  political  socialization, 
whereas  the  subjective-personal  part  is  found  in  the 
competence  of  participation.  This  is  perceived  as  a 
goal  of  political  learning.  The  question  of  how  this 
goal  of  learning  can  be  founded  theoretically  and 
how  it  can  be  empirically  achieved,  for  example, 
not  only  in  political  education  but  also  in  processes 
of  participation,  pervades  considerable  areas  of 
work  in  the  research  on  political  socialization. 

Political  Socialization  as 
Political  Learning/Education 

Political  education  is  a  historical  phenomenon. 
“Political”  thinking,  learning,  and  acting  (in  a 
sense  of  politics  as  a  struggle  or  competition  for 
influence  and  power)  was  only  established  when  a 
sphere  of  civil  life  developed  besides  the  state. 
Civil  life  needed  and  claimed  individual  freedom 
from  governmental  and  absolutistic-feudal  power, 
beginning  in  Europe  in  the  18th  century.  The 
struggle  for  autonomy  of  the  newly  established 
class — the  middle  classes — was  borne  by  ideas  of 
the  Enlightenment  (“liberty,  equality,  fraternity” 
as  the  slogan  of  the  French  Revolution).  The  aim 
was  to  replace  feudal  arbitrariness  of  power  with 
a  constitutional  framework  that  ensured  freedom 
and  equality  before  the  law.  The  guarantee  of 
equal  rights  for  all  citizens  made  political  learning 
necessary  for  the  first  time.  Political  learning  in  its 
beginnings  was  more  focused  on  the  idea  of  civil 
freedom.  Not  long  after  the  collapse  of  the  medi¬ 
eval  social  order,  political  education  turned  into  a 
problem  from  a  historical  perspective. 


The  social-historical  background  of  political 
learning  reveals  different  phases  of  development: 

•  Education  of  classes  in  the  feudal  society:  Being 
politically  educated  meant  knowing  the  rights 
and  duties  of  one’s  class. 

•  Education  toward  the  ideal  of  civil  liberty  in  the 
early  middle-class  society:  It  ought  to  serve  as  an 
instrument  for  the  enforcement  of  civil  rights  as 
provided  by  the  constitutional  state. 

•  Means  of  controlling  the  fourth  estate  of  the 
society  in  the  era  of  1 9th-century  capitalism:  In 
this  phase,  political  education  was  an 
intellectual-political  weapon  for  the  suppression 
of  the  upcoming  industrial  proletariat; 

•  Political  education  in  nationalistic-authoritarian 
systems:  Political  education  is  focused  on  the 
subordination  of  all  people  to  the  state. 

•  Education  toward  a  racist  ethnic  community 
under  National  Socialism  in  Germany: 

Education  served  as  a  means  for  the  forming  of 
the  national-socialist  man. 

•  Education  toward  the  “new  man”  under 
communist  systems:  It  helped  people  live 
harmoniously  in  a  classless  society. 

•  Education  toward  democratic  behavior  in 
contemporary  democratic  states:  Political 
learning  takes  part  in  the  challenge  of  enabling 
the  development  and  stability  of  democratic 
conditions. 

Political  learning  continuously  tries,  in  its  con¬ 
tents  as  well  as  in  its  methods,  to  consolidate  or  to 
criticize  existing  forms  of  authority.  In  a  democracy, 
its  objectives  include  different  concepts  of  democ¬ 
racy.  Political  learning  that  is  focused  on  the  impart¬ 
ing  of  political  decision  making  and  responsibility  is 
interested  in  more  than  just  the  imparting  of  knowl¬ 
edge.  It  is  geared  toward  certain  states  of  awareness, 
abilities  for  political  action,  and  attitudes  and  moti¬ 
vations  for  a  democratic  political  commitment. 

The  achievements  of  socialization  on  a  microdi- 
dactic  level  (political  education/instruction)  are 
limited,  though.  They  are  not  able  to  compensate 
the  deficits  of  the  political  system  and  the  structural 
basis  of  political  socialization.  However,  it  is  possi¬ 
ble  to  implement  the  goals  of  political  socialization 
on  a  macrodidactic  level.  Political  learning  can  then 
be  described  as  functional.  This  is  how  it  can  be 
distinguished  from  intentional  political  education. 
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Processes  of  learning  that  shape  the  potential  of 
qualification  and  actions  of  political  decision  mak¬ 
ing  and  responsibility  are  processes  of  participation. 
Competence  in  participation  can  be  acquired 
through  participation  itself.  In  this  way,  participa¬ 
tion  serves  as  an  educational  goal  as  well  as  an  edu¬ 
cational  means.  Political  learning  takes  place  when 
political  consciousness  can  be  applied  and  political 
participation  can  be  reflected  (learning  by  doing) — 
the  basis  of  which  is  political  communication. 

Political  Socialization  Through 
Political  Communication 

Political  systems  and  their  social  environment  are 
in  constant  communicative  exchange.  The  basis  of 
such  communication  is  the  aggregation  and  con¬ 
version  of  social  interests  into  political  decisions 
(interest  intermediation)  as  well  as  the  transmis¬ 
sion  of  political  values  and  norms  into  society 
(intermediation  of  politics).  Political  communica¬ 
tion  consists  of  the  exchange  of  information  about 
politically  relevant  topics.  It  can  take  place  because 
constitutional  regulations  organize  this  exchange 
of  information  and  groups  of  citizens  take  part  in 
it.  Political  communication  can  also  take  place 
without  the  exchange  of  information  being  orga¬ 
nized  by  the  governmental  system.  Effective  politi¬ 
cal  communication  is  established  if,  from  the  side 
of  the  political  system,  there  is  a  high  degree  of 
transparency  of  its  institutions  and  decisions. 
Political  communication  is  the  basis  of  participa¬ 
tion  in  political  decision  making  and  decision  pro¬ 
cesses.  Political  communication  acts  as  a  social¬ 
izing  process  if,  in  the  system  and  the  system 
environment,  the  efficiency  of  exchange  of  infor¬ 
mation  between  institutions  and  participants  is 
ensured  by  steering  processes  (through  norms, 
culture,  sanctions,  etc.). 

Transparency,  participation,  and  efficiency  are 
regarded  as  the  “magic  triangle  of  social  science.” 
They  are  principles  of  organization  that  determine 
the  success  of  political  communication.  The  rele¬ 
vance  of  the  principle  of  participation  determines 
the  extent  to  which  political  communication 
enables  the  learning  of  participation  skills. 

As  far  as  political  communication  is  organized 
by  a  political  system,  its  institutions  act  as  agents 
for  political  socialization.  The  accomplishments 
in  socialization  are  the  result  of  the  practice  of 


organized  political  communication  between  the 
system  and  the  environment  of  the  system.  Through 
passing  on  of  prevailing  norms  and  values  and 
thereby  the  culture  of  the  system  (political  culture, 
organizational  culture,  etc.),  it  allows  the  possibil¬ 
ity  of  evaluation  of  the  system  output  (laws,  party 
platforms,  etc. )  and  ensures  the  return  of  informa¬ 
tion  (input)  into  the  system,  whereby  innovation 
(e.g.,  political  reforms)  is  made  possible.  The 
socialization  of  the  political  system  thus  increases 
the  learning  capacity  of  the  political  system  and  its 
facilities.  It  is  the  basis  for  a  successful  conveyance 
of  politics. 

Conclusion 

The  learning  capacity  of  the  political  system  and 
thereby  its  survivability  depend  on  an  organized 
exchange  of  information  between  politics  and  soci¬ 
ety  as  well  as  between  political  and  administrative 
facilities  and  citizens.  In  a  democracy,  this  exchange 
is  organized  by  parliament  and  through  intermedi¬ 
ate  actors  of  civil  society  (parties,  organizations, 
associations,  etc.).  Their  achievements  are  mea¬ 
sured  in  terms  of  how  far  they  are  able  to  exercise 
political  communication.  In  a  democracy,  the 
medium  of  political  communication  is  the  political 
public.  In  the  medium  of  the  political  public, 
socialization  obtains  its  empirical  location,  and 
from  the  political  public,  it  derives  its  normative 
power. 

Leo  Kissler 
Philipps-  Universitat  Marburg 
Marburg,  Germany 
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Political  Sociology 
as  a  Field  of  Study 


Political  sociology  deals  with  the  nexus  between 
social  and  political  life.  Traditionally,  the  political 
consequences  of  social  arrangements  are  stressed, 
but  attention  has  also  been  paid  to  the  social  con¬ 
sequences  of  political  arrangements.  Combining 
these  two  perspectives,  political  sociology  studies 
links  political  and  social  phenomena;  that  is,  it  is 
an  interdisciplinary  endeavor  combining  social 
and  political  factors  to  explain  distributions  of 
power  and  dominance  in  state  and  society.  Almost 
by  definition,  intermediary  actors  and  institutions 
such  as  interest  groups,  political  parties,  and  vol¬ 
untary  associations  play  an  important  role.  The 
main  research  topics  of  political  sociologists  are 
voting  behavior,  new  social  movements,  parties, 
civil  society,  and  interest  groups  (usually  input- 
oriented  microlevel  approaches),  on  the  one  hand, 
and  state  formation,  transformations  of  political 
systems,  and  political  reform  processes  (usually 
output-oriented  macrolevel  approaches),  on  the 
other. 

In  the  past  decades,  the  distinction  between 
state  and  society  gradually  disappeared  and  has 
been  replaced  by  a  melding  and  blending  of  politi¬ 
cal  and  social  phenomena.  The  rise  of  multina¬ 
tional  corporations,  international  organizations, 
nongovernmental  organizations  (NGOs),  and  the 
spread  of  international  interdependencies  and 
supranational  arrangements  changes  the  position 
and  nature  of  the  state,  whereas  rising  levels  of 


education,  welfare,  social  security  provisions,  secu¬ 
larization,  and  individualization  imply  similar 
changes  for  society.  As  a  consequence,  political 
sociology  as  a  field  of  study  seems  to  have  lost 
much  of  its  traditional  position  to  more  specific 
approaches.  The  first  question  to  be  dealt  with  here 
concerns  the  distinction  between  the  use  of  social 
factors  to  explain  political  phenomena  (a  “sociol¬ 
ogy  of  politics”)  and  the  explanation  of  social  phe¬ 
nomena  using  political  factors  (a  “political  science 
of  society”).  Political  sociologists  stress  the  recipro¬ 
cal  nature  of  these  relationships  and  the  need  for 
integrated  approaches.  Second,  the  main  theories 
used  in  political  sociology — modernization  theo¬ 
ries,  variants  of  Marxism,  rational  choice  theo¬ 
ries — are  presented  by  following  the  debates  since 
World  War  II.  For  each  of  these  approaches,  the 
major  goals  and  attainments  are  presented  and 
illustrated  with  a  closer  look  at  one  or  two  seminal 
works.  As  will  become  clear,  these  newer  approaches 
still  rely  on  the  presumption  that  a  distinction — or 
at  least  an  analytical  distinction — between  political 
and  social  phenomena  makes  sense.  Finally,  the 
consequences  of  the  massive  shifts  in  the  relation¬ 
ships  between  state  and  society  for  political  sociol¬ 
ogy  as  a  field  of  study  are  scrutinized. 

An  Interdisciplinary  Hybrid 

Political  sociologists  study  topics  such  as  interest 
groups,  state  formation  processes,  old  and  new 
social  movements,  class-based  power,  public  opin¬ 
ion,  elites,  trade  unions,  civil  society,  the  spread  of 
governance  practices,  and  social  and  political  par¬ 
ticipation.  What  do  those  very  diverse  subject  mat¬ 
ters  have  in  common  that  arouses  the  curiosity  of 
political  sociologists?  Ordinary  answers  to  this 
question  usually  point  to  power  and  domination  as 
core  concepts  to  draw  a  line  of  demarcation.  In 
this  way,  typical  broad  definitions  of  political  soci¬ 
ology  presented  in  social  science  encyclopedias 
refer  to  the  “operation  of  power  in  social  life”  and 
the  distribution  of  power  at  the  various  levels 
(individuals,  organizations,  communities,  coun¬ 
tries,  etc.).  Other  definitions  stress  the  “social 
causes  and  consequences  of  given  power  distribu¬ 
tions”  or  the  “study  of  power  and  domination  in 
social  relationships”  as  the  defining  characteristics 
of  political  sociology.  In  this  way,  it  could  include 
studies  of  the  distribution  of  power  in  families,  the 
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mass  media,  universities,  parliaments,  trade  unions, 
and  so  on.  These  broad  definitions  underline  the 
wide  range  of  topics  studied  by  political  sociolo¬ 
gists.  Referring  to  power  and  domination  is  cer¬ 
tainly  helpful  to  characterize  the  field  in  general 
terms,  but  the  use  of  these  terms  is  a  necessary  but 
not  a  sufficient  condition  to  define  political  sociol¬ 
ogy.  Since  virtually  every  topic  studied  by  social 
scientists  is  related  to  the  operation  and  distribu¬ 
tion  of  power,  further  specifications  are  required. 
Such  a  more  specific  definition  of  political  sociol¬ 
ogy  as  a  field  of  study  implies,  first,  a  closer  look  at 
the  distinction  between  the  political  and  the  social 
and,  second,  a  specification  of  the  presumed  causal 
directions  in  our  explanations  and  interpretations. 

Any  study  of  the  relationships  between  political 
and  social  life — between  state  and  society — is 
based  on  the  conjecture  that  these  two  areas, 
indeed,  can  be  distinguished.  Yet  in  the  history  of 
ideas,  this  is  a  fairly  recent  invention.  In  his  essay 
on  the  origins  of  political  sociology,  Walter 
Runciman  (1963)  points  to  the  national  state  and 
the  rise  of  the  bourgeoisie  in  the  18th  century  as 
the  main  factors  stimulating  a  distinction  between 
the  political  and  the  social.  Exactly  that  distinction 
made  political  sociology  possible  and  sensible,  and 
it  initiated  a  wealth  of  studies  focusing  on  the  rela¬ 
tionships  between  state  and  society  from  a  wide 
variety  of  perspectives.  More  recently,  a  distinc¬ 
tion  between  state  and  society  or  between  the 
political  and  the  social  has  become  increasingly 
problematic.  The  developments  in  the  last  few 
decades  show  a  remarkable  extension  of  politics 
and  political  power.  Besides,  the  distinction 
between  political  and  nonpolitical  activities  or 
areas  became  blurred  or  disappeared  completely. 
Yet  characterizing  political  sociology  on  the  basis 
of  a  distinction  between  social  and  political  life 
does  not  presume  that  these  two  areas  can  be  dis¬ 
tinguished  empirically  unequivocally.  In  fact,  using 
this  distinction  analytically  and  conceptually 
allows  political  sociologists  to  point  out  the  his¬ 
torical  roots  of  their  discipline  and  to  analyze 
actual  developments  in  terms  of  a  melding  and 
blending  of  political  and  social  phenomena. 

Accepting  an  analytical  and  conceptual  distinc¬ 
tion  between  social  and  political  life  gives  rise  to  a 
second  complication.  Traditionally,  specific  demar¬ 
cations  of  political  sociology  are  obtained  by 
drawing  attention  to  the  relationships  between 


state  and  society.  In  this  approach,  political  sociol¬ 
ogy  deals  with  the  ways  in  which  social  and  societal 
factors  have  an  impact  on  political  phenomena,  or 
vice  versa,  it  deals  with  the  ways  in  which  political 
factors  influence  social  phenomena.  The  first  per¬ 
spective  might  be  labeled  as  a  “sociology  of  poli¬ 
tics”  and  is  common  among  American  sociologists. 
For  instance,  the  major  aim  of  the  Political  Sociology 
Section  of  the  American  Sociological  Association  is 
to  promote  activities  of  those  concerned  with  a 
sociological  understanding  of  political  phenomena. 
The  combined  Committee  on  Political  Sociology  of 
the  International  Sociological  Association  and  the 
International  Political  Science  Association  (IPSA) 
relies  on  a  somewhat  broader  approach.  In  their 
view,  since  political  sociology  is  about  the  nexus 
between  social  and  political  life,  the  focus  is  on 
organizations  or  individual  acts  that  seek  to  take  or 
influence  political  power.  Both  approaches  share  an 
unambiguous  interest  in  the  relationships  between 
state  and  society — they  differ,  however,  in  the 
extent  to  which  the  focus  is  on  social  factors. 
Whereas  a  “sociology  of  politics”  stresses  the  rele¬ 
vance  of  social  factors  for  political  phenomena,  a 
“political  science  of  society”  reverses  the  argument. 
In  practice,  the  disciplinary  roots  of  sociologists 
and  political  scientists  appear  to  be  highly  relevant 
for  the  selection  of  one  of  these  perspectives. 

In  his  seminal  article  on  sociology,  political  sci¬ 
ence,  and  political  sociology  Giovanni  Sartori  (1969) 
explicitly  rejects  the  idea  that  political  sociology  is 
characterized  by  one  of  these  single-directional  per¬ 
spectives.  For  him,  political  sociology  is  an  interdis¬ 
ciplinary  hybrid  whose  main  goal  is  to  combine 
social  and  political  explanatory  factors  as  suggested 
by  both  sociologists  and  political  scientists.  It  is  not 
the  direction  of  impacts  presumed  in  the  analyses 
(social  factors  explaining  political  phenomena  vs. 
political  factors  explaining  social  phenomena)  but, 
more  generally,  the  nexus  between  social  and  politi¬ 
cal  life  that  defines  political  sociology  as  a  distinct 
subfield. 

As  in  every  major  area  of  the  social  sciences, 
political  sociologists  do  not  agree  on  a  straightfor¬ 
ward  definition  of  their  activities  or  the  objects  they 
study.  A  minimum  consensus,  however,  includes 
the  focus  on  power  and  domination  in  the  relation¬ 
ships  between  social  and  political  phenomena. 
Political  sociologists  use  this  last  distinction  for 
analytical  and  conceptual  purposes  and  presume  a 
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reciprocal  relationship  between  political  and  social 
life — that  is,  between  the  state  and  society.  Claims 
by  either  sociologists  or  political  scientists  to  seize 
political  sociology  as  an  exclusive  subdiscipline  are 
not  only  gratuitous  but  also  do  not  teach  us  much 
about  the  distribution  of  power  and  domination. 

The  Rise  and  Fall  of 
Modernization  Approaches 

Although  its  roots  go  back  to  the  work  of  19th- 
and  early-20th-century  social  scientists  (Karl 
Marx,  Alexis  de  Tocqueville,  Emile  Durkheim, 
Max  Weber,  and  Vilfredo  Pareto,  to  mention  only 
a  few),  the  heydays  of  political  sociology  started 
after  World  War  II.  The  spread  of  modernization 
approaches  especially  contributed  to  this  advance. 
These  approaches  consider  the  development  of 
societies  as  an  evolutionary  progressive  process 
that  is  driven  by  economic  and  technological 
forces.  Its  basic  ideas  were  spelled  out  by  early 
social  scientists  and  philosophers  (Herbert  Spencer, 
Karl  Marx).  Every  society  evolves  from  simpler  to 
more  complex  ones  and  passes  through  various 
phases  or  stages  depending  on  the  available 
resources  and  challenges  confronted.  In  this  way, 
“primitive”  societies  will  gradually  become  more 
complex,  heterogeneous,  and  differentiated. 
Political  arrangements,  especially  state  formation 
and  nation  building,  develop  accordingly  and  are 
attuned  to  the  social  requirements  of  each  evolu¬ 
tionary  stage  or  phase.  Modern  industrial 
society — with  its  capitalist  ownership,  market  allo¬ 
cation,  and  division  of  labor — establishes  the  cul¬ 
mination  of  this  progressive  evolutionary  process. 
Conceptualizing  social  and  political  developments 
in  this  way  almost  directly  leads  to  the  study  of 
reciprocal  relationships  between  social  and  politi¬ 
cal  phenomena,  that  is,  to  the  study  of  political 
sociology. 

Modernization  theories  obtained  their  strongest 
impulses  from  the  spread  of  structural-functional 
system  theory  as  a  universal  framework  for  study¬ 
ing  the  evolution  of  societies.  Mainly  following  the 
work  of  Max  Weber,  Talcott  Parsons  distinguished 
between  four  major  social  subsystems:  the  eco¬ 
nomic  system,  the  political  system,  the  community 
system,  and  the  sociocultural  system.  These  four 
subsystems  are  interdependent  and  each  contrib¬ 
utes  to  the  persistence  of  the  system  as  a  whole  by 


performing,  respectively,  four  functions:  adapta¬ 
tion,  goal  attainment,  integration,  and  latent- 
pattern  maintenance  (the  AGIL  scheme). 
Approaching  the  relationships  between  the  social 
and  the  political  on  the  basis  of  the  Parsonian  sys¬ 
tem  theory  has  many  evident  advantages.  First,  a 
universal  framework  is  presented  that  offers  (func¬ 
tional)  explanations  for  the  development  of  entire 
societies  as  an  evolutionary  process.  Furthermore, 
political  phenomena  are  handsomely  conceptual¬ 
ized  as  a  distinct  subsystem  whose  features  and 
developments  are  systematically  integrated  in  the 
system  as  a  whole.  Third,  Parsonian  system  theory 
underlines  the  progressive  character  of  the  evolu¬ 
tionary  processes  studied:  A  structural-functional 
explanation  is  offered  to  show  that  primitive  soci¬ 
eties  inevitably  will  become  more  complex,  hetero¬ 
geneous,  and  differentiated.  In  the  end,  all  societies 
will  converge  to  a  liberal-capitalist  society  of  the 
U.S.  and  Western  European  type. 

Early  examples  of  the  application  of  moderniza¬ 
tion  approaches  by  political  sociologists  can  be 
found,  for  instance,  in  the  works  of  Gabriel 
Almond,  Seymour  Martin  Lipset,  Stein  Rokkan,  or 
Robert  Dahl.  These  authors  mainly  focus  on  the 
consequences  of  social  arrangements  for  the  distri¬ 
bution  of  power  to  assess  the  chances  for  democ¬ 
racy  in  capitalist  society.  Major  variants  include 
elite  theories,  pluralist  theories,  and  class-based 
theories. 

The  idea  that  specific  groups  obtain  privileged 
and  leading  positions  in  society  directly  follows 
from  the  basic  presumption  of  modernization 
approaches  that  societies  increasingly  become 
more  complex,  heterogeneous,  and  differentiated. 
As  such,  the  rise  to  power  of  specific  groups  (elites) 
is  well  founded  in  these  theories.  Traditional 
examples  can  be  found  in  the  works  of  Gaetano 
Mosca,  Vilfredo  Pareto,  and  especially  Robert 
Michels.  For  Mosca,  the  driving  force  behind  soci¬ 
etal  developments  is  the  continuous  struggle 
between  groups  to  gain  dominance  and  power.  As 
a  result,  society  always  consists  of  two  groups 
only:  the  dominating  and  the  dominated,  the  pow¬ 
erful  and  the  powerless.  As  Mosca,  Pareto  consid¬ 
ers  elites  to  be  unavoidable.  Since  human  beings 
are  very  unequal,  society  is,  by  definition,  not 
homogeneous — and  elites  will  not  be  homoge¬ 
neous  either.  Following  this  argument,  Pareto  con¬ 
cludes  that  social  change  is  always  based  on  elite 
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circulation.  The  replacement  of  elites  is  also  an 
important  aspect  of  the  work  of  Michels.  Instead 
of  continuous  battles  or  circulation  of  elites, 
Michels  presumes  that  a  permanent  amalgamation 
of  members  of  old  and  new  elites  takes  place.  Elite 
amalgamation  is  ascertained  in  organizations,  and 
so  Michels  focuses  his  attention  on  developments 
within  organizations  and,  in  particular,  within 
political  parties.  On  the  basis  of  this  work,  he  for¬ 
mulated  his  famous  Iron  Law  of  Oligarchy  already 
in  1911:  “Who  says  organization,  says  oligarchy.” 

Pluralist  theorists  accept  the  idea  that  in  modern 
societies  elites  are  inevitable.  For  pluralists,  how¬ 
ever,  this  does  not  imply  that  in  each  area  of  social 
and  economic  life  the  very  same  elite  group  is  in 
power.  In  fact,  increasing  complexity,  heterogene¬ 
ity,  and  differentiation  of  societal  arrangements 
make  it  highly  unlikely  that  a  single  group  would 
be  able  to  dominate  several  different  areas  at  the 
same  time.  Pluralist  approaches  presume,  first, 
that  modern  societies  are  complex  and  highly  frag¬ 
mented  and,  second,  that  different  elites  gain 
power  in  each  area.  As  a  result,  social  life  is  char¬ 
acterized  by  conflicts  between  elites  from  various 
areas,  which  are  based  on  different  groups. 
Pluralists  do  not  simply  consider  these  elites  or 
conflicts  between  elites  as  a  threat  to  democracy. 
On  the  contrary,  especially  because  of  elite  compe¬ 
tition,  progress  is  possible,  and  group  interests  will 
be  taken  into  account.  To  avoid  the  destructive 
consequences  of  severe  conflicts  between  elites,  the 
social  groups  on  which  they  are  based  should  over¬ 
lap;  that  is,  they  should  not  be  completely  distinct. 
If  these  structural  arrangements  are  fulfilled,  suc¬ 
cessful  democratization  will  be  the  outcome  of 
modernization.  An  example  of  a  pluralist  approach 
is  the  seminal  study  of  Robert  Dahl  on  the  distri¬ 
bution  of  power  in  an  American  community  in  the 
early  1960s.  Conscientiously,  he  studied  decision¬ 
making  processes  in  various  policy  areas  in  a  small 
town  (New  Haven,  Connecticut)  and  was  able  to 
show  the  existence  of  distinct,  competing  elites  in 
different  areas.  The  existence  of  these  elites  does 
not  endanger  democracy;  it  is  an  essential  precon¬ 
dition  for  its  functioning. 

Most  class-based  approaches  accept  the  idea 
that  societies  increasingly  become  more  complex, 
heterogeneous,  and  differentiated.  Instead  of  focus¬ 
ing  on  elites,  class-based  theories  prefer  a  broader 
conceptualization  of  the  consequences  of  ongoing 


modernization  for  power  struggles  between  social 
groups  and  for  the  resulting  social  inequalities. 
Broadly  speaking,  class  as  a  term  to  depict  some 
group  of  people  is  used  in  two  ways:  by  referring, 
first,  to  functional  contributions  in  (industrial) 
production  processes  (e.g.,  working  class)  or,  sec¬ 
ond,  to  positions  in  some  hierarchy  (e.g.,  middle 
class  or  ruling  class).  Directly  following  Marx, 
political  sociologists  in  the  Marxist  tradition  use 
class  concepts  based  on  contributions  to  the  pro¬ 
duction  process.  Other  social  scientists — among 
them  Max  Weber — prefer  relational  concepts. 
Since  positions  in  a  production  process  and  in  a 
social  hierarchy  are  evidently  related,  actual 
research  usually  deals  with  both  class  concepts.  In 
his  seminal  article  on  the  “social  requisites  of 
democracy,”  Lipset  (1959)  pointed  to  the  fact  that 
economic  development  is  an  important  precondi¬ 
tion  for  democracy.  Modernization  does  not  only 
result  in  complexity,  heterogeneity,  and  differenti¬ 
ation  but  also  implies  massive  expansion  of  pro¬ 
duction  capacities  and  wealth.  Increased  wealth 
dampens  class  struggles  and  permits  broad  support 
for  distributional  compromises.  Besides,  a  rising 
middle  class  will  claim  political  rights  and  promote 
democratic  values.  These  combined  effects  of  mod¬ 
ernization,  Lipset  argued,  strongly  favor  democra¬ 
tization  in  countries  with  high  levels  of  economic 
development. 

Whereas  Lipset  searched  for  the  mechanism 
behind  democratization,  Stein  Rokkan  attempted 
to  explain  the  varieties  of  modernization  in  Europe. 
Political  developments  in  Europe  are,  according  to 
Rokkan,  mainly  a  consequence  of  the  deep-rooted 
cleavages  that  arose  in  the  aftermath  of  several 
invasive  events  (the  Reformation,  nation  building, 
the  French  Revolution,  the  Industrial  Revolution). 
Since  these  events  occurred  differently  in  different 
places  and  times,  European  societies  show  both 
similarities  and  differences.  Accordingly,  political 
arrangements  show  similarities  and  differences  too. 
Together  Rokkan  and  Lipset  analyzed  these  simi¬ 
larities  and  differences,  especially  for  party  systems 
and  voter  alignments  in  Europe:  Evident  similari¬ 
ties  (such  as  the  existence  of  social-democratic  and 
Christian  Democratic  parties)  are  combined  with 
striking  differences  (such  as  the  spread  of  liberal 
parties).  Rokkan  and  Lipset  underlined  the  strong 
impact  of  cleavages  by  pointing  out  the  fact  that 
party  systems  and  voter  alignments  in  Europe 
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remained  “frozen”  from  the  early  extension  of  suf¬ 
frage  until  the  mid-1960s;  that  is,  the  impact  of 
social  factors  survived  two  world  wars  and  severe 
economic  turmoil. 

The  closeness  of  political  sociology  to  modern¬ 
ization  theories  in  the  first  2  decades  after  World 
War  II  provided  the  upcoming  discipline  with 
many  advantages.  Nonetheless,  from  the  very 
beginning,  modernization  theories  provoked 
strong  criticism — criticism  that  applied  also  to 
political  sociology  as  its  main  intellectual  ally.  A 
major  concern  of  many  critics  is  the  idea  that 
(American)  capitalist  society  and  liberal 
democracy  should  be  seen  as  the  end-stage  of  a 
progressive  evolution  of  humanity.  This  teleologi¬ 
cal  tendency  in  modernization  approaches  is 
rejected  as  being  ideologically  biased  and  episte¬ 
mologically  unwarranted.  On  a  closer  look,  the 
presumed  progressive  evolutionary  process  in 
many  societies  appeared  to  be  fragile,  inconsistent, 
nonmonotonous,  and  not  necessarily  following  a 
European  or  Western  model.  Furthermore,  the 
basic  idea  that  economic  and  political  develop¬ 
ments  are  interdependent  was  challenged  and 
replaced  by  a  much  more  open  approach  concep¬ 
tualizing  economic  and  political  modernization  as 
two  distinct  processes  whose  potential  interdepen¬ 
dencies  should  be  studied  carefully  instead  of 
simply  being  taken  for  granted.  Furthermore, 
methodological  criticisms  were  raised.  Since  mod¬ 
ernization  approaches  are  based  on  general  frame¬ 
works  and  claims  of  universality,  the  usual  methods 
applied  were  quantitative  and  comparative  (typi¬ 
cally,  statistical  analyses  of  cross-national  data  in 
large-N  designs).  Newer  approaches  challenged 
these  ideas  and  considered  in-depth  historical 
analyses  and  case-oriented  methods  (usually 
hermeneutic-interpretative  and  cultural-historical 
approaches  in  small -N  designs)  as  much  more 
appropriate  to  the  study  of  complicated  reciprocal 
relationships  between  the  social  and  the  political. 

Main  Approaches 

In  a  somewhat  exaggerated  way,  one  might  regard 
many  developments  in  political  sociology  in  the 
last  half  century  as  attempts  to  present  alternatives 
for  the  apparent  limitedness,  erroneousness,  and 
ideological  bias  of  the  modernization  approaches 
of  the  1940s  and  1950s.  Alternative  approaches 


challenge  the  capitalist  distribution  of  power  and 
its  implicated  social  inequalities  as  well  as  the 
strong  emphasis  on  structural-functional  argu¬ 
ments.  These  disputes  reflect  a  more  general 
change  in  the  social  sciences  away  from  social- 
scientific  approaches  (emphasizing  causal  explana¬ 
tions  based  on  regularities  and  mainly  applying 
quantitative,  comparative  methods)  toward  cul¬ 
tural-scientific  approaches  (emphasizing  specific 
cultural  meanings  and  constructions  of  meanings; 
mainly  applying  qualitative,  case-oriented  meth¬ 
ods).  The  three  major  groups  of  alternatives 
presented  for  liberal-capitalist  modernization  ap¬ 
proaches  consist  of  Marxist  theories,  critical  and 
conflict  theories,  and  postmodern  and  poststruc- 
tural  theories.  But  agents  of  social-scientific 
approaches  did  not  simply  abscond  from  the  battle 
field.  Following  mainstream  microeconomic  the¬ 
ory,  deductive  approaches  based  on  the  idea  of 
human  rationality  resulted  in  the  blossoming  of 
rational  choice  approaches  as  the  key  to  under¬ 
standing  social  and  political  developments.  Besides, 
a  revival  of  modernization  theories — based  on 
social-scientific  as  well  as  on  cultural-scientific 
approaches — can  be  observed  currently. 

Marxism 

Fiberal-capitalist  modernization  approaches 
emphasize  structural-functional  interdependencies 
between  the  political  subsystem  and  other  parts  of 
the  system.  In  this  sense,  American  society  is  usu¬ 
ally  depicted  as  the  end-stage  of  progressive  evolu¬ 
tionary  developments.  These  propositions  have 
been  challenged  by  political  sociologists  working 
in  a  Marxist  tradition.  According  to  these 
approaches,  the  capitalist  state  is  not  some  “sub¬ 
system”  that  adjusts  itself  in  neutral  ways  to  the 
functional  requirements  of  other  subsystems.  The 
genesis  of  the  late-medieval  idea  of  a  “state,”  with 
its  territorial-based  claim  on  sovereignty,  can  only 
be  understood  by  looking  at  the  requirements 
enabling  the  accumulation  of  capital — that  is,  by 
looking  at  property  rights  and  the  protection  of 
markets  at  home  and  abroad.  In  this  way,  social 
and  political  developments  are  based  on  develop¬ 
ments  in  the  production  process  and  the  accompa¬ 
nying  arrangements  to  acknowledge  rights  and  to 
legitimate  the  distribution  of  wealth.  Tike  modern¬ 
ization  theories,  Marxist  approaches  depict  liberal 
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capitalism  as  a  stage  in  the  progressive  evolution  of 
societies.  Contrary  to  many  modernization  theo¬ 
rists,  however,  Marxists  regard  capitalism  and 
liberal  democracy  as  a  transitory  phase  and  not  as 
the  end-stage  of  this  evolutionary  process. 

The  renaissance  of  Marxist  approaches  in  polit¬ 
ical  sociology  reaffirmed  the  idea  that  politics  is 
based  on  social,  in  particular  on  economic,  devel¬ 
opments.  Two  main  areas  of  research  benefited 
especially  from  this  advance:  (1)  state  formation 
and  state  building  and  (2)  the  distribution  of 
power  and  social  inequalities.  In  his  detailed  study 
on  the  Social  Origins  of  Dictatorship  and 
Democracy,  Barrington  Moore  showed  that  vari¬ 
ous  countries  took  different  routes  to  come  to  the 
modern  industrial  world.  According  to  his  view, 
the  power  of  bureaucracies  in  raising  taxes  and 
granting  privileges  was  countervailed  by  the  nobil¬ 
ity  and  the  upcoming  bourgeoisie,  which  laid  the 
basis  for  democracy  in  England  and  France.  Since 
these  countervailing  forces  were  largely  suppressed 
by  a  coalition  of  bureaucrats  and  aristocrats  in 
Germany  (Prussia),  dictatorship  was  more  likely 
than  democracy  in  Central  Europe.  Although 
Moore  did  not  concentrate  on  state  formation,  his 
book  highly  stimulated  the  revival  of  analyses 
dealing  with  that  topic,  gradually  moving  the 
emphasis  from  Marxist  theories  based  on  eco¬ 
nomic  primacy  toward  more  complex  approaches. 
Charles  Tilly,  especially,  attempted  to  overcome 
the  limitations  of  retrospective  arguments  and  the 
neglect  of  noneconomic  factors  in  explanations  of 
the  formation  of  national  states  in  Europe.  In  his 
view,  nation-states  proved  to  be  the  most  effective 
way  to  mobilize  and  organize  the  resources 
required  for  the  severe  political  conflicts  and  wars 
in  medieval  Europe;  that  is,  arms  and  warfare 
strongly  stimulated  the  rise  of  the  state.  The 
renewed  debate  also  provided  the  opportunity  to 
move  well  beyond  the  eurocentrism  of  many 
approaches  and  to  strongly  stimulate  interest  in 
state  developments  in  Asia,  Africa,  and  Latin 
America. 

Economic  causes  also  are  at  the  heart  of 
Marxist  approaches  dealing  with  the  distribution 
of  power  in  capitalist  society  and  the  implied 
social  inequalities.  George  Domhoff,  in  a  contro¬ 
versial  publication,  answered  the  question  “Who 
rules  America?”  unambiguously:  The  country  is 
dominated  by  political  and  economic  elites.  He 


stressed  that  the  dominant  pluralist  approaches  of 
the  1950s  and  1960s  tended  to  overlook  class 
consciousness  and  class  struggles  in  capitalist 
society  and  are  slanted  toward  political  decision¬ 
making  processes  instead  of  the  resulting  inequal¬ 
ities.  Although  he  did  not  consider  himself  to  be  a 
Marxist,  Domhoff’s  approach  and  terminology 
clearly  reflect  the  economic  primacy  characteristic 
of  Marxist  approaches. 

Debates  about  the  power  structure  in  capitalist 
society  were  strongly  influenced  by  the  publication 
of  Nicos  Poulantzas’s  work  on  classes  and  political 
power  in  the  late  1960s.  The  capitalist  state,  he 
argued,  is  not  simply  an  instrument  in  the  hands  of 
the  capitalists  who  use  it  for  their  own  interests. 
Instead,  the  capitalist  state  is  a  relatively  autono¬ 
mous  actor  ensuring  the  smooth  operation  of 
capitalist  society.  It  is  the  structure  of  the  system — 
not  the  short-term  interests  and  preferences  of  the 
capitalist  class — that  works  to  the  benefits  of  that 
class.  Therefore,  major  problems  in  capitalist  soci¬ 
ety  consist  of  the  clashes  between  the  demands  of 
various  classes,  on  the  one  hand,  and  the  need  for 
the  system  to  provide  social  stability  to  reproduce 
itself,  on  the  other.  Authors  such  as  Claus  Offe 
expanded  this  line  of  reasoning  by  focusing  on  the 
unavoidable  tensions  produced  by  the  actions  of 
capitalist  states  to  overcome  class  divisions  with¬ 
out  endangering  the  long-term  benefits  of  the 
capitalist  class. 

With  the  theories  of  Tilly,  Domhoff,  and  Offe, 
we  have  left  orthodox  Marxism.  The  relevance  of 
economic  factors  is  almost  universally  accepted; 
likewise  economic  determinism  and  economic 
reductionism  are  widely  rejected  as  being  one¬ 
sided  and  insufficient.  With  a  few  exceptions, 
political  sociologists  focus  on  reciprocal  relation¬ 
ships  between  social  and  political  phenomena  and 
try  to  avoid  one-way  lines  of  reasoning  in  explana¬ 
tions  of  power  and  dominance. 

Neo-Marxism:  Conflict  Theories 
and  Critical  Theories 

Neo-Marxism  is  a  rather  loose  term  referring  to 
approaches  that  amend  Marxist  theory,  usually  by 
incorporating  elements  from  other  traditions  to 
overcome  the  deficiencies  of  orthodox  Marxism, 
especially  economic  determinism  and  reduction¬ 
ism.  Major  blends  of  neo-Marxism  consist  of 
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combinations  of  Marxist  approaches  with  conflict 
theories  and  critical  theories.  Conflict  theories 
stress  that  resources  are  scarce.  Therefore,  conflict 
and  competition,  rather  than  cooperation  and  con¬ 
sensus  are  characteristic  of  human  relationships  in 
all  societies.  Struggles  to  maximize  benefits  and  to 
defend  interests  depend  on  the  ability  and  resources 
to  exercise  power  and  dominance  in  a  society. 
Powerful  groups  use  their  power  to  exploit  groups 
with  less  power — through  brute  force  and  suppres¬ 
sion  if  necessary  but  also  by  securing  ideological 
hegemony  and  structural  advantages.  Inevitably, 
permanent  social  struggles  deeply  affect  the  politi¬ 
cal  and  social  order  and  imply  societal  and  political 
changes.  Conflict  theories  are  unmistakably  based 
on  the  work  of  Marx  and  Weber  and  are  easily 
discernable  in  almost  each  and  every  political- 
sociological  theory:  pluralists,  elitists,  and  class 
theorists  all  agree  that  conflicts  and  struggles 
between  social  or  political  groups  are  crucial  to 
understand  the  reciprocal  relationships  between  the 
social  and  the  political.  Conflict  theorists,  however, 
claim  that  these  struggles  are  the  exclusive  driving 
force  behind  social  and  political  developments. 

Analyses  of  the  distribution  of  power  and  domi¬ 
nance  are  the  main  objects  of  conflict  theories. 
From  this  perspective,  not  only  the  work  of  Domhoff 
but  also  the  classic  study  of  C.  Wright  Mills  depict¬ 
ing  the  American  ruling  class  as  a  Power  Elite  can 
be  seen  as  an  important  neo-Marxist  and  conflict- 
theoretical  contribution  to  political  sociology.  In  his 
book,  Mills  points  out  the  strong  overlapping  of 
interests  of  the  military,  business,  and  political  elites 
in  the  United  States.  Behind  the  faqade  of  pluralism 
and  interest  differentiation,  the  aims  of  elites  in 
various  areas  appear  to  be  remarkably  attuned  and 
congruent  to  each  other.  By  contrast,  the  ordinary 
citizen  is  perceived  as  relatively  powerless  and  an 
easy  subject  of  manipulation  by  those  elites.  Power, 
conflict,  and  social  inequality,  then,  appear  to  be 
robustly  related  in  the  world’s  largest  liberal- 
capitalist  democracy. 

Class  provides  the  major  way  to  distinguish 
competing  social  groups  in  conflict-theoretical 
approaches.  As  we  have  seen,  Marxists  define 
classes  based  on  the  functional  contributions  of 
groups  in  production  processes,  whereas  many 
other  social  scientists  use  the  concept  to  refer  to 
positions  of  groups  in  a  hierarchy.  Both  conceptu¬ 
alizations  usually  refer  to  deep  and  persistent 


social  distinctions.  Yet  class  is  certainly  not  the 
only  distinction  offering  a  basis  for  conflicts  and 
struggles  between  various  groups.  Race  and  eth¬ 
nicity  definitely  play  an  important  role  in  many 
societies  and  are  a  cause  of  virulent  conflicts  both 
within  and  between  states.  In  a  similar  way,  geo¬ 
graphical  and  regional  differences  are  used  to 
define  conflicting  interests.  Furthermore,  the  social 
definition  of  distinct  roles  for  men  and  women 
{gender)  and  the  resulting  inequalities  lead  to  vari¬ 
ous  conflicts  and  struggles.  For  centuries,  religion 
has  proven  to  be  an  extremely  vigorous  basis  for 
social  and  political  conflicts.  Religion,  and  not 
class,  played  the  key  role  in  the  political  history  of 
many  European  countries,  and  numerous  contem¬ 
porary  conflicts  in  the  world  are  based  on  religion. 
The  list  of  social  distinctions  that  can  be  articu¬ 
lated  in  conflicts  between  groups  for  power  and 
dominance  is  virtually  endless.  As  we  have  seen, 
Rokkan’s  theory  of  long-standing  social  cleavages 
covers  many  diverse  social  distinctions  and  is  an 
example  of  the  openness  of  conflict-based 
approaches.  In  the  early  1990s,  Samuel  Huntington 
advanced  the  proposition  that  world  politics  would 
be  increasingly  dominated  by  conflicts  based  on  a 
Clash  of  Civilizations:  People  are  likely  to  see  “us” 
versus  “them”  in  the  relations  between  themselves 
and  people  of  different  ethnicity  or  religion. 
According  to  Huntington,  the  resulting  clash  will 
take  place  not  only  between  people  and  groups 
within  states  but  also  between  states  and  between 
groups  of  states. 

Conflict  theories  and  critical  theories  have  a  lot 
in  common.  The  main  difference  does  not  lie  in  the 
depiction  of  capitalist  society  as  conflictual  and 
contradictory  but  in  the  emphasis  placed  on  the 
rejection  of  the  positivist  background  of  modern¬ 
ization  approaches  and  conflict  theories. 
Positivists — especially  logical  positivists — attempt 
to  understand  and  explain  social  phenomena  by 
searching  for  regularities  and  general  statements. 
Besides,  they  strictly  observe  a  distinction  between 
facts  and  values  and  consider  the  normative  conse¬ 
quences  of  their  work  as  exogenous.  Critical  theo¬ 
rists  reject  that  model  of  science  and  support 
emancipatory  (usually  anticapitalist)  causes;  that 
is,  emancipation,  freedom,  and  decreasing  domi¬ 
nation  are  explicitly  stated  as  the  normative  bases 
for  social  research.  According  to  critical  approaches, 
the  social  sciences  are  not  concerned  with  isolated 
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social  phenomena  but  with  society  as  a  whole  as 
well  as  its  historical  specificity.  Consequently, 
critical  theories  cannot  rely  on  work  in  distinct 
disciplines  alone — understanding  society  requires 
the  integration  of  all  the  major  social  sciences. 

Critical  theory  originated  with  the  Frankfurt 
School  in  the  1930s.  In  political  sociology,  impor¬ 
tant  representatives  are,  among  many  others, 
Pierre  Bourdieu  and  Jurgen  Habermas.  Bourdieu’s 
main  interests  are  the  mechanisms  that  reproduce 
social  hierarchies  (inequalities).  The  reproduction 
of  the  social  order  and  the  simultaneous  domi¬ 
nance  of  specific  groups  over  others  cannot  be 
explained  with  economic  factors  only.  Instead, 
Bourdieu  stresses  the  relevance  of  educational  and 
cultural  factors  for  the  active  engagement  of  peo¬ 
ple  in  the  (re)production  of  culture.  In  each  society, 
people  are  struggling  to  reach  their  goals  in  distinct 
social  arenas  (fields  such  as  the  economy,  sports,  or 
the  family).  These  struggles,  however,  depend  not 
only  on  the  objective  conditions  in  each  arena  but 
also  on  the  persistent  skills  and  dispositions  (habi¬ 
tus)  of  the  people  involved.  The  skills  and  disposi¬ 
tions  generated  are  compatible  with  the  objective 
conditions  (including  the  development  of  different 
tastes  for  art  or  food  in  different  arenas)  and 
legitimize  existing  social  structures.  Moreover, 
they  exclude  improbable  practices  as  unthinkable 
or  unnatural.  By  focusing  on  the  close  interdepen¬ 
dencies  between  objective  and  subjective  factors, 
Bourdieu  presents  explanations  for  both  the  exis¬ 
tence  of  social  hierarchies  as  well  as  the  reproduc¬ 
tion  of  these  hierarchies  as  social  and  political 
inequalities. 

Already  in  the  1960s,  Jurgen  Habermas  started 
his  attempts  to  liberate  critical  theory  from  its 
Marxist  and  Frankfurt  School  roots.  Whereas 
Bourdieu  presents  a  sociological  analysis  of  social 
processes  of  power  distribution  and  the  reproduc¬ 
tion  of  inequalities,  one  of  Habermas’s  main  inter¬ 
ests  concerns  the  opportunities  for  democracy  in 
the  modern  world.  In  his  early  study  on  The 
Structural  Transformation  of  the  Public  Sphere,  he 
explores  the  historical  development  of  public  con¬ 
tacts,  characterized  by  an  expansion  of  public 
interactions  (from  families  to  coffee  houses  and 
parliaments)  and  by  a  contraction  through  the 
market  and  bureaucracies.  Public  interactions, 
Habermas  stresses,  should  enable  people  to  inter¬ 
act  with  each  other  in  critical  ways  on  the  basis  of 


equality.  Through  self-reflection,  critical  knowl¬ 
edge  can  offer  an  important  contribution  to  ascer¬ 
tain  the  public  character  of  these  interactions  by 
enabling  people  to  emancipate  themselves  from 
dominance.  Basically,  Habermas  approaches  mod¬ 
ern  society  positively  since  its  ongoing  complexity 
and  functional  differentiation  provide  opportuni¬ 
ties  for  democratic  forms  of  self-organization.  In 
other  words,  the  very  fact  that  complexity  and 
differentiation  make  total  control  over  social  life  in 
the  modern  world  illusory  enables  people  to  gain 
control  over  their  own  life  on  the  basis  of  mutual 
recognition  and  democratic  interactions. 

It  is  not  always  easy  to  distinguish  between  neo- 
Marxist,  conflict-theoretical,  or  critical-theoretical 
approaches,  and  these  labels  are  not  used  consis¬ 
tently.  Moreover,  many  conflict  theorists  rely  on 
critical  approaches,  whereas  virtually  every  critical 
theorist  stresses  the  importance  of  conflict  between 
various  groups  in  society.  Yet  they  all  dismiss 
Western-centered  structural-functionalist  modern¬ 
ization  approaches  as  well  as  orthodox  Marxism. 
Liberal  capitalism  is  not  seen  as  the  end-stage  of  a 
progressive  evolutionary  process  but  as  a  system 
whose  inherent  social  and  political  inequalities 
have  to  be  confronted  and  battled  against. 

The  Cultural  Turn:  Postmodernism 
and  Poststructuralism 

The  rise  of  conflict  and  critical  theories  was 
mainly  based  on  attempts  to  overcome  the  short¬ 
comings  of  positivism,  with  its  epistemological 
proximity  to  the  natural  sciences  and  its  claims  on 
normative  neutrality.  In  addition,  the  economic 
determinism  and  reductionism  of  Marxism  as  well 
as  the  Western-centeredness  of  most  approaches 
in  early  political  sociology  were  criticized.  Gener¬ 
ally  speaking,  conflict  and  critical  theories  do  not 
question  the  close  interdependencies  between 
structural  (objective)  and  cultural  (subjective)  fac¬ 
tors.  Especially  among  French  social  scientists  in 
the  1970s,  attention  shifted  from  structure  to  cul¬ 
ture,  and  a  new  wave  of  post-Marxist  approaches 
developed  under  labels  such  as  postmodernism 
and  poststructuralism. 

Postmodernism  and  poststructuralism  are  part 
of  the  “cultural  turn”  in  political  sociology.  In 
these  approaches,  cultural  factors  are  considered 
to  be  more  important  for  the  study  of  power  and 
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dominance  in  a  society  than  structural  ones.  The 
emphasis  is  shifted  toward  (the  construction  of) 
meaning,  and  culture  is  considered  to  be  more  rel¬ 
evant  than  politics  or  economics.  The  rationale  for 
this  “turn”  lies  in  the  developments  in  modern 
societies  as  they  could  be  especially  observed  in  the 
United  States  in  the  late  1950s  and  1960s. 
According  to  postmodernists  and  poststructural- 
ists,  life  is  defined  on  the  basis  of  cultural  factors: 
The  cultures  of  consumerism,  leisure,  lifestyle, 
fashion,  arts,  and  so  on  are  much  more  important 
than  positions  in  the  production  process  or  in  some 
hierarchy.  The  spread  of  mass  media  strongly 
stimulates  these  cultures,  and  these  are,  in  fact, 
important  parts  of  the  fundamental  changes  in 
society.  Exactly  because  modern  society  is  charac¬ 
terized  by  complexity,  heterogeneity,  and  differen¬ 
tiation,  people  increasingly  rely  on  cultural  factors 
to  define  their  life  and  their  position  within  society. 
As  a  consequence,  no  common  or  general  distinc¬ 
tions  between  groups  of  people  can  be  presumed  or 
predicted — it  is  exactly  the  apparent  lack  of  com¬ 
mon  and  general  distinctions  that  characterizes 
postmodern  societies. 

Almost  by  definition,  it  is  hard  to  find  a  common 
understanding  of  postmodernism.  Literally,  the 
term  refers  to  the  passing  of  “modernism”  only.  A 
regular  feature  of  many  postmodernists  is  that  they 
clearly  criticize  “modern”  society  as  it  developed  in 
the  Western  world  in  the  past  2  centuries,  especially 
its  claims  of  being  based  on  universal  principles  of 
progress,  equality,  and  freedom.  Postmodernists  do 
not  challenge  the  meaning  or  importance  of  each 
one  of  these  principles.  Instead,  they  reject  the  pos¬ 
sibility  of  universal,  normative,  and  ethical  judg¬ 
ments  in  general  and  stress  the  relative  nature  of  all 
such  statements.  From  this,  it  follows  that  truth  and 
knowledge  depend  on  the  social  and  historical  con¬ 
text.  Depicting  scholars  as  postmodernists  is  a  dif¬ 
ficult  task  since  most  of  them  reject  such  general 
labels  as  being  inconsistent  with  their  theories. 
Jacques  Lacan,  Michel  Foucault,  Jacques  Derrida, 
Jean-Franqois  Lyotard,  and  Anthony  Giddens  are 
among  the  most  prominent  authors  being  typified 
as  postmodernist.  For  political  sociology,  Foucault 
and  Giddens  probably  are  the  most  influential. 

The  nature  and  distribution  of  power  and  domi¬ 
nance  in  societies  are  the  major  concerns  of  Foucault. 
He  studies  these  topics  in  various  institutions  (such 
as  psychiatric  hospitals  and  prisons)  and  introduces 


a  number  of  new  concepts  to  deal  with  power  and 
dominance.  The  concept  discourse  refers  to  the 
way  in  which  language  is  used  to  express  accept¬ 
able,  institutionalized  ways  of  thinking;  it  defines 
what  can  be  said  about  a  topic  with  the  vocabulary 
and  expressions  available.  Since  every  idea  is 
phrased  in  some  terms,  discourses  affect  our  views 
on  everything.  In  that  way,  discourses  construct 
the  phenomena  about  which  we  speak.  In  other 
words,  ideas,  concepts,  attitudes,  and  ideologies  do 
not  only  arise  within  specific  contexts,  but  they 
also  define  the  way  in  which  we  consider  the  vari¬ 
ous  aspects  of  those  contexts.  The  study  of  power 
and  dominance,  then,  is  the  study  of  discourses 
about  power  and  dominance  (such  as  texts, 
speeches,  policies,  and  practices)  in  various  histori¬ 
cal  and  social  settings.  Therefore,  to  understand 
power  and  dominance,  it  is  necessary  to  study  both 
power  and  dominance  themselves  and  the  systems 
of  knowledge  that  are  used  to  produce  power  and 
dominance.  According  to  Foucault,  power  is  not 
possessed  by  individual  people  or  groups  but  is  an 
activity  that  all  people  can  engage  in.  Exercising 
power  will  always  provoke  resistance  and  competi¬ 
tion  by  challenging  the  dominant  discourse. 
Starting  with  the  discourse  concept,  Foucault’s 
actual  work  became  the  study  of  how  knowledge  is 
produced.  Since  knowledge  production  underlies 
the  same  principles  of  cultural  conditioning  and 
competing  meanings  as  any  other  area,  it  cannot  be 
based  on  structural  determinants  only.  For  that 
reason,  these  approaches  are  summarized  under 
the  label  poststructural  theories. 

The  interdependencies  of  structural  and  cultural 
factors  also  play  an  important  role  in  the  work  of 
Giddens.  With  his  theory  of  structuration  he 
attempts  to  take  into  account  that,  although  people 
are  not  free  to  do  what  they  want  and  their  knowl¬ 
edge  is  imperfect,  there  are  acts  of  individuals  that 
reproduce  the  social  structure  and  enable  social 
change.  Therefore,  what  we  call  “society”  can  be 
understood  neither  as  a  set  of  institutions  nor  as  a 
simple  aggregation  of  all  human  acts.  Relying  more 
on  Weber  than  on  Marx,  Giddens  approaches 
social  and  political  actions  from  the  perspective 
that  individual  thoughts  and  behavior  are  struc¬ 
tured  by  social  institutions,  conventions,  and  ethi¬ 
cal  codes.  Besides,  the  social  structure  is  reinforced 
and  reproduced  by  the  continuous  repetition  of 
human  acts;  that  is,  social  action  creates  structures 
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that,  at  the  same  time,  make  social  action  possible. 
In  this  way,  individual  acts  also  provide  the  key  for 
understanding  social  change:  If  people  start  to 
break  from  repetitive  acts,  the  social  structure  will 
be  reproduced  differently. 

Like  Foucault,  Giddens  rejects  a  distinction 
between  structure  and  culture.  But  whereas 
Foucault  considers  the  two  hopelessly  entangled 
by  the  very  same  discourse  they  rely  on,  Giddens 
returns  to  the  notion  of  mutual  dependency.  The 
idea  that  actions  are  constrained  by  structures, 
which  are,  in  turn,  created  and  reproduced  by 
those  very  actions,  avoids  the  fallacies  of  social 
determinism  and  reductionism.  In  modern  socie¬ 
ties,  new  information  constantly  challenges  and 
modifies  social  practices,  altering  their  character 
constantly,  too.  Since  a  sound  foundation  for 
knowledge  is  missing  and  the  expansion  of  the 
social  sciences  increasingly  results  in  differences  of 
opinion,  people  more  and  more  “reflect”  on  their 
own  situation  and  the  social  structures  they  live  in. 
As  a  result,  uncertainty  gradually  drives  out  confi¬ 
dence  and  certitude.  According  to  Giddens,  we  are 
observing  not  the  rise  of  some  postmodern  society 
but  merely  a  “radicalized  modernity.” 

The  Economic  Turn:  Rational  Choice 

Neo-  and  post-Marxist  theories  revitalized  theo¬ 
retical  thinking  in  political  sociology  and  stimulated 
new  directions.  Postmodernist  and  poststructuralist 
scholars,  especially,  presented  proposals  for  radi¬ 
cally  different  approaches.  Their  extensive  episte¬ 
mological  and  ideological  criticisms  of  positivist 
social-scientific  theories  and  methods  are  accompa¬ 
nied  by  appeals  for  social-cultural  approaches  and 
the  use  of  appropriate  strategies.  Yet  research  based 
on  these  approaches  is  usually  characterized  by 
rather  narrow  definitions  of  the  actual  objects  stud¬ 
ied  in  terms  of  power  and  dominance.  Neo-  and 
post-Marxists  drew  attention  to  the  old  questions 
about  the  relationships  between  structure  and  cul¬ 
ture  and  did  present  new  approaches  to  study  these 
interdependencies.  Not  all  social  scientists  are  con¬ 
vinced,  however,  that  approaches  to  deal  with  these 
questions  necessarily  have  to  be  based  on  epistemo¬ 
logical  criticism,  the  rejection  of  positivism,  or  the 
application  of  radically  new  research  methods. 

Presumed  causal  links  between  social  entities 
such  as  interest  groups  or  states  are  difficult  to 


analyze.  What  do  we  mean  when  we  say,  for 
instance,  that  “parties  try  to  win  elections”  or  that 
“economic  development  leads  to  democracy”? 
Usually,  these  phrases  are  not  based  on  ontological 
presumptions  about  actual  “behavior”  of  parties  or 
economic  systems  but  are  shorthand  for  the  activi¬ 
ties  of  party  leaders  or  middle-class  people  only.  In 
fact,  one  could  argue  that  causal  explanations  are 
only  possible  for  human  behavior — all  other  state¬ 
ments  are  interpretations  based  on  the  aggregation 
of  the  behavior  of  individuals  and  its  consequences. 
By  concentrating  on  the  behavior  of  individual 
consumers,  citizens,  producers,  and  so  on,  macro¬ 
social  and  macropolitical  phenomena  can  be 
explained  as  aggregations  of  microsocial  behavior. 
This  so-called  methodological  individualism  res¬ 
cues  the  idea  of  regularities  underlying  social  phe¬ 
nomena  from  neo-  and  post-Marxist  attacks  and 
reconfirms  the  search  for  regularities  as  an  intel¬ 
lectually  fruitful  goal  for  the  social  sciences. 

At  least  since  the  19th  century,  deductive  rea¬ 
soning  starting  with  straightforward  axioms  about 
individual  behavior  is  the  typical  field  of  main¬ 
stream  microeconomic  theory.  It  is  clear  that  many 
sociologists  and  political  scientists  are  fascinated 
by  the  way  in  which  economics  proceeds  and  relies 
on  rigorous  deductive  ways  of  thinking.  This  allure 
resulted  in  the  rapid  annexation  of  large  parts  of 
the  social  sciences  by  so-called  rational  choice 
approaches  in  the  past  decades.  Individuals  are 
presumed  to  have  persistent  “preference  func¬ 
tions”  that  guide  their  actions:  Each  person 
attempts  to  reduce  “costs”  and  maximize  “gains” 
under  specific  constraints;  that  is,  each  individual 
tries  to  maximize  his  or  her  “utility.”  Rational 
choice  approaches  rely  on  methodological  indi¬ 
vidualism  and  the  presumption  that  individual 
behavior  and  its  consequences  can  be  aggregated 
by  using  formal  modeling  and  mathematical  rea¬ 
soning.  Most  rational  choice  theorists  do  not  use 
the  rationality  of  individuals  as  a  necessarily  cor¬ 
rect  description  of  human  behavior.  Instead,  they 
stress  the  need  for  strict  deductive  reasoning  start¬ 
ing  with  elementary  statements  and  leading  to  the 
formulation  of  falsifiable  hypotheses.  Successful 
hypotheses  are  not  necessarily  based  on  intuitively 
plausible  presumptions;  successful  hypotheses  are 
those  that  survive  rigid  empirical  testing.  In  this 
way,  rational  choice  approaches  provide  prescrip¬ 
tions  for  both  theoretical  arguing  (deductive  and 


2032  Political  Sociology  as  a  Field  of  Study 


formalized)  and  empirical  work  (falsification  of 
hypotheses). 

The  application  of  rational  choice  theories  in 
political  science  and  sociology  started  in  the  1950s 
and  1960s.  Clearly  working  in  the  tradition  of 
descriptive  democratic  theory  presented  by  Joseph 
Schumpeter  in  his  book  An  Economic  Theory  of 
Democracy ,  Anthony  Downs  argued  that  citizens 
vote  on  the  basis  of  a  rational  calculation  of  which 
party  is  most  likely  to  meet  their  preferences. 
Voting  decisions  are  similar  to  those  of  consumers 
in  a  market  who  calculate  the  costs  (taxes)  and 
benefits  (public  services)  of  choosing  one  political 
party  rather  than  another.  To  win  elections,  par¬ 
ties  try  to  locate  themselves  and  their  policies  close 
to  the  median  voter.  In  this  way,  both  the  behavior 
of  individual  voters  and  the  strategies  and  policies 
of  political  parties  are  explained.  From  a  socio¬ 
logical  perspective,  the  advantages  of  using  deduc¬ 
tive  reasoning  started  with  the  idea  of  individual 
rationality  evident  in  Mancur  Olson’s  book,  The 
Logic  of  Collective  Action:  Public  Goods  and  the 
Theory  of  Groups ,  published  in  the  mid-1960s. 
Why  would  rational  individuals  contribute  to  the 
production  of  collective  goods  that,  by  definition, 
are  freely  available  to  everybody?  Olson  argues 
that  rational  individuals  will  only  join  groups  and 
contribute  to  collective  goods  if  separate  and 
“selective”  incentives  are  provided  that  are  strictly 
available  for  group  members  only.  In  other  words, 
the  provision  of  collective  goods  by  rational  indi¬ 
viduals  is  only  possible  if  the  free  rider  problem  is 
solved.  Olson’s  work  bridges  the  gap  between 
economical  and  sociological  explanations  of  joint 
actions  and  group  activities.  As  did  Downs’s  work 
on  democracy,  Olson’s  work  initiated  a  wealth 
of  research  on  the  consequences  of  individual 
behavior. 

Rational  choice  approaches  provide  the  instru¬ 
ments  to  study  the  microfoundations  of  macroso¬ 
cial  and  macropolitical  phenomena  in  rigorous  ways. 
Even  clearer  than  neo-  and  post-Marxism — which 
usually  focus  on  power  and  dominance — rational 
choice  does  not  provide  an  object  specification  but 
a  specific  method  and  research  strategy  principally 
based  on  positivist  epistemology.  Due  to  this  open¬ 
ness,  rational  choice  approaches  have  been  applied 
to  an  enormous  variety  of  themes  and  topics,  many 
of  which  are  in  the  field  of  political  sociology: 
voting  behavior,  bureaucracies,  state  formation, 


interest  groups,  new  social  movements,  social 
mobility,  class  reproduction,  participation,  and 
social  capital  are  only  a  few  of  the  most  prominent 
examples.  The  almost  unrestricted  opportunities 
to  apply  rational  choice  approaches  to  various 
questions  are  clearly  underlined  by  the  use  of  ratio¬ 
nal  choice  as  a  basis  for  Marxist  theories  of  class 
and  exploitation.  Jon  Elster,  especially,  strongly 
rejects  the  functional  explanations  typical  of 
Marxism  and  many  other  social  sciences.  Instead, 
he  uses  the  analytical  tools  of  rational  choice 
approaches  (especially  methodological  individual¬ 
ism)  to  provide  microfoundations  for  the  explana¬ 
tion  of  social  and  political  phenomena. 

Rigorous  deductive  reasoning  usually  takes 
individuals  as  a  starting  point,  but — as  phrases 
such  as  rational  choice  institutionalism  already 
suggest — this  does  not  imply  a  restriction  to  psy¬ 
chological  explanations  only.  However,  it  is  clear 
that  rational  choice  simply  takes  “preferences”  for 
granted  and  runs  the  risks  of  being  circular,  non- 
falsifiable,  or  even  tautological.  How  can  we  tell 
when  people  are  not  acting  in  their  own  self- 
interest,  and  what  counts  as  nonrational  behavior? 
Some  people  define  their  preferences  in  terms  of 
the  public  good  and  are  prepared  to  risk  their  life 
for  others  and  their  own  beliefs,  but  whatever  they 
do,  this  can  always  be  depicted  as  a  “rational” 
calculation  of  self-interest.  Critics  of  rational 
choice  theories,  therefore,  do  not  reject  the  need 
for  rigorous  deductive  reasoning  in  the  social  sci¬ 
ences  but  point  to  the  limited  advantages  of 
restricting  explanations  of  social  and  political 
phenomena  to  the  consequences  of  individual 
behavior  only.  Preferences,  interests,  utility,  or  the 
application  of  rational  strategies  are  all  embedded 
in  historical,  cultural,  political,  and  societal  con¬ 
texts  and  cannot  merely  be  taken  for  granted  and 
reduced  to  axioms  in  deductive  arguments. 

The  Return  of  the  State  and 
Modernization  Approaches 

The  cultural  turn  in  political  sociology  mainly 
followed  the  general  shift  in  many  social  sciences 
away  from  social-scientific  approaches  toward 
cultural-scientific  approaches.  Yet  not  every  politi¬ 
cal  sociologist  was  convinced  by  the  arguments  of 
neo-  and  post-Marxists  to  abandon  modernization 
and  positivism.  Moreover,  the  cultural  turn  had 


Political  Sociology  as  a  Field  of  Study  2033 


taught  us  a  lot  about  power  and  dominance  in 
areas  such  as  sexual  relations  and  national  identi¬ 
ties,  but  it  contributed  relatively  little  to  our  under¬ 
standing  of  the  reciprocal  relationships  between 
the  social  and  the  political.  Postmodern  theorists 
usually  simply  reject  the  usefulness  of  such  con¬ 
cepts,  and  many  authors  inspired  by  conflict  and 
critical  theories  would  consider  the  relationships 
between  social  and  political  phenomena  as  a  rela¬ 
tively  unimportant  specimen  of  more  general  ques¬ 
tions  about  power  and  dominance.  By  contrast, 
rational  choice  theories  have  proven  their  useful¬ 
ness  in  many  areas  of  human  behavior  but  do  not 
seem  to  offer  much  for  the  explanation  of,  say,  the 
rise  of  the  nation-state  or  civic  engagement  in  revo¬ 
lutionary  eras. 

Criticism  of  structural-functional  system  theo¬ 
ries  and  modernization  approaches  has  not  just  led 
to  a  cultural  turn  of  the  neo-  and  post-Marxist 
type.  In  fact,  some  political  sociologists  continued 
to  improve  modernization  theories  and  to  apply 
them  to  social  and  political  developments.  A  gen¬ 
eral  feature  of  these  approaches  is  that  they  are  all 
highly  stimulated  by  actual  major  social  and 
political  events  in  the  past  decades  and  not  by  the 
desire  to  contribute  to  epistemological  and  ideo¬ 
logical  debates  or  to  develop  a  new  research  meth¬ 
odology.  Faced  with  the  wave  of  political  unrest 
sweeping  Western  countries  in  the  late  1960s,  the 
collapse  of  the  Soviet  Union  and  the  success  of 
European  integration  in  the  1990s,  and  the  ongo¬ 
ing  processes  of  economic  and  political  globaliza¬ 
tion,  political  sociologists  renewed  their  interest  in 
evolutionary  approaches.  Furthermore,  the  nation¬ 
state  apparently  did  not  disappear  but  appears  to 
be  surprisingly  flexible  and  able  to  attune  itself  to 
the  new  challenges  of  a  globalized  world  and  the 
rise  of  many  competitors.  Within  states,  ongoing 
complexity,  heterogeneity,  and  differentiation  do 
not  seem  to  result  in  converging  social  and  politi¬ 
cal  arrangements  but  in  perceptibly  different  ones. 
Between  states,  we  observe  a  continuous  rise  in  the 
number  of  states  and  a  further  differentiation  of 
their  nature  all  over  the  world. 

Pursuing  his  goals  with  determination  and  well 
aware  of  the  cultural  turn  proposed  by  neo-  and 
post-Marxists,  Ronald  Inglehart  has  been  urging 
for  a  very  different  cultural  turn  ever  since  he  pub¬ 
lished  his  theory  of  postmaterialist  value  change  in 
the  early  1970s.  Confronted  with  the  startling 


wave  of  political  unrest  in  many  Western  countries 
in  the  late  1960s,  he  points  to  the  rise  of  a  new 
generation  with  value  orientations  that  are  differ¬ 
ent  from  those  of  their  predecessors.  As  a  conse¬ 
quence  of  ongoing  modernization,  the  generation 
born  after  World  War  II,  especially,  rejects  author¬ 
ity,  material  advantage,  and  tradition.  Instead,  it 
gives  priority  to  goals  such  as  self-fulfillment,  par¬ 
ticipation,  and  lifestyle  issues.  As  newer  genera¬ 
tions  replace  old  ones,  deep-seated  social  and 
political  changes  will  take  place  (a  “silent  revolu¬ 
tion”).  Evidently,  Inglehart  uses  a  modernization- 
theoretical  approach  and  positivist  methods,  but 
he  does  not  presume  that  developments  in  modern 
society  necessarily  will  be  progressive.  Nor  does  he 
take  value  priorities  for  granted.  The  share  of  post¬ 
materialists  among  new  generations  is  likely  to 
increase  only  if  economic  growth  is  secured  and 
social  unrest  remains  absent.  Originally  starting 
with  a  focus  on  new  modes  of  participation  in 
Western  countries,  Inglehart  expanded  his  research 
to  almost  every  country  in  the  world  and  to  many 
aspects  of  social  and  political  developments.  Using 
standardized  comparative  surveys  of  representative 
population  samples,  his  work  also  shows  that  epis¬ 
temological  criticism  of  positivist  approaches  does 
not  necessarily  mean  that  highly  interesting  infor¬ 
mation  about  social  and  political  developments 
cannot  be  obtained.  Moreover,  methodological 
individualism  and  deductive  reasoning  are  not 
exclusive  privileges  of  rational-choice  approaches. 

Modernization  theories  experienced  a  strong 
revival  with  the  collapse  of  the  Soviet  Union  and 
the  rise  of  democratic  political  systems  around  the 
world  in  the  early  1990s.  At  first,  the  question 
about  the  causes  or  favorable  conditions  for  demo¬ 
cratic  changes  attracted  renewed  attention.  Was 
the  old  Lipset  thesis  about  the  social  requisites  of 
democracy  still  valid?  On  the  basis  of  a  strong  cor¬ 
relation  between  democracy  and  economic  devel¬ 
opment,  many  authors  concluded  that  economic 
development  provides  sufficient  (and  probably  also 
necessary)  conditions  for  democracy.  Yet  deviate 
cases  are  easy  to  find.  For  instance,  Nazi  Germany 
was  a  dictatorship  in  spite  of  being  economically 
advanced,  and  India  is  a  vibrant  democracy  despite 
its  socioeconomic  problems,  which  casts  doubts  on 
the  general  validity  of  the  argument.  Economic 
development  plays  an  important  role,  indeed,  but 
is  certainly  not  the  only  motivating  force.  Returning 
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to  the  work  of  Tocqueville,  Robert  Putnam  (1994) 
showed  that  democracy  relies  on  an  active  “civil 
society”  and  much  less  on  economic  development. 
His  theory  unequivocally  underlines  the  need  for 
social  capital  (especially  trust  and  confidence)  to 
produce  collective  goods  efficiently.  As  Tocqueville 
had  observed  in  the  United  States  in  the  early  19th 
century,  voluntary  associations  are  extremely 
important  for  the  functioning  of  democracy  since 
in  these  clubs,  groups,  organizations,  alliances, 
associations,  and  the  like,  people  develop  and 
maintain  social  networks.  Established  social  net¬ 
works,  in  turn,  facilitate  social  trust,  which  enables 
the  production  of  goods  without  coercion.  The 
functioning  of  democracy  is  such  a  major  collec¬ 
tive  good.  Putnam’s  research  in  Italy  and  the 
United  States  initiated  a  lively  debate  on  the 
“social  requisites  of  democracy” — depicting  social 
capital  as  the  crucial  determinant  and  curtailing 
the  importance  of  economic  factors. 

Whereas  Putnam’s  work  focuses  on  require¬ 
ments  for  the  functioning  of  democracy,  the  estab¬ 
lishment  of  democracy  is  a  different  topic.  Analyses 
of  a  transition  (or  transformation)  of  a  political, 
social,  and  economic  system  require  encompassing 
approaches  based  on  explicit  specifications  of  the 
reciprocal  relationships  between  social  and  politi¬ 
cal  developments.  For  the  transition  of  authoritar¬ 
ian  systems  into  democratic  ones,  evolutionary 
approaches  distinguishing  between  distinct  phases 
or  stages  of  democratization  rely  heavily  on 
Samuel  Huntington’s  idea  of  a  “third  wave.”  The 
first  wave  of  democratization  (from  the  mid-19th 
century  to  the  end  of  World  War  I)  coincided  with 
the  rise  of  the  nation-state,  whereas  the  second 
one,  starting  after  World  War  II  through  the  early 
1960s,  was  mainly  the  result  of  decolonization. 
According  to  Huntington,  the  third  wave,  from 
about  1975  to  the  end  of  the  20th  century,  consists 
of  the  spread  of  democracy  in  Latin  America  and 
Asia  and  the  disintegration  of  the  Soviet  Union. 
Economic  growth,  the  declining  legitimacy  of 
authoritarian  rule,  and  the  changing  role  of  the 
Catholic  Church  as  well  as  international  structural 
factors  such  as  the  rise  of  the  European  Union  and 
the  agreement  on  human  rights  included  in  the 
Helsinki  Treaty  in  1975  all  contributed  to  the 
spread  of  democracy.  By  the  end  of  the  20th  century 
democracy  had  reached  all  regions  of  the  world. 
South  America,  all  of  Europe,  and  considerable 


parts  of  Asia  and  Africa  have  been  turned  into 
democracies.  At  the  same  time,  it  is  clear  that  the 
three  waves  are  characterized  by  different 
processes  and  that  no  general  explanation  for 
democratization  is  available.  As  Huntington  and 
numerous  other  scholars  have  shown,  democracy 
can  be  reached  through  many  different  paths. 

Modifying  the  modernization  framework  of  the 
1950s  proved  to  be  very  helpful  in  studying  demo¬ 
cratic  transformations.  The  idea  of  evolutionary 
social  and  political  developments  with  distinct 
phases  or  stages  appeared  to  be  even  more  helpful 
for  the  study  of  democratic  consolidation.  In  each 
phase  or  stage,  the  changing  relationships  between 
social  and  political  factors  are  specified.  In  the  ini¬ 
tial  phase,  opposition  toward  the  ruling  elite  and 
undemocratic  arrangements  is  mobilized.  The 
request  for  more  liberty  is  broadly  accepted  and 
generally  seen  as  the  main  goal.  The  next  phase  is 
characterized  by  the  establishment  of  institutional 
arrangements  to  replace  the  old  undemocratic  ones. 
A  new  constitution  is  adopted,  and  general  elections 
are  organized  for  the  first  time.  An  easy  return  to 
the  Old  System  is  no  longer  feasible.  In  the  advanced 
phase,  attention  shifts  toward  the  achievements  of 
the  new  democracy  to  satisfy  group  interests,  and 
economic  performance  becomes  crucial.  Finally,  the 
phase  of  democratic  consolidation  is  reached  where 
the  new  arrangements  are  institutionalized,  and  the 
system  is  able  to  meet  the  demands  and  expecta¬ 
tions  of  large  parts  of  the  population. 

The  study  of  democratic  transformation  and 
consolidation,  once  again,  shows  the  advantages 
and  pitfalls  of  modernization-theoretical  ap¬ 
proaches  to  study  links  between  social  and  political 
phenomena.  Especially,  the  use  of  phases  and  stages 
easily  runs  the  risk  of  backsliding  into  teleological 
and  ideological  prejudices.  Just  as  American  liberal- 
capitalism  was  the  highest  stage  of  social  develop¬ 
ment  in  the  evolutionary  modernization  theories  of 
the  1940s  and  1950s,  various  transformation  and 
consolidation  theories  implicitly  take  democracy  as 
the  “highest”  or  “most  sophisticated”  system.  All 
other  systems  are  lumped  together  as  negative 
deviations  from  this  ideal,  under  terms  such  as  illib¬ 
eral  democracy ,  pseudodemocracy ,  partial  democ¬ 
racy,  or  defective  democracy.  More  recent 
approaches  reject  teleological  interpretations  and 
treat  political  systems  with  a  mixture  of  authoritar¬ 
ian  and  democratic  features  not  as  “deviant”  cases 
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but  as  novel  systems.  Obviously,  these  approaches 
do  not  exclude  the  chance  that  the  direction  of 
democratization  processes  can  be  reversed.  Only  a 
close  look  at  the  reciprocal  relationships  between 
state  and  society  can  provide  explanations  for  a 
return  to  authoritarian  rule. 

The  introduction  of  concepts  such  as  postmate¬ 
rialism,  social  capital,  or  transformation  and  con¬ 
solidation  establishes  a  cultural  turn  in  political 
sociology  that  is  quite  different  from  the  cultural 
turn  initiated  by  neo-  and  post-Marxists.  The  new 
variants  of  modernization  approaches  do  not  pre¬ 
sume  that  increasing  complexity,  heterogeneity, 
and  differentiation  inevitably  prepare  the  way  for 
a  liberal-capitalist  society  as  the  highest  stage  of 
human  development;  they  do,  however,  presume 
that  the  opportunities  for  democratic  transforma¬ 
tion  and  consolidation  strongly  depend  on  the 
social  and  economic  development  of  a  society. 
Further,  the  revival  of  modernization  approaches 
did  not  follow  the  shift  in  the  social  sciences  from 
social-scientific  approaches  toward  cultural- 
scientific  approaches  and  its  accompanying  change 
in  research  methodologies.  Discussions  about  the 
character  of  political  systems  are  usually  based  on 
quantitative,  standardized,  and  comparative  indi¬ 
cators  (e.g.,  the  Freedom  House  Index  or  large 
cross-national  surveys  among  populations),  and 
terms  such  as  social  capital  and  the  third  ivave 
clearly  underline  attempts  to  summarize  divergent 
phenomena  under  general  concepts  and  to  stress 
regularities.  In  a  similar  manner  to  rational  choice 
approaches,  recent  modernization  approaches  are 
not  based  on  a  rejection  of  positivism  or  appeals 
for  a  radically  different  epistemology. 

The  End  of  the  State? 

Political  sociology  has  come  a  long  way  since  its 
start  in  the  19th  century.  The  heydays  of  modern¬ 
ization  approaches  in  the  1940s  and  1950s  seemed 
to  mark  the  definite  breakthrough  and  establish¬ 
ment  of  political  sociology  as  an  independent  sub- 
discipline.  Yet  the  list  of  criticisms  and  objections 
against  conventional  political  sociology  is  long, 
and  many  modifications  and  alternatives  have 
been  presented.  Furthermore,  the  main  object  of 
political  sociology — power  and  dominance  in  the 
reciprocal  relationships  between  state  and  society — 
changed  dramatically  in  the  last  few  decades.  As  a 


consequence,  political  sociology  has  been  con¬ 
fronted  with  the  dual  task  of  dealing  with  severe 
epistemological  and  ideological  criticism  and  with 
deep-seated  changes  of  its  object  at  the  same  time. 
Several  appeals  for  a  paradigm  shift,  a  new  political 
sociology,  and  a  political  sociology  for  the  21st 
century  have  been  presented.  For  instance,  Kate 
Nash  (2000)  pleads  for  a  new  political  sociology 
that  should  not  be  mainly  concerned  with  states  or 
class-based  approaches  to  narrowly  defined  politi¬ 
cal  phenomena  but  with  cultural  politics.  These 
“politics”  should  be  understood  in  the  broadest 
possible  sense,  and  the  focus  is  on  conflicts  about 
social  identities  and  structures  and  the  opportuni¬ 
ties  to  change  them.  Although  this  depiction  does 
not  exclude  the  state  as  a  major  object  of  political 
sociology,  it  is  clear  that  power  and  dominance  are 
the  key  features  of  such  a  new  political  sociology. 

For  discussions  of  actual  and  desirable  modifica¬ 
tions  of  political  sociology,  a  reappraisal  of  the 
changing  position  of  the  state  is  required.  Tradition¬ 
ally,  political  sociologists  have  a  complicated 
relationship  with  the  state  as  their  pet  topic.  They 
celebrate  the  concept  as  the  main  subject  matter  of 
their  studies  of  state  formation,  the  development  of 
the  state  (especially  the  nation-state),  and  the 
chances  for  democratic  consolidation.  They  study 
the  ongoing  blending  and  melding  of  social  and 
political  phenomena  within  states  and  observe  the 
blurring  of  the  distinctions  between  the  two  in  an 
era  of  radicalized  modernity.  Some  of  them  examine 
the  disappearance  of  the  state  in  a  world  character¬ 
ized  by  globalization  and  increasing  interdependen¬ 
cies.  Others  observe  the  rise  of  new  forms  of  the 
state  (particularly  in  Europe)  and  study  the  evapo¬ 
ration  of  states  confronted  with  powerful  multina¬ 
tional  corporations,  criminal  gangs,  or  NGOs.  By 
the  mid-1980s,  the  diminishing  position  of  the  state 
as  a  central  topic  for  political  sociologists  was  coun¬ 
terbalanced  by  attempts  to  “bring  the  state  back 
in.”  Following  appeals  by  Theda  Skocpol  and  her 
collaborators,  the  state  was  rediscovered  and  reac¬ 
knowledged  as  an  autonomous  actor.  Formal  polit¬ 
ical  institutions,  especially,  attracted  renewed  atten¬ 
tion  as  important  factors  for  the  distribution  of 
power  and  dominance.  Not  only  the  behavior  of 
politicians  but  also  that  of  citizens  or  elites  is 
relevant  for  the  relationships  between  state  and 
society:  The  constitutional  framework,  electoral 
laws,  the  composition  of  government,  and  many 
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other  institutional  arrangements  are  evidently  rele¬ 
vant  as  well.  In  short,  institutions  matter. 

Among  the  divergent  depictions  of  the  position 
of  the  state,  all  agree  that  the  domain  of  politics 
lost  its  characteristic  features  and  government  its 
preeminent  position  in  the  distribution  of  wealth 
and  the  management  of  public  services  in  society  in 
the  past  decades.  For  many,  the  distribution  of 
power  and  dominance  changed  fundamentally 
with  the  changing  nature  and  position  of  the  state. 
The  observation  that  nowadays  there  is  no  escape 
from  politics  is  correct  but  does  not  imply  that  the 
social  and  the  political  cannot  be  distinguished 
analytically  and  conceptually.  With  its  traditional 
focus  on  the  reciprocal  relationships  between  the 
political  and  the  social,  political  sociology  is  essen¬ 
tially  open  to  all  these  observations  and  interpreta¬ 
tions.  The  question  remains,  however,  as  to 
whether  the  changes  in  the  nature  and  position  of 
the  state  require  a  new  political  sociology — and  if 
so,  what  such  a  renewal  should  look  like. 

The  rapid  and  permanent  changes  of  the 
relationships  between  state  and  society  are  highly 
relevant  for  political  sociologists,  and  none  of  the 
processes  referred  to  can  or  should  be  abandoned 
as  study  topics.  Examining  the  consequences  of 
these  developments,  Irving  Horowitz  (1999)  ob¬ 
served  that  classical  political  sociological  ap¬ 
proaches  are  not  very  useful  since  we  are  confronted 
by  a  “larger  scale,”  which  lies  very  far  beyond  any¬ 
thing  that  could  have  been  imagined  by  Montesquieu, 
Tocqueville,  Durkheim,  or  Weber.  Although  impor¬ 
tant,  the  idea  of  differences  in  scale  between  tradi¬ 
tional  and  actual  states  and  societies  does  not  cover 
the  fundamental  changes  experienced.  Studying 
these  changes  and  their  implications  for  the  rela¬ 
tionships  between  state  and  society  is  only  possible 
if  political  sociologists  stick  to  a  broad  conceptual¬ 
ization  of  the  political  and  the  social  and  avoid  any 
a  priori  restrictions.  Major  developments  relevant 
for  making  political  sociology  more  amenable  to 
study  are  discussed  as  follows. 

Identity 

The  consequences  of  the  ongoing  process 
of  blending  and  melding  of  social  and  political 
phenomena  for  citizens  are  hard  to  summarize. 
Many  authors  have  pointed  out  changes  in  social 
identity — that  is,  the  ways  in  which  individuals 


label  themselves  as  members  of  particular  groups. 
Social  identity  can  be  based  on  nation,  class,  eth¬ 
nicity,  gender,  and  so  on  and  has  important  conse¬ 
quences  both  for  the  individual  concerned  and  for 
the  distribution  of  power  and  dominance  in  soci¬ 
ety.  For  political  sociologists,  the  fact  that  societies 
increasingly  become  more  complex,  heteroge¬ 
neous,  and  differentiated  always  implied  a  con¬ 
stant  change  in  social  groups.  Especially,  critical 
theorists  emphasize  that  in  postmodern  societies, 
the  development  of  social  identities  cannot  be 
explained  by  structural  factors  alone.  Instead,  it  is 
the  process  of  acquiring  identities  itself  that  has 
changed.  Although  usually  starting  with  quite  dif¬ 
ferent  assumptions,  much  of  the  work  on  social 
capital  and  political  culture  is  also  based  on  the 
recognition  that  blending  and  melding  of  social 
and  political  developments  have  radically  changed 
the  ways  in  which  people  identify  with  groups. 
Social  identity — and  not  the  structural  aspects  of 
the  social  or  the  political — becomes  increasingly 
relevant  for  political  sociology. 

Civil  Society 

Closely  related  to  the  rising  relevance  of  social 
identity  is  the  need  for  a  reassessment  of  the  role 
and  function  of  groups  and  associations  in  societ¬ 
ies,  where  social  and  political  processes  are  hard  to 
distinguish.  Considered  to  be  intermediaries  be¬ 
tween  state  and  society,  groups  and  associations 
have  been  textbook  topics  in  political  sociology  for 
a  long  time.  More  recently,  several  authors  point  to 
the  fact  that  a  shift  from  state-centered  governance 
toward  self-governing  associations  in  civil  society 
can  be  observed.  Groups  and  associations  still  per¬ 
form  many  of  their  traditional  functions — such  as 
interest  mediation  and  mobilization — but  instead 
of  being  participants  among  many  other  partici¬ 
pants,  they  gradually  developed  into  more  inde¬ 
pendent  entities,  claiming  to  be  essential  elements 
of  the  fabric  of  domestic  society.  Clearly,  in  the 
spirit  of  many  political  sociologists,  starting  with 
Tocqueville  and  Marx,  democracy  and  civil  society 
are  seen  as  two  sides  of  the  same  coin.  In  a  world 
characterized  by  the  blending  and  melding  of  the 
social  and  the  political,  the  rise  of  civil  society  and 
its  drive  to  replace  conventional  interest  groups 
and  associations  is  one  of  the  most  interesting 
challenges  for  political  sociologists. 
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States  and  Other  Actors 

Acknowledging  the  end  of  the  dominant  posi¬ 
tion  of  the  state  implies  the  recognition  of  the 
increasing  relevance  of  other  actors  and  partici¬ 
pants.  This  expansion  can  be  easily  accepted  and 
discussed  in  terms  of  the  erosion  of  sovereignty  or 
the  loss  of  regulatory  control  by  the  state.  Important 
aspects  include  struggles  between  states  and  big 
companies  (limits  of  private  initiatives,  antitrust 
legislation,  etc.)  and  competition  between  multina¬ 
tional  corporations.  The  most  threatening  danger 
here  is  that  the  main  advantage  political  sociolo¬ 
gists  realized  in  the  past  few  decades — the  rejection 
of  attaching  a  priori  causal  direction  to  either  soci¬ 
ety  or  the  state  in  their  analyses — is  given  up 
effortlessly  and  replaced  by  a  fashionable  depic¬ 
tion  of  the  state  as  the  main  loser  in  a  world  con¬ 
trolled  by  multinational  corporations  and  NGOs. 
Unless  political  sociologists  come  up  with  fruitful 
conceptualizations  of  the  relationships  between 
states  and  other  actors,  we  will  see  a  revival  of 
society-centered  approaches  and  a  relapse  to  a 
sociology  of  politics  of  the  pre-Sartori  era. 

Democracy 

Democracy  is  closely  linked  to  the  rise  of  the 
national  state  and  clearly  defined  within  its  bor¬ 
ders.  States  can  do  very  well  without  being  demo¬ 
cratic,  but  so  far,  it  has  taken  the  organized  and 
limited  forms  of  political  power  of  the  conven¬ 
tional  state  to  promote  democracy:  no  state,  no 
democracy.  Furthermore,  democracy  continues  to 
spread  around  the  world.  Freedom  and  liberty  are 
not  restricted  to  wealthy  countries,  and  many  poor 
and  developing  states  have  a  record  of  respecting 
political  rights  and  civil  liberties.  The  problem 
with  this  development,  however,  is  that  it  appears 
to  be  rather  easy  to  create  a  blend  of  formal 
democracy  and  political  corruption,  civil  rights 
abuses,  and  autocratic  rule.  While  only  a  few 
countries  have  slid  backward  into  military  rule, 
many  more  seem  to  reach  a  standoff  or  cease-fire 
between  democratic  and  nondemocratic  forces, 
where  elected  governments  fail  to  regulate  or  take 
control  of  the  most  powerful  social  and  economic 
groups  in  society.  For  political  sociologists,  very 
interesting  cases  and  questions  develop  that  bring 
us  back  to  the  heart  of  the  traditional  discussions 


about  social  conditions  for  democratic  rule — and 
for  undemocratic  rule. 

Globalization 

Since  democracy  is  closely  linked  to  the  nation¬ 
state,  many  scholars  emphasize  that  globalization 
presents  new  challenges  to  the  study  of  the  recipro¬ 
cal  relationships  between  state  and  society.  If 
political  power  is  no  longer  concentrated  in  states, 
then  democratic  control  should  be  expanded 
beyond  the  borders  of  the  state.  A  global  civil  soci¬ 
ety  is  presumed  to  fill  the  gaps  here.  The  struggles 
between  competing  groups  in  a  globalized  world, 
however,  are  a  familiar  topic  for  political  sociolo¬ 
gists,  and  no  radical  changes  are  required  to  deal 
with  questions  about  democracy  and  globaliza¬ 
tion.  Yet  the  old  focus  on  reciprocal  relationships 
between  the  social  and  political  within  states  is  no 
longer  appropriate,  and  the  global  connections 
and  interdependencies  of  social,  economic,  and 
political  actors  should  also  be  considered. 

New  Technologies 

It  is  not  just  advocates  of  modernization 
approaches  who  presume  that  technical  develop¬ 
ments  in  communication  and  transportation  have  a 
clear  impact  on  power  and  dominance  between 
individuals  and  groups.  As  the  cliche  goes,  the 
world  has  become  smaller  in  many  respects  in  the 
past  few  decades.  Important  consequences  of  this 
development  are  already  included  above  under 
headings  such  as  changing  identities  and  globaliza¬ 
tion.  New  technologies  further  reduce  the  tradi¬ 
tional  lines  of  demarcation  between  social  and 
political  phenomena  by  making  information  univer¬ 
sally  available  and  communication  a  routine  matter. 
There  does  not  seem  to  be  much  need  to  expand 
studies  on  the  impact  of  technological  innovations 
in  general,  but  political  sociologists  should  include 
the  spread  of  new  technologies  as  an  important 
aspect  of  each  of  the  aforementioned  five  topics. 

Recent  social-end  political  developments — the 
rise  of  postmodern  civil  society,  democratization, 
globalization,  and  new  technologies — have  important 
consequences  for  political  sociology  because  they 
fundamentally  change  the  distribution  of  power  and 
dominance.  The  conventional  distinction  between 
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the  social  and  the  political  as  the  main  object  of 
interest  for  political  sociologists,  however,  is  abso¬ 
lutely  indispensable  for  analytical  purposes. 
Precisely  because  the  state  appears  to  change  its 
nature  and  position  rapidly,  a  coherent  conceptual 
framework  is  required  to  study  these  developments. 
Appeals  for  a  new  political  sociology  lay  too  much 
emphasis  on  the  ways  in  which  the  world  is  chang¬ 
ing  and  tend  to  undervalue  the  analytical  and  con¬ 
ceptual  clarity  provided  by  the  old  approaches. 
New  directions  for  political  sociology,  then,  should 
be  based  on  the  acceptance  of  fundamental  changes 
and  of  evident  continuities  in  the  role  and  position 
of  the  state  as  well  as  on  the  recognition  of  the  need 
for  theoretical  and  conceptual  innovations  and  on 
the  demonstrated  usefulness  of  available  approaches. 

Political  Sociology  as  a  Field  of  Study? 

The  heydays  of  political  sociology  appear  to  be 
over  at  first  glance.  Newer  approaches  and  vari¬ 
ants — such  as  neo-institutionalism,  comparative 
sociology,  political  economy,  comparative  politics, 
and  postmodernism — gradually  seem  to  seize  the 
intellectual  arenas  that  used  to  be  reserved  for 
political  sociologists.  The  almost  euphoric  postwar 
period  of  widely  shared  structural-functional  mod¬ 
ernization  approaches  was  followed  by  vivacious 
controversies  about  the  object  of  the  discipline  as 
well  as  its  epistemological  foundations  and  meth¬ 
odological  performance.  Two  cultural  turns,  one 
economic  turn,  and  a  renaissance  of  moderniza¬ 
tion  theories  apparently  left  the  field  dispersed  and 
divided.  Feasible  accounts  for  this  ostensible 
decline  are,  first,  the  disappearance  of  a  clear-cut 
distinction  between  the  social  and  the  political 
and,  second,  a  general  move  toward  more  special¬ 
ized  subfields  in  the  social  sciences. 

Already,  Runciman  depicted  the  enormous 
“expansion  of  the  political”  as  the  most  important 
change  in  the  history  of  the  modern  state.  This 
development  still  continues.  Important  as  this 
extension  is  in  quantitative  terms,  we  experience 
not  only  a  strengthening  of  the  position  of  the  state 
but  also  a  change  in  the  nature  of  the  relationships 
between  the  social  and  the  political.  The  distinction 
between  state  and  society — the  main  premise  of 
political  sociology — has  gradually  disappeared 
with  the  expansion  of  the  state  and  has  been 
replaced  by  a  much  more  ambiguous  melding  and 


blending  of  state  and  society.  The  reluctance  to 
deal  with  the  changing  distribution  of  political 
power  within  and  between  states  probably  con¬ 
tributed  considerably  to  the  decline  of  political 
sociology  as  a  self-reliant  field  of  study.  Power  and 
dominance  can  be  studied  in  many  spheres  of 
life — hospitals,  neighborhoods,  executive  boards, 
and  so  on — without  referring  explicitly  to  some 
reciprocal  relationships  between  the  social  and  the 
political.  However,  it  is  precisely  the  disappear¬ 
ance  of  a  clear  distinction  between  state  and  soci¬ 
ety  that  makes  the  study  of  power  and  dominance 
much  more  interesting  and  relevant  by  focusing  on 
the  connections  between  the  two  phenomena;  that 
is,  power  and  dominance  are  not  the  main  objects 
of  interest  in  themselves  but  as  part  of  the  pro¬ 
cesses  underlying  the  dissolution  of  the  conven¬ 
tional  distinctions  between  state  and  society. 
Political  sociology  as  a  field  of  study  offers  excel¬ 
lent  opportunities  to  deal  with  the  melding  and 
blending  of  the  social  and  the  political. 

A  second  explanation  for  the  apparent  decline 
of  political  sociology  is  related  to  the  move  toward 
more  specialized  subfields  in  the  social  sciences. 
This  change  is  part  of  the  more  general  change 
away  from  social-scientific  approaches  (emphasiz¬ 
ing  causal  explanations  based  on  regularities, 
mainly  applying  quantitative,  comparative  meth¬ 
ods)  toward  cultural-scientific  approaches  (empha¬ 
sizing  specific  cultural  meanings  and  constructions 
of  meanings,  mainly  applying  qualitative,  case- 
oriented  methods).  With  the  economic  turn  and  the 
renaissance  of  modernization  approaches,  this 
general  change  is  counterbalanced.  As  a  result, 
many  subfields  flourish,  while  the  label  political 
sociology  increasingly  appears  to  be  too  unspecific. 
The  apparent  decline  of  political  sociology  as  a 
field  of  study,  then,  is  at  least  partly  due  to  the 
evident  accomplishments  of  the  initial  enterprise:  If 
many  more  specialized  subsubfields  carry  on  suc¬ 
cessfully,  the  broader  idea  loses  much  of  its  appeal. 

Political  sociologists  continue  to  contribute  to 
our  understanding  of  the  distribution  of  power 
and  dominance  in  society.  Many  of  the  actual  top¬ 
ics  suggested  by  advocates  of  new  approaches — 
identities,  citizenship,  new  social  movements,  and 
so  on — do  not  differ  from  topics  studied  by  tradi¬ 
tional  envoys  of  political  sociology.  Instead  of 
debating  claims  for  new  approaches  or  paradigm 
shifts,  the  consensus  about  the  selection  of  relevant 
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topics  should  be  stressed.  Political  sociologists 
have  been  mainly  concerned  with  input-oriented 
microlevel  approaches  (voting  behavior,  new  social 
movements,  parties,  interest  groups,  etc.),  on  the 
one  hand,  and  with  output-oriented  macrolevel 
approaches  (state  formation,  transformations  of 
political  systems  and  political  reform  processes, 
etc.),  on  the  other.  The  very  rapid  spread  of  ratio¬ 
nal  choice  approaches  in  many  social  sciences  in 
the  past  few  decades  reflects  a  need  for  more  pre¬ 
cise  theoretical  arguments  than  is  usually  provided 
by  available  approaches.  Therefore,  the  most 
important  theoretical  enhancements  in  political 
sociology  as  a  field  of  study  focus  on  the  reformu¬ 
lation  of  existing  approaches  in  more  rigorous 
ways — that  is,  in  more  deductive  ways.  Note  that 
this  does  not  necessarily  imply  a  reduction  of  all 
behavior  to  naive  utility  maximizing  nor  does  it 
require  the  use  of  formal  modeling  or  quantifica¬ 
tion.  Institutions,  norms,  and  values;  historical 
peculiarities;  collective  goods;  and  frustrations  all 
can  have  a  place  in  deductive  theories  attempting 
to  provide  microfoundations  for  macrophenom¬ 
ena.  Forthcoming  theoretical  approaches  in  politi¬ 
cal  sociology  as  a  field  of  study,  then,  will  mainly 
consist  of  attempts  to  provide  these  foundations. 
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University  of  Mannheim 
Mannheim,  Germany 
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Political  Systems,  Types 


The  term  political  system  is  an  abstract  concept 
and  is  used  widely  and  often  with  different  mean¬ 
ings.  In  this  entry,  a  political  system  is  defined  as 
a  form  of  governing  society  that  is  embedded  in  a 
legal  (constitutional),  economic,  and  cultural  envi¬ 
ronment.  The  essence  of  a  political  system  relies 
on  the  interdependence  of  its  institutions  (rules  of 
the  political  game)  and  collective  actors  (political 
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parties,  organized  interests,  governments,  and 
bureaucracies)  that  operate  within  such  a  system. 
From  this  definition,  it  follows  that  political  sys¬ 
tems  are  seen  as  a  whole:  Authority  is  exercised 
over  a  territory  through  the  body  politic,  includ¬ 
ing  its  state  format,  organization  of  public 
decision  making,  and  related  processes  of  policy 
formation.  The  study  of  types  of  political  systems 
is — almost  by  definition — comparative  by  means 
of  developing  typologies.  One  can  distinguish 
between  two  directions  in  the  study  of  types  of 
political  systems:  one,  the  descriptive  approach 
and,  two,  the  analytical  approach  by  developing 
typologies. 

The  descriptive  approach  has  a  long-standing 
history  in  political  science  and  focuses  on  typolo¬ 
gies  of  political  systems — often  on  the  basis  of  tax¬ 
onomies,  that  is,  a  (more  or  less)  logical  ordering  of 
types  that  are  hierarchically  organized.  An  example 
of  such  a  taxonomy  is  Aristotle’s  classification  of 
political  regimes.  On  the  one  hand,  Aristotle 
divided  the  politics  of  his  time  into  two  types:  good 
versus  corrupt  governance.  Many  of  these  typolo¬ 
gies  have  been  developed  over  time,  and  more  often 
than  not,  they  were  directed  by  normative  ideas 
(e.g.,  Montesquieu,  the  separation  of  powers). 
Other  typologies  have  been  driven  by  forms  of  cul¬ 
ture,  economic  systems,  or  stages  of  societal  devel¬ 
opment,  and  so  on.  All  these  classifications  are  not 
only  subject  to  normative  belief  systems  but  are 
also  limited,  as  they  describe  the  state  of  affairs 
rather  than  explain  how  and  why  these  differences 
have  emerged  or  what  they  pertain  to. 

The  analytical  approach  aims  at  developing 
comparative  typologies  that  are  not  only  (or  only 
indirectly)  normative  but  also  evidence  based.  One 
of  the  earliest  attempts  was  made  by  Lord  Bryce, 
who  compared  the  democracies  of  his  time  by 
means  of  seeking  the  commonalities  between 
democratic  systems  that  made  them  different  from 
other  political  systems  (e.g.,  direct  vs.  indirect 
forms  of  representation  and  decision  making). 
Others  tried  to  develop  classifications  on  the  basis 
of  theory  (e.g.,  Max  Weber’s  trichotomy  of  tradi¬ 
tional,  charismatic,  and  rational  rule  and  Gabriel 
Almond  and  Sidney  Verba’s  idea  of  variations  in 
political  culture)  or  by  means  of  evidence-based 
induction  (e.g.,  Arend  Lijphart,  1999). 

The  use  of  typologies  in  political  science  is 
widespread  and  considered  as  useful  for  theory 


development,  descriptive  analysis,  and  reducing. 
This  entry  first  elaborates  the  method  of  typology 
construction  and  offers  some  examples.  As  will 
become  clear,  the  use  of  a  typology  is  more  often 
than  not  an  instrument  for  analysis  in  comparative 
political  science  (see  also  Paul  Pennings,  Hans 
Keman,  &  Jan  Kleinnijenhuis,  2006).  Second,  this 
entry  discusses  a  number  of  existing  typologies  of 
political  systems  that  can  be  labeled  as  descriptive 
and  have  figured  in  comparative  political  science. 
Third,  the  focus  is  on  typologies  where  the  aim  is 
to  explain  the  patterned  variation  in  politics  that 
emerges  from  empirical  investigation.  For  exam¬ 
ple,  Robert  Dahl  and  Arend  Lijphart  have  devel¬ 
oped  typologies  to  highlight  the  intradifferences 
within  democratic  polities.  Whereas  the  former 
focuses  on  the  institutional  configuration  of  rules 
and  rights  resulting  in  polyarchy,  the  latter  devel¬ 
oped  a  typology  (i.e.,  majoritarian  vs.  consensus 
democracy)  to  understand  the  actual  working  of  a 
democratic  political  system.  Finally,  the  entry 
moves  to  a  specific  approach  in  political  science:  A 
Systems  Analysis  of  Political  Life  (1965)  as  devel¬ 
oped  by  David  Easton.  This  approach  is  deductive 
and  claims  to  explain  the  variations  across  diverse 
systems.  In  this  approach,  the  organization  of  gov¬ 
ernment  is  the  core  of  each  system.  This  approach 
is  elaborated  by  examining  patterned  variations  of 
government. 

Developing  a  Typology 

Although  typologies  are  not  exclusive  to  compara¬ 
tive  politics,  they  are  frequently  used  in  this  subdis- 
cipline  to  sustain  and  develop  arguments  as  regards 
the  organization  and  institutional  design  of  politi¬ 
cal  systems.  This  is  not  surprising  since  political 
systems  are  seen  as  closed  entities  (like  states,  poli¬ 
ties,  governments,  etc.).  To  reduce  the  real-world 
complexities,  comparativists  have  always  made 
attempts  to  translate  their  ideas  on  political  sys¬ 
tems  into  typologies.  Aristotle  did  so,  as  did 
Polybios  (comparing  various  Greek  city-states) 
and,  later,  Montesquieu  and  Lord  Bryce. 

Typologies  are  seen  to  be  useful  as  a  proto¬ 
theory.  They  are  used  not  only  to  reduce  complex¬ 
ity  but  also  to  inspect  certain  (hypothetical) 
relationships.  For  instance,  one  can  develop  a 
typology  that  combines  two  dimensions  such  as 
democracy  (yes/no)  and  economic  development 
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(high/low)  and  inspect  the  hypothesis  that  democ¬ 
racy  and  welfare  correlate.  It  is  easy  to  understand 
that  such  relatively  simple  typologies  are  more 
often  than  not  victim  of  the  choices  made  by  the 
researcher  and,  therefore,  lead  to  biased  perspec¬ 
tives.  However,  as  a  proto-theory,  a  typology  can 
certainly  help  refine  and  develop  concepts. 
Furthermore,  a  typology  assists  in  observing  to 
what  extent  concepts  function  empirically.  In 
short,  a  properly  developed  typology  is  conducive 
to  concept  formation,  theory  development,  and 
validation  in  view  of  empirical  evidence  (Mattei 
Dogan  &  Dominique  Pelassy,  1990). 

Typology  development  is  in  some  ways  a 
Scylla  and  Charybdis  problem:  On  the  one  hand, 
simple  typologies  can  enhance  clarity  in  a  system¬ 
atic  fashion;  on  the  other  hand,  however,  over¬ 
elaboration  lurks  around  the  corner  (by  develop¬ 
ing  too  many  categories  or  subdivisions  within 
one  category).  Although  a  refined  typology  can 
be  useful  and  points  to  all  logical  possibilities,  it 
also  tends  to  produce  confusion  and  complexity. 
There  is  no  solution  to  this  problem  other  than 
empirical  verification.  As  a  methodological  prin¬ 
ciple,  the  researcher  has  to  follow  as  a  rule  of 
thumb  that  a  classification  scheme  as  applied  to  a 
typology  should  include  as  many  real  cases  as 
feasible,  and  at  the  same  time,  these  cases  should 
exclusively  fit  only  one  of  the  cells  of  the  typol¬ 
ogy.  This  means  that  arbitrary  cases  are  to  be 
avoided  as  much  as  possible.  This  not  only  avoids 
confusion  but  also  prevents  the  occurrence  of 
hybrid  systems. 

Take  for  example  Lijphart’s  typology  of  democ¬ 
racies  (Lijphart,  1968).  There  are  two  dimensions: 

(1)  elite  behavior  (cooperation  vs.  competition)  and 

(2)  whether  society  is  divided  or  not  (homogeneous 
vs.  fragmented  societies).  This  leads  to  a  2  X  2 
typology  with  four  types:  centripetal,  centrifugal, 
depoliticized,  and  consociational.  In  this  typology, 
the  basic  argument  is  that  under  divisive  societal 
conditions — where  sociocultural  cleavages  are 
politically  organized  (by  parties  or  organized  inter¬ 
ests) — the  key  condition  for  stability  is  elite  behav¬ 
ior  (being  the  first  dimension  of  the  typology)  to 
regulate  political  behavior  at  the  mass  level.  Lijphart 
argues  that  whether  elites  produce  stability  depends 
on  the  type  of  conflict  in  plural  societies. 

So  far,  so  good:  There  is  an  explanation,  and  it 
appeared  plausible  (at  the  time).  However,  others 


have  also  attempted  to  use  this  typology  for  other 
systems  and  found  it  difficult  to  decide  for  all  rel¬ 
evant  cases  where  to  locate  them  in  this  fourfold 
table  or  to  accept  that  the  proposed  mechanism 
did  indeed  explain  the  political  process  for  all  the 
cases  in  each  separate  cell.  This  is  not  the  place  to 
discuss  Lijphart  further.  The  main  point  is  that 
typologies  often  flounder  in  view  of  empirical  evi¬ 
dence  or  are  biased  in  terms  of  case  selection  and 
measurement  problems.  In  addition,  one  problem 
with  this  is  that  many  typologies  tend  to  be  exclu¬ 
sively  focused  on  the  commonalities  among  the 
cases  rather  than  taking  into  account  the  differ¬ 
ences  that  exist.  Finally,  sometimes  cases  do  not  fit 
the  defined  cells  because  of  additional  circum¬ 
stances  that  defy  the  hypothesized  relationship.  In 
other  words,  typology  construction  is  certainly  a 
useful  tool  for  comparing  political  systems,  but  it 
is  also  vulnerable  to  misplacement  and  biased 
results. 

In  summary,  typologies  are  considered  as  a  useful 
instrument  to  develop  a  proto-theory  or  a  concep¬ 
tual  design  and  can  serve  as  operational  controls  on 
how  far  they  travel  in  reality.  In  addition,  typologies 
help  reduce  the  complexities  of  researching  political 
systems  in  order  to  create  a  systematic  account  of 
how  the  real  world  can  be  ordered.  Even  taking  into 
account  the  pitfalls  and  the  caveats  mentioned  here, 
the  typology  has  been  and  continues  to  be  one  of  the 
foremost  tools  of  comparative  political  science. 
Hence,  and  this  is  the  topic  of  the  next  section, 
typologies  are  an  essential  analytical  step  to  analyze 
and  investigate  types  of  political  systems. 

Descriptive  Typologies  of  Political  Systems 

Throughout  the  development  of  political  science, 
classifications  have  been  developed,  and  those  of 
Polybios,  Aristotle,  Montesquieu,  and  others  have 
already  been  mentioned.  A  number  of  these  are 
basically  (unidimensional)  listings  according  to  a 
feature  considered  to  be  a  central  one,  such  as,  for 
example,  the  “state”  or  “forms  of  government,”  as 
has  been  developed  by  Samuel  Finer  (1997).  Other 
examples  are  developmental  classifications  that 
define  historical  stages  of  society,  each  producing 
a  specific  type  of  political  authority. 

A  classic  example  is  Friedrich  Engels’s  description 
of  the  change  and  development  from  a  primitive 
society  to  a  communist  society.  This  anthropological 
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approach  related  the  production/consumption  pat¬ 
terns  of  society  to  political  authority  (from  absence 
of  rule  to  authoritarian  rule  to  self-rule).  This  teleo¬ 
logical  model  can  be  seen  as  an  almost  conditional 
set  of  political-economic  stages  predicting  the  type 
of  political  system  to  emerge  in  the  course  of  history, 
based  on  Marxist  ideas  regarding  societal  develop¬ 
ment  culminating  in  “classless  communism.” 

Max  Weber,  a  famous  German  sociologist,  has 
earned  a  reputation  by  defining  the  state  as  well  as 
developing  a  typology  of  ruling  systems.  Whereas 
his  definition  of  the  state  essentially  rested  with  the 
exercise  of  (legitimate)  power  over  a  territory,  his 
typology  concerned  an  ideal  type  that  served  as  a 
proto-theory,  arguing  that  the  degree  of  state 
development  is  associated  with  a  certain  type  of 
political  rule,  of  which  the  rational-legal  rule  (i.e., 
bureaucracy)  is  the  modern  one. 

Contrary  to  Engels’s  teleological  approach,  the 
Weberian  approach  can  be  considered  as  a  proto¬ 
theory.  Most  approaches  before  the  1960s  were  in 
fact  descriptive,  were  based  on  broad — if  not 
vague — analytical  distinctions,  and  often  tended  to 
be  close  to  teleological  reasoning  (i.e.,  a  future 
goal-oriented  argument).  In  particular,  develop¬ 
mental  studies  focusing  on  the  poorer  regions  suf¬ 
fered  from  this  bias. 

More  recently,  classification-based  typologies 
of  wholesale  political  systems  are  becoming  less 
popular  among  comparativists.  The  analytical 
focus  of  researchers  concerned  with  the  study  of 
political  systems  has  shifted  to  highlighting  specific 
features  of  political  systems.  This  change  in  focus 
also  meant  that  typologies  were  confined  to  spe¬ 
cific  types  of  systems  such  as  democracies. 
Examples  are  party  systems,  electoral  systems, 
cleavage  systems,  parties,  governmental  features, 
and  so  on  (see,  e.g.,  Dogan  &  Pelassy,  1990). 

Jean  Blondel  (1982)  developed,  for  instance,  a 
simple  comparative  typology  of  party  systems  by 
counting  the  number  of  parties  that  are  repre¬ 
sented  on  average  (from  a  two-party  system  to  a 
multiparty  system).  This  would  explain  the  differ¬ 
ences  between  legislatures  in  terms  of  interactions 
between  parties  and  government.  His  contempo¬ 
rary,  Maurice  Duverger,  argued  that  the  effect  of  a 
party  system  is  related  to  government  stability: 
The  fewer  the  number  of  parties,  the  more  stable 
party  government  would  be.  Hence,  according  to 
Maurice  Duverger,  the  nexus  between  the  electoral 


system  and  the  party  system  could  be  considered  as 
a  predictor  of  how  a  (democratic)  political  system 
would  function.  In  fact,  Duverger’s  law  (as  it  is 
also  known)  is  conducive  to  a  typology  of  electoral 
laws  and  system  stability.  The  so-called  first-past- 
the-post  electoral  system  would  be  superior  to 
proportional  representation  because  it  produces  a 
stable  single-party  government  as  opposed  to  an 
unstable  coalition  government.  Most  of  the  classi¬ 
fications  made  regarding  electoral  and  party 
systems,  however,  remain  simple,  but  as  regards 
questions  of  voting  and  party  behavior,  they  are 
merely  descriptive. 

Stein  Rokkan  is  also  well-known  and  respected 
for  his  work  on  modeling  (the  term  Rokkan  is  used 
for  developing  a  typology  of  political  systems)  the 
emergence  of  national  states  in  Europe.  His  con¬ 
cern  was  to  understand  how  political  systems 
developed  to  take  on  the  contemporary  shape  and 
organization  of  the  nation-state  in  Europe.  To  this 
end,  he  developed  grids  of  reference  or,  in  fact, 
dimensions  to  account  for  the  functional  and  terri¬ 
torial  differentiation  across  the  European  area  from 
a  historical  perspective.  In  his  view,  the  only  way  to 
do  so  is  to  make  macromodel  comparisons  that  are 
subsequently  specified  by  means  of  region-specific 
models  and  highlighted  by  individual  cases. 

Rokkan’s  typology  (see  Peter  Flora,  Stein 
Kuhnle,  &  Derek  Urwin,  1999)  contains,  first,  a 
trichotomy:  economy-territory-culture.  From 
these  master  dimensions,  specific  indicators  to 
analyze  pathways  to  the  contemporary  form  of  the 
state  are  developed:  religious  diversity,  linguistic 
variation,  ethnic  distinctiveness,  dispersal  of  land 
ownership,  urbanization,  industrialization,  trade, 
geographical  location  and  size  (including  center/ 
periphery  situations),  (de)centralization  of  author¬ 
ity,  and  polity  formation  (constitutional  develop¬ 
ment).  This  grid  of  reference  served  the  purpose  of 
accounting  for  the  cross-system  variation  of  state 
formation  over  time  and  ultimately  of  explaining 
the  Europe-wide  democratization  within  the 
emerging  nation-state. 

Rokkan’s  model  is — in  retrospect — less  a  model 
and  more  a  typology  since  the  classification  is  con¬ 
ducive  to  certain  types  (or  paths)  of  democratiza¬ 
tion  that  are  more  favorable  than  others  (earlier 
fully  accomplished  democratization).  It  enables  the 
researcher  to  examine  both  the  similarities  and  the 
differences  in  the  development  of  political  systems 
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(in  Europe).  In  addition,  it  was  the  basis  for  under¬ 
standing  not  only  how  political  systems  emerged 
but,  foremost,  how  they  shaped  the  democratic 
state  in  the  early  20th  century.  Although  Rokkan’s 
work  is  widely  acknowledged,  it  is  nowadays  more 
often  politely  referred  to  than  actually  used. 

Another  attempt  to  develop  a  diachronical 
typology  of  political  systems  stems  from  Finer 
(1997).  His  attempt  focused  on  regime  types  that 
identify  over  time  the  organization  of  political 
authority  (or  who  governs).  Finer  listed  four 
“pure”  types:  palace,  forum,  nobility,  and  the 
church,  representing  the  type  of  rule.  In  addition, 
the  four  types  could  well  develop  as  hybrids.  The 
resulting  10  cells  serve  as  an  empirical  grid  to  be 
filled  by  real-world  cases;  for  example,  the  pure 
church  category  only  concerns  two  cases:  the 
Vatican  and  Tibet,  whereas  forum  includes  all 
modern,  secular  democracies.  However  impressive 
Finer’s  book  is,  the  typology  is  purely  descriptive 
and  fails  to  pass  the  mentioned  methodological 
rule  of  thumb  on  typologies:  In  many  cases,  one 
may  question  whether  there  are  overlaps  in  the 
located  position  of  cases  or  whether  cells  remain 
(almost)  empty. 

The  typologies  presented  so  far  are  mixtures  of 
description  and  modest  explanation.  Second,  they 
represent  either  whole-system  approaches  or  sys¬ 
temic  (intrasystem)  typologies  (e.g.,  electoral  sys¬ 
tems  or  types  of  government).  Third,  it  must  be 
noted  that  many  of  these  typologies  of  political 
systems  are  often  forgotten  or  only  sparsely  used  at 
present.  Yet  there  is  an  exception  to  this:  In  1996 
(originally  1989),  Denis  and  Ian  Derbyshire  pub¬ 
lished  Political  Systems  of  the  World.  In  this 
monumental  and  ambitious  book,  the  authors  set 
out  to  cover  all  existing  political  systems  (N  =  192 
at  the  time),  classify  them  according  to  social  and 
economic  influences,  and  highlight  a  number  of 
particular  political  features  common  to  many  but 
not  all:  a  constitutional  design,  an  ideological  base, 
executive/legislative  relations,  an  electoral  system, 
and  political  parties.  In  other  words,  Derbyshire 
and  Derbyshire  follow  the  traditional  pattern  of 
descriptive  analysis  of  political  systems  (i.e.,  the 
unit  of  comparison  is  the  independent  state). 
Although  the  book  contains  some  cross-tables  that 
hint  at  more  elaboration,  most  of  the  (useful) 
information  is  based  on  one-dimensional  classifi¬ 
cations.  In  fact,  it  concerns  a  kind  of  political  map 


of  the  (contemporary)  world,  and  only  the  sections 
on  ideology  and  political  parties  can  be  considered 
analytically  novel. 

The  authors  argue  that,  in  addition  to  parties 
competing  for  office  or  representing  the  popula¬ 
tion  as  a  whole  (as  under  communist  rule),  there 
are  pressure  groups  and  interest  groups  that  are 
functionally  equivalents  of  parties  within  the 
political  system.  Therefore,  Derbyshire  and  Derby¬ 
shire  introduced  the  concept  of  corporatism  as  an 
alternative  mode  of  interest  representation  vis-a- 
vis  pluralism.  In  fact,  they  introduce  a  new  type  of 
state:  the  corporatist  state.  Apart  from  the  fact  that 
this  type  of  state  (as  far  as  it  [hasj  existed)  is  lim¬ 
ited  to  Western  Europe  with  some  similar  practices 
elsewhere,  it  is  questionable  whether  or  not  this 
type  of  polity  is  relevant  to  understanding  political 
systems  all  over  the  world  in  a  comparative 
perspective. 

The  ideological  base  of  a  political  system  is  con¬ 
sidered  by  Derbyshire  and  Derbyshire  (1996)  as  “a 
body  of  ideas  which  reflects  the  beliefs  and  values 
of  a  nation  and  its  political  system”  (p.  23).  Hence, 
in  their  view,  the  ideological  base  of  a  political 
system  (i.e.,  national  state)  refers  to  a  (often  under¬ 
lying)  shared  belief  system  on  dominating  values 
within  a  society  (including  religion).  As  the  authors 
admit,  the  labeling  of  nation-states  by  ideological 
base  is  bound  to  be  arbitrary,  and  hybrid  forms  are 
to  be  found  as  well.  Nevertheless,  it  is  interesting 
to  examine  how  this  dimension  is  distributed 
across  the  world. 

Derbyshire  and  Derbyshire  distinguish  eight  dif¬ 
ferent  ideological  bases  (see  Table  1).  In  fact,  some 
of  these  are  closer  to  being  a  type  of  state  than  a 
belief  system  per  se.  In  this  table,  the  ideological 
divisions  by  Derbyshire  and  Derbyshire  are  reported 
in  connection  with  socioeconomic  factors  and  indi¬ 
cators  of  quality  of  life.  The  idea  is  obviously  that 
there  is  a  relation  between  economic  development 
and  social  conditions,  on  the  one  hand,  and  the 
ideological  base  of  a  state,  on  the  other. 

As  is  known  from  the  literature,  the  relationship 
between  types  of  political  systems  and  social  and 
economic  development  is  not  straightforward,  nor 
is  it  that  statistically  strong  (Keman,  2002).  The 
only  solid  observation  is  that  fully  fledged  demo¬ 
cratic  systems  are  almost  always  among  the  pros¬ 
perous  countries  and  are  highly  developed  in 
socioeconomic  terms.  The  other  categories  may 
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Table  I  Distribution  of  Ideological  Base  of  the  State  and  Social  and  Economic  Development 

Gross  Domestic 


Ideological  Base 

No.  of  Political 
Systems 

Product  (% 
World  Share ) 

GDP  per 
Capita  ($) 

Literacy 
Rate  (%) 

Human  Rights 
Index 

Liberal 

73  (38.0%) 

86.1 

8.475 

88 

80 

democratic 

Emerging 

73  (38.0%) 

8.0 

1.490 

66 

63 

democratic 

Communist 

5  (2.6%) 

2.4 

810 

87 

25 

Nationalist 

8  (4.2%) 

0.6 

1.500 

57 

45 

Religious 

3  (1.6%) 

0.6 

1.200 

39 

25 

Authoritarian3 

20  (10.4%) 

0.7 

490 

60 

40 

Absolutist 

10  (5.2%) 

1.0 

8.235 

66 

44 

Source:  Derbyshire,  J.  D.,  &  Derbyshire,  I.  (1996).  Political  systems  of  the  tvorld  (pp.  25-26).  New  York:  St.  Martin’s  Press, 
a.  Distinction  military  and  authoritarian  collapsed  by  this  author. 


perhaps  tell  us  something  about  the  background  of 
the  form  of  government  that  has  emerged  around 
the  world,  but — given  the  broad  categories — very 
little  more.  In  addition,  the  defining  categories 
remain  vague  (what  distinguishes  nationalist  from 
authoritarian ?).  Probably,  there  are  various  cases 
that  are  hybrids  or  that  simply  overlap  (e.g.,  Kenya 
appears  to  be  authoritarian  and  nationalist, 
whereas  Tanzania  is  labeled  as  socialist  and 
nationalist  by  the  authors). 

In  particular,  the  distinction  between  liberal 
democracies  and  emergent  democracies  is  confus¬ 
ing  and  debatable.  First,  the  definition  tells  us  little 
about  what  the  ideological  differences  are  between 
the  two  (apart  from  the  fact  that  the  former  have 
had  an  undisturbed  history  [as  a  state]  in  terms  of 
coups  d’etat,  whereas  the  latter  emerged  in  the 
various  postwar  waves  of  democratization  and 
often  experienced  political  disruption).  Second, 
comparing  the  listings  of  Derbyshire  and  Derby¬ 
shire  (1996)  with  others,  it  appears  that  a  number 
of  cases  are  misplaced  as  regards  being  democratic, 
or  emerging  or  not.  All  in  all,  classifications  as 
developed  by  Derbyshire  and  Derbyshire  are  cer¬ 
tainly  useful  as  reference  sources  but  have  little 
added  value  in  terms  of  understanding  the  differ¬ 
ences  between  types  of  political  systems  and  are 
limited  in  their  descriptive-analytical  value  (e.g.,  as 
a  proto-theory  or  in  explaining  comparative  differ¬ 
ences).  In  some  scholars’  view,  a  typology  should 


add  to  existing  knowledge  rather  than  only  recycle 
existing  knowledge  regarding  the  similarities  of 
and  differences  between  political  systems.  Hence, 
typologies  are  useful  but  only  if  they  are  system¬ 
atic,  two-  or  multidimensional,  and  related  to  a 
theory-driven  research  question  (e.g.,  Lijphart’s 
typology  of  democratic  variations  and  Rokkan’s 
models  of  state  development). 

This  section  has  discussed  various  types  of 
typologies  as  they  have  been  developed  and  elabo¬ 
rated  in  comparative  political  science.  On  the  one 
hand,  pure  descriptive  classifications  and  typolo¬ 
gies  have  been  presented  (Finer;  Derbyshire  & 
Derbyshire).  On  the  other  hand,  a  number  of  ana¬ 
lytical  typologies  have  been  put  forward.  It  should 
be  noted  that  typologies  either  aspire  to  classify  all 
political  systems  of  the  world  (like  Finer)  or  con¬ 
fine  themselves  to  regions — such  as  Europe 
(Rokkan) — and  to  specific  regime  types — for 
example,  democracies  (Lijphart).  In  addition,  it 
was  argued  that  after  the  1970s,  another  kind  of 
typology  emerged  in  the  form  of  systemic  or  intra¬ 
system  classifications  (e.g.,  party  systems  and  type 
of  government;  see  Blondel,  1982). 

Hence,  the  present  strategy  for  developing  types 
of  political  systems  involves  going  from  descriptive 
to  analytical  approaches  and  from  focusing  on 
whole  systems  to  attributes  of  political  systems. 
The  focus  on  attributes  of  political  systems  can  be 
applied  to  comparative  politics  and  enhance  insight 
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for  the  student  regarding  both  the  cross-system 
and  intrasystem  variation  around  the  world  or 
within  certain  regions  or  regime  types.  One  useful 
way  of  developing  further  theory-driven  typologies 
is  to  derive  them  from  Easton’s  A  Systems  Analysis 
of  Political  Life. 

Variations  of  Political  Systems: 

Types  of  Political  Governance 

Systems  theory  is  derived  from  evolutionary  bio¬ 
logical  models  that  emphasize  the  interdependence 
of  the  elements  that  are  considered  to  be  essential 
for  the  survival  of  the  system.  Systems  theory 
applied  to  analyze  political  systems  is  in  the  form 
of  a  set  of  interactions  between  political  actors 
whose  behavior  is  structured  by  institutions  and  is 
embedded  in  a  wider  environment.  It  departs  from 
the  idea  that  these  institutionalized  interactions 
shape  the  dynamics  of  a  system  toward  more  or 
less  an  equilibrium  situation.  This  approach  was 
already  developed  in  the  19th  century  (e.g.,  Herbert 
Spencer)  and  has  been  applied  to  political  science 
since  the  late  1950s.  Easton  (1965,  1981)  has  been 
the  main  advocate  of  this  approach,  and  others 
like  Almond  and  Verba  applied  it  to  compare 
polities  (institutionalized  rule  of  a  society). 

Easton  conceived  of  politics  in  terms  of  its  rela¬ 
tionship  with  society  by  means  of  the  authoritative 
allocation  of  material  and  “immaterial”  for  a  soci¬ 
ety  (i.e.,  by  means  of  public  policy  formation).  The 
political  system  receives  inputs  from  society  in  terms 
of  demands  and  support,  such  as  policy  preferences 
of  organized  interests  and  political  parties,  and  elec¬ 
toral  support,  for  example,  for  parties  in  govern¬ 
ment  or  the  junta  in  power.  The  political  system 
converts  these  into  outputs  in  the  form  of  decisions 
and  enforceable  policies  that  feed  back  to  society.  If 
and  when  demand  and  support  are  (more  or  less)  in 
balance,  it  would  imply  an  equilibrium  situation — 
that  is,  political  stability.  Of  course,  this  process  is 
not  an  automatic  one  but  is  driven  by  the  types  of 
institutions  and  political  actors.  Gatekeepers,  like 
parties  or  interest  groups,  cumulate  various  prefer¬ 
ences  from  the  public  and  direct  the  system  of  gov¬ 
ernance  (i.e.,  the  conversion  process).  The  resulting 
outputs  (policy)  feed  back  into  support  and  demand 
for  the  actors  that  are  responsible  for  government. 

More  often  than  not,  the  Eastonian  approach 
of  systems  theory  has  been  criticized  for  being 


mechanistic,  teleological,  static,  and  not  fit  for 
empirical  analysis.  However,  the  practitioners  of 
comparative  politics  have  helped  refute  these 
criticisms.  First,  they  applied  systems  theory  to 
compare  countries  as  political  systems;  second,  by 
elaborating  Easton’s  model  by  specifying  political 
actors  and  institutions,  they  analyzed  the  “black 
box”  of  the  conversion  process  (or  governance); 
third,  with  the  data  available  now,  which  allow 
for  closer  scrutiny  while  comparing  political  sys¬ 
tems,  they  developed  whole-system  and  systemic 
classifications  of  political  systems. 

In  the  remainder  of  this  section,  the  differences 
and  similarities  between  political  systems  are 
elaborated  by  means  of  the  Eastonian  approach  to 
demonstrate  the  extent  to  which  types  of  political 
systems  do  vary.  This  allows  for  assessing  the 
extant  typologies.  Recall  that  a  distinction  was 
made  between  descriptive  and  analytical  typolo¬ 
gies,  on  the  one  hand,  and  between  whole-system 
comparisons  and  systemic  or  intrasystem  typolo¬ 
gies,  on  the  other.  The  Eastonian  approach  allows 
for  an  analytical  systemic  typology. 

In  what  follows,  the  focus  is  first  on  the  institu¬ 
tional  variation  of  polities  (in  this  case,  the  countries 
of  the  world)  in  terms  of  their  types  of  political 
rule,  both  representative  and  nonrepresentative. 

Institutional  Variations  of  Political  Systems 

As  we  have  observed,  there  have  been  and 
always  will  be  classifications  of  political  regimes. 
The  term  regime  simply  means  a  system  of  ruling 
society.  Numerous  indicators  are  used,  but  here 
we  shall  concentrate  on  the  institutional  configu¬ 
ration  of  government.  As  has  already  been  stated, 
political  institutions  are  basically  the  rules  of  the 
political  game,  and  these  rules  shape  and  direct 
the  behavior  of  the  players  involved  (e.g.,  parties, 
movements,  interest  groups,  bureaucrats,  people, 
and  also  the  members  of  government)  with 
respect  to  the  political  process  of  governing.  The 
main  types  of  representative  government  are 
introduced  in  the  section  that  follows.  In  addi¬ 
tion,  the  nondemocratic  types  are  also  discussed. 
This  enables  us  to  define  the  relationship  between 
government  as  the  conversion  agency  (making 
decisions  on  policies)  and  the  eventual  policy  per¬ 
formance  in  terms  of  its  societal  impact  (or  feed¬ 
back  process). 
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Representative  Government 

This  type  is  related  to  the  political-ideological 
basis  regarding  the  “liberal-democratic”  and 
“emerging  democratic”  types  of  governance  (see 
Table  1).  The  connection  lies  with  the  idea  of 
democracy.  Central  to  democracy  is  that  the  exec¬ 
utive  is  elected  indirectly  by  the  population  and, 
thus,  the  population  (or  more  precisely,  the  elec¬ 
torate),  through  its  representation  in  the  Assembly, 
directs  and  controls  government.  Hence,  the  way 
in  which  the  relationship  between  the  executive 
and  legislative  is  institutionalized  shapes  the  role 
and  position  of  government.  This  relationship  is 
constitutionally  driven  almost  everywhere  or  laid 
down  in  a  basic  law.  In  addition,  there  are  the 
“conventions”  that  shape  the  structure  of  repre¬ 
sentative  government.  Hence,  institutions  are 
determining  the  “room  for  manoeuvre”  for  gov¬ 
ernment  and  more  often  than  not,  will  be  condu¬ 
cive  to  its  “leadership  and  optimisation”  (see 
Blondel,  1982).  For  instance,  differences  in  the 
formal  relationship  between  the  executive  and  leg¬ 
islative,  on  the  one  hand,  and  the  type  of  electoral 
system,  on  the  other,  influence  the  composition  of 
government  and  its  policy-making  capacities.  Four 
types  of  government  can  be  distinguished  within 
the  category  of  representative  government: 

1.  presidential  government, 

2.  parliamentary  government, 

3.  dual-power  government,  and 

4.  single-party  government. 

According  to  Derbyshire  and  Derbyshire,  the 
cross-national  distribution  of  these  types  is  as 
shown  in  Table  2. 

Table  2  Types  of  Representative  Government 


The  parliamentary  type  of  representative  gov¬ 
ernment  is  quite  dominant  in  Europe.  Only  one 
out  of  five  government  types  outside  Europe  is 
parliamentary.  Presidential  government  is  the 
dominant  type  within  both  the  consolidated  and 
the  emergent  democracies  in  the  rest  of  the  demo¬ 
cratic  world.  Yet  since  the  late  1980s,  the  number 
of  dual-power  governments  has  doubled  (particu¬ 
larly  in  Central  and  Eastern  Europe).  This  typol¬ 
ogy  of  representative  government  is  driven  by  the 
constitutional  features  of  the  polity. 

Presidential  Government.  The  role  and  position 
of  the  head  of  state  is  crucial  for  each  type. 
Although  most  states  have  a  president  as  head  of 
state,  in  most  cases,  the  presidency  is  merely  sym¬ 
bolic,  and  its  main  function  is  to  represent  the 
sovereignty  of  the  nation  and  government.  In  this 
sense,  presidents  are  comparable  to  constitutional 
monarchs.  However,  in  other  systems — for  exam¬ 
ple,  the  United  States — the  president  has  been 
assigned  the  role  of  executive  (as  head  of  govern¬ 
ment),  and  the  related  power  is  separated  from  the 
legislative  powers  of  the  (elected)  assembly.  In 
most  cases,  the  executive  head  of  state  cannot  be 
removed  by  the  legislative  (he  or  she  appoints  the 
other  members  of  government)  and  derives  his  or 
her  legitimacy  from  popular  election  to  office  (i.e., 
elected  leadership). 

The  foremost  feature  is  that  of  the  one-person 
executive  who  dominates  the  politics  of  govern¬ 
ment.  He  or  she  can  call  on  the  public,  by  whom 
he  or  she  is  elected,  and  represents  national  gov¬ 
ernment  at  home  and  abroad.  This  feature  of  a 
one-person  executive  also  reinforces  the  position 
of  the  bureaucracy,  which,  at  least  in  many  cases, 
is  indirectly  subservient  to  the  president.  Of 
course,  this  differs  from  system  to  system,  but  if 


Region 

n 

Presidential 

Parliamentary 

Dual  Power 

Europe 

49  (31%) 

15  (30%) 

24  (48%) 

10  (22%) 

Americas 

43  (27%) 

29  (58%) 

12  (35%) 

2  (7%) 

Rest  of  the  world 

67  (42%) 

43  (64%) 

19  (28%) 

5  (8%) 

Totals 

159  (100%) 

87  (52%) 

55  (36%) 

17  (12%) 

Source:  Derbyshire,  J.  D.,  &  Derbyshire,  I.  (1996).  Political  systems  of  the  ivorld  (p.  40).  New  York:  St.  Martin’s  Press. 
Note:  Percentages  of  types  of  representative  government  are  row  totals. 
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well  organized,  the  president  can  direct  the  bureau¬ 
cracy  effectively  and  even  strengthen  the  coordina¬ 
tion  of  policy  implementation. 

Typical  for  presidentialism  is  the  executive  com¬ 
mand  of  the  head  of  state,  who  is  also  head  of 
government.  Conversely,  parliament  plays  an  indi¬ 
rect  role  by  exerting  legislative  control.  It  is  clear 
that  the  structure  of  presidential  government  is 
characterized  by  quite  a  few  lines  of  command. 
Conversely,  the  lines  of  control  are  either  recipro¬ 
cal  or  indirect.  This  implies  that  the  institutional 
configuration  of  presidential  government  can  be, 
and  often  is,  conducive  to  conflict  (e.g.,  think  of 
“divided  government” — i.e.,  the  political  “color” 
of  the  president  is  different  from  the  majority  in 
the  legislature)  and  considered  to  be  prone  to 
political  instability. 

Parliamentary  Government.  Parliamentary  gov¬ 
ernment  is  organically  linked  to  the  legislature,  or 
parliament.  The  government  emerges  from  the 
assembly  and  can  be  dismissed  by  a  vote  of  no  con¬ 
fidence  (and  often  also  needs  a  vote  of  investiture 
by  the  same  parliament).  At  the  same  time,  govern¬ 
ment  can — often  after  consultation  with  the  head  of 
state — dissolve  parliament  and  call  for  a  new  elec¬ 
tion.  Whereas  presidential  government  appears 
strong  and  relatively  independent,  parliamentary 
government  is  often  considered  to  be  weak  because 
of  the  mutual  dependence  of  the  executive  and  leg¬ 
islature.  In  other  words,  a  typical  consequence  of 
parliamentary  government  is  that  both  powers  are 
fused  and  bargaining  eventually  directs  the  outputs 
of  the  system.  Hence,  the  institutional  means  of 
command  and  control  are  distributed  across  the 
executive  and  legislative. 

Given  these  differences  from  presidentialism,  it 
is  usual  to  differentiate  between  parliamentary 
governments  by  means  of  their  conventional  shape 
and  working.  On  the  one  hand,  there  is  the  one- 
party  government,  where  the  majority  party  in 
parliament  forms  the  government  (e.g.,  in  the 
United  Kingdom  until  recently).  On  the  other 
hand,  there  is  the  coalition  government,  where  a 
combination  of  parties  forms  a  government  that  is 
supported  by  a  majority  in  parliament.  A  subtype 
is  the  minority  governments  (one-party  or  coali¬ 
tion,  occurring  often  in  Scandinavia).  In  short,  in 
parliamentary  types  of  government,  the  political 
representation  is  mediated  by  means  of  parties. 


All  in  all,  in  parliamentary  systems,  government 
is  structured  differently  from  presidential  govern¬ 
ment.  The  formal  powers  of  the  executive  and 
legislative  are  largely  fused;  consensus  formation 
between  parties  in  government  and  in  parliament 
is  a  prerequisite  to  make  policies,  and  therefore, 
negotiations  take  place  in  both  government  and 
parliament,  where,  in  the  final  instance,  parties  do 
really  matter. 

What  should  also  be  clear  is  that  the  electorate 
only  indirectly  influences  government.  Hence,  as  is 
often  argued,  parliamentary  systems  of  governance 
are  indeed  an  indirect  form  of  democracy.  Whereas 
in  presidential  systems  the  head  of  government  is 
primarily  directly  elected,  this  is  not  the  case  in 
parliamentary  systems.  To  push  this  argument  far¬ 
ther,  in  parliamentary  democracies,  parties  are  the 
key  factor  linking  the  electorate  to  parliamentary 
government. 

Dual-Power  Government.  This  type  of  govern¬ 
ment  has  often  been  considered  as  an  anomaly  or 
as  a  residual  category.  But,  as  a  consequence  of  the 
criticisms  raised  versus  both  presidential  and  par¬ 
liamentary  government,  dual-power  government  is 
taken  much  more  seriously  nowadays,  and  more 
attention  is  paid  to  this  type  of  government  (usu¬ 
ally  labeled  semipresidentialism )  as  an  alternative 
to  both  presidentialism  and  parliamentarism.  The 
majority  of  the  cases  can  be  found  in  Europe: 
France,  Estonia,  Lithuania,  Slovenia,  and,  to  some 
extent,  Portugal,  Finland,  the  Czech  Republic,  and 
Poland.  It  should  be  noted  that  most  of  these  states 
have  developed  this  dual  system  only  recently.  This 
may  well  be  an  expression  of  a  growing  dissatisfac¬ 
tion  (given  the  apparent  disadvantages)  with  both 
the  straightforward  presidential  and  parliamentary 
systems  of  democratic  and  constitutional  govern¬ 
ment.  The  third  type  of  representative  government 
is  presented  in  Figure  1. 

Dual-power  government  is  strongly  influenced 
by  constitutional  rules  (the  direct  arrows  in  Figure 
1)  and  the  multiple  relations  that  exist.  The  main 
disadvantage  is  the  delicate  interrelations  in  terms 
of  command  and  control  between  all  powers.  This 
may  well  imply  that  imbalances  and  disruption 
lead  to  stalemates,  gridlocks,  and,  thus,  govern¬ 
mental  instability.  However,  the  fact  that  the  elec¬ 
torate  has  a  more  direct  influence  on  both  the 
executive  and  the  legislative  than  in  the  other 
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Figure  I  Dual-Power  Government 

Note:  Dotted  arrows  represent  indirect  forms  of  command  or  control. 


types  of  representative  government  appears  to  be 
an  advantage. 

Nonrepresentative  Government 

These  types  of  government  overlap  to  a  large 
extent  with  the  ideological  bases  that  have  been 
distinguished  in  Table  2  and  that  do  not  belong  to 
the  democratic  family.  Here,  the  distinction  between 
one-actor  government  and  autocratic  governance  is 
made.  The  first  type  is  characterized  by  the  fact  that 
one  political  actor — be  it  a  party,  movement,  or  a 
(charismatic)  leader — rules  the  state  in  the  name  of 
the  whole  nation.  Often,  these  regimes  are  ideo¬ 
logically  inspired  by  communism  or  variations  of 
nationalism.  Autocratic  governments  can  also  be 
characterized  by  one  political  actor  who  rules  but 
not  in  the  name  of  the  collective  interest.  The  same 
line  of  reasoning  can  be  applied  to  military  and 
religious  rule.  The  polity  is  not  the  framework  of 
reference,  but  rather,  a  specific  interest  or  idea  pro¬ 
vides  an  ideological  basis  of  and  justification  for 
autocratic  rule.  Autocratic  government  concerns 
about  15.6%  of  all  regimes  in  the  world,  and 
together  with  the  one-actor  government,  this  covers 
24%  of  all  the  regimes  of  the  world. 

One-Actor  Government.  The  basic  feature  of  a 
one-actor  government  is  that  a  party  or  movement 
or,  eventually,  a  leader  represents  the  “will  of  the 
people”  and  governs  for  the  people  but  by  no 
means  is  the  state  governed  by  the  people.  Hence, 
insofar  as  there  exists  representation,  it  is  limited 
to  a  “top-bottom”  type  of  indirect  representation 


(see  Figure  2).  In  most  cases,  these  ideas  are  laid 
down  in  a  constitution  or  in  a  set  of  basic  laws  by 
one  party  or  movement.  In  addition,  the  constitu¬ 
tion  preamble  serves  to  justify  the  nature  of  the 
system  in  terms  of  ideology  (e.g.,  socialism  or 
patriotism)  and  the  need  for  a  unified  actor  steer¬ 
ing  the  state  and  society.  In  essence,  this  implies 
that  both  the  executive  and  the  legislature  (which 
almost  always  formally  exist  in  this  type  of  govern¬ 
ment)  are  formed  by  the  same  party  or  movement 
(hence,  there  is  no  competition  between  parties).  In 
fact,  government  is  formed  by  an  “elite”  that  is 
either  confined  to  party  membership  (like  in  com¬ 
munist  regimes)  or  to  having  a  function  in  the 
movement.  In  particular,  this  pattern  can  be 
observed  in  the  developing  and  postcolonial  world. 

It  would  appear  that  the  charismatic  style  of 
leadership,  the  avoidance  of  personality  cults,  and 
self-interested  behavior  of  the  elite  are  require¬ 
ments  for  smooth  change  and  adequate  societal 
performance  by  means  of  such  a  type  of  govern¬ 
ment.  If  these  requirements  are  not  met  or  external 
pressure  mounts,  a  fundamental  regime  change  is 
almost  inevitable.  Many  of  these  changes  (in  par¬ 
ticular,  in  the  past  decades)  are  going  in  two  direc¬ 
tions:  either  toward  emergent  democracy  or  toward 
an  autocratic  type  of  government. 

Autocratic  Government.  The  fundamental  features 
that  shape  this  category  of  governance  are  that  not 
only  is  it  not  representative  but  also  that  there  is  no 
explicit  link  with  society  at  large  and  hardly  any 
formal  or  constitutional  organization  of  the  polity. 
Instead,  the  ideological  basis  is  often  derived  from 
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Figure  2  One-Actor  Government 


absolutism  or  religion,  on  the  one  hand,  or  based 
on  military  rule  (operating  by  suspending  the  basic 
laws),  on  the  other. 

The  structure  of  this  form  of  government  is  that 
it  is  completely  self-organized,  and  command  and 
control  are  organized  by  force,  ruling  out  any  form 
of  political  and  civil  rights  for  the  population.  The 
most  extreme  form  is  tyranny,  more  often  than  not 
by  means  of  (personalized)  dictatorship  (e.g., 
Joseph  Stalin  [Iosif  Vissarionovic  DzugasviliJ ) . 
This  means  that  government  equals  the  “rule  of 
the  day”  and  that  force  and  fear  prevail  over  the 
well-being  of  the  nation  and  its  population.  At  the 
end  of  the  day,  neither  the  leadership  nor  its  sub¬ 
servient  elite  is  accountable  to  anybody  but  itself. 

Nonrepresentative  government  is  hardly  struc¬ 
tured  by  formal  rules.  Informal  rules  tend  to  be 
beneficial  and  shaped  by  the  leadership  and  its 
concomitant  elites  and  are  often  insufficient  for 
stable  and  enduring  government.  The  major  differ¬ 
ence  between  both  types  distinguished  here  is  the 
absence  of  abuse  and  outright  perversion  of  human 
rights  in  combination  with  attempts  to  enhance  the 
public  welfare  of  the  nation  by  one-actor  govern¬ 
ments.  Yet  in  reality,  it  is  difficult  to  draw  a  line 
between  the  two  types.  Often  the  one  develops  into 
the  other. 

Toward  a  Typology  of  Contemporary 
Forms  of  Governance 

The  different  forms  of  government  that  have 
been  discussed  are  all  derived  from  one  theoretical 


model:  Easton’s  systems  theory  of  political  life. 
The  focus  has  been  on  the  institutional  configura¬ 
tion  of  each  type:  presidential,  parliamentary,  dual 
power,  one-actor,  and  autocratic  government.  It 
has  been  shown  how  the  central  actors  represent¬ 
ing  the  systems  of  governance  are  interrelated 
(directly,  indirectly,  or  not)  through  the  institu¬ 
tions  that  are  typical  for  each  political  system.  In 
this  way,  one  can  fill  in  the  real  cases  that  belong 
to  each  type  as  well  as  consider  how  in  each  of 
these  political  systems,  the  central  actors  are 
ordered  in  terms  of  (supposed)  influence  and  their 
powers  to  command  or  control  (or  both). 

Of  course,  not  all  systems  within  each  category 
are  perfectly  the  same;  there  is  variation  in  presi¬ 
dential  systems  as  there  is  in  autocratic  systems.  In 
addition,  all  systems  are  more  or  less  in  flux: 
Institutional  arrangements  are  changed  and  some¬ 
times  completely  altered  (e.g.,  the  transition  of  the 
Fourth  French  Republic  to  the  Fifth  in  1958  or  the 
degradation  of  Kenya  from  a  one-party  state  to  an 
autocratic  system  in  the  1980s).  Finally,  there  are 
cases  that  tend  to  be  hybrids — that  is,  some  fea¬ 
tures  of  two  of  the  main  types  appear  in  one  sys¬ 
tem:  Finland,  for  example,  showed  strong  features 
of  presidentialism  until  the  1980s,  but  at  the  same 
time,  it  could  well  be  considered  to  have  a  parlia¬ 
mentary  type  of  representative  government. 
Finally,  it  is  fair  to  say  that  in  reality,  it  is  a  thin 
line  that  separates  autocratic  systems  from  one- 
party  systems  (think  of  the  absolutist  kingdoms 
that  exist  in  the  Arabic  world). 

At  the  end  of  the  day,  it  is  up  to  the  comparative 
political  scientist  to  make  a  decision  on  the  basis  of 
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evidence,  on  the  one  hand,  and  depending  on  the 
research  question  asked,  on  the  other.  This  could 
well  mean  that  a  specific  typology  is  developed,  for 
instance,  to  describe  how  democratic  a  political 
system  is  or  how  well  different  systems  perform 
(Keman,  2002).  Developing  typologies  of  political 
systems  can  help  answer  such  questions  and 
enhance  comparative  analysis. 

Conclusion 

The  different  types  of  political  systems  have  been 
discussed  throughout  history,  and  many  typologies 
have  been  developed.  The  construction  of  a  typol¬ 
ogy  is  more  challenging  if  the  researcher  not  only 
wishes  to  use  it  as  a  descriptive  tool  to  systemati¬ 
cally  reduce  the  comparative  complexities  but  also 
aims  to  arrive  at  an  analytically  driven  typology. 
As  a  methodological  rule  of  thumb,  this  entry 
emphasized  that  a  proper  typology  of  political  sys¬ 
tems  should  contain  those  cases  that  not  only 
belong  in  one  cell  but  are  also  empirically  mutually 
exclusive  across  cells.  In  addition,  this  entry  distin¬ 
guished  between  typologies  that  are  whole-system 
oriented  or  intrasystem  focused.  Whole-system 
typologies  of  political  systems  had  been  quite  fre¬ 
quent  up  to  the  1970s,  whereas  the  intrasystem 
ones  came  more  into  use  after  World  War  II. 

The  variation  in  both  descriptive  and  analytical 
typologies  of  political  systems  was  also  treated.  In 
addition,  this  entry  also  discussed  that  types  of 
political  systems  may  well  be  enhanced  on  the  basis 
of  Easton’s  idea  of  systems  analysis  of  political  life. 
To  this  end,  systems  of  representative  and  nonrep¬ 
resentative  government  were  discussed  as  an  exam¬ 
ple  of  constructing  a  typology  of  political  systems 
in  terms  of  their  institutional  design.  This  resulted 
in  an  analytical  typology  that  is  neither  static  nor 
teleological  and  allows  for  interpreting  the  process 
of  government  as  the  crucial  component  of  any 
political  system  together  with  its  systemic  features. 

Hans  Keman 
Vrije  Universiteit  Amsterdam 
Amsterdam,  Netherlands 
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Political  Theory 


Theory  is  defined  as  a  set  of  propositions  that  is 
internally  consistent  and  based  on  a  certain  set  of 
axioms  and  assumptions.  Theory  must  be  logi¬ 
cally  derivable  in  all  cases.  It  must  be  politically 
persuasive,  especially  when  it  deals  with  norms.  It 
must  be  empirically  verifiable  when  it  deals  with 
reality.  Therefore,  political  theory  is  defined  as  a 
set  of  propositions  about  “who  gets  what,  when, 
how”  (Harold  Lasswell)  and  “the  authoritative 
allocation  of  values  in  society”  (David  Easton). 
Thus  defined,  political  theory  covers  a  very  wide 
range  of  subjects  in  the  form  of  propositions. 

In  what  follows,  first  the  origins  and  types  of 
political  theory  are  examined.  Among  the  types  of 
political  theory,  this  entry  takes  into  account  the 
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following  three:  (1)  classical  philosophy,  (2)  empir¬ 
ical  political  theory,  and  (3)  formal  political  theory. 
Next,  this  entry  examines  schools  of  political  the¬ 
ory  and  their  evaluation.  Under  this  heading, 
10  schools  are  evaluated:  4  schools  of  normative 
political  theory  (conflict,  shared  values,  exchange, 
and  coordination)  and  6  schools  of  empirical  polit¬ 
ical  theory  (systems  theory,  behavioralism,  rational 
choice  theory,  institutionalism,  neuroscience,  and 
globalism).  Third,  this  entry  discusses  the  need  to 
enhance  conversations  between  normative  political 
theory  and  empirical  political  theory,  or  between 
“ought”  and  “is.”  In  other  words,  normative 
political  theory  should  talk  more  about  the  plausi¬ 
bility,  feasibility,  and  self-sustainability  of  the  nor¬ 
mative  order  it  advances,  and  empirical  political 
theory  should  discuss  normative  implications  more 
seriously.  With  the  aim  of  providing  for  more  con¬ 
versations  and  interactions  within  political  theory, 
concise  and  concrete  illustrations  of  such  proposed 
conversations  and  interactions  are  given. 

Origins  and  Types  of  Political  Theory 

Although  political  theory  as  a  part  of  the  modern 
discipline  of  political  science  emerged  only  in  the 
20th  century,  the  origins  of  political  theory  are 
unquestionably  in  ancient  thought — whether  in 
the  philosophy  of  Greece,  Mesopotamia,  Egypt, 
India,  or  China.  One  can  cite  passages  from 
Aristotle,  Kautilya,  or  Confucius  easily  to  demon¬ 
strate  that  they  are  political  scientists  indeed  and 
that  political  science,  although  modern,  has  very 
ancient  roots.  One  of  the  major  differences  the 
discipline  of  political  science  can  claim  to  have  vis- 
a-vis  other  social  science  disciplines  such  as  eco¬ 
nomics  and  sociology  is  that  the  ancient  versions 
of  political  science,  such  as  those  of  the  three  phi¬ 
losophers  mentioned  above,  retain  much  power 
and  relevance  to  the  current  reality  in  politics. 
Aristotle  is  often  mentioned  in  contemporary  writ¬ 
ings  on  politics  and  political  science.  Kautilya  is 
referred  to  in  discussions  on  rulers’  state  craft  both 
at  home  and  vis-a-vis  their  rivals.  Confucius  con¬ 
tinues  to  be  taken  up  as  providing  possible  ver¬ 
sions  of  authoritarian  politics.  It  is  rare  for 
Aristotle  to  be  discussed  in  relation  to  economics, 
although  his  ideas  on  this  subject  are  significant 
for  the  history  of  economics.  All  these  disciplines 
are  relatively  young,  since  they  have  grown  as 


modern  social  science  disciplines  in  the  West,  espe¬ 
cially  in  the  20th  century.  Yet  one  can  argue  that 
political  science  has  retained  its  ancient  origins 
even  at  the  dawn  of  the  2 1st  century.  Thus,  its 
development  is  very  complex. 

In  Western  Europe,  the  major  distinction 
between  the  sacred  and  the  secular  was  made 
gradually  but  quite  steadily  during  the  Enlighten¬ 
ment  and  the  Reformation,  and  it  diffused  to 
European  settlements  and  later  to  the  rest  of  the 
world  as  well,  at  least  superficially;  secularism  is 
therefore  closely  related  to  Western  cultures. 
Religion  and  politics  are  said  to  have  been  distin¬ 
guished  in  the  West  since  the  modern  age.  The 
same  can  be  said  about  the  relationship  between 
religion  and  science.  William  of  Ockham  gave  an 
early  epistemological  foundation  for  what  would 
be  called  modern  science  by  making  a  clear  distinc¬ 
tion  between  realism  and  nominalism.  Realism 
refers  to  the  school  of  thought  that  believes  that 
God  does  exist  in  reality  and  that  reality  was  con¬ 
ceived  as  the  basis  of  that  knowledge;  nominalism 
refers  to  the  view  that  God  exists  insofar  as  the 
concept  of  God  is  imagined.  With  this  stance,  sci¬ 
ence  was  able  to  separate  itself  from  the  cosmos  of 
the  sacred.  With  this  separation,  modern  science 
was  able  to  make  spectacular  progress  in  the  West. 

Needless  to  say,  the  separation  between  God 
and  science  and  between  God  and  politics  did  not 
come  about  so  neatly  or  once  and  for  all;  rather, 
the  picture  is  complex.  The  separation  between 
religion  and  politics  has  been  tenuous  at  best.  At 
any  rate,  for  our  purposes,  the  separation  of  poli¬ 
tics  from  God  was  a  step  forward  in  distinguishing 
political  theory  from  philosophy.  Also  the  separa¬ 
tion  of  science  from  God  was  a  step  forward  in 
distinguishing  political  theory  from  political  phi¬ 
losophy.  This  distinction  tries  to  separate  God’s 
judgment  from  that  of  scientists  or  political  leaders. 

At  the  crux  of  political  theory  is  the  mixture  of 
the  normative  and  the  empirical — that  is,  what 
ought  to  be  versus  what  is.  The  normative  has  to 
do  with  the  judgment  by  which  a  verdict  on  justice 
is  determined.  While  in  medieval  times,  such  judg¬ 
ments  came  from  the  Catholic  Church  or,  in  some 
cases,  duels  were  fought  between  opposing  parties, 
in  modern  times,  they  come  from  the  courts  where 
secular  matters  are  concerned.  Courts  have  been 
conceived  like  God.  The  concept  of  the  empirical 
refers  to  something  that  can  be  experienced  or 
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tested  in  the  daily  lives  of  people.  It  was  not  until 
modern  times,  with  the  increasingly  sharp  distinc¬ 
tion  between  the  sacred  and  the  secular — which 
made  “reality  checks”  possible — that  empirical 
testing  became  common. 

Political  theory  comprises  all  this  under  one 
umbrella,  including  both  classical  philosophy  and 
empirical  political  theory.  To  complicate  matters 
further,  political  theory  contains  within  its  territory 
what  is  called  formal  political  theory.  Classical  phi¬ 
losophy  refers  to  prescientific  and  pre-empiricist 
statements  about  what  justice  is,  how  it  should  be 
achieved,  and  how  it  should  be  conceptualized. 
Empirical  political  theory  refers  to  statements  made 
from  the  scientific  and  empiricist  viewpoints  on 
how  politics  is  played  out.  Formal  political  theory 
refers  to  logically  and/or  mathematically  derivable 
statements  that  may  or  may  not  be  amenable  to 
empirical  testing.  Dealing  with  the  normative  and 
the  empirical  is  inherently  not  easy.  Compounding 
the  problem  is  the  recent  tendency  toward  overspe¬ 
cialization  and  mutual  isolation,  rather  than  mutual 
engagement,  among  political  scientists  dealing  with 
the  normative  and  the  empirical.  Mutual  isolation 
between  classical  normative  theory  and  empirical 
theory  has  gone  sometimes  too  far  as  their  border¬ 
lines  are  not  always  very  clear. 

In  practice,  normative  political  theory  and 
empirical  political  theory  are  far  apart  for  several 
reasons.  First,  their  knowledge  base  differs.  Classical 
political  theorists  often  are  concerned  with  philoso¬ 
phy,  theology,  and  intellectual  history,  whereas 
empirical  political  theorists  are  often  interested  in 
other  empirical  social  sciences  such  as  economics, 
sociology,  and  social  psychology  or  in  other  applied 
empirical  sciences  such  as  neuroscience  in  politics. 
Second,  their  methods  of  training  are  very  different. 
Classical  political  theory  focuses  on  text  critique 
and  robust  argumentation.  All  study  is  based  on 
careful  reading  and  argumentation.  Empirical  polit¬ 
ical  theory  focuses  on  hypothesis  testing  conducted 
according  to  positivistic  practices  or  a  systematic 
reality  check.  Yet  these  differences  are  not  strong 
enough  to  undermine  their  disciplinary  identity  as 
political  science  and  to  split  it  into  two  or  more 
subdisciplines.  Although  no  solid  and  systematic 
evidence  exists,  it  looks  as  if  the  shared  fascination 
with  how  power  is  built  and  exercised  seems  to  give 
many  political  scientists  identity  and  solidarity  to 
band  together. 


The  types  of  political  theory — classical  philoso¬ 
phy,  empirical  political  theory,  and  formal  politi¬ 
cal  theory — are  discussed  in  turn  in  the  remainder 
of  this  section. 

Classical  Philosophy 

Classical  philosophy  comprises  almost  every¬ 
thing  from  the  normative  to  the  empirical,  the  pre¬ 
scriptive,  and  their  mixture.  In  The  Great  Learning 
(Daxue),  one  of  the  famous  Confucian  teachers 
instructs  his  disciples  to  do  the  following:  “Tackling 
things,  seeking  truth,  nurturing  yourself  (morally), 
sorting  the  family,  governing  the  state,  and  pacify¬ 
ing  the  world  under  heaven.  ”  This  sentence  instructs 
the  disciples  that  to  stand  above  people  they  must 
start  studying  many  things  and  knowing  a  lot. 
Then,  they  must  discipline  themselves  morally. 
That  leads  them  to  sort  out  things  in  their  family, 
and  this  process  becomes  the  basis  of  governing  the 
country.  Only  by  going  through  all  this  can  disci¬ 
ples  envisage  “pacifying  the  world  under  heaven.” 

This  philosophical  statement  is  described  in  a 
number  of  ways:  (a)  the  authoritarian  conception 
of  governing  by  the  sage,  (b)  the  moralistic  concep¬ 
tion  of  governing,  and  (c)  the  familial  conception 
of  the  state.  The  way  in  which  the  argument  is 
constructed  is  bottom  up.  But  the  argument  itself 
is  replete  with  authoritarian,  paternalistic,  and 
personalistic  overtones. 

The  no  less  famous  classical  philosophy  of  Niccolo 
Machiavelli,  author  of  The  Prince,  is  also  prescrip¬ 
tive  and  empirical.  He  warns  his  readers  that  politics 
is  determined  by  virtue  and  fortune.  By  virtue  is 
meant  a  range  of  strengths  including  moral  strength. 
Such  an  exercise  of  strength  is  needed  to  deal  with 
the  effects  of  fortune.  This  classical  philosophy  was 
meant  to  teach  the  monarch  to  act  properly  and 
prudently.  Thus,  it  is  largely  prescriptive.  At  the 
same  time,  it  is  sometimes  speculated  that  since 
Machiavelli  was  republican,  his  preaching  was 
meant  to  subvert  monarchism  by  deliberately  encour¬ 
aging  the  monarch  to  act  most  “monarchically” — 
that  is,  always  assuming  one’s  superiority,  not  trust¬ 
ing  anyone  but  oneself,  and  acting  mostly  on  the 
basis  of  strength — thus  undermining  the  basis  of 
popular  support,  which  was  becoming  increasingly 
significant. 

Classical  philosophy  includes  the  Federalist 
Papers,  in  which  precursors  of  empirical  political 
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theory  are  often  found.  The  view  that  federalism  is 
a  viable  alternative  to  a  unitary  state  is  an  exam¬ 
ple.  This  proposition  has  generative  affinity  with 
the  work  of  William  Riker  (1964)  and  his  Rochester 
School  on  formal  political  theory.  Riker’s  brand  of 
formal  political  theory  on  democracy  and  demo¬ 
cratic  choice  has  some  affinity  with  both  classical 
philosophy  and  empirical  political  theory. 

Immanuel  Kant’s  famous  essay  Perpetual  Peace 
has  been  empirically  tested  lately,  with  quite  stim¬ 
ulating  debates  arising  from  such  efforts.  In  1795, 
Kant  argued  that  three  conditions  should  be  nur¬ 
tured  to  bring  about  eternal  peace.  First,  com¬ 
merce  should  be  invigorated,  with  free  passage  and 
free  trade  ensured  among  nations.  Second,  one 
should  encourage  the  republican  form  of  politics 
rather  than  the  monarchical  form,  to  make  war 
more  difficult.  Third,  international  institutions 
should  be  created  to  allow  for  the  expression  of 
voices  of  varying  assertions,  to  facilitate  discus¬ 
sion,  and  to  come  up  with  formulas  for  conflict 
resolution.  Kant’s  formulation  has  been  reformu¬ 
lated  in  the  tradition  of  empirical  political  theory 
by  Michael  Doyle,  Bruce  Russett,  and  others.  The 
first  is  called  liberal  peace.  The  second  is  called 
democratic  peace.  The  third  is  sometimes  called 
peace  by  consortium.  The  second  is  the  most  popu¬ 
lar,  and  U.S.  Presidents  Bill  Clinton  and  George  W. 
Bush  used  this  doctrine  to  justify  U.S.  military 
interventions.  Kant’s  affinity  with  empirical  politi¬ 
cal  theory  is  abundantly  clear,  even  if  it  is  not 
universally  accepted  as  such. 

Thomas  Hobbes  is  arguably  the  most  frequently 
mentioned  author  in  relation  to  the  state  in  the 
social  sciences,  along  with  Max  Weber  and  Karl 
Marx.  Facing  what  he  saw  as  anarchy  both  at 
home  and  abroad  in  16th-century  England,  Hobbes 
argued  that  absolutism  and  an  absolutist  state 
should  be  the  answer  to  these  anarchies — what 
people  wish  to  achieve  cannot  be  accomplished 
unless  anarchical  situations  are  overcome.  In  dis¬ 
cussions  of  failed  states  and  rogue  states,  therefore, 
Hobbes  is  one  of  the  philosophers  most  frequently 
referred  to.  Describing  and  analyzing  what  is  seen 
as  anarchy  in  places  and  time  points  such  as 
Cambodia  in  the  1980s  and  1990s,  Somalia  since 
the  early  1990s,  Sudan  in  the  1990s  and  2000s, 
Afghanistan  for  the  past  40  years,  the  Democratic 
Republic  of  Congo  for  most  of  the  1990s  and 
2000s,  Rwanda  in  the  1990s  and  2000s,  and  the 


West  Balkans  in  the  1990s  and  2000s,  many 
authors  point  to  the  need  to  establish  a  monopoly 
of  violence  and  the  legitimate  use  of  power  in  the 
initial  and  yet  critical  phase  of  state  building,  along 
with  the  concord  forged  with  the  international 
community.  A  similar  diagnosis  and  prescription 
are  offered  to  show  the  process  of  state  building 
that  may  evolve  from  a  democratic  spirit  and 
under  globalizing  circumstances  and  the  process  of 
absolutism  arising  from  claims  of  state  sover¬ 
eignty.  Needless  to  say,  the  yearning  for  state  sov¬ 
ereignty  cannot  be  suppressed  fully — one  form  of 
which  is  expressed  by  the  concept  of  “sovereign 
democracy”  coined  by  Vladislav  Surkov,  chief  of 
staff  to  former  Russian  President  Vladimir  Putin 
and  now  President  Medvedev.  Sovereign  democ¬ 
racy  implies  a  sovereign  state  whose  representative 
heads  are  chosen  democratically  but  that  disallows 
foreign  interference  from  abroad,  even  if  by  demo¬ 
cratic  means — for  example,  the  attempts  at  “col¬ 
ored  revolutions”  in  Ukraine,  Georgia,  and 
Kyrgyzstan.  Chinese  leaders  view  “peaceful 
change”  (beping  yanbian )  as  anathema  because  it 
means  foreign  interference  to  force  a  regime 
change  on  the  basis  of  human  rights  and  democ¬ 
racy  as  universally  shared  values.  Thus,  on  the 
anniversary  of  the  Tiananmen  massacre  (June  4, 
1989),  China  was  placed  under  high-level  alert.  In 
short,  Hobbes  is  just  as  relevant  now,  at  the  dawn 
of  the  21st  century,  as  in  16th-century  England. 

Empirical  Political  Theory 

Empirical  political  theory  aims  at  generating 
hypotheses  that  can  be  empirically  tested  and  that 
are  also  capable  of  generating  a  higher  level  set  of 
generalizations.  In  other  words,  empirical  political 
theory  places  utmost  importance  on  two  aspects  of 
research:  (1)  empirical  validation  (or  falsification) 
and  (2)  empirical  generalization  (or  theorization). 
Thus,  empirical  political  theory  first  proposes  an 
empirically  verifiable  hypothesis  and  then  tests  it. 
If  it  is  confirmed,  it  can  then  be  generalized.  A 
good  example  of  this  type  of  theory  is  voting  in 
U.S.  presidential  elections.  To  link  with  political 
theory,  the  characteristically  American  utilitarian 
model  of  electoral  victory  based  on  the  death  toll 
of  American  troops  in  combat  situations  and  on 
per  capita  net  income  level  change  over  the  preced¬ 
ing  year(s)  serves  as  an  example.  As  pointed  out  by 
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Douglas  Hibbs  (1989),  the  question  of  whether  the 
candidate  of  the  party  that  occupies  the  executive 
office  is  elected  in  the  presidential  elections  is  likely 
to  be  determined  by  the  combination  of  the  death 
toll  of  U.S.  troops  in  combat  situations  and  per 
capita  net  income  level  change  over  the  preceding 
year(s).  This  model  hypothesizes  that  voters  yearn 
for  peace  (i.e.,  having  no  Americans  killed  in  com¬ 
bat)  and  for  prosperity  (i.e.,  increasing  income).  If 
one  considers  the  2008  U.S.  presidential  race,  two 
Ss  were  important:  Operation  Surge  in  Iraq  and 
subprime  housing  loans.  Operation  Surge  reduced 
the  death  toll  of  American  troops  from  the  summer 
of  2007  through  Election  Day.  How  this  affected 
the  voter  equation  is  one  of  the  key  points.  The 
other  S  is  the  economic  setback  triggered  by  the 
subprime  housing  loans  crisis.  Not  only  were  stock 
prices  going  down,  but  the  U.S.  dollar  also  lost 
value.  Whether  the  government  could  prevent  a 
recession  by  stimulating  the  economy  through 
pumping  a  massive  amount  of  money  into  it  was 
also  a  key  issue  in  the  voter  equation.  Expecting 
the  economy  to  recover  seemed  premature.  This 
worked  against  the  Republican  presidential  candi¬ 
date,  John  McCain,  who  did  not  support  such  an 
economic  stimulus. 

A  number  of  studies  suggest  that  trust  and 
health  are  quite  significantly  related.  Those  who 
hold  concerns  about  social  institutions  such  as 
social  insurance  are  more  likely  to  report  bad 
health.  Similarly,  those  who  do  not  trust  others 
very  much  tend  to  report  their  own  bad  health. 
The  reasoning  is  as  follows:  Those  with  lower  ver¬ 
tical  trust,  that  is,  those  who  do  not  place  much 
confidence  in  social  institutions,  cannot  enjoy  the 
benefit  of  making  the  best  use  of  them,  one  of  the 
consequences  of  which  is  the  loss  of  health.  In  a 
similar  vein,  those  with  lower  horizontal  trust,  that 
is,  those  who  do  not  place  much  confidence  in 
other  persons,  cannot  enjoy  the  benefits  of  work¬ 
ing  together,  one  of  the  consequences  of  which  is 
the  loss  of  health.  Loss  of  health  is  measured  by 
self-reported  health  using  the  World  Health 
Organization’s  Quality  of  Life  questionnaire.  This 
proposition  is  interesting  from  a  public  policy  per¬ 
spective  in  that  keeping  public  confidence  in  social 
institutions  is  important  in  itself.  Otherwise,  the 
service  that  can  be  supplied  by  social  institutions 
cannot  be  fully  provided.  Thus,  from  a  more  con¬ 
ventional  public  policy  perspective,  public  policy 


performance  is  a  dependent  variable  and  is  to  be 
explained  by  a  number  of  factors.  However,  from 
the  version  of  public  policy  perspective,  the  depen¬ 
dent  variable  is  something  individuals  can  experi¬ 
ence  physically,  such  as  health,  or  emotionally, 
such  as  happiness  and  honor. 

The  electoral  system  can  be  either  divisive  or 
cohesive.  In  the  political  theory  of  representative 
democracy,  how  to  choose  electorates  is  of  key 
importance.  Representative  democracy  takes  into 
account  at  least  two  forms  of  justice.  “Representa¬ 
tive  democracy  requires  two  conditions  to  be  suc¬ 
cessful”:  (1)  fair  representation,  reflecting  electors’ 
preferences,  and  (2)  government  stability,  enabling 
government  to  execute  policy  pledges  to  the  elec¬ 
torate.  Two  major  systems  exist:  (1)  proportional 
representation ,  whereby  parliamentary  or  legisla¬ 
tive  seats  are  awarded  according  to  the  percentage 
of  votes  polled  by  a  party,  and  (2)  first  pass  the 
post ,  in  which  the  person  from  a  district  who 
receives  the  highest  number  of  votes  is  awarded  the 
seat;  this  method  awards  a  disproportionate  num¬ 
ber  of  seats  to  parties  that  get  a  larger  number  of 
votes  and  reduces  the  number  of  seats  awarded  to 
parties  with  a  smaller  share  of  votes.  The  propor¬ 
tional  representation  system  is  said  to  give  fair 
representation  but  not  regime  stability,  whereas  the 
first-past-the-post  method  is  said  to  give  moder¬ 
ately  unfair  representation  but  regime  stability.  The 
latter  is  widely  adopted  in  many  English-speaking 
countries,  such  as  the  United  States,  the  United 
Kingdom,  Australia,  Canada,  and,  until  recently, 
New  Zealand.  The  former  is  widely  adopted  by 
Continental  European  countries.  Lately,  the  mixed 
system  combining  both  methods  has  become  more 
popular,  especially  in  non-European  regions  such 
as  the  Asia-Pacific  region,  and  seems  to  provide  fair 
representation  and  regime  stability  to  a  reasonably 
satisfactory  degree. 

Proportional  representation  can  be  very  divi¬ 
sive,  as,  for  example,  in  Israel  or  in  Iraq.  The  state 
of  Israel  is  sometimes  said  to  have  committed  two 
institutional  mistakes  in  its  founding  days:  First,  it 
was  unable  to  promulgate  a  constitution,  largely 
because  of  the  fundamental  cleavages  manifested 
on  the  issue  of  religion  and  the  state,  and,  second, 
it  adopted  proportional  representation  as  the 
mode  of  election.  Parenthetically,  there  are  only 
three  states  in  the  world  that  do  not  have  a  consti¬ 
tution:  Israel,  the  United  Kingdom,  and  New 
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Zealand.  The  United  Kingdom  does  not  have  a 
written  constitution,  except  for  the  Magna  Carta, 
and  it  seems  that  New  Zealand  has  inherited  that 
tradition.  Interestingly,  the  Iraqi  constitution 
adopted  a  proportional  representation  electoral 
system,  whereby  ethno-religious  cleavages  have 
been  amplified  and  invigorated.  The  one-person- 
from-one-district,  or  first-pass-the-post,  system, 
common  in  the  United  States  and  the  United 
Kingdom,  tends  to  exaggerate  the  number  of  votes 
the  winning  party  gets  to  obtain  parliamentary 
seats.  In  other  words,  more  proportional  strength 
is  given  to  the  winning  party  so  that  government 
can  enjoy  at  least  a  minimally  stable  majority. 

Formal  Political  Theory 

Formal  political  theory  is  a  set  of  propositions 
that  are  logically  derived  from  a  set  of  assumptions 
about  politics.  A  formal  political  theory  of  elec¬ 
toral  democracy  was  first  formalized  by  Anthony 
Downs  (1957).  This  theory  relates  the  statistical 
distribution  of  electorates’  ideological  and  policy 
positions  on  a  number  of  dimensions  to  the  poli¬ 
cies  parties  make  to  capture  more  votes.  In  other 
words,  political  parties  shape  their  positions 
according  to  the  number  of  voters  who  are  likely 
to  vote  for  them.  Anthony  Downs  propounded  an 
economic  theory  of  democracy  that  has  become 
very  influential  for  those  political  scientists  who 
believe  that  political  science  should  be  able  to  gen¬ 
erate  theories  from  which  empirical  claims  about 
political  phenomena  can  be  deduced  (after  the 
theories  themselves  have  been  tested  as  described 
above).  Downs’s  key  insight  into  spatial  economics 
is  illustrated  by  his  example  of  gas  stations,  which 
he  explains  are  located  close  to  each  other  because 
the  spatial  distribution  of  consumers  of  gasoline  is 
that  of  a  statistically  normal  curve.  In  other  words, 
rather  than  opening  a  gas  station  miles  away  from 
another  gas  station,  one  might  as  well  open  it  close 
to  the  other  one,  with  the  chance  of  getting  many 
more  customers  for  both.  Downs  applied  this 
insight  to  the  dynamics  of  electoral  democracy. 
Suppose  there  are  two  major  parties,  one  right- 
wing  and  the  other  left-wing.  Further  suppose  that 
voters  who  support  extreme  views  are  fewer  than 
voters  who  support  moderate  views,  statistically 
speaking.  To  gain  more  votes,  candidates  will 
moderate  their  views,  whether  about  war  and 


peace,  bread  and  butter,  or  honor  and  humiliation. 
The  consequence  is  that  the  two  major  parties 
move  closer  to  each  other.  As  a  result,  such  parties 
start  to  look  alike.  Many  empirical  efforts  have 
been  made  to  validate  or  invalidate  empirically  this 
Downsian  theory  of  party  competition.  Thus,  for¬ 
mal  political  theory  has  been  quite  well  linked  in 
many  ways  to  empirical  political  theory.  By  using 
the  statistical  distribution  patterns  of  electorates  in 
a  multidimensional  space,  this  formal  political 
theory  shows  that  under  representative  democ¬ 
racy,  electorates  are  sovereign  whereas  candidates 
or  political  parties  are  the  subjects.  This  analysis  is 
one  example  of  how  formal  political  theory  has 
been  applied  to  explain  empirical  phenomena. 

Politics  is  played  out  most  commonly  in  and 
among  organizations.  How  people  react  to  the 
decline  of  organizational  life  is  one  of  the  key  ques¬ 
tions  in  politics.  Albert  Hirschman  (1970)  formu¬ 
lated  the  model  of  exit,  voice,  and  loyalty.  The 
binary  choice  is  between  loyalty  and  exit.  These 
binary  choices  are  most  commonly  observed  in  the 
market:  One’s  choice  is  between  purchase  and 
nonpurchase.  In  organizational  life,  if  one  is  loyal, 
one  will  remain  with  the  organization  in  spite  of 
decreasing  rewards.  The  exit  option  is  to  get  out 
without  procrastination.  Between  the  two  options 
is  a  third,  more  common  one:  raising  one’s  voice  to 
ask  others  to  join  forces  in  improving  organiza¬ 
tional  life.  This  is  more  common  in  politics.  But 
when  one  starts  thinking  about  the  major  conse¬ 
quences  of  each  option,  it  is  much  more  complex. 
Consider  an  exit-prone  country  whose  income  has 
not  increased  dramatically  in  a  long  time,  such  as 
the  Philippines.  Filipinos  earn  a  substantial  amount 
of  income  through  emigration — by  sending  doc¬ 
tors  to  the  United  States  and  maids  to  the  Gulf 
countries,  as  they  bring  back  a  substantial  amount 
of  their  earnings  to  their  country.  One  can  specu¬ 
late  that  a  consequence  of  a  large  migrant  popula¬ 
tion  is  the  lack  of  momentum  for  endogenous 
development.  In  contrast,  in  a  loyalty-prone  nation 
such  as  Japan,  not  leaving  the  country  is  a  com¬ 
mon  response  to  the  decline  of  organizational  life, 
so  migration  rates  are  low.  At  the  height  of  orga¬ 
nizational  decline,  as  conditions  become  intolera¬ 
ble  and  such  loyalty  fails  to  attract  attention  or 
admiration,  loyalty  may  take  on  the  character  of 
voice.  This  is  a  formal  political  theory,  but  it  could 
be  an  empirical  political  theory  as  well.  The  theory 
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of  exit,  voice,  and  loyalty  touches  on  complex 
manifestations  of  organizational  life  and  varied 
options  of  human  endeavor  to  improve  it.  In  other 
words,  the  exit  option  is  based  on  conflict,  the 
voice  option  is  based  on  coordination,  and  the 
loyalty  option  is  based  on  loyalty. 

Other  than  classical  philosophy,  empirical  polit¬ 
ical  theory,  and  formal  political  theory,  there  are 
two  major  genres  that  can  be  sometimes  treated 
under  the  umbrella  of  political  theory:  epistemol¬ 
ogy  and  methodology.  Epistemology  refers  to  the 
study  of  how  human  beings  recognize  what  they 
see  and  hear  as  knowledge.  Rene  Descartes,  a 
French  philosopher,  most  famously  in  Le  Discours 
de  la  Methode,  laid  down  what  might  be  called  the 
modern  positivistic  method.  It  is  a  set  of  advice 
and  instructions  that  would  be  helpful  to  obtain  a 
clearer  understanding  of  what  one  observes. 
Methodology  refers  to  the  study  of  various  instru¬ 
ments  through  which  reality  can  be  observed  and 
analyzed  effectively. 

Schools  of  Political  Theory 

Schools  of  political  theory  are  sometimes  messy  in 
part  because  empirical  political  theory  has  grown, 
at  least  initially,  out  of  other  disciplines  such  as 
psychology,  psychiatry,  sociology,  economics,  and 
anthropology.  One  of  the  direct  origins  of  empiri¬ 
cal  political  theory  is  to  be  found  in  the  deep  trans¬ 
formations  and  the  new  needs  following  World 
War  II.  The  application  of  methodologies  and 
concepts  from  these  disciplines  to  political  science 
was  first  done  by  Samuel  Stouffer  and  Harold 
Lasswell.  For  empirical  political  theorists,  histori¬ 
cal  and  institutional  descriptions  were  the  only 
methods  used  in  their  research.  With  the  new 
methodologies  and  concepts,  empirical  political 
theory  expanded  its  scope  dramatically.  Observing, 
measuring,  and  assessing  in  a  generalizable  fashion 
became  conventions  in  political  science,  which 
allowed  schools  of  empirical  political  theory  to 
proliferate.  Prior  to  World  War  II,  the  discipline  of 
political  science  was  concerned  mostly  with  consti¬ 
tutions  and  institutions,  on  the  one  hand,  and 
political  philosophy,  on  the  other.  World  War  II 
was  also  a  catalyst  for  empirical  political  theory, 
because  governments  were  interested  in  measuring 
and  assessing  the  morale  of  their  troops  and  the 
effectiveness  of  propaganda  and  of  military  actions. 


Stouffer  and  colleagues  conducted  a  morale  study 
that  contributed  immensely  to  the  development  of 
survey  research.  Similarly,  Harold  Fasswell,  Ithiel 
de  Sola  Pool,  and  colleagues  advanced  a  propa¬ 
ganda  study  that  contributed  immensely  to  the 
development  of  intelligence  analysis.  Robert 
MacNamara  and  colleagues  developed  a  bombing 
effects  study  that  contributed  immensely  to  the 
development  of  strategic  analysis  of  costs  and  ben¬ 
efits  associated  with  strategic  options.  Another 
example  is  the  military  occupation  study  by  Ruth 
Benedict,  which  was  instituted  through  a  psycho- 
cultural  analysis  of  the  Japanese  people.  It  took 
more  than  two  decades  after  World  War  II  for 
political  science  to  come  into  its  own  with  empiri¬ 
cal  political  theory. 

With  regard  to  normative  political  theory, 
schools  are  commonly  linked  to  great  philosophers 
like  Aristotle,  Machiavelli,  John  Focke,  Karl  Marx, 
Michel  Foucault,  and  Jurgen  Habermas.  But  the 
increased  interactions  with  other  disciplines  and 
related  methodologies  and  concepts  have  also 
helped  advance  schools  of  normative  political  the¬ 
ory.  One  of  the  schemes  adopted  here  is  that  of 
Russell  Hardin,  which  distinguishes  four  schools 
of  normative  political  theory. 

Four  Schools  of  Normative  Political  Theory 

Hardin  classified  normative  political  theory  in 
terms  of  theory  based  on  conflict,  on  shared  val¬ 
ues,  on  exchange,  and  on  coordination. 

Conflict 

Normative  political  theory  focusing  on  conflict 
of  interest  includes  the  work  of  Carl  Schmitt 
(1922/1985).  Schmitt  defines  politics  as  a  friend- 
foe  relationship  within  and  across  nations.  His 
theory  is  commonly  categorized  as  ultra-conserva¬ 
tive  and  sometimes  fascist.  Its  explanatory  capac¬ 
ity  is  high  under  conflictual  situations  such  as 
wartime  but  not  in  more  peaceful  contexts. 
Although  it  is  not  necessarily  categorized  as  work 
in  the  area  of  normative  political  theory,  Carl  von 
Clausewitz’s  On  War  has  a  strong  normative  affin¬ 
ity  with  the  notion  of  the  friend-foe  relationship. 
Clausewitz  defines  war  as  no  more  than  the  con¬ 
tinuation  of  politics  by  other  means.  In  a  similar 
vein,  Field  Marshall  Boris  Shaposhnikov  defines 
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peace  as  no  more  than  the  continuation  of  war  by 
other  means.  The  fact  that  both  Clausewitz  and 
Shaposhnikov  were  military  officers  may  mean 
that  their  theories  may  be  regarded  not  as  norma¬ 
tive  political  theory  but  as  a  technical  guide  for 
action,  but  the  simplicity  and  clarity  of  their  works 
may  appeal  to  those  studying  war  and  conflict. 
Turning  to  a  Marxist  work  that  tends  to  focus  on 
class  conflict,  Barrington  Moore’s  (1966/1993) 
book,  Social  Origins  of  Dictatorship  and  Democ¬ 
racy,  can  be  regarded  as  normative  political  theory 
focusing  on  class  conflict.  His  argument  is  that 
agriculture  is  pivotal.  The  rise  of  democracy,  fas¬ 
cism,  and  communism  in  the  20th  century  may  be 
explained  by  studying  the  evolution  of  agriculture: 
Commercialization  of  agriculture  led  to  its  advance¬ 
ment,  as  in  England  and  France;  agriculture  stag¬ 
nated  due  to  the  indifference  of  the  land-owning 
class,  as  in  Germany,  Japan,  and  Italy;  and  agricul¬ 
ture  was  overexploited  by  state-led  capitalism,  as 
in  Russia  and  China — which  corresponded  to  the 
advent  of  democracy,  fascism,  and  communism  in 
these  countries,  respectively. 

Shared  Values 

Normative  political  theory  focusing  on  shared 
values  was  dominant  in  the  20th  century,  especially 
after  the  Cold  War.  The  ascendancy  of  shared  val¬ 
ues  as  a  normative  political  theory  has  much  to  do 
with  the  rise  and  spread  of  liberal  democracy  since 
the  past  century.  Liberalism  is  based  often  on  the 
utilitarian  calculation  of  free  individuals,  as  John 
Rawls  argued  (1971).  Democracy  is  based  on  the 
aggregation  of  the  preferences  of  citizens,  as 
described  by  Downs  (1957).  In  a  sense,  liberal 
democracy  demands  a  regime  in  which  shared 
values  are  key.  As  long  as  liberal  democracy  is  pre¬ 
mised,  the  type  of  normative  political  theory  focus¬ 
ing  on  shared  values  flourishes.  Even  the  latest 
definition  of  politics  by  Robert  Goodin  (2009),  as 
the  constrained  use  of  social  power,  reflects  this.  It 
is  important  to  note  that  liberalism  does  not  require 
either  the  knowledge  or  the  sharing  of  values  of 
other  individuals.  Downs’s  An  Economic  Theory 
of  Democracy  is  a  formal  political  theory  with  nor¬ 
mative  implications.  Values  are  often  illustrated  by 
the  left-right  ideology.  In  a  two-party  system,  the 
two  parties  tend  to  move  toward  the  center  from 
both  directions,  that  is,  from  left  to  center  and 


from  right  to  center.  Moderate  or  “centrist”  citizens 
share  values,  whereas  extreme  or  fringe  citizens 
have  few  fellows.  To  win  votes,  the  two  parties 
target  the  numerically  large  central  point.  This  leads 
the  parties  to  adjust  their  ideological  and  policy 
positions  and  to  compete  to  attract  the  large  num¬ 
ber  of  citizens  located  at  the  center.  The  explana¬ 
tory  capacity  of  normative  political  theory  on  the 
basis  of  shared  values  is  high,  especially  when  the 
tide  of  globalization  weakens  the  intermediate  and 
high-level  organizations  within  the  national  body 
politic,  so  that  individual  citizens  become  more 
important  than  ever  before. 

Exchange 

Normative  political  theory  based  on  exchange  is 
represented  by  Adam  Smith.  It  is  well  known  that  of 
Smith’s  two  major  works,  The  Theory  of  Moral 
Sentiments  and  The  Wealth  of  Nations,  the  former 
explains  the  sources  of  the  human  ability  to  make 
moral  judgments.  His  key  concept  is  sympathy, 
whereby  the  act  of  observing  others  makes  people 
aware  of  others’  behavior  and  the  morality  of  their 
own  behavior.  Without  sympathy  in  interpersonal 
relations,  exchange  loses  its  solid  base.  Even  when  it 
is  writ  large  in  national  and  global  markets,  exchange 
functions  well  only  if  it  is  grounded  in  sympathy  in 
social  relations.  Sympathy  is  sometimes  called  social 
capital  by  authors  such  as  James  Coleman  and 
Robert  Putnam,  among  others.  It  is  not  necessary  to 
note  that  human  life  cannot  function  sufficiently 
well  if  it  is  equipped  only  with  exchange  instruments 
and  mechanisms.  As  long  as  the  system  of  numerous 
exchange  relationships  called  markets  functions 
well,  those  normative  political  theories  can  be  evalu¬ 
ated  highly.  More  frequently,  political  life  calls  for 
schemes  and  instruments  that  go  beyond  exchange. 
To  the  extent  that  exchange  helps  resolve  conflict, 
normative  political  theory  based  on  exchange  does 
have  good  explanatory  capacity. 

Coordination 

It  may  be  surprising  to  find  that  the  notion  of 
coordination  has  not  played  a  major  role  in  the 
development  of  normative  political  theory.  It  is 
natural  that  normative  political  theory  based  on 
conflict  abounds,  given  that  all  politics  is  regarded 
as  dealing  with  conflict-of-interest  situations  and 
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friend-foe  relations.  It  is  also  natural  that  norma¬ 
tive  political  theory  based  on  exchange  abounds 
because  conflict  resolution  can  be  achieved  often 
by  making  use  of  exchange  on  a  small  to  large 
scale.  Hardin  (2009)  points  out  that  normative 
political  theory  on  the  basis  of  coordination  has 
been  insufficiently  advanced,  perhaps  because  of  a 
lack  of  awareness  of  coordination  as  a  scheme  and 
instrument  of  politics  in  a  normative  political  the¬ 
ory  framework.  Even  Hobbes’s  argument  on  abso¬ 
lutist  rule  by  a  sovereign  king  can  be  rendered  as 
normative  political  theory  articulated  by  the  notion 
of  coordination  if  Hobbesian  theory  is  reformu¬ 
lated  within  a  multilevel,  multistage  framework.  In 
such  a  framework,  a  powerful  concept  is  strategy. 
Determining  how  to  coordinate  when  you  take 
Strategy  A  and  your  adversary  takes  Strategy  B  is 
complex.  Implementing  a  two-party  coordination 
based  on  multilevel  and  multistage  strategies  makes 
normative  political  theory  more  complex.  Yet  this 
line  of  theorization  has  been  conducted  since  the 
mid-20th  century  in  other  disciplines,  such  as  mili¬ 
tary  science,  business  management,  and  different 
branches  of  engineering,  often  in  the  form  of  game 
theory.  Normative  political  theory  with  coordina¬ 
tion  as  a  key  concept  is  bound  to  grow,  since  poli¬ 
tics  relies  heavily  on  coordination. 

Six  Schools  of  Empirical  Political  Theory 

As  noted  before,  schools  of  empirical  political 
theory  are  difficult  to  classify  (see  Robert  Goodin, 
2009;  Robert  Goodin  &  Hans-Dieter  Klingemann, 
1996;  Fred  Greenstein  &  Nelson  Polsby,  1975). 
Two  yardsticks  are  the  behavioral  revolution  (the 
1950s  through  the  1960s)  and  the  postbehavioral 
revolution  (the  1970s  through  the  1980s),  fol¬ 
lowed  by  the  “perestroika”  movement  in  the 
American  Political  Science  Association  (the  1990s 
through  the  2000s).  With  the  behavioral  revolu¬ 
tion,  systems  theory  and  behavioralism  became 
prominent.  With  the  postbehavioral  revolution, 
new  attempts  were  made  beyond  the  behavioral 
revolution  in  political  science.  The  postbehavioral 
revolution  and  the  perestroika  movement  tried  to 
make  political  science  more  interpretative,  reflec¬ 
tive,  context  sensitive,  and  path  dependence 
attentive,  on  the  one  hand,  and  more  focused  on 
institutions  as  contrasted  to  individuals,  on  rigor¬ 
ous  utilitarian  calculus  versus  culturally  derived 


motivations,  on  neurophysical  movement  as 
opposed  to  manifested  human  behavior,  and  on  a 
global  outlook  as  against  the  perspective  of  the 
national  organic  whole,  on  the  other. 

Systems  Theory 

Dissatisfied  with  the  state  of  political  science  in 
the  1940s,  which  was  very  different  from  what  it  is 
today,  Easton  attempted,  during  the  1960s,  to 
make  political  science  a  scientific  discipline  whose 
theories  are  derived  from  empirical  testing  of  theo¬ 
retically  formulated  hypotheses  on  the  basis  of 
systematically  generated  data.  By  so  doing,  Easton 
aimed  at  creating  a  “general  theory”  of  politics 
with  a  systems  theory  framework.  Easton’s  famous 
definition  of  politics  as  the  “authoritative  alloca¬ 
tion  of  values  for  a  society”  was  born  of  this  sys¬ 
tems  theory  thinking.  It  was  refreshing  to  those 
political  scientists  who  were  dissatisfied  with  the 
political  science  of  that  period,  with  its  focus  on 
constitutions  and  institutions  on  the  one  hand  and 
ideas  and  ideologies  on  the  other.  Easton,  with  Jack 
Dennis,  embarked  on  the  study  of  a  political  system 
in  terms  of  the  political  socialization  of  children, 
whereby  children  learn  about  politics  from  parents, 
peers,  teachers,  and  preachers.  Norms,  values,  and 
rules  are  those  components  that  are  channeled  from 
one  generation  to  another  in  a  political  system. 
What  was  probably  felt  by  those  self-claimed  sys¬ 
tems  theory-influenced  political  scientists  was  that 
the  political  system  is  a  vastly  complex  set  of  inter¬ 
actions  of  actors  under  a  vast  array  of  rules  and 
norms  and  that  theorizing  it  at  a  systems  level  on 
the  basis  of  empirically  derived  evidence  is  defi¬ 
nitely  a  daunting  task.  The  behavioral  revolution 
was  raging  in  the  United  States  concurrently  with 
the  acceptance  of  systems  theory  thinking.  Then 
came  a  mild  disillusionment  with  both  systems 
theory  and  behavioralism  in  political  science,  con¬ 
current  with  the  turmoil  in  the  world  during  the 
late  1960s  and  early  1970s.  In  other  words,  the 
limitations  of  systems  theory  thinking  were  deeply 
felt.  Easton  himself  confessed  later  that  his  systems 
analysis  and  behavioralism  had  clear  limits. 

Behavioralism 

Calls  for  behavioral  persuasion  were  hailed  as 
revolutionary  by  enthusiasts,  who  proclaimed  that 
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it  would  transform  political  science  from  an  old, 
rusty  discipline  to  one  of  the  newest,  acclaimed 
disciplines,  with  its  focus  on  action  rather  than 
intention  and  motivation  and  on  analysis  rather 
than  interpretation.  Despite  the  short-lived  enthu¬ 
siasm  for  behavioralism,  the  spirit  and  style  of 
behavioralism  were  consolidated  in  highly  reputed 
journals.  Returning  to  the  explanatory  capacity  of 
empirical  political  theory,  we  can  take  up  Bruce 
Russett’s  (1993)  democratic  peace  theory.  The 
hypothesis  tested  is  one  of  the  arguments  made 
first  by  Kant:  A  republican  regime  (as  contrasted 
to  a  monarchical  regime)  is  less  prone  to  waging 
war  against  another  republic  than  against  a  nonre¬ 
publican  regime.  In  contemporary  parlance,  the 
hypothesis  states  that  democracies  rarely  fight 
each  other.  Russett  made  use  of  pooled  time-series 
data  composed  of  pairs  of  all  the  sovereign  states 
each  year  from  1815  through  1989. 

Rational  Choice  Theory 

Rational  choice  theory  was  born  of  formal 
political  theory.  Yet  it  is  normally  attached  to  the 
testing  of  propositions  that  are  logically  derived 
from  a  certain  set  of  assumptions  and  axioms.  Its 
rise  was  due  in  part  to  the  steady  influence  of  eco¬ 
nomics  in  political  science.  Also,  among  the  subdis¬ 
ciplines  of  political  science,  political  economy,  in 
which  economics-trained  academics  like  Anthony 
Downs  (1957),  Mancur  Olson,  and  Gerald  Kramer 
exerted  considerable  influence,  was  widely  studied 
in  the  1970s  and  1980s.  It  was  also  due  in  part  to 
the  reaction  against  the  kind  of  behavioralism  that 
was  criticized  as  blind  and  barefooted  empiricism. 
Daran  Acemoglu  and  James  Robinson’s  (2005) 
Economic  Origins  of  Dictatorship  and  Democracy 
is  an  example.  It  starts  with  the  presence  of  differ¬ 
ent  social  groups.  They  prefer  different  political 
institutions  with  different  ways  of  allocating  power 
and  resources.  The  highly  resourceful  groups  want 
to  monopolize  power.  The  rest  want  democracy.  In 
due  course,  democracy  prevails  because  the  major¬ 
ity  wants  it.  In  Thailand,  the  old  elites  and  the 
newly  growing  masses  fight  each  other  intermit¬ 
tently,  using  violence.  The  old  elites  have  not  acqui¬ 
esced  in  the  capture  of  power  by  the  masses  and 
have  recaptured  power  by  resorting  to  a  military 
coup  d’etat.  But  their  reasoning  is  lucid  and  gen¬ 
eral,  unlike  the  largely  descriptive  political  science 


accounts  of  such  struggles  between  democracy  and 
dictatorship. 

Institutionalism 

As  distinguished  from  the  institutionalism  of  the 
past,  institutionalism  today  is  called  neo-institution¬ 
alism.  Its  key  features  are  (1)  a  strong  empiricism 
using  detailed  institutional  mechanisms  and  (2)  an 
ardent  comparativism  of  institutions,  which  is  used 
to  highlight  the  strength  of  the  argument  being 
made.  The  following  contrast  is  intended  to  make 
the  general  orientation  and  product  of  institutional¬ 
ism  much  clearer.  If  Acemoglu  and  Robinson  are 
the  representative  authors  of  rational  choice  theory, 
John  Ferejohn  is  the  representative  author  of  institu¬ 
tionalism.  Acemoglu  and  Robinson  formulate  and 
test  in  a  more  general  way.  Ferejohn  formulates  and 
tests  in  a  more  comparative  fashion,  making  the  best 
use  of  a  comparative  exercise  in  hypothesis  formula¬ 
tion  and  testing.  These  features  are  forcefully  pre¬ 
sented  in  his  work  on  federalism  and  on  war  and 
state  building.  The  limit  of  institutionalism  can  be 
said  to  depend  on  the  scope  and  angle  of  compara¬ 
tive  institutional  screening  and  investigation, whether 
it  examines  the  electoral  consequences  of  electoral 
rules  such  as  proportional  representation  and  one- 
person-from-one-district  or  the  social  policy  conse¬ 
quences  of  the  taxation  system. 

Neuroscience  During  the  revolutionary  period 
of  behavioralism,  Easton’s  A  Framework  for 
Political  Analysis  and  A  Systems  Analysis  of 
Political  Life  may  be  called  the  bible  of  behavior¬ 
alism.  Another  work  that  might  be  considered 
seminal  is  Karl  Deutsch’s  (1963)  The  Nerves  of 
Government.  It  highlights  the  importance  of  the 
nerves  of  the  body  politic,  which  enable  flows  and 
feedbacks  of  information.  The  work  can  be 
regarded  as  a  forerunner  of  the  neuroscience 
school  of  politics  in  a  sense,  although  his  frame¬ 
work  did  not  delve  into  neurons  and  other  neuro¬ 
physical  components  and  functions.  Since 
Deutsch’s  interest  moved  from  social  communica¬ 
tion  and  nationalism  in  a  divided  country  to 
regional  integration  across  the  Atlantic,  he  is  often 
categorized  as  a  behavioralist.  The  steady  advances 
in  neurophysical  science  since  the  1990s  may 
make  Deutsch  a  true  forerunner  of  neuroanalysis. 
If  Tip  O’Neill  is  right  when  he  says,  “All  politics 
is  local,”  a  neurophysician  is  right  when  he  or  she 
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says,  “All  politics  is  neural.”  Neuroanalysis  is 
sharply  contrasted  to  behavioral  analysis  in  that 
the  former  focuses  on  intention  and  motivation  as 
revealed  by  changes  in  the  neurons  in  the  brain 
whereas  the  latter  focuses  on  concrete,  visible 
action.  The  former  does  not  probe  into  real  inten¬ 
tions  and  motivations.  Scientific  advances  such  as 
magnetic  resonance  imaging  and  other  devices 
have  made  it  possible  to  interpret  and  understand 
human  intentions  and  motivations.  In  detecting 
suspected  criminals,  both  assembling  material  evi¬ 
dence  and  deciphering  the  motivations  of  crime 
suspects  are  indispensable.  Neuroanalytical  data 
are  increasingly  used  to  fathom  politics.  Giacomo 
Rizzolatti  and  Corrado  Sinigaglia’s  (2008)  Mirrors 
in  the  Brain:  How  Our  Minds  Share  Actions, 
Emotions,  and  Experience  reveals  how  political 
science  can  benefit  from  neuroanalysis.  Although 
neuroanalysis  is  not  limited  to  political  science,  its 
use  is  bound  increase  because,  after  all,  the  com¬ 
plexity  of  politics  lies  in  the  fathomability  of  the 
movement  of  brain  neurons. 

Globalism 

Political  science  at  the  dawn  of  the  2 1st  century 
is  moving  in  two  opposite  directions  simultane¬ 
ously:  One  is  a  neuronal  direction,  while  the  other 
is  a  global  direction.  Advances  in  science  and  tech¬ 
nology  have  enabled  human  beings  to  connect 
with  each  other  far  and  fast  in  order  to  “macro¬ 
grasp”  politics  and  dig  deep  into  the  brain  in  order 
to  “micrograsp”  politics.  The  tide  of  globalization 
was  energized  by  the  technological  advances  in  the 
20th  century.  One  of  them  is  the  electronic  revolu¬ 
tion  in  communications,  which  enabled  the  move¬ 
ment  of  money  to  go  “mad,”  in  the  words  of  Susan 
Strange.  It  now  moves  incredibly  fast,  and  along 
with  money,  many  other  things  (e.g.,  commodities, 
education,  medicine,  migrants,  viruses,  crimes, 
drugs,  weapons,  and  information)  move  fast  on  a 
global  scale.  Politics  is  not  an  exception  to  this 
irresistible  and  irreversible  tide  of  globalization. 
David  Held  (1995)  is  the  representative  globalist. 
His  Democracy  and  the  Global  Order  builds  the 
normative  stand  of  cosmopolitan  democracy 
founded  on  various  democratic  theories.  The 
extent  to  which  normative  political  theory  is  really 
universal  and  global  at  the  same  time  was  ques¬ 
tioned  until  recently  because  a  large  bulk  of 


normative  political  theory  originated  from  modern 
Europe,  where  state  building  and  political  theory 
construction  were  both  carried  out  mostly  on  a 
national  scale. 

Conversations  Between  Normative 
and  Empirical  Theories 

So  far,  this  entry  has  summarized  three  kinds  of 
political  theory:  classical,  empirical,  and  formal.  It 
has  also  shown  that  they  are  intimately  related  to 
each  other.  In  this  section,  a  few  illustrations  are 
used  to  show  that  normative  and  empirical  political 
theories  can  conduct  fruitfully  their  conversations 
in  ways  that  would  help  articulate  them  more 
sharply  and  precisely  and  identify  their  blind  spots, 
thus  enriching  each  other.  Examples  drawn  from  to 
make  this  point  include  warlike  democracies  and 
bottom-up  regime  typology.  Both  of  these  draw 
their  propositions  from  classical  political  theories, 
such  as  those  of  Kant,  Machiavelli,  Aristotle,  and 
Charles  de  Montesquieu,  and  conduct  empirical 
testing  of  such  propositions  to  see  how  much  revi¬ 
sion  is  desirable  in  both  classical  and  empirical 
theories. 

Normative  political  theory  is  meant  to  transcend 
the  particular  time  and  space  in  which  it  is  gener¬ 
ated  in  its  argument  about  how  justice  is  to  be 
achieved.  That  is  its  raison  d’etre  in  a  sense.  At  the 
same  time,  it  is  recognized  that  in  normative  politi¬ 
cal  theory,  issues  based  on  conflict,  shared  values, 
and  exchange  tend  to  dominate.  Those  based  on 
shared  values  have  been  especially  numerous.  This 
is  in  part  because  in  the  latter  half  of  the  20th  cen¬ 
tury,  political  science  publications  and  their  reader- 
ship  have  been  dominated  by  academics  in  the 
United  States  who  have  shown  an  enduring  affinity 
with  liberalism.  Empirical  political  theory  is  also 
meant  to  go  beyond  the  particular  temporal  and 
spatial  settings  so  as  to  be  valid  under  universal 
circumstances.  At  the  same  time,  it  is  recognized 
that  the  bulk  of  empirical  political  theory  has 
tended  to  be  focused  on  the  latter  half  of  the  20th 
century  and  on  the  United  States  and  the  European 
Union.  These  two  facts  together  represent  a  serious 
problem:  Empirical  political  theory  is  heavily  biased 
to  present  the  West  as  a  fountain  of  universal  truth 
and  justice. 

It  is  remarkable  that  Kant,  living  in  Koenigsberg 
his  entire  life  and  witnessing  the  vicissitudes  of  life 
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in  European  environments,  grasped  the  wave  of 
the  future  in  a  most  succinct  way.  Although 
democracies  are  peace  loving  among  themselves, 
they  are  war  prone  toward  nondemocracies,  which 
they  fear  are  a  threat  to  their  existence.  The  right- 
wing  Kantians  were  born  in  the  2000s,  in  the  wake 
of  the  September  11,  2001,  terrorist  attacks  on  the 
United  States  and  the  Afghan  and  Iraq  wars.  They 
argued  that  if  Kant  had  been  alive  and  had 
observed  these  events,  he  would  have  supported 
the  Iraq  war  to  prevent  the  weapons-of-mass- 
destruction  (WMD)  state  of  Iraq  from  exploding 
externally.  They  argued  that  democracies  cannot 
be  just  passive,  sitting  idly  by  while  innocent  civil¬ 
ians  are  being  killed  and  when  democracies  are 
challenged  by  the  threat  of  force.  Instead,  democ¬ 
racies  should  fight  against  antidemocratic  forces. 
In  stark  contrast  to  the  democratic  interventions  of 
right-wing  Kantians,  the  left-wing  Kantians  called 
for  democracy  that  rejects  interference  from  out¬ 
side  in  the  internal  democratic  affairs  of  a  country, 
termed  sovereign  democracy  by  Vladislav  Surkov. 
This  refers  to  the  incidents  that  took  place  in  rela¬ 
tion  to  the  democratization  and  secessionist  move¬ 
ments  in  those  societies  that  used  to  be  united  in 
the  former  Soviet  Union,  such  as  Chechnya, 
Georgia,  Ukraine,  and  Kyrgyzstan. 

In  the  late  18th  century,  Europe  saw  the  French 
Revolution  and  the  Napoleonic  War.  Kant  believed 
that  the  advent  of  a  republican  regime,  free  trade, 
and  multilateral  treaties  and  international  organiza¬ 
tions  would  herald  eternal  peace  in  a  fledgling  form. 
It  is  clear  that  Kant’s  ideas  are  surely  bound  by  time 
and  space.  In  the  dawn  of  the  21st  century,  we  saw 
the  9/11  terrorist  attacks  and  the  Afghan  and  Iraq 
wars,  leading  the  right-wing  Kantians  and  the  left- 
wing  Kantians  to  argue  as  to  whether  humanitarian 
interventions  are  justified  (the  right-wing  Kantians) 
or  not  (the  left-wing  Kantians)  to  propagate  demo¬ 
cratic  regimes  from  the  outside.  Thus,  we  can 
conclude  that  normative  political  theory  has,  not 
surprisingly,  a  context  boundedness. 

Along  a  different  line  of  argument,  Kant’s 
republics  have  two  types  of  checks-and-balances 
mechanisms,  according  to  John  Ferejohn  and 
Frances  Rosenbluth  (2008).  Horizontal  checks  are 
those  mechanisms  working  at  the  higher  level  of 
the  legislature  and  the  executive.  Fawmakers  and 
law  executors  are  different  and  separate.  Neither 
can  dictate  to  the  other,  and  thus,  a  regime’s 


restraining  mechanisms  work  better  than  otherwise 
would  be  the  case.  Vertical  checks  are  those  mecha¬ 
nisms  working  between  the  elite  and  citizen  levels. 
Kant’s  republican  democracies  distinguish  between 
the  decision-making  elites  and  the  decision-shaping 
citizens  but  only  on  the  condition  that  they  interact 
with  each  other.  In  other  words,  elites  take  into 
account  citizen  preferences  in  their  decision  mak¬ 
ing,  while  citizens  express  their  preferences  verbally 
and  demonstrably  to  elites  in  their  decision  shap¬ 
ing.  Ferejohn  and  Rosenbluth  argue  that  Kant  must 
be  differentiated  from  Machiavelli  in  that  the 
republican  democracies  equipped  with  checks-and- 
balances  mechanisms  in  Kant’s  argument  prescribe 
tangentially  against  Machiavelli’s  democratic  mobi¬ 
lization  theory  and,  thus,  his  war-prone  democracy 
theory.  Machiavelli’s  argument  is  based  on  the 
comparison  between  republican  Rome  and  monar¬ 
chical  Florence.  Republican  Rome  was  endowed 
with  soldiers  whose  war-fighting  motivation  and 
capacity  were  high  because  it  was  a  politically 
inclusive  regime.  Monarchical  Florence  was  plagued 
by  mercenaries  whose  war-fighting  motivation  and 
capacity  were  not  high.  Ferejohn  and  Rosenbluth 
use  Machiavelli  to  make  sense  of  war-prone  democ¬ 
racies  as  witnessed  in  the  late  20th  and  early  2 1st 
centuries.  Therefore,  according  to  them,  peace- 
loving  and  war-prone  democracies  are  not  separate 
but  different  sides  of  the  same  coin. 

In  1835,  focusing  on  the  armed  forces  in 
democracies,  Alexis  de  Tocqueville  observed  and 
pointed  to  the  inherent  danger  of  the  military  con¬ 
ducting  activities  that  might  give  a  bad  reputation 
to  democracies.  Tocqueville  observed  that  the 
army  in  America  defended  the  invaders,  who  plun¬ 
dered  the  land  originally  possessed  by  natives  and 
broke  armistice  and  peace  treaties  with  the  natives. 
The  army  was  hugely  supported  by  the  Americans, 
who  advanced  to  the  west  from  the  initial  patches 
of  land  on  the  Atlantic  coast  as  if  it  were  their 
“manifest  destiny,”  before  this  phrase  was  coined 
later.  Tocqueville  was  apprehensive  of  the  danger 
of  the  military  having  its  conduct  legitimated  by 
democracies  whose  ideas  he  emphatically  approves 
and  expressly  admires.  In  the  wake  of  9/11  and  the 
acts  of  revenge  the  United  States  engineered,  Reiji 
Matsumoto  sensitizes  this  aspect  of  American 
democracy  fully,  citing  Tocqueville.  In  other 
words,  democratic  peace  and  war  proneness  are 
different  sides  of  the  same  coin. 


2062  Political  Theory 


American  authors  like  Max  Boot  and  Robert 
Kagan  make  the  same  set  of  observations  of 
American  being  prone  to  the  use  of  force  in  settling 
conflicts  of  interest  abroad  even  before  its  indepen¬ 
dence  from  England.  Both  authors  are  called  neo¬ 
conservative  in  the  United  States;  in  Russia,  they 
are  called  the  Bolsheviks  of  the  21st  century.  The 
American  neoconservatives  and  the  Russian 
Bolsheviks  led  by  Vladimir  Ilyich  Lenin  during  the 
previous  century  have  two  things  in  common: 
They  are  idealists,  pursuing  their  ideals  with  com¬ 
mitment  and  compassion,  and  they  are  realists, 
legitimating  the  use  of  force  if  their  casus  belli  is 
deemed  justifiable  in  light  of  their  ideals,  such  as 
democracy  and  human  rights.  Perhaps  Tocqueville 
was  right  in  his  instinctive  apprehension  since  he 
had  experienced  democratic  imperialism  in  Europe 
in  the  form  of  the  revolutionary  war  waged  by 
Napoleon  Bonaparte. 

Classical  political  theorists  often  talk  about 
regime  types.  Aristotle,  in  his  Nicomachean  Ethics , 
argues  that  ethics  is  the  basis  of  politics.  Thus, 
depending  on  the  prevailing  ethics  in  society,  three 
regime  types  are  identified:  monarchy,  aristocracy, 
and  politeia.  Monarchy  is  shaped  by  the  kind  of 
ethics  that  prevail  between  a  father  and  a  son;  aris¬ 
tocracy  is  shaped  by  the  kind  of  ethics  that  prevail 
between  a  husband  and  a  wife;  and  politeia  is 
shaped  by  the  kind  of  ethics  that  prevail  between 
an  elder  brother  and  a  younger  brother.  Aristotle’s 
theory  is  monadic  in  the  sense  that  once  the  pre¬ 
vailing  ethics  is  identified,  the  regime  type  is  auto¬ 
matically  determined.  Montesquieu’s  (1748)  Spirit 
of  Law  is  also  monadic.  Three  regime  types — 
republicanism,  monarchy,  and  autocracy — are 
determined  by  the  driving  spirit  of  a  regime. 
Republicanism  is  driven  by  virtue,  aristocracy  is 
driven  by  honor,  and  autocracy  is  driven  by  fear.  It 
is  clear  that  the  relationship  between  citizens  and 
the  state  is  monadic  and  that  once  the  regime  type 
is  specified,  the  prevailing  ethics  of  citizens  is  also 
specified.  In  the  late  19th  and  early  20th  centuries, 
Weber  talked  about  regime  types  via  means  of 
regulation — that  is,  charisma,  force,  and  legal  ratio¬ 
nality.  Again,  the  monadic  determination  is 
assumed.  Once  the  means  are  specified,  regime 
types  are  determined,  and  vice  versa.  The  state  of 
affairs  has  not  changed  much  even  in  the  late  20th 
and  early  21st  centuries.  For  instance,  Stein  Rokkan, 
during  the  1960s,  articulated  the  formation  of 


democratic  states  in  European  history.  Yet  Rokkan 
talks  about  only  regime  types,  not  citizens.  One 
may  wonder  why  there  has  been  little  interest  in 
citizens  and  their  relationship  with  the  state  when 
opinion  polls  are  conducted  all  over  the  world, 
including  in  many  authoritarian  societies.  Until 
recently,  one  could  explain  the  paucity  of  discus¬ 
sion  on  citizens  in  theories  about  the  state  as  the 
main  theories  deal  with  the  institutions  rather  than 
with  the  social  actors. 

Aristotle’s  Nicomachean  Ethics  makes  ethics 
shared  and  practiced  by  a  certain  set  of  people  the 
determinant  of  regime  types — thus  his  regime 
types  of  monarchy,  aristocracy,  and  politeia. 
Within  politeia ,  he  has  three  regime  subtypes: 
autocracy,  oligarchy,  and  democracy.  How  are  the 
three  regime  subtypes  of  politeia  determined? 
Some  scholars  conjecture  that  those  regimes  sub- 
types  are  not  well  “disciplined”  by  a  certain  set  of 
ethics  the  way  monarchy  and  aristocracy  are  deter¬ 
mined  by  what  is  called  the  societal  regime  pre¬ 
vailing  at  the  bottom.  Ethics  here  is  broadly 
understood  as  the  prevailing  political  culture, 
whose  key  components  are  defined  as  identity,  con¬ 
fidence,  and  satisfaction.  These  few  societal  regime 
types  are  not  quite  formulated  at  an  abstract  level 
that  may  be  comfortable  to  some  political  theorists. 
But  it  is  important  to  stress  the  need  to  initiate  con¬ 
versations.  The  classical  theorists  may  start  from 
anywhere  they  wish.  The  empirical  theorists  may 
start  from  the  point  where  they  are  most  comfort¬ 
able.  What  is  important  is  that  they  must  move  in 
new  directions,  which  could  bring  them  to  carry 
out  conversations  tete-a-tete,  instead  of  digging 
holes  on  both  sides  of  what  might  be  called  the 
Maginot  line  of  both  schools  of  theorists. 

With  survey  and  nonsurvey  data  being  continu¬ 
ously  collected,  empirical  political  theorists  must 
grapple  with  the  bottom-up  determination  of  a 
regime  type.  So  must  normative  political  theorists. 
Neither  normative  nor  empirical  political  theorists 
should  discuss  a  regime  type  without  examining 
the  bottom  level,  that  is,  the  citizens.  The  theoreti¬ 
cal  problem  does  not  end  here.  Even  in  the  general 
discussion  of  a  regime  type,  say  democracy,  fuzzi¬ 
ness  abounds  in  the  usage  of  this  term. 
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Politicization  of  Bureaucracy 


Politicization  of  the  bureaucracy  is  best  under¬ 
stood  in  reference  to  two  phenomena  that  are 
sometimes  related.  The  first  is  the  attempt  of 
political  principals  to  exercise  control  over  the 
bureaucracy.  Because  politicization  thus  under¬ 
stood  is  in  the  first  place  an  aspect  of  particular 
institutional  structures,  it  may  be  called  institu¬ 
tional  politicization.  The  second  form  of  politici¬ 
zation  occurs  when  this  control  is  exploited — that 
is,  when  the  bureaucracy  behaves  in  a  manner 
responsive  to  politicians.  Because  the  entailed 
notion  of  politicization  refers  primarily  to  patterns 
of  behavior  rather  than  features  of  institutions,  it 
is  significantly  distinct  from  institutional  politici¬ 
zation.  This  second  notion  is  called  behavioral 
politicization.  Although  both  types  of  politiciza¬ 
tion  are  often  criticized,  their  social  consequences 
vary.  Thus,  while  a  given  instance  of  politicization 
can  be  evaluated,  a  broad  a  priori  claim  about 
politicization  of  the  bureaucracy  is  difficult. 

Institutional  Politicization 

Politicians  have  many  means  at  their  disposal  for 
influencing  and  controlling  the  bureaucracy.  These 
include  administrative  procedures,  budget  controls, 
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oversight,  and  prior  review  of  agency  action.  The 
mechanism  most  commonly  associated  with  politi¬ 
cization  is  staffing  an  agency  with  appointees  cho¬ 
sen  by  politicians,  usually  in  the  executive  branch; 
the  higher  the  proportion  of  political  appointees 
within  the  agency  and  the  greater  the  extent  of 
their  penetration  of  the  agency  hierarchy,  the  more 
politicized  the  agency  is.  Thus,  the  United  States 
Office  of  Management  and  Budget,  with  more 
than  7%  of  its  staff  appointed  by  the  executive,  is 
more  politicized  than  the  United  Kingdom’s 
Treasury  Ministry,  with  only  seven  political 
appointees,  or  0.5%  of  its  staff.  This  entry  focuses 
on  political  appointees  rather  than  the  other  mech¬ 
anisms  of  institutional  politicization. 

A  politicized  bureaucracy  can  be  contrasted  with 
one  that  is  neutral,  autonomous,  or  insulated — that 
is,  one  free  from  influence  by  the  political  branches 
and  able  to  pursue  its  own  agenda.  Agencies  may 
be  free  to  do  so  by  virtue  of  stalemates  between 
political  actors,  each  actor  seeking  to  influence 
agency  decisions,  having  cultivated  a  sufficiently 
influential  constituency  of  its  own,  or  monopoliz¬ 
ing  the  information  necessary  for  effective  over¬ 
sight.  Political  appointees  within  the  bureaucracy 
can  also  be  contrasted  with  careerists  who  enter  the 
civil  service  and  work  their  way  through  its  ranks. 
This  distinction  is  particularly  important  because 
careerists  typically  enjoy  tenure,  promotion,  and 
salary  protections.  Careerists  are  thus  not  chosen 
by  politicians.  Rather,  they  are  typically  selected  on 
the  basis  of  objective  measures,  such  as  a  competi¬ 
tive  exam  or  special  education  (e.g.,  the  Ecoles 
Nationale  d’ Administration  and  Polytechnique  in 
France),  and  are  insulated  from  politicians.  This 
insulation  should  not,  however,  be  overstated: 
Politicians  can  still  influence  careerists  within  many 
systems  by  offering  transfers  to  prestigious  posts 
and  manipulating  budgets. 

There  are  two  general  motivations  for  institu¬ 
tional  politicization.  The  first  is  patronage,  where 
political  appointments  are  created  and  filled  as 
rewards  to  political  allies  or  in  exchange  for 
favors.  Patronage  appointments  are  often  “spoils” 
distributed  by  winners  to  those  who  aided  in 
the  campaign.  The  second  is  policy  oriented. 
Politicians,  usually  executives,  can  politicize  an 
agency  to  acquire  greater  control  over  it,  staffing 
it  with  personnel  of  their  choosing  and  whom  they 
can  dismiss.  In  this  manner,  politicization  is  a 


means  by  which  executives  steer  policy.  Similar 
reasoning  informs  the  extent  of  institutional  politi¬ 
cization  when  programs  are  enacted  or  imple¬ 
mented.  If  they  anticipate  losing  political  control, 
supporters  of  a  new  bureaucratically  administered 
program  may  try  to  place  it  in  an  insulated  agency 
dominated  by  careerists,  whereas  opponents  may 
do  the  opposite,  favoring  increased  politicization 
of  the  agency  as  a  means  of  managing  the  program 
when  they  come  to  power.  Similarly,  a  legislature 
faced  with  a  hostile  executive  may  prefer  careerist 
administration  of  policy  rather  than  politicization 
as  a  means  of  cabining  the  executive’s  influence. 

Behavioral  Politicization 

Institutional  politicization  is  frequently  a  determi¬ 
nant  of  behavioral  politicization;  structures  that 
grant  politicians  influence  over  the  bureaucracy 
will  make  it  more  responsive  to  them.  Institutional 
politicization  is  not,  however,  a  necessary  precon¬ 
dition  of  behavioral  politicization.  The  bureau¬ 
cracy  can  adjust  its  behavior  out  of  an  internal 
norm,  for  example,  or  in  anticipation  of  a  threat  of 
institutional  politicization.  Likewise,  as  indicated 
by  the  patronage  motivation  for  political  appoint¬ 
ments,  institutional  politicization  does  not  always 
lead  to  or  seek  to  implement  behavioral  politiciza¬ 
tion.  Institutional  and  behavioral  politicization,  as 
defined  here,  have  a  close  relationship,  but  one 
does  not  necessarily  imply  the  other. 

Analytically,  behavioral  politicization  can  be 
further  divided  into  two  types.  Behavioral  politici¬ 
zation  can  indicate  a  shift  in  policy  mediated 
through  the  bureaucracy.  A  new  administration 
can  usher  in  a  new  set  of  priorities  and  programs, 
and  agencies  may  alter  their  behavior  to  better  real¬ 
ize  these  goals.  It  can  also  indicate  using  agency 
discretion  for  purely  political  ends,  such  as  direct¬ 
ing  government  funds  toward  political  allies  or 
targeting  opponents  for  investigation  and  scrutiny 
by  enforcement  agencies.  The  first,  more  general 
type  of  behavioral  politicization  is  not  normatively 
suspect,  while  extreme  or  egregious  examples  of  the 
second  are  often  made  unlawful,  such  as  under  the 
Hatch  Act  in  the  United  States,  which  forbids  many 
government  employees  from  using  their  official 
authority  to  influence  or  interfere  with  elections. 
An  a  priori  normative  evaluation  of  behavioral 
politicization  in  general  is  therefore  difficult. 
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Effects  of  Institutional  Politicization 
on  Agency  Performance 

Some  research  indicates  that  increased  institutional 
politicization  in  the  form  of  political  appointees 
decreases  agency  performance.  However,  this  find¬ 
ing  hinges  on  the  presence  of  several  key  conditions. 
First,  it  requires  that  bureaucratic  management 
expertise  is  site  specific,  so  that  it  is  not  enough  that 
the  manager  understands  the  policy  area,  the  politi¬ 
cal  environment  the  agency  operates  within,  and  so 
on.  Instead  or  in  addition,  it  assumes  that  an  effec¬ 
tive  agency  requires  staff  with  particular  knowledge 
about  agency  structure,  budget,  internal  culture, 
and  so  on.  A  variation  on  this  assumption  is  that 
public  management  is  idiosyncratic,  so  that  other 
management  experience — which  political  appoin¬ 
tees  tend  to  have — does  not  transfer  well.  The  con¬ 
tention  is  that  appointees  are  generally  less  familiar 
with  navigating  the  bureaucratic  environment  or 
marshaling  coalitions  to  support  their  agenda.  The 
second  condition  is  that  appointees  with  the  rele¬ 
vant  expertise  and  experience  cannot  be  found  or 
that  the  costs  of  doing  so  are  unreasonably  high. 
The  third  is  that  political  appointees  do  not  offer 
their  own  countervailing  advantages.  The  alterna¬ 
tive  being  assumed  away  is,  for  instance,  that  a 
political  appointee,  who  is  far  more  likely  to  have 
served  in  the  White  House  or  Congress,  could  not 
use  those  connections  to  facilitate  achieving  the 
agency’s  goals  within  its  political  environment. 

If  all  these  conditions  hold,  then  ceteris  paribus, 
the  greater  the  extent  of  institutional  politicization 
through  political  appointees,  the  less  effective  the 
agency  will  be  at  its  allotted  tasks.  But  these  condi¬ 
tions  are  demanding.  While  there  exists  some 
empirical  support  for  the  first  and  second  condi¬ 
tions,  they  may  not  hold  universally  even  in  the 
archetypal  case  of  a  patronage  appointment  where 
a  neophyte  is  given  a  position  as  a  reward  or  polit¬ 
ical  favor.  Furthermore,  the  alternative — careerist 
bureaucrats — can  have  its  own  drawbacks.  Career¬ 
ists,  especially  if  they  are  insulated  from  political 
principals,  can  shirk  their  duties,  become  captured 
by  the  interests  they  are  charged  to  regulate,  or 
cultivate  their  own  influence  over  policy. 

Civil  service  career  protections  may  help  encour¬ 
age  bureaucrats  to  develop  expertise,  which  alone 
would  support  concerns  about  institutional  politi¬ 
cization.  However,  the  most  systematic  arguments 
for  this  view  demonstrate  that  bureaucrats  only 


develop  expertise  if  they  have  policy  preferences 
and  are  given  some  control  over  such  issues.  This 
implies  that  perhaps  the  ideal  system  is  a  hybrid  of 
institutional  politicization  and  the  civil  service  pro¬ 
tections  typically  enjoyed  by  careerists:  Politicians 
can  staff  agencies  with  those  who  care  about  the 
relevant  policies,  rather  than  using  an  objective 
staffing  mechanism  such  as  an  exam,  and  then  pro¬ 
vide  agency  employees  with  job  protection.  So  long 
as  the  bureaucrats  are  given  some  influence  over 
policy,  they  will  invest  in  developing  expertise. 

Politicization  and  Democracy 

In  addition  to  its  possible  impact  on  agency  perfor¬ 
mance,  institutional  politicization  can  strengthen 
democratic  control  over  policy  by  making  the 
bureaucracy  more  responsive  to  (elected)  politi¬ 
cians.  There  is  thus  the  potential  for  trade-offs 
between  efficiency  or  expertise  and  democratic 
accountability. 

As  previously  noted,  institutional  politicization 
can  lead  to  behavioral  politicization.  If  voters  exer¬ 
cise  effective  control  over  their  representatives,  then 
behavioral  politicization  does  not  threaten  demo¬ 
cratic  control.  The  politicized  bureaucracy  is 
responsive  to  politicians  and  acts  accordingly,  and 
in  this  case,  the  politicians  are  in  turn  responsive  to 
the  voters.  The  result  is  essentially  the  same  as  if  the 
bureaucrats  were  elected  directly.  Put  another  way, 
if  the  principal-agent  problems  between  the  voters 
and  elected  officials  are  resolved,  then  politicization 
need  not  undermine  democratic  control  of  policy. 
Institutional  politicization  is  a  possible  means  of 
resolving  the  principal-agent  problems  between 
politicians  and  bureaucrats,  so  that  the  politicized 
bureaucracy’s  behavior  (i.e.,  behavioral  politiciza¬ 
tion)  is  ultimately  responsive  to  the  voters. 

This  analysis  assumes,  however,  that  bureau¬ 
cratic  policy  is  a  salient  issue  for  voters,  that  is,  that 
they  are  willing  and  able  to  condition  their  vote  for 
the  political  controller  of  the  agency  on  the  bureau¬ 
cracy’s  actions.  If  out  of  ignorance  or  other  reasons 
they  cannot,  then  behavioral  politicization  can  be 
used  to  deliver  targeted  benefits  (penalties)  to  allies 
(enemies)  in  order  to  gain  political  advantage. 

Nicholas  Almendares 

New  York  University 
New  York  City,  New  York,  United  States 


2066  Politicization  of  Civil  Service 


See  also  Accountability,  Electoral;  Agencies;  Bureaucracy; 
Bureaucracy,  Rational  Choice  Models;  Effectiveness, 
Bureaucratic;  Politicization  of  Civil  Service; 
Principal-Agent  Theory 

Further  Readings 

Besley,  T.,  &  Coate,  S.  (2003).  Elected  versus  appointed 
regulators:  Theory  and  evidence,  journal  of  the 
European  Economic  Association,  1(5),  1176-1206. 

Gailmard,  S.,  &  Patty,  J.  W.  (2007).  Slackers  and  zealots: 
Civil  service,  policy  discrertion,  and  bureaucratic 
expertise.  American  journal  of  Political  Science,  51(4), 
873-889. 

Gordon,  S.  C.  (2009).  Assessing  partisan  bias  in  federal 
public  corruption  prosecutions.  American  Political 
Science  Review,  103(4),  534-554. 

Hatch  Act,  5  U.S.C.  §7321-7326. 

Lewis,  D.  E.  (2008).  The  politics  of  presidential 
appointments:  Political  control  and  bureaucratic 
performance.  Princeton,  NJ:  Princeton  University 
Press. 

Moe,  T.  M.  (1989).  The  politics  of  bureaucratic 

structure.  In  J.  E.  Chubb  &  P.  E.  Peterson  (Eds.),  Can 
the  government  govern?  (pp.  267-329).  Washington, 
DC:  Brookings  Institution. 


Politicization  of  Civil  Service 


The  term  politicization  of  civil  service  refers  to  the 
introduction  of  political  considerations  into  actions 
traditionally  carried  out  by  the  civil  service,  thus 
changing  political-administrative  relations.  There 
are  three  established  ways  of  studying  this  process, 
reflecting  somewhat  different  understandings  of 
the  basic  concept.  The  first  tradition  concerns  how 
civil  servants  are  appointed  and  promoted.  In 
politicized  appointment  and  promotion  processes, 
merit-based  criteria  are  replaced  by  political  crite¬ 
ria.  Scholars  in  the  second  tradition  study  the 
political  preferences  of  civil  servants,  often  using 
attitudinal  data,  with  the  aim  of  answering  ques¬ 
tions  such  as  whether  the  civil  service  is  dominated 
by  the  political  left  or  the  political  right.  In  the 
third  tradition,  the  actions  of  civil  servants  are  ana¬ 
lyzed  to  assess  to  what  extent  civil  servants  are 
directly  involved  in  political  decision  making.  If 
they  are  involved,  it  is  seen  as  an  indication  of 
politicization.  In  this  entry,  political-administrative 


relations  are  first  described  generally,  followed  by 
a  discussion  of  the  three  traditions  just  mentioned. 

Underlying  the  concept  of  politicization  is  the 
normative  ideal  of  the  separation  of  politics  and 
administration.  The  argument  is  that  to  prevent 
corruption  and  patronage  and  to  shield  the  exper¬ 
tise  of  the  civil  service,  the  two  spheres  should  not 
be  merged.  The  scholarly  interest  in  political- 
administrative  relations  dates  back  to  the  late  1 9th 
and  early  20th  centuries,  when  influential  authors 
such  as  Woodrow  Wilson  and  Max  Weber  advo¬ 
cated  a  clear  distinction  between  policy  making 
and  administration.  The  dominant  view  at  the  turn 
of  the  20th  century  was  that  politicians  should  be 
responsible  for  policy  making,  while  civil  servants 
should  execute  the  decisions  taken. 

The  sharp  distinction  between  the  activities  of 
politicians  and  civil  servants  has  been  questioned 
by  many  scholars.  It  has  been  pointed  out,  for 
example,  that  elected  politicians  have  a  legitimate 
interest  in  controlling  what  government  organiza¬ 
tions  do.  From  a  politician’s  point  of  view,  having 
party  loyalists  implementing  policies  ensures  that 
policies  are  not  changed,  or  in  any  other  way 
obstructed,  on  the  way  from  decision  to  implemen¬ 
tation.  The  basic  idea  of  this  line  of  argument  is 
that  neutral  competence  is  not  the  only  important 
virtue  of  the  civil  service  in  a  democratic  society. 
The  neutrality  should  be  complemented  by  respon¬ 
siveness  to  democratically  elected  leaders.  From 
this  point  of  view,  some  degree  of  control,  even  if 
it  is  imposed  by  politicization  of  the  civil  service, 
could  therefore  very  well  be  advocated. 

However,  although  it  has  been  claimed  that  the 
distinction  between  politics  and  administration 
has  been  overstated,  the  ideal  of  a  separation  of 
activities  for  politicians  and  civil  servants  is  still  very 
important.  Most  students  of  political-administrative 
relations  would  today  agree  that  a  collapse  of 
political  and  administrative  activities  would  have 
severe  consequences  for  both  democracy  and  the 
efficiency  of  the  civil  service. 

Political  Appointments 

The  common  view  is  that  political  appointments 
and  promotions  have  increased  dramatically  dur¬ 
ing  the  past  decades.  There  have  been  numerous 
reports  from  countries  belonging  to  different 
Western  administrative  traditions,  for  instance,  the 
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United  States,  Sweden,  Belgium,  and  the  United 
Kingdom  (UK),  of  an  increasing  politicization  of 
the  civil  service.  There  is  also  evidence  pointing  to 
widespread  politicization  of  the  civil  service  in  the 
developing  world.  These  studies  have  created  a 
growing  and  often  critical  debate  regarding  the 
move  toward  a  more  politicized  civil  service, 
which  has  engaged  also  international  organiza¬ 
tions  such  as  the  Organisation  for  Economic 
Co-operation  and  Development  (OECD)  and  the 
World  Bank. 

Nevertheless,  even  if  scholars  have  suggested 
that  there  is  a  common  trend  of  politicization  of 
the  civil  service,  one  should  be  aware  of  the  large 
differences  among  countries  in  terms  of  the  meth¬ 
ods,  levels,  and  trends  of  politicization. 

Generally,  there  are  two  methods  of  imposing 
political  control  over  the  civil  service  using  politi¬ 
cal  appointees.  The  most  common  method  in  the 
Western  world  is  not  to  directly  replace  civil  ser¬ 
vants  with  political  appointees  but  rather  to  add  a 
layer  of  political  appointees  to  the  civil  service. 
Political  appointees  within  this  new  layer  take  on 
advisory,  public  relation,  or  managerial  functions. 
These  functions  could  otherwise  have  been  carried 
out  by  the  civil  service.  In  some  cases,  for  example, 
during  the  Blair  administration  in  the  UK,  political 
appointees  mix  these  roles  and  serve  as  spin  doctors 
involved  in  policy-making  processes,  the  implemen¬ 
tation  of  policies,  and  public  relations.  The  system 
with  a  layer  of  political  appointees  has  a  long  tradi¬ 
tion  in  the  United  States,  historically  rooted  in  the 
so-called  spoils  system,  where  party  loyalists  fill 
important  functions  in  the  executive  branch  and  in 
federal  agencies.  Another  example  of  a  similar  strat¬ 
egy  is  found  in  Belgium,  where  ministers  in  the 
government  have  large  private  offices,  so-called 
ministerial  cabinets,  that  duplicate  civil  service  func¬ 
tions  and  give  ministers  a  political  apparatus  to  turn 
to  for  advice.  There  are,  however,  also  examples 
where  political  appointments  are  used  directly 
within  the  civil  service.  Germany  has,  for  example, 
the  institution  of  political  civil  servants,  which  refers 
to  the  top  two  ranks  of  the  civil  service  in  Germany. 
Other  examples  with  a  more  widespread  and  direct 
substitution  of  civil  servants  can  be  found  in 
Southern  European  countries  such  as  Greece, 
Portugal,  and  Spain  and  on  the  African  continent. 

Not  only  do  the  methods  differ  among  coun¬ 
tries  but  also  the  levels  of  politicization  of  the  civil 


service.  Generally,  the  levels  of  politicization  of  the 
civil  service  are  higher  in  developing  societies,  the 
two  main  reasons  being  a  political  demand  for  a 
committed  civil  service  and  the  fact  that  the  civil 
service  not  only  functions  as  a  provider  of  public 
goods  but  is  also  a  valuable  asset  for  the  employee. 
Scholars  studying  administrative  reforms  in  India 
have,  for  example,  observed  that  the  neutral  civil 
service  was  seen  as  “a  hindrance  rather  then  a 
help”  and  noted  the  politicians’  demand  for  a 
more  committed  civil  service.  Another  example 
can  be  found  in  sub-Saharan  Africa,  where  the 
state  often  is  the  main  employer.  Some  scholars 
argue  that  the  primary  objective  of  the  civil  service 
in  sub-Saharan  Africa  is  not  to  provide  service  to 
the  general  public  but  to  give  rewards  to  the  sup¬ 
porters  of  the  political  leader.  Turning  to  the 
Western  democracies,  they  can  be  divided  into 
several  administrative  traditions  with  different  lev¬ 
els  of  politicization.  Countries  such  as  Italy  and 
Belgium,  which  are  influenced  by  the  French 
administrative  tradition,  have  the  highest  levels  of 
politicization.  This  group  is  followed  by  countries 
in  continental  Europe  that  are  influenced  by  the 
German  administrative  tradition.  Then,  there  are 
the  Scandinavian  countries,  together  with  coun¬ 
tries  belonging  to  the  Anglo-Saxon  administrative 
tradition,  such  as  Ireland,  New  Zealand,  and  the 
UK,  with  relatively  low  levels  of  politicization. 
There  are,  of  course,  several  exceptions  to  this  very 
general  observation,  but  it  should  especially  be 
noted  that  the  United  States,  belonging  to  the 
Anglo-Saxon  administrative  tradition,  is  fairly 
politicized  in  comparative  terms. 

Political  Attitudes 

Studies  of  the  political  attitudes  of  civil  servants 
have  been  used  as  an  indication  of  the  politiciza¬ 
tion  of  the  civil  service.  There  are  at  least  two  dif¬ 
ferent  ways  in  which  the  attitudes  of  civil  servants 
are  important  in  the  political  process.  First,  the 
civil  service  can  make  up  a  substantial  part  of  the 
electorate  and  therefore  be  an  important  actor  if  it 
is  politically  mobilized.  Second,  and  maybe  more 
important  in  this  context,  the  attitudes  of  civil  ser¬ 
vants  might  sometimes  conflict  with  the  attitudes 
of  the  political  party  in  government,  and  this  can 
create  obstacles  for  policy  implementation  even  if 
the  civil  servants  are  not  mobilized.  Sweden  can 
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serve  as  an  example.  In  1976,  for  the  first  time  since 
1936,  a  government  not  led  by  the  Social  Democratic 
Party  was  elected.  It  is  often  claimed  that  the  new 
center-right  Swedish  government  had  problems 
implementing  new  policies  because  of  the  social- 
democratic  attitudes  of  senior  civil  servants  (even  if 
they  were  not  politically  appointed).  To  avoid  simi¬ 
lar  situations,  most  countries  have  subsequently 
created  some  kind  of  legal  or  normative  framework 
stating  the  political  neutrality  of  the  civil  service 
and/or  limiting  the  political  involvement  of  civil 
servants.  Another  example,  from  the  Thatcher  years 
in  the  UK,  can,  however,  illustrate  that  conflicts 
between  the  political  leadership  and  the  civil  service 
are  not  always  about  party  politics.  The  Conservative 
Prime  Minister  Margaret  Thatcher  tried  to  imple¬ 
ment  a  radical  reform  program  in  the  public  sector. 
Studies  have  shown  that  the  civil  service  was  not 
trusted  to  carry  out  these  reforms,  and  therefore, 
civil  servants  in  key  positions  were  replaced  by  per¬ 
sons  committed  to  the  reform  program.  Politicization 
was  based  on  attitudes  to  that  reform  program 
rather  than  to  the  Conservative  Party. 

Political  Actions 

It  is  inevitable  that  civil  servants  take  part  in  the 
political  processes  to  some  extent,  since  they  are 
parts  of  politically  led  machineries.  In  most  coun¬ 
tries,  top  civil  servants  are  also,  to  some  extent, 
involved  in  giving  policy  advice  to  ministers. 
However,  there  are  large  variations  in  how  the 
civil  service  is  involved  in  political  actions. 

Participation  in  the  policy-making  process  is 
regarded  as  a  critical  indication  of  politicization. 
Generally,  the  civil  service  can  participate  in  three 
different  ways:  (1)  directly,  (2)  indirectly  as  actors 
in  the  political  decisions,  or  (3)  as  advisors  to  the 
elected  politicians. 

One  example  of  direct  involvement  is  the  Japanese 
tradition,  where  the  civil  service  does  not  limit  its 
role  to  policy  implementation  or  technical  advice 
but  is  considered  to  be  a  powerful  actor  in  the  pol¬ 
icy-making  process.  Until  the  end  of  the  1990s,  top 
civil  servants  even  took  part  in  discussions  in  the 
Japanese  legislature,  the  Diet,  something  that  would 
be  unthinkable  in  most  other  countries.  In  Spain, 
civil  servants  are  indirectly  involved  in  the  policy¬ 
making  process.  Spain  traditionally  has  a  close  con¬ 
nection  between  the  political  and  administrative 


elites.  The  political  elite  is  largely  recruited  from  the 
Spanish  administrative  corps.  It  could,  therefore,  be 
argued  that  the  civil  service  is  involved  in  the  policy¬ 
making  process,  although  not  in  the  same  direct 
way  as  it  is  in  Japan. 

A  third  example,  illustrating  the  advisory  func¬ 
tions  of  the  civil  service,  can  be  found  in  Denmark. 
It  is  one  of  the  countries  in  the  world  with  the  few¬ 
est  political  appointments  in  the  civil  service.  This 
puts  the  politically  elected  leaders  in  a  situation 
where  the  civil  service  is  the  only  body  outside  the 
party  organization  where  they  can  turn  for  advice. 
Civil  servants  in  Denmark,  therefore,  play  a  signifi¬ 
cant  role  in  the  policy-making  process  as  advisors, 
and  this  is  paradoxically  due  to  Denmark’s  low- 
level,  rather  than  high-level,  political  appointments. 
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state  formation,  8:2500 
Zionism  as,  8:2767 

See  also  Decolonization;  Empires;  Imperialism;  Indigenous 
peoples’  rights;  Orientalism;  Postcolonialism 
Columbia  University,  1:139,  5:1534,  5:1535, 

6:1778-1779,  6:2014 
Comecon,  8:2458 

Comintern.  See  Communist  International 
Commission  on  Global  Governance,  4:1011 
Common  goods,  2:307-310 
common  pool  resources,  2:307,  2:308,  2:309,  5:1666, 
7:2159,  7:2204,  8:2643-2646 


definition,  2:307 

empirical  research,  8:2645-2646 

global,  7:2160-2161 

institutions,  8:2646-2648 

managing,  2:308-309,  7:2205,  8:2644-2646 

prisoners’  dilemma,  7:2132 

types,  2:307 

See  also  Natural  resources;  Public  goods;  Tragedy  of  the 
commons 

Common  pool  resources  (CPRs),  2:307,  2:308,  2:309, 
5:1666,  7:2159,  7:2204,  8:2643-2646 
See  also  Tragedy  of  the  commons 
Communal  rolls,  3:751 
Communication 
during  crises,  2:496 
logics,  4:1188,  4:1196 
See  also  Political  communication 
Communication  media,  5:1530 
See  also  Media,  electronic;  Media,  print 
Communication  studies,  6:1961-1962 
Communication  technology,  5:1684,  6:1939,  6:2060 
Communicative  action,  2:499,  3:685,  4:1196,  7:2327-2328 
Communicative  discourse.  See  Discourse  analysis 
Communism,  2:310-314 
anarcho-,  1:76 

distinction  from  socialism,  2:310,  8:2450-2451 
equality,  8:2450 

evolution  of  theory,  2:310-311,  2:314-315 
as  ideal,  2:313-314 

ideology,  2:311,  2:314-315,  2:320,  4:1145 
intellectuals  and,  4:1208 
legacy,  2:313 

Marx  on,  2:310,  2:311-312,  2:314-315,  5:1499 
nationalism  and,  2:313 
pacifism  and,  6:1759 
as  political  organization,  2:311-312 
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Politics  of  Language 


Political  language  narrowly  defined  refers  to  a 
specific  use  of  language  for  political  means.  In  a 
broader  sense,  political  language  refers  to  lan¬ 
guage  policy  or  political  linguistics — that  is,  the 
ways  in  which  governments  attempt  to  shape  the 
linguistic  structure  of  the  society  or  the  claims 
issued  by  linguistic  groups  to  change  existing  lan¬ 
guage  arrangements  or  legislations.  Neither  of 
these  working  definitions  fully  captures  the  rele¬ 
vance  of  language  issues  to  politics  and  political 
science.  This  entry  therefore  reviews  three  types  of 
related  but  not  overlapping  issues:  (1)  the  way 
social  sciences  address  the  relationship  between 
language  and  politics,  (2)  classical  language  poli¬ 
cies  and  legislation,  and  (3)  the  “normative  turn.” 

Until  recently,  language  matters  in  social 
sciences  have  been  addressed  in  anthropology, 
sociolinguistics,  and  sociology,  focusing  almost 
exclusively  on  language  identity  (class  or  group 
identity)  and  language  as  an  expression  of  a  spe¬ 
cific  culture — hence  its  intrinsic  value  as  a  unique 
cultural  vehicle.  These  approaches  are  conse¬ 
quently  committed  to  language  diversity.  But  lan¬ 
guage  diversity  generally  hinders  efficient  political 
administration,  and  comparative  politics  have 
shown  that  the  most  classical  sequence  of  nation 
building  is  language  rationalization,  that  is,  the 
imposition  of  one  or  more  official  national  lan¬ 
guages.  Monolingualism  is  not  the  norm;  most 
countries  have  to  manage  language  diversity, 
either  by  admitting  a  set  of  official  languages  or  by 
tolerating  language  diversity. 

Language  and  Politics 

To  the  Greeks,  language  and  politics  are  so  closely 
related  that  the  practice  of  democracy  is  impossi¬ 
ble  without  a  theory  of  language.  “Nature  makes 
nothing  in  vain  .  .  .  man  is  the  only  animal  with 
the  gift  of  speech”  (Aristotle,  Politics,  I,  2).  Politics 
is  language,  because  language  enables  deliberation 
and  debate  and  rules  out  violence,  as  Norbert  Elias 
has  shown  in  his  Civilizing  Process.  Language  is 
therefore  central  to  any  political  activity. 

Language  is  also  central  to  social  interaction, 
and  social  scientists  have  been  indebted  to  pioneer 
linguists  such  as  Ferdinand  de  Saussure,  who 


claimed  that  language  is  the  epistemological  model 
of  society.  In  other  words,  society  is  structured  like 
a  language.  Structuralism  indeed  claims  that  there 
is  a  “social  grammar,”  and  social  sciences  in  gen¬ 
eral  became  fond  of  the  idea  that  the  implicit  and 
explicit  code  ruling  society  resembles  a  constrain¬ 
ing  grammar  of  interaction.  Social  theory  in  the 
1970s  understood  social  action  as  either  conform¬ 
ing  to  that  grammar  or  violating  its  rules. 

Three  main  movements  characterize  the  past 
decades  of  research: 

1.  The  first  is  the  reception  of  nominalism  in 
analytical  philosophy  and  its  durable  influence  on 
constructivism  in  general.  Nominalism  holds  that 
there  are  no  universal  concepts,  only  empirical 
observations;  referred  to  language,  reality  pertains 
to  our  ways  of  constructing/naming  it;  John 
Austin’s  illocutionary  acts,  for  instance,  are  means 
of  “doing  things  with  words”:  A  minister  saying, 
“I  baptize  this  child,”  ipso  facto  baptizes  it. 

2.  The  second  involves  studies  on  language  and 
domination,  with  two  subfields.  One  is  concerned 
with  routinized  forms  of  domination,  as  in  Pierre 
Bourdieu’s  Language  and  Symbolic  Power  (1982, 
1991),  for  whom  the  linguistic  market  is  similar  to 
any  other  market,  or  Basil  Bernstein’s  Class,  Codes 
and  Control  (1971),  where  a  sharp  line  is  drawn 
between  the  “elaborated  speech  codes”  of  the 
middle  classes  seeking  social  mobility  and  the 
“restricted  speech  codes”  of  the  working  classes. 
Speech  responds  to  strong  cultural  pressures:  An 
elaborated  code  summarizes  general  social  means 
and  ends,  whereas  a  restricted  code  is  limited  to 
local  means  and  ends.  The  other  subfield  is  con¬ 
cerned  with  the  exacerbated  form  of  domination: 
propaganda  and  totalitarian  language.  George 
Orwell  depicts  the  deterioration  of  language  in  his 
novel  1984,  and  Viktor  Klemperer’s  diary  ( Lingua 
Tertii  Imperii )  describes  the  sheer  truth  of  how  the 
Nazis  systematically  destroyed  the  German  lan¬ 
guage,  not  only  by  evacuating  the  bald  words  but 
by  substituting  them  with  euphonized  codes,  such 
as  “final  solution”  or  “special  treatment.” 

3.  The  last  movement  is  a  legacy  from  philoso¬ 
phy  and  anthropology  and  relates  to  the  profound 
skepticism  concerning  the  written  form  of  lan¬ 
guage  and  its  potential  link  to  illiberal  policies 
(Plato,  accusing  the  logographs,  wrote  that  script 
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lies  and  that  speech  is  sincere;  script  is  compared 
to  a  pharmakon — poison  and  medicine  alike). 
Anthropology  confirms  that  script  is  historically 
linked  to  despotism,  as  the  original  use  of  script 
was  for  accountability  and  domination  of  vast  ter¬ 
ritories,  not  poetry.  Classification  might  be  a  natu¬ 
ral  property  of  language;  it  enables  governments  to 
control  society  through  rules  enabled  by  script. 

Language  Policies 

Language  policy  is  an  attempt  to  weigh  collective 
language  choices  by  institutional  means,  to  pre¬ 
scribe  the  public  use  of  one  (or  more)  language(s), 
and  to  adopt  language  legislation.  Historically,  cre¬ 
ating,  rationalizing,  or  maintaining  a  language  is  the 
classical  (European)  sequence  of  language  policy, 
mostly  congruent  with  nation  building  in  the  19th 
century.  Official  languages  are  not  always  national 
languages  (an  official  language  can  coexist  with  a 
set  of  national  languages),  and  substate  national 
communities  or  groups  with  a  strong  regional  iden¬ 
tity  may  challenge  the  official  language  and  make 
new  language  claims.  Official-  and  national- 
language  policies  are  only  efficient  alongside  com¬ 
pulsory  education,  a  wide  interest  in  learning  and 
using  the  official/national  language,  and  some  kind 
of  reward  for  doing  so  (professional,  symbolic) — 
the  latter  is  particularly  true  for  national-language 
policies:  “What  the  eye  is  to  the  lover,  language  is 
to  the  patriot”  (Benedict  Anderson,  1983,  p.  154). 

The  French  example  is  telling.  From  the  16th 
century  (the  Edit  de  Villers-Cotterets,  1539)  to  the 
1992  constitutional  revision,  France  has  always 
aimed  at  rationalizing,  spreading,  and  maintaining 
the  French  language  as  a  symbol  of  common 
belonging,  a  means  of  effective  administration  in  a 
centralized  state,  a  key  element  of  republican  iden¬ 
tity,  and  a  sign  of  equal  citizenship.  Ironically,  it 
was  an  abbot  during  the  peak  of  the  French 
Revolution  (1790-1794),  Gregoire,  who  was  con¬ 
vinced  that  revolutionary  ideas  could  only  be 
thought  of  in  French  and  who  implemented  the 
first  true  language  policy:  Political  praxis  and  lan¬ 
guage  practice  are  inseparable. 

Language  Legislation 

The  next  step  is  language  legislation  and  conse¬ 
quently  language  rights.  All  states  “speak,”  issue 


laws,  and  administer;  language,  therefore,  cannot 
be  benignly  neglected  as  religion,  for  instance.  In 
monolingual  settings,  the  public  sphere  is  entirely 
ruled  in/by  one  language.  In  multilingual  states, 
mostly  federations,  legislators  have  a  choice 
between  two  principles:  the  territoriality  and  the 
personality  principles.  The  first,  and  most  wide¬ 
spread,  principle  is  based  on  territorial  rights 
(Belgium,  Switzerland,  and  Cameroon):  It  legally 
recognizes  a  monolingual  territory.  Variants  are 
territorialized  individual  rights  (Catalonia  and 
South  Tyrol),  special  policies  for  minorities 
(Australia,  the  United  States,  and  Germany, 
Hungary),  and  territorial  bilingualism  for  minori¬ 
ties  (Estonia,  Bosnia,  and  Pakistan).  Territoriality 
is  usually  associated  with  administrative  bilingual¬ 
ism  (civil  servants  speak  all  or  part  of  the  official 
languages)  to  ensure  statewide  communication;  it 
provides  language  stability  and  language  security 
(small  languages  are  protected  on  their  territory, 
and  language  scales  are  relatively  stable)  but 
obliges  all  to  speak  the  official  language  in  its  ter¬ 
ritory  of  reference.  Territoriality  generally  leads  to 
juxtaposed  unilingualisms  and  may  disrupt  inter¬ 
community  communication,  such  as  in  Belgium. 
The  personality  principle,  on  the  other  hand,  is 
best  described  by  institutional  multilingualism: 
The  state  acknowledges  and  recognizes  individual 
language  choices.  Regardless  of  where  one  is  in  the 
territory,  civil  administration  has  to  cater  to  the 
individual’s  language  choices.  Most  of  Canada  fol¬ 
lows  this  principle,  with  both  French  and  English 
as  official  languages;  however,  in  response  to 
Quebec’s  desire  to  preserve  the  French  language, 
legislation  (Bill  101,  1977)  was  passed  that  makes 
French  the  sole  official  language  in  the  province. 

Fanguage  legislations  are  mostly  a  blend  of  dif¬ 
ferent  types  of  territoriality  associated  with  indi¬ 
vidual  rights.  Strategic  multilingualism  in  India  or 
South  Africa  hints  at  political  rather  than  linguistic 
difficulties  in  a  postcolonial  or  postapartheid  era, 
respectively.  Very  few  states  have  achieved  “libera¬ 
tion”  from  the  colonial  tongue.  The  Arabization 
movement  in  Algeria  (the  return  to  Arabic  as  the 
only  official  language)  was  a  linguistic  and  politi¬ 
cal  failure:  The  Algerian  upper  classes  continued  to 
speak  French,  the  token  of  upward  mobility.  One 
interesting  feature  of  language  policy  is  interna¬ 
tionalization  (spreading  a  national  language  world¬ 
wide).  Colonial  empires  did  so  in  Africa,  Australia, 
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and  India;  and  half  a  century  after  decolonization, 
means  of  social  mobility  are  still  linked  to  “colo¬ 
nial”  language  skills. 

Almost  all  states  enforce  linguistic  assimilation 
policies,  and  very  few  states  decide  not  to  intervene 
in  language  matters,  favoring  as  such  the  dominant 
group(s).  A  complete  language  shift  is  generally 
achieved  in  the  third  generation;  diglossia  and  bilin¬ 
gualism  are  common  among  immigrant  children. 

Language  and  Fairness 

Noam  Chomsky’s  theory  about  the  grammatical 
similarity  of  all  languages  within  a  universal  mind- 
brain  set  had  expelled  sociolinguistics  and  diver¬ 
sity  studies  from  the  linguistic  departments  in  the 
1970s.  A  similar  change  of  paradigm  took  place  in 
the  1990s  with  the  normative  turn  in  language 
studies.  Until  recently,  and  despite  a  fair  amount  of 
language  conflicts,  normative  literature  in  political 
science  had  not  paid  much  attention  to  language 
and  even  less  to  the  linguistic  dimension  of  democ¬ 
racy.  Post-Rawlsian  political  theory  has  been  dis¬ 
cussing  many  culture-  or  identity-related  topics  but 
has  barely  considered  language.  Neither  liberals  or 
liberal-culturalists,  nor  communitarians  have  really 
addressed  language  equity.  It  is  only  in  the  1990s 
that  scholars  in  comparative  politics  and  econom¬ 
ics  started  to  invest  in  language  issues.  However, 
shedding  a  new  light  on  linguistic  diversity  and 
linguistic  justice  in  a  system  dominated  by  power¬ 
ful  global  languages  rapidly  led  to  a  wider  discus¬ 
sion  on  the  usefulness  and  the  threats  of  a  common 
language,  a  lingua  franca  (in  Europe  but  also  in 
multilingual  societies  and  sometimes  even  on  a 
global  level),  the  founding  idea  being  that  some¬ 
how  a  healthy  democracy  depends  on  a  common 
tool  of  communication.  The  path  taken  by  recent 
literature  is  an  attempt  to  reconcile  the  polity’s 
systemic  and  linguistic  capacity  alongside  the  citi¬ 
zen’s  effectiveness  or  voice.  The  “deliberative 
turn”  of  the  1990s  and  the  following  “talk-centric” 
theories  of  democracy  indeed  focused  on  commu¬ 
nication  and  debating  rather  than  on  voting  but 
without  mentioning  the  precondition  of  a  success¬ 
ful  public  debate:  a  common  language.  The  discus¬ 
sion  on  such  a  common  language  as  a  necessary 
condition  of  a  more  substantial  democracy  has 
been  particularly  vivid  in  Europe  and  poses  inter¬ 
esting  questions:  Do  we  need  a  common  language 


for  a  supranational  or  even  a  global  democracy? 
Do  we  need  a  common  tool  to  discuss  global  con¬ 
cerns  (environmental  issues,  for  instance)?  Would 
social  mobility  and  employability  be  enhanced  if 
we  all  spoke  the  same  language?  The  possible 
answers  are  contained  in  three  different  models. 
The  identity  model  argues  for  some  kind  of  restor¬ 
ative  justice  after  centuries  of  nation  building  and 
linguistic  rationalization:  The  claim  for  language 
diversity  is  morally  relevant,  and  states  ought  to 
acknowledge  those  claims  instead  of  reducing  lan¬ 
guage  variety  by  imposing  global/official  languages. 
The  procedural  model  is  not  outcome  oriented: 
People  should  fairly  discuss  the  language  choices  of 
their  polity.  The  economy  or  utility  model  focuses 
on  the  social  and  economic  usefulness  of  one  or 
more  linguae  francae,  the  costs  and  benefits  of 
learning/spreading  common  language(s). 

Many  political  theorists  would  argue  that  given 
the  huge  translation  costs  in  the  European  Union, 
the  most  efficient  policy  would  be  a  fair  combina¬ 
tion  of  three  official  languages  (English,  French, 
and  German)  and  a  three-communication-languages 
repertoire,  including  one  of  the  official  languages. 
Such  a  language  coordination  policy  would  rest  on 
an  official  language  set  and  a  free  language  market 
for  the  citizens;  it  would  cost  less,  enable  govern¬ 
ments  to  invest  in  language  training,  enhance  the 
possibilities  of  communication  among  European 
citizens,  and  maintain  the  possibilities  of  connect¬ 
ing  European  repertoires  with  extra-European  lan¬ 
guage  constellations. 

Astrid  von  Busekist 
Sciences  Po 
Paris,  France 

See  also  European  Integration;  Identity,  Social  and 
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State  Formation;  Territory 
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Popular  Culture 


The  term  popular  culture  has  three  levels  of  mean¬ 
ing,  which  though  interrelated  are  mutually  dis¬ 
tinct.  It  refers  first,  in  a  generalized  way,  to  the 
form  taken  by  the  totality  of  everyday  practices 
followed  by  a  society  or  a  group  within  a  society. 
In  this  sense,  the  term  is,  from  the  cultural  history 
perspective,  a  descriptive  one,  devoid  of  any  fur¬ 
ther  theoretical  implications.  Historically,  a  sec¬ 
ond,  more  narrowly  defined  meaning  of  popular 
culture  first  came  into  use.  It  was  applied  to  the 
culture  and  lifestyle  of  a  population’s  lower  social 
strata  and  referred — mostly  within  a  historical 
perspective — to  the  perceived  polarity  between  the 
culture  and  lifestyle  of  the  general  population, 
seen  as  the  uneducated  masses,  and  the  dominant 
high  culture  of  the  elites.  The  third  meaning  of  the 
term  popular  culture — which,  in  the  English- 
speaking  world  in  particular,  equates  popular 
culture  with  pop  culture  and  thus  denotes  cultural 
industry  products  distributed  by  the  mass  media 
unrelated  to  the  canon  of  classical  high  culture — 
arose  from  the  specific  dynamics  of  the  breakdown 
of  Marxist  class  antagonism  in  the  intellectual 
circles  close  to  the  Birmingham  Centre  for 
Contemporary  Cultural  Studies  (established  in 
1964).  In  effect,  the  vertically  and  hierarchically 
conceived  juxtaposition  of  two  cultures  was  now 
replaced  by  a  horizontal  array  of  multiple  equal- 
status  “cultures”  subsisting  within  the  polarities 
of  class,  race,  and  gender.  Below,  the  historical 


developments  and  the  interaction  of  these  con¬ 
cepts  are  discussed. 

Early  Developments 

The  concept  of  a  (popular,  indigenous)  “folk”  cul¬ 
ture  (V olkskultur)  existing  in  its  own  right  origi¬ 
nated  in  the  second  half  of  the  18th  century,  born 
partly  of  the  physiocrats’  interest  in  the  population 
at  large  and  partly  of  the  romantic  notion  of  a 
“folk  soul”  (Volksseele)  characterized  by  supra- 
individual,  nation-specific  features.  During  the 
Napoleonic  period,  and  in  the  context  of  the 
nascent  discourse  of  nationalism,  this  concept  of 
Volk  underwent  reformulation;  the  relationship 
between  popular  culture  and  high  culture  came  to 
be  conceived  of  as  an  organic  unity,  its  elements 
defined  in  terms  of  each  other.  Yet  at  the  same 
time,  in  the  context  of  industrialization  and  of  the 
developing  social  crisis,  the  mutual  antagonism  of 
the  two  cultural  levels  became  an  increasingly 
prominent  feature  of  socialist  and  Marxist  cultural 
theory.  The  origins  of  the  concept  of  popular  cul¬ 
ture  at  the  time  of  the  Enlightenment  are  linked  to 
the  physiocratic  interest  in  resources,  generally — 
to  agricultural  production,  mineral  resources,  and 
proto-industrial  production — linked  ultimately, 
that  is  to  say,  to  the  discovery  of  the  productive 
capacity  of  a  territory’s  working  population  to 
increase  its  wealth.  Michel  Foucault  used  the 
term  biopolitique  (biopolitics)  to  denote  this  new, 
calculating  concern  with  the  productivity  poten¬ 
tial  of  the  population  as  a  feature  of  the  new 
power  structure,  directed  against  the  ancien 
regime. 

This  same  interest  was  accompanied,  and  digni¬ 
fied  ideologically,  by  the  notion  of  a  productive 
national  soul  ( Volksseele ),  in  which  the  quintes¬ 
sential  nature  of  a  people  was  supposed  to  find  its 
expression.  It  was  this  latter  notion,  too,  that 
underlay  the  romantic  preoccupation  with  collect¬ 
ing  nation-specific  texts — folksongs  (Johann  Herd¬ 
er,  William  Wordsworth),  fairytales  (Jacob  & 
Wilhelm  Grimm),  proverbs,  and  diverse  idiomatic 
materials.  Among  those  thinking  in  terms  of  the 
nation-state,  from  about  1800  onward,  the  biopo¬ 
litical  interest  became  ideological:  The  idea  of  a 
nation  as  an  entity,  at  one  with  itself  and  clearly 
distinct  from  other  nations,  was  no  mere  matter 
of  drawing  geographical  frontiers.  Much  more 
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important,  it  called  for  cultural  demarcation  to  take 
place — a  process  complicated  by  the  conflicting 
pulls  of  a  nation’s  high  culture  and  its  “cultural 
substrate”  (Friedrich  Jahn  and  Wilhelm  Riehl).  The 
term  folklore,  coined  by  William  Thoms  in  1846, 
denotes  the  attempt  to  record  the  cultural  substrate 
of  individual  nations,  national  minorities,  or  groups 
in  genealogical  form  and  to  subject  the  recorded 
materials  to  comparison  with  a  view  to  identifying 
what  is  universal  in  human  socialization,  along 
with  what  is  specific  to  the  respective  groups.  The 
positivist  collector  activity  focused  on  folklore  was 
now  accompanied  by  scientific  analysis  and  sorting 
of  the  material  under  the  disciplines  of  ethnography 
and  ethnology.  During  the  ensuing  development  of 
the  cultural  and  social  sciences,  the  culture  of  the 
people  (Volk),  of  “the  nameless  masses,”  was  iden¬ 
tified  for  the  first  time  as  a  subject  meriting 
research.  As  such,  it  embraced  the  material  and 
ritual-performative  patterns  of  existence  and  action 
peculiar  to  the  lower  social  strata  of  people  or 
genealogical  groups  and  their  ideas  on  religious, 
juridical,  and  everyday  practical  matters,  along 
with  their  approach  to  positioning  the  individual  in 
society  (relative  to  ancestral  descent,  family,  and 
sequence  of  generations). 

Depending  on  the  political  and  social  conditions 
prevailing  in  national  groups,  cognizance  of  the 
everyday  perspectives  and  material  culture  of  the 
lower  strata  will  tend  toward  one  or  the  other  of 
two  postulates:  comparative  and  open-minded  or 
ethnocentric  and  hypostatizing.  It  is  unlikely  that 
there  has  ever  been  a  clearer  example  of  the  latter 
than  the  direction  taken  by  German  ethnology 
from  about  1900,  which  amounted  to  disowning 
the  tradition  of  a  comparative  ethnology  or  anthro¬ 
pology  concerned  both  with  industrialized  socie¬ 
ties  and  with  nonstate  groups.  However,  neither 
line  of  development  succeeded  in  demonstrating  a 
robust  link  between  high  culture  and  cultural  sub¬ 
strate.  It  was  only  20th-century  sociology  that  first 
developed  sophisticated  models  for  such  a  linkage. 
The  behavioral  standards  in  Norbert  Elias’s  theory 
of  the  civilizing  process  and  the  habitus  variants 
represented  by  Pierre  Bourdieu  must  both  be  seen 
as  distinction-drawing  processes,  as  deliberate  dif¬ 
ferentiation  of  the  elites  from  the  “broad  masses” — 
though  this  was  later  rationalized  from  the  psycho¬ 
logical  angle  in  the  contexts  of  social,  medical,  or 
public  hygiene  debates. 


Nationalist  and  Marxist  Approaches 

Thus,  in  its  early  stages  and  subsequent  develop¬ 
ment,  the  concept  of  popular  culture  marked  off 
the  sociocultural  living  patterns  of  what  was  ini¬ 
tially  the  agrarian,  peasant  population,  and  later 
included  the  proletariat  and  lower  middle  classes, 
as  separate  and  distinct  from  those  of  the  higher 
and  socially  influential  classes.  Two  conflicting 
inheritances  can  be  traced,  one  nationalistic  and 
focused  on  origins,  the  other  Marxist  and  antago¬ 
nistic.  The  nationalistic,  origins-focused  line 
reaches  its  culmination  in  National  Socialist  ideol¬ 
ogy;  here,  the  existence  of  cultural  antagonism 
within  a  society  is  denied,  and  a  countermodel, 
based  on  ideas  such  as  national  character  and  the 
“bonding”  of  a  nation’s  various  cultural  levels,  is 
proclaimed:  the  V olksgemeinsch aft .  This  was  seen 
as  a  closed  continuum,  within  which  cultural  sub¬ 
strate  and  high  culture  cross-fertilize  and  inter¬ 
weave,  since  both  are  rooted  in  the  same  humus 
and  are  inseparably  linked.  The  soil  metaphor  is 
complemented  by  that  of  blood — blood  standing 
here  for  the  vitality  and  strength  of  the  current 
generation  at  any  given  time. 

The  background  to  this  blood-and-soil  fixation 
can  be  traced  to  19th-century  organic  models  of 
society:  Nationalistic  thinkers  generated,  for  the 
benefit  of  the  nation’s  imagination,  the  self-image 
of  a  solitary  tree,  its  roots  reaching  deep  down  to 
the  first  beginnings  of  the  nation’s  separate  cultural 
identity  and  the  new  blossoms  seen  on  it  each  year 
(i.e.,  individuals  who  have  to  put  the  community 
before  themselves)  representing  national  uniqueness 
and  vigor.  In  this  figurative  thinking,  every  nation 
constituted  a  self-contained  entity,  a  clearly  demar¬ 
cated  “body”  of  the  people.  Being  the  social  stra¬ 
tum  seen  as  tillers  of  the  soil  and  as  essentially 
enduring,  farmers  became  the  ideological  standard- 
bearers  for  this  ideology  of  Volkstum,  or  national 
distinctiveness,  which  postulated  a  continuity 
stretching  from  early  origins  to  the  present  level  of 
development.  The  continued  efficacy  of  the  strength¬ 
giving  pure  origin  was  exposed  to  many  and  vari¬ 
ous  dangers:  nonrecognition  and  relinquishment  of 
the  We  as  a  consequence  of  fascination  with  the 
Other  (or  Alien)  or  the  sustained  infiltration  of  the 
“body”  of  the  people  (or  nation)  by  “alien  ele¬ 
ments,”  the  existence  of  which  was  perceived  as 
parasitical  (e.g.,  anti-Semitism).  It  has  to  be  empha¬ 
sized  that  all  postulates  of  Volkstum  and  folklore 
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regarding  industrialized  societies  and  the  research 
on  them,  both  the  more  radical  and  the  moderate 
variants,  are  ideologically  motivated.  While  it  is 
true  that  this  readiness  of  the  upper  strata  of  society 
to  turn  to  the  Volkstum  and  folklore  of  the  lower 
strata  embraces  a  certain  element  of  compensation 
for  modernization  Angst ,  the  ideologically  trans¬ 
mitted  postulate  of  pure  origins  with  minimal  con¬ 
tamination  from  outside  is  nevertheless  preeminent. 
In  other  words,  the  everyday  perspectives  of  the 
lower  social  strata  were  almost  lost  from  sight  in 
the  ideological  canon  centered  on  origins  and  race. 

The  Marxist  approach,  focused  as  it  was  on  class 
antagonism,  led  to  the  evolution  of  the  theory  of 
“two  cultures”  in  “the  same  society”  (Lenin).  The 
antithesis  inherent  in  the  power  disparity  will  prove 
to  be  a  productive  stimulus  for  both  sides;  high  cul¬ 
ture,  having  become  weak  and  decadent,  will  be 
goaded  into  repressive  action,  while  proletarian 
counterculture,  with  hitherto  untapped  reserves  of 
energy  to  draw  on,  will  indeed  lose  some  of  its  impe¬ 
tus  but  will  prove  unstoppable  in  the  long  run.  With 
the  concept  of  the  struggle  for  cultural  hegemony  as 
outlined  by  Antonio  Gramsci,  this  black-and-white 
characterization  of  the  cultural  confrontation  (sub¬ 
ordinate  to  the  economic)  becomes  less  simplistic 
and  more  applicable  empirically.  So  Gramsci’s  con¬ 
cept  of  cultural  hegemony  being  “negotiated”  within 
society  after  the  demise  of  both  the  origin-focused 
and  the  class  warfare-motivated  models  of  national 
culture  was  to  prove  exceptionally  fruitful  in  both 
historical  and  contemporary  social  research. 

Modern  Approaches 

Inquiry  into  such  negotiation  (there  are  a  host  of 
parallel  developments  besides  those  of  Gramsci) 
implies  knowledge  of  the  specific  thinking  of  the 
actors  concerned.  Thus,  while  the  special  fascina¬ 
tion  of  the  term  folklore  lay  in  its  inbuilt  tension 
between  “knowledge  of  the  people”  and  “the 
people’s  knowledge,”  the  need  that  now  arose  for 
empirical  study  of  the  multifarious  “culture  and 
way  of  life”  of  the  population’s  lower  strata 
ensured  that  it  was  their  viewpoint  that  now 
became  the  central  concern.  The  common  feature 
of  all  these  more  modern  approaches  is  that  they 
seek  to  collect  and  to  focus  on  the  inside  views 
peculiar  to  the  various  social  environments  and 
thus — here  revealing  an  anthropological  cast  of 


mind — to  supplement  the  “grand  historical  narra¬ 
tives”  with  detailed  studies  at  the  grassroots  level 
(microhistory).  The  now  widely  established  pur¬ 
suit  of  historical  research  on  popular  culture  owes 
much  to  a  history  that  started  from  the  bottom. 

Among  the  points  of  departure  for  this  historical 
anthropology  was  the  school  that  developed  around 
the  French  journal  Annales  d’bistoire  economique 
et  sociale  (launched  in  1929).  Its  founders,  Marc 
Bloch  and  Lucien  Febvre,  announced  a  historiogra¬ 
phy  that  would  subordinate  event  history  to  a 
structural  history  concerned  with  everyday  life  and 
the  mentalities  that  it  engendered.  The  long  dura¬ 
tion  (Ferdinand  Braudel’s  longue  duree)  of  produc¬ 
tion  methods,  ideas,  and  behavioral  patterns  now 
became  no  less  a  central  focus  than  single  events: 
violently  erupting  rebellions  (as  described,  e.g.,  by 
E.  LeRoy  Ladurie,  Carnival  in  Romans )  or  ideas 
about  the  nature  of  the  world,  put  forward  at 
Inquisition  hearings  but  running  counter  to  con¬ 
temporary  trends  (e.g.,  the  worldview  of  a  miller 
around  1600  in  the  benandanti,  the  work  of  Carlo 
Ginzburg).  The  ethnological  line  taken  by  the 
Prague  School  applied  the  structuralist  method  of 
meaning-constitutive  opposition — this  was  still 
before  Claude  Levi-Strauss — to  elements  of  popu¬ 
lar  culture  (P.  Bogatyrev).  Mikhail  Bakhtin,  with  a 
background  in  Russian  formalism,  developed  his 
theory  of  folk  laughter  in  late  medieval  popular 
culture  from  the  antithetical  pairing  of  work  cul¬ 
ture  and  feast-day  culture. 

Particularly  in  academic  research  in  the  fields  of 
family,  childhood,  women’s,  and  gender  studies 
(and,  not  least,  feminist  studies),  but  also  in  the 
field  of  everyday  living  patterns  and  of  power  rela¬ 
tionships  in  agrarian,  early-modern  urban,  and 
proletarian  industrialized  cultural  phenomena, 
popular  culture  researchers  have  set  new  and  last¬ 
ing  standards  in  historiography.  In  its  early  stages, 
still  adhering  to  the  Marxist  class  antagonism  con¬ 
cept,  the  Birmingham  School  of  Cultural  Studies 
later  came  to  place  the  dynamics  of  the  hierarchi¬ 
cally  subordinated  cultural  forms  qua  countercul¬ 
ture  and  subculture,  possessing  their  own  symbolic 
language,  at  the  very  heart  of  its  activities:  “culture 
as  a  whole  way  of  life”  (R.  Williams;  R.  Hoggart). 
This  assumed  the  recognition  of  individual  identity 
positioned  on  criteria  of  social  background,  occu¬ 
pation,  and  philosophical-political,  religious,  and 
sexual  orientation  as  a  decision  in  favor  of  a  form 
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of  living  that  constitutes  a  cultural  sector  in  its 
own  right.  Under  the  influence  of  Gramsci’s  theory 
of  hegemony,  the  counterculture  and  subculture 
theory  evolved — with  youth  culture  and  music 
culture  in  the  forefront — into  a  new  theory  that 
accords  parity  of  status  to  a  range  of  cultures;  the 
erudition-laden  high  culture  of  earlier  times  is  now 
just  one  culture  among  many. 

The  most  complete  transformation  has  been  in 
the  area  of  mass  media  cultural  products.  The 
Adorno/Horkheimer  thesis  of  a  culture  industry 
manipulating  the  scene  from  the  top  has  been  chal¬ 
lenged,  in  S.  Hall’s  encoding/decoding  model,  by 
consumer  appropriation  of  the  narrative.  John 
Fiske,  taking  all  categories  of  mass  media- 
transmitted  pop  culture  products  into  his  purview, 
demonstrated  the  creative  resistance  potential  of 
this  appropriation  process.  From  a  feminist  per¬ 
spective,  Angela  McRobbie  has  emphasized  the 
gender  specificity  of  the  manner  of  appropriation. 
Cultural  studies  had  already  gone  separate  ways 
by  the  time  the  Birmingham  Center  was  wound  up 
in  2002.  Influenced  by  accelerated  media  trans¬ 
mission  and  communication,  by  the  increasing 
dominance  of  the  visual  in  cultural  output,  by 
consumer-centeredness  (among  young  shoppers  no 
less  than  others)  in  the  developed  societies  con¬ 
trasting  with  abject  poverty  in  the  underdeveloped 
world,  and  by  the  enormous  differences  globally 
with  regard  to  women’s  autonomy  of  action,  the 
lines  of  research  pursued  today  in  cultural  and 
postcolonial  studies  are  characterized  by  a  bring¬ 
ing  together  of  class,  race,  and  gender. 

A  detailed  study  of  popular  culture  (Facebook, 
blogs,  discussion  forums,  etc.)  on  the  World  Wide 
Web  is  beginning,  but  it  will  take  time  until  web 
culture  research  becomes  an  established  discipline 
in  its  own  right. 

Karl  Braun 
Pbilipps-UniversitLit  Marburg 
Marburg,  Germany 
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A  distinction  must  be  made  between  more  or  less 
coherent  populist  movements  on  the  one  hand  and 
populism  as  a  generic  theoretical  concept  referring 
to  a  variety  of  demagogic  messages  and  attitudes 
on  the  other.  In  the  following,  the  various  phases 
of  populist  movements  and  ideas  are  discussed  in 
their  historical  contexts.  The  conclusion  points  to 
current  concerns. 

Phases  of  Populist  Movements 

Scholars  conventionally  identify  the  first  historical 
manifestation  of  populism  in  the  Russian  intellec¬ 
tual  populism  of  the  late  1840s  Narodnikis  (i.e., 
populists  in  Russian;  the  word  narodnitchestvo — 
populism — appeared  in  the  1870s).  Influenced  by 
the  ideas  of  Aleksandr  Herzen,  Mikhail  Bakunin, 
and  Nicolai  Tchernichevski  but  devoid  of  any 
elaborate  ideology,  these  Narodnikis  emerged  as  a 
revolutionary  movement  adhering  to  some  kind  of 
idolatry  of  the  peasantry  and  the  rural  village 
(obchtchina).  They  were  ultraleftist,  prone  to  the 
use  of  violence  and  even  political  assassinations. 
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This  led,  later  on,  to  nihilism — terrorism  as  propa¬ 
gated  by  Nicolai  Nekrassof  and  Sergei  Netchaiev. 
They  were  also  fiercely  anti-Western  (Orientalists) 
and  proslavist  (an  extremely  radical  variety  of 
revolutionary  Slavophiles). 

Thereafter,  during  the  last  quarter  of  the  19th 
century,  populism  made  a  second  appearance  in 
the  very  different  world  of  the  European  and 
North  American  industrialized  societies.  It  thor¬ 
oughly  transformed  itself  into  an  anti-elitist,  anti¬ 
parliamentarian,  and  nationalist-chauvinistic 
movement,  especially  in  France  between  1886  and 
1889  (Boulangism,  derived  from  the  name  of  its 
charismatic  leader  General  Boulanger)  and  in  the 
United  States  during  the  1890s  (with  the  xenopho¬ 
bic  Protestant  and  anticapitalist  People’s  Party  of 
the  small  White  Anglo-Saxon  Protestant  [WASP] 
farmers  and  miners).  Under  these  circumstances, 
populism  shifted  from  the  most  extreme  Left  to  the 
radical  nationalist  Right,  appearing  to  some  extent 
as  the  ancestor  of  today’s  xenophobic  and  anti¬ 
immigrant  Western  European  populist  parties. 
Both  French  and  U.S.  populism  of  this  second 
period  only  enjoyed  a  short  time  span.  In  this  way, 
they  introduced  a  feature  of  populist  movements 
that  remained  for  a  long  time  quite  typical — that 
is,  their  short  duration,  due  to  the  ephemeral 
excesses  of  popular  fever  generated  by  a  recurrent 
hostility  to  the  existing  ruling  elite  or  to  a  newly 
established  democratic  government  (in  Europe  of 
the  late  19th  century  in  particular).  They  also  inau¬ 
gurated  populism  as  an  unsteady  form  of  mass 
electoral  movements  cemented  by  an  orientation 
toward  the  “citizens  of  true  stock.” 

Between  1925  and  1955,  populism  made  a  third 
appearance.  This  time,  it  emerged  in  Latin  America, 
with  the  semidictatorships  of  Getulio  Vargas  in 
Brazil  and  Juan  Peron  in  Argentina  (notwithstand¬ 
ing  other  populist  governments,  e.g.,  Carlos 
Ibanez’s  in  Chile  or  General  Rojas  Pinilla’s  in 
Colombia,  alongside  their  much  more  recent  revival 
in  Hugo  Chavez’s  Venezuela  or  Evo  Morales’s 
Bolivia).  In  this  region,  populism  seems  to  revive  in 
part  the  older  tradition  of  caudillismo,  that  is,  that 
of  a  predestinate  Savior  of  the  People,  wielding 
power  by  virtue  of  Providence  against  the  lofty  and 
corrupt  oligarchy  of  big  landowners.  This  inheri¬ 
tance  did  not  prevent  Latin  American  populism 
from  becoming  the  most  accomplished  of  all  popu¬ 
list  manifestations  till  now,  a  complete  political 


regime  of  some  sort,  offering  an  actual  alternative 
to  West  European  or  North  American  representa¬ 
tive  democracy.  This  alternative  could  be  called 
“incarnate  democracy,”  considering  its  embodi¬ 
ment  of  the  Sovereign  People  by  a  charismatic 
leader.  Institutionally,  these  regimes  are  based  on 
state-controlled  labor  unions  for  the  incipient 
working  class  rather  than  on  a  single  party  inspired 
by  European  totalitarian  states.  Though  rather 
authoritarian  and  by  no  means  liberal,  they,  indeed, 
introduced  for  the  first  time  some  kind  of  political 
participation  of  the  Latin  American  masses  into  the 
political  arena  to  gain  support  from  this  popular 
base.  They  did  so  to  such  an  extent  that  they 
deserve  to  be  considered  in  some  countries  as  the 
real  forerunners  of  democracy  in  this  part  of  the 
world.  This  particularity  makes  Latin  American 
national  populist  regimes  unique. 

After  World  War  II,  other  forms  of  even  more 
dictatorial  charismatic  populism  occurred  in  East 
Asia  (Kusno  Sukarno  in  Indonesia,  Ferdinand 
Marcos  in  the  Philippines)  and  in  the  decolonized 
countries  of  Africa  (Gamal  Abdel  Nasser  in  Egypt, 
Ahmed  Sekou  Toure  in  Guinea,  and  Kwame 
N’Krumah  in  Ghana).  These  personalized  regimes 
generally  gave  birth  to  a  Third  World  populism  in 
which  the  unity  of  the  nation  was  expressed  in  a 
single  party). 

In  a  fourth  period,  populism  has  dramatically 
evolved  in  Western  Europe  during  the  past  30  years. 
Between  1980  and  2000,  it  first  resuscitated  and 
modernized  at  the  same  time  the  hitherto  customary 
form  of  factious  agitators  acting  as  outsiders  oppos¬ 
ing  the  “political  correctness”  and  hypocrisy  of 
representative  democracy.  In  Denmark  and  Norway, 
to  begin  with,  “progress”  parties  were  born  as  anti¬ 
taxation  groups  committed  to  denounce  the 
excessive  fiscal  pressure  of  the  Scandinavian  social- 
democratic  welfare  states.  Later  on,  mainly  in  Jean- 
Marie  Le  Pen’s  France,  Jorg  Haider’s  Austria, 
Christoph  Blocher’s  Switzerland,  Bruno  De  Winter’s 
and  Vlaams  Belang’s  Belgium,  or  Pirn  Fortuyn’s 
Netherlands,  this  precursory  kind  of  populism, 
whose  slogan  was  “less  state,”  turned  principally 
xenophobic.  It  depicted  the  dangers  of  an  extra- 
European  “invasion”  and  denounced  multicultural 
values  and  devotees. 

These  xenophobic  parties  now  have  toned 
down  their  discourse  in  many  cases.  Some  of  them 
at  least,  particularly  in  Scandinavia,  are  trying  to 
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get  a  better  position  inside  the  democratic  party 
system.  To  this  end,  they  do  their  best  to  touch  a 
more  sympathetic  chord  in  the  middle  classes  and 
authenticate  the  new  political  respectability  they 
are  trying  to  gain.  Simultaneously  and  more  inter¬ 
estingly,  a  number  of  established  politicians 
belonging  to  the  great  parliamentarian  parties  now 
imitate  a  part  of  the  campaign  techniques  of  the 
formerly  infamous  populists  (without  admitting  it, 
of  course).  This  is  the  great  novelty  that  makes  the 
populist  performance  becoming  commonplace 
instead  of  being  irregular,  temporary,  and  subver¬ 
sive.  In  a  first  mutation,  a  few  respectable 
politicians  claimed  to  do  so  in  order  to  fight  the 
populists  of  an  orthodox  persuasion  with  their 
own  weapons  (this  partly  was  the  case  with 
Jacques  Chirac  in  France  or  the  People’s  Party 
[Osterreichische  Volkspartei,  OVPJ  in  Austria). 
But  now,  they  are  geared  to  use  these  methods 
indiscriminately  in  order  to  face  the  competition  of 
any  other  party — whether  populist  or  not.  From 
1994  onward,  Silvio  Berlusconi  in  Italy  outper¬ 
formed  everybody  else  in  this  unprecedented  art  of 
improving  his  popularity  by  giving  up  the  conde¬ 
scension  of  old-style  politicians.  Later  on,  the  2007 
French  presidential  campaign  spectacularly  went 
along  this  path,  even  though  Tony  Blair’s  and 
Edmund  Stoiber’s  2002  electoral  campaigns  can 
be  considered  as  the  first  examples  of  this  populist 
contagion  in  Britain  and  Germany. 

Current  Concerns 

The  term  populism  has  so  often  been  misused  as 
an  insult  by  pouring  scorn  on  almost  any  politician 
capable  of  winning  an  election  that  difficulties 
arise  to  define  it  more  strictly  as  a  social  science 
concept.  If  we  do  so,  we  would  argue  that  its  core 
classical  feature  relates  to  the  strong  and  emo¬ 
tional  direct  relationship  that  exists  between  the 
mass  of  a  population  and  the  populist  charismatic 
leader — or  sometimes  a  party  as  in  Mexico  before 
2000 — who  pretends  to  embody  the  People  and 
whose  ideas  and  feelings  are  supposed  to  corre¬ 
spond  to  the  People  without  any  mediation  of 
representative  democracy.  According  to  Max 
Weber,  the  term  charisma  applies  to  a  certain  qual¬ 
ity  of  a  personality,  by  virtue  of  which  this  person 
is  treated  as  endowed  with  exceptional  powers  or 
qualities.  “We  believe  in  the  common  sense  of  the 


common  people,”  as  stated  on  page  9  of  the 
Reform  Party  of  Canada’s  Principles  and  Policies: 
The  Blue  Book  in  1991.  Populism  is  also  antipo- 
litical  inasmuch  as  it  assails  the  art  of  using  time  as 
a  resource  for  delaying  and  cooling  down  prema¬ 
ture  or  counterproductive  popular  demands,  an  art 
that  commonly  represents  the  core  technique  of 
moderate  liberal  governments.  In  this  way,  populist 
leaders  dismiss  the  “pseudodemocratic”  classical 
management  of  the  political  agenda  by  professing 
that  they  are  able  to  keep  their  promises  overnight. 
Furthermore,  Ghita  Ionescu,  Ernest  Gellner,  and 
Edward  Shils  in  particular  distinguished  a  number 
of  additional  features  for  a  description  of  populism 
in  general: 

1.  Populism  is  moralistic  rather  than 
programmatic. 

2.  Populism  is  generally  loosely  organized  and  ill 
disciplined,  a  movement  rather  than  a  party. 

3.  Populism  is  based  on  a  state  of  mind  rather 
than  on  an  explicit  doctrine  or  ideology. 

4.  Populism  is  anti-intellectual. 

5.  Populism  is  strongly  opposed  to  the 
Establishment  and  to  the  alleged  treachery  of 
the  elites,  most  commonly  with  an  anticapitalist 
pretence. 

In  addition,  populism  is  first  of  all  a  polemical 
style  of  political  communication  that  places  empha¬ 
sis  on  the  alleged  will  of  the  “True  People”  and 
that  defends  a  culture  of  decency  against  a  culture 
of  irresponsibility.  This  is  perhaps  the  reason  why 
it  increasingly  permeates  the  behavior  of  politi¬ 
cians  and  the  political  stage  of  the  old  democra¬ 
cies.  Since  for  inhabitants  of  emerging  countries 
often  a  revolution  in  practical  lifestyles  has  priority 
over  general  human  rights  and  political  freedom,  it 
seems  reasonable  to  reckon  under  such  conditions 
that  populism  still  has  a  promising  future  in  the 
postindustrial  societies  as  well  as  in  the  less  afflu¬ 
ent  parts  of  the  world. 

Guy  Hermet 
Sciences  Po 
Paris,  France 

See  also  Charisma;  Democratization;  Social  Movements 
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Positive  Peace 


Positive  peace  implies  more  than  the  absence  of 
war.  It  suggests  creating  social  relationships  that 
contribute  to  mutual  well-being  and  human  flour¬ 
ishing.  The  concept  goes  beyond  the  negative  defi¬ 
nition  of  peace  as  the  absence  of  war,  which  is 
upheld  in  international  law.  According  to  Johan 
Galtung,  the  Norwegian  sociologist  and  father  of 
peace  studies,  peace  is  not  necessarily  the  opposite 
of  war.  Indeed,  peace  and  war  are  not  necessarily 
exclusive  states.  Galtung’s  innovation  was  to  define 
negative  peace  as  the  absence  of  direct  violence  and 
positive  peace  as  the  absence  of  structural  violence 
or,  to  put  it  positively,  the  presence  of  conditions  in 
which  people  can  realize  their  potential.  This  wider 
understanding  of  peace  has  been  controversial  but 
has  fostered  an  enriched  understanding  of  the  rela¬ 
tionship  between  building  peace  and  ending  war. 

Concepts  of  Positive  and  Negative  Peace 

A  positive  understanding  of  peace  has  been  implicit 
in  the  term  for  peace  in  many  cultures.  The  Greek 
word  eirene  meant  not  only  the  cessation  of  hos¬ 
tilities  between  states  but  an  organized  common 
peace.  The  Roman  pax  meant  an  order  governed 
by  laws  and  common  citizenship.  The  Hebrew 
word  shalom  comes  from  a  root  meaning  “whole¬ 
ness”  and  has  connotations  of  well-being,  social 


peace,  and  living  in  accordance  with  God’s  will. 
The  Christian  sense  of  the  peace  of  Christ  carries 
this  forward  and  evokes  the  idea  of  the  community 
of  Christendom.  The  close  idea  of  reconciliation 
means  “to  make  whole  again.”  The  Muslim  word 
salaam  also  echoes  the  Hebrew  concept.  It  means 
peace  with  Allah,  with  one’s  neighbors,  and  one¬ 
self.  The  Indian  shanti  embraces  inner  harmony 
and  avoidance  of  harm,  while  the  Chinese  heping 
denotes  harmony  in  society  and  with  nature.  A 
positive,  though  variously  interpreted,  meaning  to 
peace  seems  to  be  part  of  our  common  heritage. 

As  Europe  became  secular  and  organized  in 
sovereign  states,  the  understanding  of  peace  began 
to  change.  At  the  domestic  level,  the  king’s  peace 
implied  a  state  of  law  and  order.  Between  states, 
relations  were  either  in  a  state  of  war  or  a  state  of 
peace.  Hugo  Grotius  elaborated  this  distinction 
and  international  law  embodied  it.  Complete 
peace  became  a  distant  goal.  While  Desiderius 
Erasmus  had  sought  to  ground  peace  and  tolerance 
between  Lutherans  and  Catholics  on  reason  and 
common  Christianity,  Immanuel  Kant  looked  to  a 
future  state  of  perpetual  peace,  outside  the  frame 
of  existing  state  relations.  Peace  required  volun¬ 
tary  consent  to  an  agreement  to  abstain  from  wars 
between  states,  on  the  basis  of  reason  and  a  moral 
imperative.  Thomas  Hobbes  similarly  saw  peace 
as  far  from  the  present  state  of  affairs.  Peace  meant 
only  the  time  when  states  were  neither  fighting  nor 
being  predisposed  to  fight.  Given  the  nature  of 
sovereignty,  this  predisposition  was  ever  present  in 
anarchic  systems,  and  so  the  normal  state  of 
Europe  was  one  of  war.  Thus,  political  theorists 
relegated  positive  peace  to  a  remote  future  in 
which  people  either  consented  to  renounce  war  or 
accepted  a  common  sovereign. 

In  the  deadly  context  of  European  wars,  how¬ 
ever,  negative  peace  seemed  enough  to  strive  for.  A 
generation  shocked  by  the  destruction  of  World 
War  I  founded  international  relations  as  an  aca¬ 
demic  inquiry  into  the  conditions  of  war  and 
peace.  They  wanted  to  understand  the  causes  of 
wars  and  find  a  basis  for  peaceful  settlement  of 
disputes  and  peaceful  change.  After  World  War  II, 
when  realists  became  dominant  in  international 
relations,  this  became  the  central  research  question 
for  peace  studies.  The  most  urgent  concern  was 
how  to  prevent  a  global  war,  wrote  the  editors  of 
the  Journal  of  Conflict  Resolution,  in  the  first  issue 
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in  1957.  Researchers  on  the  correlates  of  war 
aimed  to  reduce  the  frequency  of  wars.  Others, 
such  as  Anatol  Rapoport,  aimed  to  eliminate  war 
as  an  institution.  In  the  early  years  of  the  nuclear 
age,  when  warfare  appeared  to  threaten  human 
survival,  the  top  priority  was  to  secure  the  absence 
of  war. 

But  critical  voices  in  peace  studies  questioned 
this  emphasis  on  negative  peace.  They  pointed  to 
the  war  of  “pacification”  in  Vietnam  and  the  wide¬ 
spread  inequalities  and  injustices  between  the 
developed  and  the  developing  worlds.  Was  peace 
between  states  real  peace  if  it  rested  on  relations  of 
dominance  between  hegemonic  powers  and  their 
clients  and  coexisted  with  exploitation  of  the  poor 
by  the  rich?  Could  a  cold  peace  be  acceptable  if  it 
was  maintained  by  the  threat  of  nuclear  weapons? 
It  was  in  the  context  of  such  criticisms  that 
Galtung  launched  his  idea  of  positive  peace. 

Galtung  had  already  put  forward  a  first  notion 
of  positive  peace  in  1964,  in  his  editorial  to  the  first 
edition  of  the  Journal  of  Fence  Research.  Here,  he 
saw  negative  peace  and  positive  peace  as  two 
dimensions,  with  the  task  of  peace  research  to 
understand  progress  in  both  dimensions — both  to 
eliminate  war  and  to  secure  solidarity,  community, 
and  empathy — in  short,  “the  integration  of  human 
society.”  In  1969,  he  sharpened  this  idea  with  his 
distinction  between  direct  and  structural  violence 
and  their  opposites,  negative  and  positive  peace. 
Galtung  retained  the  idea  of  peace  as  absence  of 
violence  but  broadened  the  meaning  of  violence  to 
denote  the  difference  between  the  actual  and  the 
potential.  Any  deprivation  of  life  chances,  whether 
deliberate  or  not,  constituted  a  form  of  violence. 
Direct  violence  is  harm  directly  imposed  by  an 
actor  on  a  victim.  Indirect  or  structural  violence  is 
imposed  by  a  whole  social  system,  with  no  con¬ 
scious  intention  to  harm.  Negative  peace  is  the 
absence  of  direct  violence  and  positive  peace,  the 
absence  of  structural  violence.  Galtung  left  the  rela¬ 
tionship  between  positive  peace  and  negative  peace 
indeterminate.  His  goal  was  general  and  complete 
peace,  achieved  by  peaceful  means,  but  progress 
could  be  made  in  either  dimension  independently. 

Galtung  later  added  cultural  violence  to  direct 
and  structural  violence.  The  corresponding  state 
of  absence  of  cultural  violence  was  a  culture  of 
peace  in  which  people  regard  each  other  with 
respect  and  understanding.  Feminists  insisted  that 


an  understanding  of  positive  peace  also  needed  to 
include  equal  relations  between  men  and  women 
and  the  upholding  of  human  rights.  Thus,  positive 
peace  is  emancipation  from  oppression,  in  gender 
as  well  as  in  other  social  relations.  Others  extended 
positive  peace  to  balance  with  the  environment. 
Peace  between  people  could  hardly  be  at  the 
expense  of  other  species;  so  Gaia  peace  was  juxta¬ 
posed  with  ecological  violence.  Finally,  inner  peace 
was  included  as  the  opposite  of  violence  against 
oneself.  All  of  this  gave  a  rich  spectrum  of  mean¬ 
ings  to  positive  peace  but  at  the  same  time  made  its 
achievement  an  all-encompassing  task. 

Critique  of  Galtung’ 's  Concept  of  Positive  Peace 

The  main  critique  of  Galtung’s  idea  of  positive 
peace  came  from  the  British  political  scientist  Peter 
Lawler.  Lawler  argued  that  Galtung’s  definition  of 
structural  violence  was  intrinsically  value  laden 
and  devoid  of  substance.  It  could  be  taken  to  mean 
anything  one  dislikes.  To  define  peace  as  the  full 
realization  of  human  potential  and  the  absence  of 
war  was  tantamount  to  making  the  study  of  peace 
a  quest  for  nirvana.  Instead,  he  argued,  it  was  more 
important  to  focus  on  direct  violence  and  to  see 
peace  studies  as  a  way  of  articulating  wider  devel¬ 
opments  in  social  and  political  theory  with  the  nar¬ 
rower  analysis  of  violence  and  its  conditions. 
Lawler  argued  that  the  concept  of  positive  peace 
made  peace  studies  “a  black  hole”  from  which 
nothing  could  be  excluded  and  left  it  with  no  dis¬ 
tinctive  identity  or  purpose.  The  economist  and 
peace  researcher  Kenneth  Boulding  made  a  similar 
point  when  he  argued  that  poverty  and  violence 
were  distinct  phenomena  with  different  causes.  It 
blurred  the  distinction  between  them  to  extend  the 
meaning  of  violence  to  cover  both.  The  Peace 
Research  Institute  at  Oslo  (PRIO),  which  Galtung 
had  founded,  resisted  his  broad  agenda  and  contin¬ 
ued  to  focus  on  the  causes  and  conditions  of  wars. 

Positive  peace  has  remained  controversial  and  is 
interpreted  in  different  ways.  Critical  theorists 
argue  that  peace  depends  on  a  form  of  emancipa¬ 
tion  that  must  include  liberation  from  discourses 
based  on  power  and  hegemony.  Peace  is  constituted 
by  an  intersubjective  dialogue  in  which  the  Other  is 
included.  Some  poststructuralists  see  it  as  a  plural¬ 
istic  encounter  through  agonistic  dialogue  between 
actors  who  recognize  and  accept  their  differences. 
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The  Way  Forward 

The  notions  of  positive  and  negative  peace  had  a 
strong  influence  on  the  literature  on  peace  build¬ 
ing,  which  combines  the  aim  of  avoiding  a  return 
to  war  with  the  need  for  new  institutions,  relation¬ 
ships,  and  structures  on  which  to  found  a  durable 
peace.  It  was  clear  that  effective  peace  building 
could  not  be  limited  to  the  repression  of  fighting 
but  must  involve  the  creation  of  positive  associa¬ 
tions  and  healing  of  social  injustices. 

This  highlights  the  importance  of  understanding 
the  relationship  between  steps  toward  positive 
peace  and  the  ending  or  prevention  of  wars,  which 
returns  to  the  earlier  preoccupation  with  the 
causes  and  conditions  of  wars.  Most  schools  of 
thought  in  international  relations  accept  that  peace 
must  be  more  than  the  absence  of  war.  Even  real¬ 
ists  emphasize  the  importance  of  establishing  a 
stable  and  legitimate  international  environment 
that  reflects  the  realities  of  power  but  allows  for 
peaceful  change.  The  issue  that  divides  different 
schools  in  international  relations  is  what  condi¬ 
tions  are  necessary  for  durable  peace — or  to  put  it 
in  other  terms,  what  the  constituents  of  positive 
peace  really  are. 

A  way  forward  beyond  the  terminological  dis¬ 
agreements  is  to  consider  the  idea  of  peace  not  as 
a  state  but  as  a  process  in  which  initiatives  or 
“seeds  of  peace”  have  peaceful  consequences.  The 
philosopher  Gray  Cox  suggested  that  peace  is  a 
practice  of  cultivating  agreements.  Adam  Curie, 
Professor  of  Peace  Studies  at  Bradford  University, 
argued  that  peace  involves  the  coming  together  of 
people  to  achieve  goals  they  could  not  have 
achieved  separately.  In  this  sense,  positive  peace  is 
a  process  in  which  responses  to  other  people  and 
to  suffering  lead  to  greater  empathy,  cooperation, 
and  solidarity  (following  Galtung’s  1964  defini¬ 
tion).  If  this  becomes  self-sustaining,  it  constitutes 
a  form  of  peace  that  can  be  created,  even  at  times 
of  war.  Such  associative  and  integrative  processes 
also  limit  the  incidence  of  violence.  Seen  in  these 
terms,  the  relationship  between  peaceful  processes 
and  the  ending  of  war  is  a  matter  for  research  and 
analysis  and  not  only  a  matter  of  definition. 

Hugh  Miall 
University  of  Kent 
Canterbury,  United  Kingdom 
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Positivism 


The  concept  of  positivism  has  had  different 
careers  in  the  philosophy  of  science  and  in  the 
social  sciences  and  political  science.  Whereas  in 
the  philosophy  of  science,  it  is  often  used  to 
denote  a  specific  school  in  the  history  of  philo¬ 
sophical  thought  of  its  epoque,  the  concept  is  still 
used  in  the  social  sciences  to  refer  to  a  broad  range 
of  theses,  attitudes,  and  dispositions  generally 
associated  with  the  scientific  method.  As  a  school 
in  the  history  of  philosophical  thought,  positivism 
is  usually  characterized  by  its  research  of  the  iden¬ 
tification  and  explanation  of  universal  laws  in 
unity,  or  in  conformity,  within  the  practice  of  the 
natural  sciences  and  especially  physics.  These  laws 
should  be  discovered  not  through  induction  but 
by  theoretically  oriented  experimentations.  In  the 
social  sciences,  there  is  a  tendency  among  some 
tenants  and  critics  of  positivism  to  characterize  it 
as  a  homogeneous  epistemological  tradition.  This 
entry  nuances  this  social  representation. 

Auguste  Comte  and  the 

Early  Positivist  Program 

Many  thinkers  (Francis  Bacon  [1561-1626J, 
Madame  de  Stael  [1766-1817J,  Henri  de  Saint- 
Simon  [1760-1 825J)  have  been  associated  with 
different  philosophical  and  epistemological  posi¬ 
tions  later  associated  with  positivism.  However,  it 
is  generally  the  French  sociologist  Auguste  Comte 
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(1798-1857)  who  is  regarded  as  the  father  of 
positivism.  In  the  history  of  philosophical  thought, 
late- 19th-century  positivism  was  a  theoretical 
movement  in  reaction  to  different  branches  of  ide¬ 
alisms,  Kantism,  and  Hegelianism,  to  which  it 
proposed  a  greater  emphasis  on  empirical  knowl¬ 
edge.  In  his  Cours  de  philosophie  positive  (The 
Course  in  Positivist  Philosophy ,  1830-1842), 
Comte  described  his  positivist  perspective  as  an 
approach  seeking  to  ground  theoretical  models  on 
empirical  knowledge  and  observations.  Once 
acquired  through  cognitive  processes,  knowledge 
had  to  be  verified  through  empirical  testing. 
Empiricism,  therefore,  was  brought  back  to  the 
forefront  of  epistemology.  To  the  extent  that  it 
was  led  by  an  adequate  conception  of  science,  un 
esprit  positif  (a  positive  spirit),  Comte  was  tremen¬ 
dously  confident  as  regards  the  future  of  modern 
societies.  His  writings  were  no  stranger  to  a 
national  context  that  had  celebrated  a  culte  de  la 
raison  (cult  of  reason)  after  the  Revolution.  One 
could  still  find  a  similar  optimism  in  the  work  of 
Marie  Curie  (1867-1934). 

Comte’s  philosophical  project  was  much  broader 
than  a  mere  philosophical  or  epistemological  inter¬ 
vention.  Following  Stael  and  many  of  his  contem¬ 
poraries,  Comte’s  conception  of  science  was 
embedded  in  a  teleological  philosophy  of  history 
where  the  triumph  of  science  was  linked  to  the 
triumph  of  humanity.  The  core  of  his  philosophy 
of  history  comprised  the  description  of  three  evo¬ 
lutionary  stages:  theological,  metaphysical,  and 
positivist.  With  the  last  stage,  humanity  was  seen 
as  realizing  its  full  cognitive  and  political  potential 
with  the  elimination  of  anterior,  primitive,  or  chi¬ 
merical  schemes  of  cognition.  Positivism’s  political 
project  was  also  conceived  as  an  antidote  to  social 
“disorder,”  the  latter  being  an  important  preoc¬ 
cupation  for  the  emerging  social  sciences  of  the 
late  19th  century.  Comte’s  espousal  of  a  form  of 
social  evolutionism  was  not  unusual  in  this  con¬ 
text.  Social  Darwinists  such  as  Herbert  Spencer 
(1820-1903)  had  a  similar  conception  of  history, 
and  so  did  Karl  Marx  (1818-1883),  Ferdinand 
Tonnies  (1855-1936),  and  Max  Weber  (1864- 
1920),  who  adopted  a  teleological  reading  of  his¬ 
tory,  even  though  they  did  not  locate  the  motor  of 
history  in  the  same  social  processes.  Comte  spent 
the  end  of  his  life  trying  to  build  a  religion  of 
humanity  in  accordance  with  positivist  principles. 


Historically,  positivists  have  been  accurately 
depicted  as  the  tenants  of  an  opaque  opposition 
between  sentences  of  fact  and  sentences  of  value. 
Social  and  political  scientists,  according  to  the  ten¬ 
ants  of  this  opposition,  should  hold  back  from 
deriving  normative  propositions  from  factual  ones. 
Positivists,  however,  have  not  been  alone  in  defend¬ 
ing  this  standpoint.  In  the  philosophy  of  social 
sciences,  the  neo-Kantian  sociology  of  Max  Weber 
shares  it  without  agreeing  with  an  epistemological 
position  hostile  to  the  development  of  interpreta¬ 
tive  strategies  in  the  social  sciences. 

Another  important  posture  generally  associated 
with  positivism  is  the  adhesion  to  the  project  of  a 
unified  scientific  program,  or  the  defense  of  a  unity 
of  science.  As  a  philosophy  of  the  social  sciences, 
therefore,  positivism  is  usually  associated  with  a 
commitment  to  either  methodological  naturalism 
or  ontological  naturalism.  While  methodological 
naturalists  argued  that  the  social  sciences  should 
adopt  the  same  methods  as  the  natural  sciences, 
ontological  naturalists,  or  physicalists,  defended 
the  naturalization  of  the  social  sciences.  Through 
the  20th  century,  different  variants  of  this  program 
or  project  have  been  formulated  via  different  epis¬ 
temological  or  ontological  positions,  not  all  of 
which  were  enunciated  by  positivists. 

Toward  the  dawn  of  the  19th  century  and  the 
beginning  of  the  20th  century,  the  optimistic  phi¬ 
losophy  and  teleology  of  history  at  the  core  of 
Comte’s  Cours  de  philosophie  positive  had  been 
given  up  by  several  European  practitioners  of 
positivist  methods  and  other  philosophical  cur¬ 
rents  such  as  romanticism  and  phenomenology. 
The  German  philosopher  and  historian  Wilhelm 
Dilthey  (1833-1911)  is  among  the  important  fig¬ 
ures  to  have  defended  an  antipositivist  dualism 
between  the  human  and  natural  sciences  in  this 
context.  Among  other  things,  Dilthey  developed 
two  arguments  often  found  in  the  camp  of  the  ten¬ 
ants  of  such  a  dualism:  (1)  the  rejection  of  evolu¬ 
tionism  as  an  adequate  prism  to  analyze  social 
processes  and  (2)  an  emphasis  on  the  category  of 
meaning  and  the  use  of  interpretative  strategies  in 
the  social  sciences. 

The  often  heard  claim  that  the  triumphalist  con¬ 
ception  of  social  and  cognitive  progress  was  a 
shared  feature  of  modernity  in  general  should  be 
nuanced.  Positivism  was  an  important  chapter  of 
the  intellectual  history  of  the  19th  century.  Yet  it 
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was  fostered  by  specific  social  and  intellectual  con¬ 
texts  that  do  not  represent  a  complete  picture  of 
the  contradictory  intellectual  trends  of  the  modern 
era.  It  remained  influential  and  a  source  of  politi¬ 
cal  inspiration  in  parts  of  the  world  where  local 
liberal  movements  sought  to  overcome  traditional 
religious  elites  by  importing  France’s  institutions. 

The  Vienna  Circle 

In  1929,  a  handful  of  scientists  and  philosophers 
of  science  (Moritz  Schlick  [1882—1 936J,  Rudolf 
Carnap  [1891-1970J,  Otto  Neurath  [1882-1945J) 
published  a  manifesto  titled  Wissenschaftliche 
Weltauffassung:  Der  Wiener  Kreis  (A  Scientific 
Conception  of  the  World:  The  Vienna  Circle).  The 
manifesto  became  a  central  artifact  in  the  history 
of  positivism  and  the  platform  of  logical  positiv¬ 
ists.  At  the  confluence  of  the  empirical  tradition 
and  the  analytical  philosophy  of  Bertrand  Russell, 
logical  positivism  was  characterized  by  three  ele¬ 
ments:  (1)  its  critique  of  speculative  metaphysics, 
(2)  its  quest  for  a  foundation  of  empirical  sciences, 
and  (3)  the  project  of  unifying  the  scientific  lan¬ 
guage  (Robert  Nadeau,  2000). 

Logical  positivists  adhere  to  a  theory  of  mean¬ 
ing  grounding  the  meaningfulness  of  a  sentence  on 
its  empirical  conditions  of  verification.  As  a  corol¬ 
lary,  all  propositions  whose  conditions  of  truthful¬ 
ness  could  not  be  described  were  considered 
“meaningless.”  This  argument  was  notoriously 
put  forward  by  Rudolf  Carnap  (1891-1970)  in 
The  Elimination  of  Metaphysics  Through  Logical 
Analysis  of  Language  (1959).  This  position  was  a 
declaration  of  war  against  Martin  Heidegger,  his 
disciples,  and  idealists  in  general.  During  the  fol¬ 
lowing  decades,  most  of  the  core  elements  associ¬ 
ated  with  logical  positivism  were  contested  in  turn 
by  its  former  exponents  or  by  its  heirs.  Carl  Gustav 
Hempel  (1905-1997)  explicitly  renounced  the 
program  of  a  scientific  unity  of  science.  He  gave  up 
the  belief  that  specific  theories  could  be  captured 
through  the  same  analytical  vocabulary  as  broader 
general  theories  (ontological  naturalism). 

Even  though  the  tenants  of  logical  positivism 
were  against  the  idea  that  science  was  morally  or 
politically  laden,  several  signatories  of  the  manifesto 
believed  that  the  struggle  against  metaphysics  was 
entirely  reconcilable  with  socialist  political  views. 
Therefore,  while  logical  positivism  is  associated 


with  a  reaffirmation  of  the  fact-value  dichotomy, 
in  practice  several  members  of  the  circle  were 
socialists  and  critics  of  the  characteristically  meta¬ 
physical  ethos  of  the  German  and  Austrian  nation¬ 
alist  jargons.  Unsurprisingly,  the  Circle  was  banned 
by  the  Nazis  in  1938. 

The  Rejection  of  Verificationism, 

Popper,  and  Critical  Rationalism 

Neo-positivism,  or  critical  realism,  is  an  epistemo¬ 
logical  posture  that  was  developed  by  Karl  Popper 
(1902-1994)  as  a  critique  of  some  of  the  weak¬ 
nesses  of  the  logical  positivism  associated  with  the 
Vienna  Circle.  Among  other  things,  it  sought  to 
address  the  problem  of  induction,  or  raw  empiri¬ 
cism,  unsatisfactorily  dealt  with  by  the  logical 
positivist  theory  of  verification.  Popper  revolution¬ 
ized  the  positivist  tradition  by  adopting  the  prin¬ 
ciple  of  refutation,  or  falsifiability,  instead  of  the 
principle  of  verification.  This  epistemological 
model  distinguishes  theoretical  from  experimental 
laws.  Theoretical  laws  are  formulated  as  universal 
laws.  They  are  of  a  higher  level  of  abstraction. 
Experimental  laws  direct  how  to  carry  out  an 
experiment  so  that  it  leads  to  an  eventual  valida¬ 
tion  or  invalidation  of  the  theory’s  predictions.  A 
theory  is  articulated  by  means  of  an  ensemble  of 
interlinked  propositions,  the  theoretical  laws,  via 
logical  connectors.  Theoretical  laws  can  only  be 
put  to  the  test  through  the  experimental  laws  that 
are  derived  from  them.  To  be  falsifiable,  a  theory 
must  be  potentially  false.  That  is,  there  must  be 
potential  empirical  cases,  the  observation  or  occur¬ 
rence  of  which  would  go  against  the  predictions 
provided  by  the  theoretical  frame.  If  it  is  not  pos¬ 
sible  to  point  to  a  fact  whose  occurrence  would 
mean  the  invalidity  of  the  theory,  the  theory  is  not 
falsifiable  and  is  thus  not  part  of  the  scientific 
domain.  According  to  Popper,  one  cannot  prove 
that  a  hypothesis  has  been  verified  as  true;  one  can 
only  contend  that  it  has  yet  not  been  falsified.  For 
instance,  the  empirical  observation  of  one  white 
swan  does  not  prove  the  conjecture  that  all  swans 
are  white.  On  the  contrary,  this  proposition  would 
be  refuted  by  the  occurrence  of  a  black  swan.  To 
“test”  this  theory  is  a  question  of  looking  for  a 
nonwhite  swan  rather  than  accumulating  the 
occurrences  of  white  swans.  As  the  author  of  The 
Logic  of  Scientific  Discovery  puts  it,  “Now  I  hold 
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that  scientific  theories  are  never  fully  justifiable  or 
verifiable,  but  that  they  are  nevertheless  testable.  I 
shall  therefore  say  that  the  objectivity  of  scientific 
statements  lies  in  the  fact  that  they  can  be  inter- 
subjectively  tested”  (Popper,  1992,  p.  22). 

Science,  therefore,  according  to  Popper  had  to 
be  conceived  like  a  never-ending  process  of 
attempts  to  falsify  conjectures.  Accordingly,  a  sci¬ 
entific  theory  had  to  be  considered  “true,”  along  a 
regulatory  concept  of  truth,  until  it  had  been  defi¬ 
nitely  falsified.  He  attributes  a  character  that  is  in 
principle  objective,  universal,  and  inter  subjective 
to  the  results  of  research,  insofar  as  each  subject 
must  be  able  to  achieve  the  same  results  through 
experimentation.  The  principle  of  falsifiability 
remained  consistent  with  the  positivist  objective  of 
achieving  a  unity  of  science.  Here,  the  ground  of 
this  unity  was  not  ontological  naturalism,  or 
physicalism,  but  methodological  naturalism.  Thus, 
the  unity  of  science  resides  in  a  specific  methodol¬ 
ogy:  falsificationism.  It  relies  on  a  processual  con¬ 
ception  of  truth  rather  than  a  physicalist  ontology. 
As  a  philosopher  of  the  social  sciences,  Popper 
thought  that  sociology  was  the  discipline  more  apt 
to  achieve  such  a  methodological  unity.  He  was 
suspicious  of  behaviorism  in  psychology,  which  he 
deemed  incompatible  with  his  epistemology. 

Popper  was  not  an  apolitical  animal.  He  devel¬ 
oped  a  conception  of  liberalism  in  continuity 
with  his  epistemology  and  with  liberal  theories  of 
totalitarianism.  His  feelings  for  Marxism  were 
the  complete  opposite  of  those  of  some  members 
of  the  Vienna  Circle  and  especially  from  those  of 
the  sociologist  Otto  von  Neurath.  In  The  Open 
Society  and  Its  Enemies ,  published  in  1945, 
Popper  argued  that  Plato,  Hegel,  and  Marx  had 
formulated  proto-totalitarian  philosophical  sys¬ 
tems.  The  core  philosophical  assumptions  of  their 
political  thought,  he  argued,  were  teleological 
and  could  not  be  submitted  to  falsifications. 
Hegel,  here,  was  presented  as  an  inspiration  of 
National  Socialism.  It  is  generally  admitted  today 
that  there  was  something  naive  and  polemical  in 
Popper’s  crusade  against  these  three  pillars  of 
Western  philosophical  thought,  not  only  because 
few  research  programs  in  the  social  sciences 
would  survive  the  criteria  put  up  front  by  Popper’s 
philosophy  of  science  but  also  because  some  have 
raised  serious  doubts  regarding  the  adequacy  of 
Popper’s  philosophy  of  science  to  describe  the 


practices  of  natural  science  itself  (Thomas  Kuhn, 
1996).  Critiques  of  the  adoption  of  Popper’s 
standards  in  the  social  and  political  sciences  have 
argued  that  whereas  in  the  natural  sciences  there 
are  situations  where  only  one  research  program  is 
in  a  position  of  quasi  monopoly  in  a  given  disci¬ 
pline,  thus  creating  the  conditions  of  possibility 
of  a  period  of  normal  science,  in  the  social  and 
political  sciences,  it  is  very  unlikely  that  only  one 
theoretical  paradigm  could  dominate  the  entire 
discipline.  Social  and  political  scientists  have 
come  to  accept  that  this  theoretical  pluralism  is 
an  important  characteristic,  rather  than  a  weak¬ 
ness,  of  their  discipline. 

Neo-positivism  had  its  hours  of  glory  after  the 
translation  and  publication  in  English  of  The  Logic 
of  Scientific  Discovery  in  1959.  More  important,  it 
dominated  the  perceptions  that  some  social  scien¬ 
tists  had  of  the  natural  sciences  until  even  later. 
When  the  international  relations  theoretician  and 
father  of  neorealism,  Kenneth  Waltz  (1924-  ) 
formulated  his  critique  of  induction  and  of  the 
criterion  of  predictability  in  the  literature  on  inter¬ 
national  relations,  he  was  very  close  to  Popper’s 
conception  of  science.  In  the  field  of  Anglo-American 
international  relations,  the  multiplication  of  con¬ 
tending  approaches  to  the  mainstream  of  the  disci¬ 
pline  was  labeled  as  a  “postpositivist”  era  by  several 
observers  (Steve  Smith,  Ken  Booth,  &  Marysia 
Zalewski,  1996).  Most  of  the  time,  what  was  at 
stake  was  less  the  rejection  of  positivism,  as  tradi¬ 
tionally  understood  in  the  philosophy  of  science, 
than  the  rejection  of  a  narrowly  conceived  defini¬ 
tion  of  the  discipline  and  its  methodology  by  neo¬ 
realism  and  institutional  neoliberalism. 

From  the  Positivismusstreit  to  Theoretical 
Pluralism  in  the  Social  Sciences 

While  the  end  of  the  positivist  hegemony  in  the 
social  sciences  is  often  associated  with  the  rise  of 
postmodernism,  it  would  be  more  accurate  to 
locate  it  within  the  debate  notoriously  labeled  the 
positivismusstreit ,  or  the  German  dispute  in  social 
sciences  (1961-1969).  During  the  dispute,  Popper’s 
conception  of  the  philosophy  of  the  social  sciences, 
critical  rationalism,  was  challenged  by,  among 
others,  Theodor  Adorno,  Ralf  Dahrendorf,  and 
Jurgen  Habermas.  It  is  not  possible  here  to  sum¬ 
marize  the  block  of  arguments  that  were  raised 
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against  Popper  in  the  context  of  this  epic  disagree¬ 
ment.  They  often  came  from  different,  and  contra¬ 
dictory,  traditions  of  the  social  sciences  from 
scholars  of  different  political  inclinations.  What 
matters,  however,  is  that  most  of  the  epistemo¬ 
logical  positions  that  were  injected  in  the  dispute 
were  developed  and,  sometimes,  radicalized  during 
the  1970s  and  1980s.  Since  then,  there  has  virtu¬ 
ally  been  no  pause  to  the  development  of  new 
arguments  in  defense  of  historicism,  of  interpreta¬ 
tive  strategies  or  hermeneutics  (Hans-Georg 
Gadamer,  2004),  of  standpoint  epistemologies 
(Sandra  Harding,  2004),  of  a  critique  of  the  rela¬ 
tion  between  positivism  and  social  engineering  and 
power  (Michel  Foucault,  1980),  or  of  a  more 
reflexive  approach  to  the  social  sciences  (Pierre 
Bourdieu,  2004).  Moreover,  most  epistemologists 
and  methodologists  have  abandoned  the  idea  that 
to  practice  good  science,  social  and  political  scien¬ 
tists  should  imitate  the  natural  sciences.  For  a  few 
decades,  one  of  the  epistemological  controversies 
emerging  in  political  science  has  been  to  what 
extent  political  scientists  should  borrow  the 
methods  and  models  of  the  economists,  not  of 
physicists. 

One  of  the  central  critiques  of  the  covering  law 
model  often  associated  with  tenants  of  positivism 
is  that  it  cannot  escape  reifying  the  observable 
regularity  of  a  specific  historical  context  resulting 
from  petrified  relations  of  power.  What  if,  for 
instance,  the  concepts  of  state,  nation,  sovereignty, 
or  power  meant  very  different  things  in  different 
sociohistorical  contexts?  What  if  the  theoretical 
hypotheses  of  a  general  covering  law  model  in  the 
social  sciences  were  guiding  social  researches  in 
systematically  ahistorical  directions  (Habermas, 
1988)?  And  what  if  these  intellectual  practices  had 
the  impact  of  silencing,  marginalizing,  or  negating 
alternative  social  experiences  and  modes  of  being? 
Are  positivists  not  guilty  of  an  implicit  favoritism 
in  favor  of  a  defense  of  a  given  social  order?  These 
are  some  of  the  questions  that  recent  scholarship 
by  critical  theorists,  postmodernists,  feminists,  and 
postcolonial  scholars  has  raised. 

One  should  note  that  the  critique  of  positivism 
has  been  especially  important  during  the  1980s  and 
1990s  in  the  subfields  of  the  discipline  of  political 
science  where  the  use  of  quantitative  methods  had 
become  synonymous  with  real  science.  The  tradi¬ 
tion  of  American  international  relations,  for 


instance,  is  one  subfield  where  the  critics  of  posi¬ 
tivism  have  been  notoriously  active  in  the  1980s 
and  1990s.  In  the  field  of  development  studies, 
researchers  working  with  the  dependency  theory 
have  been  extremely  skeptical  regarding  the  appli¬ 
cation  of  ahistorical  and  nation-centered  models  of 
economic  and  political  developments  along  the 
lines  of  modernization  theory.  Among  other  things, 
they  have  argued  that  the  idea  that  economic  and 
political  development  can  be  reproduced  with  the 
precision  of  a  laboratory  experiment  was  a  prob¬ 
lematic  remnant  of  a  positivist  conception.  Against 
this  conception,  they  have  argued  that  economic 
and  political  institutional  developments  had  to 
study  within  the  broader  context  of  the  global  his¬ 
torical  development  of  capitalism,  where  specific 
national  institutional  developments  never  occurred 
in  national  conceits. 

Today,  most  philosophers  of  science  have  given 
up  the  concept  of  positivism.  The  different  lines  of 
demarcation  that  several  philosophers  of  science 
have  historically  tried  to  draw  between  science  and 
nonscience  have  not  proven  decisive.  Among  the 
core  arguments  underlying  a  certain  conception  of 
positivism,  the  claim  that  there  could  be  a  clear 
line  of  demarcation  between  judgments  of  facts 
and  judgments  of  value  has  been  under  heavy  fire 
(Hilary  Putnam,  2002).  Some  philosophers  of  sci¬ 
ence  and  social  scientists,  notoriously  scientific 
realists,  have  also  challenged  the  positivist  empha¬ 
sis  on  directly  observable  entities,  leading  to  an 
inversion  of  the  hierarchy  between  ontology  and 
epistemology  (Roy  Bhaskar,  1989).  The  former 
argued  that  if  one  had  to  rely  only  on  a  methodol¬ 
ogy  with  an  emphasis  on  direct  observation,  one 
would  have  to  reject  the  validity  of  new  scientific 
territories  where  the  progress  of  knowledge 
depends  on  the  observation  of  the  effects  of  unob¬ 
servable  entities  on  other  entities. 

Even  though  positivism  does  not  have  the  intel¬ 
lectual  authority  that  it  once  had,  one  cannot 
conclude  from  what  precedes  that  the  positivist 
tradition  did  not  have  an  important  impact  on 
contemporary  research  practices  and  method¬ 
ological  dispositions  in  social  and  political  sci¬ 
ences.  Most  practitioners  of  political  science  are 
still  working  along  standards  that  were  developed 
by  the  positivist  tradition.  For  instance,  few  prac¬ 
titioners  would  give  up  the  idea  that  the  theoretical 
laws  of  a  theory  have  to  be  internally  consistent. 
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Furthermore,  practitioners  of  the  social  or  politi¬ 
cal  sciences  by  and  large  agree  that  a  theory  that 
fails  to  find  support  in  empirical  evidence  should 
be  either  revised  or  abandoned.  In  addition,  the 
argument  that  the  result  of  social  research  should 
be  accessible  through  intersubjective  processes 
remains  a  central  claim  held  by  the  majority  of 
social  and  political  scientists.  Along  this  line, 
while  most  tenants  of  reflexive  social  theory 
would  renounce  the  quest  for  an  opaque  line  of 
demarcation  between  judgments  of  fact  and  judg¬ 
ments  of  value,  they  tend  to  agree  that  judgments 
of  value  should  be  submitted  to  a  similar  intersub- 
jectively  oriented  ethic  of  discussion  rather  than 
judgments  of  facts.  However,  the  early  positivist 
claim  that  the  development  of  science  and  technol¬ 
ogy  will  necessarily  lead  to  the  achievement  of  a 
greater  humanity  has  been  thoroughly  challenged 
by  the  history  of  the  20th  century. 

Frederick  Guillaume  Dufour 
Universite  du  Quebec  a  Montreal 
Montreal,  Quebec,  Canada 

See  also  Behavioralism;  Causality;  Comparative  Methods; 
Critical  Theory;  Epistemic  Communities; 
Epistemological  and  Methodological  Foundations; 
Experiments,  Laboratory;  Experiments,  Natural; 
Hermeneutics;  Measurement;  Paradigms  in  Political 
Science;  Prediction  and  Forecasting;  Quantitative 
Versus  Qualitative  Methods;  Statistics:  Overview 

Further  Readings 

Adorno,  T.  W.,  Albert,  H.,  Dahrendorf,  R.,  Habermas,  J., 
Pilot,  H.,  &  Popper,  K.  R.  (1976).  The  positivist 
dispute  in  German  sociology.  New  York:  Harper  & 
Row. 

Bhaskar,  R.  (1989).  Reclaiming  reality:  A  critical 
introduction  to  contemporary  human  sciences. 

London:  Verso. 

Bourdieu,  P.  (2004).  Science  of  science  and  reflexivity. 

Chicago:  University  of  Chicago  Press. 

Carnap,  R.  (1959).  The  elimination  of  metaphysics 
through  logical  analysis  of  language.  In  A.  J.  Ayer 
(Ed.),  Logical  positivism  (pp.  60-81).  New  York:  Free 
Press. 

Foucault,  M.  (1980).  Power/knowledge.  New  York: 
Harvester  Press. 

Gadamer,  H.-G.  (2004).  Truth  and  method.  New  York: 
Continuum. 


Habermas,  J.  (1988).  On  the  logic  of  the  social  sciences. 
Cambridge:  MIT  Press. 

Harding,  S.  (Ed.).  (2004).  Feminist  standpoint  theory 
reader.  New  York:  Routledge. 

Kuhn,  T.  (1996).  The  structure  of  scientific  revolutions 
(3rd  ed.).  Chicago:  University  of  Chicago  Press. 
Nadeau,  R.  (2000).  Vocabidaire  technique  et  analytique 
de  Vepistemologie  [Technical  and  analytical 
vocabulary  of  epistemology].  Paris:  Presses 
Universitaires  de  France. 

Popper,  K.  R.  (1971).  The  open  society  and  its  enemies 
(Vols.  1-2).  Princeton,  NJ:  Princeton  University  Press. 
Popper,  K.  R.  (1992).  The  logic  of  scientific  discovery. 
New  York:  Routledge. 

Putnam,  H.  (2002).  The  collapse  of  the  fact-value 
dichotomy  and  other  essays.  Cambridge,  MA: 
Harvard  University  Press. 

Smith,  S.,  Booth,  K.,  &  Zalewski,  M.  (Eds.).  (1996). 
International  theory:  Positivism  and  beyond. 
Cambridge,  UK:  Cambridge  University  Press. 

Waltz,  K.  (1979).  Theory  of  international  politics.  New 
York:  McGraw-Hill. 


Postcolonialism 


There  is  no  consensus  regarding  what  the  term 
postcolonial  connotes;  indeed,  both  the  range  of 
meaning  of  the  term  and  of  the  issues  covered  by  it 
are  immense.  However,  despite  the  contestations 
around  the  range  of  meaning  and  issues,  certain 
fundamental  assumptions  are  subscribed  to  postco¬ 
lonialism.  First,  postcolonialism  suggests  that  the 
colonial  juncture  played  a  critical  role  in  the  shap¬ 
ing  of  the  modern  world.  Second,  although  there 
have  been  both  continuities  and  ruptures  from  the 
world  of  classical  colonialism,  its  presence,  lega¬ 
cies,  and  hierarchies  still  endure  in  the  modern 
world.  It  is  thus  important  to  note  that  the  post  in 
postcolonialism  does  not  allude  to  an  era  in  which 
colonialism  is  dead.  Nor  does  it  imply  an  uninter¬ 
rupted  classical  colonialism.  Rather,  postcolonial¬ 
ism  suggests  that  the  modern  world  exemplifies 
both  historical  continuities  and  ruptures  with  clas¬ 
sical  colonialism.  This  entry  provides  an  introduc¬ 
tion  to  the  ways  in  which  these  complex  shifts  and 
continuities  manifest  themselves  in  the  variety  of 
meanings  assigned  to  postcolonialism,  its  geneal¬ 
ogy,  and  the  criticisms  leveled  against  it.  Finally, 
it  discusses  the  usefulness  of  postcolonialism 
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for  understanding  the  politics  of  the  modern 
world. 

Meanings,  Controversies,  and  Criticisms 

Although  it  is  unusual  to  begin  the  discussion  of  a 
topic  with  the  controversies  and  criticisms  sur¬ 
rounding  it,  highlighting  at  least  three  controver¬ 
sies  that  haunt  postcolonialism  will  help  clarify 
what  postcolonialism  means.  The  first  controversy 
focuses  on  the  dissimulations  and  the  political 
acceptability  of  the  term  postcolonial  in  Western 
academic  circles.  Some  scholars  suggest  that  the 
prefix  post  in  postcolonial  implies  that  we  are  now 
past  the  era  of  colonialism,  global  dominance,  and 
hegemony.  The  term  is  thus  preferred  and  has 
gained  currency  in  Western  academia  because  it  is 
politically  less  threatening  than  terms  such  as  neo¬ 
colonialism,  imperialism,  and  the  Third  World. 
Postcolonial  scholars  have  challenged  this  miscon¬ 
ception  and  stated  in  no  uncertain  terms  that  post¬ 
colonial  does  not  imply  the  end  of  colonization: 
Rather,  postcolonialism  enables  us  to  examine 
both  the  historical  continuities  and  ruptures  with 
classical  colonialism. 

The  second  controversy  raises  questions  regard¬ 
ing  the  spatial  parameters  of  postcolonialism. 
Should  all  societies  that  were  once  colonized  be 
included  under  the  umbrella  term  postcolonial ? 
Should  the  United  States,  Australia,  and  South 
Africa,  which  had  an  experience  of  being  both  colo¬ 
nized  and  colonizers,  be  included  with  other  post¬ 
colonial  societies  such  as  India,  Senegal,  and  Brazil? 
In  other  words,  are  there  specific  features  of 
the  “truly”  postcolonial  countries  that  help  us  dis¬ 
tinguish  them  from  non-postcolonial  ones,  or  is 
postcolonial  a  catchall  word?  On  this  issue,  some 
postcolonial  scholars  suggest  that  while  it  is  impor¬ 
tant  to  discuss  the  differences  in  the  particular  colo¬ 
nial  histories,  all  colonizing  practices,  including  the 
contemporary  colonizing  practices  of  nation-states, 
can  be  included  under  the  term  postcolonial. 

The  third  controversy  focuses  on  the  allegedly 
culturalist  focus  of  the  postcolonialists  and  their 
neglect  of  materialist  concerns.  Critics  accuse  post¬ 
colonial  scholars  of  neglecting  traditional  Marxist 
issues  such  as  capitalism,  poverty,  and  inequality 
and  focusing  too  much  on  issues  of  concern  to  the 
poststructuralists,  such  as  representation,  identity, 
and  culture.  While  some  postcolonial  scholars  are 


guilty  of  focusing  solely  on  culturalist  concerns, 
most  suggest  that  culturalist  and  material  issues  are 
mutually  embedded.  For  them,  isolating  materialist 
from  culturalist  concerns  would  be  a  serious  ana¬ 
lytical  shortcoming  of  any  scholarship.  For  exam¬ 
ple,  scholars  like  Edward  Said,  demonstrating  the 
cultural  underpinnings  of  classical  imperialism, 
have  suggested  that  the  success  of  imperialism 
would  not  have  been  possible  without  the  dissimu¬ 
lations  affected  by  the  cultural  constructions  of 
African,  Asian,  and  American  peoples  as  the  sav¬ 
age,  uncivilized  “others.”  Calls  for  mission  civilisa- 
trice  (the  civilizing  mission)  and  “the  White  man’s 
burden”  not  only  obscured  the  violence  and  exploi¬ 
tation  engendered  by  colonization,  they  were  also 
successful  in  legitimizing  the  colonial  enterprise. 

Even  as  postcolonial  scholars  continue  to  be 
influenced  by  Karl  Marx,  they  seek  to  wrest 
Marxist  analysis  from  its  heavy  Eurocentrism. 
Postcolonialism  “historicizes”  Marx.  In  other 
words,  it  contextualizes  Marx’s  scholarship  in  the 
changes  of  19th-century  Europe  and  argues  that 
despite  the  usefulness  of  Marxist  analysis,  Marxism 
cannot  be  imported  in  its  wholesale  Eurocentric 
form  to  other  situations.  For  example,  the  focus  on 
class  in  Marxist  analyses,  which  subsumed  other 
categories  of  social  stratification,  has  led  to  the 
neglect  of  important  factors  such  as  race,  gender, 
caste,  and  ethnicity.  Offering  a  corrective,  postco¬ 
lonial  scholars  stress  the  importance  of  these  cate¬ 
gories  of  analyses,  at  the  same  time  stressing  their 
mutual  imbrication  with  each  other  and  class. 

The  Genealogy  of  Postcolonialism 

The  origins  of  postcolonialism  cannot  be  traced  to 
a  single  event  or  a  single  author.  Scholars  trace  the 
origins  of  postcolonialism  from  the  postcolonial 
fiction  of  writers  from  ex-colonial  countries,  such 
as  Chinua  Achebe,  Ngugi  W’Thiongo,  Wole 
Soyinka,  and  Ousmane  Semebene;  through  the 
literary  criticism  of  the  1980s  and  1990s;  to  the 
new  historiography  of  the  Subaltern  Studies  Group. 
In  addition,  others  point  to  certain  events  in  decol¬ 
onization  such  as  the  Bandung  conference  of  1955 
and  the  anticolonial  movements  in  Egypt,  Ghana, 
India,  Cuba,  Vietnam,  and  China  as  foundational 
to  the  emergence  of  postcolonialism. 

The  anticolonial  movements  of  the  20th  cen¬ 
tury  are  among  the  first  organized  institutional 
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challenges  to  European  myths  about  European 
superiority,  laying  the  foundations  for  postcolo¬ 
nial  scholarship.  For  example,  Mohandas  Karam- 
chand  Gandhi’s  challenges  to  British  imperialism 
are  as  foundational  to  postcolonialism  as  are  the 
Marxist/Maoist-inspired  liberationist  movements 
of  Cuba,  Vietnam,  and  China  and  the  Algerian 
anticolonial  revolution  in  which  Frantz  Fanon 
participated.  These  decolonization  movements  are 
foundational  to  postcolonial  narratives  as  they 
systematically  challenge  European  narratives  of 
modernity,  race,  gender,  and  the  like.  Several  key 
figures  of  the  decolonization  struggles,  like 
Gandhi,  Fanon,  and  their  counterpart  Albert 
Memmi,  recognize  that  the  decolonization  strug¬ 
gle  is  not  merely  the  physical  removal  of  the  colo¬ 
nizers  from  colonial  territory — it  should  also 
include  the  struggle  for  cultural  decolonization. 
They  suggest  alternative  paths  for  nationalist 
futures,  recognizing  that  breaking  from  the  stran¬ 
glehold  of  the  colonial  imagination  is  difficult. 
The  postcolonial  condition  is  marked  by  a  para¬ 
dox;  the  colonized  both  fear/loath  and  admire  the 
colonizers.  They  want  to  exhume  the  presence  and 
memories  of  colonization,  yet  they  desire  to 
become  “modern”  like  the  colonizers.  These 
insights  frame  postcolonialism. 

The  postcolonial  fiction  of  writers  from 
ex-colonial  countries  is  also  critical  in  the  geneal¬ 
ogy  of  postcolonialism.  Since  colonial  occupation 
rested  on  racialized  and  gendered  constructions/ 
representations  of  Africa,  Asia,  and  Fatin 
America,  the  first  wave  of  postcolonial  fiction 
sought  to  both  deconstruct  received  national 
identities  as  well  as  to  reconstruct  them  in  more 
“positive”  ways.  This  project  of  forgetting  and 
recovery  was  complicated,  and  it  continues  to  be 
so,  by  several  legacies  of  colonization.  First,  the 
politics  of  using  a  colonial  language  to  re-create 
national  identity  haunted  several  postcolonial 
writers;  some  like  Soyinka  opted  for  local  lan¬ 
guages,  while  others  like  Sembene  opted  for  mak¬ 
ing  films  in  local  languages.  Second,  the  desire  for 
modernity,  as  defined  by  the  West,  in  the  nation¬ 
alist  imagination  has  led  to  mimicry  as  well  as 
hybridity,  both  of  which  are  discussed  later  in 
this  entry. 

The  iconic  scholarship  of  Edward  Said  has  been 
enormously  influential  in  shaping  postcolonialism 
as  it  has  brought  to  the  forefront  the  importance  of 


discursive  formations  to  the  imperial  project. 
Influenced  by  Michel  Foucault  and  Antonio  Gram- 
sci,  Edward  Said’s  orientalism  highlights  the 
mutual  imbrications  between  colonial/Eurocentric 
knowledge  and  colonial/Western  power.  According 
to  Said,  intellectual  and  cultural  activity  regarding 
the  “Orient”  is  intimately  linked  to  the  “West’s” 
colonial  project  of  producing,  managing,  and  con¬ 
trolling  the  Orient.  These  cultural  productions  not 
only  are  critical  in  the  production  of  the  orient  in 
the  Western  imaginary,  they  are  also  central  to  the 
production  of  Western  identity  that  is  produced  in 
opposition  to  the  non-Western  world.  Postcolonial 
debt  to  Said’s  scholarship  on  the  imbrications  of 
knowledge  with  power;  the  sociality  and  material¬ 
ity  of  texts,  which  suggests  that  all  texts  are  politi¬ 
cal  and  social  commentaries  and  are  embedded  in 
relations  of  power;  the  salience  of  discourse  to 
ideological  formations;  and  the  importance  of 
agency  to  resistance  cannot  be  underscored 
enough. 

The  influence  of  the  school  of  Subaltern  Studies 
on  postcolonial  scholarship  has  also  been  signifi¬ 
cant.  Influenced  by  Antonio  Gramsci’s  scholarship 
on  the  subaltern  classes  (those  who  are  at  the  mar¬ 
gins  of  state  power  such  as  peasants,  the  working 
class,  etc.),  this  school  was  created  by  historians  of 
South  Asia  to  produce  an  alternative  to  the  elite 
focus  of  South  Asian  history.  Although  the  many 
volumes  produced  by  the  Subaltern  Studies  project 
have  provided  inspiration,  it  has  also  raised  impor¬ 
tant  questions  about  subalternity  that  are  dis¬ 
cussed  in  the  following  section. 

What  Is  Postcolonialism? 

Although  it  is  not  possible  to  identify  a  singular 
postcolonial  approach,  there  are  certain  founda¬ 
tional  characteristics  that  can  be  ascribed  to 
postcolonialism.  The  following  section  identifies 
characteristics  of  postcolonialism,  including 
postcolonial  feminism,  that  may  enable  us  to  think 
about  what  postcolonial  scholarship,  in  all  its  vari¬ 
ance  and  vitality,  means.  Clearly  it  is  not  possible 
to  highlight  every  key  postcolonial/postcolonial 
feminist  concept  in  such  a  brief  space.  The  follow¬ 
ing  points  highlight  what  many  postcolonial  schol¬ 
ars  consider  some  of  the  more  prominent  concepts 
in  postcolonialism.  It  is  important  to  note  that  the 
discussion  under  each  concept  is  limited  in  scope 
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as  the  length  constraints  of  this  entry  do  not  permit 
a  detailed  discussion. 

1.  Postcolonialism  begins  with  the  assumption 
that  the  imperialist  juncture  was  critical  in  the 
shaping  of  the  modern  world.  Although  the  mod¬ 
ern  world  exhibits  both  continuities  and  ruptures 
from  its  colonial  past,  what  is  clear  is  that  the 
legacies  of  imperialism  continue  to  haunt  us. 

2.  Postcolonialism  focuses  on  the  imbrications 
of  knowledge  with  power;  For  example,  it  suggests 
that  what  passes  for  expert  and  universal  knowl¬ 
edge  in  the  humanities,  social  sciences,  and  other 
disciplines  is  deeply  embedded  in  the  colonial 
enterprise  and  is  essentially  Eurocentric.  This  body 
of  knowledge  developed  in  juxtaposition  with 
colonization  and  was  used  (and  continues  to  be 
used)  in  instrumental  ways  to  bolster  Euro- 
American  power.  The  aim  of  postcolonial  scholars 
is  to  “provincialize”  this  knowledge — that  is,  to 
uncover  its  universal  pretensions  and  to  always 
historicize  and  contextualize  these  claims. 

3.  The  interconnections  between  European 
political  and  material  power  and  their  power  to 
represent  “the  other”  is  highlighted  by  Edward 
Said,  Chandra  Mohanty,  Gayatri  Spivak,  and 
Trinh  Minh-ha,  among  others.  In  her  classic  “Under 
Western  Eyes:  Feminist  Scholarship  and  Colonial 
Discourses,”  Chandra  Mohanty  suggests  that  a 
discursive  colonialism,  a  sort  of  neo-orientalism,  is 
at  play  in  some  Western  feminist  texts  discussing 
the  “Third  World  woman.”  In  many  such  dis¬ 
courses,  a  monolithic  image  of  the  poor,  victimized, 
oppressed,  and  other  such  “Third  World  woman” 
is  created  in  contrast  to  the  Western,  liberated,  edu¬ 
cated  woman,  who  is  free  and  in  control  of  her  own 
destiny.  The  authoritative  power  of  these  represen¬ 
tations  is  enormous,  as  is  their  influence.  As  stated 
earlier,  the  allegedly  civilizing  mission  of  the  colo¬ 
nizers  was  seen  as  legitimate  in  part  because  of  the 
success  of  the  racialized  and  gendered  images  of 
inferiority  and  savagery  that  came  to  be  associated 
with  the  peoples  of  Africa,  Asia,  and  Latin  America. 
Similarly,  images  of  Third  World  women  are  con¬ 
temporarily  used  to  legitimize  various  types  of 
interventions  into  the  Third  World. 

4.  Despite,  and  maybe  because  of,  their  Marxist 
leanings,  many  postcolonial  scholars  focus  on 
the  importance  of  race,  ethnicity,  gender,  caste, 


religion,  nation,  and  class  in  the  construction  of 
colonial,  postcolonial,  and  national  power. 
Interrogating  these  constructs  of  stratification  and 
their  deployment  by  global,  national,  and  local 
structures  of  power  enables  postcolonial  scholars 
to  “world”  these  issues — that  is,  to  interrogate  the 
ways  in  which  dimensions  of  power  are  embedded 
in  global,  national,  and  local  discourses. 

5.  Some  scholars  prefer  to  identify  postcolonial 
theories  as  interrogations  of  modernity.  Postcolonial 
and  other  scholars  trace  the  origins  of  modernity  to 
the  ideas  that  emerged  in  Europe  during  the 
Renaissance,  their  consolidation  in  Europe  during 
the  16th  century,  and  efforts  at  spreading  them 
throughout  the  world  during  European  coloniza¬ 
tion  of  Africa,  Asia,  and  the  Americas.  The  use  of 
modernity  in  colonial  discourse  can  be  seen  in  its 
efforts  to  explore,  conquer,  categorize,  and  map  the 
peoples  of  Africa,  Asia,  and  America.  The  discourse 
of  modernity  is  thus  associated  with  historical 
developments  in  Europe  and  North  America.  In  this 
discourse,  modernity  also  becomes  a  tool  for  the 
construction  of  new  identities;  to  be  modern  is  to  be 
superior,  more  rational,  more  in  charge  of  one’s 
own  destiny,  and  more  Western  than  one  who  is 
described  as  traditional.  In  addition,  the  identity  of 
the  “modern”  West,  the  “developed”  world,  is  con¬ 
trasted  to  the  “traditional,”  “underdeveloped,”  or 
“developing”  world.  In  addition,  the  idea  of  moder¬ 
nity  has  become  a  tour  de  force  for  nationalist 
dreams  and  aspirations  as  well  as  the  spread  of 
globalization.  Postcolonial  scholars  interrogate  the 
discourse  of  modernity  to  highlight  its  seductions  as 
well  as  its  slights  and  marginalizations. 

6.  The  concepts  of  nation  and  nationalism  are 
important  in  the  lexicon  of  postcolonial  scholar¬ 
ship.  The  first  wave  of  postcolonial  scholarship 
focused  on  decolonization  and  the  creation  of  inde¬ 
pendent  nations,  the  recovery  of  national  identity 
through  the  creation  of  nationalist  histories  that 
challenged  colonialist  representations  and  histo¬ 
ries,  and  the  development  of  nationalist  conscious¬ 
ness.  However,  as  the  euphoria  of  national 
independence  subsided  and  it  became  obvious  that 
the  newly  independent  nations  bred  their  own 
exclusions,  the  very  idea  of  nation  came  into  ques¬ 
tion.  Contemporary  postcolonial  scholarship  high¬ 
lights  this  disillusion  with  the  nation.  Focusing  on 
the  social  construction  of  nation  as  an  “imagined 
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community,”  postcolonial  scholars,  like  many 
scholars  of  nation  and  nationalism,  suggest  that  this 
community  is  forged  around  certain  markers  of 
identity  such  as  ethnicity,  religion,  race,  gender,  and 
sexuality.  However,  these  same  markers  are  also 
used  to  identify  those  who  do  not  belong  or  are  mar¬ 
ginal  to  the  body  of  the  nation.  According  to  Partha 
Chaterjee,  this  dynamic  of  inclusion/exclusion,  cor¬ 
responding  with  the  liberal  and  illiberal  faces  of  the 
nation,  reflects  the  inherent  contradiction  within 
the  nation  as  it  is  currently  imagined. 

The  construction  of  nationalist  discourses  is 
critical  for  the  creation  and  maintenance  of  nations. 
Dominant,  idealized,  and  elite  narratives  of  the 
nation’s  history,  including  the  history  of  decoloni¬ 
zation  and  originary  myths,  are  often  constructed 
that  leave  out  the  histories  of  subaltern  groups 
within  the  nation.  The  Subaltern  Studies  collective 
and  postcolonial  scholarship  focus  on  highlighting 
the  histories  and  lives  of  the  subaltern  within  the 
nationalist  project.  According  to  Homi  Bhabha, 
there  are  two  types  of  nationalist  discourses  that 
work  simultaneously  to  construct  the  undivided, 
unified  nation:  pedagogical  and  performative. 
Pedagogical  discourses  about  the  nation  claim  a 
fixed  origin  for  the  nation  and  an  essentialist 
national  identity.  These  discourses  establish  the 
nation  as  an  unquestioned  and  legitimate  political, 
social,  cultural,  and  so  on,  unit  in  which  people,  as 
national  subjects,  “naturally”  belong.  In  these  dis¬ 
courses,  nation,  national  subjects,  and  their  histo¬ 
ries  are  genealogically  developed,  linking  in  a 
“natural  way”  the  past  with  the  present.  In  con¬ 
trast,  performative  nationalist  discourses  refer  to 
the  need  for  nationalism  to  be  constantly  created 
and  recreated,  or  in  other  words,  nationalism 
needs  to  be  constantly  performed.  While  perfor¬ 
mative  modes  may  reinscribe  old  nationalist  sym¬ 
bols  and  signs,  they  may  also  provide  an  apertura 
in  which  the  subaltern  may  be  able  to  perform  new 
counternarratives  about  the  nation  that  disrupt  its 
essentialist  identities.  This  is  what  Bhabha  calls  the 
“disruptive  double  narrative  movement.” 

7.  Hybridity  and  mimicry  are  two  interrelated 
and  widely  used  concepts  in  postcolonial  scholar¬ 
ship.  As  stated  in  Point  4,  there  is  a  growing  disil¬ 
lusionment  within  some  postcolonial  scholarship 
with  a  nationalism  in  which  cultural  essentialisms 
and  difference  become  the  basis  for  national  identity 


and  national  borders.  For  example,  while  an  antico¬ 
lonial  narrative  with  its  rigid  national  essentialisms 
may  be  “strategically  essential”  for  subverting  the 
colonial  narrative,  many  postcolonial  scholars  like 
Fanon,  Said,  and  Bhabha  are  uncomfortable  with 
the  simple  and  exclusive  identitarian  politics  this 
may  engender.  These  scholars  suggest  that  rather 
than  being  based  on  exclusions  and  uniqueness, 
cultures  of  the  colonized  and  colonizers  are  also 
mutually  embedded.  For  Bhabha,  the  cultural 
impact  of  colonization  (and  globalization)  may 
manifest  itself  in  mimicry  and/or  hybridity.  While 
mimicry  may  appear  at  a  very  basic  level  to  be  imi¬ 
tation,  it  may  also  include  some  form  of  mockery 
and  can  work  to  subvert  colonial  power.  Hybridity 
is  seen  as  an  in-between  stage,  in  which  colonial 
and/or  globalization  narratives/texts  are  “adopted” 
but  in  new  forms  that  reflect  the  intermixing  of  the 
global  and  local. 

8.  While  the  commitment  of  postcolonial  schol¬ 
ars  to  resistance  and  agency  is  strong,  their  focus 
varies.  As  stated  earlier,  for  the  first  wave  of  post- 
colonial  scholars,  resistance  and  agency  not  only 
meant  participation  in  decolonization  struggles,  it 
also  meant  the  development  of  counternarratives 
to  recover  history,  identity,  and  the  “native”  self 
from  the  degenerative  effect  of  the  colonial  narra¬ 
tive.  For  the  “postnational”  wave  of  postcolonial 
scholars  disenchanted  with  the  marginalizations 
engendered  through  nationalisms,  it  has  meant 
recuperating  the  stories  of  the  subaltern  as  well  as 
arguing  for  a  postnational,  global  politics.  How¬ 
ever,  the  politics  of  recuperating  subaltern  histories 
and  voices  became  the  subject  of  controversy  when 
Spivak  raised  the  fundamental  question,  “Can  the 
Subaltern  Speak?”  While  some  have  focused  on 
the  conclusion  by  Spivak  to  support  or  critique  her 
argument  that  the  subaltern  cannot  speak,  others 
have  suggested  that  her  question  is  better  under¬ 
stood  as  rhetorical  and  an  effort  to  make  research¬ 
ers  aware  of  the  difficulties  inherent  in  representing 
the  subaltern  “other.”  For  example,  through  this 
question  Spivak  alerts  us  to  several  questions 
regarding  voice,  authority,  representation  and  the 
subaltern:  Who  is  the  “authentic”  subaltern?  Is  it 
possible  for  the  researcher  to  avoid  presenting 
himself  or  herself  as  the  authoritative  voice  of  the 
subaltern?  Can  the  voices  of  the  subaltern  be  heard 
through  these  authority  claims? 
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Politics  and  Postcolonialism 

Although  postcolonialism  has  only  recently  started 
to  assert  its  presence  in  the  field  of  politics  and 
international  affairs,  the  essentially  political  lenses 
of  postcolonialism  provide  critical  vantage  points 
from  which  to  understand  politics.  As  the  discus¬ 
sion  above  demonstrates,  postcolonial  approaches 
provide  essential  tools  for  interrogating,  among 
others,  the  politics  of  race,  class,  and  gender  in  the 
making  of  the  modern  world.  In  addition,  they 
help  us  understand  the  inclusions  and  exclusions 
of  nation  and  nationalism,  modernity,  globaliza¬ 
tion,  and  diaspora.  Most  of  all,  postcolonialism’s 
focus  on  the  interface  between  materialism  and 
culturalism  in  the  maintenance  of  power  provides 
us  with  a  more  holistic  understanding  of  the  exer¬ 
cise  of  power  globally.  All  these  insights  from 
postcolonialism  are  fundamentally  political  trea¬ 
tises  about  power  and  hegemony  that  deepen  our 
understanding  of  the  “modern”  world. 

Geeta  Chowdhry 
Northern  Arizona  University 
Flagstaff,  Arizona,  United  States 
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Postcommunist  Regimes 


The  concept  of  postcommunism  is  disputed  because 
it  merely  describes  the  previous  regime  in  a  given 
country  and  says  nothing  about  the  newly  formed 
regimes  subsequent  to  the  collapse  of  communism. 
As  such,  the  concept  of  postcommunism  is  impre¬ 
cise,  both  as  a  descriptive  and  as  an  analytical  cat¬ 
egory  in  social  science.  It  can  be  applied  only  to  the 
relatively  short  period  of  time  between  the  collapse 
of  the  old  and  the  establishment  of  the  new 
regime — in  other  words,  to  the  period  in  which  the 
social  and  political  relationships  are  defined  far 
more  by  the  legacy  of  the  previous  regime  than  by 
internal  and  external  factors.  The  essence  of  post¬ 
communism  is  to  stress  the  domination  of  a  non- 
democratic  political  legacy  in  new  settings  after 
dictatorship.  It  is  for  this  reason  that  the  concept 
had  been  used,  first  and  foremost,  in  the  interna¬ 
tional  political  science  literature  in  the  1990s.  This 
entry  discusses  the  uses  of  this  concept  and  its 
reduced  utility  in  recent  years. 

Communist  regimes  operated  in  the  countries 
that  are  now  termed  postcommunist ,  representing 
a  systemic  alternative  to  liberal  democracies.  These 
regimes  were  based  on  Marxist-Leninist  ideologi¬ 
cal  foundations,  or  referred  to  these  ideologies, 
while  striving  to  attain  global  communism  vis-a- 
vis  Western  capitalist  democracies.  The  Cold  War 
era,  which  emerged  after  1945,  was  characterized 
by  the  confrontation  between  “two  world  sys¬ 
tems”  and  the  potential  threat  of  a  third  world 
war.  The  “First  World,”  led  by  the  United  States, 
saw  the  systems  of  Soviet  Union  and  China,  who 
dominated  the  “Second  World,”  as  communist. 

The  extent  to  which  these  regimes  were  com¬ 
munist  in  the  original,  Marxist  sense  is  disputable, 
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given  that  market  coordination  was  not  replaced 
by  the  voluntary  collaboration  of  social  communi¬ 
ties  but  rather  by  the  domination  of  the  party- 
state.  Left-leaning  critics  of  these  regimes,  precisely 
for  this  reason,  claimed  that  the  systems  that  had 
emerged  in  the  Second  World  were  not  in  fact 
communist  but  rather  state-capitalist.  Others  had 
considered  these  systems  as  state-socialist,  arguing 
that  nationalization  had  only  represented  the  first 
step  in  building  communism,  after  which  the  pro¬ 
cess  of  social  reconstruction  would  have  to  follow. 
One  could  speak  of  communist  systems,  collec¬ 
tively,  in  the  sense  that  the  political  leadership  in 
these  countries  had  referred  to  themselves  as 
Communists:  They  shared  similar  ideologies; 
employed  similar  political,  police,  and  military 
methods;  and  endorsed  similar  goals  for  social 
transformation.  Attaining  these  goals  had  failed 
not  due  to  a  lack  of  will  but  because  these  political 
goals  were  simply  unrealistic. 

A  study  of  posttotalitarian  systems  reveals  that 
despite  the  fact  that  the  term  postcommunism  is 
imprecise,  it  has  nonetheless  achieved  a  successful 
career  in  the  international  political  science  litera¬ 
ture;  the  same  cannot  be  said  for  the  term  postfa- 
cism.  Often,  many  ex-communist  countries,  even 
20  years  after  the  democratic  transition,  are  still 
called  postcommunist;  however,  it  can  be  recalled 
that  ex-fascist  countries  20  years  after  World  War 
II  were  not  labeled  postfascist.  The  evolution  of 
these  two  terms  is  clearly  related  to  the  Cold  War 
and  the  confrontation  of  the  two  world  systems,  in 
which  the  majority  of  the  once-fascist  regimes  had 
become  democratic  and  joined  the  Western  politi¬ 
cal  and  military  organizations,  thus  confronting 
the  Soviet  Union-led  political  and  military  bloc. 
Fascist  regimes  were  relatively  short-lived,  and 
their  collapse  was  followed  by  the  global  confron¬ 
tation  between  the  forces  of  communism  and 
democracy.  Communist  systems,  by  contrast, 
endured  for  longer  periods,  and  their  collapse  was 
not  followed  by  a  confrontation  that  generated  a 
threat  of  a  world  war.  The  image  of  a  common 
enemy  and  the  lack  of  alternative  world  systems 
enabled  the  perception  of  the  legacy  of  commu¬ 
nism  to  endure,  which  in  turn  gave  space  for  the 
term  postcommunism  to  blossom.  East  Germany 
was  the  first  among  the  ex-communist  countries  to 
shed  the  stigma  of  postcommunism  and  in  1990, 
with  the  reunification  of  Germany,  ceased  to  exist 


as  an  independent  state.  It  is  interesting  to  note 
that  researchers  do  not  label  Germany  as  postcom¬ 
munist,  and  the  integration  of  the  East  German 
territories  appears  as  a  domestic  policy  issue,  much 
more  so  than  a  question  of  political  system. 

In  the  1990s,  the  international  political  science 
literature  used  the  terms  postcommunist  countries 
and  new  democracies  interchangeably,  which  for  a 
long  time  nourished  the  hope  that  the  end  of  com¬ 
munism  would  eventually  lead  to  democracy  in  all 
ex-communist  countries.  It  took  nearly  a  decade 
for  researchers  to  arrive  at  a  consensus  and  agree 
that  postcommunist  countries  cannot  be  treated  as 
a  unitary  bloc.  The  expression  postcommunism ,  as 
the  term  communism  had  in  the  past,  highlighted 
the  fundamental  commonalities  among  countries 
ranging  from  Poland  to  Mongolia,  or  from  Slovenia 
through  Turkmenistan.  In  other  words,  it  har¬ 
nessed  a  false  image  that  the  political  structures  in 
these  countries  were  essentially  similar.  For  this 
reason,  the  concept  of  postcommunism  could  not 
evolve  to  become  a  valid  analytical  category  for  all 
countries  that  for  shorter  or  longer  periods  had 
experienced  regimes  based  on  communist  ideolo¬ 
gies.  The  concept  suggested  that  there  are  far 
greater  similarities  between  these  countries  than 
differences.  Moreover,  as  we  know  today,  existing 
similarities  do  not  necessarily  stem  only  from  the 
communist  legacy:  Precommunist  historical  tradi¬ 
tions,  and  their  long-term  consequences,  are 
equally  important. 

Nevertheless,  some  fundamental  commonalities, 
at  least  in  the  beginning,  could  be  observed.  In 
addition  to  the  transition  from  communist  rule,  a 
fundamental  restructuring  of  the  economic  and 
social  systems,  and  thus  a  “triple  transformation,” 
has  been  taking  place.  All  this  had  to  be  done  in  a 
relatively  short  period  of  time,  and  indeed,  in  the 
apt  formulation  by  Jon  Elster,  Claus  Offe,  and 
Ulrich  Preuss  (1998),  it  required  “rebuilding  the 
ship  at  sea.  ”  This,  generally,  went  into  the  direction 
of  more  market-oriented  capitalist  economies  but 
with  distinct  forms  of  transferring  state-owned 
enterprises  into  private  hands.  Not  rarely,  this  was 
accompanied  by  the  emergence  of  “newly  rich”  and 
“oligarchs”  who  benefited  from  their  positions  in 
the  old  regimes  in  these  transformations.  By  con¬ 
trast,  the  populations  at  large  often  suffered  from 
deindustrialization  processes,  increasing  unemploy¬ 
ment,  and  rising  costs  of  living,  contributing  to  the 
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emergence  of  strong  social  inequalities  on  a  scale 
that  was  unknown  before  in  these  societies. 

Similarly,  different  ways  of  coping  with  the 
communist  past  and  often  considerable  human 
rights  violations  by  the  security  apparatuses  of  the 
previous  regimes  created  new  tensions.  In  a  num¬ 
ber  of  countries,  “lustration”  procedures  (i.e., 
procedures  for  screening  persons  seeking  public 
positions  in  the  new  democratic  government  to 
avoid  inducting  former  communists,  especially 
those  who  worked  for  the  secret  police  in  previous 
regimes)  were  carried  out.  In  others,  former  perpe¬ 
trators  of  human  rights  violations  and  those 
politically  responsible  remained  largely  untouched. 
The  former  communist  parties  often  acted  under 
new  labels  such  as  “socialist”  or  “social  demo¬ 
cratic”  ones.  In  the  course  of  time,  however,  they 
became  integrated  in  the  new  parliamentary  sys¬ 
tems.  At  the  other  side  of  the  political  spectrum, 
some  strongly  nationalist  parties  emerged.  On  the 
positive  side  of  the  communist  legacy,  a  high  level 
of  literacy  of  the  population  and  of  education  in 
general  must  be  noted.  Nevertheless,  levels  of 
political  interest  and  participation  still  show  a 
rather  mixed  picture. 

The  use  of  the  expression  postcommunism  sub¬ 
sided  somewhat  after  the  change  of  the  millen¬ 
nium.  This  decreased  popularity  of  the  term  can  be 
linked  to  events  such  as  the  September  1 1  terrorist 
attacks  in  New  York  and  Washington,  D.C.,  in 
2001,  the  ex-communist  East-Central  European 
countries  joining  NATO  (North  Atlantic  Treaty 
Organization),  or  the  accession  of  10  ex-commu¬ 
nist  countries  to  the  European  Union  in  2004  and 
2007.  Today,  the  new  member  states  of  the 
European  Union,  which  include  Bulgaria,  the 
Czech  Republic,  Estonia,  Hungary,  Latvia,  Lith¬ 
uania,  Poland,  Romania,  Slovakia,  and  Slovenia, 
are  usually  no  longer  described  as  postcommunist 
countries  but  rather  as  new  member  states. 
Freedom  House  has  classified  these  countries  as 
full  democracies  for  more  than  a  decade,  and 
according  to  the  Human  Development  Index,  they 
belong  to  the  most  advanced  group  of  countries  on 
a  global  scale.  Most  of  the  Balkan  countries,  with 
the  exception  of  Bosnia  and  Kosovo,  are  on  their 
way  in  joining  the  league  of  the  Baltic  republics 
and  the  Central  European  countries  in  terms  of 
democratic  development.  These  countries  have 
become,  or  are  increasingly  becoming,  a  part  of 


Western  political,  military,  and  economic  integra- 
tional  frameworks,  in  a  world  in  which  religious 
fundamentalism  and  large-scale  immigration  from 
Third  World  countries  have  become  important 
issues.  Faced  with  these  challenges,  the  West  has 
found  allies  in  the  East-Central  European  coun¬ 
tries,  enabling  them  in  turn  to  shed  the  term  post¬ 
communism  and  its  derogatory  connotations. 

The  term  postcommunism  continues  to  be 
employed  widely  across  those  countries  in  which 
the  process  of  democratization  was  stalled,  or  had 
been  reversed,  after  1991.  For  the  most  part,  these 
countries  had  been  part  of  the  ex-Soviet  Union: 
They  range  from  Belarus  to  Kazahkstan,  Georgia 
through  Kyrgyzstan,  and  Russia  to  Armenia. 
When  applied  to  these  countries,  the  term  post¬ 
communism  not  only  serves  to  describe  a  common 
past,  it  also  highlights  the  fact  that  the  process  of 
democratization  has  come  to  a  halt.  The  term  post¬ 
communism  here  indicates  the  most  convenient 
common  denominator,  based  on  which  the  various 
new,  hybrid  regimes,  ranging  from  electoral 
democracies  through  competitive  authoritarianism 
to  almost  full  authoritarian  systems,  can  be  men¬ 
tioned.  The  systematic  study  of  these  regimes 
began  only  after  the  year  2000  and  had  produced 
significant  findings  in  understanding  and  describ¬ 
ing  the  political  and  economic  transformations  in 
the  Caucasian  states  and  the  Central  Asian  coun¬ 
tries  as  well  as  in  Ukraine  and  Russia.  Yet  again, 
the  concept  of  postcommunism  has  proven  impre¬ 
cise  and  much  too  broad  an  umbrella  term,  by 
bringing  very  dissimilar,  nondemocratic  systems 
under  the  same  rubric.  Until  a  broad  consensus 
concerning  the  academic  classification  of  the  new 
hybrid  regimes  emerges,  it  is  likely  that  the  concept 
of  postcommunism  will  continue  to  be  applied  to 
these  countries. 

The  concept  of  postcommunism  cannot  serve 
as  a  legitimate  analytical  category  for  a  compre¬ 
hensive  description  of  consolidated  regimes. 
However,  as  a  modifier  it  can  be  broadly  applied 
for  a  better  understanding  of  certain  social  and 
political  processes.  Contemporary  historians, 
political  scientists,  and  sociologists  will  most 
probably  analyze,  for  a  long  time  to  come,  the 
various  aspects  of  the  historical  transformation, 
which  as  a  modifier  can  be  labeled  as  postcom¬ 
munist  transition  and  which  in  scientific  thinking 
and  historical  awareness  will  doubtlessly  remain 
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one  of  the  fundamental  historical  narratives  of  the 

20th  century. 
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Postmaterialism 


The  term  postmaterialism  refers  to  a  specific  set  of 
modern  values  that  form  a  contrast  to  traditional 
“materialistic”  values.  In  his  seminal  work  The 
Silent  Revolution:  Changing  Values  and  Political 
Styles  in  Western  Publics,  Ronald  Inglehart  intro¬ 
duced  this  term  to  the  scientific  discussion  about 
changing  value  orientations  in  advanced  industri¬ 
alized  societies.  In  earlier  publications,  Inglehart 
had  initially  used  the  term  postacquisitive  values 
instead  of  postmaterialism.  Postmaterialism  pri¬ 
marily  covers  values  such  as  environmental  protec¬ 
tion,  quality  of  life,  citizen  participation,  gender 
equality,  autonomy,  and  self-actualization.  In  con¬ 
trast,  materialistic  or  acquisitive  values  focus  on 


internal  and  external  security,  obedience,  material 
affluence,  and  economic  growth.  In  Inglehart’s 
understanding,  the  term  postmaterialism  (as  well 
as  the  term  materialism )  concerns  social  and  not 
private  values.  Values  are  defined  as  conceptions 
of  the  desirable;  therefore,  postmaterialism 
describes  a  specific  concept  of  a  desirable  type  of 
society  and  not  a  concept  of  the  desirable  order  of 
private  life.  As  abstract  conceptions,  social  values 
are  not  part  of  the  belief  systems  of  individuals. 
They  are  cultural  objects  that  exist  independently 
of  their  supporters.  But  individuals  can  internalize 
such  values,  so  that  they  become  individual  value 
orientations.  A  value  orientation  then  is  a  value 
that  a  person  has  accepted  as  a  moral  guideline.  In 
contrast  to  attitudes,  value  orientations  are  con¬ 
ceptualized  as  being  more  central  and  more  deeply 
rooted  in  the  belief  systems  of  individuals.  Thus, 
value  orientations  determine  attitudes  and  are 
more  resistant  to  change.  This  relative  stability  of 
value  orientations  is  one  of  their  most  central 
definition  elements.  In  this  entry,  the  essentials  of 
Inglehart’s  theory  of  the  silent  revolution  are  pre¬ 
sented  first.  This  is  followed  by  a  short  description 
of  the  most  important  survey  instrument  for  the 
measurement  of  postmaterialism.  Finally,  the  main 
arguments  of  Inglehart’s  critics  are  summarized. 

Inglehart’s  theory  claims  that  postmaterial  value 
orientations  became  gradually  more  widespread 
among  the  publics  in  advanced  industrialized  soci¬ 
eties  after  World  War  II.  His  theory  is  based  on  two 
simple  hypotheses:  the  scarcity  hypothesis  and  the 
socialization  hypothesis.  The  first  states  that  an 
individual’s  priorities  reflect  one’s  socioeconomic 
environment,  while  the  second  postulates  that  to  a 
large  extent,  one’s  basic  values  reflect  the  condi¬ 
tions  that  prevailed  during  one’s  preadult  years.  To 
put  Inglehart’s  theory  in  a  nutshell,  the  need  struc¬ 
ture  of  human  beings  develops  as  Abraham  Maslow 
predicts  in  his  theory  of  the  need  hierarchy.  There 
are  lower  needs,  such  as  hunger,  thirst,  and  the  need 
for  housing  and  security.  Inglehart  calls  these  needs 
material.  Furthermore,  there  are  higher  needs,  such 
as  the  desire  for  social  integration,  esteem,  and  self- 
realization.  Inglehart  calls  these  needs  postmaterial. 
In  the  logic  of  the  need  hierarchy,  higher  needs  do 
not  gain  priority  as  long  as  lower  needs  are  not 
satisfied.  Inglehart  further  assumes  that  in  the  pre¬ 
adult  years,  the  fundamental  need  structure  of  a 
person  is  cognitively  transformed  into  social  value 
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orientations,  which  remain  relatively  stable  in  the 
adult  years.  The  next  step  in  his  argument  is  that 
after  World  War  II,  the  great  majority  of  the  popu¬ 
lation  grew  up  under  conditions  of  peace,  security, 
and  economic  prosperity.  Their  material  needs 
were  satisfied,  so  they  developed  higher  order 
needs  that  were  cognitively  transformed  into  post¬ 
material  value  orientations.  Even  if  Inglehart 
allows  for  period  effects,  that  is,  short-term  value 
fluctuations  in  response  to  altered  environmental 
circumstances,  he  sees  the  major  force  of  value 
change  in  the  generational  change.  In  his  theoreti¬ 
cal  perspective,  the  postwar  generations  are  the 
demographic  basis  of  postmaterialism,  and  when 
the  older  generations  disappear,  the  society  as  a 
whole  develops  toward  postmaterial  values. 
Inglehart  calls  this  process  the  “silent  revolution” 
because  generational  change  works  only  slowly. 

One  important  peculiarity  of  Inglehart’s  theory 
is  that  it  deals  with  value  priorities.  Inglehart  is  not 
interested  in  the  absolute  importance  people  give 
to  different  values  but  in  their  relative  importance. 
He  is  especially  interested  in  the  question  whether 
a  person  gives  greater  priority  to  material  or  to 
postmaterial  concerns.  This  focus  on  value  priori¬ 
ties  has  several  causes.  First  of  all,  it  follows 
directly  from  the  Maslowian  need  hierarchy, 
which  is  the  theoretical  basis  of  Inglehart’s  argu¬ 
ment  and  deals  with  relative  priorities.  Further, 
Inglehart  argues  that  in  the  field  of  politics,  priori¬ 
ties  are  crucial.  Most  people  will  find  material  and 
postmaterial  concerns  important:  They  want  to 
have  economic  growth  and  a  clean  environment, 
wealth  and  greater  possibilities  to  participate  in 
politics.  But  because  politics  is  about  choices  and 
it  is  not  possible  to  have  everything — so  goes 
Inglehart’s  argument — people  have  to  decide 
between  different  desirable  political  goals.  The  fact 
that  Inglehart’s  theory  deals  with  value  priorities 
implies  that  the  value  space  in  his  concept  is  neces¬ 
sarily  a  one-dimensional  continuum  with  the  poles 
of  materialism  and  postmaterialism.  People  are 
either  materialists  or  postmaterialists,  but  they 
cannot  give  priority  to  both  material  and  postma¬ 
terial  values. 

The  most  important  and  widespread  survey 
instrument  for  the  measurement  of  postmaterial- 
ism  is  the  so-called  Inglehart  Index.  Inglehart  con¬ 
structed  it  to  measure  value  priorities  in  which  he 
is  primarily  interested.  For  that  reason,  the 


Inglehart  Index  forces  respondents  to  rank  order 
four  political  goals,  each  of  which  by  itself  would  be 
rated  as  very  important  by  the  respondents.  The 
underlying  assumption  is  that  the  respondents  rank 
order  these  goals  according  to  their  basic  value  pri¬ 
orities.  This  instrument  has  been  regularly  included 
in  the  biannual  Eurobarometer  surveys  of  the 
European  Commission  since  the  beginning  of  the 
1970s.  In  the  Eurobarometer  surveys,  the  Inglehart 
Index  is  implemented  in  the  following  form: 

There  is  a  lot  of  talk  these  days  about  what 
[name  of  the  Nation] ’s  goals  should  be  for  the 
next  ten  or  fifteen  years.  On  this  card  are  listed 
some  of  the  goals  that  different  people  say  should 
be  given  top  priority.  Would  you  please  say 
which  one  of  them  you  yourself  consider  to  be 
most  important  in  the  long  run? 

o  Maintaining  order  in  the  country 
o  Giving  the  people  more  say  in  important 
government  decisions 
o  Fighting  rising  prices 
o  Protecting  freedom  of  speech 

And  what  would  be  your  second  choice? 

In  the  empirical  analysis,  respondents  who  give 
top  priority  to  the  two  materialistic  goals  (first  and 
third  goal)  are  classified  as  materialists,  while  those 
who  give  top  priority  to  the  two  postmaterial  goals 
(second  and  fourth  goal)  are  coded  as  postmateri¬ 
alists.  Finally,  respondents  who  name  a  material 
and  a  postmaterial  goal  on  top  of  their  priority  list 
or  vice  versa  are  classified  as  a  mixed  type. 

Inglehart’s  numerous  books  and  articles  on  post¬ 
material  value  change  as  well  as  his  measurement 
instrument  have  been  extremely  influential  in  the 
scientific  discussion  about  changing  value  orienta¬ 
tions  in  Western  publics.  Nevertheless,  Inglehart’s 
work  generated  a  lot  of  theoretical  and  method¬ 
ological  criticism.  From  a  theoretical  point  of  view, 
it  was  criticized  very  early  that  Inglehart  did  not 
distinguish  clearly  between  the  concepts  “individ¬ 
ual  need”  and  “social  value  orientation”:  Maslow’s 
hierarchy  of  needs,  which  is  a  psychological  model 
of  personality  development,  may  not  necessarily  be 
adequate  for  the  analysis  of  the  escalation  of  public 
wants  and  political  demands.  Furthermore,  there 
is  no  reason  why  individuals  must  set  priorities 
among  material  and  postmaterial  goals  and  values, 
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since  these  options  are  not  at  all  exclusive  of  each 
other  in  real-world  politics.  There  is,  for  instance, 
no  natural  trade-off  between  giving  people  more 
say  in  government  and  fighting  inflation.  In  addi¬ 
tion,  there  is  a  lot  of  empirical  evidence  that  value 
orientations  can  and  do  change  over  the  life  course 
and  that  they  do  this  in  a  predictable  manner. 
Inglehart’s  socialization  hypothesis  has  therefore 
been  repeatedly  challenged  by  a  rival  life  cycle 
hypothesis.  Finally,  the  validity  of  the  Inglehart 
Index  cannot  be  taken  for  granted.  When  there  is 
no  natural  trade-off  between  material  and  postma¬ 
terial  goals,  forcing  the  respondents  to  rank  order 
these  political  goals  is  not  appropriate.  Empirically, 
the  Inglehart  Index  reacts  very  sensitively  to  the 
rate  of  inflation:  When  the  inflation  rate  is  high,  the 
percentage  of  materialists  in  a  country  is  high  too; 
when  the  inflation  rate  is  low,  the  issue  “fighting 
rising  prices”  is  not  of  importance  and  the  percent¬ 
age  of  materialists  is  consequently  low.  These 
short-term  fluctuations  do  not  fit  in  with  the  theo¬ 
retical  status  of  value  orientations  as  relatively  sta¬ 
ble  constructs.  So,  possibly  the  Inglehart  Index  does 
not  measure  value  but  issue  priorities. 

During  the  past  2  decades,  the  focus  of  Ingle¬ 
hart’s  work  has  gradually  shifted  away  from  post¬ 
materialism.  He  now  interprets  postmaterial  value 
change  as  an  important  but  by  far  not  the  only 
component  of  a  broader  process  of  postmodern¬ 
ization.  In  his  latest  work  coauthored  with 
Christian  Welzel,  he  puts  postmaterialism  in  the 
even  broader  context  of  the  so-called  human  devel¬ 
opment  sequence. 

Markus  Klein 
Leibniz  Universitdt  Hannover 
Hannover,  Germany 
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Postmodernism  in 
International  Relations 


Postmodernism  names  an  increasingly  varied  set 
of  intellectual  strategies  that,  in  contradistinction 
to  the  rational  empiricist  method  of  mainstream 
international  relations  thinking,  in  particular,  and 
political  science,  in  general,  foregrounds  the  inter¬ 
pretive,  ethical,  and/or  political  nature  of  interna¬ 
tional  relations  scholarship  (see  particularly, 
Robert  Walker,  1993).  Its  general  aim  is  to  test 
the  limits  and  assumptions  of  our  forms  of  knowl¬ 
edge  of  the  political  world  and  open  up  alternative 
ways  of  conceiving  and  practicing  the  political. 
With  its  focus  on  contingency,  uncertainty,  and 
plurality,  postmodernism  considers  itself  an 
appropriate  form  of  political  thinking  for  a  global 
political  age.  In  comparison  to  the  schools  of  lib¬ 
eralism,  realism,  and  Marxism  in  international 
relations  theory,  postmodernism  does  not  present 
a  school  of  thought  however.  According  to  David 
Campbell  (1998),  no  scholar  of  a  postmodern 
disposition  accepts  the  term  uncritically,  and  a 
wide  variety  of  practices  are  covered  by  the  term. 
That  said,  the  above  definition  underscores,  in 
Wittgensteinian  terms,  a  basic  “family  resem¬ 
blance”  between  the  arguments  of  these  scholars. 
This  resemblance  has  allowed  the  concept  of 
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“postmodernism”  to  acquire  in  the  past  25  years 
intellectual  consistency  in  the  disciplinary  field  of 
international  relations  (IR)  and  to  designate  some¬ 
thing  like  a  specific  approach  to  IR.  In  this  light, 
the  following  considers  (a)  the  intellectual  origins 
of  the  term  and  its  application  to  the  discipline  of 
IR,  (b)  its  major  strategies  and  engagements 
within  the  field,  and  (c)  future  perspectives. 

Origins 

Postmodernism  in  IR  can  be  traced  back  to  the 
major  moves  of  French  thought  in  the  1960s  and 
1970s.  Amalgamating  a  set  of  arguments  from 
modern  linguistics,  anthropology,  psychoanalysis, 
post-Kantian  philosophy  and  phenomenology,  and 
literary  theory,  intellectuals  like  Jacques  Derrida, 
Gilles  Deleuze,  Michel  Foucault,  Jacques  Lacan, 
Jean-Francois  Lyotard,  Julia  Kristeva,  and  Roland 
Barthes  launched  a  series  of  critiques  of  the  major 
tenets  of  humanism  in  the  humanities  and  soft  sci¬ 
ences.  They  questioned,  in  particular,  the  authority 
of  the  modern  subject  qua  the  ground  of  modern 
knowledge,  ethics,  politics,  and  aesthetics;  repre¬ 
sentative  theories  of  truth  and  reality;  and  progres¬ 
sive  theories  of  history.  Although  these  critiques 
were  different  in  kind  and,  at  times,  mutually 
incompatible,  all  the  above  thinkers  engaged,  to 
greater  or  lesser  degrees,  with  the  following:  to 
look  to  differences,  otherness,  and  the  nonhuman 
where  the  modern  period,  within  a  larger  meta¬ 
physical  movement  or  not,  secures  identity  through 
reason;  to  undermine  any  straightforward  corre¬ 
spondence  between  representation  and  reality,  the 
knowing  subject  and  the  known  object;  and,  by 
showing  the  constructed  nature  of  our  representa¬ 
tions  of  the  world,  to  prise  open  the  limits  of  these 
constructions  and  allow  marginalized  and  alterna¬ 
tive  voices  to  emerge.  All  have  been  concerned,  in 
other  words,  with  undermining  and  transcending 
Cartesian-inspired  epistemology,  with  expositing 
forms  of  critique  that  do  not  rely  on  a  final 
instance  of  truth  (God,  humanity,  the  modern  sub¬ 
ject,  the  proletariat,  and  nature),  and  with  rehears¬ 
ing  a  distinction  between  ethical  transcendence 
and  political  determination.  (The  work  of  Jacques 
Derrida  and  Michel  Foucault — together  with  the 
that  of  the  Heideggerian  philosopher  Giorgio 
Agamben — has  foremost  importance  in  this 
respect.) 


From  the  1980s  onward,  this  thought  began  to 
influence  the  subdiscipline  of  IR  within  political 
science.  Scholars  like  Richard  Ashley,  David 
Campbell,  James  Der  Derian,  Michel  Dillon,  Jim 
George,  Michael  Shapiro,  and  Robert  Walker  are 
important  figures  in  this  translation.  The  aforemen¬ 
tioned  critiques  of  language  and  representation 
help,  notably,  to  critique  the  empirical  rationalism 
of  mainstream  IR  thought:  The  belief  that  theory 
describes  the  world  and  is  validated  through 
empirical  evidence.  The  above  refusal  of  an  ulti¬ 
mate  referent  in  language  and  of  the  concomitant 
“performative”  nature  of  all  knowledge  allows  IR 
scholars,  on  the  one  hand,  to  historicize  the  domi¬ 
nant  objects  of  international  relations  thought 
(interstate  relations,  the  sovereignty  of  the  state, 
security,  peace,  and  interdependence)  contra  the 
perceived  abstractions  of  rational  choice  theory 
and  the  empirical  research  methods  of  neorealism 
and  neoliberalism.  On  the  other  hand,  it  allows 
them  to  reveal  relations  of  interest  and  power 
within  any  theoretical  construction  of  political 
reality.  Since  knowledge  is  not  neutral  (represent¬ 
ing  its  object  as  such),  it  always  already  involves 
hierarchies  of  interpretation.  As  a  result  of  these 
two  gestures,  the  field  of  international  relations  is 
pulled  away  from  explanatory  analysis  and  related 
back  to  political  theory  and  political  sociology, 
and  its  objects  are,  in  principle,  opened  up  to  alter¬ 
native  possibilities  of  political  knowledge,  power, 
and  organization. 

Engagements 

Since  the  establishment  of  these  strategies  within 
IR  theory,  postmodern  IR  thought  has  increas¬ 
ingly  weighed  theoretical  understanding  with 
empirical  investigation.  Some  standard  themes 
and  authorships  are  briefly  listed  here.  Postmodern 
topics  in  IR  range  from  genealogies  and  decon¬ 
structions  of  the  IR  tradition  (Walker,  Campbell), 
through  focused  attention  on  modern  delimita¬ 
tions  of  sovereignty  (Ashley,  Walker,  Shapiro),  to 
theorizations  of  contemporary  modalities  of  war¬ 
fare  and  security  (Michel  Dillon  and  Julian  Reid, 
Maja  Zehfuss),  of  “liberal”  regimes  of  governance 
and  surveillance  (Didier  Bigo,  Claudia  Aradau, 
Louis  Amoore,  and  Marieke  de  Goede),  of  Amer¬ 
ican  and  British  foreign  policy  (David  Campbell, 
Michael  Hardt,  Toni  Negri,  and  Michael  Shapiro), 
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of  “humanitarian”  interventions  (Michel  Dillon 
and  Julian  Reid,),  and  of  migration,  asylum,  and 
refugeeship  (Jenny  Edkins).  Since  postmodern 
approaches  to  these  topics  simultaneously  open 
up  questions  of  patriarchy  and  Western  domina¬ 
tion,  they  have  also  dovetailed  with  respective 
concerns  in  IR  with  feminist  and  postcolonial 
issues  (e.g.,  Kimberly  Hutchings  and  Vivienne 
Jabri).  Given  constraints  of  space,  rather  than 
expositing  these  arguments,  this  entry  focuses  on 
the  topic  of  “modernity”  as  such  in  postmodern 
IR  theory.  All  the  above  engagements  presuppose 
something  like  a  common  approach  to  it.  To 
understand  the  “how”  of  these  postmodern  posi¬ 
tions  in  the  field  of  IR  entails,  in  other  words,  first 
understanding  this  presupposition. 

In  its  strongest  formulations,  the  “post”  of 
“postmodern”  designates  neither  an  “after”  nor 
an  “against”  of  the  modern  period,  of  moderniza¬ 
tion  processes,  or  of  the  general  descriptive 
“modernity.”  It  refers,  rather,  to  a  critical,  ambig¬ 
uous  relation  to  the  general  tenets  of  the  modern. 
These  well-known  tenets  consist  in  a  belief  in  prog¬ 
ress,  in  the  emancipating  powers  of  reason,  in  the 
political  centrality  of  subjectivity,  and  in  ethical 
and — in  historical  tension  with  nation-state  par¬ 
ticularism — political  universalism.  Together  with 
these  tenets,  the  discourse  of  modernity  affirms  the 
exemplarity  of  Western  processes  of  moderniza¬ 
tion:  the  distinction  between  religion  and  the  state, 
the  separation  of  civil  society  from  the  state,  the 
emergence  of  the  individual  and  the  categories  of 
ethical  and  political  personality  and  solidarity, 
democracy  and  the  rule  of  law,  industrialization 
and  urbanization,  and  so  on.  For  postmodern 
thought,  following  Foucault  (1980),  these  pro¬ 
cesses  dominate  and  discipline  the  individual  as 
much  as  they  individualize  and  empower  him  or 
her:  Subjectivity  implies,  also,  subjection.  Following 
Derrida  (1992),  modern  law  is  considered  to  cover 
over  its  own  violent  emergence  and  is  consequently 
unable  to  reflect  on  its  own  ethical  and  political 
limits.  Fiberalism  affirms,  for  example,  universal 
tolerance  and  general  assembly.  But  the  spaces  of 
both  are  necessarily  exclusive  and  violent,  organiz¬ 
ing  the  “others”  of  liberalism  in  self-legitimating 
ways.  Finally,  for  postmodern  thought,  following 
Emmanuel  Fevinas’s  attention  to  ethical  singular¬ 
ity,  the  immanent  relation  between  theoretical  and 
practical  reason  in  modernity  codifies  ethics  and 


turns  the  Enlightenment  principles  of  freedom  into 
those  of  abstract  domination  with  little  eye  to 
locality  and  contingency  (the  major  critique  of 
Western  interventionism).  In  these  three  respects, 
modernity’s  promise  of  a  generalizable  axiomatics 
of  freedom  hides  practices  of  discipline  and  domi¬ 
nation.  Modernity  is  to  be  neither  avoided  nor 
transcended;  its  limits  of  violence  are,  however,  to 
be  critiqued  and  negotiated. 

Within  the  field  of  IR  this  kind  of  argument  has 
meant  the  following.  IR  turns  on  the  “anarchy”  of 
relations  that  befall  the  state  outside  the  parame¬ 
ters  of  its  own  sovereignty.  For  postmodernists, 
sovereignty  and  anarchy  are  mutually  constitutive, 
violent  constructions  whose  respective  delimita¬ 
tions  suppress  the  multiple  differences  and  plurali¬ 
ties  of  the  real.  Modern  political  organization  does 
wrong  to  these  differences.  Contra  cosmopolitan¬ 
ism,  however,  postmodern  IR  thought  does  not 
affirm  either  the  individual  against  the  modern 
state  or  the  transnational  human  rights  regime  that 
increasingly  underpins  supranational  juridical  per- 
sonhood.  For  the  limits  of  the  individual  are  as 
constructed  as  those  of  the  modern  state.  Modern 
liberal  individualism  and  its  international  equiva¬ 
lent,  cosmopolitan  liberalism,  codify  subjects  in 
particular  ways  as  bearers  of  rights,  as  fully  delim¬ 
ited  and  conscious  persons,  as  beings  prior  to 
social  interaction  and  meaning.  These  philosophies 
risk  depoliticizing  social  agents.  Therefore,  aware 
of  the  freedoms  gained  through  modernity  (in  dis¬ 
tinction  to  the  hierarchies  of  the  premodern  age), 
postmodern  IR  thought  considers  the  state  as  only 
one  actor  in  the  plural  fields  of  global  politics  and 
sees  the  individual  and  universalism  as  the  legacies 
of  Western  Enlightenment.  This  legacy  is  particu¬ 
lar.  It  cannot  consequently  theorize  for  humanity 
as  a  whole.  When  it  does,  non-Western  social 
actors  risk  being  perceived  in  a  nonpolitical  con¬ 
text;  the  consequences  of  universalism  can  be 
utterly  self-contradictory  (see  particularly  Mah- 
mood  Mamdani,  2009).  Postmodern  thought  con¬ 
verges  here  with  neo-Marxist  and  postcolonial 
critiques  of  Western  imperialism.  Modernity  can¬ 
not  be  a  straightforward  counterforce  to  such 
imperialism,  for,  however  much  it  distinguishes 
itself  from  authoritarianism,  it  is  also  complicit 
with  strategies  of  domination. 

What  emerges  from  the  postmodern  analysis  of 
modernity?  Any  countersubject  to  replace  Western 
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modernity  would  simply  reproduce  the  tenets  of 
modernity.  Postmodernists  do  not  consequently 
seek  a  new  political  subject.  They  seek  to  untie  and 
foreground  what  is  repressed  by  political  subjectiv¬ 
ity  as  such,  hence  their  practical  concern  with  the 
excluded,  the  marginal,  the  dissenting,  and  an 
(often  too)  implicit  engagement  with  political 
organizations  that  are  more  inclusive,  less  exclu¬ 
sive.  Postmodern  thought’s  ultimate  concerns  are, 
therefore,  with  ethics  and  (more  long  term)  with 
the  overall  rethinking  of  modern  political  catego¬ 
ries.  The  former  ethics  is  not  that  of  rational  uni- 
versalism,  however;  it  addresses,  as  reasonably  as 
possible,  what  political  reason  and  subjectivity  will 
necessarily  exclude.  It  is  an  ethics  of  contingency, 
of  uncertainty,  of  singularity  in  the  interstices  of 
political  arrangement  and  institution.  Given  the 
recent  economic  and  technological  globalization, 
postmodernism  in  IR  befits,  it  is  argued,  a  global 
age.  For  in  this  age,  plurality  and  difference  are 
increasingly  critical  values.  The  postmodern 
engagement  with  modernity  leads,  as  a  result,  to 
critical  and  dissenting  approaches  to  all  interna¬ 
tional  topics,  approaches  that  anticipate,  some 
way  down  the  road,  a  new  politics  of  freedom. 

Immediate  Futures 

Postmodernism  is  now  settled  within  international 
relations  theory.  Its  position  remains,  however, 
marginal.  Just  as  French  thought  has  been  unable 
to  dislodge  scientific  rationalism  in  the  social  sci¬ 
ences  (remaining  effective  in  the  humanities  and 
soft  social  sciences  alone),  so  postmodern  thought 
in  IR  has  not  perturbed,  let  alone  dislodged,  main¬ 
stream  IR  thought  (political  realism,  rational 
choice  theory,  game  theory,  and  empirical  research 
methods  in  general).  Although  its  present  critiques 
of  domestic  and  global  liberal  regimes  are  impor¬ 
tant,  future  perspectives  for  postmodernism  in  IR 
should  be  considered  in  this  light.  Two  possibilities 
seem  convincing  here.  Given  its  postpositivist 
roots  and  ethical  orientation,  postmodernism  in  IR 
will  refuse  to  become  more  “mainstream,”  resist¬ 
ing  empirical  procedure  and  attending  to  the  mar¬ 
ginal — postmodernism  will  remain  a  marginal 
critique  in  this  sense.  Or  it  will  theorize  how  to 
think  and  practice  less  violent  political  organiza¬ 
tion  in  an  age  of  globalization  and  fragmentation. 
Although  both  options  are  intellectually  justifiable, 


the  second  requires  on  the  part  of  postmodern 
scholars  more  rehearsal  of  postmodernism’s  own 
assumptions  (especially  around  the  concept  and 
practice  of  freedom).  This  entry  concludes  with 
two  examples.  First,  if  all  theory  is  interpretive  to 
a  greater  or  lesser  extent,  the  criteria  by  which  one 
theory  is  interpretatively  better  than  another  should 
be  formalizable  and,  therefore,  testable.  If  so,  at 
the  methodological  level,  postmodernism  and 
empiricism  could  talk  to  each  other.  At  the  politi¬ 
cal  level,  postmodernism  could  exposit  more  what 
the  lesser  violence  entails  in  global  political  life. 
Second — and  consequently — for  postmodern  the¬ 
ory,  freedom  tends  to  be  situated  beyond  liberal 
law,  with  the  latter  seen  as  coercive,  disciplinary, 
and,  at  worst,  depoliticizing.  A  clearer  negotiation 
with  the  inextricable  relation  between  freedom  and 
law  is,  however,  needed  if  postmodern  thought  is 
to  help  foster  more  progressive  forms  of  gover¬ 
nance.  In  an  age  of  uncertainty  regarding  the  future 
of  global  political  configurations,  both  positions 
are  of  interest  to  progressive  reorganizations  of 
international  relations.  The  future  of  postmodern 
thought  in  IR  will  probably  depend  on  a  careful 
balance  struck  between  the  two. 

Richard  John  Beardsworth 
American  University  of  Baris 
Paris,  France 
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Power 


Power  is  a  concept  broadly  used  both  in  personal 
and  political  communication  and  in  scientific  dis¬ 
course.  Everybody  claims  to  know  what  power  is. 
Yet  looking  at  the  wealth  of  meanings  given  to 
this  concept  in  everyday  life  and  among  social 
scientists,  the  confusion  is  striking.  There  are  sev¬ 
eral  reasons  for  this  confusion.  By  way  of  intro¬ 
duction,  only  two  reasons  need  to  be  mentioned. 

One  reason  for  this  confusion  is  that  the  use  of 
the  concept  of  power  in  common  and  scientific 
language  is  highly  morally  loaded.  Power  research¬ 
ers  committed  to  the  Weberian  epistemological 
norm  of  Werturteilsfreiheit  (freedom  from  value 
judgment)  are  under  extreme  pressure  both  from 
the  members  of  their  own  scientific  community, 
who  follow  the  contrary  norm  of  critical  analysis, 
and  from  the  public  to  take  a  clear  moral  position 
on  power.  Critical  theorists  who  have  committed 
themselves  to  fighting  power  with  the  means  of 
science,  on  the  other  hand,  have  been  accused  of 
normative  bias  when  dealing  with  power  analysis. 
Ever  since  power  has  been  on  the  social  scientific 
research  agenda,  it  has  provided  one  of  the  most 
prominent  battlegrounds  for  conflicting  epistemo¬ 
logical  approaches,  political  interests,  and  cultural 
beliefs.  What  is  left  after  all  these  battles  is  not  easy 
to  put  together  again  into  one  picture  of  social 
scientific  power  research. 

There  is  a  second  reason  why  power  research 
constitutes  a  particularly  slippery  ground  for  social 
scientists.  Scholars  of  power  are,  like  laymen  in 
general,  prone  to  being  influenced  by  cultural 
images  and  ideologies  of  power.  In  Western  culture, 


there  are  mainly  two  contrasting  images  of  power 
that  have  left  their  imprint  on  social  scientists’ 
efforts  to  deal  analytically  and  empirically  with  this 
widespread  phenomenon.  According  to  the  first 
image,  power  is  a  negative  phenomenon.  It  is  seen 
as  a  social  force  suppressing  the  weakest  members 
of  society  and  making  them  dependent  on  the 
power  holders.  According  to  this  cultural  image, 
power  is  considered  “bad”  because  it  is  believed 
that  the  concentration  of  power  undermines  gener¬ 
ally  accepted  social  norms  and  values  of  democracy 
and  disturbs  the  smooth  functioning  of  its  institu¬ 
tions.  Popular  Western  culture  continually  feeds 
this  negative  image  of  power.  Western  mass  media 
produce  and  reproduce  stereotypes  of  bad  power: 
The  power  holders  are  mainly  males,  meeting  other 
powerful  men  in  secret  sessions  and  gathering 
around  the  top  power  holder,  their  boss.  Violence 
is  their  ultimate  power  resource.  Implicit  in  this 
stereotype  of  bad  power  are  the  beliefs  that  secret 
power  in  general  is  stronger  than  public  power, 
that  power  has  a  gender  bias  in  favor  of  the  males, 
and  that  the  strongest  power  resource  is  violence. 

The  imprint  left  by  this  cultural  stereotype  on 
social  scientific  research  is  threefold.  First,  follow¬ 
ing  the  first  element  of  the  cultural  image  of  power 
according  to  which  “real”  power  is  invisible,  schol¬ 
ars  of  power  have  insisted  on  the  imperative  to 
discover  hidden  power  as  their  main  research  goal. 
Second,  the  gender  bias  implicit  in  the  cultural 
image  of  power  is  reproduced  by  power  research. 
Power  is  mainly  a  male  affair.  The  overwhelming 
majority  of  power  researchers  are  males,  and  their 
research  subjects,  the  power  holders,  as  a  rule  are 
males  as  well.  There  is  practically  no  research  on 
female  (political,  social,  economic,  cultural,  and 
religious)  power  and  their  peculiarities,  if  any,  as  to 
its  exercise.  There  is  a  third  and  contradictory 
imprint  that  the  popular  image  of  power  has  left 
on  power  research  in  the  social  sciences.  Although 
it  is  generally  believed  that  physical  muscular  com¬ 
ponents  are  the  ultimate  instruments  of  power, 
analyses  on  violence  do  not  constitute  a  self- 
evident  part  of  conventional  power  research. 
Leaving  aside  those  cases  when  force  and  violence 
represent  the  last  instance,  as  in  riots,  revolutions, 
and  war,  conventional  power  research  has  usually 
explored  cases  in  which  the  exercise  of  power 
appears  to  be  a  bloodless,  largely  psychological, 
and  mainly  intellectual  affair. 


2100  Power 


Whereas  the  negative  image  of  power  is  strongly 
represented  in  Western  cultural  industry,  its  coun¬ 
terpart  is  less  pronounced.  When  power  is  consid¬ 
ered  “good,”  it  is  generally  associated  with  a  small 
group  of  gifted  and  qualified  men  constituting  an 
elite  in  a  specific  societal  segment.  As  long  as  these 
fragmented  elites  do  not  coalesce  into  one  and  the 
same  elite,  they  produce  the  positive  effects  of 
power:  know-how,  expertise,  trust,  and  vision.  If 
power  is  good,  it  works  on  the  basis  of  nonphysi¬ 
cal  and  mainly  intellectual  means  of  convincing 
others.  According  to  the  institutionalized  division 
of  labor  within  the  social  sciences,  elite  research  is 
separated  from  power  research.  This  illustrates 
that  the  cultural  image  of  positive  or  good  power 
is  underrepresented  not  only  among  laymen  but 
also  among  students  of  social  science. 

All  said,  it  becomes  quite  easy  to  understand 
why  power  is  one  of  the  most  elusive,  yet  most 
widely  used  concepts  in  the  social  sciences.  Some 
scholars  have  suggested  getting  rid  of  it,  but  the 
concept  of  power  is  and  remains  an  essential  tool 
of  analysis.  Power  has  always  been  the  object  of 
many  controversies,  both  definitional  and  analyti¬ 
cal.  Accordingly,  there  are  many  different  ways  of 
how  to  define  this  concept  and  how  to  study 
power. 

Definitions  and  Constituent  Elements 

As  for  many  other  social  science  concepts,  Max 
Weber  (1978)  provides  an  excellent  point  of 
departure:  “‘Power’  ( Macht )  is  the  probability 
that  one  actor  within  a  social  relationship  will  be 
in  a  position  to  carry  out  his  own  will  despite 
resistance,  regardless  of  the  basis  on  which  this 
probability  rests”  (p.  53).  Weber  stresses  that 
there  are  a  wide  variety  of  power  bases:  “All  con¬ 
ceivable  qualities  of  a  person  and  all  conceivable 
combinations  of  circumstances  may  put  him  in  a 
position  to  impose  his  will  in  a  given  situation” 
(p.  53).  It  is  with  reference  to  this  wealth  of  pos¬ 
sibilities  to  mobilize  and  exert  power  that  Weber 
has  qualified  this  phenomenon  as  “sociologically 
amorphous.” 

Departing  from  Weber’s  definitions,  four  con¬ 
stituent  elements  of  power  can  be  distinguished: 
first,  power  is  relational,  second,  it  is  intentional, 
third,  it  is  based  on  social  resources-,  and  fourth, 
power  is  related  to  specific  contexts. 


Relational 

Many  authors  have  followed  in  Weber’s  foot¬ 
steps  and  see  power  not  as  a  specific  quality  of  a 
person  but  as  a  relation  between  two  actors  (A  and 
B)  or  entities.  For  instance,  according  to  Richard 
Tawney  (1931),  “power  may  be  defined  as  the 
capacity  of  an  individual,  or  group  of  individuals, 
to  modify  the  conduct  of  other  individuals  or 
groups  in  the  manner  which  he  desires”  (p.  230). 
Robert  Dahl’s  (1957)  elegant  definition  highlights 
the  relational  character  of  power:  “A  has  power 
over  B  to  the  extent  that  he  can  get  B  to  do  some¬ 
thing  that  B  would  not  otherwise  do”  (pp.  202- 
203).  If  power  is  understood  as  a  relation  in  which 
not  only  can  A  modify  B’s  intentions  and  will,  the 
question  arises  of  how  B  can  “relate”  to  A.  On  its 
part,  B  can  try  to  resist  A’s  will  by  mobilizing 
counterpower.  According  to  Heinrich  Popitz 
(1992),  the  ultimate  chance  to  break  A’s  power 
consists  in  B  physically  withdrawing  from  the 
power  relation  either  by  escaping  from  the  situa¬ 
tion,  by  committing  suicide,  or  by  being  prepared 
to  be  killed.  The  willingness  to  die  or  to  commit 
suicide  is  the  strongest  counterpower  B  can  mobi¬ 
lize  against  A.  Both  in  conventional  and  in  terrorist 
warfare,  it  is  a  commonly  used  weapon. 

Intentional 

The  definition  of  power  is  limited  to  cases  in 
which  A  intends  to  carry  out  his  will  vis-a-vis  B.  In 
contrast,  if  B  conforms  with  A’s  wishes  even  with¬ 
out  A’s  intention  to  impose  them  on  B,  we  are  deal¬ 
ing  with  the  phenomenon  of  influence.  However, 
the  borderline  between  influence  and  power  is 
fuzzy.  Carl  Friedrich  (first  articulated  in  1941,  then 
in  1963)  has  suggested  that  a  power  relationship 
may  exist  even  when  no  visible  pressure  or  influ¬ 
ence  is  applied.  There  may  be  cases  when  B  seems 
to  implement  A’s  will  without  pressures  being  put 
on  B.  B  does  so  because,  on  the  basis  of  B’s  and 
others’  previous  experiences,  B  is  aware  that  A 
desires  that  specific  type  of  behavior  and  is  in  a 
position  to  impart  sanctions  if  it  is  not  fulfilled.  For 
instance,  a  U.S.  senator  votes  in  a  certain  way,  con¬ 
trary  to  his  or  her  previously  manifested  views, 
because  he  or  she  knows  that  otherwise  the  presi¬ 
dent  has  the  power  to  punish  any  contrary  behav¬ 
ior  by  withdrawing  some  precious  resources  and/or 
acting  in  such  a  way  as  to  exclude  him  or  her  from 


Power  2101 


prestigious  positions  and  appointments.  Behaviors 
of  this  type  are,  in  Friedrich’s  language,  instances 
of  “anticipated  reactions.” 

Not  all  scholars  agree  that  (manifest  or  antici¬ 
pated)  intentionality  is  the  definitional  distinction 
between  influence  and  power.  As  shown  below,  a 
heated  debate  has  taken  place  from  the  mid-1950s 
to  the  1970s  focusing  on,  among  other  things,  this 
distinction.  Other  controversies  have  dealt  with 
the  question  of  how  the  intention  to  use  power 
manifests  itself.  Generally  speaking,  the  expression 
of  one’s  intention  to  use  power  depends  on  the 
type  of  resources  available  and  the  type  of  sociopo¬ 
litical  context  in  which  power  relations  take  place. 
Political  scientists  have  given  special  prominence 
to  the  realm  of  decision  making.  Harold  Lasswell 
and  Abraham  Kaplan  (1950)  define  power  as 
“participation  in  the  making  of  decisions:  G  has 
power  over  H  with  respect  to  the  values  K  if  G 
participates  in  the  making  of  decisions  affecting 
the  K-policies  of  H”  (p.  75). 

Control  is  another  concept  often  used  synony¬ 
mously  with  the  concept  of  power.  The  distinction 
between  these  two  concepts,  however,  is  important 
and  useful  for  grasping  two  different  phenomena. 
The  powerful  person  or  entity  uses  different 
resources  including,  if  necessary,  violence  to 
enforce  his  or  her  intention.  In  contrast,  the  con¬ 
cept  of  control  is  used  when  the  other’s  behavior  is 
supervised  and/or  guided  without  wanting  to 
change  it.  In  politics,  both  informal  and  formal 
mechanisms  of  control  are  at  play.  Among  the  lat¬ 
ter  ones,  the  institutionalization  of  competition 
between  political  parties  and  the  institutionaliza¬ 
tion  of  opposition  to  the  governing  party  or  party 
coalition  are  of  crucial  importance  for  making 
democracies  work.  The  institutionalization  of  con¬ 
trol  mechanisms  has  the  function  to  prevent  the 
abuse  of  power  and  to  stimulate  power  holders  to 
use  their  resources  according  to  the  generally 
accepted  goals  and  values.  However,  there  is  a 
tricky  relationship  between  control  and  power. 

Control  without  power  can  turn  out  to  be  inef¬ 
ficient  when  it  is  confronted  with  strong  power 
holders.  But  who  controls  the  controllers  if  they 
are  given  power  resources?  On  the  other  hand, 
powerless  controllers  typically  run  the  risk  of 
being  drawn  into  the  very  power  constellation  they 
have  been  given  the  function  to  supervise.  United 
Nations  peacekeeping  and  observer  forces,  for 


example,  have  often  enough  become  part  of  the 
conflicting  groups  they  were  supposed  to  control. 
Obviously,  a  necessary  precondition  for  control 
mechanisms  to  perform  their  function  of  limiting 
power  abuses  is  the  knowledge  of  who  uses  power 
where  and  when.  But,  evidently,  not  all  power 
relations  are  open  and  accessible.  Secret  power 
networks  infiltrating  official  power  centers  are  dif¬ 
ficult,  if  not  impossible,  to  check  by  institutional¬ 
ized  control  mechanisms.  If  power  works  in  the 
underground,  also  controllers  have  to  move  under¬ 
ground,  taking  the  role  of  spies,  concealed  investi¬ 
gators,  or  secret  agents.  The  power  of  power 
vis-a-vis  its  controllers  consists  in  its  capacity  to 
dictate  the  means  of  control.  Just  because  the  rela¬ 
tionship  between  power  and  control  is  so  highly 
flexible,  it  is  important  to  keep  these  two  phenom¬ 
ena  conceptually  apart  from  each  other. 

The  intention  to  exert  power  can  also  be 
expressed  explicitly  as  an  order  to  obey.  This  is 
especially  the  case  when  and  if  the  power  relation 
takes  place  within  a  system  of  authority.  In  this 
sense,  David  Easton  (1965)  speaks  about  the 
“power  of  the  authority  roles”  (p.  207).  In  more 
informal  social  contexts,  the  powerful  person  can 
express  his  or  her  intention  with  the  help  of  more 
or  less  subtle  means  of  body  language,  with  a  sign 
of  the  hand  or  a  raise  of  the  eyebrow  (Erving 
Goffman,  1963).  Typically,  only  a  limited  range  of 
persons  are  able  to  interpret  these  signs  according 
to  the  intentions  of  the  power  holder.  The  inten¬ 
tion  to  use  power  and  the  translation  of  this  inten¬ 
tion  into  action  can  be  divided  between  A,  the 
power  holder,  and  persons  who  function  as  the 
staff  of  A.  If  this  division  of  “labor”  is  given, 
power  analysis  has  to  shift  from  the  analysis  of 
dyadic  relations  between  A  and  B  to  the  analysis  of 
triadic  relations  between  A,  C,  and  B.  Whenever 
dyadic  power  relations  are  transformed  into  tri¬ 
adic  power  relations,  power  is  undergoing  the  first 
stage  in  its  transformation  into  authority  (Herr- 
schaft;  Popitz,  1992,  p.  232). 

Resources 

According  to  Weber,  power  relations  can  be 
built  on  a  variety  of  possible  resources.  Efforts 
have  been  made  to  classify  the  types  of  resources 
that  lend  themselves  most  likely  to  be  exploited  as 
power  bases.  M.  Rainer  Lepsius  (1968,  1990)  has 
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proposed  three  different  bases  of  power:  the  mar¬ 
ket,  the  association,  and  the  state.  Power  based  on 
the  market  refers  to  Karl  Marx’s  insight  into  the 
power  potential  related  to  the  ownership  of  the 
means  of  production.  Understandably,  the  owners 
of  the  means  of  production  in  general  have  a 
higher  chance  to  press  through  their  will  on  others 
than  nonowners  do.  Power  based  on  association  or 
organization  refers  to  the  chances  A  has  to  realize 
his  will  if  this  will  has  been  transformed  into  an 
organizational  goal  of  a  political  party,  trade 
union,  or  voluntary  association.  Another  lively 
debate  fought  out  in  the  1960s  and  1970s  refers  to 
the  role  performed  by  organizations  as  a  power 
basis.  As  Elmer  Schattschneider  (1960)  aptly  put  it, 
“Organisation  is  the  mobilization  of  bias”  (p.  71). 

Power  based  on  the  state  refers  to  the  (legiti¬ 
mate)  use  of  violence  as  the  ultimate  means  of 
compliance.  Power  can  be  strengthened  if  it  is 
based  on  a  combination  of  all  three  power 
resources.  Actors  who  are  situated  in  a  privileged 
market  position,  whose  power  intentions  are  orga¬ 
nized,  and  who  have  access  to  (legitimate)  means 
of  violence  have  a  greater  chance  to  succeed  than 
nonorganized  actors  in  an  unprivileged  market 
situation  without  access  to  (legitimate)  means  of 
violence.  Strong  power  resources  have  the  ten¬ 
dency  to  attract  each  other,  whereas  weak  power 
resources  tend  to  remain  fragmented.  Different 
patterns  of  power  distribution  within  a  society 
have  been  identified.  Two  extreme  models  of 
power  distribution,  the  “elitist”  and  the  “plural¬ 
ist,”  have  especially  attracted  the  attention  of  U.S. 
scholars  and  been  in  the  focus  of  a  highly  ideo¬ 
logically  loaded  controversy.  Although  this  debate 
has  taken  place  more  than  4  decades  ago,  it  is  still 
of  importance,  especially  for  scholars  dealing  with 
the  tension  between  democracy  and  power. 

Taking  into  consideration  the  plurality  of  power 
resources  and  the  ubiquity  of  power  in  social  inter¬ 
action  processes,  system  theorists  have  conceptual¬ 
ized  power  more  abstractly.  Within  his  complex 
structural-functional  framework,  Talcott  Parsons 
(1969)  has  suggested  treating  power  “as  a  specific 
mechanism  operating  to  bring  about  changes  in 
the  action  of  other  units,  individual  or  collective, 
in  the  processes  of  interaction”  (p.  353).  Even 
more  abstractly,  system  theorists  also  speak  about 
power  as  a  medium  of  communication  comparable 
with  money  in  economic  relations.  As  such,  it 


structures  social  relations  by  selecting  the  types 
and  limiting  the  amount  of  action  alternatives. 
Power  conceived  of  as  a  medium  of  communica¬ 
tion  has  the  function  to  regulate  the  dynamics  of 
interaction  processes  (see  the  classic  study  by  Peter 
Blau,  1964).  Like  money,  power  obeys  the  law  of 
diminishing  returns.  If  inappropriately  used,  it  can 
be  wasted  and  may  lose  value.  There  is  no  fixed 
relationship  between  the  increase  in  power  and  the 
increase  in  the  selection  of  action  alternatives.  The 
system  theoretical  point  of  view  of  power  allows 
for  opening  the  scope  of  power  analysis  and  inte¬ 
grating  both  positive  and  negative  functions  of 
power  understood  as  a  resource  of  the  system  into 
the  empirical  research. 

Context 

Power  relations  vary  according  to  the  type  of  the 
context  within  which  they  take  place.  These  con¬ 
texts  can  be  more  or  less  formal  and/or  have  more 
or  less  systemic  character.  In  informal  contexts,  the 
values,  norms,  and  sanctions  regulating  power  rela¬ 
tions  are  not  defined  precisely  and  explicitly. 
Therefore,  they  tend  to  be  more  arbitrary,  and  the 
risk  is  that  they  will  escape  the  control  of  both 
power  holders  and  subordinates.  If  it  is  left  to  the 
personal  whims  of  a  powerful  person  how  and 
when  to  press  through  his  or  her  will,  then  the  sub¬ 
ordinates  have  no  chance  to  appeal  to  common 
values,  norms,  or  sanctions  in  order  to  limit  the 
scope  of  their  submission.  In  these  cases,  power 
tends  to  become  “despotic.”  The  more  the  context 
is  formalized  and  the  more  it  is  constituted  by  sys¬ 
temic  characteristics,  the  more  the  power  relations 
become  predictable  and,  therefore,  manageable. 

Weber’s  (1978,  p.  212)  famous  distinction  be¬ 
tween  three  types  of  authority  has  been  used  to 
distinguish  between  three  types  of  contexts  within 
which  power  relations  develop:  (1)  the  context  of 
charismatic  authority,  (2)  the  context  of  tradi¬ 
tional  authority,  and  (3)  the  context  of  rational  or 
legal  authority.  In  all  three  cases,  the  decisive  crite¬ 
rion  of  distinction  between  power  and  authority  is 
the  claim  on  legitimacy  by  A  and  the  belief  in 
legitimacy  by  B.  Speaking  in  terms  of  ideal  types,  B 
who  is  confronted  with  A’s  power  has  to  comply 
willy-nilly  with  A’s  intention;  B  who  is  confronted 
with  A’s  authority  obeys  A’s  order  because  B 
believes  in  its  legitimacy. 
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In  addition  to  the  type  of  context  within  which 
power  relations  take  place,  it  is  also  important  to 
specify  on  which  level  power  is  studied:  on  the 
microlevel  of  power  relations  in  everyday  life,  on 
the  medium  level  of  power  relations  in  communi¬ 
ties,  or  on  the  macrolevel  of  power  relations  on  the 
top  of  authority  systems  or  regimes.  The  more  the 
power  relations  are  globalized,  the  greater  the  need 
for  power  analyses  both  (horizontally)  between 
different  power  centers  and  (vertically)  between 
different  interconnected  levels  of  power. 

In  summary,  it  can  be  said  that  the  concept  of 
power  consists  of  four  constituent  elements: 

1.  it  is  relational, 

2.  it  is  intentional, 

3.  it  is  based  on  resources,  and 

4.  it  is  related  to  specific  sociopolitical  contexts. 

The  concept  of  power  has  two  counterconcepts, 
influence,  on  the  one  hand,  and  authority,  on  the 
other.  The  distinctive  criterion  of  definition 
between  influence  and  power  is  the  intention  to 
exercise  power;  the  distinctive  criterion  of  defini¬ 
tion  between  power  and  authority  is  the  claim  and 
the  belief  in  the  legitimacy  of  power.  These  distinc¬ 
tions  help  in  presenting  some  of  the  most  impor¬ 
tant  debates  among  mainly  American  scholars  of 
power  and  their  contributions  to  power  analysis 
and  research. 

Debates  and  Controversies 

Between  the  1950s  and  the  1970s  in  the  United 
States,  heated  controversies  concerning  power 
raised  significant  questions  from  both  substantive 
and  methodological  perspectives.  In  the  research 
on  power,  substance  and  methodology  have  proved 
to  be  intimately  connected  with  each  other  and 
have  to  be  taken  into  consideration  simultaneously 
in  the  presentation  of  these  debates.  Political  scien¬ 
tists  have  applied  three  methods:  the  reputational, 
the  positional,  and  the  decisional.  With  reference 
to  the  patterns  of  distribution  of  power  resources, 
two  outstanding  models  have  been  formulated:  the 
elitist  and  the  pluralist. 

To  discover  the  powerful  ones  in  a  community, 
Floyd  Hunter,  who  is  positively  responsible  for 
launching  the  wave  of  research  and  controversies, 


resorted  to  the  reputational  method.  In  1953  he 
asked  a  panel  of  persons  supposedly  possessing  a 
fair  amount  of  information  concerning  the  city  of 
Atlanta  the  names  of  who  they  thought  were  the 
most  powerful  individuals  in  the  affairs  of  the  city. 
Though  he  supplemented  the  information  obtained 
this  way  with  his  own  acquired  knowledge,  his 
method  was  criticized  as  having  several  draw¬ 
backs.  Not  necessarily  are  those  who  are  visible 
automatically  powerful.  Not  always  will  prestige 
be  translated  into  power.  Moreover,  often  the  very 
powerful  ones  succeed  in  keeping  themselves  out 
of  the  limelight.  Interestingly,  however,  he  reached 
the  conclusion  that,  indeed,  in  the  city  of  Atlanta, 
a  few  powerful  individuals,  most  of  them  not  nec¬ 
essarily  occupying  political  or  other  types  of  office, 
were  considered  capable  of  guiding  and  control¬ 
ling  significant  activities.  There  was  a  dominant 
elite.  Aware  of  some  of  the  pitfalls  of  the  reputa¬ 
tional  method,  other  scholars  decided  to  rely  on 
the  positional  method.  That  is,  they  acquired 
information  on  all  those  individuals  occupying 
important  offices  in  the  city  government,  in  the 
business  community,  in  the  mass  media,  and  in 
religious  associations.  In  a  few  cases,  they  found 
some  overlapping.  In  general,  their  conclusion  was 
that  many  of  those  holding  important  offices, 
political  offices  as  well,  could  influence  in  a  sig¬ 
nificant  way  the  activities  of  their  community. 
There  did  not  exist  a  single  unified  and  dominant 
elite  but  groups  of  elites  in  competition.  Hence,  the 
structure  of  power  could  be  defined  as  pluralist. 
The  positional  method  too  was  subject  to  well- 
founded  criticism.  Wealthy  people,  especially,  may 
neither  need  nor  want  visible  offices.  In  many 
cases,  the  politicians  may  appear  in  charge,  but 
they  are  to  a  great  extent  dependent  on  individuals 
who,  for  instance,  finance  their  electoral  cam¬ 
paigns;  who  control  the  most  important  economic 
activities,  as  occurs  in  many  company  towns;  and 
who  manipulate  the  mass  media  and  the  presenta¬ 
tion  of  special  technical  knowledge  indispensable 
to  justify  some  choices  and  to  defeat  opposing 
alternatives.  The  positional  method  was  accused  of 
almost  inevitably  but  unconvincingly  leading  to  a 
pluralist  conclusion. 

Largely  relying  on  the  positional  method,  a 
highly  provocative  book  made  a  thunderstorm 
irruption  into  the  debate.  Rightly  and  wrongly, 
The  Power  Elite  by  C.  Wright  Mills  (1956)  was 
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intended  to  challenge  the  received  wisdom  of  the 
common  perception  that  the  U.S.  republic  is  based 
on  an  irrepressible,  competitive,  and  fruitful  demo¬ 
cratic  pluralism.  In  the  wake  of  McCarthyism  and 
in  the  midst  of  the  Cold  War,  Mills  bluntly  stated 
that  the  decision-making  process  at  the  federal 
level  was  dominated,  if  not  monopolized,  by  a 
restricted  combination  of  politicians  (chief  execu¬ 
tives),  military  officers  (warlords),  and  business¬ 
men  (corporate  rich):  the  power  elite.  Interestingly, 
when  leaving  the  presidency  in  1960,  former 
General  Dwight  H.  Eisenhower  himself  warned 
against  the  exaggerated  power  of  the  “military- 
industrial  complex.”  The  question  raised  and  the 
conclusions  reached  seemed  so  important  and  deli¬ 
cate  that  the  response  to  Mills’s  analysis  was  pro¬ 
vided  by  a  score  of  the  most  outstanding  social 
scientists  of  that  period:  Talcott  Parsons,  Daniel 
Bell,  and  Robert  Dahl. 

Indeed,  Dahl  and  some  of  his  students,  espe¬ 
cially  Raymond  Wolfinger  and  Nelson  Polsby, 
engaged  in  a  prolonged  battle  against  both  the 
method  used  by  Mills  and  the  implications  of  his 
substantive  analysis.  The  battle  was  continued  by 
several  scholars  well  after  Mills’s  death  in  1962.  At 
issue  were  two  extremely  important  political  and 
methodological  problems.  The  political  problem 
concerned  the  very  nature  of  American  democracy: 
Was  it  still  open  and  pluralist,  as  Dahl  and  his 
disciples  attempted  to  prove,  or  it  had  come  to  be 
dominated  by  a  few  groups  as  maintained  by  Mills 
(and  the  neo-Marxists)?  The  methodological  prob¬ 
lem  was  how  to  test  the  distribution  of  power  and 
the  way  power  was  being  exercised. 

In  response  to  Mills’s  The  Power  Elite,  Dahl 
formulated  the  decisional  approach  to  the  study  of 
political  power.  Focusing  on  one  community,  New 
Haven,  Connecticut,  where  his  university,  Yale,  is 
located,  he  chose  three  issue  areas  that  he  consid¬ 
ered  the  most  important  ones:  political  appoint¬ 
ments,  public  education,  and  urban  redevelopment. 
The  guiding  assumption  was  that  if  a  cohesive 
power  elite  existed  in  New  Haven,  its  members 
would  always  be  present  and  prominent  in  all 
three  areas.  If  different  groups  of  decision  makers 
were  present  and  influential  only  in  some  of  the 
three  issue  areas,  but  not  in  all  of  them,  the  theory 
of  the  pluralists  would  be  vindicated.  At  the  end  of 
a  long  and  detailed  study,  Dahl  (1961)  came  to  the 
conclusion  that  there  was  no  or  little  overlapping 


among  the  decision  makers  and  that  no  single 
group  of  elites  controlled  political  power  in  New 
Haven.  The  decisional  approach  seemed  to  be 
more  appropriate  than  both  the  reputational  and 
the  positional  in  identifying  those  effectively  hold¬ 
ing  and  exercising  political  power.  According  to 
Dahl,  it  was  also  superior  and  preferable  to  the 
other  approaches  because  it  provided  for  better 
empirical  testing.  Nevertheless,  the  debate  was  by 
no  means  exhausted  and  closed. 

In  an  article  published  in  the  American  Political 
Science  Review  (originally  in  1962,  then  collected 
in  a  book  in  1970),  which  became  the  single  most 
quoted  one,  Peter  Bachrach  and  Morton  Baratz 
questioned  the  method  and  the  conclusions  reached 
by  Dahl  and  by  the  pluralists.  Their  fundamental 
thesis  is  that  there  are  “two  faces  of  power.”  One 
is  a  visible  face  that  finds  expression  in  decision¬ 
making  activities.  It  is  not  necessarily  the  most 
influential.  There  is  a  second  face  that  may  often 
be  decisive  and  that  the  decisional  method  cannot 
illuminate.  It  is  the  invisible  face  that  prevents 
important,  perhaps  the  most  important,  issues 
from  coming  before  the  decision  makers.  Therefore, 
those  issues  cannot  be  discovered  by  the  decisional 
method  because  they  imply  or  are  “nondecisions.” 
Nor  will  the  powerful  ones  be  easily  identified 
exactly  because  they  operate,  so  to  speak,  in  the 
dark. 

Bachrach  and  Baratz’s  critics  immediately 
pointed  out  two  problems  concerning  their 
approach  and  conclusions.  Who  is  in  a  position  to 
convincingly  claim  that  there  are  important  issues 
in  community  life  that  are  not  even  taken  into  con¬ 
sideration?  The  pluralists  argued  that  all  truly 
important  issues  are  eventually  articulated.  In  any 
case,  as  to  the  method,  how  can  scholars  study 
nondecisions?  Inherently,  nondecisions  are  not 
researchable  and  not  subject  to  any  empirical  test. 

The  contrast  between  elitist  and  pluralist  think¬ 
ing  about  community  power  is  very  stark  and 
revolves  around  all  the  most  important  aspects  of 
the  ownership,  distribution,  and  exercise  of  power. 
The  pillars  of  the  two  competing  views  have  been 
summarized  in  an  excellent  and  impartial  manner, 
without  any  bias  either  political  or  methodologi¬ 
cal,  by  T.  R.  Dye  (1970,  pp.  36-37;  but  see  also 
Andrew  McFarland,  1969).  The  following  para¬ 
graphs  provide  a  comparative  synthesis  of  the  two 
views  (Dye,  1970,  pp.  36-38). 
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Whereas  elitists  believe  that  “political  power  is 
inextricably  bound  together  with  social  and  eco¬ 
nomic  power”  (Dye,  1970,  p.  36)  and,  therefore,  it 
is  likely  that  “men  of  wealth  and  social  position  in 
the  community  are  also  men  of  power”  (p.  36), 
pluralists  consider  power  an  attribute  of  individu¬ 
als  that  finds  expression  in  the  ability  to  induce 
other  individuals  to  act  in  ways  they  would  not 
otherwise  do  if  power  were  not  applied.  Where  elit¬ 
ists  stress  that  power  is  “structured”  and,  there¬ 
fore,  bound  to  last  as  long  as  the  men  having  power 
continue  in  their  roles,  pluralists  believe  that  power 
is  bound  to  pass  and  that,  in  any  case,  it  may  lapse 
away  after  some  specific  decisions  are  made. 

Whereas  elitists  state  that  there  is  a  clear  and 
persistent  distinction  between  the  elites  and  the 
masses  and  that  the  members  of  the  masses  will 
rarely  succeed  in  joining  the  ranks  of  the  elites,  plu¬ 
ralists  almost  rule  out  the  distinction  between  elites 
and  masses  and  believe  that  it  is  possible  for  many 
individuals  to  exercise  decision-making  power  at 
different  points  in  time.  Elitists  stress  that  power 
derives  fundamentally  from  the  unequal  distribution 
of  economic  resources  and  that  politicians  enjoy 
limited  autonomy  vis-a-vis  business  and  financial 
leaders,  while  pluralists  hold  that  the  power  to  par¬ 
ticipate  in  decision  making  derives  from  the  amount 
of  interest  individuals  have  in  some  specific  issues. 

Elitists  consider  that  the  powerful  are  a  small 
group  in  any  and  all  communities  because  political 
power  is  correlated  with  higher  degrees  of  educa¬ 
tion,  higher  levels  of  income,  and  more  prestigious 
occupations;  in  contrast,  pluralists  believe  that 
power  is  acquired  through  superior  leadership 
skills,  greater  managerial  capabilities,  and  better 
knowledge  of  the  democratic  process.  Wealth  may 
help,  but  it  is  never  decisive.  For  elitists,  the  image 
of  power  in  a  community  is  that  of  a  pyramid:  very 
few  on  the  top  having  very  much  power.  For  plu¬ 
ralists,  power  is  better  depicted  in  the  shape  of  an 
onion:  a  limited  concentration  of  power  on  the 
top,  much  power  given  to  those  in  the  large  middle 
of  the  onion,  and  very  few  powerless  individuals  at 
the  bottom.  For  elitists,  men  of  power  share  the 
same  basic  values  and  outlook,  while  pluralists 
point  at  the  existence  of  a  competition  among  dif¬ 
ferent  leaders  seeking  divergent  policies  and  being 
obliged  to  bargain  and  to  compromise. 

Finally,  according  to  elitists,  the  elites  do  not 
have  to  be  responsible  to  the  masses,  who  are 


misinformed  and  unable  to  exert  pressures  on  the 
elites.  To  avoid  potential  tensions  and  conflicts, 
however,  the  elites  may  behave  in  a  benevolent 
manner  out  of  a  sense  of  duty  to  their  respective 
communities.  According  to  pluralists,  however, 
community  leaders  are  responsible  to  a  variety  of 
associations.  Elections  enforce  an  additional  sense 
of  accountability  and  allow  some  elite  circulation. 

The  next  stage  of  the  debate  focused  on  the 
question  of  how  nondecisions  could  be  defined 
more  precisely  and  whether  they  could  in  fact  be 
identified  and  made  amenable  to  empirical 
research.  To  rescue  the  idea  lying  behind  nondeci¬ 
sions,  Frederick  Frey  (1971)  suggested  speaking  of 
“suppressed  issues,”  that  is,  issues  that  do  exist, 
are  important,  and  would  deserve  attention. 
However,  dominant  groups  are  capable  of  vetoing 
them  and  preventing  them  from  even  getting  on 
the  agenda,  for  fear  of  diverting  their  political 
path.  The  implication  is  that  the  structure  of 
power  in  a  given  community  would  make  it  rather 
unlikely  or  well-nigh  impossible  that  some  contro¬ 
versial  issues  be  raised  and  find  a  place  in  the 
decision-making  process.  Avoiding  any  reference 
to  Marxist  scholars,  the  authors  engaged  in  stress¬ 
ing  the  possibility  of  nondecisions  have  been  fond 
of  quoting  a  sentence  by  Schattschneider  (1960): 

All  forms  of  political  organisation  have  a  bias  in 
favour  of  the  exploitation  of  some  kinds  of 
conflict  and  the  suppression  of  others,  because 
organisation  is  the  mobilization  of  bias.  Some 
issues  are  organised  into  politics  while  others  are 
organised  out.  (p.  71) 

Of  course,  not  even  the  recognition  of  the  pos¬ 
sible  existence  of  a  mobilization  of  bias  offers  a 
completely  satisfactory  solution  key  to  the  analy¬ 
sis  of  the  suppressed  issues  of  when,  how,  by 
whom,  and  with  what  consequences.  A  convinc¬ 
ing  explanation  is  still  required  of  why  there  is  a 
specific  structure  of  power,  how  it  came  into 
being,  and  how  it  succeeded  in  consolidating  itself 
and  in  persisting.  If  the  answer  is  that  the  struc¬ 
ture  of  power  itself  has  come  into  being  thanks  to 
the  suppression  of  some  not  visible  alternatives, 
then  scholars  would  find  themselves  engaged 
potentially  in  an  infinite  regress.  It  will  also  be  of 
not  minor  interest  to  know  whether  and  how  that 
structure  of  power  has  withstood  what  kind  of 
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challenges  and  whether  and  how  it  has  been 
transformed. 

Reanalyzing  and  summarizing  the  entire  debate 
in  an  incisive  essay,  Steven  Lukes  (1974)  tried  to 
go  beyond  both  approaches.  He  referred  to  the 
decisional  approach  as  one-dimensional,  to  the 
nondecisional  approach  as  two-dimensional,  and 
then  he  offered  his  own  three-dimensional  view. 
Lukes’s  view  (1974)  combines  and  redefines  all  the 
elements  of  the  two  other  approaches,  focusing  on 
“(a)  decision-making  and  control  over  the  political 
agenda  (not  necessarily  through  decisions); 
(b)  issues  and  potential  issues;  (c)  observable  (overt 
or  covert)  and  latent  conflict;  (d)  subjective  and 
real  interests”  (p.  25).  This  highly  ambitious  blue¬ 
print  has  not  found  any  practitioner  so  far. 
Moreover,  one  cannot  refrain  from  remarking  that, 
since  Lukes  refers  to  covert  and  latent  conflicts  and 
to  subjective  and  real  interests,  his  research  pro¬ 
posal  automatically  falls  under  the  criticism  of  all 
scholars  who  consider  of  the  utmost  relevance  the 
identification  of  power  through  observable  behav¬ 
ior  and  the  drafting  of  an  operational  definition. 

In  a  way,  however,  though  unwillingly,  Lukes 
exemplifies  the  major  problem  involved  in  the 
analysis  of  power.  There  are  many  faces  of  power. 
While  the  one  that  refers  to  decision  making  is  easy 
to  identify,  scholars  know  that  decisions  are  only 
the  last  stage  in  a  long  process/procedure.  At  the 
beginning  of  that  procedure,  power  may  be  expressed 
in  writing  the  agenda  and  in  initiating  or  vetoing 
some  issues.  Of  course,  this  face  of  power  requires 
the  difficult  acquisition  of  information  that  the  pow¬ 
erful  may  want  to  suppress  and  may  succeed  in 
doing  so.  Therefore,  the  scholarly  (and  the  political) 
attention  should  turn  to  the  analysis  of  the  overall 
social  and  political  structure  to  detect  whether  that 
structure  has  incorporated  some  bias  and,  in  case  it 
has,  what  kind  of  bias  it  is.  The  inescapable  conclu¬ 
sion  is  that  power  must  be  studied  with  reference  to 
an  existing  system  (political,  social,  economic,  reli¬ 
gious,  cultural,  and  media).  As  a  consequence,  the 
overall  characteristics  of  the  system  acquire  major 
importance  in  drawing  the  boundaries  of  its  exercise 
and  in  assessing  its  impact. 

Power  in  the  International  System 

At  this  point,  a  digression  is  needed  to  briefly 
address  power  in  the  international  system. 


Unfortunately,  the  interaction  between  political 
scientists  interested  in  the  distribution  and  exercise 
of  power  within  specific  political  systems  and  inter¬ 
national  relations  analysts  has  been  poor,  infre¬ 
quent,  and  sterile.  However,  it  should  be  clear  that 
power  among  nations  constitutes  a  significant  field 
for  analysis.  For  more  than  a  century,  the  debate 
among  historians  on  what  kind  of  equilibrium 
existed  in  European  politics  was  dominated  by  the 
expression  and  the  theme  of  “balance  of  power.” 
That  such  a  balance  existed  so  that  no  single 
European  state  should  acquire  control  of  Europe 
was,  indeed,  the  goal  successfully  pursued  by  the 
United  Kingdom  throughout  the  19th  century. 
Historians  and  international  relations  analysts  are 
perfectly  aware  that  the  ways  some  states  exercise 
power  over  other  states;  control,  overtly  and/or 
covertly,  their  behavior;  shape  the  international 
agenda;  and  organize  the  international  system  are 
all  themes  worthy  of  analysis  in  terms  of  power. 
Some  (see,  especially,  A.  F.  K.  Organski  &  Jacek 
Kugler,  1980)  have  also  come  to  the  realization 
that  even  in  the  domain  of  international  politics, 
power  is  neither  simply  nor  exclusively  a  matter  of 
physical  resources.  It  consists  more  specifically  of 
the  capability  in  mobilizing,  organizing,  deploying, 
and  orienting  those  resources.  Not  surprisingly  in 
war,  democracies  have  shown  more  power  than 
expected:  far  superior  to  the  supposed  power  of 
authoritarian  regimes  of  all  varieties. 

Several  of  the  tools  and  the  approaches  used  in 
national  contexts  can  be  put  to  work  when  dealing 
with  the  international  system.  Some  truly  signifi¬ 
cant  historical  phenomena  affected  by  the  resort  to 
power  and  that  cannot  be  convincingly  explained 
without  taking  into  account  several  forms  of  power 
are,  for  example,  colonialism  and  imperialism. 

The  reappearance  of  a  debate  on  whether  and 
how  the  United  States,  revealingly  defined  as  a 
superpower  (the  only  remaining  one),  can  be  char¬ 
acterized  as  an  empire  or  not  has  brought  to  light 
a  distinction  between  hard  power  and  soft  power 
(Joseph  Nye,  2004).  Hard  power  is  based  on  mili¬ 
tary  force  leading  to  outright  war;  soft  power  relies 
on  diplomacy,  persuasion,  manipulation  of  cir¬ 
cumstances,  and  cultural  influence.  A  superpower 
may  launch  and  perhaps  win  some  or  even  most 
battles,  but  it  will  be  unable  to  conquer  the  minds 
of  individuals  if  it  resorts  only  to  violent  actions. 
On  the  contrary,  hard  power  is  likely  to  arouse 
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and  excite  hatred.  Though  from  many  points  of 
view  the  United  States  is,  indeed,  a  superpower,  in 
the  not  too  distant  future  likely  to  be  challenged  in 
terms  of  power  by  China,  there  are  scholars  who 
believe  that  at  the  international  level  it  has  become 
quite  difficult  to  identify  who  really  has  power. 
Globalization  seems  to  a  be  a  process  dominated 
by  impersonal  factors  and  hidden  protagonists, 
affecting,  of  course,  in  very  different  ways,  all 
states  and  actors,  whose  responsibility  and  power 
cannot  be  easily  (or  at  all)  ascertained  and  attrib¬ 
uted  (Susan  Strange,  1996).  Even  at  the  level  of  the 
international  system,  power  seems  to  be  every¬ 
where,  but  its  detection  and  its  analysis  have 
become  and  remain  extremely  complicated  tasks. 
And,  of  course,  when  dealing  with  events  taking 
place  at  the  level  of  the  international  system,  there 
are  scholars  who  believe  that  most  events  are 
affected  by  one  player,  the  United  States,  and 
scholars  who  believe  that  the  situation  is  extremely 
complex  and  that  there  are  a  plurality  of  actors, 
including  several  international  organizations, 
affecting  the  distribution  and  the  exercise  of  power 
in  the  international  system. 

There  has  always  been  another  important  sector 
of  power  research,  identified  mainly  in  Europe  in 
the  late  1960s.  It  is  less  ideologically  loaded  than 
the  debates  among  U.S.  scholars  and  more  aimed  at 
a  historical  and  sociological  understanding  of  the 
functioning  above  all  of  totalitarian  regimes  in  the 
past.  This  research  has  been  focused  on  regime 
types — democratic,  authoritarian,  and  totalitarian, 
roughly — and  on  the  formal  and  informal  rules 
presiding  over  all  the  processes  from  the  acquisition 
to  the  transmission  of  political  power.  Especially 
noteworthy  in  the  three  regime  types  must  be  con¬ 
sidered  the  relationship  between  power  and  the 
existence  of  organized  groups  and  their  competi¬ 
tion  and  transformation  through  time.  The  disper¬ 
sion  of  power  among  several  competing  groups  is  a 
crucial  feature  of  democratic  regimes.  Limited,  not 
competitive,  and  not  responsible  pluralism,  as 
defined  by  Juan  Linz  (1964),  is  the  hallmark  of 
authoritarian  regimes,  while  the  power  monopoly 
of  one  actor,  the  single  party,  characterizes  totali¬ 
tarian  regimes. 

With  reference  to  the  Nazi  regime,  M.  Rainer 
Lepsius  (1968)  has  analyzed  the  structure  of  inter¬ 
mediary  groups  and  its  relation  to  charismatic  lead¬ 
ership.  The  process  the  Nazis  called  Entmachtung 


(de-powerization)  “meant  a  double  process  of 
reducing  and  relocating  social  power  in  society” 
(p.  297).  Powerful  interest  groups  were  destroyed, 
and  the  remaining  less  powerful  ones  transformed 
into  agencies  of  the  Nazi  party  competing  with 
each  other  for  their  survival.  Under  the  condition 
“that  there  exists  an  atmosphere  of  competition 
and  suspicion”  (p.  63),  no  bonds  of  solidarity 
between  the  different  power  groups  were  created, 
thus  leaving  a  space  of  direct  control  to  the  charis¬ 
matic  leader.  The  combination  of  a  fragmented 
and  weak  power  structure  on  the  intermediary 
level  with  the  centralization  of  power  on  the  top 
level  of  the  regime  was  the  structural  precondition 
for  the  excessive  and  arbitrary  (mis)use  of  power 
by  Hitler. 

When  looking  at  more  complex  chains  of  power 
structures  on  the  international  level,  an  interesting 
observation  has  been  made  by,  among  others,  John 
Gerard  Ruggie  (1998).  Along  with  the  formation 
of  international  regimes,  the  role  of  power  has 
changed.  In  international  regimes,  the  capability  to 
cooperate  and  to  make  compromises  has  become 
more  important  in  making  politics  than  the  capa¬ 
bility  to  mobilize  power  resources  and  to  exercise 
power.  Accordingly,  power  research  has  been  inte¬ 
grated  into  game  theory,  bargaining  theory,  and 
theories  of  rational  choice.  This  is  why  it  may  seem 
that  power  research  has  disappeared  from  the 
social-scientific  agenda.  But  with  the  change  of  the 
role  and  the  appearance  of  power,  social-scientific 
research  on  power  also  has  changed  its  theoretical 
approach  and  conceptual  labels.  The  more  power  is 
spread  worldwide  into  “multiple  overlapping  and 
intersecting  socio-spatial  networks  of  power” 
(Michael  Mann,  1986,  p.  1),  the  more  also  power 
research  is  spread  into  multiple  overlapping  research 
areas.  The  question  “Who  has  how  much  power 
over  whom?” — which  had  dominated  power 
research  in  the  last  decades  of  the  19th  century — 
has  been  replaced  by  the  questions:  “Which  intended 
and  unintended  effects  does  power  have?”  and 
“Which  functions  does  power  fulfill  for  the  devel¬ 
opment  of  complex,  interrelated  supranational 
regimes?”  The  integration  of  nation-states  into 
supranational  systems  can  have  the  effect  of  neu¬ 
tralizing  the  power  the  nation-state  has  over  its  citi¬ 
zens.  This  is,  for  example,  the  case  when  the 
European  Court  of  Justice  intervenes  in  the  domes¬ 
tic  legal  practices  of  its  member  states.  The  principle 
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of  “subsidiarity”  can  be  understood  as  the  member 
states’  attempt  to  institutionalize  counterpower 
against  what  Christopher  Booker  (1996,  p.  203) 
polemically  has  coined  the  “new  totalitarianism” 
of  the  European  Union  (EU).  This  principle  is, 
however,  overruled  by  the  European  Court  of 
Justice.  With  the  increasing  importance  of  supra¬ 
national  systems,  power  research  is  confronted 
with  the  demanding  task  of  analyzing  two-  and 
multiple-level  power  games  (Andrew  Moravcsik, 
1996,  p.  299).  What  is  gained  in  terms  of  power 
on  one  level  can  be  lost  on  the  next  level  and  vice 
versa.  The  possibility  of  binding  allocation  of 
power  may  get  lost  in  multilevel  power  games. 
Moreover,  the  example  of  the  history  of  the  forma¬ 
tion  of  the  EU  shows  that  successful  bargaining 
power  can  produce  unintended  consequences  and 
even  paradoxes.  In  this  sense,  Fritz  Scharpf  has 
coined  the  concept  of  the  “joint-decision  trap”: 
“What  is  missed  is  the  possibility  that  authority 
might  not  be  allocated,  in  zero-sum  fashion,  to 
either  one  or  the  other  level  of  government,  but  it 
might  be  shared  by  both”  (1996,  p.  271).  The 
institutional  development  of  the  EU  is,  according 
to  Scharpf,  blocked  by  joint-decision  traps. 
Although  Parsons  did  not  develop  his  idea  of 
power  with  relation  to  the  empirical  case  of  the 
institution  building  of  the  EU,  his  insights  about 
the  positive  function  of  power  can  be  applied  to 
this  example.  If  the  institutional  context  fails  to 
allocate  power  in  a  binding  way,  power  cannot 
function  as  a  mechanism  to  reduce  action  alterna¬ 
tives  and  to  speed  up  the  decision-making  process. 

The  study  of  power  on  and  between  levels  of 
complex  political  regimes  makes  it  possible  to  lib¬ 
erate  the  notion  of  power  from  its  negative  norma¬ 
tive  bias,  which  characterized  the  debates  in  the 
1970s.  Power  was  mainly  seen  under  the  perspec¬ 
tive  of  its  threatening  potential  for  democratic 
institutions.  There  is  no  doubt  that  power  research 
under  this  critical  perspective  is  necessary  if  social 
scientists  desire  to  fulfill  their  task  and  to  detect 
violations  against  the  institutionalized  norms  and 
values  of  democratic  rule.  But  it  is  equally  impor¬ 
tant  for  social  scientists  to  investigate  the  condi¬ 
tions  under  which  binding  power  allocation  fails 
and  prevents  the  working  and  development  of 
institution  building. 

One  must  recall  Max  Weber’s  realist  definition 
of  politics:  “He  who  is  active  in  politics  strives  for 


power  either  as  a  means  in  serving  other  aims, 
ideal  or  egoistic,  or  as  ‘power  for  power’s  sake,’ 
that  is,  in  ‘order  to  enjoy  the  prestige-feeling  that 
power  gives’”  (Hans  Gerth  &  Charles  Mills,  1948, 
p.  78).  A  realist  approach  to  the  study  of  power 
should  take  into  consideration  the  multiple  func¬ 
tionality  of  power  for  complex  political  regimes. 
This  is  especially  urgent  under  the  conditions  of 
globalization.  Curiously,  the  processes  of  global¬ 
ization  have  seemed  to  suggest  that  economic  and 
political  power  exists  and  manifests  itself  in  sig¬ 
nificant  ways  and  doses  but  that  it  cannot  be  pre¬ 
cisely  located  and  assessed.  The  truth  of  the  matter 
is  that  the  power  of  power  consists  also  in  being 
and  remaining  quite,  if  not  substantially,  elusive. 

Gianfranco  Pasquino 
University  of  Bologna 
Bologna,  Italy 
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Power  and  International 
Politics 


Different  aims  inform  the  study  of  power. 
Simplifying  somewhat,  power  plays  a  role  in  two 
distinct,  if  related,  domains.  In  the  field  of  political 
theory  as  understood  here,  the  purpose  of  analyses 
of  power  is  to  capture  the  nature  of  the  polity  in 
which  questions  of  the  organization  of  (organized) 
violence,  the  common  good,  and  freedom  are  para¬ 
mount.  In  these  studies,  power  stands  for  “govern¬ 
ment”  or  “governance”  and  political  “order,”  as 
well  as  personal  “autonomy.”  The  logic  in  the  field 
of  explanatory  theories,  on  the  other  hand,  is  to 
think  of  power  in  terms  of  a  theory  of  action  first 
and  a  theory  of  domination  subsequently.  Here, 
power  is  central  to  the  explanation  of  behavior  and 
the  outcomes  of  social  action.  It  refers  to  “agency” 
and  “influence,”  if  not  “cause,”  and  to  rule  or 
domination.  This  entry  considers  various  concep¬ 
tions  of  power  in  international  relations,  including 
the  realist,  institutionalist,  structuralist,  and  con¬ 
structivist  views,  and  discusses  how  they  approach 
the  nature  of  the  polity  and  the  explanatory  use  of 
the  concept  of  power. 

Classical  realists  argue  that  because  there  is  no 
mechanism  for  controlling  the  competition  among 
sovereign  states,  power  is  simply  the  possession  of 
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resources  and  their  use  to  exercise  control  at  the 
international  level.  For  classical  realist  thinkers  in 
international  relations  (IR),  the  particular  context 
of  world  politics  suggests  that  these  domains  could 
be  fruitfully  merged — if  there  is  no  governance  at 
the  international  level,  the  first  description  of 
power  as  a  political  order  can  be  merged  with  a 
theory  of  action.  With  no  world  government,  the 
international  system  seems  to  lack  not  only  an 
ordering  authority  but  a  polity  altogether.  In  a 
sense,  therefore,  it  is  possible  to  think  of  world 
politics  as  the  simple  aggregation,  and  balance,  of 
agent  capacities  to  influence,  such  that  no  further 
theory  was  needed.  In  the  explanatory  domain, 
power  then  becomes  a  central  variable  in  a  double 
causal  link.  Power  understood  as  resources  or 
capabilities  is  an  indicator  of  the  strength  of  actors 
and,  consequently,  of  the  capacity  to  affect  or  con¬ 
trol  events.  Likewise,  a  general  capacity  to  control 
outcomes  has  been  used  as  an  indicator  for  the 
ruling  of  the  international  system.  Rather  than  see¬ 
ing  the  two  domains  as  separate,  the  special  nature 
of  world  politics  could  combine  them  in  an  explan¬ 
atory  sequence:  By  knowing  who  can  be  expected 
to  win  conflicts,  we  would  also  know  who  or  what 
governs  international  politics,  which,  given  the 
absence  of  a  world  polity,  is  all  there  is  to  know 
about  power  as  order  and  government. 

Power  analyses  in  recent  decades  have  challenged 
these  tacitly  assumed  links  and  ultimately  also  the 
neglect  of  the  concerns  of  political  theory.  Criticizing 
realism’s  “lump”  concept  of  power  (according  to 
which  all  the  elements  of  power  can  be  combined  so 
that  the  degree  of  a  state’s  power  can  be  measured 
by  a  single  indicator),  neo-institutionalism  has  tried 
to  redefine  the  link  between  resources  and  out¬ 
comes.  Similarly,  with  regard  to  the  understanding 
of  “rule”  and  “governance,”  different  structural 
power  approaches  have  demonstrated  the  need  to 
conceive  of  more  encompassing  power  concepts  so 
as  to  capture  important  but  otherwise  neglected 
facets  of  international  rule.  Poststructuralist  and 
constructivist  approaches  focus  on  power  as  author¬ 
ity  and  legitimacy,  established  not  through  an  open 
social  contract  but  in  the  habitual  working  of  dis¬ 
courses  and  practices  that  disempower/empower 
agents.  When  aiming  at  an  understanding  of  the 
world  polity,  they  also  refer  back  to  the  domain  of 
political  theory.  But  by  starting  from  a  historical 
reconceptualization  of  politics  and  order,  they  do 


this  by  stepping  altogether  outside  an  analysis  in 
terms  of  these  two  tacitly  assumed  links. 

The  Concept  of  Power  in  Realist  Theories 

Although  classical  realism  does  make  strong  polit¬ 
ical  assumptions  about  human  nature  and  the  role 
of  power  in  politics,  the  two-step  analysis  of  power 
mentioned  above,  which  is  driven  by  the  explana¬ 
tory  domain,  has  become  dominant.  There,  on  the 
macrolevel,  realist  theory  relies  on  the  concept  of 
the  balance  of  power.  This  presupposes  a  common 
denominator  for  power  in  which  all  its  aspects  can 
be  coherently  aggregated.  On  the  microlevel,  real¬ 
ist  theory  relies  on  the  idea  that  states  are  inter¬ 
ested  in  relative  gains  in  power.  Each  of  these 
concepts  presupposes  that  power  is  measurable. 
Indeed,  such  theories  require  a  concept  of  power 
akin  to  the  concept  of  money  in  economic  theory. 
In  this  analogy,  the  striving  for  utility  maximiza¬ 
tion  expressed  and  measured  in  terms  of  money 
parallels  the  national  interest  (i.e.,  security) 
expressed  in  terms  of  (relative)  power. 

This  central  assumption  has  been  challenged  by 
early  realist  critiques  and  more  recent  institutional¬ 
ist  approaches.  In  an  early  argument  that  also 
anticipates  and  implicitly  criticizes  the  economic 
analogy  in  neorealist  theory,  Raymond  Aron 
opposed  this  aggregated  concept  of  power  and  the 
underlying  power-money  analogy.  The  different 
degrees  of  the  fungibility  of  money  and  power 
resources  make  this  impossible.  The  term  fungibil¬ 
ity  refers  to  the  idea  of  a  moveable  good  for  which 
another  good  in  the  same  class  can  be  freely  substi¬ 
tuted.  Fungible  goods  are  universally  applicable  or 
convertible,  in  contrast  to  those  that  retain  value 
only  in  a  specific  context.  Whereas  fungibility 
seems  a  plausible  assumption  in  monetarized  econ¬ 
omies,  it  is  not  so  in  world  politics;  even  apparently 
ultimate  power  resources  such  as  weapons  of  mass 
destruction  may  not  be  of  great  help  in  getting 
another  state  to  change  its  monetary  policies. 

Aron  recognized  that  economic  theory  can  be 
used  to  model  behavior  on  the  basis  of  a  variety  of 
conflicting  preferences.  But  for  him,  with  the 
advent  of  money  as  a  general  standard  of  value 
within  which  these  competing  preferences  can  be 
situated  on  the  same  scale,  compared,  and  traded- 
off,  economists  were  able  to  reduce  the  variety  of 
preferences  to  one  utility  function.  Because  power 
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in  world  politics  lacks  a  real-world  fungibility,  it 
cannot  play  a  corresponding  role  as  a  standard  of 
value.  Therefore,  power  cannot  be  the  currency 
of  world  politics,  and  national  security  in  terms  of 
power  is  not  equivalent  to  a  utility  function. 

In  response,  realists  insisted  that  diplomats  had 
repeatedly  been  able  to  find  a  measure  of  power, 
and  hence,  the  difference  is  just  one  of  degree,  not  of 
kind.  Yet  even  if  actors  can  agree  on  some  approxi¬ 
mations  for  carrying  out  exchanges  or  establishing 
power  rankings,  this  is  a  social  convention  that  by 
definition  can  be  challenged  and  exists  only  to  the 
extent  that  it  is  agreed  on.  Power  resources  do  not 
come  with  a  standardized  price  tag. 

With  the  link  between  resources  and  outcomes 
forgone,  the  realist  chain  of  causes  for  understand¬ 
ing  the  international  structure  is  broken,  for  a  sin¬ 
gle  international  power  structure  relies  either  on 
the  assumption  of  a  single  dominant  issue  area  or 
on  a  high  fungibility  of  power  resources — neither 
of  which  are  realistic. 

Neo-Institutionalism:  Redefining  the  Link 
Between  Resources  and  Outcomes 

When  the  United  States  lost  the  war  in  Vietnam, 
some  scholars  tried  to  explain  this  power  paradox 
away  by  identifying  the  lack  of  “will”  on  the  part 
of  the  United  States  to  use  its  resources — that  is, 
so-called  conversion  failures.  According  to  such  an 
explanation,  the  war  did  not  indicate  the  relative 
weakness  of  the  United  States  (in  spite  of  its  mili¬ 
tary  capabilities)  but  simply  its  inability  to  use  its 
strength.  Obviously,  such  an  explanation  can  rein¬ 
terpret  any  outcome  ex  post  to  suit  any  power 
distribution.  Often,  the  trouble  with  this  type  of 
power  analysis  is  not  that  it  is  wrong  but  that  it 
cannot  go  wrong.  Neo-institutionalist  analysis 
offers  two  responses. 

One  conceptual  way  out  consisted  in  accepting 
the  apparent  lesson  of  the  Vietnam  War.  Con¬ 
sequently,  control  over  resources,  even  issue  area- 
specific  ones,  does  not  necessarily  translate  into 
control  over  outcomes.  Power  no  longer  functions 
as  a  determining  cause.  In  Robert  Keohane’s  anal¬ 
ysis,  for  instance,  determinacy  in  the  explanation 
shifts  from  interests  defined  in  terms  of  the  distri¬ 
bution  of  power  to  rational  choice  made  on  the 
basis  of  given  interests  defined  in  terms  of  power, 
expectations,  values,  and  conventions.  Hence,  only 


predictions  of  a  very  limited  kind  are  possible, 
with  a  secondary  role  for  power. 

Another  solution  to  the  paradox  of  unrealized 
power  has  been  proposed  by  David  Baldwin,  who 
has  taken  the  issue  of  power  fungibility  most  seri¬ 
ously.  His  approach  keeps  a  strong  causal  role  for 
power  by  further  specifying  the  relational  and  situ¬ 
ational  context  that  defines  which  policy  instru¬ 
ments  can  count  as  actual  power  resources  in  the 
first  place.  Baldwin’s  conception  is  shaped  by  his 
relational  understanding  of  power.  If  power  is  about 
the  capacity  to  get  someone  else  to  do  what  he  or 
she  would  not  have  otherwise  done,  then  threaten¬ 
ing  a  suicide  candidate  with  a  gun  implies  that  the 
person  holding  the  gun  has  no  power.  In  other 
words,  power  resources  have  no  intrinsic  value  or 
effect  but  depend  on  the  actual  value  systems  of 
human  beings  in  their  relations  with  each  other. 

Hence,  the  major  difference  from  utilitarian 
action  theories  is  that  personal  value  systems  can¬ 
not  be  simply  assumed  in  the  empirical  power 
analysis.  Instead,  the  researcher  has  first  to  analyze 
the  value  systems  of  the  interacting  parties  in  order 
to  establish  whether  there  are  any  power  resources 
in  the  first  place.  For  this  reason,  Baldwin  insists 
that  one  can  only  study  power  if  understood  as  a 
causal  variable  in  well-circumscribed  policy  con¬ 
tingency  frameworks.  Any  assessment  of  power 
independent  of  such  situational  factors  is  errone¬ 
ous,  and  any  generalization  beyond  such  cases  is 
contingent  and  has  to  be  established  separately. 

The  price  for  this,  however,  is  that  power  analy¬ 
sis  must  potentially  be  very  narrowly  confined  to 
particular  instances,  where  no  prediction  is  possi¬ 
ble.  Whereas  Keohane’s  institutionalist  move 
retained,  in  a  limited  way,  the  predictive  capacity 
of  a  theory  based  on  rational  choice  and  not  on 
power,  Baldwin’s  move  saves  a  central  causal  role 
for  power  at  the  price  of  predictability  in  IR/IPE 
(international  political  economy).  Keohane’s  solu¬ 
tion  points  to  the  direction  of  a  rationalist 
neo-institutionalism  and  Baldwin’s  to  a  less  gener- 
alizable,  contingent,  and  situational  solution  to 
historical  institutionalism. 

Structural  Power  in  the 
Global  Political  Economy 

With  the  link  between  resources  and  control  weak¬ 
ened,  the  micro-macro  link  between  control  over 
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outcomes  and  international  rule  might  not  be 
worthwhile  studying  at  all.  Yet  this  is  where  inter¬ 
national  political  economy  (and  constructivism 
and  poststructuralism;  see  below)  have  made  their 
most  important  contributions  to  the  analyses  of 
power.  In  fact,  concepts  of  structural  power  rede¬ 
fine  the  context  within  which  strategic  interaction 
takes  place,  the  resources  considered  important  for 
assessing  capabilities  in  the  first  place,  and  the 
outcomes  that  should  be  included  in  power  analy¬ 
sis.  Their  common  claim  is  that  the  sole  reference 
to  the  first  link,  as  made  by  neo-institutionalists,  is 
insufficient,  if  not  biased,  for  understanding  rule  in 
the  international  system.  It  is  the  second  link, 
between  outcomes  and  rule,  that  becomes  the 
starting  point  in  analyses  of  power. 

A  first  version  of  structural  power  might  be 
called  indirect  institutional  power.  This  refers  to 
the  conscious  manipulation  of  the  institutional  set¬ 
ting  within  which  bargaining  relations  take  place. 
Many  important  issues  are  decided  before  they 
reach  the  bargaining  stage — indeed,  often  because 
they  never  reach  it.  For  understanding  the  distribu¬ 
tion  of  power,  it  is  as  important  to  see  who  pre¬ 
vails  in  decision  making  as  it  is  to  analyze  which 
“nondecisions”  were  made.  Despite  occasional 
claims  to  the  contrary,  this  version  is  perfectly 
compatible  with  neo-institutionalist  approaches. 

Structural  power  has  also  been  conceptualized 
as  nonintentional  power.  Susan  Strange’s  concept 
of  structural  power  stresses  both  the  diffusion  of 
the  origins  of  power  (and  the  variety  of  power 
resources)  and  the  diffusion  of  its  effects.  Here, 
there  is  no  reason  to  exclude  from  power  analysis 
all  those  crucial  effects  that  might  not  have  been 
intended.  As  an  old  Chinese  saying  has  it,  it  makes 
little  difference  to  the  trampled  grass  beneath 
whether  the  elephants  above  it  are  making  love  or 
war.  This  analytical  shift  from  intentions  to  effects 
diminishes  the  importance  of  the  neo-institutional- 
ist  approach  for  understanding  power  based  on 
resources,  interests,  and  rationality.  It  focuses  on 
the  systematic  and  structural  aspects  of  power,  not 
on  the  chosen  ones. 

Third  and  finally,  structural  power  can  also  be 
understood  as  systematic  bias  or  impersonal 
power.  This  refers  to  an  impersonal  “mobilization 
of  bias”  whereby  social  structures  systematically 
favor  certain  agents.  Such  an  understanding  of 
power  is  common  currency  in  both  Marxist  and 


non-Marxist  writing  on  dependency,  as  well  as  in 
neo-Gramscian  approaches;  it  also  applies  to  con¬ 
structivist  and  poststructuralist  approaches  that 
emphasize  nonmaterialist  structural  biases  (see 
below).  Such  a  conceptualization  has  been  criti¬ 
cized  for  deducing  power  from  rewards,  the  so- 
called  benefit  fallacy  of  power.  We  usually  do  not 
consider  a  free  rider  powerful  because  the  free 
rider  typically  has  no  control  over  the  systematic 
arrangement  that  makes  free  riding  possible.  But 
the  benefit  fallacy  exists  only  within  a  causal 
framework.  To  say  that  a  system  systematically 
benefits  certain  people  does  not  mean  that  they 
have  created  that  benefit  or  that  they  control  it.  It 
simply  means  that  to  understand  power  in  a  social 
system,  we  need  to  take  into  account  the  links 
between  rules  and  outcomes  that  systematically 
benefit  some  actor. 

Structural  power  analysis  in  international  polit¬ 
ical  economy  tries  to  overcome  the  difficulty  of 
conceiving  power  along  the  resource-outcome- 
rule  line  by  starting  from  the  other  end.  These 
approaches  run  into  two  types  of  problem,  how¬ 
ever.  First,  they  tend  to  overplay  the  causal 
strength  of  their  analysis.  Moving  backward  from 
rule  to  outcomes  faces  similar  problems  to  moving 
from  resources  to  outcomes.  “The  United  States 
won  because  of  its  structural  power”  faces  the 
same  translation  or  conversion  questions  as  classi¬ 
cal  resource-based  analysis.  It  often  appears  to 
offer  an  answer  when  in  reality  it  begs  the  ques¬ 
tion.  The  second  risk  is  related  to  this.  IPE 
approaches  tend  to  understate  the  nonmaterialist 
aspects  of  rule  or  governance,  particularly  values 
and  beliefs,  and,  indeed,  the  extent  to  which  struc¬ 
tures  affect  events  only  though  the  meaning  given 
to  them. 

Rule  in  World  Politics:  The  Social 
Construction  of  Legitimacy  and  Order 

Still  staying  with  IR’s  emphasis  on  the  explanatory 
domain  of  power  analysis,  constructivism  redefines 
power  at  the  systemic  and  agent  levels  in  IR.  Its 
systemic  analysis  of  power  often  looks  at  the  ori¬ 
gins  of  consent  in  terms  of  practices  of  tacit  legiti¬ 
macy.  It  is  therefore  close  to  power  concepts  of  the 
family  of  “authority.”  But  rather  than  looking  at 
formal  or  institutional  authority,  constructivism  is 
interested  in  the  intersubjective  practices  of 
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power — not  in  the  position  of  authority,  therefore, 
but  rather  in  what  “authorizes,”  “legitimates,”  or 
“empowers”  it.  Moreover,  it  is  not  necessarily 
looking  at  intentional  or  agent  power  but  at  the 
impersonal  effects  of  discourses  and/or  habits  for 
the  production  and  reproduction  of  order,  particu¬ 
larly  in  cases  where  practices  go  without  saying, 
appear  natural,  and  are  therefore,  perhaps,  the 
most  effective  power  relations  there  are. 

At  the  actor  level,  such  a  view  implies  an  empha¬ 
sis  on  the  process  of  interest  formation  as  a  pri¬ 
mary  locus  for  power  relations.  For  constructivists, 
interests  cannot  be  understood  outside  their  inter- 
subjective  contexts  in  terms  of  shared  constitutive 
norms,  of  shared  knowledge  and  understandings, 
and  also  through  the  effects  that  practices  have  on 
self-understandings  or  identity.  For  constructivists, 
what  we  want  follows  from  who  we  are  (or  want 
to  be). 

One  larger  power  research  agenda  therefore 
concerns  the  background  knowledge  or  constitu¬ 
tive  “rules  of  the  game”  that  mobilize  certain 
biases  and  define  the  competent  players  and  their 
effective  moves.  Naturalized  understandings  evoke 
certain  actions  and  empower  certain  agents. 
Understanding  an  event  as  analogous  to  “Munich” 
mobilizes  a  collective  memory  that  authorizes 
some  actions  and  undermines  the  legitimacy  of 
others.  Whether  or  not  the  end  of  the  Cold  War 
has  ushered  in  a  “clash  of  civilizations,”  such  an 
understanding  mobilizes  and  is  empowered  by  pre¬ 
existing  Cold  War  scripts  in  which  totalitarianism 
has  been  replaced  by  fundamentalism  in  security 
discourse.  It  gives  it  the  potential  to  become  a  self- 
fulfilling  prophecy.  Just  when  the  security  sector 
started  to  be  demilitarized,  the  increased  use  of 
private  military  companies  such  as  Executive 
Outcomes  gave  them  “epistemic  power”  because 
their  practices  shape  the  understanding  of  security 
and  the  self-understanding  of  its  actors.  These 
practices  have  the  effect  of  authorizing  an  increas¬ 
ingly  technical  and  military  understanding  of  the 
field  while  being,  in  turn,  authorized  through 
them. 

With  regard  to  their  focus  on  identity,  con¬ 
structivist  scholars  have  looked  not  only  at  the 
impersonal  effect  of  discourses  and/or  practices  on 
self-understandings  but  also  on  the  “power  poli¬ 
tics  of  identity.”  If  identity  is  crucial  for  interest 
formation,  then  it  is  only  a  small  step  to  analyzing 


how  diplomatic  practices,  sometimes  intended,  can 
be  used  for  blackmailing  actors  by  profiting  from 
the  contradictions  in  the  actor’s  self-representations. 

As  a  result,  some  constructivist  research  agendas 
converge  with  Foucauldian  approaches  in  their 
understandings  of  order  as  diffused  practices  of  rule 
rather  than  as  clear  and/or  formal  hierarchies.  Such 
convergences  can  be  seen,  for  instance,  in  the  analy¬ 
sis  of  how  international  standards,  which  are  often 
established  by  private  actors,  are  practices  of  rule 
once  they  become  accepted  convention  and  interact 
with  the  actors  and  issues  they  were  supposedly 
only  neutrally  measuring  (e.g.,  credit  rating). 

But  at  the  same  time,  such  analyses  link  power 
in  explanatory  theory  back  to  political  theory  or, 
particularly  in  the  post-Foucauldian  vein,  actually 
put  the  latter  first.  The  understanding  of  power  is 
not  primarily  about  cause  and  effect,  or  influence 
and  outcomes.  Instead,  it  embeds  power  in  a  his¬ 
torical  analysis  of  the  changing  nature  of  the  mod¬ 
ern  and  liberal  order,  which,  so  the  thesis  goes, 
increasingly  works  by  making  the  subjects  of  order 
active  subjects  of  their  own  ordering.  Ole  Jakob 
Sending  and  Iver  Neumann,  for  instance,  analyze 
the  role  of  nonstate  actors  in  this  vein,  understand¬ 
ing  them  not  so  much  as  civil  society  in  opposition 
to  the  political  power  of  states  but  as  part  and 
parcel  of  a  decentralized  and  self-disciplining  logic 
of  a  global  order  (i.e.,  governmentality). 

The  Politics  of  Power  Analysis 

Even  if  careful  scholarly  discussion  can  discard 
some  conceptualizations  of  power,  there  is  no  one 
root  concept  that  we  can  unravel  simply  by  digging 
deeper,  neither  in  the  domain  of  political  theory 
nor  in  explanatory  theory.  Power  concepts  derive 
their  meanings  from  the  theories  in  which  they  are 
embedded,  and  there  they  meet  the  metatheoretical 
or  normative  divides  that  plague  and  enrich  our 
theorizing.  At  the  same  time,  the  debate  has  come 
full  circle.  Initially,  realist  writings  combined  the 
domains  of  political  theory,  centered  on  the  under¬ 
standing  of  order  in  the  polity,  with  the  domain  of 
explanatory  theory  by  assuming  that  in  the  absence 
of  a  genuine  world  polity  the  analysis  of  capabili¬ 
ties  and  influence  was  all  there  could  be.  By  attack¬ 
ing  the  double  link  between  agent  influence  and 
the  balance  of  power,  later  studies  redefined  a 
more  or  less  causal  role  for  power,  be  it  at  the 
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agent  or  the  structural  level.  Hence,  they  stressed  the 
explanatory  domain  of  power  at  the  expense  of  the 
political  theory  of  power.  But  this  can  work  only  so 
far,  since  the  two  domains  intrude  into  each  other; 
structural  power  is  necessary  to  understand  not  only 
outcomes  but  also  autonomy  in  a  polity;  inversely, 
the  analysis  of  the  changing  nature  of  global  gover¬ 
nance  and  order  provides  the  background  against 
which  the  very  processes  of  power  can  be  under¬ 
stood  in  the  first  place.  So,  to  close  the  circle  with  the 
poststructuralist  and  constructivist  turn,  the  analysis 
of  power  returns  again  to  show  the  links  between 
the  two  domains.  But  contrary  to  early  realism,  it 
does  so  by  assuming  the  existence  of  a  genuine 
world  polity  within  which  power  has  to  be  under¬ 
stood.  Yet  so  far  it  cannot  pretend  to  have  a  theory 
of  power  capable  of  combining  the  two  domains. 

Finally,  power  analysis  is  not  only  tied  to  the 
understanding  of  politics;  it  is  itself  political.  The 
reason  is  that  concepts  such  as  power  have  a  spe¬ 
cial  status  in  our  political  discourse.  They  are  used 
for  a  variety  of  purposes.  For  power,  two  are  par¬ 
ticularly  important.  Power  is  used  in  practical 
contexts  in  which  we  are  interested  in  what  we  can 
do  to  others  and  what  others  can  do  to  us.  It  is  also 
important  in  moral  and  legal  contexts  where  it 
functions  as  an  indicator  of  effective  responsibility: 
If  actors  could  not  have  done  an  act  (if  they  did  not 
have  the  capacity  to  do  so),  they  cannot  be  found 
guilty  of  it.  The  first  indicates  the  realm  of  action: 
Power  becomes  an  indicator  of  politics  as  the  “art 
of  the  possible.”  The  second  assesses  possible 
blame.  Since  power  is  often  conceived  of  as  a  coun- 
terfactual — that  is,  about  things  that  could  have 
been  otherwise — invoking  power  is  to  call  for  a 
justification  of  why  things  were  done  the  way  they 
were.  As  a  result,  choosing  concepts  of  power  that 
are  relatively  narrow  diminishes  the  realm  where 
“something  can  be  done”  and  in  which  action 
needs  to  be  justified;  unintended  effects,  for 
instance,  are  handled  as  regretful  but  unavoidable 
collateral  damage.  Inversely,  wider  concepts  of 
power  suggest  realms  for  action  even  where  there 
may  be  none.  The  fundamental  point  for  such  a 
performative  analysis  of  power — not  what  it  means 
but  what  it  does — is  that  invoking  the  presence  of 
power  politicizes  issues. 

This  also  explains  a  curious  paradox.  Scholars 
and  practitioners  often  engage  in  debates  about 
where  power  “really”  lies,  for  example,  whether  it 


is  hard  or  soft.  By  doing  this,  they  must  appeal  to 
an  underlying  idea  that  we  can  know  this  in  a 
somewhat  objective  manner.  If  power  were  mea¬ 
surable,  however,  such  debates  would  be  quite 
pointless.  Precisely  because  power  is  not  as  fungi¬ 
ble  as  money  and  its  understanding  is  to  some 
extent  conventional,  observers  try  to  shape  the 
common  understanding  and  fix  the  meaning  of 
what  power  is  and  where  the  power  really  lies:  For 
such  understandings  have  authoritative  effects  on 
national  security  and  foreign  policy  doctrines 
when  used  to  define  the  national  interest,  as  well 
as  on  actual  political  rank  and  standing  when  an 
actor’s  main  potential  power  resource  comes  to  be 
considered  insignificant. 
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Political  forecasting  and  prediction  refers  to  any 
effort  to  anticipate  events  that  will  occur  at  a 
future  point  in  time,  usually  with  the  intention  of 
determining  actions  that  will  either  exploit  or 
mitigate  the  effects  of  those  events.  Forecasting  is 
implicit  in  almost  all  political  decision  making, 
but  the  techniques  vary  widely  in  their  level  of 
explicitness,  ranging  from  simple  intuition  to 
detailed  historical  analogies  to  elaborate  statistical 
models.  This  entry  discusses  the  sources  of  regu¬ 
larity  in  political  behavior;  conditional  and  uncon¬ 
ditional  forecasts;  the  concepts  of  forecast  horizons, 
windows,  and  specificity;  modes  of  traditional 
(nontechnical)  forecasting;  and  simulation  and 
statistical  approaches  to  technical  forecasting. 

Evidence  for  the  importance  of  forecasting  to 
political  elites  can  be  found  in  some  of  the  earliest 
available  records  of  organized  human  behavior, 
such  the  biblical  story  of  Joseph  interpreting  the 
dreams  of  an  Egyptian  Pharaoh  found  in  Genesis 
41.  A  wide  variety  of  divination  methods — for 
example,  augury,  astrology,  dream  interpretation, 
and  consultation  with  oracles — were  nearly  univer¬ 
sal  across  cultures  in  premodern  times,  and  practi¬ 
tioners  in  these  fields  often  enjoyed  high  official 
status.  These  supernatural  techniques  were  gradu¬ 
ally  supplemented  with  the  use  of  historical  analo¬ 
gies  and  collections  of  aphorisms  or  “wisdom 
literature,”  but  prediction  techniques  remained 
nontechnical  until  relatively  recently. 

The  prospect  of  systematic  forecasting  of  polit¬ 
ical  behavior  became  attractive  following  the 


development  of  highly  accurate  forecasting  mod¬ 
els  of  physical  phenomena,  particularly  the  exam¬ 
ple  of  Isaac  Newton’s  laws  of  motion  in  the  17th 
century,  and  the  next  2  centuries  saw  the  produc¬ 
tion  of  a  number  of  works  on  “laws”  of  political 
and  economic  behavior  that  were  effectively  fore¬ 
casting  models.  However,  the  systematic  develop¬ 
ment  of  reasonably  accurate  numerical  forecasts 
did  not  occur  until  the  development  of  modern 
statistics  in  the  early  20th  century.  With  the 
increase  in  communication  and  computing  power, 
accurate  technical  forecasts  in  some  domains, 
most  conspicuously  election  outcomes,  were  com¬ 
mon  by  the  late  20th  century,  and  the  methods  are 
now  being  extended  to  other  types  of  behavior, 
notably  forecasts  of  political  instability. 

Forecasting  in  Political  Decision  Making 

Forecasting  is  ubiquitous  in  political  decision  mak¬ 
ing  for  the  simple  reason  that  to  decide  between 
multiple  courses  of  action,  a  decision  maker  must 
have  some  sense  of  the  likely  outcome  of  each 
option.  These  outcomes  are  rarely  deterministic — 
that  is,  entirely  determined  at  the  time  the  choice  is 
made — and  predictions  are  further  complicated  by 
the  fact  that  many  political  decisions  occur  in  a 
context  where  actors  are  working  to  achieve  oppos¬ 
ing  objectives  (a  topic  explored  in  detail  in  game 
theory).  Political  forecasting  may  be  dispersed 
throughout  an  organization,  with  every  agent 
responsible  for  assessing  the  likely  consequences  of 
policy  choices,  or  there  may  be  specialists  in  fore¬ 
casting,  for  example,  intelligence  agencies  project¬ 
ing  the  future  behavior  of  other  states  or  the 
emerging  commercial  field  of  political  risk  analysis, 
which  assesses  the  possible  economic  impacts  of 
political  changes  (e.g.,  whether  a  country  will 
default  on  a  loan). 

Because  humans  exhibit  free  will  and  are  capa¬ 
ble  of  a  wide  range  of  behaviors,  human  behavior 
(political  or  otherwise)  will  always  remain  less 
predictable  than  the  behavior  of  a  closed  physical 
system  such  as  an  object  travelling  through  the 
vacuum  of  space,  which  can  be  predicted  to  a  high 
level  of  accuracy  decades  in  advance.  However, 
political  behavior  tends  to  be  constrained  in  ways 
that  makes  it  considerably  more  predictable  than, 
for  example,  deciding  what  a  randomly  chosen 
household  will  be  having  for  dinner. 
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Three  factors  are  particularly  important  in 
establishing  predictability.  First,  most  political 
activity  is  strongly  constrained  by  material  circum¬ 
stances.  For  example,  an  unemployed  plumber 
deciding  to  run  for  mayor  has  a  considerably  differ¬ 
ent  set  of  options  than  does  a  billionaire  running  a 
third  mayoral  campaign.  Second,  political  forecasts 
frequently  deal  with  mass  behaviors,  and  these 
have  a  substantially  lower  probability  of  changing 
rapidly  than  does  the  behavior  of  a  single  individ¬ 
ual:  If  a  candidate  is  leading  by  20%  in  the  polls, 
the  probability  of  a  set  of  individual  decisions  (or 
polling  errors)  that  would  cause  a  loss  is  consider¬ 
ably  less  than  if  they  are  leading  by  0.5%.  Likewise, 
the  likelihood  that  a  critical  mass  of  individuals  will 
decide  to  rise  in  rebellion  against  a  democratically 
elected  government  with  a  stable  economy  is  con¬ 
siderably  less  than  the  probability  of  rebellion 
against  an  aging  ruler  in  the  midst  of  an  economic 
crisis.  Finally,  many  political  institutions  that  are 
stable  over  long  periods  of  time — notably  bureau¬ 
cracies  and  legal  systems — are  explicitly  designed  to 
produce  predictable  behavior:  A  set  of  given  inputs 
to  the  organization  is  expected  to  produce  a  consis¬ 
tent  outcome. 

Unconditional  and  Conditional  Forecasts 

Forecasts  can  be  divided  into  “unconditional”  and 
“conditional”  forecasts.  An  unconditional  forecast 
is  one  that  assumes  nothing  important  in  the  infor¬ 
mation  used  to  make  the  forecast  will  change.  The 
nature  of  that  information  may  or  may  not  be 
explicitly  specified  in  nontechnical  forecasts;  it  is 
usually  explicit  in  the  variables  and  equations  found 
in  technical  forecasts.  A  conditional  forecast,  in 
contrast,  assumes  that  certain  things  will  change: 
The  most  common  use  of  conditional  forecasts  is 
for  what-if  analyses  of  the  possible  changes  pro¬ 
jected  to  occur  if  a  specific  policy  is  adopted.  For 
example,  a  model  might  attempt  to  predict  the 
change  in  unemployment  in  a  city  if  the  city  does  or 
does  not  receive  funding  for  a  job  training  program. 

The  accuracy  of  conditional  forecasts  is  closely 
linked  to  the  issue  of  causality.  The  high  degree  of 
accuracy  of  Newton’s  laws  of  motion  is  due  to  the 
fact  that  within  a  known  set  of  boundary  condi¬ 
tions  (e.g.,  Newton’s  equations  break  down  for  an 
object  travelling  near  the  speed  of  light),  they  com¬ 
pletely  specify  behavior.  In  contrast,  forecasts  of 


political  behavior  usually  have  a  much  higher  level 
of  uncertainty  than  that  found  in  simple  physical 
systems,  though  not  necessarily  less  than  the 
uncertainty  of  complex  physical  systems;  for 
example,  many  political  forecasts  are  considerably 
more  accurate  than  weather  predictions  at  compa¬ 
rable  time  horizons. 

Three  key  factors  affect  this  accuracy.  The  first 
is  the  well-known  observation  that  correlation  does 
not  imply  causation:  The  fact  that  two  variables 
covary  does  not  mean  that  one  necessarily  causes 
the  other.  If  both  are  caused  by  a  third  variable,  the 
correlation  may  be  very  useful  for  prediction — that 
is,  the  existence  of  the  correlation  supplies  informa¬ 
tion  that  increases  forecast  accuracy — but  unless 
there  is  a  true  causal  linkage,  changing  one  of  the 
variables  will  not  change  the  other.  For  example, 
for  a  number  of  years  the  linkage  between  cigarette 
smoking  and  lung  cancer  was  dismissed  as  simply 
correlation;  both  were  assumed  to  be  linked  to 
other  lifestyle  issues  (e.g.,  individuals  working  in 
polluted  industrial  environments  were  more  likely 
to  be  smokers),  and  only  after  extensive  research 
isolated  smoking  as  a  near-certain  causal  factor 
were  changes  made  in  public  policy. 

A  second  issue  is  the  degree  to  which  the  pro¬ 
jected  change  is  within  the  range  of  prior  experi¬ 
ence  with  the  system.  A  forecast  for  the  effects  of 
increasing  the  funding  of  a  job  training  program  in 
a  city  that  has  previously  had  a  series  of  such  pro¬ 
grams  at  a  variety  of  levels  of  funding  is  more 
likely  to  be  accurate  than  the  forecast  for  a  city 
where  no  job  training  programs  had  ever  been 
tried. 

A  third  source  of  error  is  the  possibility  that  the 
human  object  of  the  prediction  will  change  its 
behavior  to  either  reinforce  or  negate  the  predic¬ 
tion,  a  possibility  not  occurring  in  physical  sys¬ 
tems.  Such  situations  are  generally  known  as 
“self-fulfilling”  and  “self-negating”  predictions. 
Consider  the  case  where  an  intelligence  agency 
makes  a  widely  publicized  prediction  that  the  gov¬ 
ernment  in  a  country  is  likely  to  fail.  If,  as  a  conse¬ 
quence,  allies  of  the  government  withhold  aid  and 
rebel  groups  inside  the  country  are  emboldened 
and  increase  their  activity,  the  government  may 
fail  in  part  because  of  the  prediction  itself:  a  self- 
fulfilling  prediction.  This  makes  the  prediction 
conditionally  accurate,  but  not  unconditionally 
accurate,  since  in  the  absence  of  the  prediction 
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itself,  the  government  might  have  survived. 
Conversely,  in  response  to  the  prediction,  the  threat¬ 
ened  government  might  change  its  policies  and,  as  a 
consequence,  not  fail:  a  self-negating  prediction.  In 
this  case,  the  prediction  appears  incorrect,  even 
though  unconditionally  it  may  have  been  correct. 

The  possibility  of  predictions  influencing  out¬ 
comes  is  particularly  critical  in  election  forecasting, 
where  predictions  can  have  the  effect  of  suppress¬ 
ing  or  increasing  turnout,  increasing  the  amount  of 
campaign  contributions  to  the  projected  winner,  or 
decreasing  the  vote  for  candidates  who  are  likely  to 
lose.  As  a  consequence,  many  democracies  prohibit 
the  publication  of  polling  results  during  some 
period  prior  to  an  election  or  on  polling  day. 
Similarly,  intelligence  projections  of  possible  insta¬ 
bility  in  other  states,  particularly  allies,  are  often 
considered  politically  sensitive  and  thus  classified. 

Forecast  Horizons,  Windows,  and  Specificity 

All  forecasts — either  implicitly  or  explicitly — have 
three  characteristics:  a  horizon  (how  far  ahead  in 
time  the  forecast  is  being  made),  a  window  (the 
period  of  time  for  when  the  forecasted  event  will 
occur),  and  specificity  (the  extent  to  which  an 
exact  prediction  is  made).  In  general,  the  accuracy 
of  forecasts  will  decline  with  longer  forecast  hori¬ 
zons  because  insufficient  information  is  available 
on  the  effects  of  current  variables  on  the  future, 
and  unanticipated  events  may  occur  to  substan¬ 
tially  change  the  projected  outcome.  For  example, 
the  accuracy  (and  specificity)  of  an  election  fore¬ 
cast  typically  will  be  lower  6  months  prior  to  the 
election  than  the  day  before  the  election,  and  even 
a  short-term  forecast  could  become  irrelevant  if 
the  projected  winner  dies  before  the  election. 
Many  statistical  forecasting  models  provide  explicit 
estimates  of  this  uncertainty. 

For  some  forecasting  problems — notably  elec¬ 
tions — the  time  horizon  is  simply  determined  by 
the  known  date  of  a  future  event.  In  some  other 
domains — for  example,  policies  affecting  long¬ 
term  public  health  or  the  environment — the  hori¬ 
zon  may  be  on  the  order  of  decades.  However,  the 
typical  policy-relevant  time  horizon  is  on  the  order 
of  6  months  to  5  years:  Shorter  times  are  irrelevant 
because  a  policy  will  not  have  had  time  to  be 
implemented  and  take  effect,  and  longer  times  tend 
both  to  accumulate  uncertainties  and  may  be 


beyond  the  professional  interests  of  those  con¬ 
cerned  with  the  forecast  (e.g.,  elected  officials). 

The  forecast  window  is  only  relevant  in  situa¬ 
tions  where  there  is  uncertainty  as  to  when  an 
event  might  occur.  Predicting  the  term  of  office  for 
a  prime  minister  in  a  parliamentary  system,  pre¬ 
dicting  whether  a  government  will  be  overthrown 
in  a  revolution,  or  predicting  when  a  country  will 
suffer  a  major  currency  crisis  would  be  examples. 
In  some  cases,  a  change  is  inevitable — no  prime 
minister  is  immortal — and  in  others,  the  projected 
event  may  never  occur. 

Finally,  predictions  differ  widely  in  their  speci¬ 
ficity.  The  predictions  of  oracles  and  astrologers 
were  notorious  for  their  lack  of  specificity:  The 
classical  Greek  historian  Herodotus  reports  the 
Delphic  Oracle  advising  the  Lydian  king  Croesus 
that  if  he  attacked  the  Persians,  “a  great  empire 
will  be  destroyed.”  On  this  advice,  Croesus 
attacked,  but  the  Persians  destroyed  bis  empire: 
The  oracle  showed  good  accuracy,  but  from  the 
perspective  of  Croesus,  the  advice  was  less  than 
optimal  in  terms  of  specificity. 

Beyond  this  extreme,  many  nontechnical  fore¬ 
casts  are  relatively  vague  as  to  exactly  what  has 
been  predicted,  because  of  the  informality  of  the 
techniques  and  the  use  of  natural  language  in 
reporting  the  predictions.  Technical  forecasts,  in 
contrast,  are  usually  much  more  precise,  with  com¬ 
plete  probability  distributions  across  a  range  of 
possible  outcomes,  whether  a  numerical  range 
such  as  forecasting  the  percentage  of  votes  for  a 
candidate  or  the  range  of  probabilities  across  a 
fixed  set  of  categorical  outcomes,  such  as  whether 
or  not  a  government  will  remain  in  power.  This 
precision  does  not  necessarily  translate  into  accu¬ 
racy  but  at  least  allows  competing  technical  fore¬ 
casts  to  be  unambiguously  compared  and,  after  the 
time  of  the  forecast  horizon,  their  accuracy  to  be 
calculated. 

Nontechnical  Forecasting 

Nontechnical  forecasting  predominantly  takes 
three  forms:  (1)  intuition,  (2)  the  use  of  rules  of 
thumb  or  aphorisms,  and  (3)  the  use  of  narrative 
archetypes  and  historical  analogies.  Intuition  is  the 
use  of  subcognitive  mental  processing — the  brain 
is  engaged  in  a  great  deal  of  work,  typically  pattern 
recognition,  but  this  occurs  in  a  manner  that  the 
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thinker  is  not  consciously  aware  of  (much  as  an 
uneducated  native  speaker  of  a  language  will  have 
perfectly  mastered  the  complex  grammar  of  the 
language  but  cannot  systematically  articulate  its 
rules).  A  typical  explanation  for  an  intuitive  fore¬ 
cast  is  “I  have  a  feeling  ...”  or  “I  know  it  when  I 
see  it.  ”  While  social  pattern  recognition  in  humans 
and  other  primates  is  highly  developed  and  hence 
frequently  accurate,  the  disadvantage  of  the  intui¬ 
tive  forecast  is  that  it  is  dependent  on  the  skill  and 
experience  of  the  individual  making  the  forecast, 
and  only  the  forecast  itself,  not  the  process  of  deter¬ 
mining  the  forecast,  can  be  shared  and  evaluated. 
To  the  extent  that  they  were  successful  in  making 
unambiguous  forecasts,  oracles,  astrologers,  dream 
interpreters,  and  other  premodern  forecasters  were 
probably  good  at  intuitive  methods. 

Aphorisms  and  rules  of  thumb  are  a  form  of 
succinct  and  very  general  if-then  models  that  link 
specific  observable  situations  with  outcomes  and 
summarize  frequently  observed  regularities;  all  cul¬ 
tures  have  a  large  number  of  these  for  dealing  with 
common  social  situations.  Unlike  intuition,  apho¬ 
risms  are  explicit,  though  the  evidence  backing 
them  is  usually  not.  The  disadvantage  of  aphorisms 
is  that  multiple  aphorisms  frequently  contradict 
each  other — for  example,  the  common  English- 
language  aphorisms  “Birds  of  a  feather  flock 
together”  and  “Opposites  attract” — and  apho¬ 
risms  are  often  vague  in  both  their  antecedents  and 
their  consequents. 

Narrative  archetypes  and  historical  analogies 
are  the  most  sophisticated  of  the  nontechnical  fore¬ 
casting  methods  and  are  very  commonly  (if  often 
implicitly)  found  in  policy  analysis  and  political 
rhetoric.  In  both  approaches,  a  set  of  observed 
events  and  circumstances  is  matched  to  a  set  of 
events  and  circumstances  in  a  series  of  idealized 
narratives  or  historical  cases,  and  a  prediction  is 
made  on  the  basis  of  the  outcome  that  occurred  in 
the  best  matching  case.  Historical  analogies  use 
situations  that  actually  occurred  in  the  past  (per¬ 
haps  with  adjustments  for  technological  and  cul¬ 
tural  changes);  with  an  archetypical  narrative,  the 
comparison  is  with  a  hypothetical  event  that  has 
the  general  characteristics  of  a  large  number  of 
historical  cases  (e.g.,  “peasant  revolt,”  “populist 
electoral  campaign”)  or  a  counterfactual  that 
seems  plausible  but  has  never  actually  occurred 
(“crisis  leading  to  a  nuclear  missile  exchange”). 


Nontechnical  forecasting  is  widespread  and 
prior  to  the  availability  of  statistical  methodologies 
and  computers  was  virtually  the  only  form  of  fore¬ 
casting  used  in  political  analysis.  Systematic  research 
has  shown,  however,  that  it  is  not  particularly 
accurate:  Many  “experts”  do  barely  better  than 
chance  on  difficult  forecasting  problems,  and  the 
expert’s  self-confidence  and  public  reputation  is 
sometimes  negatively  correlated  with  their  accu¬ 
racy.  These  failings  have  long  been  recognized,  and 
a  wide  variety  of  methods  have  been  suggested  for 
improving  the  accuracy  of  nontechnical  forecasts, 
particularly  by  gaining  improved  accuracy  through 
aggregation  of  the  forecasts  of  multiple  experts. 
Among  the  better  known  of  these  are  the  Delphi 
method,  devil’s  advocate,  brainstorming  methods, 
and,  recently,  prediction  markets. 

Simulation 

Simulation  is  a  method  of  forecasting  that  bridges 
the  nontechnical  and  technical  approaches.  In  the 
earliest  versions,  pioneered  by  the  war  games  of 
the  Prussian  military  in  the  18th  and  19th  centu¬ 
ries,  individuals  play  the  roles  of  various  actors, 
constrained  by  an  oftentimes  elaborate  set  of 
rules.  These  “games”  can  be  as  simple  as  those 
played  on  a  table  or  as  elaborate  as  extended  field 
exercises  involving  real  persons  and  equipment. 
The  method  is  still  widely  used,  although  now  it 
usually  combines  training  and  experimentation 
with  specific  tactics  rather  than  being  used  for 
strategic  forecasting. 

From  the  1960s  forward,  computer  simulations 
have  been  used:  The  two  most  common  methods 
are  (1)  systems  dynamics,  where  equations  are 
specified  for  the  aggregate  behavior  of  key  variables 
in  the  system,  and  (2)  agent-based  models,  where 
equations  or  algorithms  are  specified  for  the  behav¬ 
ior  of  simulated  individual  actors  in  the  model,  and 
the  behavior  of  the  system  as  a  whole  is  derived  by 
simulating  a  large  number  of  these.  Both  forms  of 
simulation  usually  involve  multiple  runs  with  dif¬ 
ferent  values  for  random  variables  so  that  a  full 
probability  distribution  of  the  predicted  outcomes 
can  be  obtained.  The  advantage  of  simulation  is 
that  systems  of  almost  unlimited  complexity  can  be 
specified  and  evaluated;  the  disadvantage  is  that 
determining  the  appropriate  values  of  the  parame¬ 
ters  of  the  equations  is  very  difficult,  and  in  some 
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simulations  those  results  are  largely  determined  by 
a  small  set  of  key  parameters. 

An  important  type  of  sensitivity  that  affects 
dynamic  simulations  containing  nonlinear  interac¬ 
tions  is  the  effect  of  chaos  theory,  where  the  predic¬ 
tions  of  a  model  can  be  highly  sensitive  to  values  of 
critical  parameters  and  to  the  initial  values  of  some 
variables.  In  such  models,  some  combinations  of 
parameter  values  and  initial  conditions  can  differ 
by  arbitrarily  small  numerical  amounts — differences 
far  more  precise  than  could  ever  be  estimated  from 
actual  data — and  yet  produce  widely  divergent 
outcomes.  Even  chaotic  systems,  however,  may 
provide  some  level  of  predictability — these  usually 
exhibit  multiple  “basins  of  attraction”  where  the 
system  is  likely  to  find  a  long-term  equilibrium — 
and  systems  that  are  chaotic  for  some  parameter 
values  can  be  well  behaved  and  predictable  for 
other  values. 

A  comparable  problem  can  be  found  in  agent- 
based  models  when  the  simulated  agents  continu¬ 
ally  adapt  their  behaviors  to  the  strategies  of  other 
actors,  and  consequently  patterns  of  behavior 
found  in  one  period  of  time  may  be  completely  dif¬ 
ferent  from  those  found  in  another  because  of  the 
different  strategies  being  used.  This  phenomenon 
has  been  observed  empirically  in  stock  market  trad¬ 
ing  strategies  and  in  military  strategies — a  new 
technique  that  is  highly  successful  when  first  intro¬ 
duced  will  become  less  effective  over  time  as  other 
actors  adapt  to  it.  In  evolutionary  biology,  this  is 
known  as  the  Red  Queen  hypothesis — named  after 
the  eponymous  figure  in  Lewis  Carroll’s  fantasy 
Through  the  Looking  Glass — whereby  organisms 
continually  have  to  adapt  their  strategies  to  main¬ 
tain  the  same  level  of  fitness  vis-a-vis  their  com¬ 
petitors.  Such  systems  may  be  predictable  in  the 
short  term — some  adaptations  are  more  likely  to  be 
successful  than  others — but  the  nature  of  the  regu¬ 
larity  can  change  substantially  at  longer  horizons. 

Statistical  Forecasting 

By  far  the  most  common  form  of  technical  fore¬ 
casting  involves  the  use  of  statistical  models. 
Models  used  in  political  forecasting  were  origi¬ 
nally  borrowed  from  basic  mathematical  statistics 
and  then  adopted  from  economics,  and  now 
increasing  use  is  made  of  models  originally  devel¬ 
oped  for  the  prediction  of  medical  outcomes,  as 


well  as  some  models  developed  specifically  for 
political  and  policy  analysis.  A  wide  variety  of  sta¬ 
tistical  methods  are  now  in  use,  and  many  can  be 
estimated  using  standard  statistical  packages, 
though  some  require  specialized  time-series  analy¬ 
sis  software.  Among  the  more  commonly  used 
methods  are 

•  autoregressive  time  series,  which  predict  future 
values  of  a  variable  based  on  a  linear 
combination  of  past  values — common 
approaches  include  autoregressive  moving 
average  (ARMA)  and  autoregressive  integrated 
moving  average  (ARIMA)  models; 

•  autoregressive  conditional  heteroskedasticity 
(ARCH)  models,  which  predict  the  future 
variance  of  an  indicator  rather  than  specific 
values — these  are  useful  in  predicting  future 
uncertainty; 

•  event  history  or  survival  models,  which  predict 
the  probability  of  a  specific  event  as  a  function 
of  time — variables  are  assessed  on  the  degree  to 
which  they  are  risk  factors  that  raise  or  lower 
these  probabilities; 

•  vector  autoregression  (VAR),  which  predicts  the 
response  to  specific  numerical  shocks  to  the 
system;  and 

•  cross-sectional  time  series,  which  takes  into 
account  variation  both  across  time  and  within 
groups  of  cases. 

The  advantage  of  statistical  models  is  that  they 
are  completely  transparent  in  their  specifications, 
they  are  replicable,  and  they  are  grounded  in  a 
formal  theory  of  statistics  and  probability. 
Statistical  models  also  provide  forecasts  with  a 
high  level  of  detail — for  example,  probabilities  of 
a  specific  numerical  or  categorical  outcome  at  a 
specific  point  in  time.  The  disadvantage  is  that 
they  may  require  a  great  deal  of  data  (which  may 
vary  in  quality  and  availability)  to  be  accurately 
estimated  and  they  are  completely  dependent  on 
the  appropriate  specification  of  both  the  mathe¬ 
matical  form  of  the  model  and  the  variables  used 
as  input  to  the  forecast. 

Three  other  issues  are  relevant  to  statistical 
time-series  models.  First,  the  accuracy  of  a  model 
on  the  data  used  to  estimate  the  model — the  in- 
sample  accuracy — is  usually  higher  than  the  accu¬ 
racy  on  new  data — the  out-of-sample  accuracy. 
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Furthermore,  raising  the  in-sample  accuracy  to 
account  for  all  of  the  idiosyncratic  data  points — 
fitting  the  error — will  usually  decrease  out-of- 
sample  accuracy.  Consequently,  the  development 
of  robust  statistical  models  may  require  a  great 
deal  of  experimentation,  and  determining  the 
“best”  model  for  forecasting  the  out-of-sample 
future  requires  elements  of  art  as  well  as  science. 

Second,  many  political  variables  are  strongly 
autoregressive — their  values  change  little  (or 
change  very  predictably)  from  time  point  to  time 
point.  This  means  that  the  models  are  very  accu¬ 
rate,  but  departures  from  the  autoregressive 
pattern — for  example,  a  sudden  swing  in  public 
opinion  or  unemployment — are  difficult  to  pre¬ 
dict.  Such  changes,  however,  are  likely  to  be  of 
greatest  interest  to  analysts.  Granger  causality 
analysis,  for  example,  looks  at  the  effects  of  vari¬ 
ables  only  after  the  autoregressive  factors  have 
been  taken  into  consideration. 

Finally,  in  some  domains,  particularly  those 
involving  major  political  or  economic  crises,  the 
number  of  observations  of  the  event  of  interest  is 
very  small — this  is  referred  to  as  a  rare-events  esti¬ 
mation  problem — and  it  is  difficult  for  a  statistical 
model  to  extract  sufficient  information  to  predict 
such  occurrences.  Because  of  the  importance  of 
rare  events,  this  is  an  active  research  area  and 
mathematical  methods  can  be  modified  to  at  least 
partially  adjust  for  this,  but  the  problem  is  not 
fully  solved. 

Philip  A.  Schrodt 
Pennsylvania  State  University 
University  Park,  Pennsylvania,  United  States 
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Preemptive  War 


The  concept  of  preemptive  war  refers  to  a  legiti¬ 
mate  anticipatory  war  waged  by  a  state  in  order 
to  counter  a  threat  emanating  from  an  adversary. 
Like  many  other  notions  used  as  much  by  politi¬ 
cians  as  by  scholars,  preemptive  war  is  an  essen¬ 
tially  contested  concept,  all  the  more  so  as  it 
cannot  easily  be  dissociated  from  the  concept  of 
preventive  war. 

The  idea  of  a  legitimate  anticipatory  war  goes 
back  to  the  founders  of  the  modern  just-war  tradi¬ 
tion  in  the  17th  and  18th  centuries.  Writing  about 
the  lawfulness  of  wars,  Hugo  Grotius  and  Emer 
De  Vattel  claimed  that  natural  law  as  well  as  the 
law  of  nations  permitted  states  to  repel  force  by 
force  in  order  to  defend  themselves  against  danger. 
They  acknowledged  two  types  of  danger:  First,  a 
state  is  in  danger  once  it  has  been  attacked,  and  in 
this  case  resorting  to  arms  is  legitimate  for  the  sake 
of  self-defense;  second,  a  state  is  in  danger  when  an 
attack  against  it  is  imminent,  and  in  this  case 
resorting  to  arms  is  legitimate  to  forestall  the 
intended  attack.  The  criteria  put  forward  by 
Grotius  and  De  Vattel  introduced  a  breach  in  the 
then  prevailing  Hobbesian  conception  of  interna¬ 
tional  politics — considering  any  war  to  be  justified 
per  se  and  postulating  that  a  danger,  and  an  ensu¬ 
ing  right  to  prevent  such  a  danger,  existed  as  soon 
as  there  was  a  real  or  a  perceived  increase  of 
another  state’s  power  capacities.  During  the  so- 
called  Caroline  incident  opposing  the  United  States 
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and  the  British  empire  at  the  Canadian  border  in 
1837,  the  criteria  put  forward  by  Grotius  and  De 
Vattel  were  revisited  by  U.S.  Secretary  of  State 
Daniel  Webster;  according  to  Webster,  an  antici¬ 
patory  war  could  be  considered  to  be  legitimate  on 
the  condition  that  it  was  undertaken  as  a  necessity 
of  self-defense,  instant,  overwhelming,  and  leaving 
no  choice  of  means  and  no  time  for  deliberation. 

From  that  time  on,  it  has  been  generally  acknowl¬ 
edged  that  a  distinction  must  be  made  between 
legitimate  anticipatory  wars,  called  preemptive 
wars,  and  illegitimate  wars,  called  preventive  wars. 
Although  both  relate  to  the  same  better-now-than- 
later  logic  and  designate  an  anticipatory  and  win- 
nable  war  waged  by  a  state  eager  to  forestall  a 
threat  at  time  t  in  order  not  to  have  to  wage  a 
riskier  war  under  less  favorable  circumstances  at 
times  t  +  1,  t  +  2, .  .  .  ,  t  +  n,  the  material  dissimi¬ 
larity  of  the  threat  concerned  induces  a  normative 
difference  between  them. 

Concerning  the  material  dimension,  the  first  dif¬ 
ference  lies  in  the  temporality  of  the  threat  coun¬ 
tered  by  an  anticipatory  war;  a  preemptive  war  is 
undertaken  to  confront  a  threat  perceived  to  be 
imminent,  whereas  a  preventive  war  aims  at  fore¬ 
stalling  a  threat  likely  to  become  effective  only  after 
a  certain  period  of  time.  In  other  words,  a  preemp¬ 
tive  action  is  a  tactical  riposte  to  a  short-term 
threat,  while  a  preventive  action  is  a  strategic  reply 
to  a  long-range  threat,  such  as  a  future  imbalance 
of  forces.  The  second  material  difference,  comple¬ 
mentary  to  the  first  one,  relates  to  the  source  of  the 
threat.  In  a  preemptive  war,  State  A  faces  an  irre¬ 
sistible  threat:  It  attacks  at  the  point  in  time  t  when 
an  adversary,  State  B,  is  about  to  mobilize  actual 
military  capacities  against  A,  whereas  a  preventive 
war  launched  by  State  A  seeks  at  time  t  to  impede 
State  B  from  building  up  military  capacities  that 
might  be  mobilized  against  A  at  times  t  +  1 ,  t  + 
2,  .  .  . ,  t  +  n.  In  other  words,  a  preemptive  war 
aims  at  countering  a  real  threat,  whereas  a  preven¬ 
tive  war  strives  to  forestall  a  hypothetical  threat. 

A  normative  opposition  ensues  from  these  mate¬ 
rial  differences.  In  the  legalist  paradigm  underpin¬ 
ning  the  United  Nations  (UN)  Charter,  resort  to 
war  is  permitted  only  under  two  narrow  condi¬ 
tions.  The  first  is  the  traditional — both  individual 
and  collective — self-defense  principle  according  to 
which  a  country  that  is  attacked  has  the  right  to 
defend  itself  in  a  defensive  war  in  order  to  repel  its 


attacker  and  to  secure  its  security  and  survival, 
either  alone  or  with  the  help  of  other  states  joining 
in  such  a  defensive  war  to  guarantee  the  respect  of 
the  prevailing  norms  of  appropriate  international 
conduct.  The  second  type  of  legitimate  resort  to 
arms  refers  to  preemptive  war,  considered  to  be 
the  only  kind  of  legitimate  anticipatory  war  pre¬ 
cisely  because  it  is  a  war  launched  in  response  to  a 
threat  that  is  at  once  overwhelming  and  so  immi¬ 
nent  as  to  allow  no  time  for  deliberation  and  no 
choice  of  means.  Conversely  then,  resorting  to 
preventive  war  is  viewed  as  illegitimate,  as  was 
made  clear  by  the  diverging  reactions  in  the  world 
to  Israel’s  air  strikes  against  Iraq’s  Osirak  nuclear 
plant  in  June  1981  and  against  Egyptian  and 
Syrian  military  airports  in  June  1967.  The  1967 
attack  was  considered  to  be  a  legitimate  preemp¬ 
tive  strike,  because  the  Israeli  government  knew 
with  certainty  that  the  Egyptian  and  Syrian  mili¬ 
tary  air  forces  were  about  to  bomb  Israel’s 
territory:  Tel  Aviv  could  not  afford  to  rely  on  dip¬ 
lomatic  negotiations;  the  only  option  at  its  dis¬ 
posal  was  to  destroy  its  enemies’  bombers  on  the 
ground  before  they  could  take  off.  The  situation 
was  totally  different  on  June  19,  1981,  when  the 
Security  Council  of  the  United  Nations  in  its 
Resolution  487  condemned  the  military  attack  by 
Israel  in  clear  violation  of  the  norms  of  interna¬ 
tional  conduct.  Though  Iraq  was  allegedly  build¬ 
ing  a  nuclear  facility  that  might  one  day  be  used  to 
produce  nuclear  weapons  likely  to  threaten  Israel’s 
security,  such  a  threat  was  neither  imminent  nor 
overwhelming  at  the  time  the  Israeli  government 
decided  to  launch  its  bombing  campaign. 

This  consensus  was  broken  on  the  occasion  of 
the  decision  by  the  United  States  and  some  of  its 
allies  to  go  to  war  against  Iraq  in  2003.  In  Sep¬ 
tember  2002,  the  National  Security  Strategy  of  the 
United  States  claimed  that  the  United  States  has 
long  maintained  the  option  of  preemptive  actions 
to  counter  a  sufficient  threat  to  its  national 
security;  it  also  announced  its  intention  to  act  pre¬ 
emptively  in  order  to  forestall  hostile  acts  by  its 
enemies.  In  March  2003,  the  United  States  under¬ 
took  such  an  action,  launching  Operation  Iraqi 
Freedom,  which  it  justified  as  a  preemptive  war, 
given  that  the  only  course  of  action  against  terrorists 
was  to  defeat  them  abroad  before  they  attacked 
America  at  home.  The  vast  majority  of  scholars 
reacting  to  the  military  action  against  Saddam 
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Hussein’s  regime,  however,  refused  to  label 
Operation  Iraqi  Freedom  as  a  preemptive  war.  In 
September  2002,  Michael  Walzer  wrote  that  a  pos¬ 
sible  war  undertaken  by  the  United  States  against 
Baghdad  would  be  a  preventive  rather  than  a  pre¬ 
emptive  war.  Commenting  on  the  war  in  2008, 
Michael  Doyle  asserted  that  the  so-called  Bush 
Doctrine  of  preemption  actually  was  a  doctrine  of 
prevention. 

In  light  of  the  values  shared  by  the  members  of 
contemporary  international  society,  there  is  little 
doubt,  among  the  majority  of  scholars  as  well  as 
international  observers,  that  Operation  Iraqi 
Freedom  did  not  match  the  criteria  of  a  preemptive 
war.  On  the  one  hand,  the  United  States  faced  no 
instant  or  overwhelming  threat  emanating  from 
Iraq,  neither  against  itself  nor  against  its  regional 
allies.  On  the  other  hand,  it  could  have  chosen 
other  means  to  cope  with  Iraq’s  revisionist  behav¬ 
ior,  such  as  a  vigilant  containment  strategy.  After 
all,  inspections  by  agents  of  the  International 
Atomic  Energy  Agency  (IAEA)  were  taking  place 
on  the  Iraqi  territory  to  find  a  pacific  solution  to  the 
problem  of  the  possible  weapons  of  destruction 
possessed  by  Baghdad,  and  the  pursuit  of  diplo¬ 
matic  negotiations  within  the  legal  UN  framework 
was  favored  by  many  states,  including  the  allies  of 
the  United  States.  The  Bush  administration,  while 
eliding  the  differences  between  preemption  and 
prevention,  did  not  ignore  the  criteria  defining  a 
legitimate  anticipatory  war,  all  the  more  so  as  Jeane 
Kirkpatrick,  its  UN  ambassador  during  the  Osirak 
crisis,  had  voted  for  the  resolution  condemning 
Israel’s  air  strike  and  as  in  1986  the  International 
Court  of  Justice  had  refused  to  consider  that  the 
United  States  had  acted  in  collective  self-defense  of 
El  Salvador  when  it  had  used  force  against 
Nicaragua,  accused  of  providing  weapons  to 
Salvadoran  rebels.  Actually,  the  United  States  con¬ 
sciously  proposed  a  new  conception  of  preemption, 
based  on  the  idea  of  the  necessary  adaptation  of  the 
concept  of  imminent  threat  to  the  objectives  and 
capacities  of  unpredictable  rogue  states  and  terror¬ 
ist  networks  likely  to  target  civilian  populations 
and  to  use  weapons  of  mass  destruction  rather  than 
to  resort  to  those  conventional  means  enumerated 
in  the  traditional  preemption  doctrine,  that  is,  visi¬ 
ble  mobilizations  of  armies,  navies,  or  air  forces. 

Beyond  the  Iraqi  Freedom  case,  what  is  at  stake 
today  in  the  preemption  versus  prevention  debate 


is  the  potential  emergence  of  new  norms  of  state 
conduct.  Such  an  evolution  would  not  merely  be 
tantamount  to  a  departure  from  the  prevailing  just 
war  tradition  because  of  the  weakening  of  moral 
restraints.  It  would  also  break  with  those  realpo- 
litical  considerations  that  in  the  past  led  policy¬ 
makers  such  as  Otto  von  Bismarck  to  equate 
preventive  war  with  suicide  from  fear  of  death  and 
to  reject  it  as  a  rational  means  of  foreign  policy 
given  that  it  is  impossible  to  know  things  before¬ 
hand.  If  indeed  the  Bush  Doctrine  were  adopted 
universally,  it  would  probably  permit  any  state  to 
resort  to  arms  by  invoking  its  own  definition  of 
imminent  threats.  International  instability  would 
be  the  logical  result  of  a  universal  diffusion  of 
Bush’s  unilateral  way  of  foreign  policy  making. 

This  said,  international  norms  are  not  laid 
down  once  and  for  all;  their  meaning  is  fixed  by 
usage.  While  ultimately  reflecting  power  distribu¬ 
tion,  they  are  what  states,  and  actors  shaping 
states’  behavior,  make  of  them.  Many  researchers 
believe  that  international  adoption  of  a  new  doc¬ 
trine  of  legitimate  anticipatory  war  will  require 
that  such  a  doctrine  be  revisited  and  accepted 
within  the  United  States. 

Dario  Battistella 
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Pessac,  France 
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Presidentialism 


Presidentialism  is  one  of  the  three  major  organiza¬ 
tional  models  for  government  institutions  in  con¬ 
temporary  representative  democracies.  This  entry 
discusses  its  conceptualization,  reviews  its  origins 
and  diffusion,  summarizes  its  different  varieties, 
provides  an  outline  of  the  debates  about  its  advan¬ 
tages  and  disadvantages,  and  evaluates  its  current 
relationship  with  democracy. 

Definition 

Presidentialism  can  be  defined  by  the  simultaneous 
presence  of  three  characteristics.  The  first  is  the 
existence  of  two  agents  of  the  electorate:  a  chief  of 
government — known  as  president — and  a  legisla¬ 
tive  assembly — usually  known  as  congress — both 
of  which  are  elected  separately  and  by  popular 
vote.  The  second  characteristic  is  that  both  the 
president  and  members  of  congress  are  elected  for 
preestablished  fixed  terms,  which  means  that  each 
agent’s  survival  does  not  depend  on  the  other’s 
confidence.  The  president  cannot  be  removed  by  a 
legislative  vote  of  no  confidence  on  political 
grounds.  While  the  president  can  be  removed 
through  impeachment  proceedings,  impeachment 
is  intended  as  an  exceptional  response  to  illegal  or 
improper  presidential  behavior,  and  it  requires  the 
support  of  a  qualified  congressional  majority. 
Likewise,  the  president  cannot  dissolve  the  legisla¬ 
tive  assembly  and  call  for  parliamentary  elections 
at  his  or  her  discretion. 

The  third  characteristic  of  presidentialism  is  the 
blending  of  the  roles  of  head  of  government  and 
chief  of  state  in  the  same  institution,  the  presi¬ 
dency.  As  the  head  of  government,  the  president 
has  authority  over  the  overall  direction  of  govern¬ 
ment,  appoints  cabinet  ministers  and  the  upper 
echelons  of  public  bureaucracy,  may  influence  the 
legislative  process  through  the  introduction  of 
bills,  and  has  veto  and  decree  powers.  As  head  of 
state,  the  president  is  also  the  symbolic  representa¬ 
tion  of  the  state,  signs  international  treaties,  com¬ 
mands  the  armed  forces,  and  has  the  authority  to 
grant  pardons. 

Under  parliamentarism  and  semipresidential- 
ism,  the  chief  of  government  is  not  directly  elected 
and  does  not  serve  a  fixed  term.  Rather,  the  chief 


of  government  is  chosen  by  a  legislative  majority 
and  remains  in  power  subject  to  maintaining  the 
confidence  of  such  a  majority.  In  addition,  in  these 
two  systems  the  government  can  dissolve  (or  ask  to 
dissolve)  the  parliament  and  call  for  legislative  elec¬ 
tions  before  the  legislative  term  ends.  The  combina¬ 
tion  of  these  mechanisms  functionally  provides  that 
the  party  or  coalition  that  enjoys  a  parliamentary 
majority  is  also  the  one  in  charge  of  government.  In 
contrast,  under  presidentialism,  situations  of 
“divided  government”  can  exist,  in  which  the  party 
(or  coalition)  of  the  chief  of  government — the 
president — does  not  enjoy  a  majority  in  congress. 
Under  divided  government,  cooperation  between 
the  executive  and  the  legislative  majority  is  crucial 
to  avoid  deadlock  and  stalemate. 

In  addition,  both  parliamentarism  and  semi- 
presidentialism  are  dual-executive  systems.  Under 
both  arrangements,  the  prerogatives  of  the  execu¬ 
tive  are  divided  between  a  chief  of  government 
(usually  called  a  prime  minister)  and  a  chief  of 
state  (a  monarch  or  a  president  with  limited  and/ 
or  symbolic  powers  under  parliamentarism  and  a 
president  with  extensive  powers  under  semipresi- 
dentialism). 

Origins  and  Diffusion  of  Presidentialism 

Presidentialism’s  origins  can  be  traced  to  the  con¬ 
stitutional  convention  that  took  place  in  Philadel¬ 
phia  and  produced  the  U.S.  Constitution  of  1787. 
The  main  theoretical  underpinnings,  motives,  and 
aspirations  of  the  constitution  were  expressed  by 
the  writings  of  Alexander  Hamilton,  James  Madi¬ 
son,  and  John  Jay,  published  as  The  Federalist 
Papers. 

The  most  important  elements  of  this  institu¬ 
tional  design  were  novel  at  the  time:  a  written 
constitution  that  aimed  at  defining  and  limiting 
political  authority,  a  government  based  on  popular 
sovereignty,  a  chief  of  state  named  “president” 
instead  of  a  hereditary  king,  and  a  system  of 
checks  and  balances  between  the  executive,  legisla¬ 
tive,  and  judicial  branches  designed  to  guarantee 
order  and  security  and  avoid  tyranny. 

The  inspiration  that  grew  from  the  origins  and 
characteristics  of  the  U.S.  Constitution  legitimized 
later  ruptures  with  other  premodern  monarchical 
regimes  that  came  soon  after  the  American  struggle 
for  independence.  Indeed,  the  presidential  model 
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expanded  widely  throughout  Latin  America  with 
the  end  of  Spanish  rule,  the  advent  of  wars  of  inde¬ 
pendence  (1808-1824),  and  the  process  of  state 
formation  that  followed.  Outside  the  Americas, 
presidentialism  never  found  the  same  conditions  to 
expand  and  thrive.  In  fact,  Latin  American  is  really 
the  continent  of  presidentialism  today;  there  are 
only  a  handful  of  non- American  countries  employ¬ 
ing  the  system,  including  Benin  and  Nigeria  in 
Africa  and  the  Philippines  and  South  Korea  in  Asia. 

Types  of  Presidentialism 

The  term  presidentialism  does  not  necessarily  imply 
that  the  executive  is  always  and  in  all  cases  the 
most  powerful  of  the  branches  of  government. 
Researchers  have  identified  different  types  of  par¬ 
liamentary  governments  depending  on  the  primacy 
of  the  government  or  the  parliament  (i.e.,  cabinet 
government,  Kanzlerdemokratie  [chancellor  dem¬ 
ocracy],  or  assembly  government).  A  similar  dis¬ 
tinction  can  be  made  between  at  least  two  types  of 
presidential  systems:  those  based  on  “equilibrium 
among  branches”  and  those  based  on  “executive 
dominance.”  Three  variables  differentiate  these 
categories:  ( 1 )  the  powers  of  the  president  vis-a-vis 
congress,  (2)  the  role  of  the  Supreme  Court  as  an 
arbiter  of  executive-legislative  conflict,  and  (3)  how 
parties  and  the  party  system  are  structured.  The 
United  States  is  the  paradigmatic  example  of  presi¬ 
dentialism  based  on  “equilibrium  among  branches,” 
while  most  of  the  other  cases  fall  into  the  “execu¬ 
tive  dominance”  category. 

The  U.S.  president  faces  a  powerful  legislative 
branch  with  significant  legislative  and  oversight 
prerogatives  and  complete  autonomy  with  respect 
to  its  agenda  and  its  session  calendar.  On  the  other 
hand,  the  majority  of  the  remaining  presidential 
constitutions  establish  a  more  powerful  president 
and  a  weaker  legislature  than  exists  in  the  United 
States.  In  contrast  to  the  U.S.  president,  Latin 
American  presidents  not  only  enjoy  reactive  powers 
(such  as  veto  power)  but  also  proactive  and  agenda¬ 
setting  powers.  Among  the  latter  are  the  following: 
(a)  power  to  legislate  through  emergency  decree 
powers,  (b)  exclusive  right  of  legislative  initiative  in 
some  areas  (such  as  the  budget  or  the  organization 
of  public  administration),  (c)  ample  powers  over 
the  budgetary  process,  (d)  authority  to  initiate  a 
referendum  on  general  topics  or  specific  laws, 


(e)  authority  to  convene  congress  for  special  ses¬ 
sions,  and  (f)  authority  to  ask  congress  to  prioritize 
executive  initiatives.  Notwithstanding  these  pow¬ 
ers,  Latin  American  legislatures  are  not  mere  rub¬ 
ber  stamps.  They  do  exercise  their  legislative  and 
oversight  powers  but  to  a  lesser  extent  than  their 
U.S.  counterpart. 

In  addition,  the  Supreme  Court  historically  has 
played  a  crucial  role  in  the  working  of  U.S.  presi¬ 
dentialism,  acting  as  arbiter  in  conflicts  between  the 
executive  and  the  legislature.  The  combination  of 
the  Court’s  recognition  as  an  arbiter  of  interbranch 
conflict  and  its  significant  institutional  prestige  has 
resulted  in  the  Court  operating  as  a  third  branch  of 
government.  This  is  distinct  from  other  presidential 
countries  where  Supreme  Courts  have  often  been 
unable  to  adequately  exercise  their  role  as  arbiter 
due  to  undue  influence  from  the  executive. 

Historically,  in  executive-dominant  presidential 
systems,  the  usual  way  of  limiting  presidents’ 
power  has  been  through  the  adoption  of  non- 
reelection  clauses  (either  lifelong  or  nonimmedi- 
ate).  These  measures  produce  presidents  with 
ample  ruling  powers  but  without  the  ability  to 
extend  their  terms  of  office.  In  spite  of  term  limits, 
a  number  of  Latin  American  presidents  have 
attempted  to  remove  these  clauses,  provoking  an 
increase  of  executive-legislative  conflict,  a  weak¬ 
ening  of  the  mechanisms  of  horizontal  account¬ 
ability  and  oversight,  and,  more  generally,  increased 
levels  of  political  instability. 

The  nature  and  structure  of  parties  and  party 
systems  also  differentiate  the  types  of  presidential¬ 
ism  in  functional  terms.  Two-party  systems  with 
low  internal  party  discipline  such  as  that  of  the 
United  States  not  only  provide  the  president  a 
wider  margin  to  rule  under  divided  government 
conditions  but  have  also  contributed  to  limiting 
the  power  of  presidents  who  enjoy  a  majority  in 
congress.  In  any  case,  while  the  existence  of  a  two- 
party  system  with  weak  parties  might  allow  presi¬ 
dents  to  more  successfully  push  their  agendas,  it 
also  obliges  presidents  to  pay  careful  attention  to 
congressional  autonomy,  power,  and  will. 

On  the  other  hand,  presidential  forms  of  gov¬ 
ernment  that  coexist  with  multiparty  systems  or 
two-party  systems  with  disciplined  parties,  as  is 
the  case  in  much  of  Latin  America,  have  produced 
both  cases  of  executive  dominance  and  cases  of 
compromise  between  executive  and  congress. 
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Clear  examples  of  dominance  situations  include 
those  cases  in  which  presidents,  supported  by  a 
disciplined  legislative  majority,  take  full  advantage 
of  the  situation  and  push  their  own  agendas  with¬ 
out  seeking  agreements  with  opposition  parties. 
Presidents  who  lack  congressional  majorities  may 
also  decide  to  use  (and  abuse)  their  proactive  pow¬ 
ers  and  rule  alone,  circumventing  congress. 
Alternatively,  presidents  facing  a  minority  situa¬ 
tion  can  build  coalitions  to  avoid  deadlock  and 
stalemate,  offering  cabinet  positions  to  parties 
with  legislative  representation  in  exchange  for  leg¬ 
islative  support  for  executive  initiatives.  This  is  an 
example  of  consensual  interbranch  relations. 

Criticisms  and  Debate 

During  the  1980s,  a  large  number  of  countries  (first 
in  Latin  American  and  then  in  Eastern  European) 
moved  toward  democracy,  requiring  them  to 
restore,  modify,  or  even  create  constitutions  and 
rules  for  political  competition.  This  context  pro¬ 
vided  an  opportunity  to  discuss  different  govern¬ 
ment  designs,  to  compare  the  advantages  and 
disadvantages  of  presidentialism  and  parliamen¬ 
tarism,  and  to  revise  and  eventually  modify  how 
political  institutions  were  organized.  The  political 
scientist  Juan  Linz  occupied  a  prominent  position 
in  this  debate  as  he  took  the  lead  in  a  fierce  schol¬ 
arly  criticism  of  presidentialism,  basing  his  criticism 
on  the  numerous  presidential  democracies  in  Latin 
America  that  underwent  institutional  crises  and 
eventually  broke  down  during  the  1960s  and 
1970s.  In  short,  critics  argued  that  presidentialism 
was  less  efficient  than  parliamentarism  in  guaran¬ 
teeing  political  stability  and  democratic  governabil- 
ity.  They  identified  four  perils  or  problems  inherent 
in  the  logic  of  presidential  design. 

First,  the  dual  legitimacy  created  by  presiden¬ 
tialism  is  prone  to  deadlock  and  stalemate.  Under 
presidentialism,  both  the  agents  of  the  popular 
vote  (the  president  and  the  legislature)  have  a 
legitimate  claim  to  power  and  popular  support 
without  an  official  procedure  to  solve  conflicts 
between  the  two.  Situations  of  divided  government 
constitute  a  serious  difficulty  as  opposition  parties 
in  congress  have  few  incentives  to  cooperate  if 
credit  for  success  is  not  attributed  to  them,  while 
at  the  same  time  they  will  likely  bear  the  cost  of 
failure.  Presidents  may  even  encounter  problems  in 


marshaling  support  from  their  own  parties,  as  leg¬ 
islators  of  the  president’s  party  can  resist  or  reject 
bills  from  the  executive,  risking  neither  their  posi¬ 
tions  nor  the  dissolution  of  congress.  This  problem 
does  not  exist  under  parliamentarism,  where  gov¬ 
ernments  can  maintain  themselves  in  office  only  if 
they  enjoy  legislative  support. 

A  second  problem  with  presidentialism  is,  accord¬ 
ing  to  its  critics,  the  rigidity  implied  in  fixed  terms. 
Fixed  terms  make  institutional  responses  to  critical 
situations  much  more  difficult.  The  possibilities  for 
replacing  an  unpopular  or  moribund  president  (e.g., 
resignation,  impeachment)  are  complicated,  and  the 
mere  intention  of  applying  them  might  result  in 
further  complications  and  a  deepening  of  the  crisis. 
On  the  contrary,  under  parliamentary  arrange¬ 
ments,  the  replacement  of  government  and  the  call 
for  anticipated  elections  are  normal  institutional¬ 
ized  political  solutions  that  are  always  available  and 
easy  to  use  in  times  of  crisis. 

The  third  shortcoming  of  presidentialism  is  the 
majoritarian  and  zero-sum  character  of  presiden¬ 
tial  elections:  Elections  with  only  one  winner  gen¬ 
erate  strong  political  tensions.  What  makes  the 
zero-sum  nature  of  presidentialism  even  more  seri¬ 
ous  is  that  under  certain  electoral  rules,  a  candi¬ 
date  with  little  popular  support  or  one  who  is 
strongly  deviant  from  the  median  voter  can  win, 
potentially  generating  crises  of  legitimacy  and  situ¬ 
ations  of  conflict,  polarization,  and/or  crisis.  Critics 
of  presidentialism  invoke  as  the  quintessential 
example  the  Chilean  elections  of  1970:  The  leftist 
candidate  Salvador  Allende  won  the  presidency 
with  only  36.2%  of  votes  against  the  center-right 
candidate  Jorge  Alessandri  with  34.9%  and  the 
Christian  Democrat  candidate  Radomiro  Tomic 
with  27.8%.  Despite  this  close  margin  of  victory, 
and  facing  a  hostile  congress,  Allende  began  to 
carry  out  radical  policies.  In  a  context  of  increas¬ 
ing  political  polarization,  the  government  of 
President  Allende  was  overthrown  by  the  military 
in  a  coup  d’etat  on  September  11,  1973. 

The  last  shortcoming  that  critics  note  is  the  dual 
character  of  the  presidential  office,  generated  by 
the  system’s  fusion  of  the  roles  of  head  of  state  and 
head  of  government.  The  symbolic  and/or  ceremo¬ 
nial  duties  of  the  chief  of  state  often  collide  with 
the  tasks  of  a  head  of  government.  To  its  critics, 
the  symbolic  functions  of  a  chief  of  state  are  less 
legitimate  when  carried  out  by  an  individual  who 
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is  the  leader  of  a  party  and  therefore  involved  in 
partisan  power  struggles. 

The  contributions  of  Linz  and  other  critics  of 
presidentialism  sparked  a  revival  in  interest  in  gov¬ 
ernmental  institutions  and  their  effect  on  stability 
and  democratic  performance  as  well  as  several 
responses  that  challenged  Linz’s  argument.  For 
example,  scholars  like  Scott  Mainwaring  and 
Matthew  Shugart,  while  admitting  some  difficul¬ 
ties  associated  with  presidential  systems,  presented 
counterarguments  noting  some  of  the  advantages 
of  presidentialism  over  parliamentarism. 

First,  according  to  these  scholars,  presidential¬ 
ism  offers  more  electoral  options  to  voters,  who 
can  cast  votes  for  both  the  legislature  and  the 
executive  and  can  decide  to  split  their  vote,  a  pos¬ 
sibility  parliamentarism  precludes. 

Second,  presidentialism  offers  citizens  more 
opportunities  to  exercise  accountability  and  express 
their  preferences  in  public  policy  neatly.  Under 
presidentialism,  citizens  can  choose  between  at 
least  two  clearly  different  alternatives  (the  incum¬ 
bent  and  the  opposition)  for  president  and  a  num¬ 
ber  of  alternatives  from  among  which  they  can 
select  the  candidate  closest  to  their  preferences  for 
congress.  In  contrast,  in  Westminster-type  parlia¬ 
mentarisms,  government  and  opposition  are  clearly 
defined  and  it  is  possible  for  voters  to  determine 
governmental  responsibility  for  policies,  but  there 
is  no  room  for  minority  representation.  Under  mul¬ 
tiparty  parliamentarism,  the  exact  opposite  takes 
place:  Citizens  do  not  know  in  advance  how  their 
vote  will  reflect  on  the  final  composition  of  govern¬ 
ment.  In  these  cases,  changes  in  the  partisan  com¬ 
position  of  the  ruling  coalition  can  take  place  even 
between  elections,  and  parties  originally  in  the 
opposition  might  be  invited  to  join  the  government. 

The  third  argument  in  favor  of  presidentialism 
is  that  it  better  facilitates  legislative  control  over 
lawmaking  and  oversight  of  the  executive.  As  the 
survival  of  government  is  not  at  stake,  members  of 
congress  enjoy  wider  margins  of  maneuver  to 
adequately  analyze  bills  and  look  for  broader  sup¬ 
port  for  an  initiative,  ultimately  having  a  greater 
effect  on  legislation. 

Fourth,  presidential  fixed  terms  generate  higher 
democratic  stability  in  countries  with  governabil- 
ity  problems  due  to  weak  parties  or  highly  frag¬ 
mented  or  polarized  party  systems,  where  it  would 
be  difficult  to  form  coalitions  in  a  parliamentary 
context. 


Finally,  some  scholars  argue  for  caveats  to  the 
notion  of  the  zero-sum  nature  of  presidentialism. 
The  existence  of  two  agents  of  the  electorate 
results  in  the  possibility  of  more  than  two  win¬ 
ners.  A  party  that  does  not  win  the  presidency 
but  obtains  a  considerable  number  of  seats  in  the 
assembly  can  limit  the  executive  and  block  its 
initiatives,  especially  if  the  president  possesses 
only  reactive  legislative  powers  (or  limited  proac¬ 
tive  and  agenda-setting  powers).  In  contrast, 
Westminster-style  parliamentarisms  are  good 
examples  of  one  party  taking  a  dominant  posi¬ 
tion  in  both  branches  of  government  and  exercis¬ 
ing  power  with  few  limitations. 

Presidential  Democracies  Today 

For  many  years,  the  combination  of  presidentialism 
and  democracy  only  succeeded  in  the  United  States. 
In  no  other  place  did  presidentialism  coexist  with  a 
stable  democratic  regime.  For  example,  at  the  begin¬ 
ning  of  the  1980s,  only  three  Latin  American  coun¬ 
tries  were  ruled  by  presidents  elected  in  competitive 
elections:  Colombia,  Costa  Rica,  and  Venezuela. 
The  remaining  countries  were  ruled  by  different 
varieties  of  nondemocratic  regimes,  including  hege¬ 
monic  party  systems  (Mexico)  or  military  dictator¬ 
ships  (Argentina,  Brazil,  Chile,  and  Uruguay). 

In  2010,  however,  presidential  democracies  pre¬ 
vailed  across  the  Americas.  Many  of  them  have 
suffered  periods  of  political  crisis  and  instability, 
but  all  of  them  have  shown  unanticipated  resilience 
in  recent  years.  There  have  been  major  setbacks, 
such  as  the  autogolpes  (self-inflicted  coups  by  rul¬ 
ing  presidents  to  close  the  congress)  by  the  Peruvian 
president  Alberto  Fujimori  in  1992  and  by  the 
Guatemalan  president  Jorge  Serrano  in  1993.  In 
other  cases,  presidential  democracies  survived 
thorny  circumstances  including  presidential  im¬ 
peachments,  as  was  the  case  in  Brazil  (1992)  and 
Venezuela  (1993),  or  presidential  resignations  amid 
usually  violent  social  protest  and  uprisings,  as  in 
the  case  of  the  Paraguayan  president  Raul  Cubas 
Grau  in  March  1999,  the  Ecuadorian  Jamil  Mahuad 
in  February  2000,  the  Argentinean  Fernando  de  la 
Rua  in  December  2001,  and  the  Bolivian  Gonzalo 
Sanchez  de  Lozada  in  October  2003. 

In  spite  of  scholarly  suggestions  and  initiatives 
by  some  politicians  to  modify  presidential  arrange¬ 
ments,  all  the  constitutional  conventions  convened 
throughout  Latin  America  since  the  late  1980s 
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have  maintained  or  strengthened  presidentialism’s 
basic  logic  and  tenets.  As  a  consequence,  today  the 
combination  of  presidentialism  and  democracy  is 
not  an  oddity  but  a  common  feature  of  the  politi¬ 
cal  systems  of  a  large  number  of  countries. 
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Principal- Agent  Theory 


Principal-agent  theory  (PAT)  was  developed  in 
the  field  of  economics  in  the  1970s  to  understand 
the  prevailing  problems  that  appear  every  time 
Person  A  (the  “principal”)  asks  Person  B  (the 


“agent”)  to  do  something  on  his  or  her  behalf  for 
a  given  price.  The  basic  assumption  of  PAT  is  the 
existence  of  a  double  asymmetry  between  the 
principal  and  the  agent,  since  they  have  different 
preferences  (e.g.,  the  customer/principal  wants 
healthy  teeth,  while  the  dentist/agent  wants  to 
receive  good  pay)  as  well  as  different  levels  of 
information  (e.g.,  the  customer/principal  can 
observe  the  outcome  but  not  the  costs  of  the 
action  that  the  dentist/agent  takes).  The  solution 
to  this  double  asymmetry  is  an  outcome-based 
contract  in  which  it  is  stated  that  the  agent 
receives  a  bonus  in  case  a  certain  outcome  is 
reached.  PAT  explores  how  these  contracts  should 
be  designed  according  to  the  variations  in  the 
informational  asymmetries  between  principals 
and  agents  in  different  settings. 

Similar  to  many  other  economic  tools,  PAT  has 
been  incorporated  into  political  science  with  the 
general  aim  of  creating  more  parsimonious  mod¬ 
els.  Ironically,  PAT  has  only  marginally  been  used 
for  understanding  one  of  the  most  important  prin¬ 
cipal-agent  relationship  in  politics — the  one 
between  citizens,  or  constituencies  (as  principals), 
and  elected  politicians  (as  agents).  The  reason  is 
that  the  nature  of  this  relationship  differs  too 
much  from  the  basic  premises  of  PAT,  in  which  the 
principal  holds  a  dominant  position  and  is  endowed 
with  all  the  bargaining  power.  As  a  consequence, 
the  principal  makes  a  take-it-or-leave-it  offer  to 
the  agent,  which  becomes  the  main  object  of  study 
for  the  standard  PAT.  On  the  contrary,  in  the  cen¬ 
tral  political  delegation  chain,  the  citizen/principal 
has  a  subordinate  role  vis-a-vis  a  politician/agent, 
who  can  single-handedly  write  or,  at  least,  sub¬ 
stantially  modify  the  terms  of  the  “contract” 
between  them.  In  simple  words,  governments  are 
the  only  agents  in  the  world  that  tell  their  princi¬ 
pals  what  to  do. 

PAT  has  nevertheless  been  applied  extensively 
for  understanding  the  second  delegation  chain  in  a 
polity — that  of  politicians  as  principals  and  bureau¬ 
crats  as  agents.  The  main  reason  is  that  bureau¬ 
cracy  is  one  of  the  fields  in  political  science  that 
has  resisted  more  theoretical  progress,  and  PAT 
offers  a  useful  tool  to  develop  highly  analytical 
models  with  clear  testable  propositions.  In  addi¬ 
tion,  PAT  seems  particularly  well  suited  to  address 
one  of  the  most  long-lasting  puzzles  in  bureau¬ 
cratic  politics:  the  so-called  Weberian  asymmetry. 
As  Max  Weber  indicated,  a  profound  problem  of 
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democratic  polities  is  that  “political  masters”  find 
themselves  in  the  position  of  dilettantes  who 
stand  opposite  of  expert  bureaucrats.  This  asym¬ 
metry  has  initiated  recurrent  scientific  and  ideo¬ 
logical  debates — most  notable  among  them  being 
the  discussion  on  “state  failures.”  For  the  public 
choice  literature  in  general,  and  for  Willian  Nis- 
kanen  in  particular,  bureaucracies  are  too  big  and 
too  inefficient. 

Since  it  was  imported  from  economics  in  the 
early  1980s,  PAT  has  offered  creative  answers  to 
these  debates  surrounding  public  bureaucracies 
and,  while  answering  them,  has  also  undergone  an 
important  evolution.  This  entry  surveys  the  main 
aspects  of  this  transformation  of  a  fundamentally 
economic  PAT  into  a  more  political  theory. 

From  Ex  Post  to  Ex  Ante  Theories 

The  first  setting  in  which  PAT  flourished  was  the 
American  literature  on  congressional  control  of 
bureaucracy.  Prior  to  PAT,  the  prevailing  view 
was  that  of  an  unresponsive  bureaucracy — based 
on  the  fact  that  Congress  devoted  only  a  small 
fraction  of  its  resources  to  the  direct  oversight  of 
bureaucratic  actions.  Against  this  perspective, 
PAT  authors  argued  that  the  lack  of  monitoring 
activity  was  actually  a  signal  that  control  was 
working  properly.  It  is  the  possibility  of  ex  post 
sanctions  (e.g.,  hearings,  investigations,  limited 
budgetary  appropriations)  that  acts  as  an  effective 
incentive  for  the  bureau  to  serve  congressional 
interests.  Similar  to  how  a  car  lot  owner’s  prop¬ 
erly  designed  commission  system  can  ensure  that 
a  car  sales  agent  performs  well  without  any  need 
for  direct  monitoring,  Congress  may  possess  suf¬ 
ficient  ex  post  rewards  and  sanctions  to  control 
public  bureaucracies. 

This  “congressional  dominance”  approach 
contributed  to  the  bureaucratic  literature  by  pro¬ 
viding  numerous  empirical  tests  derived  from 
rigorous  theoretical  predictions — mostly  showing 
the  congruence  between  federal  regulatory  agen¬ 
cies  and  changes  in  congressional  ideological 
preferences.  The  ultimate  empirical  proof  that 
bureaucratic  agents  follow  legislators’  preferences 
would  be  the  latter’s  high  incumbent  reelection 
rate  (more  than  90%  in  the  U.S.  House  of 
Representatives,  according  to  some  calculations/ 
estimations). 


Nevertheless,  many  authors  have  cast  doubts 
about  the  usefulness  of  ex  post  PAT  sanctions  in 
the  political  sphere.  Unlike  what  happens  in  the 
private  sector,  where  the  principal  may  openly 
expose  an  agent’s  misbehavior,  a  politician  who 
denounces  bureaucratic  noncompliance  runs  the 
risk  of  being  accused  by  the  electorate  of  lack  of 
previous  monitoring  zeal.  Since  politicians  cannot 
credibly  rely  on  ex  post  controls,  they  may  develop 
ex  ante  mechanisms  of  bureaucratic  oversight, 
such  as  imposing  detailed  administrative  proce¬ 
dures  that  “stack  the  deck”  in  favor  of  congres¬ 
sional  constituencies.  For  example,  members  of 
Congress  can  establish  transparency  requirements 
or  order  the  bureau  to  consult  some  interest 
groups  before  entering  into  any  relevant  decision. 
That  would  also  be  the  PAT  explanation  of  the 
1946  Administrative  Procedure  Act  that  was  bur¬ 
dened  with  multiple  requirements  that  limited  ex 
ante  bureaucratic  discretion.  In  general,  PAT  argu¬ 
ments,  based  on  politicians’  electoral  interests, 
challenged  the  prevailing  view  in  the  law  and  eco¬ 
nomics  literature  that  had  traditionally  explained 
administrative  procedures  as  being  the  result  of 
normative  concerns  for  fairness  and  equity.  This 
shift  of  focus  from  ex  post  to  ex  ante  mechanisms 
represents  the  first  step  away  from  economic 
orthodoxy  in  the  political  evolution  of  PAT  mod¬ 
els  of  bureaucracy. 

From  Dyadic  to  Complex  Theories 

PAT,  as  directly  imported  from  economics,  ini¬ 
tially  focused  on  the  dyadic  relationship  between 
Congress  and  bureaucracy.  Nevertheless,  from  the 
mid-1980s  onward,  more  realistic  models  of  polit¬ 
ical  control  of  bureaucracy  were  developed  in 
which  the  dyadic  relationship  became  a  more  com¬ 
plex  one,  with  Congress  sharing,  and  frequently 
competing  for,  bureaucratic  oversight  with  other 
principals,  such  as  the  presidency,  federal  and  state 
courts,  and  interest  groups. 

The  main  picture  that  is  emerging  from  these 
complex  models — some  of  them  involving  up  to 
14  principals — is  more  pessimistic  than  the  one 
from  the  early  dyadic  PAT  models  of  congressional 
dominance.  In  general,  political  control  of  bureaus 
is  poor  because  they  can  play  some  political  prin¬ 
cipals  (e.g.,  a  Republican  Congress)  off  against 
others  (e.g.,  a  Democratic  president).  In  addition, 
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the  political  control  hypothesis  seems  to  fail  even 
in  the  most  favorable  conditions,  which  has  been 
demonstrated  by  the  case  of  the  probusiness 
Reagan  administration  and  the  Environmental 
Protection  Agency  (EPA).  Despite  that  all  possible 
political  control  tools  were  mobilized,  the  Reagan 
administration  was  unable  to  significantly  reduce 
EPA’s  activities. 

From  Control  to  Organizational  Theories 

Most  public  administration  scholars  agree  that 
while  PAT  has  successfully  explored  the  issue  of 
political  control  of  bureaucracies,  it  has  been  at  the 
expense  of  treating  bureaucracies  as  black  boxes 
that  mysteriously  transform  inputs  into  outputs. 
The  (extremely  varied)  organizational  characteris¬ 
tics  of  bureaucracies  have  been  mostly  left  out.  One 
reason  is  that  PAT,  which  emerged  in  the  congres¬ 
sional  control  debate,  initially  became  more  of  a 
theory  of  legislatures  than  a  proper  theory  of 
bureaucracies.  Nevertheless,  researchers  started  to 
acknowledge  that  the  study  of  the  political  control 
of  bureaucracies  presents  serious  problems  of  scien¬ 
tific  validity,  chief  among  them  being  the  dilemma 
of  “observational  equivalence.”  The  same  empiri¬ 
cal  observation,  the  lack  of  political  monitoring 
and  sanctions  for  bureaus,  may  be  an  indication  of 
bureaucratic  independence  and,  at  the  same  time, 
of  its  exact  opposite — that  is,  a  bureaucracy  so 
compliant  that  there  is  no  need  for  supervision. 

Accordingly,  in  what  has  been  considered  to  be 
the  most  important  single  development  in  the 
modern  theory  of  bureaucracy,  PAT  scholars  have 
increasingly  shifted  their  attention  from  how  poli¬ 
ticians  control  bureaucrats  to  how  politicians 
design  bureaucratic  organizations.  One  of  the 
most  explored  organizational  features  is  the  vari¬ 
ance  in  the  autonomy  of  public  employees:  why 
some  public  bureaucracies  are  like  their  private 
counterparts — that  is,  the  principal  (government) 
is  free  to  choose  its  agents  (public  employees) — 
and,  on  the  contrary,  other  (most,  but  far  from  all) 
public  bureaucracies  have  autonomous  civil  ser¬ 
vice  systems  that  limit  the  capacity  of  government 
to  choose  the  public  employees  that  are  working 
for  it.  The  study  of  this  puzzle  has  forced  PAT 
scholars  to  seriously  question  the  source  and 
nature  of  the  problems  that  exist  in  a  principal- 
agent  relationship. 


From  Agent-as-the-Problem  to 
Principal-as-the-Problem  Theories 

The  canonical-economic  PAT  had  focused  on  the 
problems  created  by  the  agent,  who  may  take 
advantage  of  his  or  her  “hidden  information”  and 
“hidden  actions.”  The  credible  commitment  prob¬ 
lem  of  a  car  lot  owner  (principal)  not  paying  the 
promised  commission  to  a  salesman  (agent)  is 
quite  unrealistic  in  the  private  sector,  given  the 
existence  of  an  external  third  party  (public  institu¬ 
tions)  with  the  ability  to  enforce  contracts. 
Nevertheless,  in  politics,  the  principal  is  at  the 
same  time  the  third  party — or,  at  the  least,  may  be 
able  to  affect  the  third  party’s  behavior  through 
legislative  measures.  As  numerous  scholars  have 
shown,  the  problems  of  credible  commitment  cre¬ 
ated  by  all-powerful  principals  may  overshadow 
the  traditionally  explored  informational  advan¬ 
tages  of  agents.  One  of  the  most  significant  exam¬ 
ples  is  monetary  policy,  where  governments,  no 
matter  how  benevolent  they  may  appear,  face  an 
unavoidable  problem  of  credibility  in  the  eyes  of 
investors.  Just  before  elections,  governments  will 
always  find  an  advantage  in  granting  opportunistic 
short-term  benefits  to  their  constituents. 

A  more  political  strand  of  PAT  has  explored 
how  different  mechanisms,  such  as  the  delegation 
of  monetary  policy  to  an  agent  that  is  known  to 
have  different  monetary  preferences,  may  mitigate 
this  inherent  problem  of  credibility.  One  para¬ 
doxical  result  of  this  literature  is  that  multiple 
principals  overseeing  a  bureaucracy,  considered  as 
a  recipe  for  disaster  by  the  standard  economic 
PAT,  actually  have  beneficial  effects  for  monetary 
policy  because  a  safe  “core”  of  economic  policies 
is  protected  from  politicians’  opportunism.  The 
loss  of  political  control  over  the  bureaucracy — at 
least  over  bureaucracies  dealing  with  potentially 
time-inconsistent  policies — can  thus  be  a  socially 
desirable  outcome. 

All  in  all,  two  main  patterns  emerge  from  this 
review  of  the  influence  of  PAT  in  political  science. 
First,  the  parsimonious  models  of  PAT  have  revo¬ 
lutionized  the  study  of  politics  and  particularly  of 
public  bureaucracies,  illuminating  numerous 
empirical  puzzles.  At  the  same  time,  the  study  of 
politics  has  also  changed  the  core  assumptions 
and  predictions  of  canonical  PAT.  In  sum,  thanks 
to  the  importation  of  the  economics-born  PAT, 
political  science  has  evolved  into  a  more  economic 
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discipline,  but  economics  has  also  become  more 

political.  As  a  result,  the  accumulated  knowledge 

in  both  disciplines  has  benefited  extensively. 

Victor  Lapuente 
University  of  Goteborg 
Goteborg,  Sweden 
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Prisoners’  Dilemma 


The  prisoners’  dilemma  (PD)  is  usually  presented 
with  the  following  narrative.  Two  men  are  arrested 
after  committing  a  crime.  Only  a  confession  by 
one  or  both  of  them  can  lead  to  a  conviction  for 
the  crime.  If  both  of  them  remain  silent,  each  will 
be  charged  with  a  lesser  offense  and  serve  a  light 
sentence.  If  one  confesses  while  the  other  remains 
silent,  the  one  who  confesses  will  be  set  free  in 
exchange  for  his  testimony  against  the  other,  and 
the  one  who  remains  silent  will  be  convicted  of  the 
crime  and  receive  a  full  sentence.  If  both  confess, 
both  of  them  will  be  convicted  of  the  crime  but 
will  receive  a  reduced  sentence.  The  dilemma  here 
is  that  regardless  of  what  the  other  chooses  to  do, 
it  will  be  better  for  each  of  them  to  confess. 


However,  if  they  both  confess,  they  will  be  worse 
off  than  they  would  have  been  if  both  of  them 
had  remained  silent.  In  this  entry,  various  game- 
theoretical  strategies  and  possible  applications  in 
political  science  are  discussed. 

The  basic  structure  of  the  game  is  presented  in 
Table  1,  a  2  X  2  matrix  that  specifies  the  payoff 
each  player  receives,  where  T  >  R  >  P  >  S. 

The  game,  thus,  has  a  unique  equilibrium  in 
which  both  players  defect  and  receive  payoff  P. 
They  would  have  received  a  higher  payoff,  R,  had 
each  of  them  chosen  “cooperate,”  but  this  is  not 
possible  since  each  of  them  has  a  dominant  strat¬ 
egy  to  “defect.” 

In  a  repeated  PD,  the  choices  made  today  not 
only  determine  today’s  outcome  but  can  also  influ¬ 
ence  the  choices  made  in  the  future.  If  the  players 
were  to  repeat  the  game  for  a  finite  and  known 
number  of  times,  backward  induction  would  imply 
that  both  players  defect  in  every  round.  In  the  last 
round,  neither  player  fears  the  consequences  of  cur¬ 
rent  choices,  so  they  will  both  choose  “defect.”  As 
a  result,  both  players  end  up  choosing  “defect”  in 
each  round,  expecting  the  other  player  to  “defect” 
in  the  following  round.  However,  if  the  players 
were  to  play  an  infinitely  repeated  PD  or  a  finite 
repeated  PD  for  an  unknown  number  of  times, 
backward  induction  no  longer  applies.  When  a 
game  is  repeated  infinitely  or  repeated  for  a  finite 
but  unknown  number  of  times,  players  use  weighted- 
average  payoffs  to  determine  which  strategy  to 
choose.  To  calculate  the  weighted-average  payoffs 
in  a  given  round,  future  payoffs  are  multiplied  by  a 
discount  factor  q,  either  to  reflect  that  future  pay¬ 
offs  are  valued  less  than  present  payoffs  or  as  a 
probability  that  there  will  be  another  round  in  the 
future.  The  value  of  cooperation  in  a  given  round 
depends  on  the  value  of  q,  increasing  as  q  approaches 
one.  As  long  as  q  is  not  zero,  both  players  are  better 
off  if  they  both  choose  “cooperate”  in  every  round. 

Table  I  Prisoners’  Dilemma 

Player  2 

Cooperate  Defect 
Cooperate  R,  R  S,  T 

Player  1 

Defect  T,  S  P,  P 

Source:  Axelrod,  R.  (1984).  The  evolution  of  cooperation. 
New  York:  Basic  Books. 
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There  is  an  equilibrium  in  which  both  players 
defect  in  every  round  because  each  player  chooses 
a  strategy  to  “always  defect”  and  such  an  equilib¬ 
rium  exists  for  any  value  of  q.  However,  it  is  no 
longer  the  only  possible  equilibrium  outcome  for 
both  players  to  defect  in  every  round  because  play¬ 
ers  can  play  reciprocal  strategies  that  condition 
their  choices  in  a  given  round  on  the  course  of  the 
game.  For  instance,  players  can  choose  to  play  a 
grim  trigger  strategy,  which  punishes  the  defector 
by  playing  defect  for  all  future  rounds  of  the  game. 
This  reciprocal  strategy  can  produce  an  equilib¬ 
rium  in  which  both  players  cooperate  in  every 
round  for  certain  values  of  q.  In  fact,  according  to 
the  Folk  theorem,  a  wide  range  of  payoffs  can  be 
supported  in  infinitely  repeated  games  or  repeated 
games  with  finite  but  unknown  number  of  rounds 
as  long  as  q  is  not  zero,  and  consequently,  the  set 
of  strategies  that  support  mutual  cooperation  in 
every  round  can  be  quite  large  depending  on  the 
value  of  q.  There  exists  as  yet  no  consensus  with 
respect  to  the  reasons  that  may  be  invoked  for 
favoring  one  reciprocal  strategy  over  the  other 
when  any  of  them  can  support  mutual  cooperation 
in  every  round. 

Games  of  incomplete  information  offer  another 
scenario  in  which  players  might  achieve  mutual 
cooperation  even  in  a  repeated  PD  with  a  finite 
and  known  number  of  rounds.  For  instance,  sup¬ 
pose  there  is  a  probability  z  that  Player  2’s  payoffs 
are  different  from  those  depicted  in  Table  1  and 
that  his  dominant  strategy  is  a  tit-for-tat  strategy. 
A  tit-for-tat  strategy  player  cooperates  in  the  first 
round  and  then  does  whatever  his  or  her  opponent 
did  in  the  previous  round.  When  this  uncertainty  is 
introduced,  a  “cooperate”  by  Player  2  in  a  given 
round  may  be  interpreted  as  a  sign  that  Player  2  is 
a  tit-for-tat  strategy  player.  If  Player  2  is  a  tit-for- 
tat  strategy  player,  Player  1  is  better  off  if  he  or  she 
chooses  “cooperate”  in  the  following  round  and 
achieves  mutual  cooperation  in  future  rounds. 
There  is  a  chance  that  the  “cooperate”  choice  by 
Player  2  is  a  mistake  and  that  Player  2  will  choose 
“defect”  in  the  following  round.  Then,  Player  1 
will  receive  S,  which  is  worse  than  the  P  that  he  or 
she  would  have  received  if  he  or  she  had  kept  on 
playing  “defect.”  However,  if  z  is  high  enough  and 
there  are  enough  rounds  left  in  the  game,  Player  1 
will  be  willing  to  risk  this  possibility.  At  the  same 
time,  there  is  an  incentive  for  Player  2  to  pretend 


to  be  a  tit-for-tat  strategy  player  even  if  he  or  she 
is  not  when  Player  1  faces  such  an  uncertainty 
because  then  both  players  can  achieve  mutual 
cooperation  and  receive  payoff  R  for  some  rounds. 
As  they  near  the  last  round,  they  will  be  less  will¬ 
ing  to  choose  “cooperate”  since  the  fact  that  they 
will  both  choose  “defect”  in  the  last  round  has  not 
changed.  They  will  not  be  able  to  achieve  mutual 
cooperation  in  every  round,  but  they  will  be  able 
to  achieve  mutual  cooperation  in  some  rounds. 

Since  Merrill  Flood  and  Melvin  Dresher  devised 
PD  as  part  of  a  RAND  Corporation  experiment  on 
game  theory,  experiments  have  been  a  major  part  of 
PD  literature.  The  most  frequently  examined  depen¬ 
dent  variable  in  PD  experiments  is  the  number  of 
mutual  cooperations  achieved  relative  to  the  num¬ 
ber  of  games  played.  However,  the  results  will  vary 
depending  on  the  specific  environment  of  the 
experiment.  It  has  been  established  that  that  share  is 
never  zero,  but  it  is  never  100%  either.  The  fact  that 
observations  of  real  people  have  shown  that  people 
usually  manage  to  achieve  mutual  cooperation  even 
in  repeated  games  with  a  finite  and  known  number 
of  rounds  motivated  the  research  on  finitely  repeated 
PD  with  uncertainty,  which  appears  to  be  more 
consistent  with  the  experimental  results. 

Computer  simulation  is  another  way  researchers 
experiment  with  PD  games.  They  are  particularly 
useful  in  exploring  the  evolutionary  aspect  of 
repeated  PD,  such  as  the  best  strategy  for  players’ 
survival,  the  change  in  strategies  among  popula¬ 
tions,  or  the  stability  of  strategy  composition. 
Robert  Axelrod’s  computer  tournaments  conducted 
in  1980  are  part  of  a  study  on  the  emergence  of 
cooperation.  A  number  of  strategies  were  submit¬ 
ted  to  a  tournament  of  repeated  PD,  in  which  each 
strategy  was  matched  against  all  the  other  pro¬ 
grams  submitted,  against  itself,  and  against  a 
program  that  chooses  cooperation  or  defection  ran¬ 
domly.  Axelrod  tallied  how  well  each  strategy  did, 
and  the  winner  was  a  tit-for-tat  strategy. 

The  PD  is  generally  applied  to  demonstrate  the 
difficulty  involved  in  achieving  cooperative  out¬ 
comes  when  there  exists  an  incentive  to  cheat  and 
take  advantage  of  the  other.  International  politics 
has  been  an  area  in  which  PD  has  been  applied 
actively  because  states  engage  in  bilateral  interac¬ 
tions  without  a  formal  enforcement  mechanism. 
For  example,  it  is  difficult  for  two  states  in  an  arms 
race  to  make  and  implement  an  agreement  to  stop 
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the  arms  race,  although  they  will  be  worse  off  than 
if  they  are  able  to  cooperate  and  stop  it.  Each  state 
fears  that  it  will  be  taken  advantage  of  should  it 
stop  while  the  opponent  continues.  Each  state  is 
also  tempted  to  take  advantage  of  the  opponent 
that  stops  by  continuing  to  accumulate  arms.  The 
same  logic  applies  to  any  international  relations  in 
which  states  can  achieve  favorable  outcomes 
through  international  agreements  but  will  be 
tempted  to  cheat  on  implementation.  It  can  also 
been  applied  to  civil  conflict  situations. 

Although  PD  is  generally  applied  in  the  context 
of  a  two-player  game,  games  with  the  payoff  struc¬ 
ture  of  PD  have  been  applied  in  the  context  of  an 
?z-player  game  (i.e.,  n  >  2).  For  example,  some 
have  interpreted  a  game  of  common-pool  resources 
with  n  players  who  have  PD  incentives  to  be  what 
Garrett  Elardin  referred  to  as  the  tragedy  of  the 
commons.  All  the  people  in  a  community  have  a 
collective  interest  in  sustaining  common  resources. 
Elowever,  as  people  are  self-interested,  each  will  be 
tempted  to  overexploit  the  resources.  The  tragedy 
arises  because,  at  any  point  in  time,  each  individ¬ 
ual  finds  it  in  his  or  her  interest  to  exploit  the  com¬ 
mon  resources  (i.e.,  “defect”)  no  matter  what  the 
others  do.  This  can  be  seen  to  explain  the  need  for 
and  the  emergence  of  a  formal  enforcement  device, 
government  or  a  government  agency  in  most  cases, 
to  provide  public  goods  or  to  enforce  contracts 
among  individuals.  Work  by  the  Nobel  Prize  win¬ 
ner  Elinor  Ostrom  and  her  collaborators  explore 
other,  private  ways  in  which  individuals  can  solve 
the  common-resources  problem. 

Jinny  Bang 
New  York  University 
New  York  City,  New  York,  United  States 
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Privatization 


According  to  Stuart  Butler,  one  of  its  chief  advo¬ 
cates,  privatization  can  be  thought  of  as  the  shift 
of  a  function  in  whole,  or  in  part,  from  the  public 
sector  to  the  private  sector.  This  can  refer  to  a 
variety  of  policies,  including  the  sale  of  state 
assets,  the  contracting  out  of  public  services  to 
private  providers,  the  deregulation  of  various 
market-based  activities,  or  even  the  affixing  of 
user  fees  for  places  that  might  earlier  have  been 
open  access  (parks,  museums,  schools,  or  high¬ 
ways).  This  entry  discusses  alternative  concep¬ 
tions  of  privatization,  assesses  the  arguments  for 
and  against  privatization,  and  examines  some 
attempts  by  governments  to  withdraw  from 
involvement  in  some  aspects  of  their  economies. 

Alternative  Conceptions  of  Privatization 

Privatization  is  not  a  new  phenomenon.  While 
most  economists  have  normally  assumed  the  pri¬ 
vate  sector  to  be  something  like  a  default  position, 
with  government  activities  being  invented,  added 
on,  and  often  at  the  expense  of  the  private  sector, 
in  fact  the  reverse  is  the  case.  Oliver  Letwin,  a 
British  conservative  theorist,  observed  that  the 
private  sector  had  to  be  invented.  This  occurred 
with  the  creation  of  the  great  European  trading 
companies,  such  as  the  British  and  Dutch  East 
India  companies,  founded  in  the  17th  century. 
Notions  of  property  before  the  Renaissance 
assumed  that  different  actors  had  different  rela¬ 
tions  to  the  same  property.  Manorial  estates  were, 
true  enough,  the  domain  of  the  local  lord,  but  he 
was  required  to  provide  security,  order,  and  jus¬ 
tice  to  the  inhabitants  of  the  domain,  while  they 
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had  certain  rights  to  the  usufructs  of  the  property 
and  owed  the  lord  fealty  and  obedience.  The  lord 
in  turn  owed  fealty  to  the  king.  This  changed 
slowly.  For  example,  only  with  the  rise  of  the 
enclosures  of  common  pastureland  did  Britain 
develop  modern  notions  of  property.  Private 
property  did,  of  course,  exist  in  earlier  eras,  and 
much  of  its  modern  conception  is  owed  to  the 
rediscovery  of  Roman  law,  which  had  a  well- 
articulated  set  of  rules  governing  such  property. 
However,  as  the  economic  abuses  associated  with 
the  collapse  of  communism  demonstrate,  private 
property  cannot  exist  without  a  political  system 
that  defines  its  existence,  its  use,  and  the  condi¬ 
tions  of  its  exchange.  That  is,  private  property  is 
defined  by  and  exists  only  because  of  politics. 

Although  state  asset  sales  occurred  in  many 
European  countries  during  the  1950s,  when  con¬ 
servative  parties  took  over  from  left-of-center  par¬ 
ties,  these  were  relatively  limited  in  scope.  Privatizing 
government  assets  and  services  became  a  major 
policy  fashion  in  the  1980s  and  continued  through 
the  first  decade  of  the  21st  century  until  the  finan¬ 
cial  crisis  of  late  2008,  when  many  of  the  conserva¬ 
tive  policy  trends  of  the  late  20th  century  were 
called  into  question.  This  is  not  to  say  that  only 
conservatives  advocated  privatization.  Regimes  as 
different  as  Margaret  Thatcher’s  Britain,  Bill 
Clinton’s  United  States,  and  Franqois  Mitterrand’s 
France  all  actively  pursued  privatization  policies. 
They  often  had  different  reasons  for  pursuing  such 
policies,  and  these  are  explored  as  follows. 

At  the  end  of  the  20th  century,  the  privatization 
movement  was  rooted  in  the  collapse  of  Keynesian 
economics  in  the  late  1970s.  The  oil  crises  of  1973 
to  1974  and  then  1979  were  followed  by  periods 
of  economic  stagnation  and  high  inflation. 
Keynesian  orthodoxy  saw  employment  and  price 
levels  as  trade-offs,  described  by  what  was  called 
the  Phillips  curve.  Keynesians  thought  that  infla¬ 
tion  only  occurred  in  economies  experiencing  full 
employment  and  had  no  explanation  for  the  stag¬ 
flation  that  followed  the  oil  crises.  Their  inability 
to  explain  the  behavior  of  the  economy  created  a 
window  of  opportunity  for  neoclassical  economists 
to  step  in  and  stipulate  that  these  problems  were 
due  to  public  sector  distortions  that  had  accumu¬ 
lated  over  the  postwar  years.  The  solution  to 
economic  stagnation  was,  they  asserted,  less  gov¬ 
ernment.  These  were  the  advocates  of  privatization. 


Arguments  for  and  Against  Privatization 

Belief  in  the  superiority  of  markets  became  wide¬ 
spread  as  the  best  way  to  organize  not  only  the 
production  but  also  the  allocation  of  all  goods  and 
services,  with  “market  failure”  being  considered 
rare  and  the  only  justification  for  public  interven¬ 
tion.  Even  governments  were  thought  to  work  better 
if  they  were  organized  using  market  principles.  This 
latter  version  of  privatization  became  known  as  “the 
new  public  management.”  Here  government  was  to 
be  organized  as  a  private  enterprise,  while  citizens 
were  to  be  treated  as  “customers.” 

While  privatization  was  something  of  an  inter¬ 
national  trend  in  the  late  20th  century,  it  was  not 
adopted  everywhere  for  the  same  reasons.  Harvey 
Feigenbaum,  Jeffrey  Henig,  and  Chris  Hamnett 
(1998)  argued  that  privatization  was  essentially  a 
political  phenomenon  and  that  one  could  distin¬ 
guish  three  types  of  privatization  policies.  The 
types  of  privatization  were  differentiated  by  the 
motives  and  intentions  of  the  policymakers.  The 
least  political  kind  of  privatization  might  be  called 
“pragmatic.”  Pragmatic  privatizations  fill  immedi¬ 
ate  needs  without  political  motives,  such  as  selling 
a  state  asset  to  fill  an  immediate  budgetary  short¬ 
fall  or  contracting  out  to  a  private  provider 
because  the  government  simply  lacks  the  expertise. 
“Tactical”  privatizations  serve  short-term  political 
goals,  such  as  rewarding  supporters  by  awarding  a 
no-bid  contract  or  offering  discounted  shares  in  a 
public  enterprise  privatization  to  allies  of  the  party 
in  power.  “Systemic”  privatizations  are  intended 
to  permanently  transform  the  political  landscape: 
The  sale  of  public  enterprises  to  create  a  nation  of 
share  holders  was  intended  to  create  new  and 
enduring  constituencies  for  the  Conservative  Party 
in  Britain,  while  “shock  therapy,”  the  rapid  sale  of 
public  enterprises  in  Russia,  was  intended  to  per¬ 
manently  block  a  return  to  socialism  by  selling  off 
most  of  the  state’s  assets.  In  developing  countries, 
the  impetus  for  privatization  often  comes  from 
international  aid  and  lending  institutions  that 
require  the  policy  as  a  prerequisite  to  aid.  These 
institutions  usually  favor  privatization  because 
they  are  staffed  by  neoclassical  economists  or 
influenced  by  large,  conservative  contributors  such 
as  the  United  States.  In  other  cases,  international 
lenders  are  simply  seeking  to  pragmatically  avoid 
nepotism  and  other  inefficiencies  in  recipient 
countries. 
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Often,  of  course,  privatizations  are  supported 
by  different  groups  for  different  reasons.  Bureau¬ 
crats  might  favor  a  particular  privatization  for 
pragmatic  reasons;  the  ruling  party  might  see 
political  advantages,  while  outside  consultants 
based  in  think  tanks  might  be  motivated  by  ideol¬ 
ogy.  Thus,  the  typology  offered  above  might  be 
better  thought  of  as  Weberian  ideal  types  rather 
than  hard  and  fast  categories  in  the  real  world. 

Government  Attempts  to  Withdraw 
From  Economic  Involvement 

Whatever  the  motives  of  the  privatizers,  the  impact 
of  privatization  has  also  been  quite  varied.  The 
goals  may  have  been,  as  stated  by  Daniel  Golden, 
a  one-time  administrator  of  the  U.S.  National 
Aeronautics  and  Space  Administration,  “faster, 
better,  cheaper!”  However,  realistically,  privatiza¬ 
tion  policies  could  at  best  accomplish  two  out  of 
the  three.  While  there  were  some  successes  at  the 
municipal  level,  cost  savings  due  to  privatization 
were  rare.  Where  they  were  achieved,  this  was 
often  because  the  private  providers  had  hired 
lower-wage  workers.  Hence,  privatization  policies 
have  normally  been  opposed  by  labor  unions. 

In  a  number  of  cases,  privatization  policies 
drove  up  costs  and  reduced  service.  The  privatiza¬ 
tion  of  British  Rail  led  to  significant  reductions  in 
service,  though  part  of  this  was  because  the 
Conservative  government  had  anticipated  privati¬ 
zation  and  necessary  investments  in  infrastructure 
were  not  made  before  the  railroad  was  sold. 
Privatization  of  water  utilities  in  Britain  followed  a 
similar  pattern.  Perhaps  the  most  spectacular 
increases  in  cost  were  associated  with  privatization 
in  the  military.  Profiteering  often  accompanies 
military  contracts  and  is  alleged  to  have  reached 
spectacular  proportions  in  the  case  of  U.S.  con¬ 
tracts  during  the  Iraq  War.  It  should  be  noted  that 
privatization  of  the  military  may  be  pursued  not 
because  the  governments  have  any  illusions  about 
efficiency  but  because  the  use  of  private  contrac¬ 
tors  is  an  expedient  way  to  expand  unpopular 
military  activities. 

Experience  with  privatization  since  the  1980s 
suggests  that  there  is  less  abuse  where  privatization 
leads  to  a  genuinely  competitive  market,  where 
private  service  providers  are  carefully  supervised 
by  public  authorities,  and  where  well-articulated 


systems  of  accountability  are  in  place.  Even  when 
these  conditions  obtain,  privatization  does  not  nec¬ 
essarily  result  in  greater  efficiencies  or  cost  savings, 
although  this  is  a  matter  continually  under  debate. 
It  also  seems  clear  that  privatization  policies  fre¬ 
quently  have  serious  distributional  consequences. 
Thus,  cost  savings  due  to  new  efficiencies  are  fre¬ 
quently  negated  by  the  need  for  new  subsidies  to 
those  citizens  adversely  affected. 

In  recent  times,  privatizations  have  not  had  the 
popularity  that  was  evident  in  the  1980s  and 
1990s.  This  is  partially  because  there  is  now  sig¬ 
nificant  experience  with  privatization  and  the 
promised  savings  and  efficiencies  have  not  been 
realized.  Moreover,  the  shocks  occurring  in  finan¬ 
cial  markets  after  the  collapse  of  Lehman  Brothers 
in  September  2008  have  led  to  a  serious  reevalua¬ 
tion  of  the  philosophy  that  underpinned  the  priva¬ 
tization  movement. 

Harvey  B.  Feigettbaum 
The  George  Washington  University 
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Process  Tracing 


Process  tracing  is  a  key  method  of  qualitative 
analysis  of  case  studies.  As  discussed  in  this  entry, 
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it  is  defined  as  the  empirical  analysis  of  the 
hypothesized  observable  implications  of  alterna¬ 
tive  theoretical  processes  that  purport  to  explain 
the  outcome  of  a  single  case.  As  such,  it  is  a 
method  of  within-case  analysis  that  can  be  used  to 
develop  and  test  a  historical  explanation  of  an 
individual  case,  although  it  can  also  be  used 
together  with  cross-case  comparative  methods  to 
assess  whether  similar  or  different  processes  are  at 
work  in  different  cases.  This  entry  reviews  the 
epistemological  and  methodological  context  of 
process  tracing,  discusses  standards  of  doing  it 
well,  and  identifies  its  limitations  as  a  means  of 
causal  inference. 

The  Epistemology  and  Methodology 
of  Process  Tracing 

As  outlined  by  Alexander  George  and  others,  pro¬ 
cess  tracing  can  be  used  to  develop  and  test  histori¬ 
cal  explanations  of  a  single  case.  George  and  other 
proponents  of  process  tracing  often  use  the  meta¬ 
phor  of  detective  work  to  illustrate  the  technique. 
Just  as  a  detective  assesses  alternative  suspects’ 
potential  means,  motives,  and  opportunities  to  have 
committed  the  crime  under  investigation,  a  social 
science  researcher  can  use  evidence  to  assess  the 
alternative  explanations  proposed  for  the  outcome 
of  a  historical  case.  This  involves  close  attention  to 
the  hypothesized  causal  mechanisms,  processes, 
and  sequences  that  alternative  theories  predict  must 
be  evident  in  the  history  of  a  case  if  these  theories 
are  to  explain  the  outcome  of  the  case. 

The  emphasis  that  process  tracing  places  on 
causal  processes  and  mechanisms  makes  its  episte¬ 
mological  assumptions  similar  to  those  of  the  sci¬ 
entific-realist  school  of  thought  in  the  philosophy 
of  science.  As  Fred  Chernoff  has  noted,  there  are 
many  variants  of  scientific  realism.  Most  self- 
identified  scientific  realists  agree,  however,  that 
there  are  entities  in  the  world  with  causal  powers 
and  that  these  entities  act  through  causal  mecha¬ 
nisms  to  generate  the  phenomena  that  we  observe. 
In  this  view,  although  causal  mechanisms  are  ulti¬ 
mately  not  directly  observable,  our  hypotheses 
about  how  these  mechanisms  operate  generate 
observable  implications  that  we  can  test  against 
actual  observations  to  make  (potentially  fallible) 
inferences  on  the  validity  and  scope  conditions  of 
our  hypotheses.  The  observable  implications  of 


hypothesized  causal  mechanisms  and  processes 
may  concern  predictions  on  population-wide  pat¬ 
terns,  which  can  be  assessed  through  statistical 
means.  According  to  Henry  Brady  and  David 
Collier,  this  analysis  involves  inferences  from  “data 
set  observations”  or  from  measurements  of  vari¬ 
ables  across  many  different  cases.  The  observable 
implications  of  hypothesized  causal  mechanisms 
may  also  involve  detailed  predictions  about  pro¬ 
cesses  and  sequences  of  events  within  individual 
cases.  It  is  these  within-case  observable  implica¬ 
tions  that  process  tracing  assesses  through  what 
Brady  and  Collier  term  causal  process  observa¬ 
tions,  or  evidence  on  sequences  and  events  within  a 
single  case. 

Process  tracing  proceeds  through  a  combination 
of  deduction  and  induction,  with  the  mix  varying 
depending  on  the  state  of  development  of  theories 
on  the  phenomenon  under  study  and  the  availabil¬ 
ity  of  relevant  evidence.  Deductively,  the  researcher 
uses  extant  theories  to  generate  observable  implica¬ 
tions  within  a  case,  which  can  then  be  tested 
against  the  evidence.  In  this  mode,  the  researcher 
asks  questions  such  as  “Who  should  have  known 
or  who  did  what,  where,  when,  and  how  if  this 
explanation  of  the  case  is  true?”  Inductively,  the 
researcher  develops  detailed  knowledge  of  the 
sequence  of  events  in  a  case  and  uses  this  knowl¬ 
edge  to  generate  additional  potential  explanations 
of  the  case.  These  inductive  explanations,  once 
translated  into  more  rigorous  and  deductive  theo¬ 
ries,  can  generate  additional  testable  implications 
within  the  case  or  within  other  cases,  thereby  help¬ 
ing  the  researcher  guard  against  confirmation  bias. 
Even  when  the  outcome  of  a  case  is  already  known, 
there  are  usually  many  details  about  processes  and 
sequences  within  the  case  that  are  unknown  to  the 
investigator  prior  to  undertaking  process  tracing. 
These  previously  unknown  details  can  provide  an 
independent  check  on  the  investigator’s  theoretical 
assumptions. 

Process  tracing  also  proceeds  through  both 
affirmation  of  theories  that  prove  to  be  consistent 
with  evidence  from  the  case  and  eliminative  induc¬ 
tion  of  theoretical  explanations  that  prove  to  be 
inconsistent  with  the  evidence.  In  this  sense,  pro¬ 
cess  tracing  is  similar  to  Bayesian  analysis,  in 
which  the  analyst  uses  evidence  to  update  prior 
expectations  on  the  relative  likelihood  that  alter¬ 
native  theories  or  explanations  are  true.  As  in 
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Bayesian  analysis,  a  single  piece  of  evidence  might 
be  consistent  with  one  explanation  of  a  case  while 
simultaneously  disproving  many  alternative  expla¬ 
nations.  The  more  an  observable  implication  is 
likely  to  be  true  if  one  explanation  is  true,  and  the 
more  unlikely  it  is  to  be  true  if  any  of  the  alterna¬ 
tive  explanations  is  true,  the  more  this  evidence,  if 
found,  would  increase  our  confidence  in  the  one 
theory  it  fits  and  decrease  our  confidence  in  the 
alternatives.  Thus,  whether  an  instance  of  process 
tracing  is  convincing  and  determinate  in  narrowing 
the  field  to  one  explanation  of  a  case  depends  not 
on  the  number  of  cases,  pieces  of  evidence,  or  alter¬ 
native  explanations  but  on  the  probative  value  of 
the  evidence  regarding  the  alternative  explanations. 

Thus,  even  though  it  involves  only  one  case  and 
potentially  includes  many  variables  and  explana¬ 
tions,  process  tracing  does  not  raise  a  “degrees  of 
freedom”  problem,  which  arises  in  statistical  anal¬ 
ysis  when  there  are  too  many  variables  and  too  few 
cases  to  assess  alternative  explanations  with  any 
confidence.  Process  tracing  does  face  the  more  gen¬ 
eral  problem  of  indeterminacy,  of  which  the 
degrees  of  freedom  problem  is  one  variant,  as  pro¬ 
cess  tracing  may  be  indeterminate  if  several  incom¬ 
patible  explanations  are  equally  consistent  with  the 
evidence  of  a  case.  This  is  an  inherent  problem 
because  there  are  always  potential  additional 
explanations  that  the  researcher  may  not  have  con¬ 
ceived  of  or  examined,  a  challenge  that  Bayesians 
refer  to  as  identifying  the  “catchall  factor,”  or  the 
probability  one  should  place  on  the  possible  truth 
value  of  all  alternatives  to  the  explanation  of  inter¬ 
est.  For  these  reasons,  many  researchers  who  use 
process  tracing  would  agree  with  Bayesians  that 
one  should  never  put  100%  confidence  in  a  theory 
or  explanation. 

Although  it  is  a  within-case  method  of  analysis, 
process  tracing  can  supplement  cross-case  com¬ 
parisons  and  compensate  for  some  of  the  limits  of 
such  comparisons.  For  example,  a  most-similar 
case  comparison  involves  the  comparison  of  two 
cases  that  are  similar  in  all  but  one  independent 
variable  and  that  differ  in  their  outcomes.  A  stan¬ 
dard  challenge  in  this  research  design  is  that  cases 
are  never  perfectly  matched  on  all  but  one  indepen¬ 
dent  variable.  Process  tracing  can  help  determine 
whether  the  difference  in  the  independent  variable 
of  interest  is  related  to  the  outcome,  and  it  can  also 
help  assess  whether  the  residual  differences  between 


the  two  cases  might  also,  or  instead,  contribute  to 
the  differences  in  their  outcomes. 

Two  contentious  issues  are  whether  process 
tracing  can  lead  to  generalizable  knowledge  and 
whether  it  is  suited  to  interpretivist  or  constructiv¬ 
ist  epistemological  assumptions.  Nathaniel  Beck 
has  critiqued  process  tracing  for  producing  only 
explanations  of  individual  cases  rather  than  gener¬ 
alizable  knowledge.  Defenders  of  process  tracing 
respond  that  the  explanation  of  cases  of  historical 
significance  is  an  important  goal  in  its  own  right 
and  that  inferences  based  on  statistical  inferences 
are  more  convincing  if  they  can  be  shown  to  be 
consistent  with  the  mechanisms  and  processes  of 
individual  cases.  Moreover,  they  argue  that  while 
the  explanation  of  an  individual  case  may  not  gen¬ 
eralize  to  other  cases,  it  is  also  possible  to  use  a 
combination  of  induction  and  deduction  to  uncover 
explanations  via  process  tracing  that  may  prove 
widely  generalizable.  Charles  Darwin’s  detailed 
examination  of  a  few  species,  for  example,  led  to 
hypothesized  causal  mechanisms  on  evolution  and 
natural  selection  that  should  apply  not  just  to  the 
species  he  studied  but  to  all  living  things.  It  is 
nearly  impossible  to  know  whether  an  explanation 
of  a  case  will  prove  generalizable,  however,  until 
one  has  first  developed  and  tested  that  explanation 
through  process  tracing. 

Coming  from  another  perspective,  Jeffrey 
Checkel  has  critiqued  process  tracing  as  being  too 
neo-positivist  and  insufficiently  attuned  to  the 
recursivity  of  social  agents  and  social  structures. 
Proponents  of  process  tracing  would  respond  that 
it  has  much  in  common  with  some  forms  of  narra¬ 
tive  analysis,  event  history  analysis,  and  discourse 
analysis  but  that  it  does  aspire  to  verifiable  and 
generalizable  explanations.  If  one  believes  that 
agents  and  structures  are  inherently  recursive  and 
mutually  constitutive  “all  the  way  down,”  no  mat¬ 
ter  how  finely  one  slices  time  or  space,  then  one  is 
likely  to  remain  skeptical  of  process  tracing  and  all 
other  forms  of  inference  that  aspire  to  causal 
explanations. 

Standards  of  Good  Process  Tracing 

There  are  many  excellent  examples  of  process 
tracing  in  the  social  sciences,  a  number  of  which 
are  discussed  in  the  Further  Readings.  Good  pro¬ 
cess  tracing  shares  several  characteristics.  First,  it 
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considers  a  wide  range  of  the  potential  explana¬ 
tions  of  a  case  proposed  by  theorists,  historians, 
journalists,  and  participants,  and  it  gives  a  bal¬ 
anced  assessment  of  the  evidence  for  each.  Second, 
it  provides  a  fairly  continuous  explanation  of  the 
major  events  in  the  case,  with  a  convincing  expla¬ 
nation  of  each  important  turning  point.  It  is  both 
impossible  and  impractical,  of  course,  to  give  a 
fully  continuous  account  of  everything  in  a  case, 
and  even  on  important  turning  points,  the  relevant 
evidence  may  be  inaccessible,  but  the  more  the 
critical  steps  and  potential  anomalies  can  be  con¬ 
vincingly  explained,  the  better  it  is.  Third,  good 
process  tracing  weighs  the  probative  value  of  evi¬ 
dence  relative  to  alternative  explanations  and 
takes  into  account  the  potential  biases  of  the 
sources  of  that  evidence.  Fourth,  and  similar  to 
Bayesian  analysis,  diversity  of  evidence — analogous 
to  a  detective’s  search  for  evidence  on  means, 
motives,  and  opportunities — helps  provide  more 
convincing  explanations  via  process  tracing. 
Finally,  the  use  of  evidence  that  has  “use  novelty,” 
or  evidence  that  was  not  known  or  observed  by  the 
investigator  prior  to  deductively  predicting  its 
expected  value  or  measurement,  provides  protec¬ 
tion  against  confirmation  bias. 

The  Limits  of  Process  Tracing 

Like  all  forms  of  causal  inference,  process  tracing 
is  potentially  fallible.  A  researcher  can  never  be 
fully  confident  of  having  thought  of  all  the  relevant 
alternative  explanations.  Process  tracing  also  usu¬ 
ally  requires  a  great  deal  of  information  and 
research  and  can  be  intensely  time-consuming. 
Even  with  the  most  diligent  research,  there  may  be 
gaps  and  biases  in  the  availability  of  information 
on  key  steps  in  a  hypothesized  process,  especially 
on  sensitive  political  issues  on  which  actors  have 
an  incentive  to  hide  their  actions  and  motives.  The 
results  may  be  indeterminate  if  incompatible  expla¬ 
nations  are  equally  consistent  with  the  evidence. 
Extant  theories  are  often  not  sufficiently  developed 
to  make  detailed  predictions  on  the  processes  we 
should  expect  to  find  in  individual  cases,  and  dif¬ 
ferent  theorists  might  predict  different  processes 
within  a  case  using  the  same  theories.  The  results 
of  process  tracing  may  not  be  generalizable  beyond 
the  individual  case  in  any  particular  study,  and 
whether  the  results  will  prove  generalizable  cannot 


be  known  with  confidence  at  the  outset.  Finally, 
there  is  no  simple  answer  to  how  much  process 
tracing  is  enough.  How  detailed  should  one  be  in 
explaining  a  phenomenon,  or  how  far  back  should 
the  researcher  go  in  time?  Only  a  pragmatic 
Bayesian  answer  is  possible:  One  stops  when  one 
is  confident  that  the  currently  accessible  additional 
evidence  would  not  markedly  change  one’s  confi¬ 
dence  in  the  alternative  explanations. 

Conclusion 

All  methods  of  causal  or  explanatory  inference, 
including  process  tracing,  are  potentially  fallible. 
Despite  its  limitations,  process  tracing  remains  an 
important  method  for  developing  and  testing 
explanations  of  individual  cases.  As  methodolo¬ 
gists  continue  to  develop  techniques  of  process 
tracing,  they  might  focus  on  three  areas.  First,  they 
might  work  out  more  fully  the  similarities  and  dif¬ 
ferences  between  process  tracing  and  Bayesian 
inference.  Second,  they  might  address  more  fully 
critiques  on  whether  and  how  the  results  of  pro¬ 
cess  tracing  can  be  generalized  and  how  process 
tracing  relates  to  interpretivist  and  constructivist 
methods.  Third,  they  might  work  out  more  specific 
guidelines  on  the  kinds  of  evidence  to  look  for  to 
assess  and  adjudicate  among  different  kinds  of 
explanations,  including  rational  choice,  social- 
constructivist,  power,  institutional,  and  psycho¬ 
logical  explanations. 
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If  for  more  than  2  centuries  the  concept  of  prop¬ 
erty  appeared  to  be  understood,  it  is  no  longer 
possible  to  act  as  if  this  were  the  case.  In  common 
language  as  much  as  in  the  language  of  jurists  and 
economists,  the  term  property  was  associated  with 
the  notion  of  the  absolute  and  exclusive  right  of  a 
person  over  a  material  thing.  This  definition  is  no 
longer  accepted  by  experts,  a  fact  that  led  Thomas 
Grey  to  formulate  the  hypothesis  of  the  disintegra¬ 
tion  of  property. 

Therefore,  the  notion  of  property  has  presum¬ 
ably  suffered  a  rapid  philosophical  decline.  Until 
recently,  it  was  considered  essential  to  the  rele¬ 
vance  of  public  and  political  institutions,  a  core 
component  of  the  debate  surrounding  the  legiti¬ 
macy  of  capitalism — so  central  that  the  origins  of 
the  main  ideologies  and  schools  of  thought  that 
still  largely  structure  our  political  world  could  be 
traced  back  to  it.  Today,  however,  property  rights 
are  not  seen  as  central  to  issues  such  as  the  legiti¬ 
macy  of  liberal  democracies  and  the  institutions  of 
mature  capitalism;  instead,  protection  of  individ¬ 
ual  rights  and  of  the  rule  of  law  is  seen  as  a  pri¬ 
mary  concern  in  establishing  such  legitimacy. 

It  seems  hardly  questionable  that  the  notion  of 
property  as  the  absolute  right  of  a  person  over  a 
material  thing  has  lost  both  its  technical  relevance 
and  its  philosophical  importance.  But  it  is  equally 
incontestable  that  the  contemporary  evolution  of 
large  legal  systems  tends  toward  proprietarization, 
a  term  that  describes  the  expansion  of  the  range  of 
objects  that  can  be  owned,  the  spread  of  ownership 
mechanisms,  and  the  strengthening  of  legal  protec¬ 
tion  offered  to  owners.  If  the  aforementioned 


notion  of  property  has  indeed  disappeared,  prop¬ 
erty  itself  remains  central,  although  in  new  forms, 
to  how  institutions  of  mature  capitalism  function. 

History  of  the  Contemporary 
Notion  of  Property 

The  history  of  the  contemporary  notion  of  property 
can  be  divided  into  three  stages:  (1)  rise,  (2)  tri¬ 
umph,  and  (3)  fall.  The  emergence  of  the  notion  of 
property  can  be  traced  back  to  the  Second  Scholastic, 
which  took  place  during  the  17th  century  following 
and  commenting  on  the  Renaissance.  Its  final  defi¬ 
nition  as  the  natural,  absolute,  and  exclusive  right 
of  a  person  over  a  material  thing  was  formulated  by 
modern  natural  law  theorists,  especially  by  Thomas 
Hobbes  and  Samuel  Pufendorf.  A  more  legal,  less 
philosophical  genealogy  could  recount  the  slow 
rejection  of  the  divided-ownership  theory  (which 
distinguishes  between  direct  and  eminent  owners) 
and  would  put  forward  the  ideas  of  French  jurists 
such  as  Francois  Hotman,  Jean  Donneau,  or 
Charles  Dumoulin.  In  any  case,  the  idea  of  property 
as  an  absolute  right  nourished  physiocratic  writings 
and,  at  the  end  of  the  18th  century,  became  a  tru¬ 
ism  of  legal  thinking,  made  famous  by  Robert- 
Joseph  Pothier  or  William  Blackstone’s  well-known 
phrase  in  Book  2  of  the  Commentaries  on  the  Laws 
of  England:  “the  right  of  property  .  .  .  that  sole  and 
despotic  dominion  which  one  man  claims  and  exer¬ 
cises  over  the  external  things  of  the  world,  in  total 
exclusion  of  the  right  of  any  other  individual  in  the 
universe.  ” 

An  individual’s  subjective  property  rights  are 
acknowledged  by  the  Fifth  Amendment  of  the  U.S. 
Bill  of  Rights  ( “No  person  shall  ...  be  deprived  of 
life,  liberty,  or  property,  without  due  process  of 
law;  nor  shall  private  property  be  taken  for  public 
use,  without  just  compensation”),  by  Article  17  of 
the  Declaration  of  Human  Rights  of  1789  (“Since 
the  right  to  property  is  inviolable  and  sacred,  no 
one  may  be  deprived  thereof,  unless  public  neces¬ 
sity,  legally  ascertained,  obviously  requires  it,  and 
just  and  prior  indemnity  has  been  paid”),  and  by 
Article  544  of  the  Napoleonic  Code  (“Property  is 
the  right  to  enjoy  and  dispose  of  things  in  the  most 
absolute  way,  provided  this  usage  is  not  prohibited 
by  laws  or  regulations”).  In  19th-century  Europe, 
Article  544  became  the  symbol  of  a  unitary  and 
centralized  conception  of  property.  After  1804,  the 
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consensus  about  property,  at  least  in  continental 
law,  was  broad  enough  to  raise  this  notion  to  the 
level  of  a  legal  dogma.  On  that  basis,  a  general 
theory  of  property  was  progressively  constructed, 
defining  property  as  a  subjective,  unitary,  com¬ 
plete,  and  perpetual  right  over  material  goods  that 
gives  the  owner,  directly  and  without  any  personal 
mediation,  the  power  to  exclude  anyone  else  and 
to  use  those  things  both  physically  and  legally. 
Moreover,  this  dogmatic  consensus  was  coupled 
with  a  very  large  political  consensus  about  the 
legitimacy  of  the  institution  of  property. 

Not  until  1840  and  Pierre  Joseph  Proudhon’s 
famous  pamphlet  What  Is  Property ?  did  this 
political  consensus  collapse,  quickly  giving  rise  to 
a  violent  dispute — which  turned  into  civil  war — 
between  the  supporters  and  despisers  of  property. 
This  dispute  concerned  the  origins  (natural  or  con¬ 
ventional),  limits,  distribution,  and  legitimacy  of 
property  but  was  based  on  the  same  dogmatic 
definition  of  this  institution.  In  his  pamphlet, 
Proudhon  himself  wrote  that  property  was  the 
“ absolute ,  exclusive,  autocratic  domain  of  a  man 
over  a  thing.”  The  political  dispute  did  not  weaken 
the  prestige  of  the  legal  dogma. 

It  was  only  at  the  end  of  the  19th  century  that 
the  canonical  definition  of  property  was  finally  put 
into  question  because  of  the  evolution  of  property 
law  itself.  Although  this  evolution  is  quite  com¬ 
plex,  some  of  its  aspects  should  be  highlighted: 

•  The  strengthening  of  statutory  and  judicial 
limitations  related  to  property  law — the  concept 
of  the  abuse  of  right,  the  development  of  urban 
regulations,  the  expansion  of  state  powers  of 
expropriation  and  regulatory  takings,  and  so  on. 
This  evolution  leads  some  authors  to  conceive  of 
property  as  a  social  function  more  than  a 
subjective  right. 

•  The  emergence  of  so-called  corporate  properties 
or  large  companies’  collective  properties, 
especially  after  World  War  I — recorded  by  Adolf 
Berle  and  Gardiner  Means  in  a  well-known  book 
The  Modern  Corporation  and  Private  Property, 
published  in  1933.  According  to  the  authors,  the 
switch  from  individual  property  belonging  to 
physical  persons  to  social  property  belonging  to 
legal  bodies  leads  to  a  dissociation  between  the 
owner  and  the  manager — the  property  and  the 
power  of  control  that  it  bestowed. 


•  The  dematerialization  of  property — forcing  legal 
theorists  to  conceive  of  property  as  an  intangible 
object.  Gradually,  the  rights  of  authors  and 
inventors  over  their  creations  and  inventions  have 
come  to  be  characterized  as  a  “property  right.” 

Evolutions  specific  to  mature  capitalism  have  in 
fact  jeopardized  the  property  dogma.  At  the  end  of 
the  20th  century,  it  became  obvious  that  this 
dogma  did  not  reflect  the  reality  of  property  rights 
“in  action.”  Or  is  it  that  we  no  longer  know  what 
property  is? 

Theories  of  Property 

In  law,  a  theoretical  definition  is  self-fulfilling  only 
if  it  slowly  brings  about  a  consensus.  Equally,  a 
theory  of  property  will  be  empirically  validated 
only  if  it  is  commonly  accepted  and  applied  by 
political  representatives,  judges,  lawyers,  and  aca¬ 
demics.  As  of  today,  we  do  not  have  such  a  theory 
at  our  disposal.  Reconstruction  projects  are  numer¬ 
ous,  often  ingenious  and  exciting  (the  work  of 
Charles  Reich,  Richard  Posner,  Margaret  Radin, 
Jeremy  Waldron,  and  Stephen  Munzer,  among 
many  others,  should  be  mentioned  here),  but  none 
of  these  projects  has  succeeded  in  imposing  uni¬ 
form  beliefs  and  practices  on  legal  actors,  and 
some  researchers  such  as  Duncan  Kennedy  doubt 
that  it  is  even  worth  trying. 

However,  the  term  property  has  not  completely 
lost  all  significance,  even  among  experts.  Its  meaning 
can  be  established  by  enunciating  the  specificities 
of  property  rights  according  to  their  (a)  contents, 
(b)  characteristics,  (c)  object,  or  (d)  holder. 

Contents 

Property  is  frequently  associated  with  the  trip¬ 
tych  usus,  fructus,  abusus  (use,  enjoy  the  fruits  of, 
abuse)  in  Continental  law  or  still  sometimes  with 
the  right  to  occupy,  exclude,  transfer,  or  transmit,  to 
refer  to  one  of  Jeremy  Bentham’s  expressions.  Even 
more  frequently,  a  Hohfeldian  nomenclature  has 
been  used  to  enable  the  contents  of  the  rights  to  be 
analyzed  according  to  the  following  four  fundamen¬ 
tal  legal  relationships  of  opposition  and  correlation: 

1.  claim-rights  (^no-rights)  and  duties, 

2.  privileges  (=£  duties)  and  no-rights, 
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3.  powers  (^disabilities)  and  liabilities,  and 

4.  immunities  (Pliabilities)  and  disabilities. 

Having  a  claim-right  means  being  able  to  demand 
something  from  another,  with  the  latter  having  a 
duty  to  meet  the  demand;  a  privilege  exists  where 
there  is  no  duty;  a  power  refers  to  the  capability  of 
creating  legal  relationships  (e.g.,  marriage);  and  a 
person  has  immunity  when  he  or  she  has  no  duty 
to  follow  a  command  from  another. 

Property  should  therefore  be  considered  as  a 
bundle  of  rights,  a  defined  set  of  rights  and  duties, 
privileges  and  no-rights,  powers  and  liabilities,  and 
immunities  and  disabilities.  Thus,  according  to 
Wesley  Hohfeld,  with  absolute  ownership  over 
land  ( property  in  fee  simple ),  the  owner  has  the 
right  to  exclude  anyone  else,  as  well  as  some  legal 
privileges  (e.g.,  the  right  to  use  the  land)  and  the 
legal  power  to  alienate  his  or  her  property  or  some 
of  its  uses  by  giving  legal  privileges  to  third  parties 
(e.g.,  by  giving  temporary  access  to  a  neighbor). 
Finally,  he  or  she  benefits  from  legal  immunities 
(e.g.,  the  property  cannot  be  expropriated  except 
through  a  specific  procedure). 

This  nomenclature  reveals  the  difficulty  of  giv¬ 
ing  a  general  definition  of  property,  since  it  is 
impossible  to  specify  rights,  privileges,  powers,  dis¬ 
abilities,  and  immunities  that  could  be  universally 
and  necessarily  stated  in  every  relationship  charac¬ 
terized  by  legal  actors  as  a  property  relationship. 
The  best  one  can  do  is  to  note  that  the  composition 
of  the  bundle  of  rights  varies  considerably  accord¬ 
ing  to  situations  and  the  objects  concerned. 
Certainly,  it  might  be  useful  to  try  to  identify 
“types  of  bundles”  in  similar  legal  situations — 
“forms”  of  property — but  to  date,  no  theoretical 
proposition  has  succeeded  in  establishing,  in  a 
largely  convincing  way,  one  of  these  forms  as  an 
ideal  type  (as  defined  by  Max  Weber)  of  the  insti¬ 
tution  of  property. 

Indeed,  the  notions  of  property  and  exclusivity 
are  frequently  associated.  But  there  are  good  rea¬ 
sons  to  doubt  that  this  association  is  both  universal 
and  necessary.  First  of  all,  exclusivity  is  sometimes 
only  relative  (some  collective  properties  exclude 
third  parties  but  not  co-owners).  Some  rights  of 
ownership  are  inclusive  (the  holder  of  these  rights 
is  given  a  no-right  to  exclude  any  third  party):  for 
example,  in  French  administrative  law,  public 
property  of  things  assigned  to  free  public  use  or 


even  creative  commons  licenses — share-alike  or 
free  software — that  allow  the  creator  of  the  initial 
software  to  give  third  parties  the  right  to  access 
and  transform  the  initial  software,  coupled  with  a 
no-right  to  forbid  access  to  and  transformation  of 
the  upgraded  version  (this  amounts  to  giving  the 
creator  of  the  upgraded  version  a  no-right  of 
exclusion  for  this  version,  even  though  he  or  she 
owns  it). 

Similarly,  property  and  free  use  are  often  associ¬ 
ated.  Generally,  property  rights  are  composed  of 
legal  privileges,  including  the  right  to  use  the  thing, 
to  benefit  from  it,  and  so  on.  However,  there  are 
numerous  forms  of  ownership  that  do  not  include 
free  use  and  even  sometimes  exclude  it.  In  French 
law,  this  is  the  case  when  a  private  body  owns 
something  that  belongs  to  the  public  domain  and 
is  either  open  to  free  public  use  or  assigned  to  a 
public  service  holding  a  no-right  to  use  the  thing  in 
a  way  contrary  to  its  intended  purpose  (e.g.,  Paris 
airports).  It  is  then  possible  to  conceive  of  a  form 
of  private  ownership  that  would  be  applied  to 
things  that  are  strictly  inalienable. 

Characteristics 

In  Richard  Posner’s  version  of  the  economic 
analysis  of  legal  rules,  the  focus  shifts  from  their 
contents  to  the  conditions  that  they  must  fulfill  for 
a  system  of  private  property  to  function  efficiently — 
these  conditions  being  the  exclusivity,  universality, 
and  transferability  of  property  rights.  This  posi¬ 
tion  is  under  debate,  and  some  questions  have  been 
raised  about  the  real  efficiency  of  such  a  system, 
the  relevance  of  the  efficiency  criteria,  and  the  ben¬ 
efit  to  be  derived  from  the  widespread  use  of  such 
a  model,  given  that  the  model  only  applies  to  cer¬ 
tain  specific  situations  in  which  ownership  rights 
meet  the  required  conditions  and  that — after  all — 
those  situations  might  not  be  so  frequent. 

Object 

The  impossibility  of  defining  property  accord¬ 
ing  to  the  rights  it  confers  or  these  rights’  specifici¬ 
ties  leads  some  authors,  such  as  Charles  Donahue, 
to  state  that  what  distinguishes  property  law  from 
all  other  jural  relationships  is  that  property  law 
deals  with  things.  This  statement  has  been  criti¬ 
cized  on  the  basis  that  almost  anything  can  legally 
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qualify  as  a  “thing.”  Depending  on  legal  systems 
and  existing  laws,  manpower,  ideas,  the  moon, 
traditional  dances,  and  human  DNA  are,  could  be, 
or  are  not  “things,”  which  results  in  a  “thing” 
being  legally  a  thing  only  when  it  is  owned. 
Therefore,  the  object  of  a  property  right  is  a  thing 
but  a  thing  defined,  according  to  the  terms  of  the 
law,  by  the  fact  that  it  is  owned. 

This  was  not  always  the  case.  There  was  a  time 
when  things  were  considered  things  by  nature  and 
what  distinguished  them  was  the  fact  that  they 
were  tangible.  But  slowly,  jurists  have  designed 
complex  legal  mechanisms  for  the  ownership  of 
intangible  objects — rights  (e.g.,  stocks  ownership 
has  been  a  constitutional  right  in  France  since 
1982),  status  (e.g.,  a  person  owns  his  or  her  retire¬ 
ment  pension  according  to  the  European  Court  of 
Human  Rights),  and  even,  in  memory  of  the  late 
Jeremy  Bentham,  future  things  or  legitimate  wishes 
(which  have  been  recognized  as  things  by  the 
European  Court  in  its  2004  decision  in  the  Kopecky 
v.  Slovakia  case).  If  even  a  purely  “imaginary” 
thing  can  be  owned,  anything  that  can  be  some¬ 
what  useful  to  someone  can  be  the  object  of  prop¬ 
erty  rights.  This  approach  brings  to  an  end  the 
property  “dematerialization”  process  that  started 
in  the  19th  century  and  in  fact  amounts  to  replac¬ 
ing  the  notion  of  “thing”  with  the  notion  of  value. 
But  since  it  is  truly  possible  to  own  a  thing  without 
value,  this  distinction  does  not  help  advance  a  new 
and  satisfactory  definition. 

Holder 

A  strong  philosophical  current  associated  with 
the  political  tradition  of  “possessive  individual¬ 
ism”  has  tried  to  equate  subjective  freedom,  per¬ 
sonal  identity,  and  self-ownership,  property  rights 
being  consequently  the  proper  expression  of  indi¬ 
vidual  freedom.  During  the  first  half  of  the  19th 
century,  it  was  even  common  to  consider  that 
ownership  of  property,  beyond  being  a  right,  was 
a  real  political  and  moral  status  that  gave  access  to 
active  citizenship  (the  right  to  vote)  and  privileges 
before  the  courts  (“The  master  is  believed  on  the 
basis  of  his  own  affirmation,”  Article  1781  of  the 
Napoleonic  Code). 

In  contemporary  law,  the  full  legal  ability,  the 
right  to  have  rights,  including  the  right  to  prop¬ 
erty,  constitutes  the  freedom  of  the  subject.  But 


legal  theory  and  legal  practice  have  created  mere 
rights  holders,  meaning  subjects  capable  of  enjoy¬ 
ing  rights  that  they  are  unable  to  exercise  (minors, 
the  mentally  disabled,  animals,  corporate  owners, 
etc.).  Legal  imagination  has  even  produced  proper¬ 
ties  without  owners  through  the  notion  of  patri¬ 
mony  of  affectation  (patrimony  dedicated  to  a 
specific  legal  entity /Zweckvermogen),  invented  by 
the  German  doctrine  during  the  second  half  of  the 
19th  century.  Therefore,  we  cannot  make  the  dis¬ 
tinction  between  private,  public,  or  collective 
forms  of  ownership  by  only  taking  into  account 
the  jural  character  of  the  rights  holders. 

At  most,  we  can  say  that  the  notion  of  property 
is  frequently  associated  with  exclusivity  and  free 
use,  even  with  universal  opposability  and  transfer- 
ability;  that  the  definition  of  its  object  refers  to  the 
notions  of  thing  and/or  value;  and  that  it  is  often 
presumed  to  take  root  in  the  subjective  freedom  of 
owners.  It  is  also  possible  to  try  to  pinpoint  the 
coexistence,  within  the  same  legal  system,  of  sev¬ 
eral  contradictory  conceptualizations  of  property, 
all  of  which  are  used  by  legal  actors  simultane¬ 
ously  (and  often  implicitly)  in  their  reasoning  and 
argumentation,  or  even  more  generally  to  link 
certain  forms  of  property  to  certain  trends  (unitary 
ownership  and  the  unlimited  power  of  wealth 
accumulation,  private  property  and  selfishness, 
collective  property  and  inefficiency,  etc.),  although 
these  associations  are  quite  often  disputed  because 
of  their  ideological  bias. 

We  might  then  conclude  that  property  is  not  a 
purely  technical  notion  (neutral  and  objective)  but 
rather  a  “floating”  concept  whose  applications  are 
always  both  technical  and  ideological.  From  this 
point  of  view,  the  expression  bundle  of  rights  is 
helpful  because  it  expresses  the  various  types  of 
rights  that  the  term  property  can  imply.  It  might 
then  be  advisable  to  stop  wondering  what  property 
is  and  instead  try  to  determine  the  purpose  of  call¬ 
ing  certain  legal  prerogatives  “rights  of  property.  ” 

Why  Do  We  Talk  About  “Property”? 

Legal  actors  who  make  abundant  use  of  the  word 
property  have  goals  that  are  simultaneously  tech¬ 
nical  and  ideological,  as  shown  by  the  history  of 
copyrights  (but  there  are  other  examples  such  as 
modified  living  organisms,  material  cultural  goods, 
human  body  parts,  etc.). 
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Since  the  18th  century,  it  has  been  acknowl¬ 
edged  that  authors  own  an  exclusive  right  to 
reproduce  their  scientific  and  literary  productions 
and  that  this  right  can  be  transferred  to  a  third 
party  at  the  will  of  the  author — for  example,  to 
distributors  for  a  percentage  of  the  income  gener¬ 
ated  by  the  sales  of  these  copies.  Copyright  is 
limited  in  time  as  far  as  the  commercial  applica¬ 
tions  are  concerned;  its  object  is  intangible  (the 
work  is  an  “original  expression”),  and  multiple 
holders  have  simultaneous  rights  over  it  (the 
author  owns  the  copyright  and,  in  continental 
law,  the  moral  rights;  the  public  enjoys  its  intel¬ 
lectual  utilities).  In  the  past,  this  has  led  some  to 
conclude  that  copyright  was  not  a  property  right, 
the  latter  being  an  exclusive  and  perpetual  right 
over  a  material  object  ( Augustin-Charles  Renouard 
defends  this  position  in  his  Traite  des  droits 
d’auteur,  dans  la  litterature ,  les  sciences  et  les 
beaux -arts  [Treaty  of  copyright,  in  literature,  sci¬ 
ence  and  fine  arts],  Paris,  1838-1839) — or  even 
that  it  was  an  imperfect  property  right  that  could 
be  perfected  by  granting  authors  a  copyright  for 
perpetuity,  as  suggested  by  Charles  Comte  in  his 
Traite  de  la  propriete  [Property  treaty]  of  1834. 
That  copyright  should  be  characterized  as  a 
“property  right”  was  therefore  not  a  benign 
transformation;  it  tended  to  integrate  legal  copy¬ 
right  rules  into  the  general  system  of  property,  as 
it  was  understood  by  most  legal  actors.  In  the 
19th  century,  this  general  system  included  rights 
in  perpetuity. 

Since  today  it  is  quite  possible  to  conceive  of 
temporary  property  rights,  the  designation  of  copy¬ 
right  as  “intellectual  property”  is  no  longer  related 
to  the  question  of  unlimited  duration  but  rather  to 
a  willingness  to  increase  the  level  of  legal  protec¬ 
tion  for  copyright  holders  to  the  level  of  protection 
enjoyed  by  owners  of  tangible  things,  to  slowly 
unify  all  the  legal  rules  pertaining  to  various  intan¬ 
gible  creations  (works  of  arts,  inventions,  trade¬ 
marks,  etc.),  and  to  extend  the  scope  of  these  rights 
to  new  objects  (publicity  rights,  business  methods, 
etc.) — in  other  words,  to  reinforce  third-party 
exclusion  mechanisms,  to  add  new  legal  privileges 
for  owners,  and  to  expand  the  range  of  intangible 
things  that  can  be  owned.  Moreover,  using  the 
term  intellectual  property  when  referring  to  copy¬ 
right  helps  promote  a  political  belief  in  the  legiti¬ 
macy  of  property.  Property  being  a  prestigious 


institution  that  is  commonly  associated  with  indi¬ 
vidual  freedom,  the  “property  of  authors”  is  an 
easier  cause  to  defend  than  the  cause  of  their  “tem¬ 
porary  monopoly.”  In  turn,  some  experts  refuse  to 
talk  about  property  when  it  comes  to  copyright, 
either  because  they  reject  the  conception  of  law 
that  supports  those  rights  or  because  of  the  collec¬ 
tive  character  of  works  and  the  public’s  right  to 
access  them  freely. 

The  strength  of  these  critical  trends  has  reig¬ 
nited  the  “property  dispute”  by  reintroducing  the 
notion  of  “positive  community,”  which  has  theo¬ 
logical  origins.  It  has  contributed  to  the  creation  of 
legal  settings  of  “nonownership,”  even  of  “anti¬ 
ownership,”  as  is  the  case  for  some  software  with 
freeware  or  creative  commons  licenses. 

Today,  property  law  is  characterized  by  the  dis¬ 
integration  of  the  idea  of  an  absolute  right  of  a 
person  over  a  tangible  thing  and  by  the  number  of 
definitions  used  to  designate  the  institution  of 
property  and  the  diversification,  expansion,  and 
spreading  of  ownership  mechanisms.  Is  there  a  link 
between  the  destruction  of  the  dogmatic  frame¬ 
work  that  stands  around  the  notion  of  property 
and  the  current  growing  trend  of  proprietariza- 
tion?  As  long  as  it  was  collectively  impossible  for 
legal  actors  to  imagine  the  ownership  of  intangible 
objects,  property  was  limited  to  material  things.  As 
long  as  property  was  conceived  of  as  a  unitary 
right,  concentrated  in  the  hands  of  a  single  owner, 
legal  ownership  mechanisms  remained  basic  and 
did  not  allow  the  ownership  of  intangible  or  future 
things.  Today,  the  ownership  of  any  object  is 
legally  imaginable;  what  might  be  difficult  to 
imagine  are  the  limits  of  this  proprietarization 
trend.  For  some,  this  might  be  good  news,  and 
they  may  welcome  the  advent  of  a  broad  and  effi¬ 
cient  private  property  rights  system.  For  others, 
this  might  be  unfortunate  and  may  require  the 
design  of  a  new  dogmatic  framework  to  stop  the 
expansion  of  ownership.  A  third  position  may  be 
that  if  the  legal  imagination  has  largely  played  its 
role  these  past  few  years  in  inventing  original  own¬ 
ership  mechanisms,  it  may  work  to  conceive  and 
establish  a  large  number  of  human  activities  out¬ 
side  the  legal  preserve  of  property. 

Mikhail  Dorel  Xifaras 
Sciences  Po 
Paris,  France 
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This  entry  discusses  the  most  prominent  psycho¬ 
logical  theory  of  decision  making  under  conditions 
of  risk,  called  prospect  theory.  It  was  developed  by 
the  psychologists  Daniel  Kahneman  and  Amos 
Tversky  to  explain  decision  making  under  condi¬ 
tions  of  risk,  originally  published  in  1979  in 
Econometrica.  Kahneman  won  the  Nobel  Prize 
in  Economics  in  2002  for  the  work  he  conducted  in 
this  area  with  Tversky,  who  died  in  1996.  This 
work  represents  the  apex  of  models  of  cognitive 
psychology  in  both  timing  and  importance.  The 
model  has  been  imported  into  a  number  of  fields 
and  has  been  used  to  analyze  various  aspects  of 


political  decision  making,  especially  in  interna¬ 
tional  relations. 

Prospect  theory  was  developed  in  explicit  oppo¬ 
sition  to  theories  of  rational  choice  based  on  more 
normative  assumptions.  The  intuition  that  guided 
the  original  work  grew  out  of  insights  into  the 
nature  of  bounded  rationality  and  a  desire  to 
explore  the  nature  and  limits  of  those  constraints. 
Prospect  theory  was  based  on  a  series  of  experi¬ 
mental  empirical  demonstrations  of  actual  human 
choice  behavior;  it  was  explicitly  developed  to 
present  a  descriptively  accurate  model  of  human 
decision  making.  Nevertheless,  the  domain  in 
which  prospect  theory  explored  human  decision 
making  was  primarily  based  on  choices  among  a 
series  of  financial  bets  and  gambles.  It  was  not 
originally  intended  to  provide  wider  generalization 
beyond  that  domain,  although  it  has  subsequently 
been  invoked  to  explain  a  wide  variety  of  phenom¬ 
ena  across  many  fields,  including  law,  political 
science,  and  economics. 

Prospect  theory  states  that  decision  making 
depends  on  choosing  among  options  that  may 
themselves  rest  on  biased  judgments.  Thus,  it  built 
on  earlier  work  conducted  by  Kaheman  and 
Tversky  on  judgmental  heuristics  and  the  biases 
that  can  accompany  assessments  of  frequency  and 
probability.  Such  judgments  involve  evaluations  of 
the  external  world;  decisions  involve  more  funda¬ 
mentally  internal  choices  across  values.  Thus,  the 
essence  of  decision  making  involves  a  trade-off 
between  values.  Prospect  theory  encompasses  two 
distinct  phases:  (1)  an  editing  phase  and  (2)  an 
evaluation  phase. 

The  editing  phase  refers  to  the  way  in  which 
individuals  characterize  options  for  choice.  Most 
frequently,  these  are  referred  to  as  framing  effects. 
Framing  effects  demonstrate  the  way  in  which, 
contrary  to  the  axiomatic  assumptions  of  expected 
utility  models,  which  argue  that  choice  should 
remain  invariant,  the  substance  of  people’s  choice 
can  be  affected  by  the  order,  method,  or  wording  in 
which  it  is  presented.  The  classic  demonstration  of 
this  effect  took  place  in  the  so-called  Asian  flu 
paradigm,  in  which  people  were  asked  to  make  a 
choice  among  public  policy  plans  for  responding  to 
an  endemic  disease.  Although  the  actual  statistical 
probabilities  remained  identical,  the  percentage  of 
people  supporting  a  given  plan  changed  dramati¬ 
cally  based  on  whether  or  not  the  outcomes  were 
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presented  in  terms  of  the  number  of  people  who 
would  live  versus  the  number  of  people  who  would 
die.  In  perhaps  the  most  dramatic  demonstration  of 
this  effect,  real-world  patients  suffering  from  can¬ 
cer  made  different  choices  of  whether  to  undergo 
surgery  or  chemotherapy  for  treatment  of  their  ill¬ 
ness  based  on  whether  the  outcome  percentages 
were  presented  in  terms  of  survival  or  mortality. 
Once  people  are  presented  with  both  choices  side 
by  side,  they  can  easily  see  that  the  substance  of  the 
decision  remains  the  same,  even  if  the  psychic  pull 
to  perceive  them  differently  remains. 

Once  choices  are  framed  for  decision,  prospect 
theory  enters  its  second,  evaluation  phase.  This 
phase  involves  two  component  elements,  similar  in 
function,  but  not  identical  in  conception,  to  the 
notions  of  utility  and  probability  in  expected  util¬ 
ity  models.  The  first  element  is  graphically  repre¬ 
sented  by  the  value  function.  This  function  differs 
from  standard  normative  models  by  including  a 
left-hand  side  to  the  graph  that  represents  how 
people  respond  to  loss.  In  this  way,  prospect  the¬ 
ory  differs  from  standard  economic  models  that 
always  encourage  prudence  as  the  better  part  of 
valor.  Regardless  of  the  normative  imprimatur  of 
such  advice,  this  does  not  accurately  characterize 
how  most  people  make  decisions.  There  are  three 
important  aspects  to  the  value  function  that  effec¬ 
tively  distinguish  it  from  expected  utility  in  par¬ 
ticular.  First,  the  model  examines  the  way  in  which 
value  is  related  to  the  original  reference  point,  or 
the  start  of  action  or  choice.  In  most  situations, 
this  reference  point  is  assumed  to  refer  to  the  per¬ 
son’s  current  status  quo  position,  but  this  is  not 
necessarily  required  within  the  confines  of  the 
model.  Rather,  the  operative  reference  point  can 
be  defined  by  some  future  level  of  aspiration  or 
some  kind  of  social  comparison.  But  the  key 
insight  of  the  model  is  that  the  hedonic  value  of 
choice  options  is  assessed  by  the  way  in  which 
people  evaluate  change,  movement,  distance,  or 
difference  between  where  they  are,  or  where  they 
want  to  be,  and  the  outcome  offered  by  a  particu¬ 
lar  choice.  In  other  words,  relative  outcomes  mat¬ 
ter  more  than  absolute  outcomes.  This  intuition 
clearly  reflects  the  realist  world  of  international 
relations  over,  for  example,  the  more  cooperative 
image  supplied  by  neoliberal  models. 

The  second  important  insight  offered  by  the 
value  function  relates  to  the  central  prediction  of 


the  model.  People  tend  to  be  more  risk-averse 
when  in  a  domain  of  gains,  where  things  are  going 
well  and  appear  to  be  likely  to  continue  to  improve 
or  where  actors  confront  primarily  opportunities 
for  gains.  Simultaneously,  actors  tend  to  be  much 
more  risk  seeking  in  the  realm  of  losses,  where  they 
are  much  more  likely  to  take  risks  in  order  to 
recoup  previous  losses  or  to  recover  from  a  loss  in 
order  to  revert  to  a  previous  position. 

The  last  important  element  of  the  value  function 
recognizes  that  losses  hurt  more  than  equal  gains 
please.  Loss  aversion  has  indeed  become  the  most 
robust  finding  in  the  entire  model.  In  general, 
people  have  to  be  offered  about  two  and  a  half 
times  as  much  as  a  loss  in  order  to  prove  willing  to 
take  a  risk  for  the  chance  of  a  gain. 

The  second  element  of  the  evaluation  phase  is 
characterized  by  the  weighting  function.  This  func¬ 
tion  contains  two  critical  insights.  First,  people 
treat  outcomes  that  are  deemed  to  be  either  certain 
or  impossible  very  differently  than  those  whose 
changes  take  place  in  the  midrange  of  probability. 
In  other  words,  people  simply  assign  more  psycho¬ 
logical  weight  and  importance  to  outcomes  that 
they  can  characterize  with  greater  certainty.  While 
this  is  not  justifiable  from  a  normative  perspective, 
most  people  treat  quite  unlikely  events  as  though 
they  were  impossible  and  quite  likely  events  as 
though  they  were  certain  to  occur.  Second,  people 
tend  to  overweight,  or  attribute  more  importance 
than  normatively  justified,  to  low-probability 
events.  They  simultaneously  apply  less  psychologi¬ 
cal  weight  to  medium-  and  high-probability  out¬ 
comes  than  are  normatively  warranted. 

The  interaction  of  the  value  function  and  the 
weighting  function  lead  to  some  very  interesting 
and  counterintuitive  explanations  and  predictions 
for  phenomena  such  as  insurance  (taking  a  sure  loss 
against  the  small  possibility  of  a  larger  loss)  and  lot¬ 
teries  (taking  a  sure  loss  against  the  even  smaller 
possibility  of  a  large  gain).  Because  people  over¬ 
weight  small-probability  events,  the  main  prediction 
of  prospect  theory  reverses  close  to  the  reference 
point  as  individuals  become  risk  seeking  in  gains 
(lotteries)  and  risk-averse  in  losses  (insurance). 

Applications  in  Political  Science 

Prospect  theory  has  been  applied  to  a  number  of 
cases  in  political  science,  particularly  in  the  area  of 
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international  relations,  and  also  in  the  realm  of 
comparative  politics.  In  international  relations,  the 
theory  has  been  invoked  to  try  to  explain  decision 
making  involving  seemingly  irrational  risks,  such 
as  continuing  to  engage  in  sunk  costs.  In  this  way, 
prospect  theory  has  been  invoked  to  help  explain 
the  rescue  mission  of  the  hostages  in  Iran  and  the 
Cuban  missile  crisis,  among  other  cases.  In  addi¬ 
tion,  in  comparative  politics,  it  has  been  used  to 
explain  the  nature  of  risk  in  economic  restructur¬ 
ing  in  Latin  American  countries. 

The  primary  criticisms  of  prospect  theory  in 
political  science  regard  the  failure  to  provide  an 
adequate  theory  of  framing  that  explains  how  and 
why  actors  generate,  seek,  and  employ  the  frames 
they  use.  Additional  concerns  relate  to  the  external 
validity  of  the  original  experimental  paradigm. 
This  point  would  certainly  find  agreement  among 
the  original  designers;  Kahneman,  at  least,  did  not 
intend  the  work  to  be  applied  more  broadly  outside 
the  narrow  economic  domain  in  which  it  was 
originally  investigated.  More  recent  critiques  within 
psychology  have  focused  on  the  way  in  which  pros¬ 
pect  theory  provided  a  limited  representation  of 
human  emotional  and  affective  responses.  Kahne- 
man’s  more  recent  work  has  delved  more  heavily 
into  the  more  hedonic  aspects  of  human  decision 
making,  emphasizing  well-being  and  happiness. 

Some  work  has  attempted  to  compare  the  pre¬ 
dictions  offered  by  rational  choice  and  prospect 
theory  models  to  anticipate  the  outcome  of  real- 
world  political  events,  such  as  the  outcome  of  the 
Northern  Ireland  Good  Friday  peace  accords. 
Other  recent  work  has  attempted  to  synthesize  and 
integrate  the  more  cognitive  approach  advocated  by 
prospect  theory  with  a  more  evolutionary  under¬ 
standing  of  the  sources  of  risk-variant  preferences. 
Based  on  risk  optimal  foraging  theory,  subsistence 
goals  may  indeed  be  maximized  by  pursuing  a  strat¬ 
egy  that  prioritizes  risk  seeking  under  conditions  of 
threat  to  survival  in  particular. 
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Protectionism 


Protectionism  refers  to  trade  policies  that  attempt 
to  restrict  the  inflow  of  foreign  goods  into  a 
domestic  market.  These  policies  generally  fall  into 
two  categories:  tariff  and  nontariff  barriers  (NTBs) 
to  trade.  Protectionism  diverges  from  the  logic  of 
maximizing  economic  efficiency  characteristic  of 
classical  economic  liberalism,  instead  focusing  on 
advancing  individual  national  interests  more  char¬ 
acteristic  of  classic  mercantilism. 

Historically,  protectionism  has  been  the  norm 
in  the  international  political  economy.  Most  early- 
modern  European  states  followed  the  principles  of 
mercantilism,  starting  in  the  16th  century  by 
attempting  to  maintain  positive  balances  of  trade 
with  their  trading  partners,  most  readily  reflected 
in  their  gold  reserves.  A  large  stockpile  of  gold 
meant  that  the  state  was  a  winner,  while  others 
were  losers  in  this  zero-sum  approach  to  trade  and 
national  power.  Great  Britain’s  repeal  of  the  Corn 
Laws  in  1846  is  often  seen  as  the  first  significant 
step  toward  free  trade.  Thereafter,  protectionist 
policies  waxed  and  waned,  often  in  response  to 
economic  crises  such  as  the  Great  Depression  or 
the  need  by  late  industrializers  to  catch  up  in  the 
development  of  their  economies.  The  use  of  tariffs 
declined  significantly  after  World  War  II,  as  the 
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United  States  and  its  allies  launched  the  General 
Agreement  on  Tariffs  and  Trade  (GATT)  treaty 
framework,  which  would  eventually  become  the 
World  Trade  Organization  (WTO).  The  GATT 
and  WTO  have  successfully  reduced  tariffs  in  most 
areas  of  trade  and  now  concentrate  on  combating 
the  rise  of  nontariff  barriers  to  trade. 

Theories  of  Protectionism 

Most  theories  of  protectionism  focus  on  the 
demand  for  protection  raised  by  pressure  groups. 
In  pressure  group  models,  groups  seek  protection¬ 
ism  from  the  state  to  increase  their  incomes.  The 
task  for  scholars  has  been  to  try  to  explain  more 
specifically  which  groups  will  seek  protection  and 
under  what  conditions  they  will  be  successful  in 
getting  it.  The  theoretical  basis  for  the  source  of 
trade  preferences  is  usually  traced  to  one  of  two 
models:  Heckscher-Ohlin  or  Ricardo-Viner.  The 
Heckscher-Ohlin  model  argues  that  factors  of  pro¬ 
duction  (land,  labor,  capital)  have  very  low  speci¬ 
ficity — thus,  they  move  freely  among  sectors.  If  a 
country  moves  from  free  trade  to  protectionism, 
the  income  of  factors  in  which  a  country  is  rela¬ 
tively  scarce  will  rise,  while  the  income  will  decline 
for  factors  in  which  a  country  is  relatively  abun¬ 
dant.  According  to  the  Stolper-Samuelson  theorem, 
scarce  factors  will  then  demand  protectionism, 
while  abundant  factors  will  oppose  it.  The  Ricardo- 
Viner  model  posits  that  factor  specificity  is  very 
high,  thus  factor  income  is  industry  or  sector  spe¬ 
cific.  The  factors  involved  in  import-competing 
sectors  will  therefore  lose  from  free  trade,  while 
those  attached  to  export-oriented  sectors  will  gain 
income  from  free  trade.  Unfortunately,  neither 
model  explains  how  trade  policy  preferences  are 
actually  translated  into  political  action. 

Pressure  group  models  require  more  than  simi¬ 
lar  interests  in  increasing  income  to  overcome  the 
collective  action  problem  associated  with  securing 
protectionism.  If  collective  action  were  not  a  prob¬ 
lem,  then  consumers  who  largely  gain  from  free 
trade  should  easily  organize  to  defeat  any  move¬ 
ment  toward  protectionism.  Olson  has  suggested 
that  small  groups  with  specialized  interests  are 
easier  to  organize  to  secure  rents  from  the  state 
than  large  groups  with  diffuse  interests,  such  as 
consumers.  Geographic  concentration  and  firm 
concentration  are  sometimes  seen  as  useful  proxies 


for  collective-action  costs,  since  spatial  proximity 
may  reduce  the  costs  of  organizing  to  secure 
protectionism. 

The  supply  of  protectionism  from  the  state  is 
also  a  crucial  piece  of  the  puzzle,  since  it  may  help 
identify  the  conditions  under  which  pressure 
groups  may  be  successful  in  obtaining  protection. 
Institutional  features  of  the  state  along  with  poli¬ 
cymaker  preferences  also  affect  the  supply  of  pro¬ 
tectionism.  Studies  have  shown  that  policymakers’ 
beliefs  or  ideas  are  critical  in  determining  whether 
they  will  pursue  protectionist  policies.  Others  have 
shown  that  the  state  of  the  economy  is  critical  in 
influencing  the  supply  of  protection  from  policy¬ 
makers.  A  number  of  institutional  aspects  of  the 
state  may  also  influence  the  supply  of  protection¬ 
ism,  including  the  size  of  electoral  districts,  whether 
a  country  has  a  proportional  representation  or  a 
majoritarian  system,  and  whether  a  country  has  a 
parliamentary  or  a  presidential  system.  Large  elec¬ 
toral  districts,  proportional  representation,  and 
parliamentary  systems  are  thought  to  insulate 
policymakers  from  protectionist  pressures.  The 
nature  of  the  party  system  may  affect  changes  in 
the  status  quo  on  trade  policy  as  countries  with 
highly  fragmented  or  polarized  party  systems  may 
be  unable  to  initiate  trade  reform.  Protectionism 
may  also  be  more  likely  in  presidential  systems 
characterized  by  divided  government — when  one 
party  controls  the  legislature  and  another,  the 
executive.  The  administrative  capacity  of  a  state 
may  affect  its  proclivity  toward  protectionism,  as 
developing  countries  with  poor  capacity  to  extract 
direct  taxes  from  their  populations  often  rely  heav¬ 
ily  on  trade  taxes,  which  are  much  easier  to  collect 
from  limited  ports  of  entry.  Some  have  also  argued 
that  democratic  and  autocratic  regimes  may  have 
different  proclivities  for  protectionism,  though 
empirical  studies  are  mixed. 

Adherents  of  strategic  trade  theory  argue  that 
the  state  need  not  accept  the  dictates  of  compara¬ 
tive  advantage  under  conditions  of  free  trade. 
States  might  try  to  manipulate  comparative  advan¬ 
tage  to  produce  a  competitive  advantage  in  a  cer¬ 
tain  industry.  This  could  involve  the  formulation  of 
an  industrial  policy  and  the  use  of  protectionism  to 
support  domestic  producers  in  an  imperfect  market 
such  that  they  will  reap  the  benefits  of  production 
and  yield  spillover  effects  in  the  domestic  economy. 
This  type  of  policy  falls  under  the  general  category 
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of  neo-mercantilism,  with  the  state  taking  an 
active  role  in  selecting  industries,  often  so-called 
infant  industries,  that  need  protection  as  they 
mature  to  compete  in  global  markets.  Many  devel¬ 
oping  countries  experimented  with  import  substi¬ 
tution  industrialization  (ISI).  The  logic  was  to 
produce  a  domestic  industry  in  some  sector  where 
one  did  not  previously  exist  in  any  meaningful 
way.  The  main  problem  faced  by  these  neo¬ 
mercantilist  approaches  is  that  it  is  often  hard  for 
a  state  to  know  how  to  pick  an  industry  that  will 
eventually  be  competitive.  Many  ISI  ventures 
ended  in  mounting  government  debt  and  ineffi¬ 
cient  industries  that  never  could  compete  on  the 
world  market.  Once  protectionism  is  supplied  to 
support  such  ventures,  the  demand  for  them  also 
tends  to  be  locked  in  place,  since  they  will  be  reluc¬ 
tant  to  relinquish  the  economic  benefits. 

Types  of  Protectionism 

Despite  the  decline  in  the  number  and  costs  of  tar¬ 
iffs  due  to  multilateral  efforts  organized  under  the 
(i ATT/ WTO,  a  number  of  other  forms  of  NTB 
protectionism  remain,  including  subsidies,  quotas, 
exchange  rate  manipulation,  and  labor,  health, 
and  quality  control  protections.  Despite  the  gen¬ 
eral  reduction  in  tariffs,  countries  are  often  still 
able  to  enact  policies  to  protect  their  own  nation’s 
industries  from  foreign  competition. 

Tariffs 

Tariffs  are  the  most  common  and  best  known 
form  of  protectionism.  A  tariff  artificially  increases 
the  cost  of  a  good.  The  most  common  form  of 
tariff  is  an  import  tariff.  Import  tariffs  put  an  addi¬ 
tional  price,  either  a  percentage  of  the  good’s  value 
or  an  absolute  amount,  on  foreign  imports.  This 
increases  the  costs  of  foreign  goods  and  provides 
domestic  firms  with  an  advantage  in  cost  produc¬ 
tion.  Since  domestic  products  do  not  face  this  tax, 
they  are  able  to  sell  products  at  a  lower  price  and 
for  more  profit  than  foreign  competitors. 

Subsidies 

Subsidies  can  also  be  used  as  a  form  of  protec¬ 
tionism.  The  two  basic  forms  are  direct  or  indirect 
subsidies.  Direct  subsidies  are  payments  from  the 


government  directly  to  producers.  Indirect  subsi¬ 
dies  include  government  price  guarantees,  tax 
exemptions,  and  other  forms  of  payment  that  are 
not  direct  subsidies.  This  allows  producers  to  sell 
products  at  a  lower  price  than  foreign  competitors, 
making  them  artificially  competitive. 

Quotas 

Quotas  are  quantitative  limits  to  the  number  of 
items  that  are  permitted  to  be  imported  by  a  coun¬ 
try.  Quotas  can  be  compulsory  or  voluntary. 
Compulsory  quotas  are  enforced  by  government 
officials.  Actors  who  attempt  to  exceed  the  quota 
are  smugglers.  Voluntary  quotas  are  agreements 
between  a  government  and  foreign  manufacturers 
of  a  specific  number  of  goods  that  are  to  be  sold 
within  a  country.  Foreign  producers  may  agree  to 
this  in  order  to  extract  large  rents  from  the  limited 
quantity  of  goods  or  in  order  to  have  access  to  the 
market  for  fear  of  even  greater  protectionist  poli¬ 
cies.  Domestic  interests  may  agree  to  quotas  to 
limit  the  overall  competition.  Voluntary  quotas  are 
sometimes  known  as  voluntary  export  restraints 
(VERs). 

Exchange  Rate  Manipulation 

Countries  that  do  not  have  floating  currencies 
can  artificially  depress  or  inflate  the  value  of  their 
currency.  Long-term  attempts  to  depress  currency 
make  it  more  difficult  for  foreign  competitors  to 
compete  with  the  low  costs  of  domestic  goods 
while  at  the  same  time  increasing  the  costs  of  for¬ 
eign  goods. 

Labor,  Health,  and  Quality  Controls 

Labor,  health,  and  quality  controls  can  also  be 
used  as  a  form  of  protectionism.  These  standards 
are  sometimes  used  as  ways  to  restrict  foreign 
competition  from  entering  a  domestic  market. 
Claims  of  mistreatment  of  labor  can  be  used  by 
domestic  labor  groups  to  pressure  the  government 
to  restrict  the  importation  of  goods.  Health  stan¬ 
dards  may  be  employed  even  when  there  is  dubi¬ 
ous  scientific  risk  associated  with  the  imported 
products.  Quality  control  measures  can  be  simi¬ 
larly  employed.  In  addition,  quality  control  units 
may  be  understaffed,  protracting  the  length  of  time 
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between  a  product’s  arrival  and  its  sale.  While 
each  of  these  reasons  can  be  used  by  countries  in  a 
justifiable  manner,  it  can  often  be  difficult  to  parse 
out  the  difference  between  such  instances  and  the 
occasions  when  states  create  artificially  high  stan¬ 
dards  or  requirements  to  restrict  or  eliminate  for¬ 
eign  competition. 

Many  of  these  different  types  of  protectionism 
are  quite  controversial.  As  was  stated  with  labor, 
health,  and  quality  controls,  it  can  sometimes  be 
quite  difficult  to  differentiate  between  malicious 
and  virtuous  intentions.  Furthermore,  different 
countries  have  differing  values;  standards  or  assis¬ 
tance  at  a  level  that  is  acceptable  in  one  country 
may  be  perceived  as  clearly  subversive  among 
trade  partners.  As  such,  each  of  these  is  a  major 
point  of  contention  in  trade  negotiations. 

Last,  differing  types  of  protectionism  may  be 
employed  in  different  sectors.  Nearly  all  countries 
limit  foreign  competition  in  areas  of  national  inter¬ 
est.  What  those  national  interests  are  varies  by 
country.  These  can  include,  but  are  not  limited  to, 
areas  related  to  national  defense,  agriculture, 
resource  extraction,  and  others. 
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Psychological  Explanations 
of  International  Politics 


Psychological  explanations  of  international  poli¬ 
tics  focus  on  the  impact  of  cognition  and  emotion 
on  choice.  Through  the  analysis  of  decision  mak¬ 
ing,  political  psychologists  have  explored  a  wide 
range  of  topics  that  are  central  to  international 
politics:  the  onset  of  war,  nuclear  strategy,  deter¬ 
rence  and  reassurance,  signaling  and  bargaining, 
conflict  management  and  conflict  resolution,  and 
peace.  Recent  neuroscientific  research  on  emotion 
is  changing  the  analysis  of  foreign  policy  decision 
making  and  rational  choice.  This  entry  looks  first 
at  psychological  theories  of  decision  making. 
Second,  it  examines  the  consequences  of  new 
understandings  of  decision  making  for  strategic 
models  of  interaction.  Third,  the  entry  looks  at 
the  contribution  of  psychological  explanations  to 
the  analysis  of  collective  beliefs  and  moods  on 
international  political  issues. 

Cognition  and  Emotion  in  Decision  Making 

The  Cognitive  Revolution 

Forty  years  ago,  psychologists  started  a  “cogni¬ 
tive  revolution”  as  they  rejected  simple  behaviorist 
models  and  looked  again  at  the  way  cognition 
shaped  the  choices  people  make.  Although  this 
was  not  its  original  purpose,  the  cognitive  revolu¬ 
tion  can  be  understood  largely  as  a  commentary  on 
the  limits  of  rational  choice.  Much  of  the  early 
work  accepted  rational  choice  as  the  default  posi¬ 
tion  and  then  demonstrated  its  boundaries. 
Research  has  now  cumulated  to  show  that  people 
rarely  conform  to  the  expectations  of  the  rational 
model  and  that  these  deviations  are  not  random 
but  systematic. 

Cognitive  psychology  has  demonstrated  impor¬ 
tant  differences  between  the  expectations  of  rational 
decision  models  and  the  processes  of  attribution, 
estimation,  judgment,  and  choice  that  people 
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frequently  use.  It  explains  these  differences  by  the 
need  for  simple  rules  of  information  processing 
and  judgment  to  make  sense  of  environments  that 
are  both  uncertain  and  complex.  People  have  a 
preference  for  simplicity;  they  are  averse  to  ambi¬ 
guity  and  dissonance;  and  they  misunderstand 
fundamentally  the  essence  of  probability.  And 
people  have  risk  profiles  that  depart  from  what 
models  of  rational  choice  would  expect;  according 
to  Daniel  Kahneman  and  Amos  Tversky,  we  are 
far  more  averse  to  loss  than  we  are  to  gain-seeking. 
Together,  these  attributes  compromise  the  capacity 
for  rational  choice. 

Robert  Jervis  posits  that  political  leaders  mak¬ 
ing  decisions  about  the  world  need  to  order  their 
world  and  make  a  very  complex  world  somewhat 
simpler.  When  they  look  to  the  past  to  learn  about 
the  future,  political  leaders  tend  to  draw  simple 
one-to-one  analogies  without  qualifying  condi¬ 
tions.  In  1991,  President  George  H.  W.  Bush  called 
Saddam  Hussein  “another  Hitler,”  with  little 
attention  to  what  was  different  either  about  the 
two  men  or  about  Iraq  in  1990  and  Germany  in 
1938.  Yet  fitting  Saddam  into  an  existing  frame 
through  the  use  of  analogical  reasoning  gave  the 
president  a  readily  accessible  script  about  how  to 
respond  to  Iraq’s  invasion  of  Kuwait. 

Cognitive  psychologists  have  produced  robust 
evidence  that  people  strongly  prefer  consistency, 
that  they  are  made  uncomfortable  by  dissonant 
information,  and  that  they  consequently  deny  or 
discount  information  that  is  inconsistent  with 
their  beliefs  to  preserve  them.  This  drive  for  con¬ 
sistency  impairs  the  processes  of  estimation  and 
judgment.  The  well-established  tendency  to  dis¬ 
count  inconsistent  information  contributes  signifi¬ 
cantly  to  the  persistence  of  beliefs.  Political  leaders 
in  the  United  States  were  generally  resistant  to 
changing  their  beliefs  about  the  Soviet  Union  after 
Mikhail  Gorbachev  came  to  power  with  a  reform 
agenda.  Three  years  after  he  became  the  general 
secretary,  senior  policymakers  were  arguing  that 
his  strategy  was  to  lull  the  West  while  the  Soviet 
Union  recovered. 

Cognitive  processes  of  attribution  can  also  con¬ 
found  policy  making.  One  of  the  most  pervasive 
biases  is  the  fundamental  attribution  error,  where 
people  exaggerate  the  importance  of  dispositional 
over  situational  factors  in  explaining  the  disliked 
behavior  of  others  but  explain  their  own  behavior 


by  the  situational  constraints  that  they  face.  When 
explaining  behavior  that  they  like,  they  simply 
reverse  the  pattern  of  inference.  When  the  govern¬ 
ment  of  North  Korea  makes  a  concession  in  the 
talks  about  its  nuclear  program,  leaders  in 
Washington  see  that  concession  as  a  function  of 
the  situational  constraints  Pyongyang  faces  but 
explain  their  own  willingness  to  participate  in  the 
talks  as  evidence  of  their  disposition  to  search  for 
a  peaceful  compromise.  The  double  standard  in 
reasoning  is  clear. 

Cognitive  psychology  has  generated  robust  evi¬ 
dence  that  loss  is  more  painful  than  comparable 
gain  is  pleasant.  Daniel  Kahneman  and  Amos 
Tversky  found  that  leaders  tend  to  be  risk-averse 
when  things  are  going  well  and  relatively  risk 
acceptant  when  things  are  going  badly,  when  they 
face  a  crisis  in  which  they  are  likely  to  lose  or  have 
lost  something  that  matters  to  them.  Leaders  are 
also  likely  to  take  greater  risk  to  protect  what  they 
already  have — the  endowment  effect — than  to 
increase  their  gains.  They  are  also  likely  to  take 
greater  risk  to  reverse  losses,  to  recapture  what 
they  once  held,  than  they  would  to  make  new 
gains.  And  when  decision  makers  suffer  a  signifi¬ 
cant  loss,  they  are  far  slower  to  adjust  to  these 
losses  than  they  would  be  to  incorporate  gains. 
These  general  findings  apply  directly  to  foreign 
policy  choices.  President  Sadat  of  Egypt,  for  exam¬ 
ple,  never  “normalized”  for  the  loss  of  the  Sinai  to 
Israel  in  1967.  Even  though  Israel  had  an  obvious 
advantage  in  military  capabilities,  Sadat  was  unde¬ 
terred  and  highly  motivated  to  recapture  the  Sinai; 
thus,  he  launched  a  high-risk  war  in  1973,  which 
was  designed  to  begin  a  political  process  of  nego¬ 
tiation  with  Israel  that  would  restore  the  Sinai  to 
Egypt. 

Emotion,  Cognition,  and  Choice 

New  research  in  neuroscience  is  revolutionizing 
the  understanding  of  the  relationship  between 
emotion,  cognition,  and  decision.  Two  results 
stand  out.  First,  many  decisions  seem  to  be  the 
result  not  of  a  deliberative  thought  process  but  of 
preconscious  neurological  processes.  Second,  emo¬ 
tion  is  primary  and  plays  a  dominant  role  in 
choice. 

Emotions  are  adaptive  programs  of  action  that 
have  evolved  over  time  to  ensure  survival  and  then 
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reproduction.  A  useful  way  of  thinking  about  emo¬ 
tion  and  cognition  is  to  see  affective  processes  as 
those  that  address  the  go/no-go  questions,  while 
cognitive  processes  are  those  that  answer  true/false 
questions.  Choice  clearly  invokes  both  kinds  of 
processes.  Establishing  truth  claims  about  states  of 
the  world  is  usually  not  enough  for  people  to  make 
a  choice.  What  people  value  is  an  emotional  as  well 
as  a  cognitive  process,  and  both  are  important  in 
whether  they  decide  to  approach  or  avoid  making 
a  choice.  Hence,  emotion  carries  utility.  The  new 
field  of  neuro-economics  is  beginning  to  conceive 
utility  as  something  one  experiences  subjectively. 

There  is  an  ongoing  and  vigorous  debate  among 
psychologists,  neuroscientists,  and  behavioral 
economists  about  the  relationship  between  affect 
and  cognition.  Three  approaches  are  particularly 
relevant.  The  first  is  the  somatic  marker  hypothe¬ 
sis,  the  second  is  the  affect-as-appraisal  hypothesis, 
and  the  third  is  affect-as-information.  Each  under¬ 
stands  the  relationship  between  emotion  and  cog¬ 
nition  differently. 

The  first  argues  that  the  information  people 
receive  through  their  senses,  which  they  experience 
physiologically,  creates  emotions  that  then  cue 
decision  and  action.  These  physiological  experi¬ 
ences,  or  somatic  markers,  create  learned  emo¬ 
tional  responses  that  allow  people  to  decide  quickly 
what  should  be  approached  and  what  should  be 
avoided,  notes  Antonio  Damasio.  In  the  early 
stages  of  decision  making,  emotions  are  primary. 

The  appraisal  approach  to  emotion  is  quite  dif¬ 
ferent.  It  reflects  the  long-standing  view  that  cogni¬ 
tion  precedes  emotion.  It  is  thoughts  that  evoke 
feelings.  The  cognitive  process  of  appraising — a 
treaty,  a  leader,  a  new  institution — as  good  or  bad 
elicits  an  emotion.  This  approach  is  inconsistent 
with  that  of  neuroscientists  who  see  emotion  as 
primary  and  as  the  carrier  of  value. 

A  third  approach,  presented  by  Gerald  Clore 
and  Karen  Gasper,  treats  affect  as  information. 
Emotions  carry  information  to  people  about  their 
unconscious  processes,  which  then  become  con¬ 
scious  thoughts  and  feelings  and  an  input  into  the 
decisions  they  face.  In  this  sense,  emotion  does  not 
follow  cognitive  appraisal  but  creates  appraisals 
through  the  information  it  provides. 

Affective  and  cognitive  processes  can  collabo¬ 
rate  or  compete.  At  low  levels  of  intensity,  affect 
appears  to  play  a  largely  “advisory”  role;  it 


provides  information  about  cognitive  processes.  At 
intermediate  levels  of  intensity,  people  begin  to 
become  conscious  of  a  conflict  between  cognitive 
and  affective  inputs  and  the  struggle  for  self- 
control.  At  high  levels  of  intensity,  affect  can  be  so 
powerful  that  it  short-circuits  thought  and  moves 
to  action.  Emotion  precedes  cognition  and  can 
swamp  cognition.  Only  after  the  fact  can  people 
reflect  on  the  choices  that  they  made. 

Emotion  and  Cognition  as  Explanations 
of  Strategic  Interaction 

Emotion  is  a  core  driver  in  theories  of  decision 
making,  with  significant  consequences  for  the 
understanding  of  foreign  policy  decision  making. 
Psychological  explanations  of  foreign  policy  choice 
have  been  extended  to  strategic  interaction  in 
international  politics.  The  credibility  of  signals,  an 
essential  component  in  theories  of  deterrence, 
compellence,  and  bargaining,  is  not  only  a  prop¬ 
erty  of  the  sender,  as  some  formal  models  of  sig¬ 
naling  suggest,  but  also  a  function  of  the  beliefs  of 
the  receiver,  notes  Jonathan  Mercer.  These  beliefs 
are  not  only  cognitive  but  emotional  as  well.  The 
emotional  cues  that  signals  evoke — such  as  fear 
and  anger — matter  insofar  as  these  emotions  then 
prompt  beliefs  and  action  in  turn.  Research  dem¬ 
onstrates  that  fear  prompts  uncertainty  and  risk- 
averse  action,  while  anger  prompts  certainty  and 
risk  acceptance.  Threats  that  evoke  fear  are  likely 
to  prompt  hesitancy  and  a  risk-averse  response; 
that  is  the  purpose  of  most  deterrent  threats. 
Frightening  threats  are  less  likely  to  be  successful, 
however,  when  they  are  designed  to  compel  adver¬ 
sarial  leaders  to  act.  And  threats  that  humiliate 
are  likely  to  evoke  anger  and  provoke  the  risk- 
acceptant  response  that  a  threat-based  strategy  is 
designed  to  avoid.  Threat-based  strategies  conse¬ 
quently  become  much  more  complex  and  danger¬ 
ous  to  design  and  implement. 

Research  also  demonstrates  that  credibility,  a 
fundamental  component  of  theories  of  action  in 
international  politics,  is  emotional  as  well  as  cog¬ 
nitive.  Credibility  is  not  simply  a  function  of  either 
the  cost  of  the  signal  or  past  behavior,  as  some 
theories  of  bargaining  claim.  It  is  an  emotional 
belief  that  is  held  by  its  intended  receiver.  Russia’s 
credibility  is  not  only  a  function  of  what  its  leaders 
say  and  do,  or  have  said  and  done,  but  what 
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Georgia’s  leaders  believe  Moscow  will  say  and  do. 
Psychological  explanations  call  into  question  repu¬ 
tational  models  based  exclusively  on  the  past 
behavior  of  states  or  on  the  costliness  of  signals. 
They  give  theoretical  weight  to  the  pattern  of  infer¬ 
ence  leaders  make  and  build  in  emotion  as  the 
primary  driver  of  assessment  in  the  early  stages  of 
decision  making. 

Emotion,  Cognition,  and  Collective  Beliefs 

Psychological  models  explain  not  only  leaders’ 
choices  and  the  interactive  sequencing  of  these 
choices  across  a  range  of  domains  in  international 
politics  but  also  collective  beliefs  and  moods. 
Epidemiological  and  viral  models  help  explain  the 
diffusion  of  emotion  from  an  individual  to  a  larger 
group.  The  spread  of  emotions  from  one  individual 
to  another  is  similar  to  other  contagious  processes. 

Trust  and  confidence  are  central  to  workings  of 
the  international  order.  Confidence  and  trust  are 
emotional  states,  an  indicator  of  optimism  about 
the  future.  They  are  also  cognitive,  one  person’s 
sense  of  how  confident  others  are  and  their  percep¬ 
tions  of  how  confident  still  others  are.  An  indi¬ 
vidual’s  mood  is  in  part  a  function  of  the  mood  of 
others,  and  in  this  sense,  it  is  as  reasonable  to 
speak  of  a  collective  mood  as  it  is  to  speak  of 
shared  norms.  Psychological  models  speak  to  both 
collective  beliefs  and  collective  moods.  Emotions, 
political  psychologists  argue,  constitute  feelings 
and  moods  that  can  be  powerful  spurs  to  action  in 
international  politics  or  can  lead  to  panic  and 
withdrawal. 

Nationalism  is  an  emotional  belief,  an  attach¬ 
ment  to  one’s  society  that  is  evoked  by  emotional 
cues  and  expressed  through  emotional  identifica¬ 
tion  with  the  collective.  Mobilizational  appeals  to 
nationalist  loyalty  for  both  constructive  and 
destructive  purposes  are  emotional,  phrased  as 
“love  of  the  motherland  [or  fatherland].”  Slobodan 
Milosevic’s  appeal  to  Serb  nationalism,  for  exam¬ 
ple,  invoked  past  humiliations  and  grievances  in  an 
effort  to  mobilize  anger  and  support  for  action. 
The  onset  of  ethnic  war  in  the  former  Yugoslavia 
cannot  be  understood  without  referencing  the 
“emotional  entrepreneurship”  of  the  Serbian  polit¬ 
ical  leader. 

Psychological  models  explain  both  individual 
and  collective  behavior  in  international  politics. 


Research  in  neuroscience  is  bringing  emotion  back 
into  the  analysis  of  international  politics  across  a 
wide  variety  of  issues.  The  research  challenge  will 
be  to  connect  psychological  processes  to  political 
and  institutional  streams  and  to  identify  and 
explain  the  amplifying  feedback  loops. 
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Public  Budgeting 


Public  budgeting  allocates  public  resources.  This 
allocation  function  means  that  budgeting  lies  at 
the  very  heart  of  politics.  Who  gets  what  and  when 
from  the  government  in  financial  terms  is  what 
politicians  and  publics  alike  care  about.  It  is  dis¬ 
tinct  from  “private”  budgeting  in  several  respects. 
First,  in  all  but  one-person  dictatorships,  public 
budgeting  is  a  collective  decision.  The  processes 
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used  to  make  decisions  vary.  Second,  decision 
makers  do  not  spend  their  own  money.  They  must 
raise  these  funds  somehow.  The  most  common 
device  is  taxation.  Governments  may  also  make 
money  from  the  management  of  public  property, 
such  as  land  or  firms,  as  well  as  from  profits  that 
arise  from  the  sale  of  public  resources,  such  as 
copper  or  oil.  They  may  also  borrow  money  to 
fund  current  expenses.  Finally,  the  question  of 
whether  budgeted  funds  are  being  used  in  the  pub¬ 
lic  interest  is  an  ongoing  theme  that  a  private 
individual  or  firm  does  not  usually  confront. 

Decision  Making  in  Public  Budgeting 

The  fact  that  public  budgets  arise  from  a  collective 
decision  presents  several  challenges.  First,  consider 
the  lessons  from  the  formal  theory  literature  about 
the  multidimensionality  of  a  policy  space.  A  policy 
dimension,  to  be  clear,  is  simply  one  line  on  which 
one  can  rank  a  given  option.  The  traditional 
dimension  used  in  many  studies  is  a  simple  left- 
right  one,  but  in  budgeting,  one  can  imagine  many 
such  dimensions  that  correspond  to  spending  on  a 
given  subject.  One  person  may  want  the  govern¬ 
ment  to  spend  a  lot  on  health  care  but  only  a  little 
on  defense,  for  example.  The  formal  theory  litera¬ 
ture  tells  us  that  when  one  can  line  up  the  prefer¬ 
ences  of  actors  along  one  dimension,  there  is  a 
stable  outcome  that  arises  from  a  simple  majority 
vote.  When  one  has  multiple  dimensions,  however, 
several  outcomes  are  usually  possible.  Moreover, 
these  outcomes  are  not  “stable”;  that  is,  they  will 
lose  to  one  or  more  other  feasible  outcomes,  which 
in  turn  are  not  stable  either. 

All  this  means  that  the  rules  that  structure  the 
way  decisions  are  taken  on  the  budget  are  crucial. 
If  the  rule  is  to  stop  consideration  of  the  budget 
after  just  one  vote,  then  one  would  want  to  know 
who  gets  to  decide  the  content  of  that  vote.  This 
“agenda  setter”  will  have  the  ability  to  select  an 
outcome  closest  to  his  or  her  preferred  outcome 
even  though  there  may  be  other  outcomes  that  can 
beat  it  in  a  one-to-one  majority  vote.  To  move 
beyond  this  rather  abstract  discussion  to  practical 
implications,  a  student  of  public  budgeting  will 
want  to  know  who  has  the  right  to  propose  a  given 
budget,  what  the  rules  are  to  change  the  budget, 
how  high  the  threshold  is  to  pass  the  budget,  and 
what  happens  if  the  institutions  fail  to  pass  the 


budget.  For  example,  the  president  may  propose 
the  budget,  a  congress  will  then  consider  the  bud¬ 
get,  and  the  budget  may  then  have  to  be  passed 
with  only  a  simple  majority  to  become  law.  In  a 
country  such  as  Chile,  the  president  does  propose 
the  budget,  the  congress  does  not  have  the  right  to 
make  amendments  that  raise  aggregate  spending 
beyond  what  the  president  proposes,  it  takes  a 
simple  majority  in  congress  to  pass  the  budget,  and 
the  president’s  budget  simply  becomes  law  if  the 
congress  cannot  pass  it.  In  this  case,  the  president 
is  easily  the  most  important  player  in  the  budget 
process.  This  contrasts  with  a  country  such  as 
Paraguay,  where  the  congress  can  make  whatever 
amendments  it  wants  to  the  president’s  budget  and 
where  the  default  budget  is  the  previous  year’s  bud¬ 
get  rather  than  the  one  the  president  proposed.  In 
this  case,  the  congress  has  a  stronger  hand  to  play. 

The  decision-making  rules  have  additional 
importance  when  one  considers  that  government  is 
spending  not  its  money  but  someone  else’s  money. 
Assume  that  politicians  do  a  simple  cost-benefit 
analysis  for  the  people  who  support  them.  The 
politician  cares  about  the  benefits  of  additional 
spending  and  the  costs  of  additional  taxation  only 
with  respect  to  his  or  her  constituency.  Assume  as 
well  that  taxes  go  into  a  common  revenue  pool 
and  do  take  the  form  of  “user  fees”  so  that  there  is 
a  direct  correspondence  between  spending  and 
costs.  Under  such  conditions,  one  can  imagine  a 
situation  where  a  given  constituency  receives  the 
benefits  of  spending  but  pays  only  a  tiny  fraction 
of  the  costs.  For  example,  consider  an  agriculture 
minister  who  worries  only  about  the  interests  of 
farmers.  The  government  provision  of  subsidies  to 
the  farm  industry  may  equal  $1  billion,  but  the 
total  additional  tax  the  farmers  pay  for  this  is  only 
$10  million.  In  the  literature,  this  situation  is 
known  as  a  “common  pool  resource  problem.” 

Once  again,  the  decision-making  process  is  cru¬ 
cial.  If  the  agriculture  minister  determines  spend¬ 
ing  programs  for  farmers,  he  or  she  will  allocate 
more  spending  than  in  the  case  where  he  or  she 
considers  the  full  tax  implications  of  his  or  her 
decision.  If  the  budget  is  then  simply  the  sum  of 
the  requests  of  ministers  who  worry  only  about 
narrow  constituencies,  spending  will  be  higher 
than  the  ministers  themselves  would  want  if  they 
could  make  the  entire  budget  on  their  own.  One 
can  imagine  a  different  outcome  if  there  is  an  actor 
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in  the  process  whose  job  is  to  worry  about  the 
entire  tax  burden  and  not  just  the  burden  on  a 
specific  constituency.  In  practice,  this  person  is 
usually  the  finance  minister.  But  it  is  not  strictly 
necessary  to  give  all  budgetary  power  to  a  finance 
minister.  Negotiations  among  decision  makers  can 
also  lead  to  full  consideration  of  the  tax  burden  so 
long  as  they  do  not  take  the  form  of  “logrolls”  of 
different  proposals,  that  is,  so  long  as  the  parties 
involved  consider  the  full  burden  of  their  spending 
decisions  and  do  not  simply  vote  for  each  others’ 
proposals. 

One  technique  to  centralize  the  budget  process 
that  one  sees  increasingly  is  top-down  budgeting. 
The  idea  is  that  one  should  first  vote  on  a  total 
amount  of  spending  before  deciding  on  the  distri¬ 
bution  of  that  amount  according  to  different 
spending  categories.  This  is  in  contrast  to  bottom- 
up  budgeting,  where  one  votes  on  each  item  and 
then  simply  aggregates  them  all  to  get  the  total 
budget.  The  top-down  system,  for  example,  was 
instituted  in  the  Swedish  parliament  in  the  mid- 
1990s  for  all  budget  votes.  Whether  this  procedure 
really  leads  to  lower  spending  and — as  a  conse¬ 
quence — better  aggregate  fiscal  discipline  is  open  to 
question.  In  an  influential  theoretical  article,  John 
Ferejohn  and  Keith  Krehbiel  illustrate  that  the 
combination  of  preferences  among  the  relevant 
voters  decides  whether  the  top-down  or  the  bot¬ 
tom-up  process  leads  to  a  smaller  aggregate  budget. 

There  is  less  controversy  about  the  desirability 
of  budget  transparency.  When  it  is  easy  for  the 
public  to  follow  what  the  government  is  doing,  it 
is  difficult  for  policymakers  to  use  “tricks”  to 
make  figures  appear  rosier  than  they  really  are.  It 
is  also  easier  to  cut  spending  if  one  knows  about  it. 
James  Alt  and  David  Dryer  Lassen  indicate  that 
the  way  states  provide  information  on  the  budget, 
whether  there  is  independent  verification  of  budget 
figures,  whether  there  is  use  of  arbitrary  language, 
and  whether  there  is  required  text  that  justifies  dif¬ 
ferent  parts  of  the  budget,  are  connected  to  fiscal 
outcomes.  Countries  with  more  transparent  bud¬ 
get  systems  have  higher  levels  of  fiscal  discipline. 

Public  budgeting  also  involves  a  temporal  ele¬ 
ment  where  institutions  also  play  a  role.  For 
example,  it  is  common  for  politicians  to  promise 
immediate  spending  but  to  delay  the  “pain”  of  the 
costs.  One  way  to  do  this  is  through  public  bor¬ 
rowing.  The  government  in  this  case  runs  a  budget 


deficit  and  carries  debt  on  its  books  that  it  will 
have  to  repay  later.  Another  way  the  temporal  ele¬ 
ment  matters  is  when  a  benefit  is  promised  to  a 
specific  group  that  is  small  at  first  but  that  is  pro¬ 
jected  to  grow  over  time  so  that  the  costs  also 
grow.  Public  pensions  are  a  prime  example  of  this 
dynamic.  Many  countries  have  populations  that 
are  aging.  The  number  of  people  who  will  be 
entitled  to  a  public  pension  will  increase  while  the 
number  who  will  be  paying  into  a  given  system 
will  decline.  Such  a  scenario  is  one  where  the  pen¬ 
sion  system  is  designed  as  “pay  go,”  where  work¬ 
ers  pay  for  a  benefit  that  is  given  today  in  the 
expectation  that  they  will  receive  that  same  benefit 
at  some  future  date.  Public  budgeting  involves 
decisions  about  how  to  structure  such  systems. 
Australia,  for  example,  created  a  type  of  sovereign 
wealth  fund  that  invests  public  money  today  in  the 
expectation  that  it  will  be  able  to  cover  the  pen¬ 
sions  of  an  increased  number  of  retirees  tomorrow. 

An  increasingly  important  part  of  budgeting 
therefore  involves  multiannual  planning.  Annual 
budgets  with  no  planning  encourage  short¬ 
sightedness  and  overspending  for  the  reasons 
given  above.  They  also  tend  to  make  a  govern¬ 
ment  fairly  conservative  in  its  assessment  of  what 
it  is  doing — one  cannot  recognize  the  future 
impact  of  today’s  incremental  changes.  There  are 
also  types  of  spending  that  are  inherently  multi¬ 
annual.  Capital  expenditures,  for  example,  are 
often  on  items  that  take  several  years  to  build. 
There  may  also  be  contingencies,  such  as  loan 
guarantees  or  bank  recapitalizations,  that  do  not 
appear  in  an  annual  budget  but  that  should  be 
included  in  decision  making.  The  traditional  way 
to  plan  would  be  to  set  multiyear  plans  with  fixed 
projections.  The  plan  in  this  case  is  indicative 
only;  that  is,  it  forecasts  what  will  happen  if  there 
are  no  changes  in  current  spending  obligations. 

Trends  in  Public  Financing 

A  trend  in  the  public  finance  field  is  to  move  plan¬ 
ning  toward  medium-term  expenditure  frame¬ 
works  (MTEFs).  One  first  determines  an  overall 
constraint  for  the  period  that  is  based  on  antici¬ 
pated  revenues.  One  then  looks  at  the  expected 
spending  at  the  agency/ministerial  level.  There 
should  be  a  clear  distinction  between  ongoing  and 
new  programs.  There  should  also  be  measures  to 
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produce  savings  in  low-priority  areas  that  can  cre¬ 
ate  fiscal  “space”  for  ministries  to  reallocate  funds 
where  appropriate.  A  good  example  is  that  of  the 
Australian  government,  which  begins  with  a  base¬ 
line  projection  and  provides  forward  rolling  esti¬ 
mates  of  spending  for  the  next  3  years.  The  coun¬ 
try’s  Department  of  Finance  then  updates  these 
plans.  While  the  goal  is  to  get  decision  makers  in 
government  to  consider  the  temporal  implications 
of  their  actions,  the  law  also  requires  the  govern¬ 
ment  to  provide  an  update  of  this  framework  to 
voters  3  months  before  an  election. 

While  several  international  economic  organiza¬ 
tions,  such  as  the  International  Monetary  Fund, 
have  suggested  that  countries  adopt  such  medium- 
term  expenditure  frameworks,  one  should  also  be 
aware  of  the  prerequisites  for  such  planning.  A 
country  needs  to  have  enough  macroeconomic 
stability  so  that  revenue  and  expenditure  predic¬ 
tions  can  be  made  with  some  accuracy.  Politicians 
have  to  agree  to  use  the  frameworks.  There  also 
has  to  be  core  capacity  in  different  ministries  to  do 
the  planning. 

The  focus  so  far  has  been  on  the  decision¬ 
making  process  that  is  involved  in  making  the  pub¬ 
lic  budget,  but  how  the  money  is  spent  is  another 
important  topic  that  is  increasingly  making  its  way 
into  even  the  initial  decision-making  stages.  This 
brings  us  to  the  standards  that  one  uses  to  make 
the  evaluation.  The  first  standard  is  usually  a  fidu¬ 
ciary  one,  namely,  whether  the  budget  upholds 
fiscal  discipline  and,  ultimately,  whether  the  state 
is  solvent.  The  second  standard  concerns  allocative 
efficiency.  Does  the  state  budget  reflect  govern¬ 
ment  priorities?  The  third  standard  is  operational 
efficiency.  The  state  should  not  pay  too  much  for 
the  services  and  goods  it  provides  its  citizens. 

This  then  begs  the  question  of  what  one  should 
evaluate.  One  can  assess  a  budget  according  to  its 
inputs,  or  how  much  is  spent  on  each  budget  cat¬ 
egory,  but  increasingly  governments  are  assessing 
budgets  according  to  their  outputs,  or  even  out¬ 
comes.  The  examples  here  illustrate  the  differences 
among  the  three  concepts.  The  amount  of  money  a 
city  spends  on  its  police  force  is  an  input.  The 
number  of  police  personnel  would  be  the  output, 
while  the  overall  crime  level  would  be  the  out¬ 
come.  Concerns  about  fiscal  discipline  usually 
focus  on  inputs  and  discuss  the  budget  balance  as 
well  as  the  debt  burden  of  the  state.  Here  are  some 


of  the  questions  regarding  fiscal  discipline  that  one 
may  need  to  answer:  Who  finances  any  debt  that  is 
generated?  How  much  interest  does  the  state  pay 
on  that  debt?  Is  the  government  credible  enough  to 
be  able  to  repay  it  at  some  point  in  the  future? 
Discussions  of  allocative  efficiency  can  also  look  at 
inputs,  but  generally,  a  detailed  discussion  of  both 
forms  of  efficiency  requires  some  knowledge  about 
both  outputs  and  outcomes. 

“Performance  budgeting”  is  perhaps  the  hot 
topic  in  budget  circles  these  days,  and  countries 
have  in  practice  adopted  different  techniques 
under  this  heading  to  get  at  output  and  outcomes. 
The  simplest  form  of  this  technique  is  when  infor¬ 
mation  on  targets  or  results  is  included  as  part  of 
the  budget  documentation,  but  it  is  really  only 
background  information  in  practice.  Performance- 
informed  budget,  in  contrast,  means  that  informa¬ 
tion  enters  the  budget  process  but  it  does  not  have 
any  real  effect  on  how  resources  are  allocated.  A 
true  direct-performance  system  allocates  future 
resources  depending  on  the  achievement  of  specific 
goals.  For  example,  there  may  be  an  expectation 
that  a  school  district  educate  a  minimum  number 
of  students,  and  if  it  does  not,  it  will  lose  funds  to 
districts  that  educate  more  students. 

A  historical  example  illustrates  how  the  system 
can  work.  In  the  1950s,  the  different  branches  of 
the  U.S.  military  generally  decided  how  many 
tanks,  aircraft  carriers,  and  other  equipment  they 
needed.  President  John  F.  Kennedy’s  defense  secre¬ 
tary,  Robert  McNamara,  instituted  a  new  budget¬ 
ing  system  known  as  the  Planning,  Programming, 
and  Budgeting  System  (PPBS).  It  extended  the 
planning  horizons  from  1  to  5  years.  It  also  insti¬ 
tuted  planning  that  asked  what  the  threat  was  the 
U.S.  military  was  supposed  to  address,  what  the 
appropriate  force  mix  should  be  given  the  threat, 
and  what  funding  should  be  allocated  to  develop 
that  force  mix.  While  there  has  been  some  retrench¬ 
ment  on  this  system,  it  had  a  practical  political 
effect  of  moving  some  of  the  “power”  from  the 
joint  chiefs  of  staff,  which  was  composed  of  the 
heads  of  each  branch  of  the  military,  to  the  civilian 
leadership  in  the  Department  of  Defense. 

Some  variation  of  performance  budgeting  has 
appeared  in  many  countries  over  the  past  2  decades, 
and  their  experiences  have  raised  a  series  of  chal¬ 
lenges.  One  issue  concerns  the  information  that  is 
needed  to  do  performance  budgeting.  It  is  often 
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costly  to  generate,  and  some  items  are  easier  to 
assess  than  others.  Moreover,  one  would  ideally 
like  to  have  information  that  allows  one  to  estimate 
whether  different  levels  of  expenditure  generate  dif¬ 
ferent  levels  of  results,  and  such  information  is  hard 
to  get.  Finally,  there  are  monitoring  issues.  If  it  is  in 
the  best  interests  of  a  given  school  to  project  good 
performance,  for  example,  and  the  school  authori¬ 
ties  collect  the  information  themselves,  how  can 
one  be  sure  that  the  less  flattering  information  is 
indeed  passed  along  to  policymakers? 

The  number  and  type  of  output-  or  outcome- 
based  targets  can  also  overwhelm  policymakers. 
Richard  Hughes  indicates  that  the  United  Kingdom 
(UK)  had  more  than  300  performance  targets  and 
more  than  600  specific  targets  when  it  introduced 
the  Labour  government’s  version  of  performance 
budgeting  in  1998,  and  politicians  and  bureaucrats 
alike  were  overwhelmed.  Prime  Minister  Tony 
Blair  decided  to  focus  on  the  35  he  found  the  most 
important,  and  in  practice,  the  number  of  targets 
has  declined  after  each  spending  review  to  only  30 
in  2007.  While  France  included  more  than  1,000 
targets  in  its  first  budget  after  it  adopted  perfor¬ 
mance-based  budgeting  in  2006,  one  suspects  that 
the  number  of  targets  ultimately  will  decrease  in 
this  country  as  well. 

Finally,  to  collect  all  the  data  necessary  to  do 
performance  budgeting,  to  evaluate  performance, 
to  adjust  spending  given  that  performance,  and  to 
update  targets  takes  a  professional,  and  compe¬ 
tent,  bureaucracy.  Countries  that  have  adopted 
performance  budgeting  and  that  have  generally 
found  the  approach  worthwhile  have  had  some  of 
the  best  bureaucracies  in  the  world.  As  Allen 
Schick  notes,  such  countries  generally  have  decent 
budgeting  systems  anyway  and  are  probably  the 
least  in  need  of  performance  budgeting  in  the  first 
place.  According  to  Salvator  Schiavo-Campo, 
Ghana  is  one  country  where  such  budgeting  tech¬ 
niques  may  have  been  doomed  from  the  start.  The 
African  country  set  more  than  2,500  targets  in  the 
late  1990s,  or  several  times  the  number  of  accoun¬ 
tants  in  the  public  sector,  and  the  most  important 
item  in  the  budget,  personnel  expenditure,  was  not 
included  at  all. 

Joining  performance  budgeting  are  other  trends, 
such  as  private  finance  initiatives,  that  blur  the  dif¬ 
ference  between  “public”  and  “private”  budget¬ 
ing.  Consider  who  should  pay  for  the  construction 


and  maintenance  of  a  bridge.  Traditionally,  the 
government  would  build  the  bridge,  maintain  it, 
and — if  it  chose  to  charge  a  toll — receive  any  rev¬ 
enues.  This  meant  that  the  government  was  respon¬ 
sible  for  raising  the  money  for  bridge  construction. 
It  also  bore  any  risks  associated  with  the  bridge,  be 
they  bad  weather  that  delayed  the  finish  date  to  the 
effects  of  saltwater  on  the  durability  of  the  materi¬ 
als  used.  The  government  faces  a  constraint  from 
its  revenue  base  on  how  many  investment  projects 
it  can  build  and  maintain.  In  recent  years,  private 
finance  initiatives  have  brought  the  private  sector 
into  such  projects.  Generally,  the  private  sector 
arranges  the  financing  from  private  capital  mar¬ 
kets.  It  also  bears  the  risks  from  the  construction 
phase  onward.  Many  such  arrangements  also 
assign  the  maintenance  of  the  investment  to  the 
private  sector.  In  exchange,  the  private  part  of  the 
arrangement  gets  the  revenues  from  the  investment 
for  a  fixed  amount  of  time. 

Such  private  finance  initiatives  have  become 
widespread  in  the  UK,  but  one  also  finds  them  in 
other  countries,  such  as  Germany.  Because  the 
arrangements  do  require  fairly  large-scale  private 
financing,  they  depend  on  stable,  and  liquid,  capi¬ 
tal  markets.  The  financial  crisis  that  began  in  2007 
put  some  of  these  arrangements  in  jeopardy,  and  in 
the  UK,  the  state  had  to  bail  out  the  private  sector 
and  assume  the  remaining  risk  from  a  given  proj¬ 
ect.  This  development  begged  the  question  about 
how  “private”  such  initiatives  are  in  practice. 

Mark  Hallerberg 
Hertie  School  of  Governance 
Berlin,  Germany 
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Public  Employment 


Public  employment  has  many  aspects  depending 
on  the  level  one  is  considering.  It  has  a  role  in 
national  economy  and  national  employment,  in  the 
public  sector  generally,  and  at  the  level  of  public 
institutions  such  as  ministries  and  agencies.  This 
entry  describes  the  nature  of  public  employment 
and  its  significance  in  government  and  society. 

Public  Employment  in  the  Economy 

The  role  of  public  employment  varies  substantially 
depending  on  the  role  of  the  state  in  society.  In 
countries  following  the  model  of  a  welfare  state, 
such  as  the  Nordic  countries,  the  share  of  public 
sector  and  public  employment  can  be  quite  high — 
30%  of  the  national  economy  or  even  higher.  Any 
numerical  estimation  of  the  size  of  public  employ¬ 
ment  involves  a  number  of  assumptions.  For 
example,  it  is  important  to  know  what  percentage 


of  public  employment  is  composed  of  those 
employed  by  the  armed  forces,  because  this  share 
of  employment  is  considered  to  be  outside  the 
working  force.  Also,  given  the  complex  relation¬ 
ships  now  involved  in  the  delivery  of  many  public 
services,  it  has  become  more  difficult  to  determine 
who  is  in  the  public  sector  and  who  is  outside. 

In  less  developed  countries,  the  role  of  public 
employment  in  the  national  economy  is  usually 
much  smaller,  in  part  because  it  fulfills  different 
functions.  On  the  one  hand,  the  public  sector  plays 
a  more  significant  role  in  development  functions 
such  as  education  and  in  social  and  health  policy. 
On  the  other  hand,  the  limited  resources  available 
to  the  public  sector  do  not  permit  these  countries 
to  employ  many  public  servants.  Also,  patronage 
and  clientelism  in  many  of  these  systems  may  pre¬ 
vent  hiring  of  employees  through  the  merit  systems 
that  are  more  typical  of  industrialized  democra¬ 
cies.  The  politicization  of  public  service  creates 
both  management  problems  and  difficulties  in 
providing  equitable  public  services. 

Public  Employment  in  the  Political  System 

The  structure  of  the  public  sector  determines 
where  the  bulk  of  public  employment  will  occur. 
In  federal  states,  the  central  government  may  be 
relatively  small  compared  with  that  found  in  uni¬ 
tary  systems.  For  example,  in  Germany,  the  central 
government  has  small  policy-making  ministries. 
On  the  other  hand,  in  a  centralized  regime  such  as 
France,  central  government  employees  include 
elementary  and  secondary  school  teachers. 

Public  employment  reflects  the  internal  structure 
of  the  policy  sectors  in  the  public  sector.  Under  each 
ministerial  sector,  there  are  agencies,  regional  units, 
and  local  units,  which  together  form  administrative 
sectors.  For  instance,  under  the  Ministry  of  Interior, 
there  are  the  police  forces,  the  front  guards,  and  dif¬ 
ferent  kinds  of  organizations  that  are  responsible 
for  the  security  of  the  country.  These  organizations 
have  employees,  such  as  police  and  fire  personnel 
and  the  like,  who  belong  to  the  public  employment 
sector.  The  number  of  ministries  and  associated 
functions  vary  among  countries,  but  typically,  the 
number  of  ministerial  sectors  ranges  from  10  to  20. 
The  size  of  employment  varies  greatly  from  sector 
to  sector.  In  welfare  states,  the  largest  sectors  are 
social  affairs,  health  care  affairs,  and  education 
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(schools  and  universities).  As  each  country  has 
developed,  the  size  of  its  ministerial  sectors  has  also 
changed  as  a  result  of  the  modernization  or  devel¬ 
opment  process.  For  instance,  during  the  initial 
period  of  rail  and  public  road  construction,  the 
number  of  public  employees  in  the  traffic  and  com¬ 
munication  sectors  was  very  high,  but  today  it  has 
decreased.  Thus,  public  employment  to  some  extent 
can  be  used  as  an  indicator  of  the  priorities  of  gov¬ 
ernment  and  the  desire  of  the  public  sector  to  con¬ 
trol  important  policy  sectors. 

General  rules  and  steering  systems  within  the 
public  sector  and  governments  have  organized  pub¬ 
lic  employment  into  a  common  system.  These  pub¬ 
lic  employment  systems  generally  contain  several 
categories  of  public  employees,  which  are  more  or 
less  governed  by  formal  rules.  In  these  systems,  civil 
servants  play  a  major  role.  These  employees  tend  to 
be  in  a  defined  career  and  to  be  governed  by  laws 
that  are  distinct  from  other  labor  laws  in  the  soci¬ 
ety.  Unless  they  fail  woefully  in  their  tasks  or  are 
guilty  of  malfeasance,  civil  servants  in  particular 
may  have  secure  employment  for  a  lifetime.  In 
many  civil  service  systems,  rewards  for  office  are 
also  strictly  defined  by  law.  This  more  secure  posi¬ 
tion  of  the  civil  servant  follows  Max  Weber’s 
model  of  neutrality  of  civil  servants. 

Additionally,  there  are  other  public  employees 
whose  position  in  the  public  employment  system  is 
not  as  legally  formalized  as  that  of  civil  servants. 
School  teachers,  nurses,  and  doctors  usually  belong 
to  this  group  of  public  employees.  These  employees 
may  have  some  of  the  protections  of  civil  service 
law  but  often  are  employed  either  as  professionals 
or  under  ordinary  labor  law.  Furthermore,  many  of 
these  employees  have  only  a  tenuous  relationship 
with  the  public  sector  because  the  institutions  for 
which  they  work  (e.g.,  hospitals,  universities,  etc.) 
are  managed  as  autonomous  organizations  rather 
than  as  components  of  the  public  sector  per  se. 

Since  the  1970s,  in  modern  national  states  such 
as  most  European  countries  and  many  others 
around  the  world,  national-level  steering  systems 
have  been  created  to  tackle  public  employment  pol¬ 
icy  issues.  In  the  most  developed  countries,  the  for¬ 
mal  rules  of  public  employment  may  be  controlled 
by  both  legislation  and  negotiation  systems.  These 
systems,  which  include  rules  for  negotiating  salaries 
and  other  work  conditions  of  public  employees, 
complement  civil  service  laws  that  fix  the  legal 


position  of  public  employees.  These  systems  help 
develop  better  relationships  between  public  employer 
organizations  and  trade  unions  for  negotiations.  In 
many  countries,  such  negotiation  systems  have  pro¬ 
vided  not  only  public  employees  but  also  civil  ser¬ 
vants  the  right  to  strike.  Traditionally,  civil  servants 
did  not  have  the  right  to  strike. 

Especially  since  the  1980s,  new  public  manage¬ 
ment  (NPM)  has  created  a  more  flexible  framework 
for  public  sector  employees.  The  number  of  ordi¬ 
nary  civil  servants  has  decreased,  while  at  the  same 
time,  greater  numbers  of  public  employees  have 
been  hired  on  contracts,  following  private  sector 
labor  laws.  In  many  countries,  with  more  people 
moving  in  and  out  of  the  public  sector,  there  is  a 
decline  in  the  number  of  people  opting  for  a  career 
in  civil  service.  This  has  meant  increasing  similari¬ 
ties  and  closer  contacts  between  public  and  private 
working  life,  which  has  resulted  in  a  better  under¬ 
standing  of  the  obligations  of  public  employees. 

In  fact,  a  small  core  group  of  civil  servants,  and 
the  legislative  systems  built  around  their  position, 
provides  a  national  profile  for  public  employment. 
In  developed  countries,  there  are  competing  pro¬ 
files  of  the  core  civil  servants.  The  British  system  is 
famous  for  its  civil  servants,  who  are  politically 
neutral  but  are  increasingly  being  managed  by  rules 
more  similar  to  those  in  the  private  sector.  The 
French  system  is  widely  known  for  its  career  civil 
servants,  who  are  educated  in  the  best  Parisian  lyce- 
ums  and  universities  and  have  received  training  at 
the  publicly  run  Ecole  Nationale  d’Administration 
(ENA).  One  can  also  distinguish  the  German  pro¬ 
file,  in  which  a  legal  education  and  the  organization 
culture  of  Recbtstaat  have  played  a  prominent  role. 
In  the  Nordic  countries,  the  neutrality  and  legal 
basis  of  civil  servant  activities  are  emphasized,  but 
recruitment  and  career  development  are  moving 
toward  open  hiring,  by  which  anyone  who  has  the 
necessary  qualifications  can  be  appointed  to  the 
civil  service. 

In  addition  to  the  issues  of  politicization  in  pub¬ 
lic  employment  mentioned  above,  the  issue  of 
representativeness  also  can  undermine  the  effec¬ 
tiveness  of  the  public  sector.  Especially,  for  street- 
level  bureaucrats  in  contact  with  clients,  the  extent 
to  which  the  public  bureaucracy  is  representative 
of  society  in  social  and  gender  terms  will  influence 
the  success  of  government  in  delivering  programs 
as  well  as  the  legitimacy  of  the  political  system. 
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Administrative  Traditions 
and  Public  Employment 

Public  employment  systems  vary  from  continent  to 
continent.  The  traditional  system  in  Europe  has, 
however,  had  a  wide  impact  on  the  general  devel¬ 
opment  of  public  employment  traditions  all  over 
the  world.  Colonialization  has  also  had  a  role  in 
spreading  European  public  employment  systems, 
especially  in  the  African  and  Asian  countries  that 
once  belonged  to  the  British  Empire  or  had  been 
under  the  regime  of  France.  The  British  tradition 
of  civil  service,  however,  had  a  wide  impact  in 
countries  that  have  similar  or  related  administra¬ 
tive  traditions,  such  as  the  United  States,  Canada, 
Australia,  and  New  Zealand. 

In  today’s  world,  the  development  of  public 
employment  is  often  identified  with  the  British 
(including  that  of  the  British  Empire  and  its  former 
colonies)  and  the  continental  European  traditions. 
In  wider  historical  consideration,  one  must  not 
forget  the  Chinese  mandarin  tradition,  the  tradi¬ 
tions  of  Egypt,  and  those  of  ancient  Greece  or 
Rome.  Historical  traditions  of  France,  traditions 
from  the  Great  Revolution  and  the  so-called 
Napoleonic  tradition,  have  also  had  a  great  impact 
in  the  development  of  modern  public  employment 
traditions.  The  fundamentals  of  modern  public 
employment  from  cameralistic  administrative  arts 
to  Weberian  Recbtstaat  are  derived  from  German 
traditions.  The  Nordic  welfare  state  ideal  has 
brought  into  the  public  employment  tradition  the 
demands  of  public  service  orientation  and  the  role 
of  civil  servants  as  service-oriented  staff  responsi¬ 
ble  for  high-quality  public  services  in  social  and 
health  care  and  in  education. 
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Public  Goods 


Unlike  private  goods,  which  the  market  can  pro¬ 
duce  and  sell,  public  goods  are  ones  that  people 
can  freely  use  and  share  with  others.  A  pure  public 
good  is  nonexcludable  (people  cannot  be  excluded 
from  its  use)  and  joint  in  consumption  or  nonrival 
(use  by  one  person  will  not  reduce  its  availability 
to  others).  This  entry  describes  the  nature  of  pub¬ 
lic  goods  and  contrasts  the  approaches  to  them 
taken  by  economists  and  political  scientists.  When 
allocating  public  or  semipublic  goods  by  means  of 
the  coercive  powers  of  the  state — legislation,  bud¬ 
geting,  and  taxation — governments  have  to  handle 
the  problematic  of  these  goods,  including  tax  eva¬ 
sion  (free  riding),  oversupply  (rent  seeking),  and 
undersupply  (preference  distortions),  as  well  as 
allocation  and  distribution  in  trade-offs.  The  entry 
examines  these  phenomena  and  concludes  with  a 
look  at  the  allocation  of  common  goods  in  a 
global  context. 

Public  Goods  and  Public  Finance 

Since  the  identification  and  conceptualization  of 
this  special  set  of  goods  or  services  by  Italian  pub¬ 
lic  finance  scholars  in  the  19th  century,  public 
goods  have  attracted  a  lot  of  theoretical  interest 
from  both  economists  and  political  scientists. 
Whereas  in  economics  the  question  of  the  optimal 
supply  of  public  goods  has  dominated  the  concern 
with  these  goods  and  services,  which  differ  from 
private  goods  as  conceptualized  in  market  theory, 
political  scientists  have  looked  at  how  goods  and 
services  with  the  characteristics  of  public  goods  are 
allocated  in  domestic  as  well  as  international 
political  systems.  The  political  preoccupation  with 
public  goods  has  led  to  inter  alia  theories  of  collec¬ 
tive  action,  free  riding,  and  transaction  costs. 

The  theory  of  public  goods  offers  an  economic 
approach  to  public  policy  making  and  implementa¬ 
tion,  suggesting  criteria  that  public  programs  should 
fulfill,  derived  from  conceptions  of  market  failure 
and  collective-action  difficulties.  It  targets  the  allo¬ 
cation  of  public  services  as  well  as  the  regulation  of 
the  economy,  disregarding  distributive  aspects. 
Although  its  basic  model  of  an  optimal  supply  of 
public  goods  offers  an  ideal-type  construction 
bypassing  transaction  costs,  opportunism,  and 
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bounded  rationality,  it  still  captures  the  essence  of 
the  state  as  the  guardian  of  law  and  order,  financ¬ 
ing  its  operations  with  taxation. 

Nature  of  Public  and  Semipublic  Goods 

A  pure  public  good  is  characterized  by  nonexclud¬ 
ability  and  jointness  in  consumption,  or  “lumpi¬ 
ness,”  meaning  that  people  can  enjoy  it  without 
paying  for  it  and  share  it  whole  with  others.  The 
two  properties  of  nonexcludability  and  lumpiness 
create  market  failure,  as  pure  private  goods  are 
excludable  by  the  price  mechanism  and  completely 
appropriable.  Impure  or  mixed  public  goods  have 
either  nonexcludability  or  nonappropriability  but 
not  both.  Nonexcludability  is  also  called  external¬ 
ity  (external  effects),  and  nonappropriability 
involves  economies  of  scale  (natural  monopoly). 
Public  safety  and  defense  are  pure  public  goods, 
whereas  environmental  resources  (“common  pool” 
goods),  such  as  water  in  a  river,  which  are  open  to 
all  but  can  be  depleted  by  use,  are  semipublic 
goods,  as  are  those  supplied  through  infrastructure 
(“common  club”  goods),  such  as  toll  roads — 
individuals  must  pay  for  access,  but  use  does  not 
deplete  such  goods.  According  to  Richard  Cornes 
and  Todd  Sandler  (1996),  there  are  both  positive 
and  negative  externalities,  and  the  extent  of  scale 
economies  varies,  as  one  speaks  of  global,  national, 
regional,  and  local  public  goods. 

Problematic  With  Public  Goods 

No  single  person  has  any  incentive  to  supply  pub¬ 
lic  goods,  since  he  or  she  cannot  internalize  all  its 
benefits.  And  once  a  public  good  has  been  sup¬ 
plied,  the  best  strategy  for  an  individual  is  to  free 
ride  on  it.  Since  the  allocation  of  public  goods 
faces  market  failure,  they  invite  government  inter¬ 
vention.  The  key  question  is  “How  and  in  what 
form  should  government  support  or  manage  the 
allocation  of  public  goods?”  Government  tends  to 
allocate  pure  public  goods  through  its  bureau¬ 
cracy,  in  order  to  regulate  the  externalities  by 
agencies  and  to  undo  natural  monopolies  by  poli¬ 
cies  of  deregulation,  privatization,  and  leveling  the 
playing  field.  However,  when  government  corrects 
for  market  failures,  then  the  risk  of  government 
failure  arises,  as  the  public  good  is  captured  by 
rent-seeking  actors. 


Optimal  Supply  of  Public  Goods 

Efficient  allocation  of  public  goods  requires  that 
the  willingness  of  people  to  pay  for  the  lumpy 
good  cover  the  cost  of  its  provision.  This  can  be 
achieved  only  by  summing  individuals’  willingness 
to  pay  through  some  process  of  aggregation.  A 
voluntary  supply  of  public  goods  is  theoretically 
imaginable  (by  way  of  demand  revelation  schemes, 
through  which  people  indicate  their  desire  for  a 
good  by  indicating  how  much  they  are  willing  to 
pay  for  it),  but  when  the  number  of  people 
demanding  the  public  good  increases,  then  trans¬ 
action  costs  quickly  become  prohibitive.  To  avoid 
free  riding  and  preference  distortion  as  well  as 
delaying  tactics,  government  replaces  the  volun¬ 
tary  mechanism  with  state  authority.  One  may 
argue  that  the  first  theoretician  of  public  goods 
was  Thomas  Hobbes,  who  regarded  safety  as  a 
country’s  most  important  public  good. 

Real-Life  Distortions  With  Public  Goods 

The  allocation  of  public  goods — pure  or  impure — 
is  in  practical  terms  at  the  core  of  the  political 
process,  as  government  at  various  levels  in  the 
political  system  engages  in  policy  making  and 
implementation  of  programs  having  the  character¬ 
istics  of  public  goods.  The  aggregation  procedure 
for  summing  up  the  willingness  to  pay  for  public 
goods  is  in  reality  the  democratic  procedure,  or 
elections  as  well  as  legislative  budgeting  and  taxa¬ 
tion.  Thus,  party  politics  and  coalition  politics 
basically  provide  the  set  of  possible  demands  for 
public  goods,  whether  pure  or  impure. 

In  democratic  politics,  the  allocation  of  public 
goods  may  be  combined  with  distributional  poli¬ 
cies,  with  policy  combinations  that  are  conducive 
to  fiscal  illusions  involving  the  over-  or  undersup¬ 
ply  of  public  goods.  The  rational  strategy  for  the 
citizen  is  to  free  ride  on  the  supply  of  public  goods, 
minimizing  as  much  as  possible  his  or  her  contribu¬ 
tion  to  covering  their  costs.  The  efficient  prices  for 
public  goods — “Lindahl  tax  prices” — are  not  fea¬ 
sible  due  to  preference  revelation  distortions  and 
free  riding.  Thus,  government  imposes  taxes  and 
borrows  money  as  needed  to  pay  for  the  lumpy 
goods  allocated  to  the  entire  political  community. 
There  is  no  guarantee  that  what  a  citizen  is  willing 
to  pay  will  equal  what  he  or  she  is  forced  to  pay — 
the  Lindahl  tax  price.  The  allocation  of  public 
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goods  has  distribution  consequences  with  regard 
to  both  their  benefits  and  their  costs.  Thus,  public 
goods  involve  more  political  than  strictly  eco¬ 
nomic  concerns.  This  is  true  for  both  supply  and 
demand. 

On  the  supply  side,  public  goods  contracts  tend 
to  be  lucrative  for  private  entrepreneurs.  Deciding 
how  to  allocate  these  contracts,  worth  millions  or 
billions  of  dollars,  involves  a  tendering/bidding 
process  that  is  very  difficult  to  fully  institutionalize 
in  accordance  with  a  transparent  competition 
regime.  Politicians  are  easily  “captured”  by  vari¬ 
ous  enterprises,  looking  for  rent  in  the  construc¬ 
tion  of  lumpy  goods — see,  for  example,  Robert 
Caro’s  (2002)  descriptions  of  Lyndon  Baines 
Johnson’s  rise  to  power  and  Senate  dominance. 
Charles  Rowley,  Robert  Tollison,  and  Gordon 
Tullock’s  (1988)  theory  of  rent  seeking  suggests 
that  government  contractors  are  willing  to  pay 
almost  all  of  the  economic  rent  captured  in  cam¬ 
paign  contributions  and  the  like. 

Externalities  ( Nonexcludability ) 

External  effects  call  for  public  policies,  demot¬ 
ing  negative  ones  and  promoting  positive  ones. 
Since  the  social  benefits  or  costs  are  not  captured 
by  market  prices,  there  is  market  failure  and  thus  a 
reason  for  government  intervention.  The  key  ques¬ 
tion  is  how  much  should  government  spend  on 
generating  social  benefits,  and  how  much  should  it 
impose  as  social  costs?  The  theory  of  cost-benefit 
analysis  was  intended  as  a  rational  answer  to  this 
query,  but  it  has  proven  difficult  to  target  policy 
making  so  that  social  benefits  or  social  costs  are 
handled  in  an  efficient  manner.  The  problem  is  that 
there  may  be  no  limit  to  what  government  should 
do  using  externalities  as  the  criteria.  However,  spe¬ 
cific  measures  such  as  vaccination  or  control  of 
contagious  deceases  are  clearly  defensible  public 
goods  policy.  Externalities  may  be  promoted  as 
national  public  goods  or  local  public  goods  depend¬ 
ing  on  the  size  of  the  community  involved. 

Club  Goods  (Jointness ) 

The  theory  of  clubs  is  an  attempt  to  devise  a 
rational  approach  to  the  problem  of  dividing  com¬ 
petences  between  various  levels  of  government, 
taking  into  account  the  capacity  of  a  political 


community  of  a  certain  size  to  internalize  exter¬ 
nalities  as  much  as  possible,  especially  when  bene¬ 
fits  taper  off  or  congestion  arises.  Federalism  is  one 
version  of  such  a  club  theory  about  creating  juris¬ 
dictions  that  are  capable  of  internalization  of  as 
many  externalities  as  feasible.  It  has  received  a 
novel  interpretation  in  the  theory  of  multilevel  gov¬ 
ernance,  including  new  regional  organizations  such 
as  the  European  Union  (EU). 

Infrastructure  (Scale  Economies ) 

The  public  goods  aspect  of  infrastructure  used  to 
be  analyzed  as  natural  monopoly  regulation  with  a 
call  for  government  intervention.  However,  with 
the  deregulation  and  privatization  starting  in  the 
late  1970s,  both  monopoly  regulation  and  antitrust 
policy  have  lost  some  of  their  credentials.  Again,  it 
is  capture  theory  in  one  version  or  another  that 
favors  a  level-playing-field  policy  orientation  with 
maximum  open  entry  in  enlarged  markets  beyond 
national  borders.  Today,  regional  competition 
policies,  such  as  the  EU  regime  with  its  commis¬ 
sioners,  are  considered  appropriate  to  undo  monop¬ 
oly  and  invite  competition.  The  occurrence  of 
economies  of  scale  is  no  longer  considered  a  natu¬ 
ral  phenomenon  but  a  market-related  one. 
Increasing  the  size  of  the  market  is  a  better  response 
to  this  type  of  market  failure  with  club  goods  than 
entry  regulation  and  price  setting.  The  size  of  the 
market  is  a  function  of  the  extension  of  the  law  or 
common  legal  institutions  between  countries,  such 
as  EU  law. 

Global  Public  Goods  and  Commons  (Rivalry) 

Governments  may  handle  the  collective-action 
problems  that  arise  concerning  the  allocation  of 
public  goods  in  their  own  domestic  countries. 
Instead  of  relying  on  a  voluntary  mechanism  for 
the  supply  of  national  or  local  public  goods,  gov¬ 
ernments  employ  state  authority  to  make  bud¬ 
gets,  laws,  and  taxes.  But  the  problems  with  a 
voluntary  supply  of  public  goods  resurface  on 
the  global  scene,  where  externalities  such  as 
global  warming,  carbon  emissions,  and  the  envi¬ 
ronmental  destruction  of  common  pools  such  as 
the  oceans  and  endangered  species  call  for  a 
global  ecology  policy.  Kyoto  I  was  meant  to  set 
up  such  a  global  common  pools  policy,  but  it  ran 
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into  the  typical  collective  problems  of  preference 
misrepresentation,  free  riding,  and  reneging. 
Negotiating  a  new  global  policy  runs  up  against 
the  tremendous  transaction  costs  typical  of  a  vol¬ 
untary  approach  to  public  goods  provision. 
Elinor  Ostrom  (1990)  has  argued  that  small 
groups  may  overcome  the  collective-action  diffi¬ 
culty  of  protecting  a  commons,  but  in  most  such 
examples,  the  state  helps  the  groups  reduce 
transaction  costs  by  somehow  or  indirectly 
enforcing  the  rules  allocating  the  common  pool 
goods.  In  the  global  arena,  there  is  no  world  gov¬ 
ernment  or  voluntary  coordination  scheme  to  do 
just  that. 

Political  Relevance  of  Public  Goods 

While  economists  analyze  the  question  of  a  volun¬ 
tary  supply  of  public  goods  and  its  optimality, 
political  scientists  know  that  such  schemes  will 
hardly  be  practical.  Thus,  they  concentrate  on 
understanding  the  allocation  of  public  or  semipub¬ 
lic  goods  by  means  of  the  operations  of  state  insti¬ 
tutions.  In  a  representative  democracy,  political 
parties  handle  the  demand  and  supply  of  public 
and  semipublic  goods.  Due  to  bounded  rationality, 
opportunism,  and  voter  myopia,  the  political  pro¬ 
cess  will  not  arrive  at  an  optimal  allocation  of  these 
goods,  where  each  individual’s  marginal  benefit 
equals  his  or  her  marginal  cost  (Lindahl  equilib¬ 
rium).  It  is  widely  recognized  that  the  allocation  of 
public  goods  does  have  distributional  implications 
and  that  it  is  the  mixture  of  allocative  efficiency 
and  justice  in  distribution  that  makes  public  or 
semipublic  goods  inherently  political. 
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The  term  rewards  refers  to  any  kind  of  tangible 
income  and  intangible  compensation  or  honor 
attached  to  public  office.  Rewards  are  relevant  in 
political  science  because  they  link  the  adequate 
compensation  of  our  rulers  with  their  perfor¬ 
mance.  On  the  one  hand,  in  democratic  systems, 
proper  compensation  includes  the  reward  level 
and  the  transparency  of  the  reward  structure  for 
public  scrutiny.  Concerns  of  transparency,  cor¬ 
ruption,  and  public  trust  in  rulers  are  relevant  for 
creating  a  proper  reward  system.  On  the  other 
hand,  the  performance  of  public  officials  relates  to 
achievement  at  the  individual  and  organizational 
levels,  which  in  turn  are  linked  to  the  legitimacy  of 
democracy.  These  dimensions  have  been  tackled 
by  normative  and  positive  political  theories. 

Normative  theories  prescribe  what  political 
choices  ought  to  be  made  according  to  some  philo¬ 
sophical  values,  such  as  representativeness,  com¬ 
petitiveness,  and  functionality  of  rewards  for  public 
officials  in  relation  to  the  rest  of  society.  Positive 
theories  try  to  explain  the  particular  political 
choices  made  for  rewarding  public  officials. 
Different  theories  emphasize  the  role  of  institutions 
and  past  legacies,  cultural  values,  or  the  rent- 
seeking  strategies  of  individuals  in  accounting  for 
the  achieved  reward  level,  visibility,  and  value  of  the 
different  parts  of  the  reward  structure. 

The  structure  of  rewards  covers  financial  (salary), 
material  (with  monetary  value),  and  intangible  ele¬ 
ments.  The  combination  of  these  elements  varies 
among  different  categories:  countries;  public 
officials  from  the  legislative,  the  judiciary,  or  the 
executive;  the  legal  nature  of  institutions  (regulatory 
agencies,  independent  national  banks,  bureaucracies); 
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or  policy  areas  (public  safety,  education,  health, 
etc.).  Managerial  public  sector  reform  trends  and 
the  role  played  by  democratic  values  are  likely  to 
have  an  impact  on  how  the  structure  of  rewards 
changes  for  different  public  officials. 

This  entry  is  further  developed  into  four  sec¬ 
tions.  First,  the  main  normative  and  positive 
theories  are  described.  A  second  section  shows  the 
elements  of  the  reward  structure.  The  third  section 
is  devoted  to  the  recent  emphasis  on  managerial¬ 
ism  and  performance-based  pay.  The  final  section 
links  rewards  to  transparency. 

Political  Theories  on  Rewards 
Normative  Theories 

Normative  theories  of  the  18th  and  19th  centu¬ 
ries  define  how  public  officials  should  be  rewarded. 
Three  different  sets  of  values  have  been  promoted. 
First,  individualistic  values  have  been  fostered 
more  frequently.  Adam  Smith  (1723-1790)  advo¬ 
cated  for  the  exclusive  link  between  payment  and 
performance,  although  he  did  not  propose  a  spe¬ 
cific  reward  system.  Jeremy  Bentham  (1748-1832) 
devised  a  more  nuanced  system  for  rewarding 
performance,  indicating  the  degree  of  transparency 
required  in  defining  the  whole  reward  package, 
how  potential  corruption  practices  should  be 
avoided,  and  the  like.  Frederic  Taylor  (1856-1915) 
also  supported  the  idea  of  paying  for  performance 
when  advocating  his  “scientific  method”  of  work 
in  mass  production.  Second,  Georg  Hegel’s  (1770- 
1831)  view  on  rewards  was  more  functionalist, 
suggesting  a  level  of  pay  that  would  make  neutral 
and  expert  public  servants  focus  on  the  general 
interest  rather  than  the  interest  of  a  few.  Sufficient 
pay  was  needed  to  avoid  corrupt  practices.  A  simi¬ 
lar  view  was  held  by  Max  Weber  (1864-1920). 
Third,  Alexis  de  Tocqueville  (1805-1859)  hypoth¬ 
esized  that  the  strengthening  of  democracy  would 
result  in  a  decline  in  the  rewards  of  public  officials, 
thus  making  them  more  representative. 

These  normative  prescriptions  coincided  with 
the  initiation  of  macroprocesses  such  as  industrial¬ 
ization,  the  consolidation  of  modern  public  bureau¬ 
cracies,  recognition  of  individual  universal  rights, 
and  the  introduction  of  democracy.  These  dispa¬ 
rate  movements,  with  different  emphases  across 
countries,  are  likely  to  have  led  to  the  emergence 
of  the  aforementioned  criteria  on  rewards.  The 


introduction  of  mass  production  systems  brought 
about  a  concern  with  performance  and  private  sec¬ 
tor  benefits.  Unlike  in  aristocratic  times,  the  advent 
of  democracy  introduced  issues  of  public  account¬ 
ability  and  representativeness,  with  the  growth  of 
the  bureaucracy  creating  problems.  The  consolida¬ 
tion  of  public  administration  and  its  neutral  compe¬ 
tence  entailed  the  implementation  of  a  merit  system 
to  fight  politicization  practices,  derived  from  democ¬ 
ratization.  Around  the  same  time,  the  Northcote- 
Trevelyan  report  (1854),  the  Pendleton  Act  (1883), 
and  other  such  regulations  in  Western  societies  tried 
to  abolish  the  favoritism  of  political  appointments 
and  to  replace  them  by  merit  systems. 

At  present,  normative  issues  concerning  rewards 
are  ingrained  in  empirical  research.  The  work  of 
Christopher  Hood  and  Guy  Peters  in  the  1990s 
offered  three  different  criteria,  as  expounded  in 
earlier  philosophical  texts,  against  which  the  level 
of  rewards  and  their  contents  could  be  decided  on. 
First,  a  more  egalitarian  approach  recommends 
that  rewards  of  public  officials,  especially  those 
holding  high  office,  be  representative  of  the  popu¬ 
lation  at  large,  that  is,  they  should  take  into 
account  the  composition  of  the  population  accord¬ 
ing  to  the  most  obvious  criteria  (e.g.,  race,  lan¬ 
guage,  ethnicity).  A  second,  more  functionalist 
approach  advocates  paying  a  sufficient  reward  to 
prevent  officials  from  engaging  in  corrupt  prac¬ 
tices.  Finally,  a  market  approach  suggests  that  the 
level  of  rewards  should  be  on  par  with  the  salaries 
for  similar  positions  in  the  private  sector.  None  of 
the  approaches  can  be  found  in  its  pure  form  in 
any  country,  although  some  countries  seem  to 
have  more  representative  rewards,  while  others  use 
the  functional  principle  of  sufficient  compensation. 
In  any  case,  the  introduction  of  managerial  prac¬ 
tices  in  the  executive  during  the  1990s  and  2000s 
has  apparently  caused  a  shift  toward  more  indi¬ 
vidualistic  and  market-based  practices. 

Positive  Theories 

Since  the  early  1990s,  political  theories  focusing 
on  rewards  have  developed  explanatory  arguments 
trying  to  decipher  why  some  systems  have  made 
particular  choices  when  deciding  reward  levels  for 
public  officials  and  to  describe  the  particular  compo¬ 
sition  and  the  transparency  of  the  reward  structure. 
Christopher  Hood  and  Guy  Peters  have  fostered 
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several  publications  in  which  three  types  of  expla¬ 
nations  have  been  considered:  (1)  institutional, 
(2)  political  economy,  and  (3)  cultural  theories. 

Institutional  theories  claim  that  formal  and 
informal  rules  account  for  the  particular  evolution 
of  reward  systems.  Rules  define  whether  pay  is 
determined  through  collective  bargaining,  inde¬ 
pendent  review,  market  mechanisms,  or  specific 
regulation.  Rules  also  set  the  level  of  pay  at 
national,  sectoral,  or  agency  levels.  Moreover,  the 
proportion  of  the  salary  structure  is  decided  by 
rules  considering  the  basic  pay  and  other  allow¬ 
ances  and  the  degree  of  transparency.  Institutional 
theories  are  designed  to  explain  the  continuity  in 
the  working  of  certain  institutions  but  have  more 
problems  in  identifying  when  a  change  takes  place, 
because  the  institutions  by  themselves,  without  the 
contribution  of  actors,  are  noncredible  energizers 
of  institutional  change. 

Economic  theories  of  politics  consider  the 
constrained  institutional  settings  of  individual 
strategies,  but  they  focus  on  individuals  as  benefit 
maximizers,  that  is,  making  their  rewards  as  large 
as  possible.  Those  theories  are  normally  crafted  as 
games  for  which  two  sets  of  actors  have  different 
preferences  and  strategies.  To  study  rewards, 
games  can  be  modeled  to  confront  the  (dis)trust 
strategies  of  citizens  in  relation  to  the  strategies  of 
public  officials,  who  are  more  or  less  transparent 
in  getting  their  rewards.  Games  can  also  model  to 
what  extent  senior  civil  servants  have  more  or  less 
individualistic  pay  strategies  than  civil  servants  of 
lower  hierarchical  levels.  Although  embedded  in 
institutional  settings,  political  economy  theory 
tends  to  consider  the  preferences  of  individuals  as 
exogenous  to  the  institutions  and  somehow  uni- 
versalistic — that  is,  shaped  outside  the  institutional 
context  in  which  they  operate. 

Cultural  theory,  dismissing  this  universalistic 
view  of  exogenous  preferences,  goes  beyond  the 
classic  idea  that  national  cultures  matter  in 
anchoring  political  phenomena  to  cultural  views. 
Christopher  Hood,  either  alone  or  with  others, 
has  contributed  extensively  to  apply  cultural  the¬ 
ory  to  the  evolution  of  rewards  pattern  over  time. 
This  theory  is  based  on  the  “grid  group”  concepts 
of  Mary  Douglas.  “Group”  refers  to  what  extent 
individual  choices  are  determined  by  the  groups  in 
which  individuals  are  incorporated.  “Grid”  indi¬ 
cates  the  role  of  external  rules  and  prescriptions  in 


determining  the  life  of  individuals.  The  combina¬ 
tion  of  high-low  grid  and  high-low  group  offers 
four  dominant  cultural  types:  (1)  hierarchical, 
(2)  individualistic,  (3)  egalitarian,  and  (4)  hazardous. 

Hierarchical  systems  are  based  on  the  predict¬ 
ability  and  automaticity  of  rewards  progression 
within  organizations.  Individualistic  systems  are 
characterized  by  the  rent-seeking  behavior  of  public 
officials,  which  normally  operates  on  the  variable 
parts  of  the  salary  or  on  other  perks  of  substantial 
monetary  value,  discarding  the  automaticity  of 
rewards.  In  egalitarian  systems,  monetary  rewards 
are  accommodated  at  lower  levels  than  what  the 
individualistic  behavior  would  predict  because 
other  intangible  rewards  are  obtained.  Hazardous 
cultural  patterns  imply  the  introduction  of  chance 
elements  in  the  way  in  which  promotion  and 
rewards  happen,  making  them  nonautomatic  and 
unpredictable. 

Comparative  work  has  used  a  combination  of 
the  three  “lenses,”  as  they  are  not  entirely  alterna¬ 
tive  explanations  of  how  rewards  evolve.  Most 
country  case  studies  have  a  predominant  institu¬ 
tionalist  bias,  which  accounts  for  the  creation  of 
reward  systems  following  traditional  models.  The 
introduction  of  changes  in  this  account  is  normally 
referred  to  as  a  minor  adaptation  of  traditional 
practices.  In  any  case,  institutionalists  have  diffi¬ 
culty  explaining  why  there  are  shifts  within  the 
same  society  from  more  egalitarianism  (or  hierar- 
chism)  toward  more  individualism  over  time  in  the 
rewards  of  senior  civil  servants  or  why  the  salaries 
of  lower  level  civil  servants  erode  over  time,  while 
the  monetary  rewards  of  senior  civil  servants 
increase.  Political  economy  theorists  were  not  able 
to  explain  the  erosion  of  the  wages  of  politicians  in 
several  countries,  in  spite  of  them  trying  to  maxi¬ 
mize  their  rewards.  Cultural  theory  seems  to 
explain  the  shifts  throughout  time,  using  both  insti¬ 
tutionalist  and  political  economy  reasoning.  Further 
research  is  needed  to  test  these  theoretical  lenses. 

The  Structure  of  Rewards  and  Its  Level 

The  list  of  components  of  a  reward  system  is  con¬ 
siderable  and  varies  across  countries  and  types  of 
public  officials.  The  first  distinction  is  between 
financial  rewards,  material  rewards  with  monetary 
value,  and  intangible  rewards.  Financial  rewards 
constitute  the  pay  received  by  public  officials.  They 


2164  Public  Office,  Rewards 


can  be  classified  into  basic  pay  and  other  fringe 
benefits.  Basic  compensation  is  the  salary  paid  for 
a  position  that  includes  insurance,  tax  deduction, 
and  deferred  compensation.  Fringe  benefits  include 
considerations  pertaining  to  the  level  or  grade  of 
the  post,  risk  or  abnormal  working  conditions 
associated  with  the  post,  acknowledgment  of 
seniority,  special  duties  or  responsibilities,  and  per¬ 
formance  bonuses.  These  allowances  are  directly 
linked  to  the  job  performed.  However,  the  particu¬ 
lar  amount  is  not  included  in  the  basic  pay  in  some 
cases  because  they  compensate  the  job  particulari¬ 
ties  associated  with  a  particular  post.  In  other 
cases,  these  allowances  are  not  consolidated  in  the 
basic  pay  so  as  not  to  be  included  in  the  calcula¬ 
tions  of  the  future  pension,  which  will  burden  the 
public  budget. 

Material  rewards,  or  in-kind  benefits,  that  have 
direct  and  substantial  monetary  value  include  varie¬ 
gated  items  such  as  promotion,  inter-  and/or  intra¬ 
municipal  transportation,  housing,  meals,  travel, 
tax  deductions,  and  private  health  insurance,  to  list 
the  most  popular  items  across  countries.  In  most 
cases,  these  rewards  are  tax  free. 

Some  rewards  such  as  promotion,  pension,  and 
employment  in  a  better  paid  job  in  a  private  or 
(semi)public  corporation  are  deferred  in  time  and 
enjoyed  in  the  future.  Traditionally,  public  offi¬ 
cials  were  compensated  for  good  performance  by 
promoting  them  to  higher  positions  with  a  hike  in 
salary.  As  there  are  now  not  enough  positions  to 
promote  high-performing  officials,  good  perfor¬ 
mance  is  rewarded  with  extra  compensation. 
Employment  in  a  more  lucrative  position  may 
constitute  a  typical  path  in  the  system  but  is  at 
least  subject  to  some  kind  of  ethical  code. 

Finally,  public  recognition,  delegation  of 
authority,  social  respect,  job  security,  some  social 
privileges,  or  reputation  are  typical  examples  of 
intangible  rewards.  They  are  subjectively  and  con¬ 
textually  defined.  Individuals  bestow  subjective 
recognition  on  some  potential  rewards,  and  the 
context  may  deem  some  rewards  very  important 
or  irrelevant.  For  instance,  in  some  countries,  civil 
servants  have  high  status  and  are  well  respected, 
while  in  other  countries,  this  is  not  so.  Unless  they 
are  misused,  these  rewards  are  not  a  matter  of 
concern  for  public  accountability. 

Compensation  for  public  officials  includes  many 
different  financial  supplements  and  other  benefits, 
also  colloquially  known  as  perks,  that  make  its 


analysis  complex.  The  various  financial  rewards 
and  in-kind  tangible  benefits,  presently  enjoyed  by 
public  officials  or  deferred  to  the  future,  belong  to 
items  subject  to  more  or  less  transparency  and 
individual  negotiations.  The  managerialist  trend  in 
the  public  sector  in  the  past  30  years  has  also 
introduced  the  relevance  of  performance  bonus. 

Managerialism,  Rewards,  and  Performance 

Managerialism  implies  more  pay  flexibility,  that  is, 
the  ability  of  public  sector  organizations  to  match 
market  rates,  to  compete  for  scarce  skills  and  good 
candidates,  and  to  reward  individual  efforts.  The 
managerial  reforms  that  started  in  the  1980s, 
inspired  by  private  sector  practices,  have  stressed 
the  value  of  more  flexible  pay  arrangements  as 
well  as  the  use  of  performance  pay.  The  creation  of 
agencies  and  organizations  with  a  management 
ethos  that  is  different  from  that  of  traditional  min¬ 
isterial  departments  and  the  possibility  of  hiring 
personnel  under  distinct  contractual  conditions 
has  allowed  for  the  introduction  of  a  more  mana¬ 
gerial  culture  in  some  parts  of  the  public  sector.  In 
these  organizations,  where  direct  entry  from  the 
private  to  the  public  sector  has  been  possible,  sev¬ 
eral  countries  have  tried  to  match  private  sector 
rewards,  although  unsuccessfully. 

Apart  from  uniform  pay  scales,  performance  pay 
is  another  flexible  means  of  compensating  individ¬ 
ual  or  group  productivity.  When  properly  done, 
performance  pay  is  not  consolidated  in  the  salary. 
Extra  performance  is  explicitly  recognized  either 
through  the  comparison  of  achievements  against 
predefined  targets  or  through  the  appraisal  negotia¬ 
tions  between  managers  and  subordinates.  This 
financial  bonus  is  said  to  motivate  workers  to 
higher  levels  of  performance,  according  to  norma¬ 
tive  theories. 

Expectancy  and  reinforcement  theories  have 
been  associated  with  performance  pay.  Expectancy 
theory  predicts  that  employees’  motivation  is  led 
by  their  belief  that  making  an  effort  will  affect 
organizational  performance.  This  increase  in  per¬ 
formance  will  lead  to  an  increase  in  the  monetary 
benefits  of  the  employee,  who  specially  values 
these  benefits.  In  a  nutshell,  higher  performance- 
related  salary  motivates  employees.  Reinforcement 
theory  states  that  individuals  are  likely  to  choose 
responses  associated  with  positive  outcomes  of  the 
past.  In  positive  reinforcement,  valued  behavior 
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will  be  preferred  and  repeated  over  time  because  it 
yields  positive  results,  such  as  higher  salary. 

Managerialism,  present  in  many  countries, 
influences  differently  systems  and  groups  of  public 
officials.  Its  practices  have  likely  resulted  in  greater 
pay  differentials  between  civil  servants  and  politi¬ 
cians,  agency  chief  executives  and  senior  civil  ser¬ 
vants,  and  senior  civil  servants  and  lower  level  civil 
servants.  However,  empirical  research  has  often 
found  a  mismatch  between  the  potential  benefits  of 
performance  pay  for  individuals  and  for  the  orga¬ 
nization  and  staff  motivation.  An  overwhelming 
majority  of  studies  reviewed  by  James  Perry  do  not 
show  evidence  of  positive  effects  of  performance 
pay,  especially  for  organizational  performance  and 
for  higher  managerial  posts,  where  jobs  are  com¬ 
plex  and  tasks  are  not  normally  routinely  defined. 
Positive  effects  in  the  more  routine  jobs  are  due  to 
the  easiness  of  establishing  clear  goals  against 
which  performance  can  be  measured. 

Several  reasons  explain  this  mismatch.  First, 
civil  service  laws  and  procedures  try  to  be  uniform, 
while  individual  performance  pay  implies  excep¬ 
tions  to  uniformity.  Second,  budgetary  allocations 
for  performance  pay  cannot  grow  at  the  pace  of 
performance.  Unlike  the  private  sector,  higher  per¬ 
formance  does  not  necessarily  mean  concomitant 
higher  financial  benefits  for  the  organization. 
Third,  organizations  do  not  invest  much  energy  in 
performance  appraisals,  which  results  in  unsatis¬ 
factory  performance  assessments.  Fourth,  individ¬ 
ual  performance  cannot  be  properly  rewarded  if 
organizational  goals  are  not  clearly  defined,  as  is 
the  case  in  many  public  sector  bodies  with  multiple 
stakeholders.  Finally,  transparency  apparently 
makes  performance  pay  less  workable  in  the  public 
sector,  as  there  are  some  indications  that  secrecy  in 
private  organizations  is  helpful  for  performance 
schemes  to  succeed.  Furthermore,  transparency 
does  not  allow  for  a  substantial  increase  of  perfor¬ 
mance  bonuses  as  this  may  cause  a  public  outcry. 
In  sum,  performance  pay  does  not  seem  to  have 
increased  greatly  staff  motivation.  Therefore,  pub¬ 
lic  service  motivation  is  analyzed  by  including 
other  nonfinancial  components.  Success  stories  of 
performance  pay  have  been  more  prevalent  when 
individual  and  organizational  goals  were  aligned. 

In  any  case,  there  has  been  a  general  shift  away 
from  the  fixed  scales  advocated  by  the  functional¬ 
ist  approaches,  which  believe  in  the  principle  of 
adequate  compensation  and  the  need  for  a  uniform 


reward  system  across  the  board  for  similar  levels  of 
public  employees.  This  general  shift  has  represented 
more  individualization  of  salaries  for  senior  civil 
servants  in  the  executive,  but  not  so  much  in  the 
legislative  and  the  judiciary.  In  spite  of  the  limited 
success  of  pay  for  performance,  pay  differentials 
among  groups  have  materialized.  Civil  servant  sal¬ 
ary  levels  have  surpassed  parliamentarian  salaries 
in  many  countries,  and  the  differences  in  compen¬ 
sation  between  senior  civil  servants  and  civil  ser¬ 
vants  at  lower  levels  have  widened. 

Rewards  and  Transparency 

Transparency  refers  to  the  publicity  and  availabil¬ 
ity  of  data  on  rewards  of  public  officials  for  public 
scrutiny.  The  gradation  of  transparency  depends 
on  the  number  of  components  of  the  reward  struc¬ 
ture  that  are  open  to  scrutiny,  the  degree  of  public¬ 
ity  of  this  information  without  being  demanded, 
and,  alternatively,  the  availability  of  the  data  on 
request  by  individuals. 

Lack  of  transparency  is  related  to  at  least  two 
different  sets  of  practices.  First,  while  basic  pay 
may  be  frozen  or  pegged  to  annual  inflation  rates, 
other  variable  parts  or  perks  may  be  kept  away 
from  public  scrutiny.  Second,  there  may  be  a  poten¬ 
tial  for  unethical  behavior  by  claiming  expenses 
that  are  unrelated  to  the  incumbency  of  a  public 
office,  as  the  mass  media  are  occasionally  revealing 
in  different  countries. 

Normative  criteria  for  the  level  of  transparency 
of  a  system  are  similar  to  the  three  different  visions 
of  the  cultural  theory.  For  egalitarians,  rewards 
should  have  maximum  visibility,  and  the  amount  of 
hidden  perks  should  be  kept  to  the  minimum. 
Functionalists  suggest  that  rewards  should  reflect 
the  activities  pertaining  to  the  position.  If  travel  or 
special  risks  are  associated  with  the  job,  these 
should  be  paid  separately  in  tax-free  allowances. 
There  would  then  be  a  certain  justification  for  less 
visible  rewards.  Finally,  a  market  approach  would 
link  reward  to  performance.  Workable  individual 
performance  pay  seems  to  be  better  adapted  to  non¬ 
transparent  organizations,  as  in  the  private  sector. 

Empirical  research  in  Western  societies  shows 
that  the  levels  of  transparency  seem  to  be  declining 
over  the  years.  Transparency  level  differs  for  dis¬ 
tinct  types  of  public  officials.  While  rewards  for 
parliamentarians  seem  to  have  become  more  trans¬ 
parent  with  time,  the  availability  of  data  for  civil 
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servants,  especially  senior  civil  servants,  seems  to 
be  decreasing.  Furthermore,  the  analysis  of  salaries 
in  Latin  American  and  Australasian  countries  does 
not  support  the  functional  theory  that  high  rewards 
for  public  officials  entail  lower  levels  of  corruption 
in  the  system.  Further  research  is  needed,  as  it  is 
difficult  to  find  meaningful  connections  between 
aggregate  figures  of  corruption  levels  and  the  gen¬ 
eral  levels  of  rewards.  This  hypothesis  should  be 
tested  for  specific  groups  of  public  officials  (local 
officials,  security  forces,  teachers,  etc.). 

Managerialism  and  decreasing  levels  of  trans¬ 
parency  might  have  caused  the  increase  in  civil 
servants’  rewards  compared  with  the  reward  levels 
of  parliamentarians.  There  seems  to  be  an  implicit 
trend  whereby  in  democracies,  the  rewards  for 
politicians  tend  to  be  more  representative  or  reflect 
the  ideals  of  representation,  with  few  possibilities 
of  being  rewarded  for  performance,  except  for  the 
intensive  use  of  per  diems  as  a  consequence  of 
being  appointed  to  different  commissions,  whose 
attendance,  away  from  their  constituency,  allows 
for  extra  perks.  Civil  servants,  especially  senior 
civil  servants,  have  found  different  functional  and 
homogeneous  ways  to  raise  their  salary  level.  In 
some  countries,  a  senior  civil  service  category  has 
been  created,  separate  from  the  rest  of  the  civil  ser¬ 
vants.  Further,  individual  performance  pay  has 
been  introduced,  although  its  success  has  been 
questioned,  and  the  creation  of  arm’s-length  agen¬ 
cies  has  fostered  higher  rewards  for  their  managers. 
Despite  a  general  increase  of  salaries  in  the  execu¬ 
tive,  salaries  of  private  sector  enterprises  have  not 
been  matched,  even  in  those  countries  where  such 
efforts  have  been  more  persistent. 

Research  on  public  sector  rewards  has  been 
fostered  in  the  past  20  years  in  political  science, 
but  further  analysis  is  required.  Comparative 
country  studies  should  give  way  to  other  compari¬ 
sons,  especially  among  public  officials  working  for 
agencies  of  different  types  (hospitals,  regulatory 
agencies,  independent  banks,  bureaucracies)  and 
from  different  policy  sectors  (safety,  education, 
health,  etc.).  The  differences  and  similarities  across 
countries  within  these  sectors  could  yield  further 
results  and  inputs  for  political  theories. 
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Public  Opinion 


Public  opinion  is  a  notion  that  is  frequently 
invoked,  not  only  in  everyday  parlance  but  also  in 
political  and  scholarly  discussions.  Yet  it  is  a  con¬ 
cept  that  is  extremely  ambiguous  and  difficult  to 
define — indeed,  it  seems  that  a  common  definition 
does  not  exist.  When  the  scholar  Harwood  Childs 
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attempted  to  reconstruct  the  principal  meanings  of 
public  opinion  in  the  1960s,  he  identified  more  than 
50  different  definitions.  Since  then,  the  frequency  of 
use  of  the  concept  has  grown  substantially,  together, 
probably,  with  the  number  of  meanings  attributed 
to  it.  This  diversity  of  meanings — or  polysemy — is 
implicit  in  the  denomination  of  the  concept,  which 
associates  two  distinct,  and  somewhat  conflicting, 
ideas.  “Opinion”  refers  to  the  individual  sphere,  to 
reputation,  evaluation,  and  judgments  of  value; 
hence,  it  is  open  to  question,  or  debatable.  “Public” 
refers  to  the  collective  dimension,  to  subjects  and 
places  accessible  to  all,  of  common  interest,  adopted 
by  recognized  and,  therefore,  “public”  authorities. 
And  here  arises  the  question  at  the  very  heart  of  the 
debate  that  has  always  surrounded  the  topic:  how 
to  render  compatible,  consistent,  and  coinciding 
aspects  that  are  both  individual  and  collective,  sin¬ 
gular  and  common,  subjective  and  objective.  The 
question  is  further  complicated  by  the  increasingly 
widespread  and  entrenched  tendency  to  consider 
public  opinion  as  a  unitary  concept — an  “objec¬ 
tive”  and  “natural”  basis  for  political  choices  and / 
or  institutional  decisions,  in  competition  with  the 
electoral  response — and  the  tendency  to  consider 
public  opinion  as  an  indication  of,  or  even  synony¬ 
mous  with,  the  “will  of  the  people.”  This  entry 
discusses  these  various  meanings  and  their  implica¬ 
tions  for  contemporary  democracies. 

Definitions  of  Public  Opinion 

Looking  beyond  the  meanings  attributed  to  public 
opinion,  it  is  important  to  note  that  the  concept  is 
linked  to  democracy  and  its  affirmation.  Democracy 
becomes  a  condition  of  public  opinion,  a  component 
and  a  method.  Democracy  affects  and  characterizes 
the  viewpoint,  the  language,  and  every  discussion 
surrounding  public  opinion.  Consequently,  its  defi¬ 
nition  in  some  measure  also  involves  democracy. 

Among  the  various  definitions  of  public  opinion 
that  have  developed  and  intersected  over  time,  three 
prove  particularly  significant,  in  that  they  allow  one 
to  delineate  the  main  interpretive  elements,  which 
are  substantially  distinct  and  competing. 

The  first  approach  takes  its  inspiration  from  the 
Enlightenment  and  first  and  foremost  identifies 
public  opinion  with  the  “public  sphere”  (the  the¬ 
ory  of  Jurgen  Habermas),  the  area  in  which  the 
“enlightened  classes”  exercise  criticism  of  and 


provide  a  check  against  the  action  of  power.  In 
other  words,  the  public  sphere  is  the  open  debate 
around  the  decisions  and  issues  of  public  interest. 
Public  opinion  accompanies  the  formation  of 
bourgeois  society  and  mediates  the  separation  of 
the  state  from  the  publicly  important  “private 
sphere”  of  “civil  society.”  It  promotes  the  advent 
of  the  “market,”  of  rational  thought,  and  of  “lib¬ 
erty”  and  is  a  necessary  condition  of  democratic 
systems  that  have  been  established  in  the  West 
alongside  the  great  economic  and  national  revolu¬ 
tions.  According  to  this  reading,  public  opinion  is 
not  the  sum  of  individual  opinions,  as  other  inter¬ 
pretive  currents  sustain;  rather,  it  is  a  context  of 
relationships  and  communication  that  develops  at 
the  collective  level  (as  several  authors,  from  the 
Chicago  School  to  Herbert  Blumer,  argue).  It  is  a 
“public”  discussion  about  emerging  problems  that 
involves  different  actors  and  arrives  at  producing  a 
common  shared  position. 

The  second  approach  considers  public  opinion 
as  an  effect  of  the  influence  exercised  by  actors  and 
agencies  that  control  the  channels  of  information 
and  the  formation  of  ideas.  Consequently,  the  pre¬ 
vailing  opinions  in  society,  the  images  of  reality 
that  people  express,  actually  would  not  correspond 
to  reality;  they  do  not  emerge  from  free  discussions 
and  from  debate  among  intellectuals  but  would 
instead  depend  on  the  pressure  exercised  by  the 
media  and  by  propaganda.  Public  Opinion  (which 
Walter  Lippmann  writes  in  capital  letters — and  not 
by  chance),  hence,  is  fueled  by  stereotypes  and 
prejudices.  It  evokes  a  sort  of  conformism:  the 
adaptation  by  the  masses  to  prevailing  thought.  To 
a  certain  extent,  this  interpretation  is  echoed  in  the 
model  of  the  spiral  of  silence  proposed  by  Elisabeth 
Noelle-Neumann,  based  on  the  analysis  of  electoral 
behavior  in  Germany  in  the  1960s  and  1970s.  The 
study  reveals  that  large  segments  of  the  electorate 
who  did  not  express  their  preferences  during  the 
electoral  campaign  in  the  end  aligned  themselves 
with  the  positions  of  the  candidates  and  parties 
favored  to  win  and  voted  accordingly.  In  other 
words,  their  silence  reflected  not  only  uncertainty 
and  indifference  but  also  the  fear  of  deviating  from 
what  was  considered  to  be  the  predominant  orien¬ 
tation.  Interviewed  by  the  polling  agencies,  they 
preferred  to  remain  silent  rather  than  contradict  the 
beliefs  expressed  by  the  majority  and  emphasized 
by  the  media  and  in  the  public  debate.  That  is,  to 
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use  the  language  of  Noelle-Neumann,  they  pre¬ 
ferred  to  conform  to  the  “climate  of  opinion”  that 
characterizes  society  during  that  time  period.  This 
explanation  evokes  the  idea  of  common  sense ,  this 
being  another  polysemic  term  possessing  very  deep 
historical  and  cultural  roots.  It  is  discussed  and 
thematized  by,  among  others,  Antonio  Gramsci, 
who  cites  The  Betrothed  by  Alessandro  Manzoni, 
observing  that  “there  was  even  someone  who 
didn’t  believe  the  plague-spreaders,  but  couldn’t 
support  his  opinion  against  widespread  popular 
opinion,”  because,  as  Manzoni  writes,  “good  sense 
was  there,  but  it  remained  hidden  in  fear  of  com¬ 
mon  sense”  (or  general — i.e.,  public — opinion) 
(Gramsci,  1975,  p.  216). 

For  some  authors,  however  (in  particular, 
Niklas  Luhmann),  public  opinion  constitutes  a 
functional  necessity,  put  into  action  by  the  politi¬ 
cal  (sub)system  to  reduce  social  complexity.  In  this 
case,  it  is  an  area  that  is  structured  around  relevant 
and  important  themes,  conditioned  and/or  imposed 
by  public  communication.  This  reasoning  appears 
exactly  symmetric,  and  in  many  ways  contrary,  to 
that  of  Habermas,  who  holds  that  public  opinion 
coincides  with  the  public  sphere  generated  from 
“below,”  by  civil  society  and  by  intellectuals. 
Instead,  this  reasoning  stresses  the  role  of  the  insti¬ 
tutions  of  power,  political  actors,  and  the  media, 
which  aim  to  influence  and  direct  public  opinion. 

The  third  interpretation  is  defined  by  some  as 
populist  and  by  others  as  methodological.  It  does 
attribute  great  importance  to  people’s  popular 
opinions  and  at  the  same  time  to  the  possibility  of 
studying  them  through  the  technologies  of  public 
opinion  research  (polling).  Public  opinion,  in  this 
case,  is  made  to  correspond  with  the  sum  or  collec¬ 
tion  of  individual  opinions  gathered  and  analyzed 
by  means  of  surveys  conducted  on  representative 
samples  of  the  population.  The  first  and  most 
influential  exponent  of  this  position  is  George 
Gallup,  who  began  conducting  his  public  opinion 
polls  in  the  1930s  in  the  United  States,  correctly 
predicting  the  outcome  of  the  1934  presidential 
elections.  Gallup  felt  that  polls  are  useful — and 
reliable — instruments  that  measure  the  will  of  citi¬ 
zens  and  are  in  some  ways  better  than  voting  for 
discovering  and  understanding  citizen  demands, 
because  surveys  are  able  to  articulate  issues  and 
demands,  taking  into  account  individual,  social, 
cultural,  and  ecological  characteristics.  The  only 


problem  is  to  construct  a  sample  that  is  truly  rep¬ 
resentative  of  the  population  and  to  perform  sur¬ 
veys  through  well-formulated  questionnaires  and 
well-conducted  interviews.  Public  opinion,  in  this 
way,  is  traced  back  to  the  empirical  sampling  of 
aggregated  individual  opinions,  whose  meaning  is 
summed  up  according  to  a  majoritarian  logic,  as  in 
elections  where  the  popular  will  is  deduced  from 
the  vote  of  a  fraction.  This  may  be  majoritarian 
but  is  not  necessarily  the  majority — and  is  often  far 
from  the  totality  of  the  population  and/or  voters. 

Normative  Orientations 

The  three  ideas  that  orient  the  conception  of  pub¬ 
lic  opinion  are  summarized  by  three  different 
normative  orientations  that  aspire  to  prevail  as 
descriptive  or  real.  They  reflect  three  different 
ways  of  understanding  the  relationship  between 
citizens  and  power,  between  society  and  institu¬ 
tions,  in  representative  democracies.  In  the  first 
case,  public  opinion  is  conceived  of  as  a  jury,  an 
instrument  by  which  the  informed  citizen  judges 
decisions  and  those  who  make  the  decisions.  In 
this  case,  public  opinion  is  a  space  that  renders 
“public”  the  exercise  of  authority,  opening  up  to 
all  those  who  so  wish  the  chance  to  supervise  and 
participate.  In  the  second  case,  public  opinion  is 
conceived  of  as  a  consensus,  the  method  by  which 
to  conform  the  orientation  of  citizens  to  the  values 
and  wishes  of  those  who  govern,  or  as  a  climate  of 
opinion,  or  common  sense — an  assemblage  of  ste¬ 
reotypes  and  prejudices  that  inhibit  critical  thought 
and  push  the  less  aware  and  informed  members  of 
society  to  conform  to  dominant  beliefs.  In  the 
third  case,  public  opinion  is  considered  truly  as 
such,  which  is  to  say,  as  a  fact,  a  given,  an  actual 
entity  that  coincides  with  society,  voters,  and  citi¬ 
zens,  without  any  distinctions.  And  it  sums  them 
up  according  to  their  individual  opinions  into  an 
undifferentiated  aggregate,  recognized  as  having 
the  value  of  “reality”  because  it  is  observed  and 
observable,  measured  and  measurable,  by  using 
polling  technologies. 

These  three  ways  of  understanding  public  opin¬ 
ion  continue  to  coexist  and  interact  over  time. 
Media  and  public  opinion  agencies  compete  to 
shape  public  opinion  from  above.  Influenced  or 
commissioned  by  the  centers  of  the  political  and 
institutional  system,  they  seek  to  conform  citizens’ 
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opinions  to  the  prevailing  values  and  to  strengthen 
consensus  with  the  view  of  the  elite. 

In  the  same  way  and  at  the  same  time,  public 
opinion  is  a  critical  space.  It  intervenes  via  groups, 
newspapers,  social  movements — and,  in  more 
recent  times,  through  the  web,  via  Internet  blogs 
and  social  networks — to  counter  and  affect  public 
choices.  In  addition,  public  opinion  also  takes  on 
an  active  role  itself  in  forming  individual  opinions 
to  shape  the  general  beliefs. 

Undoubtedly  the  public  opinion  polling  and 
populist  conception  of  public  opinion  is  that  which 
has  progressively  won  out  and  today  tends  to 
dominate  over  the  others,  thanks  in  part  to  the 
contribution  of  the  media — especially  newspapers 
and  television.  The  media  have  amplified  and 
legitimized  polls  and  have  presented  and  imposed 
them  as  “reality.”  In  turn,  the  media  and  other 
actors  that  participate  in  presenting  the  polls — the 
pollsters,  journalists,  political  commentators,  and 
politicians  themselves — have  gained  and  continue 
to  gain  legitimization  from  the  surveys,  which 
additionally  offer  the  chance  for  a  “show,”  while 
the  political  scene  itself  appears  more  and  more 
spectacular,  as  we  will  see  further  on. 

Public  Opinion  and  Democracy 

At  this  juncture,  it  is  useful  to  shift  our  reflection  to 
the  relationship  between  public  opinion  and  democ¬ 
racy.  As  we  have  already  mentioned,  this  is  a  close 
and  inextricable  relationship.  Public  opinion  gains 
importance  with  the  establishment  of  democracy. 
First  of  all,  it  is  considered  as  a  condition,  in  that  it 
is  an  area  of  control  and  also  of  public  discussion. 
It  is  also  an  arena  for  the  conquest  of  consensus,  a 
field  in  which  different  actors  compete  and  clash  as 
they  aim  to  impose  their  issues  and  dictate  the 
agenda  of  problems  to  tackle,  to  influence  the  pri¬ 
orities  and  preferences  that  characterize  the  vision 
of  life  and  society  among  voters  and  citizens. 
Ultimately,  public  opinion  becomes  a  functional 
equivalent  of  the  electorate,  a  synonym  for  the 
people,  to  be  systematically  polled  so  as  to  under¬ 
stand  fundamental  orientations  and  to  constantly 
verify  the  degree  of  consensus  enjoyed  by  politicians 
and  their  policies.  It  could  almost  be  considered  an 
election  that  takes  place  each  and  every  day. 

Therefore,  the  connection  with  democracy 
seems  to  be  strong — a  sort  of  symbiosis  that  in 


recent  decades  has  become  systematic  and  consti¬ 
tutive.  In  this  regard,  we  can  cite  Bernard  Manin’s 
analysis  of  the  principles  of  representative  govern¬ 
ment.  In  proposing  a  periodization,  Manin  indi¬ 
cates  how  the  “democracy  of  the  public” — or  of 
the  audience — has  been  established  in  the  most 
recent  phase. 

Wherever  communication  takes  the  place  of 
organization  and  participation,  people  assume 
greater  importance  than  political  parties — or  they 
become  the  main  reference  of  the  parties — where 
the  mechanism  of  the  internal  relationship  among 
the  different  levels  of  the  party  and  the  relationship 
between  the  state  and  society  is  specifically  and 
especially  entrusted  to  the  polls.  The  voters  become 
“the  audience”  as  they  react  to  political  proposi¬ 
tions  that  escape  their  control;  citizens  become 
public  opinion.  Manin  notes  that  public  debate 
remains  a  condition  of  representative  government. 
However,  in  reality,  public  opinion  is  ordinarily 
evoked  to  legitimize  and  measure  the  consensus — 
or  dissent — concerning  political  actors.  It  is  forced 
to  coincide  with  the  practically  inseparable  duo  of 
the  media  and  the  polls.  This  brings  about  the 
problem  of  not  taking  for  granted  what  actually 
cannot  be — of  not  accepting  the  idea  of  public 
opinion  that  is  dominant  among  such  limited  social 
components  as  the  only  possible  public  opinion. 

Conversely,  if  public  opinion  is  summed  up 
through  surveys  communicated  by  the  media,  then 
it  is  impossible  to  avoid  confronting  the  funda¬ 
mental  criticisms  expressed  in  this  regard.  First 
and  foremost  among  these  criticisms  is  that 
expressed  by  Pierre  Bourdieu  in  his  noted  1973 
essay,  with  the  absolutely  explicit  title  “L’opinion 
publique  n’existe  pas”  [Public  opinion  does  not 
exist] .  The  target  of  his  unequivocal  critique  of  the 
concept  of  public  opinion  is  the  presumption  of 
forming  it  on  the  basis  of  surveys,  which  are 
imperfect  tools  for  numerous  reasons.  However, 
his  strongest  criticism  was  of  the  claim,  considered 
epistemologically  untenable,  of  producing  a  “gen¬ 
eral”  vision  based  on  “individual”  opinions:  indi¬ 
vidual  reactions  to  questions  posed  by  strangers 
(the  interviewers),  during  interviews  dealing  with 
mostly  “unfamiliar”  subject  matters,  posed  to 
people  who  are  for  the  most  part  not  very  inter¬ 
ested  or  who  may  be  completely  extraneous  to 
politics  and  the  issues  of  the  public  debate.  For  this 
very  same  reason,  it  is  considered  improper  to  use 
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an  aggregate  of  opinions  to  replace  something  that 
should  instead  emerge  from  public  debate  and  the 
initiatives  of  social  actors. 

It  is  considered  arbitrary — or  even  worse, 
deviant — to  treat  the  responses  of  interviewees  as 
equivalent,  without  distinctions  that  take  into 
account  the  differing  degrees  of  involvement  or 
expertise  of  the  people  surveyed.  It  is  as  if  everyone 
were  an  expert  in  all  matters;  as  if  there  were  no 
people  who  possessed  a  greater  degree  of  expertise; 
and,  most  important,  as  if  some  people  were  not 
more  involved  and  influential — over  the  “opinions” 
of  others — on  certain  specific  topics. 

Polling  as  a  Basis  for  the 
Concept  of  Public  Opinion 

Limitations  of  Polling 

We  can  better  articulate  the  limits  imposed  by  a 
concept  of  public  opinion — and  by  a  democracy — 
based  on  polls  by  referring  to  a  number  of  techni¬ 
cal,  methodological,  and  substantive  problems. 
First,  there  is  the  question  of  the  limits  of  represen- 
tativity  of  the  survey  samples,  which  increasingly 
include  fewer  numbers,  are  stratified  by  quotas, 
and  are  distorted  by  the  process  of  acquiring  the 
information  where  the  share  of  people  who  refuse 
to  be  interviewed  is  growing  larger  and  larger.  The 
techniques  of  weighting  the  data,  during  the  data 
analysis  phase,  should  serve  to  adjust  the  sample  to 
the  “real”  makeup  of  society.  However,  these  tech¬ 
niques  tend  to  distort  the  representativity  yet 
again,  to  the  point  of  transforming  the  conclusive 
data  into  artificial  products  of  sophisticated  math¬ 
ematical  models. 

The  second  problem  to  contend  with  is  the 
methods  and  instruments  of  the  interview.  Surveys 
are  now  conducted  mainly  by  telephone,  to  reduce 
time  and  costs.  Thus,  the  interviews  are  conducted 
in  an  impersonal  way,  often  by  specialized  market¬ 
ing  centers  with  little  expertise  in  the  subjects  being 
considered.  Furthermore,  the  home  telephone,  or 
landline,  is  becoming  less  and  less  popular,  and  so 
other  means  are  used  to  conduct  surveys — for 
example,  by  Internet,  through  the  use  of  e-mail  and 
social  networks,  or  via  mobile  telephones.  These 
new  means  create  new  problems  that  are  difficult 
to  resolve.  Surveys  conducted  using  the  Internet  are 
still  not  able  to  reach  “representative”  samples  of 
the  population,  for  the  simple  reason  that  the 


population  segments  that  use  the  Internet  are  still 
quite  specific — despite  the  fact  that  they  are  large 
and  continue  to  grow.  Certain  segments  of  the 
population,  in  particular  the  elderly  and  those  with 
less  education,  are  largely  excluded.  Telematic 
interviews  pose  problems  with  interpretation  (they 
are  not  conducted  directly,  nor  do  they  take  place 
in  a  shared  conversation).  And  last,  with  regard  to 
mobile  telephones,  complete  lists  of  numbers  are 
difficult  to  obtain  (given  the  numerous  providers) 
and  even  more  difficult  to  use  (as  access  to  the 
numbers  is  rightfully  limited  by  strict  privacy 
laws).  Furthermore,  interviewing  people  via  mobile 
telephone  brings  with  it  certain  inconveniences 
(the  locations  and  conditions  in  which  the  inter¬ 
view  takes  place  are  unpredictable).  And  in  any 
case,  the  responses  are  obviously  conditioned  by 
the  questions  asked,  by  the  style  of  the  interview, 
and  by  the  length  of  time  required  of  the  interview¬ 
ees.  To  sum  up,  it  is  very  difficult  to  consider  the 
results  of  surveys  as  “faithful”  and  representative 
of  the  population’s  beliefs. 

The  third  problem  raised  is  that  the  methods  of 
data  elaboration  and  analysis  naturally  and  inevi¬ 
tably  influence  the  results.  The  methods  of  weight¬ 
ing  the  data,  as  previously  mentioned,  increase  the 
discretion  of  the  researcher  and  the  statistician. 
Furthermore,  adequate  attention  is  not  always 
given  to  the  absence  of  responses  or  the  reticent 
responses  that  often  affect  the  “valid”  interviews 
to  a  great  extent.  Finally,  in  reading  the  results, 
there  is  the  tendency  to  present  the  distribution  of 
the  responses  according  to  a  majoritarian  logic, 
which  is  to  say  that  the  beliefs  expressed  by  the 
majoritarian  segments  are  emphasized,  though 
they  may  not  be  those  of  the  majority  of  the  popu¬ 
lation.  For  example,  an  issue  favored  by  one  third 
of  a  sample,  when  other  issues  are  chosen  by  lesser 
shares  of  interviewees,  risks  being  amplified  by 
media  communications  as  the  will  of  the  people. 

Methods  for  Addressing  Technical  Problems 

To  limit  these  technical  problems  that  invalidate 
the  claims  of  representativity  and  significance  of 
polling,  certain  methods  of  weighting  have  been 
proposed,  as  well  as  specific  and  alternative  means 
of  conducting  the  polls. 

To  begin  with,  there  is  the  practice  of  introduc¬ 
ing  some  questions  at  the  end  of  the  interview  or 
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after  a  few  questions,  in  order  to  determine  the 
expertise  or,  at  least,  the  significance  interviewees 
attributed  to  the  issues  and  questions  posed.  It  was 
James  Fishkin  who  elaborated  a  different  method 
of  conceiving  and  conducting  surveys,  known  as 
deliberative  polling.  It  entails  that  surveys  are  con¬ 
structed  in  such  a  way  that  the  “sample”  of  those 
interviewed  can  actually  be  informed  about  the 
subjects  of  the  research  and  takes  into  consider¬ 
ation  the  different  theories  and  interpretations  that 
relate  to  it.  The  polling  is  carried  out  in  such  a  way 
that  those  being  interviewed  can  dialogue  and  dis¬ 
cuss  among  themselves,  which  permits  the  “raw” 
opinion  to  be  registered,  as  well  as  the  “refined” 
opinion  (following  the  process  of  information  and 
discussion).  This  method  seeks  to  reconstruct  the 
foundations  of  public  opinion  at  its  origins,  recog¬ 
nized  by  Habermas  as  the  bourgeois  public  sphere 
and  by  others  as  the  Athenian  Polis  in  the  age  of 
Pericles. 

However,  the  correctives  proposed  have  only 
been  employed  conservatively  and  episodically.  In 
the  case  of  the  techniques  of  self-verification,  this 
has  been  true  more  out  of  carelessness  and  meth¬ 
odological  negligence  than  for  problems  related  to 
time  and  cost.  Inserting  qualitative  questions  to 
clarify  the  significance  and  the  motivations  of  the 
quantitative  responses  would  complicate  polling. 
This  is  even  more  so  the  case  following  the  wide¬ 
spread  use  of  computerized  techniques  in  telephone 
interviews.  As  for  deliberative  polling,  it  has  been 
carried  out  in  close  relationship  with  experiences 
of  deliberative  democracy,  therefore,  where  discus¬ 
sion  with  citizens  is  valued  in  decision  making, 
especially  at  the  local  level.  But  the  same  critical 
principle  that  inspired  this  design  at  the  same  time 
constitutes  a  limit  inasmuch  as  deliberative  polling 
is  a  means  of  forming  opinions,  in  a  conscious 
way.  Those  who  conduct  and  commission  surveys, 
however,  are  interested  in  finding  out  everyone’s 
opinions  rapidly — especially  the  opinions  of  the 
less  well-informed. 

All  in  all,  polls  are  imperfect  methods  of  learn¬ 
ing  individual  opinions,  which  assume  greater 
value  and  general  importance.  Polls  serve  and  are 
used  to  “objectify  subjective  opinions.”  For  this 
reason,  polls  are  important  in  democratic  play. 
They  are  important  not  so  much — or  not  only — 
because  they  define  public  opinion,  but  because 
they  help  (a)  orient  individual  opinions,  (b)  impose 


a  version  of  reality,  and  above  all  (c)  establish  the 
myth  of  a  public  opinion  that  actually  exists,  is 
scientifically  based,  and  is  therefore  measurable. 
They  assume  and  impose  a  substantial  equivalence 
between  opinions  and  people,  between  opinions 
and  voters,  and  between  opinions  and  votes. 

Polling,  Media,  and  the  Market 

However,  polls  and  the  surrounding  media, 
unlike  traditional  methods  of  propaganda  entrusted 
by  mass  parties  to  functionaries  and  internal  spe¬ 
cialists  or  to  those  with  political  affinities 
(militants  and  sympathizers),  are  conducted  by 
professionals  who  operate  in  the  market  and  who 
consequently  respect,  and  are  subject  to,  the  rules 
of  the  market.  This  means  that  they  are  sensitive  to 
the  needs  of  their  employer.  Polling  agencies  and 
the  media  are  businesses  that  are  often  under  the 
control  of  economic  groups  or  influenced  by  polit¬ 
ical  entities  of  the  government  (and  others). 

Thus,  public  opinion  based  on  the  media  and 
their  polls  is  very  different  from  the  public  opinion 
to  which  social  and  political  actors  contribute,  in 
that  it  does  not  constitute  an  arena,  or  the  result  of 
the  potentially  conflictive  interaction  between  dif¬ 
ferent  and  distinct  parties.  Rather,  it  aims  to  coin¬ 
cide  with  reality,  to  represent  and  measure  the  real 
situation,  because  it  is  produced  by  technologies 
and  technicians  (who  in  turn,  however,  are  the 
focus  of  pressures  and  involved  in  political  and 
economic  relationships). 

Consequently,  the  field  of  public  opinion  is 
complex  and  cannot  be  traced  back  to  a  single 
subject.  The  interests  of  political  and  economic 
actors  in  particular  are  in  competition  and  often 
in  contrast  with  those  of  the  media,  cultural  and 
scientific  actors,  polling  professionals,  political 
scientists,  and  journalists — who,  in  turn,  derive 
economic  compensation,  prestige,  and  power 
from  their  roles  as  “experts”  and  witnesses  of 
public  opinion.  At  times,  it  is  difficult  even  to 
determine  the  different  interested  parties  and  their 
roles:  The  politicians  need — and  at  the  same  time 
fear — the  polls  and  the  media;  just  as  the  media 
and  polling  professionals  need  the  politicians  to 
be  able  to  exercise  their  influence  in  order  to 
reach  their  audience  and  achieve  legitimacy.  In 
this  way,  they  themselves  also  become  political 
actors. 
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Public  Opinion  as  a  Social 
and/or  Political  Construct 

This  suggests  a  further,  and  different,  phenomeno¬ 
logical  interpretation,  which  holds  public  opinion  as 
a  social  and/or  political  construct,  whose  purpose  is 
not  so  much  to  guide  the  opinions  of  citizens  but 
rather  to  impose  a  representation  of  public  opinion 
that  is  recognized  as  true.  This  happens  by  fixing 
fundamental  references  around  which  it  is  structured: 
personal  or  social  uncertainty,  fear  of  others,  and  the 
vulnerability  of  the  economy  or  employment.  Thus, 
the  political  contest  becomes  a  contest  between  dif¬ 
fering  and  competing  models  of  public  opinion, 
which  are  proposed  and  even  more  so  imposed  as  the 
only  ones  possible.  These  ways  of  creating  a  vision  of 
reality,  taken  for  granted  as  such,  in  turn  influence 
relationships  with  power,  relationships  with  institu¬ 
tions,  and  political  and  electoral  choices. 

The  game  played  around  public  opinion  is  thus 
complex,  and  the  number  of  players  participating 
in  the  game  contributes  to  the  complexity.  It  sug¬ 
gests  that  in  the  era  of  democracy  of  the  public,  the 
political  struggle  is  primarily  a  symbolic  struggle, 
in  which  the  fight  is  to  prove  a  specific  vision  (or 
“narration”)  of  reality,  inspired  by  specific  values 
and  by  specific  images.  The  weapons  employed  are 
marketing  and  communication  techniques,  which 
use  the  media  and  polling  to  erase  the  distance 
between  everyday  life  and  public  life. 

It  suggests,  finally,  that  the  issue  at  stake  is  not 
only  public  opinion  but  also  the  very  meaning  of 
democracy.  On  the  other  hand,  democracy  has 
become  personalized,  and  communication  with 
citizens  happens  more  and  more  through  the 
media,  which  transforms  them  into  an  audience, 
into  spectators.  The  understanding  of  social 
demands  and  orientation  happens  not  so  much 
through  participation  but  through  polls,  which 
convert  people  into  opinions.  Therefore,  it  becomes 
difficult  to  isolate  the  discussion — and  criticism — 
around  the  prevailing  definition  of  public  opinion, 


without  involving  democracy  itself,  whose  prevail¬ 
ing  “version,”  in  this  phase,  uses  public  opinion  as 

a  base,  a  constitutive  principle — as  a  “fact.” 
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Qualitative  Comparative 
Analysis 


Qualitative  comparative  analysis  (QCA)  was  first 
introduced  to  the  social  sciences  by  Charles  Ragin 
to  identify  complex  patterns  of  causation  by  pro¬ 
cessing  social  science  data  with  Boolean  algo¬ 
rithms.  By  conducting  a  formalized  comparison, 
QCA  introduces  the  logic  of  qualitative  research  to 
the  study  of  a  larger  number  of  cases  than  are  usu¬ 
ally  examined  with  qualitative  methods.  QCA  usu¬ 
ally  is  employed  with  a  middle-sized  number  of 
cases  (10-30). 

The  approach  taken  by  QCA  has  been  described 
as  a  middle  course  between  case-oriented  and 
variable-oriented  methods.  In  his  initial  presenta¬ 
tion,  Ragin  (1987)  argues  that  variables  should  not 
be  examined  detached  from  the  cases,  as  this  is 
done  in  purely  variable-oriented  approaches.  Based 
on  the  assumption  that  a  causal  explanation  of 
social  phenomena  always  must  include  consider¬ 
ations  about  contextual  factors  as  well  as  the  pos¬ 
sibility  of  conjunctural  combinations  of  relevant 
conditions,  QCA  does  not  process  individual  vari¬ 
ables  but  includes  cases  as  configurations  in  the 
analysis  to  arrive  at  causal  explanations.  Described 
here  is  how  QCA  seeks  to  identify  the  necessary 
and  sufficient  conditions  of  a  specific  phenomenon 
(“outcome”  in  QCA  terminology)  as  well  as  to 
achieve  the  most  parsimonious  explanation  for  all 
instances  of  the  phenomenon. 

Since  the  publication  of  The  Comparative 
Method  (Ragin,  1987),  a  series  of  techniques 


based  on  the  ideas  of  QCA  have  been  developed, 
which  can  be  subsumed  as  “configurational” 
comparative  methods.  This  entry  first  focuses  on 
the  ideas  of  the  original  crisp-set  qualitative  com¬ 
parative  analysis  (csQCA)  technique  based  on 
binary  data  and  then  briefly  introduces  recent 
developments. 

Crisp-Set  QCA 

Originally,  QCA  performed  calculations  on  Bool¬ 
ean,  that  is  dichotomized,  data  only.  In  terms  of  set 
theory,  this  means  that  QCA  deals  with  “crisp 
sets”  as  opposed  to  “fuzzy  sets.”  All  data  included 
in  a  csQCA  analysis  must  therefore  be  coded 
dichotomously,  which  means  that  it  has  to  be  con¬ 
sidered  for  each  case  whether  the  case  fulfills  the 
condition  or  not.  The  outcome  has  to  be  coded 
dichotomously  as  well.  By  convention,  in  csQCA, 
an  uppercase  notation  indicates  the  presence  of  a 
condition,  whereas  a  lowercase  notation  signifies 
the  absence  of  this  condition.  By  using  Boolean 
algebra,  the  configuration  for  a  case  can  then  be 
written  by  concatenating  the  values  for  all  the  con¬ 
ditions  included  in  the  analysis.  For  example,  in  a 
study  concerning  the  conditions  of  development  of 
new  ecological  parties  in  Western  democracies  in 
the  1980s,  it  is  theoretically  feasible  that  the  suc¬ 
cess  of  these  parties  depended  on  conditions  such 
as  the  electoral  system  (proportional  representa¬ 
tion  or  not),  a  high  level  of  socioeconomic  devel¬ 
opment  (measured  by  gross  national  product 
[GNPJ),  the  construction  of  nuclear  power  plants, 
and  the  density  of  the  population.  In  the  case  of 
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Norway,  for  example,  a  high  level  of  socio¬ 
economic  development  can  be  observed,  as  well  as 
a  system  of  proportional  representation;  these  two 
conditions  are  met,  but  there  are  no  nuclear  power 
plants,  and  the  density  of  the  population  is  low. 
Thus,  for  Norway,  the  configuration  can  be  writ¬ 
ten  as  VOTING  *  GNP  *  nuclear  *  density 
(*  indicating  the  Boolean  AND). 

In  the  first  step,  all  cases  with  the  same  configura¬ 
tion  are  grouped  together.  For  each  configuration, 
the  outcomes  of  the  cases  included  are  compared. 
If  all  cases  show  the  same  outcome,  the  common 
value  (positive  or  negative)  is  assigned;  if  not,  the 
configuration  is  marked  as  contradictory.  In  the 
second  step,  for  all  configurations  with  the  same 
outcome,  the  minimization  rule  is  used  to  identify 
conditions  that  are  not  relevant  for  the  outcome 
to  occur:  “If  two  Boolean  expressions  differ  in 
only  one  condition  yet  produce  the  same  outcome, 
then  the  causal  condition  that  distinguishes  the 
two  expressions  can  be  considered  irrelevant  and 
can  be  removed  to  create  a  simpler,  combined 
expression”  (Ragin,  1987,  p.  93).  So  if,  for  exam¬ 
ple,  all  cases  with  the  configuration  VOTING  * 
GNP  *  nuclear  *  density  and  voting  *  GNP  * 
nuclear  *  density  show  the  same  outcome,  the 
condition  VOTING  can  be  considered  to  be 
superfluous.  The  minimization  rule  is  applied 
iteratively  to  the  identified  simpler  expression, 
minimizing  the  included  logical  expression  as  far 
as  possible.  This  produces  a  number  of  “prime 
implicants,”  each  explaining  only  cases  with  the 
outcome  examined. 

To  exemplify  the  use  of  the  minimization  rule, 
a  data  set  with  three  conditions  a,  b,  and  c  and  an 
outcome  o  is  used  here.  Each  possible  configura¬ 
tion  is  found  in  exactly  one  case,  producing  the 
truth  table  (Table  1). 

To  obtain  the  prime  implicants  for  cases  with 
Outcome  1,  the  minimization  rule  is  applied  to  all 
pairs  of  configurations  with  this  outcome.  Each 
configuration  in  Table  1  with  Outcome  1  is  repre¬ 
sented  in  the  left  column  of  Figure  1.  For  each  pair 
of  entries  in  this  column,  the  minimization  rule  is 
applied,  and  if  the  two  expressions  differ  in  the 
value  of  only  one  condition,  the  minimized  expres¬ 
sion  resulting  from  the  minimization  step  is  listed 
in  the  middle  column  (the  minimization  steps  are 
indicated  by  the  arrows).  This  procedure  is  itera¬ 
tively  repeated  for  the  shorter  expression  that  is 


Table  1 

Exemplary  Data  Set 

Case 

A 

B 

c 

o 

Cl 

0 

0 

0 

0 

C2 

0 

0 

1 

0 

C3 

0 

1 

0 

1 

C4 

0 

1 

1 

1 

C5 

1 

0 

0 

0 

C  6 

1 

0 

1 

1 

C7 

1 

1 

0 

1 

C8 

1 

1 

1 

1 

produced.  B  is  the  only  prime  implicant  consisting 
of  one  condition.  Prime  implicants  are  finally  iden¬ 
tified  in  Figure  1  as  implicants  that  have  not  them¬ 
selves  been  included  in  a  minimization  step  (as  aB 
and  AB  have  generated  B,  they  are  not  prime 
implicants).  Here,  AC  and  B  are  the  prime  impli- 
cants  for  Outcome  1. 

When  only  configurations  with  empirically 
existing  cases  are  included  in  the  minimization,  the 
limited  diversity  found  in  comparative  research 
can  pose  a  problem.  A  large  number  of  logically 
possible  configurations  may  exist,  which  are  not 
included  for  either  outcome.  To  allow  more  parsi¬ 
monious  results,  csQCA  can  include  such  logical 
remainders  in  the  minimization  process,  which 
makes  further  minimization  possible.  Such  logical 
remainders  must,  however,  be  included  with  cau¬ 
tion,  as  doing  so  implies  the  assumption  that  this 
configuration  does  not  contradict  the  examined 


Figure  I  Minimization  Steps 


Qualitative  Comparative  Analysis  2175 


outcome;  see  Charles  Ragin  and  Benoit  Rihoux 
(2004)  for  possible  strategies  on  how  to  deal  with 
this  problem. 

In  the  next  step,  csQCA  arrives  at  a  solution 
covering  all  configurations  with  their  respective 
outcomes  by  combining  prime  implicants  with  a 
logical  expression  that  is  as  parsimonious  as  pos¬ 
sible.  This  provides  a  short  logical  explanation  for 
all  cases  found  in  configurations  with  an  unam¬ 
biguous  outcome  but  does  not  include  cases  with 
contradictory  configurations. 

In  the  example,  the  solution  for  Outcome  1 
would  include  both  prime  implicants  and  would  be 
written  as  B  +  A  *  C  —>  O,  where  +  indicates  the 
Boolean  OR  and  *  indicates  the  Boolean  AND  in 
Boolean  algebra.  The  solution  is  read  as  “the  pres¬ 
ence  of  B  or  the  presence  of  A  combined  with  the 
presence  of  C  leads  to  the  outcome  O.” 

Necessary  and  Sufficient  Conditions 

One  key  feature  of  all  QCA  procedures  is  the  iden¬ 
tification  of  necessary  and  sufficient  conditions.  A 
condition  is  considered  necessary  for  an  outcome 
if  the  condition  is  present  in  all  configurations 
showing  the  outcome  and  does  not  occur  in  any 
configuration  without  this  outcome.  A  condition  is 
sufficient  for  an  outcome  to  occur  if  the  outcome 
occurs  in  all  configurations  with  this  condition. 
Nonetheless,  other  conditions  may  produce  the 
outcome,  too,  so  a  sufficient  condition  does  not 
have  to  occur  in  all  cases  with  the  same  outcome. 
The  analysis  of  the  conditions  of  success  of  green 
parties  in  the  1980s,  for  example,  produces  the 
csQCA  solution: 

SUCCESS  =  VOTING  *  NUCLEAR  *  GNP 
+  VOTING  *  DENSITY. 

This  means  that  the  presence  of  proportional  rep¬ 
resentation  combined  with  the  construction  of 
nuclear  power  plants  and  a  high  GNP  are  condi¬ 
tions  that  did  lead  to  the  presence  of  a  successful 
green  party,  as  did  the  presence  of  proportional 
representation  combined  with  the  presence  of  high 
population  density.  Here,  the  presence  of  propor¬ 
tional  representation  can  be  identified  as  a  neces¬ 
sary  condition,  as  all  configurations  of  cases  with 
successful  green  parties  share  this  condition.  In  the 
solution  for  countries  with  no  successful  green 


parties,  a  number  of  sufficient  conditions  for  the 
failure  can  also  be  identified: 

NO  SUCCESS  =  voting  +  gnp 

+  density  *  nuclear. 

Here,  the  absence  of  proportional  representation 
or  the  absence  of  a  high  GNP  or  the  absence  of 
high  population  density  combined  with  the  absence 
of  nuclear  power  plants  did  lead  to  the  absence  of 
green  parties.  Thus,  the  absence  of  proportional 
representation  as  well  as  the  absence  of  a  high 
GNP  was  sufficient  to  explain  the  absence  of  suc¬ 
cessful  green  parties;  see  Lasse  Cronqvist  (2007) 
for  details. 

Although  sufficient  and  necessary  conditions 
can  be  identified  with  csQCA,  many  conditions 
found  in  csQCA  solutions  are  INUS  conditions, 
which  are  insufficient  but  necessary  parts  of  a  con¬ 
dition,  only  contributing  to  the  explanation  of  an 
outcome  together  with  other  (INUS)  conditions. 

Within  groups  of  cases  with  the  same  configura¬ 
tion,  due  to  the  deterministic  procedure  used  to 
create  truth  tables,  csQCA  is  sensitive  to  single 
cases  that  conflict  with  the  outcome  of  the  other 
cases  within  the  group.  As  a  configuration  is 
defined  as  contradictory  even  if  only  one  case  dif¬ 
fers  from  the  outcome  of  the  other  cases,  larger 
data  sets  may  contain  a  greater  number  of  contra¬ 
dictions,  though  only  a  small  proportion  of  cases 
may  show  a  different  outcome.  Here,  the  minimi¬ 
zation  process  still  produces  a  solution,  but  it 
includes  only  relatively  few  cases.  These  contradic¬ 
tions,  therefore,  should  be  reduced  by  adding  more 
conditions  or  employing  more  refined  procedures 
such  as  multivalue  qualitative  comparative  analy¬ 
sis  (mvQCA)  or  fuzzy-set  qualitative  comparative 
analysis  (fsQCA). 

Further  Developments 

Another  problem  related  to  the  use  of  csQCA  is 
the  compulsory  dichotomization  of  data.  A  condi¬ 
tion  with  interval-scale  raw  data  can  only  be 
dichotomized  by  the  use  of  an  often  artificial 
threshold.  Two  approaches  have  been  developed 
that  share  the  main  ideas  of  csQCA  but  allow  the 
use  of  nondichotomous  data:  Whereas  fsQCA 
seeks  to  identify  necessary  and  sufficient  condi¬ 
tions  by  applying  measurements  of  coverage  and 
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consistency  to  the  relationships  between  interval- 
scale  conditions  and  also  allowing  for  the  analysis 
of  larger  data  sets,  mvQCA  changes  the  minimiza¬ 
tion  procedure  used  for  csQCA  to  allow  the  use  of 
multivalue  conditions  but  retains  the  main  features 
of  csQCA. 

Lasse  Cronqvist 
Universitat  Trier 
Trier,  Germany 
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Basic  Assumptions 


The  use  of  quantitative  methods  in  political  sci¬ 
ence  generally  means  the  application  of  a  statisti¬ 
cal  model  to  political  science  data,  and  a  statistical 
model  is  simply  a  set  of  compatible  probabilistic 


assumptions.  Fundamentally,  assumptions  are 
modeling  choices  made  by  a  researcher  concerning 
the  distribution  of  the  data  to  be  modeled,  how 
the  parameters  of  that  distribution  change  over 
observations  or  time,  and  the  dependence  of  one 
observation  on  another.  The  assumptions  serve 
the  dual  purpose  of  reducing  the  number  of 
parameters  in  the  model  that  must  be  estimated 
and  imbuing  potential  estimators  with  certain 
properties.  The  goals  of  the  modeling  process  are 
description  and  inference,  and  how  well  a  model 
accomplishes  these  goals  is  a  direct  function  of 
how  appropriate  its  assumptions  are  for  a  particu¬ 
lar  data  set. 

The  structure  of  the  entry  proceeds  as  follows. 
First,  the  relationship  between  assumptions,  mod¬ 
els,  and  estimators  is  discussed.  Second,  the 
assumptions  of  the  linear  regression  model  are 
discussed  in  detail  as  more  complex  models  are 
often  defined  as  departures  from  this  set  of 
assumptions.  Particular  attention  is  paid  to  the 
assumptions  necessary  for  attaining  correct  coef¬ 
ficient  estimates  and  standard  errors.  Threats  to 
these  core  assumptions  are  assessed  in  terms  of 
their  effects,  both  in  theory  and  in  practice.  A 
brief  discussion  of  possible  remedies  follows.  The 
final  sections  discuss  the  degree  to  which  the 
assumptions  of  the  linear  regression  model  are 
modified  for  use  in  the  generalized  linear  model 
(GLM)  and  how  the  assumption  of  a  random 
sample  is  dealt  with  in  a  discipline  where  random 
samples  are  rare. 

Assumptions,  Models,  and  Estimators 

A  statistical  model  is  a  mathematical  representa¬ 
tion  of  the  actual  process  in  the  world  that  gen¬ 
erated  the  data  (known  as  a  data-generating 
process,  or  DGP).  The  point  of  creating  a  statisti¬ 
cal  model  is  both  to  describe  the  data  produced 
by  the  DGP  and  to  make  inferences  about  fea¬ 
tures  of  the  DGP  that  are  unknown.  As  noted 
above,  the  model  itself  consists  of  a  set  of 
assumptions.  This  account  makes  it  clear  that 
assumptions  are  characteristics  of  models  and 
not  characteristics  of  data,  and  the  question  to 
ask  of  a  model  is  not  whether  it  is  true  or  false 
but  how  descriptively  useful  is  it.  Models  are 
more  or  less  useful  in  describing  a  given  data  set, 
and  when  an  assumption  fails  to  be  useful  in  the 
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process  of  description,  the  error  lies  with  the 
model,  not  with  the  data. 

Once  a  model  has  been  selected,  then  an  estima¬ 
tor  (a  function  of  the  sample  data  that  provides  an 
estimated  value  for  an  unknown  parameter)  is 
chosen.  Many  common  models  can  be  estimated 
by  any  number  of  estimators,  and  the  choice 
among  estimators  is  driven  by  the  model’s  assump¬ 
tions,  which  imbue  the  estimators  with  various 
properties.  An  estimator  with  good  (or  better) 
properties  under  the  assumptions  of  the  model  is 
chosen  over  an  estimator  with  bad  (or  worse) 
properties  under  the  assumptions  of  the  model. 
The  reference  to  “good  estimators”  and  not  to 
“good  estimates”  is  by  design.  In  frequentist  statis¬ 
tics,  a  good  estimate,  by  definition,  is  one  produced 
by  a  good  estimator.  Properties  of  estimators  that 
are  considered  good  include  unbiasedness  (the  esti¬ 
mator  is  on  average  neither  high  nor  low),  effi¬ 
ciency  (the  estimator  has  a  small  variance  around 
the  true  value),  and  consistency  (the  estimator  is 
near  the  true  value  almost  all  the  time  when  the 
sample  size  is  large).  Often  these  properties  are 
assessed  in  the  aggregate,  as  a  slightly  biased  esti¬ 
mator  with  a  small  variance  may  be  preferred  over 
an  unbiased  estimator  with  a  larger  variance. 

The  practice  and  art  of  data  analysis  lie  in  under¬ 
standing  which  assumptions,  and  therefore  which 
models  and  estimators,  are  appropriate  for  a  given 
data  set.  This  understanding  comes  from  three 
sources:  (1)  the  data  themselves,  (2)  theory,  and 
(3)  substantive  knowledge.  Some  assumptions  can 
be  tested,  but  such  tests  are  often  inconclusive, 
work  only  under  specific  conditions,  and  rarely 
provide  anything  more  than  vague  recommenda¬ 
tions.  Information  drawn  from  theory  and  substan¬ 
tive  knowledge  of  the  process  being  modeled  are  far 
more  reliable  guides.  The  discussion  that  follows 
necessarily  abstracts  from  the  particulars  of  a  spe¬ 
cific  data  set,  but  the  importance  of  using  informa¬ 
tion  from  outside  the  data  should  be  kept  in  mind. 

Assumptions  and  the  Linear 
Regression  Model 

Most  discussions  of  model  assumptions  begin  with 
the  linear  regression  model.  Although  linear  regres¬ 
sion  is  no  longer  the  workhorse  of  political  science — 
pride  of  place  goes  to  the  GLM — the  linear  regression 
model  serves  as  a  good  starting  point  as  the  various 


roles  that  the  assumptions  play  in  the  model  are 
clear  and  some  simple  results  can  be  established. 

The  discussion  of  the  linear  model  centers  on 
the  equation 

y  =  Xf3  +  e, 

where  y  is  a  n  X  1  vector  of  observations  on  the 
dependent  variable,  X  is  an  n  X  K  matrix  of 
regressors  (including  a  constant),  (3  is  a  K  X  1  vec¬ 
tor  of  coefficients,  and  e  is  an  n  X  1  vector  of  dis¬ 
turbances.  The  data  represent  a  random  sample 
from  a  population  of  interest. 

The  discussion  also  centers  on  the  most  com¬ 
mon  estimator  of  the  linear  regression  model,  the 
least  squares  estimator.  Replacing  the  vector  of 
unknown  coefficients  (3  with  an  estimate  (3  defines 
a  vector  of  residuals 

e  =  y  -  X|3. 

The  least  squares  estimator  is  found  by  choosing 
(3  to  minimize  the  sum  of  squared  residuals,  or  e'e_ 
The  first  step  in  this  process  is  to  take  the  derivative 
of  the  sum  of  squared  residuals  with  respect  to  (3: 

ds's  _  d(y  ~  X|3)'(y  -  Xp) 
lip  5(3 

_  d(y'y  ~  2(3'X'y'  +  P'X'Xp) 

”  3(3 

-  2X'X(3  -  2X'y. 

The  second  and  final  step  is  to  set  the  derivative 
equal  to  0  and  solve  for  (3: 

(3  =  (X'X)-1X'y. 

On  its  own,  the  result  of  the  least  squares  pro¬ 
cedure  has  no  properties;  it  is  simply  a  method  for 
fitting  a  line  to  data.  The  least  squares  method 
produces  an  estimator  when  paired  with  a  statisti¬ 
cal  model  that  makes  certain  assumptions.  Five 
assumptions  commonly  make  up  the  linear  regres¬ 
sion  model: 

1.  No  exact  linear  relationships  exist  among  the 
regressors  (X  has  full  column  rank). 

2.  X  is  nonstochastic  (the  regressors  are  fixed  in 
repeated  samples). 
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3. 


The  expectation  of  the  disturbance  term  is  zero 

(£[8]  =  0). 


the  estimator  above,  and  write  it  in  the  follow¬ 
ing  form: 


4.  There  is  homoskedasticity  but  no 
autocorrelation  (spherical  disturbances, 

E[ee']  =  ct2I). 

5.  The  disturbances  are  normally  distributed 
(s  ~  N  (0,  cr2)  )• 

The  linear  regression  model  makes  these 
assumptions  not  because  they  are  substantively 
likely  to  describe  a  data  set  (more  on  this  later) 
but  because  they  imbue  the  least  squares  estima¬ 
tor  with  certain  properties  that  are  considered 
good,  namely,  unbiasedness  and  efficiency.  The 
technical  reasons  for  these  assumptions  are  dis¬ 
cussed  first  to  reinforce  the  point  that  this  par¬ 
ticular  set  of  assumptions  is  driven  by  the  choice 
of  the  estimator  and  not  by  the  demands  of  the 
data. 

Technical  Reasons  for  the  Assumptions 

The  five  assumptions  of  the  linear  regression 
model  are  chosen  for  technical  reasons,  but  they 
also  have  substantive  implications.  This  section 
describes  what  part  each  assumption  plays  in 
deriving  the  good  properties  of  the  estimator,  and 
the  next  details  the  substantive  commitments  their 
use  demands. 

The  first  assumption,  that  no  exact  linear  rela¬ 
tionships  exist  among  the  regressors,  allows  the 
computation  of  the  least  squares  estimator: 

X'Xp  -  X'y 

(3  =  (X'X)_1X'y. 

If  an  exact  linear  relationship  did  exist  among 
the  regressors,  it  would  be  impossible  to  solve  for 
(3  by  premultipling  both  sides  of  the  above  equa¬ 
tion  by  the  inverse  of  (X'X).  The  inverse  of  a 
square  matrix  exists  only  if  the  columns  and  rows 
of  that  matrix  are  linearly  independent. 

Assumptions  2  and  3 — X  is  nonstochastic 
and  the  expectation  of  the  disturbance  term  is 
zero — allow  the  claim  that  the  least  squares  esti¬ 
mator  is  unbiased  (technically,  the  expected 
value  of  the  estimator  equals  the  parameter,  or 
E[q]  —  0).  To  see  this,  substitute  X(3  +  £  for  y  in 


(3  =  (X'X)_1X'y 
-  (X'X)_1X'[X(3  +  e] 

=  p  +  (X'X)_1X'e. 

The  estimator  is  unbiased  if  the  second  term 
above  goes  to  zero.  Taking  the  expectation  of  the 
estimator  gives 

E[ p]  -  £[( 3  +  (X'X^X'e], 

The  expectations  operator  passes  through  (3 
because  it  is  a  constant.  Assumption  2  allows  the 
operator  to  pass  through  X  as  well,  giving 

E[ p]  -  (3  +  (X'X)_1X'E[e]. 

Finally,  Assumption  3  causes  the  second  term 
above  to  be  zero,  and  the  claim  that  the  least 
squares  estimator  is  unbiased  follows,  E[p]  =  (3. 

Assumption  4 — homoskedasticity  and  no  auto¬ 
correlation — allows  estimation  of  the  estimator’s 
variance.  As  variance  is  defined  as  E[(X  —  E[XJ)2J, 
it  is  necessary  to  get  the  estimator  in  this  form.  Begin 
with  the  estimator  in  the  form  above,  and  move  the 
parameter  to  the  left-hand  side  of  the  equation: 

(3  -  p  4-  (X'X)_1X'e 
P  -  (3  -  (X'X)_1X'e. 

Square  both  sides  (remembering  that  the  inner 
product  is  the  sum  of  squares  and  the  outer  prod¬ 
uct  is  variance/covariance  matrix),  and  take  the 
expectation  (remembering  that  X  is  nonstochastic): 


Var(p)  =E[(  (3  — p)(p  — p)'] 

=  E[(X'X)“1X'££'X(X'X)“1] 
=  (X'X)“1X'E[88']X(X'X)“1. 


The  expectation  on  the  right-hand  side  includes 
n  variances  and  n(n  —  l)/2  covariances: 


£[88']  = 
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as  £[e2]  =  cr2  and  £ [e^J  =  cr..  It  is  not  possible  to 
estimate  this  many  variances  and  covariances  with 
only  n  observations.  Assumption  4,  however, 
reduces  the  number  of  items  that  must  be 
estimated  from  n  +  n(n  —  1)12  to  one,  making 
estimation  possible.  Assumption  4  states  that  the 
variances  are  uniform,  £  [of]  -  a2  ,  and  that  the 
covariances  are  zero,  £[e,e;]  =  o,;  =  0.  The  expec¬ 
tation  above  then  simplifies  to 


£[ee'] 


V  o 

0  CT2 


0 

0 


-  o2I, 


0  0 


and  the  expression  for  the  variance  is  then 


Var(p)  =  (X'X)“1X'£[ee']X(X'X)'1 
-  (X'X)“1X'ct2IX(X'X)“1 
=  ct2(X'X)_1. 

These  four  assumptions  are  enough  to  prove  the 
Gauss-Markov  theorem,  which  states  that  the  least 
squares  estimator  is  the  most  efficient  estimator 
among  the  class  of  linear  and  unbiased  estimators. 
The  theorem  does  not  say  that  the  least  squares 
estimator  is  the  most  efficient  estimator;  there  exist 
estimators  that  are  more  efficient,  but  they  happen 
to  be  biased  or  nonlinear,  or  both. 

Finally,  Assumption  5 — normally  distributed 
disturbances — allows  derivation  of  the  sampling 
distribution  of  the  least  squares  estimates.  Because 
the  estimated  coefficients  are  linear  combinations 
of  normally  distributed  disturbances,  the  estimated 
coefficients  are  normally  distributed: 

p-Nfa^x'xr1). 

Correct  t  tests,  £  tests,  and  x2  tests  follow. 


Substantive  Implications  of  the  Assumptions 

While  the  five  assumptions  discussed  above  are 
primarily  used  to  imbue  the  least  squares  estimator 
with  good  properties,  the  same  assumptions  have 


very  specific  substantive  implications  for  the  data 
being  modeled.  Rarely,  however,  do  these  assump¬ 
tions  accurately  describe  the  data  used  by  political 
scientists. 

The  assumption  that  no  exact  linear  relation¬ 
ships  exist  among  the  regressors  means  that  no 
regressor  or  set  of  regressors  can  be  a  linear  com¬ 
bination  of  any  other  regressor  or  set  of  regressors. 
Not  violating  this  assumption  is  mostly  a  matter  of 
asking  meaningful  questions  of  the  data.  Problems 
with  this  assumption  most  often  result  from  the 
careless  use  of  dummy  variables.  As  an  example, 
consider  a  regression  of  the  difference  in  votes 
obtained  by  two  presidential  candidates  in  a  dis¬ 
trict  on  an  intercept  and  a  set  of  independent  vari¬ 
ables.  The  regressors  include  a  dummy  variable  for 
districts  in  southern  states  and  a  dummy  variable 
for  districts  in  nonsouthern  states.  The  regression, 
in  part,  asks  the  question  “How  do  districts  in 
southern  states  and  districts  in  nonsouthern  states 
differ  from  other  districts?”  The  substantive  prob¬ 
lem,  of  course,  is  that  there  are  no  other  districts 
(all  districts  have  to  be  in  the  South  or  the  non- 
South).  The  question  is  not  a  meaningful  one.  (The 
technical  problem  is  that  these  dummy  variables 
would  be  collinear  with  the  intercept. ) 

The  assumption  that  the  regressors  are  fixed  in 
repeated  samples  is  equivalent  to  assuming  that  the 
analyst  has  performed  an  experiment  where  she 
has  control  over  the  inputs  to  the  experiment. 
Thus,  the  experiment  could  be  rerun  many  times 
with  exactly  the  same  values  for  the  regressors,  but 
with  new  values  for  the  disturbances  and  therefore 
new  values  for  the  dependent  variable.  Each  time 
the  regression  is  rerun,  the  different  values  for  the 
dependent  variable  would  produce  different  esti¬ 
mated  coefficients.  Discussion  of  the  distribution 
of  the  estimated  coefficients  is  therefore  possible. 

The  “fixed  X”  assumption  fails  to  describe  all 
but  a  very  few  political  science  data  sets.  The 
assumption  can  easily  be  relaxed,  however,  either 
by  assuming  the  regressors  to  be  exogenous  and 
conditioning  on  them  or  by  assuming  stochastic 
regressors.  What  is  important  is  how  the  “fixed  X” 
assumption  interacts  with  the  assumption  that  the 
expected  value  of  the  disturbances  is  zero,  £[e]  =  0. 
Combined,  the  two  assumptions  imply  that  the 
regressors  are  uncorrelated  with  the  disturbances. 
The  lack  of  correlation  between  the  regressors  and 
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the  disturbances  is  the  key  assumption  in  regres¬ 
sion  analysis.  When  the  assumption  holds,  the  least 
squares  estimator  is  unbiased.  When  the  assump¬ 
tion  does  not  hold,  the  least  squares  estimator  is 
biased.  Unfortunately,  there  are  a  variety  of  sub¬ 
stantive  reasons  why  this  key  assumption  is  unlikely 
to  hold.  The  three  biggest  concerns — measurement 
error,  omitted  variables,  and  simultaneity — are 
discussed  in  depth  in  the  next  section. 

The  assumption  of  homoskedasticity  and  no 
autocorrelation  is  perhaps  the  assumption  least 
likely  to  describe  a  political  science  data  set.  The 
assumption  implies  that  all  the  unmeasured  factors 
in  the  disturbance  affect  different  observations  in 
exactly  the  same  way.  It  implies  that  observations 
drawn  from  physically  adjacent  areas  are  no  more 
alike  each  other  than  observations  drawn  from 
nonadjacent  areas.  It  implies  that  political  pro¬ 
cesses  that  are  observed  over  time  are  not  sticky  or 
sluggish.  Finally,  the  assumption  implies  that  the 
model  has  not  been  misspecified  in  important 
ways.  Few  of  these  implications  accurately  describe 
the  majority  of  political  science  data  sets.  A 
detailed  discussion  follows. 

Getting  the  Coefficients  Right 

When  estimating  a  statistical  model,  political 
scientists  are  concerned  with  getting  the  estimated 
coefficients  right.  That  is,  a  researcher  would  like 
his  or  her  estimated  coefficients  to  be  close  to  the 
population  parameter  values.  Unfortunately,  there 
is  no  way  to  know  whether  an  estimate  is  close  to 
the  true  parameter  value  (short  of  adopting  a 
Bayesian  perspective,  which  is  yet  to  be  done  by 
the  majority  of  political  scientists).  All  that  can  be 
done  is  to  ascertain  whether  the  estimator  is 
good — and  then  assume  that  a  good  estimator  pro¬ 
duces  a  good  estimate. 

As  noted  above,  there  are  two  ways  to  think 
about  a  good  estimator  in  this  sense.  One  way  is 
to  say  that  an  estimator  is  good  if  it  is  on  average 
neither  high  nor  low,  a  property  that  is  known  as 
unbiasedness.  Formally,  an  estimator  is  unbiased 
if  its  expectation  is  equal  to  the  population 
parameter,  E[p]  =  (3.  The  other  way  is  to  say  that 
an  estimator  is  good  if  it  is  near  the  true  value 
almost  all  the  time  when  the  sample  size  is  large, 
a  property  that  is  known  as  consistency.  Formally, 
an  estimator  is  consistent  if  its  probability  limit, 


or  plim,  is  equal  to  the  population  parameter: 
plim  p  =  p. 

Of  the  two  definitions  of  the  term  good,  the  lat¬ 
ter  is  closer  to  what  political  scientists  actually 
mean.  An  unbiased  estimator  only  gives  informa¬ 
tion  about  the  average  estimate,  not  the  estimate 
actually  calculated.  Consistency,  on  the  other 
hand,  is  the  minimum  requirement  for  any  estima¬ 
tor;  with  a  large  sample  size,  the  calculated  esti¬ 
mate  should  be  close  to  the  truth. 

A  definitive  statement  choosing  between  unbi¬ 
asedness  and  consistency  is  not  necessary,  however, 
as  the  major  condition  for  achieving  both  is  the 
same.  The  key  assumption  is  that  the  disturbances 
have  mean  zero  and  are  uncorrelated  with  each 
regressor  or  E[X'e]  =  0.  (This  condition  is  weaker 
than  assuming  the  independence  of  X  and  e. 
Independence  implies  a  correlation  of  zero,  where¬ 
as  a  correlation  of  zero  does  not  imply  indepen¬ 
dence.)  Given  the  importance  of  this  assumption,  it 
would  seem  that  testing  this  assumption  would  be 
the  first  step  in  marshaling  evidence  for  the  statisti¬ 
cal  adequacy  of  a  proposed  model.  The  unknown 
disturbances,  s,  could  be  estimated  with  the  least 
squares  residuals,  e  =  y  —  X(3.  and  then  corre¬ 
lated  with  each  regressor.  Unfortunately,  this  test  is 
uninformative.  The  least  squares  residuals  are,  by 
construction,  uncorrelated  with  the  regressors.  A 
simple  manipulation  of  the  least  squares  estimator 
demonstrates  the  point.  Multiply  both  sides  of  the 
least  squares  estimator  by  X'X,  and  then  substitute 
xp  +  e  for  y  on  the  right-hand  side: 

p  =  (X'X)_1X'y 

X'Xp  =  X'(xp  +  e) 

X'Xp  =  X'Xp  +  X'e 
X'e  =  0. 

As  this  critical  assumption  is  not  directly  test¬ 
able,  we  need  to  consider  the  ways  in  which  the 
assumption  could  be  violated.  The  three  most 
common  violations  include  measurement  error, 
omitted  variables,  and  simultaneity.  We  will  exam¬ 
ine  each  in  turn. 

Measurement  Error 

Measurement  error  in  political  science  arises 
from  the  difficulty  of  measuring  the  theoretical 
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constructs  to  which  political  scientists  often  refer 
in  their  work.  Concepts  such  as  power,  democ¬ 
racy,  and  ideology  can  be  measured  in  a  variety  of 
different  ways,  but  each  measure  is  an  approxima¬ 
tion  of  an  abstract  idea.  Good  measures  include 
significant  parts  of  the  construct  of  interest,  but 
invariably,  they  also  include  parts  of  other  con¬ 
structs  (systematic  bias)  as  well  as  random  error. 
Measurement  error  is  likely  the  rule,  and  not  the 
exception,  in  most  political  science  data. 

Theory.  Let  the  DGP  include  a  single  regressor: 

y  =  x(3  +  e. 

Assume  that  the  single  regressor  has  been  mea¬ 
sured  with  some  error.  That  is,  the  variable  that  is 
actually  available  to  us  is  not  x,  but  w: 

w  =  x  +  u, 

where  u  is  the  random  error  and  w  thus  comprises 
the  true  value,  x,  plus  an  error  component.  In  addi¬ 
tion,  assume  that  u  has  mean  zero,  is  uncorrelated 
with  e,  and  is  uncorrelated  with  the  single  regres¬ 
sor,  x.  If  we  write  x  as  a  function  of  w  and  u,  the 
model  we  actually  estimate  is 

y  =  (w  -  u)P  +  e 
=  wp  +  v, 

where  v  —  e  —  u^.  The  least  squares  estimator  for 
this  model  is 

p  =  (w'w)  :w'y 
=  P  +  (w'w)  :w'v. 

The  problem  here  is  that  w  and  v  are  both  func¬ 
tions  of  the  random  error,  u.  Thus,  w  and  v  are 
correlated,  which  violates  the  assumption  that 
guarantees  consistency  and  unbiasedness. 

In  the  special  case  just  discussed  (a  regression 
with  a  single  mismeasured  regressor),  the  effect 
of  measurement  error  is  straightforward,  w  con¬ 
sists  of  a  systematic  part,  x,  and  a  random  part,  u. 
The  random  part  of  w  is  uncorrelated  with  the 
dependent  variable  by  construction.  The  larger 
the  variance  of  the  random  part  grows  relative 
to  the  systematic  part,  the  closer  the  estimated 


coefficient  is  to  zero.  This  effect  is  seen  in  the 
expression  for  p’s  inconsistency, 

plim(P  -  p)  =  -  2^7"  2- 

(T  +  (J 

X  u 

The  inconsistency  is  negative,  which  accounts  for 
the  bias  toward  zero.  The  inconsistency  is  small 
only  if  <rx  is  large  relative  to  cT.  As  the  latter  term 
increases  in  size,  the  greater  is  the  attenuation. 

Practice.  The  case  just  discussed  is  highly  stylized 
and  is  unlikely  to  be  encountered  in  practice.  The 
comforting  conclusions  of  these  kinds  of  theoretical 
discussions — for  example,  that  the  effects  are 
attenuated — generally  do  not  hold  in  more  realistic 
situations.  First,  most  regressions  are  likely  to 
include  multiple  regressors,  with  some  subset 
being  measured  with  significant  error.  In  this 
situation,  it  is  difficult  to  know  the  effects  of 
measurement  error.  Even  if  the  measurement  error 
is  confined  to  one  variable,  it  affects  the  estimated 
coefficients  of  the  variables  measured  without 
error.  Second,  the  error  in  the  above  discussion  is 
random  error;  it  is  uncorrelated  with  the 
disturbance,  e,  and  uncorrelated  with  the  lone 
regressor,  x.  Actual  measurement  error  is  far  more 
likely  to  be  systematic.  Any  operational  definition 
of  a  difficult  to  measure  theoretical  construct,  such 
as  power  or  democracy,  is  likely  to  pick  up 
elements  of  other,  unwanted  constructs.  The 
problem  can  be  severe.  It  has  been  demonstrated, 
for  example,  that  correlated  measurement  error 
between  two  variables  can  lead  to  an  incorrect 
sign  on  an  estimated  coefficient.  Including  long 
lists  of  unevenly  measured  control  variables  in  a 
regression,  therefore,  makes  little  sense.  Third, 
measurement  error  can  also  appear  in  the 
dependent  variable.  This  case  is  often  ignored,  as 
random  error  in  the  dependent  variable  is  simply 
added  to  the  error  component  of  the  model,  and 
consistency  is  unaffected.  Again,  systematic  error 
that  is  correlated  with  the  included  regressors  is 
more  likely,  and  the  result  is  inconsistency  and 
bias. 

Omitted  Variables 

Hidden  within  the  assumption  that  the  regres¬ 
sors  must  be  uncorrelated  with  the  error  term  is 


2182  Quantitative  Methods,  Basic  Assumptions 


the  claim  that  we  have  all  the  important  variables 
that  affect  the  dependent  variable  accounted  for  in 
the  model.  More  specifically,  the  claim  is  that  the 
model  includes  all  the  important  regressors  that 
are  correlated  with  the  other  included  regressors. 
Omitted  variables,  like  measurement  error,  are 
probably  the  rule  rather  than  the  exception  in 
political  science  research. 

Theory.  Let  the  true  DGP  be 

y  =  XiPj  +  x2p2  +  a, 

where  Xx  is  an  n  X  k  —  1  matrix  of  regressors  and 
x2  is  a  single  regressor.  Assume  that  x2  is  omitted 
from  the  model  either  because  it  cannot  be  mea¬ 
sured  or  because  it  is  unknown.  The  model  actu¬ 
ally  estimated,  then,  is 

y  =  x,p,  +  £*, 

where  £*  =  x2p2  +  £.  This  expression  makes  the 
problem  clear.  The  new  error  term,  £*,  includes  x2. 
If  that  variable  is  correlated  with  the  included  vari¬ 
ables,  X1S  the  assumption  that  the  regressors  are 
uncorrelated  with  the  disturbances  no  longer  holds. 

To  see  this,  begin  with  the  least  squares  estima¬ 
tor  of  the  misspecified  model,  and  substitute  the 
right-hand  side  of  the  true  model  for  y: 

Pi  -  (XiX1)_1Xiy 

-  (X'X1)'1Xl(X1p1  +  x2p2  +  E) 

=  Pi  +  (X'x1)^1x;x2p2  +  (xjxo-'xje. 

Taking  the  expectation  under  the  assumption 
that  the  included  variables  are  uncorrelated  with 
the  disturbance  term,  the  result  is 

E[\ Pi]  =  Px  +  (X'1X1)“1Xix2p2. 

Thus,  the  difference  between  the  expectation 
of  Pj  and  the  truth  depends  on  two  kinds  of  val¬ 
ues.  The  first  are  the  coefficients  from  the  regres¬ 
sion  of  the  excluded  variable  on  the  included 
variables,  (XiXt)  1Xlx2.  The  second  is  the  true 
effect  of  x2,  P2.  Using  this  information,  it  is  pos¬ 
sible  to  identify  the  direction  of  the  inconsistency 
and  bias. 


Including  irrelevant  variables  in  a  regression  is 
considered  a  lesser  problem.  Let  the  true  DGP  be 

y  =  XiPj  +  £ 

and  the  estimated  equation  be 

y  —  XiPt  +  X2p2  +  e. 

The  variables  in  X2  are  irrelevant  and  thus  have 
no  effect  on  the  dependent  variable.  When  this 
equation  is  estimated,  least  squares  correctly  esti¬ 
mates  P2  as  zero,  and  the  estimator  of  px  is  consis¬ 
tent.  The  only  downside  usually  noted  is  that  the 
variance  of  Pj  in  the  estimated  equation, 

ct2(X^M2X1)_1, 

where  M2  =  I  —  X2(X2X2)_1X2,  is  always  as  large 
as,  or  larger  than,  the  true  variance, 

^(xjXi  r1. 

This  increase  in  the  variance  is  considered  to  be 
a  fair  price  to  pay  for  the  promise  of  avoiding 
inconsistency  and  bias  caused  by  omitted  variables. 

Practice.  As  is  true  for  measurement  error,  the  theory 
and  practice  of  omitted  variables  diverge,  and 
comforting  conclusions  regarding  the  direction  of  the 
bias  or  inconsistency  apply  only  in  very  special  cases. 
For  example,  it  is  unlikely  that  a  single  variable  is 
omitted  from  a  regression.  A  set  of  omitted  variables 
is  more  likely,  making  the  expectation  above 

e[P,]  =  pa  +  (xix1)_1x;x2p2. 

Even  more  likely  than  a  set  of  omitted  variables  is 
die  situation  where  the  researcher  is  deciding  whether 
to  include  a  known  group  of  previously  omitted 
variables  while  still  omitting  a  group  of  unknown 
or  unmeasured  variables.  This  situation  can  be 
expressed  using  the  following  DGP  and  two  mis¬ 
specified  models.  Let  the  DGP  be  in  scalar  notation: 

Yt  —  Po  +  PiXn  +  P2X?2  +  p3X23  +  £( 

and  the  two  misspecified  models  be 

Model  1  :  Yt  —  P01  +  pnX(j  +  £tj, 

Model 2  :  Yt  —  P02  +  pi2X(j  +  P22Xi2  +  £f2. 
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Model  1  omits  both  X,  and  X3,  and  Model  2 
omits  just  X3  The  prevalent  view  in  political 
science  is  that  the  bias  on  (3n,  the  estimated  coef¬ 
ficient  on  Xj  in  model  1,  is  always  greater  than  the 
bias  on  fS12,  the  estimated  coefficient  on  X, 
in  model  2.  Letting  the  bias  on  (3n,  £[(3n]  —  (3l5 
be  denoted  as  and  the  bias  on 

3 12 5  £[Pi2]  -  Pi,  be  denoted  as  b(P12,Pi),  the 
mathematical  argument  is  that 

This  conclusion,  however,  does  not  hold  in 
general.  It  can  be  demonstrated  that  the  inclusion 
of  additional  relevant  variables  can  increase  or  de¬ 
crease  the  bias  on  the  X,  coefficient.  Without 
knowing  the  effects  of  the  still  omitted  variables 
on  the  newly  included  variables,  it  is  impossible 
to  know  whether  the  newly  included  variables 
increase  or  decrease  the  bias  and  inconsistency. 

Simultaneity 

Simultaneity  is  the  third  major  way  in  which  the 
key  assumption  necessary  for  unbiasedness  and 
consistency — the  regressors  and  the  disturbances 
are  uncorrelated — can  be  violated.  The  problem  of 
simultaneity  arises  when  one  of  the  right-hand-side 
regressors  is  determined  simultaneously  with  the 
dependent  variable. 

Theory.  Let  the  true  DGP  consist  of  two  equations 
with  two  dependent  variables,  yt  and  y2,  each  of 
which  is  a  function  of  the  other.  The  equations  also 
include  two  exogenous  regressors,  x,  and  x„  and 
two  sets  of  disturbances,  £j  and  e2, 

Yi  +  PuYi  +  7nxi  =  £i> 

y  2  +  p2iyi  +  7l2x2  =  G- 

To  see  the  problem  in  estimating  these  equa¬ 
tions  consistently,  rewrite  the  first  equation  as 

Yi  —  ZS  +  gj, 

where  Z  =  [y2  xj  and  8'  =  [— p12  — 7n]  .  The 
usual  ordinary  least  squares  (OLS)  estimator  is 

8  =  (Z'Z)  'Z'yp 


Substituting  Z8  +  for  yt  into  the  above  equa¬ 
tion  and  taking  expectations,  it  is  easy  to  see  that 
the  expected  value  of  8  does  not  equal  8: 

£[  8]  =  8  +  £[(Z'Z)_1Z'e1] . 

The  expected  value  on  the  right-hand  side  of  the 
above  equation  does  not  approach  zero  because  Z 
contains  an  endogenous  variable  y2,  which  is 
jointly  determined  with  y1  and  thus  is  correlated 
with  8|.  The  result  is  bias  and  inconsistency 
because  the  main  condition  for  achieving  unbiased 
and  consistent  estimates,  that  the  columns  of  Z  be 
uncorrelated  with  the  disturbance,  e15  is  violated. 

Faced  with  the  above  situation,  it  is  possible  to 
estimate  an  equation  known  as  the  reduced  form 
by  rewriting  the  original  two  equations  as 

By  +  Tx  =  s, 

where 


y'=[yi  Yi), 
r'  =  [*Yn  721  ]> 

x'  =  [Xi  x2], 
e'=  [El  £2  ]- 

Provided  B  is  nonsingular,  the  equation  can  be 
solved  in  the  following  way: 

yt  —  — B  xrx(  +  B  :e 
=  rix  +  v, 

where  II  =  —  B  :T  and  v  —  B_1e. 

Most  of  the  time,  however,  political  scientists  are 
interested  in  obtaining  estimates  of  the  original  co¬ 
efficients,  not  of  the  reduced-form  coefficients.  The 
big  question  is  whether  estimates  of  the  original 
coefficients  can  be  recovered  from  the  reduced-form 
estimates.  Consider  the  model  given  above,  but  this 
time  premultiplied  by  a  nonsingular  matrix  G: 

GBy  +  GTx  =  Ge. 

Solving  this  equation  for  the  reduced-form  coeffi¬ 
cients  produces  exactly  the  same  coefficient  estimates 
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as  the  model  that  is  not  premultiplied  by  the  G 
matrix: 

yt  =  -B_1G_1GIX  +  B_1G_1Ge 
=  nx  +  v, 

where  II  =  —  B  :r  and  v  —  B  1e. 

It  is  impossible,  then,  to  recover  estimates  of  the 
original  coefficients  because  both  models  produce 
exactly  the  same  reduced-form  coefficients.  The 
equivalence  of  these  two  models  is  known  as  the 
identification  problem,  and  much  of  the  practice  of 
simultaneous  equations  is  devoted  to  solving  it. 
Systems  of  equations  where  estimates  of  the  origi¬ 
nal  coefficients  can  be  recovered  from  the  reduced 
form  are  identified.  Systems  where  recovery  is  not 
possible  are  unidentified. 

Practice.  In  the  context  of  simultaneous  equations, 
the  identification  problem  concerns  the  ability  to 
recover  estimates  of  the  coefficients  of  interest  from 
the  reduced-form  coefficients.  Ensuring  that  a  system 
of  equations  is  identified  generally  requires  the  use 
of  a  priori  nonsample  information.  Such  information 
most  frequently  comes  in  the  form  of  exclusion 
restrictions — a  specification  that  certain  endogenous 
variables  and  certain  exogenous  variables  do  not 
appear  in  certain  equations.  The  point  is  to  make  it 
harder,  and  eventually  impossible,  to  find  a  G 
matrix  that  produces  the  same  set  of  reduced-form 
coefficients  as  the  original  model.  If  no  such  matrix 
is  found,  then  the  system  of  equations  is  identified. 

Where  does  the  nonsample  information  come 
from?  As  the  necessary  information  cannot  be 
deduced  from  the  data,  it  has  to  come  from  theory. 
Unfortunately,  little  theory  in  political  science  is 
detailed  enough  to  provide  justifiable  exclusion 
restrictions.  Decisions  about  restrictions,  then,  are 
often  made  on  ad  hoc  grounds.  The  consequences 
of  making  false  exclusions  are  the  familiar  ones  of 
bias  and  inconsistency. 

Finally,  it  should  be  noted  that  the  three  major 
sources  of  endogeneity — measurement  error,  omit¬ 
ted  variables,  and  simultaneity — do  not  occur  in 
isolation  from  one  another.  Systems  of  equations 
use  more  variables  than  single-equation  models 
and  are  therefore  more  likely,  ceteris  paribus,  to 
suffer  from  measurement  error.  The  use  of  false 
exclusion  restrictions  leads  to  omitted-variable 


bias.  Any  real-life  data  analysis  situation  is  likely 
to  suffer  from  all  three  problems,  making  any 
attempts  to  determine  the  direction  of  bias  futile. 

What  Does  Not  Need  to  Be  Assumed 

Getting  the  coefficients  “right”  requires  one  major 
assumption,  the  disturbance  has  mean  zero  and 
is  uncorrelated  with  each  regressor,  E[X'e]  =  0, 
and  one  minor  assumption,  no  regressor  or  set  of 
regressors  is  a  linear  combination  of  any  other 
regressor  or  set  of  regressors,  or  X  has  full  column 
rank.  The  list  of  assumptions  that  do  not  need  to 
be  made  is  much  longer.  For  example,  it  is  not 
necessary  to  assume  normality  of  any  variables, 
dependent  or  independent.  It  is  not  necessary  to 
assume  that  the  independent  variables  are  uncor¬ 
related  with  one  another  or  correlated  at  low 
levels.  It  is  also  unnecessary  to  assume  anything 
about  the  variance/covariance  matrix  of  the  distur¬ 
bances.  Neither  heteroskedasticity  nor  autocorrela¬ 
tion  affects  the  bias  or  consistency  of  the  estimator. 

Getting  the  Standard  Errors  Right 

Getting  the  standard  errors  right  is  not  neces¬ 
sary  for  getting  the  coefficients  right.  The  reverse, 
however,  is  not  true.  The  problems  that  can  plague 
estimates  of  the  coefficients — measurement  error, 
omitted  variables,  and  simultaneity — can  affect  the 
standard  errors.  When  trying  to  get  the  coefficients 
right,  it  is  often  difficult  in  practice  to  know  the 
direction  of  the  bias  or  inconsistency.  The  same 
is  not  true  for  the  standard  errors,  however.  In 
most  cases  involving  endogeneity,  the  estimated 
standard  errors  are  too  narrow,  leading  to  over- 
confidence  in  the  results  of  the  analysis. 

Consider  the  omitted-variable  case.  When  the 
true  DGP  is 

y  =  Xjfh  +  X2p2  +  e, 

the  true  variance  of  [3,  is  cr2  (XjM2Xj)-1.  If  the  vari¬ 
ables  in  X2  are  omitted  from  the  estimated  model, 
the  variance  of  fh  is  ct2(XJXi)-1,  which  is  always 
as  small  as,  or  smaller  than,  the  true  variance.  A 
similar  demonstration  can  be  made  in  the  case  of 
measurement  error.  As  the  amount  of  random 
error  in  x  increases,  the  estimated  standard  error  on 
the  coefficient  of  x  decreases.  The  problem  is  that 
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the  analyst  thinks  he  has  more  information  than  he 
actually  does. 

The  discussion  that  follows  assumes  that  endo¬ 
geneity  problems  do  not  exist,  so  that  the  other 
issues  that  lead  to  incorrect  standard  errors  can  be 
examined.  It  is  important  to  remember,  however, 
that  the  estimated  standard  errors  are  likely  to  be 
too  small  even  before  these  additional  problems 
are  discovered. 

Getting  estimates  of  the  regression  coefficients 
right  is  a  matter  of  making  a  single,  albeit  very 
important,  assumption:  E[X'e]  =  0.  Getting  the 
standard  errors  right  requires  making  another  very 
important  assumption  involving  the  disturbances. 
This  time  the  assumption  is  that  the  squared  dis¬ 
turbances  are  uncorrelated  with  each  regressor,  its 
square,  and  all  cross-products.  Formally,  the 
assumption  is  that  £[e2X'X]  =  cr2£[X'X],  where 
cr2  is  the  expectation  of  8.  This  assumption  is 
implied  by  the  more  common  assumption  that  the 
expectation  of  the  squared  disturbances  is  constant 
E[e2]  =  cr2.  Under  this  assumption,  as  well  as  the 
two  previous  assumptions — X  has  full  column  rank 
and  E[X'e]  =  0 — the  OLS  estimator  of  (3  is  nor¬ 
mally  distributed,  and  the  standard  errors,  t  statis¬ 
tics,  and  F  statistics  are  asymptotically  valid. 

As  with  the  previous  big  assumption,  however, 
violations  of  the  new  assumption  are  both  com¬ 
mon  and  likely. 

Heteroskedasticity 

The  assumption  of  homoskedasticity  means 
that  each  of  the  disturbances  is  drawn  from  a  dis¬ 
tribution  with  the  same  variance  as  the  other 
disturbances.  In  conjunction  with  previous  as¬ 
sumptions,  the  homoskedasticity  assumption 
is  E[e2]  =  cr2  .  Violations  of  this  assumption  can 
occur  in  one  of  two  ways:  as  a  result  of  misspecifi- 
cation  or  as  a  result  of  the  data  themselves. 

Misspecification.  Heteroskedasticity  due  to  mis- 
specification  can  arise  in  a  number  of  different 
ways.  If  the  true  DGP  is 

y  =  Xi3,  +  x202  +  e, 

and  the  analyst  omits  the  variables  in  X9  from  her 
model, 

y  =  X,(3,  +  s*, 


then  the  error  term  is  a  function  of  the  omitted 
variables,  8*  =  X20,  +  8.  As  the  omitted  variables 
vary,  so  does  the  error  variance. 

An  incorrect  functional  form  can  also  cause 
heteroskedasticity.  If  the  true  DGP  is  nonlinear, 

y,  =  0o  +  Pi*-  +  £«, 

and  the  analyst  estimates  a  linear  model, 
y,  =  00  +  01  Xj  +  Vi, 

then  the  disturbance  is  again  a  function  of  x,  u  = 
f  (xj,  8 ().  As  x  varies,  so  does  the  error  variance. 

A  coefficient  that  varies  across  observations  can 
also  be  a  source  of  heteroskedasticity.  Consider  a 
data-generating  process  where 

Vi  =  0o  +  0,V  +  £, 

and  0,  varies  randomly  around  some  fixed 
0:  0,  =  0  +  vr  Application  of  OLS  estimates  the 
model: 

X  =  00  +  (0  +  Vj)Xj  +  8, 

=  00  +  0X,  +  (v,Xi  +  8,), 

and  once  again,  the  error  term  is  a  function  of  x. 

A  multiplicative  error  term  and  an  incorrect 
data  transformation  are  other  specification  issues 
that  could  cause  heteroskedasticity. 

Data.  The  data  can  also  be  a  source  of  hetero¬ 
skedasticity.  In  the  simplest  case,  an  influential 
outlier  can  cause  heteroskedasticity.  By  the  same 
token,  a  heavily  skewed  regressor  can  also  cause  a 
nonconstant  error  variance. 

Specific  kinds  of  data  can  be  a  source  of  the 
problem.  Aggregate  data  are  a  common  source  of 
heteroskedasticity.  If  the  dependent  variable  is  an 
average  or  a  proportion,  the  variance  is  a  function 
of  the  number  of  observations  being  aggregated  in 
the  different  units.  If  the  estimated  model  is  based 
on  an  average, 

y,  =  00  +  0v  +  £„ 

where  s;  is  an  average  for  the  ;th  unit  over 
i  individuals,  the  variance  of  the  error  term  is 
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Var(e;)  =  it1  /Nr  The  error  variance,  then,  clearly 
varies  with  the  size  of  the  group.  A  similar  demon¬ 
stration  can  be  performed  for  a  dependent  variable 
based  on  a  proportion. 

Finally,  heteroskedasticity  most  commonly 
occurs  with  cross-sectional  data  where  units  of  dif¬ 
ferent  sizes,  whether  they  be  individuals,  firms, 
industries,  or  states,  are  taken  together.  It  is  easy 
to  imagine  larger  units  having  larger  absolute  error 
terms.  A  $900  billion  economy  is  more  likely  to  see 
an  absolute  error  of  $10  billion  than  a  $100  billion 
economy. 

Autocorrelation 

The  assumption  of  no  autocorrelation  means 
that  the  disturbance  associated  with  one  observa¬ 
tion  is  unrelated  to  the  disturbance  associated  with 
any  other  observation.  Formally,  the  assumption  is 
E  [£,£,]  =  0  for  i  ^  j.  As  is  the  case  with  heteroske¬ 
dasticity,  violations  are  the  result  of  misspecifica- 
tion  of  the  data  themselves. 

Misspecification.  Like  heteroskedasticity,  autocor¬ 
relation  can  be  the  result  of  omitted  variables, 
incorrect  functional  form,  or  data  manipulation.  An 
omitted  variable  that  happens  to  be  autocorrelated 
causes  the  disturbances  to  be  autocorrelated.  A 
special  case  of  this  result  occurs  when  the  omitted 
variable  is  the  lagged  value  of  the  dependent 
variable.  Autocorrelation  arises  in  this  case  due  to 
the  influence  of  the  lagged  value  on  the  current 
value.  If  a  linear  functional  form  is  used  to  model  a 
nonlinear  relationship,  there  will  be  sections  of  the 
relationship  where  the  estimated  regression  line 
consistently  underestimates  or  overestimates  the 
relationship.  Finally,  smoothing  techniques  such  as 
moving  averages  can  induce  a  periodicity  in  the 
disturbances  that  did  not  exist  prior  to  the  smoothing. 

Data.  Autocorrelation  is,  somewhat  obviously, 
most  likely  to  occur  in  data  that  are  observed  over 
time.  Such  variables  often  display  a  “stickiness” 
and  only  change  in  small  increments  over  time. 
State  budgets,  for  example,  only  change  marginally 
from  year  to  year.  Successive  observations,  then, 
are  likely  to  be  correlated.  Similarly,  random 
shocks  to  a  system,  such  as  war  or  a  market  crash, 
have  prolonged  effects  across  time  periods,  leading 
to  correlated  disturbances. 


A  more  subtle  form  of  autocorrelation  arises 
from  cross-sectional  data.  Spatial  autocorrelation 
can  occur  when  observations  are  taken  from  units 
that  are  physically  adjacent  to  one  another,  such  as 
states  or  countries.  The  behavior  of  individuals 
from  adjacent  West  European  countries  is  likely  to 
be  affected  by  similar  unmeasured  factors,  which 
leads  to  correlated  disturbances. 

What  to  Do 

The  conclusion  to  draw  from  the  discussion 
above  is  not  that  regression  can  never  be  trusted. 
All  the  problems  that  have  been  detailed  have  solu¬ 
tions  that  are  examined  elsewhere  in  this  encyclo¬ 
pedia.  The  discussion  that  follows  outlines  a  few 
broad  approaches  to  addressing  the  common 
assumption  violations  encountered  by  political 
scientists  using  the  linear  regression  model. 

Endogeneity  problems  can  often  be  dealt  with 
through  the  use  of  instrumental-variables  regres¬ 
sion.  Mismeasured  variables,  for  instance,  can  be 
replaced  with  “instruments,”  which  are  alternative 
variables  that  are  uncorrelated  with  the  distur¬ 
bance  term  and  yet  are  still  correlated  with  the 
mismeasured  regressors.  Any  regressor  that  is 
uncorrelated  with  the  disturbance  term  can  serve 
as  its  own  instrument. 

Let  the  matrix  of  instruments  be  Z,  which  has 
at  least  as  many  columns  (regressors)  as  the  orginal 
matrix  of  regressors,  X.  Next,  premultiply  the 
usual  linear  regression  equation  by  Z', 

Z'y  =  Z'Xfl  +  Z'£, 

where  the  variance  of  Z'e  is  a(Z'Z).  The  instru¬ 
mental-variables  (IV)  estimator  is,  then, 

PIV  -  (X'Z(Z'Z)_1Z'X)_1X'Z(Z'Z)'1Z'y, 

with  variance  Var(fliv)  =  cr“(X'Z(Z'Z)  *Z')X)  \ 
Estimates  are  easily  calculated  by  noting  that 
X  =  Z(Z'Z)_1Z'X  are  the  fitted  values  from  the 
regression  of  X  on  Z.  The  instrumental-variables 
estimator,  then,  is  just 

P,v=  (x'x)_X'y 

It  can  be  easily  shown  that  the  instrumental- 
variables  estimator  is  consistent  and,  therefore,  it 
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is  possible  to  get  correct  coefficients.  The  trade-off 
for  this  gain  in  consistency  is  loss  of  precision. 

Problems  relating  to  the  error  term,  such  as  het- 
eroskedasticity  and  autocorrelation,  can  be  dealt 
with  either  through  feasible  generalized  least  squares 
or  through  calculation  of  robust  standard  errors.  In 
the  case  of  heteroskedasticity,  the  challenge  is  to  esti¬ 
mate  the  correct  variance/covariance  matrix,  which  is 


Var(0)  =  (X'X)_1X'(t2QX(X'X)_1 


1  (\ 


n  \  n 


X'X)  (-X'[CT2n]X 


Estimating  a2 Cl,  the  dimensions  of  which  are 
n  X  n,  is  impossible  with  n  observations.  Estimating 

^-X'  [ct2Q]X^  ,  however,  is  not  impossible,  as  its 

dimensions  are  only  K  X  K,  where  K  is  the  number 
of  regressors  in  X.  It  can  be  shown  that  this 
expression  is  equal  to 


1 

n 


v  2  i 

Z/T  x>x<  3 


and  the  above  can  be  consistently  estimated  by 


c  1  v^~2  , 

s  =  w1ex-x- 

where  e,  is  the  residual  for  the  z’th  observation.  The 
correct  variance/covariance  is  consistently  esti¬ 
mated  by 


Var(p)  -  zz(X'X)_1S(X'X)“1. 

It  is  important  to  remember  that  these  tech¬ 
niques  bring  with  them  their  own  sets  of  assump¬ 
tions,  which  may  or  may  not  be  more  plausible 
than  the  assumptions  of  OLS.  The  key  assumption 
of  the  instrumental-variables  procedure  is  that  the 
instruments  are  uncorrelated  with  the  disturbance. 
This  assumption  cannot  be  tested,  and  the  use  of 
“quasi  instruments,”  instruments  that  are  only 
approximately  uncorrelated  with  the  disturbance, 
can  produce  incorrect  inferences.  Strong  theory  is 
required  to  make  such  assumptions  plausible. 

Assumptions  and  the  Generalized 
Linear  Model 

As  noted  in  the  introduction  to  the  linear  regres¬ 
sion  model  discussion,  GLM  replaced  the  linear 


regression  model  as  the  workhorse  of  political 
science.  A  GLM  is  one  that  assumes  a  distribu¬ 
tion  included  in  the  linear  exponential  family. 
These  distributions  include,  among  others,  the 
normal,  the  Bernoulli,  the  exponential,  and  the 
Poisson.  These  models  are  estimated  by  maxi¬ 
mum  likelihood,  and  their  good  properties  only 
hold  asymptotically. 

GLMs  require  very  strong  distributional 
assumptions.  In  addition,  they  require  many  of  the 
same  assumptions  as  the  linear  regression  model. 
Some  of  these  assumptions,  however,  work  very 
differently  in  this  setting.  Consider  the  probit 
model.  The  model  is  derived  by  assuming  that  a 
latent  or  unobserved  variable  y*  is  a  function  of 
some  regressors  and  an  error  term: 

y*  =  X(3  +  8. 

All  that  is  observed  of  the  latent  variable  is 
whether  it  is  greater  than  zero: 

f  1  if  y*  >  0 
y'  =  {0  if  y*  ^  0. 


Given  the  above,  the  probability  of  observing  a 
one  is 


Pr(y,  =  1)  =  Pr(y*  >  0) 

=  Pr(x,P  +  e,  >  0) 

-  Pr(e,  >  -x,(3) 

=  Pr(e,  <  x,p) 

=  b(x,P), 

where  F()  is  the  cumulative  distribution  function 
of  8 .  The  pro  bit  model  comes  from  assuming  that 
8  is  distributed  normally. 

The  probit  model  makes  the  same  key  assump¬ 
tion  that  the  linear  regression  model  does:  8  and  the 
regressors  must  be  uncorrelated  with  one  another. 
Unfortunately,  not  all  the  lessons  learned  in  the 
linear  regression  case  are  applicable  to  probit  mod¬ 
els.  For  example,  the  linear  regression  model  does 
not  require  normality  for  consistency.  The  probit 
model,  however,  does.  The  e,  must  be  drawn  from 
independent  and  identically  distributed  normal 
distributions.  For  the  linear  regression  model,  the 
OLS  estimator  is  unbiased  and  consistent  when  a 
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relevant  variable  is  omitted  from  the  regression, 
provided  that  the  omitted  variable  is  uncorrelated 
with  the  included  variables.  This  result  does  not 
hold  for  probit  models.  When  a  relevant  variable 
is  omitted  from  a  probit  specification,  the  esti¬ 
mated  coefficients  on  the  included  variables  are 
biased  and  inconsistent  whether  or  not  the  omitted 
variable  is  correlated  with  the  included  variables. 
By  the  same  token,  robust  standard  errors  are 
often  a  useful  way  of  dealing  with  heteroskedastic- 
ity  in  the  linear  regression  model.  Robust  standard 
errors,  however,  make  little  sense  for  the  probit 
model.  If  the  model  is  correct,  so  are  the  standard 
errors.  If  the  model  is  seriously  misspecified,  use  of 
robust  standard  errors  provides  asymptotically 
correct  variance  estimates  on  wrong  coefficient 
estimates.  Robust  standard  errors,  then,  are  of  lit¬ 
tle  help  in  this  context. 

The  lessons  learned  from  the  linear  regression 
model  have,  at  best,  heuristic  value  when  it  comes 
to  more  complex  models  such  as  GLMs.  New 
assumptions  have  to  be  made,  and  old  assump¬ 
tions  often  have  to  be  revisited  and  reevaluated. 
Solutions  to  problems  that  work  for  the  linear 
regression  case  may  not  work  for  GLMs.  It  is 
important  to  remember  that  complex  models  are 
more  than  just  fancy  linear  regressions. 

Assumptions  and  Inference 

Of  the  two  goals  of  quantitative  modeling  in 
political  science  mentioned  in  the  introduction — 
description  and  inference — the  latter  is  more  diffi¬ 
cult  to  achieve.  The  reason  for  this  inequity  is  that 
inference  depends  on  assumptions  that  are  far 
stronger  than  the  assumptions  on  which  consis¬ 
tency  and  unbiasedness  rest,  and  as  argued  above, 
these  assumptions  rarely  hold  in  political  science 
data  sets.  Although  these  assumptions  can  be 
relaxed  with  the  use  of  more  complicated  statisti¬ 
cal  models,  the  fact  remains  that  the  model  has  to 
be  nearly  right  for  correct  inference,  and  that  is  a 
tall  order  in  political  science. 

Of  all  the  assumptions  needed  for  correct  infer¬ 
ences,  the  most  fundamental  is  yet  to  be  discussed. 
When  introducing  the  linear  regression  model,  a 
random  sample  of  size  n  from  a  population  of  inter¬ 
est  was  assumed.  This  assumption  is  standard  in 
statistical  analysis  and  allows  traditional  statistical 
inference  to  work.  In  very  few  situations,  however, 


do  political  science  data  sets  resemble  anything 
even  close  to  a  random  sample  drawn  from  a 
population.  The  question  then  becomes  “To  what 
are  political  scientists  making  inferences?”  Whether 
the  data  comprise  a  set  of  international  wars, 
American  states,  or  Western  European  democra¬ 
cies,  there  is  no  actual  population  to  make  infer¬ 
ences  to,  and  these  samples  are  rarely  treated  as 
populations  in  and  of  themselves.  Political  scien¬ 
tists  pay  little  attention  to  this  issue,  and  most 
articles  that  make  use  of  quantitative  modeling  in 
the  discipline  make  no  mention  of  the  population 
of  interest,  how  the  sample  was  generated,  or  to 
what  the  statistical  inferences  refer. 

The  real  question  that  needs  to  be  addressed 
concerns  the  source  of  the  randomness  in  the  data. 
In  what  is  known  as  design-based  inference,  the 
population  is  seen  as  fixed,  and  the  sample  is  the 
result  of  a  stochastic  process  such  as  simple  random 
sampling  or  stratified  random  sampling.  Inferences 
are  made  in  traditional  fashion  from  the  sample  to 
the  population  from  which  it  was  drawn.  In  what  is 
known  as  model-based  inference,  the  observed  val¬ 
ues  are  seen  as  realizations  of  random  variables  and 
therefore  constitute  realizations  of  a  random  pro¬ 
cess.  Most  political  scientists  practice  model-based 
inference  as  true  random  samples  from  known 
populations  are  relatively  rare  in  the  discipline. 

Model-based  inference  comes  in  two  versions. 
Political  scientists,  when  asked  what  population 
they  are  making  inferences  to,  often  respond  by 
talking  about  a  superpopulation.  A  superpopula¬ 
tion  is  an  imaginary  population  from  which  the 
data  could  have  been  randomly  drawn  had  the 
imaginary  population  existed.  A  researcher  first 
assumes  the  existence  of  an  imaginary  population 
(the  superpopulation)  and  then  assumes  that  the 
sample  at  hand  is  drawn  randomly  from  the  imagi¬ 
nary  population.  There  are  situations  in  the  sciences 
where  assuming  a  superpopulation  makes  sense.  To 
borrow  an  example,  it  might  be  sensible  to  consider 
the  hurricanes  generated  in  the  Atlantic  Ocean  in  a 
given  year  under  certain  meteorological  conditions 
as  a  random  draw  from  the  population  of  hurri¬ 
canes  that  could  have  been  produced  in  the  Atlantic 
Ocean  in  that  year.  There  is,  in  such  a  circumstance, 
a  real  stochastic  process  that  generates  the  observa¬ 
tions.  The  same  cannot  be  said  of,  for  instance, 
election  data.  Political  scientists  are  fond  of  saying 
that  the  world  could  have  turned  out  differently, 
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and  thus,  the  data  are  like  a  random  draw  from  the 
superpopulation  of  elections  across  different  worlds. 
Unlike  the  process  that  generates  Atlantic  hurri¬ 
canes,  however,  there  is  no  actual  stochastic  process 
that  could  generate  these  imaginary  elections. 

The  second  version  of  model-based  inference  is 
closer  to  what  political  scientists  actually  do.  In  this 
version,  there  is  no  population,  and  no  pretense  is 
made  about  making  inferences  to  a  population. 
Instead,  a  model  is  proposed  that  accounts  for  the 
way  nature  produces  the  data.  Inferences  are  then 
made  back  to  features  of  the  model.  The  sample 
design  is  irrelevant  under  model-based  inference, 
which  is  why  it  fits  well  with  political  science.  The 
downside  of  model-based  inference  is  its  depen¬ 
dence  on  a  model,  which  may  be  misspecified  in 
any  of  the  ways  discussed  earlier.  If  the  model  is 
bad,  so  are  the  inferences  based  on  it.  An  aware¬ 
ness  on  the  part  of  political  scientists  that  they  are 
engaged  in  model-based  inference  is  important 
because  it  highlights  just  how  fundamental  models 
and  their  constituent  assumptions  are  to  the  prac¬ 
tice  of  data  analysis  in  political  science. 

Kevin  A.  Clarke 
University  of  Rochester 
Rochester,  New  York,  United  States 
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Quantitative  Versus 
Qualitative  Methods 

Perhaps  no  division  in  the  social  sciences  is  so 
persistent,  nettlesome,  and  poorly  understood  as 
the  division  between  quantitative  and  qualitative 
ways  of  knowing.  The  cleavage  can  be  traced  back 
to  the  first  applications  of  statistics  within  the 
disciplines  of  economics,  political  science,  and 
sociology,  and  it  became  increasingly  acute  in  the 
late  20th  century  as  quantitative  approaches 
gained  in  stature,  grew  in  complexity,  and  pushed 
qualitative  empirical  analysis  out  of  the  limelight. 
During  this  period,  the  division  between  qualita¬ 
tive  and  quantitative  methods  became  associ¬ 
ated — perhaps  inappropriately — with  the  rival 
epistemological  positions  of  positivism  and  inter- 
pretivism.  Charles  Smith  (1989)  summarizes  the 
now  familiar  standoff: 

On  the  one  hand,  there  are  those  who  argue  that 
only  through  the  application  of  quantitative 
measurements  and  methods  can  the  social  sciences 
ever  hope  to  become  “real”  sciences;  on  the  other 
hand,  there  are  those  who  claim  that  the  subject 
matter  of  the  social  sciences  is  simply  not 
amenable  to  quantification  and  all  attempts  to 
impose  such  measures  and  methods  upon  social 
behavior  is  just  so  much  nonsense,  (p.  29) 

This  entry  looks  at  the  strengths  and  weak¬ 
nesses  of  both  positions  in  terms  of  their  contribu¬ 
tions  to  the  comparability  of  findings. 

Debate  Positions 

While  there  have  been  many  attempts  to  shed  light 
on  this  persistent  division  in  the  social  sciences, 
work  on  this  question  is  generated  primarily  by 
writers  who  occupy  one  of  the  two  camps.  These 
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writers  tend  to  be  either  strong  partisans  or  vis¬ 
ceral  opponents  of  the  quantitative  worldview.  A 
chronic  dualism  besets  many  of  these  debates,  in 
large  part  because  the  distinction  between  quanti¬ 
tative  and  qualitative  forms  of  descriptive  and 
causal  inference  has  been  folded  into  the  much 
larger  and  more  “philosophical”  debate  between 
positivism  and  interpretivism.  Positivists  (or  natu¬ 
ralists),  usually  identified  with  quantitative  meth¬ 
ods,  present  their  perspective  as  hegemonic:  There 
is,  or  ought  to  be,  only  one  logic  of  inference.  The 
conclusion  of  these  scholars  is  that  either  there  are 
no  important  distinctions  between  qualitative  and 
quantitative  work  or,  to  the  extent  that  such  dis¬ 
tinctions  exist,  they  are  to  the  detriment  of  qualita¬ 
tive  scholarship.  “When  possible,  quantify”  is  the 
motto  of  this  camp.  Where  quantification  is  not 
possible,  this  camp  encourages  qualitative  scholars 
to  follow  the  logic  of  quantitative  reasoning. 
Defenders  of  qualitative  work  typically  emphasize 
the  limits  of  quantification  and  the  insights  that 
can  be  gained  through  an  interpretive  approach  to 
social  action.  Rather  than  a  unified  logic,  interpre- 
tivists  suggest  that  there  might  be  multiple  logics  at 
work  in  the  social  sciences.  These  multiple  logics 
stem  from  epistemological  or  ontological  commit¬ 
ments,  which  may  themselves  be  culturally  pre¬ 
scribed  and  political  or  historical  in  origin. 

A  third  position  runs  orthogonal  to  this  debate. 
According  to  this  view,  the  qualitative/quantitative 
division  is  inherently  ambiguous  and,  hence,  a  dis¬ 
traction  from  the  real  work  of  social  science.  In  this 
spirit,  a  recent  lexicon  focused  on  qualitative 
inquiry  notes,  in  the  entry  for  “Qualitative,”  that 
“the  adjective  does  not  clearly  signal  a  particular 
meaning”  (Thomas  A.  Schwandt,  1997,  p.  129). 
Rather,  it  is  used  as  a  “blanket  designation  for  all 
forms  of  [hermeneutic]  inquiry  including  ethnogra¬ 
phy,  case  study  research,  naturalistic  inquiry,  eth- 
nomethodology,  life  history  methodology,  narra¬ 
tive  inquiry,  and  the  like.”  In  sum,  concludes  the 
author  of  this  dictionary  of  qualitative  methods 
(with  no  apparent  sense  of  irony),  “‘qualitative 
research’  is  simply  not  a  very  useful  term  for  denot¬ 
ing  a  specific  set  of  characteristics  of  inquiry” 
(Schwandt,  1997,  p.  130).  The  inherent  fuzziness  of 
the  concepts  “qualitative”  and  “quantitative”  thus 
prompts  some  scholars  to  argue  that  the  so-called 
debate  must  be  a  red  herring,  that  is,  a  distraction 
from  the  real  issues  of  social  science  methodology. 


Thus,  one  finds  three  well-articulated  positions: 
(1)  naturalism  (one  logic  of  inquiry,  based  implic¬ 
itly  on  the  quantitative  template),  (2)  pluralism 
(multiple  logics  of  inquiry,  with  qualitative  and 
quantitative  work  founded  on  different  epistemol¬ 
ogies),  and  (3)  agnosticism.  Many  scholars  are 
sympathetic  to  the  naturalist  goal  of  a  single,  uni¬ 
fied  logic  of  inquiry;  only  in  this  fashion  can  the 
scientific  objective  of  knowledge  cumulation  be 
achieved.  However,  others  are  dubious  whether 
the  quantitative  template  provides  an  adequate 
framework  for  work  across  the  social  sciences. 
While  the  agnostic  position  is  attractive,  it  seems 
to  be  mostly  rhetorical.  Scholars  continue  to  divide 
themselves  into  camps — for  example,  departments, 
journals,  editorial  boards — and  these  camps  are 
often  classifiable  as  predominantly  qualitative  or 
quantitative.  These  two  concepts  refuse  to  be  ban¬ 
ished,  suggesting  that  there  must  be  something 
somewhere. 

As  a  way  of  cutting  into  this  subject  in  a  new 
fashion,  this  entry  proposes  a  new  way  of  thinking 
about  these  issues:  The  principal  factor  separating 
qualitative  observations  from  quantitative  obser¬ 
vations  is  the  assumed  comparability  of  the  evi¬ 
dence  at  hand.  Quantitative  observations  presume 
a  population  of  things  that  can  be  readily  mea¬ 
sured,  counted,  and  hence  compared.  Qualitative 
observations,  in  contrast,  presume  an  empirical 
field  where  individual  pieces  of  evidence  are  not 
directly  comparable  with  one  another,  at  least  not 
to  the  extent  that  they  can  be  precisely  measured. 
In  this  sense,  quantitative  work  is  appropriately 
labeled  “nomothetic”  and  qualitative  work,  “idio- 
graphic.”  The  key  point  is  that  the  difference 
between  these  two  kinds  of  observations  rests  on 
the  presumed  comparability  of  adjacent  observa¬ 
tions,  not  (at  least  not  directly)  on  the  size  of  the 
sample,  the  style  of  presentation  (numbers  or  nar¬ 
rative),  epistemology,  ontology,  or  the  formal 
structure  of  the  method. 

The  purpose  of  this  entry,  it  should  be  stressed, 
is  not  to  reinforce  existing  cleavages  in  the  social 
sciences.  Rather,  it  intends  to  draw  attention  to  an 
important  underlying  issue  in  these  debates,  an 
issue  that  has  not  received  much  recognition  in  the 
literature.  This  is  not  the  only  issue  underlying  the 
qualitative/quantitative  divide,  but  it  is  a  funda¬ 
mental  issue.  If  analysis  is  based  on  comparison, 
the  central  methodological  question  is  what 
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researchers  can  reasonably  compare  and  how  pre¬ 
cise  those  comparisons  can  be.  In  this  case,  there 
are  legitimate  differences  of  opinion,  and  they  are 
not  the  sort  that  can  be  empirically  proven. 

Comparability 

Comparable  observations  are  understood  as  mem¬ 
bers  of  the  same  population  and  therefore  poten¬ 
tial  members  of  the  same  sample.  They  are  exam¬ 
ples  of  a  similar  phenomenon.  They  are  apples  and 
apples,  rather  than  apples  and  oranges,  to  use  the 
time-honored  metaphor.  Note  that  comparing 
apples  and  oranges  is  not  prohibited;  however,  to 
do  so  one  must  adopt  a  higher-order  concept — 
such  as  fruit — according  to  which  apples  and 
oranges  are  similar.  This  commonsense  meaning  of 
comparability  is  widely  understood  and  agreed  on. 
But  what  does  it  mean  for  items  to  be  comparable 
within  the  context  of  social  science  research? 
Surely,  it  is  more  than  shared  membership  in  an 
arbitrary  linguistic  category. 

First,  comparable  observations  must  share  a  set 
of  relevant  descriptive  attributes  (dimensions). 
This  is  what  makes  them  comparable.  The  obser¬ 
vations  need  not  demonstrate  the  same  values  for 
those  attributes.  Each  observation  in  a  sample  may 
“score”  differently  on  each  attribute  in  either 
quantitative  or  qualitative  terms — high/low,  pres¬ 
ent/absent,  and  so  on.  But  each  observation  must 
be  scaleable,  and  the  attribute  must  have  the  same 
meaning  (roughly  or  precisely)  across  the  contexts 
in  which  it  is  being  compared.  This  can  be  labeled 
descriptive  comparability ,  suggesting  that  it  is  a 
fundamental  feature  of  conceptual  validity.  The 
defining  attributes  of  a  concept  must  be  valid 
across  the  designated  observations.  Otherwise,  the 
concept  is  being  “stretched.” 

A  second  kind  of  comparability  refers  to  the 
interrelationship  of  two  factors  in  a  causal  analy¬ 
sis,  the  cause  (or  vector  of  causal  factors)  X  and 
the  outcome  Y.  The  specified  X/Y  relationship 
must  hold  across  the  chosen  observations.  This  is 
labeled  causal  comparability  (a  looser  version  of 
unit  homogeneity ). 

Thus,  there  are  two  kinds  of  comparability: 
descriptive  and  causal.  The  first  is  presumed  in  the 
second.  If  a  sample  of  observations  is  assumed  to 
be  causally  comparable,  then  it  must  also  be 
descriptively  comparable.  In  statistical  research, 


the  assumption  of  unit  homogeneity  makes  this 
explicit,  but  it  must  also  be  true  more  generally, 
for  causal  comparability  can  exist  only  in  the  pres¬ 
ence  of  descriptive  comparability. 

Precision  and  Explicitness 

The  underlying  issues  in  the  enduring  qualitative/ 
quantitative  debate  are  tethered  to  the  problem  of 
comparability,  as  defined  above.  To  “quantify”  an 
observation  is  to  formulate  it  in  terms  that  can  be 
explicitly  and  precisely  compared  across  a  large 
number  of  observations,  that  is,  where  a  concept 
can  be  expressed  on  a  numerical  scale,  a  metric,  or 
a  variable  (herein  these  three  words  are  used  inter¬ 
changeably).  Quantitative  observations  combine 
natural-language  words  (nouns,  verbs,  or  adjectives) 
with  numbers  according  to  some  preassigned  met¬ 
ric.  It  is  a  question  of  measurement,  which  can  be 
regarded  as  the  assignment  of  numbers  to  phenom¬ 
ena  in  accordance  with  some  uniformly  applied  rule. 

Let  us  begin  with  a  discussion  of  the  concept  of 
precision.  Note  that  to  simply  recode  a  dichoto¬ 
mous  natural-language  category  as  a  series  of 
binary  numbers  does  not  make  it  any  more  precise. 
Thus,  0/1  is  no  more  precise  than  pregnant/not 
pregnant.  However,  numerical  scales  offer  the  pos¬ 
sibility  of  greater  precision  when  the  number  of 
categories  surpasses  the  categories  inherent  in 
natural  language,  as  well  as  in  circumstances 
where  these  categories  can  be  understood  as  posi¬ 
tions  on  a  continuous  (interval)  scale.  To  say  that 
one  room  is  warmer  than  another  is  comparative, 
but  it  is  less  precise  than  saying  that  one  room  is 
21  degrees  Celsius  and  the  other  is  16  degrees 
Celsius.  Thus,  in  many  situations,  the  use  of  a 
quantitative  idiom  allows  for  more  precise  com¬ 
parisons  across  units.  In  all  situations,  the  use  of  a 
scale  is  at  least  as  precise  as  natural  language  (in 
the  sense  that  no  precision  is  lost  in  the  translation 
of  words  to  numbers).  Again,  to  use  a  quantitative 
idiom  does  not  entail  great  precision;  it  entails  the 
possibility  of  great  precision  (as  well  as  a  more 
explicit  set  of  comparisons). 

Precision  should  not  be  confused  with  cer¬ 
tainty.  Quantitative  or  qualitative  statements  may 
be  uttered  with  more  or  less  confidence.  For 
example,  one  might  say,  “I  would  guess  that  the 
room  is  21  degrees  Celsius”  or  “I  would  guess  that 
the  room  is  warm.  ”  With  quantitative  statements, 
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a  mathematical  indicator  of  uncertainty  may 
accompany  the  point  score.  The  statement  that 
quantitative  observations  are  more  precise  refers 
to  the  point  estimate,  not  the  degree  of  uncertainty 
(or  dispersion). 

Quantitative  statements  are  both  more  precise  (at 
least  potentially)  and  more  explicit.  This  is  because 
the  very  act  of  creating  a  numerical  scale  requires  a 
set  of  explicit  comparisons  and  an  explicit  compari¬ 
son  set — a  domain.  Scales  cannot  be  developed  in 
highly  specific  contexts.  Suppose  that  we  classify  the 
terrorist  attacks  on  September  11,  2001,  as  a  single 
event  consisting  of  four  jointly  organized  attacks.  If 
we  treat  the  attacks  of  September  11  as  a  single 
event  rather  than  four  separate  events,  the  idea  of 
developing  a  barometer  for  9/11  is  nonsensical 
because  there  is  nothing  to  which  to  relate  it — scales 
make  sense  only  relative  to  classes  of  events  (unless, 
of  course,  that  event  is  being  used  as  a  metric  for 
understanding  other  events,  in  which  case  it  becomes 
a  comparative  metric).  Weather  can  be  measured 
precisely  because,  for  one  thing,  there  is  lots  of 
weather  to  measure,  and  temperature  is  thought  to 
have  the  same  general  meaning  in  many  different 
contexts.  Granted,  all  scales  are  bounded;  there  is 
no  universal  scale  (a  scale  applying  to  everything). 
Some  things,  such  as  metaphysics,  have  no  tempera¬ 
ture;  the  concept  of  temperature  (and  whatever 
scale  might  be  used  to  measure  it)  does  not  apply  in 
this  domain.  The  point  is  that  relative  to  words, 
which  are  only  loosely  and  implicitly  comparative, 
scales  are  precisely  and  explicitly  comparative,  and 
their  range  is  usually  quite  broad  (otherwise,  why 
bother  to  develop  a  systematic  scale?). 

Now,  it  is  true  that  some  natural-language 
adjectives,  such  as  warm,  are  explicitly  compara¬ 
tive.  But  most  words  are  more  ambiguous.  This  is 
apparent  in  the  extra  locutions  that  are  necessary 
to  render  ordinary  language  comparative.  One 
must  clarify  “warmer  than,”  “more  chairlike 
than,”  and  so  on;  whereas,  to  append  such  judg¬ 
ments  to  a  numerical  scale  is  redundant.  (One  does 
not  say,  “The  temperature  is  21  degrees  Celsius, 
warmer  than  18  degrees  Celsius.”)  Numerical 
scales  are  already  comparative,  and  no  matter  what 
one  does  with  a  numerical  observation,  it  cannot 
lose  its  precise,  explicitly  comparative  quality. 

To  be  sure,  if  one  labels  an  object  with  a  noun — 
for  example,  chair — one  is  implicitly  (if  not  explic¬ 
itly)  comparing  it  with  other  objects:  nonchairs. 


Language  has  this  universal  aspect;  if  we  call  some¬ 
thing  X,  we  imply  that  other  things  are  not  X,  or 
less  X  or  more  X.  However,  the  comparisons  are 
vague.  It  is  unclear,  for  example,  where  a  chair 
leaves  off  and  a  stool  begins,  for  few  words — and 
very  few  key  words  in  social  science — have  crisp 
boundaries.  More  important,  most  words  are  mul¬ 
tivalent;  they  have  more  than  one  attribute  and, 
consequently,  can  mean  more  than  one  thing. 
Thus,  to  say  that  an  object  is  “not  A”  could  mean 
a  number  of  different  things,  depending  on  the 
attribute)  s)  that  is  intended  by  the  author  or  under¬ 
stood  by  the  reader.  Moreover,  a  word  usually 
gains  meaning  by  its  context,  and  this  context  is 
undefined  in  settings  other  than  what  the  author  is 
studying.  Additionally,  the  other  objects  that  are 
not  X  are  typically  not  defined,  in  which  case  the 
larger  population  of  cases  (the  domain  of  the  infer¬ 
ence)  remains  implicit.  Finally,  words  are  contin¬ 
gent  on  a  particular  natural  language,  and  this 
imposes  another  sort  of  contextual  boundary 
against  comparison.  (In  contrast,  the  numeral  “5” 
and  the  operator  “=”  mean  the  same  thing  every¬ 
where — since  the  adoption  of  a  uniform  language 
of  mathematics — and  they  also  mean  the  same 
thing  in  all  the  contexts  that  they  might  be 
employed. ) 

Frequently,  natural-language  comparisons  are 
without  any  obvious  comparative  reference  point. 
The  statement  “Caesar  crossed  the  Rubicon”  is 
comparative  in  so  many  possible  ways  that  it 
might  be  considered  noncomparative:  He  did  not 
cross  it,  he  did  not  cross  the  Tiber,  it  was  not 
Brutus  who  crossed  the  Rubicon,  and  so  on.  If  this 
is  comparative,  it  is  so  in  the  most  minimal  sense. 
Yet  it  is  important  for  the  purposes  here  to  recog¬ 
nize  that  comparison  is  a  matter  of  degree. 
Qualitative  observations  can  be  more  or  less  com¬ 
parative,  but  quantitative  observations  are  almost 
always  more  precisely  and  explicitly  comparative. 

One  final  clarification  is  in  order.  This  entry  has 
stated  that  all  quantitative  statements  about  the 
world  invoke  a  class  of  events;  these  form  the  basis 
for  the  metric.  However,  it  does  not  follow  that 
quantitative  statements  about  the  world  are  neces¬ 
sarily  broader  in  scope  than  qualitative  statements. 
Indeed,  the  very  fuzziness  of  natural  language  gives 
us  the  license  to  generalize — for  one  can  avoid  saying 
anything  very  specific — while  the  exactness  of  quan¬ 
tification  may  rein  in  the  temptation  to  generalize.  It 
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follows  that  qualitative  statements  can  be  either 
very  restricted  in  scope  (as  in  the  previous  example 
about  the  singular  event  of  9/11)  or  extremely 
broad.  Saying  something  in  words  does  not  affect 
the  scope  of  the  inference.  Saying  something  with 
a  metric,  however,  presupposes  a  class  of  referents, 
which  is  to  say  it  must  make  reference  to  more 
than  one  discrete  event  (and  these  reference  points 
must  be  fairly  precise  and  explicit). 

Trade-Offs:  Thick  and  Thin  Description 

In  principle,  any  qualitative  observation  can  be 
converted  to  quantitative  form,  as  attested  by  the 
plethora  of  methods  and  software  designed  to 
perform  this  function,  for  example,  NUD*IST  soft¬ 
ware,  Computer-Assisted  Qualitative  Data  Analy¬ 
sis  Software  (CAQDAS),  and  various  narrative- 
based  methods,  as  well  as  more  generic  forms  of 
content  analysis.  There  is  no  such  thing  as  a  non- 
quantifiable  observation  because  any  single  state¬ 
ment  that  can  be  made  about  one  phenomenon 
could  also  be  made  about  another  phenomenon, 
thus  providing  the  possibility  of  some  sort  of  scale. 

Yet  it  is  not  clear  that  one  would  always  want  to 
make  this  transposition  from  words  to  words-with- 
numbers  (variables).  Indeed,  there  are  usually  costs 
associated  with  this  conversion.  The  trade-off  may 
be  understood  in  terms  of  precision  and  explicitness 
on  the  one  hand  and  depth  (or  richness)  on  the 
other.  More  concisely,  the  analyst  has  the  option  of 
describing  thinly  (with  less  depth)  or  describing 
thickly  (with  more  depth).  The  fact  is  that  words  are 
usually  multivalent;  they  generally  carry  a  variety  of 
attributes,  some  of  which  may  not  even  be  logically 
consistent.  This  is  particularly  true  of  key  words  in 
social  science — for  example,  democracy,  justice, 
corporatism,  political  party.  When  one  of  these 
words  is  converted  to  a  measurable  variable,  which 
is  to  say  to  a  precise  scale,  the  researcher  is  generally 
forced  to  drop  one  or  more  of  its  attributes.  For  not 
all  these  attributes  will  be  precisely  applicable  to  the 
class  of  phenomena  that  the  concept  is  now  (quite 
explicitly)  intended  to  cover.  This  does  not  necessar¬ 
ily  refer  to  an  expansion  in  scope,  for  natural  lan¬ 
guage  can  reach  as  far  as  mathematical  variables. 
But  in  making  the  comparison  precise  and  explicit, 
it  is  usually  necessary  to  narrow  the  definition  of  the 
natural-language  concept.  It  could  be  that  the  inten¬ 
sion  of  the  natural-language  concept  is  also  quite  a 


bit  narrower  than  the  full  set  of  attributes  normally 
(in  ordinary  language)  associated  with  the  term.  An 
author  is  free  to  define  a  term  as  he  or  she  sees  fit; 
qualitative  work  is  not  wedded  to  ordinary  lan¬ 
guage.  The  point  is,  in  creating  a  variable,  one  is 
forced  to  make  explicit  choices  about  which  defini¬ 
tional  attributes  apply  properly  to  a  class  of  phe¬ 
nomena  and  which  do  not.  This  is  likely  to  prompt 
some  narrowing  of  the  semantic  options.  And  this  is 
why  the  choice  to  quantify  a  concept  is  considered 
a  move  toward  thin  description.  More  explicit  com¬ 
parisons  can  be  made,  but  they  are  narrowed  down 
to  one  or  several  dimensions  (the  chosen  attributes 
of  the  core  concept). 

Similarly,  if  one  chooses  to  particularize  rather 
than  to  generalize,  natural  language  is  the  obvious 
vehicle  of  choice.  As  previously  pointed  out,  it  is 
inappropriate  to  construct  a  scale  when  the  class  of 
instances  under  investigation  is  one  or  several.  A 
scale  presupposes  a  population.  In  contrast,  a 
word  can  be  used  in  a  highly  specific  context;  it 
does  not  presuppose  an  explicit  comparison  with 
other  instances.  This  means  that  in  describing  the 
singularity  of  an  event,  one  is  drawn  toward  the 
implements  of  natural  language.  The  lack  of  per¬ 
fect  commensurability  between  words  used  in  one 
context  and  the  same  words  used  in  another  con¬ 
text  allows  the  researcher  the  facility  to  elucidate 
what  is  different — categorically  (qualitatively),  not 
marginally  (quantitatively) — about  that  phenome¬ 
non.  A  very  high  score  on  some  scale  can  be 
(indeed,  must  be)  indicated  with  a  quantitative 
metric.  But  a  very  different  kind  of  score  requires 
a  word,  perhaps  a  series  of  words. 

In  short,  there  are  gains  and  losses  in  the  trans¬ 
position  of  words  to  numbers,  and  vice  versa. 
What  is  interesting  about  this  classic  debate  is  that 
both  may  be  described  as  “reductionist.”  Quan¬ 
titative  studies  are  often  accused  of  reducing  reality 
and,  in  the  process,  distorting  that  reality  to  fit  the 
austere  requirements  of  the  quantitative  format. 
Each  piece  of  reality  must  be  sliced  up  into  vari¬ 
ables,  and  these  must  be  comparable  across  all 
observations.  Qualitative  studies  are  accused  of  a 
different  kind  of  reduction,  in  which  a  subject  is 
shrunk  down  to  a  highly  particular  context — the 
country,  neighborhood,  or  event  of  special  interest. 

These  contrasting  notions  of  reduction  and 
expansion  are  best  understood  as  arguments  over 
comparability.  Scholars  inclined  toward  the  tools 
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provided  by  natural  language  are  often  keen  to 
explore  a  wide  variety  of  different  aspects  in  a  par¬ 
ticular  setting.  They  wish  to  explore  multiple 
dimensions  of  one  thing.  ( Dimension  is  employed 
as  a  synonym  for  variable  in  this  context.)  Qual¬ 
itative  analysis  is  thus  often  focused  inward,  like  a 
vast  funnel.  Many  comparisons  are  made,  but  they 
are  all  understood  as  features  of  the  same  general 
topic,  existing  in  one  time  and  place.  Natural  lan¬ 
guage  is  adept  for  this  purpose,  for  it  is  rich,  tex¬ 
tured,  context  specific,  and  multivalent.  It  eluci¬ 
dates  a  wealth  of  details  about  a  person,  event,  or 
situation.  This  is  why  some  scholars  find  a  natural 
affinity  between  qualitative  tools  and  ethnographic, 
historical,  and — more  broadly — interpretivist  styles 
of  research.  In  contrast,  scholars  inclined  to  a 
numerical  understanding  of  the  world  are  drawn  to 
comparisons  that  are  broad  and  thin.  They  intend 
to  explore  one  particular  dimension  of  many 
things. 

The  interesting  aspect  of  this  familiar  contrast  is 
that  both  qualitative  and  quantitative  scholars  per¬ 
ceive  their  work  as  conforming  to  the  natural  bend 
of  the  universe.  Qualitative  scholars  usually  assume 
a  case-centered  approach.  Different  aspects  of  the 
same  cases  are  grouped;  they  go  together.  Quan¬ 
titative  scholars  are  drawn  toward  a  dimensional 
approach  to  comparability.  A  single  aspect  (dimen¬ 
sion)  of  an  entity  is  assumed  to  be  comparable 
across  multiple  cases.  This  is  sometimes  referred  to 
as  variable-centered  analysis.  While  for  a  qualitative 
scholar  it  would  seem  natural  to  explore  everything 
about  A,  for  a  quantitative  scholar  it  would  seem 
more  natural  to  explore  one  thing  about  A,  B,  and 
C.  Underlying  scholars’  choices  of  method  are  cer¬ 
tain  assumptions  about  cross-case  comparability. 
The  tools  scholars  choose — words  or  numbers — 
are,  in  part,  the  expression  of  their  relative  confi¬ 
dence  in  the  ability  to  compare  across  entities  in  a 
given  research  context.  This  is  not  to  deny  that 
many  scholars  use  both  words  and  numbers.  The 
point  is  that,  within  a  given  context,  the  likelihood 
of  choosing  one  or  the  other  strategy  is  influenced 
by  assumptions  about  case  comparability. 

Construing  Meaning  Versus 
Analyzing  Behavior 

The  importance  of  comparison  is  illustrated  in  one 
of  the  most  common  defenses  of  qualitative  work 


in  the  interpretive  mode.  It  is  often  said  that 
qualitative  analysis  focuses  on  human  meanings, 
while  quantitative  analysis  focuses  on  the  behav¬ 
ioral  components  of  human  reality — that  is, 
actions,  institutions,  or  events.  From  the  interpre¬ 
tivist  perspective,  the  business  of  social  science  is 
one  of  construing  meaning,  not  analyzing  behav¬ 
ior.  Yet  one  might  reasonably  inquire,  why  not 
study  human  meanings  quantitatively — that  is, 
with  scalar  measures — in  addition  to  studying 
them  qualitatively? 

One  rationale  is  that  human  meaning  is  con¬ 
structed  through  language  (which  establishes  the 
categories  by  which  we  understand  the  world); 
therefore,  it  makes  sense  to  study  these  linguistic 
categories  through  other  linguistic  categories  rather 
than  the  (somewhat  alien)  categories  of  numbers. 
Herbert  Blumer  (1969)  concludes. 

The  crucial  limit  to  the  successful  application  of 
variable  analysis  to  human  group  life  is  set  by  the 
process  of  interpretation  or  definition  that  goes 
on  in  human  groups.  This  process,  which  I 
believe  to  be  the  core  of  human  action,  gives  a 
character  to  human  group  life  that  seems  to  be  at 
variance  with  the  logical  premises  of  variable 
analysis,  (p.  132) 

The  problem,  though,  is  not  that  the  analysis  of 
linguistic  phenomena  must  be  carried  out  with 
linguistic  tools;  after  all,  the  contemporary  disci¬ 
pline  of  linguistics  is  heavily  quantitative.  Indeed, 
quantitative  observations  are  necessarily  rooted  in 
language  because  all  scales  must  be  expressible  in 
a  linguistic  category;  every  variable  must  have  a 
name.  What  is  it,  then,  that  seems  so  problematic 
about  collecting  quantitative  observations  about 
semantic  realities?  Why  is  there  no  quantitative 
hermeneutics? 

The  problem  is  at  the  core  of  the  qualitative/ 
quantitative  dispute:  the  problem  of  representing 
human  meaning  across  diverse  contexts  in  an  ex¬ 
plicit  and  precise  fashion.  It  is  a  problem  of  compa¬ 
rability,  not  of  language  per  se,  that  is  at  issue.  And 
what  makes  it  so  problematic  is  a  basic  feature  of 
human  life.  The  ways  in  which  we  make  sense  of 
our  lived  experience  are  extraordinarily  diverse — 
through  time,  across  cultures,  and  across  individu¬ 
als.  It  is  difficult  to  reduce  this  complexity  to  stan¬ 
dard  categories,  as  quantitative  knowledge  requires. 
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Consider  the  question  of  human  happiness 
(variously  understood  as  welfare  or  quality  of  life), 
which  has  attracted  increasing  attention  on  the 
part  of  psychologists,  sociologists,  and  economists. 
The  following  scale  was  developed  by  the  U.S. 
Bureau  of  the  Census  to  measure  the  quality  of  life: 
(a)  terrible,  (b)  unhappy,  (c)  mostly  dissatisfied, 
(d)  mixed,  (e)  mostly  satisfied,  (f)  mostly  pleased, 
(g)  delighted.  The  question  we  might  ask  ourselves 
is  whether  human  happiness  is  accurately  captured 
in  these  categories.  This  is  to  say,  does  a  person 
who  answers  b  =  unhappy  have  a  lower  quality  of 
life  than  a  person  who  responds  with  d  =  mixed ? 
There  are  potential  problems  of  conceptual  validity, 
and  the  issue  is  not  simply  a  lack  of  sophistication 
on  the  part  of  researchers.  More  fundamentally, 
the  problem  is  that  human  meanings — such  as  hap¬ 
piness — are  resistant  to  uniform  comparisons 
across  diverse  subjects.  One  feels  much  more  com¬ 
fortable  with  imprecise  comparisons  expressed  in 
the  looseness,  and  contextual  specificity,  of  natural 
language.  We  might  say,  for  example,  that  “Smith 
is  happy,”  implying  that  across  some  undefined 
population,  Smith’s  level  of  happiness  is,  let  us  say, 
somewhere  above  the  mean.  However,  we  are 
probably  reluctant  to  assign  a  precise  score  to 
Smith’s  happiness,  because  such  a  score  would  pre¬ 
sume  a  precise  comparison  with  all  others  in  the 
population. 

This  is  not  meant  to  imply  that  all  human  mean¬ 
ings  are  unquantifiable.  Nor  is  it  meant  to  imply 
that  social  scientists  should  abstain  from  quantify¬ 
ing  difficult  and  ambiguous  emotions  such  as 
human  happiness.  There  may  be  much  to  learn 
from  the  quantification  of  abstract  concepts  that 
summarize  a  great  deal  of  information  about 
human  experience.  The  point  is  simply  that  the 
profitability  of  quantifying  varies  with  the  topic, 
and  the  core  issue  is  the  comparability  of  the  phe¬ 
nomena  across  the  chosen  population.  Certain 
topics  are  more  recalcitrant,  and  these  tend  to  be 
tied  up  with  the  generation  of  meaning  (values, 
ideas,  intentions)  rather  than  observable  behavior. 
It  is  noteworthy,  for  example,  that  some  human 
intentions  seem  to  be  more  quantifiable  than  oth¬ 
ers,  and  these  tend  to  be  those  where  a  high  level 
of  comparability  across  individuals  or  across  cul¬ 
tures  can  be  assumed.  Thus,  scholars  routinely 
measure  the  concept  of  “economic  voting”  because 
the  notion  “pursuing  material  interests”  seems 


comparable,  and  hence  valid,  across  individuals 
and  populations.  In  contrast,  the  measurement  of 
religious  influences  on  voting  behavior  is  much 
more  difficult  to  quantify  because  it  is  more  diffi¬ 
cult  to  compare. 

Conclusion 

Approaches  in  the  social  sciences  are  often  under¬ 
stood  as  qualitative  or  quantitative.  Scholars  differ 
in  their  opinions  about  the  utility  of  this  distinc¬ 
tion.  Some  dismiss  it  as  a  red  herring;  others  feel 
that  there  is  good  justification  for  the  division.  Still 
others  suggest  that  there  is  no  plausible  way  of 
discarding  the  distinction  but  that  it  is  greatly  mis¬ 
understood,  and  the  key  to  this  misunderstanding 
is  to  be  found  in  assumptions  about  comparability. 

There  is  indeed  a  difference  in  basic-level  assump¬ 
tions  between  statements  that  are  quantitative  (i.e., 
understood  through  a  numerical  scale,  a  metric,  or 
a  variable)  and  those  that  are  qualitative  (i.e., 
expressed  in  natural  language).  Quantitative  de¬ 
scriptive  statements  presuppose  a  class  of  cases 
that  can  be  compared  in  an  explicit  and  precise 
manner.  To  measure  phenomenon  X  is  to  impose 
a  very  specific  metric  on  it,  one  that  is  explicitly 
comparative  (since  other  phenomena  in  this  same 
category  are  assumed  to  be  scoreable).  Qualitative 
descriptive  statements  do  not  make  any  such  pre¬ 
suppositions.  There  may  or  may  not  be  an  identifi¬ 
able  class  of  comparable  cases  that  can  be  mea¬ 
sured  along  some  set  of  dimensions.  Often,  the 
assumption  is  quite  the  reverse — particularizing 
rather  than  generalizing.  Thus,  to  quantify  some¬ 
thing  is  to  compare  in  an  explicit  and  precise  man¬ 
ner.  To  qualify  is  to  leave  such  comparisons  open; 
one  may  or  may  not  engage  in  explicit  compari¬ 
sons  with  adjacent  cases,  and  any  comparisons 
made  are  unlikely  to  be  very  precise.  While  this 
might  seem  to  indicate  a  distinct  advantage  for 
quantitative  work,  this  entry  also  shows  that  there 
are  costs  to  assuming  a  quantitative  idiom.  Not 
only  must  the  cases  be  (actually)  comparable,  but 
there  is  usually  some  loss  of  information  since 
words  are  usually  multivalent  and  metrics  are  usu¬ 
ally  unidimensional  (or  at  most  combine  several 
dimensions).  It  is  not  clear  that  scholars  always 
gain  in  analytic  leverage  by  moving  from  words  to 
numbers.  They  do,  however,  make  different  sorts 
of  comparisons. 
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The  choice  between  math  and  natural  language 
as  tools  of  social  science  is,  therefore,  a  highly  con¬ 
sequential  choice.  Methodological  tools  help  us 
reconstruct  the  empirical  world;  they  are  not  the¬ 
ory  neutral.  In  this  respect,  the  division  between 
math  and  language  is  akin  to  the  influence  that  the 
early  anthropologists  and  linguists  assigned  to  lan¬ 
guage.  Different  languages  divide  the  world  into 
different  packages;  they  encourage  us  to  visualize 
things  in  different  ways.  So,  arguably,  do  the  dif¬ 
ferent  “languages”  of  mathematics  and  ordinary 
speech.  Quantitative  tools  help  us  compare  and, 
hence,  generalize;  qualitative  tools  encourage  us  to 
differentiate. 

It  is  quite  another  thing,  however,  to  disentan¬ 
gle  the  causal  priority  of  methodologies  and  ontol¬ 
ogies.  Do  cultural  anthropologists  use  qualitative 
tools  because  they  envision  a  lumpy  universe,  or 
do  they  see  a  lumpy  universe  because  they  perceive 
it  with  qualitative  tools?  About  all  one  can  say 
with  any  degree  of  confidence  is  that  there  is  a 
strong  synergy  between  methods  and  ontologies. 
One  can  presume  that  they  are,  at  the  very  least, 
strongly  reinforcing.  This  may  help  account  for  the 
virulence  and  endurance  of  this  central  cleavage  in 
the  social  sciences  today. 

It  is  not  merely  a  matter  of  numbers  versus 
words  or  a  debate  about  what  can  or  cannot  be 
quantified.  More  fundamentally,  the  venerable 
debate  represents  fundamental  disagreements  over 
how  precise,  explicit,  and  extensive  social  science 
comparisons  ought  to  be.  Those  who  resist  numer¬ 
ical  analysis  are  dubious  about  the  validity  of  com¬ 
parisons.  They  see  no  need  to  enhance  the  precision 
or  explicitness  of  comparisons  because  they  do  not 
seek  to  compare  in  the  first  place,  or  they  seek  a 
more  restricted  ambit  of  comparative  reference 


points.  Those  who  embrace  quantification  are 
more  comfortable  with  such  comparisons.  This 
debate  has  been  engaged  at  many  levels — across 
individuals,  levels  of  government,  cultures,  and 
time  periods — and  over  many  years.  It  shows  no 
sign  of  dissipating. 
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Racism 


Racism  is  a  set  of  beliefs,  practices,  and  social  struc¬ 
tures  that  treats  groups  of  human  beings  socially 
defined  by  unalterable,  often  physical,  attributes 
(races)  as  inherently  unequal.  Racism  is  a  form  of 
subordination  and  exclusion.  It  is  a  part  of  the 
power  structure  of  institutions  and  social  relations. 
Racism  is  sustained  by  coercion  and  consent,  and  it 
is  expressed  in  prejudice,  discrimination,  oppres¬ 
sion,  violence,  or,  in  some  extreme  cases,  genocide. 
This  broader  perspective  on  racism,  epitomized  in 
the  work  of  William  Edward  Burghardt  Du  Bois 
(1868-1963),  is  dominant  in  political  science,  soci¬ 
ology,  and  social  anthropology. 

In  social  and  political  psychology,  the  term  is 
used  more  narrowly  to  describe  a  set  of  beliefs  or 
attitudes.  William  Graham  Sumner  (1840-1910) 
took  such  an  approach,  identifying  racism  as  a 
specific  form  of  ethnocentrism  related  to  the  per¬ 
ception  of  ingroup/outgroup  relations.  Racism  is  a 
set  of  prejudices  that  enables  and  legitimates 
exploitation  and  scapegoating  based  on  the  “over¬ 
generalization”  that  the  different  capacities  and 
characteristics  of  a  human  being  are  determined  by 
his  or  her  belonging  to  a  group.  This  standpoint  is 
often  associated  with  the  work  of  Gordon  Allport 
(1897-1967). 

In  everyday  life,  the  term  racism  is  often  used 
loosely  with  respect  to  subordination  or  hostility 
toward  a  group. 

From  a  historical  perspective,  racism  is  con¬ 
sidered  as  a  modern  phenomenon.  Besides  the 


discussion  on  the  existence  of  “proto-racism”  in 
ancient  Greek  or  Rome,  the  emergence  of  modern 
racism  is  connected  to  the  Enlightenment  and  the 
appearance  of  scientific  theories  of  the  evolution  of 
humankind.  The  search  for  the  biological  founda¬ 
tion  of  human  behavior  led  to  essentialist  interpre¬ 
tations  of  the  differences  between  groups  of  human 
beings,  although  the  analytical  distinction  between 
biology  and  culture  was  increasingly  used:  One  of 
the  characteristics  of  racism  is  that  the  cultural 
characteristics  and  potentials  of  groups  are  seen  as 
basically  determined  by  biological  differences. 

Politically,  racism  is  associated  with  conquest, 
colonialism,  enslavement,  and  genocide.  With  the 
emergence  of  nationalism,  racism  became  part  of 
the  social  construction  of  national  homogeneity. 
The  emphasis  on  the  cultural  uniqueness  of  a 
nation-state  led  to  certain  types  of  culturally  based 
exclusionary  ideologies — for  example,  in  romantic 
nationalism,  often  associated  with  the  ideas  of 
Johann  Gottfried  Herder  (1744-1803).  The  demo¬ 
cratic  revolutions  on  both  sides  of  the  Atlantic  led 
to  the  first  attempts  at  emancipation  of  Blacks 
from  slavery  and  Jews  from  the  ghettos.  Contrary 
to  those  efforts  at  emancipation,  racism  as  a 
system  of  beliefs  became  further  elaborated,  as 
exemplified  in  the  work  of  Arthur  de  Gobineau 
(1816-1882).  He  combines  the  idea  of  there  being 
essential  differences  between  human  “races”  with 
the  legitimization  of  the  political  subordination  of 
one  group  to  another  and  the  idea  that  “race  mix¬ 
ing”  contributes  to  the  decline  of  humankind. 

At  the  turn  of  the  20th  century,  scientific  racism 
became  an  accepted  part  of  academia,  associated 


2197 


2198  Racism 


with  new  sciences  such  as  eugenics  (Francis  Galton, 
1822-1911)  and  the  development  of  intelligence 
tests.  Especially,  newly  developed  statistical  meth¬ 
ods  were  used  to  “prove  scientifically”  the  worth 
of  different  races.  At  the  same  time,  racist  political 
regimes  stabilized,  such  as  the  institutionalized 
system  of  racism  embodied  in  the  “Jim  Crow” 
laws  of  the  U.S.  South.  The  cruel  climax  of  such 
racist  political  regimes  was  Nazi  Germany,  with  its 
legally  implemented  idea  of  “race  purity”  (Nurem¬ 
berg  Laws  of  1935;  German:  Niirnberger  Gesetze) 
and  the  attempt  to  exterminate  an  entire  part  of 
the  German  population. 

The  word  racism  came  into  use  in  English  in  the 
1930s;  at  first,  it  was  used  mainly  to  describe  the 
Nazi  regime.  After  World  War  II,  it  was  increas¬ 
ingly  used  to  describe  colonial  relations,  especially 
between  Africans  and  Europeans,  and  Black- 
White  relations  in  the  United  States  and  South 
Africa.  During  the  period  of  moral  revulsion 
against  racism  after  World  War  II,  scientific  racism 
was  finally  largely  disavowed.  International  orga¬ 
nizations  such  as  the  United  Nations  Educational, 
Scientific  and  Cultural  Organization  (UNESCO) 
started  to  mobilize  against  racism.  In  1950, 
UNESCO  declared  in  the  statement  “The  Race 
Question”  that  it  is  one  of  the  central  goals  of  the 
organization  to  fight  racism  and  suggested  abolish¬ 
ing  the  term  race  altogether: 

National,  religious,  geographic,  linguistic  and 
cultural  groups  do  not  necessarily  coincide  with 
racial  groups;  and  the  cultural  traits  of  such 
groups  have  no  demonstrated  genetic  connexion 
with  racial  traits.  Because  serious  errors  of  this 
kind  are  habitually  committed  when  the  term 
“race”  is  used  in  popular  parlance,  it  would  be 
better  when  speaking  of  human  races  to  drop  the 
term  “race”  altogether  and  speak  of  ethnic 
groups.  (UNESCO,  1952,  p.  99) 

As  noted  by  Ashley  Montagu  (1998),  this 
report  was  influenced  by  the  work  of  the  anthro¬ 
pologist  Franz  Boas  (1858-1942),  who  defined 
race  as  a  purely  social  construct.  Antiracism 
became  an  integral  part  of  social  movements  such 
as  the  anticolonial  Black  liberation  movement 
and  the  anti-apartheid  movement.  The  disaggre¬ 
gation  of  the  American  South  and  the  abolition 
of  apartheid  were  important  victories  in  the  anti¬ 
racist  struggle. 


Contemporary  Approaches  to  Racism 

Today,  the  following  four  different  foci  in  political 
research  on  racism  can  be  seen: 

1.  The  changing  faces  of  racism  entrenched  into 
the  global  order  constitute  an  important  topos. 
Racism  as  a  transnational  process  has  been  a  com¬ 
mon  theme  in  political  analysis  since  the  19th  cen¬ 
tury.  Today,  global  communication  and  economic 
networks,  together  with  the  heritage  of  colonialism, 
contribute  to  the  diffusion  of  what  is  sometimes 
called  the  “American  model  of  race  relations.”  In 
this  model,  races  are  seen  as  stable  internal  differen¬ 
tiations  of  the  nation-state,  a  structure  in  the  United 
States  famously  called  by  David  Hollinger  (1995) 
the  “ethno-racial  pentagon”  (Blacks,  Whites, 
Asians,  Hispanics,  and  Native  Americans).  This 
model  is  often  associated  with  neoliberal  ideals.  Its 
emphasis  on  individual  responsibility  contributes  to 
phenomena  such  as  “color-blind  racism.”  Color¬ 
blind  racism  sustains  racist  structures  through  the 
denial  of  the  importance  of  racism  for  social 
inequality.  The  concept  of  symbolic  racism  is  used 
in  political  psychology  to  denote  the  opposition  to 
policies  designed  to  support  specific  racial  groups. 

2.  Although  scientific  projects  such  as  the 
human  genome  project  emphasize  the  genetic  unity 
of  humankind,  new  types  of  biologically  based 
benevolent  racism  have  emerged — for  example,  in 
the  advertising  on  drugs  for  assumed  racial  groups 
and  the  interrogation  of  these  groups  for  the  bet¬ 
terment  of  medical  treatments.  Connected  with  the 
rapid  advances  in  genetics,  we  have  reached  a  new 
phase  of  research  on  the  biological  bases  of  human 
behavior,  strands  of  research  that  can  be  easily 
incorporated  into  racist  ideologies.  Further,  as 
pointed  out  by  Nadia  Abu  El-Haj  (2007),  new 
questions  are  raised  on  the  very  notion  of  “nature” 
in  its  relation  to  culture. 

3.  A  disturbing  facet  of  racism  today  is  that  it 
works  as  an  ideology  and  practice  often  without 
legal  or  institutional  support,  expressed,  for  exam¬ 
ple,  in  ongoing  segregation  or  discriminatory 
employer  practices.  These  topics,  together  with 
investigations  into  the  political  successes  of  right- 
wing  parties  and  right-wing  movements,  are  an 
ongoing  important  line  of  discussion.  Research 
here  is  based  on  a  long  tradition  of  attempts  to 
measure  racial  prejudice  and  right-wing  political 
attitudes,  starting  with  the  famous  work  of 
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Theodor  W.  Adorno  (1903-1969)  and  a  group  of 
social  psychologists  on  the  “authoritarian  person¬ 
ality”  and  continued  in  works  such  as  Bob 
Altemeyer’s  study  of  right-wing  authoritarianism 
in  the  1980s.  Today,  as  noted  by  Leonie  Huddy 
and  Stanley  Feldman  (2009),  the  concept  of  racial 
stereotypes  is  one  way  to  conceptualize  more 
clearly  the  relation  between  racism  and  politics. 

4.  The  analysis  of  racist  beliefs,  practices,  and 
social  structures  has  led  to  studies  of  whiteness  and 
the  different  dimensions  of  identity  politics.  In  this 
context,  racial  prejudices  are  also  taken  into  account 
but  more  as  hidden  or  subtle  forms  of  racism.  The 
concept  of  “unconscious  racism,”  advanced  by 
Hart  Blanton  and  James  Jaccard  in  2008,  is  one 
promising  attempt  to  measure  these  types  of  beliefs. 
The  mechanisms  of  social  and  political  exclusion 
are  interrogated  in  the  intersection  of  different 
dimensions  of  inequality,  such  as  class,  gender,  and 
nationality.  The  complexities  of  racism  ingrained  in 
modern  institutions,  giving  unequal  access  to  goods, 
services,  and  opportunities,  are  thematized  under 
the  rubric  of  “institutional  racism.”  So  one  of  the 
central  questions  in  current  political  research  is 
how  racism  and  racial  discrimination  are  repro¬ 
duced  on  the  national  and  the  world  level  irrespec¬ 
tive  of  the  intentions  of  the  actor. 

Mathias  Bos 
Pbilipps-Universitat  Marburg 
Marburg,  Germany 

See  also  Colonialism;  Culturalism;  Equality;  Ethnicity; 
Justice 
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Radical  Attitudes 


Like  many  other  concepts  in  political  science,  the 
notion  of  radicalism  dates  back  to  the  political 
conflicts  of  the  late  18th  and  early  19th  century. 
Even  then,  its  content  depended  on  the  political 
context  and  was  far  from  well  defined. 
Consequently,  being  “radical”  has  meant  different 
things  to  different  people  at  different  times  in  dif¬ 
ferent  countries.  Moreover,  radicalism  is  closely 
related,  if  not  identical  to,  a  number  of  (equally 
vague)  concepts,  such  as  extremism,  fundamental¬ 
ism,  and  populism.  As  of  today,  there  is  no  univer¬ 
sally  accepted  definition  of  radicalism  and,  by 
implication,  radical  attitudes. 

There  is,  however,  a  core  meaning  of  radicalism: 
Radicals  are  willing  to  challenge  the  ground  rules 
of  politics  to  get  to  the  root  (Latin:  radix)  of  what 
they  perceive  to  be  the  most  pressing  political 
problems.  In  any  given  context,  radicals  will  con¬ 
front  the  political  establishment  and  will  support 
policies  whose  implementation  would  trigger  sys¬ 
temic  change.  In  this  entry,  the  various  historical 
contexts  and  attempts  at  measurement  of  this  con¬ 
cept  are  discussed. 

Radicalism  in  the  18th  and  19th  Centuries 

In  the  last  third  of  the  18th  century,  a  heteroge¬ 
neous  group  of  philosophers,  writers,  and  politi¬ 
cians  began  to  campaign  for  a  thorough  reform  of 
Britain’s  political  system.  Among  the  goals  of  this 
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movement  were  abolition  of  slave  trade,  reform  of 
the  electoral  laws,  and  better  protection  of  citizens’ 
rights.  They  soon  gained  support  from  the  emerging 
middle  and  working  classes.  The  parliamentarian 
Charles  James  Fox  is  often  credited  with  coining  the 
name  for  this  new  movement  when  he  demanded  a 
“radical  reform”  of  the  electoral  system  in  1797, 
and  by  1819,  the  “radicals”  had  established  them¬ 
selves  as  a  separate  political  force  that  inspired  the 
Chartist  movement  and  played  an  important  role 
in  the  creation  of  both  the  Liberal  and  the  Labour 
parties. 

Similarly,  after  the  restoration  of  the  monarchy 
in  19th-century  France,  supporters  of  republican 
principles  called  themselves  “radicals.”  Over  the 
last  third  of  the  century,  they  drifted  to  the  left  and 
were  instrumental  in  the  foundation  of  the  coun¬ 
try’s  first  modern  left-wing  party,  the  Republican, 
Radical  and  Radical-Socialist  Party,  in  1901. 

In  Germany,  radical  was  initially  a  political 
label  chosen  by  those  liberals  who,  in  the  spirit  of 
the  French  Revolution,  demanded  civil  liberties, 
universal  male  suffrage,  and  parliamentary  repre¬ 
sentation.  In  the  second  half  of  the  19th  century, 
this  label  was  applied  to  those  members  of  the 
workers’  movement  who  favored  a  revolutionary 
change  of  government  (i.e.,  the  end  of  the  authori¬ 
tarian  monarchist  regime).  In  a  similar  fashion,  in 
many  other  European  and  Latin  American  coun¬ 
tries,  “radicalism”  became  shorthand  for  a  sub- 
type  of  liberalism  that  could  be  located  either  to 
the  left  or  to  the  right  of  the  political  center.  To  the 
present  day,  “radical”  parties  exist  in  many  coun¬ 
tries,  including  Argentina,  Chile,  Denmark, 
Ecuador,  France,  Italy,  Paraguay,  and  Switzerland. 
Most  of  them  are  today  classified  as  either  liberal 
or  socialist/social-democratic. 

Radicalism  in  the  20th  Century 

The  gradual  spread  of  liberal  democracy  and  its 
crisis  during  the  interwar  period  changed  the  mean¬ 
ing  of  the  concept  of  radicalism.  In  the  wake  of  the 
events  in  Germany,  Italy,  Russia,  and  many  other 
European  countries,  radicalism  became  a  collective 
term  for  the  forces  at  the  poles  of  the  political  spec¬ 
trum,  which  had  formerly  been  known  chiefly  as 
“ultras”  and  had  threatened  to  overthrow  liberal 
democracy:  Communists  on  the  one  side,  Fascists 
and  National  Socialists  on  the  other.  Consequentially, 


radicalism  was  transformed  into  a  primarily  spatial 
term  (location  on  the  left-right  axis)  with  a  conno¬ 
tation  that  was  directly  opposed  to  its  original 
meaning.  While  the  original  radicals  had  been 
champions  of  freedom  and  democracy,  the  radicals 
of  the  20th  century  were,  by  virtue  of  their  ideo¬ 
logical  preferences,  opposed  to  these  values.  Under 
the  postwar  consensus  of  the  1950s,  this  perspective 
on  radicalism  became  dominant. 

However,  less  than  2  decades  after  the  end  of 
World  War  II,  Seymour  Martin  Lipset  challenged 
the  prevailing  view  of  the  connection  between  cen¬ 
trism  and  support  for  democracy.  In  his  seminal 
study  Political  Man,  Lipset  (1960)  claimed  that 
fascism  and  national  socialism  were  neither  left- 
wing  nor  right-wing  ideologies.  Rather,  they  con¬ 
stituted  an  “extremism  of  the  centre.”  While  this 
statement  is  problematic  if  interpreted  in  purely 
sociological  terms — fascism  and  national  socialism 
appealed  both  to  the  middle  and  to  the  working 
classes — it  reflects  the  ambiguous  location  of  these 
regimes  on  the  traditional  left-right  spectrum.  On 
the  one  hand,  they  violently  suppressed  the  left- 
wing  unions  and  parties.  On  the  other  hand,  they 
were  hardly  champions  of  a  free  market  economy: 
Fascism  and  national  socialism  insulated  farmers 
and  small  businesses  from  competition,  engaged  in 
large-scale  economic  planning,  and  raised  govern¬ 
ment  spending  on  welfare  to  unprecedented  levels. 

More  generally,  Lipset  argued  that  attitudes 
toward  the  economy  and  attitudes  toward  democ¬ 
racy  could  vary  independently.  In  his  view,  any 
position  on  the  left-right  spectrum — radical  or 
centrist — can  be  combined  with  “the  repression  of 
difference  and  dissent,  the  closing  down  of  the  mar¬ 
ket  place  of  ideas”  (Lipset  &  Raab,  1970,  p.  6).  This 
“tendency  to  treat  cleavage  and  ambivalence  as 
illegitimate”  (p.  6)  is  what  Lipset  called  extremism. 

Lipset  fruitfully  applied  this  concept  to  right- 
wing  extremism  in  the  United  States.  In  his  view, 
the  insistence  on  free  market  principles  makes 
this  particular  breed  of  extremism  “right-wing,” 
whereas  anti-Semitism,  homophobia,  racism,  reli¬ 
gious  intolerance,  and  xenophobia  are  simply 
manifestations  of  the  same  underlying  generic 
phenomenon.  Indeed,  in  a  separate  work,  he  con¬ 
vincingly  demonstrated  that  these  traits  are  also 
prevalent  among  members  of  the  working  class, 
whose  criticism  of  free  market  principles  marks 
them  as  left-wingers. 
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Lipset’s  notion  of  extremism  is  so  broad  that  it 
resonates  with  the  even  more  general  concepts 
that  were  developed  around  the  same  time  by 
psychologists  such  as  Hans  Jurgen  Eysenck  (“tough¬ 
mindedness”)  and  Milton  Rokeach  (“closed¬ 
mindedness,”  “dogmatism”)  and  refer  to  a  tendency 
to  unconditionally  accept  norms,  prejudices,  and 
authorities.  Like  Lipset,  Eysenck,  and  Rokeach, 
many  other  scholars  treat  political  preferences  in 
general  and  political  radicalism  in  particular  as  an 
essentially  two-dimensional  phenomenon.  However, 
while  Lipset  argued  that  left-right  ideology  and  sup¬ 
port  for  democratic  values  and  institutions  can  vary 
independently,  other  authors  disagree. 

In  work  that  was  partly  inspired  by  Lipset,  Uwe 
Backes  and  Eckhard  Jesse  claim  that  there  is  a 
U-shaped  link  between  ideological  radicalism  and 
antidemocratic  extremism.  Although  they  acknowl¬ 
edge  that  radicalism  and  extremism  are  conceptu¬ 
ally  different,  they  argue  that  radical  ideological 
positions  have  implications  that  render  them 
incompatible  with  liberal  democracy  as  defined  by 
the  core  values  of  the  Lrench  Revolution:  liberty, 
equality,  and  fraternity.  According  to  Backes  and 
Jesse,  left-wing  radicalism  (communism)  overem¬ 
phasizes  equality  to  the  detriment  of  freedom, 
whereas  traditional  European  right-wing  radical¬ 
ism  (fascism)  as  well  as  American  right-wing 
radicalism  disregards  equality  in  favor  of  either 
fraternity  or  liberty.  In  Backes’s  and  Jesse’s  view, 
centrism  is  conducive  to  liberal  democracy,  while 
radicalism  is  a  necessary  and  sufficient  condition 
for  extremism.  In  a  sense,  the  20th-century  view  of 
radicalism  has  come  full  circle  in  their  work,  which 
has  influenced  many  European  scholars  directly  or 
indirectly.  However,  empirical  evidence  for  the 
U-shaped  link  between  radical  ideological  posi¬ 
tions  and  opposition  to  liberal  democracy  is 
sparse. 

Measurement  Issues 

If  radicalism  is  interpreted  in  a  purely  spatial  sense, 
it  simply  refers  to  the  end  points  of  the  ideological 
spectrum.  The  most  common  instrument  in  this 
context  is  the  general  left-right  scale  employed  in 
countless  comparative  and  single-country  studies. 
Since  the  left-right  scale  is  still  interpreted  chiefly 
in  economic  terms,  other,  more  specific  scales  that 
refer  to  the  appropriate  degree  of  government 


intervention  in  the  economy,  state  control  of  prices 
and  wages,  or  the  importance  of  trade  unions  have 
also  been  used.  On  the  other  hand,  more  inclusive 
attempts  at  measuring  radicalism  include  prefer¬ 
ences  on  “postmaterialist”  issues  such  as  the  envi¬ 
ronment,  minority  rights,  and  direct  democracy. 

The  logical  implications  of  extreme  positions 
notwithstanding,  most  researchers  would  agree, 
however,  that  a  position  at  the  end  points  of  any 
policy  scale  is  in  itself  of  little  importance  because 
people  frequently  hold  inconsistent  and  contradic¬ 
tory  attitudes.  Therefore,  a  number  of  items  and 
scales  have  been  proposed  to  directly  capture  sup¬ 
port  for  liberal  democracy. 

Arguably,  the  most  influential  among  these 
were  developed  by  Herbert  McClosky  in  his  work 
on  democratic  values.  In  his  1964  article,  McClosky 
distinguishes  between  three  subdimensions  of 
democratic  values:  respect  for  the  “rules  of  the 
game”  and  support  for  freedom  of  expression  as 
well  as  support  for  political,  economic,  social,  and 
ethnic  equality.  McClosky’s  first  dimension  pri¬ 
marily  refers  to  formal  compliance.  As  long  as  a 
majority  of  citizens  have  internalized  these  rules, 
they  will  support  democratic  institutions  even  if 
their  grasp  of  the  underlying  principles  is  patchy. 
His  second  and  third  dimensions,  however,  refer 
precisely  to  these  principles. 

A  model  (liberal)  democrat  would  subscribe  to 
both  the  principles  and  the  rules,  whereas  an  anti¬ 
democrat  would  despise  both.  Real-world  citizens 
usually  find  themselves  somewhere  in  between 
these  two  poles:  They  agree  with  the  rules  and 
abstract  principles  but  sometimes  struggle  with 
their  application.  Some  items  on  McClosky’s  scale 
were  specifically  designed  to  capture  these  con¬ 
flicts.  For  instance,  90%  of  his  respondents  believed 
in  “free  speech  for  all  no  matter  what  their  views 
might  be,”  yet  50%  agreed  that  books  containing 
“wrong  political  views”  did  not  deserve  to  be  pub¬ 
lished  and  25%  were  ready  to  suspend  due  process 
for  “dangerous  enemies  like  the  Communists.” 

To  the  present  day,  McClosky’s  work  has  had  a 
tremendous  impact  on  the  field,  but  there  are  some 
basic  problems  with  his  and  all  subsequent  attempts 
to  measure  support  for  democratic  values.  First, 
the  items  inevitably  reflect  the  political  and  his¬ 
torical  context  within  which  they  were  devised. 
For  McClosky  and  many  of  his  successors,  com¬ 
munism  was  the  main  threat  to  liberal  democracy. 
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With  the  advent  of  new  ideological  challenges  such 
as  Islamism  and  right-wing  populism,  this  is  obvi¬ 
ously  no  longer  true.  Second,  the  rules  and  some¬ 
times  even  the  principles  that  constitute  liberal 
democracy  are  bound  to  change  gradually  over 
time.  Political  behavior  and  issues  from  the  New 
Politics  agenda  that  were  considered  “radical”  in 
the  1960s — minority  rights,  the  environment,  sit- 
ins  and  human  chains — are  now  well  within  the 
political  mainstream.  Therefore,  finding  items  that 
work  well  in  all  countries  at  all  times  is  conceptu¬ 
ally  and  empirically  next  to  impossible.  Third, 
even  if  these  attitudinal  scales  generate  measure¬ 
ments  that  are  valid  across  time  and  space,  they 
lack  a  natural  cutoff  point.  At  best,  they  are  able 
to  identify  the  most  radical  persons  in  society. 
Where  the  boundary  lies  between  democrats  and 
radicals,  however,  is  an  entirely  different  question. 

Kai  Arzbeimer 
University  of  Essex 
Colchester,  United  Kingdom 
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Rational  Choice 


Rational  choice,  or  rationality,  has  two  meanings. 
First,  in  a  technical  sense,  rationality  implies  that 
an  individual’s  preferences  over  choices  possess 
two  properties:  completeness  and  transitivity.  The 
first  property  holds  that  the  individual’s  prefer¬ 
ences  are  well  defined  for  any  two  possible  alter¬ 
natives  in  the  set  of  available  choices:  Given  any 
pair  of  alternatives  x  and  y,  the  individual  prefers 
choice  x  over  y,  prefers  choice  y  over  x,  or  is  indif¬ 
ferent  between  them.  The  second  property  main¬ 
tains  that  if  the  individual  prefers  alternative  x  to 
y  and  y  to  z,  then  the  individual  prefers  x  to  z. 

Most  commonly,  when  scholars  mention  ratio¬ 
nal  choice,  they  refer  to  the  broad  set  of  work — the 
rational  choice  school — that  builds  on  this  techni¬ 
cal  definition  of  rationality.  Although  derived,  at 
the  core,  from  this  single  technical  definition  of 
rationality,  the  rational  choice  school  is  commonly 
associated  with  a  number  of  additional  views 
about  human  behavior.  Notably,  rational  choice  is 
commonly  associated  with  self-interested  behavior. 
This  association  reflects  a  common  set  of  assump¬ 
tions  about  the  structure  of  individuals’  prefer¬ 
ences,  such  as  that  people  care  only  about  their 
own  personal  welfare,  not  the  welfare  of  others. 
This  orientation  leads  many  critics  to  juxtapose 
rational  choice  with  theories  in  which  individuals 
behave  in  other-regarding  ways — for  example,  a 
Rousseauean  citizen  who  strives  for  the  common 
good.  This,  of  course,  puts  aside  the  thorny  issue  of 
how  to  define  or  determine  the  “common  good.” 

Second,  rational  choice  is  closely  associated  with 
the  idea  of  human  interdependence  and  with  a  sub¬ 
field  of  economics  known  as  game  theory.  Game 
theory  provides  a  set  of  tools  for  studying  the  inter¬ 
actions  among  people.  Crucially,  game  theory  rec¬ 
ognizes  that  one  individual’s  choices  often  depend 
on  the  behavior  of  other  individuals.  For  example, 
if  an  individual  is  stopped  at  a  four-way  stop  sign, 
his  or  her  decision  of  whether  to  wait  or  to  proceed 
depends  on  whether  he  or  she  expects  the  other  cars 
stopped  at  the  intersection  to  wait  or  to  proceed. 

Rational  Choice:  A  Social 
Theory  of  Human  Behavior 

Although  rational  choice  theory  builds  on  indi¬ 
vidual  decisions,  it  is  fundamentally  a  social  theory 
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of  human  behavior.  Indeed,  part  of  the  reason  for 
simple  assumptions  about  individuals  is  to  build 
tractable  models  of  social  interaction.  Rational 
choice  theory  is  social  in  its  approach  to  under¬ 
standing  individual  decisions,  contending  that  we 
condition  our  behavior  on  the  anticipated  behav¬ 
ior  of  other  people.  In  addition,  it  is  social  in  that 
it  allows  us  to  understand  how  groups  of  individu¬ 
als  reach  collective  decisions  and  produce  social 
outcomes. 

Human  Behavior  Is  Interdependent 

Most  rational  choice  theories  emphasize  the 
interdependence  of  individuals:  The  behavior  of 
one  individual  depends  on  how  this  person  expects 
other  people  to  behave.  This  fundamental  insight 
has  been  applied  by  scholars  to  a  wide  range  of 
questions  in  economics  and  political  science,  from 
the  entry  of  firms  into  new  markets  to  agenda  set¬ 
ting  and  voting  in  Congress. 

Charles  Cameron,  for  example,  studied  the 
presidential  veto  in  a  rational  choice  framework. 
As  an  empirical  fact,  American  presidents  rarely 
veto  legislation  from  Congress.  This  observation 
led  many  scholars  to  view  the  veto  as  unimportant 
to  the  legislative  process.  Congress,  they  argued, 
dominates  the  legislative  process,  and  the  president 
has  little  say  over  the  content  of  any  legislation. 
However,  Cameron  saw  that  the  veto  often  has  a 
profound  influence  on  legislation.  Members  of 
Congress,  he  observed,  anticipate  that  the  president 
will  veto  a  bill  if  it  is  too  far  from  his  preferred 
policy.  Members  of  Congress  design  legislation  so 
that  it  is  sufficiently  close  to  the  president’s  pre¬ 
ferred  policy  to  avoid  a  veto.  Using  rational  choice 
theory,  Cameron  demonstrates  how  the  veto  influ¬ 
ences  public  policy  outcomes,  even  if  the  president 
rarely  exercises  the  veto. 

In  another  application,  scholars  study  the  policy¬ 
making  behavior  of  executive  agencies  in  the 
American  separation-of-power  system.  The  statutes 
passed  by  Congress  often  delegate  to  agencies — 
such  as  the  Food  and  Drug  Administration  or  the 
Environmental  Protection  Agency — considerable 
discretion  in  implementing  the  law.  Some  observers 
have  called  such  delegation  of  power  to  agencies  an 
abdication  of  power  by  Congress,  leading  them  to 
question  policy  making  by  unelected  bureaucrats. 
In  contrast  to  this  conventional  perspective  on 


agencies,  rational  choice  scholars  such  as  John 
Ferejohn,  Charles  Shipan,  and  Mathew  McCubbins 
observe  that  agencies  must  anticipate  the  behavior 
of  Congress.  Just  as  Congress  delegates  authority  to 
agencies,  Congress  can  trim  this  authority,  transfer 
it  to  another  agency,  withhold  budgets,  or  invali¬ 
date  the  statute  wholesale.  If  the  agency  implemen¬ 
tation  of  a  statute  is  sufficiently  unsatisfactory  to 
Congress,  it  is  likely  to  override  the  agency.  To 
avoid  costly  interaction  with  Congress,  agencies 
must  anticipate  the  possible  reactions  by  Congress, 
adjusting  policy  to  avoid  negative  treatment  by  the 
parent  institution. 

This  example  of  congressional  delegation  is  an 
instance  in  a  broader  class  of  interactions  studied 
by  rational  choice  scholars,  known  as  principal- 
agent  problems.  This  type  of  interaction  is  charac¬ 
terized  by  a  principal  (e.g.,  Congress)  attempting 
to  induce  an  agent  (e.g.,  the  executive  agency), 
which  often  possesses  divergent  preferences  from 
the  principal,  to  perform  some  task  (e.g.,  imple¬ 
ment  a  statute).  Principal-agent  problems  have 
wide  applications  in  political  science,  including 
delegation  of  authority  by  citizens  to  elected  offi¬ 
cials,  by  officials  to  bureaucrats,  and  by  the  mili¬ 
tary  to  subordinates. 

Social  Outcomes  as  Products 
of  Individual  Choices 

One  of  rational  choice’s  most  important  contri¬ 
butions  is  the  idea  of  a  social  dilemma.  Individually 
rational  decisions  often  have  unintended  social 
outcomes;  paradoxically,  individual  maximization 
often  leads  to  socially  suboptimal  outcomes. 
Rational  choice  has  been  instrumental  in  identify¬ 
ing  and  clarifying  these  social  dilemmas  as  well  as 
in  proposing  solutions  to  them. 

Scholars  have  long  recognized  the  problems 
inherent  in  aggregating  individual  preferences  into 
social  outcomes.  In  the  18th  century,  the  Marquis 
de  Condorcet  noted  that  decisions  made  by  major¬ 
ity  vote  “cycle”;  that  is,  they  display  intransitivity. 
Consider  a  simple  version  of  this  cycling.  Suppose 
a  polity  exists  with  three  voters,  1,  2,  and  3,  and 
three  policy  alternatives,  A,  B,  and  C.  The  voters 
have  the  following  preferences:  Voter  1  prefers  A 
to  B  to  C,  Voter  2  prefers  C  to  A  to  B,  and  Voter  3 
prefers  B  to  C  to  A.  As  an  initial  social  choice, 
consider  a  vote  between  Alternatives  A  and  C.  In 
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this  case,  Voter  1  votes  for  A,  and  Voters  2  and  3 
vote  for  C,  giving  Alternative  C  a  majority  vote. 
Thus,  on  the  basis  of  majority  vote,  the  polity 
appears  to  favor  C  to  A.  Now  consider  a  vote 
between  B  and  C.  Given  these  alternatives,  Voters 
1  and  3  prefer  B,  and  Voter  2  prefers  C.  A  major¬ 
ity,  therefore,  appears  to  favor  B  to  C.  Finally, 
consider  a  vote  between  A  and  B.  With  this  choice 
set,  Voters  1  and  2  prefer  A,  and  Voter  3  prefers  B; 
the  polity  prefers  A  to  B. 

These  votes  result  in  an  intransitive  social  pref¬ 
erence  ordering.  A  majority  prefers  C  to  A,  B  to  C, 
and  A  to  B.  This  roughly  equates  to  saying  that  C 
is  greater  than  A,  B  is  greater  than  C,  and  A  is 
greater  than  B.  This  social  intransitivity  occurs 
despite  the  fact  that  the  individuals  in  the  polity 
have  well-defined  preferences  over  the  alternatives. 
Technical  studies  of  voting  show  that  this  example 
is  not  an  aberration  but  typical.  As  William  H. 
Riker  explains,  it  is  difficult  to  discuss  the  “demo¬ 
cratic  will,”  for  example,  if  the  social  preference 
ordering  cycles  through  alternatives. 

As  troubling  as  it  is,  Condorcet’s  cycling  result 
is  derived  purely  from  the  structure  of  individu¬ 
als’  preferences  and  a  simple  voting  rule.  His 
result  does  not  depend  on  individuals’  strategic 
behavior. 

In  contrast,  a  large  number  of  other  rational 
choice  applications  derive  social  dilemmas  on  the 
basis  of  strategic  interdependencies.  We  will  con¬ 
sider  four  canonical  examples  of  such  social  dilem¬ 
mas  illustrating  strategic  interdependence. 

The  prisoners’  dilemma  is  perhaps  the  longest- 
recognized  strategic  dilemma,  recorded  by  Thomas 
Hobbes  in  the  17th  century.  As  a  simple  example, 
consider  an  environment  with  two  individuals,  A 
and  B,  who  face  two  choices,  to  farm  (F)  or  to 
defect  (P).  The  value  for  Individual  A  of  farming  or 
preying  on  the  other  person  depends  on  whether 
Individual  B  farms  or  predates  (and  the  reverse). 
Using  numbers  to  simplify  the  example,  we  have 
the  following: 


Player  B 

F  (x,  y)  P  (x,  y) 

Player  A 

F  (x,  y) 

(3,3) 

(0,4) 

P  (x,  y) 

(4,  0) 

(1,  1) 

where  (x,  y)  represents  the  payoffs  for  Players  A 
and  B,  respectively. 

Each  player  must  decide  between  F  and  P.  If 
both  players  farm,  they  both  receive  a  payoff  of  3, 
maximizing  social  output.  However,  both  players 
face  a  unilateral  incentive  to  defect  from  farming: 
While  the  other  player  does  the  hard  work  of  farm¬ 
ing,  the  first  player  preys  on  his  crops.  This  unilat¬ 
eral  defection  produces  a  payoff  of  4  for  the  player 
who  predates  and  a  payoff  of  0  for  the  player  who 
is  victimized.  Because  both  players  face  the  same 
incentives,  neither  will  farm.  The  only  stable  out¬ 
come  of  the  prisoners’  dilemma  is  where  both  play¬ 
ers  predate  and  both  receive  a  payoff  of  1. 

In  Hobbes’s  view,  the  prisoners’  dilemma  mod¬ 
eled  a  world  in  which  life  is  short,  nasty,  and  brut¬ 
ish,  in  which  all  prey  on  all.  One  productive  way 
to  think  about  the  prisoners’  dilemma  is  in  terms 
of  the  ability  to  commit  to  a  certain  course  of 
action.  Both  players  prefer  to  live  in  a  world  in 
which  both  members  farm,  but  they  cannot  com¬ 
mit  to  farming  in  the  face  of  incentives  to  prey  on 
the  other  player. 

The  social  outcome — a  world  of  all  preying  on 
all — differs  from  the  social  outcome  that  every 
individual  wants  to  occur.  Nevertheless,  the  social 
outcome  is  a  product  of  individually  rational  deci¬ 
sions:  Each  member  acts  in  his  or  her  best  interest 
yet  produces  a  social  outcome  that  is  not  in  his  or 
her  best  interest. 

The  common  pool  problem ,  first  articulated  by 
William  Foster  Lloyd  in  the  19th  century  and 
sometimes  referred  to  as  the  tragedy  of  the  com¬ 
mons,  is  another  social  dilemma.  Consider  a  com¬ 
mon  resource,  such  as  land  for  grazing  cattle, 
available  to  a  society.  Each  individual  unilaterally 
decides  how  many  cattle  to  graze  on  the  land, 
receiving  some  benefit  for  each  animal  he  or  she 
grazes.  But  individuals  pay  only  part  of  the  cost  of 
grazing  each  animal:  The  cost  of  maintaining  the 
land  is  divided  among  all  members  of  the  society. 
This  structure  of  costs  and  benefits  provides  strong 
incentives  to  exploit  the  common  land,  grazing  as 
many  cattle  as  possible.  When  all  the  individuals 
do  so,  however,  overgrazing  occurs,  which  depletes 
the  quality  of  the  land.  Common-pool  problems 
have  an  enormous  range  of  applications,  including 
the  growth  of  government,  pollution,  population 
growth  (Lloyd’s  initial  concern),  and  the  overuse 
of  public  roads. 
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Another  class  of  dilemmas,  known  as  collective 
action  problems,  studied  by  rational  choice  theo¬ 
rists  focuses  on  public  goods,  or  goods  that  are  not 
excludable  (no  one  is  prevented  from  benefiting 
from  them)  and  are  nonrival  (one  person’s  con¬ 
sumption  does  not  influence  the  ability  of  another 
person  to  consume).  “Common  goods,”  such  as 
grazing  land,  differ  from  “public  goods.”  A  public 
good  is  characterized  by  nonrivalry — for  example, 
clean  air.  In  the  case  of  common  goods,  one  per¬ 
son’s  consumption  influences  the  ability  of  others 
to  consume  the  good.  Grazing  land,  for  example, 
becomes  depleted. 

Examples  of  collective  action  problems  include 
national  security  and  clean  air.  The  fundamental 
difficulty  in  a  collective  action  problem  is  that  the 
ability  of  an  individual  to  consume  the  public  good 
is  not  linked  to  any  effort  to  provide  the  good.  For 
example,  regardless  of  whether  an  individual  pays 
taxes,  he  or  she  benefits  from  the  national  security 
provided  by  the  standing  army.  This  decoupling  of 
benefits  and  costs  creates  an  incentive  to  free  ride 
off  the  contributions  of  others. 

Collective  action  problems,  therefore,  represent 
another  type  of  social  dilemma.  One  implication  of 
collective  action  problems  is  that  societies  often 
face  shortages  of  public  goods.  Even  though 
almost  all  members  of  a  society  would  benefit 
from  these  public  goods,  such  as  stronger  border 
security,  preventing  raids  from  the  neighboring 
society,  the  incentive  to  free  ride  results  in  an 
underfunded,  anemic  security  apparatus.  The 
problem  of  an  underdeveloped  security  apparatus 
was  endemic  for  most  of  human  history.  Only 
recently,  with  the  development  of  the  institutions 
of  the  modern  state,  have  security  forces  become 
well  funded — too  well  funded  in  the  eyes  of  some. 

As  a  final  example  of  a  class  of  problem  studied 
by  rational  choice  theorists,  consider  coordination 
problems.  In  this  type  of  problem,  the  members  of 
a  society  benefit  from  acting  in  concert  and  suffer 
when  they  fail  to  do  so.  A  classic  example  of  a 
coordination  problem  is  the  choice  of  whether  to 
drive  on  the  right  or  left  side  of  the  road.  If  we  all 
drive  on  the  left,  then  we  move  about  safely;  the 
same  holds  if  we  all  drive  on  the  right.  However,  if 
people  fail  to  drive  consistently  on  the  left  or  the 
right,  driving  becomes  extremely  hazardous. 

Political  science  has  applied  the  coordination 
problem  to  a  wide  range  of  political  questions. 


Consider  the  choice  of  whether  to  protest  or 
actively  work  against  an  authoritarian  regime.  If 
all  the  citizens  who  dislike  the  regime  act  simulta¬ 
neously,  they  can  overthrow  the  regime.  However, 
if  they  act  at  different  times,  with  only  small 
groups  on  the  street  at  any  time,  the  police  will 
suppress  their  movement.  The  group  of  discon¬ 
tented  citizens  faces  a  coordination  problem.  If 
they  coordinate  on  a  time  and  location,  they  ben¬ 
efit  from  a  political  transition  to  democracy;  if 
they  fail  to  coordinate,  they  receive  a  beating  by 
the  police. 

Although  we  have  focused  on  the  problems  that 
rational  choice  theory  has  helped  us  understand, 
this  perspective  has  also  generated  a  wide  range  of 
solutions.  Studying  collective  action  problems, 
Mancur  Olson  observed  that  members  of  groups 
have  the  incentive  to  free  ride  off  the  hard  work  of 
others.  As  a  partial  response  to  the  problem  of  free 
riding,  Olson  argues,  some  groups  provide  “selec¬ 
tive  benefits” — benefits  to  those  who  provide  their 
share  of  the  collective  good.  Elderly  people,  for 
example,  have  a  shared  interest  in  lobbying 
Congress  to  pass  laws  that  promote  health  care  and 
various  other  forms  of  public  policy  beneficial  to 
older  Americans.  However,  due  to  free  riding,  older 
Americans  benefit  from  these  lobbying  efforts  even 
if  they  make  no  effort  to  support  them,  such  as 
paying  dues  to  the  American  Association  of  Retired 
Persons  (AARP),  a  principal  interest  group  for  the 
elderly.  To  combat  the  incentive  to  free  ride,  the 
AARP  offers  a  number  of  benefits  that  accrue  only 
to  AARP  members  who  pay  dues,  such  as  newslet¬ 
ters,  discounts  on  medication,  magazines,  and 
other  consumer  products.  Thus,  selective  benefits 
offer  a  partial  solution  to  some  collective  action 
problems. 

In  another  example,  constitutions  can  help 
resolve  coordination  problems.  As  noted  above,  the 
choice  of  whether  or  not  to  protest  against  a  regime 
is  fraught  with  danger:  Coordinate  on  a  time,  and 
the  protestors  often  win;  fail  to  coordinate,  and  the 
protestors  often  face  suppression  by  the  police.  The 
rights  enumerated  in  a  constitution  serve  as  a  coor¬ 
dinating  mechanism.  When  a  regime  violates  a 
constitutional  right,  the  citizens  collectively  under¬ 
stand  that  a  transgression  has  occurred,  and  this 
triggers  coordinated  action  to  reverse  the  trans¬ 
gression.  Without  the  enumerated  rights,  citizens 
will  have  difficulty  determining  what  counts  as  a 
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transgression,  and  they  will  face  the  real  possibility 
of  coordination  failure  and  all  its  attendant  dangers. 

Objections  and  Limits  to 

Rational  Choice  Theory 

Rational  choice  theory  is  subject  to  several  criti¬ 
cisms,  which  fall  into  three  categories.  First,  critics 
suggest,  rational  choice  theory  is  unsuitably  restric¬ 
tive  because  it  typically  assumes  that  people 
behave  in  a  narrowly  self-interested  way.  Critics 
observe  that,  in  contrast  to  this  common  perspec¬ 
tive,  human  motivations  are  often  far  more  com¬ 
plex,  reflecting  a  concern  for  other  people. 

A  second  criticism  holds  that  rational  choice 
theories  are  not  sufficiently  constrained.  After  an 
initial  set  of  assumptions  about  the  structure  of 
preferences  and  the  interaction  between  individu¬ 
als,  rational  choice  theories  contain  rigorous,  highly 
constrained,  internally  and  logically  consistent 
analysis.  However,  scholars  have  discretion  in  their 
assumptions  about  the  preferences  and  how  they 
model  the  interaction  between  individuals.  For 
example,  altering  the  sequence  of  the  interaction — 
allowing  Player  A  to  move  before  Player  B,  say — 
often  appears  innocuous,  but  it  can  have  profound 
implications  for  the  results  of  a  rational  choice 
theory.  Similarly,  the  structure  of  individuals’  pref¬ 
erences — not  only  what  they  care  about  but  also 
how  their  payoffs  respond  to  increases  or  decreases 
of  what  they  care  about — can  dramatically  influ¬ 
ence  the  predictions  of  a  theory.  In  this  way, 
despite  the  appearance  of  rigor,  rational  choice 
theories  have  arbitrary  elements. 

A  third  common  criticism  notes  that  rational 
choice  theories  cannot  explain  important  elements 
of  human  behavior.  Voter  turnout,  for  example,  is 
exceptionally  difficult  to  explain  in  terms  of  ratio¬ 
nal  choice  theories.  Each  voter  has  only  a  trivial 
influence  over  the  outcome  of  an  election:  If  any 
single  voter  decides  not  to  turn  out  to  vote,  the 
probability  that  the  outcome  of  the  election  will  be 
altered  is  impossibly  small.  And  yet,  regardless  of 
whether  or  not  an  individual’s  vote  can  influence 
the  election  outcome,  each  voter  pays  some  cost 
for  voting — the  time  and  effort  it  takes  to  vote. 
The  combination  of  highly  uncertain  benefits  with 
certain  costs  suggests  that  voter  turnout  is  a  rare 
event.  Yet  people  clearly  vote  in  elections.  Thus, 
rational  choice  theories  fail,  in  this  criticism, 


because  they  cannot  account  for  important  aspects 
of  human  behavior. 

In  response  to  these  criticisms,  rational  choice 
scholars  often  provide  two  observations.  First, 
although  it  is  common  to  assume  the  narrow  self- 
interest  of  individuals,  this  assumption  is  not  nec¬ 
essary.  Rational  choice  theory  can  also  assume  that 
part  of  an  individual’s  welfare  depends  on  the 
welfare  of  other  people.  For  example,  many  people 
value  a  measure  of  equality  in  the  distribution  of 
wealth,  and  this  preference  often  induces  people  to 
give  to  charity  or  to  support  programs  that  redis¬ 
tribute  income.  Donations  to  charity  and  other 
apparently  selfless  acts  do  not  necessarily  represent 
irrational  behavior.  Similarly,  the  rational  choice 
perspective  easily  models  these  settings  where 
people  attach  some  value  to  being  viewed  as  good 
citizens  or  value  the  welfare  of  their  fellow  citizens. 

The  second  point  is  methodological.  The  goal  of 
rational  choice  theory  is  to  provide  a  way  of  under¬ 
standing  human  interaction  and  social  outcomes. 
Models  are  useful  because  they  capture  important 
elements  of  human  behavior  in  a  parsimonious  way. 
Creating  a  useful  model  necessarily  involves  focus¬ 
ing  attention  on  certain  motivations  to  the  exclu¬ 
sion  of  others.  As  the  examples  discussed  above 
indicate,  rational  choice  theory  helps  us  understand 
problems  of  social  importance.  This  contribution  is 
possible  because  of — not  in  spite  of — the  narrow 
focus  of  rational  choice  theories.  Moreover,  as  ratio¬ 
nal  choice  theories  matured,  they  gradually 
expanded  the  focus  to  include  less  restrictive 
assumptions.  The  core  insights,  however,  almost 
always  derive  from  the  study  of  a  highly  modeled 
environment  that  is  unrealistic  in  a  variety  of  ways. 

The  most  serious  criticism  of  rational  choice 
theories  is  that  they  are  not  sufficiently  con¬ 
strained.  As  suggested  above,  modeling  choices 
about  preferences  or  the  structure  of  interaction 
crucially  affect  the  model’s  implications.  The 
strongest  rational  choice  models,  therefore,  are 
those  that  build  on  others,  by  which  a  consensus 
emerges  on  how  to  study  a  given  problem,  such  as 
the  assumption  that  elected  officials  maximize  the 
probability  of  reelection. 

Frontiers 

Rational  choice  scholars  extend  this  approach  in  a 
variety  of  new  and  exciting  directions.  For  example, 
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an  important  literature  focuses  on  problems  related 
to  information — who  knows  what  and  when  they 
make  a  choice  as  well  as  the  incentives  for  a  player 
to  misrepresent  himself.  In  the  interest  of  space,  we 
focus  on  just  one  important  innovation:  behavioral 
choice. 

Students  of  behavioral  choice  attempt  to  com¬ 
bine  insights  from  psychology  with  standard  ratio¬ 
nal  choice  theory.  They  question  whether  it  is  pos¬ 
sible  for  humans  to  process  information  in  the  way 
implied  by  traditional  rational  choice  theories  or 
whether  emotions  or  cognitive  constraints  system¬ 
atically  bias  choices  away  from  purely  rational 
ones.  Rather  than  calculate  every  contingent  pos¬ 
sibility,  for  example,  behavioral  choice  scholars 
suggest  that  people  use  heuristics  and  rules  of 
thumb  to  reach  decisions.  Similarly,  the  way  in 
which  a  choice  is  presented  to  individuals  appears 
to  influence  their  ultimate  choice.  For  example, 
whether  people  are  asked  to  opt  in  or  opt  out  of  a 
program  can  control  whether  they  join  the  pro¬ 
gram,  a  finding  that  is  difficult  to  square  with 
conventional  rational  choice  theory.  This  approach 
remains  in  its  infancy,  but  it  promises  important 
extensions  and  generalizations  of  the  rational 
choice  approach. 

Conclusion 

Rational  choice  theory  relies  on  simplifying  assump¬ 
tions  about  individuals  in  large  part  because  it  is  a 
theory  of  social  interaction.  Assumptions  of  indi¬ 
vidual  rationality,  as  we  have  seen,  lead  to  many 
important  and  unobvious  conclusions  about  the 
behavior  of  groups  of  individuals  or  societies.  As 
illustrated  by  various  social  dilemmas,  maximizing 
individuals  do  not  imply  social  maximization. 
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Critical  rationalism  is  a  school  in  the  philosophy  of 
science  and  in  political  philosophy  centered  on  the 
works  of  the  Austrian-English  philosopher  Karl  R. 
Popper.  The  Social  Democrat  Popper  fled  from 
Vienna  and  the  Nazis  in  the  late  1930s  to  New 
Zealand  and  established  himself  in  London  after 
the  war.  Critical  rationalism  is  characterized  by  the 
method  of  trial  and  error,  applicable  in  all  scientific 
disciplines,  philosophy,  or  politics  according  to  its 
adherents.  In  a  problem  situation,  a  tentative  the¬ 
ory  is  advocated  to  solve  the  problem.  Thereafter, 
all  efforts  should  be  directed  toward  demonstrating 
the  falseness  of  this  theory  (falsification,  error 
elimination),  and  when  this  has  been  done,  we  are 
in  a  new  problem  situation  but  with  increased 
knowledge.  We  have  learned  from  our  mistake. 
This  optimistic  belief  in  the  possibility  of  the 
growth  of  knowledge  justifies  the  label  “rational¬ 
ism.”  Moreover,  this  rationalism  is  “critical,”  since 
a  prevailing  theory  should  always  be  exposed  to  a 
maximum  of  criticism.  This  is  a  particularly 
demanding  requirement  for  the  theory  constructor; 
however,  it  is  a  crucial  purpose  of  critical  rational¬ 
ism  to  depersonalize  research  and  its  theories.  This 
entry  presents  the  basic  tenets  of  critical  rational¬ 
ism,  reflecting  the  mainstream  of  contemporary 
social  science  and  beyond. 

In  a  political  science  perspective,  critical  ratio¬ 
nalism  is  crucial  both  from  a  methodological  view¬ 
point  and  as  a  substantial  political  philosophy. 
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Moreover,  critical  rationalism  indicates  how  the 
political  scientist  can  make  himself  or  herself  useful 
to  society.  Although  it  is  a  challenge  to  follow  them 
in  actual  research,  the  prescriptions  of  critical 
rationalism  have  become  mainstream  in  political 
science  or  social  science  methodology  textbooks, 
even  if  they  are  occasionally  contested.  Further, 
Popper’s  work  on  the  “open  society  and  its  ene¬ 
mies”  is  an  important  ingredient  in  the  political 
philosophy  of  the  20th  century,  known  mainly  for 
its  attacks  on  the  prophecies  and  societal  utopias  of 
Plato,  Georg  Wilhelm  Friedrich  I  legel,  and  Karl 
Marx.  It  is  actually  in  the  spirit  of  critical  rational¬ 
ism  that  not  even  a  classic  Greek  philosopher  like 
Plato,  normally  over  and  above  criticism,  is  spared. 
Subjecting  his  thought  to  criticism  is  a  way  of  tak¬ 
ing  it  seriously,  according  to  critical  rationalists. 
No  historical  or  spatial  barrier  should  protect 
against  criticism.  This  side  of  Popper’s  work 
belongs  to  normative  political  theory.  As  shown 
below,  these  two  aspects  of  Popper’s  philosophy, 
even  though  dealing  with  widely  different  matters, 
are  well  integrated  in  his  own  thought  as  they  fol¬ 
low  from  a  few  basic  assumptions. 

It  is  not  unreasonable  to  structure  20th-century 
philosophical  debates  around  Popper:  Popper  ver¬ 
sus  the  positivists  (empiricists),  Popper  versus  the 
relativists  (the  idealist  philosophy  of  language  or 
Thomas  Kuhn’s  philosophy  of  science),  and  Popper 
versus  the  Marxists  and  Neo-Marxists.  Critical 
rationalism  encompasses  not  only  Popper’s  philo¬ 
sophical  followers  (e.g.,  J.  W.  N.  Watkins,  David 
Miller,  Flans  Albert,  William  Bartley,  and  Imre 
Lakatos)  but  also  many  natural  scientists  and 
social  scientists.  Seven  theses  of  critical  rationalism 
are  formulated  below  and  put  in  their  proper  con¬ 
text.  Their  relevance  for  political  science  and  its 
debates  are  then  briefly  discussed. 

1.  Reality  exists  “out  there”  ( philosophical  real¬ 
ism).  Philosophical  realism  asserts  that  reality  exists 
independently  from  our  language  and  theories 
about  it.  A  corollary  of  realism  is  the  correspon¬ 
dence  theory  of  truth:  Truth  means  a  statement’s 
correspondence  with  reality  (as  opposed  to  the 
coherence  theory  of  truth,  which  stresses  its  coher¬ 
ence  with  previous  knowledge,  and  the  pragmatic 
theory  of  truth,  which  emphasizes  its  practical  use¬ 
fulness).  This  conception  of  truth  is  very  common- 
sensical:  Statements  are  about  something,  and  if 


they  fit  with  this  something,  the  statements  are 
true;  otherwise  they  are  not.  This  theory  of  truth 
makes  it  meaningful  to  compare  theoretical  expec¬ 
tations  with  observations  of  the  relevant  segment 
of  reality,  meaning  that  theories  can,  in  principle, 
be  evaluated  and  compared  mutually  as  to  how 
well  they  match  reality.  Therefore,  some  theories 
can  be  seen  as  better  than  others;  one  theory  may 
represent  progress  in  relation  to  another.  This  is  a 
privilege  compared  with  epistemologies,  according 
to  which  theory  and  reality  cannot  be  disentangled 
and  therefore  do  not  allow  for  any  confrontation 
between  them. 

Occasionally,  the  school  of  realism  in  interna¬ 
tional  relations,  focusing  on  states  and  their  rela¬ 
tive  power,  is  mistaken  for  philosophical  realism  as 
described  here.  Some  international  relations  real¬ 
ists  are  indeed  philosophical  realists,  whereas  oth¬ 
ers  are  not,  and  for  still  others  it  is  hard  to  tell.  It 
is  more  obvious,  however,  that  linguistically  ori¬ 
ented  schools  such  as  poststructuralism,  postmod¬ 
ernism,  or  constructivism  are  nonrealist  in  the 
philosophical  meaning  of  the  term.  Studying  the 
narratives  of  different  political  actors,  the  issue  is 
not  which  of  them  is  most  “true”  but  their  prag¬ 
matic  functioning  in  the  actors’  respective  con¬ 
texts.  Analysts  may  have  their  own  narratives  in 
the  name  of  science,  but  decoding  their  pragmatic 
or  political  agendas  is  crucial.  Objective  “truth”  is 
uninteresting,  if  it  exists  at  all.  Relativism  prevails. 

2.  Life  is  too  short  to  quarrel  about  the  meaning 
of  words  or  concepts  (anti-essentialism) .  A  conse¬ 
quence  flowing  from  the  above  disentangling  of 
language  and  reality  is  that  words  or  concepts  can 
be  defined  by  the  analyst.  They  are  decisions, 
although  they  are  not  arbitrary.  Research  strategic 
considerations  may  decide  how  to  define  a  con¬ 
cept.  Concept  essentialism  is  thereby  discarded: 
There  is  no  such  thing  as  what  a  concept  “really” 
means.  Popper  did  not  want  to  quarrel  over  words. 
Assertions  about  reality  are  important  and  should 
be  disputed — not  the  meanings  of  words  or  con¬ 
cepts.  In  other  words,  the  theory  that  poverty 
causes  revolution  is  significant  and  should  be  dis¬ 
cussed  and  disputed,  not  the  meanings  of  the  con¬ 
cepts  of  “poverty”  and  “revolution.”’  What  is  the 
essence  of  poverty  or  the  real  meaning  of  revolu¬ 
tion  are  fruitless  questions.  These  definitional 
questions  should  be  decided  relatively  quickly  as  a 
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preliminary  step  toward  the  interesting  discussion 
concerning  the  causal  relation  between  poverty 
and  revolution. 

Whether  and  how  politicians  use  a  concept  is  a 
part  of  political  language  and  thereby  reality. 
However,  that  should  not  affect  whether  and  how 
political  scientists  use  the  concept  for  analytical 
purposes.  For  instance,  it  may  be  reasonably 
argued  that  the  concept  of  “security”  is  worn  out 
for  analytical  purposes.  There  is  no  deeper  mean¬ 
ing  in  such  an  argument,  only  the  practical  consid¬ 
eration  that  the  concept  has  been  used  in  so  many 
different  senses  that  it  no  longer  conveys  any  clear 
meaning  but  rather  spreads  confusion.  Politicians’ 
use  of  this  concept  should  be  seen  as  political  lan¬ 
guage  with  other  purposes  than  analytical  ones 
(things  can  be  done  with  words  [performatives], 
but  that  requires  that  they  be  spoken  from  a  posi¬ 
tion  of  power  and  responsibility).  Such  use  is  inter¬ 
esting  as  an  empirical  object  of  study,  but  it  is 
irrelevant  for  the  analyst’s  conceptual  equipment. 
This  thesis  of  critical  rationalism  stands  in  direct 
opposition  to  the  “linguistic  turn”  of  political  sci¬ 
ence,  including  international  relations,  that  took 
place  from  the  1990s  onward  in  the  form  of  post¬ 
structuralism,  postmodernism,  or  constructivism. 

3.  Even  though  reality  exists,  it  is  difficult  to 
grasp;  there  are  no  “facts”  in  themselves  (antiposi¬ 
tivism  or  anti-inductivism) .  Although  reality  exists 
independently  of  our  theoretical  constructs  (Thesis 
1 ),  it  can  be  grasped,  according  to  critical  rational¬ 
ism,  only  through  our  conceptual  lenses  (critical 
rationalism  thereby  belongs  to  the  philosophical 
tradition  of  Immanuel  Kant).  This  means  that  there 
is  no  such  thing  as  facts  in  themselves — apart  from 
trivialities  such  as  the  numbers  in  a  telephone  direc¬ 
tory.  All  interesting  observation  is  theory  impreg¬ 
nated.  A  good  many  of  Popper’s  philosophical 
efforts  have  been  directed  against  the  positivist 
quest  for  certainty,  that  is,  that  theory  should  be 
built  inductively  “from  below”  on  a  secure  founda¬ 
tion  of  indisputable  facts  (this  includes  naive  real¬ 
ism,  i.e.,  that  things  are  what  they  look  like  and 
that  truth  is  manifested  through  them).  Knowledge 
cannot  start  from  nothing,  from  a  tabula  rasa,  or 
yet  from  observation.  The  growth  of  knowledge 
consists  mainly  in  the  modification  of  earlier 
knowledge.  Such  knowledge  should  not  be  under¬ 
stood  in  any  puristic  sense.  Even  if  political  science 


hardly  encompasses  established  theories,  there  are 
general  explanatory  principles  at  our  disposal, 
such  as  role,  rationality,  or  inertia,  that  can  be  inte¬ 
grated  into  theoretical  constructs.  The  main  thing 
is  that  the  expectations  that  we  attempt  to  falsify 
(cf.  Thesis  5)  are  theory  derived,  that  is,  derived 
from  explicit  sets  of  general  justifications.  In  other 
words,  a  priori  reasons  should  be  formulated  as  to 
why  we  believe  in  the  expectations. 

In  political  science  epistemological  debates 
(positivism  vs.  the  neo-Marxism  of  the  1970s  and 
1980s  or,  more  recently,  positivism  vs.  poststruc¬ 
turalism  or  other  linguistic  orientations),  Popper 
has  often  been  lumped  together  with  the  positiv¬ 
ists.  As  should  be  apparent,  this  is  erroneous.  His 
criticism  of  inductivism  obviously  hits  correlation- 
ally  oriented  branches  of  political  science.  For 
instance,  a  major  problem  with  U.S.  comparative 
foreign  policy  studies  was  their  lack  of  explicitly 
formulated  reasoning  as  to  why  various  hypothe¬ 
ses  deserved  to  be  tested.  Such  explicitness  would 
have  increased  the  opportunities  for  learning  from 
falsifications. 

4.  Theories  should  be  parsimonious;  among  two 
theories  with  equal  explanatory  power,  the  simpler 
one  should  be  preferred  (the  virtue  of  simplicity). 
Theories  should  preferably  be  simple  (parsimoni¬ 
ous):  They  should  be  able  to  explain  much  by 
little — by  one  or  a  few  unifying  explanatory  mecha¬ 
nisms  (be  it  rationality,  role,  or  a  unitary  actor).  We 
need  a  relatively  simple  starting  point  for  inquiry 
into  the  “world  of  chaotic  variety”  (Marx’s  formu¬ 
lation).  Parsimony  urges  one  to  include  only  those 
factors  in  a  theoretical  construct  that  a  priori  seem 
able  to  account  for  a  considerable  part  of  the 
empirical  variance.  Critical  rationalism  demands 
bold  conjectures  in  relation  to  what  we  already 
know  (our  background  knowledge);  in  other  words, 
science  should  aim  at  interesting  truth  rather  than 
just  truth.  This  demand  follows  from  the  virtue  of 
simplicity:  What  is  usually  called  the  simplicity  of  a 
theory  is  associated  with  its  logical  improbability 
and,  hence,  its  boldness. 

In  the  field  of  international  relations,  the  prefer¬ 
ence  for  parsimonious  theory  characterizes  the 
work  of  Kenneth  Waltz,  for  example,  who  is  one 
of  its  most  influential  authors.  However,  as  empha¬ 
sized  by  Waltz’s  critics,  his  assumptions  do  not 
correspond  sufficiently  to  reality  (Thesis  1),  and 
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the  falsifiability  of  his  theory  is  questionable  (see 
Thesis  5).  Climbing  down  the  “level  ladder”  in 
foreign  policy  explanation  is  also  based  on  respect 
for  parsimony:  starting  out  with  the  simplest  sys¬ 
temic  or  geopolitical  explanation,  which — if  unsuc¬ 
cessful — is  supplemented  by  factors  of  domestic 
politics  or,  if  necessary,  by  factors  in  the  decision¬ 
making  process.  In  other  words,  if  the  most  parsi¬ 
monious  explanation  does  not  sufficiently  account 
for  reality,  one  proceeds — with  some  resignation — 
toward  less  simple  modes  of  explanation. 

5.  Theories,  to  be  scientific,  should  in  principle  be 
falsifiahle  (falsificationism).  It  is  not  important 
how  a  theory  is  invented  or  discovered  by  a 
researcher;  it  may  be  inspired  by  societal  develop¬ 
ments  or  by  specific  academic  colleagues,  or  it  may 
be  dreamt  at  night.  Therefore,  research  psychology 
and  the  sociology  of  knowledge  are  redundant  dis¬ 
ciplines  from  the  viewpoint  of  distinguishing  good 
from  bad  theories  or  scientific  from  nonscientific 
ones.  Only  one  thing  counts:  the  theory’s  falsifi¬ 
ability.  It  is  easy  enough  to  find  “confirming” 
instances  of  almost  any  theory.  If  it  is  to  be  consid¬ 
ered  a  scientific  theory,  however,  it  should  run  the 
risk  of  being  false — unlike  Sigmund  Freud’s  psy¬ 
choanalysis,  for  instance;  in  short,  it  should  be 
falsifiahle.  This  thesis  is  probably  the  most  famous 
in  the  methodology  of  critical  rationalism.  The 
bolder  the  conjectures  (Thesis  4),  the  greater  the 
a  priori  likelihood  that  they  are  mistaken  and, 
hence,  the  more  falsifiahle  they  are.  Science  should 
progress  through  a  series  of  falsifications,  each 
time  leading  to  the  formulation  of  a  better  theory 
incorporating  what  has  been  learned  from  previous 
falsifications — that  is,  learning  from  past  mistakes. 

The  doctrine  of  falsificationism  was  developed 
in  opposition  to  the  positivists’  verificationism 
and,  as  the  latter  failed,  their  skepticism  toward 
the  growth  of  knowledge.  Whereas  it  is  impossible 
to  verify  the  law  “All  swans  are  white”  on  the 
basis  of  the  observation  of  a  finite  number  of 
swans,  it  is  possible  to  falsify  it  by  observing  only 
one  colored  (nonwhite)  swan.  Of  course,  the  sup¬ 
porter  of  this  law  may  object,  for  instance,  that  the 
swan  in  question  has  been  painted;  such  auxiliary 
hypotheses,  however,  should  be  used  with  the 
utmost  restraint  and  only  in  the  most  obvious 
cases.  Even  though  followers  of  Popper  such  as 
Imre  Lakatos  have  refined  falsificationism,  it  has 


faced  criticism  in  philosophy,  the  natural  sciences, 
and  the  social  sciences.  Not  surprisingly,  theory 
constructors  often  advocate  a  more  lenient  course 
vis-a-vis  theories  than  that  of  falsification.  Rather 
than  consciously  seeking  to  kill  theories  at  the  first 
occasion,  they  should  be  given  a  fair  chance  of 
“peaceful  development,”  at  least  at  the  outset.  The 
prevailing  view  in  political  or  social  science  today 
seems  to  be  that  a  certain  initial  “protectionism” 
of  theories  is  justified. 

6.  The  development  of  society  should  be  explained 
neither  on  the  basis  of  traits  or  motives  of  indi¬ 
viduals  or  groups  within  it  (reductionism)  nor  on 
the  basis  of  emergent  properties  of  society  as  a 
whole  (holism).  Instead,  revealing  unintended  con¬ 
sequences  is  the  way  forward  for  social  and  politi¬ 
cal  science.  Reductionism  means  that  unit-attributes 
are  used  to  explain  developments  of  the  whole,  the 
system.  A  special  version  of  it  is  psychologism, 
meaning  that  attributes  or  motives  of  individual 
human  beings  are  used  to  explain  societal  develop¬ 
ments.  Conspiracy  theories  are  reductionist.  Popper 
repeatedly  warned  against  conspiracy  theories  that 
claim  to  explain  various  unfortunate  societal  devel¬ 
opments  on  the  basis  of  the  characteristics  or 
motives  of  individual  human  beings  or  a  specific 
group,  functioning  as  scapegoats  (the  Jews,  immi¬ 
grants,  the  Communists,  etc.).  However,  we  should 
not,  according  to  critical  rationalism,  jump  to  the 
opposite  extreme  and  seek  to  explain  social  phe¬ 
nomena  solely  on  the  basis  of  emergent  properties 
of  the  whole  (holism).  The  target  here  could  be 
functionalism  in  sociology  or  political  science, 
explaining  developments  on  the  basis  of  societal 
“needs”  or  “functions.”  Instead,  the  preferred 
explanation  of  outcomes  in  the  social  sciences 
should  refer  to  mechanisms  producing  conse¬ 
quences  that  were  unintended  by  the  individual  or 
societal  actors  involved  (as  distinct  from  conspir¬ 
acy  theories  dealing  with  the  intended  conse¬ 
quences  of  “evil”  actions).  For  instance,  rational 
choice  theory  or  game  theory,  often  used  in  inter¬ 
national  relations,  may  be  an  excellent  instrument 
to  understand  such  unintended  consequences  in  the 
interaction  of  nation-states.  Each  state  in  a  con¬ 
flicting  dyad  proceeds  from  its  own  situational 
logic  and  wishes  to  safeguard  its  own  security.  An 
expensive  arms  race,  further  threatening  the  secu¬ 
rity  of  both  sides,  is  the  unintended  consequence  of 
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each  actor  following  its  own  situational  logic  in 
order  to  be  “safe.”  Or,  to  take  another  example, 
international  organizations  may  persist,  although 
their  original  good  purposes  have  been  fulfilled  or 
are  no  longer  relevant.  This  is  because  some 
nation-states  have  vested  interests  in  the  persis¬ 
tence  of  these  organizations,  typically  unintended 
at  the  time  of  their  creation. 

Both  Popper  and  Hans  Albert  took  up  a  fight  on 
these  issues  in  the  so-called  Positivismusstreit  in 
German  sociology  in  the  late  1960s.  The  oppo¬ 
nents  were  primarily  the  Neo-Marxists  of  the 
Frankfurt  school,  including  Jurgen  Habermas,  but 
it  was  likewise  seen  as  vital  to  keep  a  distance  from 
positivism. 

7.  Belief  in  historicist  laws  is  the  basis  of  modern 
totalitarianism,  be  it  of  a  Marxist  or  a  fascist 
nature.  In  contrast,  reform  of  the  open  society 
should  be  guided  by  piecemeal  social  engineering. 
Even  worse  than  functionalism,  holism  may  lead  to 
historicism,  the  view  that  the  whole  develops  deter¬ 
ministically  according  to  its  own  immanent  law  of 
historical  development  (in  contrast  to  scientific 
laws  of  the  if-then  type).  A  process  law  exists  for 
the  solar  system  because  there  are  few,  if  any,  fac¬ 
tors  disturbing  the  system.  A  human  society, 
however,  is  a  much  more  complex  system,  which 
cannot  be  explained  by  such  a  historicist  law.  There 
will  always  be  disturbing  factors,  be  they  external 
or  internal.  From  this  methodological  reasoning, 
Popper  develops  his  political  philosophy.  Its  target 
is  totalitarianism  operating  within  stages  of  human 
society,  through  which  a  determinist  development 
will  allegedly  lead  to  a  final  utopia  (such  as  the 
classless  society  or  Hitler’s  Thousand-Year  Reich). 
Characteristically,  The  Poverty  of  Historicism 
(Popper,  1957/1972),  the  book  in  which  these 
thoughts  were  formulated  during  World  War  II,  was 
dedicated  to  the  “countless  men  and  women  of  all 
creeds  or  nations  or  races  who  fell  victims  to  the 
fascist  and  communist  belief  in  inexorable  laws  of 
historical  destiny.”  Perhaps  it  was  unfair  to  ascribe 
this  determinist  belief  to  Marx  personally  in  The 
Open  Society  and  Its  Enemies  (Vol.  2).  However,  it 
can  in  any  case  be  ascribed  to  much  of  the  political 
Marxism  that  prevailed  in  the  20th  century. 

Popper,  in  the  preface  to  The  Poverty  of  Histori¬ 
cism,  has  added  a  further  argument  that  should 
refute  historicism:  The  course  of  human  history  is 


strongly  influenced  by  the  growth  of  human 
knowledge.  Since  we  cannot  predict,  by  rational  or 
scientific  methods,  the  future  growth  of  this 
knowledge  (then  we  would  already  have  it!),  it  is 
impossible  to  predict  the  future  course  of  human 
history. 

The  “closed  society”  is  held  together  by  “semi- 
biological  ties — kinship,  living  together,  sharing 
common  efforts,  common  dangers,  common  joys, 
and  common  distress”  (Popper,  1945,  chap.  10, 
para.  1).  It  is  a  semi-organic  unity,  to  which  the 
“so-called  organic  or  biological  theory  of  the  state 
can  be  applied  to  a  considerable  extent”  (Popper, 
1945).  The  closed  society  is  characterized  by  elite 
rule  and  the  belief  in  magical  taboos,  including 
dreams  of  a  future  utopia,  while  the  open  society 
is  one  in  which  men  have  learned  to  be,  to  some 
extent,  critical  of  taboos  and  to  base  decisions  on 
the  authority  of  their  own  intelligence — after  dis¬ 
cussion.  The  change  from  the  closed  to  the  open 
society  is  a  step  from  tribalism,  xenophobia,  and 
nationalism  to  humanitarianism. 

The  open  society  is  modeled  after  good  science, 
with  its  free  and  critical  discussion.  Rather  than  a 
belief  in  process  laws,  reform  of  the  open  society 
should  be  based  on  “piecemeal  social  engineering.” 
(This  label  turned  out,  of  course,  to  be  a  public 
relations  failure:  Few  social  or  cultural  scientists  or 
debaters  like  to  be  associated  with  such  a  “simplis¬ 
tic”  activity  as  engineering.)  The  role  of  social  or 
political  science  would  be  to  provide  the  inputs  to 
this  engineering.  Notably,  knowledge  of  unin¬ 
tended  mechanisms  would  be  valuable  in  the  pro¬ 
cess.  It  is  less  important  in  the  big  picture,  which 
specific  ideological  shading  Popper’s  open  society 
should  have.  With  his  emphasis  on  social  engineer¬ 
ing,  however,  it  seems  to  be  social  democratic  or 
social  liberal  rather  than  liberal  or  conservative. 
The  unique  feature  of  critical  rationalism  is  prob¬ 
ably  the  link  between  sophisticated  epistemological 
reasoning  and  a  forceful  political  philosophy. 
Distinct  from  his  positivist  colleagues  in  the  Vienna 
Circle,  with  whom  he  also  had  serious  epistemo¬ 
logical  disagreements,  Popper  wished  to  connect 
his  reasoning  to  the  ominous  political  develop¬ 
ments  taking  place  outside  the  seminar  room. 

Hans  Mouritzen 
Danish  Institute  for  International  Studies 
Copenhagen,  Denmark 
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Rationality,  Bounded 


Bounded  or  limited  rationality  is  a  term  coined  by 
Herbert  Simon  in  his  theory  of  decision-making 
processes  in  administrative  organizations.  It  focuses 
on  the  different  types  of  limitations  that  actors  or 
decision  makers  in  formal  or  administrative  orga¬ 
nizations  have  when  they  make  decisions.  Decision 
makers  have  to  make  a  selection  of  decision 
premises — goals,  preferences,  problems,  solutions/ 
alternatives,  and  potential  consequences — to  cope 
with  these  limitations  when  making  decisions.  The 
concept  of  bounded  rationality  relates  to  what 
kind  of  limitations  actors  have,  how  they  can  cope 


with  these  limitations,  and  what  the  effects  of 
bounded  rationality  are.  Bounded  rationality  is 
connected  to  the  concept  of  “administrative  man” 
and  can  be  seen  as  a  critique  of  and  modification 
of  the  concept  of  “economic  man” — in  other 
words,  the  theory  of  full  rationality.  Bounded 
rationality  relates  to  the  concept  of  “satisficing,” 
meaning  that  decision  makers  are  choosing  deci¬ 
sion  premises  that  are  “good  enough,”  while  eco¬ 
nomic  man  is  about  maximizing  or  choosing  the 
“best  possible”  action. 

Many  studies  of  decision  making  show  quite 
clearly  that  decision-making  processes  are  charac¬ 
terized  by  actors  not  having  all  goals  and  prefer¬ 
ences  available  or  unambiguously  defined,  not 
having  all  potential  problems  exposed,  not  hav¬ 
ing  all  alternatives  or  solutions  lined  up,  or  not 
having  all  potential  consequences  defined.  Actors  in 
the  real  world  seem  to  decide  based  on  a  few  selected 
goals/preferences,  problems,  solutions/alternatives, 
and  consequences.  Actors,  according  to  the  concept 
of  bounded  rationality,  intend  to  be  rational,  although 
they  are  constrained  in  these  efforts  by  limited  cogni¬ 
tive  capability  and  incomplete  information. 

Limitations  on  Decision  Making 

One  important  type  of  limitation  on  actors  is  prob¬ 
lems  of  attention.  Decision  makers  have  problems 
related  to  time  and  capabilities.  They  have  too 
many  decision  signals  to  attend  to  from  internal 
and  external  sources;  too  many  tasks  to  work  with; 
too  many  strategies,  plans,  and  policies  to  attend 
to;  and  too  many  contacts  to  cope  with.  And  some 
information  and  decision-making  premises  are  not 
ready  available,  so  they  have  to  search  for  them. 
Therefore,  how  they  allocate  their  scarce  resources 
is  a  critical  factor.  They  are  often  part-time  partici¬ 
pants  in  many  decision-making  processes  because 
they  move  around  between  decision  opportunities 
depending  on  what  is  brought  to  their  attention. 
And  when  they  focus  on  some  problems,  solutions, 
and  decisions,  they  have  to  defocus  on  others 
because  of  limited  capacity.  All  these  limitations 
indicate  that  decision  theories  should  be  centered 
more  on  theories  of  attention  and  search  than  on 
theories  of  choice  among  readily  available  goals, 
preferences,  alternatives,  and  consequences. 

Another  set  of  limitations  concerns  problems  of 
memory.  Both  individuals  and  organizations  may 
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have  problems  in  storing  and  retrieving  informa¬ 
tion.  Individuals  may  forget  or  may  have  problems 
of  capacity  concerning  keeping  records  of  actions 
and  experiences,  or  their  memories  may  be  dis¬ 
torted.  Organizations  may  have  problems  in  sys¬ 
tematically  collecting  information,  interpreting 
and  storing  it,  and  retrieving  information  or  mak¬ 
ing  it  fully  available  throughout  the  organization 
because  of  lack  of  capacity  and  competence  or 
organizational  barriers. 

A  third  set  of  limitations  is  problems  of  compre¬ 
hension.  Individual  actors  and  organizations  may 
have  problems  of  capacity  concerning  aggregating, 
organizing,  using,  and  inferring  from  information. 
It’s  therefore  often  difficult  to  learn  from  history 
about  the  relationships  between  actors  and  events, 
not  to  mention  predicting  further  development. 
The  information  may  be  there,  but  actors  may  fail 
to  put  it  together  in  a  comprehensive  way.  The 
more  complex  the  society  and  the  organizations 
themselves,  the  more  prevalent  such  problems  will 
be.  Actors  may  also  compete  in  defining  and  learn¬ 
ing  from  history,  partly  because  they  have  experi¬ 
ences  from  different  parts  of  an  organization  or  see 
their  experiences  through  different  lenses  of  norms 
and  values — for  example,  related  to  higher  educa¬ 
tion  and  professions.  There  may  also  be  some 
wishful  thinking  or  superstitious  learning  going 
on.  Actors  may  think  that  what  happened  had  to 
happen  or  that  all  good  things  in  their  organiza¬ 
tion  are  related  to  their  own  actions  and  all  bad 
things  to  the  actions  of  other  actors. 

A  fourth  set  of  limitations  deals  with  problems 
of  communication.  The  more  complex  and  special¬ 
ized  information  and  knowledge  are,  the  greater 
the  problems  of  communicating  information.  It’s 
difficult  to  share  information  and  knowledge 
across  generations,  cultures,  organizations,  levels, 
tasks,  professions,  and  organizational  units. 
Specialized  formal  structures  may  be  good  at  han¬ 
dling  information  internally  but  not  at  communi¬ 
cating  and  sharing  their  experiences  externally. 
Different  units  or  groups  in  an  organization  may 
use  different  information  or  different  methods  to 
simplify  the  world  surrounding  them. 

March’s  Dynamics  of  Decision  Making 

A  key  to  understanding  bounded  rationality  is  to 
understand  that  decision  makers  are  selecting 


decision  premises  to  simplify  the  decision  situa¬ 
tions  and  make  them  easier  to  cope  with,  so  a 
major  question  is  how  they  are  able  to  make  such 
a  selection.  James  G.  March,  in  interpreting  sev¬ 
eral  psychological  studies  of  individual  decision 
making,  says  that  there  seem  to  be  several  ways 
to  react  to  cognitive  limitations  and  constraints. 
One  is  to  use  stereotypes  to  interpret  actors  and 
events.  Decision  makers  construct  patterns  of 
attitudes  or  traits  and  accordingly  interpret  other 
actors’  behavior  based  on  the  typologies.  They 
imagine  and  define  intent  based  on  observing 
other  actors’  behavior  or  the  consequences  of  that 
behavior.  They  define  certain  aspects  or  elements 
of  a  problem  as  crucial  and  ignore  the  other  ele¬ 
ments.  Their  understanding  is  based  on  socially 
developed  theories,  scripts,  and  schemas,  where 
missing  information  is  filled  in  based  on  interpre¬ 
tations  of  events,  and  they  tend  to  ignore  inconsis¬ 
tencies  in  their  models  of  thought  and  action.  All 
this  makes  it  easier  to  comprehend  information 
and  to  act  systematically. 

The  simplified  interpretations  of  problems, 
solutions,  and  actors  that  decision  makers  have  is 
often  path  dependent,  because  their  search  for 
information  is  often  biased  and  limited,  meaning 
that  they  search  where  they  recently  have  found 
problems  and  solutions.  In  this  way,  they  save 
resources  and  modify  their  capability  restraints. 
They  see  what  they  like  to  see,  and  like  what  they 
see,  and  overlook  information  and  events  that  do 
not  fit  into  their  simplified  world.  In  that  respect, 
they  often  get  their  image  of  the  world  confirmed. 

March  summarizes  the  studies  on  simplification 
processes  by  describing  four  types  of  simplifica¬ 
tions  that  decision  makers  use.  One  type  is  editing, 
meaning  that  actors  tend  to  edit  and  simplify 
goals,  preferences,  problems,  alternatives,  and 
effects  before  entering  into  a  decision  process  or 
making  decisions,  leaving  only  a  small  number  of 
premises  or  decision  elements  left,  which  are  com¬ 
bined  in  a  simple  way.  Some  information  is  over¬ 
looked  or  not  processed,  decision  premises  and 
alternatives  are  attended  to  in  a  sequential  way,  or 
decision  elements  are  weighted  unequally.  Sequen¬ 
tial  attention  may  solve  problems  of  consistency  in 
the  short  run  but  increases  long-term  consistency 
problems. 

The  second  simplification  method  is  decomposi¬ 
tion,  meaning  that  actors  decompose  complex 
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problems  and  alternatives  into  simpler  elements 
that  are  easier  to  cope  with.  The  expectation  here 
is,  of  course,  that  attending  to  the  separate  ele¬ 
ments  will  eventually  lead  to  an  acceptable  com¬ 
mon  outcome.  One  way  of  decomposing  is  to 
work  backward,  because  some  problems  are  easier 
to  solve  that  way  as  it  gives  actors  the  opportunity 
to  define  an  end  state  and  come  up  with  reasons 
for  that  to  happen;  that  is,  they  can  reconstruct 
history  or  make  a  rationalization  afterward. 
Decomposing  is  often  easier  to  enact  in  a  loosely 
coupled  world  where  actors,  problems,  and  solu¬ 
tions  are  mostly  separate — for  example,  when 
organizational  units  are  semi-autonomous,  as  in  a 
large  university. 

The  third  method  is  labeled  heuristics,  meaning 
that  decision  makers  see  patterns  in  a  decision  situ¬ 
ation  and  apply  rules  or  procedures  for  appropri¬ 
ate  behavior  that  match  those  situations.  Heuristics 
are  rules  of  thumb  that  are  related  to  calculating 
probabilities  in  complex  decision  situations,  and 
the  more  differentiated  this  calculation  is  the  more 
easily  actors  may  cope  with  limitations  of  cogni¬ 
tion  and  information. 

The  fourth  of  March’s  types  is  called  framing, 
meaning  that  decisions  are  framed  by  ideas,  atti¬ 
tudes,  and  beliefs  that  define  problems  to  focus  on, 
information  to  collect,  and  dimensions  to  attend 
to.  Actors  have  paradigms  that  focus  and  simplify 
preferences  and  options,  something  that  makes 
sense  of  the  decision  situation.  A  problem  may  be 
framed  in  a  narrow  and  local  way,  with  limited 
trade-offs,  to  find  a  set  of  sufficient  conditions  for 
solving  it.  Frames  of  beliefs  and  cognition  are  part 
of  both  the  conscious  and  the  unconscious  reper¬ 
toires  of  decision  makers.  Frames  are  also  path 
dependent  and  have  a  tendency  to  be  used  again 
and  again,  something  that  increases  the  compe¬ 
tence  of  an  actor  and  often  the  actor’s  ability  to 
convince  other  actors. 

Effects  of  Formal  Structures 
on  Decision  Making 

Decision  makers  operate  inside  formal  organiza¬ 
tions,  and  the  formal  structure  is  an  important 
element  in  understanding  the  existence  of  and  cop¬ 
ing  with  bounded  rationality.  The  formal  structure 
both  helps  and  limits  decision  makers  at  the  same 
time.  It  helps  them  look  at  the  world  in  a  simplified 


way  but  at  the  same  time  limits  their  ability  to  see 
connections  and  complexities,  which  would  make 
them  better  equipped  to  have  a  broader  perspective 
on  their  own  tasks  and  roles.  So  where  decision 
makers  are  situated  in  a  formal  organization  deter¬ 
mines  what  kind  of  limitations  they  are  experienc¬ 
ing.  Leaders  are  supposed  to  design  the  formal 
structure  not  only  to  use  specialization  to  limit 
cognitive  problems  but  also  to  have  a  more  holistic 
perspective  on  how  the  specialized  roles  and  units 
fit  together  to  achieve  collective  goals. 

Luther  Gulick,  in  his  seminal  work  on  the 
design  and  organization  of  the  federal  governmen¬ 
tal  apparatus  in  the  United  States,  pointed  to  the 
fact  that  there  are  two  major  principles  of  formal 
organizations,  namely,  specialization  and  coordi¬ 
nation,  two  factors  that  are  in  a  dynamic  relation¬ 
ship.  Specialization  has  a  vertical  and  horizontal 
dimension  and  also  an  intra-  and  interorganiza- 
tional  dimension.  Combining  the  two  dimensions 
gives  us  four  types  of  specialization:  (1)  intra- 
organizational  vertical  specialization  means  the 
hierarchy  inside  an  organization  and  the  way 
authority  is  allocated  among  leadership  levels, 

(2)  interorganization  vertical  specialization  is 
exemplified  by  the  specialized  relationship  between 
ministries  and  agencies  in  the  public  sector, 

(3)  intra-organizational  horizontal  specialization 
denotes  the  specialization  of  roles  and  tasks/ 
functions  on  each  level  inside  a  formal  organiza¬ 
tion,  (4)  and  interorganizational  horizontal 
specialization  is  exemplified  by  the  specialization 
among  ministries  or  agencies. 

Organizational  Levels  and 
Bounded  Rationality 

The  relevance  of  this  typology  for  bounded  ratio¬ 
nality  should  be  evident.  As  regards  the  first  type — 
intra-organizational  vertical  specialization — James 
Thompson  makes  a  distinction  between  three 
levels  in  an  organization:  (1)  the  institutional  or 
leadership  level,  characterized  by  much  discretion 
because  of  many  partly  competing  or  ambiguous 
formal  and  informal  norms  and  values;  (2)  the 
middle-managerial  level,  characterized  by  cross¬ 
pressure  between  demands  from  both  the  leader¬ 
ship  level  and  the  grassroots  level;  and  (3)  the 
operative  or  grassroots  level,  characterized  by 
street-level  bureaucrats  often  following  formal 
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rules.  On  the  top  level,  leaders  have  both  more 
discretion  and  more  authority  to  define  their  own 
roles  than  on  the  other  levels,  but  at  the  same  time, 
there  are  often  more  acute  constraints  on  their 
rationality  because  of  their  having  to  attend  to  a 
more  complex  set  of  goals,  preferences,  problems, 
alternatives,  and  consequences.  On  the  middle 
level,  leaders  have  less  discretion  to  enact  their 
roles  than  at  the  top  level  but  more  than  for  actors 
at  the  operative  level.  But  their  limitation  of  ratio¬ 
nality  may  be  rather  problematic  to  cope  with 
because  they  are  in  a  cross-pressure  between  the 
top  and  lower  levels.  The  operative  bureaucrats 
occupy  a  role  with  much  less  discretionary  space. 
This  makes  their  rationality  limitations  much  easier 
to  cope  with,  even  though  they  may  have  problems 
seeing  their  roles  in  a  more  holistic  perspective. 

Vertical  interorganization  specialization,  as 
exemplified  by  specialization  between  ministries 
and  agencies,  may  also  illustrate  bounded  rational¬ 
ity.  Studies  of  such  public  organizations  have 
shown  that  ministries,  being  on  the  highest  hierar¬ 
chical  level,  have  the  broadest  and  most  compre¬ 
hensive  tasks,  such  as  strategic  and  policy-oriented 
tasks,  meaning  that  they,  like  Thompson’s  institu¬ 
tional-level  leadership,  will  have  to  cope  with  the 
combination  of  wide  discretionary  space  and  chal¬ 
lenging  limitations  on  their  rationality.  Agencies 
have  more  of  narrow  and  technical  tasks,  based  on 
highly  specialized  professional  competence,  mak¬ 
ing  their  discretionary  space  narrower  but  proba¬ 
bly  their  rationality  limitations  less  challenging. 

Gulick  defines  four  principles  of  horizontal  spe¬ 
cialization — based  on  (1)  purpose  (e.g.,  education 
or  defense),  (2)  process  (e.g.,  specialists  in  part 
of  a  process  or  coming  from  a  particular  higher- 
education  group),  (3)  clientele  (e.g.,  patient  groups 
in  hospitals),  or  (4)  geography  (certain  levels  or 
areas).  In  each  case,  the  different  principles  afford 
both  certain  rationality  constraints  and  opportuni¬ 
ties.  Civil  servants  working  with  a  specialized 
policy  will  have  advantages  concerning  cognitive 
limitations,  because  they  are  defined  as  specialists 
in,  for  example,  education,  defense,  health,  or 
social  services.  Civil  servants  such  as  jurists  work¬ 
ing  on  rule  development  will  have  the  same  advan¬ 
tages,  as  will  professionals  catering  to  certain  client 
groups.  And  bureaucrats  attending  to  control  tasks 
on  lower  levels  or  to  developing  certain  geographi¬ 
cal  areas  will  also  operate  under  the  same  type  of 


logic.  The  limitation  problems  for  all  these  special¬ 
ized  bureaucrats  may,  of  course,  be  related  to  hav¬ 
ing  an  overly  narrow  perspective  when  there  are 
demands  for  a  wider  perspective  and  collaboration 
across  tasks  and  sectors.  Specialization  among 
ministries  and  agencies  also  exemplifies  the  same 
logic. 

According  to  Gulick,  coordination  is  related  to 
either  formal  organization  or  ideas.  Organized 
coordination  could  be  divided  into  hierarchical 
and  collegial  coordination,  both  having  intra-  and 
interorganizational  features.  Hierarchical  coordi¬ 
nation  is  the  mirror  image  of  vertical  specializa¬ 
tion;  that  is,  what  is  vertically  specialized  must  also 
be  coordinated  to  secure  superior  central  control 
and  steering.  With  respect  to  cognitive  capacity 
and  eventual  limitations,  the  advantage  of  leaders 
on  a  central  level  is  that  they  can  see  how  the  dif¬ 
ferent  organizational  levels,  roles,  and  tasks  are 
fitting  together.  The  limitations  are,  of  course, 
related  to  capacity  and  knowledge  about  what  is 
going  on  at  the  lower  levels;  the  more  complexity 
there  is  across  levels,  the  larger  are  the  challenges 
of  rational  calculation.  Related  to  new  public  man¬ 
agement  (NPM)  as  a  reform  wave,  there  has,  for 
example,  been  a  lot  of  discussion  on  the  delegation 
of  authority  and  functions/tasks  to  lower  levels. 
Such  delegation  may  potentially  threaten  the  infor¬ 
mation  basis  of  higher-level  leaders  and  therefore 
increase  their  cognitive  challenges,  something  that 
may  limit  their  collective  efforts.  Delegation  of 
autonomy  increases  the  participation  of  lower- 
level  actors  and  may  change  the  patterns  of  influ¬ 
ence  to  the  disadvantage  of  the  top  leadership. 

Collegial  coordination  poses  other  challenges. 
Here,  there  is  no  hierarchical  tension  and  knowl¬ 
edge  distance,  but  there  is  the  problem  of  heteroge¬ 
neity  and  different  interests  horizontally  inside  or 
across  organizational  units.  The  challenges  con¬ 
cerning  rationality  limitations  and  constraint  are 
greater  across  than  within  organizations,  and  the 
larger  the  organization,  the  more  heterogeneity 
there  is  in  tasks,  roles,  and  professional  expertise. 
The  different  ways  to  reach  a  decision  under  such 
circumstances  also  give  insight  into  bounded  ratio¬ 
nality.  One  way  is  to  vote,  making  winners  and 
losers,  or  majorities  and  minorities.  This  may  cre¬ 
ate  unambiguous  solutions  but  also  more  problems 
of  legitimacy,  thereby  resulting  in  problems  of 
implementation,  so  the  solutions  will  be  bounded 
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accordingly.  Compromise  may  lead  to  increasing 
support  but  also  to  more  ambiguous  solutions, 
making  for  cognitive  challenges  to  implementa¬ 
tion.  Sequential  attention  to  goals  and  solutions, 
and  quasi  solution  of  conflicts,  may  solve  pressing 
and  current  problems  of  consistency,  but  it  will 
also  create  long-term  challenges  of  consistency, 
probably  posing  the  largest  problems  of  cognitive 
limitations  for  leaders. 

Role  of  Organizational  Culture 

Coordination  through  ideas  may  mean  a  strong 
organizational  culture,  as  illustrated  by  Herbert 
Kaufman’s  seminal  book  on  forest  rangers  in  the 
United  States,  which  described  how  rangers  in  dif¬ 
ferent  geographical  locations  held  approximately 
the  same  cultural  views,  norms,  and  attitudes.  The 
cultural  part  may  imply  that  information  and  cog¬ 
nitive  limitations  are  modified,  because  actors 
intuitively  know  how  to  act  in  a  culturally  appro¬ 
priate  way.  A  strong  culture  may  imply  “trained 
incapacity,”  a  lack  of  ability  to  adapt  to  new  con¬ 
ditions,  as  well  as  potential  tension  with  norms 
related  to  formal  structures.  In  a  modern  setting, 
however,  a  culture  may  appear  strong  because  of 
the  prominence  of  myths  and  symbols  that  bind 
members  together,  while  in  fact  these  elements  may 
be  superficial  rather  than  actually  influencing 
behavior  and  decisions.  Myths  or  symbols  have  the 
advantage  as  rules  of  thumb  to  simplify  the  world, 
as  shown  in  NPM  concerning  scripts  such  as  com¬ 
petitive  tendering,  benchmarking,  marketization, 
or  privatization.  Scripts  or  concepts  may  result  in 
real  problem  solving  but  often  function  as  window 
dressing,  creating  images  of  the  organization. 

Organizational  Slack 

Theorizing  about  bounded  rationality  has  some¬ 
thing  to  do  with  slack  in  formal  organizations, 
with  slack  defined  as  the  distance  between  avail¬ 
able  resources  and  demands.  High  slack  means  that 
performance  exceeds  goals;  that  is,  demands  are 
satisfied,  and  the  organization  has  still  more 
resources  to  use.  This  means  often  that  the  search 
for  new  alternatives  is  modest,  slack  is  increasing, 
and  aspiration  is  decreasing.  But  in  such  situations, 
slack  can  be  used  for  innovation  or  experimenta¬ 
tion,  which  makes  it  easier  to  respond  to  later  crises 


without  severe  problems  related  to  cognitive  limita¬ 
tions.  Low  slack,  meaning  that  performance  is 
worse  than  expected,  will  stimulate  a  search  for 
alternatives  and  new  solutions,  but  the  resources 
available  for  the  search  are  few,  increasing  the  pos¬ 
sibility  of  severe  information  and  cognitive  limita¬ 
tion  and  eventually  failure. 

Bounded  rationality  may  also  be  related  to 
whether  leaders  and  formal  organizations  are 
experiencing  routine  situations  or  crisis,  turbu¬ 
lence,  and  unexpected  events.  In  routine  situations, 
the  pressure  on  information  and  cognitive  limita¬ 
tions  is  rather  low,  and  traditional  procedures  and 
solutions  can  be  used  without  too  much  use  of 
resources.  But  crises  and  turbulence  impose  much 
more  pressure  on  rational  calculation.  Leaders  can 
try  to  use  procedures  and  solutions  developed  for 
emergencies  to  simplify  the  challenges  but  will 
often  have  to  improvise  because  the  fit  between 
ready-made  solutions  and  the  crises  is  problematic. 
In  improvising,  they  can  choose  from  a  repertoire 
of  alternatives  and  solutions,  which  is  better  than 
starting  from  scratch,  but  there  is  no  guarantee 
that  the  responses  will  reflect  successful  adapta¬ 
tions  to  the  turbulent  conditions. 

In  political-administrative  decision  processes,  it 
is  often  the  case  that  saliency  is  important,  meaning 
that  political  and  administrative  leaders  will  use 
more  resources  and  expertise  to  attend  to  some 
processes  and  issues  rather  than  others.  This  often 
means  that  the  challenges  of  information  and  cog¬ 
nitive  limitations  are  better  attended  to.  But  what  is 
politically  and  administratively  important  may  vary 
over  time,  and  it  is  challenging  to  refocus  on  issues 
and  cases  that  have  been  defocused.  This  returns  to 
the  question  of  slack  and  whether  the  organization 
has  the  resources  to  face  such  challenges. 

Effects  of  Using  Bounded  Rationality 

What  are  the  effects  of  using  bounded  rationality 
as  an  important  principle  or  logic  in  formal  orga¬ 
nizations?  First,  bounded  rationality  has  the  poten¬ 
tial  for  increasing  both  efficiency  and  effectiveness, 
because  preferences,  goals,  problems,  solutions, 
and  effects  are  framed  in  such  a  way  that  these  can 
be  handled  satisfactorily.  And  bounded  rationality 
may  imply  that  one  is  going  to  the  core  of  policies 
and  issues  in  a  complex  world,  without  wasting 
resources  on  unimportant  factors  and  aspects. 


Realism  in  International  Relations  2217 


Second,  bounded  rationality  may  be  related  to 
developing  a  logic  of  appropriateness  and  a  certain 
cultural  path.  Leaders  participate  in  defining  the 
appropriate  aspects  of  a  matter,  which  sets  the 
stage  for  other  actors  to  behave  accordingly  in  an 
appropriate  way. 

Third,  bounded  rationality  may  also  make  it 
easier  to  use  symbols  and  myths  to  enhance  the 
legitimacy  of  leaders,  institutions,  and  processes. 
The  simplification  and  selection  of  decision¬ 
making  premises  may  make  the  “pedagogical  chal¬ 
lenge”  less  severe  when  relating  to  other  actors  or 
a  wider  audience. 

Three  critical  remarks  could,  however,  be  added. 
In  fact,  bounded  rationality  may  lead  to  ineffective¬ 
ness  and  inefficiency.  The  reason  for  this  may  be  (a) 
lack  of  enough  knowledge  to  make  the  selection  of 
the  “right”  preferences,  goals,  solutions,  alterna¬ 
tives,  and  potential  consequences  or  (b)  that  such 
elements  are  more  the  result  of  a  negotiation  pro¬ 
cess  than  an  analytical  process,  making  it  harder  to 
handle  the  decision  processes  and  implement  poli¬ 
cies.  The  focus  may  be  more  on  certain  minor  or 
deviant  aspects  of  a  complex  problem  and  not  on 
the  core,  which  may  lead  to  reruns  of  the  processes 
and  a  lack  of  stability  in  solutions.  In  addition,  if 
this  is  the  case,  the  developing  logic  of  appropriate¬ 
ness  may  lead  to  dysfunctions  or  increasing  tension 
and  conflicts  with  the  environment.  Moreover,  a 
“failure”  in  the  process  of  selection  and  modifica¬ 
tion  of  limitations  on  rationality  may  be  covered 
up  with  extensive  use  of  symbols,  thus  potentially 
eroding  popular  support  for  the  leaders  and  mak¬ 
ing  the  followers  more  critical  and  cynical.  A  coun¬ 
terargument  to  these  three  critical  points  would,  of 
course,  be  that  there  is  no  such  thing  as  objectivity 
concerning  the  selection  of  decision  premises  in  the 
logic  of  bounded  rationality. 

Garbage  Can  Theory 

Bounded  rationality  is  the  central  part  of  instru¬ 
mental  decision-making  theory  and  is  often 
contrasted  with  a  more  economic  man-oriented 
rational  theory.  One  can,  however,  also  make 
another  and  different  connection  for  the  relevance 
of  bounded  rationality.  Garbage  can  theory,  devel¬ 
oped  by  Michael  D.  Cohen,  James  G.  March,  and 
Johan  P.  Olsen  in  the  early  1970s,  stresses  that 
decision  makers  are  part-time  participants  and 


decision  situations  represent  diverse  and  ambigu¬ 
ous  stimuli,  resulting  in  unpredictable  decision¬ 
making  processes  with  participants,  problems, 
solutions,  and  decision  opportunities  coming  and 
going,  selected  and  coupled  in  unexpected  ways. 
This  type  of  theory  could  be  seen  as  an  extension 
of  the  cognitive  parts  of  bounded  rationality;  that 
is,  the  rationality  of  actors  may  be  so  bounded  or 
limited  that  “organized  chaos”  is  the  result. 
Whereas  bounded  rationality  stresses  the  impor¬ 
tance  of  formal  structure  for  the  views  and 
thoughts  of  actors,  garbage  can  theory  focuses 
more  on  the  relative  lack  of  such  structures,  creat¬ 
ing  more  of  an  “organizational  irrationality.” 
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Realism  in  International 
Relations 


The  concept  of  realism  was  developed  in  the 
1930s  and  1940s  as  an  alternative  to  attempts  to 
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understand  and  conduct  international  relations 
based  on  international  law.  Its  founders,  Nicholas 
J.  Spykman,  Edward  Mead  Earle,  Frederick 
Schuman,  E.  H.  Carr,  John  Elerz,  William  T.  R. 
Fox,  and  Elans  J.  Morgenthau,  asserted  the  real¬ 
ism  of  their  power-based  approaches  to  foreign 
policy,  in  contrast  to  what  they  described  as  the 
dangerous  idealism  of  their  predecessors.  Writing 
in  response  to  the  appeasement  of  fascist  Italy  and 
Nazi  Germany,  which  they  held  responsible  in 
large  part  for  World  War  II,  realist  arguments  had 
great  resonance. 

In  the  United  States,  realism  appeared  to  offer 
intellectual  justification  for  the  Cold  War  and  a 
range  of  policies  pursued  by  the  Truman, 
Eisenhower,  and  Kennedy  administrations  that 
were  otherwise  at  odds  with  proclaimed  American 
values.  By  the  mid-1950s,  realism  had  become  the 
dominant  paradigm  in  international  relations  and 
remained  so  until  the  Berlin  Wall  came  down. 
Since  the  end  of  the  Cold  War,  realism  is  at  best 
primus  inter  pares,  as  liberalism,  constructivism, 
feminism,  philosophical  realism,  and  poststructur¬ 
alism  have  all  become  more  prominent.  Realism 
arguably  remains  the  dominant  frame  of  reference 
for  policymakers,  although  even  here,  liberalism 
especially  has  made  important  inroads. 

Three  Waves  of  Realism 

Looking  back  on  the  post-World  War  II  era,  we 
see  three  waves  of  realism,  each  with  important 
implications  for  the  study  and  practice  of  interna¬ 
tional  relations  and  foreign  policy. 

The  First  Wave:  Flans  Morgenthau 

The  first  wave,  represented  by  the  writings  of 
the  realists  mentioned  in  the  paragraph  above,  was 
far  and  away  the  most  influential.  Elans 
Morgenthau’s  Politics  Among  Nations,  its  princi¬ 
pal  text,  was  first  published  in  1948  and  went 
through  six  editions,  the  last  two  published  post¬ 
humously  in  1985  and  2006.  Within  the  United 
States,  and  to  a  lesser  extent  in  Europe,  it  became 
a  core  reading  in  undergraduate  international  rela¬ 
tions  courses.  Morgenthau  became  a  prominent 
public  figure  in  the  United  States  and  achieved  a 
certain  notoriety  for  his  early  support  of  the  civil 
rights  movement  and  opposition  to  the  Vietnam 


War.  In  “The  Purpose  of  Political  Science,” 
Morgenthau  (1966,  p.  77)  lamented  that  the  mak¬ 
ers  of  American  foreign  policy  had  “overlearned” 
the  lessons  of  realism  and  lost  sight  of  the  appro¬ 
priate  moral  and  legal  limitations  on  the  use  of 
power. 

In  the  first  edition  of  Politics  Among  Nations, 
Morgenthau  (1948)  maintained  that  power  is  the 
currency  of  politics  and  should  be  the  first  concern 
of  states.  He  divided  the  world  into  status  quo, 
revisionist,  and  prestige-seeking  states  (pp.  21-25). 
Status  quo  states  were  intent  on  maintaining  exist¬ 
ing  territorial  divisions  and  revisionist  states  on 
overturning  them.  Prestige-seeking  states  sought  to 
demonstrate  their  power  to  gain  more  influence. 
Morgenthau  insisted  that  state  goals  were  not 
essentialist  but  changed  over  time  and  were  not 
always  apparent  to  outside  observers.  He  thought 
that  peace  was  most  likely  to  be  preserved  when 
status  quo  powers  were  more  powerful  than  their 
revisionist  challengers  and  that  the  power  of  either 
group  could  be  enhanced  through  alliances. 
Morgenthau,  and  realists  more  generally,  speak  of 
the  balance  of  power,  which  can  mean  the  distribu¬ 
tion  of  power  in  the  system  at  any  given  moment, 
the  balance  between  status  quo  and  revisionist 
states,  or  an  advantage  in  favor  of  the  former. 
Morgenthau  used  it  all  three  ways,  to  the  conster¬ 
nation  of  his  critics. 

The  balance  of  power  assumes  some  capability 
of  measuring  power  and  its  distribution.  This  has 
proven  difficult  for  realists  of  all  persuasions. 
Morgenthau  conceived  of  power  as  an  intangible 
quality  that  had  diverse  material  and  political  com¬ 
ponents,  among  them  territory,  population,  national 
resources,  industrial  capacity,  military  prepared¬ 
ness,  national  character,  morale,  and  the  quality  of 
diplomacy  and  government.  None  of  these  attri¬ 
butes  translated  directly  into  power  because  power 
was  “a  psychological  relation  [ship]”  that  gave 
those  who  wield  it  control  over  certain  actions  of 
others  “through  the  influence  which  the  former 
exert  over  the  latter’s  minds.”  In  Politics  Among 
Nations,  first  edition,  Morgenthau  (1948)  asserted 
that  “of  all  the  factors  which  make  for  the  power 
of  a  nation,  the  most  important  is  the  quality  of 
diplomacy”  (p.  14).  The  other  attributes  of  national 
power  are  the  raw  materials  out  of  which  the 
power  of  a  nation  is  fashioned.  Diplomacy  “com¬ 
bines  those  different  factors  into  an  integrated 
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whole,  gives  them  direction  and  weight,  and  awak¬ 
ens  their  slumbering  potentialities  by  giving  them 
the  breath  of  actual  power”  (p.  105). 

Morgenthau  described  his  theory  as  scientific, 
but  his  description  of  the  nature  of  power  and  the 
balance  of  power  indicates  that  the  measurement 
and  exercise  of  power  and  the  construction  of  a 
successful  balance  of  power  is  more  art  than  sci¬ 
ence.  This  is  most  evident  in  his  analysis  of  bipo¬ 
larity  and  its  consequences.  The  term  bipolarity 
was  coined  by  William  T.  R.  Fox  in  1944  to 
describe  the  domination  of  the  postwar  interna¬ 
tional  system  by  two  “superpowers”:  the  United 
States  and  the  Soviet  Union.  Morgenthau  recog¬ 
nized  the  world  as  bipolar  in  the  mid-1950s 
because  the  power  of  these  two  states  had  become 
so  “overwhelming”  in  comparison  with  allies  and 
third  parties  that  “through  their  own  preponder¬ 
ant  weight  they  determine  the  balance  of  power 
between  them.”  The  balance  of  power  could  no 
longer  be  “decisively  affected”  by  changes  in  the 
alignments  of  their  allies,  at  least  in  the  foreseeable 
future.  Nor  could  lesser  powers  easily  defect  from 
alliances,  because  “the  two  giants”  had  the  power 
to  “hold  them  there  even  against  their  will” 
(Morgenthau  [Politics  Among  Nations],  1948, 
pp.  270-274;  1951,  pp.  48,  52-54).  The  flexibility 
of  the  balance  of  power  and  its  restraining  influ¬ 
ence  on  the  power  aspirations  of  the  main  protago¬ 
nists  had  disappeared.  The  superpowers  were  free 
to  define  their  respective  positions  as  vital  interests 
and  engage  each  other  with  every  means  at  their 
disposal  in  every  arena  in  which  they  competed.  In 
this  novel  situation,  “the  give  and  take  of  compro¬ 
mise  becomes  a  weakness  which  neither  side  is  able 
to  afford”  (Politics  Among  Nations,  1962  edition, 
p.  285). 

Morgenthau  was  clearly  uncomfortable  with 
the  pessimistic  implications  of  his  analysis  and 
sought  to  hold  out  a  ray  of  hope  for  the  future.  In 
the  first  edition  of  Politics  Among  Nations, 
Morgenthau  (1948)  argued  that  “the  changed 
structure  of  the  balance  of  power  has  made  the 
hostile  opposition  of  two  gigantic  power  blocs 
possible,  but  it  has  not  made  it  inevitable.” 
Bipolarity  has  the  potential  for  “unheard-of  good 
as  well  as  for  unprecedented  evil”  (pp.  285-286). 
In  the  last  resort,  peace  does  not  depend  on  the 
nuclear  balance  but  on  the  moral  quality  of  leaders 
and  their  willingness  to  place  the  common  goal  of 


survival  above  the  pursuit  of  unilateral  advantage. 
Morgenthau’s  theory,  in  contrast  to  those  of  his 
neorealist  successors,  stresses  the  independent 
power  of  agency. 

In  the  third,  1962,  edition  of  Politics  Among 
Nations,  Morgenthau  recognized  additional  incen¬ 
tives  for  superpower  restraint.  He  speculated  that 
the  experience  of  the  Korean  War  may  have  taught 
Moscow  and  Washington  that  they  have  to  adapt 
their  policies  to  the  wishes  of  their  allies  “if  they 
wanted  to  draw  the  maximum  of  strength  from 
their  support”  (pp.  351-352).  The  emergence  of  a 
number  of  newly  independent  and  unaligned  states 
might  also  serve  the  cause  of  restraint.  The  third 
edition  continued  to  describe  bipolarity  as  on  the 
whole  inimical  to  peace  (pp.  362-363).  In  the  fifth 
edition,  published  in  1972,  Morgenthau  expressed 
cautious  optimism  (pp.  355-356).  Detente,  explicit 
recognition  of  the  territorial  status  quo  in  Europe, 
a  corresponding  decline  in  ideological  confronta¬ 
tion,  the  emergence  of  third  forces  (e.g.,  Japan, 
China,  West  Germany),  and  the  damaging  effects 
of  the  Vietnam  War  on  American  power  had  made 
both  superpowers  more  cautious  and  respectful  of 
the  status  quo.  For  all  practical  purposes,  the  de 
facto  acceptance  of  the  postwar  division  of  Europe 
had  ended  the  Cold  War. 

For  Morgenthau,  the  success  of  the  balance  of 
power  for  the  better  part  of  three  centuries  was  less 
a  function  of  the  distribution  of  capabilities  than  it 
was  of  the  underlying  values  and  sense  of  commu¬ 
nity  that  bound  together  the  actors  in  the  system. 
When  the  European  value  consensus  broke  down, 
as  it  did  from  the  first  partition  of  Poland  through 
the  Napoleonic  Wars,  the  balance  of  power  no 
longer  functioned  to  preserve  the  peace  or  integrity 
of  the  members  of  the  system.  The  consensus 
broke  down  again  in  the  20th  century,  with  even 
more  disastrous  consequences.  At  midcentury, 
Morgenthau  was  deeply  pessimistic  about  the 
future.  The  balance  of  power  was  at  its  nadir. 
There  were  two  great  powers  instead  of  many, 
Britain  no  longer  had  the  capability  to  play  the  role 
of  balancer,  the  colonial  frontier  had  disappeared, 
and  one  of  the  principal  powers  rejected  the  very 
premises  of  the  international  order.  International 
politics  had  been  reduced  “to  the  primitive  spec¬ 
tacle  of  two  giants  eyeing  each  other  with  watchful 
suspicion”  (Morgenthau,  Politics  Among  Nations, 
first  edition,  1948,  p.  285). 
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Morgenthau’s  theory  is  descriptive  and  prescrip¬ 
tive.  “Realism,”  he  insists,  is  superior  to  “idealist” 
approaches  on  both  counts.  It  is  more  rigorous 
because  its  axioms  are  logically  derived  from  its 
starting  assumptions.  It  is  empirically  valid  because 
“the  facts  as  they  are  actually  lend  themselves  to  the 
interpretation  the  theory  has  put  upon  them” 
(Politics  Among  Nations,  1962  edition,  p.  1).  Mor- 
genthau  has  been  accused  of  making  contradictory 
claims:  He  justifies  his  theory  on  the  grounds  that  it 
provides  a  better  description  of  reality,  but  at  the 
same  time,  he  criticizes  French  and  British  policy¬ 
makers  in  the  1930s,  and  their  American  counter¬ 
parts  in  the  1960s,  for  acting  in  sharp  violation  of 
its  principles.  He  dismissed  this  criticism  as  beside 
the  point.  Politics  Among  Nations  did  not  aim  at  an 
“indiscriminate  description  of  political  reality”  but 
was  an  attempt  to  develop  a  “rational  theory  of 
politics.”  The  balance  of  power  was  “an  ideal  sys¬ 
tem,”  and  in  his  more  pessimistic  moments,  Mor- 
genthau  was  willing  to  admit  that  it  was  “scarcely 
found  in  reality.”  Realism  provided  a  benchmark 
against  which  actual  policies  could  be  understood 
and  evaluated.  For  the  same  reason,  it  contained  a 
strong  normative  element.  It  was  a  “theoretical 
construct”  of  a  fully  rational  and  informed  foreign 
policy  that  “experience  can  never  completely 
achieve”  but  that  can  be  used  as  a  guide  for  making 
and  assessing  policy  (p.  8). 

The  Second  Wave:  Neorealism 

Neorealism  or  structural  realism,  developed  by 
Kenneth  Waltz  (1979),  represents  an  attempt  to 
purge  the  tensions  and  contradictions  of  Morgenthau 
and  other  realists  of  his  generation  to  construct  a 
more  scientific  theory  based  entirely  at  the  system 
level.  The  starting  point  for  Waltz  is  the  alleged  anar¬ 
chy  of  the  international  system,  which  makes  it  a 
“self-help”  system.  Anarchy  requires  states  not  only 
to  make  the  acquisition  of  power  their  first  goal  but, 
as  Waltz’s  disciple  John  Mearsheimer  (2006)  insists, 
to  accumulate  sufficient  economic  and  military  capa¬ 
bilities  “to  make  sure  that  no  other  state  sharply 
shifts  the  balance  of  power  in  its  favour”  (p.  72).  For 
Neorealists,  the  yardstick  of  power  is  relative  not 
absolute  power  because  states  must  assess  their 
power  relative  to  that  of  other  political  units. 

Drawing  on  Nietzsche  and  Weber,  Morgenthau 
attributed  conflict  at  every  level  of  social  aggregation 


to  human  nature:  People  were  driven  by  a  desire  to 
dominate  others  ( animus  dominiandi)  and  to  use 
them  for  their  own  ends.  This  behavior  was  less 
constrained  at  the  international  level  due  to  the 
relative  absence  of  institutions,  rules,  and  mecha¬ 
nisms  for  channeling  this  drive  into  more  socially 
acceptable  channels.  Neorealists  find  the  explana¬ 
tion  for  interstate  conflict  entirely  at  the  system 
level.  They  invoke  the  “security  dilemma,”  a  phe¬ 
nomenon  first  described  by  John  Herz,  another 
prominent  first-generation  realist.  The  uncertainty 
of  the  international  environment  encourages  states 
to  seek  a  margin  of  safety  in  their  military  power. 
Their  buildups  and  deployments  threaten  other 
states,  who  respond  the  same  way.  Because  of  the 
security  dilemma,  international  conflict  would 
arise  even  in  a  system  in  which  every  power  had 
only  benign  intentions.  Unlike  Morgenthau,  for 
most  neorealists,  regime  type  and  regime  goals  are 
irrelevant.  As  the  system  itself  causes  conflict, 
actors,  in  the  words  of  Mearsheimer  (2006),  can 
be  treated  as  “black  boxes.” 

For  Waltz  (1979),  the  number  of  actors  and  the 
distribution  of  the  power  in  the  system  are  the  key 
variables  for  understanding  international  relations  as 
they  determine  the  polarity  of  the  system.  Unipolar 
systems,  dominated  by  a  single  hegemon,  are  the 
most  stable  because  the  hegemon  has  enough  power 
to  enforce  order,  as  Rome  did  for  centuries  through¬ 
out  the  Mediterranean  basin.  Bipolar  systems  also 
have  a  degree  of  stability  because  each  of  the  two 
dominant  powers  is  strong  enough  to  protect  itself 
against  any  possible  combination  against  it  and  is 
accordingly  less  worried  about  the  addition  or  defec¬ 
tion  of  allies  than  great  powers  in  a  multipolar  sys¬ 
tem.  Multipolarity  is  the  least  stable  system  because 
there  is  not  that  much  difference  in  strength  among 
the  leading  powers,  making  alliances,  and  additions 
to  and  defections  from  them,  critical  to  the  balance 
of  power.  The  balance  between  opposing  alliances  is 
also  much  more  difficult  to  calculate  because  it 
requires  estimates  of  the  strength  and  intentions  of 
multiple  powers.  Miscalculation  becomes  an  addi¬ 
tional  source  of  war  in  a  multipolar  system. 

Neorealism  is  not  monolithic.  Its  adherents  dif¬ 
fer  in  important  respects.  In  contrast  to  Waltz’s 
commitment  to  a  parsimonious  system-level  the¬ 
ory,  Steven  Walt  (1987)  maintains  that  states  bal¬ 
ance  against  perceived  threats,  not  power.  The 
United  States,  he  contends,  never  worried  about 
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British  or  French  nuclear  arsenals,  only  about 
Soviet  and  Chinese  arsenals  because  both  were 
considered  as  adversaries.  Mearsheimer  (2001), 
author  of  his  own  theory  of  international  relations, 
builds  on  the  distinction  between  offensive  and 
defensive  realism.  Traditional  realists,  whom  he 
describes  as  defensive  realists,  maintain  that  other 
powers  will  balance  against  any  state  that  becomes 
relatively  too  powerful.  Mearsheimer  advocates 
offensive  realism  on  the  grounds  that  other  powers 
often  bandwagon  (make  an  accommodation  with 
the  dominant  power)  or  buck-pass  (sit  on  the  side¬ 
lines)  in  preference  to  joining  a  balancing  coali¬ 
tion.  This  behavior  and  offensive  dominance — the 
likelihood  that  states  who  start  wars  will  win 
them — create  strong  incentives  for  aggression. 

Realists  and  neorealists  disagree  about  the 
implications  of  nuclear  weapons.  Whereas  Mor- 
genthau  recognized  that  they  had  revolutionized 
warfare,  and  with  it,  international  relations,  Waltz 
(1981)  regarded  them  as  an  unalloyed  benefit  as 
they  had  a  powerful  deterrent  effect.  Other  real¬ 
ists,  such  as  Scott  Sagan,  stressing  the  problems  of 
command  and  control  and  the  fear  of  being  the 
victims  of  a  first-strike  attack,  saw  them  as  poten¬ 
tially  destabilizing. 

Critics  have  identified  numerous  conceptual 
and  empirical  problems  with  neorealism.  Alexander 
Wendt  (1992)  argued  that  anarchy  does  not  auto¬ 
matically  lead  to  a  self-help  system;  the  character 
of  the  international  system  depends  very  much  on 
how  states  respond  to  anarchy.  Richard  Ned 
Lebow  (1994)  argues  that  neorealism’s  definition 
of  power  is  as  problematic  as  Morgenthau’s.  Waltz 
(1979)  offers  a  definition  of  power  almost  identi¬ 
cal  to  Morgenthau’s.  He  then  asserts  the  over¬ 
whelming  importance  of  material  capabilities, 
especially  military  capabilities,  because  “force 
remains  the  final  arbiter”  of  international  affairs 
(p.  131).  The  superpowers  are  “set  apart  from  the 
others  ...  by  their  ability  to  exploit  military  tech¬ 
nology  on  a  large  scale  and  at  the  scientific 
frontiers”  (pp.  180-181).  He  insists  that  the  deter¬ 
mination  of  polarity  can  be  answered  by  “common 
sense”  and  that  the  world  was  bipolar  in  1945. 
Efforts  by  scholars  to  determine  when  bipolarity 
began  range  from  Waltz’s  1945  estimate  to  Mor¬ 
genthau’s  mid-1950s  estimate,  as  well  as  assertions 
by  some  that  the  postwar  world  was  never  bipolar. 
The  end  of  the  Cold  War  has  created  a  similar 


controversy,  with  Waltz  (1990)  arguing  that  the 
world  is  still  bipolar,  Mearsheimer  (1990)  that  it 
has  become  multipolar,  and  other  realists  (e.g., 
Charles  Krauthammer,  William  Wohlforth,  and 
Michael  Mastaduno)  contending  that  it  is  now 
unipolar. 

The  most  fundamental  problem  with  Waltz’s 
theory  is  that  it  is,  by  his  own  admission,  a  theory 
of  international  relations,  not  of  foreign  policy,  and 
thus  says  little  to  nothing  about  how  individual 
states  will  behave.  Its  principal  substantive  propo¬ 
sition  is  that  multipolar  systems  are  less  stable 
(enduring)  and  more  prone  to  war  than  bipolar 
ones.  Such  a  proposition  is  not  testable  because  we 
would  require  a  large  sample  of  both  kinds  of  sys¬ 
tems  and  there  have  been  very  few  bipolar  systems 
in  history.  It  is  also  meaningless  because  what  any¬ 
one  is  really  interested  in  is  the  likelihood  of  war  in 
a  particular  system,  and  no  inferences  can  be  made 
about  single  cases  from  statistical  distributions. 

Mearsheimer’s  theory,  elaborated  in  his  2001 
book,  The  Tragedy  of  Great  Power  Politics,  has 
also  encountered  strong  criticism.  Offensive  domi¬ 
nance  has  little  empirical  support.  According  to  the 
critics,  it  is  simply  not  true,  as  Mearsheimer  claims, 
that  initiators  win  most  of  the  wars  they  have 
begun.  Lebow  shows  that  in  the  30  wars  fought 
since  1945,  the  initiators  won  only  7.  Mearsheimer’s 
(1990,  2001)  predictions  that  NATO  would  not 
survive  the  end  of  the  Cold  War  and  that  Germany 
and  Japan  should  and  would  acquire  nuclear 
weapons  have  (fortunately)  not  come  to  pass. 

Third  Wave:  Classical  Realism 

The  third  wave — classical  realism — was  ushered 
in  by  the  peaceful  end  of  the  Cold  War,  which 
appeared  to  contradict  the  expectations  of  neoreal¬ 
ism.  Classical  realism  was  also  a  response  to  the 
widespread  recognition  that  problems  of  identity, 
globalization,  justice,  and  world  order  were  at  least 
as  important  as  the  balance  of  power  for  under¬ 
standing  contemporary  international  relations. 
Scholars  have  accordingly  revisited  the  works  of 
Thucydides  (Lebow,  2003),  Hobbes  (Michael 
Williams,  2006),  and  Morgenthau  (Lebow,  2003; 
Williams,  2007)  in  search  of  insights  relevant  to 
these  problems. 

Lebow’s  The  Tragic  Vision  of  Politics  (2003)  is 
perhaps  the  most  extensive  attempt  to  reconstruct 
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the  wisdom  of  classical  realism  through  hermeneu¬ 
tic  readings  of  the  texts  of  Thucydides,  Carl  von 
Clausewitz,  and  Morgenthau.  He  contends  that  all 
three  authors  wrote  in  the  aftermath  of  destructive 
wars  and  sought  to  reconstruct  order  by  drawing 
on  the  best  of  old  practices  and  the  most  promising 
new  ideas.  In  sharp  contrast  to  neorealists,  they 
recognized  that  the  principal  threat  to  world  order 
came  from  dominant  powers  who  succumbed  to 
hubris  and  overreached  themselves.  Such  powers 
were  more  effectively  constrained  by  domestic 
restraints  and  external  practices  and  institutions 
than  by  the  balance  of  power.  Support  for  this 
contention  also  comes  from  Stuart  Kaufman, 
Richard  Little,  and  William  Wohlforth’s  compre¬ 
hensive  2007  study  of  the  balance  of  power  in  the 
ancient  world  and  non-Western  systems.  They  find 
that  it  rarely  worked  to  preserve  the  peace  as  most 
realist  theories  suggest. 

Lebow  further  contends  that  classical  realists 
were  more  interested  in  influence  than  in  power. 
They  recognized  that  material  capabilities  were 
only  one  component  of  power  and  power  was  only 
one  source  of  influence.  Influence  is  relational,  and 
the  least  efficient  way  of  exercising  it  is  through 
threats  and  rewards.  Rewards  consume  resources, 
and  threats  generate  antagonism  because  they  make 
other  actors  aware  of  their  subordination.  Influence 
is  most  effectively  exercised  through  common  proj¬ 
ects  that  respond  to  common  needs  and  build  com¬ 
mon  identities.  Influence  of  this  kind  is  limited  to 
projects  considered  to  be  in  the  common  interest, 
and  repeated  efforts  to  advance  common  interests 
can  earn  honor  for  dominant  powers  and  the  cor¬ 
responding  willingness  of  others  to  accept  their 
leadership.  In  a  more  recent  work,  A  Cultural 
Theory  of  International  Relations,  Lebow  (2008) 
explores  the  role  of  honor  and  develops  a  construc¬ 
tivist  theory  of  international  relations,  but  one  that 
incorporates  the  key  tenets  of  classical  realism. 

Morgenthau  (1962)  thought  that  nuclear  weap¬ 
ons  and  environmental  degradation  were  problems 
that  could  not  be  addressed  by  nation-states  acting 
unilaterally.  To  address  them,  the  world  required 
“a  principle  of  political  organization  transcending 
the  nation-state  and  commensurate  with  the  poten¬ 
tialities  for  good  or  evil  of  nuclear  power  itself” 
(p.  76).  While  Waltz  and  the  neorealists  sought  to 
explain  the  status  quo,  Morgenthau  looked  beyond 
it.  The  principal  task  of  international  relations 


theorists  was  to  facilitate  the  research  by  laying  the 
groundwork  “for  a  new  international  order  radi¬ 
cally  different  from  that  which  preceded  it” 
(p.  76).  Thucydides,  Morgenthau,  and  Herz  are 
attractive  to  classical  realists  above  all  because  of 
their  concern  for  justice  and  the  transformational 
nature  of  their  projects. 

Richard  Ned  Lebow 
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Hanover,  New  Hampshire,  United  States 
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Redistribution 


The  term  redistribution  refers  to  the  criteria  and  the 
mechanisms  shaping  the  authoritative  allocation  of 


valuable  assets  in  a  given  society.  In  the  narrow 
sense  used  in  political  science,  it  refers  to  the  value- 
driven,  commanding  intervention  of  the  state  in 
society  to  extract  resources  from  some  groups 
(mainly  in  the  form  of  taxes)  in  order  to  give  them 
to  other  groups  (using  public  expenditure  and 
other  redistributive  mechanisms).  Thinkers  on  the 
left  of  the  ideological  spectrum  argue  that  the  state 
should  play  an  active  role  in  this  process,  while 
those  on  the  right  hold  that  interaction  in  a  free 
market  is  the  most  efficient  mechanism  of  distribu¬ 
tion.  In  what  follows,  there  are  clarifications  on  the 
scope  and  scientific  usage  of  the  term  redistribu¬ 
tion,  an  account  of  the  historical  evolution  of  the 
concept,  a  classification  of  political  science’s  main 
theories  on  the  role  of  the  state  and  the  dynamics 
of  competition  among  groups,  and  a  brief  discus¬ 
sion  of  the  empirical  indicators  of  redistribution. 

The  Concept  of  Redistribution 

In  the  social  sciences,  the  concept  of  redistribution 
refers  both  to  the  normative  questions  about  the 
ideas,  values,  and  criteria  of  justice  that  ought  to 
illuminate  the  pursuit  of  fairness  in  a  given  society 
(from  whom  to  collect?  to  give  to  whom?  why?) 
and  to  the  practical  question  about  the  means  by 
which  authorities  should  reallocate  resources  and 
valuable  goods  (the  most  common  mechanisms 
being  tax  exemptions,  public  expenditure,  cash 
transfers,  subsidies,  and  social  programs).  The 
comprehensiveness  of  the  term  cannot  be  separated 
from  the  meaning  of  its  predecessor,  distribution. 
While  distribution  is  an  economic  concept  that 
refers  to  the  allocation  of  resources  that  occurs  in 
society  as  a  result  of  the  free  interaction  between 
private  agents  (mainly  in  the  market  but  also  by 
means  of  philanthropy,  community  organization, 
international  cooperation,  etc.),  the  term  redistri¬ 
bution  indicates  the  intervention  of  the  state,  as  a 
public  authority,  in  the  sanction  of  the  criteria  to 
redistribute  and  in  the  administration  of  the  mech¬ 
anisms  of  redistribution.  In  the  narrow  usage  of 
their  respective  disciplines,  economists  tend  to 
focus  on  the  distribution  of  monetary  income  (sala¬ 
ries),  while  political  scientists  and  sociologies 
adopt  a  broader  approach,  looking  at  the  redistri¬ 
bution  of  wealth,  including  not  only  material  assets 
but  also  other  opportunities  in  life  such  as  health 
and  education  as  well  as  entitlements  and  rights. 
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Another  conceptual  distinction  between  these  two 
terms  is  that  distribution  occurs  when  there  are 
changes  or  there  is  reallocation  of  resources  within 
the  existing  rules  of  the  game  (e.g.,  compared  with 
retailers,  farmers  obtained  more  income  than  they 
did  in  2009),  whereas  redistribution  takes  place 
when  there  are  changes  in  the  rules  of  the  game 
(e.g.,  Congress  passed  a  law  granting  tax  exemp¬ 
tions  to  farmers). 

Explaining  Redistribution 

Concerns  about  establishing  norms  for  the  redistri¬ 
bution  of  valuable  goods  have  accompanied 
humanity  since  agriculture  replaced  nomadic  hunt¬ 
ing  as  the  principal  food  source,  calling  for  an 
unprecedented  level  of  organization  of  tasks 
including  the  division  of  labor,  accumulation  of 
stocks,  and  redistribution  of  the  benefits.  Formulas 
for  fair  distribution  of  goods  and  concern  for  the 
weakest  are  present  in  the  foundational  writings  of 
all  religions,  placing  the  judgment  of  fairness  on 
the  will  of  a  divine  god  and  offering  detailed 
instructions  to  mortals  on  how  to  implement  that 
justice.  The  return  to  humanism  and  the  emer¬ 
gence  of  the  secular  state  in  the  Renaissance  period 
put  both  the  judgment  and  the  responsibility  of 
redistribution  in  human  hands,  setting  the  stage 
for  a  debate  that  would  prevail  throughout  moder¬ 
nity:  the  discussion  between  those  arguing  for 
active  intervention  of  the  state  in  the  redistribution 
of  wealth  and  those  arguing  that  the  state  should 
have  a  limited  role.  Philosophers  and  economists 
such  as  Adam  Smith,  Thomas  Malthus,  and  Karl 
Marx  led  the  scholarly  debates  on  distribution  and 
redistribution  during  the  birth  of  the  Western 
nation-states,  which  were  built  on  the  foundations 
of  the  egalitarian  beliefs  consecrated  by  the  French 
Revolution  of  1789. 

The  initial  success  of  the  communist  alternatives 
in  the  Soviet  Union  and  China  in  the  early  1900s, 
the  horror  of  World  War  I,  the  emergence  of 
authoritarian  regimes  in  Central  Europe,  and  the 
economic  crises  of  Western  capitalism  in  1929 
prompted  questions  about  the  political  and  institu¬ 
tional  dimensions  of  human  organizations.  New 
covenants  of  coexistence  and  new  models  of  inter¬ 
action  between  the  haves  and  the  have-nots  were 
needed  to  redress  the  previous  concentrations  of 
wealth  resulting  from  the  aristocratic  governments 


of  the  1 9th  century  at  the  domestic  level  and  by  the 
colonial  exploitation  of  Africa,  the  Americas,  and 
Asia  at  the  international  level.  The  development  of 
the  welfare  state  can  be  situated  between  the  late 
19th  century  and  the  1980s,  with  the  most  arche¬ 
typical  examples  located  in  the  Nordic  countries  of 
Sweden,  Norway,  Denmark,  and  Finland.  Finns, 
for  instance,  enjoy  state-funded  education,  health, 
and  welfare  services,  as  well  as  unemployment 
insurance  and  retirement  benefits,  at  no  cost  for 
the  families  other  than  the  general  taxes.  The  long- 
lasting  impacts  of  these  universal  provisions  are 
still  evident.  According  to  World  Bank’s  2002 
World  Development  Indicators,  at  the  beginning  of 
the  21st  century,  48%  of  the  national  income  in 
Brazil  was  concentrated  in  the  richest  10%  of  the 
population,  while  the  figure  was  30.5%  for  the 
United  States  and  21.6%  for  Finland.  The  presi¬ 
dency  of  Ronald  Reagan  in  the  United  States 
(1981-1989)  and  the  prime  ministership  of 
Margaret  Thatcher  in  Great  Britain  (1979-1990) 
marked  the  return  of  a  neoliberal  ideology  that 
attacked  the  welfare  state,  denouncing  its  over¬ 
spending  and  inefficiencies  and  shrinking  it  in 
favor  of  privatized  and  market-like  social  services. 

The  early  20th  century  marks  the  starting  point 
for  defining  a  specific  political  science  approach  to 
redistribution,  the  use  of  systematic  scientific 
research  to  address  the  impact  of  power,  politics, 
and  institutions  on  the  redistribution  of  public 
resources  or  “who  gets  what,  when  and  how,”  as 
Harold  Lasswell  put  it  in  1936.  The  New  Deal 
implemented  in  the  United  States  by  President 
Franklin  Roosevelt  as  a  response  to  the  economic 
crisis  of  1929  represents  a  paramount  object  of 
study  for  those  interested  in  the  dynamics  and  out¬ 
comes  of  public  redistribution  of  wealth.  More 
specific  questions  on  redistribution  address  the 
allocation  of  the  annual  budget  to  different  gov¬ 
ernmental  programs;  the  redistribution  of  reve¬ 
nues  among  subnational  governments  (states  or 
provinces);  the  redistribution  of  resources  across 
demographic,  ethnic,  religious,  or  tribal  lines;  and 
the  way  in  which  different  groups  in  society  use 
their  power  to  influence  the  outcomes  of  those 
redistributions  in  their  favor.  To  describe  these 
processes  of  redistribution,  political  scientists 
have  looked  at  forms  of  governments  (Are  they 
democracies  or  authoritarian  regimes?),  constitu¬ 
tional  and  legal  frameworks  (Is  it  a  republic,  a 
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constitutional  monarchy,  or  a  popular  republic?), 
organization  of  the  territory  of  the  state  (Is  it  a  fed¬ 
eral  or  a  unitary  system?),  tax  systems  and  laws 
(Are  they  progressive  or  regressive?),  electoral  pro¬ 
visions  and  enfranchising  (Who  is  entitled  to  stand 
for  election  and  to  vote?),  groups’  strategies  of 
influence  and  lobbying  (How  permeable  is  the 
political  system  to  the  organized  pressure  of  interest 
groups?),  the  welfare  state  and  public  expenditures 
(How  much  is  spent  on  social  programs?),  the  aim 
and  scope  of  social  programs  (Are  they  universal  or 
focalized?  who  are  the  beneficiaries?),  and  so  on. 

There  are  three  major  theories  in  political  sci¬ 
ence  on  the  role  that  the  state  plays  in  redistribu¬ 
tion.  The  first  group  of  theories  derives  from  the 
neoclassic  economists  of  the  17th  and  18th  centu¬ 
ries  (e.g.,  Adam  Smith,  David  Ricardo)  and  argues 
that  free  agents  pursuing  their  self-interests  in  free 
markets  will  achieve  the  best  possible  distribution 
of  resources,  therefore  the  state  should  limit  its 
intervention  to  guarantee  national  defense,  respect 
for  private  property,  and  the  administration  of  jus¬ 
tice.  On  the  other  hand,  a  group  of  theories  in  the 
tradition  of  Thomas  Hobbes  and  Jean-Jacques 
Rousseau  argue  that  the  market  is  full  of  imperfec¬ 
tions  that  justify  broad  intervention  by  the  state  in 
the  economy,  including  the  planning,  regulating, 
and  implementing  of  large-scale  economic  plans. 
While  liberal  thinking  prevailed  in  the  18th  and 
19th  centuries,  the  20th  century  was  dominated  by 
the  broad  intervention  of  the  state  both  at  the 
domestic  and  at  the  international  levels;  after 
World  War  II,  most  European  nations  and  the 
United  States  engaged  in  state-sponsored  recovery 
plans,  centralized  planning,  rigid  regulation  and 
intervention  in  the  most  productive  economic  sec¬ 
tors,  and  comprehensive  social  programs  to  help 
the  poor — in  other  words,  the  welfare  state.  A  third 
voice  in  this  debate  was  contributed  by  the  early 
utopian  socialism  of  Henri  de  Saint-Simon  and 
Charles  Fourier  and  later  on  by  Karl  Marx  and 
Friedrich  Engels,  who  saw  the  history  of  humanity 
as  a  permanent  class  struggle  and  proposed  the 
abolition  of  private  property,  termination  of  the 
state,  and  its  replacement  with  the  commons,  in  an 
ideal  communist  society  in  which  the  guiding  prin¬ 
ciple  of  redistribution  would  be  “from  each  accord¬ 
ing  to  his  abilities,  to  each  according  to  his  need.” 

Another  set  of  theories  in  political  science  focuses 
on  competition  between  groups  to  influence 


redistributive  outcomes  in  their  favor,  to  get  a  big¬ 
ger  portion  of  the  pie.  The  iron  law  of  oligarchy,  as 
formulated  by  Robert  Michels,  affirms  that  no 
matter  how  many  laws  or  institutions  are  in  place 
to  regulate  access  to  public  decisions,  there  will 
always  be  a  small  group  of  rich  and  resourceful 
people  who  will  manage  to  access  key  decision 
points  and  bias  public  policy  outcomes  in  their 
favor.  An  alternative  interpretation,  called  power 
resources  theory,  championed  by  Gosta  Esping- 
Andersen  and  others,  suggests  that  the  level  of 
inequality  and  the  redistributive  outcomes  in  a 
given  society  are  the  result  of  the  organizational 
resources  of  the  working  class  and  its  mobilization. 
A  third  argument  on  the  accessibility  and  dynamics 
of  redistribution  is  offered  by  Philippe  Schmitter’s 
theory  of  corporatism  and  its  American  variation, 
pluralism,  initiated  in  the  writings  of  the  Founding 
Fathers  of  the  United  States  and  most  recently 
reformulated  by  Robert  A.  Dahl;  these  theories 
hold  that  the  greatest  influence  on  the  redistribu¬ 
tive  outcomes  of  the  state  are  exerted  neither  by 
the  people  nor  by  the  elites  but  by  the  organized 
activity  of  interest  groups  and  factions.  A  fourth 
proposition  is  advanced  by  the  institutionalist 
(Karl  Deutsch  and  David  Easton)  and  neo-institu- 
tionalist  (Walter  W.  Powell  and  Paul  J.  DiMaggio) 
approaches,  which  combine  rational  choice  the¬ 
ory  with  institutional  analysis  to  conclude  that 
there  is  a  dialectic  influence  between  society  and 
its  political  institutions  so  that  the  latter  mirror 
the  balances  of  power  of  the  former  but  at  the 
same  time  limit  its  freedom. 

Even  when  the  most  fertile  studies  on  redistribu¬ 
tion  combine  cultural,  economic,  and  political 
approaches,  cross-disciplinary  efforts  have  been 
scarce,  due  to  the  difficulty  of  defining  solid  indi¬ 
cators,  the  scarcity  of  reliable  and  comparable 
data,  and  the  excessive  specialization  of  the  disci¬ 
plines.  The  Gini  coefficient  and  Theil  index  of 
inequality  are  used  as  proxies  to  analyze  the  results 
of  the  redistributive  outcomes  of  the  state,  but 
more  accurate  indicators  of  redistribution  are  the 
share  of  income  and  consumption  of  the  richest  (or 
the  poorest)  deciles  of  the  population  and  indica¬ 
tors  of  the  health  (life  expectancy  and  infant  mor¬ 
tality)  and  levels  of  education  (literacy  rate  and 
school  attendance)  of  the  poorest  social  groups. 
Political  scientists  also  look  for  indicators  of  polit¬ 
ical  participation  (voting,  petitions,  participation 
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in  rallies,  and  involvement  in  advocacy  groups)  as 
proxies  to  deduce  the  capacity  of  citizens  for  influ¬ 
encing  the  state  and,  conversely,  the  share  of  public 
resources  that  they  obtain  from  it. 
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Referenda 


A  referendum  is  a  device  of  direct  democracy  by 
which  the  people  are  asked  to  vote  directly  on  an 
issue  or  policy.  It  differs  from  an  election,  which  is 
a  vote  to  elect  persons  who  will  make  decisions  on 
behalf  of  the  people,  or  a  recall,  by  which  citizens 
are  given  the  opportunity  to  remove  from  office  an 
elected  representative.  Although  this  distinction 
between  issue  voting  and  person  voting  is  appar¬ 
ently  clear,  it  may  be  questioned,  such  as  when  the 
referendum  is,  formally  or  de  facto,  a  vote  of  con¬ 
fidence  or  about  the  accession  or  permanence  in 
power  of  a  person.  This  is  often  the  case  in  author¬ 
itarian  regimes,  but  it  also  happens  in  democratic 
contexts  (e.g.,  the  use  of  referenda  by  de  Gaulle  in 
France).  Such  referenda  are  often  qualified  as 
“plebiscites,”  although  the  word,  which  goes  back 


to  ancient  Rome,  literally  means  “a  law  enacted  by 
the  common  people”  ( plebis  scitum).  Because  a 
plebiscite  is  commonly  regarded  as  highly  manipu¬ 
lative,  the  term  has  a  negative  connotation.  The 
use  of  a  plebiscite  is  sometimes  extended  to  all 
government-initiated  referenda,  especially  if  ad 
hoc,  insofar  as  they  would  automatically  trigger  a 
vote  of  confidence.  But  the  word  has  also  tradi¬ 
tionally  been  used  in  a  more  neutral  way,  to  refer 
to  popular  votes  on  sovereignty  issues  (e.g.,  the 
plebiscites  proposed  by  the  League  of  Nations 
after  World  War  I  to  settle  boundary  disputes).  The 
word  referendum  appeared  much  later,  possibly  in 
16th-century  Switzerland,  to  indicate  the  proce¬ 
dure  by  which  delegates  to  cantonal  assemblies 
submitted  certain  issues  to  their  constituents  for 
ratification  (ad  referendum). 

Typologies 

A  distinctive  feature  of  the  referendum  is  the  great 
variety  of  forms  that  it  can  take.  This  introduces 
problems  of  comparability  and  makes  general 
statements  about  the  device  difficult.  Among  the 
numerous  modalities  of  the  referendum,  the  one 
called  initiative  is  considered  to  be  the  most  impor¬ 
tant,  as  it  indicates  the  maximum  extent  to  which 
legislative  power  is  delegated  to  the  people.  Most 
typologies  of  referenda  are  indeed  based  on  this 
criterion  and  distinguish  between  mandatory  (also 
termed  compulsory  or  obligatory)  referenda,  on 
one  side,  and  optional  (or  facultative)  referenda, 
on  the  other,  with  a  fundamental  divide  within  the 
latter  category  between  referenda  initiated  by  insti¬ 
tutional  actors  such  as  the  executive,  the  legislative 
branch,  or  a  parliamentary  minority  and  popular 
initiatives.  Many  authors  use  the  word  referendum 
for  mandatory  referenda  and  referenda  initiated 
from  within  institutions,  while  votes  demanded  by 
popular  minorities  are  referred  to  as  initiatives. 
Others  use  referenda  for  votes  on  legislative  texts 
(current  or  pending  legislation),  whether  they  are 
initiated  by  institutional  actors  or  by  popular 
minorities,  and  initiatives  for  popular  initiatives 
that  are  propositional  (i.e.,  dealing  with  proposals 
for  future  legislation).  The  formal  object  of  the 
referendum  (current,  pending,  or  proposed  legisla¬ 
tion),  the  category  of  legislative  act  it  deals  with 
(e.g.,  ordinary  legislation,  constitutional  reform, 
international  treaty),  the  subject  (e.g.,  institutional, 
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territorial,  moral,  economic),  the  legal  conse¬ 
quences  of  the  vote  (consultative — also  termed 
advisory — or  binding),  and  the  territorial  level  at 
which  it  is  organized  (e.g.,  local/regional/national) 
are  other  frequent  variables  included  in  the  typolo¬ 
gies,  creating  numerous  designations  for  the  refer¬ 
endum.  Here,  we  use  the  word  referendum  in  a 
general  sense,  which  includes  all  types  of  popular 
votes  bearing  formally  on  an  issue.  This  entry  does 
not  consider  agenda  initiatives,  whereby  a  number 
of  citizens  can  submit  a  proposal  that  must  be  con¬ 
sidered  by  the  legislature  but  is  not  necessarily  put 
to  a  vote  of  the  electorate. 

History 

The  history  of  the  modern  referendum  is  generally 
associated  with  three  countries.  On  one  side  are 
Switzerland  and  the  United  States  and  on  the  other, 
France.  In  the  first  two,  the  practice  of  referendum 
has  its  roots  in  a  tradition  of  direct  democracy  by 
popular  assemblies  at  the  local  level  (the  American 
town  meetings  and  the  Swiss  cantonal  Lands- 
gemeinde),  dating  back  to  the  Middle  Ages  in  the 
case  of  Switzerland.  In  America,  the  referendum 
experience  was  initiated  with  the  submission  of 
state  constitutions  to  the  people  (the  first  case  was 
the  rejection  of  the  Constitution  of  Massachusetts 
by  the  people  in  1778)  and  the  introduction  in 
many  states  of  the  obligatory  referendum  on  con¬ 
stitutional  amendments  proposed  by  the  legisla¬ 
ture.  But  it  was  never  extended  beyond  the  state 
level,  neither  in  the  federal  constitution  nor  in 
practice.  In  Switzerland,  the  first  major  develop¬ 
ment  of  the  referendum  occurred  at  the  cantonal 
level,  under  the  impulse  of  the  democratic  Liberals 
in  the  1830s,  although  early  forms  of  referenda 
were  found  before  this  period  (as  mentioned 
above).  At  the  time,  it  appeared  as  an  acceptable 
substitute  for  the  direct  democracy  assemblies, 
which  had  become  impractical.  In  addition,  the 
examples  of  the  United  States  and  France  were  very 
influential  in  promoting  the  constitutional  referen¬ 
dum  (the  first  nationwide  referendum  had  actually 
been  held  in  Switzerland  in  1802  to  approve  the 
Napoleonic  constitution).  During  these  years,  all 
cantonal  constitutions,  with  the  exception  of 
Friburg,  were  approved  through  referenda,  and 
provisions  for  popular  initiatives,  on  constitutional 
or  legislative  matters,  were  introduced  in  many  of 


them.  The  1848  federal  constitution  was  also  sub¬ 
mitted  to  the  people  in  a  majority  of  cantons, 
which  included  the  obligatory  referendum  for 
amendments  to  the  constitution  as  well  as  the  con¬ 
stitutional  popular  initiative  for  total  revision  of 
the  constitution.  In  both  countries,  a  decisive 
extension  of  the  referendum  was  achieved  under 
the  influence  of  political  reform  movements  in  the 
second  half  of  the  19th  century:  the  Democratic 
Movement  in  Switzerland  (the  1860s)  and  the 
Progressive  Movement  in  the  United  States  (1890- 
1920).  As  a  result  of  these  movements,  provisions 
for  popular  initiatives  were  enhanced  in  the  Swiss 
federal  constitution  (1874:  the  initiative  on  laws 
within  90  days  of  their  publication;  1891:  the  con¬ 
stitutional  initiative  for  partial  revisions)  and 
introduced  in  many  American  states,  especially  in 
the  west  (more  than  80%  of  the  24  states  that  have 
the  popular  initiative  adopted  it  during  the 
Progressive  era).  In  the  two  countries,  these  move¬ 
ments  drew  support  from  the  popular  dissatisfac¬ 
tion  with  representative  democracy,  with  politi¬ 
cians  being  accused  of  corruption  and  of  fostering 
the  interests  of  only  the  richest  sections  of  the 
population.  France  has  a  different  story  as  it  had 
no  tradition  of  direct  democracy.  Nevertheless,  its 
referendum  experience  started  much  as  it  did  in  the 
United  States,  with  referenda  on  the  revolutionary 
constitutions  of  1793  and  1795,  following  the  end 
of  monarchic  rule.  Moreover,  the  1793  constitu¬ 
tion  greatly  advanced  democratic  principles  by 
introducing  universal  male  suffrage  and  a  popular 
initiative  on  laws  within  40  days  of  their  adoption. 
This  constitution  was  actually  a  great  source  of 
inspiration  for  Switzerland.  Ultimately,  however,  it 
was  never  applied,  and  the  only  form  of  referen¬ 
dum  that  found  its  way  into  France  was  the  consti¬ 
tutional  referendum.  Overall,  France  would  soon 
take  a  different  road  with  the  plebiscitary  use  of 
the  referendum  by  Napoleon  I  and  Napoleon  III, 
to  some  extent  perpetuated  by  the  presidential  use 
of  the  referendum  under  the  fifth  republic. 

Constitutional  Provisions 

As  suggested  by  existing  data  sets  on  referenda  (see 
IDEA  and  c2d),  in  2008,  only  a  small  minority  of 
countries  (about  20%  of  the  193  countries  deemed 
independent  by  Freedom  House)  had  no  provision 
at  all  for  referenda  at  the  national  or  subnational 
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level;  these  were  mostly  in  Asia  (e.g.,  China  and 
India),  the  Middle  East,  and  Central  America. 
Provisions  for  nationwide  referenda  are  much  more 
widespread  (80%  of  the  countries)  than  provisions 
for  subnational  referenda  (40%  of  the  countries). 
At  the  national  level,  mandatory  referenda  and 
optional  referenda  (the  latter  category  including  in 
our  data  set  citizens’  demands  on  existing  or  pend¬ 
ing  legislation)  are  much  more  prevalent  (54%  and 
60%  of  all  countries,  respectively)  than  citizens’ 
initiatives  for  future  legislation  (16%).  Twenty  per¬ 
cent  of  the  countries  have  popular  initiatives  (either 
citizens’  demands  or  citizens’  initiatives).  As  may  be 
expected,  more  free  countries  (by  Freedom  House 
ranking)  have  provisions  for  referenda  (only  13% 
have  no  provisions  of  any  kind  at  any  level)  than  the 
other  countries  (24%  have  no  provisions  for  refer¬ 
enda).  This  is  partly  due  to  differences  with  regard 
to  popular  initiatives:  23%  of  free  countries  provide 
for  them  compared  with  only  15%  of  not  free  and 
partially  free  countries  (at  the  national  level).  It  is 
remarkable,  however,  that  such  a  device  to  chal¬ 
lenge  political  authorities  can  be  found  in  the  con¬ 
stitutions  of  countries  such  as  the  Russian  Federation, 
the  Asiatic  republics  of  the  former  USSR,  Togo,  or 
Uganda,  although  it  is  never  used.  But  the  major 
difference  concerns  provisions  for  subnational  refer¬ 
enda,  which  are  much  more  prevalent  in  free  coun¬ 
tries  (55%),  where  they  have  tended  to  increase  in 
recent  years,  than  in  the  other  countries  (26%). 

Practice  Around  the  World 

It  is  much  more  difficult  to  assess  the  effective 
practice  of  referenda,  especially  at  the  subnational 
level,  for  which  no  exhaustive  data  set  exists. 
However,  over  the  1980  to  2008  period,  it  is  pos¬ 
sible  to  classify  countries  according  to  the  intensity 
of  their  practice  of  national  referenda.  A  prelimi¬ 
nary  observation  should  be  that  frequent  use  of  the 
referendum  is  associated  with  the  popular  initia¬ 
tive  and  its  practice  on  a  wide  range  of  issues  (not 
strictly  constitutional  or  of  special  importance).  A 
first  group  of  very  frequent  users  includes 
Switzerland  (246  referenda),  with  Italy  (60), 
Fiechtenstein  (38),  Ecuador  (33),  and  Micronesia 
(31)  far  behind.  All  these  countries  have  provisions 
for  popular  initiatives,  which  represent  the  bulk  of 
the  practice  in  Switzerland,  Italy,  and  Fichtenstein. 
The  second  group  consists  of  frequent  users,  such 


as  Ireland  (21),  with  its  practice  of  mandatory  con¬ 
stitutional  referenda,  Palau  (19),  Colombia 
(19  since  it  became  free  in  1990),  and  Fithuania 
(18  referenda  since  independence),  the  latter  two 
showing  occasional  use  of  the  popular  initiative. 
The  third  group,  consisting  of  medium-frequency 
users  (7-13  referenda),  has  16  countries,  among 
which  are  some  occasional  practitioners  of  the 
popular  initiative  (e.g.,  Bolivia,  Hungary,  Slovakia, 
Slovenia,  New  Zealand)  and  a  more  frequent  user 
(Uruguay).  Unlike  the  previous  groups,  this  set  of 
countries  also  includes  not  fully  free  countries  (6), 
such  as  Egypt,  Belarus,  and  Morocco.  Nonfree 
countries  are  more  prevalent  in  the  next  two 
groups  of  occasional  users  (23  countries  with  4-6 
referenda)  and  rare  users  (71  countries  with 
1-3  referenda),  which  include  only  a  small  minor¬ 
ity  of  free  countries  (and  only  3  countries  with 
some  practice  of  the  popular  initiative:  Venezuela, 
Fatvia,  and  Macedonia).  This  suggests  that  the 
sporadic  use  of  the  referendum  often  has  to  do 
with  the  quest  for  popular  acclamation  of  authori¬ 
tarian  policies.  It  should  be  added  that  it  is  often 
difficult  in  the  case  of  nondemocratic  countries  to 
assess  whether  a  referendum  has  been  mandatory, 
optional,  or  ad  hoc.  A  prevalence  of  authoritarian 
regimes  is  not,  however,  found  in  the  last  group, 
consisting  of  nonusers  (70  countries),  which  has 
comparable  proportions  of  free  countries  and  par¬ 
tially  free  or  not  free  countries.  Among  the  most 
prominent  nonusers  of  the  referendum  are  the 
United  States,  Germany,  China,  India,  Japan,  and 
Israel.  The  United  States  and  Germany,  however, 
have  an  intense  practice  of  referenda,  especially 
popular  initiatives,  at  the  state  level,  albeit  with 
important  differences  from  one  state  to  another. 
Regarding  subnational  referenda,  it  should  be 
noted  that  federal  countries  and  decentralized 
countries  actually  have  a  major  propensity  for 
them  (Switzerland  being  an  exemplary  case).  It  is 
also  probably  true,  at  least  in  democratic  countries, 
that  the  decrease  in  territorial  level  (from  nation  to 
region  or  from  state  to  city)  will  likely  correlate 
with  a  higher  number  of  referenda.  From  a  dynamic 
perspective,  a  general  trend  toward  an  increase  in 
the  practice  of  referenda  is  clearly  observable.  The 
number  of  nationwide  referenda  between  1980  and 
2008  (close  to  900  referenda)  is  almost  three  times 
the  number  registered  for  1950  to  1979  (362  refer¬ 
enda).  Moreover,  the  use  of  referenda  dramatically 
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increased  during  the  1980  to  2008  period.  As  a 
matter  of  fact,  in  the  postwar  period,  there  was  a 
twofold  increase  in  the  use  of  referenda,  and  in  the 
1970s  and  1990s,  it  more  than  doubled  (compared 
with  the  previous  decade).  This  does  not  mean  that 
the  referendum  has  become  more  frequent  in  every 
country  or  is  practiced  with  the  same  intensity 
everywhere.  For  example,  in  Western  Europe,  the 
increase  in  referendum  use  has  been  much  more 
marked  in  Switzerland,  Italy,  and  Ireland  than  in 
other  countries.  Nonetheless,  the  referendum  has 
become  more  prevalent  in  many  countries  where  it 
had  never  been  practiced  before  (e.g.,  Great 
Britain,  the  Netherlands),  where  it  had  not  been 
practiced  since  1940  (e.g.,  Finland,  Luxembourg, 
Norway),  and  where  it  had  not  been  practiced  in  a 
democratic  context  (e.g.,  Austria,  Greece,  Portugal, 
Spain).  Much  of  this  has  had  to  do  with  the  sub¬ 
mission  to  the  people  of  the  different  steps  of 
European  integration  or  of  new,  cross-cutting 
issues  such  as  civil,  nuclear,  or  so-called  moral 
questions.  Clearly  enough,  from  a  world  perspec¬ 
tive,  the  increase  in  referendum  use  in  the  1970s 
and  1990s  also  reflects  the  rise  in  the  number  of 
independent  countries  and  the  use  of  the  referen¬ 
dum  during  the  process  of  nation-building  in  these 
countries,  as  well  as  the  spread  of  democratic 
regimes  around  the  world. 

Empirical  Evidence:  Causes, 
Democratic  Quality,  and  Effects 

Why  referenda  are  held  and  why  their  use  has 
spread  around  the  world,  however,  are  matters  for 
which  no  general  theory  has  been  articulated  up  to 
now.  Only  partial  theories  regarding  specific  coun¬ 
tries  or  types  of  referenda  (e.g.,  government- 
initiated  referenda)  have  been  formulated.  For  sure, 
the  reasons  for  the  use  of  referenda  are  different  in 
democratic  and  in  nondemocratic  countries,  and 
they  are  also  not  the  same  in  new  and  in  old 
democracies,  where  the  recent  increase  has  more  to 
do  with  the  crisis  of  representation  and  the  rising 
expectations  of  democracy  than  with  the  legitima¬ 
tion  of  the  new  state  or  political  regime.  The 
absence  of  a  general  theory  also  characterizes  more 
traditional  concerns  of  the  literature,  like  those 
pertaining  to  the  democratic  quality  of  referenda  or 
their  policy  and  political  effects.  Thus,  the  extent  to 
which  and  the  conditions  in  which  referenda  can 


be  democratic,  that  is,  promote  policies  that  accu¬ 
rately  reflect  the  popular  will  on  the  question 
raised,  remain  problematic.  Some  authors  believe 
that  representative  democracy  would  be  better 
than  direct  democracy  at  producing  such  policies, 
or  at  least  better  than  semidirect  democracy,  to 
which,  strictly  speaking,  the  referendum  belongs, 
as  it  is  only  a  mechanism  of  popular  decision,  not 
of  collective  elaboration  of  policies  (unlike  citizens’ 
assemblies).  Because  of  this,  people  would  almost 
inevitably  approve  legislation  reflecting  the  views 
of  a  few  policymakers  rather  than  their  own  pref¬ 
erences.  Social  choice  theorists  also  point  to 
inherent  structural  problems  that  would  dispose 
referenda  to  produce  only  arbitrary  majorities, 
while  others  believe  that  such  a  limitation  is  generic 
to  all  voting  procedures  and  so  affects  representa¬ 
tive  democracy  as  much  as  direct  democracy.  From 
a  different  point  of  view,  it  is  often  argued  that 
referenda  result  in  minority  decisions  because  of 
abstention,  which  has  proved  to  be  high  when 
their  use  becomes  frequent;  because  of  manipula¬ 
tion  by  minorities;  or  simply  because  voters  answer 
to  a  different  question  from  the  one  formally 
asked,  as  typically  occurs  when  they  express  a  vote 
of  confidence  in  the  incumbents.  To  a  large  extent, 
the  capacity  of  the  referendum  to  produce  true 
majority  decisions  is  related  to  the  level  of  interest 
and  competence  of  voters  on  the  issue,  which  itself 
depends  on  the  frequency  of  referenda  (or  the 
number  of  propositions  on  the  ballot),  the  type  of 
question  (its  complexity,  saliency,  importance  to 
daily  life,  etc.),  or  the  fairness  and  quality  of  the 
campaign.  But  there  is  no  articulated  empirical 
theory  on  the  influence  of  these  variables.  Studies 
on  voting  behavior  at  referenda,  which  form  the 
bulk  of  the  empirical  works,  rarely  go  beyond  the 
assessment  of  voting  motivations  or  the  influence 
of  party  recommendations  at  specific  polls. 
Research  on  the  effects  of  referenda  is  even  less 
developed.  Little  is  known,  beyond  particular 
country  cases,  about  their  policy  effects.  Do  refer¬ 
enda  favor  the  status  quo  and  hinder  change,  as  is 
often  argued  with  reference  to  the  Swiss  experi¬ 
ence?  Do  they  have  a  structural  bias  in  favor  of  the 
“No”  votes,  for  they  would  more  easily  aggregate 
dissents  than  create  consent?  Similarly,  does  the 
referendum  harm  minorities,  and  is  it  bound  to  do 
so  because  it  “knows  nothing  about  compromise,” 
as  Max  Weber  pointed  out?  Those  who  believe  so 
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generally  also  doubt  that  referenda  can  lead  to 
more  legitimate  decisions  and  solve  conflicts. 
Rather,  referenda  are  believed  to  enhance  divi¬ 
sions.  Finally,  do  referenda  lead  to  incoherent  and 
incompetent  policies,  as  their  opponents  most 
prominently  denounce?  The  political  effects  of 
referenda  are  no  better  ascertained.  A  classical 
argument  against  them  is  that  they  weaken  repre¬ 
sentative  democracy  by  undermining  the  role  and 
responsibility  of  political  parties  and  elected  repre¬ 
sentatives  and  that  when  used  too  frequently,  they 
generate  voter  fatigue  and  low  electoral  participa¬ 
tion.  Moreover,  it  is  argued,  popular  initiatives 
would  overload  the  political  system  by  continually 
introducing  new  demands.  Supporters  of  refer¬ 
enda,  on  the  contrary,  view  them  as  usefully  com¬ 
plementing  representative  institutions,  by  making 
them  more  democratic  and  more  legitimate. 
Referenda  would  enhance  both  citizens’  participa¬ 
tion,  in  a  large  sense,  and  citizens’  education. 
Moreover,  popular  initiatives  provide  an  alterna¬ 
tive  channel  for  raising  issues  and,  as  the  example 
of  Switzerland  shows,  encourage  representatives  to 
be  more  responsive  and  more  accommodative  in 
the  preparation  of  laws,  which  would  result  in  cre¬ 
ating  a  stronger  attachment  of  the  people  to  the 
political  system.  Altogether,  the  debate  on  refer¬ 
enda  remains  largely  inconclusive,  due  to  the  lack 
of  a  truly  comparative  study  dealing  with  a  suffi¬ 
cient  number  of  cases  and  variables.  Admittedly, 
such  a  theory  is  difficult  to  formulate  because  of 
the  great  variety  of  the  forms  of  referenda  and  the 
contexts  in  which  they  are  held,  which  stands  as  an 
impediment  to  any  kind  of  generalization.  For  the 
present,  it  should  be  noted  that  the  prevailing  view 
among  political  scientists  is  not  very  favorable  to 
referenda.  This  is  explained  by  the  fact  that  they 
are  criticized  by  both  elitists,  in  the  name  of  repre¬ 
sentative  democracy,  and  “participationists,”  in  the 
name  of  pure  direct  democracy,  which  is  regarded 
as  superior  because  it  allows  the  collective  elabora¬ 
tion  of  policies  as  well  as  more  compromise  and 
deliberation,  leading  to  more  enlightened  decisions. 
Proponents  of  the  referendum  argue  that  it  should 
not  be  evaluated  against  the  ideal  of  direct  democ¬ 
racy  but  against  the  actual  performance  of  repre¬ 
sentative  democracy,  which  can  be  more  severely 
criticized  on  the  same  grounds  as  referenda.  In 
conclusion,  further  research  on  referenda  is  all  the 
more  needed  since  they  have  become  more  frequent, 


and  the  issue  is  raised  today  of  whether  they  should 
or  should  not  be  part  of  the  current  development  of 
participatory  mechanisms.  In  such  a  perspective, 
recent  efforts  by  think  tanks  and  institutes  special¬ 
izing  in  the  improvement  of  democracy  to  advance 
referendum  “best-practice”  proposals  that  incor¬ 
porate  particular  institutional  designs  and  rules  for 
their  implementation  deserve  to  be  mentioned. 
Ultimately,  as  with  elections,  the  correctness  of 
referendum  practice  is  to  a  large  extent  a  matter  of 
rules  and  of  the  democratic  quality  of  the  regime  in 
which  it  occurs. 

Laurence  Morel 
University  of  Lille  II 
Lille,  France 
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Modern  societies  are  highly  dynamic.  Unlike  static 
traditional  societies,  they  constantly  evolve,  and 


Reform  2231 


progress  is  the  principal  goal.  Consequently,  a 
central  concept  in  these  societies  is  that  of  reform, 
meaning  an  intended  change  of  institutions  or 
policies  designed  to  overcome  perceived  deficits  or 
to  achieve  particular  goals.  If  successful,  the 
change  leads  to  a  positive  development:  improve¬ 
ment  of  an  existing  situation  or  better  perfor¬ 
mance.  Reforms  can  refer  to  a  government  as  a 
whole  or  its  institutions  and  policies,  but  political 
science  is  also  interested  in  the  restructuring  of 
nongovernmental  organizations  such  as  associa¬ 
tions  or  firms.  In  all  these  settings,  modernization 
is  tantamount  to  reform. 

The  concept  of  reform  is  used  to  designate 
extraordinary  political  decisions,  as  distinguished 
from  ordinary  decisions,  by  the  degree  of  change 
involved.  Moreover,  reforms  break  with  routine. 
As  a  rule,  political  leaders  announce  a  reform  to 
gain  public  attention  and  prove  their  ability  to 
govern.  In  this  way,  they  exploit  the  positive  con¬ 
notations  of  the  term  for  political  gain.  In  view  of 
the  dynamics  of  modern  societies  and  technolo¬ 
gies,  even  conservatives  no  longer  defend  the  status 
quo  but  propose  reforms  to  avoid  negative  devel¬ 
opments.  The  ability  to  reform  has  become  an 
indication  of  effective  governance,  while  stalemate 
and  stagnation  are  commonly  identified  with 
governance  failure.  Nevertheless,  an  analytical  dis¬ 
tinction  between  reform  and  ordinary  decisions  is 
difficult.  If  blockade  is  used  as  the  antonym  for 
reform,  the  concept  is  too  broad  for  analytical 
purposes,  because  not  all  policy  change  can  be 
construed  as  reform  and  not  all  opposition  to  such 
change  can  be  construed  as  obstructionist.  For  this 
reason,  reform  should  be  defined  as  a  policy  that  is 
particularly  ambitious  with  regard  to  change,  that 
is  highly  relevant  for  the  promoters  or  affected 
actors,  and  that  attracts  special  attention,  often 
among  the  general  public  and  the  media. 

In  contrast  to  revolutionary  change,  reform  poli¬ 
cies  maintain  a  legitimized  system  of  governance. 
Institutions  or  policies  are  not  abolished  but  trans¬ 
formed  in  a  more  or  less  incremental  process.  This 
process  follows  existing  rules,  which  usually  stipu¬ 
late  the  participation  of  actors  that  have  a  stake  in 
the  reform.  As  a  consequence,  the  intended  change 
causes  conflicts,  and  every  reform  is  confronted 
with  the  resistance  of  actors  interested  in  maintain¬ 
ing  the  status  quo  or  expecting  to  lose  power.  On 
the  other  hand,  actors  who  push  for  change  may 


veto  a  reform  if  they  are  not  satisfied  with  the  out¬ 
come.  For  these  reasons,  the  challenge  of  a  reform 
is  to  find  sufficient  supporters  and  arrive  at  deci¬ 
sions  on  change,  whereas  revolutions  generate  a 
self-enforcing  dynamic  that  is  difficult  to  control. 

Reform  is  an  intended  change  effected  by  explicit 
decisions  and  actions  of  responsible  policymakers. 
Usually,  reformers  pursue  a  design  of  renewed 
institutions  or  policies;  at  least  they  have  ideas 
about  the  direction  of  change.  Thus,  a  reform,  con¬ 
cerning  the  formal  institutions,  rules,  or  proce¬ 
dures,  differs  from  changes  driven  by  ongoing 
interaction  or  interpretation  of  existing  norms  and 
practices.  However,  emergent  patterns  of  interac¬ 
tion  and  interpretation  influence  the  outcome  of 
reforms.  In  political  systems,  as  in  the  private  sec¬ 
tor,  reform  policies  are  often  guided  by  designs 
derived  from  theories  or  copied  from  those  of  other 
governments  or  organizations.  But  more  often  than 
not,  these  plans  fail  due  to  open  opposition  from 
actors  attempting  to  maintain  their  powers  or  due 
to  the  endogenous  dynamics  of  collective  action 
interfering  with  the  intentions  of  reform  policies.  It 
is  the  interplay  of  intended  and  unintended  change 
that  explains  the  outcome  of  reforms. 

Depending  on  the  specific  situation  or  the 
desired  objectives,  the  challenges  and  difficulties  of 
reforms  vary.  Comprehensive  constitutional 
reforms  rarely  meet  with  complete  success.  As  the 
theories  of  historical  institutionalism  explain, 
changes  in  complex  institutional  settings  are  “path 
dependent”;  that  is,  they  are  constrained  by  previ¬ 
ous  decisions  on  constitution  making  or  creation 
of  political  systems.  Moreover,  because  reforms 
affect  the  allocation  of  power  in  political  systems, 
they  give  rise  to  intense  conflicts  and  provoke 
strong  resistance  by  actors  who  want  to  maintain 
their  position  or  who  profit  from  the  existing  rules. 
Rules  of  constitutional  amendment  require  deci¬ 
sions  with  qualified  majorities  either  in  parlia¬ 
ments  or  through  referenda,  if  not  both.  These 
rules  favor  coalitions  of  actors  opposing  a  reform. 
According  to  institutionalist  reasoning,  we  can 
expect  “implicit  change”  of  constitutional  norms 
through  court  decisions  or  reinterpretation  of  rules 
by  governments.  Also  possible  are  limited  amend¬ 
ments  designed  to  adjust  constitutions  to  the  devel¬ 
opment  of  prevalent  values  or  power  structures  in 
a  society.  But  major  constitutional  reforms  should 
be  regarded  as  unlikely  events. 
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Reforming  an  organization  gives  rise  to  similar 
problems,  even  if  a  reorganization  can  be  planned 
and  enacted  by  political  leaders.  Administrative 
reforms,  for  example,  often  are  initiated  by  govern¬ 
ments  or  parliaments.  By  using  their  power  to 
decide  on  competences,  procedures,  personnel,  and 
the  budget  of  administrations,  they  are  able  in 
principle  to  control  reform  processes.  However, 
radical  intervention  in  existing  structures  can 
destroy  standard  operating  procedures  or  under¬ 
mine  the  motivation  of  civil  servants.  Reforms 
enforced  from  above  often  thwart  the  collective 
“spirit”  or  culture  of  an  organization,  which  is  key 
to  maintaining  its  performance.  In  this  case,  those 
affected  by  the  change  may  resort  to  subversive 
activities  or  react  by  work-to-rule.  At  the  end,  gov¬ 
ernments  may  bring  about  changes  to  formal  orga¬ 
nizations  but  may  fail  to  achieve  the  expected 
improvements. 

Reforms  related  to  policies  face  resistance  from 
affected  interest  groups,  but  mostly,  they  get  the 
support  of  groups  profiting  from  the  change. 
Whether  they  succeed  or  not  depends  on  the  com¬ 
parative  power  and  influence  of  these  groups  in  the 
political  system.  As  George  Tsebelis  has  argued, 
intended  policy  change  can  be  explained  as  a  bar¬ 
gaining  process  the  outcome  of  which  is  deter¬ 
mined  by  the  status  quo,  the  preferences  of  the 
actors  or  coalitions  involved,  and  their  veto  pow¬ 
ers.  Simply  put,  reforms  are  more  likely  to  succeed 
in  changing  an  existing  policy  the  lower  the  num¬ 
ber  of  veto  players. 

Although  institutionalist  and  actor-centered 
theories  often  explain  the  failure  of  reforms,  they 
do  not  tell  us  much  about  the  conditions  of 
change.  Empirical  research  on  organizational 
change  has  shown  that  the  powers  and  prefer¬ 
ences  of  actors  and  the  history  of  institutional 
development  play  an  important  role  in  reform 
processes,  but  no  less  important  are  the  ideas  pro¬ 
moted  by  the  advocates  of  change.  As  a  rule, 
reform  processes  start  with  discussions  leading  to 
new  perceptions  of  problems  and  new  goals  and 
proposals  for  renovating  institutions  or  policies.  If 
the  different  opinions  merge  into  a  new  “para¬ 
digm,”  or  ideas  shared  among  the  players  in  the 
politics  of  reform,  change  in  the  proposed  direc¬ 
tion  is  likely.  Moreover,  strong  “advocacy  coali¬ 
tions”  or  “change  agents”  are  critical  to  trigger 
reforms. 


The  variables  emphasized  by  different  theories 
have  impacts  on  reforms  at  different  stages  of  the 
process.  This  is  why  we  regularly  can  observe  a 
cyclical  development.  Reforms  are  triggered  by 
new  ideas,  elaborated  and  communicated  by  advo¬ 
cates  of  change.  Public  discussions  in  the  con¬ 
cerned  policy  area  cause  a  shift  in  perceptions, 
which  tend  to  exaggerate  the  problems.  Furthers, 
they  generate  concepts  for  new  designs  and  create 
positive  attitudes  for  change.  Attention  from  the 
media  and  discourses  in  science  have  strong 
impacts  at  this  stage  of  the  reform  process.  If 
responsible  reformers  exploit  the  opportunities 
provided  by  this  development  with  bold  initiatives, 
they  can  achieve  significant  formal  changes. 
However,  such  decisions  provoke  reactions  by 
those  actors  that  oppose  the  change;  and  the  more 
ambitious  the  agenda,  the  more  intense  is  the 
response.  This  stage  of  the  process  is  characterized 
by  conflicts,  bargaining,  politicking,  and  increas¬ 
ing  negative  assessments  of  the  consequences  of  a 
reform.  As  the  media  begins  losing  interest  in  the 
issue  at  stake,  reformers  revise  their  ambitions, 
reduce  their  engagement  in  reform  policies,  and 
shift  their  attention  to  other  tasks.  At  the  same 
time,  the  actors  defending  their  interests  or  posi¬ 
tions  gain  influence,  obstructive  behavior  increases 
in  organizations  or  policy  areas,  and  the  formal 
changes  are  adjusted  to  emergent  rules  and  pat¬ 
terns  of  interactions.  This  reform  cycle  can  end  in 
the  restoration  of  former  structures  and  policies, 
but  in  most  cases,  it  results  in  moderate  changes. 

In  view  of  this  up-and-down  nature  of  reform 
processes,  scholars  have  debated  on  appropriate 
strategies  of  reforms.  According  to  one  opinion, 
successful  reforms  require  strong  leadership,  clear 
goals  and  designs,  bold  strikes  shaking  up  existing 
structures  of  powers  and  routines,  and  fast  action 
in  favorable  situations.  Others  argue  that  good 
preparation  of  reforms,  a  broad  consensus,  par¬ 
ticipation  of  the  affected  actors,  initiatives  from 
inside  organizations,  and  perseverance  in  long- 
lasting  processes  are  decisive  factors.  Presumably, 
the  strategies  of  reforms  have  to  be  adjusted  to 
institutional  frameworks  and  situations.  Moreover, 
comparative  studies  have  yielded  ample  evidence 
for  the  assumption  that  majority  democracies  can 
achieve  more  change  through  policy  reform  while 
consensus  democracies  tend  toward  incremental 
adjustment.  Whether  this  finding  also  holds  for 
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institutional  or  constitutional  reform  is  still  an 
open  question.  There  is  also  debate  as  to  whether 
crisis  situations  favor  or  constrain  reforms.  In  fact 
a  severe  crisis  increases  the  need  for  change  and 
even  can  turn  into  a  “critical  juncture”  if  existing 
institutions  break  down  or  existing  policies  no 
longer  work.  But  it  also  reduces  the  leeway  for 
reform  policies  due  to  the  pressure  for  immediate 
action.  Experiences  and  comparative  studies  in 
administrative  reforms  have  revealed  that  fiscal 
crises  limit  the  opportunities  to  reform  organiza¬ 
tions.  On  the  other  hand,  an  abundance  of 
resources  reduces  the  need  for  change  and  can  lead 
to  increasing  conservatism.  Competition  between 
governments  or  private  organizations  may  stimu¬ 
late  reform  processes,  in  particular  if  policymakers 
enter  into  an  exchange  of  best  practices. 
Nevertheless,  the  transfer  of  reform  blueprints  or 
abstract  concepts  can  have  negative  consequences 
on  the  reform  process  if  the  particular  conditions 
or  cultures  of  governments  or  organizations  are 
ignored  by  promoters  of  change.  Again,  the  admin¬ 
istrative  reforms  of  the  past  decades  provide 
examples  of  such  developments. 

Regardless  of  these  debates,  in  modern  states 
and  in  complex  organizations,  reforms  evidently 
arrive  within  limits.  Hardly  any  reform  follows  its 
initial  aims  or  ideas.  However,  in  view  of  the  inter¬ 
play  between  change  of  formal  rules  and  the 
dynamics  of  emergent  interaction,  what  reformers 
intend  cannot  be  taken  as  an  adequate  standard 
for  evaluation;  rather,  it  is  the  way  they  manage 
conflicts  and  turn  initiatives  into  a  process  of  col¬ 
lective  learning  among  affected  actors.  In  the  end, 
reforms  can  be  regarded  as  successful  if  they  bal¬ 
ance  change  and  stability  and  if  the  cyclical  devel¬ 
opment  leads  not  to  restoration  of  the  status  quo 
ante  but  to  ongoing  evolution.  Every  reform  is 
embedded  in  the  dynamics  of  institutions  and 
policy  making.  It  never  determines  change. 
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Regime  (Comparative  Politics) 


Regimes  are  the  various  types  of  political  sys¬ 
tems  found  in  the  sovereign  countries  of  the 
world.  These  can  be  defined  as  specific  individ¬ 
ual  regime  types  or  as  contrasting  types.  In  com¬ 
parative  politics,  the  concept  of  a  political  regime 
refers  to  the  formal  and  informal  structure  and 
nature  of  political  power  in  a  country,  including 
the  method  of  determining  office  holders  and  the 
relations  between  the  office  holders  and  the  soci¬ 
ety  at  large.  These  relations  could  or  could  not 
involve  accountability  of  office  holders  to  the 
demos  and  likewise  could  involve  various  free¬ 
doms  granted  to  society  or,  conversely,  none  at 
all.  The  distinction  as  to  whether  a  country  is  a 
democracy  or  not  is  thus  a  regime  distinction — 
and  indeed  (as  we  shall  see)  the  central  regime 
distinction. 

The  first  use  of  the  concept  in  modern  political 
science  was  by  David  Easton.  Easton  contrasted 
the  regime  with  the  ongoing  political  community 
(the  country),  emphasizing  that  there  can  be  and 
have  been  fundamental  changes  to  a  political 
system  (producing  a  different  regime)  even  while 
the  political  community  remains  basically  con¬ 
stant.  This  entry  defines  a  regime;  discusses  the 
key  distinction  within  a  regime,  that  is,  if  it  is  a 
democracy  or  not  (or  how  much  democracy  it 
has);  and  presents  more  generally  the  cases  of 
ambiguity. 
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What  Is  a  Regime? 

Regime  is  a  midrange  concept,  falling  between  the 
concept  of  specific  governments  (of  individuals) 
and  the  concept  of  the  state  (the  broader  adminis¬ 
trative-legal  structure  that  has  sovereign  control 
over  a  country  and  the  power  to  extract  resources 
from  it).  That  is,  governments  may  vary  (e.g.,  the 
shifts  back  and  forth  between  Conservative  and 
Labour  governments  in  the  United  Kingdom),  or 
presidential  administrations  may  change  (e.g., 
from  George  W.  Bush  to  Barack  Obama  in  the 
United  States),  but  these  are  not  changes  of  regime; 
the  democratic  regimes  in  each  case  remain.  Thus, 
over  time  a  given  regime  may  involve  many  gov¬ 
ernments.  Following  from  the  above,  for  there  to 
be  a  regime  change,  the  political  “rules  of  the 
game”  must  change  in  a  systemic  way,  not  merely 
the  winner  of  the  game.  Thus,  South  Africa  in 
1994  underwent  a  regime  change  from  a  racial 
oligarchy  to  a  liberal  democracy;  since  then,  there 
have  been  several  presidents,  starting  with  Nelson 
Mandela,  but  the  regime  has  remained  the  same. 
Conversely,  a  change  of  regime  may  occur  in  a 
specific  state,  but  the  state  itself  normally  remains 
the  same  (of  course,  states  do  get  formed  and  dis¬ 
solved,  but  the  point  here  is  that  over  the  very  long 
haul,  a  stable  state  is  likely  to  experience  multiple 
regime  types).  Key  state  institutions  and  actors  (the 
bureaucracy,  the  judiciary,  the  military,  etc.)  thus 
tend  to  carry  over  from  one  regime  to  another,  for 
example,  from  Imperial  Germany  to  the  Weimar 
Republic — in  this  example  posing  problems  for  the 
latter.  Sometimes,  though,  a  regime  change  will 
involve  the  creation  of  entirely  new  state  struc¬ 
tures,  such  as  with  the  creation  of  the  Soviet  Union 
(the  new  communist  bureaucracy,  the  Red  Army 
instead  of  the  Imperial  Army,  etc.).  Finally,  one 
must  distinguish  between  public  policies  that  speak 
to  the  nature  of  the  regime  (such  as  the  extent  of 
civil  liberties)  and  more  general  socioeconomic 
policies,  such  as  economic  policy,  health  care,  or 
same-sex  marriage — the  latter  types  of  policies  can 
change  (and  may  well  change  with  different  gov¬ 
ernments)  but  do  not  involve  regime  change. 

It  should  be  noted  that  the  word  regime  is  some¬ 
times  used  as  the  equivalent  of  “system,”  for 
example,  a  parliamentary  regime  versus  a  presi¬ 
dential  regime.  However,  this  is  too  broad  a  use  of 
the  concept  of  regime.  Thus,  if  a  democracy 
changes  from,  say,  a  parliamentary  system  to  a 


presidential  system,  as  Gambia  did  in  1970,  it  is 
not  a  regime  change.  A  similar  point  can  be  made 
about  unicameral  versus  bicameral  systems  or  uni¬ 
tary  versus  federal  systems. 

The  Key  Regime  Distinction:  Democracy 
or  Not  (or  How  Much  of  It)? 

There  are  many  different  specific  political  regimes 
in  the  world,  but  the  initial  assessment  of  a  regime 
normally  begins  with  whether  or  not  it  is  demo¬ 
cratic.  In  the  contemporary  world,  a  democratic 
regime  has  three  basic  aspects:  competitive  elec¬ 
tions,  more  or  less  universal  suffrage,  and  a 
responsible  government — that  is,  those  elected  in 
competitive  elections  both  are  accountable  to  the 
people  (the  voters)  and  have  the  actual  power  to 
govern,  as  opposed  to  effective  power  being  in  the 
hands  of  a  monarch,  the  military,  or  a  clerical  fig¬ 
ure.  (Implicit  in  this  definition  of  democracy  is 
some  sort  of  functioning  state.)  The  opposite  real¬ 
ity  here  is  that  of  an  autocratic  regime.  That  said, 
since  there  are  key  differences  among  autocratic 
regimes,  these  are  normally  identified  more  pre¬ 
cisely,  as  specific  regime  types.  The  main  types  of 
autocratic  regimes  are  totalitarian  regimes  (involv¬ 
ing  a  ruling  ideology,  mass  mobilization,  and  total 
state  control),  such  as  Nazi  Germany;  posttotali- 
tarian  regimes  (involving  less  emphasis  on  ideol¬ 
ogy  and  somewhat  greater  freedoms),  such  as  the 
Soviet  Union  under  Mikhail  Gorbachev;  sultanistic 
regimes  (involving  the  personal  rule  of  an  individ¬ 
ual  and  his  or  her  family),  such  as  Iraq  under 
Saddam  Hussein;  traditional  authoritarian  regimes 
(involving  rule  by  a  monarch),  such  as  Imperial 
Russia;  military  authoritarian  regimes,  such  as  the 
one  ruling  Myanmar;  theocratic  authoritarian 
regimes,  such  as  the  one  in  Iran  (at  least  through 
2009);  and  electoral  authoritarian  regimes  (more 
on  this  type  below).  There  are  some  autocracies, 
however,  that  straddle  these  specific  regime  types; 
for  example,  Cuba  and  North  Korea  have  elements 
of  both  totalitarianism  and  sultanism. 

Viewed  as  a  simple  dichotomy  between  a  demo¬ 
cratic  regime  and  an  autocratic  regime,  a  clear 
majority  of  the  194  or  so  countries  in  the  world 
today  have  democratic  regimes.  However,  a  stan¬ 
dard  further  distinction  is  made  between  electoral 
democratic  regimes  and  liberal  democratic  regimes. 
Electoral  democracies  have  the  three  basic  features 
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given  above.  Liberal  democracies  have  these  as 
well  (and  thus  all  liberal  democracies  are  also  elec¬ 
toral  democracies),  but  they  additionally  have  the 
liberal  features  of  broad  civil  liberties,  a  strong  rule 
of  law  (e.g.,  an  independent  judiciary),  horizontal 
accountability  of  elected  officials  to  other  state 
actors  (e.g.,  ombudsmen  and  state  auditors),  and  a 
well-functioning  state  with  limited  if  not  minimum 
corruption.  There  are,  thus,  many  democratic 
regimes  that  are  electoral  but  not  liberal.  This  is 
especially  true  in  Africa  and  Latin  America.  The 
fact  that  the  liberal  democratic  regimes  of  the 
world  are  overwhelmingly  found  in  Europe, 
European  settler  societies,  and  the  former  Anglo- 
American  colonies  has  led  to  the  criticism  that  this 
is  a  Western  rather  than  a  universal  concept. 

By  placing  liberal  democracies  ahead  of  elec¬ 
toral  democracies  (which  in  turn  are  ahead  of 
autocracies),  the  dichotomy  between  regimes  has 
been  transformed  into  a  continuum.  Indeed,  most 
scholarly  and  related  analyses  use  some  sort  of 
continuum  of  regimes.  In  terms  of  global  assess¬ 
ments,  Freedom  House  (an  international  nongov¬ 
ernmental  organization  that  conducts  research  and 
advocacy  on  democracy,  human  freedom,  and 
human  rights)  distinguishes  among  free,  partly 
free,  and  not  free  regimes,  based  ultimately  on 
7-point  scales  for  each  of  political  rights  and  civil 
liberties.  (Freedom  House  also  has  a  separate 
dichotomous  list  of  electoral  democracies.)  The 
Bertelsmann  Transformation  Index  uses  a  10-point 
scale  that  ultimately  produces  a  continuum  of  five 
categories:  democracies,  defective  democracies, 
highly  defective  democracies,  moderate  autocra¬ 
cies,  and  autocracies.  However,  it  classifies  failed 
states  separately  (within  the  broader  autocratic 
area). 

Continua  can  also  be  used  for  the  submeasures 
of  democracy.  There  are  various  measures  of  the 
rule  of  law,  including  that  of  the  World  Bank. 
Transparency  International,  a  nongovernmental 
organization,  publishes  a  Corruptions  Perception 
Index.  Yet  these  measures  do  not  claim  to  make  a 
clear  distinction  among  regimes,  perhaps  because 
they  deal  with  aspects  of  liberalism  rather  than 
democracy.  Regime  distinctions  can  be  made  based 
on  the  extent  of  the  right  to  vote;  this  would  lead 
to  pre-World  War  II  Belgium  and  France,  for 
example,  being  classified  as  male  democracies  due 
to  the  absence  of  female  suffrage.  Of  course,  in  the 


contemporary  world,  the  vast  majority  of  coun¬ 
tries  have  more  or  less  universal  suffrage  for  both 
genders,  so  suffrage  is  now  a  limited  measure  for 
defining  regimes.  In  terms  of  responsible  govern¬ 
ment,  scholars  have  produced  measures  of  the 
extent  of  monarchical  control,  and  especially  of 
military  control.  There  are  potential  gray  areas 
here,  especially  with  situations  of  monarchical  or 
military  tutelage;  normally  these  are  seen  as  being 
democratic  regimes,  but  just  so. 

Without  a  doubt,  however,  the  one  submeasure 
of  democracy  where  there  are  specific  continua 
leading  to  regime  classification  is  the  extent  of  free 
and  fair  elections.  By  “free  and  fair  elections,”  one 
means  many  things:  the  ability  of  most,  if  not  all, 
people  and  parties  to  run  for  election;  access  of 
candidates  to  voters;  equal  treatment  of  candidates 
and  parties  by  state  actors  such  as  the  police  and 
courts;  media  pluralism  so  that  opposition  candi¬ 
dates  have  media  access  (as  opposed  to  the  media 
being  controlled  by  government);  secret  ballot;  no 
coercion  of  voters;  neutral  and  transparent  ballot 
counting  with  independent  monitors  allowed;  no 
major  or  “mysterious”  delays  in  reporting  the  full 
results;  and  impartial  resolution  of  disputes.  In  a 
democratic  regime,  these  factors  exist  sufficiently 
to  allow  a  real  opportunity  for  the  government  to 
be  defeated;  in  contrast,  in  an  autocratic  regime, 
the  government  will  tilt  the  playing  field  so  that  it 
has  an  insurmountable  advantage  and/or  will  sim¬ 
ply  fabricate  the  results  (as  presumably  happened 
in  Iran  in  2009).  Yet  the  dividing  lines  here  are  not 
always  clear  and  decisive.  Certainly  if  a  govern¬ 
ment  is  defeated  and  hands  over  power,  then  one 
assumes  that  the  elections  were  fair  and  the  regime 
is  democratic.  However,  if  a  government  or  presi¬ 
dent  is  reelected,  does  that  reflect  unfairness  or 
simply  the  voters’  desire  to  reelect  them?  One 
point  here  is  whether  the  opposition  cries  foul  or 
accepts  the  results  as  legitimate.  However,  in  some 
cases,  the  opposition  may  not  bother  to  cry  foul  if 
it  sees  no  point  in  protesting. 

Even  if  the  opposition  is  not  allowed  to  win  by 
the  powers  that  be,  they  may  still  have  some  elec¬ 
toral  success.  At  a  minimum,  they  are  allowed  to 
exist.  This  gave  rise  to  the  concept  of  an  electoral 
authoritarian  regime,  which  conceptually  is  a  rela¬ 
tively  new  regime  type.  In  such  a  regime,  multi¬ 
party/multicandidate  elections  are  held  but  with 
no  chance  of  the  incumbents  losing  power  due  to 
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the  lack  of  fairness  and  perhaps  freedom  as  well. 
Yet  it  is  the  very  reality  of  election  “victory”  that 
is  used  to  give  legitimacy  to  the  regime,  if  not 
indeed  to  claim  it  as  a  democracy.  Singapore  is  a 
classic  case  here.  These  electoral  authoritarian 
regimes  can  be  contrasted  with  fully  closed  regimes, 
where  no  opposition  is  allowed  to  exist  (e.g., 
totalitarian  and  sultanistic  regimes  or  certain  tradi¬ 
tional  authoritarian  regimes  such  as  Saudi  Arabia). 
On  the  other  hand,  in  some  autocratic  regimes 
elections  are  not  just  a  rubber  stamp,  and  election 
outcomes  are  of  concern  to  the  government. 
Opposition  forces  may  win  a  significant  minority 
of  the  seats  or  even  some  control  of  regional/local 
governments.  Even  if  the  playing  field  is  not  level 
enough,  or  fair  enough,  for  the  government  to  be 
defeated  overall,  it  can  still  experience  a  worse  or 
better  result  from  election  to  election.  Malaysia  for 
decades  (through  the  time  of  writing)  fits  this  situ¬ 
ation.  Thus,  scholars  have  developed  the  concept 
of  a  competitive  authoritarian  regime  that, 
although  clearly  authoritarian  as  opposed  to  dem¬ 
ocratic,  allows  much  more  competition  and  plural¬ 
ism  (including  media  freedom)  than  (hegemonic) 
electoral  authoritarian  regimes,  not  to  mention 
fully  closed  regimes.  The  concept  of  a  competitive 
authoritarian  regime  thus  overlaps  with  the 
Bertelsmann  Transformation  Index’s  notion  of  a 
moderate  autocracy.  It  is  thus  possible  for  such 
competitive  authoritarian  regimes  to  become  com¬ 
petitive  enough  to  produce  a  transition  to  democ¬ 
racy,  as  happened  to  Ukraine  when  the  amount  of 
vote  rigging  needed  to  produce  victory  for  the 
regime’s  presidential  candidate  in  2004  led  to  the 
Orange  Revolution  and  the  ushering  in  of  a  demo¬ 
cratic  regime. 

These  distinctions  among  democracies,  com¬ 
petitive  authoritarian  regimes,  hegemonic  electoral 
authoritarian  regimes,  and  fully  closed  regimes 
essentially  or  ultimately  reflect  the  electoral  success 
of  the  opposition.  To  repeat,  in  democracies  the 
opposition  has  a  realistic  chance  of  defeating  the 
government  (and  if  this  does  not  happen  in  an 
individual  election,  it  is  the  result  of  voter  prefer¬ 
ences  and/or  poor  opposition  strategies  rather  than 
an  unfair  playing  field),  in  competitive  authoritar¬ 
ian  regimes  the  opposition  is  able  to  win  a  reason¬ 
ably  large  number  of  seats  (and  thus  also  hold  the 
government  to  some  account  in  the  legislature), 
and  in  hegemonic  electoral  authoritarian  regimes 


the  opposition  wins  only  a  tiny  and  largely  irrele¬ 
vant  number  of  seats  (but  it  still  exists,  unlike  in 
fully  closed  regimes).  These  differing  regime  clas¬ 
sifications  thus  draw  heavily  on  election  outcomes, 
but  the  general  level  of  freedom  in  a  country  (such 
as  its  Freedom  House  score)  usually  is  related  rea¬ 
sonably  well  to  this  specific  indicator  of  election 
outcomes.  There  are  also  liberal  autocratic  regimes 
where  the  civil  rights  are  much  greater  than  the 
effective  political  freedoms,  such  as  Monaco  and 
Tonga;  that  said,  in  these  countries  the  autocracy 
stems  from  a  lack  of  responsible  government  (and 
thus  “relevant”  elections)  rather  than  unfair  elec¬ 
tions  per  se. 

All  of  this  also  raises  the  question  of  whether 
empirically  there  is  a  maximum  level  of  support 
for  a  government/incumbent  beyond  which  the 
result  is  not  credible  and  the  regime  cannot  be 
considered  a  democracy.  Usually,  something  like 
70%  of  the  vote  is  suggested  here.  Yet  in  countries 
broadly  considered  to  be  democratic,  such  as 
Botswana  and  Namibia,  incumbent  parties  have 
won  over  70%  of  the  vote  in  elections  (however,  in 
South  Africa  the  African  National  Congress  has 
never  reached  this  level  of  support,  its  best  result 
being  69.7%  in  2004).  In  part,  this  may  reflect  the 
exceptional  support  given  to  parties  that  led  the 
nation’s  struggle  for  independence.  Certainly  in 
Western  democracies,  parties  (or  presidential  can¬ 
didates)  competing  in  national  elections  never  get 
much  more  than  60%  of  the  vote,  even  in  the  most 
successful  cases.  Indeed,  presidential  elections  in 
the  United  States  are  interesting  in  this  regard, 
since  there  does  appear  to  be  an  effective  “ceiling” 
just  above  60%  of  the  vote.  In  descending  order, 
the  largest  vote  shares  in  such  elections  have  been 
61.1%  (Lyndon  Johnson  in  1964),  60.8%  (Franklin 
Roosevelt  in  1936),  60.7%  (Richard  Nixon  in 
1972),  and  60.3%  (Warren  Harding  in  1920) — all 
very  close  values.  Outside  the  United  States,  one 
can  note  the  vote  share  of  64.7%  that  Jose  Figueres 
got  in  Costa  Rica  in  1953 — an  undisputed  out¬ 
come  in  a  democratic  regime.  Consequently,  it 
seems  clear  that  a  result  of  90%  or  80%  cannot  be 
consistent  with  democracy.  That  said,  many  dubi- 
ous/nondemocratic  outcomes  are  based  on  much 
less  lopsided  “official”  results,  such  as  63%  for 
President  Ahmadinejad  in  Iran  in  2009,  51% 
for  President  Salinas  in  Mexico  in  1988,  and  47% 
for  President  Kibaki  in  Kenya  in  2007 — all  of 
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which  were  nevertheless  (at  best)  competitive 
authoritarian  regimes. 

Ambiguous  and  Hybrid  Regimes 

Regardless  of  how  many  categories  there  are  in  a 
continuum  of  regime  types,  one  needs  to  ask 
whether  this  continuum  could  be  collapsed  into 
two  categories — democracies  and  autocracies.  In 
other  words,  is  every  country  ultimately  consid¬ 
ered  to  be  a  democracy  or  an  autocracy,  or  is 
there  some  unclear  middle  ground?  Scholarly  and 
related  research  differs  as  to  whether  or  not  there 
is  ever  any  such  unclear  middle  ground.  There  are 
two  rather  different  problems  here.  First  of  all,  a 
country  could  have  a  fluid  or  unstable  political 
system,  making  it  hard  to  pin  it  down.  In  this 
case,  one  option  is  the  concept  of  a  transitional 
regime.  Conversely,  the  previous  regime  could  be 
deemed  to  remain  in  place  (and  the  country  be 
classified  as  such)  until  there  is  a  clear  shift  to  a 
new  regime.  Second,  there  could  be  genuine 
scholarly  disagreement  about  the  political  reality 
in  a  country,  with  independent  analysts  having 
differing  opinions  as  to,  say,  the  level  of  competi¬ 
tiveness.  This  divergence  is  most  common  where 
the  government/incumbent  is  reelected,  and  seems 
to  be  genuinely  popular,  but  where  there  are  (seri¬ 
ous)  procedural  issues — for  example,  Russia 
under  Vladimir  Putin  or  Venezuela  under  Hugo 
Chavez.  Such  polities  could  be  considered  ambig¬ 
uous  regimes,  at  least  until  there  is  a  reasonable 
consensus  on  their  regime  classification  (which 
incidentally  tends  to  be  that  they  are  authoritar¬ 
ian,  leading  to  a  related  debate  as  to  the  point  at 
which  democracy  broke  down).  A  variant  of  this 
dilemma  is  where  a  country  just  places  into  a  cat¬ 
egory.  For  example,  the  Bertelsmann  Transformation 
Index  refers  to  highly  defective  democracies  as 
barely  meeting  its  minimum  criteria  for  democ¬ 
racy.  It  also  acknowledges  a  gray  area  between 
democracy  and  autocracy. 

These  situations  all  involve  ambiguity  in  classifi¬ 
cation.  The  resulting  notions  of  ambiguous  regimes 
or  transitional  regimes  are  meant  not  to  be  precise 
categories  but  rather  to  reflect  the  inability  to  be 
precise.  Conversely,  the  notion  of  a  hybrid  regime 
involves  combined  aspects  of  democracy  and  autoc¬ 
racy  in  a  fairly  stable,  ongoing  way.  This  concept  is 
more  likely  to  be  used  if  more  aspects  are  included 


in  the  assessment.  Narrow(er)  assessments  that 
focus  on  just  competition  and  responsible  govern¬ 
ment  are  likely  to  place  every  country  on  one  side 
or  the  other  of  the  divide — that  is,  as  either  a 
democratic  or  an  autocratic  regime.  However,  if 
there  are  multiple  criteria — and  if  these  are  weighted 
equally — then  the  reality  arises  of  countries  that  are 
democratic  in  some  aspects  and  autocratic  in  others 
or  that  average  out  as  being  in  the  middle.  This  is 
the  reality  of,  for  example,  the  Economist 
Intelligence  Unit’s  Index  of  Democracy,  which  has 
five  aspects  including  political  participation  and 
political  culture — indeed,  what  this  really  seeks  to 
measure  is  consolidated  democracy.  Consequently, 
a  substantial  number  of  countries  fall  into  its 
hybrid  regime  category,  defined  as  having  a  score 
from  4.0  to  5.9  out  of  10.  Decomposing  their  com¬ 
ponents  would  categorize  the  vast  majority  of  these 
countries  as  either  (unconsolidated)  electoral 
democracies  or  competitive  authoritarian  regimes, 
with  a  few  countries  scoring  as  they  do  because  of 
a  lack  of  sufficient  stateness. 
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Regional  Integration 
(Supranational) 

When  examining  the  macrostructure  of  the  global 
system,  three  observations  appear  highly  salient. 
Politically,  the  world  is  organized  into  200  or  so 
sovereign  nation-states,  each  an  aspiring  zone  of 
exclusive  authority.  Economically,  there  is  a  single 
though  differentiated  world  economy,  especially 
since  the  decline  of  the  global  socialist  bloc.  What 
lies  in  between  these  states  and  the  world  econ¬ 
omy  are  numerous  international  institutions, 
many  of  them  organized  on  a  regional  basis. 
Regional  integration  refers  to  the  process  of 
increasing  political  and  economic  cooperation 
among  states  in  close  geographic  proximity  to 
each  other.  The  growth  in  the  number  of  regional 
organizations  is  dramatic.  While  numerous 
regional  organizations  were  reported  to  the 
General  Agreement  on  Tariffs  and  Trade  (GATT) 
in  the  4 Vi  decades  after  World  War  II,  an  even 
more  dramatic  growth  spurt  has  taken  place  start¬ 
ing  in  1992.  While  the  number  of  new  regional 
organizations  leveled  off,  that  level  remained  very 
high  in  comparison  with  all  years  prior  to  1992. 

This  worldwide  trend  toward  regional  integra¬ 
tion  presents  two  important  puzzles.  First,  scholars 
debate  the  reasons  for  this  tendency  toward  region¬ 
alism  in  an  era  of  globalization.  Economic  theory 
pushes  in  the  direction  of  larger  units,  while  the 


demand  for  sovereignty  and  identity  suggest  smaller 
units.  Second,  why  does  one  experiment  in  regional 
integration,  the  European  Union  (EU),  stand  out, 
distinguished  not  so  much  by  the  level  of  economic 
integration  as  by  the  breadth  and  depth  of  its 
political  institutions?  The  distancing  of  Europe 
from  other  regional  organizations  has  had  the 
unfortunate  consequence  of  compartmentalizing 
scholarship.  Today,  comparative  studies  of  regional 
integration  with  the  EU  as  one  case  are  rare. 

The  New  Regionalism  in  a 
Comparative  Perspective 

The  New  Regionalism 

Regional  organizations  and  agreements  are  pro¬ 
liferating  all  over  the  world,  and  more  than  50% 
of  international  trade  is  now  conducted  within 
preferential  trading  agreements  organized  at  the 
regional  level.  Regional  integration  has  proceeded 
in  waves,  the  most  recent  of  which  began  in  the 
wake  of  the  Cold  War  in  the  early  1990s.  The  EU’s 
single  market  was  completed  in  1992,  and  the 
treaty  at  Maastricht  that  year  set  the  timetable  for 
achieving  economic  and  monetary  union  by  1999. 
The  North  American  Free  Trade  Agreement 
(NAFTA)  was  signed  by  Canada,  Mexico,  and  the 
United  States  in  1992.  The  Asia-Pacific  Economic 
Cooperation  (APEC)  was  established  in  1989  with 
the  goal  of  becoming  a  free  trade  area  (a  goal  not 
yet  realized),  and  the  Association  of  South  East 
Asian  Nations  (ASEAN),  though  established  in 
1967,  created  a  free  trade  area  in  1992.  It 
expanded  its  membership  and  its  integration  activ¬ 
ities  in  the  1990s,  and  since  the  East  Asian  finan¬ 
cial  crisis  of  1997.  The  three  regional  powers — 
China,  Japan,  and  South  Korea — have  been 
included  in  regular  ASEAN  summit  meetings. 

These  three  sites  of  economic  regionalism — 
Europe,  North  America,  and  East  Asia — are  home 
to  the  majority  of  the  world’s  intraregional  trade 
and  the  most  robust  regional  institutions,  but  steps 
toward  regionalization  have  been  taken  elsewhere 
as  well.  There  are  regional  organizations  among 
states  in  the  Middle  East  (Gulf  Cooperation 
Council),  Africa  (South  African  Customs  Union, 
West  African  Economic  and  Monetary  Union),  and 
South  America  (Southern  Common  Market  [Mer¬ 
cado  Comun  del  Sur,  MERCOSUR],  Andean 
Community),  and  there  are  free  trade  agreements  in 
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regions  all  over  the  world.  As  of  July  2010,  more 
than  474  regional  trading  agreements  had  been 
reported  to  the  World  Trade  Organization  (WTO). 
Of  the  agreements  already  in  force,  over  90%  are 
free  trade  agreements.  Almost  every  WTO  member 
is  party  to  at  least  one  regional  trade  agreement, 
and  these  agreements  often  intersect  in  complex 
ways.  At  the  same  time,  the  depth  and  scope  of 
regional  integration  are  highly  diverse,  with  the  EU 
representing  a  level  of  both  economic  and  political 
integration  not  seen  elsewhere  on  the  globe,  while 
many  regional  agreements  are  little  more  than  sym¬ 
bolic  acts  that  have  not  generated  any  meaningful 
regional  economic  or  political  cooperation. 

Given  the  institutions  already  in  place  at  the 
global  level  to  facilitate  multilateral  trade,  and  the 
widespread  membership  of  states  in  these  institu¬ 
tions,  the  “new  regionalism”  of  the  early  1990s  is 
an  interesting  puzzle,  for  which  scholars  have  pro¬ 
vided  several  explanations.  At  the  international 
level,  some  scholars  point  to  the  end  of  the  Cold 
War  as  a  catalyst  for  the  sudden  rise  in  the  number 
of  regional  agreements  signed  by  states  in  the  early 
1990s.  According  to  these  arguments,  the  move 
from  a  bipolar  world  characterized  by  stark  politi¬ 
cal  divisions  to  a  multipolar  world  characterized  by 
increasing  global  economic  competition  removed 
many  of  the  constraints  that  had  once  operated  on 
states’  foreign  policies,  both  politically  and  eco¬ 
nomically.  The  end  of  the  bipolar  division  restored 
regional  sovereignty  and  allowed  regional  powers 
to  develop.  Asia,  for  example,  had  been  geographi¬ 
cally  split  along  ideological  lines,  and  with  the  end 
of  the  Cold  War,  countries  such  as  China  and 
Vietnam  have  been  gradually  integrated  into 
regional  institutions.  The  prospect  of  a  more  multi¬ 
polar  world  and  fears  of  “Fortress  Europe”  encour¬ 
aged  states  to  organize  regionally  to  counter  the 
growing  economic  weight  of  the  EE!  and  NAFTA. 

Another  line  of  argument  focuses  on  the  role  of 
hegemony.  Some  theorize  that  the  primary  role  of 
a  global  hegemon  is  to  provide  public  goods;  in  the 
post-hegemonic  international  system,  states  began 
turning  to  regional  organizations  to  overcome 
collective-action  problems.  As  a  hegemon,  the 
Elnited  States  influenced  economic  relations  all 
over  the  globe,  and  it  traditionally  encouraged 
bilateral  trade  relationships  in  Asia,  whereas  in 
Europe,  it  supported  multilateral  integration 
efforts.  The  relative  decline  of  El.S.  hegemony  in 


certain  regions  since  World  War  II  has  afforded 
these  regions  more  freedom  to  pursue  their  own 
integration  projects.  For  example,  in  the  early 
1990s  Japan  overtook  the  United  States  as  the 
leading  provider  of  foreign  aid  in  Asia,  and  it 
began  to  challenge  U.S.  dominance  in  trade  and 
foreign  direct  investment  within  the  region.  With 
the  strong  economic  linkages  formed  during  this 
time  between  Japan  and  the  newly  industrializing 
economies  in  Asia,  the  door  was  opened  for  the 
development  of  stronger  regional  cooperation. 

The  third  reason  why  states  may  have  been 
motivated  to  pursue  regionalism  was  difficulties 
reaching  agreement  in  multilateral  trade  negotia¬ 
tions  in  the  early  1990s.  The  GATT  Uruguay 
Round  is  the  most  comprehensive  trade  agreement 
negotiated  to  date,  and  it  was  extremely  frustrating 
for  the  participants,  taking  8  years  to  complete  and 
leaving  many  developing  states  unsatisfied.  During 
this  same  time  period  (1986-1994),  NAFTA, 
MERCOSUR,  APEC,  and  the  EU’s  single  market 
were  initiated  and  completed  in  less  time  than  it 
took  to  complete  the  Uruguay  Round.  These  agree¬ 
ments  were  not  only  achieved  more  quickly,  but 
also  they  provided  solutions  to  issues  that  had 
come  to  a  standstill  in  the  GATT  negotiations. 

Comparative  Regional  Groupings 

While  regionalization  is  a  process  taking  place 
all  over  the  globe,  there  is  considerable  variation  in 
the  institutional  design  of  regional  agreements,  in 
the  depth  of  integration  they  achieve,  and  in  the 
success  with  which  they  meet  their  stated  objec¬ 
tives.  The  EU  is  exceptional  for  its  high  level  of 
both  economic  and  political  institutionalization.  In 
North  America,  the  three  parties  to  NAFTA  suc¬ 
ceeded  in  creating  a  free  trade  area  that  is  wide- 
ranging  in  its  trade  liberalization  agenda;  deep  in  its 
level  of  institutionalization,  especially  with  regard 
to  dispute  resolution  mechanisms;  and  prominent 
as  a  focal  point  for  other  subregional  agreements 
that  have  sprung  up  among  Latin  American  coun¬ 
tries.  MERCOSUR  is  the  largest  South  American 
trading  bloc,  but  it  remains  largely  intergovernmen¬ 
tal  in  nature,  with  weak  implementation  of  basic 
integration  measures  and  a  continued  preference 
for  bilateralism  among  some  of  its  members.  There 
are  many  subregional  trading  agreements  in  place 
among  African  states,  although  the  low  level  of 
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economic  development  and  the  absence  of  large 
cost  differences  in  tradable  goods  across  much  of 
the  continent  make  successful  regional  economic 
cooperation  exceedingly  difficult. 

East  Asian  Regionalism 

One  of  the  world’s  most  important  economic 
regions,  East  Asia,  has  been  a  relative  newcomer  to 
the  process  of  regionalization.  Until  very  recently, 
scholars  commonly  described  this  region  as  “under¬ 
institutionalized”  when  compared  with  the  other 
two  regional  economic  heavyweights,  North 
America  and  Europe.  While  the  EU  is  known  for  its 
robust  multilateral  and  supranational  institutions, 
in  East  Asia  cooperation  has  taken  a  more  bilateral 
and  intergovernmental  form  in  both  economic  and 
security  areas.  ASEAN  has  played  an  important 
role  in  relations  among  Southeast  Asian  countries 
since  its  founding  in  1967,  but  it  wasn’t  until  the 
late  1990s  that  ASEAN  expanded  to  meet  regularly 
with  China,  Japan,  and  South  Korea,  the  three 
regional  powers.  The  other  prominent  regional 
organization  in  Asia,  APEC,  has  21  member  states 
from  both  sides  of  the  Pacific  Ocean,  but  its  efforts 
at  trade  liberalization  have  stalled  in  recent  years 
and  left  it  largely  on  the  sidelines.  In  addition, 
Asian  countries  have  been  notably  reluctant  to  cre¬ 
ate  supranational  political  institutions  at  the 
regional  level.  Instead,  cooperation  is  largely  lim¬ 
ited  to  the  economic  realm,  and  it  is  informal,  often 
led  by  transnational  business  networks.  This  infor¬ 
mal,  business-led  form  of  economic  cooperation  in 
Asia  has  come  to  be  known  as  the  “ASEAN  Way.” 

Some  argue  that  the  diversity  among  East  Asian 
states  in  their  levels  of  economic  development  and 
government  capacity  to  implement  integration 
measures  have  hindered  meaningful  cooperation 
efforts  at  the  regional  level.  Still  others  contend 
that  the  delayed  regionalization  in  East  Asia 
reflects  the  traditional  U.S.  preference  for  bilateral 
cooperation  in  this  region.  While  the  United  States 
supported  multilateral  integration  in  Europe  in  the 
postwar  period,  it  pushed  the  development  of 
bilateralism  in  Asia,  partially  as  a  way  to  maintain 
its  hegemony  in  the  region.  In  the  post-Cold  War 
period,  the  continued  political  heterogeneity  across 
East  Asia  has  made  integration  difficult.  In  addi¬ 
tion,  Japan,  a  potential  regional  leader  in  the  early 
1990s,  prioritized  its  relationship  with  the  United 


States  over  the  pursuit  of  a  multilateral  integration 
agenda  with  its  neighbors.  Bowing  to  U.S.  pres¬ 
sure,  Japan  neither  led  integration  efforts  nor  lent 
its  full  support  to  proposals  by  other  Asian  states 
for  greater  integration  in  East  Asia. 

The  situation  in  East  Asia  dramatically  changed 
after  the  crippling  economic  crisis  in  1997.  The 
crisis  made  it  clear  to  Asian  political  leaders  just 
how  vulnerable  their  economies  were  to  the  flight 
of  global  capital.  Further,  there  was  deep  disap¬ 
pointment  with  the  response  of  the  United  States 
and  U.S. -led  international  financial  institutions, 
such  as  the  International  Monetary  Fund.  The 
United  States  continued  to  promote  neoliberal 
reforms  in  the  wake  of  the  crisis,  despite  the  long¬ 
standing  resistance  to  these  reforms  from  East 
Asian  governments,  many  of  whom  believed  that 
such  reforms  had  been  a  cause  of  the  crisis.  Japan’s 
proposal  for  an  Asian  Monetary  Fund  that  would 
help  maintain  regional  stability  in  the  future  was 
opposed  by  the  United  States,  a  move  widely 
viewed  in  East  Asia  as  a  U.S.  attempt  to  maintain 
its  influence  in  the  region,  despite  the  cost  to  stabil¬ 
ity.  Since  1997,  the  focus  has  shifted  away  from 
integration  at  the  Southeast  Asian  or  Pacific-wide 
level  to  wider  integration  across  East  Asia. 

Process  of  Regional  Integration  in  Europe 

Limitations  of  space  preclude  a  comprehensive 
review  of  regional  integration  theories,  let  alone  of 
the  wealth  of  regional  experiences  around  the 
world.  This  section  reviews  functionalism  and  neo¬ 
functionalism  (collapsing  most  distinctions)  and 
liberal  intergovernmentalism  as  the  two  dominant 
approaches.  The  section  concludes  with  some 
comments  on  new  departures  in  regional  integra¬ 
tion  theory  concerning  the  EU,  departures  that 
signal  a  break  with  the  questions  posed  by  existing 
theoretical  frameworks. 

Functionalism 

The  causal  factors  emphasized  by  functionalism 
fit  into  three  categories:  (1)  the  fertile  postwar  envi¬ 
ronment,  (2)  the  initial  integrative  successes,  and 
(3)  the  interconnected  nature  of  modern  capitalist 
societies,  which  in  turn  leads  to  the  expansion  of 
these  initial  successes.  The  fertile  postwar  envi¬ 
ronment  was  characterized  by  late  capitalism 
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(industrial  and  postindustrial),  specialized  and 
partly  autonomous  interest  groups,  and  democratic 
forms  of  governance.  In  telling  the  story  of  European 
integration,  it  is  easy  to  downplay  Europe’s  early 
postwar  history,  when  major  political  problems 
(how  to  incorporate  Germany,  how  to  provide  for 
European  security)  were  addressed  and  resolved, 
and  focus  only  on  the  economic  side  of  the  picture. 
The  economic  recovery  of  Germany,  aided  by  a 
multilateral  structure  set  up  by  the  Marshall  Plan, 
and  the  subsequent  integration  of  Germany  into 
the  North  Atlantic  Treaty  Organization  (NATO)  in 
1955  provided  the  necessary  preconditions  for  fur¬ 
ther  integration.  Analysts  attempting  to  compare 
Europe  with  other  areas  of  the  world  should  keep 
these  salient  facts  in  mind. 

The  second  explanatory  factor  offered  by  func¬ 
tionalism  has  to  do  with  the  hoped-for  initial  suc¬ 
cesses  of  the  integration  project.  One  could  argue 
that  the  initial  successes  were  contingent  rather  than 
a  predicted  outcome  of  functionalist  theory. 
However,  the  success  of  the  initial  attempts  to  inte¬ 
grate  was  endogenous  in  the  sense  that  the  practitio¬ 
ners  and  analysts  believed  that  by  focusing  on  the 
less  controversial  sectors,  they  would  stand  a  better 
chance  of  succeeding.  In  good  measure,  they  were 
right  about  this  prediction.  The  free  trade  area,  the 
customs  union,  and  much  of  the  common  agricul¬ 
tural  policy  were  in  place  by  1968.  There  is  little 
doubt  that  these  successes  paved  the  way  for  further 
integration,  even  though  there  was  little  movement 
on  the  policy  front  between  1968  and  1985. 

Third,  functionalists  argue  that  the  modern  capi¬ 
talist  society  is  interconnected  and  these  intercon¬ 
nections  all  but  ensured  that  the  initial  success 
implied  further  integration.  Several  kinds  of  link¬ 
ages  are  theorized.  There  are  linkages  among  sec¬ 
tors  (e.g.,  trade  and  monetary),  linkages  from  the 
economic  to  political  spheres  (e.g.,  market  partici¬ 
pation  generates  citizenship  rights),  and  purely  tac¬ 
tical  linkages.  This  last  category  offers  negotiators 
the  possibility  of  linking  sectors  that  are  not  inher¬ 
ently  connected;  for  example,  market  expansion 
may  be  linked  to  regional  aid  for  the  purpose  of 
making  the  former  more  attractive  to  suspicious 
participants.  These  linkages  are  mechanisms  that 
perhaps  will  lead  to  further  integration,  once  the 
initial  foundations  are  in  place.  In  functionalism 
jargon,  the  secondary  integrative  processes  set  in 
motion  by  these  mechanisms  are  spillover  processes. 


The  preceding  discussion  can  be  thought  of  as 
dealing  with  the  demand  side  of  integration.  But 
the  demand  for  integration  does  not  automatically 
lead  to  integration.  Political  market  failures  may 
occur.  Inadequate  institutional  structures  or  weak 
leadership  may  thwart  the  process.  For  integration 
to  occur,  even  when  key  social  actors  want  it  to 
occur,  collective-action  problems  have  to  be  over¬ 
come.  For  this  reason,  functionalists  assigned  a 
central  role  to  supranational  institutions,  from  the 
High  Authority  of  the  European  Coal  and  Steel 
Community  (ECSC)  to  the  Commission  and 
European  Court  of  Justice  of  the  EU. 

In  the  able  hands  of  first-generation  scholars 
such  as  Ernst  Haas,  Philippe  Schmitter,  and  Leon 
Lindberg,  the  basic  analytics  of  regional  integra¬ 
tion  theory  were  put  into  place.  A  second  genera¬ 
tion  of  neo-functionalist  scholars  (e.g.,  Alec  Stone 
Sweet,  Wayne  Sandholtz,  Paul  Pierson)  extended 
the  basic  research  program  to  account  for  deeper 
processes  of  integration,  including  the  completion 
of  the  single  market,  economic  and  monetary 
union,  and  subsequent  treaty  revisions  (Amsterdam, 
Nice,  Lisbon).  In  a  1998  work,  Stone  Sweet  and 
Sandholtz  increased  our  institutional  understand¬ 
ing  of  the  EU.  Their  work  on  the  European  Court 
of  Justice  takes  pains  to  link  active  interest  groups 
with  the  structure  of  litigation  and  legislation.  The 
Commission,  Council  of  Ministers,  and  European 
Parliament  are  all  integrated  into  one  comprehen¬ 
sive  picture  of  triadic  dispute  resolution. 

At  the  same  time  that  Stone  Sweet  was  build¬ 
ing  his  model  of  triadic  dispute  resolution, 
Sandholtz  was  questioning  the  assumption  of 
exogenous  preferences.  Sandholtz  argued  that 
preference  formation  was  partly  endogenous  to 
membership  within  the  EU  itself.  In  his  1996 
work  “Membership  Matters,”  Sandholtz  rea¬ 
soned  that  attitudes  were  not  formed  in  an  insu¬ 
lated  domestic  context  and  then  transported  to 
Brussels,  where  officials  acted  on  them.  Instead, 
the  experience  of  member  states  working  together 
in  the  EU,  solving  problems  collectively  through 
EU  institutions,  was  expected  to  contribute  to 
the  formation  of  preferences,  along  with  the 
forces  emanating  from  the  domestic  and  interna¬ 
tional  political  economy.  Sandholtz’s  contribu¬ 
tions  should  be  seen  as  part  of  a  longer  line  of 
research  on  socialization  and  the  formation  of 
preferences. 
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Finally,  with  regard  to  the  extension  of  neo¬ 
functionalism,  the  theory  gets  a  more  rigorous 
dynamic  dimension  by  reframing  some  old  ques¬ 
tions  within  a  principal-agent  framework.  Of 
what  importance  are  unintended  consequences? 
Under  what  conditions  do  they  occur?  What 
accounts  for  the  discrepancies  between  the  incen¬ 
tives  and  information  available  to  principals 
(national  executives)  and  agents  (their  appointees 
in  Brussels)?  Pierson  was  able  to  provide  a  sounder 
theoretical  foundation  to  neo-functionalism,  giv¬ 
ing  some  previously  theoretically  unanchored 
propositions  a  home. 

Liberal  Intergovernmentalism 

The  major  rival  of  neo-functionalism  is  liberal 
intergovernmentalism  (LIG).  With  a  series  of  arti¬ 
cles  published  during  the  early  1990s,  culminating 
in  his  1998  book  The  Choice  for  Europe,  Andrew 
Moravcsik  set  out  the  foundations  of  a  state¬ 
centric  theory  of  the  integration  process,  one  that 
relies  on  the  preferences  of  the  member  states, 
formed  by  processes  external  to  member  (social¬ 
ization)  pressures.  Moravcsik  theorizes  that  the 
formation  of  preferences  of  state  actors  is  due 
mostly  to  the  pressures  emanating  from  the  domes¬ 
tic  and  global  political  economy. 

In  Moravcsik’s  “Preferences  and  Power  in  the 
European  Community:  A  Liberal  Intergovernment¬ 
al  Approach”  (1993),  he  identifies  a  two-stage 
theory  of  integration  in  which  the  first  stage  is 
preference  formation  by  state  leaders  and  the  sec¬ 
ond  stage  is  intergovernmental  bargaining.  A  third 
stage  is  added  in  The  Choice  for  Europe  (1998),  in 
which  bargained  outcomes  are  institutionalized  in 
an  attempt  to  prevent  them  from  being  overturned. 
At  the  core  of  the  model  lie  the  social  and  eco¬ 
nomic  interests  of  key  producer  and  commercial 
groups. 

After  the  preferences  of  state  actors  are  formed, 
governments  engage  in  international  bargaining. 
Here,  state  power  (oddly  absent  from  much  integra¬ 
tion  theory)  plays  a  role.  The  relative  costs  of  non¬ 
agreement,  in  particular  the  ability  to  revert  easily 
to  the  status  quo,  are  the  main  source  of  power. 
Countries  that  can  do  without  an  agreement,  for 
example,  Germany  in  the  case  of  the  Economic  and 
Monetary  Union  (EMU),  can  hold  out  for  favorable 
bargaining  outcomes. 


LIG  has  come  in  for  its  share  of  criticism.  It 
limits  itself  to  grand  bargains,  those  momentous 
occasions  when  states  confront  each  other  over 
high-stakes  outcomes.  This  focus  misses  the  way  in 
which  small  things  may  accumulate  and  create 
pressure  leading  to  an  important  international 
negotiation.  Indeed,  this  is  a  central  functionalist 
point,  namely,  that  relatively  mundane  and  nonsa¬ 
lient  events  can  become  transforming  if  viewed  in 
the  long  run.  LIG  and  functionalism  are  seen  as 
two  separate  theories,  but  they  may  actually  be 
linked  through  a  tissue  of  events  tying  together 
grand  bargains. 

A  second  criticism  is  that  pressure  for  renewed 
change  need  not  come  from  exogenous  or  endog¬ 
enous  change  in  the  domestic  or  global  economy. 
It  may  come  from  a  disequilibrium  present  from 
the  moment  a  grand  bargain  is  struck.  If  state 
actors  can  bring  about  a  bargain,  and  if  they  vote 
by  consensus  (which  they  do),  then  by  definition 
they  are  satisfied  with  the  bargain.  But  this  may 
not  be  the  case  for  institutional  actors,  for  exam¬ 
ple,  the  Commission,  the  Council  of  Ministers,  the 
European  Court  of  Justice,  or  the  European 
Parliament.  Each  has  its  own  institutional  inter¬ 
ests,  which  are  different  from  the  interests  of 
national  executives.  These  actors  may  agitate  for 
change  the  moment  the  grand  bargain  is  struck. 

New  Departures,  Europeanization, 
and  Approaches  Based  on  the  Social 
Purpose  of  Integration 

While  functionalism  and  LIG  occupy  most  of  the 
debate  regarding  regional  integration  theory,  some 
new  approaches  have  appeared  that  ask  different 
questions  and  introduce  novel  directions  in  our 
research.  Significant  new  directions  are  provided 
by  the  Europeanization  and  Gramscian  approaches. 
Space  constraints  allow  only  a  brief  word  about 
each. 

The  Europeanization  approach  already  claims  a 
substantial  following.  In  some  sense,  the 
Europeanization  framework  was  a  logical  out¬ 
growth  of  the  success  of  European  integration 
itself.  When  regional  institutions  were  weak  and 
policy  was  decided  at  the  national  level  (in  the 
early  stage),  the  natural  question  was  “How  do  we 
account  for  delegation  of  authority  to  the  regional 
level?”  When  regional  institutions  became  much 
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more  developed,  the  problematic  shifted  to  how 
these  institutions  functioned  to  make  policies 
(hence  “Europe  as  a  polity”).  And  when  Europe 
was  sufficiently  established  as  a  polity,  it  became 
natural  to  ask,  “How  do  its  institutions  and  poli¬ 
cies  feed  back  onto  the  very  political  systems  that 
compose  it?”  In  a  sense,  the  Europeanization 
approach  closes  the  conceptual  circle. 

A  final  approach — labeled  Gramscian — is  sin¬ 
gled  out  because  of  the  novelty  of  the  questions 
asked.  The  main  departure  from  the  other 
approaches  is  that  it  asks  a  question  that  is  aston¬ 
ishingly  absent  from  the  functionalism/LIG  debate. 
This  debate  centers  mostly  on  the  questions  of  how 
much  authority  is  delegated  to  the  supranational 
level  (and  under  what  conditions)  and  which  actors 
are  influential  in  the  process.  The  Gramscian 
approach  asks  about  the  social  and  political  content 
of  integration.  What  goals  are  served,  and  what 
groups  or  class  fractions  are  dominant?  These  ques¬ 
tions  are  approached  not  as  a  pluralist  battle  among 
all  organized  interest  groups  but  as  a  rivalry  among 
the  various  “fractions”  of  productive  and  financial 
capital.  A  key  advocate  of  this  line  of  analysis  is 
Bastian  van  Apeldoorn  (2002),  whose  work  on  the 
role  of  classes  and  business  groups  charts  out  new 
directions  in  integration  theory.  Others,  such  as 
Fritz  Scharpf  (1999)  and  James  Caporaso  and 
Sidney  Tarrow  (2009),  adopt  a  slightly  broader 
emphasis.  All  of  these  researchers  ask  about  the 
content  of  the  integration  project  rather  than  the 
level,  scope,  or  depth  of  regional  integration. 

The  future  of  regional  integration  is  likely  to  be 
increasingly  involved  with  attempts  at  liberaliza¬ 
tion  of  international  trade,  especially  in  non- 
European  settings.  Efforts  to  find  the  “right” 
membership  to  pursue  regional  goals  will  be 
salient,  especially  in  the  Asian  setting,  where 
ASEAN,  APEC,  and  ASEAN  +  3  all  compete  for 
regional  leadership.  In  Europe,  the  functionalist/ 
intergovernmental  debate  will  continue,  though 
in  muted  form,  but  it  will  share  the  stage  with 
newer  theories  based  on  Europe  as  a  polity, 
Europeanization  theories,  and  theories  emphasiz¬ 
ing  the  social  purpose  of  European  regional 
integration,  including  the  relationship  among  inte¬ 
gration,  the  welfare  state,  and  democracy. 

As  shown,  regional  integration  processes  are 
vibrant  across  the  globe.  It  is  hoped  that  the  gap 
between  academic  theories  of  integration  in  Europe 


and  in  other  parts  of  the  world  will  be  bridged  by 
the  continuing  research  on  comparative  regional 
integration. 

Mary  Anne  Madeira  and  James  A.  Caporaso 
University  of  Washington 
Seattle,  Washington,  United  States 
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Regionalism 


Regionalism  refers  to  the  political  and  cognitive 
idea  of  forming  regions.  It  is  usually  associated 
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with  a  formal  program,  and  since  the  mid-1980s, 
there  has  been  an  explosion  of  such  regional  pro¬ 
grams  on  a  global  scale.  The  broadening  and 
deepening  of  the  European  Union  (EU)  is  perhaps 
the  most  obvious  example,  but  regionalism  is  also 
evident  in  the  revitalization  or  expansion  of  many 
other  regional  projects  around  the  world,  such  as 
the  African  Union  (AU),  the  Association  of 
Southeast  Asian  Nations  (ASEAN),  the  Economic 
Community  of  West  African  States  (ECOWAS), 
the  North  American  Free  Trade  Agreement 
(NAFTA),  the  Southern  African  Development 
Community  (SADC),  the  Southern  Common 
Market  (Mercado  Comun  del  Sur,  MERCOSUR), 
and,  more  recently,  the  Union  of  South  American 
Nations  (Union  de  Naciones  Suramericanas, 
UNASUR). 

Today’s  regionalism  is  closely  linked  with  the 
shifting  nature  of  global  politics  and  the  intensifi¬ 
cation  of  globalization.  Regionalism  is  character¬ 
ized  by  the  involvement  of  almost  all  governments 
in  the  world,  but  it  also  involves  a  wide  variety  of 
nonstate  actors.  This  results  in  a  multitude  of  for¬ 
mal  and  informal  regional  types  of  governance  and 
regional  networks  in  most  fields  of  politics.  This 
pluralism  and  multidimensionality  of  contempo¬ 
rary  regionalism  gives  rise  to  a  number  of  new 
puzzles  and  challenges  in  today’s  politics. 

This  entry  first  defines  the  terms  region ,  region¬ 
alism. ,  and  regionalization.  Second,  it  provides  an 
overview  of  the  continuities  and  discontinuities 
between  the  early  and  the  more  recent  debates  on 
regionalism.  Finally,  it  presents  a  brief  discussion 
of  regionalism  and  world  order. 

Regions,  Regionalism,  and  Regionalization 

The  concept  of  a  region  evolved  historically  to 
mean  the  space  between  the  national  and  the  local 
within  a  particular  state.  This  meaning  may  be 
captured  by  the  term  microregion ,  or  subnational 
region.  The  concept  may  also  be  used  to  refer  to 
macroregions  (the  so-called  world  or  international 
regions),  which  are  larger  territorial  units,  as 
opposed  to  nonterritorial  units  or  subsystems. 
They  exist  between  the  state  level  and  the  global 
system  level.  The  macroregion  has  been  the  most 
common  object  of  analysis  in  international  rela¬ 
tions,  while  microregions  have  more  commonly 
been  considered  in  the  study  of  domestic  politics. 


A  conventional  definition  of  a  macroregion, 
originally  suggested  by  Joseph  Nye,  is  a  limited 
number  of  states  linked  together  by  a  geographical 
relationship  and  by  a  degree  of  mutual  interdepen¬ 
dence.  Historically,  a  plethora  of  opinions  have 
been  advanced  regarding  which  mutual  interde¬ 
pendencies  matter  the  most:  economic,  political, 
and  social  interrelationships  or  historical,  cultural, 
and  ethnic  bonds.  It  has  been  argued  that  the  defi¬ 
nitions  of  what  constitutes  a  region  may  vary 
according  to  the  particular  issue  under  investiga¬ 
tion.  This  may  lead  to  the  identification  of  overlap¬ 
ping  types  of  regions,  such  as  economic  regions, 
security  regions,  environmental  regions,  and  cul¬ 
tural  regions. 

The  overwhelming  majority  of  studies  in  the 
field  of  political  science  and  international  relations 
have  focused  on  predefined  regions  in  the  form  of 
state-led  and  interstate  regional  organizations  and 
frameworks.  The  AU,  ASEAN,  EU,  SADC,  and 
ECOWAS  are  examples.  The  argument  that  the 
composition  of  regions  should  not  be  limited  to 
regional  intergovernmental  organizations  has  been 
stressed  in  recent  constructivist  and  poststructural¬ 
ist  scholarship.  From  this  perspective,  all  regions 
are  deemed  to  be  socially  constructed  and  hence 
politically  contested.  Emphasis  is  placed  on  how 
political  actors  perceive  and  interpret  the  idea  of  a 
region  and  notions  of  “regionness.”  According  to 
this  school  of  thought,  there  are  no  “natural” 
regions;  all  regions  are  (at  least  potentially)  hetero¬ 
geneous,  with  unclear  territorial  margins.  The 
focus  is  often  on  the  processes  of  region  building 
and  regionalization,  which  implies  that  the  distinc¬ 
tion  between  regionalism  and  regionalization  is 
emphasized. 

Regionalism  means  the  body  of  ideas,  values, 
and  objectives  that  contribute  to  the  creation, 
maintenance,  or  modification  of  a  particular  region 
or  type  of  world  order.  It  is  usually  associated  with 
a  formal  policy  and  project  and  often  leads  to  insti¬ 
tution  building.  Further,  regionalism  ties  agents  to 
a  specific  project  that  is  limited  spatially  or  socially 
but  not  in  time. 

Regionalization  refers  to  the  process  of  region 
formation.  It  implies  a  focus  on  the  process  by  which 
regions  come  into  existence  and  are  consolidated, 
their  “becoming”  so  to  speak.  In  its  most  basic 
sense,  the  term  may  imply  no  more  than  the  concen¬ 
tration  of  goods,  services,  investment,  peoples,  and 
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ideas  at  the  regional  level.  This  interaction  may 
lead  to  the  emergence  of  regional  actors,  networks, 
and  organizations.  When  they  are  active  at  the 
regional  level,  regional  actors  bring  about  region¬ 
alization,  so-called  region  building.  Regionalization 
may  be  caused  by  regionalism,  but  it  may  also 
emerge  in  the  absence  of  a  regionalist  project  and 
ideology.  Hence,  regionalism  may  not  always  have 
much  practical  significance  for  the  reality  of 
regionalization. 

Early  and  Recent  Debates  on  Regionalism 

Experts  on  the  subject  often  distinguish  between 
the  “old”  and  the  “new”  regionalism,  or  what  is 
more  appropriately  labeled  the  early  and  the  more 
recent  debate  on  regionalism.  The  early  debate 
covers  research  undertaken  between  the  1950s  and 
the  1970s,  and  the  key  term  was  regional  integra¬ 
tion.  These  studies  generally  viewed  regional 
integration  as  an  endogenous  process,  that  is,  a 
development  that  arose  from  conditions  internal  to 
and  specific  to  each  region  in  question.  With  some 
exceptions,  particularly  studies  conducted  in  Latin 
America  and  Africa,  most  early  research  dealt  with 
European  integration,  as  there  were  few  other 
regional  experiments  to  theorize  about  at  that 
time.  The  dominant  approach  was  neo-functional- 
ism,  which  mainly  considered  the  “spillover”  from 
the  economic  integration  of  Western  Europe  to  its 
political  unification.  Among  the  leading  authors 
who  wrote  on  such  early  regionalism  were  Ernst 
Haas  and  Karl  Deutsch. 

The  1970s  was  a  period  of  “Eurosclerosis” 
within  the  European  Community,  but  the  1985 
white  paper  on  the  internal  market  and  the  Single 
European  Act  resulted  in  a  new,  dynamic  process 
of  European  integration.  This  was  also  the  start  of 
what  has  often  been  referred  to  as  the  “new 
regionalism”  on  a  global  scale.  To  some  observers, 
regionalism  was  “new”  mainly  in  the  sense  that  it 
represented  a  revival  of  protectionism  or  neo¬ 
mercantilism.  However,  most  observers  highlighted 
the  fact  that  the  closure  of  regions  was  not  on  the 
agenda.  Indeed,  one  of  the  characteristic  features 
of  the  more  recent  debate  on  regionalism,  espe¬ 
cially  within  the  field  of  international  relations,  is 
its  focus  on  the  conditions  related  to  globalization. 
The  recent  debate  is  to  a  large  extent  generated  by 
the  transformation  of  the  Westphalian  nation-state, 


the  erosion  of  national  borders,  and  the  urgent 
question  of  how  to  find  an  alternative  post-West- 
phalian  order  in  the  context  of  globalization. 
Perhaps  the  richest  literature  in  this  regard  is  on 
the  role  of  regionalism  in  the  context  of  multilat¬ 
eral  trade  and  security. 

One  prominent  scholar  of  the  recent  debate, 
Bjorn  Hettne,  stresses  that  regionalism  needs  to  be 
understood  from  both  an  exogenous  perspective 
(outside  in)  and  an  endogenous  perspective  (inside 
out).  The  former  perspective  refers  to  the  fact  that 
regionalization  and  globalization  are  intertwined 
articulations  of  global  transformation,  whereas  the 
latter  implies  that  regionalization  is  shaped  from 
within  a  region  by  a  large  number  of  different 
actors.  The  exogenous  perspective  has  developed 
primarily  in  the  course  of  the  recent  debate, 
whereas  the  endogenous  perspective  finds  continu¬ 
ity  with  functionalist  and  neo-functionalist  theoriz¬ 
ing  about  integration  (of  Europe),  the  role  of 
agency,  and  the  long-term  transformation  of  terri¬ 
torial  identities.  In  contrast  to  Haas,  Deutsch,  and 
the  early  regional  integration  scholars,  today’s 
scholars  identify  many  regionalisms,  which  pro¬ 
vides  a  very  different  base  for  comparative  studies 
of  regionalism.  It  is  apparent  that  neither  the  object 
of  study  (ontology)  nor  the  way  of  studying  it 
(epistemology)  has  remained  static.  One  indication 
of  this  is  the  emergence  of  a  variety  of  theoretical 
frameworks  for  the  study  of  regionalism,  such  as 
constructivism,  liberal  institutionalism,  regional 
security  complex  theory,  governance  approaches, 
the  new  regionalism  approach  (NRA),  and  the 
region-building  and  regional  network  approaches. 
Indeed,  current  regionalization  may  be  seen  as  a 
new  political  landscape  in  the  making,  character¬ 
ized  by  an  expanding  cast  of  actors  (state  and  non¬ 
state)  operating  in  the  regional  arena  and  across 
several  interrelated  dimensions:  security,  develop¬ 
ment,  trade,  environment,  identity,  and  so  on. 

Historically,  the  study  of  regionalism  and 
regional  integration  has  emphasized  states  as 
actors  and  focused  on  sovereignty  transfer  or 
political  unification  within  interstate  regional 
organizations.  Since  the  late  1990s,  research  has 
placed  greater  emphasis  on  “soft,”  de  facto  or 
informal  regionalization,  acknowledging  the  fact 
that  a  range  of  nonstate  actors  has  begun  to  oper¬ 
ate  at  the  regional  level,  within  as  well  as  beyond 
state-led  institutional  frameworks.  For  instance, 
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not  only  are  business  interests  and  multinationals 
operative  in  the  global  sphere,  but  they  also  tend 
to  create  regionalized  patterns  of  economic  activ¬ 
ity.  Oft-cited  examples  include  the  regional  pro¬ 
duction  systems  in  East  and  Southeast  Asia  and 
South  African  business  expansion  in  Southern 
Africa.  Similarly,  the  role  of  civil  society  is  often 
neglected  in  the  study  of  regionalism,  despite  the 
fact  that  its  impact  is  increasing,  as  is  evident  in  the 
transnational  activist  networks  and  processes  of 
civil  society  interaction  emerging  at  the  regional 
level  around  the  world. 

The  causal  relationship  between  regionalism 
and  regionalization  (or  formal  and  informal 
regionalism  as  it  is  sometimes  called)  has  attracted 
considerable  attention  in  the  recent  debate.  Key 
issues  are  whether  or  not  formal  regionalism  pre¬ 
cedes  informal  regionalization  and  the  ways  in 
which  state,  market,  and  civil  society  actors  relate 
and  come  together  in  different  formal  and  infor¬ 
mal  coalitions,  networks  and  modes  of  regional 
and  global  governance.  Consequently,  the  study  of 
regionalism  is  inseparable  from  the  study  of  glo¬ 
balization  and  world  order. 

Regionalism  and  the  World  Order 

Many  regionalists  contend  that  regions  have 
become  critical  to  contemporary  word  politics.  As 
Peter  Katzenstein  (2005)  points  out,  “This  view  is 
in  stark  contrast  to  those  who  focus  on  the  pur¬ 
portedly  stubborn  persistence  of  the  nation-state 
or  the  inevitable  march  of  globalization”  (p.  306). 
It  is  generally  agreed  that  globalization  challenges 
the  Westphalian  nation-state.  However,  political 
intervention,  or  at  least  some  form  of  governance, 
is  still  needed.  One  way  in  which  “the  political” 
may  return  is  as  a  reformed  neo-Westphalian  order 
(i.e.,  building  on  the  nation-state  system),  gov¬ 
erned  either  by  a  reconstituted  state-driven  multi¬ 
lateral  system  or  by  a  more  loosely  organized 
global  “concert”  of  the  regional  hegemons  of  the 
world,  such  as  Germany,  Japan,  Brazil,  and  South 
Africa. 

Regionalism  may  also  be  part  of  a  post-Westpha- 
lian  governance  model  (i.e.,  transcending  the  nation¬ 
state  system)  of  coexisting  regional  communities.  By 
comparison  with  economic  and  market-led  global¬ 
ization,  regionalism  is  more  anchored  to  territorial 
domains.  By  comparison  with  multilateralism,  it  is 


a  more  exclusive  relationship  based  on  conscious 
political  strategies,  potentially  guided  by  world 
order  values  such  as  multiculturalism  and  interre¬ 
gional  dialogue  and  cooperation. 

Fredrik  Soderbaum 
University  of  Goteburg 
Goteburg,  Sweden 
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Regionalization 


Regionalization  is  the  program  or  process  of 
forming  regions.  This  implies  the  reorganization 
of  a  territory  according  to  various  criteria  or 
dimensions  and  with  different  aims.  Considering  a 
state  or  any  political  entity  with  a  unitary  essence, 
regionalization  can  be  viewed  as  the  process  of 
rearranging  or  subdividing  its  territory  into  smaller 
administrative  and/or  political  units.  These  would 
acquire  powers  and  competences  previously  held 
by  the  superior  unit.  Hence,  regionalization  is  a 
decentralized  reorganization  of  territorial  interests 
and  political  positions.  Normally,  both  subna¬ 
tional  entities  (in  the  case  of  unitary  states)  and 
substate  entities  (in  the  case  of  federal  states)  are 
called  regions;  however,  the  same  reasoning  could 
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be  applied  at  lower  levels  of  decentralization, 
although  in  this  case  the  term  used  to  refer  to  these 
entities  might  be  a  different  one.  Nonetheless, 
under  this  common  thread  and  with  a  wide  range 
of  terminology,  many  different  realities,  dynamics, 
and  political  or  cultural  traditions  can  be  found: 
France’s  Departements,  Spain’s  Comunidades 
Autonomas,  Italy’s  Regioni,  and  many  other 
denominations  that  are  rooted  in  the  local  tradi¬ 
tions.  This  entry  points  out  the  key  dimensions  of 
the  phenomenon,  the  various  approaches  to  terri¬ 
torial  politics,  and  the  varieties  of  regionalization. 
The  last  section  clarifies  the  ways  in  which  differ¬ 
ent  state  policies  unfold  into  different  types  of 
regionalization  processes. 

Dimensions  of  the  Regional  Phenomenon 

The  regional  phenomenon  took  a  leading  role  in 
politics,  economy,  and  spatial  planning — especially 
in  the  case  of  the  more  developed  Unitarian  states — 
during  the  second  half  of  the  20th  century.  Many 
states,  mainly  in  Europe,  underwent  regional 
and  decentralizing  reforms,  which  are  among  the 
most  important  political  and  institutional  innova¬ 
tions  of  the  past  decades.  In  these  regional  facts,  at 
least  three  meaningful  dimensions  can  be  identi¬ 
fied:  First,  there  is  the  sociopolitical  and/or  eco¬ 
nomic  dynamics  underlying  these  regionalizing 
reforms,  which  change  even  the  structure  of  the 
relationship  between  the  state  and  the  local  enti¬ 
ties.  Second  is  the  impact  that  globalization  and 
other  dynamics  that  promote  suprastate  integra¬ 
tion — especially  the  European  one — can  have  on 
this  regionalization  progress.  Finally,  there  are 
cultural  aspects  that  surround  or  impel  regional¬ 
ization,  either  rediscovering  or  reasserting  territo¬ 
rial  or  regional  identities,  mainly  in  those  arenas 
where  regionalist  and/or  nationalist  forces  play  an 
important  role. 

From  another  point  of  view,  there  are  three 
other  elements  not  to  be  ignored.  First,  the  politi¬ 
cal  and  administrative  decentralization  created  by 
regionalization  affects  the  institutions  and  their 
relationships  at  the  territorial  level.  Next,  there  is 
the  socioeconomic  ingredient  of  spatial  planning, 
with  an  impact  on  uneven  development  and  on 
interterritorial  balance  and  cohesion.  Last,  there  is 
a  cultural  or  identifying  element,  which  can  be 
more  or  less  challenging  and  self-assertive  and 


which  may  in  varying  degrees  constitute  a  mobili¬ 
zation  in  defense  of  territorial  claims. 

Approaches  to  Territorial  Politics 

All  things  considered,  if  territory  is  seen  as  an 
essential  part  of  the  state  and  the  authority  or 
political  sovereignty  the  latter  has  over  it,  the  way 
such  territorial  authority  is  managed  and  orga¬ 
nized  becomes  territorial  politics,  and  regionaliza¬ 
tion  develops  within  this  frame.  Regionalization  as 
territory  management  is,  above  all,  a  way  to  retain 
and  renovate  the  state’s  sovereignty,  mainly  in 
those  contexts  where  there  are  identity  conflicts  or 
peripheral  nationalistic  demands.  In  these  cases, 
regionalization  can  even  go  as  far  as  secession, 
with  a  territory  demanding  a  redrawing  of  territo¬ 
rial  boundaries  and  instituting  a  different  form  of 
territorial  authority. 

The  issue  has  been  approached  from  different 
perspectives  within  political  science,  which  reflect 
various  theories  about  the  formation  and  consoli¬ 
dation  of  states.  Salient  among  these  is  the  diffu- 
sionist  approach,  which  dominated  the  paradigm 
for  decades  with  its  “center-periphery”  model. 
(Modernization  and  integration  theories  consider 
the  irreversibility  of  the  homogenization  process 
more  plausible. )  The  key  concept  of  this  paradigm 
is  the  formation  of  societies  and  states  around  a 
dominant  center  that  radiates  its  hegemony  to  the 
periphery,  absorbing  it  and  thus  forming  a  com¬ 
mon  social,  economic,  and  political  system — we 
are  referring  here  to  state-building  and  nation¬ 
building  paradigms.  This  hegemony  builds  on  the 
values  of  the  elites  within  the  different  public 
spheres,  hence  legitimizing  the  distribution  of 
power  in  all  domains — political,  economic,  social, 
cultural,  educative,  and  so  on.  Obviously,  this 
approach  is  not  limited  to  a  narrow  geographic 
conception  of  the  core-periphery  relationship  but 
is  linked  with  an  enlightened  vision  of  economic 
and  social  development  as  well  as  social  modern¬ 
ization.  A  strong  socioeconomic  determinism 
underlies  many  of  the  different  versions  of  this  dif- 
fusionist  approach.  This  determinism  maintains 
that  the  politics  of  the  state  are  shaped  by  eco¬ 
nomic  and  social  forces.  The  orthodox  Marxist 
version  of  this  conception  does  not  question  this 
core  idea  but  instead  corrects  it  from  the  point  of 
view  of  historical  materialism,  in  which  the  class 
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struggle  is  the  power  behind  social  and  political 
transformation.  Therefore,  its  state-centrism  point 
of  view  dominates  to  the  detriment  of  any  territo¬ 
rial  differences  or  identities,  which  are  relegated  to 
a  second-tier  status.  Nonetheless,  the  center- 
periphery  model  has  been  revised  mainly  in  its 
deterministic  component.  The  processes  that  states 
undergo  during  territorial  unification  can  be  very 
different,  and  the  same  applies  to  the  moderniza¬ 
tion  of  societies.  As  a  consequence,  we  find  asym¬ 
metric  territorial  development,  territorial  conflicts, 
and,  in  all,  territorial  differentiation,  which  can  be 
more  or  less  chronic  or  more  or  less  acute  within 
national  states.  Therefore,  the  territorial  politics  or 
matrix  leading  to  regionalization  is  the  result  of 
the  combination  of  the  different  factors  that  par¬ 
ticipate  in  the  process  of  state  unification. 

However,  some  authors  question  the  center- 
periphery  paradigm  due  to  its  state  centrism:  There 
are  states  with  more  than  one  center,  such  as  the 
United  States  and  Canada;  others  may  not  have 
any  clear  center  or  periphery,  such  as  the  Federal 
Republic  of  Germany  and  the  Swiss  Republic;  fur¬ 
ther,  there  are  states  where  the  peripheries  relate  to 
more  than  one  center,  such  as  Northern  Ireland 
and  its  relation  with  London  and  Dublin,  or  the 
Basque  Country,  with  Madrid  and  Paris.  From  this 
new  point  of  view,  and  considering  the  existence  of 
strong  territorial  identities  that  have  been  politi¬ 
cized  and  activated  by  nationalist  and/or  regional- 
ist  movements,  a  new  paradigm  can  be  found 
based  on  societal  fragmentation,  territorial  asym¬ 
metry,  and  centrifugal  politics. 

Varieties  of  Regionalization 
in  Territorial  Politics 

Regionalization  processes  vary  depending  on  the 
characteristics  of  territorial  matrixes  and  the  inte¬ 
gration  processes  of  national  states,  as  well  as 
whether  there  are  nationalist  or  regionalist  periph¬ 
eral  demands,  which  can  be  more  or  less  powerful. 
States  undergoing  regionalization  take  into  account 
territorial  differences,  even  if  these  are,  on  the  one 
hand,  objective  or  material  ones  or,  on  the  other 
hand,  subjective  or  sociocultural  ones. 

The  former,  socioeconomic  dissimilarities,  are 
the  result  of  uneven  development  and  the  depen¬ 
dence  dynamics  stemming  from  the  processes  of 
modernization  and  industrialization  of  societies. 


They  can  generate  interregional  geopolitical  dis¬ 
cordances,  as  well  as  other  disparities  among  the 
central  and  peripheral  elites — political,  economic, 
social,  and  cultural. 

The  latter,  subjective  or  sociocultural  dissimi¬ 
larities,  are  normally  due  to  the  contradictory  per¬ 
ceptions  that  national  and  territorial  elites  have  of 
the  process  of  integration  and  homogenization  at 
the  nation-state  level.  These  differences  can  gener¬ 
ate  tensions  and  conflicts  of  identity,  especially 
when  there  are  regionalist  or  nationalist  move¬ 
ments  in  the  periphery  that  claim  their  own  signs 
of  identity — their  own  history,  ethnic  or  religious 
differences,  linguistic  struggles,  and  so  on — and 
are  responding  to  the  centralization  and  homoge¬ 
nization  process  at  the  nation-state  level.  Therefore, 
the  regional  differentiation  underlying  the  region¬ 
alization  process  can  be  due  to  political,  economic, 
or  sociocultural  reasons,  or  even  a  combination  of 
all  these,  and  always  seek  to  better  the  governance 
of  the  nation-state  territory. 

On  the  basis  of  this  varied  matrix,  the  different 
forms  and  strategies  of  the  regionalizing  politics 
that  state-nations  (territories  that  have  undergone 
regionalization  at  the  state  level)  apply  under  this 
new  governance  are  the  result  of  the  characteristics 
that  define  and  homogenize  their  regions,  whether 
real  or  potential.  These  policies  always  aim  at 
developing  and  improving  the  process  of  national 
integration  and  cohesion.  They  also  seek  to  ensure 
the  maintenance  of  territorial  sovereignty  of  the 
state,  which  is  endangered  either  by  an  excess  of 
centralized  homogenization  or  by  the  inefficiencies 
and  limitations  distinctive  of  economic  and  soci¬ 
etal  modernization  processes.  Hence,  depending 
on  those  state  policies,  different  types  of  regional¬ 
ization  processes  can  be  found. 

First,  there  is  the  sheer  arrangement  or  manage¬ 
ment  of  the  territory — applying  economic  and 
administrative  criteria — by  means  of  the  creation 
of  homogeneous  regions.  These  regions  assume  the 
decentralized  administrative  management  of  the 
state’s  responsibilities,  in  the  interest  of  a  better 
interterritorial  balance.  The  unitary  character  of 
the  state  is  not  questioned  at  any  point,  nor  is  the 
symmetry  of  the  new  regional  division,  which  is 
limited  to  the  amount  of  state  decentralization 
achieved. 

The  second  type  of  process  goes  a  step  further  in 
decentralizing  institutions  of  a  national  character 
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within  the  region,  as  well  as  government  functions 
that  are  recognized  as  properly  subject  to  local  sov¬ 
ereignty.  This  type  answers  the  political  claims  of 
regional  elites  and  politicians,  following  a  type  of 
federalizing  dynamic — either  symmetrical  or  not — 
that  does  not  question  the  unitary  nature  of  the 
state.  The  British  and  Spanish  examples  are  a  good 
illustration  of  this  second  type  of  regionalization. 

Finally,  we  find  federal  or  confederal  structures, 
either  symmetrical  or  not,  with  shared  sover¬ 
eignty.  Ffere,  Belgium  is  the  most  recent  example. 
The  extreme  case  is  always  territorial  secession, 
when  coexistence  under  the  roof  of  the  same 
nation-state  becomes  untenable  for  all  elites,  even 
after  other  forms  of  advanced  regionalization 
have  proved  to  be  impossible.  The  former  USSR, 
Yugoslavia,  and  Czechoslovakia  are  good  exam¬ 
ples  of  this  impossibility. 

Francisco  J.  Llera 
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Regression 


Regression  analysis  is,  by  far,  the  most  commonly 
used  statistical  technique  in  political  science. 


While  regression  analysis  can  be  defined  in  many 
different  ways,  it  is  a  tool  for  describing  the  rela¬ 
tionships  among  variables.  Regression  is  generally 
used  for  two  complementary  purposes.  First,  it 
measures  the  effects  of  one  or  more  independent 
variables  on  a  single  dependent  variable.  Second, 
it  can  be  used  to  predict  the  values  of  a  dependent 
variable  that  can  be  expected  to  occur  at  specified 
levels  of  the  independent  variables.  In  the  former 
capacity,  regression  is  a  useful  tool  for  theory  test¬ 
ing.  In  the  latter  capacity,  regression  is  a  forecast¬ 
ing  tool  that  is  useful  for  decision  making.  These 
two  uses  are  closely  connected.  Theories  are  eval¬ 
uated  by  their  ability  to  predict  as  yet  unobserved 
phenomena,  and  forecasting  is  carried  out  most 
effectively  when  predictions  are  based  on  a  sub¬ 
stantive  theory.  In  the  following,  the  major  fea¬ 
tures  and  applications  of  regression  analysis  are 
discussed. 

If  there  is  one  independent  variable,  the  analysis 
is  commonly  called  a  simple  or  bivariate  regres¬ 
sion.  If  there  is  more  than  one  independent  vari¬ 
able,  it  is  called  multiple  regression  analysis.  In 
traditional  nomenclature,  the  dependent  variable  is 
generically  represented  as  Y  and  the  independent 
variable  is  designated  as  X.  If  there  are  multiple 
independent  variables,  the  Xs  are  given  subscripts, 
say  Xj,  X2,  .  .  .  ,  Xp  .  .  . ,  Xk  for  k  variables  (note 
that  the  specific  order  of  the  X;s  generally  does  not 
matter).  For  the  moment,  we  will  assume  that  all 
variables  are  relatively  continuous  and  measured 
at  the  interval  level. 

The  immediate  objective  of  a  regression  analysis 
is  to  show  how  the  conditional  distribution  of  Y 
varies  across  the  values  of  X  (or  across  the  values 
of  the  X-s  in  a  multiple  regression).  Attention  is 
usually  focused  on  the  conditional  mean  of  Y 
rather  than  the  entire  conditional  distribution.  If 
the  conditional  Y  distributions  (and,  specifically, 
the  conditional  Y  means)  do,  in  fact,  differ  system¬ 
atically  across  the  X  values,  then  Y  is  said  to  be 
related  to  X.  If  the  conditional  Y  distributions  do 
not  vary  across  the  X  values,  then  X  and  Y  are 
unrelated  to,  or  independent  of,  each  other. 

Bivariate  Regression 

There  are  several  ways  in  which  a  regression  anal¬ 
ysis  can  be  carried  out.  For  example,  a  graphical 
approach  would  simply  plot  Y  against  X  and 
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superimpose  a  curve  that  traces  out  the  conditional 
Y  means  across  the  range  of  X.  While  this  strategy 
is  often  useful  for  exploring  bivariate  data,  it 
becomes  much  more  difficult  in  the  multiple 
regression  case.  And  there  are  some  practical  limi¬ 
tations  in  using  a  graph  as  the  final  output  of  the 
analysis  in  any  context.  Further,  the  sample  esti¬ 
mates  of  the  conditional  Y  means  are  likely  to  be 
somewhat  unreliable  and  unstable  since  there  are 
usually  only  a  small  number  of  observations  at 
each  distinct  value  of  X. 

An  alternative  approach  would  be  to  specify  a 
function  that  describes  the  hypothesized  relation¬ 
ship  between  Y  and  the  X  s.  The  empirical  analysis 
then  estimates  the  parameters  of  this  function  and 
determines  how  well  the  function  actually  repre¬ 
sents  the  data  at  hand.  Probably  the  simplest  func¬ 
tion  for  relating  the  variables  is  linear  in  form.  In 
the  bivariate  case,  a  linear  function  would  be 
shown  as  follows: 

Y,.  =  a  +  bX, 

The  subscript,  i,  that  appears  with  Y  and  X  in  the 
preceding  equation  is  an  index  for  the  observa¬ 
tions;  it  ranges  from  I  to  n  (the  total  number  of 
observations)  and  is  used  here  to  indicate  that  the 
values  of  the  two  variables  are  different  from  one 
observation  to  the  next.  The  other  two  terms,  a 
and  b,  are  the  coefficients;  they  do  not  have  sub¬ 
scripts,  indicating  that  their  numerical  values  are 
constant  across  the  n  observations  in  the  data. 
Notice  how  this  function  provides  a  simple  rule  for 
starting  with  any  observation’s  value  on  the  X 
variable  and  producing  the  observation’s  Y  value. 
If  this  function  actually  did  provide  an  accurate 
description  of  a  bivariate  data  set,  then  a  scatter- 
plot  of  Y  versus  X  would  produce  n  points  arrayed 
along  a  straight  line. 

The  b  coefficient  is  often  called  the  slope  of  the 
line.  The  absolute  value  of  the  number  associated 
with  b  gives  the  steepness  of  the  line.  The  sign  of 
the  number  (plus  or  minus)  indicates  whether  the 
line  runs  from  lower  left  to  upper  right  (a  positive 
relationship)  or  from  upper  left  to  lower  right  (a 
negative  relationship).  The  slope  shows  the  differ¬ 
ence  in  the  values  of  Y  that  is  associated  with  a 
single-unit  increase  in  X.  This  is  generally  inter¬ 
preted  as  Ys  responsiveness  to  the  X  variable.  The 
a  coefficient  is  often  called  the  intercept  of  the  line. 


It  gives  the  value  of  the  Y  variable  that  occurs 
when  X  =  0.  The  intercept  determines  the  vertical 
position  of  the  line  within  the  scatterplot  of  Y  ver¬ 
sus  X.  But  the  intercept  usually  does  not  have  a 
substantive  interpretation. 

The  simple  linear  function  depicted  above  is  not 
very  useful  by  itself  because  empirical  data  seldom 
(if  ever)  conform  to  such  a  clear-cut  pattern.  But  a 
scatterplot  of  Y  against  X  may  show  an  approxi¬ 
mately  linear  orientation  that  might  be  usefully 
summarized  by  a  linear  function.  In  that  case,  the 
previous  equation  would  be  modified  as  follows: 

Y,  -  Y,  +  £>„ 

where  Y,  =  a  +  bX,. 

Hence,  the  full  regression  equation  is 

Yj  —  ci  +  bX;  +  er 

In  the  preceding  equations,  Y  is  a  new,  “imagi¬ 
nary”  variable,  created  as  part  of  the  analysis. 
Unlike  Y  itself,  Y  is  perfectly  linearly  related  to  X. 
The  other  new  term  on  the  right  side  of  the  equals 
sign,  e;,  is  a  residual,  indicating  that  the  value  of  Y( 
generally  does  not  fall  exactly  on  the  line.  If  the 
value  of  the  residual  for  observation  i  is  a  positive 
number,  then  the  point  representing  i  in  the  scat¬ 
terplot  falls  above  the  line  relating  Y  to  X.  If  the 
residual  is  negative,  then  the  point  falls  below 
the  line.  If  the  residual  is  zero,  then  Y  =  Y,.  and  the 
point  falls  exactly  on  the  line. 

The  immediate  analytical  objective  in  the  regres¬ 
sion  analysis  is  to  define  Y  (i.e.,  select  numeric 
values  for  the  coefficients,  a  and  b)  such  that  the 
line  relating  Y  to  X  passes  through  the  middle  of 
the  point  cloud  in  the  scatterplot,  coming  as  close 
as  possible  to  as  many  of  the  points  as  possible. 
Here,  “closeness”  is  defined  as  the  vertical  distance 
from  each  data  point  to  the  line;  hence,  it  corre¬ 
sponds  to  the  residual  for  each  observation.  But 
residuals  can  be  either  positive  or  negative  in  value, 
so  they  are  typically  squared  to  provide  positive 
quantities  for  each  observation.  Then,  the  line  is 
fitted  such  that  it  produces  the  smallest  possible 
sum  of  squared  residuals  across  the  n  observations 
in  the  data  set.  This  is  called  the  least  squares  cri¬ 
terion  for  fitting  the  regression  line,  and  the  math¬ 
ematical  procedure  that  is  typically  used  to  find  the 
line  is  called  ordinary  least  squares  (OLS). 
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If  the  point  cloud  in  the  scatterplot  really  does 
conform  to  a  linear  pattern,  then  the  line  produced 
by  the  OLS  method  can  be  used  as  a  model,  or 
abstract  representation,  of  the  data.  It  provides  a 
succinct  summary  of  any  systematic  linear  struc¬ 
ture  that  exists  within  the  observations.  But  it  is 
important  to  develop  a  measure  of  how  well  the 
OLS  line  summarizes  the  observed  data.  To  do  so, 
we  begin  with  the  fact  that  Y and  the  residual  term 
will  be  uncorrelated  with  each  other  (this  is  a  con¬ 
sequence  of  the  OLS  procedure  and  our  require¬ 
ment  that  the  fitted  line  runs  through  the  middle  of 
the  point  cloud  in  the  scatterplot).  Therefore,  the 
variance  of  Y  can  be  broken  down  neatly  into 
the  sum  of  the  variance  of  Y  and  the  variance  of 
the  residual,  e.  A  goodness-of-fit  measure,  called 
R 2  (this  term  is  used  because  R 2  is  equal  to  the 
square  of  the  correlation  r  between  X  and  Y),  is 
defined  as  the  ratio  of  the  variance  of  Y  to  the  vari¬ 
ance  of  Y.  R2  is  often  interpreted  as  the  proportion 
of  variance  in  Y  that  is  “explained”  by  X,  since  it 
represents  the  part  of  Y’s  variance  that  is  linked  to 
X  through  Y.  Since  R2  is  a  proportion,  it  varies 
between  zero  and  one.  The  value  of  R2  indicates 
the  degree  to  which  the  data  conform  to  a  linear 
pattern.  If  R2  —  0,  then  there  is  no  linear  relation¬ 
ship  at  all  between  the  variables;  an  R2  of  1.0 
occurs  only  when  all  of  the  observations  fall  per¬ 
fectly  along  a  single  line  within  the  scatterplot  of 
the  bivariate  data. 

Multiple  Regression 

The  idea  of  fitting  a  linear  function  to  data  can  be 
generalized  very  easily  to  situations  with  more 
than  one  independent  variable,  producing  a  mul¬ 
tiple  regression  equation.  Once  again,  we  begin 
with 

Y,  =  Y,  +  ef. 

But  now,  Yis  defined  a  bit  differently  than  in  the 
bivariate  case: 

Yj  =  a  +  biXn  +  bzXa  +  •  •  •  +  bjXjj 

+  •  •  •  +  bkXik. 

Here,  the  Y  variable  is  a  linear  combination  of  k 
distinct  independent  variables.  The  precise  value  of 


k  is  determined  by  the  number  of  variables  the 
researcher  believes  to  be  influencing  the  dependent 
variable,  Y.  The  coefficient  associated  with  each 
independent  variable  (i.e.,  bj  for  X.)  shows  the  dif¬ 
ference  in  Y  that  occurs  when  X-  is  increased  by 
one  unit.  The  coefficient  a  is  once  again  an  inter¬ 
cept;  it  represents  the  value  of  Y  that  occurs  when 
all  of  the  independent  variables  are  equal  to  zero. 
The  full  multiple  regression  equation  is  formed  by 
combining  the  preceding  two  equations: 

Yi  —  a  +  b\Xt\  +  b2Xa  +  ■  ■  ■  +  bjX 
+  •  •  •  +  bkXlk  +  et. 

As  in  the  bivariate  case,  et  is  a  residual,  indicating 
that  Y  is  not  a  perfect  linear  function  of  the  X;s. 

Once  again,  the  least  squares  criterion  is  used  to 
estimate  the  coefficients  in  the  multiple  regression 
equation.  The  value  of  k  must  be  smaller  than  the 
number  of  observations;  otherwise,  there  will  not 
be  enough  information  to  calculate  numeric  values 
for  the  coefficients.  But  if  that  is  the  case,  the  OLS 
methodology  can  be  generalized  in  a  straightfor¬ 
ward  manner  to  produce  the  set  of  coefficients,  a, 
bv  b2,  .  .  . ,  bk,  that  minimize  the  sum  of  the 
squared  residuals  across  the  set  of  n  observations. 
Analogous  to  the  regression  line  in  the  bivariate 
case,  OLS  for  multiple  regression  produces  a  flat 
surface  in  the  multidimensional  scatterplot  with 
axes  formed  by  the  k  independent  variables  and 
the  dependent  variable.  Once  again,  the  OLS  coef¬ 
ficients  position  this  surface  so  that  it  comes  as 
close  as  possible  to  the  data  points.  Closeness  is  still 
defined  in  terms  of  the  squared  vertical  distances 
from  the  regression  surface  to  the  data  points.  The 
overall  fit  of  the  multiple  regression  equation  to  the 
data  is  still  measured  with  the  R2  statistic.  The  lat¬ 
ter  is  defined  as  before:  the  ratio  of  the  variance  in 
Y  to  the  variance  in  Y  itself.  This  gives  the  propor¬ 
tion  of  Y’s  variance  that  is  related  to  (or  explained 
by)  the  independent  variables  through  the  regres¬ 
sion  equation.  R2  is  equal  to  the  square  of  the  cor¬ 
relation  between  Y  and  Y ;  therefore,  it  is  sometimes 
called  the  squared  multiple  correlation  coefficient. 
The  interpretation  of  specific  R2  values  remains  the 
same  as  in  the  bivariate  case. 

In  the  multiple  regression  equation,  the  b-s  are 
often  called  multiple  regression  coefficients  or  par¬ 
tial  slope  coefficients.  They  are  generally  interpreted 
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as  measures  of  each  independent  variable’s  unique 
linear  effect  on  the  dependent  variable,  after  the 
linear  effects  of  the  remaining  k  —  1  independent 
variables  have  been  statistically  “removed.”  The 
OLS  procedure  estimates  all  k  +  1  coefficients  (i.e., 
the  k  partial  slopes  and  the  intercept)  simultane¬ 
ously;  therefore,  the  value  of  each  coefficient 
typically  is  affected  by  the  values  of  the  remaining 
coefficients.  The  only  exception  occurs  when  the 
independent  variables  are  perfectly  uncorrelated 
with  each  other — a  situation  that  seldom  (if  ever) 
exists  with  nonexperimental  data. 

Political  scientists  typically  use  a  regression 
equation  as  a  model  of  the  structural  relationships 
among  theoretically  relevant  variables.  Substantive 
theories  usually  specify  multiple  causal  agents  for 
any  given  dependent  variable.  And  there  are  often 
mediating  factors  that  affect  the  relationships 
between  variables.  Such  phenomena  need  to  be 
incorporated  into  the  empirical  analyses  that  test 
theoretical  propositions.  For  this  reason,  multiple 
regression  is  employed  more  frequently  than  bivar¬ 
iate  regression  in  substantive  applications.  In  addi¬ 
tion,  theories  are  broad  statements.  Therefore, 
researchers  are  often  concerned  with  statistical 
inference.  Rather  than  using  the  regression  equa¬ 
tion  as  a  tool  to  describe  the  structure  within  a 
given  data  set,  the  objective  usually  is  to  generalize 
from  the  observed  data  to  the  larger,  but  unob¬ 
served,  population  of  interest. 

Statistical  Inference 

Statistical  inference  begins  with  a  set  of  assump¬ 
tions  about  the  processes  that  generated  the 
observed  data.  First,  it  is  assumed  that  the  regres¬ 
sion  equation  accurately  mirrors  the  structural 
relationships  that  exist  within  the  population.  So 
for  a  context  in  which  Y  is  affected  by  k  indepen¬ 
dent  variables,  the  population  structure  is  assumed 
to  be 

Y,-  =  a  +  P|X,|  +  $2Xi2  +  •  •  •  +  p;.X* 

+  •  •  •  +  +  £<• 

The  variables  in  the  preceding  equation  are 
identical  to  those  in  the  regression  equation  pre¬ 
sented  earlier  (although  they  now  encompass  all 
objects  in  the  population,  not  just  a  limited  sample 
of  n  observations).  The  coefficients  (a  and  the  P;s) 


are  interpreted  as  before;  but  now,  they  are  fixed 
population  parameters  rather  than  statistics  whose 
values  vary  from  one  sample  to  the  next. 

The  last  term  on  the  right-hand  side  of  the  equa¬ 
tion,  8  ■,  is  the  error  or  disturbance  term.  This  can 
be  interpreted  as  the  sum  total  of  all  influences  on 
Y  that  are  not  explicitly  included  in  the  equation 
(i.e.,  apart  from  X,,  X2, .  .  . ,  Xk).  The  disturbance 
term  is  usually  assumed  to  consist  of  random 
“noise.”  That  is,  its  distribution  is  assumed  to  have 
a  mean  of  zero  and  a  variance  that  is  fixed  across 
all  possible  combinations  of  X;  values.  The  distur¬ 
bance  is  also  assumed  to  be  uncorrelated  with  all 
the  variables  (including  itself).  The  Xs  are  assumed 
to  be  independent  of  each  other  and  measured 
without  error.  For  convenience,  they  are  also  usu¬ 
ally  assumed  to  comprise  fixed  values  for  the  n 
observations,  meaning  that  they  do  not  vary  across 
repeated  samples  drawn  from  the  same  population. 

When  all  of  the  preceding  assumptions  hold,  the 
OLS  coefficient  estimates  possess  some  useful  sta¬ 
tistical  properties.  First,  the  multiple  regression 
coefficients  are  unbiased  estimates  of  the  corre¬ 
sponding  population  parameters;  that  is,  bj  will 
not  systematically  over-  or  underestimate  the  cor¬ 
responding  |3,  Second,  the  OLS  estimates  are  con¬ 
sistent  in  that  sampling  variance  decreases  as 
sample  size  increases;  in  other  words,  larger  sam¬ 
ples  generate  more  precise  estimates  of  the  popula¬ 
tion  parameters,  and  with  large  enough  samples, 
this  precision  becomes  almost  perfect.  And  third, 
the  OLS  estimates  are  efficient  in  that  their  sam¬ 
pling  variances  usually  are  smaller  than  those  of 
alternative  estimators  of  the  values.  A  result 
called  the  Gauss-Markov  theorem  proves  that  the 
OLS  coefficients  are  the  most  efficient  of  all  linear 
unbiased  estimators  (i.e.,  unbiased  estimators  that 
are  a  linear  combination  of  the  data). 

The  OLS  coefficients  are  linear  combinations  of 
the  sample  Y  values.  Therefore,  a  generalization  of 
the  central  limit  theorem  implies  that  the  coeffi¬ 
cients  are  normally  distributed.  Student’s  t  distribu¬ 
tion  can  be  used  with  the  estimated  standard  errors 
to  construct  confidence  intervals  for  and  test 
hypotheses  about  the  population  values  of  the 
respective  regression  coefficients.  One  can  also  test 
the  null  hypothesis  that  the  variance  of  Y  equals 
zero;  this  is  often  interpreted  as  a  test  of  the  statisti¬ 
cal  significance  of  the  overall  regression  equation.  In 
other  words,  it  is  the  functional  equivalent  of  testing 
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the  null  hypothesis  that  R2  —  0  in  the  population 
and  that  any  observed  nonzero  regression  coeffi¬ 
cients  are  due  to  the  effects  of  sampling  error  alone. 

Statistical  tests  of  the  joint  effects  of  several 
independent  variables  can  be  carried  out  in  several 
ways.  For  example,  a  t  test  can  be  performed  on  a 
linear  combination  of  regression  coefficients.  Or 
an  F  test  can  be  used  to  determine  whether  adding 
additional  variables  to  an  equation  significantly 
improves  the  fit  to  the  data.  And,  more  generally, 
the  F  test  can  be  used  to  test  hypotheses  about  sets 
of  linear  combinations  of  regression  coefficients. 
This  flexibility  in  the  strategies  used  for  statistical 
inference  is  one  of  the  most  attractive  features  of 
OLS  estimates  in  regression  analysis. 

Applications 

Another  reason  for  the  popularity  of  the  linear 
regression  model  is  its  adaptability  to  different 
research  situations.  For  example,  categorical  inde¬ 
pendent  variables  (in  which  observations  are 
distributed  across  m  discrete  categories)  can  be 
incorporated  into  an  equation  by  creating  m  —  1 
dummy  variables  representing  all  of  the  respective 
categories  except  one  (which  is  designated  the  “ref¬ 
erence”  category).  And  curvilinear  relationships 
can  often  be  modeled  by  applying  transformations 
to  the  independent  or  dependent  variables  or  by 
incorporating  polynomial  terms.  Similarly,  multi¬ 
plicative  terms  between  two  (or  more)  independent 
variables  can  be  used  to  represent  conditional 
effects,  wherein  the  impact  of  one  independent 
variable  on  the  dependent  variable  is  itself  affected 
by  the  value  of  another  independent  variable. 

Since  regression  analysis  is  used  primarily  as  a 
theory-testing  tool,  there  is  often  great  attention 
paid  to  the  question  of  exactly  which  independent 
variables  are  included  in  the  regression  equation. 
While  there  are  stepwise  procedures  that  select 
regressors  for  inclusion  from  a  pool  of  potential 
independent  variables,  they  are  not  typically 
employed  in  political  science  because  of  their  data- 
driven,  “atheoretical”  nature.  Instead,  substantive 
theory  is  usually  regarded  as  the  appropriate  guide 
for  the  specification  of  regression  models.  This  is  a 
critical  issue.  If  theoretically  relevant  variables  are 
excluded  from  the  equation,  then  the  coefficients  of 
the  included  independent  variables  are  likely  to  be 
biased  estimates  of  their  associated  population 


parameters.  This  occurs  because  the  included  vari¬ 
ables  “pick  up”  some  of  the  effects  on  the  dependent 
variable  that  properly  should  be  attributed  to  the 
omitted  independent  variables.  Alternatively,  includ¬ 
ing  extraneous  variables  in  the  regression  equation 
(i.e.,  X;s  that  have  no  real  effect  on  Y)  will  decrease 
the  precision  of  the  coefficient  estimates  since  they 
introduce  error  into  the  estimation  process. 

In  addition  to  theory  testing,  there  are  situations 
in  which  regression  analysis  is  used  as  a  forecasting 
tool,  to  predict  the  value  of  the  dependent  variable 
under  specified  conditions.  When  the  various 
assumptions  about  the  model  and  the  disturbance 
term  hold,  the  predicted  value  for  a  given  observa¬ 
tion,  Yh  can  be  interpreted  as  an  estimate  of  the 
conditional  mean  of  Y.  So  the  analyst  can  make 
predictions  about  the  average  level  of  the  dependent 
variable  that  will  occur  when  the  independent  vari¬ 
ables  are  held  at  specified  values.  Further,  a 
standard  error  for  the  conditional  mean  can  be  cal¬ 
culated  very  easily.  So  the  precision  of  the  prediction 
can  also  be  estimated,  using  a  confidence  interval  or 
a  hypothesis  test.  This  makes  regression  analysis  a 
useful  tool  to  provide  formal  guidance  for  decision 
making  in  a  wide  variety  of  substantive  contexts. 

As  mentioned  earlier,  regression  analysis  typi¬ 
cally  is  used  with  nonexperimental  data.  Therefore, 
nonzero  correlations  will  almost  always  exist 
among  the  independent  variables.  This  situation  is 
called  multicollinearity.  Multicollinearity  makes  it 
impossible  to  identify  the  specific  effects  of  each 
independent  variable  because  to  the  extent  that 
any  two  independent  variables  are  correlated,  they 
will  exert  a  nonseparable,  shared  impact  on  the 
dependent  variable.  In  practical  terms,  multicol¬ 
linearity  makes  the  coefficient  estimates  less 
precise  and  increases  the  standard  errors.  Multi¬ 
collinearity  is  a  condition  that  exists  in  the  data;  it 
is  not  a  feature  of  the  statistical  model.  There  are 
diagnostic  tests  that  can  indicate  the  degree  to 
which  multicollinearity  exists  in  a  regression  equa¬ 
tion.  But  there  is  no  way  to  “fix”  or  eliminate  its 
effects.  Fortunately,  multicollinearity  really  does 
not  cause  serious  problems  in  most  research  con¬ 
texts.  The  regression  coefficients  remain  unbiased, 
and  the  inflation  in  the  standard  errors  is  usually 
not  large  enough  to  compromise  the  process  of 
statistical  inference. 

Valid  interpretation  of  the  results  from  a  regres¬ 
sion  analysis  is  only  possible  if  the  assumptions 
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about  the  model  and  the  disturbances  reflect  the 
actual  conditions  that  exist  in  the  data.  Therefore, 
it  is  critical  to  employ  tools  called  regression  diag¬ 
nostics  to  check  the  veracity  of  the  assumptions. 
This  often  involves  examining  a  graph  of  the  residu¬ 
als  against  the  corresponding  Y,  values.  This  scat- 
terplot  shows  the  spread  of  the  observations  around 
the  fitted  regression  surface.  If  the  regression 
assumptions  hold,  then  the  residuals  are  similar  to 
the  disturbances,  and  they  should  appear  to  be  ran¬ 
dom  noise  with  no  systematic  structure.  To  the 
extent  that  the  researcher  can  perceive  patterns  or 
shapes  in  the  residual  plots,  it  indicates  that  the 
regression  model  is  “missing”  some  type  of  struc¬ 
ture  that  exists  within  the  data.  It  is  also  important 
to  identify  outliers,  or  observations  with  particu¬ 
larly  large  residuals;  in  certain  situations,  such 
observations  can  be  highly  influential  in  determin¬ 
ing  the  values  of  the  regression  coefficients.  When 
such  problems  are  diagnosed,  there  are  usually  cor¬ 
rective  actions  that  can  be  taken  to  alleviate  their 
effects  on  the  model’s  parameter  estimates;  this  is 
yet  another  strong  feature  of  regression  analysis. 
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Regulation 


Regulation  is  a  key  component  of  the  systems  that 
make  up  our  political,  social,  and  economic  lives. 


It  is  also  one  of  the  least  visible  and  transparent 
components  that  shape  our  governance  systems. 
To  regulate  is  to  legislate  in  public  and  private 
organizational  arenas,  that  is,  under  indirect  and 
often  ex  post  judicial  and  parliamentary  controls. 
In  democratic  and  pluralist  societies,  all  aim  to 
regulate  all,  and  thus,  all  contribute  to  “regulatory 
creep.”  The  amalgamation  of  these  regulations 
defines  our  liberties,  privileges,  opportunities,  and 
life  plans  in  ways  that  challenge  our  conventions 
about  politics  and  about  the  locus  of  democratic 
controls  and  accountabilities.  In  addition  to  defin¬ 
ing  the  concept  and  the  phenomenon,  this  entry 
discusses  regulation  as  a  key  activity  of  administra¬ 
tive  agencies,  regulation  beyond  the  state,  the  gov¬ 
ernance  of  regulation,  and  the  phenomena  of  both 
the  regulatory  state  and  regulatory  capitalism. 

Meaning  and  Empirical  Content 

Like  many  other  political  concepts,  regulation  is 
hard  to  define,  not  the  least  because  it  means  dif¬ 
ferent  things  to  different  people.  The  term  is 
employed  for  myriad  discursive,  theoretical,  and 
analytical  purposes  that  cry  out  for  clarification.  It 
is  also  a  highly  contested  term.  For  the  Far  Right, 
regulation  is  a  dirty  word  representing  the  heavy 
hand  of  authoritarian  governments  and  the  creep¬ 
ing  body  of  rules  that  constrain  human  or  national 
liberties.  For  the  Old  Left,  it  is  part  of  the  super¬ 
structure  that  serves  the  interests  of  the  dominant 
class  and  frames  power  relations  in  seemingly  civi¬ 
lized  forms.  For  progressive  democrats,  it  is  a  pub¬ 
lic  good,  a  tool  to  control  profit-hungry  capitalists 
and  to  manage  social  and  ecological  risks.  For 
some,  regulation  is  something  that  is  done  exclu¬ 
sively  by  government,  a  matter  for  the  state  and 
law  enforcement,  while  for  others,  regulation  is 
mostly  the  work  of  social  actors  who  monitor 
other  actors,  including  governments.  State-centered 
conceptions  of  regulation  define  it  with  reference 
to  state-made  laws,  while  society-centered  analysts 
and  scholars  of  globalization  tend  to  point  to  the 
proliferation  of  various  forms  of  civil  and  busi- 
ness-to-business  regulation. 

For  legal  scholars,  regulation  is  often  a  legal 
instrument,  while  for  sociologists  and  criminolo¬ 
gists,  it  is  yet  another  form  of  social  control.  For 
some,  it  is  the  amalgamation  of  all  types  of  laws — 
primary,  secondary,  and  tertiary  legislation — while 
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for  others  it  is  confined  to  secondary  legislation. 
For  economists  of  the  Chicago  school,  regulation 
is  usually  a  strategic  tool  used  by  private  and  spe¬ 
cial  interests  to  exploit  the  majority.  For  institu¬ 
tional  economists,  regulation  is  less  a  strategic  tool 
and  more  a  constitutive  element  of  the  market;  for 
them,  it  is  often  understood  as  the  mechanism  that 
constitutes  property  rights  or  even  as  a  source  of 
national  competitiveness.  The  French  Regulation 
School  seems  to  have  developed  a  similar  institu¬ 
tional  perspective  but  with  a  more  critical  tone  and 
without  the  functionalist  orientation  of  some 
English-speaking  economists  (Robert  Boyer,  1990). 
Whereas  scholars  of  public  administration  seem  to 
perceive  it  with  direct  and  in-depth  reference  to  the 
scope  of  state  authority  and  formal  regulatory 
organizations,  scholars  of  global  governance  tend 
to  focus  on  standards  and  soft  (nonbinding) 
norms.  While  some  seem  to  think  of  the  rise  of 
regulation  as  yet  another  indication  of  the  advance 
of  neoliberalism  and  the  retreat  of  the  welfare 
state,  others  tend  to  see  it  at  as  a  neo-mercantilist 
instrument  for  market  expansion,  high  modern¬ 
ism,  and  social  engineering.  In  European  parlance, 
for  most  of  the  20th  century,  regulation  was  syn¬ 
onymous  with  government  intervention  and, 
indeed,  with  all  the  efforts  of  the  state,  by  what¬ 
ever  means,  to  control  and  guide  the  economy  and 
society.  This  rather  broad  meaning  of  the  term 
seems  to  have  faded,  and  scholars  now  make  an 
effort  to  distinguish  rule  making  from  other  tools 
of  governance  and,  indeed,  from  other  types  of 
policy  instruments,  such  as  taxation,  subsidies, 
redistribution,  and  public  ownership. 

Regulation  not  only  is  a  distinct  type  of  policy 
but  also  entails  identifiable  forms  and  patterns  of 
political  conflict  that  differ  from  the  patterns  that 
are  regularly  associated  with  redistribution  and 
distribution.  In  addition,  while  other  types  of  policy 
such  as  distribution  and  redistribution  are  about 
relatively  visible  transfers  and  direct  allocation  of 
resources,  regulation  only  indirectly  shapes  the  dis¬ 
tribution  of  costs  in  society.  Government  budgets 
include  (relatively)  visible  and  clear  estimations  of 
the  overall  costs  of  distribution  and  redistribution 
but  hardly  any  of  the  cost  of  regulation  (with  the 
exception  of  the  administrative  costs  of  fact  find¬ 
ing,  monitoring,  and  implementation).  The  most 
significant  costs  of  regulation  are  compliance  costs, 
which  are  borne  not  by  the  government  budget  but 


mostly  by  the  regulated  parties.  The  wide  distribu¬ 
tion  of  these  costs  and  their  embeddedness  in  the 
regulatees’  budgets  make  their  impact,  effects,  and 
net  benefits  less  visible  and  therefore  less  transpar¬ 
ent  to  the  attentive  public. 

For  some,  regulation  is  a  risky  business  that  is 
prone  to  failure,  but  for  others,  the  business  of 
regulation  is  the  business  of  risk  minimization. 
Some  contend  that  regulation  comprises  mostly 
rule  making,  while  others  extend  it  to  include  rule 
monitoring  and  rule  enforcement.  For  some,  regu¬ 
lations  are  about  the  rules  and  functions  of  the 
administrative  agency  after  the  act  of  delegation; 
for  others,  as  already  observed,  regulation  includes 
every  kind  of  rule,  including  primary  legislation 
and  even  social  and  professional  norms.  The 
extensive  literature  on  regulation  in  the  United 
States  and  the  extensive  attention  paid  to  regula¬ 
tion  could  have  resulted  in  a  consensual  definition 
of  regulation.  Yet  this  is  not  the  case.  The 
American  Administrative  Procedure  Act  defines 
the  term  rule  but  not  the  term  regulation,  and 
what  it  defines  as  rule  is  confined  to  the  scope  of 
the  act  itself. 

Regulation  and  Administrative  Agencies 

One  important  aspect  of  any  discussion  of  the  dif¬ 
ferent  connotations  and  characteristics  of  regula¬ 
tion  is  the  relation  between  regulation  and  the 
existence  of  an  administrative  agency.  Rule  making 
and  rule-making  agencies  are  closely  connected. 
An  emphasis  on  the  workings,  characteristics,  fail¬ 
ures,  and  merits  of  regulation  by  administrative 
agencies  is  prevalent  in  the  literature  on  regulation. 
Indeed,  these  aspects  are  expressed  in  one  of  the 
most  widely  cited  definitions  of  regulation,  namely, 
as  “sustained  and  focused  control  exercised  by  a 
public  agency  over  activities  that  are  valued  by  the 
community”  (Selznick,  1985,  p.  363).  Not  only 
does  this  definition  include  an  explicit  reference  to 
public  agencies,  but  it  also  stresses  the  sustained 
and  focused  nature  of  regulation,  which  requires  a 
continuous  action  of  monitoring,  assessment,  and 
refinement  of  rules  rather  than  an  ad  hoc  opera¬ 
tion.  Implicit  in  this  definition  is  also  the  expecta¬ 
tion  that  ex  ante  rules  will  be  the  dominant  form 
of  regulatory  control.  The  definition  is  apt  also  in 
the  sense  that  it  recognizes  that  many,  perhaps  the 
most  important,  regulations  are  exercised  not  by 
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“regulatory  agencies”  but  by  a  wide  variety  of 
executive  organs.  This  definition  is  less  successful, 
however,  in  other  respects.  It  recognizes  regulation 
only  as  a  public  activity  by  a  public  agency  and 
thus  excludes  business-to-business  regulation  as 
well  as  civil  regulation.  It  also  does  not  clarify 
which  kinds  of  focused  control  the  public  agency 
applies  (is  it  rule  making  only  or  also  other  forms 
of  control  such  as  arbitrary  commands?);  and  the 
definition  unnecessarily  limits  regulation  to  those 
actions  that  are  valued  by  the  community. 

Regulation  Beyond  the  State 

The  focus  on  administrative  aspects  in  the  study 
of  regulation  might  be  less  useful  for  scholars 
who  emphasize  the  limits  of  “hard  law”  and  who 
are  aware  of  the  importance  of  social  norms  and 
other  forms  of  “soft  law”  in  the  governance  of 
societies  and  economies.  A  wider  definition  of 
regulation  that  captures  regulation  as  soft  law 
would  suggest  that  regulation  encompasses  all  the 
mechanisms  of  social  control,  including  uninten¬ 
tional  and  nonstate  processes.  Indeed,  it  extends 
to  “mechanisms  which  are  not  the  products  of 
state  activity,  nor  part  of  any  institutional  arrange¬ 
ment,  such  as  the  development  of  social  norms 
and  the  effects  of  markets  in  modifying  behavior” 
(Robert  Baldwin,  Colin  Scott,  &  Christopher 
Hood,  1998,  p.  4).  Thus,  the  notion  of  intention- 
ality  in  the  development  of  norms  has  been 
dropped  from  this  definition  of  regulation,  and 
anything  producing  effects  on  behavior  may  be 
considered  regulatory.  In  addition,  a  wide  range 
of  activities  that  may  involve  legal  or  quasi-legal 
norms,  but  without  the  mechanisms  for  monitor¬ 
ing  and  enforcement,  might  also  come  within  the 
definition.  This  definition  connects  widely  with 
the  research  agenda  on  governance,  “the  new 
governance”  and  the  “new  regulatory  state,” 
where  elements  of  steering  and  plural  forms  of 
regulation  are  emphasized  in  an  effort  to  capture 
the  plurality  of  interests  and  sources  of  control 
around  issues,  problems,  and  institutions.  This 
rather  wide  definition  of  regulation  also  allows  us 
to  “de-center”  regulation  from  the  state  and  even 
from  well-recognized  forms  of  self-regulation. 
De-centered  approaches  to  regulation  emphasize 
complexity,  fragmentation,  interdependencies, 
and  government  failures  and  suggest  the  limits  of 


the  distinctions  between  the  public  and  the  pri¬ 
vate  and  between  the  global  and  the  national. 

Scholars  of  regulatory  systems  often  point  to  the 
growth  in  the  number  of  civil  and  business  actors 
that  invest  in  regulation  and  accordingly  also  to  the 
growth  of  civil  and  business-to-business  regulatory 
institutions  and  instruments.  At  the  same  time, 
there  are  indications  of  the  transformation  of  the 
politics  of  interest  groups  and  nongovernmental 
organizations.  Civil  actors  are  often  associated  with 
advocacy  (e.g.,  lobbying)  and  service  provision 
(e.g.,  replacing  the  state  in  the  provision  of  wel¬ 
fare),  but  in  our  areas  of  study  they  also  produce, 
monitor,  and  enforce  regulation.  The  concept  of 
civil  regulation  aims  to  capture  this  evolving  feature 
of  civil  politics.  The  term,  as  noted  by  David  Vogel 
(2005),  refers  to  the  institutionalization  of  volun¬ 
tary  global  and  national  forms  of  regulation  through 
the  creation  of  private  (nonstate)  forms  of  regula¬ 
tion  intended  to  govern  markets  and  firms.  Civil 
regulations  attempt  to  embed  international  markets 
and  firms  in  a  normative  order  that  prescribes 
responsible  business  conduct.  They  include  old  and 
traditional  forms  of  self-regulation  but  go  beyond 
them  to  include  regulatory  techniques  such  as  third- 
party  accreditation  and  certification,  gatekeeping 
strategies,  metaregulation,  enforced  self-regulation, 
self-regulation,  and  league  tables.  Business-to- 
business  regulation  is  another  form  of  a  nonstate 
source  of  regulation.  Here,  the  growth  of  regulation 
is  driven  by  the  ability  of  some  businesses  (most 
often  big  business)  to  set  standards  for  other  busi¬ 
nesses  (most  often  smaller  ones).  One  relevant 
example  is  the  ability  of  big  supermarket  chains  to 
set  standards  for  food  manufacturing,  processing, 
and  marketing  all  over  the  world. 

The  Governance  of  Regulation 

Much  of  the  academic  and  public  discussion  of 
regulation  nowadays  deals  with  the  governance  of 
regulation  itself  (or  regulating  regulation)  rather 
than  governance  via  regulation.  The  expansion  of 
regulation  represents  two  major  challenges:  of 
effectiveness  and  of  democratic  control. 

Effectiveness 

The  first  challenge  focuses  on  the  alleged  weak¬ 
ness  of  command-and-control  systems  with 


Regulation  2257 


prescriptive  rules  that  tell  regulated  entities  what 
to  do  and  how  to  do  it.  These  prescriptive  rules 
tend  to  be  highly  particular  in  specifying  the 
required  actions  and  standards  and  are  usually 
backed  up  by  state  sanctions.  Peter  J.  May  (2007, 
p.  9)  observes  that  when  compared  with  the  alter¬ 
natives,  principles  and  norms,  regulations  are 
characterized  by  clear-cut  lines  of  responsibility 
and,  thus,  accountability.  Yet  clarity,  the  ability  to 
sanction,  and  direct  lines  of  accountability  all 
come  at  a  price.  Six  shortcomings  of  regulation  are 
important  to  note: 

1.  expensive  and  ineffective  regulatory  strategies; 

2.  inflexible  regulatory  strategies  that  encourage 
adversarial  enforcement; 

3.  legal  constraints  on  the  subjects,  procedures, 
and  scope  of  regulatory  discretion; 

4.  regulatees’  resentment,  which  leads  to 
noncompliance  or  “creative  compliance”; 

5.  strict  regulation,  which  often  presents  an 
obstacle  to  innovation;  and 

6.  regulation  that  often  serves  to  set  a  lowest 
common  denominator  for  regulatees  to  follow 
rather  than  supplying  incentives  for  improving 
standards. 

According  to  Neil  Gunningham  and  Peter 
Grabosky  (1998),  and  Steven  P.  Croley  (2008), 
there  are  five  major  strategies  of  response  to  these 
weaknesses.  The  first  and  the  most  controversial  is 
to  deregulate  existing  regulation  and  to  constrain 
new  rule  making.  These  “ossification”  strategies 
might  result  in  a  race  to  the  bottom,  or  degrada¬ 
tion  of  economic  and  environmental  performances, 
unmitigated  risk,  and  immoral  economies  and 
societies.  The  second  is  to  turn  to  “lite”  and  man¬ 
agement-based  regulation  and  to  harness  economic 
incentives  as  much  as  possible  toward  politically 
determined  public  goods.  The  third  is  to  promote 
responsive  forms  of  regulation.  The  fourth  is  to 
improve  the  regulatory  arsenal  (e.g.,  by  employing 
auctions  and  using  benchmarking)  as  well  as  the 
quality  and  training  of  the  regulators  and  the  quality 
of  the  regulatory  design.  The  fifth  is  to  institution¬ 
alize  regulatory  impact  analysis  and  cost-benefit 
techniques.  These  control  measures  are  becoming 
increasingly  popular,  and  some  countries  have 


even  established  regulatory  agencies  to  regulate 
regulation  itself  (e.g.,  the  British  Better  Regulation 
Executive,  and  the  Office  of  Information  and 
Regulatory  Affairs  in  the  United  States). 

Democratic  Control 

The  second  governance  challenge  that  regula¬ 
tion  represents  is  derived  from  the  democratic 
qualities  (or,  more  accurately,  weaknesses)  of  regu¬ 
lation.  The  belief  that  the  legislator  should  legis¬ 
late,  the  judiciary  should  adjudicate,  and  the 
executive  should  govern  via  the  bureaucracy  takes 
regulation  to  be,  at  best,  a  “necessary  evil.”  Yet 
this  necessary  evil  is  expanding  and  diversifying  to 
an  extent  that  raises  important  challenges  for 
democratic  theory  and  practice.  First,  regulators 
are  not  elected  and  are  accountable  to  the  people 
only  indirectly,  hence  the  arguments  about  the 
democratic  deficit  of  regulatory  systems.  Second, 
while  it  is  a  fundamental  idea  of  law  that  people 
should  be  subject  to  fixed,  known,  and  certain 
rules,  the  sheer  numbers  of  rules  and  the  frequency 
and  the  process  with  which  they  are  changed  cre¬ 
ate  a  situation  where  it  is  beyond  the  capacity  of 
most,  if  not  all,  individuals  to  act  without  legal 
advice.  The  sheer  volume  of  regulations  represents 
a  challenge  for  democratic,  judicial,  parliamentary, 
and  administrative  systems  of  control.  Third,  the 
growth  of  international  administrative  law  makes 
supposedly  sovereign  polities  into  rule  takers 
rather  than  rule  makers.  Regionalization,  interna¬ 
tionalization,  and  globalization  of  regulation  all 
raise  issues  of  legitimacy  and  may  lead  to  new  and 
innovative  forms  of  democratic  control  over  regu¬ 
latory  systems.  Fourth  is  the  emergence  of  “private 
regulatory  regimes”  and  “private  governments,” 
which  may  weaken  democratic  legitimacy  and  may 
change  the  balance  of  power  between  corporations 
and  states.  To  deal  with  these  democratic  chal¬ 
lenges,  it  is  necessary  to  develop  and  strengthen 
three  systems  of  control  over  bureaucratic  legisla¬ 
tion:  parliamentary,  judicial,  and  participatory. 

The  Regulatory  Agency 

and  the  Regulatory  State 

One  of  the  most  important  indicators  of  the 
growth  in  the  scope  and  depth  of  regulatory 
activities  in  modern  society  is  the  proliferation  of 
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regulatory  agencies  as  the  administrative  and 
intellectual  core  of  national  and  global  systems 
of  regulatory  governance.  Regulatory  agencies 
are  not  a  new  feature  of  modern  systems  of  gov¬ 
ernance,  but  they  have  become  a  highly  popular 
form  of  regulatory  governance  since  the  1990s. 
A  regulatory  agency  is  a  nondepartmental  public 
organization  mainly  involved  with  rule  making, 
but  it  may  also  be  responsible  for  fact  finding, 
monitoring,  adjudication,  and  enforcement.  It  is 
autonomous  in  the  sense  that  it  can  shape  its 
own  preferences;  of  course,  the  extent  of  the 
autonomy  varies  with  its  administrative  capaci¬ 
ties,  its  ability  to  shape  preferences  indepen¬ 
dently,  and  its  ability  to  enforce  its  rules.  The 
autonomy  of  an  agency  is  also  constituted  by  its 
establishment  as  a  separate  organization  and  by 
the  allocation  to  this  agency  of  a  policy  space 
where  it  is  expected  to  operate  by  way  of  exercis¬ 
ing  its  functions  and  responsibilities.  Note  that 
while  rule-making,  fact-finding,  monitoring, 
adjudication,  and  enforcement  capacities  are  the 
defining  characteristics  of  regulatory  agencies, 
other  organizations,  both  within  and  outside  the 
state,  can  also  acquire  and  successfully  deploy 
these  characteristics. 

The  expansion  in  the  number  of  regulatory  agen¬ 
cies,  and  arguably  also  in  the  scope  of  regulation 
and  in  the  policy  capacities  of  these  agencies,  has 
been  manifest  since  the  1990s  in  the  popularity  of 
the  notion  of  the  regulatory  state.  In  its  most 
straightforward  form,  the  term  regulatory  state 
“suggests  [that]  modern  states  are  placing  more 
emphasis  on  the  use  of  authority,  rules  and  stan¬ 
dard-setting,  partially  displacing  an  earlier  emphasis 
on  public  ownership,  public  subsidies,  and  directly 
provided  services”  (Hood  et  ah,  1999,  p.  3). 

As  noted  by  David  Levi-Faur  and  Sharon  Gilad 
(2004),  three  elements  are  especially  useful  in 
characterizing  the  regulatory  state.  First,  the 
bureaucratic  functions  of  regulation  are  being 
separated  from  service  delivery.  Second,  the  regu¬ 
latory  functions  of  government  are  being  sepa¬ 
rated  from  policy-making  functions,  and  thus,  the 
regulators  are  being  placed  at  arm’s  length  from 
their  political  masters.  In  this  way,  regulatory 
agencies  became  the  citadels  that  fortify  the 
autonomous  and  influential  role  of  the  regulocrats 
in  the  policy  process.  We  are  witnessing  the 
strengthening  of  the  regulators  at  the  expense  of 


politicians  on  the  one  hand  and  of  the  managerial 
elite  on  the  other.  Third,  and  as  a  result  of  the  first 
two  elements,  regulation  and  rule  making  emerge 
as  a  distinct  stage  in  the  policy-making  process. 
Accordingly,  regulation  is  emerging  as  a  distinct 
profession  and  administrative  entity.  Professional 
affiliation  to  global  networks  of  experts  is  becom¬ 
ing  a  major  source  of  innovations,  worldviews, 
accountability,  and  legitimacy. 

From  the  Regulatory  State 
to  Regulatory  Capitalism 

The  term  regulatory  capitalism  denotes  the  growth 
in  the  scope,  importance,  and  impact  of  regulation 
at  the  national  and  global  levels  and  the  growing 
investments  of  political  actors  in  regulation  in  gen¬ 
eral  and  regulatory  strategies  in  particular.  It  sug¬ 
gests  that  regulation  and  rule  making  are  the  major 
instruments  in  the  expansion  of  global  governance. 
The  notion  of  regulatory  capitalism  takes  regula¬ 
tion  theory  analysis  beyond  national  boundaries 
(hence  beyond  the  notion  of  the  regulatory  state) 
and  beyond  formal,  state-centered  rule  making 
(therefore  toward  civil  regulation  and  de-centered 
analysis  of  regulatory  systems).  It  also  denotes  a 
world  where  regulation  is  increasingly  a  hybrid  of 
different  systems  of  control:  Statist  regulation 
coevolves  with  civil  regulation;  national  regulation 
expands  with  international  and  global  regulation; 
private  regulation  coevolves  and  expands  with 
public  regulation;  voluntary  regulations  expand 
with  coercive  ones;  and  the  market  itself  is  used  or 
mobilized  as  a  regulatory  mechanism. 

David  Levi-Faur 
Hebrew  University  of  Jerusalem 
Jerusalem,  Israel 

See  also  Policy  Instruments;  Self-Regulation 
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Any  investigation  into  the  interdependencies 
between  religion  and  politics  first  of  all  calls  for  a 
definition  of  the  subject  matter  and  hence  for  a 
definition  of  religion.  However,  such  a  definition 
faces  considerable  difficulties  because  religion 
historically  is  a  universal  phenomenon  and 
encompasses  today  an  enormous  variety  of  expe¬ 
riences,  convictions,  and  practices  the  religious 
character  of  which  is  far  from  being  unequivocal. 
The  distinction  between  religious  phenomena  and 
other  social  phenomena  is  found  in  religious  soci¬ 
ology  and  anthropology,  mostly  with  regard  to 
specific  contents  or  functions.  As  to  content,  the 
experience  of  or  interaction  with  superhuman, 
preternatural,  or  transcendental  creatures,  pow¬ 
ers,  forces,  or  orders  often  is  defined  as  a  univer¬ 
sal  characteristic  of  religion.  Functionalist 
approaches  use  functions  such  as  the  individual 


coping  with  contingency  or  a  collective  founda¬ 
tion  of  identity.  Content  criteria,  however,  often 
prove  to  be  too  exclusive  and  functionalist  defini¬ 
tions,  too  inclusive.  The  attempt  to  determine 
criteria  for  the  identification  of  religion  as  a 
unique  area  of  activity  clearly  to  be  differentiated 
from  other  societal  or  cultural  spheres  of  action  is 
also  highly  selective  from  an  intercultural  point  of 
view,  because  it  refers  to  a  large  extent  to  Western 
Christianity,  with  its  characteristic  notion  of  a 
differentiation  between  the  worldly  and  spiritual 
spheres.  Nevertheless,  the  category  “religion”  has 
established  itself  as  a  global  concept  of  reference 
even  for  non-Western  religious  traditions,  and  it 
has  admittedly  been  rephrased  and  revised  in  the 
course  of  its  global  reception.  Because  of  this 
global,  pluralistic,  and  contextualized  debate 
about  the  term  religion ,  James  A.  Beckford  rec¬ 
ommends  a  constructivist  approach  to  its  defini¬ 
tion.  Constructivist  approaches  assume  that  the 
concept  of  religion,  like  other  concepts,  is  the 
result  of  conflictual  definition  processes  and 
advocate  following  this  everyday  practice  when 
defining  social  science  terms.  This  entry  sketches 
the  role  of  contemporary  religion  in  politics  and 
discusses  some  issues  of  current  research  and 
their  different  perspectives. 

Religious  traditions  and  religious  actors  have 
been  highly  relevant  in  politics  and  hence  also  in 
political  science.  After  all,  religious  traditions  have 
at  their  disposal  specific  notions  about  the  consti¬ 
tution  of  the  self  and  the  world  as  well  as  about 
proper  conduct  in  one’s  individual  and  collective 
life.  These  concepts  and  moral  guidelines  have  also 
left  their  mark  on  the  political  orientations  of  indi¬ 
vidual  and  collective  religious  actors.  However, 
not  only  do  religious  politics  exercise  an  influence 
on  the  constitution  of  the  political  community,  but 
religious  actors  and  traditions,  symbols,  and  prac¬ 
tices  in  nearly  all  historical  and  contemporary 
societies  have  also  been  instruments  and  objects  of 
political  action  and  control.  Often,  the  specific 
dynamics  of  the  religious-political  complex  result 
from  the  interaction  of  religious  politics  and  the 
politics  of  religion.  The  interactions  between  reli¬ 
gion  and  politics  are  currently  of  particular  interest 
because  since  the  end  of  the  1970s,  a  (revitaliza¬ 
tion  of  religious  traditions  and  the  politicization  of 
religious  actors  can  be  observed  in  many  parts  of 
the  world. 
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The  current  practice  of  researching  the  interac¬ 
tive  relationship  between  politics  and  religion  is 
also  due  to  its  rediscovery  by  political  science.  At 
least  since  the  1960s,  religion  was  regarded  in  large 
parts  of  the  discipline  as  a  social  complex  whose 
historical  destiny  had  been  settled.  The  reason  for 
this  was  the  underlying  conviction  of  secularization 
theory  that,  in  the  course  of  processes  of  modern¬ 
ization  and  functional  differentiation,  religious  tra¬ 
ditions  had  lost  many  of  their  former  societal  and 
political  functions  as  well  as  their  attractiveness  and 
persuasiveness.  They  would,  therefore,  be  relin¬ 
quished  to  the  private  sphere  if  not  disappear 
altogether.  Even  the  normative  issues  about  the  rela¬ 
tionship  between  politics  and  religion  seemed  to  be 
settled.  Many  traditions  of  political  theory,  such  as 
liberalism  and  republicanism,  agreed  on  the  basic 
tenet  that  religion  had  to  be  limited  to  the  private 
sphere  because  neither  was  it  suitable  for  forming  a 
universally  acceptable  normative  basis  for  demo¬ 
cratic  political  communities  nor  was  it  able  to 
supply  arguments  or  justifications  for  political  deci¬ 
sions  that  could  be  universally  agreed  on.  Moreover, 
large  parts  of  the  social  sciences  were  shaped  by  a 
“methodological  atheism,”  according  to  which 
experiences  with  or  interactions  with  supernatural 
powers  are  illusionary.  This  methodological  athe¬ 
ism  led  to  reductionist  patterns  of  explanation  that 
regarded  and  reconstructed  religion  as  a  mere  epi- 
phenomenon  of  social  or  political  processes.  Based 
on  these  underlying  convictions,  religion  was  found 
interesting  at  best  in  the  sense  of  a  relic  of  religious 
influence  on  politics,  which  could  be  found  in 
political  cleavages,  voting  behavior,  party  systems, 
or  types  of  welfare  states. 

The  relative  neglect  of  research  into  the  interac¬ 
tion  of  religion  and  politics  started  to  change  with 
the  globally  observable  (re)vitalization  of  religious 
traditions  and  the  mobilization  and  politicization 
of  religious  actors  since  the  late  1970s. 

(Re)Vitalization,  Mobilization, 
and  Politicization 

Phenomena  of  (re)vitalization  have  become  dis¬ 
cernible  in  nearly  all  religious  traditions.  The  num¬ 
ber  of  followers  of  Islam  as  well  as  of  Catholicism 
and  Protestantism  outside  the  Western  Hemisphere 
has  increased  considerably.  Even  the  Orthodox 
Church  in  Eastern  Europe  has  seen  an  unexpected 


renaissance  following  the  breakdown  of  the  com¬ 
munist  regimes.  Moreover,  new  religious  move¬ 
ments  are  forming  in  many  parts  of  the  world, 
especially  in  Asia.  Phenomena  of  politicization  also 
can  be  found  in  nearly  all  religious  traditions.  First 
and  foremost  among  them  is  the  formation  and 
political  mobilization  of  fundamentalist  or  conser¬ 
vative  religious  movements.  Since  the  attacks  of 
September  11,  2001,  the  phenomenon  of  religiously 
motivated  terrorism  has  increasingly  entered  the 
public  consciousness.  Also,  since  the  1990s,  the 
number  of  religiously  motivated  or  influenced  vio¬ 
lent  conflicts  and  civil  wars  has  increased. 

Particularly  in  the  Western  world,  with  its  increas¬ 
ing  religious  pluralism,  the  traditional  institutional 
arrangements  for  the  relationship  between  state  and 
church,  politics  and  religion,  have  become  the  cause 
for  and  object  of  political  debate  and  the  mobiliza¬ 
tion  of  religious  actors.  But  religious  actors,  organi¬ 
zations,  and  movements  have  also  played  an  impor¬ 
tant,  even  decisive,  role  in  the  processes  of  demo¬ 
cratic  transition.  All  over  the  world,  an  increasing 
political  activity  of  Christian  churches  and  organi¬ 
zations  can  be  registered.  This  increasing  presence 
of  religious  actors  in  the  public  sphere  is  also  an 
effect  of  the  transformation  of  politics  itself.  In  the 
course  of  the  comprehensive  politicization  of  cur¬ 
rent  societies,  almost  no  societal  situation  is  exempt 
from  becoming  the  object  of  political  decisions  and 
regulatory  actions.  Due  to  this  fading  distinction 
between  the  public  and  the  private  sphere,  the  for¬ 
merly  guaranteed  areas  of  a  self-determined  reli¬ 
gious  way  of  life  are  further  reduced.  Another  case 
in  point  is  that  due  to  cultural  pluralization  and 
technological  progress,  more  and  more  value- 
oriented  conflictual  issues,  such  as  the  relationship 
between  the  sexes,  sexual  identity,  and  biomedicine, 
have  been  put  on  the  political  agenda.  These  are 
questions  touching  the  religious  beliefs  of  many  citi¬ 
zens.  Finally,  globalization  leads  not  only  to  the 
internationalization  and  transnationalization  of  reli¬ 
gious  actors  but  also  to  an  accelerated  diffusion  of 
political  activity  patterns  and  problem  perceptions. 

Issues  of  Current  Research 

The  first  reaction  by  political  science  to  the  (re) 
vitalization  and  politicization  of  religion  consisted 
in  the  research  of  religious  fundamentalism. 
Similarly,  in  political  theory,  a  new  debate  began 
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about  the  principles  of  determining  the  relationship 
between  religion  and  politics.  Now  as  in  former 
times,  the  classic  liberal  and  republican  positions  of 
a  strict  separation  between  religion  and  politics  and 
the  referral  of  religion  to  the  private  sphere  are 
being  defended.  Alternative  approaches  do  uphold 
a  minimum  necessary  mutual  autonomy  of  politics 
and  religion,  but  with  regard  to  religious  and  cul¬ 
tural  pluralism,  they  no  longer  consider  the  exclu¬ 
sion  of  religious  points  of  view  from  politics  as 
mandatory.  At  the  same  time,  they  grant  politics 
considerable  leeway  in  determining  the  relation¬ 
ship  between  religion  and  politics.  This  debate  has 
also  led  to  an  increase  in  comparative  empirical 
research  into  the  genesis,  mechanisms,  and  effects 
of  different  models  for  representing  the  relation¬ 
ship  between  religion  and  politics. 

Another  important  focus  of  political  science 
research  into  the  interrelationship  of  religion  and 
politics  concerns  the  compatibility  of  democracy 
with  specific  religious  traditions,  especially  Islam. 
The  starting  point  of  this  debate  is  the  finding  that 
the  majority  of  current  democracies  have  a  pre¬ 
dominantly  Christian  population,  whereas  the 
majority  of  Islamic  countries  have  nondemocratic 
constitutions.  One  explanation  of  this  finding  is 
that  only  in  Latin  Christendom  has  there  been  the 
concept  of  the  fundamental  distinction  between 
the  worldly  and  the  divine.  Critics  of  this  argument 
refer  to  the  variety  and  changeability  of  religious 
traditions,  citing  among  others  the  example  of  the 
Catholic  Church,  which  only  after  the  Second 
Vatican  Council  changed  into  an  important  pro¬ 
moter  of  democracy.  The  nondemocratic  constitu¬ 
tion  of  many  Islamic  countries  is  ascribed  instead 
to  specific  historical,  regional,  or  cultural  factors. 
However,  there  are  still  considerable  deficits  of 
research,  for  example,  regarding  the  instruments 
and  strategies,  and  also  the  opportunities  and 
limitations,  of  religious  actors  exerting  a  political 
influence  at  the  national  and  international  levels. 
This  concerns,  for  example,  research  into  the  reli¬ 
gious  conditioning  of  welfare  states  or  on  the 
effects  of  religious  influences  on  public  policies. 

Perspectives  for  Future  Research 

Research  into  the  interaction  of  politics  and  reli¬ 
gion  is  a  current  trend.  The  dynamics  of  the 
religious-political  complex  will — together  with 


the  competitively  organized  national  and  interna¬ 
tional  research  in  political  science — open  up  new 
issues  and  debates.  It  can  be  expected  that  existing 
gaps  in  this  research  will  soon  be  filled.  But  what 
will  also  be  necessary  is  a  stronger  reflection  on  the 
term  and  the  phenomenon  of  religion  as  well  as  on 
the  discipline’s  underlying  convictions  about  reli¬ 
gion  and  its  interrelationship  with  politics.  The 
debate  about  the  concept  of  religion  is  irrefutable 
for  two  reasons.  On  the  one  hand,  the  discipline  is 
also  conducting  research  into  phenomena  such  as 
civil  religion  and  political  religion,  which  share  a 
number  of  characteristics  with  religions  without 
being  religions  in  the  classic  sense  of  the  word.  On 
the  other  hand,  the  religious  field  is  changing  due 
to  the  formation  of  new  individual  and  syncretistic 
forms  of  religiosity  and  spiritualism.  Reflecting  on 
the  underlying  convictions  is  necessary  because  the 
latter  predetermine  not  only  the  selection  of  issues 
but  also  the  interpretation  of  findings.  That  is 
especially  true  for  assumptions  about  the  capabil¬ 
ity  of  the  religious  field  for  transformation,  change, 
and  adaptation;  for  the  concept  of  religion  as  a 
(latent)  threat  to  the  peaceful  integration  of  societ¬ 
ies;  and  also  for  methodological  atheism  and  the 
reductionist  strategies  of  explanation  usually  asso¬ 
ciated  with  it. 

Ulrich  Willems 
University  of  Minister 
Minister,  Germany 
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Religiosity 


Religiosity  refers  to  the  strength  and  scope  of  reli¬ 
gious  activity  among  faith-based  communities. 
Although  “religion”  and  “religiosity”  are  often 
used  synonymously,  there  is  a  considerable  differ¬ 
ence  between  these  two  terms.  Whereas  religion 
sociologically  denotes  beliefs  and  practices  in  rela¬ 
tion  to  the  sacred,  religiosity  encompasses  a  vari¬ 
ety  of  representations  that  symbolize  religious 
involvement.  Within  the  religions  of  the  Abrahamic 
traditions,  for  example,  there  exist  monotheistic 
belief  systems  involving  a  multitude  of  practices 
that  adhere  to  the  teachings  of  literary  texts. 
However,  the  religiosity  of  these  traditions  would 
be  found  within  any  number  of  activities  includ¬ 
ing  patterns  of  religious  involvement,  forms  of 
religious  expression,  manifestations  of  religious 
structures,  and  the  like.  This  entry  discusses  some 
historical  and  contemporary  political  implica¬ 
tions  of  the  different  manifestations  of  religiosity. 

In  the  international  political  climate  of  the 
21st  century,  for  example,  religiosity  has  become 
a  crucial  feature  in  discourses  concerning  global 
security.  In  a  controversial  assessment  of  religios¬ 
ity’s  role  in  global  politics,  Samuel  P.  Huntington’s 
(1996)  book  The  Clash  of  Civilizations  and  the 
Remaking  of  World  Order  attributed  the  poten¬ 
tial  for  post-Cold  War  conflict  to  religious  and 
cultural  clashes  between  nations  and  differing 
civilizations.  Despite  Huntington’s  classification 
of  some  eight  civilizations  based  on  cultural 
lines,  these  civilizations  are  largely  bound  by  the 
demarcations  of  religiosity.  Given  this  type 
of  reasoning  and  categorization,  religiosity  rep¬ 
resents  a  critical  position  in  transnational 
conflicts. 

Notwithstanding  some  other  factors,  the  con¬ 
flicts  of  the  early  21st  century  generated  by  reli¬ 
gious  groups  suggest  that  Huntington’s  thesis  is 
supported  by  a  considerable  amount  of  evidence. 
Demonstrated  through  the  violent  incidents  of 
September  11,  2001,  the  19  hijackers  can  be  said 
to  have  been  defending  their  religiosity  through  a 
type  of  aggression  that  they  felt  would  stem  the 
spread  of  Western  values  and  ideologies.  Western 
encroachment  in  this  way  on  what  Huntington 
called  “Islamic  civilization”  may,  perhaps,  engen¬ 
der  the  greatest  amount  of  conflict. 


When  one’s  religiosity  becomes  systematically 
threatened,  contentions  appear  to  be  elevated  to 
another  level.  Mark  Juergensmeyer,  in  his  2000 
book  Terror  in  the  Mind  of  God,  coined  the  term 
cosmic  war  to  describe  a  situation  where  battles 
between  perceptions  of  good  and  evil  emerge.  In 
this  case,  a  person’s  or  group’s  religiosity  becomes 
the  basis  for  at  least  one  side  of  such  a  cosmic  con¬ 
flict.  For  example,  the  indifference  that  the  West 
often  displays  toward  the  contradictory  interplay 
between  modern  customs  and  traditional  sacred 
beliefs  and  practices  creates  an  atmosphere  where 
religious  expression  marks  a  quasi-ideological 
status.  In  this  way,  dealing  with  religiosity  in  a 
culturally  insensitive  manner  is  detrimental  to  the 
resolution  of  conflicts  of  this  magnitude. 

Given  the  importance  of  religiosity  in  global 
affairs,  measuring  its  attributes  is  particularly 
useful.  Religiosity  is  often  measured  in  terms  of 
practices  that  indicate  the  religiousness  of  a  popu¬ 
lation,  including  prayers,  worship,  service  atten¬ 
dance,  adherence  to  established  principles  of  faith, 
commitment  to  belief,  and  any  number  of  other 
activities  that  suggest  an  interaction  with  religious 
institutions.  Basic  examples  of  the  results  of  such 
measurements  may  yield  quantitative  data  such  as 
correlations  between  the  frequency  of  church 
attendance  and  physical  well-being  or  qualitative 
data  that  suggest,  for  example,  that  the  social 
mobility  and  economic  success  of  an  individual  is 
linked  to  his  or  her  commitment  to  a  particular  set 
of  religious  beliefs.  One  of  the  most  well-known 
examples  of  studies  pertaining  to  religiosity  is  Max 
Weber’s  ([1904/1905J  1998)  book  The  Protestant 
Ethic  and  the  Spirit  of  Capitalism.  Weber  focused 
on  an  insightful  finding  about  the  relationship 
between  religiosity  among  the  Calvinist  branch  of 
Protestantism  and  the  emergence  of  modern  capi¬ 
talism.  As  Weber  noted,  the  religious  doctrine  of 
predestination  led  to  an  ethic  of  hard  work  com¬ 
bined  with  an  innerworldly  asceticism  that  indi¬ 
rectly  resulted  in  an  early  accumulation  of  capital. 

In  one  of  the  most  widely  cited  pieces  of  litera¬ 
ture  on  the  topic  of  religiosity,  a  group  of  Utah- 
based  researchers  created  a  model  for  measuring 
religiosity  among  members  of  the  Church  of  Jesus 
Christ  of  Latter-day  Saints.  M.  Cornwall,  S.  L. 
Albrecht,  P.  H.  Cunningham,  and  B.  L.  Pitcher 
(1986)  used  the  components  of  religious  belief, 
religious  commitment,  and  religious  behavior, 
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combined  with  two  modes  of  personal  and  institu¬ 
tional  types  of  religious  practice,  to  construct  a 
cross-classification  for  measuring  religiosity.  This 
schema  generated  six  dimensions  of  religiosity: 
(1)  traditional  orthodoxy,  (2)  particularistic  ortho¬ 
doxy,  (3)  spiritual  commitment,  (4)  church  com¬ 
mitment,  (5)  religious  behavior,  and  (6)  religious 
participation.  Through  the  analysis  of  these  six 
dimensions,  the  categories  of  cognition,  affect,  and 
behavior  were  established  as  components  used  in 
measuring  religiosity. 

Such  a  multidimensional  approach  to  classify¬ 
ing  religiosity  suggests  similar  multidimensional 
approaches  to  the  more  general  study  of  religiosity. 
Rather  than  creating  standardized  models  for  all 
religious  activities,  much  of  the  literature  suggests 
a  movement  toward  case-by-case  assessments  of 
what  constitutes  religiosity  in  different  cultures, 
including  its  significance  and  meaning. 

Salvador  Jimenez  Murguia 
Miyazaki  International  College 
Miyazaki-gun,  Miyazaki,  Japan 
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A  religious  movement  is  an  organized  effort  that 
intends  to  bring  about  religious  reforms.  Many 
religious  movements  also  have  political  goals 
because  the  kinds  of  reforms  they  want  appear 


possible  only  by  altering,  sometimes  fundamen¬ 
tally,  political  and/or  social  contexts  and  arrange¬ 
ments  in  which  they  operate.  This  entry  discusses 
some  major  manifestations  of  such  movements  in 
contemporary  politics. 

In  recent  years,  social  scientists  have  consis¬ 
tently  noted  that  religion  can  influence  politics. 
Three  decades  ago,  the  Iranian  revolution  showed 
that  an  Islamic  religious  movement  could  over¬ 
throw  a  regime  once  seen  as  a  key  example  of  the 
modernizing  effects  of  secularization  in  the  Middle 
East.  More  recently,  the  rise  of  the  Christian  Right 
(CR)  in  the  United  States  demonstrated  how  reli¬ 
gious  movements  can  evolve  as  a  result  of  changing 
political  circumstances.  Further,  the  emergence  and 
consolidation  of  the  Bharatiya  Janata  Party  (BJP) 
in  India  was  made  possible  because  of  the  party’s 
foundations  in  Hindu  religious  movements. 
Similarly,  Jewish  fundamentalist  movements  have 
emerged  in  Israel  with  plenty  to  say  about  the 
Israel-Palestinian  issue. 

All  these  examples  point  to  the  inadequacies  of 
modernization  and  secularization  theories,  which 
predicted  that  the  importance  of  religion  in  politics 
would  decline.  Although  secularization  has  clearly 
occurred  in  many  countries,  especially  in  Western 
Europe,  this  has  taken  place  at  the  same  time  that 
religion  has  exhibited  sustained  presence — some 
would  call  it  a  resurgence — in  many  parts  of  the 
world.  Overall,  the  impact  of  religion  on  politics 
has  not  declined  but  instead  changed  in  complex 
ways,  including  the  political  involvement  of  some 
religious  movements. 

Until  recently,  such  issues  were  judged  to  be 
remote  from  central  political  questions.  The  expla¬ 
nation  for  this  neglect  lies  in  a  key  assumption 
embedded  in  the  social  sciences.  One  presupposi¬ 
tion,  especially  evident  in  theories  of  modernization 
and  political  development,  was  that  the  future  of 
the  integrated  nation-state  lay  in  secular  participa¬ 
tory  politics.  The  assumption  was  that  nation  build¬ 
ing  would  be  ill  served  by  perceived  “obscurantist” 
beliefs — such  as  religion.  Nearly  everywhere,  it 
seemed,  secular  political  leaders  dominated,  dis¬ 
placing  once  powerful  traditional  and  religious 
figures  from  prominence.  The  implication  was  that, 
to  successfully  build  nation-states,  political  leaders 
would  have  to  remain  as  neutral  as  possible  from 
the  entanglements  of  particularistic  claims,  espe¬ 
cially  those  derived  from  religion.  It  is  hoped  that 
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this  would  avoid  dogmatism  and  encourage  citi¬ 
zens’  tolerance,  a  crucial  prelude  to  building  viable 
nation-states  and  democracies.  Decades  of  appar¬ 
ently  unstoppable  movement  toward  increasingly 
secular  societies  in  many  parts  of  the  world  rein¬ 
forced  the  assumption  that  religion  and  piety 
would  inevitably  become  private  matters  every¬ 
where.  Consequently,  in  political  analysis,  religion 
was  often  regarded  as  an  increasingly  minor  prob¬ 
lem  of  little  or  no  significance  in  the  search  for 
national  unity  and  political  stability. 

Such  a  view  is,  however,  problematic.  If  it  were 
correct,  how  could  we  explain  and  account  for 
many  religious  movements’  current  political  sig¬ 
nificance?  Consider  again  the  examples  mentioned 
above: 

•  In  officially  secular  India,  the  Hindu  nationalist 
BJP  came  to  power  in  the  mid-1990s  after 
staging  a  10,000-kilometer  march  that  sought  to 
destroy  an  ancient  mosque  at  Ayodhya,  alleged 
to  have  been  built  on  the  remains  of  a  temple 
dedicated  to  Lord  Rama. 

•  In  the  United  States,  where  the  principle  of 
separation  of  church  and  state  is  incorporated  in 
the  nation’s  constitution,  the  organized  CR — a 
religious  movement  with  social  and  political 
goals — has  become  very  influential. 

•  The  continuing  conflict  between  Israel  and  Arab 
neighbors  encouraged  the  emergence  of  Jewish 
fundamentalist  movements  and  settlers  in 
occupied  territories  after  the  1967  war. 

•  The  Muslim  Brotherhood  (MB)  is  a  religious 
movement  that  emerged  in  Egypt  in  the  1920s, 
becoming  politically  and  socially  prominent  from 
the  1970s,  with  the  aim  of  establishing  an 
Islamic  state. 

These  cases  obviously  differ  greatly,  and  this 
entry  does  not  offer  a  comprehensive  theory  of 
politics  and  religious  movements.  Instead,  it  briefly 
examines  religious  movements  in  four  countries 
(India,  the  United  States,  Israel,  and  Egypt)  from 
four  religious  traditions:  Hinduism,  Christianity, 
Judaism,  and  Islam.  We  will  see  that  in  each  case, 
religious  movements  from  different  religious  tradi¬ 
tions  engage  significantly  and  consistently  with 
politics,  seeking  to  achieve  various  goals  that  relate 
to  the  nature  of  the  state  and  the  moral  and  ethical 
universe  that  informs  it. 


Hindu  Religious  Movements  in  India 

At  independence  in  1947,  the  victorious  Congress 
party  reluctantly  accepted  partition  (between  India 
and  East  and  West  Pakistan)  but  decisively  rejected 
the  ideology  of  Hindu  nationalism  (known  as 
Hindutva ).  From  the  1980s,  India  saw  tensions 
increase  between  Muslims  and  Hindus,  leading  to 
the  political  rise  of  the  BJP. 

In  officially  secular  India,  an  explosion  of  mili¬ 
tant  Hinduism  focused  on,  but  not  confined  to,  the 
incident  in  which  the  Babri  Masjid  mosque  at 
Ayodhya  was  demolished  by  a  Hindu  mob  in 
1992,  served  to  transform  India’s  political  land¬ 
scape.  Hindu  nationalists,  concentrated  in  the  BJP, 
achieved  significant  political  gains  in  the  1990s 
and  early  2000s.  They  were  reacting  against  the 
secular  visions  of  the  country’s  nationalist  found¬ 
ers,  Mahatma  Gandhi  and  Jawaharlal  Nehru. 
They  used  various  strategies  to  bring  about  change 
in  political  structures  and  processes.  What  was 
especially  striking  was  how  consistently  they  aimed 
at  political  targets  not  only  to  solve  religious  prob¬ 
lems  but  also  to  try  to  bring  about  the  consolida¬ 
tion  of  religious  identities  and  values. 

The  rise  of  the  politics  of  religious  identity  in 
India  underlines  a  central  problem:  How  can  reli¬ 
giously  plural  India  survive  the  creation  of  a  pow¬ 
erful  sense  of  identity  based  on  religion?  Of  the 
population  of  1  billion  Indians,  82%  are  Hindus, 
while  about  11%  are  Muslims,  2.5%  are  Christians, 
and  around  1.6%  are  Sikhs.  There  are  also  small 
numbers  of  Buddhists,  Parsis,  Jains,  and  followers 
of  tribal  religions.  This  religious  diversity  encour¬ 
aged  Indian  nationalists  to  pursue  a  development 
path  firmly  located  within  a  secular  sociopolitical 
and  cultural  milieu. 

Over  time,  the  issue  of  Hindu-Muslim  relations 
became  a  key  political  topic,  a  concern  emphasized 
by  the  rise  of  the  political  ideology  of  Hindutva 
(Hinduness).  Hindutva  is  a  Hindi  word,  a  neolo¬ 
gism  first  used  by  Vinayak  Damodar  Savarkar,  a 
politician  and  independence  movement  activist,  in 
his  1923  pamphlet  Hindutva:  Who  Is  a  Hindu ? 
The  term  now  generally  refers  to  Hindu  move¬ 
ments  that  advocate  Hindu  nationalism  in  India. 
The  leading  Hindu  nationalist  political  party,  the 
BJP,  which  dominated  Indian  politics  for  a  decade 
from  the  mid-1990s,  is  Hindutva’s  political  focal 
point.  Its  foundations  are  in  a  number  of  Hindu 
movements,  collectively  promoting  Hindutva, 
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known  as  the  Sangh  Parivar  (meaning  “family  of 
associations”).  Leading  organizations  within  the 
movement  include  the  Rashtriya  Swayamsevak 
Sangh,  the  Bajrang  Dal,  and  the  Vishwa  Hindu 
Parishad. 

The  overall  aim  of  the  Sangh  Parivar  is  to 
increase  the  predominance  of  Hinduism  in  India, 
in  relation  to  its  societal,  political,  and  cultural 
presence.  It  seeks  to  pursue  this  goal  through  vari¬ 
ous  means,  including  violence  and  terror.  The 
Hindutva  agenda  includes  attempts  to  suppress  or 
drive  out  Muslims  and  Christians.  This  is  because, 
for  the  Sangh  Parivar,  they  were  alien  faiths,  his¬ 
torically  introduced  to  India  by  external  conquer¬ 
ors.  Islam  was  patronized  by  the  Muslim  Moghuls 
in  the  16th  century  and  Christianity  by  the  British, 
mainly  from  the  19th  century  onward. 

The  Christian  Right  in  the  United  States 

The  CR  is  an  influential  religious  movement  with 
social  and  political  goals,  which  emerged  at  the 
time  of  the  presidency  of  Jimmy  Carter  (1977- 
1981).  The  CR  shared  many  of  the  ideals  and  goals 
of  Carter’s  successor,  Ronald  Reagan  (1981-1989), 
facilitating  its  development  into  a  significant  and 
influential  entity.  The  election  of  George  W.  Bush 
in  2001  saw  a  resurgence  of  the  CR.  Conservative 
Christian  leaders  were  able  to  arrange  sessions 
with  senior  White  House  aides.  This  was  widely 
understood  to  reflect  an  ideological  empathy 
between  leaders  of  the  CR  and  prominent  govern¬ 
ment  insiders,  including  the  President  himself. 

The  CR  has  key  concerns,  including  what  form 
of  Christianity  should  take  precedence  in  America, 
reshaping  American  society  according  to  their 
understanding  of  Christian  values,  the  importance 
of  “family  values,”  and  the  relationship  between 
these  values,  public  education,  and  public  policy 
generally.  After  George  W.  Bush’s  first  term  (2001- 
2005),  CR  leaders  were  emboldened  by  their  role 
in  reelecting  him  and  galvanized  by  their  success  in 
campaigning  for  constitutional  amendments  to  ban 
same-sex  marriage,  eventually  passed  in  18  states. 
The  aim  was  to  organize  and  build  on  these  early 
successes,  to  solidify  their  agenda-setting  role,  and 
to  help  elect  sympathetic  public  officials. 

It  used  to  be  said  that  every  4  years  at  the  time 
of  the  U.S.  presidential  elections  American  and 
foreign  journalists  rediscover  religion.  This  was 


the  periodic  occasion  when  the  media  scented  the 
electoral  possibilities  of  the  influence  of  the  CR,  a 
significant  domestic  political  lobby  group,  on  elec¬ 
toral  outcomes.  Initially,  the  CR  comprised  mainly 
Protestant  conservative  evangelicals.  Over  time, 
however,  its  makeup  became  more  eclectic,  referring 
to  a  broad  community  of  mostly  Christian  religious 
conservatives.  Prior  to  the  1970s,  the  CR  had  been 
a  subculture,  largely  keeping  its  distance  from  elec¬ 
toral  politics.  But  with  a  new  focus  on  social  conser¬ 
vatism,  around  the  time  of  the  presidency  of  Ronald 
Reagan,  Republican  Party  strategists — together 
with  neoconservatives  and  other  right-wing 
ideologues — encouraged  the  politicization  of  CR 
members  as  part  of  a  “New  Right”  fusionism  that 
saw  Ronald  Reagan  elected  as  president  in  1981 
and  reelected  4  years  later. 

Despite  ideological  and  political  differences 
among  the  different  organizations  that  make  up  the 
CR,  many  members  share  a  belief  that  secularism 
poses  a  serious  threat  to  liberty,  democracy,  and 
pluralism.  The  CR  is  “radical”  in  that  it  advocates 
dramatic  changes  in  society  and  “religious”  in  that 
its  members  and  leaders  tend  to  base  their  ideolo¬ 
gies  on  religious  doctrines  drawn  from  the  Bible.  In 
the  mid-2000s,  it  was  estimated  that  those  claiming 
identification  with  the  ideas  of  the  CR  comprised 
around  20%  (some  60  million  people)  of  the  adult 
population  of  the  United  States.  Today,  the  CR 
seeks  to  uphold  and  perpetuate  “Christian  values,” 
regarding  as  anathema  manifestations  of  “exces¬ 
sive  liberalism,”  including  legal  abortion,  absence 
or  downgrading  of  prayers  in  public  schools,  and 
science  teaching  that  adopts  a  rationalist,  rather 
than  a  “creationist,”  perspective. 

Jewish  Religious  Movements  in  Israel 

Since  Israel’s  founding  in  1948,  state  policy  has 
traditionally  favored  the  political  middle  ground. 
This  implies  that  neither  religious  nor  secular 
political  ideas  have  on  their  own  been  able  to 
dominate  the  political  agenda.  Over  time,  religious 
Jews  organized  themselves  through  a  number  of 
religious  movements  and  became  an  increasingly 
significant  political  voice.  They  were  especially 
vocal  in  opposition  to  the  policy  of  conceding 
parts  of  Biblical  Israel  to  the  Palestinians,  espe¬ 
cially  the  West  Bank  of  the  River  Jordan.  The  topic 
is  a  subject  of  intense  controversy  that  divides  the 
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country.  It  has  dominated  the  political  agenda 
since  the  early  1990s,  focused  in  the  divisive  Oslo 
peace  accords  of  1993,  Prime  Minister  Yitzhak 
Rabin’s  assassination  in  1995,  and,  following  pres¬ 
sure  from  the  government  of  the  United  States, 
handing  over  control  of  the  Gaza  Strip  to  the 
Palestinians  in  2005. 

In  addition  to  various  religious  political  parties, 
including  Shas  and  United  Torah  Judaism,  Israel’s 
political  landscape  also  includes  a  number  of  influ¬ 
ential  Jewish  movements,  including  Edah  Haredit 
(God  Fearful  Community),  Neturei  Karta 
(Guardians  of  the  City),  and  Gush  Emunim  (Bloc 
of  the  Faithful).  One  of  the  most  influential  is 
Gush  Emunim,  founded  in  1974  in  the  West  Bank 
settlement  of  Kfar  Etzion.  Gush’s  main  aim  is  to 
achieve  conquest  and  settlement  of  what  it  regards 
as  the  Biblical  land  of  Israel  (Eretz  Yisrael).  During 
the  1980s,  the  movement  grew  rapidly,  especially 
after  the  1978  Camp  David  agreement  between 
Israel  and  Egypt  that  led  to  the  return  of  the  latter 
to  the  Sinai  desert — grabbed  by  Israel  in  the  1967 
Six-Day  War.  Along  with  Gush,  other  extremist 
religious  movements,  such  as  the  late  Rabbi  Meir 
Kahane’s  organization,  Kach  (meaning  “Thus”) 
and  Kahane  Chai  (“Kahane  Lives”;  founded  after 
Kahane  was  assassinated  in  1990),  argue  on  reli¬ 
gious  grounds  against  giving  back  territory  not 
only  to  Egypt  but  also  to  the  Palestinians  or  any 
other  non-Jewish  entities.  This  is  because  they 
regard  such  a  policy  as  a  contradiction  of  God’s 
will  expressed  in  the  Torah. 

Organized  in  movements  such  as  Kach  and 
Kahane  Chai,  Jewish  religious  zealots  seek  to  be 
the  voice  of  the  mostly  religious  Jewish  settlers, 
who  try  to  influence  Israeli  policy  in  relation  to 
both  Egypt  and  the  Palestinians  so  as  not  to  hand 
land  back  to  non-Jews.  Following  the  1993  Oslo 
peace  accords  with  the  Palestinians,  involving  the 
latter  receiving  autonomy  in  the  Gaza  Strip  from 
August  2005  and  an  area  around  the  West  Bank 
city  of  Jericho,  religious  opposition  to  the  accord 
with  the  Palestinians  was  manifested  in  mass  mur¬ 
der.  A  Jewish  religious  zealot,  Baruch  Goldstein, 
linked  to  both  Kach  and  Kahane  Chai,  murdered 
29  people  and  injured  approximately  100  more  in 
a  dawn  attack  on  a  mosque  in  the  West  Bank  town 
of  Hebron  in  February  1994.  Following  the  mas¬ 
sacre,  the  Israeli  government,  as  a  sign  of  its  com¬ 
mitment  to  crush  Jewish  extremist  groups  that 


systematically  used  violence  to  try  to  achieve  their 
objectives,  banned  both  Kach  and  Kahane  Chai. 

The  political  significance  of  religious  parties  and 
movements  on  policy  making  in  Israel  is  unlikely  to 
fade  soon  for  several  reasons.  First,  the  basis  of 
both  nationality  and  the  creation  of  the  state  of 
Israel  remains  a  sense  of  religious  identity,  making 
the  issue  consistently  vulnerable  to  the  influence  of 
religious  Jews,  some  of  whom  are  also  political 
extremists.  Second,  there  has  been  strong  growth 
in  the  numbers  of  religious  Jews  since  the  early 
1970s,  including  recent  immigrants.  Now,  it  is 
claimed  that  up  to  half  of  Israeli  Jews  “respect  the 
religious  commands,”  while  1  in  10  belongs  to  the 
baredi  (ultra-orthodox)  community.  Around  60% 
of  the  baredi  population  is  younger  than  25  years 
of  age — and  the  proportion  of  the  ultra-orthodox 
will  grow  because  many  have  large  numbers  of 
children.  Many  such  people  form  the  core  support 
and  activist  base  of  the — sometimes  extremist — 
religious  movements  and  parties.  Third,  the  latter 
will  continue  to  have  major  political  influence 
because  of  the  nature  of  the  country’s  political  sys¬ 
tem,  which  is  based  on  proportional  representation 
and  a  very  low  minimal  threshold  of  1.5%  of  the 
vote  to  be  represented  in  the  Knesset  (Israeli  parlia¬ 
ment).  As  a  result,  such  parties  have  the  ability  to 
acquire  political  rewards  in  return  for  supporting 
the  main  secular  political  parties,  including  Kadima, 
Likud,  and  Labour,  in  the  context  of  the  formation 
of  coalition  governments.  Finally,  in  recent  years, 
there  has  been  a  dovetailing  of  secular  security 
concerns  (concerned  with  Israel’s  regional  national 
interests  and  power)  and  religious  interests  (an 
aversion  to  handing  over  land  to  the  non-Jewish 
Palestinians,  as  it  is  believed  to  be  against  God’s 
will).  Overall,  the  Jewish  movements  are  a  power¬ 
ful  coalition  of  interests,  often  able  to  apply  sig¬ 
nificant  pressure  on  Israel’s  government  via  both 
the  ballot  box  and  civil  society. 

The  Muslim  Brotherhood  in  Egypt 

Hassan  al-Banna  (1906-1949)  established  the  MB 
in  Egypt  in  1928  as  an  organization  providing  wel¬ 
fare  services,  aiming  to  encourage  and  defend 
morality.  The  aim  was  to  Islamize  society  “from 
below.”  Initially,  the  MB  was  just  one  of  a  number 
of  small  Islamic  associations  engaged  in  charity 
work  among  the  poor  of  the  country’s  fast-growing 
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cities.  MB  built  mosques,  schools,  and  clinics,  pro¬ 
viding  an  Islamic  welfare  framework  by  which 
Muslims  could  live  without  reference  to  the  Western 
and  secular  influences  around  them.  However,  the 
MB  was  soon  drawn  into  politics.  It  developed 
explicit  political  goals,  involving  a  radicalization  of 
its  philosophy  and  tactics  and  the  creation  of  an 
armed  wing.  Nonetheless,  the  MB’s  main  focus 
remained  education  from  below  and  the  infiltra¬ 
tion  of  Egypt’s  political  and  social  institutions.  The 
combination  of  its  welfare  services,  its  religious  cre¬ 
dentials,  and  the  government’s  increasing  authori¬ 
tarianism  combined  to  make  the  MB  Egypt’s  largest 
religious  movement,  with  tens  of  thousands  of  fol¬ 
lowers,  which  also  pursued  explicitly  political  goals. 

It  is  important  to  understand  that  the  brother¬ 
hood,  despite  its  own  rhetoric,  is  not  a  manifesta¬ 
tion  of  “traditional  Islamism.”  Instead,  the  MB  is 
a  key  example  of  a  modern  social  movement:  mass 
based,  populist,  supported  mainly  by  the  urban 
middle  and  lower  classes,  using  a  cell-based  struc¬ 
ture,  and  embracing  religious  reformism.  In  addi¬ 
tion,  the  MB  focuses  explicitly  and  primarily  on 
Egypt’s  domestic  situation:  It  wishes  to  reform  the 
country,  not  the  universal  Islamic  community  as, 
for  example,  the  transnationalist  group  A1  Qaeda 
aims  to  do.  Like  Egypt’s  secular  nationalists,  the 
MB  struggles  primarily  for  Egyptian  independence, 
with  a  rallying  cry  that  reflects  this  aim:  “Egypt  for 
the  Egyptians.” 

After  al-Banna’s  assassination  in  1949,  the  MB 
became  close  to  the  revolutionary  government  of 
Gamal  Abdul  Nasser,  following  its  accession  to 
power  in  1952.  By  the  mid-1950s,  however,  rela¬ 
tions  had  soured  following  an  unsuccessful  attempt 
on  Nasser’s  life.  As  a  result,  the  MB  was  proscribed, 
thousands  of  members  were  imprisoned,  and  its 
leader,  Sayyid  Qutb,  arrested.  He  was  executed  in 
1966.  During  his  captivity,  Qutb  produced  a  com¬ 
prehensive  commentary  on  Islamic  history,  arguing 
that  Egypt  was  not  an  Islamic  country,  as  it  was  in 
a  state  of  jahilya  (religious  ignorance).  Muslims 
could  not  live  a  proper  religious  life  in  such  circum¬ 
stances.  It  was  therefore  necessary  to  overthrow  the 
existing  political  order  by  any  means. 

Qutb’s  ideology  was  deeply  influenced  by  the 
revolutionary  radicalism  of  a  contemporaneous 
Indian  Islamist,  Sayyid  Abu’l-A’la  Mawdudi 
(1903-1979).  Qutb’s  ideological  development  fell 
into  two  distinct  periods:  before  1954  and  from 


1954  until  his  execution.  During  his  second,  radi¬ 
cal  phase,  Qutb  declared  “Western  civilization” 
the  enemy  of  Islam,  denounced  leaders  of  Muslim 
nations  for  not  following  Islam  closely  enough, 
and  sought  to  spread  the  belief  among  Sunni 
Muslims  that  it  was  their  duty  to  undertake  jihad 
to  defend  and  purify  Islam. 

Following  President  Anwar  Sadat’s  accession  to 
power  in  1970,  the  brotherhood  enjoyed  improved 
relations  with  the  government — because  it  was 
anxious  to  cultivate  the  MB’s  support  in  the  state’s 
fight  against  secular  leftists.  Taking  advantage  of 
Sadat’s  overtures,  the  brotherhood  built  mosques, 
schools,  and  clinics.  Its  financial  position  improved 
following  the  founding  of  a  range  of  profitable 
companies  and  financial  institutions.  Such  entre¬ 
preneurial  flair  was  not,  however,  appreciated  by 
all  brotherhood  members,  and  the  movement  split 
into  two  sections:  the  “radicals”  and  the  “moder¬ 
ates.  ”  The  latter  believed  in  a  gradualist  approach, 
to  Islamize  society  slowly  by  increasing  the  broth¬ 
erhood’s  influence  as  a  popular  movement,  while 
the  former  wanted  a  speedier,  more  radical  action. 
Later,  after  Sadat’s  assassination  in  1981  at  the 
hands  of  Islamists,  moderate  MB  members  entered 
electoral  politics.  Since  then  it  has  consistently 
won  seats  in  parliament,  in  line  with  President 
Mubarak’s  goal  of  “limited  democracy”  in  Egypt. 

The  radicals  were  too  impatient  for  change,  too 
opposed  to  what  they  perceived  as  a  sham  democ¬ 
racy  to  join  in  such  a  program.  Breaking  away  from 
the  moderates,  several  new  Islamist  movements — 
including  Takfir  wal-Hijra  and  al-Jihad — were 
founded.  Together  they  were  the  core  of  a  shifting 
set  of  religious-political  movements  under  the  gen¬ 
eral  title  of  al-Gama’at  al-Islamiyya  (The  Islamic 
Groups).  Stimulated  by  the  writings  of  Qutb,  they 
were  characterized  not  only  by  a  denial  of  conven¬ 
tional  electoral  politics  but  also  by  the  use  of  terror¬ 
ist  tactics  to  achieve  an  Islamic  state.  Members  of 
Takfir  wal-Hijra  were  responsible  for  a  number  of 
political  assassinations,  while  the  al-Jihad  group 
not  only  killed  President  Sadat  but  also  tried  to  kill 
two  government  ministers  and  the  parliamentary 
speaker  at  the  same  time.  The  state  then  decided  to 
crack  down  on  the  radicals.  More  than  300  mem¬ 
bers  of  al-Jihad  were  arrested  and  some  executed, 
specifically  for  their  alleged  involvement  in  the 
murder  of  members  of  the  state  security  forces  in 
the  town  of  Asyut  in  the  early  1980s. 
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The  divisions  between  the  secular  nationalists 
and  the  MB  in  Egypt  developed  in  the  1980s  into 
a  conflict  between  two  different  conceptions  of 
society  and  two  different  ideas  of  what  are  desir¬ 
able  social  and  political  changes.  The  conflict 
spilled  over  from  the  religious  sphere  into  the  fields 
of  politics,  economics,  and  social  affairs.  The  goal 
of  the  MB — to  build  an  Islamic  state — was  an 
attempt  to  transform  the  language  of  politics,  to 
build  as  broad  as  possible  a  coalition  of  opposition 
forces,  and  to  deliver  welfare  benefits,  especially  to 
the  underprivileged  urban  poor.  More  generally, 
the  Islamists  of  the  MB  wanted  to  reverse  what 
they  saw  as  a  cataclysmic  slide  into  godless  moder¬ 
nity,  exemplified  by  a  secular  society. 

Conclusion 

As  this  discussion  of  the  relationship  between 
selected  religious  movements  and  politics  has  shown, 
religious  movements  have  specific  aims  and  objec¬ 
tives  that  differ  according  to  their  specific  contexts, 
backgrounds,  and  history;  however,  they  share  an 
overt  disavowal  of  the  desirability  of  secularization 
and  associated  modernization  that  highlight  state 
attempts  to  restrict  religion  to  the  private  sphere. 
Religious  movements  work  at  the  level  of  civil  soci¬ 
ety.  Their  aim  is  not  usually  to  enter  conventional 
electoral  politics  and  try  to  achieve  the  changes  they 
want  to  see  by  involvement  in  political  society. 
Instead,  their  focus  is  on  the  community  or  grass¬ 
roots  level,  where  they  seek  to  build  mass  support, 
whether  working  alone  or  in  coalition  with  other 
like-minded  groups.  The  intention  is  to  change  both 
the  status  quo  and  the  prevailing  power  equation,  so 
as  to  build  a  society  with  more  involvement  of  reli¬ 
gion  in  everyday  matters  that  affect  the  well-being, 
norms,  and  values  of  the  mass  of  the  people. 
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Reorganization 


This  entry  discusses  the  meaning  of  the  term  reor¬ 
ganization  in  the  public  sector,  the  targets,  the 
argument,  and  the  models  for  reorganization  and 
concludes  by  assessing  the  consequences  of  these 
changes. 

The  Meaning  of  Reorganization 

In  political  science  and  public  administration, 
reorganization  usually  refers  to  government  reor¬ 
ganization.  But  one  can  talk  about  legislative  reor¬ 
ganization  and  judicial  reorganization  as  well. 
Reorganization  refers  to  the  imposition  of  a  new 
way  of  organizing,  or  “to  organize  differently,” 
which  often  involves  an  extensive  alteration  of  the 
structure  of  government. 

Any  such  reorganization  or  “shake-up”  intro¬ 
duced  must  by  definition  produce  extensive  and 
drastic  changes  to  government;  minor  changes  in 
government  are  not  considered  to  be  an  act  of 
reorganization.  In  addition,  there  may  be  a  wide 
range  of  goals  in  reorganization,  but  the  assump¬ 
tion  is  that  these  goals — efficiency,  effectiveness, 
and  changes  in  policy — are  all  being  addressed 
through  structural  changes  in  the  public  sector. 

The  expression  “government  reorganization” 
has  a  neutral  connotation.  It  does  not  directly  praise 
or  condemn  the  Old  System  as  being  good  or  bad. 
The  emphasis  is  on  the  introduction  of  a  different 
way  of  organizing  government.  On  the  other  hand, 
the  word  reform  is  value  laden.  Reform  refers  to  a 
change  for  the  better  as  a  result  of  correcting 
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abuses,  as  in  the  common  statement,  “Corruption 
in  government  is  rampant,  so  there  is  an  urgent 
need  to  reform  government.”  So  the  words  govern¬ 
ment  reorganization  and  government  reform  are 
different,  although  at  times  they  tend  to  be  used 
interchangeably. 

Recent  developments  in  the  field  of  political  sci¬ 
ence  and  public  administration  have  emphasized 
“reform”  over  “reorganization.”  The  new  buzz¬ 
words  are,  for  example,  “administrative  reform,” 
“government  reform,”  “governance  reform,” 
“NPM  reform,”  “reinventing  government,”  and 
“revitalizing  government.”  One  reason  why  the 
phrase  word  government  reorganization  is  being 
replaced  by  these  new  words  has  to  do  with  the 
beliefs  among  practitioners  and  scholars  that  the  old 
government  machines  are  in  an  urgent  need  of  being 
changed  drastically.  This  has  been  true  in  both  the 
developed  and  the  less  developed  countries.  The 
phrases  administrative  reform  and  government 
reform  are  very  broad  and  do  not  specify  the 
intended  nature  of  reform.  In  fact,  administrative 
reform  is  a  broader  term  than  government  reform 
because  the  latter  focuses  on  the  reform  of  govern¬ 
ment,  while  the  first  covers  broader  areas  such  as 
the  public  sector  and  civil  society.  Meanwhile,  the 
terms  governance  reform,  new  public  management 
(NPM)  reform ,  and  reinventing  government  are 
driven  by  specific  doctrines  or  principles  of  reform. 
We  will  return  to  this  topic  later  in  the  entry. 

The  Targets  of  Government  Reorganization 

There  are  many  ways  to  implement  programs  for 
government  reorganization.  The  most  common 
kind  of  government  reorganization  is  structural. 
For  example,  government  reorganization  may 
involve  the  creation  of  new  ministries  and  depart¬ 
ments,  the  merging  of  several  lower  level  units  into 
a  higher  one,  and  the  structural  expansion  of  a 
government  agency.  The  assumption  is  that  the 
manipulation  of  structural  variables  is  the  key  to 
improving  how  the  system  operates.  Other  targets 
of  reorganization  are  more  focused  on  procedures. 
For  example,  one  of  the  common  targets  for  reor¬ 
ganization  has  been  budget  processes.  Other  tar¬ 
gets  have  included  personnel  policy  in  government, 
policy  making,  ethics,  and  strategic  management. 

Changing  the  structure  of  government  organi¬ 
zation  involves,  for  example,  choices  between 


pyramid  or  flat  organizations,  centralization  or 
decentralization,  strong  or  weak  leadership,  small 
or  big  government,  the  variety  of  forms  of  public 
organizations  in  government,  the  size  of  the  public 
sector  in  relation  to  the  private  sector,  the  role  of 
the  civil  society  and  citizens  in  public  affairs,  and 
the  configurations  of  multilevel  governance.  One 
of  the  most  common  structural  reforms  during  the 
past  several  decades  has  been  the  creation  of 
executive  agencies  that  have  responsibility  for  a 
limited  number  of  functions  and  enjoy  a  good  deal 
of  autonomy  from  direct  controls  by  the  rest  of 
government.  The  contrary  pattern  of  reorganiza¬ 
tion  has  been  to  create  very  large  cabinet-level 
departments  that  are  designed  to  coordinate  a 
wide  range  of  public  functions  and  provide  more 
coherent  packages  of  services  to  the  public. 

The  systemic  choices  for  reorganizing  public 
budgeting  are,  for  example,  line  item  budgeting, 
performance  budgeting,  program  budgeting,  or 
zero-based  budgeting;  nonintegrated  or  integrated 
budget;  and  annual  or  multiyear  budget.  In  person¬ 
nel  reorganization,  the  system  choices  have  been, 
for  example,  spoils  or  merit  system;  for  recruit¬ 
ment;  position  classification  system  or  contract 
system;  and  performance  appraisal  or  seniority  for 
internal  promotions.  An  example  of  the  reorgani¬ 
zation  of  the  policy  process  may  involve  the  extent 
to  which  government  mechanisms  facilitate  citizen 
participation  in  the  policy  decision-making  process 
and  implementation.  Attempts  to  change  the  nature 
of  personal  morality,  professional  ethics,  organiza¬ 
tional  ethics,  and  social  ethics  can  also  be  seen  as  a 
reorganization  of  the  agency’s  normative  system 
that  is  used  to  control  the  behavior  of  individuals 
within  government.  Reorganization  in  strategic 
management  may  involve  the  implementation  of 
strategic  tools  such  as  best  practices,  benchmark¬ 
ing,  and  balanced  scorecards.  Again,  these  repre¬ 
sent  attempts  to  control  the  behavior  of  individuals 
within  the  public  sector. 

The  Arguments  for  Government 
Reorganization 

The  most  common  reason  given  for  carrying  out 
government  reorganization  is  to  improve  the  effi¬ 
ciency  of  the  government  machine.  Government 
officials  responsible  for  reform  will  usually  justify 
a  major  bureaucratic  reform  by  claiming  that  such 
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institutional  change  will  bring  in  better  services  to 
the  people  and  savings  to  government.  Efficiency  is 
defined  as  the  ratio  between  input  and  output: 
More  output  is  obtained  from  the  same  amount  of 
input.  There  are,  however,  hidden  agendas  in  every 
reorganization  effort. 

Scholars  have  pointed  out  that  there  are  vested 
interests  involved  in  any  attempt  to  reorganize 
government.  Reform,  as  a  matter  of  fact,  is  poli¬ 
tics.  After  government  reorganization,  there  are 
always  those  who  gain  and  those  who  lose  in  the 
reform  process.  For  example,  the  splitting  off  of  a 
new  ministry  from  an  old  one  opens  up  new 
opportunities  for  senior  government  officials  to  fill 
in  new  higher  positions,  and  there  may  be  a  better 
“target”  for  interest  groups  attempting  to  influ¬ 
ence  policy.  As  a  separate  entity,  the  new  ministry 
can  also  demand  more  allocation  of  budget.  New 
laws,  rules,  and  regulations  are  issued  to  provide 
legitimacy  and  authoritative  power  to  the  officials 
in  the  new  ministry.  Another  example  is  that 
reform  proposals  put  forward  by  central  agencies 
such  as  the  Civil  Service  Commission,  the  Budget 
Bureau,  the  Ministry  of  Finance,  and  the 
Government  Reform  Commission  have  tendencies 
to  consolidate  power  and  tighten  control  over  the 
entire  bureaucracy.  Such  organizations  will  claim 
that  central  executive  control  was  necessary  to 
promote  bureaucratic  responsiveness,  especially 
from  line  agencies.  Proposals  by  a  single  depart¬ 
ment  such  as  the  national  police  or  a  regulatory 
organization  usually  contain  clauses  to  increase 
their  autonomy  and  authoritative  power.  In  short, 
more  positions  are  requested,  new  suborganiza¬ 
tions  are  proposed,  new  legal  power  is  justified, 
and  bigger  budgets  are  requested. 

Further,  most  proposals  for  government  reorga¬ 
nization  designed  by  government  officials  involve 
suggestions  for  bureaucratic  structural  expansion 
and  power  aggrandizement.  Scholars  (such  as 
Anthony  Downs,  1967)  have  pointed  out  that,  in 
fact,  all  organizations  have  inherent  tendencies  to 
expand.  C.  Northcote  Parkinson’s  famous  law 
states  that  “work  expands  so  as  to  fill  the  time 
available  for  its  completion”  (quoted  in  Downs, 
1967,  p.  16).  Downs  summarized  the  reasons  why 
bureaucracy  inherently  seeks  to  expand  as  follows: 
(a)  an  organization  that  is  rapidly  expanding  can 
attract  more  capable  personnel  and  more  easily 
retain  its  most  capable  existing  personnel  than  one 


that  is  expanding  slowly,  stagnating,  or  shrinking; 
(b)  the  expansion  of  any  organization  normally 
provides  its  leaders  with  increased  power,  income, 
and  prestige;  (c)  growth  tends  to  reduce  internal 
conflicts  in  an  organization  by  allowing  its  mem¬ 
bers  to  increase  their  personal  status  without  low¬ 
ering  that  of  others;  and  (d)  increasing  the  size  of 
an  organization  may  also  improve  the  quality  of 
its  performance  (per  unit  of  output)  and  its 
chances  for  survival  (p.  17).  Subsequent  analysts 
have  pointed  out  that  increasing  the  size  of  an  orga¬ 
nization  may  create  more  managerial  problems 
than  it  creates  benefits,  but  many  bureaucratic 
organizations  persist  in  seeking  expansion. 

Faunching  a  major  government  reorganization 
plan  also  can  be  good  publicity  for  the  govern¬ 
ment.  It  creates  the  impression  that  the  problem 
lies  in  the  government  machine,  not  the  political 
executive  and  that  the  elected  government  is  seri¬ 
ously  addressing  the  problem  of  inefficiency  and 
corruption  in  the  bureaucracy.  This  kind  of  bureau¬ 
cratic  bashing  pleases  the  public  voters,  who  are 
bored  with  and  frustrated  by  their  bitter  experi¬ 
ences  with  government  bureaucrats.  Indeed,  when 
governments  have  few  opportunities  to  address  a 
policy  problem  directly,  they  often  reorganize  gov¬ 
ernment  in  order  to  appear  to  be  doing  something. 

The  experiences  of  countries  at  reorganization 
indicate  that  despite  all  the  publicity  on  govern¬ 
ment  reorganization,  at  the  end,  the  bureaucrats 
are  often  able  to  outsmart  the  political  executive. 
In  many  cases,  not  much  has  been  achieved  from 
government  reorganization.  Instead,  the  reform 
commissions  have  produced  mountainous  piles  of 
reports  and  studies  on  the  ideas  and  reform  plans 
that  never  got  implemented.  Not  much  bureau¬ 
cratic  reform  was  actually  undertaken,  and  the 
“ugly  old  bureaucracy”  remains  as  inefficient  and 
venal  as  ever.  Again,  reorganization  and  reform  are 
as  often  as  not  symbolic  policies  rather  than  effec¬ 
tive  means  of  solving  public  problems. 

Another  reason  why  government  reorganization 
is  needed  is  the  argument  of  the  turbulent  environ¬ 
ment.  Government  reorganization  is  needed 
because  of  the  volatile,  unpredictable,  and  rapidly 
changing  environment.  For  example,  the 
Minnowbrook  Conferences,  held  by  Syracuse 
University  in  1968  and  1988  for  scholars  in  public 
administration,  were  organized  to  bring  in  new 
ideas  about  government  reorganization  and 
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improvement.  In  the  age  of  rapid  globalization, 
there  is  an  urgent  need  to  reorganize  government 
in  line  with  this  new  international  system.  Another 
factor  in  support  of  government  reorganization  is 
the  revolution  in  information  technology  that  is 
drastically  changing  the  way  governments  work. 
Most  recently,  the  economic  crisis  of  the  early  21st 
century  is  forcing  substantial  rethinking  of  the  role 
of  government  and  of  the  means  of  organizing  the 
public  sector. 

Reorganization  in  the  public  sector  can  also  be 
used  to  alter  the  policies  adopted  and  implemented 
by  government.  While  efficiency  may  be  difficult 
to  attain,  given  the  numerous  barriers  that  exist 
within  the  public  sector,  reorganization  may  be 
more  effective  in  altering  policy,  simply  because  of 
the  different  priorities  that  may  be  involved  when 
an  organization  is  independent  or  when  it  is  con¬ 
tained  within  a  different  department  or  ministry. 
For  example,  the  U.S.  Coast  Guard  moved  from 
the  Department  of  Transportation  to  Homeland 
Security  after  the  terrorist  attacks  of  September  11, 
2001.  After  that  move,  the  agency  shifted  its  pri¬ 
orities  toward  security  and  away  from  safety  on 
waterways. 

The  Models  of  Government  Reorganization 

Thinking  about  how  to  implement  government 
reorganization  leads  us  to  making  choices  about 
centralization  and  decentralization,  big  or  small 
government,  strong  or  weak  leader,  pyramid  or  flat 
structures,  the  extent  of  citizen  participation,  the 
power  relations  between  superiors  and  subordi¬ 
nates,  the  appropriate  values  and  culture,  and  the 
nature  of  environment  one  is  facing.  All  these  vari¬ 
ables  are  important  for  defining  how  government 
will  function  and  the  type  of  choices  that  the  orga¬ 
nizations  have  on  which  to  base  their  decisions. 

B.  Guy  Peters  (1992)  proposes  a  theoretical 
analysis  of  “government  reorganization.”  Accord¬ 
ing  to  him,  reform  and  reorganization  is  one  of  the 
most  common  activities  in  government.  He  pointed 
out  three  sets  of  approaches  to  reform:  purposive 
model  of  reform,  environmental  dependency,  and 
institutional  models  of  organization  change.  First, 
purposive-action  approaches  assume  that  one  or 
more  actors  in  the  reorganization  process  have  a 
particular  end  state  in  mind  for  the  public  sector 
when  they  propose  the  reform  or  reorganization. 


The  need  to  reorganize  government  may  come  from 
perceived  inadequacies  of  existing  arrangements,  a 
sense  of  “overload”  and  “ungovernability”  in  pub¬ 
lic  organizations,  and  the  desire  of  the  rational 
individual  bureaucrat  to  maximize  his  or  her  per¬ 
sonal  utility.  Second,  the  environmental-determinism 
approach  assumes  that  changes  occur  as  a  func¬ 
tion  of  the  relationship  of  the  administrative  struc¬ 
tures  to  their  environment.  The  need  to  reorganize 
comes  from  the  reaction  of  government  to  innova¬ 
tions  in  their  environment  that  provide  them  with 
the  opportunity  to  exploit  the  new  situation. 
Another  explanation  is  from  contingency  theories, 
which  argued  that  “the  internal  structuring  of 
organizations  will,  over  time,  come  to  reflect  the 
characteristics  of  the  task  environment”  (Peters, 
1992,  p.  206).  The  population  ecology  models 
assume  that  the  best  way  to  understand  govern¬ 
ment  reorganization  is  to  look  at  the  evolution  of 
entire  populations  of  organizations.  Third,  institu¬ 
tionalism  approaches  assume  that  reform  requires 
to  some  extent  altering  the  internal  values  of  the 
organization,  which  includes  the  modification  of 
the  operative  values  of  organizational  members. 
Institutional  models  in  political  science  “stress  the 
durability  of  organizational  values,  or  cultures,  and 
therefore  the  extreme  difficulty  that  may  be 
encountered  in  attempting  to  produce  any  rapid  or 
significant  changes  in  performance  through  reorga¬ 
nization”  (Peters,  1992,  p.  211). 

The  traditional  model  of  government  organiza¬ 
tion,  and  perhaps  reorganization,  often  follows  the 
classic  ideal-type  bureaucracy  of  Max  Weber.  The 
characteristics  of  bureaucracy  are  as  follows: 

1.  There  is  the  principle  of  fixed  and  official 
jurisdictional  areas,  which  are  generally  ordered 
by  rules — that  is,  by  laws  or  administrative 
regulations. 

2.  The  principles  of  office  hierarchy  and  of  levels 
of  graded  authority  mean  a  firmly  ordered 
system  of  super-  and  subordination  in  which 
there  is  a  supervision  of  the  lower  offices  by  the 
higher  ones. 

3.  The  management  of  the  modern  office  is  based 
on  written  documents  (the  files),  which  are 
preserved  in  their  original  or  draft  form. 

4.  Office  management,  at  least  all  specialized  office 
management — and  such  management  is 
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distinctly  modern — usually  presupposes 
thorough  and  expert  training. 

5.  When  the  office  is  fully  developed,  official 
activity  demands  the  full  working  capacity  of 
the  official,  irrespective  of  the  fact  that  his 
obligatory  time  in  the  bureau  may  be  firmly 
delimited. 

6.  The  management  of  the  office  follows  general 
rules,  which  are  more  or  less  stable,  more  or 
less  exhaustive  and  which  can  be  learned. 

The  literature  in  public  administration  is  marked 
by  efforts  to  find  a  more  suitable  way  to  reorganize 
government  than  Weberian  bureaucracy.  Peters 
(1996)  proposes  four  alternative  models:  the  mar¬ 
ket  model,  the  participatory  state,  the  flexible 
government,  and  the  deregulated  government. 
Michael  Barzelay  and  Barbak  Armajani  (1992) 
observed  the  shifting  from  the  bureaucratic  para¬ 
digm  to  the  postbureaucratic  paradigm  as  a  move¬ 
ment  away  (a)  from  the  public  interest  to  the 
benefits  for  individual  citizens,  (b)  from  efficiency 
to  quality  and  value,  (c)  from  control  to  winning 
adherence  to  norms,  (d)  from  enforcing  responsi¬ 
bility  to  building  accountability,  and  (e)  from  justi¬ 
fying  costs  to  delivering  value. 

In  many  parts  of  Europe  and  the  Antipodes,  the 
reform  movements  are  loosely  called  the  NPM. 
Christopher  Hood  (1991)  summarized  the  doc¬ 
trines  of  NPM  as  consisting  of  (a)  hands-on  profes¬ 
sional  management  in  the  public  sector,  (b)  explicit 
standards  and  measures  of  performance,  (c)  greater 
emphasis  on  output  controls,  (d)  shift  to  disaggre¬ 
gation  of  units  in  the  public  sector,  (e)  shift  to 
greater  competition  in  public  sector,  (f)  stress  on 
private  sector  styles  of  management  practices,  and 
(g)  stress  on  greater  discipline  and  parsimony  in 
resource  use. 

The  NPM  model  was  not,  however,  widely 
adopted  in  Southern  Europe  because  of  the  empha¬ 
sis  on  administrative  law  as  the  means  of  control¬ 
ling  within  the  public  sector. 

In  the  United  States,  the  use  of  private  sector 
styles  of  management  came  to  be  known  as  the 
reinventing  government  movement.  David  Osborne 
and  Ted  Gaebler  (1992)  attacked  the  “sluggish 
centralized  bureaucracies”  as  unfit  for  the  rapidly 
changing,  information-rich,  knowledge-intensive 
society  and  economy  of  the  1990s.  They  proposed, 


instead,  “the  entrepreneurial  government,”  con¬ 
sisting  of  10  principles: 

1.  catalytic  government  (steering  rather  than 
rowing), 

2.  community-owned  government  (empowering 
rather  than  serving), 

3.  competitive  government  (injecting  competition 
into  service  delivery), 

4.  mission-driven  government  (transforming  rule- 
driven  organizations), 

5.  result-oriented  government  (funding  outcomes 
not  inputs), 

6.  customer-driven  government  (meeting  the  needs 
of  the  customer,  not  the  bureaucracy), 

7.  enterprising  government  (earning  rather  than 
spending), 

8.  anticipatory  government  (prevention  rather 
than  cure), 

9.  decentralized  government  (from  hierarchy  to 
participation  and  teamwork),  and 

10.  market-oriented  government  (leveraging  change 
through  the  market). 

This  style  of  reform  was  more  compatible  with 
governing  in  the  United  States  because  of  the  long¬ 
standing  emphasis  on  management  within  the 
public  sector. 

Another  emerging  framework  for  reorganizing 
government  is  “governance.”  At  the  abstract 
level,  as  noted  by  Bidhya  Bowornwatana  (1996), 
governance  in  public  administration  refers  to  a 
reform  model  that  advocates  the  following  prin¬ 
ciples:  a  smaller  government  that  does  less,  a 
flexible  and  global-vision  government,  an  account¬ 
able  government,  and  a  government  that  is  fair. 
From  the  government’s  experience  in  the  United 
Kingdom,  R.  A.  W.  Rhodes  (1996)  concludes  that 
governance  is  about  managing  networks  that  are 
self-organizing  and  autonomous.  The  governance 
approach  has  involved  the  use  of  social  actors 
through  networks  as  a  means  of  providing  public 
services,  as  well  as  a  means  of  shaping  public 
policy.  These  social  actors  have  tended 
not  to  be  market  actors  but  more  often  repre¬ 
sent  a  variety  of  civil  society  organizations  and 
interests. 
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For  some,  mostly  practitioners,  government 
reorganization  meant  the  introduction  to  govern¬ 
ment  of  management  tools  such  as  the  balanced 
scorecard,  strategic  planning,  the  learning  organi¬ 
zation,  reengineering.  Many  scholars  have  criti¬ 
cized  such  practices  as  being  part  of  management 
fads  and  fashions  that  do  not  really  improve  gov¬ 
ernment  performance  and  that  the  management 
tools  applied  will  soon  go  out  of  fashion. 

The  Consequences  of  Government 
Reorganization 

The  study  of  government  reorganization  cannot  be 
complete  without  considering  the  impact  of  gov¬ 
ernment  reorganization.  What  are  the  intended 
and  unintended  consequences  of  government  reor¬ 
ganization?  Who  gains?  Who  loses?  Have  the 
power  relationships  in  a  ministry  or  department 
been  changed?  Have  public  services  delivery  been 
improved?  What  are  the  positive  and  negative 
unintended  consequences?  Several  observations 
can  be  made  about  the  results  of  reorganization. 
First,  important  consequences  of  government  reor¬ 
ganizations  take  a  long  time  to  materialize.  The 
time  span  of  such  an  impact  bypasses  the  time  in 
office  of  the  policymakers.  Second,  the  question  of 
who  is  responsible  for  any  negative  consequences 
of  reform  can  get  very  complicated,  as  during  the 
course  of  time  (e.g.,  10  years)  several  actors  may 
have  been  involved  with  the  reorganization  policy. 
New  policies  may  be  introduced  one  after  another, 
with  the  effect  that  it  is  almost  impossible  to  trace 
accountability. 

Third,  the  immediate  gainers  and  losers  of  gov¬ 
ernment  reorganizations  are  the  political  execu¬ 
tives  and  subordinate  bureaucrats.  Some  actors  are 
promoted  to  newly  established  positions.  Some 
others  are  reshuffled  to  less  important  positions. 
The  political  executive  in  the  short  run  gets  politi¬ 
cal  points  from  their  publicity  of  the  reform  plans. 
Fourth,  the  success  of  reorganization  efforts  to 
change  the  values  and  cultures  of  government  pro¬ 
grams,  and  of  the  system  more  generally,  cannot  be 
easily  proven.  They  are  not  easily  visible,  and 
change  takes  a  long  time.  As  Frederick  Mosher 
(1967)  pointed  out,  a  reorganization  effort  is  not  a 
one-time  thing  but  a  step  in  progressive  history. 
Fifth,  the  majority  of  reorganization  efforts  tend  to 
fail  in  meeting  their  proclaimed  goals.  Although 


new  positions  have  been  created  and  more  money 

spent,  efficiency  has  not  improved. 
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Representation 


Political  representation  is  a  basic  institution  of 
Western  modernity.  Put  simply,  it  exists  so  that 
those  governed  can  govern  indirectly.  It  thus  cre¬ 
ates  a  minimum  level  (considered  to  be  the  only 
realistic  one)  of  self-government.  Under  the  sys¬ 
tems  of  modern  representation,  the  governed 
choose  those  who  will  govern  through  elections 
and  authorize  them  to  do  so  in  their  name,  thus 
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granting  representatives  a  mandate  to  look  after 
the  interests  of  the  governed. 

In  its  original  Latin  form,  the  word  representa¬ 
tion  covered  a  vast  semantic  field,  and  this  has  been 
further  widened  over  time.  To  represent  means  to 
portray,  describe,  narrate,  and  duplicate — to  make 
present  something  that  is  not  and  cannot  be  so. 
Hence,  the  problems  begin  with  the  very  word, 
and  circumscribing  its  significance  by  talking 
about  “political”  representation  does  little  to 
resolve  this. 

Even  if,  officially,  they  do  not  bear  this  name, 
forms  of  representation — or,  at  least,  institutional¬ 
ized  communication  between  the  governed  and  the 
governing — can  be  seen  even  within  those  political 
regimes  not  commonly  classified  as  representative. 
Further,  theoretically,  no  collective  body  can  exist 
without  some  form  of  (even  symbolic)  representa¬ 
tion.  A  people,  nation,  political  party,  class,  interest, 
or  religious  belief  needs  to  be  represented — and 
have  its  own  spokespeople — so  that  it  can  be  rec¬ 
ognized  by  both  its  own  members  and  everyone 
else.  However,  if  representation  is  a  constituting 
element  of  a  collective  body,  this  raises  the  key 
question  of  how  the  wishes  of  that  body  can  be 
transmitted  to  its  representatives. 

Belying  modernist  rhetoric  regarding  the  indi¬ 
vidual,  a  collective  body  in  the  real  world  of  poli¬ 
tics  is  effectively  the  aggregation  of  preexisting 
collective  bodies.  A  plurality  of  collective  bodies — 
territorial  communities,  associations,  organized 
interests,  political  movements  and  parties,  and  so 
on — decides  to  constitute  a  unitary  political  body 
with  representatives  who  lead  them  and  act  as 
their  spokespeople.  It  is  thus  represented  to  the 
external  world  through  these  spokespeople,  and  a 
plurality  of  wills  is  thereby  transformed  into  a  col¬ 
lective  will.  We  might  ask,  however,  how  can  rep¬ 
resentatives  perform  this  task?  If  the  collective 
body  consists  of  heterogeneous  parts,  willing  or 
induced  to  cohabit,  but  possessing  different  and 
often  conflicting  desires,  what  will  these  represen¬ 
tatives  effectively  represent? 

Of  course,  no  form  of  representation  is  ever  a 
perfect  replication  of  the  original  concept.  To 
represent  means  to  interpret,  and  interpretations 
are  inevitably  arbitrary  and  distorting.  To  repre¬ 
sent  something — and  political  representation  is  no 
different  from  literary,  pictorial,  photographic,  car¬ 
tographic,  or  any  other  form  of  representation — can 


also  signify  improving  and  embellishing  it. 
However,  this  can  never  be  a  perfect  replication, 
especially  so  in  the  case  of  a  collective  body.  Here, 
to  represent  implies  generalizing,  abstracting,  and, 
basically,  manipulating,  especially  when  the  man¬ 
date  is  reduced  to  such  a  simplified  and  stylized 
act  as  voting — in  other  words,  a  mark  on  a  ballot 
paper.  Allegiance  is  therefore  a  constant,  theoreti¬ 
cal,  and  political  obsession  for  political  represen¬ 
tation.  What  defines  it  and  to  what  degree  this 
mirrors  the  original  is  another  fundamental  issue 
of  the  debate. 

Some  theorists  even  deny  the  idea  of  a  man¬ 
date  conferred  on  representatives  and/or  of  rep¬ 
resentation  itself.  Two  illustrious  authors — Hans 
Kelsen  and  Joseph  A.  Schumpeter — crudely 
define  representation  as  being  nothing  more  than 
a  “fiction,”  which,  according  to  Kelsen,  is  useful 
for  its  legitimizing  role  since  it  convinces  those 
governed  that  they  are  governing,  while,  in 
Schumpeter’s  view,  this  fiction  is  inevitably  dam¬ 
aging  because  electoral  mechanisms  force  those 
governing  to  be  opportunistic  rather  than  to  pur¬ 
sue  the  collective  good.  Therefore,  what  is  con¬ 
ventionally  called  a  representative  government  is 
merely  a  device  legitimizing  the  division  of  work 
between  the  governed  and  the  governing,  trans¬ 
forming  electoral  consensus — which  is  often 
fictitious  given  the  workings  of  electoral  systems — 
given  by  voters  to  a  political  party,  a  coalition,  or 
a  leader,  into  the  will  of  the  collective  body. 
Rather  than  hereditary  or  other  principles,  repre¬ 
sentation  is  the  mechanism  that  modern  societies 
have  adopted  to  validate  their  governors.  This  is 
not  without  its  complications,  beginning  with  the 
fact  that  representative  regimes  not  only  autho¬ 
rize  the  representatives  of  the  majority  party 
obtaining  electoral  success  to  govern  but  also 
give  a  voice,  again  through  representatives,  to  the 
defeated  minorities,  thus  enabling  and  encourag¬ 
ing  them  to  interact  with  the  representatives  of 
the  majority.  It  is  then  expected  that  everyday 
politics  will  establish  in  what  way  and  to  what 
degree  this  takes  place. 

However,  even  if  what  we  call  “representation”  is 
not  really  representative,  it  still  offers  a  series  of 
extremely  valuable  advantages:  Mandates  (at  least 
in  the  West)  are  limited  over  time,  there  is  the  oppor¬ 
tunity  to  replace  governments,  and  the  electoral 
procedure  allows  citizens  to  voice — individually  or 
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collectively,  peacefully  or  otherwise — their  opin¬ 
ions  and  interests.  Above  all,  representation 
creates  for  both  the  governed  and  the  governing 
the  not  inconsiderable  opportunity  of  communi¬ 
cating  fairly  regularly  with  each  other.  The  pro¬ 
cess  is  anything  but  linear,  and  the  equality  of 
citizens  is  a  myth;  however,  representation  estab¬ 
lishes  an  incentive  structure  that  enables  the 
governing  to  govern  and  to  minimize  conflict 
with  those  governed.  Of  course,  these  incentives 
do  not  always  work  because  the  governed  are  a 
heterogeneous  body  with  variegated  interests, 
and  satisfying  the  wishes  of  all  is  thus  inconceiv¬ 
able.  However,  to  date,  representation  has 
generally  worked,  albeit  with  some  notable  acci¬ 
dents  along  the  way. 

Experience  tells  us  that  representation  also  con¬ 
siderably  reduces  the  costs  of  cultural,  political, 
and  social  conflicts.  These  do  not  disappear,  but 
the  electoral  and  parliamentary  procedures  of  rep¬ 
resentation  increase  the  chances  of  their  being 
managed  peacefully.  A  represented  interest  is  an 
acknowledged  interest  and,  once  it  has  accepted 
representation  (and  is  allowed  to  be  represented), 
its  explosive  potential  is  attenuated.  It  may  not 
happen:  Some  interests  transcend  the  boundaries 
of  representation  and  maintain  a  position  of  reject¬ 
ing  it  and  refuting  the  rules.  While  there  are  many 
such  examples,  these  only  serve  to  confirm  the  fact 
that  representation  and  representative  government 
are  inevitably  somewhat  imperfect. 

Given  the  number  of  controversial  issues  and 
questions  surrounding  the  topic  of  “representa¬ 
tion,”  it  is  no  surprise  to  discover  that,  for  the  past 
3  centuries,  theoretical  work  has  attempted  to  give 
form  to  the  necessarily  provisional  device  of  repre¬ 
sentation.  The  great  challenge  facing  representa¬ 
tion  at  the  outset  is  what  could  be  defined  as  the 
challenge  of  pluralism.  This  is  a  challenge,  which, 
to  date,  aside  from  momentary  defeats,  representa¬ 
tion  has  generally  won.  However,  it  is  a  challenge 
that  is  not  yet  over.  It  has  been  (and  continues  to 
be)  very  demanding,  and  it  is  interesting  to  observe 
how  theory  has  approached  the  issue. 

What  can  be  defined  as  the  “pure”  theory 
rejects  pluralism  outside  the  political  arena,  based 
on  the  logic  that  there  is  a  fundamental  unity  of 
the  collective  body  and  its  representatives  and  that 
representation  essentially  aims  to  legitimatize  the 
representatives’  authority.  Representation  denotes 


the  population  as  a  whole,  and  any  mandate  dic¬ 
tated  by  the  represented  to  its  representatives — who 
are,  therefore,  “trustees” — is  to  be  discouraged 
because  it  would  prejudice  the  independence  and 
ability  of  representatives  to  transcend  specific  inter¬ 
ests  when  in  pursuit  of  what  the  theory  solemnly 
defines  as  the  common  good. 

Alongside  this  pure  theory,  however,  there  is  an 
“impure”  or  pragmatic  theory,  which  over  time 
has  undermined  the  former.  This  enjoys  a  more 
relaxed  relationship  with  pluralism  since  it 
acknowledges  the  composite  nature  of  the  collec¬ 
tive  body  and  comes  to  different  conclusions. 
From  this  perspective,  there  is  nothing  wrong  with 
seeing  the  representative  as  a  delegate  or  a  spokes¬ 
person  for  particular  interests,  but  it  is  to  be  hoped 
that  such  representatives  will  be  capable  of  rising 
above  these  interests  to  have  a  more  farsighted 
view.  The  synthesis  effect  is  achieved  through  the 
interaction,  whether  through  negotiation  or  con¬ 
flict,  between  different  interests  within  the  repre¬ 
sentative  body.  The  outcome  is  not  necessarily 
more  modest  than  what  was  promised  but  is  effec¬ 
tively  something  not  quite  achieved  by  the  pure 
theory. 

Obviously,  under  these  conditions,  no  govern¬ 
ment  is  sufficiently  representative  (not  least  because 
the  objectives  are  very  unclear),  and  this  insuffi¬ 
ciency  has  been  a  recurrent  issue  within  public 
debate.  There  is  always  someone  ready  to  denounce 
the  lack  of  representation  and  to  look  for  repara¬ 
tion.  There  is  always  some  political  force  that 
claims  to  be  more  representative  than  its  competi¬ 
tors.  This  is  still  happening  at  the  dawn  of  a  new 
millennium  when  representation  is  clearly  encoun¬ 
tering  considerable  difficulties. 

In  recent  decades,  the  decisions  taken  by  author¬ 
ities  not  based  on  any  electoral  mandate,  or  that 
have  minimal  representative  links  to  those  affected 
by  their  decisions,  have  multiplied.  As  an  example, 
it  is  sufficient  to  consider  the  deterritorializing  pro¬ 
cesses  of  political  authorities  provoked  by  global¬ 
ization.  Moreover,  in  large  part,  thanks  to  the 
media,  the  sophisticated  manipulative  methods 
with  which  electoral  competitions  are  now  held 
shape  (and  distort)  communications  between  the 
governed  and  the  governing.  Democratic  regimes 
have  recently  tended  to  enhance  the  role  of  the 
executive  power  to  the  detriment  of  the  elected 
assemblies.  The  executive  is  always  a  representative 
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body,  as  is  its  leader,  because  it  is  chosen  by  the 
electorate.  But  what  is  the  level  of  representation  of 
an  individual  and  the  limited  committee  working 
with  them  in  relation  to  the  complexities  of  con¬ 
temporary  democracies?  If  representation  implies 
interpretation,  will  what  contemporary  democra¬ 
cies  propose  be  all  too  simplistic  and  dismissive? 
Finally,  according  to  current  theories,  there  are  also 
those  who  want  to  remove  representation  from  the 
political  vocabulary,  acknowledging  the  possibility 
both  of  forms  of  representation  not  supported  by  a 
popular  mandate  and  of  forms  of  democracy  not 
founded  on  representation. 

Put  simply,  it  is  possible  to  divide  representation 
as  occurring  in  three  main  periods: 

f.  individual  representation,  in  which  the 

representatives  were  chosen  from  a  minority  of 
notables,  a  limited  group  based  on  wealth  and 
culture;  social  prestige  was  thus  the  decisive 
element  in  electoral  competition; 

2.  collective  representation,  organized  through 
political  parties;  and 

3.  post-political  party  collective  representation — 
that  is,  the  current  era,  which  is  fairly  uncertain 
and  complicated. 

The  outlines  of  these  three  epochs  will  be  described 
below  as  we  disentangle  the  key  (pure  and  impure) 
theories  relevant  to  the  definition  and  social  con¬ 
struction  of  representation. 

Pure  Representation  Theory 

Modern  political  representation  was  invented  in 
Great  Britain  in  the  17th  century  by  reusing  and 
renewing  long-standing  institutions.  Needing  to 
replace  a  monarch  who  ruled  by  divine  right  and 
was,  therefore,  the  representative  of  God  on  earth, 
parliament  had  the  idea  to  designate  itself  as  the 
representative  of  the  people.  The  problem  was  that 
while  God  is  abstract  and  remote,  the  people  are 
rather  closer  and  more  concrete.  Although  it  is  not 
entirely  clear  who  and  what  “the  people”  were, 
representation  had  a  precedent  in  that  of  the 
ancien  regime. 

From  the  time  that  large  territorial,  political 
entities  began  to  organize  themselves  around  a 
central  power,  the  problem  arose  for  sovereigns  of 


persuading  civic  and  religious  potentates,  collec¬ 
tive  bodies,  and  lower  level  territorial  units  to 
acknowledge  their  authority  and  cooperate  with 
them.  The  solution  adopted  was  that  of  consulta¬ 
tion  through  agreements  and  reciprocal  recogni¬ 
tion  of  powers,  privileges,  and  rights.  This  practice 
also  had  the  advantage  of  assembling  in  a  single 
place,  and  at  the  same  time,  the  entire  collective 
body,  whose  unity  was  thereby  “represented” 
while  at  the  same  time  “representing”  (since  it  was 
frequently  impossible  for  the  existing  potentates  to 
be  directly  present). 

However,  both  the  transition  to  modernity 
(which  eroded  the  old  centers  of  power  and  estab¬ 
lished  new  ones)  and  the  growing  monarchical 
ambitions  of  unifying  and  governing  the  territory 
required  a  change  in  representation  methods.  The 
dispersion  of  power  over  a  multiplicity  of  poten¬ 
tates,  which  had  been  confirmed  by  ancien  regime 
representation,  had  become  intolerable  for  large 
territorial  states.  There  were  two  main  alternatives 
available  to  such  states.  The  first  was  that  the 
monarchy  should  limit  the  competing  powers 
without  formally  annulling  them,  as  would  be  the 
case  in  the  French  model:  The  States  General 
ceased  being  convened  in  1614.  The  second  was 
that  of  unifying  the  competing  powers  in  a  single 
body  representing  the  whole  of  collective  society, 
as  was  the  case  in  England. 

The  extraordinary  theoretical  move  devised  by 
Hobbes  was  that  of  representing  the  entire  collec¬ 
tive  through  an  artificial  figure  and  imagining  it  to 
be  constituted  no  longer  by  preestablished  powers 
but  by  a  crowd  of  autonomous  individuals  who 
renounce  their  powers  through  a  social  contract. 
However,  history  ignored  his  preference  for  an  indi¬ 
vidual  as  the  representative  of  the  multitude  and  the 
owner  of  sovereignty,  and  in  the  1640s,  following 
the  conflict  with  the  King  (although  the  Malmes¬ 
bury  philosopher  had  not  excluded  this  second  pos¬ 
sibility),  England  opted  for  a  parliament,  which, 
according  to  Hobbes,  is  nonetheless  also  a  single 
artificial  figure  constituting  the  unity  of  the  people 
rather  than  the  dispersion  of  the  crowd. 

Once  it  had  liberated  itself  of  power  by  divine 
delegation,  the  elite  sitting  in  parliament  pursued 
the  following  logic:  It  was  prepared  to  represent 
the  people  but  was  little  inclined  to  hand  over  the 
power  conquered  in  the  people’s  name.  On  the 
contrary,  it  wanted  to  concentrate  this  power, 
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monopolistically,  among  its  members.  However,  if 
“the  people”  is  an  invention  of  parliament  and  if  it 
exists  only  in  so  much  as  it  is  represented,  the  con¬ 
straints  of  representation  are,  obviously,  different 
from  those  of  the  ancien  regime.  Not  only  are  the 
people  unified  by  a  single  mandate  but  the  man¬ 
date  holder  also  receives  a  mandate  that  is  no 
longer  bound  by  the  instructions  of  a  preexisting 
power.  Modern  representation  thus  requires  that 
the  representative  is  not  a  delegate  but  a  trustee 
who  looks  after  the  interests  of  the  collective  as  a 
whole. 

As  frequently  occurs,  practice  struggles  to  cor¬ 
respond  to  the  ideal  prescriptions  of  theory.  If  “the 
people”  is  an  invention,  there  nonetheless  exists  a 
scattered  population  of  aggregated  interests  that 
want  to  have  a  voice.  Further,  as  representation  is 
materially  based  on  a  formal  act  such  as  elections 
through  which  the  mandate  is  manifested,  it  is  suf¬ 
ficient  that  elections  are  minimally  competitive  so 
that,  on  the  one  hand,  the  electorate  (even  when 
carefully  selected  through  requirements  of  property 
and  wealth)  may  advance  requests  in  exchange  for 
their  vote  and,  on  the  other,  those  aspiring  to  the 
mandate  may  encourage  such  requests.  This  raises 
the  question,  however,  given  the  stiff  competition 
for  a  parliamentary  seat,  of  how  candidates  can  be 
expected  to  merely  display  their  personal  merits 
and  capacities  and  appeal  to  generic  superior  val¬ 
ues  and  interests,  thus  renouncing  the  attempt  to 
attract  voters  through  the  offer  of  some  specific 
advantage. 

The  debate  surrounding  the  nature  of  the  man¬ 
date  thus  immediately  became  entangled.  Among 
those  who  attempted  to  unravel  the  knots,  the 
name  of  Edmund  Burke  leaps  to  mind.  He  stressed 
the  idea  that,  over  and  above  what  he  considered 
the  legitimate  pluralism  of  particular  interests,  the 
role  of  parliamentarians  is  not  to  satisfy  the  desires 
of  this  or  that  representative  but  to  pursue,  with 
full  independence  of  judgment,  what  they  consider 
to  be  the  interests  of  the  nation.  The  electorate  is 
mandated  by  the  nation,  which  is  a  much  more 
vast  and  noble  entity  than  those  holding  the  vote. 
Burke’s  parliament,  in  his  famous  1774  Speech  to 
the  Electors  of  Bristol ,  was  not  a  convention  of 
ambassadors  representing  opposing  interests  but  a 
representative  assembly  of  a  single  nation  sharing 
the  same  interests  in  a  common  good.  As  a  conse¬ 
quence,  representatives  should  not  receive  any 


binding  mandate  but  should  simply  and  solely 
obey  their  own  judgments  and  conscience. 

Given  that  they  had  a  number  of  illustrious 
precedents,  Burke’s  pronouncements  were  not 
intended  to  formulate  a  new  theory  of  representa¬ 
tion.  Rather,  he  wanted  to  persuade  voters  that 
elected  representatives  would,  first,  provide  the 
best  possible  service  without  receiving  their  instruc¬ 
tions  or  having  generalized  aims  and,  second, 
would  subsequently  account  for  their  actions. 
Since  political  circumstances  are  always  the  stimu¬ 
lants  of  theory,  Burke  was  very  familiar  with  the 
Whig  ideas  that  opposed  those  of  the  Tories, 
according  to  whom  it  was  the  monarch  who  repre¬ 
sented  the  nation — while  parliament  should  con¬ 
tent  itself  with  transmitting  local  demands  and 
partial  interests. 

In  practice,  the  impact  of  Burke’s  ideas  was  lim¬ 
ited.  An  indication  of  this  was  possibly  the  fact 
that  his  speech  was  followed  by  a  resounding  elec¬ 
toral  defeat.  However,  his  arguments  had  been 
established  as  part  of  the  theory  of  representation, 
reappearing  on  the  other  side  of  the  Channel 
where  revolutionary  political  theory,  and,  above 
all,  Emmanuel  Sieyes,  laid  down  a  fairly  similar 
but  much  more  linear  and  decisive  path.  The 
French  situation  in  1789,  however,  was  different 
possibly  because  history  did  not  allow  it  to  update 
and  adapt  preexisting  institutions  but  required  it 
to  found  and  construct  new  ones  and  possibly  also 
because  there  were  different  challenges  against 
which  theory  and  practice  were  measured. 

Rejecting  Jean-Jacques  Rousseau’s  praise  for 
democracy  and  condemnation  of  representation  as 
incompatible  with  modern  society,  Sieyes  not  only 
argued  that  society  should  request  that  the  princi¬ 
ples  of  the  division  of  labor  be  applied  to  politics 
and  representative  government,  but,  with  revolu¬ 
tionary  resolution,  he  also  called  for  the  autonomy 
of  representatives.  However,  recalling  Hobbes’s 
lessons,  his  solution,  despite  initially  being  similar 
to  Burke’s,  was  rather  more  drastic.  Sieyes’s  con¬ 
cept  of  society  was  a  compact,  homogeneous,  and 
unified  space  without  any  of  the  asymmetries  of 
ancien  regime  society  against  which  the  Third 
Estate  had  arisen.  Indeed,  given  that  such  a  society 
did  not  exist  in  reality,  rather  than  being  discour¬ 
aged,  Sieyes  invented  his  own.  He  promoted  the 
nation  to  the  role  of  a  third,  virtual  actor,  standing 
between  the  people  and  its  representatives,  hence 
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averting  the  possibility  that  one  would  assume  a 
superior  role  in  relation  to  the  other  since  the 
nation  could  only  be  constituted  through  the  legis¬ 
lative  body.  The  nation  was,  therefore,  a  unitary 
and  indivisible  body  but,  equally,  so  too  was  the 
assembly  as  its  exclusive  spokesperson. 

According  to  Burke,  society  and  its  articulations 
were  not  without  political  relevance.  Worried  by 
memories  of  the  ancien  regime  and  concerned  with 
liberating  at  all  costs  the  actions  of  the  elective 
body  and,  therefore,  the  state,  from  claims  by  par¬ 
tial  interests,  Sieyes  indicated  the  nation  as  the 
only  alternative  to  the  state.  As  for  the  citizens, 
they  would  express  themselves  only  at  the  moment 
of  voting.  Pluralism  was  dissolved  in  the  represen¬ 
tative  body,  the  single  mind  and  voice  of  the 
nation,  together  with  any  inequality  or  diversity 
between  individuals.  According  to  Sieyes,  it  was 
not  even  important  whether  the  assembly  was 
divided  into  majorities  and  minorities:  The  single¬ 
ness  of  the  nation’s  intents  allowed  the  will  of  the 
majority  always  to  coincide  with  the  common 
interest. 

Among  all  the  inventions,  and  all  the  hypocri¬ 
sies,  of  modern  political  theory,  the  latter  appears 
to  be  one  of  the  most  prized  and  persistent.  It  is 
assumed  that  the  individual  choices  made  by 
majority  choice  are  based  on  a  “unanimous  will  of 
the  associates,  ”  equipped  with  the  means — such  as 
a  majority  principle — to  allow  the  collective  body 
to  function  and  pursue  its  own  ends.  Underlying 
the  majority  principle — Sieyes  was  not  lacking  in 
common  sense — there  was  also  an  indication  of 
method:  There  should  first  be  a  division  into 
majorities  and  minorities.  The  elected  had  the 
honor  of  debating  issues  together  and,  therefore, 
according  to  an  expression  that  has  recently  come 
back  into  fashion,  to  “deliberate” — to  discuss, 
negotiate,  persuade,  change  opinion,  and  make 
compromises  in  order  to  arrive  at  a  common  will. 
Further,  Sieyes  perfected  these  arguments  by  for¬ 
mulating  a  plan  of  social  and  cultural  engineering 
(which  the  French  revolutionaries  and  their  succes¬ 
sors  earnestly  cultivated)  and  establishing  a  related 
model  destined  to  be  successful.  We  refer,  of 
course,  to  the  nation,  which,  according  to  Sieyes, 
in  addition  to  its  role  as  a  virtual  political  actor, 
should  also  be  established  as  a  collective  identity  in 
which  the  material  articulations  of  society  could 
truly  be  transformed. 


Impure  Theory  and  the  Revival 
of  the  Pure  Theory 

Burke  and  Sieyes  transformed  representation. 
Members  of  a  class  of  notables,  selected  by  census 
suffrage  (a  system  in  which  votes  are  not  equal  but 
are  weighted  according  to  the  social  rank  of  the 
voter),  wanted  to  entrust  government  to  an  assem¬ 
bly  of  notables,  isolating  it  from  its  environment 
and  thereby  reducing  eventual  conflicts  to  differ¬ 
ences  of  opinion.  However,  there  was  also  an  alter¬ 
native  model.  The  fathers  of  democracy  in  the 
United  States  had  previously  come  up  with  a  more 
pragmatic  solution  in  relation  to  social  and  politi¬ 
cal  pluralism,  possibly  because  the  constituting 
process  took  place  under  different  conditions.  First 
of  all,  they  had  no  ancien  regime  to  exorcize. 
Second,  only  a  handful  of  colonies  sought  self- 
government  and  these  contractually  decided  to 
unite,  giving  themselves  a  constitution  along  with 
common  bodies  of  representation  and  government. 

The  authors  of  The  Federalist  Papers,  and 
James  Madison  in  particular,  were  well  aware  of 
the  threat  of  factions.  Nonetheless,  since  their  exis¬ 
tence  was  seen  as  inevitable,  they  also  believed  that 
any  attempt  to  eliminate  them  would  be  a  threat  to 
liberty.  Undoubtedly,  as  proposed  by  Burke,  filter¬ 
ing  them  through  an  assembly  that  brought 
together  a  wise,  enlightened,  and  patriotic  minor¬ 
ity  ready  to  sacrifice  its  particular  interests  for  the 
good  of  the  nation  would  help  reduce  any  poten¬ 
tial  threat  ( Federalist  Paper  No.  10,  1787). 
However,  it  seemed  better  to  focus  on  the  possibil¬ 
ity  of  applying  a  representative  government  to  a 
state  with  broad  dimensions,  which  contained  a 
numerous  and  diverse  population  among  whom 
interests  would  be  not  only  multiplied  but  also 
dispersed  and  balanced  against  one  another  in  rep¬ 
resentative  institutions. 

In  other  words,  representation  was  a  risk,  but  it 
would  have  been  senseless  to  pretend  to  ignore  it 
like  Sieyes  by  creating  a  virtual  people.  Given  the 
presence  of  an  established  and  peaceful  tradition 
of  self-government,  of  citizen  and  village  assem¬ 
blies,  it  was  considered  preferable  to  focus  on  the 
confrontation  of  opposing  interests  and  on  the 
capacity  of  a  representative  government  to  create 
equilibrium.  The  U.S.  doctrine,  more  realistic  than 
that  in  Europe,  was  thus  not  scandalized  by  the 
idea  of  sectoral  interests  and  constituency  service 
(harshly  stigmatized  in  Europe  as  clientelism)  since 
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it  was  felt  that  solutions  would  always  be  found 
for  large  collective  issues. 

The  United  States  was  still  a  distant  province, 
unable  to  influence  significantly  representation  in 
Europe  where,  for  a  further  century,  representative 
theory  would  continue  along  the  path  traced  by 
Sieyes,  convinced  of  the  need  to  exclude  any  par¬ 
ticular  interest  from  the  sphere  of  stateness. 
Moreover,  circumstances  were  such  that,  just  as 
the  requirements  of  wealth  and  ability  that  limited 
access  to  the  vote  were  beginning  to  weaken,  a 
new  and  more  rigorous  version  of  the  pure  theory 
was  created  by  those  who  were  not  political  but 
were,  instead,  a  new  category  of  public  law  schol¬ 
ars  formed  under  the  shadow  of  the  state.  These 
scholars  claimed  that  only  the  state  has  an  autono¬ 
mous  life  since  nothing  precedes  it  and  nothing  can 
oppose  it — neither  society  nor  the  people,  even  less 
so  the  forces  of  social  or  political  pluralism. 

This  area  of  reflection  was  inaugurated  by  a 
German  doctrine,  of  which  an  emblematic  expo¬ 
nent  was  Paul  Laband.  He  disputed  the  concept  of 
representation,  resolutely  rejecting  the  notion  that 
the  Reichstag’s  members  were  representatives  of 
the  people  or  even  that  they  represented  anyone  at 
all.  According  to  Laband,  the  electorate  did  not 
confer  any  mandate  nor  did  it  govern  through  its 
representatives.  Once  voting  had  ended,  public 
participation  definitively  ceased. 

Laband’s  sentiments  were  soon  echoed  in  other 
European  countries.  In  France,  Raymond  Carre  de 
Malberg  similarly  reduced  representation  to  a 
mere  technical  device  for  the  recruitment  of  the 
ruling  political  class.  According  to  Malberg,  the 
representative  represented  the  nation — considered 
as  a  unitary  body,  distinct  from  its  individual  and 
collective  parts  and  even  from  the  state  itself.  The 
members  of  parliament  are  hence  the  organ  of  the 
state,  while  voters  only  exercise  nominal  power 
and  do  not  confer  any  mandate. 

In  reality,  what  this  generation  of  jurists  was 
pursuing  was  another  goal  that  was  much  more 
important  than  the  independence  of  those  elected. 
In  Germany,  the  aim  was  to  reduce  any  possibility 
of  competition  between  elected  bodies,  the  mon¬ 
arch,  and  the  executive’s  power.  In  France  and 
Italy,  the  purpose  was  to  rescale  the  parliamentary 
regime  within  the  state — that  is,  the  parliament, 
the  political  parties,  and  representation  itself,  in 
addition  to  the  pronouncements  of  the  electorate. 


Nonetheless,  a  radical  criticism  of  the  representa¬ 
tive  regime  was  needed  to  address  the  enigma  of 
representation  based  on  electoral  mechanisms. 
This  was  the  task  taken  on  by  Gaetano  Mosca, 
who,  finding  in  this  a  confirmation  of  his  theory  of 
the  political  class,  contrasted  the  normative  struc¬ 
ture  of  the  pure  theory  with  empirical  observation: 
Not  only  were  candidates  and  the  elected  subject 
to  even  the  most  modest  private  interests,  but,  to 
gain  consensus,  they  flattered  the  electorate  and 
actively  encouraged  particular  interests.  Voters, 
according  to  Mosca,  prefer  representatives  who 
look  after  the  interests  of  the  voters.  However,  it  is 
not  just  the  voters  who  elect  the  representatives 
but  the  representatives  who  persuade  the  voters  to 
elect  them,  by  raising  expectations  and  making 
commitments  regarding  their  future  work. 

Further,  Mosca  believed  that  the  impending 
intervention  of  political  parties  was  destined 
to  increase  such  disadvantages.  His  key  idea, 
however — in  truth,  previously  advanced  by  The 
Federalist  Papers — was,  above  all,  that  political 
representation  would  be  unable  to  comply  with 
the  prohibition  of  an  imperative  mandate. 
Theoretical  work  had  denied  this  at  length  but 
should  have  been  aware  of  this.  Indeed,  according 
to  Mosca,  it  was  perhaps  precisely  because  theory 
had  in  reality  always  known  this  that  it  continued 
to  deny  it  with  such  vehemence.  Hence,  it  was  an 
illusion  to  deny  the  existence  of  a  pluralism  of 
interests  outside  the  state,  essentially  because 
political-electoral  competition  multiplies  the  num¬ 
ber  of  candidates,  making  them  spokespeople 
and,  at  the  same  time,  transforming  politics  from 
a  mission  reserved  for  an  elite  identified  with  the 
state  into  a  set  of  opportunities  facilitating  the 
social  ascent  of  individuals  and  groups  seeking  to 
obtain  favorable  political  measures. 

In  discussing  representation,  Mosca  thus  out¬ 
lined  an  entrepreneurial  concept  of  politics  that  was 
destined  to  last.  Max  Weber  further  developed  this 
by  identifying  the  existence  of  the  “party  enter¬ 
prise,”  which  competes  to  obtain  votes  and  public 
roles.  This  idea  was  then  perfected,  in  the  early 
1940s,  by  Joseph  A.  Schumpeter,  who  definitively 
dismantled  the  myth  of  representation  (and  democ¬ 
racy)  and  reread  politics  as  a  marketplace. 
Schumpeter’s  ideas,  inspired  by  U.S.  democracy, 
saw  the  problem  as  lying  in  how  democratic  politics 
was  constructed.  He  considered  there  to  be  a  vast 
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range  of  latent  interests  that  the  political  represen¬ 
tative  brings  to  light,  puts  into  a  form,  and  exploits 
to  his  or  her  own  advantage.  The  political  entre¬ 
preneur  thus  transforms  latent  questions  into  sig¬ 
nificant  political  interests  to  be  used  in  mobilizing 
the  electorate  to  his  or  her  own  advantage.  The 
risk  that  follows  from  this  is  that  government 
actions  may  become  a  mere  subproduct  of  elec¬ 
toral  competition.  In  other  words,  if  the  transfor¬ 
mation  by  candidates  of  the  latent  preferences  of 
voters  into  political  demands  is  a  key  strategy 
within  the  competition  for  power,  it  therefore  con¬ 
stitutes  not  only  a  resource  but  also  a  handicap  for 
political  representation,  which  is  thereby  more 
inclined  to  favor  particular  interests  to  the  detri¬ 
ment  of  general  interests. 

Representation  and  Political  Parties 

The  great  novelty  of  the  20th  century,  as  Mosca 
had  perfectly  understood,  consisted  in  the  advent 
of  mass  political  parties  whose  success  was  strictly 
linked  to  the  introduction  of  universal  suffrage 
(which  they  enthusiastically  promoted).  Mass 
political  parties  were  imposing  machines  with  mil¬ 
lions  of  members  and  solidly  organized  appara¬ 
tuses.  Never  before,  within  national  boundaries, 
had  there  been  such  a  dangerous  challenger  to  the 
authority  of  the  state.  This  produced  apocalyptic 
pronouncements  by  theorists,  of  which  Mosca’s 
was  only  the  precursor.  The  reaction  of  one  of  the 
most  sophisticated  exponents  of  antiparliamentari- 
anism  between  the  two  World  Wars,  Carl  Schmitt, 
was  exemplary  as  regards  its  catastrophic  over¬ 
tones:  He  imputed  no  less  than  the  destruction  of 
representation  and  liberal  parliamentarianism  to 
the  intrusiveness  of  political  parties  and  organized 
economic  interests.  According  to  Schmitt,  once  the 
elected  had  become  party  delegates,  parliaments 
could  do  little  other  than  ratify  decisions  made 
elsewhere. 

It  is  difficult  to  deny  the  fact  that,  with  the 
advent  of  mass  political  parties,  a  demanding 
intermediary  imposed  itself  between  the  electorate 
and  the  elected.  Compared  with  the  parliaments  of 
notables  in  which  the  elected  were  linked  to  each 
other  only  by  fragile  bonds  of  association,  the 
introduction  of  party  discipline  hardened  both 
parliamentary  confrontation  and  political  lines  of 
division.  However,  Schmitt’s  depiction  of  a  situation 


whereby  the  intervention  of  political  parties 
deprives  democratic  parliaments  of  the  possibility 
of  rational  discussion  while  unrestrained  centrifu¬ 
gal  dynamics  lacerate  society  and  irreversibly 
prejudice  the  primacy  of  the  state  and  its  capacity 
to  pursue  the  general  interest  was  dictated,  above 
all,  by  his  own  authoritarian  leanings.  In  fact,  if 
anything,  political  parties  constituted  a  formidable 
reducing  and  reordering  element  as  regards  the 
pluralism  of  ideas  and  interests. 

This  was  the  view  expressed  by  Max  Weber  in 
1918,  following  World  War  I.  Political  parties, 
seen  as  large  mass  associations,  served  to  discipline 
the  conflict  and,  due  to  their  preferred  method  of 
establishing  leadership,  were  a  precious  antidote  to 
the  “iron  cage”  of  instrumental  rationality.  It  was 
not  even  true  that  the  advent  of  political  parties 
destroyed  the  possibilities  for  discussion  and  delib¬ 
eration  but  merely  moved  their  location  to  within 
the  parties  themselves.  Schmitt’s  catastrophist  ideas 
were  challenged,  again  during  the  tempestuous 
Weimar  years,  by  another  illustrious  jurist,  Hans 
Kelsen,  who,  far  from  reprocessing  liberal  princi¬ 
ples,  attempted  to  reconcile  representation  with 
pluralism  in  order  to  recreate  a  credible  form  of 
political  synthesis  in  place  of  that  originally  offered 
by  the  liberal  state. 

According  to  the  Kelsenian  critique,  the  use  of 
the  expression  “representation”  can  even  be  con¬ 
sidered  abusive  as  it  serves  only  to  foment  the  illu¬ 
sion  that  the  people  really  govern.  However,  when 
linked  by  party  political  mediation  to  universal 
suffrage,  what  is  commonly  called  representation 
could  be  invigorated  and  given  new  meaning. 
Kelsen’s  thesis  was  that  the  combination  of  repre¬ 
sentative  institutions  and  universal  suffrage  creates 
a  renewed  structure  of  political  opportunities  that 
in  turn  encourages  old  and  new  political  entrepre¬ 
neurs — previously  concerned  only  with  cultivating 
a  limited  electorate  or  securing  entry  into  the 
political  marketplace — to  mobilize  large  numbers 
through  political  parties.  Thus,  not  only  can  indi¬ 
vidual  citizens,  in  association  with  others,  over¬ 
come  the  conditions  of  marginality  to  which  they 
would  otherwise  be  condemned,  but  even  the 
weakest  interest  groups  are  offered  the  opportunity 
to  join  together,  demonstrate,  and  influence  politi¬ 
cal  decisions.  According  to  Kelsen — an  intellectual 
with  social-democratic  leanings — representation 
may  well  be  a  fiction,  but,  thanks  to  political 
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parties,  it  at  least  allows  the  governed  to  commu¬ 
nicate  with  the  governing,  in  addition  to  balancing 
society’s  inequalities.  Refuting  the  idea  that  mass 
political  parties  prejudice  every  political  synthesis, 
Kelsen  argued  that  it  would  ultimately  be  elected 
bodies  and  the  interactions  between  parties  that 
would  recalibrate  the  pluralism  that  the  latter  have 
previously  reordered. 

Kelsen’s  view  was  revived  after  World  War  II 
when  political  parties  obtained  official  recogni¬ 
tion,  in  some  cases  in  the  newly  established 
Western  European  constitutions.  Whether  based 
on  an  illusion  or  not,  political  parties  renewed  and 
revitalized  representation,  receiving  an  exemplary 
recognition — but  obviously  not  the  only  one — 
from  another  German  jurist,  Gerhard  Leibholz, 
who  had  originally  shared  Schmitt’s  position.  At 
the  end  of  World  War  II,  Leibholz  pleaded  the  case 
for  the  official  recognition  of  political  parties  and 
also  raised  a  question  that  would  soon  backfire  on 
them.  If  political  parties  are,  he  asked,  rightfully 
incorporated  into  the  constitutional  structure,  why 
not  fully  ratify  this  inclusion  with  all  its  conse¬ 
quences?  In  other  words,  he  raised  the  extremely 
delicate  subject  of  the  internal  party  democracy. 

While  this,  of  course,  has  long  been  a  thorny 
issue,  political  parties  can  effectively  represent  it, 
among  other  things,  by  adopting  a  “descriptive” 
method  of  representation — the  driving  force 
behind  “substantive”  representation.  While  pro¬ 
moting  the  social  classes  they  represent  to  elected 
office  and  broadening  the  recruitment  pool  for 
political  leadership,  the  premise  is  that,  by  sharing 
the  same  life  experiences  of  the  represented,  repre¬ 
sentatives  will  become  more  sensitive  to  the  for¬ 
mer’s  needs.  This  understanding  of  political  parties 
was,  above  all,  symbolic.  However,  the  argument 
on  which  descriptive  representation  is  founded  is 
not  to  be  underestimated.  So  much  so  that  it  has 
recently  been  revived  through  propositions  (albeit 
often  unsuccessful)  for  the  introduction  of  quotas 
benefiting  segments  of  the  population  considered 
to  be  underrepresented. 

However,  the  fundamental  novelty  that  political 
parties  embodied  in  the  European  experience  was 
something  different.  They  undertook  the  responsi¬ 
bility  for  a  large  part  of  the  representative  relation¬ 
ship.  They  received  a  mandate  from  citizens  to 
represent  them  and  became  the  repositories  of 
their  collective  hopes  (and  therefore  their  trustees), 


while  representatives  in  elected  assemblies  were 
party  delegates.  As  ever,  of  course,  reality  is  more 
complex  and  often  belies  the  image  both  of  parlia¬ 
ment  as  the  exclusive  place  for  negotiations 
between  parties  and  of  parties  as  an  irremovable 
obstacle  between  the  elected  and  the  electorate. 
Mass  political  parties  were  more  complex,  and  less 
disciplined,  machines  than  those  described  in 
abstract  models.  They  were  the  site  of  bitter  inter¬ 
nal  conflicts  and  interacted  in  very  different  ways 
with  the  various  segments  of  the  electorate:  For 
example,  the  middle  classes  were  treated  differ¬ 
ently  than  the  working  classes,  and  confessional 
parties  were  represented  in  a  different  way  from 
socialist  parties.  Even  the  most  disciplined  parties 
allowed  space  for  direct  contact  between  the 
elected  and  their  electors — clientelism  was  fre¬ 
quently  practiced — and  party  discipline  was  also 
considerably  softened  at  times  in  elected  assem¬ 
blies.  Thus,  while  in  theory  political  parties  (espe¬ 
cially  ideologically  based  parties)  were  trustees,  in 
reality  they  were  much  more  adaptive. 

The  equilibrium  achieved  by  representation, 
thanks  to  political  parties,  was  called  into  ques¬ 
tion  by  their  transformation  due  to  exogenous 
reasons:  social  change,  political  competition,  and 
the  intrusion  of  the  media.  Within  a  generation, 
the  social  transformations  created  by  the  great 
postwar  developments,  the  erosion  of  class  cleav¬ 
ages,  the  demise  of  ideologies,  and  the  downsizing 
of  political  competition,  on  the  one  hand,  made 
parties  less  attractive  as  large  collective  associa¬ 
tions  and,  on  the  other,  encouraged  them  to  dilute 
their  policy  offer,  to  differentiate  between  elector¬ 
ates,  and  to  concentrate  mainly  on  their  leader¬ 
ship.  Already  by  the  mid-1960s,  Otto  Kirchheimer 
had  indicated  the  new  political  technology  of  the 
catch-all  party  and,  15  years  later,  when  public 
financing  of  political  parties  was  introduced,  the 
concept  of  the  cartel  party  (a  party  that  uses  the 
resources  of  the  state  to  maintain  its  position 
within  the  political  system)  served  to  highlight  the 
restrictions  imposed  on  party  competition.  As  a 
result  of  these  changes,  party  machines  have 
increasingly  specialized  in  election  campaigns  and 
the  selection  of  political  personnel  while  simulta¬ 
neously  ceding  a  consistent  part  of  their  represen¬ 
tative  functions  to  interest  groups. 

Previously,  traditional  mass  political  parties  had 
nurtured  the  grand  ambition  of  covering  the  needs 
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of  society  as  a  whole.  At  the  height  of  its  develop¬ 
ment,  sectors  within  society  would  disperse  in  all 
directions  in  the  search  for  new  methods  of  repre¬ 
sentation  and  new  instruments  to  make  themselves 
heard  by  public  authorities.  The  political  scene 
would  be  rapidly  crowded  with  actors  and  organi¬ 
zations  offering  representation  in  competition  with 
the  parties:  entrepreneurial  associations,  trade 
unions,  and  territorial  communities.  At  the  same 
time,  a  multiplicity  of  forms  would  emerge  that, 
technically,  must  be  considered  representative  but 
that  would  claim  legitimacy  on  the  basis  that  they 
offered  possibilities  for  direct  citizen  participation: 
We  refer,  of  course,  to  the  associations  and  collec¬ 
tive  movements  making  up  the  galaxy  of  organiza¬ 
tions  that  has  become  known  as  civil  society.  In 
various  ways,  therefore,  the  pluralism  of  interests 
and  opinions  has  spilled  out  over  the  sides  of  the 
“political  party  as  container,”  running  down  new 
paths.  As  a  result,  the  geometries  and  landscapes  of 
representation  known  to  date  have  been  overturned. 

Representation  After  Political  Parties 

Overcoming  liberal  prejudices  toward  them, 
political  parties  adapted  representation  to  mass  democ¬ 
racy.  However,  toward  the  end  of  the  20th  century, 
ancient  anti-political  party  prejudices 
reappeared,  presenting  themselves  as  the  defense 
not  only  of  representation  but  also  of  democracy. 
The  criticism  was  paradoxical.  Political  parties 
were  simultaneously  accused  of  being  both  an 
excessively  permissive  and  an  overly  rigid  filter  of 
society,  of  representing  too  much  and  representing 
too  little.  They  were  accused  of  allowing  them¬ 
selves  to  be  led  by  electoral  opportunism  and  of 
being  too  amenable  to  partial  interests  (with  highly 
damaging  consequences  for  government  perfor¬ 
mance).  They  were  also  charged  with  being  intru¬ 
sive  bureaucracies  concerned  with  the  interests  not 
of  those  they  were  supposed  to  represent  but  of 
their  executives  and  managers.  Parties  sought,  it 
was  said,  merely  to  optimize  vote  shares  and  pub¬ 
lic  positions,  to  the  detriment  of  technical  compe¬ 
tencies,  the  public  purse,  and  the  spontaneous 
generosity  of  civil  society.  None  of  these  of  course 
is  an  unfounded  criticism. 

As  we  know,  political  parties  are  anything  but 
defunct.  Nonetheless,  they  have  been  profoundly 
changed:  From  large  collective  associations,  they 


have  become  mainly  political  enterprises.  The  car¬ 
tel  party  (as  described  by  Robert  S.  Katz  and  Peter 
Mair)  is  a  very  different  machine  from  the  tradi¬ 
tional  mass  political  party,  having  ceded  a  large 
part  of  its  representative  capacity  to  other  institu¬ 
tions.  Parties  are  also  hotly  pursued  by  the  media, 
which,  through  its  extensive  and  obsessive  use  of 
surveys,  claims  to  represent  public  opinion.  Thus, 
the  great  concern  that  stimulated  the  pure  theory 
of  representation  has  resurfaced,  as  we  can  see  in 
the  work  of  the  U.S.  academic  Hannah  Pitkin, 
who  effectively  illustrated  in  The  Concept  of 
Representation  the  atmosphere  of  uncertainty  sur¬ 
rounding  representation  and  elaborated  an  essen¬ 
tially  normative,  but  empirically  based,  response 
that  recalled  the  lessons  of  The  Federalist  Papers. 

When  Pitkin’s  book  came  out  in  1967,  the 
demise  of  conventional  mass  political  parties  in 
Western  Europe  had  only  just  begun.  However,  in 
the  United  States  this  had  taken  place  some  time 
earlier.  As  a  consequence,  Pitkin  returned  to  the 
argument  (dear  to  those  in  the  United  States)  of 
the  representative  as  delegate,  along  with  that 
regarding  the  systemic  effects  of  representative 
government.  The  representative,  according  to 
Pitkin,  must  demonstrate  the  qualities  of  a  dele¬ 
gate  and  a  trustee  (and,  even  better,  of  a  leader). 
That  said,  a  delegate’s  role  is  far  from  easy.  What 
is  he  or  she  a  delegate  of?  Of  the  constituency 
from  which  he  or  she  was  elected?  Of  a  group?  Of 
a  particular  interest?  Or  of  the  party  that  has 
facilitated  and  sponsored  his  or  her  election? 
Appearing  at  the  dawn  of  the  era  of  collective 
movements  (and  the  rediscovery  of  civil  society), 
Pitkin’s  book  constituted  a  detailed  and  updated 
inventory  of  the  classic  dilemmas  of  representa¬ 
tion.  It  primarily  confirmed  that  the  vote  is  an 
approximate  tool  for  identifying  the  electors’  pref¬ 
erences,  as  well  as  directing  the  electorate’s  choices. 
Among  the  issues  reproposed,  or  raised,  by  Pitkin, 
in  addition  to  those  concerning  delegates  and 
trustees,  the  most  striking  are  those  about  stand¬ 
ing  for  and  acting  for  and  “descriptive”  and  “sub¬ 
stantive”  representation.  However  vital  political 
representation  may  still  be,  Pitkin  argues,  it  is  a 
problematic  tangle  from  which  it  would  be  an 
error  to  expect  too  much.  What  counts,  above  all, 
is  the  overall  performance  of  representative  gov¬ 
ernment.  Representation  requires  the  presence  of 
the  people  in  government  activities,  even  if  it  does 
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not  permit  them  to  act  directly  and/or  individu¬ 
ally.  To  this  end,  representatives  must  know  how 
to  act,  when  required,  based  on  what  they  believe 
in  good  faith  to  be  the  interests  of  the  represented. 
The  latter,  in  turn,  will  need  to  monitor  the  actions 
of  their  representatives — with  both  sides  seeking 
to  avoid  conflict  between  the  preferences  of  the 
represented  and  the  decisions  of  the  representa¬ 
tives.  Representation,  therefore,  must  be  not  only 
responsive  but  also  accountable. 

This  formula  does  not  preclude  representatives 
from  making  decisions  independently  of  prefer¬ 
ences  expressed  by  the  citizenry;  however,  this  can 
happen  only  in  those  cases  where  representatives 
provide  public,  credible  justifications  that  allow 
the  public  to  assess  and  sanction  the  choices  made 
in  their  names.  Government  ceases  to  be  represen¬ 
tative,  Pitkin  concludes,  when  the  contrast  between 
the  work  of  the  representatives,  general  interest, 
and  public  opinion  becomes  systematic.  The  plu¬ 
rality  of  the  representatives  and  the  possibility  of 
them  not  only  deliberating  together  but  also  recip¬ 
rocally  controlling  one  another  combine  to  reduce 
the  risks. 

An  heir  to  the  “impure”  theory  of  political  rep¬ 
resentation,  Pitkin  seeks  to  reconcile  democracy 
and  representation.  She  acknowledges  the  intrinsic 
difficulty  of  their  cohabitation,  admitting  that  the 
balance  between  the  two  will  always  be  somewhat 
precarious.  However,  as  she  recently  emphasized, 
it  is  not  necessarily  a  foregone  conclusion  that  the 
marriage  between  democracy  and  representation  is 
a  thing  of  the  past.  Rather,  it  could  yet  be  saved  by 
strengthening  the  role  of  citizens  and  injecting  rep¬ 
resentative  institutions  with  a  more  substantial 
dose  of  self-government  and  participation.  Indeed, 
the  latter  consideration  is  one  of  the  most  fre¬ 
quently  raised  issues  in  the  theoretical  and  political 
debates  of  the  past  decade. 

The  Last  Metamorphosis? 

The  transformation  and  repositioning  of  political 
parties  have  had  enormous  effects  on  representa¬ 
tion.  Being  unable  to  herald  a  return  to  the  liberal 
model  of  representation,  typical  of  a  society  of 
notables,  it  has  led  instead  to  the  rediscovery  of  a 
double  circuit  of  representation.  The  official  cir¬ 
cuit,  in  the  main,  connects  the  electorate,  parties, 
parliament,  and  the  executive,  where,  due  to  the 


much  discussed  “presidentialization”  of  politics, 
the  mandate  conferred  by  voters  has  been  signifi¬ 
cantly  strengthened,  making  the  head  of  govern¬ 
ment  its  direct  consignee.  Thus,  a  “lengthening”  of 
the  representation  relationship — hinged  on  the 
personalization  of  government  action — has 
occurred.  Moreover,  this  relationship  is  often 
tinged  with  strong  plebiscitary  tones. 

While  political  parties  appear  to  have  special¬ 
ized  in  the  selection  of  leadership  candidates  as 
well  as  electoral  marketing,  this  does  not  exclude 
them  from  every  representative  activity.  They 
remain  as  gatherers  of  interests,  and  great  conflicts 
of  interest  still  take  place  within  them.  However, 
the  “presidentialization”  of  democratic  regimes 
has  created  a  more  rigid  filter  than  was  the  case  in 
the  past.  The  position  and  status  of  the  representa¬ 
tive  institution  par  excellence — that  is,  parlia¬ 
ment — is  decidedly  less  certain,  however.  It  would 
be  wrong  to  say  that  parliaments  have  become 
marginal  institutions,  given  that  they  still  provide 
for  the  preparation  of  legislative  norms  and  their 
members  generally  maintain  constant  contact  with 
the  electorate.  However,  they  are  now  usually  far 
more  subordinate  to  the  executive’s  power  than 
was  previously  the  case. 

Rejected  by  political  parties,  interests  have 
managed  to  establish  a  second  unofficial  and 
nonelectoral  path  that  stands  in  competition 
with  the  official  one.  This  phenomenon  had 
already  assumed  importance  thanks  to  neo¬ 
corporate  negotiations  (involving  collaboration 
between  unions  and  the  government),  but  it  has 
become  even  more  crucial  thanks  to  the  social 
changes  brought  about  by  post-Fordism  and  glo¬ 
balization.  Representation  of  interests,  as  a 
result,  has  been  separated  from  electoral  repre¬ 
sentation:  For  example,  in  Europe,  the  phenom¬ 
enon  of  lobbies,  well-known  in  U.S.  politics,  has 
taken  root  and  appears  to  be  flourishing.  So 
much  so  that  it  has  been  fully  recognized  and 
validated  by  the  European  Union  (EU).  Further, 
the  unofficial  path  has  been  widened  to  embrace 
advocacy  groups,  nongovernmental  organiza¬ 
tions,  and  social  movements. 

What  conclusions  can  we  draw  about  these 
innovations?  Not  that  official  representation  has 
become  optional  for  democratic  regimes.  This  is 
confirmed  both  by  the  debate  about  quotas  for 
both  elected  and  nonelected  positions — to  help 
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disadvantaged  social  groups — and  the  debate  on 
political  rights  for  immigrants.  Representation  is 
thus  still  a  decisive  condition  of  existence  and  rec¬ 
ognition.  It  still  implies  that  the  people  (despite  the 
problems  surrounding  that  term)  must  be  taken 
into  consideration.  Nonetheless,  we  can  see  a 
swinging  back  of  the  pendulum  toward  the  pure 
theory,  thanks  to  the  prominence  given  to  repre¬ 
sentatives,  whether  official  or  unofficial. 

Ultimately,  it  is  no  coincidence  that  theory  has 
sought  to  replace  the  rhetoric  of  representation  as 
responsiveness  with  that  of  retrospective  judgment. 
Accountability  is  the  new  parameter  against  which 
the  quality  and  legitimacy  of  democratic  regimes 
are  measured.  The  concept  of  accountability  is  dis¬ 
cussed  elsewhere  in  this  encyclopedia;  but,  in  gen¬ 
eral  terms,  we  can  say  that  accountability  is  the  link 
created  between  the  performance  of  representatives 
and  the  expectations  placed  on  them.  Their  degree 
of  receptiveness  with  regard  to  the  preferences  of 
the  governed  is  left  to  the  judgment  of  the  govern¬ 
ing.  It  is  up  to  representatives  to  establish  whether 
it  is  opportune  or  not  to  accept  the  suggestions  of 
voters — evaluated  alongside  indications  from  opin¬ 
ion  polls,  the  media,  and  pressure  from  organized 
interests  and  civil  society.  In  return,  the  right  of 
citizens  to  assess  political  performances  is  legiti¬ 
mized  and  bolstered.  Further,  a  horizontal,  less 
asymmetric  accountability  of  government  activity 
also  takes  place,  thanks  to  both  the  separation  of 
powers  and  private  power.  It  goes  without  saying 
that  what  is  thus  offered  at  the  end  is  a  rather  dif¬ 
ferent  interpretation  of  representation  than  that 
which  previously  prevailed. 

Above  and  beyond  the  rhetoric  surrounding  it, 
representation  implied  an  impulse  from  below  (the 
represented)  to  above  (the  representatives).  The 
rhetoric  of  accountability  casts  those  governing  as 
trustees,  who,  from  their  lofty  position,  decide 
which  policies  should  be  adopted,  interpreting  the 
needs  of  the  governed  and,  above  all,  leaving  to 
the  latter  the  possibility  of  sanctions  afterwards. 
The  motives  that  necessitate  such  a  change  are  the 
dissolving  of  the  large  collective  bodies  of  refer¬ 
ence  (class  and  religious  affiliations,  etc.),  the  very 
evident  growth  of  cultural  pluralism,  the  difficul¬ 
ties  that  political  parties  have  encountered  in 
reconstructing  themselves,  the  tendency  of  the 
media  to  dramatize  issues  while  also  influencing 
the  political  agenda,  and  the  preeminence  assumed 


by  rather  sui  generis  institutions  such  as  those  gov¬ 
erning  the  EU. 

The  rules  established  by  theory  for  accountabil¬ 
ity  are  neither  hurried  nor  superficial.  It  remains  to 
be  established,  obviously,  if  and  to  what  degree 
they  are  effective,  and  there  are  solid  reasons  for 
supposing  that  if  representation  is  a  fiction,  then  so 
too  is  accountability.  Since  representation,  despite 
being  a  fiction,  has  produced  effects  that  continue 
to  be  appreciated,  the  same  reasoning  could  hold 
true  for  accountability. 

However,  the  difficulties  of  representation 
encourage  other  innovations.  Alongside  the  unof¬ 
ficial  circuit  of  interests,  a  third  one  has  been  cre¬ 
ated,  which,  to  varying  degrees,  reproposes  direct 
democracy — not  as  an  alternative  to  the  represen¬ 
tative  form  but  as  a  complement — to  foster  and 
promote  greater  citizen  engagement  in  decision 
making.  An  established  product  of  similar  attempts 
is  that  denoted  by  the  term  governance.  Located  at 
the  crossroads  of  interest  representation  and  citi¬ 
zen  participation,  governance  claims  that  policies 
are  the  product  of  partnership  negotiations  con¬ 
ducted  by  stakeholders  and  public  institutions, 
with  the  latter  no  longer  viewed  as  a  consistent  and 
hierarchical  system  pursuing  the  general  interest, 
but  as  a  varied  constellation  of  agencies  (and  other 
kinds  of  interests)  performing  particular  services. 

Combining  the  representation  of  interests  and 
participation,  governance  appears  to  be  very  insub¬ 
stantial  (particularly  as  a  form  of  direct  democ¬ 
racy).  It  is  no  surprise  therefore  that  attempts  are 
continuing,  both  in  theory  and  in  practice,  to  graft 
forms  of  direct  democracy  onto  the  trunk  of  repre¬ 
sentative  government.  Through  neighborhood 
democracy,  citizen  juries,  participative  budgets, 
and  associative  and  deliberative  democracy,  repre¬ 
sentative  democracy  continues  to  call  on  ordinary 
citizens  to  assist  it,  while  the  features  of  elected 
assemblies — discussion,  confrontation  between 
different  points  of  view,  and  reciprocal  persua¬ 
sion — appear  to  be  enjoying  a  second  youth 
beyond  the  confines  of  such  assemblies  among  the 
citizenry. 

The  effectiveness  of  these  forms  of  course  is  also 
questionable.  It  is  not  a  given  that,  as  promised, 
they  can  indeed  serve  to  transcend  particular  inter¬ 
ests  and  bring  ordinary  citizens  closer  to  politics. 
One  could  argue  that  deliberation  appears  limited 
to  the  periphery  of  the  system  and  that  citizens  are 
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only  periodically  given  a  voice,  with  grassroots 
representatives,  probably  carefully  selected  by 
public  authorities,  taking  the  floor.  However,  as 
often  is  the  case  with  innovations,  the  process  is 
open-ended.  By  way  of  conclusion,  we  might  finish 
with  another  question:  Thanks  to  this  double 
movement  of  political  representation — on  top 
toward  personalized  leadership  and  below  toward 
direct  democracy — is  it  the  case  that,  after  almost 
4  centuries  of  resurrections,  the  phoenix  of  repre¬ 
sentative  government  is  rising  from  the  ashes,  or 
are  these  innovations  simply  diversions  accompa¬ 
nying  the  definitive  burial  of  representation? 
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Representative  Bureaucracy 


The  central  tenet  of  the  theory  of  representative 
bureaucracy  is  that  passive  representation,  or  the 
extent  to  which  a  bureaucracy  reflects  or  mirrors 
the  demographic  characteristics  of  the  population, 
leads  to  active  representation,  or  the  pursuit  of 
policies  reflecting  the  interests  and  desires  of  repre¬ 
sented  groups.  The  theory  is  premised  on  the  belief 
that  such  attributes  or  characteristics  lead  to  cer¬ 
tain  early  socialization  experiences  that  in  turn  give 
rise  to  attitudes,  values,  and  beliefs  that  ultimately 
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help  shape  the  behavior  and  decisions  of  individ¬ 
ual  bureaucrats.  The  theory  of  representative 
bureaucracy  is  based  on  the  idea  that  the  demo¬ 
graphic  composition  of  bureaucracy  should  reflect 
the  demographic  composition  of  society.  Supporters 
believe  that  a  bureaucracy  that  reflects  the  diver¬ 
sity  of  the  general  population  implies  a  symbolic 
commitment  to  equal  access  to  power  and  that 
shared  personal  characteristics  of  distinctive  group 
members  will  offer  a  shared  voice  in  the  policy 
process.  When  members  of  identifiable  groups, 
such  as  a  specific  racial  or  ethnic  group,  become 
public  officials,  they  become  legitimate  actors  in 
the  political  process  with  the  ability  to  shape  pub¬ 
lic  policy  and  implementation.  A  representative 
bureaucracy  provides  a  means  of  fostering  equity 
or  at  least  the  perception  of  equity  in  the  policy 
process.  If  the  bureaucracy  reflects  the  demo¬ 
graphic  origins  of  society,  the  theory  implies  that 
government  will  be  more  responsive  to  the  public 
interest  by  better  ensuring  that  all  politically  sig¬ 
nificant  interests  and  values  are  represented  in  the 
formulation  and  implementation  of  public  policies 
and  programs.  This  entry  continues  with  a  discus¬ 
sion  of  the  existing  three  main  research  streams  on 
such  a  theory  (passive  representation,  active  repre¬ 
sentation,  and  linkage  between  passive  and  active 
representation)  and  concludes  with  remarks  on 
possible  future  developments. 

Research  on  Representative  Bureaucracy 

For  more  than  50  years,  scholars  have  been  inter¬ 
ested  in  both  the  normative  and  empirical  study  of 
representative  bureaucracy.  Three  important 
streams  of  research  have  emerged.  First,  research¬ 
ers  have  focused  on  passive  representation  or  the 
extent  to  which  public  bureaucracies  reflect  the 
demographic  composition  of  society.  Second, 
scholars  have  explored  whether  passive  represen¬ 
tation  affects  active  representation.  Third,  scholars 
have  explored  factors  that  moderate  the  linkage 
between  passive  and  active  representation. 

Passive  Representation 

Studies  dating  back  to  the  1960s  examine  pas¬ 
sive  representation  or  the  extent  to  which  women 
and  racial  and  ethnic  minorities  are  employed  in 
the  public  sector.  Researchers  in  the  United  States 


first  focused  primarily  on  race  and  ethnicity  as  the 
demographic  characteristics  of  interest,  but  similar 
approaches  were  later  applied  to  examine  the  rep¬ 
resentation  of  women  in  public  organizations. 
Because  of  the  marked  social,  cultural,  and  eco¬ 
nomic  differences  among  different  minority  groups, 
research  on  representation  in  the  United  States 
moved  from  combining  minorities  into  one  cate¬ 
gory  to  splitting  minorities  into  four  groups — 
African  Americans,  Hispanics,  Native  Americans, 
and  Asian  Americans.  Researchers  have  examined 
a  number  of  other  demographic  factors,  such  as 
age  and  disabilities  and  the  interaction  between 
race/ethnicity  and  gender.  In  the  U.S.  context,  race, 
ethnicity,  and  gender  are  considered  the  most 
salient  characteristics  because  numerous  politically 
relevant  attitudes  and  values  are  defined  along 
these  two  dimensions. 

In  early  works,  scholars  examined  aggregate 
public  employment  trends  to  ascertain  the  extent 
to  which  women  and  minorities  were  broadly  rep¬ 
resented,  but  to  understand  the  phenomenon  bet¬ 
ter,  subsequent  studies  segmented  the  employment 
figures  by  organizational  level  and  occupational 
category.  While  women  and  some  minorities 
appear  to  be  reasonably  well  represented  in  terms 
of  presence  in  public  organizations  (penetration), 
they  tend  to  be  overrepresented  in  the  lower  eche¬ 
lons  of  bureaucracies  and  underrepresented  in  the 
managerial  and  executive  ranks  (stratification). 
Penetration  refers  to  how  well  minorities  and 
females  are  represented  in  total  numbers,  while 
stratification  is  concerned  with  how  well  these 
groups  are  represented  throughout  the  career  lad¬ 
der,  particularly  in  senior  executive  positions. 
Thus,  penetration  examines  the  quantity  of 
employment,  and  stratification  provides  informa¬ 
tion  about  the  quality  of  the  employment. 

Mary  Guy’s  work  shows  that  while  American 
women  have  found  great  success  entering  public 
service,  they  lag  behind  men  in  their  ability  to 
advance  to  the  highest  ranks  and  tend  to  be  dispro¬ 
portionately  located  in  certain  agency  types. 
Similarly,  African  Americans  and  Hispanics  have 
had  difficulty  advancing  to  the  highest  levels  of  pub¬ 
lic  service.  While  women  and  minorities  have  gained 
greater  access  to  positions  in  government  in  the  past 
several  decades,  a  key  challenge  facing  public  orga¬ 
nizations  is  advancing  women  and  minorities  into 
positions  of  greater  responsibility  and  power. 
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Active  Representation 

Students  of  representative  bureaucracy  also  sug¬ 
gest  that  bureaucracies  broadly  representative  of 
the  general  public  should  produce  policy  outputs 
that  meet  the  needs  of  all  citizens.  Here,  scholars 
have  examined  the  relationship  between  employ¬ 
ment  of  minorities  and  women  and  agency  outputs 
and  outcomes  affecting  these  groups.  Specifically, 
research  has  examined  the  relationship  between 
demographic  representation  and  disciplinary 
actions  and  ability  groupings  in  school  systems, 
charges  or  complaints  of  discrimination  filed  by  a 
regulatory  agency,  child  support  collection,  hous¬ 
ing  loan  eligibility  determinations  made  by  a  fed¬ 
eral  agency,  federal  procurement  decisions,  and 
student  educational  achievement. 

The  evidence  is  largely  supportive  of  the  linkage 
between  passive  and  active  representation  for 
minority  groups.  In  the  first  study,  linking  demo¬ 
graphic  representation  and  policy  outcomes, 
Kenneth  J.  Meier  and  Joseph  Stewart  found  that 
the  increased  presence  of  African  American  street- 
level  bureaucrats  (e.g.,  schoolteachers)  had  a  sig¬ 
nificant  effect  on  policy  outcomes  favoring  African 
American  students.  In  a  similar  approach,  John 
Hindera  found  that  as  the  employment  of  African 
Americans  in  the  U.S.  Equal  Employment  Oppor¬ 
tunity  Commission  increased,  discrimination 
charges  filed  on  behalf  of  that  group  also  increased. 
Sally  Coleman  Selden’s  research  demonstrated  that 
as  the  employment  of  African  Americans,  Hispan- 
ics,  and  Asians  Americans  increased  in  the  Farmers 
Home  Administration,  the  percentage  of  rural 
housing  loan  eligibility  determinations  favoring 
each  group  increased.  Craig  Smith  and  Sergio 
Fernandez  found  that  increasing  minority  repre¬ 
sentation  at  the  senior  executive  levels  resulted  in 
increased  contracting  with  small  minority-owned 
businesses. 

The  evidence  for  women  is  mixed.  Early  studies 
did  not  find  that  increased  representation  of  women 
results  in  policy  outcomes  favoring  women. 
However,  Lael  Keiser  and  her  colleagues  outlined 
several  conditions  that  moderate  the  linkage  between 
passive  and  active  representation  for  women. 
Specifically,  they  note  that  the  following  three  con¬ 
ditions  facilitate  the  linkage  between  passive  repre¬ 
sentation  and  representation  of  women:  (1)  the 
policy  decision  needs  to  directly  benefit  women, 
(2)  the  issue  needs  to  be  defined  as  gendered  or 


salient  for  women,  and  (3)  the  gender  of  the  admin¬ 
istrator  making  the  decision  fundamentally  changes 
the  relationship  or  interaction  between  the  admin¬ 
istrator  and  client.  Kenneth  J.  Meier  and  Jill 
Nicholson-Crotty  found  that  women  file  more  sex¬ 
ual  assault  reports  when  there  are  more  women 
officers  in  police  departments. 

Factors  That  Moderate  the  Linkage  Between 
Passive  and  Active  Representation 

As  noted  above,  implicit  in  the  linkage  between 
passive  and  active  representation  is  the  expectation 
that  civil  servants  will  hold  similar  attitudes  to 
citizens  with  similar  backgrounds  and  experience 
and  that  attitudes,  in  turn,  will  influence  policy 
decisions.  This  expectation  is  founded  on  the  fol¬ 
lowing  logic:  Civil  servants  from  a  particular 
demographic  group  experience  similar  socializa¬ 
tion  patterns  and  therefore  hold  similar  values  and 
beliefs  with  members  of  that  particular  demo¬ 
graphic  group.  Scholars  have  taken  several 
approaches  to  examine  whether  conditions  and 
attitude  moderate  the  link  between  passive  and 
active  representation.  Research  has  demonstrated 
that  active  representation  is  more  likely  to  occur 
when  administrators  have  considerable  discretion, 
or  at  least  the  perception  of  discretion,  in  decision 
making  and  when  policy  decisions  are  meaningful 
or  salient  to  a  particular  group.  Bureaucrats  need 
to  have  enough  discretion  to  translate  their  value 
preferences  into  actions  that  will  improve  policy  or 
program  outcomes  benefiting  represented  groups. 

Some  scholars  have  examined  attitude  congru¬ 
ence  between  minority  and  women  bureaucrats  and 
citizens.  Recently,  Mark  Bradbury  and  J.  Edward 
Kellough  found  that  African  American  administra¬ 
tors  hold  attitudes  very  similar  to  those  of  African 
American  citizens.  Moreover,  they  found  that  White 
administrators  exhibit  very  different  attitudes  from 
African  American  citizens.  There  are  a  few  studies 
that  demonstrate  that  attitudes  shape  role  percep¬ 
tions,  which  subsequently  affect  behaviors  or  pro¬ 
duce  policy  outputs  favorable,  at  least  for  minori¬ 
ties.  Selden  found  that  minority  status  is  largely 
mediated  by  the  adoption  of  a  minority  representa¬ 
tive  role.  That  is,  active  representation  is  driven  by 
the  degree  to  which  minority  bureaucrats  perceive 
of  themselves  as  minority  representatives.  Bradbury 
and  Kellough’s  study  showed  that  African  American 
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bureaucrats  are  more  likely  to  adopt  a  minority 
representative  role  than  White  bureaucrats.  Because 
the  research  examining  attitudes  is  less  clear  regard¬ 
ing  women,  Julie  Dolan  proposes  that  scholars 
measure  and  examine  the  linkage  between  gender 
consciousness  and  active  representation  of  women 
in  the  population. 

Another  factor  that  moderates  the  linkage 
between  passive  and  active  representation  is  orga¬ 
nizational  socialization.  Depending  on  the  nature 
of  how  an  organization  socializes  its  members,  it 
may  either  hinder  or  promote  active  representa¬ 
tion.  In  one  of  the  few  studies  that  examine  this 
factor  directly,  Vicky  Wilkins  and  Brian  Williams 
find  that  an  organization’s  structure  and  processes 
can  hinder  representation.  African  American  police 
officers  revealed  that  they  felt  pressure  to  conform 
to  the  norms  of  their  departments,  which  affected 
both  their  attitudes  and  their  actions. 

Future  Development 

Scholars  interested  in  representative  bureaucracy 
have  much  opportunity.  In  the  United  States,  most 
of  the  existing  research  focuses  on  African 
Americans  or  minorities,  generally.  Future  studies 
need  to  focus  on  the  observed  patterns  of  represen¬ 
tation  of  Flispanics,  Asian  Americans,  Native 
Americans,  and  women.  Internationally,  scholars 
need  to  examine  further  factors  that  affect  the  abil¬ 
ity  of  administrators  to  translate  their  values  into 
actions  and  factors  that  influence  the  adoption  of 
a  representative  role. 

Sally  C.  Selden 
Lynchburg  College 
Lynchburg,  Virginia,  United  States 
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The  word  republic  is  derived  from  the  Latin  res 
publica,  meaning  “the  common  thing”  or  “the 
public  good.”  Cicero,  among  other  Latin  writers, 
translated  the  Greek  politeia  into  res  publica,  and, 
in  55  BCE,  he  wrote  his  famous  political  treatise 
De  Re  Publica.  Although  the  term  republic  has 
been  used  in  a  variety  of  ways  and  historical  con¬ 
texts,  we  can  distinguish  two  main  meanings, 
“substantive”  and  “formal.”  In  the  substantive 
sense,  “republic”  refers  to  a  government  in  which 
the  supreme  power  resides  not  in  a  monarch  or  a 
king  but  in  a  body  of  citizens  entitled  to  vote. 
Thus,  in  a  republic,  power  is  exercised  by  elected 
officers  and  representatives  governing  according 
to  law  and  accountable  to  the  people.  In  this 
sense,  Cicero  refers  to  res  publica  as  res  populi,  a 
thing  or  good  that  belongs  to  the  people,  or  the 
public.  Referring  in  turn  to  the  Greek  city-states, 
the  Roman  republic,  the  American  republic,  the 
French  one,  or  the  Dutch  one,  various  authors 
from  Cicero  to  Niccolo  Machiavelli,  Jean-Jacques 
Rousseau,  James  Madison,  and  Hannah  Arendt 
have  understood  “republic”  as  a  political  model 
(formal).  This  model  is  based  on  the  absence  of 
the  oppressive  rule  of  a  monarch  and  the  estab¬ 
lishment  of  rule  of  law  and  mixed  government,  as 
well  as  a  concern  for  the  public  good  and  virtue. 
The  different  versions  of  the  republican  model  of 
public  virtue  and  collective  liberty  have  been 
opposed  to  liberalism,  as  being  one-sidedly  focused 
on  individual  liberty  and  private  interest.  In  addi¬ 
tion,  the  term  has  referred,  in  recent  times,  to  a 
government  having  a  chief  of  state  who  is  not  a 
monarch  but,  as  a  general  rule,  a  president.  In  this 
minimal  sense,  it  has  been  applied  at  a  global 
scale.  Sovereign  political  units  such  as  Romania, 
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Bulgaria,  or  Iran  (as  an  Islamic  republic)  have 
embraced  this  form  of  government.  A  republic  can 
also  be  a  constituent  political  and  territorial  unit, 
as  seen  in  regimes  such  as  the  Union  of  Soviet 
Socialist  Republics  or  Yugoslavia. 

The  Roman  republican  model,  with  its  respect 
for  rule  of  law  and  results  of  elections,  comple¬ 
mented  by  the  alliance  between  the  rulers  and  the 
ruled  and  sealed  in  victorious  military  combats, 
has  been  revered  from  the  Renaissance  until  the 
present  day.  This  entry  traces  the  different  mean¬ 
ings  and  realities  of  the  republic  through  medieval 
times  and  the  beginnings  of  modernity,  the 
American  and  French  revolutions,  and  into  the 
contemporary  period  when  it  has  continued  to 
capture  the  political  imagination. 

The  Middle  Ages  and  the 
Beginning  of  Modernity 

During  the  Middle  Ages,  res  publica  generally  des¬ 
ignated  the  political  community  in  opposition  to 
the  church.  It  was  composed  of  a  body  politic 
whose  head  was  the  prince  or  the  king.  In  the  late 
Middle  Ages,  new  republics  (e.g.,  the  Swiss  confed¬ 
eration)  appeared  in  Europe  when  a  number  of 
small  states  or  city-states  embraced  republican 
principles  of  government.  Generally,  these  were 
small  states  or  city-states  in  which  the  merchant 
class  had  risen  to  prominence.  In  Italy,  given  the 
absence  of  a  powerful  central  government,  promi¬ 
nent  towns  gained  considerable  independence  and 
adopted  communal  forms  of  government  (e.g.,  the 
Republic  of  Venice  and  that  of  Genoa).  In  Florence, 
the  memory  of  the  Greek  city-state,  the  Roman 
republic  as  well  as  of  the  successful  restoration  of 
the  republic  between  1494  and  1512  inspired  an 
exceptional  generation  of  Florentine  republicans. 
Machiavelli,  its  most  outstanding  representative, 
served  as  second  secretary  in  the  Chancellory  of 
the  restored  Florentine  republic  between  1498  and 
1512.  Machiavelli’s  reinterpretation  of  the  politi¬ 
cal  legacy  of  antiquity  was  based  on  the  notion  of 
virtu — common  liberty  and  love  of  the  country — 
and  inspired  antiroyalist,  republican  literature  in 
17th-century  England  (e.g.,  James  Harrington’s 
republican  tract  Oceania ).  The  republican  revolts 
in  Britain  and  the  Netherlands  were  linked  to  the 
formation  of  a  large  merchant  class  who  prospered 
from  the  trade.  The  Dutch  Revolt,  beginning  in 


1566,  saw  the  Dutch  republic  reject  the  rule  of 
Hapsburg  Spain  in  a  long  conflict  that  would  last 
until  1648.  The  Dutch  republic  resisted  the  great¬ 
est  military  power  of  the  day  and  was  the  wonder 
of  17th-century  Europe.  In  contrast  to  the  Roman 
republican  model  centered  on  military  virtue,  the 
Dutch  republic  was  centered  more  on  mild  com¬ 
mercial  virtues.  The  Dutch  commercial  republic 
brought  together  the  values  of  republicanism  with 
those  of  democracy  and  tolerance.  However,  in  a 
world  of  emerging  large  absolutist  monarchies,  the 
defense  of  the  liberty  of  small  republican  govern¬ 
ments  was  short  lived.  It  was  the  modern  revolu¬ 
tions  in  America  and  France  that  would  leave  a 
definitive  scar  on  the  “system”  of  absolutist  mon¬ 
archies.  The  actions  of  both  nations  marked  the 
beginning  of  the  Democratic  Age  and  brought 
about  a  historical  reversal  of  balance  in  favor  of 
the  republican  and  the  modern  democratic  model 
of  government. 

American  and  French  Revolutions 
and  Their  Aftermath 

The  American  and  French  Revolutions  did  not 
start  as  antimonarchical  movements,  but  they  both 
led  to  the  establishment  of  influential  republican 
regimes.  The  American  Founders  drew  inspiration 
from  the  history  of  the  republics.  Whereas  the  term 
republic  does  not  come  up  in  the  Declaration  of 
Independence,  it  appears,  for  instance,  in  Article  IV 
of  the  Constitution,  which  “guaranteejs]  to  every 
State  in  this  Union  a  Republican  form  of 
Government.”  The  American  Constitution  estab¬ 
lished  a  federative  republic,  whose  justification  is 
to  be  found  in  The  Federalist  Papers.  Using  Publius 
as  their  common  pen  name,  the  authors  of  The 
Federalist  Papers  gave  a  novel  meaning  to  the  term 
republic.  In  contrast  to  a  long-standing  tradition  of 
thinking  epitomized  by  Rousseau,  they  argued  that 
a  republican  regime  is  also  possible  in  a  large  terri¬ 
tory  and  population.  In  addition,  in  the  influential 
Federalist  Paper  No.  10,  James  Madison  opposes 
“republic”  to  direct  democracy,  but  in  the  name  of 
the  democratic-universalistic  principles  of  liberty 
and  equality.  The  term  referred  to  a  government 
indirectly  controlled  by  the  people — in  other 
words,  what  we  call  representative  democracy.  The 
United  States  developed  into  a  federative  republic 
through  a  system  of  indirect  representation  and  the 
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dispersal  of  powers  in  the  republic  out  of  the  con¬ 
cern,  famously  voiced  by  Madison,  to  avoid  cen¬ 
tralized  despotism. 

In  contrast  to  the  American  case,  the  French 
Revolution  resulted  in  a  state  of  permanent  instabil¬ 
ity  and  the  gradual  creation  of  a  centralized  republic. 
The  model  of  the  French  republic,  while  central¬ 
ized,  was  based  on  the  democratic-universalistic 
principles  of  liberte,  egalite,  et  fraternite  (liberty, 
equality,  and  fraternity).  Surely,  Enlightenment 
thinkers  such  as  Voltaire  were  initially  more  inter¬ 
ested  in  the  notion  of  constitutional  monarchy 
than  that  of  republic,  arguing  that  republics  tended 
to  fall  into  anarchy  or  tyranny.  Such  thinkers 
quoted  Montesquieu  who  had  claimed  that  a  city- 
state  should  ideally  be  a  republic  and  had  main¬ 
tained  that  a  limited  monarchy  would  be  better 
suited  to  a  large  nation.  In  fact,  the  1789 
Declaration  des  droits  de  I’homme  et  du  citoyen 
(Declaration  of  the  Rights  of  Man  and  of  the 
Citizen)  and  the  new  constitution  promulgated  in 
1791  were  not  antimonarchical  in  principle,  as  the 
Revolution  looked  initially  for  a  redefinition  of  the 
terms  of  monarchy — for  instance,  the  1791  consti¬ 
tution  acknowledges  the  king  to  be  the  representa¬ 
tive  of  God  and  of  the  nation,  the  instrument 
“from  which  all  powers  emanate.”  The  French 
Revolution  turned  “republican”  only  after  Louis 
XVI’s  “flight  to  Varennes,”  which  resulted  in  his 
condemnation  as  a  traitor.  The  subsequent  radical- 
ization  of  the  political  climate  led  to  the  King’s 
eventual  decapitation,  the  declaration  of  France  as 
a  republic,  and  the  infamous  Reign  of  Terror. 
Radicalization  led  France  to  a  state  of  war  both  at 
home  and  abroad.  Between  1792  and  1815,  there 
was  a  permanent  war  between  France  and  the 
European  monarchies.  The  success  of  France  in 
the  Revolution  pitted  republican  France  against  the 
European  monarchies  and  saw  republics  spread  by 
force  of  arms  across  much  of  Europe  as  a  series  of 
client  republics  were  set  up  across  the  continent. 
The  rise  of  Napoleon  marked  the  end  of  the  First 
French  Republic,  and  his  eventual  defeat  allowed 
the  victorious  monarchies  to  put  an  end  to  many  of 
the  oldest  republics  on  the  continent,  including 
Venice,  Genoa,  and  the  Netherlands.  Napoleon 
aimed  to  build  a  French  continental  Europe  but 
stumbled  especially  in  front  of  the  British  monar¬ 
chy.  The  consequence  was  the  strengthening  of  the 
grip  of  the  European  monarchies  that  crushed  the 


Revolution  of  1848  and  the  awakening  of  nation¬ 
alist  feelings.  Outside  Europe,  the  Napoleonic 
Wars  allowed  some  of  the  states  of  Latin  America 
to  gain  their  independence  (e.g.,  Venezuela  under 
the  leadership  of  Simon  Bolivar).  These  move¬ 
ments,  despite  their  ambiguities,  carried  on  the 
echo  of  emancipation  and  universal  message  of 
the  democratic  revolutions,  gradually  leading  to 
the  association  between  the  concept  of  “republic” 
and  the  modern  democratic  principles  of  equality 
and  liberty. 

The  20th  Century  and  Beyond 

At  the  beginning  of  the  20th  century,  republics  in 
Europe  were  but  a  few.  This  situation  would  radi¬ 
cally  change  with  the  two  World  Wars.  In  both 
wars,  political  mobilization  was,  by  and  large,  not 
driven  by  the  monarchical  principle.  The  after- 
math  of  World  War  I  was  marked  by  the  disman¬ 
tling  of  empires:  The  Austro-Hungarian  Empire, 
the  German  Empire,  the  Russian  Empire,  and  the 
Ottoman  Empire  were  turned  one  by  one  into 
republics.  The  most  violent  conversion  occurred 
during  the  Russian  Revolution  of  1917,  in  which 
the  czar  and  his  family  were  brutally  executed.  In 
Eastern  Europe,  Asia,  and  Cuba,  the  communist 
regimes  brought  to  power — often  by  means  of 
force — meant  the  abolition  of  the  existing  monar¬ 
chies.  The  resulting  communist  republics  (e.g., 
China,  Romania,  and  Albania)  proved  in  practice 
to  be  authoritarian  or  totalitarian  regimes. 

The  formation  of  new  republics  was  also  linked 
to  the  process  of  decolonization.  The  years  after 
World  War  II  saw  most  of  the  remaining  European 
colonies  gain  their  independence,  and  many  of 
them  became  republics.  France,  for  example, 
encouraged  the  establishment  of  republics  in  its 
former  colonies.  However,  in  the  Middle  East, 
Britain  installed  local  monarchies  in  several  colo¬ 
nies  and  mandates,  including  in  Iraq,  Jordan, 
Kuwait,  Oman,  Yemen,  and  Libya.  In  the  follow¬ 
ing  decades,  revolutions  and  coups  overthrew  a 
number  of  monarchs  and  installed  republics.  For 
instance,  in  Iran,  the  Iranian  Revolution  overthrew 
the  monarchy  and  created  an  Islamic  republic. 
Several  monarchies  remain,  and  the  Middle  East  is 
the  only  part  of  the  world  where  several  large 
states  are  ruled  by  monarchs  (e.g.,  Saudi  Arabia, 
Jordan,  and  Morocco). 
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The  fall  of  the  Berlin  Wall  did  not  mean  a  return 
to  the  monarchical  form  of  government.  Today, 
republic  is  one  of  the  most  widespread  terms  des¬ 
ignating  existing  states.  But  the  popularity  of  the 
term  came  at  the  cost  of  its  losing  its  explicative 
power.  It  has  been  applied  to  different  and  even 
opposing  regimes,  from  liberal  democracies  to 
authoritarian  and  even  totalitarian  governments. 
Overall,  the  opposition  between  republic  and 
monarchy  has  become  less  salient.  Especially  in  the 
West,  the  monarchy  has  mainly  a  symbolic  value 
(e.g.,  in  Great  Britain,  Spain,  and  Norway).  Some 
formal  republics  (Syria,  North  Korea)  act  like 
absolutist  monarchies,  totalitarian  regimes,  or  a 
combination  of  both.  In  general,  the  tension  that 
existed  between  the  republicans  and  monarchists 
has  lost  its  impact  in  many  places  in  the  West,  as 
well  as  in  Latin  America  or  Asia,  even  if  it  remains 
a  divisive  issue  in  some  Islamic  countries.  The 
decreased  relevance  of  the  opposition  between 
monarchical  and  republican  regimes  is,  however, 
only  a  part  of  the  story.  In  the  past  decade,  there 
has  been  a  revival  of  the  interest  in  the  substantial 
meaning  of  republic ,  in  relation  to  the  Roman, 
Dutch,  American,  or  French  historical  experiences. 
The  republican  emphasis  on  common  good,  rule  of 
law,  and  public  virtue  has  become  even  more  rele¬ 
vant  in  a  time  of  consumer  individualism  and  dis¬ 
affected  democracies. 
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Republicanism 


In  the  realm  of  political  concepts,  republicanism 
appears  as  the  doctrine  favoring  the  prevalence  of 
the  republican  regime,  whether  this  prevalence  is 
the  outcome  of  some  activism  (eager  to  see  the 
achievement  of  the  republic  considered  as  the  best 
regime)  or  of  intellectual  considerations  (the 
republic  encapsulates  valuable  political  ideas  and 
ideals).  A  more  anecdotic  subset  of  acceptations  is 
linked  with  the  existence  (as  is  the  case  in  the 
United  States)  of  “republican”  parties:  Repub¬ 
licanism  is  no  more  than  the  fact  of  belonging  to 
them.  Republicanism  is  anything  but  a  recent 
lexical  creation:  According  to  English  dictionaries, 
the  word  made  its  grand  debut  in  the  language  in 
the  wake  of  civil  troubles  in  17th-century  England. 
Its  conceptual  and  semantic  roots  go  back  even 
further  than  its  3-centuries-old  existence.  Coined 
from  the  word  republic  (and  the  adjective  derived 
from  it,  republican),  the  idea  of  “republicanism” 
encapsulates  one  of  those  long  intellectual  histo¬ 
ries  that  are  the  privilege  of  only  a  handful  of 
modern  political  concepts.  It  refers  to  a  concept 
inherited  from  antiquity:  the  Roman  res  publica 
(which  resulted  in  the  modern  republic).  As  such, 
it  has  this  rare  supplementary  specificity  among 
political  concepts  of  having  an  authentic  Roman 
origin:  Classical  Greece,  which  has  influenced  in 
so  many  ways  the  political  history  of  the  West,  has 
only  indirect  equivalents  for  the  Latin  res  publica. 
The  success  of  the  concept  is  all  the  more  remark¬ 
able  because,  and  is  sufficiently  measured  by  the 
fact  that,  in  the  2  millennia  of  its  history,  the  word 
has  hardly  changed  in  many  European  languages. 
The  Latin  res  publica  is  still  easy  to  retrieve  from 
the  French  republique  (from  which  the  English 
republic  was  imported),  the  German  republik,  or 
the  Italian  repubblica. 

The  meaning  of  the  word  has  probably  known 
more  significant  changes  than  its  phonetics.  Res 
publica  means  literally  in  Latin  “the  public 
thing.”  Public  in  its  turn  (publicus  in  Latin) 
derives  from  pop  ulus,  which  means  “people,” 
and  thus,  as  Marcus  Tullius  Cicero  (106-43  BCE) 
puts  it  in  his  De  Republica,  “The  public  thing  is 
the  thing  of  the  people”:  “res  publica,  res  populi.” 
This  is  why  the  traditional  translations  of  res 
publica  as  “commonwealth”  or  even  “state” 
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(when  it  is  not  just  “republic”)  are  not  totally 
satisfactory  as  they  do  not  spell  out  what  was 
evident  for  the  Romans  when  they  talked  about 
the  republic:  the  dimension  of  the  collectivity,  the 
dimension  of  the  people. 

This  meaning  is  equally  lost  in  contemporary 
acceptations  of  the  word  republic.  It  has  eventually 
specialized  in  characterizing  a  type  of  political 
regime,  where,  basically,  the  head  of  the  state  is 
not  a  monarch.  Roman  conceptions  do  include  this 
aspect:  The  end  of  the  monarchy  and  the  instaura- 
tion  of  republican  institutions  by  Marcus  Junius 
Brutus  after  the  destitution  of  Tarquin  the  Proud  is 
an  important  part  of  the  political  mythology  of 
Rome.  But  it  is  a  secondary  aspect  compared  with 
the  role  of  the  people,  which  receives  the  real 
emphasis  in  the  Latin  expression.  Being  a  notion 
defined  by  what  it  is  not,  the  modern  conception 
of  the  republic  is  even  vaguer.  It  is  not  surprising 
that  it  applies  to  a  large  variety  of  situations,  from 
the  proudly  secular  French  one  (with  a  history  of 
no  fewer  than  five  different  “republics”  in  the  span 
of  nearly  230  years)  to  the  “Islamic  Republic  of 
Iran,”  created  in  1979,  where  there  is  no  separa¬ 
tion  between  religion  and  the  state. 

“Republicanism,”  as  the  derivative  of  the 
already  protean  notion  of  the  republic,  reflects  its 
many  understandings.  In  English  as  well  as  in 
many  other  European  languages,  the  suffix  ism 
develops  the  root  of  the  word  to  mean  a  doctrine 
or  a  norm,  and  most  modern  English  dictionaries 
follow  these  lines  in  their  definition  of  republican¬ 
ism.  The  two  first  (and  main)  acceptations  screened 
by  the  Oxford  English  Dictionary  are  “the  spirit 
characteristic  of  a  republic  or  republican  ideas; 
support  for  or  adherence  to  republican  principles; 
republican  government  or  institutions”  and  “the 
principles  and  policies  of  the  Republican  Party. 
Also:  support  for  or  membership  of  this  party.” 
However,  as  a  concept  of  political  theory  or 
thought,  “republicanism”  has  retained  many  of 
the  original  traits  of  the  Latin  res  publica.  It  refers 
to  the  crucial  importance  of  the  part  of  social  life 
that  is  appropriated  by  the  form  of  collectivity 
called  a  “people.  ”  This  sector  is  not  to  be  understood 
as  a  material  one  (the  republic  is  not  principally 
the  public  real  estate,  so  to  say)  but  as  some¬ 
thing  more  immaterial:  It  is  the  common  interest 
of  the  people  in  participating  in  the  decisions  that 
affect  their  lives.  Republicanism,  as  the  sets  of 


values  involved  in  this  conception,  largely  draws 
on  a  conception  of  the  people  as  citizens  and  of 
liberty  as  the  supreme  political  value. 

History 

In  its  most  neutral  conception,  republicanism 
refers  to  the  doctrine  of  those  who,  in  established 
monarchies,  want  to  suppress  the  monarchic  sys¬ 
tem.  It  is  the  case  of  the  “republicans”  in  England 
or  Luxembourg,  for  instance:  They  want  to  put  to 
an  end  to  the  monarchy  in  the  United  Kingdom  or 
the  Grand  Duchy  and  have  them  replaced  with 
republics.  By  this  token,  republicanism  would  not 
really  exist  in  republics,  where  the  goal  that 
“republicanism”  ascribes  to  politics  has  already 
been  reached.  But  other  acceptations  prove  to  be 
richer,  when  they  qualify  something  specifically 
ideological:  the  idea  that  the  republic  is  not  a  fact 
but  an  ideal  of  governance  that  has  to  be  deepened 
or  protected  even  in  existing  republics.  In  this  con¬ 
ception,  republicanism  is  less  a  doctrine  than  a 
doctrinal  agenda  for  pursuing  the  attainment  of 
political  goods  considered  as  crucial  in  a  political 
collectivity  defined  by  the  preeminence  of  the 
people.  It  ultimately  defines  the  shaping  of  a  spe¬ 
cific  political  togetherness  far  beyond  the  mere 
question  of  the  republican  form  of  the  regime. 

Such  a  conception  of  the  republic  can  be  called 
normative  as  it  entails  the  consideration  of  norms 
(i.e.,  representations  of  the  world  from  the  stand¬ 
point  of  ideal,  regulating  behaviors).  It  can  trace  its 
origin  back  to  classical  Rome.  It  is  true  that  the 
Romans  did  not  know  the  word  republicanism  (or 
rather  did  not  have  any  equivalent  to  it).  But  they 
linked  their  idea  of  the  republic  with  specific  val¬ 
ues  and  virtues;  they  were  convinced  that  politics 
was  about  the  norms  and  collective  ideals  that 
republican  forms  of  collective  life  reflected  and 
helped  achieve.  To  them,  the  republic  did  not  cor¬ 
respond  solely  to  a  form  of  governance  (such  as 
democracy,  aristocracy,  or  monarchy)  but  to  the 
hope  for  better  governance.  They  thought  of  the 
republic  as  a  specific  regime,  an  inherently  supe¬ 
rior  one,  a  desirable  one;  in  this  sense,  the  Romans 
could  not  separate  the  republic  from  what  we  now 
call  “republicanism.”  This  conception  obviously 
lost  its  preeminence  with  the  rise  of  the  Empire, 
which  followed  the  fall  of  the  Republic.  The 
expression  res  publica  gained  the  more  neutral 
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meaning  of  a  political  collectivity  without  precise 
reference  to  the  values  attached  to  it.  It  is  the 
acceptation  retained  in  medieval  Latin,  and  the 
evolution  of  Roman  languages,  that  even  led  to 
legitimate  talk  of  a  monarchy  as  a  “republic.” 

It  was  only  at  the  dawn  of  modernity,  mostly  at 
the  beginning  of  the  16th  century,  that  the  concept 
of  the  republic  regained  some  of  it  original  specific¬ 
ity.  Since  then,  authors  have  considered  more  and 
more  often  that  even  if  the  term  means  something 
broader  that  what  the  Romans  had  in  mind,  it  could 
not  refer  to  all  types  of  governance.  For  them,  des¬ 
potism  is  mostly  the  kind  of  governance  that  cannot 
be  considered  as  a  republican  one.  We  see  it  ruled 
out  by  Etienne  de  La  Boetie  (1530-1563),  the 
French  author,  in  his  Discourse  About  Voluntary 
Servitude  (1549):  He  insists  that  one  could  hardly 
call  an  absolute  and  tyrannical  political  organiza¬ 
tion  a  “republic.”  Two  centuries  later,  Jean-Jacques 
Rousseau  is  still  undecided  in  his  Social  Contract 
(1762/1997).  When  in  the  second  book  he  notes 
that  “any  legitimate  government  is  a  republican 
one”  (II  6),  he  seems  to  consider  that  republics 
include  monarchies  that  comply  with  his  standards 
of  legitimacy.  But  in  the  third  book,  he  places  the 
“monarchic  government”  “under  the  republican 
one”  (III  6),  thus  indicating  that,  in  his  opinion,  a 
monarchy  should  eventually  not  be  considered  as  a 
republic.  In  1788,  when  James  Madison  writes  in 
the  39th  release  of  The  Federalist  Papers  about  the 
“  Republican  Principles  ”  of  the  American  Revolution, 
it  is  done  with  hesitation.  That  diverse  regimes  have 
been  called  a  republic  is  only  an  “impropriety”  and 
only  shows  “the  extreme  inaccuracy  with  which  the 
term  has  been  used  in  political  disquisitions.  ” 

The  reticence  in  completely  identifying  the  word 
republic  with  any  kind  of  political  collectivity,  and 
the  subsequent  specialization  of  the  word  at  the 
end  of  the  18th  century  in  different  languages, 
illustrates  the  progressive  rediscovery  of  classical 
republican  ideals  and  the  rebirth  of  republicanism. 
Since  Hans  Baron’s  The  Crisis  of  the  Early  Italian 
Renaissance:  Civic  Humanism  and  Republican 
Liberty  in  an  Age  of  Classicism  and  Tyranny 
(1966),  it  is  often  considered  that  the  Italian 
Renaissance  was  the  time  and  place  of  the  redis¬ 
covery  of  these  republican  ideals.  The  “humanist” 
movement  of  the  time  would  have  seen  a  growing 
interest  in  civic  action  and  participation  in  public 
life.  Baron  and  his  followers,  such  as  John  G.  A. 


Pocock,  in  his  Machiavellian  Moment  (197 5/2003), 
insist  on  the  reestablishment  of  the  republican  tra¬ 
dition  inherited  from  antiquity  under  the  form  of  a 
“civic  humanism.”  In  this  intellectual  tradition  of 
the  Renaissance,  the  value  of  the  individual,  his 
(political  notions  were  still  strongly  gendered) 
enlightened  opinion,  and  his  culture,  nourished  by 
the  reading  of  the  great  ancients,  are  reflected  in  a 
civic  engagement  based  on  the  commitment  to 
defend  one’s  collectivity  and  its  rights  and  liberties. 
This  is  the  basis  for  what  Pocock  calls  the  “Atlantic 
republican  tradition,  ”  which  drags  on  at  least  until 
the  18th  century  in  America.  His  work,  focusing 
on  the  English-speaking  world  after  the  Italian 
Renaissance,  neglects  in  quite  a  spectacular  fash¬ 
ion  the  prominent  French  republicanism  of  the 
18th  and  19th  centuries  as  well  as  the  continental 
republican  ideologies  of  the  19th  and  20th  centu¬ 
ries.  But  it  provides  a  historiography  that,  though 
contended,  has  the  merit  to  indicate  how — and 
when — the  influence  of  classical  Rome  found  its 
way  back  into  modern  politics. 

The  People 

The  reengagement  with  the  model  of  republican 
Rome  during  the  Renaissance  period  is  crucial  for 
modern  republicanism.  Even  if  the  details  of  the 
“Machiavellian  moment”  in  Western  history  might 
be  challenged,  and  the  role  of  the  Florentine  ideol¬ 
ogy  of  the  Renaissance  is  nuanced,  it  is  difficult  to 
deny  that  with  the  Renaissance,  and  the  role 
played  in  it  by  Italy,  a  shift  in  the  conception  of  the 
relation  between  humanity  and  the  world  is  per¬ 
ceptible.  This  change  influences  the  conception  of 
legitimate  politics,  laying  the  ground  for  modern 
conceptions  of  republicanism. 

As  Baron,  and  then  Pocock,  underline  it,  the 
Renaissance  was  characterized  by  a  new  conviction 
that  it  is  necessary  to  be  involved  in  the  life  of  the 
city.  According  to  these  authors,  this  contrasts 
sharply  with  the  tendency  in  the  Middle  Ages  to 
emphasize  among  clerical  elites  the  superiority  of 
contemplation.  The  substance  of  what  Baron  calls 
“civic  humanism,”  which  has  become  an  iconic 
expression  for  studies  on  republicanism,  is  precisely 
the  alliance  between  a  new  vision  of  humanity  pro¬ 
moted  by  intellectual  elites  and  the  sense  that  the 
natural  outcome  of  this  vision  is  involvement  in 
state  affairs.  Such  an  involvement  is  obviously  at 
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odds  with  traditional  monarchic  ideologies,  where 
political  decision  making  and  actions  are  concen¬ 
trated  at  the  level  of  the  reigning  monarch.  The 
Renaissance  transformed  this  core  conviction  of  the 
Middle  Ages.  In  a  way,  it  reinforced  it  strongly  with 
the  rise  of  absolutism  and  the  progressive  debase¬ 
ment  of  traditional  aristocracy,  which  gave  the  fig¬ 
ure  of  the  monarch  an  increasingly  central  role.  But 
at  the  very  same  time,  the  dawning  of  capitalism,  a 
new  critical  culture  influenced  by  the  reading  of  the 
ancients  and  the  taste  for  new  ideas,  challenged  the 
authoritarian  grounds  of  absolutism.  Modern 
republicanism  is  an  inherent  part  of  these  changes. 

The  pivotal  concept  in  this  reinvention  of  repub¬ 
licanism  is  the  rediscovery  of  the  “people”  consid¬ 
ered  as  a  “public.”  Indeed,  the  “people”  played  its 
role  in  medieval  conceptions  of  politics,  as  the 
monarch  was  supposed  to  govern  for  the  common 
good.  However,  this  role  was  mostly  the  passive  one 
of  a  recipient  of  good  politics  consenting  to  the 
action  of  the  leader.  On  the  contrary,  in  the  repub¬ 
lican  tradition  inherited  from  classical  Rome,  the 
“people”  is  an  active  subject  of  politics,  and  the  role 
of  the  leader  (the  prince)  is  significantly  trans¬ 
formed.  If,  as  republican  ideologies  claim,  politics  is 
“public,”  then  it  is  not  a  “private”  thing,  owned  by 
private  tenants  such  as  kings,  who  would  “have” 
the  collectivity  as  one  of  their  belongings.  The  con¬ 
trast  was  already  familiar  to  Roman  thinkers. 
Tacitus,  for  instance,  opens  his  Annals,  which 
retrace  the  beginning  of  the  empire  after  the  fall  of 
the  republic,  with  a  remembrance  of  the  early 
Roman  kings,  evoking  this  period  when  “Rome 
was  ruled  by  kings,”  as  the  English  translation 
stands.  In  fact,  Tacitus’s  Latin  is  more  precise.  He 
writes  literally  that  “kings  had  the  City  of  Rome.” 
What  he  has  in  mind  is  the  contrast  between  the 
king,  who  “has”  the  power,  and  the  republican 
magistrate,  who  “exerts”  power — exerts  it  in  the 
name  of  the  Roman  people.  This  distinction  between 
the  public  exercise  of  power,  which  should  include 
the  popular  will,  and  monarchy  as  a  form  of  priva¬ 
tization  of  politics  is  one  of  the  strongest  elements 
of  differentiation  in  the  Roman  experience  of  poli¬ 
tics.  In  the  late  history  of  Rome,  the  most  powerful 
emperors  carefully  avoided  taking  the  title  of  king, 
only  using  the  title  of  prince  for  this  reason. 

Modern  republicanism  did  not  have,  in  its  early 
stages,  a  notion  of  the  people  as  thoroughly  formed 
as  was  the  case  in  classical  Rome  (after  all, 


Machiavelli,  one  of  the  heroes  of  Pocock’s  “repub¬ 
lican  tradition”  wrote  a  book  about  the  prince  and 
not  about  the  people).  But  it  still  proved  to 
be  averse  to  absolute  monarchy:  The  English 
republic  in  the  17th  century — when  the  word 
republicanism  entered  the  English  vocabulary — 
was  the  first  illustration  of  the  fact,  with  the  execu¬ 
tion  of  Charles  I.  In  the  18th  century,  the  “people” 
gained  a  still  stronger  presence  in  political  affairs 
and  started  to  play  a  role  as  a  prime  actor  of 
political  life  in  an  evolution  that  finds  its  ultimate 
formulation  in  the  American  and  French  revolu¬ 
tions.  The  American  constitution,  which  grants  in 
its  Article  IV,  Section  4,  a  “Republican  Form  of 
Government,”  opens  with  the  legendary  formula: 
“We  the  people.  ...”  The  first  French  Republic 
(1792),  in  its  turn,  states  in  its  1793  constitution 
that  “there  can  be  no  constitution  but  [that] 
accepted  by  the  people.” 

The  sense  that  the  republic  is  public  in  that  it 
involves  the  people  is  one  of  the  most  stable  fea¬ 
tures  of  republicanism.  This  is  how  Madison 
(1788) — 18  centuries  after  Cicero — still  defines  the 
republic: 

We  may  define  a  republic  to  be,  or  at  least  may 
bestow  that  name  on,  a  government  which  derives 
all  its  powers  directly  or  indirectly  from  the  great 
body  of  the  people,  and  is  administered  by  persons 
holding  their  offices  during  pleasure,  for  a  limited 
period,  or  during  good  behavior.  ( The  Federalist 
Papers,  No.  39) 

This  definition  undoubtedly  adds  to  the  classical 
idea  a  democratic  sentiment  that  was  not  always 
present  in  ancient  conceptions.  Rome,  for  instance, 
was  not  a  democracy,  no  more  than  the  “Florentine” 
republics  were,  which  serve  as  an  early  model  for 
modern  republicanism.  Politicians  and  theoreti¬ 
cians  were  very  aware  of  this  fact,  and  in  Rome, 
they  even  took  pride  in  not  being  democrats. 
Cicero,  in  the  De  Republica,  insists  on  a  conception 
of  the  republic  that  illustrates  Aristotle’s  theory  of 
the  “mixed”  government:  According  to  Cicero,  the 
republic  combines  a  regal  element,  an  aristocratic 
one,  and  a  democratic  one,  seeing  in  them  the 
qualities  of  “affection,”  “talent,”  and  “liberty.” 

The  difference  between  a  republic  and  a  democ¬ 
racy  is  probably  one  of  the  oldest  debated  issues  as 
regards  republican  matters,  and  to  a  certain  extent, 
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it  has  remained  the  same:  In  a  democracy,  the 
people  that  rule  are  in  fact  a  majority,  whereas  in 
a  republic,  it  is  considered  as  a  whole.  This  could 
be  the  basic  difference  between  a  public  instance  of 
politics  and  a  popular  one.  In  the  public  concep¬ 
tion,  decisions  are  not  necessarily  unanimous.  But 
there  is  the  idea  that  they  should  be  made  in  the 
interest  of  the  people  as  a  global  entity.  This  con¬ 
ception  of  the  republic  is  akin  to  the  concept  of 
“general  will”  that  Rousseau  designs  to  explain 
how  a  public  decision  (about  law)  integrates  all  the 
differences  expressed  by  individuals.  In  such  a 
model,  each  individual  has  a  voice,  but  everybody 
should  recognize  his  (and  later  in  the  history  of 
republics,  her)  best  interest  in  the  final  decision 
even  if  it  is  not  the  one  initially  favored  by  the 
individual.  In  this  case,  Rousseau  adds,  it  is  just 
that  the  individual  was  wrong  about  what  he 
thought  was  best  for  himself. 

Rousseau’s  outlook  cannot  be  considered  as 
encompassing  all  sorts  of  republicanism.  Moreover, 
his  thought  is  not  solely  dedicated  to  spelling  out 
republican  ideologies  but  also  tries  to  embrace  the 
source  of  political  legitimacy.  However,  both  under¬ 
takings  are  eventually  the  same  in  his  mind,  as  “a 
legitimate  government  is  a  republican  one” 
(Contract,  II  6),  and  this  is  how  Rousseau’s  para¬ 
digm  can  serve  as  a  general  model  for  the  republican 
conception  of  the  people.  It  allows  understanding  of 
what  is  at  stake  in  the  republican  idea:  the  attempt 
to  articulate  individual  wills  and  rights  in  a  collec¬ 
tivity  that  would  not  be  dependent  on  individual 
behaviors  but  would  aggregate  them. 

This  model  is  a  powerful  one  when  it  comes  to 
understanding  how  the  republican  and  democratic 
conceptions  of  the  people  might  diverge.  This  is 
not  always  the  case,  however.  Many  prominent 
figures  of  modern  republicanism  of  all  periods 
have  not  differentiated  between  the  two.  The  poli¬ 
tician-thinker  Thomas  Jefferson  (1743-1826)  in 
the  United  States,  where  he  was  recognized  infor¬ 
mally  as  the  leader  of  a  “democrat-republican 
party,”  and  the  republican  intellectual  Alain 
(Emile-Auguste  Chartier,  1868-1951)  in  France  do 
not  really  imagine  that  republican  ideals  can  limit 
the  role  of  the  people  in  politics.  However,  other 
authors  lean  toward  qualifying  republicanism  as  a 
specific  way  to  consider  democracy  or  even,  like 
Auguste  Comte  (1798-1857),  as  a  way  to  avoid 
democracy.  This  conception  of  the  republic  as 


separate  from  a  democracy  promotes  a  vision  of 
the  demos  (“the  people”  in  Greek)  as  standing 
united  by  showing  a  superior  capacity  to  silence  its 
internal  divisions.  This  tendency  is  often  critical  of 
“liberal”  conceptions  of  democracy,  for  which  the 
division  of  interests  is  legitimate.  It  is  even  some¬ 
times  dangerously  at  odds  with  a  proper  liberal 
state:  the  first  French  republic  (1792-1804)  slid 
into  “the  Terror”  precisely  when  its  leaders,  fol¬ 
lowing  Maximilien  Robespierre  (1758-1794), 
considered  that  any  kind  of  opposition  was  the 
seed  of  a  division  that  had  to  be  suppressed. 
Because  republican  ideals  emphasize  the  “agree¬ 
ment  about  justice”  that  Cicero  talked  about,  they 
can  end  up,  in  radical  practices  of  the  republic, 
legitimizing  the  elimination  of  dissenters  within. 

The  Citizen 

The  radicalization  of  terror  is  neither  an  accident  in 
the  history  of  republicanism  nor  its  fate.  It  is  not  an 
accident  in  the  sense  that  the  promotion  of  the 
unity  of  the  people  in  republican  ideologies  has 
often  been  linked  with  the  necessity  of  defending 
the  preeminence  of  the  country.  This  is  why  nation¬ 
alism,  or  at  least  a  jealous  affirmation  of  one’s 
sovereignty,  is  a  frequent  trait  of  republicanism  in 
its  modern  versions.  It  might  favor  a  strong  com¬ 
munitarian  and  possibly  violent  reaction  against 
dissent.  One  can  see  it  in  the  history  of  the  Roman 
republic,  which  has  constantly  suffered  from  coups, 
exiles,  and  brutal  evictions.  The  Renaissance  repub¬ 
licanism,  which  serves  as  a  paradigm  for  Baron  and 
his  successors,  also  corresponds  to  the  dark  history 
of  the  Italian  republics,  and  Machiavelli’s  indiffer¬ 
ence  to  political  violence  is  a  symptom  of  this 
rather  rough  republican  history. 

The  privilege  given  to  the  collectivity  also 
encourages  bellicose  behavior  toward  outsiders  for 
the  sake  of  defending  the  integrity  of  “the  people.” 
Since  classical  Rome,  war  is  often  considered  as  a 
sign  of  the  strength  of  true  republics.  The  rationale 
for  such  an  attitude  is  easily  understandable.  If  the 
unity  of  the  people  is  the  supreme  reference  for 
republican  politics,  whatever  threatens  this  unity  is 
regarded  as  the  most  lethal  menace,  whether  the 
threat  comes  from  the  inside  or  from  the  outside. 

At  the  same  time,  it  would  be  wrong  to  assume 
that  republicanism  is  limited  to  an  authoritarian 
affirmation  of  the  people  without  tolerance  for 
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diversity.  On  the  contrary,  if  republicanism  has 
been  rediscovered  in  the  early  stages  of  modernity, 
it  is  because  it  makes  room  for  another  value  of 
political  modernity:  the  individual. 

The  question  was  already  raised  by  Cicero,  who 
noted  that  a  people  is  not  “any  junction  of  men 
united  in  any  fashion”  but  “the  junction  of  a  mul¬ 
titude,  which  associates  on  the  grounds  of  an 
agreement  about  justice  and  a  commonality  of 
what  is  useful”  (I  25).  The  collectivity  is  not  a  col¬ 
lection  of  persons,  and  the  people  is  not  a  multi¬ 
tude  but  its  “junction”  due  to  shared  principles 
and  interests.  Hence,  a  way  of  considering  the 
people  as  a  public  consists  in  paying  attention  to 
its  composition.  If  the  people  is  a  collective  entity, 
the  public  is  also  made  of  individuals  who  belong 
to  the  common  sphere  of  deliberation — what 
Cicero  calls  the  “agreement  about  justice.”  For 
this  reason,  republicanism  cannot  be  unilaterally 
associated  with  a  communitarian  approach  of 
politics.  Cicero,  and  with  him  more  largely  the 
Roman  conception  of  the  republic,  was  keen  on 
differentiating  between  the  people  in  the  sense  of 
the  republic  and  the  people  in  the  sense  of  a  mass 
that  would  dissolve  individualities.  The  distinction 
between  what  is  public  and  what  is  “popular” 
parallels  the  distinction  between  a  collectivity  of 
individuals  reasoning  about  their  common  interest 
and  an  indistinct  crowd  often  suspected  to  be  prey 
to  momentary  passions,  leading  to  anarchy  and 
disorder.  It  also  parallels  the  opposition  between 
politics  as  the  outcome  of  rational  decisions  made 
by  individuals  and  politics  as  influenced  by  the 
spread  of  passions  through  the  mob.  This  value  of 
republicanism  has  been  particularly  important  for 
the  reflection  on  French  republicanism.  Authors 
such  as  Jules  Barni  (1818-1878),  Edgar  Quinet 
(1803-1875),  and  Alain  insist  on  the  value  of  indi¬ 
vidual  reasoning  to  achieve  a  more  efficient  repub¬ 
lic.  The  people  is  worth  what  each  part  of  it  is. 

This  individual  that  republicanism  celebrates 
as  the  vivid  part  of  the  people  is  not  the  one  lost 
in  his  (and  later,  her)  special  interests;  it  is  the 
individual  who  is  able  to  understand  the  com¬ 
mon  good  and  reflect  on  it.  This  individual  is  the 
citizen.  For  this  reason,  the  republican  promi¬ 
nence  of  the  individual  does  not  solely  reflect  the 
rise  of  modern  individualism,  though  it  is  obvi¬ 
ously  compatible  with  it.  Rome  also  promoted 
the  individual  citizen  as  an  important  aspect  of 


its  “republicanism.”  But  in  modern  times,  the 
figure  of  the  citizen  is  a  practical  solution  that 
republicanism  offers  to  solve  the  delicate  equa¬ 
tion  of  contemporary  politics,  for  which  the 
necessity  to  govern,  the  voice  of  the  governed  in 
public  matters,  and  the  place  of  the  individual  in 
a  social  life  that  is  by  necessity  a  collective  one 
are  as  imperative  as  they  are  contradictory. 

The  major  device  in  this  configuration  is  the 
role  of  the  law.  The  republican  tradition  considers 
that  law  is  the  product  of  the  collective  reasoning 
of  the  citizens,  the  citizen  being  defined  since 
Aristotle  (384-322  BCE)  as  the  one  who  has  the 
right  to  participate  in  the  life  of  the  city.  Hence, 
law  is  a  testimony  to  the  appropriation  of  politics 
by  the  people,  who  can  recognize  themselves  in  the 
ruling  of  their  country,  and  hence  ensures  that 
the  republic  is  truly  public — that  is,  a  product  of 
the  people’s  will.  This  is  why  the  citizen,  as  we 
have  seen  with  Rousseau,  does  not  decide  from  a 
personal  point  of  view.  He  makes  his  decision  in 
view  of  what  he  thinks  best  for  his  country.  This 
kind  of  abnegation  is  both  integrative  and  exclu¬ 
sive:  By  deciding  the  rules  that  are  applied  to  the 
whole  political  body  (or,  in  a  representative  gov¬ 
ernment,  by  choosing  those  who  make  or  apply 
the  law),  the  citizen  extends  his  real  sphere  of 
influence  to  the  coalescence  of  his  vote  with  those 
of  his  peers  in  a  ballot.  This  is  a  particularly  strong 
way  of  integrating  each  individual  into  a  collectiv¬ 
ity  that  he  can  grasp.  At  the  same  time,  this  very 
integration  has  been  the  reason  why  republicanism 
has  often  supported  the  exclusion  of  those  who 
were  considered  unable  to  participate  in  the  ele¬ 
vated  sense  of  the  public  sphere  because  they  were 
too  tied  to  private  interests — slaves  in  antiquity 
and,  until  the  20th  century,  women,  whom  many 
republics,  such  as  the  French  or  the  Helvetic 
republics,  have  adamantly  excluded  (until  1944  in 
France  and  until  1971,  even  until  1990  at  the  local 
level,  in  Switzerland). 

This  elevated  sense  of  the  public  good  is  often 
linked,  in  the  republican  literature,  with  the  theme 
of  a  specific  virtue  on  which  republicanism  depends. 
The  virtue  of  the  citizen  is  not  solely  moral.  It  is 
also  an  important  political  device,  as  this  virtue 
consists  in  being  able  to  see  beyond  one’s  immedi¬ 
ate  benefit  and  to  favor  the  collectivity  over 
oneself.  This  attitude  regularly  leans  toward  a  sacri¬ 
ficial  rhetoric,  in  which  the  individual  must  be 
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ready  to  forgo  his  personal  interests  for  the  sake  of 
the  public  good.  The  Romans  had  a  particularly  rich 
mythology  of  citizens  sacrificing  their  life  or  their 
well-being  for  Rome.  This  bold  conception  of  virtu¬ 
ous  devotion  resurfaced  in  the  Italian  Renaissance 
and  continued  to  feed  republicanism  during  the 
American  and  French  revolutions  in  the  18th  cen¬ 
tury.  During  World  War  I,  the  theme  of  the  sacrifice 
of  the  soldier  for  the  homeland  is  constant  in  the 
French  republican  propaganda,  for  instance. 

There  is  another  important  aspect  of  the  repub¬ 
lican  conception  of  the  law:  the  equality  of  citizens 
before  the  law.  This  allows  differentiating  between 
the  individual  engaged  as  an  equal  to  any  other 
individual  in  the  public  sphere  and  the  same  indi¬ 
vidual  in  the  private  sphere,  where  inequalities 
dominate  and  where  the  individual  is  limited  to 
local  interests  and  constraints.  The  law  is  thus 
placed  between  the  citizen  and  the  wider  public 
collectivity  in  which  he  still  recognizes  himself. 
Republicanism  thus  promotes  a  universal  concep¬ 
tion  of  politics,  in  the  sense  that  everybody  is  sup¬ 
posed  to  be  integrated  in  this  equal  sense  of  the 
common  good  that  translates  into  laws. 

Liberty 

From  a  republican  standpoint,  the  rule  of  law  is 
thus  not  only  a  way  to  govern  the  people.  It  is  also 
a  way  to  preserve  and  reinforce  a  central  political 
value:  liberty. 

While  liberty  is  not  solely  a  republican  concept, 
it  is  not  possible  to  imagine,  whichever  period  we 
consider,  a  republican  ideology  that  would  not 
favor  liberty  as  the  most  important  aspect  of  its 
public  values.  This,  once  again,  is  linked  with  the 
centrality  of  the  people  in  the  conception  of  poli¬ 
tics  that  republicanism  develops.  Maintaining  the 
integrity  of  the  people  means  that  it  must  not  be 
subject  to  any  foreign  law.  To  that  extent,  liberty 
concentrates  all  the  functions  of  politics  from  a 
republican  standpoint. 

Freedom  is  first  linked  with  the  absence  of  tyr¬ 
anny.  This  aspect  explains  how  suspicious  monar¬ 
chical  regimes  look  in  a  republican  conception; 
there  is  the  deep  fear  that  the  personal  sovereign 
might  turn  into  a  despot.  But  republican  freedom 
is  more  than  just  protection  from  the  evils  of  per¬ 
sonal  subjection;  it  is  also  perceived  as  a  function 
of  the  law.  As  the  French  republican  Jules  Barni 


(1872)  underlines  it  in  his  Manuel  republicain ,  law 
in  a  republic  has  as 

its  object  to  regulate  the  reciprocal  relations  of 
citizens,  in  order  to  insure  both  the  respect 
of  their  individual  rights  and  the  well  being  of 
society  as  a  whole.  .  .  .  Instead  of  hindering 
individual  freedom,  laws  must .  .  .  protect  its 
legitimate  usage.  (II  6,  36) 

Barni’s  analysis  is  a  significant  illustration  of 
republican  principles;  he  sums  up  many  of  the  val¬ 
ues  of  late-modern  republicanism,  for  which  law  is 
a  means  to  protect  freedom  and  not  to  “hinder”  it. 
If  the  people  is  the  foundation  of  the  republic,  the 
law  made  by  this  people  is  its  own  expression  of 
freedom,  and  the  republican  tradition  as  a  whole 
could  illustrate  the  proud  aphorism  of  Rousseau 
(1762/1997):  “Obedience  to  a  law  which  we  pre¬ 
scribe  to  ourselves  is  liberty”  (Social  Contract ,  18). 

However,  this  conception  of  the  law  is  at  odds 
with  a  more  pessimistic  outlook  that  states  that 
any  form  of  law  is  a  limitation  of  somebody’s  lib¬ 
erty.  This  analysis,  developed  by  Isaiah  Berlin 
(1909-1997)  in  his  famous  article  “Two  Concepts 
of  Liberty,”  leads  to  the  idea  of  “negative  free¬ 
dom,”  which  is  based  on  noninterference:  One  is 
free  inasmuch  as  nobody  interferes  with  one.  This 
conception  of  liberty  is  often  labeled  as  the  “lib¬ 
eral”  one.  Berlin  opposes  this  conception  and  puts 
forth  a  “positive”  understanding  of  liberty,  which 
claims  that  freedom  is  the  participation  of  the 
people  in  the  making  of  the  law,  as  one  cannot  be 
self-oppressed.  These  categories,  originally  crafted 
after  Constant’s  remarks  (1767-1830)  in  his  1819 
lecture  “On  the  Liberty  of  the  Ancients  Compared 
With  That  of  the  Moderns,”  provide  an  easy  con¬ 
trast  between  a  liberal  conception  of  freedom 
based  on  noninterference  and  a  republican  concep¬ 
tion  that  takes  its  roots  in  collective  participation 
in  the  public  sphere.  Contemporary  republicanism 
contends  that  the  former  allows  the  retreat  of  citi¬ 
zens  from  the  public  sphere  to  enjoy  private  free¬ 
dom  and  insists  on  the  necessity  for  citizens  to 
engage  in  the  public  sphere  in  order  to  preserve 
their  liberty. 

The  two  positions  are  not  incompatible.  Constant, 
and  then  Berlin,  emphasize  that  political  freedom  is 
a  combination  of  private  enjoyment  and  political 
mobilization  to  promote  and  defend  freedom.  This 
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does  not  entirely  satisfy  the  requisites  of  republican¬ 
ism,  and  the  contemporary  republican  thinker 
Philip  Pettit  insists  that  the  aim  is  less  a  “combina¬ 
tion”  of  positive  and  negative  liberties  than  “a  third 
alternative.”  This  alternative  would  be  “the  concep¬ 
tion  of  freedom  as  non-domination.”  This,  Philip 
Pettit  (1997)  writes,  “requires  that  no  one  is  able  to 
interfere  on  an  arbitrary  basis — at  their  pleasure — 
in  the  choices  of  the  free  person.  This  is  the  concep¬ 
tion  espoused  in  the  long  republican  tradition” 
(chap.  1).  Such  a  conception  is  for  him  the  path  to 
a  renovated  republicanism  that  can  accommodate 
both  the  republican  tradition  of  an  engaged  people 
developing  a  strong  sense  of  the  autonomy  of  its 
community  and  a  modern  sense  of  freedom  that 
makes  room  for  differences  in  lifestyles  and  values 
the  private  sphere,  without  forgetting  that  the  pur¬ 
suit  of  happiness  is  thoroughly  hampered  in  a  world 
devoid  of  true  political  freedom. 

Conclusion 

Republicanism,  beyond  its  venerable  antique  ori¬ 
gins,  has  undoubtedly  proven  to  be  a  dynamic  way 
of  addressing  the  questions  of  political  modernity. 
By  providing  political  theory  with  an  integrative 
conception  of  the  law  and  by  highlighting  a  con¬ 
ception  of  freedom  that  respects  individual  rights 
and  the  necessity  to  build  a  strong  collectivity, 
republicanism  is  able  to  frame  a  particularly  effi¬ 
cient  conception  of  modern  democracy.  Its  ambi¬ 
tion  is  also  to  avoid  the  less  favorable  outcomes  of 
the  latter  while  retaining  its  most  valuable  inspira¬ 
tion.  The  balance  is  neither  always  easy  to  maintain 
nor  always  favorable  to  republican  conceptions. 
Republicanism  also  contains,  not  solely  as  a  theory 
but  also  as  a  political  reality,  the  seeds  of  authori¬ 
tarian  behavior  and  the  temptation  of  conserva¬ 
tism  and  of  elitism.  Liberal  democracy  might  be 
criticized,  and  efficiently  so  from  a  republican 
standpoint,  for  its  propensity  to  comply  with  the 
rule  of  a  changing,  superficial  majority.  But  repub¬ 
licanism  might  not  give  a  fully  satisfying  response 
to  these  evils  when  it  pretends  that  majorities  rule 
legitimately  not  on  the  grounds  of  their  numerical 
superiority  but  because  they  are  or  should  be  the 
voice  of  the  people  as  a  whole.  If,  as  republicanism 
contends,  liberalism  is  never  completely  immune 
from  the  temptation  of  relativism  and  indiffer¬ 
ence,  republicanism  is  also  never  fully  above  the 


suspicion  of  desiring,  as  Rousseau  put  it,  to  force 

people  to  be  free. 
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Responsibility 


Responsibility  is  one  of  those  big  political  words 
that  are  commonly  used  but  whose  precise  mean¬ 
ing  is  often  left  obscure.  Responsibility  is  a  con¬ 
cept  that  implies  a  value  judgment,  but  the  values 
it  embodies  can  differ  according  to  the  context, 
the  discourse,  and  the  views  of  the  speaker. 
Responsibility  is  a  state  of  affairs — more  precisely, 
as  the  Oxford  and  Webster’s  dictionaries  define  it: 
the  state,  quality,  or  fact  of  being  responsible. 
However,  one  can  be  responsible  in  a  variety  of 
ways,  which  signify  various  usages  of  the  concept 
of  responsibility. 

Usages  of  the  Concept  of  Responsibility 
Responsibility  as  Agency,  Cause,  or  Capacity 

The  concept  of  responsibility  can  be  used  to 
indicate  that  someone  or  something  has  had  the 
power  to  cause  a  particular  event  to  happen.  In  a 
mere  descriptive  sense,  responsibility,  in  this  sense 
of  agency,  coincides  with  causality.  “Responsible 
for”  can  be  replaced  by  “caused,”  or  any  other 
expression  that  suggests  a  causal  connection.  Not 
only  human  agents  but  also  things,  situations,  or 
circumstances,  can  be  said  to  be  responsible — for 
example,  “a  virus  was  responsible  for  the  out¬ 
break  of  the  epidemic.”  In  a  moral  sense,  how¬ 
ever,  responsibility  can  be  ascribed  only  to  human 
agents,  who  moreover  must  have  the  capacity  to 
act  responsibly.  This  implies  that  they  must  have 
had  the  possibility  to  act  otherwise — that  is,  they 
must  not  have  been  compelled  by  external  forces 
and  they  must  have  possessed  sufficient  mental 
capacities  to  have  realized  the  consequences  of 
their  actions  and  to  direct  their  will.  This  is  a 
major  issue  in  criminal  law  and  moral  philosophy 
and  has  sparked  legal  and  philosophical  debates 
about  mens  rea  and  about  determinism  versus 
free  will. 

Responsibility  as  Task,  Duty,  or  Obligation 

The  concept  of  responsibility  can  also  be  used  to 
indicate  the  tasks,  duties,  and  obligations  that  flow 
from  a  social  role,  position,  or  office.  For  example, 
“The  president  is  responsible  for  appointing 
the  heads  of  federal  agencies,”  or  “It  was  his 


responsibility  to  coordinate  the  crisis  response.” 
Often  the  plural  “responsibilities”  is  used,  as  in  “She 
has  many  responsibilities.”  Responsibility  in  this 
sense  is  often  followed  by  the  preposition  “for.” 
This  meaning  of  responsibility  is  often  at  stake  in 
administrative  law  and  is  relevant  in  determining 
the  width  and  extent  of  political  accountability. 

Responsibility  as  Accountability, 

Answerability,  and  Liability 

Very  often  “being  responsible”  is  used  to  indi¬ 
cate  moral,  political,  or  legal  accountability  for  a 
sorry  state  of  affairs.  In  the  first  instance,  this  may 
amount  to  simply  having  to  provide  an  account  to 
a  forum,  such  as  parliament,  a  congressional  com¬ 
mittee,  or  a  legal  court,  and  to  provide  information 
about  one’s  conduct.  Second,  this  can  involve 
answerability .  The  forum  may  ask  questions  and 
the  actor  will  be  asked  to  provide  justifications  or 
excuses.  Finally,  this  can  involve  liabilities.  The 
forum  may  pass  judgment  on  the  conduct  of  the 
actor — it  may  approve  of  an  annual  account, 
denounce  a  policy,  or  publicly  condemn  the  behav¬ 
ior  of  an  official  or  an  agency.  Judgment  also 
implies  the  imposition  of  formal  or  informal  sanc¬ 
tions  on  the  actor  in  case  of  malperformance  or, 
for  that  matter,  of  rewards  in  case  of  adequate 
performance.  Responsibility  as  accountability  is 
often  followed  by  the  preposition  “to.”  This  sense 
is  highly  relevant  in  politics,  for  example,  with 
regard  to  the  doctrine  of  ministerial  responsibility 
in  parliamentary  systems. 

Responsibility  as  Virtue 

“Being  responsible”  can  also  be  used  to  indicate 
a  positive  moral  attitude,  a  desirable  character  trait, 
or  praiseworthy  conduct.  We  can  say  of  someone 
that  she  or  he  is  a  “responsible  administrator,” 
“behaves  responsibly,”  in  short,  that  she  or  he  is  a 
person  with  a  “sense  of  responsibility.”  What  these 
various  usages  have  in  common  is  that  they  see 
responsibility  as  a  desirable  quality  in  officials  and 
organizations.  Responsibility  is  used  as  a  normative 
concept,  as  a  set  of  standards  for  the  behavior  of 
actors,  or  as  a  desirable  state  of  affairs.  Often,  in 
this  type  of  discourse,  the  adjective  “responsible”  or 
the  adverb  “responsibly”  are  used,  as  in  “responsi¬ 
ble  government,”  “the  responsible  administrator,” 


2300  Responsibility 


or  “we  want  public  officials  to  behave  responsi¬ 
bly.”  In  these  usages  of  the  concept,  responsibility 
is  seen  as  a  virtue,  as  a  positive  feature  of  institu¬ 
tions  or  officials.  It  is  used  to  positively  qualify  a 
state  of  affairs  or  the  performance  of  an  actor.  It 
comes  close  to  “responsiveness,”  considerateness, 
and  a  willingness  to  act  in  a  transparent,  fair,  and 
equitable  way.  This  sense  is  often  used  in  discus¬ 
sions  about  political  morality  and  administrative 
ethics. 

Historical  Roots 

The  noun  responsibility  is  relatively  modern. 
According  to  the  Oxford  English  Dictionary,  the 
first  references  date  back  to  the  end  of  the  19th 
century.  It  appears,  for  example,  in  The  Federalist 
Papers  and  in  the  work  of  Edmund  Burke,  where 
it  refers  to  both  capacity  and  obligation.  The 
adjective  “responsible”  is  found  much  earlier,  for 
example,  in  the  work  of  both  Thomas  Hobbes  and 
John  Locke,  as  being  liable  and  virtuous,  respec¬ 
tively.  Responsibility  used  in  the  sense  of  virtue  can 
be  found  in  modern  Christian  ethics  from  the  19th 
century  onward.  Both  the  English  “responsibility” 
and  the  French  responsabilite  can  presumably  be 
traced  to  the  word  responsabilis.  This  term,  mean¬ 
ing  having  to  answer,  does  not  occur  in  the  classic 
Latin  vocabularies  and  probably  dates  from  the 
legal  jargon  of  the  late  Middle  Ages. 

Active  and  Passive  Responsibility 

In  the  realm  of  politics,  with  its  multiple  and  often 
evocative  uses  of  the  word  responsibility,  it  can  be 
helpful  to  distinguish  between  active  and  passive 
responsibility.  This  distinction  more  or  less 
amounts  to  the  distinction  between  responsibility 
as  a  virtue  on  the  one  hand  and  as  accountability 
or  a  liability  on  the  other. 

When  responsibility  as  a  virtue  is  at  stake,  the 
focus  is  on  action  in  the  present,  on  the  prevention 
of  undesirable  situations  and  events.  Active  respon¬ 
sibility  is  about  taking  responsibility  ex  ante, 
about  behaving  responsibly.  The  central  question 
is  “What  should  be  done?”  An  example  of  such  a 
proactive  use  of  the  concept  of  responsibility  in  the 
realm  of  politics  is  Max  Weber’s  famous  distinc¬ 
tion  between  an  ethics  of  principle  and  an  ethics  of 
responsibility.  According  to  Weber,  a  strictly 


deontological  approach  to  politics,  in  which  all 
that  matters  is  adherence  to  political  principles,  is 
highly  irresponsible.  The  truly  responsible  politi¬ 
cian  takes  the  foreseeable  consequences  of  his  own 
decisions  into  account  when  deciding  on  a  course 
of  action. 

In  the  case  of  passive  responsibility,  the  focus  is 
on  determining  accountabilities  and  liabilities  after 
the  fact.  Passive  responsibility  is  retrospective  in 
nature,  it  is  about  responsibility  ex  post,  about 
holding  someone  accountable  for  a  given  state  of 
affairs.  The  central  question  is  “Who  is  to 
blame?”  Passive  responsibility  has  both  a  practical 
dimension — who  is  to  be  held  accountable? — and 
a  normative  dimension — who  is  to  bear  the  blame 
and  suffer  the  consequences?  In  the  realm  of  poli¬ 
tics,  a  major  issue  in  this  regard  is  the  problem  of 
many  hands,  which  was  first  coined  by  Dennis 
Thompson.  Because  many  different  officials  con¬ 
tribute  in  many  ways  to  decisions  and  policies  of 
government,  it  is  difficult,  both  practically  and  in 
principle,  to  identify  who  is  responsible  for  politi¬ 
cal  outcomes. 

Political  Responsibility 

In  most  Western  democracies,  the  problem  of 
many  hands  has  led  to  introducing  a  kind  of  vicar¬ 
ious  liability  for  ministers,  heads  of  departments, 
and  other  political  officials.  They  are,  at  least  in 
principle,  accountable  to  parliament  or  congress 
for  anything  that  goes  wrong  under  their  guidance, 
whether  or  not  they  were  actually  in  a  position  to 
do  anything  about  it.  In  political  practice,  how¬ 
ever,  political  leaders  often  accept  complete  respon¬ 
sibility  only  in  retoricis,  without  accepting  further 
consequences.  Political  responsibility  is  often  lim¬ 
ited  to  giving  an  account  and  to  answering  ques¬ 
tions  from  journalists  or  political  representatives 
and  does  not  extend  to  resignation. 

In  Westminster  parliamentary  systems,  this 
vicarious  liability  is  known  as  the  doctrine  of  min¬ 
isterial  responsibility.  Ministers  are  accountable  to 
parliament  not  just  for  their  own  conduct  but  also 
for  any  acts  or  omissions  of  their  civil  servants.  The 
traditional  view  of  ministerial  responsibility  holds 
that  in  the  case  of  serious  mistakes  of  civil  servants 
a  minister  should  not  only  give  an  account  to  par¬ 
liament  but  should  also  take  the  blame  and  resign, 
regardless  of  whether  he  himself  or  she  herself  was 
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personally  to  blame.  This  is  also  known  as  the 
Carrington  doctrine — after  Lord  Carrington,  the 
British  minister  of  Foreign  Affairs,  who  resigned 
because  his  intelligence  service  had  presumably  not 
foreseen  the  Argentinean  invasion  of  the  Falkland 
Islands.  However,  the  practice  of  ministerial 
responsibility  does  not  extend  nearly  as  far  as  this 
broadly  conceived  doctrine.  If  ministers  resign  at 
all,  they  usually  do  so  only  because  they  have 
failed  in  personally  supervising  their  civil  servants 
or  in  their  conduct  toward  parliament.  (Even  the 
resignation  of  Lord  Carrington  came  to  be  seen  by 
many  as  less  noble  than  it  seemed,  because  years 
later  it  turned  out  that  he  himself  had  decided  not 
to  pass  on  the  information  about  the  pending  inva¬ 
sion  of  the  Falklands.) 

An  important  advantage  of  this  rather  narrow 
view  of  ministerial  responsibility  is  that  cabinets 
will  not  lose  very  capable  ministers  as  a  result  of 
unfortunate  incidents.  A  major  disadvantage  is 
that  this  weakens  the  political  control  of  the  civil 
service,  as  there  is  no  direct  political  accountability 
of  civil  servants  in  most  Westminster  systems.  This 
is  different  in  the  United  States,  where  senior  pub¬ 
lic  officials  can  be  asked  to  give  an  account  before 
congressional  committees. 
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Responsiveness 


The  most  general  connotation  of  the  notion  of 
responsiveness  pertains — as  a  noun — to  reactivity 
or  sensitivity.  Descriptively,  it  denotes  the  quality 
of  being  responsive  to  stimulation:  as  a  dynamic 
process  of  reacting  quickly.  The  adjective  respon¬ 
sive  refers  to  “answering,  replying,  and  respond¬ 
ing”  as  well  as  being  “susceptible  to  the  feelings  of 
others.”  In  technology,  the  term  is  used  to  describe 
how  quickly  the  system  responds  to  user  input. 
This  entry  analyzes  in  greater  depth  the  meaning 
of  the  notion,  points  out  the  components  of 
responsiveness  and  its  foci,  and  discusses  what  its 
“causal  links”  are. 

Meaning 

The  political  meaning  of  responsiveness  is  rather 
faithful  to  all  the  foregoing  meanings.  Responsive¬ 
ness  in  political  science  is  closely  related  to  con¬ 
cepts  such  as  “representation,”  “accountability,” 
“delegation,”  and  “representative  government.” 
Many  scholars  consider  the  stable  responsiveness 
of  a  government  to  the  preferences  of  its  citizens  to 
be  a  fundamental  feature  of  a  representative 
democracy. 

Before  disentangling  the  concept  of  responsive¬ 
ness ,  let  us  concentrate  on  the  other  aforementioned 
notions  and  phenomena.  Accountability — reads 
one  of  the  political  science  dictionaries — has  two 
major  meanings,  which  partly  overlap.  First,  the 
more  common  and  standard  one  is  that  those  who 
hold  power  are,  in  a  broad  sense,  stewards  and 
must  be  able  to  manifest  the  fact  that  they  exercise 
power  and  fulfill  their  duties  properly.  (It  is  pre¬ 
cisely  the  issue  of  fulfilling  their  duties  that  is  at  the 
crux  of  the  concept  of  responsiveness.)  Second, 
accountability  refers  to  arrangements  that  secure 
conformity  between  the  values  of  a  delegating  body 
and  the  person  to  whom  the  powers  and  responsi¬ 
bilities  are  delegated.  In  contemporary  democracies, 
governments  are  accountable  to  the  people  through 
the  process  of  election.  This  means  that  political 


2302  Responsiveness 


accountability  confines  the  use  of  power  and,  if 
need  be,  sanctions  its  abuse.  Institutionalizing 
accountability  means  de  facto  binding  the  use  of 
political  power  to  specific,  clearly  defined,  and 
publicly  accepted  standards. 

The  hidden  dimension  of  these  relationships 
pertains  to  the  phenomenon  (and  concept)  of  del¬ 
egation.  Governing  and  political  decision  making 
in  large  entities,  such  as  contemporary  democratic 
states,  entails  the  delegation  of  authority.  The  right 
to  make  decisions  is  designated — in  agency  rela¬ 
tionships — to  an  agent  by  a  principal.  However, 
this  task  is  conditional;  it  continues  only  if  the 
principal  is  satisfied  with  the  deeds  of  the  agent 
and  the  former  is  satisfied  when  the  agent  per¬ 
forms  in  accordance  with  the  principal’s  wishes 
and  interests.  It  may  be  withdrawn,  however,  if  the 
principal’s  evaluation  is  poor.  This  very  condition¬ 
ality,  coupled  with  the  possibility  of  withdrawal,  is 
the  essence  of  accountability  mechanisms.  This 
threat  of  withdrawal  need  not  materialize;  as  a 
confidence  procedure,  accountability  works  even 
when  the  principal  is  not  trying  to  replace  the 
agent  with  an  alternative.  Yet  she  or  he  has  the 
potential  and  means  to  do  so.  Consequently,  “gov¬ 
ernments  are  accountable ” — as  the  classic  volume 
on  the  topic  puts  it — “if  voters  can  discern  whether 
governments  are  acting  in  their  interests  and  sanc¬ 
tion  them  appropriately,  so  that  incumbents  who 
act  in  the  best  interests  of  citizens  win  reelection 
and  those  who  do  not  lose  them”  (Adam 
Przeworski,  Susan  C.  Stokes,  &  Bernard  Manin, 
1999,  p.  40).  The  single  macromechanism  behind 
this,  approximating  the  phenomenon  of  respon¬ 
siveness,  is  the  one  that  forces  the  incumbents  to 
choose  policies  that  enhance  their  chances  of 
reelection.  In  the  same  volume,  one  finds  yet 
another  proposal  of  a  linkage  between  account¬ 
ability  and  responsiveness:  “Accountability  is  ...  a 
property  of  institutional  structures,  whereas 
responsiveness  is  a  consequence  of  interaction 
within  such  structures.  .  .  .  Responsiveness  is  a 
measure  of  how  much  accountability  an  institu¬ 
tional  infrastructure  permits”  (Przeworski  et  ah, 
1999,  p.  131). 

The  notions  introduced  here  might  be  better 
understood  if  approached  from  a  policy  process 
perspective.  A  simplified  model  of  policy  process 
assumes  that  it  starts  with  the  commonsense  prem¬ 
ise  that  what  drives  politics  are  people’s  interests 


and  values,  which  are  translated  into  preferences; 
these  preferences  are  then  aggregated  and  signaled 
(via  elections,  demonstrations,  and  opinion  polls) 
to  respective  politicians.  In  contemporary  repre¬ 
sentative  democracies,  “mandates”  are  the  essen¬ 
tial  form  of  such  “signals”  sent  out  at  elections, 
constituting,  in  fact,  a  choice  between  the  different 
political  programs  offered  by  competing  parties. 
Once  elected,  politicians  implement  policies,  which 
result  in  certain  political  outcomes.  It  is  these  out¬ 
comes  that  are  evaluated — one  criterion  of  which 
is  their  congruency  with  voters’  preferences;  if  the 
evaluation  is  positive,  it  is  very  likely  that  the 
incumbent  will  remain  in  office;  if  it  is  negative, 
the  likelihood  of  replacing  one  camp  of  politicians 
with  another  is  high.  In  short,  representation  is  a 
relation  between  interests  and  outcomes;  the 
essence  of  responsiveness  is  in  the  relationship 
between  “signals”  and  “policies”  as  well  as 
between  “mandates”  and  “policies.” 

A  responsive  government  is  one  that  takes  into 
account  the  preferences  of  the  citizens  and  acts 
according  to  these  preferences.  The  key  purpose  of 
the  mechanism  of  accountability  is  guaranteeing 
that  agents  do  not  depart  from  the  expressed  or 
tacit,  though  objective,  interests  of  their  principals. 
In  a  way,  accountability  is  a  means  to  a  political 
end,  which  is  responsiveness.  Put  differently,  one 
may  state  that  accountability  is  the  property  of  an 
institutionalized  relationship  in  which  the  exercise 
of  power  by  one  actor  or  set  of  actors  is  constrained 
and  subject  to  some  requirement  of  responsiveness 
to  those  over  whom  the  power  is  exercised. 

Defined  in  such  a  way,  accountability  can  be 
further  divided  into  two  functional  dimensions: 
Transparency  in  the  exercise  of  power  and  enforce¬ 
ability  of  the  principle  of  responsiveness  via  the 
establishment  of  some  means  by  which  sanctions 
can  be  imposed  on  representatives  in  case  they 
misrepresent  constituents  or  violate  commonly 
accepted  standards. 

Unfortunately,  citizens  at  large  cannot  partici¬ 
pate  in  all  decisions  that  affect  them;  however, 
accountability  mechanisms  assume  that  representa¬ 
tive  responsiveness  requires  a  certain  level  of  citizen 
competence  and  participation  in  controlling  repre¬ 
sentatives.  Obviously,  political  systems  differ;  one 
of  the  fundamental  domains  of  empirical  research 
concerns  exactly  this — what  kind  of  political  insti¬ 
tutions  provide  incentives  for  representatives  to  be 
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responsive.  The  main  distinction  in  this  respect  lies 
either  between  presidential  versus  parliamentary 
systems  or  between  two  visions  of  democracy,  one 
coined  the  “majority  control  vision”  and  the  other 
the  “proportionate  influence  vision.”  The  major 
difference  between  the  latter  two  is  whether  the 
government  ought  to  be  responsive  to  the  majority 
of  people,  as  the  first  Westminster  majoritarian 
design  assumes,  or  to  as  many  people  as  possible, 
which  is  the  underlying  political  logic  of  propor¬ 
tional,  consensus-type  democracies.  Many  empiri¬ 
cal  studies  confirm  theoretical  expectations  and 
indicate  that  different  visions  of  democracy,  mani¬ 
fested  by  the  relevant  institutional  infrastructures, 
have  an  impact  on  the  type  and  quality  of  respon¬ 
siveness  in  an  anticipated  direction.  At  the  same 
time,  the  intricacies  within  responsiveness  are  sen¬ 
sitive  to  representational  role  orientations  (i.e., 
party  role,  pressure  group  role,  and  constituency- 
representational  role). 

General  politics  is  divided  into  issue  packages  or 
policy  areas.  Responsiveness  is  not  an  unmitigated 
virtue.  In  some  areas,  we  expect  politicians  to  be 
responsive;  in  others,  more  complex  and  special¬ 
ized,  we  believe  that  politicians  and  their  advisers 
should  decide  on  our  behalf  without  great  atten¬ 
tion  to  our  preferences  (e.g.,  monetary  policy, 
defense  spending).  As  a  result,  how  much  account¬ 
ability  there  should  be  is  a  difficult  issue  to  resolve 
and  can  be  decided  only  in  view  of  to  what  extent 
popular  responsiveness  is  desired. 

Components  of  Responsiveness 

Responsiveness  ought  to  be  viewed  as  a  complex, 
multidimensional  phenomenon  composed  of  a  vari¬ 
ety  of  targets  in  the  relationship  between  the  repre¬ 
sentatives  and  the  represented.  Most  commonly, 
responsiveness  pertains  to  policy  congruence. 
However,  there  are  at  least  three  other  components, 
which  result  in  service  responsiveness,  allocation 
responsiveness,  and  symbolic  responsiveness.  It  is 
only  when  all  four  components  are  taken  into 
account  simultaneously  that  one  can  truly  cover  the 
representation  phenomenon. 

Policy  responsiveness  refers  to  the  relationship 
between  a  district’s  or  electorate’s  stance  on  a  pol¬ 
icy  issue,  determined  by  some  measure  of  central 
tendency,  and  the  policy  orientation  of  the  repre¬ 
sentative  and,  consequently,  to  his  or  her  decisions 


concerning  its  implementation.  This  concerns  the 
congruence  or  fit  between  the  two  sides  of  the  rep¬ 
resentational  bond;  one  expects  a  meaningful  link 
between  the  policy  preferences  of  the  electorate  and 
the  deeds  of  their  representatives,  be  they  attitudinal 
or  behavioral.  To  claim  that  policy  responsiveness 
exists,  one  must  ensure  that  both  the  representative 
and  his  or  her  constituents  agree  on  a  particular 
policy.  Many  problems  arise:  (a)  some  scholars 
point  to  the  alleged  incompetence  of  the  constitu¬ 
ents,  (b)  others  indicate  that  in  many  policy  areas 
they  usually  prove  incompetent,  (c)  another  prob¬ 
lem  derives  from  responsiveness  to  the  long-term 
interests  of  the  constituents  versus  their  immediate 
wishes,  and  (d)  still  another  problematic  issue  con¬ 
cerns  biased  information  that  the  representative 
has  about  either  the  wishes  or  the  interests  of  the 
constituents. 

Service  responsiveness  covers  a  wide  range  of 
activities  performed  by  the  representative,  often 
dubbed  “pork  barrel  politics”  or  “case  work.” 
This  concerns  the  benefits  a  representative  can 
secure  for  the  constituents  outside  his  or  her  legis¬ 
lative  work.  At  the  individual  level,  this  ranges 
from  responding  to  letters  and  inquiries  to  inter¬ 
ventions  on  behalf  of  citizens  in  state  or  local  insti¬ 
tutions.  On  a  more  collective,  macrolevel,  the 
representative  might  embark  on  the  role  of  an 
advocate  or  lobbyist  for  local  interests. 

Allocation  responsiveness  is  relatively  similar  to 
the  “service”  type,  except  that  in  this  case  the  ini¬ 
tiative  is  rather  on  the  side  of  the  representative 
and  the  scope  of  activity  is  broader.  Allocation 
often  concerns  what  is  usually  called  “public 
goods”  and  is  related  to  the  development  of  the 
welfare  state  and  expansion  of  the  state’s  role  in 
society  and  the  economy.  Usually,  such  allocations 
benefit  entire  regions  as  a  whole  and,  in  reality,  this 
very  much  depends  on  the  representative’s  future 
orientation  and  his  or  her  ability  to  anticipate  the 
prospective  needs  and  developments  of  a  given 
area.  Seen  from  another  angle,  this  means  she  or  he 
invigorates  local  entrepreneurship,  animates  local 
communities,  and  stimulates  people’s  needs. 

Symbolic  responsiveness  is  less  behavioral  and 
more  psychological.  It  is  also  more  vague  and 
diffused  in  its  manifestations.  Symbolic  respon¬ 
siveness  pertains  more  to  diffused  trust  and  sup¬ 
port  caused  by  how  the  representative’s  role  is 
performed  and  is  less  related  to  the  particular, 
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instrumental  satisfaction  derived  from  acquiring 
some  material  goods  or  profit.  It  more  often  con¬ 
cerns  fundamental  values,  confidence  in  political 
procedures,  and  the  way  a  politician  behaves;  it 
concerns  his  or  her  integrity.  What  matters  most 
in  symbolic  responsiveness  is  that  voters  feel  rep¬ 
resented,  which  is  at  the  crux  of  the  legitimacy  of 
the  general  political  system. 

Consequently,  because  of  the  relative  indepen¬ 
dence  of  these  four  forms  of  responsiveness,  there 
is  no  reason  to  assume  that  good,  effective  respon¬ 
siveness  in  one  domain  ought  to  be  accompanied 
by  equally  effective  and  sound  responsiveness  in 
another  domain.  Particular  voters  and  social 
groups  might  be  unsatisfied  with  the  performance 
of  their  representative  in  some  policy  area,  but  at 
the  same  time,  they  may  be  relatively  satisfied  by 
the  way  he  or  she  delivers  services. 

There  are,  however,  certain  patterned  relation¬ 
ships  regarding  the  frequency  of  the  occurrence  of 
particular  forms  of  responsiveness.  If  politicians — 
for  whatever  reason — are  oriented  toward  local 
interest,  what  follows  is  the  high  priority  given  to 
“service”  and  “allocation”  responsiveness.  If, 
however,  they  are  focused  on  the  party  role,  they 
are  more  likely  to  give  precedence  to  “policy”  or 
“symbolic”  responsiveness. 

A  frequently  debated  issue  within  the  political 
representation  domain  is  the  extent  to  which,  on 
the  one  hand,  the  “will  to  represent”  and,  on  the 
other,  the  problem  of  the  district’s  or  electorate’s 
heterogeneity  are  important  for  the  quality  of  rep¬ 
resentation.  The  former — the  will  to  represent — is 
at  the  crux  of  the  responsiveness  phenomenon, 
although  in  the  classical  formulation  of  the  “stand¬ 
ing  for”  descriptive  representation,  the  will  is  not 
necessary.  The  representative  might  simply  look 
responsive  because  he  or  she  is  from  the  constitu¬ 
ents’  geographical  district.  Moreover,  even  if  the 
representative  has  a  distaste  for  representing  the 
will  of  the  people,  he  or  she  might  nevertheless 
deliver  high-quality  representation — that  is,  look 
responsive — because  his  or  her  values,  attitudes, 
and  preferences  are  similar  to  the  constituents’, 
notwithstanding  a  knowledge  or  willingness  to  fol¬ 
low  their  expectation  of  principals.  In  a  nutshell,  it 
is  important  to  know  where  such  unintentional 
and  accidental  responsiveness  occurs  and  what  are 
its  long-term  effects — for  instance,  for  the  reelec¬ 
tion  of  the  representative.  It  is  worth  noting  that 


the  mainstream  classical  approach  to  political  rep¬ 
resentation  and  responsiveness  assumes  the  will  to 
represent  as  an  important  point  of  departure  for 
deciding  to  whom  exactly,  what  precisely,  and  by 
what  means  the  representative  is  intentionally 
attempting  to  be  responsive. 

The  latter — the  issue  of  the  district’s  or  elector¬ 
ate’s  heterogeneity — is  even  more  acute  for  the  qual¬ 
ity  of  responsiveness.  In  an  ideal  world,  which  is 
rarely  the  case,  the  existence  of  a  perfectly  homoge¬ 
neous  constituency  would  allow  the  representative 
to — at  least  potentially — become  an  ideal  represen¬ 
tative,  achieving  perfect  responsiveness  vis-a-vis  a 
constituent’s  preferences.  In  fact,  the  extent  to  which 
his  or  her  representational  merits  approximate  per¬ 
fection  depends  completely  on  him  or  her,  assuming 
that  the  representative  has  access  to  reliable  infor¬ 
mation  about  the  district’s  sentiments  and  the  will 
to  familiarize  himself  or  herself  with  them.  The 
world  is,  nevertheless,  much  more  complicated, 
and  the  usual  state  comprises  complex  heteroge¬ 
neous  constituencies  coupled  with  a  limited  ability 
of  the  latter  to  effectively  communicate  its  prefer¬ 
ences.  In  such  an  instance,  what  a  “willing  to  be 
responsive”  representative  can  do  is  offer  issue 
stances  and  enact  policies  that  minimize  the  unsolv- 
able  problem  of  representing  conflicting  interests. 
The  best  the  representative  can  do  is  to  follow  the 
majority,  more  precisely;  learn  about  some  central 
tendency  in  his  or  her  constituency;  and  follow  the 
preferences  of  such  a  majority.  If,  however,  there  is 
no  clear  majority,  or — worse — the  district  prefer¬ 
ences  are  highly  polarized,  the  representative  can 
either  decide  to  be  responsive  to  one  end  of  the 
spectrum  or  offer  some  “central”  policies  that  will 
rarely  represent  the  sentiments  of  anybody  but  will 
at  least  minimize  the  discrepancy  between  the  rep¬ 
resentative  and  the  constituency  at  large.  One 
should  note,  however,  that  such  a  far-from-unique 
situation  regarding  significant  district  heterogeneity 
is  a  sound  excuse  for  the  representative  to  follow 
his  or  her  own  considered  judgments  or  conscience 
when  having  to  make  ultimate  legislative  decisions, 
which  are  often  supported  by  details  of  competent 
deliberation  among  peers. 

Focus  of  Responsiveness 

Another  method  of  depicting  the  multidimension¬ 
ality  of  responsiveness  is  through  the  foci  of  the 
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representation  relationship.  Briefly  stated,  the  foci 
can  be  organized  into  three  baskets.  The  first  is  a 
geographical  focus;  both  representatives  and  con¬ 
stituents  may  think  that  the  crucial  representative 
bond  is  between  the  representative  and  the  con¬ 
stituency  defined  in  terms  of  region,  state,  city,  or 
district.  Second,  the  representation  focus  can  be 
viewed  by  means  of  a  classical  functional  approach, 
conceiving  what  is  to  be  represented  in  terms  of 
class,  professional  group,  ethnic  or  religious  group, 
or  ideological  camp.  Third,  the  focus  of  responsive¬ 
ness  can  be  individuals,  be  they  socially  important 
and  influential  local  celebrities,  or  rank-and-file 
citizens,  or  unknown  clients  in  need  of  assistance. 

The  Causal  Link 

Some  scholars  mistakenly  assume  that  responsive¬ 
ness  by  its  very  nature  is  always  a  dependent  vari¬ 
able.  The  representation  relationship  is  far  more 
complex:  Transactions  between  the  represented 
and  representative  are  dynamic,  top  down  and 
bottom  up,  and  their  very  essence  depends  to  a 
large  degree  on  both  the  exact  component  and  the 
focus  of  responsiveness.  Responsiveness  and 
response  are  not  the  same.  In  fact  a  “purely  reac¬ 
tive”  representative  is  a  far  cry  from  the  ideal  role 
model  assumed  in  contemporary  democracies. 

This  is  a  relevant  place  to  check  the  classics. 
Hanna  Pitkin,  in  her  now  classic  1967  work  on 
representation,  does  not  define  the  concept  of 
responsiveness.  The  responsiveness  phenomenon 
features,  however,  in  the  chapter  on  the  mandate- 
independence  controversy.  Following  Burke’s 
famous  opposition,  Pitkin  also  asks  many  detailed 
questions  related  to  the  issue  of  whether  a  repre¬ 
sentative  ought  to  do  what  the  constituents  want, 
and  be  bound  by  mandates  or  direct  instructions 
from  them,  or  be  free  to  act  as  he  or  she  chooses 
based  on  his  or  her  skills,  knowledge,  and  compe¬ 
tence  to  guarantee  their  welfare.  To  precisely  grasp 
and  conceptualize  the  responsiveness  phenome¬ 
non,  it  is  extremely  important  to  decide  at  which 
point  we  lie  between  the  two  poles  of  the  ideal 
model  of  the  mandate-independence  controversy. 
On  the  one  hand,  one  may  assume  that  “good 
representation”  occurs  only  when  the  representa¬ 
tive  acts  on  direct  and  explicit  instructions  from 
his  constituents,  and  any  deviation  from  them  is 
considered  unacceptable.  At  the  other  end  of  the 


scale,  good  representation  means  complete  inde¬ 
pendence  and  full  discretion;  consequently, 
constituents  have  no  right  even  to  assess  the  fulfill¬ 
ment  of  campaign  promises;  in  this  stance,  once 
the  representative  is  elected,  he  or  she  must  be 
completely  free  to  use  his  or  her  own  judgment 
and  competence.  Of  course,  there  are  solutions 
that  fall  in  between:  One  might  expect  a  “good 
representative”  to  act  as  he  or  she  thinks  the  con¬ 
stituents  would  want,  unless  he  or  she  receives 
instructions  from  them,  which  then  must  be 
obeyed.  Whatever  the  precise  understanding  of 
good  representation  is  assumed  to  be,  it  has  direct 
consequences  on  the  way  we  conceive  the  notion 
of  responsiveness. 

An  additional  requirement  for  clarifying  this 
paradox  is  that  it  is  expected  that  the  represented 
must  reveal  the  ability  to  be  active.  Pitkin  high¬ 
lights  throughout  the  book  that  “the  represented 
must  be  both  present  and  not  present”  to  talk 
logically  of  the  very  sense  of  representation.  The 
representative  must,  of  course,  act  and  be  indepen¬ 
dent;  however,  the  represented  must  be  in  some 
sense  acting  through  him  or  her.  Moreover,  the 
relationship  between  the  two  becomes  meaningful 
only  if  the  represented  is  capable  of  action,  and  has 
and  is  ready  to  present  a  will,  judgment,  or  a  pref¬ 
erence  on  his  or  her  own.  If  this  capability  and 
readiness  to  act  is  missing,  the  whole  idea  of  repre¬ 
sentation  as  substantive  activity  is  questionable. 
Acting  on  behalf  of  someone  who  is  completely 
helpless,  has  no  idea  of  what  his  or  her  preferences 
are,  or  is  totally  incompetent  is  not  representing. 
As  a  consequence,  the  process  of  representing  con¬ 
sists  of  supporting  and  achieving  the  interests  of 
the  represented,  assuming  that  the  represented  is 
capable  of  judgment  and  action,  but  at  the  same 
time,  it  does  not  act  and  does  not  object  to  what  is 
done  on  his  or  her  behalf  by  the  representative. 

What  [italics  added]  the  representative  does  must 
be  in  his  principal’s  interest,  but  the  way  [italics 
added]  he  does  it  must  be  responsive  to  the 
principal’s  wishes.  .  .  .  Responsiveness  seems  to 
have  a  kind  of  negative  criterion:  conflict  must  be 
possible  and  yet  nevertheless  not  occur.  (Pitkin, 
1967/1985,  p.  155) 

The  reasons  for  either  following  the  will  of  the 
people  or  promoting  their  interest  is  crucial  from 
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yet  another  perspective — the  cognitive  ability  and 
mobilization  of  the  citizenry  at  large.  Numerous 
methodologically  sound  empirical  endeavors  show 
that  a  very  small  proportion  of  the  public  can  actu¬ 
ally  articulate  their  interests;  moreover,  even  fewer 
people  can  manifest  coherent  belief  systems  that 
allow  their  values  and  preferences  to  be  translated 
into  comprehensive  rational  policies.  In  such  a  situ¬ 
ation,  the  dilemma  of  the  representative,  whether 
and  to  what  exactly  he  has  to  be  responsive,  is 
extremely  complicated.  If  no  conflict  between  citi¬ 
zens’  wishes  and  their  interests  is  visible,  the  role  of 
the  representative  is  relatively  simple,  but  on  rare 
occasions,  this  does  occur.  Obviously,  the  represen¬ 
tative  need  not  follow  the  wishes  of  the  represented, 
but  he  or  she  must  consider  them,  especially  when 
they  are  in  conflict  with  what  he  sees  as  the  con¬ 
stituent’s  interests,  because  the  reason  for  this  dis¬ 
crepancy  must  be  found  and  clarified.  This  stipula¬ 
tion  is  at  least  true  for  democracies.  In  other 
words,  if  a  representative  behaves  and  acts  con¬ 
trary  to  the  publicly  known  wishes  of  his  elector¬ 
ate,  some  explicit  rationale  is  required.  Such 
behavior  calls  for  explanation  and  justification.  It 
is  precisely  at  this  point  that  the  issue  of  respon¬ 
siveness  comes  into  play.  The  representative  has  to 
explain  why  he  or  she  is  unresponsive;  and  not  any 
explanation  will  do,  and  only  certain  clarifications 
will  be  legitimized  by  the  people.  On  the  other 
hand,  accepted  justifications  boost  the  chances  of 
the  representative’s  reelection. 

Throughout  this  classic  volume,  one  finds  a  con¬ 
vincing  picture  of  political  representation  that  is 
impersonal  and  institutional.  Whenever  we  refer  to 
a  system  or  government  as  “representative”  we  are 
in  fact  describing  an  institutional  arrangement  and 
its  mechanisms.  The  representation,  and  its  quality, 
emerges  not  from  an  individual  will  to  represent, 
but  from  the  logic  of  the  structure  and  functions  of 
the  system.  Political  representation  comes  about 
when  the  system  is  effectively  looking  “after  the 
public  interest  and  is  responsive  to  public  opinion 
except  insofar  as  nonresponsiveness  can  be  justified 
in  terms  of  public  interest”  (Pitkin,  1967/1985, 
p.  224).  These  traits  of  representation — its  “pub¬ 
lic”  and  “institutional”  character — contrary  to 
many  assumptions  about  the  individualistic,  dyadic 
nature  of  responsiveness,  can  emerge  indepen¬ 
dently  from  any  deliberate  interactions  between 
voters  and  representatives.  It  is  because  of  the  very 


mechanisms  of  free  media,  separation  of  powers, 
regular  fair  elections,  and  other  inventions  of  lib¬ 
eral  democracies  that  responsiveness  is  likely  to 
emerge  even  from  a  political  system  in  which  par¬ 
ticular  individuals,  voters,  and  representatives  are 
pursuing  quite  other  goals. 

This  problem — to  what  extent  responsiveness  is 
a  systemic  property  and  to  what  extent  it  should 
be  treated  as  emerging  from  individual  deliberate 
interactions — remains  a  question  open  to  empiri¬ 
cal  inquiry. 

Radoslaw  Markoivski 
Polish  Academy  of  Sciences  and  Warsaw  School 
of  Social  Sciences  and  Humanities 
Warsaw,  Poland 
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Responsiveness  of  Bureaucracy 


Responsiveness  is  one  member  of  a  family  of 
related  and  positively  valued  concepts  such  as 
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responsibility  and  answerability  that  have  to  do 
with  the  accountability  of  bureaucrats.  There  is,  at 
first  glance,  generally,  little  controversy  about  the 
proposition  that  bureaucrats  should  be  responsive 
to  the  (elected)  politicians  or  their  appointees  at 
the  top  levels  of  governments.  This  seems  obvious, 
or  it  would  be  impossible  to  hold  politicians 
accountable  for  the  actions  of  the  governments 
they  lead.  However,  beneath  these  simple  state¬ 
ments  lurks  a  minefield  of  complications. 

Responsive  to  Whom  or  to  What? 

How  responsive  do  we  want  bureaucrats  to  be  to 
those  at  the  top  levels  of  government?  Might  we 
also  want  them  to  answer  to  those  they  serve,  their 
“customers,”  in  the  language  favored  by  recent 
reformers?  Or  to  the  norms  and  traditions  of  the 
organizations  they  serve?  Or  to  external  expecta¬ 
tions  that  come  from  their  professions — to  the 
standards  and  norms  of  their  roles  as  scientists, 
engineers,  social  workers,  or  related  professionals? 
Or  to  some  moral  standards  that  are  widely  shared 
in  the  society?  Or  to  the  desires  of  those  in  the 
legislative  branch  of  government,  especially  if  they 
work  in  a  separation-of-powers  system,  such  as 
that  of  the  United  States? 

When  there  is  a  clash  between  the  demands  of 
the  many  forces  that  affect  bureaucrats  and  the 
orders  of  those  at  the  top  of  governments,  the  situ¬ 
ation  becomes  complex.  The  most  extreme  and,  in 
a  sense,  the  most  straightforward  to  deal  with  is 
when  government  leaders  demand  actions  that 
clash  with  common  morality — mass  killing,  tor¬ 
ture,  depriving  people  of  basic  rights,  and  the  like. 
There  bureaucrats  are  looked  on,  at  least  after  the 
fact,  as  heroes  when  they  are  not  responsive. 

Most  situations  are  not  so  clear,  even  in  retro¬ 
spect.  Bureaucrats  in  agencies  responsible  for 
protecting  the  environment  may  be  pressured  by 
scientific  evidence  on  the  one  side  that  suggests  the 
need  for  strict  regulation  of  pesticides  or  of  factory 
emissions  and  on  the  other  by  political  leaders  who 
oppose  such  regulation  and  interpret  the  laws  in 
these  areas  in  a  way  that  preclude  strict  regulation. 
Customers  or  constituents  of  organizations  may 
have  preferences  that  differ  from  those  at  the  top 
of  governments,  and  they  may  be  in  a  position  to 
make  life  very  uncomfortable,  now  or  in  the 
future,  for  those  who  do  not  respond  to  them. 


Bureaucrats  who  have  been  trained  as  scientists, 
engineers,  or  social  workers  may  face  demands 
from  political  leaders  who  would  rather  please  an 
important  constituency  than  follow  a  course  rec¬ 
ommended  by  scientific  findings  or  the  standards 
of  a  given  profession.  Legislators,  especially  in 
systems  with  strong  legislative  committees,  may 
have  different  perspectives  than  executive  leaders 
on  what  should  be  done,  and  these  legislators  may 
be  quite  influential  because  of  longevity  in  office  or 
the  related  possibility  that  they  will  be  there  long 
after  executive  leaders  have  departed. 

Responsiveness  Problems 

Bureaucrats,  especially  those  at  or  near  the  top 
levels  of  their  organizations,  have  numerous 
resources  that  they  bring  to  the  relationship  with 
politicians.  They  have  the  expertise  gained  through 
long  service  in  a  given  area,  supplemented  often  by 
specialized  training  prior  to  entering  the  civil  ser¬ 
vice  and  training  on  the  job  and  in  programs 
designed  to  increase  their  knowledge.  Max  Weber, 
perhaps  the  most  famous  theorist  of  bureaucracy, 
put  a  great  deal  of  emphasis  on  the  power  this 
expertise  brought,  and  though  those  who  study 
bureaucracy  may  debate  the  level  of  bureaucratic 
influence  under  various  circumstances,  few  would 
deny  it.  Senior  career  civil  servants  have  a  great 
deal  of  knowledge  about  the  history  of  their  pro¬ 
grams,  the  proposals  that  have  been  made  in  the 
past  to  change  their  programs,  the  constellation  of 
forces  that  stand  in  support  or  in  opposition  to 
their  programs,  and  what  has  worked  or  failed 
to  work  in  the  past.  This  is  useful  information  to 
politicians,  and  they  ignore  it  at  their  peril,  but  it 
also  permits  bureaucrats  to  help  shape  the  agenda 
and  to  exert  control  over  what  is  or  is  not  done.  In 
a  real  sense,  bureaucrats  responding  to  requests  for 
information  may  make  their  nominal  masters 
responsive  to  them  rather  than  the  other  way 
around.  This  is  a  consistent  dilemma  for  executive 
politicians  seeking  change. 

A  related  problem  is  that  leaders  may  be  reluc¬ 
tant  to  give  clear  signals  to  their  subordinates 
about  policy  changes  because  too  much  clarity  can 
be  a  source  of  political  opposition  or  embarrass¬ 
ment  after  proposals  have  been  compromised  to 
secure  adoption.  This  ambiguity  gives  civil  ser¬ 
vants  opportunities  to  shape  their  responses  in 
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ways  that  insert  their  own  preferences  or  those  of 
supporters  of  their  programs  into  the  mix  without 
seeming  to  act  in  an  unresponsive  manner. 

Civil  servants  can  also  act  in  a  passive-aggressive 
manner,  withholding  information  that  would  be 
very  useful  to  their  superiors.  Political  leaders 
should  want  to  avoid  this,  but  if  they  distrust  or 
ignore  civil  servants,  they  may  inadvertently 
encourage  such  behavior. 

On  the  other  side  of  this  problem  is  the  dilemma 
bureaucrats  may  face  when  they  deliver  advice  or 
analyses  to  politicians  based  on  what  they  believe 
in  good  conscience  are  the  facts  or  are  reasonable 
conclusions  based  on  experience.  Politicians,  moti¬ 
vated  by  their  ideologies — and  research  shows  that 
they  are  generally  more  motivated  by  ideologies 
than  bureaucrats — may  react  harshly  to  such  infor¬ 
mation  if  it  does  not  fit  their  predilections  and  may 
blame  the  messenger  for  what  they  are  told.  The 
easy  way  out  for  bureaucrats  is  to  avoid  providing 
such  information  or  advice,  or  even  to  distort  what 
is  communicated  to  fit  the  predilections  of  political 
superiors,  though  one  can  make  a  strong  argument 
that  such  a  course  is  not  the  responsible  way  for  a 
professional  to  be  responsive. 

Finally,  as  mentioned  above,  in  real  political 
systems,  especially  in  separation  of  powers  sys¬ 
tems,  there  are  multiple  authorities  with  some  say 
over  what  civil  servants  do.  How  to  deal  with 
conflicting  demands  from  these  authorities  can  be 
a  big  problem  for  those  who  see  it  as  their  role  to 
be  responsive  and  an  opportunity  for  those  with 
their  own  agendas. 

Exit,  Voice,  or  Loyalty? 

The  most  famous  book  relevant  to  this  general 
subject  is  Exit,  Voice,  and  Loyalty,  by  Albert 
Hirschman.  Hirschman  argues  that  an  organiza¬ 
tion’s  customers  or  employees  have  three  basic 
things  they  can  do  when  they  are  dissatisfied  with 
the  policies  or  actions  of  those  who  head  the  orga¬ 
nization.  One  option  is  exit.  Civil  servants,  the 
focus  of  this  essay,  can  resign  in  protest.  This  may 
be  a  futile  gesture,  but  it  may  also — if  handled 
astutely — result  in  much  negative  publicity  for  the 
organization  or  policy  that  is  the  target  of  the  res¬ 
ignation.  A  problem  for  a  career  civil  servant  is 
that  he  or  she  may  not  have  the  resources  to  cope 
effectively  with  the  aftermath  of  exit.  In  addition, 


the  gesture  may  be  futile.  The  organization  may  be 
able  to  simply  shrug  off  the  resignation  and  con¬ 
tinue  doing  what  it  did  before  with  relatively  little 
problem.  On  the  other  hand,  a  dramatic  resigna¬ 
tion  may  cause  the  organization  a  good  bit  of 
trouble;  indeed,  even  the  threat  of  resignation  may 
be  adequate  from  the  standpoint  of  the  civil  ser¬ 
vant.  Whatever  the  ultimate  outcome,  exit  is  a  way 
to  avoid  responsiveness. 

A  second  option  is  voice.  Here,  the  civil  servant 
stays  in  the  organization  and  gives  voice  to  his  or 
her  concerns.  At  one  level,  this  is  an  expression  of 
loyalty  because  the  civil  servant  stays  and  expresses 
what  he  or  she  believes  should  be  done  (or  what  the 
available  evidence  suggests  should  be  done),  but  it 
may  be  greeted  with  hostility  by  those  at  the  top  of 
the  organization.  The  civil  servant  who  gives  voice 
may  be  relegated  to  an  unimportant  job,  not 
receive  promotions,  or  be  the  target  of  other  acts  of 
recrimination.  Evidence  indicates  that  there  is  a 
cultural  dimension  to  voice — that  is,  how  appropri¬ 
ate  people  think  it  is  as  a  means  to  express  dissent. 
For  example,  in  a  comparative  study  done  in  the 
mid-1980s,  82%  of  West  German  civil  servants  felt 
that  they  should  support  their  government’s  poli¬ 
cies  even  if  they  personally  disagreed  and  found 
them  “very  undesirable  or  ill  considered,”  whereas 
only  42%  of  U.S.  civil  servants  felt  the  same  way. 

A  more  aggressive  form  of  voice,  especially 
attractive  if  exiting  is  not  a  readily  available 
choice,  is  sabotage.  Here  the  civil  servant  remains 
in  the  organization  but  gives  voice  by  passing 
information  to  those  in  a  position  to  use  it  against 
the  organization.  Ironically,  those  who  feel 
extremely  attached  to  the  organization  may  find 
this  the  best  course  of  action.  So-called  whistle¬ 
blowing  has  become  more  and  more  institutional¬ 
ized  in  U.S.  organizations,  with  formal  protections 
in  place  to  protect  the  whistle-blower,  but  the 
simple  leaking  of  information  that  can  compro¬ 
mise  the  direction  of  an  organization  by  the  cur¬ 
rent  leadership  is  a  time-honored  practice  in 
bureaucracies.  Research  findings  in  the  United 
States  indicate  that  civil  servants  would  prefer  to 
give  voice  within  the  confines  of  the  government, 
resigning  if  persuasion  does  not  work  and  the  dis¬ 
agreement  is  severe.  Leaking,  then,  may  be  thought 
of  as  something  between  voice  and  exit. 

The  final  option  is  loyalty.  Here  the  civil  ser¬ 
vant  sticks  with  the  organization  and  follows  its 
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directives  no  matter  what  he  or  she  might  think  of 
what  is  asked.  This  is  responsiveness  that  may  be 
attractive  to  organizational  leadership  because 
what  it  wants  will  be  implemented  no  matter 
what  the  people  in  the  organization’s  bureaucracy 
might  think  or  prefer.  But  there  is  a  cost,  and  the 
cost  can  be  significant  if  the  leadership’s  goals  are 
unattainable,  extremely  costly,  extremely  unwise, 
or  some  combination  of  the  above.  Bureaucratic 
voice,  at  a  minimum,  is  often  in  the  interest  of  the 
leadership;  abject  submissiveness  may  be  much 
regretted  after  the  fact. 

Conclusion 

Bureaucratic  responsiveness  is  important  in  a  gov¬ 
erning  system  where  top  decision  makers  are  ulti¬ 
mately  held  accountable  for  the  policies  and  actions 
of  the  governments  they  lead.  We  would  not  want 
politicians  to  say  credibly  that  while  the  law  per¬ 
mitted  alternate  actions,  the  career  bureaucracy 
was  unresponsive  and  there  was  no  way  because  of 
that  to  change  programs  or  policies.  On  the  other 
hand,  we  also  should  be  wary  of  a  totally  submis¬ 
sive  bureaucracy.  Bureaucrats  carry  with  them  the 
expertise  and  experience  to  help  make  policy  deci¬ 
sions  viable  and  policy  implementation  effective. 
They  know  a  great  deal  about  what  has  worked  in 
the  past,  what  has  failed  for  technical  or  political 
reasons,  and  what  course  might  be  best  for  accom¬ 
plishing  what  political  leaders  would  like,  though 
this  may  require  changes  and  compromises  that 
political  leaders  would  prefer  to  avoid.  Ideally, 
bureaucrats  also  adhere  to  a  code  of  personal  and 
professional  conduct  that  makes  them  valuable 
servants  of  the  nation,  leading  them  to  resist  the 
worst  impulses  that  might  seize  national  leaders. 
An  “unresponsive”  civil  servant  who  refused  to 
send  victims  to  concentration  camps  or  to  help 
organize  torture  sessions  would  deserve  the  thanks 
of  his  or  her  nation.  It  is  hoped  that  cases  where 
this  is  necessary  will  be  extraordinarily  rare.  In 
between  that  extreme  and  abject  obedience  is  found 
a  complicated  world  where  bureaucratic  respon¬ 
siveness  is  much  sought  after  but,  for  the  most  part, 
not  fully  attainable  or  even  fully  desirable. 
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Revisionism 


As  a  term  of  art,  revisionism  surfaced  in  the  late 
1800s  and  has  since  steadily  gained  in  scope.  Its 
meaning  may  be  either  critical  or  assertive,  but  its 
purpose  is  always  to  probe  the  intellectual  validity 
and  the  political  legitimacy  of  a  dominant  ortho¬ 
doxy.  The  notion  typically  implies  an  ongoing 
attempt  to  revise,  revisit,  or  reconsider  an  official, 
mainstream,  well-established,  or  at  least  widely 
accepted  tenet,  brought  forth  by  the  history  of 
political  thought  and  events  that  claims  ethical 
authenticity  and  prescribes  normative  conduct  in 
public  affairs.  For  regular  criticism  to  qualify  as 
revisionism,  it  should  challenge  canons  of  inter¬ 
pretation  of  facts,  processes,  and/or  ideas  that  are 
conducive  to  the  configuration  (or  are  the  intended 
outcome)  of  a  system  of  power  relations  in  the 
international  arena,  national  politics,  or  aca¬ 
demia.  Revisionism  deserves  its  name  to  the  extent 
that  it  remains  a  failed  orthodoxy  and  its  propo¬ 
nents  are  singled  out  as  deviationists  not  only 
from  a  prevailing  construal  of  truth  but  also  from 
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the  social  administration  of  it.  Hence,  revisionism 
and  revisionists  are  often  subject  to  coercive  inter¬ 
ventions  from  the  incumbent  orthodoxy:  sanc¬ 
tions  in  international  relations,  purges  and  trials 
in  nondemocratic  national  settings,  and  marginal¬ 
ization  and  exclusion  in  the  academic  sphere. 
Revisionism  itself,  especially  when  manifested — as 
is  frequently  the  case — in  the  realm  of  arts  and 
letters,  is  sometimes  inclined  to  prize  such  an 
imposed  peripheral  position  in  the  production  of 
values  emphasizing  negation  and  evasion  as  cre¬ 
ative  hermeneutical  forces.  At  any  rate,  revision¬ 
ism  could  be  the  name  of  a  contentious  politics  of 
practical  knowledge.  This  entry  is  especially 
focused  on  the  variety  of  revisionisms  and  the 
ambiguities  of  the  concept. 

Variety  of  Revisionisms 

Three  authoritative  interpretations  of  intellectual 
and  political  history,  and  the  political  arrangements 
that  accompany  them,  have  engendered  the  main 
versions  of  revisionism.  They  are  related  to 
Marxism,  the  Great  War,  and  the  Holocaust.  As  a 
strong  and  recurrent  political  label,  revisionism 
was  used  aggressively  in  1908  by  Vladimir  Lenin 
with  the  explicit  intention  of  defining  a  canonical 
brand  of  Marxism  against  what  he  deemed  to  be 
the  nonrevolutionary  reformist  trend  gaining  sway 
over  German  and  Austrian  social  democracy.  The 
evolution  of  Marxism  as  the  official  ideology  of 
state  socialism  could  be  read  from  then  on  as  a  nar¬ 
rative  of  orthodoxy  enforcing  its  speculative  outlook 
and  political  command  by  permanently  exposing 
and  overcoming  present  or  suspected  revisionisms. 
In  the  history  of  international  relations,  revision¬ 
ism  is  a  descriptor  of  the  foreign  policies  of  the 
former  Central  Powers  that  stood  in  defiance  of  the 
status  quo  established  in  1919  by  the  Treaty  of 
Versailles.  The  classic  example  would  be  Hungary, 
a  polity  traumatized  by  severe  territorial  losses  and 
insulated  in  a  culture  of  denial  of  its  new  identity  as 
a  diminished  European  actor.  The  normative  elu¬ 
siveness  of  the  international  order  presided  over  by 
the  Society  of  Nations  allowed  Japan,  albeit  a  vic¬ 
tor  of  the  Great  War,  to  adopt  a  revisionist  lan¬ 
guage,  critical  of  Western  supremacy,  and  to 
embark  on  revisionist  operations  in  East  Asia  and 
the  Pacific;  in  the  process,  the  democratic-oriented 
Japanese  governments  of  the  1920s  were  replaced 


by  a  military-driven  direction  of  national  politics  in 
the  1930s.  German  revisionism  was  rooted  in  the 
19th-century  volkiscb  (ethnic)  movement  and 
related  to  previous  forms  of  national  revanchism  (a 
term  used  since  the  1870s  to  describe  a  political 
manifestation  of  the  will  to  reverse  territorial  losses 
incurred  by  a  country,  often  following  a  war)  and 
irredentism  (any  position  advocating  annexation  of 
territories  administered  by  another  state  on  the 
grounds  of  common  ethnicity  or  prior  historical 
possession,  actual  or  alleged).  The  initially  progres¬ 
sive,  then  turned  “Old  Right”  American  scholar 
Harry  Elmer  Barnes  (1889-1968),  perhaps  the 
most  prominent  and  prolific  revisionist  historian, 
echoed  the  German  style  in  his  work.  In  his  book 
The  Genesis  of  the  World  War  (1929),  Barnes 
argued  against  what  he  described  as  “court  histori¬ 
ans,”  that  the  Great  War  (World  War  I)  was  by  no 
proper  historical  account  a  just  one,  and  that  all  the 
powers  concerned  share  responsibility  for  its  causes 
and  course  of  events.  He  extended  his  argument 
after  World  War  II,  theorizing  against  the  “black¬ 
out”  of  mainstream  historiography,  which  he 
denounced  as  a  myth  factory  working  for  political 
purposes  and  under  political  command.  Expanding 
his  prior  analyses,  Barnes  maintained  that  Nazi 
German  war  crimes  and  the  extermination  of  the 
European  Jews  were  to  be  considered  at  best 
unproved  facts,  anyway  matched  by  the  treatment 
imposed  by  the  Allies  to  the  vanquished  Germans. 

The  legacy  of  Barnes  is  twofold.  On  the  one 
hand,  he  remains  a  source  of  inspiration  for  many 
libertarians  connected  to  the  Mises  Institute,  among 
whom  Murray  N.  Rothbard  (1926-1995)  stands 
out.  According  to  them,  revisionism  should  be  con¬ 
strued  as  a  teacher  of  freedom  of  speech  and 
thought,  of  truth  and  rationality  in  a  world  sub¬ 
dued  by  propaganda,  and  of  indoctrination  and 
spectacular  mythologies  taking  advantage  of  the 
gullibility  of  a  misinformed  general  public.  On 
the  other  hand,  Barnes’s  work  is  used  as  a  corner¬ 
stone  by  the  massive  body  of  literature  displaying  a 
diverse  array  of  conspiracy  theories,  efforts  of  revi¬ 
sion  of  national  canonized  histories,  the  explicit  or 
implicit  denial  of  the  Holocaust,  and  other  varieties 
of  negationism.  This  last  form  of  revisionism  aims 
not  only  at  questioning  and  raising  doubts  as  to  the 
veracity  and  the  scale  of  the  Hitlerian  genocide  but 
also,  and  most  important,  at  leveling  the  guilt  of  all 
parties,  civilians  included,  involved  in  the  two 
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conflagrations  of  the  20th  century  and  their  respec¬ 
tive  aftermaths.  Had  abjection  been  evenly  distrib¬ 
uted,  there  would  ultimately  be  no  irrational 
aggressor  and  no  defender  of  just  causes,  no  victim 
to  be  remembered,  and  no  executioner  to  be  held 
responsible.  In  the  negationist  view,  the  politics  of 
retribution  and  memory  are  but  tools  of  control  of 
the  public  mind  employed  by  conspiring  minorities 
ruling  over  markets,  the  media,  and  some  govern¬ 
ments.  This  literature,  networking  across  Western 
cultures  and  the  Middle  East,  manifests  itself  as  an 
alternative  type  of  historical  writing  and  habitually 
addresses  marginal  but  committed  audiences  asso¬ 
ciated  with  extreme  Right  and  fundamentalist 
political  persuasions.  Fostered  by  the  language  of 
history  and  doctrine,  revisionism  is  always  plead¬ 
ing,  if  only  by  implication,  for  a  transfiguration  of 
a  domestic  political  regime  or  the  international 
order. 

Ambiguities  of  the  Concept 

However  miscellaneous  its  major  historical  shape, 
revisionism  emerged  in  normative  terms  as  a  word 
of  no  certain  meaning.  Its  first  uses  point  to  this 
built-in  ambiguity.  The  earliest  revisionists  appear 
to  have  been,  rather  technically,  the  advocates  of 
the  judicial  review  of  the  notorious  Dreyfus  affair 
that  stirred  political  passions  in  1894-1895  France. 
After  the  exculpation  of  Alfred  Dreyfus,  the  notion 
itself  was  inverted  by  conservative  voices  denounc¬ 
ing  not  a  mere  reversal  of  justice  but  a  revision  of 
history.  The  case  may  provide  a  pattern  for  under¬ 
standing  the  very  fabric  of  revisionism:  It  is  almost 
never  an  intellectual  controversy  about  how  rea¬ 
son  could  restore  facts  to  their  pristine  historical 
state  but  rather  a  political  contention  on  how  a 
society  should  form  its  beliefs  about  what  facts 
ought  to  stand  for  and  who  is  entitled  to  expound 
their  position.  This  indistinct  word  turned  into  a 
concept  at  about  the  same  time  in  Marxist  German 
thought.  Eduard  Bernstein  (1850-1932)  authored 
between  1896  and  1898  a  series  of  articles  devoted 
to  The  Problems  of  Socialism,  instantly  arousing  a 
long-lasting  controversy  as  to  how  Marx’s  entwined 
theories  on  capitalism  and  proletarian  revolution, 
coined  in  the  mid-1800s,  should  be  empirically 
tested  for  the  political  use  of  the  working-class 
movement.  Bernstein’s  critique  of  the  inaccuracy 
of  the  Marxian  predictions  about  the  inevitable 


collapse  of  capitalist  economy,  the  accelerated 
impoverishment  of  the  working  class,  the  evapora¬ 
tion  of  the  middle  class,  as  well  as  his  reshuf¬ 
fling  of  the  labor  theory  of  value  and  the  dismissal 
of  class  struggle  as  the  privileged  venue  of  social 
change  were  initially  rejected  by  his  fellow  social¬ 
ists  (Karl  Kautsky,  August  Bebel,  and  Rosa 
Fuxemburg)  only  to  lay  a  few  years  later  the  foun¬ 
dations  of  the  enduring  orthodoxy  guiding  the  poli¬ 
cies  of  the  Western  and  Central  European  Socialist 
and  Social  Democratic  parties.  What  emerged  as 
orthodoxy  in  Germany,  France,  and  later  Britain 
was  dubbed  revisionist  by  the  Bolsheviks  even 
before  they  seized  power  in  Russia.  Lenin  wrote  it 
clearly:  To  amend  Marx  is  blatant  proof  of  hostil¬ 
ity  toward  Marxism  (even  if  a  Fabian  anecdote  has 
it  that  the  elderly  Marx  did  not  consider  himself  a 
Marxist).  Hence,  revisionism  grows  to  be  the  elab¬ 
orate  but  standard  crime  of  betrayal  of  the  commu¬ 
nist  project.  After  October  1917,  Lenin’s  trium¬ 
phant  party  imposed  itself  as  nothing  less  than  revo¬ 
lutionary  (i.e.,  genuine)  Marxism  incarnated  in  one 
country  and  organized  globally  in  the  Communist 
International.  By  way  of  consequence,  to  tell  apart 
orthodoxy  from  reformism  was  no  longer  the  task 
of  a  “professorial  science”  of  Marxism  (to  quote 
Lenin)  but  the  public  policy  of  a  political  regime 
that  raised  “scientific  socialism”  to  the  dignity  of  a 
practical  philosophy  able  to  have  recourse  not  only 
to  means  of  persuasion  but  also  to  instruments  of 
coercion.  Plain  intellectual  criticism  (expressed, 
suspected,  or  attributed)  deserved  to  be  qualified  as 
revisionism  as  soon  as  a  sociological  dimension 
was  tactically  attached  to  it:  Prone  by  definition  to 
embourgeoisement,  revisionists  were  “objectively” 
doomed  to  the  condition  of  “social  fascists,”  pro¬ 
moters  of  state  capitalism,  elitists,  and  agents  of 
imperialism.  Henceforth  a  matter  of  bureaucratic 
decision  and  penal  sanction,  revisionism  multiplied 
its  facets  and  materializations  as  the  Soviet  leader¬ 
ship  moved  to  provide  a  Marxist/Leninist-certified 
interpretation  of  international  and  domestic  politi¬ 
cal  events  and  social  processes.  As  long  as  he  was 
in  command,  Stalin’s  thinking  and  actions  embod¬ 
ied  orthodoxy  for  all  seasons:  Yesterday’s  ortho¬ 
dox  might  have  easily  been  unmasked  as  today’s 
revisionist,  as  in  the  case  of  Trotsky  and  many 
others.  Or  else,  an  orthodox  at  home  could  look 
from  abroad  like  a  revisionist.  For  instance,  in 
1957,  the  Polish  leader  Wladislaw  Gomulka,  while 
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formally  charging  Leszek  Kolakowski  with  revi¬ 
sionism  (alongside  other  Marxist  philosophers  who 
were  experiencing  the  Khrushchev  report  as  an 
intellectual  liberation),  was  himself  accused  of  revi¬ 
sionism  by  his  Romanian  communist  counterparts. 
In  the  early  1960s,  Nikita  Khrushchev  was  in  turn 
denounced  as  a  “modern  revisionist”  by  Mao 
Zedong.  Two  decades  later,  Mao  himself  was 
labeled  as  one  by  the  Albanian  communists,  proud 
to  proclaim  themselves  antirevisionists.  The  term 
was  briefly  revived  in  1989  by  the  Romanian  hard¬ 
liners  as  a  causal  explanation  for  the  roundtable 
talks  through  which  Hungarian  or  Polish  ruling 
parties  were  contemplating  free  and  fair  elections 
and  eventually  a  transition  to  political  pluralism 
and  unplanned  economy.  After  the  demise  of  com¬ 
munism,  revisionism  (self-styled,  imaginary,  or 
indisputable)  still  lends  itself  to  a  variety  of  uses  in 
international  and  domestic  politics  or  in  academia 
but  seems  to  have  exhausted  its  career  as  a  concept. 
As  a  moral  and  political  language,  it  still  can  map 
the  hermeneutical  and  ethical  encounters  of  politics 
and  history. 

Daniel  Barbu 
University  of  Bucharest 
Bucharest,  Romania 
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Revolution 


Social  scientists  have  provided  two  alternative 
definitions  of  revolution.  Some  theorists  articulate 


a  political  definition  of  revolution  characterized 
by  the  forcible  transfer  of  state  power.  Charles 
Tilly  defines  revolution  as  forcible  transfer  of 
power  in  the  course  of  a  struggle  involving  at  least 
two  distinct  blocs  of  contenders  that  make  incom¬ 
patible  claims  to  control  the  state  with  some  sig¬ 
nificant  segments  of  the  population  supporting  the 
claims  of  the  rival  contenders.  Others  developed 
criteria  that  incorporated  significant  social  out¬ 
comes.  Theda  Skocpol  is  interested  in  social 
revolutions — that  is,  alterations  in  both  the  politi¬ 
cal  and  economic  structures  of  society.  Skocpol’s 
definition  requires  a  rapid,  basic  transformation 
of  the  state  and  class  structures  that  are  carried 
out  in  part  through  class-based  revolts  from 
below.  Social  revolutions,  though  rare  phenomena, 
have  produced  the  most  fundamental  changes  in 
the  modern  world  and  social  life.  For  this  reason, 
this  entry  will  focus  on  the  causes,  processes,  and 
outcome  of  social  revolutions  and  will  review 
alternative  theories,  namely,  class  analysis  theories, 
state-centered  theories,  political-conflicts  theories, 
and  theories  that  stress  the  role  of  ideological 
transformation  in  accounting  for  revolutions. 

Revolutions  have  played  a  significant  role  in  the 
rise  of  the  modern  world  and,  over  the  past  few 
centuries,  have  transformed  the  political  and  eco¬ 
nomic  systems  of  social  life  and  at  least  partially 
altered  the  nature  of  social  relations.  In  the  West, 
modern  revolutions  have  been  produced  by  con¬ 
flicts  over  the  distribution  of  material  resources, 
the  allocation  and  extent  of  political  power,  and 
the  essence  of  social  relations.  These  revolutions, 
in  turn,  eventually  gave  rise  to  modern  capitalism, 
democracy,  and  various  social  movements,  includ¬ 
ing  the  abolition  of  slavery,  socialist  and  labor 
movements,  civil  rights,  women’s  equality,  and  gay 
and  lesbian  rights.  In  the  developing  world, 
national  liberation  movements  and  revolutions 
have  produced  independence,  nationalist  regimes, 
and,  in  a  few  cases,  socialist  transformations. 

Revolutionary  movements  and  transformations 
have  also  stimulated  scholarly  debates  about  the 
causes,  processes,  and  outcome  of  revolutions  and 
large-scale  social  conflicts.  Indeed,  18th-  and  19th- 
century  European  revolutions  and  revolutionary 
movements  influenced  Karl  Marx’s  analysis  of 
capitalism,  class  conflict,  and  revolution,  which  in 
turn  influenced  generations  of  20th-century  social 
scientists. 
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Causes  of  Revolutions 
Modernization  and  Political  Instability 

Modernization  theorists  claim  that  rapid  eco¬ 
nomic  development  and  social  change  in  develop¬ 
ing  countries  can  be  destabilizing  as  large-scale 
social  transformations  such  as  commercialization, 
industrialization,  and  urbanization  dislocate  and 
uproot  large  segments  of  the  population.  These 
changes,  in  turn,  may  generate  some  measure  of 
confusion  and  anomie,  leading  to  antisocial  activi¬ 
ties  and  destructive  violent  behavior.  In  the  absence 
of  integrative  forces  within  the  existing  order,  the 
uprooted  may  join  revolutionary  movements  to 
integrate  into  the  new  social  system.  Other  ver¬ 
sions  of  the  theory  emphasize  the  political  dimen¬ 
sion  of  rapid  modernization,  which  may  extend 
political  consciousness,  expand  political  mobiliza¬ 
tion,  and  instigate  demands  for  political  participa¬ 
tion.  These  developments,  in  turn,  may  undermine 
traditional  political  authority  and  complicate  the 
creation  of  new  political  institutions,  which  may 
result  in  disorder  and  political  instability. 

Class  Conflict  and  Revolutions 

Class  analysts  of  revolution  claim  that  the  key 
to  understanding  revolutions  lies  in  the  conflict 
over  the  distribution  of  material  resources.  These 
theorists  often  point  to  economic  polarization, 
concentration  of  wealth  and  income,  and  rising 
disparities  in  promoting  conflicts  and  revolutions. 
Under  highly  polarized  economic  conditions,  the 
dominant  and  lower  classes  may  not  find  common 
ground  for  compromise  and  peaceful  coexistence 
but  instead  may  adopt  extremist  political  positions 
leading  to  conflict  and  revolution. 

Marx  was  a  leading  proponent  of  the  class  con¬ 
flict  model.  He  viewed  the  contradiction  between 
the  forces  and  relations  of  production  within  a 
single  mode  of  production  to  be  pivotal  in  generat¬ 
ing  class  conflict  and  revolutions.  He  claimed  that 
class  exploitation  and  domination  within  the  eco¬ 
nomic  sphere  would  generate  shared  interests 
within  the  working  class,  which  would  lead  to 
class  solidarity,  consciousness,  and  organizations. 
Market  competition  and  endemic  economic  crises 
under  capitalism  continuously  undermine  the  posi¬ 
tion  of  the  petty  bourgeoisie  and  force  them  into 
the  ranks  of  the  proletariat,  the  industrial  working 


class.  According  to  Marx,  these  transformations 
would  expand  the  proportion  of  the  working  class 
in  advanced  industrial  societies  and  convert  them 
into  a  giant  force  capable  of  disrupting  the  social 
structure.  Marx  expected  that  class  conflict  would 
inevitably  find  expression  in  the  political  sphere.  In 
times  of  economic  crisis,  when  the  capitalist  sys¬ 
tem  is  most  vulnerable,  industrial  workers  armed 
with  revolutionary  ideology  would  target  the  state 
whose  policies  generally  support  the  interests  of 
their  class  enemy,  the  capitalist  class.  Given  the 
ever-expanding  size  of  the  working  class  and  recur¬ 
rent  capitalist  crises,  the  working  class  would  ulti¬ 
mately  be  able  to  overthrow  capitalism  and  establish 
a  socialist  system. 

Although  Marx’s  predictions  on  revolutions  in 
advanced  capitalist  countries  never  materialized, 
his  class  analysis  inspired  a  number  of  scholars  to 
undertake  studies  of  revolutions  and  large-scale 
social  conflicts  in  a  wide  variety  of  countries.  In  a 
seminal  work,  Barrington  Moore,  Jr.  demonstrated 
the  ways  in  which  alternative  coalitions  of  urban 
and  agrarian  classes  produced  a  constellation  of 
20th-century  revolutions  that  determined  the 
nature  of  the  modern  political  systems  of  capitalist 
democracy,  fascism,  and  communism.  Moore  also 
employed  class  analysis  to  explain  conditions  that 
produced  peasant  uprisings  and  revolutions.  Other 
scholars,  such  as  Jeffery  Paige  and  Eric  Wolf,  also 
used  class  analysis  in  different  ways  to  explain 
20th-century  peasant  revolts  and  agrarian  revolu¬ 
tions  in  developing  countries. 

State-Centered  and  Structural 
Theories  of  Revolution 

State-centered  theories  of  revolution  shift  the 
focus  of  analysis  away  from  class  conflict  and 
toward  the  power  structure,  notably  the  state, 
which  is  regarded  as  relatively  autonomous.  State- 
centered  theorists  locate  the  state  within  the  inter¬ 
national  political  system  and  examine  the  political 
and  military  pressures  that  may  render  it  vulnera¬ 
ble.  According  to  Skocpol,  a  leading  proponent  of 
this  model,  military  pressures  and  defeat  in  wars 
may  undermine  the  state’s  coercive  apparatus  and 
lead  to  revolution.  In  agrarian  bureaucracies, 
external  pressures  combined  with  state  efforts  to 
promote  industrial  development  may  generate 
schisms  between  the  state  and  the  dominant  class, 
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with  each  sector  competing  to  extract  greater 
resources  from  the  peasantry.  In  such  conflicting 
situations,  dominant  classes  may  be  able  to  block 
state  policies  and  frustrate  government  attempts  at 
economic  transformation.  Such  conflicts  between 
the  state  and  the  dominant  class  may  result  in  state 
breakdown,  paving  the  way  for  revolutionary 
upheavals.  Peasants  may  be  able  to  revolt  and 
bring  about  a  revolution  if  they  have  communal 
solidarity  and  strength. 

In  another  view  of  state-centered  revolution  in 
developing  countries,  Misagh  Parsa  points  out  that 
state  autonomy  does  not  prevent  states  in  the 
developing  world  from  serving  the  interests  of  the 
privileged.  Interested  in  promoting  development, 
state  policies  often  serve  the  economic  interests  of 
large  capital,  often  adversely  affecting  the  interests 
of  the  rest  of  the  population  and  providing  favor¬ 
able  conditions  for  coalition  formation. 

In  particular,  highly  centralized,  exclusive,  and 
hyperactive  states  may  be  vulnerable  to  attack  in 
times  of  crisis.  Such  states  contract  the  scope  of  the 
polity,  use  repression  to  block  access  to  the  state, 
and  thus  undermine  support  for  the  regime, 
increasing  the  likelihood  of  polarization  and  revo¬ 
lution.  In  Parsa’s  analysis,  the  relationship  between 
the  state  and  the  economy  has  significant  implica¬ 
tions  for  the  nature  of  social  conflicts.  The  level  of 
state  intervention  in  capital  accumulation  can 
affect  the  nature  and  outcome  of  social  conflicts. 
In  regulative  states  such  as  market  economies,  the 
low  level  of  state  intervention  in  capital  accumula¬ 
tion  reduces  the  probability  that  the  state  will 
become  the  direct  target  of  collective  action  and 
thus  diminishes  the  likelihood  of  revolutionary 
conflicts.  Capital  allocation  and  accumulation  are 
determined  by  an  abstract,  decentralized,  depoliti- 
cized,  and  “self-regulating”  market  system,  which 
tends  to  defuse  and  privatize  conflicts,  confining 
them  to  the  civil  society.  Because  markets  are 
abstract,  decentralized,  and  depoliticized,  they 
cannot  be  attacked  or  overthrown.  As  a  result,  the 
regulative  state  is  unlikely  to  attract  direct  attacks 
or  challenges  because  class  conflict  remains  con¬ 
fined  within  the  economic  sphere  and  the  civil 
society.  Should  such  conflicts  escalate,  aggrieved 
groups  may  clamor  for  the  regulative  state  to  inter¬ 
vene  on  their  behalf  rather  than  attacking  it, 
increasing  the  likelihood  of  a  reformist,  nonrevolu¬ 
tionary  outcome.  Where  state  intervention  is  low, 


the  regulative  state  may  be  perceived  as  an  autono¬ 
mous  entity  that  serves  general,  societal  interests. 
In  such  cases,  the  regulative  state  may  actually 
become  an  integrative  rather  than  a  divisive  force. 
Alternatively,  low  state  intervention  in  capital 
accumulation  may  increase  the  likelihood  and 
intensity  of  class  conflict,  which  can  remove  the 
regulative  state  as  the  principal  target  of  attack 
and  thereby  reduce  the  likelihood  of  revolution. 

Hyperactive  states,  in  contrast,  intervene  greatly 
in  the  economy  and,  as  a  consequence,  can  become 
vulnerable  to  challenge.  Hyperactive  states  tend  to 
be  major  economic  actors,  control  abundant  eco¬ 
nomic  resources,  and  intervene  extensively  in  capi¬ 
tal  allocation  and  accumulation.  High  levels  of 
intervention  entail  significant  political  conse¬ 
quences  for  these  states.  As  they  become  direct 
producers  and  financiers,  the  abstract,  decentral¬ 
ized,  and  depoliticized  market  mechanism  is 
replaced  by  a  visible,  concrete  social  entity,  which 
can  be  attacked  during  conflict  or  crisis.  High  state 
intervention  expands  the  extent  of  political  con¬ 
flicts  in  times  of  economic  crisis,  as  economic  and 
political  conflicts  converge  in  the  political  arena, 
when  the  hyperactive  state  rather  than  market 
forces  are  held  accountable  for  failure  and  mis¬ 
management.  Finally,  hyperactive  states  that 
employ  sizable  segments  of  the  workforce  tend  to 
become  the  target  of  workers’  economic  conflicts. 
Workers’  attacks  against  the  state  may  reduce  the 
intensity  of  class  conflict  and,  in  turn,  increase  the 
likelihood  of  coalition  formation.  Broad  coalitions 
and  disruption  of  the  social  order  are  very  impor¬ 
tant  in  Parsa’s  theory  of  revolution.  In  the  absence 
of  broad  coalitions,  revolutionary  challengers 
might  be  repressed  and  rendered  ineffective. 

Jeff  Goodwin  articulates  another  version  of 
state-centered  revolutionary  theory  in  contempo¬ 
rary  developing  countries.  Although  class  interests 
and  economic  grievances  are  important,  Goodwin 
argues  that  the  roots  of  revolutionary  movements 
are  to  be  found  in  the  political  context  in  which 
class  relationships  and  economic  institutions  are 
embedded.  Revolutionary  movements  in  the 
periphery  have  been  inadvertently  facilitated  and 
encouraged  by  the  formation  of  violent,  exclusion¬ 
ary,  authoritarian  governments  that  are  also  orga¬ 
nizationally  incoherent.  Despotic  regimes  provide 
a  visible  focus  of  opposition  for  groups  and  classes 
that  may  have  different  grievances  but  come 
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together  in  a  broad  coalition.  Exclusionary,  repres¬ 
sive  states  render  political  moderates  and  reform¬ 
ists  relatively  impotent  and  inconsequential.  Thus, 
revolutions  often  take  place  in  countries  that  reject 
peaceful  attempts  to  redress  social  and  political 
grievances  through  elections  and  organizing  and 
instead  resort  to  blatant  fraud  and  violent  repres¬ 
sion.  Revolutionary  movements  may  attract  broad 
popular  support  where  states  sponsor  or  promote 
unpopular  economic  or  social  arrangements, 
repress  or  exclude  mobilized  social  groups,  and 
alienate  the  elite  through  corruption  and  arbitrary 
personalistic  rule.  Even  moderate  and  predomi¬ 
nantly  middle-class  political  parties  and  voluntary 
associations  can  join  in  broad  alliances  that  effec¬ 
tively  support  radical  movements  waging  armed 
revolutionary  struggles. 

Tilly  argues  that  a  natural  history  of  revolutions 
that  could  specify  necessary  and  sufficient  condi¬ 
tions  for  revolutions  is  not  possible  because  as 
regimes  vary  and  change  historically,  they  also 
alter  conditions  for  the  violent  seizure  of  state 
power.  Nevertheless,  Tilly  does  provide  a  state- 
centered  explanation  of  revolution,  analyzing  a 
state’s  vulnerability  to  revolution  as  determined  by 
its  capacity  to  control  resources  and  the  activities 
of  its  population  and  the  extent  to  which  the  pub¬ 
lic  enjoys  equal  rights  and  is  able  to  influence  gov¬ 
ernmental  affairs  and  policies.  He  reasons  that 
high-capacity  democratic  regimes  are  not  vulnera¬ 
ble  to  revolutions  because  they  are  popular,  and 
high-capacity  nondemocratic  states  are  unlikely 
targets  for  revolution  because  their  rulers  are  able 
to  impose  strict  controls  on  the  people.  Low- 
capacity  nondemocratic  regimes  likewise  do  not 
produce  revolutionary  outcomes  because  once  the 
revolutionaries  seize  power,  the  state  apparatus 
tends  to  fragment  and  create  difficulties  for  form¬ 
ing  a  new  state.  Thus,  in  Tilly’s  model,  medium- 
capacity  nondemocratic  regimes  are  the  most 
likely  to  foster  revolutionary  outcomes. 

Ideology  and  Revolutions 

Some  theorists  attribute  independent  power  and 
dynamics  to  ideology  in  the  analysis  of  revolutions. 
Maintaining  that  ideology  has  a  central,  even  inde¬ 
pendent  role  in  any  existing  social  structure  and  in 
its  transformations,  they  argue  that  class,  eco¬ 
nomic,  and  political  conflicts  by  themselves  could 


not  lead  to  revolutions  and  that  only  revolutionary 
ideologies  can  convert  social  discontent  into  revolu¬ 
tionary  crisis.  Using  the  example  of  the  Iranian 
revolution,  some  theorists  emphasize  the  role  of  the 
Shi’a  culture  of  martyrdom  that  inspired  devout 
Iranians  to  oppose  the  Shah  in  the  face  of  repres¬ 
sion  and  death.  Proponents  of  this  perspective 
maintain  that  without  a  revolutionary  ideology  and 
culture  of  resistance,  revolutions  are  highly  unlikely. 

Theorists  who  stress  the  role  of  ideology  in 
revolutions  are  correct  in  one  important  sense: 
Ideologically  driven  challengers  have  been  in  the 
forefront  of  revolutionary  struggles  in  developing 
countries.  In  the  20th  century,  intellectuals  and 
students  adopted  revolutionary  ideologies  and 
struggled  to  transform  their  societies.  Ideological 
shifts  among  students  and  intellectuals  not  only 
fostered  solidarity  and  cohesion  among  these  van¬ 
guards  of  revolutionary  struggles  but  also  provided 
an  analysis  of  the  causes  of  the  conflicts  and  pro¬ 
vided  a  formula  out  of  those  conditions.  Although 
these  revolutionaries  fought  for  their  ideology,  they 
were  also  aware  of  the  need  to  form  broad  coali¬ 
tions  to  bring  down  the  ruling  powers.  To  mini¬ 
mize  ideological  conflicts,  revolutionary  challeng¬ 
ers  often  adopted  alternative  public  approaches  to 
ensure  coalition  formation.  To  attract  those  who 
did  not  share  their  ideologies,  revolutionary  lead¬ 
ers  were  sometimes  circumspect  about  their  ideolo¬ 
gies.  In  Iran,  for  example,  Ayatollah  Khomeini 
never  told  the  Iranian  people  about  his  theocratic 
goals,  but  instead,  he  borrowed  significant  aspects 
of  the  nationalist  ideology  of  the  1950s  and 
attacked  the  Shah’s  regime  for  failing  those  aspira¬ 
tions.  In  the  case  of  Nicaragua,  the  Sandinistas 
compromised  and  modified  their  program  to 
ensure  a  broad  coalition  that  was  able  to  over¬ 
throw  the  government  in  1979. 

Revolutionary  Processes  and  Outcome 

Modern  revolutionary  conflicts  often  begin  with 
the  rise  of  a  revolutionary  challenger  or  coalition 
of  challengers  that  attempt  to  gain  exclusive  con¬ 
trol  over  state  power.  These  challengers  are  often 
ideologically  driven  intellectuals  and  students 
inspired  in  part  by  the  success  of  revolutionaries 
elsewhere  and  antagonized  by  domestic  economic 
structures  and  political  systems.  Revolutionary 
challengers  may  emerge  in  many  societies,  but 
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most  fail  due  to  government  repression  and/or  lack 
of  popular  backing.  Those  revolutionary  challeng¬ 
ers  that  do  survive  are  successful  in  gaining  the 
support  of  dissident  intellectuals  and  students  who 
contribute  resources  and  new  recruits.  Revolut¬ 
ionary  challengers  become  serious  contenders  for 
power  when  they  acquire  adherents  among  major 
classes  or  collectivities  such  as  workers,  peasants, 
or  segments  of  the  middle  class. 

Revolutionary  situations  emerge  when  popular 
backing  for  revolutionary  challengers  becomes 
substantial  and  rulers  are  incapable  or  unwilling  to 
suppress  the  challengers.  It  is  important  to  note 
that  revolutionary  situations  often  arise  as  a  result 
of  certain  societal  characteristics  and  major  eco¬ 
nomic  and  political  crises,  not  because  of  the 
actions  of  revolutionary  challengers,  who  often 
constitute  a  very  small  segment  of  the  population. 
Specific  characteristics  render  certain  societies 
more  vulnerable  to  revolutionary  situations. 
Countries  marked  by  high  levels  of  inequality  in 
the  distribution  of  wealth  and  income  are  vulner¬ 
able  to  economic  and  political  crises.  Further, 
countries  that  are  highly  dependent  economically 
on  other  countries  tend  to  be  more  susceptible  to 
economic  crises.  Vulnerability  is  especially  acute 
for  developing  countries  that  rely  heavily  on 
exports  of  one  or  a  few  raw  materials  and  primary 
commodities.  Fluctuations  in  the  world  market 
and  falling  prices  for  their  goods,  combined  with 
an  economic  downturn,  can  be  devastating.  The 
impact  may  threaten  broad  segments  of  the  popu¬ 
lation,  contributing  to  a  decline  in  support  for  the 
government  and  a  rise  in  popular  backing  for  revo¬ 
lutionaries.  In  revolutionary  situations,  states  lose 
popular  support,  and  rulers  may  become  isolated 
and  be  abandoned  even  by  the  dominant  classes 
who  are  not  generally  given  to  supporting  revolu¬ 
tion  or  radical  social  change. 

Revolutionary  outcomes  result  when  members 
of  the  polity  defect  from  the  regime  or  when  chal¬ 
lengers  successfully  neutralize  or  defeat  the  armed 
forces  and  seize  control  of  the  state  apparatus. 
Where  the  core  of  a  regime  refuses  to  relinquish 
power  in  a  revolutionary  situation,  challengers 
may  have  to  resort  to  large-scale  disruptive  tactics 
and  armed  struggle  to  overthrow  the  government. 
Certain  states  are  especially  vulnerable  to  revolu¬ 
tionary  outcomes.  States  that  are  highly  exclusive, 
repressive,  and  corrupt  often  have  a  narrow  base 


of  support  and  may  be  quickly  abandoned  by 
members  of  the  polity,  facilitating  a  revolutionary 
outcome.  States  that  depend  on  external  support 
are  also  vulnerable  to  a  revolutionary  outcome. 
Such  dependent  states  may  lose  their  external  sup¬ 
port  for  various  reasons,  including  an  interest  in 
preventing  deteriorating  conditions  and  further 
radicalization  of  politics.  These  shifts  in  interna¬ 
tional  alignment  and  the  withdrawal  of  external 
support  may  expedite  the  overthrow  of  the  govern¬ 
ment,  producing  a  revolutionary  outcome. 

Complex  processes  affect  the  immediate  politi¬ 
cal  outcome  of  revolutions.  In  most  revolutionary 
situations,  alternative  contenders  such  as  radical 
and  moderate  challengers  compete  against  each 
other  for  state  power.  Generally,  in  economically 
polarized  situations  of  the  developing  world, 
where  income  inequalities  are  high  and  growing, 
liberal  and  moderate  political  organizations  do  not 
gain  a  great  deal  of  strength  and  popular  support. 
But  the  political  ideology  and  organizational 
strength  of  challengers  do  not  always  affect  their 
likelihood  of  seizing  power.  In  fact,  the  immediate 
political  outcome  of  revolutions  is  not  easily  pre¬ 
dictable  at  the  outset  of  revolutionary  struggles. 
The  mobilization  options  that  are  available  are 
central  in  determining  which  challengers  will  be 
victorious.  For  example,  in  Nicaragua,  state  poli¬ 
cies  under  the  ruling  Somoza  family  marginalized 
the  moderate  opposition  for  decades.  When  the 
government  refused  to  hold  free  elections  in  1978, 
any  chance  for  peaceful  change  was  blocked,  and 
the  opposition  became  radicalized,  thereby  nar¬ 
rowing  the  mobilization  options.  This  improved 
the  position  of  the  Sandinista  National  Liberation 
Front,  a  small,  militant  guerrilla  organization  who 
had  already  launched  an  armed  struggle  to  over¬ 
throw  the  regime  and  was  eventually  able  to  seize 
power.  In  Iran,  the  majority  of  the  opposition 
fought  for  independence,  freedom,  and  social  and 
economic  justice,  but  the  Shah’s  policies  had 
severely  repressed  the  secular  political  organiza¬ 
tions  and  closed  all  mobilization  options.  As  a 
result,  mosques  remained  the  only  public  spaces 
that  enjoyed  some  measure  of  autonomy  from  the 
state  and  provided  relatively  safe  places  for  the 
people  to  gather  and  mobilize.  Mobilization 
through  the  mosque  enabled  Khomeini  and  a  small 
segment  of  his  clerical  supporters  to  lead  the 
struggles  in  the  final  stages  of  the  conflicts  and  seize 
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power.  In  the  Philippines,  radical  leftist  revolution¬ 
aries,  the  most  powerful  political  organization  in 
the  1980s,  failed  to  seize  power  in  part  because 
their  radical  ideology  threatened  the  privileged 
classes  and  prevented  the  formation  of  a  broad 
coalition.  When  the  radicals  boycotted  the  fateful 
election  in  1986,  moderates  were  able  to  take  the 
lead  in  the  struggles  against  Marcos,  and  in  the 
end,  the  reformist  elite  was  able  to  take  power. 
Thus,  it  is  reasonable  to  say  that  the  revolutionary 
process  itself  exerts  a  great  deal  of  influence  on  the 
eventual  outcome  of  the  revolutionary  struggle. 

Once  the  state  is  overthrown,  revolutionary 
coalitions  generally  break  down.  Although  revolu¬ 
tionary  challengers  need  broad  coalitions  to  seize 
state  power,  these  coalitions  become  superfluous 
once  the  revolutionaries  control  the  instruments  of 
coercion.  The  new  revolutionary  regime  is  often 
more  centralized  and  sometimes  more  repressive 
than  the  old  one  and  may  not  fulfill  all  its  prerevo¬ 
lutionary  promises.  The  long-term  structural  out¬ 
comes  of  revolutions  are  often  determined  by 
complex  factors  and  not  easily  predictable.  The 
interest  and  ideology  of  the  revolutionary  regime 
only  partly  determine  the  structural  changes  pur¬ 
sued  by  the  new  government.  Other  internal 
forces,  from  both  within  and  outside  the  polity, 
can  mobilize  to  realize  their  interests,  thus  affect¬ 
ing  the  structural  outcome  of  revolutions.  The 
overthrow  of  the  state’s  repressive  apparatus  often 
leads  to  reduced  repression  and  provides  an  oppor¬ 
tunity  for  various  classes  and  collectivities  to 
mobilize  and  demand  change.  Additionally,  inter¬ 
national  political  and  ideological  conditions  may 
also  influence  the  internal  development  of  post¬ 
revolutionary  societies,  particularly  where  the 
country’s  economy  depends  on  external  sources 
and  the  state  requires  outside  support. 

Future  analyses  of  social  revolutions  would  ben¬ 
efit  from  perspectives  that  synthesize  and  analyze 
large-scale  social  transformations,  class  analysis, 
states,  ideologies,  and  role  of  global  forces.  It  is 
important  to  understand  the  large-scale  social 
changes  that  often  precede  social  revolutions.  They 
can  affect  major  interests  and  the  capabilities  of 
various  actors  and  set  the  stage  for  social  conflicts. 
Changes  in  the  class  structure  are  also  critical  in 
understanding  social  revolutions.  While  class  con¬ 
flict  may  prevent  and  delay  revolutions,  class  coali¬ 
tions  seem  to  have  played  a  very  important  role  in 


revolutions  in  developing  countries  by  disrupting 
the  social  structure  and  facilitating  the  overthrow 
of  the  government.  State-centered  theories  of  revo¬ 
lution  can  contribute  much  by  specifying  state 
vulnerabilities  and  conditions  that  may  result  in 
the  isolation  of  the  state  and  its  abandonment  by 
broad  segments  of  the  population.  Finally,  analy¬ 
ses  of  ideology  can  contribute  immensely  to  expla¬ 
nations  of  social  revolutions  by  analyzing  the 
causes  of  the  rise  of  revolutionary  challengers  and 
explaining  why  large  segments  of  the  population 
shift  their  support  to  such  challengers. 
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Rights 


Modern  political  theory  locates  the  notion  of 
“rights”  at  the  center  of  its  debates.  The  liberal 
state  and  constitutional  theory  have  emphasized 
rights  as  fundamental  building  blocks  of  the  social 
order.  In  political  thought,  after  a  long  lull,  rights 
are  once  again  a  renewed  focus  of  interest.  Political 
theorists  now  commonly  opt  to  articulate  social 
demands  in  terms  of  rights  rather  than  (for 
instance)  as  a  general  utility  or  integrity  of  the 
body  politic.  In  this  vein,  several  theorists — above 
all,  those  with  liberal  roots — have  adopted  the 
Lockean  vision  of  the  just  political  order  whose 
primary  obligation  is  to  respect  the  moral  rights  of 
its  citizens. 

However,  there  is  a  broad  spectrum  of  defini¬ 
tion  for  the  concept  of  rights.  In  its  older,  objec¬ 
tive  usage,  a  right  means  “what  is  just”  or  “what 
is  fair.”  Aristotle,  for  instance,  used  dikaion  to 
indicate  that  a  society  is  rightly  ordered.  However, 
this  “objective”  sense  of  right  is  not  equivalent  to 
our  modern,  subjective  notion  of  rights,  which 
originates,  according  to  historians,  in  the  thought 
of  John  Locke  in  the  18th  century — or  perhaps 
even  further  back,  in  late-medieval  European 
thinking.  In  this  subjective  sense,  rights  are  some¬ 
times  defined  as  “normative  attributes”  that  per¬ 
tain  to  persons.  Other  approaches  see  rights  as 
entitlements  to  choose,  entitlements  (not)  to  per¬ 
form  certain  actions,  or  entitlements  to  expect 
that  others  will  (not)  perform  certain  actions. 

This  entry  defines  a  right  as  a  legal  or  moral 
recognition  of  choices  or  interests  to  which  a 
particular  weight  is  attached.  It  proceeds  in 
three  steps.  The  first  section  elucidates  the  dif¬ 
ferent  types  of  rights  and  the  second  section  the 
forms  and  functions  of  rights.  The  final  section 
delivers  a  brief  overview  of  the  major  theories 
of  rights. 

Types  of  Rights 

Any  study  of  rights  should  begin  with  an  elucida¬ 
tion  of  the  multiple  categories  of  rights.  In  the 
realm  of  politics,  one  can  distinguish  between  legal 
and  moral  rights,  between  rights  as  such  and 
“human  rights,”  and  between  civil,  political,  social, 
and  cultural  rights. 


Legal  and  Moral  Rights 

Legal  rights  describe  a  type  of  institutional 
arrangement  in  which  interests  are  guaranteed 
legal  protection,  choices  are  guaranteed  legal  effect 
or  goods,  and  opportunities  are  provided  to  indi¬ 
viduals  on  a  guaranteed  basis.  Assertions  that  X 
has  a  legal  right  to  Y  are  tested  according  to 
whether  the  law  does  in  fact  recognize  and  imple¬ 
ment  X’s  right  to  Y. 

Moral  rights  express  the  justified  demand  that 
such  institutional  arrangements  should  be  imple¬ 
mented,  maintained  in  the  name  of  a  fundamental 
principle  that  accords  importance  to  certain  basic 
individual  values  such  as  autonomy  or  moral 
agency.  The  assertion  that  X  has  a  moral  right  in 
the  absence  of  any  legal  recognition  of  this  right 
may  take  the  form  of  a  demand  for  the  law  to  be 
changed. 

Natural  rights  can  be  considered  as  a  subclass 
of  moral  rights.  These  are  fundamental  rights, 
derived  from  nature  or  divine  authority,  and 
enjoyed  by  all  men  and  women  regardless  of  their 
beliefs  and  position  in  society.  They  pertain  to 
human  beings  in  the  state  of  nature,  prior  to  the 
institution  of  society.  Natural  rights  are  thus  con¬ 
sidered  to  precede  civic  rights  held  in  the  political 
order.  They  are  therefore  universal,  inalienable, 
and  indefeasible:  They  cannot  be  contested  by 
political  authority,  and  indeed,  it  is  the  responsibil¬ 
ity  of  those  who  govern  to  protect  them. 
Consequently,  the  concept  was  used  to  justify  the 
revolutions  of  the  18th  century  on  the  grounds 
that  the  existing  law  infringed  on  individuals’ 
natural  rights.  In  the  United  States,  for  instance, 
the  Declaration  of  Independence  (1776)  against 
the  British  colonial  state  was  based  on  an  appeal  to 
the  natural  rights  of  all  Americans.  The  Declaration 
des  droits  de  I’homme  et  du  citoyen  (Declaration 
of  the  Rights  of  Man  and  of  the  Citizen,  1789)  also 
refers  to  certain  “natural  rights”  of  mankind. 

Rights  and  Human  Rights 

This  distinction  between  moral  or  natural  rights 
and  legal  rights  leads  to  another  difference:  the  dif¬ 
ference  between  rights  and  human  rights.  One 
agrees  with  Michael  Freeden  (1991)  that  human 
rights  are  the  most  basic  prerogatives  pertaining  to 
what  is  essentially  human,  while  other  categories 
of  rights  are  more  specific,  limited,  and  normally 
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derivative.  Freeden  defines  human  rights  as  a 
“conceptual  device,  expressed  in  linguistic  form, 
that  assigns  priority  to  certain  human  or  social 
attributes  regarded  as  essential  to  the  adequate 
functioning  of  all  human  beings”  (p.  7). 

The  promulgation  of  the  French  Declaration  in 
1789  gave  an  impetus  to  the  debate  about  the 
rights  of  man.  In  his  Reflections  on  the  Revolution 
in  France  (1790),  Edmund  Burke  attacked  the 
abstract,  metaphysical,  and  simplistic  character  of 
human  rights,  which  he  contrasted  with  the  “real 
rights” — related  to  their  particular  history  and 
circumstances — of  “Englishmen.”  In  reaction, 
Thomas  Paine  published  Rights  of  Man  (1791)  in 
which  he  argued  that  having  been  created  rational 
and  equal,  man  had  natural  rights  including  those 
to  liberty,  property,  and  security,  as  well  as  the 
right  of  resistance  to  oppression.  Two  other  major 
critics  of  human  rights  were  Jeremy  Bentham  and 
Karl  Marx.  For  the  former,  the  term  right  should 
be  confined  to  its  legal  sense,  since  the  demand  for 
a  right  is  no  more  a  right  in  itself  than  a  hungry 
man’s  plea  for  bread.  Marx  claimed  in  On  the 
Jewish  Question  (1843)  that  there  was  an  inherent 
contradiction  in  the  French  Declaration.  Whereas 
in  the  “political  community”  man  regards  himself 
as  a  “communal  being,”  in  “civil  society”  he  acts 
as  a  private  individual  who  regards  other  men  as  a 
means.  Marx  therefore  concluded  that  “not  one  of 
the  so-called  rights  of  man  goes  beyond  egoistic 
man,  man  as  a  member  of  civil  society,  namely,  an 
individual  withdrawn  into  himself,  his  private 
interest,  and  his  private  desire  and  separated  from 
the  community.” 

Civil,  Political,  Social,  and  Cultural  Rights 

In  a  seminal  essay  published  in  1950,  Thomas 
Marshall  divided  the  development  of  citizenship  in 
England  into  three  parts:  civil,  political,  and  social. 
The  civil  element  is  composed  of  the  rights  neces¬ 
sary  for  individual  freedom/liberty  of  the  person: 
freedom  of  speech,  thought,  and  faith;  the  right  to 
own  property  and  to  conclude  valid  contracts;  and 
the  right  to  justice.  These  so-called  first-generation 
rights  were  most  prominently  proclaimed  during 
the  revolutions  of  the  1 8th  century,  particularly  in 
the  French  Declaration  of  1789  and  the  1791  Bill 
of  Rights  in  the  United  States.  The  institutions  most 
directly  associated  with  civil  rights  are  courts  of 


justice.  In  theory,  civil  rights  exist  independently  of 
state  action,  protecting  the  freedom  of  the  individ¬ 
ual  against  potential  encroachments  by  state  power. 
They  are  often  called  “liberty  rights”  since  they  are 
based  on  a  “negative”  conception  of  freedom. 

The  political  element  means  the  right  to  partici¬ 
pate  in  the  exercise  of  political  power,  as  a  member 
of  a  body  invested  with  political  authority  or  as  an 
elector  of  the  members  of  such  a  body.  These 
so-called  second-generation  rights  developed  prin¬ 
cipally  over  the  course  of  the  19th  century,  and 
their  corresponding  institutions  are  parliaments 
and  local  government  councils. 

The  social  element  ranges  from  the  right  to  a 
modicum  of  social  welfare  and  security,  to  the 
right  to  share  fully  in  the  social  heritage,  and  to 
live  the  life  of  a  civilized  human  being  according  to 
the  standard  prevailing  in  society.  These  rights, 
which  represent  a  continuation  of  several  claims 
made  by  19th-century  socialist  movements,  came 
to  fruition  in  the  20th  century  in  the  framework  of 
the  welfare  state.  The  institutions  most  closely 
associated  with  social  rights  are  the  educational 
system  and  social  services.  These  rights  are  also 
known  as  “claim  rights”  on  the  grounds  that  they 
represent  “positive”  freedoms — that  is,  concrete 
rights  that  individuals  and  groups  can  expect  to 
enjoy  a  decent  standard  of  living.  They  rely  on 
state  intervention  to  guarantee  societal  solidarity 
and  equality.  Codified  in  the  Universal  Declaration 
of  Human  Rights  (1948),  social  rights  have  also 
been  formalized  in  the  International  Covenant  on 
Economic,  Social,  and  Cultural  Rights  (United 
Nations,  1966). 

The  International  Covenant  of  1966  also  recog¬ 
nizes  “cultural  rights,”  which  can  be  defined  as  the 
body  of  rights  guaranteeing  respect,  protection,  and 
promotion  of  cultural  references  and  expression — 
that  is,  the  ensemble  of  values,  convictions,  beliefs, 
language,  and  knowledge — through  which  indi¬ 
viduals  identify  themselves,  communicate  with 
each  other,  forge  a  shared  sense  of  belonging,  and 
consider  themselves  to  be  recognized  with  dignity. 
Sometimes  known  as  “fourth-generation,”  cul¬ 
tural  rights  were  recognized  in  the  second  half  of 
the  20th  century,  particularly  in  North  America. 
“Multiculturalist”  authors  interpret  cultural  rights 
as  being  clearly  individual  and  not  collective  since 
their  ultimate  purpose  is  the  protection  of  the  indi¬ 
vidual.  Indeed,  cultural  rights  such  as  the  freedom 
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to  speak  one’s  native  language  are  intimately 
linked  to  other  individual  rights  such  as  dignity  or 
freedom  of  expression. 

Forms  and  Functions  of  Rights 

An  analysis  of  rights  can  be  divided  into  two  parts: 
a  description  of  the  internal  structure  of  rights  and 
a  description  of  what  rights  do  for  those  who  hold 
them. 

Forms  of  Rights 

The  most  famous  legal  analysis  of  the  form  of  a 
right  is  that  of  Wesley  Hohfeld  (1879-1918). 
Hohfeld  argued  that  the  proposition  “X  has  a 
right  to  do  R”  could  mean  four  things: 

1.  First,  it  could  mean  that  X  has  a  privilege  to  do 
R.  In  this  case,  Y  (or  anyone  else)  has  “no 
rights”  toward  X,  and  X  has  no  duty  to 
perform  that  action.  Singing  in  your  bath  or 
painting  your  bedroom  blue  would  be  examples. 

2.  X  has  a  claim  (or  a  right)  against  Y. 
Consequently,  Y  has  a  duty  toward  X  to  do  R. 
For  instance,  an  employer  has  the  duty  to  pay 
wages  to  employees  or  a  parent  has  a  duty  to 
provide  food  to  his  or  her  child. 

3.  X  has  a  power  to  do  R,  which  alters  the  legal 
rights  and  duty  of  Y.  For  instance,  a  police 
officer  has  the  power  to  ask  to  see  the  license  of 
a  speeding  driver. 

4.  X  has  an  immunity  with  regard  to  R.  In  this 
case,  Y  (or  anyone  else)  lacks  the  power  to 
bring  about  a  certain  consequence  for  X.  For 
instance,  witnesses  in  court  have  a  right  not  to 
be  obliged  to  incriminate  themselves. 

However,  Hohfeld’s  categorization  makes  no 
attempt  to  clarify  what  the  function  of  a  right  is. 
There  are  two  broad  accounts  of  the  special  rela¬ 
tion  between  duties  and  right  bearers  that  have 
been  proposed:  the  choice  theory  of  right  and  the 
interest  (or  benefit)  theory  of  rights. 

Functions  of  Rights 

The  “choice  theory”  (or  will  theory)  singles  out 
the  rights  bearer.  It  holds  that  when  I  have  a  right 


to  do  something,  what  is  effectively  protected  is 
my  choice  of  whether  or  not  to  do  it.  It  accentuates 
my  freedom  and  my  self-fulfillment.  In  a  famous 
article,  Herbert  L.  Hart  argued  that  if  there  was 
one  natural  right,  it  was  the  equal  right  of  all  men 
to  be  free.  In  addition  to  this  right  of  freedom, 
Alan  Gerwith  establishes  a  right  to  well-being. 
John  Rawls,  too,  assumes  in  his  theory  of  justice 
that  rights  pertain  to  primary  goods  that  rational 
and  moral  individuals  would  regard  as  necessary 
to  the  functioning  of  a  just  constitutional  frame¬ 
work.  In  all  cases,  the  end  is  to  ensure  the  condi¬ 
tions  without  which  autonomy  is  impossible  and 
without  which  human  agency  cannot  be  realized. 

The  “benefit  theory”  maintains  that  the  func¬ 
tion  of  a  right  is  to  further  the  interests  of  the 
individual  who  enjoys  it.  An  owner  has  a  right  not 
because  owners  have  choices  but  because  owner¬ 
ship  makes  the  owner  better  off.  To  have  a  right  is, 
then,  to  be  the  intended  beneficiary  of  someone 
else’s  duty.  In  other  words,  rights  and  duties  are 
correlative.  A  less  strict  version  (the  “interest  the¬ 
ory”)  maintains  that  individuals  have  rights  when¬ 
ever  an  interest  that  they  hold  is  regarded  as 
sufficiently  important  in  itself  to  justify  obliging 
others  to  promote  that  interest  in  some  way.  In  this 
interpretation,  rights  thus  generate  duties. 

Theories  of  Rights 

Richard  Tuck  has  correctly  pointed  out  that  the 
meaning  of  a  term  such  as  right  is  “theory  depen¬ 
dent.”  In  other  words,  the  elucidation  of  a  com¬ 
plex  notion  such  as  rights  requires  an  account  of 
the  possible  theories  about  politics  that  employ  the 
concept. 

The  Natural  Rights  Paradigm 

According  to  Freeden,  the  theory  of  natural 
rights  includes  four  elements.  First,  it  argues  simply 
that  human  beings  are  born  with  such  rights,  that 
they  are  part  of  our  initial  equipment  in  the  same 
ways  as  our  bodies  are.  A  common  core  of  human 
nature  is  thus  defined  by  encircling  it  with  a  suc¬ 
cinct  list  of  natural  rights.  They  are  innate,  inalien¬ 
able,  and  indefeasible.  Second,  natural  rights  are 
presocial.  They  are  not  the  product  of  any  social 
artifice;  rather,  political  societies  are  created  for  the 
very  purpose  of  ensuring  the  recognition  and 
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implementation  of  natural  rights.  Third,  natural 
rights  are  absolute;  they  prevail  over  any  other 
consideration  that  might  deny  their  validity,  appli¬ 
cability,  or  range.  Fourth,  such  rights  are  universal; 
all  human  beings,  irrespective  of  time  and  space, 
enjoy  them.  Ffistorically,  appeal  was  made  to  natu¬ 
ral  law  or  the  law  of  God.  The  primary  exposition 
of  natural  law  is  to  be  found  in  the  writings  of  the 
Dominican  St.  Thomas  Aquinas  (1225-1274)  who 
defined  natural  law  as  the  participation  of  divine 
law — discoverable  by  reason — in  God’s  rational 
creatures.  For  John  Locke  (1632-1704),  since  we 
are  all  creatures  of  God,  we  have  inalienable  rights. 
The  modern  age,  however,  is  more  skeptical  about 
revelation  as  a  basis  for  political  morality.  Recent 
theories  have  attempted  instead  to  identify  the 
deep  moral  values  and  principles  that  underlie  the 
idea  of  rights.  John  Finnis  (1940-  ),  for  example, 
has  revived  some  of  the  central  tenets  of  natural 
law:  If  the  overriding  rationale  of  natural  law  is  to 
establish  what  is  truly  good  for  human  individuals, 
it  follows  that  one  needs  some  conception  of 
human  good — of  individual  fulfillment  in  a  form 
(or  in  a  range  of  forms)  of  communal  life  that  fos¬ 
ters  rather  than  hinders  such  fulfillment.  This 
might  include,  among  other  things,  the  right  not  to 
be  tortured,  not  to  be  deprived  of  one’s  capacity  to 
procreate,  the  right  not  to  be  lied  to,  and  so  on. 
Finnis  insists  that  the  first  principles  of  natural  law 
are  not  deductively  inferred  from  any  speculative 
principles  or  metaphysical  propositions  about 
human  nature.  To  discover  what  is  naturally  right 
is  to  ask  not  what  is  in  accordance  with  human 
nature  but  what  is  in  accordance  with  reason. 

Legal  Positivism 

This  central  claim  of  natural  law  is  rejected  by 
legal  positivists  who  deny  that  the  legal  validity  of 
a  norm  depends  on  its  substantive  moral  quality. 
Bentham  (1748-1832)  attacked  natural  rights  in 
his  Anarchical  Fallacies  as  “rhetorical  nonsense — 
nonsense  upon  stilts,”  calling  rights  in  general  “fic¬ 
tions,”  before  distinguishing  between  the  “bad” 
fiction  of  natural  rights  and  the  “good”  fiction  of 
legal  rights.  He  analyzed  rights  as  a  rational 
human  contrivance  necessary  for  political  and 
social  life.  Rights  could  exist  only  within  this 
framework  and  were  not  anterior  to  law:  “From 
real  laws  come  real  rights,  but  from  imaginary 


laws,  from  laws  of  nature  .  .  .  come  imaginary 
rights”  ( Principles  of  Legislation ,  Chapter  XIII). 
Together  with  his  disciple  John  Austin  (1790- 
1859),  Bentham  laid  the  foundations  of  legal  posi¬ 
tivism,  which  rejects  the  conception  that  rights 
exist  independent  from  human  enactment.  The 
validity  of  any  right  can  always  be  traced  to  an 
objectively  identifiable  source — be  it  the  command 
of  a  sovereign  for  Bentham  and  Austin,  the  rule  of 
recognition  for  Hart,  or  a  basic  norm  that  validates 
the  constitution  for  Hans  Kelsen.  Consequently, 
legal  positivism  establishes  a  clear  distinction 
between  “ought”  (what  is  morally  desirable)  and 
“is”  (what  actually  exists).  It  does  not  follow,  how¬ 
ever,  that  they  subscribe  to  the  proposition  that  an 
unjust  law  must  be  obeyed.  Both  Austin  and 
Bentham,  for  instance,  acknowledged  that  disre¬ 
gard  for  unjust  laws  is  justified  if  it  promotes 
change  for  the  good.  According  to  Hart,  rights  are 
distinguished  from  other  moral  considerations  by 
the  fact  that  they  protect  and  promote  the  specifi¬ 
cally  human  interest  in  freedom. 

Rights  as  Trumps 

In  the  1970s,  the  foundations  of  legal  philoso¬ 
phy  were  shaken  by  the  vision  of  law  advanced  by 
American  jurist  Ronald  Dworkin  (1931-  ). 
Dworkin’s  attack  on  legal  positivism  and  utili¬ 
tarianism  is  founded  on  his  concern  that  the  law 
ought  to  “take  rights  seriously.”  Rights  are  best 
understood  as  “trumps”  over  some  background 
justification  for  political  decisions  that  state  a 
goal  for  the  community  as  a  whole.  For  instance, 
if  someone  has  a  right  to  publish  pornography, 
this  means  that  it  is  wrong  to  act  in  violation  of 
that  right  even  if  there  are  some  reasons  to  believe 
that  the  community  would  be  better  off  without 
pornography.  Dworkin’s  thesis  is  grounded  in  the 
conviction  that  no  government  may  impose  any 
constraint  on  a  person  that  this  person  could  not 
accept  without  abandoning  her  or  his  sense  of 
equal  worth.  Utilitarianism — which  identifies  the 
notion  of  average  or  collective  welfare  as  the  goal 
of  human  action — should  be  replaced  by  a  rights- 
based  theory  centering  on  individual  indepen¬ 
dence.  However,  this  liberal  conception  of  rights 
differs  from  a  libertarian  conception  such  as  that 
advocated  by  Robert  Nozick.  While  the  latter  is 
concerned  with  preserving  individual  choice 
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(limited  only  by  the  rights  of  others  to  the  same 
good),  the  former  insists  on  an  equal  treatment  of 
all  members  of  society.  Dworkin’s  conception  of 
political  morality  includes  three  elements:  (1)  jus¬ 
tice,  which  incorporates  both  individual  rights 
and  collective  goals  for  treating  citizens  with 
equal  concern  and  respect;  (2)  fairness,  which 
designates  the  procedures  that  give  all  citizens 
equal  influence  in  decisions  that  affect  them;  and 
(3)  procedural  due  process,  which  relates  to  the 
correct  procedures  for  determining  whether  a 
citizen  has  violated  the  law. 

This  metaphor  of  rights  as  “trumps”  raises  the 
question  of  whether  there  is  any  absolute  right. 
According  to  Gerwith,  a  right  is  absolute  when  it 
cannot  be  overridden  by  any  circumstances,  can 
thus  never  be  justifiably  infringed,  and  must  be 
fulfilled  without  any  exceptions.  Gerwith  asserts 
that  there  is  at  least  one  such  absolute  right:  All 
innocent  persons  have  an  absolute  right  not  to  be 
the  intended  victims  of  a  homicidal  project. 
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Risk  and  Public  Policy 


The  relationship  between  risk  and  public  policy 
goes  back  to  the  17th-century  Prussian  origins  of 
scientific  enquiry  into  policy  and  state  science,  as 
part  of  cameralist  Policeywissenscbaft  (German 
for  “public  policy”).  The  systematic  study  of  pol¬ 
icy  was  to  ensure  the  well-being  of  the  local  state 
and  its  subjects.  There  has  been  little  change  to 
that  understanding  since.  Risks  include  not  only 
threats  to  health  through  environmental,  eco¬ 
nomic,  or  social  processes  but  also  risks  to  actors 
arising  from  political  processes  themselves.  In 
terms  of  definition,  the  literature  usually  follows 
the  economist  Frank  Knight  in  suggesting  that 
risks  are  associated  with  events  whose  probabilities 
are  known.  Consequently,  three  sets  of  linkages 
between  risk  and  public  policy  can  be  distin¬ 
guished:  (1)  related  to  the  substantive  analysis  of 
risks,  (2)  the  context  of  policy  making  related  to 
risk,  and  (3)  the  policy  instruments  employed  in 
addressing  risk. 

Substantive  Analysis  of  Risks 

The  relationship  between  risk  and  public  policy 
can  be  defined  in  a  substantive  sense.  Accordingly, 
the  interest  here  is  how  public  policies  deal  with 
particular  types  of  risk.  Within  this  discussion  of 
substantive  risks,  three  separate  but  overlapping 
concerns  can  be  identified. 

Risks  to  Individual  and  Collective  Health 

One  key  consideration  over  the  past  3  decades 
has  been  the  interest  in  risk  as  a  result  of  environ¬ 
mental  and  technological  developments,  namely, 
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the  risks  to  individual  and  collective  health  that 
public  policies  are  supposed  to  protect  against. 
The  field  of  science  and  technology  studies  has 
considered  how  organizations  deal  with  uncertain¬ 
ties  (potential  events  to  which  probabilities  cannot 
be  assigned)  and  risks  (potential  events  to  which 
probabilities  can  be  assigned).  Significant  attention 
has  been  paid  to  the  socially  constructed  nature  of 
expert  decision  making,  such  as  regarding  the 
specifications  of  nuclear  reactors  or  civilian  air¬ 
craft.  Further,  there  has  been  significant  interest  in 
organizational  processes,  whether  relating  to  the 
“normalization  of  deviance”  as  part  of  (interorga- 
nizational)  processes;  the  inevitability,  as  discussed 
by  Charles  Perrow  (1999),  of  things  going  wrong; 
or  the  avoidance  of  such  errors  in  so-called  high- 
reliability  organizations.  This  interest  has  moved 
into  the  field  of  environmental  and  civil  contin¬ 
gency-related  risks,  especially  given  the  concern 
over  the  impact  of  climate  change  (e.g.,  flooding) 
and  security.  A  related  category  of  interest  has  con¬ 
cerned  itself  with  how  changing  values  among  the 
population  have  led  to  a  growing  distrust  of  large 
technical  systems,  especially  those  associated  with 
risks  of  catastrophic  consequence,  for  example, 
nuclear  reactors.  But  attention  has  not  merely  been 
paid  to  high-tech  or  catastrophic  large-scale  risks. 
In  a  comparative  study  of  nine  risks  (e.g.,  air  pol¬ 
lution,  dangerous  dogs,  and  pesticides),  Christopher 
Hood,  Henry  Rothstein,  and  Robert  Baldwin 
(2001)  found  significant  differences  in  risk  regula¬ 
tion  regimes  that  could  not  be  explained  by  a 
purely  functional  “risk  profile”  explanation. 
Related  to  this,  Christopher  Hood  and  Martin 
Lodge  (2005)  found  institutional  similarities  in 
regulatory  responses  to  one  particular  risk,  namely, 
dog  bites  in  public,  across  significantly  different 
national  institutional  contexts.  Regardless  of  the 
political  system,  regulatory  responses  to  media 
pressure  in  the  wake  of  fatal  dog-bite  attacks  was 
remarkably  similar. 

Risk  and  the  Political  Process 

A  second  key  consideration  relates  to  how 
actors  deal  with  risk  affecting  them  as  part  of  the 
political  process.  Such  substantive  issues  of  risk  to 
actors  are  at  the  heart  of  the  public  policy  litera¬ 
ture  on  regulation,  in  particular  regarding  the 
regulation  of  infrastructures.  Here,  the  key  interest 


is  in  understanding  the  institutional  conditions 
that  accentuate  so-called  political  risks.  In  con¬ 
texts  in  which  developmental  outcomes  (such  as 
investment  in  infrastructure  modernization  and 
expansion)  depend  on  private  investors,  as  govern¬ 
ments  lack  budgetary  resources,  private  investors 
will  adjust  the  levels  of  their  investment  according 
to  the  risk  of  their  being  expropriated  through 
administrative-regulatory  means.  The  risk  of  such 
expropriation  is  particularly  high  following  the 
initial  investment  in  fixed  assets  (such  as  infra¬ 
structure).  As  a  result,  private  investors  seek  regu¬ 
latory  devices  that  reduce  political  and  regulatory 
discretion.  Therefore,  policies  need  to  be  devised 
that  reduce  administrative  discretion  in  the  con¬ 
text  of  specific  political  institutional  constella¬ 
tions.  At  the  same  time,  the  literature  has  also  been 
interested  in  reducing  the  risk  of  “capture”  and 
“drift” — that  is,  the  risk  that  private  parties  will 
not  fulfill  their  mandate  or  rather  redirect  the 
goals  of  regulation  away  from  oversight  toward 
advocacy  of  their  own  interests.  Following  a 
period  in  which  the  literature  assumed  a  near 
inevitability  that  “agents”  would  escape  oversight, 
more  recent  literature  in  institutional  analysis  has 
pointed  to  devices  to  hardwire  regulatory  regimes 
in  order  to  reduce  problems  of  political  and  “pri¬ 
vate”  risks. 

Risk  and  Responsibility  When  Things  Go  Wrong 

The  third  area  of  concern  is  the  risk  of  being 
blamed  when  things  go  wrong.  This  literature  links 
to  the  long-established  interests  in  explaining  why 
politicians  delegate  certain  tasks  to  agencies  rather 
than  maintain  decision-making  authority  (the 
“blame-shift”  hypothesis  that  has  not  received 
much  empirical  support)  as  well  as  other  strategies 
by  which  politicians  seek  to  avert  or  divert  the 
risks  of  being  blamed,  such  as  strategies  of  presen¬ 
tation  (relying  on  spin  and  other  communication 
devices)  or  on  policy  strategies.  The  latter  can  be 
either  explicit  by  abandoning  particular  public 
activities  or  less  explicit  in  the  sense  of  frontline 
abandonment.  For  example,  street-level  bureau¬ 
crats,  such  as  police  forces,  withdraw  from  par¬ 
ticular  activities  that  have  caused  public  backlash. 
It  has  been  suggested  that  an  increasingly  adver¬ 
sarial  and  legalistic  environment  has  accentuated 
the  trend  toward  frontline  abandonment. 
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The  Context  of  Policy  Making 

Looking  at  the  contextual  setting  of  risk  and  pub¬ 
lic  policy  indicates  the  ways  in  which  either  the 
context  of  policy  making  has  shifted  or  the  specific 
conditions  under  which  policies  associated  with 
particular  risks  arise.  Again,  three  literatures  can 
be  distinguished. 

In  the  first  understanding  of  a  changing  context, 
the  key  environmental  change  is  associated  with 
Ulrich  Beck’s  “risk  society.”  According  to  Beck, 
societies  are  not  just  organized  by  risk  but  are  also 
confronted  with  risks  that  are  generated  through 
processes  of  modernization  itself.  Whereas  in  the 
past,  risks  were  associated  with  exogenous  (or 
external)  events  (rain,  asteroids),  contemporary 
society  is  faced  by  endogenous  (or  manufactured) 
risks,  such  as  industrial  and  automotive  emissions. 
Importantly,  the  nature  of  these  endogenous  risks 
leads  to  a  redistribution  of  positions  within  society 
in  which  wealth  no  longer  directly  insures  against 
risk  exposure.  Instead,  position  is  defined  by 
knowledge  and  access  to  information,  as  risks  are 
largely  “invisible.” 

Further,  the  risk  society  is  also  characterized  by 
increased  skepticism,  if  not  hostility,  toward  tradi¬ 
tional  sources  of  authority  and  expertise.  Moreover, 
as  noted  by  Mary  Douglas  and  Aaron  Wildavsky 
(1982),  accounts  informed  by  grid  group  cultural 
theory  highlight  value  conflicts  and  shifts  within 
society  and  therefore  stress  the  importance  of  risk 
perceptions,  thus  challenging  the  supposedly  neu¬ 
tral  technologies  of  risk  assessment. 

A  further  literature  considers  how  risks  provoke 
“moral  panics.”  Heightened  public  attention  trans¬ 
lates  into  pressure  (transmitted  by  the  media)  for 
something  to  be  done  to  which  politicians  seek  to 
respond.  Their  knee-jerk  reactions  are  regarded  as 
suboptional,  not  just  because  decision  making 
under  pressure  is  said  to  reduce  information  flows 
but  also  because  uncertainty  regarding  cause  and 
effect  is  prevalent.  Consequently,  inappropriately 
designed  responses  emerge  that  are  made  even  more 
problematic  by  agencies  seeking  to  regulate  for  the 
“last  10%,”  thereby  incurring  substantial  costs. 

Risk  and  Policy  Instruments 

A  third  type  of  linkage  between  risk  and  public 
policy  is  particularly  interested  in  instruments. 
Indicating  a  fundamental  divide  between  normative 


assumptions,  the  literature  is  divided  into  two 
key  areas.  One  focuses  in  particular  on  the 
importance  of  deliberative  and  proceduralized 
solutions  to  questions  of  risk,  pointing  in  partic¬ 
ular  to  the  societal  context,  as  described  by  Beck. 
In  light  of  inherent  value  conflicts  within  society, 
it  is  argued  that  deliberative  processes  are  likely 
to  narrow  rather  than  widen  differences  and  that 
therefore  procedural  devices  need  to  be  estab¬ 
lished  to  allow  such  deliberation  to  take  place.  A 
key  example  of  this  literature  has  been  the  rise  of 
the  precautionary  principle,  a  policy  approach 
that  has  gained  considerable  attention,  with 
accusations  that  it  invites  “capture”  and  “popu¬ 
lism”  as  it  suggests  (depending  on  exact  defini¬ 
tion)  that  the  burden  of  proof  can  be  reversed  if 
considerable  doubts  regarding  safety  exist. 
Particular  activities  or  products  are  no  longer 
innocent  until  proven  guilty;  instead,  they  are 
assumed  to  be  guilty  unless  proven  otherwise. 

In  contrast  to  the  deliberation-oriented  litera¬ 
ture,  a  separate  strain  argues  the  case  for  improved 
“risk  analysis”  instruments — that  is,  attempts  at 
rationalizing  decision  making  by  enhancing  the 
quality  of  information  used.  Such  instruments  and 
technologies  have  garnered  substantial  interest, 
especially  since  the  1970s  and  the  rise  of  the  “cost- 
benefit  state”  in  the  United  States.  Subsequently, 
such  instruments  have  gained  increased  currency  in 
other  jurisdictions,  for  example,  in  the  form  of 
regulatory  impact  assessments  or  the  advocacy  of 
diverse  “risk  management”  tools.  These  tools  sup¬ 
posedly  improve  decision  making  and  reduce  “irra¬ 
tionality”  among  politicians  and  civil  servants 
alike.  Whereas  the  attraction  of  these  instruments  is 
partly  to  reassert  authority  in  decision  making  and 
prevent  “hasty”  decision  making  (and  thus  “over¬ 
regulation”),  it  is  less  evident  that  these  instruments 
have  actually  reduced  the  impact  of  interest  groups 
and  heightened  public  attention  on  public  policy.  In 
contrast,  they  appear  to  have  themselves  become 
part  of  contested  policy  making,  given  that  their 
performance  in  terms  of  controlling  risks  has 
proven  to  be  less  than  satisfactory.  Indeed,  Michael 
Power  has  suggested  that  such  attempts  at  rational¬ 
izing  risk  in  policy  making  is  likely  to  accentuate 
rather  than  reduce  risks.  Not  unrelated  to  contribu¬ 
tions  that  seek  to  “rationalize”  decision  making 
regarding  risk,  there  have  been  debates  about 
instruments  to  insure  against  actualized  risks  and 
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their  potentially  redistributive  consequences.  This 
relates,  for  example,  to  debates  regarding  flood  or 
crop  insurance  schemes,  their  funding,  their  private 
or  public  status,  and  the  degree  to  which  such  sys¬ 
tems  are  mandatory  or  not. 

The  Field  of  Risk  and  Public  Policy 

Risk  and  public  policy  is  a  field  populated  by  con¬ 
tributions  from  across  the  social  sciences.  The 
transdisciplinary  nature  of  these  contributions  has 
only  increased  in  recent  years  as  the  rise  of  interest 
in  civil  contingencies,  both  before  and  after  crises 
(as  noted  by  Arjen  Boin,  Allan  McConnell,  &  Paul 
t’Hart,  2008),  as  well  as  the  growing  importance  of 
climate  and  demographic  change  demand  such 
cross-disciplinary  approaches.  While  the  focus  on 
substantive  risks  remains  critical,  other  research 
agendas  are  developing  that  focus  particularly  on 
the  failure  of  instruments  to  identify  risks  and  the 
need  to  understand  and  explain  variations  between 
jurisdictions  and  domains,  especially  in  terms  of 
variations  in  enthusiastic  endorsement  of  risk  pol¬ 
icy  instruments  or  perceptions  of  risk  across  differ¬ 
ent  contexts. 
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Rituals 


Rituals  are  a  challenge  for  political  science.  They 
can  be  found  in  a  great  variety  of  forms  and  are 
multifunctional  to  such  an  extent  that  science  has 
been  unable  to  find  an  unambiguous  definition  up 
to  now.  Already  the  term  ritual  was  used  in  antiq¬ 
uity  in  a  double  sense — with  a  primarily  religious 
meaning  (religious  cult)  or  modally  (the  type  and 
manner  of  carrying  out  the  ceremonies  of  the 
cult).  In  Roman  religion,  ritual  denotes  an  ordered 
ceremonial  activity.  Theology  and  religious  studies 
adhere  to  this  day  to  the  concept  of  ritual  as  a  col¬ 
lective  term  for  religious  ceremonies  as  well  as  for 
individual  sequences  therein.  It  includes  the  cults 
and  worshipping  customs  of  Jews  and  Christians 
as  well  as  those  of  various  peoples  in  different 
times  and  places,  from  the  Egyptian  Isis  cult  or  the 
Roman  priestly  authority  of  the  Vestal  Virgins  to 
the  religious  forms  of  contemporary  Confucianism, 
Buddhism,  Taoism,  and  Islam.  Early  on,  Confucian 
philosophers  investigated  the  legitimate  role  of 
ritual  (in  Chinese,  li)  in  the  guidance  of  a  society. 
This  entry  will  discuss  the  impact  of  such  tradi¬ 
tions  on  recent  developments  in  the  social  sciences 
and  the  renewed  importance  of  the  study  of  ritu¬ 
als. 

From  the  perspective  of  religious  studies,  ritual 
is  a  matter  of  cubic  actions  that  follow  fixed  rules 
for  the  purpose  of  worshipping  God,  the  gods,  or 
figures  considered  to  be  holy.  Religious  rites 
express  explicit  wishes  for  propitiation,  solace, 
recognition,  and  order.  They  refer  to  a  transcen¬ 
dental  power.  Their  rules,  texts,  and  notes  are 
gathered  in  Christian  churches  into  official  liturgi¬ 
cal  books.  For  instance,  in  the  Roman  Catholic 
Church  they  are  described  in  the  Rituale  Romanum 
dating  from  1614  (Pope  Paul  V),  which  in  1918 
became  obligatory  for  all  dioceses.  Anthropology, 
ethnology,  anthropological  religious  research, 
sociology,  and  psychology  attach  great  signifi¬ 
cance  to  the  conceptual-theoretical  recording,  col¬ 
lection,  and  classification  of  religious  and  profane 
ceremonies. 

The  founder  of  anthropological  religious 
research,  Edward  Bernett  Tylor  (1873),  distin¬ 
guished  religious  rites  as  an  expressive-symbologi- 
cal  gestural  language  of  theology  from  those  that 
serve  for  communicating  with  and  influencing 
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spirits.  With  his  investigation  of  the  religion  of  the 
Semites,  William  Robertson  Smith  created  an  ini¬ 
tial  foundation  for  the  systematic  investigation  of 
rituals.  He  developed  the  thesis  that  religions 
essentially  consist  of  beliefs  and  rites,  with  rites 
and  practical  customs  being  dominant.  Arnold  van 
Gennep’s  Les  Rites  de  Passage  recognizes  the 
rhythm-creating  function  of  rituals  as  a  reflection 
of  natural  rhythms.  A  phase  pattern  serves  for  clas¬ 
sifying  rites  or  individual  sequences.  Transitional 
rites  are  intended  to  allow  individuals  and  groups 
to  transcend  spatial,  temporal,  and  social  borders. 
For  Gennep,  rites  are  “compelling  actions.”  Thus, 
the  acceptance  of  a  gift  has  an  obligatory  effect  on 
the  recipient.  Gennep’s  conceptualization  gained 
lasting  recognition  for  the  anthropological  investi¬ 
gation  of  initiatory  rites. 

The  social  sciences  speak  of  rituals  as  a  collec¬ 
tive  designation  with  which,  however,  individual 
scientists  relate  different  categories  and  theories. 
Ritual  and  rite  are  frequently  used  as  synonymous 
terms.  Emile  Durkheim  and  his  students  Henri 
Hubert  and  Marcel  Mauss  developed,  with  regard 
to  the  sociology  of  religion,  a  theory  later  taken  up 
by  social  anthropology  and  French  structuralism. 
Durkheim  defined  religion  as  a  system  of  beliefs 
and  practices  that  are  united  in  a  single  moral  com¬ 
munity.  For  the  Durkheim  school,  rituals  bring 
about  social  integration;  they  are  mechanisms  that 
produce  social  conformity  and  solidarity.  Whenever 
people  come  together,  there  is,  according  to 
Durkheim,  a  natural  tendency  to  coordinate,  stan¬ 
dardize,  and  repeat  their  actions.  Durkheim  desig¬ 
nated  the  feeling  of  participating  through  group 
activity  in  something  transcendent  as  “the  sacred,” 
which  is  represented  in  symbols. 

With  Max  Weber,  too,  rituals  can  have  the  func¬ 
tion  of  creating  identity,  based  on  common  inter¬ 
ests  and  the  “ritual  qualification”  of  membership, 
as  he  demonstrated  with  early  religions  or  the  cul- 
tic  communities  of  the  polis.  Subsequently,  in 
Chinese  Confucianism  and  Taoism,  those  well 
versed  in  writing  and  literature  (ritually  trained 
scholar-politicians)  occupied  high  political  offices. 
According  to  Weber,  rituals  can  mark  the  social 
differentiation  of  a  society.  Thus,  caste  societies  are 
based  on  the  ritual  separation  of  occupations  and 
professions.  Social  scientists  at  the  turn  of  the  20th 
century  focused  mainly  on  the  rituals  of  tradi¬ 
tional,  preindustrial  societies.  They  linked  them 


closely  to  religion  and  magic  and  excluded  other 
(profane)  manners  of  behavior.  Social  rationaliza¬ 
tion,  however,  caused  religions  and  rituals  to 
diverge  more  strongly.  The  tendencies  of  Western 
societies  toward  rationalization,  which  were  ana¬ 
lyzed  by  Weber,  are  capable,  through  a  “disen¬ 
chantment”  of  the  world,  of  relegating  religious 
rituals  to  the  margins  of  public  and  private  life,  or 
of  situating  them  in  the  political  and  social  domains. 

Political  Rituals 

Terminologically,  the  concept  of  ritual  cannot 
be  limited  to  religious  and  magical  rituals,  for  the 
conceptual  core  consists  of  ritual  acts.  These  are 
omnipresent  in  modern  and  postmodern  societies, 
additionally  and  especially  in  political  life:  from 
ceremonies  in  parliaments,  inaugurations  of  presi¬ 
dents,  jurisprudence,  diplomatic  protocol,  and 
electoral  campaigns  all  the  way  to  debates  in  the 
mass  media.  Political  leaders  endeavor  by  means 
of  rituals  to  shape  a  political  reality  for  their  fol¬ 
lowers.  Revolutionaries  seek  new  paths  for  the 
legitimation  of  a  political  regime.  Through  their 
participation  in  rituals,  citizens  identify  themselves 
with  political  parties  and  social  movements  that 
are  comprehensible  only  in  symbolic  form.  The 
political  elite  live  amid  rituals  that  regulate  the 
interaction  among  its  members  and  with  the  public 
at  large. 

Edward  Shils  found  that  ritual  still  had  a  bad 
reputation  among  utilitarian-thinking  intellectuals 
in  the  1960s;  the  advance  of  rationalization  and 
secularization  had  caused  it  to  be  considered  as  no 
more  than  a  relic.  There  are,  however,  good  reasons 
to  treat  it  as  a  contemporary  phenomenon  sui 
generis.  Political  rituals  have  been  noted,  but  their 
empirical  investigation  is  only  now  getting  under 
way.  Impulses  for  research  came  from  three  direc¬ 
tions:  (1)  from  the  debate  concerning  symbolic 
policy  and  politics  of  symbols  (as  discussed  by 
Murray  Edelman,  1971),  (2)  from  analyses  of  fas¬ 
cist  and  communist  dictatorships  (such  as  that  of 
Charles  Lane,  1981),  and  (3)  from  political  culture 
research  such  as  that  of  Gabriel  A.  Almond  and 
Sidney  Verba.  Attention  shifted  from  religious  to 
symbolic  rituals.  Inasmuch  as  symbols  imbue  actions 
with  meaning,  rituals  allow  persons  and  groups  to 
develop  their  own  worldviews.  Politics  is  then  the 
art  of  understanding  and  further  developing  (in 
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accordance  with  one’s  self-interest)  the  predomi¬ 
nant  symbols  of  an  era. 

Symbolic  politics  are  capable  of  simply  feigning 
political  activity  before  a  mass  audience  and  of 
providing  benefits  for  an  elite.  But  as  a  rule,  they 
are  structured  so  as  to  include  the  audience  in 
political  activities.  They  refer  to  the  symbolic  and 
cultural  capital  of  individuals  (as  noted  by  Pierre 
Bourdieu),  which  determines  social  prestige 
through  the  use  of  language,  education,  and  cul¬ 
ture.  Symbolic  politics  are  engaged  in  an  unswerv¬ 
ing  struggle  over  the  designations,  concepts,  and 
interpretations  of  the  agenda  of  public  life.  When 
successfully  used,  they  become  a  communicative 
power  for  the  purpose  of  promoting  interests.  In 
repressive  regimes,  rituals  can  be  used  to  stifle  con¬ 
tradiction  and  to  destroy  regime  opponents,  as,  for 
example,  in  the  Moscow  purges  of  the  1930s. 

In  addition  to  the  interest  in  the  ritual  activity  of 
integrated  groups  in  the  tradition  of  Smith  and 
Durkheim,  questions  arise  as  to  the  function  of 
rituals  in  conflicts  within  and  between  groups  and 
societies  or  with  regard  to  the  establishment 
of  social  and  political  borders.  Civil  wars,  ritualis¬ 
tic  terrorism,  and  failed  states  demonstrate  the 
political  relevance  of  ritual.  They  help  represent 
political  power  openly  inasmuch  as,  for  example, 
constructed  traditions  and  myths  are  ritualized.  In 
the  heterogeneous  global  society,  nationally  or  cul¬ 
turally  based  rituals  may  appear  to  be  dangerous — 
for  instance,  in  the  Western  world,  the  rituals  of 
honor  and  disgrace.  But  rituals  can  also  have  a 
pacifying  effect,  such  as  the  ritual  of  recurring 
world  conferences,  or  they  can  symbolize  the  con¬ 
cept  of  peace,  as  in  the  sit-ins  and  human  chains  of 
the  peace  movement  during  the  1960s  and  1970s. 
Consideration  is  also  given  to  the  biological  bases 
of  social  behavior  that  influence  the  fundamental 
forms  of  politics:  hierarchy,  peer  groups,  authority, 
fear,  and  competition.  The  ethologist  Julian  Huxley 
introduced  the  term  ritualization  for  the  stylized, 
repetitive  gestures  and  postures  of  animals.  Pro¬ 
ceeding  from  this  perspective,  ethnological  sociol¬ 
ogy  understands  this  term  as  a  mechanism  for 
regulating  social  behavior,  with  which  intimacy, 
formality,  territoriality,  hierarchy,  or  difference  in 
age  can  be  expressed,  and  which  simultaneously 
fosters  self-identification.  Rituals  thus  are  multi¬ 
functional,  open  processes.  The  anthropological 
and  literary-critical  mainstream  regards  ritual 


phenomena  as  symbolic,  expressive  activity. 
Anthropologists  such  as  Victor  Turner  (1969) 
applied  the  concept  of  ritual  to  industrial  societies, 
and  political  scientists  such  as  David  I.  Kertzer 
(1988)  argued  for  an  application  to  modern  poli¬ 
tics.  It  is  basically  a  matter  of  inquiring  into  how 
symbolic  processes  enter  into  politics  and  why  they 
are  important. 

For  a  long  time,  political  scientists  tended, 
through  their  belief  in  ongoing  secularization,  to 
overlook  the  phenomena  of  ritual  and  concomi¬ 
tantly  of  the  body  as  carriers  of  communication. 
Later,  in  the  1980s,  in  the  United  States,  a  cross- 
disciplinary  investigative  approach  was  estab¬ 
lished  around  the  Journal  of  Ritual  Studies. 
Sociologists,  political  scientists,  jurists,  media 
and  communications  scholars,  art  theoreticians, 
dramaturgists  as  well  as  literary  and  cultural 
scholars  united  in  the  investigation  of  ritual 
aspects  in  the  subsystems  of  postmodern  societ¬ 
ies.  The  political,  social,  cultural,  and  religious 
dynamic  since  the  end  of  the  East-West  conflict 
has  sharpened  the  awareness  of  ritual  forms  in 
the  political  sphere.  In  parallel,  religion  has  been 
rediscovered  as  a  crucial  social-political  factor. 
Samuel  P.  Huntington’s  1996  study  The  Clash  of 
Civilizations  and  the  Remaking  of  World  Order 
became  an  important  point  of  reference.  Initial 
analytical  steps  brought  to  the  fore,  out  of  exist¬ 
ing  microstudies  and  theoretical  outlines,  a  series 
of  attributes  or  attributive  dimensions.  Jan 
Platvoet  (1955)  suggests  distinguishing  between 
13  characteristics  or  functions  with  which  rituals 
may  be  identified.  He  argues  that  rituals  are 
interactive,  collective,  a  habit,  a  traditionalizing 
innovation,  expressive,  communicative,  sym¬ 
bolic,  multimedial,  a  performance,  performative, 
aesthetic,  strategic,  and  integrative.  Proceeding 
from  the  experiences  of  participants  in  various 
ritual  acts,  Ronald  L.  Grimes  (1995)  formulated 
a  typology  that  endeavors,  under  the  generic 
term  of  ritual,  to  classify  the  entire  area  into  six 
types:  ritualization,  rules  of  propriety,  ceremony, 
magic,  liturgy,  and  celebration. 

Communication,  Performance, 

Political  Performance 

The  three  key  concepts  of  communication,  perfor¬ 
mance,  and  political  performance  are  characteristic 
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of  various  social-scientific  approaches  toward 
explaining  ritual  processes.  The  widespread  con¬ 
ception  of  rituals  as  communicative  activities  is 
related,  on  the  one  hand,  to  the  theory  of  action 
that  goes  back  to  Max  Weber  and  Talcott  Parsons 
and,  on  the  other  hand,  to  the  shift  toward  lan¬ 
guage  as  activity  accomplished  in  the  20th  century 
by  philosophy  and  by  the  theory  of  science. 
According  to  the  philosopher  Ludwig  Wittgenstein, 
speech  is  an  activity  guided  by  rules.  The  ritual  as 
text,  as  script,  determines  sequences  of  actions  and 
those  engaged  therein.  Its  symbols  refer  to  certain 
systems  of  belief  or  knowledge.  Rituals  are  thus  a 
special  genre  of  communication  within  the  com¬ 
municative  repertoire  of  a  social  unit. 

Alongside  stands  an  application  of  philological 
methods  to  the  study  of  human  activities  that  is 
derived  from  structuralism  (e.g.,  the  work  of 
Leonard  Bloomfield,  Claude  Levi-Strauss,  etc.).  The 
speech  acts  of  a  ritual  or  of  other  forms  of  activity 
are  segmented  into  repetitive  parts  whose  distribu¬ 
tion  is  classified  with  the  goal  of  comprehending  the 
synchronous  coaction  as  a  structured  semiotic  sys¬ 
tem.  The  analysis  is  text  related  and  without  con- 
textualizations,  for  example,  of  a  social-historical, 
geographical,  or  economic  type.  The  primary  object 
of  investigation  consists  of  the  frequently  uncon¬ 
scious  ritualizations  of  everyday  speaking  (perfor¬ 
mance).  There  are  significant  investigations  into 
rituals  of  transition  (e.g.,  by  van  Gennep)  or  rituals 
of  accessibility  (e.g.,  by  Erving  Goffman)  on 
entrance  into  a  new  situation,  a  new  office,  an 
assembly,  or  a  segment  of  life  and  into  the  changes 
in  status  and  roles  that  are  contained  therein.  The 
accent  of  these  investigations  is  on  social  borders. 

The  terminology  of  performance  that  has  spread 
from  the  artistic  program  of  “action  art”  to  the 
analysis  of  ritual  should  draw  attention  to  the  fact 
that  activities  embody  and  construct  meaning. 
Clifford  Geertz  and  others  have  included  this  real¬ 
ization  in  the  term  cultural  performance ;  Turner 
speaks  of  social  drama.  The  analysis  thus  reacts  to 
social  upheavals  during  the  1970s  that  were  trig¬ 
gered  by  new  social  and  political  movements,  by  a 
sensitization  to  spiritual  ideas  and  foreign  tradi¬ 
tions  (New  Age),  and  by  the  liturgical  renewal 
(Second  Vatican  Council)  of  the  Roman  Catholic 
Church.  Inasmuch  as  the  processes  and  transfor¬ 
mative  power  of  theatrical  dramaturgy  and  ritual 
prove  to  be  interrelated,  it  is  expected  that  rituals 


will  be  able  to  help  channel  social  crises  into  the 
reconstruction  and  renewal  of  societies. 

Above  all,  the  mass  media  have  taken  over  the 
role  of  staging  reality  for  a  worldwide  audience 
through  the  ritual  formats  of  the  news  broadcast, 
talk  shows,  and  political  events  such  as  inaugura¬ 
tions,  parliamentary  hearings,  conferences,  and 
wartime  reports.  The  holders  of  political  office 
have  at  all  times  made  use  of  a  broad  repertoire  of 
symbols  to  create  solidarity  among  people  and  to 
impart  to  them — in  Durkheim’s  sense — a  group  or 
community  identity:  flags;  hymns;  insignia  of  power 
such  as  coats  of  arms,  the  throne,  the  crown,  cere¬ 
monial  clothes,  and  reserved  colors;  political  sym¬ 
bols  of  above  and  below,  good  and  evil,  or  left  and 
right;  and  certain  animals  (e.g.,  lions,  eagles),  key 
words,  sounds,  and  images,  as  well  as  buildings. 

Ralf  Rytlewski 
Freie  Universitat  Berlin 
Berlin,  Germany 
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Robust  Statistics 


Traditional  methods  for  estimating  regression 
models  can  be  unduly  influenced  by  a  small  subset 
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of  the  data.  For  example,  using  ordinary  least 
squares  (OLS)  regression  to  model  economic 
growth  in  15  industrialized  democracies,  Peter 
Lange  and  Geoffrey  Garrett  find  that  the  interac¬ 
tion  between  Left  governments  and  organized 
labor  is  positively  and  significantly  correlated  with 
economic  performance.  In  his  analysis  of  the  data, 
however,  Bruce  Western  illustrates  that  these  find¬ 
ings  are  largely  determined  by  the  Norwegian 
case.  Using  robust  estimation  techniques  to 
account  for  this  observation  reveals  much  greater 
uncertainty  about  the  influence  of  the  interaction 
effect  on  economic  performance.  OLS  estimation 
minimizes  large  residuals  at  the  expense  of  degrad¬ 
ing  the  fit  of  the  remaining  observations.  The  coef¬ 
ficient  estimates  it  generates  can  thus  be  strongly 
influenced  by  even  a  single  large  residual,  as  was 
the  case  in  the  model  estimated  by  Lange  and 
Garret.  Like  OLS  estimation,  unusual  points  may 
also  influence  traditional  maximum  likelihood 
methods.  The  likelihood  often  depends  on  means 
and  variances,  whose  estimated  values  can  be 
largely  determined  by  points  lying  outside  the 
majority  of  the  data.  Thus,  when  implementing 
any  regression  method  based  on  the  mean  and 
variance,  the  analyst  must  be  wary  of  unusual 
observations.  In  this  entry,  various  ways  of  dealing 
with  this  problem  are  discussed. 

Observations  can  be  unusual  in  two  important 
ways:  First,  an  observation  may  be  an  outlier  if  the 
value  of  the  explanatory  variable  is  typical  of  the 
sample  but  the  value  of  the  outcome  variable  is 
atypically  large  or  small;  alternatively,  an  observa¬ 
tion  may  be  a  leverage  point  if  the  value  of  the 
explanatory  variable(s)  is  demonstrably  different 
from  the  rest  of  the  data.  An  influential  observa¬ 
tion  refers  to  either  an  outlier  or  leverage  point 
whose  inclusion  in  the  analysis  substantially  alters 
the  estimates  of  the  statistics  of  interest,  including 
parameter  estimates  and  predictions,  the  estimated 
variance  of  these  values,  and  the  goodness-of-fit 
statistics.  These  influential  observations  can  result 
from  “bad”  data  (such  as  data  that  have  been 
recorded  incorrectly),  improper  modeling  of  an 
outcome  variable  with  a  heavy-tailed  distribution, 
or  models  that  fail  to  describe  the  data  well  for 
certain  values  of  the  predictor. 

Even  if  only  a  small  fraction  of  the  data — or 
even  a  single  observation — is  influential,  estima¬ 
tion  strategies  that  assume  all  data  are  modeled 


correctly  may  produce  erroneous  results.  To  avoid 
producing  incorrect  estimates,  regression  analyses 
must  account  for  influential  observations.  A  popu¬ 
lar  strategy  for  dealing  with  these  observations  is 
the  diagnosis  and  removal  of  these  points  before 
the  estimation  of  a  regression  model  to  the  remain¬ 
ing  “good”  data.  This  technique  is  acceptable  if 
the  observation  has  been  recorded  incorrectly  and 
the  true  value  cannot  be  recovered,  if  the  observa¬ 
tion  arises  from  a  different  population  than  the 
other  data,  or  if  there  is  a  theoretical  justification 
for  excluding  the  observation  from  the  analysis.  It 
is  problematic,  in  contrast,  if  the  influential  point 
emerges  because  of  heavy-tailed  distributions  or 
inadequate  models.  In  these  cases,  an  observation 
cannot  be  discarded  simply  because  it  fails  to  fit 
the  model,  as  removing  this  data  point  provides 
unwarranted  support  to  the  incorrect  model. 
Instead,  robust  regression  provides  a  compromise 
between  deleting  influential  observations  and 
allowing  them  to  violate  the  assumptions  of  tradi¬ 
tional  regression  estimators.  When  data  conform 
to  conventional  assumptions,  robust  and  nonro- 
bust  methods  provide  similar  estimates  of  statistics 
of  interest.  Robust  estimators,  however,  are  resis¬ 
tant  to  the  effects  of  influential  observations  and 
retain  their  efficiency  when  data  are  nonnormal. 

Diagnosing  Influential  Points 

To  determine  if  a  robust  regression  approach  is 
necessary,  the  analyst  must  establish  whether  there 
are  influential  points  in  the  data  set.  Before  fitting 
a  regression  model,  the  investigator  often  conducts 
exploratory  data  analysis  such  as  visualizing  the 
variables  in  scatterplots  and  box  plots.  While  a 
visual  examination  of  the  data  can  sometimes  be 
used  to  detect  outliers  and  leverage  points,  it  can¬ 
not  determine  whether  these  points  are  influential, 
nor  does  it  capture  all  potential  influence  points. 
Another  popular  strategy  for  detecting  outliers  is  a 
post  hoc  examination  of  the  residuals  from  the 
regression  analysis.  Though  a  large  residual  indi¬ 
cates  that  an  observation  is  an  outlier,  this  outlier 
may  not  be  an  influential  point.  Moreover,  leverage 
points  can  exhibit  small  residuals  while  altering  the 
estimates  of  the  statistics  of  interest.  Given  the 
limitations  of  these  techniques,  it  is  necessary  to 
examine  influence  statistics  before  assuming  that 
there  are  no  influential  observations  in  the  data  set. 
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The  three  most  popular  measures  of  influence 
rely  on  regression  deletion  diagnostics  that  exam¬ 
ine  the  effect  of  removing  a  given  observation  on 
the  estimated  quantities  of  interest.  The  most  com¬ 
monly  used  diagnostic  for  influential  cases  is 
Cook’s  Distance  (Cook’s  D),  which  measures  the 
impact  of  an  unusual  observation  on  the  slope 
coefficients.  Unless  the  observation  is  an  influential 
point,  the  coefficient  estimates  calculated  when 
observation  z  is  removed  from  the  analysis  should 
be  similar  to  the  estimated  values  when  this  case  is 
included.  The  Cook’s  D  is  usually  assessed  by  com¬ 
paring  its  value  with  an  F  distribution,  and  while 
there  is  no  universally  acknowledged  cutoff,  a  per¬ 
centile  value  greater  than  50%  demonstrates  that 
the  zth  observation  has  a  significant  effect  on  the 
estimated  slope  coefficients. 

While  Cook’s  D  measures  the  influence  of  the 
zth  observation  on  all  slope  coefficients,  the  DFFIT 
provides  the  actual  changes  in  the  prediction  esti¬ 
mates  caused  by  the  deletion  of  each  case  through 
a  measure  of  the  influence  of  the  zth  observation  on 
the  fitted  value  for  y,.  The  resulting  statistic  repre¬ 
sents  the  number  of  estimated  standard  deviations 
that  the  fitted  value  increases  or  decreases  with  the 
inclusion  of  the  zth  observation  in  the  model.  Any 
observation  with  \DFFIT\  >  2  \J (p  +  1  )/zz,  where 
p  is  the  number  of  linear  predictors  and  n  is  the 
number  of  observations,  should  be  investigated. 

Finally,  the  DFBETA  is  an  analogous  statistic 
for  examining  an  observation’s  influence  on  a  par¬ 
ticular  coefficient.  It  is  obtained  by  calculating  the 
difference  between  the  coefficient  estimated  with 
and  without  the  zth  observation.  The  resulting  sta¬ 
tistic  measures  the  change  in  the  number  of  stan¬ 
dard  deviations  for  these  two  estimates  of  the 
coefficient.  A  large  DFBETA  value  may  indicate 
that  the  zth  observation  has  a  large  influence  on 
this  regression  coefficient.  The  cutoff  value  for 
DFBETA  statistics  is  2/ \fn. 

These  three  deletion  diagnostics  are  used  to 
uncover  individual  observations  that  are  influen¬ 
tial.  It  can  be  possible,  however,  for  a  case  that  is 
not  individually  influential  to  become  highly  influ¬ 
ential  when  grouped  with  other  observations.  This 
is  known  as  the  masking  effect ,  because  the  influ¬ 
ence  of  the  observation  is  masked  by  the  presence 
of  a  neighboring  case.  Single-case  deletion  diagnos¬ 
tics  can  be  extended  to  assess  the  changes  caused 
by  the  deletion  of  more  than  one  observation  at  a 


time.  Although  this  strategy  may  be  effective  for 
small  subsets  of  the  data,  if  all  possible  subsets  of 
cases  are  considered  multiple-case  deletions,  diag¬ 
nostics  become  computationally  burdensome. 
Thus,  measures  of  influence  employing  cluster 
analysis  may  have  to  be  implemented  in  order  to 
detect  subsets  of  influence  points. 

Robust  Regression 

After  diagnosing  the  presence  of  influential  points 
in  the  data,  an  appropriate  regression  method 
must  be  selected  to  mitigate  the  effect  of  these 
points.  In  selecting  a  modeling  strategy,  an  impor¬ 
tant  factor  to  consider  is  the  breakdown  point  of 
the  method.  The  breakdown  point  is  the  propor¬ 
tion  of  incorrect  observations  that  an  estimator 
can  support  before  giving  an  arbitrarily  erroneous 
result.  A  breakdown  point  cannot  exceed  50%, 
because  if  more  than  half  of  the  observations  are 
problematic,  it  is  impossible  to  differentiate  the 
underlying  distribution  from  the  contaminating 
distribution.  In  the  context  of  estimating  the  cen¬ 
ter  of  a  distribution,  the  mean  has  a  breakdown 
point  of  0,  as  it  can  be  made  arbitrarily  large  by 
changing  a  single  observation.  The  median,  in 
contrast,  exhibits  the  maximum  breakdown 
point. 

Resistant  statistics  are  those  with  breakdown 
points  near  50%.  There  are  three  main  robust  regres¬ 
sion  techniques  that  generate  resistant  statistics: 
M-estimators,  least  median  estimators,  and  likeli¬ 
hood  modifications.  Each  of  these  strategies 
approachesrobust  regression  differently.  M-estimators 
can  be  used  to  weight  observations  to  reduce  the 
effect  of  influential  points.  Least  median  estimators, 
in  contrast,  are  naturally  resilient  to  influential  points 
due  to  a  high  breakdown  point  of  the  median. 
Finally,  likelihood  modifications  allow  the  investiga¬ 
tor  to  model  the  distribution  of  the  influential  points. 
While  all  three  techniques  are  superior  to  regres¬ 
sion  models  that  do  not  account  for  influential 
observations,  each  method  has  theoretical  and  com¬ 
putational  strengths  and  weaknesses  that  must  be 
considered  when  choosing  a  modeling  strategy. 

M -Estimators 

Most  fitting  criteria  find  parameter  estimates 
that  minimize  (or  maximize)  an  objective  function 
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of  the  observed  data  and  the  parameters. 
Minimizing  the  sum  of  squared  residuals  is  an 
example  of  a  broader  class  of  estimators,  known 
as  M-estimators,  which  provide  statistics  that 
minimize  an  objective  function  of  the  form 
where  r,  is  the  residual  of  the  z'th 
observation.  The  function  p  represents  the  type 
of  error  that  the  investigator  wishes  to  mini¬ 
mize,  such  as  the  squared  error,  the  absolute 
value  of  the  error,  and  so  on.  Many  robust 
regression  methods  are  based  on  iteratively 
reweighted  M-estimators,  where  weights  {iVj) 
are  used  to  diminish  the  effect  of  influential 
points.  Because  these  weighted  M-estimators 
have  a  higher  breakdown  point  than  traditional 
regression  estimators,  they  are  more  resistant  to 
influential  observations. 

The  estimates  of  the  parameters  ((3)  are  easy  to 
calculate  (though  the  standard  errors  can  be 
obtained  only  through  the  use  of  bootstrap  meth¬ 
ods).  Iteratively  reweighted  M-estimators  are  esti¬ 
mated  using  the  following  algorithm: 

Step  0:  Assign  weight  1  to  each  data  point. 

Step  1:  Find  the  value  of  p  that  minimizes  yr 

Step  2:  Compute  the  new  residuals  (r,). 

Step  3:  Redefine  the  weights  Wj  using  the  residuals  r, 
by  the  prescribed  weighting  scheme. 

Step  4:  Return  to  Step  1,  substituting  the  new 
residuals  and  weights  into  the  objective  function. 

Step  5:  Repeat  the  iterations  until  the  values  of  3 
have  converged. 

When  using  M-estimators,  the  most  important 
consideration  is  prescribing  a  weighting  scheme 
that  down-weights  influential  points  in  a  reason¬ 
able  way.  Since  the  weights  depend  on  the  values 
of  the  residuals,  plotting  w,  as  a  function  of  r, 
shows  how  large  residuals  are  treated  by  the 
weighting  scheme.  Deletion  methods  have  a 
weighting  function  that  is  1  on  an  interval  until  the 
residual  exceeds  the  cutoff  point  determined  by  the 
method,  and  the  weighting  function  immediately 
drops  to  0.  A  continuous  weighting  function,  in 
contrast,  gradually  decreases  the  weights  as  the 
residuals  increase. 

A  popular  continuous  weighting  scheme  is 
Tukey’s  bi  weight  method.  This  iteratively 


reweighted  least  squares  M-estimator  assigns 
weights  between  0  and  1  to  all  observations, 
decreasing  from  1  to  0  as  the  residual  becomes 
larger.  This  method  results  in  the  removal  of  obser¬ 
vations  whose  residuals  are  too  large,  but  in  con¬ 
trast  to  deletion  methods,  a  residual  that  is  close  to 
the  cutoff  point  receives  a  weight  that  is  close  to  0, 
rather  than  a  weight  of  1 .  Like  other  down-weight¬ 
ing  M-estimators,  Tukey’s  bi-weight  method  ade¬ 
quately  accounts  for  influential  outliers.  Under 
certain  circumstances,  however,  both  Tukey’s 
method  and  the  broader  class  of  M-estimators  can 
be  susceptible  to  high-leverage  observations. 

Least  Median  Estimators 

In  contrast  to  the  reweighting  methods,  it  is 
also  possible  to  define  objective  functions  that  are 
already  robust  in  some  sense.  In  place  of  ^hp(r,), 
functions  of  the  form  median^i)  give  rise  to 
least  median  regression.  The  intuition  behind  this 
method  is  straightforward.  Since  the  median  is  a 
statistic  that  is  naturally  robust  to  extreme  values, 
minimizing  the  objective  function  allows  one  (or 
several)  influential  points  to  take  a  large  value  of 
p(r)  while  the  remaining  observations  have  a  value 
of  p(r)  that  is  small.  The  few  large  values  have  little 
or  no  effect  on  the  estimate  of  the  median,  and  the 
values  of  3  that  are  obtained  best  fit  the  majority  of 
the  data. 

One  theoretically  appealing  choice  is  least 
median  squares  (LMS)  regression,  which  arises 
from  choosing  p(r)  =  r1 .  The  breakdown  point  of 
LMS  regression  for  p  >  1  is 

n  —  2p  +  4 

~ : in  ’ 

which  is  50%  in  large  data  sets.  Although  both 
this  method  and  M-estimation  techniques  handle 
outliers  in  a  satisfactory  manner,  LMS  regression 
is  more  robust  to  leverage  points.  For  this  reason, 
it  is  preferable  to  other  robust  regression  tech¬ 
niques.  However,  an  exact  solution  to  a  least 
median  squares  estimation  problem  necessitates 
searching  over  a  space  of  size  (p  +  i)  and  inverting 
a  (p  +  1)  x  (p  +  1)  matrix  at  each  step,  which  is 
infeasible  for  large  n  and  moderate  p.  Random 
subsampling  must  therefore  be  used  to  achieve  a 
good  estimator  with  high  probability. 
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Likelihood  Adjustments 

In  contrast  to  minimization  techniques,  the 
effect  of  influential  points  can  be  reduced  using 
likelihood  methods.  OLS  regression  is  equivalent 
to  maximum  likelihood  estimation  when  the 
errors  are  assumed  to  be  independently  and  identi¬ 
cally  distributed  from  the  same  mean  0  Gaussian 
distribution.  To  account  for  influential  points,  the 
sampling  distribution  of  the  data  can  be  changed 
from  a  Gaussian  to  a  heavy-tailed  distribution. 
This  allows  for  more  large  residuals  to  be  observed 
without  greatly  modifying  the  estimated  mean 
structure  of  the  regression.  A  natural  choice  of 
heavy-tailed  distributions  is  the  Student’s  t  family. 
These  distributions  have  polynomial  decay,  which 
is  slower  than  the  exponential  decay  of  the 
Gaussian,  and  as  v  — >  °°,  the  Student’s  t  with  v 
degrees  of  freedom  converges  to  a  normal  distri¬ 
bution.  In  contrast  to  the  other  techniques,  the 
confidence  bounds  on  parameter  estimates,  and 
predictions  can  be  easily  obtained  from  the  likeli¬ 
hood  function.  However,  while  this  likelihood 
method  accounts  for  influential  outliers,  like  the 
M-estimator,  it  remains  susceptible  to  leverage 
points.  To  account  for  leverage  points  in  the  likeli¬ 
hood  framework,  it  may  be  necessary  to  fit  a  mix¬ 
ture  model.  This  approach  acknowledges  that  the 
observations  may  arise  from  multiple  data  gener¬ 
ating  processes,  each  of  which  must  be  modeled. 
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Rousseau,  Jean-Jacques 

(1712-1778) 


Jean-Jacques  Rousseau  is  a  unique  figure  in  the 
history  of  political  thought  whose  legacy  spans 
literature  and  political  philosophy  with  equally 
outstanding  creativity.  As  a  political  thinker, 
Rousseau  helped  formulate  with  the  greatest  rigor 
the  normative  standards  of  modern  Western  poli¬ 
tics.  Many  ideas  and  convictions  now  considered 
fundamental  to  modern  democracy  originated  with 
Rousseau  or  were  revisited  by  him — sovereignty  of 
the  people,  the  role  of  the  individual,  the  requi¬ 
sites  of  fair  governance  based  on  respect  for  the 
voice  of  the  collective,  and  the  revolutionary 
dimension  of  these  aspirations.  All  are  to  be  found 
in  Rousseau’s  works.  Although  not  literally  a 
democrat  himself  (he  notes  in  his  most  important 
treatise  of  political  philosophy,  On  the  Social 
Contract,  that  only  a  “people  of  gods”  could  gov¬ 
ern  itself  under  such  a  regime),  Rousseau  sketched 
many  of  the  concepts  on  which  modern  democ¬ 
racy  rests.  In  the  18th  century,  when  philosophy 
contributed  new  ideas  to  radically  reshape  the 
norms  of  what  is  acceptable  and  desirable  in  poli¬ 
tics  and  when  new  classes  came  to  power,  Rousseau 
was  seen  by  many  as  the  soul  of  his  era  and  a 
major  inspiration  for  the  future.  This  entry  reviews 
first  his  life,  then  his  career  as  a  philosopher,  and 
finally  his  works. 

Rousseau’s  prominence  as  a  political  philosopher 
should  not  eclipse  his  eminent  position  as  a  writer. 
A  gifted  stylist,  Rousseau  rightly  ranks  as  one  of  the 
most  revered  authors  in  the  French  literary  tradi¬ 
tion.  His  literary  influence  has  been  of  no  less 
importance  than  his  political  heritage.  Rousseau’s 
daring  Confessions  (published  after  his  death  in 
1782)  made  him  the  inventor  of  the  major  modern 
autobiographical  genre.  In  the  Confessions,  as  well 
as  in  other  personal  narratives,  Rousseau  does  not 
gloss  over  the  minutiae  of  his  character  or  its  flaws, 
thus  reflecting  a  new  interest  in  the  depiction  of  the 
self  without  any  self-aggrandizing  descriptions  of 
one’s  virtue.  In  texts  such  as  his  Reveries  of  the 
Solitary  Walker,  Rousseau  also  set  the  tone  for  the 
early  French  romanticism — something  like  half  a 
century  ahead  of  its  time — where  admiration  for 
nature,  recognition  of  the  power  of  feelings  and 
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emotions  on  moral  behavior,  and  the  taste  for  soli¬ 
tude  and  the  sentiment  of  isolation  predominate. 

Early  Life 

Born  in  1712  in  Geneva,  Switzerland,  to  a  mod¬ 
est  watchmaker  who  had  married  the  daughter 
of  a  pastor,  Rousseau  belonged  neither  to  the 
traditional  aristocracy  nor  to  the  rising  bourgeoi¬ 
sie,  from  which  many  of  the  intellectual  elite 
arose.  Nothing  in  his  early  years  or  his  family 
background  seems  to  have  predestined  him  to 
produce  the  most  influential  works  in  political 
philosophy. 

Rousseau’s  childhood  is  like  that  of  many  oth¬ 
ers  of  his  time,  a  story  of  multiple  separations 
and  loose  family  links:  His  mother  died  soon 
after  his  birth,  he  hardly  knew  his  only  brother, 
and  his  father  had  to  flee  Geneva  when  Jean- 
Jacques  was  10.  Rousseau  was  left  in  the  custody 
of  a  pastor,  who  oversaw  his  formal  education. 
Despite  these  separations,  Rousseau  considered 
that  he  had  a  happy  childhood  until  age  13, 
when  he  became  an  engraver’s  apprentice,  suffer¬ 
ing  the  violence  of  a  particularly  brutal  master. 
Three  years  later,  this  caused  him,  for  fear  of 
being  beaten  once  more,  to  leave  Geneva,  after  he 
had  found  the  doors  of  the  city  closed  because  of 
a  curfew.  Rousseau  began  a  vagabond  lifestyle, 
traveling  to  France  and  Italy,  returning  at  some 
point  to  Switzerland,  all  mostly  by  foot.  For  all 
his  life,  Rousseau  loved  walking,  eventually  giv¬ 
ing  his  last  (unfinished)  book  the  title  Reveries  of 
a  Solitary  Walker. 

In  a  few  years,  Rousseau  had  travelled  across 
many  countries,  eventually  settling  down  in  France. 
He  became  a  Catholic  (converting  back  to  Protes¬ 
tantism  in  1754).  He  discovered  the  minor  provin¬ 
cial  nobility  with  his  first  patroness,  Mme  de 
Warens,  who  oversaw  his  conversion  to  Cathol¬ 
icism.  Until  his  mid-30s,  he  was  employed  in  a 
series  of  positions  of  domestic  service,  from  lackey 
to  embassy  secretary  in  Italy  and  from  steward  (as 
well  as  lover)  to  Mme  de  Warens  to  singer  and 
music  master  or  private  tutor.  All  his  life,  Rousseau 
resented  his  dependence  on  powerful  protectors 
and  made  sure  that  he  could  make  an  independent 
living  with  a  modest  employment  as  music  copyist. 
In  1744,  he  met  Therese  Levasseur,  a  seamstress 
and  chambermaid,  with  whom  he  had  at  least 


three  children,  whom  he  would  abandon.  He  mar¬ 
ried  her  secretly  late  in  his  life. 

Literary  Career 

Rousseau’s  literary  career  started  in  the  1740s, 
when  he  met  Denis  Diderot  and  the  team  of  the 
Encyclopedia  through  one  of  his  patronesses, 
Mme  d’Epinay.  The  Encyclopedia  was  a  collec¬ 
tive  endeavor  to  present  systematically  the  sum 
of  knowledge  of  the  time.  Rousseau  contributed 
several  articles  on  music  and  also  an  important 
text  on  political  economy.  He  was  acquainted 
with  some  of  the  brightest  minds  of  the  time,  the 
philosopher  Etienne  de  Condillac  and  the  math¬ 
ematician  and  philosopher  Jean  le  Rond 
d’Alembert,  and  was  in  communication  with 
Voltaire.  Fame  came  to  him  following  the  publi¬ 
cation  in  1750  of  his  Discourse  on  the  Arts  and 
Sciences ,  which  he  had  submitted  to  the  Academy 
of  Dijon  in  competition  for  a  prize.  His  work 
received  the  award  despite  an  audacious  central 
claim  that  the  development  of  sciences  and  arts 
(in  the  parlance  of  the  time,  technique)  contrib¬ 
uted  not  to  the  development  of  man  but  to  his 
regression.  His  well-crafted  paradox  satisfied 
not  only  the  religious  conservatives,  who  were 
critical  of  the  idea  of  progress  founded  on 
the  development  of  reason  and  science,  but  also 
the  modernists,  who  could  see  in  his  text  a  cri¬ 
tique  of  the  building  of  prejudices  through  mis¬ 
guided  reasoning  and  a  praise  of  natural  reason. 
It  also  attracted  enough  polemics  to  draw  flatter¬ 
ing  attention  to  itself. 

The  First  Discourse,  although  certainly  not 
Rousseau’s  strongest  work,  nevertheless  inaugu¬ 
rated  his  systematic  reflections  on  politics.  The 
second  discourse — What  Is  the  Origin  of  Inequal¬ 
ity  Among  Men,  and  Is  It  Authorized  by  Natural 
Laud — written  in  1754,  clarifies  his  position.  A  radi¬ 
cal  manifesto  in  favor  of  equality  among  men  (and, 
to  a  certain  extent,  women),  the  Second  Discourse 
purports  to  show  that  inequality  is  not  a  social  fact 
but  the  sign  of  corruption  in  society.  Rousseau’s 
position  is  situated,  in  the  wake  of  the  contractar¬ 
ian  theories  that  had  dominated  political  thought 
for  over  a  century,  in  time  with  Samuel  von 
Pufendorf,  Thomas  Hobbes,  and  John  Locke: 
Social  bonds  are  not  natural  but  constructed,  and 
any  rational  explanation  of  politics  has  to  justify 
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precisely  why  it  has  been  built.  But  Rousseau 
added  one  consideration  that  radically  changed  the 
meaning  of  the  social  contract:  Why  should  human 
beings  accept  a  contract  that  is  obviously  to  the 
detriment  of  those  being  oppressed? 

To  answer  this  mystery,  Rousseau  sketches  a  new 
anthropology  of  the  individual,  insisting  on  the  rup¬ 
ture  between  the  state  of  nature  and  civil  society. 
Rousseau  notes  the  inconsistency  of  the  contractar¬ 
ian  theories  before  him,  which  suppose  that  society 
starts  with  individuals  but  assume,  like  Locke,  that 
they  are  tied  by  the  rules  of  natural  law.  He  is 
equally  critical  of  Hobbes,  who  reduces  the  natural 
links  between  men  to  the  “battle  of  all  against  all”: 
Conflict  is  still  a  social  relation.  Only  people  who 
can  relate  to  one  another  can  fear  each  other.  The 
informal  cooperation  imagined  by  Locke,  just  like 
the  perpetual  strife  pictured  by  Hobbes,  already 
presupposes  society  and  so  cannot  provide  a  true 
outlook  on  the  natural  state  of  man  and,  thus,  any 
explanation  of  the  beginnings  of  society. 

Maturity  and  Isolation 

The  Second  Discourse  opens  up  many  major 
themes  for  modern  political  thought.  The  first  is 
the  economic  dimension  of  social  links:  Only  the 
invention  of  property — which  for  Rousseau  is 
not  natural — and  the  necessity  to  protect  it  make 
crucial  the  guarantees  that  society  offers.  Second, 
the  book  is  characterized  by  a  fundamental  pes¬ 
simism  about  the  origins  of  society,  which  could 
be  described  as  a  soft-power  coup  of  the  wealthy 
against  the  poor:  Society  was  established  when 
the  wealthiest  convinced  the  poorest  that  they 
both  had  a  similar  interest  in  becoming  engaged 
in  the  bonds  of  society,  thus  making  the  strongest 
stronger  with  the  help  of  the  weaker.  But  this 
pessimism  is  balanced  with  a  sort  of  optimism 
about  the  very  nature  of  society.  Since  men 
engaged  themselves  in  the  bonds  of  society  freely 
and  equally  to  seek  protection,  any  legitimate 
social  organization  should  respect  this  equality 
and  this  freedom.  Equality  and  liberty  are  not 
solely  natural  rights;  with  Rousseau,  they  have 
also  become  legitimate  political  claims. 

With  the  publication  of  two  major  works 
in  1762,  On  Social  Contract  and  Emile,  Rousseau 
develops  his  thought,  insisting  that  people’s 
sovereignty  derives  from  the  necessity  to  grant 


freedom  and  equality  and  thus  denying  any  rule  by 
divine  law.  In  the  Second  Discourse,  he  directly 
opposes  the  notion  of  original  sin,  or  rather  the 
idea  that  it  was  a  justification  for  inequalities.  In 
On  Social  Contract,  he  refuses  the  justification  of 
domination  in  the  name  of  religion.  Hence,  his 
praise  of  religious  tolerance  and,  in  a  famous  sec¬ 
tion  of  Emile,  the  idea  of  a  natural  religion  that 
would  be  the  heart’s  call  for  faith  versus  the  artifi¬ 
ciality  of  established  religions. 

With  such  claims,  Rousseau  would  confront 
again  the  religious  conservatives  (be  they 
Protestants  or  Catholics).  But  he  also  opposed  the 
“philosophers”  who,  in  a  liberal  tradition,  consid¬ 
ered  after  Montesquieu  that  the  people  could  not 
be  the  direct  source  of  law.  For  Rousseau,  on  the 
contrary,  law  was  solely  the  translation  of  the  will 
of  the  people — the  “general  will.”  There  was  to  be 
a  direct  link  between  the  existence  of  a  people  and 
its  actual  organization  as  a  political  body  express¬ 
ing  itself  in  laws.  Otherwise,  it  was  all  too  clear  to 
Rousseau  that  the  Age  of  Enlightenment  was  not  a 
grand  solution  if  it  meant  continuing  dispossession 
of  the  people  from  their  rights. 

Within  a  few  years,  Rousseau  came  to  be  iso¬ 
lated  (his  illicit  relationship  with  Levasseur  was 
publicly  condemned,  and  he  was  accused  of  aban¬ 
doning  his  children),  while  his  support  in  the 
intellectual  community  grew  thin  as  former  friends 
like  Diderot  turned  into  enemies — the  most  ven¬ 
omous  being  the  powerful  Voltaire — and  hostility 
turned  to  persecution  (Rousseau’s  views  on  reli¬ 
gion  led  to  his  condemnation  by  both  Catholics 
and  Calvinists).  Rousseau  was  threatened  with 
imprisonment  and  had  to  flee  to  England. 
Although  he  returned  to  France  during  his  last 
years,  he  was  a  deeply  unsettled  man  whose  mood 
was  darkened  by  what  he  considered  to  be  plots 
against  him. 

Conclusion 

The  richness  of  Rousseau’s  thought  makes  its  inter¬ 
pretation  difficult.  Countless  disputes  have  arisen 
about  meaning.  For  the  longest  time,  Rousseau 
appeared  as  a  radical  individualist,  belonging  to  a 
certain  tradition  of  liberalism  in  the  context  of  con¬ 
tractarianism.  But  he  also  has  been  reproached  for 
his  insistence  on  the  collective  power  of  the  people, 
to  the  point  of  being  portrayed  as  a  “proto-socialist.” 
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His  influence  on  the  French  Revolution  also  has  been 
discussed  heatedly.  Rousseau,  perhaps  more  than 
any  other  thinker,  encapsulated  in  many  aspects  of 
his  work  the  contradictory  meanings  of  political 
modernity.  For  this  reason,  he  is  probably  best 
understood  as  a  political  philosopher  at  the  cross¬ 
road  of  ideologies  that  would  diverge  only  after  him, 
for  they  hardly  existed  before. 

Thierry  Leterre 
Miami  University  John  E.  Dolibois  Center 

Luxembourg 

See  also  Hobbes,  Thomas;  Individualism;  Locke,  John; 
Natural  Law;  Political  Philosophy;  Republic 

Further  Readings 

Cottret,  M.  (2005).  Jean-Jacques  Rousseau  en  son  temps 
[Jean-Jacques  Rousseau  and  his  times].  Paris:  Perrin. 
Derathe,  R.  (1970).  Jean-Jacques  Rousseau  et  la  science 
politique  de  son  temps  [Jean-Jacques  Rousseau  and 
the  political  science  of  his  times].  Paris:  J.  Vrin. 
Rousseau,  J.  -J.  (1997).  The  social  contract  and  other 
later  political  writings  (V.  Gourevitch,  Ed.  &  Trans.). 
Cambridge,  UK:  Cambridge  University  Press. 

Swenson,  J.  (2000).  On  Jean-Jacques  Rousseau: 
Considered  as  one  of  the  first  authors  of  the 
revolution.  Stanford,  CA:  Stanford  University  Press. 
Wingrove,  E.  R.  (2000).  Rousseau’s  republican  romance. 
Princeton,  NJ:  Princeton  University  Press. 


Rule  of  Law 


The  phrase  rule  of  law  is  commonly  understood  to 
mean  the  method  of  government  of  a  sociopoliti¬ 
cal  system  featuring  the  following  properties: 

1.  Conflicts  among  actors — individual  and 
collective — are  governed  by  means  of  rules  that 
are  impartial  and  equal  for  everyone. 

2.  Rules  are  enacted  by  means  of  a  “limited 
government”  that  wields  its  own  functions 
within  the  confines  of  the  same  rules 
(supremacy  of  rules  over  the  rule  of  men). 

3.  Compliance  with  such  rules  is  rooted  in  a  legal 
and  political  culture  (nomos  as  a  sociolegal 
order). 


This  entry  first  explores  the  meaning  of  the  con¬ 
cept  and  how  it  has  evolved  historically,  as  a  basis 
for  a  comprehensive  definition  of  the  concept.  It 
then  considers  the  relationship  between  the  rule  of 
law,  sovereignty,  and  democracy  and  distinguishes 
between  the  formal  and  substantive/normative 
aspects  of  the  rule  of  law.  The  various  dimensions 
of  the  rule  of  law  as  it  is  realized  in  particular  cul¬ 
tural  contexts  are  then  discussed.  The  entry  then 
examines  empirical  research  on  the  rule  of  law  in 
relation  to  political  action  and  the  implementation 
of  the  rule  of  law,  and  it  also  describes  recent 
research  agendas,  touching  on  the  way  the  rule  of 
law  contributes  to  the  quality  of  democratic  pro¬ 
cesses.  The  entry  concludes  with  a  look  at  direc¬ 
tions  for  future  research,  including  the  way  the 
rule  of  law  is  transformed  by  the  transnationaliza¬ 
tion  of  law  and  politics. 

Historical  Roots  of  the  Concept 

Prima  facie,  the  institutional  ideal  of  the  rule  of 
law  pivots  on  the  meaning  of  impartial  and 
abstract  rules,  on  the  foundations  of  the  nomos 
(order).  The  origin  of  the  concept  of  rule  of  law 
dates  back  to  classical  Greece.  By  emphasizing  the 
importance  of  relations  among  equals,  Plato’s 
Laws  and  Aristotle’s  Politics,  Book  III  elaborate  a 
vision  of  nomos  whose  rules  transcend  the  subjec¬ 
tive  will  and  men’s  protean  proclivities.  A  nomos 
is  the  rule  of  laws,  which  is  in  contrast  with  the 
rule  of  men.  The  relevance  of  cogent  laws  for  the 
entire  political  community  became  one  of  the  pil¬ 
lars  of  Roman  civilization,  within  which  the  law 
( ius )  split  into  ius  gentium  (the  Roman  law  that  is 
in  force  for  other  populations)  and  ius  civilis  (the 
ensemble  of  laws  deriving  from  legal  experts, 
assemblies,  and  emperors’  rule-making  activities). 
Part  of  the  ius  civilis  was  constituted  by  laws  in  the 
strict  sense  of  the  word  or,  more  precisely,  by  those 
laws  adopted  by  assemblies  (lex).  As  Charles 
Mcllwain  (1940)  stressed,  the  difference  between 
the  concepts  of  ius  and  lex  can  be  summarized 
against  this  background  in  classicism,  from  which 
the  term  rule  of  law  inherited  a  semantic  ambigu¬ 
ity,  being  understood  as  both  rule  by  ius  and  rule 
by  lex. 

The  Middle  Ages  grafted  on  this  first  form  of 
ambiguity  (law  vs.  enacted  law)  the  seed  of  a  sec¬ 
ond  element  of  criticality.  In  the  Middle  Ages,  the 
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political  order  was  legitimate  to  the  extent  that  it 
reflected  the  rationale  [ratio)  of  a  transcendent 
order,  the  divine  one.  From  this  viewpoint,  formal 
legitimacy  and  substantive  legitimacy  could  only 
converge.  As  noted  by  James  Blythe  (1992),  a 
political  order,  such  as  the  Holy  Roman  Empire, 
that  emanated  from  divine  investiture  (and  from 
obeying  the  behavioral  precepts  mandated  by  the 
religious  tradition)  had  to  derive  from  that  investi¬ 
ture  the  pledge  of  its  own  “fairness.”  The  appear¬ 
ance  of  modernity  on  the  institutional  scene,  and 
with  it  the  separation  of  the  moral  system  from  the 
political  and  legal  system,  created  an  irreparable 
split  between  the  formal  and  the  normative  under¬ 
standings  of  the  rule  of  law. 

How  could  the  convergence  between  formal 
legitimacy  and  substantive  legitimacy  be  guaran¬ 
teed  within  the  political  order?  A  possible  response 
comes  from  the  Age  of  Enlightenment,  which 
accepted  that  the  rules  of  the  political  order  were 
legitimized  by  their  relationship  with  the  natural 
order  (natural  law).  Nevertheless,  since  the  link 
between  natural  laws  and  government  rules  is  cre¬ 
ated  by  human  activity,  either  by  jurisprudential 
law  or  by  statutory  law,  the  passage  from  the  first 
to  the  second  is  influenced  by  a  human  motiva¬ 
tional  and  value-laden  predisposition.  As  noted  by 
Herbert  Hart  (1961),  the  rule  of  men,  with  their 
different  moral  and  value-laden  positions,  seems  to 
go  back  therefore  to  the  rule  of  rules. 

If  these  questions  seem  to  be  imbued  with  exqui¬ 
sitely  philosophical  considerations,  no  one  can 
deny  their  bearing  on  empirical  research.  What  do 
we  measure  when  we  use  the  concept  of  rule  of 
law?  The  form  of  government?  The  content  of  gov¬ 
erning  rules?  To  what  level  of  reality  does  the  unit 
of  analysis  belong?  To  what  extent  do  recent  insti¬ 
tutional  changes  affecting  both  law  and  politics 
exert  an  influence  on  the  semantics  of  the  concept 
of  rule  of  law?  The  remainder  of  this  entry  presents 
a  critical  overview  of  the  knots  that  were  left  hang¬ 
ing  by  the  secular  debate  around  the  concept  of 
rule  of  law  and  offers  an  overview  of  the  main  find¬ 
ings  from  empirical  research  conducted  in  political 
science.  Historical  and  analytical  arguments  are 
presented  to  delimit  the  meaning  of  the  concept  of 
rule  of  law.  Finally,  a  multidimensional  conception 
of  the  rule  of  law  is  analyzed  in  relation  to  two 
main  approaches  adopted  by  political  scientists:  the 
first  focusing  on  rule  of  law  as  a  precondition  for 


unfolding  political  processes  and  the  second  on 
rule  of  law  as  the  outcome  of  unfolding  political 
processes. 

Given  its  polysemy,  the  definition  of  the  con¬ 
cept  of  rule  of  law  needs  to  be  based  on  its 
historical  foundation  and  on  both  the  consider¬ 
ation  of  organizational  forms  exemplifying  the 
rule  of  law  in  the  history  of  Western  political 
institutions  and  the  philosophical  meanings  that 
this  concept  has  acquired  in  Western  legal  and 
political  thought. 

The  first  meaning  of  rule  of  law  has  its  histori¬ 
cal  roots  in  the  Anglo-Saxon  experience  of  com¬ 
mon  law:  the  ensemble  of  legal  rules  created 
through  the  jurisprudential  elaboration  of  ordi¬ 
nary  courts  within  the  territory  of  the  Kingdom  of 
England  (Albert  Dicey,  1915).  With  regard  to  the 
original  nucleus,  which  remains  bound  up  with  the 
legal  tradition  of  common  law,  as  suggested  by 
Gianfranco  Poggi  (1978),  European  institutional 
experiences  have  induced  a  modification  and 
extension  of  the  meaning  of  the  concept  in  light  of 
the  emergence  of  the  modern  state  during  the  1 6th 
and  17th  centuries. 

With  the  emergence  of  the  state  as  a  center  of 
power,  the  concept  of  rule  of  law  deviated  from 
the  initial  conception,  which  had  been  grounded  in 
common  law.  The  first  deviation  arose  out  of  the 
new  vision  of  the  Recbtsstaat — the  state  as  a  ratio¬ 
nal  organization  of  society  originating  from  the 
foundational  act  of  a  sovereign  entity,  to  which  the 
legal  system  should  conform.  The  second  deviation 
draws  attention  to  the  correspondence  between 
legal  rules  and  the  orientation  of  the  popular  will 
as  expressed  by  the  legislative  body — the  parlia¬ 
ment — first  pointed  out  by  Emmanuel  Sieyes  in 
1789.  The  postrevolutionary  French  experience 
advanced  the  idea  of  etat  de  droit  (rule  of  law)  and 
the  belief  that  “popular  sovereignty”  is  the  source 
of  the  legitimization  of  the  rule  of  rules. 

Only  vaguely  does  this  look  like  the  United 
States’  formulation  of  constitutionalism.  Across  the 
Atlantic,  the  creation  of  a  republican  form  of  gov¬ 
ernment  centered  on  political  and  institutional  lib¬ 
eral-constitutional  principles,  introducing  the  idea 
of  a  “rule  of  rules”  based  on  the  supremacy  of 
“constitutional  rule,”  as  pointed  out  by  John  Ely 
(1980).  This  is  conceived  in  contraposition  to  the 
rule  of  men,  as  stated  in  The  Federalist  Papers , 
where  “Publius”  (the  public)  justifies  the  need  for 
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limited  government  by  referring  to  the  intrinsic  fal¬ 
libility  of  human  nature: 

If  men  were  angels,  no  government  would  be 
necessary.  If  angels  were  to  govern  men,  neither 
external  nor  internal  controls  on  government 
would  be  necessary.  In  framing  a  government 
which  is  to  be  administered  by  men  over  men,  the 
great  difficulty  lies  in  this:  you  must  first  enable 
the  government  to  control  the  governed;  and  in 
the  next  place  oblige  it  to  control  itself.  A 
dependence  on  the  people  is,  no  doubt,  the 
primary  control  on  the  government;  but 
experience  has  taught  mankind  the  necessity  of 
auxiliary  precautions.  (Hamilton,  Jay,  & 
Madison,  1787-1788,  No.  51) 

If  in  postrevolutionary  France  the  supremacy  of 
the  rule  of  rules  is  to  be  understood  as  supremacy 
of  the  legislative  rule  (of  the  law),  American  consti¬ 
tutionalism  pivots  on  the  supremacy  of  the  consti¬ 
tutional  rule  and  the  Bill  of  Rights.  Alexis  de 
Tocqueville  saw  this  difference  as  rooted  in  the  fact 
that  the  two  constitutional  orders  have  different 
historical  origins.  Only  the  French  constitution 
was  born  out  of  a  breaking  point  with  the  ancien 
regime,  which  implies  the  search  for  a  foundational 
act — namely,  a  constitutive  moment  of  the  state. 
This  is  the  fundamental  distinction  for  understand¬ 
ing  the  difference  between  the  ways  in  which  the 
French  and  American  democracies  work. 

During  the  20th  century,  the  historical  experi¬ 
ences  of  European  totalitarian  regimes  highlight  a 
critical  aspect  of  the  continental  version  of  the  rule 
of  rules.  When  by  “rule  of  rules”  one  means  rule 
of  laws — that  is,  of  legislative  rules — one  can  run 
into  the  following  perverse  effect.  Laws  can  com¬ 
ply  with  all  formal  criteria  of  legality  and  proce¬ 
dural  correctness  and  yet  can  undermine  individual 
rights  on  the  substantive  plane.  In  extreme  cases, 
they  can  (as  it  happened  in  totalitarian  systems) 
transform  themselves  into  instruments  of  domina¬ 
tion  and  violence.  The  debate  on  Gustav  Radbruch’s 
dilemma  of  unfair  law  is  the  apogee  of  a  cultural 
and  legal  process  that  puts  emphasis  on  the  sub¬ 
stantive  aspects  of  the  rule  of  rules,  especially  if 
they  exemplify  themselves  as  the  etat  de  droit  and 
the  Rechtsstaat. 

As  Ronald  Dworkin  (1978)  points  out,  this  is 
the  start  of  the  constitutionalism  of  rights,  of  the 


constitutional  charters  and  catalogs  of  fundamen¬ 
tal  rights  built  into  constitutions.  The  constitution, 
and  to  a  larger  degree  the  activity  of  constitutional 
courts  in  charge  of  interpreting  the  constitution 
and  controlling  the  constitutionality  of  laws, 
become  instruments  used  to  anchor  democratic 
political  regimes  and  ensure  the  legitimacy  of 
legislative  norms  (valued  not  only  in  terms  of  pro¬ 
cedural  correctness  but  also  for  substantive  coher¬ 
ence  with  a  superior  normative  order,  that  of 
fundamental  rights).  The  introduction  of  the  require¬ 
ment  of  a  qualified  majority  or  referendum  to 
modify  the  constitution  formally  ratifies  the  assur¬ 
ance  or  the  persistence  of  such  an  anchorage. 

Studies  conducted  on  the  different  forms  that 
the  democratization  process  has  taken  in  areas  of 
Latin  America  and  Central  and  Eastern  Europe 
emphasize  the  role  of  the  rule  of  law  as  capable  of 
granting  stability  for  democratic  transition.  Juan  J. 
Linz  and  Alfred  Stepan  (1996)  maintain  that  the 
rule  of  law  is  a  necessary  condition  for  the  good 
functioning  of  those  dimensions  or  “arenas”  whose 
development  is  required  to  construct  a  democratic 
regime.  In  particular,  noncompliance  with  the  rule 
of  law  forecloses  any  possibility  of  developing  an 
independent  civil  society  and  a  pluralist  and  com¬ 
petitive  political  system. 

Since  the  1990s,  the  contribution  of  political 
scientists  has  been  enriched  by  experiences  and 
reflections  drawn  from  policies  intended  to  pro¬ 
mote  the  rule  of  law.  As  Daniela  Piana  (2010)  has 
noted,  it  is  precisely  because  of  their  orientation 
toward  crafting  interventions  for  countries  in  the 
democratization  stage  that  such  policies  have  con¬ 
tributed  to  identifying  the  institutions  and  capaci¬ 
ties  necessary  for  the  emergence  and  maintenance 
of  the  rule  of  law.  In  many  cases,  the  same  mean¬ 
ing  of  the  concept  has  been  operationalized  to 
comprise  the  ensemble  of  institutions  to  which  the 
promotional  politics  of  the  rule  of  law  point. 

In  general,  these  contributions  draw  attention 
to  the  political-institutional  dimension  of  the  rule 
of  law.  In  a  different  way,  Russell  Hardin  (1999) 
interprets  the  rule  of  law  as  a  rule  of  rules  that 
spontaneously  emerges  to  solve  problems  of  social 
coordination  where  conditions  of  pluralism  of  val¬ 
ues  and  preferences  exist.  According  to  Hardin, 
the  concept  of  rule  of  law  denotes  those  systems  of 
collective  action  where  power  asymmetry  exists. 
He  thus  puts  emphasis  on  the  importance  of  the 
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social  fabric  that  underlies  institutions,  on  a  par 
with  what  political  science  studies  on  social  capital 
have  pointed  out. 

History  teaches  us  that  the  institutional  configu¬ 
rations  adopted  to  implement  the  principle  of  the 
rule  of  rules  can  vary  according  to  the  cultural 
context,  although  they  all  tend  to  vouch  for  the 
legality  and  legitimacy  of  political  power.  For  this 
reason,  it  is  possible  to  identify  a  common  thread 
among  the  different  exemplifications  of  the  con¬ 
cept  of  rule  of  law,  referring  to  the  way  in  which 
the  form  of  government  ensures  that  rules  (laws) 
do  not  become  a  mere  instrument  of  domination. 
It  is  in  this  vein  that,  as  noted  by  Gianluigi 
Palombella  (2010),  the  history  of  liberal  constitu¬ 
tionalism  gives  flesh  to  an  institutional  ideal,  the 
rule  of  law,  whose  meaning  is  determined  in  rela¬ 
tion  to  the  nature  (source,  form,  and  content)  of 
government  rules.  The  organizational  form  of 
power — the  form  of  government  that  substantiates 
an  architecture  where  powers  are  separate  or  are 
mutually  accountable — cannot  but  operate  by  vir¬ 
tue  of  the  institutional  ideal  of  the  rule  of  law, 
never  being  the  rule  of  law  itself. 

Rule  of  Law,  the  Sovereign 
State,  and  Democracy 

As  the  concept  of  the  rule  of  law  has  evolved,  it  has 
become  a  polysemic  one,  with  multiple  meanings 
that  allow  its  application  in  a  variety  of  contexts. 
However,  this  polysemy  also  makes  it  challenging 
to  conduct  empirical  research  on  the  rule  of  law. 
Despite  the  internal  variability  of  the  concept,  it  is 
useful  in  such  empirical  research  to  maintain  a 
distinction  between  the  rule  of  law  and  other 
aspects  of  modern  liberal  political  systems,  such  as 
state  and  democracy.  Although  there  is  a  strict 
relation  between  these  two  concepts  and  the  rule 
of  law,  the  relation  cannot  be  reduced  to  a  relation 
of  either  causal  necessity  or  synonymy. 

Rule  of  Law  and  the  Sovereign  State 

With  the  concept  of  the  sovereign  state,  the  rule 
of  law  entertains  a  relation  of  semantic  inclusivity. 
In  other  words,  the  rule  of  rules,  under  specific 
historical  and  cultural  conditions,  has  become,  as 
Joseph  Raz  (1996)  notes,  the  rule  of  statutory  laws 
and  regulations,  both  expressions  of  the  modern 


state.  Conceived  as  the  state  of  law,  the  rule  of  law 
prompts  empirical  research  focused  on  the  formal 
correctness  with  which  the  law  is  applied.  The 
state  of  law  contributes  to  legitimize  the  use  of 
power  because  it  ensures  the  following: 

•  Formal  legitimacy:  Actions  are  predicated  on 
formalized  rules  that  are  clear,  ratified,  and 
neutral  in  terms  of  their  values. 

•  Consistency  over  time:  The  actions  of  centralized 
decision-making  bodies  conform  to  such  rules, 
making  possible  the  orderly  operation  of  the 
state. 

In  this  respect,  the  presence  of  the  state  of  law, 
understood  as  a  form  of  government  pivoting  on 
the  logic  of  synoptic  rationality,  based  on  written 
law,  and  codified  in  a  systematic  and  coherent 
fashion  is  a  condition  of  democracy  as  we  conceive 
it  today.  However,  this  does  not  exhaust  the  mean¬ 
ings  of  the  concept  of  rule  of  law.  If  the  rule  of  law 
were  to  be  defined  as  the  rule  of  rules,  not  of  men, 
then  the  presence  of  the  state  conceived  as  a  mod¬ 
ern  state — much  less  as  a  nation-state — is  not  a 
necessary  dimension  of  the  rule  of  rules.  It  becomes, 
though,  a  necessary  part  of  it  if  the  concept  of 
“rules”  is  intended  to  refer  to  “statutory  laws.” 

This  point  is  important  for  understanding  the 
process  of  adoption  and  implementation  of  the 
principles  of  the  rule  of  law  in  cultural  contexts 
where  the  formation  of  the  state  has  not  taken 
place.  As  noted  by  Leonardo  Morlino  (2009), 
hybrid  regimes  exist  in  which  some  embryonic 
forms  of  rule  of  law  mingle  with  a  territorial 
administration  that  lacks  some  fundamental  fea¬ 
tures  of  the  state  of  law,  the  proceduralization  of 
the  application  of  the  law.  In  this  regard,  it  is 
appropriate  that  the  two  concepts  of  rule  of  law 
and  of  the  state  are  kept  separate,  since  it  is  pos¬ 
sible  that  there  is  a  state  without  rule  of  law,  and 
in  some  case  even  vice  versa,  or  that  the  second 
appears  as  a  mechanism  capable  of  applying  laws 
in  a  formally  correct  way  without,  however,  ensur¬ 
ing  that  such  laws  comply  with  citizens’  funda¬ 
mental  rights. 

To  conduct  empirical  research,  one  needs  to 
observe  how  the  machinery  of  the  state  embodies 
legality  in  its  legislative  and  regulative  law-making 
processes.  As  stressed  by  Poggi  (1978),  since  the 
state  is  a  bureaucratic  legal  order,  founded  on  laws 
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that  define  and  ratify — in  a  constitutive  fashion — 
the  spheres  of  action  of  all  the  areas  of  the  public 
machine,  from  the  individual  (public  sector 
employees)  to  the  collective  (the  branches  and 
departments  of  the  public  administration),  it  is  in 
addition  appropriate  to  study  how  far  the  obedi¬ 
ence  of  legality  is  not  only  an  objective  property  of 
rules  (laws)  but  also  a  cognitive  feature  of  public 
sector  actors  (how  far  the  obedience  of  legality  is  a 
guiding  principle  of  public  action). 

Rule  of  Law  and  Democracy 

The  rule  of  law  does  not  necessarily  imply  a 
democratic  form  of  government.  This  theoretical 
premise  has  influenced  a  large  part  of  the  Western 
literature  on  the  rule  of  law  and  on  democracy.  As 
noted  earlier,  the  meaning  of  the  rule  of  law  does 
not  emerge  historically  in  relation  to  the  construc¬ 
tion  of  a  democratic  regime.  To  be  sure,  the  rule  of 
law  requires  merely  that  government  functioning 
obey  some  fundamental  constraints,  represented 
by  rules  that,  even  in  a  monarchy,  first  the  sover¬ 
eign  and  next  the  executive  must  follow.  The  intro¬ 
duction  of  democracy  in  sociopolitical  systems 
where  the  rule  of  law  is  in  force  happens  in  the 
second  instance,  after  the  consolidation  of  the 
modern  state  and  the  party  function  of  political 
representation. 

The  union  between  the  rule  of  law  and  democ¬ 
racy  brings  with  it  an  intrinsic  tension.  If,  as  a 
matter  of  fact,  the  rule  of  law  provides  for  rules 
that  constrain  political  power,  democracy  makes  it 
possible  that  through  lawmaking  one  can  wield 
political  power,  the  latter  made  legitimate  thanks 
to  its  stemming  from  democratic  consensus. 
Therefore,  between  the  two  forces,  one  that  limits 
government  via  rules  and  the  other  empowering  a 
democratic  government  by  means  of  rules,  it  is 
necessary  to  find  a  point  of  equilibrium.  The 
balancing  of  the  democratic  principle  and  the  con¬ 
stitutional  principle  takes  on  different  forms  accord¬ 
ing  to  which  countries  one  takes  into  account. 
Broadly  speaking,  Robert  Dahl’s  (1971)  position 
should  apply.  Starting  from  a  minimum  concept  of 
democracy,  Dahl  speaks  of  obedience  to  the  rule  of 
law,  provided  that  polyarchies  are  characterized  by 
conditions  of  equal  access  to  public  offices,  respect 
for  individual  freedom,  and  valid  rules  erga  omnes 
(for  all).  Yet  this  condition  is  necessary  but  not 


sufficient  to  ensure  a  fair  union  between  the  rule  of 
law  and  democracy.  Indeed,  Dahl  is  concerned 
with  the  rule  of  law  in  its  formal  meaning.  The 
existence  of  formal  legality,  as  Dahl  describes  it, 
does  not  guarantee  the  implementation  of  the  rule 
of  law.  Formal  legality  leaves  open  the  relationship 
between  the  value  system  of  legislative  majorities 
and  the  value  system  of  a  normative  order  regard¬ 
less  of  the  majority’s  will.  The  rule  of  law  in  a 
system  that  is  formally  a  democracy  does  not 
determine  whether  such  a  system  recognizes  and 
protects  the  fundamental  rights  of  citizens;  thus, 
we  need  to  distinguish  between  the  broad  and  nar¬ 
row  senses  of  the  rule  of  law — the  substantive 
versus  the  formal  or  procedural. 

Adam  Przeworski  (1991),  who  does  not  dismiss 
a  purely  formal  vision,  claims  that  the  rule  of  law 
clears  up  two  sets  of  problems:  the  “Madisonian” 
dilemma,  which  is  the  need  to  set  limits  to  the 
potential  capacity  of  the  state  to  overwhelm  citi¬ 
zens’  rights,  and  the  “Hobbesian”  dilemma, 
namely,  the  need  to  protect  individual  rights  from 
the  potentially  predatory  actions  of  other  private 
actors.  He  identifies  two  dimensions  of  the  rule  of 
law:  One  pertaining  to  freedom,  which  has  to  do 
with  the  capacity  of  the  state  to  protect  citizens’ 
rights,  and  the  other  pertaining  to  safety,  which 
has  to  do  with  the  capacity  of  the  state  to  protect 
citizens  from  other  citizens.  While  to  perform  the 
first  function  one  needs  mechanisms  that  constrain 
the  public  hand,  the  execution  of  the  second 
requires  mechanisms  that  make  the  public  hand 
sufficiently  powerful  and  effective  to  maintain 
social  control.  Such  conditions  define  the  mini¬ 
mum  requirements  to  say  that  a  democracy  is  sub¬ 
ject  to  the  rule  of  law. 

If,  however,  one  wants  to  broaden  these  condi¬ 
tions  and  integrate  the  substantive  dimensions  of 
the  rule  of  law,  the  systems  of  democratic  gover¬ 
nance  should  give  rise  to  an  implementation  of 
rules  so  that  the  fundamental  rights  can  be  fulfilled 
in  a  comprehensive  and  enduring  fashion.  In  this 
regard,  rule  of  law  and  democratic  principles — 
that  is,  the  legitimacy  of  rules  that  constrain  the 
power  and  legitimacy  of  the  objectives  pursued 
through  the  power  of  the  majority — must  there¬ 
fore  be  made  compatible  through  a  vision  of  the 
constitution  that  is  not  an  ex  ante  given  (guaran¬ 
teeing  the  formal  conditions  of  the  democratic 
game)  but  that  stems  from  the  same  democratic 
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game  ex  post.  Concretely,  each  Western  liberal 
democracy  has  configured  itself  along  an  axis 
where  the  different  forms  with  which  history  has 
changed  the  principle  of  rule  of  law  and  the 
democratic  principle  have  been  located.  Yet  since 
the  beginning  of  the  20th  century,  the  majority  of 
contemporary  democracies  have  included  such 
principles  in  their  constitutional  charters.  T ogether 
with  the  rules  that  define  the  democratic  institu¬ 
tional  system,  these  constitutions  ratify  even  lists 
of  fundamental  rights  unavailable  to  political 
majorities.  Such  constitutional  principles  entail 
that  parliamentary  will,  even  when  pursuing  col¬ 
lective  objectives  legitimized  by  majority  consen¬ 
sus,  cannot  compromise  the  fundamental  rights 
of  citizens.  The  constitution  defines  at  the  start  of 
political  processes  those  areas  where  ordinary 
politics  can  be  legitimately  practiced  according  to 
democratic  practices  and  in  compliance  with  fun¬ 
damental  rights.  These  are  established  and 
arranged  once  and  for  all  in  the  constitutive  rules 
of  political  order.  In  this  regard,  a  constitutional 
democracy  sets  forth  the  form  the  government 
must  take,  the  rights  guaranteed  to  citizens,  and 
the  procedures  by  which  such  rights  are  imple¬ 
mented. 

In  addition,  written  constitutions  provide  for 
institutional  devices  with  which  it  is  possible  to 
protect  the  constitution  from  both  attacks  by  par¬ 
liamentary  majorities  and  possible  drifts  of  execu¬ 
tive  power.  Among  these,  the  most  common,  as 
noted  by  Mauro  Cappelletti  (1989),  is  the  inspec¬ 
tion  of  constitutionality  that  constitutional  courts 
can  perform  against  primary  and  secondary  legis¬ 
lation  and  against  the  norms  of  international  law. 
The  growth  of  politics  aimed  at  safeguarding  one’s 
rights  is  an  additional  feature  of  the  Western 
democracies  that  emerged  in  the  second  part  of  the 
20th  century  as  a  reaction  to  the  totalitarian  and 
authoritarian  drifts  that  prevailed  over  the  demo¬ 
cratic  majorities  at  the  beginning  of  the  century. 
The  history  of  the  20th  century  revealed  that  the 
constitution  is  not  only  an  initial  given  of  the 
political  process.  Rather,  it  should  be  conceived  as 
the  aggregate  effect  of  the  interaction  between  the 
demand  for  and  the  supply  of  justice,  such  interac¬ 
tion  taking  place  not  only  through  the  dialogue 
between  the  judiciary  and  the  legislative  but  also 
through  the  dialogue  between  jurisdictions  and 
individuals  or  social  groups. 


Dimensions,  Units,  and  Levels 
of  the  Rule  of  Law 

Beyond  the  different  forms  of  instantiation  of  the 
rule  of  law,  the  concept  of  rule  of  law  can  also 
refer — at  a  more  abstract  level — to  a  system  where 
the  rule  of  rules  is  in  force,  intended  as  an  institu¬ 
tional  ideal  available  in  different  organizational 
forms  (administrative-legal  structure  and  govern¬ 
ment  standard  procedures)  according  to  the  source, 
form,  and  content  that  rules  gain  (jurisprudential, 
legislative,  and  administrative-regulative).  Concept 
formation  should  then  follow  the  path  of  all  mul¬ 
tidimensional  concepts,  whose  advantage  stems 
from  being  able  to  incorporate  into  their  semantics 
different  aspects  related  by  juxtaposition — 
conjunction — rather  than  by  disjunction.  The  rule 
of  law  is  many  things  together,  all  of  them  com¬ 
bined  into  a  real  instantiation  in  different  patterns. 

The  extension  of  the  rule  of  law  is,  thus,  com¬ 
posed  of  a  combination  of  five  dimensions: 

1.  Civil  order:  a  focus  on  the  right  to  life,  freedom 
from  fear  and  torture,  personal  security,  and 
right  to  own  property,  guaranteed  and  protected 
throughout  the  country;  low  rates  of  crime  and 
social  conflict;  a  persistent  and  durable  culture 
of  legality 

2.  Independent  judiciary  and  a  modern  justice 
system:  a  focus  on  mechanisms  establishing  an 
independent,  professional,  and  efficient  judiciary 
system  that  allows  equal  access  to  justice,  free 
of  undue  pressures  and  with  enforcement  of 
decisions 

3.  Capacity  of  the  sociopolitical  to  formulate, 
implement,  and  enforce  the  law:  a  focus  on  a 
governance  system  (legislative  and  executive 
branches)  capable  of  ensuring  the  production  of 
high-quality  legislation  and  its  implementation 
throughout  the  country;  a  transparent  policy¬ 
making  process  allowing  for  the  participation  of 
the  civil  society;  and  the  presence  of  a 
professional,  neutral,  accountable,  and  efficient 
state  bureaucracy 

4.  Low  if  not  nonexistent  rate  of  corruption, 
illegality,  and  abuse  of  power  by  state  agencies: 
a  focus  on  the  internalization  of  the  law  by 
public  officials  and  social  actors;  the  existence 
of  self-constraint  mechanisms  of  public  ethics 
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and,  in  case  these  mechanisms  are  ineffective, 
effective  policies  for  fighting  corruption  and 
abuses  of  power 

5.  Security  forces  that  are  respectful  of  citizens’ 
rights  and  are  under  civilian  control:  a  focus  on 
the  mechanisms  of  civilian  control  over  security 
forces  as  well  as  on  efficient,  uncorrupted,  and 
disciplined  police  forces  respectful  of  human 
and  political  rights 

This  strategy  enables  research  to  encompass  dif¬ 
ferent  ideal  types  or  forms  of  government,  pro¬ 
vided  that  the  nature  of  rules  whose  supremacy  is 
suggested  by  the  phrase  rule  of  law  is  specified. 
The  concept  of  rule  of  law  here  takes  on  a  precise 
meaning  and  can  be  operationalized  once  the  ori¬ 
gin  of  jurisprudential,  legal,  and  administrative- 
regulative  rules  has  been  determined.  The  term 
rules  is  meant  in  a  broad  sense,  not  restricted  to 
positive  law.  Rules  consist  of  jurisprudence,  doc¬ 
trine,  and  the  interpretative  principles  that  judges 
of  different  rank  and  jurisdiction  refer  to,  as  well 
as  the  social  rules  that  stem  from  institutional  tra¬ 
dition  and  are  implicit  in  the  very  fabric  of  political 
community.  This  is  a  fundamental  point.  It  emerges 
as  a  “thick”  formulation  of  the  rule  of  law  that 
redefines  what  has  been  stated  by  scholars  of 
democracy  and  in  particular  by  authors  such  as 
Guillermo  O’Donnell  (2004),  who  have  high¬ 
lighted  that  the  legality  and  formal  correctness  of 
rules  (the  “thin”  vision)  are  not  sufficient  to  ensure 
that  a  country  is  subjected  to  the  rule  of  rules. 

If  one  narrows  the  field  of  observation  to 
democracies,  the  rule  of  law  is  not  only  a  necessary 
condition  for  achieving  a  high  degree  of  quality  of 
democratic  governance  but  also  the  effect  of  the 
interaction  among  different  factors,  including  both 
social  and  political  aspects.  This  distinction  does  not 
entirely  cover  the  previously  mentioned  dichotomy 
between  formal  and  substantial  rule  of  law — 
which  belongs  more  properly  to  the  realm  of  nor¬ 
mative  theories — nor  can  it  be  considered  as  a  way 
to  reword  the  relationship  between  rule  of  law  as 
an  end  and  rule  of  law  as  a  means  to  an  end.  Both 
these  second  distinctions  belong  as  a  matter  of  fact 
to  a  normative,  prescriptive  discourse,  whereas  the 
distinction  between  rule  of  law  as  a  precondition 
and  rule  of  law  as  an  outcome  is  phrased  in  terms 
of  the  empirical  analysis  of  a  democracy. 


This  double  vision  finds  its  own  reference  even 
within  the  contemporary  debate  developed  by  the 
supporters  of  the  rule  of  law,  namely,  law  or  admin¬ 
istrative  science  experts,  who  together  with  policy¬ 
makers,  diplomats,  and  bureaucrats  are  committed 
to  the  diffusion  of  the  principles  of  the  rule  of  law 
in  developing  countries  or  those  countries  in  transi¬ 
tion  toward  democracy.  According  to  Rachel  Belton 
(2005),  all  these  options  fall  within  two  broad  con¬ 
cepts,  which  are  not  necessarily  in  opposition  to 
each  other:  the  first  focused  on  the  rule  of  law  as  an 
end  of  the  political  order  and  the  second  on  the  rule 
of  law  as  an  instrument  of  the  political  order.  The 
rule  of  law  as  an  end  in  itself  means  limited  govern¬ 
ment  and  a  guarantee  of  fundamental  rights.  This  is 
the  definition  of  constitutionalism.  According  to 
Morlino  (2009),  the  rule  of  law  as  an  instrument 
means  an  independent  legal  system,  a  transparent 
and  efficient  public  administration,  and  an  ensem¬ 
ble  of  police  forces  and  law  enforcement  agencies 
subject  to  civil  control.  However,  the  distinction 
between  rule  of  law  as  a  means  and  rule  of  law  as 
an  end  does  not  provide  any  profitable  way  to  con¬ 
duct  empirical  research.  Indeed,  it  presupposes  by 
petitio  principii  that  actors  who  comply  with  the 
rules  that  underlie  the  very  basis  of  the  institutions 
to  which  the  five  dimensions  listed  above  refer — 
civil  rights,  the  judicial  system,  the  administrative 
system,  military  force,  and  the  police — are  pursuing 
an  end  or  are  using  their  compliance  as  a  means  to 
a  different  end.  One  may  argue  instead  against  such 
a  teleological  view  of  the  rule  of  law.  Rule  of  law 
has  been  unveiled  by  empirical  research  as  a  com¬ 
plex  process  imbued  within  political  institutions, 
whose  direction  and  whose  main  rationale  cannot 
be  grasped  simply  on  the  basis  of  the  means-end 
dichotomy.  Rather,  it  may  be  of  some  interest  to 
distinguish  between  strategies  of  research  and 
empirical  analysis,  being  fully  aware  that  reality  is 
ontologically  unified  and  this  distinction  serves  only 
human  cognitive  goals. 

Empirical  Research  on  the  Rule  of  Law 

Rule  of  Law  as  a  Source  of  Opportunities 
for  Political  Action 

This  scholarship  is  framed  in  a  neo-institutionalist 
approach  and  ultimately  restored  to  the  social  sci¬ 
ences  the  primacy  of  agency  as  the  source  of  social 
processes.  Law  as  a  condition  of  action  basically 
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entails  that  research  should  focus  on  the  way 
actors  interact  under  the  conditions  set  down 
by  the  law  and  on  the  way  actors  take  advantage 
of  the  existence  of  laws  and  institutions  believed 
to  instantiate  the  principle  of  the  rule  of  law — as, 
for  instance,  the  judicial  branch.  Two  types  of 
actors  have  been  taken  into  account:  (1)  institu¬ 
tional  actors,  in  particular  with  regard  to  the  use 
of  the  constitutional  courts  as  clearinghouses  for 
institutional  conflicts,  and  (2)  social  actors,  in 
particular  with  regard  to  legal  mobilization. 

When  law  is  thought  of  as  an  opportunity  to 
apply  pressure,  the  organizational  and  cognitive 
resources  handled  by  social  and  political  actors 
become  of  utmost  importance  in  transforming  an 
opportunity  for  political  pressure  into  an  effective 
instrument  for  the  exercise  of  power.  Non¬ 
governmental  organizations  and  citizens  associa¬ 
tions  that  can  take  cases  before  the  courts  and 
make  their  actions  visible  by  any  means  may  bring 
about  a  major  change  in  the  legal  and  the  constitu¬ 
tional  system  of  a  country,  achieved  through  pres¬ 
sure  from  below. 

A  large  body  of  scholarship  has  been  developed 
over  the  past  decade  in  the  field  of  European  studies 
by  devoting  energies  to  cast  light  on  the  use  of 
European  Community  law  to  make  policies  or  to 
enact  new  ways  of  enforcing  traditional  rights.  The 
so-called  supranational  rule  of  law  that  has  been 
identified  in  European  legal  norms  and  judicial  insti¬ 
tutions — for  instance,  the  European  court  of  justice 
and  the  European  court  of  human  rights — has  been 
thought  to  be  a  promising  arena  that  actors  endowed 
with  resources  and  capacities  can  enter  to  push  for¬ 
ward  their  own  interests  and  values.  European 
scholars  are  vigorously  debating  the  type  of  rule  of 
law  used  by  the  European  Union  (EU)  (in  the  con¬ 
text  of  the  constitutional  debate  developed  over  the 
past  years).  As  a  matter  of  fact,  the  EU  seems  to  be 
more  intelligible  as  a  rule  of  law  in  process — rule  of 
law  as  an  outcome — than  as  a  rule  of  law  as  a  pre¬ 
condition  for  democratic  processes.  To  phrase  it 
differently,  European  constitutionalism,  thought  of 
as  a  complex  and  multilayered  system  of  rights 
enforcement,  is  not  given  ex  ante  but  instead  is  made 
up  as  a  political,  social,  and  cultural  construction. 

Rule  of  Law  as  Outcome  of  Political  Processes 

The  rule  of  law  may  be  thought  as  a  social  sys¬ 
tem  in  which  interactions,  both  social  and  political, 


are  ruled  and  shaped  according  to  fundamental, 
impartial,  stable,  and  predictable  rules,  some  of 
them  of  longue  duree  (long  term).  Rule  of  law  as 
outcome  is  highlighted  by  nonformalist  and  non¬ 
positivist  scholars  such  as  Hardin  (1999)  and 
Morlino  (2009),  who  have  devoted  time  and 
energy  to  restore  to  the  field  of  research  the  study 
of  how  rules  are  implemented. 

Some  research  has  been  developed  to  bring 
political  processes  to  light,  both  at  the  macrolevel 
and  at  the  mesolevel.  A  major  point  related  to  the 
study  of  the  rule  of  law  as  an  outcome  impinges  on 
the  capacity  of  a  society  for  settling  disputes  on  the 
basis  of  the  law.  An  extensive  scholarship  has 
revealed  the  importance  of  cultural  and  cognitive 
conditions,  which  exist  alongside,  and  sometimes 
even  in  opposition  to,  structural  conditions. 

These  studies  have  observed  that  the  legitimacy 
of  legal  rules  and  political  institutions  is  key  to 
maintaining,  in  the  long  run,  the  rule  of  law.  Civic 
engagement  and  a  civic  culture  have  been  pointed 
out  by  scholars  working  on  political  culture  and 
related  matters  as  a  cornerstone  of  a  well-functioning 
democracy  and  ipso  facto  of  a  system  based  on  the 
rule  of  law.  The  argument  put  forth  about  the  posi¬ 
tive  correlation  that  should  exist  between  civic  cul¬ 
ture  and  the  rule  of  law  is  as  follows:  Civic  culture 
helps  maintain  a  low  level  of  social  conflict,  and  as 
a  consequence,  it  reduces  indirectly  the  need  to  use 
rules  to  foster  social  behaviors  based  on  common 
principles  of  peaceful  social  life. 

Empirical  research  has  recently  pointed  out  the 
role  played  by  factors  that  stand  outside  the  politi¬ 
cal  system,  for  instance,  the  existence  of  the  supra¬ 
national  networks  in  which  standards  of  rule  of 
law  are  set  down,  promoted,  and  disseminated 
internationally.  In  a  way,  this  fairly  new  phenom¬ 
enon  is  associated  with  the  increasing  importance 
of  the  interplay  of  domestic  and  supranational  fac¬ 
tors  in  making  the  rule  of  law  a  multilayered  con¬ 
figuration  of  rules  and  principles  rather  than  the 
epiphenomenon  of  a  hierarchical  system  of  rules 
enacted  exclusively  by  domestic  actors.  Interna¬ 
tional  scholars  and  comparative  political  scholars 
have  joined  in  exploring  the  mechanisms  that  work 
in  the  interstices  of  the  two-level  games  played  by 
domestic  and  supranational  institutions  nowadays. 
The  result  has  been  a  spectacular  growth  of  nor¬ 
mative  production,  pursuing  the  implementation 
of  the  rule  of  law  that  transcends  the  usual  tradi¬ 
tional,  domestic,  judicial,  and  legislative  arenas. 
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Empirical  Research  on  Democratic  Quality 

Complexity  and  polysemy  can  be  handled  effec¬ 
tively  if  the  rule  of  law  is  meant  to  be  a  multidi¬ 
mensional,  conceptual  tool  covering  different 
aspects  of  sociopolitical  systems.  This  approach 
appears  promising  when  framed  in  a  broader  con¬ 
text,  as  the  empirical  analysis  of  the  quality  of 
democracy.  Democratic-quality  assessment  appears 
today  to  be  the  contemporary  way  of  phrasing  an 
old  and  perennial  question  cutting  across  all 
Western  political  experiences,  namely,  the  search 
for  good  government.  Democratic  assets  address 
the  issue  of  expanding  the  spectrum  of  political 
participation,  injecting  the  principle  of  political 
competition  under  conditions  of  formal  equality 
and  given  the  primacy  of  the  law.  However, 
although  formal  designs  may  perfectly  satisfy 
abstract  criteria,  in  practice  democratic  institutions 
work  differently  in  different  countries  and  may 
deliver  different  types  of  services  and  goods,  result¬ 
ing  in  differing  levels  of  citizen  satisfaction  despite 
being  designed  similarly.  In  brief,  what  matters  in 
reality  is  how  democracy  works  and  what  puts 
into  motion  the  formal  institutional  designs.  This 
point  is  fully  accepted  by  the  comprehensive  and 
empirically  oriented  approach  to  democratic  qual¬ 
ity  that  has  recently  been  developed  by  Morlino 
(2004).  Following  this  approach,  one  refers  to  the 
qualities  of  a  democracy.  These  qualities  are 
grouped  into  three  types: 

1.  qualities  associated  with  the  procedures  of  the 
political  system, 

2.  qualities  associated  with  the  content  of  the 
policies,  and 

3.  qualities  associated  with  the  products  of  the 
political  processes. 

The  first  comprises  rule  of  law,  interinstitutional 
accountability,  electoral  accountability,  competi¬ 
tion,  and  participation.  The  second  relates  to  the 
responsiveness  of  the  democratic  processes.  The 
third  concerns  the  equality  and  solidarity  that  are 
in  reality  offered  to  citizens. 

Accordingly,  the  rule  of  law  is  not  only  a  proce¬ 
dural  dimension  of  the  democratic  regime — one  of 
the  qualities  it  is  expected  to  feature.  It  is  also  one 
of  the  crucial  dimensions  on  which  many  other 
aspects — or  qualities — of  democratic  governance 
depend.  As  a  matter  of  fact,  the  rule  of  law  enters 


into  patterns  of  regular  interaction  with  other  pro¬ 
cedural  dimensions.  First  among  these  is  interinsti¬ 
tutional  accountability.  In  those  cases  in  which 
legislative  oversight  of  the  policies  enacted  by 
government  proves  to  be  weak  and  ineffective, 
judicial  institutions — for  instance,  the  constitu¬ 
tional  courts — can  perform  a  subsidiary  function 
and  put  into  motion  an  alternative  mechanism  of 
checks  and  balances.  In  a  different  setting,  the  rule 
of  law  may  specifically  interact  with  the  partici¬ 
pants  and  thereby  contribute  to  the  enforcement  of 
the  rules  that  govern  a  society.  One  way  for  this  to 
happen  is  through  legal  mobilization.  Civil  society 
organizations  and  rights  advocacy  coalitions  may 
help  enforce  or  deepen  the  fundamental  freedoms 
of  citizens,  thereby  bringing  about  a  change  in  the 
political  system.  Judicial  arenas  may  become  in 
this  view  an  opportunity  to  act,  of  which  social 
actors  may  take  full  advantage  if  they  have  the 
resources  to  mobilize  and  bring  a  case  before  the 
courts. 

A  further  point  that  deserves  to  be  emphasized  is 
that  the  empirical  approach  to  democratic-quality 
assessment  illustrates  and  reveals  the  several  differ¬ 
ent  ways  by  which  the  law  enters  into  a  combina¬ 
tion  and  intertwines  with  the  extralegal  norms  of  a 
society.  In  a  way,  the  wide  range  of  policies  and 
programs  put  into  motion  to  promote  the  rule  of 
law  in  nascent  democracies  has  provided  scholars 
with  a  formidable  experiment  of  institution  build¬ 
ing,  something  that  in  old  and  consolidated  democ¬ 
racies  would  have  never  been  possible.  Such  an 
experiment  creates  quasi-optimal  conditions  to 
detect  and  disentangle  the  relative  weight  of  prele- 
gal  dimensions  incorporated  into  the  daily  life  of 
social  communities  and  to  assess  the  role  such 
dimensions  play  in  transforming  formal  legal  rules 
into  effective  social  practices.  Drawing  lessons 
from  these  experiences  of  social  and  legal  engineer¬ 
ing,  a  comprehensive  democratic-quality  assess¬ 
ment  exercise  considers  the  interaction  between 
the  procedural  aspect  of  the  rule  of  law  and  the 
degree  to  which  legal  norms  and  jurisprudence 
match  the  diffuse  legal  culture  (an  aspect  referred 
to  by  the  notion  of  responsiveness).  By  focusing  on 
the  prelegal  dimension  of  the  rule  of  law,  a  scholar 
may  be  accused  of  naive  optimism.  If  on  the  one 
hand  rule  of  law  proves  to  be  a  fairly  vague  con¬ 
cept,  on  the  other  hand  prelegal  aspects  show 
resistance  to  empirical  investigation.  Indeed,  for 
social  scientists,  this  exploration  is  forced  to  deal 
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with  a  never-ending  puzzle,  namely,  the  intangible 
nature  of  unobservable  forces,  whose  effects  we 
can  measure  and  whose  intrinsic  ontology  we  can 
only  guess.  However,  by  denying  the  importance  of 
the  prelegal  aspects  of  rule  of  law,  one  does  not  get 
very  far  either  in  understanding  the  way  society 
functions  or  in  providing  policymakers  with  sound 
frameworks  to  set  their  agenda  for  legal  and  judi¬ 
cial  reforms.  According  to  Leonardo  Morlino  and 
Wojciech  Sadurski  (2010),  a  way  out  would  be  to 
refer  to  the  democratic  rule  of  law  in  order  to 
incorporate  those  aspects  that  refer  directly  to  the 
prelegal  dimensions  of  social  enterprise.  In  this 
way,  as  Palombella  (2010)  has  observed,  rule  of 
law  would  mean  simply  the  primacy  of  rules, 
whereas  the  determinacy  of  what  these  rules  are, 
where  they  come  from,  and  where  they  are  going 
remains  an  open  question,  to  be  addressed  empiri¬ 
cally.  As  a  matter  of  fact,  rules  used  in  a  society 
to  deal  with  an  inescapable  part  of  social  life — 
disagreement — might  have  different  sources  and 
different  formats.  By  endorsing  this  view,  scholars 
are  able  to  appreciate  the  delicate  intertwinement 
that  exists  among  democratic  qualities  and,  more 
specifically,  between  the  rule  of  law  and  the  differ¬ 
ent  components  that  characterize  a  democratic 
society. 

A  valuable  aspect  of  the  way  the  democratic- 
quality  assessment  addresses  the  rule  of  law 
comes  from  its  pluralist  theoretical  stance.  As 
mentioned  before,  the  rule  of  law  is  a  “thick” 
concept,  in  which  traditionally  normative  and 
descriptive  aspects  are  mingled  in  a  complex 
interlacing  of  mutual  influences.  Democratic- 
quality  assessment  accepts  this  plurality  of  views 
and  interprets  them  as  a  variety  of  configurations 
in  which  the  rule  of  law  as  an  institutional  prin¬ 
ciple  can  be  instantiated  into  a  democratic  regime. 
In  countries  in  which  the  rule  of  law  is  grounded 
in  a  vibrant  civil  society  and  the  extension  of  the 
public  sector  is  limited  because  of  historical  rea¬ 
sons,  democratic-quality  assessment  does  not 
endorse  an  evaluative  position;  that  is,  it  does  not 
assess  this  specific  configuration  as  a  better  or  a 
worse  one  than  others  equally  represented  by 
Western  institutional  experiences.  It  simply  recog¬ 
nizes  that  each  configuration  of  a  democracy  is 
characterized  by  the  specific  way  in  which  each 
quality  participates  to  determine  the  whole  of 
democratic  governance. 


Beyond  the  Rule  of  Law? 

A  further  development  witnessed  by  contemporary 
scholarship  concerns  the  way  legal  and  political 
reality  challenges  traditional  definitions  and  con¬ 
ceptions  of  rule  of  law.  Despite  its  being  a  concept 
with  a  long  history,  the  rule  of  law  does  not  cease 
to  be  subjected  to  a  steady  and  permanent  con¬ 
frontation  with  reality  and,  thereby,  to  restlessly 
readapt  its  meaning  and  its  normative  value.  In 
fact,  this  complex  and  polysemic  concept  mirrors 
the  multiplicity  of  the  institutional  and  cultural 
experiences  that  are  observed  by  the  numerous  and 
different  models  of  organization  of  power — both 
in  the  Western  tradition  and  nowadays  outside 
it — and  by  the  many  different  models  through 
which  individuals  have  managed  to  accommodate 
selfishly  oriented  behavior  in  a  collective  setting  of 
predictable  and  stable  rules  of  interaction.  In  a 
way,  this  is  a  sign  of  the  state  of  good  health  of  the 
concept,  because  it  bespeaks  its  capacity  to  accom¬ 
modate  different  instances  of  the  same  general 
principle,  namely,  the  idea  of  constraining  power 
through  rules,  legal  and  extralegal. 

However,  in  recent  decades,  new  institutional 
phenomena  and  new  ways  of  thinking  and  practic¬ 
ing  law  making  and  law  enforcement  have  come  to 
light.  The  diffusion  of  democratic  institutions  and 
the  emergence  of  regimes  that  combine  nondemo- 
cratic  features  with  elements  of  the  rule  of  law 
have  added  richness  and  complexity  to  an  already 
large  spectrum  of  empirical  cases  to  which  the  con¬ 
cept  of  rule  of  law  have  been  applied  both  in  the 
presence  and  in  the  absence  of  the  democratic  con¬ 
ditions  in  which  politics  takes  place. 

One  can  mention  at  least  three  new  develop¬ 
ments  undergone  by  political  institutions  that  call 
for  a  rethinking  and  a  reconsideration  of  the  mean¬ 
ing  of  the  rule  of  law  and,  consequently,  the  way 
the  concept  is  operationalized  in  empirical  research 
design. 

1.  The  unpredicted  growth  of  the  production  of 
legal  norms,  which  is  currently  being  experienced 
in  many  advanced  democracies.  The  idea  of 
constraining  power  through  rules  that  are  “law” 
seems  to  encounter  some  difficulties,  due  to 
overproduction  of  legal  norms.  In  this  respect,  too 
much  regulation  shadows  the  magic  glamour  of 
the  rule  of  law  as  a  limit  to  the  rule  of  men.  When 
the  legal  norms  are  persistently  changed,  the 
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intervention  of  the  will  of  men  becomes  dominant 
and  even  overrules  the  “rule  of  rules.” 

2.  The  uncontrollable  process  of  transnation¬ 
alization  of  the  rule  of  law.  This  is  related  to  the 
multiplication  and  fragmentation  of  the  sources  of 
law.  Laws,  quasi  laws,  and  so-called  soft  laws  are 
produced  at  the  subnational,  national,  and 
supranational  levels.  In  practice,  they  mix  together. 
In  reality,  they  offer  to  social  and  political  actors 
the  possibility  of  moving  between  different 
normative  systems,  in  a  way  that  owes  much  more 
to  a  contingent  and  strategic  logic  than  to  a  stable, 
predictable,  and  valid  erga  omnes  logic. 

3.  The  increasing  hybridization  of  legal,  judicial, 
and  institutional  practices  and  procedures.  Social 
and  political  systems  have  deployed  extensive 
processes  of  mutual  imitation,  which  have 
transferred  practices  and  procedures  from  other 
systems  where  they  had  once  been  adopted  or 
invented.  This  occurs  at  all  levels  of  the  organization 
of  the  state,  in  new  and  advanced  democracies  as 
well  as  in  nondemocratic  and  hybrid  regimes. 

These  comprehensive  changes  entail  a  number 
of  important  consequences  for  the  political  science 
research  agenda.  When  the  rule  of  law  is  studied  as 
an  outcome — as  rule  implementation — it  should 
be  considered  a  radial  concept,  composed  of 
semantic  building  blocks  radiating  outward  from  a 
center.  Not  all  the  blocks  have  empirical  meaning 
in  all  instantiations  of  the  rule  of  law.  They  can 
even  have  different  relative  weights  in  different 
empirical  contexts.  This  may  allow  researchers  to 
accommodate  the  large  variety  of  different  forms 
of  rule  of  law,  without  stretching  the  concept  to 
the  point  of  emptying  it  of  value. 

It  is  of  paramount  importance  that  typological 
studies  are  used.  When  the  rule  of  law  is  studied  as 
a  precondition,  it  is  extremely  useful  in  analytical 
terms  to  start  from  ideal  types,  which  feature  a 
specific  way  of  combining  law,  politics,  and  soci¬ 
ety.  The  rules  that  constrain  power  are  produced 
not  only  by  making  laws  but  also,  over  the  long 
run,  by  social  processes.  History  has  provided  us 
with  experiences  that  took  different  paths  through 
law,  politics,  and  society.  Once  developed  as  an 
abstract  typology,  the  research  then  can  go  in 
depth  and  see  to  what  extent  each  real  case  instan¬ 
tiates  the  features  presented  in  the  ideal  types. 


Generally  the  rule  of  law  seems  to  carry  an 
intrinsic  dilemma,  which  is  mirrored  in  the  way  it 
has  been  used  as  a  concept  in  empirical  research 
projects.  Basically  and  substantially,  it  has  been 
thought  of  as  a  way  to  steer  and  control  social 
processes  on  the  basis  of  general,  abstract  rules 
that  are  able  to  remove  the  unavoidable  discretion¬ 
ary  power  from  the  process  of  resolving  social  and 
political  conflicts.  However,  these  general  rules 
must  be  implemented  by  the  actions  of  individuals, 
and  thus,  the  rule  of  law  does  not  eliminate  the 
inherent  tension  between  the  rules  and  those  who 
make  them. 

It  is  important  that  dialogue  continue  to  develop 
between  those  whose  chief  concern  is  concept 
building  and  observation  of  reality  and  those  con¬ 
cerned  with  broadening  the  theoretical  base  of  the 
rule  of  law,  thereby  accounting  more  fully  for  the 
different  aspects  of  democracy  with  which  the  rule 
of  law  interacts.  For  these  reasons,  empirical 
research  in  all  the  social  sciences — and  particularly 
empirical  research  using  a  comparative  method¬ 
ological  approach — deserves  more  than  ever  to  be 
elaborated  on  and  intensified  by  future  generations 
of  social  scientists. 
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Sampling,  Random 
and  Nonrandom 


In  its  most  general  form,  a  sample  is  simply  a  sub¬ 
set  of  a  larger  population  or  group.  In  scientific 
applications,  the  purpose  of  sampling  is  to  be  able 
to  draw  conclusions  about  a  larger  population. 
There  are  many  reasons  to  sample  rather  than 
attempt  to  collect  data  on  the  full  population. 
They  include  the  following: 

•  Cost:  ft  is  often  cost  prohibitive  to  collect  data 
on  the  entire  population. 

•  Speed:  To  collect  data  on  the  entire  population 
would  take  an  excessive  amount  of  time. 

•  Feasibility:  Often  the  complete  set  of  population 
elements  is  unknown.  Most  populations  do  not 
have  centralized  lists  of  their  members.  Even 
when  such  lists  exist,  it  is  usually  impossible  to 
collect  data  on  every  single  member. 

This  entry  discusses  the  major  types  and  problems 
of  sampling  procedures. 

The  sampling  procedure  can  be  described  using 
the  following  terminology. 

Population  Unit.  The  population  unit  is  the  unit  at 
which  the  data  are  collected.  In  survey  research, 
population  units  are  typically  people,  although 
they  can  also  be  at  other  levels,  such  as  households. 
Other  types  of  research  designs  may  look  at  larger 
units  of  analysis,  such  as  nations,  cities,  organiza¬ 
tions,  or  institutions. 


Target  Population.  The  target  population  is  the 
group  of  elements  that  the  sampling  procedure 
aims  to  represent.  For  example,  a  public  opinion 
survey  may  attempt  to  represent  all  adult  Americans 
or  may  limit  the  population  to  individuals  with 
certain  qualifying  characteristics,  such  as  parents 
of  school-age  children,  voters,  people  with  heart 
disease,  or  residents  of  South  Carolina. 

Sampled  Population.  In  contrast  to  the  target  pop¬ 
ulation,  the  sampled  population  refers  to  the  por¬ 
tion  of  the  population  from  which  the  sample  is 
actually  drawn.  In  an  ideal  world,  the  sampled 
population  and  the  target  population  are  nearly  the 
same  thing,  but  for  a  number  of  reasons  drawing  a 
sample  from  the  target  population  is  usually  not 
feasible.  For  example,  survey  data  collected  using 
random  digit  dialing  is  unable  to  include  respon¬ 
dents  who  do  not  possess  telephones.  Low-incidence 
target  populations  exacerbate  these  issues.  For 
instance,  if  a  researcher  wants  to  do  a  representa¬ 
tive  study  of  people  who  run  for  local  office,  using 
a  random  digit  dialing  technique  would  be  enor¬ 
mously  expensive,  as  many  contacts  would  need  to 
be  made  to  get  a  single  eligible  respondent.  For  this 
reason,  studies  of  low-incidence  populations  are 
often  conducted  on  prescreened  individuals  (e.g., 
those  from  membership  lists  or  clinics).  In  all  cases, 
it  is  important  to  consider  any  systematic  differ¬ 
ences  between  the  target  population  and  the  sam¬ 
pled  population. 

Sampling  Unit  and  Sampling  Frame.  To  select  the 
sample  from  the  population,  the  population  must 
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first  be  divided  into  sampling  units,  or  elements 
that  are  subject  to  being  selected.  The  collection  of 
the  list  of  sampling  units  is  referred  to  as  the  sam¬ 
pling  frame.  In  a  simple  random  sample  design,  the 
sampling  frame  may  consist  of  a  list  of  individuals 
who  belong  to  an  organization,  with  the  sampling 
units  being  each  individual.  Cluster  designs  may 
have  a  more  complex  relationship  between  the 
sampling  frame  and  the  sampling  unit,  as  described 
in  the  next  section. 

Types  of  Sampling 

Sample  selection  can  be  broadly  defined  as  being 
random  or  nonrandom — that  is,  as  either  proba¬ 
bility  or  nonprobability  sampling.  The  primary 
difference  between  these  two  approaches  is  that 
nonprobability  samples  require  some  judgment  on 
the  part  of  the  researcher  as  to  which  cases  should 
be  included,  while  probability  samples  are  selected 
such  that  any  member  of  the  population  may  be 
included  (although  not  necessarily  with  equal 
probability).  Additionally,  self-selected  samples 
have  no  sampling  frame  in  the  traditional  sense. 

There  are  numerous  approaches  to  random  and 
nonrandom  sampling.  The  next  section  describes 
some  of  the  most  commonly  used  designs. 

Random  Sampling 

•  Simple  random  sample  (SRS):  In  a  simple  random 
sample  of  size  n  in  a  population  of  size  N,  every 
possible  subset  of  size  n  has  an  equal  probability 
of  selection,  and  the  probability  that  an 
individual  unit  is  selected  into  the  sample  is  n/N. 

•  Systematic  sampling:  In  a  systematic  sample  of 
size  n  of  a  given  population  of  size  N,  the  first 
step  is  to  pick  a  random  starting  point  R 
between  1  and  k,  where  k  is  the  next  integer 
after  N/n.  Next,  R  and  every  &th  element  after  R 
in  the  population  are  selected  into  the  sample. 
One  risk  of  using  systematic  sampling  is  that  if 
there  is  any  periodicity  in  the  list,  the  sample 
variance  may  be  large. 

•  Stratified  sampling:  The  stratified  sampling 
procedure  first  divides  the  population  into 
subgroups,  called  strata,  and  then  selects  an  SRS 
from  each  stratum.  One  potential  downside  of 
using  an  SRS  design  is  that  simply  due  to  chance, 
certain  groups  in  the  population  may  be 


underrepresented,  especially  when  smaller 
sample  sizes  are  used.  One  way  to  guard  against 
this  is  to  use  stratified  sampling  on  measures  of 
interest.  For  instance,  when  conducting  a  survey 
of  political  attitudes  in  a  U.S.  state,  one  might 
want  to  ensure  that  urban,  suburban,  and  rural 
areas  are  adequately  represented.  To  achieve 
this,  the  researcher  can  stratify  the  state  into 
levels  of  urbanicity  and  then  sample  within  each 
urbanicity  level.  Stratified  sampling  improves 
efficiency  if  within-strata  units  are  relatively 
homogeneous  and  most  of  the  variance  in  the 
survey  measurements  is  between  strata. 

•  Cluster  sampling:  Often,  SRS  designs  are  not 
feasible  because  the  total  list  of  elements  is  not 
known.  For  example,  if  one  wanted  to  conduct  a 
study  of  restaurant  workers  in  a  city,  there  is  no 
centralized  list  of  these  employees.  However,  it 
would  be  relatively  straightforward  to  find  a  list 
of  restaurants  in  a  city.  In  a  cluster  sampling 
design,  the  first  step  would  be  to  take  an  SRS  of 
the  restaurants  in  the  city  and  then  take  a 
subsample  of  the  employees  in  each  restaurant. 

In  this  design,  restaurants  are  the  primary 
sampling  units,  and  employees  are  the  secondary 
sampling  units.  Cluster  samples  are  more 
efficient  if  there  is  little  variation  between 
clusters  and  more  variation  within  clusters. 

•  Multistage  cluster  sampling:  A  variation  on  the 
previous  design  is  multistage  cluster  sampling, 
which  effectively  is  a  series  of  nested  clusters. 
Taking  the  previous  example,  suppose  the  same 
researcher  wants  to  interview  a  representative 
sample  of  restaurant  workers  at  the  state  level. 
While  one  could  sample  from  the  list  of 
restaurants  in  the  state,  to  do  so  may  be  an 
inefficient  use  of  limited  resources  and  time.  For 
that  reason,  it  makes  sense  to  first  take  a  random 
sample  of  the  regions  in  the  state  (e.g.,  counties), 
with  the  probability  of  selection  proportional  to 
the  population  in  the  region;  then  a  random 
sample  of  cities  within  the  selected  counties; 
followed  by  a  sample  of  several  restaurants  in 
each  city;  and  finally  of  employees  in  the  selected 
restaurants. 

Nonrandom  Sampling 

Nonrandom  sampling  comprises  a  broad  cate¬ 
gory  of  approaches  that  are  similar  primarily  in 
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that  subjective  criteria  are  used  in  case  selection 
rather  than  randomization.  While  nonrandom 
samples  lack  the  statistical  properties  that  random 
samples  have,  there  are  often  compelling  reasons 
to  use  them,  ranging  from  feasibility  to  conve¬ 
nience  to  specific  criteria  tailored  to  the  research 
design.  Some  examples  of  nonrandom  sampling 
are  as  follows: 

•  Quota  sampling:  Quota  sampling  resembles 
stratified  sampling  in  that  strata  are  created  with 
target  numbers  for  each  group.  Where  quota 
sampling  differs  from  stratified  sampling  is  that 
the  elements  within  each  strata  are  not  chosen  at 
random.  Quotas  can  be  based  on  marginal 
distributions  of  individual  variables  or  by  the 
joint  distribution  of  a  number  of  variables.  The 
main  risk  in  using  quota  sampling  is  that  the 
selected  elements  are  subject  to  bias,  including 
the  following: 

Judgment:  The  respondents  are  selected  by 
the  interviewer,  which  introduces  the  interview¬ 
er’s  preferences  as  to  whom  to  interview  within 
the  quota  groups. 

Self-selection:  The  respondents  opt  into  the 
study.  While  these  respondents  may  fill  the 
quota  groups,  they  may  be  qualitatively  different 
from  members  of  the  group  who  do  not  opt  in. 

Nonignorable  bias:  The  factors  defining  the 
quota  groups  are  insufficient  to  explain  the 
differences  between  the  respondents  in  the  quota 
groups  and  the  population  in  those  groups. 

•  Convenience  sampling:  A  convenience  sample 
contains  a  group  of  participants  who  select 
themselves  into  a  study.  Clinical  trials  for  medical 
treatments  frequently  use  this  type  of  design. 
Because  it  is  impossible  to  sample  patients  at 
random,  these  studies  rely  on  the  willingness  of 
patients  to  volunteer  for  a  clinical  trial.  Once  in 
the  clinical  trial,  patients  are  assigned  at  random 
to  receive  the  drug  being  tested  or  a  placebo. 

•  Purposeful  sampling:  Other  types  of  nonrandom 
sampling  rely  on  case  selection  by  the  researcher. 
In  these  situations,  cases  are  selected  based  on  a 
cluster  of  characteristics  that  allow  the  researcher 
to  test  a  theory.  Generally,  these  designs  are  used 
when  a  relatively  small  number  of  cases  are  to  be 
examined  in  depth.  There  are  many  types  of 


purposeful  sampling.  Some  examples  are  as 
follows: 

Typical  cases:  Cases  are  selected  by  virtue  of 
not  being  unusual  or  exceptional  with  respect  to 
the  larger  population. 

Most  similar/most  different:  Cases  are 
selected  based  on  having  very  similar  or  very 
different  characteristics.  For  example,  two 
nations  with  a  similar  set  of  policies  but  very 
different  outcomes  may  be  examined. 

Critical  cases:  In  critical  case  sampling,  cases 
are  selected  that  are  considered  to  be  particu¬ 
larly  important  to  understanding  the  logic 
behind  a  causal  phenomenon.  Critical  cases  are 
often  used  to  make  the  argument  that  if  an  out¬ 
come  is  true  for  that  case,  then  it  is  likely  true 
for  all  cases. 

•  Snowball  sampling:  Snowball  sampling  builds  a 
sample  in  stages  by  starting  with  a  (generally 
small)  group  of  subjects  who  are  considered  to 
be  good  sources  of  information,  who  then  recruit 
additional  subjects  from  their  own  social 
networks.  The  new  recruits  then  find  more 
recruits,  and  so  on,  until  a  “snowball”  of 
recruits  is  built  up.  While  the  resultant  sample 
may  not  be  representative  of  the  population  at 
large,  snowball  sampling  is  especially  useful  for 
collecting  data  on  populations  that  are  hard  to 
reach  (e.g.,  people  engaging  in  illegal  activities). 

Sampling  and  Nonsampling  Errors 

Samples  are  often  used  to  obtain  estimates  of 
population  parameters  of  interest.  A  population 
estimate  obtained  by  a  sample  estimator  may  con¬ 
tain  errors  that  can  arise  from  many  sources.  The 
total  error  can  be  decomposed  into  two  pieces: 
sampling  error  and  nonsampling  error. 

Evaluating  Sampling  Measurements  and  Errors 

Sampling  error  is  the  random  error  that  arises  as 
a  result  of  selecting  a  sample  of  a  population.  In  a 
random  sample  with  no  bias  present,  an  individual 
sample  may  differ  from  the  true  population  param¬ 
eter,  but  many  repeated  estimates  from  a  sample 
will  on  average  cover  the  true  population  parame¬ 
ter.  The  degree  of  uncertainty  that  is  introduced  by 
sampling  error  is  quantified  by  the  standard  error 
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of  the  estimate  or  the  margin  of  error  of  a  sam¬ 
pling  procedure.  This  section  discusses  the  charac¬ 
teristics  of  sample  estimators  in  general  and  the 
characteristics  of  estimators  derived  from  simple 
random  sampling  in  particular. 

Sample  Measurements 

Given  a  sample  of  i  —  1 ,  ,n  units  from  a 
population  of  N  units,  measured  with  respect  to 
characteristic  x,  die  sample  mean  of  an  attribute  is 
given  by  the  sum  of  the  values  of  x  over  all  the 
units  in  the  sample,  divided  by  the  sample  size: 

lv 

y  = 

n 

The  sample  mean  is  considered  to  be  an  estimate 
for  the  population  mean. 

The  precision  of  this  estimate  can  be  described 
using  the  sampling  distribution.  The  sample  vari¬ 
ance  is  a  measure  of  the  spread  for  a  given  attri¬ 
bute  x,  calculated  as  the  average  squared  deviation 
from  the  sample  mean: 

s2  =  - V  (yj  —  y)2 . 

y  n-  1  y y> 

The  sample  standard  deviation  is  the  square  root 
of  the  sample  variance: 


The  statistic  that  is  typically  reported  to  quan¬ 
tify  the  amount  of  sampling  error  is  the  standard 
error  (SE),  or  the  standard  deviation  of  the  sam¬ 
pling  distribution: 


The  standard  error  can  be  used  to  generate  the 
confidence  interval  of  the  sample  mean.  Because 
the  distribution  of  all  possible  sample  means  is 
approximately  normal,  the  95%  confidence  inter¬ 
val  of  the  sample  mean  is 

y  ±  1 .96  X  SE9. 


Nonsampling  Errors 

Nonsampling  error  refers  to  any  error  intro¬ 
duced  into  the  sample  estimate  that  is  not  attrib¬ 
utable  to  sampling  variation.  If  these  errors  are 
systematic,  then  they  introduce  bias  into  the 
sample  estimate.  In  probability  sampling  where 
the  selection  probabilities  are  known,  unbiased 
estimates  of  the  population  parameters  and  their 
standard  errors  can  be  calculated.  For  nonprob¬ 
ability  samples,  or  probability  samples  where  the 
selection  probabilities  are  not  known,  this  is  not 
always  the  case,  and  at  a  minimum,  a  model  of 
sample  inclusion  is  necessary.  For  both  random 
and  nonrandom  sampling  designs,  however,  non- 
sampling  errors  are  often  present. 

Nonsampling  errors  can  systematically  bias 
parameter  estimates  to  a  greater  extent  than  the 
deviation  caused  by  random  sampling  errors.  These 
errors  can  be  due  to  selection  bias,  where  some  set 
of  the  target  population  is  systematically  not 
included  in  the  sample.  For  instance,  coverage  error 
results  when  units  in  the  target  population  are  not 
included  in  the  sampling  frame  (undercoverage)  or 
when  some  units  in  the  sample  do  not  originate 
from  the  target  population  (overcoverage). 

Additionally,  errors  in  the  survey  instrument 
can  lead  to  bias  from  measurement  error.  Measure¬ 
ment  error  can  be  introduced  into  a  survey  instru¬ 
ment  in  several  ways.  Not  all  are  preventable,  but 
a  carefully  constructed  survey  design  can  minimize 
these  errors. 

•  Question  wording,  order,  and  context  can 
introduce  bias  into  the  survey  instrument.  The 
respondent  may  be  “primed”  to  provide  a 
particular  response,  or  a  question  may  not  be 
neutrally  worded. 

•  An  imprecise  measurement  device  may  prevent 
respondents  from  expressing  their  actual 
opinions. 

•  Respondents  may  provide  inaccurate  answers  to 
survey  questions  unintentionally,  because  of 
poor  questionnaire  design  or  simple  mistakes. 

•  With  sensitive  material  and  when  sampling 
human  subjects,  the  researcher  also  has  to 
consider  the  possibility  that  respondents  are 
deliberately  misrepresenting  information  in  the 
survey  responses.  For  example,  respondents  may 
lie  when  asked  about  their  yearly  income  or 
when  asked  whether  they  voted  in  the  previous 
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general  election.  Respondents  may  also  forget 
details  when  answering  questions  about  activities 
in  the  past  and  may  provide  guesses  in  response. 

•  Human  errors  can  also  be  introduced  into  the 
survey  when  coding  the  responses  for  analysis. 

Nonresponse  error  can  create  significant  bias  in 
samples.  Often,  the  characteristics  of  non-responders 
are  systematically  different  from  the  characteristis  of 
those  who  do  respond.  This  problem  is  especially 
significant  if  the  measurement  of  interest  is  affected 
by  nonresponse.  For  example,  in  a  study  of  political 
participation,  if  people  who  participate  are  more 
likely  to  respond  than  people  who  do  not,  then  esti¬ 
mates  of  participation  rates  will  almost  certainly  be 
affected.  In  cases  such  as  these,  nonresponse  is  non- 
ignorable.  There  are  two  basic  types  of  nonresponse. 

•  Unit  nonresponse  occurs  when  an  entire 
sampling  unit  is  missing  from  the  sample.  A 
respondent  who  refuses  to  answer  any  of  the 
questions  in  a  survey  is  an  example. 

•  Item  nonresponse  occurs  when  a  single  or 
multiple  survey  items  are  missing,  but  not  all  of 
them,  for  a  single  unit.  For  example,  a 
respondent  who  skips  a  question  about  income 
level  but  responds  to  the  rest  of  the  survey 
exhibits  item  nonresponse. 

In  cases  where  the  probability  of  response  is 
affected  only  by  observed  variables  rather  than 
survey  measurements,  the  data  are  considered 
missing  at  random,  and  the  effects  of  nonresponse 
can  be  considered  ignorable,  as  a  model  can  adjust 
for  any  effects  of  nonresponse.  Using  a  different 
participation  example,  suppose  women  were  more 
likely  to  respond  to  the  survey  than  men  were.  If 
the  participation  rate  among  the  women  surveyed 
is  the  same  as  it  is  for  the  population  of  women 
and,  similarly,  the  participation  rate  among  the 
men  surveyed  is  the  same  as  it  is  for  the  population 
of  men,  then  survey  weights  can  correct  for  the 
effects  of  unit  nonresponse. 
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Sanctions 


Originally  conceived  by  Woodrow  Wilson  as  a 
key  component  of  the  post-World  War  I  order 
within  the  League  of  Nations,  the  effective  appli¬ 
cation  of  economic  and  other  sanctions  did  not 
develop  unilaterally  until  after  World  War  II  and 
developed  multilaterally  only  after  the  post-Cold 
War  era.  Despite  some  examples  of  punitive  sanc¬ 
tions  taking  an  economic  toll  when  imposed  uni¬ 
laterally,  such  as  the  United  States  against  Cuba 
and  Iran  and  the  former  Soviet  Union  against 
Armenia,  multilateral  sanctions  have  become  the 
norm  as  they  have  both  the  greatest  legitimacy 
and  chance  of  success.  Thus,  they  constitute  the 
main  focus  of  this  entry. 

Since  1990,  the  United  Nations  Security  Council 
(UNSC)  has  imposed  sanctions  against  19  nations 
and  entities,  while  regional  organizations,  most 
notably  the  European  Union  and  the  British 
Commonwealth,  have  increasingly  resorted  to 
sanctions  as  well.  The  legal  authority  for  United 
Nations  (UN)  sanctions  appears  in  Chapter  VII, 
Article  41,  of  the  UN  Charter,  which  provides  that 
the  Security  Council  may  call  on  states  to  impose 
nonmilitary  measures,  such  as  interruptions  of  eco¬ 
nomic  and  diplomatic  relations,  to  protect  interna¬ 
tional  peace  and  security.  In  its  first  45  years 
of  operation,  the  Council  imposed  sanctions 
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measures  in  just  two  cases:  in  Southern  Rhodesia 
in  1966  (S/RES/232)  and  in  South  Africa  in  1977 
(S/RES/418).  The  end  of  the  Cold  War  removed 
the  political  roadblocks  preventing  collective 
action,  while  the  dynamics  of  economic  globaliza¬ 
tion  convinced  nations  that  they  derived  little  ben¬ 
efit  from  cheating  on  the  enforcement  of  sanctions. 
Instead,  nongovernmental  actors  and  criminal 
networks  have  tended  to  be  the  major  sanctions 
busters. 

UN  multilateral  economic  sanctions  have  been 
imposed  for  a  diverse  set  of  reasons:  to  promote 
democracy  and  human  rights,  to  enforce  interna¬ 
tional  law  and  resolutions  of  the  UNSC,  to  prevent 
military  aggression  and  armed  conflict,  to  encour¬ 
age  military  demobilization  and  postconflict  recon¬ 
struction,  to  counter  terrorism,  and  to  prevent  the 
proliferation  of  nuclear  weapons.  The  list  of  the 
sanctions  imposed  by  the  UNSC  from  1990  to 
2009  reveals  a  substantial  set  of  cases  where  sanc¬ 
tions  are  not  imposed  against  a  government  but 
rather  against  nonstate  actors  or  militia  forces 
operating  within  a  particular  country.  Examples  of 
this  trend  include  the  cases  of  UN  sanctions  against 
actors  in  Somalia,  Angola,  Rwanda,  Sierra  Leone, 
Democratic  Republic  of  Congo,  Sudan,  and  Cote 
d’Ivoire.  In  the  cases  of  Afghanistan  and  Iraq, 
sanctions  initially  were  imposed  against  govern¬ 
ments,  but  after  the  U.S.-led  overthrow  of  these 
governments,  the  Security  Council  measures  were 
redirected  against  individuals  and  entities  associ¬ 
ated  with  the  former  regime  and/or  insurgent/ 
terrorist  groups.  Sanctions  imposed  in  Iraq, 
Somalia,  Liberia,  and  Afghanistan  have  changed 
significantly  over  the  years,  in  line  with  changing 
political  conditions  within  the  targeted  regime. 

Of  all  UNSC  sanctions,  only  those  imposed 
against  Southern  Rhodesia  (1966),  South  Africa 
(1977),  Iraq  (1990),  Yugoslavia  (1991),  and  Haiti 
(1993)  included  comprehensive  trade  sanctions  as 
part  of  a  larger  set  of  coercive  techniques. 
Controversies  regarding  the  devastating  humani¬ 
tarian  impact  of  such  sanctions  in  Iraq  led  to  a 
reevaluation  of  general  trade  embargoes  and 
prompted  the  adoption  after  1994  of  more  selec¬ 
tive  and  targeted  measures — the  so-called  smart 
sanctions.  Smart  sanctions  are  aimed  exclusively 
against  wrongdoers  rather  than  entire  economies. 
These  include  individuals  holding  specific  govern¬ 
ment  positions,  economic  elites,  and  entities  who 


provide  significant  support  to — or  benefit  from — 
the  sanctioned  government  or  who  have  violated 
international  law  or  Security  Council  resolutions 
on  their  own  accord.  Smart  sanctions  seek  to  con¬ 
trol,  capture,  or  restrict  the  movement  and  use  of 
financial  assets,  specific  products — such  as  luxury 
goods,  arms,  or  particular  commodities — that 
often  are  critically  necessary  for  the  supply  and 
financing  of  armed  conflict  or  that  are  being 
moved  illegally  to  aid  an  illegal  action  or  actor. 
Smart  sanctions  include 

•  financial  sanctions,  which  freeze  the  assets 
(including  property)  of  and  block  financial 
transactions  with  designated  individuals  and 
entities,  including  restrictions  on  dealing  with 
specific  banks; 

•  arms  embargoes,  which  ban  the  supply  of 
weapons,  military-related  technology,  and  other 
forms  of  military  assistance; 

•  travel  sanctions,  which  deny  visas  and  ban  the 
travel  of  designated  individuals  or  prohibit  travel 
on  designated  airlines  or  to  the  airspace  of 
targeted  regimes; 

•  commodity  sanctions,  which  prohibit  imports  or 
exports  of  specific  materials  or  goods,  such  as 
diamonds,  oil,  timber,  and  selected  stones  and 
metals;  and 

•  diplomatic  sanctions,  which  deny  national 
participation  in  international  events  or 
organizations  or  withdraw  the  diplomatic 
privileges  of  designated  individuals  or  regimes. 

To  implement  these  focused,  targeted  financial 
measures,  the  UN  has  relied  on  state  implementa¬ 
tion  and  enforcement  through  banks  and  major 
financial  institutions.  Most  international  hard- 
currency  transactions  are  screened  through  name 
detection  software,  which  provides  an  effective 
means  of  interdicting  illicit  transfers.  Many  nations 
also  have  financial  intelligence  units  that  assist  law 
enforcement  officials  in  detecting  and  thwarting 
financial  crime  and  sanctions  violations. 

The  Security  Council  has  also  created  sanctions 
committees  and  authorized  them  to  establish  lists 
of  designated  individuals  and  entities  whose  assets 
are  to  be  frozen.  In  most  cases,  those  listed  are  also 
subject  to  travel  and  visa  restrictions  as  well.  The 
A1  Qaeda  and  Taliban  Sanctions  Committee 
(UNSCR  [UNSC  Resolution]  1267)  has  the  largest 
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and  most  contested  list,  with  approximately  500 
names.  Designation  lists  also  exist  for  the  sanc¬ 
tions  in  Liberia,  Cote  d’Ivoire,  Democratic  Republic 
of  Congo,  and  Sudan  and  for  the  sanctions  against 
insurgents  and  supporters  of  the  former  regime  in 
Iraq.  As  of  2009,  the  total  number  of  individuals 
and  entities  on  UN  sanctions  committees  lists  was 
more  than  900.  The  procedures  for  designating 
and  removing  names  from  these  lists  have  proven 
to  be  highly  controversial,  especially  for  the 
Security  Council’s  consolidated  list  of  alleged  sup¬ 
porters  of  A1  Qaeda  and  the  Taliban. 

Arms  embargoes  are  the  most  frequently 
employed  forms  of  sanctions,  but  in  many  ways, 
they  are  the  least  successful.  The  enforcement  of 
arms  embargoes  requires  a  high  degree  of  interna¬ 
tional  cooperation.  Arms  smuggling  networks  are 
highly  sophisticated  and  pervasive  and  are  often 
able  to  circumvent  interdiction  that  is  poorly 
enforced  in  some  borders.  Only  in  the  sanctions 
against  Iraq  has  international  cooperation  been 
sufficiently  strong  to  make  Security  Council  restric¬ 
tions  on  the  supply  of  arms  and  the  advanced 
weapons  technologies  relatively  effective. 

Commodity  sanctions  are  a  means  of  prevent¬ 
ing  militia  and  criminal  actors  from  exploiting 
natural  resources  to  finance  and  sustain  armed 
conflict.  Commodity  sanctions  have  also  helped 
facilitate  more  effective  governance  by  newly 
emerging  postconflict  regimes  in  the  wake  of  the 
UN-approved  peace  settlements.  The  Security 
Council  imposed  oil  embargoes  as  part  of  the  sanc¬ 
tions  against  Iraq,  former  Yugoslavia,  and  the 
other  countries  in  the  1990s.  An  embargo  on 
the  export  of  logs  was  imposed  in  Liberia  in  2003. 
The  most  significant  and  successful  commodity 
sanctions  came  against  the  so-called  blood  dia¬ 
monds.  Beginning  in  1998  with  the  case  of  Angola, 
and  continuing  with  the  sanctions  in  Sierra  Leone 
and  Liberia,  the  Security  Council  prohibited  the 
import  of  diamonds  from  territory  controlled  by 
rebel  groups.  This  effort  spurred  the  creation  of 
the  Kimberley  certification  system  that,  in  turn, 
has  helped  shrink  the  financial  base  used  by  crimi¬ 
nal  groups  and  militias  in  armed  conflict. 

UN  smart  sanctions  have  also  been  used  as  a 
means  of  controlling  nuclear  proliferation.  The  end 
of  these  efforts  will  depend  on  the  intelligent  inte¬ 
gration  of  incentives,  sanctions,  and  security  assur¬ 
ances  as  part  of  a  diplomatic  bargaining  process. 


Sanctions  can  be  imposed  and  implemented  by  the 
UNSC,  but  security  assurances  and  inducements  are 
the  purview  of  individual  nations,  particularly  the 
United  States  and  other  major  powers,  acting  on 
their  own  or  in  concert  beyond  the  UN  framework. 

Scholars  have  engaged  in  contentious  debate 
regarding  how  to  evaluate  the  effectiveness  of 
sanctions  generally  and  UN  sanctions  in  particu¬ 
lar.  The  most  authoritative  empirical  study  is  that 
by  Gary  Hufbauer  and  colleagues,  which  examines 
204  cases  from  1914  to  2006.  Their  data  set 
includes  a  large  number  of  cases  of  unilateral  sanc¬ 
tions,  mostly  imposed  by  the  United  States.  Their 
conclusion  is  that  sanctions  contributed  to  policy 
change  in  70  of  the  cases  studied,  for  an  overall 
success  rate  of  34%.  Other  scholars  have  found 
that  sanctions  are  most  effective  when  economic 
costs  are  high  for  the  target  but  low  for  the  sender, 
when  the  gross  domestic  product  of  the  sender  is 
much  larger  than  that  of  the  target,  and  when  the 
target  and  the  sender  have  extensive  trade  rela¬ 
tions.  To  these  results,  Daniel  Drezner  has  added 
the  “sanctions  paradox,”  noting  that  sanctions  are 
most  likely  to  be  effective  when  the  sender  and  the 
target  are  interdependent  economically  and  have 
cooperative  political  relations,  yet  coercive  mea¬ 
sures  are  rarely  necessary  under  such  conditions 
and  in  fact  are  often  imposed  when  political  rela¬ 
tions  and  interdependence  are  low.  David  Cortright 
and  George  Lopez  found  the  1  in  3  success  rate 
noted  by  the  Hufbauer  study  replicated  for  the  UN 
sanctions  as  well,  especially  when  the  Council 
maximized  the  use  of  expert  panels,  monitoring 
groups,  and  aggressive  sanctions  committees. 

Many  analysts  would  consider  the  current  state 
of  sanctions  a  mixed  success.  On  the  one  hand,  the 
diversity  of  global  economic  relations  has  made  it 
possible  for  the  UNSC  to  institute  many  refine¬ 
ments  in  the  design  and  implementation  of  tar¬ 
geted  economic  sanctions  that  have  some  success 
in  curtailing  violent  conflict  and  enforcing  inter¬ 
national  norms  of  counterterrorism  and  counterp¬ 
roliferation.  The  record  of  success  for  multilateral 
sanctions  is  mixed  but  no  less  so  than  for  the  use 
of  military  force. 

On  the  other  hand,  concerns  about  the  lack  of 
due  process  rights  in  the  listing  and  delisting  pro¬ 
cedures  for  smart,  targeted  sanctions  have  created 
a  political  backlash  against  UN  sanctions.  A  num¬ 
ber  of  European  states,  longtime  supporters  of 
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sanctions,  have  both  criticized  the  Security  Council 
procedures  and  redrawn  some  of  the  boundaries  of 
sanctions  imposition  as  a  result  of  court  rulings 
and  administrative  decisions. 

George  A.  Lopez 
University  of  Notre  Dame 
Notre  Dame,  Indiana,  United  States 
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Scaling 


Scaling  methods  used  in  the  social  sciences  typi¬ 
cally  are  statistical  procedures  that  take  observed 
data  and  extract  latent  (i.e.,  abstract)  dimensions 


on  which  the  objects  or  subjects  are  ordered.  In 
political  science,  the  primary  focus  has  been  on 
issue/public  policy  and/or  ideological  scales  using 
individuals’  judgments  or  observed  voting  behav¬ 
ior.  One  important  example  is  the  estimation  of 
ideological  dimensions  for  all  of  congressional 
history  by  Keith  Poole  and  Howard  Rosenthal. 
Below,  the  development  of  these  techniques  and 
their  applications  in  political  science  are  discussed. 

Scaling  methods  used  in  political  science  for  the 
most  part  have  their  origins  in  psychology.  The 
original  work  on  scaling  in  psychology  done  at  the 
turn  of  the  20th  century  was  aimed  at  measuring 
general  ability  or  intelligence.  The  pioneers  were 
Karl  Pearson  and  Charles  Spearman.  Spearman  in 
1904  used  factor  analysis  to  analyze  a  correlation 
matrix  between  test  scores  of  22  English  high  school 
boys  for  Classics,  French,  English,  Math,  Pitch,  and 
Music.  Spearman  computed  a  form  of  rank-order 
correlation  between  each  pair  of  skills  across  the  22 
school  boys  and  then  extracted  a  common  or  gen¬ 
eral  factor  (g  factor)  from  the  matrix.  His  method 
of  computing  pairwise  correlations  and  calculating 
the  g  factor  was  quickly  supplanted  by  later  work. 

In  particular,  Pearson  invented  the  product- 
moment  correlation  coefficient  that  is  universally 
denoted  as  r,  and  he  should  also  be  credited  with 
the  invention  of  principal  components  analysis 
(what  we  now  would  think  of  as  straightforward 
eigenvalue/eigenvector  decomposition).  Pearson  in 
1901  called  it  “the  method  of  principal  axes”  and 
stated  the  problem  quite  succinctly:  “In  many 
physical,  statistical,  and  biological  investigations  it 
is  desirable  to  represent  a  system  of  points  in 
plane,  three,  or  higher  dimensioned  space  by  the 
‘best-fitting’  straight  line  or  plane”  (p.  559). 
Remarkably,  this  also  describes  the  essence  of  the 
famous  Eckart-Young  theorem  (1936),  which  is 
the  foundation  of  general  least  squares  problems. 

Lewis  Leon  Thurstone  thought  that  Spearman’s 
one-factor  theory  of  intelligence  was  wrong,  and  he 
succeeded  in  developing  a  method  for  extracting 
multiple  factors  from  a  correlation  matrix. 
Thurstone’s  theory  of  intelligence  postulated  seven 
rather  than  one  primary  mental  ability,  and  he  con¬ 
structed  tests  specific  to  the  seven  abilities:  verbal 
comprehension,  word  fluency,  number  facility,  spa¬ 
tial  visualization,  associative  memory,  perceptual 
speed,  and  reasoning.  Thurstone  also  developed  the 
law  of  comparative  judgment.  Thurstone’s  law  is 
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more  accurately  described  as  a  measurement  model 
for  a  unidimensional  subjective  continuum. 
Subjects  are  asked  to  make  a  series  of  n(n  —  1)12 
pairwise  comparisons  of  n  stimuli.  It  is  assumed 
that  a  subject’s  response  reflects  the  momentary 
subjective  value  associated  with  the  stimulus  and 
that  the  probability  distribution  of  these  momen¬ 
tary  values  is  normally  distributed.  It  is  then  pos¬ 
sible  to  recover  the  underlying  continuum  or  scale 
by  essentially  averaging  across  a  group  of  subjects. 
If  the  variances  of  the  stimuli  (the  discriminal  dis¬ 
persions)  on  the  underlying  scale  are  the  same 
(Case  5  of  the  model),  the  requirement  of  the  par¬ 
allel-item  characteristic  curves  in  the  Rasch  model 
is  satisfied.  Case  5  of  Thurstone’s  method  should 
yield  essentially  the  same  results  as  the  Rasch 
model  for  dichotomous  data. 

In  the  decade  after  World  War  II,  Thurstone’s 
scaling  work  inspired  the  development  of  what 
became  known  as  Guttman  scaling  and  multidi¬ 
mensional  scaling  (MDS).  These  methods  represent 
measurements  of  similarity  between  pairs  of  stimuli 
as  distances  between  points  in  a  low-dimensional 
(usually  Euclidean)  space.  The  methods  locate  the 
points  in  such  a  way  that  points  corresponding  to 
very  similar  stimuli  are  located  close  together,  while 
those  corresponding  to  very  dissimilar  stimuli  are 
located  farther  apart.  Warren  Torgerson  showed  a 
simple  method  of  MDS  based  on  the  Eckart-Young 
theorem.  However,  Torgerson’s  method  assumed 
that  the  similarities  were  measured  on  a  ratio  scale, 
which  is  very  unlikely  in  most  cases. 

Roger  Shepard  argued  that  the  relationship 
between  the  true  distance  between  a  pair  of  stimuli 
and  the  observed  distance  was  exponential.  This 
led  him  to  develop  nonmetric  multidimensional 
scaling  in  which  distances  are  estimated  that  repro¬ 
duce  a  weak  monotone  transformation  (or  rank 
ordering)  of  the  observed  dissimilarities.  Shepard’s 
program  worked,  but  the  key  breakthrough  was 
Joseph  Kruskal’s  idea  of  monotone  regression  that 
led  to  the  development  of  a  powerful  and  practical 
nonmetric  MDS  program.  By  the  early  1970s,  this 
was  known  by  the  acronym  KYST  (Kruskal, 
Young,  Shepard,  and  Torgeson)  and  is  still  in  use 
today. 

MDS  methods  can  be  seen  as  evolving  from  fac¬ 
tor  analysis  and  Thurstone’s  unidimensional  scal¬ 
ing  method  with  the  key  difference  being  that 
MDS  methods  are  applied  to  relational  data — that 


is,  data  such  as  similarities  and  preferential  choice 
data  that  can  be  regarded  as  distances.  At  the  same 
time  that  MDS  methods  were  evolving,  Louis 
Guttman  developed  scalogram  analysis  or  what  is 
more  commonly  known  as  Guttman  scaling.  A 
Guttman  scale  is  the  basis  of  all  modern  skills- 
based  tests.  It  is  a  set  of  items  (questions,  prob¬ 
lems,  etc.)  that  are  ranked  in  order  of  difficulty  so 
that  those  who  answer  correctly  (agree)  on  a  more 
difficult  (or  extreme)  item  will  also  answer  cor¬ 
rectly  (agree)  all  the  less  difficult  (extreme)  items 
that  preceded  it.  Rasch  analysis  (more  broadly, 
item  response  theory  [IRTJ )  is  essentially  a  sophis¬ 
ticated  form  of  Guttman  scalogram  analysis.  These 
are  techniques  for  examining  whether  a  set  of 
items  is  consistent  in  the  sense  that  they  all  mea¬ 
sure  increasing/decreasing  levels  of  some  unidi¬ 
mensional  attribute  (e.g.,  mathematical  ability, 
racial  prejudice). 

At  the  same  time  when  Torgerson  was  develop¬ 
ing  classical  scaling  and  Guttman  was  developing 
scalogram  analysis,  Clyde  Coombs  developed 
unfolding  analysis.  Coombs  came  up  with  the  idea 
of  an  ideal  point  and  a  single-peaked  preference 
function  to  account  for  the  observed  rank  order¬ 
ings.  The  idea  was  to  arrange  the  individuals’  ideal 
points  and  points  representing  the  stimuli  along  a 
scale  so  that  the  distances  between  the  ideal  points 
and  the  stimuli  points  reproduced  the  observed 
rank  orderings.  Coombs  called  this  an  unfolding 
analysis  because  the  researcher  must  take  the  indi¬ 
viduals’  rank  orderings  and  “unfold”  them. 

Unfolding  analysis  deals  with  relational  data  and 
is  therefore  an  MDS  method.  Both  unfolding  analy¬ 
sis  and  scalogram  analysis  deal  with  individuals’ 
responses  to  a  set  of  stimuli.  But  Guttman’s  model 
is  very  different  from  the  unfolding  model.  In  terms 
of  utility  theory,  unfolding  analysis  assumes  a  sin¬ 
gle-peaked  (usually  symmetric)  utility  function. 
That  is,  utility  (the  degree  of  preference)  declines 
with  distance  from  the  individual’s  ideal  point.  In 
contrast,  Guttman  scaling  is  based  on  a  utility  func¬ 
tion  that  is  always  monotonically  increasing  or 
decreasing  over  the  relevant  dimension  or  space. 
Above  some  threshold,  the  individual  always 
responds  with  a  Yes/Correct,  and  below  the  thresh¬ 
old,  the  individual  always  responds  with  a  No/ 
Incorrect.  The  counterpart  to  an  ideal  point  is  the 
position  on  the  scale  where  the  individual’s 
responses  switch  from  Yes/Correct  to  No/Incorrect. 
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Interestingly,  these  two  very  different  models 
are  observationally  equivalent  in  the  context  of 
parliamentary  voting.  In  the  unfolding  model, 
there  are  two  outcomes  for  every  parliamentary 
motion — one  corresponding  to  “Yea”  and  one 
corresponding  to  “Nay.”  Legislators  vote  for  the 
option  closest  to  their  ideal  points.  In  one  dimen¬ 
sion,  this  forms  a  perfect  scalogram.  Hence, 
Guttman  scaling  methods  and  their  IRT  paramet¬ 
ric  descendants  can  be  used  to  analyze  parliamen¬ 
tary  (binary  choice)  data. 

In  the  1970s  and  1980s,  political  scientists 
began  to  combine  techniques  from  econometrics 
and  statistics  with  approaches  developed  by  psy¬ 
chometricians.  John  Aldrich  and  Richard  McKelvey 
in  1977  developed  a  unidimensional  scaling 
method  for  7-point  scales.  This  method  uses  the 
respondents’  perceived  locations  of  the  political 
stimuli  to  recover  an  underlying  latent  configura¬ 
tion.  It  also  will  recover  the  respondents’  self¬ 
placements  on  the  same  latent  scale.  Henry  Brady 
made  contributions  to  the  statistical  foundations 
of  nonmetric  MDS  as  well  as  methods  for  and 
problems  with  the  analysis  of  preferences.  Poole 
and  Rosenthal  combined  the  random  utility  model 
developed  by  economists,  the  spatial  model  of  vot¬ 
ing,  and  alternating  estimation  methods  developed 
in  psychometrics  to  develop  NOMINATE,  an 
unfolding  method  for  parliamentary  roll  call  data. 

In  the  1990s  and  the  early  2000s,  the  availability 
of  inexpensive  and  fast  computers  made  simulation 
methods  for  the  estimation  of  complex  multivariate 
models  practical  for  the  first  time,  and  these  meth¬ 
ods  were  fused  with  long-standing  psychometric 
methods  by  political  scientists.  Specifically,  Markov 
chain  Monte  Carlo  (MCMC)  simulation  within  a 
Bayesian  framework  coupled  with  the  IRT  model 
can  be  used  to  perform  an  unfolding  analysis  of 
parliamentary  roll  call  data.  Political  scientists  have 
successfully  blended  methods  from  statistics  and 
econometrics  with  psychometrics  to  produce 
unique  scaling  applications.  The  coming  decades 
should  see  even  greater  advances  due  to  the  avail¬ 
ability  of  ever-faster  parallel  computing  methods 
and  the  increasing  sophistication  of  political  sci¬ 
ence  methodological  training. 

Keith  T.  Poole 
University  of  California,  San  Diego 
La  Jolla,  California,  United  States 
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Scaling  methods  germinated  in  the  pioneering 
work  of  Karl  Pearson  and  Charles  Spearman, 
who  in  the  first  decade  of  the  20th  century  devel¬ 
oped  methods  to  describe  relationships  between 
variables.  Pearson  created  the  correlation  coeffi¬ 
cient  for  metric  data,  while  Spearman  introduced 
the  rank  order  correlation  and  an  algorithm 
designed  to  document  the  existence  of  a  general 
factor  in  intelligence  research.  Since  then,  these 
ideas  have  been  extended  to  permit  an  (in  theory) 
unlimited  number  of  factors  obtained  from  data 
ranging  from  crude  categorical  to  refined  metric 
and  permitting  the  imposition  of  a  variety  of  con¬ 
straints  on  the  variables.  What  all  scaling  meth¬ 
ods  have  in  common,  however,  is  that  they  can  be 
used  to  transform  a  set  of  observed  variables  to  a 
smaller  number  of  latent  variables  or  factors 
(dimensions). 
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Scaling  methods  differ  with  respect  to  the  type 
of  input  data  for  which  they  are  most  appropriate. 
Principal  component  analysis  (PCA)  and  factor 
analysis  (FA)  make  the  assumption  that  the  data 
are  metrically  scaled.  If  the  data  are  ordered  cate¬ 
gories,  such  as  Likert-type  responses,  categorical 
(or  nonlinear)  principal  component  analysis 
(CATPCA)  is  probably  the  best  choice.  Multiple 
correspondence  analysis  (MCA)  is  particularly 
suited  for  the  analysis  of  unordered  categorical  (or 
nominal)  data.  For  paired-comparison  data,  the 
proper  choice  would  be  multidimensional  scaling. 
Rasch  scaling  or  the  unfolding  model  are  proper 
choices  when  the  response  options  are  dichoto¬ 
mous  (Yes/No)  with  an  implicit  item  difficulty 
order,  which  usually  means  that  they  have  a  one¬ 
dimensional  solution.  The  following  documents 
the  similarities  and  differences  of  the  most  popular 
scaling  methods  in  the  political  and  social  sciences, 
namely,  PCA,  CATPCA,  and  MCA. 

To  demonstrate  these  methods,  we  chose  seven 
variables  on  national  pride  from  the  U.S.  portion 
of  the  International  Social  Survey  Program  (ISSP) 
2003.  To  avoid  a  lengthy  discussion  on  missing 
data,  possible  imputation  techniques,  and  their 
impact  on  the  solution,  we  use  listwise  deletion, 
which  reduces  the  sample  from  1,216  cases  to 
1,148  cases.  The  seven  variables  (identified  with 
the  abbreviations  that  will  be  used  in  subsequent 
figures)  are  as  follows: 

A.  I  would  rather  be  a  citizen  of  [Country]  than  of 
any  other  country  in  the  world. 

B.  There  are  some  things  about  [Country]  today 
that  make  me  feel  ashamed  of  [Country], 

C.  The  world  would  be  a  better  place  if  people 
from  other  countries  were  more  like  the 
[Country  Nationality]. 

D.  Generally  speaking,  [Country]  is  a  better 
country  than  most  other  countries. 

E.  People  should  support  their  country  even  if  the 
country  is  wrong. 

F.  When  my  country  does  well  in  international 
sports,  it  makes  me  proud  to  be  [Country 
Nationality]. 

G.  I  am  often  less  proud  of  [Country]  than  I  would 
like  to  be. 


Five  response  options  ranging  from  agree 
strongly  (1),  to  neither  agree  nor  disagree  (3),  to 
disagree  strongly  (5)  were  made  available  for  all 
items.  Five  of  the  seven  variables  have  a  positive 
formulation,  such  that  agree  strongly  implies  a 
favorable  attitude  toward  the  country.  The  remain¬ 
ing  two  items  (2  and  7)  have  a  negative  formula¬ 
tion,  requiring  respondents  to  (strongly)  disagree 
to  express  a  favorable  attitude. 

Principal  Component  Analysis 

Assume  that  there  are  m  observed  variables  with  n 
respondents  each.  In  PCA,  these  m  observed  vari¬ 
ables  are  reexpressed  without  any  loss  of  informa¬ 
tion  by  m  latent  variables;  that  is,  when  the  number 
of  factors  equals  the  number  of  variables,  the 
original  observed  scores  can  be  recaptured.  How¬ 
ever,  the  usual  aim  is  to  reduce  the  dimensionality 
of  the  data  by  using  a  substantially  smaller  number 
of  m  *  factors.  Specifically,  each  observed  variable 
(z ;.)  can  be  expressed  through  its  association  (ajk) 
with  each  latent  factor  (ffe),  resulting  in 

Zj  =  Cay!  +  f2a,2  +  •  •  •  +  fm*a;m*  +  •  •  •  + 

Here,  z.  is  a  vector  that  contains  the  ^-transformed 
categories  of  the  jth  variable  for  all  n  respondents, 
ik  contains  the  categories  from  the  &th  latent  vari¬ 
able  of  all  n  respondents,  and  ajk  is  the  correlation 
between  the  ;th  observed  and  the  kth  latent 
variable. 

In  matrix  notation  and  for  the  full  solution  (i.e., 
where  the  number  of  observed  variables  is  equal  to 
the  number  of  factors),  Z  =  FAT,  with  I  /VzF1  F  =  I . 
I  is  an  identity  matrix,  which  by  definition  has  ones 
in  the  main  diagonal  and  zeros  in  the  off-diagonal 
elements;  in  other  words,  the  factors  are  orthogo¬ 
nal  (uncorrelated)  with  each  other.  Because 
Zu  —  (X{  ~  oc)/sx  and  z2;  =  (y,  ~  y)/sy ,  it  follows 
that  ri2  =  (l/«)z]z2,  and  in  matrix  notation, 
R  =  {l/n)ZrZ  =  (1/«)AFtFAt  -  AAT,  which  is 
known  as  the  fundamental  theorem  in  FA.  Since 
the  idea  is  to  reduce  the  set  of  m  observed  variables 
to  m*  factors,  with  ra*  <  m,  it  follows  that 
Z  «  F,„*At  and  R  «  Am*A\. 

Applying  canonical  decomposition  produces 
R  =  BABT,  where  B  is  the  matrix  of  the  eigenvec¬ 
tors  and  A  is  the  diagonal  matrix  of  the  eigenval¬ 
ues,  with  >  k2  >  •  •  •  >  Xm*  >  •  •  •  >  \m.  From 
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R  =  AAT,  it  follows  that  the  correlations  between 
any  two  variables  (/,  /)  can  be  recomputed  from 
the  corresponding  factor  loadings:  Y!k=iaikCLkl  =  ri‘- 
The  factor  loadings  (a^),  ranging  between  —1  and 
+  1,  can  be  interpreted  as  correlations  between  the 
latent  and  observed  variables.  The  communality 
j  describes  the  proportion  of  the  variance  in 
variable  j  explained  by  those  m *  factors  that  were 
used  to  construct  the  reduced  space. 

Several  possibilities  exist  for  deciding  on  the 
number  of  dimensions  to  be  included  in  an  inter¬ 
pretation.  The  eigenvalue  (or  Kaiser)  criterion, 
which  is  the  default  in  many  statistical  packages,  is 
most  often  applied  in  the  social  and  political  sci¬ 
ences.  The  rationale  for  this  criterion  is  that  a 
latent  variable  should  explain  more  of  the  varia¬ 
tion  in  the  data  than  an  observed  variable.  Since 
each  observed  variable  has  an  explained  variance 
of  1.0  (the  correlation  of  a  variable  with  itself),  the 
eigenvalues  of  the  factors  to  be  considered  in  the 
solution  have  to  be  greater  than  one — that  is, 
\m,  >  1.  This  is  an  arbitrary  criterion  that  never¬ 
theless  is  sometimes  useful.  An  alternative  criterion 
is  the  so-called  scree  test:  The  eigenvalues  are  plot¬ 
ted  in  successive  order;  the  factor  that  forms  an 
elbow  (i.e.,  a  substantial  change  in  magnitude  in 
the  increasing/decreasing  list  of  eigenvalues)  will 
be  the  first  one  that  is  not  considered  in  the  solu¬ 
tion.  Finally,  it  is  sometimes  preferable  to  consider 
just  those  factors  for  which  one  can  provide  a 
substantive  interpretation. 

After  choosing  the  number  of  factors  to  be 
retained  in  a  solution,  scholars  often  rotate  the 
solution  to  obtain  a  better  interpretation;  varimax 
rotation  is  most  commonly  applied.  In  an  iterative 
procedure,  the  coordinates  of  variables  and  cases 
in  the  m  "  -dimensional  latent  space  are  changed  in 
a  way  that  maximizes  the  sum  of  the  variances  of 
the  squared  loadings  on  each  factor.  Using  this 
technique,  median  loadings  become  either  smaller 
or  larger  and  are  therefore  more  clearly  associated 
with  a  single  factor.  While  varimax  rotation  keeps 
the  orthogonality  of  the  factors  (i.e.,  the  factors 
remain  uncorrelated),  other  rotation  methods  such 
as  oblimin  allow  the  factors  to  be  correlated  after 
rotation. 

The  literature  sometimes  confuses  PCA  and  FA. 
The  difference  between  these  methods  is  that  PCA 
does  not  incorporate  any  error  term,  since  Z  =  FAT 
and  R  =  AAT.  In  contrast,  in  FA,  it  is  assumed  that 


each  variable  has  an  item-specific  error  term  (v-); 
from  this,  it  follows  that  Z  =  FAT  +  V  and  R  = 
AAT  —  VTV.  If  the  errors  are  uncorrelated,  only  the 
main  diagonal  elements  of  the  correlation  matrix  R 
(which  are  by  definition  one)  have  to  be  reduced 
iteratively  to  correct  for  the  item-specific  error. 
The  manner  of  estimating  these  error  terms  differ¬ 
entiates  the  various  methods  of  FA.  The  aim  of 
structural  equation  modeling  (SEM)  is  to  allow  for 
correlations  between  the  error  terms  and  for  cor¬ 
relations  between  the  latent  factors  themselves  and 
to  classify  observed  and  latent  variables  as  depen¬ 
dent  and  independent  ones.  While  PCA  is  used  to 
explore  the  structure  of  the  data,  SEM  is  used  to 
test  whether  the  data  are  consistent  with  the  mod¬ 
els  developed  on  the  basis  of  the  literature. 

Input  for  PCA  and  FA  is  usually  the  matrix  of 
product-moment  correlation  coefficients,  which 
assume  metric  data.  Hence,  the  PCA  of  categorical 
variables  such  as  the  5 -point  Likert  items  in  our 
example  is  problematic.  Nevertheless,  such  vari¬ 
ables  have  often  been  treated  as  if  they  were  met¬ 
ric.  In  the  following,  we  apply  PCA  to  the  ISSP 
data  described  above  to  show  the  solution  that  one 
would  typically  find  in  the  literature.  Using  the 
seven  variables  on  national  pride  as  input  data, 
Table  1  shows  that  the  first  and  second  eigenvalues 
are  2.631  and  1.086,  respectively;  together,  they 
explain  53.1%  of  the  variation.  After  varimax 
rotation,  the  two  eigenvalues  are  2.227  and  1.490 
(Table  1). 

According  to  the  eigenvalue  criterion,  the 
solution  is  two-dimensional  since  only  the  first 
two  eigenvalues  are  greater  than  one.  However, 
the  difference  between  the  first  and  the  second 
eigenvalue  is  large,  while  there  is  only  a  small 
difference  between  the  second  and  the  third 
eigenvalues.  By  the  scree  test  criterion,  the  solu¬ 
tion  would  be  one-dimensional.  With  respect  to 
the  factor  loadings  (Table  2),  all  variables  are 
associated  with  Dimension  1:  Positive  loadings 
are  found  for  the  positively  formulated  items  and 
negative  loadings  for  those  of  reversed  polarity. 
Substantively,  this  factor  appears  to  reflect  the 
extent  to  which  respondents  report  feeling 
national  pride.  On  the  second  dimension,  all 
variables  have  positive  loadings,  regardless  of  the 
content  or  polarity  of  the  item.  This  latent  vari¬ 
able  might  be  interpretable  as  a  methodological 
artifact  known  as  acquiescence  (the  tendency  to 
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Table  I  Principal  Component  Analysis,  Eigenvalues,  and  Explained  Variances 


Initial  Eigenvalues 

Extraction  Sums  o 
Loadings 

f  Squared 

Rotation  Sums  of  Squared 
Loadings 

Dimension 

Total 

%  Variance 

Cumulative 

% 

Total 

%  Variance 

Cumulative 

% 

Total 

%  Variance 

Cumulative 

% 

1 

2.631 

37.586 

37.586 

2.631 

37.586 

37.586 

2.227 

31.818 

31.818 

2 

1.086 

15.519 

53.105 

1.086 

15.519 

53.105 

1.490 

21.288 

53.105 

3 

0.805 

11.505 

64.610 

4 

0.771 

11.018 

75.628 

5 

0.666 

9.511 

85.139 

6 

0.543 

7.760 

92.900 

7 

0.497 

7.100 

100.000 

Source:  Data  from  International  Social  Survey  Programme  (ISSP),  National  Identity  II  (2003).  Distributor:  GESIS  Cologne 
Germany,  ZA  No.  3910. 


agree  with  statements  regardless  of  their  content). 
By  the  scree  test  criterion,  as  well  as  that  of  sub¬ 
stantive  meaning,  the  solution  is  one-dimensional. 
However,  since  most  scholars  use  the  eigenvalue 
criterion,  we  tentatively  follow  this  procedure, 
keeping  the  two-dimensional  solution  and  per¬ 
forming  a  varimax  rotation. 

After  varimax  rotation,  the  loadings  of  the 
positively  formulated  items  remain  almost 
unchanged  on  the  first  dimension,  while  the  load¬ 
ings  of  the  negatively  formulated  items  move  to  the 
second  dimension  to  define  their  own  factor. 
Therefore,  the  rotated  solution  suggests  a  two- 
dimensional  structure  mirroring  just  the  polarity 
of  the  items:  Positive-polarity  items  form  Dimen¬ 
sion  1,  and  negative-polarity  items  define  Dimen¬ 
sion  2.  Findings  such  as  these  have  sparked  a  lively 
debate  about  whether  the  underlying  structure 
consists  of  two  unipolar  concepts  or  one  bipolar 
concept.  Regardless  of  which  position  is  taken,  it  is 
insufficient  to  base  it  just  on  the  rotated  solution 
and  the  eigenvalue  criterion. 

A  more  appropriate  technique  for  analyzing 
ordered  categorical  variables  is  CATPCA,  in 
which  the  categories  are  replaced  by  optimal 
scores.  The  optimal  scoring  process  allows 
order  constraints  to  be  imposed  so  that  ordered 
categorical  variables  get  increasing,  or  at  least 
nondecreasing,  scores  as  the  category  levels 
become  increasingly  severe.  Responses  inconsis¬ 
tent  with  the  implied  ordering  manifest  them¬ 
selves  in  tied  optimal  scores  for  two  or  more 


successive  categories.  In  contrast  to  PCA,  the 
number  of  dimensions  m  *  to  be  used  for  the  fit 
must  be  specified  in  advance,  and  the  solutions 
for  m  *  and  m  *  +  1  dimensions  are  not  nested. 
Once  the  optimal  scores  have  been  calculated, 
they  replace  the  category  codes,  and  the  remain¬ 
der  of  the  analysis  can  be  regarded  as  (classical) 
PCA.  In  short,  CATPCA  is  an  appropriate  tech¬ 
nique  to  display  relationships  between  cases 
associated  with  a  set  of  ordered  categorical 
variables.  Like  PCA,  the  method  produces  eigen¬ 
values  and  explained  variances,  factor  loadings, 
and  factor  scores  with  mean  zero  and  unit  stan¬ 
dard  deviation.  Essentially,  CATPCA  can  be 
regarded  as  PCA  applied  to  ordered  categorical 
data.  An  additional  advantage  of  CATPCA  is 
that  the  solutions  can  be  visualized  with  the 
help  of  biplots.  It  is  possible  to  impose  linearity 
on  the  optimal  scores,  in  which  case  the  biplots 
can  be  expressed  as  vectors.  John  Gower  and 
David  Hand  (1996)  simplified  this  approach  by 
showing  that  biplot  axes  can  be  interpreted  just 
like  other  coordinate  axes — that  is,  by  projec¬ 
tion  and  by  reading  a  value  off  a  scale.  The 
closer  two  biplot  axes  are  to  each  other,  the 
stronger  the  association  of  the  corresponding 
variables  in  the  m  "'-dimensional  space. 

Running  CATPCA  on  the  ISSP  data  described 
above  and  restricting  the  solution  to  two  dimen¬ 
sions,  the  first  factor  explains  38.2%  of  the  varia¬ 
tion  and  the  second  an  additional  15.6%. 
Comparing  the  CATPCA  with  the  PCA  solution 
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Table  2  PCA  and  CATPCA  Factor  Loadings,  Before  and  After  Varimax  Rotation 


PCA,  Unrotated  Solution  PCA,  Rotated  Solution  CATPCA 


Dimension  1 

Dimension  2 

Dimension  1 

Dimension  2 

Dimension  1 

Dimension  2 

Item  A 

.647 

.090 

.602 

-.253 

.651 

.068 

Item  B 

-.556 

.612 

-.165 

.810 

-.558 

.605 

Item  C 

.646 

.346 

.732 

-.033 

.643 

.363 

Item  D 

.719 

.287 

.764 

-.121 

.724 

.289 

Item  E 

.554 

.222 

.590 

-.093 

.563 

.236 

Item  F 

.601 

.148 

.592 

-.180 

.614 

.132 

Item  G 

-.549 

.656 

-.136 

.845 

-.551 

.659 

Source:  Data  from  International  Social  Survey  Programme  (ISSP),  National  Identity  II  (2003).  Distributor:  GESIS  Cologne 
Germany,  ZA  No.  3910. 

Note:  CATPCA  =  categorical  principal  component  analysis;  PCA  =  principal  component  analysis. 


shows  a  small  increase  in  the  explained  variation. 
An  increase  in  explained  variation  is  always  the 
case  since  CATPCA  does  not  restrict  the  items  to 
have  constant  distances  between  successive  catego¬ 
ries  as  does  PCA.  The  differences  in  the  explained 
variances  can  be  interpreted  as  one  indication  of 
the  quality  of  the  data:  The  smaller  the  difference 
between  the  solutions,  the  better  the  data. 
Comparing  the  factor  loadings  of  the  unrotated 
PCA  with  the  CATPCA  solution  (Table  2)  reveals 
only  trivial  differences.  Dimension  1  reflects  the 
difference  between  positive  and  negative  formula¬ 
tions,  while  all  items  have  positive  loadings  on 
Dimension  2. 

Figure  1  shows  the  biplot  axes  of  the  seven  vari¬ 
ables,  with  tick  marks  to  symbolize  the  category- 
level  points  and  with  labels  affixed  to  the  agree 
strongly  end  point.  The  factorial  axes  are  excluded 
since  they  are  not  useful  in  the  biplot  presentation. 
Two  sets  of  items  are  distinguished,  corresponding 
to  the  positive  and  negative  formulations.  As 
numerically  shown  for  PCA  via  varimax  rotation, 
the  two  sets  of  items  could  be  described  as  separate 
factors.  Inspecting  the  distances  between  all  suc¬ 
cessive  categories  on  each  item  shows  that  there 
are  no  ties — that  is,  there  are  no  two  single  catego¬ 
ries  measuring  the  same  level  of  agreement/ 
disagreement  to  a  certain  item.  However,  the  dis¬ 
tances  between  the  categories  are  quite  unequal. 
Take  Item  A  as  an  example:  The  first  category 
[agree  strongly )  is  relatively  far  from  the  second 


category  [agree)-,  in  fact,  the  first  category  is 
located  in  the  negative  parts  of  Dimensions  1  and 
2,  while  the  second  category  is  already  located  in 
the  positive  parts  of  those  dimensions  (i.e.,  on  the 
other  side  of  the  mean,  which  is  the  point  where 
the  lines  for  all  items  cross).  In  other  words,  the 
midpoint  of  the  latent  concept  behind  Item  A  is  not 
“3”;  rather,  it  is  a  point  between  “1”  and  “2.” 
Furthermore,  the  categories  neither  agree  nor  dis¬ 
agree  and  disagree  are  relatively  close  to  each 
other,  indicating  that  these  two  responses  are 
rather  similar  for  the  average  respondent. 


Figure  I  Biplot  Presentation  of  National  Identity 
Data 

Source:  Data  from  International  Social  Survey  Programme 
(ISSP),  National  Identity  II  (2003).  Distributor:  GESIS 
Cologne  Germany,  ZA  No.  3910. 
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In  the  final  step,  we  perform  MCA  on  the  same  dimension  shows  that  all  items  retain  the  ordi- 

data.  While  PCA  restricts  the  data  to  be  metric  (i.e.,  nality  of  their  categories.  Dimension  1  reflects 

all  distances  between  two  successive  categories  are  the  substantive  meaning,  which  is  the  level  of 

equal),  and  while  CATPCA  keeps  the  ordinality  of  national  pride,  with  the  negatively  formulated 

the  successive  categories,  MCA  makes  no  assump-  items  running  in  a  direction  opposite  to  the  posi¬ 
tions  about  the  measurement  level  of  the  variables.  tive  ones.  The  second  dimension  shows  the 

In  sum,  MCA  can  be  regarded  as  PCA  with  (unor-  horseshoe  (or  the  Guttman  effect),  with  extreme 

dered)  categorical  variables.  In  contrast  to  PCA  and  values  in  the  negative  part  and  moderate  values 

CATPCA,  the  visualization  of  the  data  is  usually  of  in  the  positive  part.  This  effect  is  methodologi- 

primary  interest  in  MCA.  cally  induced;  for  the  given  example,  it  has 

Figure  2  presents  the  visualization  of  the  seven  already  been  suggested  that  these  are  due  mainly 

variables  with  its  total  of  35  categories.  In  this  to  acquiescence.  It  is  precisely  to  exclude  meth- 

figure,  the  successive  categories  of  the  individual  odologically  induced  variations  of  such  kinds  in 

variables  are  connected  by  trajectories.  Projecting  ordinal  data  that  Jan  de  Leeuw  (2006)  proposed 

the  categories  from  all  variables  onto  the  first  the  use  of  CATPCA. 


Figure  2  Multiple  Correspondence  Analysis  (MCA)  Presentation  of  National  Identity  Data 

Source :  Data  from  International  Social  Survey  Programme  (ISSP),  National  Identity  II  (2003).  Distributor:  GESIS  Cologne 
Germany,  ZA  No.  3910. 
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It  should  be  noted  that  CATPCA  also  allows  a 
combination  of  metric  and  categorical  (ordered  and 
unordered)  data  within  a  single  analysis.  In  such 
situations,  the  metric  variables  are  restricted  to  a 
metric  scale  with  equal  distances  and  the  ordered 
categorical  variables  to  their  implicit  successive 
order  of  categories,  while  no  order  is  imposed  on  the 
categories  of  the  unordered  categorical  variables. 
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Schema 


A  schema  (plural  schemata)  in  the  field  of  social 
science  represents  a  person’s  mental  structure 


used  to  organize  current  knowledge  about  the 
world  and  to  guide  cognitive  processes  and 
behavior.  In  this  way,  we  use  schemata  to  cate¬ 
gorize  objects  and  events,  based  on  their 
elements  and  characteristics,  to  interpret  and 
predict  the  world.  While  evaluating  the  world, 
new  information  is  being  processed  according 
to  how  it  fits  into  the  mental  structures,  or 
rules.  In  the  field  of  social  science,  and  particu¬ 
larly  in  cognitive  science,  we  retrieve  knowledge 
from  various  areas,  such  as  artificial  intelli¬ 
gence,  with  the  main  goal  of  developing  simpli¬ 
fied  mental  structures  about  our  knowledge  to 
draw  conclusions  about  missing  or  noneviden- 
tial  information,  such  as  during  decision  mak¬ 
ing  or  political  evaluation.  Examples  of  sche¬ 
mata  include  rubrics,  social  roles,  stereotypes, 
and  worldviews.  This  entry  discusses  meanings 
and  the  use  of  schemata,  as  well  as  their  appli¬ 
cation  to  social  science. 

The  Concept  and  Its  Application 

Schema  as  a  concept  was  first  introduced  into 
psychology  by  the  British  psychologist  Frederic 
Bartlett  (1886-1969)  through  his  learning  theory. 
Bartlett  perceived  organized  knowledge  as  an 
elaborate  network  of  abstract  mental  structures 
that  represent  our  understanding  of  the  world. 
Bartlett’s  main  studies  concentrate  on  the  impact 
of  our  cultural  background  in  rephrasing  and 
memorizing  certain  events.  For  example,  in  one  of 
his  most  well-known  studies,  Bartlett  examined 
whether  subjects  could  recall  events  that  strongly 
deviate  from  their  own  environmental  back¬ 
ground,  and  he  showed  that  the  more  culturally 
deviated  one’s  own  background  was  from  that  of 
the  presented  story,  the  less  likely  it  was  that  par¬ 
ticipants  could  remember  the  story.  Bartlett  con¬ 
cluded  that  the  participants  distort  the  presented 
story  in  favor  of  their  own  cultural  stereotypes, 
and  details  that  were  difficult  to  interpret  were 
omitted  because  they  did  not  fit  in  with  the  par¬ 
ticipants’  own  schemata. 

In  general,  the  learner  in  schema  theory  actively 
builds  schemata  and  revises  them  in  light  of  new 
information.  Here,  it  is  important  to  mention 
that  each  schema  is  unique  and  depends  on  an 
individual’s  experiences  and  cognitive  processes. 
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David  Ausubel  (1968),  in  his  meaningful  learning 
theory,  argues  that  there  exists  a  hierarchical  orga¬ 
nization  of  knowledge  where  new  information  will 
be  added  to  the  already  existing  hierarchy.  In  con¬ 
trast,  Jean  Piaget  shows  that  there  is  more  than  one 
body  of  knowledge  available  to  learners.  Piaget 
claims  that  there  exists  a  network  of  context- 
specific  bodies  of  knowledge  and  according  to 
specific  situations  we  apply  a  specific  body  of 
knowledge.  Piaget’s  definition  of  schema  is  useful 
for  interpreting  information  in  which  situation- 
specific  schemata  are  useful  in  distinguishing 
between  two  types  of  categories  of  interpretation 
of  knowledge:  the  expert  and  the  novice.  Experts 
are  more  complex  and  developed  schemata  that 
function  better  in  any  given  domain,  and  the  nov¬ 
ice  has  no  schema  or  inadequate  schema  to  help 
interpret  new  information.  Since  schemata  are 
perceived  as  context  specific,  they  depend  on  the 
individual’s  experience  with  the  subject. 

Beside  interpreting  information,  the  literature 
also  suggests  that  schemata  are  crucial  for  decod¬ 
ing  how  that  information  is  presented  to  oneself. 
One  possibility  is  reflection  in  text  structure. 
This  means  that  readers  use  their  systematic  rep¬ 
resentation  of  text  to  help  them  interpret  the 
text.  Also,  as  Robert  Kaplan  (1966)  points  out,  it 
is  important  to  note  that  an  essay  style  is  cultur¬ 
ally  determined. 

Moreover,  schemata  have  two  types  of  hierar¬ 
chies  that  concentrate  on  the  relationship  between 
the  part  and  the  whole.  It  shows  that  on  the  basis 
of  partial  information  structures,  we  are  able  to 
perceive  the  whole  picture  of  an  event.  This  refer¬ 
ence  is  possible  because  each  schema  has  a  main 
category,  a  so-called  slot  that  connects  different 
semantic  networks  and  provides  us  with  a  whole 
picture  of  an  event  or  object.  For  example,  in  the 
main  slot  “house,”  we  store  the  information 
“wall,”  “roof,”  and  “floor,”  among  others.  Within 
the  context  of  part-whole  relationships,  we  can 
therefore  infer  that  a  house  has  a  wall,  a  roof,  and 
a  floor.  Moreover,  each  schema  is  developed  in  a 
way  that  helps  us  simplify  drawing  conclusions  of 
a  represented  concept.  For  example,  if  we  know 
that  an  object  is  a  door,  then  according  to  the 
definition  of  a  schema  “door,”  we  can  assume  that 
it  has  a  lock,  a  handle,  and  hinges,  which  shows 
that  we  apply  all  the  attributes  of  a  door. 


Moreover,  William  Brewer  and  James  Treyens 
(1981)  studied  the  effects  of  schemata  in  human 
memory.  They  conducted  a  study  in  which  30  sub¬ 
jects  were  brought  into  the  office  of  the  principal 
investigator  and  were  told  to  wait  there.  After 
35  seconds,  the  subjects  were  asked  to  leave  the 
room  and  to  list  everything  that  they  could  recall 
being  in  the  room.  Brewer  and  Treyens  showed 
that  the  subjects  could  recall  all  those  objects  that 
fit  into  their  schema  of  “office  room,”  and  they 
had  a  much  more  deviated  memory  of  those  items 
that  were  not  a  part  of  their  schema.  For  example, 
29  of  the  30  subjects  recalled  that  the  office  had  a 
chair,  a  desk,  and  walls;  however,  only  8  could 
recall  the  anatomic  skull  or  a  writing  pad. 
Interestingly,  9  subjects  mentioned  that  they  had 
seen  books,  when,  in  fact,  there  were  no  books  in 
the  office.  Being  able  to  recall  books  when  books 
were  not  among  those  objects  present  shows  that 
our  memory  of  the  characteristics  of  certain  loca¬ 
tions  depends  on  schemata  that  we  associate  with 
those  types  of  locations. 

Schemata  can  also  contain  so-called  event  con¬ 
cepts,  which  can  be  defined  as  a  conceptual  struc¬ 
ture.  Roger  Schank  and  Robert  Abelson  (1977) 
developed  one  version  of  an  event  concept  that 
they  defined  as  “scripts.”  If  we  define  an  event,  for 
example,  as  a  “restaurant  visit,”  then  we  are  able 
to  determine  stereotypical  sequences  of  action  that 
define  a  restaurant  visit.  A  study  shows  that  differ¬ 
ent  subjects  recalled  different  sequences  from  the 
same  event.  Among  the  total  number  of  32  sub¬ 
jects  in  their  study,  there  was  no  consistent  recall 
of  the  sequences  of  actions.  In  fact,  in  an  overlap 
of  73%  of  all  subjects,  Schank  and  Abelson  found 
that  only  a  stereotypical  sequence  of  “taking  a 
seat,”  “ordering  food,”  “eating,”  and  “paying” 
was  the  same. 

Applying  Schema  Theory 

Applying  a  schema  to  social  science  represents  our 
common  understanding  about  an  event  or  a  con¬ 
cept.  Hence,  each  individual  has  his  or  her  own 
understanding  of  this  event,  which  depends  on  the 
characteristics  that  he  or  she  associates  with  the 
object,  content,  or  location,  which,  in  turn,  depends 
on  his  or  her  cultural  and  political  background. 
Several  misrepresentations  of  an  actual  event 
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described  in  social  science  arise  from  our  own  par¬ 
ticular  cultural-political  background.  Many  areas 
in  political  science  apply  schemata  when  decision 
making  occurs — for  example,  during  peace  nego¬ 
tiations  or  United  Nations  summit  meetings,  where 
a  vast  variety  of  cultural-political  backgrounds  are 
represented.  It  can  be  inferred,  therefore,  that  the 
more  detailed  the  description  of  an  event,  the  more 
likely  are  larger  proportions  of  members  to  reach 
agreement. 

The  literature  on  schemata  concentrates  primarily 
on  strategies  that  focus  on  how  different  representa¬ 
tives  with  a  defined  set  of  schemata  communicate 
and  interpret  the  events.  Here,  abstract  theories  and 
exemplary  theories  apply  to  how  schemata  are 
actually  used  by  distinct  learners.  Abstract  theory 
works  through  the  usage  of  a  prototype  that  repre¬ 
sents  a  certain  category,  and  exemplary  theories 
work  through  the  individual  comparison  with  the 
prototype.  In  the  exemplary  theory,  we  do  not  store 
concepts  in  our  schema  but  individual  examples, 
and  the  schema  functions  through  comparisons 
rather  than  representation,  as  in  abstract  theory. 

Certain  strategies  of  simplifying  schemata 
include  stereotypes  and  archetypes  that  drive  our 
decision-making  process.  Here,  the  literature  sug¬ 
gests  that  numerous  strategies  follow  from  schema 
theory.  Sharon  Widmayer  (2007)  acknowledges 
the  role  of  prior  knowledge  in  cognitive  process¬ 
ing  and  shows  that  already  existing  schemata 
related  to  the  new  information  need  to  be  acti¬ 
vated.  This  is  described  in  the  literature  as  “stimu¬ 
lating  recall  of  prior  knowledge.”  Teachers,  for 
example,  activate  student’s  prior  knowledge 
through  reading  the  heading  and  the  title  before 
starting  a  new  subject  related  to  it.  Another  teach¬ 
ing  strategy  is  using  analogies  and  comparisons  to 
activate  the  learner’s  existing  schema  in  particular 
to  help  learners  draw  connections  among  already 
existing  schemata.  Here,  it  is  crucial  that  teachers 
use  the  most  realistic  familiar  scenarios  instead  of 
conventional  abstract  methods.  With  regard  to 
the  familiarity  of  already  existing  schemata,  the 
literature  points  out  that  with  highly  familiar  sce¬ 
narios,  the  learner  will  activate  the  already  exist¬ 
ing  schemata  more  successfully,  and  the  strongest 
connections  among  schemata  will  thus  be 
achieved.  To  support  this  conclusion,  Edward 
Price  and  Marcy  Driscoll  showed  in  their  1997 
study  that  around  11%  of  the  subjects  could  solve 


a  particular  problem  in  an  unfamiliar  context, 
while  around  58%  of  those  participating  in  the 
study  could  answer  a  very  similar  problem  in  a 
familiar  context. 

In  conclusion,  schemata  help  us  represent  our 
knowledge  of  how  characteristics  of  certain  events 
are  recalled.  However,  there  remain  limitations 
such  as  the  applicability  of  schemata  across 
knowledge  domains  that  are  relevant.  Here,  one’s 
self-knowledge  as  well  as  one’s  cultural-political 
background  are  crucial  determinants. 

Katja  Michalak 
American  University  in  Bulgaria 
Blagoevgrad,  Bulgaria 
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Script 


The  notion  of  cognitive  scripts,  closely  related  to 
schema  theory,  became  especially  important  in  the 
1970s,  when  cognitive  psychologists  (e.g.,  Robert 
Abelson,  1981;  Roger  Schank  &  Robert  Abelson, 
1977) — and,  later,  scholars  of  international 
relations — began  to  explore  the  role  of  cognitive 
shortcuts  in  information  processing  and  decision 


Script  2365 


making.  According  to  Abelson,  one  of  the  leading 
advocates  of  this  approach,  scripts  are  “conceptual 
representations  of  stereotyped  event  sequences.” 
Put  more  simply,  a  script  may  be  thought  of  as  a 
particular  kind  of  schema  or  “mental  box”  that 
provides  the  typical  default  values  for  an  event  of 
some  kind  or  an  act  that  we  are  accustomed  to 
performing,  such  as  watching  a  movie  or  eating 
out  at  a  restaurant.  We  usually  experience  little 
difficulty  dining  at  a  restaurant  we  have  never  vis¬ 
ited  before,  for  instance,  since  we  simply  rely  on 
the  default  values  stored  in  our  memory  to  guide 
our  behavior;  we  wait  to  be  seated  by  the  host  or 
hostess,  a  server  presents  us  with  menus,  we  pick 
what  we  want  to  eat,  we  eat  it  when  it  arrives,  and 
so  on.  By  the  same  token,  if  a  friend  informs  you 
that  she  went  to  see  a  movie  last  night,  you  can 
easily  use  the  default  values  you  keep  in  your  head 
for  typical  visits  to  the  cinema  to  guess  how  her 
evening  probably  went.  Scripts  can  help  decision 
makers  assess  the  nature  of  a  situation  quickly  but 
can  sometimes  be  a  source  of  cognitive  error,  a  fact 
that  has  not  been  lost  on  students  of  foreign  policy 
decision  making.  The  notion  of  scripts  has  been 
broadly  accepted  within  international  relations, 
but  it  has  also  been  subjected  to  various  criticisms 
(as  discussed  below). 

The  concept  of  a  script  clearly  draws  itself  on 
the  cognitive  image  of  a  movie  or  theatrical  script 
in  which  events  are  played  out  one  after  the  other, 
and  it  can  also  be  compared  to  a  cartoon  strip. 
This  idea  is  consistent  with  the  more  general 
notion  that  human  beings  are  “cognitive  misers,” 
and  the  approach  may  be  viewed  as  part  of  the 
bounded  rationality  tradition  within  political  sci¬ 
ence,  which  stresses  the  ways  in  which  individuals 
depart  from  “pure”  or  comprehensive  rationality. 
Rather  than  considering  everything  we  experience 
sui  generis,  according  to  this  view,  we  are  usually 
far  more  economical  in  our  information  process¬ 
ing.  We  commonly  fit  new  sensory  data  into  estab¬ 
lished  mental  categories,  both  because  this  requires 
little  effort  and  because  it  allows  us  to  make  sense 
of  the  outside  world  quickly  and  expeditiously. 
This  is  particularly  the  case  under  conditions  of 
high  uncertainty  and  ambiguity,  when  the  indi¬ 
vidual  is  being  bombarded  with  too  much  infor¬ 
mation,  or  when  he  or  she  possesses  too  little.  As 
well  as  helping  us  make  sense  of  what  has  hap¬ 
pened  in  the  past  or  is  happening  now,  scripts 


often  play  a  strong  predictive  role  in  decision  mak¬ 
ing,  allowing  us  to  ascertain  in  advance  with  a 
reasonable  degree  of  confidence  what  is  likely  to 
occur  in  the  future  (or  at  least  what  we  think  is 
likely  to  occur). 

Scripts  can  also  be  a  major  source  of  cognitive 
error,  however.  Because  they  contain  information 
about  things  that  are  only  typically  true,  at  least  in 
that  individual’s  experience,  there  is  always  the 
potential  for  oversimplification  of  a  novel  stimulus. 
We  may  make  assumptions  based  on  the  typical  or 
prototypical  behaviors  that  may  be  entirely  mis¬ 
leading  or  false.  Scripts  can  also  compete  with  one 
another  in  our  minds  in  situations  where  it  is 
unclear  what  is  actually  happening.  Two  or  more 
scripts  may  seem  relevant  to  the  case  in  hand,  and 
the  decision  maker  may  grapple  with  uncertainty  as 
to  which  best  suits  the  issue  in  question. 

Scholars  of  foreign  policy  decision  making  such 
as  Deborah  Larson  (1985)  and  Michael  Shapiro 
and  G.  Matthew  Bonham  (1973)  were  quick  to 
pick  up  on  the  political  relevance  of  scripts  in 
international  relations.  “Balkanization”  and  “the 
Trojan  Horse,”  for  instance,  are  two  scripts  often 
used  in  international  relations,  and  as  noted  by 
Yaacov  Vertzberger  (1990),  scripts  appear  to  play 
an  especially  prominent  role  in  the  analysis  of  stra¬ 
tegic  threats.  In  American  foreign  policy,  the 
“Munich  script”  and  the  “Vietnam  script”  have 
frequently  influenced  the  deliberations  of  decision 
makers,  both  during  the  Cold  War  and  since.  This 
provides  a  useful  political  example  of  the  ways  in 
which  rival  scripts  can  compete  for  a  decision 
maker’s  attention  and  of  the  manner  in  which 
scripts  are  used  to  predict  future  events.  The  gen¬ 
eration  of  policymakers  who  had  experienced 
World  War  II  were  particularly  attuned  to  the 
memory  of  British  Prime  Minister  Neville  Cham¬ 
berlain  after  the  Munich  conference  of  1938, 
where  he  famously  waved  a  piece  of  paper  on 
which  Adolf  Hitler  had  agreed  not  to  invade 
Poland  and  Western  Europe  in  exchange  for  part 
of  what  was  then  Czechoslovakia.  Chamberlain 
famously  emerged  from  the  conference  promising 
“peace  in  our  time.”  When  this  attempt  at  accom¬ 
modation  failed  and  was  followed  by  World  War 
II,  many  policymakers  (especially  in  the  United 
States)  drew  the  wider  lesson  that  any  effort  to 
“appease”  a  dictator  was  bound  to  lead  to  disas¬ 
ter.  The  widespread  use  of  the  domino  theory  by 
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U.S.  decision  makers  during  the  Cold  War,  for 
instance,  was  clearly  based  on  the  cognitive  appeal 
of  this  script.  From  this  perspective,  attempts 
to  appease  an  adversary  inevitably  lead  to  well- 
intentioned  but  empty  negotiations;  verbal  or  writ¬ 
ten  promises  are  followed  by  betrayal,  causing  a 
nation  to  fight  a  war  against  an  enemy  it  should 
have  confronted  much  earlier  using  its  military 
might. 

During  the  Cuban  missile  crisis,  John  F.  Kennedy 
appears  to  have  had  the  World  War  I  script  in  mind, 
urging  his  advisers  to  read  Barbara  Tuchman’s  book 
The  Guns  of  August-,  he  was  determined  not  to 
allow  events  to  spiral  out  of  control  in  1962,  as  they 
had  in  1914  prior  to  the  onset  of  World  War  I.  The 
Munich  script  was  evoked  on  far  more  numerous 
occasions  during  the  Cold  War,  however,  and  it  was 
perhaps  most  famously  used  by  President  George 
H.  W.  Bush  after  Saddam  Hussein’s  troops  invaded 
Kuwait  in  1990.  Bush  argued  that  if  Hussein’s 
aggression  was  not  confronted  early  on — if  the  Iraqi 
leader  was  appeased,  in  other  words — events  simi¬ 
lar  to  those  of  the  1930s  would  play  out,  albeit  in 
the  Middle  East  rather  than  Europe  and  with 
Hussein  as  the  antagonist  rather  than  Hitler.  This 
cognitive  image  remains  highly  potent  in  American 
political  discourse  today,  despite  the  passage  of  (to 
date)  more  than  70  years;  during  the  2008  presiden¬ 
tial  election  in  the  United  States,  for  instance,  the 
then  candidate  Barack  Obama  was  accused  by 
Republicans  of  appeasement  for  advocating  direct 
talks  with  Iran  over  the  issues  of  nuclear  weapons 
and  terrorism.  The  Vietnam  script — which  remains 
equally  potent — stresses,  on  the  other  hand,  the 
dangers  of  confrontation  rather  than  accommoda¬ 
tion.  Recalling  the  military  and  political  errors  that 
cost  the  United  States  so  much  blood  and  treasure, 
one  popular  interpretation  of  Vietnam  suggests  that 
a  nation  should  exercise  extreme  caution  when  con¬ 
templating  the  use  of  military  force.  Images  of 
“body  bags,”  “getting  bogged  down”  in  enemy  ter¬ 
rain,  and  mass  protests  are  all  evoked  by  the 
Vietnam  script,  as  well  as  the  considerable  human 
and  political  costs  of  a  military  intervention  that 
effectively  destroyed  Lyndon  Johnson’s  presidency. 

The  prominence  of  the  Munich  and  Vietnam 
scripts  in  the  United  States  illustrates  the  perils  of 
“overlearning”  the  lessons  of  a  single  event.  The 
former  illustrates  the  dangers  of  overgeneralizing 
one  set  of  historical  events  into  a  script  that 


purports  to  predict  what  will  usually  happen  when 
a  ruthless  or  expansionist  leader  is  appeased  (or, 
taken  to  its  extreme,  when  any  form  of  diplomacy 
is  used).  Equally,  one  can  argue  that  the  Vietnam 
script  has  often  led  American  leaders  to  be  overly 
cautious  in  responding  to  genuine  threats,  creating 
a  sometimes  inappropriate  reluctance  to  use  mili¬ 
tary  power.  As  a  young  antiwar  activist,  Bill 
Clinton  had  been  strongly  influenced  by  the  costs 
of  Lyndon  Johnson’s  decision  to  escalate  American 
involvement  in  the  Vietnam  War.  As  president,  he 
was  consequently  often  slow  to  use  military  force, 
most  notably  when  the  former  Yugoslavia  began 
to  break  up  in  the  early  1990s.  Eventually,  U.S. 
firepower  was  used  against  the  Serbs  in  1995, 
helping  to  bring  the  parties  together  at  a  peace 
conference  in  Dayton,  Ohio,  but  not  before  hun¬ 
dreds  of  thousands  had  perished  in  the  conflict. 
Similarly,  Clinton  may  also  have  been  reluctant  to 
intervene  in  Rwanda  when  major  acts  of  genocide 
erupted  there  in  1994,  because  he  was  relying  on 
what  might  be  termed  the  Somalia  or  “Black 
Hawk  Down”  script;  the  latter  suggested  that 
military  interventions  and/or  attempts  to  “nation 
build”  in  Africa  are  fraught  with  danger,  drawing 
on  the  cognitive  image  of  the  disastrous  1993 
attempt  by  U.S.  forces  to  capture  the  Somali  war¬ 
lord  Mohammed  Farah  Aidid  during  which  18 
American  soldiers  died  and  73  were  wounded  (the 
death  toll  among  Somalis  was  much  higher). 
Although  the  similarities  between  the  Somali  and 
Rwandan  cases  were  arguably  superficial,  Clinton 
administration  officials  later  admitted  that  the 
dangers  of  committing  troops  to  intervene  in  the 
slaughter  of  Tutsis  by  Hutus  in  Rwanda  domi¬ 
nated  their  thoughts  as  a  result  of  what  had  hap¬ 
pened  the  previous  year  in  Somalia. 

While  the  notion  that  leaders  use  cognitive 
scripts  has  been  generally  (if  perhaps  grudgingly) 
accepted  by  most  scholars  within  international  rela¬ 
tions,  a  number  of  potential  problems  within  this 
area  should  also  be  highlighted.  The  first  is  simply 
that  there  has  been  relatively  little  development  of 
the  script  idea  within  international  relations  or 
political  science  since  the  1980s.  For  good  or  bad, 
the  rational  choice  or  game-theoretical  tradition — 
which  makes  the  “as  if”  assumption  that  decision 
makers  possess  pure  rationality — has  in  general 
proven  more  attractive  to  many  political  scientists 
for  modeling  purposes  than  the  admittedly  messy 
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notion  that  policymakers  employ  a  variety  of  cogni¬ 
tive  shortcuts  (not  the  least  because  the  latter 
assumption  renders  decision  makers  somewhat 
idiosyncratic).  Indeed,  one  of  the  scholars  who 
originally  helped  popularize  the  idea  of  scripts  and 
schemas  within  political  science — Robert  Axelrod — 
joined  the  rational-choice  tradition  some  years  ago. 
Second,  many  scholars  of  foreign  policy  decision 
making  have  preferred  to  use  the  competing  but 
obviously  related  notion  of  analogical  reasoning  in 
their  work,  a  now  well-established  tradition  that 
covers  the  same  ground  as  early  work  on  scripts  but 
has  generally  proven  itself  to  be  somewhat  more 
popular  among  scholars  of  foreign  policy  analysis. 
Third — and  in  a  related  vein — there  has  been  a  gen¬ 
eral  failure  in  the  literature  to  disentangle  the  notion 
of  scripts  from  strongly  related  concepts  such  as 
schemas  and  analogies.  If  scripts  are  simply  “event 
schemas,”  do  we  really  need  the  former  label?  Are 
scripts  merely  the  same  thing  as  historical  analogies, 
or  are  they  something  analytically  distinct?  Last, 
there  has  also  been  an  overemphasis,  perhaps,  on 
the  “cold”  information-processing  tradition  from 
which  the  focus  on  scripts  derived,  and  some  politi¬ 
cal  psychologists  are  themselves  turning  away  from 
the  computational  analogy  on  which  much  of  the 
original  literature  of  schemas  and  scripts  was  based. 

Nevertheless,  whatever  terminology  we  use, 
scripts  will  remain  highly  relevant  to  the  study  of 
politics  because  both  elite  decision  makers  and 
ordinary  voters  must  often  make  decisions  with 
only  incomplete  information  about  the  situation 
at  hand.  Political  actors  can  and  do  make  incor¬ 
rect  inferences  by  fitting  individuals  or  events 
into  the  wrong  categories  or  scripts,  based  on 
their  observable  but  all  too  frequently  superficial 
similarities. 
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Secession 


Secession  is  the  formal  withdrawal  from  an  estab¬ 
lished,  internationally  recognized  state  by  a  con¬ 
stituent  unit  to  create  a  new  sovereign  state.  This 
entry  analyzes  the  meaning  of  the  notion,  its  empir¬ 
ical  dynamics,  and  its  justification  and  finally 
addresses  the  key  question  on  the  reasons  why  there 
are  not  a  higher  number  of  secessions  in  the  world. 

The  decision  to  secede  represents  an  instance  of 
political  disintegration,  when  the  citizens  of  a  sub¬ 
system  withdraw  their  political  activities  from  the 
central  government  to  focus  on  their  own  center. 
To  the  observer,  secession  may  appear  irrational  as 
it  often  entails  the  sacrifice  of  economic  opportuni¬ 
ties  and  the  endurance  of  social  upheaval.  Because 
of  the  state’s  opposition  and  monopoly  of  coercive 
force,  secessionist  struggles  frequently  become  vio¬ 
lent  and  protracted.  Thus,  secession  is  disintegra¬ 
tive  in  the  most  fundamental  sense:  It  involves  not 
the  overthrow  of  the  existing  government  but 
rather  its  territorial  dismemberment. 

The  breakup  of  the  Soviet  Union  and  Ethiopia 
and  the  fragmentation  of  Yugoslavia  and  Czecho¬ 
slovakia  have  created  some  20  new  states  through 
secession,  which  has  grudgingly  received  interna¬ 
tional  recognition  and  legitimation.  Indeed,  from 
1776  onward,  secession  has  been  one  of  the  most 
frequent  ways  of  creating  new  states.  With  the  col¬ 
lapse  of  the  Austro-Hungarian,  Ottoman,  and 
Russian  empires  after  World  War  I,  numerous  new 
states  were  created  through  secession.  Given  the 
countless  unresolved  cultural  and  territorial  dis¬ 
putes  and  the  many  unsatisfied  aspirations  of 


2368  Secession 


nations  in  Africa,  Asia,  and  even  in  the  West — for 
example,  Quebec,  Catalonia,  the  Basques,  Flanders, 
Scotland,  and  Wales — secession  continues  to  be  a 
force  to  be  reckoned  with  in  international  politics. 

Dynamics 

The  dynamics  of  secession  rest  on  four  precondi¬ 
tions:  nation,  territory,  leaders,  and  discontent. 
Secession  demands  must  be  presented  by  an  identi¬ 
fiable  community  that  is  smaller  than  the  state  and 
that  threatens  to  withdraw  if  not  satisfied.  Questions 
of  identity  underpin  this  community  of  people,  or 
nation,  who  perceive  the  characteristics  that  distin¬ 
guish  their  members  from  other  groups,  who  feel  a 
commitment  to  each  other,  and  who  then  under¬ 
take  the  challenge  of  changing  their  circumstances. 
This  community  must  be  associated  with  the  terri¬ 
tory  on  which  it  would  establish  its  newly  indepen¬ 
dent  state.  Without  effective  leadership  to  translate 
community  needs  into  demands,  threats  to  the 
nation  might  merely  degenerate  into  social  unrest 
as  pent-up  frustrations  are  vented.  Discontent  with 
the  current  circumstances  motivates  the  communi¬ 
ty’s  demands  for  change.  Often  a  nation  is  bound 
together,  and  perhaps  even  defined,  by  common 
claims  of  discrimination,  neglect,  exploitation,  or 
repression  in  economic,  political,  cultural,  linguis¬ 
tic,  or  religious  terms.  The  U.S.  Declaration  of 
Independence  points  to  the  “unbearable  tyranny  of 
the  state”  as  both  the  reason,  in  the  sense  of  provid¬ 
ing  the  motivating  force,  and  the  moral  justification 
for  secession. 

Embedded  in  any  secession  lies  the  perceived 
justice  of  the  community’s  cause.  The  debate  sur¬ 
rounding  the  “right  of  secession”  and  its  close 
relative,  “the  right  of  national  self-determination,” 
revolves  around  (a)  the  argument  that  secession 
may  be  justifiable  either  in  circumstances  where 
state  rule  over  a  nation  is  particularly  oppressive 
and  tyrannical  or  when  a  majority  of  a  territorially 
concentrated  community  desires  secession  and 
(b)  the  argument  that  secession  may  be  desirable 
due  to  the  benefits  it  provides  for  nations  to  orga¬ 
nize  themselves  according  to  their  own  values. 

Justification 

Those  seeking  to  justify  secession  extend  the  argu¬ 
ment  that  if  a  society  may  overthrow  an  unbearable 


government,  then  a  segment  of  the  population  may 
also  remove  itself  from  a  particularly  objectionable 
government.  John  Stuart  Mill  acknowledged  that 
freedom  and  liberty  may  not  be  possible  when  the 
state  is  an  artificial  amalgamation  of  two  or  more 
distinct  nations  with  one  dominating  the  govern¬ 
ment  and,  thus,  conceded  that  secession  may  be  a 
necessary  alternative  to  promote  liberty.  One  need 
only  think  of  the  experiences  of  the  Armenians,  Jews, 
Kurds,  and  people  from  many  other  nations  facing 
government-sponsored  slaughter  to  understand  how 
nations  would  believe  that  they  could  guarantee  their 
safety  and  even  survival  through  possessing  their 
own  state.  For  revolution  or  secession,  thus,  the 
underlying  principle  of  protest  is  the  same. 

President  Woodrow  Wilson  extended  the  argu¬ 
ment  from  when  secession  may  be  justifiable  to 
when  it  may  be  desirable.  Above  all  else,  Wilson 
sought  peace  based  on  justice.  He  believed  that  the 
subjugation  of  one  nation  by  another  was  unjust 
and  would  lead  inevitably  to  a  threat  to  peace. 
Implicit  in  Wilson’s  approach  lies  the  belief  that 
secession  was  a  desirable  way  to  create  states 
coterminous  with  nations.  Every  historic  nation 
should  possess  the  same  right  to  organize  its  com¬ 
munal  affairs  according  to  its  own  values,  not 
simply  those  who  through  foresight,  geography,  or 
luck  gained  their  own  independent  states.  Some 
have  even  argued  that  a  more  peaceful  world  may 
not  be  possible  without  the  completion  of  the 
international  system  of  states  through  continued 
secession  based  on  national  aspirations. 

Low  Incidence  of  Secession 

Given  these  arguments,  one  might  reasonably 
question  why  there  have  not  been  a  larger  number 
of  successful  secessions.  According  to  one  account, 
the  world  possesses  approximately  8,000  different 
languages  and  cultures  but  fewer  than  200  inde¬ 
pendent  states.  The  forces  arrayed  against  seces¬ 
sion — both  domestic  and  international — go  far 
beyond  simply  the  philosophical  difficulties  sur¬ 
rounding  the  “right  of  secession.” 

Secession,  by  its  very  nature,  presents  the  inter¬ 
national  territorial  system  with  instability  and 
chaos.  The  potential  conclusion  for  the  logic  of 
secession  would  be  the  infinite  division  of  existing 
political  entities.  More  often  than  not,  given  their 
legitimate  monopoly  of  force  within  a  territory, 
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most  states  have  effectively  opposed  secession 
attempts.  And  even  if  the  secessionist  community 
were  to  win  on  the  battlefield,  it  stands  to  lose  the 
diplomatic  contest.  The  principles  of  state  sover¬ 
eignty  and  territorial  integrity  function  as  funda¬ 
mental  norms  of  post-1945  international  relations, 
thereby  effectively  limiting  secession  as  a  means  of 
altering  existing  borders.  Article  2(4)  of  the  United 
Nations  Charter  states, 

The  Organization  and  its  Members  .  .  .  shall  act 
in  accordance  with  the  following  Principles:  .  .  . 

(4)  all  Members  shall  refrain  in  their  international 
relations  from  the  threat  or  use  of  force  against 
the  territorial  integrity  or  political  independence 
of  any  state. 

As  a  consequence,  the  international  community 
has  consistently  withheld  diplomatic  recognition 
from  secessionist  entities. 

The  comparative  study  of  secessions  and  sepa¬ 
ratist  movements  from  Western  liberal  democra¬ 
cies,  former  communist  states,  and  developing 
countries  reveals  that  a  nation’s  decision  to  secede 
can  be  thought  of  as  a  function  of  its  appraisal  of 
its  circumstances.  Whether  or  not  to  attempt  seces¬ 
sion  depends  on  the  costs  and  benefits  associated 
with  the  nation’s  political  alternatives:  continued 
membership  within  the  existing  state  or  secession. 
This  appraisal  is  dynamic  and  continuous  and  can 
be  affected  both  by  policies  of  the  central  govern¬ 
ment  and  by  developments  in  the  international 
system.  Indeed,  although  the  road  to  independence 
is  frequently  a  long  one,  the  actual  decision  to 
secede  only  arises  when  government  action  or  inter¬ 
national  developments  change  the  community’s 
view  of  the  balance  among  these  costs  and  benefits. 
A  nation’s  decision  to  seek  independence  can 
change  with  circumstances;  in  some  instances,  sen¬ 
sitive  government  policy  can  convince  a  secessionist 
community  to  accept  greater  autonomy  within  the 
existing  state,  while  in  other  cases,  evolution  in  the 
prevailing  international  climate  can  outweigh 
domestic  factors  in  the  dynamic  of  secession. 
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Secret  Services 


From  an  academic  point  of  view,  there  is  no  canon¬ 
ical  definition  of  secret  services.  The  word  secret 
qualifies  the  most  apparent  characteristic,  even  if 
hidden,  of  these  services’  modus  operandi. 
Obviously,  unlike  other  administrations,  the  secret 
services  are  discreet  and  their  activities  are  hidden, 
but  the  term  secret  gives  no  indication  of  their 
exact  role.  The  secret  services’  function  is  first  to 
gather  information.  The  two  main  areas  in  which 
they  specialize  are  intelligence  and  counterespio¬ 
nage.  They  are  also  responsible  for  a  third  mission, 
referred  to  as  “covert  actions.”  In  most  countries, 
these  three  activities  are  implemented  by  several 
intelligence  agencies.  However,  this  kind  of  defini¬ 
tion  only  insists  on  a  functional  dimension. 

Nature  and  Function  of  Secret  Services 

In  1966,  the  U.S.  Central  Intelligence  Agency 
analyst  and  academic  historian  Sherman  Kent 
wrote  a  short,  seminal  book  in  which  he  defined 
the  three  dimensions  of  intelligence  services: 
(1)  these  services  are  organizations ,  (2)  they 
undertake  activities ,  and  (3)  they  produce  knowl¬ 
edge.  This  approach  is  very  helpful  for  scholars 
insofar  as  it  shows  that  intelligence  agencies  are 
not  merely  the  tools  of  executive  authorities.  The 
very  nature  of  secret  services  means  that  they  try 
to  escape  from  all  kinds  of  external  scrutiny  and 
especially  from  academic  analysis.  As  a  result, 
current  literature  concerning  the  secret  services  is 
influenced,  on  the  one  hand,  by  journalistic  bias 
(the  need  for  scoops  and  permanent  suspicion 
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toward  what  is  hidden)  and,  on  the  other,  by  con¬ 
spiracy  theories.  Today,  this  lack  of  academic 
literature  is  problematic  especially  since  the 
assessment  of  “global”  threats  and  the  so-called 
war  against  terror  have  increased  the  role  played 
by  intelligence  organizations.  During  the  Cold 
War,  most  countries,  apart  from  the  former  USSR 
and  the  United  States,  had  only  limited  means.  In 
recent  years,  most  intelligence  services  have  seen 
their  size,  budget,  and  means  increase  consider¬ 
ably.  They  are  no  longer  an  insignificant  part  of 
the  state,  a  weak  side  of  public  policies.  That  is 
one  of  the  reasons  why  governments  in  liberal 
democracies  have  tried  to  improve  accountability 
and  oversight. 

Most  intelligence  agencies  now  publish  some  of 
their  reports  and  documents  and  edit  their  own 
official  histories  (usually  written  by  academics), 
and  the  parliamentary  committees  in  charge  of 
overseeing  their  activities  edit  the  annual  reports  on 
the  agencies.  It  is  surprising,  then,  to  note  the  pau¬ 
city  of  academic  literature  considering  the  ever- 
increasing  available  material  (the  current  published 
official  documents  and  declassified  archives).  For  a 
long  time,  intelligence  has  been  described  as  the 
“missing  dimension”  in  the  study  of  international 
relations,  but  the  same  could  also  be  said  of  domes¬ 
tic  policy  studies.  Despite  the  criticism  of  intelli¬ 
gence  activities  by  its  detractors,  the  first  academic 
works  were  published  in  the  1970s  by  “practitio¬ 
ner-scholars”  like  Sherman  Kent.  Academic  schol¬ 
ars  began  to  work  on  the  topic  in  the  1980s.  Today, 
intelligence  studies  is  an  authentic  scientific  research 
field  with  its  own  research  centers,  departments  in 
college  and  universities,  and  journals  and  series. 
Different  stakeholders  meet  regularly  at  confer¬ 
ences  and  contribute  to  journals.  The  majority  of 
academic  literature  comes  from  English-speaking 
political  scientists  specializing  in  international  rela¬ 
tions  and  from  historians.  This  would  indicate  that 
intelligence  is  not  yet  entirely  recognized  as  a  bona 
fide  subject  by  all  social  sciences:  The  same  can  be 
said  of  sociologists  and  scholars  working  on  poli¬ 
tics  while  not  taking  intelligence  into  account.  The 
situation  is  worse  in  continental  Europe,  where  the 
first  works  only  appeared  at  the  very  beginning  of 
the  21st  century. 

Despite  this  doubtful  state  of  affairs,  it  is  possi¬ 
ble  to  pinpoint  some  truths  in  the  field.  The  special¬ 
ization  of  a  part  of  the  state  in  the  gathering  of 


information  illustrates  the  existence  of  a  specific 
function  in  the  contemporary  state.  Secrets  func¬ 
tion  like  a  veil:  They  hide  the  fact  that  intelligence 
services  are  organizations — that  is,  one  part  of  the 
bureaucracies  in  our  liberal  democracies.  Indeed, 
these  secret  bureaus  have  to  cope  with  special 
rules — essentially  concerning  information  and 
law — yet  they  are  nevertheless  a  growing  part  of 
states.  The  activities  of  the  secret  services,  whose 
aim  is  to  provide  governments  with  intelligence, 
sometimes  of  a  strategic  nature,  make  them  power¬ 
ful.  Despite  the  progress  of  civil  liberties  and  trans¬ 
parency,  this  core  of  the  state  remains  hidden  for 
the  most  part.  In  this  secret  area,  the  state  finds 
more  autonomy  and,  theoretically,  more  power. 
Intelligence  agencies  are  an  important  part  of  what 
could  be  called  the  secret  state.  In  fact,  the  study  of 
secret  services  leads  us  away  from  the  view  that  the 
state  has  become  more  transparent  and  has  less 
influence  on  citizens  in  the  long  term.  Nowadays, 
the  existence  of  secret  states  in  liberal  democracies 
could  restimulate  studies  on  the  current  nature  of 
states. 

The  Political  Dimension 

One  can  also  consider  the  existence  of  intelligence 
agencies  in  the  state  from  a  political  perspective. 
Despite  the  theory  of  the  law-abiding  state,  whereby 
the  entire  state  has  to  comply  with  the  law  and 
implement  neutral  policies,  the  existence  of  the 
secret  services  linked  to  the  political  authority 
would  indicate  that  some  part  of  the  administra¬ 
tion  is  not  a  neutral  tool  for  high-ranking  officials 
but,  conversely,  that  its  nature  is  profoundly  politi¬ 
cal.  Politics  in  intelligence  operates  at  three  differ¬ 
ent  levels. 

1.  The  secret  services  are  located  within  the 
state  but  are  very  close  to  political  authorities. 
The  sensitive  dimension  of  intelligence  often 
transforms  the  agencies  into  advisors  or  think 
tanks.  This  means  that  the  agencies  can  be  used  to 
elaborate  political  agendas  or  they  can  themselves 
influence  them. 

2.  Then,  intelligence  is,  of  course,  a  tool  for 
political  authority  when  it  implements  policies  at 
home  and  abroad.  Many  public  policies  are  not 
neutral.  Here,  scholars  face  a  problem.  The  fact 
that  the  most  important  parts  of  these  policies  are 
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not  publicized  diminishes  the  capacity  to  assess 
them,  and  at  the  same  time,  it  automatically  sug¬ 
gests  that  they  are  more  efficient  and  powerful 
than  they  perhaps  really  are.  It  is  impossible  to  go 
beyond  this  limit,  except  for  scholars  working  on 
archives  or  historical  material. 

3.  Politics  in  intelligence  is  not  only  linked  to 
the  position  of  the  agencies  within  the  state;  being 
civil  servants  or  contractors,  their  members  are 
sometimes  in  a  situation  where  they  can  express 
their  political  opinion  through  their  professional 
occupations.  The  politicization  of  agencies  is  a 
common  case. 

Finally,  intelligence  activity  is  one  part  of  pub¬ 
lic  policies  and  one  part  of  politics.  This  implies 
that  the  very  essence  of  the  state  remains  close  to 
politics.  It  means  that  the  Weberian  illusion  of  a 
neutral  state  preoccupied  only  with  efficiency 
must  be  left  aside  when  studying  the  secret  state. 

Recent  Developments  and  Future  Directions 

Most  agencies  first  appeared  in  the  20th  century. 
At  this  time,  their  staff  was  composed  of  civil  ser¬ 
vants  and  military  officers.  The  fact  that  the  agen¬ 
cies  served  the  state  was  indubitable.  However,  the 
nature  of  the  intelligence  services  has  considerably 
evolved  in  recent  decades  because  of  a  significant 
transformation  in  the  conception  of  their  work 
and  their  management.  Now,  more  and  more 
agencies  have  recourse  to  contractors  or  to  people 
coming  from  the  private  sector  for  short  missions. 
Furthermore,  retired  officials  from  agencies  often 
work  for  private  intelligence  firms  that  become 
contractors  for  the  agencies.  The  classical  borders 
between  private  sector  and  public  administration 
are  vanishing  slowly.  This  has  an  effect  on  the  way 
we  define  the  secret  state,  which  remains  secret  in 
its  process  but  has  become  less  public.  Finally,  the 
transparency/secrecy  tandem  seems  to  be  more 
prevalent  than  the  private/public  one. 

If  a  sociological  study  of  the  state  could  help 
demarcate  the  nature  of  the  secret  state  and  outline 
its  real  borders,  empirical  works  would  be  neces¬ 
sary  to  delineate  the  limits  of  the  notion  of  intelli¬ 
gence  community  that  is  often  used  by  scholars 
and  also  by  officials.  The  fact  that  people  working 
in  or  for  the  agencies  are  subjected  to  clearance 
processes  and  use  sensitive  information  isolates 


them  from  other  professionals.  This  does  not  mean 
that  intelligence  careers  avoid  the  traditional  fea¬ 
tures  of  other  professions.  The  notion  of  an  intel¬ 
ligence  community  produces  a  false  feeling  of 
cohesiveness  that  is  reinforced  by  the  secret  dimen¬ 
sion  of  its  activity.  A  sociological  study  of  the  dif¬ 
ferent  stakeholders  and  professions  inside  the 
agencies  could  enable  us  to  study  them  from  a 
more  impartial  point  of  view  in  order  to  under¬ 
stand  the  relationship  between  the  different  agen¬ 
cies  and  within  each  agency  in  each  country. 

If  this  could  be  helpful  to  understand  what  intel¬ 
ligence  is  today,  its  study  is  also  useful  to  approach 
the  features  of  secrecy  in  contemporary  democra¬ 
cies.  Indeed,  the  secrecy  of  agencies  is  not  only 
something  that  is  not  submitted  to  publicity; 
throughout  the  20th  century,  the  state  enacted  spe¬ 
cial  regulations  to  protect  what  it  judged  to  be  most 
sensitive.  These  regulations  became  the  borders  of 
the  emerging  contemporary  secret  state,  and  the 
first  written  judicial  rules  were  in  fact  the  first  rules 
to  hamper  the  agencies  in  their  being  a  part  of  the 
secret  state.  The  creation  of  official  secrecy  through 
regulations  and  classification  processes  was  a 
major  step  in  the  secret  state-building  process,  but 
it  also  refers  to  its  features  in  the  shadows  of  liberal 
democracy.  This  shows  that  democracy  accepts  and 
adjusts  to  a  special  area  where  transparency  and 
liberalism  are  ineffective.  The  fact  that  a  Freedom 
of  Information  Act  (FOIA)  was  passed  in  the 
United  States  and  in  the  United  Kingdom  (UK)  is  a 
landmark  in  this  evolution.  Today,  the  FOIA  is  a 
lawful  weapon  against  public  secrecy,  even  if  not 
all-powerful. 

Finally,  it  is  important  to  understand  that  the 
comprehension  of  intelligence  needs  both  compari¬ 
son  and  a  historical  approach.  There  is  no  single 
model  of  a  secret  state  or  of  an  intelligence  agency. 
The  building  of  the  agencies  is  the  result  of  a  long 
process  that  began  in  the  19th  century.  Wars  and 
international  crises  have  played  a  major  role  in  this 
evolution.  However,  despite  these  common  features 
and  the  current  expansion  of  agencies  after  the  end 
of  the  Cold  War,  the  essence  of  intelligence  agencies 
is  very  different  depending  on  the  country.  If  we 
can  talk  of  a  U.S.-U.K.  model  in  so  far  as  most 
agencies  aspire  to  have  the  same  position  in  the 
state  as  in  these  countries,  the  reality  is  very  differ¬ 
ent.  The  means  accorded  to  the  agencies,  the  role  of 
the  law,  the  kind  of  states  into  which  the  agencies 
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fit,  and  their  influence  on  public  policies  are  not  the 
same.  This  emphasizes  once  again  the  need  for 
more  social  science  studies  to  understand  what 
intelligence  really  is. 

Sebastien  Laurent 
University  of  Bordeaux 
Bordeaux,  France 
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Secularism 


The  term  secularism  has  different  senses.  It  is  used 
to  refer  to  secular  humanism  and  atheism;  the 
social  process  of  secularization;  and  political, 
state-driven  projects.  This  entry  is  concerned 
solely  with  the  third  sense,  which  is  most  fre¬ 
quently  used  in  political  science. 

What  Is  Political  Secularism? 

Broadly  speaking,  secularism,  anywhere  in  the 
world,  means  the  advocacy  of  a  separation  of  orga¬ 
nized  religion  from  organized  political  power  (the 
state)  that  is  inspired  by  a  specific  set  of  values.  In 


this  general  sense,  secularism  is  a  normative  doc¬ 
trine  that  pretends  to  be  universal,  although  it  is 
applied  in  different  cultures  that  are  more  or  less 
compatible  with  it.  Secularism  can  be  understood 
in  its  strict  meaning  of  separateness  between  polit¬ 
ical  power  and  religion  or,  more  broadly,  as  bear¬ 
ing  several  values  such  as  toleration,  equal  religious 
liberty,  and  freedom  of  religion.  Sometimes  these 
two  definitions  may  be  in  contradiction,  as  illus¬ 
trated  in  France  over  the  issue  of  Muslim  women 
wearing  the  veil.  However,  its  constitutive  ele¬ 
ments  can  be  differently  interpreted,  giving  rise  to 
multiple  forms  of  secularism. 

Secular  states  are  disconnected  from  religion  at 
three  distinct  levels:  (1)  ends,  (2)  institutions  and 
personnel,  and  (3)  law  and  public  policy.  This  dis¬ 
tinguishes  them  from  theocracies  and  states  that 
establish  single  or  multiple  religions.  In  a  theoc¬ 
racy,  a  deep  connection  between  state  and  religion 
exists  at  all  three  levels.  A  priestly  order  directly 
administers  the  state  by  reference  to  what  it 
believes  are  ends  inscribed  in  divine  laws  (e.g.,  the 
Islamic  Republic  of  Iran  as  Ayatollah  Khomeini 
aspired  to  run  it).  In  states  with  established  reli¬ 
gions,  in  which  one  religion  is  accepted  as  the 
official  one,  a  priestly  order  does  not  govern 
directly,  and  a  large  measure  of  institutional  and 
personnel  differentiation  exists.  This  disconnec¬ 
tion,  at  Level  2,  also  referred  to  in  some  contexts 
as  church-state  separation,  goes  hand  in  hand  with 
an  overall  ideological  connection.  Religion  and 
state  share  common  ends.  The  state  is  subordinate 
to  religious  ends  even  though  it  has  its  own  func¬ 
tion,  power  structure,  and  internal  norms.  Thus,  in 
states  with  an  established  religion,  because  of  this 
primary  connection  at  Level  1,  there  is  an  auto¬ 
matic  connection  at  Level  3.  For  example,  the 
revenue  collected  by  the  state  is  available  for  reli¬ 
gious  purposes. 

A  secular  state  is  nontheocratic  and  has  no 
established  religion.  Disconnection  from  religion 
at  this  level  distinguishes  secular  states  from  both 
theocracies  and  states  with  established  religion.  A 
secular  state  has  its  own  secular  ends.  The  second- 
order  disconnection,  church-state  separation, 
demarcates  it  only  from  a  theocracy. 

Given  that  ethical  reasoning  is  best  when  it  is 
contextual  and  comparative,  what  are  the  posi¬ 
tive  and  negative  aspects  of  secular  and  nonsecu¬ 
lar  states?  Historically,  nonsecular  states  have 
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recognized  a  particular  version  of  the  religion 
enunciated  by  the  dominant  church  as  the  official 
religion,  compelled  individuals  to  participate  in 
only  one  church,  punished  them  for  failing  to 
profess  a  particular  set  of  religious  beliefs,  and 
levied  taxes  in  support  of  one  particular  church. 
Nonsecular  states  embody  a  regime  of  inequality 
between  religions  (e.g.,  between  Christians  and 
Jews)  and  also  among  the  churches  of  the  same 
religion.  Societies  with  such  states  were  either 
wracked  by  inter-/intrareligious  wars  or  have  per¬ 
secuted  minority  religious  groups.  States  with 
substantive  establishments  have  not  changed  with 
time:  such  as  Saudi  Arabia,  Pakistan,  and  Israel. 
Opponents  of  theocratic  states  and  of  those  with 
an  established  religion  argue  that  such  states  per¬ 
petuate  inter-  and  intrareligious  domination  and 
are  incompatible  with  the  values  of  freedom  and 
equality. 

Secular  States  and  Religion-Centered  States 

Are  all  secular  states  better  than  religion-centered 
states?  This  does  not  follow.  As  mentioned  above, 
a  secular  state  must  have  secular  ends.  However, 
these  ends  themselves  may  be  of  two  kinds.  The 
first  are  amoral.  Amoral  secular  states  are  so  called 
when  their  entire  purpose  is  to  maximize  power, 
wealth,  or  both.  But  just  as  without  separation 
there  is  no  true  secularism,  a  value-less  separation 
does  not  add  up  to  secularism.  A  state  may  separate 
itself  from  the  ethics  and  morality  of  religions  for 
wealth  and  power,  as  we  can  observe  in  plutocra¬ 
cies  (government  based  on  wealth)  or  in  totalitarian 
regimes  that  are  based  on  material  ideologies.  Such 
a  state  is  secular  but  does  not  instantiate  the  nor¬ 
mative  perspective  of  secularism  and  in  fact  is 
incompatible  with  it. 

Value-based  secular  states  are  distinct  from 
amoral  states.  Several  values  are  constitutive  of 
secularism:  prevention  of  cruelty,  toleration,  equal 
religious  liberty,  the  freedom  to  exit  from  religion 
and  to  profess  no  religion,  and  equality  of  both 
passive  and  active  citizenship  rights  (equal  distri¬ 
bution  of  benefits  such  as  protection  of  life,  avail¬ 
ability  of  health  care,  emergency  services,  welfare 
benefits,  and  education  and  nondiscrimination  in 
the  distribution  of  the  right  to  vote,  to  stand  for 
public  office,  and  to  deliberate  on  the  public 
good).  Furthermore,  some  of  these  values  can  be 


interpreted  both  individualistically  and  nonindivid- 
ualistically  (the  right  to  worship  or  education  can 
be  given  to  both  individuals  and  communities — that 
is,  to  maintain  their  religious  practices  or  to  set  up 
their  own  schools). 

These  values-based  secular  states  differ  from 
one  another  in  the  selection  and  combination  of 
values  and  the  weight  assigned  to  each,  but  most 
important  in  how  they  unpack  the  metaphor  of 
separation  at  a  third  level,  concerning  laws  and 
public  policy.  One  type  of  secular  state  conceives 
of  disconnection  at  the  third  level  in  a  wholly  one¬ 
sided  manner.  To  disconnect  is  then  to  exclude 
religion  from  its  affairs  but  without  setting  any 
limits  on  its  own  interventionist  powers  in  the 
affairs  of  religion.  Such  states  (e.g.,  the  former 
communist  states,  Kemalist  Turkey,  and  to  some 
extent  France)  exclude  religion  to  regulate,  con¬ 
trol,  or  sometimes  destroy  it  in  the  name  of  a  single 
value  such  as  equality.  This  may  help  states  deal 
with  aspects  of  intrareligious  domination  (e.g., 
anticlericalism  in  France)  but  not  to  address  inter¬ 
religious  domination — that  is,  when  members  of 
one  religious  community  discriminate  against, 
marginalize,  or  even  oppress  members  of  another 
religious  community. 

The  explanation  of  this  contradiction  is  to  be 
found  in  history:  Issues  of  radical  individual  free¬ 
dom  and  citizenship  equality  arose  in  European 
societies  after  religious  homogenization.  The  birth 
of  confessional  states  was  accompanied  by  the  mas¬ 
sive  expulsion  of  subject-communities  whose  faith 
differed  from  the  religion  of  the  ruler.  Such  states 
found  a  place  for  some  toleration  in  their  moral 
landscape,  but  this  was  consistent  with  the  deep 
inequalities  and  the  virtually  invisible,  marginalized 
existence  of  members  of  religious  minorities.  The 
liberal  democratization  and  the  consequent  secular¬ 
ization  of  many  European  states  have  helped  citi¬ 
zens  with  non-Christian  faiths  acquire  most  formal 
rights.  But  critics  of  secularization  and  advocates  of 
pluralism  and  consociative  order  may  argue  that 
such  a  scheme  of  rights  neither  embodies  a  regime 
of  interreligious  equality  nor  effectively  prevents 
religion-based  discrimination  and  exclusion. 
Indeed,  it  masks  majoritarian,  ethno-religious 
biases.  This  is  evident  in  the  different  kinds  of  dif¬ 
ficulties  faced  by  Muslims.  For  example,  in  Britain, 
a  third  of  all  primary  school  children  are  educated 
by  religious  communities.  Yet  applications  for  state 
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funding  by  Muslims  have  been  frequently  turned 
down.  This  is  also  manifest  in  demands  by  the 
Muslims  that  they  be  allowed  to  build  mosques  (as 
in  Germany,  Italy,  and  Switzerland)  or  have  proper 
Muslim  burial  grounds  of  their  own  (as  in 
Denmark).  Another  ongoing  area  of  controversy, 
especially  in  Western  European  states  such  as 
France,  concerns  policies  that  restrict  the  wearing 
of  the  hijab  (headscarf),  niqab  (face  veil),  or  burka. 

A  second  type  of  values-based  secular  state  con¬ 
ceives  of  this  third-level  disconnection  as  mutual 
exclusion.  Such  a  state  maintains  a  policy  of  strict 
or  absolute  separation  in  which  religion  is  excluded 
from  the  affairs  of  the  state  but  the  state  too  is 
excluded  from  the  affairs  of  religion.  The  state  has 
neither  a  positive  relationship  with  religion — for 
example,  there  is  no  policy  of  granting  aid  to  reli¬ 
gious  institutions — nor  a  negative  relationship  with 
it;  it  is  not  within  the  scope  of  state  activity  to  inter¬ 
fere  in  religious  matters,  even  when  the  values  pro¬ 
fessed  by  the  state  are  violated.  This  noninterference 
is  justified  on  the  ground  that  religion  is  a  private 
matter,  and  if  something  is  amiss  within  the  private 
domain,  it  can  be  mended  only  by  those  who  have 
a  right  to  do  so  within  that  sphere.  This,  according 
to  proponents  of  this  view,  is  what  religious  free¬ 
dom  means.  When  a  state  is  disconnected  from 
religion  at  all  three  levels  in  this  particular  way,  then 
we  may  say  that  a  wall  of  separation  has  been 
erected  between  the  two.  American  political  secular¬ 
ism  could  exemplify  some  aspects  of  this  model. 

It  is  now  increasingly  clear  that  both  forms  of 
Western  secularism  have  persistent  difficulties  cop¬ 
ing  with  community-oriented  religions  such  as 
Roman  Catholicism,  Islam,  and  Sikhism,  which 
demand  a  greater  public  presence  and  even  official 
recognition  for  themselves — particularly  when 
they  live  in  the  same  society.  Moreover,  they  were 
not  designed  for  societies  with  deep  religious  diver¬ 
sity.  Both  of  these  versions  developed  in  the  con¬ 
text  of  a  single-religion  society  and  to  solve  the 
problems  of  one  religion,  namely,  Christianity. 
Both  understand  separation  as  exclusion  and  make 
individualistically  conceived  values — individual 
liberty  or  equality  between  individuals  or  both — as 
the  ground  for  separation. 

India  provides  another  model  of  secularism, 
with  multiple  religions  being  a  part  of  its  founda¬ 
tion.  Indian  secularism  is  inextricably  tied  to  deep 
religious  diversity.  It  has  a  commitment  to  multiple 


values — liberty  and/or  equality — not  conceived  of 
narrowly  as  pertaining  to  individuals  but  inter¬ 
preted  broadly  to  cover  the  relative  autonomy  of 
religious  communities.  While  there  are  boundaries 
between  the  state  and  religion,  these  boundaries 
are  porous.  Thus,  the  state  may  intervene  to  inhibit 
some  practices,  just  as  it  respects  and  lends  support 
to  other  practices,  of  a  religious  community. 
India’s  commitment  to  multiple  values  and  princi¬ 
pled  distance  means  that  the  state  tries  to  balance 
different,  ambiguous,  but  equally  important,  val¬ 
ues.  Thus,  its  secular  ideal  resembles  a  contextual, 
ethically  sensitive,  politically  negotiated  arrange¬ 
ment  that  evolved  to  deal  with  tensions  generated 
continuously  by  deep  religious  diversity. 

Rajeev  Bhargava 
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Security  and  Defense  Policy 


The  term  security  policy  denotes  the  aims  and 
means  that  are  needed  to  protect  a  state  from  both 
external  and  internal  threats.  Defense  policy  has  a 
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narrower  extension,  usually  confined  to  the  aims 
and  means  against  external  military  threats. 

Security  and  defense  policies  have  always  been 
at  the  center  of  international  politics,  but  their 
nature  has  changed  due  to  the  advent  of  nuclear 
weapons  and  their  proliferation,  economic  inter¬ 
dependence,  the  end  of  the  Cold  War,  environ¬ 
mental  problems,  technological  advancements 
and  vulnerabilities,  as  well  as  other  material  and 
cultural  developments  typically  linked  to  global¬ 
ization.  These  phenomena  have  also  brought 
about  conceptual  changes  and  competing  termi¬ 
nological  usages. 

Traditional  and  Contemporary  Views 

It  is  widely  assumed  that  every  state  must  have  a 
security  and  defense  policy  if  it  wants  to  survive  in 
the  anarchical  international  system.  Security  is  seen 
as  the  key  national  interest  of  states,  and  that  is 
why  defense  is  needed.  Security  and  defense  policy 
is  thus  seen  as  belonging  to  high  politics,  which 
implies  that  it  occupies  a  superior  hierarchical  posi¬ 
tion  among  state  activities  and  ranks  higher  in 
importance  than  other  fields  of  foreign  policy. 

These  traditional  views  of  security  and  defense 
policy  are,  however,  in  many  ways  challenged  in 
today’s  world  politics.  Both  terms,  security  and 
defense,  have  been  widened  and  are  broadly 
applied  in  various  ways.  Also,  the  assumption  that 
security  and  defense  must  be  given  priority  (a  nor¬ 
mative  claim)  or  are  given  priority  (an  empirical 
claim)  among  various  policy  issues  is  seen  as  ques¬ 
tionable.  Alternative  views  of  security  have  been 
put  forward  by  critical  security  theorists,  environ¬ 
mentalists,  and  proponents  of  “human  security.” 

Security  is  a  difficult  concept,  because  it  can  refer 
both  to  the  objective  absence  of  threats  and  to  a  sub¬ 
jective  feeling  of  security.  It  is  also  debatable  whether 
security  means  an  absolute  absence  of  threats  or 
whether  security  refers  to  a  state  of  affairs  where  the 
likelihood  or  “risk”  of  major  threats  is  low. 

The  traditional  view  links  defense  and  security  to 
military  affairs.  It  starts  from  the  assumption  that 
there  is  a  security  dilemma  in  world  politics  because 
of  the  anarchical  nature  of  the  international  system. 
Because  there  is  no  central  authority,  states  need  to 
be  prepared  for  defending  themselves.  International 
security  has  been  associated  with  balance  of  power 
and  stability. 


Against  this  background  of  traditional  security 
thinking,  it  can  be  suggested  that  “defense”  and 
“security”  are  actually  euphemisms  for  war  and 
military.  States  used  to  have  ministries  for  war; 
after  World  War  II,  they  were  renamed  ministries 
for  defense.  Military  officers  are  “security  profes¬ 
sionals,”  and  “private  security  companies”  con¬ 
duct  war  operations.  “Security  studies”  is  often 
another  name  for  strategic  or  war  studies.  If  secu¬ 
rity  and  defense  policy  are  understood  in  this  way, 
the  term  denotes  all  military  activities  regardless  of 
whether  they  have  to  do  with  security  as  a  goal 
and  whether  they  are  defensive  or  offensive.  This 
definition,  however,  reflects  quite  a  common  usage 
that  emerged  after  World  War  II. 

Security  policy  is  often  seen  as  a  combination  of 
foreign  and  defense  policy.  The  administrative  util¬ 
ity  of  adopting  the  concept  of  security  was  in  tight¬ 
ening  the  link  between  defense  and  foreign  offices, 
for  example,  in  the  United  States  through  the 
establishment  of  a  National  Security  Council.  In 
some  countries,  the  concept  of  security  policy 
implies  that  foreign  policy  goes  (or  should  go) 
ahead  of  defense  policy.  According  to  this  view, 
defense  policy  is  a  subset  of  security  policy, 
whereas  parts  of  foreign  policy,  such  as  develop¬ 
ment  cooperation,  can,  but  do  not  necessarily,  fall 
outside  the  realm  of  security  policy. 

There  is  no  agreed  definition  of  security  policy 
partly  because  there  is  no  agreement  on  the  notion 
of  security.  The  narrow  notion  of  security  empha¬ 
sizes  military  threats,  but  a  broader  notion  of  secu¬ 
rity,  comprehensive  security,  includes  all  kinds  of 
“new”  threats,  such  as  financial  instability,  climate 
change,  environmental  risks,  natural  catastrophes, 
energy  shortage,  organized  crime,  piracy,  drugs, 
technological  hazards,  international  migration, 
and  epidemic  diseases.  Moreover,  a  security  policy 
can  also  address  internal  threats,  such  as  political 
opposition,  social  unrest,  famine,  aging,  or  street 
violence.  These  days,  one  of  the  meanings  most 
often  used  for  “security  policy”  deals  with  the 
security  of  information  systems. 

The  problem  with  the  wider  understanding  of 
security  is  that  the  concept  lacks  focus.  For  exam¬ 
ple,  one  important  border  to  uphold  is  the  differ¬ 
ence  between  intentional  and  unintentional  threats. 
Moreover,  although  a  wider  concept  of  security 
has  become  widely  accepted  also  by  many  state 
authorities,  this  has  not  necessarily  led  to  notable 
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changes  in  practices.  The  role  of  the  military  in 
understanding  security  and  defense  has  not  disap¬ 
peared;  it  has  just  been  framed  within  a  larger  view 
of  security. 

In  particular,  after  the  terrorist  attacks  of  2001 
in  the  United  States,  the  internal  aspects  of  security 
policy  have  become  more  salient.  Administratively, 
therefore,  ministries  of  interior  and  other  agencies 
such  as  ministries  of  justice  have  become  much 
more  active  agents  of  security  policy  than  before. 
At  the  same  time,  technical  methods  of  producing 
security  through  control  and  surveillance  have 
dominated  the  discussion  about  how  to  manage 
various  security  problems. 

The  concept  of  defense  policy  can  also  be 
related  to  actions  other  than  military.  Like  “secu¬ 
rity,”  “defense,”  too,  has  a  larger  broader  defini¬ 
tion  that  includes  all  measures  that  are  taken  to 
defend  a  country  or  an  organization  against  all 
sorts  of  threats.  It  can  thus  denote  the  economic, 
civil,  and  psychological  aspects  of  defense  in  addi¬ 
tion  to  the  military  aspect  and  covers  activities 
during  both  war  and  peace.  Such  a  concept  of 
“total  defense”  has  been  adopted  in  particular  in 
small  countries  such  as  Austria,  Finland,  Sweden, 
Switzerland,  and  Singapore.  Peace  movements 
have  also  proposed  other  alternative  understand¬ 
ings  of  defense,  laying  emphasis  on  civilian  resis¬ 
tance  and  nonviolent  means. 

Among  the  alternative  paradigms  to  the  tradi¬ 
tional  notion  of  security,  proponents  of  human 
security  suggest  that  human  needs  should  be  at  the 
center  of  security  concerns  and  that  this  would  put 
emphasis  on  development  as  well  as  on  health 
issues  and  the  economic,  social,  and  political  rights 
of  individuals.  Advocates  of  critical  security  stud¬ 
ies  are  generally  dissatisfied  with  the  traditional, 
realist  view  of  security.  They  also  want  to  broaden 
the  understanding  of  security  and  emphasize  the 
potential  of  overcoming  security  problems  through 
emancipation.  Postmodern  approaches  to  security 
regard  security  discourses  as  essential  in  creating 
state  identities  in  the  first  place. 

The  theory  of  securitization  developed  by  the 
so-called  Copenhagen  school  of  security  also  sub¬ 
scribes  to  a  broader  view  of  security  consisting  of 
various  sectors,  such  as  military,  political,  eco¬ 
nomic,  societal,  and  environmental  sectors.  It  also 
stresses  that  labeling  something  as  a  security  threat 
is  a  speech  act,  which  means  that  there  are  no 


security  threats  before  they  are  made  into  such  in 
the  political  discourse.  Because  securitization  means 
lifting  security  issues  above  normal  politics,  often 
justifying  emergency  measures  and  exceptions,  it 
could  hamper  open  political  debate  and  have  nega¬ 
tive  effects  on  the  policy-making  process. 

Characteristics  and  Implementation 
of  Security  and  Defense  Policies 

States  are  usually  seen  as  principal  actors  that 
adopt  and  implement  security  and  defense  policies, 
partly  because  they  are  seen  as  legitimate  holders 
of  means  of  violence.  However,  in  particular,  if  the 
military  aspect  of  security  and  defense  is  down¬ 
played  also,  international  organizations,  regional 
actors,  international  companies,  and  other  human 
groupings  can  have  a  security  or  defense  policy. 

The  object  of  security  or  defense  can  also  vary. 
It  can  be  the  state,  its  government,  critical  infra¬ 
structure  or  territory,  society,  population  or  indi¬ 
viduals,  or  principally  anything  that  is  seen  as 
worth  securing  and  defending.  It  is  also  possible  to 
talk  about  global,  common,  or  international  secu¬ 
rity  and  to  defend  the  existing  world  order  or  the 
ecosystem. 

Many  countries  and  institutions  have  their  own 
specific  definitions  for  the  terms  security  and 
defense.  After  the  terrorist  attacks  of  2001,  the 
United  States  adopted  the  concept  of  “homeland 
security”  to  deal  with  military,  terrorist,  and  other 
threats  inside  the  territory  of  the  country.  For 
example,  when  launching  the  Common  Foreign 
and  Security  Policy  (CFSP)  and  the  European 
Security  and  Defense  Policy  (ESDP),  the  European 
Union  adopted  its  own  conceptual  scheme,  accord¬ 
ing  to  which  security  and  defense  policy  deals  with 
crisis  management  and  not  with  common  defense. 

Security  and  defense  policies  have  traditionally 
been  surrounded  by  a  realm  of  secrecy,  informa¬ 
tion  gathering  through  intelligence,  and  a  lack  of 
democratic  control  by  parliaments.  Many  states, 
however,  lay  out  their  doctrine  on  security  and 
defense  in  the  form  of  a  “white  book”  or  a  public 
statement  on  security  or  defense.  Such  governmen¬ 
tal  documents  create  transparency,  promote  public 
discussion  and  legitimacy,  and  enable  long-term 
strategic  planning. 

A  nation’s  defense  can  take  various  forms  even 
when  restricted  to  the  traditional  military  meaning 
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of  the  term.  One  basic  choice,  for  example,  deals 
with  offensive  and  defensive  defense  strategies. 
Another  choice  is  whether  to  acquire  nuclear  or 
other  nonconventional  weapons.  A  state  relying  on 
conventional  forces  has  to  find  the  right  balance 
between  air,  naval,  and  land  forces.  States  can  also 
choose  whether  to  emphasize  territorial  defense  or 
intervention  forces.  Furthermore,  states  can  choose 
between  conscription  and  a  professional  army  or  a 
mixed  system  and  can  decide  whether  to  use  pri¬ 
vate  security  companies.  Finally,  defense  policy 
also  covers  various  other  political  and  social  issues 
such  as  civil-military  relations  and  democratic 
control  of  the  military,  defense  economics,  and 
gender  and  sexual  equality  within  the  military. 

States  also  face  security  policy  choices  and  can 
develop  different  security  strategies.  Besides  choos¬ 
ing  between  a  narrow  and  broader  concept  of 
security  for  the  basis  of  their  security  policy,  states 
can  choose  whether  or  not  to  securitize  certain 
threats.  Furthermore,  states  can  try  to  seek  security 
within  an  alliance,  in  a  multilateral  framework  of 
collective  security,  or  alone,  for  example,  on  the 
basis  of  a  policy  of  neutrality.  The  classic  security 
strategies  in  world  politics  are  balancing,  band- 
wagoning,  and  buck-passing.  Balancing  means 
gathering  strength  against  the  opponent  either  by 
domestic  mobilization  or  with  external  alliances; 
bandwagoning  leads  one  to  accommodate  and  join 
the  opponent;  and  buck-passing  refers  to  shifting 
the  responsibility  for  resisting  the  opponent  to 
other  actors. 
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Security  Apparatus 


The  concept  of  security  apparatus  builds  on  dis¬ 
tinct  yet  functional  elements  with  dynamic  and 
structural  value.  This  entry  examines  security  as 
the  state  of  being — or  at  least  feeling — secure  and 
considers,  in  particular,  the  meaning  of  security 
defined  as  the  safety  of  a  state  or  an  organization 
against  criminal  activities  such  as  terrorism  or 
espionage  and  other  potential  dangers.  The  secu¬ 
rity  apparatus  is  considered  as  a  complex  struc¬ 
ture,  usually  a  segment  of  a  larger  organization,  as 
well  as  the  equipment  required  for  a  particular 
purpose.  The  structures  responsible  for  providing 
security  to  a  society  and  its  population  reflect  the 
contemporary  values  held  by  a  country  or  any 
other  human  organization.  As  such,  the  security 
apparatus  also  provides  the  state  of  affairs  of  the 
underlying  cultural,  social,  and  political  elements 
and,  in  addition  to  its  primary  function,  allows  a 
“health  check”  of  a  society  that  can  help  assess  its 
status  and  predict  its  future.  This  entry  first  con¬ 
siders  various  ways  in  which  security  and  security 
threats  have  been  defined  and  then  describes  vari¬ 
ous  types  of  threats  to  security  at  the  national, 
regional,  and  global  levels.  A  centralized  national 
approach  to  security  (e.g.,  the  U.S.  Office  of 
Homeland  Security)  is  contrasted  with  the  decen¬ 
tralized  European  approach,  in  which  responsibil¬ 
ity  is  distributed  among  various  agencies. 
Descriptions  of  the  security  apparatus  in  Italy  and 
France  illustrate  two  ways  in  which  nations  may 
implement  security  policy.  Last,  the  role  of  inter¬ 
national  security  organizations  is  considered. 

Evolving  Definitions  of  Security 

The  geopolitical,  social,  and  economic  develop¬ 
ments  that  have  occurred  since  the  last  decade  of 
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the  20th  century  have  made  the  distinction  between 
internal  and  external  security  increasingly  blurred, 
thereby  influencing  both  the  concept  and  the  struc¬ 
ture  of  security  apparatuses  throughout  the  world. 
The  rise  in  independent  states  and  in  worldwide 
travel  and  exchange,  for  example,  and  a  more 
open  communication  society  have  greatly  influ¬ 
enced  the  strategic  thinking  underpinning  national 
and  international  security  affairs. 

Whether  geared  toward  facing  risks,  threats, 
or  actual  dangers,  the  security  apparatus  will 
require  several  basic  elements  to  meet  its  chal¬ 
lenges,  as  it  would  normally  be  triggered  by  an 
asymmetric  scenario  where  the  attacker  would 
have  at  its  disposal  a  range  of  targets  to  strike 
through  different  tactics.  Hence,  a  security  strat¬ 
egy  will  represent  the  basic  framework  of  any 
security  apparatus,  one  encompassing  all  the  ele¬ 
ments  that  may  need  protection  as  well  as  the 
tools  and  measures  available  to  provide  such 
protection.  The  key  to  any  strategy  is  accurate 
information  and  skilled  users  who  are  able  to 
receive  it  and  take  appropriate  action.  Therefore, 
information  gathering  and  processing  also  need 
to  be  assessed  in  view  of  their  value  and  meaning 
for  a  security  apparatus. 

A  list  of  what  must  be  protected  and  the  cor¬ 
responding  security  measures  may  be  only  tenta¬ 
tive  due  to  the  dynamic  character  of  the  security 
environment;  nonetheless,  the  broad  categories  of 
internal  and  external  security  may  generally  be 
broken  down  into  elements  such  as  resources, 
infrastructures,  and  civil  protection;  natural  and 
man-made  disasters;  policing,  intelligence,  mili¬ 
tary,  and  diplomatic  sectors;  nongovernmental 
and  intergovernmental  cooperation;  and  low- 
intensity  conflict  and  state  of  war.  An  open-ended 
list  may  be  further  developed  through  an  exami¬ 
nation  of  issues  of  concern  to  national  security, 
including  “megaprotection”  in  conventional, 
strategic,  and  nuclear  deterrence;  serious  and 
organized  crime,  including  illicit  arms  and  drugs 
trafficking,  and  terrorism;  commercial,  industrial, 
financial,  and  economic  competitiveness;  environ¬ 
mental  decay;  and  cultural  and  ethical  values. 

The  range  of  security  concerns  that  may  come 
to  the  forefront  of  public  concern  is  matched  by 
the  depth  of  the  apparatus  called  on  to  ensure  the 
security,  safety,  and  well-being  of  individual  soci¬ 
eties.  The  traditional  divide  between  external  and 


internal  security  has  normally  reflected  the  distinc¬ 
tion  between  conventional  and  emerging  threats, 
which  largely  fell  within  the  mandate  of  the  mili¬ 
tary  and  the  civil  infrastructures,  respectively.  The 
evolution  toward  a  multipolar  and  multidimen¬ 
sional  environment  contributes  to  giving  a  new 
shape  to  the  security  concept  and  relevant  appara¬ 
tus,  particularly  as  it  concerns  domestic  security. 

Threats  to  Security 

The  structure  of  the  security  apparatus  is  influenced 
by  the  threats  affecting  security,  whether  at  the 
domestic,  regional,  or  global  level.  Since  World  War 
II,  the  Western  Hemisphere,  particularly  the  Western 
European  countries,  has  known  an  unprecedented 
period  of  peace  and  stability  mostly  due  to  their 
cooperation  through  common  institutions  such  as 
the  Council  of  Europe,  the  European  Communities, 
and  the  North  Atlantic  Treaty  Organization 
(NATO).  Still,  Europe,  just  like  other  major  interna¬ 
tional  actors,  including  the  United  States,  Russia, 
China,  India,  or  Japan,  faces  security  threats  and 
challenges  that  originate  both  within  and  beyond 
the  regional  borders.  As  a  matter  of  fact,  the  2003 
European  Security  Strategy  was  premised  on  the 
consideration  that,  while  large-scale  aggression 
against  any  member  of  the  European  Union  was 
unlikely,  Europe  faced  new  threats  of  a  more 
diverse,  less  visible,  and  less  predictable  fashion. 

Terrorism  and  its  complex  social,  cultural,  reli¬ 
gious,  and  developmental  causes  constituted  the 
first  security  threat  on  the  European  agenda, 
because  of  the  then  still  fresh  memory  of  the 
September  11,  2001,  attacks  on  the  United  States; 
the  bombings  in  Madrid,  Spain,  on  March  11, 
2004;  and  the  bombings  in  London,  United 
Kingdom,  on  July  7,  2005.  European  countries 
have  been  and  continue  to  be  both  a  target  and  a 
base  for  terrorism,  whether  politically  motivated 
or  linked  to  religious  extremism.  This  continuing 
threat  raises  questions  about  the  integration  of 
foreign  communities;  it  also  draws  attention  to  the 
stark  contrast  between  modernization  and  secular¬ 
ism,  on  the  one  hand,  and  strict  observance  and 
religious  zeal,  on  the  other. 

Proliferation  of  weapons  of  mass  destruction 
was  assessed  as  potentially  the  greatest  threat  to 
European  security  given  the  risks  implied  in  an 
arms  race  in  bordering  regions,  especially  in  the 
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Middle  East,  and  the  increased  potency  of  weap¬ 
ons  facilitated  by  advances  in  the  biological  sci¬ 
ences,  which  could  be  supported  by  the  spread  of 
missile  technology.  The  deadly  scenario  of  a 
terrorist  group  armed  with  weapons  of  mass 
destruction  and  able  to  inflict  damages  previously 
available  only  to  state  actors  demonstrates  the 
equation  produced  by  the  combination  of  conven¬ 
tion  and  emerging  threats,  which  requires  a  uni¬ 
form  reaction. 

Regional  conflicts  were  also  listed  as  top  secu¬ 
rity  concerns  for  they  threaten  stability,  destroy 
human  lives  and  social  and  physical  infrastruc¬ 
tures,  and  challenge  fundamental  freedoms  and 
human  rights.  Furthermore,  regional  conflicts  may 
lead  to  extremism,  which  facilitates  and  provides 
opportunities  for  two  other  strategic  security 
issues:  state  failure  and  organized  crime. 

The  collapse  of  the  state  not  only  entails  domes¬ 
tic  threats  such  as  bad  governance,  corruption, 
weak  institutions,  and  a  lack  of  accountability  but 
also  has  international  resonance.  Failed  states  also 
risk  providing  a  home  for  terrorist  and  organized 
criminal  groups,  thereby  undermining  regional 
stability  and  global  governance. 

Finally,  organized  crime  represents  an  internal 
security  threat  with  an  important  external  dimen¬ 
sion  through  its  activities,  particularly  cross-border 
illicit  trafficking  and  its  possible  links  with  terror¬ 
ist  activities. 

Most  recently,  the  international  economic  and 
financial  crisis  affecting  all  national  systems  has 
added  to  the  list  of  security  threats  in  view  of  its 
implication  for,  among  other  things,  the  growth  of 
extremist  and  antisystem  movements  that  can  desta¬ 
bilize  weak  societies  and  the  increased  geopolitical 
tensions  between  countries  that  are  struggling  to 
defend  critical  national  assets  such  as  the  banking 
or  the  industrial  sector.  The  economic  crisis  may 
affect  peace  and  stability  in  countries  and  entire 
regions  that  have  emerged  from  or  are  still  strug¬ 
gling  with  conflict  and  tension,  thereby  affecting 
their  own  security  as  well  as  the  security  of  their 
neighbors  with  increasing  risks  for  the  national 
financial  systems,  including  the  security  of  sovereign 
financial  viability.  Security  risks  may  become 
opportunities  for  political  movements  or  criminal 
organizations  seeking  to  exploit  weak  or  unstable 
governments  through  social  unrest,  predatory  finan¬ 
cial  practices,  or  common  criminal  activities. 


The  security  apparatus  builds  on  prevention 
and  law  enforcement  and  intelligence  efforts  to 
ensure  domestic  security,  combating  terrorism  and 
protecting  the  public,  infrastructure,  and  historical 
and  cultural  sites.  Recent  terrorist  attacks  have  led 
to  suggestions  for  strengthening  legislation  and 
other  measures  dealing  with  border  control,  illegal 
immigration,  and  related  criminal  activities  such  as 
trafficking  in  human  beings,  transport  security, 
and  terrorism  financing;  however,  countries  with  a 
multiagency  approach  normally  assign  comple¬ 
mentary  tasks  to  the  different  agencies.  The  Italian 
and  the  German  security  apparatuses  are  exam¬ 
ined  as  they  typify  the  European  system  for  secu¬ 
rity  protection,  while  each  represents  a  specific 
subsystem,  namely,  the  central  approach  and  the 
federal  approach,  respectively. 

Organization  of  the  Security  Apparatus: 

Contrasting  Approaches 

Two  clear  approaches  emerge  among  those  coun¬ 
tries  that  have  been  confronted  with  serious  natu¬ 
ral  and  man-made  disasters  over  the  past  decade. 
On  the  one  hand,  the  United  States,  following 
deadly  terrorist  attacks  on  its  soil,  attacks  against 
its  interests  abroad,  and  natural  disasters,  under¬ 
went  a  thorough  reorganization  of  its  national 
security  and  security  apparatus,  which  led  to  the 
establishment  of  the  Department  of  Homeland 
Security.  At  the  other  end  of  the  spectrum,  coun¬ 
tries  such  as  France,  Germany,  Italy,  Spain,  and 
the  United  Kingdom  have  elected  to  preserve  the 
existing  organization  of  their  security  apparatuses 
where  responsibilities  are  allocated  through  differ¬ 
ent  public  agencies  and  levels.  The  different  phi¬ 
losophies  behind  the  choice  between  a  unified, 
central  department  and  a  system  of  central  and 
local  agencies  reflect  historical,  cultural,  and  polit¬ 
ical  considerations,  chiefly  the  debate  over  the 
protection  of  civil  liberties  and  the  need  to  ensure 
protection  against  deadly  dangers,  which  involves 
a  proper  balance  between  increased  security  and 
the  adequate  safeguarding  of  personal  freedoms. 

While  the  U.S.  Department  of  Homeland 
Security  builds  on  a  concept  of  security  that 
includes  border  protection,  transport  security, 
emergency  preparedness,  and  the  protection  of 
critical  infrastructures,  European  countries  have 
mainly  invested  in  law  enforcement  efforts  and 
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have  sought  to  integrate  counterterrorism  and  pre¬ 
paredness  programs  into  emergency  management 
efforts  with  a  view  to  greater  flexibility  in  respond¬ 
ing  to  security  challenges  with  often  limited  per¬ 
sonnel  and  resources.  Regardless  of  the  security 
apparatus  elected  to  ensure  the  safety  and  well¬ 
being  of  the  nation,  a  consistent  pattern  has 
recently  emerged  whereby  increased  investments 
are  made  in  intelligence  and  law  enforcement  ini¬ 
tiatives,  and  often  a  combination  thereof,  to  fight 
against  organized  crime  and  terrorism. 

At  a  time  when  the  traditional  distinction 
between  foreign  and  domestic  threats  is  blurring, 
the  ability  to  plan  and  carry  out  devastating 
attacks — particularly  through  cyber  warfare  or 
chemical,  bacteriological,  radiological,  and  nuclear 
agents — enables  relatively  small  groups  of  terror¬ 
ists  or  criminals  to  challenge  state  actors  and  their 
defenses.  Even  those  countries  with  experience  in 
terrorist  attacks  and  counterterrorism  policies 
need  to  improve  their  security  apparatus  in  a  way 
that  allows  for  ready  information  exchange  among 
disparate  government  agencies  that,  by  their  very 
nature,  professional  culture,  or  legal  requirements, 
have  been  used  to  create  organizational  barriers, 
with  the  consequent  segregation  of  duties  and 
knowledge. 

The  vexing  question  concerning  the  exchange  of 
information  among  law  enforcement  and  intelli¬ 
gence  agencies  brings  with  it,  as  related  issues,  the 
balance  between  strategic  and  tactical  intelligence; 
data  collection,  its  exploitation,  and  dissemination 
in  view  of  the  intrinsic  need  to  maintain  secrets, 
methods,  and  sources;  a  review  of  the  classification 
methods  that  would  allow  sharing  information 
with  other  agencies  nationally  and  abroad;  the 
exploitation  of  intelligence  data  in  the  framework 
of  law  enforcement  operations  intended  to  investi¬ 
gate  crimes  for  prosecution  and  adjudication;  and 
the  need  to  improve  security  while  respecting  civil 
liberties. 

Two  European  Models 

Italy 

In  Italy,  the  government  (Council  of  Ministers), 
comprising  the  head  of  the  government  and  the 
ministers,  is  the  collective  decision-making  body, 
and  it  coordinates  the  efforts  made  by  governmental 
ministries  and  agencies  to  protect  domestic  security. 


The  Ministry  of  Interior  is  the  lead  government 
ministry  on  public  order  and  security,  immigration 
and  border  controls,  and  civil  protection.  The 
Public  Security  Department  manages  the  national 
police  force  and  is  responsible  for  implementing  the 
public  order  and  security  policy;  the  Civil  Liberty 
and  Immigration  Department  sets  immigration  and 
asylum  policies;  and  the  Fire  Brigade,  Public  Aid, 
and  Civil  Defense  Department  leads  efforts  in 
emergency  preparedness  and  response  policies. 

The  activities  carried  out  by  the  Ministry  of 
Interior  are  normally  supported  by  several  other 
government  ministries  with  responsibility  over 
particular  aspects  of  national  security,  including 
the  Defense  Ministry,  the  Health  Ministry,  and  the 
Infrastructure  and  Transport  Ministry.  Customs 
and  financial  policies  are  under  the  responsibility 
of  the  Economy  and  Finance  Ministry. 

To  coordinate  the  various  efforts,  interministe- 
rial  bodies  are  usually  set  up  to  bring  together 
department  ministers  and  officials.  The  Public 
Order  and  Security  Committee  examines  issues 
related  to  the  protection  of  public  order  and  secu¬ 
rity  and  to  the  organization  of  the  police  forces. 
The  committee  is  chaired  by  the  interior  minister 
and  comprises  the  chief  of  police/director  general 
of  public  security  and  the  undersecretary  entrusted 
by  the  prime  minister  with  special  responsibility 
for  the  intelligence  services.  The  heads  of  the  other 
police  forces  also  take  part  in  the  committee,  and 
additional  ministers  may  be  involved  depending 
on  the  issue  at  hand.  The  Interministerial  Com¬ 
mittee  for  Intelligence  and  Security,  which  pro¬ 
vides  advice  and  makes  proposals  on  the  general 
direction  and  fundamental  objectives  for  intelli¬ 
gence  and  security  policy,  is  chaired  by  the  prime 
minister  and  comprises  the  ministers  for  the 
Interior,  Defense,  Foreign  Affairs,  Justice,  and 
Economy  and  Finance  departments,  while  the 
heads  of  the  intelligence  services  and  other  gov¬ 
ernment  officials  may  also  be  invited  to  partici¬ 
pate.  Finally,  the  security  apparatus  includes  the 
Politico-Military  Unit  and  the  Department  of  Civil 
Protection  of  the  Prime  Minister’s  Office,  which 
have,  each  in  its  own  respect,  responsibilities  for 
drawing  up  and  implementing  emergency  plans  to 
deal  with  any  chemical,  biological,  radiological, 
and  nuclear  incidents;  transport  safety  and  bioter¬ 
rorism;  and  civil  and  military  preemptive  mea¬ 
sures  against  major  incidents  or  attacks.  The 
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Department  of  Civil  Protection  coordinates  and 
works  with  a  tight  network  of  central,  regional, 
provincial,  and  local  administrations  as  well  as 
other  public  agencies  and  voluntary  organiza¬ 
tions,  thereby  forming  a  “national  service”  oper¬ 
ating  on  the  basis  of  the  principle  of  subsidiarity. 

The  Interior  Ministry  has  responsibility  over  the 
State  Police  ( Polizia  di  Stato)  through  the  Public 
Security  Department,  which  coordinates  police 
tasks  and  activities  at  the  central  level  and  hosts  the 
Anti-Terrorism  Strategic  Analysis  Committee 
(Comitato  per  I’analisi  strategica  del  terrorismo, 
CASA).  At  the  provincial  level,  the  top  public  secu¬ 
rity  authorities  are  the  prefect,  who  is  appointed  by 
and  answerable  to  the  central  government,  and  the 
questore ,  the  State  Police’s  senior  provincial  officer 
with  operational  control  and  jurisdiction  in  the 
field  of  public  order,  security,  and  criminal  and 
intelligence  matters.  The  State  Police  includes  dif¬ 
ferent  specialist  units  that  may  play  a  role  in  pro¬ 
tecting  various  aspects  of  domestic  security:  The 
antiterrorism  police  have  primary  responsibility  for 
investigations  aimed  at  preventing  and  fighting  ter¬ 
rorism,  including  information  collection  and  analy¬ 
sis;  the  traffic  police  patrol  roads  and  highways;  the 
railway  police  ensure  the  security  of  travelers  and 
their  belongings,  the  security  of  railway  stations, 
and  the  control  of  dangerous  goods  transported  by 
rail;  the  immigration  and  border  police  are  respon¬ 
sible  for  the  entry  and  stay  of  foreign  nationals  and 
immigrants  in  Italy,  as  well  as  the  prevention  and 
control  of  illegal  immigrants;  and  the  postal  and 
communications  police  seek  to  prevent  and  tackle 
the  illegal  use  of  communication  technologies. 
Other  specialist  units  may  be  deployed  to  perform 
high-risk  interventions,  for  example,  in  the  case  of 
a  hostage  incident,  or  to  provide  rescue  services  in 
areas  affected  by  natural  disasters. 

The  Italian  security  apparatus  also  has  a  military 
corps  ( Carabinieri )  that  carries  out  police  duties 
among  the  civilian  population,  not  unlike  units  in 
other  European  countries,  such  as  the  French 
Gendarmerie  or  the  Spanish  Guardia  Civil.  The 
Carabinieri,  which  are  empowered  to  gather  intel¬ 
ligence,  investigate  terrorist  and  organized  crime 
organizations,  and  respond  to  high-risk  situations, 
are  institutionally  accountable  to  the  Ministry  of 
Defense  and  the  Ministry  of  Interior.  A  third  police 
force  with  responsibilities  over  financial,  tax,  and 
customs  issues  is  the  Guardia  di  Finanza,  directly 


attached  to  the  Minister  of  Economy  and  Finance, 
with  a  structure  resembling  the  Carabinieri, 
although  its  specialized  tasks  make  it  a  prime  actor 
in  the  prevention,  investigation,  and  disruption  of 
money  laundering,  fraud,  tax  evasion,  and  illicit 
financing. 

In  addition  to  the  law  enforcement  agencies,  the 
Italian  security  apparatus  also  includes  intelligence 
and  security  services  with  distinct  responsibilities 
for  the  defense  of  the  state’s  independence  and 
integrity  against  any  danger  on  the  military  front 
with  counterespionage  and  counterintelligence 
duties  (Agenzia  informazioni  e  sicurezza  esterna , 
AISE)  and  for  the  defense  of  the  democratic  state 
and  its  institutions  against  all  forms  of  subversion 
(. Agenzia  informazioni  e  sicurezza  interna,  AISI). 
The  AISE  and  AISI  work  under  the  authority  of 
the  prime  minister,  or  an  undersecretary  with  due 
delegation  of  authority,  who  exercises  the  relevant 
functions  through  the  Department  of  Information 
for  Security. 

Germany 

Germany,  like  Italy,  has  no  department  or  min¬ 
istry  for  “homeland  security.”  Its  long-established 
organization  for  protecting  security  and  respond¬ 
ing  to  emergencies  within  the  national  borders  is 
defined  by  the  1949  Basic  Law  (Grundgesetz,  i.e., 
the  Constitution),  which  involves  ministries  and 
agencies  at  the  federal,  state  (16  Lander ),  and  local 
levels.  The  German  approach  to  dealing  with  inci¬ 
dents,  whether  natural  disasters  or  terrorist  acts,  is 
bottom-up,  beginning  at  the  local  level,  involving 
state  authorities  if  necessary,  and  relying  on  federal 
agencies  if  needed. 

Since  September  11,  2001,  the  German  govern¬ 
ment  has  assessed  terrorism  as  the  most  immediate 
threat  to  national  security,  although  Germany’s 
experience  with  terrorism  dates  back  to  the  1970s, 
when  both  domestic  and  international  terrorism 
were  a  cause  of  serious  concern.  At  the  federal 
level,  security  functions  are  the  responsibility  of 
several  ministries,  including  the  Interior,  Justice, 
Defense,  Foreign  Affairs,  and  Finance  departments, 
as  well  as  other  departments  and  agencies.  The 
German  security  apparatus  relies  on  intelligence, 
law  enforcement,  and  judicial  prosecution  to  pre¬ 
vent  terrorist  acts  and  to  identify  and  neutralize 
potential  terrorists. 
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The  most  important  intelligence  authorities  are 
the  Federal  Intelligence  Service  (Bundesnachrichten- 
dienst,  BND),  which  is  responsible  for  gathering 
intelligence  abroad  under  the  Federal  Chancellery; 
the  Federal  Bureau  for  the  Protection  of  the 
Constitution  (Bundesamt  fur  Verfassungsschutz , 
BfV),  which  gathers  domestic  intelligence  and  is 
placed  under  the  Ministry  of  the  Interior;  and  the 
Military  Counterintelligence  Service  (Militarischer 
Abscbirmdienst,  MAD),  which  is  under  the  Ministry 
of  Defense.  The  federal  law  enforcement  agencies 
include  the  Federal  Bureau  of  Criminal  Investiga¬ 
tions  (Bundeskriminalamt,  BKA);  the  Federal  Police 
(Bundespolizei  [BPOLJ,  formerly  Bundesgrenz- 
schutz),  which  has  responsibility  over  border  pro¬ 
tection  and  aviation  security  and  carries  out  the 
screening  of  border  traffic  (BKA  and  BPOL  are 
placed  under  the  Ministry  of  the  Interior);  and  the 
Federal  Public  Prosecutor  General  (General- 
bundesanwalt,  GBA). 

Following  the  September  11,  2001,  attacks,  the 
BKA  has  been  given  authority  to  lead  its  own  inves¬ 
tigations  without  a  formal  request  from  the  BfV,  as 
had  been  previously  required.  Furthermore,  since 
2004,  the  federal  and  state  ministers  of  the  Interior 
have  introduced  measures  to  improve  coordina¬ 
tion,  which  include  a  central  database  regarding 
religious  fundamentalists  suspected  of  terrorism 
and  a  coordination  center  under  the  Ministry  of 
Interior  comprising  BKA,  BND,  BfV,  and  MAD,  as 
well  as  state  and  local  agencies.  These  coordination 
initiatives  prompted  the  debate  over  the  need  for 
greater  centralization  of  security  activities,  which, 
however,  faces  opposition  from  the  state  and  local 
authorities  and  finds  obstacles  in  the  constitutional 
laws  and  arrangements.  The  German  structure  for 
civil  protection  and  emergency  response  is  also  a 
joint  responsibility  of  both  the  Lander  and  the  fed¬ 
eral  republic  and  their  local  jurisdictions  and  the 
Federal  Office  for  Civil  Protection  and  Disaster 
Response  ( Bundesamt  fur  Beuolkerungsscbutz  und 
Katastropbenbilfe,  BBK),  which  was  established  in 
2004  under  the  Ministry  of  the  Interior,  is  called  to 
support  state  and  local  authorities  in  the  event  of 
significant  disaster  and  damage  situations. 

Germany’s  history,  geographical  location,  and 
its  industrial  and  economic  prominence  have 
greatly  affected  the  shape  of  its  security  apparatus. 
On  the  one  hand,  strong  emphasis  has  been  placed 
on  the  protection  of  the  civil  rights  and  liberties  of 


all  those  residing  in  Germany,  including  nonciti¬ 
zens;  on  the  other  hand,  its  position  in  the  heart  of 
Europe  and  its  economic  strength  have  made 
Germany  attractive  to  millions  of  immigrants, 
thereby  calling  for  the  need  to  protect  its  borders 
and  introduce  robust  security  controls. 

International  Security  Organizations 

The  external  dimension  of  security  issues  has 
triggered  the  establishment  and  development  of 
institutions  forming  an  embryonic  international 
security  apparatus  supporting  domestic  activities. 
Two  organizations  deserve  a  closer  scrutiny  as 
they  facilitate  cross-border  police  cooperation 
and  support  national  services  and  authorities 
whose  missions  is  to  prevent  and  combat  crime. 

The  International  Criminal  Police  Organization 
(INTERPOL),  established  in  1923,  is  the  world’s 
largest  international  police  organization  and  cur¬ 
rently  has  188  member  countries.  INTERPOL 
operates  through  its  General  Secretariat  (Lyon, 
France)  and  a  network  of  National  Central  Bureaus 
located  in  each  member  country.  Its  core  functions 
cover  secure  global  police  communication  services 
through  a  system  (1-24/7)  that  enables  national 
police  services  to  request,  submit,  and  access  police 
data  instantly  and  the  management  of  operational 
data  services  and  databases  with  information  on 
known  criminals,  wanted  persons,  fingerprints, 
DNA  profiles,  stolen  or  lost  travel  documents,  sto¬ 
len  motor  vehicles,  child  sex  abuse  images,  and 
stolen  works  of  art.  Support  to  police  services  is 
provided  in  six  priority  crime  areas:  corruption, 
drugs  and  organized  crime,  financial  and  high-tech 
crime,  fugitives,  public  safety  and  terrorism,  and 
trafficking  in  human  beings.  While  mostly  devoted 
to  operational  police  assistance  and  support, 
INTERPOL’s  mandate  may  also  extend  to  civil 
protection  activities  such  as  disaster  victim  identi¬ 
fication  in  case  of  a  terrorist  attack  or  a  natural 
disaster;  following  the  tsunami  that  hit  Thailand 
and  Sri  Lanka  in  December  2004,  INTERPOL 
took  an  active  role  in  the  international  effort  to 
identify  the  victims  of  the  disaster. 

A  notable  example  of  what  may  be  considered 
a  regional  security  apparatus  is  the  European 
Police  Office  (EUROPOL),  established  by  the 
member  states  of  the  European  Union  with  the 
mandate  to  handle  criminal  intelligence  and 
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improve  cooperation  in  preventing  and  combating 
serious  international  organized  crime  and  terror¬ 
ism.  EUROPOL  supports  the  member  states’  com¬ 
petent  authorities  by  facilitating  the  exchange  of 
information  with  and  among  Europol  liaison  offi¬ 
cers,  seconded  as  the  representatives  of  the  national 
competent  services  to  the  headquarters  (at  The 
Hague,  the  Netherlands),  and  by  providing  exper¬ 
tise,  technical  support,  and  criminal  intelligence 
analysis  in  support  of  operations.  EUROPOL  also 
produces  strategic  reports,  notably  the  yearly 
Organized  Crime  Threat  Assessment  and  the 
Terrorism  Situation  and  Trend  Report,  as  well  as 
crime  analysis  on  the  basis  of  information  and 
intelligence  provided  by  the  member  states  and 
third  parties  and  assessed  through  different  infor¬ 
mation,  index,  and  analysis  systems. 

Conclusion 

The  structures  responsible  for  providing  security 
to  a  society  and  its  population  reflect  the  contem¬ 
porary  values  held  by  a  country  or  other  human 
organizations.  As  such,  the  security  apparatus  also 
provides  the  state  of  affairs  of  the  underlying  cul¬ 
tural,  social,  and  political  elements  and,  in  addi¬ 
tion  to  its  primary  function,  allows  a  “health 
check”  of  a  society  that  can  help  assess  its  status 
and  predict  its  future. 
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This  entry  deals  with  security  cooperation  in  a  clas¬ 
sical  sense:  It  focuses  on  state  security  and  coopera¬ 
tion.  This  does  not  imply  that  other  actors  are  not 
subjects  of  security  or  do  not  participate  in  related 
cooperation;  it  pays  just  tribute  to  the  fact  that 
states  are  still  the  main  actors  in  both  domestic  and 
external  security  activities,  including  cooperation.  It 
is  worth  noting  that  international  and  nongovern¬ 
mental  organizations,  substate  actors,  companies, 
and  others  are  involved  in  these  activities  as  well. 

International  security  cooperation  is  counterin¬ 
tuitive  to  the  notion  of  “international  anarchy.”  To 
entrust  a  part  of  the  responsibility  for  state  survival 
to  other  actors  thus  sounds  far-fetched;  neorealism 
has  expressed  this  skepticism  distinctly.  Yet  in 
everyday  international  life,  security  cooperation  is 
found  in  ever  more  varied  forms:  It  is  an  ubiquitous 
pattern  in  world  and  in  regional  politics. 

Cooperative  strategies  in  security  policy  accept 
the  notion  of  anarchy  but  view  it  in  a  more  relaxed 
way  than  neorealists  do.  These  strategies  see  states 
as  security  seekers.  Security  interests  have  become 
interdependent  owing  to  the  penetrability  of 
national  borders  and  the  impossibility  of  perfect 
defense,  and  national  interests  in  general  have 
become  interdependent  through  globalization. 
Competing  interests  are  mitigated  by  common 
interests.  States  can  signal  effectively  benign  inten¬ 
tions.  “Predators”  and  sincere  partners  can  be 
distinguished.  Common  security  institutions  sup¬ 
ply  reliable  information  about  present  and  future 
capabilities  and  intentions,  and  common  norms 
channel  the  goals,  interests,  and  actions  of  states  in 
a  mutually  agreeable  direction. 

This  entry  provides  a  typology  of  security 
cooperation  in  today’s  world.  It  distinguishes 
between  exclusive  security  cooperation,  where 
entry  rights  are  limited  and  controlled  and  which 
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coordinates  policies  toward  the  nonmembership, 
and  inclusive  security  cooperation,  which  is  open 
and  which  regulates  the  policies  of  the  members 
toward  each  other.  Exclusive  institutions  contain 
a  strong  element  of  realism,  though  they  are  by  no 
means  exclusively  rooted  in  realist  theory.  Inclusive 
institutions  have  stronger  relations  to  liberal- 
institutionalist,  constructivist,  and  normative- 
idealist  theories.  The  following  sections  describe 
the  basic  structure,  advantages  and  disadvantages, 
and  theoretical  underpinnings  of  each  type. 

Exclusive  Security  Cooperation 
Coalitions  of  the  Willing 

Today’s  “coalition  of  the  willing” — a  short¬ 
term  association  of  states  that  ally  for  a  single 
strategic  purpose  at  a  certain  point  in  time  and 
space — was  the  standard  type  of  alliance  in  the 
classical  Westphalian  world.  In  this  type  of  coali¬ 
tion,  the  alliance  does  not  define  the  mission,  but 
the  mission  defines  the  alliance.  Coalition  partners 
are  selected  for  their  particular  capabilities  to 
achieve  the  specific  purpose,  not  by  more  intrinsic 
attributes  and  considerations. 

Coalitions  of  the  willing  contain  no  legal  obliga¬ 
tions,  develop  no  institutional  structure,  have  no 
common  staff,  entertain  no  common  ideology  or 
culture,  elaborate  no  grand  strategy,  and  refrain 
from  integrating  military  forces.  Their  members 
have  only  a  short-term  time  horizon.  Since  the 
strongest  member  or  a  few  core  members  select  the 
partners,  they  are  exclusive. 

Because  of  its  short  time  horizon,  the  coalition 
of  the  willing  cannot  mitigate  the  security 
dilemma:  No  long  shadow  of  the  future  reduces 
the  remaining  distrust  among  its  members;  no 
expectations  are  created  over  how  relations  might 
evolve  after  the  short-term  objective  has  been 
achieved.  As  an  example,  the  “coalition  against 
terror”  includes  even  long-term  enemies  (e.g., 
India  and  Pakistan).  However,  with  regard  to 
nonmembers,  high  uncertainties  emerge  over 
whether  such  coalitions  might  turn  against  other 
states  in  the  future.  Being  exclusive  and  unilater¬ 
alist,  coalitions  of  the  willing  might  stimulate 
“coalitions  of  the  unwilling,”  or  even  “counter¬ 
coalitions  of  the  threatened.” 

For  the  leading  great  powers,  several  advantages 
emerge.  In  the  absence  of  institutional  structures, 


customary,  political,  or  legal  rules  exist;  symme¬ 
tries  translate  directly  into  influence;  and  the  com¬ 
mand  of  the  strongest  is  unchallenged.  Burdensome 
consultation  procedures  are  absent  (e.g.,  Kosovo 
1999  vs.  Afghanistan  or  Iraq  2003).  Because  the 
coalitions  are  shaped  for  a  single  purpose,  unity  of 
political  will  is  granted — no  opposition  force  within 
the  coalition  must  be  expected.  Also,  transaction 
costs  are  very  low,  no  future  obligations  ensue,  and 
freedom  of  action  remains  unconstrained.  For 
smaller  states,  coalitions  of  the  willing  are  a  cheap 
way  of  bandwagoning,  enhancing  their  security, 
and  sharing  the  eventual  spoils  without  the  burden 
of  enduring  dependence. 

On  the  downside,  smaller  coalition  members 
have  little  influence  on  the  coalition’s  politics, 
which  are  dominated  by  their  powerful  partners. 
They  bear  the  consequences  of  actions  decided  and 
conducted  unilaterally  by  others.  These,  in  turn, 
are  helpless  against  any  moves  of  the  smaller  allies 
to  desert  the  coalition.  Coalition  performance  is 
thus  uncertain  because  of  the  lack  of  predictability 
within  the  coalition. 

Hence,  coalitions  of  the  willing  appear  fit  to 
achieve  short-term,  straightforward  objectives.  If 
circumstances  are  complex  and  problems  endur¬ 
ing,  these  advantages  fade  in  comparison  with 
more  stable  and  reliable  structures. 

Alliances 

Today,  the  notion  of  “alliance”  has  been  shaped 
by  the  model  of  NATO  (North  Atlantic  Treaty 
Organization),  even  though  its  longevity  and  institu¬ 
tional  complexity  constitute  a  historical  singularity. 

Alliances  are  a  corollary  of  international  anarchy. 
Each  ally’s  security  is  enhanced  against  potential 
enemies  by  the  pooling  of  resources.  To  rely  on  the 
promises  of  somebody  else  goes  beyond  a  common 
sense  notion  of  “self-help.”  Within  the  realm  of 
neorealist  theory,  the  puzzle  is  all  the  larger  as  the 
problem  of  “relative  gains”  affects  allies  as  well. 
Because  of  this  problem,  distrust  is  not  uncommon 
in  alliances,  and  a  lot  of  symbolic  politics  is 
deployed  to  deal  with  this  destructive  factor. 

Realists  dispute  which  variable  triggers  an  alli¬ 
ance.  According  to  the  classical  view,  states  ally 
against  the  biggest  power  in  their  environment. 
Another  perspective  has  gained  increasing  support 
in  the  past  2  decades:  States  ally  against  the  biggest 
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perceived  threat  in  their  environment.  This  amend¬ 
ment  modifies  basic  neorealist  assumptions. 
“Threat”  resides  in  the  behavior  of  the  threatening 
actor  and  is  a  unit-level  attribute.  Assessment  of 
threat  depends  on  subjective  (or  collective-cultural) 
factors.  Finally,  according  to  neoclassical  realism, 
in  a  system  without  “predators,”  alliances  will 
evaporate,  as  predators  are  the  only  threats  that 
can  emerge  and  without  them,  allying  would  be  a 
waste  of  resources. 

Alliances,  once  formed,  face  several  dilemmas. 
Allies  do  not  necessarily  hold  identical  security 
interests.  As  a  consequence,  they  must  expect  two 
opposite  types  of  behavior  from  each  other.  First, 
one  ally’s  security  guarantee  for  the  other  ally  may 
embolden  that  ally  to  engage  in  brinkmanship, 
provoking  an  armed  conflict  one  does  not  want 
(entrapment  risk).  Otherwise  that  ally  might  switch 
sides  and  desert  to  the  erstwhile  enemy  (risk  of 
abandonment).  By  reassuring  one’s  ally  of  one’s 
own  loyalty,  one  is  reducing  the  danger  of  aban¬ 
donment,  but  only  to  enhance  the  risk  of  entrap¬ 
ment  as  the  ally  will  take  one’s  loyalty  for  granted 
no  matter  what.  If  one  keeps  a  distance  from  one’s 
ally,  the  ally  might  be  less  emboldened  to  practice 
brinkmanship,  but  disappointment  might  drive 
that  ally  toward  the  enemy. 

An  alliance  between  a  great  power  (as  protec¬ 
tor)  and  a  couple  of  smaller  powers  (as  protected) 
contains  a  specific  dilemma.  The  protector  worries 
about  the  risk  of  the  guarantee  it  is  giving  to  the 
small  allies.  Individually,  they  may  not  amount  to 
much,  but  together,  they  may  be  geopolitically 
important.  But  is  the  value  of  each  single  one 
worth  marching  into  a  war  with  another  great 
power?  For  the  smaller  allies,  the  reliability  of  the 
alliance  guarantee  can  also  be  a  reason  to  be  ner¬ 
vous:  Will  the  protector  risk  a  war  against  the 
threat  I  am  facing? 

Vast  distances  exacerbate  the  alliance  dilemma: 
Geopolitics  is  not  the  same  for  all  allies;  vital  inter¬ 
ests  differ.  Politically,  the  allies  are  at  pains  to 
agree  on  the  threat  and  on  the  trigger  that  triggers 
the  “alliance  case.”  Physically,  the  allies  struggle 
hard  to  devise  a  strategy,  a  posture,  and  logistics 
that  make  the  alliance’s  promise  credible.  The 
enhanced  risks  of  the  nuclear  age  have  exacerbated 
this  dilemma. 

Asymmetric  alliances  quarrel  about  burden 
sharing.  The  biggest  member  makes  the  decisive 


contribution.  Assets  of  the  smallest  allies  add  vir¬ 
tually  nothing  at  the  margin.  Smaller  allies  have 
thus  an  incentive  to  contribute  less  than  their 
“arithmetic”  share — they  are  up  for  a  free  ride, 
while  the  larger  member  produces  most  of  the 
public  good  alliance  security. 

Many  alliances  in  history  were  weakly  institu¬ 
tionalized.  With  the  growing  speed  of,  and  the 
stakes  involved  in,  modern  warfare,  alliances  try  to 
achieve  greater  effectiveness  and  to  mitigate  the 
defection  problem  by  higher  institutionalization. 
NATO  has  been  the  first  fully  institutionalized  alli¬ 
ance  with  a  headquarters,  a  bureaucracy,  a  general 
staff,  a  council,  an  agreed  strategy,  and  a  far- 
reaching  military  integration.  Institutionalization 
enhances  alliance  efficiency,  trust,  and  stability. 
“Sunk  investment  costs”  motivate  members  to 
keep  the  alliance  under  changed  circumstances: 
Enduring  collaboration  creates  social  bonds.  As  a 
consequence,  NATO  has  survived  the  loss  of  its 
enemy. 

Clubs 

International  club  governance  is  on  the  rise. 
Clubs  are  less  institutionalized  than  alliances  but 
more  so  than  coalitions  of  the  willing.  They  are 
distinct  in  that  they  are  not  necessarily  confined  to 
rendering  security  services  to  their  members  alone 
but  deliver  “security  products”  to  a  broader  com¬ 
munity,  including  nonmembers. 

“Clubs”  in  the  security  field  are,  for  example, 
the  export  control  regimes  (e.g.,  Nuclear  Suppliers 
Group  [NSG]),  the  Proliferation  Security  Initiative, 
and  the  G8  [Group  of  Eight]).  They  do  not  rest  on 
treaties  and  have  an  exclusive  membership  that  is 
enhanced  by  consensual  co-optation.  They  are 
meant  to  be  permanent.  All  have  a  flat  institutional 
structure:  some  agreed  rules  and  norms,  even  in 
written  form  (e.g.,  the  export  guidelines  of  the 
NSG).  And  they  all  have  established  decision¬ 
making  structures  (usually  consensus  rule). 

The  form  of  “club”  permits  members  to  escape 
from  the  constraints  of  legally  regulated  coopera¬ 
tion.  It  allows  for  an  opaque  style  of  proceeding 
that  might  be  helpful  to  achieve  consensus  on 
politically  difficult  issues  (e.g.,  export  control) 
while  creating  mutual  confidence  that  the  partners 
are  sincere.  This  might  be  a  condition  to  establish 
cooperation  in  controlling  the  export  of  weapons 
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of  mass  destruction  (WMD)  between  potential 
political  rivals.  It  might  also  facilitate  substantial 
coordination  in  areas  of  high-security  salience, 
such  as  G8  antiterrorism  collaboration.  And  it 
allows  for  greater  flexibility  than  formal  treaties. 
The  guidelines  for  clubs  are  easy  to  change  and 
amend,  namely,  by  simple  agreement  of  the  execu¬ 
tives  and  without  the  often  cumbersome  ratifica¬ 
tion  procedures.  The  downside  is  a  lack  of 
accountability. 

Concerts 

Concert  is  a  mechanism  to  coordinate  great 
power  policies.  The  historical  model,  the  Concert 
of  Europe,  was  established  at  the  end  of  the 
Napoleonic  wars,  and  it  kept  the  peace  between 
the  European  great  powers  from  the  Vienna 
Congress  (1815)  up  to  the  Crimean  War  (1853), 
contained  the  ensuing  wars  of  Italian  and  German 
unification,  and  provided  an  arena  for  conflict 
management  afterward  until  the  breakdown  in 
1914.  It  was  a  tool  to  provide  security  for  its  mem¬ 
bers  against  each  other,  to  prevent  third  parties 
from  disturbing  stability,  to  offer  the  public  good 
of  broad  conflict  management,  and  to  mitigate  the 
consequences  of  inevitable  political  change. 

The  classical  concert  relied  on  a  combination  of 
behavioral  and  procedural  norms.  Great  powers 
guaranteed  each  other’s  existence  and  territorial 
integrity  and  recognized  each  other’s  vital  inter¬ 
ests.  They  committed  not  to  change  the  status  quo 
by  force.  They  accepted  restraint  on  their  right  to 
intervene  unilaterally  in  other  states’  internal 
affairs.  Crisis  management  was  to  be  conducted 
jointly  (with  the  option  to  opt  out  when  vital  inter¬ 
ests  were  not  at  stake).  A  rough  balance  of  forces 
was  maintained.  The  overarching  norm  was  to 
avoid  a  devastating  major  war  in  Europe.  The  con¬ 
cert  maintained  an  elaborated  system  of  confer¬ 
ences,  meetings  of  ambassadors,  and  occasional 
encounters  between  monarchs.  Consultations  were 
intensified  when  an  international  crisis  arose. 

The  strength  of  the  concert  was  based  on  the 
common  experience  of  large-scale,  enduring  war 
after  the  French  Revolution  and  on  a  similar  social 
basis  of  largely  aristocratic  or  great  bourgeois 
leaderships  and  diplomatic  corps.  These  common¬ 
alities  permitted  collaboration  despite  value  differ¬ 
ences  between  the  more  liberal  Britain  and  France, 


on  the  one  hand,  and  the  much  more  conservative 
Russia,  Prussia,  and  Austria,  on  the  other.  When 
the  memory  of  the  wars  faded,  extended  suffrage 
in  Britain  and  France  brought  new  social  forces  to 
the  seats  of  power,  and  nationalism  led  to  higher 
political  mobilization  everywhere,  it  became  harder 
to  achieve  a  consensus.  The  concert  suffered  also 
from  the  shortcomings  that  a  Russian  attempt 
failed  to  place  arms  control  on  its  agenda  and  that 
extra-European  issues  remained  largely  outside  its 
concerns,  but  it  influenced  the  balance  of  power 
and  mutual  perceptions  of  the  members.  Interests 
of  small  states  or  national  movements  were  not 
generally  ignored — the  concert  helped  with  the 
independence  of  Belgium,  Greece,  and  Romania 
and  managed  the  transfer  of  the  province  of 
Neuenburg  from  Prussian  possession  to  Swiss 
sovereignty — but  smaller  states  had  a  voice  only  if 
the  concert  granted  it  occasionally. 

The  concert  realized  security  cooperation  among 
major  powers  of  different  ideological  orientation 
in  a  phase  of  great  change,  and  it  worked  for  an 
impressively  long  period  of  time.  It  combined  real¬ 
ist  (balance  of  power),  normative,  and  institution¬ 
alist  features;  this  might  be  a  hint  that  successful 
real-world  security  cooperation  has  to  rely  on 
paradigmatic  synthesis  rather  than  purism. 

Inclusive  Security  Cooperation 
Collective  Security 

The  concept  of  collective  security  has  never 
been  realized  in  pure  form  and  with  resounding 
success.  It  aims  to  establish  a  system  of  generalized 
deterrence  and  reassurance,  which  relaxes  the  need 
for  self-help  efforts.  An  operating  system  of  collec¬ 
tive  security  would  reduce  the  security  dilemma  for 
everybody  tremendously. 

The  basic  principle  of  collective  security  is  “all 
for  one,  one  for  all”:  Members  commit  to  assist  an 
attacked  partner.  Collective  security  is  an  inclusive 
provisional  alliance  without  a  predefined  enemy. 
The  aggressor  is  supposed  to  be  one  of  the  partners 
who,  by  attacking,  defines  himself  as  the  enemy  of 
the  rest.  The  basic  feature  is  thus  the  marshaling  of 
overwhelming  defense  in  the  moment  of  aggres¬ 
sion  (which  assumes  that  no  member  has  more 
power  than  the  rest  together). 

The  difficulties  start  with  the  question  of  who 
the  aggressor  is.  Escalatory  processes  leading  to 
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war  can  be  interactive  and  incremental,  making  it 
difficult  to  identify  an  aggressor  unambiguously, 
as  at  the  outset  of  World  War  I.  Countries  may  feel 
compelled  to  engage  in  preemptive  defense,  as 
Israel  claimed  in  1967,  in  which  case  the  one  who 
fires  the  first  shot  does  so  for  defensive  purposes. 
Those  who  are  assessing  an  aggression  claim  may 
suffer  from  biases  as  they  feel  friendlier  to  one 
party  than  to  the  other.  The  next  hurdle  is  an 
assessment  of  which  measures  are  to  be  taken  and 
which  forces  should  intervene  on  the  basis  of  what 
strategy  and  under  whose  command.  Once  a  war 
for  collective  security  is  under  way,  when  and  on 
what  conditions  it  should  be  terminated  and  what 
the  postwar  settlement  should  look  like  are  divi¬ 
sive  issues.  Collective  security  can  fail  on  each  of 
these  choices. 

A  public  good  problem  arises  because  the  utility 
of  collective  security  is  enjoyed  by  all,  while  the 
burdens  in  the  hour  of  truth  are  more  likely  to  fall 
only  on  a  part  of  the  membership.  Geostrategic 
interests  differ,  as  do  military  resources.  Few  coun¬ 
tries  can  project  and  maintain  power  over  long 
distances.  The  more  powerful  an  aggressor  is,  the 
riskier  a  collective  military  counteroperation  will 
be.  A  further  complicating  factor  is  the  enormous 
risk  of  escalation  into  gross  destruction  should  the 
aggressor  be  a  nuclear  weapons  state. 

For  these  reasons,  realists  have  shrugged  off  col¬ 
lective  security  as  inferior  to  self-help  security  sys¬ 
tems.  Nevertheless,  the  disruptions  emerging  from 
unfettered  self-help  in  an  increasingly  interdepen¬ 
dent  world  have  led  analysts  repeatedly  to  recon¬ 
sider  collective  security.  It  is  at  least  possible  to 
define  some  conditions  that  would  enhance  the 
chances  that  it  might  work:  (a)  if  the  great  powers 
are  not  in  existential  conflict  and  share  an  interest  in 
stability,  (b)  if  nuclear  weapons  are  not  in  the  hands 
of  would-be  aggressors  (or  could  be  reasonably 
defended  against),  and  (c)  if  efficient  joint  assess¬ 
ment  capabilities  and  procedures  are  available  to 
steer  parties  through  the  difficult  choices  to  be  made. 
In  other  words,  collective  security  assumes  the  exis¬ 
tence  of  concert-like  features  as  its  basic  condition. 

Security  Regimes 

Security  regimes  offer  a  more  modest  form  of 
cooperation.  They  aim  at  regulating  policy  sub¬ 
fields  rather  than  security  at  large.  Regimes  exist, 


for  example,  in  subfields  of  arms  control  and  non¬ 
proliferation,  in  curbing  terrorist  financing,  or  in 
the  United  Nations  (UN)  routine  for  peacekeeping 
operations.  As  other  international  regimes,  they 
consist  of  four  different  levels:  (1)  principles  state 
what  the  objective  of  the  regime  is  and  how  cause- 
and-effect  relations  in  the  specific  field  are  seen  by 
regime  members;  (2)  norms  contain  general  guide¬ 
lines  for  the  behavior  of  members;  (3)  rules  consist 
of  specific  prescriptions  and  proscriptions,  specify¬ 
ing  selectively  the  meaning  of  the  norms;  and 
(4)  procedures  establish  how  decisions  are  taken. 

Regimes  might  be  treaty  based  or  informal. 
They  might  be  supported  organizations  that 
administer  the  regime  and  perform  services  for 
members,  such  as  verification  or  technical  assis¬ 
tance  (e.g.,  the  Organization  for  the  Prohibition  of 
Chemical  Weapons  [OPCW]  in  the  chemical  weap¬ 
ons  regime). 

Regimes  perform  valuable  functions  in  the  secu¬ 
rity  sector.  Specific  rules  for  behavior  create  pre¬ 
cise  expectations  for  the  future,  reducing  the 
uncertainty  that  is  at  the  root  of  the  security 
dilemma.  Transparency,  verification,  and  proce¬ 
dures  for  the  clarification  of  ambiguous  informa¬ 
tion  give  insight  into  the  faithfulness  of  regime 
members.  Moreover,  arms  control  regimes  that 
curb  seriously  the  opportunities  for  surprise  attacks 
and  comprehensive  offensives  suggest  the  benevo¬ 
lent  intentions  of  all  partners,  in  addition  to  pro¬ 
viding  information  about  their  capabilities.  A 
dense  network  of  such  regimes  might  help  trans¬ 
form  relations  of  mutual  distrust  into  mutual  con¬ 
fidence  and  help  overcome  the  security  dilemma: 
In  the  end  phase  of  the  Cold  War,  progress  in  arms 
control  and  political  change  reinforced  each  other. 

This  effect,  however,  is  not  determined.  The 
overall  interstate  relationships  and  the  opportuni¬ 
ties  for  progress  in  arms  control  are  connected.  If 
a  conflict  is  harsh  and  mutual  distrust  high,  field- 
specific  regimes  might  be  too  piecemeal  to  help. 
Either  party  in  the  conflict  might  request  such 
fundamental  changes  in  the  position  of  the  other 
side  as  a  precondition  for  negotiations  that  the 
whole  cooperation  process  falters.  The  failure  of 
arms  control  and  regional  security  talks  between 
Israel  and  its  Arab  neighbors  during  the  1990s  is  a 
telling  example.  Likewise,  arms  control  is  vulner¬ 
able  to  domino  effects  if  something  else  in  the 
relationship  goes  wrong;  For  example,  U.S.  concern 
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about  a  Soviet  army  brigade  on  Cuba  sealed  the 
fate  of  the  Strategic  Arms  Limitation  Talks  (SALT 
II)  ratification  in  the  Senate  in  1979,  even  though 
the  two  issues  were  not  substantially  related. 

Regional  Organizations 

Regional  organizations  are  increasingly  impor¬ 
tant  in  security  cooperation.  Only  a  few  regions  are 
lacking  in  related  structures  (the  Middle  East  and 
South  Asia).  A  great  number  of  war-driving  factors 
are  most  likely  to  occur  in  regional  settings:  territo¬ 
rial  conflict,  enduring  rivalry,  contiguity,  and  over¬ 
lapping  transborder  ethnicities.  Neighbors  are  most 
prone  to  face  the  risk  of  war,  while,  on  the  other 
hand,  they  might  be  more  similar  in  terms  of  his¬ 
torical  experience  or  cultural  traits  than  countries 
far  away  and  may  also  harbor  a  considerable 
potential  for  mutually  beneficial  cooperation.  Thus, 
while  the  security  problem  is  particularly  virulent, 
remedies  are  also  more  easily  available. 

International  organizations,  regional  ones  in 
particular,  have  potential  effects  that  make  them 
fit  for  conflict  management  and  security  produc¬ 
tion.  They  create  permanent  contact  among  offi¬ 
cials,  leading  to  greater  familiarity  and,  eventually, 
more  mutual  confidence.  They  provide  informa¬ 
tion  and  enhance  transparency,  reducing  the  uncer¬ 
tainty  problem  behind  the  security  dilemma.  They 
create  the  expectation  of  further  cooperation, 
lengthening  the  “shadow  of  the  future.”  In  these 
functions,  they  are  similar  to  international  regimes. 
But  they  can  address  interstate  conflicts  among 
their  members  more  comprehensively.  They  offer 
communication  channels  that  facilitate  talks  about 
conflict  solutions.  And  embedding  rivals  in  the 
same  organization  offers  other  actors  opportuni¬ 
ties  for  mediation. 

Some  regional  organizations  have  been  created 
with  an  ostensible  focus  on  nonsecurity  issues  but 
perform  important  roles  for  regional  security  or 
have  evolved  their  agenda  to  include  security 
explicitly  (e.g.,  the  European  Union  [EUJ, 
Association  of  Southeast  Asian  Nations,  or  the 
Economic  Community  of  West  African  States). 
Others  have  been  created  for  security  collabora¬ 
tion,  such  as  the  Organization  for  Security  and 
Cooperation  in  Europe  (OSCE)  or  the  Shanghai 
Cooperation  Organisation.  It  might  also  be  that  a 
setting  is  created  ad  hoc  with  the  specific  mission 


to  manage  a  particular  project  but  evolves  into  a 
broader  multilateral  institution.  This  might  be  the 
future  of  the  six-party  talks  in  East  Asia,  which 
came  into  being  to  contain  the  regional  risks 
emerging  from  North  Korea’s  nuclear  activities. 

The  success  of  these  organizations  appears  to 
depend  more  on  specific  regional  conditions  and 
on  leadership  and  practice  and  less  on  prefixed 
structural  conditions.  It  requires  fundamental 
political  willingness  and  truly  shared  objectives. 
Where  these  conditions  are  absent,  regional  coop¬ 
eration  does  not  take  off  at  all  (e.g.,  in  the  Middle 
East)  or  with  limited  success  (e.g.,  the  OSCE). 

Security  Communities 

In  security  communities,  cooperation  reaches 
such  a  level  that  the  security  dilemma  evaporates 
altogether:  The  specter  of  military  conflict  is 
replaced  by  the  expectation  of  peaceful  change. 
While  security  communities  may  start  from  utility 
motivations  for  working  together,  in  their  mature 
state,  they  are  transformed.  The  key  change  is  the 
development  of  a  shared  identity.  This  identity 
makes  the  idea  of  warring  “against  one’s  own  kin” 
out  of  the  question.  Military  contingency  plans 
against  each  other  and  related  preparations  do  not 
exist.  The  EU  is  the  most  outstanding  example. 

Mature  security  communities  contain  a  tight 
interrelationship  between  material  (utilitarian) 
and  ideational  factors.  Close  relations  provide  col¬ 
lateral  utilities:  enlarged  freedom  of  movement  for 
citizens,  economic  benefits,  scientific  collabora¬ 
tion,  and  so  on.  Common  values  and  the  observa¬ 
tion  of  established  behavioral  norms,  instantiated 
in  everyday  practices,  keep  the  community  tightly 
linked.  Close-knit  elites  serve  as  “community 
entrepreneurs”;  they  may  compensate  for  a  less 
determined  popular  commitment.  Communities 
tend  to  develop  more  elaborate  organizations  and 
communication  channels.  Military  collaboration 
might  grow  into  partial  integration.  This  evolu¬ 
tionary  perspective  on  security  communities 
should  not  be  confused  with  a  eufunctional  bias: 
If  environmental  circumstances  change  drastically 
or  if  domestic  upheaval  produces  new  attitudes, 
communities  may  decay. 

While  the  main  thrust  of  security  communities  is 
the  pacification  of  intracommunity  relationships, 
they  may  also  create  outside  effects.  They  might 
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erect  a  joint  defense  against  a  common  threat,  like 
an  alliance,  and  offer  security  services  for  third 
parties  (e.g.,  capacity  building  for  export  controls, 
the  transfer  of  counterterrorism-relevant  informa¬ 
tion,  and  providing  troops  for  peacekeeping). 
While  the  “alliance”  function  can  have  destabiliz¬ 
ing  effects  for  excluded  parties,  security  services 
might  transfer  the  pacifying  effects. 

Security  communities  are  ambiguous  as  to 
inclusiveness/exclusiveness:  Inclusiveness  applies 
only  to  countries  that  share  criteria  of  member¬ 
ship;  the  procedure  for  entry  is  co-optation.  On  the 
other  hand,  security  communities  bind  themselves 
to  welcome  everybody  fulfilling  these  criteria;  in 
that  sense,  they  are  inclusive.  Uncertainties  about 
EU  boundaries  illustrate  this  ambiguity. 
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Security  Dilemma 


A  security  dilemma  is  a  situation  in  which  the 
actions  taken  by  a  state  to  increase  its  own  security 
cause  reactions  from  other  states,  which  leads  to  a 
decrease  rather  than  an  increase  in  the  state’s  secu¬ 
rity.  Some  scholars  of  international  relations  find 
that  the  security  dilemma  is  the  most  important 
source  of  conflict  in  international  relations.  They 
argue  that  in  the  international  realm,  there  is  no 
legitimate  monopoly  of  violence — that  is,  there  is 
no  world  government — and  as  a  consequence, 
each  state  must  take  care  of  its  own  security  and 
survival.  For  this  reason,  the  primary  goal  of  states 
is  to  maximize  their  own  security.  Even  if  states 
focus  solely  on  this  goal  and  have  no  intention  of 
harming  others,  many  of  the  actions  taken  by 
states  to  increase  their  own  security — such  as 
weapons  procurement  and  the  development  of 
new  military  technologies — will  decrease  the  secu¬ 
rity  of  others.  Decreasing  the  security  of  others 
does  not  automatically  place  the  state  in  a  dilemma, 
but  because  of  the  anarchic  structure,  other  states 
will  follow  suit  if  one  state  arms.  They  cannot 
know  whether  the  arming  state  will  use  its  increased 
military  capabilities  for  attack  in  the  future.  For 
this  reason,  they  will  either  choose  to  increase  their 
own  military  capabilities  in  order  to  reestablish  the 
balance  of  power  or  they  will  launch  a  preemptive 
attack  to  prevent  the  arming  state  from  upsetting 
the  balance  in  the  first  place.  If  they  choose  the 
first  option,  the  result  may  be  a  security  spiral.  A 
security  spiral  is  an  action-reaction  process,  where 
two  states  are  tied  in  an  armaments  race  with  each 
state  responding  to  increases  in  weapons  procure¬ 
ment  and  defense  expenditure  by  the  other  state, 
leading  them  both  to  arm  themselves  more  and 
more  heavily.  This  may  lead  to  war  in  the  long  run. 
If  they  choose  the  last  option,  military  conflict  will 
be  imminent. 

The  logic  of  the  security  dilemma  was  first 
described  by  Herbert  Butterfield  in  1949.  The  term 
itself  was  coined  by  John  Herz  in  1950.  Although 
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the  logic  seems  to  fit  particularly  well  with  the 
security  competition  between  the  United  States 
and  the  Soviet  Union  during  the  Cold  War,  propo¬ 
nents  of  the  term  do  not  see  it  as  tied  to  a  specific 
historic  era.  Rather,  it  reflects  the  fundamentally 
tragic  nature  of  international  life:  State  actors 
strive  for  peace  and  stability  but  end  up  in  military 
conflict.  Thus,  even  if  all  states  are  status  quo  pow¬ 
ers  wishing  only  peace  and  security,  war  may 
occur  because  of  the  fear  and  insecurity  following 
from  the  anarchic  structure  of  the  international 
system.  This  focus  on  the  effect  of  international 
anarchy  on  the  behavior  of  states  is  typical  of  so- 
called  structural  realism — sometimes  termed 
neorealism — which  posits  that  the  international 
system  is  a  self-help  system,  where  states  must 
focus  on  their  own  interests  to  maximize  their 
chance  of  security  and  survival.  In  particular,  the 
security  dilemma  logic  is  central  to  so-called  defen¬ 
sive  realism.  Proponents  of  defensive  realism  argue 
that  states  seek  to  maximize  their  chance  of  secu¬ 
rity  and  survival  by  maintaining  their  position  in 
the  international  system,  not  by  expansion.  Still, 
scholars  working  within  alternative  theoretical 
traditions  have  discussed  how  the  security  dilemma 
logic  looks  from  their  perspective  and  have  used 
their  own  theoretical  insights  to  suggest  how  we 
might  move  beyond  the  security  dilemma  logic  and 
create  a  more  peaceful  world. 

A  large  number  of  international  relations  schol¬ 
ars  have  applied  and  developed  the  logic  of  the 
security  dilemma.  Some  of  the  most  prominent 
security  dilemma  scholars  are  Robert  Jervis,  Barry 
Posen,  and  Charles  Glaser.  From  a  more  critical 
stance,  the  security  dilemma  logic  has  been 
explored  by,  among  others,  Ken  Booth  and 
Nicholas  Wheeler.  Recent  research  on  the  security 
dilemma  has  applied  the  logic  to  most  regions  of 
the  world,  and  it  has  resulted  in  a  proliferation  of 
more  fine-grained  distinctions  between  different 
types  of  security  dilemmas. 

Some  scholars  of  international  relations  argue 
that  all  states  face  a  security  dilemma  all  the  time, 
but  most  of  them  agree  that  the  intensity  of  security 
dilemmas  tends  to  vary  over  time  and  space.  The 
literature  on  international  relations  points  to  differ¬ 
ent  sources  of  variation  in  the  security  dilemma. 
Security  scholars  such  as  Stephen  Van  Evera  argue 
that  the  intensity  of  the  security  dilemma  depends 
on  the  ease  of  conquest.  If  conquest  is  easy,  states 


will  typically  face  an  intense  security  dilemma, 
because  the  risk  of  military  defeat  is  raised  every 
time  a  competing  state  adds  to  its  military  capabili¬ 
ties.  Conversely,  if  conquest  is  difficult,  the  security 
dilemma  is  ameliorated,  because  other  states  may 
add  to  their  military  capabilities  without  posing  a 
direct  offensive  threat.  If  we  are  able  to  tell  the  dif¬ 
ference  between  offensive  and  defensive  weapons, 
states  may  even  signal  their  benign  intentions  by 
deploying  defensive  weapons.  Other  states  will 
know  that  they  have  acquired  weapons  not  to 
attack  but  to  defend.  Offense  and  defense  domi¬ 
nance  vary  over  time  and  space  depending  on  a 
number  of  factors,  including  geography,  military 
technology,  and  military  doctrine. 

Also,  regime  type  may  affect  the  intensity  of  the 
security  dilemma.  Although  democracies  often  go 
to  war,  they  rarely  go  to  war  against  other  democ¬ 
racies.  When  two  autocracies  face  each  other  or 
when  a  democracy  and  an  autocracy  face  each 
other,  security  spirals  sometimes  spin  out  of  con¬ 
trol,  because  each  side  interprets  the  move  by  the 
other  side  as  potentially  threatening.  But  this  is 
rarely  the  case  when  two  democracies  face  each 
other.  Two  characteristics  of  modern,  stable 
democracies  explain  why.  First,  the  policy  pro¬ 
cesses  in  liberal  democracies  is  fairly  transparent, 
even  when  viewed  from  outside  the  country. 
Parliamentary  debates  are  usually  open  to  the 
public — sometimes  even  televised — and  political 
parties  outside  government,  mass  media,  and 
interest  groups  ensure  that  few  government  deci¬ 
sions  of  any  importance  are  taken  without 
scrutiny  and  public  debate.  Second,  democracies 
usually  have  rules  and  regulations  preventing 
them  from  rushing  into  war.  Some  policies  are 
made  extraordinarily  difficult,  thereby  binding  the 
policymakers  and  signaling  to  the  outside  world 
that  decisions  cannot  be  taken  without  prior 
warning.  Both  of  these  characteristics  of  modern, 
liberal  democracies  reduce  uncertainty  and  thereby 
ameliorate  the  security  dilemma.  As  summed  up 
by  Charles  Lipson  (2003),  “Because  democracies 
have  more  accurate  perceptions  of  each  other,  they 
are  better  able  to  cooperate,  build  trust  and  avoid 
war”  (p.  72). 

Other  scholars  argue  that  the  security  dilemma 
is  largely  irrelevant,  because  international  conflict 
is  not  the  result  of  status  quo  powers  seeking 
to  maximize  security  but  of  revisionist  powers 
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seeking  to  maximize  power.  If  all  states  are  status 
quo  powers,  these  critics  argue,  then  military  con¬ 
flict  would  be  extremely  rare,  because  the  world 
would  consist  of  status  quo  powers  eager  to  signal 
their  benign  intentions.  But  this  is  not  the  case: 
States  wishing  to  expand  their  power  at  the 
expense  of  others  are  the  most  important  source  of 
military  conflict  in  international  relations,  not  the 
uncertainty  and  insecurity  of  status  quo  powers. 

Today,  military  conflict  between  states  is  less 
frequent  than  throughout  most  of  the  world’s  his¬ 
tory.  At  the  same  time,  conflict  related  to  weak 
and  failed  states  is  now  a  major  source  of  instabil¬ 
ity  in  many  parts  of  the  world.  Some  scholars 
argue  that  even  though  this  development  seems  to 
fit  uneasily  with  the  idea  of  unitary  state  actors 
stuck  in  a  security  dilemma  in  an  anarchic  interna¬ 
tional  system,  the  basic  logic  of  the  security 
dilemma  may  still  be  applied.  For  instance,  Barry 
Posen,  analyzing  ethnic  violence  between  ethnic 
groups  in  a  collapsing  Yugoslavia  in  the  early 
1990s,  found  that  the  Serbs  and  Croats  experi¬ 
enced  a  security  dilemma  in  an  emerging  anarchy 
where  each  group  had  to  take  care  of  its  own 
security.  Brian  Job  extends  the  applicability  of  the 
security  dilemma  even  further  by  discussing  how  a 
similar  logic  applies  to  weak  Third  World  states 
unable  to  provide  security  for  their  own  citizens 
and  sometimes  even  constituting  a  threat  to  parts 
of  their  own  population. 
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Selection  Bias 


Selection  bias  refers  to  the  inaccurate  estimates  that 
can  occur  when  a  sample  of  data  is  nonrandomly 
selected  for  study.  Samples  may  be  purposefully 
nonrandomly  selected  by  researchers.  More  often, 
however,  nonrandom  (also  called  “selective”)  sam¬ 
ples  result  from  choices  made  by  the  actors  being 
studied  or  by  others  whose  choices  affect  them. 
This  entry  discusses  the  importance  of  such  biases 
and  the  ways  to  avoid  them. 

The  samples  available  for  studying  political  pro¬ 
cesses  often  are  selective.  For  example,  to  learn 
about  the  escalation  or  resolution  of  international 
disputes,  one  must  study  a  sample  of  states 
involved  in  disputes.  These  countries  have  made  or 
been  the  recipients  of  demands  or  threats.  To  learn 
what  leads  citizens  to  vote  for  particular  parties  or 
candidates,  one  must  study  a  sample  of  persons 
who  have  registered  to  vote  and  decided  to  go  to 
the  polls.  To  learn  what  makes  economic  sanctions 
or  peace-building  missions  effective,  one  must 
study  a  sample  of  cases  in  which  sanctions  have 
been  applied  or  civil  wars  have  ended,  respectively. 
To  learn  about  public  opinion,  one  must  study 
persons  who  have  consented  to  respond  to  surveys 
and  to  do  so  informatively.  In  each  of  these  cases, 
the  sample  available  for  study  is  not  a  random 
sample  from  the  population  of  interest;  instead,  it 
is  a  sample  that  has  been  nonrandomly  selected  or 
self-selected. 

The  economist  James  Heckman  was  awarded 
the  Nobel  Prize  in  economics  in  2000  for  his 
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work  on  methods  for  analyzing  selective  sam¬ 
ples  without  bias.  Heckman  popularized  these 
methods  and  applied  them  extensively  to  the 
study  of  labor  economics.  Christopher  Achen 
was  the  first  political  scientist  to  develop  meth¬ 
ods  for  analyzing  selective  samples.  Achen 
explained  how  nonrandom  selection  often 
affects  analysts’  attempts  to  evaluate  public 
policy.  For  example,  an  analyst  might  wish  to 
evaluate  a  pretrial  release  system:  Are  appropri¬ 
ate  criteria  used  to  decide  which  defendants  will 
and  will  not  be  released?  One  criterion  that 
judges  use  in  determining  whether  or  not  to 
release  a  defendant  is  the  seriousness  of  the 
accusation;  those  accused  of  serious  crimes  are 
less  likely  to  be  released.  The  analyst  might 
therefore  wish  to  know  whether  those  accused 
of  serious  crimes  are  more  likely  to  flee  or  com¬ 
mit  additional  crimes  if  released;  if  not,  perhaps 
judges  would  want  to  release  more  defendants 
accused  of  serious  crimes.  This  question  is  dif¬ 
ficult  to  answer,  because  the  sample  of  released 
defendants  is  selective.  When  judges  do  release 
those  accused  of  serious  crimes,  it  is  because 
they  have  some  additional  information  (e.g., 
exemplary  courtroom  behavior)  indicating  that 
these  defendants  are  unlikely  to  be  rearrested. 
Thus,  in  the  sample  of  released  defendants, 
everyone  is  unlikely  to  commit  additional 
crimes — in  some  cases  because  those  accused  of 
more  minor  crimes  are  unlikely  to  do  so  and  in 
some  because  those  who  are  released  after  being 
accused  of  serious  crimes  are  exceptional.  For 
this  reason,  a  study  of  released  prisoners  will 
show  little  or  no  relationship  between  the  seri¬ 
ousness  of  the  accusation  and  the  chance  of 
rearrest,  even  if  such  a  relationship  truly  exists. 
However,  methods  that  correct  for  selection 
bias  show  that  those  accused  of  serious  crimes 
are  more  likely  to  flee  or  be  rearrested  if  they 
are  released  without  regard  to  courtroom  behav¬ 
ior.  Failure  to  correct  for  selection  bias  might 
lead  judges  to  release  more  defendants  accused 
of  serious  crimes  and  to  regret  this  decision 
when  those  defendants  go  on  to  have  a  higher 
rearrest  rate. 

In  statistical  terms,  selection  bias  can  lead  to 
biased  or  inconsistent  estimates,  depending  on  the 
model.  Stated  more  formally,  selection  bias  refers 
to  the  biased  and/or  inconsistent  estimates  that  may 


occur  as  a  result  of  nonrandom  selection.  To  see 
the  problem,  consider  a  researcher’s  attempt  to 
test  the  hypothesis  that  pairs  of  states  are  more 
likely  to  go  to  war  when  their  military  strength  is 
nearly  equal  (they  have  power  parity);  one  ratio¬ 
nale  behind  this  hypothesis  is  that  leaders  are  less 
likely  to  know  which  side  would  win  a  war  when 
states  are  more  evenly  matched,  and  so  the  ulti¬ 
mate  loser  is  willing  to  fight.  Assume,  for  exam¬ 
ple,  that  whether  or  not  two  countries  are 
roughly  equal  in  power  determines  their  likeli¬ 
hood  of  being  in  a  military  dispute  (selecting  into 
the  sample)  as  well  as  their  likelihood  of  escalat¬ 
ing  a  dispute  to  war  (the  outcome  of  interest). 
Assume  further  that  an  unobserved  variable,  the 
interests  at  stake  in  the  dispute,  also  affects  both 
initiation  and  escalation.  Pairs  of  countries  enter 
the  sample  of  militarized  disputes  in  one  of  three 
ways.  Some  pairs  are  involved  in  disputes  because 
they  are  roughly  equal  in  power,  some  because 
one  or  both  of  the  states  has  strong  interests  at 
stake,  and  some  for  both  reasons.  Informally,  the 
problem  is  that  in  the  sample  of  disputing  dyads, 
pairs  of  countries  that  do  not  have  equal  power 
contain  at  least  one  country  with  particularly 
strong  interests  at  stake,  while  pairs  with  equal 
power  may  or  may  not  contain  a  country  with 
strong  interests  at  stake.  For  this  reason,  dyads  in 
the  sample  with  equal  power  are  likely  to  go  to 
war,  and  dyads  with  unequal  power  also  are 
likely  to  go  to  war.  When  we  use  the  countries 
involved  in  disputes  to  estimate  the  effect  of 
power  parity  on  whether  or  not  states  go  to  war, 
parity  appears  to  have  little  effect — even  though 
it  has  a  strong  one.  In  other  words,  the  estimate 
of  the  effect  is  inaccurate. 

This  problem  can  be  represented  formally.  In 
the  equations  that  follow,  the  importance  of  the 
issues  is  one  component  of  the  error  term  in  the 
selection  equation  (u  u  in  Equation  1)  and  also  one 
component  of  the  error  term  in  the  outcome  equa¬ 
tion  (u2j  in  Equation  2),  so  that  the  two  error  terms 
are  correlated: 

ff select,-  =  oq  =  7i  parity,  +  y2X2i  +  uu,  ^ 

UWar,  =  oq  =  (3  j  parity,  +  ult.  ^) 

In  this  example,  the  observed  dependent  vari¬ 
ables  are  dichotomous;  the  country  has  some 
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unobserved  utility  of  being  in  a  dispute,  and  we 
observe  that  Y1(.  =  1  (the  observation  is  in  the 
sample)  if  Useiect,  >  0  and  Y u  —  0  (the  observation 
is  not  in  the  sample)  otherwise;  similarly,  the  coun¬ 
try  has  some  unobserved  utility  of  going  to  war, 
and  we  observe  that  Y2i  —  1  (the  case  goes  to  war) 
if  UWar ,  >  0  and  Y2i  —  0  otherwise.  The  variable  X2 
is  another  variable  that  influences  selection  but  not 
the  outcome  of  interest;  it  is  called  an  exclusion 
restriction  (see  below).  Since  by  assumption  power 
parity  has  a  positive  effect  on  selection  and  on 
escalation  to  war  in  the  example,  y,  and  fSj  are 
positive.  If  the  states  have  roughly  equal  power 
(parity  has  a  value  of  1),  the  observation  is  more 
likely  to  have  a  high  value  of  Use iect,  and  select  into 
the  sample,  even  if  the  error  term,  uu,  is  small  or 
negative;  but  if  the  states  have  unequal  power  (par¬ 
ity  has  a  value  of  zero),  the  observation  will  only 
select  into  the  sample  if  the  other  independent  vari¬ 
able  (X2)  leads  it  to  do  so  or  if  the  pair  has  a  large 
value  of  the  error  term  in  the  selection  equation, 
ulr  Thus,  observations  with  parity  that  get  involved 
in  disputes  can  have  any  value  of  uu,  and  observa¬ 
tions  without  parity  that  get  involved  in  disputes 
more  often  have  large  values  of  uu. 

The  selection  process  and  the  correlated  error 
terms  in  Equations  1  and  2  lead  power  parity,  the 
independent  variable  of  interest  in  the  outcome 
equation,  to  be  correlated  with  the  error  term  in 
that  equation.  Because  of  the  correlation  between 
uXi  and  u2i ,  the  sample  of  dyads  available  to  esti¬ 
mate  the  impact  of  power  parity  on  whether  coun¬ 
tries  escalate  their  disputes  to  war  includes 
(a)  pairs  of  countries  with  equal  power  (X1(.  =  1) 
that  may  have  large  or  small  error  terms  in  the 
outcome  equation  (values  of  u2i)  and  (b)  states 
with  unequal  power  (X1;  =  0)  that  have  large  error 
terms  in  the  outcome  equation  (values  of  u2j). 
Thus,  power  parity  is  negatively  correlated  with 
the  error  term  in  the  equation  of  interest,  and  the 
researcher’s  estimate  of  its  effect  ((3J  using  stan¬ 
dard  techniques  (probit  or  logit)  is  inconsistent. 

The  problem  of  selection  bias  often  is  given 
short  shrift  by  researchers,  who  ignore  selection 
because  they  believe  that  the  estimates  are  accu¬ 
rate  for  the  sample  in  which  they  are  interested. 
However,  without  modeling  the  selection  process, 
the  researcher  cannot  obtain  an  accurate  estimate 
of  the  causal  effect  of  the  independent  variables  of 
interest  on  the  dependent  variable  in  the  equation 


of  interest,  even  for  the  sample  under  consider¬ 
ation.  The  reason  is  that  the  independent  vari¬ 
ables  of  interest  are  correlated  with  the  error  term 
in  the  selected  sample  whether  or  not  they  were 
correlated  with  the  error  term  in  the  selection 
equation;  the  selection  process  creates  the  correla¬ 
tion.  In  the  power  parity  example,  power  parity  is 
not  associated  with  a  greater  probability  of  war  in 
the  sample  of  disputes.  However,  power  parity 
does  have  a  causal  effect  on  war  for  states 
involved  in  disputes;  this  causal  effect  is  hidden 
by  the  fact  that  among  disputing  dyads,  those 
with  uneven  power  are  more  likely  to  have  serious 
disputes  due  to  the  selection  process. 

Fortunately,  the  problem  of  selection  bias  can  be 
surmounted,  given  appropriate  data.  Heckman 
showed  that  when  the  dependent  variable  in  the 
selection  equation  is  continuous,  the  problem 
described  here  leads  to  what  is  known  as  omitted- 
variable  bias  in  the  outcome  equation.  He  further 
devised  an  easy,  two-step  estimation  technique  that 
avoids  the  bias.  Other  researchers  have  developed 
techniques  for  avoiding  selection  bias  with  different 
types  of  data.  Techniques  to  avoid  selection  bias 
almost  always  require  a  valid  exclusion  restriction 
(X2i  in  Equation  1  on  previous  page),  which  is  a 
measured  variable  that  influences  whether  or  not 
the  observation  is  selected  into  the  sample  but  does 
not  affect  the  outcome  of  interest.  Critics  of  selection 
methods  have  argued  that  appropriate  exclusion 
restrictions  often  are  difficult  to  obtain;  without  a 
valid  exclusion  restriction,  evidence  from  computer 
simulations  suggests  that  selection  methods  may  be 
more  misleading  than  estimates  obtained  with  stan¬ 
dard  techniques. 

Anne  E.  Sartori 
Northwestern  University 
Evanston,  Illinois,  United  States 
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Self-Determination 


Self-determination  is  a  highly  contentious  concept 
that  encompasses  a  variety  of  meanings  and  political 
claims.  Each  of  these  claims  is  based  on  the  theory 
that  particular  population  groups  possess  an  inher¬ 
ent  right  to  control  their  own  political  institutions. 
The  term  has  been  used  in  at  least  three  different 
ways.  First,  self-determination  can  refer  to  the  col¬ 
lective  right  of  a  defined  ethnic,  linguistic,  cultural, 
or  religious  community  to  create  and  administer 
their  own  state.  This  is  often  the  foundation  for  an 
argument  in  favor  of  secession  or  irredentism,  the 
doctrine  that  populations  should  live  under  the 
sovereignty  of  the  country  to  which  they  are  ethni¬ 
cally  or  culturally  related  regardless  of  existing 
juridical  borders.  Second,  it  can  refer  to  the  right  of 
a  population  to  decide  how  they  will  be  governed 
and  who  will  represent  them  in  government,  the 
bedrock  of  democratic  theory.  Finally,  the  concept 
of  self-determination  also  represents  the  claim  that 
all  states  and  societies  have  the  right  to  determine 
their  own  political,  economic,  and  social  institu¬ 
tions.  In  this  sense,  it  is  synonymous  with  the  prin¬ 
ciple  of  sovereignty  and  nonintervention,  reflecting 
one  of  the  most  important  concepts  in  diplomatic 
practice  and  international  law.  More  specifically, 
the  principle  of  self-determination  has  been 
advanced  by  populations  challenging  the  role  of 
foreign  powers  in  influencing  their  governments 
and  its  political  structures.  This  claim  has  long  been 
associated  with  anticolonial  resistance  movements 
and,  following  decolonization,  with  efforts  by 
popular  movements  to  challenge  intervention  by 
foreign  powers  in  their  internal  affairs. 

Although  all  of  these  approaches  are  viewed  as 
paths  to  liberation,  in  practice  they  are  often  com¬ 
peting  doctrines.  Because  there  are  often  competing 
claims  over  who  the  “self”  is  in  self-determination, 
the  concept  has  been  the  source  of  both  civil  and 
international  conflict.  This  occurs  on  a  number  of 
levels. 

First,  the  liberal  principle  of  popular  sovereignty 
de-emphasizes  social  distinctions  within  the  popu¬ 
lation  as  the  political  foundation  of  the  state.  Since 
all  citizens  are  juridically  equal,  the  principle  of 
democratic  decision  making  gives  priority  to  the 
will  of  the  majority.  On  the  other  hand,  national 
sovereignty  is  based  on  the  will  of  the  nation  and 


gives  priority  to  the  needs  of  the  national  commu¬ 
nity.  It  has  been  common  over  the  past  century  to 
have  competing  claims  of  self-determination  based 
on  these  differences.  Whereas  a  national  commu¬ 
nity  may  claim  the  right  to  secede  or  amalgamate 
to  create  a  new  state  out  of  at  least  some  of  the 
territory  of  existing  states,  the  principle  of  popular 
sovereignty  holds  that  the  entire  population  should 
decide  whether  to  break  up  the  country.  Thus, 
although  a  plebiscite  or  referendum  would  reflect 
the  popular  will  and  likely  result  in  a  vote  against 
creating  a  new  sovereignty,  the  principle  of  national 
autonomy  would  demand  that  each  community  be 
given  the  right  to  decide  for  itself  with  whom  it 
wishes  to  associate.  Moreover,  the  principle  of  ter¬ 
ritorial  integrity  as  the  foundation  of  the  modern 
nation-state  system  is  inconsistent  with  secession 
except  in  the  most  extreme  circumstances. 

Second,  what  constitutes  the  national  community 
that  can  legitimately  evoke  the  principle  of  self- 
determination?  There  is  no  objective  definition  of 
nationality  that  can  be  applied  in  all  circumstances. 
Nationalists  have  alternatively  defined  themselves  in 
terms  of  culture,  ethnicity,  language,  religion,  and 
political  association.  Thus,  since  nations  are  usually 
self-defined,  their  composition  is  subject  to  contesta¬ 
tion  and  even  redefinition.  It  is  therefore  not  always 
clear  who  gets  to  define  what  constitutes  the  com¬ 
munity.  In  the  face  of  competing  claims,  at  least  one 
group’s  assertion  of  self-determination  will  inevita¬ 
bly  be  denied.  Moreover,  there  is  not  even  agreement 
around  the  foundation  of  nationhood.  Some,  such  as 
the  German  romanticists  and  their  followers,  argue 
that  nationality  is  primordial — that  is,  it  reflects  the 
long-standing  natural  divisions  inherent  in  human 
societies.  Individuals  are  born  into  their  national 
communities,  and  therefore,  one’s  national  identity 
is  not  a  matter  of  personal  choice.  Constructivists 
like  Benedict  Anderson,  on  the  other  hand,  argue 
that  nations  are  imagined  communities  that  reflect 
mental  images  of  an  affinity  that  its  members  create 
rather  than  any  natural  grouping  based  on  actual 
historical  or  physical  properties.  This  construction 
of  nationality,  Ernest  Gellner  adds,  is  primarily  a 
product  of  industrialism  and  the  alienation  from  the 
community  that  it  produces. 

Third,  the  liberal  notion  of  popular  sovereignty 
is  built  on  the  assumption  of  a  common  citizenry, 
and  therefore,  ethnicity  and  religion  had  little  to 
do  with  the  definition  of  either  the  state  or  the 
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nation.  John  Stuart  Mill,  for  example,  argued  that 
democratic  institutions  are  next  to  impossible  in  a 
country  made  up  of  different  nationalities.  Without 
a  fellow  feeling,  a  unified  public  opinion,  which 
he  argues  is  necessary  for  the  working  of  a  repre¬ 
sentative  government,  cannot  exist.  If  this  is  the 
case,  then  the  only  way  to  resolve  the  issues  raised 
above  is  through  a  policy  of  ethnic  cleansing. 
Thus,  as  Eric  Hobsbawm  holds,  the  ideal  of  the 
national  state  advocated  by  U.S.  President  Wood- 
row  Wilson  in  1919  is  one  that  could  only  be 
attained  by  barbarians. 

Self-Determination  in  Practice 

The  revolt  of  the  American  colonies  against  British 
rule  in  the  late  1 8th  century  was  the  first  successful 
assertion  of  the  right  to  self-determination  against 
an  external  ruler,  although  the  frequent  but  unsuc¬ 
cessful  Irish  revolts  against  English  rule  predated 
the  American  Revolution.  However,  though  the 
American  War  of  Independence  was  waged  against 
a  colonial  ruler,  the  cry  for  independence  by  the 
colonists  was  not  based  on  claims  of  nationalism, 
nor  did  the  revolutionaries  specifically  challenge 
colonialism  per  se.  Rather,  the  Declaration  of  Inde¬ 
pendence  was  based  on  the  democratic  principle 
that  the  consent  of  the  governed  was  the  only 
legitimate  foundation  for  government.  The  chal¬ 
lenge  to  British  taxation,  which  provided  the  initial 
spark  for  the  revolution,  centered  on  the  arguments 
that  the  colonists  were  not  represented  in  the 
British  Parliament  and,  moreover,  that  British  rule 
itself  did  not  reflect  the  will  of  the  colonial  popula¬ 
tion.  For  revolutionary  leaders  like  Thomas  Jeffer¬ 
son,  self-determination  meant  self-government  and 
popular  sovereignty.  In  this  sense,  the  right  to  self- 
determination  was  held  individually  by  each  citi¬ 
zen.  This  idea  of  popular  sovereignty  was  also  the 
foundation  of  the  1789  French  Revolution  and  its 
radical  conception  of  citizenship. 

During  the  19th  century,  the  concept  of  self- 
determination  shifted  to  collectivist  claims 
advanced  by  European  nationalist  movements 
seeking  to  create  national  states  out  of  the  large 
multiethnic  empires  that  ruled  Central  and 
Eastern  Europe.  Such  movements  argued  that  all 
national  groups  were  entitled  to  a  state  of  their 
own  and  that  national  self-determination  was 
the  only  legitimate  criteria  for  statehood.  This 


philosophy  provided  the  foundation  for  the  cre¬ 
ation  of  Italy  and  Germany,  as  well  as  for  the 
claims  advanced  by  dozens  of  ethnic  groups  liv¬ 
ing  within  the  Hapsburg  and  Ottoman  empires. 
This  conception  of  self-determination  saw  the 
right  as  one  that  was  held  collectively  by  a  self- 
defined  national,  ethnic,  linguistic,  or  religious 
community.  Sovereignty,  then,  was  held  not  by 
the  people  per  se  but  by  the  nation  as  a  whole. 
Self-government,  defined  as  one  that  reflected  the 
individual  wills  of  the  population,  was  not  a 
requirement  for  the  fulfillment  of  this  ideal. 

These  movements  reached  their  peak  around 
the  time  of  World  War  I.  Both  U.S.  President 
Woodrow  Wilson  and  the  Bolshevik  leader  Leon 
Trotsky  voiced  strong  support  for  the  concept  of 
self-determination  as  the  foundation  for  a  just 
peace.  Wilson  original  concept  of  self-determination 
was  synonymous  with  self-government.  However, 
he  later  adopted  the  nationalist  perspective,  advo¬ 
cating  a  reorganization  of  the  Hapsburg,  German, 
and  Ottoman  territories  along  national  lines  after 
the  collapse  of  the  three  empires  during  the  war. 
He  was  particularly  influenced  by  the  political 
pressures  created  by  the  publication  of  the  Bolshe¬ 
viks’  peace  program  and  by  the  agitation  for 
national  independence  that  swept  the  Czech, 
Hungarian,  Polish,  and  South  Slav  territories.  The 
core  of  the  Bolshevik  platform  for  the  settlement 
of  the  war  was  the  realization  of  national  self- 
determination  for  peoples  throughout  Europe,  a 
proposal  that  drew  particular  favor  with  the 
nationalist  movements  challenging  the  Hapsburg 
monarchy.  The  allies  knew  that  they  needed  to 
develop  a  program  of  their  own  if  they  hoped  to 
compete  for  the  hearts  and  minds  of  the  public  and 
forestall  the  spread  of  Bolshevism  among  the  disaf¬ 
fected  populations  of  Eastern  and  Central  Europe. 

Wilson  responded  to  these  events  by  reframing 
the  war  as  one  for  national  self-determination. 
There  could  be  neither  justice  nor  order,  Wilson 
argued,  so  long  as  national  (i.e.,  ethnic)  communi¬ 
ties  were  denied  the  right  to  choose  their  own 
political  affiliation.  His  solution  would  be  a  read¬ 
justment  of  borders  and  a  redistribution  of  territory 
based  on  the  ethnic  composition  of  the  population. 
He  articulated  this  political  vision  for  the  new 
European  order  in  a  series  of  speeches  known  as  the 
Fourteen  Points  Address,  the  Four  Principles,  the 
Four  Ends,  and  the  Five  Particulars. 
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At  the  same  time,  although  the  reorganization 
of  Central  and  Eastern  Europe  provided  for  the 
creation  of  a  dozen  new  states,  the  postwar  settle¬ 
ment  did  not  abolish  the  practice  of  colonialism, 
nor  did  it  provide  for  political  independence  for 
those  living  under  the  former  Ottoman  Empire.  It 
was  not  until  the  aftermath  of  World  War  II  that 
millions  of  people  living  in  Africa,  Southeast  Asia, 
and  the  Middle  East  were  granted  the  right  to  form 
their  own  states.  Following  World  War  II,  the  insti¬ 
tution  of  colonialism  quickly  collapsed.  In  some 
cases,  the  colonial  powers  determined  that  colo¬ 
nialism  was  no  longer  either  feasible  or  sustainable 
and  granted  independence  to  their  colonies.  This 
was  the  case  in  much  of  Africa.  In  other  cases, 
however,  colonial  powers  such  as  France  and 
Portugal  refused  to  do  so,  sparking  anticolonial 
rebellions  and  wars  of  national  self-determination. 
These  challenges  were  particularly  acute  in  Vietnam, 
Algeria,  Angola,  and  Rhodesia  (now  Zimbabwe). 

Yet  even  voluntary  decolonization  produced 
problems  of  its  own  with  regard  to  the  ideal  of  self- 
determination.  First,  the  European  powers  did  not 
facilitate  decolonization  in  Africa  and  the  Middle 
East  by  creating  national  states  based  on  the  com¬ 
position  of  the  ethnic,  religious,  linguistic,  or  tribal 
populations  living  there.  Rather,  they  built  new 
states  from  the  existing  colonial  boundaries  that 
had  been  established  almost  a  century  earlier  at  the 
Congress  of  Berlin.  The  self-determination  princi¬ 
ple  would  be  based  on  political  independence,  not 
on  national  or  cultural  autonomy.  While  this  was 
probably  the  most  practical  method  for  transfer¬ 
ring  sovereignty  to  the  native  populations,  it  vio¬ 
lated  the  principle  of  national  self-determination  by 
creating  states  that  failed  to  coincide  with  the  dis¬ 
tribution  of  the  traditional  political  communities 
on  the  continent.  This  not  only  forced  the  popula¬ 
tions  to  reconceptualize  their  identities  from  tribal 
or  clan-based  affinities  to  “citizens”  of  a  new 
abstract  state  that  was  alien  to  their  cultures  and 
histories,  it  also  created  significant  internal  con¬ 
flicts  among  groups  over  who  would  dominate  the 
new  states.  Thus,  conflicts  over  self-determination 
erupted  between  various  communities  within  many 
of  the  new  states,  sparked  by  resentment  from 
those  peoples  who  felt  that  they  were  being 
oppressed  by  a  dominant  tribal  or  ethnic  group. 

In  addition,  both  the  former  colonial  powers  and 
the  emerging  U.S.  superpower  began  to  exercise  a 


significant  amount  of  influence  in  the  economic  and 
political  development  of  the  new  states.  As  newly 
formed  weak  states,  the  new  formations  were  highly 
susceptible  to  external  pressure,  and  the  great  pow¬ 
ers  emerged  as  dominant  forces  within  these  states. 
The  governments  of  such  states  tended  to  be  author¬ 
itarian  and  often  worked  closely  with  these  great 
powers  in  defiance  of  the  popular  will.  This  led  to 
new  types  of  movements  for  self-determination, 
popularly  known  as  “anti-imperialist,”  often  esca¬ 
lating  into  decades-old  civil  wars.  The  resolution  of 
these  conflicts  is  still  partial  at  best. 

The  revolutions  in  Eastern  Europe  in  the  early 
1990s  did  not  only  bring  down  communist  govern¬ 
ments  allied  with  the  Soviet  Union.  They  also 
reopened  questions  of  national  identity  that  had 
been  settled  for  almost  half  a  century.  Lacking  the 
secular  identity  that  the  communist  governments 
had  provided  to  the  population,  the  emerging 
authorities  viewed  liberation  as  synonymous  with 
ethnic  nationalization.  As  such,  they  asserted  that 
their  states  would  reflect  the  will  of  their  dominant 
nations,  often  in  the  face  of  large,  newly  oppressed 
national  minorities.  Using  the  popular  sovereignty 
principle  of  majority  rule,  many  imposed  the 
dominant  nationality  on  the  entire  population, 
often  resulting  in  ethnic  conflict  and  civil  war.  This 
was  most  pronounced  in  the  breakup  of  Yugoslavia, 
a  political  entity  that  had  been  formed  after  World 
War  I  on  the  principle  of  Pan-Slavism.  Following 
the  fall  of  the  Berlin  Wall,  however,  parochial 
nationalism  emerged  as  the  dominant  political 
principle,  encouraging  the  Serbian,  Bosnian,  and 
Croatian  authorities  to  assert  their  autonomy  and 
reject  a  common  Slavic  culture.  This  resulted  in  the 
worst  political  violence  and  gross  human  rights 
violations  since  the  1930s. 
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Self-Regulation 


Self-regulation  describes  the  way  a  specific  group 
controls  and  supervises  any  or  all  aspects  of  its 
own  activities  in  terms  of  public  interest.  It  is  an 
extraordinarily  fluid  concept.  The  term  bleeds  into 
neighboring  ideas,  such  as  voluntary  regulation, 
self-policing,  self-governance,  corporate  social 
responsibility  or  coregulation.  Self-regulation 
applies  to  institutions  ranging  from  local  sports 
clubs,  to  professions,  to  global  industrial  net¬ 
works,  leaving  out  little  of  what  lies  in  between. 
Self-regulation  covers  myriad  activities  and  comes 
in  various  organizational  guises.  Consequently, 
the  idea  is  the  subject  of  some  controversy.  This 
entry  defines  the  notion,  shows  the  past  traditions, 
discusses  professional  self-regulation  with  its 
issues  and  criticisms,  and  finally  presents  its  recent 
evolution. 

Definition  and  Background 

The  term  self-regulation  implies  two  dimensions,  as 
emphasized  by  Anthony  Ogus  (1999).  First,  regula¬ 
tory  regimes  differ  in  the  amount  of  control  they 
have  over  the  supervision  they  give  the  group,  that 
is,  the  extent  of  se//-regulation.  Some  forms  of 
self-regulation  are  handled  entirely  by  the  group  in 
question,  such  as  self-regulation  of  the  medical 
profession.  In  other  instances,  notably  industry  tech¬ 
nical  standards,  self-regulation  may  rely  on  third- 
party  oversight,  such  as  the  International  Organization 
for  Standardization  (ISO).  Second,  regulatory 
approaches  may  differ  in  the  capacity  to  enforce 
regulation,  that  is,  the  ability  to  self -regulate.  Some 
self-regulatory  regimes  have  legal  or  quasi-legal 


status,  for  example,  for  law  or  accountancy. 
Others  are  little  more  than  a  set  of  normative  aspi¬ 
rations,  such  as  the  nonbinding  industry  codes  of 
conduct  in,  for  example,  the  tobacco  industry. 

Historical  antecedents  of  self-regulation  go  as 
far  back  as  ancient  Rome.  In  medieval  and 
Renaissance  Europe,  largely  self-governing  guilds 
regulated  trades  and  crafts.  According  to  Philippe 
Dollinger  (1998),  the  Hanseatic  League,  a  politi¬ 
cally  autonomous  alliance  of  trading  cities  and 
their  guilds,  was  probably  the  culmination  of 
trades-based  self-governance  in  premodern  Europe. 
In  Asia,  there  are  records  of  associations  of  crafts¬ 
men  and  trades,  called  shreni  in  India  or  hanghui 
in  China.  In  general,  premodern  forms  of  self¬ 
regulation  contributed  to  the  broader  goal  of  self- 
governance.  With  the  rise  of  the  modern  state  and 
its  monopoly  on  political  power,  self-regulation 
evolved  into  an  alternative  to  direct  state  over¬ 
sight,  albeit  an  alternative  that  was  to  complement 
rather  than  replace  the  general  regulatory  legiti¬ 
macy  of  the  state.  Particularly  in  the  English- 
speaking  world,  self-regulation  of  this  kind  is  most 
commonly  associated  with  the  oversight  of  profes¬ 
sions,  most  prominently  of  medicine  and  law.  In 
Europe  and  the  United  States,  as  noted  by  Donald 
Irvine,  much  of  the  legal  basis  for  professional  self¬ 
regulation  was  established  in  the  19th  and  20th 
centuries.  Symbolizing  the  status  of  an  indepen¬ 
dent  profession,  professional  self-regulation  has 
become  the  gold  standard  for  aspiring  vocations. 

Professional  Self-Regulation 

Proponents  of  professional  self-regulation  contend 
that  it  solves  the  problem  of  information  asymme¬ 
try  between  the  state  and  the  professions. 
Professions,  not  only  medicine  and  law  but  also 
accountancy,  have  traditionally  argued  that  effec¬ 
tive  regulation  requires  a  level  of  expert  knowledge 
found  only  within  the  profession  itself.  Regulation 
by  peers,  so  the  argument  goes,  creates  the  trust 
and  reciprocity  crucial  to  supervising  a  highly 
complex,  knowledge-intensive,  and  dynamic  field 
of  activity.  Since  expertise,  trust,  and  collegiality 
lead  to  fewer  regulatory  errors,  proponents  point 
out,  the  costs  of  self-regulation  to  the  taxpayer  are 
lower  than  in  the  case  of  external  regulation.  For 
this  reason,  professional  regulation  typically  is 
independent  and  equipped  with  considerable 
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enforcement  capacities,  for  example,  the  barring 
and  disbarring  of  legal  practitioners. 

In  the  past,  social  scientists  have  viewed  profes¬ 
sional  self-regulation  with  suspicion.  For  schools  of 
thought  committed  to  methodological  individual¬ 
ism,  such  as  Public  Choice,  self-regulation  puts  the 
lunatics  in  charge  of  the  asylum.  This  distorts  mar¬ 
ket  mechanisms  in  a  number  of  ways.  First,  by 
issuing  licenses  to  practice,  professionals  tend  to 
keep  the  supply  low  to  inflate  prices — a  claim  that 
resonates  well  with  the  fact  that,  for  example,  U.S. 
health  care  costs  have  consistently  made  up  about 
14%  of  the  gross  domestic  product.  Second, 
professional  trust  and  collegiality — articulated 
in  jargon  and  reproduced  through  impenetrable 
hierarchies — blur  the  lines  of  accountability 
between  individuals  within  professions  and  between 
the  professions  and  society.  In  this  way,  professional 
self-regulation  becomes  a  system  of  private  law  (or, 
literally,  privilege)  that,  in  the  view  of  critics,  eludes 
democratic  accountability  and  legitimates  the  viola¬ 
tion  of  consumer  rights  (Ogus,  1999).  Third,  self¬ 
regulating  institutions,  such  as  the  American 
Automobile  Association  (AAA)  in  the  United  States 
or  the  Law  Society  in  the  United  Kingdom,  occupy  a 
privileged  position  in  the  policy  process.  Equipped 
with  requisite  independence  and  legal  clout,  thinkers 
have  argued,  institutions  of  self-regulation  invari¬ 
ably  bring  about  self-serving  policy.  In  sum,  econo¬ 
mists  have  concluded,  the  practice  of  professional 
self-regulation  ensures  that  whenever  professional 
self-interest  is  incongruent  with  the  public  interest, 
the  interests  of  the  professions  prevail.  Suspending 
market  competition,  so  the  argument  goes,  creates 
opportunities  for  rent  seeking  that  no  rational  indi¬ 
vidual  could,  or  indeed  should,  let  pass.  Typically, 
economists  such  as  Milton  Friedman  have  suggested 
that  the  market  be  allowed  to  allocate  professional 
resources  and  services.  Market  competition  would 
then  regulate  all  else. 

In  the  past  3  decades  or  so,  self-regulation  has 
departed  from  its  traditional  home  within  the  pro¬ 
fessional  services.  Partly  in  response  to  shifting 
beliefs  about  the  role  of  the  state  in  regulation  and 
partly  in  response  to  emergent  global  trends,  pro¬ 
ductive  and  extractive  industries  the  world  over 
have  experimented  with  self-regulation.  Here,  the 
justification  has  been  cast  in  terms  of  efficiency  by 
pointing  out,  as  Richard  Andrews  (1998)  has  done, 
that  so-called  command-and-control  regulation  has 


proven  unwieldy,  ineffective,  and  expensive.  What 
is  more,  particularly  for  environmental  regulation, 
as  Andrews  notes,  new,  more  diffuse  forms  of 
environmental  degradation — climate  change  or 
stratospheric  ozone  depletion — elude  conventional 
end-of-pipe  regulation.  Self-regulation  of  this  kind 
is  believed  to  work  with  the  grain  of  the  market  by 
allowing  competition  to  determine  and  diffuse  best 
industrial  practices.  Unlike  traditional  professional 
self-regulation,  industry  self-regulation  takes  place 
in  widely  varying  degrees  of  independence. 
Significantly,  as  both  Richard  Andrews  and 
Virginia  Haufler  (2001)  have  observed,  very  few 
self-regulatory  regimes  for  industry  can  demon¬ 
strate  the  enforcement  capacities  of  the  traditional 
professions 

This  too  has  been  the  object  of  criticism.  For 
these  commentators,  including  Denis  Smith  and 
Steve  Tombs  (1995)  as  well  as  Andrews  (1998), 
industry  self-regulation,  however  dressed  up,  is 
almost  synonymous  with  no  regulation  and  is 
unable  to  protect  the  public  interest.  First,  critics 
argue  that  effective  market-based  self-regulation 
often  requires  rather  specific  industry  or  socio¬ 
political  conditions.  For  example,  best  practice 
models  are  important  in  industries  that  rely  on 
reputation.  In  markets  driven  predominantly  by 
price,  such  as  the  garment  industry,  best  practice 
usually  equates  to  cost  minimization.  Similarly, 
political  and  socioinstitutional  cultures  based  on 
consensus  and  cooperation,  such  as  the  Nordic 
countries,  favor  effective  industry  self-regulation 
more  than  countries  where  industrial  relations  are 
characterized  by  animosity  and  competition.  In 
industries  where  these  conditions  do  not  hold,  clas¬ 
sical  collective-action  problems,  such  as  free  riding, 
undermine  self-regulatory  regimes.  Second,  even 
under  ideal  conditions,  so  the  argument  goes,  self¬ 
regulation  by  itself  is  unlikely  to  address  the  social, 
environmental,  or  health-related  external  costs  of 
industry  activities.  Yet  it  is  precisely  these  costs  to 
the  environment,  society,  and  human  health  that 
legitimate  regulation  in  the  first  place  (Haufler, 
2001).  Third,  state  regulation  has  in  the  past  been  a 
reaction  to  the  lack  of  enthusiasm  on  part  of  indus¬ 
try  to  voluntarily  rein  in  harmful  activities  (Andrews, 
1998;  Smith  &  Tombs,  1995).  Critical  observers 
argue  that  empirical  evidence  offers  no  reason  to 
believe  that  this  has  changed.  Similarly,  proponents 
claim,  evidence  suggests  that  lack  of  institutional 
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distance  between  regulator  and  industry  often  leads 
to  agency  capture  and,  as  a  consequence,  rent- 
seeking  behavior.  The  upshot  of  the  argument  is 
that  industry  self-regulation  amounts  to  a  carte 
blanche  for  industry  to  pursue  profits  with  little  or 
no  regard  to  social,  environmental,  and  health- 
related  costs.  The  pressures  of  highly  competitive 
global  markets,  so  the  argument  goes,  ensure  that 
even  well-intentioned  firms  are  forced  into  the 
“race  to  the  bottom.  ”  The  solution  here  is  to  install 
external,  preferably  public,  instances  of  oversight. 

Despite  coming  to  opposite  conclusions,  both 
critiques  consider  self-regulation  primarily  as  a 
strategy  for  pursuing  individual  self-interest 
through  public  institutions.  Recently,  however, 
researchers  have  started  thinking  about  self¬ 
regulation  as  a  process  of  institutional  evolution. 
Ongoing  social,  political,  and  environmental 
changes  around  the  globe,  so  the  argument  goes, 
have  profoundly  transformed  the  environment  for 
regulation  in  ways  that  are  not  always  straightfor¬ 
ward.  On  the  one  hand,  globalization  blurs  orga¬ 
nizational  and  ideological  boundaries,  say,  between 
nation-states  or  the  political  Left  and  Right.  On 
the  other  hand,  it  (re-)creates  powerful  new  divi¬ 
sions  at  global  level,  for  example,  between  the  rich 
and  the  poor  or  between  the  advocates  and  skep¬ 
tics  of  climate  change  policy.  New  opportunities, 
argue  these  thinkers,  go  hand  in  hand  with  new 
problems.  For  example,  firms  today  can  operate  in 
“governance  voids”  where  regulation  is  light  and 
costs  are  low.  On  the  other  hand,  these  very  same 
firms  face  highly  discerning  consumer  citizens  who 
demand  ethically,  environmentally,  and  socially 
sound  business  practices,  all  this  at  a  time  when 
fragmentation  of  national  polities  into  highly  spe¬ 
cialized  policy  subsystems — often  spilling  over 
national  boundaries — erodes  public  sector  regula¬ 
tory  capacities. 

The  Future  of  Self-Regulation 

For  institutionalists,  the  variety  of  self-regulatory 
practices  suggests  an  open-ended  process  of  insti¬ 
tutional  evolution.  Thus,  they  argue,  it  would  be 
a  mistake  to  dismiss  out  of  hand  either  the  poten¬ 
tial  or  the  risks  of  self-regulation.  Instead,  self- 
regulatory  regimes  need  to  be  assessed  as  social 
systems  that  define  norms,  create  organizational 
structures,  and  generate  patterns  of  (recursive) 


practices.  The  complexity  and  uncertainty  of  con¬ 
temporary  regulatory  challenges  rule  out  a  priori 
assessments  of  regulatory  effectiveness.  Today, 
effective  self-regulation  has  become  “responsive 
regulation”  that  evolves  according  to  the  require¬ 
ments  of  the  specific  policy  subsystems.  Conse¬ 
quently,  institutionalists  argue  that  setting  up 
regulatory  choices  in  terms  of  polar  opposites — 
market  versus  state  versus  civil  society — is  likely 
to  yield  no  benefit.  Instead,  regulation  requires  the 
tailor-made  interplay  of  skills  and  knowledge 
found  throughout  society.  Effective  self-regulation, 
then,  is  about  a  finding  a  robust  public/private/ 
civil  society  mix.  The  precise  ingredients  of  this 
mix — the  relative  roles  and  functions  of  contend¬ 
ing  policy  actors — will  depend  on  the  institutional 
circumstances  in  which  regulation  takes  place. 

Therefore,  institutionalists  contend,  self-regulation 
requires  openness:  of  institutions,  of  networks, 
and,  not  the  least,  of  minds.  While  traditional  self¬ 
regulation  was  about  closing  off  professions  to 
external  scrutiny,  “responsive”  regulation  entails 
encouraging  experimentation  and  debate.  On  this 
view,  self-regulation  is  an  integral  part  of  a  larger 
process  of  “coregulation”  in  which  all  social  sec¬ 
tors,  including  the  state,  play  a  vital,  if  constantly 
changing,  role. 
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Bremen,  Germany 
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Semipresidentialism 


Semipresidentialism  is  a  constitutional  type  that 
combines  a  popularly  elected  head  of  state  with  a 
head  of  government  who  is  responsible  to  a  popu¬ 
larly  elected  legislature.  Semipresidentialism  is 
therefore  a  mixed  regime  form  that  combines  fea¬ 
tures  of  parliamentarism,  which  prescribes  a 
fusion  of  powers  and  mutual  dependence  between 
executive  and  legislative  powers,  and  presidential- 
ism,  which  is,  in  contrast,  characterized  by  a  sepa¬ 
ration  of  powers  and  mutual  independence  of 
executive  and  legislative  powers.  Semipresidential 
systems  thus  maintain  a  dual  power  structure  in 
which  the  head  of  state  shares  executive  power 
with  a  prime  minister  and  his  or  her  cabinet. 

Semipresidentialism  as  a  Mixed  Regime 

The  concept  of  semipresidential  government  was 
coined  during  the  1970s  by  the  French  political 
scientist  Maurice  Duverger  in  an  attempt  to  inves¬ 
tigate  and  appraise  the  constitutional  uniqueness 
of  the  French  Fifth  Republic.  In  1980,  in  what  is 
by  now  a  classic  article,  Duverger  introduced  his 
model  to  the  greater  international  public  and 
defined  semipresidentialism  as  a  combination  of 
three  elements:  (1)  the  president  is  elected  by  uni¬ 
versal  suffrage,  (2)  the  president  possesses  consid¬ 
erable  powers,  and  (3)  the  president  has  opposite 
him  or  her  a  prime  minister  and  ministers  who 
possess  executive  and  governmental  power  and 
can  stay  in  office  only  if  the  parliament  is  not  in 
opposition  to  them.  Applying  this  definition, 
Duverger  identified  seven  semipresidential  regimes, 
in  Austria,  Finland,  France,  Iceland,  Ireland, 
Portugal,  and  the  Weimar  Republic  in  Germany 
(1919-1933).  More  recent  identifications  come  up 
with  much  larger  numbers,  and  a  2008  survey  by 


Robert  Elgie  lists  no  less  than  54  semipresidential 
regimes  in  existence.  To  mention  just  a  handful  of 
cases  in  addition  to  those  identified  by  Duverger, 
the  list  includes  countries  such  as  Angola,  Bulgaria, 
Cameroon,  Egypt,  Guinea-Bissau,  Mongolia,  Sao 
Tome  and  Principe,  Senegal,  Tajikistan,  and 
Timor-Leste.  Several  of  these  more  than  50  coun¬ 
tries  have  nondemocratic  regimes  being  run  under 
the  cloak  of  semipresidentialism,  but  a  good  30  or 
so  are  partial  or  full  democracies. 

The  remarkable  increase  in  the  number  of  such 
regimes  reflects  two  partly  intertwined  circum¬ 
stances.  First,  this  type  of  regime  has  gained  much 
ground  in  recent  years.  For  instance,  the  dualistic 
logic  of  semipresidentialism  having  much  in  com¬ 
mon  with  the  previous  communist  system  of  gov¬ 
ernment,  varieties  of  semipresidentialism  have 
frequently  been  adopted  in  constitutional  develop¬ 
ments  following  the  collapse  of  communism  in 
Central  and  Eastern  Europe  and  the  former  USSR. 
Second,  the  increase  also  reflects  deviations  by 
authors  and  analysts  from  the  criteria  laid  down  by 
Duverger,  which  are,  admittedly,  contestable  and 
anything  but  transparent.  For  example,  the  crite¬ 
rion  that  the  president  is  elected  by  universal  suf¬ 
frage  is  interpreted  by  some  to  mean  a  direct  elec¬ 
tion  and  by  others  to  mean  election  by  a  popular 
vote,  either  direct  or  indirect.  While  the  first  inter¬ 
pretation  excludes  elections  in  an  electoral  college, 
the  second  does  not.  To  give  another  example, 
when  and  if  the  criterion  is  that  the  president  must 
possess  considerable  powers,  authors  may  differ  in 
their  view  of  what  set  of  powers  is  to  be  regarded 
sufficient  to  satisfy  this  criterion.  Also,  how  the 
powers  are  divided  between  president  and  prime 
minister  vary  greatly  between  countries,  and  this 
variation  carries  problems  of  classification  and 
interpretation  in  its  wake. 

Furthermore,  while  some  classifications  rely  on 
institutional  criteria  only,  others  employ  in  addi¬ 
tion  criteria  that  relate  to  actual  or  political  power. 
Clearly,  this  adds  to  the  confusion,  as  constitutions 
that  are  relatively  similar  may  still  be  applied  dif¬ 
ferently.  Duverger  was  certainly  very  much  aware 
of  this,  as  he  maintained  that  the  constitutions  of 
Austria,  Ireland,  and  Iceland  are  semipresidential, 
while  political  practice  in  these  countries  is  parlia¬ 
mentary.  In  fact,  according  to  Duverger,  the  very 
purpose  of  the  concept  of  semipresidential  govern¬ 
ment  was  to  explain  how  it  was  the  case  that  a 
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similarity  of  rules  may  still  result  in  a  diversity  of 
games.  The  differing  views  and  interpretations  of 
semipresidentialism  have  even  come  to  concern 
the  very  status  of  the  concept.  According  to  one 
school  of  thought,  the  alternation  between  presi¬ 
dential  and  parliamentary  power  that  is  inherent 
in  this  form  of  regime  implies  that  semipresiden¬ 
tialism  is  a  hybrid  form  or  a  synthesis  between 
parliamentarism  proper  and  presidentialism 
proper.  Others  maintain,  however,  that  semipresi¬ 
dentialism  is  rather  a  specific  and  separate  regime 
form.  Having  like  presidentialism  two  sources  of 
popular  legitimacy,  namely,  presidential  elections 
on  the  one  hand  and  legislative  elections  on  the 
other,  semipresidentialism,  the  argument  goes,  is  a 
regime  form  closer  to  presidentialism  than  to  par¬ 
liamentarism. 

Given  that  the  concept  of  semipresidentialism 
has  been  and  still  is  a  source  of  confusion,  it  is 
only  natural  that  most  listings  of  semipresidential 
regimes  tend  to  come  up  with  divergent  findings, 
at  times  including  cases  that  may  only  with  hesita¬ 
tion  be  referred  to  as  semipresidentialism.  For 
instance,  the  small  African  island  state  Sao  Tome 
and  Principe  often  figures  in  lists  of  semipresiden¬ 
tial  regimes;  in  this  country,  however,  the  directly 
elected  president  is  charged  with  dismissing  the 
prime  minister.  In  fact,  according  to  the  constitu¬ 
tion,  the  government  is  politically  responsible 
before  the  president  as  well  as  the  assembly. 
Clearly,  this  constitutional  order  is  some  distance 
away  from  the  basic  idea  of  semipresidentialism. 
Finland,  to  mention  another  example,  was  earlier 
regarded  by  many  as  a  model  case  of  semipresi¬ 
dentialism.  However,  presidential  powers  were 
much  reduced  in  the  new  Form  of  Government 
Act  from  the  year  2000,  and  they  now  boil  down 
to  a  statement  in  the  Finnish  Constitution  that 
foreign  policy  is  led  by  the  president  in  coopera¬ 
tion  with  the  cabinet.  Given  that  trends  toward 
internationalization  and  globalization  have  much 
blurred  the  distinction  between  foreign  and  domes¬ 
tic  policy,  the  correct  interpretation  of  this  vague 
constitutional  statement  has  become  a  source  of 
much  political  dispute  in  the  country.  The  point  to 
be  made  here,  however,  is  that  Finland  nowadays 
represents  a  much  weaker  variant  of  semipresiden¬ 
tialism  than  before.  Indeed,  future  inclusion  of 
Finland  in  the  semipresidential  camp  may  even  be 
seriously  questioned. 


Empirical  Findings  on  Semipresidentialism 

From  the  difficulties  of  definition  it  follows  that 
the  empirical  findings  concerning  the  politics  and 
impact  of  semipresidentialism  are  in  like  manner 
indistinct  and  even  contradictory.  In  practice, 
however,  semipresidential  systems  tend  to  oscillate 
between  two  forms.  On  the  one  hand,  the  presi¬ 
dent  gains  in  power  if  the  legislative  majority  is  of 
the  same  party  as  the  president.  On  the  other  hand, 
if  the  legislative  majority  is  of  divergent  parties,  it 
will  strengthen  the  position  of  the  prime  minister. 
This  second  form  is  named  cohabitation  and  may 
result  in  a  formally  or  informally  agreed-on  division 
of  labor  between  the  two  heads  of  government. 
For  instance,  in  France  during  the  cohabitation 
period  of  1986  to  1988,  President  Franqois 
Mitterrand  retained  responsibility  for  foreign 
affairs,  while  Jacques  Chirac  as  prime  minister  was 
responsible  for  domestic  affairs.  Such  inoffensive 
solutions  are  not  always  obtainable,  and  findings 
suggest  that  cohabitation  creates  political  tension 
in  times  of  crisis  and  may  in  some  cases  even  be 
responsible  for  the  collapse  of  democracy.  Of 
course,  if  authority  is  fragmented  in  the  legislature, 
with  no  party  or  coalition  enjoying  a  substantive 
majority,  the  outcome  may  be  that  the  president 
resorts  to  executive  decrees  and  emergency  rule. 
Further  findings  from  the  study  of  the  new  democ¬ 
racies  that  have  resorted  to  semipresidentialism  are 
that  the  dual  executive  is  in  general  likely  to  gener¬ 
ate  conflict;  however,  systems  that  accentuate  the 
position  of  the  prime  minister  are  likely  to  main¬ 
tain  better  democratic  performance  than  systems 
that  emphasize  the  position  of  the  president.  If 
both  the  president  and  the  prime  minister  have 
significant  powers,  the  situation  becomes  complex 
and  perhaps  even  risky  to  newly  democratizing 
regimes. 

The  main  lesson  to  be  learned  from  the  confu¬ 
sion  of  definitions  and  from  the  less  than  consis¬ 
tent  research  findings  concerning  semipresidential 
politics  is  that  there  is  not  one  kind  of  semipresi¬ 
dentialism  but  many.  In  consequence,  as  the  mani¬ 
foldness  brings  forward  operationalizations  that 
are  muddled  and  easily  contested,  the  concept  of 
semipresidentialism  should  not  really  be  used  as 
such  as  an  independent  or  dependent  variable  in 
attempts  to  chart  the  consequences  or  the  determi¬ 
nants  of  regime  choice.  True,  the  fact  that  semi¬ 
presidential  regimes  operate  in  different  ways  may 
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be  accounted  for  by  reference  to  historical  and 
contextual  factors  and  to  factors  that  relate  to  the 
operating  of  electoral  and  party  systems,  but  the 
differences  also  follow  from  the  simple  fact  that 
the  regimes  in  question  are  not  all  alike. 
Reformulations  of  the  term  are  therefore  called 
for,  and  in  recognition  of  the  analytical  difficulties, 
attempts  have  been  made  to  establish  more  finely 
differentiated  categories.  A  volume  from  the  early 
1990s  by  Matthew  Shugart  and  John  Carey  (1992) 
has  emerged  as  a  landmark  publication  in  this 
respect,  as  it  replaces  the  semipresidential  category 
with  a  distinction  between  premier-presidential 
systems  and  president-parliamentary  systems.  The 
former  type  comprises  cases  in  which  the  prime 
minister  exerts  greater  executive  power;  the  latter 
is  about  cases  in  which  the  president  wields  greater 
authority.  Attempts  have  also  been  made  at  a  fur¬ 
ther  division  of  premier-presidential  systems;  these 
attempts  build  on  the  significance  of  the  nonlegis¬ 
lative  powers  vested  in  the  presidency. 

Future  Directions 

Efforts  at  promoting  the  future  use  of  the  semi¬ 
presidential  category  should  probably  advance 
further  along  these  lines.  Semipresidentialism  has 
often  been  conceptualized  as  a  sort  of  umbrella 
model,  intended  to  cover  somehow  the  terrain 
between  parliamentarism  and  presidentialism. 
When  strictly  and  narrowly  defined  and  applied, 
the  concept  of  semipresidentialism  no  doubt 
brings  analytical  rigor  and  capacity  to  the  study  of 
political  regimes;  at  the  same  time,  however,  pre¬ 
cisely  because  it  is  strictly  and  narrowly  defined, 
the  concept  fails  to  cover  all  the  varieties  and  the 
diversified  arrangements  of  checks  and  balances 
that  have  been  introduced  in  many  contemporary 
constitutional  frameworks.  As  these  varieties  have 
been  incorporated  into  the  semipresidential  frame, 
the  frame  has  become  fuzzy  and  difficult  to  fit  into 
the  practices  of  research.  Obviously,  future  efforts 
need  to  depart  from  the  ambition  to  preserve  con¬ 
ceptual  clarity  while  advancing  the  capacity  of  the 
concept  to  cover  and  manage  an  expanded  field  of 
empirical  observations.  In  these  efforts,  one  is  well 
advised  to  consider  that  as  there  are  subtypes  of 
semipresidentialism,  there  are  subtypes  of  parlia¬ 
mentarism  and  subtypes  of  presidentialism  as 
well.  At  some  point  these  latter  subtypes  merge 


with  the  subtypes  of  semipresidentialism. 
Recognizing  this,  constitutional  political  science 
needs  to  work  out  and  adopt  in  future  research  a 
full-fledged  typology  that  describes  by  including 
also  semipresidential  subtypes  the  relative  levels  of 
parliamentarism  and  presidentialism  in  political 
regimes. 
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Separation  of  Powers 


Governmental  powers  are  separated  for  several 
reasons.  This  entry  analyzes  the  notion  and  its 
rationale,  discusses  the  concept  of  mixed  govern¬ 
ment,  assesses  the  influence  of  Montesquieu, 
reviews  the  American  experience,  and  shows  how 
the  separation  of  powers  is  actually  implemented. 

Nations  may  decide  that  it  is  necessary  to  sepa¬ 
rate  functions  that  are  considered  unique  and 
exclusive  in  nature,  such  as  the  legislative,  execu¬ 
tive,  and  judicial  powers.  Another  motivation  is  to 
divide  the  functions  of  government  to  ensure  that 
no  one  body  can  accumulate  sufficient  power  to 
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violate  the  individual  rights  and  liberties  of  citi¬ 
zens.  The  first  reason  is  analytical  and  based 
largely  on  experience  and  practicality.  The  second 
is  more  theoretical  and  value  laden,  especially 
because  it  involves  making  judgments  about  the 
human  tendency  to  abuse  power.  To  check  that 
inclination,  powers  are  kept  separate.  The  U.S. 
system  of  separation  of  powers  draws  from  both 
sources. 

The  Notion  in  the  Classic  Literature 

The  concept  of  separation  of  powers  is  found  in 
the  works  of  political  analysts  from  the  time  of  the 
Greek  philosophers  Plato  and  Aristotle.  Mis¬ 
interpretations  and  misconceptions  about  separa¬ 
tion  of  powers  are  perennial  problems.  If  there  is 
no  separation  of  powers,  one  branch  of  govern¬ 
ment  might  acquire  too  much  influence  and 
threaten  other  branches  and  the  people.  With  too 
much  separation,  government  does  not  function 
properly,  creating  conditions  that  invite  a  dictator 
to  appear  in  order  to  restore  order  and  efficiency. 

The  constitutions  of  1791  and  1848  in  France 
represented  concerted  efforts  to  establish  a  pure 
separation  of  powers.  The  first  experiment  resulted 
in  the  Committee  of  Public  Safety,  the  Directory, 
and  the  reign  of  Napoleon  Bonaparte.  The  second 
led  to  Louis  Napoleon,  reaction,  and  the  Second 
Empire.  Professor  M.  J.  C.  Vile  (1967)  observed 
that  “this  last  flirtation  with  the  pure  doctrine 
ended  in  the  same  way  as  others  had  ended  in 
France — in  absolutism”  (p.  207).  The  framers  of 
the  American  Constitution  attempted  to  avoid 
those  violent  swings.  Justice  Joseph  Story  (1905) 
said  that  the  framers  accepted  a  separation  of  pow¬ 
ers  but  “endeavored  to  prove  that  a  rigid  adher¬ 
ence  to  it  in  all  cases  would  be  subversive  of  the 
efficiency  of  the  government,  and  result  in  the 
destruction  of  the  public  liberties”  (p.  396). 

The  Concept  of  Mixed  Government 

The  framers  of  the  American  Constitution  were 
familiar  with  efforts  over  the  centuries  to  combine 
different  political  interests  to  provide  for  balanced 
government.  The  principle  of  mixed  government, 
as  a  method  of  stabilizing  a  political  order,  can  be 
traced  back  to  Greek  philosophy.  In  Book  III  of  his 
Laws ,  Plato  combined  monarchy  and  democracy 


to  form  a  well-governed  city.  Aristotle,  in  Book  IV 
of  his  Politics,  mixed  different  types  of  oligarchies 
and  democracies  to  create  a  stable  state.  He  also 
divided  government  into  separate  departments:  the 
deliberative,  executive,  and  judicial. 

In  an  effort  to  ensure  balance  and  stability  in  a 
political  order,  Polybius,  a  Roman  historian  of  the 
2nd  century  BCE,  studied  different  types  of  consti¬ 
tutions  to  prevent  ideal  forms  from  degenerating 
into  perverted  ones.  In  Book  VI  of  his  Histories,  he 
settled  on  a  mix  of  kingship,  aristocracy,  and 
democracy:  The  force  of  each  being  neutralized  by 
that  of  the  others,  neither  of  them  should  prevail 
and  outbalance  another,  but  the  constitution 
should  remain  for  long  in  a  state  of  equilibrium 
like  a  well-trimmed  boat.  Niccolo  Machiavelli,  in 
his  Discourses,  adopted  a  similar  method  of  avoid¬ 
ing  the  risks  of  a  pure  form  of  government.  It  was 
better  to  pull  out  portions  from  each  form,  so  that 
there  is  combined  under  the  same  constitution  a 
prince,  a  nobility,  and  the  power  of  the  people — 
then  these  three  powers  will  watch  and  keep  each 
other  reciprocally  in  check. 

John  Locke  studied  the  bloody  civil  wars  in 
17th-century  England,  which  pitted  the  monarchy 
against  Parliament  and  brought  ruin  to  the  people. 
What  system  of  government  could  check  against 
those  misfortunes?  He  concluded  that  what  was 
needed  was  not  a  balance  among  political  classes 
but  rather  a  division  among  political  institutions: 
legislative,  executive,  and  federative,  with  the  latter 
representing  external  affairs.  As  he  explained  in 
Book  II  of  his  Two  Treatises  of  Civil  Government 
(1690),  men  discovered  methods  of  restraining  any 
exorbitances  of  those  to  whom  they  had  given 
authority  over  them  and  of  balancing  the  power  of 
government  by  placing  several  parts  of  it  in  differ¬ 
ent  hands.  It  was  a  mistake  to  entrust  government 
functions  to  a  single  department:  It  may  be  too 
great  a  temptation  to  grasp  at  power,  for  the  same 
persons  who  have  the  power  of  making  laws  to 
have  also  in  their  hands  the  power  to  execute  them. 

In  dividing  the  departments  of  government, 
Locke  did  not  attempt  a  pure  separation.  The  leg¬ 
islative  branch  could  punish  the  executive  for  mal¬ 
administration  of  the  laws,  and  the  executive  was 
empowered  to  assemble  and  dismiss  the  legisla¬ 
ture.  Locke  did  not  provide  for  an  independent 
judiciary.  Under  his  system,  the  judicial  function 
was  placed  largely  under  the  executive.  In  1701,  a 
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decade  after  the  appearance  of  his  political  trea¬ 
tise,  the  Act  of  Settlement  in  England  helped  secure 
judicial  independence  by  guaranteeing  tenure  to 
judges  as  a  reward  for  their  good  behavior. 

From  these  studies,  one  can  appreciate  how  the 
doctrine  of  separated  power  evolved.  For  early 
writers  on  the  mixed  state,  stability  came  from 
dividing  the  sovereign  power  among  three  social 
classes:  (1)  the  king,  (2)  the  nobility,  and  (3)  the 
common  people.  In  later  developments  of  political 
thought,  authority  is  allocated  by  assigning  power 
to  three  different  governmental  bodies  and  expect¬ 
ing  them  to  check  one  another. 

Montesquieu’s  Influence 

There  have  been  frequent  claims  that  the  French 
political  theorist  Baron  de  Montesquieu  had  a 
profound  impact  on  the  American  framers  and 
their  reliance  on  separation  of  powers.  Woodrow 
Wilson  (1908)  wrote  that  the  framers  of  the 
Constitution  “followed  the  scheme  as  they  found 
it  expounded  in  Montesquieu,  followed  it  with 
genuine  scientific  enthusiasm”  (p.  56).  James 
Bryce  concluded  that  the  U.S.  Constitution  was 
created  “de  novo,  on  the  most  slender  basis  of 
pre-existing  national  institutions,”  relying  in 
large  part  on  Montesquieu’s  seminal  work,  The 
Spirit  of  the  Laws  (1748).  According  to  Bryce 
(1891),  “no  general  principle  of  politics  laid 
such  hold  on  the  constitution-makers  and  states¬ 
men  of  America  as  the  dogma  that  the  separation 
of  these  three  functions  is  essential  to  freedom” 
(PP-  20,  26). 

The  American  framers  did  indeed  refer  to 
Montesquieu  favorably  and  with  great  respect.  In 
Federalist  No.  47,  James  Madison  spoke  of  “the 
celebrated  Montesquieu”  as  the  “oracle”  who 
was  also  cited  on  the  separation  doctrine 
(Alexander  Hamilton,  James  Madison,  &  John 
Jay,  2002,  p.  337).  There  are  similarities  between 
Montesquieu’s  theory  and  the  U.S.  Constitution, 
but  the  evidence  is  persuasive  that  the  American 
framers  relied  on  their  practical  experiences  in  the 
colonies  and  state  government  after  their  break 
with  England.  Also,  Montesquieu  anticipated  a 
far  greater  separation  between  the  branches  than 
exists  in  the  U.S.  Constitution,  and  he  misper- 
ceived  the  British  form  of  government  at  the  time 
of  his  study. 


In  providing  for  the  legislative,  executive,  and 
judicial  branches,  Montesquieu  selected  the  British 
Constitution  as  his  model  and  visited  Fondon  from 
1729  to  1731  to  study  its  distribution  of  powers. 
As  one  of  several  critiques,  Frederick  Maitland 
wrote  that  it  was  “curious  that  some  political  theo¬ 
rists  should  have  seen  their  favourite  ideal,  a  com¬ 
plete  separation  of  administration  from  judicative, 
realised  in  England;  in  England  of  all  places  in  the 
world,  where  the  two  have  for  ages  been  inextrica¬ 
bly  blended”  (Kenneth  Davis,  1951,  p.  28).  To 
Walter  Bagehot  (1895),  the  English  system  repre¬ 
sented  the  “close  union,  the  nearly  complete  fusion, 
of  the  executive  and  legislative  powers”  (p.  78). 
Montesquieu’s  model  was  highly  theoretical  and 
failed  to  pick  up  political  developments  in  England. 
Power  began  to  rest  in  the  hands  of  the  Cabinet 
and  leading  ministers  such  as  Robert  Walpole  from 
1721  to  1742. 

The  concept  of  an  independent  judiciary, 
coequal  with  other  branches,  owes  little  to 
Montesquieu.  He  did  not  view  the  judiciary  as  a 
permanent  branch.  It  was  to  be  drawn  from  the 
body  of  the  people  at  different  points  of  the  year, 
as  had  been  the  custom  in  Athens  during  the  time 
of  Socrates.  Compared  with  the  legislative  and 
executive  branches,  Montesquieu  called  the  judi¬ 
ciary  to  some  extent  “next  to  nothing.”  This 
framework  for  the  judiciary  had  little  application 
to  the  U.S.  Constitution,  which  creates  a  Supreme 
Court  with  permanent  jurisdiction  and  grants  its 
members  life  tenure. 

Montesquieu  captured  well  the  tension  and 
checks  among  the  three  branches.  To  form  a  mod¬ 
erate  government,  it  was  necessary  to  combine  the 
several  powers;  to  regulate,  temper,  and  set  them  in 
motion;  to  give,  as  it  were,  ballast  to  one,  in  order 
to  enable  it  to  counterpoise  the  other.  Again  he 
stressed  how  the  experience  shows  us  that  every 
person  invested  with  power  is  apt  to  abuse  it  and 
to  carry  his  or  her  authority  as  far  as  it  will  go. 

The  American  Experience 

Much  of  what  the  American  framers  discovered 
about  the  structure  of  government  came  from 
what  they  learned  during  the  colonial  period,  their 
break  with  England,  and  efforts  under  the 
Continental  Congress  to  develop  a  form  of  govern¬ 
ment  that  would  be  both  effective  administratively 


Separation  of  Powers  2405 


and  protective  of  individual  rights.  Even  during  the 
colonial  period,  with  political  power  theoretically 
centered  in  London,  Americans  created  legislative 
bodies  and  assemblies  that  exercised  the  rudiments 
of  representative  control.  When  London  decided 
on  a  stamp  tax  in  1765  to  help  pay  for  the  costs  of 
the  Lrench  and  Indian  wars,  colonial  opposition 
was  so  intense  and  bitter  that  the  British  govern¬ 
ment  decided  to  repeal  the  statute  within  a  year. 
America  had  its  first  taste  of  political  indepen¬ 
dence  and  democratic  rule. 

After  declaring  their  independence  from  England 
in  1776,  many  of  the  American  states  wrote  into 
their  constitutions  explicit  provisions  for  a  separa¬ 
tion  between  the  branches  of  government.  New 
Hampshire,  the  last  of  the  13  states  to  form  a  con¬ 
stitution,  recognized  that  separation  could  not  be 
so  extreme  to  prevent  effective  government.  The 
three  departments  were  to  be  kept 

as  separate  from  the  independence  of  each  other, 
as  the  nature  of  a  free  government  will  admit  or 
as  is  consistent  with  that  chain  of  connection  that 
binds  the  whole  fabric  of  the  constitution  in  one 
indissoluble  bond  of  union  and  amity.  (Francis 
Thorpe,  1909,  p.  2457) 

After  the  break  with  England,  America  operated 
under  a  single  branch  of  government  at  the 
national  level:  the  Continental  Congress.  Delegates 
gathered  to  pass  laws,  sat  in  committees  to  handle 
administrative  duties,  and  sat  on  other  committees 
to  settle  adjudicatory  disputes,  especially  those 
involving  questions  of  admiralty.  By  1781,  the 
burdens  were  so  great  that  the  Congress  decided  to 
create  a  single  executive  to  carry  out  administra¬ 
tive  matters  and  set  up  a  court  of  admiralty.  Those 
bodies  were  not  independent.  They  were  creatures 
of  the  Congress  and  subject  to  legislative  control. 
But  more  from  practicality  than  theory,  the 
Congress  had  evolved  separate  bodies  to  carry  out 
the  functions  of  government  (Louis  Fisher,  1972). 

By  the  time  of  the  constitutional  convention  in 
Philadelphia  in  1787,  there  was  broad  agreement 
among  the  arriving  delegates  that  the  new  charter 
of  government  must  have  three  branches  of  gov¬ 
ernment.  The  structure  and  operation  of  the 
Continental  Congress  had  been  thoroughly  dis¬ 
credited.  It  was  well  said  by  the  historian  Francis 
Wharton  (1889)  that  the  Constitution  “did  not 


make  this  distribution  of  power.  It  would  be  more 
proper  to  say  that  this  distribution  of  power  made 
the  Constitution  of  the  United  States”  (p.  663). 

After  the  convention  had  drafted  the  Constitution 
and  adjourned,  James  Madison  confided  to 
Thomas  Jefferson  that  the  boundaries  between  the 
three  powers  “consist  in  many  instances  of  mere 
shades  of  difference.”  In  Federalist  No.  47, 
Madison  upheld  the  maxim  that  tyranny  resulted 
whenever  three  branches  were  concentrated  in  the 
same  hands.  However,  he  charged  that  the  princi¬ 
ple  had  been  “totally  misconceived  and  misap¬ 
plied”  (Hamilton  et  ah,  2002,  pp.  336-337). 
Neither  in  Montesquieu  nor  the  British  govern¬ 
ment  were  the  three  powers  separated  in  any  pure 
sense.  In  turning  to  the  state  constitutions,  the 
same  pattern  of  overlapping  powers  emerged.  The 
intent  of  Montesquieu,  Madison  said,  could  be  no 
more  than  that  in  cases  where  the  ivhole  power  of 
one  department  is  exercised  by  the  same  hands 
that  possess  the  ivhole  power  of  another  depart¬ 
ment,  the  fundamental  principles  of  a  free  consti¬ 
tution  are  subverted. 

When  the  draft  constitution  was  released  for 
public  discussion  and  ratification  by  the  states,  crit¬ 
ics  condemned  the  intermixture  of  the  branches.  By 
1788,  however,  the  doctrine  of  separated  powers  in 
any  pure  sense  had  lost  ground  to  the  concept  of 
checks  and  balances.  One  contemporary  pamphle¬ 
teer  called  the  separation  theory  a  “hackneyed 
principle”  and  a  “trite  maxim”  (Vile,  1967,  p.  153). 
Madison  and  Hamilton  had  to  contend  with  those 
who  objected  that  the  branches  of  government  were 
not  separate  but  blended.  The  impeachment  pro¬ 
cess,  for  example,  was  attacked  for  combining  leg¬ 
islative  and  judicial  powers  in  the  same  department. 
In  Federalist  No.  66,  Hamilton  argued  that  the  true 
meaning  of  the  separation  maxim  was  “entirely 
compatible  with  a  partial  intermixture”  and  that 
overlapping  was  “not  only  proper  but  necessary  to 
the  mutual  defence  of  the  several  members  of  the 
government  against  each  other”  (Hamilton  et  ah, 
2002,  p.  431).  For  those  who  objected  that  the 
treaty  process  mixed  the  executive  with  the  Senate, 
Hamilton  responded  wearily  in  Federalist  No.  75  to 
“the  trite  topic  of  the  intermixture  of  powers” 
(Hamilton  et  ah,  2002,  p.  475). 

During  the  ratification  debates,  three  states 
(North  Carolina,  Pennsylvania,  and  Virginia) 
insisted  that  a  separation  clause  be  added  to  the 
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national  bill  of  rights.  Different  proposals  were 
advanced  to  keep  the  branches  separate  and  dis¬ 
tinct.  Among  the  other  items  to  be  included  in  a 
bill  of  rights,  the  House  and  the  Senate  debated  the 
merits  of  a  separation  clause.  It  was  finally  omit¬ 
ted.  Had  it  been  adopted,  it  would  not  have 
affected  the  blending  of  departments  and  powers 
already  sanctioned  by  the  Constitution  (Fisher, 
1972). 

Separation  in  Practice 

The  framers  of  the  U.S.  Constitution  offered  gen¬ 
eral  guidance  to  the  structure  and  operation  of 
government.  Much  of  their  efforts  could  not  have 
anticipated  how  the  three  branches  would  exercise 
their  powers  or  how  the  national  government 
would  grow  in  size  and  dominance.  An  early  dis¬ 
pute  concerned  the  president’s  power  to  remove 
executive  officials.  In  1789,  when  the  first  three 
executive  departments  were  being  created,  Madison 
argued  successfully  that  the  president  possessed  an 
implied  removal  power  over  department  heads 
who  prevented  him  from  fulfilling  his  constitu¬ 
tional  duty  to  see  that  the  laws  were  faithfully 
exercised.  At  the  same  time,  also  in  1789,  Madison 
insisted  that  certain  officials  who  carried  out  what 
he  considered  duties  of  a  “judiciary”  nature  should 
not  serve  at  the  pleasure  of  the  president,  especially 
when  those  officials  were  responsible  for  determin¬ 
ing  that  federal  funds  were  being  spent  in  accor¬ 
dance  with  law. 

Another  restriction  on  presidential  control  over 
the  executive  branch  concerned  the  distinction 
between  two  duties.  Some  executive  officials  owed 
allegiance  to  the  president.  Those  were  political 
duties.  Other  responsibilities  were  to  the  statutory 
direction  from  Congress.  Those  were  legal  duties, 
referred  to  as  “ministerial  acts.”  Over  time,  courts 
understood  that  they  could  not  order  officials  to 
fulfill  political  duties  but  could  compel  the  dis¬ 
charge  of  legal  duties.  Through  this  development, 
both  legislative  and  judicial  branches  became 
involved  in  directing  the  administration  of  law 
within  the  executive  branch. 

The  Constitution  does  not  expressly  provide  for 
the  power  of  judicial  review,  allowing  courts  to 
strike  down  as  invalid  the  actions  of  the  states, 
Congress,  and  the  president.  Judicial  review 
evolved  as  another  implied  power,  invoked  in  the 


1803  case  of  Marbury  u.  Madison.  A  sentence  in 
that  decision  later  came  to  be  interpreted  as  “judi¬ 
cial  supremacy,”  meaning  that  a  U.S.  Supreme 
Court  ruling  had  finality  and  could  not  be  chal¬ 
lenged  or  overridden  by  the  other  branches.  Yet 
nothing  in  the  sentence  implies  any  level  of  finality: 
“It  is  emphatically  the  province  and  duty  of  the 
judicial  department  to  say  what  the  law  is.”  This 
means  that  courts  decide  cases,  not  that  what  they 
say  is  the  “last  word.” 

In  studying  other  governments  and  constitu¬ 
tions,  the  framers  concluded  that  single  executives 
had  involved  their  countries  in  wars  not  for  the 
national  interest  but  for  reasons  of  personal  fame 
and  ambition.  In  Federalist  No.  4,  John  Jay 
warned, 

It  is  too  true,  however  disgraceful  it  may  be  to 
human  nature,  that  nations  in  general  will  make 
war  whenever  they  have  a  prospect  of  getting  any 
thing  by  it;  nay,  absolute  monarchs  will  often 
make  war  when  their  nations  are  to  get  nothing 
by  it,  but  for  purposes  and  objects  merely 
personal,  such  as  a  thirst  for  military  glory. 

Those  motivations  led  a  ruler  “to  engage  in  wars 
not  sanctified  by  justice  or  the  voice  and  interests 
of  his  people”  (Hamilton  et  ah,  2002,  p.  101). 

The  framers  understood  that  British  theorists 
such  as  John  Locke  and  William  Blackstone  vested 
the  responsibility  for  foreign  affairs  and  war  with 
the  executive.  The  U.S.  Constitution  assigns  none 
of  those  powers  to  the  president  (foreign  com¬ 
merce,  making  treaties,  appointing  ambassadors, 
declaring  war,  or  issuing  letters  of  marque  and 
reprisal).  They  are  either  assigned  exclusively  to 
Congress  or  shared  between  the  president  and  the 
Senate  (treaties  and  appointments).  From  1789  to 
1950,  all  major  wars  were  either  declared  or 
authorized  by  Congress.  The  president  had  no 
independent  authority  to  take  the  country  from  a 
state  of  peace  to  a  state  of  war.  However,  President 
Harry  Truman  in  1950  ordered  U.S.  troops  to 
Korea  without  ever  coming  to  Congress  for  statu¬ 
tory  authority.  Since  that  time,  other  presidents 
have  also  claimed  the  right  to  engage  the  country 
in  war  without  legislative  authority,  seeking 
“authority”  instead  from  the  United  Nations 
Security  Council  or  from  countries  of  the  North 
Atlantic  Treaty  Organization  (NATO). 
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Truman  was  also  the  first  president  to  use  not 
only  express,  implied,  and  emergency  power  but 
what  has  been  called  “inherent”  authority — powers 
that  somehow  inhere  in  the  nature  of  an  office 
and  supposedly  are  not  subject  to  checks  and  bal¬ 
ances  from  other  branches.  In  claiming  that 
power  in  1952  when  seizing  steel  mills  to  prose¬ 
cute  the  war  in  Korea,  Truman  was  rebuffed  by 
the  federal  judiciary.  Other  presidents,  including 
Richard  Nixon  and  George  W.  Bush,  cited  inher¬ 
ent  authority  to  justify  their  actions  but  typically 
encountered  defeat  in  both  the  courts  and 
Congress. 

Over  the  past  6  decades,  Congress  has  often 
failed  to  defend  its  constitutional  powers  over  war 
and  spending.  It  has  repeatedly  deferred  to  presiden¬ 
tial  initiatives.  In  this  manner  and  others,  the  three 
branches  have  not  had  what  the  framers  assumed 
they  would:  the  incentive  to  fight  off  encroachments 
and  protect  their  institutional  powers.  When  checks 
and  balances  are  weak,  political  institutions  gain 
powers  that  exceed  constitutional  limits  and  threaten 
individual  rights  and  liberties.  To  the  framers,  indi¬ 
vidual  freedoms  depended  mainly  not  on  courts 
acting  as  guardians  or  the  president  acting  in  the 
“national  interest”  but  on  the  structural  checks  sup¬ 
plied  by  the  separation  of  powers. 
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Ordinary  least  squares  (OLS),  along  with  its  cous¬ 
ins  such  as  probit,  is  the  workhorse  method  of 
empirical  political  science.  Starting  with,  as  usual, 
the  linear  model, 

y  —  a  +  (3x  +  0.2  +  e,  (1) 

OLS  is  a  fine  way  of  estimating  (3  and  0  as  long  as 
x  and  z  are  exogenous.  An  explanatory  variable  is 
exogenous  if  whatever  statistical  process  deter¬ 
mines  it  does  not  depend  on  the  statistical  process 
that  determines  either  the  dependent  variable  or 
the  error  term.  If  an  explanatory  variable  is  not 
exogenous  (and  so  is  said  to  be  endogenous),  then 
OLS  has  severe  problems.  These  problems  are 
typically  worse  than  for  other  forms  of  misspecifi- 
cation  and  do  not  disappear  as  the  sample  size 
grows.  Endogeneity  issues  cannot  be  fixed  by  the 
usual  tweaks  to  OLS  (generalized  least  squares); 
different  estimation  methods  are  required. 

Predetermined  socioeconomic  characteristics 
are,  by  definition,  exogenous.  Beyond  a  few  simple 
variables  (physical  characteristics  of  people  or 
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countries),  the  argument  that  a  variable  is  exoge¬ 
nous  must  be  made  theoretically  and  must  be  made 
relative  to  the  dependent  variable  of  interest.  Thus, 
for  example,  if  we  believe  that  democracy  is  exog¬ 
enous  to  economic  development,  a  regression  of 
economic  development  on  democracy  yields  mean¬ 
ingful  results;  but  we  need  some  theory  to  know 
that  democracy  is  exogenous — that  is,  that  it  is  not 
the  case  that  increased  wealth  leads  to  more 
democracy.  Empirical  results  are  conditional  on 
this  assumption;  the  assumption  of  exogeneity 
cannot  be  tested  empirically. 

Suppose,  for  example,  that  we  are  interested  in 
explaining  attitudes  toward  political  candidates. 
Let  y  be  a  measure  of  how  much  one  likes  a  polit¬ 
ical  candidate  and  x  be  a  measure  of  how  close  the 
candidate  is  to  the  voter  on  various  issues.  It  may 
be  the  case  that  voters  who  like  a  candidate  more 
are  more  likely  to  perceive  the  candidate  as  being 
close  to  them  on  issues,  so  x  may  not  be  exoge¬ 
nous.  If  x  is  endogenous  to  y,  it  is  well  known  that 
OLS  can  be  badly  biased,  and  these  problems  per¬ 
sist  even  with  huge  sample  sizes  (i.e.,  OLS  is  not 
even  consistent).  The  solution,  worked  out  in  the 
1940s  by  various  Nobel  Prize-winning  econome¬ 
tricians  associated  with  the  Cowles  Loundation,  is 
to  estimate  a  series  of  simultaneous  equations  for 
both  x  and  y. 

Thus,  for  two  equations,  if  x  determines  y  but  y 
also  determines  x,  we  can  write 

y  =  Px  +  0z  +  e 

x  =  yy  +  kw  +  £,  (3) 

where  z  and  w  are  exogenous  and  s  and  £  are  error 
terms  that  may  well  be  correlated  with  each  other 
(but  separately  satisfy  the  usual  Gauss-Markov 
conditions).  One  issue  is  what  method  is  best  for 
estimating  the  various  model  parameters;  but  a 
more  fundamental  issue  is  whether  it  is  even  pos¬ 
sible  to  estimate  these  parameters.  The  latter  is 
called  the  identification  problem. 

Identification 

The  identification  issue  is  critical,  and  dealing  with 
it  comes  before  any  estimation  issue.  The  basic 
question  is  whether  more  than  one  set  of  parameter 
estimates  is  equally  consistent  with  the  data;  almost 


always,  if  more  than  one  set  of  estimates  is  consis¬ 
tent  with  the  data,  an  infinite  number  of  estimates 
are  consistent  with  the  data.  In  such  cases,  the 
model  is  not  identified.  In  this  situation,  even  if  we 
were  sure  we  had  ideal  estimates  of  the  parameters, 
someone  else  could  have  equally  ideal  but  different 
parameter  estimates.  This  problem  does  not  arise 
with  models  typically  estimated  via  OLS  (but  only 
because  we  assume  away  the  problem  by  assuming 
that  all  explanatory  variables  are  exogenous)  but  is 
always  a  potential  issue  with  simultaneous  equa¬ 
tion  models.  A  simple  example  shows  the  problem. 

Suppose  we  have  an  infinite  amount  of  data,  so 
there  is  no  statistical  uncertainty.  Consider  the 
model  where  x  determines  y  and  vice  versa  but 
nothing  else.  We  run  our  favorite  computer  pro¬ 
gram  and  obtain,  say, 

y  —  2x  +  3  +  8  (4) 

x  =  -y  +  2  +  £.  (5) 

Nothing  can  prevent  us  from  adding  the  two  equa¬ 
tions  together,  giving,  after  simplification, 

2  2  L  y  (6) 

Since  the  compound  error  in  brackets  looks  like  a 
perfectly  good  error  term  (remember  that  the 
errors  in  the  equations  may  be  correlated),  if  we 
are  sure  that  p  =  2,  we  are  just  as  sure  that  p  = 
1/2,  and  similarly  for  the  second  equation.  This  is 
the  unidentified  situation,  and  no  amount  of  fancy 
estimation  will  help. 

If  both  equations  contain  the  same  additional 
exogenous  variables,  there  is  nothing  to  prevent 
adding  of  the  two  equations  together  to  mimic  the 
original  equation  but  with  different  coefficients. 
But  suppose  that  theory  tells  us  that  there  is  some 
exogenous  z  that  belongs  in  the  first  equation  but 
not  in  the  second  equation,  and  the  opposite  for  a 
different  exogenous  tv.  Then,  we  cannot  add  the 
equations  together  without  making  either  y  depend 
on  w  or  x  depend  on  z.  These  conditions  must  be 
given  by  theory,  since  if  they  are  not  correct,  the 
system  is  not  identified  and  so  estimation  cannot 
proceed. 

Lor  our  example,  we  might  have  liking  for  a 
candidate  depend  on  whether  one  shares  one’s 
ethnicity  with  that  candidate  (but  where  theory 
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tells  us  that  ethnicity  does  not  affect  how  close  the 
candidate  is  to  the  voter  on  the  issues,  a  very 
strong  assumption)  and  closeness  on  the  issues 
affected  by  the  voter’s  occupation  (which  would 
not  be  expected  to  directly  affect  how  much  the 
voter  likes  a  candidate  but  might  affect  issue  posi¬ 
tions).  These  are  strong  assumptions,  but  without 
these  or  similar  assumptions  the  system  is  not  iden¬ 
tified  and  empirical  research  cannot  proceed. 
Sometimes  the  correct  conclusion  is  that  theory  is 
inadequate  for  identification;  other  times  theory 
allows  further  empirical  work.  (Resorting  to  OLS 
does  not  solve  the  problem,  since  this  would  imply, 
incorrectly,  that  x  is  exogenous.) 

We  can  check  to  see  if  a  system  is  identified  by 
counting  the  number  of  exogenous  and  endogenous 
variables  in  each  equation.  An  exogenous  variable  is 
excluded  from  an  equation  if  it  is  used  in  the  system 
but  does  not  appear  in  that  equation.  An  equation 
is  identified  if  it  has  at  least  as  many  excluded  exog¬ 
enous  variables  as  it  has  included  endogenous 
variables.  If  this  condition  is  met  exactly,  the  system 
is  said  to  be  exactly  identified;  if  there  are  more 
excluded  exogenous  variables  than  included  endog¬ 
enous  variables,  the  system  is  said  to  be  overidenti¬ 
fied.  It  is  possible  that  some  equations  are  identified, 
some  overidentified,  and  others  not  identified. 
Obviously,  we  can  say  nothing  about  the  parameter 
estimates  of  the  unidentified  equations.  For  the 
others,  we  can  proceed  to  estimation. 

It  is  possible  to  test  where  the  restrictions  that 
overidentify  a  system  are  valid,  but  the  restrictions 
to  test  where  a  system  is  just  identified  must  be 
justified  purely  theoretically.  It  is  possible  for  a 
system  to  be  weakly  identified,  where  the  excluded 
exogenous  variables  in  an  equation  have  only  a 
weak  effect  in  equations  where  they  are  theoreti¬ 
cally  included.  This  leads  to  problems  similar  to 
those  of  underidentification.  Weak  theory  is  inad¬ 
equate  for  identification  purposes. 

Two-Stage  Least  Squares 

There  are  many  good  ways  to  estimate  a  system  of 
simultaneous  equations,  but  by  and  far  the  most 
common  is  two-stage  least  squares  (2SLS).  2SLS 
works  because  we  can  always  solve  the  system  of 
equations  to  get  each  endogenous  variable  to  be  a 
linear  function  of  all  (included  and  excluded) 
exogenous  variables.  This  is  called  the  reduced 


form  of  the  system.  Each  of  the  equations  of  the 
reduced  form  can  be  well  estimated  by  OLS,  since 
all  the  explanatory  variables  in  each  equation  of 
the  reduced  form  are  exogenous. 

The  first  stage  of  2SLS  estimates  the  reduced 
form.  Each  of  these  OLS  estimations  (one  for  each 
endogenous  variables)  is  used  to  predict  the  values 
of  the  endogenous  variables  as  a  function  of  the 
exogenous  variables  in  the  usual  way.  These  pre¬ 
dicted  values  then  replace  the  endogenous  vari¬ 
ables  in  a  second  OLS  regression  (equation  by 
equation).  This  provides  good  estimates  of  the 
coefficients  of  the  system  of  equations.  If  we  just 
do  two  OLS  regressions,  the  estimated  standard 
errors  will  be  wrong,  but  any  computer  package 
will  automate  the  process  and  provide  the  correct 
standard  errors.  There  are  other  ways  of  estimat¬ 
ing  simultaneous  equation  models,  but  none  are 
great  improvements  over  2SLS. 

Simultaneous  Equations  Today 

After  the  initial  burst  of  enthusiasm,  the  estimation 
of  systems  of  simultaneous  equations  in  political 
science  (and  in  social  science,  in  general)  is  today 
rarely  done.  While  there  are  a  number  of  reasons 
for  this  (including  the  rise  of  other  research  meth¬ 
odologies  for  assessing  complex  causal  situations), 
probably  the  main  one  is  that  it  is  hard  to  find 
strong  theory  to  allow  for  the  identification  of  the 
entire  system. 

Over  the  past  decade  or  so,  there  has  been  a 
huge  increase  in  the  use  of  instrumental  variables. 
This  focuses  attention  on  a  single  equation  of 
interest  where  one  or  more  explanatory  variables 
of  interest  are  endogenous.  Researchers  then  look 
for  instruments  that  are  simply  exogenous  vari¬ 
ables  that  explain  the  endogenous  explanatory 
variables  but  do  not  explain  the  dependent  vari¬ 
able  directly.  This  method  is  identical  to  2SLS, 
except  for  the  focus  on  only  one  outcome  variable. 
Often  it  is  easier  to  find  variables  that  we  believe 
explain  one  or  more  endogenous  explanatory  vari¬ 
ables  but  do  not  explain  the  prime  dependent  vari¬ 
able  of  interest.  Thus,  2SLS  has  again  become  a 
critical  tool  but  one  that  lives  on  under  another 
name.  Note  that  instrumental  variables  have 
exactly  the  same  identification  issues  as  do  simul¬ 
taneous  equations  (except  that  we  are  interested  in 
only  one  equation,  not  the  whole  system),  and  the 
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estimation  method  is  exactly  the  same  as  2SLS. 
The  various  issues  discussed  above  thus  continue 
to  be  critical. 
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Social  Capital 


The  central  premise  of  social  capital  is  that  social 
ties  have  value.  The  term  social  capital  refers  to 
the  collective  value  of  all  social  networks  and  the 
inclinations  that  arise  from  these  networks  to  do 
things  for  each  other.  In  other  words,  social  capi¬ 
tal  refers  to  the  people  we  know  and  the  norms  of 
reciprocity  that  exist  between  us.  The  notion  of 
social  capital,  therefore,  emphasizes  a  wide  variety 
of  specific  benefits  that  flow  from  the  trust,  reci¬ 
procity,  information,  and  cooperation  associated 
with  social  networks.  In  this  way,  value  is  created 
for  people  who  are  connected  and,  at  least  some¬ 
times,  for  bystanders  as  well.  This  entry  reviews 
the  concept  of  social  capital  by  unpacking  its  per¬ 
spectives  and  components  as  well  as  assessing  its 
implications  in  social,  economic,  and  political 
arenas. 

This  simple  definition  of  social  capital  serves  a 
number  of  purposes.  First,  rather  than  focusing  on 
the  consequences  of  social  capital,  it  concentrates 
on  the  sources  thereof  and  recognizes  that  the  fea¬ 
tures  of  social  capital,  namely,  trust  and  reciproc¬ 
ity,  are  developed  in  an  iterative  process.  Second, 
this  definition  includes  the  different  dimensions  of 
social  capital  and  acknowledges  that  communities 
can  have  varied  access  to  each  dimension.  Third,  it 


identifies  community  as  the  primary  unit  of  analy¬ 
sis,  while  conceding  that  individuals  and  house¬ 
holds  can  appropriate  social  capital.  As  a  result,  the 
notion  of  social  capital  has  become  an  influential 
theoretical  concept  in  the  analysis  of  contemporary 
social  and  economic  development,  and  change  and 
cohesion  in  various  societies,  communities,  and 
groups.  In  addition,  social  capital  is  related  to  the 
description  and  explanation  of  social  action.  While 
it  features  in  much  academic  and  interdisciplinary 
discourse,  including  political  science,  economics, 
sociology,  and  social  anthropology,  it  also  has  par¬ 
allels  with  political  agendas,  social  and  economic 
development  and,  more  recently,  social  policy 
development.  This  has  to  a  large  extent  become 
visible  in  the  influence  of  international  bodies  such 
as  the  World  Bank  and  International  Monetary 
Fund  in  defining  research  areas  as  well  as  structur¬ 
ing  the  debate  surrounding  social  issues  related  to 
the  analysis  of  social  capital. 

Social  Capital:  Different  Types 
and  Perspectives 

The  wide  variation  of  interest  in  the  concept  of 
social  capital  is  partly  evident  in  early  works  by 
Pierre  Bourdieu  and  James  Coleman  and  was  popu¬ 
larized  by  the  most  widely  recognized  proponent  of 
social  capital,  Robert  Putnam.  The  focus  of  social 
capital  for  each  of  these  authors  lies  on  networks 
and  relationships  as  a  resource;  however,  they  have 
done  so  in  different  ways.  This  conceptualization  of 
the  role  of  social  relationships  in  development 
marks  a  significant  departure  from  earlier  theoreti¬ 
cal  approaches  and  as  such  has  direct  implications 
for  contemporary  development  research  and  policy. 
It  is,  therefore,  necessary  to  distinguish  between  the 
four  perspectives  on  social  capital,  which  identify 
the  types  of  social  capital  in  different  ways. 

The  Communitarian  View 

The  communitarian  perspective  equates  social 
capital  with  mostly  informal  and  local  horizontal 
associations,  such  as  clubs  and  civic  groups.  The 
focus  of  this  perspective  lies  in  the  number  and 
density  of  these  local  organizations  and  associa¬ 
tions  within  a  community.  The  basic  premise  for 
communitarians  is  that  social  capital  is  essentially 
good  and  the  more  social  capital  that  is  present  in 


Social  Capital  2411 


a  given  community,  the  more  positive  its  effect  on 
that  community’s  welfare.  Critics  of  this  view, 
however,  have  pointed  to  perverse  social  capital, 
which  exists  in  communities  that  are  isolated  or 
parochial.  Furthermore,  this  perspective  implicitly 
assumes  that  communities  are  homogeneous  and 
strive  to  achieve  the  society’s  collective  interest; 
however,  racial  and  ethnic  exclusion,  inequality, 
and  gender  discrimination  are  only  a  few  examples 
that  suggest  otherwise.  This  downside  of  social 
capital  has  the  potential  to  hinder  development. 

The  Networks  View 

The  networks  view  is  a  more  nuanced  perspec¬ 
tive  on  social  capital  as  it  stresses  the  importance 
of  vertical  and  hierarchical  as  well  as  horizontal 
associations  between  people  and  also  attempts  to 
account  for  relations  within  and  among  organiza¬ 
tional  entities  such  as  community  groups  and 
firms.  This  view  also  distinguishes  between  two 
types  of  social  capital,  namely,  bonding  and  bridg¬ 
ing  social  capital.  The  former  refers  to  strong 
intracommunity  ties  that  provide  families  and 
communities  with  a  common  purpose  and  a  sense 
of  identity,  while  the  latter  refers  to  weak  inter¬ 
community  ties,  namely,  those  that  cross  social 
and  cultural  ties  such  as  gender,  class,  race,  ethnic¬ 
ity,  religion,  and  socioeconomic  status,  which  can 
form  the  basis  for  sectarian  interests. 

The  networks  view  has  two  key  propositions. 
The  first  proposition  is  that,  although  social  capital 
can  provide  a  range  of  services  for  the  members  of 
a  community,  there  are  potential  negative  eco¬ 
nomic  consequences  resulting  from  the  costs  that 
these  ties  can  place  on  the  noneconomic  claims  of  a 
member’s  commitment.  The  second  proposition  is 
that  a  distinction  needs  to  be  made  between  the 
sources  of  social  capital  and  the  consequences 
derived  from  them.  Thus,  to  make  claims  with 
regard  to  the  efficacy  of  social  capital,  the  network 
view  acknowledges  the  necessity  of  both  strong 
intercommunity  ties  and  weak  extracommunity 
networks  and  that  the  range  of  outcomes  associ¬ 
ated  with  social  capital  are  attributable  to  the  dif¬ 
ferent  combinations  of  these  two  dimensions.  The 
strength  of  this  view  lies  in  its  ability  to  engage  in 
policy  discussion  based  on  empirical  evidence  and 
assessments  of  contending  explanations.  A  clear 
challenge,  however,  lies  in  retaining  the  integrity  of 


the  many  positive  aspects  of  bonding  social  capital 
while  trying  to  ensure  increased  access  to  formal 
institutions  and  facilitating  a  diverse  stock  of  bridg¬ 
ing  social  capital. 

The  Institutional  View 

The  third  perspective  on  social  capital  builds 
from  the  first  two  perspectives,  adding  formalized 
national  structures  to  the  mix.  The  basic  premise 
of  this  view  is  that  the  legal,  institutional,  and 
political  environment  facilitates  the  strength  of 
civil  society  and  community  networks  and  that 
social  capital  is  regarded  as  a  dependent  variable — 
in  contrast  to  the  communitarian  and  network 
views,  where  social  capital  is  regarded  as  an  inde¬ 
pendent  variable.  The  institutional  approach 
stresses  two  points:  first,  that  social  groups’  capac¬ 
ity  to  act  in  accordance  with  collective  interests 
depends  on  the  quality  of  their  formal  institutions 
and,  second,  that  the  performance  and  effective¬ 
ness  of  the  state  and  firms  are  dependent  on  their 
internal  credibility  and  competence  as  well  as  their 
external  accountability  to  civil  society. 

There  are  two  primary  approaches  to  research 
from  this  view.  The  first  approach  employs  case 
studies  that  are  based  on  comparative  history  and 
postulates  that  civil  society  is  only  able  to  thrive 
to  the  extent  to  which  the  state  encourages  it.  The 
second  approach  adopts  quantitative  cross¬ 
national  techniques  to  measure  the  impact  of 
social  divisions  and  government  performance  on 
economic  performance.  The  ability  of  this  view  to 
address  macroeconomic  policy  concerns  is  its 
greatest  strength  as  well  as  its  weakness  in  that  it 
fails  at  a  microeconomic  level. 

The  Synergy  View 

The  forth  and  most  recent  perspective  on  social 
capital,  the  synergy  view,  encompasses  an  integra¬ 
tion  of  the  work  from  the  networks  and  institutional 
views  and  focuses  primarily  on  conditions  that  may 
foster  developmental  synergies.  This  approach  iden¬ 
tifies  complementarity — mutually  supportive  rela¬ 
tions  between  various  private  and  public  actors  that 
are  embodied  in  legal  frameworks — and  embedded¬ 
ness — the  nature  of  and  the  extent  to  which  ties 
connect  citizens  and  public  officials — as  the  basis  for 
synergy  between  government  and  citizen  action.  In 
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other  words,  this  approach  can  only  work  when  the 
actions  of  public  officials  are  embodied  by  perfor¬ 
mance-oriented  organizational  environments  that 
are  credible  and  competent. 

The  synergy  view  can  be  used  for  three  funda¬ 
mental  functions:  (1)  to  identify  the  nature  of  a 
community’s  social  relationships  and  formal  insti¬ 
tutions  as  well  as  the  extent  of  the  interaction 
between  them;  (2)  to  develop  institutional  strate¬ 
gies  on  the  basis  of  these  social  relations,  with 
particular  reference  to  the  two  types  of  social 
capital — namely,  bonding  and  bridging  social  cap¬ 
ital;  and  (3)  to  determine  the  ways  in  which  the 
positive  manifestations  of  social  capital  can  coun¬ 
teract  isolationism,  sectarianism,  and  corruption. 

The  four  perspectives  on  social  capital  thus  dif¬ 
fer  in  terms  of  the  unit  of  analysis  that  each  focuses 
on;  their  treatment  of  social  capital  as  a  dependent, 
independent,  or  mediating  variable;  and  the  extent 
to  which  they  integrate  state  theory.  However,  the 
four  perspectives  on  social  capital  also  have  a 
number  of  common  features:  They  all  link  the 
social,  economic,  and  political  spheres;  they  all 
recognize  that  social  relationships  can  improve 
development  outcomes  and  that  these  outcomes 
are  dependent  on  the  horizontal  versus  hierarchi¬ 
cal  relationships  and  broader  political  and  legal 
context;  they  all  recognize  how  the  formal  and 
informal  organization  of  the  relationships  between 
economic  agents  can  improve  economic  activities; 
and  all  infer  that  desirable  social  relationships  and 
formal  institutions  have  positive  externalities. 

Components  of  Social  Capital: 

Networks,  Norms,  and  Trust 

Definitions  of  social  capital  vary  considerably,  and 
this  becomes  problematic  when  trying  to  establish 
a  set  of  indicators  for  measuring  social  capital  or  in 
reaching  a  consensus  on  the  best  ways  of  measur¬ 
ing  the  concept.  To  date,  census  data  of  groups  and 
associational  memberships  in  a  given  society  and 
survey  data  measuring  levels  of  trust  and  civic 
engagement  have  been  the  two  broadest  approaches 
taken.  Producing  anything  similar  to  that  of  a  cen¬ 
sus  of  a  society’s  stock  of  social  capital  could  be 
described  as  practically  and  logistically  impossible 
because  it  would  rely  on  the  multiplication  of  num¬ 
bers  that  are  nonexistent  or  subjectively  estimated. 
Surveys  on  trust  and  civic  engagement  are  the 


other  source  of  data  that  has  been  widely  used  as 
a  proxy  for  social  capital. 

Social  capital  exists  in  two  methodological 
dimensions:  one  qualitative  and  one  quantitative. 
The  quantitative  dimension  is  simply  the  number 
of  social  contacts  that  people  within  a  society 
have.  The  more  the  social  networks  to  which 
people  have  access,  and  the  more  extensive  those 
social  networks  are,  the  more  social  capital  a  soci¬ 
ety  has.  Extensive  and  varied  networks  are  an  asset 
in  almost  all  situations,  and  such  social  contacts 
need  not  be  strong  or  deep  on  the  personal  level. 
As  noted  previously,  there  is  also  strength  in  orga¬ 
nizations  and  societies  distinguished  by  extensive 
weak  ties.  However,  this  quantitative  dimension 
alone  is  inadequate  for  the  analysis  of  social  capi¬ 
tal  and  its  components.  In  addition  to  the  sheer 
number  of  social  contacts,  we  need  to  add  the 
quality  of  those  contacts.  The  reason  for  this  is 
obvious:  It  cannot  be  an  asset  to  know  a  lot  of 
people  whom  you  cannot  trust  or  tolerate. 
Networks  alone  thus  do  not  have  value.  Instead, 
the  value  of  contacts  and  networks  depends  on  the 
quality  of  relations — trust  and  tolerance — within 
these  networks.  The  concept  of  social  capital  can 
be  measured  by  a  wide  range  of  indicators.  Most 
empirical  researchers  have  made  contributions 
with  regard  to  three  of  the  core  elements  or  com¬ 
ponents  of  social  capital,  namely,  the  number  of 
voluntary  organizations  (networks)  that  individu¬ 
als  are  actively  engaged  in,  tolerance,  and  trust. 

Networks 

Two  approaches  have  been  used  to  measure 
social  networks,  the  most  common  of  which  has 
been  a  structural  analysis  whereby  the  focus  rests 
on  formal  associational  membership  as  opposed  to 
informal  social  ties.  Most  empirical  research  has 
relied  on  data  from  longitudinal  surveys  tracking 
the  trends  in  associational  membership;  however, 
these  are  often  limited  to  a  small  number  of  sec¬ 
tors,  such  as  religious  organizations  (church,  etc.) 
and  labor  unions.  Subsequently,  the  official  records 
of  membership  in  voluntary  organizations  such  as 
sports  and  social  clubs  and  humanitarian  societies 
have  been  the  focus  of  most  historical  institutional 
studies. 

The  importance  of  associational  membership, 
for  some  authors,  rests  with  its  capacity  to  act  as  a 
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proxy  indicator  for  both  the  structural  features 
and  the  cultural  norms  of  social  capital — that  is, 
social  networks  as  well  as  trust  and  cooperation. 
Macrolevel  trends,  or  trends  at  the  societal  level, 
are  often  investigated  among  a  number  of  volun¬ 
tary  organizations  such  as  sports  or  recreation 
clubs,  arts  and  cultural  societies,  and  professional 
associations;  however,  the  challenge  has  been  to 
measure  the  extent  to  which  associational  mem¬ 
bership  in  these  organizations  is  able  to  generate 
and  facilitate  tolerance,  reciprocal  cooperation, 
and  social  trust,  which  are  the  components  of 
social  capital. 

Although  earlier  works  regarded  formal  associ¬ 
ational  membership  as  proxy  indicators  of  social 
networks,  the  significance  and  value  of  informal 
social  ties  as  a  means  to  generate  and  facilitate  the 
norms  of  reciprocity,  mutual  trust,  and  tolerance, 
as  opposed  to  card-carrying  membership,  are 
widely  recognized  and  gaining  momentum.  In 
other  words,  although  formal  associational  mem¬ 
bership  is  regarded  as  a  strong  indicator  of  societal 
networking,  it  is  only  one  such  indicator  and  not 
necessarily  the  most  important. 

Tolerance 

Tolerance  is  consistently  related  to  perceptions 
of  threat  and  one’s  willingness  to  put  up  with  oth¬ 
ers  who  are  perceived  as  the  least  liked.  To  tolerate 
is  to  allow  something  from  which  one  recoils, 
which  one  considers  unacceptable  and  would  pre¬ 
fer  not  to  identify  with,  and  is  a  disposition 
directed  at  an  object  of  aversion  or  dislike. 
Tolerance  entails  the  forbearance  to  admit,  to 
accommodate,  to  put  up  with,  and  to  overcome 
the  hostility  generated  by  dislike.  Intolerance 
emerges  when  revulsion  finds  expression  in  preju¬ 
dice  and  hostility. 

In  a  democracy,  tolerance  requires  one  to  accept 
that  one’s  political  opponents,  even  one’s  enemies, 
have  the  right  to  participate  in  a  contest  for  power 
under  the  principled  assumption  that  they  might 
win  the  contest  and  assume  power,  albeit  limited 
and  constrained  by  the  rule  of  law.  Within  a  multi¬ 
cultural  context,  it  is  to  accept  as  fellow  and  equal 
citizens  those  people  who  hold  cultural  beliefs  and 
practices  rooted  in  values  that  may  be  unfamiliar, 
and  even  incompatible  with,  and  offensive  to  one’s 
own  cultural  beliefs.  Such  allowance  may  take  the 


form  of  negative  tolerance,  which  amounts  to  for¬ 
bearance  by  way  of  ignoring,  avoiding,  and  disen¬ 
gaging  with  the  object  of  tolerance.  In  this  case, 
tolerance  entails  having  to  grit  your  teeth  and  to 
“grin  and  bear  it.”  Positive  tolerance,  by  contrast, 
is  based  on  the  principled  assumption  of  being  able 
“to  agree  to  disagree,”  which  allows  opponents  to 
engage  with  rather  than  shun  one  another. 

Tolerance  is  considered  an  essential  requisite 
for  democracy  as  it  is  required  for  the  effective 
functioning  of  the  competitive  processes  of  repre¬ 
sentative  democracy,  as  well  as  the  effective  main¬ 
tenance  and  deepening  of  a  doctrine  and  practice 
of  human  rights.  Representation  is  achieved 
through  elections  by  organized  political  parties, 
who  compete  for  the  popular  vote.  Competition 
occurs  as  parties  present  different  campaign  prom¬ 
ises  to  the  voters  and  vie  for  the  voters’  preference 
by  showing  their  policy  positions  to  be  superior  to 
those  of  others.  This  is  a  process  of  comparative 
evaluation  that  can  be  stressful,  as  personalities 
and  the  personal  virtues  and  weaknesses  of  candi¬ 
dates  become  easily  enmeshed  with  ostensibly  dis¬ 
passionate  policy  differences. 

Policy  differences  themselves  can  also  become 
matters  of  intense  contestation  when  high  interest- 
based  stakes  are  involved  in  the  election  outcomes. 
Tolerance  is  needed  in  such  intense  contests,  as  the 
liberal  model  of  democracy  prescribes  that  all  such 
actions  be  conducted  within  the  boundaries  of  law 
and  the  rule  of  law.  Opponents  are  to  be  defeated 
at  the  ballot  box,  not  eliminated  from  the  political 
arena.  Winners  must  bear  with  losers,  and  both 
need  to  be  prepared  to  face  up  to  one  another 
again  and  again  in  subsequent  contests.  And  the 
arena  of  contestation  itself,  with  its  constraining 
laws,  has  to  be  maintained. 

The  functioning  of  human  rights  doctrines  also 
requires  tolerance  from  citizens.  Rights,  as  special 
claims  against  the  state  or  other  civic  bodies  or 
other  citizens,  function  alongside  one  another,  and 
unless  a  clear  hierarchy  of  rights  is  established, 
such  rights  hold  equal  claims  against  one  another. 
The  advancement  of  any  right  by  a  citizen  then 
becomes  contingent  on  its  not  being  infringed  on 
by  the  claims  advanced  by  other  rights  holders. 
Constant  litigation  becomes  one  way  of  dealing 
with  rival  claims,  but  for  the  effective  functioning 
of  society  on  a  regular  basis,  tolerance  becomes 
essential  as  an  operating  norm. 
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Trust 

Trust  in  any  person  emerges  from  a  prior  con¬ 
viction  that  that  particular  person  holds  no  malev¬ 
olent  intentions  toward  one  and  is  not  intent  on 
willful  exploitation  of  or  doing  harm  or  damage  to 
oneself.  Trust  is  a  disposition  toward  others  that 
follows  from  their  reputation  for  reliability,  hon¬ 
esty,  and  integrity.  A  person  is  trusted  by  others  to 
the  extent  that  these  others  have  confidence  in  the 
commitments  made  by  the  said  person — that  is, 
when  predicted,  expected,  and  promised  outcomes 
do  in  fact  materialize  accordingly.  Such  trust  can 
be  directed  or  dispersed.  If  dispersed,  it  is  referred 
to  as  generalized  trust  and  is  indicative  of  an  indi¬ 
vidual’s  overall  worldview  of  other  people.  If 
directed,  trust  is  a  disposition  that  addresses 
clearly  demarcated  categories  of  people. 

Democracy  requires  both  political  trust  and 
interpersonal  trust.  Political  trust  is  directed  at 
institutions  of  power.  In  a  liberal  democracy  these 
are,  in  the  first  instance,  the  institutions  of  political 
representation,  which  elected  representatives 
occupy  after  having  contested  free  and  fair  elec¬ 
tions.  In  electoral  contests,  promises  are  made 
thick  and  fast,  and  trusting  voters  cast  their  ballots 
on  the  expectation  that  such  promises  will  be  kept 
and  duly  executed  by  the  newly  elected  rulers.  But 
liberalism  limits  the  kinds  of  promises  politicians 
can  offer  to  the  electorate.  For  example,  human 
rights  doctrines  proscribe  promises  to  exterminate 
one’s  opponents  after  the  election  or  even  to  limit 
their  human  rights.  On  the  contrary,  an  inbuilt, 
implicit  promise  of  each  and  every  politician  has  to 
be  that  the  rule  of  law,  the  doctrine  and  practice  of 
liberal  rights,  and  the  procedural  as  well  as  sub¬ 
stantive  commitments  entailed  therein  are  going  to 
be  upheld.  Without  a  certain  threshold  level  of 
trust  in  politicians  that  such  system  maintenance 
will  take  place,  the  legitimacy  of  the  entire  liberal 
democratic  regime  is  open  to  question. 

Interpersonal  trust  is  also  of  vital  importance  to 
the  maintenance  of  democratic  regimes.  As  is  the 
case  with  political  trust,  interpersonal  trust  entails 
the  confidence  that  certain  expectations  about  the 
behavior  of  others  will  actually  materialize.  In 
interpersonal  relations  such  a  reputation  for  reli¬ 
ability  is  usually  vested  in  the  knowledge  that 
people  have  of  one  another  and  of  the  personal 
traits  that  are  attributed  to  others  on  the  basis  of 
such  knowledge  or  are  explicitly  or  implicitly 


derived  from  it.  It  is,  therefore,  not  a  generalized 
but  a  directed  form  of  trust.  People  who  know  one 
another  personally  and  meet  on  a  daily  basis  have 
the  best  opportunity  to  gain  the  knowledge  neces¬ 
sary  for  establishing  trustworthiness. 

Family,  friends,  and  community  relations  form 
this  inner  core  of  interpersonal  relations,  from 
which  reputations  for  reliability  can  be  estab¬ 
lished  and  from  which  so-called  thick  trust  can  be 
established.  Slightly  less  intimate  relations  in  the 
form  of  shared  values  and  cultural  practices  (such 
as  religion)  as  well  as  presumed  kinship  relations 
that  together  form  the  basis  of  ethnic  identities 
provide  another  source  of  knowledge  for  thick 
trust.  Thin  trust,  by  contrast,  is  established 
between  strangers  and  may  even  be  typified  by  the 
very  absence  of  overt  socially  shared  denomina¬ 
tors  such  as  language,  religion,  or  heritage.  Such 
trust  is  established  on  the  basis  of  ongoing  joint 
engagement  in  projects  and/or  more  formally 
shared  identities,  such  as  corporate  membership 
or,  at  the  national  level,  citizenship.  Both  these 
kinds  of  trust  are  required  for  the  maintenance  of 
a  liberal  society.  Thick  trust  cements  families  and 
households,  the  core  social  units  from  which 
overall  social  cohesion  is  constructed  and  the  pri¬ 
mary  units  of  general  and  political  socialization. 
Thin  trust  is  needed  to  bridge  larger  social  divi¬ 
sions  in  society  and  is  required  for  very  large 
projects  of  collaborative  endeavor  such  as  socio¬ 
economic  development,  which  typify  modernizing 
societies. 

Both  kinds  of  interpersonal  trust,  as  well  as 
political  trust,  are  ultimately  established  under  con¬ 
ditions  of  imperfect  information,  albeit  at  varying 
levels,  and  require  an  element  of  risk  taking.  The 
trusting  person  is  always  vulnerable  to  exploitation. 
The  classic  liberal  conviction  argues  that  such  risk 
taking  is  warranted  because  of  the  presumed  rea¬ 
sonableness  of  people  and  of  their  ability  to  sense 
that  they  can  pursue  their  own  interests  in  collabo¬ 
ration  with  others  rather  than  in  perpetual  opposi¬ 
tion  to  them.  And  liberals  assume  reciprocity — that 
is,  that  other  citizens  will  calculate  likewise.  One 
way  of  minimizing  such  risk  in  emergent  liberal 
societies,  and  the  attendant  vulnerability  of  trust¬ 
ing  citizens,  has  been  to  establish  an  extensive  set 
of  explicit  rules  within  which  to  construct  the  lib¬ 
eral  “open  society.”  The  fundamental  rules  are 
those  found  in  human  rights  charters,  usually 
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embodied  in  a  rigid  formal  constitution  and 
backed  up  by  an  independent  judiciary  within  a 
constitutional  state  to  oversee  its  maintenance  and 
implementation. 

Conclusion 

In  essence,  social  capital  is  a  multidisciplinary  con¬ 
cept  that  has  a  growing  influence  over  ideas  in 
health  science,  urban  studies,  regional  studies, 
social  policy,  criminology,  business  studies,  and 
social  and  economic  geography  and  history.  It  is  a 
concept  that  has  captured  the  imagination  of  poli¬ 
cymakers  and  professionals  in  many  fields  and  has 
influenced  research  and  theory  right  across  the 
social  sciences. 
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Social  Choice  Theory 


The  theory  of  social  choice  aims  to  explain  collec¬ 
tive  outcomes  as  derived  from  individual  decisions 
and  institutional  rules.  The  founding  contribu¬ 
tions  in  the  1950s  and  1960s  eroded  a  previously 
unquestioned  confidence  in  the  capability  of  cer¬ 
tain  common  aggregative  procedures,  including 
majority  rule,  to  guarantee  consistent  social 
choices  satisfying  individuals’  preferences. 
According  to  several  “impossibility”  theorems,  no 
decision  procedure  can  guarantee  social  choices 
fulfilling  some  apparently  simple  requirements  of 
fairness,  but  all  of  them  are  vulnerable  to  strategic 
manipulation  and  may  produce  arbitrary  or  unsta¬ 
ble  results.  This  entry  discusses  the  history,  prob¬ 
lems,  and  current  applications  of  social  choice 
theory. 

The  problem  of  aggregation  of  individual  pref¬ 
erences  into  a  social  welfare  function  or  a  collec¬ 
tive  decision  by  voting  led  to  revisions  of  certain 
traditional  utilitarian  assumptions  in  welfare  eco¬ 
nomics  and  democratic  theory.  The  study  of  insti¬ 
tutions,  decisions,  and  voting  rules  became  of 
paramount  interest  to  explain  inefficiencies  and 
undesirable  collective  results.  In  more  recent  devel¬ 
opments,  social  choice  theorists  have  also  contrib¬ 
uted  to  distributive  justice  theory  by  approaching 
classical  topics  such  as  auctions,  negotiations,  fair 
division,  and  other  resource  allocation  procedures. 
The  explanation  of  unsatisfactory  social  choices 
provides  room  for  normative  concerns  of  better 
institutional  and  mechanism  design. 

In  a  foundational  theorem,  Kenneth  Arrow  pro¬ 
poses  a  set  of  normative  conditions  to  make  a  social 
choice  acceptable.  They  include,  for  individual 
preferences,  (a)  transitivity  or  internal  consistency 
(if  somebody  prefers  A  to  B  and  B  to  C,  he  must 
prefer  A  to  C)  and  (b)  universal  domain,  or  admis¬ 
sion  of  all  preferences  without  previous  restrictions. 
For  social  choices,  his  conditions  are  (a)  monoto¬ 
nicity,  or  the  requirement  of  a  consistent  relation 
between  individuals’  preferences  and  the  social 
choice  (which  contains  the  Pareto  criterion  favoring 
unanimous  decisions);  (b)  independence  of  the 
social  choice  from  individual  preferences  regarding 
irrelevant  alternatives  that  cannot  win;  and 
(c)  “no  dictatorship,”  or  the  nonexistence  of  an 
automatically  decisive  actor.  According  to  Arrow’s 
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theorem,  no  decision  rule  can  fulfill  all  these  condi¬ 
tions  all  the  time. 

A  typical  example  of  inconsistency  and  instabil¬ 
ity  is  the  so-called  Condorcet  cycle.  Assume  that 
three  voters,  A,  B,  and  C,  have  different  prefer¬ 
ences  over  three  alternatives,  X,  Y,  and  Z,  as 
shown  in  these  profiles: 

A:  X  >  Y  >  Z 

B:  Y  >  Z  >  X 

C:  Z  >  X  >  Y, 

where  “>”  means  “is  preferred  to.” 

If  we  apply  majority  rule  to  comparisons  of  the 
three  alternatives  by  pairs,  we  find  that  X  is 
socially  preferred  by  majority  to  Y  (because  a 
majority  formed  by  A  and  C  prefers  X  to  Y),  Y  is 
preferred  to  Z  (because  A  and  B  prefer  Y  to  Z), 
and  Z  is  preferred  to  X  (because  B  and  C  prefer  Z 
to  X).  Thus,  we  have  a  cycle  of  successive  social 
choices:  X  >  Y  >  Z  >  X.  Even  if  the  individual 
preferences  are  complete  and  transitive,  the  social 
choice  by  majority  rule  is  not  transitive  but  is 
unpredictable — in  practice,  any  choice  would  be 
arbitrary  and  potentially  unstable. 

However,  this  and  other  paradoxes  do  not 
mean  that  all  social  choices  made  by  majority  or 
any  other  rule  are  never  fair  or  always  arbitrary 
and,  thus,  vulnerable  to  instability.  They  mean 
that  no  decision  rule  can  guarantee  that,  under  any 
distribution  of  individuals’  preferences,  the  social 
choice  will  always  fulfill  a  set  of  normative  condi¬ 
tions.  In  particular,  the  distribution  of  preferences 
in  the  above  example  is  prone  to  haphazard  results 
because  it  implies  that  the  alternatives  are  located 
on  two  “dimensions”  (since  some  of  the  prefer¬ 
ences  are  and  some  are  not  consistent  with  the 
alphabetical  order).  Generally,  it  is  not  the  case 
that  social  choices  are  always  unstable  but  that 
they  can  be  unstable.  In  further  developments,  cer¬ 
tain  conditions  regarding  individuals’  preferences 
have  been  identified  that  guarantee  consistent  and 
stable  social  choices.  Many  exercises  to  test  insta¬ 
bility  have  used  simulations  in  which  an  “impartial 
culture”  or  “random  society”  involving  an  equal 
probability  of  each  conceivable  individual  order  of 
preferences  is  assumed.  This  assumption,  however, 
maximizes  the  probability  of  inconsistent  and 
unstable  social  choices. 


An  alternative  is  to  relax  the  founding  theorems’ 
prescription  that  no  restrictions  should  be  imposed 
on  individuals’  preferences  ( or  “  universal  domain  ” ) . 
Specifically,  a  sufficient  (although  not  necessary) 
condition  for  social  choice  stability  is  the  single¬ 
peakedness  of  individual  preference  curves,  which 
is  formally  equivalent  to  the  condition  that  all 
individuals’  preferences  can  be  ordered  along  a 
single  linear  dimension  (e.g.,  the  left-right  axis). 
More  in  tune,  it  has  been  found  that  arbitrary  and 
potentially  unstable  social  choices  are  more  likely 
the  higher  the  dispersion  of  individuals’  prefer¬ 
ences  (the  proportion  of  multipeaked  individual 
curves  or  the  number  of  issue  dimensions)  and  the 
dispersion  of  alternatives  (the  number  of  alterna¬ 
tives  and  the  distance  between  them). 

These  contributions  suggest  the  advantages  of 
relatively  harmonious  societies  in  producing  con¬ 
sistent  and  stable  social  choices,  even  with  poten¬ 
tially  manipulable  procedures.  Yet  the  restriction 
of  relevant  preferences  may  also  result  from  the 
decision  process  itself,  ultimately  depending  on  the 
institutional  rules  of  the  game. 

Institutional  Rules 

A  subsequent  line  of  research  promoted  by  social 
choice  theory  attempts  to  evaluate  the  relative  per¬ 
formance  of  different  institutional  and  decision 
rules  in  satisfying  individuals’  preferences  and  pro¬ 
ducing  acceptable  social  choices.  The  impossibility 
theorems  tell  us  that  it  is  impossible  to  guarantee 
fair  and  stable  social  choices  with  any  rule.  But 
certain  rules  tend  to  produce  inconsistent  choices 
more  frequently  than  others.  In  a  world  of  uncer¬ 
tainty,  the  likelihood  of  consistency  and  stability 
may  be  a  useful  guide  to  institutional  evaluation 
and  design. 

With  these  lenses,  some  decision  procedures 
based  on  the  majority  principle  can  be  reviewed. 
The  spatial  theory  of  voting  shows  that  if  there  are 
only  two  alternatives  along  a  single  issue  or  value 
dimension  (e.g.,  the  left-right  axis),  majority  rule 
tends  to  make  the  alternative  closer  to  the  median 
voter’s  preference  the  social  choice.  By  definition, 
the  median  voter — that  is,  the  voter  whose  prefer¬ 
ence  is  located  in  an  intermediate  position  with  less 
than  half  of  the  voters  on  each  of  the  two  sides — is 
always  necessary  to  form  a  consistent  majority  on 
a  single  dimension.  Since  the  median  position  has 
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the  property  of  minimizing  the  sum  of  the  dis¬ 
tances  from  all  other  positions,  the  winner  by 
majority  rule  in  a  two-alternative  contest  can  not 
only  be  stable  but  also  minimizes  aggregate  dis¬ 
tance  from  all  individual  preferences  and  thus 
maximizes  social  utility. 

However,  this  model  relies  on  two  strong 
assumptions:  only  two  alternatives  and  a  single¬ 
dimensional  issue  space.  If  the  set  of  alternatives 
submitted  to  majority  decision  is  not  bound,  inter¬ 
ested  actors  can  manipulate  the  social  choice  by  the 
introduction  of  new  alternatives — such  as  a  party 
or  candidate’s  platform  that  includes  a  “package” 
of  proposals  on  several  issues — or  new  issue  or 
value  dimensions.  If  there  are  more  than  two  alter¬ 
natives,  even  in  a  single-dimensional  space,  no 
alternative  may  obtain  more  than  half  the  votes, 
thus  making  majority  rule  indecisive  and  unable  to 
produce  a  social  choice.  In  a  multidimensional 
space,  even  with  only  two  alternatives,  the  major¬ 
ity  winner  can  be  unpredictable,  depending  on 
which  issue  or  value  takes  higher  salience  in  voter 
choice;  in  the  long  term,  there  can  be  a  series  of 
successive  winners  relying  on  different  salient 
issues  and  values,  with  no  foreseeable  “trajectory.” 

Several  procedures  loosely  related  to  the  major¬ 
ity  principle  can  then  be  adopted.  With  plurality  or 
relative  majority  rule,  the  winner  is  the  alternative 
that  obtains  a  higher  number  of  votes  than  any 
other  alternative  while  not  requiring  a  particular 
proportion  of  votes.  It  guarantees  a  winner  (except 
for  a  tie),  but  it  may  imply  minority  support.  For 
example,  the  winner  by  simple  plurality  rule 
among  three  alternatives  may  be  the  one  most 
rejected  by  an  absolute  majority  of  voters  having 
split  their  votes  among  the  other  two  alternatives. 
From  a  social  choice  perspective,  simple  plurality  is 
usually  considered  the  least  desirable  decision  rule. 

The  procedure  of  majority  runoff  requires  an 
absolute  majority  of  votes  in  the  first  round  of  vot¬ 
ing,  while  in  a  second  round  the  choice  can  be 
reduced  to  the  two  alternatives  receiving  the  highest 
number  of  votes  in  the  first  round,  so  as  to  secure 
majority  support  for  the  winner.  Majority- 
preferential  or  instant-runoff  voting  also  requires  an 
absolute  majority  of  voters’  first  preferences,  while 
successive  counts  of  further  preferences  are  made  to 
find  an  alternative  with  majority  support.  In  con¬ 
trast  to  what  can  happen  with  plurality  rule,  these 
procedures  prevent  the  most  rejected  alternative 


from  winning,  since  if  it  has  not  been  eliminated  at 
the  first  round,  it  will  be  defeated  at  the  second 
round. 

However,  with  both  plurality  and  majority- 
runoff  rules,  the  median  voter’s  preference  can  be 
defeated  or  eliminated  in  the  first  or  successive 
rounds.  This  implies  that  the  nonmedian  winner 
by  any  of  these  procedures  might  be  defeated  by 
another  alternative  by  absolute  majority  if  the 
choice  between  the  two  were  available.  These  pro¬ 
cedures  are,  thus,  dependent  on  irrelevant  alterna¬ 
tives;  they  encourage  strategies  aimed  at  altering 
the  number  of  alternatives,  such  as  “divide  and 
win”  and  “merge  and  win,”  as  well  as  nonsincere 
or  strategic  votes  in  favor  of  a  less  preferred  but 
more  likely  to  win  alternative. 

Additional  voting  procedures  based  on  the 
majority  principle  were  invented  by  several 
Christian  philosophers  and  enlightened  academics. 
The  13th-century  Catalan  philosopher  Ramon 
Llull  and  the  18th-century  French  Marquis  de 
Condorcet  almost  concurred  in  proposing  variants 
of  pairwise  comparisons.  By  these  procedures,  an 
election  requires  multiple  rounds  of  voting  between 
all  possible  pairs  of  alternatives.  In  Llull’s  version, 
the  winner  is  the  alternative  having  won  the  great¬ 
est  number  of  pairwise  comparisons,  while  in 
Condorcet’s  version,  an  alternative  is  required  to 
win  all  pairwise  comparisons — that  is,  the  major¬ 
ity  winner  is  the  alternative  able  to  win  by  majority 
against  every  other  alternative.  When  the  alterna¬ 
tives  are  perceived  by  the  voters  as  ordered  along  a 
single  linear  dimension,  the  winner  by  Condorcet 
procedure  is  always  the  one  preferred  by  the 
median  voter.  The  Condorcet  winner  can  be  con¬ 
sidered  highly  satisfactory  for  the  electorate  and 
can  be  used  as  a  positive  reference  for  comparison 
with  the  results  obtained  with  other  procedures. 
However,  in  multidimensional  spaces,  such  an 
alternative  may  not  exist,  as  shown  with  the  exam¬ 
ple  at  the  beginning  of  this  entry. 

Another  sophisticated  procedure,  known  as 
rank  order  count,  was  devised  independently  by 
both  the  15th-century  German  Cardinal  Nicholas 
of  Cusa  and  the  18th-century  French  academic 
Jean-Charles  de  Borda.  This  procedure  requires 
that  the  voter  order  all  the  alternatives  and  award 
them  1,  2,  3,  and  more  points,  ranking  them  from 
the  least  to  the  most  preferred.  The  winner  is  the 
alternative  that  has  collected  the  highest  sum  of 
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points.  The  “Borda  winner”  can  also  be  consid¬ 
ered  highly  satisfactory  for  the  electorate  and  can 
be  used  as  a  normative  reference.  Yet  this  voting 
procedure  can  be  manipulated  because  some  vot¬ 
ers  can  award  lower  points  to  rival  alternatives 
than  would  actually  correspond  to  their  sincere 
preferences  in  order  to  prevent  their  victory.  For 
this  reason,  Cusanus  warned  that  electors  should 
“act  according  to  conscience,”  and  Borda  remarked 
that  his  procedure  was  conceived  “only  for  honest 
men.”  Nevertheless,  the  likelihood  of  making  a 
sincere  loser  a  strategic  winner  is  lower  the  higher 
the  differences  in  votes  and  ranks  between  the  two 
alternatives,  as  had  already  been  shown  by  the 
Spanish  mathematician  Joseph  Isidore  Morales  in 
Seville  in  the  late  1 8th  century. 

Approval  voting  has  also  remote  origins  in 
medieval  times.  It  allows  the  voters  to  vote  for 
those  alternatives  that  they  consider  acceptable, 
from  a  minimum  of  one  to  a  maximum  of  all 
minus  one,  and  the  alternative  with  a  plurality  of 
votes  becomes  the  winner.  Approval  voting  tends 
to  produce  broadly  consensual  social  choices.  Yet 
it  is  still  vulnerable  to  strategic  manipulation  since 
voters  with  information  regarding  others’  prefer¬ 
ences  and  whose  preferred  alternatives  have  wide 
support  can  concentrate  their  votes  on  one  or  a 
few  alternatives  and  present  their  intermediate 
preferences  as  unacceptable. 

Finally,  according  to  classical  utilitarian  assump¬ 
tions  about  cardinal  utilities,  as  presented  by  the 
English  philosopher  Jeremy  Bentham  in  the  early 
19th  century,  range  voting  gives  voters  the  oppor¬ 
tunity  to  give  different  numbers  of  votes  (or  scores) 
to  alternatives.  As  voters  have  both  the  incentive 
and  the  opportunity  to  provide  detailed  informa¬ 
tion  about  their  preferences,  this  procedure  may 
produce  the  most  satisfactory  outcome  for  the 
greatest  number  of  voters.  All  in  all,  the  aforemen¬ 
tioned  procedures  tend  to  produce  results  that  are 
more  consistent  with  individuals’  preferences  than 
plurality  rule. 


Certain  formal  findings  in  voting  theory  can 
also  enlighten  performances  of  institutional  formu¬ 
las  implying  different  degrees  of  fusion  or  division 
of  powers.  According  to  spatial  theory,  the  “single¬ 
package”  social  choice  in  a  policy  “space”  formed 
by  multiple  issues  and  dimensions  can  be  highly 
unpredictable,  as  mentioned.  This  may  correspond 
to  single-party  parliamentary  regimes,  as  well  as 
presidential  regimes  with  a  president’s  party  major¬ 
ity  in  the  assembly,  where  a  single  election  may 
become  decisive  for  all  the  multiple  policy  issues 
that  may  enter  the  government’s  agenda.  These 
institutional  frameworks  tend  to  produce  relatively 
changing  and  unstable  policies. 

By  contrast,  in  multiparty  elections  producing 
coalition  cabinets,  different  issues,  roughly  corre¬ 
sponding  to  different  portfolios,  can  be  dealt  with 
separately  on  single-issue  spaces.  Also,  in  regimes 
of  separation  of  powers,  each  separate  election  for 
a  different  office  can  focus  on  one  or  a  few  issues 
and  favor  the  consistency  and  stability  of  social 
choices.  In  these  institutional  frameworks,  each 
issue  can  be  the  subject  of  a  broad  multiparty  or 
interinstitutional  agreement  around  the  median 
voter’s  position,  which  can  preclude  drastic 
changes  and  induce  policy  stability  in  the  mid-  or 
long  term. 

A  Model  of  Social  Choice 

A  simple  geometrical  model  can  illustrate  the  rel¬ 
evance  of  social  choice  models  for  the  analysis  of 
institutions  and  their  performance.  Let  us  use  the 
simplest  case  of  an  electorate  composed  of  three 
voters  with  different  preferences  (or  three  voters’ 
groups  with  the  same  preferences  and  a  similar 
number  of  members).  If  the  voters’  preferences  can 
be  located  on  a  single-issue  dimension,  such  as  A, 
B,  and  C  on  the  horizontal  axis  in  Figure  1,  the 
median  voters’  preference,  B,  may  win  any  elec¬ 
tion  by  majority  rule  against  each  of  the  other 
alternatives. 


Median  voters’ 
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Figure  I  A  Single-Dimensional  Electorate 
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If,  alternatively,  there  are  two  issue  dimensions, 
the  three  voters,  A,  B,  and  C,  can  hold  different 
preferences  on  Issue  1  (a15  b15  and  c2)  and  on  Issue 
2  (a„  b2,  and  c,),  as  presented  in  Figure  2.  Then 
different  institutional  formulas  involving  separate 
and  joint  elections  can  produce  different  social 
choices.  Let  us  assume,  first,  that  there  are  two 
separate  elections  for  different  offices,  such  as  con¬ 
gress  and  presidency  or  two  chambers  in  parlia¬ 
ment,  dealing  with  different  sets  of  issues  or  issue 
dimensions.  On  each  separate  election,  the  inter¬ 
mediate  alternative  close  to  the  median  voter  is 
advantaged  and  may  win.  In  Figure  2,  bj  wins  in 
the  election  on  Issue  1,  and  a2  wins  in  the  election 
on  Issue  2.  The  social  choice  is  represented  by  the 
intersection  point  of  the  winning  positions  on  each 
issue,  b,-a2.  As  can  be  seen,  the  social  choice  of 
separate  elections  on  different  issues  under  the 
above  assumptions  is  a  somewhat  centrist  point 
located  inside  the  minimal  set  containing  all  vot¬ 
ers’  preferences,  or  the  Pareto  set  (the  triangle  ABC 
in  the  figure). 

Now,  let  us  assume,  alternatively,  that  the  social 
choice  on  all  the  issues  is  made  in  a  single  election, 
as  would  correspond  to  a  simple  institutional 


framework,  such  as  a  unicameral  parliamentary 
regime  by  plurality  rule.  The  institutional  setting 
forces  the  voters  to  choose,  not  between  alterna¬ 
tives  on  separate  issues  but  between  packages  of 
alternatives  on  all  the  issues  at  the  same  time.  The 
set  of  possible  winners,  or  win  set,  in  such  a  joint 
election  depends  on  the  status  quo.  Let  us  adopt 
the  hypothesis  that  the  status  quo  is  the  social 
choice  previously  produced  by  two  separate  elec¬ 
tions,  the  point  bj-a,  in  Figure  2.  The  set  of  possi¬ 
ble  winners  in  a  single,  two-dimensional  election 
from  this  point  is  represented  by  the  multipetal 
shaded  area  in  the  figure.  This  is  formed  by  circu¬ 
lar  indifference  curves  around  the  voters’  prefer¬ 
ences  and  crossing  the  status  quo.  It  is  assumed 
that  every  voter  prefers  the  alternatives  that  are 
closer  to  the  voter’s  preference  and  in  particular 
prefers  those  inside  the  indifference  curve  to  those 
outside.  Accordingly,  the  set  of  possible  majority 
winners  in  a  joint  election  is  formed  by  all  the 
points  at  which  a  majority  of  voters  (any  majority 
of  two  voters  out  of  three  in  the  example)  is  more 
satisfied  than  they  would  be  in  the  status  quo;  that 
is,  the  win  set  is  formed  by  the  intersections  of 
pairs  of  indifference  curves. 


Issue  2 


Figure  2  Joint  and  Separate  Elections 
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As  can  be  seen,  the  set  of  possible  winners  in  a 
joint  election  is  relatively  large,  which  makes  the 
prediction  of  results  difficult.  A  number  of  possible 
social  choices  are  located  outside  the  Pareto  set 
and  even  beyond  the  rank  of  voters’  preferences 
(e.g.,  the  set  includes  some  points  located  beyond 
the  extreme  preference,  b2,  on  Issue  2).  The  possi¬ 
bility  that  many  different  alternatives  can  win  may 
generate  instability  in  a  series  of  successive  social 
choices,  since  any  winning  point  can  be  further 
beaten  by  some  other  point  in  the  corresponding 
win  set.  This  analysis  allows  us  to  state  that  a  joint 
election  on  a  multidimensional  set  of  issues,  as  a 
model  for  the  typical  single  election  in  a  simple 
regime,  can  be  more  uncertain  and  unstable  over 
time  than  separate  elections  on  different  issues, 
such  as  elections  for  different  offices  in  regimes 
with  division  of  powers. 

How  to  Choose 

The  analysis  of  the  relative  successes  and  pitfalls  of 
different  institutional  rules  has  moved  social  choice 
theorists  to  study  how  institutions  are  chosen  and 
how  they  ought  to  be  chosen  in  order  to  favor  fair 
and  stable  developments — or  the  problem  of 
choosing  how  to  choose.  Game  theoretical  models 
have  also  been  used  for  the  analysis  of  strategic 
behavior  in  such  a  field.  In  consistency  with  the 
assumptions  regarding  human  behavior  within 
institutional  constraints,  it  can  be  assumed  that 
people  may  prefer,  choose,  and  support  those  insti¬ 
tutional  formulas  producing  satisfactory  results  for 
themselves  and  reject  those  making  them  perma¬ 
nently  excluded  and  defeated.  As  a  consequence, 
institutions  producing  widely  distributed  satisfac¬ 
tory  social  choices  should  be  better  able  to  develop 
endogenous  support  and  endure.  In  general,  widely 
representative  social  choices  should  feed  support 
for  the  corresponding  institutions,  while  exclusion¬ 
ary,  biased,  or  arbitrary  outcomes  might  foster 
rejection  of  the  institutions  producing  such  results. 
These  findings  have  led  to  the  establishment  of  a 
positive  relation  between  institutional  pluralism, 
social  efficiency,  and  democratic  stability. 
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Social  Cohesion 


In  the  work  of  the  founders  of  sociology,  society 
was  conceived  of  as  an  organism  that  needed  to 
respond  to  the  main  challenge  posed  by  modernity: 
How  do  societies  marked  by  social  differentiation 
and  distinct  social  groups  create  mechanisms  to 
live  together?  The  concept  of  social  cohesion  fea¬ 
tures  prominently  in  the  work  of  its  originator, 
Emile  Durkheim  (1858-1917),  for  example,  and 
was  later  revisited,  though  not  always  under  the 
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same  name,  in  the  functionalist  tradition.  However, 
Durkheim,  in  his  major  work  on  the  subject 
(1893),  used  the  notion  of  solidarity  to  describe 
ways  in  which  social  integration  is  achieved. 
Solidarity  is  a  “force”  that  binds  the  different 
parts  of  a  whole,  creating  bonds  and  relations  of 
reciprocity  between  individuals. 

Progressively,  and  more  so  in  recent  times,  and 
far  removed  from  the  initial  analytical  use  that 
Durkheim  gave  to  the  concept  of  solidarity,  social 
cohesion  has  become  increasingly  intermingled 
with  normative  and  policy-making  purposes. 
Therefore,  the  current  literature  reflects  two  differ¬ 
ent  visions  of  social  cohesion.  The  first  perspective 
is  policy  oriented  and  part  of  the  political  dis¬ 
course;  it  strives  to  put  in  place  a  battery  of  objec¬ 
tive  indicators  for  assessing  the  degree  of  social 
cohesion  observable  in  a  society.  The  second,  by 
contrast,  a  sociological  perspective,  interrogates 
the  nature  of  social  cohesion,  focusing  on  social 
dynamics  and  the  cultural  and  political  mecha¬ 
nisms  that  bind  society.  The  following  presenta¬ 
tion  focuses  on  both  visions. 

The  Policy-Making  Concept 
of  Social  Cohesion 

The  concept  of  social  cohesion  adopted  by  the 
European  Union  (EU)  is  an  excellent  example  of  the 
first  strategy.  It  is  essentially  a  normative  reference 
associated  with  operational  criteria  revolving  around 
indicators  (e.g.,  employment,  health)  that  are 
selected  by  public  debate,  politicians,  and  techno¬ 
crats.  This  predominant  concept  of  social  cohesion 
in  the  current  international  debate  was  developed  in 
the  1990s  by  the  EU  as  part  of  a  political  discourse 
imbued  with  an  essentially  normative-evocative 
meaning  that  seeks  to  define  a  desirable  horizon  for 
society.  The  concept  of  social  cohesion  is  defined  as 

the  capacity  of  a  society  to  ensure  the  welfare  of 
all  its  members,  minimising  disparities  and 
avoiding  polarisation.  A  cohesive  society  is  a 
mutually  supportive  community  of  free  individuals 
pursuing  these  common  goals  by  democratic 
means.  (European  Committee  for  Social  Cohesion, 
2004,  p. 3) 

The  notion  of  social  cohesion  is  believed  to  rep¬ 
resent  the  central  values  of  solidarity  and  equality 


that  are  the  original  features  of  the  “European 
model.”  It  is  contrasted  sharply  with  the  so-called 
Anglo-Saxon  (English-speaking  countries)  model, 
which  is  regarded  as  steeped  in  more  individualis¬ 
tic  values  and  less  concerned  with  distributive 
aspects  or  the  state’s  role  as  guarantor  of  the  com¬ 
mon  good  (European  Commission,  2007). 

According  to  Gosta  Esping-Andersen,  Duncan 
Gallie,  Anton  Hemerijk,  and  John  Myers  (2002), 
explicitly  underlying  the  EU’s  concerns  with  social 
cohesion  are  the  changes  in  its  economy  and 
demography  in  recent  decades,  particularly  its 
inclusion  in  the  globalization  process  and  the  sub¬ 
sequent  impact  on  job  creation/unemployment  and 
distribution  of  wealth,  and  the  resultant  challenges 
to  its  welfare  state.  These  changes  provoke  social 
tensions,  which,  in  turn,  jeopardize  social  cohesion. 
The  EU  concept  of  social  cohesion  has  to  do  with 
an  effort  to  maintain  the  (idealized)  standard  of  the 
immediate  past  that  it  would  like  to  preserve. 

As  the  concept  of  social  cohesion  became 
increasingly  central  to  the  EU’s  discourse,  criteria 
and  indicators  were  developed  to  measure  it. 
Known  as  the  Laeken  indicators,  they  deal  mainly 
with  distributive  variables  (employment,  income, 
access  to  public  services).  In  this  way,  the  notion  of 
social  cohesion,  a  normative  framework,  became 
operational  and,  therefore,  a  target  for  public 
policies  intended  to  influence  the  indicators.  In 
sum,  the  concept  of  social  cohesion  is  associated 
with  a  specific  political  context  that  evokes  what  is 
regarded  as  a  desirable  state  of  affairs  relative  to 
the  status  quo  ante. 

This  use  of  the  concept  of  social  cohesion  has 
tended  to  be  disseminated  in  recent  years  among 
international  agencies.  To  do  so,  they  have  sepa¬ 
rated  the  concept  from  the  concrete  European 
context  in  which  it  originated,  producing  an 
abstract  and  ahistorical  concept  of  social  cohe¬ 
sion.  For  example,  according  to  the  Economic 
Commission  for  Latin  America  and  the  Caribbean 
(ECLAC),  social  cohesion  is  “the  dialectic  [rela¬ 
tionship]  between  instituted  social  inclusion  and 
exclusion  mechanisms  and  the  responses,  percep¬ 
tions  and  attitudes  of  citizens  towards  the  way 
these  mechanisms  operate”  (2007,  p.  18)  and 
implies  a  previous  theory  and  an  empirical  analy¬ 
sis  of  what  is  meant  by  “citizens”  and  “inclusion/ 
exclusion  mechanisms”  in  the  Latin  American 
context. 
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The  included-excluded  dichotomy  produces  a 
preconceived  vision  of  society  that  fails  to  take 
into  account  the  different  and  complex  processes 
differentiating  society  that  are  far  broader  than  a 
single  cleavage.  Of  course,  limited  access  to  social 
services,  income,  and  opportunities  in  the  labor 
market  is  frequently  the  cornerstone  in  the  con¬ 
struction  of  a  sense  of  social  exclusion.  But  con¬ 
crete  situations  are  mostly  in  the  gray  areas,  and 
most  citizens  feel  simultaneously  included  and 
excluded. 

Analysis  of  social  cohesion  can  neither  be  dis¬ 
sociated  from  the  category  of  relative  deprivation 
(e.g.,  the  expectations  and  feelings  of  inclusion  or 
exclusion  of  a  recent  migrant  originally  from  a 
rural  setting  are  not  the  same  as  those  of  a  genera¬ 
tion  born  and  raised  in  the  city),  nor  can  it  over¬ 
look  the  fact  that  feelings  of  exclusion,  frustration, 
and  social  anomie  may  be  even  stronger  among 
sectors  with  a  better  state  of  social  well-being.  In 
general,  analyses  of  social  cohesion  guided  by  the 
included-excluded  dichotomy  tend  to  consider 
economic  mechanisms  (usually  employment  and 
social  policy)  as  the  main,  or  even  the  only,  factor 
of  integration,  against  which  individual  leanings 
are  juxtaposed.  This  vision  does  not  consider  other 
social  processes  of  solidarity  formation  that  are 
extremely  important  to  preserve  social  cohesion, 
such  as  religion  or  the  mass  media. 

Societies  whose  cohesion  revolves  around  egali¬ 
tarian  values  can  intensify  feelings  of  exclusion 
among  individuals  and  groups  that  would  be 
considered  acceptable  in  other  contexts.  In  certain 
cases,  greater  economic  inclusion  can  increase  feel¬ 
ings  of  symbolic  and  political  exclusion,  and 
conversely,  greater  symbolic  inclusion  can  inten¬ 
sify  feelings  of  economic  exclusion.  In  sum,  the 
objective  and  subjective  dimensions  of  inclusion/ 
exclusion  are  complex  and  require  theoretical  and 
empirical  analyses  sensitive  to  the  historical  forma¬ 
tion  of  values  systems  in  each  society. 

The  Analytical  Concept  of  Social  Cohesion 

As  social  theory  teaches  us,  all  societies  generate 
some  form  of  cohesion.  Otherwise,  they  would  not 
exist.  But  the  mechanisms  for  maintaining  social 
cohesion  vary  according  to  the  historical  forma¬ 
tion  of  each  society.  In  complex  societies,  this  is 
reflected  in  the  existence  of  universes  of  beliefs  and 


values  shared,  to  varying  degrees,  by  all  members 
of  society  and  by  systems  of  authority  rooted  in 
consensual  norms  and  coercive  systems  that  ensure 
the  functioning  of  the  established  order.  Mechan¬ 
isms  of  social  disintegration  are  also  many  and 
varied.  They  may  be  the  product  of  exclusion, 
anomic  violence,  or  authoritarian  ideologies.  Social 
cohesion  in  modern  times  cannot  be  dissociated 
from  social  change  and  from  social  conflict. 
Modern  societies  are  in  constant  mutation,  which 
means  that  some  forms  of  sociability  are  always 
disintegrating  and  giving  way  to  new  mechanisms 
for  integration  in  which  citizen  participation  and 
demands  play  a  critical  role. 

If  every  society  has,  by  definition,  some  form  of 
social  cohesion,  then  what  is  at  stake  from  an  ana¬ 
lytical  standpoint  is  the  understanding  of  the 
mechanisms  by  which  social  cohesion  is  produced 
and  absorbs  the  processes  of  change  and  social 
conflict,  as  well  as  the  mechanisms  through  which 
these  are  expressed  and  resolved.  This  means 
expanding  the  horizon  of  analytical  and  normative 
social  cohesion  beyond  (but  inclusive  of)  public 
policy  to  encompass  the  functioning  of  political 
and  cultural  systems.  Social  cohesion  has  not  only 
to  do  with  the  most  adequate  or  effective  public 
policies,  but  it  also  requires  us  to  ask  ourselves 
about  the  mechanisms  for  the  symbolic  and  politi¬ 
cal  mobilization  of  citizens,  which  are  a  prerequi¬ 
site  for  the  possibility  (or  impossibility)  of  public 
policies  and  state  reforms. 

In  this  view,  the  concept  of  social  cohesion 
requires  more  sensitivity  to,  and  the  effective  inclu¬ 
sion  of,  social,  political,  and  cultural  issues,  an 
interdisciplinary  dialogue  that  incorporates  the 
required  contributions  of  the  various  disciplines  in 
the  social  sciences.  This  means  bringing  together 
economists,  sociologists,  political  scientists,  anthro¬ 
pologists,  and  historians  in  an  effort  to  discover  the 
many  nuances  evoked  by  the  concept  of  social 
cohesion  (societies  that  value  democracy  and  equity 
and  transmit  a  sense  of  belonging  and  dignity  to 
their  citizens).  Broadly  speaking,  cohesion  is  not  so 
much  a  matter  of  developing  a  theory  of  social 
cohesion  as  of  placing  this  concept  at  the  service  of 
a  multidisciplinary  examination  of  the  social  pro¬ 
cesses  under  way  in  contemporary  societies. 

The  challenge,  in  particular,  is  to  make  sure  that 
sociocultural  dynamics  are  not  overlooked  in  the 
reports  of  governmental  and  international  agencies. 
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One  of  the  reasons  for  this  absence  is  that  when 
handled  in  an  intellectually  responsible  manner, 
sociocultural  dynamics  require  a  sensitivity  to, 
and  acknowledgment  of,  the  diversity  of  national 
values  and  histories  in  which  values  acquire 
specific  symbolic  meanings  that  are  hard  to  quan¬ 
tify  and  generalize.  This  ultimately  conspires 
against  the  analyses  developed  by  international 
organizations  whose  vocation  is  to  come  up  with 
generalizable  and  quantifiable  solutions,  some¬ 
times  at  the  expense  of  the  complex  webs  and  the 
particularities  of  national  histories.  This  also  means 
accepting  social  conflict  as  a  legitimate  and  critical 
component  of  the  construction/transformation 
of  mechanisms  for  social  cohesion  in  democratic 
societies. 
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Social  Democracy 


Social  democracy  (sometimes  used  synonymously 
with  democratic  socialism )  refers  to  a  political  ten¬ 
dency  resting  on  three  fundamental  features: 
(1)  democracy  (e.g.,  equal  rights  to  vote  and  form 
parties),  (2)  an  economy  partly  regulated  by  the  state 
(e.g.,  through  Keynesianism),  and  (3)  a  welfare  state 
offering  social  support  to  those  in  need  (e.g.,  equal 
rights  to  education,  health  service,  employment,  and 
pensions).  Very  often,  socialist  democratic  parties 
are  members  of  the  “Socialist  International,”  an 
organization  of  social-democratic,  socialist,  and 
labor  parties  that  pursues  mutual  cooperation.  This 
entry  discusses  social  democracy’s  roots  and  its  fun¬ 
damental  characteristics  and  modern  development. 

Origins 

Inspired  by  the  19th-century  German  economist 
and  sociologist  Karl  Marx  (1818-1883),  social 
democrats  traditionally  deemed  total  equality  the 
final  goal  of  efforts  to  remedy  the  plight  of  the 
working  class.  They  regarded  the  capitalist  eco¬ 
nomic  system  as  the  cause  behind  society’s  ills. 
With  close  ties  to  the  then  budding  labor  move¬ 
ment,  social-democratic  political  parties  began, 
toward  the  end  of  the  19th  century,  to  form  and 
organize  themselves.  A  key  demand  was  public 
ownership  of  the  means  of  production  as  a  way  to 
transform  society  into  a  socialist  economic  system, 
thus  ending  the  exploitation  of  the  working  class 
by  the  bourgeoisie  and  creating  true  democracy. 
Very  soon,  different  groups  began  to  debate  the 
means  by  which  to  reach  the  end.  Could  and 
should  what  was  regarded  as  capitalism’s  inner 
contradictions  be  passively  allowed  to  develop,  or 
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should  the  final  collapse  be  speeded  up  by  resort¬ 
ing  to  violent  revolution?  Others  instead  suggested 
“revisionist,”  peaceful,  and  gradual  reforms  of  the 
system  launched  through  liberal  democracy  via 
political  parties  in  parliament,  including  coopera¬ 
tion  between  social  classes.  Soon,  the  nature  of  the 
end  itself  became  a  bone  of  contention.  Could  and 
should  the  entire  capitalist  economic  system  really 
be  thrown  overboard?  Or  rather,  were  reforms  of 
the  economic  system  preferable,  so  as  to  sustain  its 
ability  to  create  wealth?  Were  not  welfare  state 
measures  sufficient  to  alleviate  the  system’s  per¬ 
ceived  unfair  wealth  distribution  effects?  Led  by 
the  German  socialist  Eduard  Bernstein  (1850— 
1932),  the  revisionist  camp  evolved  into  the  social- 
democratic  movement.  The  other  camp  opted  for 
more  revolutionary,  far-reaching  solutions  includ¬ 
ing  “dictatorship  by  the  proletariat,”  culminating 
in  the  Russian  Revolution  of  1917  and  the  first 
communist  state.  Subsequently,  most  social- 
democratic  parties  began  reorienting  themselves 
from  being  solely  the  political  branch  of  the  work¬ 
ing  class  to  becoming  “catchall”  parties. 

The  issues  of  how  extensive  the  transformation 
of  society  should  be  and  by  what  means  it  should 
be  achieved  did  not  end  with  the  social  democrats’ 
break  with  the  Communists.  Applying  the  com¬ 
monly  used  “left-right”  ideological  continuum  to 
two  analytical  aspects,  “public  ownership”  and 
“social  policy,”  many  researchers  would  certainly 
place  social  democracy  slightly  left  of  center  regard¬ 
ing  public  ownership  and  perhaps  a  little  further  to 
the  left  on  social  policy.  However,  consensus  is  less 
solid  when  “left”  and  “right”  are  to  be  defined. 
Normally,  left  would  signify  more  public  enter¬ 
prises  and  more  dirigiste  state  involvement  in  the 
economy,  but  the  actual  mix  of  socialist  and  capi¬ 
talist  policies  differs  considerably  between  parties 
and  between  different  time  periods.  Actual  owner¬ 
ship  of  the  means  of  production  has  seldom  formed 
a  vital  part  of  policies.  As  to  the  social  policy 
dimension,  the  left  would  stand  for  a  more  exten¬ 
sive  welfare  state  in  terms  of  covering  more  people 
with  more  generous  support  at  higher  cost  and 
with  fewer  if  any  market-emulating  characteristics. 

Postwar  Developments  and  the  “Third  Way” 

Social  democracy  evolved  after  World  War  II  into 
a  political  ideology  that  focused  on  working  inside 


the  overall  framework  of  a  mixed-market  economy 
while  trying  to  protect  and  make  life  decent  for  all 
those  who  were  economically  the  most  vulnerable. 
Although  often  critical  of  aspects  of  the  market 
economy,  social  democracy  accepted  its  fundamen¬ 
tals.  The  exact  policies  were  always  a  balancing  act 
between  “planning,”  “steering,”  and  “control”  on 
the  one  hand  and  “efficiency,”  “profit,”  and 
“competitiveness”  on  the  other.  Rather  than  put¬ 
ting  the  economy  in  the  hands  of  politicians,  most 
parties  chose  the  welfare  state  as  the  primary  step 
to  “decommodifying”  society,  although  few  would 
argue  for  complete  decommodification.  The  status 
of  equality  is  a  moot  point.  Originally,  most  social 
democrats  defined  equality  as  a  situation  in  which 
most  people  enjoyed  roughly  the  same  standard  of 
living.  To  attain  this  condition,  redistribution  from 
the  haves  to  the  have-nots  would  be  necessary. 
Later,  the  meaning  underwent  a  shift  to  signify 
equality  of  opportunity  rather  than  of  outcome, 
thus  lessening  the  emphasis  on  redistribution 
through  transferring  resources  from  one  group  to 
another.  Although  the  focus  was  no  longer  specifi¬ 
cally  on  retribution,  measures  were  advocated  to 
ensure  that  no  one  fell  below  a  decent  or  minimally 
acceptable  standard  of  living. 

Once  there,  the  welfare  state  had  to  be  pro¬ 
tected.  Generating  fresh  resources  for  the  welfare 
state  became  imperative.  The  parties  increasingly 
came  to  rely  on  the  efficiency  of  the  market  econ¬ 
omy  to  create  those  resources.  Gradually,  the  idea 
of  abolishing  the  market  economy  was  pushed  into 
the  background.  Indeed  the  mixed-market  econ¬ 
omy  became  core  social-democratic  ideology — 
with  public  policies,  for  example,  through  so-called 
Keynesian  countercyclical  policies  and  through 
active  labor  market  policies  and  sometimes  through 
corporatist  solutions,  to  help  the  economy  function 
even  better  than  if  left  on  its  own. 

However,  occasionally,  parts  of  social  democ¬ 
racy  have  contemplated  further  steps  toward  a 
more  socialist  society.  Yet  any  concrete  plans 
about  real  planning  institutions  have  only  rarely 
emerged.  A  peak  of  ideological  revival  occurred 
from  around  1970  to  1983,  when  some  groups  in 
some  parties  began  questioning  whether  the  wel¬ 
fare  state  alone  would  suffice  to  create  the  desired 
society.  The  ownership  issue  reemerged  with  force 
as  an  internal  issue  in  many  parties.  Some — for 
example,  certain  groups  in  the  British  Labour 
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Party — advocated  the  return  to  older  policies  in 
which  state  nationalizations  of  major,  if  not  all, 
businesses  would  take  center  stage.  Others — for 
example,  in  the  German  and  the  Swedish  Social 
Democratic  Parties — suggested  extended  union 
influence  over  business  decision  making  as  the 
way  forward.  Perhaps  the  most  innovative  and 
controversial  of  the  new  policies  were  plans  for 
“economic  democracy”  through  so-called  wage 
earner  funds  put  forward  by  sections  of  the 
Swedish  social  democracy.  In  their  original,  blue- 
collar  union  draft  proposals,  the  funds  were 
designed  to  tax  major  parts  of  businesses  in  order 
to  create  funds  that  would  soon  be  able  to  take 
economic  control  of  private  companies.  However, 
the  social-democratic  leadership  was  far  less 
enthusiastic  about  any  takeover  plans  and  consid¬ 
erably  watered  down  the  final  version  that  was 
adopted  in  1983.  A  number  of  nationwide  funds 
were  set  up  much  like  pension  funds.  A  special 
business  tax  tunneled  off  resources  to  the  funds 
for  buying  shares  on  the  regular — private — stock 
market  with  an  overall  ownership  cap  set  at  about 
10%.  When  a  nonsocialist  government  was 
formed  in  1991,  the  funds  were  abolished  and 
have  not  since  reappeared  in  social-democratic 
policy  making. 

No  doubt  social  democracy  has  been  a  very  suc¬ 
cessful  ideology.  Some  would  even  stress  that  after 
World  War  II  many  Western  societies  became  more 
or  less  permeated  by  social-democratic  ideas. 
Although  support  varies,  social  democracy  has 
enjoyed  remarkably  stable  parliamentary  strength 
over  the  decades.  This  is  a  movement  optimistic 
about  the  possibilities  of  modernizing  society 
through  technological  advances  and  prepared  to 
use  an  array  of  policies  with  which  to  improve 
people’s  lives  by  correcting  the  market  economy’s 
inequitable  effects.  However,  in  practice,  several 
other  political  tendencies,  such  as  Christian  democ¬ 
racy  and  social  liberalism,  also  propose  far-reaching 
public  sector  measures  including  welfare  states  and 
other  regulations  designed  to  correct  the  adverse 
effects  of  an  unregulated  market  economy.  These 
parties  strive  to  establish  socially  beneficial  societies 
too.  Still,  even  though  similarities  with  Christian 
democracy  and  social  liberalism  are  strong,  many 
researchers  would  agree  that  social  democracy  is 
more  ambitious  in  its  quest  to  eradicate  unfairness 
rather  than  just  alleviating  the  effects  of  social 


injustices,  although  the  differences  may  not  always 
be  easy  to  pinpoint. 

Toward  the  turn  of  the  millennium,  many 
social-democratic  parties  began  to  feel  concern 
about  some  perceived  tendencies,  all  of  which  basi¬ 
cally  had  to  do  with  the  financial  burden  of  the 
welfare  state.  It  was  always  evident  that  the  expan¬ 
sion  and  even  maintenance  of  welfare  states  would 
be  expensive.  Therefore,  public  incomes  were  of 
the  essence.  Taxes  were  essential  not  only  as  a 
means  to  narrowing  down  real-income  gaps 
between  the  different  strata  in  society  but  also  as  a 
means  to  create  budget  incomes.  A  growing  tax 
base  was  deemed  equally  important  to  support 
public  sector  budgets.  As  the  welfare  states  grew, 
further  expansions  would  entail  not  only  higher 
taxes  but  also  heavier  reliance  on  ever-growing 
economies.  Even  though  many  voters  have  accepted 
high  taxes  in  countries  where  welfare  state  mea¬ 
sures  have  been  universally  enjoyed,  some  resent¬ 
ment  toward  high  marginal  taxes  was  beginning  to 
be  felt  during  the  1980s  and  1990s.  Many  parties 
began  debating  whether  taxes  had  perhaps  reached 
their  ceiling.  If  so,  welfare  state  programs  would 
have  to  rely  even  more  on  a  growing  tax  base  and, 
thus,  on  an  ever  more  efficient  market  economy. 
Whether  real  or  not,  the  perceived  effects  of  eco¬ 
nomic  globalization  and  the  ensuing  constant 
structural  changes  at  the  same  time  caused  many 
parties  to  reconsider  how  far  the  market  economy 
could  in  fact  be  regulated  without  negative  effi¬ 
ciency  consequences.  Simultaneously,  mounting 
criticism  of  welfare  state  bureaucracy,  planning 
inertia,  and  lack  of  citizenship  choice  led  many 
parties  to  try  new  policies,  including  lower  taxes, 
deregulation  of  financial  markets,  and  various 
public-private  combinations  including  elements  of 
competition  inside  the  public  sector. 

Soon,  social  democracy  was  criticized  for  being 
“neoliberal,”  unacceptably  diluting  true  social- 
democratic  ideals,  and  finally  closing  the  door  to 
more  genuine  reforms  of  capitalism.  Others  fer¬ 
vently  defended  new  policies  as  a  “Third  Way” 
between  socialism  and  capitalism.  It  has  been  sug¬ 
gested  that  “equality”  was  replaced  by  “fairness,” 
“collective  rights”  by  “individual  rights,”  “redistri¬ 
bution”  by  “individual  chances,”  and  “the  state  as 
a  provider”  by  “the  state  as  enabler.”  How  drastic 
the  changes  have  actually  been  is  a  recurring  aca¬ 
demic  theme.  No  doubt  a  change  of  rhetoric  has 
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taken  place,  but  there  is  a  case  to  be  made  for 
substantial  policy  continuity  with  the  past.  Aiding 
the  market  and  enabling  citizens  to  overcome 
changes  were  vital  parts  of  the  golden  age  (roughly 
1945-1980)  of  social  democracy.  Socialism 
“today”  was  never  on  the  agenda.  Efficiency,  full 
employment,  and  an  element  of  equality  took  cen¬ 
ter  stage.  Sometimes  policies  such  as  these  were 
described  as  corresponding  to  a  “planned”  society. 
However,  they  closely  conformed  to  a  mixed- 
market  economy  but — and  this  is  the  key  to  social 
democracy — where  the  economy  is  regarded  as  the 
means  to  a  higher  end,  a  decent  society. 

Jonas  Hinnfors 
University  of  Goteburg 
Goteburg,  Sweden 
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Social  Dominance  Orientation 


Social  dominance  orientation  (SDO)  is  a  concept 
developed  in  social  dominance  theory  (SDT)  to 
describe  an  individual-level  value  structure  that 
prefers  inequality  and  stratified  categorization 
among  social  groups.  Individuals  with  high  SDO 
desire  a  society  marked  by  stratification  and  are 
thus  inclined  to  accept  such  social  structures  as 
warranted.  They  differentiate  between  groups  on 
the  basis  of  superiority  and  subordination,  which, 
in  political  practice,  often  leads  to  attitudes  and 
behavior  that  entail  discrimination  and  segregation. 


Insight  into  the  existence  of  SDO  emerged  from 
growing  interest  in  the  group-based  aspects  of 
human  life.  SDT,  of  which  orientation  forms  an 
integral  and  crucial  part,  was  first  developed  by 
Felicia  Pratto  and  Jim  Sidanius  in  the  early  1990s 
and  since  then  has  been  frequently  used  in  psychol¬ 
ogy  and  social  psychology  research,  mostly  in 
journals  focusing  on  personality  or  experimental 
psychology.  As  issues  of  multiculturalism,  integra¬ 
tion,  and  assimilation  due  to  increasing  migration 
and  societal  diversity  have  risen  on  the  political 
science  agenda,  the  SDO  concept  has  began  to 
make  its  way  into  parts  of  the  discipline.  This  entry 
defines  SDO  in  greater  detail,  including  how  it  is 
measured  and  what  is  known  of  its  distributional 
pattern.  The  important  matter  of  how  SDO  relates 
to  authoritarianism  will  also  be  touched  on. 
Finally,  some  political  and  practical  implications 
of  SDO  are  introduced. 

SDO  is  defined  as  a  personality  trait  that  pre¬ 
dicts  certain  attitudes,  a  basic  structure  that  in 
turn  translates  into  several  more  concrete  percep¬ 
tions.  In  research  into  values  and  culture,  “orien¬ 
tation”  refers  to  basic  cognitive  structures  and 
beliefs,  of  which  we  as  individuals  possess  rela¬ 
tively  few.  Orientations  tend  to  be  stable,  formed 
in  a  process  of  early-childhood,  adolescent,  and 
young-adulthood  socialization  and  experience. 
Attitudes,  in  contrast,  are  more  numerous  and  mal¬ 
leable  and  emanate  from  orientations.  Individuals 
with  SDO  are  characterized  by  strong  perceptions 
of  their  own  ingroups  as  hierarchically  and  justifi¬ 
ably  superior  to  other  groups,  most  prominently  so 
in  relation  to  immigrants,  people  of  other  ethnici¬ 
ties,  and  the  opposite  sex.  SDO  has  a  number  of 
attitudinal  consequences,  all  of  them  favoring  hier¬ 
archy  over  equality.  Concrete  expressions  of  SDO 
include  ethnic  prejudice,  nationalism,  cultural  elit¬ 
ism,  sexism,  and  a  belief  in  meritocracy  (i.e.,  people 
with  superior  competence  or  skill  should  occupy 
favored  positions  in  society).  SDO  has  been  claimed 
to  be  a  personality  variable  on  par  with  what  are 
known  in  personality  psychology  as  the  “Big  Five” 
traits:  extraversion,  agreeableness,  openness,  neu- 
roticism,  and  conscientiousness. 

The  SDO  concept  has  developed  in  the  interface 
between  personality  and  social  psychology,  and 
experimental  studies  have  demonstrated  that  it  can 
successfully  predict  prejudice.  Through  measuring 
SDO,  one  can  differentiate  individuals  based  on 
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their  level  of  SDO  inclination.  The  South  African 
nationalists,  who  introduced  apartheid  and  openly 
defended  it  with  arguments  of  hierarchy  and  supe¬ 
riority,  exemplify  SDO.  Anti-Black  racism  in  the 
United  States  is  yet  another  well-known  example 
of  the  orientation,  as  is  India’s  caste  system,  as  it  is 
a  social  order  based  on  human  hierarchy. 

Authoritarianism  has  been  related  to  socializing 
experiences  in  childhood — for  example,  dominat¬ 
ing  parents — which  in  adulthood  are  translated 
into  an  individual  desire  to  dominate  others.  SDO 
is  not  believed  to  originate  from  formative  experi¬ 
ences  but  from  a  combination  of  socialization  and 
personal  temperament. 

SDO  is  measured  using  a  scale  developed  by 
Felicia  Pratto,  Jim  Sidanius,  Lisa  Stallworth,  and 
Bertram  Malle  (1994).  The  various  items  consti¬ 
tuting  the  scale  all  concern  perceptions  of  hierar¬ 
chy  (e.g.,  “Some  people  are  just  more  worthy  than 
others”  or  “It  is  not  a  problem  if  some  people  have 
more  of  a  chance  in  life  than  others”)  as  opposed 
to  equality  (e.g.,  “If  people  were  treated  more 
equally,  we  would  have  fewer  problems  in  this 
country”).  The  original  seven-step  scale  runs  from 
“very  negative”  to  “very  positive.” 

SDO  and  Right-Wing 
Authoritarianism  (RWA) 

SDO  is  often  studied  and  analyzed  in  relation  to 
another  neighboring,  yet  different,  individual  value 
structure,  right-wing  authoritarianism  (RWA). 
Both  SDO  and  RWA  predict  prejudiced  action  that 
is  discriminatory  and  could  be  perceived  as  intoler¬ 
ant,  for  example,  supporting  the  persecution  of 
immigrants  and  other  “outgroups.”  However,  the 
RWA  scale,  which  grew  out  of  the  research  of 
Theodor  Adorno,  Else  Frenkel-Brunswik,  Daniel 
Levinson,  and  Nevitt  Sanford  (1950)  into  the 
authoritarian  personality  (measured  using  a  scale 
originally  developed  by  Robert  Altemeyer),  is 
driven  by  conformity  bias  (as  opposed  to  auton¬ 
omy),  submissiveness  to  authority  figures,  ethno- 
centricity,  and  aggressiveness  toward  outgroups. 
SDO,  in  contrast,  is  motivated  by  concern  for 
social  hierarchy  between  groups  (as  opposed  to 
egalitarianism).  RWA  has  been  demonstrated  to  be 
preoccupied  with  relationships  between  the  indi¬ 
vidual  and  the  group  (i.e.,  intragroup  relation¬ 
ships),  whereas  SDO  is  concerned  mostly  with 


relationships  between  groups  (i.e.,  intergroup  rela¬ 
tionships).  Individuals  guided  by  RWA  support 
assimilationist  practices  for  immigrants  and  mar¬ 
ginal  groups,  which  lessens  these  outgroups’  poten¬ 
tial  to  introduce  change.  For  people  with  high 
SDO,  assimilationist  practices  are  instead  viewed 
as  threatening,  because  they  conflict  with  and  even 
blur  social  boundaries.  There  is  a  distinct  difference 
between  the  more  individual-oriented  RWA  and 
SDO,  which  focuses  on  the  collective,  group  level. 
SDO  and  RWA  are  thus  complementary  but  only 
partly  overlapping  in  predicting  prejudice.  RWA  is 
related  to  uncertainty  reduction  in  various  respects: 
High-RWA  people  tend  to  be  religious  and  to 
desire  order,  structure,  security,  and  tradition — 
desires  that  generally  do  not  characterize  those 
with  high  SDO.  It  has  been  argued  that  SDO  rep¬ 
resents  a  more  modern  kind  of  prejudice  than  the 
conformist  and  traditionalist  RWA.  Differentiating 
between  SDO  and  RWA,  as  done,  for  example,  by 
Sam  McFarland  and  Robert  Altemeyer,  has 
advanced  and  fine-tuned  our  understanding  of 
individual  preference  and  value  structures.  Because 
of  their  similarity  at  an  aggregate  behavioral  level, 
as  evident,  for  example,  in  anti-immigration  rheto¬ 
ric  and  prejudiced  perceptions  of  groups  such  as 
Jews,  homosexuals,  African  Americans,  and 
women,  it  is  easy  to  miss  their  different  bases  of 
such  sentiments  and  policy  preferences. 

Gender  and  SDO 

SDO  is  claimed  to  have  evolutionary  roots.  Most 
research  has  found  systematic  variation  between 
the  sexes  in  the  level  of  SDO  and  that  men,  almost 
regardless  of  cultural  context,  have  higher  SDO 
than  women.  A  few  studies  of  homosexual  men 
have  concluded  that  they  have  lower  SDO  than 
heterosexual  men  or  women.  Although  absolute 
levels  of  SDO  vary  among  cultural  settings,  due  to 
differences  in  distributional  policy  and  economic 
system  between,  for  example,  the  United  States, 
India,  Sweden,  or  Ukraine,  men  always  have  rela¬ 
tively  higher  SDO  than  do  women.  In  SDT,  this  is 
called  the  “invariance”  hypothesis,  explained  by 
the  different  reproductive  patterns  of  the  sexes, 
which  leads  women  to  be  more  selective  in  choos¬ 
ing  mating  partners  to  ensure  reproductive  success 
(defined  as  raising  a  sexually  mature  second  gen¬ 
eration).  Consequently,  females  choose  higher 
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status  men,  making  men  competitively  involved  in 
a  status  struggle  for  the  more  selective  women,  a 
struggle,  a  desire,  and  even  a  need  to  dominate 
other  men  and  groups  of  men  that  could  threaten 
their  access  to  fertile  women.  As  a  behavioral  con¬ 
sequence  of  their  higher  SDO,  in  professional  con¬ 
texts,  men  are  more  often  found  in  hierarchical 
social  milieus,  such  as  the  military,  and  politically 
are  more  inclined  to  support  intolerant  political 
parties.  The  cross-cultural  similarities  in  SDO  dis¬ 
tribution  between  the  sexes  have  thus  been 
explained  by  biological  predispositions  in  mating 
selection.  This  gender  bias  in  SDO  allows  an 
analysis  in  which  worldwide  male  dominance  in 
society  as  well  as  in  politics  and  economics  could 
have  some  evolutionary  explanations. 

A  second  major  hypothesis  in  SDT  is  called  the 
“interaction”  hypothesis,  predicting  that  among 
“arbitrary  set”  groups  (e.g.,  class  and  ethnicity), 
SDO  levels  will  vary  between  the  groups  depend¬ 
ing  on  the  hierarchical  structure  of  a  society.  In 
low-hierarchy  cultures,  the  level  of  SDO  will  be 
more  similar  throughout  a  range  of  groups,  while 
the  differences  will  be  much  greater  in  highly  hier¬ 
archical  societies.  Evidence  has  been  found  to  sup¬ 
port  both  the  interaction  and  the  invariance 
hypotheses. 

Implications 

Research  has  identified  important  behavioral 
implications  of  SDO.  These  concern  the  roles  of 
both  the  private  person,  who  may  be  highly  preju¬ 
diced  and  aggressive  toward  outgroups,  and  the 
citizen,  who  may  vote  for  anti-immigration,  xeno¬ 
phobic,  and  intolerant  parties.  Individuals  with 
high  SDO  are  inclined  to  move  into  jobs  and  insti¬ 
tutions  that  maintain  or  increase  social  hierarchy, 
and  they  prefer  policies  that  support  social  separa¬ 
tion  rather  than  integration  or  assimilation. 

Are  there  ways  to  modify  SDO  at  the  aggregate 
or  individual  level?  Knowing  that  societies  with 
more  equal  relationships  display  more  similar  lev¬ 
els  of  SDO  between  high-  and  low-status  groups, 
we  could  single  out  redistributive  welfare  policies 
as  one  such  way.  Since  SDO  is  focused  on  group 
relationships,  relaxing  group  rhetoric  and  senti¬ 
ment  and  instead  emphasizing  individual  aspects 
has  been  proven  in  experimental  research  to  have 
some  beneficial  effects.  Similarly,  empathizing 


with  members  of  the  outgroup  could  reduce  high 
levels  of  SDO  too. 

Li  Bennicb-Bjorkman 
Uppsala  University 
Uppsala,  Sweden 
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Social  Exclusion 


Distinct  from,  and  typically  broader  than,  the  tra¬ 
ditional  concept  of  poverty,  social  exclusion  is  a 
contested,  broad  concept  that  can  take  very  differ¬ 
ent  meanings  from  one  policy  setting  to  another. 
A  central  aspect  of  most  definitions  of  social 
exclusion  available  in  the  literature  is  the  lack  of 
adequate  work  opportunities,  which  can  lead  to 
unemployment  or  underemployment.  But  reduc¬ 
ing  social  exclusion  to  employment-related  issues 
would  be  a  mistake,  as  the  concept  refers  to  issues 
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ranging  from  gender  and  ethnic  discrimination  to 
housing  problems  and  limited  access  to  health  and 
social  care,  among  other  things.  Another  central 
aspect  of  social  exclusion  is  its  complexity. 
According  to  Hilary  Silver  and  S.  M.  Miller 
(2003),  for  example,  social  exclusion  appears  as 

(1)  multi-dimensional  [italics  added]  (...); 

(2)  dynamic,  [italics  added]  denoting  a  process, 
movement  or  trajectory  from  full  integration  to  a 
condition  of  multiple  exclusions;  (3)  relational, 
[italics  added]  in  that  exclusion  entails  social 
distance  or  isolation,  rejection,  humiliation,  lack  of 
social  support  networks  and  denial  of  participation; 

(4)  active,  [italics  added]  in  that  people  and 
processes  do  the  excluding;  and  (5)  relative  [italics 
added]  to  context,  (p.  8) 

Although  analytical  remarks  such  as  this  have 
done  much  to  clarify  the  meaning  of  this  concept, 
social  exclusion  remains  contested,  and  some 
social  policy  scholars  reject  it  altogether,  claiming 
that  it  is  overly  vague  or,  even  worse,  misleading. 
Consequently,  there  is  no  consensus  in  the  social 
science  literature  about  the  analytical  usefulness  of 
this  concept.  This  entry  elaborates  the  history  and 
meaning  of  this  concept  and  examines  its  role  as  a 
central  idea  in  contemporary  social  policy. 

History  and  Meaning  of  the  Concept 

The  French  writer  Jean  Klanfer  published  a  book 
(L’Exclusion  Sociale:  Etude  de  la  Marginalite  dans 
les  Societes  Occidentales  [Social  Exclusion:  The 
Study  of  Marginality  in  Western  Societies])  in  1965 
that  defined  social  exclusion  in  terms  of  irrespon¬ 
sible  behavior  and  economic  marginalization.  In 
the  mid-1970s,  another  French  writer,  Rene  Lenoir, 
helped  popularize  the  concept  of  social  exclusion 
in  his  country.  According  to  him,  social  exclusion 
was  the  product  of  factors  such  as  poverty  and  dis¬ 
ability,  which  prevented  some  citizens  from  fully 
participating  in  the  French  economy  and  society. 
However,  in  France,  it  is  only  during  the  1980s 
and  early  1990s  that  social  exclusion  became  a 
major  policy  idea.  A  number  of  issues  contributed 
to  this  situation.  First,  long-term  unemployment 
became  a  permanent  feature  of  the  French  eco¬ 
nomic  landscape.  Second,  the  debate  on  immigra¬ 
tion  and  the  status  of  ethnic  minorities  increased 


interest  in  the  concept  of  social  exclusion,  which 
can  refer  to  the  negative  effects  of  racism  and 
discrimination.  From  a  political  standpoint,  how¬ 
ever,  social  exclusion  became  a  major  term  in 
French  public  discourse  partly  because  centrist 
politicians  referred  to  it  to  reinforce  their  reputa¬ 
tion  as  compassionate  actors  who  care  about  the 
fate  of  the  nation’s  less  privileged  citizens.  In 
general,  fighting  social  exclusion  is  a  source  of 
legitimacy  for  the  state  and  for  political  actors 
who  seek  to  fight  what  is  widely  perceived  as  a 
social  evil. 

Beyond  France,  the  concept  of  social  exclusion 
has  become  a  key  policy  idea,  even  if  the  emer¬ 
gence  of  this  concept  at  the  forefront  of  interna¬ 
tional  policy  debates  is  relatively  recent,  as  it  was 
only  during  the  1990s  and  2000s  that  it  became  a 
permanent  feature  of  international  debates  about 
social  inequality  and  the  future  of  the  welfare 
state. 

Since  the  1990s,  the  concept  of  social  exclusion 
has  become  increasingly  popular  across  Europe. 
For  instance,  in  the  United  Kingdom  (UK),  after 
taking  office  in  1997,  New  Labour  Prime  Minister 
Tony  Blair  depicted  himself  as  a  crusader  against 
social  exclusion.  A  major  symbol  of  this  emphasis 
on  social  exclusion  was  the  creation  of  a  state  unit 
devoted  to  this  issue  in  late  1997.  As  in  France, 
although  a  powerful  rhetoric  surrounded  social 
exclusion,  critics  noted  that  the  resources  allocated 
to  fight  this  set  of  social  ills  remained  quite  modest 
in  scope.  From  a  political  standpoint,  the  rhetoric 
about  social  exclusion  helped  the  Blair  government 
reinforce  its  social  policy  credentials  at  the  same 
time  as  it  was  pursuing  a  broad  liberal  economic 
agenda.  Importantly,  however,  significant  resources 
have  been  allocated  to  the  struggle  against  social 
exclusion,  which  has  become  a  legitimate  public 
policy  concept,  both  in  government  and  in  aca¬ 
demia.  In  comparative  terms,  the  political  dis¬ 
course  around  social  exclusion  in  the  UK  was 
based  more  on  moral  and  behavioral  issues  than  in 
France;  however,  in  both  cases,  social  exclusion 
has  become  an  enduring  aspect  of  policy  debates  in 
a  way  that  makes  it  difficult  to  reduce  it  to  a  short¬ 
term  political  fad. 

Beyond  France  and  the  UK,  social  exclusion  has 
become  increasingly  influential  in  many  other 
European  countries,  largely  through  the  platforms 
and  initiatives  of  the  European  Union  (EU),  which 
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has  explicitly  embraced  this  concept  as  part  of  an 
attempt  to  increase  its  policy  legitimacy.  Although 
the  EU  discourse  about  social  exclusion  can  be 
traced  back  to  the  late  1980s  and  1990s,  the  con¬ 
cept  only  emerged  as  a  central  EU  policy  concept 
in  2000,  as  the  main  legacy  of  the  European 
Council  conference  held  in  Lisbon.  At  that  meet¬ 
ing,  member  states  agreed  to  transform  the  fight 
against  poverty  and  social  exclusion  into  a  core 
aspect  of  the  EU’s  employment  and  social  policy 
strategy.  Articulated  around  the  idea  of  a 
“European  social  model,”  the  EU  approach  to 
social  exclusion  is  based  on  the  “open  method  of 
coordination,”  which  is  based  on  the  interaction 
between  EU-defined  goals  and  national  policy  ini¬ 
tiatives.  Substantively,  the  EU  stressed  the  central 
role  of  employment  policy  in  the  struggle  against 
social  exclusion.  From  this  perspective,  social 
exclusion  is  closely  related  to  the  quest  for  activa¬ 
tion  and  the  European  Employment  Strategy.  As 
stated  in  a  European  Commission  document  titled 
Fight  Against  Poverty  and  Social  Exclusion  (2000), 
“Employment  is  the  best  safeguard  against  social 
exclusion.  In  order  to  promote  quality  employ¬ 
ment  it  is  necessary  to  develop  employability,  in 
particular  through  policies  that  promote  the  acqui¬ 
sition  of  skills  and  life-long  learning”  (p.  3). 
Additionally,  income  maintenance  programs  such 
as  social  assistance  schemes  and  pension  programs 
are  depicted  as  major  tools  in  the  fight  against 
social  exclusion. 

A  Central  Idea  in  Contemporary 
Social  Policy  in  Europe  and  Beyond 

Since  2000,  social  exclusion  has  remained  a  promi¬ 
nent  aspect  of  EU  policy  debates.  According  to 
Even  Nilssen,  the  popularity  of  this  concept  within 
the  EU  is  related  to  fact  that  it  has  been  defined 
mainly  in  economic  rather  than  in  moral  terms.  By 
focusing  mainly  on  employment  and  economic 
issues,  the  EU  has  largely  reduced  social  policy  to 
labor  market  regulation.  From  this  perspective,  the 
EU  push  to  transform  social  exclusion  into  a  major 
policy  issue  reinforces  its  economic  focus  while  cre¬ 
ating  the  sense  that  the  “social  question”  has 
become  a  political  priority,  which  may  not  effec¬ 
tively  be  the  case.  Overall,  although  social  exclusion 
has  become  a  major  component  of  EU  discourse 
and  policy  guidelines  since  2000,  there  is  no 


consensus  about  the  true  impact  and  the  long-term 
policy  consequences  of  the  various  EU  initiatives 
launched  in  the  name  of  the  struggle  against  social 
exclusion. 

Outside  Europe,  since  the  1990s,  the  concept  of 
social  exclusion  has  been  increasingly  debated  in 
other  parts  of  the  advanced  industrial  world.  This 
is  the  case  in  countries  such  as  Australia  and 
Canada,  for  instance.  Although  social  exclusion  is 
perhaps  not  as  central  a  policy  concept  in  these 
two  countries  as  it  has  become  in  Europe,  it  is  now 
part  of  mainstream  international  policy  debates. 
Considering  this  remark,  one  country  within  the 
advanced  industrial  world  where  social  exclusion 
remains  a  relatively  marginal  policy  idea  is  the 
United  States.  As  critics  have  pointed  out,  this  is 
puzzling  considering  the  comparatively  high  levels 
of  poverty  and  social  inequality  in  U.S.  society. 
Clearly,  the  phenomena  that  are  described  as 
social  exclusion  in  Europe  and  other  parts  of  the 
world  are  widespread  in  the  United  States;  how¬ 
ever,  terms  such  as  inequality  and  poverty  remain 
largely  used  due  to  the  fact  that  social  exclusion 
and  ways  to  deal  with  it  are  explicitly  associated 
with  social  policy  and  redistribution  in  some  sense. 
Thus,  in  the  United  States,  the  concept  of  social 
exclusion  is  best  known  among  students  of  com¬ 
parative  social  policy  and  largely  remains  a  termi¬ 
nology  for  academic  discourse.  A  more  likely 
explanation  for  the  absence  of  a  widespread  social 
exclusion  discourse  in  the  United  States  is  the  fact 
that,  in  contrast  to  the  situation  prevailing  in  the 
EU  and  in  other  parts  of  the  world,  prominent  U.S. 
political  actors  have  rarely  embraced  this  concept 
as  part  of  a  broader  electoral  and/or  policy  strat¬ 
egy.  Only  time  will  tell  if  this  concept  has  a  politi¬ 
cal  future  in  the  United  States. 

Finally,  the  concept  of  social  exclusion  as  for¬ 
mulated  in  Europe  and  other  parts  of  the  advanced 
industrial  world  is  known  and  debated  in  develop¬ 
ing  societies.  Yet,  as  Ruhi  Saith  notes,  the  concept 
of  social  exclusion  cannot  be  transposed  from  the 
wealthiest  to  the  poorest  regions  of  the  planet  with¬ 
out  substantial  analytical  adjustments  being  made. 
For  instance,  because  the  concept  was  first  adopted 
in  a  European  society  characterized  by  a  formal 
labor  market  and  large  welfare  state  programs,  it 
takes  a  different  meaning  when  used  in  the  context 
of  much  poorer  countries,  where  the  informal  sec¬ 
tor  can  be  larger  than  the  formal  one  and  where 
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public  social  programs  do  not  adequately  cover 
very  large  segments  of  the  population.  This  situa¬ 
tion  has  led  scholars,  policy  experts,  and  nongov¬ 
ernmental  organizations  (NGOs)  to  adapt  this 
concept  to  the  realities  of  the  developing  world. 
Thus,  in  the  developing  world,  concerns  over  social 
exclusion  have  taken  the  form  of  poverty  reduction 
programs  sponsored  mainly  by  international  finan¬ 
cial  institutions  and  designed  to  address  the  nega¬ 
tive  social  impacts  of  previous  macroeconomic 
adjustments  as  well  as  to  provide  some  basic  pro¬ 
tection  for  the  most  vulnerable  members  of  society. 
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Social  Movements 


In  a  comprehensive  sense,  die  concept  of  social 
movement  refers  to  (a)  mostly  informal  networks 
of  interaction,  based  on  (b)  shared  beliefs  and 
solidarity,  mobilized  around  (c)  contentious 
themes,  through  (d)  the  frequent  use  of  various 
forms  of  protest.  This  entry  begins  with  a  discus¬ 
sion  of  the  elements  of  this  definition.  Next,  spe¬ 
cific  forms  of  (nonconventional)  participation, 
mobilization,  and  organization  of  social  move¬ 
ments  are  identified.  Their  impact  in  recent 
decades  on  political  systems,  democratization, 
public  policies,  and  even  the  international  sphere 
is  then  highlighted.  Finally,  the  consequences  for 
normative  political  theory  and  deliberative  or 
direct  forms  of  democracy  are  discussed. 

Social  movements  are  constituted  by  networks 
of  informal  relations  between  a  plurality  of  indi¬ 
viduals  and  groups,  which  are  more  or  less  struc¬ 
tured  from  an  organizational  point  of  view.  While 
political  parties  and  pressure  groups  have  rela¬ 
tively  well-defined  organizational  boundaries,  such 
as  card-carrying  members  of  specific  organiza¬ 
tions,  social  movements  are  composed  of  scattered 
and  weakly  connected  networks  of  individuals 
who  feel  they  are  part  of  a  collective  effort.  While 
organizations  that  refer  to  movements  exist,  move¬ 
ments  are  not  organizations  but  rather  networks  of 
relations  between  diverse  actors  that  often  also 
include  organizations  with  formal  structures. 

These  networks  of  relations  are  considered  as 
constituting  a  social  movement  when  their  members 
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share  a  system  of  beliefs  that  nourish  solidarity  and 
collective  awareness.  In  fact,  one  characteristic  of 
movements  is  the  elaboration  of  alternative  world 
visions  and  value  systems  that  differ  from  those 
dominating  at  that  time.  For  this  reason,  move¬ 
ments  have  been  considered  as  protagonists  of 
social  change. 

These  emerging  values  form  the  basis  of  the 
definition  of  conflicts  around  which  actors  mobi¬ 
lize.  In  particular,  from  the  1970s  onward,  new 
social  movements  were  defined  as  actors  in  new 
conflicts.  While  Marxist  analyses  traditionally  pos¬ 
ited  the  centrality  of  capital-labor  conflicts,  the 
transformations  that  followed  World  War  II 
stressed  the  relevance  of  social  criteria — such  as 
gender  or  generation — that  were  not  based  on 
class  position.  Scholars  of  the  new  movements, 
such  as  the  French  sociologist  Alain  Touraine, 
agree  in  highlighting  the  declining  relevance  of  the 
conflict  between  capital  and  labor.  As  Alberto 
Melucci  has  observed,  in  complex  societies,  which 
require  increasing  integration  and  extend  their 
control  even  over  the  motivations  for  action,  new 
social  movements  (e.g.,  ecological,  feminist)  try  to 
oppose  the  penetration  of  the  state  and  the  market 
into  social  life,  reclaiming  individual  identity  and 
the  right  to  shield  one’s  private  and  romantic  life 
from  the  omniscient  manipulation  of  the  system. 

Finally,  social  movements  are  characterized  by 
their  adoption  of  “unusual”  forms  of  political 
behavior.  Many  scholars  pinpoint  the  fundamental 
distinction  between  movements  and  other  political 
actors  in  the  use  by  the  former  of  protest  as  a  form 
of  exerting  political  pressure.  Protest  is  defined  as  a 
nonconventional  form  of  action  that  interrupts  daily 
routine.  Protestors  generally  address  their  actions  to 
public  opinion  over  elected  representatives  or  public 
bureaucracies.  In  fact,  movements  are  characterized 
by  their  fundamental  critique  of  representative 
democracy,  and  they  challenge  institutional  assump¬ 
tions  about  conventional  ways  of  doing  politics  in 
the  name  of  participative  democracy. 

Over  the  past  few  years,  the  analysis  of  social 
movements  has  been  approached  through  the  con¬ 
cept  of  contentious  politics,  defined  as  episodic, 
public,  collective  interaction  among  claims  makers 
and  their  targets  (when  at  least  one  government  is 
a  claimant,  an  object  of  claims,  or  a  party  to  the 
claim  and  the  claims  would,  if  realized,  affect  the 
interests  of  at  least  one  of  the  claimants).  Doug 


McAdam,  Sidney  Tarrow,  and  Charles  Tilly  devel¬ 
oped  a  broad  research  project  that  addressed  the 
similar  mechanisms  at  play  in  social  movements, 
revolutions,  democratization  processes,  and  ethnic 
conflicts.  Advocating  a  dynamic  use  of  concepts, 
the  scholars  involved  in  this  project  have  tried  to 
single  out  general  mechanisms  of  contention. 
Although  social  movement  studies  first  developed 
in  the  field  of  sociology,  they  have  also  contrib¬ 
uted,  as  will  be  discussed  below,  to  the  main  disci¬ 
plinary  subfields  of  political  science. 

Nonconventional  Participation 

The  first  pieces  of  research  on  social  movements  in 
the  field  of  political  science  were  carried  out  within 
the  solid  research  tradition  on  political  participa¬ 
tion.  While  this  field  of  research  was  initially 
focused  on  conventional  forms  of  participation, 
underlining  the  unequal  participation  of  various 
social  groups,  from  the  1970s  onward,  scholars 
began  to  observe  a  rapid  growth  of  new,  noncon¬ 
ventional  forms  of  political  participation,  such  as 
petitions,  sit-ins,  boycotts,  the  occupation  of  build¬ 
ings,  and  the  blocking  of  traffic.  This  led  them  to 
investigate  the  different  styles  of  participation  of 
different  social  groups,  generations,  or  nations  and 
the  conditions  that  would  provoke  the  develop¬ 
ment  of  new  forms  of  participation. 

In  an  important  piece  of  comparative  research 
carried  out  in  the  1970s  on  different  Western 
democracies,  Samuel  Barnes  and  Max  Kaase 
observed  that,  with  respect  to  laws  and  decisions 
seen  as  unjust  or  illegitimate,  ever- wider  groups  of 
citizens  are  ready  to  resort  to  forms  of  action  char¬ 
acterized  by  their  nonconventionality.  Yet  we  can¬ 
not  speak  of  a  real  split  between  those  who  use 
“orthodox”  participatory  tactics  and  those  who 
use  so-called  direct-action  tactics.  In  fact,  conven¬ 
tional  participation  is  often  accompanied  by  non¬ 
conventional  participation,  indicating  that  those 
interested  in  politics  and  competent  in  the  field 
tend  to  use  a  range  of  the  various  available  instru¬ 
ments  for  exerting  pressure  on  governments.  If 
there  are  individuals  who  prefer  one  or  the  other 
type  of  strategy,  there  are  nonetheless  many  who 
combine  the  two  types.  The  conclusion  is  that  the 
increase  in  nonconventional  participation  is  not  an 
indicator  of  the  decline  of  the  legitimacy  of  liberal 
democracies,  where,  on  the  contrary,  a  growth  in 
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political  competence  has  been  observed,  in  particu¬ 
lar  among  young  people.  It  is  instead  an  expression 
of  a  lasting  enlargement  of  the  strategies  available 
to  citizens. 

Later  pieces  of  comparative  research  (among 
them  work  by  Russell  Dalton  and  Pippa  Norris) 
have  revealed  that  while  levels  of  traditional  types 
of  political  participation  have  remained  stable  or 
(in  some  forms)  declined  in  the  1990s  and  the  new 
millennium,  levels  of  noninstitutional  participation 
have  increased  enormously.  This  growth  con¬ 
cerned  all  of  the  countries  analyzed,  and  inside  the 
single  countries  the  differences  in  levels  of  partici¬ 
pation  linked  to  gender,  age,  and  educational  level 
had  lessened,  leading  some  to  speak  of  a  “partici¬ 
patory  revolution.” 

Research  on  social  movements  has  made  a  par¬ 
ticularly  rich  contribution  to  the  analysis  of  politi¬ 
cal  participation  in  work  on  repertoires  of  protest. 
Charles  Tilly  has  defined  a  modern  repertoire  of 
collective  action  as  formed  by  the  set  of  means  a 
group  has  at  its  disposal  for  making  collective 
claims.  Tilly  has,  in  particular,  identified  the  differ¬ 
ences  in  the  types  of  contentious  action  in  particu¬ 
lar  historical  periods.  Protest  was  certainly  not 
unheard  of  prior  to  the  formation  of  the  nation¬ 
state:  Peasants  burned  down  mills  in  protest 
against  increases  in  the  price  of  bread;  subjects 
dressed  up  to  mock  their  superiors;  and  funerals 
could  be  turned  into  occasions  for  denouncing 
injustices.  The  tactics  adopted  by  protestors  varied 
from  the  use  of  irreverent  symbols  and  music  (as  in 
the  custom  of  charivari  or  shivaree)  to  field  inva¬ 
sions  and  grain  seizures.  However,  they  all  were 
parochial  in  scope,  addressing  local  actors  or  the 
local  representatives  of  national  actors;  moreover, 
they  relied  on  patronage,  appealing  to  local  power 
holders  to  convey  grievances  or  settle  disputes.  In 
the  19th  century,  however,  forms  of  collective 
action  began  to  change,  as  the  old  parochial  and 
patronage-dependent  repertoire  was  replaced  by 
one  that  was  national  and  autonomous,  involving 
actions  such  as  strikes,  electoral  rallies,  public 
meetings,  petitions,  marches,  insurrections,  and 
the  invasion  of  legislative  bodies.  This  transforma¬ 
tion  in  the  form  of  protest  was  the  result  of  the 
creation  of  the  nation-state,  the  development  of 
capitalism,  and  the  emergence  of  modern  means  of 
communication.  The  new  repertoire,  therefore, 
responded  to  a  new  situation  in  which  politics  was 


increasingly  national  in  character,  the  role  of  com¬ 
munities  diminished,  and  organized  associations 
spread,  particularly  among  the  working  class. 

Resource  Mobilization  and 

Organizational  Approaches 

Another  significant  contribution  from  scholars  of 
social  movements  is  found  in  organizational  the¬ 
ory,  from  research  on  organizational  populations 
to  the  most  recent  neo-institutionalist  approaches. 
In  particular,  placing  itself  within  the  open-system 
approach  (which  maintained  the  relevance  of 
interactions  between  organizations  and  their  envi¬ 
ronments),  research  on  social  movement  organiza¬ 
tions,  carried  out  by  scholars  such  as  Mayer  Zald 
and  John  McCarthy,  has  underlined  the  role  of 
organizational  entrepreneurs  in  the  mobilization 
of  resources  in  the  organizational  fields  of  single 
groups. 

Until  the  1960s,  studies  on  social  movements 
were  dominated  by  a  functionalist  approach  that 
interpreted  them  as  reactions  to  systemic  malfunc¬ 
tions,  thereby  reducing  them  to  purely  reactionary, 
pathological  phenomena.  Criticizing  this  represen¬ 
tation,  throughout  the  1970s,  the  resource  mobili¬ 
zation  approach  considered  social  movements  as  a 
normal  part  of  the  political  process,  focusing 
analyses  on  the  mobilization  of  those  resources 
necessary  to  collective  action.  According  to  the 
resource  mobilization  approach,  social  movements 
act  in  a  rational,  prognostic,  and  organized  man¬ 
ner.  Protest  actions  derive  from  a  calculation  of 
costs  and  benefits,  influenced  by  the  presence  not 
only  of  conflicts  but  also  of  the  resources  that  are 
required  to  be  mobilized  around  those  conflicts.  In 
a  historical  situation  in  which  unease,  contrasts, 
conflicts  of  interests,  and  ideological  clashes  appear 
ever  present,  the  rise  of  collective  action  cannot  be 
explained  on  the  basis  of  these  elements  alone, 
begging  the  study  of  the  conditions  that  permit  the 
transformation  of  discontent  into  action.  The  type 
and  extent  of  available  resources  explain  the  tacti¬ 
cal  choices  of  movements  and  the  consequences  of 
collective  action  for  the  social  and  political  system. 

In  this  sense,  the  mobilization  capacities  of 
groups  vary  enormously,  and  this  reinforces  the 
inequalities  present  in  society.  As  far  as  the  so- 
called  weak  interests — of  those  poor  in  resources — 
are  concerned,  their  organization  often  depends  on 
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the  mobilization  of  influential  patrons  and/or 
strong  organizational  structures.  Organization, 
aided  by  solidarity  links,  can  therefore  compensate 
for  the  absence  of  material  resources. 

Unlike  the  “mass  society”  theories,  which  had 
presented  some  forms  of  political  aggregation  as 
effects  of  the  uprooting  of  individuals  from  primary 
groups  and  of  the  social  disaggregation  stemming 
from  modernization,  the  resource  mobilization 
approach  instead  explains  mobilization  through 
both  the  moral  gratification  intrinsic  to  the  pursuit 
of  a  collective  good  and  the  existence  of  horizontal 
(i.e.,  internal  to  the  collective)  and  vertical  (i.e., 
between  different  collectives)  solidarity  links.  The 
density  of  relations  between  people  who  share  some 
cultural  characteristics  tends  to  facilitate  the  con¬ 
struction  of  solidarity.  Social  movement  activists 
are  in  fact  well  integrated  within  various  types  of 
social  networks,  both  formal  and  informal.  It  is 
above  all  due  to  these  links  that  the  potential  activ¬ 
ist  develops  a  particular  vision  of  the  world  and 
acquires  the  information  and  skills  necessary  for 
collective  action.  Given  that  social  environments 
(and  groups)  are  differentiated  on  the  basis  of  the 
density  of  the  networks  that  characterize  them,  this 
gives  rise  to  different  capacities  for  participation. 
Charles  Tilly  has  argued  that  the  mobilization  of 
groups  is  influenced  by  their  level  of  catnet-ness ,  a 
synthesis  of  characteristics  linked  to  social  catego¬ 
ries  and  the  density  of  social  networks.  Indeed,  the 
passage  from  a  category  (an  aggregation  of  indi¬ 
viduals  who  share  determined  characteristics)  to  a 
social  group  (a  community  capable  of  collective 
action)  is  facilitated  by  the  simultaneous  presence  of 
specific  categorical  traits  and  networks  of  relations 
that  link  the  subjects  shared  among  those  traits. 

That  opportunities  for  participation  grow  for 
those  social  groups  characterized  by  similar  struc¬ 
tures  and  intense  social  relations  is  indicated  by 
numerous  studies  on  the  labor  movement.  This 
was  linked  in  particular  to  the  presence  of  large 
masses  of  workers  who  carried  out  similar  tasks 
and  tended  to  spend  not  only  their  working  hours 
but  also  their  leisure  time  together,  living  in 
socially  homogeneous  areas  found  close  to  the  fac¬ 
tories.  The  presence  of  socially  homogeneous  net¬ 
works  characterized  by  intense  social  relations 
favored  the  choice  to  cooperate.  Collective  action 
then  reinforced  the  awareness  of  holding  common 
interests — causing  the  growth  of  what  Karl  Marx 


called  class  consciousness.  Socialist  ideology  would 
then  politicize  many  social  struggles,  proposing  a 
wider  vision  of  the  world. 

Modernization  has  on  the  whole  increased  orga¬ 
nizational  capacities.  Above  all,  technological 
developments  have  increased  the  quantity  of 
resources  available  in  the  environment.  In  an 
imposing  piece  of  research  spanning  the  past  4  cen¬ 
turies,  Charles  Tilly  has  uncovered  an  evolution 
from  decentralized  and  informally  coordinated 
movements  to  centralized  and  formal  movements — 
that  is,  from  solidarity  or  community  groups  to 
proactive  and  prolonged  actions,  organized  on  a 
grand  scale  by  associations  constituted  with  the 
aim  of  gaining  specific  ends.  This  development  was 
rendered  possible  by  the  expansion  of  means  of 
communication  (today,  in  particular,  of  communi¬ 
cation  via  computer).  Economic  progress  too  had  a 
generally  positive  effect  on  the  associative  capaci¬ 
ties  of  individuals,  increasing  the  quantity  of 
resources  available  for  collective  action.  In  particu¬ 
lar,  the  diffusion  of  education  and  the  proportional 
upward  shift  in  socioeconomic  class  heightened 
propensities  to  associate,  since  more  people  are 
able  to  subscribe  and  give  money.  In  addition, 
together  with  education,  faith  in  one’s  own  ability 
to  influence  the  outside  world  tends  to  increase, 
and  with  it  the  motivation  for  collective  action. 

Identities  and  History 

Research  on  social  movements  has  also  contrib¬ 
uted  to  the  debate,  an  increasingly  relevant  one  in 
political  science,  on  the  development  of  collective 
identity.  According  to  Alessandro  Pizzorno,  a 
characteristic  of  politics  is  its  reference  to  solidar¬ 
ity  systems  that  form  the  basis  of  the  definition  of 
interests.  Those  who  are  mobilized  defend  certain 
interests,  which  are  recognizable  only  with  refer¬ 
ence  to  a  certain  value  system.  Interest  in  one’s 
own  material  welfare  is  not,  for  example,  absolute 
or  innate  but  linked  to  a  certain  conception  of  the 
world.  Value  options  lead  us  to  identify  with  larger 
groups,  to  which  we  feel  a  sense  of  belonging  and 
in  the  interests  of  which  we  are  disposed  to  act. 
The  process  of  participation  thus  requires  the  con¬ 
struction  of  sympathetic  collectives  inside  which 
individuals  consider  one  another  as  equals. 

Identity  as  the  awareness  of  belonging  to  a  col¬ 
lective  “we,”  or  to  a  class,  thus  facilitates  political 
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participation.  If  the  construction  of  an  identity  is  a 
precondition  for  collective  action,  it  is  at  the  same 
time  a  product  of  it.  In  fact,  participation  trans¬ 
forms  the  identity  of  individuals,  reinforcing  the 
feeling  of  belonging  to  some  groups  and  weaken¬ 
ing  identification  with  other  roles.  In  the  evolution 
of  collective  action,  identity  is  produced  and  mul¬ 
tiplies.  The  barricades  of  the  revolutionary  move¬ 
ment,  the  strikes  of  the  labor  movement,  and  the 
occupations  of  the  student  movement  are  all  forms 
of  action  oriented  toward  influencing  public  deci¬ 
sions.  But  they  also  have  an  internal  effect,  creat¬ 
ing  solidarity  between  participants,  making  them 
feel  part  of  a  collective  effort.  It  is  the  same 
action — participation — that  then  reinforces  the 
sense  of  belonging  in  a  sort  of  virtuous  circle. 

For  collective  action  to  occur,  it  is  necessary 
that  those  who  act  elaborate  a  definition  of  them¬ 
selves,  of  other  social  actors,  and  of  the  content  of 
the  relations  that  connect  them.  They  must  identify 
not  only  a  “we”  with  whom  to  sympathize  but 
also  a  “they”  to  attribute  blame  to  for  the  condi¬ 
tions  they  wish  to  change.  The  construction  of 
identity  requires  not  only  a  positive  definition  of 
who  is  part  of  a  certain  group  but  also  necessarily 
a  negative  definition  of  who  is  excluded.  For  inter¬ 
action  to  occur  it  is  necessary,  on  the  one  hand, 
that  the  identities  of  diverse  actors  be  recognized 
by  external  actors,  so  that  a  part  of  the  mobiliza¬ 
tion  is  oriented  toward  this  search  for  recognition, 
inseparable  from  identity  itself.  On  the  other  hand, 
collective  action  itself  contributes  to  building  and 
consolidating  identity  through  the  definition  of 
boundaries  between  actors  involved  in  a  conflict. 

Political  Opportunities  and 
Comparative  Political  Systems 

The  analysis  of  social  movements  has  been  particu¬ 
larly  lively  in  the  field  of  comparative  politics. 
Although  concentrating  mostly  on  liberal  democ¬ 
racies,  studies  of  social  movements  have  system¬ 
atized  their  observations  on  the  relations  between 
institutional  political  actors  and  protest.  As  chal¬ 
lengers  to  a  given  political  order,  social  movements 
interact  with  the  actors  that  enjoy  a  consolidated 
position  in  that  order.  The  characteristics  of  these 
interactions  affect  both  the  form  that  collective 
action  takes  and  its  probabilities  of  success.  In  the 
study  of  social  movements,  the  most  widely  used 


concept  for  defining  the  characteristics  of  the 
external  environment  relevant  to  the  development 
of  social  movements  is  the  political  opportunity 
structure ,  developed  in  particular  by  the  American 
political  scientist  Sidney  Tarrow.  Integrating  vari¬ 
ous  empirical  observations  into  a  theoretical 
framework  for  his  study  of  protest  cycles  in  Italy, 
Tarrow  considered  the  degree  of  openness  or 
closure  of  formal  political  access,  the  degree  of 
stability  or  instability  of  political  alignments,  the 
availability  and  strategic  postures  of  potential 
allies,  and  the  political  conflicts  between  and 
within  elites  as  the  main  categories  of  political 
opportunities  for  social  movements. 

With  the  increase  in  interest  in  social  movements 
by  political  scientists,  European  scholars,  including 
Hanspeter  Kriesi,  Dieter  Rucht,  and  Donatella 
della  Porta,  started  to  use  the  concept  of  political 
opportunities  in  cross-country  research  projects. 
Alexis  de  Tocqueville’s  famous  contrast  between  a 
“weak”  American  government  and  a  “strong” 
French  one  is  usually  the  implicit  or  explicit  start¬ 
ing  point  for  analyses  linking  institutional  factors 
with  social  movement  development.  Some  criticism 
notwithstanding,  the  idea  that  a  state’s  strength  or 
weakness  influences  social  movement  strategies 
remains  central  to  the  literature  on  collective  action 
in  general  and  on  revolutions  in  particular.  In 
Europe  especially,  this  approach  resonated  with  a 
focus  on  the  cross-national  comparison  of  different 
types  of  European  democracies,  based  on  different 
institutional  assets  and  cultural  traditions.  In  gen¬ 
eral,  an  institutional  system  has  been  considered 
more  open  (and  less  repressive)  the  more  political 
decisions  are  dispersed  (through  functional  differ¬ 
entiation  of  power,  territorial  decentralization,  and 
direct  democracy).  The  prevalent  belief  is  that  the 
greater  the  number  of  actors  who  share  political 
power  (the  greater  the  checks  and  balances),  the 
greater  the  chance  that  social  movements  will  gain 
access  to  the  system.  However,  while  a  weak  execu¬ 
tive  may  ease  access  to  the  decision-making  pro¬ 
cess,  it  will  have  little  hope  of  implementing  policies 
to  meet  the  demands  of  social  movements.  Formal 
institutions  do,  however,  interact  with  informal 
strategies  to  deal  with  opponents,  with  either  inclu¬ 
sive  or  exclusive  historical  traditions  that  tend  to 
reproduce  themselves. 

Beyond  the  comparison  of  different  institutions, 
the  political  process  approach  also  stressed  the  role 
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of  institutional  allies  for  social  movements.  A  more 
dynamic  set  of  variables — susceptible  to  change  in 
the  short  term  and  the  object  of  pressure  from 
social  movements — included  aspects  such  as  elec¬ 
toral  instability  and  elite  divisions.  Attention  to 
allies  such  as  trade  unions  and  political  parties  also 
resonated  with  the  relevance  assigned  to  these 
actors. 

One  of  the  reasons  for  the  spread  of  the  political 
opportunity  approach  in  Europe  may  have  been 
the  interest,  well  developed  in  European  political 
science  and  sociology,  in  cross-European  compari¬ 
sons.  In  the  1990s,  in  particular,  this  interest  pro¬ 
duced  large  comparative  research  projects,  singling 
out  and  exploiting  different  dimensions  of  com¬ 
parison  among  European  countries,  such  as  citi¬ 
zens’  regimes  in  research  on  mobilization,  migrant 
rights,  welfare  regimes,  or  issues  of  unemploy¬ 
ment.  Historical  comparisons,  looking  at  long¬ 
term  evolution,  singled  out  a  trend  toward  an 
increasing  diffusion  of  protest  repertoires  and 
social  movements. 

The  Effects  of  Social  Movements 
on  Public  Policies 

Another  area  to  which  research  on  social  move¬ 
ments  contributed  is  the  analysis  of  public  policies. 
A  growing  number  of  studies  have  addressed  the 
effects  of  protest  on  the  policy  process.  In  one  of 
the  first  and  most  influential  studies  on  social 
movement  effects,  William  Gamson  identified  the 
factors  contributing  to  success  as  a  minimalist 
strategy  (“thinking  small”),  the  adoption  of  direct 
action  and  a  centralized  and  bureaucratic  organi¬ 
zation.  Other  scholars  of  collective  action  have, 
however,  challenged  this  vision.  In  particular,  it 
has  been  noted  not  only  that  violence  can  some¬ 
times  appear  as  a  winning  strategy  but  also  that 
when  organizations,  including  social  movements, 
become  bureaucratized,  the  desire  for  organiza¬ 
tional  survival  comes  to  prevail  over  declared  col¬ 
lective  objectives.  According  to  Francis  Fox  Piven 
and  Richard  Cloward,  the  effort  to  build  organiza¬ 
tions  could  be  not  only  futile  but  also  damaging. 
The  search  for  material  resources  to  ensure  organi¬ 
zational  survival  leads  inexorably  toward  coopta¬ 
tion  and  the  taming  of  protest.  However,  it  has 
been  remarked  that  no  particular  strategic  element 
can  be  evaluated  in  isolation  without  taking  into 


account  the  conditions  within  which  social  move¬ 
ments  must  operate  as  well  as  the  presence  of  alli¬ 
ances  or  opponents  in  power.  Most  important, 
movements  are  never  the  sole  actors  to  intervene 
on  an  issue;  rather,  they  are  part  of  alliances 
including  political  parties  and  sometimes  even 
public  agencies. 

The  attribution  of  credit  for  obtaining  substan¬ 
tive  successes  is  in  fact  complex.  The  presence  of  a 
plurality  of  actors  makes  it  difficult  to  attribute 
success  or  failure  to  any  one  particular  strategy, 
and  whether  the  results  of  protest  should  be  judged 
in  the  short  or  the  long  term  represents  a  further 
problem.  While  social  movements  demand  long¬ 
term  changes,  protest  cycles  instead  tend  to  stimu¬ 
late  immediate  “incremental”  reforms.  Policy 
development  is  characterized  by  steps  forward  and 
backward,  moments  in  which  public  policy 
approaches  the  demands  made  by  social  move¬ 
ments  and  others  in  which  the  situation  deterio¬ 
rates.  Factors  peculiar  to  social  movements,  such 
as  their  distance  from  the  levers  of  power,  the 
heterogeneous  definition  of  their  objectives,  and 
their  organizational  instability,  further  complicate 
matters. 

These  difficulties  notwithstanding,  social  move¬ 
ments  have  had  relevant  effects  on  different  stages 
of  public  decision-making  processes.  Generally, 
social  movements  are  formed  to  express  dissatisfac¬ 
tion  with  an  existing  policy  in  a  given  area. 
Although  it  is  usual  to  make  a  distinction  between 
political  and  cultural  movements,  the  first  following 
a  more  instrumental  logic,  the  second  more  sym¬ 
bolic,  all  movements  tend  to  make  demands  on  the 
political  system:  Environmentalist  groups  have 
demanded  interventions  to  protect  the  environ¬ 
ment;  pacifists  have  opposed  the  culture  of  war; 
students  have  criticized  selection  and  authoritarian¬ 
ism  in  education;  the  feminist  movement  has  fought 
discrimination  against  women;  and  the  world  social 
forums  have  criticized  neoliberal  globalization. 

The  definition  of  specific  policy  claims  is  often 
relevant  for  the  very  self-definition  of  a  social 
movement.  Frequently,  a  particular  demand 
becomes  nonnegotiable ,  being  the  basis  for  a 
movement’s  identity.  In  many  countries,  for  exam¬ 
ple,  the  feminist  movement  has  been  constructed 
around  the  nonnegotiable  right  of  women  to 
“choose”  concerning  childbirth;  the  halting  of  the 
installation  of  nuclear  missiles  in  the  countries  of 
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the  North  Atlantic  Treaty  Organization  (NATO) 
fulfilled  a  similar  role  for  the  peace  movement.  In 
the  first  case,  mobilization  was  proactive,  seeking 
to  gain  something  new,  the  right  to  free  abortion; 
in  the  second,  it  was  reactive,  seeking  to  block  a 
decision  (to  install  Cruise  missiles)  that  had  already 
been  taken.  One  of  the  founding  organizations  of 
the  World  Social  Forum  in  Porto  Alegre,  the 
Association  for  the  Taxation  of  Financial  Trans¬ 
actions  and  for  Civic  Action  (Association  pour  la 
Taxation  des  Transactions  financieres  et  pour 
l’Action  Citoyenne,  ATTAC),  emerged  around 
demands  for  a  tax  on  transnational  transactions; 
also  present  in  Porto  Alegre,  the  debt  relief  cam¬ 
paign  asked  for  the  cancellation  of  foreign  debt  for 
poor  countries.  Demands  whose  symbolic  value  is 
very  high,  such  as  the  Equal  Rights  Amendment  in 
the  case  of  the  American  feminist  movement, 
remain  central  for  a  movement  even  when  their 
potential  effectiveness  is  questionable.  The  impor¬ 
tance  of  such  nonnegotiable  objectives  is  con¬ 
firmed  by  the  fact  that,  although  activists  may  be 
willing  to  negotiate  on  other  demands,  even  partial 
victories  on  these  issues,  such  as  a  woman’s  right 
to  voluntarily  interrupt  pregnancy,  are  considered 
as  defeats.  While  nonnegotiable  demands  are  par¬ 
ticularly  important  in  the  construction  of  collective 
identities,  social  movements  rarely  limit  themselves 
to  these.  In  fact,  they  tend  to  articulate  specific 
requests  for  reforms. 

The  changes  brought  about  by  social  move¬ 
ments  may  be  evaluated  by  looking  at  the  various 
phases  of  the  decision-making  process:  the  emer¬ 
gence  of  new  issues,  the  writing  and  applying  of 
new  legislation,  and  the  analysis  of  the  effects 
of  public  policies  in  alleviating  the  conditions  of 
those  mobilized  by  collective  action.  Various  levels 
of  responsiveness  to  collective  demands  within  the 
political  system  can  be  distinguished:  from  the 
availability  of  the  authorities  to  listen  to  the  pro¬ 
testors,  to  their  willingness  to  put  an  issue  on  the 
agenda,  and,  then,  to  adopt  and  implement  specific 
policies.  Research  on  social  movements  initially 
concentrated  on  the  production  of  legislation,  with 
quantitative  and  qualitative  analyses  of  the 
responses  of  parliaments  and  governments. 

More  and  more,  however,  research  has  devel¬ 
oped  also  on  the  implementation  of  decisions 
demanded  by  social  movements  as  well  as  the  cul¬ 
tural  transformations  they  produce.  Although 


movements  tend  to  request  legislative  change,  it  is 
also  true  that  this  is  neither  their  only  nor  perhaps 
even  their  primary  objective,  as  movements  are  in 
fact  carriers  of  symbolic  messages  by  influencing 
bystanders  and  spreading  their  own  conception  of 
the  world  and  alternative  values.  Typically,  new 
ideas  emerge  within  critical  communities  and  are 
then  spread  via  social  movements.  While  the 
capacity  of  social  movements  for  reaching  their 
general  aims  has  been  considered  low,  their  capac¬ 
ity  for  thematization,  the  introduction  of  new 
issues  into  public  debate,  and  their  long-term  cul¬ 
tural  influence  have  been  considered  high. 

Local  Protest  and  Urban  Policies 

From  the  1990s  onward,  research  on  social  move¬ 
ments  and  public  policies  has  also  focused  on  pro¬ 
test  campaigns  against  large-scale  public  works  in 
urban  and  semirural  contexts.  If  the  presence  of 
territorial  conflicts  is  certainly  not  a  new  phenom¬ 
enon,  the  1990s  saw  a  growing  analytical  focus  on 
new  forms  of  protest,  which  have  been  portrayed 
as  limited  and  localized.  It  has  been  observed  that 
those  seeking  to  defend  the  quality  of  life  in  a  lim¬ 
ited  territory  tend  to  oppose  public  works  that  are 
seen  as  compromising  either  the  ecological  equilib¬ 
rium  (e.g.,  refuse  incinerators)  or  public  security 
(e.g.,  the  insertion  of  unwanted  social  groups  in 
their  territory).  The  presence  of  localized  conflicts 
has  met  with  great  concern,  given  the  multiplica¬ 
tion  in  both  these  types  of  protest  against  locally 
unwanted  land  uses  (LULUs)  in  recent  years. 

Local  conflicts  are  usually  seen  in  the  social 
science  literature  (and  more  generally)  as  being 
motivated  by  a  “not  in  my  backyard”  (NIMBY) 
syndrome,  associated  with  conservative  behavior 
and  an  egotistical  resistance  to  social  change.  In 
public  policy  analysis,  local  opposition  is  often 
associated  with  “free  riderism” — that  is,  the  refusal 
to  pay  the  necessary  costs  to  attain  public  goods. 
This  interpretation  of  the  NIMBY  syndrome  has, 
however,  been  disputed.  In  fact,  empirical  research 
on  local  oppositions  has  indicated  a  complex  real¬ 
ity,  with  committees  characterized  by  a  diverse 
capacity  or  will  to  present  their  particular  claims 
within  a  more  comprehensive  framework. 

In  contrast  to  a  NIMBY  reading  of  the  situa¬ 
tion,  sociological  research  has  often  interpreted 
LULU  conflicts  as  an  expression  of  different  types 
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of  social  movements.  In  the  discourse  of  those  who 
protest,  the  defense  of  local  natural  resources  is 
often  accompanied  by  a  more  general  defense  of 
natural  resources.  From  this  point  of  view,  local 
conflicts  can  be  interpreted  as  an  evolution  in  the 
ecological  movement,  characterized  not  only  by 
the  growth  (in  numbers,  resources,  and  legitimacy) 
of  environmental  organizations  but  also  by  the  dif¬ 
fusion  of  environmental  awareness  into  public 
opinion  as  well  as  an  extension  of  collective  action 
beyond  traditional  organizational  networks  or 
environmental  discourses.  In  contrast  to  the  1970s 
and  1980s,  the  main  role  in  the  mobilization  of  the 
protest  is  now  played  by  citizens’  committees — 
that  is,  organized  but  weakly  structured  groups  of 
citizens  that  unite  on  a  territorial  basis  and  use 
forms  of  protest  to  oppose  interventions  that  they 
claim  to  be  damaging  for  the  quality  of  life  in  their 
territory.  Additionally,  although  previously  framed 
mainly  as  a  “postmaterialist”  concern,  shared  by  a 
“new  bourgeoisie”  with  high  levels  of  education, 
environmental  conflicts  now  tend  to  involve  more 
and  more  underprivileged  groups,  above  all  in 
degraded  areas,  where  projects  with  high  environ¬ 
mental  impacts  (e.g.,  incinerators)  are  often 
executed.  In  these  conflicts  environmental  associa¬ 
tions  are  usually  present,  but  they  often  cooperate 
and  conflict  with  other  organizations  based  in  the 
territory. 

Local  conflicts  are  also  discussed  in  the  light  of 
earlier  research  on  urban  “growth  machines,” 
defined  by  formal  and  informal  networks  between 
public  and  private  actors  oriented  toward  increas¬ 
ing  investment  for  economic  growth.  Against  these 
networks,  social  movements,  groups,  and  associa¬ 
tions  opposed  to  the  reduction  of  welfare  resources 
destined  for  marginal  social  groups,  as  well  as 
residents’  associations  who  see  themselves  as  dam¬ 
aged  by  projects  for  urban  transformation,  pro¬ 
pose  a  different  model  of  development. 

To  overcome  the  NIMBY  label,  social  move¬ 
ments  mobilized  in  these  local  conflicts  tend  to 
bridge  the  opposition  to  specific  land  use  with 
frames  of  social  justice.  Faced  with  those  who 
accuse  them  of  protesting  on  behalf  of  the  indi¬ 
vidual  interest  (rather  than  the  common  good), 
they  develop  a  “not  on  planet  Earth”  (NOPE) 
discourse.  Moreover,  they  often  define  their  pro¬ 
test  through  a  procedural  rhetoric  that  defends 
their  action  as  opposition  to  the  abuse  of  power 


and  lack  of  transparency  in  public  decision  mak¬ 
ing,  as  well  as  the  collusive  alliance  between  gov¬ 
ernment  and  entrepreneurial  interests. 

Social  Movements  and  Democratization 

Reflections  on  social  movements  are  also  relevant 
for  the  analysis  of  democratization  processes.  The 
role  of  social  movements  for  democracy  has  been 
discussed  with  very  different  expectations.  Initially, 
social  movements  were  considered  either  as  mar¬ 
ginal  (vis-a-vis  political  parties)  to  democratic  devel¬ 
opment  or  as  dangerous  for  democracy,  promoting 
excessive  levels  of  societal  mobilization.  Later  on, 
however,  these  assumptions  have  been  contested, 
and  more  and  more  attention  has  been  paid  to  the 
development  of  civil  society  organizations. 

It  is  true  that  some  movements  have  refused 
democracy  altogether  (e.g.,  fascist  and  neo-fascist 
movements),  while  others  have  the  unintended 
effect  of  producing  a  backlash  against  democratic 
rights.  Additionally,  identity  politics,  often  seen  in 
ethnic  conflicts,  have  sometimes  ended  in  religious 
war  and  racial  violence.  However,  since  the  1970s 
in  particular,  the  assessment  not  only  that  social 
movements  flourish  in  democracies  but  also  that 
most,  albeit  not  all,  social  movements  support 
democracy  has  become  widespread.  Pushing  for 
wider  suffrage  or  the  recognition  of  associational 
rights,  social  movements  contributed  to  the  first 
waves  of  democratization  and  the  development  of 
democratic  public  spheres.  Later  on,  the  labor 
movement  has  contributed  to  the  enlargement  of 
citizens’  rights,  and  social  movements  have  played 
a  fundamental  role  in  the  struggles  for  democrati¬ 
zation  as  well  as  in  democratic  consolidation. 

The  social  science  literature  on  regime  transfor¬ 
mation  has  stressed  that  social  movements  contrib¬ 
ute  to  democratization  under  certain  conditions.  In 
particular,  only  those  movements  that  explicitly 
demand  increased  equality  and  protection  for  minor¬ 
ities  actually  promote  democratic  development.  In 
fact,  looking  at  the  process  of  democratization,  it 
may  be  observed  that  collective  mobilization  has 
frequently  created  the  conditions  for  the  destabili¬ 
zation  of  authoritarian  regimes  and  can  also  lead 
to  the  intensification  of  repression  or  the  col¬ 
lapse  of  weak  democratic  regimes,  particularly 
where  social  movements  do  not  stick  to  democratic 
conceptions.  While  labor,  student,  and  ethnic 
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movements  brought  about  a  crisis  in  the  Franco 
regime  in  Spain  in  the  1960s  and  1970s,  the 
worker  and  peasant  movements  and  the  fascist 
countermovements  contributed  to  the  failure  of 
the  process  of  democratization  in  Italy  in  the 
1920s  and  1930s.  Moreover,  social  movements 
have  often  formed  transnational  alliances  to  over¬ 
throw  an  authoritarian  regime.  In  Latin  America, 
as  well  as  in  Eastern  Europe,  social  movements 
have  asked  for  (different  forms  of)  democratiza¬ 
tion,  producing  a  final  breakdown  of  neo-fascist  as 
well  as  real-socialist  authoritarian  governments. 

Research  on  various  transition  processes  has 
stressed  that  the  first  steps  toward  democratization 
include  a  demobilization  of  civil  society  and  the 
development  of  more  institutionalized  political 
actors  following  the  opening  of  institutional  oppor¬ 
tunities,  or  the  “pacted”  strategy  of  the  elites’  set¬ 
tlement,  as  in  Spain  after  Franco  and  toward  the 
end  of  the  Pinochet  regime  in  Chile.  In  recent 
democratization  processes,  the  availability  of  pub¬ 
lic  and  private  funds  in  the  Third  Sector  contrib¬ 
uted  to  an  early  institutionalization  of  movement 
organizations.  This  does  not,  however,  necessarily 
seem  to  be  the  fate  of  social  movements  in  phases 
of  democratic  consolidation:  The  presence  of  a 
tradition  of  mobilization,  as  well  as  movements 
that  are  independent  from  political  parties,  can 
facilitate  the  maintenance  of  high  levels  of  protest 
during  transition  and  consolidation — as  the  shan¬ 
tytown  dwellers’  movement  in  Chile,  the  urban 
movement  in  Brazil,  or  the  environmental  move¬ 
ments  in  Eastern  Europe  illustrate. 

In  turn,  consolidation  fuels  the  development  of 
social  movements.  Resources  for  collective  action 
tend  in  fact  to  increase  over  time,  as  movements 
become  institutionalized,  construct  subcultural 
networks,  create  channels  of  access  to  policymak¬ 
ers,  and  form  alliances.  This  organizational  conti¬ 
nuity  means  that  the  experiences  of  “early-riser” 
movements  represent  both  resources  and  con¬ 
straints  for  those  that  follow.  Processes  of  imita¬ 
tion  and  differentiation,  and  enforced  repetition 
and  learning  take  place  simultaneously.  Movement 
activists  inherit  structures  and  models  from  their 
predecessors.  The  social  movement  sector  grows 
with  the  diffusion,  during  each  wave  of  mobiliza¬ 
tion,  of  the  capacities  required  for  collective  action. 
In  fact,  mobilization  is  facilitated  by  the  presence 
of  networks  of  activists  willing  to  mobilize  around 


new  issues,  where  these  are  seen  as  compatible 
with  their  original  identities.  Moreover,  the  sub¬ 
stantive  gains  made  by  one  movement  can  have 
beneficial  consequences  for  the  demands  of  other 
movements,  and  their  success  can  encourage  fur¬ 
ther  mobilizations.  It  can  be  concluded,  therefore, 
that  the  importance  of  social  movements  tends  to 
grow  insofar  as  there  is  an  ever-increasing  amount 
of  resources  (both  technical  and  structural)  avail¬ 
able  for  collective  action. 

International  Relations  and 

Transnational  Movements 

Social  movement  studies,  like  other  areas  of  the 
social  sciences,  have  been  late  to  address  the  phe¬ 
nomena  of  transnationalization  and  are  still  in 
search  of  adequate  methods,  concepts,  and  theo¬ 
ries  to  explain  them.  There  are  several  reasons  for 
this.  First,  most  scholarship  has  time  and  again 
confirmed  the  relevant  role  that  national  political 
opportunities  play  in  influencing  social  movement 
mobilization,  its  dimensions,  duration,  and  forms. 
The  modern  repertoire  of  protest  emerged  with  the 
creation  of  the  nation-state,  and  social  movements 
have  played  an  important  role  in  the  development 
of  (national)  citizenship  rights.  So  it  is  at  the 
national  level  that  they  fought  for  access,  suffered 
state  repression,  and  found  allies.  Second,  social 
movement  studies  focused  on  Western  (and 
Northern)  democracies.  Furthermore,  research  in 
international  relations  has  long  considered  the 
states  as  the  only  relevant  actors. 

In  the  field  of  social  movements  as  in  others, 
phenomena  of  transnationalization  were  first,  and 
not  by  chance,  addressed  within  the  fields  of  inter¬ 
national  relations  and  international  sociology. 
Bringing  “transnational  relations  back  in,”  Thomas 
Risse,  Kathryn  Sikkink,  and  others  have  pointed 
out  the  role  of  transnational  environmental  and 
human  rights  campaigns  in  developing  interna¬ 
tional  normative  regimes.  In  doing  this,  they  took 
into  account  the  role  of  nonstate  actors,  as  well  as 
emphasizing  cooperation  over  competition. 
Research  on  human  rights  regimes  or  peace  and 
war  also  stressed  the  emergence  of  international 
norms  that  challenged  the  vision  of  international 
politics  as  an  anarchic  system  of  states. 

At  the  same  time,  research  emerged  on  the  devel¬ 
opment  of  international  governmental  organizations 
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(IGOs)  and,  in  parallel  and  related  to  these,  on  a 
population  of  international  nongovernmental  orga¬ 
nizations  (INGOs),  often  taking  the  form  of  transna¬ 
tional  social  movement  organizations.  Over  the  past 
2  decades,  these  nongovernmental  actors  have 
grown  enormously  in  terms  of  numbers,  member¬ 
ship,  material  resources,  public  resonance,  and  insti¬ 
tutional  access.  This  research  also  recognizes  the 
interplay  of  actors  at  different  geographical  levels, 
going  beyond  disciplinary  borders  between  internal 
and  international  politics.  In  human  rights  cam¬ 
paigns,  national  actors,  suffering  repression  in 
authoritarian  regimes,  found  allies  abroad  in  epis- 
temic  communities,  involving  IGOs,  national  gov¬ 
ernments,  experts,  and  INGOs. 

Focusing  on  the  interactions  between  social 
movements  and  IGOs,  in  the  first  studies  from  a 
social  movement  perspective,  Hanspeter  Kriesi, 
Dieter  Rucht,  Sidney  Tarrow,  and  Donatella  della 
Porta  emphasized  the  capacity  of  transnational 
social  movements  to  adapt  to  IGOs’  rules  of  the 
game,  with  a  diplomatic  search  for  agreement  over 
democratic  accountability,  discretion  over  trans¬ 
parency,  and  persuasion  over  mobilization  in  the 
street.  In  this,  they  found  some  resonance  in  the 
more  normatively  oriented  literature  on  civil  soci¬ 
ety  seen  as  the  beginning  of  a  global  civil  society. 

Research  on  transnational  processes  and  social 
movements  first  developed  around  reflections  on 
multilevel  opportunity  structures.  In  this  sense, 
two  main  paths  of  transnationalization  were  sin¬ 
gled  out:  (1)  social  movements  with  domestic 
political  concerns  (especially  in  authoritarian 
regimes)  searching  for  external,  international  allies 
and  (2)  social  movements  addressing  their  own 
governments  to  influence  international  political 
decisions. 

Attention  was  also  paid  to  the  resource 
exchanges  between  transnational  social  movement 
organizations  and  IGOs.  Exchanges  of  knowledge 
as  well  as  a  potential  reciprocal  legitimization  were 
singled  out  along  with  the  capacity  of  transna¬ 
tional  social  movements  to  sensitize  public  opinion 
to  global  problems.  At  the  same  time,  the  difficul¬ 
ties  in  protesting  beyond  national  borders  were 
stressed  by  looking  at  different  data  on  protest 
events,  which  generally  seemed  to  remain  anchored 
at  the  national  level.  Some  studies  also  pointed  out 
the  cognitive  effects  of  globalization,  for  example, 
in  the  intensification  of  relations  beyond  borders 


in  terms  of  the  cross-national  diffusion  of  move¬ 
ment  frames  and  strategies  for  maintenance  of 
public  order.  Here  as  well,  attention  is  paid  not 
only  to  the  potential  for  increasing  exchanges  but 
also  to  their  limits. 

With  the  development  of  campaigns  addressing 
various  and  diverse  IGOs,  more  reflections  were 
developed  concerning  the  different  opportunities 
that  different  international  assets  offered  to  differ¬ 
ent  social  movements.  Opportunities  such  as  a 
consensual  culture  and  a  reciprocal  search  for  rec¬ 
ognition  were  available,  for  example,  for  the 
United  Nations  (UN)  but  not  for  (the  much  more 
closed  and  hostile)  international  financial  institu¬ 
tions  such  as  the  World  Trade  Organization 
(WTO),  the  International  Monetary  Fund  (IMF), 
and  the  World  Bank,  which  became  the  targets  of 
lively  protests.  In  fact,  more  recent  research  has 
indicated  that  there  are  as  many  types  of  institu¬ 
tions  at  the  transnational  level  as  there  are  national 
ones.  The  social  movements  that  target  them, 
therefore,  have  to  find  specific  leverage,  for  exam¬ 
ple,  in  the  unanimity  rules  of  the  WTO,  which 
make  alliances  with  some  states  particularly  rele¬ 
vant,  while  international  experts  and  formal  chan¬ 
nels  of  consultation  are  exploited  in  interactions 
with  the  IFO.  Additionally,  in  recent  research, 
IGOs  have  emerged  as  complex  and  fragmented 
institutions,  made  of  different  bodies  that  provide 
external  actors  with  differentiated  opportunities, 
booking  at  the  European  Union,  the  European 
Council,  Commission,  Parliament,  and  courts  are 
all  targeted  by  social  movements,  but  protest  strat¬ 
egies  vary  with  the  different  characteristics  of  these 
specific  bodies.  Furthermore,  different  movements 
can  have  a  more  difficult  or  an  easier  life  in  terms 
of  gaining  access  to  specific  (sympathetic) 
Directorates  General  in  the  European  Commission 
or  opposing  powerful  ones. 

Different  opportunities  require  different  strate¬ 
gies.  If  previous  research  had  stressed  the  modera¬ 
tion  of  social  movement  tactics  when  addressing 
IGOs,  with  a  move  from  the  street  to  the  lobbies, 
recent  studies  have  rediscovered  protest.  In  fact, 
going  beyond  the  specific  experiences  of  the  paral¬ 
lel  summits  organized  by  the  UN  on  environmental 
or  women’s  issues,  many  interactions  between 
social  movements  and  IGOs  have  involved  protest. 
With  time,  the  frustration  over  the  results  of  more 
moderate  forms  of  interaction  brought  about  the 
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development  of  broader  coalitions,  including  reli¬ 
gious  groups,  unions,  and  social  movements, 
which  combined  pressure  with  petitions,  marches, 
and  even  direct  action.  Since  the  1999  protests 
during  the  WTO  meeting  in  Seattle,  the  2000s 
have  seen  an  escalation  of  interactions  between 
protestors  and  the  police  during  the  contestation 
of  international  summits.  Even  though  transna¬ 
tional  protests  remain  a  rare  occurrence,  the  few 
transnational  protests  that  have  taken  place 
(whether  in  the  form  of  global  days  of  action, 
countersummits,  or  social  forums)  emerge  as  par¬ 
ticularly  “eventful”  in  their  capacity  to  produce 
relational,  cognitive,  and  affective  effects  on  social 
movement  activists  and  social  movement  organiza¬ 
tions  themselves. 

Finally,  at  the  cultural  level,  a  global  language 
developed  together  with  intense  interactions  dur¬ 
ing  the  transnational  events  mentioned  and  indeed 
a  growing  acknowledgment  of  the  roles  and 
responsibilities  of  IGOs.  Social  movement  activists 
started  to  present  their  actions  as  part  of  a  global 
justice  movement,  calling  for  global  justice  and 
global  democracy.  Although  still  deep-rooted  in 
national  political  systems  and  movement  families, 
cosmopolitan  activists  tend  to  bridge  the  local  with 
the  global  and  vice  versa.  In  doing  this,  they  are 
contributing  to  the  development  of  a  transnational 
political  system,  as  well  as  transnational  identities. 

Social  Movements,  Democracy, 
and  Normative  Theory 

Although  mainly  addressed  within  the  empirical 
subfields  of  political  science,  social  movement 
studies  have  recently  begun  to  interact  with  politi¬ 
cal  theory,  more  specifically  the  branch  reflecting 
on  participatory  and  deliberative  conceptions  of 
democracy. 

Social  movements  often  do  not  limit  their  inter¬ 
ventions  to  single  policies  but  challenge  the  ways 
in  which  political  institutions  work.  Movements 
demand,  and  often  obtain,  the  decentralization  of 
political  power,  the  consultation  of  interested  citi¬ 
zens  on  particular  decisions,  or  appeals  procedures 
against  decisions  by  the  public  administration. 
They  also  interact  increasingly  with  public  admin¬ 
istrations:  They  ask  to  be  allowed  to  testify  before 
representative  institutions  and  the  judiciary,  to  be 
listened  to  as  counterexperts,  and  to  receive  legal 


recognition  and  material  incentives.  In  fact,  social 
movements  increase  the  possibilities  for  access  to 
the  political  system,  both  through  ad  hoc  channels 
relating  to  specific  issues  and  through  institutions 
that  are  open  to  all  noninstitutional  actors.  In  the 
late  20th  and  early  21st  centuries,  social  move¬ 
ments  have  indeed  been  able  to  introduce  changes 
that  move  toward  greater  grassroots  control.  They 
have  produced  a  change  in  political  culture,  in  the 
whole  set  of  norms  and  reference  schemes  that 
define  the  issues  and  means  of  action  that  are 
politically  legitimate.  Repertoires  of  collective 
action,  which  were  once  condemned  and  dealt 
with  simply  as  public  order  problems,  have  slowly 
become  acceptable,  and  new  policy  arenas  have 
been  created  on  issues  of  major  concern  for  social 
movement  activists  (see,  e.g.,  environmental  or 
gender  rights  agencies).  Not  only  were  some  public 
bureaucracies  established  under  the  pressure  of 
movement  mobilizations  and  movement  activists 
regarded  as  potential  allies,  but  also  movement 
activists  have  been  co-opted  into  specific  public 
bodies  as  members  of  their  staff.  The  public 
administrators  working  in  these  institutions  medi¬ 
ate  particular  social  movement  demands  through 
both  formal  and  informal  channels  and  frequently 
ally  themselves  with  movement  representatives  to 
increase  the  amount  of  public  resources  available 
in  the  policy  areas  over  which  they  have  authority. 
They  tend  to  have  frequent  contacts  with  represen¬ 
tatives  of  the  social  movements  involved  in  their 
areas,  with  the  movements  taking  on  a  consultancy 
role  in  many  instances.  In  addition,  they  some¬ 
times  develop  common  interests. 

Social  movement  activists  also  maintain  direct 
contacts  with  decision  makers — participating  in 
epistemic  communities,  made  up  of  representatives 
of  governments,  parties,  and  interest  groups  of 
various  types  and  persuasion.  Nongovernmental 
organizations  critical  of  neoliberalist  globalization 
have,  in  particular,  resorted  to  bringing  pressure  to 
bear  at  both  the  national  and  the  international 
levels,  cultivating  specific  expertise.  From  human 
rights  groups  to  environmentalists,  advocacy  net¬ 
works — composed  of  activists,  bureaucrats  belong¬ 
ing  to  international  organizations,  and  politicians 
from  many  countries — have  won  significant  gains 
in  a  number  of  areas,  such  as  the  protection  of  the 
environment  or  human  rights  violations.  Some 
social  movement  organizations  have,  indeed,  been 
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highlighted  as  having  not  only  increased  in  number 
but  also  strengthened  their  influence  on  the  vari¬ 
ous  stages  of  international  policy  making. 

Most  important,  so-called  participatory  and 
deliberative  institutional  experiments  have  devel¬ 
oped  over  the  past  2  decades,  especially  at  the  local 
level.  They  are  based  on  the  principle  of  participa¬ 
tion  by  “normal  citizens”  in  public  arenas  of 
debate,  empowered  by  information  and  rules  for 
high-quality  communication.  Actors  associated 
with  social  movements  have  intervened  in  the 
development  of  some  of  these  experiments,  some¬ 
times  as  critical  participants,  sometimes  as  external 
opponents. 

More  generally,  social  movements  have  devel¬ 
oped  a  fundamental  critique  of  conventional  poli¬ 
tics,  affirming  the  legitimacy  (if  not  the  primacy) 
of  alternatives  to  parliamentary,  liberal  democ¬ 
racy.  Especially  since  the  1960s,  many  social 
movements  have  supported  the  direct  participation 
of  citizens  in  public  decision  making,  criticizing 
the  delegation  of  decision  making  to  representa¬ 
tives  who  can  be  controlled  only  at  the  moment  of 
elections.  Moreover,  they  seek  to  switch  decision 
making  to  more  transparent  and  controllable 
sites.  In  the  social  movement  conception  of 
democracy,  the  people  themselves  (who  are  natu¬ 
rally  interested  in  politics)  must  assume  direct 
responsibility  for  intervening  in  the  political 
decision-making  process.  More  recently,  the  nor¬ 
mative  debate  on  deliberative  democracy  has 
found  resonance  in  some  social  movement  organi¬ 
zations  that  have  developed  norms  and  practices 
of  consensual  decision  making,  stressing  the  need 
to  improve  the  quality  of  communication.  Some 
attention  on  the  part  of  political  theorists  has  also 
been  focused  on  the  role  of  social  movements  in 
the  creation  of  alternative  public  spheres,  free 
from  state  intervention. 

In  the  global  justice  movement,  in  particular, 
several  organizations  have  experimented  with  par¬ 
ticipatory,  discursive  models  of  democracy  both  in 
their  internal  decision  making  and  in  their  interac¬ 
tions  with  political  institutions.  Internally,  social 
movements  have — with  a  greater  or  lesser  degree 
of  success — attempted  to  develop  an  organiza¬ 
tional  structure  based  on  participation  (rather  than 
delegation),  consensus  building  (rather  than  major¬ 
ity  votes),  and  horizontal  networks  (rather  than 
centralized  hierarchies). 


As  far  as  the  social  movement  critique  of  exist¬ 
ing  democracy  is  concerned,  their  search  for  an 
alternative  cannot  be  considered  as  over.  Not  all 
students  of  social  movement  organizations  agree 
that  they  have  overcome  the  risk  of  producing  oli¬ 
garchies  and  personalistic  leadership,  the  very 
problems  at  the  center  of  their  critique  of  tradi¬ 
tional  politics.  Although  it  maximizes  responsive¬ 
ness,  the  direct  democracy  model  has  weaknesses 
as  far  as  representation  and  efficiency  are  con¬ 
cerned.  Problems  of  efficiency  affect  the  success  of 
movement  organizations  themselves;  problems  of 
representation  concern  the  legitimization  of  new 
forms  of  democracy.  The  refusal  by  social  move¬ 
ments  to  accept  the  principles  of  representative 
democracy  can  undermine  their  image  as  demo¬ 
cratic  actors,  particularly  when  they  begin  to  take 
on  official  and  semiofficial  functions  within  repre¬ 
sentative  institutions,  assuming  the  form  of  parties 
or  public  interest  groups.  Social  forums,  by  bring¬ 
ing  together  heterogeneous  actors,  pay  great  atten¬ 
tion  to  the  quality  of  internal  communication  but 
with  unequal  results.  These  limitations  notwith¬ 
standing,  social  movements  have  helped  open  new 
channels  of  access  to  the  political  system,  contrib¬ 
uting  to  the  identification,  if  not  the  solution,  of  a 
number  of  democratic  problems. 
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Social  Stratification 


Social  stratification  refers  to  the  positions  held  by 
individuals  and  groups  in  the  structures  of  inequal¬ 
ity  existing  in  a  society.  Specifically,  it  denotes  the 
classification  of  individuals  and  groups  into  differ¬ 
ent  categories  on  the  basis  of  the  amount  of  one  or 
more  privileges  enjoyed  by  the  members  of  each 
category  and/or  the  intensity  of  power  that  they 


are  able  to  exert  over  other  people.  In  contempo¬ 
rary  advanced  societies,  based  on  democratic 
political  regimes  and  market  economies,  these 
categories  are  usually  referred  to  as  strata  or 
classes,  depending  on  the  criteria  chosen  to  iden¬ 
tify  them.  Strata  and  classes  are  groups  based  on 
factual  inequalities — that  is,  disparities  produced 
by  the  workings  of  societies  with  legal  systems 
stipulating  the  equality  of  all  citizens  before  the 
law.  Hence,  strata  and  classes  are  open  groups 
that  individuals  can  enter  or  leave  according  to 
the  acquisition  or  the  loss,  during  their  lives,  of 
the  characteristics  defining  membership  of  a  spe¬ 
cific  class  or  stratum.  By  contrast,  in  most  tradi¬ 
tional  societies,  social  inequalities  were  based  on 
legal  and/or  religious  rules  that  led  to  the  forma¬ 
tion  of  closed  groups — such  as  castes,  orders,  or 
estates — to  which  people  belonged  from  birth  and 
for  their  entire  lifetimes,  with  no  chance  of  escap¬ 
ing  from  their  initial  condition.  In  this  entry,  the 
major  factors  determining  social  positions  and 
various  ways  to  group  them  into  broader  catego¬ 
ries  of  strata  and  classes  are  discussed. 

Social  Stratification  Systems 
of  Advanced  Societies 

From  an  analytical  point  of  view,  the  advanced 
societies  comprise  a  plurality  of  institutional  orders 
characterized  by  distinct  systems  of  social  stratifi¬ 
cation.  For  instance,  within  the  political  sphere, 
heads  of  states,  prime  ministers,  and  ministers  of 
central  governments  perform  more  crucial  roles 
and  hold  superior  positions  compared  with  those 
performed  and  held  by  members  of  parliaments, 
mayors,  members  of  city  councils,  and  the  like. 
The  latter,  in  their  turn,  are  politically  more  influ¬ 
ential  than  simple  citizens.  Indeed,  even  mayors 
and  members  of  city  councils  can  (a)  take  decisions 
regarding  the  needs  and  interests  of  different 
people  and  the  whole  community  and  (b)  frame 
these  decisions  in  legal  rules.  No  simple  citizen  has 
this  authority.  However,  some  simple  citizens  can 
hold  commanding  positions  in  the  economic  sphere, 
such  as  chief  executive  officers  of  big  corporations 
or  proprietors  of  medium-  and  small-sized  firms. 
Chief  executive  officers  and  entrepreneurs  can 
determine  the  goals  of  their  companies  and  firms, 
their  organizational  features,  and  the  tasks  under¬ 
taken  by  their  managers  and  professionals.  In  their 
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turn,  managers  and  professionals  are  responsible 
for  organizing  the  work  of  routine  nonmanual 
employees,  foremen,  and  skilled  and  unskilled 
manual  workers.  Obviously  no  white-  or  blue- 
collar  worker  can  take  any  decision  regarding  the 
firm’s  economic  strategies  and  organizational 
arrangements.  As  a  consequence,  they  are  placed 
at  the  bottom  of  the  stratification  system  of  the 
economic  realm.  Yet  even  national  politicians  have 
no  direct  role  to  perform  in  this  sphere,  and  in  no 
sense  can  they  be  considered  as  holding  top  posi¬ 
tions  in  the  relevant  stratification  system.  Similar 
situations  can  be  observed  in  the  cultural  and  edu¬ 
cational  sphere,  in  the  religious  realm,  and  so  on. 

Though  largely  independent,  the  main  institu¬ 
tional  orders  of  contemporary  advanced  societies 
are  not  reciprocally  disconnected.  On  the  contrary, 
they  are  functionally  interdependent.  Educational 
systems  and  universities  are  required  to  produce 
not  only  intellectuals  but  also  skilled  workers  for 
the  economy.  Moreover,  a  high  level  of  schooling 
can  be  a  useful  asset  in  the  political  and  economic 
arenas.  In  their  turn,  the  workings  of  the  economy, 
besides  being  dependent  on  the  availability  of  suit¬ 
ably  skilled  labor,  depend  closely  on  infrastruc¬ 
tural  interventions,  trade  and  tax  policies,  labor 
market  regulations,  and  welfare  and  educational 
measures  decided  by  central  and  local  govern¬ 
ments.  Politics  can  take  even  more  incisive  action  in 
the  economic  realm  by  determining  specific  finan¬ 
cial  support  in  favor  of  individual  sectors  or  firms 
to  prevent  unemployment  episodes  or  to  guarantee 
the  survival  of  economic  activities  considered  as 
crucial  national  or  local  assets.  In  parallel,  the 
economy,  through  taxation,  supplies  politics  with 
the  financial  resources  needed  to  develop  public 
policies  and  to  pay  the  costs  of  political  assemblies, 
related  bureaucratic  bodies  and  administrative 
staffs,  the  army,  and  so  on.  Furthermore,  compa¬ 
nies,  firms,  and  actors  with  higher  positions  in  the 
economic  sphere  can  influence  the  workings  of  the 
political  realm  by  selectively  funding  parties  and 
politicians. 

Owing  to  the  functional  interdependence  among 
institutional  orders,  those  who  perform  superior 
roles  and  hold  higher  positions  in  one  such  order 
can  influence  both  their  counterparts  in  other 
orders  and,  even  more  so,  those  occupying  subor¬ 
dinate  roles  in  them.  As  a  consequence,  the  incum¬ 
bents  of  higher  positions  in  the  political  system,  for 


instance,  are  usually  able  to  secure  advantages  for 
themselves,  civil  servants,  and  related  social  groups. 
In  a  similar  way,  people  in  higher  positions  within 
the  cultural  sphere  are  frequently  able  to  obtain 
sizeable  material  and  symbolic  privileges.  More¬ 
over,  they  are  quite  often  able  to  use  these  advan¬ 
tages  and  their  cultural  capital  to  achieve  desirable 
standings  in  the  political  or  economic  sphere. 
Obviously,  the  same  holds  for  the  incumbents  of 
higher  roles  in  the  economic  or  other  institutional 
realms.  These  people  may  also  use  their  power  and 
privileges  to  obtain  social  acknowledgments  and 
honors  and  to  pursue  political  careers.  By  contrast, 
people  occupying  lower  positions  within  an  insti¬ 
tutional  order  can  only  with  difficulty  compensate 
for  their  disadvantages  by  exerting  some  kind  of 
influence  in  other  social  realms.  It  is  decidedly 
more  likely  that  they  will  play  subordinate  roles  in 
the  latter  as  well. 

It  is  precisely  because  advantages  and  power 
linked  to  one  specific  structure  of  inequality  can  be 
transformed  into  advantages  and  power  linked  to 
other  forms  of  social  disparity  that  most  social 
scientists  maintain  that  there  exists  a  general  sys¬ 
tem  of  stratification  in  contemporary  advanced 
societies.  This  general  system  is  thought  to  be 
much  more  important  than  those  existing  within 
each  individual  institutional  order:  first,  because  it 
involves  all  members  of  a  society,  and  second, 
because  it  concerns  the  overall  living  conditions  of 
persons  and  groups.  The  roles  underlying  the  posi¬ 
tions  held  in  specific  systems  of  stratification  are 
usually  made  up  of  occupations.  This  is  all  the 
more  so  in  the  case  of  a  general  stratification  sys¬ 
tem.  Obviously,  occupation  is  not  the  sole  feature 
conditioning  the  allocation  of  persons  and  groups 
in  specific  and  general  systems  of  stratification. 
Gender,  generation,  race,  ethnicity,  educational 
credentials,  level  of  technical  skills,  type  of  social 
competencies,  and  so  on  can  operate  as  factors 
generating  several  specific  forms  of  inequality  and 
hence  influence  the  positions  of  individuals  in  the 
relevant  stratification  systems.  Yet  the  vast  major¬ 
ity  of  social  scientists  maintain  that,  in  advanced 
societies,  several  inequalities — and  hence  the  over¬ 
all  social  positions  of  persons — mainly  depend  on 
the  occupation  that  they  (or  the  members  of  their 
household)  perform.  Indeed,  crucial  aspects  of  the 
disparities  observed  in  people’s  living  conditions — 
such  as  income,  consumption,  prestige,  health, 
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psychological  well-being — are  deeply  affected  by 
occupations.  As  a  consequence,  all  the  schemes 
developed  by  social  scientists  during  the  past 
50  years  to  represent  the  main  features  of  general 
stratification  systems  have  used  occupations  as 
their  observation  units. 

Despite  this  basic  similarity,  the  stratification 
schemes  and  measures  developed  by  social  scien¬ 
tists  differ  substantially.  A  fundamental  distinction 
can  be  made  depending  on  which  aspect  of  social 
inequality — distributional  or  relational — is  the 
main  concern.  Distributional  disparities  refer  to 
the  amount  of  a  privilege,  or  a  set  of  privileges, 
enjoyed  by  different  groups  of  people.  Relational 
inequalities  concern  power  disparities  occurring 
between  these  groups  and  their  capacity  both  to 
condition  the  conduct  of  other  groups  and  to  suc¬ 
cessfully  fulfill  their  interests  and  choices.  These 
two  aspects  of  inequality  are  connected.  Power  is  a 
means  to  achieve  social  advantages,  while  advan¬ 
tages  can  be  used  to  achieve  power.  However, 
precisely  because  they  are  conceptually  distin¬ 
guished,  the  distributional  and  relational  aspects 
of  social  inequalities  can  be  studied  separately. 

Social  Strata  and  Occupational 
Stratification  Scales 

Some  scholars  focus  on  the  distributional  aspect, 
arguing  that  it  is  more  directly  linked  to  the  dis¬ 
parities  among  the  living  conditions  of  individuals 
and  groups.  As  a  consequence,  they  maintain  that 
general  stratification  systems  are  made  up  of 
strata.  Indeed,  a  social  stratum  can  be  defined  as  a 
set  of  individuals  and  families  who  share  similar 
living  conditions  because  they  enjoy  similar 
amounts  of  one  or  more  advantages.  Strata  form  a 
linear  hierarchy  in  which  each  of  them  (except  the 
highest  and  the  lowest)  is  adjacent  to  two  other 
strata:  one  standing  above  and  one  standing  below. 
The  reason  why  strata  form  a  completely  ordered 
hierarchy  is  that  privileges  are  gradational  proper¬ 
ties.  For  instance,  it  may  happen  that  the  poorest 
family  in  a  country  does  not  possess  anything.  But 
it  is  not  true  that  the  wealthiest  family  possesses 
the  entire  national  wealth.  It  possesses  only  a  part 
of  it.  Moreover,  between  the  poorest  and  the 
wealthiest  families  lie  numerous  other  families  who 
own  intermediate  amounts  of  wealth.  In  principle, 
one  may  say  that  each  individual  person  or  family 


possesses  a  different  amount  of  wealth  and  hence 
that  the  number  of  strata  identifiable  on  the  basis 
of  the  distribution  of  this  characteristic  is  virtually 
infinite.  Yet  social  scientists  tend  to  identify  a  dis¬ 
crete  and  reasonably  small  number  of  positions  by 
grouping  together  persons  and  families  with  simi¬ 
lar  amounts  of  privilege(s)  and  hence  rather  similar 
living  standards. 

Usually,  individuals  and  families  are  grouped  in 
each  social  stratum  by  means  of  statistical  proce¬ 
dures  that  lead  to  the  specification  of  stratification 
scales,  where  each  stratum  is  given  a  specific  score 
(and  hence  a  rank).  In  their  turn,  the  scores  are 
intended  to  measure  the  entire  range  of  inequality 
underlying  the  distribution  of  one  or  more  privi¬ 
leges,  to  define  the  distances  (in  terms  of  strength 
of  inequality)  between  the  various  strata,  and  to 
express  the  specific  position  occupied  by  a  given 
stratum  in  a  stratification  system.  Social  scientists 
have  developed  stratification  scales  of  various 
kinds.  A  first  distinction  differentiates  between 
analytic  and  synthetic  scales.  The  former  refer  to 
just  one  privilege,  whereas  the  latter  pay  attention 
to  (more  or  less  formally  specified)  combinations 
of  several  privileges.  Analytic  scales  mainly  regard 
income  and  prestige.  Income  scales  are  intended  to 
identify  the  economic  disparities  among  socio- 
occupational  strata,  while  those  based  on  prestige 
(in  the  strict  meaning  of  the  term)  are  intended  to 
identify  status  groups  and  the  amount  of  honor, 
deference,  respect,  and  social  consideration  they 
receive.  Synthetic  scales  intend  to  measure  the 
overall  social  standing  of  socio-occupational  strata 
by  combining  two  or  more  privileges — for  instance, 
income  level,  amount  of  social  prestige,  intensity 
of  psychological  gratification,  degree  of  autonomy 
in  performing  job  tasks — typically  associated  with 
the  occupations  belonging  to  a  stratum. 

Besides  their  synthetic  or  analytic  character, 
socio-occupational  scales  can  be  distinguished  on 
the  basis  of  the  information  used  in  their  construc¬ 
tion.  Objective  scales  rely  on  data  sets  recording 
factual  properties  of  occupations  (e.g.,  the  amount 
of  salary  or  wages,  the  intensity  of  unemployment 
risks)  or  their  incumbents  (e.g.,  education  level, 
technical  skills,  the  occupations  of  friends  and 
spouses).  Subjective  or  reputational  scales  are 
based  on  surveys  that  collect  popular  evaluations 
of  the  overall  social  standing  of  occupations.  It  is  to 
be  noted  that,  despite  their  subjective  foundations 
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and  the  considerable  changes  that  the  occupational 
structures  of  advanced  societies  have  undergone, 
these  scales  prove  to  be  rather  stable  over  time  and 
across  countries.  This  indicates  that  the  mechanisms 
underlying  the  generation  of  socio-occupational 
strata  are  similar  in  most  countries  with  a  market 
economy  and  a  democratic  political  regime  and  that 
the  new  occupations  produced  by  technological 
progress  do  not  alter  the  basic  features  of  individual 
strata  and  the  related  stratification  system. 

Social  Classes  and  Class  Schemes 

Class  schemes  furnish  representations  of  social 
stratification  alternative  to  those  based  on  socio- 
occupational  strata  and  scales.  A  class  can  be 
defined  as  a  social  category  made  up  of  individuals 
and  families  who  possess  the  same  power  assets 
and  hence  hold  the  same  positions  in  the  overall 
system  of  social  relations  of  domination  and  sub¬ 
ordination  existing  in  a  society.  The  reason  why 
some  scholars  prefer  to  represent  social  stratifica¬ 
tion  in  terms  of  class  schemes  is  quite  simple.  They 
argue  that  distributive  inequalities  ultimately 
depend  on  relational  disparities. 

Most  class  schemes  envisage  stratification  struc¬ 
tures  based  on  a  rather  small  number  (5-15)  of 
different  positions — in  other  words,  different 
classes.  This  is  so  because  power  resources  (or 
effective  combinations  of  some  of  them)  are  rather 
scarce  and  the  control  exerted  over  each  of  them 
by  individual  classes  is  either  (almost)  complete  or 
(almost)  nonexistent.  Indeed,  if  a  power  resource 
were  gradually  distributed  between  classes,  no 
class  could  dominate  another.  As  classes  are  rooted 
in  relational  inequalities,  they  do  not  necessarily 
form  a  linear  hierarchical  order.  It  may  happen 
that  two  or  more  classes,  controlling  different 
power  resources,  stand  in  a  position  of  reciprocal 
equilibrium.  Nonetheless,  classes  give  rise  to  a  par¬ 
tially  ordered  social  hierarchy  in  which  higher 
classes  dominate  the  remaining  ones  and  in  which 
middle  classes  are  dominated  by  higher  ones  but  in 
their  turn  dominate  lower  classes. 

Because  the  amount  of  the  power  assets  con¬ 
trolled  by  members  of  a  given  class  can  vary,  as 
well  as  their  ability  and  opportunities  to  exploit  it, 
classes  are  internally  stratified.  However,  accord¬ 
ing  to  scholars  who  adopt  a  class  perspective,  the 
inequalities  in  living  standards  that  may  occur 


within  classes  are  markedly  smaller  than  those 
observed  between  classes.  Despite  the  unavoidable 
internal  heterogeneity  of  classes,  the  boundaries 
among  them  are  far  less  conventional  than  those 
drawn  among  strata.  Indeed,  it  is  the  control,  or 
the  lack  of  control,  over  power  resources,  rather 
than  the  decisions  of  scholars,  that  automatically 
define  the  class  positions  and  the  class  member¬ 
ships  of  individuals  and  groups. 

Most  class  analysts  maintain  that,  in  advanced 
societies,  the  process  of  class  formation  is  mainly 
influenced  by  power  relations  in  the  economic 
sphere.  In  their  turn,  these  power  relations  depend 
on  the  power  assets  controlled  by  the  members  of 
individual  classes.  Four  types  of  power  assets  are 
usually  identified: 

1.  means  of  production, 

2.  educational  qualifications  and  technical  skills, 

3.  labor  power,  and 

4.  control  over  organizations  exerted  by  top  and 
middle  managers  and  civil  servants. 

Power  resources,  however,  are  not  equally  effec¬ 
tive.  Control  over  the  means  of  production  and 
organizations  furnishes  greater  power  than  that 
afforded  by  the  possession  of  educational  creden¬ 
tials  or  technical  expertise,  and  all  of  these  guaran¬ 
tee  greater  influence  than  that  furnished  by  simple 
labor  power.  It  is  essentially  this  inequality  of 
effectiveness  that  engenders  the  specific  relations 
between  classes.  Power  assets  can  be  combined, 
and  the  social  position  of  some  classes  is  defined 
precisely  by  their  combinations.  This  is  typically 
the  case  of  self-employed  workers  owning  very 
small  firms  and  frequently  labeled  the  petty  bour¬ 
geoisie  (shopkeepers,  plumbers,  electricians, 
mechanics,  stockbreeders,  vinedressers,  etc.). 
Indeed,  they  possess  their  own  means  of  produc¬ 
tion,  have  specific  technical  expertise,  and  directly 
contribute  with  their  own  labor  to  the  operation  of 
their  small  firms.  A  similar  case  can  be  made  for 
self-employed  professionals  (lawyers,  financial 
consultants,  architects,  and  medical  doctors).  Their 
class  position  is  based  on  the  possession  of  both 
high  educational  credentials  and  means  of  produc¬ 
tion.  Obviously,  numerous  people  are  equipped 
with  high  educational  credentials  or  technical 
skills,  and  everybody  is  endowed  with  his  or  her 
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own  labor.  But  in  many  cases,  these  assets  are 
intangible  and  can  be  ignored.  The  class  positions 
of  entrepreneurs  and  chief  executives  entirely 
depend  on  their  control  of  means  of  production. 
Indeed,  an  entrepreneur  remains  an  entrepreneur 
even  though  she  or  he  may  be  in  poor  health  and 
does  not  possess  any  educational  qualification. 

As  in  the  case  of  socio-occupational  stratifica¬ 
tion  scales,  social  scientists  have  developed  several 
definitions  of  classes  and  numerous  class  schemes. 
However,  those  linked  to  the  Marxist  and 
Weberian  traditions  are  still  the  most  influential. 
Neo-Marxist  scholars  maintain  that  classes  are 
rooted  in  the  social  relations  of  production  and  the 
related  processes  of  exploitation.  However,  they 
acknowledge  that  the  social  stratification  of 
advanced  societies  is  much  more  complex  than 
envisaged  by  Marx  in  the  mid-  19th  century.  They 
state  that,  besides  the  bourgeoisie  and  the  prole¬ 
tariat,  there  are  classes  whose  members  are  simul¬ 
taneously  exploited  by  the  owners  of  means  of 
production  and  are  exploiters  of  the  working 
class.  These  middle  classes  are  distinguished 
according  to  the  amount  of  educational  and  pro¬ 
fessional  credentials  possessed  by  their  members 
and  the  control  that  they  can  exert  over  the  orga¬ 
nizational  arrangements  of  firms  and  public 
bureaucracies.  Moreover,  neo-Marxist  authors 
maintain  that  the  owners  of  means  of  production 
do  not  form  an  entirely  homogeneous  class,  and 
they  split  them  among  the  bourgeoisie  (i.e.,  entre¬ 
preneurs  and  chief  executive  officers  of  large-  and 
medium-sized  firms),  small  entrepreneurs  (owners 
of  firms  with  a  small  number  of  employees),  and 
the  petty  bourgeoisie  (i.e.,  the  self-employed  with 
at  most  one  or  two  employees). 

Authors  who  adopt  the  neo-Weberian  perspec¬ 
tive  draw  up  their  class  schemes  by  considering 
both  the  work  and  the  market  situation  of  indi¬ 
viduals  and  groups.  In  their  opinion,  the  process  of 
class  formation  does  not  revolve  solely  around  the 
social  division  of  labor;  it  also  revolves  around  the 
economic  life  chances  of  individuals  and  groups.  In 
their  more  recent  developments,  neo-Weberian 
representations  of  class  structure  first  separate 
employers  from  the  self-employed  and  employees 
and  then  group  employers  into  three  classes:  large 
employers,  small  employers  not  in  agriculture,  and 
small  employers  in  agriculture.  The  same  sectoral 
distinction  is  performed  for  the  self-employed. 


Finally,  employees  are  allocated  to  different  classes 
mainly  according  to  the  employment  relations 
between  them  and  their  employers.  Two  main 
dimensions  underlie  the  forms  of  employment  con¬ 
tracts  usually  available  in  the  labor  markets  of 
advanced  societies:  the  (high  or  low)  specificity  of 
the  human  assets  possessed  by  employees — that  is, 
their  level  and  kind  of  technical  and  social  abili¬ 
ties — and  the  (high  or  low)  difficulty  of  monitoring 
their  work  by  employers.  The  combination  of 
these  two  dimensions  produces  three  basic  types  of 
contract:  (1)  service  relationship,  (2)  labor  con¬ 
tract,  and  (3)  mixed  forms  of  service  relation  and 
labor  contract.  These  contractual  categories  are 
then  internally  subdivided  according  to  the  usual 
organizational  roles  of  the  relevant  employees  and/ 
or  the  economic  sector  (industry  and  services  vs. 
agriculture)  in  which  they  work. 

Recent  Developments 

In  recent  years,  the  two  standard  approaches  to 
the  study  of  social  stratification  have  progres¬ 
sively  converged,  and  currently,  several  lines  of 
empirical  inquiry  on  social  inequalities  are  carried 
out  using  class  schemes  or  socio-occupational 
scales  indifferently.  This  convergence  has  been 
produced  by  the  increasingly  detailed  information 
about  interviewees’  occupations  collected  by 
socioeconomic  surveys,  the  progress  achieved  in 
the  statistical  techniques  used  for  the  relevant 
analyses,  and  the  conviction  that  both  ways  to 
express  social  positions  of  individuals  and  groups 
can  be  useful  in  clarifying  the  effects  of  these  posi¬ 
tions  on  specific  inequalities  and  the  mechanisms 
underlying  them. 

The  convergence  of  the  class  and  stratum  per¬ 
spectives  has  not  increased  their  popularity  among 
social  scientists,  however.  On  the  contrary,  both 
have  been  challenged  by  authors  who  maintain  that 
strata  and  classes  are  disappearing,  or  have  already 
disappeared,  from  contemporary  advanced  societ¬ 
ies.  In  the  opinion  of  such  authors,  social  inequali¬ 
ties  are  becoming  increasingly  fragmented  on  an 
individual  basis.  Two  main  arguments  are  put  for¬ 
ward  to  support  this  thesis.  First,  advanced  societies 
have  undergone  a  long  process  of  institutionaliza¬ 
tion  of  individualism:  that  is,  a  process  that  places 
personal  rights  and  personal  independence  at  the 
center  of  cultural,  political,  economic,  and  juridical 
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arrangements.  As  a  consequence,  collective  entities, 
such  as  professional  associations,  local  communi¬ 
ties,  classes,  strata,  churches,  and  even  families,  are 
increasingly  less  able  to  shape  the  life  trajectories 
and  destinations  of  individuals.  Second,  the  global¬ 
ization  of  the  economy  exposes  everybody,  no 
matter  how  privileged  in  their  current  social  posi¬ 
tions,  to  increasing  risks  of  suddenly  lapsing  into 
unemployment,  financial  hardship,  poverty,  mul¬ 
tiple  deprivation,  and  similar  distressful  situations. 

Social  stratification  scholars  react  to  the  thesis 
of  the  individualization  of  social  inequalities  by 
maintaining  that  it  has  not  yet  received  convincing 
empirical  support.  These  scholars  admit  that 
advanced  societies  are  experiencing  a  secular  trend 
toward  emancipation  of  individuals  from  the  strict 
social  control  exerted  in  the  past  by  different  com¬ 
munities.  But  they  also  point  out  that  a  large  body 
of  empirical  analysis  has  shown  that  several  differ¬ 
ent  expressions  of  crucial  inequalities — such  as 
those  regarding  educational  opportunities,  inter- 
generational  mobility  and  career  chances,  risks  of 
unemployment,  level  of  income  and  wealth,  and 
health  conditions — and  even  the  mating  selection 
process  are  still  linked  to  the  class  and  stratum  of 
origin  and  current  belonging.  Authors  supporting 
the  thesis  of  the  crystallization  of  inequalities 
around  classes  and  strata  recognize  that  the  latter 
are  less  socially  visible  than  they  were  at  the  begin¬ 
ning  of  the  industrialization  process.  Moreover, 
they  acknowledge  that  contemporary  advanced 
societies  are  becoming  somewhat  more  socially 
fluid  and  open.  But  they  stress  that  this  movement 
toward  greater  social  fairness  is  feeble  and  slow,  so 
that  most  of  these  societies  still  exhibit  highly 
effective  processes  of  social  closure  structured 
around  strata  and  classes. 

Indeed,  as  shown  by  the  experience  of  the 
Nordic  countries,  only  systematic  and  long-lasting 
policies  aimed  at  increasing  levels  of  social  equality 
can  guarantee  a  stable  reduction  of  social  dispari¬ 
ties.  In  the  past  10  years,  however,  these  policies 
have  been  weakened  everywhere  by  both  the 
increasing  popularity  of  market-oriented  economic 
thought  and  the  more  recent  negative  effects  of 
economic  recession. 
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Socialism 


By  the  final  third  of  the  19th  century,  socialism 
had  emerged  in  Europe,  both  as  a  critique  of  lib¬ 
eralism  and  as  the  only  comprehensive  alternative 
to  it.  In  this  sense,  socialism  was  an  intellectual 
reckoning  with  the  world  created  by  the  British 
industrial  revolution  and  the  French  Revolution  of 
1789,  and  it  was  initially  confined  to  dissident 
groups  within  these  two  countries.  But  like  its 
main  ideological  rival,  socialism  came  in  many 
competing,  dynamic  varieties.  Uniquely,  socialism 
in  all  of  these  varieties  was  concerned  with  under¬ 
standing  the  world  both  as  it  is  and  as  it  might  be 
or  would  become.  As  such,  it  represented  an  alter¬ 
native  socioeconomic  system  to  capitalism  based 
on  principles  of  egalitarianism  and  collectivism. 
The  main  currents  of  socialism  also  became  cham¬ 
pions  of  democracy. 

Origins  and  Early  Development 

Socialists  and  liberals  drew  on  a  common  stock  of 
Enlightenment  assumptions.  In  this  view,  human¬ 
kind  had  struggled  successfully  to  control  nature 
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by  the  application  of  reason.  History  was  an 
ascent,  though  one  in  which  irrationalist  forces 
had  obstructed  the  march  of  progress  and  were  yet 
to  be  swept  away  completely.  Reason  applied  to 
human  affairs  could  in  principle  produce  the  same 
advances  as  science  had  already  made  possible  in 
the  effort  to  understand  and  subdue  nature.  The 
social  world  was  the  work  of  humanity,  and 
human  beings  could  understand  its  evolution  and 
promote  their  self-improvement  in  the  process. 

Radical  and  dissident  opinion  within  the 
Enlightenment  pointed  beyond  mainstream  liberal¬ 
ism  and  inspired  future  socialists.  Thomas  Paine, 
for  example,  had  shown  the  imagination  and 
audacity  to  advocate  old  age  pensions,  land  nation¬ 
alization,  more  liberal  marriage  and  divorce  laws, 
family  allowances,  free  education,  and  a  host  of 
other  reforms  that  became  part  of  the  social- 
democratic  program  in  the  20th  century.  Jean- 
Jacques  Rousseau  contradicted  his  Enlightenment 
peers  in  the  mid-18th  century  by  identifying  civil 
society  (and  the  division  of  labor)  as  the  source  of 
inequality,  which  the  political  superstructure 
merely  legitimated  and  protected.  He  also  stressed 
the  iniquities  of  private  property,  the  multiplicity 
of  artificial  wants  and  the  dependencies  they  gen¬ 
erated,  and  the  growth  of  invidious  distinction, 
base  rivalry,  and  competition  that  emanated  from 
the  same  quarter.  Where  mainstream  liberalism 
saw  only  progress,  Rousseau  saw  also  “an  assem¬ 
bly  of  artificial  men  and  factitious  passions.” 

Rousseau  blamed  these  discontents  on  the 
whole  course  of  civilization,  whereas  the  socialists 
accepted  that  industrialism,  its  most  recent  phase, 
was  here  to  stay.  It  was  capitalism  that  would  have 
to  be  reformed  or  overthrown.  For  large  sections 
of  the  population,  the  first  industrialization  (in 
Britain)  was  experienced  as  a  social  and  economic 
catastrophe — the  destroyer  of  community,  family, 
environment,  and  the  quality  of  life  in  general.  The 
motivating  force  behind  this  “great  transforma¬ 
tion”  was  individual  profit.  The  socialists  did  not 
deny  that  wealth  and  technological  advances  were 
associated  with  this  change,  but  they  were  also 
concerned  with  understanding  and  overcoming  its 
destructive  and  divisive  features.  A  new  language 
was  developed.  It  gave  us  terms  such  as  working 
class,  industrialism,  capitalism,  and  middle  class. 
In  November  1827,  the  word  socialism  appeared 
in  print  for  the  first  time  in  the  Co-operative 


Magazine  in  Britain.  The  argument  was  now  set 
forth  that  the  problems  of  the  new  society  were  not 
attributable  to  industry  as  such  but  to  capitalism  in 
particular. 

Robert  Owen,  himself  a  factory  owner,  had  for 
10  years  previously  argued  that  the  source  of  prob¬ 
lems  such  as  poverty  lay  in  unrestrained  competi¬ 
tion.  The  answer  lay  in  the  principle  of  cooperation. 
Owen  was  a  firm  believer  in  social  engineering; 
human  beings  would  improve  as  their  environments 
were  altered  to  induce  desired  changes  in  behavior, 
health,  morals,  educational  attainment,  and  so  on. 
He  thus  believed  that  the  suffering  created  by  capi¬ 
talism  was  avoidable.  He  changed  his  factory 
regime  at  New  Lanark  in  Scotland  to  demonstrate 
in  practice  that  pauperism,  squalid  housing,  chronic 
ill  health,  and  unemployment  were  unnecessary 
evils.  In  1813,  he  published  an  account  of  these 
changes  in  A  New  View  of  Society.  At  first,  Owen 
argued  that  profits  would  improve  rather  than  be 
damaged  by  such  enlightened  reforms.  Later,  as  his 
ideas  developed,  he  denounced  property,  religion, 
and  the  family — forfeiting  his  respectability  in  elite 
circles  in  the  process.  He  turned  to  funding  egalitar¬ 
ian  and  communitarian  experiments  in  the  United 
States,  such  as  the  one  at  New  Harmony  in  Indiana, 
begun  in  1826.  After  the  Great  Reform  Act  of  1832 
failed  to  enfranchise  the  working  class  in  Britain, 
Owen,  who  had  returned  to  London,  associated 
himself  with  both  the  cooperative  movement  and 
the  creation  of  the  Grand  National  Consolidated 
Trade  Union.  His  followers  in  Britain  strengthened 
their  critique  of  capitalist  exploitation  by  develop¬ 
ing  the  argument  that  labor  was  the  source  of  all 
wealth — a  doctrine  inspired  by  the  labor  theory  of 
value  expounded  by  David  Ricardo  in  Principles  of 
Political  Economy  and  Taxation  (1817). 

Meanwhile,  in  France,  the  radical  reconstruc¬ 
tion  of  society  was  an  ambition  promoted  and 
legitimated  by  the  revolution  of  1789.  The  princi¬ 
ples  of  this  revolution — expressed  in  the  name  of 
“the  people” — included  the  “rights  of  man”  and 
“liberty,  equality,  and  fraternity.”  These  ideas  had 
international  resonance  and  were  opposed  interna¬ 
tionally  by  the  counterrevolution  of  the  aristocracy 
and  monarchy.  France  was  involved  in  a  virtually 
continuous  war  between  1792  and  1815.  In  1793, 
the  most  radical  wing  of  the  revolutionary  leader¬ 
ship,  the  Jacobins,  seized  power  in  Paris  in  circum¬ 
stances  of  French  military  reversals.  The  Jacobin 
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dictatorship  itself  was  overthrown  a  year  later,  and 
a  process  was  begun  that  promoted  the  army  and 
the  person  of  Napoleon  Bonaparte.  The  most 
extreme  radicals  in  the  Jacobin  milieu  began  to 
conclude  that  the  revolutionary  dynamic  had  been 
stopped  at  the  point  where  it  benefited  only  the 
bourgeoisie.  They  now  formulated  demands  of 
economic  as  well  as  political  equality.  This  Society 
of  Equals,  a  Parisian  political  club  led  by  Gracchus 
Babeuf,  was  suppressed  by  Bonaparte  himself.  But 
before  Babeuf  and  the  leaders  were  executed  in 
May  1797,  the  Society  of  Equals  had  laid  the  foun¬ 
dations  of  a  communist  political  current  stressing 
abolition  of  private  property  in  land,  the  need  for 
social  planning,  the  provision  of  state-supplied 
basic  necessities  to  ensure  a  national  minimum 
standard  of  life,  communal  living  arrangements, 
the  universal  obligation  to  work,  and  the  abolition 
of  divisions  of  rich  and  poor. 

After  Napoleon’s  final  defeat,  the  principal 
object  of  the  victors  at  the  Congress  of  Vienna  in 
1815  was  to  suppress  “the  Revolution.”  But  the 
Jacobin  and  Babouvist  traditions  were  kept  alive  in 
France  by  men  such  as  Filippo  Buonarroti  (1761— 
1837),  Etienne  Cabet  (1788-1856),  and  Louis- 
Auguste  Blanqui  (1805-1881),  who  stressed  the 
need  for  a  revolutionary  dictatorship  to  complete 
the  unfinished  business  of  1789  in  the  creation  of 
a  new  social  order  based  on  a  radical  egalitarian¬ 
ism.  These  doctrines  commanded  support  within 
the  urban  working  class,  as  the  insurrections  of 
1830  to  1848  in  cities  such  as  Lyons  and  Paris 
showed.  The  secret  societies  of  the  French  com¬ 
munist  tradition,  though  they  envisaged  a  world 
without  class  distinctions,  were  less  successful  in 
elaborating  theoretical  alternatives  to  bourgeois 
society.  This  was  better  accomplished  by  socialist 
reformers  such  as  Charles  Fourier  (1772-1837) 
and  the  Comte  de  Saint-Simon  (1760-1825),  who 
separately  elaborated  detailed  schemes — many  of 
them  frankly  fantastic — and  seminal  ideas  that 
others  (including  Karl  Marx  and  Auguste  Comte) 
would  take  forward  in  different  ways. 

Saint-Simon  and  his  followers  believed  that 
they  were  developing  a  science  of  society  that  dis¬ 
closed  the  centrality  of  production  and  consump¬ 
tion.  They  argued  that  scientists  and  producers 
must  be  at  the  top  of  any  rational  hierarchy  of 
merit  and  should  play  a  major  role  in  the  govern¬ 
ment  of  the  future,  committed  as  it  would  be  to 


the  scientifically  planned  improvement  of  human¬ 
ity.  Saint-Simon’s  vision  of  progress  imagined  such 
elites — including  the  propertied  elite — exercising 
an  enlightened  self-interest  by  solving  conflicts  of 
interest  with  the  improvement  of  the  poor  in 
mind.  This  was  the  way  to  avoid  violence  and 
increase  the  security  and  prosperity  of  the  proper¬ 
tied,  as  well  as  to  promote  the  good  of  society  as 
a  whole.  In  elaborating  his  doctrines,  Saint-Simon 
parted  company  with  liberal  individualism, 
embraced  feminism  and  pacifism,  and  imagined 
pan-European  structures  of  cooperation.  He  also 
inaugurated  forms  of  Christian  socialism  (in  his 
Nouveau  Christianism) .  His  ideas  spread  across 
Europe  in  the  years  up  to  1848,  thanks  to  the 
work  of  followers  such  as  Barthelemy  Enfantin, 
Amand  Bazard,  and  Pierre  Leroux,  who  system¬ 
atized  the  chaotic  original. 

Charles  Fourier’s  (1772-1837)  thoughts  were 
even  more  detailed  and  confused  than  Saint- 
Simon’s  but  no  less  original  and  provocative. 
Fourier  championed  feminism  and  cooperation  as 
keys  to,  and  measures  of,  social  progress.  He  pro¬ 
posed  and  set  up  model  communities,  which  he 
called  phalanstery,  based  on  assigning  individuals 
to  jobs  for  which  they  were  suited.  In  describing 
these  communities,  Fourier  ignored  no  detail  of 
social  organization.  But  his  big  ideas  centered  on 
the  relationship  between  environment,  education, 
and  character.  From  a  competitive,  selfish,  exploit¬ 
ative  system  could  come  only  greedy,  dishonest 
people.  The  influence  of  Rousseau  is  evident,  as 
indeed  it  is  in  Saint-Simon’s  thinking.  Fourier’s 
best  known  follower,  Victor  Considerant  ( 1808— 
1893),  also  championed  the  “right  to  work”  and 
was  prominent  in  the  1848  revolution  in  Paris  as 
well  as  the  First  International  Working  Men’s 
Association,  established  in  1864. 

Saint-Simon,  Fourier,  and  Owen  were  known  to 
later  socialists  by  the  collective  name  given  to  them 
by  Marx  (1818-1883)  and  Friedrich  Engels — the 
Utopian  Socialists — largely  because  their  schemes 
depended  on  methods  of  persuasion,  education, 
and  example,  which  the  founders  of  “scientific 
socialism”  deemed  impractical.  Marx  and  Engels 
were  nevertheless  converted  to  socialism  through 
the  work  of  these  pioneers  and  their  German  fol¬ 
lowers  (e.g.,  Wilhelm  Weitling  and  Moses  Hess).  In 
1842,  an  observer  such  as  Lorenz  von  Stein  could 
associate  socialism  in  France  with  peaceful  reform 
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and  detailed  thinking  about  the  nature  of  reform, 
whereas  communism  signified,  according  to  the 
same  author,  a  revolutionary  rejection  of  existing 
society,  with  proletarian  roots,  stressing  the  need 
for  practical  actions  to  overthrow  the  status  quo. 
The  French  Revolution  had  bequeathed  both  the 
Jacobin  model  of  the  dictatorial,  insurrectionist 
vanguard — kept  alive  by  the  secret  societies — and 
the  ideal  of  a  republican  democracy.  Louis  Blanc 
(1811-1882)  became  the  best  known  exponent  of 
the  latter,  arguing  that  any  social  change  to  social¬ 
ism  had  to  be  democratic.  But  the  insurrectionary 
tradition  survived  not  the  least  because  the  urban 
working  class  was  everywhere  a  small  minority 
except  in  Britain,  and  it  was  far  from  clear  that  a 
“democratic  revolution,”  as  suggested  by  Blanc 
and  the  German  emigre  Moses  Hess,  was  possible. 
Even  in  Britain,  the  remarkable  campaign  of  the 
Chartists  (1837-1848)  for  male  suffrage  came  to 
nothing,  and  socialism  itself  disappeared  from  view 
in  that  country  until  its  revival  in  the  1880s.  From 
the  perspective  of  Europe  in  the  1840s — a  conti¬ 
nent  still  consisting  predominantly  of  peasants  and 
autocracy — socialism  as  a  reforming  movement  of 
the  working  class  was  a  nonstarter.  Yet  the  work  of 
Marx  and  Engels  made  this  very  connection. 

Marx  and  His  Legacy 

Marx  came  to  socialism  via  German  philosophy, 
especially  the  work  of  G.  W.  F.  Hegel  (1770-1831) 
and  the  critiques  of  Hegel  associated  with  Ludwig 
Feuerbach  and  the  Left  Hegelians.  He  shared 
Hegel’s  idealization  of  the  Greek  polis  and  the 
conviction  that  the  modern  world  was  scarred  by 
fragmentation  and  alienation.  He  also  adopted 
Hegel’s  dialectical  method  while  giving  it  a  materi¬ 
alist  foundation  not  found  in  Hegel’s  own  philoso¬ 
phy.  In  Paris  from  1843,  Marx  encountered  the 
proletariat  for  the  first  time  and  studied  the  works 
of  the  mutualist  socialist  Pierre-Joseph  Proudhon 
as  well  as  those  of  the  Utopians.  He  learned  more 
about  the  working  class  through  Engels’s  The 
Condition  of  the  Working  Class  in  England  in 
1844,  the  product  of  the  author’s  investigations  in 
Manchester  since  1842  and  his  knowledge  of 
Owenite,  Chartist,  and  trade  union  circles. 
Together,  Marx  and  Engels  were  active  in  German 
communist  emigre  circles  in  Brussels  between 
1845  and  1848,  and  it  was  here  that  they  were 


commissioned  to  write  a  manifesto  for  the  clandes¬ 
tine  Communist  League.  Their  involvement  in  the 
revolutions  that  swept  across  Europe  from  Paris  in 
1848  led  Marx  to  permanent  exile  in  London  a 
year  later.  He  devoted  his  studies  from  this  time  to 
the  critique  of  capitalist  political  economy,  culmi¬ 
nating  in  the  publication  of  the  first  volume  of  Das 
Kapital  in  1867. 

Marx’s  intellectual  revolution  effected  the  single 
most  important  development  of  socialist  thought. 
He  argued  that  the  economic  basis  of  society — its 
mode  of  production — determined  its  social  and 
political  structures.  On  this  foundation,  social  classes 
arise,  and  it  is  conflict  between  classes  that  generates 
historical  change.  In  capitalist  society,  the  basic 
classes  in  conflict  were  those  of  the  bourgeoisie — the 
owners  of  capital — and  the  proletariat.  Marx 
regarded  his  greatest  insight  as  the  theory  of  surplus 
value,  which  showed  the  way  in  which  capitalists 
exploited  their  workers.  Such  exploitation  was  an 
integral  structural  feature  of  capitalism  and  would 
persist  for  as  long  as  capitalism  existed.  But  the 
development  of  capitalism  would  lead  to  class 
polarization,  periodic  crises  of  overproduction,  and 
opportunities  for  the  emergence  of  proletarian  class 
consciousness.  Eventually,  when  objective  condi¬ 
tions  and  subjective  class  consciousness  were 
aligned,  the  proletariat — the  vast  majority  in  a 
mature  capitalist  economy — would  overthrow  the 
system  of  exploitation  and  the  political  institutions 
that  served  to  legitimate  and  defend  it.  This  is  how 
socialism  would  emerge  as  the  successor  to  capital¬ 
ism  in  the  march  of  human  progress. 

Most  of  what  Marx  and  Engels  wrote  remained 
unpublished  or  unread  outside  the  small  groups  of 
British  and  German  socialists  with  which  they  had 
contact  in  the  1840s,  1850s,  and  1860s.  During 
this  period,  their  thinking  moved  emphatically 
against  the  mentality  of  the  secret  societies  in  favor 
of  revolutionary  change  as  the  product  of  open, 
mass  struggles  arising  from  conditions  generated 
by  capitalism  itself.  When  the  First  International 
was  formed  in  1864,  with  Marx  as  a  foundation 
member,  it  brought  British  trade  unionists,  French 
Proudhonists,  German  Communists,  and  anarchist 
followers  of  Michael  Bakunin  together.  Marx 
failed  to  convert  the  trade  unionists  to  his  vision 
but  succeeded  in  marginalizing  the  anarchists, 
Blanquists,  and  Proudhonists,  even  at  the  cost  of 
relocating  the  International  to  New  York  City  in 


2452  Socialism 


1872  and  thereby  effectively  terminating  it.  But 
there  were  other,  more  important  indications  that 
Europe  was  moving  in  the  direction  Marx  expected, 
such  as  the  spread  of  industrial  capitalism,  the 
growth  of  trade  unionism  and  class  conflict,  and 
the  creation  of  nation-states  in  Italy  and  Germany, 
which  Marx  believed  would  pave  the  way  to  the 
formation  of  united  workers’  movements.  The 
German  Workers’  Association  had  already  been 
set  up  by  Ferdinand  Lassalle  in  1863  to  press  for 
universal  suffrage  and  socialism  through  the  exist¬ 
ing  state  machinery.  Marx’s  own  followers  were 
involved  with  those  of  Lassalle  in  creating  the 
Social  Democratic  Party  (SPD)  in  1875.  Throughout 
this  period  and  up  to  Engels’s  death  in  1895,  the 
“Marxists”  in  Germany  were  guided  by  the  doc¬ 
trine’s  founders,  such  that  the  SPD  as  a  whole 
could  be  called  a  Marxist  party  by  the  1880s.  After 
Marx’s  death  in  1883,  Engels  did  what  he  could  to 
systematize  and  codify  the  doctrine  for  the  widest 
dissemination,  establishing  contacts  throughout 
Europe  as  he  did  so,  watching  as  the  SPD  moved 
from  strength  to  strength  and  new  Marxist  parties 
were  created  across  the  continent,  coming  together 
in  a  new  federation  of  national  parties  in  1889  as 
the  Second  International. 

Socialism  nevertheless  lost  none  of  its  capacity 
for  disputation  and  dissent.  In  part,  this  was 
because  it  continued  to  display  a  dual  temporal 
existence,  being  both  an  account  of  the  world  as  it 
was  as  well  as  an  account  of  the  world  as  it  might 
be.  On  both  counts,  there  was  enormous  scope  for 
disagreement.  There  was  the  question  of  how  the 
transition  would  come.  Would  it  be  revolutionary 
or  by  incremental  reform,  would  the  transition  be 
peaceful  or  violent,  and  would  it  be  dependent  on 
skilled  leadership  or  the  result  of  blind  economic 
forces  or  some  combination  of  the  two?  One 
important  variable  in  providing  answers  to  these 
questions  was  national.  The  Marxist  parties  them¬ 
selves  were  noticeably  more  committed  to  an 
insurrectionist  reading  of  Marx  the  more  one  trav¬ 
elled  east  across  Europe  and  into  the  zones  of 
autocracy,  censorship,  and  political  intolerance.  In 
Russia,  Marxists  argued  among  themselves  about 
the  need  for  a  prior  bourgeois-democratic  revolu¬ 
tion  and  their  role  in  relation  to  it.  But  there  was 
no  argument  that  they  had  to  organize  clandes¬ 
tinely  and  that  any  political  work  that  they  did  was 
necessarily  illegal.  By  contrast,  socialism  in 


England,  when  it  came  back  to  life  in  the  1880s, 
developed  in  circumstances  of  emerging  parlia¬ 
mentary  democracy  and  a  strong  liberal  tradition 
that  accommodated  currents  sympathetic  to  state- 
sponsored  social  reform.  In  Germany,  the  most 
successful  socialist  party  of  them  all,  the  SPD,  was 
practically  divided  and  ideologically  challenged 
from  within  by  “revisionism”  as  the  century  came 
to  an  end.  Meanwhile,  in  the  United  States  of 
America,  it  was  noticed  in  the  first  decade  of  the 
20th  century  that  the  most  powerful  industrial 
nation  on  earth  had  failed  to  produce  a  socialist 
movement  of  corresponding  size  and  significance. 
Thus  began  enquiries  into  “American  exceptional- 
ism,  ”  but  even  in  the  heartland  of  socialism,  there 
were  well-developed  doubts  about  the  Marxist 
analysis  of  capitalism  and  its  inevitable  demise. 

In  theoretical  terms,  it  was  Eduard  Bernstein,  a 
protege  of  Engels  himself,  who  questioned  whether 
the  evidence  supported  Marx’s  vision  of  class 
polarization,  working  class  immiseration,  capital¬ 
ist  breakdown,  and  the  necessity  of  a  revolutionary 
rupture.  Instead,  he  pointed  to  the  persistence, 
even  growth,  of  multiple  class  strata,  the  need  for 
class  allies,  the  evidence  of  improvements  in  living 
standards,  and  the  prospect  of  further  gradual 
change  via  the  ballot  box.  In  practice,  sections  of 
the  SPD — notably  the  Reichstag  Fraktion  and  the 
trade  unions — were  already  home  to  strong  reform¬ 
ist  currents.  Though  the  “revisionism”  debate 
(1898-1903)  was  concluded  with  apparent  victory 
to  the  revolutionaries,  events  soon  showed  how 
misleading  this  was.  In  August  1914,  the  major 
powers  of  Europe  went  to  war,  and  the  SPD  ini¬ 
tially  supported  Germany’s  war  effort  with  as 
much  enthusiasm  as  the  avowedly  reformist 
Labour  Party  supported  Britain’s.  The  immense 
strains  engendered  by  the  war  in  all  the  belligerent 
countries  eventually  brought  revolutionary 
Marxists  to  power  in  Russia  in  November  1917. 
Lenin’s  Bolsheviks  moved  quickly  to  establish  a 
rival  Third  (or  Communist)  International  in  1919, 
and  by  1921,  communist  parties  had  been  formed 
in  most  European  countries. 

The  Russian  Revolution  and  Its  Aftermath 

Ideologically,  the  consequences  of  World  War  I 
and  the  Bolshevik  seizure  of  power  in  Russia  were 
immense.  Lenin  was  able  to  argue  persuasively 
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that  the  war  signaled  the  beginning  of  “an  epoch 
of  wars,  civil  wars  and  revolutions”  in  which  bour¬ 
geois  democracy  would  wither  and  the  prospects 
for  meaningful  reforms  recede.  The  empirical  evi¬ 
dence  of  the  interwar  years — economic  disloca¬ 
tion,  the  world  crisis  sparked  by  the  Wall  Street 
Crash,  and  the  rise  of  fascism — did  not  contradict 
this  reasoning.  Nor  did  growing  conflict  in  the 
colonial  world,  for  which  the  Bolsheviks  claimed  a 
special  relationship  as  the  first  revolutionaries  to 
embrace  anti-imperialism  as  a  foundation  value 
and  permanent  project.  In  these  circumstances, 
Communists  were  persuaded  to  model  themselves 
on  Lenin’s  Bolsheviks.  A  party  once  justified  as  a 
necessary  adaptation  to  Tsarist  persecution  was 
now  the  model  for  all  Communists,  whether  in 
England  or  China.  The  Jacobin  tradition  was 
restored,  not  the  least  in  the  dictatorial  way  in 
which  the  Bolsheviks  purported  to  represent  the 
workers  and  peasants  of  the  old  Tsarist  Empire. 

World  interest  in  the  Union  of  Soviet  Socialist 
Republics  (USSR),  as  it  was  known  from  1922, 
massively  increased  after  1930  when  the  global 
capitalist  economic  crisis  coincided  with  the  appar¬ 
ently  successful  inauguration  of  Soviet  Five-Year 
Plans.  Socialism  became  increasingly  associated 
with  the  idea  of  state-centered  planning.  Soviet 
prestige  was  also  enhanced  by  its  role — real  and 
imagined — in  the  struggle  against  fascism  and 
Nazism,  culminating  in  the  expansion  of  the  Soviet 
sphere  as  far  west  as  central  Europe  in  1945. 
Thereafter,  the  Soviet  Union  was  perceived  as  a 
model  of  how  conscious  economic  planning  could 
transform  a  backward,  peasant  society  into  a  mod¬ 
ern,  industrialized,  technologically  advanced  state, 
free  of  foreign  domination.  Throughout  the  Third 
World,  socialism  and  the  Soviet  Union  exercised 
this  appeal  almost  until  the  final  dissolution  of  the 
USSR  in  1991.  Socialism  was  equated  with  mod¬ 
ernization,  social  progress,  and  various  forms  of 
unity  in  the  first  decades  after  World  War  II,  giving 
rise  to  claims  for  specific  forms  of  African,  Arab, 
Islamic,  and  Third  World  socialism.  In  postcolo¬ 
nial  Africa,  in  countries  marked  by  “poverty,  igno¬ 
rance,  disease  and  illiteracy,”  as  Kwame  Nkrumah 
itemized  the  legacy  of  imperialism,  leaders  arose 
such  as  Julius  Nyerere  of  Tanzania,  who  pointed 
to  precolonial  forms  of  collectivism  and  egalitari¬ 
anism  on  which  an  indigenous  socialism  could  be 
built.  There  was  no  disguising,  however,  that 


socialism  appealed  because  the  high  road  to  pros¬ 
perity  and  real  independence  was  thought  to  lie 
with  industrialization  and  rapid  modernization 
under  the  direction  of  the  Plan. 

In  the  advanced  capitalist  countries  of  Western 
Europe,  the  reformist  parties  that  regrouped  as  the 
Labour  and  Socialist  International  in  1923  were  led 
by  men  who  emphatically  denied  the  socialist  cre¬ 
dentials  of  the  Bolsheviks  precisely  because  of 
Lenin’s  violent  and  undemocratic  methods.  The 
dictatorship  of  Lenin’s  successor  Stalin  provided 
gruesome  support  for  their  insight.  But  it  was  not 
until  the  long  period  of  full  employment  and  pros¬ 
perity  following  World  War  II  that  they  could  boast 
a  coherent  alternative  to  Soviet  socialism.  This 
“Keynesian”  alternative  depended  on  the  belief  that 
democratically  elected  governments  could  steer 
economies  to  ensure  annual  economic  growth.  By 
this  means,  the  goals  of  socialism  could  be  obtained: 
full  employment;  an  egalitarian  distribution  of 
income,  wealth,  and  opportunity;  the  elimination  of 
poverty;  and  the  end  of  economic  crises  and  their 
destructive  consequences.  Social-democratic 
Sweden  was  the  shining  model  of  this  approach  as 
late  as  the  1970s,  though  the  left  wing  of  the  social- 
democratic  parties  continued  to  insist  on  the  need 
to  further  reduce  private  economic  power. 

The  Crisis  of  Socialism 

In  the  years  1970  to  2000,  socialism  in  all  its  vari¬ 
eties  entered  a  profound  crisis.  Economic  stagna¬ 
tion  and  authoritarianism  in  the  communist  world 
led  eventually  to  the  collapse  of  these  regimes  in 
Eastern  Europe  in  1989  and  finally  the  Soviet 
Union  itself  as  the  communist  leaders  failed  in 
their  attempts  to  reform  the  system  from  within. 
Socialism  as  an  alternative  system  lost  its  powers 
of  attraction  because  of  this  failure,  coming  as  it 
did  on  top  of  the  moral  calamity  of  communism, 
with  its  death  toll  of  millions,  which  some  social¬ 
ists  had  forecast  shortly  after  the  Bolshevik  seizure 
of  power  but  which  others  had  ignored  for  as  long 
as  they  could.  Meanwhile,  social  democracy 
entered  a  period  of  ideological  retreat  as  the 
Keynesian  approach  broke  down  in  the  1970s,  and 
the  future  of  the  welfare  state  was  called  into  ques¬ 
tion  by  long-term  changes  in  the  structure  of  soci¬ 
eties  and  markets.  The  term  globalization  was 
often  invoked  as  a  shorthand  for  these  subversions 
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of  the  old  social-democratic  paradigm.  Though 
radical  ideas  of  left-wing  provenance  continued  to 
develop  after  1970,  socialism  had  ceased  for  the 
foreseeable  future  to  provide  them  with  a  coherent 
vision  of  an  alternative  social  system. 
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Socialist  Parties 


Socialist  and  social-democratic  parties  are  the 
main  and  the  most  electorally  successful  party 
family  of  the  Left.  The  first  socialist  parties 
appeared  in  Europe  during  the  last  quarter  of  the 
19th  century  as  a  result  of  the  politicization  of  the 
class  conflicts  generated  by  the  Industrial 
Revolution.  The  formation  of  socialist  parties  was 
one  of  the  outcomes  of  the  cleavage  between 
capital  and  labor  and  of  the  political  mobilization 
of  the  working  class.  This  entry  discusses  the  ori¬ 
gins,  programs,  and  political  role  of  these  parties. 

Prior  to  World  War  I,  there  were  socialist  or 
social-democratic  parties  or  candidates  in  almost 
every  European  country  and  in  some  states  in 
other  regions  of  the  world  (Australia,  New 


Zealand,  the  United  States,  and  some  Latin 
American  countries).  In  some  cases,  they  were  part 
of  a  subculture  milieu  in  which  a  socialist  mass- 
party  organization  was  linked  to  a  trade  union  and 
to  a  network  of  ancillary  groups.  In  its  early  years, 
the  European  socialist  or  social-democratic  parties 
emphasized  some  core  ideas  concerning  the  trans¬ 
formation  of  capitalist  societies  into  more  egalitar¬ 
ian  ones  built  on  solidarity,  cooperation,  and 
social  justice.  These  parties,  many  of  them  Marxist, 
postulated  the  end  of  human  exploitation,  class 
divisions,  and  economic  inequalities.  The  1940s 
witnessed  the  expansion  of  socialist  parties  to 
places  distant  from  their  European  origins,  as 
some  anticolonial  organizations  adopted  socialism 
as  their  ideology  in  Asia  and  Africa.  However,  this 
phenomenon  resulted  in  the  formation  of  parties 
that — from  the  ideological  point  of  view — could 
only  partially  be  considered  socialist  given  their 
combination  of  a  socialist  ideology  with  anticolo¬ 
nial  and  nationalist  ideas. 

The  socialist  movement  experienced,  almost 
since  its  birth,  divisions  over  its  program  and  poli¬ 
cies  for  the  transformation  of  capitalism  (e.g., 
concerning  the  role  of  the  state  in  the  management 
of  the  economy)  and  over  the  best  strategies  to 
achieve  it  (e.g.,  divisions  between  reformists  and 
revolutionaries).  These  debates  among  parties  and 
leaders  were  already  present  in  the  First  (1864) 
and  Second  (1889)  International  Working  Men’s 
Associations.  In  his  analysis  of  electoral  socialism, 
Adam  Przeworski  argues  that  the  early  socialists 
confronted  three  choices:  (1)  whether  to  achieve 
socialist  goals  from  within  or  from  outside  the 
political  institutions  of  capitalism,  (2)  whether  to 
trust  exclusively  in  the  working  class  as  the  agent 
of  socialist  change  or  whether  to  appeal  to  other 
social  classes  as  well,  and  (3)  whether  to  transform 
society  through  partial  reforms  or  to  devote  all 
efforts  to  the  final  abolition  of  capitalism.  The 
responses  to  these  three  choices  caused  internal 
strife,  shaped  the  evolution  of  the  socialist  parties, 
and  contributed  to  differentiate  them  from  other 
left-wing  organizations. 

The  socialists  decided  to  participate  in  elections. 
In  doing  so,  socialist  parties  tried  to  take  advantage 
of  the  political  rights  that  capitalist  societies  accorded 
to  the  workers  as  a  means  to  achieve  improvements 
in  their  life  conditions.  This  option  clearly  differenti¬ 
ated  socialist  parties  from  “abstentionist”  streams 
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of  the  workers’  movement,  such  as  the  anarchists. 
Despite  the  acceptance  of  the  electoral  process  to 
win  political  power,  within  the  socialist  parties, 
there  was  a  certain  ambivalence  about  the  need  to 
eventually  use  revolutionary  means  to  conquer 
political  power  or  to  defend  it  once  attained  by 
peaceful  means.  However,  democracy  eventually 
became  not  just  the  main  path  to  socialism  but  the 
parties’  very  goal.  In  a  certain  sense,  the  ideal 
defended  by  socialist  parties  consisted  in  the  exten¬ 
sion  of  democratic  political  principles  to  the  eco¬ 
nomic  and  social  domains.  The  abandonment  of 
revolutionary  roads  to  gain  power  differentiated 
socialist  and  social-democratic  parties  from  other 
groups  of  socialist  and  Marxist  origin,  such  as  the 
Communists. 

Socialist  parties  originally  opted  for  the  elec¬ 
toral  route  to  obtaining  improvements  in  the  living 
conditions  of  their  electorate  and  to  extend  its 
influence  among  working  class  voters,  but  they 
were  also  convinced  that  this  path  would  give 
them  political  power  relatively  easily.  They  believed 
that  they  would  win  elections  in  the  short  to 
medium  term,  given  the  unstoppable  growth  of  the 
working  class  inherent  in  capitalism.  However,  the 
working  class  did  not  grow  to  become  the  social 
majority  in  most  capitalist  societies,  and  in  any 
case,  it  never  voted  as  a  single  cohesive  electoral 
bloc  as  socialist  leaders  and  intellectuals  had 
hoped  for.  Under  these  circumstances,  the  socialist 
parties  were  confronted  with  the  decision  either  to 
appeal  to  other  social  groups,  if  they  wanted  to 
maximize  their  votes  and  their  possibilities  to  form 
governments  in  Western  countries,  or  to  remain  a 
minority  party  appealing  to  an  exclusively  work¬ 
ing  class  constituency.  They  decided  to  build  a 
multiclass  electoral  coalition  that  reflected  the  evo¬ 
lution  and  differentiation  of  the  social  structure  of 
capitalist  societies.  This  wider  electoral  coalition — 
at  least  in  class  terms — blurred  many  socialist 
goals.  In  any  case,  the  socialist  project,  regardless 
of  the  class  orientation  of  its  appeals,  hardly 
attracted  the  support  of  a  clear  majority  of  the 
electorate  in  many  countries  during  the  first  decades 
of  its  development.  Hence,  socialist  parties  rarely 
had  the  opportunity  to  implement  their  program 
without  incorporating  other  political  allies. 

The  socialist  program  was  characterized  by  its 
reformist  nature.  This  feature  also  differentiated 
socialist  parties  from  other  revolutionary  and  radical 


groups.  In  its  original  implementation,  these  reform¬ 
ist  policies  were  understood  as  steps  in  the  path 
toward  the  complete  and  future  transformation  of 
capitalist  society,  following  the  socialist  ideal. 
Therefore,  socialist  parties  originally  believed  that 
the  accumulation  of  economic  and  social  reforms 
would  eventually  produce  a  radical  transformation 
of  the  social  order  in  a  socialist  way.  But  what  was 
far  from  clear  was  what  these  reformist  economic 
socialist  policies  should  exactly  be.  The  socialist 
parties  were  formally  favorable  to  the  nationaliza¬ 
tion  or  socialization  of  the  means  of  production. 
However,  once  the  socialist  parties  were  in  govern¬ 
ment  during  the  end  of  the  1 9th  and  first  third  of 
the  20th  centuries,  hardly  any  overarching  policy  of 
nationalization  took  place  in  Western  Europe. 
Despite  their  electoral  successes,  socialist  parties  did 
not  gain  enough  political  and  electoral  support  to 
promote  their  own  programs.  Furthermore,  they 
did  not  want  to  risk  their  political  prestige  in  a 
policy  they  were  unsure  how  to  implement  and  that 
could  find  strong  resistance.  In  the  absence  of  a 
well-defined  and  differentiated  socialist  economic 
policy,  socialist  parties  promoted  public  policies 
directed  toward  the  expansion  of  social  welfare 
through  different  plans  on  housing,  health,  salaries, 
unemployment,  and  so  on. 

In  the  1930s  and  1940s,  some  West  European 
socialist  parties  began  to  defend  and  to  implement 
Keynesian  policies  to  regulate  the  economic  crisis 
and  to  reduce  unemployment.  The  socialist  parties 
defended  an  active  role  of  the  state  in  the  econ¬ 
omy,  anticyclical  policies,  public  investment,  and 
public  debt,  aiming  to  maintain  jobs  and  to  rein¬ 
vigorate  the  economy.  After  World  War  II,  social¬ 
ist  parties  generally  adopted  Keynesian  policies  to 
regulate  the  economy;  the  projects  of  nationaliza¬ 
tion  or  socialization  were  almost  forgotten,  nation¬ 
alizations  were  rarely  implemented  (although 
some  took  place  during  postwar  economic  recon¬ 
struction),  and  the  parties  assumed  a  non-Marxist 
program  for  the  increase  of  equality  and  social 
justice  within  the  limits  of  capitalism.  Consequently, 
the  possibilities  of  an  advancement  of  equality  were 
linked  to  the  expansion,  inclusiveness,  and  redistrib¬ 
utive  efforts  of  state  intervention  and  of  social  wel¬ 
fare  programs.  However,  in  the  view  of  the  socialist 
parties,  the  state  should  limit  itself  to  the  provision  of 
public  goods,  compensating  through  these  policies 
the  inequalities  generated  by  the  market.  Given  that 
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this  program  should  be  developed  within  a  capital¬ 
ist  economy,  social  welfare  policies  were  very 
dependent  on  the  evolution  of  the  free  market 
economy,  of  its  efficient  management,  and  of  its 
growth  rate.  The  socialist  economic  and  social 
policies,  welfare  state  programs,  state  investment, 
and  state  fiscal  revenues  through  direct  taxation 
that  would  make  possible  all  these  policies  were 
subject  to  sustaining  a  growing  and  full  employ¬ 
ment  economy  that  ensures  private  sector  returns. 

This  bundle  of  socialist  policies  encountered  a 
benign  economic  environment  for  its  implementa¬ 
tion  from  the  end  of  World  War  II  until  the  1970s. 
This  period  has  been  sometimes  described  as  the 
golden  age  of  West  European  social-democratic  par¬ 
ties  and  policies.  In  any  case,  in  spite  of  the  influence 
of  non-social-democratic  parties  and  governments 
in  the  creation  of  West  European  welfare  states, 
some  elements  of  the  social-democratic  program 
became  a  crucial  component  of  the  West  European 
policy  status  quo  after  the  postwar  period. 

However,  what  could  be  termed  as  the  social- 
democratic  settlement  faltered  in  the  1970s,  when 
the  economic  context  worsened  notably.  The 
Keynesian  and  economic  redistribution  policies 
did  not  seem  to  be  feasible  under  the  new  eco¬ 
nomic  circumstances.  The  principles  that  guided 
the  policies  of  the  socialist  parties  were  challenged: 
The  role  of  the  state  in  the  management  of  the 
national  economies,  its  role  in  the  regulation  of  the 
market  dynamics,  and  the  welfare  state  policies 
suffered  from  economic  and  financial  crises, 
decreasing  growth  rates,  rising  unemployment, 
and  growing  internationalization  of  the  economy. 
Later  on,  the  social-democratic  orthodoxy — that 
rising  social  expenditure,  together  with  state  inter¬ 
vention  and  regulation,  ensured  better  functioning 
of  the  market — was  severely  challenged  by  eco¬ 
nomic  globalization. 

In  the  1980s  and  1990s,  not  only  did  socialist 
parties  confront  important  challenges  coming  from 
social,  cultural,  economic,  and  institutional  changes 
that  made  their  platforms  less  appealing  than  the 
neoliberal  or  neoconservative  ones,  but  they  also 
confronted  new  competition  from  New  Left  par¬ 
ties.  In  this  context,  the  socialist  parties  began  a  yet 
unfinished  search  for  a  new  program  that  could 
provide  them  with  a  new  identity  adapted  to  the 
changes  in  the  socioeconomic  environment  and  in 
the  patterns  of  electoral  competition.  During  this 


period,  social  democracy  underwent  a  substantial 
programmatic  change  that  in  many  cases  has 
meant  an  abandonment  of  the  previously  tradi¬ 
tional  social-democratic  concerns,  moving  toward 
more  centrist  and  promarket  policy  positions, 
sometimes  called  the  third  way. 

In  the  long  term,  socialist  parties  have  succes¬ 
sively  abandoned  their  original  anticapitalist  and 
Marxist  positions  of  the  late  19th  century,  have 
adopted  Keynesian  policies  compatible  with  the 
efficacy  of  macroeconomic  management  in  the  con¬ 
text  of  a  capitalist  society,  and  finally  have  moder¬ 
ated  their  reformist  impulses  to  match  the  strongly 
market-oriented  political  environment  that  has 
existed  since  the  1980s.  Although  they  still  have 
more  favorable  policy  positions  than  other  party 
families  toward  social  justice,  income  redistribu¬ 
tion,  and  state  social  expenditure,  a  policy  conver¬ 
gence  between  socialist  parties  and  right-of-center 
party  families  has  taken  place  in  Western  Europe. 

Luis  Ramiro 
University  of  Murcia 
Murcia,  Spain 
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Socialist  Systems 


Socialist  systems  are  those  regimes  based  on  the 
economic  and  political  theory  of  socialism,  which 
advocates  public  ownership  and  cooperative 
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management  of  the  means  of  production  and  allo¬ 
cation  of  resources.  These  include  states  described 
as  socialist,  communist,  or  Marxist,  all  of  which 
claim  to  be  based  on  efforts  to  achieve  social  jus¬ 
tice  and  equality.  The  best  term  for  these  regimes 
may  be  state  socialism,  which  has  gained  wide  cur¬ 
rency  in  the  political  science  literature.  In  response 
to  the  crisis  and  collapse  of  state  socialism  at  the 
end  of  the  1980s,  many  analysts  have  emphasized 
that  the  universalistic  goals  of  socialism  were  not 
thereby  invalidated,  since  many  other  currents  of 
socialism,  for  instance,  social  democracy,  have 
been  vigorously  developing  around  the  world. 

According  to  the  classical  Marxist  approach, 
socialism  is  only  the  first  stage  of  the  postcapitalist 
society,  followed  by  communism  as  a  second  stage 
in  which  people  receive  “according  to  their  needs.” 
Marxist  theory  was  elaborated  for,  and  based  on, 
the  most  developed  countries  of  the  world.  Although 
the  state  socialist  project  originated  from  Marxist 
theory,  it  was,  however,  a  deviation  from  the  origi¬ 
nal  theory  of  Karl  Marx.  The  application  of  this 
theory  in  backward  countries,  starting  with  Lenin’s 
Russia,  can  be  considered  as  turning  it  to  the  other 
extreme — that  is,  to  a  revolutionary  theory  for  the 
poorest  countries  of  the  world.  This  trend  reached 
its  peak  under  Joseph  Stalin  and  Mao  Zedong  and, 
later,  was  embraced  by  Fidel  Castro  of  Cuba  and 
by  other  local  leaders  in  the  developing  countries. 
What  is  more,  there  have  been  two  basically  differ¬ 
ent  paths  to  state  socialism.  Along  the  above- 
mentioned  lines  was  a  series  of  internal  revolutions 
and  subsequent  transformations,  as  in  Russia, 
China,  and  Cuba,  leading  to  state  socialist  regimes 
on  different  continents.  In  the  second  case,  there 
were  also  some  repressive  impositions  of  commu¬ 
nist  rule  on  some  countries  by  conquest,  as  for 
instance  in  Poland  and  Hungary.  This  second  path 
to  state  socialism  has  usually  been  distinguished 
from  the  first,  since  these  state  socialist  regimes 
were  to  a  great  extent  alien  in  terms  of  the  histori¬ 
cal  course  of  the  countries  concerned. 

The  following  sections  briefly  describe  the  his¬ 
tory  and  main  features  of  state  socialist  systems, 
the  political  science  models  that  account  for  their 
emergence,  and  their  historical  trajectories  from 
their  origins,  through  phases  of  industrialization 
and  social  transformation  to  their  abrupt  collapse 
in  response  to  the  global  economic  and  political 
crises  of  the  1970s  and  1980s. 


Characteristics  of  Socialist  Systems 

The  main  features  of  state  socialist  systems  are  the 
following: 

One-Party  System.  Political  rule  is  based  on  the 
unified  power  of  partocracy  or  on  the  concentra¬ 
tion  of  power  in  the  Communist  Party  as  a  super¬ 
state.  In  state  socialism,  there  is  no  division  of 
power  because  the  party  represents  the  dictator¬ 
ship  of  the  proletariat  and  builds  a  bureaucratic 
system  of  political  control.  Democratic  centralism, 
first  in  the  party  and  then  in  all  political  institu¬ 
tions,  became  the  general  organizational  principle 
for  state  socialism.  In  the  constitutions  of  the  coun¬ 
tries  concerned,  the  leading  role  of  the  party  in 
state  and  society  is  stipulated. 

Command  Economy.  This  is  an  economic  system 
with  central  planning  (Five-Year  Plans)  in  the  state- 
administered  economy.  There  can  be  no  private 
ownership  of  the  means  of  production,  since  these 
are  nationalized  and  state  owned.  Directive  plan¬ 
ning  as  instructions  from  above  is  the  method  of 
economic  management  in  an  economy  based  on 
state  property,  in  which  the  State  Planning  Agency 
(Gosplan  in  Russia)  acts  as  a  superministry.  But 
planning  was  a  bargaining  process  between/among 
various  actors  and  organizations,  with  centrally 
determined  prices,  heavily  subsidized  products,  and 
huge  subsidies  for  the  heavy  industries.  Command 
economy  is  accompanied  by  the  coercive  collectiv¬ 
ization  of  agriculture,  which  produced  a  general 
scarcity  and  low  quality  of  agricultural  goods. 

Personality  Cult  of  the  Leader.  A  cultural  system 
based  on  the  ideological  monopoly  of  “Marxism- 
Leninism”  and  on  the  “charisma”  of  the  number 
one  leader  was  followed.  This  ideological  monop¬ 
oly  means  that  the  official  ideology  penetrated  all 
spheres  of  cultural  life,  demanding  ideological 
conformity  and  uniformity.  State  socialist  systems 
had  a  future-oriented  legitimacy;  that  is,  they  tried 
to  legitimate  themselves  in  these  backward  coun¬ 
tries  as  the  means  for  catching  up  with  the  most 
developed  Western  states.  However,  this  legitimat¬ 
ing  device  was  more  and  more  eroded  in  the  1980s 
in  the  European  “socialist”  countries  by  the  slow¬ 
ing  down  of  economic  growth  and  by  the  growing 
attractiveness  of  the  Western  world  with  its  abun¬ 
dance  of  goods  and  services. 
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The  chief  actor  in  state  socialist  countries  was 
the  nomenklatura.  It  meant  that  the  Communist 
Party  was  involved  in  the  appointment  of  people  to 
all  senior  posts  in  society.  The  nomenklatura  prin¬ 
ciple  was  the  main  instrument  for  the  exercise  of 
democratic  centralism.  It  was  applied  not  merely 
within  the  party  itself  but  also  in  the  state  admin¬ 
istration,  the  police,  the  military,  the  judiciary, 
education,  the  economy,  and  social  organizations 
such  as  the  trade  unions  and  the  people’s  fronts — 
in  short,  it  involved  total  control  over  appoint¬ 
ments  in  all  major  organizations.  The  party  struc¬ 
ture  was  highly  hierarchical;  the  real  power  was  in 
the  hands  of  a  larger  central  committee  and  a 
smaller  politburo  controlled  by  the  almighty 
leader.  The  party  congress  and/or  the  party  confer¬ 
ence  were  only  legitimizing  devices  for  democratic 
centralism  and  the  official  communication  plat¬ 
forms  for  the  economic  and  political  decisions  of 
the  party  leadership.  The  Communist  Party  was 
also  a  mobilizing  instrument  since  there  was  a 
rather  large  party  membership — between  6%  and 
10%  of  the  adult  population — as  a  “transmission 
belt”  to  the  population  as  a  whole  to  implement 
party  decisions. 

The  emergence  of  state  socialist  systems  was 
one  of  the  most  important  historical  developments 
of  the  20th  century.  In  the  1980s,  at  its  peak,  40% 
of  the  world’s  population  lived  in  the  socialist 
world  (i.e.,  in  countries  run  by  communist  parties), 
and  one  third  of  the  global  industrial  output  was 
produced  by  the  various  state  socialist  systems. 
The  core  state  socialist  countries  were  military 
allies  and  economic  partners  of  the  Soviet  Union  in 
two  major  organizations:  the  Council  for  Mutual 
Economic  Assistance  (CMEA,  or  Comecon,  1949- 
1991)  and  the  Warsaw  Treaty  Organization 
(1955-1991).  These  organizations  were  instru¬ 
ments  of  the  great  power  efforts  of  the  Soviet 
Union  for  global  dominance.  The  Warsaw  Pact 
and  Comecon  were  the  two  pillars  of  Soviet  global 
power  as  the  basic  military  alliance  and  economic 
organization  in  the  bipolar  world  on  the  side  of  the 
Soviet-dominated  state  socialist  countries.  Come¬ 
con  was  organized  in  January  1949  as  a  reaction 
to  the  Marshall  Plan  (1947),  and  it  was  extended 
beyond  Europe  with  the  entry  of  Mongolia  (1962), 
Cuba  (1972),  and  Vietnam  (1978).  The  Warsaw 
Pact  as  a  reaction  to  the  North  Atlantic  Treaty 
Organization  (NATO)  was  also  an  instrument  of 


the  coercive  control  of  the  Soviet  Union  over  its 
allies.  The  major  principle  of  the  Warsaw  Pact  was 
formulated  in  the  Brezhnev  doctrine  of  limited 
sovereignty  (December  1968)  after  the  Soviet 
Union’s  crushing  of  Czechoslovakian  reform 
efforts  in  the  “Prague  Spring.”  The  state  socialist 
countries  in  general  had  a  network  of  multilateral 
treaties,  but  there  were  also  a  series  of  bilateral 
treaties  between  the  core  state  socialist  countries 
modeled  after  their  contacts  with  the  Soviet  Union. 

Varieties  of  State  Socialism 

The  socialist  world  system  embraced  four  conti¬ 
nents  with  many  countries  for  shorter  or  longer 
periods,  reaching  its  largest  territorial  extension  in 
the  mid-1980s.  However,  these  countries  claiming 
to  be  “socialist”  were  not  homogeneous  at  all,  nor 
did  they  form  a  real  “world  system,”  since  they 
were  split  by  serious  tensions  and  deep  cleavages, 
first  of  all  by  the  adversarial  competition  for  dom¬ 
inance  between  the  Soviet  and  Chinese  models. 
The  state  socialist  countries  were  usually  allies  of 
one  of  these  two  big  powers  that  tried  to  extend 
their  respective  spheres  of  influence,  even  against 
each  other.  Given  their  diverging  and  conflicting 
approaches  to  the  one-party  state,  command 
economy,  and  personality  cult  of  the  leader,  the 
state  socialist  countries  were  sometimes  fierce 
enemies  of  each  other  in  different  historical  peri¬ 
ods.  First,  after  World  War  II,  a  conflict  emerged 
between  the  Soviet  Union  with  its  allies  and 
Yugoslavia  and,  parallel  with  it,  between 
Yugoslavia  and  Albania;  then  in  the  1960s  between 
the  Soviet  Union  and  China  (and  their  allies);  and 
finally  between  China  and  Vietnam.  China  and 
Vietnam  fought  a  real  war  in  1979,  but  the  Soviet 
Union  and  China  also  had  a  series  of  border 
clashes.  As  state  socialism  spread  in  the  world,  so 
the  controversies  became  deeper  between  its  lead¬ 
ing  powers;  therefore,  in  fact,  no  unified  socialist 
world  system  ever  existed. 

One  can  distinguish  the  following  territorial 
groupings  of  state  socialist  countries  in  the 
1980s: 

European  group:  Albania,  Bulgaria, 

Czechoslovakia,  German  Democratic  Republic, 

Hungary,  Poland,  Romania,  Soviet  Union,  and 

Yugoslavia 
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Asian  group:  Afghanistan,  Burma,  Cambodia, 
China,  Laos,  Mongolia,  North  Korea,  and  Vietnam 

Latin  American  group:  Cuba,  Guyana,  Nicaragua, 
and  Surinam 

Arab  socialist  group:  Algeria,  Iraq,  Libya,  Syria, 
and  Yemen 

Black  African  group  (usually  called  socialist 
orientation):  Angola,  Benin,  Burkina  Faso, 

Ethiopia,  Ghana,  Guinea,  Guinea  Bissau,  Congo, 
Madagascar,  Mozambique,  Somalia,  Tanzania,  and 
Zimbabwe  (These  countries  were  fragile, 
nondurable  systems,  and  it  was  very  difficult  to 
define  their  “socialist”  character  beyond  official 
declarations.) 

Three  models  have  been  offered  in  political  sci¬ 
ence  for  the  explanation  of  the  emergence  and 
transitory  survival  of  state  socialist  countries: 

1.  Totalitarianism:  This  involved  the  state  pen¬ 
etration  of  society  for  its  total  control.  This  theory 
was  very  fashionable  after  World  War  II  and  also 
after  the  collapse  of  the  Soviet-dominated  socialist 
world  system.  Totalitarianism  is  an  ideological 
label  for  modern  tyranny.  It  is  simplistic  but  is  still 
very  popular  as  a  journalistic  term.  However,  this 
theory  fails  to  answer  two  basic  questions:  first, 
how  totalitarian  control  over  the  society  can  work 
at  all,  and  if  it  is  so,  then  second,  how  political 
change  can  be  explained  according  to  this  static 
and  rigid  model.  No  doubt,  this  theory  makes 
some  contributions  to  the  partial  understanding  of 
the  worst  and  most  repressive  period  of  Stalinism, 
between  1949  and  1953,  or  the  current  history  of 
North  Korea,  for  instance.  But  it  is  unable  to 
explain  the  state  socialist  project  as  a  whole  in  its 
historical  dynamism  from  the  October  Revolution 
in  1917  to  the  collapse  of  the  Soviet  Union  in 
1991,  with  its  internal  fights,  oppositions  and 
counterelites,  and  increasing  social  resistance.  The 
totalitarian  form  of  state  socialism  is  a  unique  phe¬ 
nomenon  in  world  history  as  a  worst-case  scenario 
of  historical  blind  alleys. 

2.  Radical  revolution:  This  is  the  “Bolshevik”- 
type  seizure  of  power  by  the  working  class  or  the 
poor  peasantry  through  a  long  internal  war  or 
“permanent  revolution.”  These  transitory  societies 
have  been  in  between,  or  have  combined  elements 


of,  capitalism  and  socialism.  The  internal  revolu¬ 
tionary  potential  of  these  poor  countries  has 
played  a  dominant  role  in  the  fight  against  depen¬ 
dency  on  world  capitalism  and  for  national  inde¬ 
pendence,  so  “socialism”  and  “nationalism”  have 
also  been  combined.  According  to  this  model,  the 
fate  of  these  revolutions  supposedly  remains  open. 
They  may  develop  toward  a  “real”  socialism, 
toward  communism,  or  toward  the  capture  of 
power  by  the  state  bourgeoisie  as  nomenklatura, 
leading  to  the  bureaucratic  deformation  of  state 
socialism.  For  the  theory  of  radical  revolution, 
state  socialism  is  a  historical  hybrid,  a  combina¬ 
tion  of  “genuine”  socialism  or  anticapitalism  and 
dependent  capitalism  in  the  guise  of  revolutionary 
nationalism,  that  has  survived  nowadays  mostly  or 
exclusively  in  the  Third  World. 

3.  Modernization:  This  refers  to  the  catching-up 
exercise  of  a  modernizing  elite  organized  in  the 
form  of  a  communist  party  to  establish  an  indus¬ 
trial  society  belatedly.  The  repressive  state  appara¬ 
tus  is  in  fact  acting  as  an  instrument  of  state 
capitalism  to  carry  out  the  process  of  capital  accu¬ 
mulation  through  the  forcible  extraction  of  surplus 
from  the  working  class  and  the  peasantry.  In  the 
last  analysis,  this  modernization  exercise  leads  to 
the  convergence  with  world  historical  develop¬ 
ment,  if  the  catching-up  effort  proves  to  be  suc¬ 
cessful.  Thus,  in  the  final  analysis,  state  socialism 
is  just  one  of  the  more  peculiar  ways  of  reaching 
the  stage  of  modern  capitalist  society.  For  modern¬ 
ization  theory,  state  socialism  is  not  an  aberration 
or  “hell  on  the  earth”  but  only  one  of  several  tran¬ 
sitory  and  controversial  routes  that  early  industrial 
societies  take  to  converge  finally  with  mainstream 
global  development. 

Historical  Trajectory  of  State  Socialism 

The  usual  descriptions  follow  only  the  historical 
trajectory  of  communist  ideologies  from  the  theo¬ 
retical  foundations  by  Marx  and  Engels  through 
Lenin  and  Stalin  to  Mao  Zedong  and  Castro. 
Lenin  indeed  elaborated  the  revolutionary  theory 
for  a  backward  Russia  with  the  basic  principles  of 
the  Bolshevik  party  as  its  democratic  centralism. 
He  emphasized  the  revolutionary  potential  of 
Russia  as  the  weakest  part  of  the  imperialist  world 
system.  After  the  victory  of  the  October  Revolution 
and  the  failure  of  the  “world  revolution,”  Stalin 
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introduced  the  principle  of  “socialism  in  one  coun¬ 
try.”  To  consolidate  the  state  socialist  system, 
Stalin  exercised  a  high  level  of  political  repression 
to  overcome  social  resistance  and  to  complete  the 
state-building  process  in  the  early  stage  of  industri¬ 
alization.  He  completed  the  building  of  the  repres¬ 
sive  one-party  state  and  a  command  economy  with 
the  personality  cult,  reducing  the  Russian  populace 
to  infantile  dependence  on  the  state.  The  vigorous 
development  of  industrial  society  led  to  the  growth 
of  Russia  as  a  great  power  after  the  victory  in 
World  War  II.  Mao  Zedong  and  later  Castro 
extended  the  state  socialist  ideology  more  and 
more  to  the  peasantry  as  the  main  revolutionary 
force.  These  ideological  forms  have  been  in  fact  a 
combination  of  the  class  struggle  theory  with  the 
rise  of  nationalism  in  the  (post)colonial  world, 
where  large  peasant  masses  were  dislocated. 

Behind  these  ideologies,  the  “existing  social¬ 
ism”  had  to  cope  with  real,  not  only  ideological, 
obstacles  and  controversies.  State  socialism 
emerged  after  World  War  I  as  an  alternative  to 
world  capitalism,  and  it  vanished  in  the  early 
1990s  as  a  meaningful  alternative.  It  came  to  an 
end  as  an  important  social  formation,  even  if  some 
of  its  hybrid  forms  have  survived  in  China  and 
Vietnam;  in  some  other  archaic  forms  with  tradi¬ 
tional  authoritarianism,  as  in  North  Korea;  and 
with  some  vague  postcolonial  variants  elsewhere. 
There  were  some  basic  internal  and  external  built- 
in  obstacles  that  were  responsible  for  the  vanishing 
of  state  socialism  or  the  “really  existing”  socialism 
as  a  major  type  of  development  by  the  end  of  the 
20th  century.  The  previously  noted  catching-up 
exercise  from  inside  as  well  as  underdevelopment 
and  external  dependence  were  the  domestic  and 
international  reasons  that  emerged,  but  a  combi¬ 
nation  of  internal  and  external  built-in  obstacles 
caused  its  abrupt  demise  at  the  end  of  the  20th 
century.  If,  and  when,  state  socialism  generated 
development,  it  also  produced  some  economic  fac¬ 
tors  and  sociopolitical  forces  that  turned  against  it. 
As  extensive  industrialization  came  to  an  end,  the 
problems  of  economic  management  were  aggra¬ 
vated,  and  the  command  economy  proved  to  be 
unable  to  achieve  economic  integration  and  steer 
the  structures  created  by  intensive  industrializa¬ 
tion.  Therefore,  the  economic  crisis  cycles  were 
integral  parts  of  this  system  of  recurring  imbal¬ 
ances.  The  historical  trajectory  presented  different 


models  from  war  communism  to  market  socialism, 
but  the  policy  of  perestroika  failed,  and  the  reforms 
instituted  by  Premier  Mikhail  Gorbachev  were  the 
last  attempt  at  a  revival.  At  the  same  time,  the 
external  obstacle  of  world  capitalism  proved  more 
and  more  that  the  economies  of  state  socialist 
countries  were  not  competitive  in  the  world  mar¬ 
ket.  They  were  suitable  only  for  the  first  and  most 
drastic  phase  of  industrialization,  followed  in  the 
1980s  by  economic  decline  in  the  core  state  social¬ 
ist  countries  during  their  second  phase  of  industri¬ 
alization,  although  at  the  same  time,  state  socialism 
had  spread  to  the  world’s  poorest  countries,  where 
early  industrialization  was  still  taking  place.  State 
socialism  lost  the  economic  competition  with  the 
West  in  the  1980s,  when  the  Soviet  Union  had 
become  the  world’s  largest  industrial  society  of  the 
late-1 9th -century  type,  with  an  obsolete  socioeco¬ 
nomic  structure  and  a  high  level  of  militarization 
and  also  with  an  economy  of  shortage  that  would 
lead  ultimately  to  mass  discontent  and  full  delegiti¬ 
mation  of  the  socioeconomic  system. 

Although  all  state  socialist  countries  had  a  leg¬ 
acy  of  authoritarianism  from  their  previous  his¬ 
tory,  the  new  type  of  repressive  state  soon  became 
itself  an  internal  obstacle  to  their  socioeconomic 
development.  Political  reforms  were  always  high 
on  the  agenda  because  it  was  clear  that  the  political 
system  could  not  work  properly,  since  it  continu¬ 
ally  created  new  obstacles  to  its  own  working — the 
new  industrial  and  educated  classes  that  had  arisen 
as  a  result  of  socioeconomic  development  could 
not  fit  into  the  political  system.  When  the  urban¬ 
ized  strata,  including  the  new  working  class,  began 
to  demand  more  human  rights  and  freedoms,  they 
were  confronted  with  the  implacable  tenets  of  state 
socialism  as  a  one-party  state.  Thus,  they  turned 
against  the  ideological  monopoly  of  the  party  and 
demanded  an  open  society. 

In  the  core  state  socialist  countries,  a  series  of 
grave  political  crises  in  1956  (Hungary),  1968 
(Czechoslovakia),  and  1988  (Poland)  finally  cul¬ 
minated  in  a  deep  crisis  of  political  legitimacy  at 
the  end  of  the  1980s.  Two  factors — economic  and 
political — produced  a  combined  internal  pressure 
at  the  end  of  the  1980s,  when  the  democratic  ero¬ 
sion  was  strengthened  by  the  external  pressure  of 
globalization.  As  global  telecommunication  pene¬ 
trated  to  these  countries,  the  advantages  of  the 
Western  economic  order  and  world  market  were 
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displayed  in  much  more  detail  than  before  and 
resulted  in  the  spread  of  democratic  ideas  through 
the  new  types  of  media.  All  in  all,  the  year  1975 
may  be  considered  as  a  turning  point  for  both 
internal  models  and  external  competition: 

1.  Until  the  mid-1970s,  the  core  state  socialist 
countries  developed  much  more  quickly  than  the 
Western  states;  thus,  the  future-oriented  legitimacy 
of  the  catching-up  model  seemed  to  work.  But  the 
global  crisis  in  the  1970s  caused  the  economies  of 
the  state  socialist  countries  to  slow  down,  and 
owing  to  this  deceleration  of  socioeconomic  devel¬ 
opment,  state  socialism  in  the  most  developed  core 
countries  lost  its  legitimacy  step  by  step.  The  devel¬ 
opment  model  of  state  socialist  countries  was 
shifted  to  the  poorest  countries,  so  as  a  result  of 
this  qualitative  failure,  state  socialism  was  signifi¬ 
cantly  weakened  in  the  core  countries,  but  para¬ 
doxically,  it  became  attractive  in  the  poorest 
countries,  and  it  began  its  largest  quantitative  ter¬ 
ritorial  extension  in  the  Third  World. 

2.  During  their  industrialization  process,  the 
state  socialist  countries  looked  externally  competi¬ 
tive  for  a  long  time;  therefore,  even  the  Western 
modernization  models  suggested  that  they  were 
successful  industrial  societies.  But  they  could  not 
adapt  to  the  structural  changes  in  the  world  econ¬ 
omy  during  the  1970s;  their  obsolete  structures 
remained  and,  increasingly,  were  lagging  behind  in 
the  global  competition.  At  the  peak  of  detente  in 
1975,  the  Conference  on  Security  and  Cooperation 
in  Europe  (CSCE,  later  the  Organization  of  Security 
and  Co-operation  in  Europe;  OSCE) — with  the 
participation  of  the  United  States  and  Canada — 
became  instrumental  in  the  erosion  of  state  social¬ 
ism  through  the  permanent  campaign  for  human 
rights  (Helsinki  Watchdog  groups  for  human 
rights  in  Central  and  Eastern  Europe). 

3.  There  was  also  an  imperial  overstretching  of 
Russia’s  resources,  both  externally  and  internally. 
To  sustain  such  an  empire  and  its  full  apparatus  of 
oppression  demanded  vast  energy  and  expendi¬ 
ture,  creating  an  internal  obstacle  for  Russian 
society  from  the  combined  effects  of  full  militariza¬ 
tion,  the  economic  burden  of  producing  military 
hardware,  and  the  social-ideological  burden  of 
suppressing  human  freedom.  In  the  end,  the  Soviet 
Union  was  too  weak  economically  to  maintain 


such  a  world  empire,  which  was  very  costly  and 
diverted  resources  from  Russia’s  own  socioeco¬ 
nomic  development  (around  one  third  of  its  gross 
domestic  product).  Consequently,  the  system’s 
eventual  collapse  was  not  only  a  political  failure 
but  also  an  economic  necessity. 
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Soft  Law 


Although  law  is  often  thought  of  as  binding  on 
those  to  whom  it  applies,  and  sanctions  are  associ¬ 
ated  with  breaking  the  law,  some  legal  rules  are 
voluntary  and  not  coercive.  Particularly  in  interna¬ 
tional  law,  legal  rules  are  not  legally  binding,  and 
legal  sanctions  cannot  be  used  in  cases  of  noncom¬ 
pliance.  Soft  law  refers  to  those  legal  rules  that  are 
not  legally  binding  and  for  which  legal  sanctions 
cannot  be  used  in  cases  of  noncompliance.  These 
rules  can  be  guidelines,  codes  of  conducts,  standards, 
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and  other  voluntary  rules.  In  contrast  to  hard  law, 
soft  law  can  be  decided  by  a  variety  of  actors,  pub¬ 
lic  and  private.  Making  hard  law  is  the  prerogative 
of  legislative  assemblies,  national  parliaments,  and 
the  European  parliament,  with  a  few  exceptions. 
Soft  law  can  be  created  by  governments,  interna¬ 
tional  organizations,  nongovernmental  organiza¬ 
tions,  and  the  business  community.  It  should  be 
noted  that  the  distinction  between  hard  and  soft 
law  is  hard  to  make  in  practice.  Legally  binding 
law  often  contains  references  to  standards  and 
other  non-legally  binding  rules,  and  soft-law  rules 
often  contain  references  to  legally  binding  legisla¬ 
tive  rules.  The  two  types  of  rules  are  therefore  ideal 
types  that  cannot  easily  be  found  in  their  pure  form 
empirically.  This  entry  defines  soft  law,  contrasts  it 
with  other  forms  of  public  sector  intervention,  and 
discusses  its  applications  in  the  European  Union 
(EU)  and  elsewhere. 

Definition 

Ulrika  Morth  defines  soft  law  as  legitimate  rules 
that  may  have  political  and  legal  effects,  although 
the  rules  are  not  legally  binding.  This  means  that 
not  every  voluntary  rule  can  be  regarded  as  soft 
law.  In  cases  of  noncompliance,  the  sanctions  are 
social  and  political  rather  than  legal.  A  good  exam¬ 
ple  of  how  these  social  sanctions  might  look  is  the 
peer  pressure  mechanism  in  the  international  body 
the  Organisation  for  Economic  Co-operation  and 
Development  (OECD).  OECD  lacks  competence  to 
issue  legally  binding  laws.  However,  it  is  often 
argued  that  the  organization  has  had  important 
influence  on  the  policies  of  member  states,  as  these 
concern,  for  instance,  reforms  in  the  public  sector, 
often  labeled  as  the  new  public  management  reform. 
Social  and  political  pressures  are  put  on  the  mem¬ 
ber  states  by  “naming  and  shaming”  countries  that 
do  not  follow  the  OECD  recommendations.  The 
soft  law-making  processes  in  the  OECD  are  also 
characterized  as  social  processes  in  which  there  is 
confidence  building,  exchanges  of  experiences, 
identity  building,  and  socialization  among  the 
national  officials;  this  can  explain  why  the  member 
states  implement  the  OECD  recommendations. 

A  classic  and  often  used  definition  of  soft  law  is 
that  by  Francis  Snyder,  in  which  he  states  that  they 
are  rules  that  have  no  binding  force  but  that  they 
may  in  any  case  “have  practical  effects”  (Snyder, 


1993,  p.  198).  A  more  thorough  definition  is  pre¬ 
sented  by  Karel  Wellens  and  Gustaaf  Borchardt 
(1989).  Soft  law  is 

the  rules  of  conduct  that  find  themselves  on  the 
legally  non-binding  level  (in  the  sense  of  enforce¬ 
able  and  sanctionable  through  international 
responsibility)  but  which  according  to  the  inten¬ 
tion  of  its  authors  indeed  do  possess  legal  scope, 
which  has  to  be  further  defined  in  each  case.  Such 
rules  do  not  have  in  common  a  uniform  standard 
of  intensity  as  far  as  their  legal  scope  is  con¬ 
cerned,  but  they  do  have  in  common  that  they  are 
directed  at  (intention  of  the  authors)  and  do  have 
as  effect  (through  international  law),  that  the 
conduct  of  States,  international  organizations 
and  individuals  is  influenced  by  these  rules,  how¬ 
ever  without  containing  international  legal  rights 
and  obligations,  (p.  274) 

The  essence  of  these  three  definitions  is  that  soft 
law  is  defined  in  a  procedural  way,  that  it  consists 
of  non-legally  binding  rules,  and  that  it  comes  in 
many  varieties.  What  Morth’s  definition  adds  to 
the  traditional  ones  is  thus  the  aspect  of  legitimacy. 

Forms  of  Public  Sector  Intervention 

The  phenomenon  of  soft  law  has  for  a  long  time 
been  used  in  global  organizations,  in  the  EU,  and 
in  nation-states,  but  it  was  not  until  the  1970s  that 
the  very  concept  of  soft  law  was  established.  One 
explanation  for  why  the  concept  started  to  be  used 
by  scholars  and  others  was  the  fundamental 
changes  in  international  politics.  The  oil  crises  in 
the  1970s  changed  how  students  of  international 
politics  defined  international  power.  They  changed 
their  power  perspective  from  a  state-centric  one  to 
a  more  pluralist  view  on  power  in  a  globalized 
world.  The  increased  awareness  in  the  1970s  of 
globalization  has  since  given  rise  to  a  vivid  litera¬ 
ture  on  the  need  for  global  rules  in  a  globalized 
world.  Studies  of  international  regimes,  as  a  way 
of  analyzing  the  new  international  landscape,  have 
argued  that  international  cooperation  consists  of 
private  and  public  actors  and  that  non-legally 
binding  rules  are  sometimes  the  only  ones  possible. 
Indeed,  many  international  organizations  use  soft 
law  because  they  have  no  other  alternative.  The 
absence  of  a  world  government  that  can  decide  on 
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supranational  and  legally  binding  rules  makes  vol¬ 
untary  rules  the  only  viable  alternative.  Even  coer¬ 
cive  rules,  which  can  be  decided  by  the  United 
Nations  Security  Council,  are  sometimes  viewed  as 
soft  because  of  the  lack  of  credible  legal  sanctions 
in  cases  of  noncompliance.  There  is  an  illusory 
expectation  that  legally  binding  rules  are  more 
efficient  when  it  comes  to  compliance  in  a  global¬ 
ized  world,  which  lacks  the  authority  of  and  demo¬ 
cratic  system  found  within  nation-states. 

Applications  in  the  European  Union 

The  use  of  soft  law  has  increased  in  the  EU,  which 
seems  to  contradict  the  explanation  mentioned 
above  that  soft  law  is  often  the  only  alternative  for 
international  organizations.  Indeed,  the  EU  has  the 
legal  competence  to  decide  on  hard  law.  One  expla¬ 
nation  of  why  soft  law  is  used  more  is  the  increased 
need  for  flexibility  when  there  is  a  concern  about 
the  possibility  of  noncompliance.  Soft  law  is  prefer¬ 
able  when  you  are  dealing  with  problems  that  are 
complex  and  diverse  and  characterized  by  uncer¬ 
tainty.  Thus,  rule  making  concerning  highly  contro¬ 
versial  issues  is  politically  the  only  way  of  moving 
forward  in  the  European  integration  process  because 
the  member  states  want  to  retain  national  sover¬ 
eignty  in  these  areas.  These  controversial  questions 
often  go  beyond  the  issues  on  the  domestic  market 
and  other  issues,  such  as  economic  policy,  taxes, 
pensions,  and  other  welfare  topics,  that  are  decided 
within  the  EU  member  states.  The  alternative  to 
soft  law  is  in  practice  never  hard  law.  This  is  so 
because  of  the  unwillingness  of  EU  member  states 
to  delegate  supranational  decision-making  powers 
to  the  EU  with  respect  to  politically  controversial 
welfare  state  issues.  By  deciding  on  soft  law,  the 
national  governments  can  argue  to  their  EU-skeptical 
citizens  that  as  these  rules  are  voluntary,  they  have 
not  delegated  national  sovereignty  to  the  EU. 

Nonstate  Actors 

So  far,  we  have  only  discussed  the  use  of  soft  law 
by  international  organizations.  Legitimate  rule 
making  is  not  only  a  prerogative  of  intergovern¬ 
mental  organizations  or  of  states.  Private  actors, 
both  for-profit  and  nonprofit  actors  (nongovern¬ 
mental  organizations),  are  sometimes  also  seen  as 
legitimate  rule  makers.  Soft  law  is  often  linked  to 


nonstate  actors  and  to  their  participation  in 
national,  European,  and  global  rule  making.  The 
private  actors  are  not  formally  part  of  the  hard- 
law  legislative  process  that  one  finds  in  nation¬ 
states  or  in  the  EU.  If  they  want  to  set  up  rules  that 
regulate  their  activities,  they  have  to  use  soft  law. 
Voluntary  rules  are  often  the  preferred  alternative 
for  the  business  community,  which  wants  to  pre¬ 
empt  any  state  interference.  One  telling  example  is 
the  emergence  of  corporate  social  responsibility, 
which  consists  of  various  codes  of  conduct  that 
regulate  how  a  socially  responsible  firm  should 
behave.  Whether  these  soft  rules  are  complied  with 
or  not  is  often  monitored  by  various  nongovern¬ 
mental  organizations  and  not  by  states  or  other 
public  actors.  What  is  interesting  with  the  emer¬ 
gence  of  corporate  social  responsibility  is  that  it 
seems  to  support  the  idea  that  we  are  living  in  a 
time  when  soft  law  is  the  most  readily  available 
regulatory  mode  not  only  for  private  actors  but 
also  for  states.  States  are  less  prone  to  use  their 
hierarchical  authority  and  so  become  advisory 
rather  than  directing.  This  change  from  hierarchi¬ 
cal  authority  to  a  more  horizontal  steering  is  some¬ 
times  characterized  as  a  move  from  government  to 
governance.  The  steering  mode  of  governance  has 
empowered  private  actors  and  given  them  an 
authority  that  traditionally  belonged  to  the  state 
and  the  public  sector.  The  spread  of  codes  of  con¬ 
duct,  standardization,  and  other  voluntary  rules 
indicates  that  a  range  of  actors  other  than  public 
ones  strive  to  establish  regulation  through  innova¬ 
tive  mechanisms.  The  recent  trends  in  making  pri¬ 
vate  firms  more  socially  responsible  suggest  that 
we  are  dealing  with  a  cross-sectional  phenomenon. 
It  is  a  reflection  of  transformed  boundaries  and 
relations  across  societal  sectors — business,  state, 
and  civil  society.  These  transformed  boundaries 
can  partly  be  explained  by  new  demands  on  the 
three  sectors  and  spheres.  Modern  society  places 
many  demands  on  private  companies  and  on  pub¬ 
lic  actors.  States  are  placing  more  emphasis  on 
regulation  instead  of  on  public  ownership  and 
directly  provided  services.  The  state  has  a  regula¬ 
tory  role  and  a  contracting  role  in  agreeing  on 
rules  between  the  public  and  the  private  spheres. 
These  rules  are  increasingly  soft  or  a  complex  com¬ 
bination  of  hard  and  soft  law  in  which  one  may 
also  find  legally  binding  rules  that  are  decided  not 
by  legislators  but  by  the  contracting  partners.  This 
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mixture  of  actors  and  rules  seems  to  blur  the  bor¬ 
der  between  private  and  public  actors.  A  new  pub¬ 
lic  domain  emerges  in  which  there  are  actors  and 
rules  from  the  state,  from  business,  and  from  civil 
society.  This  public  domain  is  transnational  rather 
than  national  or  international.  Soft  law  makes 
boundaries  less  relevant  and  important.  From  a 
democratic  point  of  view,  this  state  of  no  borders 
between  the  public/private  and  the  national/inter¬ 
national  could  be  a  problem.  Democracy  is  often 
practiced  in  terms  of  representative  democracy, 
which  means  that  elected  politicians  are  the  hard 
law  legislators.  They  are  accountable  for  the  laws 
and  for  the  decisions  that  they  make.  The  use  of 
soft  law  broadens  the  possibilities  for  who  the 
legitimate  rule  makers  may  be  and  extends  rule 
making  to  actors  other  than  elected  politicians. 
This  means  that  the  realm  of  the  rule-making 
sphere  expands  but  the  democratic  realm  only  con¬ 
cerns  some  of  the  actors.  This,  in  turn,  could  lead 
to  a  situation  in  which  those  who  have  power  are 
not  democratically  accountable  for  the  decisions 
they  make  and  that  those  who  can  be  democrati¬ 
cally  accountable  have  no  power. 

Ulrika  Mortb 
Stockholm  University 
Stockholm,  Sweden 
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Solidarity 


Solidarity  is  a  feeling  of  being  connected  to  and 
responsible  for  others.  This  experience  of  connec¬ 
tion  and  obligation  is  a  key  aspect  of  the  life  of  any 
collectivity  and  its  members.  Solidarity  has  there¬ 
fore  received  much  attention  in  political  thought 
concerning  both  premodern  and  modern  politics. 
In  the  context  of  democratic  culture,  it  has  been 
one  of  the  celebrated  ideas  and  ideals,  although 
perhaps  not  as  much  as  liberty  and  equality, 
among  others.  This  has  to  do  with  the  central,  yet 
difficult,  nature  and  role  of  solidarity.  Hence,  the 
two  questions  that  this  entry  strives  to  answer  are 
“What  are  the  general  characteristics  of  solidar¬ 
ity?”  and  “What  are  the  challenges  it  entails?” 

Characteristics  of  Solidarity 

Solidarity  refers  to  the  type  of  relation  in  which 
people  feel  that  they  have  an  obligation  to  help 
others,  for  instance,  because  they  experience  a  con¬ 
nection  and  “we-ness”  with  them  based  on  a  com¬ 
munity  of  interest  and  values — to  help  others  so 
that  they  are  empowered  and  can  stand  on  their 
own  feet.  Obviously,  the  modalities  of  this  feeling 
of  solidarity  and  of  obligation,  including  its  level 
of  intensity,  vary  with  the  configuration  of  the 
environment  in  which  it  takes  place.  In  this  per¬ 
spective,  solidarity  can  be  organized  and  analyzed 
according  to  three  categories.  Although  these  cat¬ 
egories  can  overlap  in  reality,  we  present  them 
separately  for  the  sake  of  clarity.  They  concern  the 
structural  character  of  solidarity,  its  scope,  and  its 
subjective  dimension.  From  this,  we  can  infer  that 
solidarity  has  a  strong  connection  with  responsi¬ 
bility  and  justice. 

Structural  Solidarity 

Solidarity,  understood  and  defined  as  struc¬ 
tural,  alludes  to  the  social  role  that  it  plays  in  a 
collective  entity,  whatever  this  collectivity  is 
(group,  community  or  society,  public  or  private, 
etc.).  This  role  suggests  that  a  collectivity  cannot 
progress  and  develop  if  its  members  are  not  work¬ 
ing  together  to  achieve  the  common  goal(s)  with 
which  it  identifies  or  if  they  do  not  feel  and  act  on 
a  sense  of  solidarity  toward  one  another  and  the 
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collectivity.  This  structural  aspect  of  solidarity  is 
why  solidarity  tends  to  be  seen  as  the  expression, 
instrument,  and  result  of  a  commitment  to  the 
group,  its  members,  and  their  well-being.  Even  the 
emergence  and  sustainability  of  cooperation,  so 
essential  socially  because  it  epitomizes  mutually 
beneficial  interactions  and  the  collective  good, 
would  be  quite  impossible  if  people  did  not  think 
that  they  can  count  on  the  help  of  others — on  their 
sense  of  solidarity — when  needed. 

Because  solidarity  as  a  structural  phenomenon  is 
about  the  possibility  of  collective  life,  it  implies  that 
each  individual  existence  is  social;  that  is,  it  relies 
in  crucial  ways  on  the  sense  of  solidarity  with  oth¬ 
ers.  This  comes  down  to  the  fact  that  people  cannot 
be  and  do  much  on  their  own.  To  a  large  extent,  all 
people — no  matter  how  capable — depend  on  the 
care  of  others  to  grow  and  flourish.  Hence,  at  most 
junctures  of  their  life,  mundane  or  fundamental, 
individuals  are  the  recipients  of  multiple  actions  of 
solidarity.  The  final  stage,  when  the  end  has  come, 
is  no  exception.  It  is  a  deep  sense  of  human  and 
spiritual  solidarity  that  allows  us  to  be  buried  with 
dignity  and  remembered  after  we  are  gone. 

The  structural  character  of  solidarity  and  the 
modalities  of  the  social  dimension  of  each  indi¬ 
vidual  existence  are  not  fixed  forever.  They  differ 
according  to  the  type  of  social  group  in  which  they 
unfold.  Regardless  of  whether  his  analysis  and 
classification  of  solidarity  is  correct,  Emile 
Durkheim  had  this  in  mind  when  he  distinguished 
mechanical  solidarity  from  organic  solidarity.  For 
him,  mechanical  solidarity  was  characteristic  of 
the  relatively  undifferentiated  social  structures  of 
premodern  societies,  where,  in  his  view,  members 
are  bound  together  by  shared  values,  beliefs,  and 
customs  and  are  generally  engaged  in  the  same 
activities.  In  contrast,  he  saw  organic  solidarity  as 
more  typical  of  modern,  industrial  societies,  where 
people  are  bound  together  by  a  high  degree  of 
functional  interdependence  based  on  a  complex 
division  of  labor. 

The  Scope  of  Solidarity 

A  second  way  of  analyzing  solidarity  is  related 
to  its  scope:  In  other  words,  to  whom  does  it 
apply,  and  how  deeply  does  it  apply?  Or  to  what 
extent  is  the  realm  of  solidarity  and  of  those  who 
are  in  it  wide  or  narrow,  closed  or  open,  and  to 


what  extent  are  the  benefits  it  brings  about  thick 
or  thin?  To  illustrate  this  point,  it  is  helpful  to 
recall  the  distinctions  that  have  often  been  drawn 
between  traditional  and  modern  solidarity. 

In  traditional  forms  of  social  organizations, 
solidarity  connotes  a  tight  bond  among  people, 
frequently  based  on  kinship.  The  sense  of  solidar¬ 
ity  runs  deep  and  permeates  the  collectivity’s  inter¬ 
nal  relations.  The  flipside  of  this  “thick”  inward 
solidarity  tends  to  be  its  exclusive  character.  The 
translation  of  the  “us”  versus  “them”  divide  into 
the  “in”  versus  “out”  difference  has  a  heavy  bear¬ 
ing  on  who  is  a  recipient  of  solidarity  and  who  is 
not.  In  comparison,  modern  solidarity  in  its  vari¬ 
ous  forms,  springing  from  democratic  values, 
tends  to  be  wider  than  the  ties  of  family.  The 
norms  of  universality  and  equality  at  their  core 
introduce  and  call  for  a  connectedness  among 
people.  This  connectedness  goes  much  beyond  the 
boundaries  of  the  immediate  communities  in 
which  individuals  live.  As  a  result,  these  norms 
have  historically  worked  in  favor  of  a  widening 
and  deepening  of  solidarity  at  the  national  and, 
subsequently,  international  levels. 

This  is  not  to  say  that  the  disparities  between 
traditional  and  modern  solidarity  necessarily 
amount  to  absolute  differences  of  scope.  For 
example,  although  premodern  solidarity  has  the 
tendency  to  be  exclusive,  it  contains  significant 
cosmopolitan  orientations  fueled  by  philosophical 
(e.g.,  the  Stoics)  and  religious  (as  represented  by 
Saint  Paul  and  Christian  universalism)  beliefs.  Not 
surprisingly,  modern  humanism  and  democratic 
culture  built  on  these  to  develop  their  own  brand(s) 
of  solidarity.  As  for  modern  solidarity,  although  it 
was  meant  to  be  wide,  it  is  also  inclined  to  be  thin. 
As  universality  and  equality  expand  the  circle  of 
human  solidarity,  relating  to  people  and  offering 
real  solidarity  becomes  abstract,  fragile,  and  atten¬ 
uated.  Moreover,  democratic  considerations  do 
not  eliminate  the  existence  of  priority  and  hierar¬ 
chy  in  any  social  setting  and  its  selective  and  limit¬ 
ing  impact  on  the  projection  of  solidarity.  How 
could  it  be  otherwise  considering  that  priorities 
and  hierarchies  are  a  part  and  parcel  of  human  life, 
because  without  them,  there  are  no  particular 
guidelines  to  allocate  the  always  limited  resources 
at  hand?  Furthermore,  when  solidarity  takes  place 
within  the  framework  of  nationalism,  a  product 
and  expression  of  modern  democratic  culture,  it 
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can  conceive  of  and  implement  inclusiveness  in  an 
exclusive  manner.  This  may  happen  both  among 
countries  where  nationalist  solidarity  may  be  pur¬ 
sued  without  regard  for  the  welfare  of  other  states 
and  within  countries  where  dominant  groups  may 
deny  the  legitimacy  of  minorities  and  their  right  to 
observe  their  own  traditions. 

Subjective  Solidarity 

The  third  characteristic  of  solidarity,  subjective 
solidarity,  refers  to  the  requirement  that  individu¬ 
als  carry  out  actions  that  create  the  structure 
needed  for  solidarity.  They  depend  on  the  collabo¬ 
ration  of  people  to  function  and,  therefore,  need  to 
be  activated  by  people.  The  negative  effects  accom¬ 
panying  actors’  disengagement  best  illustrate  this. 
Once  people  feel  that  there  is  no  point  in  uphold¬ 
ing  a  sense  of  solidarity,  particularly  due  to  a  lack 
of  expected  reciprocation,  structural  solidarity  is 
in  danger  of  disappearing.  Exit  strategies  develop, 
each  of  them  amounting  to  a  “desolidarization” 
from  others  and  from  the  collectivity  with  which 
the  individual  had  previously  identified. 

But  subjective  solidarity  is  not  limited  to  its  con¬ 
tribution  to  structural  solidarity.  It  can  be  at  work 
in  the  absence  of  structural  solidarity  or  against  it. 
For  instance,  cosmopolitanism  and  the  sense  of 
solidarity  it  extends  to  all  individuals  certainly 
exceed  the  realms  that  solidarity  has  socially 
shaped,  be  it  within  the  primary  communities  (e.g., 
the  family)  or  at  the  national  level.  This  is  where 
solidarity  is  more — or  less,  depending  on  how  we 
see  it — than  organized  social  reality  and  can 
become  a  product  of  individual  initiative  and,  as 
such,  dependent  on  subjective  morality  and  solidar¬ 
ity.  The  individual  decides  that  solidarity  is  owed 
to  someone  regardless  of  the  existing  collectivities 
of  allegiance  and  the  discriminations  of  solidarity 
they  entail.  In  his  own  way,  Emmanuel  Levinas 
defends  a  radical  version  of  this  form  of  solidarity 
when  he  argues  that  our  fundamental  moral  situa¬ 
tion  as  people  is  not  one  of  autonomy  but  of  heter- 
onomy,  or  obedience  to  the  voice  of  the  other. 

Solidarity,  Responsibility,  and  Justice 

From  these  elements  of  solidarity,  we  can  infer 
that  it  is  closely  connected  to  the  notions  of 
responsibility  and  justice.  Concerning  solidarity 


and  responsibility,  one  of  the  highlights  is  that  to 
experience  solidarity  is  also  to  experience  respon¬ 
sibility.  In  addition,  since  responsibility  calls  for 
action,  if  not  leadership,  solidarity  as  responsibil¬ 
ity  is  neither  simply  theoretical  nor  an  end  in  itself. 
It  is  geared  toward  action,  toward  helping  others 
in  a  concrete  manner  and,  if  the  gravity  of  the  situ¬ 
ation  demands,  with  the  commitment  of  an  activ¬ 
ist.  Short  of  this,  professing  solidarity  and  doing 
next  to  nothing  is  destined  to  be  no  more  than 
hypocrisy. 

Such  a  letdown  is  all  the  more  damaging  since, 
ultimately,  solidarity  is  a  requisite  of  justice,  in 
four  principal  ways.  First,  solidarity  serves  as  a 
trigger  mechanism  vis-a-vis  the  concerns  of  justice. 
If  there  is  no  identification  and  connection  and, 
consequently,  no  solidarity  with  others,  experienc¬ 
ing  and  calling  for  justice  is  not  likely  to  happen. 
Second,  the  fact  that  solidarity  toward  others  pre¬ 
supposes  that  they  have  rights  and  that  these  have 
to  be  fulfilled  is  another  illustration  of  the  strong 
link  between  justice  and  solidarity.  Third,  as  soli¬ 
darity  stresses  the  social  conditions  of  possibility 
of  a  collectivity  and  the  social  nature  of  each  indi¬ 
vidual’s  existence  within  it,  it  necessarily  asks  for 
the  interests  of  each  to  be  taken  into  account  and 
looked  after.  Hence,  the  distributive  characteristic 
at  the  core  of  justice  is  at  the  center  of  solidarity  as 
well.  Fourth,  while  solidarity  underlines  what  is 
owed  to  others  in  general,  similar  to  justice,  it  pays 
special  attention  to  the  less  fortunate  and  to  the 
one  who  has  been  victimized. 

Challenges  of  Solidarity 

The  challenges  that  solidarity  has  to  confront  are 
not  to  be  ignored.  Its  importance  makes  them  all 
the  more  daunting.  Among  them  is  the  pressure 
that  solidarity  can  put  on  human  agency.  There  is 
also  the  difficulty  of  handling  the  conflicts  among 
solidarity  traditions.  Finally,  there  is  the  problem¬ 
atic  future  of  solidarity. 

Solidarity  and  the  Question  of  Agency 

Although  solidarity  is  essentially  viewed  as  a 
positive  phenomenon,  what  makes  it  an  asset  for 
the  collectivity  and  its  members  can  make  it  a  lia¬ 
bility.  This  Janus-faced  nature  applies  to  premod¬ 
ern  and  modern  solidarity. 
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As  noted  earlier,  solidarity  tends  to  be  about 
both  inclusion  and  exclusion.  Another  problem  is 
related  to  this  perhaps  unavoidable  ambiguity  and 
limitation  of  solidarity — the  fact  that  solidarity  has 
had  difficulty  finding  a  balance  between  too  much 
and  not  enough  and,  in  the  process,  endangering 
the  sense  of  human  agency — a  somewhat  para¬ 
doxical  turn  of  events,  since  solidarity’s  goal  is 
empowerment. 

Regarding  the  existence  of  too  much  solidarity 
and  not  enough  agency,  the  constraining  aspects  of 
primitive  forms  of  solidarity  have  at  times  led 
anthropology,  sociology,  and  political  science,  for 
example,  to  stress  that  in  these  primitive  forms, 
people  have  no  room  for  a  life  of  their  own.  To  a 
lesser  degree,  a  similar  assessment  has  been  put 
forward  concerning  how  long-established  tradi¬ 
tions  fix  collective  and  individual  existences  in 
more  or  less  unquestioned  and  difficult-to-change 
patterns.  From  the  18th  century  onward,  criticisms 
of  the  social  role  of  religion  have  frequently  taken 
place  in  this  context. 

A  classic  charge  against  modernity  has  been 
that  it  leads  to  a  very  thin  and  perhaps  inadequate 
solidarity,  with  debilitating  effects  on  human 
agency.  Two  kinds  of  shortcomings  have  been 
underlined  in  particular.  Connected  and  standing 
at  the  intersection  of  democracy  and  capitalism, 
they  point  to  the  fact  that  modern  culture  strength¬ 
ens  individual  and  collective  agency  in  some  ways 
but  weakens  it  as  well. 

The  first  alleged  shortcoming  of  modernity 
arises  from  the  individualism  of  modern  demo¬ 
cratic  solidarity.  Although  its  values  originate 
from  a  commitment  to  universality  and  equality, 
which  engineers  empowerment,  at  the  same  time, 
it  encourages  individual  and  anomic  tendencies. 
From  this  perspective,  many  have  argued  that  the 
liberal  branch  of  democratic  culture,  especially  in 
its  celebration  of  individual  liberty,  is  more  prone 
to  this  than  the  republican  one,  which  stresses  the 
dynamics  of  rights  and  duties  among  individuals. 
In  any  case,  in  the  worst  circumstances,  individuals 
can  end  up  being  loyal  to  and  in  solidarity  with  no 
one.  As  such,  they  can  be  deprived  of  the  social 
anchoring  with,  and  belonging  to,  the  world  and 
others  that  is  so  important  to  feeling  and  being  at 
home  in  one’s  life. 

A  second  shortcoming  arises  when  those  in 
charge  politically  and  economically  are  eager  to 


minimize  the  sense  of  obligation  associated  with 
democratic  values.  What  is  interesting  is  that  the 
principle  of  responsibility  and  its  individual  and 
social  virtues  are  often  put  forward  both  as  a  jus¬ 
tification  for  a  decreased  sense  of  responsibility 
and  as  an  alternative  to  redress  the  ills  that  this 
may  produce.  An  example  of  this  shortcoming  of 
modernity  can  be  seen  at  the  national  level  in  the 
United  States.  There,  a  cultural  tendency  toward 
self-reliance  is  at  times  instrumentalized  by  self- 
serving  economic  forces  to  discredit  the  idea  that 
institutions  have  a  duty  to  make  possible  and 
underwrite  social  solidarity.  The  debates  on  health 
care,  in  2009  and  later,  provide  a  case  in  point.  But 
the  reduction  of  solidarity  to  a  bare  minimum  is 
taking  place  internationally  as  well,  as  the  state  of 
international  human  rights  indicates.  To  be  sure, 
conventions  calling  for  their  defense  in  their  vari¬ 
ous  dimensions  create  a  framework  of  interna¬ 
tional  solidarity.  However,  this  sense  of  solidarity 
is  not  imperative.  Despite  the  costs  incurred  by 
people  beyond  their  borders,  countries  never  fail  to 
give  preference  to  their  national  interests  and  those 
of  their  citizens.  Their  reluctance  to  act  decisively 
to  address  the  humanitarian  crisis  of  the  1990s  and 
the  insufficient  policy  efforts  at  the  global  level  to 
overcome  widespread  poverty,  to  say  the  least,  are 
part  of  this  story.  The  absence  of  international 
mechanisms  of  accountability  to  tackle  these  limi¬ 
tations  only  expresses  and  reinforces  the  lack  of 
commitment. 

Conflicts  Among  Traditions  of  Solidarity 

The  difficulty  of  accommodating  and  reconcil¬ 
ing  the  variety  of  solidarity  traditions  is  another 
major  challenge.  Conflicts  between  these  compet¬ 
ing  traditions  concern  access  to  and  distribution  of 
resources,  on  the  one  hand,  and  redefinition  of 
identities,  collective  and  individual,  on  the  other. 

The  different  forms  of  solidarity  of  the  past  and 
the  present  and  those  constituting  the  diverse  lay¬ 
ers  of  social  life  show  that  solidarity  comes  in 
many  shapes  and  sizes.  In  addition,  as  each  of 
them  orients  people  and  collectivities  in  set  ways, 
it  is  not  easy  to  find  a  constructive  middle  ground 
between  status  quo  and  change. 

Already  in  the  West,  it  has  been  difficult.  Most 
of  the  economic,  political,  social,  and  cultural  bat¬ 
tles  related  to  the  evolution  of  Western  countries 
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have  taken  place  in  connection  with  tensions 
among  visions  of  solidarity.  The  conflicts  sur¬ 
rounding  the  various  welfare  state  models  of  the 
European  continent  and  even  more  so  between 
them  and  the  American  model  are  the  latest  illus¬ 
tration  of  this  situation.  That  being  said,  no  mat¬ 
ter  how  formidable  these  struggles  and  the  pain  of 
accommodation  have  been,  they  appear  relatively 
benign  compared  with  the  ones  generated  by  the 
clash  of  Western  and  non-Western  traditions  of 
solidarity  since  the  West  started  its  worldwide 
expansion  at  the  end  of  the  15th  century.  Over 
time,  to  formulate  and  advocate  their  claims  of 
individual  and  collective  rights  in  support  of  their 
emancipation,  non-Western  countries  have  bor¬ 
rowed  extensively  from  the  values  and  structures 
of  modern  democratic  solidarity.  Therefore,  not 
all  is  bad.  Still,  the  non-West  has  paid  a  heavy 
price  for  the  hegemonic  domination  of  the  West. 

National  alliances  with  the  West  have  often  led 
to  the  elimination  of  local  forms  of  solidarity,  with 
no  replacement  by  other  forms.  Furthermore, 
although  countries  may  have  gained  nominal  inde¬ 
pendence,  in  fact,  the  majority  of  citizens  have 
often  been  relegated  to  second-class  status.  In  the 
worst  case  scenario,  with  the  local  elite  playing  the 
double  game  of  being  Westernized  and  yet  fixated 
on  preserving  their  ancestral  privileges  and  regular 
people  left  with  no  place  to  go,  the  types  of  solidar¬ 
ity  that  had  glued  people  together  can  disintegrate. 
In  the  process,  it  can  open  the  gates  of  radicaliza- 
tion  and  violence  within  and  across  national 
boundaries.  While  gradually,  non-Western  com¬ 
munities  have  had,  nolens  volens,  to  forge  hybrid 
and  workable  new  identities,  the  trauma  and 
resentment  generated  by  cultural  dislocation  and 
alienation  have  been  deep,  with  subsequent  success 
in  development,  when  it  took  place,  easing  but  not 
erasing  these. 

The  West  has  been  equally  effective  in  impos¬ 
ing  its  vision  of  solidarity  internationally.  At  this 
level,  too,  non-Western  countries  have  made  the 
best  of  the  Western  culture  of  solidarity  expressed 
in  the  norms  of  human  rights,  self-determination, 
and  sovereign  equality  of  states,  in  particular  to 
liberate  themselves  from  the  shackles  of  colonial¬ 
ism.  Nevertheless,  beyond  this,  it  has  been  an 
uphill  battle.  Although  Western  powers  have  been 
to  some  extent  willing  to  bring  on  board  non- 
Western  traditions  and  concerns  of  solidarity  in 


the  negotiation,  conceptualization,  and  implemen¬ 
tation  of  international  norms  of  regulation,  their 
worldview  and  interests  have  prevailed  and  con¬ 
tinue  to  prevail.  This  dominance  could  be  seen  in 
the  drafting  of  the  Universal  Declaration  of 
Human  Rights  when  the  United  Nations  was  cre¬ 
ated.  Today,  the  battles  concerning  development 
and  climate  change  indicate  how  developed  coun¬ 
tries,  essentially  the  West,  continue  to  concede 
very  little  to  the  developing  countries — by  and 
large,  the  non-West. 

The  consequence  of  this  somewhat  one-sided 
story  of  solidarity  is  not  simply  that  it  undermines 
or  even  expels  non-Western  traditions  of  solidarity 
but  that  the  message  of  solidarity,  and  of  the 
democratic  solidarity  projected  by  the  West,  also 
becomes  a  contradictory  one.  It  is  made  of  and 
oscillates  between  predatory  and  humanistic  orien¬ 
tations.  This  is  not  new,  since  this  has  been  the 
trademark  of  Western  relations  with  the  rest  of  the 
world  during  more  or  less  the  past  5  centuries. 
Critics  of  Western  solidarity  call  attention  to  sev¬ 
eral  issues.  First,  they  ask,  what  kind  of  solidarity, 
especially  one  that  claims  to  be  democratic,  offers 
the  spectacle  of  people  being  consistently  inclined 
to  overlook  the  interests  and  rights  of  others? 
Second,  if  the  solidarity  is  geared  toward  monop¬ 
oly  and  exclusiveness,  the  risk  is  high  that  this  will 
result  in  a  world  without  options  and  reflexivity.  If 
only  for  this,  it  is  at  odds  with  the  commitment  to 
pluralism  at  the  core  of  modern  democratic  soli¬ 
darity.  Finally,  these  critics  argue  that  at  a  time 
when  societies  are  increasingly  not  self-sustainable 
because  they  are  increasingly  not  self-contained, 
these  are  counterproductive  ways  to  conceive  and 
implement  solidarity.  It  engenders  more  disintegra¬ 
tion  than  integration. 

The  Future  of  Solidarity 

Part  of  the  solution  for  this  problem,  and  the 
future  of  solidarity,  lies  in  establishing  relations 
and  systems  of  dependencies  among  individuals 
and  collective  actors  that  are  more  positive  and 
inclusive  than  negative  and  exclusive,  as  is  pres¬ 
ently  the  case.  This  will  not  eliminate  all  the  dilem¬ 
mas  of  solidarity,  particularly  when  these  have  to 
be  addressed  in  the  context  of  the  democratic 
demands  of  pluralism,  universality,  equality,  and 
liberty  now  serving  as  benchmarks  of  legitimacy, 
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nationally  and  internationally.  But  at  least  it 
shows  a  path  for  winning  not  at  the  expense  of 
others  but  with  and  through  taking  account  of 
others.  From  this  perspective,  inequalities  should 
never  be  so  profound  and  entrenched  that  they 
deny  countries  and  individuals  who  suffer  from 
them  the  possibility  of  overturning  them  and 
achieving  at  some  point  a  threshold  of  decent 
empowerment. 

Solidarity,  one  of  the  oldest  notions  of  social  life 
and  political  thought,  far  from  being  a  thing  of  the 
past,  is  at  the  center  of  the  future  of  individuals 
and  collectivities.  Its  relevance  endures. 

Jean-Marc  Coicaud 
United  Nations  University 
New  York  City,  New  York,  United  States 
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Sovereignty 


In  practice,  sovereignty  is  one  of  the  most  con¬ 
tested  concepts  in  political  science,  although  it  is 
conventionally  used  to  connote  supreme  authority 
within  a  given  polity.  The  meanings  of  sovereignty 
vary  widely  depending  on  the  context,  as  well  as 
on  its  inferential  connections  to  other  concepts. 
Although  long  crucial  to  modern  political  science 
and  international  relations,  the  concept  of  sover¬ 
eignty  is  notoriously  ambiguous  and  thus  hard  to 
define  with  any  precision.  During  the  past  decades, 
it  has  been  argued  that  the  concept  of  sovereignty 
has  lost  some  of  its  analytical  purchase  due  to  the 
intensified  exchange  and  increased  interdepen¬ 
dence  between  political  communities.  This  entry 
examines  issues  related  to  the  definition  of  sover¬ 
eignty,  traces  its  historical  evolution  since  the 
Middle  Ages,  and  considers  its  relevance  to  politi¬ 
cal  science  today. 

Issues  of  Definition 

The  proper  source,  locus,  and  scope  of  political 
authority  have  been  subjected  to  constant  debate 
in  the  history  of  political  thought.  From  where 
does  such  authority  derive  its  legitimacy?  With 
whom  should  it  reside?  What  are  the  proper  limits 
of  its  exercise?  While  these  are  some  of  the  ques¬ 
tions  that  any  account  of  political  authority  must 
confront,  most  modern  definitions  of  sovereignty 
imply  that  such  authority — quite  irrespective  of 
the  sources  of  its  legitimacy  and  its  exact  locus — 
must  be  both  indivisible  and  absolute  to  qualify  as 
properly  sovereign  and  nothing  else.  While  being 
indivisible  implies  that  it  must — at  least  in  the¬ 
ory — reside  with  a  single  agent  in  a  given  polity, 
being  absolute  entails  that  this  agent  either  is  sov¬ 
ereign  or  not  but  nothing  in  between.  These  formal 
characteristics  distinguish  sovereignty  from  related 
concepts  such  as  autonomy  and  independence, 
both  of  which  are  thought  to  be  matters  of  degree. 
Insofar  as  sovereignty  is  defined  in  terms  of 
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authority,  it  is  also  thereby  distinguished  from 
mere  power,  since  the  latter  concept  does  not  nor¬ 
mally  carry  the  same  connotations  of  acceptance, 
obligation,  or  obedience  as  does  authority.  Taken 
together,  these  defining  characteristics  have  raised 
the  question  whether  sovereignty  is  best  under¬ 
stood  as  a  rule  or  as  an  empirical  fact.  In  analogy 
with  grammatical  rules,  we  could  say  that  the  con¬ 
cept  of  sovereignty  contains  both  descriptive  and 
prescriptive  elements.  Sovereignty  thus  not  only 
refers  to  a  state  of  affairs  but  also  implies  that  this 
state  of  affairs  constitutes  a  norm  to  which  politi¬ 
cal  life  ought  to  conform  in  order  to  be  intelligible. 
Finally,  there  is  a  performative  dimension  to  the 
concept  of  sovereignty,  insofar  as  it  is  frequently 
used  by  various  agents  to  claim  supreme  authority 
in  different  contexts. 

Modern  theories  of  sovereignty  frequently  dis¬ 
tinguish  between  the  internal  and  external  aspects 
of  sovereignty.  Internal  sovereignty  is  attributed  to 
the  governmental  institutions  of  a  state  by  virtue  of 
fulfilling  some  or  all  of  the  above  criteria,  while 
external  sovereignty  is  attributed  to  the  state  as  a 
whole  by  virtue  of  being  recognized  as  such  by 
other  states.  Since  the  terms  of  recognition  are 
dependent  on  the  internal  attributes  of  sovereignty, 
the  requirements  of  external  sovereignty  have  var¬ 
ied  considerably  over  time.  Recognition  has  been 
granted  on  different  grounds,  ranging  from  prin¬ 
ciples  of  dynastic  succession,  via  national  self- 
determination  and  territorial  integrity,  to  the  more 
recent  requirement  that  states  should  be  governed 
according  to  democratic  principles  to  merit  inter¬ 
national  recognition. 

Historical  Evolution 

Although  quintessentially  a  modern  concept,  the 
history  of  sovereignty  can  be  traced  back  at  least 
to  the  political  theology  and  feudal  practices  of 
the  late  Middle  Ages.  At  this  point  in  time, 
supreme  authority  was  vested  in  the  person  of  the 
prince,  who  derived  his  authority  both  from  theo¬ 
logical  sources  and  from  his  relative  ability  to 
protect  his  subjects  from  internal  and  external 
enemies.  The  subsequent  articulation  of  modern 
conceptions  of  sovereignty  followed  a  series  of 
cumulative  steps. 

The  first  step  toward  the  articulation  of  a  recog¬ 
nizably  modern  conception  of  sovereignty  was 


taken  when  claims  to  supreme  authority  were  ter¬ 
ritorialized.  This  took  place  in  sharp  contrast  to 
prevailing  views  of  imperial  authority,  which 
emphasized  its  universal  and  boundless  character. 
While  the  idea  of  imperial  authority  implied  no 
restriction  on  the  scope  of  temporal  power,  rulers 
who  contested  such  imperial  claims  during  the 
13th  century  did  so  by  asserting  that  individual 
kings  ought  to  enjoy  the  same  political  and  legal 
status  within  their  domains  as  does  the  emperor 
within  the  empire.  A  decisive  break  with  the  impe¬ 
rial  order  occurred  when  the  rulers  of  the  medieval 
kingdoms  of  France  and  Naples  refused  to  recog¬ 
nize  the  authority  of  the  emperor  Henry  VII  over 
matters  considered  internal.  In  support  of  their 
claims,  they  could  draw  on  the  doctrines  of  jurists 
such  as  Andreas  de  Isernia  and  Bartolus  of 
Sassoferrato,  who  argued  that  temporal  authority 
ought  to  be  territorially  delimited. 

A  second,  and  largely  simultaneous,  step  toward 
a  modern  conception  of  sovereignty  was  taken 
with  the  gradual  depersonalization  of  political 
authority.  During  the  Middle  Ages,  authority  had 
often  been  conferred  on  the  physical  person  of  the 
prince  through  anointment  and  other  acts  of  lit¬ 
urgy  that  established  his  divine  right  to  rule. 
Depersonalization  meant  that  supreme  authority 
was  relocated  to  the  fictitious  person  of  the  crown 
and  then  transferred  to  the  royal  domain  or  to  the 
body  politic  as  a  whole.  This  transition  culminated 
in  the  creation  of  an  abstract  state  concept  during 
the  17th  century,  to  which  sovereignty  could  be 
attributed  independently  of  the  physical  person  of 
the  prince. 

The  third  step  toward  a  modern  conception  of 
sovereignty  was  taken  during  the  16th  and  17th 
centuries,  when  sovereignty  was  defined  in  terms 
of  indivisibility  and  absoluteness.  Legal  theorists 
such  as  Jean  Bodin,  Hugo  Grotius,  and  Thomas 
Hobbes  argued  that  sovereign  authority  could  not 
be  divided  within  one  and  the  same  polity  and  that 
it  is  either  attributable  to  a  given  agent  or  not.  The 
consequence  of  these  conceptual  mutations  was 
that  the  modern  concept  of  sovereignty  came  to 
connote  supreme,  indivisible,  and  absolute  author¬ 
ity  within  a  given  territory,  while  the  questions  of 
its  proper  locus  and  scope  within  each  state  was 
left  to  succeeding  generations  to  handle.  This  shift 
implied  that  the  very  identity  of  the  early-modern 
state,  along  with  the  emergent  international  society 
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of  states  in  Europe,  became  dependent  on  the  exis¬ 
tence  of  sovereign  authority  and  its  recognition  by 
others. 

The  fourth  and  final  step  toward  a  fully  modern 
conception  of  sovereignty  was  taken  when  sover¬ 
eignty  was  popularized  by  being  relocated  from 
kings  to  the  people.  From  Rousseau  onward,  the 
will  of  the  people  was  regarded  as  the  source  of 
sovereignty.  Yet  the  rise  of  popular  sovereignty  did 
little  to  challenge  the  underlying  requirements  of 
territoriality  and  indivisibility,  which  left  modern 
political  theory  with  the  problem  of  accounting 
how  the  sovereignty  of  the  people  could  be  justi¬ 
fied  in  democratic  terms.  The  final  outcome  of  this 
transition  was  the  modern  nation-state,  governed 
according  to  the  principles  of  popular  sovereignty. 
Toward  the  end  of  the  19th  century,  this  entity 
became  the  main  object  of  inquiry  for  modern  his¬ 
toriography  and  political  science. 

During  the  20th  century,  the  concept  of  sover¬ 
eignty  became  increasingly  controversial  within 
political  science.  Whereas  many  pluralists  argued 
that  the  concept  of  sovereignty  was  redundant  to 
understand  how  modern  democratic  societies  are 
governed,  Marxists  argued  that  sovereignty  was 
but  a  fiction  designed  to  conceal  the  uneven  distri¬ 
bution  of  power  and  wealth  between  classes  in 
capitalist  societies.  Still  others  argued  that  the  con¬ 
cept  of  sovereignty  should  be  banished  from  the 
vocabulary  of  political  science  because  of  its  opac¬ 
ity  and  ambiguity. 

The  Contemporary  Debate 

More  recently,  the  usefulness  of  the  concept  of 
sovereignty  has  been  contested  on  other  grounds, 
however.  At  the  heart  of  this  debate,  we  find  the 
question  whether  sovereignty  is  a  permanent  con¬ 
dition  of  political  order  or  whether  it  is  likely  to  be 
replaced  by  other  forms  of  political  authority  in 
the  near  future. 

Many  political  scientists  today  dispute  whether 
it  is  meaningful  to  speak  of  political  authority  as 
being  absolute  and  indivisible  since  the  processes 
of  European  integration  and  globalization  have 
brought  relocations  of  political  authority  to  levels 
below  as  well  as  above  that  of  the  state.  Today, 
political  authority  appears  to  be  both  relative  and 
divisible  in  character.  Consequently,  many  politi¬ 
cal  scientists  argue  that  the  concept  of  sovereignty 


has  to  be  either  redefined  to  fit  present  conditions 
better  or,  more  radically,  abandoned  altogether  in 
favor  of  other  concepts  such  as  autonomy.  Other 
critics  have  contested  the  usefulness  of  the  concept 
of  sovereignty  on  more  philosophical  grounds, 
questioning  the  ontological  status  of  this  concept 
and  its  ideological  functions  in  justifying  abuses  of 
power  and  international  warfare.  Many  of  these 
critics  have  argued  that  sovereignty  has  been  rei¬ 
fied  by  modern  political  science  and  international 
relations  theory.  Rather  than  simply  referring  to  a 
legal  norm  or  an  empirical  fact,  constructivists 
and  poststructuralists  have  argued  that  the  con¬ 
cept  of  sovereignty  has  been  constitutive  of  the 
modern  political  order  through  its  usage  in  politi¬ 
cal  discourse.  By  implication,  sovereignty  is 
believed  to  be  historically  contingent  and  hence 
mutable,  rather  than  a  necessary  condition  of 
political  order. 

Such  a  diagnosis  of  course  begs  the  question  of 
available  alternatives.  Like  many  earlier  attempts 
to  contest  the  concept  of  sovereignty,  the  recent 
critique  of  sovereignty  presupposes  what  it  sets  out 
to  criticize,  namely,  that  there  is,  or  at  least  has 
been,  something  like  indivisible  and  absolute 
political  authority.  The  concept  of  sovereignty, 
along  with  all  its  traditional  connotations  of 
supremacy,  territoriality,  indivisibility,  and  abso¬ 
luteness,  nevertheless  continues  to  exercise  a  pow¬ 
erful  influence  on  the  contemporary  political 
imagination,  to  the  point  that  alternatives  either 
seem  out  of  reach  or  carry  features  similar  to  that 
of  the  good  old  sovereign  state. 

This  should  be  evident  from  contemporary 
attempts  to  justify  a  relocation  of  sovereignty  to 
the  global  level,  which  despite  their  aspirations 
often  remain  premised  on  the  notion  that  political 
authority  needs  to  be  both  bounded  and  central¬ 
ized  for  a  political  community  to  be  viable.  Efence, 
symptomatically,  the  present  contestation  of  sover¬ 
eignty  seems  to  be  what  keeps  this  alive  within 
political  science  and  international  relations  theory. 
By  focusing  on  sovereignty  and  the  question  of  its 
endurance,  recent  critics  have  left  some  more  basic 
questions  of  political  authority — questions  that  the 
concept  of  sovereignty  once  was  invented  to 
address — unanswered.  To  answer  these  questions 
without  presupposing  that  political  authority  needs 
to  be  indivisible,  or  that  political  communities  have 
to  be  bounded,  remains  one  of  the  main  challenges 
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to  political  science  and  academic  international 
relations  today. 
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Spatial  Models  of  Politics 


Spatial  models  are  widely  used  by  political  scien¬ 
tists  to  analyze  decision  making  in  a  wide  range  of 
settings,  from  elections  and  legislatures,  to  gov¬ 
ernment  formation,  to  international  organiza¬ 
tions,  to  the  U.S.  Supreme  Court,  and  many  other 
things.  Spatial  models  provide  what  has  proved  to 
be  a  fruitful  way  to  think  rigorously  about  poli¬ 
tics,  grounded  in  a  simple  intuition  with  a  reach 
far  beyond  the  narrow  confines  of  professional 
political  science.  At  least  since  the  days  of  the 
French  Revolution,  indeed,  references  to  “left” 
and  “right”  and  to  changing  policy  “positions” 
have  been  a  part  of  day-to-day  political  discourse. 
The  general  public  generally  understands  what  it 
means  to  talk  about  politics  using  “spatial”  lan¬ 
guage,  as  such  language  is  not  just  a  conceit  of 
political  scientists.  In  this  entry,  mathematical  and 
empirical  approaches  to  spatial  modeling  and 


their  applications  to  various  forms  of  political 
decision  making  are  discussed. 

The  words  left  and  right  refer  in  this  context  to 
directions  along  a  “dimension”  of  political  prefer¬ 
ence.  But  one  dimension  is  rarely  enough  for  a  good 
description  of  how  real  people  think  in  a  given  set¬ 
ting.  It  is  easy  to  imagine  two  people  who  are  both 
on  the  economic  right,  with  similar  views  about 
government  intervention  in  the  economy,  who 
nonetheless  have  very  different  views  about  govern¬ 
ment  intervention  in  matters  of  personal  morality. 
One  can  think  of  views  on  matters  such  as  abortion 
and  same-sex  marriage,  for  example,  in  terms  of  a 
liberal-conservative  dimension  that  is  quite  inde¬ 
pendent  from — in  spatial  language  orthogonal 
to — the  left-right  dimension.  If  economic  and 
social  issues  are  all  that  concern  political  scientists, 
then  they  can  consider  political  preferences  to  be 
well  described  using  a  two-dimensional  space, 
spanned  by  a  left-right  economic  policy  dimension 
and  a  liberal-conservative  social  policy  dimension. 
Building  systematically  on  this  spatial  metaphor  for 
describing  political  preferences,  a  metaphor  that 
does  seem  to  be  deeply  rooted  in  real  politics, 
political  scientists  tend  to  take  one  of  two  distinct 
but  related  approaches  when  they  analyze  political 
decision  making.  These  approaches  can  be  thought 
of  as  mathematical  and  empirical. 

Mathematical  and  Empirical  Approaches 

One  can,  as  a  mathematical  construct,  think  of  an 
abstract  space  of  possible  outcomes.  An  integral 
part  of  the  formal  definition  of  individual  ratio¬ 
nality  is  having  preferences  that  rank  such  out¬ 
comes  in  a  transitive  way:  Preferring  outcome  A 
to  B  and  preferring  B  to  C  implies  preferring  A  to 
C.  Many  cost-benefit  calculations  involve  specify¬ 
ing  “how  much”  A  is  preferred  to  B,  relative  to 
how  much  B  is  preferred  to  C.  Information  or 
assumptions  about  such  matters  can  be  expressed 
as  perceived  “distances”  between  A,  B,  and  C  in 
an  assumed  “metric”  space  comprising  a  set  of 
possible  outcomes  and  a  measure  of  the  distance 
between  these. 

One  key  distinction  between  mathematical  and 
empirical  approaches  to  spatial  modeling  arises  as 
soon  as  the  metric  used  to  measure  distances 
between  outcomes  is  considered.  A  typical 
approach  by  mathematical  modelers  is  to  assume  a 
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distance  measure  that  is  analytically  tractable.  By 
far,  the  most  common  assumption  is  the  Euclidean 
metric,  familiar  to  all  who  know  elementary  geom¬ 
etry.  The  Euclidean  metric  uses  Pythagoras’s  theo¬ 
rem  to  measure  the  distance  between  two  points  in 
a  multidimensional  space  as  the  square  root  of  the 
sum  of  the  squares  of  interpoint  distances  on  each 
dimension.  If  individual  i,  with  ideal  point  xid  on 
dimension  d,  evaluates  outcome  /,  perceived  as 
having  a  position  xjd  on  dimension  d,  then  /’ s  per¬ 
ceived  Euclidean  distance  from  /,  D|,  is 

Dfi  =  ^Hd{X‘d  ~  Xjd)2 ■ 

While  Euclidean  distances  are  often  implicitly 
treated  as  being  “natural,”  they  make  a  key  empir¬ 
ical  assumption  about  how  real  people  trade  off 
distance  from  their  ideal  points  on  one  dimension 
against  distance  on  other  dimensions.  For  exam¬ 
ple,  if  important  issues  to  be  decided  concern  abor¬ 
tion  and  public  spending  and  if  some  voter  is 
evaluating  two  policy  packages,  the  Euclidean 
assumption  is  that  this  evaluation  is  not  the  aggre¬ 
gate  distance  between  packages  on  abortion  and 
public  spending  but  rather  the  square  root  of  the 
sum  of  the  squares  of  these  distances.  This  is  an 
empirical  assumption  about  how  real  people  think. 
There  is,  furthermore,  consistent  psychological 
evidence  that  if  two  dimensions  are  separable — 
that  is,  if  a  person’s  evaluations  of  outcomes  on 
one  dimension  can  be  described  without  knowing 
his  or  her  evaluations  on  the  other  dimension — 
then  decision  making  by  real  humans  is  more  real¬ 
istically  modeled  using  the  city  block  metric.  This 
assumes  that  the  perceived  distance  between  two 
outcomes  is  the  simple  sum  of  their  distances  on 
each  dimension: 

dT  =  -  xid  I 

Notwithstanding  the  evidence  that  a  city  block 
metric  may  generate  a  more  realistic  empirical  rep¬ 
resentation  of  the  preferences  of  real  humans,  it  is 
far  less  tractable  analytically.  Thus,  most  pub¬ 
lished  formal  analyses  of  spatial  models  in  a  mul¬ 
tidimensional  setting  assume  a  Euclidean  metric  or 
some  generalization  of  this  that  is  convenient  for 
manipulation  using  calculus.  This  is  an  aspect  of  a 


more  general  distinction  between  mathematical 
and  empirical  approaches  to  spatial  modeling. 

The  empirical  approach  uses  a  space  as  a  way  to 
describe  the  real  preferences  of  real  people  in  real 
settings,  not  as  a  formal  scheme  for  characterizing 
the  possible  preferences  of  possible  agents  in  possible 
settings.  In  the  empirical  approach,  in  stark  contrast 
to  the  mathematical  approach,  dimensions  of  differ¬ 
ence  in  the  outcome  space  have  substantive  meaning. 
As  opposed  to  a  horizontal  or  x  dimension,  left  and 
right  may  be  described,  for  example,  in  terms  of 
economic  policies;  similarly,  a  second  dimension 
may  be  described  in  terms  of  specific  social  or  moral 
issues.  Much  more  important  than  the  words  used  to 
describe  dimensions,  however,  the  structure  of  the 
space  is  defined  in  terms  of  structures  of  preferences 
in  real  populations.  For  example,  political  scientists 
may  be  able  to  predict  attitudes  on  same-sex  mar¬ 
riage  in  a  given  population  from  attitudes  on  abor¬ 
tion,  gun  control,  and  capital  punishment;  indeed, 
they  may  be  able  to  predict  attitudes  on  any  one 
issue  in  this  bundle  from  attitudes  on  the  other 
issues.  In  this  event,  they  can  think  of  attitudes  on 
same-sex  marriage,  abortion,  gun  control,  and  capi¬ 
tal  punishment  as  if,  in  this  population,  all  correlate 
with  an  unobservable  “latent”  dimension.  And  they 
can  give  this  latent  dimension  a  name,  such  as 
liberalism  versus  conservatism.  This  is  an  empiri¬ 
cal  description  of  how  real  people  think  in  this 
population — not  an  abstract  mathematical  charac¬ 
terization  of  how  possible  people  might  possibly 
think.  Thus,  while  it  is  theoretically  possible  to 
envisage  vegetarians  who  favor  capital  punishment 
for  humans,  if,  indeed,  most  real  vegetarians  oppose 
capital  punishment,  then  we  think  of  vegetarianism 
and  capital  punishment  as  if  they  are  both  predicted 
by  a  single  underlying  dimension.  The  same  set  of 
issue  areas  may  thus  generate  different  empirical 
spaces  for  different  human  populations  if  attitudes 
correlate  in  different  ways. 

This  focuses  attention  on  the  dimensionality  of 
policy  spaces,  which  are  fixed  by  assumption  in 
mathematical  spatial  models  but  are  the  primary 
focus  of  interest  for  empirical  models.  Empirical 
policy  spaces  are  estimated  from  data  on  real  atti¬ 
tudes  (or  assumed  from  assumed  data  on  these). 
One  way  or  another,  this  estimation  involves  data 
reduction;  information  on  a  large  number  of  spe¬ 
cific  matters  on  which  real  people  do,  or  might, 
have  preferences  is  summarized  in  terms  of  their 
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positions  on  a  much  smaller  number  of  latent 
dimensions  with  substantive  interpretations. 

There  are  many  ways  to  do  this.  One  way  is  to 
analyze  manifest  behavior,  such  as  voting  in  legis¬ 
latures.  A  political  scientist  might  think,  for  exam¬ 
ple,  that  legislators’  voting  records  reflect  their  real 
preferences  and  that  people  who  typically  vote  on 
the  same  side  of  a  set  of  motions  in  a  legislature 
have  more  similar  preferences  than  people  who 
typically  vote  on  different  sides  of  these  motions. 
Observed  behavior  on  legislative  voting  can  be 
used  to  derive  a  matrix  of  interlegislator  differ¬ 
ences;  the  political  scientist  can  then  seek  a  low¬ 
dimensional  space  that  represents  these  differences 
reasonably  well.  Alternatively,  he  or  she  can  use 
surveys  to  ask  politicians  directly  about  their  pref¬ 
erences  on  a  range  of  issues.  Expert  surveys  can  use 
country  specialists  as  surrogates,  in  essence  asking 
them  how  politicians  would  answer  a  set  of  ques¬ 
tions  on  the  issues  if  they  were  asked  and  if  they 
were  to  answer  honestly.  Speeches  and  texts  gener¬ 
ated  by  real  politicians  can  be  analyzed  for  content 
about  the  authors’  preferences  by  human  interpre¬ 
tative  coders  or  by  automated  computational  algo¬ 
rithms.  There  are  many  potential  sources  of  raw 
data  on  the  preferences  of  real  political  actors. 

Having  collected  raw  data  on  preferences,  a  high¬ 
dimensional  data  space  can  be  reduced  using  some 
scaling  technique  to  a  low-dimensional  summary 
space  that  is  taken  as  the  most  efficient  description 
of  preferences  of  many  different  people  on  many 
different  matters.  The  most  detailed  possible  descrip¬ 
tion  of  preferences  is  always  high-dimensional — 
everyone  is  different  from  everyone  else  in  some 
way.  For  a  given  substantive  problem,  therefore, 
there  is  always  a  trade-off.  Analysts  ultimately  seek 
a  spatial  description  of  preferences  that  is  a  parsimo¬ 
nious  (low-dimensional)  summary  of  relevant  simi¬ 
larities  and  differences  between  people  yet  does  not 
miss  something  that  is  both  substantively  important 
and  critical  to  theoretical  conclusions.  This  means 
that  there  is  no  such  thing  as  the  abstract  dimension¬ 
ality  of  the  structure  of  preferences  in  a  given  set¬ 
ting.  In  this  sense,  there  is  no  “one  true  space”  to 
be  discovered  if  only  political  scientists  could  find 
it.  For  any  substantive  problem,  there  are  high¬ 
dimensional  background  data  that  can  be  collected 
on  “raw”  attitudes  in  the  real  population  under 
investigation.  There  are  also  “metadata,”  which 
cannot  be  collected,  on  potential  attitudes  on 


matters  that  have  yet  to  arise  and  cannot  yet  be  con¬ 
ceived.  Given  the  data  on  attitudes,  there  is  a  best 
estimate,  d,  of  the  dimensionality  of  the  relevant 
policy  space  (with  d  lying  somewhere  between  zero 
and  infinity)  that  is  the  most  useful  low-dimensional 
summary  of  preferences  for  the  problem  at  hand. 
Once  the  problem  is  specified,  scaling  techniques 
that  reduce  high-dimensional  data  spaces  to  low¬ 
dimensional  summaries  typically  have  measures  of 
“fit”  that  show  how  much  variation  in  the  raw  data 
is  captured  by  a  given  low-dimensional  description. 
Such  measures  help  the  analyst  make  substantive 
judgments,  for  example,  that  the  best  spatial  repre¬ 
sentation  in  a  given  setting  is  two-dimensional — in 
the  sense  that  a  two-dimensional  representation  fits 
the  data  much  better  than  a  one-dimensional  repre¬ 
sentation  but  adding  a  third  dimension  does  not 
yield  much  improvement  in  fit  in  relation  to  a  given 
problem.  Notwithstanding  this,  the  ultimate  deci¬ 
sion  on  the  dimensionality  of  an  empirical  space  is  a 
judgment  call  for  the  analyst,  who  in  a  sense  is 
always  an  activist  in  imposing  some  low-dimensional 
representation  on  high-dimensional  data  about 
preferences. 

Implications 

The  dimensionality  of  a  given  policy  space,  a  fun¬ 
damentally  empirical  matter,  has  profound  impli¬ 
cations  for  abstract  mathematical  models.  This  is 
because  logical  inferences  derived  from  formal 
spatial  models  of  political  decision  making,  and 
especially  of  voting,  depend  critically  on  the 
dimensionality  of  the  assumed  outcome  space  as 
well  as  on  other  detailed  assumptions  about  the 
preferences  of  real  humans.  One  well-known  rea¬ 
son  for  this  arises  from  the  possibility  of  voting 
cycles  over  outcomes  in  multidimensional  spaces, 
whereby  A  beats  B,  B  beats  C,  and  C  beats  A  in 
majority  votes.  While  the  famous  formal  proofs  of 
this  are  known  as  the  chaos  results ,  the  substantive 
implication  in  such  settings  is  not  “chaos”  but  that 
“something  else,”  over  and  above  unconstrained 
majority  voting,  is  needed  before  models  can  make 
unambiguous  predictions  about  outcomes. 
Mathematical  models  of  voting  in  multidimen¬ 
sional  outcome  spaces,  therefore,  tend  to  be  very 
sensitive  to  precise  substantive  assumptions  about 
the  local  institutional  setting  under  investigation — 
assumptions  that  may  drastically  narrow  both 
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their  formal  generality  and  their  empirical  applica¬ 
bility — and  hence  their  susceptibility  to  scientific 
testing. 

Spatial  representations  of  preferences  over  out¬ 
comes,  whether  assumed  arbitrarily  or  estimated 
empirically,  are  not  of  course  models  in  themselves; 
they  are  key  inputs  to  such  models.  Thus,  there  is 
no  such  thing  as  “the”  spatial  model  of  voting,  of 
legislative  behavior,  or  of  government  formation. 
There  are  many  different  spatial  models  of  these 
important  political  phenomena,  each  model  build¬ 
ing  on  some  spatial  characterization  of  preferences 
by  making  precise  behavioral  assumptions  about 
decision-making  agents  and  the  institutional  setting 
in  which  their  decisions  are  manifested.  What 
makes  all  these  very  different  models  spatial  is  that 
the  utility  functions  of  key  agents,  which  may  of 
course  include  many  different  things,  all  include 
terms  that  compare  expected  utilities  deriving  from 
different  points  in  the  outcome  space.  Relative 
utilities  of  pairs  of  outcomes  are  described  in  terms 
of  perceived  distances  between  these  points  and  an 
additional  assumption  about  how  the  agent  feels 
about  “policy  loss.”  Let  LL  be  the  utility  derived 
by  agent  i  from  outcome  j  and  A;/  be  some  vector 
of  other  sources  of  agent  utility,  perhaps  the  cha¬ 
risma  of  some  political  candidate  or  a  bribe  paid, 
that  have  nothing  to  do  with  policy  preferences. 
It  is  conventional  to  assume  that  agent  utilities 
decline  as  a  function  of  increasing  policy  dis¬ 
tance  between  an  ideal  point  and  the  outcome 
being  evaluated.  This  function  is  often  assumed 
to  be  linear  (U,j  =  A;;  —  D;/)  or  quadratic 
( U,,  —  Aij  —  Djj).  As  with  the  choice  of  metrics,  the 
choice  of  loss  functions  is  often  made  on  the  basis 
of  analytical  convenience  rather  than  empirical 
evidence  about  how  real  people  actually  think.  The 
assumption  of  quadratic  loss  is  the  more  common 
and  is  occasionally  justified  as  an  empirical  assump¬ 
tion  that  real  humans  are  risk-averse,  though  hard 
evidence  about  risk  aversion  in  the  real  word  typi¬ 
cally  concerns  economic  decision  making,  and 
there  is  very  little  evidence  one  way  or  the  other 
concerning  whether  real  people  tend  to  be  risk- 
averse  when  making  political  decisions.  It  seems 
equally  plausible  to  argue  that  voters,  for  example, 
are  more  sensitive  to  policy  differences  between 
two  parties  whose  policies  are  close  to  their  own 
ideal  points  than  they  are  to  the  differences 
between  two  parties  whose  policy  positions  are 


very  remote,  yet  this  runs  counter  to  the  assump¬ 
tion  of  quadratic  loss.  For  this  reason,  a  number  of 
spatial  models  of  political  decision  making  assume 
linear  rather  than  quadratic  policy  loss  in  the  agent 
utility  functions  that  they  specify,  although  justifi¬ 
cations  for  this,  when  explicit,  tend  to  be  casual. 

Model  Types 

Within  the  general  spatial  modeling  framework  set 
out  above,  there  is  huge  variation  in  the  type  of 
model  one  encounters.  One  famous  application, 
associated  with  the  name  of  Anthony  Downs, 
though  predating  his  work,  concerns  voting  in  elec¬ 
tions.  A  spatial  model  of  voting  in  elections  might 
assume  an  exogenously  fixed  institutional  setting 
of  an  election  to  one  single-seat  district  with  a 
simple  plurality  formula  for  mapping  votes  cast 
into  seats  won.  It  might  assume  two  species  of 
agents — politicians  and  voters — with  an  exoge¬ 
nously  given  set  of  politicians  assumed  to  prefer 
winning  the  support  of  as  many  voters  as  possible 
and  with  voters  assumed  to  prefer  voting  for  the 
politician  who  promotes  the  policy  package  that 
seems  to  them  to  be  closest  to  their  own  ideal  policy. 
These  assumptions  define  premises  for  one  of  many 
possible  spatial  models  of  electoral  competition — in 
this  case,  one  possible  model  of  presidential  elec¬ 
tions  under  plurality  rule.  If  voters’  preferences  on 
all  relevant  issues  can  be  represented  using  a  single 
latent  dimension  and  if  politicians  have  good  infor¬ 
mation  about  these  preferences,  then  such  a  model 
generates  the  well-known  implication  that,  in  the 
absence  of  some  other  assumption,  the  policy  posi¬ 
tions  of  both  candidates  will  tend  to  converge  on  the 
ideal  point  of  the  median  voter,  who  in  this  sense 
will  be  pivotal  in  deciding  the  election  result.  The 
reader  will  have  noticed  a  lot  of  ifs  in  this  informal 
statement  of  the  model,  however,  hinting  at  the 
many  possible  spatial  models  of  this  one  single 
political  process. 

Quite  different  spatial  models  may  be  generated 
by  different  institutional  settings.  Such  models 
might  allow  the  strategic  “entry”  of  new  candidates 
into  the  competition.  They  might  assume  a  diverse 
set  of  single-seat  districts,  in  which  candidates  from 
the  same  party  must  all  set  the  same  policy  posi¬ 
tion.  Given  some  assumed  structure  of  political 
communication,  they  might  assume  that  politicians 
can  set  different  policy  positions  in  different 
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districts.  They  might  also  assume  the  following:  a 
diverse  set  of  multiseat  districts,  with  exogenous  or 
maybe  endogenous  boundaries,  and  various,  maybe 
endogenous,  electoral  formulae  used  to  translate 
votes  cast  into  seats  won.  Different  models  may 
make  different  behavioral  assumptions  about  poli¬ 
ticians — for  example,  that  politicians  are  motivated 
to  enact  policies  rather  than  maximize  votes  or 
have  some  specified  trade-off  between  these  objec¬ 
tives.  Also,  it  may  be  assumed  that  voters  are  moti¬ 
vated  to  have  an  impact  on  the  composition  of  the 
eventual  government  rather  than  supporting  their 
closest  party,  or  some  trade-off  may  be  assumed 
between  these  objectives.  There  is  thus  no  limit  to 
the  number  of  possible  spatial  models  of  voting  in 
elections.  A  particular  model  is  identified  by  its 
explicit  (and  often  implicit)  assumptions  about 
agent  behavior  and  the  institutions  structuring  this. 

Moving  beyond  voting  in  elections,  character¬ 
ized  by  elite-mass  interactions  between  a  small  set 
of  candidates  and  a  large  set  of  voters,  spatial  mod¬ 
els  have  also  been  applied  to  elite-elite  interactions 
such  as  voting  in  committees  and  legislatures  and  in 
the  U.S.  Supreme  Court  as  well  as  bargaining 
between  party  leaders  over  government  formation 
in  parliamentary  democracies.  Such  high-stakes 
settings  with  interactions  between  small  numbers 
of  sophisticated  agents  may  well  imply  different 
empirical  assumptions  about  the  following:  how 
much  each  agent  knows  about  the  preferences  of 
all  others,  whether  all  agents  perform  the  same  type 
of  complex  strategic  calculation  about  matters  that 
may  or  may  not  be  important  to  them,  and  how 
much  agents  focus  on  optimizing  decisions  on  the 
problem  under  investigation,  such  as  voting  on 
some  issue,  when  they  must  in  reality  make  many 
decisions  on  many  different  matters.  Voting  in  elec¬ 
tions  may  be  important,  but  not  that  important,  for 
voters,  whereas  bargaining  between  seasoned  poli¬ 
ticians  over  who  becomes  the  prime  minister  is 
likely  the  most  important  deal  they  ever  do. 

In  each  of  these  cases,  there  remains  a  sharp 
analytical  distinction  between  spatial  models  that 
make  the  assumption  that  real-world  preferences 
on  all  important  matters  can  be  well  represented 
by  a  single  latent  dimension  and  those  that  assume 
that  more  than  one  dimension  is  needed  for  a  real¬ 
istic  representation  of  preferences.  One-dimensional 
spatial  models  tend  to  be  analytically  tractable, 
even  if  they  can  sometimes  be  difficult  to  resolve. 


They  are  thus  appealing  to  formal  theorists,  who 
would  be  the  first  to  acknowledge  that  the  assump¬ 
tion  of  a  one-dimensional  preference  structure  is 
substantively  unrealistic.  Many  published  spatial 
models  are,  for  this  reason,  one-dimensional,  and 
the  claim  is  typically  that,  even  when  patently 
unrealistic,  they  are  in  some  sense  aids  to  intuition. 
Multidimensional  models,  while  of  their  essence 
offering  greater  potential  realism  with  regard  to 
descriptions  of  how  real  people  think,  tend  to  be 
analytically  intractable.  One  response  to  this  is  to 
constrain  the  model  with  further  strong  institu¬ 
tional  assumptions  that  make  it  tractable  while  at 
the  same  time  reducing  its  generality.  The  other 
solution  is  to  investigate  the  model  using  carefully 
designed  suites  of  computer  simulations. 

Conclusion 

Spatial  modeling  is  a  general  approach  to  analyz¬ 
ing  political  decision  making.  The  more  abstract 
and  formal  the  approach,  the  more  the  resulting 
models  are  about  decision  making  in  general.  The 
more  empirical  the  approach,  the  more  they  are 
about  decision  making  by  human  beings  in  settings 
where  data  could  possibly  be  collected.  Within  this 
general  approach,  there  is  no  single  spatial  model 
of  decision  making,  by  either  abstract  agents  or 
real  humans,  in  any  specified  setting.  The  consider¬ 
able  intellectual  attraction  of  the  spatial  modeling 
approach  is  that  it  offers  a  conceptual  language 
that  allows  different  modeling  assumptions  to  be 
compared  in  relatively  explicit  ways.  The  spatial 
modeling  approach  thus  provides  a  set  of  building 
blocks  for  different  models  of  important  political 
processes  but  leaves  all  important  substantive 
assumptions  firmly  in  the  hands  of  the  modeler. 
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Stability 


Political  stability  is  an  elusive  concept,  one  that  is 
difficult  to  define  and  operationalize.  In  simplest 
terms  (and,  as  will  be  seen,  the  concept  simplest 
terms  can  itself  be  misleading),  stable  political 
systems  are  those  in  which  governance  and  the 
transfer  of  power  occur  in  regular,  predictable 
ways.  When  we  think  of  stable  political  systems, 
therefore,  we  think  of  those  that  hold  regular  elec¬ 
tions  and  incumbents  who  vacate  offices  and 
allow  opponents  to  take  over  those  offices.  We 
also  envision  systems  in  which  protests  are  not 
widespread,  especially  violent,  or  challenging  to 
the  basic  system  of  governance.  As  is  evident  from 
this  definition,  however,  stable  systems  that  are 
not  democratic  would  seem  to  be  excluded  from 
it.  Yet  dictatorships  can  last  for  decades  and,  thus, 
in  this  sense,  are  stable.  Consequently,  we  must 
acknowledge  that  the  proposed  definition  is  prob¬ 
lematic,  and  so  we  need  to  think  more  deeply 
about  the  topic.  This  entry  is  consequently  focused 
on  the  three  questions  that  should  be  addressed 
when  dealing  with  this  topic:  how  to  define,  how 
to  achieve,  and  how  to  measure  stability. 

Defining  Stability 

Two  primary  difficulties  confront  us  when  we 
think  more  deeply  about  the  meaning  of  political 
stability.  First,  while  most  journalistic  and  some 
academic  accounts  treat  stability  as  a  dichotomous 
variable,  meaning  that  a  given  political  system  is 
either  “stable”  or  “unstable,”  on-the-ground  real¬ 
ity  is  quite  different.  Only  at  the  most  basic  level  is 
such  a  dichotomy  justifiable.  For  analytical  pur¬ 
poses,  however,  there  is  widespread  recognition 
among  experts  that  political  stability  in  the  real 
world  covers  a  very  broad  continuum.  There  are 
some  political  systems  that  are  chronically  unsta¬ 
ble,  with  elections  occurring  irregularly  (if  at  all), 
and  in  which  the  outcome  of  elections  is  often 
contested  and  likely  to  lead  to  mass  protests,  elite 


subversion,  or  both.  On  the  other  hand,  there  are 
systems  in  which  regular,  predictable  political  pro¬ 
cesses  are  almost  always  the  norm.  Finally,  there  is 
a  wide  range  between  these  two  extremes  and 
cases  of  nations  that  fluctuate  between  stable, 
unstable,  and  everything  in  between. 

Political  science  has  recognized  the  continuous 
nature  of  political  stability  and  as  a  result  has 
accepted  the  wide  variation  in  degrees  of  stability 
as  an  important  variable  to  be  studied  in  compara¬ 
tive  politics  and  international  relations.  Indeed, 
there  often  exist  regions  within  a  given  country 
that  exhibit  more  stability  than  others.  Far  less 
consensus  emerges  on  the  normative  benefits  of 
stability,  which  leads  to  the  second  and  more  vex¬ 
ing  challenge  in  coming  to  grips  with  the  term 
stability. 

Is  stability  a  good  thing,  and  if  so,  is  more  of  it 
always  better?  Consider  the  argument  in  favor  of  a 
state  of  continual  instability.  This  position  was  per¬ 
haps  most  strongly  and  dramatically  argued  by 
China’s  Mao  Zedong.  From  Mao’s  perspective, 
China  (and  indeed  all  Marxist  regimes)  benefited 
from  a  state  of  constant  revolution;  in  the  absence  of 
such  revolution,  bourgeois  forces  could  threaten  to 
reinstate  capitalism.  As  a  result,  Mao  plunged  China 
into  the  Cultural  Revolution  in  1966,  producing 
social  unrest,  violence,  and  purges.  From  Mao’s 
point  of  view,  however,  and  from  that  of  the  intel¬ 
lectual  leadership  of  the  Red  Guards  who  held  sway 
during  this  period,  instability  was  to  be  institutional¬ 
ized  as  the  new  “normal”  in  order  to  prevent  the 
communist  revolution  from  being  extinguished. 
Only  with  the  death  of  Mao  and  the  arrest  of  the 
so-called  Gang  of  Four  in  1976  did  this  protracted, 
nonstop  “revolution”  come  to  an  end. 

Whereas  for  Maoists  political  instability  was  a 
cherished  ideal,  for  other  regimes,  whether  on  the 
right  or  the  left,  instability  of  any  kind  has  been 
seen  as  an  existential  threat.  Cases  on  the  left 
include  Cuba  under  Fidel  Castro,  a  country  ruled 
for  nearly  50  years  by  one  man  until  his  health 
failed,  and  the  case  of  North  Korea  under  the  rule 
of  Kim  II-sung,  who  ran  his  country  from  1948 
until  1994.  At  the  opposite  extreme,  are  the  cases  of 
Alfredo  Stroessner,  who  ruled  Paraguay  from  1954 
until  1989,  and  the  Somoza  family  dynasty,  which 
controlled  Nicaragua  from  the  1930s  until  their  fall 
in  the  Sandinista  Revolution  in  1979.  These 
and  similar  regimes  see  any  form  of  instability  as 
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threatening  and  have  routinely  imprisoned  and/or 
killed  anyone  who  has  challenged  them.  In  these 
cases,  then,  extreme  stability  is  clearly  not  a  posi¬ 
tive  state  from  most  normative  perspectives. 

Even  when  we  move  away  from  such  cases  of 
extreme  stability  or  extreme  instability,  it  is  unclear 
how  much  stability  is  normatively  ideal.  From  an 
instrumental  point  of  view,  however,  it  does  seem 
clear  that  the  extremes  are  highly  dysfunctional  in 
a  variety  of  ways.  At  the  most  basic  level,  human 
life  is  often  threatened  when  instability  is  high.  The 
great  famine  that  affected  China  from  1958  to 
1961,  in  which  at  least  15  million  people  are  said 
to  have  died,  is  an  illustration  of  the  negative 
impact  of  such  instability.  Beyond  death  from  fam¬ 
ine,  instability  also  affects  health  more  generally, 
as  well  as  education,  since  schools  and  universities 
are  often  shuttered  in  conditions  of  permanent 
revolution.  On  the  economic  side,  financial  invest¬ 
ment  is  almost  certainly  attenuated  in  countries 
that  exhibit  high  levels  of  instability,  as  the  risk 
premium  is  too  great  for  all  but  the  most  risk- 
tolerant  investors.  In  addition,  political  careers  are 
hard  to  shape  and  plan  for  when  instability  is  high, 
since  the  rules  of  the  game  are  constantly  in  ques¬ 
tion  and  subject  to  change.  Young  people  who 
aspire  to  a  political  career  can  find  themselves  with 
their  political  parties  banned  or  the  political  office 
to  which  they  aspire  eliminated. 

Extreme  political  stability,  on  the  other  hand, 
produces  other  costs.  Dictatorships  discourage 
investors  because,  although  the  regime  itself  may 
be  stable,  its  decisions  are  often  unpredictable, 
since  they  depend  on  the  whim  of  a  single  indi¬ 
vidual  or  clique  of  noncompetitively  selected 
rulers.  Investors  can  find  themselves  with  their  fac¬ 
tories,  farms,  or  banks  expropriated  virtually  over¬ 
night.  Recourse  to  the  courts  is  futile,  as  the  rule  of 
law  is  no  more  than  what  the  dictator  declares  it  to 
be.  Innovation  is  stifled  in  such  regimes,  since  there 
are  few  incentives  for  individuals  to  be  creative. 
Indeed,  dictatorships  inculcate  fear  in  those  who 
wish  to  accomplish  any  sort  of  change,  since  such 
change  will  be  seen  as  a  challenge  to  the  status  quo. 
Political  careers  are  often  truncated  by  purges,  as 
parties  or  families  fall  out  of  favor. 

In  sum,  it  would  appear  that  as  one  moves  to  the 
extremes  of  stability  or  instability,  the  costs  grow 
exponentially,  suggesting  that  the  gains  for  stability 
may  conform  to  a  quadratic  form  with  the  highest 


returns  of  all  kinds  (human,  economic,  and  politi¬ 
cal)  lying  at  the  top  of  an  inverted  U  curve.  In 
effect,  it  would  appear  that  the  ancient  Greek  phi¬ 
losophers  had  got  it  right — Aristotle  criticized 
Sparta  in  the  Politics  for  its  exclusive  focus  on  mak¬ 
ing  war  and  its  lack  of  attention  to  making  peace. 

Achieving  Stability 

The  key  question  in  the  literature  of  contemporary 
political  science  is  how  to  achieve  stability.  Much  of 
that  literature  has  focused  one  way  or  the  other  on 
the  link  between  political  legitimacy  and  political 
stability.  Most  modern  theorizing  goes  back  to  Max 
Weber  in  his  classic  work  Politics  as  a  Vocation. 
Weber  saw  three  types  of  legitimacy:  charismatic, 
traditional,  and  rational-legal,  only  the  last  being  a 
stable  form.  In  Political  Man ,  Seymour  Lipset  drew 
on  Weber  and  hypothesized  that  stable  democracies 
are  achieved  when  political  systems  are  seen  by  their 
citizens  as  being  effective  at  delivering  public  ser¬ 
vices  and  are  held  accountable  for  their  actions. 
Lipset  as  well  as  David  Easton  realized  that  regimes 
that  achieved  these  goals  would  produce  citizens 
uninterested  in  violent  or  radical  change.  Moreover, 
such  systems  would  be  better  equipped  than  others 
to  weather  the  storm  of  periods  of  especially  poor 
performance.  Hence,  legitimate  systems  can  over¬ 
come  sharp  economic  downturns  because  they  have 
built  up  a  reservoir  of  political  support,  and  citizens 
are  prepared  to  grant  them  significant  degrees  of 
freedom  to  “fix  up  the  mess.” 

Beyond  legitimacy,  stability  also  seems  to  depend 
on  reasonable  equality  in  the  distribution  of  wealth. 
Considerable  literature,  beginning  with  Aristotle 
and  continuing  into  modern  political  science,  has 
linked  instability  to  inequality.  Academic  disputes 
have  emerged,  however,  as  to  how  to  measure 
inequality  and  whether  scholarly  attention  should 
be  focused  on  income  inequality,  wealth  inequality, 
or  land  inequality  (see,  e.g.,  the  works  of  Manus 
Midlarsky,  and  Edward  Muller  and  Mitchell 
Seligson  listed  in  the  Further  Readings). 

Measuring  Stability 

Stability  has  been  measured  extensively  by  data¬ 
bases  in  political  science  that  have  counted  the 
number  of  coups,  revolutions,  insurrections,  vio¬ 
lent  protests,  and  so  on.  Debates  center  on  the 
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significance  and  weighting  of  each  of  these  events: 
Do  10  violent  protests  add  up  to  the  equivalent  of 
one  attempt  at  insurrection,  and  how  can  one  mea¬ 
sure  these  events?  The  empirical  work  on  factors 
related  to  legitimacy,  which  in  turn  would  lead  to 
stability,  has  been  extensive,  but  much  of  it  has  suf¬ 
fered  from  a  narrow  empirical  perspective.  Here,  as 
Arthur  Miller  notes,  the  “trust  in  government” 
paradigm  has  held  sway,  in  which  a  small  number 
of  items  in  surveys  measure  the  extent  to  which  citi¬ 
zens  trust  their  government  “to  do  the  right  thing.” 

More  recently,  the  conceptualization  of  legiti¬ 
macy  has  been  broadened  considerably.  One 
approach,  as  noted  by  Bruce  Gilley,  includes  mea¬ 
sures  based  on  public  opinion  (from  the  World 
Values  survey)  as  well  as  objective  indicators,  such 
as  tallies  of  protests.  Another  approach  argues  that 
legitimacy  is  fundamentally  an  attitudinal  con¬ 
struct  and  looks  at  it  as  the  extent  to  which  citizens 
express  various  degrees  of  their  support  for  their 
political  systems.  Surveys  are  ideal  for  measuring 
this  notion  of  legitimacy.  A  six-dimensional  system 
developed  and  tested  by  John  Booth  and  Mitchell 
Seligson  (2009)  in  a  number  of  countries  was 
found  to  powerfully  predict  political  stability.  This 
approach,  rather  than  making  stability  part  of  the 
legitimacy  phenomenon  itself,  makes  legitimacy  a 
predictor  of  it. 

Important  research  frontiers  lie  ahead.  We  do 
not  know  if  there  are  important  worldwide  trends 
in  stability.  The  widespread  movement  toward 
democracy  with  the  end  of  the  Cold  War  has  pro¬ 
duced  some  stable  regimes,  but  it  has  also  been 
associated  with  expanding  conflicts  in  others.  It 
remains  to  be  seen  if  the  world  as  a  whole  is  mov¬ 
ing  toward  more  stable  or  less  stable  polities. 

Mitchell  A.  Seligson 
Vanderbilt  University 
Nashville,  Tennessee,  United  States 

See  also  Democracy,  Theories  of;  Equality;  Legitimacy; 
Political  Systems,  Types;  Support,  Political;  Weber,  Max 

Further  Readings 

Booth,  J.  A.,  &  Seligson,  M.  A.  (2009).  The  legitimacy 
puzzle  in  Latin  America:  Democracy  and  political 
support  in  eight  nations.  Cambridge,  UK:  Cambridge 
University  Press. 


Gilley,  B.  (2009).  The  right  to  rule:  How  states  win  and 
lose  legitimacy.  New  York:  Columbia  University  Press. 

Midlarsky,  M.  (1988).  Rulers  and  the  ruled:  Patterned 
inequality  and  the  onset  of  mass  political  violence. 
American  Political  Science  Review,  82,  491-510. 

Miller,  A.  H.  (1974).  Political  issues  and  trust  in 

government.  American  Political  Science  Review,  68, 
951-972. 

Muller,  E.  N.,  &  Seligson,  M.  A.  (1987).  Insurgency  and 
inequality.  American  Political  Science  Review,  81, 
425-451. 


Stages  Model  of 
Policy  Making 


A  common  method  of  examining  the  operation  of 
a  policy  process  is  to  break  it  down  into  several 
stages  and  substages.  Each  of  these  can  then  be 
analyzed  separately,  and  commonalities  across 
stages  can  also  be  identified.  Over  the  course  of 
the  history  of  the  policy  sciences,  several  distinct 
models  of  the  policy  process  were  put  forward 
before  the  current  five-stage  model  of  agenda  set¬ 
ting,  policy  formulation,  decision  making,  policy 
implementation,  and  policy  evaluation  was  settled 
on.  The  evolution  of  these  models  and  their 
advantages  and  disadvantages  as  analytical  and 
empirical  constructs  are  set  out  below. 

One  of  the  most  popular  means  for  simplifying 
public  policy  making  has  been  to  disaggregate  the 
process  into  a  series  of  discrete  stages  and  sub¬ 
stages.  The  resulting  sequence  of  stages  is  referred 
to  as  the  policy  cycle  or  stages  model  of  policy. 
This  simplification  has  its  origins  in  the  earliest 
works  on  public  policy  analysis  but  has  received 
somewhat  different  treatment  in  the  hands  of  dif¬ 
ferent  authors. 

The  idea  of  simplifying  the  complexity  of  public 
policy  making  by  breaking  the  policy-making  pro¬ 
cess  down  into  a  number  of  discrete  stages  was  first 
broached  in  the  early  work  of  Harold  Lasswell,  one 
of  the  founders  of  the  policy  sciences.  Lasswell 
divided  the  policy  process  into  seven  stages:  (1)  intel¬ 
ligence,  (2)  promotion,  (3)  prescription,  (4)  invoca¬ 
tion,  (5)  application,  (6)  termination,  and  (7)  appraisal. 
In  Lasswell’s  view,  the  seven  stages  described  not 
only  how  public  policies  were  actually  made  but  also 
how  they  should  be  made.  The  policy  process  began 
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with  intelligence  gathering — that  is,  the  collection, 
processing,  and  dissemination  of  information  for 
those  who  participate  in  the  decision  process.  It 
then  moved  to  the  promotion  of  particular  options 
by  those  involved  in  making  the  decision.  In  the 
third  stage,  the  decision  makers  actually  pre¬ 
scribed  a  course  of  action.  In  the  fourth  stage,  the 
prescribed  course  of  action  was  invoked;  a  set  of 
sanctions  was  developed  to  penalize  those  who 
failed  to  comply  with  the  prescriptions  of  the  deci¬ 
sion  makers.  The  policy  was  then  applied  by  the 
courts  and  the  bureaucracy  and  ran  its  course 
until  it  was  terminated  or  cancelled.  Finally,  the 
results  of  the  policy  were  appraised  or  evaluated 
against  the  aims  and  goals  of  the  original  decision 
makers. 

Lasswell’s  analysis  of  the  policy-making  process 
focused  on  the  decision-making  process  within 
government  and  had  little  to  say  about  external  or 
environmental  influences  on  government  behavior. 
It  simply  assumed  that  decision  making  was  lim¬ 
ited  or  restricted  to  a  presumably  small  number  of 
participants  staffing  official  positions  in  govern¬ 
ment.  Another  shortcoming  of  this  model  was  its 
lack  of  internal  logic,  especially  with  reference  to 
placing  appraisal  or  evaluation  after  termination, 
since  policies  are  usually  evaluated  prior  to  being 
wound  down  rather  than  afterward.  Nevertheless, 
this  model  was  highly  influential  in  the  develop¬ 
ment  of  a  policy  science.  Although  not  entirely 
accurate,  it  did  reduce  the  complexity  of  studying 
public  policy  by  allowing  each  stage  to  be  isolated 
and  examined  before  putting  the  whole  picture  of 
the  process  back  together. 

Lasswell’s  formulation  formed  the  basis  for  a 
model  developed  by  Gary  Brewer  in  the  early 
1970s.  According  to  Brewer,  the  policy  process  was 
composed  of  six  stages:  (1)  invention/initiation, 
(2)  estimation,  (3)  selection,  (4)  implementation, 
(5)  evaluation,  and  (6)  termination.  In  Brewer’s 
view,  invention  or  initiation  referred  to  the  earliest 
stage  in  the  sequence  when  a  problem  would  be 
initially  sensed.  This  stage,  he  argued,  would  be 
characterized  by  an  ill-conceived  definition  of  the 
problem  and  suggested  solutions  to  it.  The  second 
stage  of  estimation  concerned  calculation  of  the 
risks,  costs,  and  benefits  associated  with  each  of  the 
various  solutions  raised  in  the  earlier  stage.  This 
would  involve  both  technical  evaluation  and  nor¬ 
mative  choices.  The  objective  of  this  stage  is  to 


narrow  the  range  of  plausible  choices  by  excluding 
the  unfeasible  ones  and  to  somehow  range  the 
remaining  options  in  terms  of  desirability.  The 
third  stage  consists  of  adopting  one,  none,  or  some 
combination  of  the  solutions  remaining  at  the  end 
of  the  estimation  stage.  The  remaining  three  stages 
consist  of  implementing  the  selected  option,  evalu¬ 
ating  the  results  of  the  entire  process,  and  termi¬ 
nating  the  policy  according  to  the  conclusions 
reached  by  its  evaluation. 

Brewer’s  version  of  the  policy  process  improved 
on  Lasswell’s  pioneering  work.  It  expanded  the 
policy  process  beyond  the  confines  of  government 
in  discussing  the  process  of  problem  recognition 
and  clarified  the  terminology  in  use  to  describe  the 
various  stages  of  the  process.  Moreover,  it  intro¬ 
duced  the  notion  of  the  policy  process  as  an  ongo¬ 
ing  cycle.  It  recognized  that  most  policies  did  not 
have  a  definite  life  cycle — moving  from  birth  to 
death — but  rather  seemed  to  recur,  in  slightly  dif¬ 
ferent  guises,  as  one  policy  succeeded  another  with 
only  minor  or  major  modifications.  Brewer’s 
insights  inspired  several  other  versions  of  the  pol¬ 
icy  cycle  to  be  developed  in  the  1970s  and  1980s, 
the  most  well-known  of  which  were  set  out  in 
popular  textbooks  by  Charles  Jones  and  James 
Anderson. 

To  avoid  a  plethora  of  models  of  policy  stages 
and  their  variants,  it  is  necessary  to  clarify  the  logic 
behind  the  cycle  model.  In  the  works  of  Brewer, 
Jones,  and  others,  the  operative  principle  behind 
the  notion  is  the  logic  of  applied  problem  solving, 
The  stages  in  applied  problem  solving  and  its  cor¬ 
responding  stages  in  the  policy  process  are  depicted 
in  Table  1. 

In  this  model,  agenda  setting  refers  to  the  pro¬ 
cess  by  which  problems  come  to  the  attention  of 


Table  I  Five  Stages  of  the  Policy  Cycle  and  Their 
Relationship  to  Applied  Problem  Solving 


Phases  of  Applied 
Problem  Solving 

Stages  in  Policy  Cycle 

Problem  recognition 

Agenda  setting 

Proposal  of  solution 

Policy  formulation 

Choice  of  solution 

Decision  making 

Putting  solution  into 
effect 

Policy  implementation 

Monitoring  results 

Policy  evaluation 
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governments;  policy  formulation  refers  to  the  pro¬ 
cess  by  which  policy  options  are  formulated  within 
government;  decision  making  refers  to  the  process 
by  which  governments  adopt  a  particular  course  of 
action  or  nonaction;  policy  implementation  refers 
to  the  process  by  which  governments  put  policies 
into  effect;  policy  evaluation  refers  to  the  processes 
by  which  the  results  of  policies  are  monitored  by 
both  state  and  societal  actors,  the  result  of  which 
may  be  reconceptualization  of  policy  problems 
and  solutions. 

The  most  important  advantage  of  the  policy 
cycle  model  as  set  out  above  is  that  it  facilitates  the 
understanding  of  public  policy  making  by  break¬ 
ing  the  complexity  of  the  process  into  any  number 
of  stages  and  substages,  each  of  which  can  be 
investigated  alone  or  in  terms  of  its  relationship  to 
any  or  all  the  other  stages  of  the  cycle.  This  aids 
theory  building  by  allowing  numerous  case  studies 
and  comparative  studies  of  different  stages  to  be 
undertaken.  Another  advantage  of  the  model  is 
that  its  breadth  permits  examination  of  the  role  of 
all  actors  and  institutions  dealing  with  a  policy, 
not  just  those  governmental  agencies  formally 
charged  with  the  task. 

The  principal  disadvantage  of  the  stages  model 
is  that  it  can  be  misinterpreted  as  suggesting  that 
policymakers  go  about  solving  public  problems  in 
a  very  systematic  and  more  or  less  linear  fashion. 
This  is  not  the  case  in  reality,  as  the  identification 
of  problems  and  the  development  and  implementa¬ 
tion  of  solutions  are  often  an  ad  hoc  and  idiosyn¬ 
cratic  process.  Decision  makers  often  simply  react 
to  circumstances  and  do  so  in  terms  of  their  inter¬ 
ests  and  preset  ideological  dispositions.  Another 
problem  with  the  model  is  that  while  the  logic  of  the 
policy  cycle  may  be  fine  in  the  abstract,  in  practice, 
the  stages  are  often  compressed  or  skipped,  or  they 
are  followed  in  an  order  unlike  that  specified  by 
the  logic  of  applied  problem  solving.  Thus,  the 
cycle  may  not  be  a  single  iterative  loop  but  rather 
a  series  of  smaller  loops  in  which,  for  example,  the 
results  of  past  implementation  decisions  have  a 
major  impact  on  future  policy  formulation,  regard¬ 
less  of  the  specifics  of  the  agenda-setting  process  in 
the  case  concerned.  In  short,  there  is  often  no  lin¬ 
ear  progression  of  a  policy  as  conceived  by  the 
model.  Third,  and  perhaps  most  important,  the 
model  lacks  any  notion  of  causation.  It  offers  no 
pointers  as  to  what  or  who  drives  a  policy  from 


one  stage  to  another,  a  matter  of  crucial  interest 

to  scholars  working  on  the  subject. 

Michael  Howlett 
Simon  Fraser  University 
Burnaby,  British  Columbia,  Canada 
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Stalinism 


The  term  Stalinism  in  its  proper  sense  denotes  the 
essence  of  a  specific  social  system  that  emerged  in 
the  Soviet  Union  in  the  late  1920s  under  the  lead¬ 
ership  of  Premier  Joseph  Stalin.  The  term  is  also 
used  to  refer  to  (a)  a  specific  way  of  behavior  and 
thinking  dominant  in  Soviet  politics  from  the  late 
1920s  to  the  mid-1950s  or  influenced  by  and 
similar  to  it,  (b)  a  corresponding  ideology  or  class 
of  ideologies  justifying  Stalin’s  policies,  or  (c)  con¬ 
stitutional  arrangements  and  institutional  designs 
typical  of  or  similar  to  those  developed  under 
Stalin’s  leadership. 

One  of  the  first,  if  not  the  first,  documented 
usage  of  the  word  Stalinism  appeared  in  an  article 
by  Karl  Radek  published  in  December  1934  in 
Pravda,  where  he  introduced  the  expression 
Marxism-Leninism-Stalinism.  This  initiative  was 
short-lived.  Neither  Stalin  himself  nor  party  ideol¬ 
ogists  used  the  word,  preferring  Marxism-Leninism 
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as  the  title  of  their  ideology  and  socialism  as  a 
typical  description  for  the  social  order  developed 
in  the  Soviet  Union.  Since  the  mid-1950s,  Stalinism 
essentially  has  become  a  derogatory  term.  Even  its 
champions — with  very  few  exceptions  (see 
Sergeev’s  1999  apology  for  “mystical  Stalinism”) — 
avoid  using  the  word.  Furthermore,  a  word 
Stalischina  emerged  in  Russian  with  essentially  the 
same  meaning  as  Stalinism  but  with  much  more 
offensive  connotations. 

Traditionally,  interpretations  of  Stalinism  vary 
between  a  limited  focus  on  highly  specific  charac¬ 
teristics  of  Soviet  politics,  often  attributed  to 
Stalin’s  weird  personal  impact,  and  a  shallow 
explanation  of  related  phenomena  as  mere 
instances  of  totalitarianism.  A  more  balanced 
view  requires  a  combination  of  a  still  broader 
perspective  with  adequate  reference  to  precise  his¬ 
torical  particulars  and  conditions  of  political 
developments. 

Origins  and  Characteristics 

By  the  beginning  of  the  20th  century,  the  sponta¬ 
neous  development  of  European  countries  and 
their  overseas  colonies  extended  over  great 
expanses  and  held  sway  over  civilizations  and 
cultures  hardly  ready  to  adjust  to  the  modes  and 
models  of  modern  behavior.  The  world  became 
integrated  into  a  single  interstate  system  of  ter¬ 
ritorial  rule  that  was  internally  divided  by  impe¬ 
rialist  rivalry  and  structural  disproportions. 
Political  leaders  had  no  institutional  and  intel¬ 
lectual  resources  to  manage  such  development 
even  on  national  levels,  to  say  nothing  of  an 
international  one.  The  worldwide  expansion  of 
modernization  coupled  with  the  imperialist 
enclosure  of  the  entire  surface  of  the  globe  into  a 
single  interstate  system  made  the  issue  of  control 
of  development  critical.  The  inability  to  ade¬ 
quately  respond  to  that  great  challenge  provoked 
World  War  I. 

Postwar  reconstruction  implied  a  grand  project 
to  meet  the  challenge.  The  Entente  winners  relied 
on  national  self-determination  and  the  creation  of 
a  universal  quasi  polity  in  the  form  of  the  League 
of  Nations.  This  project  proved  too  simplistic  and 
straightforward,  however,  to  be  adequate.  It  imme¬ 
diately  provoked  an  alternative  one.  During  World 
War  I,  radical  Marxists  had  called  for  transforming 


an  imperialist  war  into  a  world  revolution  through 
a  series  of  civil  wars.  The  October  Revolution  and 
the  civil  war  that  followed  seemed  to  prove  the 
underlying  logic.  Thus,  Soviet  power  and  the 
Comintern  emerged  as  an  alternative  grand  proj¬ 
ect.  This  was  the  historical  context  for  the  first 
wave  of  democratization  and  the  Soviet  experi¬ 
ment  that  evolved  into  Stalinism,  as  well  as  for  a 
number  of  national  alternative  projects  that  pro¬ 
duced  a  variety  of  totalitarian  and  authoritarian 
regimes. 

Soviet  power  was  highly  ambivalent.  On  the 
one  hand,  it  relied  on  mass  participation  and  thus 
had  a  clear  democratic  calling  coupled  with  an 
institutional  form  of  direct  democracy  of  Soviets — 
Councils  of  Workers  and  Peasants  Deputies.  On 
the  other  hand,  the  emergent  system  could  only  be 
run  by  a  highly  integrated  and  disciplined  new- 
type  vanguard  party.  Lenin,  in  his  seminal  book  of 
1920,  “Left-Wing”  Communism:  An  Infantile 
Disorder,  clearly  fixed  a  hierarchy  of  power:  leaders 
— >  party  — >  working  class  ->  masses. 

Further  development  reshaped  the  initial  combi¬ 
nation  of  democratic  conceptual  and  institutional 
insights  with  authoritarian  and  repressive  trends 
into  a  highly  contradictory  pattern  of  rule  known 
as  Stalinism.  Its  core  was  the  “totalitarian  democ¬ 
racy”  (Jacob  Talmon,  1952)  that  prevailed  during 
the  civil  war  (1918-1920)  and  successive  postwar 
periods  (1920-1922,  1924-1927).  This  nascent 
system  was  gradually  made  more  and  more  effec¬ 
tive  by  institutional  and  ideological  innovations. 
Some  of  those  innovations  could  be  attributed  to 
Stalin  personally,  but  many  were  developed  by  his 
counterparts  and  would-be  political  rivals.  The 
first  decade  after  the  October  Revolution  saw  a 
successive  accumulation  of  traits  that  could  be 
considered  attributes  of  Stalinism,  particularly 
through  an  increasing  authoritarianism. 
Correspondingly,  the  years  and  decades  after 
Stalin’s  death  may  be  seen  as  a  gradual  de- 
Stalinization  or  a  series  of  structural  and  substan¬ 
tive  changes  that  had  a  very  profound  effect 
despite  the  claims  of  radical  anti-Soviet  critics  that 
“the  system  is  untransformable.” 

The  dating  of  Stalinism  and  its  specific  periods  is 
problematic.  Widespread  attribution  of  Stalinism  to 
the  period  between  the  respective  deaths  of  Lenin 
(1924)  and  Stalin  (1953)  seems  to  be  inaccurate, 
greatly  exaggerating  agency  factors  at  the  expense 
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of  structural  ones.  Rather,  it  is  the  congresses  of  the 
new-type  party  that  should  be  considered  critical 
junctures.  Thus,  all  the  years  of  profound  internal 
political  struggle  within  the  party  up  to  the  15th 
Congress  in  December  1927  could  best  be  seen  as 
a  period  of  nascent  Stalinism. 

The  age  of  Stalinism  proper  could  be  divided 
into  a  number  of  periods.  They  include  emergent 
Stalinism,  up  to  the  16th  Congress  of  July  1930; 
early  or  initial  Stalinism,  up  to  the  17th  Congress 
of  January  to  February  1934  and  the  subsequent 
death  of  Sergey  Kirov  in  December  1934;  the  com¬ 
plete  or  full-fledged  Stalinism  of  1934  to  1941;  the 
Word  War  II  interlude;  the  advanced  Stalinism  of 
1945  to  1952,  the  year  of  the  19th  Congress;  and 
the  late  Stalinism  of  1952  to  1956,  concluding 
with  the  20th  Congress. 

Some  essential  characteristics  of  Stalinism,  how¬ 
ever,  had  started  to  be  elaborated  much  earlier. 

The  core  of  the  new  political  system  was  a  new 
type  of  party.  Initially,  it  was  a  highly  disciplined 
and  motivated  clandestine  vanguard  party  orga¬ 
nized  and  molded  by  Lenin  on  the  basis  of  the 
principle  of  democratic  centralism.  After  the 
October  Revolution,  in  a  number  of  realignments, 
Communists  managed  to  keep  dominant  positions 
and  after  the  Civil  War  emerged  as  the  only  com¬ 
manding  power.  At  the  10th  Party  Congress  of 
March  1921,  internal  party  factions  were  banned, 
but  policy  options  could  be  discussed  on  the  basis 
of  alternative  platforms.  Soon  after,  in  the  first 
purge  (micmm,  meaning  “cleansing”),  about  a 
third  of  the  membership  was  expelled  from  the 
party.  Launching  campaigns  against  oppositions 
of  various  kinds  and  using  purges,  Stalin  managed 
to  consolidate  the  party  by  its  15th  Congress.  Only 
then  was  it  possible  to  start  its  transformation 
from  a  party  proper  into  the  core  of  a  party-state 
totalitarian  system. 

The  conception  of  building  socialism  in  a  single 
country  was  another  cornerstone  of  the  Soviet 
experiment.  Already  in  his  1915  article  “On  the 
Slogan  for  a  United  States  of  Europe,”  Lenin  had 
declared  that  uneven  economic  and  political  devel¬ 
opment  made  the  victory  of  socialism  possible, 
first  in  a  few  countries  or  even  in  a  single  country 
alone.  Lenin’s  criticism  of  left  Communists  in 
1920  and  his  subsequent  playing  down  of  the  idea 
of  permanent  revolution,  as  advanced  by  Leon 
Trotsky,  marked  a  significant  departure  from  the 


orthodox  Marxist  vision  of  world  revolution.  By 
the  end  of  1924,  Stalin  recognized  that  “the  prole¬ 
tariat  can  and  must  build  a  socialist  society  in  one 
country”  in  the  second  edition  of  his  book 
Foundations  of  Leninism — the  statement  was 
missing  in  the  first  edition  earlier  in  the  same  year. 
Soon  after,  Nikolai  Bukharin  elaborated  on  the 
issue.  He  argued  that  the  existing  economic  base  of 
the  Soviet  Union  was  sufficient  for  socialist  con¬ 
struction  in  case  its  security  could  be  militarily 
provided.  By  the  time  of  the  15th  Congress,  the 
idea  was  a  fully  accepted  cornerstone  of  official 
policy. 

After  coming  to  power,  the  Communist  Party 
made  consistent  attempts  to  control  appointments 
to  most  important  posts.  Stalin  himself  did  a  lot  to 
establish  special  departments  in  regional  party 
committees  that  would  oversee  the  most  important 
appointments.  As  Secretary-General,  beginning  in 
1922,  he  personally  handled  similar  control  in  the 
center.  That  practice  foreshadowed  the  future  sys¬ 
tem  of  nomenklatura  (described  below  in  the  dis¬ 
cussion  on  complete  Stalinism). 

Historical  Development 

Emergent  Stalinism  (1927-1930)  was  primarily 
linked  with  the  beginning  of  industrialization  and 
collectivization  as  well  as  the  launching  of  the  so- 
called  5-year  plans.  With  all  the  economic  impor¬ 
tance  of  those  policies,  their  social  aspects  were 
far  more  crucial.  A  highly  heterogeneous  econ¬ 
omy  and  society  were  radically  transformed.  The 
economy  became  state  controlled  and  planned. 
Society  was  first  atomized  and  then  reshaped 
with  imposed  homogenization.  All  those  steps 
seemed  highly  modern,  but  in  fact,  such  radical 
and  one-sided  modernization  would  turn  out  to 
be  dysfunctional. 

The  second  general  party  purge  of  1929  to  1930 
was  a  major  event  of  the  period.  About  100,000 
Communists,  or  some  10%  of  the  membership, 
were  purged.  At  the  same  time,  a  significant  num¬ 
ber  of  new  members — industrial  workers — joined 
the  Party. 

Early  or  initial  Stalinism  (1930-1934)  was 
marked  by  further  consolidation  of  party  control. 
During  that  period,  a  theory  of  aggravation  of  the 
class  struggle  in  the  process  of  building  socialism 
was  advanced.  In  his  speech  on  the  results  of  the 
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first  5-year  plan  in  January  1933,  Stalin  (1951) 
began  as  follows: 

Eradication  of  classes  is  accomplished  not  by 
cooling  down  the  class  struggle  but  by  its  heating 
up.  Withering  of  the  state  would  be  achieved  not 
by  weakening  of  state  authority  but  by  its  maximal 
strengthening,  necessary  to  crush  the  remnants  of 
the  dying  classes  and  to  organize  a  defense  against 
capitalist  encirclement,  which  is  not  yet  destroyed 
and  would  not  be  destroyed  soon.  (p.  211) 

This  argument  supplied  a  theoretical  basis  for  the 
claim  that  ongoing  repression  of  political  oppo¬ 
nents  is  necessary. 

The  third  general  party  purge  was  performed 
during  1933.  This  time,  during  the  period  of  the 
purge,  new  memberships  were  temporarily  sus¬ 
pended.  About  400,000  Communists,  or  some 
18%  of  the  membership,  were  purged. 

Complete  Stalinism  (1934-1941)  was  marked 
by  the  growth  of  the  personality  cult  of  Stalin  and 
the  fastening  down  of  a  full-fledged  nomenklatura 
system.  The  term  nomenklatura  (from  the  Latin 
nomenclatura ,  or  “list  of  names”)  denotes  a  prac¬ 
tice  of  advance  nomination  of  candidates  to  various 
important  positions  by  respective  party  commit¬ 
tees.  For  example,  important  positions  at  the  union 
level  were  staffed  by  the  Central  Committee  of  the 
Party,  at  the  republican  level  by  each  republic’s 
Central  Committee,  at  the  regional  level  by  regional 
committees,  at  the  local  level  by  local  committees, 
and  so  on.  Nomination  implied  that  a  person’s 
name  appeared  in  the  list  of  names,  or  nomenkla¬ 
tura.  The  person  was  controlled,  replaced,  pro¬ 
moted,  or  disgraced  within  that  system.  One  could 
have  access  to  significant  positions  only  by  virtue  of 
joining  the  nomenklatura. 

Nomenklatura  is  often  interpreted  as  a  new 
class  (e.g.,  by  Milovan  Djilas  and  Michael 
Voslenski).  Strictly  speaking,  it  was  not  so. 
Nomenklatura  itself  was  highly  stratified.  It  was 
more  a  system  of  control  and  rationalization  of 
government  by  and  through  the  party  than  a  social 
grouping,  much  less  a  class.  In  fact,  it  was  nothing 
but  a  central  link  between  the  party  as  a  core  of  the 
Soviet  totalitarian  system  and  administrative  infra¬ 
structure  integrated  into  the  system.  In  fact,  its 
development  was  coupled  with  the  reduction  of 
the  state  to  an  administrative  arrangement. 


From  1936  onward,  purge  campaigns  were 
replaced  by  the  “struggle  with  traitors  of  the  peo¬ 
ple.”  This  struggle  culminated  in  what  came  to  be 
known  as  Yezbovsbcbina.  The  worst  of  the  repres¬ 
sions  occurred  while  the  People’s  Ministry  of 
Internal  Affairs  (Narodnyy  komissariat  vnutren- 
nich  del ,  NKVD)  was  headed  by  Nikolai  Yezhov 
from  September  1936  to  August  1938.  Stalin  made 
him  responsible  for  the  repressions.  Yezhov  was 
subsequently  arrested  and  executed.  The  very  term 
Yezbovsbcbina  was  coined  to  integrate  the  cam¬ 
paign  of  criticism. 

The  World  War  II  period  created  a  new  recon¬ 
figuration  of  world  politics  that  influenced  many 
aspects  of  Stalinism.  The  idea  of  building  socialism 
in  a  single  country  was  reinterpreted,  putting  the 
Soviet  Union  at  the  center  of  capitalist  states,  rang¬ 
ing  from  its  most  reactionary  fascist  foes  all  the 
way  to  its  allies.  Intensification  of  the  class  struggle 
was  linked  to  this  picture.  While  the  totalitarian 
party-state  formation  proved  effective  and  instru¬ 
mental,  it  was  supplemented  by  the  traditional 
great-power  apparatus  of  the  Russian  imperial 
autocracy;  thus,  Stalin’s  personality  cult  was  greatly 
boosted  and  became  more  deeply  entrenched. 

Advanced  Stalinism  (1945-1952)  saw  a  further 
transformation  of  the  theory  of  building  socialism 
in  a  single  country.  People’s  democracies  in  the 
countries  that  fell  into  the  sphere  of  Soviet  geopo¬ 
litical  control  formed  the  so-called  socialist  camp. 
Consolidation  of  the  new  system  gave  impetus  to 
the  idea  of  intensification  of  the  class  struggle. 
Stalinism  made  claims  to  global  domination.  Its 
conceptual  basis  of  Marxist  scientific  teaching  was 
extended  to  integrate  all  science.  The  natural  sci¬ 
ences  were  supposed  to  elaborate  dialectical  mate¬ 
rialism,  while  social  scholarship  was  to  be  informed 
by  historical  materialism.  Several  public  campaigns 
on  specific  sciences  were  launched,  with  Stalin 
himself  publishing  a  book  on  linguistics. 

All  in  all,  the  system  of  advanced  totalitarianism 
became  too  complex  and  complicated  to  be  run  by 
simple  principles  of  total  control.  The  system  was 
bound  to  change. 

Late  Stalinism  (1952-1956)  saw  a  series  of 
reforms  begun  by  Stalin  himself,  finishing  with  the 
thaw  initiated  during  the  premiership  of  his  succes¬ 
sor  Nikita  Khrushchev.  The  19th  Congress,  orga¬ 
nized  after  a  13 -year  interval,  signaled  the  beginning 
of  the  process.  It  was  Georgy  Malenkov,  not 
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Stalin,  who  made  the  main  speech  (report  on  party 
activities).  The  speech  highlighted  the  need  for 
criticism  and  self-criticism.  The  theme  was  devel¬ 
oped  by  Stalin  himself  and  most  of  the  speakers. 
Many  historians  interpret  this  as  preparation  for  a 
new  wave  of  purges.  This  was  probably  true,  but 
it  also  signaled  the  coming  reforms.  They  fol¬ 
lowed,  coupled  with  a  power  struggle  after  Stalin’s 
death  in  March  1953.  Several  projects  of  de- 
Stalinization  emerged,  ranging  from  strengthening 
state  structures  and  emancipating  them  from  the 
party  (Lavrentii  Beria,  Minister  of  the  Interior)  to 
revivalism  of  Leninist  principles,  including  totali¬ 
tarian  democracy  (Khrushchev). 

Mikhail  Ilyin 
MGIMO  University 
Moscow,  Russian  Federation 
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The  three  sections  of  this  entry  examine  the  very 
complex  and  controversial  political  phenomenon 
commonly  referred  to  as  the  state.  The  first  section 
presents  a  conceptual  statement  that  takes  as  its 
point  of  departure  the  ideal-typical  treatment  of  the 
state  presented  at  the  beginning  of  the  20th  century 
by  the  German  scholar  Max  Weber  (1864-1920). 
This  choice  reflects  the  significant  influence  that 
treatment  exercised,  over  the  remainder  of  that 
century,  within  the  disciplines  of  law,  political  sci¬ 
ence,  and  sociology.  (Note  that  Weber’s  arguments, 
while  markedly  original  in  parts,  echoed  and  elabo¬ 
rated  theoretical  views  widely  shared  within  the 


juridical  and  social  science  literature  of  his  time, 
both  in  Germany  itself  and  elsewhere.  However, 
neither  at  the  time  nor  later  were  they  universally 
accepted  as  valid.).  Included  here  are  comments 
that  support,  qualify,  and  modify  Weber’s  own 
treatment.  The  next  section  of  the  entry  summa¬ 
rizes  the  major  historical  developments  that  in  the 
course  of  Western  modernization  imparted  to  the 
political  institutions  of  certain  European  countries 
the  distinctive  features  emphasized  in  Weber’s  state 
concept.  They  turned  the  state  itself  into  the  model 
arrangement  for  the  generation  and  management 
of  large-scale  political  power,  first  in  other  European 
countries  and  then  in  other  parts  of  the  world.  The 
third  section  of  the  entry  considers  very  briefly  the 
major  events  that,  since  Weber  wrote,  have  mark¬ 
edly  affected  the  institutional  physiognomy  of  the 
state  itself  and  its  position  within  the  larger  society 
and  highlights  some  aspects  of  the  contemporary 
discussion  about  “the  state  of  the  state.” 

A  Conceptual  Portrait  of  the  State 

Weber’s  understanding  of  the  state  is  most  often 
recalled  in  a  definition  he  proposed  in  various 
texts:  “The  state  is  a  human  community  that  (suc¬ 
cessfully)  claims  the  monopoly  of  the  legitimate 
use  of  physical  violence  [italics  added]  within  a 
given  territory”  (Gerth  &  Mills,  1946,  p.  78).  But 
a  more  elaborate  definition  may  provide  a  better 
start  to  our  own  discussion: 

The  primary  formal  characteristics  of  the  modern 
state  are  as  follows:  It  possesses  an  administrative 
and  legal  order  subject  to  change  by  legislation, 
to  which  the  organized  activities  of  the 
administrative  staff,  which  are  also  controlled  by 
regulations,  are  oriented.  This  system  of  order 
claims  binding  authority,  not  only  over  the 
members  of  the  state,  the  citizens,  most  of  whom 
have  gained  membership  by  birth,  but  also  to  a 
very  large  extent  over  all  action  taking  place  in 
the  area  of  its  jurisdiction.  It  is  thus  a  compulsory 
organization  with  a  territorial  basis.  Furthermore, 
today,  the  use  of  force  is  regarded  as  legitimate 
only  insofar  as  it  is  either  permitted  by  the  state 
or  prescribed  by  it.  (Weber,  1978,  p.  56) 

Although  Weber  here  as  elsewhere  uses  the 
expression  “the  modern  state,”  in  his  own  view, 
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the  qualifier  modern  is  largely  superfluous.  While 
his  best  known  work,  The  Protestant  Ethic  and 
the  Spirit  of  Capitalism  (1905/2002),  investigates 
(famously  and  controversially)  a  religious  aspect 
of  the  economic  component  of  Western  modern¬ 
ization,  Weber  views  state  formation  as  the  most 
significant  political  component  of  that  same, 
broad-ranging,  universally  significant  historical 
phenomenon.  By  the  same  token,  the  state  consti¬ 
tutes  a  distinctive,  relatively  recent  moment 
within  the  historical  career  of  a  universal  but 
highly  varied  phenomenon — that  of  Herrschaft 
(an  expression  often  translated  as  “authority”  or 
“domination,”  which  below  we  generally  refer  to 
as  “rule”). 

Conceptually  speaking,  rule  itself  is  a  variant  of 
an  even  broader  phenomenon — that  of  political 
power.  In  the  Weberian  construction  of  it,  political 
power  exists  in  a  situation  where  given  two  parties 
(individual  or  collective)  pursuing  incompatible 
interests,  one  can  overcome  the  resistance  of  the 
other  to  its  own  pursuits  by  engaging  or  threaten¬ 
ing  to  engage  in  violent  action  and  thus  putting  at 
stake  the  other’s  physical  vulnerability.  The  result¬ 
ing  inequality  between  the  two  parties  can  be  more 
or  less  marked,  more  or  less  durable,  and  wide- 
ranging.  It  can  become  sharper  and  more  stable, 
involve  greater  numbers,  and  affect  a  broader 
spectrum  of  social  affairs  when  the  superior  party 
no  longer  needs  to  exercise  or  threaten  violence  in 
order  to  get  its  way  but  establishes  and  manages  a 
relation  where  its  own  command  routinely  engen¬ 
ders  the  other  party’s  obedience.  To  the  extent  that 
this  happens,  we  can  say  that  political  power  has 
become  institutionalized  into  rule. 

Weber’s  greatest  contribution  to  the  study  of 
the  institutionalization  of  political  power  lies  in  his 
emphasis  on  its  legitimation;  indeed,  he  views  rule 
as  legitimate  political  power.  This  is  the  case  to  the 
extent  that  the  inferior  party  is  not  subjectively 
induced  to  obey  by  its  sheer,  unquestioning  habitu¬ 
ation  to  its  own  position  or  by  a  calculation  of  the 
material  costs  and  rewards  it  can  expect  respec¬ 
tively  from  granting  or  denying  obedience.  Rather, 
its  obedience  expresses  the  belief  it  entertains  in 
the  superior  party’s  entitlement  to  issue-binding 
commands  and  in  its  own  corresponding  moral 
duty  to  obey  them.  This  is  of  course  an  ancient 
insight,  but  Weber  develops  it  most  creatively. 
First,  he  argues  that  the  beliefs  in  question,  for  all 


the  historical  variety  in  their  content  and  in  the 
situations  that  engender  them  and  that  they  sus¬ 
tain,  can  all  be  subsumed  under  three  basic,  ideal 
types  of  legitimacy:  traditional,  charismatic,  legal- 
rational.  Then,  he  shows  that  this  tripartition  is 
correlated  with  other  significant  distinctions  con¬ 
cerning  diverse  aspects  of  systems  of  rule. 

Building  to  an  extent  on  this  and  other  contri¬ 
butions  of  Weber’s,  the  contemporary  German 
sociologist  Heinrich  Popitz  (1925-2002)  devel¬ 
oped  a  broader  theory  of  the  institutionalization  of 
political  power  and  of  its  development  into  rule. 
There  are  three  components  to  this  process: 

First,  power  relations  become  de-personalized. 
Power  is  no  longer  undistinguishable  from  the 
particular  individual  who  at  a  given  time  is  in  a 
markedly  advantageous  position,  but  increasingly 
appears  to  be  connected  with  functions  and 
positions  which  transcend  the  individual. 
Furthermore,  a  formalization  of  power  takes 
place,  in  that  its  exercise  is  more  and  more 
oriented  to  rules,  procedures,  and  ritual  practices. 
(This  does  not  exclude  its  arbitrary  use;  but  one 
can  talk  about  this,  or  about  undue  favor,  only 
when  arbitrary  or  favourable  acts  are  contrasted 
with  what  happens  in  most  situations.)  The  third 
aspect  of  the  institutionalization  of  power  is  the 
growing  integration  of  power  relations  within  a 
broader  order.  Power  gears  itself  into  the  pre¬ 
existent  situation.  It  becomes  part  of  and  is 
encompassed  within  the  social  edifice,  which  it 
supports  and  by  which  it  is  in  turn  supported. 
(Popitz,  1992,  pp.  38-39) 

The  French  social  theorist  Jean-Yves  Calvez  sug¬ 
gests  that  we  characterize  the  state  as  a  polity  where 
institutionalization  is  particularly  thorough  and 
advanced.  This  proposal  is  significant  but  considers 
expressly  only  the  extent  to  which  institutionaliza¬ 
tion  is  realized  in  state  building  and  disregards  the 
question  of  how  it  is  institutionalized.  In  some  sys¬ 
tems  of  rule — for  instance,  in  the  classical  Chinese 
empire — a  very  extensive  and  penetrating  institu¬ 
tionalization  process  was  carried  out  largely  by 
means  of  what  Popitz  calls  ritual  practices,  both 
those  of  an  expressly  religious  nature  (the  emperor 
was  supposed  to  have  a  mandate  from  Heaven)  and 
others  that  we  might  call  ceremonial  and  etiquette. 
Here,  political  and  administrative  practices  were 
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standardized  primarily  by  reference  to  values  and 
norms  perceived  as  sacred,  stylized  in  ways  chosen 
for  their  symbolic  appropriateness,  conveying  and 
instilling  a  respect  for  tradition  and  a  commitment 
to  its  protection. 

Although  in  the  West  ritual  and  similar  prac¬ 
tices  have  also  been  used  in  constructing  and  man¬ 
aging  political  power,  already  in  ancient  Greece 
and  Rome,  it  was  institutionalized  and  managed 
by  means  of  rather  different  practices,  both  origi¬ 
nally  unique  to  the  West.  On  the  one  hand,  sig¬ 
nificant  political  and  administrative  decisions  came 
to  depend,  more  or  less,  on  express,  legitimate, 
public  deliberation  over  alternative  views  of  what 
policies  to  adopt.  On  the  other,  those  polities 
structured  themselves  by  means  of  law — that  is,  of 
general  commands  that  articulated  the  polity  itself 
into  organs  and  offices  with  distinctive  faculties 
and  responsibilities. 

Furthermore,  the  formation  of  policy  itself  and 
(as  far  as  possible)  its  subsequent  implementation 
were  also  regulated  by  law,  which  the  Greeks  and 
Romans  increasingly  understood  as  a  body  of  prin¬ 
ciples  and  rules  of  secular  nature,  capable  of  being 
taught,  learned,  and  rationally  interpreted  and 
applied.  Its  bearing  on  a  huge  variety  of  conten¬ 
tious  matters  could  be  determined  by  means  of  a 
distinctive  form  of  legal  discourse — highly  literate, 
sophisticated,  intellectually  prestigious  (especially 
in  Rome) — which,  in  principle,  could  produce 
choices  demanding  rational  assent  between  alter¬ 
native  solutions  to  a  given  practical  problem. 

Thus  understood,  law  is  much  better  than  ritual 
at  confronting  open-ended  situations  and  at 
handling  and  generating  contingency.  Now,  contin¬ 
gency  itself — the  understanding  that  much  (perhaps 
all)  that  is  the  case  in  the  world  is  not  so  of  neces¬ 
sity  but  merely  because  it  is  the  case,  often  (not 
always)  as  a  result  of  human  agency — is  a  central 
aspect  of  modernity.  On  this  account  (among  oth¬ 
ers),  one  may  understand  why  law  played  a  large 
role  in  the  building  of  the  typical  polity  of  Western 
modernity  and  in  the  formation  and  implementa¬ 
tion  of  its  policies.  For  while  legal  decisions  are 
occasioned  by  the  possibility  of  alternative  choices, 
they  settle  for  one  or  the  other  via  a  work  of  inter¬ 
pretation  intrinsically  open  to  contestation  and 
reconsideration.  This  is  all  the  more  so  when  they 
draw  on  principles  and  modalities  of  argument  not 
perceived  as  divine  or  supernatural  in  nature  and, 


thus,  not  endowed  with  absolute  and  utterly 
unchallengeable  validity. 

Later  Western  polities,  particularly  during  the 
Middle  Ages,  maintained  a  close  relationship 
between  law  and  rule.  But  here,  law  itself  was  seen 
chiefly  as  the  product  of  long-lasting,  spontaneous 
social  and  cultural  processes,  mainly  embodied  in 
locally  evolved  customs;  these  expressed  widely 
shared  moral  convictions,  often  interpreted  and 
inculcated  by  priests  as  constituting  religious  com¬ 
mandments.  Rulers  related  to  the  law,  so  under¬ 
stood,  chiefly  as  its  (more  or  less  effective  and 
reliable)  custodians  and  enforcers. 

A  rather  drastic  change  in  the  relation  between 
rule  and  law  associated  with  (indeed,  constitutive 
of)  the  advance  of  modernity  (though  oddly 
enough,  it  finds  a  significant  precedent  in  the 
Gregorian  reform  of  the  Church  in  the  11th  cen¬ 
tury)  was  associated  with  the  emergence  and  devel¬ 
opment  of  so-called  positive  law.  Here,  lawmaking 
comes  to  be  seen  as  a  distinctive  and  exclusive 
prerogative  of  rule,  turning  law  from  a  framework 
of  policy  into  an  instrument  for  it.  It  has  become 
possible  to  make  law(s)  in  order  to  confront  the 
situation  and  change  it  by  selecting  and  mandating 
new  policies. 

This  “positivization”  of  law,  by  making  it 
depend  on  express,  deliberate,  time  and  place- 
bound  human  decisions  (rather  than  on  its  pre¬ 
sumed  agreement  with  the  supreme  religious  and 
moral  dictates  of  the  divinity  or  of  nature  itself,  or 
with  the  folk’s  jural  traditions),  makes  it  relevant 
to  a  distinctive  feature  of  the  ideal-typical, 
Weberian  state — its  rationality.  The  system  of  rule 
can  treat  law  as  one  instrument  for  optimizing  the 
relation  between  the  costs  and  benefits  of  its  own 
activities.  This,  in  principle,  is  not  possible  where 
the  system’s  prevalent  mode  of  institutionalization 
relies  on  ritual,  where  the  connection  between 
activities  and  intended  results  is  chiefly  symbolic, 
not  intrinsic. 

But  turning  law  into  a  political  instrument 
raises  a  threatening  problem:  how  to  secure  the 
third  aspect  of  the  institutionalization  of  political 
power,  its  integration  into  the  larger  society — a 
balance  between  what  a  society  is  induced  (or 
indeed  compelled)  to  do  for  the  state  and,  vice 
versa,  what  the  latter  does  for  the  former.  States 
generally  deal  with  this  problem  by  establishing  a 
hierarchical  distinction  between  different  sources 
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of  law — between  a  state’s  constitution  and  its  stat¬ 
utes  or  between  these  and  the  authoritative  acts 
applying  them  (judicial  sentences,  ministerial 
decrees).  Legal  commands  at  lower  levels  are  valid 
only  if,  while  in  principle  their  content  is  open- 
ended,  they  can  be  discursively  shown  to  be  consis¬ 
tent  with  higher  level  commands.  These  come  to 
represent,  so  to  speak,  a  more  deep  and  abiding 
commitment  of  the  polity  to  the  protection  and 
fostering  of  some  societal  interests.  But  the  forma¬ 
tion  and  execution  of  commands  at  each  level  can 
be  expected  to  reflect,  once  more,  a  preoccupation 
with  rationality,  a  sustained  effort  to  attain  opti¬ 
mal  results  in  the  face  of  changing  circumstances 
while  minimizing  costs  and  risks.  Thus  under¬ 
stood,  legal  commands  can  both  confront  and 
generate  a  great  deal  of  contingency. 

When  it  is  thus  “positivized” — that  is,  rendered 
contingent  on  express,  deliberate,  and  time  and 
place-bound  human  decisions — the  making  and 
application  of  law  empowers  the  state.  Instead  of 
closely  monitoring  multiple,  diverse,  and  changing 
circumstances  to  issue  commands  that  activate 
specific  responses  from  specific  individuals,  the 
state  can  address  abstractly  formulated,  general 
commands  to  an  open-ended  plurality  of  individu¬ 
als,  both  those  actively  involved  in  its  own  opera¬ 
tions  and  those  at  the  receiving  end  of  them;  it  can 
program  and  make  predictable  their  response  to 
those  commands.  It  can  evoke  in  them  not  just  ad 
hoc  responses  but  lasting  and  broad-ranging  dis¬ 
positions  to  act,  frame  their  activities  within  an 
environment  of  sanctioned  expectations,  and  struc¬ 
ture  both  their  private  relations  to  one  another  and 
those  they  entertain  with  the  state  itself.  And  it  can 
do  this  not  only  by  ordering,  authorizing,  or  for¬ 
bidding  specific  activities  of  individuals  but  also  by 
enabling  them  to  produce  valid,  actionable  legal 
effects  in  their  own  open-ended,  selective  pursuit 
of  their  interests. 

Insofar  as  it  does  this,  the  state  recognizes  as 
differentiated  from,  complementary  to,  and  auton¬ 
omous  of  itself  (though  to  some  extent  controlled 
and  empowered  by  its  own  activities)  a  whole 
realm  of  social  and  cultural  processes  that  we  can 
label  civil  society.  This  exists  insofar  as  individuals 
and  collectivities  subject  to  the  state’s  law  also 
hold  under  it  rights  of  their  own,  that  is  legally 
recognized  and  protected  claims — private  rights, 
which  they  exercise  in  their  dealings  with  one 


another,  and  public  ones,  the  exercise  of  which 
concerns  their  relations  to  the  state  itself  and 
asserts  some  entitlements  vis-a-vis  it.  The  comple¬ 
mentarity  between  the  state  and  civil  society  is  the 
state’s  characteristic  mode  of  integration — the 
third  aspect  of  the  institutionalization  of  rule  men¬ 
tioned  by  Popitz. 

All  this  presupposes  the  peculiar  and  histori¬ 
cally  rare  juridical  development  whereby  the  cen¬ 
ter  of  political  power  to  some  extent,  in  some 
fashion,  binds  itself  to  the  observance  of  rules  of  its 
own  making.  To  quote  Popitz  (1992)  again, 

Only  rarely,  in  the  history  of  society,  has  one 
found  it  possible  even  just  to  pose  systematically 
and  consequentially  the  question,  how  to  lay 
boundaries  on  institutionalized  violence.  Essen¬ 
tially,  one  has  accomplished  this  only  in  the 
Greek  polis,  in  republican  Rome,  in  a  few  other 
city-states,  and  in  the  history  of  the  modern  con¬ 
stitutional  state.  The  answers  to  that  question 
have  remained  strikingly  similar:  the  postulate  of 
the  supremacy  of  the  law  and  of  the  equality  of 
all  before  it  (isonomia),  the  idea  of  delimiting  in 
principle  all  legislation  (basic  rights),  procedural 
norms  (decisions  taken  by  organs,  publicity, 
appeals  to  higher  authorities),  norms  on  the  attri¬ 
bution  of  offices  (turn  taking,  selection),  and 
norms  of  the  public  sphere  (freedom  of  thought 
and  of  assembly),  (p.  65) 

Nothing  less  than  that  is  at  stake  in  the  state-law 
relation — but  also,  nothing  more.  An  excessive 
emphasis  on  that  relation  may  obscure  the  sheer 
factuality  of  other  aspects  of  the  state’s  operations — 
not  just  its  ultimate  grounding  on  violence  but  also 
the  question  of  whether  it  does  or  does  not  at  a 
given  moment  possess  the  material  resources 
required  for  those  operations,  be  these  once  more 
of  a  violent  kind  or  of  a  peaceable  kind.  (The  say¬ 
ing  C’est  I’argent  qui  fait  la  guerre — it  is  money 
that  makes  war — suggests  that  the  two  kinds  are 
closely  connected.)  Much  of  Weber’s  political  the¬ 
orizing  has  to  do  with  the  arrangements  whereby 
under  different  kinds  of  rule  those  resources  are 
gathered,  managed,  and  committed,  thus,  with 
administration.  The  administrative  arrangements 
typical  of  the  state  are  bureaucratic;  this  means  (in 
Weber’s  own  famous  ideal  type  of  bureaucracy) 
that  administration  is  carried  out  on  the  basis  of 
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various  forms  of  knowledge — juridical,  military, 
fiscal,  managerial,  statistical,  geographical,  all  of 
which  are  perceived  as  different  from  (and  superior 
to)  purely  factual  information  and  practical  know¬ 
how.  When  deployed  in  the  course  of  express 
deliberation,  they  are  expected  to  yield  decisions 
that  command  assent  and  that  activate  and  orient 
the  conduct  on  the  ground  of  merely  executive 
activities.  The  state’s  capacity  to  apply  such  forms 
of  knowledge  via  its  bureaucratic  apparatus  fur¬ 
ther  qualifies  it  as  a  distinctively  rational  kind  of 
political  enterprise,  purposefully  seeking  both 
effectiveness  (the  maximal  achievement  of  its  own 
ends)  and  efficiency  (the  optimal  relation  between 
means  and  ends).  It  can  use  its  bureaucratic  staff  to 
monitor  changing  circumstances;  identify  and  fore¬ 
cast  dangers  and  opportunities;  maintain,  suspend, 
or  modify  existing  arrangements;  and  make  appro¬ 
priate  dispositions  to  confront  (and  to  generate) 
contingent  situations. 

The  diversity  in  the  contents  of  the  knowledge 
that  the  state  can  mobilize  is  paralleled  by  the 
diversity  in  other  resources  at  its  disposal — guns 
and  gallows,  sophisticated  financial  instruments, 
buildings,  various  means  of  communication,  spe¬ 
cific  juridical  faculties,  moneys.  Both  diversities 
(and  the  consequent  one  in  the  vocational  identi¬ 
ties  within  the  staff)  reflect  themselves  in  the  varied 
nature  of  the  tasks  the  state  can  take  on  itself  in 
managing  a  society’s  political  affairs.  In  turn,  the 
diversity  in  the  savoirs,  the  materials,  and  the  per¬ 
sonnel  available  to  the  state  finds  visible  expres¬ 
sion  in  the  organizational  design  of  the  state  itself. 
It  is  in  the  first  instance  articulated  functionally, 
that  is,  into  units  that  are  each  normally  engaged 
in  different  activities,  reflecting  the  varied  nature 
of  the  tasks  those  activities  must  accomplish  and  of 
the  relative  bodies  of  knowledge  and  of  relevant 
material  resources.  Again,  in  matching  as  far  as 
possible  its  structures  to  its  functions,  the  state 
expresses  its  rationality.  (The  familiar  image  of  the 
state  as  a  machine  best  conveys  this  principle.) 

The  second  mode  of  articulation  is  instead  hier¬ 
archical.  The  performance  of  the  various  state 
activities  by  diverse  personnel  on  the  ground — 
soldiers,  teachers,  diplomats,  policemen,  judges,  tax 
collectors,  and  social  workers — is  controlled  by 
personnel  higher  up,  who  activate  the  personnel 
under  them  and  instruct  them  on  how  to  operate, 
monitor,  and  if  necessary  sanction  those  operations. 


Here,  the  telling  image  is  that  of  the  pyramid,  where 
the  base  is  provided  by  the  personnel  on  the  ground 
and  the  apex  by  the  controlling  bureaucratic  elite. 
In  this  way,  the  state  secures  its  unity  and  the  uni¬ 
formity  or,  at  any  rate,  the  consistency  in  the  way 
its  activities  are  carried  out  over  time  and  in  various 
locales.  Furthermore,  the  higher  levels  not  only  can 
mandate  and  thus  constrain  the  local,  time-bound 
exercise  of  the  same  activities  at  lower  levels,  but 
they  can  also  merge  and  coordinate  diverse  activi¬ 
ties  and  effect  the  convergence  of  different  bodies  of 
knowledge  and  of  personnel. 

Both  in  their  functional  and  in  their  hierarchical 
arrangements,  bureaucratic  systems  embody  the 
first  two  dimensions  of  institutionalization  accord¬ 
ing  to  Popitz: 

1.  Depersonalization:  The  basic  components  of 
bureaucratic  units  are  offices,  ensembles  of 
resources,  entitlements,  and  responsibilities  to 
which  the  physical  individuals  appointed  to 
each  office  are  expected  to  orient  their  activities. 
To  guarantee  this  expectation,  officials  do  not 
own  the  facilities  they  use,  and  in  using  them, 
the  officials  are  expected  to  serve  the  interests 
of  the  office,  not  those  private  to  the  occupants 
themselves. 

2.  Formalization:  The  activities  of  the  occupants 
are  imputed  to  the  offices  themselves  and 
rendered  valid  only  insofar  as  they  implement 
certain  rules,  both  of  a  technical  and  of  a  legal 
nature,  both  substantive  and  procedural. 

The  state  can  produce  such  rules  directly,  by 
means  of  positive  law  that  at  successive  levels — a 
statute  turned  into  bylaws,  a  bylaw  into  concrete 
ordinances — can  activate  the  discretionary  capac¬ 
ity  of  the  administrative  personnel.  It  can  also 
produce  such  rules  indirectly,  by  authorizing 
activities  that  mobilize  the  bodies  of  nonlegal, 
professional  knowledge,  which  have  qualified 
individuals  for  entering  that  profession.  In  both 
ways,  it  can  program  the  conduct  of  the  office¬ 
holders,  on  the  one  hand,  making  it  predictable 
and,  on  the  other,  making  it  respond  to  the 
particularities  and  contingencies  of  the  hie  et  nunc 
that  the  officials  confront. 

A  system  of  administration  that  mobilizes  very 
diverse  resources  and  articulates  itself  into  a  vari¬ 
ety  of  units  permits  the  state  to  exercise  rule  in  a 
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continuous  and  intensive  manner.  It  can  thus 
match  the  demands  and  opportunities  presented 
by  the  increasingly  complex  and  changing  societies 
of  modernity  and  acknowledge  their  growing  pref¬ 
erence  for  a  rational  and  secular  orientation  to 
action,  individual  and  collective,  and  the  particular 
significance  they  attach  to  economic  affairs. 

This  conceptual  portrait  of  the  state  has 
expressly  considered  only  the  ways  in  which  rule  is 
structured  and  carried  out  within  it.  But  the  mod¬ 
ern  political  universe  is  constituted  by  a  plurality 
of  states  and  characterized  also  by  the  relations 
that  typically  exist  among  them,  for  the  monopoly 
of  legitimate  violence  constitutive  of  each  state 
refers  to  a  specific  portion  of  the  globe,  at  the 
boundaries  of  which  it  encounters  the  monopoly 
vested  in  other  states.  Together,  they  make  up  the 
states’  system,  the  critical  feature  of  which  is  that 
the  units  making  it  up  are  not  empowered  and 
controlled  by  an  overarching  power  center.  They 
are  equal  to  one  another  in  some  respects:  Each  is 
territorial,  being  bounded  by  the  portion  of  the 
globe  over  which  it  exercises  the  monopoly  of 
legitimate  violence;  each  is  sovereign,  in  the  sense 
of  constituting  the  overriding  fount  of  all  legiti¬ 
mate  political  authority;  and  each  is  expected  and 
entitled  autonomously  to  define  its  own  interests 
and  to  pursue  them  by  committing  its  own 
resources  at  its  own  risk. 

This  elementary  view  of  international  affairs, 
and  the  decisive  role  unavoidably  played  in  them 
by  sheer  relations  of  might  between  the  participat¬ 
ing  states,  is  grounded  on  the  principles  laid  down 
in  the  Treaties  of  Westphalia  (1648).  The  extent  to 
which  these  still  hold  is  a  key  issue  within  the  dis¬ 
cipline  of  international  relations,  given  the  number 
and  magnitude  of  events  that  have  occurred  since 
that  time  in  the  relations  between  states.  Weber 
himself  seems  to  have  held  a  Westphalian  view  of 
these  matters,  perhaps  because  many  of  the  afore¬ 
mentioned  events  (such  as  the  development  of 
certain  kinds  of  international  organizations  or  the 
advent  of  nuclear  weapons)  occurred  after  his 
death.  In  fact,  his  theoretical  writings  are  largely 
silent  on  these  matters,  but  some  of  his  political 
ones,  especially  from  the  times  of  World  War  I  and 
immediately  afterward,  throw  some  light  on  them. 

There  is  thus  a  marked  contrast  between  the 
import  of  state  building  within  the  state  itself  and 
its  import  between  the  states.  At  the  first  level, 


state  building  is  typically  associated  with  pacifica¬ 
tion,  since  the  state  monopolizes  legitimate  vio¬ 
lence  and  commits  it  to  sanction  the  activities  of  an 
extensive,  differentiated  but  at  the  same  time  uni¬ 
tary  system  of  law  and  administration.  All  this,  it 
might  be  said,  has  an  ordering  effect,  reducing  the 
contingency  in  the  relation  between  the  parts.  At 
the  second  level,  contingency  is  instead  increased, 
since  the  states  take  their  own  initiatives  vis-a-vis 
one  another  (including  the  recourse  to  violence)  in 
light  of  their  own  autonomous  definition  of  their 
own  interests.  For  each,  chief  among  these  is  its 
own  security,  but  a  state  of  affairs  that  secures  one 
state  may  desecure  another.  Under  these  condi¬ 
tions,  for  each,  the  pursuit  of  security  must  take  the 
form  of  increasing  as  far  as  possible  its  own  might. 

Major  Phases  in  State  Development 

An  overview  of  “the  state’s  story”  is  possible  only 
by  means  of  considerable  oversimplification.  This 
discussion  reduces  it  to  three  phases,  each  compris¬ 
ing  institutional  developments  that  varied  consid¬ 
erably  from  one  context  to  another  in  their  timing 
and  in  their  concrete  manifestations. 

Consolidation  of  Rule 

The  first  phase  can  be  labeled  consolidation  of 
rule-,  it  took  place  in  Europe  between  the  12th  and 
the  17th  centuries.  During  this  phase,  a  number  of 
locally  based  rulers  sought  to  extend  the  territorial 
reach  of  their  political  control.  Naturally,  this 
engenders  between  them  a  competitive  process 
whose  outcome  favors  a  few  rulers  and  compels  a 
considerably  larger  number  of  others  to  remain  in 
the  business  of  rule  (if  at  all)  only  in  a  subordinate 
position,  their  own  territories  being  subsumed 
under  the  larger  ones  now  controlled  by  the  suc¬ 
cessful  competitors.  As  a  result,  the  political  map 
of  the  continent  becomes  simpler  and  simpler, 
comprising  fewer  distinct  areas,  now  mostly  geo¬ 
graphically  more  continuous  and  historically  more 
stable  ones — unless,  of  course,  they  become  them¬ 
selves  objects  of  further  processes  of  consolidation. 

Sometimes,  these  are  peaceful.  For  instance,  the 
scions  of  two  dynasties  ruling  over  different  parts 
of  Europe  marry,  and  the  territorial  holdings  of 
one  spouse  become  soldered  with  those  of  the 
other.  Or  a  political  center  temporarily  hands  over 
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to  another  a  part  of  its  own  territory  as  security  for 
a  loan  but  then  defaults  and  surrenders  that  security. 
But  mostly,  consolidation  is  the  more  or  less  direct 
outcome  of  open  conflicts  between  two  centers  of 
rule  over  which  one  will  control  which  territory. 
Such  conflicts  are  mostly  settled  by  war,  which  leads 
to  conquest  and  forcible  annexation  of  all  or  a  part 
of  the  loser’s  territory  to  the  advantage  of  the  win¬ 
ner.  “States  make  war,”  as  Tilly  (1992)  has  memo¬ 
rably  put  it,  “and  wars  make  states”  (p.  42). 

Thus,  a  decisive  role  in  the  consolidation  of  rule 
is  played  by  military  resources.  But  these  in  turn 
require  the  “sinews  of  war,”  that  is,  the  financial 
capacity  to  muster  those  resources — troops,  offi¬ 
cers,  and  material — and  deploy  them  against 
opponents  with  the  intent  of  prevailing  over  them 
in  the  clash  of  arms.  Very  often,  military  innova¬ 
tion  confers  an  advantage  to  armies  and  fleets  that 
are  larger,  for  these  can  wage  war  on  more  than 
one  front  and  become  internally  differentiated  into 
“services,”  which  can  effectively  perform  distinct, 
complementary  tasks  in  warfare.  But  only  rulers 
who  marshal  larger  resources  can  afford  such 
armies  and  fleets,  and  to  this  end,  they  need  to 
raise  troops  from  larger  populations,  tapping  the 
wealth  produced  by  larger  territories.  This  pre¬ 
mium  on  bigness  is  a  strong  inducement  to  what 
we  call  consolidation. 

When  weapons  are  silent,  however  temporarily, 
resources  of  a  different  nature  also  come  into  play. 
Often,  political  centers  intent  on  consolidating  rule 
do  this  in  response  to  an  appeal  to  peace  that  recurs 
most  frequently  in  European  history,  being  often 
voiced  by  religious  leaders.  Each  center  seeks  to 
prove  that  by  establishing  its  control  over  a  larger 
territory,  it  can  put  an  end  to  tensions  between 
rivalrous  lesser  powers,  which  would  otherwise 
occasion  war  or  other  violent  disorder.  This  does 
not  always  involve  prevailing  over  those  powers  in 
battle.  Diplomatic  action,  the  game  of  alliances  and 
coalitions,  the  ability  to  isolate  opponents  or  to 
make  them  accept  a  degree  of  subordination,  and 
sometimes  the  recourse  to  arbitration  by  the  empire 
or  the  papacy  also  play  a  role. 

In  addition,  military  activity  itself  requires  and 
produces  rules  of  its  own,  the  very  core  of  an 
emerging  body  of  law  seeking,  more  or  less  effec¬ 
tively,  to  regulate  aspects  of  the  relations  between 
states.  Another  significant  part  of  such  law  makes 
conflict  over  territory  less  likely  by  laying  down 


clear  principles  for  succession  into  vacant  seats  of 
power,  mostly  by  making  legitimate  descent  the 
exclusive  entitlement  to  rule.  Other  developments 
contribute  to  the  same  “pacification”  effect.  For 
instance,  thanks  to  the  remarkable  advances  in 
geography,  in  the  measurement  of  terrain,  and  in 
cartography,  the  physical  reach  of  each  center  of 
rule  comes  to  be  clearly  delimited  by  geographical 
borders,  in  turn  often  determined  by  features  of 
the  terrain.  It  remains  true,  as  Thomas  Hobbes 
puts  it,  that  states  adopt  toward  one  another,  even 
when  they  are  not  fighting,  “a  posture  of  warre 
[sic].”  But  they  partition  the  continent  of  Europe, 
and  later  other  continents,  in  a  clear  and  poten¬ 
tially  stable  manner.  (There  are  also  rules  concern¬ 
ing  the  seas.) 

Rationalization  of  Rule 

Consolidation  of  rule  produced  larger,  more 
visible  and  stable  territorial  containers  for  the  next 
phase  in  the  “state’s  story,”  which  can  be  labeled 
rationalization  of  rule.  Postmedieval  Western 
modernization  entailed  a  thorough-going  institu¬ 
tional  differentiation  between  diverse  spheres  of 
social  existence  and  cultural  practice.  This  allowed 
the  rationalization  of  each  sphere,  for  it  allowed 
the  efficiency  and  effectiveness  of  the  workings  of 
each  to  be  verified  with  reference  to  distinctive 
criteria  of  success  and  failure.  This  also  happened, 
in  highly  varied  ways,  as  concerns  the  relation 
between  the  political  and  the  other  societal  spheres. 
Rulers  increasingly  sought  to  place  themselves  in 
charge  of  all  political  concerns  of  society  and  only 
those.  One  expression  of  this  was  the  successful 
(though  contested)  enthronement  of  the  idea  of 
state  reason  as  the  distinctive,  imperious  standard 
and  polestar  of  their  action.  Much  more  significant 
was  the  secularization  of  the  state — the  protracted, 
controversial  process  whereby,  so  to  speak,  the 
state  got  out  of  the  business  of  sending  souls  to 
heaven,  in  which  previously  it  had  been  much 
involved. 

Less  visible,  but  in  the  long  run  at  least  as  sig¬ 
nificant,  was  a  process  that  found  early  expression 
in  a  formula  adopted  by  absolutist  rulers.  They 
claimed  for  themselves  all  the  imperium — that  is,  all 
faculties  and  facilities  related  to  the  political  man¬ 
agement  of  society — but  left  to  the  “particulars” 
(which  at  the  time  meant  chiefly  the  landowning 
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estates  and  various  city-based  corporate  bodies) 
the  dominium,  that  is,  the  property  of  most  eco¬ 
nomic  resources.  In  due  course,  this  led  to  an 
increasing  institutional  autonomy  of  the  economic 
sphere.  The  progressive  marketization  of  the  pro¬ 
duction  and  distribution  of  wealth  would  progres¬ 
sively  limit  some  of  the  economic  policies  pursued 
by  the  absolutist  rulers  themselves  and  replace 
them  as  far  as  possible  with  legal  frameworks 
within  which  the  private  initiatives  of  individual 
market  actors  could  display  their  peculiar,  power¬ 
ful  dynamics. 

This  process  of  differentiation  and  mutual 
autonomization  of  parts  (which  can  be  character¬ 
ized  as  the  progressive  matching  of  structures  to 
distinct  societal  functions)  was  carried  out  also 
within  the  political  sphere  itself  through  what 
came  to  be  called  the  separation  of  powers  (legisla¬ 
tive,  jurisdictional,  and  executive)  and,  as  noted 
above,  through  the  increasingly  complex  structure 
of  the  state’s  administrative  apparatus.  Here,  the 
rationalization  of  rule  finds  further  expression  in 
the  emergence  of  a  professional  body  of  adminis¬ 
trators.  These,  in  the  name  of  their  competences 
(assumed  to  be  proved  by  the  possession  of  degrees 
and  by  success  in  open  competitions  for  public 
appointment),  displace  and  replace  (without 
depriving  them  of  their  economic  advantages  and 
status  privileges)  the  members  of  traditional  elites 
who  had  previously  exercised  various  political, 
administrative,  judicial,  and  military  functions  as  a 
matter  of  right.  Increasingly,  those  functions  come 
to  be  exercised  instead  as  a  matter  of  duty  by 
expressly  selected,  appointed,  and  promoted  indi¬ 
viduals.  The  consequent  rationality  gain  is  obvi¬ 
ous,  for  instance,  in  the  ease  with  which  their 
activities  can  be  programmed,  coordinated,  moni¬ 
tored,  and  sanctioned  from  the  political  center. 

Expansion  of  Rule 

Having  strengthened  and  rationalized  their 
administrative  apparatus,  states  can  embark  on  a 
third  phase  of  their  development,  which  can  be 
called  the  expansion  of  rule.  This  phenomenon  has 
that  apparatus  as  its  protagonist  (and,  in  some 
interpretations,  its  main  beneficiary).  But  it  has 
roots  also  in  other  institutional  processes,  includ¬ 
ing  some  pertaining  to  the  relation  between  state 
and  society  itself. 


The  latter  increasingly  constitutes  a  complex 
and  changing  environment,  set  in  motion  by  mod¬ 
ernizing  processes  such  as  increasing  literacy,  the 
first  printed  media,  incipient  processes  of  urban¬ 
ization  and  industrialization,  and  the  formation  of 
new  social  groupings  and  new  elites. 

In  certain  political  contexts,  the  rationalized 
administration  seeks  to  take  charge  of  those  pro¬ 
cesses  on  behalf  of  the  ruler.  Consider  a  statement 
on  Polizey  from  18th-century  Prussia,  a  state  with 
a  marked  tendency  to  authoritarian  rule  and  a 
particularly  advanced  administrative  system: 

Police,  in  the  broad  understanding  of  the  term, 
refers  to  all  those  measures  in  the  internal  affairs 
of  a  nation  through  which  the  wealth  of  the  state 
may  be  more  permanently  established  and 
maintained.  .  .  .  Police,  in  the  narrow  sense  of  the 
term,  refers  to  all  that  which  is  required  for  the 
proper  condition  of  civil  life,  and  in  particular 
for  the  maintenance  of  good  order  and  discipline 
among  subjects.  (Charles  Maier,  1997,  p.  110) 

In  other  contexts,  however,  the  developments 
mentioned  above  generate  new  forms  of  social, 
cultural,  and  economic  power  in  the  hands  of 
groups  that  resist  and  sometimes  openly  oppose 
the  political  practices  of  the  ancien  regime,  such  as 
the  privileged  roles  it  reserved  for  members  of  the 
aristocracy;  the  arcana  imperii,  which  excluded  the 
processes  of  policy  formation  from  the  observa¬ 
tions  and  the  inputs  of  an  increasingly  informed 
and  sophisticated  public;  or  extensive  censorship 
of  printed  materials.  Where  such  groups  are  suc¬ 
cessful,  the  result  is  the  emergence  of  the  liberal 
public  sphere.  Here,  private  individuals  and  spon¬ 
taneously  formed  groupings  can  address  each  other 
regarding  matters  of  common  interest,  formulate 
claims  and  critiques  concerning  their  handling  by 
the  authorities,  generate  bodies  of  opinion,  and 
eventually  form  expressly  political  alignments, 
which  compete  for  electoral  success  within  increas¬ 
ingly  broad  and  active  constituencies  and  thus  for 
the  formation  of  governments. 

By  establishing  a  liberal  public  sphere,  a  state 
can  project  itself  as  to  some  extent  mandated  and 
empowered  by  its  population,  committed  to  serv¬ 
ing  its  needs.  The  involvement  in  the  public  sphere 
of  broader  and  broader  strata  of  the  population 
and  the  fact  that  normally  the  outcomes  of  public 
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decisions  are  accepted  also  by  those  whose  propos¬ 
als  are  not  successful  increasingly  legitimize  and 
strengthen  the  state. 

However,  since  a  liberal  public  sphere  operates 
as  a  hinge  between  civil  society  and  the  state  and 
transmits  to  the  latter  impulses  from  the  former,  it 
unavoidably  also  reveals  the  divided  nature  of  the 
society  itself,  the  cleavages  (political,  socioeco¬ 
nomic,  cultural,  and  regional)  that  traverse  it,  and 
the  contrast  between  the  interests  activating  its  dif¬ 
ferent  components.  The  mechanisms  of  political 
representation  increasingly  map  these  antagonisms 
onto  the  legislative  organs,  all  the  more  so  as  the 
suffrage  is  broadened.  Policy  formation  becomes 
largely  adversarial;  the  policy  proposals  and  cri¬ 
tiques  of  government  and  opposition  advocate 
contrasting  demands;  and  the  elites  guiding  the  par¬ 
ties  commit  themselves,  if  put  in  control  of  govern¬ 
ment,  to  preserve  or  modify  the  current  distribution 
of  economic  advantage  within  the  population. 

Adversarial  policy  formation  confirms  and 
emphasizes  a  characteristic  of  the  state  we  have 
already  mentioned:  the  contingent,  open-ended 
nature  of  political  initiatives.  By  the  same  token, 
however,  it  threatens  the  state’s  abiding  commit¬ 
ment  to  its  own  unity,  its  claim  to  represent  and 
pursue  the  general  interest  and  to  override  those  of 
a  sectional  nature  that  divide  civil  society  and, 
within  the  public  sphere,  articulate  intrinsically 
controversial  issues. 

There  are  two  basic  responses  to  this  threat, 
both  of  which  in  different  ways  promote  the 
expansion  of  rule.  The  first  is  the  promotion  of 
citizenship.  Here,  only  the  most  antagonistic  and 
subversive  claims  conveyed  in  the  public  sphere  are 
(more  or  less  openly)  suppressed,  while  others  are 
allowed  expression  and  a  measure  of  realization, 
thus  institutionalizing  the  related  conflicts  and 
making  them  less  divisive.  In  this  way,  contrasting 
proposals  can  be  moderated  and  made  the  object 
of  compromise.  Policies  of  this  nature  are  generally 
associated  with  progressive  parties  but  were  some¬ 
times  undertaken  by  conservative  governments — 
for  instance,  Bismarck  initiated  in  Germany  some 
of  the  early  provisions  characteristic  of  what 
would  later  be  called  the  “welfare  state.” 

The  other  response  confronts  in  a  different  fash¬ 
ion  the  threat  to  state  unity  generated  by  some 
aspects  of  political  modernization.  Oriented  to  the 
promotion  and  the  fostering  of  nationhood,  it 


consists  in  evoking  within  the  population  an  emo¬ 
tionally  compelling  sense  that  it  shares  politically 
significant  commonalities,  expressing  a  single 
political  identity  and  a  single  political  destiny. 
These  commonalities  supposedly  transcend  and 
relativize  all  existent  cultural  and  socioeconomic 
divisions  within  the  society  and  are  focused  on  the 
state  itself,  investing  its  unity  and  its  might  with 
value  in  everybody’s  mind.  The  promotion  of 
nationhood,  in  a  sense,  attributes  to  the  early- 
modern  notion  of  reason  of  state — originally  con¬ 
ceived  as  the  exclusive  concern  of  a  very  narrow 
political  elite,  the  object  of  deliberations  and 
machinations  covered  by  the  arcana  imperii — a 
broader  and  deeper  resonance  and  makes  it  attain 
significance  in  the  hearts  and  minds  of  the  masses. 

At  any  rate,  in  the  prevailing  political  rhetoric, 
the  notion  of  national  interest  becomes  the  lode¬ 
star  of  the  most  important  forms  of  policy  making. 
The  state  itself  acquires  a  new  justification  as  the 
political  instrument  of  the  nation  itself,  conceived 
as  a  social  entity  grounded  on  prepolitical  com¬ 
monalities,  which  generate  in  its  components  a 
shared  belonging.  Among  other  things,  nation¬ 
hood  entails  that  the  population  has  a  close,  abid¬ 
ing,  intensely  meaningful  relation  with  the  state’s 
territory  and  construes  it  as  the  very  body  of  the 
nation.  When  this  relation  cannot  be  attained  or 
maintained,  their  frustration  feeds  nationalist  com¬ 
plaints  and  aspirations. 

There  are  significant  differences  between  these 
two  positions,  revealed  among  other  things  by  the 
style  and  content  of  the  respective  flows  of  politi¬ 
cal  communication.  Citizenship,  as  indicated,  aims 
to  moderate  and,  to  an  extent,  reconcile  differ¬ 
ences  revealed  by  the  public  sphere;  nationhood 
aims  to  transcend  them.  The  latter  does  so  by 
focusing  on  the  fact  that,  as  already  emphasized, 
each  state  exists  next  to  others  and  together  they 
constitute  an  unstable  and  threatening  environ¬ 
ment,  whereas  the  promotion  of  citizenship 
addresses  in  the  first  place  a  state’s  internal  condi¬ 
tions.  The  politics  of  nationhood  are  high  politics, 
focused  on  the  pursuit  of  collective  destiny  chiefly 
through  might;  the  politics  of  citizenship  are  some¬ 
times  labeled  low  politics,  for  typically,  they 
address  humdrum  issues  of  distribution  of  the 
domestic  product  by  increasing  or  abridging  the 
rightful  claims  of  different  sections  of  the  popula¬ 
tion.  What  we  may  call  the  discourse  of  citizenship 
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tends  to  be  conducted  in  a  relatively  factual,  cogni¬ 
tively  oriented  manner,  largely  to  establish  what 
contribution  various  groups  make  to  the  economy 
and  whether  they  are  properly  rewarded  for  it.  The 
discourse  of  nationhood  is  largely  intended  to  evoke 
and  celebrate  shared  sentiments;  its  content  is  highly 
symbolic  and  is  often  the  product  at  least  as  much 
of  creative  intellectuals  and  literati  (musicians, 
poets,  and  novelists)  as  of  experts  and  specialists. 
(Characteristically,  the  book  that  in  late-  19th-  and 
early-20th-century  Italy  did  the  most  to  generate 
nationalist  feelings  among  schoolchildren  bore  the 
title  Cuore  [Heart].) 

For  all  this,  the  two  responses  do  not  differ 
radically,  and  there  are  affinities  between  them.  For 
instance,  the  expansion  of  suffrage  has  been  his¬ 
torically  associated  with  the  increasing  rate  of  mili¬ 
tary  participation  required  by  the  modernization  of 
warfare;  other  rights  of  citizenship  have  sometimes 
expanded  to  reward  subaltern  groups  of  the  popu¬ 
lation  for  efforts  made  and  sacrifices  borne  in  times 
of  war  on  behalf  of  national  interests.  Policies  to 
remedy  some  extremes  of  socioeconomic  inequality 
within  the  population,  particularly  those  associated 
with  regional  differences,  have  often  been  consid¬ 
ered  necessary  for  the  promotion  of  nationhood. 
More  generally,  both  citizenship  and  nationhood 
project,  in  somewhat  different  ways,  the  same  view 
of  the  population  as  the  constituency  of  the  state, 
and  both  may  be  considered  as  different  but  over¬ 
lapping  aspects,  on  the  one  hand,  of  progressive 
democratization  of  the  state  and,  on  the  other,  of 
the  expansion  of  rule. 

Vicissitudes  of  the  20th-Century  State 

Both  the  previous  sections  have  had  as  their  focus 
(the  first  in  conceptual  terms,  the  second  via  a 
schematic  narrative)  the  “state  story”  as  it  devel¬ 
oped  up  to  Weber’s  times.  Only  some  of  the  most 
significant  later  developments  can  be  treated  here, 
and  that  too  only  briefly,  with  an  emphasis  on  the 
more  problematical  ones. 

First,  the  late  19th  and  20th  century  witnessed 
what  could  be  called  the  state’s  success  as  the  uni¬ 
versally  recognized  institutional  arrangement  for 
constructing  and  managing  large-scale  polities 
with  a  territorial  base.  By  the  end  of  the  20th  cen¬ 
tury,  almost  every  part  of  the  world  had  fallen 
under  the  jurisdiction  of  one  state  or  the  other. 


(The  major  exception,  not  discussed  here,  is  that  of 
extraterritorial  waters. )  This  drastic  extension  of  a 
model  that  originally  had  a  regional  basis  in 
Western  Europe  took  place  over  a  few  centuries 
but  was  accelerated  by  phenomena  such  as  nation¬ 
alism  (which  in  a  sense  challenged  and  inverted  the 
sequence  proposed  earlier  “first  state,  then 
nation”),  decolonization,  the  position  of  unique 
visibility  and  legitimacy  gained  after  World  War  II 
by  the  United  Nations  and  a  few  other  interna¬ 
tional  organizations  (mostly  with  states  as  their 
constituents),  and  the  breakdown  of  empire-like 
polities  such  as  the  former  USSR  and  Yugoslavia 
and  the  establishment  of  numerous  new  states  on 
parts  of  their  territory.  However,  the  state  model 
can  be  interpreted  and  implemented  in  various 
ways,  some  of  which  hardly  qualify  as  states  in  the 
meaning  of  the  expression  we  derived  from  Weber. 
There  are  in  any  case  strong  states  and  weak  states, 
and  the  current  expressions  failed  states  and  rogue 
states  point  to  further  dimensions  of  variability. 

Furthermore,  arguably,  no  contemporary  state 
entirely  possesses  a  key  characteristic  of  the  classical 
state:  territorial  sovereignty.  The  territorial  compo¬ 
nent  has  been  weakened  by  some  aspects  of  eco¬ 
nomic  and  cultural  globalization;  by  ecological 
phenomena  that  ignore  territorial  boundaries;  by 
massive,  uncontainable  migration  flows;  and  by 
forms  of  warfare  the  protagonists  of  which  have  no 
distinctive  territorial  bases.  The  sovereignty  of 
many  states  has  been  compromised  by  their  more  or 
less  open  surrender  of  various  jurisdictional  facul¬ 
ties  and  resources  to  international  organizations  of 
various  kinds.  The  development  of  nuclear  weapons 
has  called  into  question  a  basic  component  of 
“Westphalian”  territorial  sovereignty — the  right  of 
states  to  assert  interests  they  deem  critical  by  wag¬ 
ing  war — because  the  “ascent  to  the  extremes” 
characteristic  of  warfare  has  acquired  the  potential 
to  visit  utter  and  final  doom  on  both  parties  and, 
possibly,  the  rest  of  humankind  as  well. 

The  domestic  dimension  of  the  state’s  institu¬ 
tional  mission — its  relationship  to  civil  society — 
also  has  seen  massive  developments  in  the  20th 
century.  The  most  significant  of  these  can  be  seen  in 
the  development  of  totalitarianism.  This  happens  in 
some  states  when  unbounded  political  power  has 
not  just  modified  that  relationship  to  the  state’s 
advantage  but  in  a  sense  abolished  it  by  denying 
any  autonomy  to  the  institutions  and  processes  that 


State  2495 


structure  civil  society  itself:  the  family,  organized 
religion,  the  market,  public  opinion,  education  and 
science,  and  social  and  cultural  movements.  If 
some  complementarity  with  civil  society  is  concep¬ 
tually  essential  to  the  state  itself,  totalitarianism 
can  be  seen,  paradoxically,  as  abolishing  the  state 
itself.  In  fact,  it  has  been  suggested  that  in  totalitar¬ 
ian  systems,  ultimate  political  power  lies  not  with 
state  institutions  proper  but  with  a  single  party 
and  its  supreme  leadership — sometimes  a  single 
individual.  Only  these  can  activate  and  direct  the 
powers  vested  in  state  institutions. 

Even  in  states  committed  to  liberal  democracy, 
the  expansion  of  rule  has  continued  powerfully 
into  the  20th  century,  generally  shifting  the  bound¬ 
aries  between  state  and  society  by  allowing  the 
former  greater  leverage  over  the  latter.  This  is  an 
“overdetermined”  phenomenon,  produced  by  a 
number  of  diverse  and  often  not  otherwise  related 
causal  influences.  Schematically,  these  can  be  seen 
to  arise  either  from  the  state  or  from  the  society 
side  of  the  boundaries. 

On  the  state  side,  expansion  has  been  produced 
in  the  first  place  by  the  states  pursuing  those  dis¬ 
tinctively  political  interests  related  to  each  state’s 
position  in  the  international  order.  This  holds  par¬ 
ticularly  for  the  great  increase  in  state  involvement 
in  societal  management  occasioned  by  the  two 
world  wars.  More  sustained  and  equally  signifi¬ 
cant  in  the  long  run  is  the  impulse  that  involve¬ 
ment  received  as  a  result  of  the  fact  that,  under 
liberal  democratic  rules,  adversary  politics  is  the 
main  matrix  of  policy.  However,  it  is  not  the  only 
matrix,  for  a  further  impulse  has  come  from  the 
state  administrative  apparatus’s  own  interest  in 
gaining  a  greater  and  greater  capacity  for  authori¬ 
tative  oversight  and  regulation  of  social  affairs, 
with  the  result  that  there  has  been  an  increase  in  its 
own  size  and  complexity,  the  entity  of  the  resources 
it  extracts  and  manages,  and  the  related  degree  of 
discretion. 

A  look  at  society’s  side,  however,  suggests  that 
the  often  evoked  imagery  of  an  increasingly  inva¬ 
sive,  interfering,  and  greedy  state  has  serious  limi¬ 
tations.  In  the  first  place,  societies  may  witness 
structural  developments  (e.g.,  new  forms  of  com¬ 
munication  or  financial  innovations)  or  cultural 
developments  (e.g.,  those  inspiring  “identity  poli¬ 
tics”)  that  displace  existing  forms  of  legal  and 
administrative  regulations.  Often,  the  resultant 


vacuum  can  only  be  filled  by  the  state  modifying 
previous  regulations  and  adding  new  ones;  if  this 
does  not  occur,  large-scale  social  and  economic 
disaster  may  take  place.  Furthermore,  state-ruled 
societies  are  all  characterized  by  persistent  and 
significant  divisions,  some  of  which  can  only  be 
tempered  or  accommodated  by  the  state’s  distribu¬ 
tive  policies  and  the  related  forms  of  administra¬ 
tive  intervention.  The  social  demand  for  this  is 
generally  expressed  through  adversarial  politics 
and/or  by  alignment  with  the  state’s  administrative 
apparatus  to  establish  new  units  and  acquire  new 
faculties  and  resources. 

The  most  visible  product  of  this  phenomenon  is 
the  great  20th-century  expansion  of  the  welfare 
state.  But  seeing  this  as  the  product  chiefly  of  “the 
politics  of  envy”  is  facile  and  ignores  multiple 
demands  for  state  intervention  and  spending  that 
originate  instead  from  powerfully  entrenched  eco¬ 
nomic  groups,  seeking  public  support  for  their 
financial  needs,  their  technological  innovations, 
and  their  search  for  secure  markets  at  home  and 
abroad.  In  other  terms,  there  is  a  welfare  state  also 
for  the  rich,  not  just  the  poor. 

The  continuing  and  intensified  expansion  of 
state  rule  in  the  20th  century  engenders,  as  we 
have  suggested,  the  tendency  of  the  administrative 
apparatus  to  grow  in  size  and  complexity.  In  turn, 
this  makes  it  more  and  more  difficult  for  the  state’s 
political  summit  to  coordinate  the  activities  of  that 
apparatus  across  its  multiple,  diverse,  and  often¬ 
times  competing  units  to  ensure  the  effectiveness 
and  efficiency  of  their  operations. 

The  widely  used  notion  of  state  overload 
explains  these  problems  but  should  be  comple¬ 
mented  by  other  considerations.  To  some  extent, 
the  bureaucratic  mode  of  organization  of  the  state 
itself,  as  Weber  presented  it  by  way  of  the  ideal- 
typical  method,  is  rendered  problematical  by  the 
increasingly  diverse,  changing,  dynamic,  open 
social  and  cultural  environments  characteristic  of 
contemporary  advanced  societies.  One  senses  that, 
in  such  a  context,  that  model  can  no  longer  deliver 
on  its  promise  of  continuing  rationalization  of 
existence.  Although  the  state  is  by  no  means  the 
only  sphere  in  which  the  bureaucratic  model 
asserted  itself  in  the  course  of  modernization,  that 
model  appears  to  have  been  successfully  revised, 
improved,  or  to  an  extent  supplanted  in  other 
spheres,  especially  the  economy.  In  the  political 
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sphere,  it  seems  to  suffer  from  an  additional  degree 
of  rigidity,  of  a  denial  of  its  own  difficulties  as  it 
were,  although  here  too  there  are  attempts  at  tran¬ 
scending  those  limitations:  Consider  the  late-20th- 
century  slogan  “reinventing  government.”  But  by 
and  large,  it  is  not  unfair  to  call  attention  to  the 
contradiction  between,  on  the  one  hand,  the  state’s 
increasing  attempts  at  intervention  and  regulation 
of  societal  affairs  and,  on  the  other,  the  intrinsic 
clumsiness  and  costliness  of  its  modus  operandi. 

But  this  critique  should  not  be  taken  entirely  at 
face  value.  In  most  of  its  formulations,  it  is  an 
ideological  aspect  of  a  phenomenon  that  warrants 
some  suspicion — the  deliberate,  self-interested 
offensive  against  the  state  of  large  accumulations 
of  economic  power  characteristic  of  the  end  of  the 
20th  century  and  the  beginning  of  the  21st. 
Essentially,  the  dominant  units  of  the  contempo¬ 
rary  economy  have  been  seeking  to  take  over  more 
and  more  aspects  of  general  societal  management; 
and  while  doing  so  in  the  pursuit  of  their  own 
interests,  they  have  appealed  to  what  they  claim  to 
be  the  undeniable,  intrinsic,  across-the-board  supe¬ 
riority  of  “the  market”  over  “politics/the  state.” 

The  most  visible  aspect  of  this  offensive  is  con¬ 
stituted  by  so-called  economic  globalization.  But 
this,  in  the  words  of  Vincenzo  Roppo  (2001),  an 
Italian  legal  scholar,  is  in  essence  “a  gigantic  pro¬ 
cess  whereby  state  functions  are  transferred  to 
markets”  (p.  531).  The  arguments  for  the  legiti¬ 
macy  and  necessity  of  the  process  are  the  core  of 
neoliberal  ideology,  and  their  pars  destruens  con¬ 
sists  largely  in  denouncing  the  intrinsic,  unavoid¬ 
able  weakness  and  wastefulness  of  rule  itself  as  the 
only  alternative  mode  of  societal  management. 

The  vast  resonance  imparted  to  such  arguments 
by  media  of  all  kinds  (including  academic  litera¬ 
ture)  has  put  the  state  (and  politics)  on  the  defen¬ 
sive,  under  charge,  especially,  of  inefficiency. 
There  are  serious  grounds  for  the  charge,  but  the 
arguments  present  them  as  uniquely  valid,  whereas 
the  criterion  of  efficiency  itself  should  be  recog¬ 
nized  as  not  being  unmistakably  and  universally 
valid  and  paramount  but  as  constituting  an  argu¬ 
able  value  preference  and  thus,  unavoidably,  as 
itself  entailing  a  political  choice. 
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A  virtual  state  is  a  country  unit  that,  like  a  virtual 
corporation,  has  largely  transferred  its  production 
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overseas.  This  occurs  for  two  reasons.  First,  the 
industrialization  process  itself  stresses  manufac¬ 
turing  production  but  ultimately  locates  greater 
value  in  the  output  of  high-level  services.  Thus, 
every  industrial  economy  goes  through  this  shift 
from  an  emphasis  on  the  products  of  land,  to 
manufacturing,  and  then  to  services.  Second,  even 
if  manufacturing  output  is  still  critical  for  the  cor¬ 
poration,  it  can  best  be  produced  in  countries  with 
low-wage  but  technically  competent  labor  sup¬ 
plies.  The  virtual  corporation,  in  this  way,  abets 
the  development  of  the  virtual  state.  This  entry 
reviews  small  states,  large  states,  and  the  impact 
of  “virtuality”  on  peace. 

Small  and  Large  States 

Initially,  small-island  or  littoral  countries  such  as 
Hong  Kong,  Taiwan,  Singapore,  and  Holland  pro¬ 
duce  goods  at  home.  Then,  they  discover  that  these 
goods  may  be  produced  more  economically  on  the 
Chinese  mainland,  Indonesia,  Malaysia,  Vietnam, 
or  Bangladesh.  Shares  of  gross  domestic  product 
(GDP)  in  the  various  virtual  states  shift  increas¬ 
ingly  toward  services,  with  Hong  Kong  ultimately 
having  little  or  no  home  production  at  all.  Foreign 
direct  investment  rises  as  a  consequence,  and  pro¬ 
duction  takes  place  in  areas  that  the  home  country 
does  not  control.  Guaranteed  economic  access  to 
that  region  then  becomes  the  necessary  substitute 
for  imperial  governance. 

The  process  is  not  confined,  however,  to  small 
or  island  nations.  Even  major  and  large  states 
experience  “virtuality”  as  they  shift  away  from 
home  manufacturing  to  technical  services  such  as 
research  and  development.  Japan  still  retains  30% 
of  its  GDP  in  manufacturing,  but  in  the  United 
States,  the  figure  is  less  than  20%.  European  coun¬ 
tries  are  moving  in  the  same  direction,  and  India,  a 
developing  country,  has  grown  particularly  because 
of  the  value  of  its  service  industries.  It  follows,  of 
course,  that  some  nations  must  do  the  manufactur¬ 
ing  that  is  outsourced  to  them  from  overseas.  Brazil, 
China,  Mexico,  Poland,  Hungary,  and  the  East 
European  states  have  benefited  from  this  shift.  In  this 
sense,  there  is  a  distinction  between  “head  nations” — 
countries  that  decide  what  shall  be  produced  and  do 
the  design,  financing,  and  marketing — and  “body 
nations,”  which  perform  the  manufacturing  tasks. 
This  is  not  a  hard-and-fast  distinction;  however, 


many  erstwhile  body  nations  such  as  South  Korea 
and  China  are  now  in  the  process  of  becoming 
head  nations.  Nor  do  head  nations  fail  to  do 
manufacturing  for  others.  Taiwan  has  devolved 
many  manufacturing  functions  on  China,  but  it 
also  acts  as  a  “foundry”  to  produce  goods  to  order 
for  client  head  countries.  In  this  way,  even  the 
United  States  has  acted  as  a  body  nation  for 
Japanese,  German,  French,  and  even  Chinese  pro¬ 
duction  located  within  its  borders.  Countries  that 
produce  abroad  may  seek  to  have  their  production 
sold  in  those  markets,  and  they  thereby  escape  the 
tariff  walls  between  them  and  their  market.  As 
Robert  Mundell  has  shown,  the  movement  of  fac¬ 
tors  of  production  abroad  can  compensate  for 
limits  on  export  trade.  In  the  1930s,  however,  this 
process  did  not  occur,  and  nations  seeking  raw 
materials  and  markets  believed  that  they  had  to 
conquer  territory  to  retain  access  to  both.  The 
military  conflict  that  followed  was  in  part  the 
result  of  economic  restrictions. 

In  theory,  the  virtual  state,  a  model  to  which 
many  states  are  tending,  carries  with  it  a  new  sys¬ 
tem  of  international  politics.  In  the  past,  when 
military  conflict  and  the  irrepressible  desire  for 
territory  determined  relations  among  states,  the 
main  flow  between  countries  consisted  of  armies. 
If  the  virtual  model  holds,  future  flows  will  be 
largely  economic  as  capital,  technology,  labor 
power,  and  information  move  rapidly  among 
states.  In  the  long  term,  national  access  to  mobile 
factors  of  production  (capital,  technology,  and 
labor)  can  replace  the  need  to  control  or  own  addi¬ 
tional  land. 

Transferring  the  bulk  of  their  home  production 
overseas  and  shifting  most  of  their  economy  to 
high-level  services,  virtual  states  reshape  both  pro¬ 
ductive  and  international  relationships.  They  inau¬ 
gurate  a  world  based  on  mastery  of  flows  of 
production  and  purchasing  power  rather  than  on 
stocks  of  goods.  They  emancipate  labor  from  rou¬ 
tine  mechanical  tasks  and  offer  new  employment  in 
technical  or  creative  services.  They  usher  in  a  world 
based  on  education  and  human  capital  rather  than 
on  machines  and  physical  capital.  They  offer 
nations  the  opportunity  to  forge  international  links 
of  production  that  are  difficult  if  not  impossible  to 
break.  (Some  believe  that  the  relation  between  the 
United  States  and  China  in  both  political  and  eco¬ 
nomic  terms  is  reaching  an  important  level  of 
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“unbreakability.”)  Like  the  headquarters  of  a  vir¬ 
tual  corporation,  the  virtual  state  determines  over¬ 
all  strategy  and  invests  in  its  people  rather  than 
amassing  expensive  production  capability.  It  con¬ 
tracts  out  other  functions  to  states  that  specialize  in 
or  need  them.  Imperial  Great  Britain  may  have  been 
the  model  for  19th-century  practice,  but  Singapore 
may  be  the  model  for  the  21st  century.  The  state  no 
longer  commands  resources  as  it  did  in  mercantilist 
yesteryear;  rather,  it  negotiates  with  foreign  and 
domestic  capital  and  labor  to  lure  them  into  its  eco¬ 
nomic  sphere  and  to  stimulate  its  growth. 

The  virtual  state,  however,  is  not  equal  to  the 
small  state.  Even  large  states  undergo  the  eco¬ 
nomic  transition  from  land,  to  manufacturing,  to 
services.  Like  larger  states,  small  states  can  shift 
their  production  overseas  only  if  they  are  protected 
in  doing  so.  They  therefore  depend  on  guarantees 
of  freedom  of  the  seas  and  open  trade  routes  to 
carry  on  their  business.  Some  larger  power  or  pow¬ 
ers  have  to  provide  these  protections,  and  it  is  not 
surprising,  therefore,  that,  to  continue  to  exist, 
small  virtual  states  need  to  have  either  alliances 
with,  or  de  facto  support  from,  much  larger  and 
stronger  powers.  The  small  virtual  state  depends 
on  a  single  Great  Power  protector  or  on  a  balance 
of  power  internationally.  In  addition,  if  tariffs  rise 
or  even  if  they  fail  to  fall,  virtual  states  are  in  jeop¬ 
ardy.  This  is  because  capital  moves  much  more 
quickly  than  goods.  An  indebted  country  cannot 
quickly  send  new  cascades  of  exports  abroad  and 
hope  to  finance  a  looming  deficit.  The  first  moves 
slowly:  the  second  instantaneously. 

In  recent  years,  small  virtual  states  such  as 
Indonesia,  Malaysia,  Thailand,  and  South  Korea 
have  needed  “spare  tires.”  They  have  required 
access  to  “spare  finance”  to  tide  them  over  periods 
of  economic  crisis.  In  1997-1998,  Thailand  could 
not  meet  payments  on  its  international  obligations, 
and  speculators  withdrew  their  money.  The  Thai 
currency — the  baht — had  been  overvalued  for 
some  time.  It  began  to  lose  parity  with  the  dollar 
and  the  yen.  Since  Thai  borrowings  were  in  foreign 
currency,  Bangkok  needed  more  local  currency  to 
compensate  for  it.  When  this  was  printed,  it  inevi¬ 
tably  caused  inflation.  When  Thailand  devalued,  it 
did  not  have  enough  foreign  currency  to  cover  its 
obligations.  It  needed  money  from  the  International 
Monetary  Fund  (IMF)  or  other  lenders  to  cover  its 
losses,  but  these  sources  would  only  help  if 


Thailand  put  on  high  interest  rates  and  threw  its 
economy  into  recession,  cutting  imports.  After 
Thailand  devalued,  speculators  attacked  the  cur¬ 
rencies  of  Indonesia,  Malaysia,  and  Korea,  all  of 
which  were  (in  relation)  overvalued.  All  three  were 
thrown  into  recession  to  meet  the  demands  of 
international  investors.  Small  virtual  states,  thus, 
needed  access  to  much  larger  stores  of  capital.  They 
were  too  small  financially  to  stand  on  their  own 
feet.  At  the  same  time,  Japan  and  China,  much  big¬ 
ger  economies — with  greater  hoards  of  foreign 
capital — breezed  through  the  1998  crisis  without  a 
tremor.  Economic  size  began  to  tell  the  tale. 

As  larger  states  become  virtual,  they  establish 
production  chains  that  save  on  labor  costs  and  that 
link  them  to  distant  markets.  Few  links  in  this 
chain  remain  within  the  home  country.  The  Hong 
Kong  firm  Li  &  Fung,  experts  in  fashion  textiles, 
illustrates  this  point: 

Li  and  Fung  work  with  10,000  suppliers  in  48 
countries  to  source  materials  and  makers  for 
clients.  So  a  fabric  from  India  that  gets  dyed  in 
China  will  go  to  Thailand  to  be  embroidered  (with 
sequins  made  in  Korea  and  rhinestones  from 
Brazil)  and  then  return  to  China  to  be  cut  into 
garments.  The  firm  adheres  to  a  “30/30”  principle; 
it  guarantees  that  it  will  purchase  at  least  30%  of 
the  business  from  each  supplier,  but  it  will  not 
exceed  70%.  . .  .  And  it  requires  that  suppliers 
have  to  go  outside  the  network  to  survive — and 
thus  be  in  a  position  to  bring  in  new  ideas  from 
the  outside.  The  results  are  impressive.  In  terms  of 
asset  productivity,  Li  &  Fung  earns  30-50% 
return  on  equity.  Regarding  personnel  productivity, 
it  earned  $1  million  per  employee  per  year.  And  it 
scales  elegantly.  The  firm  took  in  $11  billion  in 
2004.  (Kenneth  Cukier,  2007,  pp.  22-23) 

The  Impact  of  “Virtuality”  on  Peace 

The  United  States,  Germany,  France,  and  Japan 
became  almost  equally  dependent  on  overseas  pro¬ 
duction  chains,  units  of  which,  however,  did  not 
work  for  one  corporation  or  even  one  country. 
They  were  available  for  many  headquarters  firms 
in  different  parts  of  the  world. 

Production  chains  make  warfare  among  partici¬ 
pating  units  difficult,  expensive,  and  perhaps  ulti¬ 
mately  self-defeating.  No  such  chains  existed  in 
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1914  or  1939.  Aggressive  states  will  have  to 
choose,  geographically,  between  trying  to  capture 
the  headquarters  of  the  virtual  state  or  its  produc¬ 
tion  links  with  other  countries  and  firms.  In  an  era 
of  capital  and  labor  mobility,  the  first  suggests  a 
loss  of  capital  and  headquarter  talent  overseas. 
Whether  the  in-house  experts  will  work  with  for¬ 
eign  occupiers  is  doubtful,  and  international  sanc¬ 
tions  will  in  any  event  likely  occur.  If  an  aggressor 
seeks  to  capture  one  or  more  links  in  an  overseas 
production  chain,  it  may  also  be  disappointed. 
Will  car  chassis  separated  from  engines,  transmis¬ 
sions,  wheels,  tires,  and  shocks  serve  much  of  an 
instrumental  or  even  financial  purpose?  Units  can, 
of  course,  be  sold  off  but  at  diminishing  prices, 
because  the  links  with  headquarters  and  research 
and  development  have  been  severed.  To  guard 
against  nationalization,  recent  research  shows  that 
multinational  corporations  investing  in  developing 
countries  have  tried  to  avoid  placing  full  assembly 
or  component  manufacture  in  one  place.  The  effect 
is  equally  marked  in  deterring  foreign  seizure.  The 
effect  is  redoubled  for  corporations  in  states  that 
enjoy  economies  of  scale  in  production. 

Software,  finance,  autos,  long-haul  civil  air¬ 
craft,  insurance,  microprocessors,  and  pharmaceu¬ 
ticals  are  monopolized  in  a  small  number  of  states. 
Aggression  that  strives  to  engross  such  industries 
will  fail  unless  Europe,  Japan,  and  the  United 
States  are  captured  simultaneously,  a  very  difficult 
task.  And  even  then,  in  all  three  cases,  the  econo¬ 
mies  are  gained  by  major  production  chains  over¬ 
seas  that  would  not  be  seized  coincidentally  with 
headquarters  nations. 

The  greatest  long-term  advantage  of  virtual 
states  is  not  just  economic;  it  is  also  pacific.  By  link¬ 
ing  with  countries  and  economic  capabilities  over¬ 
seas,  such  states  achieve  their  competence  and 
power  through  a  form  of  hard  interdependence 
with  others.  These  essential  ties  did  not  exist  in 
1914  or  1939,  but  they  are  becoming  characteristic 
in  recent  years.  Not  only  that:  As  “virtuality”  comes 
to  attach  itself  to  large  states  and  Great  Powers — 
such  as  China,  the  United  States,  Japan,  and 
Europe — it  provides  an  insurance  against  major  war 
that  did  not  exist  before  in  human  history. 
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State  Collapse 


State  collapse  represents  a  rare  and  an  extreme  level 
on  a  continuum  of  state  decay,  when  a  weak  state 
becomes  unable  and/or  unwilling  to  fulfill  its  task 
as  the  provider  of  public  goods.  Although  there  is 
no  consensus  on  a  precise  definition  of  the  term, 
still  there  are  several  traits  that  collapsed  states 
share.  The  first  is  the  disappearance  of  state  author¬ 
ity  as  grantor  of  security  and  welfare  of  its  citizens. 
The  second  is  the  appearance  of  subnational  lead¬ 
ers,  mainly  in  the  form  of  strong  warlords,  who 
control  several  hundred  armed  militias  that  rob, 
rape,  and  pillage  to  finance  their  war  efforts.  Third, 
the  national  identity  is  replaced  with  subnational 
identities,  including  those  of  ethnic  group,  religion, 
and  race.  Finally,  the  territorial  integrity  of  the  col¬ 
lapsed  state  becomes  compromised  when  all  other 
state  and  nonstate  actors,  especially  neighboring 
states,  intervene.  This  entry  analyzes  the  causes  and 
consequences  of  state  collapse  and  the  possible 
remedies  for  avoiding  it. 

The  phenomenon  of  state  collapse  is  a  recent 
addition  to  the  political  science  parlance  that  came 
with  the  end  of  the  Cold  War  in  the  1990s.  In  this 
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period,  the  number  of  internal  wars  around  the 
globe  began  to  rise,  with  deadly  consequences  in 
places  like  Rwanda  and  Bosnia.  Many  of  these 
conflicts  have  been  described  as  “ethnic  conflicts”; 
nonetheless,  ethnicity  is  only  one  aspect  of  political 
violence.  While  violent  conflicts  must  be  consid¬ 
ered  as  a  failure  of  the  state  to  perform  some  of  its 
fundamental  tasks,  one  must  also  pay  attention  to 
the  formation  of  the  state,  especially  in  most  of  the 
developing  world  such  as  Africa.  The  foundation 
of  these  states  is  colonialism,  which  made  use  of 
clan-based  divide-and-conquer  tactics  in  addition 
to  their  military  superiority.  Furthermore,  state 
formation  in  places  such  as  Africa  did  not  begin  on 
a  contractual  basis;  rather,  predatory  powers  con¬ 
quered  the  land,  created  state  institutions,  and 
imposed  taxes  and  forced  labor  intended  only  to 
facilitate  the  extraction  of  wealth,  mainly  in  the 
form  of  raw  materials,  in  these  territories. 

The  postcolonial  state  in  areas  like  Africa  began 
with  divided  societies  and  economies  that  rely 
heavily  on  rent  from  the  sale  of  commodities.  Since 
the  1960s,  however,  many  economies  in  the  region 
have  witnessed  a  decline  in  their  livelihood  as  a 
result  of  unstable  political  regimes  and  diminished 
commodity  prices  that  shattered  the  ability  to 
adequately  provide  public  services.  Many  of  these 
states  went  through  an  initial  phase  of  ethnic 
group  federation  as  a  substitute  for  the  lack  of 
essential  services  to  all  their  citizens.  Often,  a  just 
system  of  redistribution  within  and  among  groups 
is  the  key  to  creating  solidarity  among  citizens,  and 
its  breakdown  often  triggers  deadly  political  vio¬ 
lence.  Thus,  in  most  of  the  conflict-ridden  coun¬ 
tries,  nepotism  and  favoritism  were  practiced 
rather  than  a  fair  and  just  redistribution  system. 

With  the  end  of  the  Cold  War,  two  reforms, 
heavily  promoted  by  the  West,  further  undermined 
the  already  weak  states  in  many  parts  of  the  devel¬ 
oping  world.  The  first  one  involved  structural 
adjustment  programs  grounded  in  neoclassical 
economics,  which  promoted  privatization,  dis¬ 
couraged  subsidies,  and  devalued  currencies  and 
which  were  complemented  by  the  reduction  of  the 
amount  of  aid;  together,  these  measures  led  to 
heightened  inflation  and  reduced  many  govern¬ 
ment  services.  The  second  is  the  imposition  of 
political  reform  based  on  the  idealization  of  repre¬ 
sentative  democracy,  which  had  been  promoted  as 
the  only  ingredient  for  stability  and  development. 


However,  as  has  been  proven,  a  sudden  demo¬ 
cratic  transition  in  the  midst  of  economic  devasta¬ 
tion  facilitated  the  widespread  internal  wars  and 
cases  of  subsequent  state  failure  and  collapse  of  the 
1990s.  During  this  period,  state  after  state  in 
Eastern  Europe  and  Africa  experienced  deadly 
internal  wars  that  claimed  (and  still  claim  in  places 
like  Somalia  and  the  Democratic  Republic  of 
Congo)  many  lives,  created  thousands  of  refugees 
and  internally  displaced  populations,  and  eventu¬ 
ally  redrew  the  map  of  Eastern  Europe.  Moreover, 
with  the  absence  of  superpower  involvement, 
states  began  to  intervene  in  neighboring  conflicts, 
which  led  to  the  widespread  availability  of  arms 
and  mercenaries,  especially  child  soldiers. 

Nevertheless,  since  the  beginning  of  the  2fst 
century,  the  overwhelming  majority  of  collapsed 
states  are  in  the  continent  of  Africa.  According  to 
Fund  for  Peace,  7  out  of  the  top  10  failed  states  are 
in  the  African  continent.  The  remainder  of  this 
entry  examines  the  factors  behind  the  collapse  and 
what  actions,  if  any,  can  be  taken  to  reverse  their 
status. 

The  Process  of  Collapse 

A  strong  state  can  be  described  as  a  state  in  control 
of  its  territory  and  able  to  deliver  a  full  range  of 
political  goods  to  its  citizens.  Conversely,  weak 
states  are  incapable  of  controlling  their  territories 
and  unable  to  adequately  provide  public  goods. 
Such  states  become  weak  either  as  a  result  of  inher¬ 
ent  factors,  including  geographical,  physical,  and 
fundamental  economic  constraints,  or  because  of 
internal  antagonisms,  greed,  or  despotism.  Weak 
states  typically  harbor  ethnic,  religious,  linguistic, 
or  other  tensions,  which  may  be  transformed  into 
all-out  conflict  between  contending  groups.  Their 
ability  to  provide  adequate  political  goods  becomes 
diminished.  Gross  domestic  product  (GDP)  per 
capita  and  similar  indicators  fall,  corruption 
becomes  widespread,  and  the  rule  of  law  is  hardly 
practiced. 

In  some  instances,  weak  states  turn  into  failed 
states  as  they  are  no  longer  able  to  provide  essen¬ 
tial  political  goods.  They  progressively  forfeit 
their  role  as  the  preferred  national  suppliers  of 
political  goods,  which  in  turn  impels  warlords  and 
other  nonstate  actors  to  fill  the  vacuum.  Its  insti¬ 
tutions  are  flawed  where  both  legislature  and 
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judiciary,  if  they  exist  at  all,  ratify  the  decisions  of 
a  strong  executive  rather  than  being  independent. 
The  bureaucracy  loses  its  sense  of  professional 
responsibility.  Failed  states  exhibit  deteriorating 
infrastructures  where  telephones  fail,  water  sup¬ 
plies  dry  up,  and  power  falters.  Educational  and 
medical  facilities  crumble,  resulting  in  increased 
illiteracy,  infant  mortality,  poverty,  food  short¬ 
ages,  and  hunger.  Failed  states  are  insecure  and 
cannot  project  power  much  beyond  the  capital 
city.  Flowever,  failed  states  offer  unparalleled  eco¬ 
nomic  opportunity  for  a  privileged  few,  mainly 
those  within  the  ruling  class.  These  faltering  state 
failures  become  obvious  even  before,  or  as,  rebel 
groups  and  other  contenders  arm  themselves, 
threaten  the  residents  of  central  cities,  and  over¬ 
whelm  demoralized  government  contingents,  as  in 
the  cases  of  Liberia,  Nepal,  Somalia,  Rwanda, 
Sierra  Leone,  Democratic  Republic  of  Congo,  and 
Cote  d’Ivoire. 

Nevertheless,  many  states  recover  as  they  open 
the  political  space  to  other  concerned  parties  and 
move  toward  inclusive  and  stable  polity. 
Unfortunately,  some  states  progress  toward  more 
decay  and  eventually  collapse.  As  argued,  col¬ 
lapsed  states  are  rare  and  extreme  versions  of  a 
failed  state.  They  exhibit  a  vacuum  of  authority 
and  mere  geographical  existence,  where  the  forces 
of  entropy  have  overwhelmed  the  state  and  its  abil¬ 
ity  to  provide  some  semblance  of  order.  Political 
goods  are  obtained  through  private  means.  Security 
becomes  the  rule  of  the  strong,  where  warlords 
take  over.  However,  parts  of  the  collapsed  state 
may  exist  and  function,  albeit  unrecognized  and  in 
a  disorderly  manner. 

The  collapse  of  states  is  a  combination  of  two 
processes  of  decay  that  simultaneously  work  from 
inside  and  outside  the  country,  weakening  and 
eventually  destroying  the  state.  The  primary  force 
of  the  inside-out  process  is  the  prevalence  of  bad 
leadership.  These  leaders  over  the  years  weaken 
their  states  through  kleptocracy,  neopatrimonial- 
ism,  corruption,  and  politicization  of  elements 
such  as  ethnicity,  religion,  and  race.  Often,  state 
property  becomes  the  private  property  of  the  ruler, 
thus  making  the  control  of  the  state  zero  sum.  The 
collapsed  state  is  not  only  unable  to  develop  a 
stable  political  system  but  lacks  the  administrative 
capacity  to  govern  the  territory  effectively  and 
ignite  sustainable  economic  development.  The 


weakness  of  the  state  is  most  apparent  in  the 
periphery,  while  all  services  and  facilities  are  con¬ 
centrated  in  urban  areas.  Thus,  while  the  core  of 
the  state  weakens  through  mismanagement,  pres¬ 
sure  from  the  periphery  becomes  emboldened. 
Finally,  competition  for  the  control  of  state  func¬ 
tions  ensues  between  different  forces  in  the  coun¬ 
try,  and  in  the  end  all  of  them  lose. 

Nonetheless,  the  decay  that  leads  to  state  col¬ 
lapse  also  simultaneously  occurs  from  outside  in. 
As  noted,  it  is  in  Africa  that,  with  devastating 
impact,  the  largest  concentration  of  collapsed 
states  is  found.  The  external  influence  began  with 
slavery,  which  caused  millions  of  casualties,  dehu¬ 
manization,  and  massive  depopulation,  followed 
by  colonialism,  which  resulted  in  artificial  borders 
and  economies  that  currently  depend  on  rent  from 
commodity  and  foreign  aid.  Finally,  the  conflicts 
in  many  collapsed  states  are  fueled  by  the  latest 
weaponry,  and  the  warlords  receive  support  and 
legitimization,  which  further  delays  any  hope  for 
ending  the  political  violence. 

Possible  Remedies  for  State  Collapse 

The  revival  of  a  collapsed  state  is  like  putting 
together  a  broken  pot.  Unfortunately,  systematic 
analyses  and  comparative  studies  in  the  field  of 
peaceful  conflict  settlement  are  largely  underre¬ 
searched.  Nevertheless,  the  reconstruction  of  a 
collapsed  state  initially  requires  reliable  conflict 
mediation  and  facilitation.  The  term  mediation 
entails  activities  requiring  a  dynamic  influence  by 
reputable  individuals  and  groups  as  go-betweens. 
Facilitation  involves  activities  such  as  inviting  the 
warring  parties  to  talk  to  each  other  and  provid¬ 
ing  conditions  for  talks.  The  focus  of  facilitation 
and  mediation  should  be  on  long-term  prevention 
of  violent  conflict  rather  than  on  producing  short¬ 
term,  “bandage”  solutions.  Therefore,  it  is  imper¬ 
ative  that  efforts  be  made  to  enhance  the  capacity 
of  the  middle  and  local  levels  rather  than  concen¬ 
trate  on  the  warlords.  Additionally,  the  warlords 
must  at  the  same  time  be  disarmed,  isolated,  and 
eventually  punished  for  their  crimes  against 
humanity. 

In  addition  to  the  facilitation  and  mediation 
efforts,  there  is  also  a  need  for  genuine  efforts  to 
rebuild  state  institutions.  While  there  has  been  suc¬ 
cess  in  the  rebuilding  of  many  collapsed  states, 
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mainly  in  Eastern  Europe,  others  remain  in  a  col¬ 
lapsed  condition,  especially  in  Africa.  Many  states 
in  Africa  continue  to  face  great  challenges  in  the 
reconstruction  of  their  state  institutions  and  estab¬ 
lishment  of  rule  of  law  in  their  territories.  Collapsed 
states,  as  argued,  require  strong  support  to  rebuild 
legitimate  institutions,  which  in  turn  will  bring  law 
and  order,  create  trust  among  the  combating 
groups,  and  revive  the  destroyed  economic  infra¬ 
structure. 

In  the  past  few  years,  international  efforts  have 
been  mainly  focused  on  achieving  cessation  of  hos¬ 
tilities  and  signing  of  peace  agreements  while 
neglecting  other,  more  pressing  factors,  especially 
creating  favorable  political  and  economic  condi¬ 
tions.  As  a  result,  the  conflicts,  especially  in  sub- 
Saharan  Africa,  tend  to  be  longer  and  deadlier, 
mainly  due  to  lack  of  international  finance  and 
capital  to  curb  humanitarian  disasters  and,  more 
important,  help  in  reconstruction  after  peace 
settlements.  The  donor  countries  continue  their 
abandonment,  and  the  critical  help  needed  to 
reconstruct  the  political  and  economic  institutions 
is  usually  not  provided.  Finally,  collapsed  states 
can  only  backtrack  along  the  path  from  collapsed 
to  failed  to  weak  status,  which  is  only  possible  if 
sufficient  security  is  restored  to  rebuild  the  institu¬ 
tions  and  strengthen  the  legitimacy  of  the  revived 
state.  This  backtracking  can  only  be  realized 
through  a  strong  and  genuine  intervention  force, 
as  in  the  case  of  Lebanon  with  Syrian  security  and 
Sierra  Leone  with  British  intervention.  Finally,  the 
citizens  of  collapsed  states  must  be  educated  and 
empowered  to  demand  their  rights  and  become 
active  in  rebuilding  their  broken  state. 
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State  Failure 


State  failure  is  composed  of  two  elements:  the  fail¬ 
ure  and  the  state.  This  entry  first  examines  the 
diverse  notions  of  failure,  including  their  defini¬ 
tion,  their  components  and  processes,  some  exam¬ 
ples,  and  rehabilitation.  It  then  addresses  divergent 
conceptions  of  the  state,  with  particular  attention 
to  ideas  about  the  non-Western  state  that  impinge 
on  an  understanding  of  failure  and  associated  pro¬ 
cesses.  State  failure  is  the  basis  of  a  large  array  of 
concepts,  starting  with  a  rare  and  narrow  concern, 
already  containing  certain  ambiguities  of  defini¬ 
tion,  measurement,  and  remedy,  and  expanding 
into  nearly  the  entire  field  of  comparative  politics, 
democracy,  development,  and  interstate  policy. 

Failure 

Failure  refers  to  several  overlapping  concepts, 
unfortunately  often  confused  with  each  other:  col¬ 
lapse,  failed,  failing,  fragile,  and  weak. 

The  most  narrowly  defined  category  is  that  of 
collapsed  states.  Collapse,  the  most  extreme  form 
of  damage,  occurs  when  states  can  no  longer  per¬ 
form  their  basic  functions,  defined  as  enforcing 
internal  and  external  security,  extracting  and  allo¬ 
cating  resources,  and  providing  social  services,  for 
some  time  or  where  the  structure,  authority  (legiti¬ 
macy),  law,  and  political  order  have  fallen  apart. 
There  are  few  unambiguous  modern  cases  of  the 
disappearance  of  a  state  (Ghana,  1979-1981; 
Chad,  1979-1985;  Lebanon,  1983-1987;  Haiti, 
1986-1991;  Sierra  Leone  in  the  mid-1990s; 
Afghanistan,  1992-2002;  and  Liberia,  1989- 
1997 — although  none  have  done  so  as  lengthily 
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and  conclusively  as  Somalia  since  1990).  State  col¬ 
lapse  can  occur  because  of  but  also  without  civil 
war,  although  the  collapse  of  law,  order,  and 
legitimate  authority  does  create  a  security  vacuum 
that  opens  society  to  lawlessness.  I.  William 
Zartman  (1995)  has  identified  some  signs  of  the 
process  of  collapse:  conflict  at  the  political  center, 
decision  failure,  a  narrow  central  clique,  defensive 
policies,  and  privatization  of  state  agents.  But 
there  is  no  notion  of  any  sequence,  of  necessary 
and  sufficient  components,  or  of  the  nature  of  the 
process.  These  conceptual  weaknesses  are  present 
in  all  other  types  of  state  failure  analysis. 

Equally  uncertain  is  the  process  of  rehabilita¬ 
tion  after  collapse.  Unsolved  issues  include  the 
need  to  rehabilitate  the  state  versus  a  state,  the 
priorities  of  the  process,  the  relation  between  secu¬ 
rity  and  institutionalization,  the  role  of  welfare, 
the  role  of  the  international  community,  the  rela¬ 
tion  between  state  building  and  nation  building 
(see  below,  under  “States”),  and  the  need  for  a 
Man  on  a  White  Horse  or  a  strongman.  Again, 
these  conundrums  will  appear  with  regard  to  other 
types  of  state  failure,  although,  there,  they  are 
more  specifically  addressed. 

Failed  states  constitute  the  next  larger  circle.  The 
immediate  problem  with  the  concept  is  that  it  is  not 
clear  whether  it  refers  to  states  that  have  failed 
as  states — hence  being  closer  to  the  collapsed 
category — or  to  states  that  have  failed  in  one  or 
more  of  their  functions  while  still  holding  onto  their 
state  status.  In  the  latter  case,  how  many  functions 
must  have  failed  before  the  state  itself  can  be  termed 
a  failure?  Furthermore,  some  cases  of  state  failure 
often  cited,  such  as  Yugoslavia  or  old  Pakistan, 
refer  to  constituted  units  (states)  discarding  an  over¬ 
arching  federal  framework  (state)  to  take  over  geo¬ 
graphic  pieces  of  the  formerly  larger  unit  and  are 
scarcely  cases  where  some  institutional  framework 
was  ever  completely  absent.  Another  problem  with 
the  concept  of  failure  is  its  geographic  reference: 
Frequently,  states  continue  to  be  effective,  often 
strongly  effective,  in  a  given  part  of  their  territory, 
leaving  the  rest  beyond  their  control  to  a  more  or 
less  effective  opposition;  thus,  they  fail  in  their 
duties  as  rulers  of  their  assigned  territory  but  suc¬ 
ceed  in  ruling  a  part  of  it  while  leaving  the  rest  as  a 
proto-  or  counterstate.  Examples  include  Colombia 
and  the  FARC  (Revolutionary  Armed  Forces  of 
Colombia),  Sudan  and  the  SPFA/M  (Sudan  Peoples 


Fiberation  Army/Movement),  and  Sri  Fanka  and 
the  FTTE  (Fiberation  Tigers  of  Tamil  Eelam). 

These  problems  do  not  invalidate  the  concept 
but  rather  indicate  its  complexity  and  the  difficulty 
of  applying  it  to  a  multifaceted  reality.  In  most 
works  on  failed  states,  it  is  the  components  of  fail¬ 
ure  that  drive  the  analysis,  so  that  analysis  can 
pinpoint  at  what  the  state  has  failed,  if  not  at 
“everything.”  These  functions  can  be  classified  in 
many  ways,  but  according  to  Stuart  Eizenstat  and 
colleagues  (2004),  an  effective  categorization  is  in 
terms  of  security,  capacity,  and  legitimacy.  The 
striking  fact  is  that  there  is  little  overlap  between 
the  three  types  of  failure:  conflicted  states,  neglect¬ 
ing  states,  and  illegitimate  states.  One  measure 
gave  two  overlaps  on  all  three  (Congo-Brazzaville 
and  Afghanistan),  with  25%  appearing  on  two 
columns;  another  gave  none,  with  10%  on  the  list¬ 
ings  appearing  in  two  columns.  Fistings  vary  sub¬ 
stantially  over  a  span  of  several  years.  Immunization 
rates  were  used  as  indicators  (proxies)  for  capac¬ 
ity,  battle  deaths  for  security,  and  a  composite 
measure  of  “political  freedom”  for  legitimacy,  but 
other  measures  can  give  very  different  results. 

When  analysis  turns  to  individual  cases,  not 
only  the  component  concepts  and  indicators  but 
also  the  paths  to  failure  can  be  seen  to  vary  enor¬ 
mously,  indicating  that  the  causes  of  failure  only 
coincide  at  a  high  level  of  abstraction  that  is  often 
circular.  Rapacious  concentration  of  power,  and 
hence  resources,  in  a  small,  often  ethnically  and/or 
geographically  concentrated  clique,  to  the  neglect 
of  the  general  population — the  “privatization”  of 
states — is  a  frequent  characteristic.  Thus,  for 
example,  behind  the  capacity  gap  lies  a  political 
rather  than  a  resource  failure;  states  do  not  fail 
because  they  are  poor  but  are  poor  because  they 
fail,  although  poverty  admittedly  makes  good  gov¬ 
ernance  more  difficult. 

Failing  states  is  the  next  largest  circle,  both 
broader  and  softer.  It  refers  to  states  that  show 
signs  of  doing  badly  in  some  crucial  sector  or  func¬ 
tion  but  that  have  not  necessarily  reached  an  end 
point  of  failure  in  that  function.  Again,  definition 
and  measurement  problems  arise:  It  is  not  clear 
how  failing  a  state  has  to  be  on  how  many  sectors 
to  qualify  for  the  general  category,  particularly 
since  most  states  can  be  said  to  show  weakness  in 
many  policy  areas.  Most  treatments  have  handled 
this  problem  by  avoiding  it  with  a  sliding  scale  and 
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a  complex  set  of  indicators,  taking  the  analysis 
closer  to  bumpy  reality  and  further  from  implica¬ 
tions  of  clear-cut  categorization.  An  understanding 
of  the  paths  of  sectoral  failures  and  then  their  rela¬ 
tion  to  each  other  both  causally  and  in  creating  a 
whole  picture  of  “failing-ness”  is  still  underdevel¬ 
oped,  although  some  work,  such  as  that  of  Robert 
Rotberg  (2003),  has  been  undertaken  to  disregard 
the  causational  problem  and  address  sectorally  the 
rehabilitation  problem. 

The  name  of  the  condition  measured  also  var¬ 
ied.  One  index,  that  of  the  Fund  for  Peace,  mea¬ 
sures  instability  using  12  indicators:  demographic 
pressures,  refugees  and  internally  displaced  per¬ 
sons,  group  grievance,  population  flight,  income 
inequality,  economy,  legitimacy,  welf are/ services, 
human  rights,  security  apparatus,  elite  factions, 
and  foreign  interventions.  Another  index,  by  Susan 
Rice  and  Stewart  Patrick  (2008),  seeks  indications 
of  weakness  with  a  more  complex  set  (only  par¬ 
tially  overlapping  with  the  previous  set)  of  mea¬ 
sures,  divided  into  four  baskets  (as  opposed  to  the 
previously  mentioned  three  gaps),  with  five  subin¬ 
dicators  in  each:  (1)  economic  (per  capita  gross 
national  income,  GDP  growth,  income  inequality, 
inflation,  and  regulatory  quality),  (2)  political 
(state  effectiveness,  rule  of  law,  accountability, 
corruption,  and  freedom),  (3)  security  related  (con¬ 
flict,  human  rights,  conflict  territory,  coups,  stabil¬ 
ity,  and  absence  of  violence),  and  (4)  welfare 
related  (child  mortality,  water  and  sanitation, 
undernourishment,  primary  schooling,  and  life 
expectancy).  Only  15  of  the  top  20  states  were  on 
both  lists,  in  very  different  rankings.  The  instability 
study  puts  a  causal  emphasis  on  poverty,  particu¬ 
larly  in  the  more  failing  states,  although  Paul 
Collier  et  al.  (2003)  found  structural  weakness  to 
correlate  only  distantly  and  indirectly  to  cause 
political  instability.  The  study  also  indicates  that 
remedies  vary  according  to  levels  of  instability: 
Failed  states  need  greater  attention  to  security  first, 
whereas  states  of  middle-range  failures  need  to  give 
targeted  attention  to  specific  sectoral  weaknesses. 

It  is  not  clear  how  much  significance  lies  in  the 
difference  between  instability  and  weakness. 
However,  another  study  (for  the  Central  Intelligence 
Agency),  which  started  out  as  a  task  force  on  state 
failure  and  used  rigorous  statistical  analysis,  was 
led  by  its  data  and  correlations  to  focus  instead  on 
political  instability.  The  report  found  failure  and 


collapse  to  be  new  labels  for  a  kind  of  severe 
political  crisis  and  reduced  their  cause  to  three 
basic  variables:  child  mortality,  trade  restrictions, 
and  lack  of  democracy — variables  so  distant  from 
the  effect  as  to  suggest  a  complex  causal  chain. 

Other  concepts  such  as  fragile ,  vulnerable,  and 
weak,  even  broader  than  failing,  constitute  the 
focus  of  the  broadest  research,  appearing  in  the  end 
as  a  variable  characteristic  of  all  140  to  160  devel¬ 
oping  or  non-Western  countries.  Studies  focus  on 
the  economic  functions  of  the  state,  with  more  than 
100  indicators;  on  weakness  in  authority,  capacity, 
and  legitimacy,  using  74  indicators;  on  fragility, 
measured  through  effectiveness  and  legitimacy  in 
the  economic,  social,  political,  and  security  spheres, 
using  33  indicators,  as  does  the  USAID  Fragile 
States  Strategy  of  2006;  or,  as  in  Paul  Collier  and 
colleagues’  2003  study,  simply  on  conflict  prone¬ 
ness,  among  others.  Different  indicators  of  differ¬ 
ent  dimensions  lead  to  strongly  different  diagnoses. 
However,  even  though  the  terrain  is  uneven  and 
slippery,  they  do  permit  a  debate  over  causes,  rela¬ 
tions  among  causes,  and  remedies,  which  is  useful 
both  for  understanding  state  failure  and  for  policy 
making.  On  the  other  hand,  the  analysis  has  moved 
away  from  a  single  identifiable  concept  that  can  be 
explained  and  also  away  from  consensual  under¬ 
standings  of  even  a  broad  condition  that  can  form 
the  basis  of  comparative  analysis. 

As  a  result,  rehabilitation — measures  to  over¬ 
come  or,  in  a  preventive  manner,  to  avoid  the  sub¬ 
ject  condition,  that  is,  state  failure — has  also  been 
the  subject,  or  at  least  the  end  purpose,  of  some 
attention.  Most  work  on  state  failure,  in  all  its 
forms,  ends  with  prescriptions  for  rehabilitation 
and  specifically  for  third-state — usually  Western — 
interventions.  The  compass,  which  once  headed 
toward  institutionalization,  now  seems  to  be  turn¬ 
ing  to  antipoverty  measures,  reviving  an  old  leitmo¬ 
tif  of  the  World  Bank  and  also  a  constant  interest  of 
the  Organisation  for  Economic  Co-operation  and 
Development,  the  United  States  Agency  for 
International  Development,  and  the  UK  Department 
for  International  Development.  A  third  compass 
point,  which  continually  reappears,  is  security,  often 
seen  as  the  precondition  of  poverty  reduction  and 
institutionalization.  In  the  end,  these  three  foci 
correspond,  of  course,  to  the  three  disciplinary 
baskets  of  economy,  polity,  and  security,  totally 
interdependent  but  fighting  for  precedence.  When 
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states  fail,  everything  must  be  corrected  at  once, 
and  waiting  for  one  sector  to  be  completed  before 
starting  another  means  that  failure  will  not  be 
overcome.  That  said,  aid  missions  and  state  build¬ 
ers  will  agree  that  security  comes  first,  even  though 
it  may  well  end  last. 

States 

Beyond  this  analytical  and  remedial  disagreement 
lies  uncertainty  about  the  nature  of  the  state.  The 
state,  as  we  know,  is  “a  human  community  that 
(successfully)  claims  the  monopoly  of  the  legitimate 
use  of  physical  force  within  a  given  territory,”  in 
Max  Weber’s  restrictive  terms.  A  similar  definition 
with  force  only  subsumed  terms  a  state  “the 
authoritative  political  institution  that  is  sovereign 
over  a  recognized  territory”  (Zartman,  1995,  p.  5). 
Weber’s  name  is  also  associated  with  the  idea  of  a 
Weberian  state,  distinguished  by  the  bureaucratic, 
institutional  nature  of  its  routinized,  performance- 
based  legitimacy.  By  this  definitional  complex, 
therefore,  the  state’s  functions  include  internal  and 
external  security,  internal  and  international  legiti¬ 
macy,  territorial  control,  institutionalization,  and 
by  some  extension  services,  possibly  welfare- 
oriented  programs.  To  some,  a  state  is  a  state,  so 
defined;  the  definition  is  highly  abstract  and  can  be 
used  to  judge  performance  or  other  characteristics 
such  as  failure  or  collapse.  It  is  not  a  phenomenon 
of  all  times  and  places;  although  the  state  as  defined 
took  some  centuries  to  emerge  in  Europe  and  then 
was  transferred  to  the  Western  Hemisphere,  it  has 
been  slower  to  take  root  across  other  parts  of  the 
world,  despite  earlier  isolated  instances.  Probably 
the  most  widespread  alternative  to  the  state,  as 
defined,  is  the  nonterritorial  polity,  which  has  exer¬ 
cised  many  of  the  functions  of  the  territorial  state 
while  also  extending  sovereignty  over  a  people, 
wherever  they  may  be  at  the  moment.  Notions  such 
as  the  Muslim  umma  ([mother]  community), 
expressed  in  the  diverse  ideas  of  Muammar  al- 
Qadhafi  of  Libya  and  Osama  bin  Laden  of  A1 
Qaeda,  are  contemporary  examples,  constrained  in 
practice  by  the  existence  of  territorial  states. 

To  some,  however,  it  is  inappropriate  to  talk  of 
a  state  in  some  parts  of  the  contemporary  world, 
notably  in  Africa.  There,  states  are  not  in  a  condi¬ 
tion  of  shortfall  with  regard  to  a  claimed  or  aspired 
condition  (statehood)  but  are  simply  not  there 


because  they  are  not  native  to  the  land  and  its 
people.  Hence,  attempts  to  analyze  the  causes  of  a 
shortfall  would  be  pointless,  although  it  is  also 
sometimes  suggested  that  new  entities  should  be 
established  (by  someone)  that  would  be  more  nat¬ 
ural  to  the  terrain.  To  these  arguments,  the  notion 
of  state  failure  or  failings  provides  a  useful  alterna¬ 
tive,  indicating  how  states  are  doing  on  various 
measures  and  functions  on  the  ladder  to  effective 
statehood. 

This  evaluation  is  based  on  the  working  assump¬ 
tion  that  a  broad  category  such  as  state  is  applicable, 
even  if  not  fully  attained,  around  the  contemporary 
world.  While  historical,  geographical,  and  cultural 
peculiarities  affect  the  ability  of  the  polity  to  climb 
the  ladder,  it  is  considered  a  ladder  that  applies  to 
functions  that  are  universally  required  and  desired. 

One  element  not  explicitly  covered  in  either  of 
the  definitions  of  state  (although  perhaps  implicit  in 
Weber’s  reference  to  a  community)  is  the  notion  of 
a  nation ,  the  identity  community  contained  within 
the  state.  Since  the  end  of  the  19th  century,  custom¬ 
arily  associated  with  the  Lrench  Revolution,  state 
and  nation  have  been  assumed  to  be  coincident, 
although  not  synonymous,  as  the  nation-state.  The 
term  state  has  two  meanings,  both  relevant  to  state 
failure.  One  is  that  sovereignty  lies  in  the  identify¬ 
ing  population,  rather  than  in  an  elevated  figure 
such  as  a  monarch  (or  a  deity).  The  other  is  that  the 
population  of  the  state  coheres  in  a  shared  identity. 
However,  again  there  are  ambiguities  in  reality. 
The  first  meaning  is  assumed  to  be  symbolically 
true,  but  in  practice,  the  ruler  can  often  take  deci¬ 
sions  on  his  or  her  own  without  any  reference  to 
popular  will.  The  matter  is  usually  covered  in 
accountability  measures  in  many  failing-state  indi¬ 
ces.  This  element  edges  state  failure  or  nonfailure 
toward  considerations  of  democracy,  a  concept 
that  is  not  directly  associated  with  most  aspects  of 
failure  but  that  slips  in  measures  of  legitimacy. 
Confounding  the  two  notions  opens  up  a  great 
debate.  The  second  meaning  also  carries  a  signifi¬ 
cant  assumption:  Many  developing  states  and  even 
some  developed  states  can  be  far  from  failure,  even 
though  there  is  no  effective  sense  of  national  iden¬ 
tity  coherence.  Even  before  the  notions  of  failure 
became  current,  new  states  looked  to  the  needs  of 
national  coherence  and  sought  to  create  a  nation  to 
fit  the  state,  or  state-nation.  The  imposition  of  a 
new  identity  drew  critical  reactions  from  traditional 
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or  ethnic  nations  and  in  turn  contributed  to  state 
failure.  “Nation-ness”  is  not  covered  in  most 
evaluations  of  failure,  although  it  is  a  critical 
underlying  variable  in  matters  such  as  the  mea¬ 
surement  of  internal  conflict. 

Whatever  the  evaluation  of  a  particular  state’s 
position  on  the  ladder,  state  building  becomes  the 
prescription  for  improvement.  The  term  is  not  to  be 
confused  with  nation  building,  which  is  often  mis¬ 
takenly  used  as  a  synonym;  nation  building  refers 
to  the  policy  of  building  the  identity  unit  and  can 
only  be  conducted  indigenously  within  the  state, 
whereas  state  building  can  be  contributed  to  by 
external  assistance,  at  least  to  a  limited  extent.  In  a 
specific  sense,  state  building  refers  to  institutional¬ 
ization  and  good  governance,  terms  particularly 
important  to  international  financial  institutions 
and  their  policies  toward  low-income  countries 
under  stress  or  fragile  and  conflict-affected  coun¬ 
tries.  In  policy  terms,  whether  state  building  is  an 
internal  or  external  exercise  is  an  important  aspect 
of  the  debate,  basic  to  decisions  on  specific  policies 
and  measures.  The  answer  that  it  is  both  does  not 
solve  the  kernel  of  the  debate  over  proportions. 
Too  much  international  attention  removes  the  ele¬ 
ment  of  ownership  and  responsibility  from  the 
failing  state,  an  element  that  is  at  the  core  of  the 
failure  question:  A  state  must  overcome  failure 
itself;  by  definition,  it  cannot  be  “unfailed”  from 
the  outside.  However,  states  in  failure,  like  those  in 
conflict,  need  help;  their  very  condition  removes 
elements  that  are  needed  for  its  correction.  State 
building  is  a  topic  of  particular  focus  specific  to  the 
post-World  War  II  era  but  especially  to  the  post- 
Cold  War  world.  During  both  periods,  the  domi¬ 
nant  focus  was  on  the  limitation  of  overpowering, 
totalitarian  states,  whereas  the  contemporary  focus 
is  on  building  state  capacity;  inclusion  of  the 
notion  of  good  governance,  which  contains  a 
notion  of  restricted  effectiveness,  is  a  corrective  to 
the  danger  of  overwhelming  state  power. 

Finally,  one  of  the  most  important  questions 
with  regard  to  state  failure,  which  is  still  unre¬ 
solved,  is  the  relation  between  state  failure  and 
civil  society  failure.  While  it  has  been  suggested 
that  state  collapse  is  made  possible  by  the  con¬ 
comitant  collapse  of  civil  society,  which  is  unable 
to  promote  candidates  to  fill  the  gap,  other  work 
indicates  that  civil  society  expands  to  fill  the  func¬ 
tional  gap.  Like  other  questions  with  regard  to  the 


topic,  the  relationship  is  fraught  with  conceptual 
ambiguities.  A  useful  definition,  but  by  no  means 
the  only  one,  considers  civil  society  to  cover  civil 
organizations  independent  of  government.  The 
functions  civil  society  can  fill  the  range  between 
none  and  all,  either  extreme  constituting  a  contribu¬ 
tion  to  state  failure.  The  more  functions  civil  society 
can  perform,  the  less  the  state  has  to  perform, 
beyond  some  (undetermined)  degree  of  regulation 
of  civil  society  performance.  Yet  some  functions, 
such  as  security  (policing  and  military  defense)  and 
national  adjudication,  are  uniquely  state  functions 
and  relegation  to  civil  society  (as  to  external  con¬ 
tractors)  weakens  the  state  rather  than  strengthen¬ 
ing  its  functional  performance. 

The  Challenge  of  State  Failure 

In  sum,  state  failure  is  a  serious  problem  in  con¬ 
temporary  politics.  The  topic  of  state  failure  faces 
a  double  challenge.  On  the  one  hand,  as  a  problem, 
it  calls  for  correction  of  conditions  of  state  weak¬ 
ness  to  varying  degrees.  On  the  other,  the  topic 
itself  and  its  component  concepts  are  all  ambigu¬ 
ous  and  open-ended.  The  first  runs  up  against  the 
limited  capabilities  of  external  support  to  help  deal 
with  the  situation  and  to  do  so  without  the  further 
weakening  of  effective  foreign  involvement.  The 
second  calls  for  more  detailed  conceptual  measure¬ 
ments  and  indicators,  involving  further  concepts 
related  to  more  specific  aspects  and  unrealistic 
precision.  Yet  both  challenges  are  inherent  in  the 
paths  to  greater  effectiveness  and  understanding. 
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State  formation  refers  to  the  processes  leading  to 
the  centralization  of  political  power  within  a  well- 
defined  territory.  These  processes  are  not  histori¬ 
cally  and  geographically  uniform,  and  there  is  no 
single  explanation  for  them.  There  have  been  a 
variety  of  paths  to  statehood.  This  entry  deals 
with  the  most  general  question  of  state  formation 
concerning  the  historical  origin  of  states  as  such  as 
well  as  with  more  specific  questions  connected  to 
the  development  of  the  modern  states  system  and 
to  state  formation  in  the  contemporary  world. 
State  formation  in  early  agrarian  societies,  in  the 
early  capitalist  West,  and  in  the  era  of  decoloniza¬ 
tion  and  imperial  dissolution  display  different 
processes  and  institutional  features  and  will  also 
be  analyzed. 

There  are  at  least  three  major  theoretical  per¬ 
spectives  on  the  state  and  state  formation.  The 
pluralist  view  describes  the  state  as  a  set  of  various 
interest  groups,  an  arena  for  contending  actors. 
The  state  developed,  accordingly,  through  bar¬ 
gaining  processes  between  interest  groups.  The 


Marxist  view  is  instrumentalist,  with  the  state  as  a 
committee  for  managing  the  common  interests  of 
the  ruling  class.  Here,  the  state  was  established  as 
an  instrument  of  domination.  The  statist  perspec¬ 
tive  implies  that  states  have  emerged  as  distinctive 
structures  with  institutions  and  modes  of  opera¬ 
tion  that  cannot  be  derived  from  interest  group 
maneuvering  or  class  structure.  The  latter  perspec¬ 
tive  is  particularly  associated  with  Max  Weber  and 
even  with  neo-Marxist  critiques  of  orthodox 
Marxism.  In  empirical  accounts  of  state  forma¬ 
tion,  the  major  theories  are  rarely  employed  one- 
sidedly  or  spelled  out  explicitly,  but  they  are  often 
combined  into  a  more  compound  analysis. 

The  Origin  of  the  State 

State  formation  requires  cooperation  and  rule 
within  a  unit  far  greater  than  a  kinship  group  or 
a  community  based  on  face-to-face  recognition.  A 
possible  explanation  for  cooperation  on  a  grand 
scale  is  population  pressure  and  scarce  resources, 
particularly  the  shortage  of  soil.  Increasing  popu¬ 
lation  density  and  scarce  resources  stimulate 
more  intensive  food  production.  Thereby,  social 
complexity  increases  because  agriculture  requires 
division  of  labor  and  specialization.  Early  state 
formation  emerged  after  the  establishment  of 
agricultural  societies. 

The  basic  idea  here  is  that  societies  organized  as 
states  will  be  more  efficient  externally  and  inter¬ 
nally.  Externally,  they  are  stronger,  better  orga¬ 
nized,  and  more  competitive  in  relation  to  less 
well-organized  societies.  Internally,  big  units  with 
intensive  production  may  serve  the  population 
with  services  and  conditions  of  life  that  are 
unavailable  for  small  groups  or  individually. 
Conflict  and  common  defense  abroad  may  also 
support  social  integration  at  home.  The  inhabit¬ 
ants  of  a  state  are  in  the  same  boat,  sharing  threats 
from  the  outside. 

State  formation  has  not  been  a  linear  historical 
process.  States  and  empires  have  disintegrated,  col¬ 
lapsed,  and  disappeared,  while  others  have  thrived 
and  enlarged  over  long  periods.  If  the  authorities 
are  unable  to  supply  the  population  with  a  mini¬ 
mum  of  services  and  security,  legitimacy  and  sup¬ 
port  will  erode.  This  has  happened  in  failed  states 
as  well  as  in  many  former  empires  and  colonial 
territories. 
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States  and  empires  may  also  disintegrate  due  to 
a  principal-agent  problem.  The  central  authority 
delegates  power  and  responsibility  to  the  local  and 
regional  representatives.  The  representatives  in  the 
periphery  may  then  exploit  the  derived  resources  in 
a  quest  for  autonomy.  Imperial  erosion  and  state 
failure  have  various  elements  of  these  mechanisms. 

Population  pressure  as  a  state-building  factor  is 
accompanied  by  division  of  labor  and  social 
inequality.  Agriculture  and  other  modes  of  intensi¬ 
fied  food  production  allow  for  the  emergence  of  a 
rent-collecting  upper  class.  In  a  state,  there  is  a 
vertical  division  of  labor  into  social  classes  with 
differentiated  access  to  land  and  other  productive 
assets.  In  contrast  to  the  roaming  gangs  and  ban¬ 
dits  in  an  anarchy,  the  stationary  rulers  in  a  territo¬ 
rial  state  will  benefit  from  more  limited  surplus 
extraction  whereby  the  productive  capacity  of  the 
people — and  thereby  the  tax  basis — is  upheld  in  the 
long  run.  Here  is  a  source  of  support  from  below, 
whether  or  not  government  is  representative. 

From  another  perspective,  the  increased  pro¬ 
ductivity  in  agricultural  societies  could  hardly  be 
handled  by  relatively  egalitarian  tribal  communi¬ 
ties.  With  increasing  productivity,  the  available 
surplus  was  distributed  on  the  basis  of  power. 
State  organization  was  an  organization  of  com¬ 
mand  whereby  inequality  and  legitimacy  could  be 
combined  in  new  ways. 

More  specific  accounts  of  the  processes  of  state 
formation  have  introduced  religious,  technical, 
and  geographical  factors.  Religion  is  a  mode  for 
the  legitimation  of  social  stratification  and  com¬ 
mand,  conducive  to  state  formation  at  the  early 
stages.  The  evolution  of  script  made  possible  the 
centralization  of  information  and  the  spread  of 
precise,  long-distance  messages.  Variation  in  geog¬ 
raphy,  soil,  and  climate,  certainly,  had  crucial 
consequences  for  the  ways  and  means  of  state  for¬ 
mation.  Island  states  and  states  surrounded  by 
high  mountains  were  relatively  easy  to  defend, 
compared  with  states  with  borders  in  open,  conti¬ 
nental  areas. 

One  specific  hypothesis  of  early  state  formation 
is  the  irrigation  hypothesis ,  which  states  that  the 
organization  of  states  on  a  large  scale  started  with 
big  hydraulic  constructions  in  the  drylands  of  the 
Middle  East,  Asia,  and  Meso-America.  The  theory 
of  a  hydraulic  society,  made  famous  by  Karl  Witt- 
fogel,  has  been  opposed  by  references  to  numerous 


early  state  formations  prior  to  large-scale  irriga¬ 
tion,  to  societies  where  irrigation  was  organized 
locally,  and  to  state  formation  in  areas  of  the 
world  where  complex  hydraulic  systems  were  out 
of  the  question. 

Numerous  states  have  been  born  by  conquest. 
When  agricultural  communities  are  conquered  by 
nomadic  tribes  or  nomadic  tribes  establish  control 
over  large  territories,  state  organization  is  the  sta¬ 
bilization  of  command.  The  productive  capacity  of 
an  agrarian  society  is  fused  with  the  political  inter¬ 
ests  of  nomadic  conquerors.  Generally,  however, 
the  expansive  Eurasian  nomadism  is  younger  than 
the  first  high  cultures  with  organized  state  power. 
States  have  been  established  and  enlarged  by  con¬ 
quest,  but  this  is  not  the  prime  mover  in  historical 
state  formation.  Morton  Fried  differentiated 
between  the  endogenous  factors  that  were  preva¬ 
lent  in  primary  state  formation  in  the  Middle  East, 
Meso-America,  and  the  Far  East  and  the  exoge¬ 
nous  factors — such  as  conquest  and  pressure  from 
the  outside — in  secondary  state  formation. 

Externally,  war  supported  the  internal  mecha¬ 
nisms  in  early  state  formation.  In  the  rivalry 
between  states,  the  more  efficient  and  powerful 
polities  survived  either  by  their  own  mobilization 
of  force  or  by  protection  in  powerful  alliances. 

The  Modern  States  System 

The  modern  states  system  arose  in  Europe  with  the 
development  of  international  law.  The  turning 
point  in  legal  terms  was  the  Treaty  of  Westphalia 
in  1648,  after  30  years  of  unruly  war  between  prin¬ 
cipalities  and  religious  authorities.  Roman  Catholic 
emperors  fought  against  Protestant  kings  and 
Roman  Catholic  kings  against  Protestant  princes. 
The  treaty  stated  that  the  individual  principalities 
could  choose  their  religion  freely,  independent 
from  imperial  or  papal  control — cuius  regio  eius 
religio.  This  was  a  long  step  toward  the  idea  of 
autonomous  territorial  states,  sovereign  and  for¬ 
mally  equal.  The  idea  implied  that  the  system  of 
governance,  public  religion,  and  economic  princi¬ 
ples  could  be  shaped  without  external  interference. 
This  idea  of  state  sovereignty  was  elaborated  in  the 
theory  of  international  law  by  1 8th-century  writers 
like  Christian  Wolf  and  Emmerich  de  Vattel;  it  has 
been  confirmed  in  a  series  of  international  treaties 
right  up  to  the  still  valid  United  Nations  (UN) 
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convention  of  1945.  The  long-term  effect  of  the 
Treaty  of  Westphalia  was  the  contemporary  delin¬ 
eation  of  the  globe  into  constitutionally  sovereign 
states. 

The  legal  formalities  of  European  state  forma¬ 
tion  was  substantiated  by  technological  and 
socioeconomic  transformations.  The  building  and 
centralization  of  the  states  was  served  by  techno¬ 
logical  innovation.  The  development  of  firearms 
and  cannons  in  the  early-modern  epoch  made 
local  fortifications  inefficient.  Political  centraliza¬ 
tion  was  a  prerequisite  for  the  economic  extrac¬ 
tion  necessary  for  the  finance  of  standing  armies. 

The  modern  state  has  been  characterized  by 
Max  Weber  as  a  monopoly  of  legitimate  force  and 
by  Joseph  Schumpeter  as  a  monopoly  of  taxation. 
Modern  state  building  may  thereby  be  described  as 
increasing  territorial  control  by  force — army, 
police,  and  bureaucracy — on  the  one  hand  and 
effective  taxation  on  the  other.  New  resources  are 
then  controlled  by  the  central  authority.  A  special¬ 
ized  state  bureaucracy  is  a  prerequisite  for  this 
kind  of  control,  and  it  is  in  turn  reinforced  by  it. 

Weber’s  idea  of  the  modern  state  combines  the 
monopoly  of  physical  violence  with  legitimacy  and 
territoriality.  Without  these  characteristics,  anar¬ 
chy  would  ensue.  The  fundamental  ambition  of  the 
state  authority  is  to  legitimate  the  structure  of 
domination.  In  this  context,  Weber  developed  the 
categories  of  traditional,  charismatic ,  and  legal 
types  of  legitimation.  In  specific  instances  of  state 
formation  and  state  power,  there  is  a  variable  mix 
of  these  pure  types. 

Warfare  was  revolutionized  with  the  develop¬ 
ment  of  standing  armies,  field  weaponry,  and 
heavy  artillery.  The  central  state  power  was 
strengthened,  while  the  earlier  feudal  warrior  aris¬ 
tocracy  lost  their  military  advantages.  Armored 
cavalry  became  obsolete,  local  fortifications  were 
no  longer  defensible,  and  hired  troops  and  enlisted 
armies  were  beyond  the  financial  means  of  the 
local  nobility.  The  war  technology  undermined  the 
position  of  the  feudal  aristocracy  to  the  advantage 
of  monarchic  centralization. 

Financial  resources  are  the  nerves  of  govern¬ 
ment,  as  Jean  Bodin  noted  toward  the  end  of  the 
16th  century.  Taxation  upheld  the  preindustrial 
states,  but  public  financial  means  were  chronically 
in  short  supply  with  increasing  expenditures.  Low 
productivity,  inefficient  markets,  and  arbitrary  tax 


burdens  resulted  in  tax  riots  and  political  setbacks 
as  the  fiscal  pressure  increased.  On  the  other  hand, 
the  crises  of  legitimacy  brought  about  by  increas¬ 
ing  taxation  led  to  administrative  buildup  and 
reform  to  cope  with  tax  resistance. 

The  monarchy  acquired  a  tax  monopoly  and  a 
control  with  central  banking.  Schumpeter  argued 
that  the  concentration  of  state  power  was  linked  to 
the  development  of  early  capitalism.  The  market 
economy  led  to  increased  economic  growth,  while 
the  central  authorities  acquired  an  increasing  part 
of  the  economic  surplus  through  taxation  and  state 
enterprise.  Capitalist  growth  could  not  proceed 
without  a  rational  bureaucracy  for  public  adminis¬ 
tration  and  a  rational  system  of  law  for  stable 
transaction  rules  and  predictable  frameworks  for 
entrepreneurship.  In  Max  Weber’s  view,  these 
bureaucratic  and  legal  conditions  were  unique  to 
the  European  trajectory,  in  contrast  to  the  charis¬ 
matic,  patrimonial,  or  traditional  modes  of  rule 
elsewhere. 

With  political  centralization  and  economic  expan¬ 
sion,  the  capacity  for  warfare  increased  accordingly. 
The  perspectives  of  Weber  and  Schumpeter  are 
echoed  in  Charles  Tilly’s  famous  dictum  that  war 
made  the  state,  and  the  state  made  war. 

An  absence  of  government  would  have  been 
disastrous  for  economic  growth.  Anarchic  condi¬ 
tions  would  breed  disorder,  localism,  and  infrastruc¬ 
tural  defects.  The  development  of  communications, 
certified  knowledge  and  expertise,  legal  frameworks 
for  market  transactions,  justice,  and  order  would  all 
suffer.  State  formation  was  favorable  for  the  devel¬ 
opment  of  trade  and  industry. 

On  the  other  hand,  an  overdeveloped  state — a 
self-serving  power  structure  alienated  from  civil 
society — would  also  be  hostile  to  economic  devel¬ 
opment.  The  state  bureaucracy  as  a  new  ruling 
class  could  easily  degenerate  to  kleptocratic  and 
corrupt  practices.  Predatory  governments  are  sys¬ 
tematically  exploiting  society  rather  than  being 
responsive  to  it.  Arbitrary  and  predatory  rule  drain 
the  surplus  away  from  long-term  investment  and 
undermine  the  stable  conditions  of  law  and  predict¬ 
ability.  This  is  the  liberal  account  of  state  power. 

The  liberal  perspective  on  early-modern  state 
formation  has  been  tied  up  with  the  advantages  of 
political  fragmentation  in  the  early-modern  epoch. 
Whereas  the  empires  of  antiquity  were  character¬ 
ized  by  administered  trade  and  a  coalescence  of 
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economic  system  and  territorial  control,  capitalist 
expansion  and  industrial  growth  thrived  with  an 
economic  market  system  beyond  the  borders  of 
state  polities.  The  economic  division  of  labor 
gained  vitality  from  a  more  open  space  than  a 
single  political  unit.  The  economic  system  was 
wider  than  any  single  state  before  the  emergence  of 
the  modern  states  system.  The  plurality  of  states 
stimulated  economic  growth  and  provided  their 
inhabitants  with  an  escape  system  that  served  as  a 
barrier  to  arbitrary  government.  The  importance 
of  the  multipolarity  in  Europe  was  considered 
already  during  the  Enlightenment  and  developed 
at  length  by  Edward  Gibbon  in  his  monumental 
study  of  the  decline  and  fall  of  the  Roman  Empire. 

At  closer  range,  there  is  a  variety  of  reasons  for 
the  specifics  of  modern  state  formation  in  different 
parts  of  Europe.  While  Britain  emerged  as  a  cen¬ 
tralized  polity  in  the  early-medieval  epoch,  Italian 
and  German  lands  were  not  unified  until  the  sec¬ 
ond  half  of  the  19th  century.  Above  the  specifics  of 
individual  states,  Perry  Anderson  has  drawn  a 
major  line  of  division  between  the  East  and  West 
in  Europe.  The  absolutist  state  in  the  West  emerged 
on  endogenous  lines  of  development,  through 
antiquity  and  feudalism,  while  the  Eastern  variety 
of  statehood  was  a  response  to  external  challenges. 
Western  absolutism  was  erected  above  an  emerg¬ 
ing  urban  network  and  a  relatively  free  peasantry, 
while  Eastern  absolutism  built  on  small,  subordi¬ 
nate  urban  societies  and  rural  serfdom.  Thus,  the 
socioeconomic  forms  of  the  West  were  dynamic 
and  adapted  to  a  modernized  state,  while  the 
Eastern  institutions  remained  economically  stag¬ 
nant  with  the  state  as  a  response  to  exogenous 
military  pressure. 

Historically,  state  formation  in  Africa,  Asia,  and 
the  Americas  has  taken  a  variety  of  forms.  Precolonial 
chiefdoms  in  Africa  were  centralized  polities  with 
large  territories  and  often  with  hundreds  of  thou¬ 
sands  of  inhabitants,  some  of  them  intact  right  up 
to  the  onslaught  of  European  colonization.  The 
Asian  empires  evolved  in  a  continuous  rivalry 
between  nomadic  and  sedentary  societies;  even  the 
long  history  of  Chinese  dynasties  was  highly  depen¬ 
dent  on  the  border  fortifications  and  challenges 
from  the  outside.  The  indigenous  realms  in  Meso- 
America  and  South  America  were  unified  in  defense 
against  hostile  tribes  until  the  technologies  and  dis¬ 
eases  of  European  conquerors  brought  them  down. 


These  instances  of  early  state  formation  did  not 
lead  continuously  into  the  contemporary  world. 
The  political  trajectories  were  broken  by  revolu¬ 
tion,  as  in  China,  or  by  colonial  occupation  and 
control,  as  in  Africa  and — earlier — in  the  Americas. 
Postcolonial  state  formation  is  a  complex  mix  of 
endogenous  and  exogenous  change,  with  new 
international  conditions  as  a  crucial  factor. 

Recent  and  Contemporary  State  Formation 

Modern  statehood  has  expanded  throughout  the 
globe  in  four  major  waves  during  the  past  200 
years:  first  the  formation  of  independent  republics 
in  the  Americas  from  the  late  18th  century,  then 
the  dissolution  of  the  Ottoman  and  Habsburg 
empires  during  World  War  I,  followed  by  the  pro¬ 
cess  of  decolonization — with  the  dissolution  of  the 
European  empires  overseas — from  about  1960, 
and  finally  the  emergence  of  new  states  in  the  wake 
of  the  collapse  of  the  communist  regimes  and  the 
dissolution  of  the  Soviet  Union  from  1991. 

These  waves  of  state  formation  were  orches¬ 
trated  by  the  victory  of  rebellious  movements,  the 
collapse  from  above,  and  the  withdrawal  of  exter¬ 
nal  control.  The  key  to  state  formation  here  was 
different  from  the  processes  in  the  neolithic  revolu¬ 
tion  or  the  emergence  of  postfeudal  absolutism  in 
the  West.  Still,  the  European  postfeudal  and  post- 
Renaissance  state  provided  a  formal  model  toward 
which  recently  acquired  states  gravitated. 

In  terms  of  international  law,  as  a  formalization 
and  codification  of  the  modern  states  system,  there 
were  approximately  two  dozen  entities  within  this 
system  at  the  end  of  the  19th  century;  they  included 
the  European  states,  the  former  American  colonies, 
the  Ottoman  empire,  and  other  entities  recognized 
by  treaty  with  European  powers — Persia,  Siam, 
China,  and  Japan.  In  the  19th  century,  however,  it 
was  unclear  what  state  formation  and  statehood 
really  were.  There  have  been  shifting  criteria  for 
the  definition  of  a  sovereign  state.  International 
power  relations  have  been  crucial  in  modern  state 
formation,  as  illustrated  by  the  attempt  by  the 
Western  powers  to  establish  Kurdistan  and 
Armenia  as  independent  states  in  1920,  withdrawn 
3  years  later  under  Turkish  pressure. 

The  states  system  of  the  1 9th  century  was  based 
on  a  constitutive  theory  of  state  formation.  The 
status  as  an  independent  state  was  derived  from 
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diplomatic  practice — through  treaties  and  the 
exchange  of  ambassadors — without  explicit  crite¬ 
ria  or  formal  registration.  The  states  system  had  no 
clear  borders,  and  the  exact  number  of  states  was 
uncertain. 

This  lack  of  formal  criteria  became  increasingly 
awkward  with  international  conferences  and  trea¬ 
ties,  including  the  League  of  Nations  after  World 
War  I.  Which  political  entities  were  qualified  for 
admission?  General  recognition  required  standard¬ 
ized  and  universally  accepted  criteria  for  state¬ 
hood.  The  League  of  Nations  Treaty  circumvented 
the  problem  by  stating  that  any  fully  self-governed 
state,  dominion,  or  colony  could  be  voted  in  as  a 
member,  but  this  left  the  field  open  for  diplomatic 
maneuvering  and  power  politics. 

The  problem  was  addressed  at  an  inter- 
American  conference  in  Montevideo,  Uruguay,  in 
December  1933.  The  final  treaty  defined  a  state 
according  to  objective  criteria  in  a  way  that  is  still 
the  current  view  according  to  international  law. 
To  be  a  state,  a  political  entity  has  to  have  a  per¬ 
manent  population,  well-defined  borders,  a  gov¬ 
ernment,  and  a  capacity  to  honor  international 
obligations.  This  was  called  a  declaratory  theory 
of  statehood,  whereby  an  effective  government 
was  a  prerequisite  for  international  status  and 
recognition.  As  statehood  no  longer  hinged  on 
recognition  as  such,  it  was  not  abolished  by  the 
withdrawal  of  recognition.  The  fact  of  statehood 
was  moved  from  recognition  to  the  empirical  pre¬ 
requisites  for  recognition. 

Furthermore,  the  League  of  Nations  soon  codi¬ 
fied  the  prevalence  of  the  territorial  integrity  of 
established  states  above  the  right  of  secession, 
whatever  the  composition  of  the  population  or  the 
public  opinion  in  the  contested  areas  was.  How 
was  it  possible  to  adapt  post-World  War  II  decolo¬ 
nization  to  the  idea  of  territorial  integrity?  There 
were  51  original  members  of  the  UN  in  1945, 
increasing  to  127  member  states  in  1970,  157  in 
1981,  and  191  in  2004.  This  is  a  fast  and  radical 
process  of  state  formation ,  within  an  international 
states  system  where  the  established  polities  were 
eager  to  protect  their  territorial  integrity  and  avoid 
secessional  dismemberment. 

The  eventual  solution  was  pragmatic  and  finite, 
with  a  doctrine  of  decolonization  that  limited  the 
new  states  to  colonies  overseas  and  prohibited  fur¬ 
ther  secession  from  these  new  sovereign  states. 


According  to  the  doctrine,  decolonization  should 
proceed  in  one  step  across  salt  water,  without  fur¬ 
ther  legitimate  challenges  to  the  integrity  of  old 
and  new  states.  This  principle  was  accepted  by 
governments  throughout  the  Third  World,  in 
defense  of  the  new  state  order.  When  the  former 
USSR  and  the  federation  of  Yugoslavia  dissolved 
after  the  Cold  War,  sovereign  states  were  estab¬ 
lished  within  the  borders  of  the  former  republics, 
without  further  secessions  or  border  revisions.  The 
contested  independence  of  Kosovo  in  2008  is  the 
only  exception  here. 

The  postcolonial  states  and  their  borders  are  the 
result  of  exogenous  processes.  State  formation  is 
based,  first,  on  arbitrary  colonial  delineations  and, 
second,  on  diplomatic  principles.  Due  to  the  com¬ 
bination  of  a  colonial  past  and  a  postcolonial  lack 
of  national  integration  and  legitimacy,  many  post¬ 
colonial  states  have  been  bureaucratically  overde¬ 
veloped,  predatory,  and  inefficient.  Robert  Jackson 
has  called  them  quasi  states ,  lacking  the  institu¬ 
tional  qualities  of  the  Weberian  state.  The  modern 
state  proper,  after  a  long  historical  evolution, 
developed  institutions  by  sailing  between  the 
Scylla  of  anarchy  and  the  Charybdis  of  predatory 
rule.  The  quasi  state,  on  the  other  hand,  was  fully 
recognized  abroad,  but  it  was  predatory  or  par¬ 
tially  anarchic  internally. 

In  Max  Weber’s  terms,  many  postcolonial  states 
are  neopatrimonial  in  the  sense  that  the  public 
bureaucracy  is  mixed  up  with  clientelism,  kinship 
relations,  and  personal  dependency.  In  the  neopat¬ 
rimonial  state,  political  support  is  either  for  sale  or 
based  on  kinship  obligations.  Moreover,  neopatri¬ 
monial  features  are  a  matter  of  degree,  on  a  scale 
from  the  Weberian  ideal  type  of  formal  rationality 
to  the  failed  state. 

War  may  have  been  a  prime  mover  in  early- 
modern  European  state  formation,  but  this  expla¬ 
nation  is  not  generally  valid  in  the  non-European 
world.  War  and  waves  of  conquest  contributed  to 
state  formation  in  some  parts  of  Asia,  such  as 
Vietnam,  but  were  hardly  relevant  in  the  African 
context.  In  many  cases,  formal  independence  was 
followed  by  territorial  disintegration,  ethnic  strife, 
and  civil  war. 

Postcolonial  state  formation  has  been  a  process 
against  heavy  odds.  Early-modern  war  in  Europe 
unified  the  states  against  external  threats,  while 
modern  wars  in  the  non-European  world  have 
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been  mostly  domestic.  External  powers  may  con¬ 
tinue  to  exploit  the  economic  weaknesses  of  post¬ 
colonial  states,  in  contrast  to  the  centralization  of 
surplus  extraction  in  the  early-modern  epoch. 
Weak  institutions  and  deficient  leadership  contrib¬ 
ute  to  the  failure  of  state  formation  in  many  post¬ 
colonial  areas. 

State  Formation  as  a  Nonuniform  Process 

State  formation  takes  place  in  a  historical  context. 
There  is  only  a  superficial  similarity  between  the 
origin  of  the  state  in  neolithic  times,  the  evolution 
of  the  early-modern  state  in  Europe,  and  the  vari¬ 
ety  of  geopolitical  and  socioeconomic  conditions 
behind  state  formation  in  the  modern  era.  It  is  the 
outcome  of  a  certain  class  of  phenomena — the 
centralized  political  structures  within  relatively 
large  territories — that  bind  these  processes 
together.  State  formation  is  a  generic  term.  The 
state  and  the  mechanisms  behind  it  cover  a  wide 
range  of  political  forms  and  experiences. 

No  single  explanation  can  explain  state  forma¬ 
tion  in  general — during  the  neolithic,  the  early- 
modern,  and  the  contemporary  epochs.  Nor  can  a 
single  explanation  adequately  deal  with  modern 
state  making  in,  for  instance,  Britain,  France, 
Germany,  or  the  United  States.  We  have  to  com¬ 
bine  the  more  general  preconditions  with  those 
mechanisms  that  are  specific  to  the  emergence  of 
the  individual  state. 
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Statism 


The  English  word  statism  and  its  equivalents  in 
other  languages  denote  an  idea  of  the  supremacy 
of  the  state  and  the  corresponding  principles,  ide¬ 
ologies,  policies,  institutions,  and  even  specific 
instances  of  state  intervention  in  personal,  social, 
or  economic  matters. 

Origins 

As  a  vague  and  contested  term,  I’etatisme  emerged 
in  France  and  then  spread  throughout  Europe  in 
the  late  19th  or  early  20th  century,  either  as  a 
transliteration  (German  Etatismus,  Polish  etatyzm, 
and  Swedish  etatism )  or  in  the  form  of  a  caique 
(Italian  statalismo,  Spanish  estatismo,  Portuguese 
estadismo ,  Turkish  Devletgilik ,  and  Finnish 
Valtiojohtoisuu),  or  as  both  transliteration  and 
caique  (Russian  3TaTH3M,  rocy.aapciBCHnuHCCTBo; 
English  etatism,  statism;  and  Dutch  etatisme ,  stat- 
isme).  Later,  there  emerged  transliterations  of  the 
English  term  (e.g.,  Lithuanian  statizmas). 

The  idea  of  the  supremacy  of  state  emerged 
much  earlier  then  the  term  I’etatisme.  Soon  after 
Niccolo  Machiavelli  and  Francesco  Guicciardini 
coined  the  word  stato  in  early  16th  century,  signo- 
ral  or  kingly  supremacy  over  a  territorial  polity 
was  gradually  depersonalized  and  conceptualized 
as  a  core  notion  of  political  order.  In  the  mid-17th 
century,  Thomas  Hobbes  identified  the  state  as 
“ Mortal  God,  to  which  wee  owe  under  the 
Immortal  God,  our  peace  and  defence”  ( Leviathan , 
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chap.  17)  or  as  the  ultimate  and  supreme  authority. 
He  claimed  that  there  was  no  power  on  earth  to  be 
compared  with  the  Leviathan — “ Non  est  potestas 
Super  Terrain  quae  Comparetur  ei”  (There  is  no 
power  on  earth  that  can  be  compared  with  him). 
Accordingly,  the  state  was  to  be  a  departure  point 
for  all  political  thinking  and  practice. 

Historical  Evolution 

Further  development  of  statism  in  the  17th  and 
1 8th  centuries  was  challenged  both  by  its  reperson¬ 
alization  by  absolute  monarchs  (as  in  Louis  XIV  of 
France’s  declaration  “L’Etat  c’est  moi”)  and  by 
attempts  to  usurp  state  prerogatives  by  political 
factions  of  all  kinds.  At  the  same  time,  the  viability 
of  the  idea  of  state  supremacy  was  confirmed  by 
“public  policy  science”  ( Polizeiwissenschaft )  and 
constitutionalism.  They  highlighted  the  transition 
from  “state  of  estates”  ( Standestaat )  to  “good- 
policy  state”  ( Polizeistaat ),  and  then  to  various 
forms  of  limited  monarchy,  and  finally  to  constitu¬ 
tional  ones. 

A  major  problem  was  posed  by  a  lack  of  clarity 
about  the  source  or  sources  of  state  authority.  A 
conflict  of  alternative  viewpoints  ranging  from 
divine  to  popular  mandate  made  the  idea  of  state 
supremacy  highly  contestable.  It  was  Georg 
Wilhelm  Friedrich  Hegel  who  tried  to  develop  a 
decidedly  abstract  perspective  that  would  justify 
philosophically  the  supremacy  of  the  state.  In  his 
Elements  of  the  Philosophy  of  Right  (Grundlinien 
der  Philosophie  des  Rechts ),  he  further  developed 
the  Hobbesian  idea  of  the  state  as  Mortal  God: 

The  state  is  the  march  of  God  through  the  world. 

Its  foundation  is  the  power  of  reason  that 
implements  itself  as  will.  To  pursue  the  idea  of 
state  one  has  to  consider  not  specific  states  but 
the  idea  for  itself  or  the  actual  God.  (sec.  258 
addendum) 

In  other  words,  the  idea  of  the  state  could  not  be 
reduced  to  specific  instances  but  should  be 
enhanced  to  the  higher  abstraction  of  transhistori- 
cal  development.  The  state  was  made  a  part  of  the 
divine  strategy,  not  a  mere  product  of  human 
endeavor.  To  this  day,  Hegel’s  claim  remains  the 
purest  expression  of  the  philosophical  justification 
of  statism. 


In  the  postrevolutionary  era  of  the  19th  century, 
statism  was  challenged  by  its  many  rivals.  It  was 
opposed  by  individualism,  anarchism,  socialism, 
communism,  and  radical  brands  of  liberalism,  to 
mention  only  the  most  important  ones.  Marxism 
advanced  a  “scientific”  theory  of  universal  devel¬ 
opment  according  to  which  the  state  was  supposed 
to  wither  away.  It  was  the  most  radical  and  clear 
expression  of  antistatism  at  that  time. 

It  was  in  the  19th  century  that  the  word  statism 
acquired  its  pejorative  or  derogatory  sense,  which 
is  still  widespread  in  ideologically  motivated  dis¬ 
courses.  But  it  was  also  the  moderate  liberalism 
and  particularly  the  constitutionalism  of  the  19th 
century  that  helped  clarify  many  aspects  of  the 
idea  of  the  supremacy  of  state  as  the  crucial  frame¬ 
work  for  international  and  national  political  order. 

The  Second  Empire  periods  in  France  (1852— 
1870)  and  in  Germany  (1871-1919)  were  the 
main  vehicles  of  statism.  The  term  welfare  state 
( Etat  providence)  was  originally  coined  by  Emile 
Ollivier,  a  deputy  in  the  National  Assembly,  in 
1864.  He  used  it  to  identify  a  new  system  of  social 
solidarity  (systeme  de  solidarity  national). 

In  Germany,  statism  was  not  only  related  to 
strengthening  the  traditional  aspects  of  state  power 
(military  might,  administrative  capabilities,  etc.) 
but  also  marked  by  the  intrusion  of  the  new  impe¬ 
rial  state  into  areas  that  previously  were  the 
domains  of  nonstate  agencies.  The  policy  of  culture 
struggle  (Kulturkampf)  allowed  the  German  state 
to  promote  effective  secularization  and  to  play 
down  the  authority  of  the  Catholic  Church.  As  a 
result,  the  state  gained  control  over  certain  func¬ 
tions  that  hitherto  had  belonged  exclusively  to  the 
church,  such  as  education,  marriage  ceremonies, 
and  so  on.  The  church  was  forbidden  from  partici¬ 
pating  directly  in  political  affairs.  The  state  assumed 
responsibility  for  the  training  and  appointment  of 
clergy.  Under  Chancellor  Otto  von  Bismarck’s 
May  Laws  of  1873  and  1874,  the  state  assumed 
responsibility  for  the  training  and  appointment  of 
clergy.  (In  1886,  however,  this  policy  was  relaxed, 
returning  education  of  clergy  to  the  seminaries.) 

Bismarck’s  policy  of  social  insurance  (soziale 
Fiirsorge )  meant  a  more  direct  and  decisive  intru¬ 
sion  of  the  state  into  the  private  sphere.  It  helped 
introduce  health  insurance,  accident  insurance, 
disability  insurance,  and  old-age  retirement  pen¬ 
sion  in  the  1880s.  The  policy  helped  further 
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develop  the  German  tradition  of  the  welfare  state 
( Wohlfahrtsstaat )  that  dated  back  to  the  “good- 
policy  state”  ( Polizeistaat )  of  the  18th  century  and 
paved  the  way  for  the  “social  state”  (Sozialstaat) 
of  the  20th  century.  In  fact,  Bismarck  himself  iden¬ 
tified  his  policy  first  openly  as  practical  Christian¬ 
ity  and  then  informally  as  “state  socialism” 
(Staatssozialismus).  Both  identifications  were  cor¬ 
rect  since  the  traditional  church  domain  of  care¬ 
taking  for  the  poor  and  needy  was  taken  over  by 
the  state  to  forestall  socialist  claims  on  the  issues 
of  well-being  of  working  people. 

At  the  turn  of  the  20th  century,  statism  was 
provided  with  sound  intellectual  justification  by  a 
prominent  Swedish  political  scientist,  Johan  Rudolf 
Kjellen.  He  advocated  that  only  states  as  main 
political  units  made  domestic  and  foreign  politics 
possible.  States  collectively  provided  the  arena  of 
international  politics;  each  individual  state  com¬ 
prised  a  framework  for  national  politics.  He  iden¬ 
tified  states  as  a  “life-form”  ( lifvsform ),  stressing 
their  organic  unity  with  the  life  stories  of  respec¬ 
tive  peoples  in  their  natural  environment.  Thus, 
Kjellen  secularized  the  Hobbesian  “Mortal  Gods” 
into  “life-forms”  subject  to  scientific  investigation. 

World  War  I  led  to  a  major  breakthrough  of 
statism.  All  the  belligerents  were  forced  to  develop 
state-controlled  wartime  economies.  Both  political 
thinking  and  institution  building  were  greatly 
influenced  by  this  development.  At  the  same  time, 
this  also  provoked  a  reaction,  boosting  antistatism 
of  various  sorts,  ranging  from  serene  pacifism  to 
revolutionary  communism.  The  clash  of  statist  and 
antistatist  pressures  further  complicated  the  politi¬ 
cal  developments.  In  the  setting  of  postwar  recon¬ 
struction,  state  agencies,  corporations,  trade 
unions,  cooperatives,  and  social  movements  were 
making  overlapping  efforts. 

The  October  Revolution  posed  a  major  chal¬ 
lenge  to  statism,  launching  an  alternative  form  of 
direct  “workers’  and  peasants’  democracy  of 
Soviets.  ”  A  supposedly  antistatist  drive  soon  led  to 
the  adoption  of  the  Stalinist  policy  of  a  “socialist 
state  in  a  single  country”  and  the  creation  of  a 
totalitarian  “party-state”  ( napmm-eocydapcmeo ). 

Policies  of  self-determination  swept  many  coun¬ 
tries  in  the  wake  of  the  World  War  I.  To  a  great 
extent,  this  trend  depended  on  state  efforts  and 
promoted  various  kinds  of  statism.  For  example, 
in  Turkey,  statism,  or  Devletgilik,  became  one  of 


the  six  “arrows”  or  principles  of  Kemalism  (the 
political  philosophy  of  Mustafa  Kemal,  founder  of 
the  modern  Turkish  Republic).  According  to  that 
principle  and  policy,  the  state  was  to  direct  the 
country’s  economic  development.  It  was  also  to 
engage  in  areas  where  private  enterprise  had 
proven  to  be  inadequate,  or  where  national  inter¬ 
est  required  direct  state  involvement.  In  promoting 
Devletgilik ,  the  Turkish  state  emerged  as  the 
owner  of  the  major  industries  of  the  country. 

Statism  was  also  one  of  the  cornerstones  of 
Italian  fascism.  At  a  meeting  in  October  1925, 
Benito  Mussolini  formulated  the  guiding  motto  of 
his  regime:  “Everything  in  the  State,  nothing  out¬ 
side  the  State,  nothing  against  the  State”  ( Tutto 
nello  Stato,  niente  al  di  fuori  dello  Stato,  nullo 
contro  lo  Stato-,  Karl  Bracher,  1973).  Such  verbiage 
incited  resentment.  Soon  the  fascist  brand  of  stat¬ 
ism  was  labeled  statolatry  (statol 'atria).  In  his 
encyclical  Non  Abbiamo  Bisogno  of  June  29, 1931, 
Pope  Pius  IX  denounced  fascist  attempts  to  monop¬ 
olize  completely  the  young,  from  their  most  tender 
years  up  to  manhood  and  womanhood,  for  the 
exclusive  advantage  of  a  party  and  of  a  regime 
based  on  an  ideology  that  clearly  resolves  itself  into 
a  true  and  genuine  pagan  worship  of  the  state  (“in 
una  vera  e  propria  statolatria  pagana ”) — statolatry. 
Later,  the  term  statolatry  was  widely  used  by  a 
staunch  critic  of  statism,  Ludwig  von  Mises. 

Such  claims  to  total  control  were  beyond  the 
capabilities  of  fascist  Italy  to  sustain.  Instead,  it 
developed  a  state  structure  known  as  the  corporate 
state  with  the  ruling  party  acting  as  a  mediator 
between  “corporations”  making  up  the  body  of 
the  nation.  Similar  designs  were  quite  popular  else¬ 
where  in  the  1930s.  The  most  prominent  examples 
were  Estado  Novo  in  Portugal  (1932-1968)  and 
Brazil  (1937-1945),  the  Austrian  Standestaat 
(1933-1938),  and  authoritarian  experiments  in 
Estonia,  Romania,  and  some  other  countries  of 
East  and  East-Central  Europe. 

Much  more  consistent  attempts  to  develop 
totalitarian  versions  of  statism  were  undertaken  in 
the  German  Third  Reich  and  the  former  USSR. 
Still,  the  “success”  was  fairly  ambiguous  and  self- 
defeating  since  the  emergent  models  actually  inte¬ 
grated  state  structures  into  a  more  inclusive  system 
of  total  control,  in  the  form  of  the  Hitlerite  “move¬ 
ment-state”  (Bewegungsstaat)  and  the  Stalinist 
“party-state”  ( napmm-zocydapcmeo ).  Thus,  the 
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state  in  the  proper  sense  of  the  term  was  actually 
eradicated. 

Liberal  versions  of  corporate  statism  of  the 
1930s  developed  into  welfare  states.  The  most 
notable  examples  are  the  United  States, 
Netherlands,  Australia,  New  Zealand,  Uruguay, 
Sweden,  and  other  Nordic  countries. 

The  advancement  of  the  American  welfare  state 
was  highlighted  by  the  New  Deal,  a  series  of  eco¬ 
nomic  programs  President  Franklin  D.  Roosevelt’s 
administration  initiated  between  1933  and  1936. 
The  First  New  Deal,  of  1933,  was  aimed  at  short¬ 
term  recovery  programs  to  combat  the  effects  of 
the  Great  Depression.  The  Second  New  Deal 
(1935-1936)  included  labor  union  support,  the 
Social  Security  Act,  and  programs  to  aid  farmers 
and  migrant  workers. 

In  Sweden,  a  policy  of  taking  over  the  system  of 
parish  welfare  by  the  state  and  the  development  of 
additional  state-controlled  social  services  was 
introduced  in  early  1930s.  It  is  known  as  people’s 
home  (Folkhemmet),  a  very  telling  term  intro¬ 
duced  already  in  1928  by  the  Social  Democratic 
Party  leader  Per  Albin  Hansson,  with  inspiration 
presumably  coming  from  Kjellen. 

Recent  Developments 

In  the  later  part  of  the  20th  century,  the  welfare 
state  and  statism  had  their  ups  and  down. 
Emergence  of  new  states,  economic  crises,  social 
unrest,  and  revolutions  were  typically  accompa¬ 
nied  by  an  upsurge  of  statist  policies  and  thinking. 
Periods  of  relatively  steady  development  tended  to 
incite  withdrawal  of  the  state  from  economic  and 
social  spheres.  The  most  prominent  examples  are 
Thatcherism  and  Reagonomics. 

Statism’s  influences  on  political  science  are 
mainly  found  in  the  traditions  of  normative  and 
legal  studies  of  states.  Far  more  outspoken  are 
antistatism  traditions  related  to  liberal  and  Marxist 
sources  of  inspiration.  In  a  more  general  context, 
both  the  behavioral  approach  and  structural  func¬ 
tionalism  helped  play  down  elements  of  statism 
from  the  1950s  to  the  1970s,  introducing  concepts 
of  political  systems  and  political  culture.  Reaction 
followed  and  “brought  the  state  back  in”  (see 
Theda  Skocpol,  Peter  Evans,  &  Dietrich  Ruesche- 
meyer,  1985).  The  globalization  debate  reintro¬ 
duced  expectations  of  the  state  “withering  away.” 


Currently,  a  more  balanced  approach  seems  to 
prevail.  It  is  generally  recognized  that  the  system  of 
states  is  indispensable  as  a  kind  of  reference  grid 
for  world  politics  and  the  respective  territorial 
units  are  vital  institutional  frameworks  for  internal 
policies. 

Mikhail  Ilyin 
MGIMO  University 
Moscow,  Russian  Federation 
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Statistical  Inference,  Classical 
and  Bayesian 


Statistical  inference  is  a  form  of  induction  and  can 
be  broadly  defined  as  “learning  from  data.”  The 
two  dominant  forms  of  statistical  inference  are 
“classical”  (or  “frequentist”)  inference  and  Bayes¬ 
ian  inference. 

Briefly,  classical  inference  assesses  the 
plausibility  of  a  hypothesis  by  asking  bow 
frequently  we  would  see  results  like  the  one 
actually  obtained  in  repeated  applications  of  the 
data  generation  mechanism,  assuming  the 
hypothesis  to  be  true.  If  a  statistic  0  computed 
with  the  observed  data  is  judged  to  be  sufficiently 
unusual  relative  to  its  expected  value  under  the 
hypothesis,  then  the  hypothesis  is  considered 
falsified.  The  assumed  hypothesis  is  often  a  “null” 
or  “no  effects”  hypothesis;  if  this  hypothesis  is 
rejected  (in  the  sense  given  above),  then  we  usually 
say  that  we  have  a  “statistically  significant” 
finding.  The  assumptions  here  are  that  statistics 
vary  randomly  across  repeated  applications  of  the 
data  generation  mechanism  (e.g.,  random 
sampling,  say  in  the  case  of  the  analysis  of  survey 
data),  while  the  objects  of  interest — population 
parameters  0 — are  constants.  Repeated  appli¬ 
cations  of  the  sampling  process,  if  undertaken, 
would  yield  different  y  and  different  0.  The 
distribution  of  values  of  0  that  would  result  from 
repeated  applications  of  the  sampling  process  is 
called  the  sampling  distribution  of  0;  the  standard 
deviation  of  this  distribution  is  the  standard  error 
of  0.  For  many  statistics,  asymptotic  theory  gives 
the  form  of  the  statistic’s  large-sample  sampling 
distribution  (e.g.,  normal,  x2)-  The  sampling 
variance  of  a  statistic  is  often  also  easy  to  estimate; 
for  instance,  if  0  is  the  maximum  likelihood 
estimate,  then  L(0)  is  often  estimated  with  the 
inverse  of  the  information  matrix  (minus  the 
second  derivatives  of  the  log  of  the  likelihood 
function  with  respect  to  0,  usually  evaluated  either 
at  0  or  at  a  hypothesized  value  0*).  This  approach 
is  by  and  far  the  most  frequently  taught  and 
frequently  deployed  framework  for  statistical 
inference  in  the  social  sciences. 

By  contrast,  Bayesian  inference  uses  Bayes  rule 
(we  will  drop  the  apostrophe  in  “Bayes’  rule”)  to 
compute  the  conditional  probability  of  hypotheses 


given  the  data  at  hand,  without  any  explicit 
reference  to  what  might  happen  over  repeated 
applications  of  the  data  generation  mechanism. 
Bayes  rule  states  that  if  A  and  B  are  events  then 

P(A|B)=P<™,  (I* 

where  P(A\B)  is  the  conditional  or  posterior  prob¬ 
ability  of  A  given  that  event  B  has  occurred,  P(A)  is 
the  prior  probability  of  A,  and  P(B)  is  the  marginal 
probability  of  B.  This  proposition — an  uncontro- 
versial  result  given  the  conventional  definition  of 
conditional  probability — can  be  restated  more  pro¬ 
vocatively  as 


(2) 

where  P(H )  is  the  prior  probability  of  a  hypothesis 
and  P(E\H)  is  the  likelihood  of  “evidence”  (or 
data)  E  under  hypothesis  El.  This  form  of  Bayes 
rule  underscores  its  relevance  as  a  tool  for  statisti¬ 
cal  inference.  In  the  case  of  a  finite  set  of  compet¬ 
ing  hypotheses  H  —  {H, ,  .  .  .  ,  Hf,  the  law  of 

total  probability  implies  that  P(E)  =  X  ;eH  PlElHf 
P(Hf.  Note  that  the  resulting  posterior  probabili¬ 
ties  constitute  a  proper  probability  mass  function 

over  the  set  El ;  that  is,  X  j€H  P(El]  IE)  =  1.  For  the 
case  of  a  continuous  parameter  0  6  ©  C  H  and 
data  y  ~  p(y\9),  Bayes  rule  becomes 


P(%)  - 


p(y|6)p(9) 

/0p(y|e)p(0)^0 


°cp(y|0)p(0), 


(3) 


or  (in  words)  the  posterior  density  for  0  is  propor¬ 
tional  to  the  prior  density  for  0,  p(0),  times  the 
likelihood  for  the  data  given  0,  ply  I  0).  The  integral 
in  the  denominator  in  Equation  3  ensures  that  the 
posterior  density  integrates  to  one  and  thus  is  a 
proper  probability  density. 

Note  that  Bayesian  inferences  do  not  rely  on 
how  0  might  vary  over  repeated  applications  of 
random  sampling.  Instead,  Bayesian  procedures 
are  geared  toward  answering  a  simple  question: 
What  should  I  believe  about  0  in  light  of  the  data 
available  for  analysis,  y?  The  quantity  0  (y)  has  no 
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special,  intrinsic  status  in  the  Bayesian  approach, 
although  a  statistic  such  as  a  least  squares  or 
maximum  likelihood  estimate  0  is  a  feature  of  the 
data  that  may  be  helpful  in  computing  the  poste¬ 
rior  distribution  for  0.  In  the  Bayesian  approach, 
the  roles  of  0  and  0  are  reversed  relative  to  their 
roles  in  classical,  frequentist  inference:  0  is  ran¬ 
dom,  in  the  sense  that  the  researcher  is  uncertain 
about  its  value,  while  0  is  fixed,  a  feature  of  the 
data  at  hand.  An  example  will  help  make  the  dis¬ 
tinction  clear. 

Example:  Attitudes  Toward  Abortion.  In  the  1994 
General  Social  Survey  (GSS),  1,934  respondents 
were  asked,  “Please  tell  me  whether  or  not  you 
think  it  should  be  possible  for  a  pregnant  woman 
to  obtain  a  legal  abortion  if  the  woman  wants  it  for 
any  reason.” 

Of  the  respondents,  895  reported  “yes,”  and 
1,039  said  “no.”  Let  0  be  the  unknown  population 
proportion  of  respondents  who  agreed  with  the 
proposition  in  the  survey  item — that  a  pregnant 
woman  should  be  able  to  obtain  an  abortion  if  the 
woman  wants  it  for  any  reason.  The  question  of 
interest  is  whether  a  majority  of  the  population 
supports  the  proposition  in  the  survey  item. 

Under  the  assumption  that  the  survey  responses 
are  independent  (not  an  unreasonable  assumption 
for  a  survey  such  as  the  GSS),  the  binary  survey 
responses  can  be  modeled  as  binomial  process; 
that  is,  the  likelihood  has  the  binomial  form 

(4> 

Frequentist  Approach.  The  maximum  likelihood 
estimate  of  0  is  0  =  895/1934  ~  .46,  the  approxi¬ 
mation  coming  via  rounding  to  two  significant 
digits.  With  this  large  sample,  the  normal  is  an 
excellent  approximation  to  the  sampling  distribu¬ 
tion  of  0.  Interest  focuses  on  whether  the  unknown 
population  proportion,  0,  is  equal  to  .5.  A  typical 
classical  approach  to  this  question  is  to  test  the  null 
hypothesis  H0:  0  =  .5  against  all  other  alternatives 
HA  :  0  #  .5  or,  say,  a  one-sided  alternative  HB  :  0  > 
.5.  We  would  then  ask  how  often  we  would  see  the 
value  of  0  actually  obtained,  or  an  even  more 
extreme  value  if  H0  were  true,  by  centering  the 
sampling  distribution  of  0  at  the  hypothesized 


value.  The  variance  of  0  under  H0  is  given  in  the 
usual  way,  computed  as  minus  the  inverse  of  the 
second  derivatives  of  the  log-likelihood  function 
evaluated  at  the  hypothesized  value  of  0.  In  turn, 
the  standard  deviation  of  the  normal  sampling  dis¬ 
tribution  (the  standard  error  of  0)  under  H0  is 


se(0|Ho)  =  6ho) 


.50  x  (1  -  .50) 


1934 


.011. 


The  realized  value  of  0  is  (.5  —  .46)/.011  ~  3.64 
standard  errors  away  from  the  hypothesized  value. 
Under  a  normal  distribution,  this  is  an  extremely 
rare  event.  Over  repeated  applications  of  random 
sampling,  only  a  small  proportion  of  estimates  of 
0  will  lie  3.64  or  more  standard  errors  away  from 
the  hypothesized  mean  of  the  sampling  distribu¬ 
tion.  This  proportion  is 


oo 

2  X  /  d \>(z)dz  -  2  X  [1  -  $(3.64)]  «  .00028, 

3.64 

where  4>(*)  and  $(•)  are  the  normal  PDFs  and 
CDFs,  respectively.  Given  this  result,  most  (fre¬ 
quentist)  analysts  would  reject  the  null  hypothesis 
in  favor  of  either  alternative  hypothesis,  reporting 
the  p  values  for  H0  against  HA  as  .00028  and  for  H0 
against  HB  as  .00014. 

Bayesian  Approach.  The  unknown  parameter  is 
06  0  =  [0,  1J.  Suppose  that  we  bring  little  or  no 
prior  information  to  the  analysis,  assigning  0  a 
uniform  prior  over  0.  Applying  Bayes  rule,  we 
seek  the  posterior  density  p(Q\y)  x  p(Q)p(y  10).  The 
uniform  prior  is  constant  with  respect  to  0  and  is 
absorbed  into  the  constant  of  proportionality. 
With  a  binomial  likelihood  and  a  uniform  prior  on 
0,  the  posterior  density  of  0  is  a  Beta  density, 
which  is  well  approximated  by  a  normal  density 
for  samples  of  even  moderate  size.  That  is,  our 
working  approximation  to  the  posterior  density 
for  0  is  a  normal  density  centered  on  the  maximum 
likelihood  estimate  of  .46  with  standard  deviation 
.011;  that  is,  p(0ly)  ~  N(.46,  .Oil2).  Note  that 
most  of  the  posterior  probability  mass  lies  below 
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.5,  suggesting  that  the  hypothesis  0  >  .5  is  not 
well  supported  by  the  data.  In  fact,  the  posterior 
probability  of  this  hypothesis  is 

oo 

Pr(0  >  ,5|y)  -  J  p(9\y)d8 

.5 


00 


.5 


Note  an  apparent  symmetry  between  the  fre- 
quentist  and  Bayesian  answers.  In  both  instances, 
the  “answer”  involved  computing  the  same  tail 
area  probability  of  a  normal  distribution,  with  the 
probability  of  H0  under  the  Bayesian  posterior 


distribution  corresponding  with  the  p  value  in  the 
frequentisi  test  of  H0  against  the  one-sided  alterna¬ 
tive  Hb  (see  Figure  1).  But  this  similarity  really  is  only 
superficial.  The  Bayesian  probability  is  a  statement 
about  the  researcher’s  beliefs  about  0  obtained  via 
application  of  Bayes  rule,  and  is  Pr  (H0\y),  obtained 
by  computing  the  appropriate  integral  of  the  poste¬ 
rior  distribution  p(Q ly).  The  frequentist  p  value  is 
obtained  via  a  slightly  more  complex  route  and  has 
a  quite  different  interpretation  from  the  Bayesian 
posterior  probability  since  it  conditions  on  the  null 
hypothesis;  that  is,  the  sampling  distribution  is 
/~(0| /-/())■  and  the  p  value  for  H0  against  the  one-sided 
alternative  is  the  proportion  of  0  less  than  .46  we 
would  see  under  repeated  sampling  with  the  sam¬ 
pling  distribution  assumed  under  the  null  hypothesis. 


Posterior  density  of  G,  p(G/y)  Posterior  density  of  6 ,  p(G/y), 


Sampling  distribution  of  G  under  H0  :  G  =  .5  Sampling  distribution  of  6  under  H0  :  6  =  .5, 

zoomed  in  to  neighborhood  around  6  =  .46 


Figure  I  Attitudes  Toward  Abortion:  Posterior  Density  of  0  Contrasted  With  Sampling  Distribution  of  0  Under 

Ho:0  =  .5 

Source:  Jackman,  S.  (2009).  Bayesian  analysis  for  the  social  sciences  (Figure  1.10,  p.  35).  Hoboken,  NJ:  Wiley. 

Note:  The  top  right  panel  shows  the  posterior  density  in  the  neighborhood  of  u  5  .5,  with  the  shaded  region  corresponding 


to  the  posterior  probability  p(9  >  -5|y)  —  j  p(0|y)d0  =  .00014  .  The  lower  right  panel  shows  the  sampling  distribution 

in  the  neighborhood  of  0  =  .46  ,  with  the  shaded  region  corresponding  to  the  proportion  of  times  one  would 
observe  0#  .46  if  H0  :  u  5  .5  were  true,  corresponding  to  .00014  of  the  area  under  the  sampling  distribution. 
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Relevance  of  the  Classical/Bayesian 
Distinction  to  Social  Science 

The  distinction  between  the  classical/frequentist 
and  Bayesian  approaches  to  statistical  inference  is 
important  in  at  least  three  ways:  (1)  although  the 
frequentist  approach  is  ubiquitous  in  social  science 
training  and  practice,  it  is  unclear  whether  its 
premises  apply  in  many  social  science  contexts; 
(2)  many  social  scientists  often  give  Bayesian  inter¬ 
pretations  to  the  outcome  of  classical  procedures 
and  (3)  since  parameters  are  considered  random 
variables  in  the  Bayesian  approach,  extensions 
such  as  multilevel  or  hierarchical  modeling  are 
straightforward,  as  is  inference  for  latent  variables, 
missing  data,  and  so  on. 

Inference  Without  Random  Sampling 

Consider  the  analysis  of  cross-national  data  in 
economics,  political  science,  or  sociology,  say 
using  national  accounts  data  from  the  OECD 
(Organisation  for  Economic  Co-operation  and 
Development).  Alternatively,  consider  researchers 
in  American  politics  analyzing  all  trade  votes  in 
the  U.S.  Senate  in  a  given  period  or  an  analysis  of 
the  determinants  of  civil  war,  using  an  authorita¬ 
tive,  comprehensive  listing  of  all  civil  wars  in  the 
world  since  World  War  II. 

In  what  sense  do  data  such  as  these  comprise  a 
sample  from  a  population?  Repeating  the  data  col¬ 
lection  exercise  would  not  yield  a  new  sample  from 
the  population,  but  rather,  save  for  coding  error  or 
errors  in  transmission  or  transcription  of  the  data, 
“repeated  sampling”  would  yield  exactly  the  same 
data  set  and  statistics,  0 .  In  cases  such  as  these,  there 
is  no  uncertainty  due  to  variation  in  repeated  sam¬ 
pling  from  a  population.  Nonetheless,  one  can  feed 
data  of  this  sort  to  a  computer  program  and  have 
standard  errors  and  significance  tests  reported  for 
various  statistics  (e.g.,  means,  correlations,  regres¬ 
sion  coefficients)  as  usual.  But  what  do  those  stan¬ 
dard  errors  mean  in  this  context?  What  do  we  mean 
when  we  say  a  result  is  “statistically  significant”  in 
this  setting? 

Several  possibilities  come  to  mind.  Coding  error 
might  see  the  numbers  changing  over  repetitions  of 
the  data  collection  process,  say,  if  the  data  were 
being  manually  transcribed  from  archival  sources 
into  a  database.  But  this  form  of  measurement  error 
may  have  quite  different  operational  characteristics 
from  classical  sampling  error:  For  example,  what  is 


the  population  of  possible  data  sets  that  might  be 
generated  via  transcription  error,  and  how  do  we 
(randomly)  sample  from  that  population?  Are  those 
transcription  errors  distributed  randomly  and,  if  so, 
via  what  distribution?  In  short,  does  a  central  limit 
theorem  apply  to  the  resulting  estimator  and  stan¬ 
dard  errors?  Almost  surely  not:  As  n  — >  co,  frequen¬ 
tist  sampling  error  vanishes  (and,  for  independent 
data,  at  a  rate  of  \[n ) ,  but  this  hardly  seems  likely 
for  coding  errors. 

Nor  is  it  satisfying  or  particularly  helpful  to  say, 
“The  data  are  just  one  of  many  possible  data  sets 
that  could  have  been  generated  if  history  were  to  be 
replayed  many  times  over.”  What  is  the  sampling 
mechanism  that  selected  the  history  we  happened 
to  observe?  No  one  knows,  or  can  know.  Adhering 
to  a  frequentist  conception  of  probability  in  the 
face  of  nonrepeatable  data  seems  dubious. 

Moreover,  it  is  not  clear  how  to  attach  frequen¬ 
tist  probabilities  to  past  or  nonrepeatable  events 
about  which  one  may  be  uncertain  (e.g.,  the  proba¬ 
bility  that  an  asteroid  impact  caused  the  K-T  mass 
extinction)  or  to  future  events  (e.g.,  the  probability 
that  Democrats  will  hold  a  majority  of  seats  in  the 
U.S.  House  of  Representatives  after  the  next  elec¬ 
tion).  That  is,  there  is  a  large  class  of  events  of  inter¬ 
est  to  social  scientists  for  which  the  frequentist 
notion  of  probability  is  inappropriate.  In  these 
situations — quite  common  in  the  social  sciences — 
the  Bayesian  approach  provides  a  coherent  basis  for 
inference. 

What  Is  a  Confidence  Interval? 

The  stark  contrast  between  classical  and  Bayesian 
inference  is  also  particularly  evident  when  we  con¬ 
sider  a  staple  of  statistical  practice,  the  confidence 
interval.  To  illustrate  this,  consider  the  abortion 
attitudes  example  again.  Using  a  large-sample, 
normal  approximation,  a  95%  confidence  interval 
for  0  is  0  ±  2  X  se(0)  =  .46  ±  2  X  .011  = 
[.438,  .482],  But  what  exactly  is  the  interpretation 
of  this  confidence  interval?  Many  social  scientists 
in  this  situation  would  say,  “There  is  a  .95  prob¬ 
ability  that  0  lies  between  .438  and  .482.” 

But  this  is  not  the  correct  classical  interpretation. 
In  the  classical  approach,  parameters  are  fixed  char¬ 
acteristics  of  populations,  so  either  0  lies  in  the 
interval,  or  it  doesn’t.  The  correct  interpretation  of 
a  classical  or  frequentist  confidence  interval  depends 
on  the  repeated  sampling  characteristics  of  a  sample 
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statistic.  In  the  case  of  a  95%  confidence  interval, 
the  correct  frequentist  interpretation  is  that  95  %  of 
the  95%  confidence  intervals  one  would  draw  in 
repeated  samples  will  include  0.  This  leads  us  to 
wonder  if  the  95%  confidence  interval  that  one 
constructs  from  the  data  set  at  hand  is  one  of  the 
lucky  95%  that  actually  contain  0.  No  one  knows. 

This  example  highlights  how  the  repeated  sam¬ 
pling,  frequentist  notion  of  probability  is  simply  not 
about  how  many  researchers  use  and  understand  the 
term  probability.  From  the  frequentist  perspective, 
the  statement  “There  is  a  .95  probability  that  0  lies 
between  .438  and  .482”  is  valid,  since  for  frequen- 
tists  “probability”  is,  at  least  tacitly,  understood  to 
mean  “relative  frequency  in  repeated  sampling.”  But 
this  is  not  how  most  practioners  understand  confi¬ 
dence  intervals.  Rather,  subjective  statements  of  the 
sort  “I  am  95%  sure  that  0  E  [.438,  .482J”  are  quite 
typical.  The  correct  frequentist  interpretation  is  the 
less  helpful  statement  about  the  performance  of 
the  95%  confidence  interval  in  repeated  sampling. 
On  the  other  hand,  the  statement  “Having  looked 
at  the  data,  I  am  95%  sure  that  0  E  [.438,  .482J” 
is  produced  by  a  Bayesian  analysis,  a  characteriza¬ 
tion  of  the  researcher’s  beliefs  about  a  parameter 
in  formal,  probabilistic  terms  rather  than  a  state¬ 
ment  about  the  repeated  sampling  properties  of  a 
statistical  procedure. 

Hierarchical  Modeling 

The  prior  density  p(Q)  plays  a  key  role  in  Bayesian 
inference.  Some  critics  of  Bayesian  inference  point 
to  the  arbitrary,  subjective  nature  of  the  prior  den¬ 
sity  as  a  weakness  of  the  Bayesian  approach.  But 
the  prior  density  also  provides  a  way  for  model 
expansion  when  working  with  data  pooled  over 
multiple  units  and/or  time  periods.  Data  of  this  sort 
abound  in  the  social  sciences.  Individuals  live  in 
different  locations,  with  factors  that  are  constant 
for  anyone  within  that  location  but  that  vary  across 
locations;  students  attend  different  schools;  voters 
live  in  different  constituencies;  firms  operate  in  dif¬ 
ferent  types  of  markets;  politicians  compete  under 
different  sets  of  electoral  rules,  and  so  on.  Under¬ 
standing  the  relative  weight  of  microlevel  variables 
and  macrolevel  or  “contextual”  variables — and 
critically,  the  interaction  between  them — is  central 
to  many  research  programs  in  the  social  sciences.  A 
key  question  in  research  of  this  type  is  how  the 


causal  structure  that  operates  at  one  level  of  analy¬ 
sis  (e.g.,  individuals)  varies  across  a  “higher”  level 
of  analysis  (e.g.,  localities  or  time  periods). 

The  Bayesian  approach  to  statistical  inference  is 
extremely  well  suited  to  answering  this  question. 
Recall  that  in  the  Bayesian  approach  parameters 
are  random  variables.  Thus,  it  is  not  difficult  to 
replace  the  prior  p(Q)  with  a  richer  model  to  allow 
for  the  way  the  parameters  0  might  vary  across 
groups  j  =  1, ...  J,  perhaps  as  a  function  of  the 
characteristics  of  the  groups:  For  example,  0;  ~ 
f(zp  y),  where  y  is  a  set  of  unknown  hyperparam¬ 
eters.  That  is,  the  model  is  now  composed  of  a 
nested  hierarchy  of  stochastic  relations:  The  data 
from  unit  /,  y,  are  modeled  as  a  function  of  covari¬ 
ates  and  parameters  0;,  while  cross-unit  heteroge¬ 
neity  in  the  0-  is  modeled  as  a  function  of 
unit-specific  covariates  z;  and  hyperparameters  y. 

Models  of  this  sort  are  known  to  Bayesians  as 
hierarchical  models  but  go  by  many  different 
names  in  different  areas  of  the  social  sciences 
depending  on  the  specific  form  of  the  model  and 
the  estimation  strategy  being  used  (e.g.,  models 
with  “random”  or  “varying”  coefficients  models, 
“multilevel”  or  “mixed”  models).  It  is  no  exaggera¬ 
tion  to  say  that  hierarchical  modeling  is  one  of  the 
great  successes  of  the  Bayesian  revival  of  the  past 
20  years.  Simple  models  of  causal  heterogeneity — 
in  which  parameters  are  assumed  to  vary  in  a  non- 
deterministic  way  across  units — are  substantively 
attractive  and  yet  nontrivial  enough  to  estimate  in 
a  classical  setting.  A  suite  of  models  known  as 
Markov  Chain  Monte  Carlo  (MCMC)  made  Bayes¬ 
ian  computation  for  these  models  rather  simple. 

Feasibility  of  Bayesian  Inference  via  MCMC 

As  recently  as  the  1980s,  applications  of  Bayesian 
inference  were  limited  to  a  relatively  small  set  of 
problems  in  which  the  posterior  density  for  0  can  be 
expressed  in  closed  form;  even  a  problem  as  simple 
as  logistic  or  probit  regression  models  for  binary 
data  was  mathematically  intractable,  and  so  Bayes¬ 
ian  inference  was  more  a  philosophical  position  than 
a  practical  strategy  for  data  analysis.  But  throughout 
the  1990s,  there  was  something  of  an  explosion  of 
interest  in  Bayesian  approaches,  almost  entirely 
driven  by  the  fact  that  cheap  computing  power  made 
it  feasible  to  do  simulation-based,  Bayesian  statisti¬ 
cal  data  analysis.  The  MCMC  made  the  Bayesian 
approach  not  just  a  theoretical  curiosity  but  a 
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practical  reality  for  applied  researchers.  Briefly, 
MCMC  works  by  using  Monte  Carlo  methods  to 
generate  simulation-based  characterizations  of  pos¬ 
terior  densities;  the  results  from  Markov  Chain  the¬ 
ory  show  that  the  stochastic  explorations  of  posterior 
densities  typically  generated  by  MCMC  algorithms 
nonetheless  yield  “simulation-consistent”  summa¬ 
ries  of  the  posterior  density. 

MCMC  algorithms  have  proven  themselves 
amazingly  powerful  and  flexible  and  have  brought 
wide  classes  of  models  and  data  sets  out  of  the 
“too  hard”  basket.  We  have  already  mentioned 
hierarchical  models.  But  other  examples  include 
data  sets  with  a  lot  of  missing  data,  models  with 
many  parameters,  models  with  many  latent  vari¬ 
ables,  mixture  models,  and  flexible  semi-  and  non- 
parametric  models.  Thus,  whatever  one’s  view  of 
the  intellectual  or  philosophical  virtues  of  Bayesian 
inference,  by  adopting  a  Bayesian  approach  to 
inference,  researchers  can  avail  themselves  of 
MCMC  algorithms,  an  extraordinarily  powerful 
set  of  techniques  for  estimation  and  inference. 

A  Historical  Note 

The  result  that  we  now  refer  to  as  Bayes  theorem 
appeared  in  an  essay  attributed  to  the  Reverend 
Thomas  Bayes  and  communicated  to  the  Royal 
Society  after  Bayes’s  death  in  1761.  This  famous 
essay  has  been  republished  many  times.  Several 
authors  have  noted  that  there  is  some  doubt 
whether  Bayes  actually  discovered  the  theorem 
that  was  named  after  him.  Nor  is  it  clear  that 
Bayes  himself  was  a  “Bayesian”  in  the  sense  that 
we  use  the  term  today. 

Apparently  unaware  of  Bayes’s  work,  Pierre- 
Simon  Laplace  stated  Bayes  theorem  in  a  more 
general  form.  Perhaps  because  of  Laplace’s  work 
on  the  subject,  Bayes’s  essay  went  unnoticed  until 
after  1780  and  played  no  important  role  in  scien¬ 
tific  debate  until  the  20th  century.  This  said, 
Bayesian  inference  was  the  dominant  mode  of  sta¬ 
tistical  inference  until  the  early  20th  century, 
although  it  was  known  at  that  time  as  the  “method 
of  inverse  probability.” 

The  adjective  Bayesian  did  not  enter  the  statisti¬ 
cal  vernacular  until  the  20th  century.  The  first  use 
of  Bayesian  as  an  adjective  was  by  Ronald  Fisher, 
in  his  introduction  to  a  paper  originally  written  in 
1921.  Not  surprisingly,  Fisher’s  use  of  the  term 
was  not  flattering,  since  he  was  at  pains  to  contrast 


his  approach  to  statistical  inference  from  the  sub¬ 
jectivism  he  disliked  in  the  Bayesian  approach. 

Simon  Jackman 
Stanford  University 
Stanford,  California,  United  States 
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Statistical  Significance 


Political  scientists  use  the  concept  of  statistical  sig¬ 
nificance  to  validate  theoretical  assertions  about 
statistical  relationships.  In  the  most  basic  sense,  sta¬ 
tistical  significance  means  that  the  hypothesized 
statistical  relationship  is  probably  true.  That  is,  the 
hypothesized  relationship  is  not  due  to  random 
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chance.  When  a  result  is  highly  significant,  it  is  very 
unlikely  to  be  due  to  chance.  But  this  does  not  mean 
that  the  hypothesized  relationship  is  strong.  Statisti¬ 
cal  significance  is  not  to  be  confused  with  substan¬ 
tive  importance.  A  research  finding  may  be  strongly 
validated  but  not  very  important.  Moreover,  sig¬ 
nificance  tests  in  political  science,  and  social  science 
in  general,  do  not  directly  test  the  significance  of  the 
researcher’s  belief  about  the  statistical  relationship 
between  the  variables  of  interest.  Instead,  statistical 
significance  relates  to  how  likely  the  findings  are, 
given  the  assumption  that  there  are  no  differences  in 
the  subgroupings  of  the  population.  This  entry  is 
presented  in  four  parts:  (1)  it  discusses  the  relation¬ 
ship  between  the  significance  test  and  the  research 
hypothesis,  (2)  it  presents  the  elements  of  the  sig¬ 
nificance  test,  (3)  it  explains  the  relationship  between 
statistical  significance  and  the  power  of  the  signifi¬ 
cance  test,  and  (4)  it  briefly  discusses  the  relation¬ 
ship  between  statistical  significance  and  sample  size. 

The  Significance  Test  and  Hypotheses 

A  hypothesis  is  a  statement  about  an  underlying 
population  that  may  be  true  or  false.  Relationships 
in  political  science  are  generally  probabilistic  rather 
than  deterministic,  and  this  fact  has  major  implica¬ 
tions  in  the  design  of  attempts  to  prove  the  exis¬ 
tence  of  hypothesized  relationships  between 
variables.  In  political  analysis,  researchers  are  often 
interested  in  establishing  a  causal  relationship  be¬ 
tween  variables.  Establishing  this  causal  relationship 
involves  meeting  three  conditions:  (1)  establishing 
association  between  variables,  (2)  establishing  an 
appropriate  time  order  between  variables,  and 
(3)  eliminating  alternative  explanations.  Statistical 
significance  is  most  related  to  establishing  associa¬ 
tion  between  variables  and  is  only  tangentially 
related  to  the  other  two  conditions. 

Establishing  statistical  significance  involves  test¬ 
ing  a  hypothesized  statistical  relationship.  Ronald 
A.  Fisher  is  credited  most  with  the  development  of 
the  statistical  significance  concept.  Fisher  proposed 
a  procedure  that  produces  significance  levels  from 
the  data  about  a  single  hypothesis  with  a  known 
distribution  and  a  specified  test  statistic  (such  as  z, 
t,  F,  and  x2)-  Meanwhile,  Jerzy  Neyman  and  Egon 
Pearson  proposed  a  procedure  that  tests  the  prob¬ 
ability  of  a  null  or  restricted  hypothesis  against  a 
research  (alternative)  hypothesis.  As  political  sci¬ 
ence  has  matured  as  a  discipline,  political  scientists 


rely  more  and  more  on  quantitative  data  and  sta¬ 
tistical  methods  to  test  hypothesized  relationships. 
Over  time,  the  fusion  of  Fisher’s  significance  test 
procedure  and  Neyman-Pearson’s  hypothesis  test 
procedure  produced  the  null  hypothesis  statistical 
test,  which  is  now  the  dominant  procedure  for 
determining  statistical  significance  in  political  sci¬ 
ence  and  in  social  science  in  general. 

An  important  aspect  of  significance  tests  is  deter¬ 
mining  whether  to  use  a  one-tailed  or  two-tailed  test 
of  significance,  and  this  decision  is  related  directly 
to  the  form  of  the  paired  hypotheses.  The  null 
hypothesis  (H0)  is  the  hypothesis  to  be  tested  and  is 
usually  a  statement  of  a  parameter  value,  say  0,  that 
corresponds  to  a  parameter  value  of  no  effect,  say 
0*.  In  turn,  the  null  hypothesis  takes  the  form  H0:  0 
=  0*.  In  many  political  and  social  research  applica¬ 
tions  0 is  zero.  Meanwhile,  the  form  of  the  alterna¬ 
tive  hypothesis  determines  whether  the  significance 
test  is  one-tailed  or  two-tailed.  The  two-tailed  test 
takes  the  form  Ha:  0  T  0  *.  This  means  the  alterna¬ 
tive  hypothesis  includes  values  falling  below  and 
above  the  value  of  0 listed  in  H0.  Conversely,  one- 
tailed  alternative  hypotheses  have  the  directional 
forms  of  Ha:  0  <  0"  or  Ha:  0  >  0  *.  The  alternative 
hypothesis  Ha:  0  <  0  *  refers  to  detecting  whether  0 
is  smaller  than  the  particular  number  0  *.  The  alter¬ 
native  hypothesis  Ha:  0  >  0 *  refers  to  detecting 
whether  0  is  larger  than  0*.  In  short,  the  researcher 
predicts  deviation  in  H0  in  a  particular  direction.  It 
is  generally  safest  to  use  a  two-tailed  test,  but  there 
are  some  situations  in  which  the  one-tailed  test  is 
more  appropriate. 

Elements  of  Significance  Tests 

All  significance  tests  have  five  elements:  assump¬ 
tions,  hypotheses,  test  statistic,  p  value,  and  conclu¬ 
sion.  The  concept  of  significance  relates  most  directly 
to  the  p  value,  which  contains  information  concern¬ 
ing  the  probability  of  observing  a  result  like  the 
researcher  observes  if  there  is  no  difference  in  the 
corresponding  groups  in  the  population  from  which 
the  sample  is  drawn.  A  significance  test  involves 
comparing  a  test  value  that  the  researcher  has  calcu¬ 
lated  with  some  critical  value  for  the  statistic. 

Assumptions 

Significance  tests  require  certain  assumptions  for 
test  validity.  These  include  the  type  and  scale  of  the 
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data,  the  form  of  the  population  distribution,  the 
method  of  sampling,  and  the  sample  size.  Data  may 
take  quantitative  or  qualitative  forms.  Additionally, 
data  may  be  collected  from  different  research 
sources:  attitude  surveys,  governmental  documents, 
historical  records,  newspaper  files,  voting  statistics, 
or  other  stored  data.  These  types  of  data  collected 
from  various  sources  suggest  different  distribu¬ 
tional  forms.  Typical  parametric  distributional 
forms  include  the  normal,  Student’s  t,  gamma, 
binomial,  and  chi-square.  Additionally,  some  data 
may  preclude  the  use  of  parametric  testing  proce¬ 
dures  (e.g.,  count  or  rank  data).  Assumptions  must 
also  be  made  concerning  the  type  of  sampling  that 
is  required  for  the  conclusions  drawn  from  the 
sample  to  be  applicable  to  the  specified  population. 
Many  significance  tests  require  a  minimum  sample 
size,  and  the  validity  of  most  tests  improves  as  the 
sample  size  increases.  Sample  size  is  an  important 
factor  in  statistical  significance,  which  is  discussed 
in  greater  detail  as  follows. 

Hypotheses 

Significance  tests  generally  examine  two  hypoth¬ 
eses  about  the  value  of  a  parameter:  the  null 
hypothesis  and  the  alternative  or  research  hypoth¬ 
esis.  The  investigator  designs  the  research  to  deter¬ 
mine  the  amount  of  evidence  that  exists  to  support 
the  research  hypothesis.  That  is,  evidence  against 
the  null  hypothesis  is  evidence  in  support  of  the 
research  hypothesis. 

Test  Statistic 

Test  statistics  generally  follow  the  normal, 
Student’s  t,  F,  or  chi-square  distributions.  The  for¬ 
mation  of  the  test  statistic  usually  involves  a  point 
estimate  of  the  parameter  to  which  the  hypotheses 
refer.  Commonly  used  test  statistics  include  the  z 
score  for  large  sample  means  (y)  and  proportions 
(tt),  the  t  statistic  for  small-sample  means  and  pro¬ 
portions  and  individual  coefficients  in  linear  regres¬ 
sion  models,  the  F  statistic  to  test  the  significance 
of  multiple  linear  regression  models,  the  chi-square 
statistic  (x2)  to  test  independence  of  variables 
and  goodness  of  fit  for  generalized  linear 
models  (GLMs),  the  likelihood  ratio  test  statistic 
(—2  LOG  L)  to  test  nested  GLM  models,  and  the 
Wald  statistic  ( z 2)  to  test  the  significance  of  param¬ 
eters  in  GLM  models. 


p  Value 

The  p  value  provides  a  means  to  interpret  the 
evidence  provided  by  the  test  statistic  on  a  probabil¬ 
ity  scale.  The  p  value  is  the  probability  of  obtaining 
a  test  statistic  value  as  large  as  the  one  obtained, 
conditioned  on  the  null  hypothesis  being  true.  The 
smaller  the  p  value,  the  more  strongly  the  data  con¬ 
tradict  the  null  hypothesis.  Conversely,  a  moderate 
to  large  p  value  indicates  that  the  data  are  consistent 
with  the  null  hypothesis.  The  p  value  is  the  summary 
of  the  evidence  in  the  data  about  the  null  hypothesis. 

Conclusion 

Significance  tests  are  designed  to  minimize  Type  I 
error,  which  is  the  error  of  rejecting  the  null  hypoth¬ 
esis  when  the  null  hypothesis  is  true.  To  establish  the 
minimization  of  Type  I  error,  the  investigator  sets 
the  predetermined  and  small  a  level  at  which  H0  is 
rejected.  The  a  level  of  .05  has  become  the  conven¬ 
tional  level  of  statistical  significance  in  social 
research.  The  .05  level  provides  that  only  5  times  out 
of  100  the  null  hypothesis  will  be  rejected  when  the 
null  is  true.  The  researcher  is  interested  in  the  range 
of  values  of  the  test  statistic  for  which  the  null 
hypothesis  will  be  rejected.  If  the  statistic  produces  a 
p  value  that  is  smaller  than  the  critical  alpha  level, 
the  researcher  has  fairly  strong  evidence  against  the 
null  hypothesis  (H0).  In  the  case  of  a  two-tailed  test, 
a  is  divided  by  2  so  that  half  of  the  .05  probability 
is  in  the  left  tail  of  the  distribution  and  half  of  the 
probability  is  in  the  right  tail  of  the  distribution. 

Significance  and  Power  of  the  Test 

The  Neyman-Pearson  hypothesis  test  approach 
yields  the  idea  of  the  power  of  the  test.  Type  I  error 
can  only  be  made  when  the  null  hypothesis  is  true. 
Conversely,  Type  II  error  can  only  be  made  when 
the  null  hypothesis  is  false  and  is  defined  as  the 
probability  of  failing  to  reject  the  null  hypothesis 
when  the  null  hypothesis  is  false.  Type  II  error  is 
usually  referred  to  as  beta  ((3).  Power  (1  —  (3)  is 
related  to  statistical  significance  in  that  it  is  the 
probability  of  rejecting  the  null  hypothesis  condi¬ 
tioned  on  the  fact  that  the  null  hypothesis  is  false. 
In  other  words,  power  is  the  probability  of  detect¬ 
ing  an  association  (effect)  given  that  there  is 
indeed  an  association  (effect).  Under  the  power 
rubric,  the  researcher  seeks  to  develop  and  employ 
the  test  that  has  the  highest  power  for  a  given 
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predetermined  a  level  through  sample  size,  desired 
significance  level,  and  the  research  hypothesis. 

Statistical  Significance  and  Sample  Size 

There  are  two  misunderstandings  related  to  the 
relationship  between  statistical  significance  and 
sample  size:  (1)  statistical  significance  in  large 
samples  translates  into  substantive  importance  and 
(2)  p  values  associated  with  larger  samples  are 
more  reliable  than  identical  p  values  in  smaller 
samples.  In  the  first  misunderstanding,  as  sample 
size  increases,  the  researcher  is  increasingly  likely 
to  detect  a  significant  effect.  Additionally,  as  sam¬ 
ple  size  increases,  it  is  more  likely  that  the  investi¬ 
gator  will  detect  smaller  population  effect  sizes 
progressively.  This  means  that  large  sample  sizes 
are  more  likely  to  reveal  statistical  significant  rela¬ 
tionships,  but  these  statistically  significant  effects 
may  relate  to  small  substantive  effects.  In  the  sec¬ 
ond  misunderstanding,  a  p  value  derived  from  a 
larger  sample  is  not  more  reliable  than  an  identical 
p  value  produced  by  a  smaller  sample.  Two  studies 
that  reject  the  null  hypothesis  with  the  same  p 
value  are  equally  likely  to  make  Type  I  errors.  This 
statement  holds  true  even  if  the  sample  sizes  are 
dramatically  different. 
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Empirical  work  is  about  estimating  the  relationships 
among  variables  and  then,  based  on  the  values  of 
and  confidence  in  those  estimates,  discussing  their 
substantive  implications.  These  relationships  are 
expressed  as  a  model  relating  some  outcome  or  set 
of  outcomes,  the  endogenous  variable  or  variables, 
denoted  by  Y,  to  a  set  of  explanatory  variables,  con¬ 
ventionally  denoted  by  X  and  referred  to  as  the 
exogenous  or  causal  variables.  The  models  may  be 
explicitly  referred  to  as  causal  models,  but  the  term 
causal  is  frequently  treated  as  implicit. 

The  current  commonly  used  definition  of  cau¬ 
sation,  referred  to  as  the  Neyman-Holland-Rubin 
(NHR)  model,  is  based  on  how  one  might  expect 
Y  to  be  different  if  there  were  a  different  value  for 
X  in  an  otherwise  identical  world.  A  second  view 
is  that  causation  is  a  process  of  interactions  that 
connect  values  of  X  to  values  of  Y  and  that  this 
process  depends  on  the  context  being  studied.  For 
example,  being  female  might  be  the  “cause”  of 
women’s  wages  being  lower  than  wages  for  men 
with  the  same  education  and  experience  because 
of  the  chain  of  actions  that  occur  in  the  labor 
market,  not  because  an  additional  X  chromosome 
affects  wages.  An  important  research  objective  is 
elaborating  and  examining  empirically  the  com¬ 
ponents  of  this  chain  in  their  sequential  order.  All 
views  of  causation  imply  a  temporal  sequence, 
where  a  change  in  X  must  occur,  or  have  a  very 
strong  likelihood  of  occurring,  prior  to  the  change 
in  Y  even  if  the  time  lag  is  smaller  than  the  inter¬ 
vals  at  which  one  observes  X  and  Y.  An  important 
requirement  for  models  and  empirical  work  is 
that  they  improve  our  understanding  of  this 
causal  process. 

Causal  models  and  thinking  are  a  central  part  of 
our  personal  and  professional  lives.  If  you  hit  your 
thumb  with  a  hammer,  it  will  hurt  and  you  will  at 
a  minimum  get  a  bad  bruise.  More  abstractly, 
social  scientists  talk  regularly  in  causal  terms:  Will 
mailings  increase  voter  turnout?  Will  reducing 
class  size  improve  student  learning?  Will  more  on- 
the-job  training  raise  wages?  Will  economic  sanc¬ 
tions  bring  a  nation  to  the  bargaining  table?  These 
examples  involve  a  chain  of  reasoning  underlying 
the  causal  statement  connecting  the  change  in  one 
variable  to  the  change  in  the  other.  The  hammer 
hitting  a  thumb  does  not  directly  cause  the  bruise, 
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pain,  and  swelling.  Post  cards,  smaller  classes,  job 
training,  and  sanctions  per  se  do  not  cause  voters 
to  turn  out,  students  to  get  higher  test  scores, 
wages  to  increase,  or  countries  to  bargain,  but  we 
proceed  as  if  we  believe  they  do. 

Causal  statements  vary  in  the  confidence  one  has 
in  the  causal  logic  or  in  the  chain  of  events  linking  Y 
to  X  and  therefore  in  the  prediction  of  how  Y  is 
likely  to  be  different  if  X  is  different.  For  example, 
one  can  confidently  predict  that  your  thumb  will 
hurt,  swell,  and  turn  purple  if  you  hit  it  with  a  ham¬ 
mer.  There  is  less  confidence  in  the  prediction  that 
turnout  will  increase  if  all  potential  voters  receive  a 
mailing  or  a  phone  call  before  election  day.  And 
there  is  far,  far  less  confidence  in  the  prediction  that 
economic  sanctions  will  lead  a  rogue  state  to  negoti¬ 
ate.  This  entry  provides  a  brief  introduction  to  the 
types  of  data  and  statistical  methods  used  in  the 
social  sciences  to  try  to  reduce  this  uncertainty  about 
causal  arguments  and  to  estimate  the  magnitude  of 
the  relationship  between  Y  and  the  X. 

The  entry  begins  with  a  very  brief  discussion 
of  the  common  models  representing  causal  state¬ 
ments.  This  is  followed  by  a  discussion  of  the 
different  strategies  for  designing  empirical  stud¬ 
ies,  of  the  types  of  data  that  are  frequently  avail¬ 
able  in  these  studies,  and  of  some  of  the  common 
statistical  methods  used  to  evaluate  and  estimate 
the  causal  model.  The  entry  concludes  with 
some  cautions  about  the  types  of  conclusions 
that  can  be  reached  in  these  different  circum¬ 
stances  and  with  a  plea  for  modesty  in  any 
empirical  study. 

Models,  Empirical  Analysis, 
and  Causal  Inferences 

Models  facilitate  predictions  of  how  an  outcome  is 
likely  to  differ  if  some  other  variable  is  changed, 
represented  as  AY  =  F  (AX).  Since  the  social  sci¬ 
ences  discuss  outcomes  probabilistically,  the 
expression  should  refer  to  the  expected  change  in 
Y,  or  if  Y  is  a  discrete  variable,  the  change  in  the 
probability  that  Y  changes  its  state — that  is,  E  (AY) 
=  F  (AX  )  or  E  [Pr(AY  )J  =  F  (AX).  For  the  pur¬ 
poses  of  this  discussion,  the  origin  of  the  model,  be 
it  mathematical  deduction,  previous  empirical 
results,  or  conventional  wisdom,  is  not  important. 
What  is  important  is  that  there  be  a  set  of  logical 
and  consistent  statements  about  how  or  why 
changes  in  X  cause  changes  in  Y. 


In  some  instances  the  model  may  be  extremely 
simple.  Consider  Harold  Gosnell’s  pioneering  field 
study  of  whether  pre-election  mailings  increase 
voter  turnout.  Here  the  outcome  variable  Vindicates 
is  the  probability  that  a  subject  votes.  The  variable 
X  indicates  whether  the  person  received  the  mail¬ 
ing  and  is  a  binary  variable — the  person  did  or  did 
not  receive  the  mailing.  The  model  is 

Y  =  B0  +  B1X1,  (1) 

where  B0  is  the  probability  of  voting  among  those 
who  did  not  receive  the  mailing,  X4  =  0,  and  B ,  is 
the  increase  in  that  probability  among  those  who 
received  the  mailing,  X1  —  1.  In  the  NHR  model, 
B1  is  the  magnitude  of  the  counterfactual  of  what 
would  have  been  the  outcome  if  a  person  who  did 
not  receive  the  mailing  had  received  it. 

The  purpose  of  the  study  was  to  estimate  the 
magnitude  of  Br  It  was  not  just  enough  to  know 
that  the  mailing  increased  turnout,  it  was  also 
important  to  estimate  by  how  much,  as  that  would 
determine  whether  it  was  worth  the  costs  in  money 
and  time  to  use  mailings  in  future  elections.  Although 
Gosnell  does  not  refer  to  this  as  an  explicit  model, 
there  very  definitely  is  a  causal  model  in  his  study. 
Other  models  often  relate  continuous  variables  in 
the  same  manner.  Studies  of  educational  achieve¬ 
ment  use  Equation  1  to  relate  the  expected  perfor¬ 
mance  on  standardized  test  scores,  Y,  to  class  size, 
X15  both  of  which  are  continuous  variables.  Here,  B1 
indicates  how  test  scores  are  expected  to  change 
with  each  reduction  in  class  size,  with  the  expecta¬ 
tion  that  B  j  is  negative.  Again,  the  magnitude  of  Bl 
is  important  as  it  is  central  to  any  benefit-cost  analy¬ 
sis  for  hiring  more  teachers.  Though  models  rapidly 
became  more  complex,  this  simple  model  is  the  basis 
for  much  empirical  work. 

Models  often  include  multiple  causal  variables 
because  of  what  is  being  modeled  and  because  of 
the  model’s  purpose.  Most  outcomes  of  interest  to 
social  scientists  are  not  the  result  of  single  causal 
factors,  so  it  is  important  to  understand  and  to 
compare  the  influence  of  the  various  factors.  Voter 
turnout,  for  example,  can  be  stimulated  by  per¬ 
sonal  contacts,  X,;  phone  calls,  X3;  media  promo¬ 
tions,  X4;  and  same-day  registration,  X5,  as  well  as 
mailings,  Xr  Decisions  are  often  based  on  the  rela¬ 
tive  influence,  cost,  and  side  effects  of  these  fac¬ 
tors:  Are  personal  contacts  more  cost-effective  at 
stimulating  turnout  than  mailings,  than  phone 
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calls,  than  same-day  registration?  These  interests 
lead  to  multivariate  models — for  example, 

Y  —  B0  Y  BjXj  +  B2X2  +  B3X3 

+  B4X4  +  B5X5.  (2) 

One  may  conduct  an  empirical  study  where  only 
one  X  is  varied  to  isolate  the  effect  of  changes  in 
that  X  on  Y,  but  ultimately  the  analyst  will  want 
to  know  the  magnitude  of  all  the  coefficients. 

Increasingly,  models  are  featuring  nonlinear 
specifications,  which  add  to  their  richness  as  well  as 
to  their  complexity.  The  simplest  nonlinearities  are 
introduced  with  algebraic  transformations,  such  as 
logs,  powers,  roots,  and  trig  functions,  of  continu¬ 
ous  variables.  Such  transformations  make  the  rela¬ 
tionships  between  Y  and  the  Xs  nonlinear  and 
nonseparable.  Many  studies  of  educational  achieve¬ 
ment,  for  example,  model  the  log  of  test  scores  as  a 
function  of  the  logs  of  the  various  explanatory  vari¬ 
ables.  This  specification  allows  for  increasing  or 
decreasing  as  well  as  constant  returns  to  each  input 
and  indicates  that  the  marginal  effect  of  any  one 
input  depends  on  the  levels  of  the  other  inputs.  This 
property  contrasts  with  that  of  the  linear  and  sepa¬ 
rable  model,  where  the  marginal  effect  is  constant 
and  independent  of  the  values  of  the  other  variables. 

Models  may  also  be  nonlinear  in  the  values  of 
the  parameters,  which  may  vary  randomly  or  sys¬ 
tematically  with  time  or  as  functions  of  other 
explanatory  variables.  This  general  form  is 

Yj  =  F(Xj,B).  (3) 

Note  that  Equation  2,  which  is  linear  in  the 
coefficients,  is  a  special  case  of  Equation  3.  The 
nonlinearities  add  greatly  to  the  ability  of  the  mod¬ 
els  to  represent  complex  phenomena,  including  the 
possibility  that  some  coefficients  may  vary  system¬ 
atically  among  potential  observational  units.  In  the 
linear  model  in  Equation  2  this  could  mean  that 
the  B  coefficients  are  denoted  with  an  i  subscript, 
indicating  that  they  vary  across  cases.  While  being 
more  general  in  form,  the  nonlinear  models  pres¬ 
ent  estimation  challenges. 

Models  may  also  encompass  multiple  outcomes, 
which  are  both  the  consequence  and  the  cause  of 
other  variables.  An  early  form  of  such  a  model  is 
Peter  Blau  and  Otis  Dudley  Duncan’s  hierarchical 
model  of  U.S.  occupational  status  attainment.  An 
individual’s  occupation  at  time  t  is  modeled  as  a 


direct  function  of  his  or  her  father’s  occupation,  the 
person’s  education  level,  and  the  first  job;  while  the 
latter  two  variables  are  themselves  related,  both  are 
related  to  father’s  occupation.  In  this  model,  status 
attainment  entails  the  interrelationship  among  sev¬ 
eral  endogenous  variables.  A  more  complex  example 
is  Patrick  Brandt  and  John  Freeman’s  model  of  the 
U.S.  macropolitical  economy,  which  uses  nine  inter¬ 
acting  variables,  each  with  its  own  equation.  As 
discussed  in  the  section  on  multi-equation  models, 
models  with  multiple  endogenous,  or  outcome,  vari¬ 
ables  present  challenging  data  and  estimation  prob¬ 
lems.  These  are  a  consequence  of  the  model  and  the 
phenomena  being  represented,  and  a  single  endoge¬ 
nous  variable  and  equation  would  be  inadequate. 

The  discussion  of  models  and  their  uses  is  an 
important  introduction  to  empirical  estimation.  The 
empiricist’s  task  is  twofold.  One  objective  is  to  pro¬ 
vide  evidence  about  the  model’s  credibility — how 
consistent  the  model  is  with  empirical  evidence,  in 
how  broad  a  set  of  contexts,  and  when  compared 
with  other  possible  explanations.  Arthur  Stinch- 
combe,  in  1968,  made  the  still  relevant  point  that 
most  empirical  studies  do  not  refute  theoretical 
propositions  but  demonstrate  their  consistency  with 
observable  data.  The  more  frequent  and  broader  the 
contexts  in  which  this  consistency  is  observed,  the 
more  credible  the  model.  This  increased  credibility 
then  raises  one’s  confidence  in  the  model’s  predic¬ 
tions  about  what  change  in  Y  to  expect  from  a 
specified  change  in  X.  The  second  empirical  task  is 
to  estimate  the  magnitude  of  the  model’s  parame¬ 
ters.  This  enables  one  to  say  what  change  in  Y  is 
expected  from  a  change  in  X,  or  in  different  Xs,  and 
something  about  the  uncertainty  in  this  expectation. 

A  second  reason  for  this  brief  summary  of  mod¬ 
els  is  that  empirical  researchers  must  have  a  clear 
understanding  of  the  model  prior  to  any  study.  A 
model’s  content  and  structure  determine  the  most 
appropriate  empirical  strategy,  including  the  design 
of  the  study,  the  types  of  data,  and  the  statistical 
methodology.  The  linear  model  in  Equation  1  or  2 
can  be  estimated  with  a  range  of  methods  and  data 
that  would  be  inappropriate  or  unavailable  for 
models  with  nonlinear  coefficients  or  extensive 
interactions  among  the  explanatory  variables. 
Models  that  emphasize  dynamic  adjustments  may 
require  time-series  data  with  stringent  stability  and 
equilibrium  properties.  Many  important  phenom¬ 
ena  are  not  amenable  to  explicit  manipulation  as  is 
done  in  experimental  studies,  so  the  models  must  be 
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examined  with  observational  data.  And  the  list  goes 
on.  It  is  critical  for  a  credible  empirical  study  that 
the  design,  data,  and  methods  match  the  properties 
of  the  model,  a  requirement  that  must  be  met  before 
any  data  are  collected  and  any  numbers  crunched. 

The  Experimental  Method 

The  textbook  lab  or  field  experiment  is  often  cited 
as  the  gold  standard  for  empirical  work.  The  model 
in  this  case  is  Equation  1.  In  a  well-designed  and 
conducted  experiment,  the  values  of  X  are  deter¬ 
mined  by  the  researcher  and  randomly  assigned  to 
subjects,  and  neither  the  subject  nor  the  individuals 
measuring  the  outcomes  know  the  values  of  X  for 
each  subject.  We  refer  to  these  conditions  as  exoge¬ 
neity,  randomization,  and  double-blindedness, 
respectively.  If  these  conditions  are  met,  a  simple 
linear  regression  will  give  unbiased  estimates  for  B1 
with  both  binary  and  continuous  variables.  And  as 
the  number  of  subjects  becomes  larger,  the  balance 
between  the  control  and  treatment  groups  with 
respect  to  unobserved  variables  becomes  better, 
meaning  that  estimated  results  become  more  accu¬ 
rate  estimates  of  the  true  causal  relationship.  An 
experiment  may  also  be  conducted  under  very  con¬ 
trolled  conditions  so  that  other  factors  that  might 
influence  Y  are  constant.  It  is  worth  examining 
these  conditions  and  their  contribution  to  accurate 
estimates  as  these  conditions  become  important  in 
developing  other  methods. 

The  fact  that  the  researcher  determines  the  val¬ 
ues  for  X  precludes  the  possibility  that  these  varia¬ 
tions  are  the  result  of  changes  in  Y  or  of  some 
aspect  of  the  data  collection  process,  both  of 
which  will  prevent  accurate  estimates  of  Bv  This 
can  be  a  major  problem  with  observational  data. 
Consider  the  positive  correlation  between  media 
attention  and  turnout.  Does  hearing  more  political 
information  lead  one  to  vote,  or  does  an  intention 
to  vote  cause  one  to  pay  more  attention  to  media 
coverage  of  the  campaign?  If  the  researcher  deter¬ 
mines  each  person’s  media  exposure  within  the 
experiment,  it  rules  out  the  latter  explanation. 

Randomization  is  critical  in  any  experiment. 
Subjects  will  not  be  identical  on  all  factors  other 
than  the  variable  of  interest.  Gosnell’s  mailing 
would  reach  people  of  different  ages,  education 
levels,  genders,  political  interests,  and  so  on.  All 
these,  and  many  more,  factors  will  be  associated 
with  differences  in  turnout.  These  excluded  factors 


are  represented  in  the  model  by  adding  a  stochastic 
term  denoted  by  U  to  Equation  1: 

Y  -  B0  +  B1X1  +  U,  (4) 

where  U  represents  all  these  factors  and  how  they 
affect  the  observed  value  of  Y.  Now  suppose  that 
the  mailings  happened  to  be  sent  to,  or  read  by, 
those  more  disposed  to  vote.  Then  comparisons  of 
turnout  between  those  receiving  and  those  not 
receiving  the  mailing  would  badly  bias  the  estimate 
of  the  effect  of  the  mailing.  Formally,  the  values  of 
17,  which  include  the  effects  of  predispositions, 
would  be  correlated  with  X1 

Randomization,  if  done  properly  and  if  the 
sample  is  large  enough,  ensures  that  the  composi¬ 
tion  of  the  group  receiving  the  mailing  is  identical 
to,  or  balanced  with,  the  composition  of  the  group 
that  did  not  receive  the  mailing.  Turnout  will  be 
higher  among  those  predisposed  to  vote,  but 
because  each  group  has  an  equal  proportion  of 
such  people,  differences  in  turnout  between  the  two 
groups  would  not  be  the  result  of  one  group  having 
more  committed  voters  than  the  other.  More  for¬ 
mally,  X  and  U  are  uncorrelated.  Randomization, 
when  done  properly,  makes  the  reliability  of  exper¬ 
iments  far  greater  than  that  of  observational  studies 
and  greatly  simplifies  the  statistical  analysis,  as  one 
only  needs  to  estimate  the  coefficients  in  Equation 
4,  which  can  be  done  with  ordinary  regression  or 
comparisons  of  means  if  X  is  binary. 

Any  factor  that  does  not  vary  in  the  sample  can 
be  ruled  out  as  an  alternative  explanation  for  the 
observed  variations  in  Y,  lending  greater  credibility 
to  the  variations  in  X  as  the  cause.  If  Gosnell  had 
sent  mailings  only  to  people  under  25  who  had  not 
voted  before  and  compared  their  turnout  with  that 
of  25-year-old  nonvoters  who  did  not  get  the  mail¬ 
ing,  then  age  and  past  voting  behavior  would  not 
explain  any  differences  in  turnout  between  the  two 
groups,  increasing  one’s  confidence  in  the  conclu¬ 
sion  that  turnout  differences  are  related  to  receiving 
the  mailing.  In  some  research,  it  is  possible  to  design 
or  select  observations  such  that  possible  confound¬ 
ing  factors  are  constant  within  the  sample,  greatly 
reducing  the  possibility  of  bias. 

If  Equation  1  accurately  models  the  substantive 
question  and  an  experiment  is  feasible,  then  a  well- 
designed  and  executed  experiment  will  provide 
very  credible  estimates  of  the  effects  of  X  on  Y. 
Much  of  political  methodology,  however,  addresses 
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situations  where  the  model  of  interest  does  not 
conform  to  Equation  1  or  where  an  experiment 
was  not  or  could  not  be  conducted  according  to 
the  requisite  standards,  though  there  has  been  a 
dramatic  increase  in  the  development  of  creative 
experimental  designs  in  the  social  sciences.  The 
remainder  of  the  entry  discusses  the  procedures 
analysts  use  to  deal  with  these  shortcomings.  The 
detailed  discussion  of  the  experimental  method  is 
important  in  understanding  these  procedures  as 
many  of  them  are  designed  to  reproduce  what  can 
be  accomplished  with  a  proper  experiment.  It  is 
also  important  to  see  the  experimental  approach  as 
being  consistent  with  one,  but  not  all,  models. 

Observational  Studies 

Many  substantive  areas  of  interest  in  political  sci¬ 
ence  are  not  amenable  to  experiments,  despite  recent 
expansions  in  the  use  of  experiments.  In  some 
instances,  the  manipulations  required  by  the  model 
are  technically  possible  but  are  prohibited  on  ethical 
grounds.  More  often,  X  cannot  be  controlled  and 
varied  by  the  researcher.  Astronomers  cannot  create 
black  holes  or  alter  galaxies,  medical  researchers 
cannot  force  people  to  engage  in  risky  behaviors, 
and  political  scientists  cannot  create  economic 
depressions,  randomly  assign  representatives  to  dis¬ 
tricts,  start  or  stop  wars,  and  so  on.  All  rely  on 
observational  studies,  meaning  data  that  are  gener¬ 
ated  by  the  world  outside  researchers’  control. 
Observational  studies  question  the  randomization 
and  exogeneity  conditions.  Considerable  attention  is 
devoted  to  addressing  these  questions.  The  types  of 
data  generated  by  observational  studies  also  increase 
the  uncertainty  of  the  causal  interpretations  one  can 
draw  from  the  analysis  of  these  data. 

Nonrandomization 

The  classical  response  to  the  lack  of  random¬ 
ization  in  observational  studies  is  the  multivariate 
regression  shown  in  Equation  2.  The  problem 
with  estimating  the  model  in  Equation  4  is  that 
the  stochastic  term,  17,  contains  factors  other  than 
Xj  that  are  related  to  Y  and  that  are  correlated 
with  X,.  (Random  assignment  in  an  experiment, 
when  achieved,  means  that  they  should  be  uncor¬ 
related.)  If  one  can  specify  and  construct  vari¬ 
ables  for  these  factors,  they  can  be  included  in  the 


multivariate  regression  shown  in  Equation  2, 
expanded  to  include  the  stochastic  term  and  as 
many  explanatory  variables  as  necessary: 

Y  —  Bq  +  f>iXi  +  BiX  2  +  -B3X3 

+  •  •  •  +  BkXk  +  U.  (5) 

The  additional  variables  are  included  not  to  assess 
their  relative  influence  on  Y,  though  this  may  be  an 
important  result,  but  to  include  potentially  con¬ 
founding  factors  in  the  analysis.  The  interpretation 
of  Equation  5  is  that  B1  indicates  the  change  in  Y 
expected  for  a  unit  change  in  X,;  holding  X2 
through  XK  constant.  If  one  has  been  successful  in 
including  all  those  factors  that  are  causally  related 
to  Y,  meaning  their  true  coefficient  is  different 
from  zero,  and  that  are  correlated  with  Xj  in  the 
sample,  then  U  is  uncorrelated  with  X,  and  the 
estimate  for  B1  obtained  with  an  ordinary  least 
squares  (OLS)  regression  is  unbiased,  as  it  was  in 
the  experimental  setting. 

In  some  applications,  OLS  is  not  the  most  appro¬ 
priate  estimator  because  of  the  limited  nature  of  the 
outcome  variable.  Examples  are  when  outcomes  are 
observed  as  dichotomies,  such  as  turnout;  discrete 
categories,  such  as  votes  in  a  multiparty  election; 
event  counts,  such  as  the  numbers  of  conflicts  in 
given  time  periods;  or  durations,  such  as  the  likely 
length  of  governments  formed  under  different  condi¬ 
tions.  The  latter  example  is  also  directly  related  to 
hazard  models,  which  estimate  the  probability  of 
failure  at  some  time  t,  given  survival  up  to  that  time, 
and  event  history  models,  which  examine  the  length 
of  time  between  events.  These  outcome  measures, 
which  are  not  continuous,  unbounded,  interval  vari¬ 
ables,  violate  the  conditions  assumed  in  OLS  regres¬ 
sions.  The  form  of  the  relationship  between  the 
observed  variables  is  unlikely  to  be  linear,  and  it  is 
even  unlikely  to  be  linear  in  the  coefficients,  preclud¬ 
ing  the  conventional  transformations  that  might 
create  such  a  linear  model.  Furthermore,  the  form 
and  distribution  of  the  stochastic  term,  U,  in 
Equaiton  5  will  not  meet  the  requirements  of  the 
OLS  estimator.  Taken  together,  these  difficulties  are 
likely  to  lead  to  biased  estimates  of  the  coefficients  if 
OLS  is  used  with  the  observed  data. 

These  problems  have  stimulated  the  develop¬ 
ment  of  a  vast  array  of  estimation  methods  tailored 
to  the  specific  distributional  properties  of  the 
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resulting  stochastic  term.  These  have  the  popular 
names  of  probit  and  logit,  with  their  various  modi¬ 
fiers;  Poisson  and  negative  binomial;  Weibull  and 
Cox  hazard  models,  and  so  on.  They  have  a  com¬ 
mon  approach,  which  is  to  specify  a  distribution 
for  the  stochastic  term  and  to  use  this  distribution 
to  derive  an  expression  for  the  likelihood  of  observ¬ 
ing  the  data.  This  description  of  the  data  is  referred 
to  as  the  data-generating  process,  and  the  expres¬ 
sion  is  referred  to  as  the  likelihood  function,  which 
measures  the  likelihood  of  the  data  for  specific 
values  of  the  parameters.  (The  value  of  the  likeli¬ 
hood  function  is  directly  proportional  to  the  prob¬ 
ability  of  observing  the  data  given  the  model  and 
parameters.)  Estimation  proceeds  by  finding  the 
values  for  these  parameters  that  maximize  this  like¬ 
lihood  function — or,  in  most  instances,  the  log  of 
the  likelihood  function — hence  the  name  maximum 
likelihood  estimation  (MLE).  If,  for  example,  the 
stochastic  term  in  Equation  5  is  normally  distrib¬ 
uted,  the  maximum  likelihood  estimator  is  OLS.  If 
the  data-generating  process  is  described  properly, 
the  MLE-derived  estimates  will  have  desirable  sta¬ 
tistical  properties  if  the  sample  size  is  large,  and  in 
most  applications,  they  are  easily  computed.  These 
methods’  statistical  properties  only  hold  in  large 
samples  and  usually  require  stricter  distributional 
assumptions  about  the  Us  than  the  more  straight¬ 
forward  unncorrelatedness  of  X  and  U  required  in 
OLS  estimation. 

Unfortunately,  what  is  straightforward  in  theory 
is  quite  difficult  in  practice.  There  are  no  statistical 
tests  to  determine  if  U  is  uncorrelated  with  each  X 
or  if  one  has  the  correct  data-generating  process. 
The  Us  are  not  observed  and  for  individual  observa¬ 
tions  are  not  well  approximated  by  the  residuals  in 
the  regression.  Furthermore,  by  construction  the 
correlation  between  these  residuals  and  each  X  is 
zero,  so  that  result  cannot  be  used  to  argue  that  U 
and  the  Xs  are  uncorrelated.  One  must  use  the 
available  theories  and  detailed  knowledge  of  the 
context  in  which  the  data  were  collected  to  argue 
that  the  uncorrelatedness  condition  is  met.  Simply 
adding  more  variables  in  the  hope  that  U  will  even¬ 
tually  become  uncorrelated  with  the  Xs  is  not  a 
remedy  as  this  process  introduces  its  own  problems 
and  errors.  Such  “kitchen  sink”  regressions  may  not 
reduce  bias  and  certainly  run  the  risk  of  increasing 
the  coefficients’  standard  errors  and  thereby  increas¬ 
ing  the  coefficients’  mean  squared  errors. 


Mimicking  Experimental  Methods 
With  Observational  Data 

Deep  skepticism  about  the  ability  to  meet  the 
conditions  for  unbiasedness  using  linear  regression 
models  has  led  to  an  extensive,  growing,  and 
sophisticated  literature  on  alternatives  for  situa¬ 
tions  where  only  observational  data  are  available. 
This  section  briefly  reviews  three  of  these  alterna¬ 
tives.  Readers  wanting  more  detail  are  referred  to 
other  survey  articles. 

Natural  Experiments.  Though  this  would  seem  to 
be  an  oxymoron,  there  are  instances  where 
“nature”  randomly  assigns  treatments  to  one  set  of 
observations  and  not  others,  creating  balanced 
treatment  and  control  groups.  If  these  assignments 
are  truly  random,  then  analysis  of  the  resulting 
data  using  comparisons  of  means  and  simple  linear 
models  can  proceed  as  if  the  data  were  generated 
by  an  experiment.  An  example  of  research  based 
on  a  natural  experiment  are  studies  assessing  the 
effect  of  additional  income  on  a  range  of  behav¬ 
iors,  such  as  political  attitudes  or  consumption,  by 
studying  lottery  winners.  The  assumption  is  that 
lotteries  are  random  events,  so  that  the  amount  of 
money  won  is  equivalent  to  random  assignment 
and  thus  is  uncorrelated  with  other  factors  that  are 
related  to  the  behavior  being  studied. 

Natural  experiments  are  also  useful  in  creating 
continuous  variables  that  are  correlated  with  out¬ 
comes  of  interest  but  uncorrelated  with  other  mea¬ 
sures  of  behavior  because  of  nature’s  randomness. 
The  value  of  such  variables  is  discussed  in  a  subse¬ 
quent  section  on  endogeneity  and  instrumental 
variables  (IVs).  Studies  of  the  ability  of  U.S.  federal 
disaster  relief  aid  to  buy  support  for  incumbent 
administrations  and  of  the  effects  of  past  vote  pat¬ 
terns  on  its  distribution  rely  on  the  randomness  of 
hurricane  paths  in  Florida.  The  destruction  caused 
by  a  hurricane  can  eventually  cover  a  wide  area, 
but  its  exact  path  over  local  areas  is  random.  This 
randomness  means  that  the  local  variations  in  a 
hurricane’s  severity  are  unlikely  to  be  correlated 
with  the  community’s  characteristics,  such  as 
income  or  support  for  the  current  government  in 
previous  elections.  A  variable  measuring  this  sever¬ 
ity  can  then  be  used  to  predict  what  aid  levels 
would  be  independent  of  economic  and  political 
considerations. 
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Regression  Discontinuity.  This  strategy,  which  is  a 
more  elaborate  version  of  the  natural  experiment, 
takes  advantage  of  the  natural  random  assignment 
of  observations  to  treatment  and  control  groups. 
The  term  discontinuity  arises  because  these  natural 
experiments  occur  around  some  exogenously  given 
threshold.  The  outcomes  for  the  two  groups  can 
then  be  compared  to  assess  the  effects  of  the  natural 
treatment.  For  example,  a  study  of  the  electoral 
advantage  of  incumbency  uses  this  method  to  com¬ 
pare  the  vote  share  of  incumbents  who  won  their 
previous  election  with  less  than  52.5%  with  that  of 
the  freshman  representatives  who  replaced  incum¬ 
bents  who  lost  with  more  than  47.5%  of  the  two- 
party  vote.  The  proposition  is  that  election  outcomes 
are  sufficiently  random  within  this  band  (around 
50%)  so  that  incumbency  is  effectively  a  random 
treatment.  This  randomization  is  used  to  justify  the 
comparison  of  the  vote  shares  of  these  two  groups  as 
a  measure  of  the  incumbency  advantage.  Regression 
discontinuity  designs  have  a  relatively  long  history, 
coming  into  and  out  of  favor  in  different  disciplines. 
They  are  currently  generating  considerable  interest 
in  economics  and,  more  recently,  in  political  science. 

Matching.  This  method  stems  from  the  work  of 
Donald  Rubin  and  Paul  Rosenbaum,  separately  and 
jointly  with  others.  It  does  not  try  to  create  random 
assignment  but  compares  observations  in  the  treat¬ 
ment  group  with  observations  selected  from  the 
control  group  that  have  similar  characteristics.  A 
logical  difficulty  with  the  NHR  causal  model  is  that 
it  is  impossible  to  answer  the  counterfactual  of  what 
an  untreated  subject  would  have  done  if  she  or  he 
had  been  treated  because  both  states  cannot  be 
observed  for  the  same  individual.  The  ideal  next  step 
is  to  have  identical  cases  of  those  receiving  treatment 
and  those  who  do  not.  The  statistical  exercise  called 
matching  finds  those  appropriate  matches,  meaning 
the  most  similar  untreated  and  treated  observations, 
among  the  observed  data.  This  process  gets  compli¬ 
cated  when  one  has  to  match  on  a  multidimensional 
set  of  variables,  some  of  which  may  be  continuous, 
such  as  age.  There  are  a  number  of  procedures  for 
measuring  similarity  and  for  deciding  which  obser¬ 
vations  to  compare. 

If  certain  conditions  are  met,  then  matching 
methods  have  very  desirable  statistical  properties, 
including  reduced  bias  and  smaller  mean  squared 
errors  relative  to  other  multivariate  methods. 


Comparisons  of  results  from  experiments  with 
randomization  with  results  from  analysis  of  non- 
randomly  assigned  groups  using  matching  report 
comparable  results  with  both  methods.  This  gives 
further  credence  to  the  efficacy  of  the  matching 
strategy,  when  done  properly. 

The  design  in  these  methods  is  to  re-create  statisti¬ 
cally  the  advantages  of  a  true  experiment  by  either 
controlling  potential  confounding  variables  through 
the  matching  process  or  randomizing  them  through 
the  natural  setting  or  the  discontinuity.  The  causal 
model  in  each  instance  is  still  the  one  represented  in 
Equation  1,  in  which  the  effect  of  the  treatment  is 
separable  from  and  additive  to  any  other  factors. 

Observational  Data  and  Causal  Interpretations 

Observational  data  limit  in  a  number  of  ways 
one’s  confidence  in  the  causal  interpretations  given 
the  model  and  the  empirical  results.  Causal  argu¬ 
ments  have  an  implied  time  frame,  with  changes  in 
X  preceding  changes  in  Y.  In  an  experimental  set¬ 
ting,  this  condition  is  met  as  the  researcher  controls 
for  each  case  the  change  in  X  prior  to  observing 
whether  Y  changes.  Often  with  observational  data, 
the  information  on  X  and  Y  is  not  observed  within 
the  necessary  time  frame,  making  it  hard  to  deter¬ 
mine  if  a  change  in  X  preceded  a  change  in  Y  and 
by  how  much  Y  changed  as  a  result. 

Observational  data  come  in  several  forms.  Some 
data  may  be  from  a  cross  section  of  the  popula¬ 
tion,  meaning  the  cases  are  observed  only  once; 
from  a  panel,  meaning  the  same  units  are  observed 
multiple  times;  from  a  time  series,  where  units  are 
observed  at  regular  intervals;  and  from  pooled 
time  series  and  cross  sections,  where  multiple  units 
are  observed  at  regular  intervals.  The  distinction 
between  panel  data  and  a  time  series  is  the  number 
and  frequency  of  the  temporal  observations.  Event 
history  data,  which  are  commonly  used  in  some 
subfields  in  sociology  and  political  science,  are  a 
form  of  time  series  that  provides  data  for  an  indi¬ 
vidual’s,  an  organization’s,  or  a  country’s  history. 

Causal  inferences  with  cross-sectional  data  and 
the  model  in  Equation  5  require  several  conditions. 
Assume  that  the  adjustment  process  for  any  cross- 
sectional  observation  i  from  Y0  =  Xj0B  +  Ui0  to  Y, 
for  a  new  value  of  Xn  —  X*  is 

Y,i  =  pY,0  +  (1  -  p)X;S  +  Un 

=  x;b  -  P(x;  -  x,0)B  +  Pui0  +  un. 
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If  X*  is  stable  for  at  least  t  periods,  the  equation  for 
Yu  is 

Yit  =  pYif.  i  +  (1  -  P )X*B  +  Uu 

=  X*B  -  p\X*  -  X,0 )B  +  t  P SUi,t-s-  (6) 

s=0 

If  each  Uit  is  independent  of  X*  for  all  units,  the 
randomization  condition,  and  if  t  is  large  and/or 
p  is  small  so  that  the  term  in  Equation  6  involving 
pf  becomes  ignorable,  the  regression  estimates  for 
B  will  become  unbiased.  The  estimates  will  also 
be  unbiased  if  the  change  in  X  is  uncorrelated 
with  X*,  but  this  is  a  more  sample  specific  and 
stricter  condition.  Accurate  estimation  of 
Equation  6  also  requires  that  the  values  of  B  for 
each  observation  be  constant  over  the  period 
encompassed  by  t.  Subsequent  sections  discuss 
appropriate  methods  to  use  if  the  values  of  B  vary 
among  the  cross-seetional  units  and  conditions 
for  accurate  estimation  if  the  observed  units  are 
not  independent. 

Panel  data  offer  substantial  advantages  over 
cross-sectional  data  in  that  one  can  observe  AY 
and  AXt  for  the  same  unit  for  the  interval  between 
waves,  permitting  a  more  direct  estimate  of  Bk. 
Results  from  the  analysis  of  panel  data  increase 
confidence  in  the  original  model,  but  limitations 
remain.  Consider  Equation  6  with  Yit  denoting  the 
observations  of  Y  at  the  second  wave,  with  t  indi¬ 
cating  the  number  of  time  periods  between  the 
second  wave  of  interviews  and  when  X  first 
equaled  Xt.  Subtracting  Yi0  —  Xi0B  +  Ui0  from  both 
sides  of  this  equation  gives 

AY,  =  AX,B(1  -  p')  +  (Uit  -  U, 0) 

+  Zp  sUj,t-s-  (7) 

s=  1 

Unless  p  is  small  and/or  enough  time  has  elapsed 
between  when  X  changed  and  the  second  wave,  as 
measured  by  t,  regressions  of  AY  on  AX  will 
underestimate  B.  Again  the  relationship  between 
Y,  and  X„  the  estimate  B,  must  be  constant  over 
the  interval  of  the  panel.  Panel  data  have  a  distinct 
advantage  if  there  are  fixed  individual  effects,  as 
part  of  the  stochastic  term,  the  first  difference  in 
the  Us,  will  eliminate  most  of  their  effects. 
Multiple-wave  panels  provide  even  greater  advan¬ 
tages  in  estimating  these  relationships  and  in 
assessing  their  stability. 


The  informationally  richest  data  that  enable  the 
most  reliable  estimate  of  the  time  sequences  relating 
changes  in  Y  to  changes  in  X  are  those  with  a  long 
time  series,  including  event  history  data,  and  particu¬ 
larly  pooled  time-series  and  cross-sectional  data.  As 
discussed  in  subsequent  sections,  time-series  data 
permit  more  extensive  modeling  of  whether  and 
how  Y  reaches  a  new  equilibrium  following  changes 
in  X.  This  modeling  can  also  include  heterogeneity 
in  equilibrium  and  in  how  fast  units  might  reach  that 
equilibrium.  If  one’s  substantive  interests  and  the 
model  developed  to  represent  those  interests  include 
temporal  dynamics  and  heterogeneity,  then  time- 
series  data  are  required  to  estimate  and  evaluate  the 
model. 

A  Model-Based  Approach 
to  Empirical  Analysis 

A  limitation  of  the  work  described  so  far  lies  not 
just  in  whether  the  data  and/or  analysis  consti¬ 
tutes  or  mimics  a  well-designed  experiment  by 
meeting  the  randomization  condition.  The  limi¬ 
tation  is  in  the  structure  of  the  model  being  esti¬ 
mated.  This  model  will  be  an  appropriate 
representation  for  the  phenomena  being  ana¬ 
lyzed  in  some  circumstances,  particularly  those 
that  resemble  medical  trials,  where  these  tech¬ 
niques  have  been  used  extensively.  In  many 
instances,  however,  the  separable  and  additive 
expression  in  Equation  4,  or  even  its  multivari¬ 
ate  form  in  Equation  5,  will  be  an  inadequate 
representation  of  the  causal  structure.  This 
brings  us  back  to  some  form  of  Equation  3. 
Some  examples  of  such  models  and  possible  esti¬ 
mation  strategies  will  be  described  in  the  follow¬ 
ing  sections. 

Models  With  Variable  Relationships 

The  simplest  form  for  Equation  3  are  models 
where  the  relationship  between  Y  and  X  is  condi¬ 
tional  on  the  value  of  the  other  variables.  These 
propositions  are  increasingly  common  in  political 
science,  where  many  theories  are  about  the  magni¬ 
tude  of  the  effect  of  an  X  on  Y.  These  propositions 
lead  to  models  with  interaction  terms  of  various 
forms  and  various  degrees  of  complexity. 

Interaction  Terms.  The  simplest  model  with  varying 
relationships  is  a  straightforward  interaction  term, 
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the  product  of  two  variables.  Consider  Ted  Brader’s 
experimental  study  of  the  effects  of  TV  ads.  A  major 
proposition  is  that  ads  that  cue  enthusiasm  reinforce 
prior  decisions  while  ads  that  cue  fear  reduce  this 
relationship  and  increase  the  reliance  on  current 
evaluations.  A  simple  but  accurate  expression  of 
Brader’s  model  is  Choice,  —  b0  +  aChoiceM  +  Eval,(3 
+  Ut,  where  t  —  1  and  t  refer  to  the  pre-  and  post¬ 
treatment  periods,  respectively:  Choice,  is  a  partici¬ 
pant’s  candidate  choice  at  t;  and  Eval,  is  a  set  of  issue 
and  candidate  trait  evaluations.  Brader’s  proposi¬ 
tions  about  the  effects  of  different  TV  ads  are  that 
they  alter  the  values  of  a  and  (3,  not  that  they  have 
an  additive  effect  on  Choice: 

Choicet  =  b0  +  (a,  +  a2*Te  -  a3  *  7})  *  Choice, _i 

+  Eval,  *  (Pi  —  P2  *  T  +  P3  *  Tf)  +  Ut 
=  b0  +  ajChoice,-!  +  a 2(Te  *  Choice, _,) 

-  a3(Tf*  Choice,- j)  +  Eval^-I-  (Te  *  Eval,)  (3, 

-  (7)*Eval,)p3  +  U„  (8) 

where  T,  and  T,  indicate  the  type  of  treatment,  that 
is,  an  ad  that  cues  enthusiasm  or  fear,  respectively. 
Brader  uses  a  well-designed  and  executed  experi¬ 
ment  to  collect  the  information  to  estimate  this 
model.  The  estimation  is  done  with  a  probit  version 
of  the  MLE  estimator  because  of  the  dichotomous 
nature  of  the  Choice  variable.  The  important  point 
here  is  not  one  of  experimental  or  observational 
data.  It  is  the  need  to  be  clear  about  the  model, 
about  what  evidence  can  be  used  to  evaluate  or 
enhance  the  credibility  of  the  model,  and  about  the 
most  appropriate  statistical  procedure  given  the 
model  and  the  evidence. 

Equation  8  is  a  straightforward  example  of  the 
broader  group  of  models  that  feature  propositions 
about  systematic  variations  in  the  relationships 
between  Y  and  X,  which  are  represented  by  the 
interaction  terms.  Many  central  propositions  in 
political  science  relate  to  systematic  variations  in 
the  influence  that  one  or  more  variables  have  on 
some  outcome  of  interest.  For  example,  central 
bank  autonomy  affects  the  influence  central  gov¬ 
ernments  have  on  inflation.  The  bank’s  autonomy 
does  not  contribute  directly  to  inflation  but  alters 
the  ability  of  incumbent  governments  to  manage 
the  macroeconomy  in  accord  with  its  preferences. 
William  R.  Clark  suggests  that  most  studies  of  the 
effects  of  institutions,  of  cultural  differences,  and 
of  applications  of  game-theoretic  models  require 


models  with  interaction  terms.  The  need  to  model 
systematic  variations  in  coefficients  is  evident  in 
sociology  as  well.  Parameters  describing  important 
aspects  of  the  intergenerational  transmission  of 
occupational  status  are  expected  to  vary  nationally 
and  temporally. 

Estimation  of  the  simplest  models  with  interac¬ 
tion  terms  have  been  the  subject  of  considerable 
and  long-standing  discussion  in  political  science. 
In  models  such  as  Brader’s,  the  interaction  terms 
constitute  another  explanatory  variable,  and  the 
model  remains  linear  in  the  parameters,  which 
justifies  the  use  of  a  linear  model  such  as  OLS  or 
probit  in  Brader’s  study.  Statistical  inference  about 
the  marginal  effects  on  Y  of  changes  in  an  X 
require  more  complicated  calculations  than  in 
models  without  interaction  terms  because  these 
marginal  effects  change  with  the  values  of  the 
exogenous  variables. 

An  important  variation  in  this  structure  with 
temporal  models  occurs  when  the  coefficients  vary 
with  time.  There  are  a  number  of  different  proposi¬ 
tions  one  might  make  about  the  pattern  of  these 
variations.  One  can  depict  such  models  as  Yt  = 
Xfi(t)  +  Ut,  where  B ( t)  represents  some  function  of 
time.  The  simplest  variation  is  when  there  is  a 
sharp  break  in  the  value  at  a  critical  point  but  the 
coefficients  are  constant  on  either  side  of  that 
break.  In  this  structure,  Bt  —  (B1  +  B2Dt ),  where  Dt 
equals  zero  for  intervals  before  the  critical  period 
and  one  for  the  periods  following.  This  creates 
interaction  variables  that  consist  of  the  explanatory 
variable  of  interest  multiplied  by  Dt.  This  shifts  the 
relationship  between  Y,  and  X,  up  or  down  for  the 
second  period  but  leaves  the  relationship  constant 
within  these  periods.  More  complicated  variations 
in  the  relationship  between  Y,  and  X,  can  be  mod¬ 
eled  through  a  variety  of  functions,  such  as  polyno¬ 
mials:  for  example,  B(t)  —  a0  +  ajf  +  a 2t2  +  •  •  •  + 
a mtm.  The  more  complex  models  necessitate  more 
sophisticated  estimation  procedures  than  the  linear 
models  shown  so  far,  but  the  main  point  is  that 
these  models,  which  are  required  for  studying 
dynamic  processes,  as  with  studies  of  the  federal 
open  market  committee  and  monetary  policy,  have 
a  far  more  complex  structure  than  Equation  4  or  5. 

Random  Coefficients.  Models  with  randomly  vary¬ 
ing  coefficients  are  used  in  a  variety  of  contexts. 
Here,  the  relationships  between  Y  and  the  Xs  are 
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not  identical  for  all  observations,  as  in  Equation  5, 
nor  do  they  vary  in  some  systematic  way  with  time 
or  with  other  variables  but  are  themselves  random 
variables.  The  model  with  random  coefficients  can 
be  represented  by  Y  =  =  X;(£>  +  e(.)  +  Ut, 

where  a,  denotes  the  variations  in  the  relationship 
between  Y  and  X  among  observations.  If  E( s;)  =  0 
for  all  observations,  meaning  that  B(.  is  uncorrelated 
with  X  then  B  denotes  the  mean  effect  of  X  on  Y. 

The  random  coefficients  model  has  been  used  to 
critique  the  use  of  regression  adjustments  to  exper¬ 
imental  data.  Some  statisticians  consider  the  out¬ 
come  for  individual  i  to  be  fixed  or  random  for 
that  individual  conditional  on  whether  the  person 
is  in  the  treatment  or  control  group,  respectively. 
These  constant  individual  effects  then  vary  among 
individuals,  so  that  Y  =  XT  +  (1  —  X()C(,  where 
X(  equals  one  if  the  subject  is  in  the  treatment 
group  and  T  and  C,  are  the  outcome  for  person  i 
if  she  or  he  is  in  the  treatment  or  control  group, 
respectively.  The  average  treatment  effect  is 
T  -  C  —  (1  +  (1  which  corre¬ 

sponds  to  the  mean  of  the  coefficients  on  X.  and 
(1  —  X(),  for  the  treatment  and  control  groups, 
respectively. 

Estimation  of  the  model  with  random  coeffi¬ 
cients  requires  one  important  condition  and  some 
specialized  methods.  The  condition  is  that 
X“=i(X*  —  X/,  =  0  for  all  K  explanatory  vari¬ 

ables,  meaning  that  the  coefficient  variations  are 
independent  of  each  explanatory  variable.  If  the 
condition  is  met,  then  OLS  estimates  are  consistent 
but  inefficient.  More  elaborate  estimators  are 
required  for  efficiency  and  to  get  the  standard 
errors  correct  as  the  stochastic  term  created  by 
inclusion  of  the  coefficient  variations  does  not  meet 
the  homoskedastic  conditions  for  OLS  estimation. 
If  this  independence  condition  is  not  met,  the  result 
is  biased  estimates  for  B.  The  correction  if  X  and  IT 
are  correlated  involves  modeling  the  systematic 
variations  in  Bi  as  a  function  of  some  exogenous 
variables,  for  example,  Bik  —  ZAk  +  sik,  so  that  zlk 
is  now  uncorrelated  with  the  explanatory 
variables.  Including  this  expression  for  Bik  in 
Equation  5  gives  the  model  with  interaction  terms 
between  Xk  and  Z. 

Hierarchical  Models.  Often  the  context  for 
studying  outcomes  of  interest  consists  of  multi¬ 
ple  levels.  Individuals  in  voting  studies  reside 


within  a  layer  of  election  districts.  In  the  United 
States,  this  hierarchy  consists  of  municipalities, 
counties,  congressional  districts,  and  states.  In 
the  European  Union,  individuals  are  nested 
within  districts,  provinces,  and  countries.  It  is 
likely  that  the  factors  affecting  individual  behav¬ 
ior  vary  among  these  different  higher  level  units 
but  are  constant  for  all  individuals  within  a 
unit.  Party  organizations,  resources,  strategies, 
and  competitiveness,  which  affect  individuals’ 
turnout  and  vote  decisions,  vary  substantially 
among  local  areas  and  states.  In  democratic 
countries,  there  is  substantial  institutional  vari¬ 
ation,  which  affects  individual  decisions.  Models 
that  adequately  represent  these  hierarchical 
relationships  are  more  complex  than  Equation  5 
and  are  an  application  of  the  previous  models 
with  interaction  terms  and  random  coefficients. 

Models  with  these  multiple  levels,  frequently 
referred  to  as  hierarchical  or  mixed  models,  com¬ 
bine  explicit  interaction  terms  and  random  coeffi¬ 
cients  to  represent  these  multilevel  effects.  Consider 
the  following  two-level  model,  where  Yi;  refers  to 
the  outcome  for  person  i  residing  in  Level  2  of  unit 
j,  such  as  a  state  or  a  country;  X-  refers  to  if  indi¬ 
vidual-level  explanatory  variables;  and  Z;  refers  to 
M  variables  describing  state  or  country  character¬ 
istics  for  unit  /.  The  full  model  is 

Y,y  =  Boj  +  Yj  XijkBkj  +  Uy, 

k=i 

M 

h oj  d  no  T"  ZjmA()m  Y  , 

m= 1 
M 

-^kj  A0k  ^  Zjm-Akm  £&/'• 

m  =  1 

Combining  these  equations  gives 

M  K 

^ ij  -^00  -^-ijk-^-Ok 

m= 1  k=l 

K  M 

■X.ijkZjjn-Aktn  bJ ij 

k=l  m= 1 

K 

“i”  8  Qj  ^  Xjjk  8  kj  •  ( 9 ) 

k=l 

The  A0m  terms  estimate  the  Level  2  (e.g.,  state 
or  country)  effects  on  Level  1  (e.g.,  individual) 
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behavior  by  shifting  the  intercept  for  all  individu¬ 
als  in  the  same  Level  2  unit.  The  coefficients  Aok 
estimate  the  effects  of  individual-level  variables  on 
individual  behavior.  These  are  the  terms  in  the 
conventional  individual  model  possibly  repre¬ 
sented  by  Equation  5.  Finally,  the  coefficients  Akm 
indicate  how  the  Level  2  factors,  Z-,  alter  the  rela¬ 
tionship  between  the  individual  variables  and  the 
individual  outcome.  For  example,  if  the  competi¬ 
tiveness  of  the  two  U.S.  parties  within  a  state 
affects  both  voter  turnout  in  a  state  and  the  rela¬ 
tionship  between  education  and  turnout,  then  the 
model  needs  to  include  an  interaction  term  that  is 
the  respondent’s  education,  Xjjk,  times  the  two 
parties’  competitiveness  within  state  j,  Zjm.  The  e0/ 
and  zkj  terms  represent  heterogeneity  in  the  higher 
unit’s  intercepts  and  coefficients,  respectively,  and 
are  terms  in  the  random  coefficients  model  just 
discussed. 

Summary.  There  is  a  major  distinction  between 
the  models  described  in  this  section  and  the  one 
represented  by  Equation  4  or  5.  Both  structures 
are  linear  and  separable  in  the  parameters,  mean¬ 
ing  that  the  basic  model  is  described  by  the 
expression  Y,  =  ,  Xlk  B  +  Un  but  the  models  in 

this  section  are  not  linear  and  separable  in  the 
relations  between  Y  and  X.  In  Equations  4  and  5, 
8Y/dXk  —  Bk  regardless  of  the  value  of  Xk  or  of 
any  other  X.  Transformations  of  Y  and  X  such 
that  Y::'  =  f(Y)  and  X*  =  f(X)  still  leave  the 
expression  3Y*/dXj=  Bk.  In  the  models  with  inter¬ 
action  terms,  the  expected  marginal  change  in  Y 
for  a  marginal  change  in  Xk  is  not  the  constant  Bk. 
Consider  the  simple  model  with  one  interaction 
term: 

Y,  =  Bj  +  B2X,  +  B3Zt  +  B4X,Z,  +  Ur  (10) 

This  produces  the  marginal  relationship  3Y(/dX, 
—  B2  +  B4Z:.  This  changes  the  marginal  rela¬ 
tionship  between  Y  and  X,  even  to  the  point 
that  for  certain  values  of  Z  there  may  be  no 
change  in  Y  for  a  change  in  X,  depending  on  the 
values  of  B,  and  Br  Any  effort  to  use  or  mimic 
experimental  methods  to  estimate  8Y/dX  must 
take  the  context,  meaning  the  value  of  Z,  into 
account.  More  important,  if  the  interest  is  in 
estimating  how  the  relationship  between  Y  and 
X  varies  with  the  value  of  Z,  as  it  is  in  many 


studies,  one  must  have  observations  in  which 
both  X  and  Z  vary.  Estimation  of  both  B2  and 
B4  may  not  require  observational  data,  as 
Brader  demonstrates,  but  it  requires  an  estima¬ 
tion  method  that  matches  the  structure  of  the 
model  in  Equation  10  and  data  with  systematic 
variations  in  both  X  and  Z. 

Models  With  Interactions  Among  Observations 

Some  substantively  important  topics  require 
models  with  explicit  interactions  among  the  obser¬ 
vations,  whether  temporal  or  spatial.  Such  models 
are  necessary  to  examine  the  temporal  dynamics  of 
social  processes  and  their  possible  equilibrium  prop¬ 
erties.  The  extensive  work  on  the  interaction  of 
macroeconomic  and  political  conditions  exemplifies 
such  models.  In  these  studies,  the  effects  of  condi¬ 
tions  in  one  time  period  on  conditions  in  later  peri¬ 
ods  are  an  explicit  part  of  the  model,  the  data,  and 
the  estimation  procedures.  There  is  increasing  inter¬ 
est  in  models  that  include  explicit  interactions 
among  individuals  or  geographically  defined  units. 
Communication  of  political  attitudes  and  informa¬ 
tion  among  individuals  and  the  diffusion  of  policy 
innovations  among  states  are  examples  of  these 
substantive  interests.  The  substantive  questions 
focus,  at  least  in  part,  on  the  interdependence  of 
observational  units,  and  this  interaction  is  more 
than  a  statistical  nuisance.  Models  that  treat  the 
interdependence  of  either  temporal  or  cross- 
sectional  units  as  an  integral  part  of  the  structure 
are  more  complex  than  those  in  Equations  4  and  5 
and  require  sophisticated  statistical  methods  to  esti¬ 
mate  these  interactions. 

Interdependencies  among  observations  imply  a 
different  model  and  their  estimation  requires  differ¬ 
ent  statistical  methods  from  those  derived  from  the 
experimental  paradigm.  A  central  necessary  condi¬ 
tion  of  the  experimental  method  and  of  the  tech¬ 
niques  developed  to  mimic  it  is  that  each  observation 
is  observed  and  is  behaving  independently  of  all  the 
other  units  observed.  This  is  referred  to  as  the 
SUTVA  (stable  unit  treatment  value  assumption) 
condition.  In  the  context  of  Gosnell’s  and  other 
voter  turnout  field  experiments,  it  means  that  there 
is  no  interaction  between  someone  receiving  the 
“treatment,”  for  example,  a  mailing  or  a  phone  call, 
and  an  individual  not  receiving  the  treatment.  For 
subjects  living  in  the  same  neighborhood  and 


Statistics:  Overview  2535 


possibly  attending  meetings  of  the  same  local  organi¬ 
zations,  such  as  a  church  or  PTA,  this  condition 
might  be  problematic.  The  basic  premise  for  SUTVA 
is  obviously  violated  for  studies  of  how  units  interact, 
be  they  time  periods,  individuals,  or  political  units. 

Temporal  Interdependence.  For  studies  of  dynamic 
processes,  the  interactions  among  observational 
units  are  a  primary  focus.  Even  when  an  examina¬ 
tion  of  the  temporal  interactions  is  not  the  primary 
interest  but  one  must  use  observational  time-series 
data,  the  possible  interdependence  cannot  be 
ignored  but  must  be  included  in  the  model.  At  one 
point,  these  interactions  were  treated  as  nuisance 
terms  that  had  to  be  diagnosed  and  then  dealt  with 
statistically  in  order  to  obtain  efficient  estimators 
and  accurately  estimated  standard  errors.  The 
extensive  use  of  the  Durbin-Watson  statistic  and  of 
models  with  first-order  autocorrelated  error  terms 
exemplifies  this  thinking. 

Increasingly,  this  autocorrelation  is  interpreted 
as  being  substantively  interesting,  and  efforts 
being  are  made  to  incorporate  temporal  interde¬ 
pendencies  into  the  model.  Consider  the  com¬ 
monly  used  error  correction  model  (ECM),  which 
is  an  elaborated  version  of  Equation  6: 

AY,  -  A XtBr  +  (1  -  p)(Y;_1  ~Yt-1)  +  Ut,  (H) 

where  Y*_  ,  =  (1  —  p)  1(B0  +  X,  iB',)  is  the  long- 
run  equilibrium  value  of  Y  at  time  t  —  1  given  Xt_1 
and  p  <  1.  Changes  in  Y  from  t  —  1  to  t  depend  on 
its  responsiveness  to  exogenous  shocks,  defined  by 
AX(B , ;  and  how  quickly  Y  responds  to  being  out 
of  equilibrium,  given  by  (1  —  p)  ( Y*_ ,  -  Y(_x). 
With  appropriate  manipulation  of  terms,  this 
model  is  estimated  as 

Y,  =  B0  +  X,B,  +  X(_x(B*  -Bt)  +  PY,_j  +  Ut 
-  B0  +  X;B|  +  X,_jB2  +  pY,_x  +  Ut.  (12) 

In  time-series  models  of  the  macropartisanship 
of  the  U.S.  electorate,  for  example,  the  interest  is  in 
whether  the  electorate’s  partisanship  responds  to 
short-term  economic  shocks,  AXf,  and  how  quickly 
it  returns  to  equilibrium  after  these  shocks.  The 
ECM  also  predicts  a  new  equilibrium,  given  by 
Y*  =  (1  —  p)-1[B0+  X*(B1+B1)],  if  there  is  a 
permanent  shift  in  the  exogenous  variables  from 


X(.j  to  X*  and  estimates  how  long  it  takes  to 
approximate  that  equilibrium.  In  this  framework, 
macropartisanship  might  shift  to  a  new  equilib¬ 
rium  if  the  electorate  permanently  shifts  its  prefer¬ 
ences  on  a  major  issue,  if  the  parties  adopt  new 
platforms,  and/or  if  the  electorate  sees  a  perma¬ 
nent  shift  in  the  parties’  ability  to  manage  eco¬ 
nomic  or  foreign  policy  issues. 

Spatial  and  Individual  Interdependence.  The 
treatment  of  cross-sectional  interdependence  is 
experiencing  a  comparable  and  extensive  trans¬ 
formation.  One  approach  treats  the  stochastic 
terms  as  spatially  interdependent  and  develops 
tests  and  methods  for  incorporating  this  interde¬ 
pendence.  There  is  no  causal  interdependence, 
whereby  the  actions  of  one  unit  influence  the 
actions  or  outcomes  in  the  other  units.  An  exam¬ 
ple  of  this  stochastic  interdependence  would  be  a 
model  of  voter  turnout  in  election  districts  where 
bad  weather  reduces  turnout  and  weather  pat¬ 
terns  are  not  confined  to  individual  districts  but 
spill  across  boundaries,  reducing  turnout  in  the 
adjacent  districts.  More  substantively  interesting 
models  incorporate  interdependence  whereby  the 
actions  or  outcomes  in  one  unit  directly  influence 
the  actions  of  the  other  units.  This  type  of  inter¬ 
dependence  has  a  long  history  in  political  science 
research.  Propositions  about  how  political  atti¬ 
tudes  are  communicated  among  individuals  and 
about  the  diffusion  of  policy  innovations  among 
states  within  the  United  States  are  examples. 

Here  is  the  formal  distinction  between  the  two 
types  of  interdependence: 

Stochastic  interdependence: 

T  =  Y  X*Bt  + 

k=i 

N 

e,  =  k  Y  wa£t  +  U, 

7=1 

gives 

Y;  =  Y  XikBk  +  k  Y  w‘iei  +  U<-  (13) 

k=i  i=i 

Structural  interdependence: 

Y,  =  p  Y  wHYi  +  Z  XikBk  +  £,•  (14) 

;= 1  k=i 
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The  terms  wq  indicate  the  pattern  of  interdepen¬ 
dence  among  observations,  so  that  wtj  =  0  if  units 
i  and  /  are  independent  and  wj/  ^  0  otherwise. 
There  are  various  ways  to  measure  wtj  for  the  non- 
independent  case,  depending  on  how  dense  one 
expects  the  interactions  to  be  and  whether  one 
standardizes  each  unit’s  values  for  w  based  on  the 
number  of  connections.  The  terms  \  and  p  indicate 
the  magnitude  of  the  stochastic  and  structural 
interdependence,  respectively. 

In  Equation  13,  the  expected  effect  on  Y,  of  a 
change  in  Xik  remains  fixed  and  is  given  by  Bk. 
Random  shocks  in  one  observation  are  shared 
among  the  other  observations  as  their  effects 
are  propagated  through  the  other  stochastic 
terms.  This  indicates  that  the  stochastic  terms  in 
the  equation  to  be  estimated  are  no  longer  inde¬ 
pendent,  which  means  that  the  OLS  estimates 
are  unbiased  but  inefficient  and  the  computed 
standard  errors  are  incorrect.  In  Equation  14, 
by  contrast,  any  change  in  Xik  leads  to  direct 
and  indirect  changes  in  Y;  as  its  effects  are 
propagated  through  the  other  observations.  The 
total  magnitude  of  these  direct  and  indirect 
effects  depends  on  the  structure  of  interdepen¬ 
dence,  the  w(/s,  and  the  magnitude  of  the  inter¬ 
dependence,  p.  [These  effects  are  shown  better 
in  matrix  notation  as  Y  =  (I  —  pW)-1(XB  +  s), 
where  W  is  an  N  X  N  matrix  showing  the  full 
pattern  of  interdependencies.]  This  latter  model 
is  by  far  the  more  substantively  interesting  as  it 
proposes  some  form  of  strategic  or  learning 
behavior  among  the  units.  This  would  be  the 
appropriate  model  for  discussing  the  diffusion 
of  innovations  among  states  or  the  communica¬ 
tion  of  political  information.  Common  pro¬ 
grams  such  as  Stata  describe  and  include  various 
tests  and  estimation  procedures  for  each  type  of 
interdependence. 

Summary.  The  substantive  questions  that  moti¬ 
vate  Equations  11  and  14  are  different  from  the 
ones  embodied  in  Equation  4  or  5.  The  param¬ 
eter  p,  which  measures  the  interdependence 
among  units,  does  not  have  an  analog  in 
Equation  5.  So  it  is  not  surprising  that  the  struc¬ 
ture  of  the  models  is  different,  even  to  the  point 
that  Equation  14  is  not  even  linear  in  the  coef¬ 
ficients.  Estimation  of  p,  and  therefore  of  the 
whole  model,  requires  different  types  of  data 


and  methods.  Furthermore,  the  observational 
interdependence  embodied  in  p  is  not  a  nuisance 
parameter  or  a  property  of  the  data  to  be 
excised,  if  possible.  It  represents  a  substantive 
part  of  the  model. 

Nonlinear  Models 

The  most  general  version  of  the  model  in 
Equation  3  is  nonlinear  in  the  coefficients.  This 
form  is  useful  for  modeling  a  large  set  of  complex 
processes.  The  model  in  Equation  14  is  an  example 
of  a  model  that  is  nonlinear  in  its  coefficients. 
Some  models  with  interaction  terms  can  become 
nonlinear  in  the  parameters.  For  example,  a  model 
of  central  bank  and  governmental  control  of  mon¬ 
etary  and  inflation  policy  posits  interaction  terms 
that  are  functions  of  unobserved  variables,  which 
are  modeled  as  functions  of  observables.  When  the 
functions  for  the  latent  variables  are  introduced  as 
interaction  terms,  it  produces  a  model  that  is  non¬ 
linear  in  the  coefficients  because  of  the  constrained 
multiplicative  terms. 

Nonlinear  dynamic  systems  are  likely  to  include 
nonlinear  functions  of  parameters.  Processes  that 
might  be  path  dependent,  for  example,  have 
received  considerable  attention  in  some  areas  of 
political  science.  Path  dependence  in  a  temporal 
process  requires  a  time-varying  transition  param¬ 
eter.  Incorporating  this  requirement  in  a  dynamic 
model  similar  to  Equation  6  requires  a  pt  that  is  a 
function  of  time-varying  variables  in  the  system, 
denoted  by  Zt,  giving  pt  —  p(Z;).  This  variation  in 
and  extension  to  the  temporal  models  discussed 
above  gives 

Y,  =  p,Y,_j  +  (1  -  p t)XtB  +  Ut 

=  P {Zt)Yt-y  +  [1  -  P {Zt)]XtB  +  Ut.  (15) 

There  is  an  important  difference  between  this 
model  and  the  ECM  model  in  Equation  11, 
although  Y  has  the  same  long-run  equilibrium  in 
both  if  p(  <  1  for  all  t.  In  the  ECM  model  p  is 
constant,  while  here  p  varies  with  time  as  a  func¬ 
tion  of  the  values  of  Zt.  The  constraints  on  a 
dynamic  system  are  that  0<P,<1,  which  implies 
that  it  is  a  nonlinear  function  of2Zr  An  example 
of  such  a  function  is  p  =  a]e  “ lZ‘ ,  which  ensures 
that  0  <  p ,<av  The  resulting  model  has  important 
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dynamic  properties  that  are  vastly  different  from 
those  of  the  linear  model  in  Equation  5,  and  pos¬ 
sibly  even  Equations  6  and  11.  For  example,  if  a1 
—  1,  meaning  that  p,  <  1  when  Z]  ^  0  and  pt  =  1 
when  Zf  —  0,  if  the  variance  of  Ut goes  to  zero  as 
Zt  goes  to  zero,  it  produces  a  path-dependent 
sequence  for  Yt,  whose  equilibrium  value  depends 
both  on  the  values  for  Xt,  as  in  Equations  6  and 
11,  and  on  the  sequence  of  values  for  Xt  and  for 
Ut  prior  to  the  point  where  Zt  —  0.  If  the  variane 
e  of  Ut  does  not  go  to  zero  when  p(  =1,  the  pro¬ 
cess  becomes  what  is  called  a  random  walk  after 

Z,  =  0. 

If  the  stochastic  term  is  separable  and  additive, 
as  it  is  in  Equation  15,  a  method  called  nonlinear 
least  squares  can  be  used  to  estimate  the  model’s 
coefficients.  Derivation  of  the  nonlinear  least 
squares  estimator  is  straightforward.  As  in 
Equation  3  let  Yt  =  F(Xt,  B)  and  Y,  =  F(Xt,  b) ,  and 
let  there  be  K  coefficients  Bk  to  be  estimated.  As 
with  OLS,  the  nonlinear  method  chooses  the  values 
for  each  bk  that  minimize  the  expression  Y*Aet  = 

-  Y')2.  =  -  P(X*M2:  The  central 

term  in  this  minimization  is  the  matrix  of  the  par¬ 
tial  derivative  of  each  F(Xt,b)  with  respect  to  each 
coefficient  bk,  f  (Xt,  bk)  =  dF(Xt,bk)/  dbk.  There  are 
K  such  derivatives  for  each  observation,  which 
when  done  for  all  T  observations  gives  a  T  X  K 
matrix.  The  non-linear  least  squares  estimator  is 
the  solution  to  the  set  of  K  nonlinear  simultaneous 
equations: 

bk)et=  if(Xt,bk)(Yt-Yt) 

t=  1  t=  1 

=  tf(X„bk)Yt 

t=  1 

-  Yjf(Xt,bk)[F(Xt,b)\  =  0.  (16) 

t=  1 

With  the  linear  model,  Yt  =  and  the 

partial  derivative  of  Yt  with  respect  to  bk  is  sim¬ 
ply  Xtk,  so  the  jth  equation  in  Equation  16  then 
becomes 

X  x,,et  =  f  Y,  -  X  WM  =  x  X/Y,  -  X  bk  X  x<ix *  =  0, 

t= i  t= i  V  k=i  J  t= i  k=i  t= i 


which  is  the  familiar  expression  for  the  OLS  esti¬ 
mator.  (In  matrix  form,  this  expression  is  XY  — 
(XX)b  —  0,  which  may  be  more  familiar.)  Finding 
the  solution  to  the  equations  in  Equation  16  is 
much  more  complicated  than  for  the  OLS  estima¬ 
tor,  but  with  the  power  of  current  computers  and 
numerical  methods,  most  estimations  can  be 
accomplished  relatively  easily.  The  resulting  esti¬ 
mates  are  consistent  and  asymptotically  normally 
distributed. 

As  in  many  models  of  nonlinear  dynamic  sys¬ 
tems  and  other  applications  of  nonlinear  least 
squares,  the  substantive  process  is  best,  or  possibly 
only,  modeled  by  a  structure  that  is  nonlinear  in 
the  coefficients.  Also,  in  many  of  these  applica¬ 
tions,  accurate  parameter  estimates  are  very  impor¬ 
tant  as  the  models  may  have  vastly  different 
equilibrium  and  dynamic  properties  for  different 
coefficient  values,  as  is  the  case  with  the  model  of 
path  dependence  above.  The  properties  of  these 
models  render  the  linear  model  inappropriate,  giv¬ 
ing  rise  to  the  need  for  a  nonlinear  estimator. 
Fortunately,  as  various  simulation  studies  illus¬ 
trate,  the  nonlinear  least  squares  estimator  per¬ 
forms  quite  well,  even  for  potentially  difficult 
parameter  values. 

Multi-Equation  Models 

Some  important  processes  can  only  be  repre¬ 
sented  by  multiple  equations,  as  there  are  several 
interacting  endogenous  variables,  and  these  inter¬ 
actions  are  the  central  concern.  The  Blau  and 
Duncan  model  of  occupational  status  attainment 
features  two  predetermined  but  not  necessarily 
exogenous  variables,  father’s  occupation  and  edu¬ 
cation,  and  three  outcome  measures  for  the  respon¬ 
dent,  education,  first  job,  and  current  job  status. 
These  five  variables  are  organized  in  a  hierarchical 
system  in  which  the  variables  expressed  earlier  in 
the  system  are  not  functions  of  the  variables 
included  later.  By  then  treating  all  the  stochastic 
terms  in  the  separate  equations  as  uncorrelated  (a 
strong  assumption),  the  system  is  recursive  and  can 
be  estimated  with  the  same  methods  used  to  esti¬ 
mate  Equation  5.  This  early  multi-equation  model 
stimulated  an  extensive  amount  of  work  on  mod¬ 
els  with  multiple  outcomes.  This  subsequent  work 
and  earlier  but  related  work  in  economics  removed 
both  the  hierarchical  structure  and  the  condition 
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of  uncorrelated  stochastic  terms,  leaving  a  fully 
endogenous  system. 

A  much  more  complex  example  is  Brandt’s  and 
Freeman’s  recent  model  of  the  U.S.  macropolitical 
economy.  This  model  uses  nine  equations  to  model 
the  contemporaneous  interaction  of  nine  economic 
and  political  variables,  emphasizing  the  dynamic 
interactions  within  the  political  economy.  Each  of 
these  variables  is  the  left-hand-side  variable  in  one 
equation  and  a  right-hand-side  variable  in  the  oth¬ 
ers.  Multi-equation  systems  with  joint  endogenous 
variables,  whether  hierarchical  or  not,  present  dif¬ 
ficult  estimation  problems  as  the  endogeneity  of 
the  variables  included  in  each  equation  violates  the 
basic  principle  of  independence  between  the 
explanatory  variables  and  the  stochastic  term, 
referred  to  earlier  as  the  exogeneity  condition. 

As  with  some  of  the  previous  models,  the  early 
treatment  of  multi-equation  models  considered  the 
presence  of  endogenous  variables  in  an  equation  to 
be  a  necessary  nuisance  resulting  from  the  obser¬ 
vational  nature  of  the  data.  There  is  an  extensive 
literature  in  econometrics  related  to  dealing  with 
these  nuisances,  usually  under  headings  such  as 
endogenous  regressor,  instrumental  variables, 
two-stage  least  squares,  or  simultaneous  equa¬ 
tions.  These  concerns  are  real,  and  endogeneity 
poses  serious  estimation  problems,  even  in  the  best 
of  conditions. 

The  thumbnail  description  of  the  IV  estimator 
is  as  follows.  The  equation  being  estimated  is 

K  M 

yt  =  Y  XtkBk  +  Y  YmCm  +  Ut,  (17) 

k=l  m= 1 

where  the  Xks  are  exogenous  variables  and  the  Ym s 
are  endogenous  variables.  The  Yms  are  considered 
endogenous  because  they  are  correlated  with  U.  IV 
estimation  requires  one  or  more  exogenous  vari¬ 
ables,  denoted  by  Z„  that  are  correlated  with  Y m 
but  uncorrelated  with  U.  There  must  be  at  least  as 
many  instruments  as  there  are  included  endoge¬ 
nous  variables,  that  is,  L  >  M.  Formally,  these 
requirements  are  that  E^'^Z,,  Y/;„j  =  tr/m  /  0 

and  that  E^J=1ZtiUt'\  —  (tu,  —  0.  (These  condi¬ 
tions  are  usually  stated  as  probability  limits,  but 
the  covariance  form  is  easier  to  picture.) 
Informally,  this  means  that  y  is  related  to  each 
instrument  variable  Z/3  but  only  because  of  Z/s 


association  with  Ym;  that  is,  Z,  has  no  direct  asso¬ 
ciation  with  y. 

The  natural  experiments  discussed  earlier  can 
be  a  good  source  of  instruments.  If  the  conse¬ 
quences  of  some  natural  event,  such  as  the  local 
severity  of  a  hurricane’s  winds,  are  a  random 
event,  then  measures  of  this  event  can  be  good 
instruments.  The  research  estimating  the  ability  of 
federal  aid  to  build  support  for  an  incumbent 
administration  had  to  treat  the  amount  of  aid  dis¬ 
tributed  to  communities  as  endogenous  as  it  was 
also  proposed  that  aid  distribution  was  influenced 
by  votes  for  the  incumbent  in  previous  elections. 
The  hurricane’s  intensity  should  be  related  to  the 
amount  of  aid,  but  because  of  the  randomness  of 
hurricane  paths,  wind  intensity  should  be  uncor¬ 
related  with  previous  vote  patterns  over  the  larger 
area  covered  by  the  hurricane. 

The  selection  of  instruments  is  often  aided  by 
the  model  itself.  Many  applications  contain  equa¬ 
tions  modeling  each  of  the  endogenous  variables 
in  the  system,  which  means  that  there  are  separate 
equations  for  each  Ym  in  addition  to  the  equation 
for  y.  This  set  of  equations  is  referred  to  as  a  struc¬ 
tural  model,  and  each  equation  will  usually  con¬ 
tain  both  exogenous  and  endogenous  explanatory 
variables.  In  estimating  the  equation  for  y,  the 
exogenous  variables  not  included  in  this  equation 
but  that  are  included  in  the  equations  for  the  other 
endogenous  variables  become  the  natural  instru¬ 
ments.  Denote  by  X*  the  K*  exogenous  variables 
that  are  included  in  the  equations  for  Ym  but  are 
not  included  in  the  equation  for  y.  The  variables  in 
X*  become  the  instruments,  that  is,  the  Zs.  The 
propositions  that  argue  for  their  inclusion  in  the 
other  equations  imply  that  they  are  correlated  with 
Ym.  The  logic  for  excluding  them  from  the  equation 
for  y  is  that  they  are  uncorrelated  with  U.  Together 
these  define  the  properties  of  an  instrument. 

A  necessary  condition  for  estimating  the  equa¬ 
tion  for  y  is  that  the  number  of  included  endoge¬ 
nous  variables  be  less  than  or  equal  to  the  number 
of  excluded  exogenous  variables,  M  <  X*.  This, 
however,  is  only  a  necessary  condition.  The  suffi¬ 
cient  conditions,  referred  to  as  identification, 
require  a  more  detailed  knowledge  of  which  vari¬ 
ables  in  Xs'  are  related  to  which  included  endoge¬ 
nous  variables.  If  there  is  only  one  included 
endogenous  variable — for  example,  the  structural 
model  contains  only  two  equations — the  necessary 
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condition  is  also  the  sufficient  condition. 
Determining  if  the  equation  for  y  is  identified  and 
can  be  estimated  if  it  contains  more  than  one 
endogenous  variable  is  a  more  exacting  process, 
for  which  there  are  no  simple,  uniform  rules.  A 
more  detailed  discussion  of  identification  and  how 
to  determine  if  an  equation  is  identified  can  be 
found  in  any  econometrics  textbook. 

IV  estimation  is  a  general  estimation  strategy 
for  equations  with  endogenous  variables.  Its 
application  to  multi-equation  systems  is  a  special 
case.  Regardless  of  the  application,  the  require¬ 
ments  for  IV  estimation  are  the  reason  why  one 
should  approach  IV  estimation  with  caution, 
because  they  may  be  difficult  to  meet.  The  require¬ 
ment  that  the  Z,s,  or  the  Xs" s,  and  Ym  be  correlated 
can  be  tested  as  these  variables  are  observed.  The 
tests  for  this  correlation  require  more  than  a  sim¬ 
ple  correlation.  First,  what  is  important  is  the  par¬ 
tial  correlation  between  Ym  and  the  Z;s,  controlling 
for  the  included  exogenous  variables,  the  Xks. 
Second,  if  there  is  more  than  one  included  endog¬ 
enous  variable,  each  Ym  must  be  correlated  with  a 
different  instrument  or  set  of  instruments.  This  is 
another  statement  of  the  identification  condition, 
but  it  means  that  it  must  hold  for  the  data  at  hand, 
not  just  as  a  theoretical  statement. 

The  condition  that  the  Z;s  and  U  are  uncorre¬ 
lated  is  not  testable,  as  U  is  not  observed.  In  cases 
where  there  are  more  instruments  than  endoge¬ 
nous  variables,  that  is,  K *  —  L  >  M,  referred  to 
as  the  over  identified  case,  there  are  some  weak 
tests  for  the  adequacy  of  the  instruments.  These 
tests  are  based  on  the  fit  between  the  residuals 
from  the  estimated  equation  for  y  and  the  instru¬ 
ments.  The  better  this  fit  is,  the  less  likely  the 
instruments  are  to  be  independent  of  U.  But  these 
are  relatively  low-power  tests,  meaning  that  they 
are  not  highly  reliable  for  deciding  whether  to 
accept  the  null  hypothesis  of  independence. 

There  are  other,  and  very  different,  estimation 
strategies  for  accommodating  endogeneity  as  illus¬ 
trated  by  the  model  of  the  U.S.  macropolitical 
economy  mentioned  earlier.  But  they  are  beyond 
the  scope  of  this  entry.  These  estimation  methods 
rely  on  different  constraints  and  assumptions 
from  those  in  the  IV  estimator.  They  also  nicely 
illustrate  the  development,  specification,  and  use 
of  multi-equation  models  to  provide  important 
insights  into  the  intricate  interactions  among  the 


components  of  complex  social  phenomena. 
Substantive  use  of  these  models  extends  far 
beyond  models  of  political  economy.  The  chal¬ 
lenge  in  these  models  is  to  continue  to  develop 
estimation  methods  that  are  tailored  to  the  models 
and  to  the  types  of  data  available  to  estimate  these 
models.  The  estimation  task  posed  by  these  mod¬ 
els  is  not  simply  the  reliance  on  observational  data 
due  to  the  difficulty,  maybe  impossibility,  of  per¬ 
forming  experiments  that  carefully  and  exoge¬ 
nously  manipulate  one  variable  at  a  time.  The 
central  feature  of  these  models  is  how  to  represent 
and  then  to  estimate  the  complex  contemporane¬ 
ous  and  lagged  interactions  among  a  set  of  vari¬ 
ables,  which  are  endogenous  because  of  these 
interactions. 

Concluding  Comments 

The  focus  of  this  entry  is  on  models  that  represent 
causal  statements  about  social  processes  and  on 
some  of  the  data  and  estimation  methods  that  are 
consistent  with  these  models.  The  models  dis¬ 
cussed  here  (from  Equations  4  through  17,  regard¬ 
less  of  whether  the  data  are  derived  from 
experiments  or  observations,  share  one  common 
feature.  The  stochastic  component  is  separable  and 
additive  to  the  systematic  part.  This  condition  sim¬ 
plifies  the  estimation  process  and  facilitates  deriva¬ 
tion  of  the  estimator’s  statistical  properties.  There 
are  methods  beyond  the  scope  of  this  discussion 
that  rely  on  other  specifications,  assumptions,  and 
conditions  that  need  to  be  considered  in  any 
empirical  study. 

There  are  a  variety  of  strategies  one  might  use 
to  estimate  and  evaluate  the  various  models.  There 
is  no  single  “best”  method  for  all  studies.  The 
choice  of  estimation  method  should  follow  from 
the  substantive  question,  from  the  model  used  to 
study  that  question,  and  from  the  available  data. 
The  limitations  of  OLS  regression  in  accurately 
estimating  Equation  5  with  observational  data  are 
well  documented,  if  not  always  acknowledged  in 
practice.  Lab  experiments  are  often  considered  the 
“gold  standard,”  and  they  have  very  desirable 
properties  in  many  contexts.  They  also  need  to  be 
considered  carefully.  The  experimental  setting, 
stimulii,  and  subjects  must  closely  approximate  the 
actual  setting,  stimulii,  and  individuals  on  which 
the  results  will  be  applied.  Artificial  experimental 
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settings  may  generate  artificial  findings.  Field 
experiments  will  overcome  the  validity  concerns 
but  may  raise  other  problems.  Randomization  and 
double-blindedness  may  be  difficult  to  achieve 
when  working  with  real  organizations  with  their 
own  agendas  and  procedures. 

Natural  experiments  and  regression  discontinu¬ 
ity  designs  and  matching  methods  all  use  observa¬ 
tional  data.  This  gives  them  the  advantage  of 
analyzing  “real”  data  from  “real”  subjects  in  their 
natural  setting  and  in  ways  that  can  overcome 
some  of  the  problems  associated  with  regression 
analysis  of  observational  data.  One  must  be  sure, 
however,  that  there  are  no  subtle  selection  pro¬ 
cesses  that  prevent  true  randomization.  For  exam¬ 
ple,  in  the  studies  based  on  lottery  winnings,  can 
one  be  sure  that  those  with  large  winnings  were 
not  playing  different  games  from  those  with  small 
earnings  and  thus  might  have  different  personal 
characteristics  that  could  be  correlated  with  how 
the  money  was  spent  or  with  political  attitudes?  In 
the  regression  discontinuity  study  of  incumbency 
mentioned  earlier,  the  electoral  margins  were 
taken  as  50%  ±  2.5%.  Is  assignment  to  incum¬ 
bency  or  nonincumbency  still  random  within  that 
range?  And  what  about  a  margin  of  ±1%  or 
±5%?  With  matching  methods,  one  must  be  sure 
there  are  no  unobservables  omitted  from  the  pro¬ 
pensity  score  that  might  be  correlated  with  the 
treatment.  Matching  procedures  can  also  be  com¬ 
promised  by  subtle  biases  in  the  selection  pro¬ 
cesses  and  the  failure  of  those  in  the  treatment 
group  to  actually  receive  the  treatment.  Subjects 
reachable  by  phone  may  have  different  character¬ 
istics  from  those  who  are  not,  and  receiving  a 
mailing  is  not  the  same  thing  as  receiving  and 
reading  a  mailing. 

There  is  an  important  aspect  to  the  decision 
about  what  method  to  employ  that  is  not  captured 
in  the  debates  about  the  limitations  of  observa¬ 
tional  data,  regressions,  experiments,  and  the 
methods  that  mimic  them.  This  is  the  match,  or 
mismatch,  between  the  proper  causal  model,  the 
available  data,  and  the  empirical  method.  For 
models  with  additive  and  separable  variables,  such 
as  Equations  4  and  5,  experiments,  when  possible, 
and  the  methods  that  mimic  them  with  observa¬ 
tional  data  are  indeed  powerful  tools  that  should 
be  and  are  used  regularly.  But,  as  discussed  here, 
there  are  many  models  of  important  substantive 


phenomena  in  the  social  sciences  and  many  types 
of  data  for  which  these  methods  are  inadequate. 
As  always  in  the  social  sciences,  and  maybe  other 
sciences  as  well,  this  situation  leads  to  some  less 
than  desirable  choices.  One  can  choose  to  study 
phenomena  that  are  suited  to  particular  methods 
because  of  their  desirable  properties,  for  example, 
sticking  with  phenomena  that  can  be  studied 
experimentally  or  with  matching  techniques.  But 
this  means  choosing  substantive  topics  that  are 
well  modeled  by  the  additive  and  separable  equa¬ 
tions  and  the  SUTVA  condition.  One  can  choose 
to  use  methods  that  are  powerful  in  one  setting  to 
examine  phenomena  that  are  better  represented 
by  more  complex  models,  in  the  hope  that  the 
power  of  the  method  trumps  the  cost  of  having  an 
inadequate  model.  Or  one  can  examine  substan¬ 
tive  questions  of  interest  with  the  appropriate 
models  and  the  best  available  methods  and  data, 
recognizing  the  limitations  of  each.  Good  science 
requires  a  combination  of  the  three,  with  a  con¬ 
scious  and  modest  admission  of  their  limitations. 
It  is  also  the  case  that  important  methodological 
advances  have  come  from  efforts  to  advance  the 
third  strategy. 

Particularly  in  the  case  of  the  third  strategy, 
where  the  substantive  questions  require  complex 
models,  where  the  available  and  observed  data 
may  be  far  from  perfect,  and  where  the  “best” 
methods  are  weak,  the  process  of  building  confi¬ 
dence  in  causal  statements  and  applications  will 
be  slow  and  highly  contested.  Scientists  would  do 
well  to  follow  the  description  of  this  process 
offered  by  Arthur  Stinchcombe  quite  a  while  ago, 
which  every  empirical  researcher  should  read.  He 
is  very  emphatic  that  not  only  can  empirical 
research  not  prove  theories,  it  seldom  rejects  theo¬ 
ries.  What  it  does  do  is  lend  credibility  to  theories. 
This  credibility  is  enhanced  with  a  cumulation  of 
studies  in  multiple  settings  and  with  multiple 
types  of  data  and  methods.  For  some  questions 
and  evidence,  this  credibility  may  quite  high.  But 
for  many  others,  because  of  the  complexity  of  the 
causal  process  that  connects  X  to  Y  of  the  avail¬ 
able  data,  or  because  of  the  limitations  of  even  the 
most  appropriate  best  methods,  this  credibility 
may  remain  low  and  contestable.  In  this  context, 
proposals  for  another  study,  for  a  new  design,  or 
for  the  use  of  a  new  method  are  a  critical  part  of 
normal  science.  As  is  some  honest  modesty  about 
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what  is  and  can  be  learned  from  each  study  and 

with  each  method. 
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Within  the  discipline  of  political  science,  the  study 
of  security  was,  until  the  1990s,  dominated  by 
strategic  studies.  The  end  of  the  Cold  War  dealt  a 
significant  blow  to  the  legitimacy  of  strategic  stud¬ 
ies  as  the  centerpiece  of  the  study  of  security, 
resulting  in  the  transformation  of  the  field  into 
what  we  now  know  as  security  studies.  This  entry 
traces  the  history  of  that  transformation,  begin¬ 
ning  with  the  evolution  of  strategic  studies,  its 
challengers,  and  the  emergence  of  security  studies 
as  the  new  moniker  for  the  field.  It  begins  with  the 
so-called  golden  age  of  strategic  studies,  in  which 
the  central  concepts  of  Cold  War  security  and 
defense  were  elaborated,  and  in  particular,  intel¬ 
lectual  work  was  performed  to  integrate  nuclear 
weapons  into  the  discussion  on  national  defense. 
It  then  considers  the  other  two  periods  of  Cold 
War  strategic  studies:  the  first  characterized  by  the 
operationalization  of  arms  control  in  the  period  of 
detente  and  the  second  coincident  with  the  period 
known  as  the  Second  Cold  War.  As  the  Second 
Cold  War  swiftly  gave  way  to  the  rapid  transfor¬ 
mation  of  Europe  and  the  Cold  War’s  end,  the 
criticisms  of  strategic  studies  that  had  been  voiced 
throughout  its  history  gained  greater  purchase 
within  the  discipline.  The  entry  traces  first  the 
emergence  of  security  studies  in  the  1990s  as  a 
reformed  study  of  security  after  the  Cold  War’s 
end  and  then  the  changes  wrought  to  the  field  by 
the  events  of  September  11,  2001.  The  final  sec¬ 
tion  in  the  entry  considers  the  state  of  the  field 
today. 

The  close  connection  of  the  periodization  of 
strategic  and  security  studies  with  the  major  events 
of  the  postwar  world  is  no  accident.  Security  stud¬ 
ies  is  a  policy  science  and  was  intimately  connected 
to  the  creation  of  the  national  security  state  in  the 
United  States  following  World  War  II.  The  close 
connection  of  academic  disciplines  and  the  security 
state  is  by  no  means  unique  to  strategic  studies  and 
has  been  particularly  noted  in  the  creation  of  area 
studies  within  political  science,  but  the  ties  were 
particularly  overt  in  the  case  of  strategic  studies. 
While  strategic  studies  developed  as  a  recognized 
academic  discipline  within  the  universities,  it  was 
always  closely  tied  to  military  institutions,  as  in 
the  case  of  the  RAND  Corporation,  formed  to 
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conduct  research  for  the  U.S.  Air  Force.  As  a  pol¬ 
icy  science  with  a  military  focus,  strategic  studies 
was  underpinned  theoretically  by  the  political  real¬ 
ism  that  emerged  in  the  1950s  as  the  dominant 
approach  to  the  study  of  international  relations 
(IR)  in  the  English-speaking  world.  Even  as  the 
marginal  variant  of  realism,  liberal  institutional¬ 
ism,  grew  in  importance  in  the  1970s  and  1980s, 
strategic  studies  remained  largely  immune,  and 
was,  indeed,  at  the  heart  of  the  reassertion  of  a 
parsimonious  “neorealism”  in  the  period.  With  the 
development  of  security  studies,  theoretical  plural¬ 
ism  has  developed  in  the  field,  with  the  various 
theoretical  traditions  of  IR  reflected  in  security 
studies.  This  pluralism  has  included  the  develop¬ 
ment  of  what  has  become  termed  critical  security 
studies ,  drawing  the  traditions  of  critical  social 
theory  into  the  study  of  security — which  has,  in 
turn,  led  to  the  suggestion  that  security  itself  be 
abandoned. 

Strategic  Studies 

In  the  aftermath  of  World  War  II,  the  United  States 
undertook  one  of  the  largest  reorganizations  of  its 
state  institutions  in  its  history — a  reorganization 
exceeded  only  by  that  following  the  events  of 
September  11,  2001.  The  centerpiece  of  this  reor¬ 
ganization  was  the  National  Security  Act  of  1947, 
but  that  was  only  the  most  prominent  piece  of 
legislation  among  a  range  of  legislative  and  execu¬ 
tive  actions  that  collectively  created  the  U.S. 
national  security  state.  Explaining  the  nature  and 
meaning  of  this  transformation  of  the  U.S.  state 
depends  on  the  theoretical  position  taken  to  look  at 
it.  The  overt  explanation,  accepted  by  most  in  the 
mainstream  of  IR  and  reflected  in  strategic  studies, 
is  that  the  United  States  was  responding  to  the 
emergence  of  a  global  threat  posed  by  its  erstwhile 
ally,  the  Soviet  Union.  Based  on  this  reading,  the 
National  Security  Act  (1947)  and  its  attendant 
reorganization  of  the  U.S.  state  were  strategic 
responses  to  the  security  threat  of  the  former  USSR, 
designed  to  achieve  “national  security”  for  the 
United  States.  One  problem  with  this  explanation 
is  that  the  very  idea  of  “national  security”  was  a 
product  of  the  same  processes  that  gave  rise  to  the 
national  security  state  in  this  period.  Scholars 
informed  by  critical  theories  have  therefore  argued 
that  the  national  security  state  and  the  threat  to 


which  it  purports  to  respond  were  both  a  creation 
of  the  same  practices,  designed  to  maintain  an 
increasingly  globalized  American  liberal  capitalism. 

On  either  account  of  the  creation  of  the  national 
security  state,  the  development  of  strategic  studies 
paralleled  the  state’s  transformation  and  sup¬ 
ported  the  goals  of  U.S.  national  security.  The 
nature  of  the  enterprise,  however,  is  seen  rather 
differently  in  each  account.  The  mainstream  view 
is  that  strategic  studies  developed  as  a  policy¬ 
relevant  science  in  response  to  the  same  problem 
that  gave  rise  to  the  transformation  of  the  U.S. 
state:  the  growing  animosity  between  the  United 
States  and  the  former  USSR  and  the  addition  of 
nuclear  weapons  to  the  traditional  great  power 
conflict.  The  goal  of  strategic  studies,  based  on  this 
view,  was  to  build  on  previous  work  on  great 
power  relations  and  the  nature  of  warfare  in  order 
to  understand  the  dynamics  of  the  emergent  Cold 
War  and  to  develop  concepts  for  understanding 
the  application  of  force  and  the  provision  of 
defense  in  the  nuclear  era.  Critically  informed 
scholars  reject  this  reactive  view  of  the  strategic 
studies  field  and  see  rather  that  the  conceptual 
development  performed  by  strategic  studies  was 
part  of  the  production  of  a  particular  kind  of 
United  States,  a  particular  kind  of  Soviet  Union  as 
its  enemy,  and  of  the  specific  form  the  nuclear¬ 
armed  Cold  War  took.  This  entry  will  provide  an 
account  largely  consonant  with  the  critical  view  of 
the  place  of  strategic  studies,  both  because  it  is 
more  compelling  and  because  the  mainstream 
account  is  easily  read  through  the  critical  one. 

Strategic  Studies’  Golden  Age 

The  “golden  age”  of  strategic  studies  was  dom¬ 
inated  by  the  question  of  nuclear  weapons.  Indeed, 
nuclear  weapons  provided  the  raison  d’etre  for 
strategic  studies  as  a  separate,  civilian,  academic 
enterprise  in  the  context  of  the  early  Cold  War  and 
the  production  of  the  national  security  state.  The 
problem  strategic  studies  set  out  to  solve  was  how 
to  normalize  nuclear  weapons — that  is,  how  to 
take  a  technology  whose  destructive  power  was 
orders  of  magnitude  greater  than  any  that  had 
preceded  it  and  make  of  it  a  usable  military  and 
political  instrument.  The  scale  of  the  problem  was 
magnified  soon  after  the  field’s  founding  when  the 
United  States  successfully  tested  a  thermonuclear 
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device  in  1954,  making  the  explosive  potential  of 
nuclear  arms  all  but  unlimited.  The  explosives  that 
destroyed  Hiroshima  and  Nagasaki  were  roughly 
the  equivalent  of  25,000  tons  of  high  explosive;  the 
thermonuclear  devices  that  were  developed  after 
that  initial  test  in  1954  are  measured  in  the  mil¬ 
lions  of  tons  of  equivalent  trinitrotoluene  (TNT). 
Plotting  how  to  make  something  this  destructive 
appear  meaningfully  as  a  “weapon”  was  truly,  in 
Herman  Kahn’s  evocative  phrase,  “thinking  about 
the  unthinkable.” 

Deterrence 

The  primary  solution  that  was  developed  by 
strategic  studies  to  the  problem  of  nuclear  weap¬ 
ons  and  national  security  was  the  theory  of  deter¬ 
rence.  The  basic  concept  of  deterrence  is  quite 
simple,  as  it  involves  issuing  a  threat  of  some  kind 
in  an  attempt  to  prevent  another  from  doing  some¬ 
thing  that  they  would  otherwise  do.  Even  as  a 
military  concept,  deterrence  much  predates  the 
nuclear  age,  as  the  issuing  of  threats  is  a  common 
military  tactic.  Nuclear  deterrence  is  different  in 
two  ways,  however.  First  of  all,  strategic  studies 
reformulated  deterrence  as  a  strategy  rather  than  a 
tactic,  and  second,  it  developed  deterrence  to  be 
mutual. 

The  elevation  of  deterrence  from  tactic  to  strat¬ 
egy  solved  the  central  problem  of  nuclear  weapons 
for  the  national  security  state.  A  strategy  of  deter¬ 
rence  placed  nuclear  devices  at  the  heart  of  the 
military  (and  political)  organization  but  structured 
the  system  of  management  in  such  a  way  that  the 
devices  did  not  need  to  be  detonated  to  be  useful. 
Deterrence  produced  nuclear  devices  as  “weap¬ 
ons”  in  name  only,  as  they  were  intended  only  to 
be  used  as  a  threat;  the  actual  use  of  nuclear  weap¬ 
ons  would  mark  the  failure  of  deterrence. 
Deterrence  is  therefore  an  elegant  solution  to  the 
problem  of  vast  destructive  capability  being  framed 
as  “weapons,”  because  it  produces  a  strategy  that 
never  actually  calls  on  the  explosive  potential  of 
the  devices  to  be  realized. 

As  elegant  as  the  deterrence  solution  is  to  the 
nuclear  problem,  it  raises  a  number  of  problems  of 
its  own,  not  the  least  of  which  is  that  it  frees  nuclear 
weapons  from  any  rational  constraints.  A  “nor¬ 
mal”  explosive  weapon,  the  purpose  of  which  is  to 
be  detonated  in  the  course  of  a  military  campaign 


to  achieve  certain  ends,  is  limited  by  its  tactical 
uses  and  the  ends  to  be  sought.  Those  uses  and 
ends  will  largely  dictate  both  the  size  of  the  explo¬ 
sive  needed  and  the  number  of  explosives  in  a 
usable  arsenal.  Together  with  some  assumptions 
about  the  likely  scale  and  duration  of  any  conflict, 
military  planners  will  have  a  fair  sense  of  the  nec¬ 
essary  development  of  any  given  weapon.  By  con¬ 
trast,  deterrence  produces  nuclear  weapons  simply 
as  the  materialization  of  threat,  rather  than  as 
explosives  to  be  deployed,  and  so  more  and  bigger 
are  likely  to  be  better  as  they  are  even  more  threat¬ 
ening  than  smaller  and  fewer. 

Strategic  studies  provided  one  solution  to  the 
problem  of  constraint  in  developing  nuclear  deter¬ 
rence  as  mutual.  Without  nuclear  weapons,  deter¬ 
rence  is  generally  a  unilateral  tactic:  The  relatively 
larger,  stronger,  more  capable  potential  combatant 
can  threaten  the  smaller  and  weaker  to  shape  the 
latter’s  actions,  but  the  smaller,  weaker  combatant 
is  unlikely  to  be  able  to  threaten  the  stronger  suf¬ 
ficiently  to  deter.  Nuclear  weapons  were  seen  to 
change  this  simple  equation,  because  if  both  sides 
in  a  conflict  had  them,  they  would  be  able  to  inflict 
tremendous  damage  on  each  other  without  one 
having  first  to  “win”  on  the  battlefield.  This  obser¬ 
vation  led  to  one  of  the  most  memorable  products 
of  golden  age  strategic  studies:  mutually  assured 
destruction,  or  MAD.  MAD  was  not  considered  a 
strategy  but  rather  a  condition.  With  enough 
weapons  and  delivery  systems,  both  the  United 
States  and  the  Soviet  Union  would  be  able  to  inflict 
untold  damage  on  each  other,  at  the  same  time, 
regardless  of  anything  either  tried  to  do  to  defend 
itself. 

Most  of  the  work  of  golden  age  strategic  studies 
was  devoted  to  operationalizing  a  strategy  of 
deterrence  in  conditions  of  MAD.  Central  to  this 
effort  was  the  concept  of  “strategic  balance,” 
which  provided  an  answer  to  the  problem  of  lim¬ 
its.  Deterrence,  strategic  studies  argued,  would  be 
workable  in  conditions  of  MAD  if  the  two  sides 
were  roughly  equivalent.  It  is  worth  noting  that 
this  conclusion  does  not  derive  from  the  logic  of 
nuclear  deterrence  under  MAD,  where  all  that  is 
necessary  is  to  retain  the  capacity  to  devastate  an 
opponent  under  any  conditions,  and  so  the  relative 
size  of  the  arsenals  is  irrelevant.  However,  a  strate¬ 
gic  imbalance  in  which  the  former  USSR  held  more 
weapons  than  the  United  States  was  politically 
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unacceptable  and  so  was  never  seriously  counte¬ 
nanced  in  strategic  studies,  an  indication  of  the 
mutually  constitutive  relationship  between  the 
study  of  strategy  and  the  production  and  mainte¬ 
nance  of  the  national  security  state. 

While  strategic  stability  provided  a  relative 
solution  to  the  problem  of  limits,  it  provided  no 
guidance  in  absolute  terms.  As  long  as  there  was 
rough  parity  and  the  two  arsenals  were  configured 
in  such  a  way  that  MAD  obtained  (i.e.,  each 
retained  a  so-called  survivable  second-strike  capa¬ 
bility),  the  absolute  numbers  of  arms  could 
continue  to  climb.  The  solution  to  the  arms  race 
problem  was  the  second  major  output  of  strategic 
studies’  golden  age:  the  theory  of  arms  control. 

Arms  Control 

While  deterrence  theory  provided  no  upper  limit 
on  the  level  at  which  strategic  stability  could  be 
achieved,  it  did  have  consequences  for  lower  limits. 
The  most  important  of  these  was  that  zero  was  not 
an  option:  Eliminating  nuclear  weapons  does  not 
allow  for  strategic  stability  in  the  terms  developed 
by  strategic  studies.  Before  the  emergence  of  strate¬ 
gic  studies,  attempts  to  limit  arms  were  generally 
conceived  in  terms  of  disarmament — the  United 
States  had  even  attempted  to  entice  the  former 
USSR  into  mutual  nuclear  disarmament  under  the 
United  Nations  (UN)  control  with  the  Baruch  Plan 
of  1946.  With  the  elaboration  of  nuclear  deter¬ 
rence  theory  and  the  demonstrated  ability  of  the 
former  USSR  to  develop  and  build  nuclear  weap¬ 
ons,  seemingly  at  the  same  pace  as  the  United 
States,  strategic  studies  took  up  the  problem  of 
devising  limits  to  their  development. 

Through  the  latter  half  of  the  1950s,  following 
the  test  explosion  of  a  Soviet  thermonuclear  device 
in  1955,  strategic  studies  turned  its  collective 
attention  to  the  question  of  controlling  the  devel¬ 
opment  of  nuclear  weapons.  This  effort  culmi¬ 
nated  in  the  special  issue  of  the  journal  Daedalus 
in  1960,  which  was  described  by  the  issue’s  editor 
as  a  “handbook  on  the  problems  of  arms  control 
and  national  policy.” 

The  two  outputs  of  the  golden  age  of  strategic 
studies  were  intimately  interconnected,  as  arms 
control  was  developed  as  a  means  of  achieving  the 
balance  required  of  “strategic  stability,”  which  was 
in  turn  the  necessary  condition  for  the  successful 


functioning  of  deterrence.  The  handbook  that  stra¬ 
tegic  studies  devised  was  then  put  into  operation 
by  the  U.S.  state  throughout  the  1960s,  producing 
a  series  of  bilateral  and  multilateral  arms  control 
agreements,  culminating  with  the  Nuclear  Non- 
Proliferation  Treaty  (NPT,  1968)  on  the  multilat¬ 
eral  side  and  the  Strategic  Arms  Limitation  Treaty 
(SALT,  1972)  on  the  bilateral  one. 

While  both  the  theory  of  deterrence  and  that  of 
arms  control  were  seen  to  be  scholarly  answers  to 
objective  problems,  they  were  rather  more  political 
programs  that  required  the  mutual  consent  of 
those  operating  them  to  function.  What  this  meant 
in  particular  was  that  the  managers  of  Soviet  mili¬ 
tary  and  foreign  policies  needed  to  accept  the  basic 
tenets  of  deterrence  for  both  it  and  arms  control  to 
function,  and  initially  they  did  not.  Deterrence 
theory  is  more  formally  termed  rational-deterrence 
theory,  and  the  modifier  “rational”  is  important. 
To  develop  its  impressive  edifice  of  theory,  strate¬ 
gic  studies  assumed  that  decisions  were  taken  by 
strategically  rational  men — that  is,  by  applying  a 
simple  cost-benefit  analysis  to  the  situation  in 
which  the  decision  makers  find  themselves — and  it 
further  assumed  that  the  costs  and  benefits  associ¬ 
ated  with  nuclear  war  were  fairly  clear.  Those 
responsible  for  Soviet  policy,  however,  had  been 
trained  in  a  tradition  quite  different  from  that  of 
liberal  economics,  which  gave  rise  to  the  parsimo¬ 
nious  assumptions  of  strategic  rationality  and  to 
the  game  theories  that  were  derived  from  these 
assumptions,  which,  in  turn,  informed  the  strategy 
of  deterrence. 

Lor  deterrence  to  function,  therefore,  the  United 
States  needed  not  only  to  build  weapons  and  issue 
the  appropriate  threats  but  also  to  educate  the 
Soviet  Union  in  the  intricacies  of  strategic  rational¬ 
ity  and  the  theory  of  deterrence  that  flowed  from 
it.  The  arms  control  negotiations  that  followed 
strategic  studies’  development  of  an  arms  control 
theory  in  the  1950s  therefore  served  two  purposes 
in  facilitating  the  functioning  of  deterrence.  The 
first,  overt  function  was  to  reach  agreements  on 
limits  to  the  size  and  structure  of  the  two  sides’ 
nuclear  arsenals,  creating  the  much  sought  strate¬ 
gic  stability.  However,  at  least  as  important,  if  not 
more  so,  was  the  educative  function  of  the  negotia¬ 
tions,  as  the  U.S.  delegations  socialized  their  Soviet 
counterparts  into  the  nature  and  functioning  of 
deterrence  theory.  This  was,  perhaps,  strategic 
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studies’  greatest  contribution  to  the  successful 
functioning  of  the  U.S.  national  security  state — 
that  is,  the  development  of  a  body  of  concepts  that 
could  be  successfully  communicated  to  the  Soviet 
Union  to  construct  it  as  the  particular  enemy  called 
for  in  the  security  doctrines  of  the  United  States.  It 
was  truly  a  singular  achievement. 

Detente  and  a  New  Cold  War 

The  successful  SALT  negotiations  ushered  in  a 
period  of  detente  between  the  Soviet  Union  and 
United  States  and  seemingly  reduced  the  urgency 
of  the  problems  that  defined  strategic  studies  as  a 
field  of  inquiry.  In  the  broader  discipline  of  IR,  this 
period  was  marked  by  the  growth  of  liberal  insti¬ 
tutionalism  as  a  challenge  to  political  realism  and 
a  reorientation  of  the  discipline  toward  issues  of 
political  economy  and  international  cooperation 
rather  than  those  of  security  and  conflict.  Interest 
in  strategic  studies  waned  through  the  1970s, 
although  the  institutional  basis  in  both  universities 
and  think  tanks  was  not  seriously  eroded,  and  with 
the  return  of  the  Cold  War  following  the  election 
of  Ronald  Reagan  to  the  U.S.  presidency  in  1980, 
strategic  studies  experienced  what  Stephen  Walt 
has  termed  a  renaissance. 

As  with  the  golden  age,  the  renaissance  in  stra¬ 
tegic  studies  was  driven  by  the  complicity  of  the 
academic  field  with  the  security  policies  of  the 
United  States.  The  Reagan  administration  ushered 
in  what  is  generally  termed  the  Second  Cold  War, 
a  period  of  heightened  rhetorical  tension  between 
the  United  States  and  the  former  USSR  and  of  stag¬ 
gering  growth  in  U.S.  defense  spending.  In  particu¬ 
lar,  two  elements  of  the  Reagan  foreign  policy 
required  the  intellectual  talents  of  strategic  studies: 
(1)  dissatisfaction  with  a  deterrence  theory  that 
pronounced  any  actual  use  of  nuclear  weapons  as 
a  failure  and  (2)  a  closely  related  desire  to  escape 
from  nuclear  vulnerability  by  technical  means. 

There  are  a  number  of  terms  used  to  denote  the 
first  of  these  changes  in  U.S.  state  attitude.  The 
most  official  is  the  term  countervailing  strategy, 
but  its  opponents  attempted  to  offset  the  poor  pub¬ 
lic  relations  of  MAD  by  terming  it  the  Nuclear 
Utilization  Targeting  Strategy,  or  NUTS.  Whatever 
term  is  used,  it  denotes  a  change  in  U.S.  military 
thinking  about  the  way  in  which  nuclear  weapons 
would  be  used  in  times  of  war.  The  golden  age 


answer  to  this  question  had  been  that  nuclear 
weapons  should  be  held  as  a  threat  and  should  only 
be  used  in  the  case  of  deterrence  failure  to  punish 
the  Soviet  Union  for  not  heeding  those  threats.  The 
countervailing  strategy,  or  NUTS,  sought  to 
develop  a  tactical  doctrine  for  nuclear  weapons, 
giving  them  a  real  military  mission,  with  the  ulti¬ 
mate  goal  not  just  of  deterring  the  Soviet  Union  but 
of  doing  so  by  planning  to  win  a  nuclear  war. 

The  shift  to  a  countervailing  strategy,  which 
triggered  the  renaissance  in  strategic  studies,  is 
associated  with  the  bellicose  Reagan  administra¬ 
tion,  but  in  fact,  it  was  put  into  motion  during  the 
Carter  administration.  It  represents,  therefore,  not 
a  partisan  shift  in  policy  as  much  as  a  reinvigora- 
tion  of  the  national  security  state,  and,  as  with  the 
foundation  of  that  state  in  the  1940s  and  1950s, 
strategic  studies  undertook  much  of  the  necessary 
intellectual  work.  Strategic  studies  through  the 
1980s  was  dominated  by  literature  working 
through  the  problems  posed  by  the  countervailing 
strategy,  testing  them  through  the  rigors  of  aca¬ 
demic  contestation  by  those  still  committed  to 
deterrence  strategies  that  were  not  so  overtly  war¬ 
fighting  strategies. 

The  second  prompt  to  the  renaissance  of  strate¬ 
gic  studies  was  rather  more  partisan  in  its  origins, 
though  it  reflected  the  flip  side  of  the  move  to  a 
war-fighting  strategy  for  nuclear  deterrence: 
Reagan’s  Strategic  Defense  Initiative  (SDI).  In  the 
often-quoted  words  of  the  presidential  speech  that 
launched  SDI  in  March  1983,  the  initiative  was 
designed  to  render  nuclear  weapons  “impotent 
and  obsolete”  by  providing  a  defensive  shield 
against  incoming  Soviet  nuclear  missiles.  The  stra¬ 
tegic  implications  of  a  defense  of  this  kind  would 
have  been  extraordinary,  though  nothing  com¬ 
pared  with  the  technical  achievements  required  to 
make  it  happen.  The  twin  challenges  of  technology 
and  strategy  drove  a  second  constituent  of  the  stra¬ 
tegic  studies  renaissance  of  the  1980s,  as  analysts 
grappled  with  how  to  realize  Reagan’s  dream  and 
also  questioned  whether  or  not  it  was  a  dream  that 
should  ultimately  be  pursued. 

The  renaissance  of  strategic  studies  also  marks 
the  apogee  of  the  field  as  the  heart  of  the  study  of 
security.  Critics  of  strategic  studies  in  the  1980s 
had  increasing  grist  for  their  mills,  as  the  patently 
absurd  “winnable  nuclear  war”  was  taken  up  as  a 
serious  issue  for  scholarly  debate,  and  similarly, 
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the  science  fiction  dream  of  an  impenetrable  mis¬ 
sile  shield  was  actively  pursued.  While  these  two 
intellectual  pursuits  undermined  the  legitimacy  of 
strategic  studies,  the  ground  on  which  it  had  been 
built  was  unceremoniously  removed  from  under¬ 
neath  the  field  by  the  end  of  the  Cold  War. 

The  Critics  of  Strategic  Studies 

Given  the  close  connection  of  strategic  studies 
to  the  national  security  system  in  the  United  States, 
it  is  not  surprising  that  much  of  the  criticism  of  the 
field  came  from  outside  the  United  States.  There 
were  certainly  criticisms  of  strategic  studies  raised 
within  the  United  States  in  the  1980s,  particularly 
from  outside  the  discipline,  but  many  of  those  with 
the  deepest  resonance  came  from  scholars  based  in 
the  United  Kingdom  (UK)  and  continental  Europe. 

One  of  the  most  extensive  critiques  of  strategic 
studies  was  developed  institutionally  around  peace 
studies.  Peace  studies,  or  peace  research,  as  an 
alternative  discipline  to  strategic  studies  was  largely 
concentrated  in  Scandinavia  (notably  at  the 
Stockholm  Peace  Research  Institute  and  the  Peace 
Research  Institute  of  Oslo)  and  in  the  UK.  Peace 
studies  grew  alongside  strategic  studies  in  the  post¬ 
war  period,  and  where  strategic  studies  were 
largely  the  product  of  political  science,  peace  stud¬ 
ies  attracted  substantial  involvement  from  natural 
scientists,  particularly  nuclear  scientists,  who  were 
concerned  with  the  destructive  potential  their 
work  had  produced.  It  stressed  pacifism  in  its 
approach  to  foreign  policy  and  rejected  the  ratio¬ 
nal  and  instrumental  approach  to  nuclear  weapons 
adopted  by  strategic  studies.  In  particular,  peace 
studies  argued  that  the  potential  devastation  of 
nuclear  conflict — including  the  potential  extinc¬ 
tion  of  the  human  species,  what  Jonathan  Schell  in 
an  influential  text  called  “second  death” — meant 
that  rational  cost-benefit  analysis  was  inappropri¬ 
ate  and,  particularly,  that  nuclear  explosives  had 
to  be  conceived  as  something  other  than  military 
weapons.  The  peace  studies’  critique  did  not  gain 
much  purchase  within  political  science  until  the 
1980s,  when  the  MAD  versus  NUTS  debate  made 
the  peace  studies’  position  considerably  more  gen¬ 
erally  comprehensible. 

A  second  important  line  of  critical  argument 
was  advanced  by  Ken  Booth  in  the  UK  in  an  impor¬ 
tant  book,  Strategy  and  Ethno centrism.  Booth  also 


took  aim  at  the  instrumental  rationality  of  strategic 
studies,  arguing  that  it  represented  a  construct  of  a 
particular  Western  philosophical  tradition.  Decision 
makers  outside  that  tradition,  he  argued,  would 
not  necessarily  accept  the  arguments  underpinning 
rational  deterrence  theory,  nor,  therefore,  would 
they  act  in  the  ways  predicted  by  strategic  studies. 
The  need  to  socialize  the  Soviet  Union  into  the 
logic  of  deterrence  is  a  clear  example  of  the  sort  of 
problem  Booth’s  work  raised. 

A  third  line  of  criticism  that  has  informed  much 
of  what  followed  in  security  studies  after  the  end 
of  the  Cold  War  took  issue  with  the  narrowly 
military  focus  of  strategic  studies.  The  most  prom¬ 
inent  exponent  of  this  criticism  was  Barry  Buzan  in 
his  noted  work  People,  States  &  Fear.  Buzan 
argued  that  while  military  defense  was,  of  course, 
important  to  the  security  of  states,  there  were  a 
number  of  other  “sectors”  in  which  states  faced 
threats  from  which  they  needed  to  be  secured. 
These  sectors — political,  economic,  societal,  and 
environmental,  in  addition  to  the  military — call  for 
security  strategies  that  are  not  necessarily  (indeed, 
not  usually)  military  and  thus  for  a  study  of  secu¬ 
rity  that  is  potentially  quite  different  from  that 
found  in  strategic  studies. 

Taken  together,  these  and  related  criticisms  of 
strategic  studies  informed  a  significant  transforma¬ 
tion  in  the  study  of  security  within  political  science 
following  the  end  of  the  Cold  War.  Strategic  stud¬ 
ies  was  a  creature  of  the  Cold  War  in  a  number  of 
quite  fundamental  ways.  The  national  security 
state  that  it  serviced  was  built  for  the  Cold  War, 
and  the  principal  research  focus  of  strategic  studies 
was  the  Cold  War  nuclear  contest  between  the 
East  and  the  West.  Even  more  significant  than  this, 
however,  was  the  evident  failure  of  political  real¬ 
ism  to  anticipate  or  even  explain  the  end  of  the 
Cold  War.  Realism  was  the  theoretical  foundation 
of  strategic  studies,  and  as  the  end  of  the  Cold  War 
undermined  realism’s  claims,  so  too  did  it  under¬ 
mine  the  legitimacy  and  supremacy  of  strategic 
studies  as  an  approach  to  security. 

Security  Studies 

The  end  of  the  Cold  War  appeared  to  transform 
the  global  security  environment  fundamentally. 
The  contest  between  the  Soviet  Union  and  the 
United  States  had  defined  the  international  security 
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environment  since  the  end  of  World  War  II.  Its 
passage  seemed  to  herald  an  era  in  which  coopera¬ 
tion  rather  than  conflict  might  define  global  secu¬ 
rity,  in  which  direct  military  threats  to  the  future 
of  the  state  might  recede  in  importance,  and  in 
which  security  was  divisible — that  is,  not  every 
security  issue  worldwide  was  connected  to  a  cen¬ 
tral  great  power  rivalry.  Furthermore,  the  appar¬ 
ent  changes  to  a  more  peaceful  global  order  led 
many  in  both  the  West  and  the  East  to  seek  a 
“peace  dividend,”  an  opportunity  to  redirect 
resources  away  from  military  security  and  toward 
other  social  goals. 

Although  the  end  of  the  Cold  War  was  clearly  a 
tremendous  opportunity,  it  also  posed  a  difficult 
problem  for  the  management  of  international  secu¬ 
rity.  The  Cold  War  had  guided  policy  practitioners 
in  the  practice  of  security  policy — they  knew  what 
the  risks  were,  and  they  knew  how  to  “go  on”  in 
conditions  of  Cold  War.  Indeed,  providing  the 
stock  body  of  answers  to  the  nature  of  the  risks 
and  the  best  means  of  response  was  precisely  what 
strategic  studies  provided  to  the  national  security 
state  through  the  latter  half  of  the  20th  century. 
Without  the  Cold  War  context,  neither  the  risks 
nor  the  strategies  were  accepted  as  understood, 
and  what  is  more,  the  discipline  that  had  been 
providing  the  answers  was  undermined  by  its  own 
failure  to  comprehend  this  extraordinarily  signifi¬ 
cant  change  in  its  own  object  of  study. 

In  this  context  of  global  political  change,  pol¬ 
icy  uncertainly,  and  intellectual  fluidity,  there 
grew  a  series  of  movements  to  redefine  security, 
to  understand  it  in  some  new  way,  and  to  provide 
a  reconfigured  study  of  security  for  the  new  era. 
These  movements  have  given  rise  to  what  is  now 
called,  in  political  science,  security  studies,  which 
is  a  much  broader,  theoretically  plural,  and  inter¬ 
disciplinary  activity  compared  with  strategic  stud¬ 
ies,  which  both  preceded  it  and  continues  within 
it.  The  fragmentation  and  flowering  of  security 
studies  began  with  a  simple  question:  What  is 
security? 

What  Is  Security  ? 

Booth  (1991)  characterized  the  study  of  security 
during  the  Cold  War  as  “looking  at  world  politics 
through  a  missile-tube  and  gun-sight”  and  said 
that  in  such  a  study  “weapons  provided  most  of 


the  questions,  and  they  provided  most  of  the 
answers — whatever  the  weapon,  whatever  the  con¬ 
text,  and  whatever  the  cost”  (pp.  315-316). 
Booth’s  caustic  comment  points  to  the  assumptions 
of  strategic  studies  in  the  Cold  War:  that  security, 
first  and  foremost,  meant  the  defense  of  the  state 
against  external  threats,  specifically  those  that 
threaten  the  continued  existence  of  the  state  itself. 
Such  existential  threats  against  the  state  are  mili¬ 
tary  threats,  particularly  nuclear  threats,  and  the 
defense  against  them  is  primarily  a  military  matter. 
In  Walt’s  (1991)  terms,  security  concerned  the 
“threat,  use  and  control  of  military  force”  (p.  212). 

Once  the  Cold  War’s  end  undermined  the  mili¬ 
tary  conception  of  both  security  and  the  study  of 
security  that  followed  from  it,  scholars  took  to 
asking  the  deceptively  simple  question  “What  is 
security?”  The  debates  that  followed  from  this 
question  rapidly  identified  two  axes  along  which 
answers  could  be  organized.  These  axes  followed 
from  challenges  to  the  two  key  elements  of  the 
traditional  answer  to  the  question  of  security  by 
strategic  studies:  that  it  was  the  protection  of  states 
from  military  threats.  The  first  axis  emerged  from 
considering  the  question  of  what  security  is  by  ask¬ 
ing,  “What  if  there  are  security  threats  to  the  state 
other  than  military  threats?”  This  refinement  on 
the  question  led  to  a  discussion  of  the  broadening 
of  security.  The  second  axis  developed  from  chal¬ 
lenging  the  first  element  of  the  traditional  answer — 
the  focus  on  states.  This  second  line  of  argument 
began  by  asking  what  happens  if  we  are  concerned 
with  securing  objects  other  than  the  state,  leading 
to  a  debate  on  deepening  security.  Post-Cold  War 
security  studies  is,  therefore,  born  from  a  debate 
characterized  by  the  broadening  and/or  deepening 
of  security. 

Broadening  Security:  Multiple 

Threats/Multiple  Responses 

The  arguments  suggesting  the  broadening  of 
security  followed  from,  or  largely  reproduced, 
Buzan’s  critique  of  strategic  studies  from  People, 
States  &  Fear.  Buzan  had  argued  that  the  state  was 
threatened  in  a  range  of  ways,  which  could  be 
organized  into  five  sectors,  of  which  the  military 
was  only  one.  The  broadening  argument  looked  at 
the  other  sectors  Buzan  outlined — the  political, 
economic,  societal,  and  environmental — to  suggest 
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that  a  post-Cold  War  security  needed  to  be 
concerned  with  the  threats  faced  in  these  areas,  at 
least  as  much  as  those  in  the  military  realm. 

There  is  an  important  consequence  of  any  move 
to  broaden  the  security  agenda  by  considering  non¬ 
military  threats  to  the  security  of  the  state:  The 
military  will  not  necessarily  be  the  most  useful  tool 
in  responding  to  those  threats.  Consider,  for  exam¬ 
ple,  the  threat  supposedly  posed  to  Western  Europe 
by  “communism”  during  the  early  Cold  War.  This 
is  an  evident  example  of  a  threat  in  the  “political” 
sector,  according  to  Buzan’s  schema:  a  threat  aimed 
at  the  organizational  stability  of  the  (in  this  case, 
liberal-capitalist)  state.  While  there  may  have  been 
a  connection  between  the  European  Communists 
and  the  Soviet  Union,  the  military  could  not  pro¬ 
vide  any  meaningful  protection  against  that  exter¬ 
nal  influence,  and  it  was  entirely  irrelevant  to  the 
ideological  threat  posed  by  the  appeal  of  commu¬ 
nist  parties  internally.  The  same  is  true  of  societal, 
economic,  and,  particularly,  environmental  threats. 

The  final  of  Buzan’s  sectors — the  environmental — 
attracted  considerable  attention  in  the  debate  over 
the  broadening  of  security  in  the  early  1990s.  A 
variety  of  issues  that  later  became  gathered  under 
the  label  of  “climate  change” — global  warming, 
rising  sea  levels,  desertification,  and  environmental 
pollution  such  as  “acid  rain” — were  gathering  sig¬ 
nificant  political  attention.  Several  of  these  were 
seen  to  pose  important  risks  to  the  state,  and  a 
literature  on  “environmental  security”  developed 
as  part  of  the  argument  for  broadening  security 
studies.  Not  only  was  the  environmental  security 
debate  important  in  its  own  right,  but  it  also  raised 
a  number  of  the  issues  that  have  defined  much  of 
the  rest  of  security  studies  since  the  early  1990s. 

The  first  question  of  the  environmental  security 
debate  was  what  sort  of  security  issue  was  posed 
to  the  state  by  environmental  change.  One  line  of 
argument  was  that  environmental  change  posed  a 
direct  threat  to  the  state,  in  much  the  same  way  as 
external  armies  did.  Under  extreme  conditions, 
climatic  change  could  threaten  the  very  existence 
of  the  state.  However,  the  conditions  under  which 
this  was  possible  seemed  extremely  rare  in  most 
instances:  The  most  common  example  raised  was 
that  of  Pacific  Island  nations  that  could  be  inun¬ 
dated  by  rising  sea  levels.  For  the  Western  states 
generally  serviced  by  security  studies,  however,  it 
was  difficult  to  imagine  such  a  direct  threat  to  their 


continued  existence.  A  second  line  of  argument 
suggested  that  the  environmental  threat  was  not 
direct  but  rather  that  climatic  change  could  pro¬ 
duce  effects  that  were  generally  recognized  to  be 
issues  of  security,  most  commonly  the  mass  move¬ 
ment  of  people  and  conflict  over  increasingly 
scarce  resources. 

The  second  line  of  environmental  security  argu¬ 
ment,  that  climatic  change  would  produce  conflict, 
did  not  fundamentally  challenge  the  traditional 
notion  of  security  as  a  military  affair  but  rather 
argued  that  military  threats  could  emerge  from 
nontraditional  locations.  This  rather  conservative 
consequence  bred  a  third  argument  within  the 
environmental  security  debate,  which  focused  on 
the  contradiction  between  military  means  and 
environmental  security.  Essentially,  the  argument 
was  that  the  military,  in  both  preparation  for  and 
prosecution  of  wars,  was  tremendously  destructive 
to  the  environment.  Environmental  security,  this 
argument  suggested,  was  therefore  best  seen  as 
protecting  the  environment  from  the  military!  By 
making  such  an  argument,  however,  its  proponents 
posed  a  fundamental  challenge  to  traditional  secu¬ 
rity  studies,  as  it  was  no  longer  the  state  that  was 
to  be  secured  but  rather,  in  this  case,  the  environ¬ 
ment  itself.  Such  a  conclusion  tied  the  “broaden¬ 
ing”  security  arguments  to  the  second  stream  of  the 
security  studies  debate  in  the  early  post-Cold  War 
period,  that  of  considering  “deepening”  security. 

Deepening  Security:  Protecting  Other  Referents 

The  “deepening”  security  discussion  began  from 
the  question  of  whether  what  should  be  secured 
(i.e.,  the  referent  object  of  security)  should  be  some¬ 
thing  other  than  the  state.  The  environmental  secu¬ 
rity  debate  led  to  the  suggestion  that  it  was  the 
environment  itself  that  needed  protection  from, 
among  other  things,  the  very  practices  that  are  gen¬ 
erally  considered  to  provide  security.  However,  a 
number  of  other  answers  to  this  question  were 
advanced  in  the  context  of  the  discussion  of  what 
security  meant.  Some  argued  that  the  state  was  a 
representative  of  a  community — usually  termed  the 
nation — and  that  what  should  be  secured  was  not 
the  state  as  such  but  rather  the  nation.  Others 
rejected  the  narrowness  of  even  thinking  of  states  as 
“nations”  and  suggested  that  security  studies  should 
be  concerned  with  the  security  of  communities, 
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regardless  of  the  terms  under  which  they  were  con¬ 
stituted.  Any  of  these  forms  of  communal  argument 
were  particularly  useful  in  drawing  attention  to 
those  circumstances  in  which  the  state,  rather  than 
providing  security  to  its  population,  was  an  active 
threat  to  at  least  some  part  of  that  population. 

The  most  influential  argument  for  broadening 
security  was  drawn  into  security  studies  from 
international  development,  specifically  the  United 
Nations  Development  Programme’s  (UNDP)  1994 
annual  report,  New  Dimensions  of  Human 
Security.  The  basic  notion  of  human  security  was 
that  individuals,  rather  than  states,  should  be  the 
referent  object  of  security.  As  the  UNDP  formu¬ 
lated  it,  human  security  was  a  very  broad  notion, 
encompassing  all  threats  to  the  lives  and  liveli¬ 
hoods  of  people.  In  many  ways,  the  UNDP’s 
conception  of  human  security  was  an  attempt  to 
reformulate  the  notion  of  development,  particu¬ 
larly  human  development,  in  terms  of  “security,” 
driven  in  part  by  a  desire  to  capture  some  of  the 
expected  peace  dividends  from  the  end  of  the  Cold 
War  for  the  global  development  agenda. 

Coming  as  it  did,  just  as  states,  and  the  security 
studies  that  serviced  them,  were  searching  for  a  way 
to  think  about  security  in  a  post-Cold  War  world, 
human  security  has  had  a  significant  impact  on  both 
policy  and  scholarship.  It  has  been  picked  up  par¬ 
ticularly  by  midranked  states  in  the  international 
system,  who  have  seen  in  it  a  means  of  forging  for 
themselves  a  significant  role  in  the  global  security 
agenda,  often  otherwise  dominated  by  the  larger 
states.  As  the  states  and  security  studies  have  turned 
to  human  security,  the  breadth  of  the  original  UNDP 
conception  has  been  lost.  Indeed,  two  competing 
conceptions  of  human  security  have  emerged  from 
this  debate,  with  Canada  championing  a  conception 
of  human  security  as  “freedom  from  fear”  and 
Japan  championing  one  of  “freedom  from  want,” 
both  of  which  are  contained  within  the  UNDP’s 
formulation.  Canada’s  formulation  has  had,  per¬ 
haps,  the  most  obvious  policy  impact,  as  it  led  to 
actions  to  ban  antipersonnel  land  mines,  a  global 
plan  of  action  on  small  arms  and  light  weapons,  and 
some  consideration  of  action  on  child  soldiers. 

Securitization 

The  attempt  by  the  UNDP  to  recast  human 
development  in  terms  of  human  security  can  be 


usefully  described  using  one  of  the  most  important 
concepts  to  be  developed  in  post-Cold  War  secu¬ 
rity  studies:  that  of  “securitization.”  In  these 
terms,  it  would  be  said  that  the  UNDP  attempted 
to  “securitize”  the  development  agenda,  or  at  least 
some  significant  part  of  it,  though  it  would  then  be 
a  question  of  whether  or  not  that  “securitizing 
move”  was  successful.  The  term  securitization  was 
introduced  into  security  studies  by  Ole  Wasver, 
who  then  later  teamed  with  Buzan  to  marry  the 
latter’s  sectoral  approach  to  securitization  in  order 
to  form  what  has  been  termed  the  Copenhagen 
School  of  security  studies.  There  are  some  deep 
tensions  between  securitization  and  Buzan’s  secu¬ 
rity  sectors,  and  so  in  addition  to  the  Copenhagen 
School,  the  idea  of  securitization  has  spawned  a 
wide  range  of  scholarship,  often  simply  called 
“securitization  studies.” 

The  idea  of  “securitization”  is  at  once  simple 
and  radical.  W sever  counsels  that  we  treat  security 
as  a  “speech  act,”  by  which  he  means  an  utterance 
that  produces  something  in  the  world  by  virtue  of 
its  being  said.  Promising  is  one  of  the  more  com¬ 
mon  examples  of  a  speech  act:  A  promise  only 
exists  by  virtue  of  someone  saying,  “I  promise.” 
W sever  suggests  that  security  only  exists  by  virtue 
of  someone  authoritatively  saying  “such  and  such 
is  an  issue  of  security”  and  (importantly)  having 
that  utterance  broadly  accepted  by  its  target  audi¬ 
ence.  The  effect  of  a  successful  securitization,  he 
argues,  is  to  remove  an  issue  from  the  realm  of 
normal  politics  and  so  make  it  possible  for  extraor¬ 
dinary  means  (often,  but  not  necessarily,  military 
means)  to  be  applied  in  response. 

The  radical  implications  of  securitization  are 
also  quite  easily  seen.  Securitization  fundamentally 
undermines  the  claims  of  political  realists,  and  the 
forms  of  security  study  that  follow  them,  that  the 
world  is  a  dangerous  place,  and  security  involves 
identifying  the  threats  “out  there”  and  responding 
appropriately.  By  contrast,  securitization  argues 
that  there  are  no  objective  security  threats;  rather, 
an  issue  becomes  a  matter  of  security  if  and  only 
if  it  is  successfully  securitized.  Security  is  socially 
produced — socially  constructed  to  use  language 
common  in  the  field — and  securitization  provides 
a  broad  conception  of  the  means  of  that  construc¬ 
tion.  Securitization  also,  therefore,  makes  security 
intensely  political.  While  successful  securitiza¬ 
tion  removes  an  issue  from  normal  politics  (it  is 
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worth  thinking  here  of  the  range  of  extraordinary 
measures  taken  in  response  to  “terrorism”  as  a 
security  threat  following  9/11),  the  act  of  securitiza¬ 
tion  itself  is  a  potential  site  of  political  contestation. 
For  strategic  studies,  by  contrast,  security  is  “too 
important”  for  politics.  What  securitization  illus¬ 
trates  is  that  this  assertion  by  the  academic  study  of 
security  is  part  of  the  securitizing  apparatus  of  the 
state,  designed  to  insulate  the  most  extreme  forms 
of  violent  state  action  from  political  opposition. 

Critical  Security  Studies 

The  treatment  of  security  as  a  social  construct 
and,  therefore  as  a  site  of  politics,  is  a  feature  of  a 
number  of  approaches  to  security  that  have  grown 
up  in  the  period  following  the  end  of  the  Cold  War, 
in  addition  to  securitization  studies.  These 
approaches  are  loosely  termed  critical  security  stud¬ 
ies,  after  a  1997  volume  of  the  same  name.  The 
volume,  and  the  conference  from  which  it  came, 
drew  a  range  of  scholars  dissatisfied  with  strategic 
studies  as  the  heart  of  the  study  of  security  and 
sought  theoretical  pluralism  among  the  disaffected. 
Theoretical  pluralism,  however,  does  not  sit  well 
with  the  professional  practices  of  academics,  and  so 
the  inclusive  home  for  opponents  to  the  mainstream 
has  divided  into  rather  more  exclusive  and  exclu¬ 
sionary  literatures,  largely  along  theoretical  lines. 

The  first  theoretical  tradition  that  gathered 
under  the  critical  security  studies  name  was  social 
constructivism,  which  had  emerged  in  the  United 
States  in  the  late  1980s.  Constructivism  was  con¬ 
cerned  with  the  role  of  ideas  in  international  poli¬ 
tics,  particularly  the  ways  identities  and  interests 
were  constructed  in  social  interaction.  Borrowing 
from  Anthony  Giddens’s  ideas  of  structuration,  IR 
constructivism  sought  to  overcome  the  debate 
between  agency  and  structure  in  IR  through  the 
reflexive  production  of  identities  and  social  institu¬ 
tions  in  world  politics.  The  constructivist  position, 
however,  did  not  accept  a  complete  break  with  the 
epistemological  realism  of  mainstream  IR,  which 
closely  allied  it  to  the  Copenhagen  School  and, 
particularly,  securitization  studies.  The  same  is  not 
true  of  the  other  two  broad  streams  of  critical  secu¬ 
rity  studies,  which  drew  on  continental  European 
critical  social  theory  for  theoretical  guidance:  The 
first  laid  claim  to  the  critical  security  studies  label 
by  virtue  of  its  drawing  its  primary  theoretical 
inspiration  from  the  critical  theory  of  the  Frankfurt 


School  and  the  related  post-Marxist  German  social 
theory;  the  second  is  a  looser  gathering  of  scholars 
informed  by  poststructural  French  social  theory. 

Critical  Theory,  with  capitals,  is  a  product  of 
the  Frankfurt  School,  which  is  both  an  institution 
and  an  intellectual  tradition,  and  is  associated 
with  a  series  of  important  social  theorists,  includ¬ 
ing  Max  Horkheimer,  Theodor  Adorno,  Herbert 
Marcuse,  and  Walter  Benjamin.  The  most  prom¬ 
inent  contemporary  social  thinker  associated 
with  the  Frankfurt  School  is  Jurgen  Habermas. 
The  ideas  of  these  thinkers  have  been  drawn  into 
the  study  of  security,  largely  by  Booth  and  his 
colleagues  at  the  University  of  Wales, 
Aberystwyth,  and  so  have  spawned  what  is 
sometimes  termed  the  Welsh  School  of  critical 
security  studies.  The  Welsh  approach  to  security 
sees  emancipatory  politics  at  the  heart  of  security 
studies — that  is,  security  studies  that  in  reality 
works  for  those  presently  oppressed  by  the  global 
capitalist  security  system,  rather  than  being  an 
academic  exercise  undertaken  by  the  national 
security  state,  which  is  the  principal  institution  of 
that  oppression. 

Progressive  political  change  is  also  a  hallmark  of 
the  French  tradition  of  social  critique.  French  social 
theory  is  not  so  clearly  organized  into  a  school  as  is 
its  German  counterpart,  but  it  is  generally  associ¬ 
ated  with  the  work  of  a  number  of  influential 
French  thinkers — Jacques  Derrida,  Michel  Foucault, 
and  Jean  Baudrillard,  among  the  most  commonly 
cited  in  security  studies — as  well  as  a  number  of 
others  with  similar  philosophical  starting  points: 
Slavoj  Zizek,  Judith  Butler,  Gilles  Deleuze,  Felix 
Guattari,  and  Giorgio  Agamben.  What  marks  out 
the  poststructural  contribution  to  security  studies  is 
an  acceptance  of  a  radical  epistemological  break 
with  the  modern  tradition,  an  epistemological 
break  that  is  resisted  by  mainstream  security  stud¬ 
ies,  constructivism,  and  even  the  Welsh  School. 
This  break,  often  termed  antifoundationalism, 
leads,  for  example,  to  an  acceptance  of  contingency 
and  contextuality  in  understanding  social  forma¬ 
tions,  including  security  institutions  and  practices. 
In  relation  to  security,  the  contributions  of  a  post- 
structural  approach  can  be  found  most  prominently 
in  the  work  of  David  Campbell,  Simon  Dalby,  and 
James  Der  Derian,  among  others.  Poststructural 
security  studies  has  also  flourished  in  response  to 
the  dramatic  reexpansion  of  the  national  security 
state  following  the  9/11  attacks. 


Strategic  (Security)  Studies  2551 


Feminism  and  Security 

Most  accounts  of  security  studies,  even  critical 
security  studies,  exclude  feminist  scholarship,  and 
yet  feminist  work  on  security  has  flourished  in  the 
same  period,  and  much  of  it  draws  on  a  similar 
range  of  mostly  critical  social  theory,  as  do  other 
parts  of  the  field.  The  range  of  feminist  scholarship 
is  particularly  important,  because  there  is  not  one 
single  feminist  approach  to  security  but  rather 
feminist  analysis  begins  from  the  full  range  of 
social  theory — including  liberal,  post-Marxist, 
poststructural,  and  postcolonial — and  in  no  way 
seeks  to  distill  these  theoretical  traditions  into  a 
single  “feminist”  theory.  What  does  unite  feminist 
scholarship  is  a  concern  with  gender  in  social  life 
and,  for  those  interested  in  security,  the  relation¬ 
ship  of  gender  to  security.  Gender  refers  to  the 
constitution  of  sex-related  social  differences,  of 
masculinities  and  femininities,  and  so  feminists  are 
concerned  with  the  ways  in  which  these  social  cat¬ 
egories  are  (re)produced  and  with  its  effects. 

Feminist  security  scholarship  has  explored  the 
ways  in  which  masculinities  and  femininities  have 
been  produced  in  and  through  security  institutions 
and  practices,  as  well  as  the  ways  in  which  those 
institutions  differentially  affect  the  lives  of  men  and 
women.  The  military  is  perhaps  the  quintessential 
such  security  institution,  and  much  feminist  scholar¬ 
ship  has  explored  the  ways  in  which  the  military 
produces  and  relies  on  a  particular  kind  of  mascu¬ 
linity  (often  termed  hypermasculinity),  which  is  also 
(re)produced  in  wider  society.  Feminist  scholars 
have  also  demonstrated  the  ways  in  which,  unsur¬ 
prisingly,  the  hypermasculine  and  necessarily  violent 
military  has  particular,  and  highly  gendered,  effects 
when  it  acts.  These  effects  are  seen  most  obviously 
in  rape  as  a  tactic  of  war  and  in  the  persistent  use  of 
prostitution  to  service  soldiers  in  the  field,  both  of 
which  have  been  shown  to  be  long-standing  features 
of  militaries.  Indeed,  Sandra  Whitworth  has  turned 
the  feminist  gaze  on  UN  Peacekeepers,  and  she 
found  that  donning  a  blue  beret  does  not  alter  these 
violent,  hypermasculine  practices. 

More  broadly,  much  feminist  scholarship  can 
be  said  to  show  the  way  in  which  the  national 
security  state  as  a  whole  is  a  product  and  producer 
of  patriarchy.  In  other  words,  the  state  and  the 
form  of  “national  security”  it  produces  and  prac¬ 
tices,  (re)produce  a  broad  social  structure  that  dif¬ 
ferentially  benefits  men  at  the  expense  of  women. 
For  an  account  of  security  studies,  this  finding  is 


particularly  important,  because  of  the  role  that 
security  studies  has  played  in  providing  intellectual 
service  to  the  national  security  state.  The  inescap¬ 
able  corollary  of  the  feminist  critique  of  that  state 
is  that  security  studies  is  not  only  an  enabler  of  the 
national  security  state  but  is  a  product  and  pro¬ 
ducer  of  gender  inequality  too.  Security  studies,  in 
other  words,  is  in  itself  a  highly  gendered  practice. 

The  End  of  Security? 

The  post-Cold  War  era  has  seen  a  burgeoning  of 
security.  The  peace  dividend  anticipated  from  the 
end  of  the  superpower  confrontation  was  never  real¬ 
ized  as  the  decade  following  the  transformation  of 
Europe  saw  the  national  security  state  reassert  itself 
in  novel  ways.  In  its  search  for  continued  relevance, 
the  national  security  state  and  its  academic  support, 
security  studies,  looked  to  the  world  outside  the 
Central  European  front  and  found  it  dangerously 
unstable.  The  national  security  state  and  security 
studies,  therefore,  retooled  to  bring  security  to  the 
violent  hinterland — particularly  when  that  hinter¬ 
land  bordered  on  the  core,  as  it  did  in  the  wars  of 
the  former  Yugoslavia.  Security  studies  provided  the 
intellectual  resources  for  this  retooling,  providing  a 
wealth  of  literature  exploring  the  dynamics  of  what, 
most  notably,  Mary  Kaldor  has  termed  the  New 
Wars.  The  fact  that  these  new  wars  tend  to  be 
fought  in  some  of  the  least  advantaged  parts  of  the 
world — Afghanistan,  Somalia,  the  Democratic 
Republic  of  Congo,  and  Sierra  Leone,  for  example — 
has  led  to  a  growing  merger  between  security  and 
development,  although  ironically  not  in  the  fashion 
the  UNDP  would  have  hoped  in  launching  its 
notion  of  human  security.  Rather  than  capturing  the 
power  and  resources  of  “security”  for  the  hard 
work  of  human  development,  the  merger  of  security 
and  development  has  harnessed  the  global  develop¬ 
ment  institutions  to  the  national  security  state’s 
violent  export  of  its  liberal-democratic  self. 

Having  survived  the  1990s,  security  received  its 
largest  single  boost  since  the  end  of  World  War  II 
with  the  terrorist  attacks  on  the  United  States  in 
2001.  Suddenly,  the  homeland  appeared  once  more 
to  be  under  threat,  and  this  time,  unlike  during  the 
Cold  War,  even  the  United  States  had  been  success¬ 
fully  attacked.  Security  spread  like  a  virus  through¬ 
out  Western  societies  in  response:  Antiterrorism 
legislation  was  passed,  amended,  updated,  and 
passed  again;  borders  became  sites  of  ever-greater 
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scrutiny  and  potential  violence;  state  surveillance 
spread  rapidly;  and  a  wide  range  of  legal  protec¬ 
tions  in  Western  societies,  which  had  been  won 
through  centuries  of  struggle,  were  summarily  cast 
off.  The  prison  at  Guantanamo  Bay,  Cuba,  stood 
as  the  symbol  of  this  renewed  security,  but  security 
was  far  from  contained  in  an  island  limbo.  The 
United  States  undertook  the  largest  reorganization 
of  its  state  structure  since  the  National  Security  Act 
of  1947  with  the  creation  of  the  Department  of 
Homeland  Security,  which  in  turn  has  served  as  the 
model  for  the  reinvigoration  of  the  national  secu¬ 
rity  state  among  the  allies  of  the  United  States. 
Against  this  backdrop,  it  is  not  surprising  that  secu¬ 
rity  studies  has  likewise  flourished,  since  the  trans¬ 
formed  national  security  state  has  just  as  much 
need  for  intellectual  support  work  as  it  did  at  its 
postwar  founding. 

Much  of  the  work  in  security  studies  as  it  emerged 
from  strategic  studies,  however,  leads  to  troubling 
questions  about  the  future  practice  of  security  stud¬ 
ies  itself.  The  original  securitization  work,  for 
example,  argued  that  the  best  security  politics  was 
“desecuritization,”  an  attempt  to  remove  as  much  of 
it  as  possible  from  the  ambit  of  security  and  into  the 
realm  of  normal  politics.  The  feminist  work  on  the 
gendered  character  of  the  national  security  state 
similarly  suggests  that  there  is  something  deeply 
troubling  about  the  services  provided  to  that  state  by 
security  scholars.  Mark  Neocleous  gave  a  strong  and 
clear  voice  to  these  questions  in  his  2008  book, 
Critique  of  Security ,  in  which  he  suggested  that 
security  was  the  central  logic  of  the  liberal  state,  and 
so  speaking  in  terms  of  security,  even  to  oppose  the 
extravagant  security  practices  of  the  state,  is  to 
reproduce  the  national  security  state  and  insulate  it 
from  the  possibility  of  political  transformation.  Just 
as  security  studies  is  maturing  in  its  new,  poststrate- 
gic  studies  guise,  therefore,  we  are  led  to  ask 
whether  we  have  now  reached  the  end  of  security. 

David  Mutimer 
York  University 
Toronto,  Ontario,  Canada 
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Structural  Equation 
Modeling 


Structural  equation  modeling  (SEM)  is  a  very  gen¬ 
eral  statistical  approach  for  modeling  and  estimat¬ 
ing  data.  It  is  seen  as  a  combination  of  factor 
analysis  and  regression  or  path  analysis.  These 
procedures  are  regarded  as  special  cases  of  SEM. 
It  contains,  in  addition,  classical  multivariate 
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techniques  such  as  analysis  of  variance,  analysis  of 
covariance,  dummy  regression,  and  canonical  cor¬ 
relation  as  special  cases.  A  structural  equation  model 
contains  latent  variables  that  should  correspond  to 
theoretical  constructs  from  substantive  theory  and 
their  reflective  indicators  or  items  that  form  the  mea¬ 
surement  model.  The  relationships  between  the 
latent  variables  (constructs  and  factors)  and  their 
indicators  (observed  variables)  are  quantified  by  the 
corresponding  factor  loadings.  The  regression  coef¬ 
ficients  between  the  latent  variables  (structural  rela¬ 
tions)  take  random  and  nonrandom  measurement 
error  into  account  and  are,  therefore,  not  biased. 
This  part  of  the  model  is  called  “structural  model” 
and  represents  the  underlying  theory.  This  entry 
presents  some  of  the  basic  features  of  SEM  using  an 
example  from  the  European  Social  Survey  (ESS). 

SEM  models  can  be  visualized  by  a  graphical 
path  diagram,  which  represents  the  relationships 
between  the  latent  variables  (structural  model)  and 
the  relationships  between  latent  and  observed  vari¬ 
ables  (measurement  model).  The  latent  variables 
are  symbolized  by  circles,  the  observed  variables 
by  rectangles,  and  the  postulated  direction  of  influ¬ 
ences  by  directed  arrows  (see  Figure  1).  The  path 
diagram  can  be  translated  into  a  set  of  linear  or 
matrix  equations.  Figure  1  displays  a  diagram  that 
graphically  represents  a  structural  equation  model 
with  the  measurement  and  the  structural  part.  It 
contains  one  exogenous  (independent)  construct 
and  one  endogenous  (dependent)  construct.  Each 
construct  in  Figure  1  is  measured  by  three  indica¬ 
tors  to  control  for  all  forms  of  random  and  non- 
random  measurement  errors. 

In  Figure  1,  is  an  exogenous  latent  construct, 
measured  by  three  indicators  xv  x2,  and  x3.  Their 


measurement  errors  are  designated  as  81}  82,  and 
S3.  Tjj  is  an  endogenous  latent  construct,  measured 
by  three  indicators  yv  y2,  and  yr  Their  measure¬ 
ment  errors  are  designated  as  e15  e2,  and  e3.  A.  is 
the  symbol  for  the  unstandardized  factor  loading. 
The  residual  of  the  latent  endogenous  variable  is 
(h-  It  represents  the  unexplained  variance  of  the 
latent  endogenous  variable.  The  regression  coeffi¬ 
cient  between  the  exogenous  and  the  endogenous 
latent  variable  is  yir  The  first  subscript  refers  to 
the  dependent  variable,  the  second  to  the  indepen¬ 
dent  variable. 

The  corresponding  equation  system  for  the 
model  in  Figure  1  is  as  follows: 

Structural  model: 

Til  =  Tn  €i  +  it  (1-1) 

Measurement  model: 


Xi  : 

=  KZi 

+ 

8i 

(2.1) 

X2  = 

=  KZi 

+ 

§2 

(2.2) 

x3  = 

= 

+ 

§3 

(2.3) 

yi  = 

= 

+ 

Si 

(2.4) 

yz  = 

= 

+ 

£2 

(2.5) 

ys  = 

r 

+ 

£3 

(2.6) 

The  intercepts  of  the  structural  and  the  measure¬ 
ment  model  have  been  omitted. 


Figure  I  The  General  Model  for  Two  Latent  Variables 
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The  items  are  conceptualized  as  reflective  indica¬ 
tors,  as  in  confirmatory  factor  analysis,  which 
means  that  the  researcher  postulates  a  direction  of 
influence  from  the  latent  to  the  observed  variable. 
Both  the  measurement  model  and  the  structural 
model  can  be  generalized  to  take  into  account  n 
indicators  and  m  constructs.  One  can  now  differen¬ 
tiate  between  three  types  of  parameters:  (1)  free 
parameters  to  be  estimated  from  the  data,  such  as  in 
classical  multivariate  analysis;  (2)  fixed  parameters 
that  are  set  a  priori  to  a  certain  value  such  as  0  or  1; 
and  (3)  constrained  parameters  that  are  set  equal  to 
another  parameter.  For  estimating  the  coefficients, 
several  estimation  methods  are  available.  The  stan¬ 
dard  method  is  maximum  likelihood  estimation.  By 
using  this  method,  all  the  free  parameters  are  esti¬ 
mated  simultaneously  taking  into  account  both  the 
fixed  and  the  constrained  parameters  in  the  minimi¬ 
zation  of  the  fitting  function.  Other  estimation 
methods  take  nonnormal  distributions  into  account 
such  as  robustified  maximum  likelihood,  asymp¬ 
totic  distribution  free  estimator  ( ADF),  and  weighted 
least  squares  (WLS).  In  addition,  Bayesian  estima¬ 
tion  is  possible,  which  allows  the  testing  of  a 
broader  class  of  hypotheses  and  which  may  be 
more  robust  in  smaller  samples.  The  model  testing 
is  mostly  done  in  at  least  two  steps,  because  other¬ 
wise  the  necessary  model  modifications  are  too 
complex. 

The  measurement  model  is  first  tested  via  con¬ 
firmatory  factor  analysis.  After  one  has  found  a 
well-fitting  measurement  model,  in  the  second 
step,  the  full  structural  equation  model,  which 
includes  the  fitted  measurement  model,  is  tested. 
There  are  three  approaches  for  testing  SEM  mod¬ 
els:  (1)  strictly  confirmatory,  (2)  the  use  of  alterna¬ 
tive  models,  and  (3)  model  generating. 

1.  In  a  strictly  confirmatory  situation  the 

researcher  has  formulated  one  single  model  and 
has  obtained  empirical  data  to  test  it.  The 
model  should  be  either  accepted  or  rejected. 


2.  The  researcher  has  specified  several  alternative 
models  (or  competing  models),  and  on  the  basis 
of  an  analysis  of  a  single  set  of  empirical  data, 
one  of  the  models  should  be  selected. 

3.  In  the  model-generating  approach,  the 
researcher  has  specified  a  tentative  initial  model. 
If  the  initial  model  does  not  fit  the  given  data, 
the  model  should  be  modified  and  tested  again 
using  the  same  data.  The  respecification  of  each 
model  may  be  theory  driven  or  data  driven,  (cf. 
Joreskog,  1993) 

As  with  classical  multivariate  analyses,  the 
estimation  procedure  (in  the  standard  case  this  is 
maximum  likelihood  estimation)  provides  unstan¬ 
dardized  and  standardized  coefficients  and  stan¬ 
dard  errors  and  z  values  for  the  unstandardized 
coefficients  from  the  estimation  procedure.  In  addi¬ 
tion,  global-  and  detailed-fit  measures  for  the 
specified  model  are  provided.  The  global-fit  mea¬ 
sures  are  all  some  function  of  the  chi-square,  the 
degrees  of  freedom,  and,  in  some  cases,  the  sample 
size.  There  are  descriptive  measures  such  as  the 
comparative  fit  index  (CFI),  probability-based  ones 
such  as  the  chi-square  test  and  the  root  mean 
square  error  of  approximation  (RMSEA),  and 
those  based  on  information  theory  (e.g.,  Akaike’s 
information  criterion  [AIC] ).  Cutoff  values  for 
these  measures  have  been  proposed,  although  there 
is  an  ongoing  discussion  about  which  ones  should 
be  used  in  which  way. 

Generally,  the  process  of  model  building  in 
SEM  can  be  described  in  the  steps  visualized  in 
Figure  2.  It  is  important  to  understand  that  most 
specified  models  are  in  the  first  step  falsified,  and 
one  has  to  modify  the  model  according  to  detailed 
fit  measures  such  as  the  modification  index  and  the 
expected  parameter  change  that  together  with  sub¬ 
stantive  theoretical  knowledge  should  guide  the 
respecification  of  model  modifications.  This  pro¬ 
cess  corresponds  to  model  generating  (the  third 


Model  does  not  fit 


Figure  2  The  Process  of  Causal  Modeling 
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approach)  rather  than  model  testing  (the  first  or 
second  approach). 

The  following  example  uses  data  from  the  British 
sample  of  the  ESS,  Round  1  (2002).  Results  of  a 
model  to  explain  attitude  toward  immigration  by 
the  two  latent  variables  “education”  and  “tradition/ 
conformity”  value  are  reported.  These  relations 
represent  the  structural  model.  This  model  is  speci¬ 
fied  as  a  partial  mediation  model,  as  education 
influences  both  values  and  attitude  toward  immi¬ 
gration,  and  the  effect  of  education  is  only  partially 
mediated  by  value.  As  a  consequence,  the  structural 
model  is  composed  of  two  equations  for  explaining 
the  two  latent  endogenous  variables  value  and 
immigration.  “Immigration”  is  measured  by  three 
reflective  indicators,  “tradition/conformity”  by  four 
observed  variables,  and  “education”  by  two.  The 


question  wordings  and  the  unstandardized  factor 
loadings  and  their  standard  errors  are  reported  in 
Table  1;  the  standardized  solution  is  reported  in 
Figure  3.  All  factor  loadings  are  significant  (p  < 
.001).  The  fit  of  the  model  is  very  good  according  to 
the  standard  criteria  (x2  =  32.82  with  24  degrees  of 
freedom;  CFI  =  .999;  RMSEA  -  .013). 

Figure  3  shows  that  the  loadings  of  the  three 
items  to  measure  attitude  toward  immigration  are 
all  very  high,  indicating  a  high  formal  validity.  The 
same  is  the  case  for  the  two  indicators  of  “educa¬ 
tion.”  The  standardized  factor  loadings  of  the  value 
construct  “tradition/conformity”  are  generally 
lower  than  those  of  the  other  constructs,  and  they 
range  from  0.44  to  0.72.  The  regression  coefficient 
between  “education”  (EDU)  and  “tradition/ 
conformity”  (TRCO)  is  significant,  but  weak  (0.24); 


Table  I  Question  Wordings  and  Unstandardized  Factor  Loadings 


Education 

(EDU) 

Tradition/ 

Conformity 

(TR/CO) 

Immigration 

(IMM) 

edup  Highest  level  of  education 

X 

0.47 

0 

0 

SE 

0.02 

0 

0 

edu2:  Years  of  full-time  education 

\ 

1.00 

0 

0 

SE 

a 

0 

0 

tr/cop  Important  to  follow  traditions  and  customs 

\ 

0 

0.75 

0 

SE 

0 

0.05 

0 

tr/co2:  Important  to  do  what  is  told  and  follow  rules 

\ 

0 

1.05 

0 

SE 

0 

0.06 

0 

tr/co3:  Important  to  behave  properly 

\ 

0 

1.00 

0 

SE 

0 

a 

0 

tr/co4:  Important  to  be  humble  and  modest 

\ 

0 

0.63 

0 

SE 

0 

0.04 

0 

immp  Allow  many/few  immigrants  of  different  race/ 

\ 

0 

0 

1.31 

ethnic  group  from  majority 

SE 

0 

0 

0.03 

immp  Allow  many/few  immigrants  of  same  race/ 

\ 

0 

0 

1.00 

ethnic  group  from  majority 

SE 

0 

0 

a 

imm3:  Allow  many/few  immigrants  from  poorer 

\ 

0 

0 

1.18 

countries 

SE 

0 

0 

0.03 

Source:  ESS  Round  1:  European  Social  Survey  Round  1  Data  (2002).  Data  file  edition  6.1.  Norwegian  Social  Science  Data 
Services,  Norway — data  archive  and  distributor  of  ESS  data.  Results  computed  by  the  authors. 

Notes:  \  =  unstandardized  factor  loadings;  SE  =  standard  error  of  \. 

a.  No  SE  is  estimated  because  1.0  is  a  fixed  parameter,  which  is  necessary  for  identification  of  the  model. 
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Figure  3  Results  From  a  Structural  Equation  Modeling  (Standardized  Solution) 

Source :  ESS  Round  1:  European  Social  Survey  Round  1  Data  (2002).  Data  file  edition  6.1.  Norwegian  Social  Science  Data 
Services,  Norway — data  archive  and  distributor  of  ESS  data.  Results  computed  by  the  authors. 


more  education  is  associated  with  less  agreement 
on  the  TRCO  scale.  The  effect  of  education  on 
“immigration”  (IMM)  (—0.34)  is  higher  than  that 
“TRCO”  on  “immigration”  (—0.15).  The  regres¬ 
sion  coefficients  on  “immigrants”  are  partialized 
coefficients.  Substantively,  more  education  and 
more  disagreement  on  the  TRCO  scale  are  associated 
with  less  negative  attitudes  toward  immigration. 
One  can  see  in  the  path  diagram  that  “education” 
has  not  only  a  direct  effect  on  “immigration”  but 
also  an  indirect  effect  mediated  by  “TRCO.” 
Direct  (—0.34)  and  indirect  effects  (0.24  multi¬ 
plied  by  —0.15)  are  added  as  total  effects  (  —  0.38) 
of  one  variable  (education)  on  another  (immi¬ 
grants).  The  asymptotic  standard  errors  and  boot¬ 
strapped  standard  errors  for  indirect  effects  and 
total  effects  and  their  significance  levels  are  addi¬ 
tionally  calculated  in  most  SEM  software. 

In  Figure  3,  one  can  see  for  the  latent  variables 
that  the  explained  variance  is  0.06  for  “TRCO” 
and  0.17  for  “immigrants.”  These  low  numbers 
indicate  that  especially  for  the  explanation  of 
“TRCO,”  but  also  for  the  attitude  toward 


immigration,  the  relevant  explanatory  variables 
are  missing. 

The  model  can  be  extended  as  a  generalized 
latent  variable  model  to  multiple  groups  or  mod¬ 
els,  estimating  latent  means  and  intercepts;  autore¬ 
gressive,  cross-lagged,  latent  growth  curves;  latent 
curve  models;  and  stochastic  differential  equations 
for  longitudinal/panel  data,  testing  mediation  and 
moderation,  nonlinear  models,  multilevel  models, 
and  mixture  models.  Furthermore,  it  can  be 
extended  to  ordinal  and  categorical  data,  where 
other  estimation  techniques  are  used.  The  latent 
variable  approach  can  be  embedded  easily  in  a 
Bayesian  framework. 
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Subject  Culture 


Among  scholars  of  political  culture,  the  term  sub¬ 
ject  culture  signifies  a  specific  subtype  whose 
members  are  not  actively  participating  in  the  cen¬ 
tral  political  system.  The  members  of  all  political 
systems  at  all  times  exhibit  significant  differences 
in  their  orientations,  attitudes,  and  beliefs  about 
politics.  Many  of  the  differences  depend  on  the 
inequalities  in  access  to  the  political  opportunities 
existing  in  these  political  systems.  Especially, 
gender  differences,  with  women  disadvantaged 
vis-a-vis  men,  are  clearly  the  product  of  solidly 
structured  biases  against  women’s  participation  in 
politics.  Many  differences  are  also  the  conse¬ 
quence  of  some  aspects  of  the  processes  of  social¬ 
ization.  Finally,  some  of  the  differences  simply 
derive  from  the  preferences  of  the  individuals, 
from  their  willingness  to  devote  more  or  less  of 
their  time  and  energy  to  learning  about  politics 
and  to  taking  part  in  political  activities.  Some 
political  cultures  encourage  and  reward  political 
knowledge,  involvement,  and  participation;  oth¬ 
ers  do  not.  However,  differences  among  individu¬ 
als  appear  in  all  political  systems.  These  have  been 
the  object  of  quite  a  number  of  studies,  mainly 
carried  out  by  political  anthropologists,  but  in  the 


1960s,  the  study  of  political  culture  became  an 
important  component  of  the  field  of  political  sci¬ 
ence  and,  more  precisely,  in  the  work  of  scholars 
attempting  to  explain  modernization  and  political 
development. 

The  first  major  contribution  to  the  field  of 
political  culture  was  made  by  the  survey  of  citi¬ 
zens’  attitudes  in  five  democracies  performed  by 
Gabriel  Almond  and  Sidney  Verba.  In  their  semi¬ 
nal  book  The  Civic  Culture ,  they  explored  the 
dense  web  of  relationships  between  political  atti¬ 
tudes  and  the  survival/stability  of  Germany,  Italy, 
Mexico,  the  United  Kingdom  (UK),  and  the  United 
States.  The  underlying  hypothesis  was,  under¬ 
standably,  that  there  should  exist  significant  differ¬ 
ences  in  the  political  cultures  of  citizens  to  explain 
the  differences  in  the  various  political  systems,  two 
of  them  (the  United  States  and  the  UK)  stable  and 
effective  democracies,  two  of  them  (Italy  and 
Germany)  democracies  that  had  previously  failed, 
and  one  (Mexico)  a  single-party  hegemonic  regime. 
To  capture  the  nature  and  quality  of  the  respective 
civic  cultures,  the  authors  identified  three  different 
orientations:  cognitive,  affective,  and  evaluational. 
Cognitive  orientations  refer  to  the  knowledge  of 
and  the  beliefs  about  the  political  system,  its  roles 
and  the  incumbents  of  these  roles,  its  inputs  and  its 
outputs.  Affective  orientations  are  the  feelings 
about  the  political  system  and  its  roles,  personnel, 
and  performance.  Evaluational  orientations  con¬ 
cern  the  judgments  and  opinions  about  political 
objects  that  typically  involve  the  combination  of 
value  standards  and  criteria  with  information  and 
feelings.  Combining  these  orientations  with  aspects 
of  the  political  system  and  its  inputs,  outputs,  and 
the  role  of  the  individual  in  the  system,  Almond 
and  Verba  suggested  the  existence  of  three  types  of 
political  cultures.  More  precisely,  in  a  parochial 
political  culture,  individuals  have  no  knowledge 
concerning  the  political  system — that  is,  neither  its 
inputs  nor  its  outputs.  By  contrast,  in  a  participant 
political  culture,  individuals  know  about  the  polit¬ 
ical  system  and  its  inputs  and  outputs  and  are  also 
aware  that  they  can  play  an  active  role.  Subjects 
have  come  to  know  about  the  political  system  and 
its  actions,  the  outputs,  but  they  are  not  familiar 
with  how  they  themselves  can  affect  the  inputs  and 
how  to  play  a  role  in  the  political  system. 

All  political  systems  contain  varying  combina¬ 
tions  of  individuals  with  parochial,  subject,  or 
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participant  orientations.  It  makes  a  great  difference 
for  any  political  system,  of  course,  whether  any 
one  of  these  three  types  of  political  culture  is  or  has 
become  dominant.  Almond  and  Verba  argue  that 
there  must  exist  and  develop  some  congruence 
between  political  cultures  and  political  structures. 
They  suggest  that  a  parochial  culture  is  most 
congruent  with  a  traditional  political  structure,  a 
subject  culture  with  a  centralized  authoritarian 
structure,  and  a  participant  culture  with  a  demo¬ 
cratic  political  structure.  One  could  also  remark 
that,  for  instance,  a  centralized  authoritarian  struc¬ 
ture  may  devote  some  resources  to  the  transforma¬ 
tion  of  parochials  into  subjects,  but  it  will  certainly 
also  pursue  the  goal  of  preventing  the  subjects 
from  becoming  participants.  Unfortunately,  the 
analysis  by  Almond  and  Verba  did  not  aim  at  pro¬ 
viding  sharp  profiles  of  the  types  of  political  cul¬ 
tures  prevailing  in  the  five  political  systems  they 
had  taken  into  consideration.  Instead,  they  shifted 
their  attention  to  the  relationship  between  cogni¬ 
tion,  affect,  and  evaluation  and  three  additional 
dimensions  of  political  culture:  allegiance,  apathy, 
and  alienation.  However,  they  did  not  even  explore 
whether  the  parochials,  the  subjects,  and  the  par¬ 
ticipants  were  more  likely  to  become  allegiant, 
apathetic,  or  alienated.  Nor  did  they  give  enough 
attention  to  the  presumably  differentiated  impact 
of  diverse  combinations  of  parochials,  subjects, 
and  participants  in  the  political  culture  of  a  democ¬ 
racy.  Moreover,  as  Verba  has  recognized  in  The 
Civic  Culture  Revisited  (Almond  &  Verba  1980), 
a  dynamic  view  and  a  careful  interpretation  of  how 
these  processes  and  situations  were  bound  to  be 
affected  by  time  were  sorely  lacking.  Finally,  prac¬ 
tically  none  of  the  studies  that  followed  their  path¬ 
breaking  empirical  research  (Lucian  Pye  &  Sidney 
Verba,  1965),  not  even  the  book  reappraising  their 
original  contribution,  has  devoted  specific  atten¬ 
tion  to  the  category  of  subject  political  culture. 

According  to  Almond  and  Verba,  two  variables 
have  a  significant  impact  on  all  types  of  political 
culture:  the  amount  of  information  and  the  level  of 
education.  In  slightly  different  ways,  of  course, 
both  the  parochials  and  the  subjects  have  a  rather 
low  level  of  education  and  are  exposed  to  a  limited 
amount  of  political  information.  Therefore,  they 
are  easily  manipulated  by  the  rulers.  In  a  way,  one 
can  say  that  both  information  and  education, 
increasing  their  political  knowledge,  will  encourage 


their  eventual  transformation  into  participants. 
Indeed,  this  was  the  most  important  message  con¬ 
veyed  by  Daniel  Lerner  in  his  1958  book,  The 
Passing  of  Traditional  Society.  The  revolution  in 
communications  will  produce  significant  conse¬ 
quences  for  the  political  cultures  of  traditional 
societies.  The  most  important  among  these  conse¬ 
quences  were  thought  to  be  the  almost  complete 
disappearance  of  the  parochials  because,  inevita¬ 
bly,  all  individuals  living  in  the  same  territory, 
under  the  same  rulers,  and  affected  by  similar 
processes  would  become  increasingly  cognizant  of 
the  political  system  and  its  inputs  and  outputs  and 
would  probably  also  come  to  realize  that  they 
might  have  a  role,  though  subordinate,  to  play. 
Even  more  so  for  the  subjects,  their  growing 
knowledge  of  the  political  system  and  its  outputs 
might  motivate  them  to  learn  more  about  the  way 
to  produce  inputs  (demands  and  support)  and  to 
become  personally  active  in  politics. 

Scholars  studying  political  development  have 
often  been  accused  of  having  a  teleological  and 
ethnocentric  bias  in  their  approach — that  is,  of 
being  oriented  to  foresee  and  predict  for  develop¬ 
ing  countries  the  political  future  already  reached 
by  Western  democracies.  Curiously,  though,  when 
it  came  to  the  likelihood  of  the  transformation  of 
political  cultures,  most  of  the  same  scholars  were 
significantly  more  cautious.  There  was  no  explicit 
prediction  of  any  possibility  of  a  swift  appearance 
of  a  widespread  participant  culture.  On  the  con¬ 
trary,  most  scholars  believed  that  for  some  indefi¬ 
nite  period  of  time,  the  majority  of  developing 
countries  would  suffer  exactly  because  of  the  dif¬ 
ficulty  of  transforming  parochials  and  subjects 
into  participants.  In  any  case,  in  the  1960s  and 
early  1970s,  the  scholarly  and  policy-making 
emphasis  was  placed  on  the  need  to  build  states 
and  to  strengthen  their  structures  (the  bureaucracy 
and  the  armed  forces  and,  to  a  minor  extent,  rep¬ 
resentative  assemblies)  more  than  on  the  creation 
of  nations,  an  idea  not  easy  to  export  into  develop¬ 
ing  countries.  The  analysis  of  the  agencies  and 
processes  of  socialization — families  and  schools — 
so  important  for  the  transformation  of  political 
cultures,  remained  of  limited  interest  and  was 
rarely  carried  out  with  specific  commitment.  That 
social  and  political  movements,  for  instance,  those 
active  in  the  national  liberation  struggles,  could 
exert  some  influence  in  accelerating  cultural 
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changes,  especially  among  those  “subjects”  who 
might  become  involved  in  their  activities,  was  not 
a  common  theme  worthy  of  in-depth  studies. 

The  concept  of  a  subject  culture  disappeared 
rather  quickly  from  the  language  of  scholars  inter¬ 
ested  in  modernization  and  political  development. 
It  had  never  made  significant  inroads  among  schol¬ 
ars  studying  political  participation.  The  attention 
shifted  to  more  precise  and  better  manageable  and 
researchable  directions.  The  following  three  ele¬ 
ments,  interest,  information,  and  efficacy,  have 
been  explored  extensively  by  subsequent  scholars 
who  were  specializing  in  the  study  of  political  par¬ 
ticipation.  Nevertheless,  one  can  still  stretch  the 
concept  of  subject  culture  to  indicate  the  type  of 
political  culture  shown  by  those  individuals  who 
have  some  interest  in  politics,  not  much  informa¬ 
tion,  and  a  very  low  sense  of  political  efficacy. 
Their  interest  in  the  working  of  the  political  system 
may  be  stimulated  only  by  their  awareness  that  it 
affects  their  lives.  They  collect  the  amount  of 
political  information  that  seems  sufficient  to  cope 
with  the  outputs  of  the  political  system.  They  are 
more  or  less  painfully  conscious  of  their  inability 
to  play  a  successful  role  in  the  working  of  the 
political  system.  With  reference  to  the  wealth  of 
existing  data  on  the  quantity  and  quality  of  politi¬ 
cal  participation  in  contemporary  political  sys¬ 
tems,  it  is  possible  to  state  that,  contrary  to  the 
views  formulated  by  Almond  and  Verba  in  the 
1960s,  there  remain  significant  numbers  of  indi¬ 
viduals,  not  only  in  centralized  authoritarian 
regimes  but  in  all  contemporary  democratic 
regimes  as  well,  whose  political  culture  shows  the 
features  of  a  subject  culture. 
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International  relations  as  a  field  of  study  has  tra¬ 
ditionally  been  defined  as  a  science  whose  primary 
purpose  has  been  to  explain  the  behavior  of  states 
operating  in  an  anarchic  world  where  power — 
usually  measured  in  terms  of  economic  resources, 
military  strength,  and  moral  influence — has 
always  been  unevenly  spread  among  its  various 
units.  How  to  measure  power,  the  uses  to  which 
power  is  put,  how  states  acquire  and  lose  it,  and 
what  impact  its  unequally  distributed  character 
has  on  relations  between  states  in  the  wider  inter¬ 
national  system  have  been  central  issues  discussed 
by  scholars  of  international  relations  ever  since 
Thucydides  wrote  his  classic  History  of  the 
Peloponnesian  War  in  the  4th  century  BCE. 

Measuring  the  power  of  states  has,  of  necessity, 
created  its  own  taxonomy.  Thus,  different  states 
may  be  defined  as  great,  weak,  imperial,  regional, 
global,  or  even — more  recently — failed,  largely  in 
terms  of  three  attributes:  (1)  their  own  set  of 
unique  capabilities,  (2)  the  distribution  of  capa¬ 
bilities  in  the  wider  international  system,  and 
(3)  their  capacity  to  exercise  control  over  both 
their  own  affairs  and  those  of  others.  Inevitably, 
most  attention  has  been  paid  to  the  actions  of  great 
powers  insofar  as  they  have  exercised  most  influ¬ 
ence  on  the  international  system.  Thus,  when  the 
Napoleonic  wars  ended  with  France’s  defeat  in 
1814,  it  was  naturally  assumed  that  the  future  of 
Europe  would  be  shaped  by  at  least  five  “great” 
powers:  Great  Britain,  the  Austrian  Empire,  Russia, 
Prussia,  and  France.  A  century  later,  it  was  taken 
for  granted  that  the  outcome  of  World  War  I 
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would,  in  the  end,  be  determined — once  again — by 
the  policies  and  capabilities  of  the  various  great 
powers. 

World  Wars  I  and  II 

The  term  superpoiver  only  emerged  in  the  period 
following  the  “Great  War”  (World  War  I)  of  1914 
to  1918.  Hitherto,  writers  had  hardly  used  the 
term,  preferring  instead  to  speak  of  certain  states 
possessing  vast  powers  that  extended  well  beyond 
their  own  territory  as  empires.  The  term  super¬ 
power  was  thus  very  much  a  modern  creation  and 
was  first  used  in  the  early  1 920s  with  special  refer¬ 
ence  to  the  greatest  of  all  modern  empires — the 
British  Empire,  with  one  in  four  people  around  the 
world  living  under  its  flag.  It  then  became  more 
commonly  employed  during  World  War  II.  Indeed, 
the  idea  that  the  war  would  conclude  with  an 
enormous  concentration  of  power  in  the  hands  of 
only  three  states — the  British  Empire,  the  United 
States,  and  the  former  USSR — was  initially  mooted 
by  the  Dutch  American  geostrategist  Nicholas  J. 
Spykman,  first  in  his  1942  study  America’s  Strategy 
in  World  Politics  and  then  2  years  later  in  his 
short,  but  highly  influential,  book  The  Geography 
of  the  Peace.  Spykman  was  a  power  theorist  par 
excellence.  But  he  made  a  clear  distinction  between 
different  kinds  of  power  and  concluded  (against 
the  then  influential  arguments  advanced  by  the 
British  geographer  Halford  Mackinder)  that  the 
dominant  global  actor  after  the  war  would  not  be 
the  Soviet  Union,  which  now  controlled  the 
Eurasian  land  mass — what  Mackinder  termed  the 
heartland — but  rather  the  United  States,  with  its 
unmatched  maritime  global  supremacy. 

The  belief  that  the  defeat  of  the  Axis  powers 
would  lead  to  the  creation  of  a  new  kind  of  world 
order  dominated  by  “super”  powers  was  also 
advanced  by  William  T.  R.  Fox,  a  Chicago-trained 
academic  who  after  forming  part  of  the  interna¬ 
tional  staff  at  the  United  Nations  conference  in  San 
Francisco  went  on  to  influence  many  up-and- 
coming  international  relations  scholars  in  the 
United  States.  In  his  justly  famous  1944  book,  The 
Super-Powers:  The  United  States,  Britain  and  the 
Soviet  Union — Their  Responsibility  for  Peace,  Fox 
effectively  helped  popularize  the  term  superpower. 
Interestingly,  like  Spykman,  he  identified  three  (not 
two)  states  that  could  lay  claim  to  this  particular 


status  as  the  war  came  to  an  end.  Two  of  these — 
the  United  States  and  the  Soviet  Union — had 
acquired  their  position  in  the  process  of  waging 
war  against  Germany  and  Japan;  the  third  (the 
United  Kingdom  [UK] )  had  for  some  time  been  the 
most  extensive  empire  in  world  history.  Fox,  how¬ 
ever,  drew  different  conclusions  from  those  of 
Spykman.  Spykman  never  believed  that  the  three 
would  easily  work  together  to  ensure  the  peace 
after  the  war.  In  his  opinion,  they  were  doomed  to 
compete.  Fox,  on  the  other  hand,  was  more  opti¬ 
mistic  and  initially  hoped  that  the  three  powers 
could  work  in  concert  to  secure  a  more  stable 
world.  Indeed,  as  the  subtitle  of  his  book  sug¬ 
gested,  they  had  a  responsibility  to  do  so. 

The  Cold  War  Period 

The  period  between  1944  and  1950  saw  a  signifi¬ 
cant  change  in  the  application,  if  not  the  meaning, 
of  the  term  superpoiver.  This  led  to  the  deletion  of 
the  British  Empire  from  the  original  trio.  Weakened 
by  war  and  increasingly  dependent  on  the  United 
States  in  a  new  world  order  that  was  fast  decolo¬ 
nizing  (India  declared  independence  in  1947),  it 
became  more  and  more  difficult  to  think  of  the  UK 
as  a  superpower.  This  of  course  left  only  two 
superpowers  in  play — the  United  States  and  the 
Soviet  Union.  They  in  turn  were  said  to  be  “super” 
for  a  variety  of  different  reasons:  their  sheer  geo¬ 
graphical  size,  their  respective  hard  power  capa¬ 
bilities,  their  global  reach,  their  pursuit  of  defined 
missions  worldwide,  and  their  ability  to  shape  the 
policies  and  choices  of  other  increasingly  depen¬ 
dent  small-  and  medium-sized  powers. 

The  idea  that  there  were  now  only  two  super¬ 
powers  also  gave  rise  to  another  connected  idea: 
bipolarity.  This  notion  was  popularized  by  the 
most  widely  read  international  relations  scholar  of 
the  immediate  postwar  years:  the  German-born 
American  scholar  Hans  J.  Morgenthau,  who  in 
1948  spoke  of  a  radically  new  world  organized 
around  two  dominant  “poles”  of  power  and 
attraction.  This  system  could  quite  easily  conclude 
in  war,  he  feared.  But  as  time  passed,  Morgenthau 
began  to  wonder  whether  this  bipolar  interna¬ 
tional  system  might  even  contain  within  itself  the 
basis  of  a  more  stable  international  order.  Eater, 
another  international  relations  scholar,  Kenneth 
Waltz,  developed  this  argument  even  further.  Tike 
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Morgenthau,  Waltz  noted  the  apparent  stability  of 
the  postwar  order  and  the  success  of  the  super¬ 
powers  in  defying  earlier  predictions  that  the  Cold 
War  would  sooner  or  later  turn  hot.  This,  he 
argued,  happened  in  part  because  of  nuclear  weap¬ 
ons.  But  it  was  also  a  function  of  the  bipolar  dis¬ 
tribution  of  power.  This,  he  insisted,  was  less  war 
prone  than  a  system  where  there  were  many  power 
sources  (a  multipolar  order).  He  believed  that 
error  was  more  common  in  the  latter  system 
because  of  the  difficulty  of  estimating  accurately 
the  power  and  cohesion  of  shifting  and  often 
unstable  coalitions.  In  a  bipolar  world,  on  the 
other  hand,  where  the  two  superpowers  relied  on 
their  own  vastly  superior  power  for  their  security, 
coalitions  were  less  important  and  so  stability 
more  likely. 

As  a  simplifying  device,  the  idea  of  a  bipolar 
world  composed  of  two  superpowers  to  describe 
the  international  system  after  1947  was  useful 
enough.  But  it  was  not  without  its  limits.  Four  at 
least  deserve  mention  here. 

One  was  that  it  could  not  really  explain  the 
Cold  War  itself.  To  do  this,  one  required  to  look 
at  more  than  just  the  uneven  distribution  and  con¬ 
centration  of  power  in  the  world.  This  only  pro¬ 
vided  one  kind  of  measure.  It  did  not  necessarily 
explain  the  deeper  reasons  for,  and  unique  charac¬ 
ter  of,  the  conflict.  To  do  this,  it  was  necessary  to 
examine  the  very  different  socioeconomic  founda¬ 
tions  of  the  two  states,  their  opposing  worldviews, 
and  their  quite  different  histories. 

Second,  the  idea  of  a  “superpower”  implied  a 
rough-and-ready  equality  between  the  two  states. 
But  as  we  know,  from  the  outset — and  for  the 
duration  of  the  Cold  War — the  United  States  was 
always  in  a  superior  position.  This  was  partly  for 
the  geographical  reasons  originally  suggested  by 
Spykman.  But  it  was  also  because  its  economy  was 
larger  and  more  productive  and  its  allies  richer  and 
generally  more  willing  to  follow  its  lead  than  say 
Poland,  Czechoslovakia,  and  Hungary  were  ever 
prepared  to  follow  that  provided  by  the  former 
USSR. 

Third,  the  notion  of  there  being  just  two  super¬ 
powers  having  the  capacity  to  project  power  any¬ 
where  in  the  world  does  not  explain  significant 
alterations  in  the  balance  of  power  in  the  world 
after  1947.  In  1950,  for  example,  there  were  only 
two  nuclear  weapons  states:  the  United  States  and 


the  former  USSR.  By  the  late  1960s,  however, 
there  were  five,  including  China,  the  UK,  and 
France.  In  1947,  the  United  States  was  “super”  in 
nearly  every  conceivable  way.  Twenty-five  years 
later,  it  faced  quite  a  different  set  of  circumstances 
brought  about  by  the  emergence  of  new  economic 
players  such  as  Europe  and  Japan.  Nor  could  the 
United  States  and  the  former  USSR  control  events 
quite  so  easily  as  the  notion  of  a  superpower  might 
have  implied.  Thus,  both  lost  significant  allies  in 
the  1960s  when  China  broke  from  the  former 
USSR  and  France  left  the  military  command  struc¬ 
ture  of  the  North  Atlantic  Treaty  Organization 
(NATO).  Then,  in  the  1970s,  the  United  States  was 
unceremoniously  ejected  from  two  key  countries — 
Vietnam  and  Iran — while  the  former  USSR  was 
thrown  out  of  Egypt.  And  during  the  next  decade, 
the  Soviet  Union  fought  a  long  and  unsuccessful 
war  on  its  southern  border  in  Afghanistan. 

Finally,  the  notion  of  superpower  really  had  no 
predictive  capacity.  Indeed,  precisely  because  many 
scholars  viewed  the  former  USSR  as  a  superpower, 
it  may  well  have  misled  them  into  assuming  it  was 
far  stronger  than  it  turned  out  to  be.  Certainly,  one 
of  the  more  important  reasons  why  so  many  stu¬ 
dents  of  international  relations  failed  to  anticipate 
the  sudden  and  unexpected  end  to  the  Cold  War  in 
1989,  followed  2  years  later  by  the  collapse  of  the 
former  USSR  itself,  was  because  of  an  attachment 
to  a  concept  that  ultimately  proved  to  be  more  an 
obstacle  than  an  asset  in  understanding  how  the 
Soviet  Union  actually  worked. 

1991  and  After 

If  a  superpower  is  a  state  traditionally  considered 
to  be  one  step  higher  than  a  great  power,  it  remains 
an  open  question  as  to  how  we  should  characterize 
the  distribution  of  power  following  the  collapse  of 
the  former  USSR  in  1991.  Clearly  the  world  was 
no  longer  bipolar.  But  what  kind  of  world  was  it? 
For  a  while,  some  predicted  the  rise  of  a  new  mul¬ 
tipolar  world  order  consisting  of  Japan,  the  newly 
empowered  European  Union,  and  the  United 
States.  It  soon  became  clear,  however,  that  this 
was  not  to  be.  Japan  failed  financially.  Europe 
proved  to  be  weak  in  its  efforts  to  end  the  civil  war 
in  (former)  Yugoslavia.  Meanwhile,  the  United 
States  experienced  one  success  after  another  from 
military  victory  in  Iraq  in  1991  through  to  Kosovo 
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just  a  few  years  later.  At  the  same  time,  the  U.S. 
economy  boomed,  the  Russian  economy  declined, 
and  the  capabilities  gap  between  the  United  States 
and  the  rest  of  the  world  grew  and  grew. 

In  a  world  without  rival  (or  indeed  without 
balance)  in  which  American  values  seemed  to  be 
driving  all  before  them,  it  was  hardly  surprising 
that  scholars  and  policymakers  alike  were  drawn 
to  the  distinctly  radical  conclusion  that  we  were 
now  living  in  an  entirely,  almost  unique,  world 
order  whose  single  most  obvious  characteristic 
was  that  there  was  only  one  single  pole  in  it.  This 
in  turn  raised  the  intriguing  question  of  how  one 
should  then  characterize  the  United  States  itself: 
“Superpower”  seemed  too  weak  and  anyway 
was  too  much  associated  with  a  period  now  long 
since  passed — namely,  the  Cold  War.  Analysts 
thus  began  to  look  around  for  new  ways  of 
defining  the  United  States.  Some,  like  the  French 
Foreign  Minister,  Flubert  Vedrine,  preferred  the 
idea  of  a  “hyperpower,”  suggesting  that  the 
United  States  now  had  unprecedented  power  and 
capacity.  Others  were  attracted  to  the  notion  of 
hegemony,  implying  that  the  United  States  had 
not  only  enormous  power  but  also  a  capacity  to 
lead  and  inspire  others.  A  few  writers,  however, 
began  to  define  the  United  States  as  a  new  form 
of  liberal  Empire — a  new  Rome  on  the  Potomac, 
as  one  observer  put  it — which,  even  if  it  did  not 
seek  to  acquire  other  people’s  territory,  did  what 
all  empires  have  done  in  the  past:  It  shaped  the 
choices  of  others  and  defined  the  rules  of  the 
game  while  punishing  those  who  did  not  play  by 
them. 

By  the  beginning  of  the  21st  century,  there¬ 
fore,  the  term  superpower  was  no  longer  much 
used.  Indeed,  according  to  many  thinkers,  the 
very  term  had  little  value  in  trying  to  explain 
international  politics  in  an  increasingly  interde¬ 
pendent  world  composed  of  several  actors  often 
facing  challenges  that  did  not  necessarily  arise 
from  the  possession  of  vast  concentrations  of 
power.  Some  even  suggested  that  the  distribution 
of  power  had  changed  so  much  in  the  first  10 
years  of  the  new  millennium  that  it  would  be 
quite  meaningless  to  think  of  the  world  in  terms 
of  any  kind  of  superpower  determination.  That 
age,  it  was  agued,  was  now  past. 

Yet  the  idea  did  not  die  away  entirely.  On  the 
contrary,  as  America’s  fortunes  began  to  fade 


and  those  of  China  started  to  rise,  many  began 
to  view  the  emerging  international  order  in 
increasingly  superpower,  possibly  even  bipolar, 
terms.  China  may  have  been  no  Soviet  Union. 
Still,  its  potential  was  huge  and  its  ambitions 
global.  Nor  was  it  the  only  potential  superpower 
on  the  scene  seeking  to  compete  with  the  United 
States.  Europe,  and  perhaps  India  in  time,  could 
also  lay  some  claim  to  being,  or  over  time 
becoming,  superpowers  of  a  sort.  Policymakers 
certainly  seemed  to  think  so  and,  while  talking 
the  cooperative  language  of  globalization  and 
global  governance,  continued  to  act  in  ways  that 
would  have  been  easily  recognizable  to  policy¬ 
makers  from  a  bygone  age.  The  Cold  War  with 
all  its  ideological  baggage  might  have  been  over. 
The  market  may  have  also  triumphed.  But  very 
powerful  states,  or  groupings  of  states,  with  seri¬ 
ous  capabilities,  international  ambitions,  and  a 
desire  to  mould  the  future,  still  sought  to  deter¬ 
mine  the  way  the  world  looked.  Even  if  one  kind 
of  superpower  rivalry  had  concluded  by  the 
beginning  of  the  1990s,  nearly  20  years  later,  it 
very  much  looked  as  if  another  was  just  starting 
to  take  shape. 

Michael  Cox 
London  School  of  Economics 
London,  United  Kingdom 
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The  concept  of  political  support  was  initially  devised 
by  David  Easton  in  the  context  of  his  theory  of 
political  systems.  According  to  Easton  (1965),  polit¬ 
ical  systems  have  inputs  and  outputs,  with  inputs 
taking  the  form  of  demands  and  support.  Political 
support  occurs  when  “A  acts  on  behalf  of  B  or  when 
he  orients  himself  favorably  toward  B”  (p.  159).  A 
more  precise  definition  was  provided  by  this  author 
in  a  later  work  where  he  defines  political  support  as 
the  “way  in  which  a  person  evaluative  orients  him¬ 
self  to  some  object  through  either  his  attitudes  or  his 
behaviour”  (1975,  p.  436).  Easton  develops  a  multi¬ 
dimensional  conceptualization  of  political  support  in 
which  he  distinguishes  both  between  the  objects  of 
support  and  the  types  of  support.  Within  the  first,  he 
differentiates  from  each  other  support  for  three  dis¬ 
tinct  objects:  the  political  community,  the  regime, 
and  the  authorities.  Within  types,  he  distinguishes 
between  two:  specific  and  diffuse. 

As  important  as  it  has  been  for  the  study  of 
political  attitudes,  Easton’s  conceptualization  of 
political  support  has  been  seen  by  some  as  unclear 


and  has  led  to  much  research  that  has  been  criti¬ 
cized  as  ambiguous,  confusing,  and  noncumula- 
tive.  Different  understandings  of  Easton’s  original 
arguments  have  led  to  a  variety  of  schemes, 
approaches,  and  indicators  that  have  aimed  to  cap¬ 
ture  the  different  dimensions  of  political  support. 
There  is  no  consensus  on  how  exactly  Easton’s 
ideas  should  be  empirically  assessed.  Long-standing 
debates  among  scholars  have  concentrated  not  only 
on  which  are  the  best  ways  to  measure  the  different 
dimensions  empirically  but  also  on  what  the  evi¬ 
dence  gathered  thus  far  has  meant. 

Objects  of  Political  Support 

An  initial  distinction  put  forward  by  Easton  deals 
with  the  classification  of  the  different  objects 
toward  which  political  support  might  be  extended. 
According  to  him,  support  is  fed  into  the  political 
system  in  relation  to  three  basic  objects,  which  he 
orders  in  a  scale  of  abstractness.  At  one  extreme 
lies  the  most  intangible  or  “abstract”  object:  the 
political  community,  or  nation.  At  the  other 
extreme  lies  the  most  immediate,  concrete  object: 
the  political  authorities.  Between  the  poles  lies  a 
third  object  of  support:  the  regime,  or  the  “rules  of 
the  game,”  that  allow  a  political  system  to  be  main¬ 
tained.  Easton  warns  that  the  fact  that  support  for 
a  political  system  can  be  broken  down  into  three 
elements  does  not  mean  that  support  for  each  one 
of  these  objects  is  independent  of  the  others.  Many 
times  the  three  kinds  of  support  are  closely  inter¬ 
related,  so  that  the  presence  of  one  is  a  function  of 
the  presence  of  one  or  both  of  the  other  kinds. 

Authors  such  as  Pippa  Norris  and  Russell 
Dalton  have  followed  Easton’s  line  in  differentiat¬ 
ing  certain  objects  of  support  that  are  more  general¬ 
ized  from  others  that  are  more  concrete.  Both 
authors  offer  refined  versions  of  Easton’s  classifica¬ 
tion  of  objects  of  political  support,  in  which  they 
argue  about  the  need  to  differentiate  between  three 
different  components  of  the  political  regime:  regime 
principles,  performance,  and  institutions.  Along 
these  lines,  they  suggest  and  empirically  confirm  a 
five-dimensional  structure  that  includes  as  objects 
the  political  community,  regime  principles,  regime 
performance,  regime  institutions,  and  political 
authorities.  John  Booth  and  Mitchell  Seligson  have 
confirmed  through  factor  analysis  that  a  similar 
structure  is  operating  in  several  Latin  American 
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democracies,  with  the  addition  of  a  sixth  dimension 
that  reflects  attitudes  toward  local  government. 

Diffuse  and  Specific  Political  Support 

Besides  classifying  the  objects  of  political  support, 
Easton  (1975)  distinguishes  between  two  kinds  of 
support,  specific  and  diffuse,  which  are  of  different 
natures  and  have  different  implications  for  the 
political  system.  On  the  one  hand,  specific  support 
is  described  as  the  type  related  to  the  “satisfactions 
that  members  of  a  system  feel  they  obtain  from  the 
perceived  outputs  and  performance  of  the  political 
authorities”  (p.  437).  On  the  other  hand,  diffuse 
support  refers  to  “evaluations  of  what  an  object  is 
or  represents — to  the  general  meaning  it  has  for  a 
person — not  of  what  it  does”  (p.  444).  Thus,  spe¬ 
cific  support  is  the  temporary  and  relatively 
ephemeral  acceptance  or  approval  that  citizens 
express  for  a  political  object  as  a  result  of  its  satis¬ 
faction  of  their  specific  demand;  whereas  diffuse 
support  is  conceived  of  as  a  deeper,  more  endur¬ 
ing,  and  more  generalized  political  loyalty  result¬ 
ing  from  early-life  political  socialization  and, 
therefore,  is  more  immune  to  short-term  induce¬ 
ments,  rewards,  or  performance  evaluations. 

This  distinction  is  an  important  contribution. 
Specific  support  can  be  object  specific  in  two  ways: 
In  the  first  place,  people  are  assumed  to  be  capable 
of  being  aware  of  the  political  authorities  working 
on  behalf  of  the  system;  in  the  second  place,  it 
takes  into  account  the  perceived  decisions,  poli¬ 
cies,  actions,  utterances,  and  style  of  the  authori¬ 
ties.  In  this  sense,  members  of  a  political  system 
can  perform  a  rational  calculation  of  whether  the 
authorities’  actions  address  their  needs  and 
demands.  Under  such  conditions,  specific  support, 
which  can  only  exist  in  societies  whose  institutions 
allow  authorities  to  be  held  accountable  for  their 
actions  and  the  resulting  consequences,  will  fluctu¬ 
ate  according  to  people’s  perceived  benefits  and 
satisfactions.  Diffuse  support  instead  hinges  on  the 
general  meaning  given  to  political  objects  and  is 
defined  as  a  “reservoir  of  favorable  attitudes”  that 
members  of  a  given  political  system  possess,  which 
allow  them  to  overlook  outputs  that  do  not  benefit 
their  wants.  It  is  more  durable  than  specific  sup¬ 
port  and  more  resistant  to  or  even  independent  of 
sudden  epiphenomena  such  as  policy  outputs  and 
performance.  This  support  also  remains,  despite 


the  ups  and  down  in  outputs  and  beneficial  perfor¬ 
mances.  While  diffuse  support  may  change,  it  is 
difficult  to  weaken  when  strong  and  to  strengthen 
when  weak.  Rooted  in  early  socialization  and 
experience,  diffuse  support  might  be  generated  and 
fostered  independent  of  the  fulfillment  of  particu¬ 
lar  needs  and  demands  by  the  authorities. 

The  Problem  of  the  Multidimensionality 
of  Political  Support 

Most  research  has  interpreted  Easton’s  conceptual¬ 
ization  as  a  unidimensional  axis  that  ranges  from 
specific  to  diffuse  support,  along  which  the  differ¬ 
ent  political  objects  are  arrayed  (Figure  1)  (Norris, 
1999).  In  this  conception,  support  for  each  political 
object  is  given  a  level  of  specificity/diffuseness, 
implying  that  specific  and  diffuse  support  do  not 
represent  distinct  kinds  that  could  be  extended 
toward  every  political  object.  This  approach  has 
been  criticized  by  other  readers  of  Easton  such  as 
Mariano  Torcal  and  Jose  R.  Montero  (2006)  for  its 
simplicity,  which  has  given  birth  to  an  excessively 
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Figure  I  Unidimensional  Conception  of  Political 
Support 
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broad  concept  of  support,  which  “empirically 
works  as  a  sort  of  umbrella  under  which  different 
dimensions  are  not  differentiated”  (p.  10). 

The  latest  refinements  to  the  multidimensional 
understanding  of  political  support  have  been 
developed  by  Dalton  (2004)  and  by  Torcal  and 
Montero  (2006).  These  authors  have  argued  that 
the  correct  interpretation  of  Easton’s  theory  implies 
that  every  political  object  is  subject  to  both  specific 
and  diffuse  support.  Thus,  a  multidimensional 
structure  of  political  support  is  devised,  where 
objects  may  be  subject  to  both  diffuse  and  specific 
support  at  the  same  time  (Figure  2).  This  view  does 
not  imply  that  support  for  a  political  object,  in 
every  context,  will  be  a  combination  of  both  spe¬ 
cific  and  diffuse  support.  What  it  means  is  that 
each  political  object  should  be  “addressed  alter¬ 
nately  or  jointly  by  both  diffuse  and  specific  sup¬ 
port”  (Torcal  &  Montero,  2006,  p.  9). 

Measurement  and  Hypothesis  Testing  Problems 

The  disagreements  over  the  multidimensional 
nature  of  political  support  have  been  extended  to 
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debates  regarding  the  appropriate  ways  of  measur¬ 
ing  it.  No  consensus  has  been  reached  among 
scholars  on  which  are  the  best  indicators  for  cap¬ 
turing  the  different  components  of  political  sup¬ 
port.  Furthermore,  discussions  regarding  what  the 
indicators  aiming  to  measure  specific  and  diffuse 
support  have  meant  date  back  to  the  1970s,  when 
scholars  debated  whether  the  increasing  levels  of 
discontent  shown  in  surveys  reflected  attitudes 
toward  the  incumbents  or  toward  the  regime. 
Almost  40  years  later,  the  debate  is  still  open. 
Indicators  that  have  been  used  to  measure  political 
support  have  been  criticized  severely  and  recur¬ 
rently  for  their  problems  of  validity.  For  instance, 
it  is  still  common  to  measure  regime  support  (both 
specific  and  diffuse)  with  satisfaction  with  demo¬ 
cratic  functioning,  an  indicator  highly  contami¬ 
nated  by  incumbent  support,  which  in  a  way  is 
reproducing  the  1970s  debate  between  Jack  Citrin 
and  Arthur  Miller,  reaching  again  the  same  incon¬ 
clusive  results. 

Dalton  (2004)  has  warned  about  the  difficulties 
on  drawing  fine  theoretical  distinctions  when  aim¬ 
ing  to  measure  the  different  components  of  politi¬ 
cal  support.  In  fact,  there  are  doubts  about  the 
efficiency  of  empirical  research  when  trying  to 
isolate  the  different  types  of  support,  as  public 
sentiments  can  blend  adjacent  orientations  and 
indicators  of  public  support  frequently  overlap 
between  levels.  Despite  the  obstacles  in  measuring 
the  different  components  of  political  support, 
Dalton  argues  that  it  is  important  to  depart  from  a 
multidimensional  framework,  as  “public  orienta¬ 
tions  towards  different  objects  of  support  carry 
different  political  implications”  (p.  7).  Therefore, 
he  proposes  different  indicators  to  measure  spe¬ 
cific  and  diffuse  support  for  each  different  political 
object,  leaving  aside  the  frequent  assumption,  dis¬ 
cussed  before,  that  each  political  object  contains 
by  definition  a  level  of  specificness/diffuseness. 
However,  this  approach,  although  not  too  critical 
of  Easton’s  theoretical  proposal,  does  contain  a 
problematic  assumption:  It  assigns  an  a  priori 
meaning  of  specificity  or  diffuseness  to  different 
survey  indicators  for  each  political  object.  Some 
critics  or  researchers  argue,  instead,  that  the  level 
of  conditional  (specific)  and  unconditional  support 
(diffuse)  can  only  be  estimated  by  assessing  empir¬ 
ically  the  role  of  different  variables  in  explaining 
political  support  for  each  political  object.  Support 
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is  specific,  and  therefore  conditional,  when  it  can  be 
explained  by  variables  such  as  ideological  prefer¬ 
ences,  party  preferences,  underlying  values,  or  sat¬ 
isfaction  with  the  economic  or  social  performance. 
Support  becomes  diffuse,  and  therefore  uncondi¬ 
tional,  when  neither  of  these  factors  influences  the 
level  of  individual  support  in  a  significant  way. 
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Survey  Research 


Survey  research  is  a  method  of  measurement  based 
on  the  systematic  collection  of  information  from  a 
sample  of  members  of  a  population.  This  sample  is 
only  a  fraction  of  the  population  of  interest.  It  rep¬ 
resents  the  whole  population  by  the  way  it  is 
selected.  Conclusions  about  the  total  population  are 
reached  through  a  process  of  statistical  inference. 


Samples  can  be  drawn  of  anything  whose  properties 
can  be  defined  and,  therefore,  can  be  made  of  many 
types  of  populations,  such  as  persons,  products, 
institutions,  organizations,  or  events.  The  most 
familiar  surveys  are  those  taken  from  persons  about 
their  opinions  using  a  structured  questionnaire. 
This  entry  is  mostly  about  survey  research  of 
human  populations.  In  the  United  States  and  other 
English-speaking  countries,  surveys  are  also  called 
polls.  There  is  no  precise  distinction  between  these 
terms  and  there  are  no  significant  methodological 
differences  between  polls  and  surveys.  The  term 
survey  is  frequently  used  when  speaking  of  polls 
done  by  academic  institutions,  and  polls  are  surveys 
reported  in  the  media.  The  first  part  of  this  entry 
describes  the  general  scientific  bases  of  survey 
research  and  offers  an  overview  of  the  principal 
aspects  involved  in  the  design  of  surveys.  The  sec¬ 
ond  part  gives  an  account  of  the  key  moments  in  the 
evolution  of  survey  research.  The  third  part  describes 
the  principal  contributions  of  survey  research  to  the 
social  sciences  along  with  some  criticisms. 

Scientific  Bases  of  Survey  Research 

Survey  research  is  based  on  sampling  techniques 
founded  on  probability  theory.  People  attribute 
different  meanings  to  the  concept  of  probability. 
Most  often  it  is  interpreted  as  the  relative  fre¬ 
quency  of  events,  such  as  the  number  of  red  or 
black  cards  in  a  deck.  Its  precise  meaning  is  cen¬ 
tered  on  the  concept  of  a  “normal  distribution.” 
The  most  important  characteristic  of  the  normal 
distribution  is  that  even  when  the  actual  values  of 
a  variable  do  not  have  a  normal  distribution,  the 
mean  of  that  variable  estimated  from  a  large  sam¬ 
ple  can  be  regarded  as  having  come  from  a  normal 
distribution  of  sample  means.  This  is  what  the 
central  limit  theorem  states.  If  researchers  draw  a 
large  number  of  independent  random  samples  of 
any  variable,  the  mean  value  of  the  means  obtained 
in  each  of  the  random  samples  will  be  close  to  the 
actual  mean  of  the  population.  Moreover,  they 
will  be  able  to  calculate  the  standard  error  of  the 
mean.  Hence,  researchers  can  draw  samples  of 
populations  and  infer  that  the  mean  of  the  vari¬ 
ables  observed  will  be  close  to  the  mean  of  the 
population  with  a  known  margin  of  error. 

Good,  reliable  surveys  follow  basic  scientifically 
founded  rules  to  select  the  sample  of  the  population 
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to  be  studied.  First,  the  researcher  has  to  precisely 
define  the  population  he  or  she  wants  to  study.  The 
sample  of  individuals  to  be  interviewed  is  selected 
in  such  a  way  that  every  member  of  the  total  popu¬ 
lation  has  a  not  necessarily  equal,  but  a  known, 
chance  of  selection.  If  all  members  of  the  total 
population  have  the  same  chance  to  be  selected, 
every  individual  in  the  sample  represents  the  same 
number  of  persons  in  the  population.  These  are 
self-weighted  samples.  If  the  selected  individuals 
have  different  chances  of  being  selected,  weights  are 
used  to  estimate  the  number  of  persons  in  the 
population  they  represent. 

Any  bias  in  the  selection  of  the  sample  would 
produce  distorted  measurements.  Errors  of  selec¬ 
tion  come  from  many  sources,  but  selection  errors 
are  frequently  produced  when  the  researcher  has 
some  preconceived  ideas  about  the  individuals  he 
or  she  wants  to  select — for  instance,  when  the 
researcher  looks  for  typical  or  relevant  individuals 
of  a  population,  when  the  poll  depends  on  volun¬ 
teer  or  easy-to-contact  subjects,  or  when  inter¬ 
viewers  are  asked  to  look  in  the  field  for  some 
specific  type  of  subjects.  A  random  selection  of  a 
sample,  such  as  choosing  a  specific  number  of  per¬ 
sons  from  a  list  using  random  numbers  or  selecting 
every  /zth  number  of  persons  from  the  list,  mini¬ 
mizes  selection  biases.  The  designer  of  a  survey  can 
improve  the  accessibility  of  the  sample-selecting 
groups  of  individuals  that  can  be  found  in  the 
same  location  or  area,  in  clusters  of  individuals, 
such  as  electoral  precincts,  counties,  or  neighbor¬ 
hoods.  This  technique  is  known  as  area  probability 
sampling.  The  survey  designer  can  also  subdivide 
the  population  into  more  homogeneous  subparts, 
known  as  strata,  to  improve  the  quality  and  acces¬ 
sibility  of  the  samples.  This  technique  is  known  as 
stratification.  For  instance,  in  an  electoral  study, 
the  researcher  can  select  in  the  first  stage  a  sample 
of  precincts  from  a  total  list  of  polling  areas,  a  list 
that  was  previously  divided  according  to  turnout 
and  voting  behavior.  In  the  second  stage,  the 
researcher  selects  households  within  each  of  the 
precincts  selected  following  a  systematic  method, 
such  as  every  other  household  starting  where  the 
voting  site  is  installed.  In  the  third  stage,  the  inter¬ 
viewer  selects  the  persons  to  be  interviewed  fol¬ 
lowing  some  almost  random  method  of  selection, 
such  as  asking  for  the  person  with  the  most  recent 
birthday. 


When  a  sample  is  selected  following  these  basic 
rules — that  is,  if  it  is  truly  probabilistic — it  will  be 
unbiased  and  will  therefore  reproduce  the  charac¬ 
teristics  of  the  total  population  within  a  known 
margin  of  error.  In  the  electoral  example,  the 
aggregate  vote  of  the  sampled  precincts  should  be 
very  close  to  the  actual  vote  in  previous  elections, 
and  the  sociodemographic  characteristics  of  the 
population  interviewed  would  be  very  similar  to 
the  total  population.  In  other  words,  once  aggre¬ 
gated,  the  statistics  obtained  from  the  sample  will 
reproduce  the  actual  parameters  of  the  population 
with  a  known  margin  of  error. 

The  adequate  size  of  a  sample  does  not  depend 
on  the  extent  of  the  total  population.  The  deci¬ 
sion  on  the  size  of  a  survey  is  always  a  delicate 
balance  between  the  value  of  the  additional  infor¬ 
mation  and  the  cost  to  obtain  it,  but  ultimately, 
the  designer  of  the  study  has  to  determine  the 
lowest  number  of  cases  on  which  he  or  she  is  will¬ 
ing  to  generalize  from  the  sample  to  the  total 
population  and  the  size  of  the  error  he  or  she  is 
willing  to  accept.  In  any  case,  as  the  size  of  the 
sample  is  increased,  its  quality  improves,  but  at 
some  point,  any  increase  in  size  produces  very 
limited  gains  in  quality.  Common  sample  sizes  for 
national  or  regional  studies  usually  vary  between 
1,200  and  2,500  cases,  but  if  the  designer  of  the 
study  wants  to  generalize  about  something  of 
very  low  occurrence,  say  some  rare  illness,  he  or 
she  would  get  very  few  cases  with  these  sizes.  For 
instance,  imagine  that  only  3%  of  the  population 
has  been  infected  by  the  H1N1  virus  at  some 
moment.  In  a  well-designed  survey  of  1,000 
cases,  the  researcher  would  generalize  about  the 
infected  population  based  on  only  30  cases,  a 
very  small  number. 

Surveys  are  interviews  with  the  individuals  of 
the  sampled  population  using  a  structured  ques¬ 
tionnaire.  The  interview  with  the  sampled  indi¬ 
viduals  can  be  done  by  an  interviewer  or  can  be 
self-administered.  Individuals  can  be  interviewed 
in  person  or  by  telephone.  Wherever  the  extent  of 
telephone  lines  is  not  significantly  biased  toward 
the  upper-level  economic  strata,  the  telephone  is 
the  preferred  way  to  interview.  The  new  informa¬ 
tion  technologies  offer  new  forms  of  interviews. 
Self-administered  interviews  are  questionnaires  to 
be  filled  out  that  were  given  to  the  sampled  popu¬ 
lation  directly  or  sent  via  e-mail  or  the  Internet.  In 
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some  cases,  surveys  can  be  done  using  more  than 
one  mode  of  interview,  but  the  sample  has  to  be 
very  carefully  designed,  and  the  researcher  should 
be  aware  that  the  mode  of  interview  can  have 
some  effects  on  the  results  that  the  researcher  must 
take  into  account. 

The  design  of  the  questionnaire  is  another  fun¬ 
damental  aspect.  Questions  should  produce  good, 
unbiased,  and  objective  information.  The  ques¬ 
tions  in  a  survey  should  answer  what  the  designer 
of  the  study  wants  to  know  and  should  be  equally 
accessible  to  all  respondents.  For  some  topics,  the 
researcher  knows  in  advance  that  the  questions 
asked  could  produce  something  called  social  desir¬ 
ability  biases,  simply  because  they  are  embarrass¬ 
ing,  or  because  the  respondent  might  be  afraid  to 
reveal  what  he  or  she  assumes  to  be  “bad”  behav¬ 
ior.  Some  questions  might  be  simply  deliberately 
biased  because  they  include  approval  of  means  and 
of  results,  what  is  known  as  double-barreled  ques¬ 
tions.  The  order  of  the  questions  in  a  questionnaire 
should  be  also  carefully  designed,  because  some 
questions  might  lead  to  the  answers  of  the  follow¬ 
ing  questions. 

The  biggest  challenge  to  survey  research  is  the 
problem  of  nonresponse.  Even  if  the  sample  and 
the  questionnaire  are  carefully  designed,  and  the 
field  execution  of  the  actual  process  of  data  collec¬ 
tion  is  done  by  well-trained  professional  interview¬ 
ers  and  close  supervision,  people  can  refuse  to 
participate  or  not  answer  the  questions  asked  in 
the  survey.  In  some  environments,  the  intense  use 
of  polls  by  the  media  has  resulted  in  an  increasing 
number  of  refusals  to  participate.  Survey  research 
is  based  on  the  information  provided  by  those  who 
participated  in  the  survey.  Based  on  the  random 
selection  of  the  sample,  in  survey  research,  it  is 
assumed  that  those  who  refused  to  participate  or 
did  not  answer  some  of  the  questions  are  not  sub¬ 
stantially  different  from  those  who  participated, 
except  for  their  unwillingness  to  participate.  This 
may  be  a  sound  assumption,  but  ultimately  there  is 
no  way  to  be  sure  about  it,  since  the  researcher 
does  not  have  actual  information  from  the  nonre¬ 
spondents.  Consistent  trends  in  public  opinion 
data  or  the  rationality  of  actual  changes  in  these 
trends  and  other  indicators  show  that  the  assump¬ 
tion  about  nonresponse  is  sound,  but  surveys  with 
very  large  numbers  of  nonresponse  should  be  read 
with  some  degree  of  skepticism. 


Information  Obtained  in  Surveys 

Surveys  can  produce  factual  or  perceptual  infor¬ 
mation.  The  objective  information  can  be  actual 
facts  of  different  matters  such  as  how  many  indi¬ 
viduals  in  the  sampled  population  are  employed  or 
unemployed,  the  extent  of  the  audience  of  a  TV 
program,  the  number  of  years  a  person  attended 
school,  or  the  types  of  goods  the  family  owns  or 
uses.  They  can  also  measure  past  or  present  behav¬ 
ioral  experiences,  such  as  cigarette  consumption, 
crime  victimization,  or  use  of  public  transporta¬ 
tion.  Likely  future  behavior  can  also  be  measured, 
but  there  is  a  general  consensus  that  polls  are  not 
a  very  good  instrument  to  do  this.  The  typical  elec¬ 
toral  forecasts  are  based  on  questions  about  the 
vote  preference  of  the  respondent  at  the  moment  of 
the  interview,  and  a  typical  error  in  predicting  the 
election  results  usually  comes  from  late  changes  in 
preferences  that  surveys  might  fail  to  register. 

Perceptual  information  can  measure  attitudes  or 
opinions,  such  as  the  evaluation  of  the  job  perfor¬ 
mance  of  different  authorities,  the  state  of  the 
economy,  or  opinions  about  some  general  topics 
or  issues.  Attitudes  are  positive  or  negative  orien¬ 
tations  toward  objects,  persons,  governments, 
institutions,  public  policies,  or  party  programs. 
Surveys  can  measure  not  only  the  prevalence  of  the 
attitudes  explored  but  also  their  intensity. 
Preferences  based  on  comparisons  of  attitudes  are 
also  measured  in  surveys.  Comparisons  of  candi¬ 
dates,  parties,  specific  public  policies,  or  general 
orientations  toward  them  are  frequently  obtained 
from  the  public  in  surveys  of  all  kinds.  People  have 
opinions  or  perceptions  about  the  objective  state 
of  the  world.  For  the  researcher  who  measures 
these  beliefs,  it  is  not  relevant  if  the  belief  is  true  or 
not,  the  relevant  finding  is  that  a  population  thinks 
that  the  state  of  affairs  at  a  given  moment  is  as 
they  describe  it.  Many  times,  perceptions  about  the 
state  of  the  economy  or  crime  are  not  in  correspon¬ 
dence  with  actual  reality.  As  in  the  case  of  atti¬ 
tudes,  surveys  can  measure  how  important  these 
beliefs  are  for  the  public. 

A  survey  taken  at  some  particular  moment  pro¬ 
duces  a  cross-sectional  measure  of  the  value  of 
variables  at  the  moment  of  the  interview.  Many 
variables  are  constantly  measured,  resulting  in 
extremely  useful  longitudinal  studies  that  show 
changes  and  continuities.  If  these  continuous  mea¬ 
surements  are  obtained  from  differently  sampled 


Survey  Research  2569 


individuals,  they  can  show  changes  over  time  in 
the  mean  values  of  the  variables  studied,  but  con¬ 
clusions  about  the  behavior  of  the  individuals 
would  be  unfounded.  The  researcher  would  have 
to  interview  the  same  individuals  over  time  to 
come  to  sound  conclusions  about  the  individual’s 
changes  of  behavior.  These  are  panel  studies  with 
specific  methodologies. 

Development  of  Survey  Research 

Survey  research  evolved  gradually  from  various 
social  needs.  A  survey  is  any  comprehensive  view 
or  appraisal  of  any  population.  In  this  broad  sense, 
surveys  have  probably  been  done  since  the  first  tax 
collectors  valued  the  extent  and  resources  of  their 
desired  contributors.  Actually,  the  first  evidences 
of  written  language  are  records  of  commercial 
transactions  and  of  tax  or  tribute  payers.  In  the 
Middle  Ages,  poll  began  to  mean  “head”  and 
acquired  the  contemporary  meaning  of  taxing.  In 
the  16th  century,  every  Spanish  conquistador  was 
always  accompanied  by  an  accountant  in  charge  of 
the  survey  of  the  conquered  territories.  These  tax 
collectors  produced  hundreds  of  invaluable  sur¬ 
veys  of  the  “New  World,”  first  recovering  the  tax 
records  of  the  Indian  authorities  where  available 
and  later  carrying  out  their  own  appraisals.  The 
Industrial  Revolution  created  a  complex  set  of  new 
social  problems  and  the  need  to  understand  them. 
There  are  records  of  some  surveys  that  were  done 
on  different  matters  such  as  poverty,  education, 
and  health  in  many  of  the  countries  that  are  now 
called  industrialized  or  developed.  In  the  newly 
independent  national  states,  the  need  to  create  new 
tax  rolls  kept  tax  collectors  busy. 

These  were  enterprises  of  data  collection  that 
attempted  to  survey  the  entire  population  of  inter¬ 
est.  The  evolution  of  surveys  based  on  fractions  of 
the  populations  took  longer  to  become  the  com¬ 
mon  practice  of  today.  The  desire  to  predict  the 
results  of  upcoming  elections  has  been  a  constant 
incentive  in  the  development  of  survey  research. 
Besides  the  value  that  the  news  media  attribute  to 
these  forecasts,  particularly  in  presidential  systems, 
electoral  predictions  based  on  surveys  offer  the 
rare  if  not  unique  opportunity  to  evaluate  ex  post 
the  accuracy  of  the  polls  on  which  they  are  based. 
The  first  of  these  surveys  were  the  so-called  straw 
polls  that  some  newspapers  in  the  United  States 


conducted  to  predict  the  winner  of  upcoming  elec¬ 
tions.  It  is  generally  agreed  that  the  first  such  poll 
was  taken  in  the  summer  of  1824  by  the  Harrisburg 
Pennsylvanian  in  the  election  between  Andrew 
Jackson  and  John  Quincy  Adams.  Other  straw 
polls  were  done  in  the  19th  century,  and  at  least 
nine  newspapers  and  magazines  published  the 
results  of  straw  polls  for  different  state  and  presi¬ 
dential  elections  in  the  first  quarter  of  the  20th 
century.  The  newspapers  obtained  the  information 
to  do  their  prediction  by  various  methods.  In  some 
cases,  the  newspaper  printed  a  ballot  for  some 
period  of  time  and  asked  its  readers  to  cut  it  and 
mail  it  back  to  the  journal.  Others  polled  citizens 
attending  political  meetings  or  in  selected  areas. 

From  1916  through  1932,  the  magazine  Literary 
Digest  predicted  the  outcomes  of  the  U.S.  presi¬ 
dential  elections  correctly  but  missed  in  1936  with 
data  obtained  with  its  well-known  straw  poll.  At 
every  presidential  election,  the  Literary  Digest 
mailed  millions  of  ballots  some  time  before  the 
elections  to  a  mail  list  taken  from  the  telephone 
directories  of  state  car  registrations.  In  1936,  the 
magazine  announced  that  the  Republican  candi¬ 
date  Allan  Landon  was  expected  to  obtain  57%  of 
the  national  vote.  He  obtained  38.5%.  This  fiasco 
marked  the  demise  of  the  straw  polls.  Regardless 
of  the  occasional  accuracy  of  some  of  the  predic¬ 
tions  estimated  with  the  data  obtained  with  this 
form  of  data  collection,  the  ballots  were  tabulated 
ignoring  the  implicit  biases  involved  in  the  way  the 
respondents  were  selected.  The  mail  list  used  by 
the  Literary  Digest  magazine  was  clearly  biased  to 
upper-income  households,  and  the  magazine 
received  information  only  from  those  who  were 
willing  to  respond  to  the  poll.  They  assumed  in 
aggregating  the  results  that  car  owners  willing  to 
respond  represented  the  entire  population.  In  other 
words,  the  researcher  assumed  that  the  partici¬ 
pants  in  the  poll  had  the  same  chance  of  selection 
as  those  who  did  not  participate  in  the  survey, 
either  because  they  were  not  contacted  or  because 
they  refused  to  answer.  These  nonresponse  biases 
favored  the  most  committed  Republicans,  result¬ 
ing  in  the  mistaken  estimate.  These  were  typically 
what  we  call  now  nonprobabilistic  samples. 

By  the  1930s,  vast  changes  in  the  production 
and  in  the  marketing  of  goods  and  services  started 
to  develop.  New  information  about  the  markets 
and  the  demands  of  consumers  was  needed,  and 
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market  research  appeared  to  satisfy  it.  A  long  tradi¬ 
tion  of  social  statistics  supplied  in  Western  Europe 
the  needs  of  this  new  market  research,  while  in  the 
United  States,  data  collection  based  on  surveys 
became  a  common  practice,  and  survey  research 
received  a  substantial  impulse  that  has  pervaded 
until  today.  By  the  end  of  World  War  I,  many  large 
manufacturers  and  advertisement  media  had  mar¬ 
ket  research  groups,  and  new  firms  began  to  offer 
market  and  consumer  information  to  their  clients; 
among  them  were  those  founded  by  Elmo  Roper 
and  George  Gallup.  They  adopted  the  methods  of 
market  research  to  measure  public  attitudes  and,  in 
fact,  created  today’s  public  opinion  polls.  In  July 
1935,  Fortune  magazine  published  the  first  Fortune 
poll  produced  by  Roper,  and  in  October,  Gallup 
published  the  first  Gallup  poll  as  a  syndicated  ser¬ 
vice  to  35  newspapers  and  magazines. 

The  methodologies  used  by  this  new  industry 
were  closer  to  the  scientific  standards  of  the  meth¬ 
ods  in  use  today  but  insufficient  to  produce  unbi¬ 
ased  results.  Usually,  respondents  were  contacted 
and  interviewed  either  in  clusters  of  neighborhoods 
or  in  places  of  great  attendance  of  visitors  such  as 
commercial  plazas.  To  try  to  prevent  biases  in  the 
data  collected,  researchers  selected  very  diverse 
points  of  interview,  and  interviewers  were  asked  to 
complete  predefined  sociodemographic  quotas  in 
such  a  way  that  the  aggregate  would  resemble  the 
strata  defined  in  the  population  census. 

Another  big  fiasco  marked  again  the  change  in 
methodologies  and  the  real  incorporation  of  the 
probability  principles  to  the  selection  of  the  sam¬ 
pled  population.  Polls  had  predicted  with  great 
accuracy  the  U.S.  presidential  elections  of  1940 
and  1944,  but  in  1948,  the  three  major  producers 
of  electoral  forecasts,  George  Gallup,  Elmo  Roper, 
and  Archibald  Crossley  predicted  a  substantial  vic¬ 
tory  for  Thomas  E.  Dewey  over  Harry  S.  Truman 
who  was  the  actual  winner  by  a  comfortable  mar¬ 
gin  of  more  than  four  points.  Hearings  were  called 
by  the  House  of  Representatives  to  discuss  the  big 
mistake,  and  the  Social  Science  Research  Council 
produced  a  study  of  the  major  polls.  The  study 
pinpointed  various  reasons  for  the  error,  but  the 
most  important  in  terms  of  the  adoption  of  better 
sampling  methods  pointed  to  the  persistence  of 
selection  biases  associated  with  the  quota  sampling 
in  use.  Instead  of  following  loose  quota  samples,  the 
industry  began  to  assign  interviewers  to  specific 


areas  previously  selected  through  a  random  process. 
As  the  installation  of  telephone  lines  dramatically 
expanded,  interviewing  respondents  over  the  phone 
became  the  most  common  form  of  contact  and 
interview.  Respondents  were  selected  through  a 
process  known  as  random  digital  dialing  where  the 
researchers  do  not  need  lists  or  directories  since  the 
lines  are  selected  automatically  following  random 
processes.  Survey  research  expanded  all  over 
through  market  research  and  government  produc¬ 
tion  of  social  studies  and  economic  data.  As  large 
producers  or  marketing  companies  entered  new  ter¬ 
ritories,  local  market  research  began  to  be  demanded, 
and  local  industries  emerged.  Gallup  opened  local 
offices  in  many  places,  and  by  the  outbreak  of 
World  War  II,  public  opinion  polling  was  a  com¬ 
mon  practice  in  most  democratic  societies. 

Perspectives  of  Survey  Research 

Public  acceptance  of  surveys  varies  constantly.  Big 
flaws,  such  as  those  that  still  occur  in  electoral  fore¬ 
casts,  produce  widespread  criticism  and  disappoint¬ 
ment.  During  political  campaigns,  when  polls  tend 
to  be  more  present  in  the  media,  criticism  increases, 
and  the  idea  that  polls  influence  voters  is  frequently 
expressed.  The  use  of  polls  as  a  source  of  informa¬ 
tion  for  politicians  to  craft  their  messages  has 
become  a  constant  source  of  criticism  of  survey 
research.  The  media,  particularly  broadcast  media, 
play  an  increasingly  important  role  in  politics 
today.  Image  and  constructed  messages  prevail  in 
the  public  arena.  A  new  breed  of  political  consul¬ 
tants  has  emerged  to  polish  the  image  of  politicians 
and  to  construct  their  messages.  These  consultants 
struggle  to  tune  their  designs  to  public  opinion 
based  on  polls  and  surveys.  Critics  claim  that  this 
use  of  survey  research  leads  to  the  trivialization  of 
democracy  and  that  it  manipulates  the  desires  and 
choices  of  the  public. 

These  are  serious  criticisms,  but  survey  research 
has  produced  many  positive  things  as  well.  The 
electronic  and  printed  media  routinely  report  pub¬ 
lic  sentiments  on  a  wide  variety  of  issues  and 
closely  follow  electoral  campaigns.  This  common 
practice  in  established  democracies  set  in  later  in 
younger  democracies,  but  in  the  long  run,  the  use 
of  survey  research  to  monitor  governments  and 
elections  developed  together  with  the  consolida¬ 
tion  of  democratic  practices.  This  intensive  use  of 
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public  opinion  polling  to  produce  political  data 
such  as  election  forecasts  or  presidential  approval 
is  the  most  visible  evidence  of  survey  research  in 
the  daily  life  of  citizens. 

Survey  research  has  also  been  widely  used  for  a 
long  time  to  produce  a  broad  range  of  data  about 
markets  and  various  economically  relevant  vari¬ 
ables.  Surveys  are  routinely  taken  to  estimate  market 
opportunities  for  old  or  new  products  or  to  follow 
public  sentiment  toward  brands  and  products. 
Many  of  the  most  familiar  economic  data,  such  as 
employment,  income,  and  expenditure  patterns 
among  households  or  consumer  sentiments,  are 
obtained  using  surveys.  Policymakers  use  survey 
research  to  assess  social  needs  or  to  evaluate  the 
impact  on  the  public  of  the  policies  implemented. 

But  the  most  significant  effect  of  survey  research 
has  been  in  the  production  of  knowledge  over  a  vast 
number  of  disciplines.  The  impact  of  survey  research 
in  pure  and  applied  social  research  has  been  of  great 
consequence.  Studying  scientifically  selected  sam¬ 
ples,  researchers  can  soundly  infer  conclusions  of 
total  populations  without  the  almost  impossible  task 
of  studying  every  individual.  The  repeated  measure¬ 
ment  of  many  and  very  diverse  variables  has  allowed 
the  understanding  of  change.  Information  obtained 
through  direct  interviews  to  actual  persons,  and  sci¬ 
entifically  based  statistical  inferences  to  the  entire 
population,  opened  the  door  to  the  study  of  indi¬ 
vidual  motivations.  Before,  individual  motivations 
could  be  studied  only  through  experimentation  with 
very  small  samples  or  improperly  inferred  from  the 
study  of  aggregate  data.  Survey  research  has  pro¬ 
duced  abundant  findings  in  many  areas  of  econom¬ 
ics,  sociology,  social  and  political  psychology,  and 
political  science  among  others. 

In  many  countries,  huge  collections  of  surveys 
have  been  collected,  and  various  international 
projects  are  booming,  opening  the  doors  to  a  new 
comparative  understanding  of  many  aspects  of 
societies  and  politics. 

Ulises  Beltran 

Centro  de  Investigacion  y  Docencia  Economicas 

Mexico  City,  Mexico 
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Systematic  surveys  of  the  opinions,  attitudes,  and 
behavior  of  human  populations  are  one  of  the 
major  inventions  of  20th-century  social  science 
research.  Although  survey  research  has  its  origins 
in  the  pre-World  War  II  era,  in  the  late  1940s, 
social  scientists  began  to  conduct  high-quality  sur¬ 
vey  research  on  a  regular  basis.  Since  that  time, 
surveys  have  become  a  staple  of  social  science 
research,  and  data  generated  by  such  studies  are 
used  by  thousands  of  scholars  around  the  world. 
These  data  are  gathered  in  a  variety  of  ways,  which 
methodologists  call  survey  modes.  In  the  early  21st 
century,  major  survey  modes  are  in-person  inter¬ 
views,  telephone  interviews,  self-completion  ques¬ 
tionnaires,  and  Internet  questionnaires.  Below, 
these  modes  and  their  respective  advantages  and 
disadvantages  are  discussed  in  greater  detail. 

In-Person  Interviews 

The  in-person  or  face-to-face  interview  occupies  a 
preeminent  position  in  scientific  survey  research. 
Such  interviews  typically  are  carried  out  in  respon¬ 
dents’  homes,  although,  historically,  survey  firms 
(“houses”)  have  used  other  venues  as  well. 
In-person  interviews  have  been  a  major  data 
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collection  device  since  the  inception  of  survey 
research  and — when  used  in  conjunction  with 
probability  samples  of  populations  of  interest — 
still  are  considered  by  many  scholars  to  be  the  gold 
standard  for  valid  and  reliable  measurement.  The 
claim  is  that  experienced,  well-trained  interviewers 
doing  in-person  surveys  can  gather  higher  quality 
data  than  would  be  possible  using  any  other  data- 
gathering  mode.  Working  in  the  face-to-face  mode, 
interviewers  can  build  rapport  and  trust  with 
respondents,  thereby  minimizing  social  desirability 
biases  that  threaten  data  quality.  Also,  although 
in-person  surveys  have  a  long  history,  they  are  not 
technically  primitive.  Using  computer-assisted  per¬ 
sonal  interview  techniques,  sophisticated  experi¬ 
ments  can  be  conducted  without  recourse  to  a 
researcher’s  lab.  Data  can  be  e-mailed  back  imme¬ 
diately  from  the  field  to  project  headquarters, 
ensuring  the  close  and  continuous  monitoring  of  a 
survey’s  progress. 

Despite  its  virtues,  the  face-to-face  mode  has 
significant  drawbacks  that  have  prompted  some 
researchers  to  consider  alternative  data  collection 
modes.  One  very  serious  problem  is  the  high  cost. 
National  surveys  with  large  probability  samples 
have  become  extremely  expensive,  and  their  cost  is 
escalating  rapidly.  Another  problem  is  declining 
response  rates.  Although  some  major  in-person 
surveys  continue  to  have  very  high  response  rates, 
others  have  experienced  a  sizable  erosion  in  the 
percentage  of  people  willing  to  participate. 

Telephone  Interviews 

Starting  in  the  1980s,  social  scientists  and  com¬ 
mercial  pollsters  reacted  to  these  emerging  prob¬ 
lems  by  employing  computer-assisted  telephone 
interview  methods,  with  probability  samples 
chosen  by  random  digit  dialing  procedures.  In 
addition  to  their  relatively  low  cost,  these  surveys 
can  be  conducted  quickly,  with  fieldwork 
completed  in  a  fraction  of  the  time  needed  for  a 
face-to-face  study.  This  is  a  very  important  con¬ 
sideration  for  researchers  interested  in  monitor¬ 
ing  the  dynamics  of  public  opinion.  Using  tele¬ 
phone  interviewing  techniques,  researchers  can 
conduct  daily  interviews  with  random  subsets 
(replicates)  of  larger  samples  to  produce  fine¬ 
grained  portraits  of  the  evolving  public  mood 
during  national  election  campaigns.  Yet  another 


desirable  feature  is  that  telephone  interviewing 
facilitates  survey  experiments. 

These  pluses  notwithstanding,  the  attractiveness 
of  the  telephone  mode  has  faded  in  recent  years. 
Response  rates  have  fallen  substantially,  with 
implications  that  are  not  yet  well  understood. 
Although  some  studies  suggest  that  the  impact  of 
low  response  rates  on  data  quality  is  not  particu¬ 
larly  pernicious,  deleterious  effects  may  vary  by 
subject  matter.  Coverage  problems  are  also  increas¬ 
ing  as  people  abandon  landline  telephones  for  cell 
phones  that  are  not  reached  using  traditional  tech¬ 
niques.  And  there  is  emerging  evidence  of  an 
enhanced  social  desirability  bias  relative  to  other 
modes. 

Self-Completion  Questionnaires 

Another  survey  mode  is  the  self-completion,  mail 
back  questionnaire.  A  cover  letter,  a  printed  ques¬ 
tionnaire,  and  a  self-addressed  return  envelope  are 
mailed  to  a  list  of  potential  respondents.  If  the 
names  and  addresses  of  a  population  of  interest  are 
available,  a  researcher  can  draw  a  probability 
sample  and  deploy  the  standard  machinery  of  sta¬ 
tistical  inference.  The  self-completion  mail  back 
questionnaire  also  can  be  used  in  conjunction  with 
an  in-person  survey,  with  the  questionnaire  being 
given  to  respondents  after  they  have  completed  a 
face-to-face  interview.  The  interviewer  can  wait 
while  the  respondent  fills  out  the  questionnaire  or 
request  that  it  be  returned  later. 

Self-completion  questionnaires  have  enormous 
cost  advantages  relative  to  in-person  and  tele¬ 
phone  interviews.  In  Australia  and  New  Zealand, 
national  election  studies  have  been  conducted 
quite  effectively  using  mail  back  questionnaires. 
The  British  Election  Study  and  the  British  Social 
Attitudes  surveys  cut  costs  and  expanded  their 
data  collections  by  having  interviewers  deliver 
supplementary  self-completion  questionnaires  to 
respondents  when  face-to-face  surveys  were  con¬ 
ducted.  The  response  rates  for  these  surveys  have 
been  very  good.  However,  the  overall  report  card 
for  self-completion  surveys  is  not  stellar. 
Researchers  using  this  mode  typically  find  that 
response  rates  are  mediocre  to  dismal  unless  care¬ 
ful  attention  is  paid  to  the  incentives  needed  to 
secure  participation  by  members  of  particular 
study  populations. 
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Internet  Surveys 

Over  the  past  2  decades,  a  new  mode,  the  Internet 
survey,  has  become  increasingly  popular.  Internet 
surveys  have  several  attractive  features.  Total  cost 
is  typically  at  least  an  order  of  magnitude  less  than 
that  for  a  comparable  in-person  survey,  and  the 
marginal  cost  of  increasing  the  sample  size  is 
extremely  low.  These  cost  advantages  make  it  pos¬ 
sible  to  do  extremely  large-N  surveys  and  to 
employ  multiple  waves  of  data  collection  to  track 
individual-level  dynamics.  Low  costs  also  mean 
that  multiple  surveys  on  the  same  topic  can  be 
fielded  simultaneously  or  in  rapid  succession, 
thereby  making  replication  a  realistic  possibility. 
There  is  more.  Speed  is  spectacular,  and  the  reach 
is  global;  Internet  surveys  have  the  ability  to  enter 
and  exit  from  the  field  in  countries  around  the 
world  in  a  matter  of  hours.  In  addition,  with  the 
spread  of  high-speed  broadband  connections, 
sophisticated  experiments  involving  a  variety  of 
audio  and  visual  stimuli  are  realistic  options. 

These  exciting  features  notwithstanding, 
Internet  surveys  are  controversial.  A  major  stick¬ 
ing  point  is  the  failure  of  the  vast  majority  of 
Internet  surveys  to  employ  probability  samples. 
The  reason  is  a  lack  of  Internet  sampling  frames 
for  populations  of  interest.  In  pioneering  work  in 
the  1980s,  the  Dutch  social  scientist  Willem  Saris 
addressed  this  problem  by  recruiting  a  panel  of 
Internet  respondents  via  initial  telephone  contact. 
The  Internet  coverage  problem — that  is,  that  some 
potential  respondents  lacked  a  computer — was 
handled  by  giving  them  one.  Saris’s  methodology 
was  subsequently  adopted  by  the  American  firm 
Knowledge  Networks.  Although  the  Saris  method¬ 
ology  initially  seemed  to  be  a  sound  method  for 
obtaining  a  probability  sample  of  Internet  respon¬ 
dents,  it  was  only  as  good  as  the  obtained  tele¬ 
phone  sample.  With  telephone  surveys  suffering 
from  declining  response  rates  and  increasing  cov¬ 
erage  problems,  the  quality  of  an  Internet  panel 
recruited  by  telephone  cannot  be  assumed.  A 
related  problem  may  be  long-term  panel  retention. 

Other  Internet  survey  researchers  do  not  attempt 
to  recruit  a  traditional  probability  sample.  For 
example,  the  British  survey  house  YouGov  draws 
stratified  random  samples  from  a  huge  list  of  people 
that  it  has  recruited  to  its  master  Internet  panel 
by  a  variety  of  non-probability-based  methods. 
Poststratification  weights  based  on  demographic 


characteristics,  media  consumption  patterns,  and 
political  partisanship  are  used  to  improve  sample 
representativeness.  YouGov’s  U.S.  partner,  Poli- 
metrix,  employs  an  innovative  respondent  selec¬ 
tion  technique  in  which  people  chosen  at  random 
from  large-scale  surveys  using  high-quality  proba¬ 
bility  samples  are  matched  with  participants  in  its 
enormous  PollingPoint  panel  on  several  demo¬ 
graphic  and  political  characteristics.  The  panelist 
with  the  closest  match  to  the  person  from  the 
probability  sample  is  then  sent  an  Internet  survey. 
Polimetrix  argues  that  its  matched  sample  esti¬ 
mates  outperform  poststratification  weighting  pro¬ 
cedures  and  have  a  sampling  distribution  similar  to 
that  used  for  simple  random  samples.  Yet  another 
innovative  approach  has  been  developed  by  George 
Terhanian  of  Harris  International,  who  uses  a  pro¬ 
pensity  score  matching  technique  to  select  respon¬ 
dents  for  Harris’s  Internet  surveys. 

Some  analyses  of  the  quality  of  the  data  pro¬ 
duced  by  these  non-probability-based  Internet 
surveys  indicate  that  they  can  produce  results  com¬ 
parable  with  high-quality  probability  surveys.  For 
example,  the  marginal  distributions  for  key  vari¬ 
ables  in  the  Internet  component  of  the  2005  British 
Election  Study  are  very  similar  to  those  produced 
by  the  traditional  in-person  survey.  Perhaps,  more 
important,  the  vast  majority  of  parameters  for 
models  of  electoral  choice  estimated  using  the  two 
data  sets  are  statistically  indistinguishable.  YouGov 
has  experienced  great  success  in  predicting  vote 
shares  in  recent  British  elections;  there  are  simi¬ 
larly  encouraging  results  for  U.S.  survey  data  gath¬ 
ered  as  part  of  the  2006  Cooperative  Congressional 
Election  Study.  However,  other  studies  conducted 
using  U.S.  data,  particularly  those  by  Jon  Krosnick 
and  his  colleagues,  report  significant  differences 
between  results  of  analyses  of  Internet  data  gath¬ 
ered  using  panels  of  respondents  recruited  by  non¬ 
probability  means  and  data  gathered  using  either 
in-person  interviews  or  Internet  questionnaires 
administered  to  national  probability  samples. 

These  discrepant  findings  are  part  of  the  larger, 
very  lively  debate  concerning  the  advantages  and 
disadvantages  of  various  modes  of  survey  research. 
As  this  debate  unfolds,  considerations  of  cost- 
effectiveness,  representativeness,  sample  size,  speed 
of  data  collection,  and  possibilities  for  sophisti¬ 
cated  experimentation  and  replication  will  be  key 
factors  that  analysts  weigh  when  deciding  which 
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survey  mode  is  best  able  to  accomplish  their  par¬ 
ticular  research  goals. 
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Sustainable  Development 


Sustainable  development  was  famously  defined  in 
the  1987  Report  of  the  World  Commission  on 
Environment  and  Development  (The  Brundtland 
Report)  as  “development  that  meets  the  needs  of 
the  present  without  compromising  the  ability  of 
future  generations  to  meet  their  own  needs.”  Long 
accepted  as  a  guiding  notion  aimed  at  reconciling 
developmental  and  environmental  concerns,  the 
concept  has  been  criticized  as  being  based  on  the 
anthropocentric  notion  of  nature  as  a  resource 
requiring  expert  human  management.  Others 
point  out  that  this  is  too  closely  linked  with 
notions  of  capital-intensive  economic  development 
to  be  of  any  use  in  the  formulation  of  an  alterna¬ 
tive  environmental  vision  that  can  restore  the  bal¬ 
ance  in  human  interaction  with  the  ecosphere. 


Proponents  and  detractors  of  the  concept  agree 
that  it  is  a  vague  notion  that  can  be  interpreted  in 
divergent  ways,  depending  on  whether  the  empha¬ 
sis  is  placed  on  development  or  on  resource  sustain¬ 
ability,  on  economic  development  only,  or  on 
broader  notions  of  human  development.  Despite 
the  appropriateness  of  these  and  other  criticisms, 
there  is  no  evidence  that  the  concept  is  soon  to  lose 
its  privileged  position  in  the  national  and  interna¬ 
tional  politics  of  development  and  the  environment. 

Development  refers  to  the  process  of  increasing 
people’s  welfare.  The  welfare  of  a  group  ulti¬ 
mately  depends  on  the  extent  and  quality  of  stock 
of  all  natural  and  human  assets  in  their  economy — 
commonly  referred  to  as  wealth.  The  notion  of 
sustainable  development  reminds  us  that  current 
changes  in  wealth  must  have  consequences  for 
welfare  in  the  future  of  both  the  current  genera¬ 
tion  and  generations  to  come.  Attempts  to  increase 
people’s  welfare  must  therefore  be  accompanied 
by  genuine  savings  in  wealth:  the  offsetting  of 
resource  depletion  and  environmental  degrada¬ 
tion,  limiting  exhaustible  resource  exports,  pro¬ 
tecting  critical  natural  capital,  and  maintaining 
and  increasing  investment  in  human  resources. 

Intergenerational  and  Global  Inequality 

Sustainable  development’s  biggest  contribution  is 
to  broaden  the  scope  of  ethical  thinking  to  include 
future  generations  among  those  to  whom  we  owe 
obligations.  It  also  holds  important  consequences 
for  the  practices  of  national  accounting  and  for 
issues  of  global  governance.  Appreciation  of  the 
importance  of  genuine  savings  implies  that  we  have 
to  look  anew  at  national  accounts  and  indicators 
of  development.  Analysts  have  long  agreed  that 
national  production  and  income  figures  provide 
little  evidence  of  the  fundamentals  of  development, 
as  they  indicate  only  how  successfully  the  current 
generation  is  exploiting  resources.  With  its  mea¬ 
sure  of  human  development  the  United  Nations 
(UN)  Development  Programme  and  its  annual 
Human  Development  Reports  have  attempted  to 
introduce  a  broader  and  intergenerational  measure 
of  development  by  combining  data  about  educa¬ 
tion  and  health  with  national  production/income 
figures.  However,  the  literature  on  sustainable 
development  prescribes  that  we  also  consider  how 
countries  manage  their  natural  and  human 
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resources  with  an  eye  to  the  future.  Income  ratios 
between  the  rich  and  poor  have  converged  in 
recent  decades  (according  to  some  measures  and  in 
some  parts  of  the  world)  and  infant  mortality  is 
declining  across  the  majority  of  countries.  There  is 
also  a  perceptible  narrowing  of  average  human 
development  index  scores  over  time  in  the  cross- 
sectional  data.  Some  analysts  take  this  as  indica¬ 
tive  of  a  narrowing  of  inequality  between  countries 
(which  is  only  one  but  a  very  important  dimension 
of  the  phenomenon  of  global  inequality).  However, 
when  we  measure  in  what  shape  the  production 
processes  and  social  policies  of  the  industrial  era 
have  left  the  human  and  natural  resource  base  of 
many  parts  of  the  world,  it  is  clear  that  a  devastat¬ 
ing  global  divide  is  widening  between  those  parts 
of  the  world  where  genuine  savings  have  kept  pace 
with  population  growth  and  those  where  fertility 
increasingly  outstrips  genuine  savings.  The  evolu¬ 
tion  of  this  genuine  savings  divide  is  a  tale  of  the 
misuse  of  power  and  the  generation  of  privilege 
and  of  marginalization,  and  it  places  humankind 
before  one  of  its  most  daunting  policy  and  moral 
challenges:  how  to  reduce  global  poverty  and 
inequality  now  without  exacerbating  both  in  the 
future.  Unfortunately,  the  international  politics  of 
sustainable  development  makes  it  unlikely  that  we 
will  be  able  to  meet  that  challenge  anytime  soon. 

Sustainable  Development 

and  International  Politics 

From  its  inception,  the  term  sustainable  develop¬ 
ment  has  had  a  bearing  on  international  politics.  In 
the  late  1960s  and  early  1970s,  the  newly  indepen¬ 
dent  industrializing  countries  of  the  South  started 
to  use  UN  resources  to  frame  their  needs  for  eco¬ 
nomic  modernization  in  terms  of  the  structural 
inequities  of  the  global  economy.  The  emerging 
green  political  revolution  in  the  industrialized 
states,  precipitated  by  reports  of  imminent  resource 
scarcity,  compelled  leaders  of  the  rich  North  to 
focus  on  possible  causal  links  between  economic 
modernity  and  environmental  degradation.  In  con¬ 
trast,  John  Vogler  argues,  the  leaders  of  the  South 
coalition  in  global  affairs  saw  their  own  rapid  eco¬ 
nomic  modernization,  supported  by  generous  offi¬ 
cial  development  assistance  from  the  industrialized 
North,  as  the  solution  for  the  interrelated  problems 
of  poverty  and  environmental  degradation. 


These  diverging  conceptions  of  the  relationship 
between  development  and  protection  of  the  envi¬ 
ronment  were  carried  into  the  1972  UN  Conference 
on  the  Human  Environment  held  in  Stockholm  but 
were  largely  left  unresolved.  Nevertheless,  Stock¬ 
holm  led  to  the  mainstreaming  of  environmental 
concerns  both  internationally  and  nationally  and 
precipitated,  among  others,  the  formation  of  the 
UN  Environment  Programme  and  the  creation  of 
departments  of  environmental  affairs  in  many 
countries.  It  was  only  with  the  publication  of  the 
Brundtland  Report  in  1987  and  the  UN  Conference 
on  the  Environment  and  Development  (Rio  “Earth 
Summit”)  of  1992  that  it  became  possible  diplo¬ 
matically  to  strike  a  tenuous  balance  between  the 
interests  of  the  North  and  the  South.  The  concept 
of  “sustainable  development” — first  popularized  in 
the  Brundtland  Report  and  then  institutionalized  at 
Rio  with  the  creation  of  the  Commission  for 
Sustainable  Development — persuaded  the  leaders 
of  the  South  that  after  the  end  of  the  Cold  War 
development  issues  were  to  be  placed  at  the  center 
of  world  attention.  When  the  Rio  Declaration  and 
the  UN  Framework  Convention  on  Climate  Change 
(UNFCCC) — two  of  the  major  outcomes  of  Rio — 
formulated  the  principle  of  “common  but  differen¬ 
tiated  responsibilities,”  the  long-standing  resource 
concerns  of  the  South  also  seemed  to  be  alleviated. 
On  the  other  hand,  the  term  sustainable  develop¬ 
ment  and  its  operationalization  at  Rio  also  came  as 
a  bonus  for  the  leaders  of  the  North,  as  it  allowed 
them  to  satisfy  the  demands  of  their  increasingly 
environmentally  conscious  electorates  to  main¬ 
stream  environmental  issues;  to  build  up  domestic 
modi  vivendi  between  agriculture,  business,  and 
environmental  activists;  and  to  attempt  to  add  envi¬ 
ronmental  standards  as  a  new  weapon  in  their 
protectionist  armor. 

Problems  and  Future  Prospects 

While  the  normative  framework  of  sustainable 
development  continues  to  be  the  focal  point  on 
which  global  environmental  and  developmental 
regimes  converge,  it  appears  as  if  the  underlying 
bargain  that  was  struck  at  Rio  is  eroding.  Bitterly 
disappointed  by  the  failure  of  the  North  to  live  up 
to  its  resource-sharing  promises,  and  after  the 
indecisiveness  of  the  Johannesburg  World  Summit 
on  Sustainable  Development  in  2002,  emerging 
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centers  of  power  in  the  South  are  falling  back  on  a 
distributive  bargaining  strategy  in  global  negotia¬ 
tions  on  development,  the  environment,  and  inter¬ 
national  economic  interaction.  This  is  evident  in 
their  vocal  resistance  to  the  inclusion  of  environ¬ 
mental  issues  in  trade  negotiations  and  in  their 
resistance  to  accept  any  climate  change  mitigation 
deal  that  will  undermine  their  economic  modern¬ 
ization.  Having  been  conned  before  by  sustainable 
development,  the  major  countries  of  the  South  are 
adamant  not  to  let  their  “right  to  develop”  be 
compromised  by  a  deal  on  the  mitigation  of  green¬ 
house  gas  emissions,  no  matter  how  persuasive  the 
case  for  mitigation  is.  With  growing  concerns 
about  fuel  and  food  shortages  becoming  more 
common  in  the  first  decade  of  the  new  millennium, 
the  emphasis  for  the  leaders  of  the  South  again 
falls  on  how  to  make  development  sustainable. 

Both  conceptually  and  politically,  the  notion  of 
sustainable  development  thus  seems  less  sustainable 
than  it  was  in  1992.  However,  there  is  no  concep¬ 
tual  or  normative  challenger  in  sight,  and  it  remains 
embedded  in  numerous  dimensions  of  global  and 
national  governance.  Successive  secretaries  general 
of  the  UN  have  elevated  the  notion  of  sustainable 
development  to  an  overarching  norm  in  multilateral 
attempts  to  secure  global  public  goods.  These 
efforts  no  longer  include  only  government  leaders, 
but  they  also  include  transnational  nongovernmen¬ 
tal  organizations  and  private  corporations.  The 
very  definition  of  a  global  public  good  favored  in 
the  UN  system  is  based  on  the  idea  that  a  public 
good  can  claim  to  be  global  only  if  it  allows  for 
nonexcludability  and  nonrivalry  across  generations. 

Its  institutionalization  at  the  highest  governance 
levels  has  allowed  the  notion  to  contribute  a  range 
of  principles  and  managerial  tools  to  the  gover¬ 
nance  of  the  environment,  both  nationally  and 
internationally.  Among  the  lasting  principles  that  it 
has  contributed  are  the  so-called  precautionary 
principle  that  is  now  deeply  ingrained  in  multilat¬ 
eral  negotiation  practice,  the  notion  of  common  but 
differentiated  responsibilities,  the  importance  of 
taking  an  ecosystem  approach  in  dealing  with  chal¬ 
lenges,  the  “polluter  pays”  principle,  and  the  prin¬ 
ciple  of  environment  policy  integration.  In  recent 
years,  emphasis  has  also  shifted  to  the  political 
conditions  of  sustainability:  subsidiarity  (i.e.,  that 
decisions  should  be  taken  at  the  appropriate  level  of 
governance),  participation  by  and  responsiveness 


to  the  people  directly  affected  by  decisions,  and 
gender  equity. 

The  concept  of  sustainable  development  has 
also  presided  over  the  emergence  of  international 
environmental  law  dealing  with  the  protection  of 
the  ozone  layer,  the  protection  of  biological  diver¬ 
sity,  biosafety,  the  protection  of  natural  forests, 
the  combating  of  desertification,  and  the  stabiliza¬ 
tion  of  climate  change.  It  has  also  had  a  profound 
effect  on  corporate  governance,  where  it  joins 
profitability  and  social  responsibility  as  one  of  the 
three  “bottom  lines.” 

Political  scientists  have  good  reasons  to  be  inter¬ 
ested  in  sustainable  development.  First,  the  con¬ 
cept  prompts  a  broader,  more  inclusive  basis  for 
ethical  deliberation  by  introducing  intertemporal 
considerations  (i.e.,  an  analysis  that  takes  into 
account  a  large  span  of  time).  Second,  sustainable 
development  has  become  an  integral  part  of  the 
contemporary  state’s  managerial  apparatus.  Third, 
it  progressively  has  become  the  normative  focus 
around  which  the  global  governance  of  the  envi¬ 
ronment  converges.  Fourth,  it  adds  a  significant 
new  dimension  to  distributive  battles  between  the 
industrialized  North  and  the  industrializing  South 
and  to  analytical  attempts  to  measure  inequality 
between  states.  Finally,  the  concept  itself  continues 
to  be  politically  highly  contested. 
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Political  symbols  are  signs  (images,  flags,  slogans, 
etc.)  that  represent  political  reality.  Political  sym¬ 
bols  constitute  the  foundation  for  the  cognitive, 
emotional,  and  psychomotor  processes  with  which 
individuals  evaluate  the  political  sphere.  Political 
symbols  can  be  used  strategically  in  a  quest  for 
political  power;  they  may,  however,  also  have 
unintended  consequences.  This  happens  because 
the  interpretation  of  political  symbols  is  not  only 
informed  by  predefined  notions  but  is  the  social, 
political,  and  cultural  context  that  adds  another 
dimension  of  meaning  to  a  given  set  of  political 
symbols. 

Political  symbols  reduce  the  complexity  of  poli¬ 
tics  by  making  references  and  increasing  the 
amount  of  information  in  the  communication  pro¬ 
cess.  While  the  term  political  symbol  describes  the 
objects  of  political  communication,  the  term  sym¬ 
bolic  politics  subsumes  the  occurrence  of  a  specific 
process  of  political  communication.  In  habitual 
language,  symbolic  politics  means  a  publicly  dis¬ 
played  deception  or  surrogate  action  that  is  used  to 
detract  from  actual  political  reality.  In  this  sense, 
symbolic  politics  is  considered  to  be  a  surrogate  for 
politics.  Symbolic  politics  differs  from  substantial 
policy.  As  a  policy  of  signs  (terms  and  slogans, 
badges,  banners  and  pictures,  gestures,  ritual  acts, 
and  political  staging),  symbolic  politics  evolves  in  a 
semantic  field.  Substantial  policy,  by  contrast,  con¬ 
sists  of  a  revisable  succession  of  political  decisions 
(e.g.,  legislation,  contracts,  and  taxes).  Symbolic 
politics  and  substantial  policy  can  be  related  to 
each  other.  On  the  one  hand,  symbolic  politics  can 
have  an  impact  on  substantial  policy.  On  the  other 
hand,  substantial  policy  can  be  communicated, 


implemented,  or  averted  by  symbolic  politics.  In 
the  following  sections,  the  relationships  between 
political  symbols  and  symbolic  politics  and  their 
various  functions  are  discussed. 

The  Use  of  Signs  and  the 
Nature  of  Symbolic  Politics 

Political  symbols/symbolic  politics  constitute  a 
specific  dimension  of  how  we  construct,  represent, 
and  perceive  political  reality.  As  is  always  the  case, 
when  we  deal  with  the  reality  of  social  phenom¬ 
ena,  this  happens  against  a  backdrop  of  questions 
concerning  the  difference  between  substance  and 
appearance.  The  negative  connotation  of  symbolic 
politics,  which  is  often  expressed  in  public  opinion 
polls,  does  not  correspond  to  the  more  sophisti¬ 
cated  view  of  symbolic  politics  in  cultural  studies 
and  the  social  sciences,  for  symbolic  politics  in  a 
broader  sense  also  means  the  strategic  use  of  signs 
to  meet  society’s  requirements  of  political  orienta¬ 
tion.  Attention  may  be  attracted;  willingness  to 
engage  in  political  action,  to  express  loyalty,  or  to 
demonstrate  protest  may  be  shown  by  symbolic 
politics.  In  doing  so,  the  symbolic  value  of  signs  is 
converted  into  political  power. 

This  process  is  not  a  common  cultural,  philo¬ 
sophical,  or  epistemological  phenomenon  of  nom¬ 
ination  or  interpretation.  The  use  of  symbols  in 
politics  is  the  expression  of  a  struggle  for  a  certain 
view  of  life.  Thus,  questions  of  dominance  or  sub¬ 
ordination  are  involved,  resulting  in  the  tendency 
to  unilaterally  implement  or  achieve  by  mutual 
agreement  what  should  be  generally  binding.  The 
strategic  use  of  symbolic  instruments  in  politics 
does  not  simply  aim  at  achieving  interaction. 
Political  symbols  are  neither  mere  decorative 
attachments  nor  ideological  ballast,  in  contrast  to 
“real”  politics.  Symbolic  politics  was,  and  is,  still 
an  integral  part  of  political  communications  in  the 
media  society  and  thereby  of  politics  in  general. 
Thus,  symbolic  politics  is  not  restricted  to  acts  of 
state  with  ostentation,  national  commemoration 
ceremonies,  or  mass  rallies.  Precisely  because  there 
is  no  pure  form  of  politics — it  is  more  or  less  always 
a  dual  reality  of  occurrence  and  interpretation — 
there  is  a  need  for  the  use  of  symbolic  means. 

Symbols  have  a  constitutive  meaning  for  social 
reality  in  an  all-embracing  cultural  and  philosoph¬ 
ical  sense.  They  are  neither  simply  an  image  of 
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reality  nor  an  exponent  of  its  objects  but,  as  the 
philosopher  Ernst  Cassirer  (1985)  points  out,  are 
a  medium  for  the  perception  of  reality  and  thus 
part  of  the  aesthetic  construction  of  the  world. 
They  create  a  transcendental  precondition  for  the 
construction  of  human  reality.  From  a  sociological 
point  of  view,  social  reality  is  not  perceived  by 
firsthand  experience  but  by  a  symbolic  world  of 
meaning.  This  symbolic  world  can  overlap  or 
assimilate  antithetical  parts  of  everyday  life. 

An  important  impulse  for  the  international 
debate  about  symbolic  politics  was  provided  by 
Murray  Edelman  in  his  works  The  Symbolic  Uses 
of  Politics  (1964)  and  Politics  as  Symbolic  Actions: 
Mass  Arousal  and  Quiescence  (1971).  He  argues 
in  favor  of  an  analysis  of  political  power  based  on 
substantiated  social  theory.  Edelman  ties  in  with 
the  basic  assumptions  of  the  symbolic  interaction- 
ist  and  the  interpretative  school  of  sociology,  as 
exemplified  in  the  work  of  George  Herbert  Mead 
(1863-1931),  which  propagates  the  extension  of 
normative  and  institutional  political  concepts  by 
affective  and  evaluative  aspects  of  politics.  He 
understands  political  action  as  playing  a  certain  role, 
which  finds  its  expression  in  dramatic  or  symbolized 
form.  Edelman  connects  subjective  and  intersubjec- 
tive  forms  of  analysis — microphenomena — with 
the  production  of  general  binding  decisions — 
macrophenomena.  Referring  to  Walter  Lippmann’s 
(1922/1997)  classic  study  of  public  opinion,  he 
draws  the  conclusion  that  politics  is  too  complex  to 
be  accessible  to  the  public.  Instead  of  improving  real 
conditions,  a  dramatic  symbolic  scene  of  abstrac¬ 
tions  appears.  A  panoptic  view  is  built  by  images, 
staging,  and  symbols.  This  surrogate  world  may 
refer  to  objective  reality  (“referential  symbols”).  But 
then  there  are  also  “condensation  symbols,”  which 
deceive  about  reality.  Permanently  installed  demo¬ 
cratic  rituals  encourage  belief  in  political  participa¬ 
tion  and  rational  acts  of  state,  even  though  they  are 
instruments  of  political  “quiescence.”  Publicly  com¬ 
municated  policy  is  becoming  ritual,  a  political  sport 
for  spectators.  According  to  Edelman,  political  sym¬ 
bols  and  symbolic  politics  are  becoming  instruments 
of  manipulation  for  political  elites. 

To  a  large  extent,  there  has  been  no  result  in  the 
critical  debate  initiated  by  Edelman  to  correlate 
the  assumed  real  politics  and  their  legitimation  by 
symbolic  politics.  Furthermore,  symbolic  politics 
has  to  deal  with  collective  actors  and  institutions 


that  are  symbolic  regimes  themselves,  being  central 
elements  in  forming,  stabilizing,  and  transforming 
social  relationships.  Political  symbols/symbolic 
politics  are  successful  when  they  make  room  for 
interpretations  that  are  compatible  with  the  way  a 
culture  endows  its  politics  with  meaning.  As  Pierre 
Bourdieu  (2000)  observed,  symbolic  politics  is  suc¬ 
cessful  if  it  refreshes  a  specific  repertoire  of  inter¬ 
pretation  that  can  be  affiliated  to  the  culture  of 
political  interpretation  or  “symbolic  space.” 

So  symbolic  politics,  in  the  sense  of  pseudopoli- 
tics,  is  just  a  negative  special  case  of  a  fundamental 
aspect  of  political  acting  that  always  has  instru¬ 
mental  and  expressive  parts.  Four  fundamental 
functions  of  symbolic  politics  can  be  defined: 

•  Political  symbols  as  well  as  symbolic  politics 
have  a  signal  function:  They  attract  attention, 
breach  routines  for  providing  or  placing 
information,  organize  the  perception  of  politics, 
and  contribute  to  the  reliability  of  behavior. 

•  By  decreasing  complexity,  they  provide  a 
regulator  to  cope  with  the  mass  of  information. 

•  The  use  of  political  symbols  not  only  aims  to 
denominate  political  circumstances  but  is  also  a 
part  of  the  political  fight  for  the  disposal  of 
denomination. 

•  By  perceiving  symbols  in  a  mode  of  suggestive 
immediacy,  symbolic  politics  not  only  addresses 
rationality  but  also  activates  emotions. 

Conclusion 

Symbolic  politics  is  policy  and  as  such  neither  prin¬ 
cipally  good  nor  bad.  It  is  an  essential  component 
of  political  communications.  Symbolic  politics  is 
not  an  invention  of  a  media  society.  However,  the 
permanent  presence  of  politics  in  the  media  and  a 
high-level  display  of  modern  media  systems  bring 
about  responses  concerning  mediation  and  the 
impact  of  “symbolic  power”  (as  portrayed  by 
Bourdieu)  to  an  extent  that  has  never  occurred  in 
history  before.  The  mere  existence  of  symbolic 
politics  is  not  itself  a  political  problem.  A  striking 
question  concerns  its  quality.  An  analytical,  differ¬ 
entiated  evaluation  may  result  from  varying  stress 
ratios:  between  retrogressive  or  reflexive,  with¬ 
drawn  to  privacy  or  active  citizenship,  relating  to 
persons  or  relating  to  factual  issues,  entertaining  or 
informative,  causing  emotions  or  solving  problems, 
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concerning  actors  or  concerning  affected  persons, 
being  hermetic  or  proceeding  from  an  idea  to  logi¬ 
cally  following  an  idea,  and  so  on.  By  using  such  an 
analytical  concept,  an  incorrect  dualistic  under¬ 
standing  can  be  avoided,  one  that  regards  symbolic 
politics  as  being  a  phenomenon  of  political  acting 
on  stage,  while  “real”  politics  are  seen  as  taking 
place  backstage.  Politics,  including  symbolic  poli¬ 
tics,  are  played  on  many  different  stages  at  the  same 
time.  In  this  action,  politicians  can  only  succeed  if 
they  know  how  to  play  different  roles  and  are  able 
to  use  symbolic  politics. 

Research  concerning  symbolic  politics  shows 
serious  gaps.  In  cultural  studies,  the  dominant 
interpretive  of  symbolic  politics  should  be  rein¬ 
forced  with  an  analysis  of  social  sciences,  espe¬ 
cially  with  the  results  of  evaluation  research. 
Researching  symbolic  politics  should  not  be 
restricted  to  a  view  on  the  interior  logic  of  systems 
of  symbols.  It  should  contain  the  structure  of 
material  and  pragmatic  problems,  as  is  the  case  in 
various  cultural-historical  evaluations.  An  analysis 
of  symbolic  politics  has  to  evaluate  the  relation¬ 
ship  of  symbolic  and  pragmatic  means  in  struc¬ 
tures  of  political  interpretation  and  construction. 
By  detecting  coherences  and  antitheses  between 
the  logic  of  fact  and  the  logic  of  presentation,  such 
an  analysis  can  contribute  toward  understanding 
the  logic  of  the  function  of  the  political  dimension 
as  well  as  the  rating  of  political  communications 
according  to  the  quality  of  democracy. 
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Systems  Theory 


Systems  theory  is  a  science  that  has  the  compara¬ 
tive  study  of  systems  as  its  object.  There  are  differ¬ 
ent  types  of  systems:  organisms  (animals,  humans, 
and  particularly  cognitive  mechanisms  in  organ¬ 
isms),  machines  (particularly  computers),  physico¬ 
chemical  systems,  psychic  systems,  and  social 
systems.  Such  a  comparative  research  program  for 
heterogeneous  types  of  systems  presupposes  a 
highly  general  concept  of  systems,  for  which 
numerous  features  have  been  proposed:  the  inter¬ 
dependency  of  the  parts  of  a  system,  the  reference 
of  any  structure  and  process  in  a  system  to  the 
environments  of  the  system,  equilibrium  and 
adaptedness  and  continuous  readaptations  to 
environmental  demands  as  core  elements  of  the 
understanding  of  a  system,  self-organization  of  a 
system  as  the  principal  way  it  responds  to  external 
intervention,  and  complexity  as  a  trigger  mecha¬ 
nism  for  system  formation  and  as  the  form  that 
describes  the  internal  network  structures  of  con¬ 
nectedness  among  system  elements.  After  examin¬ 
ing  the  historical  roots  of  general  system  theory, 
this  entry  focuses  on  the  main  two  theories  that 
have  been  elaborated  in  social  sciences  from  this 
perspective. 

General  System  Theory,  Information 
Theory,  and  Cybernetics 

Systems  theory  is  an  understanding  related  to 
those  definitions  developed  in  the  years  after  1 940 
on  the  basis  of  suggestions  from  biology  (the  “gen¬ 
eral  system  theory”  of  Ludwig  von  Bertalanffy), 
physiology  (Walter  B.  Cannon,  Walter  Pitts,  and 
Warren  McCulloch),  and  information  theory  and 


2580  Systems  Theory 


cybernetics  (Claude  Shannon,  Norbert  Wiener, 
and  William  Ross  Ashby).  Particularly,  the  idea  by 
Shannon  and  Wiener  of  defining  information  as  a 
selection  among  alternative  possibilities  turned  out 
to  be  a  generalization  transcending  heterogeneous 
systems  and  pointing  to  systems  theory  as  a  kind 
of  general  selection  theory.  This  was  connected  to 
the  strictly  binary  way  of  operation  Pitts  and 
McCulloch  postulated  in  a  paper  on  nerve  cells 
published  in  1943.  This  idea  that  at  any  branching 
of  nerve  cells  there  are  only  two  alternative  states 
available  proved  to  be  the  simplest  suggestion  of 
how  to  make  use  of  a  network  of  cells  for  long 
chains  of  numerical  operations.  From  this  came 
the  computer  and,  at  the  same  time,  came  more 
general  ideas  regarding  the  operational  realities  of 
any  observing  system  whatsoever. 

Since  its  beginnings,  the  social  sciences  were  an 
important  part  of  the  establishment  of  systems 
theory.  Jurgen  Ruesch  and  Gregory  Bateson  were 
in  1951  the  first  who  tried  to  base  a  social  science 
discipline  on  an  information  and  communication 
theory  coming  from  cybernetics  ( Communication : 
The  Social  Matrix  of  Psychiatry).  But  the  two  most 
influential  suggestions  were  the  comprehensive 
sociological  versions  of  systems  theory  that  were 
proposed  by  Talcott  Parsons  and  Niklas  Luhmann 
beginning  in  the  1950s  and  the  1970s,  respectively. 

Talcott  Parsons 

Parsons  (1902-1979)  had  been  influenced  by  equi¬ 
librium  ideas  from  physiology  (Cannon),  the 
system-environment  thinking  of  the  Harvard  phys¬ 
iologist  Lawrence  Henderson,  and  the  duality  of 
information  and  energy  that  Wiener  had  proposed. 
From  these  materials,  he  developed  a  sociological 
systems  theory.  Social  systems  are  related  either  to 
the  internal  environment  of  other  social  systems  or 
to  external  nonsocial  environments  (psychic,  bio¬ 
logical,  and  cultural  environments).  Furthermore, 
they  differ  in  the  way  they  refer  to  time:  They  are 
either  oriented  toward  realizations  in  the  future  or 
to  satisfactions  of  needs  in  the  present  ( instrumen¬ 
tal  or  consummatory ).  From  these  two  distinctions, 
internal/external  and  instrumental/consummatory, 
Parsons  derived  four  possibilities  for  the  formation 
of  systems:  first,  there  are  adaptive  systems  (com¬ 
bining  external  reference  and  future  orientation, 
e.g.,  the  economy);  second,  systems  that  are 


specialized  on  goal  attainment  (internal  reference 
and  future  orientation,  e.g.,  the  polity);  third,  sys¬ 
tems  focused  on  integration  of  system  elements 
(internal  reference  and  present  orientation,  e.g., 
the  society  conceived  as  a  community);  and 
fourth,  systems  that  are  responsible  for  the  main¬ 
tenance  of  long-term  patterns  (external  reference 
and  present  orientation,  e.g.,  cultural  institutions 
in  society). 

There  is  one  further  aspect  that  Parsons  adds  to 
this  elementary  distinction  of  the  four  types  of  sys¬ 
tems.  Among  these  four  system  types,  he  distin¬ 
guishes  those  in  which  a  primacy  of  the  transfer  of 
information  obtains  all  cultural  institutions  and 
systems  from  those  that  are  focused  on  transfers  of 
energy  (e.g.,  the  adaptive  economic  system). 
Information-rich  systems  control  energetical  sys¬ 
tems.  These,  on  the  other  hand,  are  thought  of  as 
conditioning  factors  that  limit  the  scope  of  infor¬ 
mation-rich  systems.  This  argument  was  taken 
from  Wiener,  and  Parsons  derived  from  it  a  bidi¬ 
rectional  hierarchy  of  conditions  and  control  that 
interrelated  all  types  of  systems. 

On  the  basis  of  these  elementary  distinctions, 
Parsons  worked  for  a  further  3  decades  on  a  social 
theory  that  identified  in  any  concrete  social  system 
these  four  universal  functional  aspects  (adaptation, 
goal  attainment,  integration,  and  pattern  mainte¬ 
nance),  which  often  constitute  autonomous  sub¬ 
systems  of  the  respective  system.  In  an  analogy  to 
economics,  he  then  added  input-output  analysis. 
Systems  and  subsystems  are  interrelated  via  the 
input  and  output  of  resources  that  are  either  the 
result  or  the  precondition  of  ongoing  system  pro¬ 
cesses.  Among  these  resources  are  the  cognitive 
and  motivational  resources  of  participants  and  the 
rights  and  values  that  are  attributed  to  them.  These 
different  types  of  resources  are  transferred  in 
exchange  processes  between  systems.  For  analyz¬ 
ing  these  exchange  processes  going  on  between 
systems,  without  which  systems  would  never  be 
able  to  procure  the  resources  they  need  for  their 
functioning,  Parsons  created  a  theory  about  media 
of  exchange. 

Parsons  started  again  with  an  analogy  to  eco¬ 
nomics  in  theorizing  about  media  of  exchange.  He 
postulated  that  there  is  first  of  all  money  in  its 
economic  function  as  a  medium  of  exchange,  well- 
known  to  economists.  Then,  he  added  power  and 
argued  that  it  is  best  understood  when  analyzed  as 
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analogous  to  money,  as  an  exchange  medium  that 
mediates  the  transfer  of  resources  (decisions,  sup¬ 
port,  responsibility,  etc.)  important  in  political 
processes.  And  after  having  written  theories  for 
power  and  money,  Parsons  added  further  media  of 
exchange  for  input-output  processes  between  sys¬ 
tems,  among  which  influence  and  value  commit¬ 
ments  play  an  especially  prominent  role  on  the 
level  of  societal  exchanges. 

In  continuing  his  decades-long  work  on  media 
of  exchange  between  systems,  Parsons  affirmed 
once  more  the  cognitive  starting  point  of  systems 
theory  in  the  1940s:  Systems  theory  as  an  interdis¬ 
ciplinary  endeavor  making  use  of  intellectual 
resources,  from  the  sciences  as  well  as  from  the 
humanities,  always  focused  on  strategies  for  com¬ 
paring  heterogeneous  systems  and  diverse  system 
processes. 

Niklas  Luhmann 

Luhmann’s  (1927-1998)  writings  always  presup¬ 
posed  what  Parsons  had  done.  But  it  is  also  true 
that  he  started  systems  theory  anew.  For  him,  the 
system-environment  distinction  as  inspired  by 
Ludwig  von  Bertalanffy’s  theory  of  open  systems 
was  a  much  more  important  starting  point  than  it 
was  for  Parsons.  Whereas  for  Parsons  the  environ¬ 
ment  of  a  social  system  always  consists  of  other 
systems,  for  Luhmann,  a  phenomenological  under¬ 
standing  of  environment  is  far  more  prominent — 
one  that  looks  at  the  difference  between  system 
and  environment,  environments  being  structured 
in  a  completely  different  way  than  is  the  case  in 
systems.  Order  from  noise ,  the  formula  of  Henri 
Atlan,  which  Luhmann  later  on  cited  very  often, 
gives  a  good  idea  of  concepts  of  environment  that 
look  for  contrasts  and  for  differences  and  not  for 
a  simple  plurality  of  other  systems. 

From  the  start,  complexity  was  another  central 
term  in  Luhmann’s  writings.  Systems  process  com¬ 
plexity;  they  arise  by  establishing  and  stabilizing 
the  difference  in  complexity  with  respect  to  their 
environments.  As  is  the  case  in  the  work  of  Wiener 
and  Bateson,  systems  for  Luhmann  are  those 
originating  in  communications,  and  as  such  they 
are  based  on  a  way  of  processing  information  that 
Luhmann  calls  meaning.  Meaning  is  formally 
similar  to  information  as  it  is  based  on  something 
being  a  selection  among  plural  alternatives.  But 


what  is  characteristic  of  meaning  and  thereby  con¬ 
stitutive  for  social  and  psychic  systems,  as  the  two 
types  of  systems  making  use  of  meaning,  is  that  the 
alternatives  not  chosen  are  still  remembered.  One 
can  come  back  to  them,  one  can  criticize  selections 
by  pointing  to  the  alternatives  that  were  available, 
and  one  can  write  history  on  the  basis  of  this  dual 
structure  of  meaning. 

For  Luhmann,  too,  systems  have  a  functional 
orientation.  They  specialize  in  certain  problem 
solutions  characteristic  and  constitutive  of  them. 
But  he  completely  refrains  from  making  a  finite 
catalog  of  the  basic  functions  that  have  to  be 
dealt  with  everywhere.  Instead,  every  system  is 
conceived  to  be  singular  in  fulfilling  the  func¬ 
tional  need  that  somehow  was  the  catalyst 
around  which  the  process  of  system  formation 
came  about  as  a  historical  and  a  contingent  pro¬ 
cess.  Sports,  for  example,  is  a  global  function 
system  in  present-day  world  society.  But  the 
formation  of  this  system  is  contingent  on  the 
improbable  synthesis  of  very  heterogeneous  tra¬ 
ditions  (the  hunting  and  riding  traditions  of  the 
European  nobility,  boxing  and  wrestling  as 
popular  amusements  in  early-modern  Europe, 
ball  games  in  English  public  schools,  the  gym¬ 
nastics  of  Northern  Europe,  etc.).  On  the  basis 
of  such  examples,  one  can  understand  that  for 
Luhmann,  modern  society  consists  of  huge  and 
global  function  systems  for  economic  relations, 
science,  religion,  law,  and  intimate  relations — 
and  for  a  number  of  other  functional  problems 
in  communication.  Functional  differentiation  is 
the  guiding  principle  of  differentiation  in  con¬ 
temporary  society. 

Further  elaboration  of  the  theory  then  was  done 
by  work  on  three  theories  that  were  added  to  the 
theory  of  social  systems.  There  is  a  theory  of  socio¬ 
cultural  evolution,  conceived  as  a  neo-Darwinist 
theory,  which  analyzes  how  structure  formation  is 
possible  on  the  basis  of  chance  events.  Second, 
Luhmann  reformulates  the  Parsonian  theory  of 
media  of  exchange  that  mediate  input-output  pro¬ 
cesses  between  systems  as  a  theory  of  communica¬ 
tion  media,  which  are  conceived  as  being  internal 
to  functional  systems.  These  communication  media 
are  effect  mechanisms.  They  are  based  on  symbols 
that  are  thought  to  be  effective  in  communication — 
such  as  symbols  of  money,  power,  truth,  or  love — 
and  as  such  effective  symbols,  motivate  other 
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social  actors  to  do  something  they  would  not  oth¬ 
erwise  have  done.  In  this  version  of  the  theory 
there  is  no  exchange  implied,  as  communication  is 
not  understood  as  an  exchange  process.  Third, 
Luhmann  works  out  a  differentiation  theory  that 
embeds  the  empirical  core  diagnosis  of  functional 
differentiation  into  a  more  general  theory  of  forms 
of  system  differentiation,  including  segmentation 
and  stratification.  The  guiding  idea  is  once  more  to 
have  an  instrument  for  doing  comparative  research 
on  social  systems.  Different  historical  formations 
can  be  compared  by  looking  at  the  forms  of  system 
differentiation  that  are  dominant  in  them. 

A  further  shift  in  the  foundations  of  systems 
theory  arose  in  the  late  1970s  as  the  result  of  a  new 
interdisciplinary  import.  Luhmann  adopted  the 
theory  of  autopoiesis  proposed  by  Humberto 
Maturana  and  Francisco  Varela,  which  differs  from 
the  Bertalanffy  tradition  of  open  systems  in  looking 
at  systems  (e.g.,  cells)  as  being  completely  closed  on 
the  basis  of  their  own  production  processes. 
Whatever  they  consist  of — elements,  structures, 
processes,  or  boundaries — systems  are  conceived 
to  produce  all  their  elementary  constituents  by 
their  own  production  processes.  Luhmann  con¬ 
nected  this  hypothesis  to  communication  theory. 
He  then  described  social  systems  as  autopoietic 
communication  systems,  which  always  produce 
and  reproduce  a  specific  type  of  communication 
(e.g.,  payments  in  the  economy  or  published  truth 
claims  in  the  social  system  of  science)  and  which 
do  this  only  on  the  basis  of  processes  internal  to 
the  system.  At  the  same  time,  he  held  on  to  the 
primacy  of  the  system-environment  distinction. 
Regarding  autopoietic  systems,  this  means  that  for 
them,  too,  it  is  true  that  they  only  can  continue 
their  processes  of  production  and  reproduction  of 
their  components  if  they  incessantly  observe  their 
relevant  environments  and  generate  information 
instructive  for  their  production  processes  on  the 
basis  of  these  observations. 

Systems  Theory  Today 

Systems  theory  today  continues  in  its  two  variant 
forms:  first,  as  general  systems  theory,  primarily 
influential  among  some  biologists,  chemists,  physi¬ 
cists,  and  mathematicians  and  holding  an  impor¬ 
tant  institutional  place  in  the  Santa  Fe  Institute 


and,  second,  as  a  paradigm  of  sociological  theoriz¬ 
ing  and  research,  linked  to  the  writings  of  Parsons 
and  Luhmann.  As  a  sociological  paradigm,  it  is 
attractive  because  of  its  universalism,  conceiving  a 
multifaceted  approach  to  the  analysis  of  social 
systems  that,  in  the  plurality  of  theoretical 
approaches  it  brought  about,  promises  to  be  appli¬ 
cable  to  a  whole  range  of  problems  relevant  for 
sociology.  In  political  science,  this  tradition  was 
deepened  by  David  Easton  (1953),  who  opened  the 
way  to  a  more  empirical  use  of  the  concept  of 
“political  system”  that  was  clearly  compatible 
with  the  behavioralist  approach  to  politics  and 
challenged  the  classical  use  of  the  concept  of  the 
state.  Strongly  present  during  the  1960s,  this  tradi¬ 
tion  faded  by  the  end  of  the  1970s. 
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Tax  Policy 


Taxes  are  monetary  charges  imposed  by  govern¬ 
ments  on  natural  or  legal  persons.  They  are  com¬ 
pulsory  and  unrequited:  Taxpayers  are  legally 
obliged  to  pay  taxes  and  cannot  expect  to  receive 
any  specific  benefit  in  return,  such  as  a  piece  of 
public  property  or  a  particular  health  care  treat¬ 
ment  in  a  public  hospital.  Taxes  are  not  fees. 
While  taxes  are  ostensibly  collected  for  the  sake  of 
the  public  good,  the  liability  of  the  individual  tax¬ 
payer  is  independent  of  the  personal  utility  he  or 
she  derives  from  that  good.  Taxes  are  levied  in 
virtually  all  countries  of  the  world.  They  consti¬ 
tute  the  major  source  of  revenue  of  the  modern 
state. 

This  entry  deals  with  tax  policy  generally.  It 
considers  the  major  types  and  purposes  of  taxa¬ 
tion,  reviews  common  trends  and  national  varia¬ 
tions  in  the  tax  policy  of  advanced  Organisation 
for  Economic  Co-operation  and  Development 
(OECD)  countries,  reflects  on  the  causes  and  con¬ 
straints  of  tax  policy  making,  discusses  the  effects 
of  taxation  on  growth  and  inequality,  and  high¬ 
lights  specific  features  of  taxation  in  developing 
countries. 

Types  of  Taxes 

It  is  conventional  to  distinguish  between  direct  and 
indirect  taxes.  A  direct  tax  is  paid  directly  to  the 
government  by  the  (natural  or  legal)  person  on 
whom  it  is  imposed.  Direct  taxes  are  usually  based 


on  the  taxpayer’s  ability  to  pay.  The  most  impor¬ 
tant  direct  tax  is  the  individual  income  tax  levied 
on  the  labor,  capital,  or  other  income  of  a  person 
or  household.  Other  important  direct  taxes  include 
the  corporate  income  tax  on  business  profits,  the 
property  tax  on  real  estate  holdings,  the  wealth  tax 
on  net  worth,  the  inheritance  tax  on  bequests,  and 
social  security  contributions.  An  indirect  tax  is 
paid  to  the  government  by  a  person  different  from 
the  person  on  whom  it  is  imposed.  The  most 
important  examples  are  general  consumption 
taxes,  such  as  the  value-added  tax  (VAT)  or  the 
sales  tax,  and  specific  consumption  taxes,  such  as 
excises  on  fuel,  tobacco,  or  alcohol.  These  taxes 
are  indirect  in  the  sense  that  they  are  imposed  on 
the  consumers  of  goods  and  services  but  paid  to 
the  government  by  the  seller  of  the  goods  or  ser¬ 
vices.  In  contrast  to  direct  taxes,  indirect  taxes 
cannot  take  into  account  taxpayers’  ability  to  pay. 
While  poor  people  usually  pay  less  income  tax, 
absolutely  and  relatively,  than  rich  people,  they 
always  pay  the  same  rate  of  VAT  or  excise. 

Another  typology  sorts  taxes  according  to  their 
presumed  distributive  effect.  The  individual  income 
tax  is  usually  considered  a  progressive  tax  because 
it  typically  combines  a  progressive  statutory  rate 
schedule  with  a  personal  exemption  for  a  mini¬ 
mum  level  of  income.  Both  features  ensure  that 
income  earners  pay  progressively  more  of  their 
income  in  tax  the  higher  their  earnings.  Corporate 
income,  property,  and  wealth  taxes  are  also  often 
considered  to  be  progressive.  Social  security  con¬ 
tributions,  by  contrast,  can  be  considered  propor¬ 
tional  to  the  extent  that  they  are  charged  at  flat 
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rates  and  with  little  or  no  personal  exemption. 
Often  they  are  charged  only  up  to  a  maximum 
level  of  income.  This  tends  to  make  their  distribu¬ 
tive  effect  regressive  because  taxpayers  with 
incomes  above  the  income  cap  owe  an  ever  smaller 
share  of  their  income  to  social  security  contribu¬ 
tions  the  more  they  earn  above  the  income  cap 
than  taxpayers  with  income  below  the  cap.  General 
consumption  taxes  and  excises  are  commonly  con¬ 
sidered  to  be  regressive  because  the  share  of  income 
spent  on  consumption  tends  to  decrease  as  the  level 
of  personal  income  increases.  The  theoretical  basis 
for  distinguishing  taxes  according  to  their  distribu¬ 
tive  impact  is  shaky,  however.  The  link  between 
corporate  income  taxation  and  tax  progressivity, 
for  example,  is  by  no  means  clear  and  crucially 
depends  on  assumptions  about  incidence.  There  is 
also  empirical  evidence  to  suggest  that  common 
intuitions  about  the  distributive  effects  of  taxes  are 
often  inaccurate.  For  example,  income  and  prop¬ 
erty  taxes  are  not  invariably  progressive  in  end 
effect  and  may  sometimes  even  be  regressive. 

Purposes  of  Taxation 

According  to  the  American  economist  Richard 
Musgrave  (1959),  taxation  serves  three  broad  pur¬ 
poses,  which  are  discussed  below. 

Resource  Allocation.  Taxes  allocate  resources  to  the 
public  sector.  This  is  the  obvious  fiscal  function  of 
taxation.  During  the  19th  and  20th  centuries,  alter¬ 
native  sources  of  revenue — ownership  of  productive 
resources,  sales  of  public  office  or  monopoly  power, 
coercive  confiscation  of  domestic  or  foreign  wealth, 
monetary  debasement — decreased  in  significance. 
For  the  modern  state,  taxation  is  the  single  most 
important  source  of  government  revenue.  Taxes  can 
also  improve  resource  allocation  in  the  private  sec¬ 
tor,  for  example,  by  forcing  actors  to  internalize 
externalities,  as  in  environmental  taxation  or 
tobacco  and  alcohol  taxation.  Hence,  taxation  may 
also  serve  a  regulatory  function  for  private  resource 
allocation. 

Redistribution.  Taxation  is  one  major  instrument 
to  reduce  inequalities  in  the  distribution  of  income 
and  wealth  either  directly  through  tax  progressiv¬ 
ity  or  indirectly  through  funding  redistributive 
expenditure  programs. 


Stabilization.  Taxation,  together  with  public 
expenditure,  monetary  policy,  and  public  debt,  can 
help  stabilize  the  business  cycle  and  ensure  high 
levels  of  employment  and  growth. 

Common  Trends  and  National  Variations 

Tax  policy  in  advanced  OECD  countries  has  fol¬ 
lowed  broadly  similar  trends  over  the  past  50  years 
in  terms  of  tax  level,  tax  mix,  and  tax  design,  but 
significant  national  variations  remain. 

The  level  of  total  taxation  has  risen  in  all 
advanced  OECD  countries.  Taxation  as  a  share  of 
GDP  grew  from  roughly  25%  on  average  in  1955 
to  38%  on  average  in  2005.  The  level  of  taxation 
is  highest  in  Nordic  countries,  such  as  Sweden 
(49%  of  GDP  in  2005)  and  Denmark  (51%).  It 
tends  to  be  below  average  in  Anglo-Saxon  coun¬ 
tries,  such  as  Australia  (31%).  IT  is  also  below 
average  in  the  United  States  (28%).  Tax  levels  in 
continental  Europe  and  in  Mediterranean  states  lie 
between  these  extremes. 

The  tax  mix  has  grown  more  uniform.  All 
advanced  OECD  countries  derive  the  bulk  of  their 
tax  revenues  from  just  three  taxes:  individual 
income  tax,  social  security  contributions,  and  a 
general  consumption  tax,  usually  a  VAT.  While 
these  taxes  accounted  for  only  about  45%  of  total 
tax  revenues  on  average  in  1955  and  for  as  little  as 
21%  and  23%  of  total  revenues  in  Ireland  and 
Portugal,  respectively,  by  2005,  they  accounted  for 
an  average  of  71%  of  total  tax  revenues.  Other 
taxes,  the  most  important  of  which  are  excises  on 
specific  goods  and  services,  have  lost  in  fiscal 
importance.  Significant  national  variations  still 
remain  in  the  relative  reliance  on  the  three  major 
taxes  though.  Anglo-Saxon  countries  tend  to  rely 
more  heavily  on  the  individual  income  tax  (and 
also  the  corporate  income  tax)  than  other  OECD 
countries  but  less  on  general  consumption  taxation 
and  social  security  contributions.  Australia  and 
New  Zealand  collect  no  revenue  from  social  secu¬ 
rity  contributions  at  all.  Continental  European  and 
Mediterranean  countries,  such  as  Belgium,  the 
Netherlands,  Germany,  France,  Portugal,  and 
Spain,  tend  to  rely  disproportionately  on  social 
security  contributions,  while  the  revenue  raised 
through  individual  income  taxes  is  often  below 
average.  Finally,  Nordic  countries  rely  heavily  on 
income  tax  (but  not  corporate  income  tax)  and 
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VAT  but  less  on  social  security  contributions. 
Denmark  is  the  extreme  case,  raising  almost  50% 
of  total  tax  revenue  through  income  tax  and  20% 
through  VAT  but  hardly  anything  through  social 
security  contributions  in  2005. 

In  terms  of  tax  design,  the  most  prominent 
recent  development  in  income  taxation  has  been 
the  flattening  of  the  statutory  rate  schedule.  The 
top  individual  income  tax  rate  fell  from  an  average 
of  almost  70%  in  advanced  OECD  countries  and 
more  than  90%  in  the  United  Kingdom  and  Japan 
in  1975  to  an  average  of  slightly  below  48%  by 
2005.  In  some  states,  rate  cuts  were  accompanied 
by  a  move  away  from  a  global  income  tax,  under 
which  income  from  whatever  source  is  aggregated 
and  subjected  to  a  single  rate  of  tax,  toward  a  so- 
called  schedular  income  tax,  under  which  income 
from  different  sources  is  taxed  at  different  rates. 
Thus,  interest  income  is  increasingly  subjected  to  a 
separate  final  withholding  tax  outside  the  individ¬ 
ual  income  tax  in  OECD  countries.  The  so-called 
dual  income  tax,  introduced  in  Norway,  Sweden, 
and  Finland  in  the  1990s,  separates  the  taxation  of 
capital  and  labor  income  completely,  subjecting 
only  the  latter  but  not  the  former  to  taxation  at  high 
progressive  rates.  Corporate  income  taxation  has 
been  characterized  by  a  general  move  toward  lower 
statutory  tax  rates  and  broad  tax  bases  since  the 
1980s.  The  dominant  trend  in  consumption  taxa¬ 
tion  has  been  the  spread  of  VAT  since  the  1970s.  In 
2010,  all  OECD  countries  except  the  United  States 
had  a  VAT.  At  the  same  time,  the  number  of  excises 
has  generally  fallen,  and  some  new  environmental 
taxes  have  been  introduced. 

Determinants  of  Tax  Policy 

Various  factors  shape  the  choice  of  tax  level,  tax 
mix,  and  tax  design,  including  public  spending, 
administrative  capacity,  domestic  politics,  and 
international  influences. 

Public  spending  has  increased  even  faster  than 
total  taxation  over  the  past  50  years.  It  went  up 
from  27%  of  GDP  on  average  in  1950  to  45%  on 
average  in  2005.  The  main  driver  behind  the 
growth  in  spending  was  social  expenditure.  As  the 
welfare  state  expanded,  the  outlays  for  social  pur¬ 
poses  went  up.  Social  expenditure  now  accounts  for 
more  than  50%,  on  average,  of  public  spending  in 
advanced  OECD  countries  and  explains  virtually 


all  the  variance  in  national  tax  levels.  The  major 
difference  between  high-tax  and  low-tax  countries 
is  simply  that  the  former  have  a  large  welfare  state 
while  the  latter  do  not.  While  the  close  association 
between  public  spending  and  taxation  is  obvious, 
the  direction  of  causation  is  more  contested.  Some, 
such  as  Junko  Kato  (2003),  argue  that  the  establish¬ 
ment  of  high-performance  tax  systems  preceded 
and  enabled  the  growth  of  large  welfare  states, 
while  others,  such  as  Peter  Lindert  (2004),  maintain 
that  high  taxation  is  a  consequence  of  high  levels  of 
social  expenditure. 

According  to  B.  Guy  Peters  (1991),  administra¬ 
tive  capacity  has  a  major  influence  on  the  level  and 
shape  of  taxation.  Early  taxes,  such  as  tolls,  tariffs, 
excises,  and  property  taxes,  targeted  the  easy-to- 
monitor  bottlenecks  of  economic  activity.  This 
helped  economize  on  scarce  administrative 
resources  but  was  economically  distorting  and  fis¬ 
cally  ineffective.  As  administrative  capacities  grew 
during  the  20th  century,  these  bottlenecks  were 
relieved,  and  the  tax  burden  was  spread  more 
widely  and  efficiently.  Tolls,  customs  duties,  and 
excises  were  replaced  by  general  sales  taxes  and, 
since  the  1960s,  by  the  technically  superior  but 
administratively  complex  VAT.  In  direct  taxation, 
property  and  wealth  taxes  were  supplanted  by 
income  tax.  The  spread  of  the  withholding  method 
of  income  tax  collection  since  the  1940s,  whereby 
the  tax  due  is  retained  (“withheld”)  by  the  income’s 
paying  agent  (the  employer  or  bank)  rather  than 
the  income  earners  themselves,  facilitated  the  con¬ 
version  of  an  income  tax  originally  restricted  to  the 
upper  class  into  a  mass  tax  and  increased  the  time¬ 
liness  and  buoyancy  of  tax  collections.  Problems  of 
assessment,  monitoring,  and  enforcement  also 
explain  why,  for  example,  capital  (net  wealth, 
capital  gains,  owner-occupied  housing)  or  fringe 
benefits  are  usually  taxed  more  leniently  than 
labor  income  or  why  financial  services  are  often 
exempt  from  VAT  even  though  they  constitute  an 
important  part  of  economic  activity. 

Taxation  is  a  major  issue  in  domestic  politics.  It 
is  commonly  assumed  that  left  parties  favor  high 
and  progressive  taxation  because  their  core  con¬ 
stituency  comes  primarily  from  the  lower  half  of 
the  income  distribution,  while  right  parties  prefer 
moderate  levels  of  proportional  or  regressive  taxa¬ 
tion  because  they  represent  the  upper  half  of 
income  earners  All  else  being  equal,  high  levels  of 
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inequality  should  move  tax  policy  to  the  left,  low 
levels  to  the  right.  However,  various  contextual 
factors  modify  or  even  reverse  this  simple  logic. 
First,  electoral  competition  is  usually  multidimen¬ 
sional,  and  this  tends  to  moderate  the  parties’ 
policy  positions  on  any  one  dimension,  including 
taxation.  Second,  left-party  governments  may  be 
deterred  from  high,  progressive  taxation  by  the 
possibility  of  negative  macroeconomic  conse¬ 
quences.  Actually,  left  governments  in  Sweden  and 
elsewhere  have  often  opted  for  regressive  con¬ 
sumption  taxes  and  social  security  contributions 
rather  than  progressive  income  taxes,  to  fund 
social  spending.  Finally,  institutional  factors  shape 
the  translation  of  party  preferences  into  government 
policies.  Consensus  democracies,  characterized  by 
multiparty  systems,  proportional  representation 
electoral  institutions,  broad  coalition  governments, 
interest  group  corporatism,  and  strong  judicial 
review,  decrease  the  responsiveness  of  tax  policy 
making  to  the  policy  preferences  of  the  median 
voter,  while  majoritarian  systems  with  two-party 
systems,  majority  elections,  single-party  govern¬ 
ments,  and  interest  group  pluralism  tend  to  increase 
it.  Some  argue  that  this  difference  explains  why 
majoritarian  Anglo-Saxon  democracies  rely  more 
heavily  on  progressive  income  taxes  and  corporate 
taxation  than  the  more  consensus-oriented  conti¬ 
nental  and  Nordic  European  countries:  In  these 
countries,  as  noted  by  Jude  Hays  (2009),  the 
majoritarian  impulse  to  impose  heavy  taxes  on  the 
rich  has  been  kept  in  check  by  multiple  veto  play¬ 
ers,  while  in  majoritarian  Anglo-Saxon  democra¬ 
cies,  it  dominates  tax  policy  making  during  periods 
of  leftist  government. 

Various  international  influences  shape  national 
tax  policy  making.  One  is  the  diffusion  of  tax 
policy  examples  and  ideas  across  national  borders. 
It  has  been  argued,  for  example,  that  the  highly 
visible  “market-conforming”  1986  U.S.  Tax 
Reform  Act  triggered  a  wave  of  reform  in  other 
OECD  countries,  which  emulated  the  U.S.  exam¬ 
ple  and  established  a  new,  “neoliberal”  tax  policy 
orthodoxy.  This  may  account  for  the  general  trend 
toward  low  statutory  tax  rates  and  broad  tax  bases 
in  corporate  and  individual  income  taxation. 
International  organizations  can  also  influence 
national  tax  policy.  The  European  Union  (EU) 
severely  constrains  the  freedom  of  its  member 
states  in  matters  of  tax  design  and  administration 


by  detailed  tax  harmonization  in  indirect  taxation 
and  through  the  case  law  of  the  European  Court  of 
Justice  in  direct  taxation.  The  influence  of  other 
international  organizations  is  more  selective  and 
less  obvious.  The  World  Trade  Organization  stipu¬ 
lates  international  norms  for  the  tax  treatment  of 
international  trade.  The  so-called  OECD  Model 
Tax  Convention,  which  serves  as  an  almost  univer¬ 
sally  accepted  blueprint  for  bilateral  double-tax- 
avoidance  treaties,  exerts  some  soft  harmonizing 
pressure  on  national  income  taxation  by  defining 
basic  tax  concepts  that  have  become  established 
features  of  national  taxation,  such  as  the  distinc¬ 
tion  between  active  and  passive  income  or  between 
subsidiaries  and  permanent  establishments.  Also, 
OECD  revenue  statistics,  by  defining  a  common 
classificatory  frame  for  the  comparison  of  national 
taxes,  exerts  subtle  pressure  to  design  national 
taxes  such  that  they  can  be  fitted  into  this  frame. 
Finally,  national  tax  policy  is  constrained  by  inter¬ 
national  tax  competition.  Tax  competition  has 
been  a  major  driver  behind  the  general  fall  in  cor¬ 
porate  tax  rates  since  the  1980s.  Corporate  tax 
revenues  remained  largely  unaffected  partly 
because  of  the  offsetting  effects  of  broadening  the 
tax  base.  But  the  corporate  tax  burden  shifted 
from  high-profit  firms,  which  benefit  most  from 
low  rates,  to  low-profit  firms,  which  suffer  most 
from  base  broadening.  Arguably,  individual 
income  tax  also  suffered  because  low  corporate 
tax  rates  increase  the  incentives  for  rich  individuals 
to  shift  income  into  the  corporate  sector  in  order 
to  protect  it  from  taxation  at  the  top  individual 
income  tax  rate.  Generally,  tax  competition  is 
more  constraining  for  small  states  than  for  large 
states:  They  suffer  more  in  terms  of  tax  base  flight 
if  they  try  to  maintain  high  corporate  tax  rates. 
However,  they  also  potentially  benefit  more  from 
undercutting  foreign  tax  rates.  Ireland  has  set 
aggressive  corporate  tax  rates  to  speed  up  national 
economic  development,  and  many  new  EU  mem¬ 
ber  states  in  Eastern  Europe  have  tried  to  emulate 
Ireland’s  apparent  success. 

Taxation  and  Economic  Growth 

Perhaps  surprisingly,  there  is  little  evidence  for  a 
negative  correlation  of  economic  growth  and  taxa¬ 
tion.  High-tax  states  do  not  systematically  grow 
more  slowly  than  low-tax  ones  in  the  OECD  area. 
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Most  observers  agree  that  a  crucial  key  to  explain¬ 
ing  what  Lindert  (2004)  called  the  “free-lunch 
puzzle”  is  that  high-tax,  high-spending  welfare 
states  in  Scandinavia  and  continental  Europe  have 
a  more  growth-friendly  tax  mix  than  the  relatively 
low-tax  Anglo-Saxon  states.  Some  highlight  the 
fact  that  the  former  group  of  states  relies  more 
heavily  on  regressive  taxes  on  labor  and  consump¬ 
tion  (social  security  contributions,  VAT,  and 
excises  on  alcohol,  tobacco,  and  fuel)  and  less  on 
progressive  income  taxes  than  the  latter.  As  Lindert 
notes,  a  more  regressive  tax  mix  is  said  to  promote 
growth  by  reducing  the  disincentive  effects  of  taxa¬ 
tion.  Progressive  taxes  purportedly  discourage 
extra  effort  by  subjecting  extra  income  to  a  higher 
tax  burden;  regressive  taxes  encourage  effort  by 
rewarding  extra  income  with  lower  effective  tax 
rates.  Others  note  that  progressive  taxes  may  also 
stimulate  growth.  A  progressive  tax  system  is 
praised  for  putting  money  into  the  hands  of  those 
most  likely  to  spend  it — namely,  the  poor. 
According  to  Achim  Kemmerling  (2009),  a  pro¬ 
gressive  tax  system  invigorates  the  labor  market  by 
facilitating  the  take-up  of  low-wage  work  and  by 
discouraging  wage  drift,  and  it  smoothes  dispos¬ 
able  income  over  the  business  cycle  and,  thus, 
helps  stabilize  the  macroeconomy.  From  this  per¬ 
spective,  continental  European  and  Scandinavian 
tax  mixes  appear  more  growth  friendly  than  the 
Anglo-Saxon  tax  mix,  not  because  they  are  more 
regressive  but  because  they  burden  capital  less. 
Steffen  Ganghof  (2006)  suggested  that  focusing 
taxation  on  labor  and  consumption  stimulates  sav¬ 
ings  and  investment  and,  therefore,  innovation  and 
employment. 

Taxation  and  Inequality 

Taxation  can  reduce  income  inequality  either 
directly,  by  a  progressive  rate  schedule  that  imposes 
a  higher  tax  burden — relatively  and  absolutely — 
on  the  rich  than  on  the  poor,  or  indirectly,  by  fund¬ 
ing  redistributive  social  expenditure.  According  to 
Lane  Kenworthy  (2008),  while  advanced  OECD 
countries  rely  on  both  strategies,  they  generally 
achieve  more  inequality  reduction  by  social  expen¬ 
diture  than  by  progressive  taxation.  The  redistribu¬ 
tive  effects  of  both  strategies  are  not  completely 
equivalent  though.  While  poverty  alleviation  calls 
for  social  expenditure  because  the  poor  have  little 


or  no  taxable  income  to  begin  with,  the  reduction 
of  high-end  income  inequality  calls  for  progressive 
taxation  because  the  rich  do  not  depend  on  social 
transfers  and  public  services.  The  combination  of 
progressive  taxation  and  redistributive  expendi¬ 
ture  is  made  difficult  by  economic  and  political 
trade-offs.  Redistribution  by  progressive  taxation 
requires  a  high  tax  burden  on  the  rich  and  also  a 
high  tax  burden  on  capital  because  the  rich  are 
likely  to  derive  a  large  share  of  their  income  from 
capital.  This  tends  to  increase  the  efficiency  costs 
of  taxation  in  terms  of  reduced  savings,  invest¬ 
ment,  and  work  effort,  and  to  alienate  the  rich  and 
parts  of  the  middle  class  from  the  welfare  state. 
This  may  explain  why  countries  relying  strongly 
on  redistribution  by  progressive  taxation,  such  as 
the  Elnited  States,  usually  allocate  less  funds  to 
social  expenditure.  It  may  also  explain  why  coun¬ 
tries  with  high  levels  of  social  expenditure,  such  as 
Sweden  and  Germany,  rely  strongly  on  regressive 
consumption  taxes  and/or  social  security  contribu¬ 
tions  and  have  a  low  income  tax  burden  on  capi¬ 
tal.  Harold  Wilensky  (2002)  suggested  that  these 
features  reduce  the  efficiency  costs  of  taxation  and 
keep  the  well-off  committed  to  the  welfare  state 
thus  allowing  for  higher  levels  of  taxation  and 
spending. 

Taxation  in  Less  Developed  Countries 

Developing  countries  rely  less  on  taxation  and  more 
on  nontax  sources  of  revenue — debt,  seigniorage 
(the  “inflation  tax”),  and  foreign  transfers — than 
advanced  OECD  countries.  This  is  reflected  in  a 
significantly  lower  level  of  total  taxation.  According 
to  some  estimates,  the  tax  take  of  an  average  devel¬ 
oping  country  has  stagnated  at  less  than  20%  since 
the  late  1960s.  The  tax  mix  of  developing  coun¬ 
tries  is  typically  geared  toward  indirect  taxation, 
especially  excises  and  taxes  on  international  trade, 
while  direct  taxes  on  income,  property,  wealth, 
corporate  profits,  or  capital  gains  typically  contrib¬ 
ute  little  to  total  revenues.  Various  potential 
explanatory  factors  have  been  discussed  for  the 
specific  tax  features  of  developing  countries.  Low 
levels  of  legitimacy  and  trust  in  state  institutions 
are  said  to  increase  tax  resistance,  constraining 
revenue-raising  potential.  High  levels  of  corruption 
purportedly  have  a  similar  effect.  There  is  empiri¬ 
cal  evidence  of  a  substitution  effect  as  people  do 
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not  like  to  pay  for  public  services  twice,  first  via 
taxes  and  then  again  via  bribes.  The  economic 
structure  can  also  have  a  constraining  effect.  High 
levels  of  agricultural  employment  and  a  large 
informal  sector  tend  to  be  negatively  associated 
with  tax  revenue,  whereas  industrialization  and  a 
concentration  of  economic  activity  in  resource 
extraction  tend  to  increase  tax  revenue.  Also, 
developing  countries  are  more  vulnerable  to  the 
pressures  of  tax  competition  and  have  purportedly 
suffered  revenue  losses  because  of  it.  Finally, 
administrative  problems  help  explain  the  strong 
focus  on  easy-to-monitor  tax  handles,  such  as  spe¬ 
cific  transactions  (excises  and  trade  taxes)  or  large 
corporations  (the  corporate  income  tax). 

Philipp  Genschel 
Jacobs  University  Bremen 
Bremen,  Germany 
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Territory 


Several  dimensions  of  politics  come  together  in  the 
word  territory:  land,  a  functional  factor  such  as 
communication  infrastructure,  and  a  symbolic  fac¬ 
tor  such  as  national  identity.  Typically  associated 
with  a  polity,  particularly  a  nation-state,  the  term 
can  also  be  applied  to  any  portion  of  space  referred 
to  otherwise  as  a  region,  locality,  or  place.  Sometimes, 
a  territory  is  an  area  awaiting  formal  incorporation 
into  an  adjacent  state,  as  in  the  case  of  Alaska  before 
it  became  one  of  the  U.S.  states.  In  general,  however, 
territory  is  particularly,  if  not  exclusively,  associated 
with  the  spatial  organization  of  the  modern  state 
with  its  claim  to  absolute  control  or  sovereignty 
over  a  population  within  carefully  defined  external 
borders.  Indeed,  until  Robert  D.  Sack  extended  the 
understanding  of  human  territoriality  as  a  strategy 
to  individuals  and  organizations  in  general,  use  of 
the  term  territory  was  largely  confined  to  the  spatial 
organization  of  states.  In  the  social  sciences,  such  as 
sociology  and  political  science,  this  is  still  the  case, 
such  that  the  challenge  posed  to  territory  by  net¬ 
worked  forms  of  organization  (typically  associated 
with  globalization)  is  invariably  characterized  in 
totalistic  terms  as  “the  end  of  geography.”  This  sig¬ 
nifies  the  extent  to  which  territory  has  become  the 
dominant  geographical  term  (and  imagination)  in 
the  social  sciences.  It  is  then  closely  allied  to  state 
sovereignty.  As  John  Agnew  (1994)  pointed  out, 
because  sovereignty  is  seen  to  “erode”  or  “unbun¬ 
dle,”  territory  is  also  assumed  to  behave  likewise. 
From  this  viewpoint,  territory  takes  on  an  epistemo¬ 
logical  centrality,  in  that  it  is  understood  as  abso¬ 
lutely  fundamental  to  modernity. 

The  territorial  nation-state  is  a  highly  specific 
historical  entity.  It  first  arose  in  Europe  in  the  1 6th 
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and  17th  centuries.  Since  that  time,  political  power 
has  often  been  seen  as  inherently  territorial. 
Politics  take  place  only  within  “the  institutions 
and  the  spatial  envelope  of  the  state  as  the  exclu¬ 
sive  governor  of  a  definite  territory.  We  also  iden¬ 
tify  political  territory  with  social  space,  perceiving 
countries  as  ‘state-societies’”  (Paul  Hirst,  2005, 
p.  27).  Much  interstate  conflict  is  about  competing 
territorial  claims.  The  process  of  state  formation 
has  always  had  two  crucial  attributes.  One  is 
exclusivity.  All  the  political  entities  (the  Roman 
Catholic  Church,  city-states,  etc.)  that  could  not 
achieve  a  reasonable  semblance  of  sovereignty 
over  a  contiguous  territory  have  been  steadily  dele- 
gitimized  as  major  political  actors.  The  second  is 
mutual  recognition.  The  power  of  states  has  rested 
to  a  considerable  extent  on  the  recognition  each 
state  receives  from  the  others  in  the  form  of  nonin¬ 
terference  in  its  “internal”  affairs.  Together,  these 
attributes  have  created  a  world  in  which  there  can 
be  no  territory  without  a  state.  In  this  way,  terri¬ 
tory  has  come  to  underpin  both  nationalism  and 
representative  democracy,  both  of  which  depend 
critically  on  restricting  political  membership  by 
homeland  and  address,  respectively. 

In  political  theory,  control  over  a  relatively 
modest  territory  has  long  been  seen  as  the  primary 
solution  to  the  security  dilemma:  to  offer  protec¬ 
tion  to  populations  from  the  threats  of  anarchy 
(disorder),  on  the  one  hand,  and  hierarchy  (distant 
rule  and  subordination),  on  the  other.  The  problem 
has  been  to  define  what  is  meant  by  “modest”  size. 
To  Baron  de  Montesquieu  (1949),  the  Enlighten¬ 
ment  philosopher,  different  size  territories  inevita¬ 
bly  have  different  political  forms:  “It  is,  therefore, 
the  natural  property  of  small  states  to  be  governed 
as  a  republic,  of  middling  ones  to  be  subject  to  a 
monarch,  and  of  large  empires  to  be  swayed  by  a 
despotic  prince”  (p.  122).  Early  modern  Europe 
offered  propitious  circumstances  for  the  emergence 
of  a  fragmented  political  system,  primarily  because 
of  its  topographical  divisions.  Montesquieu  further 
notes,  however,  that  popular  representation  allows 
for  the  territorial  extension  of  republican  govern¬ 
ment.  The  founders  of  the  United  States  added  to 
this  by  trying  to  balance  between  centralizing  cer¬ 
tain  security  functions,  on  the  one  hand,  and 
retaining  local  controls  over  many  other  functions, 
on  the  other.  The  recent  history  of  the  European 
Union  can  be  thought  of  in  similar  terms. 


Human  activities  in  the  world,  however,  have 
never  conformed  entirely  to  spaces  defined  by 
proximity  as  provided  by  territory.  Indeed,  and 
increasingly,  as  physical  distance  proves  less  of  a 
barrier  to  movement,  spatial  interaction  between 
separated  nodes  across  networks  is  an  important 
mechanism  of  geographical  sorting  and  differenti¬ 
ation.  Sometimes  posed  today  in  terms  of  a  world 
of  flows  versus  a  world  of  territories,  this  is  better 
thought  of  in  terms  of  territories  and/or  networks 
of  flows  rather  than  as  one  versus  the  other. 
Territories  and  networks  exist  relationally  rather 
than  mutually  exclusively.  If  territorial  regulation 
is  all  about  tying  flows  to  places,  territories  have 
never  been  zero-sum  entities  in  which  the  sharing 
of  power  or  the  existence  of  external  linkages 
totally  undermines  their  capacity  to  regulate.  If  at 
one  time  territorial  states  did  severely  limit  the 
local  powers  of  transterritorial  and  local  agencies, 
the  fact  that  this  is  no  longer  the  case  does  not 
signify  that  the  states  have  lost  all  of  their  powers: 

Territory  still  matters.  States  remain  the  most 
effective  governors  of  populations.  .  .  .  The 
powers  to  exclude,  to  tax,  and  to  define  political 
rights  are  those  over  which  states  acquired  a 
monopoly  in  the  seventeenth  century.  They 
remain  the  essentials  of  state  power  and  explain 
why  state  sovereignty  survives  today  and  why  it 
is  indispensable  to  the  international  order.  (Hirst, 
2005,  p.  45) 

Territoriality  is  the  strategic  use  of  territory  in 
either  the  organization  and  exercise  of  power, 
legitimate  or  otherwise,  over  blocs  of  space  or  the 
organization  of  people  and  things  into  discrete 
areas  through  the  use  of  boundaries.  In  studies  of 
animal  behavior,  spatial  division  into  territories  is 
seen  as  an  evolutionary  principle,  a  way  of  foster¬ 
ing  competition  so  that  those  best  matched  to  their 
territory  will  have  more  surviving  offspring.  With 
human  territoriality,  however,  spatial  division  is 
more  typically  thought  of  as  a  strategy  used  by 
organizations  and  groups  to  organize  social,  eco¬ 
nomic,  and  political  activities.  From  this  view¬ 
point,  space  is  partitioned  into  territorial  cells  or 
units  that  can  be  relatively  autonomous  (as  with 
the  division  of  global  space  into  territorial  nation¬ 
states)  or  arranged  hierarchically  from  basic  units 
in  which  work,  administration,  or  surveillance  is 
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carried  out  through  intermediate  levels  at  which 
managerial  or  supervisory  functions  are  located  to 
the  topmost  level  at  which  central  control  is  concen¬ 
trated.  Alternative  spatial  organizations  of  political 
and  economic  institutions,  particularly  hierarchical 
networks  (as  in  the  world-city  network)  or  reticular 
networks  (as  with  the  Internet),  can  challenge  or 
supplement  the  use  of  territoriality. 

Theoretically,  territory  can  be  seen  as  having  a 
number  of  different  socioeconomic  origins.  As 
Sack  notes,  these  include  the  following: 

f .  as  a  result  of  explicit  territorial  strategizing  or 
territoriality  to  devolve  administrative  functions 
but  maintain  central  control, 

2.  as  a  secondary  result  of  resolving  the  dilemmas 
facing  social  groups  in  delivering  public  goods 
(as  in  Michael  Mann’s  1984  sociology  of 
territory), 

3.  as  an  expedient  facilitating  coordination 
between  capitalists  who  are  otherwise  in 
competition  with  one  another  (as  in  Marxist 
theories  of  the  state), 

4.  as  the  focus  of  one  strategy  among  several  of 
governmentality  (as  in  Michel  Foucault’s 
writings),  and 

5.  as  a  result  of  defining  boundaries  between  social 
groups  to  identify  and  maintain  group  cohesion 
(as  in  the  writings  of  Georg  Simmel  and  Fredrik 
Barth  and  in  more  recent  sociological  theories 
of  political  identity  such  as  that  of  John 
Agnew). 

Whatever  its  origins,  territoriality  is  put  into  practice 
in  a  number  of  different,  if  often  complementary, 
ways:  by  popular  acceptance  of  classifications  of 
space  (e.g.,  “ours”  vs.  “yours”),  through  commu¬ 
nication  of  a  sense  of  place  (where  territorial 
markers  and  boundaries  evoke  meanings),  and  by 
enforcing  control  over  space  (by  surveillance, 
policing,  and  legitimation).  When  combined,  these 
practices  constitute  a  political  hegemony  exercised 
territorially. 

In  a  distinctive  formulation,  that  of  Gilles 
Deleuze  and  Felix  Guattari,  the  term  deterritorial- 
ization  has  been  used  to  refer  to  the  breakdown  of 
territorial  organization.  Epistemologically  juxta¬ 
posing  “state  philosophy”  with  “nomad  thought,” 


Deleuze  and  Guattari  associate  territorialization 
with  the  former  and  deterritorialization  with  the 
latter.  Ontologically,  however,  there  could  be 
quite  different  territorial  systems  at  play  over  time 
as,  for  example,  with  pre-  and  postcolonial  contact 
between  colonizers  and  natives  leading  to  the 
breakdown  of  one  system  followed  by  a  period  of 
deterritorialization  before  the  imposition  of  a  new 
territorial  organization.  Crucial  to  the  concept  of 
deterritorialization  of  Deleuze  and  Guattari  is  the 
claim  that  “processes  are  becomings,  and  aren’t  to 
be  judged  by  some  final  result  but  by  the  way  they 
proceed  and  their  power  to  continue,  as  with  ani¬ 
mal  becomings,  or  nonsubjective  individuations” 
(Deleuze,  1995,  p.  146). 

More  typically,  however,  two  other  approaches 
tend  to  dominate  most  thinking  about  the  contem¬ 
porary  erosion  of  territorial  organization.  In  the 
first  case,  territory  is  posed  as  a  physical  base  for 
economic  activities,  and  deterritorialization  is 
viewed  as  either  the  lessening  importance  of  local 
constraints  or  the  weakening  of  the  impact  of 
physical  distance  on  economic  transactions.  Such 
ideas  are  part  and  parcel  of  much  discussion  of 
economic  globalization.  In  the  second  case,  terri¬ 
tory  is  perceived  as  a  declining  spatial  assemblage 
of  power  relations  and  identity  strategies.  From 
this  perspective,  ideas  such  as  the  “end  of  terri¬ 
tory”  and  the  rise  of  network  space  are  linked  to 
the  recent  onset  of  a  worldwide  deterritorializa- 
tion.  This  camp  tends  to  see  deterritorialization  in 
terms  of  the  overall  weakening  of  territorial  identi¬ 
ties  in  the  face  of  globalization  and  the  prolifera¬ 
tion  of  “nonplaces”  such  as  airports  and  fast-food 
joints. 

In  summary,  there  have  been  important  histori¬ 
cal  dimensions  to  and  philosophical  disputes  about 
territory.  It  takes  various  forms,  and  they  have 
waxed  and  waned  in  relative  significance.  Overall, 
churches  and  polities  (states,  empires,  federations, 
etc.)  have  been  the  most  important  users  of  terri¬ 
tory.  Some  churches  (such  as  the  Roman  Catholic 
Church)  and  some  states  (such  as  the  United 
States)  have  more  complex  and  formally  hierarchi¬ 
cal  territories  than  do  others.  Today,  transnational 
and  global  businesses  often  erect  territorial  hierar¬ 
chies  that  cut  across  existing  state-based  ones.  So 
even  as  some  uses  of  territory  fade  away,  others 
emerge.  State-based  territory  may  undergo  a 
renaissance  in  the  face  of  recent  challenges  to  the 
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sustainability  of  financial  globalization.  Therefore, 
though  varying  in  precise  form  and  complexity, 
territory  seems  still  to  be  very  much  with  us  in  one 
form  or  another. 

John  Agnew 
University  of  California,  Los  Angeles 
Los  Angeles,  California,  United  States 
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“International  terrorism”  is  a  contested  term.  In 
popular  usage,  it  is  typically  taken  to  refer  to 
groups  that  use  “terrorist”  tactics,  such  as  target¬ 
ing  civilians,  across  national  borders.  A  contem¬ 
porary  example  of  “international  terrorism”  is  the 
transnational  A1  Qaeda  network.  But  the  term  has 
been  used  to  describe  phenomena  as  diverse  as  the 
19th-century  anarchist  movement,  the  Palestine 
Liberation  Organization  in  the  1970s  and  1980s, 
and  the  various  Marxist  groups  operating  in 
Europe  and  elsewhere  during  the  1970s,  such  as 
the  Italian  Red  Brigades. 


Both  parts  of  the  term  are  contested.  The  term 
“terrorism”  continues  to  elicit  passionate  debate, 
both  in  terms  of  how  it  should  be  defined  (and 
whether  it  constitutes  a  separate  category  of  vio¬ 
lence)  and  how  it  is  used.  Definitions  range  from 
those  encompassing  most  forms  of  political  vio¬ 
lence  (rendering  it  indistinguishable  from  other 
forms  of  warfare)  to  those  encompassing  most 
oppositional  activism  (rendering  it  indistinguish¬ 
able  from  nonviolent  activism).  A  more  precise  set 
of  definitions  defines  it  with  reference  to  the  type 
of  target  selected  (usually  noncombatants)  and  to 
the  way  in  which  violence  is  used  to  inculcate  fear 
in  a  wider  audience  than  those  actually  targeted,  to 
achieve  political  change.  Disagreements  persist, 
though,  over  who  can  be  considered  a  “noncom¬ 
batant”  and  whether  one  can  generalize  such  a 
definition,  given  that  the  very  notion  of  “noncom¬ 
batant”  is  intertwined  with  the  development  of  the 
modern  state  and  warfare  in  Europe,  while  bound¬ 
aries  between  combatants  and  civilians  are  blurred 
by  practices  such  as  conscription  and  aerial  and 
asymmetric  warfare. 

Similarly,  controversy  continues  to  rage  over 
how  the  term  is  applied.  Acts  of  violence  against 
military  personnel,  for  instance,  are  regularly 
labeled  “terrorist,”  while  state  actors  engaging  in 
“terrorist”  tactics  (as  defined  above)  are  typically 
not  labeled  thus — unless  they  are  considered 
“rogue”  states.  Definitions  are  routinely  used 
beyond  their  analytical  reach  to  describe  any  acts 
by  actors  who  have  been  labeled  “terrorist”  by 
state  elites.  What  or  who  is  considered  “terrorist” 
is  thus  profoundly  shaped  by  political  context. 
Before  terrorism  became  a  widely  accepted  term, 
acts  of  violence  in  1970s  Italy,  for  instance,  were 
described  as  bombings  or  kidnappings.  Only  in 
retrospect,  when  the  term  had  gained  international 
credence  in  the  context  of  the  Cold  War,  were 
these  acts  labeled  “terrorist.”  The  term  is,  more¬ 
over,  used  by  governments  and  nonstate  actors 
alike  to  delegitimize  opponents  and  justify  extraor¬ 
dinary  measures  against  them.  This  has  led  some, 
particularly  critical  terrorism  scholars,  to  argue 
that  the  term  is  a  political-normative  tool,  rather 
than  a  useful  analytical  concept. 

The  “international”  part  of  the  term  is  also  con¬ 
tested.  Some  reserve  the  term  “international  terror¬ 
ism”  for  the  acts  of  those  whose  goals,  membership, 
and  operations  cross  national  boundaries.  Others 
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apply  it  to  the  acts  of  any  group  operating  in  more 
than  one  state,  even  if  goals  and  membership  are 
limited  to  one  state.  Yet  others  use  the  term  inter¬ 
changeably  with  “terrorism,”  particularly  in  the 
context  of  the  global  war  on  terror  where  “terror¬ 
ism”  has  become  synonymous  with  opposition  to 
the  international  order,  regardless  of  one’s  aims. 
The  U.S.  State  Department,  for  instance,  refers  to 
all  organizations  it  has  labeled  “terrorist”  as 
indulging  in  “global  terrorism.”  Intriguingly,  few 
use  “international  terrorism”  to  describe  acts  of 
state  terrorism  across  borders,  even  though  within 
the  discipline  of  international  politics,  the  term 
“international”  is  usually  reserved  for  relation¬ 
ships  between  states  (as  opposed  to  “transna¬ 
tional,”  which  refers  to  nonstate  actors  across 
borders).  Those  who  recognize  this  distinction 
tend  to  use  the  term  “transnational  terrorism”  to 
refer  to  nonstate  terrorism  across  boundaries. 

The  “new  terrorism”  debate  has  added  further 
fuel  to  these  definitional  controversies.  According 
to  a  number  of  scholars,  a  “new  terrorism”  has 
emerged  since  the  1990s,  which  is  transnational, 
religiously  inspired,  organizationally  decentralized, 
and  more  lethal,  radical,  and  uncompromising.  A1 
Qaeda  is  the  primary  example,  but  other  groups, 
mostly  Islamist,  are  also  cited.  Critics  of  this  trend 
have  pointed  out  that  many  aspects  of  this  so-called 
new  terrorism  can  be  found  in  examples  of  “old 
terrorism”  and  that  by  calling  this  type  of  violence 
“new,”  insights  from  the  “old  terrorism”  literature 
and  from  past  conflicts  are  ignored.  Moreover,  by 
linking  the  “new”  to  the  ostensibly  “religious” 
motivations  of  these  groups,  the  “new  terrorism” 
literature  overemphasizes  the  religious  aspect  of 
these  groups  while  downplaying  political  and 
socioeconomic  dynamics.  It  also  downplays  the 
vast  differences  between  groups  that  draw  their 
inspiration  from  religion.  Hamas,  for  instance, 
draws  its  inspiration  from  Palestinian  nationalism 
as  well  as  religion  and  has  shown  more  willingness 
to  compromise  and  practice  restraint  than  A1 
Qaeda. 

Because  there  is  little  agreement  over  what  con¬ 
stitutes  “terrorism,”  let  alone  “international  terror¬ 
ism,”  debates  over  what  drives  this  type  of  violence 
are  problematic.  Most  “terrorism  databases,”  for 
instance,  exclude  nonstate  violence  in  sub-Saharan 
Africa  because  this  is  usually  classified  as  “civil 
war” — with  significant  implications  for  analyses  of 


the  relationship  between  “terrorism”  and  “pov¬ 
erty”  or  the  availability  of  mineral  resources.  Sim¬ 
ilarly,  few  databases  include  acts  of  state  terror,  and 
most  do  not  distinguish  between  violence  against 
noncombatants  and  violence  against  combatants,  if 
the  perpetrators  are  classified  as  “terrorist.” 

Contributing  Factors 

With  this  caveat,  numerous  factors  have  been  cited 
as  facilitating  “terrorism.”  Political  exclusion  and 
repression  have  been  found  to  play  a  role  in  many 
instances  (e.g.,  Algeria,  Egypt,  Colombia),  but  so 
has  (rapid)  political  liberalization  and  an  increase 
in  political  opportunities  (e.g.,  Indonesia,  Colombia, 
Spain).  Ethnic  and  religious  cleavages  can  contrib¬ 
ute  to  violence,  especially  when  overlapping  with 
socioeconomic  cleavages  or  with  interethnic  occu¬ 
pation  or  repression.  Modernization — and  in  par¬ 
ticular  urbanization,  the  expansion  of  education, 
the  weakening  of  traditional  power  structures,  and 
unequal  development — has  been  a  factor,  but  so 
has  lack  of  development  and  recession  (especially 
following  economic  growth).  Recession  played  a 
part  in  the  slide  of  Algeria  into  civil  war  in  the 
1990s  and  in  the  outbreak  of  the  Palestinian 
Intifada  in  the  late  1980s  (both  of  which  conflicts 
included  “terrorist”  acts  as  defined  above).  The 
weakening  of  traditional  power  structures  and 
expansion  of  the  university  sector,  coupled  to 
unequal  development,  facilitated  the  development 
of,  among  others,  the  Provisional  Irish  Republican 
Army  (IRA)  in  Northern  Ireland,  Hizbollah  (also 
known  as  Hezbollah)  in  Lebanon,  and  the  Tamil 
Tigers  in  Sri  Lanka.  But  without  state  violence  and 
discriminatory  policies  these  socioeconomic  factors 
would  not  have  been  sufficient  to  trigger  a  violent 
response. 

Transnational  safe  havens  and  support  from 
third-party  states  have  played  a  part  in  a  number 
of  conflicts,  as  have — facilitated  by  the  processes 
of  globalization — the  presence  of  transnational 
diasporas  or  criminal  networks  (such  as  those 
linked  to  the  blood  diamond  trade).  The  Tamil 
Tigers,  for  instance,  were  strengthened  by  support 
from  elements  in  the  Tamil  diaspora.  The  Basque 
nationalist  ETA  benefited  from  safe  havens  across 
the  Spanish  border  in  France,  while  Lebanon’s 
Hizbollah  has  been  profoundly  shaped  by  Iran’s 
support.  A1  Qaeda’s  transnational  character  is 


Terrorism,  International  2593 


similarly  in  part  a  product  of  the  transnational 
networks  of  volunteers  arriving  in  Afghanistan 
during  the  1980s,  just  as  it  is  a  product  of  global¬ 
ization  and  its  effect  on  resources,  a  sense  of  self, 
culture,  and  territory.  But  it  is  important  not  to 
overstate  the  significance  of  transnational  factors 
because  this  can  obscure  the,  often,  very  local  rea¬ 
sons  for  political  violence.  The  Tamil  Tigers’ 
struggle  was  rooted  in  deeply  felt  grievances  over 
the  marginalization  of  Tamils  by  the  Sri  Lankan 
government,  just  as  Hizbollah’s  emergence  was  a 
response  to  Israel’s  1982  Lebanon  invasion  and 
local  Shi’i  grievances  against  the  Lebanese  state. 
Even  A1  Qaeda  has  roots  in  local  struggles  across 
the  Middle  East  and  Asia  between  secular  state 
elites,  reformist  Islamists,  and  militant  Islamists. 

Macrofactors  alone  cannot  explain  “terror¬ 
ism.”  Modernization,  political  exclusion,  and 
inequality  have  not  everywhere  given  rise  to  politi¬ 
cal  violence.  Political  liberalization  in  Morocco  did 
not  usher  in  the  level  of  violence  seen  in  neighbor¬ 
ing  Algeria.  This  underlines  the  importance  of 
complementing  macro  with  micro-  and  mesolevel 
explanations.  What  type  of  organization  emerges, 
how  many  competing  groups  materialize  within  a 
particular  social  movement  sector,  what  resources 
are  available,  how  macro  and  micro  events  are 
interpreted — all  these  factors  influence  whether  or 
not  violence  ensues. 

Research  on  what  factors  facilitate  “interna¬ 
tional  state  terrorism”  or  state  sponsorship  of 
nonstate  “terrorism”  (beyond  that  carried  out  by 
so-called  rogue  states)  is  still  in  its  infancy  even 
though  the  scale  of  violence  emanating  from  states 
far  outstrips  that  of  its  nonstate  equivalent.  There 
is  a  nascent  literature  on  the  role  of  Western  states 
in  supporting  state  repression  or  nonstate  “terror¬ 
ism  in  the  Global  South.”  But  there  is  nothing  (yet) 
resembling  the  amount  of  research  that  has  been 
done  on  nonstate  “transnational  terrorism.” 

International  Coordination  of  Responses 

Until  the  attacks  on  New  York’s  World  Trade 
Center  and  the  Pentagon  on  September  11,  2001, 
most  responses  to  “transnational  terrorism” 
remained  within  the  national  sphere,  limited 
cooperation  notwithstanding.  Since  2001  and  the 
declaration  of  a  Global  War  on  Terror,  responses 
have  become  more  coordinated  internationally, 


facilitated  by  the  broader  processes  of  globaliza¬ 
tion.  The  establishment  of  the  international 
Financial  Action  Task  Force,  which  has  coordinated 
the  freezing  of  financial  assets  of  internationally 
designated  “terrorist”  groups  and  individuals,  is  a 
case  in  point,  as  is  the  involvement  of  an  interna¬ 
tional  alliance  in  the  fight  against  A1  Qaeda  and 
the  Taliban  in  Afghanistan.  Cooperation  between 
states  has  increased  on  practices  such  as  extradi¬ 
tion,  the  sharing  of  intelligence  data,  rendition 
(transporting  suspects  to  states  that  allow  tor¬ 
ture),  and  replicating  deradicalization  models, 
such  as  the  rehabilitation  programs  for  prisoners 
accused  of  “terrorism.” 

The  notion  of  “international  terrorism”  and  the 
term’s  malleability  has  arguably  helped  state  elites 
to  forge  a  united  response  to  a  plethora  of  different 
threats,  both  local  and  international.  Critics  have 
argued  that  such  an  approach  risks  downplaying 
local  differences,  encouraging  a  homogeneous 
counterterrorism  model  that  advances  a  hegemonic 
globalization  (seen  as  part  of  the  problem)  against 
all  dissenters,  and  privileging  military  over  non¬ 
military  measures,  such  as  the  controversial  inva¬ 
sion  of  Iraq.  The  notion  of  an  all-threatening 
“international  terrorism”  has,  according  to  its  crit¬ 
ics,  served  to  increase  state  power  while  weakening 
mechanisms  of  accountability,  facilitating  the  birth 
of  a  surveillance  society,  particularly  affecting 
those  believed  to  be  “at  risk”  from  radicalization 
(so-called  suspect  communities).  Against  this,  pro¬ 
tagonists  point  to  the  success  in  foiling  “terrorist” 
plots  through  increased  international  cooperation, 
including  cooperation  with  previously  uncoopera¬ 
tive  states,  and  to  the  development  of  nonmilitary 
responses,  such  as  rehabilitation  programs,  eco¬ 
nomic  development,  and  the  strengthening  of  “at 
risk”  communities. 
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Terrorist  Groups 


Defining  a  terrorist  group  is  as  difficult  as  defining 
terrorism,  which  is  a  highly  contested  concept 
politically.  There  is  no  universally  accepted,  legally 
binding  definition  of  the  term,  and  national  legis¬ 
lations  or  international  organizations  attach  dif¬ 
ferent  meanings  to  the  concept.  As  a  consequence, 
the  United  Nations,  for  example — in  13  conven¬ 
tions  and  protocols  adopted  since  1963 — confined 
itself  to  defining  and  proscribing  specific  acts  of 
terrorism  (such  as  skyjacking,  hostage  taking,  and 
nuclear  terrorism).  A  review  of  the  literature  by 
Alex  Schmid  produced  more  than  100  different 
definitions  some  of  which  diverge  strongly. 
However,  there  are  commonly  cited  criteria,  such 
as  the  use  of  violent  means,  the  political  motiva¬ 
tion  of  the  perpetrators,  and  the  intention  to 
create  panic  and  fear  among  segments  of  the  popu¬ 
lation  or  the  population  as  a  whole.  In  the  follow¬ 
ing,  the  major  characteristics,  forms  of  action,  and 
types  of  such  groups  are  discussed  in  light  of  a 
number  of  current  explanatory  approaches. 

Against  this  background,  terrorist  groups  or 
organizations  can  be  described  as  actors  who  use 
primarily  terrorist  tactics  and  means  to  pursue 
their  political  goals.  This  does  not  exclude  the  use 
of  other  violent  or  nonviolent  means.  Most  groups 
labeled  terrorists  are,  therefore,  multifaceted:  For 
example,  they  may  also  conduct  guerrilla-style 
operations;  they  may  act  as  political  movements  or 
parties  by  instigating  a  political  wing;  they  may 
work  as  business  actors  through  the  establishment 
of  companies  and  fund-raising  organizations;  and 
they  may  serve  as  social  welfare  organizations  by 
delivering  services  to  particular  segments  of  the 
population.  Thus,  terrorist  groups  are  usually  not 


only  interested  in  pursuing  terrorist  activities  but 
in  a  whole  range  of  different  issues.  Nevertheless, 
the  frequent  or  occasional  use  of  terrorism  remains 
a  key  characteristic  of  these  groups,  which  has 
severe  implications  for  the  recruitment  and  train¬ 
ing  of  activists;  internal  organization;  operational 
planning  and  logistics;  methods  of  internal  and 
external  communication;  transfer  and  channeling 
of  money,  weapons,  and  people;  and  leadership 
and  their  opportunities  to  act  and  maneuver.  Most 
of  this  is  simply  because  terrorism  is  a  tactic  that 
attacks  people  and  institutions  from  underground. 
“Going  underground”  and  acting  from  the  hidden, 
however,  has  material  and  political  costs,  which 
affect  the  entire  organization,  their  followers,  and 
supporters. 

Distinguishing  Terrorist  Groups  From 
Other  Phenomena  of  Violence 

Terrorist  groups — or  more  precisely  terrorist  cells, 
squads,  and  commands — should  analytically  be 
distinguished  from  other  phenomena  of  violence. 
First,  terrorist  groups  are  groups  in  the  sense  that 
they  comprise  a  number  of  members  who  are 
bound  together  by  some  kind  of  organizational 
structure,  which  exists  over  a  certain  period  of 
time.  According  to  some  national  legislations  (e.g., 
that  in  Germany),  a  minimum  of  three  persons  is 
required  in  an  organization  to  speak  about  it  as  a 
“terrorist  organization.”  This  implies  that  violent 
acts  by  single  individuals  (gunmen,  killers,  snipers, 
homicide,  etc.)  do  not  fall  into  that  category.  Second, 
because  of  their  political  ambitions,  however  defined, 
terrorist  groups  differ  from  primarily  criminally  or 
economically  motivated  organizations — such  as 
criminal  gangs,  mafia  structures,  syndicates,  war¬ 
lords,  or  mercenaries.  This  distinction  does  not 
exclude  the  fact  that  individual  members  of  terror¬ 
ist  organizations  may  be  economically  motivated; 
indeed,  sometimes,  terrorist  groups  may  exploit 
the  socioeconomic  needs  of  particular  persons  or 
segments  of  the  society  for  their  own  ends. 
Moreover,  terrorist  groups  often  undertake  crimi¬ 
nal  activities  (e.g.,  drug  trafficking) — or  coop¬ 
erate  with  criminals  (e.g.,  to  get  weapons  and 
explosives) — to  finance  and  launch  terrorist  opera¬ 
tions.  Therefore,  a  terrorist-criminal  nexus  usually 
exists.  But  the  key  question  still  remains  as  to 
whether  the  economic  and  criminal  activities  are 
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ends  in  themselves  or  whether  they  are  instruments 
to  further  a  political  and  ideological  agenda. 
Third,  terrorist  groups  should  be  distinguished 
from  other  politically  motivated  actors,  in  particu¬ 
lar  from  rebel  or  guerrilla  groups,  militias,  and 
paramilitary  organizations.  Rebel  movements  in 
general  aim  at  the  conquest  of  and — if  possible — 
permanent  control  over  territory.  Terrorist  groups, 
on  the  other  hand,  might  have  territorial  ambi¬ 
tions;  however,  they  simply  lack  the  capabilities  to 
conquer  and  control  larger  territories.  In  addition, 
while  rebels  usually  employ  physical  violence  in 
the  context  of  unconventional  warfare  (insurgen¬ 
cies)  to  diminish  their  opponent’s  military  clout,  to 
defeat  the  enemy,  or  force  it  to  surrender,  terrorist 
groups  are  mainly  interested  in  the  psychological 
effects  of  violent  acts.  In  the  real  world,  it  may  still 
be  difficult  to  uphold  these  ideal-type  distinctions 
as  some  groups  may  transform  from  one  type  of 
actor  to  another  in  the  course  of  a  conflict,  while 
others  have  to  be  seen  as  hybrid  organizations, 
which  incorporate  features  of  both  rebel  or  militia 
and  terrorist  groups  (e.g.,  the  Tamil  Tigers  in  Sri 
Lanka  and  Hizbollah  [also  known  as  Hezbollah] 
in  Lebanon). 

Fourth,  terrorist  groups  are  nonstate  actors  and 
should  not  be  equated  with  state  institutions 
responsible  for  acts  of  state  terror.  Phenomena  of 
state  terror  have  variously  been  subsumed  under 
the  specific  terms  of  war  crimes ,  crimes  against 
humanity ,  torture ,  extralegal  killings,  human  rights 
abuses,  and  genocide — labels  that  are  much  more 
adequate  than  the  concept  of  terrorism.  State  ter¬ 
ror  (tyranny,  at  worst)  and  terrorist  groups  differ 
fundamentally  in  that  the  former  is  oriented 
toward  keeping  and  consolidating  the  political 
status  quo,  while  nonstate  actors  use  terrorist  tac¬ 
tics  to  change  the  situation  and,  finally,  to  over¬ 
throw  a  political  regime.  Moreover,  state  actors  or 
those  acting  on  behalf  of  a  government  do  not 
typically  operate  covertly  but  seek  visibility  to 
intimidate  either  the  population  at  large  or  partic¬ 
ular  groups  (such  as  opposition  groups  or  minori¬ 
ties).  Furthermore,  terrorist  attacks  can  be  the 
work  of  a  few  armed  individuals,  while  state  terror 
requires  a  relatively  powerful  security  and  intelli¬ 
gence  apparatus.  There  is,  however,  a  gray  area  of 
state-sponsored  terrorism,  which  implies  state  tol¬ 
eration  of,  support  for,  or  direction  of  terrorist 
groups.  Thereby,  state  actors  strategically  sponsor 


terrorist  groups  in  pursuit  of  their  own  agenda.  In 
a  few  cases,  they  may  even  actively  promote  the 
founding  of  terrorist  organizations  or  may  be 
directly  involved  in  terrorist  operations,  thus  blur¬ 
ring  the  line  between  state  and  nonstate  actors. 

Rationale  and  Characteristics 
of  Terrorist  Groups 

Despite  the  fact  that  terrorist  groups  differ  largely 
in  size,  organizational  structure,  ideology,  political 
goals,  and  support,  their  rationale  has  remained 
basically  unaltered  since  the  beginning  of  modern 
terrorism  during  the  19th  century,  even  though 
terrorist  methods  have  changed  over  time.  First  of 
all,  terrorist  groups — even  the  largest  ones — are 
relatively  “weak”  actors  compared  with  regular 
armies  or  security  forces.  Terrorist  tactics,  such  as 
car  or  truck  bombs,  suicide  bombings,  targeted 
assassinations,  kidnapping,  and  hostage  taking, 
are  therefore  used  because  of  lack  of  alternatives. 
Terrorist  groups  use  this  kind  of  violence — 
frequently  against  targets  of  high  symbolic  value — 
not  only  to  shock  and  intimidate  particular  segments 
or  the  society  at  large  but  also  to  mobilize  sym¬ 
pathizers  and  supporters  as  well  as  to  contribute 
to  the  radicalization  of  political  movements. 
They  see  themselves  as  the  vanguard  acting  on 
behalf  of  “oppressed,”  “humiliated,”  and  “mar¬ 
ginalized”  groups,  thus,  adopting  a  sense  of 
moral  superiority,  which  serves  to  justify  their 
immoral  and  illegal  acts  in  their  eyes  and  that  of 
others.  In  this  sense,  terrorism  is  a  “communica¬ 
tion  strategy”  conveying  political  messages  to 
friends  and  foes  alike. 

In  addition  to  the  operational  level,  terrorist 
groups  also  rely  on  the  spread  of  propaganda  in  the 
form  of  claims  of  responsibility  after  attacks,  the 
launch  of  warnings,  and  political  declarations, 
which  nowadays  are  usually  disseminated  on  the 
Internet  or  on  videotapes.  For  this  purpose,  some 
larger  groups  even  have  their  own  media  depart¬ 
ments  and  ways  of  transmitting  their  messages. 
Propaganda  aims  not  only  at  conveying  signals  but 
also  at  purposeful  deception  and  disinformation;  it 
provides  terrorist  groups  with  a  platform  that 
makes  them  appear  “bigger”  and  “more  power¬ 
ful” — an  effect  that  an  isolated  terrorist  attack  can 
hardly  produce.  Terrorism  would  simply  not  work 
without  publicity.  Yet  terrorist  attacks  and  terrorist 
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propaganda  in  and  of  themselves  do  not  necessar¬ 
ily  produce  the  desired  mobilization  of  supporters. 
Rather,  mobilization  is  achieved  through  the  reac¬ 
tion  that  an  attack  provokes  among  those  against 
whom  it  is  directed.  Attacks  are  aimed  at  prompt¬ 
ing  the  opponent  to  adopt  brutal  and  dispropor¬ 
tionate  measures,  which  will — as  terrorist  groups 
frame  it — “delegitimize”  and  “unmask”  him.  For 
terrorist  groups,  the  objective  is  to  convey  the 
impression  that  the  target  of  their  attacks  (e.g.,  a 
government)  is  the  actual  aggressor,  while  the 
attack  itself  is  justified  as  a  means  of  legitimate 
self-defense  and  self-assertion. 

Terrorist  groups  employ  an  action-reaction  spi¬ 
ral,  which  allows  them  to  swap  roles  and  portray 
themselves  as  the  actual  victims.  As  many  observ¬ 
ers  note,  there  is  an  asymmetrical  relation  between 
terrorist  groups  and  the  state’s  security  apparatus: 
On  the  one  hand,  terrorists  are  typically  inferior  to 
the  state’s  police  and  military  potential — logisti- 
cally  but  also  in  terms  of  manpower.  On  the  other 
hand,  they  operate  underground  and,  thus,  have 
the  element  of  surprise  on  their  side.  It  is  enough 
for  them  to  succeed  every  now  and  then  to  demon¬ 
strate  that  even  comprehensive  and  cost-intensive 
security  measures  by  the  state  cannot  offer  com¬ 
plete  security  for  the  population.  Moreover,  many 
terrorist  groups  have  a  time  horizon  that  is  differ¬ 
ent  from  that  of  their  counterparts;  they  see 
themselves  as  being  on  a  historical  mission  and, 
therefore,  think  long  term;  they  do  not  aim  at 
quick  victories  but  believe  that  “history”  or  “des¬ 
tiny”  will  be  on  their  side  and  that  in  the  long  run, 
they  will  prevail. 

These  strategic  advantages  do  not  come  without 
costs,  however.  Terrorist  groups  are  forced  to  be 
innovative — on  the  one  hand,  because  they  are 
usually  constantly  on  the  run  and,  on  the  other 
hand,  because  they  have  to  do  the  unexpected  to 
remain  successful.  They  have  to  be  unpredictable; 
they  must  avoid  adopting  behavioral  patterns  and 
must  therefore  change  their  methods  from  time  to 
time.  By  the  same  token,  this  increases  the  risk  of 
failure.  Not  every  terrorist  organization  is  able  to 
be  innovative  in  the  long  run  due  to  the  lack  of 
resources,  adequate  leadership,  or  a  favorable 
environment.  Terrorist  groups  face  a  structural 
dilemma:  On  the  one  hand,  they  have  to  imple¬ 
ment  considerable  precautionary  measures  to 
avoid  being  discovered — measures  that,  however, 


limit  their  freedom  of  movement  and  their  opera¬ 
tional  capacity.  On  the  other  hand,  their  political 
visibility  and  influence  crucially  depend  on  their 
ability  to  launch  successful  and  surprising  attacks. 
But  each  plot,  in  turn,  inevitably  provides  investi¬ 
gators  with  information  about  the  group,  their 
logistics,  and  their  operational  patterns.  A  further 
difficulty  for  terrorist  groups  lies  in  securing  their 
financial  and  logistical  base  for  a  longer  period  of 
time.  To  some  extent,  actors  are  preoccupied  with 
sustaining  themselves  and  organizing  their  sur¬ 
vival.  Doing  this  from  the  underground  and  with 
discreet  support  from  outside  does  have  a  number 
of  implications  for  the  internal  organization  and 
the  cooperation  with  others.  In  many  instances,  for 
example,  terrorists  need  to  cooperate  with  orga¬ 
nized  crime  groups  in  acquiring  weapons,  launder¬ 
ing  money,  or  trafficking  illegal  goods.  In  other 
cases,  terrorist  groups  undertake  legal  or  shadowy 
business  activities;  they  are  involved  in  fund-raising 
or  in  collecting  a  “revolution  tax.”  A  critical  junc¬ 
ture  is  when  the  means  become  an  end  in  them¬ 
selves,  and  the  self-sustaining  aspect  of  a  group 
becomes  more  important  than  the  actual  political 
ambitions.  Then,  terrorist  groups  or,  at  least,  cer¬ 
tain  segments  of  that  group,  simply  degenerate 
into  criminal  or  money-making  organizations — 
such  as  the  Revolutionary  Armed  Forces  of 
Colombia  (Fuerzas  Armadas  Revolucionarias  de 
Colombia,  FARC)  and  the  IRA — which  no  longer 
care  about  mobilizing  larger  parts  of  the  popula¬ 
tion  for  political  ends. 

Over  time,  the  conspiratorial  character  of  terror¬ 
ist  groups  often  leads  to  psychological  defects — 
mainly  due  to  group-think  as  well  as  to  strict  black- 
and-white  thinking.  The  groups  tend  to  create  their 
own  reality,  perceiving  and  interpreting  every  event 
and  all  information  according  to  their  own  world¬ 
views.  From  their  perspective,  each  setback  or  “fail¬ 
ure”  is  merely  an  “insignificant  episode”  on  the 
long-term  path  to  “victory.”  Terrorist  leaders  gen¬ 
erally  disregard  contradictory  developments  and  the 
possibility  of  pursuing  alternative  strategies.  This 
development  culminates  in  terrorists  feeling  reas¬ 
sured  in  their  behavior,  irrespective  of  what  is  hap¬ 
pening  around  them.  Progressive  loss  of  reality 
frequently  goes  hand  in  hand  with  increasing  radical- 
ization  and  brutalization  in  the  use  of  violence.  The 
latter  may  also  be  directed  against  group  members  or 
supporters.  Doubts,  dissent,  or  even  compromise 
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with  the  opponents  is  usually  not  tolerated  because 
they  jeopardize  the  group’s  internal  coherence. 

Numbers  and  Types 

Since  the  labeling  of  militant  groups  as  terrorists  is 
loaded  with  political  considerations  and  termino¬ 
logical  problems,  there  can  be  no  authoritative  list 
of  terrorist  groups.  The  most  comprehensive  set  of 
data  is  provided  by  the  Global  Terrorism  Database 
(National  Consortium  for  the  Study  of  Terrorism 
and  Reponses  to  Terrorism  [START]  at  the 
University  of  Maryland).  The  database  contains 
information  about  more  than  80,000  terrorist 
attacks  beginning  in  1970  and  conducted  by  more 
than  2,000  squads,  commands,  or  groups  with 
distinct  names,  ranging  from  very  small  fringe 
groups  to  large,  full-fledged  insurgencies.  These 
data,  however,  just  indicate  how  many  groups  may 
use  inter  alia  terrorist  tactics,  but  it  does  not  say 
anything  about  a  consistent  pattern  of  behavior, 
which  would  justify  the  term  terrorist  group  and 
which  would  enable  analysts  to  distinguish  them 
from  other  armed  nonstate  actors.  The  same  prob¬ 
lem  applies  to  the  comprehensive  list  of  groups  in 
a  2008  study  by  Seth  Jones  and  Martin  Libicki, 
which  refers  to  648  “terrorist  organizations”  that 
existed  between  1968  and  2006 — 244  are  still 
active  and  241  have  been  active  for  10  or  more 
years,  which  underscores  how  persistent  militant 
groups  can  be  once  they  have  been  established. 
This  set  of  data  also  points  to  the  fact  that  most 
terrorist  groups  are  very  small  and  count  in  the  10s 
rather  than  the  100s.  But  the  list  also  includes  a 
number  of  larger  groups,  which  are  better  under¬ 
stood  as  rebel  or  guerrilla  movements  (such  as 
FRELIMO  in  Mozambique,  the  Maoists  in  Nepal, 
or  the  Sandinistas  of  Nicaragua)  or  simply  as  orga¬ 
nized  crime  groups.  Not  least  because  of  these  dif¬ 
ficulties,  governments’  official  lists  of  terrorist 
groups  are  much  smaller.  For  example,  the  U.S. 
State  Department  has  listed  45  groups  as  “foreign 
terrorist  organizations”  (July  2009);  the  Canadian 
Security  Intelligence  Service,  40  (November  2008); 
and  the  Council  of  the  European  Union,  47 
(January  2009);  a  large  number  of  them  are  char¬ 
acterized  by  an  Islamist  ideology. 

Beyond  the  issue  of  numbers,  the  literature  on 
terrorism  mainly  deals  with  the  question  of 
typologies  since  terrorist  groups  differ  greatly 


according  to  various  aspects.  The  most  common 
typology  focuses  on  the  ideological  orientation 
and  worldviews  of  terrorist  groups.  Depending 
on  the  author,  they  are  categorized  as  follows: 

•  social  revolutionary,  Marxist,  or  left-wing 
groups  (e.g.,  the  German  Red  Army  Faction); 

•  ethnonational,  separatist  groups  (e.g.,  the  IRA  or 
the  Basque  Homeland  and  Liberty  [Euskadi  ta 
Askatasuna,  ETA]); 

•  right-wing,  neo-fascist,  or  racist  groups  (e.g.,  the 
Ku  Klux  Klan  in  the  United  States); 

•  religious  groups  (e.g.,  A1  Qaeda  and  other 
Jihadist  groups);  and 

•  apocalyptic  groups  (the  Aum  Shinrikyo  sect  in 
Japan  may  be  a  case  in  point). 

The  assumption  is  that  the  form  of  ideology  does 
explain  other  aspects  of  terrorist  groups,  such  as 
the  political  agendas,  the  links  with  particular 
communities  and  milieus,  the  level  of  public  sup¬ 
port,  the  duration  of  the  group,  or  their  destructive 
potential. 

An  alternative  typology  distinguishes  terrorist 
groups  on  the  basis  of  their  geographical  reach, 
that  is,  based  on  whether  their  actions  and  goals 
are  limited  to  a  local  area  or  whether  they  pursue 
an  overarching,  global  agenda. 

First,  national  or  domestic  terrorist  group  refers 
to  a  terrorist  group  that  operates  mainly  in  its 
home  country  to  challenge  the  domestic  political 
status  quo,  either  by  regime  change  or  by  territo¬ 
rial  separation;  perpetrators  and  victims  are  both 
residents  of  the  same  state.  Despite  their  contacts 
with  like-minded  groups  elsewhere,  the  Basque 
ETA,  the  IRA,  the  Kurdish  Workers’  Party  (Partiya 
Karker  Kurdistan,  PKK),  the  Sri  Lankan  Tamil 
Liberation  Tigers  of  Tamil  Eelam  (LTTE),  and  the 
European  left-wing  groups  of  the  1970s  and  1980s 
(e.g.,  Brigate  Rosse  in  Italy,  Action  Directe  in 
France,  and  Red  Army  Faction  in  Germany)  do  by 
and  large  fit  into  that  category. 

Second,  the  label  international  terrorism — or 
internationally  operating  terrorists — applies  to 
groups  that  aim  at  internationalizing  a  local  con¬ 
flict  to  attract  the  attention  of  a  wider  public  and 
to  put  their  issues  on  the  international  agenda. 
They  deliberately  choose  and  attack  targets  abroad; 
typically,  assassins  and  victims  are  not  residents  of 
the  same  state.  Examples  for  this  kind  of  terrorism 
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have  been  Palestinian  groups,  such  as  the  Palestine 
Front  for  the  Liberation  of  Palestine  (PFLP),  the 
PFLP-General  Command  (PFLP-GC),  and  the  Abu 
Nidal  Group,  which  in  the  late  1960s  consciously 
decided  to  undertake  spectacular  operations  abroad 
to  make  the  world  aware  of  the  Palestinian  cause. 

Third,  transnational  terrorism — sometimes  also 
called  new  terrorism — involves  groups  and  net¬ 
works  of  groups  that  aim  at  changing  the  global  or, 
at  least,  the  regional  political  order.  Their  acts  are 
directed  against  the  actual  or  assumed  dominance 
or  hegemony  of  certain  states,  ideologies,  and  ways 
of  life  (e.g.,  the  Western  culture).  These  groups  may 
be  engaged  in  local  conflicts  also,  but  they  see  these 
struggles  in  a  broader  context  of  a  global  or 
regional  fight.  Transnational  terrorist  groups  share 
a  number  of  features,  including  a  transnational 
ideology,  multinational  membership,  and  transna¬ 
tional  network  structures  designed  to  bypass  states 
and  regions.  The  paradigm  case  is  the  A1  Qaeda 
network  of  Osama  bin  Laden.  Since  1996  at  the 
latest,  bin  Laden’s  Islamist  movement  has  been 
directly  targeting  U.S.  and  Western  influence,  or 
“imperialism,”  in  the  Islamic  world,  which  in  its 
view  has  to  be  defeated  first  in  order  to  “liberate” 
Muslim  populations  from  their  “corrupt”  rulers 
and  install  a  “true”  Islamic  order  under  the  rule  of 
the  Caliphate.  In  its  ideology,  A1  Qaeda  has  linked 
local  conflicts  with  Muslim  involvement  (e.g., 
Bosnia,  Chechnya,  Mindanao,  and  Indonesia), 
with  the  mission  of  a  global  jihad  against  the 
“Jewish-Christian  crusaders.”  Moreover,  in  con¬ 
trast  to  most  conventional  terrorist  groups,  A1 
Qaeda  works  as  a  multinational  enterprise,  which — 
despite  its  Arab  origin — comprises  persons  of  dif¬ 
ferent  ethnic,  national,  and  linguistic  backgrounds 
bound  together  by  a  common  transnational  Islamist 
ideology.  Other  groups — associated  with  A1  Qaeda 
to  a  greater  or  lesser  extent — such  as  Jemaah 
Islamyyiah  in  Southeast  Asia,  the  A1  Qaeda  of  the 
Maghreb  region,  and  some  Central  Asian  as  well  as 
some  Pakistani  jihadi  groups — have  adopted  a 
similar  worldview. 

Understanding  and  Explaining 
Terrorist  Groups 

There  is  no  single  theory  that  could  explain  the 
emergence,  persistence,  transformation,  and 
decline  of  terrorist  groups  in  general — the  universe 


of  groups  is  simply  too  diverse  for  that.  However, 
it  has  been  widely  established  that  most  terrorist 
groups  are  the  radical  outgrowth  of  domestic  or 
international  conflicts.  Thus,  terrorist  groups  typi¬ 
cally  emerge  in  three  different  situations:  The  first 
is  a  latent  conflict  that  terrorists  aim  at  escalating 
by  their  acts  to  mobilize  parts  of  the  population  in 
order  to  trigger  a  mass  movement  (e.g.,  left-wing 
terrorism  in  Western  Europe  in  the  1970s).  Here, 
terrorist  groups  view  themselves  as  “catalysts”  of  a 
subversive  or  separatist  movement.  The  second  case 
of  terrorist  groups  emerges  in  the  context  of  an 
armed  struggle,  which  has  already  escalated  to  a 
greater  extent.  The  militarily  weaker  actor — feeling 
backed  into  a  corner — then  frequently  resorts  to 
terrorist  methods.  Depending  on  the  circumstances, 
terrorist  acts  may  replace  previous  strategies  alto¬ 
gether,  which  leads  to  the  actor’s  transformation 
and  the  consolidation  of  terrorist  structures.  In  the 
third  scenario,  a  ceasefire  or  peace  agreement 
would  have  been  concluded  between  the  warring 
factions,  which  will  then  be  undermined  and  sabo¬ 
taged  by  radical  fringe  groups  that  may  have  fre¬ 
quently  split  from  existing  militant  factions.  These 
groups  continue  the  “fight”  by  launching  terrorist 
attacks  to  reignite  the  conflict  (e.g.,  Real  IRA  in 
Northern  Ireland  after  the  1998  agreement). 

Despite  this  common  terrorism-conflict  nexus, 
there  are  a  number  of  factors  that  determine  the 
circumstances  under  which  terrorist  groups  emerge 
and  exist.  These  factors  can  be  analyzed  on  three 
levels  and  using  various  social  science  theories. 
First,  the  microlevel  refers  to  individual  motiva¬ 
tions  and  circumstances  to  establish  or  join  a  ter¬ 
rorist  group.  Here,  biographical  analysis,  theories 
about  deviant  behavior,  identity  theories,  psycho¬ 
logical  and  sociopsychological  theories,  and  ratio¬ 
nal  choice  theories  are  often  used  for  isolating 
explanatory  variables,  such  as,  inter  alia,  traumatic 
events,  past  or  current  experience  with  violence, 
personal  revenge,  altruism,  narcissistic  personality, 
paranoia,  peer  pressure,  frustration  and  aggression 
because  of  “relative  deprivation,”  and  cost-benefit 
calculations,  which  may  influence  the  behavior  of 
individuals.  Second,  the  mesolevel  considers  the 
group  as  a  collective  actor  together  with  its  deci¬ 
sions  and  activities.  In  this  regard,  theories  on  col¬ 
lective  action  (e.g.,  resource  mobilization  theory); 
social  learning  and  group  behavior  (e.g.,  ingroup- 
outgroup  thinking);  organizational  and  network 
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studies;  perceptions,  belief  systems,  and  mind  sets; 
political  economy;  concepts  and  theoretical 
assumptions  about  the  role  of  ideology;  and  myths 
and  symbols  as  well  as  theories  about  the  role  of 
leadership  (e.g.,  charismatic  leadership)  are  usually 
applied  in  order  to  analyze  the  establishment,  the 
internal  coherence  and  structure,  and  the  prospects 
for  persistence  of  a  group.  Third,  the  macrolevel 
addresses  the  general  political,  cultural,  and  socio¬ 
economic  environment  in  which  a  terrorist  group 
operates.  This  level  refers  to  modernization  theo¬ 
ries;  center-periphery  models  and  theories  about 
structural  imbalances;  issues  of  historical  injustice; 
approaches  of  conflict  and  peace  studies;  theories 
on  globalization,  transnationalization,  and  space; 
sociological  questions  about  the  political  system; 
and  the  behavior  of  ruling  elites  as  well  as  concepts 
of  statehood.  Based  on  these,  a  number  of  explan¬ 
atory  factors  are  mentioned  in  the  literature,  such 
as  poverty,  relative  deprivation,  political  frustra¬ 
tion  and  marginalization,  fragile  or  failed  state¬ 
hood,  lack  of  democratic  governance,  geographi¬ 
cal  aspects,  and  the  effects  of  shadow  economies 
and  global  markets.  In  any  case,  the  three  levels 
have  to  be  studied  in  combination  because  they  are 
deeply  intertwined:  For  example,  factors  at  the 
meso-  or  macrolevels  may  influence  individual 
motivations  and  the  demands,  needs,  and  identity 
of  individuals,  which,  in  turn,  have  to  be  reflected 
at  the  mesolevel  to  keep  a  group  together. 
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Theocracy 


The  term  theocracy  was  coined  by  Flavius  Josephus 
in  the  1st  century  CE  to  label  a  political  system  in 
which  political  rule  is  exercised  by  the  clergy  or 
other  religious  officials  deemed  as  representatives 
of  God  on  earth.  This  entry  discusses  the  meaning 
of  the  term  theocracy  and  its  presence  in  various 
religions,  historically  and  in  the  present. 
Theocracies  have  existed  throughout  the  world  at 
various  points  in  history,  not  only  in  the  Catholic 
and  Islamic  worlds  but  also  in  contexts  as  varied 
as  that  of  the  early  Mormons  in  the  United  States, 
under  Joseph  Smith  and  Brigham  Young,  and  in 
Tibet  from  the  13th  until  the  mid-20th  centuries. 
After  describing  the  concept  of  theocracy  in  vari¬ 
ous  religions,  this  entry  further  explores  the  mean¬ 
ing  of  theocracy  through  the  example  of  the  role 
of  the  Russian  Orthodox  Church.  It  concludes 
with  questions  about  whether  political  society  in 
general  presupposes  some  broadly  theocratic 
foundation. 

The  Concept  of  Theocracy  in 
Various  Religious  Contexts 

In  the  strict  sense  of  the  term,  theocracy  is  not 
parallel  to  the  categories  of  democracy,  aristoc¬ 
racy,  and  monarchy.  One  would  indeed  be  at  a 
loss  if  one  wanted  to  situate  God,  who  certainly  is 
not  of  this  world,  in  a  concrete  political  function 
of  any  kind. 

The  word  theocracy  has,  however,  a  real  history, 
related  to  the  justifications  given  by  both  ecclesias¬ 
tical  and  secular  authorities  for  the  imposition  of 
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their  views  in  the  political  sphere  or  in  some  aspects 
of  it.  There  is  a  theocratic  aspect  to  a  given  view¬ 
point  if  it  is  presented  as  indisputable  because  it  is 
derived  from  a  scripture  deemed  religious,  or 
sacred,  such  as  the  Bible  or  the  Koran,  or  from  an 
assertion  coming  from  a  religious  authority  not 
recognized  as  competent  in  the  political  sphere.  The 
latter  is  the  case  with  fatwas,  the  decrees  pro¬ 
claimed  by  one  form  or  other  of  Islamic  authority, 
an  ulema  or  a  college  of  ulemas,  even  if  it  is  on  the 
other  hand  affirmed  that  in  Islam  there  is  no 
authority  of  any  kind  above  the  individual  believ¬ 
er’s  reading  of  the  Koran. 

During  the  Middle  Ages,  the  removal  of  a  king 
or  an  emperor  by  a  pope  could  also  be  termed  an 
act  with  a  theocratical  basis,  since  papal  sover¬ 
eignty  gave  the  pope  indirect  power  over  the 
political  sphere.  Another  example  is  the  pope’s 
authority  to  intervene  in  the  case  of  sin  (ratione 
peccati)  when  there  is  a  question  of  morality. 

On  the  other  hand,  some  princes  and  some 
kings,  especially  those  who  were  the  objects  of 
quasi-sacramental  coronations,  seemed  to  be 
invested  with  a  divine  right  proper;  there  was 
something  theocratic  about  them,  their  power,  and 
their  commands.  One  may  wonder  whether  this 
form  of  pretension  still  exists  today  anywhere.  It 
may  have  been  the  case,  probably,  with  the 
Japanese  emperor  until  after  World  War  II.  The 
present  Iranian  regime,  in  which  the  final  authority 
lies  with  an  ayatollah  (presently  Seyed  Ali 
Khamenei),  can  in  some  real  sense  be  termed  theo¬ 
cratic,  and  the  same  applies,  with  the  differences 
specific  to  the  Sunni  world,  to  the  Wahhabi  mon¬ 
archy  of  Saudi  Arabia.  Today’s  jihadist  movement 
advocates  the  restoration  of  the  caliphate  system, 
abolished  almost  a  century  ago  by  Atatiirk  at  the 
fall  of  the  Ottoman  Empire. 

John  Calvin  is  known,  for  his  part,  to  have 
ruled  Geneva,  the  city  of  his  reformation,  in  a 
theocratic  way  and  with  a  theocratic  justification. 
Luther,  contrarywise,  reinforced  the  profane  power 
of  the  temporal  sovereigns,  freeing  them  from  any 
pretense  of  jurisdiction  of  the  Catholic  authority. 
Thus,  in  his  theory  of  two  kingdoms,  he  made 
them  dependent  on  God  alone  (Zwei  Reicbe). 

In  the  Catholic  world  of  the  17th  century, 
Jacques-Benigne  Bossuet  (1627-1704)  was  con¬ 
vinced  of  the  divine  character  of  the  absolutist 
power  of  his  king.  More  generally,  he  thought  that 


there  is  something  divine  in  the  proper  sense  of  the 
term  invested  by  God  himself  in  the  princes  as 
such.  One  had  consequently  to  show  “religion” 
proper  toward  them,  though  it  was  to  be  the  reli¬ 
gion  of  “second  majesty,”  or  second  degree,  in 
comparison  with  the  religion  of  first  majesty, 
which  we  owe  to  God  [La  politique  tiree  des  pro- 
pres  paroles  de  I’Ecriture  sainte.  III  2,  3).  All  this 
again  sounded  theocratical,  at  least  in  the  broad 
sense  of  the  term.  Perhaps  one  can  ask  whether 
there  is  any  branch  of  religion  that  has  not  known 
something  of  this  in  the  past.  But  even  today,  in 
Tibetan  Buddhism,  the  power  of  the  Dalai  Lama, 
at  the  same  time  temporal  and  spiritual,  is  invested 
with  a  similar  dual  character. 

The  Russian  Orthodox  Church 

On  the  other  hand,  the  close  association  of  a 
state  with  a  religion — even  if  it  is  short  of  a  state 
religion — produces,  similarly,  some  kind  of  a  theo¬ 
cratical  effect.  In  contemporary  Russia,  where,  in 
spite  of  the  constitutional  separation  of  church  and 
state,  there  exists  a  special  recognition  of  the 
Orthodox  Church  (Moscow  patrarchate)  in  some 
documents  of  the  state,  one  can  sense  a  special 
religious  aura  of  favor  to  the  advantage  of  the 
Church  in  many  actions  of  the  government.  In  the 
perception  of  many  citizens,  the  government  is  not 
quite  purely  profane.  Conversely,  this  has  to  do 
with  the  special  relationship  to  the  “nation”  that 
appears  in  the  statements  of  the  Orthodox  Church. 
In  the  Sotsialnaia  kontseptsia  ( Bases  of  the  Social 
Concept  of  the  Russian  Orthodox  Church), 
approved  by  the  Holy  Synod  in  the  year  2000,  one 
reads, 

The  Church  does  not  distribute  men  according  to 
nationality  or  social  origin.  .  .  .  The  universal 
character  of  the  Church  does  however  not  mean 
that  the  Christians  do  not  enjoy  a  national 
specificity,  a  national  expression.  [There  exists]  a 
Christian  national  culture.  .  .  .  Among  the  saints 
whom  the  Russian  Orthodox  Church  venerates 
many  have  made  themselves  famous  by  their  love 
and  fidelity  to  their  fatherland. 

While  it  is  true  that  no  mention  is  made  here 
of  the  1941  case  when  the  Patriarch  was  called 
by  Joseph  Stalin  to  the  rescue  of  the  nation,  the 
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insistence  is,  however,  significant.  Following 
Germany’s  attack  on  the  Soviet  Union  in  1941, 
Stalin  relaxed  his  formerly  vigorous  antireligious 
and  anticlerical  policy  and  adopted  a  more  toler¬ 
ant  stance  toward  the  Russian  Orthodox  Church. 
According  to  some  historians,  this  shift  occurred 
in  part  to  satisfy  U.S.  President  Franklin  Roosevelt’s 
conditions  for  increased  American  support  in  the 
war  against  Adolf  Hitler  and  also  because,  par¬ 
ticularly  with  clerical  approval,  many  who  might 
resist  an  appeal  to  go  to  battle  for  communism 
could  be  relied  on  to  do  so  for  the  Holy  Russia  of 
Orthodox  Christianity.  The  Russian  Church  has 
many  times  given  its  blessing  to  the  people  for 
their  participation  in  the  Liberation  wars. 

Today  other  Christian  denominations  in  Russia 
complain  about  the  special  treatment  of  the 
Orthodox  Church.  Some  Muslims  too  are  critical 
of  this  situation  of  privilege  enjoyed  by  the  Russian 
Orthodox  Church. 

Of  course,  the  Russian  Orthodox  Church,  how¬ 
ever,  is  far  from  theocratic  in  Ayatollah  Khomeini’s 
sense  of  running  a  state  as  of  a  few  years  ago  in 
Iran,  or  even  in  Khameini’s  sense  of  running  a  state 
today.  It  is  Islam  that  most  frequently  looms  on  the 
horizon  whenever  one  speaks  of  theocracy  today, 
even  if  it  be  in  a  vague  sense  in  countries  that  do  not 
kowtow  to  politicoreligious  authorities  as  such. 

The  Possibility  of  Political 
Authority  Without  Theocracy 

The  word  theocracy  may  also  come  up  in  discus¬ 
sions  on  the  question  of  whether  any  political 
authority  can  exist  without  an  aura  of  religious 
transcendence,  such  an  aura  signifying  conversely 
that  the  political  authority  bows  to  conscience  and 
stops  from  acting  whenever  conscience  or  consci¬ 
entious  objection  is  at  stake — the  latter  being,  of 
course,  of  utmost  importance.  There  is  indeed  such 
a  majesty  in  political  power  that  some  authors  do 
not  hesitate  to  see  an  element  of  theocracy  in  the 
democratic  governments  themselves.  In  March 
2007,  the  Paris  Revue  d’ethique  et  de  theologie 
(Journal  of  Ethics  and  Moral  Theology )  published 
an  article  of  strongly  paradoxical  intent  by  the 
prominent  French  scholar  Remi  Brague  under  the 
title  “Un  regime  autre  que  la  theocratie  est-il  pos¬ 
sible?”  (“Is  a  Regime  Other  Than  Theocracy 
Possible?”).  Brague,  an  expert  on  medieval  Arabic, 


Jewish,  and  Latin  philosophy  and  cultural  history, 
notes  that  democracy  particularly  is  some  kind  of 
theocracy: 

Whether  the  fundamental  idea  in  democracy  is 
the  law  or  conscience  does  not  make  much  of  a 
difference:  the  two  of  these  ideas  have  theological 
foundations.  This  is  why  our  democratical  ideals 
of  the  supreme  power  of  the  law  (Law  state)  or 
of  an  interior  tribunal  of  ethics  as  the  last 
authority  in  the  each  person’s  soul,  are 
theologically  grounded,  (p.  110) 

In  Western  political  regimes,  Brague  says,  law  is 
considered  to  be  “anchored  in  the  people’s  will,” 
but 

the  people  consists  of  free  human  beings,  apt  to 
recognize  what  they  should  do  through  listening 
to  the  voice  of  their  conscience.  In  this  case  too, 
law  is  grounded  in  the  last  instance  in  the  idea  of 
the  human  conscience.  Vox  populi,  vox  Dei  is 
the  origin  of  “one  man,  one  vote.”  (p.  Ill) 

Brague  then  discusses  in  the  same  vein  the 
nature  of  divine  law  and  Islam  and  notes  also  that 
with  respect  to  Western  democracy,  “we  Westerners 
live,  or  have  lived,  in  a  certain  sense,  in  a  theocracy 
of  conscience”  (p.  117).  Although  today  Westerners 
want  to  conceive  of  their  regime  as  a  democracy, 
Brague  asks  whether,  in  fact,  that  is  the  kind  of 
society  we  really  want.  He  argues  that,  instead,  we 
want  a  regime  of  free  citizens.  That  freedom  means 
“that  we  have  to  obey  no  other  instance  but  our 
conscience,  the  presence  of  which  in  us  expresses 
God’s  care  for  his  creature”  (p.  118).  With 
Rousseau,  Fichte,  and  Kant,  there  has  been  a  secu¬ 
larization  of  conscience.  But,  says  Brague, 

The  secularized  state  of  our  democracies  is  [itself] 
a  problem.  One  can  indeed  ask  whether  a  human 
community  can  live  in  the  long  run  otherwise 
than  a  “laos”  (people  under  a  conscience)  or 
without  some  kind  of  a  theological  basis.  ...  Is  a 
non  theocratical  regime  possible  in  the  long  run? 

(p.  120) 

If  we  take  theocratical  in  a  broad  sense  of  the  term, 
the  answer  is  indeed  clearly  no.  No  one  is  self- 
engendered.  The  social  contract,  so  praised  as  the 
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foundation  of  political  society,  does  not  produce 
those  who  themselves  make  the  contract.  So,  at  the 
least,  it  seems  that  political  life  presupposes  some 
sense  of  moral  authority  as  a  condition  for  the 
social  contract,  though  this  may  not  be  considered 
a  theocractic  concept  in  the  strict  sense,  especially 
today,  when  the  term  is  used  primarily  in  reference 
to  Islam. 

Authorities  in  the  Religious  Sphere  Itself 

Can  the  religious  authority  exercised  in  the  reli¬ 
gious  sphere  proper — that  is,  the  authority  of  the 
Christian  bishops,  and  of  the  Catholic  pope  in 
particular — be  termed  theocratic ?  It  is  not  so,  at 
least  in  the  official  documents  of  the  Catholic 
Church.  Any  such  reference  is  normally  avoided 
because,  among  other  things,  it  is  obvious  that 
such  authorities  are  under  the  authority  of  the 
Scriptures  themselves.  The  Vatican  II  Council  has 
of  late  repeatedly  stressed  that  there  is  a  term  of 
reference  beyond  any  episcopal  or  pontifical 
authority  itself. 

Such  authority  is  religious  when  it  is  derived 
from  the  faith  of  the  believers,  but  this  does  not 
mean  the  same  as  theocratic,  because  here  there 
can  be  no  pretension  to  enjoying  God’s  very 
authority,  or  some  part  of  it.  There  is  the  preten¬ 
sion  only  of  some  participation  in  the  authority  of 
Jesus  Christ  as  the  founder  of  this  body,  the 
Church,  in  which  he  is  in  some  sense  continued. 
Vatican  II  in  its  document  Lumen  gentium  (1964) 
says,  “The  social  whole  which  constitutes  the 
Church  is  at  the  service  of  the  Spirit  of  Christ  who 
gives  it  life,  for  the  growth  of  that  body”  (No.  8). 

Aftermath  of  “Political  Theology” 

In  the  realm  of  political  philosophy,  one  should 
finally  also  mention  here  the  Christian  legal  scholar 
and  philosopher  Carl  Schmitt  (1888-1985),  whose 
views  on  the  national  socialism  of  Nazi  Germany 
probably  were  drawn  from  his  adherence,  in  his 
early  writing  Politische  Theologie  (1922),  to  the 
idea  of  some  form  of  perpetuation  of  the  Holy 
Empire.  In  a  context  of  “neutralization”  and 
“depolitization,”  dominated  by  a  technicist  way  of 
thinking — the  Weimar  Republic — he  defended  a 
view  of  politics  as  not  innocent  but  instead  charac¬ 
terized  by  the  dialectics  of  friend  and  foe  and  the 


presence  of  power  (Macht),  as  well  as  a  certain 
transcendence.  This  colored  in  some  way  the  very 
positive  attitude  he  showed,  at  least  for  a  time, 
toward  national  socialism.  (For  this  reason,  it 
would  of  course  be  dangerous  to  associate  all  ideas 
of  transcendence  in  political  philosophy  with  such 
an  extreme  vision  of  the  political  realm) . 

Jean-Yves  Calvez 
Jesuit  Faculties  of  Philosophy  and  Theology 

Paris,  France 
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Thick  Description 


Thick  description  is  a  research  method  enabling  us 
to  discern  meanings  within  the  contexts  in  which 
social  actions  take  place.  It  originated  as  a  tool  for 
ethnographers  engaged  in  participant  observation 
and  was  later  adopted  by  a  wider  range  of  qualita¬ 
tive  researchers,  including  some  sociologists  and 
political  scientists.  In  his  famous  1973  essay  titled 
“Thick  Description:  Toward  an  Interpretive 
Theory  of  Culture,”  the  anthropologist  Clifford 
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Geertz  explained  the  contrast  between  “thick” 
and  “thin”  description.  Drawing  from  the  Oxford 
philosopher  Gilbert  Ryle’s  reflections  on  thinking 
as  entailing  an  account  of  the  settings,  circum¬ 
stances,  and  intentions  that  give  actions  their 
meanings,  he  showed  how  important  it  is  to  go 
beyond  mere  facts  and  appearances  when  it  comes 
to  interpreting  behavior.  The  most  well-known 
illustration  provided  is  that  of  the  contraction  of 
one  eyelid,  which  may  be  taken  as  either  an  invol¬ 
untary  twitch  or  as  a  wink  (imparting  a  particular 
message,  related  to  a  socially  established  code),  as 
a  parody  of  wink,  or  even  the  rehearsal  of  this 
parody.  Geertz  goes  on  with  other  examples  taken 
from  his  own  field  experiences.  The  core  of  the 
argument  is  that  performing  empirical  research 
involves  accurately  describing  social  actions  and 
exploring  underlying  meanings.  In  other  words, 
thick  description  leads  to  thick  interpretation. 
This  entry  discusses  the  basic  assumptions  under¬ 
lying  this  method  and  its  relevance  for  political 
science. 

Methodological  Critique  and 
Epistemological  Debate 

From  a  methodological  point  of  view,  such  an 
approach  marks  an  important  departure  from 
traditional  assumptions.  What  is  explicitly  criti¬ 
cized  is  the  empirical  “thinness”  of  some  research 
traditions — such  as  behavioralism — which  largely 
overlook  what  makes  sense  to  the  social  actors  stud¬ 
ied.  A  typical  reproach  from  the  advocates  of  thick 
description  is  that  in  many  sectors  of  the  social 
sciences,  there  is  too  much  abstract  theorization 
and  not  enough  serious  field  research.  Interpretivists 
are  notably  skeptical  about  “armchair  scholars” 
who  delegate  the  collection  of  data  to  polling  orga¬ 
nizations  or  junior  assistants  and  concentrate  on 
the  more  noble  tasks  of  calculation  and  theory 
building.  To  some  extent,  this  opposition  coincides 
with  the  endless  debates  between  researchers  who 
use  quantitative  indicators  that  are  deemed  reliable 
insofar  as  they  are  replicable  and  those  who  resort 
instead  to  direct  observation  or  long  interviews 
because  they  believe  that  only  intense  immersion 
into  particular  settings  will  yield  the  information 
required.  Within  the  first  (positivist)  tradition,  the 
norm  is  to  handle  “objective”  data  or  at  least  evi¬ 
dence  that  is  not  excessively  dependent  on  what 


might  be  seen  as  the  researcher’s  subjective  read¬ 
ing;  moreover,  all  sorts  of  highly  sophisticated 
analytical  instruments  (correlation  coefficients, 
causal  modeling  based  on  econometric  techniques, 
etc.)  are  seen  to  guarantee  “true  science.”  On  the 
other  (interpretivist)  side,  serious  doubts  are  har¬ 
bored  about  the  reliability  of  the  procedures  and 
the  outcomes  these  procedures,  which  are  often 
influenced  by  the  questions  asked. 

At  the  epistemological  level,  it  is  claimed  that 
thick  description  leads,  for  better  or  worse,  to  con- 
textualism.  Rich  information  is  a  leading  hallmark 
of  thick  descriptive  qualitative  research — not  in 
the  sense  of  pooling  superficial  comparable  data 
from  many  places  but  in  amassing  many  contextu¬ 
ally  relevant  details.  This  richness  may  be  seen  as 
commendable  in  the  context  of  monographs;  how¬ 
ever,  it  is  somewhat  problematic  when  theoretical 
ambitions  or  even  comparisons  are  involved.  Once 
again,  here,  we  are  at  the  core  of  hotly  debated 
axiomatic  beliefs.  Many  social  scientists  share  the 
view  that  it  is  incumbent  on  them  to  emulate  the 
natural  sciences  and  provide  universal  laws.  Any 
“excessive  empiricism”  is  consequently  viewed  as 
a  posture  making  it  difficult  to  develop  necessary 
generalizations.  On  the  contrary,  in  the  other 
camp,  it  is  held  that  the  chief  merit  of  in-depth 
field  studies  lies  precisely  in  making  us  understand 
that  efforts  to  explain  the  social  world  with  grand 
theories  held  to  be  ubiquitously  valid  amount  to 
denying  important  sources  of  difference  and  com¬ 
plexity.  In  political  science,  contextual  analysis — 
referring  to  historical,  cultural,  psychological,  and 
demographic  dimensions  and  others — is  some¬ 
times  taken  quite  seriously  (see  Robert  Goodin  & 
Charles  Tilly,  2008).  However,  the  question  arises 
as  to  what  extent  context  matters  and  whether  the 
main  goal  of  the  discipline  should,  or  should  not, 
remain  the  defense  of  abstract  and  homogenizing 
models,  even  at  the  price  of  glossing  over  impor¬ 
tant  contextual  dimensions. 

We  are  touching  here  on  several  key  themes,  such 
as  context  dependency  possibly  leading  to  relativ¬ 
ism,  the  sensitive  question  of  causality  and,  more 
generally,  that  of  whether  or  not  the  social  sciences 
should  seek  to  ape  the  methods  used  in  the  “hard” 
sciences.  To  return  to  Geertz,  who  clearly  defined 
himself  as  an  “anti  anti-relativist,”  the  world  is 
incorrigibly  diverse  because  human  development 
has  led  to  infinite  sociocultural  arborescence,  with 
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ever  multiplying  branches.  This  enabled  the  exis¬ 
tence  of  myriad  cultural  expressions  and  an  unlim¬ 
ited  number  of  potentialities  and  motivations,  each 
of  which  should  be  understood  in  its  particular 
context.  This  clearly  draws  attention  to  the  impor¬ 
tance  of  culture  and,  more  specifically,  to  what 
concerns  us  here — its  place  within  the  analysis  of 
political  life.  In  what  is  probably  his  most  often 
quoted  phrase,  Geertz  (1973)  explains, 

Believing  with  Max  Weber,  that  man  is  an 
animal  suspended  in  webs  of  significance  he 
himself  has  spun,  I  take  culture  to  be  those  webs, 
and  the  analysis  of  it  to  be  therefore  not  an 
experimental  science  in  search  of  law  but  an 
interpretative  one  in  search  of  meaning,  (p.  5) 

It  is  important  to  note  that  he  adds  that  culture 
must  be  understood  as  a  “deep”  context  and  not 
merely  as  “reflecting,”  “expressing,”  “correspond¬ 
ing  to,”  “emerging  from,”  or  “conditioned  by” 
interest-based  social  cleavages.  This  perspective 
obviously  defies  the  assumptions  of  several  classi¬ 
cal  schools  of  thought  (e.g.,  Marxist  visions  in 
terms  of  ideology  or  constructivist  theories  of 
social  reality),  and  it  was  to  have  a  considerable 
influence  on  several  disciplines,  leading  notably  to 
what  is  now  known  as  the  new  cultural  history  or 
cultural  sociology. 

Universalism  Versus  Relativism 
in  Political  Science 

As  far  as  political  science  is  concerned,  and  more 
specifically  if  we  look  at  the  subfield  of  compara¬ 
tive  politics,  it  might  be  argued  that  two  rival 
metatheoretical  frameworks  for  research  are  used: 
the  universalist  and  the  relativist.  Within  each  one 
of  them,  obviously,  there  are  distinct  standpoints. 
For  example,  both  structuralist  and  rational  choice 
perspectives  are  universalist.  Similarly,  there  is  a 
variety  of  cultural  approaches.  In  the  following,  an 
attempt  will  be  made  to  clarify  the  positioning  of 
interpretive  perspectives  based  on  thick  description 
vis-a-vis  other  established  traditions  in  this  disci¬ 
pline.  Within  the  universalist  paradigm,  there  is, 
first  of  all,  the  political  philosophy  tradition. 
Geertz  insists  that  cultural  categories  emerging  from 
the  past  hold  us  in  thrall.  Therefore,  one  should 
wonder  whether  ways  of  thinking  formulated  in 


one  particular  context — say  that  of  Athens  during 
the  4th  century  BCE — are  relevant  when  it  comes 
to  interpreting  politics  in  settings  far  removed 
from  those  from  which  those  views  derive. 
Likewise,  having  systematic  recourse  to  the  con¬ 
cepts  of  political  economy,  a  discipline  elaborated 
to  account  for  the  dynamics  of  the  rise  of  capital¬ 
ism  in  Europe,  might  not  always  be  relevant  to 
non-Western  polities.  There,  it  may  be  more 
appropriate  to  reason  in  terms  of  “moral  econ¬ 
omy”  or  “economy  of  affection,”  concepts  that 
have  been  applied,  respectively,  to  Southeast  Asia 
and  sub-Saharan  Africa  as  a  result  of  the  empirical 
observation  of  actual  political  practices.  In  a 
Geertzian  perspective,  one  should  think  twice 
before  using  any  apparatus  with  universalistic 
ambitions  because  they  are  usually  ethnocentric 
and  normative.  This  inevitably  raises  the  tricky 
issue  of  the  dominance  of  a  terminology  drawn 
from  the  experience  of  Europe  and  North  America, 
which  is  assumed  to  apply  to  the  entire  world. 
ITowever,  if  each  “terrain”  generates  its  own 
operational  notions,  how  can  political  scientists 
communicate  with  each  other,  and  how  can  we 
make  comparisons  possible?  We  shall  return  to 
this  crucial  point  and  suggest  how  the  conundrum 
can  be  resolved  at  the  end  of  this  entry. 

As  was  already  mentioned,  the  Geertzian 
approach  holds  that  culture  is  not  merely  to  be 
defined  as  a  by-product  of  more  fundamental  fac¬ 
tors.  This  does  not  necessarily  mean  that  we 
should  argue  in  terms  of  primacy  of  culture  but 
only  that  we  should  beware  of  approaches  that 
systematically  resort  to  the  same  structural  deter¬ 
minants  ( economic  ones,  for  instance ) .  Structuralists 
frequently  accuse  those  they  classify  as  “cultural- 
ists”  of  having  an  overly  impressionistic  and 
“essentialist”  viewpoint.  Seeking  to  identify  uni¬ 
versal  features  transcending  cultural  diversity,  they 
are  eager  to  reduce  the  range  of  observable  cases  to 
a  limited  number  of  given  types,  looking  for  func¬ 
tional  equivalents  if  need  be.  Pivotal  to  the  dispute 
between  the  two  schools  of  thought  is  the  question 
of  meaning.  In  sharp  disagreement  with  the  idea 
that  local  meanings  would  merely  be  a  negligible 
surface  phenomenon,  interpretivists  hold  that  it  is 
imperative  to  unravel  what  makes  sense  (or  not) 
to  the  actors  concerned.  Hence,  there  is  a  dialogue 
of  the  deaf  between  adherents  of  structuralist 
approaches,  who  argue  that  it  is  necessary  to  avoid 
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being  blinded  by  the  apparent  diversity  of  concrete 
situations,  and  advocates  of  thick  descriptions, 
who  are  constantly  wary  of  reductionist  approaches 
prone  to  fitting  realities  into  preconceived  molds. 

Another  influential  perspective,  within  the  uni- 
versalist  paradigm,  is  rational  choice.  We  can  all 
agree  that  human  beings  are  possessed  of  reason. 
However,  the  motivations  and  interests  that  moti¬ 
vate  groups  or  individuals  in  one  milieu  are  often 
incomprehensible  to  those  in  another.  For  inter- 
pretivists,  the  aim  is  not  to  deny  the  possible  use  of 
the  concepts  of  interest  and  rationality.  It  is  merely 
to  place  them  in  context.  Even  if  we  take  it  for 
granted  that  human  beings  always  carefully  weigh 
the  consequences  of  their  actions,  we  need  to 
accept  that  the  realm  of  possible  choices  is  con¬ 
strained  by  the  prevalent  universe  of  meaning 
within  which  they  live.  This  is  where  thick  descrip¬ 
tion  is  viewed  as  fundamental.  We  would  all  like 
to  believe  that  it  is  easy  to  transcend  the  limitations 
of  our  own  culture  and  to  understand  those  of  oth¬ 
ers.  In  reality,  this  is  an  exercise  that  requires 
lengthy  training  and  is  never  fully  completed.  To 
take  just  one  example,  Scandinavian  diplomats 
may  deplore  the  fact  that  in  international  summits 
“the  poorer  the  country,  the  more  ostentatious  its 
representatives,”  but  reciprocally,  the  conspicu¬ 
ously  modest  behavior  of  the  members  of  some 
Nordic  nongovernmental  organizations  appears  to 
be  incomprehensible  in  the  eyes  of  some  sub- 
Saharan  elites.  The  role  of  the  interpretivist  is  to  go 
beyond  such  ethnocentric  views  and  to  make  sense 
of  the  respective  attitudes  and  cross-perceptions. 

The  developmentalist  schools  of  thought  may  be 
placed  at  the  intersection  of  the  universalist  and  the 
relativist  paradigms.  The  singularity  of  these 
approaches  is  that  their  adherents  are  usually  rather 
conscious  of  important  disparities  between  societies 
or  polities,  but  they  also  share  the  assumption  that 
it  is  quite  possible  to  move  toward  a  desirable 
direction.  Their  frameworks  are  affected  by  the  fact 
that  they  take  their  own  model  of  reference  as  the 
end  point  toward  which  all  others  should  be  con¬ 
verging.  The  teleological  goals  might  be  different, 
and  indeed  radically  divergent  (suffice  it  to  think  of 
the  Cold  War  era  and  the  two  systems  in  competi¬ 
tion),  and  largely  biased  by  ideological  convictions. 
However,  evolution  toward  the  supposedly  best 
system  would  only  be  a  matter  of  time,  willpower, 
and  application  of  the  right  recipes.  Even  if  some 


macromodels  of  deep  sociopolitical  mutations 
deserve  to  be  admired  for  their  sophistication,  the 
question  is  whether  they  are  generally  applicable. 
Adherents  of  cultural  approaches  are  often  accused 
of  promoting  a  fairly  static  and  immutable  vision  of 
societies.  It  is  true  that  they  are  particularly  attuned 
to  the  longue  duree  and  thus  to  the  resilience  of 
cultural  traits  or  some  logics  of  adoption/adapta¬ 
tion.  However,  they  think  it  useful  to  reason  not 
only  in  an  evolutionist  way  (when  appropriate)  but 
also  in  terms  of  involutions — that  is,  to  pay  full 
attention  to  the  possible  specific  dynamics  of  some 
societies.  This  heuristic  concept,  widely  used  by 
Geertz  in  his  studies  of  Indonesia,  makes  it  possible 
to  account  for  cultural  patterns  that  are  no  longer 
traditional  but  not  evolving  into  a  recognizable 
modernity  either. 

Relativism,  Postmodernism,  Interpretivism 

Let  us  now  consider  the  relativist  paradigm.  As  is 
well-known,  some  confusion  is  engendered  by  the 
fact  that  there  are  so  many  conceptions  of  what  is 
exactly  meant  by  culture.  Space  limitations  pre¬ 
clude  the  full  enumeration  and  discussion  of  all  the 
schools  of  thought  that  take  cultural  dimensions 
seriously,  but  it  is  important  to  explain,  at  least,  in 
which  ways  an  interpretivist  approach  diverges 
from  perspectives  in  terms  of  values  and  from 
postmodernist  views.  In  political  science,  a  large 
number  of  scholars  equate  culture  with  systems  of 
beliefs  and  customs.  At  the  heart  of  all  these  defini¬ 
tions  lies  the  notion  of  values ,  which  refers  to  that 
“conception  of  the  desirable”  commonly  shared 
within  a  given  social  grouping.  If  it  is  arguable  that 
such  an  approach  is  useful  as  concerns  small,  tra¬ 
ditional  communities,  it  is  less  likely  that  it  can 
offer  insights  regarding  modern  polities,  character¬ 
ized  by  a  high  degree  of  social  differentiation,  and 
even  less  regarding  postmodern  societies  in  which 
normative  conventions  have  been  shattered.  As 
Bertrand  Badie  has  noted,  the  great  advantage  of 
an  approach  in  terms  of  meaning  is  that  once  we 
accept  that  culture  is  not  a  concatenation  of  nor¬ 
mative  standpoints  but  the  language  that  makes 
understanding  possible,  it  becomes  easier  to  explain 
why  individuals  who  live  within  the  same  cultural 
setting  can  hold  antagonistic  convictions  based  on 
different  values  and  still  belong  to  the  same  cul¬ 
ture.  For  similar  reasons,  interpretivists  distance 
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themselves  from  conceptions  in  terms  of  political 
culture.  If  they  accept  that  the  notion  of  culture 
refers  to  a  wide  semiotic  framework,  the  idea  of  a 
political  culture  autonomous  from  the  rest  of  a 
society’s  cultural  codes  presupposes  the  existence 
of  a  political  sphere  strongly  differentiated  from 
other  spheres  (religious,  economic,  etc.) — which  is 
far  from  being  generally  the  case,  even  in  the  con¬ 
temporary  world.  Beyond  the  antagonistic  visions 
of  the  ideal  society  held  by  political  parties,  what  is 
of  interest  to  an  interpretivist  approach  is  that  they 
all  operate  within  shared  systems  of  meaning. 

It  is  sometimes  assumed  that  Geertzian  anti¬ 
positivist  writings  have  paved  the  way  for  post¬ 
modernist  skepticism  (see,  e.g.,  Goodin  &  Tilly, 
2008,  p.  8).  Admittedly,  some  postmodernists 
praise  interpretivists  for  having  recontextualized 
“Western  knowledge”  and  also  for  having  empha¬ 
sized  the  need  to  study  other  systems  of  meaning 
in  their  own  terms.  Nevertheless,  they  typically 
criticize  interpretivists  for  not  following  them  all 
the  way  when  it  comes  to  challenging  dominant 
cultures  and  glorifying  those  of  the  so-called 
oppressed  minorities.  There  is  obviously  a  huge 
gap  between  militant  postures  eager  to  propagate 
multiculturalism  and  a  scientific  one  merely  aim¬ 
ing  to  underline  the  magnitude  of  cultural  cleav¬ 
ages.  An  interpretivist  approach  does  not  exalt 
differences;  it  just  seeks  to  analyze  them  seriously. 
Likewise,  if  interpretivists  are  fairly  critical  of 
much  of  the  apparatus  of  the  natural  sciences, 
including  hypothesis  testing  or  the  formulation  of 
dependent  and  independent  variables,  it  does  not 
mean  that  they  readily  adhere  to  deconstructivist 
perspectives  considering  that  reality  is  pure  illusion 
and  that  everything  is  intertextual.  The  intention 
certainly  is  to  challenge  the  premises  on  which 
earlier  approaches  have  been  constructed  but  with 
the  aim  of  providing  a  new  form  of  science  gener¬ 
ating  rigorous  theorizing  driven  by  data  collected 
through  serious  fieldwork. 

This  has  several  important  consequences.  First, 
thick  description  implies  thinking  semiotically.  It 
should  be  underlined  that  the  semiotic  approach 
espoused  here  does  not  refer  to  semiology  and 
assorted  structuralist  theories  of  signs  derived  from 
linguistics  but  entails  being  particularly  attentive  to 
the  worlds  of  meaning  of  others.  A  key  aspect  that 
should  be  noted  is  that,  whereas  thin  descriptions 


use  experience-distant  concepts,  thick  descriptions 
resort  more  willingly  to  experience-near  concepts. 
In  other  words,  this  tradition  of  studies  is  not  very 
comfortable  with  ubiquitous  abstract  concepts  that 
confer  an  illusion  of  analytical  familiarity  to  all 
places  and  situations.  As  already  alluded  to,  a 
major  problem  is  our  need  to  operate  at  a  certain 
level  of  generalization  and  to  find  ways  of  translat¬ 
ing  the  unfamiliar  logics  of  other  cultures  into  an 
idiom  that  all  political  scientists  can  understand 
without  falling  prey  to  excessive  universalism.  This 
is  not  necessarily  an  impossible  mission.  Following 
Bertrand  Badie  and  Guy  Hermet  (1990),  one  pos¬ 
sible  approach  to  resolving  this  dilemma  is  to  make 
a  distinction  between  concepts  that  are  transcul- 
tural  and  those  that  are  monocultural.  For  instance, 
according  to  these  two  authors,  the  concept  of 
norm  falls  into  the  first  category  but  that  of  state 
would  fall  into  the  second — insofar  as,  thanks  to 
the  insight  provided  by  historical  sociology,  we 
understand  that  the  state  is  not  the  natural  frame¬ 
work  of  political  activity  but  merely  one  specific 
mode  of  political  organization,  which  unpremedi- 
tatedly  emerged  in  some  European  countries  at  the 
end  of  the  Middle  Ages.  Moreover,  we  know  that 
its  transplantation  to  other  contexts  has  often 
proved  problematic,  largely  because  of  cultural  fac¬ 
tors.  What  is  required  is  not  the  evaluation  of  states 
according  to  an  ideal-type  model  or  according  to  a 
gradualist  approach  that  would  put  Scandinavian 
countries  at  one  extreme  of  a  continuum  and 
Somalia  at  the  other.  It  is  rather  to  study  all  sorts 
of  polities  without  necessarily  placing  the  concept 
of  the  state  at  the  heart  of  our  research  and,  instead, 
to  pay  full  attention  to  the  thick  description  of  the 
nature  and  working  of  social  and  political  rela¬ 
tions.  Consequently,  comparative  studies  should 
question  the  explanatory  value  of  concepts  and 
analytical  models  across  cases. 

Second,  a  thick  description  perspective  involves 
thinking  inductively  rather  than  deductively.  This 
means  beginning  with  an  examination  of  empiri¬ 
cally  observed  realities  and  only  at  a  later  stage 
mobilizing  the  most  appropriate  instruments.  The 
chief  merit  of  in-depth  field  studies  is  precisely  in 
avoiding  the  temptation  of  a  priori  generalization 
on  the  basis  of  grand  theories  held  to  be  universally 
valid.  Although  many  apparatuses  are  worthy  of 
note,  it  is  often  relatively  simple  to  supply  empirical 
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findings  that  do  not  readily  fit  with  their  respective 
predictions.  The  existence  of  such  discrepancies 
necessitates  not  a  further  elaboration  of  alternative 
theories  with  global  explanatory  ambitions  but 
limited  ones  succeeding  in  making  sense  of  nonneg- 
ligible  variations  across  time  or  space.  The  scien¬ 
tific  reasoning  suggested  here  is  not  one  that  attains 
the  highest  level  of  abstract  generalization  but  one 
that  takes  into  account  various  logics  of  meaning. 
Placing  the  issue  of  meaning  center  stage  leads  us  to 
rethink  the  merits  and  demerits  of  the  classical 
approaches  and  challenges  the  supposed  universal¬ 
ity  of  some  mechanisms  (see  the  many  illustrations 
in  Patrick  Chabal  &  Jean-Pascal  Daloz,  2006). 
Such  an  interpretivist  standpoint  leads  to  theoreti¬ 
cal  eclecticism.  Needless  to  say,  its  intention  is  not 
to  juxtapose  largely  incompatible  theories  but 
rather  to  move  toward  an  awareness  of  the  limited 
validity  of  exclusivist  conceptual  frameworks.  The 
result  is  certainly  not  sterile  empiricism  or  subjec¬ 
tivism  but  a  nondogmatic  perspective  seeking  to 
illuminate  a  great  diversity  of  scenarios.  At  the 
paradigmatic  level,  in  contrast,  ecumenism  (see, 
e.g.,  Mark  Lichbach  &  Alan  Zuckerman,  1997, 
which  attempts  to  marry  rationalist,  cultural,  and 
structuralist  approaches  in  the  field  of  comparative 
politics)  can  only  be  considered  with  skepticism. 
There  is  obviously  little  possibility  of  analytical 
dialogue  or  theoretical  convergence  between  the 
proponents  of  universalist  and  relativist  paradigms. 

Jean-Pascal  Daloz 
St.  Anthony’s  College,  University  of  Oxford 
Oxford,  United  Kingdom 
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Think  Tanks 


The  term  think  tank  has  been  used  to  describe 
both  private  and  public  organizations  that  provide 
advice  to  governments.  A  generally  accepted  defi¬ 
nition  would  describe  think  tanks  as  organizations 
that  are  distinct  from  government  and  whose 
objective  is  to  provide  advice  on  a  diverse  range  of 
policy  issues  through  the  use  of  specialized  knowl¬ 
edge  and  the  activation  of  networks.  What  follows 
in  this  entry  is,  first,  a  discussion  of  the  historical 
and  geographical  development  of  the  concept;  sec¬ 
ond,  a  typology  of  think  tanks;  third,  a  discussion 
of  characteristics  and  myths  related  to  think  tanks; 
and  finally,  some  thoughts  about  the  impact  of 
globalization  on  think  tanks. 

Origins 

The  term  think  tank  was  first  used  in  military  jar¬ 
gon  during  World  War  II  to  describe  a  safe  place 
where  plans  and  strategies  could  be  discussed,  but 
its  meaning  started  changing  during  the  1960s 
when  it  was  used  in  the  United  States  to  describe 
private,  nonprofit  policy  research  organizations.  It 
has  been  proposed  that  in  reality  the  first  example 
of  a  think  tank  was  the  Fabian  Society,  which  was 
aiming  to  influence  public  policy  in  Britain  at  the 
end  of  the  19th  century.  For  many  years,  the  major¬ 
ity  of  scholars  studying  think  tanks  considered 
them  as  a  uniquely  American  phenomenon  that 
boomed  in  the  United  States  because  of  the  excep¬ 
tionality  of  its  political  system  and  its  rich  tradition 
of  private  funding  available  for  think  tanks.  From 
a  global  perspective,  however,  the  argument  that 
think  tanks  are  a  uniquely  American  phenomenon 
is  not  convincing.  Think  tanks  have  also  flourished 
in  other  industrial  Anglo-American  countries,  such 
as  Canada,  the  United  Kingdom  (UK),  and  Aus¬ 
tralia,  where  normally,  they  are  more  modest  than 
in  the  United  States.  European  think  tanks  vary 
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considerably.  In  Germany,  for  example,  large,  influ¬ 
ential  think  tanks  are  encountered,  but  they  are 
often  state  funded  and  associated  with  political 
parties  or  universities.  In  France,  what  could  be 
described  as  think  tanks  are  organizations  gathered 
around  the  government  in  Paris  that  operate  in  a 
conflictual  but  subordinate  relationship  with  politi¬ 
cal  parties.  In  southern  Europe,  think  tanks  are  a 
more  recent  phenomenon  as  they  began  to  appear 
in  the  1970s  after  the  establishment  of  democratic 
governments.  Research  on  think  tanks  outside  the 
Western  world  shows  that  it  is  likely  that  an  even 
greater  variety  of  organizations  exists  globally. 

Typology 

The  diversity  of  organizations  that  fall  under  the 
term  think  tank  has  led  to  the  creation  of  typolo¬ 
gies.  At  least  four  types  of  think  tanks  can  be 
observed.  The  first  is  the  ideological  tank,  which 
refers  to  organizations  that  have  a  clearly  specified 
political  or,  more  broadly,  ideological  philosophy 
and  are  clearly  related  to  the  category  of  advocacy 
tanks.  Examples  include  the  New  Right  think  tanks 
in  the  UK  and  the  think  tanks  that  are  affiliated  to 
political  parties  in  Germany.  This  type  is  the  closest 
to  the  stereotypical  idea  of  what  a  think  tank  is. 
The  next  type  is  the  specialist  tank,  which  includes 
institutes  that  have  a  thematic  focus.  The  most 
common  subjects  are  foreign  and  public  policy,  but 
think  tanks  also  specialize  in  other  issues,  such  as 
the  environment.  The  third  category  includes  insti¬ 
tutes  that  do  not  work  at  the  national  level.  They 
could  either  work  at  the  regional  level,  such  as  the 
American  state  tanks,  or  at  the  supranational  level, 
such  as  the  think  tanks  that  are  based  in  Brussels 
and  are  aiming  at  the  European  Union  (EU).  The 
final  category  is  the  think-and-do  tanks  that  relate 
to  organizations  that  apart  from  their  traditional 
research  activities,  are  starting  to  be  active  at  a 
more  practical  level,  such  as  in  the  funding  of  char¬ 
ity  projects.  This  type  of  think  tank  is  closer  to 
nongovernmental  organizations  (NGOs). 

Following  the  above  typology,  it  can  be  observed 
that  think  tanks  are  similar,  but  they  can  be  distin¬ 
guished  from  other  organizations  that  are  involved 
in  the  political  arena.  A  think  tank  is  different 
from  a  university  unit  that  offers  courses  but  also 
conducts  research.  It  is  different  from  philan¬ 
thropic  organizations  that  have  as  one  of  their 
lower  priorities  the  funding  of  research  and  prefer 


the  funding  of  actions  directed  to  society  in  a  more 
straightforward  way.  They  are  different  from  gov¬ 
ernment  advisory  organizations  because  they  play 
a  distinctive  and  unique  role  by  providing  more 
independent  intellectual  support  to,  or  new  alterna¬ 
tives  for,  public  policy.  Nevertheless,  there  have 
been  government  research  institutes,  for  example, 
in  France,  that  are  often  described  as  think  tanks. 
Finally,  think  tanks  are  also  different  from  pressure 
groups  and  interest  groups.  This  division  has 
become  less  obvious  because  pressure  groups 
increasingly  develop  in-house,  well-researched  cri¬ 
tiques  of  existing  policy.  One  of  the  most  impor¬ 
tant  differences  is  that  pressure  groups  have  a 
membership  of  individuals  as  one  of  their  central 
characteristics.  When  they  do  get  involved  in 
research,  they  do  it  to  support  their  campaigns,  and 
it  does  not  constitute  their  preliminary  interest. 

Common  Characteristics  of  Think  Tanks 

If  these  are  the  types  of  think  tanks,  what  are  the 
common  characteristics  of  these  organizations? 
First,  it  is  their  policy  focus,  which  means  that  their 
objective  is  to  bring  knowledge  and  policy  making 
together  by  informing  and,  if  possible,  influencing 
the  policy  process.  Think  tanks  conduct  and  recycle 
research  that  aims  to  solve  policy  problems  and  not 
solely  to  advance  the  theoretical  debate.  The  sec¬ 
ond  characteristic  is  public  purpose,  which  refers  to 
the  reason  for  the  existence  of  think  tanks.  Most  of 
the  think  tanks  claim  that  they  conduct  research  to 
inform  the  public  and  the  government  on  how  to 
improve  public  policy.  Their  rhetoric  often  claims 
that  their  work  is  for  the  common  good  and  to 
educate  the  public.  Third,  the  expertise  and  pro¬ 
fessionalism  of  their  research  staff  is  the  key  intel¬ 
lectual  resource  of  think  tanks  and  a  way  of 
legitimizing  their  findings.  Finally,  the  key  activities 
of  think  tanks  are  usually  research  analysis  and 
advice,  which  come  in  the  form  of  publications, 
conferences,  seminars,  and  workshops. 

Myths  Concerning  Think  Tanks 

It  has  been  argued  that  think  tanks  do  not 
always  possess  the  characteristics  described  above 
and  that  some  of  these  features  are  better  described 
as  myths.  The  first  myth  concerns  the  policy  focus 
of  think  tanks  and  their  role  as  bridges  between 
knowledge  and  government  policies.  As  has  already 
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been  discussed,  no  one  type  of  think  tanks  exists. 
Furthermore,  the  borders  between  think  tanks  and 
organizations  such  as  interest  groups,  professional 
associations,  consultants,  and  university  institutes 
are  blurred.  The  result  is  that  it  is  not  always  easy 
to  identify  an  organization  as  a  think  tank  and  that 
bridging  knowledge  and  policies  can  happen  in  a 
variety  of  ways,  if  indeed  it  is  a  priority  of  the 
organization.  Another  obstacle  for  think  tanks 
that  wish  to  play  the  role  of  a  bridge  between 
knowledge  and  government  is  that  in  many  coun¬ 
tries,  the  directors  and  experts  of  think  tanks  are 
closely  related  to  politicians  and  bureaucrats.  In 
reality,  they  belong  to  the  same  elite;  they  have 
similar  worldviews,  and  they  often  move  between 
governmental  organizations  and  NGOs.  Thus, 
think  tanks  cannot  fulfill  their  role  as  bridges. 

The  second  myth  is  that  think  tanks  serve  the 
public  purpose.  Although  the  public  purpose  is 
always  in  their  rhetoric,  think  tanks  are  themselves 
organizations  that  have  private  interests,  and  they 
are  dependent  on  their  sources  of  funding.  Often, 
their  concern  about  their  image  and  reputation 
limits  the  spectrum  of  their  policy  proposals.  It  is 
even  doubtful  to  what  extent  think  tanks  can  deter¬ 
mine  their  own  research  agendas  because  this  is 
often  dependent  on  contracts  and  funding  opportu¬ 
nities.  This  myth  is  closely  related  to  the  alleged 
independence  of  think  tanks,  which  as  has  been 
shown  is  only  partial  given  their  needs  for  funding 
and  publicity.  The  third  myth  concerns  the  knowl¬ 
edge  resources  of  think  tanks.  Although  think 
tanks  do  normally  recruit  experts  and  provide 
policy  analysis,  they  often  recycle  rather  than  pro¬ 
duce  academic  knowledge.  Their  aim  is  to  make 
academic  findings  more  palatable  for  busy  politi¬ 
cians  and  policymakers.  This  means  that  think 
tanks  play  an  important  role  in  setting  the  research 
and  policy  agenda  and  in  prioritizing  some  subjects 
over  others. 

International  Think  Tanks 

Globalization  has  affected  think  tanks,  especially 
by  increasing  their  appetite  and  capacity  for  inter¬ 
national  networking.  International  think  tanks  and 
global  networks  have  emerged  since  the  1990s. 
International  think  tanks,  although  they  are  based 
in  one  country,  claim  not  to  have  any  specific 
national  links.  An  example  is  the  European  Policy 
Centre  in  Belgium,  which  claims  that  it  is  interested 


in  the  EU  and  not  in  a  particular  member  state. 
Additionally,  there  is  a  tendency  for  transnational 
communication  between  think  tanks  to  occur 
through  the  creation  of  regional  and  international 
fora,  such  as  the  Global  Development  Network. 
International  organizations,  such  as  the  World 
Bank,  are  encouraging  this  kind  of  activity  through 
the  organization  of  a  number  of  regional  and  inter¬ 
national  conferences  for  the  promotion  of  think 
tanks  and  their  work.  On  the  other  hand,  think 
tanks  are  increasingly  offering  their  services  to 
international  organizations,  such  as  the  World 
Bank  and  the  EU.  All  this  international  activity  has 
translated  into  a  proliferation  of  the  number  of 
such  organizations  globally. 

Future  Challenges 

The  challenges,  but  also  the  opportunities,  that 
think  tanks  are  facing  are  many.  First,  the  nature 
of  funding  has  changed,  and  at  times  of  economic 
recession,  obtaining  it  has  become  difficult. 
Second,  the  proliferation  of  think  tanks  and  other 
NGOs  has  meant  increased  competition  but  also 
increased  possibilities  for  synergies,  better  produc¬ 
tivity,  and  broader  audiences.  Third,  the  rise  of  the 
Internet  has  multiplied  their  communication  pos¬ 
sibilities,  but  it  has  also  signified  the  spread  of 
information  that  is  not  always  accurate.  Fourth, 
the  emergence  of  specialist  think  tanks  in  new 
areas,  such  as  biotechnology  and  genetics  has 
facilitated  the  provision  of  more  focused  analysis. 
Finally,  globalization  has  meant  an  increased 
demand  for  policy  advice  and  a  new  role  for  think 
tanks  that  are  forced  to  study  policy  alternatives 
from  around  the  globe  and  then  adapt  them  to 
their  local  context. 

Stella  Ladi 
Panteion  University 
Athens,  Greece 
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Time-series  analysis  is  a  type  of  statistical  analysis 
of  observational  data  that  vary  over  regular  units  of 
time.  In  political  science,  the  data  used  are  typically 
aggregate  descriptors,  such  as  unemployment  rates, 
levels  of  presidential  approval,  the  incidence  of  ter¬ 
rorist  attacks,  or  the  percentages  of  Democratic  and 
Republican  Party  identifiers  in  the  electorate.  The 
time  intervals  over  which  these  data  are  gathered 
are  usually  years,  quarters,  or  months,  but  data 
gathered  over  much  shorter  periods  also  may  be 
employed.  For  example,  political  economists  study¬ 
ing  variations  in  currency  exchange  rates  or  volatil¬ 
ity  in  stock  markets  may  use  data  gathered  every 
day,  every  minute,  or  even  every  second.  Although 
there  are  exceptions,  time-series  analysts  normally 
do  not  generate  their  data  de  novo  but  rather  rely 
on  data  gathered  by  government  statistical  agencies 
or  private  entities,  such  as  public  opinion  polling 
agencies.  In  the  following,  the  basic  features  and 
recent  advances  in  this  field  are  presented. 

Historically,  time-series  analysis  was  closely  in¬ 
tegrated  with  other  aspects  of  econometrics.  Re¬ 
searchers  typically  began  by  specifying  a  model 
where  the  dependent  (Yt)  and  independent  (Xkt) 
variables  as  well  as  the  stochastic  error  term  (sz), 
were  subscripted  with  t  to  denote  variation  over 
time.  Equation  1  is  an  example  with  a  single  predic¬ 
tor  variable  (Xz).  The  coefficients  (30  and  [3,  in  this 
equation  were  assumed  to  be  time  invariant  and 
were  estimated  using  ordinary  least  squares  (OLS) 
regression: 

Yt  =  Po  +  +  Er  (1) 

A  standard  set  of  postestimation  diagnostics  was 
performed,  with  a  Durbin-Watson  test  used  to 


detect  the  presence  of  first-order  autocorrelation  in 
the  residuals.  Autocorrelated  residuals  prompted 
analysts  to  infer  the  existence  of  autocorrelation  in 
the  error  process,  that  is,  et  =  pet_}  +  vt,  where  vt  is 
a  well-behaved  Gaussian  error  term  that  is  ~N( 0, 
(T2)  by  assumption.  In  turn,  autocorrelated  errors 
were  treated  as  a  nuisance,  rather  than  as  an  indica¬ 
tor  of  model  misspecification.  This  nuisance  could 
be  eliminated  by  a  variant  of  generalized  least 
squares  (the  Cochrane-Orcutt  transformation), 
with  the  parameter  p  being  estimated  from  the  data. 

The  resulting  model  (Equation  2)  involves  first- 
order  partial  differences  of  all  left-  and  right-hand- 
side  variables,  including  the  error  process: 

Yt  ~  Py,  i  =  Po  “  PPo  +  PlXt 

—  P1pXz_1  +  Ez  —  pS;-!-  (2) 

The  model  thus  contains  a  common  factor  restric¬ 
tion  (1  —  pL),  where  L  is  a  backshift  operator.  Pol¬ 
itical  scientists  using  this  technique  often  appeared 
to  be  unaware  that  the  model  had  dynamic  proper¬ 
ties  with  effects  of  all  predictor  variables  being  dis¬ 
tributed  over  time  by  the  presence  of  a  lagged 
endogenous  variable  Yz_,,  with  the  rate  of  decay  of 
these  effects  being  governed  by  the  parameter  p. 

Starting  in  the  1970s,  analysts  became  increas¬ 
ingly  aware  of  the  restrictive  nature  of  the 
Cochrane-Orcutt  transformation,  and  many  began 
to  address  the  “nuisance”  of  autocorrelated  errors 
by  inserting  a  lagged  endogenous  variable  on  the 
right-hand  side  of  their  model  (Equation  3). 

Yt=  Po  +  XYt-i  +  PAt  +  et-  (3) 

This  specification  avoided  the  common  factor 
restriction  associated  with  the  Cochrane-Orcutt 
transformation,  but  it  still  imposed  a  possibly 
theoretically  unattractive  uniform  dynamic  on  the 
effects  of  all  predictor  variables,  with  the  rate  of 
decay  of  those  effects  being  governed  by  the 
parameter  A.  associated  with  the  lagged  endoge¬ 
nous  variable.  In  addition,  the  lagged  endogenous 
variable  heightened  the  possibility  that  parameter 
estimates  would  be  biased  and  inconsistent. 
Analysts  recognizing  this  possibility  typically  tested 
model  residuals  for  first-order  correlation  using 
statistics  such  as  Durbin’s  H  (the  Durbin-Watson 
test  being  inapplicable  for  models  with  lagged 
endogenous  variables). 
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Another  important  development  in  the  1970s 
involved  threats  to  inference  posed  by  trending 
(nonstationary)  data.  The  stationarity  condition 
requires  that  variables  have  constant  means,  con¬ 
stant  variances,  and  constant  autocovariances  for 
any  lag  k  regardless  of  the  value  of  t.  Well  before 
World  War  II,  prominent  statisticians  and  econo¬ 
mists  (e.g.,  George  Udny  Yule  and  John  Maynard 
Keynes)  worried  about  the  fact  that  relationships 
between  variables  as  suggested  by  statistically  sig¬ 
nificant  coefficients  estimated  in  time-series  regres¬ 
sions  might  be  artifacts  of  nonstationarity.  The 
reality  of  this  spurious  regression  threat  was  docu¬ 
mented  by  Clive  Granger  and  Paul  Newbold  in 
1974.  Using  simulated  nonstationary  variables 
that  were  independent  by  construction,  Granger 
and  Newbold  found  that  they  falsely  rejected  the 
null  hypothesis  for  regression  coefficients  at  alarm¬ 
ingly  high  rates.  Type  I  errors  (rejecting  a  true 
hypothesis)  were  a  serious  problem  when  data 
were  nonstationary. 

Traditionally,  researchers  concerned  about  this 
threat  had  “detrended”  their  data  by  regressing 
them  on  a  time  counter  corresponding  to  the  units 
of  temporal  aggregation  of  their  data.  However, 
many  analysts  believe  that  such  deterministic 
trends  are  rare  in  the  data-generating  processes  for 
variables  of  interest  to  social  scientists.  In  this 
regard,  Granger  and  Newbold  had  assumed  that 
nonstationarity  was  a  product  of  a  local-level  pro¬ 
cess,  such  as  a  random  walk  (see  Equation  4). 

Y,  -  1.0  *  Yf_1  +  Bf.  (4) 

As  Equation  4  shows,  a  random  walk  “remem¬ 
bers  everything”:  that  is,  its  value  at  time  t  equals 
its  value  at  t  —  1  plus  a  contemporaneous  random 
shock,  e(.  Since  shocks  are  not  discounted  over 
time,  the  value  of  a  random  walk  at  any  time  t  is 
the  sum  of  all  shocks  to  that  point  in  time,  plus  the 
initial  value  of  the  series.  The  variance  of  such  a 
process  increases  without  bound  as  t  goes  to  infinity. 
If  the  random  walk  model  also  includes  a  constant 
on  the  right-hand  side,  the  resulting  model — a 
random  walk  with  drift — generates  a  deterministic 
trend.  Random  walks,  with  and  without  drift,  are 
clearly  nonstationary. 

Given  the  seriousness  of  the  spurious  regressions 
threat,  it  is  important  to  diagnose  the  presence  of 
nonstationarity  in  time-series  variables.  Historically, 
the  simplest  of  these  diagnostics  has  been  an  ocular 


inspection  of  the  graph  of  successive  values  of  a 
time-series  variable.  If  the  variable  appears  to  move 
predominantly  upward  or  downward  over  the 
period  for  which  data  are  available,  the  analyst 
concludes  that  the  process  “trends,”  that  is,  it  is 
nonstationary.  A  second  diagnostic  was  suggested 
by  George  Box  and  Gwilym  Jenkins,  who  demon¬ 
strated  that  “stochastic  trending”  variables,  such  as 
random  walks,  have  characteristic  autocorrelation 
functions.  An  autocorrelation  function  is  the  cor¬ 
relation  of  a  time-series  variable  with  itself  at  suc¬ 
cessively  longer  lags.  The  autocorrelation  function 
for  a  random  walk  will  have  very  large  values  at 
low  lags,  and  the  value  of  those  correlations  will 
decay  only  very  slowly  as  the  lag  length  increases. 
The  autocorrelation  functions  of  stationary  vari¬ 
ables  are  very  different,  either  decaying  at  geomet¬ 
ric  rates  in  the  case  of  an  autoregressive  process  or 
having  one  or  more  significant  “spikes”  (correla¬ 
tions)  in  the  case  of  a  moving  average  process. 

Inspection  of  graphs  and  autocorrelation 
functions  involves  judgment  calls  rather  than 
formal  statistical  tests.  However,  over  the  past 
three  decades,  econometricians  have  developed 
a  large  battery  of  formal  tests  for  nonstationar¬ 
ity.  The  most  widely  used  of  these  unit-root  tests 
was  proposed  by  David  Dickey  and  Richard 
Fuller  (DF): 

(1  -  L)Yt  =  P0  +  P,T  +  XY, +  Ef,  (5) 

where  T  represents  time. 

The  DF  test  involves  regressing  the  first  differ¬ 
ence  of  a  variable  on  its  lag  in  levels  plus  possibly 
a  constant  and  a  deterministic  trend  (as  in  Equation 
5),  depending  on  hypotheses  about  the  data- 
generating  process.  The  key  parameter  is  X,  with  a 
significance  test  (t  test)  for  X  determining  whether 
the  variable  is  nonstationary.  For  the  DF  and  many 
other  (but  not  all)  of  these  tests,  the  null  hypothesis 
is  nonstationarity.  The  t  distribution  for  the  DF 
test  is  nonstandard,  and  special  tables  of  critical 
values  must  be  used  for  significance  tests.  If  one  is 
concerned  that  test  results  may  be  biased  because 
the  error  process  of  the  DF  regression  is  not  “white 
noise,”  lags  of  the  dependent  variable  may  be 
added  to  the  right-hand  side.  The  result  is  an  aug¬ 
mented  Dickey-Fuller  test,  and  again,  special  t 
distributions  must  be  used.  Fike  other  unit-root 
tests,  the  DF  has  been  criticized  for  having  low 
power,  thus,  raising  the  possibility  of  incorrectly 
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failing  to  reject  the  null  hypothesis  of  nonstation- 
arity  (a  Type  II  error). 

Since  the  1980s,  it  has  become  standard  prac¬ 
tice  for  analysts  who  conclude  that  variables  of 
interest  are  nonstationary  to  first-difference  these 
variables  prior  to  doing  regression  analyses,  that 
is,  to  compute  Zt  —  Yt  —  Yt_v  If  subsequent  diag¬ 
nostics  suggest  that  the  differenced  series  remains 
nonstationary,  a  second  difference  is  taken,  that  is, 
Wt  =  Zt  —  Zt_v  If  we  assume  that  the  data-generating 
process  is  characterized  by  a  stochastic  trend,  such 
differencing  will  engender  stationarity. 

Analyses  of  models  using  differenced  variables 
will  not  be  subject  to  the  spurious  regressions 
threat.  However,  by  virtue  of  being  specified  with 
differenced  variables,  these  models  address  only 
the  short-term  effects  of  various  predictor  vari¬ 
ables.  Any  long-run  relationships  will  be  ignored. 
Since  such  relationships  can  be  of  considerable 
theoretical  interest,  a  methodology  for  incorporat¬ 
ing  them  in  the  analysis  of  nonstationary  data  is 
very  useful.  Such  a  methodology  was  provided  in 
1987  by  Robert  Engle  and  Clive  Granger,  who 
developed  the  concept  of  co-integration  and  dem¬ 
onstrated  its  relationship  with  error  correction 
models  in  the  context  of  applied  time-series  analy¬ 
sis.  (Engle  and  Granger  were  awarded  the  Nobel 
Prize  in  Economics  for  this  work  in  2003.)  They 
showed  that  it  is  possible  for  a  linear  combination 
of  two  nonstationary  variables  to  be  stationary.  If 
this  is  the  case,  then,  the  variables  are  said  to  be 
co-integrated.  Engle  and  Granger’s  proposed  test 
for  co-integration  is  simple — regress  Yt  on  Xt  in 
their  original-level  form,  and  then,  test  the  residu¬ 
als  for  nonstationarity.  (Again  the  requisite  t  distri¬ 
bution  for  a  DF  test  on  these  residuals  is  nonstan¬ 
dard.)  If  the  residuals  are  stationary,  one  concludes 
that  the  two  series  are  co-integrated.  If  Y  and  X  are 
co-integrated,  Engle  and  Granger  show  that  it  is 
possible  to  model  both  the  short-  and  long-run 
effects  of  X  on  Y  via  an  error  correction  specifica¬ 
tion.  This  error  correction  model  (an  autoregres¬ 
sive  distributed  lag  model  with  restrictions)  is 

(1  -  L)Yt  =  (30  +  (3,(1  -  L)Xt 

+  a(Y  ~  C,X)H  +  E„  (6) 

where  C,  is  the  correction  term. 

In  Equation  6,  the  short-run  effect  of  X  operat¬ 
ing  at  time  t  on  Yt  is  captured  by  the  (3,  coefficient. 


The  long-run  effect  is  captured  by  a,  with 
(Y  —  C,X)  operating  with  a  one-period  lag  consti¬ 
tuting  the  error  correction  mechanism.  If  such  a 
mechanism  is  operating,  the  expectation  is  that  a 
will  have  an  absolute  value  less  than  1.0  and  carry 
a  negative  sign.  For  example,  an  estimated  a  of 
—  .5  implies  that,  ceteris  paribus,  any  shock  to  the 
system  that  disturbs  the  long-run  dynamic  equilib¬ 
rium  between  Y  and  X  will  be  eroded  at  a  rate  of 
50%  in  all  subsequent  time  periods.  Engle  and 
Granger  suggested  that  (Y  —  C,X)  could  be  mea¬ 
sured  as  the  residuals  from  the  co-integrating 
regression  of  Y  on  X.  Then,  Equation  6  could  be 
estimated  using  OLS.  However,  this  two-step 
approach  to  estimation  is  not  necessary,  and  equa¬ 
tions  such  as  Equation  6  can  be  estimated  in  one 
step  using  nonlinear  least  squares  or  maximum 
likelihood  procedures.  If  one  is  not  interested  in  a 
standard  error  for  C„  OLS  can  be  used. 

In  contrast  to  the  methods  described  above,  Box 
and  Jenkins  in  1976  offered  a  class  of  ARIMA 
(autoregressive,  integrated,  moving  average)  mod¬ 
els  designed  to  simplify  and  improve  forecasting. 
Reacting  to  the  perceived  theoretical  inadequacies 
and  empirical  failures  of  the  traditional  Cowles 
Commission  approach  to  forecasting,  which 
involved  the  specification  and  estimation  of  large 
multi-equation  models,  Box  and  Jenkins  proposed 
that  the  future  values  of  a  time  series  could  be 
forecast  using  only  its  own  history.  As  illustrated 
in  Equation  7,  the  resulting  ARIMA  models  had 
autoregressive  and  moving-average  components, 
with  the  variable  of  interest  ( Yt)  being  differenced 
one  or  more  times  if  necessary  to  achieve  stationar¬ 
ity.  A  variable  that  becomes  stationary  with  one 
difference  is  said  to  be  integrated  of  order  1  (i.e., 
7(1)).  Using  Box  and  Jenkins’s  (p,  d,  q)  notation, 
the  model  in  Equation  7  is  a  (1,  1,  1)  model;  it  has 
one  difference  to  produce  stationarity,  one  autore¬ 
gressive  term  [(1  —  cpLjY^,],  and  one  moving- 
average  term  (Oe^,).  The  parameters  9  and  0  are 
estimated  from  the  data: 

(1  -  L)Yt  =  9(1  -  T)Yf_, 

+  et  +  08f-i-  C7) 

Since  univariate  ARIMA  models  are  theoreti¬ 
cal  forecasting  tools,  the  data  are  used  as  a  guide 
to  model  identification  (specification).  In  this 
regard,  Box  and  Jenkins  showed  analytically  that 
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autoregressive  and  moving-average  models  have 
characteristic  patterns  in  their  autocorrelation  and 
partial  autocorrelation  functions  that  could  be 
used  as  specification  guides.  In  addition,  they  sug¬ 
gested  diagnostic  procedures  for  assessing  the 
quality  of  ARIMA  models.  These  include  signifi¬ 
cance  tests  for  model  parameters,  parameter  values 
consistent  with  the  assumption  that  the  (differ¬ 
enced)  series  is  stationary,  and  the  presence  of 
white-noise  residuals  from  a  series  filtered  with  an 
estimated  ARIMA  model;  the  latter  condition  was 
typically  assessed  via  Ljung-Box  or  Box-Pierce 
portmanteau  tests.  Ex  post  forecasting  perfor¬ 
mance  using  a  portion  of  currently  available  data 
is  another  important  diagnostic. 

ARIMA  models  may  be  extended  to  include 
explanatory  variables.  These  may  be  a  mixture  of 
continuous  variables  (Xs;  e.g.,  inflation  and  unem¬ 
ployment  rates)  and  interventions  (Is;  e.g.,  a 
national  election,  a  terrorist  attack),  with  the  latter 
typically  being  measured  using  dummy  variables  0 
and  1  (see  Equation  8).  Depending  on  how  these 
dummy  variables  are  coded,  interventions  may  be 
specified  as  having  permanent  or  temporary  effects. 
The  impact  of  any  explanatory  variable  may  begin 
contemporaneously  or  with  a  delay  of  1  through  k 
periods.  Also,  unlike  garden-variety  time-series 
regression  models  with  lagged  endogenous  vari¬ 
ables,  multivariate  ARIMA  models  may  specify 
different  dynamics  for  the  effects  of  different  pre¬ 
dictors.  For  example,  in  Equation  8,  the  immediate 
(time  t)  impact  of  I  is  captured  by  aq,  and  the 
lagged  effects  (growth  or  decay)  are  captured  by 
8r  Lagged  effects  of  X  are  estimated  with  8,, 
which  need  not  have  the  same  value  as  8,. 
Depending  on  the  available  degrees  of  freedom, 
several  8s  may  be  specified  in  a  single  multivariate 
ARIMA  model: 

(1  -  L)Yt  =  cp(l  -  L)Yt_1  +  Ml/(1  -  SjLKl  -  L) 
It-k  +  co/(l  -  82L)(1  -  L)Xt_k 

+  £t  +  0fh-i.  (8) 

Recent  research  on  long-memoried  processes 
has  produced  a  class  of  autoregressive,  fractionally 
integrated,  moving  average  (ARFIMA)  models  that 
generalize  the  integration  concept  in  traditional 
ARIMA  models.  As  Janet  Box-Steffensmeier  and 
Renee  Smith  argue,  time-series  variables  are  not 
simply  stationary  or  nonstationary  in  a  knife-edged 


sense;  rather,  they  can  be  thought  of  as  having 
degrees  of  integration.  Thus,  the  d  term  in  the  Box- 
Jenkins  p,  d,  q  models  is  no  longer  an  integer  with 
values  0  if  a  variable  is  stationary  or  1  if  it  needs  to 
be  differenced  once  to  make  it  stationary.  In 
ARFIMA  models,  the  values  of  d  lie  along  a  con¬ 
tinuum  from  —0.5  to  1.0,  with  values  equal  to  or 
greater  than  0.5  and  less  than  1.0,  indicating  that 
the  variable  is  nonstationary  but  ultimately  mean 
reverting. 

The  d  parameter  and  its  associated  standard 
error,  together  with  any  specified  autoregressive 
and  moving-average  parameters,  can  be  estimated 
from  the  data  using  maximum  likelihood  or  other 
procedures.  An  example  of  such  an  ARFIMA 
model  is  Equation  9.  Lacking  any  autoregressive 
or  moving-average  terms,  this  model  is  called 
“pure  fractional  noise.”  As  with  conventional 
ARIMA  models,  explanatory  variables  can  be 
included  in  ARFIMA  specifications.  A  recent 
extension  of  these  models  involves  the  develop¬ 
ment  of  the  concepts  of  fractional  co-integration 
and  fractional  error  correction: 

(1  -  L)%  =  zt.  (9) 

The  desire  for  simple,  easily  estimated,  accurate 
forecasting  models  did  not  abate  with  the  develop¬ 
ment  of  ARIMA  models.  In  1980,  the  econometri¬ 
cian  Christopher  Sims  offered  a  penetrating 
critique  of  conventional  multi-equation  forecasting 
models  and  suggested  a  radical  alternative.  Sims 
argued  that  many  of  the  parameter  restrictions 
required  to  achieve  identification  in  multi-equation 
models  (with  endogenous  regressors)  were  simply 
“incredible.”  Researchers  imposed  these  restric¬ 
tions  not  because  they  were  justified  by  theory  or 
evidence  but  rather  because  they  were  necessary  to 
avoid  simultaneity  biases.  Given  this  ad  hoc 
nature,  the  poor  forecasting  performance  of  such 
models  was  hardly  surprising. 

Sims  did  not  claim  that  econometricians  were 
radically  ignorant  of  their  subject  matter.  Rather, 
he  asserted  that  researchers  could  use  theory  and 
experience  to  assemble  sets  of  variables  that  were 
dynamically  interrelated  in  possibly  highly  com¬ 
plex  ways.  Researchers  wishing  to  forecast  could 
“round  up  the  usual  suspects,”  even  if  that  com¬ 
plexity  kept  them  from  developing  credible  identi¬ 
fying  restrictions  needed  for  the  estimation  of 
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structural  parameters.  But  all  was  not  lost.  Sims 
proposed  that  forecasters  use  these  sets  of  vari¬ 
ables  to  develop  simple  multi-equation  autoregres¬ 
sive  models  comprising  unrestricted  reduced  forms. 
Each  variable  of  interest  was  a  function  of  one  or 
more  lagged  values  of  itself  and  one  or  more 
lagged  values  of  all  other  variables  in  the  system. 
Additional  variables  capturing  important  exoge¬ 
nous  shocks  could  be  added  if  desired.  The  resulting 
set  of  equations  is  called  a  vector  autoregression  or 
VAR.  In  Sims’s  original  formulation,  parameters  in 
VARs  could  be  estimated  simply — via  OLS  or  SUR 
techniques.  Equations  10a  and  10b  are  an  example 
of  a  simple  two-variable  VAR  with  a  single  lag  on 
both  variables  in  the  system: 

Y(  =  A0  +  AxYt_x  +  BlXt_1  +  e,  (10a) 
xt  =  c0  +  ClXt_1  +  DxYt_x  +  ut.  (10b) 

Although  the  simplicity  of  these  VARs  was 
attractive,  researchers  soon  attempted  to  improve 
their  forecasting  performance  by  imposing  restric¬ 
tions  on  various  parameters  in  a  system.  Later, 
analysts  attracted  by  the  ideas  of  co-integration 
and  error  correction  developed  vector  error  correc¬ 
tion  models  that  incorporated  hypothesized  long- 
run  relationships  explicitly  into  a  VAR  setup. 
Others,  wishing  to  extract  explanatory  power  from 
VARs,  respecified  them  as  moving-average  repre¬ 
sentations  of  a  system  of  interest.  By  so  doing,  they 
could  trace  the  dynamic  effects  of  shocks  to  a  vari¬ 
able  on  other  variables  in  that  system.  These  analy¬ 
ses  are  not  entirely  theory-free;  rather,  general 
hypotheses  about  the  flow  of  causality  are  required 
to  order  the  variables  in  a  system  and,  hence,  how 
shocks  are  transmitted  through  it.  Still  others  have 
leveraged  recent  developments  in  applied  Bayesian 
statistics  to  specify  and  estimate  VARs. 

Another  important  methodological  advance 
enables  researchers  to  study  the  evolution  of  the 
variance  of  a  time-series  variable.  These  autore¬ 
gressive  conditional  heteroskedasticity  (ARCH) 
models  were  pioneered  by  Engle  in  1982,  with  a 
subsequent  generalization  by  producing  the  gener¬ 
alized  ARCH  (GARCH)  model  that  closely  resem¬ 
bles  traditional  ARIMA  models.  Equations  11a 
and  lib  are  an  example.  Here,  the  error  process 
(e()  for  Yt  in  Equation  11a  has  a  hetereoskedastic 
variance  that  evolves  according  to  Equation  lib. 


The  scaling  parameter  for  this  conditional  vari¬ 
ance,  ht,  is  driven  by  its  own  past  value  as  well  as 
the  size  of  the  variance  at  t  —  1,  that  is,  e2(_x: 

Yf  =  P0  +  +  et,  (11a) 

ht  =  \ o  +  Xiht-i  +  MVr  (Hb) 

Econometricians  have  extended  GARCH  mod¬ 
els  to  incorporate  ideas  such  as  asymmetry,  nonlin¬ 
earity,  integration,  and  fractional  integration. 
GARCH  models  also  may  be  specified  such  that 
the  conditional  variance  influences  the  evolution 
of  the  mean  of  a  process,  and  explanatory  vari¬ 
ables  may  be  included  in  the  equation  for  ht  to  help 
account  for  how  it  evolves. 

To  date,  the  vast  majority  of  applications  of 
GARCH  models  have  been  in  the  field  of  finance, 
where  investigators  are  keenly  interested  in  mea¬ 
suring  risks  associated  with  holding  an  asset  at  any 
time  t.  Conditional  heteroskedasticity,  tapping 
overtime  variation  in  the  size  of  shocks  to  a  system 
and,  hence,  volatility,  is  a  natural  way  of  calibrat¬ 
ing  such  risks  in  empirical  analysis.  Other  applica¬ 
tions  are  possible.  For  example,  in  2009,  Harold 
Clarke,  David  Sanders,  Marianne  Stewart,  and 
Paul  Whiteley  used  GARCH  models  to  study  the 
development  of  consensus  in  public  opinion  regard¬ 
ing  the  performance  of  British  Prime  Minister 
Tony  Blair.  A  key  explanatory  variable  in  the 
model  of  the  GARCH  process  for  the  conditional 
variance  of  Blair’s  performance  evaluations  is  a 
measure  of  civilian  casualties  in  the  Iraq  War.  This 
application  of  GARCH  modeling  illustrates  how 
advanced  time-series  methods  can  be  used  in 
empirical  analyses  of  phenomena  of  long-standing 
interest  to  political  scientists. 

Harold  D.  Clarke 
University  of  Texas  at  Dallas 
Richardson,  Texas,  United  States 
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Time-Series  Cross-Section 
Data  and  Methods 


The  use  of  pooled  time-series  cross-section  (PTSCS) 
data  in  quantitative  political  analysis  has  increased 
immensely  over  the  last  two  decades.  Pooled  data 
analysis  has  become  the  standard  especially  in 
subdisciplines  such  as  international  relations, 
comparative  politics,  and  comparative  political 
economy.  However,  fields  that  use  microdata, 
such  as  political  behavior  or  American  politics, 
also  increasingly  use  PTSCS  data  due  to  the  greater 
availability  of  survey  data  over  time.  Panel  data 
pool  cross-sectional  information  (number  of  units 
N)  with  information  over  time  (number  of  time 
points  T),  for  example,  data  on  individuals  or 
firms  at  different  points  in  time,  information  on 
countries  and  regions  over  time,  and  so  on.  Thus, 
panel  data  consist  of  repeated  observations  on  a 
number  of  units.  We  can  distinguish  between 
cross-sectional  dominant  data  (cross-section  time- 
series,  CSTS),  time-series  dominant  data  (time- 
series  cross  section,  TSCS),  and  pooled  data  with  a 
fixed  number  of  units  and  time  points.  The  data 
structure  has  implications  for  the  model  choice 


since  asymptotic  properties  of  estimators  for  pooled 
data  are  either  derived  for  N  — >  °°  or  T  ->  °°.  In 
addition,  violations  of  full  ideal  conditions  and 
specification  issues  have  more  or  less  severe  effects 
for  bias  and  efficiency  depending  on  whether  the 
number  of  units  exceeds  the  number  of  observa¬ 
tions  over  time  or  vice  versa.  In  what  follows,  we 
discuss  the  respective  strengths  and  weaknesses  of 
this  method  and  various  ways  by  which  we  can 
cope  with  some  of  the  inherent  problems. 

Some  have  argued  that  TSCS  and  CSTS  data 
consist  of  observations  at  different  points  in  time 
for  fixed  units  of  theoretical  interest,  such  as  coun¬ 
tries  or  dyads,  whereas  in  panel  data,  the  units, 
mostly  individuals  in  surveys,  are  of  no  specific 
interest  and  are  randomly  sampled  from  an  under¬ 
lying  population  with  all  inferences  dedicated  to 
uncovering  the  relationships  in  the  population. 
Textbooks  and  articles,  however,  use  these  terms 
quite  loosely.  This  entry  follows  this  trend  and  dis¬ 
cusses  general  estimation  procedures  and  specifica¬ 
tion  issues  with  respect  to  different  kinds  of  data 
pooling  cross-sectional  and  time-series  information. 

Advantages  and  Disadvantages 
of  PTSCS  Data  Analysis 

Panel  data  pool  observations  for  units  (?)  and  time 
periods  (t).  The  typical  data-generating  process 
can  be  characterized  as 

K 

y„  =  OL  +  £  fikxkil  +  zu,  i  =  1, . . .  ,N; 

k=\ 

t=l,...,T,  (1) 

with  k  independent  variables  x,  which  have  obser¬ 
vations  for  N  units  (?)  and  T  periods  (t).  The 
dependent  variable  y  is  continuous  (though  in 
principle,  it  can  be  limited  dependent,  which 
requires  nonlinear  estimation  procedures)  and  also 
observed  for  i  and  t.  zit  describes  the  error  term  for 
observations  ?  and  t  and  we  can  assume  an  NT  X 
NT  variance-covariance  matrix  Q  of  the  error 
term  with  the  typical  element  E(eit,ejS).  In  case  all 
Gauss-Markov  assumptions  are  met  (the  error 
term  is  iid ),  this  model  can  be  straightforwardly 
estimated  by  ordinary  least  squares  (OLS).  Because 
PTSCS  data  combine  time-series  and  cross-section 
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information,  this  is  rarely  the  case.  However,  the 
analysis  of  PTSCS  data  offers  significant  advan¬ 
tages  over  the  analysis  of  pure  time-series  or  pure 
cross-sectional  data.  First,  using  pooled  data 
increases  the  number  of  observations  and,  there¬ 
fore,  the  degrees  of  freedom,  which  allow  us  to  test 
more  complex  arguments  by  employing  more  com¬ 
plex  estimation  procedures.  More  important,  most 
theories  in  the  social  sciences  generate  predictions 
over  space  and  time,  and  it  seems,  therefore,  indis¬ 
pensable  to  test  these  hypotheses  by  using  data 
providing  repeated  information  for  theoretically 
interesting  units.  PTSCS  data  analysis  allows  the 
modeling  of  dynamics,  which  is  impossible  when 
pure  cross  sections  are  examined,  which  may  lead  to 
spurious  regression  results.  Finally,  analyzing  pooled 
data  allows  controlling  for  unit  heterogeneity 
beyond  the  inclusion  of  additional  right-hand-side 
(RHS)  variables.  Accordingly,  pooled  data  can  be 
used  to  get  rid  of  some  kinds  of  omitted-variable 
bias,  make  the  best  of  the  available  information,  test 
theories  that  predict  changes,  and  test  theories  that 
predict  parameter  heterogeneity. 

The  most  prominent  disadvantage  of  panel  data 
analysis  lies  in  the  fact  that  an  econometrically 
sound  model  specification  is  typically  hard  to  find 
since  the  data  structure  not  only  combines  all  the 
problems  of  cross-sectional  and  time-series  data, 
but  also  these  problems  typically  arise  simultane¬ 
ously.  Specification  problems  in  pooled  data  anal¬ 
ysis  can  be  summarized  as  follows: 

•  The  residuals  are  typically  serially  correlated  and 
not  independent  of  each  other. 

•  The  residuals  have  different  variances  for 
different  units  (panel  heteroskedasticity). 

•  The  residuals  of  different  units  are 
contemporaneously  correlated. 

•  The  residual  of  unit  i  covaries  with  residuals  of 
unit  j  for  different  points  in  time. 

•  The  expected  mean  of  the  error  term  deviates 
from  zero  for  different  units. 

While  each  single  violation  of  the  underlying 
model  assumptions  is  often  straightforwardly 
accounted  for  by  existing  econometric  measures, 
combinations  of  problems  might  not  be  solved 
simultaneously  in  a  satisfying  manner.  Econometric 
solutions  are  often  incompatible  with  theories. 
Sometimes,  it  is  hard  to  find  models  that  at  the 


very  same  time  are  econometrically  sound  (unbi¬ 
ased,  efficient)  and  provide  an  appropriate  test  of 
the  theory.  Weighing  the  advantages  and  disad¬ 
vantages  of  pooled  data  analysis,  the  positive 
aspects  certainly  prevail,  especially  because  the 
analysis  of  pooled  data  allows  the  testing  of  com¬ 
plex  arguments  over  space  and  time,  which  are 
characteristic  for  the  social  sciences.  From  this 
perspective,  the  steep  increase  in  the  popularity  of 
panel  data  analysis  does  not  seem  to  be  surprising. 

Heteroskedasticity  and  Contemporaneous 
Error  Correlation  in  Panel  Data 

Heteroskedasticity  in  pooled  data  presents  a  more 
complex  problem  than  in  pure  cross  sections  since 
(a)  the  error  term  can  have  unit-specific  variances 
(panel  heteroskedasticity),  (b)  the  error  term  can  be 
contemporaneously  correlated,  that  is,  the  error 
term  of  unit  i  is  correlated  to  that  of  unit  j  in  the 
same  year,  and  (c)  the  error  term  of  one  unit  i  can 
be  correlated  with  the  error  term  of  unit  j  at  differ¬ 
ent  points  in  time.  In  addition,  the  error  term  can 
have  time-dependent  error  variances  (autoregressive 
conditional  heteroskedasticity).  Panel  heteroske¬ 
dasticity  mainly  occurs  if  the  model  specification 
fits  different  units  differently  well.  Correlations  of 
the  errors  across  units  are  determined  by  unob¬ 
served  features  of  one  unit  that  are  linked  to 
another  unit.  Both  features  violate  Gauss-Markov 
assumptions:  While  they  leave  the  simple  estima¬ 
tors  consistent,  such  estimators  are  now  inefficient, 
and  standard  errors  may  be  incorrect.  More  impor¬ 
tant,  both  heteroskedasticity  and  error  correlation 
often  signal  omitted-variable  bias  since  in  both 
cases  something  that  should  have  been  included 
into  the  structural  part  of  the  equation  was  left  out. 

This  problem  can  be  solved  in  a  substantive  way 
by  identifying  the  causes  of  the  omitted-variable 
bias  and  including  these  variables  into  the  RHS  of 
the  models.  Often,  this  approach  is  not  feasible 
since  the  sources  for  heteroskedasticity  are 
unknown,  or  excluded  factors  cannot  be  mea¬ 
sured.  In  this  case,  several  econometric  solutions 
have  been  proposed. 

A  first  approach  brought  forward  by  R.  W. 
Parks  and  Jan  Kmenta  is  a  feasible  generalized  least 
squares  (FGLS)  estimation,  which  is  characterized 
by  an  NT  X  NT  block  diagonal  matrix  with  an 
N  X  N  matrix  X  that  contains  contemporaneous 
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covariances  along  the  block  diagonal.  Parks  and 
Kmenta  also  suggest  an  ft  matrix  with  a  panel- 
specific  first-order  autoregressive,  AR(1),  error 
structure  and  contemporaneously  correlated  errors, 
but  in  principle,  FGLS  can  handle  different  correla¬ 
tion  structures. 

Because  the  true  structure  of  X  and  ft  are 
unknown,  this  procedure  requires  the  estimation 
of  a  very  large  number  of  parameters  to  obtain  the 
error  covariances,  which  in  turn  leads  to  very  inef¬ 
ficient  and  therefore  unreliable  results.  Nathaniel 
Beck  and  Jonathan  Katz  show  that  the  Parks 
method  highly  underestimates  standard  errors  and 
therefore  induces  overconfidence  in  estimation 
results.  As  a  result,  this  estimation  procedure  is 
very  rarely  used  in  recent  work  using  pooled  data. 
Beck  and  Katz  suggest  a  different  way  of  dealing 
with  panel  heteroskedasticity.  They  argue  that 
coefficient  estimates  of  OLS  are  consistent  but 
inefficient  in  pooled  data  and  that  the  degree  of 
inefficiency  depends  on  the  data  and  the  exact 
error  process.  They  suggest  using  OLS  and  correct¬ 
ing  the  estimated  standard  errors  by  taking  the 
specific  panel  structure  of  the  data  into  account: 

Var[p]  -  (X'X)“1X'ftX(X'X)“1,  (2) 

with 

ft  =  (E'E/T)  <8>  IT.  (3) 

This  is  called  the  panel-corrected  standard  errors 
method.  Other  violations  of  Gauss-Markov 
assumptions,  such  as  serial  correlation  of  the  error 
term,  have  to  be  treated  beforehand.  Since  this 
approach  manipulates  only  the  standard  errors  of 
an  OLS  model,  the  coefficients  are  biased  when¬ 
ever  OLS  is  biased. 

Dynamics  and  Serial  Correlation 

As  pooled  data  combine  information  across  units 
and  over  time,  another  problem  arises  if  the  error 
term  is  serially  correlated.  The  error  term  in  t  is 
dependent  on  the  error  term  in  t  —  1: 

s  it  =  PT,f-i  +  in-  (4) 

Again,  violating  the  independence  assumption 
econometrically  only  influences  the  efficiency  of 


the  estimation.  Yet  since  the  residual  of  a  regres¬ 
sion  model  picks  up  the  influences  of  those  vari¬ 
ables  that  have  not  been  included,  persistence  in 
excluded  variables  is  the  most  frequent  cause  of 
serial  correlation.  Several  remedies  for  serial  cor¬ 
relation  are  available,  all  of  which  have  different 
consequences  for  the  model  specification  and 
interpretation  of  the  estimation  results. 

A  substantive  solution  to  the  problem  of  serial 
correlation  is  the  inclusion  of  a  lagged  dependent 
variable  (LDV)  yit-1  to  the  RHS  of  the  regression 
equation.  In  many  cases,  this  is  enough  to  eliminate 
serially  correlated  error  terms.  However,  there  are 
also  many  perils  of  adding  an  LDV  to  the  list  of 
regressors.  One  of  the  main  problems  arises  because 
the  inclusion  of  an  LDV  makes  it  difficult  to  inter¬ 
pret  effects  of  the  substantial  RHS  variables  directly 
and  correctly  since  the  conditional  effect  of  x  on  y 
is  dynamic  and  aggregated  over  all  periods.  It  can 
be  described  by  the  following  polynomial: 

y(x)h^tp  =  Pi xit  +  X  (Po^Pi**)-  (5) 

p= i 

Unfortunately,  the  standard  errors  of  the  function 
in  Equation  5  cannot  be  easily  calculated. 

Since  including  an  LDV  resembles  a  shortened 
distributed  lag  model,  we  implicitly  assume  that  all 
variables  exert  an  equally  strong  one-period  lagged 
impact  on  the  dependent  variable.  Therefore,  find¬ 
ing  a  nonsignificant  coefficient  of  a  theoretically 
interesting  explanatory  variable  in  an  LDV  model 
does  not  necessarily  mean  that  this  variable  has  no 
effect;  it  only  tells  us  that  this  variable  does  not 
affect  the  dependent  variable  contemporane¬ 
ously — it  might  still  have  a  lagged  effect.  From 
this,  it  follows  that  the  coefficient  of  the  LDV  esti¬ 
mates  at  best  the  average  dynamic  effect  of  all 
substantive  RHS  variables  rather  than  the  actual 
dynamic  effect  of  each  explanatory  variable. 

Another  (lesser)  problem  occurs  when  combin¬ 
ing  an  LDV  with  the  estimation  of  unit-specific 
effects  by  a  fixed-effects  (FE)  specification  or  a 
least  squares  dummy  variable  (LSDV)  model  (see 
the  next  section  for  a  more  detailed  description). 
This  leads  to  biased  estimates  since  the  LDV  cova¬ 
ries  with  the  time-invariant  part  of  the  error  term. 
This  problem  is  called  Nickell  bias.  The  best 
known  suggestions  tackling  the  problem  of  Nickell 
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bias  are  the  instrumental  variable  approach  by 
Theodore  Anderson  and  Cheng  Hsiao  (AH),  the 
differenced  GMM  (generalized  methods  of 
moments)  model  by  Manuel  Arellano  and  Stephen 
Bond  (AB),  and  the  Kiviet  correction  (by  Jan 
Kiviet),  which  proposes  a  corrected  within  estima¬ 
tor  that  subtracts  a  consistent  estimate  of  the  bias 
from  the  original  FE  estimator.  The  first  two 
approaches  solve  the  bias  problem  by  first  differ¬ 
encing  both  sides  of  the  regression  equation  and 
instrumentation  of  the  LDV  with  higher  order  lags 
of  the  LDV.  Thereby,  AH  only  uses  the  two-period 
lagged  LDV  as  an  instrument  while  AB  allows  the 
use  of  all  possible  lags  of  the  LDV  and  all  exoge¬ 
nous  variables  in  the  model.  Both  approaches  gen¬ 
erate  asymptotically  consistent  estimation  results, 
whereby  AB  produces  more  efficient  estimates  due 
to  the  exploitation  of  all  moment  conditions.  In 
finite  samples,  however,  both  estimators  are  prob¬ 
lematic  with  regard  to  efficiency  as  Monte  Carlo 
experiments  examining  the  finite  sample  properties 
reveal.  Higher  lags  of  the  LDV  provide  good  instru¬ 
ments  only  in  the  case  when  y  is  highly  persistent 
over  time.  Unfortunately,  in  such  a  case,  the  prob¬ 
ability  that  the  instruments  also  covary  with  the 
error  term  remains  high.  Lrom  this  perspective, 
both  estimators  cannot  solve  the  problem  of  Nickell 
bias  if  y  is  highly  persistent,  or  they  solve  the  prob¬ 
lem  very  inefficiently  in  the  case  of  low  persistence. 

A  Prais-Winsten  (PW)  transformation  of  the 
model  offers  another  solution  to  serial  correlation. 
The  advantage  of  the  PW  approach  consists  in  the 
transformation  of  both  the  LHS  and  the  RHS  of 
the  equation,  which  allows  a  direct  interpretation  of 
the  regression  coefficients.  PW  is  estimated  by  GLS 
and  is  derived  from  the  AR(1)  model  for  the  error 
term.  Lirst,  a  standard  linear  regression  is  estimated: 

Jit  =  Wb  +  £,(•  (6) 

An  estimate  of  the  correlation  in  the  residuals  is 
then  obtained  by  the  following  auxiliary  regression: 

e«t  =  PCf-i  +  ir  ( 7 ) 

A  Cochrane-Orcutt  transformation  is  applied  for 
observations  t  =  2,  .  .  .  ,  n: 

yu  ~  py,t-\  =  P(*,t  p vy— i )  +  s a-  (8) 


And  the  transformation  for  t  =  1  is  as  follows: 

]/l-  P2U  =  pfV1  -  P2M^  +  ]/l  -  p2Si.  (9) 

Equation  9  shows  that  another  advantage  of  the 
PW  transformation  is  the  preservation  of  the  first 
period.  The  differences  between  a  PW  and  an  LDV 
model  might  be  substantial  depending  on  p  and  the 
serial  correlation  in  both  y  and  x. 

Heterogeneity 

One  of  the  advantages  of  analyzing  pooled  data  is 
the  possibility  of  controlling  for  heterogeneity 
across  units.  When  examining  cross-sectional  data, 
it  is  impossible  to  tell  whether  the  estimated  effects 
are  contingent  on  unobserved  effects  that  are  spe¬ 
cific  to  each  unit  and,  therefore,  biased.  PTSCS 
data  analysis  rests  on  the  assumption  that  units  are 
similar  enough  to  be  pooled  together.  If  that  were 
not  the  case,  we  could  still  find  appropriate  speci¬ 
fications  that  allow  accounting  for  differences 
across  units  that  might  influence  the  estimation 
results.  Textbooks  discuss  this  problem  usually 
under  the  header  “unit-heterogeneity”  and  offer 
remedies  such  as  LE  or  random-effects  (RE)  mod¬ 
els.  However,  these  models  only  deal  with  time- 
invariant  unit-specific  effects,  but  units  can  also  be 
heterogeneous  with  respect  to  slope  parameters, 
dynamics,  or  lag  structures.  The  next  sections  dis¬ 
cuss  different  versions  of  unit  heterogeneity. 

Unit  Heterogeneity 

When  units  have  specific  characteristics  that 
cannot  be  measured  and  are  time  invariant,  they 
offer  different  initial  conditions  that  might  influ¬ 
ence  the  regression  model.  Especially,  if  these  time- 
invariant  unit-specific  effects  are  correlated  with 
any  of  the  RHS  variables,  coefficient  estimates  are 
distorted  by  omitted-variable  bias.  If  that  is  the 
case  and  we  do  not  control  for  unit-specific  effects, 
the  Gauss-Markov  assumption  of  x  being  deter¬ 
ministic  is  violated: 

K  M 

Jit  =  a  +  £  PkXkit  +  Z  y mZmi  +  U;  +  £*,  (10) 

k=l  m= 1 
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where  u,  denotes  the  unit-specific  effects  and  z  the 
other  explanatory  variables  that  are  time  invariant 
but  can  be  measured  and  are  of  theoretical  interest. 
If  u,  is  excluded  from  the  estimation,  it  becomes 
part  of  the  overall  error  term  and  will  make  the 
model  less  efficient  in  the  case  when  it  does  not 
covary  with  any  of  the  x  or  z  but  will  induce  bias 
if  it  is  correlated  with  any  of  the  regressors. 
Econometrically,  we  can  solve  for  correlated  unit- 
specific  effects  by  including  a  dummy  variable  for 
each  unit  into  the  RHS  of  the  model,  which  gener¬ 
ates  unit-specific  intercepts.  This  estimation  proce¬ 
dure  is  called  the  LSDV  model: 

K 

y,t  =  a  +  Pi  X  xku  +  y„-iDj  +  e,f.  (11) 
k=i 

The  unit-specific  dummy  variables  (D,)  are  multi- 
collinear  to  any  time-invariant  variable  z;  the  coef¬ 
ficients  for  z  are  therefore  not  identified.  We  can 
also  employ  a  so-called  FE  specification,  which  is 
econometrically  equivalent  to  an  LSDV  model. 
The  FE  model  first  de-means  all  variables  in  the 
model  and  then  estimates  the  transformed  equa¬ 
tion  by  OLS: 

K 

y,t  ~  y,  =  Pi  X  (***  -  xki)  +  eit  -  e,  +  Uj  -  Uj 

‘-1  (12) 

K 

=  y,i  =  IV  X  %ku  +  Ft- 

k=l 

The  FE  transformation  eliminates  not  only  the 
unit-specific  effects  but  also  the  time-invariant 
variables  that  might  be  of  theoretical  interest.  FE 
can  become  highly  inefficient  because  it  only  uses 
the  within  information  of  all  variables.  Yet  not 
controlling  for  unit-specific  effects  leads  to  biased 
estimates  if  unit  effects  exist  and  are  correlated 
with  any  of  the  regressors. 

If  unit-specific  effects  do  exist  but  do  not  covary 
with  any  of  the  RHS  variables,  not  controlling  for 
unit  effects  does  not  bias  the  estimates  but  increases 
the  sampling  variation  of  the  OLS  estimator  and 
therefore  generates  less  efficient  results.  A  straightfor¬ 
ward  remedy  is  an  RE  specification,  which  treats  the 
Uj  as  a  random  unit  specific  part  of  the  error  term. 
The  RE  model  only  quasi-de-means  the  variables: 
Rather  than  removing  the  time  average  from  the 
explanatory  and  dependent  variables  at  each  t,  RE 


removes  a  fraction  of  the  time  average.  The  RE 
estimator  generates  more  efficient  results  than  the 
FE  estimator,  but  the  RE  model  produces  biased 
estimates  if  the  RHS  variables  covary  with  the 
unobserved  unit-specific  effects.  RE  resembles  a 
feasible  GLS  estimator  where  the  Q  matrix  (VC 
[variance-covariance  of  the  errors])  matrix  of  the 
error  term)  has  a  specific  RE  structure  that  only 
depends  on  two  parameters:  cT  and  ay .  RE  and  FE 
estimates  tend  to  be  close  if  T  gets  large  or  the  vari¬ 
ance  of  the  estimated  unit  effects  increases  as  com¬ 
pared  to  the  error  variance. 

Since  the  RE  estimator  is  more  efficient  than  its 
FE  counterpart  in  the  case  when  the  unit  effects  are 
uncorrelated  with  the  regressors,  it  is  useful  to 
determine  which  of  the  two  specifications  should 
be  used.  Textbooks  typically  suggest  employing  the 
Hausman  test.  The  Hausman  test  is  based  on  the 
following  logic:  Since  the  RE  estimator  is  biased  if 
unit-specific  effects  are  correlated,  differences 
between  FE  and  RE  estimates  are  interpreted  as 
evidence  against  the  RE  assumption  of  zero  covari¬ 
ance  between  x  and  ut.  Econometricians  confirm 
that  the  Hausman  test  has  good  asymptotic  prop¬ 
erties.  Nevertheless,  in  finite  samples,  the  test 
results  are  influenced  by  the  trade-off  between  bias 
and  efficiency.  The  Hausman  test  is  only  powerful 
in  the  limit:  Since  FE  is  consistent,  the  difference 
between  RE  and  FE  estimates  can  only  be  caused 
by  biased  RE  estimates.  In  finite  samples,  however, 
the  differences  can  result  from  two  sources:  biased 
RE  estimates  and  unreliable  FE  point  estimates  due 
to  inefficient  estimation  of  variables  with  low 
within  variation.  The  Hausman  test  actually  mir¬ 
rors  this  trade-off  since  it  divides  the  difference  of 
RE  and  FE  estimates  by  the  difference  in  the 
asymptotic  variances  of  the  RE  and  FE  estimates. 
From  this,  it  follows  that  the  test  results  are  espe¬ 
cially  unreliable  if  the  estimation  equation  contains 
regressors  that  are  both  correlated  with  the  unit- 
specific  effects  and  rarely  change  over  time. 

Highly  problematic  in  a  FE  specification  is 
the  estimation  of  time-invariant  or  nearly  time- 
invariant  variables.  While  the  problem  of  including 
completely  time-invariant  variables  is  apparent, 
the  estimation  of  rarely  changing  variables  does 
not  seem  as  problematic  since  FE  specifications 
generate  an  estimate.  However,  this  estimate 
might  be  very  inefficient  since  FE  specifications 
eliminate  all  cross-sectional  variation,  and  only 
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the  variance  over  time  is  used  to  compute  the 
coefficient.  If  this  within-unit  variation  is  very 
small,  the  sampling  variation  of  FE  estimates 
increases  drastically,  which  leads  not  only  to 
large  standard  errors  but  also  to  very  unreliable 
point  estimates.  In  the  case  of  time-invariant 
variables,  applied  researchers  often  resort  to  a 
simple  pooled  OLS  or  an  RE  model,  which 
allow  the  estimation  of  coefficients  for  time- 
invariant  variables  that  are  biased  in  case  the 
unit-specific  effects  covary  with  the  regressors. 
Hausman  and  Taylor  suggest  an  estimator  that 
uses  the  uncorrelated  RHS  variables  as  instru¬ 
ments  for  the  correlated  regressors.  If  the  instru¬ 
ments  are  poor,  the  Hausman-Taylor  estimator 
produces  highly  inefficient  parameter  estimates. 
Plumper  and  Troeger  developed  an  estimation 
procedure  combining  the  favorable  characteris¬ 
tics  of  FE  with  the  possibility  of  efficiently  esti¬ 
mating  time-invariant  and  nearly  time-invariant 
variables.  This  procedure  is  called  a  fixed- 
effects  vector  decomposition  (FEVD).  FEVD  is  a 
three-stage  estimation  procedure  that  decom¬ 
poses  the  unit  fixed  effects  into  a  part  that  can 
be  explained  by  the  time-invariant  and  rarely 
changing  variables  and  an  unexplained  part. 
FEVD  generates  estimates  with  smaller  root- 
mean-squared  errors  than  competing  estimators 
(Hausman-Taylor,  FE,  RE,  pooled  OLS)  under 
a  wide  range  of  conditions. 

Parameter  Heterogeneity 

As  compared  with  unobserved  time-invariant 
unit  heterogeneity,  we  observe  parameter  hetero¬ 
geneity  if  the  coefficient  of  an  explanatory  variable 
differs  significantly  across  units  or  over  time.  If 
parameters  change  across  time  or  units,  we  are 
likely  to  deal  with  unobserved  and  therefore 
excluded  interaction  effects,  or  it  could  be  that  we 
have  assumed  the  wrong  functional  form  for  the 
statistical  relationship.  If  the  source  of  parameter 
heterogeneity  is  known  or  our  theoretical  model 
even  predicts  differences  in  parameters  across  units 
or  time  periods,  we  can  straightforwardly  specify 
the  correct  model  by  including  interaction  terms 
between  time  periods  or  groups  of  units  and  the 
specific  RHS  variables. 

In  cases  where  the  source  of  parameter  heteroge¬ 
neity  is  unknown,  seemingly  unrelated  regressions 


(SURs)  or  random-coefficients  models  (RCMs) 
offer  an  econometric  solution  to  the  problem.  SUR 
models  estimate  a  single  regression  for  every  unit 
but  exploiting  the  panel  structure  of  the  data  by 
assuming  a  joint  error  process  for  all  units.  This 
increases  the  efficiency  of  estimation  by  “borrow¬ 
ing  strength.”  SUR  models  only  generate  accept¬ 
able  parameter  estimates  for  long  time  series,  that 
is,  when  T  is  much  larger  than  N.  SUR  models 
employ  a  GLS-type  estimator  for  the  VC  matrix, 
which  weights  the  standard  errors  by  the  cross  sec¬ 
tion-specific  mean-squared  errors. 

The  random-coefficients  estimator  (e.g.,  Beck 
and  Katz)  provides  a  compromise  between  esti¬ 
mating  the  fully  pooled  model  and  a  fully  unpooled 
estimate  (separate  OLS  for  each  unit).  Pooled  OLS 
depends  on  the  stark  assumption  of  unit  homoge¬ 
neity,  whereas  separate  OLS  estimation  for  each 
unit  produces  inefficient  results.  The  RCM  bor¬ 
rows  strength  by  shrinking  each  of  the  individual 
unit  OLS  estimates  back  to  the  overall  (pooled) 
estimate.  It  is,  therefore,  also  a  good  test  for 
“poolability”  of  the  data.  The  RCM  generalizes 
the  RE  estimator  from  the  intercept  to  all  param¬ 
eters  of  interest 

K  M 

y  it  Oi  T  [3 ;  %kit  T  ^  Uy  mil  T  z„ . 

*=1  »= 1  (13) 

(3;  ~  N  ((3,  ct;)  . 

The  RCM  can  be  made  more  useful  by  allowing  the 
(3,  to  be  functions  of  other  unit-specific  variables. 

Heterogeneity  of  Dynamics  and  Lag  Structures 

In  pooled  data,  not  only  do  coefficient  estimates 
vary  across  units  but  also  dynamic  effects.  In  addi¬ 
tion,  different  RHS  variables  might  exert  a  differ¬ 
ently  lagged  impact  on  the  dependent  variable,  and 
the  lag  length  can  differ  across  units.  Different 
dynamics  can  be  straightforwardly  incorporated 
into  an  RCM  or  SUR  models  by  including  an  LDV 
with  unit-specific  coefficients.  PW  specifications 
also  allow  for  unit-specific  autoregressive  pro¬ 
cesses  in  the  error  term. 

Since  statistical  tests  for  heterogeneous  dynamics 
or  lag  structures  are  not  readily  available,  defining 
specific  dynamics  should  be  based  on  theoretical 
grounds.  Unit-specific  dynamics  can  then  be  more 
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directly  modeled  by  making  the  relevant  regressors 
or  the  LDV  interact  with  dummies  for  specific 
groups  of  units.  Since  different  explanatory  vari¬ 
ables  can  have  differently  lagged  effects  across 
units  as  well,  it  is  not  plausible  to  just  vary  the 
estimates  of  the  LDV  because  the  marginal  effect 
of  an  RHS  variable  at  time  t  >  1  partially  depends 
on  the  estimate  for  the  LDV. 

In  summary,  different  kinds  of  unit  heterogene¬ 
ity  do  not  prevent  pooling  of  information  over 
time  and  across  units.  Theoretically,  unit  hetero¬ 
geneity  leads  to  interesting  research  questions  that 
can  be  empirically  analyzed  with  the  appropriate 
model  specification.  The  possibility  of  controlling 
for  unit  heterogeneity  renders  pooled  data  analy¬ 
sis  more  attractive  than  pure  cross-section  or  time- 
series  analysis. 

Conclusion 

This  entry  provides  a  short  overview  of  basic  esti¬ 
mation  procedures  and  specification  issues  in 
PTSCS  data.  Due  to  space  constraints,  other 
important  topics,  such  as  spatial  effects  in  pooled 
data,  nonstationarity,  and  the  usefulness  of  error 
correction  models  as  well  as  estimation  procedures 
and  specification  issues  in  limited  dependent- 
variable  models  for  pooled  data,  could  not  be  dis¬ 
cussed.  While  estimators  and  statistical  tests  are 
discussed  in  most  econometric  textbooks,  a  thor¬ 
ough  discussion  of  specification  problems  in  pooled 
and  panel  data  remains  important.  This  also  holds 
true  for  the  question  of  how  asymptotic  properties 
of  estimators  translate  to  finite-sample  analysis. 
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Tocqueville,  Alexis  de 

(1805-1859) 


French  political  thinker,  author,  historian,  and 
political  philosopher  Alexis  de  Tocqueville,  known 
primarily  for  his  Democracy  in  America  (two 
volumes,  1835-1840),  has  constantly  gained  read¬ 
ership  after  a  hiatus  from  1880  to  1945.  The 
development  of  liberal  democracies,  demands  for 
pluralism,  and  the  demise  of  the  communist  model 
seem  to  explain  this  worldwide  renewal  of  interest. 
It  is  important  not  to  succumb  to  the  allure  of  a 
visionary  who  saw  it  all  coming,  however.  Instead, 
let  us  gauge  the  work’s  originality  with  respect  to 
its  era  and  the  tradition  of  political  philosophy. 
Three  points  in  particular  come  to  the  fore:  the 
relationship  between  freedom  and  equality,  the 
concept  of  democracy,  and  the  vision  of  despotism. 

Freedom  and  Equality 

Compared  with  a  predecessor,  such  as  Benjamin 
Constant,  or  a  contemporary,  such  as  Franqois 
Guizot,  Tocqueville  pondered  the  consequences  of 
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the  principle  of  equality  promoted  by  the  French 
Revolution  in  1789  in  far  greater  depth.  He 
believed  that  the  aspiration  to  equality  is  the  driv¬ 
ing  force  in  history  that  would  inevitably  trans¬ 
form  not  only  class  relations,  law,  customs,  and 
language  but  also  all  of  modern  humanity’s  self¬ 
representations.  The  “gradual  and  progressive 
development  of  equality,”  as  the  introduction  to 
Democracy  in  America  states,  upsets  hierarchies 
and  shapes  minds  in  such  a  way  that  Tocqueville 
wondered  if  they  are  compatible  with  freedom — 
the  other  great  modern  value.  It  was  because  he 
was  of  the  liberal  school  but  also  of  aristocratic 
stock  (old  nobility  by  his  father,  noblesse  de  robe 
by  his  mother)  that  Tocqueville  pursued  this 
inquiry.  He  viewed  the  aristocracy,  due  to  interme¬ 
diate  bodies  and  the  decentralization  of  power 
under  the  feudal  system,  as  the  very  source  of  the 
spirit  of  liberty  in  Europe,  together  with  the 
Christian  idea  of  human  freedom. 

For  Tocqueville,  the  conflict  between  freedom 
and  equality  largely  corresponds  to  the  opposition 
between  aristocratic  society  and  democratic  soci¬ 
ety.  This  gives  rise  to  the  second  originality,  less 
with  respect  to  his  era  than  our  own:  modern 
democracy  does  not  necessarily  promote  a  society 
of  freedom,  contrary  to  what  today  has  become  a 
widespread  idea.  Guizot  shared  this  opinion.  He 
preferred  the  representative  system,  in  which  pay¬ 
ment  of  the  poll  tax  determined  eligibility  to  vote, 
over  democracy,  which  he  believed,  like  Sieyes, 
was  a  primitive  and  oppressive  form  of  societies  in 
their  infancy.  Guizot  considered  that  the  model 
for  free  societies  was  found  in  the  English  parlia¬ 
mentary  system  and  not  in  the  young  republic  of 
the  United  States  of  America.  For  Tocqueville, 
France  had  certainly  experienced  a  period  of  des¬ 
potic  democracy  in  the  form  of  the  Napoleonic 
Empire  and  even  a  bloody  democracy  during  the 
Terror  practiced  by  the  revolutionary  government 
of  1793.  But  America  offered  food  for  thought  as 
it  had  found  the  means  to  curb  the  dangers  of 
popular  sovereignty  by  dispersing  the  locus  of 
power  and  allowing  civil  society  to  organize, 
thereby,  diminishing  the  threats  that  the  spirit  of 
equality  posed  to  freedom  via  associations,  free¬ 
dom  of  the  press,  systematic  recourse  to  justice, 
religious  freedom,  the  separation  of  church  and 
state,  local  community  life,  and  more  generally 
speaking,  federalism. 


Traveling  to  investigate  President  Jackson’s 
America  at  the  age  of  26  (1831),  Tocqueville  thus 
reproduced  Baron  de  Montesquieu’s  accomplish¬ 
ment  as  regards  the  English  Constitution:  to  draw 
overarching  principles  from  a  particular  case.  The 
liberal  tradition  had  taken  the  British  monarchy  as 
a  model.  Tocqueville  took  this  same  comparative 
stance,  in  line  with  Montesquieu’s  aspiration,  that 
is,  aristocratic  liberalism,  but  this  time,  he  aimed  to 
draw  lessons  from  the  American  republic.  This 
young  writer,  with  his  aristocratic  background,  did 
not  seek  to  salvage  the  values  of  his  milieu  by  tak¬ 
ing  the  supposedly  right  model  or  right  version  of 
society:  England,  with  its  aristocracy,  a  service 
class,  and  open  to  merit,  which  was  a  stark  contrast 
to  the  social  sterility  of  the  French  nobility  steeped 
in  its  privileges  of  birth.  He  instead  aimed  to 
reform,  elevate,  and  spiritualize  a  lifestyle  that  he 
had  little  sympathy  for:  a  society  of  commerce  and 
jealous  competition  among  equals,  calculated  self- 
interest,  utilitarianism,  and  the  lack  of  any  official 
hierarchy.  The  typical  example  is  America,  but  it 
was  bound  to  gain  favor  in  Europe.  He  hoped  nev¬ 
ertheless  to  breathe  certain  aristocratic  values,  such 
as  the  spirit  of  freedom,  public  service,  and  protec¬ 
tion  of  the  weak,  into  this  social  form  that  was 
making  advances  in  history  with  the  expansion  of 
the  bourgeoisie. 

Democracy 

Actually,  Tocqueville’s  new  concept  of  democracy 
consisted  of  two  very  different  components:  the 
political  system  on  one  hand  (elections,  constitu¬ 
tion,  political  debate  as  a  public  scene)  and  the 
social  state  (etat  social)  on  the  other — that  is,  civil 
society,  but  including  in  it  customs,  beliefs  and 
public  opinion,  forms  of  self-representation,  and 
relations  between  individuals. 

What  is  there  in  common  between  these  two 
constitutive  spheres  of  democracy?  Majority  rule 
triumphs  in  both  civil  society  and  government, 
but — it  is  remarkable  to  note — political  despotism 
in  government  can  very  well  gain  acceptance  by 
democracy  as  a  social  state.  Tocqueville  himself 
witnessed  with  sorrow  and  helplessness  the  rise  to 
power  and  the  coup  d’etat  of  Napoleon  III  and 
then  his  lasting  triumph.  The  Second  Empire 
regime  illustrated,  unfortunately,  what  he  had 
described  in  Democracy  in  America  some  20  years 
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earlier.  In  the  foreword  to  the  Ancien  Regime  and 
the  Revolution  (1856),  he  alluded  to  a  few  pages 
of  the  introduction  to  Democracy  in  America 
(1835)  pertaining  to  the  immoderate  thirst  for 
wealth,  lack  of  interest  in  politics,  cynicism  in 
social  relations,  respect  for  brute  force  as  being  the 
very  passions  aroused  by  democracy  that  can  lead 
to  despotism.  “Democratic  societies  that  are  not 
free  can  be  wealthy,  refined,  even  splendid.  ”  It  is  in 
that  sense  that  a  democratic  society  can  be  not 
free,  despite  the  opinions  we  hold  today. 

Despotism 

From  this  derives  the  treatment  of  despotism  as  a 
theoretical  problem  to  which  Tocqueville,  building 
on  the  analyses  of  Locke,  Montesquieu,  and 
Rousseau,  takes  an  entirely  new  approach.  The 
originality  is  to  consider  that  despotism  can  arise 
out  of  the  very  logics  of  democracy  (considered  as 
civil  society),  as  we  have  already  seen.  But  actually, 
two  possibilities  exist,  which  are  sometimes  com¬ 
bined  and  sometimes  separate. 

Apathetic,  depoliticized  citizens  devoted  to  the 
constant  search  for  “material  gratifications”  (to 
use  Tocqueville’s  expression)  will  in  the  long  run 
unburden  themselves  of  the  task  of  governing 
themselves  if  a  power  (and  a  leader)  offers  its  ser¬ 
vices,  promising  to  satisfy  their  thirst  for  wealth 
and  take  charge  of  all  aspects  of  their  everyday  life. 
In  this  sense,  despotism,  “soft  and  tutelary”  (as 
Tocqueville  calls  it),  occurs  when  a  centralized  and 
bureaucratic  state  extends  its  intervention,  replac¬ 
ing  citizen  freedom  with  egalitarian  paternalism, 
even  if  it  means  harshly  repressing  unruliness.  The 
picture  of  this  new  brand  of  despotism  is  found  in 
the  now  famous  pages  at  the  end  of  Volume  2  of 
Democracy  in  America. 

Further,  given  both  the  numerical  and  moral 
weaknesses  of  an  individual  faced  with  the  mass  of 
his  equals  and  fellow  men,  despotism  is  realized  in 
the  form  of  this  conformism  that  always  plagues 
democracies.  The  “tyranny  of  the  majority”  that  is 
given  free  rein  in  a  parliamentary  regime  if  minor¬ 
ity  rights  are  not  preserved  by  appropriate  institu¬ 
tions  finds  its  counterpart  in  civil  society  as  well. 
The  democratic  paradox  is  thus  as  follows:  Proud 
of  his  individual  freedom  of  judgment,  the  indi¬ 
vidual  feels  intimidated  before  the  majority  judg¬ 
ment  every  time  he  is  told  that  public  opinion  has 


spoken.  Can  one  be  right  alone  against  thousands 
of  people  when  one  makes  the  equality  of  minds 
the  cornerstone  of  the  regime  and  life  in  society?  In 
a  democracy,  it  may  seem  that  enlightenment,  wis¬ 
dom,  and  truth  lie  in  numbers,  which  make  up  the 
public  or  public  opinion:  Tocqueville  believed  that 
he  had  observed  in  America  the  paradoxical  union 
of  freedom  of  thought,  proclaimed  as  a  founding 
axiom,  with  the  stifling  of  this  selfsame  freedom — 
in  certain  periods — under  the  Empire  of  the  Public: 
“In  the  United  States,  the  majority  takes  it  upon 
itself  to  provide  individuals  with  a  range  of  ready¬ 
made  opinions  and  thus  relieves  them  of  the  obli¬ 
gation  to  form  their  own.”  (p.  491). 

Modern  democracy  would  thus  be  a  confirma¬ 
tion  of  the  fact  that  democracy  embodies  just  as 
much  the  potential  for  freedom  as  it  does  a  new 
brand  of  despotism  and  that  it  is  in  general  pro¬ 
foundly  ambivalent  as  is  its  chief  principle,  the 
principle  of  equality.  Equality  even  embodies  a 
new  religion  that  democracy  eagerly  builds  on: 

Regardless  of  what  political  laws  men  are  subject 
to  in  ages  of  equality,  we  may  anticipate  that 
faith  in  common  opinion  will  become  a  sort  of 
religion,  with  the  majority  as  its  prophet,  (p.  492) 

The  Public  is  democracy’s  new  god;  it  worships 
itself  through  the  practice  of  consulting  public 
opinion.  No  one  more  than  Tocqueville  has 
arrived  at  such  a  richness  of  expression  on  this 
question,  however  much  it  was  debated  during  the 
18th  century:  What  is  the  new  power  of  public 
opinion?  What  is  government  by  opinion? 

We  will  note  that  Montesquieu,  after  Plato,  had 
described  the  disadvantages  of  total  equality  {The 
Spirit  of  the  Laws,  VIII  3,  “Of  the  Spirit  of  Extreme 
Equality”).  Tocqueville  makes  use  of  this  analysis 
but  enriches  it  as  well  with  his  readings  of  the 
French  moralists  and  especially  the  Jansenists,  such 
as  Franqois  de  la  Rochefoucauld,  Blaise  Pascal,  and 
Pierre  Nicole:  He  analyzes  the  endless  pursuit  of 
material  gratifications  as  an  equivalent  to  what 
Pascal  had  called  “distractions”  (in  which  man 
hides  his  own  mortality  from  himself  by  the  pursuit 
of  satisfactions  that  occupy  him  entirely);  he  even 
says  explicitly  that  perfect  and  definitive  equality 
flees  on  an  “eternal  flight,”  according  to  an  expres¬ 
sion  Pascal  had  used  and  that  Tocqueville  cites.  As 
regards  the  situation  of  the  French  after  the  1789 
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revolution,  he  adopts  a  tone  that  is  both  patrician 
and  one  of  a  Jansenist-influenced  author,  so  to 
speak:  “Having  destroyed  the  obstructing  privi¬ 
leges  enjoyed  by  some  of  their  fellow  men,  they  run 
up  against  universal  competition”  (p.  627).  Such  is 
“democracy”  according  to  Tocqueville — at  once  a 
set  of  political  institutions,  mentalities,  and  social 
behaviors.  It  is  a  system  that  can  be  improved  on, 
with  which  one  must  learn  to  live,  he  explains  to 
his  family  and  his  friends  in  the  legitimist  party: 
local  freedoms,  the  power  of  public  opinion,  the 
search  for  material  well-being  are  all  fields  of 
action  offered  to  democracy  and  that  should  draw 
the  attention  of  reformers,  theoreticians,  and  polit¬ 
ical  professionals  alike. 

Influence  on  U.S.  Political  Science 

Tocqueville  has  inspired  many  works  of  political 
science,  especially  in  America.  We  might  mention 
Louis  Hartz,  for  whom  liberalism  in  America 
developed  in  the  absence  of  a  feudal  past  to  be 
fought  against  (The  Liberal  Tradition  in  America, 
1955);  Robert  Bellah  expounds  on  a  typically 
Tocquevillian  theme  in  Habits  of  the  Heart: 
Individualism  and  Commitment  in  American  Life 
(1985),  as  does  Robert  D.  Putnam  in  his  theory  of 
“social  capital”  more  recently  (Bowling  Alone, 
2000):  the  communitarian  stream  takes  Tocqueville 
as  one  of  its  reference  points. 

Lucien  Jaume 
Sciences  Po 
Paris,  France 

Translated  from  the  French  by  Cynthia  Schoch 

See  also  Democracy,  Theories  of;  Liberalism;  Political 
Philosophy;  Political  Theory 
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Tolerance 


Tolerance  is  derived  from  the  word  tolerate, 
which  is  broadly  defined  as  permitting  or  endur¬ 
ing  what  is  objectionable.  This  entry  discusses  the 
historical  and  philosophical  background  of  toler¬ 
ance  with  an  emphasis  on  liberal  doctrines  and 
examines  its  connection  to  human  rights  as  well  as 
the  limits  of  tolerance. 

Social  tolerance  involves  a  permissive  attitude 
toward  those  whose  race,  religion,  ethnicity, 
nationality,  sexual  orientation,  opinion,  or  habits 
differ  from  one’s  own.  In  politics,  tolerance  can  be 
a  quality  of  an  individual,  group,  corporate  entity, 
or  state.  Although  the  meaning  of  the  concept  has 
changed  over  time,  tolerance  for  different  opinions 
and,  thus,  allowing  for  freedom  of  expression  and 
association  is  considered  the  hallmark  of  a  liberal 
democratic  state.  Although  it  started  as  an  exclu¬ 
sive  political  system,  some  commitment  to  reli¬ 
gious  pluralism  and  the  gradual  expansion  of 
political  rights  have  contributed  to  its  identifica¬ 
tion  with  social  tolerance  as  well. 

Historical  and  Philosophical  Background 

Eurocentric  historiographies  of  tolerance  attribute 
its  origin  to  the  Protestant  movement  or  the  age  of 
Enlightenment,  which  respectively,  invited  reli¬ 
gious  pluralism  and  humanism.  Arguments  raised 
by  natural  law  and  social  contract  theorists,  such 
as  John  Locke  (1632-1704)  and  Jean-Jacques 
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Rousseau  (1712-1778),  against  the  “divine  rights 
of  kings”  or  aristocratic  privileges  of  the  ancien 
regime  and  in  favor  of  religious  freedom  or  secu¬ 
larism  constitute  the  core  of  liberal  theories  of 
tolerance.  John  Stuart  Mill’s  (1806-1873)  empha¬ 
sis  on  liberty  and  utilitarian  endorsement  of  non¬ 
conformity,  deemed  necessary  for  the  creative 
mind  to  flourish  and  help  advance  the  society, 
have  been  very  influential  as  well. 

However,  the  philosophy  and  practice  of  toler¬ 
ance  existed  in  both  European  and  non-European 
societies  since  ancient  times,  and  the  latter  ones 
were  usually  more  advanced  and  informed  the 
former.  The  Edicts  of  Ashoka,  issued  by  the  3rd 
century  BCE  ruler  of  the  Maurya  Empire  with  the 
purpose  of  spreading  his  recently  adopted  religion 
of  Buddhism,  also  included  provisions  of  ethnic 
and  religious  tolerance.  Islamic  empires,  in  gen¬ 
eral,  were  known  for  respecting  other  major  reli¬ 
gions  and  protecting  their  followers.  These  pro¬ 
tected  people  (the  dhimmi)  originally  included 
“the  people  of  the  book”  (the  Abrahamic  texts: 
the  Old  and  New  Testaments  and  the  Koran),  but 
the  definition  was  expanded  to  include  Hindus 
and  Buddhists  when  the  Mughal  Empire  expanded 
Islamic  rule  into  the  territories  of  some  older 
Asian  civilizations. 

Andalusian  Spain,  ruled  by  the  Moorish 
Muslims  between  the  early  eighth  and  late  fif¬ 
teenth  centuries,  was  a  hotbed  of  multiculturalism 
and  collaboration  of  intellectuals  of  all  creeds  and 
cultures.  Muslim  philosopher  Ibn  Rushd  (Averroes, 
1126-1198)  and  Rabbi  Moses  Maimonides  (1135- 
1204)  argued  not  only  for  tolerance  but  also  pro¬ 
moted  the  Stoic  notions  that  there  was  an  eternal 
and  universal  natural  law  that  governed  the  entire 
universe,  that  all  human  beings  were  born  with  the 
ability  to  reason,  and  that  human  conduct  there¬ 
fore  needed  to  be  harmonized  with  this  universal 
law.  For  Christian  Europeans,  Andalusia  served  as 
a  door  to  new  ideas;  in  addition  to  the  Moorish 
philosophers’  writings,  their  ancient  Greek  sources 
entered  the  European  worldview,  especially  after 
the  completion  of  the  Christian  conquest  of  Spain 
in  1492.  However,  this  exposure  had  no  immedi¬ 
ate  impact  on  discriminatory  practices.  In  fact,  the 
Inquisition  and  other  mechanisms  were  employed 
more  vehemently  than  before,  and  the  persecuted 
“heretic”  Christians  and  Jews  sought  refuge  in  the 
multicultural  Islamic  Empire  of  the  Ottomans. 


The  recognition  of  the  legitimacy  of  diversity 
and  allowing  different  cultural  groups  to  maintain 
their  customs  and  identity,  however,  did  not  make 
these  empires  immune  to  discrimination.  In  fact, 
the  noninterference  form  of  tolerance,  advanced  or 
practiced  in  highly  authoritarian  or  hierarchal 
empires,  involved  considerable  discrimination.  For 
example,  the  Ottoman  state,  often  praised  for 
allowing  all  nationalities  to  freely  practice  their 
customs,  language,  and  religion,  also  imposed 
heavier  taxes  on  its  non-Muslim  subjects. 

In  the  West,  the  natural  rights  philosophers  usu¬ 
ally  fell  short  of  upholding  the  universalism  of  the 
Stoic  teachings  and  denied  equality  to  certain  seg¬ 
ments  of  humanity.  John  Locke,  for  example, 
claiming  that  the  inalienable  rights  of  life,  liberty, 
and  estate  were  held  by  land-owning  men,  trusted 
only  that  small  group  of  people  with  governance 
and  political  rights,  disenfranchised  other  men  and 
all  women,  and  justified  slavery.  Though  more 
egalitarian,  John  Stuart  Mill  denied  equal  treat¬ 
ment  and  rights  to  “those  backward  states  of  soci¬ 
ety  in  which  the  race  itself  may  be  considered  as  in 
its  nonage”  (Chapter  1,  On  Liberty).  The  blatant 
sexism  of  St.  Thomas  Aquinas  (1225-1274),  who 
wondered  why  God  would  even  create  woman,  a 
“misbegotten  male,”  in  the  first  production  of 
things,  has  been  expanded  on  by  others  who  ques¬ 
tioned  if  women  had  souls  (if  they  were  fully 
human).  In  the  early  modern  era,  Western  imperi¬ 
alist  expansion  instigated  similar  queries  about 
whether  aboriginals  had  souls.  Colonialism,  often 
justified  as  an  enlightening  and  civilizing  mission, 
involved  repression,  assimilation,  and  the  ultimate 
devaluation  of  the  colonized  and  their  cultures. 

Liberal  Doctrines 

The  democratization  of  liberal  philosophies  and 
states  drew  them  closer  to  universalism.  However, 
while  the  contemporary  liberal  democratic  stance 
against  discrimination  constitutes  an  improvement 
over  the  discriminatory  tolerance  displayed  in 
ancient  and  medieval  empires,  it  maintains  a  simi¬ 
lar  negative  conceptualization  of  tolerance.  Both 
the  liberal  democratic  and  imperial  models  of  tol¬ 
erance  allow  different  people  or  peoples  “to  exist” 
by  employing  a  “let  them  be”  approach,  which  is 
considered  a  major  flaw  by  their  contemporary 
critics. 
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Feminists,  postcolonial  theorists,  and  postmod¬ 
ernist  detractors  have  scrutinized  the  egalitarian 
claims  of  the  liberal  tradition  and  democratic  sys¬ 
tems.  They  find  that  the  liberal  model’s  demand  of 
and  reliance  on  noninterference  translates  to  a  lack 
of  full  recognition,  thus,  permitting  the  continua¬ 
tion  of  inequalities  and  discriminatory  practices.  In 
fact,  they  consider  the  act  of  toleration  as  hierar- 
chal  and  patronizing,  because  by  tolerating  the 
“objectionable,”  the  person  (or  the  dominant 
group  in  a  society)  passes  a  judgment  and  creates 
an  “other”  that  is  considered  less  valuable,  yet 
harmless  enough  to  be  allowed  existence.  Thus, 
instead  of  promoting  tolerance,  which  would 
involve  a  vertical  recognition,  they  call  for  hori¬ 
zontal  recognition,  which  would  mean  recognizing 
that  other  individuals  and  cultures  are  equal  in 
value. 

In  Western  societies,  these  debates  on  the 
accommodation  of  differences  have  included  ques¬ 
tions  about  the  limitation  of  legal  equality  and 
freedoms.  Arguments  about  the  repressive  impact 
of  nonrecognition  of  the  achievements  and  contri¬ 
butions  of  minority  cultures  and  the  notion  that 
indifference  to  one’s  identity  or  culture  can  in  fact 
harm  the  person  ushered  in  the  advocacy  of  multi- 
culturalism,  which  is  also  referred  to  as  identity 
politics  or  politics  of  recognition.  (The  main  and 
overt  battleground  of  multiculturalism  has  been 
educational  systems,  where  the  debates  ranged 
from  the  means  and  merits  of  preserving  minority 
languages,  e.g.,  French  in  Quebec,  to  revising  the 
curricula  to  include  the  historical,  literary,  and 
other  contributions  of  non-White,  non-Western 
populations,  e.g.,  Africans  and  African  Americans 
in  the  United  States.)  While  the  movement  has 
promoted  equality  in  dignity  and  recognition,  the 
means  of  evaluation  of  cultures  and  parameters  of 
recognition  continue  to  be  contentious  issues. 

Connection  to  Human  Rights 

Arguments  in  favor  of  tolerance  have  been  always 
closely  related  to  arguments  on  human  rights, 
which  are  rights  claimed  against  the  state  and  soci¬ 
ety  for  the  sole  reason  of  being  a  human.  While 
human  rights  can  be  traced  back  to  ancient  times, 
a  leap  toward  a  common  definition  of  human 
rights  and  their  universal  recognition  was  made 
with  the  establishment  of  the  United  Nations 


(UN),  the  Charter  (1945)  of  which  included  the 
promotion  of  human  rights  as  one  among  the 
goals  of  the  organization.  The  Universal  Declaration 
of  Human  Rights  (UDHR),  adopted  by  the  UN 
General  Assembly  in  1948,  provided  a  list  of 
human  rights,  and  the  subsequent  conventions  and 
treaties  issued  by  the  organization  created  a  body 
of  international  human  rights  law. 

The  principles  of  antidiscrimination  and  equal¬ 
ity  in  dignity  lie  at  the  core  of  the  normative  frame¬ 
work  of  the  international  human  rights  law.  First, 
the  UDHR  spells  out  that  human  rights  are  put 
under  protection  because  “human  beings  are  born 
free  and  equal  in  dignity  and  rights”  (Article  1).  In 
addition  to  the  UN  Charter,  which  specifies  that 
human  rights  apply  to  all  without  distinction  as  to 
race,  sex,  language,  or  religion,  Article  2  of  the 
UDHR  expands  the  list  to  include  “race,  colour, 
sex,  language,  religion,  political  or  other  opinion, 
national  or  social  origin,  property,  birth  or  other 
status”  and  objects  to  discrimination  based  on 
“political,  jurisdictional  or  international  status  of 
the  country  or  territory  to  which  a  person  belongs.” 
The  same  extensive  nondiscrimination  list  is  repeated 
in  various  articles  of  the  subsequent  human  rights 
treaties  but  without  taking  a  position  on  the  value 
of  the  specified  categories  of  differences — they  do 
not  call  for  their  elimination  or  protection.  The 
International  Covenant  of  Civil  and  Political 
Rights  (ICCPR,  which  was  adopted  in  1966  and 
came  into  force  in  1976),  however,  brings  ethnic¬ 
ity,  religion,  and  language  under  protection  as 
minority  rights  (Article  27). 

In  addition  to  nondiscrimination,  the  interna¬ 
tional  human  rights  law  promotes  respect  for  dif¬ 
ferences.  For  example,  the  International  Covenant 
on  Economic,  Social  and  Cultural  Rights  (adopted 
and  entered  into  force  in  1966  and  1976,  respec¬ 
tively)  not  only  recognizes  everyone’s  right  to  edu¬ 
cation  but  also  stipulates  that  “education  shall  be 
directed  to  the  full  development  of  the  human 
personality  and  sense  of  dignity”  and  “enable  all 
persons  to  participate  in  a  free  society,  promote 
understanding,  tolerance  and  friendship  among  all 
nations  and  all  racial,  ethnic  or  religious  groups” 
(Article  13). 

The  emphasis  placed  on  equality  in  dignity  and 
respect  for  differences  provide  for  a  “nonnega¬ 
tive”  conceptualization  of  tolerance  in  the  interna¬ 
tional  human  rights  law.  It  promotes  positive  and 


Totalitarian  Regimes  2627 


horizontal  tolerance  by  requiring  the  acceptance 
and  full  recognition  of  differences,  and  the  unique 
identity  of  individuals  or  groups  without  discrimi¬ 
nation.  This  approach  has  been  displayed  in  other 
UN  documents  and  activities  on  tolerance,  includ¬ 
ing  the  Declaration  on  the  Elimination  of  All 
Forms  of  Intolerance  and  of  Discrimination  Based 
on  Religion  or  Belief  (1981),  the  designation  of 
1995  as  the  “Year  for  Tolerance,”  and  holding  a 
World  Conference  against  Racism,  Racial 
Discrimination,  Xenophobia,  and  Related 
Intolerance  (2001). 

The  Limits  of  Tolerance 

A  question  that  applies  both  to  the  liberal  notion 
of  “negative”  tolerance  and  to  the  broader  concep¬ 
tualization  of  it  as  recognition  is  whether  tolerance 
warrants  tolerating  the  intolerant.  In  other  words, 
what  are  the  limits  of  tolerance?  While  the  liberal 
philosopher  Karl  Raimund  Popper  (1902-1994) 
considered  the  intolerant  (especially  the  advocates 
of  totalitarian  ideologies)  as  the  enemies  of  open 
society  and  thus  objected  to  the  idea  of  tolerating 
them,  another  liberal  philosopher,  John  Rawls, 
argued  that  tolerance  for  all  was  necessary  to 
maintain  a  just  (tolerant)  society,  but  that  could  be 
allowed  only  if  the  intolerant  ones  do  not  endanger 
the  tolerant  essence  of  the  society.  In  a  similar  vein, 
stating  that  “any  advocacy  of  national,  racial  or 
religious  hatred  that  constitutes  incitement  to 
discrimination,  hostility  or  violence  shall  be  pro¬ 
hibited  by  law,”  the  ICCPR  sets  boundaries  of  tol¬ 
erance  in  the  international  human  rights  law 
(Article  20).  Although  paradoxical,  these  proposi¬ 
tions  address  some  concerns  about  hate  speech, 
violence  stemming  from  bigotry,  or  authoritarian 
takeover  of  democratic  systems.  However,  calls  for 
cultural  preservation  and  recognition,  prevalent  in 
the  discourse  of  politics  of  recognition  and  multi- 
culturalism,  make  women  and  others,  whose  sub¬ 
ordination  is  dictated  or  enforced  by  cultural  norms 
and  values,  uneasy.  It  is  not  only  gender  equality 
and  other  emancipatory  aspects  of  the  interna¬ 
tional  human  rights  law  that  can  be  jeopardized  in 
the  name  of  religious  freedom  or  celebration  of 
multiculturalism,  cultural  relativism,  but  also  the 
right  to  self-determination  in  primarily  undemo¬ 
cratic  contexts,  where  members  are  not  allowed  to 
interpret  the  cultural  sources  and  determine  their 


own  lives,  may  help  sustain  discrimination  and 
serve  the  privileged. 
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Totalitarian  Regimes 


The  most  common  definition  of  totalitarian 
regimes,  categorized  generally  under  the  concept 
of  “totalitarianism,”  refers  to  modern  antiliberal 
and  antidemocratic  regimes  based  on  the  monop¬ 
olization  of  power  by  a  revolutionary  single  party 
led  by  a  charismatic  leader  and  dominating  over 
the  state  and  society  by  means  of  terror  and  pro¬ 
paganda.  Totalitarian  regimes  carry  out  their 
policy  using  a  variety  of  institutions  controlled  by 
the  single  party:  a  highly  centralized  bureaucracy, 
an  official  and  secret  police,  a  party  militia,  the 
manipulation  of  culture  and  public  opinion 
through  the  monopoly  of  mass  media,  and  a  net¬ 
work  of  organizations  set  up  all  over  the  country 
under  the  control  of  the  single  party,  whose  tasks 
are  to  regiment,  indoctrinate,  and  mobilize  perma¬ 
nently  the  individual  and  the  masses. 

More  specifically,  some  scholars  consider  mass 
extermination  as  a  necessary  feature  whose  pres¬ 
ence  makes  a  regime  truly  totalitarian;  other  schol¬ 
ars  argue  that  there  can  be  totalitarianism  without 
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terror,  or  they  emphasize  the  role  of  the  charismatic 
leader  or  the  mobilization  of  the  masses  as  an  essen¬ 
tial  characteristic  of  totalitarian  regimes.  However, 
all  scholars,  whatever  their  definitions  of  totalitar¬ 
ian  regimes,  agree  that  these  are  new  types  of  anti¬ 
democratic  political  systems  that  differ  substantially 
from  traditional  dictatorships  and  civil  or  military 
modern  authoritarian  states,  whose  concentration 
of  power  is  based  mainly  on  repression  but  does 
not  require  a  revolutionary  single  party,  an  all- 
encompassing  ideology,  or  a  network  of  collective 
organizations  for  the  mobilization  of  the  masses. 

The  definition  of  totalitarian  regimes  has  been  a 
matter  of  endless  debate,  with  highly  ideological 
and  political  implications.  Hence,  any  definition  of 
totalitarian  regimes  necessarily  implies  a  survey  of 
the  origins  and  development  of  the  concept  of 
totalitarianism.  This  will  be  done  in  the  following 
sections  of  the  entry,  complemented  by  an  analysis 
of  this  type  of  regime  in  the  Cold  War.  Next,  the 
entry  examines  the  eclipse  of  the  concept  of  totali¬ 
tarianism  in  political  science  from  the  middle  of 
the  20th  century  through  the  1970s.  The  entry 
concludes  by  examining  the  renewed  interest  in 
this  concept  that  began  in  the  1980s  and  has  led  to 
new  interpretations  of  it. 

The  Name  and  the  Thing 

Contrary  to  widespread  opinion,  the  terms  totali¬ 
tarian  and  totalitarianism  were  not  invented  by 
Mussolini,  even  if  the  Fascist  regime  was  the  only 
one  to  label  itself  as  totalitarian.  Both  words  were 
originally  coined  by  Italian  antifascists  a  few 
months  after  Mussolini  was  appointed  by  the  King 
as  Prime  Minister  in  October  29,  1922.  In 
November  1923,  the  liberal  antifascist  Giovanni 
Amendola,  probably  the  inventor  of  the  word 
totalitarian,  described  fascism  as  imbued  with  a 
totalitarian  spirit,  using  violence  to  impose  its  ide¬ 
ology  as  a  form  of  faith  obligatory  for  all  Italians. 
In  the  same  period,  Catholic  antifascists  used  this 
new  adjective  to  define  fascism  as  a  political  reli¬ 
gion  sacralizing  the  nation  and  the  state,  thus 
being  anti-Christian  in  essence.  Since  June  1924, 
when  the  socialist  deputy  Giacomo  Matteotti  was 
assassinated  by  a  Fascist  gang,  the  word  totalitar¬ 
ian  was  also  applied  to  the  Fascist  extremists  push¬ 
ing  Mussolini  hard  to  establish  a  dictatorship.  In 
December  1924,  the  Marxist  antifascist  Lelio 


Basso  used  the  substantive  totalitarianism  for  the 
first  time  to  describe  the  new  Fascist  state  in  the 
making  by  transforming  traditional  institutions 
such  as  the  monarchy,  the  parliament,  and  the 
magistracy,  as  well  as  the  armed  forces,  into  the 
instruments  of  a  single  party. 

We  must  bear  in  mind  that  totalitarian  and 
totalitarianism  were  coined  by  anti-Fascists  before 
the  establishment  of  the  single-party  regime  in 
Italy.  Therefore,  they  did  not  refer  to  Fascist  ideol¬ 
ogy  or  to  Fascist  rhetoric  but  to  the  reality  of  the 
Fascist  party’s  practice  of  ruling  over  the  country 
using  an  armed  militia  to  terrorize,  persecute,  and 
kill  political  opponents.  It  was  the  first  time  that 
an  armed  militia  party,  identifying  itself  with  the 
nation  and  attacking  all  political  opponents  as 
internal  enemies  of  the  state,  took  power  in  a 
Western  parliamentarian  democracy  to  destroy  it. 
The  novelty  of  the  phenomenon  required  a  new 
concept  to  define  it. 

The  concept  of  totalitarianism  was  first  devel¬ 
oped  by  the  Italian  priest  Luigi  Sturzo,  the  founder 
of  the  Popular  Party,  who  was  forced  into  exile  by 
the  Fascists  in  1924.  In  his  book  Italy  and  Fascism, 
published  in  English  in  1926,  Sturzo  compared  the 
reality  of  the  two  regimes  fascism  and  bolshevism 
and  defined  both  as  totalitarian  because  of  their 
points  of  resemblance,  in  spite  of  their  differences 
in  historical  background,  social  support,  ideolo¬ 
gies,  and  method  of  achieving  power.  The  points 
of  resemblance  were  the  rejection  of  the  principles 
and  method  of  democracy,  the  monopoly  of  power 
and  politics  by  a  single  party,  the  violent  suppres¬ 
sion  of  all  opponents,  and  the  regimentation  of 
society  under  party  control.  Sturzo’s  book  was 
soon  translated  into  German,  French,  and  Spanish, 
thus  spreading  the  concept  of  totalitarianism  in 
Europe  and  America  from  the  late  1920s. 

The  Totalitarian  State 

After  Adolf  Hitler’s  seizure  of  power  in  1933  and 
the  establishment  of  a  single-party  regime  in 
Germany,  the  expression  totalitarian  state  was 
being  commonly  used  by  intellectuals  and  politi¬ 
cians  to  define  the  new  antidemocratic  regimes  in 
Russia,  Italy,  and  Germany  in  order  to  distinguish 
them  from  ancient  despotism  and  the  modern  dic¬ 
tatorships  mushrooming  all  over  Europe  during 
the  interwar  period.  Most  of  these  antidemocratic 
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regimes  believed  in  the  concentration  of  power  in 
the  hands  of  a  king,  an  army  official,  or  a  “strong 
man”  representing  the  old  establishment.  During 
the  1930s,  an  increasing  number  of  scholars  in 
Europe  and  the  United  States,  such  as  Raymond 
Aron,  Franz  Borkenau,  Alfred  Cobban,  Michael 
Florinsky,  Hans  Kohn,  Fritz  Morestein  Marx,  Elie 
Halevy,  Carlton  J.  H.  Heyes,  Emil  Federer,  and 
Jose  Ortega  y  Gasset,  put  forward  various  inter¬ 
pretations  of  the  totalitarian  regimes,  emphasizing 
their  novelty. 

According  to  these  scholars,  the  basic  feature  of 
totalitarianism  was  a  single-party  regime  created 
by  an  antidemocratic  mass  movement  that  used 
paramilitary  violence  and  terror  combined  with 
electoral  consensus,  as  in  the  case  of  fascism  and 
Nazism,  to  conquer  and  maintain  power,  render¬ 
ing  permanent  its  own  dictatorship  as  a  police 
state  based  on  the  supremacy  of  a  charismatic 
leader  and  the  oligarchy  of  the  single  party,  which 
comprised  an  elite  group  of  “new  men”  emerging 
from  anonymity.  Other  peculiar  features  of  a 
totalitarian  state  were  the  cult  of  the  supreme 
leader  of  the  party,  who  was  exalted  and  revered 
like  a  demigod  by  the  followers  and  by  the  masses; 
absolute  state  centralization  in  the  administrative 
and  political  fields;  state  control  of  the  economy; 
the  militarization  of  society  through  the  perma¬ 
nent  and  capillary  rule  of  the  single  party,  whose 
main  goal  was  to  remold  and  transform  the  indi¬ 
vidual  and  the  masses  in  the  image  of  its  ideology; 
the  fanaticism  and  missionary  spirit  of  the  single¬ 
party  regime;  a  rigorous  monopoly  of  education 
militarizing  the  youth;  the  use  of  methods  of  tech¬ 
nological  and  organizational  modernization  for 
the  mobilization  of  the  masses;  and  the  develop¬ 
ment  of  an  intense  and  constant  activity  of  collec¬ 
tive  indoctrination  through  mass  organizations, 
festivals,  parades,  pageants,  plebiscites,  and  sports. 

Here  again,  we  have  to  bear  in  mind  that  it  was 
the  reality  of  these  regimes,  their  structure  and 
functioning,  and  their  concrete  policy  toward  the 
masses,  and  not  their  ideologies,  intentions,  or 
goals,  that  were  at  the  origins  of  the  definition  of 
totalitarianism. 

Interpretations  of  Totalitarianism 

Theories  of  the  totalitarian  regime  were  also  devel¬ 
oped  by  the  intellectuals  mentioned  above  to 


explain  its  origins,  workings,  and  foreseeable  evo¬ 
lution.  The  theories  focused  mainly  on  modern 
phenomena  and  events  that  contributed  to  the 
emergence  of  totalitarian  regimes  in  different 
countries  such  as  Russia,  Italy,  and  Germany. 
Since  all  these  regimes  were  established  after 
World  War  I  and  were  strictly  connected  to  the 
war,  although  in  different  ways,  the  war  was  con¬ 
sidered  as  the  seedbed  for  totalitarianism. 
Conditions  for  the  emergence  of  totalitarian 
regimes  were  the  radicalization  of  politics  and 
class  struggles;  the  incapacity  of  the  parliamentary 
system  to  provide  an  effective  solution  to  social 
and  economic  conflicts;  the  impoverishment  and 
disorientation  of  social  classes  hit  by  fierce  eco¬ 
nomic  crises,  moving  toward  a  revolution  or  a 
counterrevolution  to  survive;  and  the  rebellion  of 
the  masses,  looking  for  simple  and  reassuring  solu¬ 
tions  by  subordinating  themselves  to  the  command 
of  a  “strong  man,”  who  promised  to  resolve  the 
crisis  with  brutal  yet  efficient  methods.  From  the 
war  experience,  the  revolutionary  movements  that 
built  up  a  totalitarian  regime  derived  the  militari¬ 
zation  of  politics,  the  use  of  violence,  total  state 
control  over  society,  and  brainwashing  propa¬ 
ganda  as  basic  methods  to  maintain  the  monopoly 
of  power.  A  feeble  or  short-lived  democratic  tradi¬ 
tion,  widespread  political  fragmentation,  and  a 
sequence  of  weak  governments  were  also  reckoned 
as  conditions  for  the  emergence  of  totalitarian 
regimes. 

In  a  broader  historical  perspective,  the  roots  of 
the  totalitarian  state  were  linked  to  the  mounting 
tendency  toward  centralization  of  the  modern 
state,  expanding  its  control  over  public  and  private 
life.  Some  scholars  pointed  to  the  advent  of  the 
mass  society,  the  atomization  of  individuals,  the 
political  mobilization  of  amorphous  masses,  and 
the  use  of  modern  techniques  of  propaganda  by 
ruthless  and  cynical  demagogues  as  the  main  fac¬ 
tors  for  the  making  of  a  totalitarian  regime.  The 
totalitarian  state  was  also  seen  as  a  perverted  expe¬ 
rience  of  mass  democracy  in  the  age  of  industrial¬ 
ization.  The  Jacobin  dictatorship  during  the  French 
Revolution  as  well  as  the  regimes  of  Napoleon  I 
and  Napoleon  III  were  numbered  among  the  har¬ 
bingers  of  totalitarianism. 

More  specifically,  scholars  of  religious  orienta¬ 
tion,  such  as  Adolf  Keller,  Waldemar  Gurian, 
Jacques  Maritain,  and  Eric  Voegelin,  attributed  the 
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origins  of  the  totalitarian  state  to  the  process  of 
secularization:  They  defined  totalitarianism  as  a 
political  religion  deifying  the  state,  race,  or  class 
and  enforcing  state  control  over  the  body  and 
mind  of  the  people  to  mold  them  into  obedient 
believers  and  servants.  Hence,  totalitarianism  was 
anti-Christian  in  essence,  even  if  fascism  and 
Nazism  did  not  preach  atheism  as  bolshevism  did 
but  pretended  to  be  respectful  of  the  churches. 

Theories  of  totalitarianism  had  their  heyday  in 
f  939,  when  many  writings  on  totalitarian  regimes 
were  published.  The  nonaggression  treaty  between 
Nazi  Germany  and  Soviet  Union  in  August  1939 
was  regarded  as  a  confirmation  of  the  affinity 
between  the  two  totalitarian  regimes.  In  his  book 
The  Totalitarian  Enemy,  published  in  1940,  dis¬ 
cussing  the  conflict  between  the  democratic  and 
totalitarian  types  of  regime,  its  origins,  and  its 
results,  Franz  Borkenau  called  Nazism  “Brown 
bolshevism”  and  affirmed  that  Russian  commu¬ 
nism  was  inherently  the  most  expanding  and 
aggressive  form  of  totalitarianism. 

After  1941,  however,  when  the  Soviet  Union 
was  attacked  by  Nazi  Germany  and  hence  joined 
the  Allied  forces  in  World  War  II,  the  tendency 
prevailed  among  anti-Fascists  not  to  talk  about 
Joseph  Stalin’s  regime  as  totalitarian,  one  excep¬ 
tion  being  Sigmund  Neumann’s  book  Permanent 
Revolution:  Totalitarianism  in  the  Ages  of  Inter¬ 
national  Civil  War,  published  in  1942.  Neumann 
put  forward  the  most  systematic  and  in-depth 
comparative  analysis  of  the  totalitarian  regimes  of 
the  time.  Taking  into  account  earlier  interpreta¬ 
tions  of  the  roots,  features,  and  structure  of  totali¬ 
tarianism — the  legacy  of  the  Great  War,  mass 
democracy,  single-party  rule,  the  new  elite,  the  cult 
of  the  leader,  state  control  of  the  economy,  the 
militarization  of  society,  the  mobilization  of  the 
masses,  political  religion — Neumann  systematized 
them  into  a  new  comprehensive  theoretical  frame¬ 
work,  adding  new  concepts  to  define  totalitarian¬ 
ism:  the  revolutionary  dictatorial  party,  the  per¬ 
manent  revolution,  and  the  international  civil 
war.  The  revolutionary  dictatorial  party  was 
antidemocratic  in  essence,  imbued  with  a  quasi¬ 
religious  missionary  spirit  and  claiming  an  exclu¬ 
sive  identification  with  a  collectivity,  the  nation, 
the  proletariat,  or  the  race.  It  was  driven  by  its 
own  ideological  dynamism  to  acquire  monopoly 
of  power,  to  build  a  one-party  state,  and  to  organize, 


regiment,  and  mobilize  the  masses  in  order  to 
regenerate  them  and  create  “a  new  man,”  aiming 
also  at  extending  its  revolution  through  territorial 
and  ideological  expansionism,  therefore  promot¬ 
ing  an  international  civil  war  to  achieve  its  goals. 
While  emphasizing  significant  structural  similari¬ 
ties,  Neumann  also  stressed  that  communism,  fas¬ 
cism,  and  Nazism  must  be  differentiated  through 
their  historical  background,  ideological  orienta¬ 
tion,  social  mobilization,  and  ultimate  goals. 
Hence,  he  concluded,  a  full  definition  of  the  totali¬ 
tarian  regime  must  include  this  diversity,  avoiding 
therefore  any  sweeping  formula. 

Totalitarianism  in  the  Cold  War 

Following  the  fall  of  fascism  and  Nazism  in  1945, 
the  only  surviving  totalitarian  regime  was  the 
Soviet  Union,  which  never  accepted  being  labeled 
as  such.  After  the  breakdown  of  the  international 
antifascist  alliance  and  the  start  of  the  Cold  War, 
there  was  a  new  wave  of  interest  in  totalitarian¬ 
ism,  this  time  with  a  strong  anticommunist  orien¬ 
tation.  This  time,  because  of  the  Holocaust  and 
the  mass  killings  under  Stalin’s  regime,  terror  and 
mass  extermination  were  added  as  necessary  fea¬ 
tures  in  the  development  of  the  concept  of  totali¬ 
tarianism.  At  the  beginning  of  the  Cold  War,  the 
concept  was  widely  used  in  anticommunist  propa¬ 
ganda  to  identify  Stalin’s  regime  as  a  political 
“evil”  similar  to  Hitler’s  regime.  Two  major 
works  in  the  1950s  triggered  off  new  debates  on 
totalitarianism:  Hannah  Arendt’s  The  Origins  of 
Totalitarianism,  published  in  1951,  and  Carl  J. 
Friedrich  and  Zbigniew  K.  Brzezinski’s  Totalitarian 
Dictatorship  and  Autocracy,  published  in  1956, 
after  a  conference  on  totalitarianism  held  in 
March  1953  by  the  American  Academy  of  Arts 
and  Sciences,  whose  proceedings  were  published  1 
year  later. 

In  her  interpretation  of  totalitarianism,  Arendt 
echoed  many  concepts  developed  by  earlier 
scholars — the  atomization  of  society,  the  amor¬ 
phous  masses,  the  ruthless  elite,  and  the  perma¬ 
nent  revolution — while  stressing  the  role  of  an 
all-encompassing  ideology  and  the  subordination 
of  the  state  to  the  single  party  as  key  features  of 
the  totalitarian  regime.  Arendt  also  theorized  that 
the  totalitarian  regime  was  the  result  of  a  totali¬ 
tarian  movement,  but  she  argued  that  actually 
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only  the  Nazi  party  was  a  totalitarian  movement, 
while  denying  this  characterization  to  the  Bolshevik 
party.  She  considered  only  Germany  under  the 
Nazis  and  the  Soviet  Union  under  Stalin’s  rule  as 
examples  of  “true  totalitarianism,”  asserting  that 
Lenin’s  Bolshevik  party  and  the  Soviet  Union  were 
not  totalitarian  until  Stalin’s  rule.  Even  Mussolini’s 
regime,  according  to  Arendt,  was  not  totalitarian 
until  1938.  Nevertheless,  she  also  asserted  that 
only  if  Nazi  Germany  had  won  World  War  II 
would  Hitler  have  been  able  to  establish  a  fully 
developed  totalitarian  regime.  Arendt  considered 
that  the  essence  of  totalitarianism  was  mass  terror 
and  mass  extermination,  as  the  Holocaust  under 
the  Nazis  and  the  purges  under  Stalin’s  regime. 
Therefore,  she  defined  totalitarianism  in  its  final 
stages  as  an  absolute  evil.  Arendt’s  approach  to 
the  question  of  totalitarianism  was  more  philo¬ 
sophical  than  theoretical.  Notwithstanding  serious 
criticism  of  her  interpretation  of  totalitarianism 
based  on  what  were  claimed  to  be  its  many  short¬ 
comings  and  contradictions,  Arendt’s  work  had  a 
major  impact  on  the  debate  on  totalitarianism  in 
the  following  decades. 

More  theoretically  oriented,  Friedrich  and 
Brzezinski  analyzed  the  structure  of  the  totalitarian 
regimes  in  Russia,  Italy,  and  Germany,  classifying 
their  similarities  in  a  syndrome  of  six  basic  features: 

1.  an  official  ideology  that  covers  all  vital  aspects 
of  man’s  existence  and  is  projected  toward  a 
perfect  final  state  of  mankind; 

2.  a  single  party  led  by  one  man,  unquestionably 
dedicated  to  its  ideology,  hierarchically  and 
oligarchically  organized,  and  either  superior  to 
or  completely  intertwined  with  the  bureaucratic 
government  organization; 

3.  a  system  of  terroristic  police  control; 

4.  technologically  conditioned,  near -complete 
monopoly  control  by  the  party  of  all  means  of 
mass  communication; 

5.  similar  technologically  conditioned,  near- 
complete  monopoly  and  control  of  all  means  of 
effective  armed  combat;  and 

6.  central  control  and  direction  of  the  entire 
economy  through  bureaucratic  coordination  of 
its  formerly  independent  entities. 


Modernization  and  industrialization  were  also 
emphasized  by  Friedrich  and  Brzezinski  as  neces¬ 
sary  conditions  for  the  establishment  of  the  totali¬ 
tarian  regime. 

Friedrich  and  Brzezinski  were  criticized  because 
their  totalitarian  syndrome  was  considered  a 
descriptive  concept  instead  of  an  explanatory  the¬ 
ory,  while  historians  who  were  skeptical  about  the 
validity  of  the  concept  of  totalitarianism  put  into 
question  the  applicability  of  their  static  model  of 
similarities  to  the  complex  reality  of  the  so-called 
totalitarian  regimes,  obscuring  their  substantial 
differences  and  their  contradictory  reality. 

The  Eclipse  of  the  Concept  of  Totalitarianism 

Debates  on  totalitarianism  flourished  during  the 
Cold  War.  Among  the  totalitarian  regimes  then 
included  were  the  communist  states  established  in 
Eastern  Europe,  China,  and  North  Korea  after  the 
end  of  World  War  II,  as  well  as  Castro’s  regime  in 
Cuba  after  1959.  However,  after  Stalin’s  death 
and  the  condemnation  of  Stalin’s  cult  of  personal¬ 
ity  and  his  crimes  by  Soviet  Premier  Nikita 
Khrushchev  in  1956,  there  was  mounting  criticism 
of  the  concept  of  totalitarianism.  It  was  rejected  by 
communist  and  left-wing  scholars  as  a  mere  propa¬ 
ganda  slogan  to  denigrate  the  Soviet  Union  and  the 
communist  regimes  by  comparing  them  to  the 
Nazi  regime. 

The  use  of  the  concept  of  totalitarianism  came 
under  fierce  attacks  through  the  1960s.  It  is  sig¬ 
nificant  to  note  that  in  the  1967  edition  of  her 
book  on  totalitarianism,  Arendt  mentioned  the 
spread  of  totalitarian  governments  in  Eastern 
Europe  after  World  War  II  because  of  the  imposi¬ 
tion  of  bolshevism  by  Stalin,  but  she  also  asserted 
that  Stalin’s  death  was  followed  by  a  process  of 
detotalitarization.  She  recognized  also  that  the 
Chinese  Communist  Party’s  totalitarian  traits  have 
been  manifest  from  the  beginning  and  that  Mao 
Zedong’s  regime  was  characterized  by  mass  terror 
and  extermination,  but  she  denied  that  Communist 
China  was  totalitarian  because  she  felt  that  Mao’s 
was  not  a  criminal  mind  like  Stalin  and  Hitler. 

Attacks  on  the  concept  of  totalitarianism  went 
so  far  as  to  propose  that  the  name  itself  should  be 
banned  from  encyclopedia  entries  and  political  sci¬ 
ence  itself,  because  there  has  never  been  such  a 
thing  as  a  totalitarian  regime.  In  the  wake  of 
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Arendt’s  interpretation,  some  historians  asserted 
that  fascism  was  not  totalitarian;  other  historians 
made  similar  claims  about  the  Nazi  regime  and  the 
Soviet  Union,  denying  any  validity  to  the  concept 
of  totalitarianism.  The  more  traditional  and  generic 
term  authoritarianism  was  preferred  to  describe  all 
types  of  modern  dictatorships  and  one-party  states. 

In  the  following  decade,  there  was  actually  a 
sort  of  “eclipse  of  totalitarianism.”  One  scholar 
proposed  seriously  in  1976  that  we  should  cease  to 
speak  of  totalitarianism.  Ten  years  later,  another 
scholar  asked  ironically  if  there  has  ever  been  such 
a  thing  as  totalitarianism.  The  rejection  of  the  con¬ 
cept  meant  actually  the  rejection  of  the  thing,  thus 
producing  a  sort  of  conceptual  void  in  the  analysis 
of  the  variety  of  single-party  regimes  in  the  20th 
century,  though  new  experiences  of  domination, 
such  as  the  Khmer  Rouge  regime  established  in 
Cambodia  in  1976  to  1979,  did  replicate  the  fun¬ 
damental  traits  of  the  totalitarian  regime  in  its 
most  extreme  form — mass  extermination — to 
transform  human  beings  and  create  a  “new  man.” 
Many  features  of  totalitarianism  were  also  present 
in  many  secular  nationalist  one-party  states  in  the 
Islamic  world  and  the  Third  World:  the  cult  of  the 
leader,  an  armed  militia,  mass  mobilization,  a 
political  religion,  and  the  myth  of  regeneration. 

Going  Back  to  Reality:  New  Interpretations 

However,  since  the  late  1980s,  mainly  due  to  the 
fall  of  the  Soviet  Union,  there  has  been  a  revival  of 
interest  in  totalitarianism,  resulting  in  new 
approaches  to  the  question  of  totalitarianism  and 
new  interpretations  of  it.  Pioneers  in  the  revival  of 
the  concept,  even  during  its  eclipse,  were  scholars 
such  as  Hans  Buchheim,  Karl  D.  Bracher,  and 
above  all  Juan  J.  Linz.  They  dealt  with  the  defini¬ 
tion  of  totalitarianism  going  back  to  the  historical 
reality  and  revising  previous  theories  according  to 
the  result  of  new  research.  They  argued  that  the 
nature  of  the  totalitarian  regime  is  not  static  but 
dynamic  and  that  such  regimes  are  monistic  but 
not  monolithic.  Hence,  a  more  nuanced  interpreta¬ 
tion  of  the  relationship  between  the  leader  and  the 
party  and  between  the  party  and  the  state  arose 
from  their  revision  of  the  concept  of  totalitarian¬ 
ism.  While  stressing  the  prominence  of  police  con¬ 
trol  and  repression,  they  denied  that  terror  was  the 
essence  of  totalitarianism  and  argued  that  it  was 


not  sufficient  to  define  it.  Finally,  they  did  not 
underestimate  the  role  of  ideology,  mass  mobiliza¬ 
tion,  and  even  popular  enthusiasm  in  totalitarian 
politics  but  recognized  also  that  the  totalitarian 
claim  to  total  control  over  society  had  never  been 
achieved. 

Linz’s  definition  of  what  characterizes  a  regime 
as  totalitarian  has  been  fundamental  in  the  revi¬ 
sion  of  the  concept  of  totalitarian  regimes  and  its 
revival.  In  an  essay  published  first  in  1974  and 
then  in  2000  with  an  up-to-date  introduction,  Linz 
defines  a  regime  as  totalitarian  when  the  following 
characteristics  apply  and  only  when  they  occur 
together:  an  exclusive  ideology,  a  single  mass  party 
and  other  mobilization  organizations,  and  power 
concentrated  in  an  individual  and  his  or  her  col¬ 
laborators  that  is  not  accountable  to  any  large 
constituency  and  cannot  be  removed  by  legal, 
peaceful  means.  When  one  or  more  of  these  char¬ 
acteristics  are  missing,  the  regime  could  be  defined 
as  pretotalitarian  or  posttotalitarian. 

Since  the  early  1990s,  there  has  been  worldwide 
a  flourishing  of  new  discussion  about  totalitarian¬ 
ism.  New  historical  works,  mainly  those  on  Italian 
fascism,  have  also  contributed  to  renewing  sub¬ 
stantially  the  interpretation  of  totalitarianism.  The 
result  of  this  new  research  and  interpretation  can 
be  summarized  as  follows. 

The  totalitarian  regime  is  a  modern  experiment 
in  political  domination  undertaken  by  a  revolu¬ 
tionary  movement,  with  an  integralist  conception 
of  politics,  that  first  secures  total  power  monopoly, 
whether  by  legal  or  illegal  means,  and  then 
destroys  or  transforms  the  previous  regime  and 
constructs  a  new  state.  This  new  state  is  based  on 
a  single-party  regime  under  the  rule  of  a  charis¬ 
matic  leader  and  has  as  its  chief  objective  the 
conquest  of  society — that  is,  the  subordination, 
integration,  and  homogenization  of  the  ruled  by 
means  of  the  integral  politicization  of  existence, 
whether  collective  or  individual.  The  new  reality  is 
interpreted  according  to  the  categories,  myths,  and 
values  of  an  exclusive  and  all-encompassing  ideol¬ 
ogy,  institutionalized  in  the  form  of  a  political 
religion,  that  aims  to  shape  the  individual  and  the 
masses  through  an  anthropological  revolution, 
thus  regenerating  the  human  being  and  creating 
the  new  man,  dedicated  in  body  and  soul  to  the 
realization  of  the  revolutionary  and  imperialistic 
policies  of  the  totalitarian  party.  The  ultimate  goal 
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of  this  regime  is  to  create  a  new  civilization  along 
expansionist  and  supranational  lines. 

Defining  the  totalitarian  regime  as  an  experi¬ 
ment  is  intended  to  emphasize  that  totalitarianism 
is  a  continual  process  or  a  permanent  revolution-,  it 
is  a  method  of  rule  to  achieve  a  goal,  not  a  goal  in 
itself. 

Emilio  Gentile 
Sapienza-Universita  di  Roma 
Rome,  Italy 
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T  OT  ALIT  ARIANISM 


All  political  concepts  are  essentially  contested,  but 
some  are  more  contested  than  others.  This  is  cer¬ 
tainly  the  case  for  totalitarianism.  It  is  almost 
impossible  to  reach  a  consensus  based  on  an 
operational  definition  of  this  term.  Many  of  its 
users  apply  it  to  any  dictatorship,  autocracy,  or 
fanatical  belief  or  behavior.  Many  of  its  critics  see 
it  as  an  invention  of  Cold  War  propaganda,  aim¬ 
ing  to  identify  Nazism  and  communism  in  order 
to  delegitimize  the  latter  or  excuse  the  former. 
While  the  scientific  character  and  the  present  rel¬ 
evance  of  the  concept  of  totalitarianism  can  be 


challenged  legitimately,  the  discussion  must  first 
get  rid  of  the  two  temptations  mentioned  above.  If 
the  use  of  the  term  totalitarianism  is  to  be  justi¬ 
fied,  it  makes  sense  only  if  it  is  applied  to  a  regime, 
a  movement,  a  mentality,  and  an  aspiration  that 
goes  beyond  the  limits  of  politics  to  encompass 
the  whole  of  society,  culture,  and  human  life  by 
negating  the  autonomy  of  domains  such  as  sci¬ 
ence,  religion,  or  art.  Also,  whatever  the  propa¬ 
gandists  instrumentalization  during  the  Cold 
War,  it  is  simply  historically  wrong  to  see  in  it  the 
origins  of  the  concept.  In  this  entry,  its  origins, 
different  uses,  and  contemporary  relevance  are 
discussed. 

Origins  and  Definitions 

The  first  known  use  of  the  term  totalitarianism 
occurred  in  1922  in  Italy.  It  is  due  to  liberal  oppo¬ 
nents  of  Mussolini,  like  Piero  Gobetti  and  Giovanni 
Amendola,  who  were  accusing  him  of  wanting  to 
dominate  not  only  government  but  the  whole  of 
social  and  institutional  life — an  accusation  that 
Mussolini  proudly  accepted  by  proclaiming  his 
“ferocious  totalitarian  will.”  While  Italian  fascism 
was  in  fact  less  totalitarian  than  German  Nazism, 
it  made  abundant  use  of  the  term  the  total  state, 
while  the  German  Marshal  Ludendorff  was  pro¬ 
moting  the  idea  of  total  war.  As  for  totalitarianism 
as  such,  it  was  mainly  used  by  left-wing  German 
emigres  like  Emil  Lederer  in  his  “State  of  the 
Masses”  or  Sigmund  Neumann,  trying  to  pinpoint 
the  novelty  of  Nazism.  They  converged  with 
Catholic  authors  like  Waldemar  Gurian  and  Eric 
Voegelin  who  were  putting  forth  the  notion  of 
“political  religion”  to  indicate  the  ambition  of 
both  fascism  and  communism  to  bring  salvation 
on  earth  and  their  appeal  to  total  faith  and  total 
commitment,  similar  to  that  demanded  from  the 
members  of  a  sect. 

The  “totalitarian  syndrome”  continued  to  take 
shape  in  the  1930s,  centered  on  fascism  but,  more 
and  more,  involving  an  implicit  or  explicit  com¬ 
parison  with  communism.  It  included  the  three 
elements  of  an  ideology,  a  movement  or  a  party, 
and  a  regime.  The  ideology  characteristically  com¬ 
bined  an  interpretation  of  the  world  and  of  history, 
claiming  to  be  based  on  science  (a  racist  biology  in 
one  case,  scientific  materialism  in  the  other)  and  a 
quasi-religious  appeal.  The  movement  or  party 
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was  a  totally  devoted  elite.  The  regime  was  based 
on  a  complete  monopoly  of  power  and  a  claim  of 
total  unity,  typically  incarnated  in  one  leader,  and 
total  war  against  the  enemies  of  the  faith  and 
against  the  groups  (racial  in  one  case,  social  in  the 
other)  who  were  supposed  to  be  the  sources  of  all 
evil  and  who  had  to  be  eliminated  in  order  to  bring 
about  the  triumph  of  justice. 

For  some  authors,  like  the  French  philosopher 
Claude  Lefort,  the  essence  of  totalitarianism  is  the 
effort  to  bring  back  societies  based  on  modern 
individualism  and  differentiation  to  a  premodern 
unit  with  a  total  primacy  of  the  community,  albeit 
through  modern  means.  Others  emphasize  the 
other  side  of  the  coin:  the  attempt  at  totally  elimi¬ 
nating  an  enemy — the  inversion  of  the  Clausewitzian 
formula,  “War  is  the  continuation  of  politics  by 
other  means”  into  “Politics  is  the  continuation  of 
war  by  other  means.”  The  perfect  formulation 
would  be  that  of  Carl  Schmitt,  which  he,  ironi¬ 
cally,  applied  to  the  enemies  of  Nazism  rather  than 
to  Nazism  itself:  “Total  enemy,  total  war,  total 
state,”  the  total  enemy  being  the  key. 

Two  works  provide  the  central  formulation  of 
this  classical  notion  of  totalitarianism:  that  of 
George  Orwell  ( Animal  Farm  and  1984)  and 
Hannah  Arendt  (The  Origins  of  Totalitarianism). 
Orwell’s  lasting  contribution  is  above  all  the 
notion  of  total  surveillance  (“Big  Brother  is  watch¬ 
ing  you”),  the  manipulation  of  feelings  (with  the 
conclusion  of  1984:  “He  loved  Big  Brother”),  and 
even  more,  the  “newspeak,”  the  creation  of  a  new 
language  where  love  can  mean  hatred  and  peace 
means  war.  Hannah  Arendt’s  emphasis  is  on  the 
dynamics  of  ideology  and  terror  (the  latter  literally 
eliminating  the  groups  condemned  to  historical 
extinction  by  the  former),  which  become  an  irre¬ 
pressible  process  and  a  force  by  themselves,  beyond 
their  original  content  and  target.  Here,  too,  a 
“supersense”  is  created,  a  kind  of  “mad  logic” 
through  which  Hitler  diverts  precious  vehicles 
from  the  war  effort  to  the  extermination  of  the 
Jews  and  Stalin  purges  the  Soviet  elite,  including 
the  Red  Army  leadership,  on  the  eve  of  the  German 
invasion. 

A  third,  less  illustrious  but  more  academic  work 
has  nevertheless  become  a  classic  of  totalitarianism 
studies.  It  is  Carl  Friedrich  and  Zbigniew  Brzezin- 
ski’s  Totalitarian  Dictatorship  and  Autocracy.  It 
provides  a  number  of  criteria  that  distinguish  a 


totalitarian  regime  from  an  ordinary  tyranny. 
These  refer  to  the  existence  of  a  single  party;  a 
monopoly  on  ideological,  political,  cultural,  and 
economic  affairs;  and  the  use  of  modern  technol¬ 
ogy  for  a  degree  and  pervasiveness  of  control  and 
indoctrination  that  were  not  at  the  disposal  of 
earlier  autocracies.  Raymond  Aron  provides  an 
analogous  list  of  criteria,  but  he  also  makes  a  point 
that  is  more  important — that  the  totalitarian 
dynamic,  powerful  and  fearsome  as  it  is,  is  not  the 
only  one  at  work  even  in  societies  in  which  it  rules. 
Other  dynamics,  like  that  of  economic  develop¬ 
ment,  the  arms  race,  or  of  external  contacts,  which 
cannot  be  entirely  suppressed,  may  compete  with 
it  or  deviate  from  it. 

Earlier,  the  great  social  historian  Barrington 
Moore  had  distinguished  three  forces  in  Soviet 
evolution:  power  (leading  to  totalitarianism), 
rationality  (leading  to  bureaucracy  or  to  democ¬ 
racy,  or  to  both),  and  tradition  (leading  to  the 
reassertion  of  old  Russian  cultural  and  imperial 
patterns).  Similarly,  Alex  Inkeles  proposed  in  1966 
that  the  logic  of  totalitarianism  be  combined  with 
that  of  development  and  that  of  industrial  society. 
This  more  complex  and,  in  a  way,  more  modest 
approach,  is  in  any  case  more  able  to  respond  to 
the  barrage  of  criticism  aimed  at  the  key  notion  of 
totalitarianism,  particularly  toward  the  end  of  the 
Cold  War,  by  a  number  of  historians  and  social 
scientists. 

The  attacks  revolve  mostly  around  two  themes. 
First,  the  notion  of  totalitarianism  is  supposed  to 
gloss  over  the  fundamental  difference  between 
Nazism  (or  fascism)  and  Stalinism  (or  commu¬ 
nism).  It  seems  to  rely  on  an  essentialist  and  fixed 
view  of  politics,  negating  the  changes  that  have 
affected,  in  particular,  Soviet  communism  over  the 
years.  Clearly,  communism  and  fascism  come  from 
two  different  traditions.  As  the  Polish  philosopher 
Leszek  Kolakowski  put  it,  one  is  the  monstrous 
child  of  the  enlightenment,  the  other  the  mon¬ 
strous  child  of  romanticism.  While  both  react 
against  bourgeois  democracy,  the  one  does  it  in  the 
name  of  a  radical  equality,  the  other  in  the  name  of 
the  superiority  of  an  esthetic,  aristocratic,  or 
heroic  elite.  Both  were  propelled  by  World  War  I, 
but  one  by  a  pacifist  reaction  and  the  other  by  a 
cult  or  nostalgia  for  war.  Above  all,  one  was 
bound  to  alienate  the  entire  world  since  it  pro¬ 
claimed  the  national  and  racial  supremacy  of  one 
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people,  while  the  other  stood  for  universalism  as 
well  as  for  equality.  But  this  can  be  seen  as  one 
more  argument  in  favor  of  the  problem  raised  by 
totalitarianism:  How  can  a  universalistic,  egalitar¬ 
ian,  and  libertarian  doctrine,  whose  official  song 
proclaimed  the  refusal  of  any  supreme  savior,  be  it 
God  or  Caesar,  end  by  adopting  the  divinization  of 
one  man,  the  absolute  power  of  the  rulers,  and  a 
permanent  war  against  supposed  enemies,  all  prac¬ 
tices  that  seem  to  go  much  more  naturally  with 
Nazism? 

It  is  precisely  this  strange  convergence  that  is  the 
strong  point  of  Hannah  Arendt,  whose  analysis  is 
essentially  based  on  a  parallel  between  the  two 
extreme  periods  of  the  two  regimes — 1936  to  1939 
for  one  and  1941  to  1945  for  the  other.  Even 
there,  however,  there  are  enormous  differences. 
Nazi  totalitarianism  was  more  directly  murderous 
and  targeted  a  number  of  categories  (Jews,  Roma, 
homosexuals,  communists),  which  had  to  be 
destroyed,  whereas  normal  apolitical  Germans 
were  left  in  peace.  In  Russia,  totalitarianism  was 
extended  to  every  sphere  of  life  and  literally  any 
citizen  could  be  its  victim. 

Of  course,  the  end  was  very  dissimilar:  violent 
overthrow  from  outside  through  lost  war  in  one 
case  and  the  decay  of  ideology  and  terror  and  the 
considerable  reduction  of  its  role  over  time  in  the 
other.  This  leads  us  to  the  other  group  of  objec¬ 
tions:  about  the  possibility  of  change.  Of  course, 
nobody  knows  whether  de-Hitlerization  or  de- 
Polpotization  would  have  occurred  in  Germany  or 
in  Cambodia  after  the  death  of  their  respective 
leaders.  The  loss  of  self-confidence  in  their  power 
and  legitimacy,  which  obviously  occurred  in  the 
European  communist  elite,  or  the  attempt  to  com¬ 
bine  their  power  with  an  adoption  of  the  most 
extreme  form  of  capitalism,  as  in  the  Chinese  case, 
are  not  a  unique  evolution,  yet  who  can  say  that 
they  were  inevitable  in  each  case?  Two  pure  cases 
of  totalitarianism,  the  Maoist  and  the  Cambodian 
ones,  appeared  or  were  flourishing  at  a  time  when 
a  process  of  de-Stalinization  was  already  well 
under  way  in  the  Soviet  Union.  And  an  almost 
pure  totalitarian  fossil-like  North  Korea  survives 
to  this  day. 

At  any  rate,  the  end  of  the  Soviet  Union  sur¬ 
prised  the  critics  of  the  totalitarian  model  as  much 
as  its  advocates.  Most  of  the  former  believed  that 
the  Soviet  Union  would  reform  and  evolve  toward 


a  kind  of  social  democracy.  Most  of  the  latter 
thought  that  totalitarianism  could  never  change.  In 
fact,  it  did.  But  due  precisely  to  the  nature  of 
totalitarianism,  the  attempt  to  reform  it  led  to  its 
collapse. 

Contemporary  Relevance 

Where,  then,  are  we  now?  Is  totalitarianism  dead, 
or  alive  and  well?  The  most  likely  answer  is  that  it 
has  taken  a  postmodern  turn;  in  other  words,  its 
various  components  are  dispersed  and  carry  on 
independently  or  in  various  new  combinations.  It 
does  not  seem  likely  that  the  combination  of  a 
general  doctrine  and  a  revolutionary  practice 
claiming  to  be  scientific  but  appealing  to  religious 
feelings  and  commitment  can  rule  a  country  or  an 
empire  in  the  21st  century  as  in  the  20th.  Theocratic 
fundamentalism  seems  to  be  on  the  rise  rather  than 
“secular  religions,”  and  while  less  exposed  to  veri¬ 
fication  and  disappointment  than  the  latter,  it 
seems  even  less  able  to  adapt  to  modern  society. 
Totalitarian  regimes  in  their  pure  form — like 
North  Korea — do  survive  for  the  time  being  but 
ever  more  precariously.  Genocidal  regimes  like 
that  of  the  Khmer  Rouge,  collective  genocides  like 
in  Rwanda,  mad  dictators,  or  oppressive  military 
regimes  are  likely  to  appear  and  disappear  with 
catastrophic  results,  but  they  are  more  likely  to  be 
based  on  nationalism,  ethnicity,  religion,  or  old- 
fashioned  personal  tyranny  than  to  be  or  remain 
specifically  totalitarian,  unless  the  progress  of  biol¬ 
ogy  enables  a  government  to  fulfill  one  old  totali¬ 
tarian  dream:  that  of  changing  human  nature  and 
creating  a  posthuman  society  that  would  be  totally 
controlled.  But  for  the  time  being,  this  possibility 
still  belongs  to  science  fiction  rather  than  political 
analysis. 

The  same  cannot  be  said,  with  the  same  proba¬ 
bility,  of  totalitarian  movements.  The  combination 
of  fanaticism  and  technology,  to  use  George  W. 
Bush’s  formulation,  in  movements  that  attempt  to 
use  science  and  technology  to  destroy  the  existing 
order,  or  even  the  planet  itself  in  a  grand  apoca¬ 
lypse,  is  certainly  a  possibility.  The  Aum  Shinrikyo 
sect  in  Japan,  with  a  membership  that  includes 
scientists  and  engineers,  trying  to  destroy  Tokyo  in 
order  to  hasten  or  forestall  the  end  of  the  world, 
may  be  a  harbinger  of  things  to  come.  But  should 
one  call  it  totalitarian? 
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More  broadly,  should  one  call  global  terrorism 
or  a  movement  like  A1  Qaeda  totalitarian?  Or 
should  we  reserve  the  term  for  the  historical  phe¬ 
nomenon  born  in  the  20th  century  and  assuming  a 
specifically  modern  character?  Both  choices  are 
legitimate,  and  depend  on  whether  one  adopts  a 
narrow  and  precise  or  a  broad  and  approximate 
definition  of  totalitarianism.  What  seems  certain, 
however,  is  the  permanence  or  the  periodic  resur¬ 
gence  of  a  third  element — totalitarian  aspirations 
and  passions.  The  wish  to  be  God;  the  wish  to 
eliminate  one’s  enemies;  the  wish  for  absolute 
security  or  for  “escaping  from  freedom”;  passions 
such  as  compassion  or  indignation,  which  can  lead 
to  hatred  and  revenge;  and,  above  all,  the  search 
for  sacrifice  and  for  identifying  with  a  great  cause 
or  conversely  the  search  for  stability  and  for  defin¬ 
itive  answers  in  an  ever-changing  and  complex 
world — all  these  are  permanent  possibilities  of 
human  nature.  But  they  are  particularly  likely 
reactions  to  a  regime  like  liberal  democracy  and  an 
era  like  that  of  globalization,  both  of  which  gener¬ 
ate  anxiety  because  they  deprive  us  of  the  old  cer¬ 
tainties  about  the  fate  of  the  individual  and  of  the 
world. 

Pierre  Hassner 
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Paris,  France 
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Trade  Liberalization 


Trade  liberalization  refers  to  the  opening  up  of 
markets  to  foreign  imports  and  entails  the  abolish¬ 
ment  or  reduction  by  governments  of  tariff  and 
nontariff  barriers  (quotas,  standards,  etc.)  that 
limit  trade  in  goods  or  services  across  countries. 
Governments  can  liberalize  trade  unilaterally  or 
on  a  reciprocal  basis  in  negotiations  with  other 
countries  and  can  apply  any  cuts  in  a  preferential 
or  a  nonpreferential  manner.  Preferential  liberal¬ 
ization  removes  barriers  to  imports  only  from 
selected  countries  and  can  lead  to  the  establish¬ 
ment  of  free  trade  areas  and  customs  unions, 
whereas  nonpreferential  liberalization  applies  to 
all  countries  that  are  covered  by  a  most  favored 
nation  clause.  The  modern  international  trading 
system  built  around  the  World  Trade  Organization 
is  based  on  the  nonpreferential  liberalization  of 
trade  and  thus  limits  member  countries’  discretion 
to  liberalize  trade  preferentially.  A  country  that 
abolishes  all  its  trade  barriers  engages  in  free 
trade.  This  entry  first  presents  a  brief  history  of 
major  moves  toward  international  free  trade  and 
then  discusses  the  possible  causes  and  conse¬ 
quences  of  trade  liberalization. 

Most  major  European  countries  engaged  in 
trade  liberalization  from  the  mid- 19th  century 
until  the  start  of  World  War  I,  producing  the  rapid 
growth  in  international  trade  that  was  a  key  fea¬ 
ture  of  the  first  phase  of  globalization.  By  contrast, 
nearly  all  countries  increased  barriers  to  trade  in 
the  interwar  years,  leading  to  a  collapse  of  interna¬ 
tional  trade.  The  process  of  trade  liberalization 
was  restarted  by  the  United  States  in  1934,  when 
Congress  passed  the  Reciprocal  Trade  Agreements 
Act  that  conferred  authority  on  the  president  to 
agree  on  reciprocal  tariff  cuts  with  other  countries, 
which  entered  into  force  without  requiring  further 
congressional  approval.  After  the  end  of  World 
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War  II,  other  developed  countries  joined  the  pro¬ 
cess  of  trade  liberalization  and  agreed  to  repeated 
negotiations  in  the  framework  of  the  General 
Agreement  on  Tariffs  and  Trade  (1947-1994). 

While  initially  these  trade  rounds  yielded  only 
minor  tariff  cuts,  the  Kennedy  Round  (1964- 
1967),  which  was  launched  as  a  response  to  the 
creation  of  the  European  Economic  Community 
(1958),  led  to  a  reduction  of  average  tariffs  by 
about  one  third.  Overall,  the  multilateral  negotia¬ 
tions  caused  a  drop  in  the  average  tariffs  on  indus¬ 
trial  goods  applied  by  developed  countries  from 
more  than  40%  at  the  end  of  World  War  II  to  less 
than  4%  after  the  implementation  of  the  cuts 
agreed  on  in  the  Uruguay  Round  (1986-1994). 
For  some  time,  these  tariff  reductions  were  partly 
offset  by  the  imposition  of  nontariff  barriers  to 
trade  and  the  use  of  administrative  trade  instru¬ 
ments,  such  as  antidumping  duties.  Since  the 
1960s,  however,  a  series  of  international  agree¬ 
ments  have  also  imposed  limits  on  the  use  of  these 
instruments  for  protectionist  purposes. 

Throughout  most  of  the  postwar  period,  devel¬ 
oping  countries  hardly  participated  in  the  process 
of  trade  liberalization,  not  least  because  they 
received  one-sided  preferences  from  developed 
countries,  which  gave  them  little  incentive  to 
reduce  their  own  barriers.  In  the  1980s  and  1990s, 
however,  many  of  them  undertook  partly  substan¬ 
tial  unilateral  tariff  cuts  when  moving  away  from 
import-substituting  industrialization  policies. 
Developing  countries  have  also  become  eager  par¬ 
ticipants  in  the  proliferation  of  preferential  trade 
agreements  that  characterize  the  current  interna¬ 
tional  trading  system.  There  has  been  little  progress 
in  multilateral  trade  liberalization  since  the  mid- 
1990s,  and  therefore,  it  seems  safe  to  conclude  that 
international  trade  is  currently  as  free  as  it  has  ever 
been,  which  also  explains  the  rapid  growth  in  inter¬ 
national  trade  over  the  last  few  decades.  Variation 
in  trade  barriers  across  countries  and  sectors  obvi¬ 
ously  still  persists:  on  average,  developed  countries 
have  lower  trade  barriers  than  developing  coun¬ 
tries,  and  industrial  goods  and  services  face  lower 
trade  barriers  than  agricultural  goods. 

Causes  of  Trade  Liberalization 

While  the  economic  case  for  free  trade  is  very 
strong,  trade  liberalization  is  puzzling  from  a 


political  science  perspective  given  that  politicians 
have  short-term  incentives  to  impose  barriers  to 
trade.  These  incentives  derive  from  the  fact  that 
although  overall  a  country  is  expected  to  gain  from 
freer  trade,  trade  liberalization  imposes  costs  on 
some  sectors  of  the  society — generally,  the  scarce 
factor  of  production  and  producers  of  goods  and 
providers  of  services  that  face  competition  from 
imports.  A  political  economy  model  of  trade  pol¬ 
icy  making  expects  these  losers  to  defend  their 
interests — namely,  averting  trade  liberalization — in 
the  policy-making  process,  for  example,  by  lobby¬ 
ing  and  voting  for  parties  or  politicians  that  prom¬ 
ise  protectionist  policies.  By  contrast,  consumers, 
who  benefit  from  the  lower  prices  on  imported 
goods  and  services  resulting  from  trade  liberaliza¬ 
tion,  are  unlikely  to  become  politically  active  to 
defend  their  interests  because  of  the  diffuse  nature 
of  these  benefits. 

The  puzzle  of  trade  liberalization  in  the  face  of 
short-term  political  incentives  to  provide  protec¬ 
tion  has  motivated  a  large  political  science  litera¬ 
ture  that  stresses  factors  such  as  hegemony  and 
geopolitics,  societal  demands,  political  institutions, 
and  ideas  and  learning.  One  of  the  earliest  expla¬ 
nations  provided  for  trade  liberalization  empha¬ 
sized  the  existence  of  a  hegemonic  power  in  the 
international  system  that  may  either  have  an  inter¬ 
est  in  an  open  trading  system  or  provide  openness 
as  a  public  good.  A  hegemonic  power’s  interest  in 
free  trade  may  stem  from  its  ability  to  reap  the 
traditional  gains  resulting  from  trade  while  at  the 
same  time  avoiding  the  associated  costs  because 
the  large  internal  market  cushions  the  distribu¬ 
tional  effects  of  trade.  A  hegemonic  power,  more¬ 
over,  may  open  its  market  in  pursuit  of  geopolitical 
objectives,  such  as  strengthening  allies,  creating 
relations  of  asymmetric  dependence,  and  fostering 
security  alliances.  A  prominent  explanation  for  the 
liberalization  of  trade  after  World  War  II,  conse¬ 
quently,  highlights  the  hegemonic  position  of  the 
United  States  at  that  time  and  its  geopolitical 
rivalry  with  the  Soviet  Union. 

Another  school  of  thought  looks  for  a  change 
in  the  balance  of  societal  demands  when  trying  to 
explain  trade  liberalization.  Following  this  rea¬ 
soning,  the  rise  of  multinational  companies, 
which  benefit  from  low  trade  barriers  when  ship¬ 
ping  inputs  across  borders  and  selling  finished 
goods  abroad,  may  have  given  impetus  to  the 
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liberalization  of  trade  after  World  War  II.  Moreover, 
an  increase  in  the  number  of  firms  that  can  take 
advantage  of  economies  of  scale  should  foster  trade 
liberalization  since  these  firms  are  likely  to  be  com¬ 
petitive  on  world  markets.  As  the  United  States  is 
the  home  country  to  both  many  multinational 
companies  and  firms  that  benefit  from  economies 
of  scale,  these  arguments  may  explain  why  that 
country  was  at  the  forefront  of  trade  liberalization 
after  World  War  II.  At  the  same  time,  these  argu¬ 
ments  have  relatively  little  to  say  about  trade  liber¬ 
alization  in  the  second  half  of  the  19th  century, 
when  neither  multinationals  nor  economies  of  scale 
played  a  major  role.  Equally  based  on  societal 
demands  is  the  argument  that  trade  liberalization 
may  be  a  reaction  to  the  trade  policies  chosen  by 
other  countries.  If  the  trade  policies  of  a  country 
reduce  foreign  exporters’  market  access,  these 
exporters  are  likely  to  react  and  lobby  their  govern¬ 
ment  to  engage  in  negotiations  with  the  foreign 
country  with  the  aim  of  eliminating  the  obstacles 
they  face.  These  negotiations,  in  turn,  may  lead  to 
a  liberalization  of  trade.  Finally,  an  initial  step 
toward  trade  liberalization  may  have  feedback 
effects  that  strengthen  free  trade  forces.  By  bolster¬ 
ing  exporters  and  weakening  import  competitors,  it 
should  change  the  balance  of  societal  demands  in  a 
way  that  favors  further  liberalization. 

Trade  liberalization  may  also  be  a  result  of 
changes  in  political  institutions.  At  the  domestic 
level,  an  increase  in  the  size  of  electoral  districts,  for 
example,  should  undermine  the  influence  of  protec¬ 
tionist  forces  by  exacerbating  their  collective  action 
problems  and  thus  pave  the  way  for  freer  trade 
policies.  Moreover,  democratization  may  lead  to 
more  liberal  trade  policies  in  countries  in  which 
labor  is  abundant.  The  reasoning  behind  this  argu¬ 
ment  is  that  in  labor-abundant  countries,  workers 
are  expected  to  benefit  from  more  trade;  therefore, 
inclusion  of  their  preferences  in  the  political  process 
should  make  that  country’s  trade  policy  more  lib¬ 
eral.  At  the  international  level,  the  existence  of  an 
international  regime  that  facilitates  trade  coopera¬ 
tion  among  countries  by  reducing  the  transaction 
costs  associated  with  trade  negotiations  and  pro¬ 
viding  a  commitment  device  may  also  lead  to  trade 
liberalization.  The  international  trading  regime 
after  World  War  II  may  not  only  have  resolved 
intergovernmental  bargaining  problems  but  also 
reduced  domestic  opposition  to  trade  liberalization 


by  enabling  the  coexistence  of  relatively  free  trade 
and  domestic  compensation — a  system  that  has 
become  known  as  embedded  liberalism. 

Ideas  and  learning  provide  for  final  potential 
causes  of  trade  liberalization.  In  the  second  half  of 
the  19th  century,  the  spread  of  the  idea  that  free 
trade  is  beneficial  for  all  countries  may  have  moti¬ 
vated  the  abolition  of  trade  barriers,  first  in  the 
United  Kingdom  and  later  in  the  rest  of  Europe.  In 
the  second  half  of  the  20th  century,  decision  mak¬ 
ers  may  have  been  influenced  by  their  perception 
that  the  protectionist  policies  in  the  interwar  years 
had  contributed  to,  and  had  made  worse  the  con¬ 
sequences  of,  the  Great  Depression.  Most  recently, 
developing  countries  may  have  reacted  to  the  per¬ 
ceived  failure  of  import  substitution  industrializa¬ 
tion  policies  when  enacting  relatively  liberal  trade 
policies  from  the  1980s  onward.  In  this  last  case, 
economic  ideas  may  also  have  become  causally 
relevant  because  they  were  carried  by  international 
organizations,  such  as  the  International  Monetary 
Fund,  which  applied  them  when  arranging  for  con¬ 
ditionality  agreements  with  debtor  countries. 

Consequences  of  Trade  Liberalization 

Whatever  the  causes  of  trade  liberalization,  a  step 
toward  free  trade  is  likely  to  have  consequences  for 
the  distribution  of  wealth  in  a  country  and  the 
political  organization  of  that  country.  With  respect 
to  the  distribution  of  wealth,  trade  liberalization 
produces  winners  and  losers  in  a  society.  Unless  a 
government  redistributes  income,  this  will  lead  to 
greater  inequality — an  effect  that  is  confirmed  by  a 
series  of  empirical  studies.  What  is  more,  some 
authors  have  suggested  that  free  trade  will  under¬ 
mine  a  country’s  capacity  to  redistribute  income, 
since  an  open  trading  system  makes  it  easier  for 
companies  to  relocate  to  countries  with  lower 
taxes  and  less  welfare  spending.  The  evidence  con¬ 
cerning  this  expectation,  however,  is  at  best  mixed. 

Trade  liberalization  may  also  have  an  impact  on 
the  political  organization  of  countries,  especially  if 
an  initial  move  toward  freer  trade  makes  them 
dependent  on  continued  openness.  A  country  that 
becomes  dependent  on  relatively  free  trade  may 
require  changes  in  the  political  structure  that  lock 
in  liberal  policies.  This  may  be  achieved,  for  exam¬ 
ple,  by  adopting  corporatist  practices,  with  strong 
domestic  compensation  mechanisms.  Alternatively, 
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countries  may  insulate  decision  makers  from  pro¬ 
tectionist  pressures  by  increasing  the  size  of  elec¬ 
toral  districts  and  moving  toward  proportional 
representation.  Trade  liberalization  may  even  fos¬ 
ter  political  disintegration.  By  allowing  small 
countries  to  benefit  from  a  large  market,  an 
increasingly  open  international  trading  system 
removes  one  of  the  incentives  for  their  continued 
stay  within  a  larger  federation.  This  reasoning  may 
provide  a  rationale  for  the  recent  increase  in  the 
number  of  sovereign  countries  in  the  world. 
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Traditional  Rule 


As  indicated  by  F.  Nyamnjoh  (2003)  in  the  1950s 
and  1960s,  modernization  theorists  predicted  that 
traditional  rule  would  soon  become  outdated  and 
would  be  replaced  by  “modern”  bureaucratic 


offices  and  institutions.  Nyamnjoh  further  con¬ 
tends  that  even  underdevelopment  and  depen¬ 
dency  theorists  did  not  seem  to  give  chiefs  and 
chieftaincy  much  of  a  chance;  they  saw  chieftaincy 
as  lacking  in  mobilizational  ability  for  social  and 
political  change.  However,  political  scientists, 
sociologists,  historians,  and  educated  traditional 
leaders  in  Africa  acknowledge  the  resilience  of 
chieftaincy  institutions  (cf.  Nyamnjoh,  2003,  pp. 
233-250;  Chief  Linchwe  II,  1989,  pp.  99-102). 
Today,  new  scholarship  in  Southern  Africa  such 
as  that  by  Lungisile  Ntsebeza  (2005)  is  paying 
attention  to  chieftaincy,  intrigued  by  the  manner 
in  which  the  institution  remains  part  of  the 
cultural  and  political  landscape  constantly  negoti¬ 
ating  and  renegotiating  its  position  within  the 
modern  institutions  of  countries  such  as  Botswana. 
This  entry  examines  the  place  of  traditional  rule  in 
Africa,  using  Botswana  as  an  example. 

Botswana  enjoys  a  good  reputation  for  its 
record  of  good  government  and  democracy.  Over 
decades,  Botswana  has  demonstrated  to  the  world 
an  immense  socioeconomic  strength  driven  by  a 
combination  of  modern  and  conservative  tradi¬ 
tional  leadership  despite  diversification  of  socio¬ 
economic  bases.  The  country’s  leadership  has  been 
able  to  face  and  overcome  challenges  of  great  mag¬ 
nitude,  such  as  the  scourge  of  HIV/AIDS.  Several 
scholars  have  attributed  the  country’s  pillars  of 
modern  democracy  and  ability  to  withstand  21st- 
century  socioeconomic  challenges  to  its  traditional 
government  system  (cf.  G.  Somolekae  &  M.  H. 
Lekorwe,  1998).  In  Botswana,  there  are  intangible 
but  identifiable  group  sensibilities  toward  demo¬ 
cratic  discourse,  a  discourse  that  is  situated  within 
culture,  leadership,  and  development.  In  this  entry, 
traditional  rule  in  Botswana  and  its  contemporary 
methods  for  avoiding  and  resolving  conflicts  in  the 
past  and  the  present  are  examined.  The  role  of 
chieftaincy  (bogosi)  and  the  kgotla  system  is 
emphasized.  Bogosi  and  the  kgotla  institution  are 
deeply  ingrained  in  Botswana  society,  and  the 
chiefs  ( dikgosi )  are  a  critical  component  of  chief¬ 
taincy.  In  fact,  one  cannot  conceive  of  chieftaincy 
without  a  chief  who  is  placed  at  society’s  helm.  A 
few  relevant  examples  will  be  cited  as  illustration 
for  Botswana’s  historic  and  contemporary  tools  for 
dialogue,  consultation,  and  mediation.  However, 
the  entry  does  recognize  limitations  that  still  exist 
within  the  bogosi  institution. 
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On  several  occasions,  the  kgotla  institution  has 
been  cited  in  contemporary  studies  as  an  institu¬ 
tion  that  encourages  mutual  knowledge,  apprecia¬ 
tion,  respect,  and  tolerance  (cf.  Somolekae  & 
Lekorwe,  1998). 

Bogosi  and  the  Kgotla:  Agents  of 

Sustainable  Democracy  and  Development 

One  important  feature  of  the  precolonial  Tswana 
traditional  rule  was  the  use  of  tribal  assemblies  or 
kgotlas  and  the  conduct  of  public  affairs,  including 
the  tradition  of  peacemaking,  that  took  place 
through  the  kgotla.  Tswana  institutions  of  gover¬ 
nance,  such  as  bogosi  (chieftainship)  and  the 
kgotla  system,  have  continued  to  be  relevant  to  the 
lives  of  Batswana. 

Earlier  studies  suggested  that  the  kgotla  had 
been  a  central  element  in  the  traditional  machinery 
of  Tswana  government.  Kofi  Darkwah  (1996) 
argues  that  before  1966,  the  kgotla  may  be  said  to 
have  been  not  just  a  public  meeting  place  but  also 
the  seat  of  the  government  or  administration  of  the 
community.  As  Isaac  Schapera  (1994)  notes,  the 
central  kgotla  of  the  kgosi  was  the  public  assembly 
where  he  not  only  delivered  judgments  and  laws 
but  also  listened  to  the  people  when  matters  of 
public  interest  were  discussed.  Chiefs  (dikgosi) 
presided  in  large  kgotla  meetings  (pitso  or 
phuthego),  where  after  lengthy  discussion  in  which 
“all”  could  participate,  the  Kgosi  had  the  right  to 
make  final  decisions  on  all  matters.  The  chief  exer¬ 
cised  authority  over  a  large  area  (a  district  today) 
through  a  hierarchy  of  relatives  and  officials, 
including  close  advisers  and  ward  headmen  (P.  M. 
Mgadla  &  A.  Campbell,  1989,  p.  49).  During  the 
19th  century,  BaNgwato  in  what  is  today  the  cen¬ 
tral  district  and  BaTawana  of  Ngamiland  (north¬ 
west  district)  extended  their  political  control  over 
other  groups  considered  of  lower  status  than  the 
Tswana  dominant  groups.  The  so-called  minority 
groups  included  San/Basarwa,  BaYei,  Bambukushu, 
BaKalanga,  BaBirwa,  and  BaTswapong.  BaNgwato 
and  BaTawana  were  among  those  who  considered 
themselves  superior  to  others  because  of  their 
strong  political  and  administrative  system,  which 
included  monarchy,  as  well  as  courts  of  laws  and 
an  economy  based  on  large-scale  animal  hus¬ 
bandry.  In  both  the  Ngwato  and  Tawana  hierar¬ 
chical  system  of  government,  the  whole  area  under 


their  thraldom  was  divided  into  “counties”  with 
senior  Tswana  each  in  charge  as  chief’s  representa¬ 
tives  (baemedi  ba  kgosi).  The  chief’s  representa¬ 
tives  would  supervise  their  areas  and  ensure  that 
the  paramount  chief’s  (kgosi  e  kgolo’s)  property, 
such  as  cattle,  ivory,  and  furs,  was  properly 
guarded.  Representatives  of  the  constituent  wards 
expressed  their  opinions  at  the  main  kgotla  on 
behalf  of  their  charges.  The  constituent  ward  far 
from  the  administrative  center  of  the  traditional 
rule  also  had  its  own  small  kgotla,  which  gave 
certain  semiautonomy  from  the  paramount  chief 
and  invested  the  group  with  the  authority  to  regu¬ 
late  some  of  its  affairs  and  to  address  the  chief. 

Although  the  subordinate  kgotla  did  not  have 
the  power  to  legislate,  they  formed  part  of  the  tra¬ 
ditional  government  machinery  in  the  sense  that 
they  were  part  of  the  administrative  and  judicial 
system  of  the  community.  Early  accounts  of  the 
Tswana  chief  portray  him  as  relying  a  great  deal  on 
consultation  with  leading  men,  both  agnates  and 
commoners,  hence  the  Tswana  aphorism  “ Kgosi 
ke  Kgosi  ka  batbo ”  [The  chief  is  the  chief  by  the 
grace  of  the  people].  Omulf  Gulbrandsen  (1995, 
pp.  415-444)  points  out  that  on  the  one  hand,  the 
Tswana  strongly  felt  the  need  for  an  effective  ruler, 
while  on  the  other  hand,  they  were  strongly  aware 
of  the  need  for  a  check  on  the  powers  entrusted  to 
him  as  expressed  in  the  above  proverb.  Pauline 
Peters  (1994,  p.  34)  argues  that  the  kgotla  embod¬ 
ied  the  consultative  aspect  of  ruling  authority; 
thus,  it  played  the  role  of  counterweight  to  central 
power.  The  great  kgotla  was  the  arena  where  the 
chief  promulgated  laws,  which  had  been  discussed 
in  detail  at  prior  meetings,  and  where  the  many  not 
included  in  the  select  circle  around  the  chief  were 
able  to  express  their  opinions  on  the  proposal.  In 
theory,  all  men  who  attended  the  kgotla  had  the 
right  to  air  their  views  without  fear  of  reprimand. 
The  dictum,  “ mmualebe  o  bua  la  gagwe”  or 
“ mahoko  a  kgotla  a  mantle  otlbe”  meant  that  no 
one  could  be  debarred  from  voicing  his  or  her 
opinion,  no  matter  how  unpopular. 

In  defining  the  contemporary  kgotla,  during  an 
interview,  Isaac  Tudor  (August  23,  2001)  of 
Kgosing  ward  in  Maun  (Ngamiland  district) 
describes  it  as  a 

democratic  institution,  you  are  free  to  say 

anything  from  your  mind,  it’s  your  freedom  of 
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speech,  and  nobody  will  send  you  to  court  for 
talking  nonsense.  Everyone  deserves  to  be  heard. 
You  are  bound  to  have  endless  Kgotla  meetings, 
depending  on  the  seriousness  of  the  subject 
discussed. 

It  is  common  that  each  male  speaker  randomly 
stands  up  to  speak.  In  what  is  considered  Tswana 
custom  and  tradition,  respectable  adult  men  wore 
hats  (which  they  had  to  remove  at  the  kgotla)-,  by 
contrast,  female  adults  had  to  cover  their  heads 
with  scarves  when  attending  the  kgotla. 

The  result  was  a  government  with  a  certain 
openness,  and  the  whole  idea  of  debate  was  recog¬ 
nized  as  an  important  part  of  policy  making.  While 
all  major  decisions  and  pronouncements  appear  to 
have  been  subjected  to  public  appraisal,  it  also 
seems  clear  that  once  a  decision  was  made,  it  was 
expected  to  be  observed  by  all.  Peters  (1994,  p.  35) 
has  described  the  kgotla,  therefore,  as  an  open 
forum  where  consensus  politics  decided  the  out¬ 
comes  of  debate.  Although  the  powers  of  the 
kgotla  were  advisory  only,  the  chief  and  his  offi¬ 
cials  could  use  that  forum  to  generate  a  consensus 
before  taking  action  and  to  sense  opposition  to 
particular  proposals. 

There  were  exceptions  to  this  openness.  L.  D. 
Ngcongco  (1989,  pp.  46,  58-74)  has  cited  the 
example  of  exclusions  in  Kweneng  where 
BaKgalagadi  were  not  as  a  rule  expected  to  speak 
at  a  kgotla,  even  though  they  were  free  to  attend. 
In  the  case  of  Ngamiland,  likewise,  subordinate 
groups  such  as  the  BaSarwa  (San)  felt  intimidated 
and  thus  found  it  difficult  to  stand  up  and  speak  at 
the  Maun  Kgotla.  Tswana  societies,  like  other 
African  societies,  did  not  consider  public  affairs  a 
domain  for  women  (Somolekae  &  Lekorwe,  1998, 
p.  187).  Like  other  subordinates,  women  were 
treated  as  children.  Thus,  as  a  rule,  they  did  not 
participate  in  kgotla  debates  unless  they  were 
invited  to  give  evidence  during  the  settlement  of 
disputes.  The  aggregate  functions  of  the  kgotla 
rendered  it  a  space  that  was  historically  a  male 
domain.  Only  during  the  colonial  period  were 
Western-educated  women,  such  as  Pulane  Moremi 
of  BaTawana  (1947-1964)  and  Gaogangwe 
Gaseitsiwe  Ntebogang  Ratshosa  of  BaNgwaketse 
(1924-1928),  made  regents.  Gradually,  after 
World  War  II,  nonroyal  women  found  their  place 
in  the  kgotla  and  could  speak  in  meetings. 


Chieftaincy  has  presented  the  strongest  chal¬ 
lenge  to  women’s  struggles  against  their  minor 
status  than  any  other  factor.  Arguments  have  been 
advanced  against  chieftaincy  in  Africa  with  the 
assumption  that  this  is  a  predominantly  male  insti¬ 
tution.  Gender  and  human  rights  activists  have 
called  for  reform  within  the  traditional  rule  struc¬ 
ture,  which  is  often  viewed  as  an  institution  of  an 
undemocratic  nature.  Although  chieftaincy  has 
been  dominated  by  men  in  the  past,  the  approval 
of  the  appointment  of  female  paramount  chiefs  by 
the  Botswana  Minister  of  Local  Government  in  the 
past  few  years  is  an  indication  that  traditional  rule 
has  reformed  and  that  the  institution  accommo¬ 
dates  women  as  capable  leaders.  The  ascension  of 
Kgosi  Mosadi  Seboko  to  head  a  kgotla  in  2002  is 
a  good  example  of  the  few  women  who  have  bro¬ 
ken  new  ground.  Her  official  installation  as  a 
chief,  draped  in  a  leopard  skin  like  other  male 
traditional  leaders,  marked  her  authority  among 
her  tribal  group,  the  Balete,  30  km  south  of  the 
capital,  Gaborone.  Seboko  became  the  first  woman 
to  serve  in  the  Ntlo  Ya  Dikgosi  mandated  to  advise 
both  the  government  and  parliament  on  issues 
relating  to  custom  and  tradition,  including  tribal 
property.  The  transformation  of  bogosi  was  noted 
by  several  speakers  during  this  historic  moment  in 
Botswana.  During  Kgosi  Mosadi  Seboko’s  inaugu¬ 
ration,  Athaliah  Molokomme,  the  first  Botswana 
female  Attorney  General  remarked, 

The  institution  of  chieftainship  and  the  installation 
of  Botswana’s  first  woman  paramount  chief 
cannot  be  seen  in  isolation  from  the  tremendous 
socioeconomic  changes  that  have  taken  place  in 
our  country  and  the  world.  (Mmegi,  2003) 

On  her  side,  Seboko  thanked  her  people  for 
support.  “You  were  able  to  transcend  the  gender 
imbalance  that  many  are  still  grappling  with,  and 
installed  me  not  because  I  am  a  woman,  but  rather 
on  the  basis  of  birthright  equity”  (Mmegi,  2003). 
Since  her  installation  as  paramount  chief,  her 
major  concern  has  been  the  increase  in  the  num¬ 
ber  of  disintegrating  families,  crime,  domestic 
abuse  and  violence,  unemployment,  and  HIV/ 
AIDS.  Nyamnjoh  (2003)  observes  that  these  new 
developments  in  Botswana  point  to  an  institution 
that  is  adaptable  and  negotiating  with  changing 
political  and  social  realities  in  the  country.  Since 
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Seboko’s  installation,  several  women,  such  as 
Keleatile  Moremi  of  Batawana,  are  now  claiming 
full  citizenry  as  provided  for  in  Botswana’s  consti¬ 
tution.  The  Bill  of  Rights  in  the  Constitution  sug¬ 
gests  that  actually  women  have  equal  rights  as 
men.  On  August  26,  2009,  Mosadi  Seboko  was 
recognized  at  Riverwalk  cinema  in  Gaborone, 
with  a  few  other  women,  through  a  Botswana 
home-produced  film  documentary  not  only  con¬ 
tributing  to  the  socioeconomic  and  political  devel¬ 
opment  of  Botswana  but  also  for  serving  as  a 
mentor  to  the  younger  generation. 

Another  visible  transformation  within  tradi¬ 
tional  rule  in  the  country  is  the  installation  of 
young  and  educated  chiefs  to  strengthen  the  voice 
in  the  Ntlo  ya  Dikgosi,  negotiating  their  way  into 
the  democratic  discourse.  In  the  past,  educated 
chiefs,  such  as  Kgosi  Seepapitso  IV  of  BaNgwaketse 
and  Kgosi  Linchwe  of  BaKgatla  II,  not  only  served 
as  traditional  authorities  within  the  confines  of  the 
then  House  of  Chiefs  (an  advisory  body  to  Parlia¬ 
ment  and  government  on  matters  pertaining  to 
Botswana  tradition  and  culture)  but  were  at  some 
point  assigned  by  the  government  to  serve  as  dip¬ 
lomats  in  countries  such  as  the  United  States  and 
China.  The  number  of  young,  educated,  and  elite 
paramount  chiefs  has  been  increasing  since  the 
installation  of  a  number  of  Tawana  II  of  BaTawana 
in  the  late  1990s.  Among  these  young  cadre  of 
chiefs  are  Lotlamoreng  II  of  BaRolong  (2002); 
Kgari  Sechele  II  of  BaKwena  (2002),  who  was 
enthroned  chief  of  Puso  Gaborone  of  BaTlokwa 
(2006);  the  new  chairman  of  Ntlo  ya  Dikgosi;  and 
more  recently  Kgafela  II  of  BaKgatla  (2008). 

In  2009,  Kgosi  Tawana  II,  who  had  resigned 
from  chieftainship  to  stand  for  public  office,  was 
elected  to  Parliament.  As  men  such  as  Tawana 
move  from  traditional  authority  into  modern  poli¬ 
tics,  they  remain  royalty  and  carry  the  prestige  of 
being  Kgosi  into  Botswana’s  modern  politics.  Like 
the  first  president  of  Botswana,  Seretse  Khama, 
and  his  son  and  current  president,  Seretse  Ian 
Khama,  their  relative  Tawana  is  likely  to  enjoy  a 
legitimacy  drawn  from  his  position  as  former  chief 
of  the  dominant  BaTawana  tribe.  Ian  Taylor  has 
observed  that  the  electorate  of  Botswana  is  still 
inclined  toward  a  traditionalist  culture  of  respect 
for  authority,  and  this  may  explain  why  Kgosi 
Bathoen  II  of  BaNgwaketse  in  1969  played  a 
dominant  role  in  local  politics  and  won  support 


for  the  opposition  Botswana  National  Front  Kanye 
South  constituency.  The  well-educated  but  non¬ 
royal  Quett  Ketumile  Masire,  a  Botswana  Demo¬ 
cratic  Party  candidate,  lost  the  constituency  to 
Bathoen  II.  The  Kanye  South  constituency  was  in 
the  hands  of  the  opposition  for  a  very  long  time.  In 
his  recent  biography,  Masire  (2006,  p.  103)  con¬ 
tends  that  in  the  mid-1950s,  tradition  or  authority 
was  often  used  to  undermine  the  emerging  non¬ 
royal  elites.  However,  it  is  worth  noting  that  when 
the  Chieftainship  Act  was  introduced  in  1965,  it 
meant  that  power  was  granted  to  the  president  to 
recognize  or  not  recognize  a  traditional  ruler,  mak¬ 
ing  all  chiefs  subordinate  to  central  government. 
Ntlo  ya  Dikgosi  has  no  legislative  powers,  and  the 
positions  of  chiefs  are  dependent  on  recognition  by 
the  state. 

Transformation  in  chieftaincy  in  Botswana 
should  also  be  understood  in  the  context  of  ethnic 
minority  representation  in  the  Ntlo  ya  Dikgosi.  In 
2006,  the  President  of  Botswana  Festus  Mogae 
toured  the  country  to  address  and  consult  Batswana 
on  issues  of  bogosi  and  the  reconstitution  of  the 
House  of  Chiefs  to  become  Ntlo  ya  Dikgosi.  The 
president’s  tour  was  a  follow-up  to  the  Balopi 
Commission,  which  had  been  assigned  earlier  to 
undertake  a  national  consultation  exercise  to  come 
up  with  proposals  for  change  that  would  have  a 
profound  impact  on  national  unity  (Republic  of 
Botswana,  2000,  pp.  93-110).  After  independence 
(1966),  some  sections  of  Botswana  society  repeat¬ 
edly  voiced  concern  at  the  inadequacy  of  the  consti¬ 
tution  in  addressing  the  aspirations  of  some  sections 
of  the  nation.  According  to  President  Mogae,  “The 
House  of  Chiefs,  as  it  was  then  known,  was  singled 
out  as  a  classic  symbol  of  inequality  as  it  was  per¬ 
ceived  to  be  an  exclusive  club  of  some  tribes”  (Piet, 
2007).  He  observed  that  this  did  not  go  well  with 
some  citizens  of  Botswana  and  viewed  this  as  a 
threat  to  national  unity  and  lack  of  political  will  for 
democratization  and  political  development.  The 
inequality  that  existed  then  had  to  be  addressed  by 
amending  sections  77,  78,  and  79  of  the  Consti¬ 
tution  of  Botswana  in  order  to  render  them  tribally 
neutral.  On  February  1,  2007,  Botswana’s  historic 
opening  of  the  first  meeting  of  the  Ntlo  ya  Dikgosi 
was  held  at  which  20  new  members,  including  a 
few  women,  were  welcomed  to  the  house.  The 
forum  of  communication  for  these  presidential 
deliberations  was  the  kgotla. 
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Conclusion 

Bogosi  and  the  kgotla  have  gradually  evolved  as 
institutional  agents  of  democratization  and  develop¬ 
ment.  Since  independence,  chiefs  have  become  active 
agents  of  government.  However,  traditional  rule  has 
displayed  remarkable  dynamism  and  adaptiveness 
to  new  socioeconomic  and  political  developments  in 
the  country.  Traditional  instituitional  functionaries 
have  over  time  learned  to  adapt  themselves  in 
accessing  and  using  resources  such  as  political 
power  and  education  in  building  sustainable  change 
in  governance.  It  is  worth  emphasizing  that  to  pro¬ 
mote  interaction  leadership  and  a  culture  of  peace 
and  to  support  actions  that  foster  understanding, 
tolerance,  and  solidarity  in  Botswana,  traditional 
rule  still  requires  further  transformation  to  address 
gender  equality  and  minority  representation  in  the 
Ntlo  ya  Dikgosi.  Leadership  skills,  such  as  strategic 
planning,  communication  and  presentation,  team 
building,  creative  problem  solving,  consultation,  and 
conflict  resolution,  are  very  relevant  to  bogosi  and 
the  kgotla  for  fostering  democratic  participation. 

Maitseo  Bolaane 
University  of  Botswana 
Gaborone,  Botswana 
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Tragedy  of  the  Commons 


The  tragedy  of  the  commons  is  a  concept  that 
denotes  a  specific  type  of  social  dilemma  in  which 
interdependent  individuals  face  incentives  to 
choose  independent  actions  that  maximize  their 
individual  benefit  but  generate  a  suboptimal  aggre¬ 
gate  outcome.  Garrett  Hardin  is  credited  with  first 
introducing  the  concept  to  describe  the  potential 
conflict  between  individual  rationality  and  collec¬ 
tive  efficiency  inherent  in  the  management  of  a 
common-pool  resource  (CPR).  A  CPR  is  a  natural 
or  man-made  resource  characterized  by  a  low 
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degree  of  excludability  and  a  high  degree  of  sub- 
tractability.  Excludability  refers  to  the  feasibility 
and  hence  costliness  of  excluding  potential  users 
from  using  a  resource;  subtractability,  on  the  other 
hand,  concerns  whether  a  user’s  appropriation  of 
units  from  the  resource  would  reduce  the  amount 
of  the  units  available  to  others.  The  two  physical 
attributes  generate  what  are  known  as  provision 
problems  facing  individuals  managing  a  CPR. 
Once  a  CPR  is  provided,  it  is  difficult  to  exclude 
potential  users  from  using  the  resource,  regardless 
of  whether  or  not  they  have  contributed  to  the 
resource’s  provision.  A  rational  individual  who 
considers  the  costs  and  benefits  of  alternative 
courses  of  action  would  find  no  incentive  to  con¬ 
tribute  and  instead  would  wait  for  others  to  pro¬ 
vide  the  resource.  Collective  inaction  is  often  the 
outcome. 

In  his  famous  1968  article  in  Science,  Hardin 
depicted  a  hypothetical  world  of  a  pasture  open  to 
all.  Every  herder  faced  the  incentive  that  by  bring¬ 
ing  in  an  additional  animal  to  graze  in  the  pasture, 
he  would  reap  all  the  benefit  generated  by  the  addi¬ 
tional  animal  but  bear  only  a  small  fraction  of  the 
cost  of  overgrazing.  A  rational  herder  facing  such  a 
cost-benefit  calculus  would  likely  choose  to  bring 
in  as  many  animals  to  the  pasture  as  possible  to 
maximize  his  short-term  personal  gain.  When 
every  herder  adopted  the  same  calculus  and  acted 
on  it,  however,  the  collective  outcome  would  be  a 
complete  depletion  of  the  pasture — an  outcome 
that  would  harm  every  herder.  As  Hardin  pointed 
out,  what  makes  the  situation  a  tragedy  is  not  the 
unhappy  ending  per  se  but  the  remorselessness 
involved.  Although  every  herder  would  see  the  pre¬ 
dicament  coming,  they  would  find  themselves 
trapped  in  a  dilemma  from  which  they  could  not 
escape. 

The  tragedy  of  the  commons  is  in  fact  one  type 
of  social  dilemma  in  which  interdependent  indi¬ 
viduals  face  the  incentive  to  make  independent 
choices  that,  while  maximizing  short-term  indi¬ 
vidual  benefit,  collectively  generate  an  inefficient 
aggregate  outcome.  The  outcome  is  inefficient 
because  there  is  at  least  one  alternative  that  can 
generate  a  higher  level  of  collective  gain  benefiting 
all  the  individuals  involved.  While  the  individuals 
are  fully  cognizant  of  their  contributing  to  the  sub- 
optimal  situation  by  failing  to  exercise  temperance 
and  restraint,  they  find  themselves  left  with  no 


choice  but  to  continue  to  pursue  short-term  inter¬ 
est  so  as  to  avoid  falling  prey  to  others.  In  a  social 
dilemma,  aggregate  inefficiency  is  not  necessarily  a 
result  of  evil  acts  by  some  malicious  culprits  who 
means  to  harm  others.  Even  for  a  group  of  benign 
and  innocent  individuals,  as  long  as  they  have  to 
choose  independent  actions  in  an  interdependent 
situation  and  are  unable  to  read  others’  minds,  the 
possibility  of  a  social  dilemma  always  exists. 

The  “tragedy  of  the  commons”  provides  policy 
analysts  with  a  powerful  metaphor  for  describing 
and  deciphering  public  problems  that  are  rooted  in 
human  interdependence.  Moving  from  the  more 
traditional  forms  of  CPRs,  such  as  irrigation,  fish¬ 
ery,  forestry,  groundwater,  transport  systems,  and 
budgets,  studies  of  the  commons  have  extended  to 
cover  nontraditional  sectors,  including  the  intel¬ 
lectual  domain  and  the  Internet.  Recent  studies  of 
the  new  commons — forms  of  shared  resources 
formerly  not  recognized  as  CPRs — have  been  a 
result  of  changing  understanding  of  resource  man¬ 
agement  and  organization  in  new  policy  areas  or 
sometimes  an  improved  appreciation  for  the  for¬ 
merly  unrealized  extent  and  complexity  of  long- 
existing  problems.  Starting  with  high-seas  fisheries 
and  other  more  traditional  forms  of  cross-border 
commons,  the  field  of  “global  commons”  now 
includes  abstract  common  properties — biodiver¬ 
sity,  genetic  commons,  atmosphere,  social  diversity 
and  interconnectedness,  equality,  and  knowledge. 

Governing  the  Commons  as  a 
Search  for  Policy  Panaceas 

Hardin’s  grim  prediction  about  the  tragedy  of  the 
commons  prompted  generations  of  social  scientists 
and  policy  analysts  to  engage  in  a  search  for  effec¬ 
tive  ways  to  govern  the  commons.  Hardin  himself 
sees  a  change  in  human  values  and  ideas  of  moral¬ 
ity  as  the  only  way  to  escape  from  the  tragedy;  the 
crux  of  the  problem  lies  with  the  question  of  how 
the  morality  of  temperance  can  be  put  in  place. 
Hardin  is  pessimistic  about  the  possibility  that  a 
morality  of  temperance  could  emerge  out  of  con¬ 
science;  the  perverse  incentives  embedded  in  a 
commons  are  simply  too  strong  for  the  develop¬ 
ment  of  credible  mutual  commitment,  which  is  the 
very  foundation  for  conscience.  To  Hardin,  the 
second  best  alternative  is  to  create  mutual  coercion 
that  is  mutually  agreed  on.  Although  Hardin  is 
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well  aware  that  coercion  can  be  generated  by  vari¬ 
ous  mechanisms,  such  as  shame,  he  argues  that  for 
the  commons,  which  are  on  a  large  scale  and 
involve  a  large  number  of  individuals,  restrictive 
laws  passed  by  a  government  are  the  more  feasible 
way  to  generate  mutual  coercion. 

A  caveat  is  warranted  here.  Hardin  did  not 
naively  argue  that  coercion  by  government  is  a 
foolproof  and  straightforward  solution.  In  particu¬ 
lar,  he  cautioned  that  the  morality  of  temperance 
is  highly  context  sensitive,  and  hence,  it  is  almost 
impossible  to  stipulate  a  law  that  could  take  all 
conditions  into  account.  Hardin  also  recognized 
that  a  major  challenge  of  legislating  temperance  is 
to  make  sure  that  those  who  make  and  monitor 
the  rules  are  themselves  properly  monitored  and 
controlled. 

For  many  policy  analysts  who  favor  simple  and 
clear-cut  policy  advice,  however,  the  tragedy  of  the 
commons  metaphor  provides  not  only  a  diagnosis 
of  the  problem  of  the  commons  but,  more  impor¬ 
tant,  also  a  neat  policy  panacea.  Given  that  the 
users  of  a  commons  are  trapped  in  a  social 
dilemma  and  are  presumably  unable  to  get  out  of 
it  by  themselves,  direct  government  intervention 
seems  to  be  the  logical  solution  to  help  them  out. 
In  irrigation  management  in  the  1970s  and  1980s, 
for  example,  many  international  donors  and 
governments  in  much  of  the  developing  world  con¬ 
sidered  direct  government  intervention  a  policy 
panacea.  When  put  into  practice,  the  panacea  took 
the  form  of  outright  nationalization  of  the  existing 
irrigation  systems,  large-scale  infrastructure  invest¬ 
ment,  and  the  establishment  of  a  strong,  central¬ 
ized  irrigation  bureaucracy.  As  policy  analysts 
embraced  the  policy  panacea,  Hardin’s  caution 
about  the  context  specificity  of  the  morality  of 
temperance  and  the  need  to  monitor  the  monitors 
fell  largely  on  deaf  ears  and  was  often  considered 
a  trivial  matter  involving  implementation  details. 

At  the  time  when  direct  government  intervention 
was  hailed  as  a  policy  panacea  for  coping  with  the 
problems  of  the  commons,  policy  analysts  trained 
in  economics,  in  particular  property  rights  econom¬ 
ics,  advocated  the  imposition  of  private  property 
rights  as  an  alternative  policy  panacea.  The  logic  of 
their  argument  is  straightforward:  By  putting  in 
place  private  property  rights  and  hence  some  sort 
of  (pseudo)market  exchange,  individual  owners  of 
a  CPR  would  be  able  and  willing  to  internalize  all 


the  costs  and  benefits  of  their  actions  in  relation  to 
the  provision  and  use  of  the  resource.  Without 
externalities,  the  maximizing  behavior  of  the  indi¬ 
viduals  will  aggregate  into  collective  efficiency. 

Although  the  privatization  panacea  seems  to 
offer  a  neat  analytic  solution  to  the  problems  of 
the  commons,  attempts  to  apply  the  panacea  in  the 
real  world  have  been  subject  to  the  thorny  ques¬ 
tions  of  whether  it  is  practically  possible  and,  if  so, 
how  to  divide  up  a  commons  and  how  to  keep  the 
transaction  costs  involved  in  the  exchange  at  a 
reasonably  low  level.  In  irrigation  management, 
for  example,  ample  evidence  has  shown  that  an 
effective  water  market  is  possible  only  if  there 
exists  both  a  strong  physical  infrastructure  that 
allows  for  efficient  transfers  of  resource  units  and 
a  solid  social  infrastructure  that  enables  the  users 
to  engage  in  exchange  activities  in  an  orderly  man¬ 
ner.  Unfortunately,  in  the  real  world,  both  types  of 
infrastructure  are  often  in  short  supply;  this  is 
particularly  the  case  in  many  developing  countries 
where  the  livelihood  of  the  poor  often  directly 
hinges  on  the  effective  management  of  various 
CPRs,  such  as  irrigation  systems  and  other  rural 
infrastructure. 

Empirical  Research  on  the  Commons 

In  the  early  1980s,  as  direct  government  interven¬ 
tion  in  CPR  management,  particularly  in  the  devel¬ 
oping  world,  had  produced  rather  mixed  outcomes, 
scholars  and  policy  analysts  began  to  question  the 
viability  of  the  panacea  of  centralized  control  and 
regulation.  Instead  of  taking  Hardin’s  pessimistic 
prediction  for  granted,  researchers  started  to  seri¬ 
ously  take  stock  of  existing  ethnographic  evi¬ 
dence  of  the  governance  of  the  commons  and 
conduct  systematic  empirical  research.  If  there 
were  major  lessons  that  the  decades  of  empirical 
research  on  the  commons  had  to  offer,  three 
would  stand  out. 

First,  a  commons  is  not  destined  to  be  a  tragedy. 
While  there  are  instances  of  failure,  there  are  many 
cases  of  success  in  various  policy  domains  ranging 
from  irrigation,  fishery,  groundwater,  and  forestry, 
to  grazing  lands.  In  most  of  these  successful  cases, 
resource  users  and  managers  are  able  to  develop  an 
appropriate  governance  structure  to  manage  and 
regulate  the  use  of  the  resources  so  as  to  maintain 
a  certain  level  of  sustainability. 
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Second,  the  users  of  a  CPR  are  not  doomed  to 
be  helpless  individuals  awaiting  help  from  an 
external  authority.  Empirical  evidence  has  shown 
that  around  the  world  many  communities  of 
resource  users  are  able  to  organize  effective 
collective  action  to  govern  the  commons.  In  fact, 
self-governance  is  not  only  a  possibility  but  a 
commonplace.  Moreover,  contrary  to  what  con¬ 
ventional  theories  have  suggested,  successful  self- 
governance  is  not  confined  to  CPRs  of  a  small  scale. 
In  irrigation  management,  for  example,  there  are 
cases  in  which  local  communities  have  successfully 
governed  and  managed  irrigation  systems  that  serve 
a  service  area  of  thousands  of  hectares  over  long 
periods  of  time.  What  is  intriguing  is  that  self- 
organized  community-based  irrigation  systems  have 
often  significantly  outperformed  government- 
managed  irrigation  systems,  which  are  often  charac¬ 
terized  by  sophisticated  physical  infrastructure  and 
management  by  professional  engineers. 

Third,  contrary  to  what  Hardin  presumed,  a 
CPR  is  not  necessarily  an  open-access  resource  that 
is  available  to  all  at  any  time.  In  a  majority  of  the 
successful  cases,  the  users  of  the  commons  are  able 
to  define  membership  in  relation  to  their  resource 
through  a  diversity  of  institutional  arrangements. 
The  existence  of  considerable  diversity  in  the  design 
of  CPR  institutions  raises  a  fundamental  question 
as  to  whether  the  tripartite  choice  of  the  state,  mar¬ 
kets,  and  open  access  can  fully  capture  the  empiri¬ 
cal  diversity  and  provide  an  adequate  analytical 
framework  for  the  study  of  the  commons. 

Governing  the  Commons  as  a  Search 
for  Effective  Design  of  Institutions 

Hardin’s  analysis  of  the  commons  and  the  conven¬ 
tional  CPR  theories  are  anchored  on  two  assump¬ 
tions.  The  first  assumption  is  that  those  involved 
in  the  commons  dilemma  resemble  the  neoclassi¬ 
cal  rational  man,  who  is  portrayed  as  having  clear 
and  definite  preferences  and  excellent  informa¬ 
tion-processing  capabilities.  While  the  neoclassical 
rational  man  may  not  make  the  right  choice  every 
time,  he  commands  good  knowledge  of  the  prob¬ 
ability  distribution  of  what  might  happen  for 
alternative  courses  of  action.  Theoretical  and 
policy  analyses,  then,  focus  on  examining  the  logic 
of  action  of  the  neoclassical  rational  man  in  a 
social  dilemma  and  explicating  possible  ways  to 


circumvent  the  perverse  incentives  embedded  in 
the  dilemma. 

The  problem  of  the  neoclassical  assumption  of 
rationality  is  not  so  much  that  it  oversimplifies  but 
that  it  has  been  overgeneralized,  as  if  it  were  the 
only  viable  assumption  of  rationality.  Recent 
research  has  shown  that  the  neoclassical  model 
works  relatively  well  only  in  stable  situations 
where  the  information  required  for  decision  mak¬ 
ing  is  more  or  less  complete,  such  as  in  a  perfectly 
competitive  market.  For  the  analysis  of  individual 
choice  making  in  a  commons  where  a  high  level  of 
uncertainty  is  involved,  and  hence  learning  is 
essential,  the  more  complex  model  of  bounded 
rationality,  which  takes  into  account  human 
beings’  cognitive  limitations  as  well  as  their  ability 
to  learn  and  engage  with  one  another,  would  be 
more  appropriate.  An  implication  of  the  model  is 
that  fallible  individuals,  while  capable  of  making 
mistakes,  can  learn  from  the  mistakes,  improve 
their  understanding  of  action  situations,  and  design 
new  institutional  arrangements  and  strategies  to 
cope  with  the  social  dilemma.  The  model  points  to 
the  need  to  understand  problem  solving  as  a  pro¬ 
cess  of  trial  and  error  in  a  search  for  both  means 
and  ends  (common  understanding)  and  to  the  rel¬ 
evance  of  choices  at  multiple  levels. 

The  second  assumption  underlying  the  conven¬ 
tional  analysis  of  the  commons  is  that  individuals 
make  choices  in  an  institutional  vacuum.  The  con¬ 
ventional  theories  not  only  fail  to  acknowledge  the 
process  of  socialization  as  central  to  the  creation  of 
the  individual  but  also  decontextualize  the  analysis 
of  actors’  behavior.  As  argued  by  many  researchers, 
individuals  in  a  commons  are  in  fact  making  situated 
choices;  their  decisions  are  shaped  and  conditioned 
by  the  greater  contexts  of  culture  and  social  rules. 

As  a  result  of  a  better  appreciation  of  human  fal¬ 
libility  and  the  importance  of  institutions,  begin¬ 
ning  in  the  1980s,  the  focus  of  CPR  research  has 
shifted  toward  examining  how  the  boundedly  ratio¬ 
nal  individuals  struggle  to  cope  with  social  dilem¬ 
mas  by  crafting  institutions  and  identifying  design 
principles  for  institutions  that  can  bring  about 
effective  outcome.  Among  the  contending  defini¬ 
tions  of  institutions  in  CPR  research,  the  one  devel¬ 
oped  by  Elinor  O  strom  and  her  colleagues  in  a 
tradition  of  institutional  analysis  and  development 
(IAD)  is  a  particularly  useful  analytical  construct. 
In  the  IAD  framework,  institutions  are  defined  as 
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“rules-in-use”  adopted  by  a  community  of  indi¬ 
viduals  to  regulate  their  repetitive  interactions 
pertaining  to  the  physical-biological  world.  The 
rules-in-use  are  linguistic  prescriptions  that  require, 
prohibit,  or  permit  a  range  of  action  for  individu¬ 
als  under  particular  situations.  When  individuals 
ponder  their  actions  and  strategies  in  relation  to 
one  another,  they  take  the  rules  into  their  calculus. 

Based  on  careful  theoretical  analysis  and  meta¬ 
analysis  of  a  large  number  of  empirical  case  stud¬ 
ies,  Ostrom  and  her  colleagues  have  identified 
eight  principles  of  institutional  design  for  robust 
governance  of  CPRs.  These  now  well-known  prin¬ 
ciples  are 

1.  clearly  defined  boundaries, 

2.  congruence  between  rules  and  physical 
conditions, 

3.  collective-choice  arrangements, 

4.  monitoring, 

5.  graduated  sanctions, 

6.  conflict  resolution  mechanisms, 

7.  minimal  recognition  of  rights  to  organize,  and 

8.  nested  enterprises. 

Unlike  conventional  theories  that  focus  on  how 
users  choose  strategies  of  action  in  response  to  a 
given  incentive  structure,  the  IAD  framework 
looks  at  the  governance  of  the  commons  as  an 
ongoing  process  that  requires  continual  contribu¬ 
tion  from  the  users  and  recognizes  the  possibility 
of  the  users  developing  and  modifying  institutions 
to  govern  the  resource.  Empirical  evidence  has 
shown  that  institutions  that  are  characterized  by 
some  or  all  of  these  design  principles  are  better 
able  to  provide  incentives  for  users  of  a  commons 
to  engage  in  collective  action  and,  more  important, 
to  craft  and  continuously  recraft  institutions  (rules- 
in-use)  to  govern  the  commons. 

First,  individuals  in  a  CPR  situation  would  be 
more  likely  to  participate  in  the  provision  and  man¬ 
agement  of  the  resource  if  they  perceived  a  possibil¬ 
ity  that  they  would  enjoy  the  benefits  generated  by 
their  collective  effort.  Rules  that  stipulate  clearly 
how  consequences  of  collective  endeavors  would 
impact  on  whom  are  of  major  significance  in  provid¬ 
ing  incentive  for  individuals  to  engage  in  collective 


action.  The  existence  of  well-defined  boundary  rules 
and  user  rights  is  a  particularly  important  factor 
accounting  for  system  performance. 

Second,  individuals  in  an  interdependent  situa¬ 
tion  are  not  always  able  to  see  their  interdepen¬ 
dence  and  its  implications  for  aggregate  efficiency. 
Only  when  the  users  of  a  CPR  are  aware  of  and 
understand  how  their  interests  relate  to  those  of 
one  another,  how  much  potential  benefits  could  be 
generated  by  working  with  one  another,  and  how 
fragile  one  could  be  when  acting  alone  in  an  inter¬ 
dependent  situation  can  they  begin  to  see  the 
potential  of  and  need  for  arriving  at  a  broader 
conception  of  preference  in  which  self-interest  is 
conditioned  by  recognizing  the  need  to  take  the 
interests  of  others  into  consideration.  When  insti¬ 
tutions  provide  opportunities  and  incentives  for 
the  users  to  communicate,  reason,  and  share  infor¬ 
mation  with  one  another  about  not  only  the 
physical  conditions  of  the  commons  but  also  their 
concerns,  the  users  are  more  ready  to  work  with 
others  in  managing  the  resource. 

Third,  research  has  found  that  the  users  of  a  CPR 
are  better  able  and  willing  to  cooperate  with  one 
another  if  they  have  a  better  understanding  of  the 
physical  conditions  of  the  resource  and  are  able  to 
monitor  and  keep  track  of  the  change  in  the  physical 
environment.  Institutions  can  serve  an  information- 
inducing  function  if  they  embody  useful  information, 
encourage  the  gathering  and  use  of  local  informa¬ 
tion  at  various  settings,  and  allow  the  users  to  use 
time-specific  and  place-specific  information  in  their 
daily  exigencies.  Moreover,  by  ruling  in  certain 
actions  and  ruling  out  some  others,  institutions  also 
provide  information  not  only  about  the  structure  of 
the  action  situation  faced  by  individuals  but  also 
about  the  choices  of  others.  Such  information 
allows  them  to  form  expectations,  which  in  turn 
enable  the  individuals  to  make  choices  by  taking  the 
possible  choices  of  others  into  consideration. 

Fourth,  strategies  available  to  resource  users 
that  are  ruled  in  by  institutions  are  usually  the  ones 
that  maximize  collective  outcomes  but  not  individ¬ 
ual  payoffs.  Hence,  a  collectively  optimal  outcome 
usually  hinges  on  quasi-voluntary  compliance, 
where  the  compliance  of  a  user  is  contingent  on 
the  compliance  of  others.  Quasi  compliance  can  be 
sustained  only  if  the  users  expect  long-term  gains 
from  complying  with  the  rules  and  perceive  some 
prospect  of  being  sanctioned  if  any  one  user 
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among  them  breaks  the  rules.  For  monitoring  and 
sanctioning  to  be  effective,  they  have  to  be  designed 
in  such  a  way  that  the  activities  of  monitoring  and 
sanctioning  involve  only  low  costs,  are  diffused  in 
scope,  and  are  perceived  by  users  in  the  commu¬ 
nity  as  fair  and  reasonable. 
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Transaction  Costs 


Transaction  costs  figure  prominently  in  formal 
political  decision  making  as  well  as  in  informal 


processes  of  political  negotiation.  Transaction 
cost  considerations  are  highly  relevant  in  a  num¬ 
ber  of  domains,  such  as  social  choice  theory,  pub¬ 
lic  sector  reforms  aiming  at  contracting  in  or  out, 
and  the  design  of  global  governance. 

Costs  linked  with  transacting  among  humans 
arise  when  there  is  cooperation  in  one  form  or 
another.  Cooperation  resulting  in  agreements 
requires  a  number  of  measures  for  it  to  be  effec¬ 
tive,  such  as  negotiation  of  the  terms  of  agreement, 
specifying  a  contract  or  writing  down  the  agree¬ 
ment,  and  implementation  of  the  agreement,  pos¬ 
sibly  through  court  action. 

As  these  costs  before  and  after  an  agreement 
tend  to  be  measured  chiefly  in  time,  effort,  and 
frustration  rather  than  in  money,  they  are  distin¬ 
guished  from  ordinary  production  costs  in  econom¬ 
ics.  Although  transaction  costs  were  first  theorized 
about  in  economic  theory,  they  have  figured 
prominently  in  politics,  less  often  as  the  costs  of 
contracting  and  more  often  as  so-called  decision 
costs.  One  may  here  recall  the  much  debated  insti¬ 
tutions  of  liberum  veto  and  the  filibuster,  both  of 
which  tend  to  maximize  transactions  costs.  Today 
transaction  costs  are  a  major  consideration  in 
public  sector  reform,  especially  in  new  public  man¬ 
agement,  where  government  seeks  to  replace  long¬ 
term  contracting  (bureaucracy)  with  short-term 
contracting  (tendering/bidding). 

Foundations  of  Transaction  Cost  Economics 

Transaction  cost  economics  started  with  the  1937 
article  by  Ronald  Coase  on  the  firm  and  continued 
with  his  1960  article  on  social  costs.  Economic  life 
is  replete  with  different  forms  of  contracts,  whose 
shape  transaction  cost  theory  helps  explain.  Thus, 
in  the  Coasean  tradition,  internal  firm  organiza¬ 
tion  with  long-term  contracts  presents  a  mecha¬ 
nism  to  reduce  transaction  costs  when  relying  on 
massive  short-term  contracting  in  the  market.  This 
1937  insight  received  a  much  needed  elaboration 
by  Oliver  E.  Williamson,  underlining  the  implica¬ 
tions  of  transaction  costs  for  both  hierarchies  and 
markets.  In  the  Williamson  tradition,  minimizing 
transaction  costs  is  of  paramount  importance  in 
economic  life  where  strong  forces  like  opportunism 
and  asymmetric  information  with  economic  agents, 
especially  when  possessing  asset-specific  knowl¬ 
edge,  are  conducive  to  economic  inefficiency,  such 
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as  monopoly.  The  1960  Coase  article  broadened 
transaction  cost  economics  to  the  domain  of  public 
policy  and  externalities,  arguing  that  with  zero 
transaction  costs,  external  effects  and  market  fail¬ 
ures  could  be  handled  by  means  of  bargaining  and 
compensation.  This  claim  about  the  ability  of  mar¬ 
ket  forces  to  allocate  responsibility  for  external 
costs  is  known  as  the  Coase  theorem.  It  anticipated 
recent  debates  about  extending  the  “polluter  pays” 
idea  to  a  global  ecology  policy  to  reduce  carbon 
emissions.  In  general,  institutional  economics  (new 
or  neo-institutional)  combine  the  insights  from 
both  Coase  and  Williamson,  emphasizing  that  eco¬ 
nomic  evolution  aims  at  minimizing  transaction 
costs  (Oliver  Williamson  &  Scott  Masten,  1997). 
No  similar  teleology  concerning  a  fundamental 
drive  to  develop  institutions  that  minimize  transac¬ 
tions  is  to  be  found  in  politics,  however. 

Transaction  Costs  in  the  Political  Sphere 

In  politics,  transaction  cost  arguments  are  primar¬ 
ily  found  in  the  theory  of  voting  and  the  public 
choice  school.  Transaction  costs  arise  in  any 
political  group  taking  decisions  by  some  mecha¬ 
nism  of  voting,  whether  the  group  be  a  domestic 
one  like  the  national  assembly  or  an  international 
organization  like  the  World  Bank,  the  United 
Nations,  or  the  International  Monetary  Fund.  In  a 
group  of  choice  participants — political  parties, 
legislators,  state  representatives,  faculty  members, 
and  club  members — any  decision  or  collective 
action  will  result  from  the  summation  of  the  pref¬ 
erences  of  the  participants  according  to  some 
mechanism  of  aggregation.  Alternative  social 
choice  mechanisms — majoritarian  techniques,  pro¬ 
portional  methods,  utilitarian  scoring,  positional 
techniques — result  in  varying  transaction  costs  for 
the  group  as  a  whole. 

Given  the  assumption  that  no  participant  in  the 
choice  group  should  be  given  a  separate  status,  the 
decision  method  that  appears  to  be  the  just  one  is 
the  Wicksell  unanimity  principle.  It  epitomizes  the 
idea  of  deliberative  democracy — namely,  that  a 
group  decision  must  take  into  account  the  views  of 
all  participants.  By  deliberating  the  relevant  issues, 
the  “correct”  group  decision  will  emerge.  In  1896, 
Swedish  economist  Knut  Wicksell  had  already 
identified  this  kind  of  efficiency  justice  with  the 
veto  principle,  meaning  that  all  choice  participants 


have  not  only  a  say  but  in  effect  a  no  capacity.  By 
favoring  the  consensus  method,  Wicksell  wanted 
to  avoid  the  situation  of  a  minority  getting  pushed 
over,  having  to  bear  alone  the  heavy  costs  from  a 
collective  decision.  Thus,  unanimity  would  secure 
Pareto  optimality  in  politics.  However,  Wicksell 
did  not  recognize  the  full  scale  of  the  transaction 
costs  that  attend  the  unanimity  principle  when 
employed  in  practice.  Consensus  is  the  most  trans¬ 
action  cost-burdensome  decision  rule  for  a  choice 
group.  Thus,  alternatives  must  be  employed,  at 
least  in  domestic  politics,  to  avoid  decision  inertia. 
As  noted  by  James  Buchanan  and  Gordon  Tullock 
(1962),  Wicksell’s  insights  into  decision  costs — for 
the  loser  (external  costs)  and  for  the  group  as  a 
whole  (transaction  costs) — form  the  backbone  of 
the  public  choice  school. 

It  is  interesting  to  note  that  in  politics,  minimiz¬ 
ing  transaction  costs  is  not  an  overall  manifest 
objective  or  latent  function,  as  it  were.  One  may 
distinguish  between  domestic  politics  and  interna¬ 
tional  relations.  Only  for  the  first  domain  can  one 
speak  of  a  search  for  institutions  that  minimize 
transaction  costs,  such  as  the  simple  majority 
decision  procedure  and  majoritarian  election 
techniques.  In  contrast,  choice  participants  in  inter¬ 
national  institutions — the  governments  of  member 
states — are  reluctant  to  give  up  the  Wicksell  una¬ 
nimity  principle.  The  principle  of  state  sovereignty 
entails  that  governments  can  protect  their  vital 
interests  in  international  organizations  with  a  firm 
veto.  Similarly,  in  regional  organizations,  with  the 
exception  of  the  EU,  the  veto  rule  is  upheld, 
although  it  maximizes  transaction  costs. 

Transaction  costs  surface  in  decision  making  in 
political  groups  whether  domestic,  regional,  or 
international.  They  are  inversely  related  to  the 
decisiveness  of  the  group,  which  is  the  ratio  of  the 
number  of  winning  coalitions  to  the  total  number 
of  coalitions  among  the  choice  participants.  When 
decisiveness  is  low,  then,  a  stalemate  is  probable 
because  of  the  ensuing  strategies  of  delaying  and 
holding  out.  Voting  under  a  simple  majority  or 
qualified  majority  increases  decisiveness  and 
reduces  transaction  costs.  The  existence  of  a  dicta¬ 
tor  in  the  group  minimizes  transaction  costs  but 
also  increases  the  majority’s  risk  of  losing  out  on 
its  preferred  positions.  A  simple  majority  mini¬ 
mizes  transactions  costs  in  social  choice  while  at 
the  same  time  letting  the  majority  prevail. 


2650  Transaction  Costs 


The  fact  that  regional  and  international  organi¬ 
zations  seldom  employ  simple  majority  entails  that 
minimizing  transaction  costs  is  not  the  sole  preoc¬ 
cupation  of  these  choice  participants.  They  may 
favor  unanimity  or  qualified  majority  because 
these  regimes  increase  the  choice  participants’ 
power  to  block,  which  may  be  a  more  important 
concern  than  the  decisiveness  of  the  group.  Thus, 
transaction  costs  in  politics  may  be  supported 
because  they  follow  necessarily  from  the  highly 
valued  blocking  power  of  each  participant. 

These  insights  into  the  inverse  relationship 
between  blocking  power  and  group  decisiveness 
stemming  from  the  logic  of  transaction  costs  in 
political  decision  making  have  been  elaborated  on 
in  constitutional  economics  and  N-person  game 
theory.  Recognizing  the  positive  effect  of  increas¬ 
ing  transaction  costs,  many  constitutions  provide 
for  a  qualified  majority  on  sensitive  matters, 
increasing  the  individual  power  to  block  and 
decreasing  group  decisiveness.  Constitutional 
decision  making  is  especially  surrounded  by  iner¬ 
tia  that  leads  to  more  substantial  transaction 
costs.  Realizing  that  transaction  costs  may  become 
staggering  as  a  result  of  delays  and  holding  out, 
international  organizations  have  deviated  from 
unanimity  and  introduced  qualified  majorities  of 
one  or  another  kind.  Yet  procedural  matters, 
being  less  sensitive,  are  often  decided  by  simple 
majority. 

Measuring  and  Managing  Transaction  Costs 

In  economics,  several  attempts  have  been  made  to 
measure  transactions,  but  they  are  far  less  identifi¬ 
able  than  standard  production  costs  like  labor  and 
capital.  More  specifically,  the  costs  of  establishing 
a  patent  in  various  countries  have  been  measured 
in  the  form  of  the  length  of  time  it  takes  from  the 
launching  of  an  application.  Some  scholars  esti¬ 
mate  that  transaction  costs  may  amount  to  a  stag¬ 
gering  one  third  of  the  GDP,  including  both  financial 
operations  and  enforcement  costs.  Yet  it  must  be 
pointed  out  that  these  measures  of  transaction 
costs  in  the  economic  literature  are  somewhat 
imprecise.  It  has  even  been  argued  by  Eirik  Furu- 
botn  and  Rudolf  Richter  that  transaction  costs 
include  the  costs  of  maintaining  or  changing 
the  political  institutions  that  legislate  about  or 
administer  contracts.  More  convincing  are  the 


attempts,  such  as  that  by  Yoram  Barzel,  to  esti¬ 
mate  the  reduction  in  transaction  costs  involved  in 
introducing  more  efficient  institutions  like  the 
Aktiengesellschaft,  private  property  to  land  and 
partnership  schemes. 

In  political  science,  transaction  costs  can  be 
identified  as  being  related  to  the  inverse  of  the 
decisiveness  of  an  assembly  or  a  decision-making 
board.  In  domestic  affairs,  one  tends  to  opt  for  a 
transaction  costs  minimizing  mechanism  like  sim¬ 
ple  majority,  whereas  in  regional  and  interna¬ 
tional  coordination,  unanimity  or  a  qualified 
majority  is  favored,  resulting  in  sometimes  heavy 
transaction  costs.  Transaction  costs  in  political 
assemblies  are  minimized  when  the  individual 
blocking  power  is  not  substantial.  N-person  game 
theory,  especially  the  power  index  approach, 
offers  a  convenient  method  for  estimating  the 
decisiveness  of  a  choice  group  under  alternative 
institutional  arrangement,  as,  for  instance,  in  the 
European  parliament,  the  European  Council,  and 
various  international  bodies. 

In  public  administration  or  public  management, 
the  seminal  move  toward  contracting  out  or 
externalization  has  raised  a  concern  over  rising 
transaction  costs.  Relying  on  a  multitude  of  deliv¬ 
ery  agents — public,  private,  and  third  sector — 
governments  have  become  more  anxious  about  get¬ 
ting  the  contracts  correct  as  disputes  under  the 
new  public  management  will  be  settled  in  court.  It 
is  an  open  question  as  to  whether  the  reduction  in 
production  costs  that  come  with  short-term  con¬ 
tracting  is  enough  to  compensate  for  the  sharp  rise 
in  transaction  costs  with  tendering/bidding  schemes, 
especially  litigation  costs.  When  giving  up  long¬ 
term  contracting  in  standard  employment  rela¬ 
tions,  the  government  also  loses  its  general  authority 
to  direct  the  work  effort  and  to  train  its  employees 
continuously — sometimes  described  as  the  hollow¬ 
ing  out  of  the  state. 

Transaction  Costs  in  Global  Governance 

Today,  transaction  costs  surface  strongly  in  global 
governance  as  the  governments  of  the  countries  of 
the  world  confront  a  number  of  issues  arising  from 
growing  interdependences.  Groups  like  the  G8  or 
G20  are  basically  unanimity  groups  and  are,  there¬ 
fore,  associated  with  decision  inertia  and  delays. 
Since  the  decisions  of  international  coordination 
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bodies  are  not  only  time-consuming  in  the  making 
but  can  also  be  reneged  on  ex  post,  transaction 
costs  loom  large  in  several  domains  of  global  gov¬ 
ernance.  To  reduce  these  transaction  costs,  inter¬ 
national  organizations  need  to  reconsider  their 
structure,  avoiding  unanimity  but  allowing  for 
quantitative  voting  in  order  to  recognize  the 
immense  differences  in  scale  between  various 
countries.  They  also  need  to  examine  the  enforce¬ 
ment  problem  in  global  governance  as  few  of  them 
have  explicit  dispute  settlement  mechanisms,  at 
least  not  those  that  operate  speedily. 

Transaction  costs  have  also  been  taken  into 
account  in  global  policy  making  in  the  form  of  the 
carbon  emissions  scheme — the  Kyoto  agreement. 
If  a  price  on  carbon  abatement  could  be  set  and 
emission  rights  be  allocated  to  all  countries  glob¬ 
ally,  then,  a  global  emissions  policy  imitating  the 
market  would  be  conducive  to  an  efficient  reduc¬ 
tion  in  carbon  emissions,  following  the  Coase 
theorem.  However,  arriving  at  such  a  comprehen¬ 
sive  global  emissions  policy  has  not  yet  been  suc¬ 
cessful  due  to  transaction  costs  both  in  negotiating 
the  emission  rights  (ex  ante)  and  in  enforcing  the 
agreement  against  reneging  (ex  post). 

Jan-Erik  Lane 
University  of  Freiburg 
Freiburg,  Germany 
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Transatlantic  Relations 


Like  many  of  the  most  widely  used  concepts  in  the 
political  science  subfield  of  international  relations, 
the  term  transatlantic  relations  defies  precise  defi¬ 
nition  and  has  been  used  to  refer  to  a  variety  of 
historical,  political,  cultural,  social,  and  economic 
interactions  between  polities,  societies,  institu¬ 
tions,  groups,  and  individuals  in  Europe  and  North 
America.  As  such,  the  phenomenon,  if  not  the  term 
itself,  stretches  from  the  period  of  European  explo¬ 
ration  and  colonization  of  the  Americas,  especially 
in  the  16th  and  17th  centuries,  through  the  ensuing 
political  upheavals  of  the  late  18th  century — in 
particular  those  associated  with  the  United  States 
War  of  Independence — the  consolidation  and 
extension  of  U.S.  power  and  influence  across  the 
New  World  in  the  19th  century — primarily  at  the 
expense  of  Spain  and  France — to  the  projection  of 
U.S.  power  and  influence  into  Europe  itself.  The 
latter  process  was  most  clearly  reflected  in  the 
United  States’  crucial  contribution  to  the  outcomes 
of  the  two  world  wars  of  the  last  century  and  the 
establishment  of  a  permanent  military  presence  in 
Europe,  which,  although  it  outlived  its  origins  in 
the  Cold  War,  continues  today.  Because  of  their 
origins  in  the  efforts  of  European  powers  to  colo¬ 
nize  the  New  World,  however,  transatlantic  rela¬ 
tions  have  never  been  limited  to  merely  political  or 
security  affairs.  They  have  always  been  influenced 
by  economic  and  cultural  interactions,  with  trade 
and  immigration  constituting  the  most  important 
components  of  transatlantic  relations  in  the  18th 
and  19th  centuries. 

The  rich  history  of  transatlantic  relations  and 
the  membership  of  Canada  in  important  transat¬ 
lantic  institutions,  such  as  North  Atlantic  Treaty 
Organization  (NATO)  and  Organization  for 
Security  and  Co-operation  in  Europe  notwith¬ 
standing,  the  term  today  is  most  often  used  with 
reference  to  post-World  War  II  relations  between 
the  United  States  and  Europe.  In  security  affairs,  it 
primarily  comprises  relations  between  the  United 
States  and  its  European  NATO  partners.  In  eco¬ 
nomic  affairs,  the  frame  of  reference  is  the  United 
States  and  the  members  of  the  European  Union 
(EU),  although  relations  between  the  EU  and  the 
North  American  Free  Trade  Area  are  sometimes 
covered  by  the  term.  Generally  excluded  from  the 
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concept  are  relations  between  Europe  and  Latin 
America  as  well  as  relations  between  the  United 
States  and  European  states  that  are  not  members 
of  either  the  EU  or  NATO. 

Following  a  brief  summary  of  the  major  con¬ 
tours  of  transatlantic  relations  in  the  post-World 
War  II  period,  this  entry  addresses  the  develop¬ 
ment  of  transatlantic  relations  after  the  demise  of 
the  Soviet  Union  and  the  Warsaw  Pact.  The  discus¬ 
sion  then  turns  to  a  survey  of  the  major  theoretical 
approaches  to  the  analysis  and  explanation  of  the 
history  and  future  development  of  transatlantic 
relations. 

Transatlantic  Relations 
in  the  Postwar  Period 

Although  there  were  some  efforts  to  systematically 
analyze  transatlantic  relations  in  the  interwar 
period,  they  first  emerged  as  a  central  topic  of 
concern  in  international  relations  theory  and  prac¬ 
tice  in  the  period  following  World  War  II.  In  par¬ 
ticular,  the  breakdown  of  the  wartime  alliance 
between  the  Western  powers  and  the  Soviet 
Union,  the  extension  and  consolidation  of  Soviet 
control  over  the  states  of  Eastern  and  Southeastern 
Europe,  and  the  perceived  threat  posed  by  the 
Soviet  Union  to  the  war-ravaged  states  of  Western 
Europe  implied  the  need  for  close  economic  and 
military  cooperation  between  the  states  of  Western 
Europe  and  the  United  States  and  led  to  the 
Marshall  Plan,  the  development  of  NATO,  and 
the  strategy  of  containment.  For  the  first  three 
decades  of  the  Cold  War,  transatlantic  relations 
were  characterized  by  a  high  degree  of  asymmetry. 
In  economic  and  military  affairs,  the  United  States 
was  the  provider  and  Europe  the  consumer.  In 
both  fields,  the  United  States  was  a  global  super¬ 
power.  Europe,  by  contrast,  was  at  most  a  regional 
junior  partner. 

Both  sides  of  the  Atlantic  drew  large  benefits 
from  this  asymmetrical  relationship;  however, 
transatlantic  relations  experienced  frequent  and 
serious  controversies.  Whereas  the  states  of  Europe 
were  quite  happy  to  accept  American  economic 
assistance  and  security  guarantees  on  the  conti¬ 
nent,  the  former  colonial  powers  found  it  difficult 
to  cede  to  the  United  States  those  prerogatives  that 
had  accompanied  their  previous  status  as  great 
powers.  A  general  American  aversion  to  Europe’s 


continued  colonial  pretensions  resulted  in  strained 
relations  throughout  the  1950s  and  erupted  into  a 
serious  crisis  when  the  United  States  demanded  the 
withdrawal  of  British  and  French  forces  from 
Egypt,  which  together  with  Israel  they  had  attacked 
in  October  1956  in  response  to  President  Nasser’s 
decision  to  nationalize  the  Suez  Canal.  For  their 
part,  Europeans  often  resented  the  fact  and  exer¬ 
cise  of  American  power.  In  the  wake  of  Suez, 
French  president  Charles  de  Gaulle  set  out  to 
enhance  France’s  military  and  political  indepen¬ 
dence  by  developing  nuclear  weapons,  successfully 
testing  an  atomic  bomb  in  1960.  In  early  1966,  he 
announced  that  France  would  alter  the  terms  of 
her  participation  in  the  Atlantic  Alliance.  As  a 
result,  France  withdrew  its  ground  and  air  forces 
as  well  as  headquarters  personnel  from  NATO 
commands,  demanded  the  removal  of  U.S.  and 
Canadian  forces  from  French  territory,  and  forced 
the  relocation  of  NATO  headquarters  and  installa¬ 
tions  to  neighboring  countries. 

The  1970s  and  1980s  were  similarly  marked  by 
periodic  strains  in  transatlantic  relations,  in  par¬ 
ticular  over  nuclear  strategy  and  arms  control,  as 
well  as  relations  with  the  Soviet  Union  and  its  East 
European  satellites,  but  the  debates  were  generally 
conducted  within  a  larger  consensus  over  basic 
interests,  goals,  and  values.  It  is  perhaps  a  paradox 
of  transatlantic  relations  that  their  very  success  has 
tended  to  engender  subsequent  tensions  between 
Europe  and  North  America.  Hence,  the  American 
decision  to  condition  receipt  of  Marshall  Plan 
assistance  on  the  development  of  a  common 
European  economic  policy  contributed  to  the 
development  of  the  European  Communities  and 
eventually  the  EU.  Premised  on  transatlantic  coop¬ 
eration,  Europe’s  economic  recovery  nonetheless 
led  to  conflict  with  the  United  States  over  ques¬ 
tions  of  economic  policy  especially  in  the  fields  of 
trade  and  investment. 

By  the  beginning  of  the  21st  century,  the  eco¬ 
nomic  asymmetry  between  the  United  States  and 
its  European  trading  partners  was  all  but  elimi¬ 
nated.  Ongoing  efforts  to  widen  and  deepen 
European  integration  in  the  fields  of  foreign  and 
security  policy  by  means  of  a  European  Constitution 
hold  the  prospect  of  eliminating  the  political  asym¬ 
metries  between  the  United  States  and  Europe, 
although  Europe’s  capacity  for  power  projection 
lags  far  behind  that  of  its  American  ally. 
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Transatlantic  Relations  After  the  Cold  War 

The  rapid  and  unexpected  end  of  the  Cold  War 
and  the  subsequent  collapse  of  the  Soviet  Union 
and  communist  governments  across  Eastern 
Europe  in  1989  to  1990  dramatically  changed  the 
context  within  which  transatlantic  relations  had 
developed  for  almost  a  half  century,  leading  many 
to  question  their  future  course.  If  transatlantic 
relations  had  been  premised  on  meeting  the  threat 
posed  by  the  Soviet  Union  in  a  global  anticommu¬ 
nist  struggle  and  the  reinforcing  imperative  of 
economic  recovery  and  if  the  early  asymmetry  in 
capabilities  between  the  two  sides  of  the  Atlantic 
was  disappearing,  could  a  basic  consensus  on  both 
sides  of  the  Atlantic  over  fundamental  interests, 
goals,  and  values  in  international  and  domestic 
affairs  suffice  to  sustain  the  history  of  close  coop¬ 
eration?  Since  1990,  the  question  has  preoccupied 
policymakers  and  scholars  alike. 

Scholars  of  international  security  and  defense 
policymakers  wondered  whether  NATO  could 
survive  without  an  identifiable  enemy.  Prominent 
political  scientists  maintained  that  NATO  was 
indeed  disappearing,  while  influential  politicians 
argued  that  NAT O  would  have  to  go  out  of  area  if 
it  were  to  retain  relevance.  Meanwhile,  many 
observers  of  transatlantic  economic  relations 
argued  that  with  the  opening  of  the  economies  of 
Eastern  Europe  augmenting  ongoing  economic 
liberalization  in  China,  the  end  of  the  Cold  War 
would  accelerate  the  ongoing  processes  of  global¬ 
ization.  As  a  result,  Europe  and  the  United  States 
were  expected  to  engage  in  fierce  competition  for 
mobile  capital  and  a  declining  number  of  high- 
paying  jobs.  Moreover,  as  the  relative  importance 
of  other  regions  of  the  world  for  both  Europe  and 
the  United  States  increased,  some  reduction  in  the 
level  of  importance  that  was  accorded  to  transat¬ 
lantic  relations  seemed  unavoidable. 

After  an  American-led  coalition  defeated  Iraq 
and  restored  the  Kuwaiti  Emir  to  power  in  1991, 
the  transatlantic  agenda  of  the  1990s  was  largely 
dominated  by  two  broad  strategic  issues.  The  first 
involved  the  expansion  of  NATO  and  the  EU  to 
the  newly  independent  states  of  Eastern  Europe; 
the  second  issue  was  how  to  respond  to  the  cam¬ 
paign  of  mass  murder  and  ethnic  cleansing  that 
emerged  after  the  breakup  of  Yugoslavia  and 
resulted  in  the  1992-1995  war  in  Bosnia.  With 
respect  to  the  latter  issue,  NATO  proved  unwilling 


or  unable  to  develop  and  execute  a  coherent  strat¬ 
egy  for  the  first  three  and  a  half  years  of  the  con¬ 
flict.  In  light  of  that  failure,  the  logic  of  NATO 
expansion  was  less  than  entirely  persuasive. 
Meanwhile,  the  EU  pursued  a  strategy  of  sanc¬ 
tions,  United  Nations  resolutions,  and  threats  of 
diplomatic  isolation  in  an  effort  to  coerce  Serbia 
and  the  Bosnian  Serbs  to  stop  the  ethnic  cleansing 
but  to  no  avail.  Following  a  series  of  transatlantic 
recriminations  and  a  change  of  government  in 
both  the  United  States  and  France,  NAT O  eventu¬ 
ally  intervened  militarily  and  forced  the  parties  to 
negotiate  a  resolution  to  the  crisis.  The  lessons  of 
the  Bosnian  War  still  fresh  in  their  minds,  United 
States  and  European  leaders  responded  in  a  swift 
and  effective  manner  to  the  1999  crisis  over 
Kosovo. 

The  question  of  the  future  course  of  transatlan¬ 
tic  relations  again  became  acute  in  the  aftermath  of 
the  September  11,  2001,  terrorist  attacks  on  New 
York  and  Washington,  D.C.,  and  the  ensuing 
transatlantic  debate  over  the  course  and  scope  of 
the  global  war  on  terror  and  how  to  deal  with  the 
perceived  threat  posed  by  weapons  of  mass  destruc¬ 
tion  (WMD)  in  the  hands  of  so-called  rogue  states. 
The  debate  was  focused  primarily  on  Iraq,  which 
the  U.S.  and  European  intelligence  agencies 
believed  possessed  WMD  and  which  was  invaded 
by  U.S.  and  British  forces  in  March  2003.  The 
ensuing  dispute  pointed  to  divergent  perceptions 
of  core  interests  between  the  United  States  and 
important  European  allies — France  and  Germany 
in  particular — and  led  to  a  breakdown  of  allied 
cooperation  in  the  United  Nations  and  a  wave  of 
anti-American  sentiment  across  Europe.  Many 
regard  the  crisis  over  Iraq  to  have  been  the  most 
serious  of  the  post-World  War  II  era. 

Transatlantic  relations  appeared  to  reach  a 
nadir  during  the  two  administrations  of  U.S. 
President  George  W.  Bush  (2001-2009).  For  exam¬ 
ple,  in  Germany,  Chancellor  Gerhard  Schroeder 
not  only  opposed  the  United  States  on  a  funda¬ 
mental  question  of  security  but  premised  his  2002 
reelection  campaign  on  opposition  to  American 
foreign  policy.  But  the  transatlantic  divide  was  not 
limited  to  debates  over  single  issues  or  the  rhetoric 
of  individual  politicians.  A  series  of  differences 
over  issues  as  diverse  as  the  role  of  religion  in 
politics,  the  establishment  of  an  International 
Criminal  Court,  climate  change,  and  the  death 
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penalty,  many  of  which  predated  the  Bush  presi¬ 
dency,  led  substantial  numbers  of  citizens  on  both 
sides  of  the  Atlantic  to  conclude  that  the  postwar 
consensus  over  fundamental  interests,  goals,  and 
values  had  dissipated. 

Theoretical  Approaches  to  the  Study 
of  Transatlantic  Relations 

Given  that  the  field  of  international  relations  is 
characterized  by  a  number  of  competing  para¬ 
digms,  theoretical  traditions,  or  schools  of  thought, 
the  absence  of  a  scholarly  consensus  over  the  basic 
causes  and  consequences  of  transatlantic  relations 
should  come  as  no  surprise.  Owing  to  the  interde¬ 
pendence  of  facts  and  theory,  disputes  begin  at  the 
level  of  description — that  is,  over  the  question  of 
what  precisely  needs  to  be  explained.  Thus,  many 
state-centric  approaches  find  it  difficult  to  describe, 
let  alone  explain,  what  constructivist  or  culturally 
oriented  approaches  regard  to  be  a  major  develop¬ 
ment  in  transatlantic  relations  over  the  course  of 
the  second  half  of  the  20th  century:  the  emergence 
of  a  pluralistic  security  community  in  the  North 
Atlantic  area.  Although  the  relevant  literature  is 
vast  and  includes  Marxist  and  other  critical  per¬ 
spectives,  mainstream  scholarship  has  come  to 
cluster  around  three  broad  approaches. 

Realism 

Although  no  longer  as  dominant  as  it  was  during 
the  Cold  War,  realism  as  an  approach  to  the  study 
of  international  relations  remains  influential,  par¬ 
ticularly  in  the  subfield  of  international  security. 
Realism  does  not  offer  a  single  theory  of  interna¬ 
tional  relations,  but  realist  theories  tend  to  share  a 
set  of  common  assumptions  about  the  fundamental 
causes  of  international  outcomes.  These  include  the 
following:  that  international  relations  are  charac¬ 
terized  by  the  absence  of  legitimate  authority  above 
the  sovereign  state,  which  is  to  say  that  the  ordering 
principle  is  anarchy;  that  sovereign  states  are  the 
primary  actors  in  the  international  system;  and  that 
states  at  a  minimum  seek  to  survive  but  may  strive 
to  dominate  the  system  by  maximizing  their  power 
relative  to  other  states.  Because  of  anarchy  and  the 
possibility  that  some  states  may  try  to  dominate 
others,  the  use  of  force  is  always  possible  and  hence 
states  are  fundamentally  insecure.  Realism  assumes 


that  the  insecurity  produced  by  anarchy  leads  states 
to  jealously  guard  their  relative  position  in  the 
international  system.  When  considering  opportuni¬ 
ties  for  economic  cooperation,  states  will  worry 
that  the  eventual  distribution  of  benefits  might 
leave  others  better  off.  Moreover,  they  will  fear  that 
economic  gains  might  be  converted  to  military 
capability.  Hence,  realist  theory  expects  relatively 
little  cooperation  in  international  politics. 

Among  realist  theories,  the  balance-of-power 
theory  is  the  most  developed  and  relevant  for  the 
analysis  of  transatlantic  relations.  Because  realism 
in  general  is  skeptical  regarding  the  prospects  for 
durable  cooperation  in  international  politics,  the 
long  history  of  cooperative  transatlantic  relations 
appears  puzzling.  At  first  glance,  balance-of-power 
theory  appears  to  offer  an  explanation.  When  con¬ 
fronting  a  potential  threat  to  their  security,  states 
are  expected  to  try  to  balance  it  through  the  acqui¬ 
sition  of  arms  or  allies.  Hence,  the  history  of  post¬ 
war  transatlantic  cooperation  is  explained  as  a 
response  to  the  commonly  perceived  Soviet  threat. 
Although  rivalry  between  the  United  States  and  its 
European  allies  was  never  completely  absent,  the 
need  to  maintain  a  united  front  against  the  Soviet 
bloc  kept  conflict  within  manageable  bounds. 

From  the  perspective  of  balance-of-power  the¬ 
ory,  however,  because  the  collapse  of  the  Soviet 
Union  and  the  breakup  of  the  Warsaw  Pact  elimi¬ 
nated  the  overriding  common  interest  that  sus¬ 
tained  close  transatlantic  relations,  it  should  have 
led  to  a  significant  weakening  of  transatlantic 
cooperation  and  the  collapse  of  common  institu¬ 
tions,  including  NATO.  Indeed,  balance-of-power 
theory  would  appear  unable  to  explain  not  only 
NATO’s  persistence  but  also  its  expansion  to 
Eastern  Europe  in  the  1990s  and  the  early  years  of 
the  21st  century  as  well  as  the  decision  of  French 
President  Nicolas  Sarkozy  in  2009  to  reverse 
President  de  Gaulle’s  decision  of  1966  and  to  reen¬ 
ter  the  alliance’s  integrated  military  command. 
With  regard  to  economic  relations,  even  the  largest 
economic  crisis  since  the  Great  Depression  during 
2008-2009  did  not  lead  to  a  repeat  of  the  “beggar 
thy  neighbor”  policies  that  plagued  transatlantic 
economic  relations  during  the  1930s.  Rather, 
transatlantic  relations  were  characterized  by 
intense  efforts  to  coordinate  American  and 
European  macroeconomic  policies  in  an  effort  to 
stimulate  the  global  economy,  an  outcome  clearly 
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at  odds  with  the  predictions  of  realist-inspired 
analyses. 

Liberalism 

Like  realism,  liberal  theories  tend  to  assume 
that  states  are  the  most  important  actors  in  inter¬ 
national  politics  and  generally  regard  their  identi¬ 
ties  and  interests  as  fixed  and  given.  Nonetheless, 
liberal  theories  of  international  relations  are  more 
sanguine  with  regard  to  the  future  of  transatlantic 
cooperation  after  the  Cold  War.  In  particular, 
institutional  theories  maintain  that  by  reducing 
structural  impediments  to  cooperation — high 
transaction  costs,  informational  asymmetries,  and 
insufficient  communication  principal  among 
these — international  institutions  can  produce 
durable  patterns  of  cooperation  even  in  the  absence 
of  world  government  by  creating  a  common  set  of 
rules  and  norms  to  govern  the  interactions  of 
states.  Because  the  long-term  benefits  of  coopera¬ 
tion  are  held  to  outweigh  the  short-term  gains  that 
states  might  achieve  from  unilateral  defection,  they 
are  expected  to  place  a  high  value  on  maintaining 
institutions  even  if  the  conditions  under  which 
they  were  established  no  longer  obtain. 

Hence,  in  the  wake  of  the  Cold  War,  liberal 
theorists  would  expect  Europe  and  the  United  States 
to  maintain  a  high  degree  of  cooperation  even  in  the 
absence  of  a  common  enemy.  From  this  perspective, 
NATO’s  persistence  is  explained  in  terms  of  its  abil¬ 
ity  to  adapt  to  the  challenges  of  a  new  era  and  its 
potential  for  managing  security  relations  among  its 
members.  Moreover,  although  there  is  a  surprising 
lack  of  formal  institutions  specifically  directed  at 
governing  transatlantic  economic  relations,  some 
have  argued  that  a  dense  transnational  network  of 
like-minded  bureaucrats,  regulators,  and  lawyers 
constitutes  an  informal  institution  that  has  helped 
Europe  and  the  United  States  avoid  a  rupture  in 
their  economic  relationship. 

Constructivist  and  Cultural  Approaches 

Whereas  both  realist  and  liberal  theories  of 
transatlantic  relations  begin  from  the  standpoint 
of  states  with  fixed  preferences  or  interests,  con¬ 
structivist  theories  stress  the  embeddedness  of 
states  in  social  structures  of  shared  norms  and 
beliefs — that  is,  culture — which  evolve  over  time. 


Not  only  are  these  social  structures  shaped  by  state 
behavior,  but  they  serve  to  shape  and  reshape  the 
conceptions  of  the  identity  and  interests  of  states. 
Constructivists  stress  the  transformational  poten¬ 
tial  in  international  politics.  When  states  come  to 
share  the  values,  norms,  and  symbols  that  provide 
their  social  identity,  their  relations  may  come  to 
resemble  those  of  individuals  in  a  society  or  even  a 
community  in  which  the  use  of  force  by  one  mem¬ 
ber  of  the  community  against  another  is  consid¬ 
ered  unthinkable. 

Although  security  communities  are  not  unprec¬ 
edented  in  the  history  of  international  relations, 
the  postwar  international  system  appears  unique. 
In  modern  times,  there  has  never  been  a  period  in 
which  war  among  the  European  great  powers  or 
between  a  European  power  and  the  United  States 
was  considered  unthinkable.  Yet  with  the  end  of 
the  Cold  War,  this  appears  to  be  the  case.  Given 
the  scale  and  frequency  of  great  power  wars  in  the 
past,  Robert  Jervis  has  argued  that  the  emergence 
of  a  security  community  among  the  great  powers 
constitutes  the  single  most  striking  discontinuity  in 
the  modern  history  of  international  relations. 

If  true,  the  claim  suggests  serious  limitations  to 
many  realist  and  liberal  theories  and  their  applica¬ 
tion  to  the  subject  of  transatlantic  relations,  which 
might  be  better  understood  from  the  perspective  of 
actors  enjoying  shared  identities  and  values.  But 
although  they  point  to  the  possibility  of  transfor¬ 
mational  change,  constructivist  theories  have  not 
done  a  good  job  of  establishing  the  conditions 
under  which  such  changes  are  likely  to  take  place. 
Understanding  the  role  of  agency  in  path-dependent 
processes,  thus,  poses  key  challenges  for  the  fur¬ 
ther  development  of  constructivist  theory. 
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Transformation,  Economic 


The  term  economic  transformation  is  often  used 
in  economics  vocabulary,  though  it  is  not  defined 
in  the  discipline’s  textbooks  and  dictionaries.  In 
general,  the  term  refers  to  structural  changes  in 
the  economic,  monetary,  and  financial  systems  of 
states,  which  are  generated  by  passing  from  one 
mode  of  production  to  another  or  by  the  transi¬ 
tion  of  one  model  of  development  to  another. 

Since  the  beginning  of  the  modern  era,  several 
economic  theories  have  contributed  to  these  struc¬ 
tural  changes  by  advocating  for  the  replacement 


of  one  mode  of  production  by  another,  or  of  one 
model  of  development  by  another,  that  is  more 
able  to  ensure  the  prosperity  of  nations  and  indi¬ 
viduals.  Over  time,  those  that  have  had  the  largest 
influence  are  mercantilism,  classical  liberalism, 
Marxism-Leninism,  the  heterodox  liberal  theory, 
and  neoliberal  theory. 

This  entry  retraces  the  history  of  economic 
transformations  since  the  17th  century  by  showing 
the  major  impacts  of  the  evolution  of  economic 
ideas  with  regard  to  such  changes.  It  also  examines 
several  approaches  in  political  science  that  have 
analyzed  the  impacts  of  economic  transformations 
on  the  political  regimes  of  states  and  in  interna¬ 
tional  relations.  It  is  indeed  difficult  to  devote 
more  attention  to  the  work  of  political  scientists  as 
most  have  ignored  economic  transformations  and 
those  who  have  noted  them  have  not  carried  out 
any  additional  analyses  of  such  changes.  They 
have  instead  focused  on  the  legal,  social,  cultural, 
and  political  impacts  of  economic  changes. 
Although  enormously  interesting,  the  aforemen¬ 
tioned  literature  is  not  pertinent  to  this  entry. 

Mercantilism 

Mercantilism  appeared  in  the  context  of  the  forma¬ 
tion  of  the  first  European  nation-states  during  the 
16th  and  17th  centuries.  The  central  idea  of  this 
theory,  as  it  was  applied  in  France  and  England,  was 
that  the  state  had  to  support  the  development  of 
trade  and  industry  in  order  to  expand  its  economic 
power,  which  was  the  principal  support  of  its  mili¬ 
tary  and  political  strength.  To  this  end,  the  state  had 
to  promote  the  growth  of  production  by  subsidies, 
privileges,  monopoly  concessions,  and  maintaining 
the  lowest  wages  possible.  The  state  also  had  to 
encourage  the  conquest  of  colonies,  which  provided 
a  source  of  primary  goods  and  an  opportunity  for 
manufacturing;  this  involved  the  construction  of  a 
powerful  navy  to  protect  trade  between  colonies 
and  the  homeland  and  to  block  out  rival  nations. 
While  this  economic  model  enabled  all  the  European 
colonial  empires  to  expand,  only  in  countries  such 
as  the  Netherlands  and  England,  which  practiced  a 
form  of  commercial  and  financial  mercantilism,  was 
the  development  of  a  capitalist  economy  dominated 
by  a  rich  bourgeoisie  in  the  manufacturing,  trading, 
and  banking  sectors  favored.  Over  time,  this  bour¬ 
geoisie  became  more  and  more  hostile  toward  the 
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constraints  of  mercantilism,  such  as  laws  that  for¬ 
bade  the  concentration  of  businesses,  limitations  on 
the  importation  of  primary  goods,  and  the  iron  law 
of  wages,  which  hindered  consumption,  provoked 
overproduction  crises,  and  lowered  investments  and 
profits.  This  conflict  was  the  root  cause  of  the  revo¬ 
lution  of  1688,  which  allowed  the  English  bourgeoi¬ 
sie  to  implement  a  new  economic  model  inspired  by 
classical  liberal  economics. 

Liberalism 

According  to  classical  liberalism,  conceptualized 
notably  by  John  Locke,  Adam  Smith,  and  David 
Ricardo,  all  individuals  are  rational  and  naturally 
seek  to  improve  their  economic  well-being  at  the 
least  possible  cost.  It  is  from  this  common  desire 
that  the  market  is  created,  a  place  where  producers 
attempt  to  sell  the  largest  number  of  goods  at  the 
most  advantageous  price  and  where  consumers 
seek  to  purchase  the  goods  that  they  need  at  the 
lowest  possible  price.  The  free  market  naturally 
tends  toward  equilibrium  between  supply  and 
demand  and  between  the  growth  of  production 
and  income.  It  is  the  state’s  interference  in  the  mar¬ 
ket  that  causes  the  misallocation  of  resources 
(unemployment  or  a  lack  of  manpower),  imbal¬ 
ances  between  supply  and  demand  (overproduc¬ 
tion  or  a  shortage  of  goods),  and  excessive  rises 
and  falls  in  prices  (inflation  or  deflation).  According 
to  this  theory,  free  trade — not  only  within  but 
between  nations — is  the  source  of  wealth.  Since 
nations  do  not  possess  all  the  three  main  requisites 
of  capitalism  (raw  materials,  capital,  and  man¬ 
power),  they  must  specialize  their  economies 
according  to  their  comparative  advantages  to  reap 
the  maximum  benefits  of  free  trade.  Liberal  theory 
became  the  dominant  economic  model  between 
1750  and  1920  because  of  the  British  Empire’s 
supremacy  and  the  adoption  of  the  theory  by  other 
European  colonial  powers  following  the  over¬ 
throw  of  absolutist  regimes  by  revolutionary 
movements  led  by  national  bourgeoisies. 

Neo-Mercantilism 

However,  in  the  19th  century,  liberalism  was 
rejected  by  Germany  and  the  United  States,  both  of 
which  adopted  a  model  based  on  British  mercantil¬ 
ism  in  order  to  strengthen  their  industries  behind 


protectionist  barriers.  According  to  the  found¬ 
ers  of  neo-mercantilism  (Alexander  Hamilton, 
Friedrich  List,  Wilhelm  Roscher,  and  Gustav 
Schmoller),  it  is  to  the  state’s  advantage  to  be 
protectionist  when  they  are  weak  or  in  economic 
decline  and  to  favor  free  trade  when  they  are 
strong  or  in  a  period  of  economic  growth.  The 
application  of  this  doctrine  permitted  both 
countries  to  become  great  economic,  political, 
and  military  powers  in  less  than  a  century. 
However,  if  the  United  States  practiced  a  peace¬ 
ful  and  isolationist  neo-mercantilism  because  of 
their  vast  domestic  market,  Germany,  deprived 
of  both  such  a  market  and  colonies,  opted  for  an 
imperialist  neo-mercantilism  that  was  one  of  the 
causes  of  the  Franco-Prussian  War  of  1870  and 
World  War  I. 

Marxism 

In  the  19th  century,  the  most  radical  critics  of  lib¬ 
eral  English  political  economy  were  Karl  Marx 
and  Friedrich  Engels.  Their  theory  argued  that 
within  the  framework  of  capitalism,  value  was  not 
found  in  labor,  as  Adam  Smith  and  David  Ricardo 
claimed,  but  rather  in  surplus  value  or  in  unpaid 
wages  to  workers.  To  outlast  the  competition, 
businesses  would  have  to  constantly  raise  their 
growth  or  profit  rates  by  replacing  workers  by 
machines  and  by  creating  monopolies,  which 
would  bring  about  overproduction  crises,  a  rise  in 
unemployment,  bankruptcy  of  small  businesses, 
and  the  impoverishment  of  the  middle  class.  These 
barriers  to  growth  in  supply  and  demand  would 
inevitably  bring  about  a  relocation  of  businesses  to 
periphery  markets.  Such  actions,  far  from  countering 
the  falling  rate  of  profit,  would  generate  conse¬ 
quences  of  the  contradictions  and  evils  of  capital¬ 
ism,  which  would  create  conditions  favorable  for  a 
global  socialist  revolution.  Thus  was  born  the 
famous  slogan  of  the  Communist  Manifesto: 
“Workers  of  the  world,  unite!”  Vladimir  Ilyich 
Ulyanov  Lenin,  founder  of  the  Russian  Bolshevik 
party,  took  the  theory  of  Marx  and  Engels  further. 
According  to  Lenin,  over  the  course  of  the  last 
third  of  the  19th  century,  capitalism  had  reached 
the  “highest  stage  of  imperialism.”  Capital  was 
now  more  concentrated  in  the  hands  of  financial 
oligopolies,  which  stemmed  from  the  merger  of 
monopolistic  commercial,  industrial,  and  banking 
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firms.  These  oligopolies  ferociously  struggled 
among  each  other  for  the  possession  of  resources 
and  already  conquered  or  available  territories, 
which  led  to  imperialist  wars  that  victimized  the 
oppressed  lower  classes.  Over  the  course  of  World 
War  I,  the  Bolshevik  party  called  on  Russian  work¬ 
ers  and  farmers  to  mobilize  and  take  up  arms 
against  the  Tsar,  which  led  to  the  revolution  of 
1917  and  the  creation  of  the  Union  of  Soviet 
Socialist  Republics  (USSR).  Confronted  with  the 
construction  of  socialism,  a  subject  largely  ignored 
by  Marx  and  Engels,  Lenin  developed  a  theory  that 
would  largely  serve  as  a  model  for  those  communist 
regimes  that  came  into  being  after  1945.  According 
to  this  theory,  the  first  step  in  the  construction  of 
socialism  is  the  dictatorship  of  the  proletariat,  in 
which  the  state  assumes  control  of  all  means  of 
production  and  moves  toward  a  progressive  col¬ 
lectivization  of  agriculture  in  order  to  finance  the 
development  of  heavy  industry,  both  civil  and 
military — a  key  sector  of  the  economy  and  national 
defense.  The  development  of  consumer  goods 
industries  is  thus  pushed  back  to  a  later  stage. 

From  1920  to  the  Present: 

Heterodox  Liberalism 

The  troubled  interwar  period,  characterized  by 
several  failed  attempts  at  a  socialist  revolution,  the 
establishment  of  fascist  regimes  in  a  number  of 
countries,  the  economic  depression  of  the  1930s, 
and  the  abandonment  of  liberalism  in  favor  of  pro¬ 
tectionism,  led  several  liberal  economists,  such  as 
John  Maynard  Keynes,  James  Meade,  Joan 
Robinson,  and  Gunnar  Myrdal,  to  develop  a  new 
economic  theory — heterodox  liberalism.  This  the¬ 
ory  argues  that  neither  liberalism,  which  led  to  the 
worst  economic  depression  in  history;  nor  nation¬ 
alism,  which  was  the  source  of  several  wars;  nor 
Marxism,  which  attempts  to  mitigate  socioeco¬ 
nomic  inequalities  at  the  expense  of  political  liber¬ 
ties;  is  capable  of  guaranteeing  sustained  economic 
growth,  greater  social  justice,  democracy,  and 
peace.  Following  World  War  II,  most  advanced 
capitalist  countries  adopted  this  theory.  It  would 
serve  as  the  basis  for  the  developmentalist  theory  of 
Raul  Prebisch,  Oswaldo  Sunkel,  Jose  Serra,  and 
Fernando  Henrique  Cardoso,  which  was  used  by  a 
large  number  of  developing  nations.  It  was  at 
the  origin  of  the  Bretton  Woods  compromise,  the 


foundation  of  the  new  world  economic  order  of  the 
postwar  period.  Due  to  the  economic  and  social 
progress  that  it  brought  about,  it  contributed  to 
limiting  the  expansion  of  communism.  Nevertheless, 
this  theory  was  adopted,  under  the  influence  of  the 
USSR,  by  roughly  40  countries  in  Europe,  Asia, 
and  Africa  in  the  years  following  1945. 

According  to  Keynesian  heterodoxy,  it  is  not  the 
rise  in  supply  but  the  increase  in  household  and 
business  demand  that  serves  as  the  lifeblood  of 
economic  growth.  Governments  thus  sought  to 
promote  full  employment  by  investing  in  infra¬ 
structure,  protecting  labor-intensive  industries  that 
produce  consumer  goods,  and  encouraging  union¬ 
ization.  They  also  had  to  support  the  income  of 
those  excluded  from  the  labor  market  (e.g.,  unem¬ 
ployed,  dependents,  pensioners)  by  social  pro¬ 
grams  and  liberalize  consumer  credit,  all  while 
requiring  central  banks  to  maintain  low  interest 
rates.  To  counter  the  imbalances  of  an  unregulated 
international  market,  governments  were  left  to 
establish  multilateral  institutions  that  provide  the 
necessary  capital  to  rebuild  the  European  and 
Japanese  economies  and  to  aid  the  industrializa¬ 
tion  of  developing  countries,  ensure  the  stability  of 
currencies  and  the  balance  of  payments,  and  pro¬ 
mote  a  gradual  and  sectored  liberalization  of  trade 
adapted  to  the  unequal  development  of  nations. 
These  precepts  served  as  the  basis  for  the  World 
Bank,  the  International  Monetary  Fund  (IMF),  and 
the  General  Agreement  on  Tariffs  and  Trade 
(GATT).  They  would  eventually  inspire  the  cre¬ 
ation  of  the  European  Economic  Community  and 
several  other  regional  integration  agreements.  They 
also  allowed  for  the  adoption  of  agreements  facili¬ 
tating  the  exportation  of  manufactured  goods  from 
newly  industrialized  countries  (NICs)  to  industrial¬ 
ized  countries  (ICs),  such  as  the  Multi  Fibre 
Arrangement  and  the  Preferential  Trade  System. 

Theories  Inspired  by  Heterodox  Liberalism 

For  30  years,  the  heterodox  liberal  model  generated 
the  longest  growth  cycle  in  history;  a  decrease  in 
socioeconomic  inequalities,  especially  in  ICs;  a  rein¬ 
forcement  of  state  interdependence  and  cooperation; 
and  mitigation  of  North-South  disparities  from  the 
emergence  of  South  European,  East  Asian,  and  Latin 
American  NICs.  These  transformations  were  the 
origin  of  several  new  theories  in  political  science. 
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The  most  notable  of  these  is  the  modernization  the¬ 
ory  developed  by  Seymour  Martin  Lipset,  which 
argued  that  the  progress  of  economic  and  social 
development  is  the  main  determinant  in  the  emer¬ 
gence  and  stability  of  democracies.  During  the  1960s 
and  1970s,  the  validity  of  this  theory  was  challenged 
by  the  domination  of  authoritarian  regimes  in  most 
NICs.  However,  the  democratization  of  most  NICs 
between  1974  and  1995  and  the  instability  of 
democracies  established  in  developing  nations  after 
the  Cold  War  would  confirm  the  legitimacy  of  the 
modernization  theory.  David  Mitrany’s  functionalist 
theory  argued  that  the  modernization  of  societies 
generates  myriad  nonpolitical  problems  whose  solu¬ 
tions  require  the  collaboration  of  experts  from 
several  countries.  One  of  the  benefits  of  such  col¬ 
laboration  is  that  it  encourages  states  to  strengthen 
cooperation  by  concluding  economic  integration 
agreements  that  contribute  to  the  development  of 
their  political  integration.  This  analysis,  confirmed 
by  the  early  stages  of  European  construction,  was 
completed  by  Ernst  Haas’s  neo-functionalist  theory. 
According  to  this  theory,  the  creation  of  suprana¬ 
tional  institutions  is  not  uniquely  determined  by  the 
benefits  of  economic  integration.  It  follows  when 
national  leaders  are  convinced  that  it  is  in  their  inter¬ 
est  to  transfer  their  loyalty  and  their  ability  to 
respond  to  institutions  that  have  supranational  juris¬ 
diction.  The  neo-institutionalist  theory  advanced  by 
Joseph  Nye,  Robert  Keohane,  and  Stephen  Krasner 
attempted  to  demonstrate  that  the  economic  interde¬ 
pendence  of  states  reinforces  their  common  interest 
to  cooperate  within  international  regimes;  as  such, 
regimes  allow  them  to  reduce  uncertainty  and  their 
transaction  costs.  Unlike  neo-realists,  neo¬ 
institutionalists  argue  that  international  cooperation 
does  not  necessarily  stem  from  the  leadership  of  a 
hegemonic  power.  The  results  and  actions  of  inter¬ 
national  organizations  affirm  both  theories.  Inter¬ 
national  cooperation  without  a  hegemon  is  possible 
in  areas  of  low  politics  (economic,  environmental, 
social,  and  cultural  issues)  but  not  in  areas  of  high 
politics  (political  and  military  issues). 

Transition  to  Neoliberalism 

At  the  start  of  the  1970s,  several  problems 
brought  about  the  decline  of  the  heterodox  liberal 
model  and  the  ensuing  crisis.  The  transnational- 
ization  of  firms  and  capital  gave  rise  to  an  increase 


in  the  volume  of  Eurodollars  in  circulation  and  a 
depletion  of  American  gold  reserves,  in  such  a 
way  that  in  1971,  the  United  States  had  to  aban¬ 
don  the  gold  exchange  standard,  which  was  fun¬ 
damental  to  the  system  of  fixed  exchange  rates 
administered  by  the  IMF.  The  latter  was  replaced 
by  a  floating-exchange-rate  system  entirely  at  the 
mercy  of  the  monetary  market’s  supply  and 
demand.  For  30  years,  the  rise  in  public  spending 
and  in  consumption  was  largely  financed  by  pub¬ 
lic  and  private  debt,  which  took  advantage  of 
surplus  capital  and  low  interest  rates.  However, 
this  dynamic  led  to  a  general  rise  in  prices  and 
wages.  This  inflationist  context  prompted  firms  in 
developed  countries  to  lower  their  investments 
and  accelerate  the  relocation  of  their  production 
to  NICs.  This  movement  created  a  situation  of 
stagflation  in  ICs,  which  was  aggravated  by  the  oil 
crises  of  1973  and  1979  and  the  subsequent  reces¬ 
sions  of  1974-f975  and  1980-1983.  Pressure 
from  financial  institutions,  worried  over  the  sol¬ 
vency  of  governments  and  firms,  convinced  cen¬ 
tral  banks  to  sharply  raise  interest  rates  in  1983. 
This  decision  brought  about  a  severe  crisis  in  the 
public  finances  of  states,  which  was  the  determin¬ 
ing  factor  in  the  transition  toward  a  new  eco¬ 
nomic  model  based  on  neoliberalism. 

Neoliberal  theory  is  the  synthesis  of  new 
American  economic  theories  derived  from  classical 
and  neoclassical  liberal  economics,  which  were 
developed  in  opposition  to  Keynesianism  during 
the  1960s  and  1970s.  Milton  Friedman’s  monetar¬ 
ist  theory  argues  that  inflation  is  caused  by  main¬ 
taining  low  interest  rates  through  government 
control  of  central  banks.  Governments  should 
allow  monetary  authorities  to  adjust  interest  rates 
as  a  function  of  the  market,  which  would  raise  the 
level  of  money  and  combat  inflation,  the  principal 
cause  of  slowdowns  in  growth.  The  supply-side 
theory,  advanced  by  Arthur  Laffer  and  Jan  P.  Sey¬ 
mour,  maintains  that  excessively  high  taxes  on 
incomes  and  profits  discourage  initiative,  savings, 
investment,  and  productive  effort,  all  while  encour¬ 
aging  tax  evasion  and  leading  to  reductions  in 
government  revenues.  The  theory  calls  for  lower 
taxes  for  the  rich,  who  spend  more  than  the  poor, 
and  for  a  transfer  of  social  spending  to  the  private 
sector.  Gary  Becker  and  Jacob  Mincer’s  human 
capital  theory  maintains  that  all  human  activities 
are  determined  by  cost-benefit  calculations.  This 
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logic  must  prevail  in  public  and  para-public  insti¬ 
tutions  to  raise  their  productivity  and  efficiency 
and  to  lower  deficits  and  government  debt. 

The  precepts  of  neoliberalism  were  upheld  by 
the  IMF  and  the  World  Bank  during  the  1980s  and 
1990s.  They  eventually  led  to  the  creation  of  struc¬ 
tural  adjustment  programs,  which  would  become 
conditions  for  aid  to  ICs,  NICs,  and  developing 
countries.  Progressively,  governments  of  capitalist 
countries  the  world  over  began  to  embrace  this 
philosophy.  However,  very  few  countries  fully 
applied  the  recommendations  of  neoliberal  theo¬ 
ries.  Most  adopted  mixed  economic  policies, 
which  combined  the  ideas  of  heterodox  liberalism 
and  neoliberalism.  Contrary  to  the  fears  of  the 
Left,  the  transition  toward  neoliberalism  did  not 
lead  to  a  dismantling  of  the  state  or  to  the  destruc¬ 
tion  of  the  economic  and  social  achievements  of 
Keynesianism.  Nevertheless,  the  neoliberal  model 
was  at  the  origin  of  an  unprecedented  liberaliza¬ 
tion  of  international  trade.  The  Uruguay  Round  of 
the  GATT  (1986-1993),  the  adoption  of  the 
North  American  Free  Trade  Agreement  in  1994, 
the  transformation  of  the  European  Economic 
Community  into  a  common  market  (following  the 
Single  European  Act  of  1986)  and  then  into  an 
economic  and  monetary  union  (following  the 
Maastricht  Treaty  of  1993),  and  the  creation  of 
the  Mercosur  customs  union  in  1994  have  all  not 
only  abolished  trade  barriers  for  manufactured 
goods  but  also  liberalized  the  flow  of  services  and 
capital.  The  opening  of  markets  has  reinforced  the 
power  of  multinational  firms  and  banks  and  weak¬ 
ened  the  control  of  states  over  their  economies.  In 
2008,  Peter  Willets  counted  77,200  nonfinancial 
multinational  firms  in  138  countries.  The  few  hun¬ 
dred  with  the  most  important  assets  are  Western 
European,  Japanese,  or  American,  but  a  large 
number  are  also  found  in  other  ICs  (Canada, 
Sweden,  and  Australia)  or  in  emerging  countries 
(China,  South  Korea,  Malaysia,  Singapore,  India, 
Brazil,  and  Russia).  Nonfinancial  multinational 
firms  have  enormous  power  over  governments  as 
they  can  relocate  their  production  to  other  coun¬ 
tries  that  offer  better  advantages.  Financial  multi¬ 
national  firms  not  only  largely  determine  the  value 
of  currencies  and  the  interest  rates  for  states  but 
also  make  investment  decisions  that  determine  the 
profitability  of  their  banks,  insurance  companies, 
pension  funds,  and  other  financial  institutions. 


The  Crisis  of  Communism 

The  crisis  of  the  heterodox  liberal  model  coincided 
with  the  crisis  of  the  communist  model.  In  the 
USSR,  it  was  caused  by  the  concentration  of  gov¬ 
ernment  spending  in  the  military  industry  and 
the  decline  of  productivity  in  other  sectors  of  the 
economy  caused  by  a  disillusionment  with  the 
communist  ideology,  unable  to  fulfill  its  promises 
“of  a  better  tomorrow.”  In  China,  it  was  brought 
about  by  the  disastrous  cumulative  economic 
effects  of  “The  Great  Leap  Forward”  and  by  the 
“Great  Cultural  Revolution”  envisioned  and  led 
by  Mao  Zedong.  Though  the  outcome  of  these 
crises  was  a  transition  toward  capitalism,  they  ran 
very  different  courses  in  each  country.  In  the 
USSR,  the  combination  of  economic  liberalization 
(; perestroika )  and  the  liberalization  of  the  political 
system  ( glasnost )  led  to  the  disappearance  of  the 
Communist  Party,  the  breakup  of  the  USSR,  and 
the  collapse  of  communist  regimes  in  central  and 
eastern  Europe  in  less  than  5  years  (1987-1991). 
Russia  subsequently  adopted  an  orthodox  neolib¬ 
eral  model  that  had  some  disastrous  effects  while 
mortgaging  the  democratization  of  the  political 
regime.  Conversely,  the  countries  of  Central  and 
Eastern  Europe  succeeded,  between  1991  and 
2004-2007,  in  establishing  functional  market 
economies  and  stable  democracies,  largely  due  to 
the  European  Union’s  accession  process,  closely 
supervised  by  Brussels.  In  China,  the  transition 
toward  a  market  economy  has  been  very  gradual 
and  controlled  by  the  Communist  Party.  It  has 
been  accompanied  by  a  progressive  yet  limited 
liberalization  of  the  political  system,  with  the 
authorities  seeking  to  preserve  the  country’s  unity 
and  stability.  This  stability  has  encouraged  the 
inflow  of  capital  and  investment  by  foreign  firms, 
and  has  been  the  principal  cause  of  the  very  strong 
economic  growth  that  China  has  seen  in  the  years 
since  1980. 


Conclusion 

History  demonstrates  that  over  the  past  four  cen¬ 
turies,  the  capitalist  mode  of  production  has  been 
the  source  of  economic  transformation  in  a  major¬ 
ity  of  nations.  The  communist  mode  of  production 
appeared  in  about  40  countries  during  a  brief 
period  of  less  than  50  years.  Today  capitalism 
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spans  the  globe,  with  the  exception  of  certain 
regions  in  central  Asia,  Africa,  and  Oceania,  where 
precapitalist  modes  of  production  exist.  The  indus¬ 
trialization  and  modernization  of  developing 
countries  are  the  lifeblood  of  growth  and  of  the 
globalization  of  capitalism.  Given  that  over  time 
these  transformations  are  more  and  more  rapid — 
the  industrialization  of  European  countries  hap¬ 
pened  in  200  years,  while  the  process  in  NICs  in 
the  20th  century  lasted  4  or  5  decades — it  is  highly 
probable  that  the  possibilities  of  an  expansion  of 
capitalism  will  dry  up  before  the  end  of  the  21st 
century.  What  will  happen  then?  Will  we  witness 
the  onset  of  imperial  wars,  as  Lenin  predicted;  or 
the  establishment  of  a  world  order  founded  on 
specialization  and  the  economic  cooperation  of 
states,  as  liberals  claim;  or  rather,  the  institution  of 
a  postcapitalist  model,  characterized  by  the  protec¬ 
tion  of  natural  resources,  a  decrease  in  population 
and  consumption,  and  orientation  of  economic 
activity  toward  human  needs  rather  than  material 
ones,  as  ecologists  desire?  No  one  knows  for  sure. 
One  thing  is  certain:  No  mode  of  production  or 
model  of  development  lasts  forever. 

Diane  Etbier 
University  of  Montreal 
Montreal,  Quebec,  Canada 
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Transition 


The  term  transition  is  one  of  several  used  in 
research  on  the  transformation  of  political  regimes. 
Other  competing  terms  include  transformation, 
regime  change,  system  change,  and  democratiza¬ 
tion.  These  terms  not  only  are  closely  related  to 
each  other  but  also  overlap  in  meaning;  however, 
they  are  not  identical  in  their  meaning.  Moreover, 
they  all  have  specific  connotations  and  are  often 
linked  to  particular  theoretical  concepts  or  even 
reflect  ideological  biases. 

The  term  transition  was  first  used  in  political 
science  by  Dankwart  Rustow  in  his  1970  article 
“Transitions  to  Democracy.”  However,  it  was  not 
until  1986  that  the  term  was  transformed  into  a 
more  coherent  theoretical  concept  by  the  seminal 
1986  work  of  Guillermo  O’Donnell,  Philippe  C. 
Schmitter,  and  Laurence  Whitehead.  They  defined 
transition  as  “the  interval  between  one  political 
regime  and  another”  (p.  6).  Transitions  are  limited 
in  time:  on  the  one  hand  by  the  dissolution  of  an 
authoritarian  regime  and,  on  the  other,  “by  the 
installation  of  some  form  of  democracy,  the  return 
to  some  form  of  authoritarian  rule,  or  the  emer¬ 
gence  of  a  revolutionary  alternative”  (p.  6).  The 
two  authors  deliberately  introduced  contingency 
as  an  important  variable  with  respect  to  the  regime 
outcome.  They  do  not  deterministically  speak 
about  “transition  to  democracy,”  but  “transition 
from  authoritarian  rule”  to  an  “uncertain  some¬ 
thing  else”  (italics  added;  p.  3).  However,  the 
study  triggered  and  inspired  an  avalanche  of 
empirical  research  in  democratization,  first  on 
Southern  Europe  and  Latin  America  and  later  on 
in  the  1990s  on  Asia,  Africa,  and  Eastern  Europe. 
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Transitologists  conceive  the  (not  necessarily)  suc¬ 
cessful  transition  from  authoritarian  to  democratic 
rule  typically  in  three  subsequent  phases:  first  liber¬ 
alization,  then  democratization,  and  finally  con¬ 
solidation.  Liberalization  marks  the  beginning  of 
transition.  It  is  the  process  of  granting  some  of  those 
“liberal”  rights  to  the  citizen  that  are  traditionally 
associated  with  habeas  corpus,  (partial)  freedom  of 
movement,  free  speech,  freedom  of  association,  the 
sanctity  of  the  home,  and  so  forth.  Authoritarian 
rulers  grant  these  rights  typically  in  situations  when 
the  regime  suffers  from  a  crisis  of  legitimacy  and  the 
so-called  softliners  among  the  ruling  elites  prevail 
over  the  hardliners  by  arguing  that  a  limited  liberal¬ 
ization  and  controlled  broadening  of  the  ruling  base 
can  ultimately  save  the  political  power  of  the 
authoritarian  regime.  However,  Adam  Przeworski 
has  shown  that  this  often  turns  out  to  be  a  misper¬ 
ception.  The  changing  power  relations  and  the 
dynamics  of  regime  change  are  barely  controllable 
and  often  trigger  a  chain  of  events  and  decisions 
that  finally  lead  to  democratization  by  default. 
Gorbachev’s  glasnost  and  perestroika,  the  collapse 
of  the  Soviet  Union,  and  the  beginning  of  democra¬ 
tization  constitute  only  one  of  many  examples. 

Democratization  by  default  occurs  when  the 
newly  conditioned  and  nonguaranteed  civil  and 
political  rights  open  a  new  space  for  an  emerging 
civil  society.  The  more  civil  society  grows  beyond 
the  control  and  repression  of  the  authoritarian 
regime,  the  more  the  risk  of  punishment  decreases 
for  protest  movements  and  prodemocratic  demon¬ 
strations.  The  increasing  number  of  protesters 
lowers  the  costs  for  individual  and  collective 
actions  against  the  authoritarian  regime.  The 
granting  of  some  liberal  rights  by  authoritarian 
rulers  is  often  motivated  by  their  calculation  of 
avoiding  protests  and  broadening  the  base  of 
legitimacy  of  the  old  regime.  However,  the  con¬ 
trary  has  often  been  the  case:  Liberalization  has 
triggered  more  protests,  leading  to  the  emergence 
of  a  stronger  civil  society,  which  then  led  to  the 
beginning  of  democratization. 

The  phase  of  democratization  is  the  proper  core 
of  each  transition.  It  is  the  time  when  the  old  insti¬ 
tutions  and  the  repressive  apparatus  of  the  author¬ 
itarian  regime  have  lost  their  power,  and  the  new 
rules  of  democracy  are  not  yet  fully  established. 
The  lack  of  effective  institutions  opens  a  large 
space  for  new  political  actors.  In  these  situations, 


where  the  precise  power  relations  are  not  transpar¬ 
ent,  the  relevant  actors  of  the  remaining  old  regime 
and  the  new  opposition  decide  to  negotiate  form 
and  content  of  the  next  steps  of  democratization. 
The  core  of  these  negotiations  involves  crucial 
democratic  institutions,  such  as  electoral  laws,  the 
form  of  government,  its  relation  to  the  parliament, 
the  debate  surrounding  the  decision  to  have  a 
presidential  or  parliamentary  system,  majoritarian 
or  consensus  forms  of  democracy,  and  a  central¬ 
ized  or  federalist  state.  Although  the  social  and 
historical  context  of  a  country  and  society  matters, 
the  final  configuration  of  the  new  institutions  is 
above  all  a  logical  outcome  of  the  power  relations 
and  the  power  struggle  between  the  relevant  actors 
of  the  old  authoritarian  regime  and  the  democratic 
opposition  and  their  strategic  decisions. 

The  role  of  actors  is  decisive  in  transition 
research.  Philippe  C.  Schmitter  and  colleagues  and 
Adam  Przeworski  ascribe  to  them  the  crucial  role 
within  any  transition  from  authoritarian  rule. 
Structural  and  socioeconomic  constraints  or  oppor¬ 
tunities  that  are  at  the  core  of  modernization  the¬ 
ory  are  neglected  or  deliberately  excluded  from  the 
analysis  by  an  implicit  ceteris  paribus  clause.  It 
depends  above  all  on  the  constellation  of  actors 
whether  a  transition  leads  to  democracy  or  falls 
back  to  dictatorial  rule.  Inspired  by  the  transicion 
pactada  in  post-Franco  Spain  (1975-1977)  and 
confirmed  by  the  negotiated  transitions  in  Brazil, 
Uruguay,  and  Chile  in  the  1980s  or  the  roundtables 
in  Eastern  Europe  after  1989,  pacted  transitions 
became  the  paradigmatic  key  of  successful  transi¬ 
tions.  Particularly  when  the  reformers  (softliners) 
of  the  old  regime  and  the  moderates  of  the  demo¬ 
cratic  opposition  dominate  their  respective  camps 
and  agree  in  negotiations  or  even  formalized  pacts 
on  the  next  steps  of  political  reforms,  democracy  is 
the  most  probable  outcome  of  the  transition.  Elite 
settlements,  where  the  relevant  actors  agree  on  the 
fundamental  rules  of  the  democratic  game,  are 
considered  to  be  a  solid  base  for  further  democra¬ 
tization.  Elites  play  the  decisive  role  in  the  move¬ 
ments,  whereas  the  actions  of  the  masses  are 
mostly  understood  as  a  transient  phenomenon  at 
the  beginning  of  the  political  transformation. 

Since  it  is  basically  an  analytical  distinction,  it  is 
not  precisely  clear  in  reality  when  the  phase  of 
liberalization  ends  and  the  transition  begins.  The 
same  is  true  for  the  borderline  between  transition 


Transitional  Justice  2663 


to,  and  consolidation  of,  democracy.  Both  lines  are 
blurred,  and  the  different  phases  can  best  be 
understood  and  researched  as  overlapping  pro¬ 
cesses.  However,  the  mainstream  of  transitology 
argues  that  transition  has  finished  when  the  first 
democratic  elections,  the  founding  elections  of  the 
new  democratic  regime,  have  taken  place.  Others, 
such  as  Wolfgang  Merkel,  believe  that  the  passing 
of  a  revised  or  new  democratic  constitution  is  the 
proper  end  of  the  transition.  Only  when  the  trans¬ 
fer  of  political  power  from  one  person  or  a  group 
of  persons  to  a  set  of  institutionalized  rules  has 
taken  place,  which  are  equally  valid  for  both  the 
ruling  and  the  ruled,  can  the  formal  rules  of 
democracy  be  said  to  be  established. 

What  is  the  theoretical  contribution  of  transi¬ 
tion  to  democracy  literature  to  our  understanding 
of  regime  changes  or  democratization  of  political 
regimes?  Theoretically,  the  transition  paradigm  can 
be  considered,  according  to  Barrington  Moore,  as 
an  answer  to  historical  structuralism  and,  accord¬ 
ing  to  Talcott  Parsons,  Seymour  Martin  Lipset,  and 
others,  to  modernization  theory.  While  structural¬ 
ists  focus  particularly  on  class  relations  and  the  link 
of  social  classes  to  the  state,  modernization  theory 
could  never  get  rid  of  a  certain  deterministic  bias. 
It  is  above  all  socioeconomic  development  and  the 
emergence  of  the  middle  classes  that  lead  sooner  or 
later  to  democracy;  actors  are  not  taken  seriously 
into  account.  One  of  the  strengths  of  the  transition 
paradigm  is  that  it  can  shed  light  into  the  black  box 
of  political  actors,  actor  constellations,  and  finally 
political  action.  Transition  theory  brought  political 
actors  back  into  the  limelight.  However,  the  transi- 
tologists  constructed  their  own  black  box:  They  did 
not  sufficiently  integrate  in  their  analyses  the  socio¬ 
economic  opportunity  structure  for  political  action. 
For  some,  such  as  Giuseppe  Di  Palma,  democracy 
can  be  crafted  only  if  the  actors  agree  regardless  of 
the  socioeconomic  and  structural  constraints. 
Despite  all  its  merits,  the  transition  paradigm  could 
never  overcome  the  theoretical  burden  of  a  consid¬ 
erable  degree  of  voluntarism.  The  disregard  of 
structural  and  historical  factors  was  not  the  least 
of  the  issues  that  contributed  to  the  exhaustion  of 
the  transition  paradigm  in  the  late  1990s.  Since 
then,  historical  institutionalism  has  taken  over  the 
paradigmatic  lead  in  regime  research  and  democra¬ 
tization  studies  linking  actors  and  actions  to  the 
structural  and  historical  context.  However,  the 


theoretical  merit  of  the  transition  paradigm  lasts: 
Political  actors  and  agency  are  back  in  research  on 
the  transformation  of  political  regimes. 
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One  of  the  effects  of  the  third  wave  of  democrati¬ 
zation  that  took  place  in  the  last  quarter  of  the 
20th  century  has  been  to  confront  newly  democra¬ 
tizing  countries  with  the  need  to  come  to  terms 
with  their  “evil  past” — the  human  rights  violations 
committed  under  authoritarian  rule.  Transitional 
justice,  one  of  the  fastest  growing  areas  within  the 
study  of  democratization,  deals  with  the  manner  in 
which  the  claims  arising  from  such  wrongdoings, 
often  undertaken  by  the  state  and  its  agents, 
should  be  handled  fairly  and  equitably. 
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The  study  of  transitional  justice  is  by  no  means 
confined  to  lawyers  or  to  the  discipline  of  the  law. 
It  is  an  eminently  interdisciplinary  field  in  which 
law,  political  science,  history,  sociology,  anthro¬ 
pology,  theology,  and  other  disciplines  converge. 
A  number  of  institutes  in  several  continents  have 
been  established  in  the  first  decade  of  the  21st  cen¬ 
tury  to  study  the  many  dimensions  of  transitional 
justice,  to  analyze  the  different  policy  tools  used  to 
achieve  its  objectives,  and  to  disseminate  best  prac¬ 
tices  in  the  field.  The  handling  of  transitional  jus¬ 
tice  issues  and  their  consequences  affects  the 
dynamics  of  the  political  transition  and  the  subse¬ 
quent  democratic  consolidation. 

Much  like  the  latter  two  processes,  transitional 
justice  itself  is  also  marked  by  contingency  and 
paradox.  Much  depends  on  the  historical  context 
and  the  legacies  of  any  given  political  trajectory. 
Transitions  are,  by  definition,  highly  fluid,  and 
both  political  action  and  the  uses  of  the  law  find 
themselves  under  a  different  set  of  rules  from  those 
obtaining  in  periods  of  stability  and  calm.  To  iden¬ 
tify  the  transformative  opportunities  presented  by 
the  conjuncture,  which  will  allow  change  to  pro¬ 
ceed  toward  the  new  order,  becomes  an  essential 
task  both  for  political  leaders  and  for  those 
responsible  for  the  transitional  justice  tools. 

Historical  Context 

Transitional  justice  falls  squarely  within  the 
broader  issue  of  human  rights.  The  latter  emerged 
as  an  international  issue  in  the  aftermath  of  the 
Holocaust,  and  was  formalized  as  such  in  the  1948 
United  Nations  (UN)  Human  Rights  Declaration. 
However,  it  was  not  until  the  early  1970s,  in  the 
light  of  human  rights  violations  committed  in  the 
Southern  Cone  of  Latin  America,  that  the  issue  of 
transitional  justice  came  to  the  forefront  of  inter¬ 
national  relations  and  foreign  policy  agendas. 

In  its  original  incarnation,  transitional  justice 
was  seen  as  mainly  concerned  with  transitions  from 
authoritarian  to  democratic  rule.  It  first  drew  heav¬ 
ily  on  the  experiences  of  Southern  Europe  and  Latin 
America  in  the  1970s  and  1980s  as  well  as  on  those 
in  Central  and  Eastern  Europe  after  1989  and  in 
Southern  Africa  in  the  mid-1990s.  It  shared  some  of 
the  same  value  orientation  of  democratization  stud¬ 
ies  more  generally — what  Abraham  F.  Lowenthal 
has  referred  to  as  the  “thoughtful  wishing”  school 


of  scholarship.  Over  time,  however,  transitional 
justice  has  expanded  to  other  instances,  encompass¬ 
ing  a  much  broader  scope,  to  include  postconflict 
societies  more  generally,  arising  both  from  interna¬ 
tional  and  civil  wars.  The  field  has  thus  become 
somewhat  less  normative  and  more  neutral,  with  a 
stronger  emphasis  on  empirical  work.  After  the 
many  internal  conflicts  that  erupted  in  the  post- 
Cold  War  world — mainly  in  Southeastern  Europe, 
Africa,  and  Asia — transitional  justice  has  emerged 
as  an  important  tool  within  the  wider  panoply  of 
nation-  and  peace-building  instruments  to  be 
deployed  after  wars  come  to  an  end. 

In  all  these  cases,  however,  the  critical  question 
is  the  same:  What  to  do  “the  morning  after” — be 
it  after  the  fall  of  the  dictator  or  the  signing  of  the 
peace.  And  here  come  to  the  fore  the  many  dilem¬ 
mas  and  paradoxes  that  are  such  a  hallmark  of 
transitional  justice.  On  the  one  hand,  there  is  so 
much  to  do  on  so  many  fronts  that  the  last  thing 
the  new  leaders  may  want  is  to  “frontload”  their 
own  political  agenda — a  problem  often  com¬ 
pounded  by  the  possibility  of  authoritarian  regres¬ 
sion.  On  the  other  hand,  the  very  legitimacy  of  the 
incoming  coalition  will  often  be  based  on  their 
alleged  moral  superiority  over  the  outgoing  elites. 
It  was  the  new  order’s  denunciation  of  those 
human  rights  violations  that  contributed  to  deplet¬ 
ing  the  political  capital  of  the  ancien  regime.  To  do 
nothing  now,  safely  ensconced  in  office,  would 
seem  the  height  of  cynicism.  There  will  thus  be 
strong  domestic  and  international  pressures  against 
a  policy  of  “do  nothing.” 

What  to  do,  then,  about  this  evil  past  and  about 
the  evildoers?  Though  some  countries  (Brazil  and 
Spain  come  to  mind)  effectively  decided  to  do 
nothing — although  this  does  not  mean  that  the 
issue  itself  disappeared — many  new  governments 
have  felt  it  necessary  to  take  some  sort  of  action. 
The  reasons  for  this  go  beyond  the  routine  pressures 
the  rulers  experience  “to  do  something”  about  a 
given  matter.  After  dictatorship  or  after  war,  there  is 
a  deep-felt  need  to  draw  a  clear  line  between  the  old 
regime  and  the  new  dispensation,  between  yester¬ 
day’s  order  and  that  of  today.  In  many  cases — that 
of  apartheid  in  South  Africa  is  one  of  them — the 
suffering  inflicted  on  significant  portions  of  the 
population  has  been  such  that  the  very  notion  of 
pretending  that  nothing  happened  or  that  it  is  best 
to  simply  forgive  and  forget  is  abhorrent  to  many. 
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Elements  of  Transitional  Justice 

Transitional  justice,  then,  as  Mark  Freeman  has 
pointed  out,  encompasses  four  distinct  sets  of 
activities: 

1.  trials  in  their  various  manifestations,  be  they 
criminal  or  civil,  national  or  international; 

2.  fact-finding  bodies,  of  which  the  most 
prominent  are  truth  commissions  (TCs); 

3.  reparations  to  the  victims  of  human  rights 
violations  and/or  their  relatives;  and 

4.  criminal  justice  reforms  designed  to  avoid  the 
repetition  of  mass  abuses. 

Trials  and  Amnesties 

The  question  is  how  to  go  about  this  highly 
sensitive  task.  The  classic  response  to  this  after 
wars  has  been  the  successor  trials — to  put  the  high 
command  of  the  outgoing  regime  in  the  dock.  The 
best  known  example  in  the  contemporary  period  is 
the  trial  of  Saddam  Hussein  in  Iraq,  ending  in  his 
conviction  and  execution.  And  although  there  are 
many  precedents,  the  most  relevant  one  for  these 
purposes  is  that  of  the  Nuremberg  and  the  Tokyo 
trials  after  World  War  II,  which  in  many  ways  set 
the  template  for  the  form.  Nuremberg  was  largely 
a  response  to  the  Holocaust,  which  caused  such  a 
shock  to  humanity  that  it  triggered  a  large  number 
of  legal  responses  and  innovations. 

Something  similar  could  be  said  about  the  post- 
Cold  War  era.  The  large  number  of  internal  con¬ 
flicts  that  have  erupted  in  the  absence  of  the 
constraints  imposed  by  the  bipolar  international 
dynamic  have  led  to  the  growth  of  international 
criminal  justice  institutions.  The  International 
Criminal  Tribunal  for  Former  Yugoslavia  (ICTY), 
the  International  Criminal  Tribunal  for  Rwanda, 
and  hybrid  (i.e.,  courts  that  combine  national  and 
international  elements)  criminal  tribunals  in 
Bosnia-Herzegovina,  Cambodia,  Kosovo,  Sierra 
Leone,  and  Timor-Leste  are  among  them.  The 
most  significant  addition  to  this  roster  of  tribunals 
is,  of  course,  the  International  Criminal  Court 
(ICC),  established  in  the  Rome  Treaty  signed  in 
1998,  and  which  came  into  being  in  2002  in  The 
Hague.  Despite  the  unwillingness  of  some  large 
powers  to  ratify  the  Rome  Treaty  and  become 


party  to  it  (most  prominently  the  United  States, 
which  went  so  far  as  to  formally  unsign  the  treaty), 
as  of  2009,  108  UN  member  states  had  ratified  it. 

The  ICC  has  come  to  fill  a  long-standing  void 
in  international  law,  making  it  possible  to  bring 
individuals  to  justice,  something  for  which  the 
International  Court  of  Justice,  also  based  in  The 
Hague,  has  no  jurisdiction  (only  states  have 
standing). 

Hand  in  hand  with  this  development  has  been 
the  expansion  of  universal  jurisdiction.  By  this  is 
meant  the  ability  of  judges  to  investigate,  indict, 
and  ultimately,  arrest  individuals  for  war  crimes, 
crimes  against  humanity,  and  massive  human 
rights  abuses  committed  anywhere  in  the  world. 
Although,  under  normal  circumstances,  the  juris¬ 
diction  of  criminal  courts  is  fixed  territorially  (i.e., 
by  the  boundaries  of  the  state  in  which  the  crime 
was  committed),  globalization  and  a  higher  aware¬ 
ness  of  the  need  to  stand  up  for  human  rights  have 
started  to  change  this,  and  a  more  expansive 
notion  of  jurisdiction  has  taken  hold.  Spain,  where 
a  law  allows  for  universal  jurisdiction  in  cases  of 
human  rights  violations  involving  Spanish  citizens 
or  Spanish  residents,  has  been  a  country  whose 
judges  have  been  especially  active  in  this  regard. 

In  1998,  General  Augusto  Pinochet  became  the 
first  former  head  of  state  arrested  abroad  (in 
London)  for  human  rights  violations  committed  at 
home.  Shortly  thereafter,  Serbian  President 
Slobodan  Milosevic  became  the  first  sitting  head  of 
state  to  be  indicted  for  such  crimes  by  the  ICTY, 
being  eventually  arrested  and  taken  to  stand  trial 
in  The  Hague,  where  he  died  in  prison.  Former 
Liberian  strongman  Charles  Taylor  suffered  a 
similar  fate,  awaiting  trial  in  The  Hague.  In  2008, 
Sudanese  President  Omar  al-Bashir  became  the 
first  head  of  state  to  be  indicted  by  the  ICC  special 
prosecutor  for  genocide,  crimes  against  humanity, 
and  war  crimes  in  Darfur.  In  2009,  the  ICC  issued 
arrest  warrants  against  Bashir  on  the  last  two 
charges  but  not  the  first. 

Domestically,  however,  as  Ruti  Teitel  has 
observed,  much  as  successor  trials  are  seen  as  crucial 
for  the  reestablishment  of  the  rule  of  law,  consid¬ 
ered  to  have  been  compromised  by  the  ancien 
regime,  they  also,  paradoxically,  may  endanger  the 
legal  foundations  of  the  new  order.  A  basic  principle 
of  criminal  law  is  that  of  nonretroactivity  of  the 
penal  code:  Nulla  poena  sine  lege.  Yet  the  crimes  for 
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which  the  authorities  and  functionaries  of  the  old 
regime  are  to  be  held  accountable  for  are  often  not 
typified  as  such  in  the  statutes  of  the  preexisting 
order.  The  risk,  thus,  is  to  give  the  appearance  of 
highly  politicized,  quasi  “show  trials”  that  make  a 
mockery  of  the  moral  and  juridical  superiority 
claims  of  the  new  leaders.  Some  have  argued  that 
that  is  exactly  what  happened  with  the  trial  of 
Saddam  Hussein.  After  the  return  of  democracy, 
both  Greece  (in  the  1970s)  and  Argentina  (in  the 
1980s)  prosecuted  the  members  of  the  military  junta 
and  had  them  stand  trial  in  special  tribunals  set  up 
for  that  purpose. 

At  the  other  end  of  the  spectrum  of  responses 
are  amnesties.  The  enactment  of  amnesty  takes 
cognizance  of  the  fact  that  evil  deeds  were  commit¬ 
ted  and  that  they  need  to  be  addressed.  However, 
it  is  estimated  that  the  needs  of  national  peace  and 
reconciliation  take  precedence  and  that,  for  their 
sake,  it  is  best  to  let  sleeping  dogs  lie.  After  its  own 
transition  to  democracy  in  1985,  Uruguay  enacted 
such  a  law.  A  variant  of  this  is  self-amnesty.  In 
1978,  Chile’s  military  regime,  led  by  General 
Augusto  Pinochet,  approved  such  an  amnesty  law, 
covering  human  rights  violations  committed  dur¬ 
ing  the  1973  to  1978  period. 

According  to  some  legal  scholars,  self-amnesties 
are  null  and  void  as  they  violate  a  basic  legal  prin¬ 
ciple:  No  one  should  be  entitled  to  enact  legislation 
for  his  or  her  own  benefit.  Yet  when  President 
Patricio  Aylwin  (1990-1994)  gave  consideration 
to  derogating  Pinochet’s  amnesty  law,  one  coun¬ 
terargument  he  faced  was  that  once  the  penal 
responsibility  for  a  given  crime  has  been  extin¬ 
guished  by  an  amnesty  law,  it  cannot  be  recreated 
simply  by  derogating  the  amnesty  law. 

The  limitations  and,  in  some  cases,  the  unsatis¬ 
factory  results  of  both  successor  trials  and  blanket 
amnesties  have  contributed  to  the  rise  of  a  third 
option  in  the  tool  kit  of  transitional  justice — that 
of  TCs. 

Truth  Commissions 

TCs  are  officially  appointed  (although  usually 
independent)  bodies  tasked  with  investigating 
human  rights  violations  during  a  specific  period 
(often  that  of  the  previous  authoritarian  regime  or 
the  one  on  the  losing  end  of  a  war).  Their  powers 
vary,  but  they  are  generally  made  up  of  respected 


personalities  who  are  supposed  to  produce  within 
a  specified  time  period  (ideally  not  more  than  6 
months  to  2  years)  a  report  that  documents  those 
violations  for  the  record  and  establishes  a  factual 
truth  about  them.  They  serve  as  group  exercises  in 
collective  expiation  and  memorialization.  They  are 
not  tribunals,  and  they  may  be  formed  by  nation¬ 
als  (the  general  rule),  foreign  citizens  (in  cases 
where  nationals  dare  not  tread,  as  happened  in  the 
one  in  El  Salvador),  or  both. 

TCs  are  not  necessarily  incompatible  with  pre¬ 
existing  amnesty  provisions,  nor  do  they  preclude 
subsequent  prosecutions  by  the  courts.  They  have 
increasingly  become  the  policy  tool  of  choice  of 
new  democracies  as  they  come  to  terms  with  their 
evil  past.  Several  dozen  of  them  have  been  estab¬ 
lished  in  the  course  of  the  past  decades.  Two  par¬ 
ticularly  significant  ones,  associated  with  two 
emblematic  transitions  of  the  1990s,  were  the 
Chilean  (1990-1991)  and  the  South  African  (1995- 
1998)  Truth  and  Reconciliation  Commissions 
(TRCs).  If  the  quality  of  the  report  of  the  former 
was  considered  to  have  set  a  standard  of  sorts,  it 
was  the  latter,  led  by  Archbishop  Desmond  Tutu, 
that  most  enhanced  the  profile  of  TCs,  particularly 
because  of  its  innovative  use  of  public  hearings. 
Since  then,  countries  such  as  Peru,  Panama,  Timor- 
Leste,  Ghana,  Burundi,  Nigeria,  and  Sierra  Leone 
have  relied  on  them,  with  varying  results.  As  a  rule, 
TCs  have  largely  not  been  used  in  Eastern  and 
Central  Europe,  where  the  preferred  system  of  deal¬ 
ing  with  the  past  has  been  that  of  lustration, 
whereby  former  party  officials  and  high-ranking 
functionaries  are  disqualified  from  government 
service  in  the  new  dispensation. 

The  reasons  for  this  veritable  explosion  of  truth 
commissioning  are  many.  On  the  one  hand,  they 
provide  a  reasonable  compromise  between  the 
extremes  of  outright  prosecution  through  special 
courts  and  full  amnesty.  On  the  other,  through  their 
fact  finding,  they  can  lay  the  foundations  for  much 
of  the  work  that  needs  to  be  done  to  heal  the  body 
politic  of  the  blows  inflicted  on  it  by  dictatorship 
and/or  war.  The  truth  must  be  established:  Violations 
need  to  be  put  on  the  public  record.  A  measure  of 
justice  must  be  meted  out:  The  culprits  deserve 
some  punishment,  even  if  only  symbolic.  With  the 
facts  on  the  open,  reparations  can  proceed.  Not 
bound  by  the  exacting  evidentiary  standards  of  judi¬ 
cial  procedures,  they  have  more  flexibility  to  call  on 
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witnesses  and  to  gather  relevant  data.  The  estab¬ 
lished  judiciary,  compromised  by  its  acquiescence  to 
dictatorship,  is  often  not  in  a  position  to  undertake 
anything  comparable  to  this.  Still,  it  may  well  be 
able  to  follow  up  on  the  work  of  the  TC,  using  the 
initial  evidence  thus  gathered  to  prosecute  the  cul¬ 
prits  of  human  rights  violations. 

TCs  are  most  appropriate  after  a  significant 
regime  change,  in  which  major  human  rights  viola¬ 
tions  were  committed  primarily  by  one  side,  and 
when  there  is  still  a  certain  balance  between  the 
democratizing  forces  and  those  of  the  outgoing 
coalition.  Under  such  circumstances  and  when 
there  is  often  some  dispute  about  what  actually 
happened  under  the  previous  regime,  the  need  to 
have  a  common  national  narrative  about  those 
facts  is  especially  acute  to  overcome  past  divisions 
and  forge  a  common  future. 

The  parallel  has  been  made  with  the  needs  of 
patients  suffering  from  posttraumatic  stress  disor¬ 
der.  People  who  have  undergone  major  traumas  in 
their  lives  often  find  themselves  disoriented  and  in 
acute  need  of  telling  their  story  to  someone  who 
will  take  it  seriously.  It  should  not  be  surprising  to 
realize  that  individuals  who  lost  their  loved  ones 
and/or  were  themselves  submitted  to  torture  and 
were  then  publicly  denounced  as  subversives 
should  often  suffer  from  all  sorts  of  psychological 
difficulties  and  should  need  help. 

Beyond  individual  grief  and  the  imperative  to 
overcome  it,  looms  a  larger  issue,  what  Jose 
Zalaquett  has  referred  to  as  the  cathartic  function 
of  TCs.  After  the  collective  trauma  of  repressive 
dictatorship  or  war  and  all  it  entails,  nations  need 
a  moment  of  rebirth  or  regeneration  of  their  sense 
of  identity  and  being,  one  that  provides  a  clean 
break  with  that  oppressive  past  and  thus  gives 
them  the  necessary  impetus  to  forge  ahead  and 
build  a  better  future.  If  conducted  properly,  TCs 
can  perform  this  function.  The  experience  of  the 
South  African  TRC,  which  in  its  2.5  years  of  exis¬ 
tence  engaged  in  one  of  the  most  extraordinary 
exercises  in  national  soul-searching  ever  under¬ 
taken,  remains  a  prime  example  of  that.  The  pub¬ 
lic  hearings  held  during  those  years,  often  in 
churches,  with  strong  religious  undertones  and 
amply  covered  by  the  media,  made  the  country 
come  to  terms  with  the  legacy  of  apartheid  in  a 
way  that  perhaps  no  other  mechanism  could 
have. 


Until  the  South  African  TRC,  no  TC  had  ever 
held  public  hearings.  Since  then,  many  countries, 
including  Nigeria,  Grenada,  Timor-Leste,  Ghana, 
Peru,  Morocco,  and  Paraguay  have  done  so. 
Public  hearings  have  distinct  advantages  over 
those  held  in  camera.  In  our  media-driven  societ¬ 
ies,  they  generate  enormous  public  attention, 
increase  the  awareness  of  past  abuses,  and  are 
more  likely  to  trigger  public  debate  than  the  mere 
publication  of  a  report.  However,  they  are  time- 
consuming,  expensive  to  set  up,  and  demand 
enormous  amounts  of  staff  time.  They  are  also  a 
high-risk  endeavor.  The  need  for  procedural  fairness 
in  the  proceedings  of  TCs  is  especially  apparent  in 
the  case  of  public  hearings,  given  the  lack  of  ordi¬ 
nary  judicial  rules  like  witness  cross-examination. 

Another  critical  issue  is  that  of  “naming 
names.”  A  key  feature  of  TCs  is  their  victim- 
oriented  approach,  so  different  from  the  regular 
adversarial  penal  proceedings  prevailing  in  the 
West,  mainly  directed  toward  establishing  guilt 
rather  than  toward  highlighting  the  victim’s  plight. 
In  that  context,  the  issue  of  whether  the  report  of 
TCs  should  actually  name  the  likely  culprits  of 
human  rights  violations  is  highly  contested.  For 
some,  given  the  lack  of  exacting  evidentiary  stan¬ 
dards  in  the  TC’s  hearings  and  procedures,  this 
would  be  an  inherently  unfair  and  even  dangerous 
exercise.  For  others,  it  would  be  the  equivalent  of 
the  journalistic  mention  of  the  suspects  of  a  crime, 
no  more  and  no  less.  Yet  given  the  official  weight 
that  the  TC  reports  carry,  the  latter  is  a  complex, 
if  not  downright  dubious,  proposition.  Passing  on 
the  relevant  information  to  the  courts  for  further 
action  rather  than  making  such  names  public 
would  seem  to  be  the  more  prudent  course  of 
action. 

Reparations 

Inextricably  intertwined  with  TCs,  though 
analytically  separate  from  them,  is  the  issue  of 
reparations  for  human  rights  violations.  Lost  lives 
cannot  be  resurrected,  and  cutting  off  limbs 
inflicts  permanent  damage  to  its  victims.  Yet  some 
degree  of  reparation  is  both  of  material  and  sym¬ 
bolic  significance.  Through  reparations,  the  state 
effectively  acknowledges  its  responsibility  for  past 
human  rights  violations.  Who  qualifies?  For  how 
much?  How  fair  is  it  to  impose  what  may  be  a 
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heavy  budgetary  burden  for  reparations  on  the 
governments  of  developing  societies  that  have 
great  difficulty  in  balancing  competing  social  and 
economic  priorities?  What  about  intergenera- 
tional  justice?  Why  should  present  and  future 
generations  pay  for  the  sins  of  past  ones?  These 
are  some  of  the  questions  raised  by  the  vexed  issue 
of  reparations. 

The  report  of  TCs  provides  an  important  initial 
data  set  that  makes  it  possible  to  identify,  at  least 
on  a  preliminary  basis,  the  likely  beneficiaries  of 
such  reparations.  There  is  consensus  in  the  litera¬ 
ture,  however,  that  the  formal  handling  of  repara¬ 
tions  claims  should  be  done  by  a  separate  body, 
established  after  the  TC  has  discharged  its  man¬ 
date.  This  is  likely  to  be  a  long  and  cumbersome 
process,  which  may  take  several  years  to  be 
completed. 

Reparations  are  by  no  means  limited  to  cash 
payments.  Lump  sum  payments  (especially  involv¬ 
ing  high  sums  to  large  numbers  of  people)  can  be 
particularly  onerous  on  the  public  purse,  which  is 
why  pensions  and  scholarships  are  often  preferred. 
As  Pablo  de  Greiff  has  pointed  out,  reparations 
include  apologies,  educational  and  health  services, 
pensions,  property  and  job  restitutions,  business 
loans,  and  other  types  of  assistance.  A  particularly 
sensitive  issue  is  how  to  determine  the  reparations 
due  to  torture  victims,  as  evidentiary  questions 
often  arise. 

Some  TCs,  though  by  no  means  all,  have  added 
reconciliation  to  their  name.  This  has  also  become 
a  contested  term.  Prima  facie,  there  is  nothing 
wrong  with  it.  Presumably,  the  very  purpose  of 
any  TC  is  to  bring  the  country  together,  to  recon¬ 
cile  it  with  itself,  and  to  avoid  the  repetition  of 
another  cycle  of  human  rights  abuses.  Yet  some, 
including  Chilean  president  Michelle  Bachelet, 
herself  the  victim  of  human  rights  violations,  have 
argued  that  reconciliation  is  a  very  personal  pro¬ 
cess  that  cannot  be  expected  to  occur  in  a  collec¬ 
tive,  such  as  a  nation.  Others  have  stated  that  it  is 
a  concept  with  religious  overtones,  which  has  no 
place  in  contexts  of  separation  of  church  and 
state. 

Setting  the  record  straight  in  terms  of  the  vic¬ 
tims  and  what  happened  to  them  (the  “Truth”); 
aiming  for  a  measure  of  limited,  even  if  only  sym¬ 
bolic,  sanction  of  the  culprits  (“some  Justice”); 
and  setting  up  follow-up  mechanisms  that  through 


reparations  to  the  victims  or  their  relatives  and 
some  institutional  changes  (human  rights  educa¬ 
tion  and  reform  of  the  judiciary  and  of  the  mili¬ 
tary)  prevent  such  future  wrongdoings  may,  in  the 
end,  be  the  best  that  TCs  can  aim  for. 

Jorge  Heine 

Centre  for  International  Governance  Innovation 
Waterloo,  Ontario,  Canada 
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Triangulation 


Triangulation  is  a  research  strategy  in  which  one 
brings  multiple  forms  of  evidence  to  bear  on  a 
single  research  question.  The  term  triangulation  is 
often  modified  by  an  adjective  (e.g.,  within- 
method  triangulation)  to  reflect  how  the  eviden¬ 
tiary  variation  is  introduced.  In  the  years  since  the 
term  was  introduced  into  the  research  methods 
lexicon,  two  primary  accounts  of  the  value  of  tri¬ 
angulation  have  emerged.  One  account  views  tri¬ 
angulation  as  valuable  for  what  it  reveals  about 
the  validity  of  a  descriptive  or  causal  inference. 
The  second  views  triangulation  as  valuable  for 
how  it  enriches  the  perspective  one  gains  on  the 
question  under  investigation.  The  endorsement  of 
triangulation  for  one  or  both  of  these  reasons  is 
widespread  within  the  social  sciences,  but  skepti¬ 
cal  and  cautious  voices  are  heard  as  well.  This 
entry  discusses  varying  perspectives  of  triangula¬ 
tion  and  points  to  new  developments. 

Early  references  to  triangulation  focused  on 
measuring  concepts  with  data  gathered  from  mul¬ 
tiple  methods.  The  classic  reference  is  to  the  1966 
volume  Unobtrusive  Measures  by  Eugene  J.  Webb 
and  colleagues.  The  advice  was  threefold:  (1)  sup¬ 
plement  measures  of  social  and  political  behavior 
drawn  from  interviews  and  questionnaires  with 
(unobtrusive)  measures  drawn  from  physical  trace 
evidence,  documents,  and  observation;  (2)  test  any 
given  hypothesis  repeatedly  using  the  different 
measures  of  key  concepts;  and  (3)  treat  the  hypoth¬ 
esis  as  more  credible  if  all  the  tests  converge  in 
support.  The  reasoning  is  that  measures  built  from 
different  data  sources  will  each  be  flawed  in  ways 
that  the  researcher  should  be  able  to  anticipate,  if 
not  correct,  in  advance.  But  the  particular  flaws 
will  vary  across  the  measures.  A  survey-based  mea¬ 
sure  of  alcohol  consumption,  for  example,  would 
likely  suffer  from  social  desirability  bias,  which 
would  not  threaten  the  validity  of  a  measure  based 
on  (covert)  observation  of  how  many  liquor  bot¬ 
tles  were  found  in  a  person’s  trash,  even  though 
the  latter  will  have  its  own  limitations.  If,  there¬ 
fore,  one  finds  similar  results  when  testing  a 
hypothesis  with  each  measure,  one  can  draw  con¬ 
clusions  with  greater  confidence.  It  becomes 
implausible  to  view  the  findings  as  an  artifact 
driven  by  biases  unique  to  each  measure. 


Note  that  the  major  concern  here  is  systematic 
error  in  the  measures  (a  question  of  measurement 
validity)  not  random  error  (a  question  of  measure¬ 
ment  reliability).  Hence  the  advice  is  to  repeat 
one’s  analysis  using  each  measure,  not  to  form 
indices  from  the  set  as  if  they  were  simply  parallel 
measures.  This  strategy  loses  value,  however,  if  it 
is  plausible  to  view  the  bias  in  each  measure  as 
having  concordant  effects;  even  if  the  hypothesis 
were  false,  flaws  in  the  measures  could  produce 
converging  results.  Hence,  it  is  important  to  select 
measures  where  the  biases  are  expected  to  produce 
discordant  effects  on  the  results. 

Influenced  by  Norman  Denzin’s  more  expansive 
treatment  of  the  topic  in  1970,  social  science  under¬ 
standing  of  triangulation  quickly  extended  beyond 
the  measurement  context.  To  represent  the  varying 
ways  in  which  different  forms  of  evidence  can  be 
combined,  triangulation  is  now  routinely  differenti¬ 
ated  into  types.  Between-method  triangulation 
involves  multiple  methods  of  data  collection  (focus 
groups,  sample  surveys,  participant  observation  in 
field  settings,  content  analysis  of  documents,  labo¬ 
ratory  experiments,  and  so  on).  In  within-method 
triangulation,  the  data  collection  method  is  held 
constant  but  the  design  (e.g.,  analyze  a  panel  and  a 
cross-sectional  survey)  or  measurement  technique 
(e.g.,  work  with  different  survey  questions  or  ques¬ 
tion  types)  may  vary.  Data  triangulation  refers  to 
the  use  of  data  on  multiple  samples,  obtained  at 
different  times  and/or  in  different  contexts.  With 
investigator  triangulation,  multiple  investigators 
work  at  least  semi-independently  on  a  joint  ven¬ 
ture.  In  analysis  triangulation,  the  same  data  are 
analyzed  using  multiple  techniques  (e.g.,  apply  nar¬ 
rative  and  conversational  analysis  to  in-depth  inter¬ 
view  texts  or  analyze  quantitative  data  with  both 
Bayesian  and  non-Bayesian  statistical  techniques). 

Two  accounts  of  triangulation’s  purpose  and 
value  emerged  alongside  this  expansion  of  its 
forms.  One  follows  the  same  logic  of  corroboration 
laid  out  in  the  earlier  work  on  measurement: 
Because  each  piece  of  evidence — generated  from 
research  that  varies  in  method,  design,  measure¬ 
ment  technique,  sample,  investigator,  and/or  mode 
of  analysis — has  weaknesses  not  shared  with  the 
other(s),  working  with  multiple  forms  of  evidence 
should  help  the  researcher  reach  conclusions  that 
are  more  valid  than  if  any  one  evidentiary  source 
had  been  used  alone.  Whether  the  goal  is  descriptive 
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or  causal  inference,  triangulation  should  enhance 
the  researcher’s  ability  to  rule  out  rival  explanations 
for  the  findings  and/or  establish  their  generality. 

Of  course,  just  what  is  gained  in  any  study 
depends  on  the  research  question  and  form(s)  of 
triangulation  employed.  Working  with  multiple 
samples,  as  with  data  triangulation,  will  speak 
most  directly  to  questions  of  generalization. 
Combining  lab  and  field  experiments  will  offer 
evidence  both  on  the  validity  of  a  causal  hypothe¬ 
sis  and  on  whether  a  result  found  in  the  lab  setting 
generalizes  to  the  natural  setting.  Corroboration  of 
a  result  when  using  two  or  more  analysis  tech¬ 
niques  demonstrates  that  the  finding  is  not  an 
artifact  of  the  specific  assumptions  that  underlie 
each  one.  As  these  examples  suggest,  for  triangula¬ 
tion  to  be  effective,  the  researcher  must  deliber¬ 
ately  select  forms  of  evidentiary  variation  to 
achieve  specific  objectives. 

The  second  account  of  triangulation’s  purpose 
and  value  follows  a  logic  of  complementarity 
rather  than  corroboration.  It  boils  down  to  three 
propositions: 

1.  Any  one  form  of  empirical  evidence  will 
inevitably  yield  a  partial  or  incomplete 
understanding  of  the  phenomena  under  study. 

2.  Different  forms  of  evidence  will  provide 
different  perspectives  on  and  insights  into  the 
phenomena. 

3.  Thus,  by  combining  forms  of  evidence,  a  research 
project  will  generate  a  richer  and  more  complete 
body  of  knowledge  about  the  phenomena. 

Examples  abound.  Observational  data  may  make 
evident  what  interview  data  do  not  because  of 
people’s  tendencies  to  focus  on  what  is  salient, 
novel,  or  positive.  Analysis  of  quantitative  data 
sets  might  establish  an  empirical  regularity  while 
in-depth  case  studies  provide  insights  into  causal 
mechanisms.  Interviews  with  multiple  actors 
who  work  within  or  interact  with  an  organiza¬ 
tion  will  provide  multiple,  situated  perspectives 
on  the  dynamics  of  interest.  Documents  may  be 
used  to  gain  insight  into  the  historical  and  insti¬ 
tutional  context  of  a  problem  while  interviews 
are  used  to  acquire  perspectives  on  current 
debates. 

Some  social  scientists  embrace  both  purposes  of 
triangulation.  One  version  of  this  position  simply 


conceives  of  triangulation  as  a  multipurpose 
research  strategy — one  that  may  at  times  be  used 
to  see  if  a  descriptive  or  causal  hypothesis  can  be 
corroborated  with  multiple  forms  of  evidence  and 
that  may  at  other  times  be  used  to  generate  a  more 
complete  (deeper,  broader,  multifaceted,  contextu¬ 
alized,  complex,  and/or  holistic)  understanding  of 
a  topic.  Another  version  sees  both  purposes  as 
potentially  relevant  to  any  hypothesis-testing 
study,  depending  on  the  results:  If  the  triangulated 
results  converge,  the  hypothesis  will  be  validated 
by  the  logic  of  corroboration,  but  if  the  findings 
are  inconsistent  or  contradictory,  the  researcher 
can  develop  a  more  complete  (nuanced,  contextu¬ 
alized,  deeper,  etc. )  explanation  that  reconciles  the 
divergent  results. 

Other  social  scientists  adopt  one  version  of  the 
virtues  of  triangulation  but  not  the  other.  In  some 
of  the  literature,  triangulation  is  simply  associated 
with  one  objective  (corroboration  via  convergence 
vs.  enrichment  via  multiplicity)  while  the  other  is 
ignored  or  underplayed.  Scholars  who  work  pri¬ 
marily  with  qualitative  methods  frequently  go 
further,  however,  and  reject  the  corroboration 
point  of  view  entirely,  arguing  that  it  presupposes 
an  (positivist)  epistemology  that  is  inconsistent 
with  their  own.  The  issues  here  are  complex  and 
the  viewpoints  many,  but  they  often  revolve 
around  the  interpretation  of  convergence  versus 
divergence  in  results.  Consider,  for  example,  a 
project  on  the  topic  of  “bureaucratic  conflict”  that 
finds  widely  varying  accounts  of  how  much  con¬ 
flict  there  is  between  political  appointees  and  civil 
servants  depending  on  who  is  being  queried  about 
the  matter  and  that  uncovers  discrepancies  between 
interview  reports  and  evidence  gleaned  from  pub¬ 
lic  records.  A  simple  (perhaps  simplistic)  applica¬ 
tion  of  the  convergence  logic  would  treat  that 
variability  as  due  to  bias  in  each  indicator  of  the 
true  or  objective  level  of  conflict.  To  scholars 
working  within  constructivist  or  interpretivist  tra¬ 
ditions,  this  would  make  no  sense  at  all.  Differences 
in  perspective  across  individuals  are  to  be  expected, 
public  records  have  their  own  generative  processes, 
and  all  this  variation  is  constitutive  of  the  phenom¬ 
enon  of  bureaucratic  conflict  and  central  to  the 
inquiry,  not  getting  in  the  way  of  a  clear  under¬ 
standing  of  it. 

Ideas  about  triangulation  are  still  evolving. 
New  issues  are  arising  in  the  burgeoning  literature 
on  research  strategies  that  mix  quantitative  and 
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qualitative  evidence  (often  called  mixed-method  or 
multimethod  research  strategies).  Although  one 
still  finds  the  between-methods  triangulation  label 
applied  to  any  research  project  that  has  a  qualita¬ 
tive  and  a  quantitative  component,  scholars  are 
beginning  to  depict  triangulation  as  just  one  form 
of  mixing  the  methods.  For  example,  since  in  trian¬ 
gulation  the  various  methods  are  chosen  a  priori 
for  how  they  together  provide  more  insight  into  a 
central  research  question  than  any  one  taken  alone, 
it  would  be  a  stretch  to  treat  a  project  in  which 
quantitative  and  qualitative  methods  are  used  in 
seriatim ,  with  the  results  of  the  first  used  to  deter¬ 
mine  what  is  done  (sampled,  measured,  focused  on) 
in  the  second,  as  a  case  of  triangulation.  Systematic 
attention  is  also  being  given  to  the  varying  purposes 
served  by  qualitative  and  quantitative  evidence  in 
mixed-method  designs  and  to  the  ways  in  which 
such  evidence  can  be  effectively  integrated. 

Laura  Stoker 
University  of  California,  Berkeley 
Berkeley,  California,  United  States 
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Social  trust  is  the  expectation  of  reliance  that  indi¬ 
viduals  in  a  community  have  toward  each  other  on 
the  basis  of  shared  norms,  mutual  reciprocity,  and 
cooperative  behavior.  Trusting  others  is  based  on 
the  belief  that  their  behavior  will  not  be  harmful  or 
deceitful.  Although  trust  in  family  and  close  friends 
may  be  common  in  most  societies,  the  concept  of 
generalized  social  trust  involves  trusting  people  in 
a  much  wider  scope  of  social,  political,  and  eco¬ 
nomic  relations.  Also  referred  to  as  interpersonal 
trust,  social  trust  is  regarded  as  a  cultural  trait  that 
develops  with  the  formation  and  maintenance  of 
social  networks  and  secondary  associations. 
Rationalist  approaches  also  see  social  trust  as  a 
learned  reciprocal  and  cooperative  behavior. 
Whether  it  is  cultural  or  rational,  social  trust  varies 
significantly  from  one  society  to  another,  and  this 
variation  is  thought  to  have  an  impact  on  politics 
and  economics.  This  entry  reviews  the  origins  of 
the  concept  of  social  trust,  the  ways  in  which  it  has 
been  more  commonly  measured,  the  limitations 
attributed  to  its  measurement,  the  more  general 
theoretical  and  empirical  relationships  between 
social  trust  and  economic  growth  and  democracy, 
and  the  role  of  social  trust  as  a  key  component  of 
a  broader  concept  of  social  capital. 

Origins  of  the  Concept  of  Social  Trust 

The  idea  of  trust  can  be  found  in  various  political 
thinkers  from  different  ages,  but  its  current  under¬ 
standing  as  trust  in  others  within  a  community  or 
a  set  of  social  networks  can  most  prominently  be 
traced  back  to  Alexis  de  Tocqueville’s  Democracy 
in  America.  He  wrote  that  in  19th-century  America 
vast  numbers  of  associations  “are  formed  and 
maintained  by  the  agency  of  private  individuals” 
on  the  basis  of  trust  and  shared  interests.  From  this 
perspective,  trust  is  strongly  related  to  associa- 
tional  life,  which  in  turn  facilitates  the  pursuit  of 
common  goals  that  would  be  harder  to  achieve 
through  purely  individual  efforts.  The  link  between 
trust,  secondary  associations,  and  democracy  has 
been  emphasized  by  many  authors  throughout  the 
second  half  of  the  20th  century  and  still  attracts  a 
lot  of  attention,  particularly  because  indicators 
have  shown  a  significant  decrease  in  social  trust,  a 
point  that  we  will  return  to  later. 
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In  the  late  1950s,  an  influential  culturalist 
approach  to  politics  put  the  concept  and  measure¬ 
ment  of  social  trust  at  the  center  of  the  academic 
debate.  The  Civic  Culture  study,  published  in 
1963,  established  a  theoretical  and  empirical  rela¬ 
tionship  between  trust  and  democracy.  The  authors, 
Gabriel  Almond  and  Sydney  Verba,  conceived  of 
trust  as  a  component  of  a  democratic  political  cul¬ 
ture.  In  their  view,  stable  democracy  requires  a  set 
of  democratic  values  and  beliefs  that  supports  it, 
and  trust  in  others  is  one  of  those  values.  Their 
book,  based  on  survey  data  from  five  nations,  rein¬ 
forced  the  idea  that  social  trust  is  a  condition  for 
the  formation  of  secondary  associations  and,  as  a 
consequence,  a  feature  that  fosters  the  development 
of  a  participatory  society,  a  pillar  of  democracy. 

The  relationship  between  social  trust  and 
democracy  was  very  persuasive,  and  it  had  a  great 
impact  on  subsequent  empirical  political  research. 
As  trust  was  part  of  a  more  general  democratic 
political  culture,  higher  levels  of  interpersonal  trust 
were  consequently  linked  to  higher  levels  of  democ¬ 
ratization.  Subsequent  research  has  shown  that 
social  trust  is  related  to  effective  democratic  gover¬ 
nance,  not  only  because  secondary  associations  are 
a  vital  part  of  democracy  but  also  because  trust 
enhances  the  development  of  a  loyal  opposition 
respecting  the  “rules  of  the  game,”  facilitates  self¬ 
compliance  of  political  agreements,  and  fosters 
tolerance.  Democratic  politics  is  based  on  rules  and 
procedures  that  require  the  acceptance  and  compli¬ 
ance  of  the  parts,  and  trust  is  a  positive  intervening 
factor.  It  is  important  to  note  that  the  close  rela¬ 
tionship  between  social  trust  and  democracy  as  a 
form  of  government  does  not  mean  that  trusting 
other  people  is  the  same  as  trusting  political  insti¬ 
tutions.  Social  trust  and  political  trust,  generally 
understood  as  confidence  in  political  institutions, 
may  be  related,  but  they  are  two  different  concepts, 
and  each  one  is  subject  to  different  explanations. 

Measures  of  Social  Trust 

Social  trust  has  been  measured  in  various  ways  but 
mainly  through  survey  research.  The  most  com¬ 
mon  indicator  was  used  in  The  Civic  Culture 
study,  but  it  was  only  one  of  five  original  survey 
items  designed  to  measure  “faith  in  people”  that 
its  authors  borrowed  from  previous  research.  The 
measure  of  social  trust  derives  from  the  following 


question:  “Some  people  say  that  most  people  can 
be  trusted.  Others  say  you  can’t  be  too  careful  in 
your  dealings  with  people.  How  do  you  feel  about 
it?”  The  percentage  of  survey  respondents  who 
chose  the  option  “most  people  can  be  trusted” 
became  an  empirical  representation  of  social  or 
interpersonal  trust  as  well  as  an  initial  basis  for 
cross-national  comparisons.  Today,  many  com¬ 
parative  public  opinion  surveys  include  this  item  in 
their  questionnaires,  and  some  of  them  have  kept 
record  of  it  for  several  years,  providing  a  relatively 
standard  measure  for  a  wide  array  of  societies  with 
different  levels  of  economic  development,  political 
institutions,  and  cultural  traditions. 

Although  many  surveys  use  this  indicator  as  a 
standard  measure  of  social  trust,  the  question  from 
which  it  is  derived  has  been  subject  to  criticism  and 
continual  revision.  One  of  the  main  complaints  is 
that  the  question  is  posed  as  a  simple  dichotomy  in 
which  people  in  general  can  only  be  trusted  or 
distrusted.  This  implies  that  trust  is  not  a  matter  of 
degree.  However,  this  is  questionable  because,  as 
some  argue,  some  groups  in  society  can  be  trusted 
at  a  higher  rate  than  others,  or  the  same  group  of 
people  can  be  trusted  under  some  circumstances 
but  not  so  much  under  different  ones.  This  leads  us 
to  a  second  criticism,  which  is  that  the  standard 
question  fails  to  provide  any  social  context  or 
group  reference  that  could  help  evaluate  who  can 
be  trusted  and  who  cannot.  If  a  person  says  that 
“most  people  can  be  trusted,”  who  is  included  in 
that  response  and  who  is  not?  Moreover,  can  the 
same  person  be  trusted  under  different  circumstances 
or  situations?  This  leads  to  a  third  criticism — that 
the  question  is  ambivalent  and,  therefore,  an  unre¬ 
liable  indicator  of  social  trust. 

Despite  these  problems  and  the  development  of 
alternative,  context-specific  questions  in  cross¬ 
national  survey  research,  the  standard  question  of 
generalized  social  trust  continues  to  be  widely 
used.  And  this  is  so  mainly  for  two  reasons:  First, 
many  believe  that,  despite  its  limitations,  the  stan¬ 
dard  question  does  tap  generalized  social  trust, 
and  second,  the  percentage  of  respondents  who  say 
“most  people  can  be  trusted”  has  a  high  level  of 
cross-national  variation.  This  is  shown  by  the  most 
recent  European  Values  and  World  Values  Studies, 
conducted  in  2005  to  2007.  The  data  from  those 
surveys  display  a  range  that  goes  from  a  high  of 
74%  of  respondents  in  Norway  who  think  that 
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“most  people  can  be  trusted”  to  a  low  of  4%  in 
Trinidad  and  Tobago  (see  Table  1).  This  signifi¬ 
cant  variation  in  trust  also  correlates  with  other 
variables,  such  as  the  level  of  economic  develop¬ 
ment  and  democracy.  Richer  and  more  democratic 
societies  tend  to  express  higher  levels  of  general¬ 
ized  social  trust. 

Trust  and  Economic  Development 

Cross-national  comparative  studies  as  well  as 
more  particular  case  studies  on  social  trust  have 
shown  that  some  societies  are  highly  trusting  while 
others  are  driven  by  a  culture  of  suspicion  and 
distrust.  The  economic  and  political  implications 
of  this  variation  have  been  documented  in  various 
works.  On  the  economic  side,  some  authors  argue 
that  high-trusting  societies  where  social  networks 
exist  are  far  more  efficient  and  much  more  able  to 
take  advantage  of  information  technology,  for 


example,  than  low-trusting  societies.  The  latter 
face  several  organizational  and  competitive  disad¬ 
vantages  when  compared  with  the  former.  Others 
argue  that  norms  of  trust  and  reciprocity  facilitate 
good  economic  performance  and  create  economic 
prosperity.  Part  of  the  explanation  is  that  trust 
diminishes  the  costs  of  transaction  and  makes  eco¬ 
nomic  relations  more  efficient.  Also,  data  gathered 
in  different  environments  have  shown  that  the 
rates  of  financial  investment  are  related  to  trust, 
thereby,  influencing  economic  growth.  Organiza¬ 
tional  perspectives  have  linked  trust  to  teamwork 
and  higher  levels  of  productivity.  From  any  of 
these  perspectives,  trust  is  clearly  an  economic 
asset. 

Trust,  Democracy,  and  Social  Capital 

On  the  political  side,  the  relationship  between  trust 
and  democracy  has  been  extensively  discussed. 


Table  I  Social  Trust  by  Country:  Percentage  of  Respondents  Who  Say  “Most  People  Can  Be  Trusted” 


Norway 

74 

Italy 

29 

Mexico 

16 

Sweden 

68 

Uruguay 

28 

Serbia 

15 

Finland 

59 

Ukraine 

28 

Burkina  Faso 

15 

China 

52 

Russia 

27 

Colombia 

14 

Vietnam 

52 

Ethiopia 

24 

Morocco 

13 

New  Zealand 

51 

Taiwan 

24 

Cyprus 

13 

Switzerland 

51 

India 

23 

Chile 

12 

Australia 

48 

Bulgaria 

22 

Zambia 

12 

Netherlands 

44 

Andorra 

21 

Iran 

11 

Indonesia 

43 

Romania 

20 

Brazil 

9 

Canada 

42 

Spain 

20 

Malaysia 

9 

Thailand 

42 

Poland 

19 

Ghana 

9 

Hong  Kong 

41 

France 

19 

Peru 

6 

Iraq 

41 

Egypt 

18 

Rwanda 

5 

USA 

40 

Slovenia 

18 

Turkey 

5 

Japan 

39 

Moldova 

18 

Trinidad  & 

4 

Germany 

34 

Mali 

17 

Tobago 

Jordan 

31 

South  Africa 

17 

Britain 

30 

Argentina 

17 

South  Korea 

30 

Guatemala 

16 

Source:  World  Values  Surveys,  2005-2007  (www.worldvaluessurvey.org). 


2674  Trust,  Social 


Interpersonal  trust  is  viewed  as  part  of  a  set  of  val¬ 
ues  that  are  related  to  the  development  of  demo¬ 
cratic  institutions,  although  the  causal  direction  in 
this  relationship  has  been  the  subject  of  an  ongoing 
debate.  The  role  of  trust  and  its  accompanying 
associational  life  influence  the  functioning  of 
democracy,  and  civic  engagement  is  thought  to 
have  a  positive  effect  on  democratic  governance. 
But  this  is  where  a  more  general  concept  of  social 
capital  unfolds,  and  social  trust  is  only  a  part  of  it. 
Social  capital,  generally  understood  in  terms  of 
social  networks  and  norms  of  reciprocity  and  trust¬ 
worthiness,  is  a  more  comprehensive  construct. 
However,  social  trust  can  be  seen  as  an  important 
but  autonomous  component.  Recent  research  has 
found,  for  example,  that  the  link  between  social 
capital  and  socioeconomic  development  is  mainly 
mediated  by  trust  and  not  by  an  organized  civil 
society.  Thus,  social  trust  is  a  component  of  social 
capital,  but  it  should  not  be  equated  to  it. 

Social  capital  studies  have  recorded  a  steady 
decline  in  trust  and  associational  life  during  the 
last  decades,  mainly  in  the  United  States.  This 
decline  has  several  theoretical  and  empirical  impli¬ 
cations,  and  scholars  have  asked  whether  a  decline 
in  trust  is  also  a  decline  in  democracy  or  a  decline 
in  socioeconomic  development.  Following  social 
capital  studies,  as  trust  and  social  networks  are 
strongly  associated  to  giving  and  volunteering,  to 
participatory  communities,  to  higher  levels  of  tol¬ 
erance,  and  as  Robert  Putnam  has  called  them,  to 
“many  other  forms  of  civic  virtue,”  the  decline  of 
trust  has  been  an  alarming  trend.  Some  argue  that 
this  decline  is  the  result  of  an  increase  in  economic 
inequality,  which  means  that  the  once  achieved 
horizontal  social  relations  are  returning  to  vertical 
and  hierarchical  ones.  Others  argue  that  the 
decline  in  associational  life  is  only  observed  in  tra¬ 
ditional  organizations  but  that,  increasingly, 
autonomous  individuals  driven  by  self-expression 
and  emancipative  values  are  creating  different 
ways  of  organizing,  even  through  the  use  of  new 


technologies.  Trust,  for  example,  is  seen  as  part  of 
a  more  postmodern  cultural  syndrome  of  well¬ 
being  than  a  part  of  civic  traditions. 

We  can  confidently  say  that  the  literature  on 
social  trust  takes  on  a  vibrant  topic  that  is  not 
limited  to  the  macrophenomena  of  democracy  and 
socioeconomic  development  but  that  contributes 
to  important  issues  in  organizational  research, 
where  social  trust  is  viewed  as  a  process,  even  a 
strategy,  for  successful  joint  work  between  indi¬ 
viduals,  groups,  institutions,  and  nations. 

Alejandro  Moreno 
Instituto  Tecnologico  Autonomo  de  Mexico 

Mexico  City,  Mexico 

See  also  Civic  Culture;  Civic  Participation;  Political 
Culture;  Social  Capital;  Tolerance 
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Unilateralism 


The  most  precise  and  parsimonious  definition  of 
unilateralism  is  numerical:  a  state  acting  alone. 
However,  unilateralism  has  a  wide  range  of  defini¬ 
tions,  none  of  which  has  emerged  as  predominant 
within  the  literature.  A  further  complication  is 
that  some  definitions  of  unilateralism  are  general 
and  can  be  applied  to  either  security  affairs  or  the 
global  economy,  whereas  other  definitions  are 
specific  to  only  one  of  these  realms.  This  entry 
first  reviews  four  general  definitions  of  unilateral¬ 
ism,  followed  by  two  definitions  that  pertain  spe¬ 
cifically  to  the  global  economy,  and  then  two 
definitions  concerning  security  affairs. 

Although  the  term  unilateralism  has  long 
existed,  it  rose  dramatically  in  prominence  during 
the  George  W.  Bush  administration.  The  Bush 
administration  was  widely  regarded  by  scholars 
and  pundits  as  promoting  a  strong  turn  in  U.S. 
foreign  policy  away  from  multilateralism  and 
toward  unilateralism.  The  widespread  debate  over 
the  Bush  administration’s  foreign  policy  led  to 
greatly  renewed  interest  among  scholars  about  the 
practice  and  consequences  of  unilateralism. 

Of  the  four  general  definitions  of  unilateralism, 
the  easiest  one  to  operationalize  focuses  on  the 
number  of  countries  that  coordinate:  An  action 
with  a  single  participating  state  is  regarded  as  uni¬ 
lateral,  whereas  one  that  has  three  or  more  coordi¬ 
nating  states  is  multilateral.  The  clarity  of  this 
definition  notwithstanding,  many  analysts  ques¬ 
tion  whether  the  mere  fact  that  three  or  more 


states  coordinate  is  sufficient  to  constitute  multi¬ 
lateralism.  Critics  of  this  quantitative  definition 
argue  that  the  real  question  is  not  how  many  coun¬ 
tries  coordinate  but  the  way  in  which  they  do. 
Some  critics  argue  that  for  an  action  to  be  consid¬ 
ered  multilateral,  the  burden  of  coordination  must 
be  widely  shared.  These  analysts  argue,  for  exam¬ 
ple,  that  if  a  powerful  country  contributes  the  vast 
majority  of  its  troops  for  a  military  operation,  then 
it  is  still  best  seen  as  unilateral  even  if  three  or 
more  countries  do  participate  in  the  action.  In  his 
critique  of  the  quantitative  definition,  John  Ruggie 
argues  that  what  distinguishes  multilateralism  is 
that  it  involves  the  coordination  of  policies  by 
three  or  more  states  on  the  basis  of  generalized 
principles  of  conduct.  In  this  view,  ad  hoc  coordi¬ 
nation  by  even  a  large  group  of  states  should  not 
be  considered  multilateralism.  The  fact  that  more 
than  20  countries  contributed  troops  to  the  2003 
Iraq  War,  for  example,  does  not  constitute  multi¬ 
lateralism  since  this  coordination  did  not  occur  on 
the  basis  of  generalized  principles  but  instead 
reflected  particular  interests. 

The  second  general  definition  of  unilateralism  is 
a  state  acting  on  the  basis  of  narrow  self-interest 
without  regard  for  the  interests  of  other  countries. 
Using  this  rubric,  unilateralism  is  easy  to  identify 
in  those  situations  when  a  state  acts  by  itself 
against  the  wishes  of  all  other  countries.  Conversely, 
an  action  is  clearly  not  unilateral  if  all  countries 
agree  that  it  should  be  undertaken.  Yet  it  is  rare  in 
international  politics  that  either  all  or  no  states 
agree  with  a  given  action,  and  the  literature  is 
unclear  regarding  how  many  countries  must  agree 
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with  an  action  before  it  is  no  longer  considered 
unilateral.  In  this  regard,  analysts  sometimes  apply 
the  term  unilateralism  to  actions  that  many  states 
see  as  beneficial  and  participate  in. 

A  related  general  definition  of  unilateralism  is 
when  a  state  acts  without  consulting  other  states 
beforehand.  The  literature  does  not  specify,  how¬ 
ever,  how  much  consultation  must  occur  for  a 
state  or  leader  to  avoid  being  regarded  as  under¬ 
taking  a  unilateral  foreign  policy.  In  this  regard, 
the  Bush  administration  was  generally  viewed  as 
acting  unilaterally  during  its  first  term,  in  signifi¬ 
cant  part  because  it  was  seen  as  not  consulting  very 
much  with  its  allies.  Yet  American  consultation 
with  its  allies  on  some  matters — such  as  intelli¬ 
gence  sharing  regarding  terrorists — remained 
extremely  high  under  the  Bush  administration. 
Moreover,  in  areas  where  U.S.  consultation  with 
allies  did  drop  off,  the  decline  was  not  to  zero;  for 
example,  the  United  States  continued  to  have  a 
very  close  working  relationship  on  military  opera¬ 
tions  with  states  such  as  Britain  and  Australia. 

A  final  general  definition  of  unilateralism  is  a 
state  not  complying  with  international  institutions 
and/or  withdrawing  from  them.  Actions  such  as 
the  Bush  administration’s  withdrawal  from  the 
Anti-Ballistic  Missile  Treaty,  its  failure  to  ratify 
the  Kyoto  Protocol,  and  its  efforts  to  undermine 
the  International  Criminal  Court  were  roundly 
criticized  by  the  international  community  as  uni¬ 
lateral.  As  with  the  previous  two  general  defini¬ 
tions,  the  literature  lacks  a  threshold  regarding  this 
third  definition:  It  is  unclear  exactly  how  much 
noncompliance  with  institutions  and/or  treaty 
withdrawals  must  occur  before  a  state  is  regarded 
as  pursuing  a  unilateral  foreign  policy  in  an  overall 
sense.  It  is  notable  in  this  regard  is  that  the  Bush 
administration  followed  its  predecessors  in  con¬ 
tinuing  to  comply  with  many  international  institu¬ 
tions,  particularly  in  the  economic  realm. 

Unilateralism  has  two  principal  definitions  with 
respect  to  the  global  economy,  the  first  of  which  is 
a  departure  from  the  principle  of  nondiscrimina¬ 
tion.  The  significance  of  nondiscrimination  is  typi¬ 
cally  outlined  with  respect  to  trade,  in  which  a 
defining  feature  of  the  Bretton  Woods  order  was 
the  extension  of  most-favored-nation  treatment  to 
all  members  of  the  General  Agreement  on  Tariffs 
and  Trade  (GATT).  Unilateralism  can  also  be 
applied  to  the  monetary  sphere;  in  this  regard, 


unilateralism  is  typically  equated  with  the  creation 
of  exclusive  currency  blocks,  as  occurred  during 
the  interwar  period. 

A  second  definition  of  unilateralism  pertaining 
to  the  global  economy  is  the  aggressive  use  of 
power  by  a  dominant  state  to  force  a  weaker  state 
to  accept  concessions  in  economic  negotiations. 
Analysts  who  advance  this  definition,  such  as 
Jagdish  Bhagwati,  stress  that  a  powerful  state  such 
as  the  United  States  is  at  inherent  advantage  over 
other  states  in  bilateral  negotiations.  In  this  view,  a 
departure  from  negotiations  within  the  context  of 
global  international  institutions  such  as  the  World 
Trade  Organization  constitutes  unilateralism. 

With  respect  to  security  affairs,  there  are  two 
key  definitions  of  unilateralism.  The  first  focuses 
on  operational  conduct.  According  to  this  perspec¬ 
tive,  a  military  action  is  obviously  unilateral  if  only 
one  state  participates.  Yet  it  is  rare  for  a  state  to 
act  in  the  military  realm  without  any  help  whatso¬ 
ever  from  other  states.  In  light  of  this,  Sarah  Kreps 
argues  that  an  action  by  a  coalition  of  states 
should  still  be  considered  unilateral  if  one  state 
contributes  the  vast  majority  of  the  troops  and 
financial  resources  and  also  if  the  lead  state  is  able 
to  effectively  control  the  decisions  made  by  the 
coalition.  Kreps  argues  that  from  an  operational 
standpoint,  the  2003  Iraq  War  was  clearly  unilat¬ 
eral  in  nature  since  the  United  States  contributed 
around  90%  of  the  troops,  bore  the  vast  prepon¬ 
derance  of  the  financial  costs  of  the  war,  and  was 
dominant  in  terms  of  the  planning  process  for  the 
timing  and  conduct  of  the  military  operation.  She 
argues  that,  in  comparison,  the  1991  Gulf  War 
was  multilateral  since  it  was  much  more  balanced 
on  all  three  of  these  dimensions:  The  United  States 
provided  less  than  75%  of  the  troops,  was  respon¬ 
sible  for  only  11.5%  of  the  coalition  expenditures 
for  the  war,  and  had  relatively  less  authority 
regarding  the  planning  of  the  war  since  most  of  the 
other  major  powers  in  the  world  also  participated. 

Second,  many  scholars  define  unilateralism  in 
the  security  realm  on  the  basis  of  whether  an  inter¬ 
national  institution  authorizes  or  endorses  a  mili¬ 
tary  action.  Scholars  who  make  this  argument 
typically  argue  that  a  military  action  is  unilateral  if 
it  does  not  receive  the  backing  of  the  United 
Nations  (UN).  A  key  reason  for  this  focus  on  the 
UN  is  international  law.  According  to  customary 
international  law,  the  use  of  military  force  is 
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unlawful  without  the  authorization  of  the  UN  (the 
sole  exception  is  self-defense  in  response  to  an 
armed  attack).  Other  scholars  argue  that  it  sets  too 
high  a  standard  to  say  that  any  action  should  be 
considered  unilateral  if  it  fails  to  receive  UN 
authorization;  they  stress  that  if  this  standard  were 
adopted,  then  the  1999  Kosovo  War  would  be 
considered  unilateral  even  though  this  military 
action  was  approved  by  the  North  Atlantic  Treaty 
Organization  (NATO)  and  was  prosecuted  collec¬ 
tively  by  its  members.  For  these  analysts,  the 
Kosovo  case  suggests  the  need  for  a  more  narrow 
definition  of  unilateralism:  A  military  action  should 
be  considered  multilateral  if  it  is  approved  by  any 
international  institution  and  is  unilateral  only  if  it 
completely  fails  to  receive  institutional  backing. 

Although  the  concept  of  unilateralism  is  used 
very  prominently  by  scholars  and  pundits,  analysts 
typically  employ  the  term  without  specifying  it. 
Given  the  plethora  of  different  meanings  of  the 
term,  it  is  incumbent  to  be  precise  about  the  par¬ 
ticular  definition  that  is  being  used.  It  is  only 
recently  that  scholars  have  moved  to  establish 
detailed  coding  procedures  for  clearly  distinguish¬ 
ing  unilateralism  from  multilateralism,  and  more 
work  along  these  lines  would  be  beneficial. 

Stephen  G.  Brooks 
Dartmouth  College 
Hanover,  New  Hampshire,  United  States 
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The  United  Nations  (UN)  is  an  intergovernmental 
organization  dedicated  to  the  maintenance  of 
international  peace  and  security.  It  achieved  nearly 
universal  state  membership  in  2002.  With  192 
members  at  the  beginning  of  the  21st  century,  the 
organization  rests  on  the  principle  of  collective 
security,  by  which  its  members  commit  themselves 
to  reject  the  use  of  force  in  the  settlement  of  their 
disputes  and  promise  to  act  jointly  against  any 
aggressor.  Founded  at  the  close  of  the  modern 
world’s  most  destructive  war,  the  final  provisions 
of  the  UN  Charter  were  negotiated  at  the  San 
Francisco  Conference,  which  met  from  April 
through  June  1945.  The  organization  came  into 
being  on  October  24  of  that  year  with  the  issuance 
of  sufficient  ratifications  by  member  governments. 

History  and  Philosophy 

At  the  time  of  its  creation,  the  UN  was  the  latest 
manifestation  of  a  130-year-old  search  for  mecha¬ 
nisms  of  international  cooperation  and  law  to 
supplant  the  traditional  methods  of  national  self- 
defense  and  war  that  had  been  common  to  interna¬ 
tional  relations.  Following  the  defeat  of  the  French 
Emperor  Napoleon  Bonaparte  in  1815,  the  victori¬ 
ous  allies  gathered  at  the  Congress  of  Vienna  and 
crafted  an  arrangement  dubbed  the  Concert  of 
Europe  to  maintain  a  hoped-for  peaceful  conti¬ 
nent.  Under  the  arrangement,  the  major  powers  of 
Europe  agreed  to  meet  regularly  and  to  act  collec¬ 
tively  against  any  threat  to  the  post-Napoleonic 
settlement.  Often  considered  a  conservative  settle¬ 
ment,  the  Vienna  meeting  nonetheless  ushered  in  a 
reasonably  tranquil  19th  century,  which  experi¬ 
enced  a  creative  expansion  of  international  organi¬ 
zations.  Early  river  commissions  such  as  the 
Central  Rhine  Commission  (1815)  and  the 
European  Danube  Commission  (1856)  reflected  a 
new  interest  in  “functional”  international  organi¬ 
zations  that  built  trust  among  peoples  by  provid¬ 
ing  venues  of  cooperation  to  resolve  common 
problems.  The  International  Telegraphic  Union 
(1865)  and  the  Universal  Postal  Union  (1874) 
paved  the  way  for  numerous  future  international 
agencies  dealing  with  issues  as  diverse  as  narcotic 
drugs,  agriculture,  health,  weights  and  measures, 
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railroads,  time  zones,  and  tariffs.  “Political”  inter¬ 
national  organizations  meant  to  ease  tensions  and 
resolve  formal  disputes  between  governments  also 
proliferated.  In  1899  and  1907,  the  Hague  Con¬ 
ferences  established  the  Permanent  Court  of  Arbi¬ 
tration  for  the  resolution  of  legal  controversies 
between  states. 

The  Congress  System  collapsed  in  the  carnage  of 
World  War  I.  However,  the  war  itself  convinced 
many  leaders  of  the  need  for  a  world  organization 
with  the  authority  and  political  will  to  avert 
another  major  conflict.  U.S.  President  Woodrow 
Wilson  led  the  effort  at  the  1919  Versailles  Peace 
Conference  to  create  the  League  of  Nations,  com¬ 
mitted  to  maintaining  peace  and  security,  fostering 
international  cooperation,  and  developing  compre¬ 
hensive  international  law  as  enforceable  substitutes 
for  the  use  of  force  by  any  member  of  the  league. 

The  League  came  into  being  on  January  10, 
1920,  with  its  headquarters  in  Geneva,  Switzerland. 
The  organization  consisted  of  an  Assembly  com¬ 
posed  of  all  members,  a  Council  that  included  per¬ 
manent  members  from  the  great  powers,  and  a 
Secretariat.  Both  the  Assembly  and  the  Council 
required  unanimity  on  any  decision.  The  Covenant 
also  established  a  Permanent  Court  of  International 
Justice  to  hear  disputes  between  states  and  a  man¬ 
date  system  (whereby  major  colonizing  powers 
would  prepare  colonial  areas  for  independence)  that 
foreshadowed  the  end  of  colonialism.  The  most 
controversial  provision  of  the  Covenant,  Article  10, 
called  for  collective  security  to  assure  nations  of  the 
League’s  protection  against  aggression. 

Wilson’s  League  proved  ineffective  in  resolving 
the  most  important  conflicts  of  the  interwar 
period.  In  no  small  measure,  the  U.S.  rejection  of 
the  Versailles  Treaty,  and,  therefore,  its  nonpar¬ 
ticipation  in  the  League,  undercut  the  will  and 
authority  of  the  organization.  Also,  the  Covenant 
requirement  that  decisions  in  the  League’s  Assem¬ 
bly  and  Council  secure  unanimous  approval — 
acknowledging  each  member’s  sovereignty — made 
action  to  stop  an  aggressor  nearly  impossible.  The 
League  collapsed  as  fascist  governments  ignored 
the  terms  of  the  1919  peace  settlement  and  carried 
out  aggression  in  both  Europe  and  Asia. 

Just  as  the  League  of  Nations  was  the  brainchild 
of  Woodrow  Wilson,  so  too  the  UN  had  its  origins 
in  the  deliberations  of  an  American  president  and 
his  administration. 


When  President  Franklin  Delano  Roosevelt 
entered  World  War  II  as  an  advocate  of  great- 
power  realism,  he  argued  that  the  League  had  been 
too  dependent  on  world  public  opinion  and  not 
adequately  sensitive  to  the  realities  of  power.  By 
1943  he  had  come  to  believe  that  only  a  world 
organization  that  had  none  of  the  League’s  weak¬ 
nesses  was  needed  to  secure  the  postwar  peace. 
Roosevelt  himself  would  name  the  organization 
the  “United  Nations,”  taking  the  name  from  the 
“Declaration  by  United  Nations,”  signed  by  the 
allies  in  January  1942. 

Roosevelt’s  views  shifted  in  part  because  of  the 
strong  advocacy  by  the  U.S.  State  Department  for 
some  postwar  agency  that  could  enforce  the  rule  of 
law  and  the  pacific  settlement  of  international  dis¬ 
putes.  U.S.  Secretary  of  State  Cordell  Hull  was  the 
primary  promoter  of  the  idea.  He  established  a 
departmental  committee  to  draft  plans  for  an 
international  organization.  This  committee,  largely 
under  Leo  Pasvolsky’s  leadership,  worked  until  the 
end  of  the  war  to  develop  the  proposals  that  would 
ultimately  be  the  basis  for  the  UN  Charter  itself. 
The  projected  Charter,  like  the  League’s  Covenant 
before  it,  created  a  plenary  conference,  a  secretar¬ 
iat,  agencies  for  technical  services,  and  most 
important  an  executive  committee,  in  this  case 
consisting  of  the  four  allied  powers — the  United 
States,  Great  Britain,  China,  and  the  former 
USSR — plus  other  nonpermanent  member  states. 
In  a  series  of  meetings  during  the  spring  and  sum¬ 
mer  of  1943,  Roosevelt  informally  gave  his  bless¬ 
ing  to  the  effort  to  obtain  British  and  Soviet  assent 
to  a  new  international  organization.  Following  the 
Tehran  Conference  of  wartime  leaders  in  Novem¬ 
ber  1943,  Roosevelt  approved  a  “Plan  for  the 
Establishment  of  an  International  Organization 
for  the  Maintenance  of  International  Peace  and 
Security.  ” 

The  most  serious  Charter  issue  that  Roosevelt 
had  to  solve  in  the  final  months  of  the  war  was  the 
question  of  voting;  from  his  perspective,  the  issue 
was  how  to  protect  the  traditional  sovereignty  of 
the  nation-states  that  would  be  members  of  the 
world  organization  and  yet  not  allow  the  UN  to 
fall  victim  to  the  requirement  of  unanimity  among 
the  members.  The  topic  was  addressed  at  the  1944 
Dumbarton  Oaks  Conference  and  at  the  Yalta 
Conference  in  February  1945.  Josef  Stalin,  the 
leader  of  the  Soviet  Union,  while  acceding  to 
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majority  voting  in  both  the  General  Assembly  and 
the  contemplated  Security  Council,  insisted  on  an 
absolute  veto  in  the  latter  even  over  the  discussion 
of  any  issue  that  threatened  international  peace  or 
security.  The  United  States  opposed  the  Soviet  posi¬ 
tion,  and  in  the  end,  a  compromise  emerged  under 
which  any  issue  could  be  brought  to  the  Council 
but  substantive  action  on  the  matter  required  an 
enlarged  majority  and  the  concurrence  of  the  five 
permanent  members — the  four  allied  great  powers 
plus  France — in  effect,  giving  each  a  veto. 

As  many  as  50  governments  sent  representa¬ 
tives  to  the  UN  Conference  on  International 
Organization  in  San  Francisco.  While  the  Big 
Three — the  United  States,  the  Soviet  Union,  and 
Great  Britain — already  had  agreed  on  important 
aspects  of  the  new  organization,  the  participation 
of  many  other  nations,  with  their  own  interests, 
made  the  conference  a  complex  exercise  in  multi¬ 
lateral  diplomacy. 

The  great  powers  often  were  required  to  make 
meaningful  alterations  in  the  proposed  Charter  in 
order  to  achieve  the  necessary  votes  for  passage.  In 
response  to  small-state  concerns,  the  competence 
of  the  General  Assembly  was  expanded.  It  was 
given  control  of  the  UN  budget.  The  conference 
agreed  that  the  Assembly  could  discuss  any  issue, 
including  security  concerns  and  threats  to  the 
peace,  at  least  until  the  Security  Council  was 
“seized”  with  the  issue.  Further,  the  secretary- 
general,  elected  by  the  General  Assembly,  would 
be  able  to  bring  any  matter  that  he  believed  threat¬ 
ened  international  peace  and  security  to  the  atten¬ 
tion  of  the  Council  (UN  Charter,  Article  99). 

States  from  Latin  America,  Africa,  and  Asia 
envisioned  an  institution  of  universal  membership 
with  extensive  activities  in  the  economic,  cultural, 
and  human  rights  domains.  In  particular,  they 
pressed  for  a  UN  role  in  the  achievement  of  “inde¬ 
pendence”  for  existing  colonial  possessions.  Their 
demands  led  to  a  declaration  on  self-government 
being  incorporated  in  the  final  Charter.  Under  the 
leadership  of  Latin  American  governments,  there 
was  also  an  effort  to  promote  human  rights.  The 
final  UN  Charter  draft  reflected  this  new  concern, 
becoming  the  first  international  treaty  to  use  the 
term  human  rights ,  in  this  case  citing  those  rights 
six  times  in  the  text  and  calling  for  their  protec¬ 
tion.  To  support  this  effort,  the  founding  delegates 
called  for  the  creation  of  a  Commission  on  Human 


Rights.  Once  established  in  1946,  the  Commission 
proceeded  under  the  leadership  of  Eleanor 
Roosevelt  to  draft  the  Universal  Declaration  of 
Human  Rights. 

Structure  and  Purposes  of  the  United  Nations 

The  UN  Charter  serves  as  the  constitutional  basis 
for  the  organization’s  structures  and  procedures, 
the  rights  and  responsibilities  of  UN  members,  and 
the  UN’s  authority  to  act  in  the  international  com¬ 
munity.  The  Charter  envisions  an  institution  with 
revolutionary  powers  and  authority  in  interna¬ 
tional  affairs.  The  organization  is  granted  not  only 
the  traditional  instruments  of  diplomatic  practice 
but  also  the  authority  to  impose  crushing  sanctions 
and  to  use  military  force  (Article  41),  with  nation¬ 
states  required  to  put  their  troops  at  the  disposal  of 
the  institution.  Chapters  VI  and  VII  of  the  Charter 
provide  the  legal  legitimacy  for  these  sweeping 
powers.  Chapter  VII  contains  the  primary  mecha¬ 
nisms  for  collective  security,  allowing  the  organiza¬ 
tion  to  undertake  enforcement  measures,  while 
Chapter  VI  provides  for  the  traditional  means  of 
diplomacy  to  achieve  the  “pacific  settlement”  of 
disputes. 

Chapter  VII,  Article  39,  gives  the  Security 
Council  the  authority  under  international  law  to 
“determine  the  existence  of  any  threat  to  the 
peace,  breach  of  the  peace,  or  act  of  aggression,” 
and  to  “decide  what  measures  shall  be  taken”  to 
halt  the  threat  or  punish  the  aggressor.  Article 
40  empowers  the  Council  to  take  provisional  mea¬ 
sures  to  prevent  an  escalation  or  aggravation  of  a 
dispute.  In  July  1987,  the  Soviet  Union  and  the 
United  States  for  the  first  time  joined  in  a  resolu¬ 
tion  (598)  on  the  Middle  East  and  demanded  that 
Iran  and  Iraq  “observe  an  immediate  cease-fire”  in 
their  ongoing  war.  Citing  Article  40,  the  Council 
decided  “to  consider  further  steps  to  ensure  com¬ 
pliance”  if  either  of  the  parties  refused  to  accept 
the  UN’s  demands. 

According  to  the  original  conception  of  the  UN 
as  the  guarantor  of  international  peace,  the  orga¬ 
nization  was  expected  to  command  military  forces 
under  the  authority  and  direction  of  the  Security 
Council  and  with  the  help  of  a  Military  Staff 
Committee.  All  member  states,  as  charged  by 
Articles  43  and  44,  were  obliged  to  contribute 
troops  and  equipment  to  these  UN-led  operations. 
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Despite  the  intentions  of  the  founders,  the  Military 
Staff  Committee  quickly  fell  into  disuse,  largely 
ignored  after  1946  as  the  Cold  War  intensified.  In 
place  of  direct  UN  military  action,  the  more  com¬ 
mon  practice  emerged  of  the  Council  empowering 
individual  states  to  take  such  action,  usually  in  the 
form  of  ad  hoc  coalitions.  In  this  context,  the 
Council’s  role  of  legitimation  is  critical.  Using 
Article  48,  which  requires  member  states  “or 
some  of  them,”  to  “carry  out  the  decisions  of  the 
Security  Council,”  the  body  has  regularly  autho¬ 
rized  coalitions  “to  use  all  necessary  means”  to 
restore  peace  and  stability.  Thus,  the  1991  Gulf 
War  against  Iraq  and  interventions  in  Somalia, 
Rwanda,  Bosnia  and  Herzegovina,  East  Timor, 
Albania,  Liberia,  Afghanistan,  and  the  Central 
African  Republic  were  authorized  UN-enforcement 
operations. 

On  paper,  Chapter  VII  worked  a  revolution  in 
international  politics.  The  Cold  War,  however, 
made  that  revolution  more  theory  than  practice. 
With  the  Security  Council  locked  in  a  superpower 
stalemate,  Chapter  VII  provisions  generally  could 
not  be  implemented  when  conflicts  arose.  Council 
members  and  the  General  Assembly  were  forced  to 
fall  back  on  the  more  traditional  methods  outlined 
in  Chapter  VI  in  the  hope  of  resolving  a  dispute  or 
ending  hostilities. 

Chapter  VI  describes  mechanisms  for  noncoer- 
cive  measures  to  settle  disputes  peacefully  between 
nations. 

On  its  own,  Chapter  VI  gives  the  Council  little 
beyond  the  means  of  international  public  persua¬ 
sion  to  end  disputes.  Yet  without  the  far-reaching 
remedies  of  Chapter  VII  at  their  disposal,  the 
Security  Council  and  the  General  Assembly  often 
have  turned  to  broad  interpretations  of  Chapter  VI 
to  end  a  particular  conflict.  It  is  through  this  pro¬ 
cess  of  reinterpretation  that  the  devices  of  “peace¬ 
keeping”  and  “observer”  missions  have  come 
about.  Secretary-General  Dag  Hammarskjold 
coined  the  term  Chapter  VIVi  to  designate  the 
Charter-based  authority  for  these  operations.  A 
new  “gray  area”  of  UN  action  emerged,  keeping 
UN  involvement  relevant  to  the  resolution  of  dis¬ 
putes  in  ways  that  neither  the  Charter  formally 
established  nor  earlier  practice  by  the  League  of 
Nations  contemplated. 

The  Charter  establishes  six  principal  organs  to 
guide  the  organization:  (1)  the  General  Assembly, 


which  serves  as  the  plenary  body  of  the  full  mem¬ 
bership;  (2)  the  Security  Council,  made  up  of  five 
permanent  members  and,  as  of  2009,  10  nonper¬ 
manent  members  who  are  elected  on  a  rotating 
basis  to  2-year  terms;  (3)  the  Economic  and  Social 
Council,  which  oversees  the  myriad  functional 
agencies  reporting  to  the  world  body;  (4)  the 
Trusteeship  Council,  which  is  now  defunct,  hav¬ 
ing  presided  over  the  end  of  the  colonial  system; 
(5)  the  Secretariat  headed  by  the  secretary-general, 
which  provides  the  international  civil  servants  for 
all  the  administrative  tasks  of  the  UN;  and  (6)  the 
International  Court  of  Justice  (ICJ),  successor  to 
the  League  of  Nations  Permanent  Court  of 
International  Justice  (PCIJ).  All  but  the  ICJ  are 
headquartered  in  New  York  City.  The  Court  sits 
in  the  former  chambers  of  the  PCIJ  in  The  Hague, 
the  Netherlands. 

These  six  organs  are  at  the  center  of  a  much 
larger  umbrella  organization.  The  “United  Nations 
System”  includes  a  full  array  of  UN  entities,  pro¬ 
grams,  funds,  specialized  agencies,  and  interna¬ 
tional  bodies  related  to  the  UN  and  scattered 
around  the  world.  Much  of  the  work  of  the  UN  is 
done  far  from  New  York  City,  in  other  headquar¬ 
ters  cities,  most  particularly  in  Geneva,  Switzerland, 
where  more  diplomatic  meetings  are  convened 
each  year  than  in  New  York  and  where  the  UN 
High  Commissioner  for  Human  Rights  and  the 
International  Labour  Organization  have  their 
offices;  in  Nairobi,  Kenya,  the  home  of  the  UN 
Environment  Programme;  and  in  Vienna,  Austria, 
the  seat  of  the  International  Atomic  Energy 
Commission. 

Some  entities  in  the  UN  system  act  indepen¬ 
dently,  but  as  a  rule,  intrasystem  communication  is 
directed  from  the  various  wings  to  the  six  principal 
organs  of  the  UN.  The  secretary-general  manages 
this  sprawling  system  by  way  of  the  Chief 
Executives  Board  for  Coordination  (CEB).  The 
CEB  is  made  up  of  the  heads  of  UN  bodies  and 
agencies.  The  specialized  agencies  on  the  CEB  are 
autonomous,  self-governing  entities  that,  nonethe¬ 
less,  fall  under  the  rubric  of  the  UN.  In  2009,  17 
specialized  agencies  were  affiliated  with  the  UN, 
including  well-known  entities  such  as  the  World 
Health  Organization,  International  Civil  Aviation 
Organization,  World  Bank  Group,  Food  and 
Agriculture  Organization,  and  Universal  Postal 
Union.  Among  the  UN  “Programmes  and  Funds,” 
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the  most  important  are  the  UN  Development 
Programme,  World  Food  Programme,  UN  Child¬ 
ren’s  Fund,  UN  Population  Fund,  UN  Environment 
Programme,  and  International  Fund  for  Agri¬ 
cultural  Development. 

The  public  “face”  of  this  worldwide  system  is 
the  secretary-general,  who  is  nominated  by  the 
Security  Council  and  elected  by  the  General  Assem¬ 
bly  to  a  5-year  renewable  term.  By  tradition,  the 
post  is  never  held  by  an  individual  from  a  major 
power.  As  of  2011,  there  have  been  eight  secretar- 
ies-general:  Trygve  Lie  (Norway,  1946-1952),  Dag 
Hammarskjold  (Sweden,  1953-1961),  U  Thant 
(Burma,  1961-1971),  Kurt  Waldheim  (Austria, 
1972-1981),  Javier  Perez  de  Cuellar  (Peru,  1982- 
1991),  Boutros  Boutros-Ghali  (Egypt,  1992-1996), 
Kofi  Annan  (Ghana,  1997-2006),  and  Ban  Ki-moon 
(South  Korea,  2007-  ).  Secretaries-general  must  be 
both  adroit  managers  and  world-class  diplomats. 
Often  during  their  tenure,  their  attempts  to  fulfill 
both  of  these  roles  have  led  them  into  conflict  with 
one  or  the  other  of  the  great  powers.  In  the  1960s, 
for  example,  the  Soviet  Union  attempted  to  replace 
Dag  Hammarskjold,  calling  him  a  puppet  of  U.S. 
policy.  In  the  1990s,  the  United  States  vetoed  a 
second  term  for  Boutros  Boutros-Ghali,  largely  for 
being  ineffective  in  carrying  out  administrative  and 
budgetary  reform  in  the  UN. 

The  United  Nations  and  the  Cold  War 

In  the  late  1940s,  the  expected  cooperation  among 
the  world’s  great  powers  essential  to  UN  success 
dissolved  into  a  Cold  War  contest  between  the 
United  States  and  the  Soviet  Union,  each  with  their 
respective  allies.  Superpower  competition  affected 
the  working  of  the  UN  Security  Council  most 
intensely  because  the  veto  could  deadlock  any  con¬ 
templated  UN  action.  Early  confrontations  in  Iran, 
Greece,  and  Turkey  highlighted  the  ineffectiveness 
of  the  Council  when  the  national  security  interests 
of  the  United  States  and  the  former  USSR  con¬ 
flicted.  They  also  foreshadowed  Council  inaction 
in  later  conflicts  in  Vietnam  (1947-1974),  Hungary 
(1956),  and  Czechoslovakia  (1968).  Only  in  Korea 
in  1951  was  the  Security  Council  able  to  invoke  its 
powers  under  the  Charter’s  collective  security  pro¬ 
visions  to  declare  the  North  Korean  government 
the  aggressor  in  the  conflict  and  to  order  member 
nations  to  take  all  necessary  military  measures  to 


repulse  the  attack.  This  was  only  possible,  how¬ 
ever,  because  the  Soviet  Union  was  boycotting 
Council  meetings  at  the  time  to  protest  Western 
unwillingness  to  seat  the  new  government  of  Mao 
Zedong  as  the  legitimate  representative  of  China  in 
the  body. 

When  the  Soviet  delegate  returned  to  the 
Security  Council,  the  United  States  circumvented 
the  Council  by  pushing  through  the  General 
Assembly  the  “Uniting  for  Peace  Resolution,” 
which  allowed  the  Assembly  to  make  recommen¬ 
dations  on  the  restoration  of  peace  and  security 
when  the  Council  was  deadlocked  by  the  veto.  The 
Soviet  government  protested  the  maneuver  as  an 
unconstitutional  revision  of  the  Charter.  Korea 
was  a  prime  early  indication  of  the  Cold  War’s 
impact  on  the  UN.  Fighting  ended  with  an  armi¬ 
stice  in  1953,  but  the  country  remained  into  the 
21st  century,  long  after  the  end  of  the  Cold  War, 
the  last  divided  country  dating  from  the  end  of 
World  War  II. 

The  Congo  crisis  of  the  early  1960s  further 
underscored  the  Cold  War  challenge  to  the  UN. 
When  the  Congo  became  independent  from 
Belgium  in  1960,  a  complicated  civil  war  broke 
out,  with  one  side  being  supported  by  the  Soviet 
Union,  one  side  by  Washington,  and  a  third  side 
trying  to  secede.  Secretary-General  Dag  Ham¬ 
marskjold  tried  to  insert  a  UN  presence  to  bring 
the  disorder  to  an  end.  Believing  Hammarskjold  to 
be  carrying  out  the  wishes  of  the  United  States,  the 
Soviet  Union  demanded  a  reorganization  of  the 
office  of  secretary-general,  replacing  the  single  sec¬ 
retary  with  a  “Troika,”  whereby  there  would  be  a 
three-person  executive  with  equal  representation 
from  the  Western  bloc,  the  Eastern  bloc,  and  the 
neutral  countries  in  the  UN.  The  Soviets,  along 
with  the  French,  also  refused  to  donate  their 
assessment  for  the  Congo  operation,  claiming  it  to 
be  illegal  since  it  had  not  been  approved  by  the 
Security  Council.  Their  refusal  to  pay  contributed 
to  a  serious  financial  crisis  for  the  UN  that  would 
plague  the  organization  for  the  rest  of  the  century. 

Because  of  the  deadlock  in  the  Security  Council, 
the  locus  of  UN  activity  shifted  to  the  General 
Assembly,  where,  after  1960,  states  from  the 
developing  world  gained  a  working  majority.  As  a 
result,  UN  attention  shifted  to  the  problems  of  the 
so-called  Third  World.  Economic  development  and 
the  amelioration  of  social  dislocation  in  postcolonial 
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Figure  I  The  United  Nations  System 

Source:  UN  Department  of  Public  Information  (2009).  Used  by  permission. 
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regions  became  primary  concerns  of  the  member¬ 
ship.  In  1965,  the  Assembly  established  the  UN 
Development  Programme  to  address  the  extraor¬ 
dinary  challenges  of  poverty,  underdevelopment, 
and  the  humanitarian  crisis  in  Africa,  Asia,  and 
Latin  America. 

Of  course,  General  Assembly  decisions  imposed 
no  mandatory  requirements  on  the  member  states. 
Consequently,  the  Assembly  depended  on  moving 
world  public  opinion  and  putting  in  place  new 
programs  that  could  command  near-universal  sup¬ 
port  for  their  success.  In  the  context  of  the  Cold 
War,  where  the  Soviet/Third  World  bloc,  after 
1970,  had  a  working  majority  in  the  Assembly, 
finding  such  a  consensus  proved  nearly  impossible. 
UN  debate  became  increasingly  acrimonious,  and 
UN  decisions  proved  to  be  less  and  less  command¬ 
ing  of  world  attention. 

Post-Cold  War  Cooperation 
and  Reform  at  the  UN 

The  1980s  witnessed  the  first  glimmer  of  a  new 
and  expanded  role  for  a  world  body  such  as  the 
UN.  The  rise  of  Mikhail  Gorbachev  to  power  in 
the  Soviet  Union  in  1985  and  the  Kremlin’s  rapid 
fashioning  of  a  new,  less  antagonistic  foreign 
policy  opened  the  door  to  possible  U.S.-USSR 
cooperation  under  UN  auspices  to  address  global 
conflicts.  By  the  time  of  the  Iraqi  invasion  of 
Kuwait  in  1990  and  the  Gulf  War  in  1991, 
Gorbachev’s  anticipation  of  an  effective  UN 
seemed  prescient,  as  Moscow  and  Washington 
cooperated  within  the  Security  Council  to  declare 
Iraq  the  aggressor  and  to  authorize  military  action 
under  Chapter  VII  of  the  Charter. 

Success  in  the  Gulf  War  translated  to  a  new 
optimism  that  the  UN  might  be  an  effective  instru¬ 
ment  for  resolving  global  problems.  After  1985, 
UN  peacekeeping  operations  evolved  into  nation¬ 
building  experiments,  with  some  success  in 
Namibia,  Cambodia,  and  East  Timor.  More  prob¬ 
lematic  were  UN  efforts  in  Somalia,  Bosnia,  and 
Kosovo.  The  UN,  reinforced  by  a  newly  engaged 
Security  Council,  took  on  a  plethora  of  global 
challenges:  terrorism  in  the  wake  of  the  September 
11,  2001,  attacks  in  the  United  States;  HIV/AIDS, 
particularly  in  Africa;  pandemic  disease  in  Asia; 
climate  change;  human  rights  violations,  which  it 
met  not  only  with  sanctions  against  governments 


complicit  in  human  rights  abuses  but  also  by  its 
creation  of  the  International  Criminal  Court;  and 
the  endemic  economic  and  social  problems  of  the 
developing  world,  which  it  attempted  to  meet  by 
the  implementation  of  a  set  of  Millennium 
Development  Goals,  established  in  2000. 

Under  strong  pressure  from  the  United  States, 
the  UN  also  carried  out  a  significant  reform  pro¬ 
gram.  Particularly  under  the  leadership  of 
Secretary-General  Kofi  Annan,  the  UN  apparatus 
streamlined  its  budgetary  and  administrative  pro¬ 
cesses,  cutting  the  staff  by  a  quarter  and  operating 
with  a  zero-growth  budget  for  nearly  5  years.  After 
2005,  UN  members  also  took  up  structural  reform. 
They  replaced  the  much  maligned  Commission  on 
Human  Rights  with  a  new  Human  Rights  Council 
and  established  a  Peacebuilding  Commission  to 
manage  assistance  to  postconflict  states  and 
regions.  Eased  regulations  opened  many  of  the  UN 
decision-making  bodies  to  influence  from  nongov¬ 
ernmental  organizations  and  private  actors  such  as 
corporations,  social  groups,  and  individuals.  There 
were  also  lengthy  discussions  and  several  propos¬ 
als  to  expand  the  Security  Council  to  include  states 
representative  of  new  centers  of  power  in  world 
politics.  While  several  permanent  members  of  the 
Council  blocked  these  efforts  for  the  time  being, 
there  was  a  general  recognition  that  structural 
change  at  the  UN  would  need  to  come  sooner  or 
later  to  reflect  the  changed  realities  in  the  interna¬ 
tional  system.  As  the  millennium  began,  there 
appeared  to  be  an  emergent  “new”  UN,  still  reflec¬ 
tive  of  the  basic  Charter  provisions  established  in 
1945  but  prepared  to  respond  to  the  changed  con¬ 
ditions  of  the  21st  century. 
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Utilitarianism 


Throughout  the  history  of  moral  and  political  phi¬ 
losophy,  the  idea  of  welfare  as  a  moral  good  has 
repeatedly  appeared.  Utilitarians  distinctively  take 
welfare  consequences  to  be  the  central  good  and 
the  principle  by  which  to  judge  the  morality  of 
individuals’  actions.  It  would  be  odd,  even  per¬ 
verse,  to  suppose  that  political  theory  should  not 
attend  to  the  consequences  of  actions  or  policies, 
because  consequences  are  the  very  point  of  impor¬ 
tant  policies.  Hence,  moral  and  political  philoso¬ 
phy  is  a  joint  program  in  the  view  of  utilitarians. 
Indeed,  Jeremy  Bentham’s  most  cited  statement  on 
utilitarianism  is  in  his  1789  book  An  Introduction 
to  the  Principles  of  Morals  and  Legislation.  There¬ 
after,  utilitarianism  was  more  or  less  a  dominant 


moral  theory  into  the  beginning  of  the  20th  century 
with  Henry  Sidgwick’s  The  Methods  of  Ethics 
(1907)  in  philosophy  and  down  to  this  day  in  the 
work  of  many  economists.  Herbert  Hart 
(1958/1983)  states  that  it  dominated  legal  theory 
from  Bentham  through  roughly  the  1960s,  when 
somewhat  disorganized  and  often  contradictory 
theories  of  rights  rose  to  a  brief  heyday.  Since  then, 
a  hundred  flowers  and  not  a  few  weeds  have 
grown.  This  entry  examines  both  act  utilitarian¬ 
ism,  which  focuses  on  the  consequences  of  indi¬ 
vidual  actions,  and  rule  utilitarianism,  which  states 
that  we  should  act  in  accordance  with  the  rules 
that  achieve  the  greatest  happiness  for  the  greatest 
number.  It  also  considers  the  role  that  institutional 
rules  play  in  utilitarian  assessment  of  public  policy 
and  examines  the  utilitarian  perspective  on  dis¬ 
tributive  justice. 

During  all  of  this  development,  utilitarianism 
was  often  taken  in  new  directions.  For  much  of  its 
first  century,  it  was  grounded  in  the  value  theory 
of  interpersonally  comparable  and  additive  utility 
theory.  This  was  cardinal  rather  than  ordinal  util¬ 
ity,  roughly  in  keeping  with  views  in  economics, 
most  especially  in  the  utilitarian  discussion  of 
Francis  Edgeworth.  If  utility  is  interpersonally 
comparable,  10  utiles  for  you  is  equal  to  10  utiles 
for  me  in  our  overall  evaluation  of  our  society.  If 
it  is  additive,  your  utiles  added  to  mine  sum  to  20 
utiles. 

Paul  Samuelson  notes  that  many  issues  in  eco¬ 
nomics  can  be  well  understood  only  if  we  treat 
utility  as  not  simply  additive  but  as  ordinal.  You 
can  prefer  A  to  B,  but  you  cannot  say  by  how 
much  you  prefer  it.  It  follows  that  your  utility  from 
a  policy  cannot  simply  be  added  to  mine  to  yield 
an  overall  value  of  the  policy.  Although  this  view 
has  been  criticized  because  it  is  not  part  of 
Bentham’s  statement  of  utilitarianism,  such  critics 
incorrectly  assume  that  utilitarianism  is  all  and 
only  what  Bentham  states.  In  fact,  utilitarian  the¬ 
ory  develops  in  tandem  with  the  value  theory  of 
utility  in  other  fields,  particularly,  of  course,  in 
economics  but  also  in  psychology  and  in  recent 
experimental  games.  More  typically,  John  Rawls 
(1999)  compares  his  own  theory  of  justice  with 
utilitarianism.  But  after  smartly  canvassing  David 
Hume’s  value  theory  and  implicit  utilitarianism, 
which  he  seems  to  think  are  more  credible  than 
Bentham’s,  Rawls  then  argues  for  the  superiority 
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of  his  own  theory  over  utilitarianism  as  though  the 
only  definitive  utilitarian  vision  is  that  of  Bentham 
(1789/1970). 

An  obvious  problem  with  ordinal  utilitarianism 
is  that  it  is  much  harder  to  do  aggregations.  We 
cannot  simply  add  everything  up  to  determine  how 
great  a  particular  community’s  welfare  is.  It  is 
important  to  note  that  this  difficulty  is  not  an  arti¬ 
fact  of  the  theory;  it  is  inherent  in  the  world  to 
which  the  theory  is  applied.  On  our  best  current 
understandings  of  economic  value  theory,  the  ordi¬ 
nal,  noninterpersonally  comparable  value  theory  is 
descriptively  correct.  It  is  the  underlying  reality 
that  is  difficult.  The  developments  in  utility  theory 
and  therefore  in  utilitarianism  are  clearly  driven  by 
the  wish  to  achieve  realism. 

Rawls  gives  himself  the  trivially  easy  task  of 
basically  attacking  act  utilitarianism,  which  should 
be  distinguished  from  rule  utilitarianism  and  which 
focuses  our  concern  narrowly  on  individuals’ 
actions,  many  of  which  cannot  be  motivated  by 
interest  but  only,  if  it  is  available,  by  beneficence. 
Indeed,  no  standard  classical  moral  theory  for 
individuals  can  be  the  basis  for  a  distributive  the¬ 
ory.  No  serious  utilitarians  would  disagree  with 
this  point,  nor  would  they  think  it  relevant  to  start 
from  an  account  of  individual  action  to  reach  dis¬ 
tributive  judgments.  To  dismiss  utilitarianism  in  its 
most  primitive  Benthamite  version  in  order  to 
reach  conclusions  about  distributive  justice  is  irrel¬ 
evant  both  for  any  theory  of  distributive  justice 
and  for  aptly  sophisticated  utilitarianism. 

Hume’s  general  concern  is  the  naturalist  analy¬ 
sis  of  our  motivations  and  actions.  He  virtually 
dismisses  act  utilitarianism — before  the  term 
existed — because  it  requires  too  much  beneficence, 
which  Hume  thinks  is  in  too  short  supply  to  have 
much  impact  at  the  societal  level.  He  thinks  that 
the  supposition  that  we  can  all  be  motivated  by 
beneficence  violates  human  nature.  Our  concern 
with  ourselves  largely  trumps  our  beneficence. 

Any  moral  theory  that  cannot  address  institu¬ 
tional  issues  may  be  reduced  to  a  sideline  game 
with  trumped-up  examples.  A  useful  theory  must 
be  a  joint  moral  and  political  theory  and  must 
address  real  issues  in  the  world  in  which  we  live. 
Combining  moral  and  political  theory  is  an  easy 
challenge  for  utilitarianism,  which  from  Bentham 
forward,  and  arguably  even  from  Hume  forward, 
is  inherently  a  joint  theory.  We  can  label  the 


broader  concern  as  institutional  utilitarianism. 
Such  utilitarianism  addresses  many  of  the  prob¬ 
lematic  issues  that  have  arisen  since  Bentham. 
Beginning  arguably  with  John  Stuart  Mill,  theo¬ 
rists  have  tried  to  accommodate  the  theory  to 
institutions  and  norms  with  a  theory  of  rule  utili¬ 
tarianism.  This  idea  solves  the  seeming  problems 
of  calculating  which  of  the  available  actions  would 
contribute  the  most  to  overall  welfare.  Unfortu¬ 
nately,  the  difficulties  in  determining  what  rules 
we  should  follow  might  commonly  be  as  great  as 
those  in  directly  calculating  what  action  to  take.  This 
difficulty  has  generally  been  resolved  in  practice — 
typically  without  any  reference  to  utilitarianism — 
by  creating  institutions  that  guide  us  or  even  control 
us  in  following  institutional  rules.  Daily  life  is 
full  of  such  institutional  rules.  Many  of  these 
merely  coordinate  us  on  some  good  action,  as 
argued  by  Hume  for  a  remarkable  array  of  social 
contexts.  It  could  be  true  that  many  people  see  the 
usefulness — one  of  Hume’s  favorite  terms — of 
various  institutional  arrangements,  but  if  they  fol¬ 
low  the  rules  of  the  institutions,  that  will  be 
because  the  institutions  can  give  them  incentives  to 
do  so,  not  because  they  see  and  are  motivated  by 
the  good  of  having  people  comply  in  general. 

Finally,  we  take  note  of  the  intellectual  recep¬ 
tion  of  utilitarianism.  Far  more  than  any  other 
moral  theory,  utilitarianism  is  invoked  explicitly 
or  implicitly  by  social  scientists.  This  fact  does  not 
make  the  theory  morally  correct.  Most  social  sci¬ 
entists  are  essentially  naturalists.  With  Hume,  they 
seek  to  explain  people’s  moral  views,  their  morally 
motivated  actions,  and  their  moral  failings,  as 
when  their  interests  trump  their  moral  visions. 
Utilitarians  and  social  scientists  share  a  central 
concern  with  institutions  and  their  workings. 
Institutions  have  a  minor  secondary  role  in  most  of 
moral  theory  and  no  role  at  all  in  some  of  it. 

Competing  moral  visions,  such  as  Kantianism 
and  virtue  theory,  are  taken  far  less  seriously  by 
nonphilosophers.  Strangely,  its  critics  arguably 
have  spent  more  time  on  utilitarianism  than  its 
advocates  have.  Many  of  the  criticisms  wrongly 
claim  that  utilitarianism  is  self-contradictory.  For 
example,  critics  argue  that  utilitarians  must  spend 
so  much  time  calculating  that  they  have  no  time 
left  for  living,  so  that  utilitarianism  is  crippling  in 
practice.  This  criticism  was  demolished  with  ridi¬ 
cule  by  Mill  nearly  2  centuries  ago,  and  yet  it  lives 
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on.  We  have  institutions  to  handle  many  compli¬ 
cated  issues.  Often,  we  can  rely  on  the  past  experi¬ 
ences  of  millions  of  others  who  have  gone  this  way 
before.  Individuals  are  therefore  not  left  in  the 
lurch  when  they  face  an  important  decision  for 
action.  They  can  act  and  then  get  on  with  living. 

It  is  interesting  that  Rawls  (1955)  has  given 
utilitarian  theory  one  of  the  best  defenses  against  a 
pervasive  but  misguided  attack.  Not  surprisingly, 
it  is  in  the  realm  of  justice  under  the  law.  To  the 
commonplace  objection  that  a  utilitarian  sheriff 
must  violate  the  law  and  allow  the  lynching  of  an 
innocent  prisoner  to  quell  the  impending  violence 
of  a  mob,  Rawls  argues  that  this  is  a  specifically 
institutional  problem  for  which  utilitarianism  has 
adequate  resources.  Implicitly,  he  argues  for  insti¬ 
tutional  utilitarianism.  Unfortunately,  he  forgets 
that  vision  in  his  later  theory  of  justice. 

The  main  criticism  is  that  the  focus  on  utility 
or  welfare  and  therefore  on  consequentialism  is 
misguided — that  is,  a  moral  theory  should  focus 
instead  on  evaluating  actions  and  not  their  out¬ 
comes,  as  pointed  out  by  Gertrude  Anscombe.  This 
is  to  say  that  utilitarianism  is  thoroughly  mis¬ 
guided.  There  is  no  credibly  insightful  way  to 
respond  to  such  an  attack.  What  theory  of  distribu¬ 
tive  justice,  for  example,  could  possibly  make  com¬ 
pelling  sense  without  attention  to  the  consequences? 
The  disagreement  here  has  come  to  have  an  almost 
ideological  tone.  It  is  pointless  to  adjudicate  this 
difference.  It  is  far  more  compelling  to  focus  on  the 
structures  of  utilitarianism  and  competing  theories 
rather  than  on  their  whole-cloth  justification  or 
rejection. 
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Utopianism 


Utopianism  is  the  systematic  use  of  utopias — that 
is,  forms  of  thinking  that  depict  a  desirable  but 
impractical  state  of  things.  As  with  all  other  isms, 
the  concept  may  also  have  a  pejorative  connota¬ 
tion  but  not  necessarily  so. 

The  term  utopia  derives  from  the  eponymous 
book  (1516)  by  Thomas  More,  who  coined  it  to 
designate  the  imaginary  ideal  society  that  he 
described  in  the  second  part  of  his  book.  More 
intentionally,  it  played  with  the  ambivalence  of  the 
term,  since  u-topia,  from  ancient  Greek,  can  be  a 
contraction  of  both  eu-topos  (the  “good-place”) 
and  ou-topos  (the  “no-place”).  This,  as  we  shall 
see,  would  have  enduring  consequences.  With 
time,  the  term  came  generally  to  mean  all  ideas  or 
proposals  that  are  good  but  unrealistic  or  even 
impossible.  As  Karl  Mannheim  famously  put  it,  a 
state  of  mind  is  utopian  when  it  is  incongruous 
with  the  reality  within  which  it  occurs:  Utopias 
break  the  bonds  of  existing  social  order.  This  entry 
discusses  the  concept  of  utopianism  by  recon¬ 
structing  its  forms,  its  functions,  and,  finally,  its 
prospects  in  the  contemporary  world. 
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Forms  of  Utopianism: 

“Good-Place”  or  “No-Place”? 

Generally  speaking,  we  can  distinguish  between 
two  forms  of  utopianism.  The  first  is  the  literary 
genre  that  followed  the  example  of  More’s  Utopia. 
Tommaso  Campanella’s  City  of  the  Sun  (1602), 
Francis  Bacon’s  New  Atlantis  (1627),  and  William 
Morris’s  News  From  Nowhere  (1891)  are  all 
examples  of  such  a  genre.  The  description  of  the 
good-place  often  takes  the  form  of  a  narrative  by 
a  traveler  who  discovers  the  land  of  utopia,  typi¬ 
cally  an  island  or  at  least  a  territory  separated  form 
the  others.  Soem  works  in  this  genre  are  negative 
utopias,  such  as  Orwell’s  1984  (1949),  which 
describes  the  dreadful  dream  of  a  disciplinary  and 
totalitarian  society  ruled  by  Big  Brother.  These 
works  are  also  at  times  called  “dystopias” — from 
the  Greek  dys,  which  means  abnormal,  faulty,  or 
bad. 

The  second  form  of  utopianism  is  that  of  works 
that  are  not  part  of  the  literary  utopian  genre  but 
nevertheless  enclose  significant  utopian  elements. 
Political  treatises  such  as  Jean-Jacques  Rousseau’s 
The  Social  Contract  (1762)  or  Immanuel  Kant’s 
Perpetual  Peace  (1795)  contain  important  utopian 
moments  in  that  they  depict  societies  that  are 
good-places  but  also  no-places.  The  degree  to 
which  they  are  realizable  is  still  a  matter  of  contro¬ 
versy:  Rousseau  portrayed  a  direct  democracy 
where  human  beings  stand  as  free  and  equal, 
whereas  Kant’s  Perpetual  Peace  defined  the  articles 
of  the  hypothetical  international  treaty  that  would 
put  an  end  to  the  international  condition  of  anar¬ 
chy  and  war.  In  synthesis,  a  work  contains  utopian 
elements  when  it  expresses  the  belief  that  some  (or 
all)  social  evils  can  be  eliminated  and  a  better  soci¬ 
ety  created. 

While  in  the  case  of  the  literary  genre  it  is  rela¬ 
tively  easy  to  determine  whether  a  work  is  part  of 
it  or  not,  the  degree  of  utopianism  of  works  that 
contain  utopian  moments  is  highly  controversial. 
Things  are  further  complicated  by  the  fact  that  the 
concept  of  utopianism  has  at  times  a  pejorative 
connotation.  A  significant  example  is  Karl  Marx 
and  Friedrich  Engels’s  Communist  Manifesto 
(1848).  On  the  one  hand,  perhaps,  no  other 
authors  have  been  as  influential  in  spreading  the 
belief  that  social  evils  can  be  eradicated  and  a 
good-place  developed.  On  the  other  hand,  Marx 
and  Engels’s  prospect  of  a  communist  society  does 


not  derive  from  imagination  alone  but  is  grounded 
on  a  scientific  analysis  of  the  historical  conditions 
of  the  proletariat.  In  their  Manifesto,  they  strongly 
criticized  former  socialists  such  as  Comte  de  Saint- 
Simon,  Charles  Fourier,  and  Robert  Owen  for  their 
utopianism — used  here  in  the  pejorative  sense. 
Having  theorized  about  the  emancipation  of  the 
proletariat  in  an  epoch  when  the  material  condi¬ 
tions  for  emancipation  were  not  yet  ripe,  utopian 
socialists  necessarily  failed  in  identifying  the  eco¬ 
nomic  presuppositions  for  such  a  transformation, 
so  that  their  social  criticism  remained  a  merely 
fantastic  picture  of  an  impossible  future  society.  To 
such  utopian  socialism,  in  which  personal  inven¬ 
tiveness  takes  the  place  of  historical  action,  Marx 
and  Engels  opposed  their  scientific  communism. 

Some  authors  have  criticized  utopianism  by 
arguing  that  by  depicting  inexistent  perfect  societ¬ 
ies,  it  can  generate  authoritarian  and  totalitarian 
attempts  to  endorse  them.  According  to  Karl 
Popper,  for  instance,  utopias  such  as  Plato’s  ideal¬ 
ized  republic  endorse  visions  of  a  “closed”  society 
that  anticipate  those  of  modern  thinkers  such  as 
Hegel  and  Marx  and  must  therefore  be  repudiated. 
Utopias  can,  however,  be  defended  in  many  ways 
from  their  critics.  First,  one  can  argue  that  utopias 
are  by  definition  no-places.  Indeed,  most  utopian 
thinkers  did  not  even  think  of  trying  to  enforce 
their  schemes  of  perfect  society.  Even  those  who 
actually  tried  to  transform  their  no-places  into 
something  real  most  often  favored  means  such  as 
education  and  small-scale  experiments.  Those  who 
endorsed  the  possibility  of  a  violent  revolution 
remain  a  minority. 

Second,  the  accusation  of  totalitarianism  holds 
at  best  only  for  major  utopias.  The  distinction 
between  major  and  minor  utopias  is  another  help¬ 
ful  distinction  to  group  the  different  forms  that 
utopianism  can  take.  As  Jay  Winter  observed, 
major  Utopians  are  those,  such  as  Hitler  and  Stalin, 
who  radically  aimed  at  extirpating  all  social  evils 
from  the  world  and  resorted  to  unconditional  vio¬ 
lence  to  realize  their  projects.  Their  totalitarian 
visions  and  their  commitment  to  ruthless  removal 
from  the  world  of  those  malevolent  elements 
blocking  the  path  to  a  beneficent  future,  even  at 
the  price  of  extermination,  render  them  major  Uto¬ 
pians.  Minor  utopianism  is  generated  by  imagin¬ 
ings  of  liberation  on  a  usually  smaller  scale,  which 
also  sketch  out  a  world  very  different  from  the  one 
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we  live  in  but  from  which  not  all  social  conflicts 
and  evils  are  eradicated.  Examples  of  such  minor 
Utopians  are  the  authors  of  the  Universal  Declar¬ 
ation  of  Human  Rights  (1948)  or  the  ideologists  of 
ecological  movements. 

The  Functions  of  Utopianism 

Against  the  critics  of  utopianism,  it  can  also  be 
observed  that  the  latter  performs  very  important 
functions  within  society.  In  the  first  place,  utopias 
are  the  means  to  exercising  a  critique  of  the  pres¬ 
ent.  By  depicting  “good-places,”  they  tell  us  where 
we  are  and  therefore  also  how  far  away  we  are 
from  that  ideal.  It  comes  therefore  as  no  surprise 
to  find  out  that  the  first  part  of  More’s  Utopia  was 
devoted  to  the  analysis  of  the  evils  affecting  the 
society  More  lived  in.  Even  when  Utopians  do  not 
explicitly  go  through  the  detailed  description  of 
the  evils  of  society,  a  criticism  is  implicit  in  the 
radical  alternative  to  the  existing  order  that  they 
disclose.  To  put  it  in  a  nutshell,  utopias  transcend 
reality  but  do  not  depart  from  it. 

Utopianism  has  therefore  an  important  regula¬ 
tive  function.  By  “regulative,”  we  mean  the  capac¬ 
ity  of  an  idea  to  serve  as  a  guiding  ideal  for  human 
conduct  independent  of  its  content  of  reality. 
Utopias  in  this  sense  are  not  necessarily  blueprints 
for  the  creation  of  a  radically  new  society.  They 
are  the  means  to  measure  the  good  and  the  bad 
contained  in  each  society.  The  presupposition  of 
such  a  critique  is  the  conviction  that  the  current 
state  of  affairs  is  modifiable.  This  can  be  the  result 
of  a  belief  in  the  perfectibility  of  human  nature  or 
of  the  social  world. 

Finally,  we  should  observe  that  utopias  also 
help  develop  new  ideals.  Utopias  not  only  tell  us 
where  we  stand,  they  also  uncover  the  new  direc¬ 
tions  that  we  may  take.  They  are  the  result  of  the 
work  of  the  imagination  that  discloses  alternative 
scenarios  and,  therefore,  enrich  the  sense  of  human 
possibility.  As  Victor  Hugo  once  said,  today’s  uto¬ 
pia  is  tomorrow’s  reality.  The  aphorism  is  usually 
understood  in  the  sense  that  utopias  can  be  real¬ 
ized,  at  least  to  a  certain  extent.  But  we  could  also 
reverse  it  by  saying  that  tomorrow’s  realities  are 
today’s  utopias.  Indeed,  modern  conquests  such  as 
the  abolition  of  slavery  or  universal  suffrage  have 
all  been  utopias  once.  In  a  way,  all  ideas  that  con¬ 
tribute  to  social  progress  have  to  contain  a  certain 


degree  of  utopianism  insofar  as  what  they  portray 
is  a  no-place. 

The  Prospects  of  Utopianism  Today 

We  have  seen  that  utopianism  unfolds  itself  in  the 
disclosure  of  “good-places”  that  are  also  “no¬ 
places.”  This  also  means  that  utopianism  breaks 
away  from  the  present  but  only  to  criticize  it.  One 
could  even  argue  that  utopianism  is  realistic  in  the 
sense  that  it  tells  us  where  we  do  not  stand.  Of 
course,  utopias  can  be  more  or  less  realistic  in  this 
sense.  If  the  literary  utopias  of  the  Golden  Age  or 
the  Paradise  Lost  after  the  original  sin  have  very 
little  grasp  on  contemporary  reality,  the  most  sig¬ 
nificant  utopias  of  modernity  have  all  contributed 
to  the  critique  of  existing  societies  and  the  disclos¬ 
ing  of  new  directions  for  their  development. 
Utopias  are  often  reality  whose  time  is  not  yet  ripe. 
This  clearly  emerges  if  we  think  of  major  social 
conquests  such  as  universal  suffrage  or  the  aboli¬ 
tion  of  apartheid,  which  used  to  appear  as  impos¬ 
sible,  utopian  goals.  The  societies  that  have  been 
created  after  their  achievement  are  not  perfect  but 
certainly  better  ones. 

Although  historically  less  significant,  we  should 
also  mention  the  attempts  to  realize  integral  uto¬ 
pias.  Even  if  this  remains  a  small  phenomenon, 
there  is  a  type  of  utopianism  that  has  inspired  the 
design  of  integrally  utopian  experimental  commu¬ 
nities.  These  living  utopias  range  from  religious 
communities  such  as  the  Oneida  community  in 
the  United  States,  first  formally  established  in 
1848,  to  the  socialist  phalanxes  inspired  by 
Charles  Fourier  that  were  created  in  France  and 
the  United  States  in  the  19th  century,  to  the  kib¬ 
butz,  a  form  of  Jewish  settlement  that  began  early 
in  the  20th  century. 

Despite  the  fact  that  utopianism  is  far  from  van¬ 
ishing,  many  authors  have  claimed  that  we  live  in 
an  epoch  of  the  death  of  utopia.  For  instance,  they 
observe  that  the  utopian  genre  is  increasingly  lim¬ 
ited  to  literature  and  fiction.  This  is  partly  because 
the  high  degree  of  specialization  in  the  social  sci¬ 
ences  has  rendered  global  utopias  unacceptable  to 
the  academic  community.  But  according  to  some 
interpreters,  behind  the  alleged  death  of  utopias, 
there  are  more  structural  reasons.  Herbert  Marcuse, 
for  instance,  observed  that  the  concept  of  utopia 
becomes  obsolescent  in  the  contemporary  world 
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because  today  any  transformation  of  the  technical 
and  natural  environment  is  a  real  possibility.  On 
the  opposite  front  are  those  who  consider  utopia 
dead  because  they  see  no  possibility  of  envisaging 
an  alternative  to  the  status  quo.  In  an  epoch  when 
many  celebrate  the  end  of  history,  there  seems  to 
be  no  space  in  terms  of  time,  and  also  no  need,  to 
embark  on  a  journey  to  the  land  of  utopia.  Yet  the 
societies  we  live  in  are  far  from  perfect,  and  this 
ultimately  guarantees  the  possibility  of  and  also 
the  persisting  need  for  utopias. 

Chiara  Bottici 
The  New  School  for  Social  Research 
New  York  City,  New  York,  United  States 
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Values 


Several  definitions  of  values  have  been  influential 
in  the  social  sciences.  For  the  anthropologist  Clyde 
Kluckhohn,  a  value  is  a  conception  of  the  desirable 
that  influences  the  selection  of  available  modes, 
means,  and  ends  of  action.  Central  to  this  defini¬ 
tion  is  the  notion  of  “a  conception  of  the  desir¬ 
able.  ”  A  desire  is  a  wish  or  a  preference,  while  the 
term  desirable  goes  beyond  a  wish  or  a  want  by 
bringing  in  considerations  of  moral  content. 

For  the  psychologist  Milton  Rokeach,  a  value  is 
an  enduring  belief  that  a  specific  mode  of  conduct 
or  end  state  of  existence  is  preferable  to  opposite 
or  converse  modes  of  conduct  or  end  states  of  exis¬ 
tence.  Rokeach’s  definition  includes  several  ele¬ 
ments  that  can  be  used  as  a  point  of  departure  for 
discussing  several  dimensions  of  the  value  concept. 
Rokeach  indicated  that  there  are  two  types  of  val¬ 
ues:  (1)  consummatory  (end  state  of  existence)  and 
(2)  instrumental  (mode  of  conduct)  values.  Rokeach 
also  differentiated  between  personal  and  social 
values.  People  have  values  that  they  want  to 
emphasize  in  their  own  lives  (self-centered)  and 
also  those  that  they  would  emphasize  in  their 


social  environment  (society  centered).  This  differ¬ 
entiation  can  be  expanded  to  different  domains, 
and  one  can  talk  about,  for  example,  family  val¬ 
ues,  work  values,  bureaucratic  values,  and  political 
values. 

For  Rokeach,  a  value  is  a  basic  and  relatively 
stable  element  in  a  person’s  belief  system.  A  value 
is  a  prescriptive  belief  wherein  some  means  or  end 
of  action  is  judged  to  be  desirable  or  undesirable. 
Values  are  sometimes  contrasted  with  attitudes, 
which  are  often  defined  as  a  set  of  beliefs  orga¬ 
nized  around  a  specific  object  or  situation.  A  value 
is  considered  to  be  a  basic  (prescriptive)  belief  that 
often  influences  a  specific  attitude  together  with 
other  beliefs.  Beliefs,  attitudes,  and  values  can  be 
conceived  to  lie  on  a  center-periphery  dimension 
where  values  are  the  most  central,  intensive,  and 
enduring  and  beliefs  are  the  most  peripheral. 

Building  on  Rokeach  and  others,  Shalom 
Schwartz  identifies  six  formal  characteristics  that 
are  the  defining  features  of  basic  human  values: 
(1)  Values  are  beliefs  (2)  about  desirable  end-states 
or  behaviors  (modes  of  conduct)  that  (3)  transcend 
specific  situations  or  actions,  (4)  guide  selection  or 
evaluation  of  behavior  and  events,  and  (5)  are 
ordered  by  relative  importance  to  form  a  value 
system.  (6)  The  relative  importance  of  values 
guides  attitudes  and  behavior. 

Other  researchers  such  as  Jan  van  Deth  and 
Elinor  Scarbrough  consider  the  relationship 
between  values  and  attitudes  as  a  reciprocal  one 
that,  at  the  individual  level,  provides  opportunities 
for  the  modification  and  adaptation  of  values. 
These  scholars  use  the  notion  “value  orientation” 
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for  constellations  of  attitudes  that  can  be  patterned 
empirically  and  interpreted  theoretically  in  a  mean¬ 
ingful  way.  This  implies  that  value  orientations 
can  be  studied  with  data  that  include  indicators  of 
attitudes. 

Culture  can  be  considered  as  the  rich  complex  of 
meanings,  beliefs,  symbols,  norms,  and  values 
prevalent  among  people  in  a  society.  Cultural  differ¬ 
ences  can  be  studied  along  many  dimensions.  Given 
that  values  are  the  central  elements  in  individuals’ 
belief  systems,  the  values  that  are  emphasized  in  a 
society  are  a  very  central  feature  of  culture.  A  major 
conceptual  advantage  of  an  approach  in  which 
values  are  central  is  that  values  can  be  considered  as 
important  intermediate  variables  between  several 
independent  background  variables  and  various 
variables  that  can  be  considered  to  be  consequences 
of  values. 

Sociostructural,  personal  (including  personality 
traits),  and  various  institutional  variables  can  be 
considered  independent  variables  that  can  explain 
variations  in  value  priorities.  Values  are  consid¬ 
ered  determinants  of  virtually  all  kinds  of  social 
behavior:  evaluations  and  attitudes,  moral  judg¬ 
ments,  and  justifications  of  self  and  others.  Values 
are  also  determinants  of  political  behavior,  for 
example,  various  types  of  political  participation 
and  voting  behavior. 

Given  the  central  intermediate  role  values  have 
in  the  causal  chain,  from  background  variables  to 
behavior,  values  often  play  a  central  role  in 
explaining  the  impact  of  background  variables  on 
behavior.  Some  general  value  dimensions  have 
received  considerable  attention.  First,  these  are 
presented.  Then,  the  focus  is  on  political  value 
dimensions,  political  value  change  and  compara¬ 
tive  patterns  of  value  priorities,  and  political  val¬ 
ues’  antecedents  and  consequences. 

General  Value  Dimensions 

The  Dutch  social  scientist  Geert  Hofstede’s  studies 
of  national  cultural  differences  have  received 
worldwide  attention.  Culture  consists  of  values, 
rituals,  heroes,  and  symbols,  according  to  Hofstede, 
but  values  are  seen  as  most  fundamental.  His 
famous  five  dimensions  of  national  cultures  are 
power  distance,  uncertainty  avoidance,  individual¬ 
ism  versus  collectivism,  masculinity  versus  femi¬ 
ninity,  and  long-  versus  short-term  orientations. 


Hofstede  studied  in  detail  national  differences  in 
culture  in  more  than  50  nations  along  these  dimen¬ 
sions  and  examined  the  consequences  of  the  cul¬ 
tural  dimensions  along  a  series  of  domains,  for 
example,  family  patterns,  school  and  work  organi¬ 
zations,  and  politics. 

Another  influential  theory  of  human  values  was 
formulated  by  Schwartz,  who  describes  10  types  of 
basic  human  values  that  are  distinguished  by  their 
motivational  goals.  These  values  are  power, 
achievement,  hedonism,  stimulation,  self-direction, 
universalism,  benevolence,  tradition,  conformity, 
and  security.  This  theory  formulates  a  structure  of 
dynamic  relations  among  the  values.  This  is  indi¬ 
cated  by  a  classification  of  the  10  value  types  in  a 
circular  continuum.  Adjacent  value  types  share 
some  motivational  emphases  and  are  therefore 
compatible,  while  other  values  are  more  distant 
from  each  other  and  are  considered  to  be  conflict¬ 
ing  or  even  diametrically  opposed. 

The  oppositions  can  also  be  summarized  by 
organizing  the  values  into  two  bipolar  higher  order 
dimensions.  One  dimension  contrasts  openness  to 
change  and  conservation  values.  This  dimension 
puts  self-direction  and  stimulation  values — which 
emphasize  independent  action,  thought,  and  feeling 
and  readiness  for  new  experiences — in  opposition 
to  conservation  values,  security,  conformity,  and 
tradition — which  emphasize  self-restriction,  order, 
and  resistance  to  change.  The  second  higher  order 
dimension  contrasts  self-enhancement  and  self¬ 
transcendence  value  types.  This  dimension  puts 
power  and  achievement  values — which  emphasize 
one’s  pursuit  of  success  and  dominance  of  others — 
in  opposition  to  universalism  and  benevolence 
values — which  involve  concern  for  the  welfare  and 
interests  of  others.  There  has  also  been  consider¬ 
able  research  on  the  causes  and  consequences  of 
each  of  these  value  types. 

Political  Values  and  Political  Value  Dimensions 

Consummatory  political  values  can  be  considered 
as  end  states  that  individuals  would  like  to  see 
characterize  the  society  as  a  whole  and  see  imple¬ 
mented  through  the  political  system.  Instrumental 
political  values  are  modes  of  conduct  that  are  con¬ 
sidered  legitimate  (or  illegitimate)  to  influence 
political  decisions,  for  example,  various  types  of 
political  participation  and  ways  of  influencing 
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political  decisions.  In  political  science,  the  concept 
of  values  is  at  the  core  of  David  Eastons’s  famous 
definition  of  politics — as  interactions  through 
which  values  are  authoritatively  allocated  for  a 
society. 

The  most  well-known  political  value  dimension 
is  that  of  the  materialist/postmaterialist  value  orien¬ 
tations.  These  value  orientations  were  originally 
formulated  by  Ronald  Inglehart,  who  argues  that 
“new”  postmaterialist  values  are  deeply  rooted  and 
stand  in  opposition  to  more  traditional  materialist 
values.  Materialist  values  emphasize  economic  and 
physical  security,  such  as  economic  stability  and 
growth,  law  and  order,  and  strong  defense.  Post- 
materialist  values  emphasize  self-expression,  sub¬ 
jective  well-being,  and  quality  of  life.  The  new 
postmaterialist  political  values  are  central  to  the 
“New  Politics”  because  these  values  are  associated 
with  new  political  issues  such  as  environmental  con¬ 
cerns,  lifestyle  issues,  social  and  political  participa¬ 
tion,  minority  rights  and  social  equality  and  social 
issues,  and  a  change  in  the  social  base  for  radical, 
change-oriented  politics.  Another  way  of  conceptu¬ 
alizing  the  New  Politics  is  represented  by  environ¬ 
mental  versus  economic  growth  values.  Today,  this 
conflict  is  firmly  rooted  in  the  public  consciousness, 
and  in  many  West  European  countries,  conflicts 
over  environmental  values  seem  to  be  the  most 
manifest  expression  of  the  New  Politics  conflict.  A 
clear  manifestation  of  this  is  the  emergence  of  Green 
parties,  which  have  gained  considerable  electoral 
support  in  many  Western  democracies. 

Scott  Flanagan  has  emphasized  that  Inglehart’s 
conceptualization  of  value  change  combines  two 
dimensions:  (1)  a  materialist/nonmaterialist  dimen¬ 
sion  and  (2)  a  libertarian/ authoritarian  dimension. 
Flanagan  considers  value  change  along  the  latter 
dimension  as  most  important.  The  overarching 
concept  that  integrates  libertarian  values  is  self- 
actualization,  and  the  central  value  orientations 
within  the  notion  of  libertarian  values  are  auton¬ 
omy,  openness,  and  self-improvement.  The  author¬ 
itarian  value  orientations  designate  a  broader 
cluster  of  values,  which,  along  with  concerns 
about  security  and  order,  includes  respect  for 
authority,  discipline  and  dutifulness,  patriotism, 
intolerance  toward  minorities,  conformity  to  cus¬ 
toms,  and  support  for  traditional  religious  and 
moral  values.  The  libertarian/authoritarian  value 
orientations  are  also  the  central  components  in 


Herbert  Kitschelt’s  important  work  on  changes  in 
the  party  systems  of  Western  democracies. 

Though  dominant,  research  on  materialist/ 
postmaterialist  political  values  has  gradually  been 
supplemented  by  work  on  other  political  value 
dimensions.  Political  value  conflicts  can  be  consid¬ 
ered  to  originate  in  the  most  crucial  sociopolitical 
cleavages.  For  example,  according  to  the  seminal 
work  of  Seymour  Martin  Fipset  and  Stein  Rokkan, 
the  most  important  political  cleavages  in  industrial 
society  are  the  religious  cleavage  and  the  class 
cleavage.  Both  of  these  cleavages  are  associated 
with  important  value  orientations.  Christian  values 
focus  on  the  importance  of  Christian  morals  and 
principles  in  society  and  politics  and  on  traditional 
moral  guidelines  in  school  and  society  in  general. 
Secularization  is  often  understood  as  a  process 
whereby  mundane  reality  is  less  and  less  interpreted 
from  a  supernatural  perspective,  and  secular  values 
are  based  on  more  modern  norms  of  morality  that 
people  want  to  decide  for  themselves  without  the 
guidelines  of  the  church.  Religious/secular  values 
tap  these  contrasting  orientations.  Religious  orien¬ 
tations  are  often  considered  to  comprise  two  differ¬ 
ent  aspects.  One  aspect  is  the  religious  beliefs  that 
people  hold.  This  can  be  tapped  by  concrete  ques¬ 
tions  about  which  religious  conceptions  and  dog¬ 
mas  the  respondents  believe  in,  how  important  God 
is  in  their  life,  or  what  kind  of  God — if  any — they 
believe  in.  The  other  aspect  is  church-oriented  reli¬ 
gion  or  church  religiosity.  The  essence  of  this 
dimension  is  “church  integration”:  The  more  peo¬ 
ple  participate  in  the  relevant  church’s  rites  and 
services,  the  more  church  integrated  they  are. 

The  Industrial  Revolution  gave  rise  to  economic 
interest  conflicts  that  were  anchored  in  hierarchi¬ 
cal  sociostructural  variables.  The  most  important 
political  value  orientations  that  emerged  from  the 
Industrial  Revolution  were  the  economic  (materi¬ 
alist)  left-right  values.  These  value  orientations  are 
economic  in  nature,  and  they  refer  in  particular  to 
the  role  of  government  in  creating  more  economic 
equality  in  society  versus  the  need  for  economic 
incentives  and  efficiency.  These  value  orientations 
incorporate  value  conflicts  related  to  control, 
power,  and  the  degree  of  distribution  of  resources 
in  the  production  sphere.  They  include  workers’ 
control  and  state  regulation  of  the  economy  versus 
private  enterprise,  private  property,  and  the  mar¬ 
ket  economy,  and  the  size  of  the  welfare  state  and 
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economic  and  social  equality  versus  the  need  for 
differentiated  rewards  for  stimulating  effort.  The 
moral  value  dimension  and  economic  left-right 
values  are  often  referred  to  as  “Old  Politics” 
because  they  capture  the  essence  of  the  traditional 
lines  of  conflict  in  industrial  society.  In  contrast, 
the  New  Politics  refers  to  value  conflicts  emerging 
from  postindustrial  society. 

Inglehart  has  broadened  his  study  of  value 
dimensions  by  emphasizing  that  his  materialist/ 
postmaterialist  dimension  is  only  one  component 
of  a  much  larger  value  syndrome.  In  an  alternative 
way  of  conceptualizing  and  analyzing  value  orien¬ 
tations  in  a  long-term  perspective,  he  focuses  on  a 
two-dimensional  value  structure:  one  dimension 
based  on  traditional  versus  secular-rational  values 
and  the  other  based  on  survival  versus  self-expression/ 
well-being  values.  These  two  dimensions  are  asso¬ 
ciated  with  the  structural  changes  from  preindus¬ 
trial  to  industrial  and  from  industrial  to  postindus¬ 
trial  society,  respectively.  The  most  essential  values 
in  the  first  dimension  are  religious  versus  more 
secular  values,  but  the  traditional  pole  also  includes 
family  values,  social  conformity  rather  than  indi¬ 
vidualistic  striving,  deference  to  authority,  high 
levels  of  national  pride,  and  a  nationalistic  out¬ 
look,  while  the  secular-rational  pole  has  the  oppo¬ 
site  preferences. 

The  self-expression  values  on  the  second  dimen¬ 
sion  are  values  and  attitudes  such  as  tolerance, 
trust,  emphasis  on  subjective  well-being,  and  civic 
activism  that  emerge  in  postindustrial  societies 
with  high  levels  of  existential  security  and  indi¬ 
vidual  autonomy.  At  the  opposite  pole,  survival 
values  emphasize  economic  and  physical  security, 
and  they  include  orientations  such  as  intolerance 
of  outgroups,  an  authoritarian  outlook,  insistence 
on  traditional  gender  roles,  and  resistance  to  cul¬ 
tural  change.  Materialist/postmaterialist  values  are 
central  in  the  second  dimension.  Economic  left- 
right  values  do  not  appear  to  be  central  in  their 
characterization  of  either  of  the  two  dimensions, 
but  they  could,  nevertheless,  be  primarily  located 
along  the  first  dimension. 

Value  Change  and  Cross-National 
Differences  in  Political  Value  Priorities 

Inglehart  incorporates  most  explicitly  the  issue  of 
cultural  change  in  his  work.  He  identifies  a  “silent 


revolution”  in  which  a  gradual  value  change  takes 
place  along  the  materialist/postmaterialist  dimen¬ 
sion.  As  older  and  more  materialist  generations 
die,  they  are  continuously  replaced  by  younger, 
less  materialist  generations.  Inglehart’s  theory  is 
based  on  two  hypotheses:  (1)  The  scarcity  hypoth¬ 
esis  implies  that  short-term  effects  may  induce  all 
cohorts  to  emphasize  postmaterialist  values  when 
economic  conditions  are  good  and  materialist  val¬ 
ues  when  economic  conditions  decline.  The  cohort 
differences  are  explained  by  differences  in  eco¬ 
nomic  and  physical  security  during  the  formative 
years  of  the  various  cohorts.  (2)  The  socialization 
hypothesis  predicts  a  watershed  between  the  post- 
World  War  II  and  the  prewar  cohorts  in  value 
priorities  because  they  have  such  different  experi¬ 
ences  in  the  formative  years  regarding  economic 
security  (economic  scarcity  vs.  economic  prosper¬ 
ity)  and  physical  security  (war  vs.  absence  of  war). 

The  theory  of  value  change  is  based  on  a  strong 
and  consistent  relationship  between  age  and  mate¬ 
rialist/postmaterialist  values  and  presupposes  that 
life  cycle  effects  are  not  dominant  for  explaining 
age  differences.  These  expectations  are  supported 
by  comparative  survey  data.  Inglehart  finds  also 
that  support  for  postmaterialist  values  is  strongest 
in  wealthy  countries  and  that  generation  differ¬ 
ences  are  largest  in  countries  with  high  economic 
growth.  This  is  explained  by  differences  in  eco¬ 
nomic  security  during  childhood  and  youth  (for¬ 
mative  years):  In  wealthy  countries,  a  large  portion 
of  the  population  has  grown  up  in  an  economically 
secure  environment.  In  countries  with  a  high  level 
of  economic  growth,  the  various  cohorts  have 
experienced  dramatic  changes  in  levels  of  eco¬ 
nomic  security.  Similar  patterns  and  explanations 
have  been  put  forward  for  comparative  differences 
and  generational  differences  on  the  broader  sur¬ 
vival/self-expression  value  dimension. 

Empirical  research  has  shown  a  fairly  consistent 
decline  in  religiosity  in  rich,  advanced  industrial 
countries  along  the  different  dimensions  discussed 
above.  In  the  discussion  of  secularization  in  Europe, 
some  have  argued  that  this  applies  only  to  the 
church-oriented  dimension,  not  the  belief  dimen¬ 
sion.  Comparative  longitudinal  empirical  research, 
however,  has  found  that  these  two  sides  of  reli¬ 
gious  involvement  are  highly  correlated  and  even 
that  the  distinction  between  them  is  difficult  to 
uphold  in  empirical  analysis.  Church-oriented  and 
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belief-oriented  religious  involvements  tend  to  “go 
together”  and  are  difficult  to  separate  in  empirical 
research.  There  is  even  evidence  that  the  changes  in 
the  two  measures  take  place  simultaneously,  with 
changes  in  church  integration  being  paralleled  by 
similar  changes  in  religious  beliefs.  The  most 
important  aspect  of  secularization  at  the  individual 
level  is  the  decline  in  both  religious  beliefs  and 
church  attendance.  In  other  parts  of  the  world, 
religiosity  has  been  much  more  stable.  Overall,  the 
world  is  apparently  becoming  more  religious  even 
though  the  advanced  industrial  countries  in  the 
West  have  become  considerably  more  secular. 

The  late  1970s  and  the  1980s  saw  a  revival  of 
conservative  and  neoliberal  ideas  and  movements 
and  increased  support  for  nonsocialist  political 
parties  that  challenged  the  postwar  Keynesian  con¬ 
sensus  on  the  welfare  state,  state  intervention,  and 
citizenship  rights  in  the  social  domain — economic 
leftist  values  that  had  been  implemented  in  most 
West  European  polities.  This  right-wing  ideology 
underscored  the  superiority  of  market  mechanisms 
in  promoting  both  economic  prosperity  and  the 
maximization  of  individual  freedom.  It  was  also 
adamant  that  state  intervention  did  not  work  and 
accorded  a  minimal  role  to  the  state  in  the  opera¬ 
tion  of  the  economy  and  the  social  order.  In  many 
countries,  nonsocialist  governments  with  a  neolib¬ 
eral  agenda  came  to  power.  The  value  change  that 
underpinned  these  rightist  trends  was  a  move 
toward  economic  rightist  values  that  could  be 
documented  in  many  countries.  However,  the 
1980s  saw  a  trend  toward  leftist  values,  and  in 
many  countries,  the  mass  public’s  values  were  as 
leftist  as  they  had  been  before  the  shift  toward  the 
right  around  1980.  In  the  1990s,  many  social 
democratic  governments  came  to  power  but  with 
more  centrist  programs  than  before,  and  trends  in 
economic  left-right  values  from  the  1990s  show 
stability  or  inconclusive  patterns. 

Two  different  perspectives  on  comparative  dif¬ 
ferences  in  priorities  of  economic  left-right  orienta¬ 
tions  are  found  in  the  literature.  The  first  one  is  that 
values  will  vary  according  to  the  welfare  regime  that 
exists  in  the  given  country.  The  point  of  departure 
is  the  well-known  distinction  between  the  universal 
or  social  democratic,  the  conservative  or  Christian 
democratic,  and  the  liberal  or  residual  welfare  state. 
Welfare  state  attitudes  are — according  to  this 
perspective — formed  by  the  character  of  the 


institutional  regimes  of  social  policy.  Universal  wel¬ 
fare  states  are  underpinned  by  strong  popular  sup¬ 
port  because  most  social  groups  are  included  as 
recipients.  This  is  not  the  case  with  the  liberal  wel¬ 
fare  state,  where  the  recipients  are  the  lower  social 
classes.  Support  for  these  hypotheses  is  rather 
mixed. 

The  second  perspective  predicts  that  leftist  ori¬ 
entations  will  receive  greatest  support  in  the  least 
advanced  Western  societies,  where  welfare  pro¬ 
grams  and  the  regulation  of  the  economy  have 
been  low  and  where  social  inequalities  are  fairly 
large.  People  in  these  countries  want  change  in  a 
more  egalitarian  direction,  while  those  in  more 
advanced  welfare  states  live  in  a  different  eco¬ 
nomic  environment  where  the  need  for  further 
development  of  leftist  policies  is  not  appreciated. 
In  Inglehart’s  conceptualization,  there  is  a  dimin¬ 
ishing  marginal  utility  or  return  of  classical  eco¬ 
nomic  leftist  policies  such  as  economic  regulation 
and  income  equality  in  advanced  welfare  states. 
This  view  is  supported  by  different  comparative 
studies  of  attitudes  toward  income  equality  and 
state  intervention,  as  well  as  general  economic 
left-right  orientations. 

Antecedents  of  Political  Value  Orientations 

Most  of  the  research  has  been  done  on  materialist/ 
postmaterialist  values.  According  to  the  theory  of 
the  New  Politics,  the  social  basis  of  these  values 
turns  the  old  order  upside  down:  The  spread  of 
postmaterialist  values  is  explained  by  economic 
growth  and  wealth,  generational  replacement,  the 
growth  of  the  new  middle  class,  and  the  spread  of 
higher  education.  Change-oriented  postmaterialist 
values  should  be  most  frequently  found  among  the 
higher  educated  strata,  the  new  middle  class,  and 
the  younger  cohorts.  Studies  have  found  that  sup¬ 
port  for  environmental  and  libertarian  values  in 
advanced  industrial  societies  is  strongest  among  the 
younger  age-groups,  those  with  higher  education, 
and  those  who  belong  to  the  new  middle  class. 

Regarding  religious/secular  values,  age  and  gen¬ 
der  are  the  most  important  predictors,  while  educa¬ 
tion  has  a  smaller  impact  and  other  sociostructural 
variables  such  as  social  class  are  of  less  importance. 
Men  and  the  younger  cohorts  have  more  secular 
values.  Economic  left-right  values  are  coupled  with 
the  class  cleavage.  Leftist  values  are  strongest 
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among  the  working  class,  those  with  lower  income, 
and  the  lower  educated  strata.  The  correlations 
between  these  class  variables  and  economic  left- 
right  values  have,  however,  declined  considerably 
over  time.  Age  and  gender  are  seldom  consistently 
correlated  with  economic  left-right  values  in  com¬ 
parative  research. 

Consequences  of  Value 

Personal  and  social  values  can  have  important 
consequences,  as  research  on  Hofstede’s  and 
Schwartz’s  value  dimensions  has  shown.  Most  of 
the  research  on  political  consequences  of  values, 
however,  has  focused  on  political  values.  According 
to  the  theory  of  the  New  Politics,  materialist/ 
postmaterialist  values  will  have  important  conse¬ 
quences  for  political  attitudes  and  behavior  in 
advanced  industrial  democracies.  This  is  framed 
within  a  developmental  perspective:  There  is  a 
transformation  of  cleavage  structure  from  ascrip- 
tive  variables  (e.g.,  gender  and  religious  denomi¬ 
nation)  and  achieved  or  hierarchical  (social  status) 
variables  to  postindustrial  (political  value)  vari¬ 
ables.  New  value  conflicts  are  gradually  replacing 
traditional  social  structural  variables  for  explaining 
political  behavior,  or — in  Inglehart’s  words — 
there  is  a  change  from  a  “class-based  to  a  value- 
based  pattern  of  political  polarization.” 

In  accordance  with  New  Politics  theory,  materi- 
alist/postmaterialist  values  have  several  aspects: 

1.  They  have  shown  themselves  to  be  important 
predictors  for  attitudes  toward  social  change. 
Individuals  with  postmaterialist  values  express 
stronger  support  for  radical  social  change. 

2.  They  are  increasingly  the  basis  for  identification 
with  the  ideological  labels  “Left”  and  “Right.” 
This  implies  that  the  ideological  meaning  of 
Left  and  Right  changes  from  meanings  related 
to  the  social  structure  and  values  in  industrial 
society  to  postmaterialist  values. 

3.  They  are  an  important  cause  of  the 
“participatory  revolution” — that  is,  the  increase 
in  protest  activities  or  unconventional 
participation. 

4.  They  will  increasingly  become  an  important 
determinant  for  party  choice.  They  will 
constitute  a  new  political  axis  that  will  cut 


across  the  established  social  cleavages  and 
polarities  between  the  New  Left  and  Green 
parties  and  the  New  Right  parties. 

5.  They  are  determinants  of  a  series  of  political 
attitudes  that  are  closely  related  to  the  New 
Politics,  environmental  concerns,  lifestyle  issues, 
social  and  political  participation,  minority 
rights,  and  social  equality  and  social  issues. 

However,  when  economic  left-right  values  and 
religious/secular  values  are  included  in  the  analy¬ 
ses,  these  values  sometimes  have  an  even  larger 
impact  on  attitudes  and  behavior  than  materialist/ 
postmaterialist  values. 

There  are  also  different  views  on  the  impact  of 
values  on  political  attitudes  and  behavior.  When 
Van  Deth  and  Elinor  Scarbrough  (1995)  explicitly 
examined  the  impact  of  the  three  sets  of  political 
value  orientations,  the  main  finding  was  that  the 
behavioral  consequences  of  value  orientations  are 
rather  modest:  They  are  not  uniform  across  coun¬ 
tries,  and  the  impact  of  values  on  different  phe¬ 
nomena  did  not  change  much  from  the  1960s  to 
the  1990s.  A  major  exception  to  the  modest  impact 
of  political  values  is  party  choice,  where  all  three 
sets  of  value  orientations  have  considerable  impact 
and  tend  to  cut  across  each  other  in  the  way  they 
differentiate  between  various  political  parties. 

Oddbjorn  Knutsen 
University  of  Oslo 
Oslo,  Norway 
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The  term  variable  is  used  in  a  variety  of  senses  in 
social  science  methodology,  with  the  various 
usages  unified  by  reference  to  a  score  on  an  indica¬ 
tor  that  in  some  sense  could  have  been  different. 
Variables,  in  the  broadest  sense,  are  indicators  that 
are  not  constants;  contrasting  usages  of  both  terms 
are  briefly  considered  below.  Social  scientists  and 
other  methodologists  often  make  more  fine-grained 
distinctions  among  variables  in  terms  of  their  sta¬ 
tistical  or  causal  characteristics.  In  this  entry,  these 
distinctions  and  their  applications  are  discussed. 

Variables  and  Constants 

Some  indicators  capture  traits  that  are  ontologi- 
cally  variable:  If  a  study  were  to  be  in  some  sense 


repeated,  the  score  on  this  indicator  for  a  given 
case  could  have  been  other  than  what  it  was.  An 
intuitive  example  of  a  variable  in  this  sense  is  the 
result  of  rolling  a  die.  If  the  die  were  to  be  rolled 
again,  there  is  no  particular  reason  to  believe  that 
it  would  show  the  same  number  of  dots  as  on  an 
earlier  roll.  Hence,  the  number  of  dots  shown  on 
the  die  after  a  roll  is  a  variable  in  the  ontological 
sense.  An  equivalent  example  in  social  science 
research  involves  random  experimental  assign¬ 
ment  of  a  subject  to  one  of  a  number  of  treat¬ 
ments:  If  the  process  of  assignment  were  to  be 
repeated,  one  may  reasonably  expect  that  many 
subjects  would  get  a  different  assignment  than  they 
did  in  the  initial  iteration  of  the  study.  For  this 
reason,  one  may  conclude  that  experimental 
assignment  is  a  variable,  not  a  constant. 

A  second  common  usage  of  the  distinction 
between  variables  and  constants  involves  describ¬ 
ing  the  data  actually  produced  by  a  particular 
study,  rather  than  the  ontological  characteristics  of 
the  data-generating  process.  In  this  usage,  an  indi¬ 
cator  is  said  to  be  a  variable  if  the  cases  included 
in  the  study  have  more  than  one  score  on  the  indi¬ 
cator.  Thus,  a  research  design  in  which  only 
political  activists  are  selected  for  analysis  may  be 
described  as  one  in  which  political  participation  is 
a  constant;  in  contrast,  if  some  apolitical  individu¬ 
als  are  added  to  the  research  design,  participation 
would  then  be  said  to  be  a  variable.  This  is  the 
usage  employed  in  the  common  aphorism  that  it  is 
impossible  to  explain  a  variable  with  a  constant. 

These  two  usages  are  nonequivalent.  An  indica¬ 
tor  may  be  ontologically  variable  while  still  pro¬ 
ducing  the  same  score  for  every  case  in  a  given 
study.  As  a  simple  example,  consider  an  extremely 
unfair  coin  that  shows  heads  in  99.9%  of  flips  and 
tails  in  0.1%.  The  study  involves  flipping  the  coin 
100  times,  recording  1  for  each  instance  of  heads 
and  0  for  tails.  With  this  setup,  the  probability  of 
observing  a  score  of  one  for  each  of  the  100  cases 
of  flipping  the  coin  is  about  90.5%,  so  it  is  quite 
probable  that  the  resulting  indicator  will  be  a  con¬ 
stant  in  the  second  sense.  However,  each  coin  flip 
obviously  involves  a  variable  in  the  ontological 
sense:  It  is  not  impossible  (although  obviously 
unlikely)  that  a  given  flip  will  show  tails  rather 
than  heads. 

Likewise,  an  indicator  may  take  on  different 
scores  across  the  cases  in  a  given  study  without 
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being  ontologically  variable.  Consider  a  hypotheti¬ 
cal,  badly  designed  experiment  in  which  the  stu¬ 
dents  in  a  large  lecture  course  are  assigned  to  one 
of  two  treatments  by  the  decision  rule  that  stu¬ 
dents  whose  last  names  start  with  the  letters  A  to 
M  receive  one  treatment  (recorded  as  0),  while 
those  falling  between  N  and  Z  receive  the  other 
(recorded  as  1).  Clearly,  the  indicator  recording 
the  resulting  treatment  status  does  not  have  the 
same  score  across  all  cases  and  is  thus  a  variable  in 
the  second  sense.  Nevertheless,  consider  the 
research  design.  The  set  of  experimental  subjects  is 
fixed;  if  the  study  were  to  be  repeated,  the  same 
students  in  the  same  lecture  course  would  be  the 
participants.  Likewise,  the  decision  rule  for  assign¬ 
ment  to  treatment  is  deterministic — that  is,  a 
person  would  always  get  the  same  treatment 
assignment  (unless  he  or  she  were  to  change  last 
names  during  the  course  of  the  term).  Therefore,  in 
the  ontological  sense,  treatment  assignment  in  this 
study  is  a  constant,  even  though  it  takes  on  differ¬ 
ent  scores  for  different  cases.  The  key  idea  is  that, 
for  a  given  individual  in  this  research  design,  treat¬ 
ment  assignment  could  not  have  been  other  than 
what  it  was. 

Types  of  Variables 

Probability  theory  distinguishes  variables  in  terms 
of  the  mathematical  characteristics  of  the  set  of 
values  that  the  variable  may  take  on,  as  well  as  the 
probability  that  each  specific  value  (or  sometimes 
a  value  within  a  narrow  window  of  the  specified 
point)  will  actually  turn  up  in  a  specific  realization 
of  the  variable.  The  first  of  these  criteria  produces 
the  distinction  between  discrete  and  continuous 
variables,  while  the  second  generates  the  various 
families  of  named  random  variables  discussed  in 
probability  theory.  Discrete  random  variables  are 
those  that  take  on  values  that  can  be  associated 
with  (some  subset  of)  the  integers.  Discrete  vari¬ 
ables  in  political  science  may  include  the  number  of 
times  a  survey  respondent  was  contacted  by  repre¬ 
sentatives  of  a  political  party  during  the  past  year, 
the  number  of  battle  deaths  during  an  armed  con¬ 
flict,  or  the  number  of  seats  per  voting  district  in  a 
legislative  election.  These  variables  may  or  may  not 
have  an  upper  limit;  the  number  of  battle  deaths  in 
a  conflict  is  essentially  unlimited,  whereas  the 
number  of  legislative  seats  in  a  district  probably 


cannot  exceed  several  hundred.  What  all  discrete 
variables,  whether  bounded  or  not,  share  is  that 
some  numerical  values  between  the  lowest  and 
highest  observed  scores  are  in  principle  nonsensi¬ 
cal.  For  example,  it  makes  little  sense  to  speak  of  a 
conflict  that  produces  134.52  battle  deaths  or  of  a 
survey  respondent  who  was  visited  three  and  a  half 
times  by  representatives  of  a  party. 

Continuous  variables,  in  contrast,  are  those  that 
take  on  values  that  are  too  numerous  to  be  mapped 
onto  the  integers.  A  simple  example  of  a  continu¬ 
ous  variable  is  the  angle  between  the  minute  and 
the  hour  hand  on  a  clock.  Because  the  angle  can, 
in  principle,  be  measured  with  virtually  unlimited 
precision,  the  set  of  possible  results  of  this  variable 
is  essentially  infinite.  In  political  science  practice, 
few  variables  are  measured  with  such  high  preci¬ 
sion.  However,  variables  that  are  measured  with 
sufficient  detail  to  take  on  a  very  large  number  of 
possible  scores  are  essentially  treated  as  continu¬ 
ous,  even  though  in  principle  they  are  discrete.  By 
this  reasoning,  variables  such  as  an  individual’s 
age  or  years  of  education,  a  country’s  per  capita 
gross  domestic  product,  or  the  proportion  of  the 
population  of  a  country  belonging  to  the  country’s 
largest  ethnic  group  are  effectively  continuous. 

The  distinction  between  discrete  and  continuous 
variables  has  sometimes  played  a  major  role  in  rec¬ 
ommendations  for  data  analysis,  with  researchers 
being  advised  to  choose  techniques  that  are  close  fits 
for  the  characteristics  of  the  variable  that  they  seek 
to  explain.  Hence,  unbounded  continuous  variables 
may  be  explained  using  ordinary  least  squares 
(OLS)  regression,  while  discrete  variables  should  be 
explained  using  techniques  such  as  logistic  regres¬ 
sion,  Poisson  regression,  negative  binomial  regres¬ 
sion,  and  so  on.  A  different  perspective,  which 
places  less  emphasis  on  the  distinction  between 
continuous  and  discrete  variables,  has  recently 
received  increased  attention.  In  this  perspective, 
OLS  regression  always  predicts  the  mean  of  the 
variable  to  be  explained,  conditional  on  the  values 
of  the  other  variables  included  in  the  model. 
Further,  the  mean  is  an  important  quantity  regard¬ 
less  of  whether  the  variable  of  interest  is  discrete  or 
continuous.  Therefore,  OLS  regression  may  be  a 
useful  analytic  tool  regardless  of  whether  the  vari¬ 
able  to  be  explained  is  itself  discrete  or  continuous. 

Probability  theorists  and  statisticians  have 
described  a  large  number  of  more  specific  families 
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of  discrete  or  continuous  variables.  These  families 
of  variables  are  characterized  in  terms  of  the  prob¬ 
ability  that  specific  outcomes  will  occur.  For 
example,  consider  two  different  kinds  of  continu¬ 
ous  variables.  One  can  take  on  any  value  between 
0  and  10,  and  each  of  those  values  is  equally  likely 
to  occur.  This  is  one  example  of  the  family  of  uni¬ 
formly  distributed  variables;  other  members  of  the 
family  are  generated  by  moving  the  lower  and 
higher  bounds  of  the  variable’s  range  of  possible 
values.  The  second  variable  is  unbounded,  although 
most  of  the  scores  fall  between  0  and  10.  Whereas 
with  the  uniformly  distributed  variable  all  values 
within  the  range  are  equally  likely,  with  this  sec¬ 
ond  variable,  the  probability  associated  with  a 
given  value  increases  as  the  value  gets  closer  to  5. 
If  the  distribution  of  probability  across  values  fol¬ 
lows  what  is  informally  described  as  the  bell  curve, 
then  this  is  a  normally  distributed  variable;  other¬ 
wise,  it  may  follow  one  of  many  similar  distribu¬ 
tions,  such  as  the  t  distribution. 

These  formally  characterized  families  of  vari¬ 
ables  have  highly  specific  mathematical  forms  and 
generally  tolerate  no  deviations  from  the  ideal. 
Thus,  for  example,  if  the  second  variable  described 
above  is  slightly  more  likely  to  take  on  values 
below  its  mean  than  above  its  mean,  then  the  vari¬ 
able  does  not  follow  the  normal  distribution  or 
any  of  the  other  bell-like  distributions.  Indeed, 
statisticians  routinely  argue  that  most  real-world 
variables  do  not  exactly  conform  to  any  of  the 
mathematically  described  families  of  variables. 
Instead,  they  may  typically  have  their  own  idiosyn¬ 
cratic  probability  structures.  Named  and  well- 
studied  families  of  variables  may  nonetheless  be 
useful  as  approximations  to  the  probability  distri¬ 
butions  of  real-world  variables,  although  specific 
arguments  about  why  a  given  approximation  is 
useful  in  a  particular  context  are  needed. 

Social  scientists  routinely  distinguish  among 
types  of  variables  in  terms  of  their  causal,  rather 
than  statistical,  structure.  A  hypothesis  to  be  tested, 
or  the  report  of  a  finding,  may  be  specified  in  terms 
of  a  series  of  variables  that  are  said  to  play  certain 
roles  with  respect  to  each  other.  The  most  common 
distinction  in  this  vein  is  between  dependent  and 
independent  variables,  but  scholars  also  discuss 
confounding  or  omitted  variables;  collider  vari¬ 
ables;  intervening,  mediator,  or  mechanism  vari¬ 
ables;  moderator  variables;  exogenous  variables; 


endogenous  variables;  and  instrumental  variables. 
These  categories  of  variables  largely  crosscut  the 
various  meanings  of  variable  vis-a-vis  constant 
discussed  earlier. 

A  dependent  variable  is  the  variable  to  be  caus¬ 
ally  explained;  it  is  the  outcome  of  interest.  Such  a 
variable  is  said  to  be  dependent  because,  if  the 
hypothesis  is  correct,  its  value  depends  causally  on 
the  values  of  several  other  variables  specified  by 
the  analyst.  The  variables  that  are  hypothesized  to 
cause  the  dependent  variable  are  routinely  described 
as  independent  variables.  This  usage  seems  to  draw 
on  experimental  research  contexts,  in  which  one  or 
more  key  hypothesized  causes  of  the  outcome  in 
question  are  randomly  assigned  to  subjects.  With  a 
large  enough  sample,  random  assignment  will 
make  these  hypothesized  causes  have  no  statistical 
relationship  with  each  other  or  with  any  other  vari¬ 
able  that  they  do  not  cause;  hence,  they  are  inde¬ 
pendent  of  each  other  and  of  all  other  causes  of  the 
dependent  variable.  In  nonexperimental  research, 
the  independent  variables  are  rarely  statistically 
independent  in  this  sense.  Thus,  for  nonexperimen¬ 
tal  research,  it  is  best  to  regard  the  term  indepen¬ 
dent  variable  as  a  synonym  for  hypothesized  cause. 

Confounding  or  omitted  variables  are  usually 
variables  that  cause  one  or  more  of  the  indepen¬ 
dent  variables  in  a  hypothesis  and  also  the  depen¬ 
dent  variable.  (Confounding  variables  may  also 
have  more  complex  positions  in  a  causal  chain.) 
Omitted  or  confounding  variables  are  important 
because,  if  they  are  not  dealt  with  in  some  appro¬ 
priate  way,  they  can  distort  the  relationships 
between  the  independent  variables  and  the  depen¬ 
dent  variable — making  a  causal  relationship  appear 
to  exist  when  it  does  not  or  changing  the  apparent 
direction  or  magnitude  of  a  really  existing  relation¬ 
ship.  Strategies  for  dealing  with  confounding  vari¬ 
ables  include  randomly  assigning  cases  to  different 
categories  on  the  key  independent  variable  or 
variables  to  ensure  (with  large  enough  samples) 
that  no  confounding  variables  exist;  choosing  a  set 
of  cases  in  which  a  suspected  confounding  variable 
does  not  vary,  such  that  its  effects  are  constant 
across  the  cases;  identifying  some  component  of 
the  key  independent  variable  that  is  known  to  be 
unrelated  with  the  confounding  variable  and  using 
that  component  to  test  the  hypothesis;  or  incorpo¬ 
rating  the  confounding  variable  into  a  statistical 
model  to  remove  its  interference. 
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Collider  variables,  first  discussed  by  Judea 
Pearl,  are  variables  that  meet  the  following  crite¬ 
ria.  First,  they  are  caused  by  a  key  independent 
variable  or  by  a  cause  of  that  independent  variable. 
Second,  they  are  also  caused  by  an  unmeasured 
variable  that  is  statistically  and  causally  unrelated 
to  the  key  independent  variable.  Third,  that 
unmeasured  variable  is  a  cause  of  the  dependent 
variable.  Collider  variables  typically  pose  no  prob¬ 
lems  to  causal  inference  when  they  are  left  out  of  a 
research  design;  however,  if  the  research  design 
mistakenly  identifies  a  collider  variable  as  a  con- 
founder  and  either  selects  cases  so  as  to  hold  the 
collider  constant  or  incorporates  the  collider  into 
the  statistical  model,  the  result  will  be  to  induce  a 
statistical  relationship  between  the  key  indepen¬ 
dent  variable  and  the  unmeasured  variable,  thereby 
distorting  any  causal  inference. 

Variables  that  are  said  to  be  intervening,  media¬ 
tor,  or  mechanism  variables  are  those  that  constitute 
a  step  in  the  causal  process,  linking  an  independent 
variable  to  the  dependent  variable.  These  variables 
thus  help  answer  the  question,  “How  does  the  inde¬ 
pendent  variable  cause  the  dependent  variable?” 
Researchers  often  attend  to  intervening  variables 
because  they  regard  accounting  for  the  causal  pro¬ 
cess  connecting  the  independent  and  dependent 
variables  as  central  to  their  analytic  task.  Moderator 
variables  are  variables  that  act  to  change  the  causal 
effect  of  an  independent  variable.  For  example, 
watching  a  campaign  advertisement  might  increase 
Democrats’  evaluation  of  the  candidate  in  question 
while  decreasing  the  evaluation  of  Republicans  and 
leaving  political  independents  unaffected.  In  this 
case,  if  watching  the  advertisement  is  regarded  as 
the  independent  variable,  then  a  subject’s  party 
identification  is  a  moderator  variable. 

Exogenous  variables  are  independent  variables 
that  have  no  causal  or  statistical  relationship  with 
any  confounding  variable.  Typically,  this  is  either 
because  of  random  assignment  or  because  the 
causes  of  the  exogenous  variable  or  variables  are 
not  themselves  causes  of  the  dependent  variable.  If 
an  independent  variable  is  exogenous,  then  its 
apparent  relationship  with  the  dependent  variable 
will  generally  provide  information  about  the  true 
causal  relationship  between  the  variables.  Endoge¬ 
nous  variables  are  independent  variables  that  are 
not  exogenous  (sometimes,  the  dependent  variable 
is  also  said  to  be  endogenous).  In  other  words,  an 


endogenous  variable  is  causally  or  statistically 
related  with  one  or  more  confounding  variables. 
As  a  result,  the  apparent  relationship  between  an 
endogenous  variable  and  the  dependent  variable 
provides  little  direct  information  about  causal 
relationships. 

Finally,  instrumental  variables  are  exogenous 
variables  that  cause  an  endogenous  independent 
variable  but  have  no  direct  causal  relationship 
with  the  dependent  variable.  That  is  to  say,  instru¬ 
mental  variables  are  variables  that  cause  one  of  the 
hypothesized  causes  of  the  outcome  and  that  are 
unaffected  by  confounders.  Sometimes,  an  experi¬ 
mental  assignment  can  serve  as  an  instrumental 
variable  if  the  outcome  caused  by  that  experimen¬ 
tal  assignment  is  itself  a  hypothesized  cause  of 
another  variable  of  interest  and  if  the  experimental 
assignment  has  no  direct  effect  on  that  other  vari¬ 
able.  Alternatively,  researchers  seek  instrumental 
variables  by  looking  for  causes  of  their  key  inde¬ 
pendent  variable  that  seem  to  be  in  some  sense  like 
an  experiment.  For  instance,  the  amount  of  rainfall 
in  a  given  year  may  be  usefully  like  an  experiment 
as  a  cause  of  agricultural  productivity,  which  in 
turn  may  have  effects  on  various  political  out¬ 
comes  that  are  not  themselves  directly  affected  by 
rainfall.  Instrumental  variables  are  useful  because 
they  can  identify  a  component  of  an  independent 
variable  that  is  not  affected  by  confounders. 

Jason  Seawright 
Northwestern  University 
Evanston,  Illinois,  United  States 
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Confounding  is  a  pervasive  problem  for  drawing 
inferences  about  political  causes  and  effects.  In 
brief,  some  individuals,  countries,  or  other  units 
are  exposed  to  a  “treatment”  or  “intervention,” 
while  others  are  not.  Differences  in  outcomes  may 
reflect  the  effect  of  treatment,  or  they  may  be  due 
to  confounders — that  is,  variables  associated  with 
exposure  to  treatment  and  with  the  outcome.  For 
example,  does  civil  war  inhibit  economic  growth? 
Do  political  institutions  shape  development?  How 
does  service  in  the  military  affect  wages  after  war? 
Such  questions  are  difficult  to  settle,  because  a 
range  of  unobserved  variables  are  associated  with 
the  presence  of  civil  war,  types  of  political  institu¬ 
tions,  or  service  in  the  military,  and  these  may 
shape  growth,  development,  or  individual  wages. 
Reverse  causality  can  also  be  a  problem. 

Instrumental  variables  can  be  used  to  address 
the  problem  of  confounding,  in  both  experiments 
and  observational  studies.  In  randomized  con¬ 
trolled  experiments,  a  coin  flip  determines  which 
subjects  are  assigned  to  treatment,  so  subjects 
assigned  to  receive  the  treatment  are,  on  average, 
just  like  subjects  assigned  to  control.  However, 
even  in  experiments  there  can  be  confounding,  if 
subjects  who  accept  the  treatment  are  compared 
with  those  who  refuse  it.  Analysts  should,  there¬ 
fore,  compare  subjects  randomly  assigned  to  treat¬ 
ment  with  those  randomly  assigned  to  control. 
Instrumental-variables  analysis  may  be  used  to 
estimate  the  effect  of  treatment  on  Compliers  (sub¬ 
jects  who  follow  the  treatment  regime  to  which 
they  are  assigned).  In  experiments,  treatment 
assignment  usually  satisfies  two  key  requirements 
for  an  instrumental  variable:  It  is  statistically  inde¬ 
pendent  of  unobserved  causes  of  the  dependent 


variable,  and  it  plausibly  affects  the  outcome  only 
through  its  effect  on  treatment  receipt. 

In  observational  studies  (those  in  which  assign¬ 
ment  to  treatment  is  not  under  the  control  of  the 
researcher),  the  problem  of  confounding  is  typi¬ 
cally  more  severe  because  units  self-select  into  the 
treatment  and  control  groups.  Instrumental- 
variables  analysis  can  be  used  to  recover  the  effect 
of  an  endogenous  treatment — that  is,  a  treatment 
variable  that  is  correlated  with  confounders. 
However,  strong  assumptions  are  often  required, 
and  these  can  be  only  partially  validated  from 
data.  The  use  of  instrumental  variables  in  observa¬ 
tional  studies  is  discussed  below,  after  a  bench¬ 
mark  application  to  experimental  data  is  first 
described. 

Instrumental-Variables 
Analysis  of  Experiments 

In  experiments,  subjects  often  fail  to  follow  the 
treatment  regime  to  which  they  are  assigned.  In  A. 
Gerber  and  D.  Green’s  (2000)  study  of  the  effect  of 
door-to-door  canvassing  on  turnout,  for  example, 
some  voters  who  were  assigned  to  receive  a  get- 
out-the-vote  message  did  not  answer  the  door.  It  is 
misleading  to  compare  subjects  who  answer  the 
door  with  subjects  who  do  not  because  there  may 
be  confounding.  However,  treatment  assignment 
can  serve  as  an  instrumental  variable  for  treatment 
receipt,  which  allows  an  estimation  of  the  effect  of 
treatment  on  Compliers. 

An  example  from  the  health  sciences  helps 
make  the  logic  clear.  In  the  1960s,  the  Health 
Insurance  Plan  (HIP)  clinical  trial  studied  the 
effects  of  screening  for  breast  cancer.  About 
31,000  women  between  the  ages  of  40  and 
64  years  were  invited  for  annual  clinical  visits  and 
mammographies,  which  are  X-rays  designed  to 
detect  breast  cancer.  The  group  of  women  invited 
for  screening  was  called  the  assigned-to-treatment 
group,  or  just  the  treatment  group.  In  the  control 
group,  31,000  women  received  the  status  quo 
health  care.  The  invitation  for  screening  was  issued 
at  random,  so  that  the  women  in  the  assigned-to- 
treatment  group  were  just  like  the  women  who 
were  not,  up  to  random  error. 

Table  1,  adapted  from  D.  Freedman  (2009), 
shows  death  rates  from  breast  cancer  5  years  after 
the  start  of  the  trial.  In  the  assigned-to-treatment 
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group,  20,200  women  or  about  two  thirds  of 
women  accepted  the  invitation  to  be  screened, 
while  one  third  refused.  It  might  seem  natural  to 
compare  the  women  who  received  screening  with 
those  who  refused.  Yet  women  self-select  into 
screening,  and  those  who  accept  screening  are  dif¬ 
ferent  from  those  who  refuse.  There  is  an  impor¬ 
tant  confounder:  Richer  and  better-educated 
women  tend  to  come  in  for  screening,  and  while 
such  women  are  less  vulnerable  to  other  diseases 
(see  the  final  column  of  Table  1),  they  are  more 
prone  to  breast  cancer  (probably  because  they  tend 
to  have  fewer  children  and  childbearing  is  protec¬ 
tive  against  breast  cancer). 

The  correct,  experimental  comparison  is 
between  women  randomly  invited  to  come  in  for 
screening — whether  or  not  they  were  actually 
screened — and  the  whole  control  group.  This 
intention-to-treat  analysis  shows  a  strong  effect  in 
relative  terms.  In  the  assigned-to-treatment  group, 
there  were  1.26  deaths  per  1,000  women,  while 
there  were  2.03  deaths  per  1,000  women  in  the 


control  group.  So  the  effect  of  assignment  to 
screening  is  —0.77  deaths  per  1,000.  However,  the 
intention-to-treat  analysis  likely  understates 
the  effect  of  screening — after  all,  one  third  of  the 
women  in  the  assigned-to-treatment  group  were 
not  actually  screened. 

What,  then,  was  the  effect  of  screening  on 
women  in  the  treatment  group  who  accepted 
screening?  Instrumental-variables  analysis  answers 
this  question.  To  begin,  it  is  useful  to  think  about 
the  experimental  population  as  comprising  two 
kinds  of  subjects:  Compliers  and  Never  Takers. 
Here,  Compliers  are  women  who  accept  screening 
if  they  are  assigned  to  treatment  but  are  not 
screened  if  assigned  to  control,  while  Never  Takers 
are  women  who  are  not  screened,  whether  they  are 
assigned  to  treatment  or  control.  By  looking  at  the 
control  group  alone,  we  cannot  tell  which  is 
which:  Never  Takers  look  just  like  Compliers, 
since  neither  type  of  subject  receives  the  treatment 
when  assigned  to  the  control  group.  In  the  treat¬ 
ment  group,  however,  20,200  or  about  two  thirds 


Table  I  Deaths  From  Breast  Cancer  and  Other  Causes  (Health  Insurance  Plan  Study) 


Group  Size 

Deaths  From 
Breast  Cancer 

Death  Rate 
per  1,000 
Women 

Deaths  From 
Other  Causes 

Death  Rate 
From  Other 
Causes  per 
1,000  Women 

Assigned  to  treatment 

Accepted  screening 

20,200 

23 

1.14 

428 

21.19 

Refused  screening 

10,800 

16 

1.48 

409 

37.87 

Total 

31,000 

39 

1.26 

837 

27.00 

Assigned  to  control 

Would  have  accepted 

20,200 

47 

2.33 

— 

— 

screening 

Would  have  refused 

10,800 

16 

1.48 

— 

— 

screening 

Total 

31,000 

63 

2.03 

879 

28.35 

Source:  Adapted  from  Freedman,  D.  (2009).  Statistical  models:  Theory  and  practice  (2nd  ed.,  p.  4,  Table  1).  New  York: 
Cambridge  University  Press. 
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of  the  women  accepted  screening.  Because  subjects 
are  randomly  assigned  to  treatment  and  control, 
the  mix  of  Compliers  and  Never  Takers  should  be 
about  the  same  in  both  groups.  We  can  thus  esti¬ 
mate  that  two  thirds  of  women  in  the  control 
group  are  Compliers,  just  as  in  the  treatment 
group. 

Instrumental-variables  analysis  compares  death 
rates  of  Compliers  in  the  treatment  group  with 
death  rates  of  Compliers  in  the  control  group.  It  is 
easy  to  measure  the  former  quantity  because  we 
observe  which  subjects  are  screened  in  the  treat¬ 
ment  group  and  can  track  their  death  rates.  But 
what  about  the  latter?  First,  note  that  Never 
Takers  in  the  treatment  group  and  Never  Takers 
in  the  control  group  should  have  a  similar  inci¬ 
dence  of  death  from  breast  cancer — after  all,  nei¬ 
ther  group  was  screened.  In  the  treatment  group, 
16  women  who  refused  screening — these  were 
Never  Takers — died  from  breast  cancer.  Thus, 
about  16  of  the  women  who  died  from  breast  can¬ 
cer  in  the  control  group  are  also  Never  Takers. 
(Here,  the  treatment  and  control  groups  are  the 
same  size;  if  they  differed,  we  would  use  rates 
instead  of  numbers.)  Because  63  women  died  from 
breast  cancer  in  the  control  group,  it  implies  that 
about  47  (i.e.,  63  =  16)  of  the  women  who  died 
from  breast  cancer  in  the  control  group  were 
Compliers. 

We  can  then  fill  in  the  third  column  of  Table  1 
for  the  subjects  assigned  to  control,  dividing 
deaths  from  breast  cancer  by  group  size.  The 
analysis  implies  a  death  rate  of  1.14  deaths  per 
1,000  women  among  Compliers  in  the  treatment 
group  and  2.33  deaths  per  1,000  among  Compliers 
in  the  control  group.  Thus,  the  effect  of  screening 
on  Compliers  is  —1.19  (i.e.,  1.14  —  2.33)  deaths 
per  1,000  women — a  substantially  larger  effect 
than  suggested  by  the  intention-to-treat  analysis. 
One  may  arrive  at  this  same  estimate  by  dividing 
the  estimated  intention-to-treat  parameter  by  the 
fraction  of  the  treatment  group  that  was  screened — 
that  is, 


In  the  instrumental-variables  estimator  in 
Equation  1,  we  are  implicitly  assuming  that  no 


women  in  the  control  group  were  screened.  (In 
the  1960s,  few  women  sought  out  mammography 
on  their  own.)  In  other  contexts,  subjects  who  are 
assigned  to  the  control  group  may  seek  out  the 
treatment.  With  a  double  crossover,  the  model 
for  compliance  would  be  extended  to  include 
Always  Takers — that  is,  subjects  who  receive  treat¬ 
ment  whether  assigned  to  treatment  or  control — 
as  well  as  Compliers  and  Never  Takers.  In  the 
denominator  of  the  instrumental-variables  esti¬ 
mator  analogous  to  Equation  1,  we  would  then 
need  to  subtract  the  fraction  of  the  control  group 
that  was  screened  from  the  fraction  of  the  treat¬ 
ment  group  that  was  screened.  Note  that  ran¬ 
dom  assignment  is  crucial  here,  because  it  allows 
us  to  estimate  the  counterfactual  outcomes  for 
women  in  the  control  group  who  would  have 
accepted  screening  had  they  been  assigned  to 
control. 

Some  assumptions  are  required.  For  one,  we 
must  assume  that  there  are  no  Defiers:  These  are 
subjects  who  do  the  opposite  of  what  they  are  told. 
In  the  HIP  breast  cancer  study,  Defiers  are  subjects 
who  would  take  an  exam  if  assigned  to  control  but 
would  refuse  an  exam  if  assigned  to  treatment. 
Notice  also  that  Equation  I  estimates  the  causal 
effect  of  treatment  for  a  specific  subset  of  experi¬ 
mental  subjects — namely,  Compliers.  When  the 
effects  of  treatment  are  heterogeneous  for  different 
subjects,  this  “local  average  treatment  effect”  may 
not  in  general  be  the  same  as  the  average  causal 
effect  of  treatment  for  all  subjects  in  the  experi¬ 
mental  population. 

Instrumental- Variables  Analysis 
of  Observational  Data 

In  observational  studies,  researchers  do  not  apply 
the  treatment  or  intervention;  instead,  the  subjects 
select  themselves  into  treatment  or  control  groups. 
Selection  is  usually  highly  nonrandom,  and  there  is 
typically  confounding.  However,  under  some 
conditions,  researchers  may  exploit  instrumental 
variables  to  recover  the  effect  of  an  endogenous 
treatment  variable.  Just  as  in  experiments,  a  valid 
instrumental  variable  must  be  independent  of  other 
causes  of  the  dependent  variable,  and  it  must  influ¬ 
ence  exposure  to  treatment  but  not  influence  the 
outcome,  other  than  through  its  effect  on  exposure 
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to  treatment.  The  latter  condition  is  sometimes 
called  an  exclusion  restriction,  with  reference  to 
the  exclusion  of  the  instrumental  variable  from  a 
causal  equation  governing  the  outcome. 

Joshua  D.  Angrist  (1990),  for  example,  uses 
draft  lottery  numbers  as  an  instrumental  variable 
for  military  service  during  the  Vietnam  War. 
Understanding  the  effects  of  past  military  service 
on  labor  market  earnings  is  difficult,  because 
people  who  choose  to  serve  in  the  military  may  be 
different  from  those  who  do  not,  in  ways  that 
matter  for  future  earnings.  A  key  assertion  in 
Angrist’s  study  is  that  the  draft  number  is  as  good 
as  being  randomly  assigned:  Whether  one’s  draft 
number  is  high  or  low  is,  therefore,  independent 
of  factors  that  influence  future  earnings.  With  a 
dichotomous  treatment  (military  service/no  mili¬ 
tary  service),  the  instrumental-variables  estimator 
is  analogous  to  Equation  1,  though  the  denomina¬ 
tor  should  be  adjusted  for  a  double  crossover: 
Some  people  dodge  the  draft,  while  others  serve  in 
the  military  even  if  they  are  not  drafted.  An 
important  but  reasonable  assumption  is  that  there 
are  no  Defiers — that  is,  people  who  sign  up  for  the 
military  if  not  drafted  but  emigrate  to  Canada 
when  their  number  comes  up.  Note  that,  here, 
instrumental  variables  estimate  the  effect  of  treat¬ 
ment  for  a  particular  subset  of  subjects — those 
who  serve  in  the  military  if  drafted,  but  not  other¬ 
wise.  Whether  this  effect  is  informative  about  the 
effect  of  military  service  for  other  subjects  may  be 
a  matter  of  opinion. 

A  second  example  comes  from  an  influential 
study  by  E.  Miguel,  S.  Satyanath,  and  E.  Sergenti 
(2004)  of  the  effect  of  growth  on  the  probability 
of  civil  war  in  Africa.  Confounding  poses  a  big 
problem  in  this  research  area,  since  many  difficult- 
to-measure  variables  may  affect  both  growth  and 
the  likelihood  of  civil  war.  However,  year-to-year 
variation  in  rainfall  is  plausibly  as-if  random,  and 
it  may  influence  economic  growth — that  is,  treat¬ 
ment  receipt — without  independently  affecting  the 
probability  of  civil  war  through  other  channels.  If 
so,  an  instrumental-variables  analysis  may  allow 
estimation  of  the  effect  of  economic  growth  on 
conflict  for  those  countries  whose  growth  perfor¬ 
mance  is  shaped  by  variation  in  rainfall.  This 
application  illuminates  another,  distinct  concern 
about  the  interpretation  of  instrumental-variables 


estimates:  Variation  in  rainfall  may  influence 
growth  only  in  particular  sectors,  such  as  agricul¬ 
ture,  and  growth  in  distinct  economic  sectors  may 
have  different  effects  on  the  probability  of  con¬ 
flict.  Using  rainfall  as  an  instrument  for  studying 
growth  may  capture  such  idiosyncratic  rather 
than  general  effects,  so  caution  may  be  advised 
when  extrapolating  results  or  making  policy  rec¬ 
ommendations. 

Finally,  D.  Acemoglu,  S.  Johnson,  and  J.  A. 
Robinson  (2001),  in  a  pathbreaking  study  of  the 
effects  of  institutional  arrangements  on  countries’ 
economic  performance,  use  colonial  settler  mortal¬ 
ity  rates  as  an  instrumental  variable  for  current 
institutions.  These  authors  argue  that  settler  mor¬ 
tality  rates  during  colonial  years  do  not  affect  cur¬ 
rent  economic  performance  in  former  colonies, 
except  through  their  effect  on  current  institutions; 
they  also  argue  that  settler  mortality  is  as  good  as 
randomly  assigned,  at  least  conditional  on  covari¬ 
ates.  Since  neither  assumption  is  verifiable  from 
the  data,  a  combination  of  historical  evidence  and 
a  priori  reasoning  must  be  used  to  try  to  validate, 
at  least  partially,  these  core  assumptions.  The  por¬ 
tion  of  current  institutions  that  is  “explained”  (in 
a  statistical  sense)  by  past  settler  mortality  rates 
may  also  have  idiosyncratic  effects  on  economic 
growth,  which  could  limit  the  generalizability  of 
the  findings. 

Strengths  and  Limitations  of 
Instrumental-Variables  Analysis 

As  these  examples  suggest,  instrumental  variables 
provide  an  important  tool,  because  they  help  con¬ 
front  the  problem  of  confounding — a  first-order 
issue  in  the  social  sciences.  Instrumental-variables 
regression  may  also  be  used  to  correct  for  error  in 
the  measurement  of  independent  variables,  which 
can  pose  important  inferential  obstacles  in  social 
scientific  research.  In  recent  years,  instrumental 
variables  have  been  used  to  estimate  causal  effects 
in  many  substantive  domains. 

Nonetheless,  the  use  of  instrumental  variables 
often  requires  strong  assumptions,  which  can  be 
only  partially  validated  from  data.  Some  empirical 
tests  can  be  performed  to  assess  the  central 
assumption  that  the  instrumental  variable  is  as 
good  as  randomly  assigned;  for  instance,  the 
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instrument  may  be  shown  to  be  uncorrelated  with 
pretreatment  covariates  (those  that  are  deter¬ 
mined  before  the  intervention).  A  priori  reasoning 
and  detailed  knowledge  of  the  empirical  context 
may  also  play  an  important  role.  In  observational 
studies,  however,  because  there  is  often  no  actual 
randomization,  the  validity  of  an  as-if  random 
assignment  is  often  a  matter  of  opinion;  this  asser¬ 
tion  may  be  classified  along  a  spectrum  from  “less 
plausible”  to  “more  plausible,”  but  it  is  difficult 
to  validate  the  placement  of  any  given  study  on 
such  a  spectrum. 

Additional  issues  arise  in  many  applications, 
often  in  connection  with  the  use  of  multiple  regres¬ 
sion  models.  For  instance,  concerns  about  the  endo¬ 
geneity  of  a  single  treatment  variable  will  typically 
lead  researchers  to  use  instrumental-variables 
regression.  Yet  analysts  typically  do  not  discuss  the 
possible  endogeneity  of  other  covariates  in  their 
multiple  regression  models.  (One  reason  may  be 
that  the  number  of  instruments  must  equal  or  sur¬ 
pass  the  number  of  endogenous  variables,  and  good 
instruments  are  difficult  to  find.)  Furthermore, 
instruments  that  are  truly  random  may  not  be 
strongly  related  to  an  endogenous  treatment;  in  this 
case,  a  substantial  small-sample  bias  can  arise.  One 
recommendation  for  practice  may  be  to  report 
“reduced-form”  results.  (Reduced  form  is  a  syn¬ 
onym  for  intention  to  treat;  here,  the  outcome  is 
regressed  directly  on  the  instrumental  variable.) 

Another  recommendation  may  be  to  report 
instrumental-variables  regressions  without  covari¬ 
ates;  with  one  endogenous  treatment  variable  and 
one  valid  instrument,  including  covariates  can  be 
unnecessary  and  even  be  harmful.  The  estimand 
should  be  carefully  defined,  and  difficulties  that 
may  arise  when  extrapolating  results  to  other  con¬ 
texts  and  types  of  subjects  should  be  considered.  In 
multiple-regression  models,  the  statistical  model 
itself  must  be  validated,  to  the  extent  possible; 
with  regression,  the  identification  of  causal  effects 
depends  not  just  on  the  exogeneity  of  instrumental 
variables  in  relation  to  a  posited  regression  model 
but  also  on  the  validity  of  the  underlying  model 
itself. 

Finally,  it  is  important  to  emphasize  that  nei¬ 
ther  of  the  core  criteria  for  a  valid  instrumental 
variable — that  it  is  statistically  independent  of 
unobserved  causes  of  the  dependent  variable  and 


that  it  affects  the  dependent  variable  only  through 
its  effect  on  the  endogenous  treatment — are 
directly  testable  from  data.  Analysts  using  instru¬ 
mental  variables  should  defend  these  assertions 
using  evidence  and  reasoning,  to  the  extent  possi¬ 
ble.  Yet  instrumental- variables  estimates  should 
be  interpreted  with  an  appropriate  degree  of  cau¬ 
tion,  especially  outside  the  experimental  context. 

Thad  Dunning 
Yale  University 
New  Haven,  Connecticut,  United  States 
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Veto  Player 


A  veto  player  is  an  individual  or  collective  actor 
who  possesses  veto  power  in  a  political  process  and 
behaves  strategically.  Particular  conceptions  of  the 
basic  concept  have  to  specify,  among  other  things, 
what  the  relevant  process  is,  who  the  veto  players 
are,  and  what  motivates  them.  The  most  influential 
conception  of  veto  players  has  been  developed  by 
George  Tsebelis  (2002)  in  his  veto  player  theory. 
That  model  has  been  elaborated  in  a  number  of 
different  ways  by  subsequent  scholars. 

Tsebelis  relies  on  the  concept  of  the  veto  player 
to  unify  the  comparative  analysis  of  political  sys¬ 
tems.  One  part  of  this  unification  is  conceptual. 
Tsebelis  argues  that  traditional  institutional 
dichotomies  such  as  between  unicameral  and 
bicameral  parliaments  or  between  presidential  and 
parliamentary  systems  of  government  can  to  a  sig¬ 
nificant  extent  be  replaced  by  analyzing  political 
systems  and  situations  as  particular  veto  player 
configurations. 

But  veto  player  theory  also  aims  at  unifying  our 
causal  understanding  of  politics.  The  basis  of  this 
causal  unification  is  the  concept  of  policy  stabil¬ 
ity — that  is,  the  difficulty  of  changing  the  existing 
policies  (the  status  quo)  in  a  political  system. 
Policy  stability  is  the  causal  mechanism  that  links 
veto  player  configurations  to  particular  outcomes 
of  substantive  importance.  Tsebelis  argues  that  in 
parliamentary  systems,  in  which  the  cabinet 
depends  on  the  confidence  of  the  assembly,  policy 
stability  leads  to  cabinet  instability.  If  the  cabinet 
cannot  agree  on  policy  change,  it  will  be  replaced. 
For  presidential  systems,  in  contrast,  veto  player 
theory  predicts  that  policy  stability  is  likely  to 
encourage  a  coup  or  some  other  form  of  regime 
instability.  Neither  of  these  two  arguments  is  com¬ 
pletely  new,  and  they  can  be  criticized,  but  veto 
player  theory  embeds  both  of  them,  and  many  oth¬ 
ers,  in  one  relatively  parsimonious  and  coherent 
framework. 

Tsebelis  distinguishes  two  types  of  veto  players: 
(1)  Institutional  players  are  those  established  by  a 
country’s  constitution.  For  example,  the  U.S. 
Constitution  identifies  the  president  as  a  veto 
player.  (2)  Partisan  players  are  established  by  the 
way  political  competition  plays  out  in  a  given  coun¬ 
try  at  a  given  time.  One  important  analytical  strat¬ 
egy  of  veto  player  theory  is  to  focus  on  partisan 


veto  players  whenever  possible.  For  example, 
instead  of  treating  the  Finnish  parliament  as  one 
institutional  veto  player,  the  theory  treats  cabinet 
parties  forming  a  majority  coalition  within  the 
parliament  as  partisan  veto  players.  One  advan¬ 
tage  of  “replacing”  institutional  with  partisan  veto 
players  for  analytical  purposes  is  that  we  may  be 
able  to  ascribe  particular  policy  preferences  to  the 
latter.  Estimating  actors’  preferences  is  crucial  for 
the  application  of  veto  player  theory. 

Tsebelis  relies  on  game  theory  and  a  spatial  rep¬ 
resentation  of  preferences.  Actors’  preferences  are 
represented  by  a  point — their  so-called  ideal 
point — in  some  one-dimensional  or  multidimen¬ 
sional  policy  space,  and  the  actors’  main  goal  is  to 
move  the  collective  decision  as  close  as  possible  to 
this  point.  Based  on  these  formal  tools,  Tsebelis 
derives  specific  hypotheses  about  the  relationship 
between  veto  players  and  policy  stability. 

A  superficial  reading  of  veto  player  theory  often 
misunderstands  it  as  proving  a  seemingly  trivial 
point — that  a  higher  number  of  veto  players 
increases  policy  stability.  Yet  one  crucial  insight  of 
the  formal  analysis  is  that  the  number  of  veto  play¬ 
ers  alone  tells  us  little  about  the  potential  for  policy 
change.  What  matters  is  how  this  number  interacts 
with  the  distances  between  the  ideal  points  of  veto 
players’  preferences.  Moreover,  the  location  of  the 
status  quo,  the  allocation  of  agenda-setting  power, 
and  the  internal  cohesion  of  collective  veto  players 
may  also  be  of  great  importance  for  the  outcome 
of  collective  decisions.  Veto  player  theory  analyses 
the  interplay  of  these  explanatory  factors. 

One  important  formal  result  of  veto  player 
theory  is  the  “absorption  rule.”  This  rule  states 
that  if  a  (potential)  veto  player  has  an  ideal  point 
that  is  geometrically  embedded  within  the  ideal 
points  of  other  veto  players — that  is,  if  its  ideal 
point  is  in  the  so-called  unanimity  core — then  this 
player  does  not  matter  in  the  analysis;  it  is 
absorbed.  For  example,  Tsebelis  argues  that  even 
courts  with  the  authority  to  review  and  veto  legis¬ 
lation  can  often  be  ignored  in  veto  player  analyzes 
because  they  are  selected  in  ways  that  virtually 
guarantee  their  absorption. 

Discussion  and  Critique 

Distinguishing  between  “pure”  and  “interpreted” 
veto  player  theory  is  helpful  in  discussing  the  lim¬ 
its  and  criticism  of  Tsebelis’s  theory.  The  pure 
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theory  consists  of  the  formal  models  in  which  the 
conclusions  follow  logically  from  the  assumptions. 
Yet  we  know  that  the  assumptions  are  idealizations 
or  useful  fictions.  The  combination  of  these  two 
facts — “false”  assumptions  and  valid  conclusions — 
implies  that  testing  the  “truth”  or  “falsity”  of  the 
veto  player  theory  is  not  a  meaningful  enterprise. 
The  more  important  question  is  whether  the  theory 
is  useful  for  particular  purposes. 

In  contrast  to  pure  theory,  interpreted  veto 
player  theory  includes  assumptions  about  how  the 
formal  models  relate  to  reality.  It  is  mainly  these 
assumptions  that  give  empirical  content  to  the 
theory  and  hence  make  it  testable.  Moreover, 
while  no  theorist  doubts  the  validity  of  Tsebelis’s 
deductions,  many  theorists  make  conflicting 
assumptions  about  how  these  deductions  are  to  be 
applied  to  the  real  world. 

Consider  an  example.  In  a  five-party,  multidi¬ 
mensional  legislature,  parties  A,  B,  and  C  form  a 
majority  coalition  whereas  parties  D  and  E  are  in 
opposition.  Tsebelis’s  interpreted  veto  player  theory 
assumes  that  A,  B,  and  C  are  veto  players  on  all 
policies.  A  competing  assumption,  proposed  by 
Michael  Laver  and  Kenneth  Shepsle  (1996),  is  that 
each  minister  is  a  sort  of  policy  dictator  for  his  or 
her  portfolio  so  that  one  of  the  three  parties  alone 
determines  policy  on  a  particular  issue.  A  third 
assumption,  highlighted  by  Anthony  McCann 
(2006),  is  that  legislation  may  reflect  the  preferences 
of  all  five  parties.  The  reason  is  that  one  of  the  oppo¬ 
sition  parties  may  undermine  the  existing  coalition 
by  forming  an  alternative  coalition  that  makes  some 
of  the  members  better  off.  Taking  opposition  par¬ 
ties’  preferences  into  account  may  thus  be  a  way  of 
reducing  their  incentive  to  challenge  the  existing 
coalition.  Formally  speaking,  while  Tsebelis  expects 
the  collective  decision  to  be  within  the  unanimity 
core  of  the  putative  veto  players  A,  B,  and  C, 
McGann  expects  it  to  be  within  the  so-called  uncov¬ 
ered  set  for  the  entire  five-party  parliament.  For 
Tsebelis,  it  is  sufficient  to  be  a  member  of  the  actual 
winning  coalition  in  order  to  be  a  veto  player.  For 
McGann,  a  true  veto  player  must  be  a  member  of 
every  possible  winning  coalition,  as  would  be  the 
case  if  one  party  had  a  majority  in  parliament. 

Just  as  Tsebelis’s  pure  veto  player  theory  may 
be  interpreted  differently,  theorists  may  also  use 
the  label  of  “veto  players”  to  refer,  at  least  in  part, 
to  other  pure  theories.  For  instance,  Scott  Basinger 
and  Mark  Hallerberg  (2004)  question  the 


importance  of  Tsebelis’s  absorption  rule.  They 
highlight  the  relevance  of  the  mere  number  of  veto 
players,  regardless  of  the  distances  between  their 
preferences,  based  on  the  view  that  many  veto 
players  increase  transaction  costs. 

Interpreted  veto  player  theory  is  often  tested  in 
isolation  rather  than  along  with  competing  theo¬ 
ries.  Or  it  is  simply  applied  to  some  phenomenon. 
Even  these  noncomparative  tests  or  applications 
are  not  without  problems,  however,  for  two 
related  reasons:  It  is  difficult  to  adequately  mea¬ 
sure  actors’  policy  preferences,  and  it  is  difficult  to 
unambiguously  identify  the  actual  veto  players. 

The  problem  of  identification  is  most  obvious  in 
the  case  of  minority  cabinets.  By  definition,  the 
parties  forming  a  minority  cabinet  do  not  have  suf¬ 
ficient  seats  in  parliament  to  change  the  status  quo, 
but  it  is  not  known  in  advance  which  party  or  par¬ 
ties  will  support  the  cabinet  on  a  particular  piece  of 
legislation.  Tsebelis  argues  that  minority  cabinets 
tend  to  have  a  privileged  position  in  the  policy 
space  as  well  as  strong  agenda-setting  powers,  so 
that  they  typically  do  not  have  to  make  significant 
concessions  to  support  parties.  Hence,  he  treats 
only  cabinet  parties  as  veto  players.  Other  authors, 
such  as  Steffen  Ganghof  and  Thomas  Brauninger 
(2006),  doubt  Tsebelis’s  assumptions  and  believe 
that  support  parties  can  be  quite  powerful. 
Assumptions  about  how  to  identify  veto  players  are 
part  of  the  interpretation  of  pure  veto  player  the¬ 
ory:  They  increase  the  theory’s  empirical  content 
and  hence  testability,  but  they  may  be  contested. 

The  problem  of  measuring  preferences  is  as  fol¬ 
lows:  Veto  player  theory  refers  to  actors’  final  (all 
things  considered)  preferences  over  legislative  pro¬ 
posals.  Yet  these  preferences  are  often  difficult  to 
measure,  especially  in  broad  comparative  studies. 
Empirical  estimates  of  preferences  based  on  party 
manifestos  or  expert  surveys  often  measure  par¬ 
ties’  general  ideological  orientations  or  outcome 
preferences  (e.g.,  more  growth  vs.  more  equality) 
rather  than  their  final  policy  preferences  (e.g.,  a 
high  or  a  low  business  tax  rate).  If  this  is  the  case, 
there  is  a  significant  gap  between  theory  and  mea¬ 
surement.  If  this  gap  is  ignored,  the  predictions  of 
veto  player  theory  may  be  misunderstood.  For 
example,  if  an  empirical  analysis  measures  veto 
players’  outcome  preferences,  it  is  far  from  clear 
whether  large  distances  between  these  preferences 
lead  to  policy  stability.  It  is  also  possible  that  pol¬ 
icy  preferences  are  much  more  proximate,  possibly 
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due  to  strong  economic  or  technological  con¬ 
straints  faced  by  the  players,  thus  allowing  for 
significant  policy  change. 

The  problems  of  identification  and  preference 
measurement  also  imply  that  veto  player  theory  is 
strongest  when  applied  to  cohesive  and/or  disci¬ 
plined  parties.  While  veto  player  theory  has  all  the 
formal  tools  to  deal  with  situations  of  noncohesive 
parties,  comparative  analyses  typically  lack  the 
detailed  information  on  individual  preferences  that 
would  be  needed  to  apply  these  tools  empirically. 
The  difficulty  of  dealing  with  noncohesive  parties  is 
quite  significant  because  the  likelihood  of  such  par¬ 
ties  is  much  greater  in  presidential  systems.  At  the 
same  time,  presidential  systems  are  also  more  likely 
to  have  significant  institutional  veto  points  in  addi¬ 
tion  to  (the  first  chamber  of)  parliament.  Applica¬ 
tions  of  veto  player  theory,  therefore,  often  focus 
on  parliamentary  systems  and  boil  down  to  the 
assumption  that  cabinet  parties  are  veto  players. 

Despite  these  limitations,  veto  player  theory  is  a 
significant  achievement.  It  helps  unify  our  under¬ 
standing  of  political  systems  and  political  processes. 
And  such  unification  of  knowledge  is  central  to 
scientific  progress. 

Steffen  Ganghof 
University  of  Potsdam 
Potsdam,  Germany 
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Violence 


In  addition  to  finding  food  and  shelter,  the  con¬ 
tainment  of  violence  has  always  been  a  vital  issue 
for  humankind.  Violence  denotes  the  use  of  phys¬ 
ical  force  to  inflict  injury  on  persons  or  cause 
damage  to  property  and  manifests  itself  in  forms 
as  different  as  homicide  or  state  violence  (e.g., 
repression,  coercion).  This  definition  is  indepen¬ 
dent  of  agents,  objects,  or  circumstances  of  vio¬ 
lence.  Political  violence  can  be  distinguished  from 
collective  or  civil  violence  (such  as  social  banditry) 
by  (a)  the  number  of  persons  involved,  (b)  the 
intentions  of  the  actors ,  and  (c)  the  reactions  of 
the  public. 

The  more  discontent  is  blamed  on  the  political 
system,  the  greater  the  potential  for  political  vio¬ 
lence.  Often,  there  is  no  clear  boundary  between 
a  bargaining ,  a  coercion,  and  a  terror  model  in 
the  use  of  violence.  Also,  domestic  (political)  vio¬ 
lence  is  to  be  distinguished  from  interstate  vio¬ 
lence  in  that  at  least  one  participant  is  not  a 
government. 

Typologies  of  violence  flourish  (violence  against 
persons/nonhumans,  direct/indirect,  physical/ 
psychological,  individual/collective,  organized/ 
spontaneous,  criminal/political,  and  noninstitu- 
tionalized/institutionalized  [e.g.,  the  Mafia  or  the 
period  in  Colombian  history  known  as  La 
Violencia,  or  noninstitutionalizedj).  Our  definition 
precludes  the  dichotomy  between  personal  and 
structural  violence,  the  latter  implying  no  direct 
actor  but  rather  “structural  or  indirect”  (Galtung, 
1969,  p.  170)  causes.  Such  a  definition  excludes 
very  little  and  thus  inflates  the  concept  of  violence. 

The  most  important  differentiation  is  between 
legal/illegal  and  legitimate! illegitimate  violence. 
Max  Weber  defines  the  modern  state  by  its  specific 
means,  that  of  physical  violence.  Where  the  author¬ 
ity  of  the  state  is  considered  legitimate,  stable  rule 
will  result.  In  the  opposite  case  of  illegal  and  ille¬ 
gitimate  violence,  social  disorder  prevails,  with  the 
violence  and  instability  envisioned  by  Thomas 
Hobbes.  To  a  lesser  extent,  social  order  also 
breaks  down  where  legal  state  violence  is  not 
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accepted  by  the  population  and  in  the  reverse  sce¬ 
nario  where  violence  is  illegal  but  considered 
highly  legitimate  by  the  people.  These  are  instances 
of  classical,  often  revolutionary  protest. 

The  challenge  to  the  state  monopoly  of  violence 
can  be  combined  with  challenges  to  the  national 
borders,  by  separatist  groups  or  foreign  powers, 
and  to  the  societal  consensus,  for  example,  by  revo¬ 
lutionary  groups.  This  leads  to  a  three-dimensional 
table  classifying  at  least  eight  different  forms  of  vio¬ 
lent  political  conflict. 

Note  that  with  violent  protest  and  demonstra¬ 
tions,  usually,  neither  the  state  monopoly  of  vio¬ 
lence  nor  the  societal  order  is  challenged.  As  noted 
by  Douglas  Hibbs  (1973),  the  response  of  the  state 
authorities,  to  a  large  extent,  determines  whether 
protests  escalate  into  internal  war  (civil  war) — the 
organized,  initially  often  clandestine  form  of  chal¬ 
lenging  the  state  monopoly  of  violence.  Robustly, 
these  two  dimensions  have  reoccurred  in  analyses 
of  diverse  forms  of  violent  political  conflict.  Apart 
from  genocide  and  massive  external  warfare,  inter¬ 
nal  wars  claim  the  largest  numbers  of  victims. 

There  are  many  overlaps  between  these  differ¬ 
ent  forms  of  conflict,  especially  from  a  dynamic 


perspective.  International  terrorism,  for  example, 
would  have  to  be  located  between  the  first  and 
second  box  in  the  first  row  of  Figure  f .  Together 
with  the  challenge  of  international  order,  a  fourth 
fundamental  dimension  arises  here.  Yet  there  are 
analytical  and  empirical  limitations  for  all  16  types 
to  emerge. 

Political  violence  can  be  analyzed  at  four  differ¬ 
ent  levels:  (1)  international  (wars,  militarized  inter¬ 
state  disputes,  and  external  military  interventions), 
(2)  national  (macrolevel),  (3)  subnational  groups 
(mesolevel),  and  (4)  participating  individuals. 
According  to  Ted  Robert  Gurr  (1970),  both  vari¬ 
ants  of  deprivation  (i.e.,  the  discrepancy  between 
expected  value  positions  and  the  capabilities  to 
achieve  them),  absolute  deprivation  and  relative 
deprivation,  fall  short  in  explaining  violent  politi¬ 
cal  conflicts.  There  are  many  more  such  situations 
of  deprived  masses  but  few  manifest  conflict 
events.  The  lack  of  a  unifying  ideology  and  charis¬ 
matic  leadership  that  can  overcome  the  collective- 
action  problem  (viz.,  incurring  relatively  high  costs 
and  little  rewards  at  the  beginning  of  a  protest 
action)  is  one  factor.  Without  these  factors  making 
for  mass  mobilization,  it  is  simply  too  costly  for  an 


State  monopoly  of 
violence 

Challenged  Not  challenged 


Challenged 


Societal  order 


Not  challenged 


State  boundaries 

State  boundaries 

Challenged  Not  challenged 

Challenged  Not  challenged 

International 

revolutionary 

movement 

Internal  war 
antistate 
terrorism 
revolutionary 
situation 

"Universalist" 

liberation 

ideologies 

Ideological 

extremism 

Irredenta 

Military  coup 

Movements  for 
a  supranational 
union 

Protest 

demonstrations 

Figure  I  Politics  and  Violence:  Central  Dimensions  and  Forms  of  Conflict 


2710  Violence 


individual  to  undergo  the  burden  of  violent  con¬ 
flict  in  view  of  its  limited  success. 

A  broader  and  theoretically  more  pertinent 
approach,  resource  mobilization,  stresses  the 
resources  available  to  potential  conflict  agents  (num¬ 
ber  of  participants,  money,  weapons,  organizational 
skills  and  networks  of  mobilization,  availability, 
leadership,  and  unifying  ideology).  Yet  even  such  a 
rational-choice  view  of  resources,  contrary  to  the 
approaches  of  need  deprivation,  cannot  explain 
the  point  in  time  when  conflict  potentials  material¬ 
ize.  This  usually  occurs  in  situations  of  a  widening 
political  opportunity  structure  marked  by  elite 
dissent,  fiscal  crises,  external  military  defeats,  elec¬ 
toral  realignments,  inconsistent  repression,  and 
unforeseen  events.  All  these  factors  add  up  to  mak¬ 
ing  predictions  at  the  macrolevel  difficult  if  not 
impossible.  As  Timur  Kuran  (1995)  points  out, 
only  the  occurrence  of  informational  and  emo¬ 
tional  “cascades”  among  the  protesters  made  the 
dramatic  changes  of  system  collapse  and  transfor¬ 
mation  understandable  in  the  demise  of  commu¬ 
nism  in  Eastern  Europe  in  the  early  1990s. 
Individual  and  group  behavior  is  codetermined  by 
macrofactors  at  the  national  and  international 
level,  and  only  through  group-mobilized  (violent 
or  nonviolent)  actions  and  particular  elite  responses 
can  the  changes  at  the  macrolevel  be  explained. 
There  can  be  no  plausible  explanation  by  relying 
on  macroprocesses  alone. 

At  least  five  major  explanatory  variables  have 
been  found  at  the  macrolevel,  distinguishing  more 
violent  nations/states  from  others.  (External  wars 
are  excluded  here.) 

Economic  Development  and  Growth 

Developed  states  with  at  least  some  social  security 
provisions  display  lower  amounts  of  violent  con¬ 
flicts  (measured  in  terms  of  dead  and  injured 
victims).  It  is  unclear  whether  this  is  a  consistent 
linear  trend  or  whether  a  curvilinear  relationship 
predominates,  with  mid-level  states  reaching  the 
maximum  values.  Here,  incentives  for  protest 
would  be  strongest,  combined  with  higher  demands 
and  still  limited  means  to  satisfy  them.  Highly 
developed  democracies  such  as  the  United  States, 
the  United  Kingdom,  and  many  Mediterranean 
states  frequently  exhibit  strong  patterns  of  protest 
and  demonstrations  but  usually  avoid  their  escala¬ 
tion  into  violent  internal  war.  There  is  also  a  group 


of  rather  quiet  democracies  consisting  of  Scandin¬ 
avia,  Switzerland,  and  the  democratic  outposts  of 
former  British  rule  overseas  (New  Zealand, 
Australia,  and  the  Caribbean  islands).  In  particu¬ 
lar,  semiauthoritarian  states  in  the  Third  World 
and  successor  states  to  the  Soviet  Union  in  the 
Caucasus  and  Central  Asia,  in  sub-Saharan  Africa, 
and  elsewhere  often  exhibit  high  values  on  both 
dimensions  simultaneously — protest  and  internal 
war.  Only  totalitarian  states  with  high  and  consis¬ 
tent  repression  succeed  in  avoiding  both  forms  of 
conflict.  North  Korea  is  the  remaining  prime 
example.  Once  state  repression  is  lowered,  how¬ 
ever,  events  parallel  to  the  disorders  in  Zimbabwe 
and  parts  of  the  former  Soviet  Union  often  occur. 

Surprisingly,  rapid  economic  growth  may  spur 
protests  due  to  the  uneven  distribution  of  benefits 
from  such  dramatic  changes  (e.g.,  Iran  in  the 
1970s).  Corruption  and  rigged  elections  often  are 
further  contributing  factors.  In  situations  of  mani¬ 
fest  economic  crisis,  one  often  finds  less  conflict 
due  to  obstacles  in  overcoming  the  free  riding 
problem — that  is,  incurring  particularly  high  costs 
when  being  among  the  first  protestors  and  with 
little  hope  of  achieving  goals.  Fears  of  becoming 
unemployed  dominate  and  lead  to  individualistic 
solutions  to  such  mischief.  Yet  where  the  system 
capacity  to  deal  with  these  problems  is  massively 
reduced,  conditions  furthering  mobilization  are 
more  likely  to  be  expected. 

Socioeconomic  Inequality 

Contrary  to  much  speculation,  inequality  in  land- 
ownership,  wealth,  or  income  is  not  often  associ¬ 
ated  with  more  violent  conflicts.  Ideological  and 
religious  beliefs  (e.g.,  caste  systems)  may  justify  the 
persistence  of  such  patterns.  Only  where  these 
inequalities  become  the  focus  of  mobilized  social 
groups  does  more  violent  conflict  have  to  be  reck¬ 
oned  with.  Thus,  for  example,  massive  oil  revenues 
are  passed  on  to  the  lower  classes  in  some  Latin 
American  countries,  buying  off  (temporary)  pro¬ 
test  and  conflict.  Albert  Hirschman,  with  his  “tun¬ 
nel  effect”  (i.e.,  waiting  in  a  traffic  queue  further 
back  in  a  tunnel  but  still  moving  ahead  and  thus 
having  hope  for  progress),  predicts  tolerance  for 
inequality  as  long  as  growth  trickles  down.  Again, 
deprivation  must  be  articulated  and  mobilized 
before  such  structural  factors  may  contribute 
directly  to  violent  conflicts. 
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State  Repression 

The  conflict-instigating  effect  of  political  repression 
(political  censorship,  bans  on  opposition  parties  and 
basic  political  rights,  large  internal  security  forces, 
manifest  acts  of  repression,  etc.)  is  countered  by  a 
deterrent  effect  once  the  level  of  repression  has 
passed  a  vital  threshold  where  protestors  become 
concerned  with  their  own  physical  safety.  This 
double  reasoning  makes  it  understandable  why 
state  repression  in  liberal  democracies  often  con¬ 
tributes  to  more  protest,  while  more  repression  in 
highly  authoritarian  states  can  reduce  both  forms  of 
violent  conflict.  Also,  the  turning  point  of  driving 
protest  underground  becomes  understandable — 
namely,  where  a  protestor  starts  fearing  for  his  or 
her  own  life.  Semirepressive  regimes  are  thus  most 
unstable  and  often  the  place  of  the  most  persistent 
forms  of  violent  conflicts,  due  to  injustices  from 
the  past,  fears  for  the  future,  and  inconsistent 
repressive  measures. 

Ethnocultural  Fractionalization 

Ethnocultural  fractionalization  is  the  most  likely 
and  most  persistent  causal  factor  in  predicting  the 
incidence  of  violent  conflict,  even  though  almost 
any  hypothesis  about  the  size  composition  of  the 
underlying  populations  and  violent  conflict  has 
been  tested  and  partially  rejected  due  to  other  fac¬ 
tors  and  contrary  logic.  Thus,  an  equal  division  of 
two  distinct  populations  could  lead  not  only  to 
continual  fighting  (e.g.,  Israel  and  Palestine)  but 
also  to  the  insight  that  neither  population  will  win 
permanently.  A  majority  of  four  to  one  by  one 
group  may  deter  any  challenges  from  a  minor 
group,  which,  in  effect,  often  may  be  economically 
more  successful  as,  for  example,  the  Basques  in 
Spain,  Tamils  in  Sri  Lanka,  or  the  Chinese  in 
Southeast  Asia  show.  In  contrast,  it  may  vigorously 
instigate  such  challenges  due  to  feelings  of  long¬ 
term  discrimination.  At  least  four  structural  factors 
turn  those  multicultural  relations  between  domi¬ 
nant  and  dominated  groups  into  persistent  ones: 

1.  Distinct  cultural  groups  produce  lower 
transaction  costs  for  respective  group  members  in 
everyday-life  behavior.  Also,  the  public  good  of 
security  in  the  end  rests  only  with  one’s  own  kin. 

2.  Such  groups  in  the  past  often  have  been  victims 
of  violent  conflict. 


3.  This  memory  effect  produces  a  lasting  legacy  for 
the  future. 

4.  Moreover,  state  institutions  may  possess  private 
information  withheld  from  subordinate  groups. 
Such  information  may  be  used  at  the  next 
opportunity,  to  the  detriment  of  subordinate 
groups.  Perhaps  only  supranational  supervision, 
as  supplied  in  part  by  the  European  Union  or 
the  United  Nations,  may  serve  as  a 
“permanent”  control  here. 

There  is  also  Karl  Deutsch’s  (1966)  prediction  at 
the  macrolevel  that,  with  an  underlying  heteroge¬ 
neous  population,  the  means  of  modernization  and 
social  mobilization  will  be  used  for  within-group 
strengthening,  thus  causing  larger  between-group 
differences.  Homogeneous  populations,  in  contrast, 
do  not  face  these  challenges.  At  least  one  major 
study  (Hibbs,  1973)  corroborates  this  general  effect 
of  ethnocultural  differentiation  combined  with 
social  mobilization.  Irredenta  populations  created 
by  colonial  border  drawing  or  other  historical 
effects  exacerbate  this  conflict  scenario. 

Finally,  there  are  many  linkages  between  inter¬ 
nal  and  external  conflicts,  such  as  attempts  at 
diverting  internal  tensions,  intervention  by  exter¬ 
nal  agents,  or  the  spillover  of  defeat  in  war. 

Territorial  characteristics,  shortages  of  natural 
resources,  and  disputed  boundaries  have  been  iden¬ 
tified  as  causes  of  internal  wars  and  guerilla  war¬ 
fare;  often  combined  with  terrorism,  they  have 
become  a  renewed  and  intensified  focus  of  a  group 
of  researchers,  largely  at  the  Institute  of  Peace 
Research  (Oslo)  and  the  University  of  Zurich.  Also, 
there  are  specific  accounts,  such  as  that  by  Stathis 
Kalyvas  (2006),  of  why  and  how  some  peaceful 
multicultural  settings  break  down,  while  others  per¬ 
sist.  Dynamic  analyses  at  the  communal  level  thus 
strengthen  insights  in  the  understanding  of  violence. 

Ekkart  Zimmermann 
Dresden  University  of  Technology 
Dresden,  Germany 
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Voting  rules  determine  how  voters  cast  their  ballots 
and  how  vote  shares  are  converted  to  seats  or  man¬ 
dates  in  the  elected  assembly  or  office.  Voting  rules 
are  thus  among  the  principal  building  blocks  of 
representative  democracy.  They  establish  the  most 
fundamental  theoretical  and  empirical  links  between 
public  preferences,  political  representation,  and, 
ideally,  government  formation  and  policy  making. 
Thus,  the  political  consequences  of  alternative 
voting  rules  have  always  concerned  political  scien¬ 
tists.  Both  majoritarian  and  proportional  voting 
rules  exert  profound  effects  on  alleged  political 
efficiency  (i.e.,  the  format  and  the  fragmentation 


and  polarization  of  political  party  systems  and  their 
propensity  for  sustaining  single-party  or  coalition 
governments)  and  for  different  modes  of  formal 
and  substantive  political  representation  (i.e.,  the 
distortion  of  vote  shares  and  seat  shares  and  the 
representation  of  the  median  voter  in  parliament,  in 
government,  or  actually  in  enacted  policies). 

The  study  of  voting  rules  and  their  respective 
consequences  belongs  to  the  very  core  of  compara¬ 
tive  political  research.  As  illustrated  by  William  H. 
Riker,  research  on  voting  rules  and  electoral 
behavior  has  been  organized  as  a  cumulative  sci¬ 
ence.  In  theoretical  and  conceptual  terms,  the 
arguments  have  advanced  from  the  early,  inductive 
reasoning  by  John  Stuart  Mill,  Henry  Droop,  or 
Thomas  Hare  toward  the  codification  of  Duverger’s 
law  and  hypothesis,  thereafter  toward  their  refine¬ 
ment  by  scholars  such  as  Giovanni  Sartori  or  Rein 
Taagepera,  and,  more  recently,  toward  the  sophis¬ 
ticated,  game-theoretical  treatment  by  Gary  W. 
Cox.  Further  research  extended  the  scope  of  the 
analyses  so  as  to  also  consider  the  additive  effect  of 
social  structure  or  direct  presidential  elections  and, 
most  important,  the  interactive  effects  of  institu¬ 
tional  structure  and  political  cleavages. 

Voting  Rules  and  Electoral  Fragmentation 
Inductive  Generalizations  at  the  National  Level 

Supporters  of  majoritarian  democracy  focus  on 
the  efficiency  dimension  and  thus  on  the  effects  of 
voting  rules  on  the  number  of  viable  electoral  and / 
or  parliamentary  parties.  The  fragmentation  of 
national  party  systems  is  their  most  important  and 
politically  most  consequential  feature,  which  is 
directly  related  to  the  impact  of  the  respective  vot¬ 
ing  rules.  Riker  took  what  is  now  canonized  as 
Duverger’s  sociological  laws  as  a  yardstick  to  assess 
the  truly  scientific  character  of  the  discipline — the 
cumulative  process  of  generating  knowledge  by  a 
series  of  constant  revisions  and  improvements. 
Riker’s  theoretical  point  of  departure  was  Maurice 
Duverger’s  analysis  and  lawlike  codification  of  the 
effects  of  alternative  voting  rules,  which  he  dis¬ 
criminated  into  Duverger’s  law  and  Duverger’s 
hypothesis:  (a)  the  law  states  that  plurality  rules 
tend  to  systematically  favor  two-party  systems  and 
(b)  the  hypothesis  suggests  that  majority  systems 
and  proportional  representation  (PR)  tend  to  sys¬ 
tematically  produce  multipartyisms. 
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These  processes  are  fueled  by  two  interrelated 
causal  mechanisms:  (1)  The  mechanical  effects  of 
voting  rules  operate  in  a  deterministic  way  and 
refer  to  the  formal  conversion  of  exogenous  given 
vote  shares  to  seat  shares.  (2)  The  psychological 
effects,  in  contrast,  work  in  a  probabilistic  fashion 
and  evolve  from  the  anticipation  of  these  mechan¬ 
ical  effects  by  (a)  voters  who  want  to  avoid  wast¬ 
ing  their  vote  ( strategic  voting )  and  (b)  party  elites 
who  want  to  avoid  fielding  and  investing  scarce 
resources  for  candidates  or  lists  that  are  in  fact  out 
of  the  running  ( strategic  entry). 

Majoritarian  Voting  Rules 

Majoritarian  voting  rules  require  successful 
candidates  or  lists  to  win  a — however  defined — 
majority  of  the  votes.  This  broad  category  com¬ 
prises  a  number  of  heterogeneous  subtypes  that  are 
associated  with  different  political  consequences. 

Plurality  or  “first-past-the-post”  systems  apply 
the  relative  majority  rule.  In  an  electoral  district 
that  consists  of  n  seats,  the  n  strongest  competi¬ 
tors  are  awarded  a  mandate.  In  empirical  terms, 
plurality  rules  are  most  frequently  implemented  in 
single-member  districts.  However,  this  combina¬ 
tion  does  not  necessarily  belong  to  the  core  con¬ 
cept  of  plurality  but  merely  flags  an  empirical 
regularity. 

In  contrast,  majority  rules  build  on  an  alterna¬ 
tive  concept  and  definition  of  majority.  These 
voting  systems  regularly  require  an  absolute 
majority  for  the  assignment  of  a  seat  in  the 
assembly  or  office — that  is,  candidates  or  elec¬ 
toral  lists  need  to  obtain  at  least  50%  plus  one 
vote  of  the  overall  votes  cast.  In  multiparty  sys¬ 
tems,  this  threshold  often  cannot  be  cleared  by 
any  competitor  so  that  vacant  seats  need  to  be 
assigned  by  either  a  runoff  of  the  (two)  strongest 
parties  (runoff  systems)  or  by  a  second  ballot  that 
builds  on  some  kind  of  plurality  rule  (majority 
plurality  systems).  As  before,  majority  rules  also 
tend  to  be  implemented  in  single-member  dis¬ 
tricts,  but  this  is  not  included  in  the  definition  of 
majority  voting  rules. 

Although  every  voter  still  casts  an  exclusive,  indi¬ 
vidual  vote,  the  single  nontransferable  vote  (SNTV) 
implies  the  application  of  plurality  voting  rules  in 
multimember  districts.  The  available  seats  are  filled 
by  the  n  strongest  candidates  or  electoral  lists. 


Empirical  assessments  of  voting  rules  did 
not  sustain  Duverger’s  law  as  a  deterministic 
national-level  generalization.  Systematic  analy¬ 
ses,  for  instance  by  Douglas  W.  Rae,  Arend 
Lijphart,  or  Gary  W.  Cox,  have  illustrated  that 
majoritarian  voting  rules  grosso  modo  (roughly) 
tend  to  allow  for  lower  levels  of  electoral  and 
parliamentary  fragmentation  than  proportional 
voting  rules.  However,  plurality  in  single¬ 
member  districts  does  not  unambiguously  sus¬ 
tain  two-party  systems  anywhere  outside  the 
United  States.  Instead,  the  empirical  effects  of 
classical  “first-past-the-post”  systems  happen  to 
depend  on  a  variety  of  specific  technical  and 
contextual  features,  for  instance,  district  bound¬ 
aries  that  may  be  subject  to  the  practice  of  ger¬ 
rymandering,  the  structure  of  political  competi¬ 
tion,  and  alternative  degrees  of  party  system 
nationalization. 

Proportional  Voting  Rules 

Proportional  voting  rules  require  successful  can¬ 
didates  or  lists  to  win  the  support  of  a  certain 
share  of  the  votes  that  is  either  defined  by  the  elec¬ 
toral  quota  or  determined  by  the  application  of 
some  highest-averages  method.  While  majoritarian 
rules  can  in  principle  be  combined  with  any  pos¬ 
sible  district  magnitude,  PR  logically  only  works  in 
multimember  electoral  districts. 

Quota  systems  define  an  electoral  quota,  which 
is  calculated  by  the  numbers  of  votes  and  available 
seats  in  an  electoral  district,  as  the  price  of  a  seat. 
Each  party  is  awarded  as  many  seats  as  it  has  full 
quotas,  and  eventually,  the  remaining  seats  are 
often  distributed  by  some  largest-remainder  crite¬ 
rion.  Common  quota  variants  are,  for  example, 
the  Hare  quota,  which  is  based  on  the  number  of 
votes  divided  by  the  number  of  seats;  the  Droop 
(or  Hagenbach-Bischoff)  quota,  which  divides  the 
number  of  votes  by  the  number  of  seats  plus  one; 
or  the  Imperiali  quota,  which  divides  the  number 
of  votes  by  the  number  of  seats  plus  two. 

Highest-average  rules  divide  the  votes  cast  for  a 
respective  party  list  by  a  series  of  divisors.  The 
d’Hondt  system  applies  the  divisor  sequence  {1,  2, 
3,  .  .  .  ,  n},  the  Sainte-Lague  method  implies  divi¬ 
sion  by  the  series  of  odd  numbers  {1,  3,  5,  ... ,  n), 
and  the  Imperiali  quota  builds  on  the  divisor 
sequence  {1,  1.5,  2,  2.5,  .  .  . ,  nj.  In  the  second 
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step,  the  n  seats  at  stake  are  awarded  sequentially 
by  the  size  of  the  respective  averages. 

The  single  transferable  vote  (STV)  system  realizes 
candidate-based  PR  and  aims  at  limiting  the  poten¬ 
tial  for  wasted  votes  by  a  system  of  preferential  vot¬ 
ing.  STV  initially  assigns  seats  according  to  the 
stated  first  preferences.  After  candidates  have  been 
either  elected  or  eliminated,  surplus  votes  or  wasted 
votes  are  transferred  to  the  remaining  candidates 
based  on  lower-ranking  preferences  of  the  voter. 

Empirical  analyses  revealed  ambiguous  evidence 
for  Duverger’s  hypothesis.  Grosso  modo,  propor¬ 
tional  voting  rules  do  allow  for  higher  numbers  of 
viable  parties  or  electoral  lists  than  majoritarian 
ones.  However,  depending  on  societal  polarization 
and  political  cleavage  structures,  proportional 
voting  rules  are  also  compatible  with  highly  con¬ 
centrated  two-party  systems. 

Within  the  scope  of  proportional  voting  rules, 
there  have  often  been  intense  debates  about  the 
mechanical  consequences  of  alternative  propor¬ 
tional  formulae.  However,  their  empirical  conse¬ 
quences  regularly  do  not  differ  much  and  have 
often  been  profoundly  overestimated.  Generally, 
the  highest-average  formulae,  especially  d’Hondt, 
are  expected  to  benefit  larger  parties,  while  quota 
systems  are  supposed  to  be  fairer  for  small  parties. 
Simulation  studies  have  illustrated  that  the  effects 
of  either  group  of  voting  rules  overlap  and,  unless 
they  are  reinforced  by  a  multitude  of  parallel  elec¬ 
toral  districts,  do  not  have  significant  electoral 
consequences.  When  the  mechanical  consequences 
of  alternative  proportional  allocation  methods 
tend  to  be  limited,  alternative  proportional  voting 
rules  do  not  bring  about  the  need  for  strategic 
behavior  by  voters  or  by  party  elites.  Note  that  in 
empirical  electoral  systems,  pure  proportionality  is 
normally  restricted  by  additional  regulations — for 
example,  low  district  magnitudes  as  mathematical 
and  predefined  legal  thresholds  as  formal  entry 
barriers.  Typically,  the  mechanical  and  psycho¬ 
logical  effects  of  average  district  magnitudes  and 
legal  thresholds  clearly  exceed  the  consequences  of 
alternative  PR  rules. 

Further  Refinements 

Giovanni  Sartori  has  contributed  additional 
refinements  and  reformulations  to  Duverger’s 
sociological  laws.  Instead  of  the  conceptually 


dubious  and  problematic  established  dichotomy 
of  proportional  and  majoritarian  technical  rules 
and  conceptual  ideal  types  (and  some  combina¬ 
tions  of  different  systems  in  between),  Sartori 
arranged  voting  rules  along  a  unipolar  continuum 
from  feeble  to  strong.  Voting  rules  that  are  a  pri¬ 
ori  supposed  to  exert  a  significant  modifying 
impact  on  actual  electoral  behavior  (i.e.,  majori¬ 
tarian  voting  rules)  are  considered  “strong,”  while 
those  that  are  not  expected  to  induce  these  sub¬ 
stantial  psychological  effects  (i.e.,  proportional 
voting  rules)  are  labeled  “feeble.” 

The  second  dimension  of  Sartori’s  typology 
brings  in  contextual  features.  Parallel  to  the  vot¬ 
ing  rule  dimension,  Sartori  contrasts  strong  (i.e., 
structured  and  nationalized)  and  feeble  (i.e., 
unstructured  and  heterogeneous)  party  systems. 
In  the  resulting  2X2  matrix,  Sartori  expects  a 
reduction  in  the  number  of  viable  political  par¬ 
ties  and  a  drive  toward  a  two-party  format  where 
strong  (i.e.,  plurality)  voting  rules  meet  strong 
(i.e.,  highly  structured)  party  systems.  However, 
when  strong  voting  rules  are  combined  with  fee¬ 
ble,  unstructured  party  systems,  the  reductive 
effect  is  actually  less  developed  or  limited  to  local 
bipartism  within  isolated  districts.  Any  combina¬ 
tion  of  feeble  voting  rules  with  structured  party 
systems  still  implies  restrained  political  competi¬ 
tion,  but  then,  political  competition  is  structured 
by  political  incentives  that  originate  not  from  the 
institutional  rules  of  the  game  but  solely  through 
party  channelment.  Eventually,  feeble  party  sys¬ 
tems  might  be  combined  with  feeble,  unstruc¬ 
tured  party  system  contexts,  indicating  a  “no 
effect”  situation. 

Subsequent  empirical  analyses  revealed  that 
Sartori’s  observance  of  contextual  features,  above 
all  the  party  system  context  that  fosters  or  prevents 
the  effectiveness  of  voting  rules,  adds  significant 
insight  when  the  scope  of  an  actual  analysis  is 
extended  beyond  the  long-standing  democracies  of 
the  Organisation  for  Economic  Co-operation  and 
Development  (OECD)  world.  Transition  countries 
typically  lack  structured  modes  of  party  competi¬ 
tion  and  thus  regularly  fail  to  meet  the  necessary 
conditions  for  the  effectiveness  of  constraining 
effects  that  originate  from  strong  electoral  systems. 
Problems  with  “strong”  rules  in  these  contexts  can 
be  clearly  illustrated  by  the  recent  electoral  history 
of  postcommunist  polities. 
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Searching  for  a  general  expression  that  ties 
together  Duverger’s  law  and  hypothesis,  Rein 
Taagepera  and  Matthew  Shugart  proposed  an 
inductive  inspection  of  actual  electoral  returns  that 
replaces  the  analysis  of  qualitative  voting  rules  by 
an  exclusive  focus  on  quantitative  district  magni¬ 
tudes.  They  reason  that  their  measure  of  electoral 
fragmentation — the  Effective  Number  of  Electoral 
Parties — is  on  average  linked  to  district  magnitude 
by  a  logarithmic  association: 

EfNlMJ  -  1.25  log  M  +  2.5, 

with  N  denoting  the  Effective  Number  of  Electoral 
Parties  and  M  denoting  the  district  magnitude  at 
the  national  level.  The  equation  allows  for  the  pre¬ 
diction  of  fragmentation  levels  that  will,  as  the 
authors  argue,  usually  fall  within  an  interval  of  ±  1 
unit  from  this  quantitative  generalization  of 
Duverger’s  law. 

At  the  national  level,  any  link  of  the  national 
number  of  parties  and  average  features  of  the 
respective  electoral  system  at  the  national  level  can¬ 
not  be  more  than  a  rough,  inductive  generalization 
that  lacks  the  necessary  causal  argument  or  mecha¬ 
nism.  While  the  substantive  quantity  of  interest,  the 
number  of  electoral  or  parliamentary  parties,  is 
observed  at  the  national  level,  the  conversion  of 
votes  into  seats  and  thus  the  causal  mechanisms 
that  lie  at  the  bottom  of  the  wasted  vote  argument 
and  Duverger’s  psychological  effect  all  operate  at 
the  district  level.  From  a  conceptual  point  of  view, 
there  are  additional  problems  with  Duverger’s  law 
and  subsequent  generalizations.  A  number  of 
recent  empirical  studies  sweep  under  the  carpet  an 
important  distinction  incorporated  in  the  original 
law  and  hypothesis.  Duverger  linked  plurality 
systems  to  bipartism,  while  both  majority  and  pro¬ 
portional  voting  rules  were  supposed  to  allow  for 
different  degrees  of  multipartyism.  As  a  conse¬ 
quence,  any  exclusive  focus  on  the  key  variable 
district  magnitude,  therefore,  no  longer  applies  to 
the  original  argument,  obviously  disregards  its  core 
causal  mechanisms,  and  potentially  introduces  an 
omitted  variable  bias  to  the  empirical  analysis. 

Moreover,  national-level  analyses  often  follow 
Duverger’s  flawed  idea  of  a  multiplying  effect  of 
proportional  voting  rules:  The  more  permissive 
an  electoral  system  (i.e.,  the  higher  the  district 
magnitude),  the  more  parties  will  be  viable  in  the 


electorate.  This  notion  is  established  as  a  quasi- 
deterministic  argument  that  does  not  take  into 
account  intervening  factors  such  as  social  heteroge¬ 
neity  or  collective  action  problems  partisans  face 
when  setting  up  their  platforms.  However,  even  the 
most  effective  PR  system  just  reflects  social  hetero¬ 
geneity  and  political  polarization,  which  are  the 
products  of  social  structure  and  political  cleavages; 
political  institutions  like  voting  rules  do  not  artifi¬ 
cially  produce  these  phenomena.  As  a  consequence, 
electoral  systems  need  to  be  conceptualized  as  con¬ 
stituting  an  upper  bound  to  the  number  of  viable 
candidates  and/or  political  parties  and  not  as  arti¬ 
ficially  producing  an  institutionally  defined  quan¬ 
tity  of  party  system  fragmentation  or  multiplying 
the  number  of  viable  competitors. 

Causal  Mechanisms  at  the  District  Level 

In  a  series  of  contributions,  Cox  has  explicitly 
focused  on  the  district  level  of  electoral  systems, 
strengthened  the  causal  stringency  of  Duverger’s 
original  argument  by  an  elaborate  game-theoretical 
reformulation,  and  attached  conditions  to  its  func¬ 
tionality.  His  argument  concentrates  on  the  role  of 
expectations  in  electoral  competition  and  subse¬ 
quent  strategic  coordination  attempts  by  voters 
and  by  party  elites — that  is,  it  concentrates  on 
Duverger’s  psychological  effect.  Building  on  both 
classic  statements  of  formal  theory  and  the  induc¬ 
tive  tradition  of  data  analysis  in  comparative 
political  science,  he  was  able  to  strengthen  the 
causal  argument,  to  broaden  its  applicability,  and 
attach  conditions  to  its  functionality.  In  essence, 
Cox’s  contribution  deals  with  problems  of  strategic 
cooperation  in  electoral  politics.  Formal  modeling 
suggests,  quite  in  accordance  with  the  original 
propositions  by  Maurice  Duverger,  that  in  each 
single-member  plurality  district  (M  =  1)  there  will 
be  only  two  vote-getting  candidates  in  a  game-the¬ 
oretical  equilibrium.  Next,  Cox  extends  his  argu¬ 
ments  to  multimember  districts  (M  >  1)  with 
majoritarian  SNTVs  and,  eventually,  with  propor¬ 
tional  voting  rules  (d’Hondt).  As  a  direct  general¬ 
ization  of  Duverger’s  law,  he  suggests  that  there 
may  be  no  more  than  M  +  1  viable  candidates  in 
each  district  of  the  magnitude  M.  This  proposition, 
labeled  the  M  +  1  rule,  is  the  central  building  block 
of  Cox’s  contribution  to  the  analysis  of  electoral 
systems:  EfNIMJ  <  M  +  1,  with  N  denoting  the 
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raw  number  of  electoral  parties  and  M  denoting 
the  magnitude  of  a  local  electoral  district. 

The  formal  argument  spelled  out  in  detail  by 
Cox  overcomes  some  of  the  central  flaws  of  induc¬ 
tive  macro-macro  generalizations.  In  the  first  place, 
it  explicitly  focuses  on  the  district  level,  the  wasted 
vote  argument,  and  the  causal  incentives  that  drive 
strategic  voting  and  strategic  entry.  Second,  Cox 
also  develops  an  explicit  generalization  to  double¬ 
ballot  majority  systems  when  he  argues  that  the 
limit  of  M  +  1  viable  candidates  does  not  actually 
relate  to  the  actual  district  magnitude  (M  =  1)  but 
to  the  number  of  first-round  candidates  that 
can  qualify  for  the  usually  decisive  second  round 
(either  M  =  2  for  the  classical  runoff  system  or  even 
M  >  2  for  two-round  majority  plurality  systems  like 
the  French  one).  Third,  the  M  +  1  rule  does  not 
suggest  a  specific  number  of  viable  contenders  that 
would  be  deterministically  produced  by  the  elec¬ 
toral  structure,  but  in  contrast,  it  conceptualizes 
institutional  constraints  as  erecting  an  upper  limit  to 
or  a  carrying  capacity  for  the  number  of  viable  can¬ 
didates  or  lists  in  a  game-theoretical  equilibrium. 

Next,  Cox  spells  out  a  set  of  conditions  that 
need  to  be  fulfilled  so  as  to  induce  a  restriction  to 
a  maximum  of  M  +  1  viable  candidates  or  lists  in 
a  game-theoretical  equilibrium.  In  particular,  these 
contextual  factors  involve  the  presence  of  strictly 
hierarchically  ordered,  transitive  voter  preferences 
(as  opposed  to  voters  who  do  not  distinguish 
between  two  or  more  electoral  options);  the 
absence  of  a  Condorcet  winner,  short-term  instru- 
mentally  rational  calculations  on  the  voters’  side; 
and  full,  common  information  about  the  state  of 
the  electoral  contest  with  the  clear  identifiability  of 
competitive  and  trailing  candidates  or  lists. 

However,  causal  stringency  and  clarity  do  not 
come  without  substantive  costs.  The  argument 
provided  by  Cox  is  presented  in  two  successive 
stages  of  strategic  coordination:  first  within  the 
electoral  districts  and  second  across  these  districts. 
Therefore,  the  M  +  1  rule  does  not  directly  address 
the  politically  meaningful  number  of  parties  at  the 
national  level  but  the  quite  politically,  somewhat 
inconsequential  fragmentation  of  local  electorates 
within  the  respective  electoral  districts.  The  subse¬ 
quent  step  evolves  into  another,  second-stage  coor¬ 
dination  game  when  the  locally  viable  competitors 
decide  to  link  up  or  to  abstain  from  cross-district 
linkages,  to  coordinate  (or  not  to  coordinate) 


across  the  various  districts  of  an  electoral  system 
so  as  to  form  a  nationalized  political  party,  or  to 
remain  a  regionalized  political  force.  Cox  specifies 
national  policy  goals,  the  presidency,  the  premier¬ 
ship,  and  the  existence  of  upper-tier  seats  in  the 
electoral  system  as  potential  factors  pressing 
toward  cross-district  linkages. 

Voting  Rules,  Formal  and 

Substantive  Representation 

The  theoretical  visions  of  democracy  in  which  alter¬ 
native  majoritarian  and  proportional  voting  rules 
are  rooted  also  incorporate  different  standards  for 
the  assessment  and  evaluation  of  political  represen¬ 
tation.  Supporters  of  consensus  or  consociational 
democracy  emphasize  the  alleged  legitimacy  dimen¬ 
sion  and  focus  on  the  inclusiveness  of  electoral  rules 
and  descriptive  representation.  Their  assessment  of 
the  formal  quality  of  representation  builds  on  the 
relation  of  vote  shares  and  seat  shares — that  is,  on 
disproportionality  indicators  that  have  been,  for 
instance,  suggested  by  John  Loosemore,  Victor  J. 
Hanby,  and  Michael  Gallagher.  Theoretical  models 
and  empirical  analyses  have  illustrated  that  propor¬ 
tional,  “feeble”  voting  rules  tend  to  foster  descrip¬ 
tive  representation,  while  majoritarian,  “stronger” 
voting  rules  tend  to  trade  descriptive  representation 
for  more  effective  party  system  concentration, 
probably  easier  government  formation,  and  sup¬ 
posedly  higher  government  stability. 

Supporters  of  the  majoritarian  vision  of  democ¬ 
racy  would  generally  not  accept  formal,  descrip¬ 
tive  representation  as  a  conceptually  adequate  and 
substantively  meaningful  criterion  for  evaluating 
alternative  voting  rules  but  rather  adopt  a  princi¬ 
pal-agent  perspective  on  political  representation. 
Thus,  representation  means  having  one’s  voice 
represented  either  in  the  decision-making  process 
or  in  its  outcome — actual  enacted  policies  and 
their  correspondence  to  the  preferences  of  the  elec¬ 
torate.  In  the  classical  spatial  model  of  politics  as 
devised  by  Harold  Hotelling  and  Anthony  Downs, 
two  parties  or  candidates  that  compete  under  plu¬ 
rality  rule  and  on  a  single  policy  dimension  are 
expected  to  converge  toward  the  median  voter’s 
position  so  as  to  maximize  their  vote  shares  and 
their  probabilities  of  victory  (the  famous  “median 
voter  theorem”).  When  more  than  one  mandate  is 
at  stake  and  some  proportional  voting  rule  is 
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adopted — that  is,  when  a  majority  of  the  votes  is 
no  longer  necessary  but  some  share  of  the  vote 
suffices — these  centripetal  incentives  cease  to 
apply,  and  parties  or  candidates  are  expected  to 
spread  evenly  along  the  policy  space. 

The  theoretical  argument  spelled  out  by  Cox 
and  others  posits  that  majoritarian  voting  rules 
have  a  higher  potential  to  ensure  the  correspon¬ 
dence  of  public  preferences  and  government  poli¬ 
cies.  Strong  centripetal  incentives  generated  by 
“strong”  voting  rules  foster  enacted  policy  repre¬ 
sentation;  the  absence  of  these  centripetal  incen¬ 
tives  under  proportional  voting  rules  in  contrast 
potentially  allows  for  the  fabrication  of  (coalition) 
governments  and  enacted  policies  that  are  more 
distant  from  the  median  voter’s  position.  Vice 
versa,  as  its  proponents  are  ready  to  concede,  the 
political  consequences  of  majoritarian  voting  rules 
depend  on  effective  strategic  coordination  by  voters 
and  by  party  elites,  and  thus  effective  representa¬ 
tion  of  the  median  voter  position  depends  on  a 
variety  of  contextual  parameters.  Subsequent 
empirical  analyses  by  G.  Bingham  Powell  and 
others  have,  however,  cast  serious  doubt  on  the 
capability  of  “strong”  voting  rules  to  produce  more 
adequate  policy  representation  than  weak  ones. 

Voting,  Causal  Complexity, 
and  Intervening  Factors 

Ultimately,  voting  rules — as  any  social  or  political 
institutions — do  not  originate  from  nowhere  and, 
quite  contrary  to  the  notion  of  a  sociological  law, 
institutional  pressures  and  the  virtues  of  “constitu¬ 
tional  engineering”  tend  to  be  only  one  impact 
factor  in  the  formulation  of  voting  rules,  alongside 
and  in  interaction  with  others. 

Empirical  findings  that  indicate  an  association 
of  voting  rules  and  electoral  fragmentation,  voting 
rules  and  inner-party  democracy,  or  voting  rules 
and  the  quality  of  political  representation  do  not 
“prove”  anything  about  causality  and  the  direction 
of  potential  causal  effects.  While  the  institutional 
perspective  suggests  that  voting  rules  profoundly 
affect  these  features  of  party  systems  and  political 
representation,  scholars  such  as  Harry  Eckstein  or 
John  G.  Grumm  have  first  concluded  that  voting 
rules  are  either  inconsequential  or  themselves 
a  product  (rather  than  the  reason)  of  political 
competition  and  party  systems,  but  they  are 


designed  and  negotiated  by  party  politics  to  cater 
to  partisan  interest. 

Theoretical  and  empirical  insights  reveal  that 
voting  rules  have  political  consequences,  for 
instance,  on  the  number  of  viable  political  com¬ 
petitors  or  parties  and  on  the  quality  of  political 
representation.  In  turn,  the  very  idea  that  voting 
rules  might  have  (redistributive)  effects  implies 
that  politicians  will  be  interested  in  manipulating 
these  effects  in  their  favor.  Politicians  would  thus 
not  bother  about  electoral  rules  when  these  were 
of  no  or  little  significance  for  party  competition 
and  their  respective  interests. 

Voting  rules  (or  other  institutional  features)  are 
only  one  factor  among  many  others  that  affect 
party  system  fragmentation  and  the  quality  of 
political  representation.  Empirical  analyses,  for 
example,  by  Peter  Ordeshook,  Octavio  Amorim 
Neto,  and  Gary  W.  Cox,  have  demonstrated  addi¬ 
tional  institutional  effects  such  as  direct  presiden¬ 
tial  elections  and  the  electoral  cycle.  Moreover, 
political  cleavage  structures  and  the  number  of 
societal  groups  are  consequential  for  party  system 
fragmentation.  In-depth  analyses  have  shown  that 
these  macrosociological  variables  often  do  not 
directly  affect  the  fragmentation  and  political  rep¬ 
resentation  but  interact  with  electoral  structure. 
Taking  on  the  above  notion  of  the  carrying  capac¬ 
ity  of  electoral  systems,  this  means  that  electoral 
fragmentation  does  not  originate  when  there  are 
either  many  polarized  sociological  groups  or  a  per¬ 
missive  electoral  system  (additive  causation)  but 
occur  only  when  social  and  institutional  factors 
operate  conjointly  (interactive  effects). 

These  findings  correspond  exactly  to  the  theo¬ 
retical  insights  incorporated  in  the  M  +  1  rule: 
Voting  rules  do  not  artificially  “produce”  a  spe¬ 
cific  number  of  viable  candidates  or  political  par¬ 
ties,  but  they  only,  in  combination  with  additional 
features  of  the  electoral  system,  establish  and 
define  the  certain  carrying  capacity  and  thus  an 
upper  bound  to  the  number  of  (locally)  viable  can¬ 
didates  or  electoral  lists.  However,  whether  the 
empirical  number  of  parties  approaches  these 
respective  upper  bounds  is  determined  by  socio¬ 
logical  and  organizational  aspects. 

Guido  Tiemann 
Institute  for  Advanced  Studies,  Vienna 

Vienna,  Austria 
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Legislative  voting  rules  are  the  rules  that  govern 
the  lawmaking  process,  as  opposed  to  the  voting 
rules  that  govern  the  election  of  representatives  to 
lawmaking  bodies.  There  are  two  kinds  of  rules 
that  govern  the  lawmaking  process.  First,  there 
are  rules  that  define  and  regulate  the  lawmaking 
bodies  and  the  relationships  between  them  (“con¬ 
stitutional  rules”).  These  rules  determine  the 
number  of  legislative  chambers,  their  relative 
powers,  and  their  relationship  to  the  executive, 
among  other  things.  Second,  there  are  rules  that 
govern  the  internal  working  of  the  legislative  bod¬ 
ies  (“parliamentary  rules”).  These  rules  cover 
matters  such  as  control  of  the  legislative  agenda 
and  the  committee  system  and  are  usually  made 


within  the  legislature.  In  this  entry,  we  will  discuss 
the  theory  of  voting,  constitutional  rules  and  their 
effects,  and  finally  parliamentary  rules. 

The  study  of  the  rules  of  lawmaking  can  be 
traced  back  at  least  as  far  as  Aristotle.  Although 
there  are  exceptions,  traditional  approaches  to  leg¬ 
islative  rules  have  tended  to  be  descriptive  and 
formal,  drawing  heavily  on  constitutional  law.  As 
political  science  took  a  more  behavioral  turn  from 
the  1950s,  many  scholars  put  less  emphasis  on 
formal  rules,  as  these  rules  did  not  seem  to  have 
much  effect  on  how  political  agents  actually 
behaved.  However,  approaches  based  on  social 
choice  and  rational  choice  theory  (the  “new  insti¬ 
tutionalism”)  did  give  a  central  place  to  formal 
rules  and  provided  a  link  between  these  rules  and 
behavior.  Social  choice  theorists  characterized  the 
mechanical  properties  of  voting  rules  and  found 
these  often  produced  counterintuitive  results. 
Rational  choice  theorists  considered  how  maximiz¬ 
ing  agents  would  act  within  a  framework  of  formal 
rules.  The  key  insight  was  that  small  changes  in 
seemingly  unimportant  rules  could  have  very  large 
effects  on  predicted  behavior  in  equilibrium. 

There  has  also  been  a  growth  in  comparative 
empirical  work  on  political  institutions.  Recent 
work  has  considered  the  effect  of  differences  in 
constitutional  rules  (and  to  a  lesser  extent,  parlia¬ 
mentary  rules)  on  policy  outcomes  such  as  redistri¬ 
bution,  state  size,  and  growth.  These  various 
approaches  to  legislative  voting  rules  are  likely  to 
be  complementary.  For  example,  rational  choice 
approaches  emphasize  the  importance  of  small, 
seemingly  insignificant  rules;  identifying  these 
rules  in  specific  legislatures  requires  that  someone 
do  “thick”  descriptive  work. 

The  Theory  of  Voting 

Before  considering  actual  voting  rules,  we  will 
briefly  consider  the  theoretical  analysis  of  voting 
rules.  This  gives  a  powerful  illustration  of  the  effect 
that  voting  rules  can  have.  The  voting  rules  in  vir¬ 
tually  every  legislature  in  the  world  use  some  vari¬ 
ant  of  a  binary  amendment  procedure.  Here,  a 
proposal  is  made,  and  amendments  to  this  pro¬ 
posal  are  considered  one  at  a  time.  The  final  bill, 
with  amendments,  is  then  voted  on.  This  procedure 
is  binary  because  only  two  alternatives  are  voted 
on  at  any  one  time.  The  most  common  procedure 
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for  deciding  between  two  alternatives  is  majority 
rule,  although  supermajority  rules  (a  proposal  fails 
unless  it  receives,  say,  60%  of  the  votes)  are  used 
in  some  instances.  Majority  rule  has  a  special  place 
in  democratic  theory  in  that  it  is  the  only  binary 
rule  that  treats  all  voters  and  proposals  equally. 
The  pervasiveness  of  binary  amendment  proce¬ 
dures  may  be  due  to  their  simplicity;  it  may  also  be 
due  to  the  fact  that  procedures  that  consider  more 
than  two  alternatives  at  a  time  can  be  manipulated 
by  adding  extra  (possibly  irrelevant)  proposals. 

While  binary  amendment  procedures  eliminate 
one  kind  of  manipulation,  they  are  vulnerable  to 
another.  Simplifying  greatly,  Arrow’s  theorem 
shows  that  majority  rule  or  any  other  nondictato- 
rial  binary  procedure  is  vulnerable  to  cycling  (or, 
strictly  speaking,  intransitivity).  That  is,  it  is  pos¬ 
sible  for  the  voters  to  choose  A  over  B  in  a  head- 
to-head  contest,  to  choose  B  over  C,  and  to  choose 
C  over  A.  Cycling  creates  opportunities  for  agenda 
manipulation.  Suppose  we  first  have  a  vote  between 
A  and  B  and  then  put  the  winner  up  against  C.  A 
beats  B  in  the  first  round  but  is  then  beaten  by  C, 
so  C  is  chosen.  However,  suppose  we  put  B  against 
C  first,  B  beats  C  but  loses  to  A  in  the  final,  so  A 
is  chosen.  If  we  put  A  against  C  in  the  first  round, 
B  is  the  eventual  choice.  The  outcome  thus  depends 
completely  on  the  order  in  which  we  consider  the 
alternatives.  It  should  not  surprise  us  that  those 
who  get  to  decide  the  order  in  which  alternatives 
are  considered,  such  as  the  Rules  Committee  in  the 
U.S.  House  of  Representatives,  have  considerable 
power. 

How  much  influence  the  agenda  has  over  out¬ 
comes  will  depend  on  how  often  voter  preferences 
produce  cycles.  Later  work  in  social  choice  sug¬ 
gested  that  if  more  than  one  issue  is  considered  at 
a  time,  cycles  are  almost  inevitable  and  an  agenda 
can  be  found  to  produce  absolutely  any  outcome. 
This  led  some  to  the  conclusion  that  political  insti¬ 
tutions  and  elite  agenda  manipulation,  not  voter 
preferences,  controlled  outcomes.  William  Riker’s 
Liberalism  Against  Populism  was  perhaps  the 
strongest  statement  of  this  position.  More  recent 
social  choice  results  lead  to  more  moderate  conclu¬ 
sions.  If  legislators  are  just  as  smart  as  the  agenda 
setter,  they  can  resist  agenda  manipulation  by  vot¬ 
ing  strategically.  Nicholas  Miller  showed  that  stra¬ 
tegic  voting  with  a  fixed  agenda,  an  open  agenda, 
or  a  competitive  agenda  would  produce  outcomes 


in  the  “uncovered  set,”  which  is  typically  a  rela¬ 
tively  small,  centrally  located  set  of  alternatives. 
Thus,  agenda  setting  would  influence  the  outcome, 
but  this  would  be  quite  tightly  constrained  by  the 
preferences  of  the  voters.  Under  this  scenario,  vot¬ 
ing  rules  matter,  but  we  are  not  driven  to  the 
extreme  conclusion  that  they  are  the  only  things 
that  matter. 

Constitutional  Rules 

We  have  made  a  distinction  between  constitutional 
rules  (which  define  the  lawmaking  bodies  and  their 
relations  with  each  other)  and  parliamentary  rules 
(which  govern  the  internal  workings  of  legisla¬ 
tures).  Constitutional  rules  are  often  codified  in  a 
written  constitution,  which  requires  either  the 
agreement  of  multiple  institutions  or  a  referendum 
to  amend.  Parliamentary  rules  can  usually  be 
changed  by  the  legislature  itself.  There  are  some 
exceptions  where  this  distinction  is  less  clear-cut. 
For  example,  there  are  countries  where  the  consti¬ 
tution  can  be  changed  by  statute  law  (e.g.,  the 
United  Kingdom,  Israel). 

One  important  constitutional  feature  that  is  not 
dealt  with  here  is  the  choice  of  electoral  system. 
Clearly,  the  effect  of  the  legislative  voting  rules 
may  vary  depending  on  how  the  legislature  is  con¬ 
stituted.  In  particular,  whether  the  electoral  system 
produces  a  single-party  government  (as  is  often  the 
case  with  plurality  elections)  or  multiparty  coali¬ 
tion  governments  (usually  the  case  with  propor¬ 
tional  representation)  is  important.  In  some  cases, 
electoral  laws  are  defined  or  restricted  by  the  con¬ 
stitution,  but  in  others,  they  are  simply  defined  by 
ordinary  legislation.  They  may  even  be  made  (as  in 
the  United  States)  by  subnational  legislatures. 

Constitutional  rules  differ  over  a  variety  of 
dimensions.  However,  they  can  be  considered  in 
terms  of  whether  they  concentrate  legislative 
authority  in  one  body  or  disperse  it  among  several 
bodies.  Rules  that  require  the  concurrence  of  sev¬ 
eral  bodies  to  pass  legislation  have  an  effect  similar 
to  having  a  supermajoritarian  rule  in  a  single  legis¬ 
lative  body.  Having  more  veto  points  means  that 
more  than  a  simple  majority  is  required.  For  exam¬ 
ple,  it  is  harder  to  pass  legislation  through  two 
legislative  chambers  than  through  one,  even  if  both 
chambers  use  simple  majority  rule  voting.  This  is 
particularly  the  case  when  the  two  chambers  are 
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elected  in  different  ways.  Constitutional  features 
that  may  have  an  effect  similar  to  a  supermajori- 
tarian  voting  rule  include  the  following. 

Bicameralism.  Having  two  legislative  chambers 
instead  of  one  creates  an  extra  hurdle  for  proposed 
legislation  to  clear.  How  significant  this  hurdle  is 
will  depend  on  whether  the  two  chambers  have 
equal  powers  or  whether  the  second  chamber  has 
limited  powers  to  amend/propose  legislation  or  can 
be  bypassed  by  the  first  chamber.  It  will  also 
depend  on  whether  the  two  chambers  are  elected 
the  same  way  and,  thus,  have  very  similar  policy 
preferences.  The  term  strong  bicameralism  is  some¬ 
times  applied  to  systems  with  equally  powerful 
chambers  that  are  elected  in  different  ways,  while 
weak  bicameralism  is  applied  to  bicameral  legisla¬ 
tures  without  these  characteristics. 

Presidentialism.  If  there  is  a  presidential  veto,  then 
the  president  is  essentially  an  extra  chamber  of  the 
legislature.  Thus,  a  system  like  that  of  the  United 
States  is  essentially  tricameral,  although  the  presi¬ 
dential  veto  can  be  overridden  by  a  supermajoritar- 
ian  vote  of  the  other  chambers.  Even  without  a 
legislative  veto,  presidents  can  effectively  change  or 
block  legislation  by  the  way  they  implement  it, 
while  in  a  parliamentary  system,  the  executive  is 
directly  answerable  to  the  legislature. 

Federalism.  Under  a  federal  system,  implementing 
a  policy  may  require  the  cooperation  of  both  cen¬ 
tral  and  subnational  governments.  Federal  systems 
also  often  have  strong  bicameralism  in  the  national 
legislature,  with  the  second  chamber  representing 
subnational  units. 

Judicial  Review.  Constitutional  courts  may  pro¬ 
vide  another  veto  point.  The  legislature  may  over¬ 
come  this  by  changing  the  constitution,  but  this 
often  requires  a  supermajoritarian  procedure. 

Referenda.  These  allow  the  entire  population  to 
be  treated  as  a  chamber  of  the  legislature. 
Constitutions  vary  as  to  whether  referenda  can  be 
demanded  by  the  voters  or  whether  they  can  only 
be  called  by  the  government;  and  they  also  vary  as 
to  whether  they  can  propose  new  legislation  or 
only  invalidate  legislation  passed  by  the  legisla¬ 
ture.  Referenda  can  be  used  to  break  a  legislative 


gridlock,  but  they  can  also  be  used  obstructively 
to  dismantle  a  multi-issue  legislative  program  one 
issue  at  a  time. 

The  normative  justification  given  for  constitu¬ 
tional  rules  that  disperse  legislative  power  (and  thus 
work  in  a  supermajoritarian  manner)  is  that  this 
prevents  the  abuse  of  power  by  the  government  and 
protects  minorities  from  majority  tyranny.  Critics 
of  this  position  argue  that  by  making  government 
action  difficult,  such  rules  effectively  entrench  exist¬ 
ing  injustices  and  prevent  the  remedy  of  new  ones, 
thereby  harming  vulnerable  minorities.  Regardless 
of  one’s  normative  evaluation,  we  should  expect 
constitutional  rules  that  are  effectively  supermajori¬ 
tarian  to  lead  to  slower  change  in  government  pol¬ 
icy.  Empirically,  countries  with  such  rules  seem  to 
have  less  redistributive  government  policy  and  have 
built  welfare  states  more  slowly.  This  should  not  be 
surprising,  as  the  constitutional  rules  provide  more 
opportunities  for  groups  that  lose  from  redistribu¬ 
tion  to  block  the  process. 

Parliamentary  Rules 

There  are  a  variety  of  parliamentary  rules  (rules 
that  regulate  the  internal  workings  of  legislatures) 
that  will  significantly  affect  outcomes.  As  was 
argued  in  the  first  section,  the  theory  of  voting  sug¬ 
gests  that  the  order  in  which  proposals  are  consid¬ 
ered  can  change  the  eventual  choice.  The  way 
proposals  are  bundled  together  can  have  a  similar 
effect.  Suppose,  for  example,  that  one  proposal  has 
the  enthusiastic  support  of  40%  of  the  legislature, 
while  another  proposal  has  the  support  of  a  differ¬ 
ent  40%.  If  the  two  proposals  are  bundled  into  a 
single  bill,  it  may  well  pass  with  a  majority,  even 
though  each  proposal  would  have  failed  if  consid¬ 
ered  separately.  This  process  is  known  as  logroll¬ 
ing.  In  some  legislatures,  it  is  carried  out  through 
decentralized  deals  between  legislators,  while  in 
others,  there  is  centralized  negotiation  between 
party  leaders  (the  coalition  agreements  that  follow 
elections  in  some  European  countries  can  be 
regarded  as  large,  centrally  negotiated  logrolls).  In 
either  case,  we  would  expect  whoever  has  control 
over  the  order  of  business  and  what  bills  get  to  be 
proposed  to  have  considerable  influence.  The  fol¬ 
lowing  are  among  the  institutional  features  that 
affect  this: 
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Control  Over  Proposing  Legislation.  The  legislative 
leadership  (usually  the  government  in  parliamen¬ 
tary  systems)  will  be  in  a  more  powerful  position  if 
it  has  a  monopoly  (or  near  monopoly)  of  the  right 
to  introduce  new  legislation.  It  is  notable  that  some 
legislatures  delegate  the  developing  of  some  legisla¬ 
tion  to  legislative  committees  or  independent  com¬ 
missions  (such  as  the  Royal  Commission  in  some 
Scandinavian  countries)  in  which  both  government 
and  nongovernment  parties  are  represented. 

Control  Over  Amendments.  Once  legislation  is 
introduced,  the  legislative  leadership  may  exercise 
different  levels  of  control  over  how  it  may  be 
amended.  If  there  is  a  closed  rule,  the  legislature  is 
not  allowed  to  amend  the  legislation  and  must 
either  accept  or  reject  what  has  been  proposed. 
This  increases  the  influence  of  whoever  gets  to  pro¬ 
pose  legislation.  However,  under  an  open  rule, 
where  anyone  can  propose  new  amendments,  the 
power  of  the  agenda  setter  is  far  weaker.  An  inter¬ 
mediate  position  is  a  rule  that  allows  only  certain 
kinds  of  amendments.  An  important  feature  of 
some  European  legislatures  is  that  the  government 
has  the  right  to  propose  the  last  amendment.  This 
restores  a  considerable  degree  of  agenda-setting 
power  to  the  government  because  counteramend¬ 
ments  to  this  last  amendment  are  not  possible. 

Control  Over  Legislative  Timetable.  Control  over 
what  is  debated,  when,  and  for  how  long  is  also 
important.  The  legislative  leadership  obviously 
cannot  get  its  program  passed  if  it  cannot  get  it 
debated.  An  extreme  example  of  a  lack  of  leader¬ 
ship  control  in  some  legislatures  is  the  right  to 
unlimited  debate.  This  allows  a  minority  to  filibus¬ 
ter  or  essentially  veto  legislation  by  refusing  to  stop 
debating  it  (although  there  may  be  a  procedure 
allowing  a  supermajority  vote  to  end  debate,  as  in 
the  U.S.  Senate). 

Committee  System.  Detailed  consideration  of  legis¬ 
lation  is  usually  delegated  to  legislative  committees 
before  being  presented  to  the  full  legislature.  The 
legislators  on  these  committees  may  develop  policy 
expertise  and  be  deferred  to  by  other  legislators.  In 
some  legislatures,  committees  may  act  as  veto 
points  in  that  all  legislation  on  a  particular  topic 
has  to  pass  through  them.  The  number  and  impor¬ 
tance  of  committees  varies  considerably  between 


legislatures.  In  some  legislatures,  the  governing 
party  or  coalition  controls  all  committee  chairman¬ 
ships,  while  in  others,  they  are  shared  between  all 
parties. 

Degree  of  Party  Discipline.  In  some  legislatures, 
representatives  are  more  or  less  free  agents,  who 
may  represent  the  interests  of  their  constituents  as 
much  as  their  parties.  In  other  legislatures,  parties 
are  tightly  disciplined,  and  important  decisions  are 
made  by  negotiation  among  the  party  leaders.  If 
party  discipline  is  tight,  parliamentary  rules  may 
serve  to  facilitate  agreements  between  parties  (in 
particular  the  coalition  agreement)  rather  than 
agreements  between  legislators. 

There  are,  however,  two  reasons  to  be  cautious 
about  trying  to  explain  policy  outcomes  in  terms 
of  parliamentary  rules.  The  first  is  the  problem  of 
heritability.  Parliamentary  rules  are  usually  chosen 
by  the  legislature  itself.  Further,  the  legislative 
leadership  that  implements  the  rules  is  also  chosen 
by  the  legislature.  If  the  legislature  does  not  like 
the  outcomes  the  rules  produce,  they  could  pre¬ 
sumably  change  either  the  legislative  leadership  or 
the  rules  themselves.  If  this  is  the  case,  rules  do  not 
so  much  produce  outcomes  but  are  rather  the 
means  by  which  the  ruling  coalition  in  the  legisla¬ 
ture  gets  what  it  wants.  This  would  suggest  that 
the  effect  of  constitutional  rules  may  be  more  pro¬ 
found  than  those  of  parliamentary  rules. 

The  second  problem  is  that  of  functional  equiva¬ 
lence.  It  may  be  possible  for  the  governing  coalition 
to  get  its  way  by  a  variety  of  different  means.  For 
example,  in  some  countries,  the  government  keeps 
very  tight  control  over  the  legislative  agenda.  In 
other  countries,  nongovernment  parties  have  far 
more  ability  to  amend  legislation,  but  the  govern¬ 
ment  has  the  right  of  last  amendment.  However,  in 
both  sets  of  countries,  the  government  seems  to  be 
able  to  get  most  of  the  legislation  that  was  laid  out 
in  its  coalition  agreement  passed.  Similarly,  in  some 
countries,  there  are  very  detailed  committee  hear¬ 
ings  on  legislation,  while  in  other  countries,  these 
appear  to  be  little  more  than  a  formality.  However, 
if  the  legislation  in  those  countries  was  developed 
by  a  Royal  Commission  on  which  the  legislative 
committee  members  served,  this  compliance  is  not 
surprising.  This  in  no  way  diminishes  the  impor¬ 
tance  of  parliamentary  rules  in  understanding  the 
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working  of  legislatures,  but  it  does  make  it  harder 
to  find  statistical  relationships  between  parliamen¬ 
tary  rules  and  outcomes. 

Anthony  McGann 
University  of  Essex 
Colchester,  United  Kingdom 

See  also  Agenda  Setting;  Constitutionalism;  Electoral 
Systems;  Parliaments;  Social  Choice  Theory;  Veto 
Player;  Voting  Rules,  Electoral,  Effects  of 

Further  Readings 

Arrow,  K.  (1963).  Social  choice  and  individual  values. 

New  York:  Wiley.  (Original  work  published  1951) 
Black,  D.  (1971).  The  theory  of  committees  and  elections. 
Cambridge,  UK:  Cambridge  University  Press.  (Original 
work  published  1958) 


Cox,  G.,  &  McCubbins,  M.  (2007).  Legislative  leviathan: 
Party  government  in  the  house  (2nd  ed.).  Cambridge, 
UK:  Cambridge  University  Press. 

Doring,  H.,  &  Hallerberg,  M.  (Eds.).  (2004).  Patterns  of 
parliamentary  behavior:  Passage  of  legislation  across 
Western  Europe.  Burlington,  VT:  Ashgate. 

Heller,  W.  (1999).  Making  policy  stick:  Why  the 
government  gets  what  it  wants  in  multiparty 
parliaments.  American  journal  of  Political  Science, 
45(4),  780-798. 

Lijphart,  A.  (1999).  Patterns  of  democracy:  Government 
forms  and  performance  in  thirty-six  countries.  New 
Haven,  CT:  Yale  University  Press. 

Riker,  W.  (1982).  Liberalism  against  populism: 

A  confrontation  between  the  theory  of  democracy  and 
the  theory  of  social  choice.  San  Francisco:  Freeman. 
Schofield,  N.,  &  Sened,  I.  (2006).  Multiparty  democracy: 
Elections  and  legislative  politics.  Cambridge,  UK: 
Cambridge  University  Press. 


w 


War  and  Peace 


The  21st  century  has  begun  much  as  the  20th  cen¬ 
tury  ended:  in  war  and  armed  conflict.  Conflicts  of 
many  types,  old  and  new,  are  under  way  in  war 
zones  such  as  Afghanistan,  Iraq,  Lebanon,  the 
Democratic  Republic  of  the  Congo,  Sri  Lanka, 
Colombia,  Lebanon,  Chechnya,  and  Sudan.  The 
threat  of  war  remains,  but  the  nature  of  warfare  is 
changing  in  tandem  with  often  unforeseen  geo¬ 
political,  technological,  economic,  and  ethnic 
changes.  According  to  the  studies  performed  since 
2005  by  the  Human  Security  Centre  at  the  University 
of  British  Columbia,  violence  is  in  decline:  In  the 
past  dozen  years,  there  has  been  a  significant 
decrease  in  violence  as  measured  by  the  number  of 
wars,  genocides,  and  human  rights  violations. 

Since  the  end  of  the  Cold  War,  the  number  of 
armed  conflicts  has  fallen  by  about  40%  from 
about  50  in  1991  to  about  30  in  2004.  High- 
intensity  conflicts  (those  that  cause  more  than 
1,000  battle-related  deaths  per  year)  are  down 
80%,  also  down  are  civil  wars  (80%),  genocides 
(80%),  major  and  minor  terrorist  attacks  (50%), 
the  number  of  refugees  (45  % ),  coups  and  attempted 
coups  (60%),  and  international  crises  (70%). 
More  than  100  conflicts  have  ended,  including 
70%  of  secessionist  conflicts,  and  the  average 
number  of  battle  deaths  per  armed  conflict 
decreased  by  98%,  from  38,000  in  1950  to  600  in 
2002.  (Iraq  since  2003  is  an  exception  to  the 
trend.)  There  were  20,000  battle  deaths  in  all  wars 
combined  in  2003,  compared  with  700,000  in 


1950.  In  the  1990s,  the  ratio  of  battle  deaths  to 
population  was  a  third  of  what  it  had  been  in  the 
1970s.  However,  these  positive  developments  were 
mitigated  by  continued  strife  in  some  regions,  par¬ 
ticularly  in  Africa,  where  war  is  claiming  more 
victims  than  on  all  other  continents  combined.  But 
even  in  Africa,  there  are  fewer  armed  conflicts 
today  (about  10  compared  with  15  per  year  5 
years  ago).  The  number  of  African  countries  torn 
by  armed  conflict  has  therefore  dropped  by  a  third. 
During  the  same  period,  direct  battle  deaths  have 
decreased  in  Africa  by  24%. 

At  the  dawn  of  the  third  millennium,  inhabit¬ 
ants  of  the  planet  had  an  average  risk  of  becoming 
a  casualty  of  war  of  approximately  0.4%,  com¬ 
pared  with  an  average  of  about  1%  between  1945 
and  1990.  People  are  much  more  likely  to  die  of 
disease  and  pandemics  (91%)  or  in  car  accidents 
(2%)  than  in  a  war.  However,  there  is  little  cause 
for  celebration,  for  armed  conflicts  that  could, 
theoretically,  have  been  prevented  or  controlled 
continue  to  claim  hundreds  of  thousands  of  direct 
and  indirect  victims  year  after  year.  Of  the  22  mil¬ 
lion  people  who  have  perished  in  armed  conflicts 
since  World  War  II,  5.5  million  died  between  1990 
and  1995.  The  causes  of  conflict  are  far  from  being 
resolved,  and  the  dangers  attendant  on  conflict 
have  by  no  means  been  eliminated,  although  they 
are  perhaps  being  anticipated  and  managed  more 
successfully.  In  short,  it  is  too  early  to  say  that  war 
is  being  eradicated  or  even  curbed.  Human  insecu¬ 
rity  remains  a  challenge  and  a  serious  danger. 
Many  armed  conflicts  continue  to  rage  with  deadly 
consequences  and  wars  that  seemed  to  be  coming 
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to  an  end  have  flared  up  again.  In  the  wake  of  the 
genocides  in  Rwanda  and  Sudan,  the  civil  war  in 
the  Democratic  Republic  of  the  Congo,  and  the 
new  outbreaks  of  violence  in  Ivory  Coast,  Somalia, 
and  Angola,  there  is  no  guarantee  that  we  will  not 
witness  other  unexpected  and  tragic  events  in 
Africa.  Interstate  conflict  also  remains  a  possibil¬ 
ity:  For  example,  hostilities  could  break  out 
between  North  and  South  Korea  should  the  North 
Korean  regime  decide,  despite  the  armistice,  to 
attack  the  South;  between  China  and  Taiwan  over 
the  latter’s  desire  to  secede;  between  India  and 
Pakistan  over  their  long-standing  dispute  over 
Kashmir  and  recently  because  of  terrorism;  between 
Ethiopia  and  Eritrea  over  unresolved  border  dis¬ 
putes  that  caused  70,000  deaths  between  1997  and 
2000;  in  central  and  Western  Africa,  where  inter¬ 
nal  chaos  and  quarrels  between  neighboring  coun¬ 
tries  make  for  a  dangerous  mix;  in  the  Persian 
Gulf,  where  the  Kurdish  question  has  yet  to  be 
settled  in  Iraq  and  between  the  countries  in  the 
region;  and  in  Lebanon,  as  the  Israeli  intervention 
in  the  summer  of  2006  demonstrated. 

These  disputes  are  only  a  few  examples  of  the 
continuing  impact  of  interstate  rivalries  and  domes¬ 
tic  insecurity.  There  can  be  no  doubt  that  the  end 
of  the  Cold  War  has  led  to  greater  peace  between 
states  than  at  any  previous  time  in  history  (hence 
Francis  Fukuyama’s  well-known  “end  of  history” 
thesis  of  1992).  However,  the  real  challenge  now  is 
preventing  or  resolving  intrastate  wars.  In  other 
words,  present-day  conflicts  are  of  a  new  type:  In 
the  past,  states  fought  each  other,  but  now  clans, 
ethnic  groups,  and  factions  are  challenging  the 
state  itself.  Interstate  warfare  is  therefore  being 
replaced  by  ethnopolitical  warfare.  State  actors  are 
losing  their  monopoly  on  violence  as  infrastate 
players  get  in  on  the  act.  Supranational  bodies  are 
intervening  in  matters  that  were  previously  the 
exclusive  preserve  of  states,  in  a  bid  to  manage  and 
control  crises.  It  can  be  argued  that  the  words  war 
and  conflict  have  taken  on  new  meaning. 

The  Transformation  of  Armed  Conflict 

War  between  great  powers  in  the  industrialized 
world  has  lost  its  raison  d’etre  in  view  of  techno¬ 
logical  developments  (nuclear  weapons),  demo¬ 
graphic  changes  (low  birth  rate),  economic  factors 
(trade),  and  political  factors  (liberal  democracy). 


However,  wars  continue  to  ravage  underdeveloped 
and  developing  regions  of  the  world.  The  creation 
of  a  relatively  unipolar  system  dominated  by  the 
United  States  has  not  prevented  armed  conflicts 
from  breaking  out.  The  world  is  no  longer  divided 
into  strategic  “protectorates”  as  it  was  during  the 
Cold  War,  and  the  emerging  multipolar  system  is 
aggravating  geopolitical  rivalries  in  some  regions. 

War  is  becoming  “deinstitutionalized”  and 
reverting  to  what  it  was  before  1648 — hence  the 
suggestion  that  we  are  returning  to  “premodern” 
warfare.  Classic  warfare  is  disappearing  as  war 
becomes  “privatized,”  in  the  sense  that  it  is  being 
waged  increasingly  by  private  armies  rather  than 
conscripts  or  professional  soldiers.  Since  the  begin¬ 
ning  of  the  decade,  there  have  been  growing  num¬ 
bers  of  nonstate  armed  conflicts — that  is,  fighting 
between  different  factions  within  a  state  as  opposed 
to  fighting  between  the  state  and  one  or  more  fac¬ 
tions.  For  example,  in  2003,  there  were  29  armed 
conflicts  involving  states  and  30  nonstate  conflicts. 
However,  both  types  of  conflicts  are  in  decline. 

Forty  percent  of  the  casualties  of  war  are  direct 
victims  (killed  in  battle),  and  60%  are  indirect  vic¬ 
tims  (those  who  die  as  a  result  of  the  consequences 
of  armed  conflict — displacement,  disease,  pandem¬ 
ics,  malnutrition,  and  famine).  Civilians,  who 
make  up  a  large  percentage  of  the  casualties,  are 
10  times  more  likely  to  be  indirect  victims  than 
direct  victims.  It  is  estimated  that  approximately 
one  third  of  the  civilians  who  die  in  armed  con¬ 
flicts  are  killed  by  government  forces  and  two 
thirds  by  intrastate  groups. 

The  indirect  victims  of  armed  conflict  are 
largely  women  and  children.  Many  are  sexually 
assaulted.  Of  the  male  casualties,  90%  are  direct 
victims  of  combat,  but  there  are  also  large  num¬ 
bers  of  men  among  the  indirect  victims.  Men  are 
more  likely  than  women  to  perish  in  mass  killings 
or  to  die  of  pandemics  or  malnutrition. 

More  wars  are  ending  than  there  are  new  wars 
breaking  out.  For  example,  between  1991  and 
2004,  28  armed  struggles  for  secession  began  or 
resumed,  while  43  were  curbed  or  ended.  In  2004, 
there  were  25  such  armed  conflicts  under  way,  the 
smallest  number  since  1976. 

Every  year,  “ethnic  wars”  account  for  the 
majority  of  “major”  armed  conflicts  (12  out  of  19 
in  2005).  However,  the  number  of  such  wars  has 
plunged  since  the  end  of  the  1990s,  as  has  the 
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number  of  civil  wars.  Typically,  ethnic  conflicts 
are  caused  by  the  collapse  of  states,  not  the  reverse. 
They  grow  out  of  severe  crises  of  governance  that 
cause  states  to  disintegrate.  (Half  a  dozen  states 
have  crumbled  over  the  past  20  years,  including 
Afghanistan,  the  USSR,  Yugoslavia,  and  Somalia.) 

Armed  conflicts  are  also  spurred  by  the  profit 
motive:  fierce  competition  for  control  of  resources, 
particularly  minerals,  diamonds,  oil,  and  water. 
Elites,  guerrillas,  mafia,  and  mercenaries  engage  in 
warfare  to  exact  tribute  or  to  lay  their  hands  on  a 
nation’s  wealth.  The  geography  of  a  conflict — its 
fault  lines — is  shaped  more  by  the  distribution  of 
resources  than  by  ideological  or  political  differ¬ 
ences  (as  is  clearly  seen  in  Central  and  Western 
Africa). 

Democratization  is  a  growing  source  of  vio¬ 
lence.  Approximately  one  third  of  states  are  in 
transition.  There  are  more  democracies  today  than 
ever  before  in  the  history  of  the  world,  but  the 
recent  wave  of  democratization  has  left  scars  in  its 
wake  because  of  the  failure  of  many  transitional 
states  to  protect  human  rights  or  establish  a  state 
of  law  and  because  the  democratization  process 
has  given  rise  to  instability  and  bouts  of  political 
violence  (examples  include  the  former  Yugoslavia, 
Colombia,  Afghanistan,  the  Democratic  Republic 
of  the  Congo,  Burundi,  Rwanda,  Liberia,  Sudan, 
the  Palestinian  territories,  and  Iraq). 

The  Phenomenon  of  War 

Interstate  war  is  a  recurring  phenomenon  that 
appears  to  be  part  and  parcel  of  the  Westphalian 
system;  there  were  278  such  wars  between  1648 
and  1940,  or  one  every  2  years.  In  2003,  J.  David 
Singer  calculated  that  there  had  been  412  inter¬ 
state  and  intrastate  wars  between  1816  (when 
there  were  23  states)  and  1997  (when  there  were 
181);  135  of  these  wars  occurred  after  1950.  In  all, 
there  were  179  wars  in  the  19th  century  and  233 
in  the  20th  century.  Of  the  2,340  weeks  between 
1945  and  1990,  only  3  were  entirely  free  of  war. 
According  to  Singer,  the  record  is  not  encouraging: 
The  two  decades  with  the  largest  number  of  wars 
since  1816  have  been  the  1970s  (36  wars)  and  the 
1990s  (31  wars).  In  recent  years,  between  10% 
and  15%  of  states  have  been  at  war.  The  good 
news  is  that  as  a  proportion  of  the  number  of 
states  in  the  world  and  the  world’s  population 


(both  of  which  are  growing),  the  number  of  wars 
is  decreasing.  The  ratio  of  wars  to  states  has  fallen 
steadily  over  the  decades,  from  0.74  in  the  1890s 
(the  highest  in  history)  to  0.26  during  the  1940s 
and  0.17  during  the  1990s  (one  of  the  lowest  in 
history).  However,  the  number  of  battle  deaths  has 
been  stable  since  1950,  at  an  average  of  approxi¬ 
mately  2.6  million  per  decade,  or  one  per  1,000, 
for  a  total  of  approximately  13  million  dead. 
(These  figures  do  not  include  indirect  victims  of 
war.)  Wars  caused  an  estimated  38  million  battle 
deaths  (including  11  million  soldiers)  between  the 
years  1  CE  and  1899  and  more  than  46  million 
deaths  (including  22  million  soldiers)  between 
1900  and  2000.  From  1816  to  1939,  interstate 
wars  caused  28.4  million  battle  deaths  and  civil 
wars  6.8  million.  Between  1940  and  2000,  the 
proportions  were  reversed:  3.3  million  battle  deaths 
in  interstate  wars  and  11.5  million  in  civil  wars. 
World  War  I  claimed  a  total  of  20  million  direct 
casualties,  counting  both  soldiers  and  civilians,  and 
World  War  II  killed  40  million.  Five  wars  accounted 
for  more  than  half  of  all  combat  deaths  during  the 
1946  to  2002  period:  the  civil  war  in  China,  the 
Korean  War,  the  Vietnam  War,  the  Iran-Iraq  war, 
and  the  Afghan  wars.  These  numbers  do  not 
include  government-backed  massacres  of  their  own 
people  (genocide,  politicide,  and  femicide),  which 
are  estimated  to  have  caused  more  than  175  mil¬ 
lion  deaths  between  1900  and  2000,  of  which 
approximately  125  million  were  caused  by  geno¬ 
cides  and  famines  (70  million  in  China  alone), 
24  million  by  light  arms,  17  million  by  artillery, 
and  2  million  by  aerial  or  naval  bombardments. 

Definition  of  War 

We  will  return  later  to  the  statistics  and  trends,  but 
first,  we  must  attempt  to  define  what  war  is.  There 
is  no  consensus  in  the  abundant  literature.  One  of 
the  classics,  A  Study  of  War,  written  by  Quincy 
Wright  (1942)  of  the  University  of  Chicago, 
defines  war  as  “violent  contact  of  distinct  but 
similar  entities”  (chap.  1).  States  may  enter  into 
violent  contact,  but  so  too  may  lions  or  tigers:  This 
definition  is  inadequate  since  it  ignores  the  impor¬ 
tance  of  the  political  dimension  of  war.  In  Chapter 
I  of  his  classic  On  War,  the  traditional  strategist 
Carl  von  Clausewitz  described  war  as  “an  act  of 
violence  intended  to  compel  our  opponent  to  fulfill 
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our  will”  and  then  famously  commented,  “War  is 
a  continuation  of  politics  by  other  means.” 

War  may  be  considered,  then,  to  be  an  armed 
confrontation  between  enemies  with  irreconcilable 
or  incompatible  political  goals,  which  always  has 
the  potential  to  escalate  into  an  unlimited  engage¬ 
ment  aimed  at  achieving  total  victory  and  the 
destruction  of  the  opponent.  It  is  the  ultimate 
instrument  of  policy  where  political  differences  can¬ 
not  be  resolved  except  by  the  use  of  force.  War  is 
also  an  organized  process,  a  dimension  stressed  by 
the  expert  on  war,  Gaston  Bouthoul:  “War  is  armed 
and  bloody  struggle  between  organized  groups.  ” 
From  a  realist  point  of  view,  therefore,  war 
implies  acts  of  violence  conducted  and  organized  by 
political  and  military  actors  with  antagonistic 
motives,  which  may  be  governments  or  infrana¬ 
tional  or  supranational  entities.  Similarly,  in  the 
negative,  we  can  say,  with  the  realists,  that  peace  is 
the  absence  of  organized  violence  between  groups  or 
states.  We  shall  see  that,  in  the  case  of  intrastate  and 
ethnopolitical  wars,  there  is  disagreement  between 
those  who  believe  that  war  is  inherent  in  the  security 
dilemma  ( realist  theorists)  and  those  who  argue,  on 
the  contrary,  that  war  is  constructed  through  the 
manipulation  of  identity  by  political  entrepreneurs 
and  decision  makers  ( critical  theorists ). 

War  can  take  different  forms  in  different  eras: 

International  war  is  war  between  states,  such  as  the 
Gulf  War  of  1991,  North  Atlantic  Treaty 
Organization’s  (NATO)  war  against  Serbia  in 
1999,  the  intervention  in  Afghanistan  in  2001,  and 
the  invasion  of  Iraq  in  2003.  It  may  be  a  regional 
war,  such  as  the  wars  in  the  Middle  East  or 
between  Pakistan  and  India,  or  a  world  war  if  the 
conflict  spreads,  as  in  World  Wars  I  and  II. 

Intrastate  war  has  been  the  most  common  type  of 
warfare  since  the  end  of  the  Cold  War.  The 
examples  of  the  former  USSR,  Yugoslavia,  Rwanda, 
the  Democratic  Republic  of  the  Congo,  Somalia, 
Sierra  Leone,  Sudan,  Colombia,  Afghanistan,  and 
Sri  Lanka  illustrate  the  wide  scope  of  violence 
between  governments,  factions,  ethnic  groups,  and 
other  groups  trying  to  win  or  maintain  control  over 
territory  and  political  power.  Intrastate  war  may 
also  be  associated  with  civil  strife  or  traditional 
ideological  warfare  (revolutionary  wars,  national 
liberation  wars,  guerrilla  wars). 


War  may  be  conducted  by  conventional  means 
(invasion,  bombing,  coercion)  or  unconventional 
means  (terrorism,  insurrection,  using  chemical, 
biological,  or  nuclear  arms).  The  means  used 
determine  the  nature  of  the  war. 

Wars  may  be  short,  long,  or  indeterminate.  They 
range  in  duration  from  the  Six-Day  War  in  1967  to 
decades-long  conflicts,  such  as  the  Vietnam  War, 
which  continued  from  1945  to  1975.  Civil  wars, 
such  as  the  ones  in  Northern  Ireland,  Sri  Lanka, 
and  Kurdistan,  which  long  appeared  interminable 
(or  still  do),  may  come  to  seem  like  a  permanent 
condition. 

War  may  be  total  or  limited.  In  the  first  case,  it 
knows  no  bounds:  The  annihilation  of  the  enemy, 
the  extermination  of  its  population,  and  the  end  of 
its  regime  are  the  objectives.  In  the  second  case,  the 
war  is  conducted  within  a  restricted  framework 
and  is  aimed  primarily  at  preventing  an  escalation 
to  unbridled  violence.  The  restrictions  limit  the 
geographic  scope  of  the  conflict,  the  number  of 
opponents,  the  use  of  specific  means  or  specific 
weapons,  and  the  intensity  of  the  fighting. 

Examples  of  “limited  war”  include  the  U.S. 
military  engagement  in  Korea  and  Vietnam,  and 
the  Soviet  engagement  in  Afghanistan. 

Unconventional  wars  include  psychological 
warfare,  proxy  or  indirect  wars,  “nonwar 
operations,”  and  cold  war  (the  potential  for 
violence  is  ever  present  in  all  these  types  of 
warfare);  new  wars  include  information  wars,  “star 
wars,”  the  war  against  international  terrorism,  the 
war  on  drugs,  and  wars  for  control  over  resources 
(cases  in  which  the  potential  for  the  use  of  force 
and  for  armed  conflict  is  high). 

Definition  of  Armed  Conflict 

Some  research  centers  that  compile  statistics,  such 
as  the  International  Institute  for  Strategic  Studies 
(IISS)  in  London  and  the  Stockholm  International 
Peace  Research  Institute  (SIPRI),  use  the  term 
armed  conflict  rather  than  war.  An  armed  conflict 
may  flare  up  sporadically,  without  necessarily  con¬ 
stituting  a  war  as  defined  above,  but  is  more  than 
a  mere  conflict  in  that  it  is  not  limited  to  a  political 
dispute.  There  are  generally  some  20  “major  armed 
conflicts”  in  any  given  year,  as  defined  by  SIPRI 
(“the  use  of  armed  force  between  the  military 
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forces  of  two  or  more  governments,  or  of  one  gov¬ 
ernment  and  at  least  one  organized  armed  group, 
resulting  in  the  battle-related  deaths  of  at  least 
1,000  people  in  any  single  calendar  year”).  The 
international  environment  is  shaped  to  a  large 
degree  by  these  armed  conflicts.  Most  are  intra¬ 
state,  and  the  majority  occurs  in  Asia  or  Africa.  A 
significant  proportion,  but  less  than  half,  began 
before  1989.  Civilians  make  up  the  bulk  of  the 
victims.  As  a  result,  the  number  of  refugees  and 
displaced  persons  remains  high.  Recently,  the 
Human  Security  Centre  has  begun  compiling 
deaths  resulting  from  armed  violence  between 
nonstate  actors  as  well  as  from  government-led 
violence  against  its  own  people. 

More  broadly,  the  term  conflict  can  be  used  to 
refer  to  a  situation  of  opposed  interests  that  does 
not  necessarily  lead  to  armed  confrontation  (e.g., 
the  40-year  East-West  conflict).  If  it  does  develop 
into  armed  conflict,  it  can  be  the  same  thing  as  a 
war.  Conflict  generally  implies  a  situation  of 
strong  opposition  between  a  state,  ethnic  group, 
clan,  or  other  group  and  another  of  these  entities 
due  to  incompatible  goals,  which  may  be  of  a  ter¬ 
ritorial,  political,  diplomatic,  economic,  military, 
ethnoreligious,  internal,  or  external  nature.  The 
range  of  distinctions  that  can  be  applied  to  con¬ 
flicts,  armed  or  otherwise,  indicates  their  diversity: 

A  conflict  may  be  over  control  of  the  government 
and  the  state  and  therefore  involve  deep  nationalist, 
ideological,  or  ethnic  divisions  (Afghanistan  and 
the  Democratic  Republic  of  the  Congo  are  good 
examples).  Conflicts  of  this  type  can  escalate  into 
violent  confrontation  between  groups  or  factions 
seeking  at  least  partial  control  of  the  state  (e.g.,  the 
rival  guerrilla  groups  in  Colombia). 

Territory  is  a  key  issue  in  many  conflicts.  The 
motives  may  be  ethnic  (as  in  the  former  Yugoslavia, 
Sri  Lanka,  Nagorno-Karabach,  Georgia),  economic 
(Iraq  and  Kuwait,  the  Spratly  Islands),  or  strategic 
(Israel  and  Syria  on  the  Golan  Heights,  India  and 
Pakistan  in  Kashmir,  Morocco,  Western  Sahara). 

Ideology  can  be  an  important  factor  when  it  is 
enmeshed  in  a  long-festering  dispute  (as  in  the 
China-Taiwan  and  Israel-Palestine  conflicts). 

Many  conflicts  fall  into  more  than  one  of  these 
categories.  In  some,  all  the  above  factors  come  into 


play,  intensifying  the  conflict.  Collapsed  states  often 
present  political,  territorial,  and  ideological  chal¬ 
lenges  that  can  be  difficult  to  manage  or  contain. 
Numerous  examples  can  be  seen  in  Africa  (Rwanda, 
Democratic  Republic  of  the  Congo,  Somalia),  the 
Caucasus  (Georgia,  Armenia,  Azerbaijan),  and 
Central  America  (Guatemala,  El  Salvador).  Some 
major  conflicts  are  regularly  in  the  news;  other  con¬ 
flicts  are  minor  in  scope  and  limited  in  time  (such  as 
the  fishing  disputes  between  Canada  and  Spain  or 
between  Iceland  and  Great  Britain).  Finally,  in  a 
minority  of  cases,  we  see  local  or  regional  conflicts 
that  are  confined  to  one  country  or  neighboring 
countries,  but  a  growing  proportion  of  these  have 
consequences  with  international  ramifications  (e.g., 
refugees,  fears  of  escalation,  collapsed  states,  dis¬ 
placed  persons,  humanitarian  crises). 

Trends  and  Developments  in  Armed  Conflict 

Violence,  a  prominent  characteristic  of  the  interna¬ 
tional  system,  is  changing  with  time.  What  then 
are  the  main  features  of  such  violence,  and  how 
have  they  changed?  What  are  the  major  trends  in 
war,  conflict,  and  violence  around  the  world? 

First,  “major”  wars  (between  major  powers) 
have  declined  substantially,  from  27  in  the  16th 
century  to  17  in  the  17th  century,  10  in  the 
18th  century,  5  in  the  19th  century,  and  5  in  the 
20th  century.  There  have  been  no  wars  between 
major  powers  since  1945.  If  the  trend  holds,  there 
should  be  no — or  at  the  most  very  few — major 
wars  in  the  21st  century.  On  the  other  hand,  the 
destruction  wrought  by  major  wars  has  increased 
exponentially  due  to  more  advanced  weaponry  and 
the  defense  dilemma.  The  European  wars  of  the 
16th  century  caused  slightly  more  than  a  million 
casualties,  while  60  million  soldiers  and  civilians 
died  in  the  two  world  wars  of  the  20th  century, 
more  than  the  casualties  of  all  previous  wars  com¬ 
bined.  Since  1945,  wars  have  caused  an  additional 
40  million  civilian  and  military  casualties.  (Civil 
wars  and  wars  of  independence  in  the  Third  World 
account  for  a  large  proportion  of  these.) 

Second,  95%  of  armed  conflicts  and  wars  are 
intrastate.  One  of  the  goals  of  warfare  today  is  the 
creation  of  smaller  states,  not  larger  ones  as  was 
the  case  in  the  past.  As  Kalevi  Holsti  observes,  the 
great  majority  of  wars  since  1945  have  been  waged 
within  states:  “Almost  77%  of  the  164  wars  were 
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internal,  where  armed  combat  was  not  against 
another  state  but  against  the  authorities  within  the 
state  or  between  armed  communities.”  Notwith¬ 
standing  the  oft-heard  thesis  that  this  is  a  phenom¬ 
enon  of  the  post-Cold  War  period,  it  has  in  fact 
been  a  well-established  trend  for  50  years.  In  other 
words,  states  have  been  under  threat  of  fragmenta¬ 
tion  for  some  time. 

On  the  one  hand,  interstate  wars  are  becoming 
less  frequent;  on  the  other,  intrastate  wars  are 
becoming  more  so.  While  there  were  a  dozen  civil 
wars  per  decade  in  the  first  half  of  the  20th  cen¬ 
tury,  the  average  has  increased  to  20  in  the  last 
50  years.  During  the  1990s,  the  majority  of  civil 
wars  lasted  more  than  5  years,  two  fifths  lasted 
more  than  10  years,  and  a  quarter  more  than  20 
years.  Between  1989  and  1996,  more  than  one 
third  of  United  Nations  (UN)  member  states — 
countries  with  a  combined  total  of  20  million  sol¬ 
diers  and  a  civilian  population  of  3.3  billion — were 
torn  by  civil  war.  Civil  wars  in  the  Third  World 
left  some  40  million  dead. 

Third,  territorial  factors  are  becoming  consider¬ 
ably  less  important  as  a  cause  of  war.  Between 
1648  and  1945,  about  one  half  of  wars  were  ter¬ 
ritorial  in  nature,  compared  with  30%  since  1945. 
As  noted  above,  between  1945  and  1989,  close  to 
77%  of  wars  were  internal,  of  which  half  were 
ideological  and  the  other  half  ethnic  conflicts  or 
wars  of  secession. 

Whereas  during  the  Cold  War  armed  conflicts 
were  fairly  evenly  divided  between  conflicts  of  a 
territorial,  ideological,  and  ethnopolitical  nature, 
since  1989,  the  greatest  proportion  of  conflicts  has 
fallen  into  the  last  category;  most  commonly,  these 
have  been  ethnic  and  identity-based  conflicts  in 
states  at  risk  of  collapse.  On  the  other  hand,  serious 
territorial  conflicts  have  a  greater  chance  of  escalat¬ 
ing  into  war  than  do  other  types  of  conflict.  While 
there  are  fewer  and  fewer  disputes  between  states 
over  territorial  sovereignty,  17%  of  the  world’s 
309  land  borders  are  disputed,  and  39  countries 
are  still  involved  in  jurisdictional  disputes  over 
archipelagos  or  islands.  Some  observers  argue  that 
territory  remains  an  important  issue,  particularly  in 
the  age  of  globalization,  as  some  states  fight  fiercely 
to  maintain  their  integrity  while  they  see  their  eco¬ 
nomic  independence  slipping  away. 

Fourth,  the  majority  of  wars  and  armed  conflicts 
are  being  waged  in  the  Global  South,  primarily  in 


Africa  and  Asia  but  also  in  the  Middle  East  and  the 
Caucasus.  In  Latin  America,  the  number  of  armed 
conflicts  has  declined  considerably.  Three  regions 
have  seen  no  interstate  wars  since  1945:  North 
America,  South  America,  and  Western  Europe 
(except  for  the  intervention  against  Serbia  in 
1999). 

Fifth,  fewer  soldiers  and  more  civilians  are 
involved  in  wars.  Civilians  have  become  the  leading 
victims  of  armed  conflict  by  far:  In  the  1990s,  90% 
of  the  victims  of  war  were  civilians,  compared  with 
65%  during  World  War  II  and  40%  during  World 
War  I.  There  were  nearly  5  million  military  and 
civilian  battle  deaths  in  intrastate  wars  (counting 
both  colonial  and  civil  wars)  between  1900  and 
1949,  and  more  than  10  million  between  1950  and 
2000.  Interstate  wars  claimed  27  million  military 
and  civilian  battle  lives  during  the  1900  to  1949 
period  and  3  million  between  1950  and  2000. 
These  figures  do  not  include  state  genocide  perpe¬ 
trated  against  domestic  populations  in  the  former 
USSR,  China,  Cambodia,  and  elsewhere,  which 
have  claimed  more  than  100  million  victims. 

Civil  strife  has  had  devastating  effects.  For 
example,  there  have  been  close  to  4  million  battle 
deaths  in  the  Democratic  Republic  of  the  Congo 
since  1998.  With  1,250  people  dying  daily,  the  war 
in  the  Congo  has  been  the  deadliest  since  World 
War  II.  The  use  of  rape  as  a  weapon  of  war  and 
the  spread  of  pandemics  such  as  AIDS,  which  are 
rampant  in  many  countries  and  in  the  armed  forces 
in  particular,  should  also  be  noted  (60%  of  sol¬ 
diers  are  infected  with  the  human  immunodefi¬ 
ciency  virus  [HIVJ  in  Zambia,  55%  in  Zimbabwe, 
40%  in  Angola  and  the  Democratic  Republic  of 
the  Congo,  and  10%  to  30%  in  Tanzania,  Ivory 
Coast,  and  Nigeria).  A  growing  proportion  of 
civilian  victims  are  children,  and  the  majority  of 
armed  conflicts  involve  child  soldiers  under  the  age 
of  15.  In  the  Third  World,  industrialization  and 
the  expansion  of  the  service  economy  are  creating 
an  influx  of  young  people  into  the  urban  centers 
where  more  jobs  are  available.  As  a  result,  the 
population  of  the  cities  is  swelling,  leading  to  the 
“urbanization”  of  conflicts.  Jean-Louis  Dufour 
has  predicted  that  cities  will  be  the  battlefields  of 
the  next  century. 

In  light  of  this  survey,  there  is  reason  for  both 
optimism  and  pessimism  about  the  prospects  for 
violence  in  the  future. 
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On  the  one  hand,  while  there  are  many  more 
states  today  than  there  were  200  years  ago,  there 
has  been  no  increase  in  armed  conflicts  between 
states.  In  proportionate  terms,  there  are  fewer 
armed  conflicts  today  than  there  were  at  the  begin¬ 
ning  of  the  20th  century.  On  the  other  hand,  it  is 
clear  that  the  human  tendency  toward  violence  has 
not  subsided;  in  fact,  it  has  become  more  intense 
and  less  state  driven. 

The  Future  of  Peace  Missions 

Since  the  early  1990s,  the  UN  has  played  a  central 
role  in  the  implementation  of  conflict  prevention 
and  resolution  strategies.  As  a  rule,  the  UN  is  auto¬ 
matically  involved  in  peace  missions.  The  existence 
of  an  international  organization  that  is  able  to 
deploy  soldiers  supplied  by  member  countries  in 
order  to  maintain  or  restore  an  often  fragile  peace  is 
a  new  development  of  historic  significance.  Over  the 
past  50  years,  the  UN  has  sent  hundreds  of  thou¬ 
sands  of  Blue  Helmets  (i.e.,  peace  forces)  to  carry 
out  a  variety  of  observations  and  to  monitor 
missions  designed  to  help  prevent  a  resumption  of 
hostilities.  Since  the  end  of  the  Cold  War,  the  UN 
has  adopted  two  additional  goals:  (1)  creating 
conditions  for  a  lasting  peace  settlement  and 
(2)  supporting  efforts  toward  reconciliation  and 
reconstruction  in  societies  that  have  experienced 
violent  conflicts.  This  expanded  agenda  explains  the 
significant  increase  in  peacekeeping  missions  during 
the  past  decade. 

Seventeen  UN  peace  missions  were  launched  in 
the  4  years  following  the  end  of  the  Cold  War, 
from  1989  to  1993 — as  many  as  during  the  pre¬ 
ceding  four  decades.  There  had  been  a  total  of  15 
missions  up  to  1989,  all  but  5  in  connection  with 
interstate  conflicts.  Between  1989  and  2000,  there 
were  a  total  of  38  new  missions,  all  but  5  for  intra¬ 
state  conflicts.  In  1991,  there  were  about  11,000 
peacekeepers  on  the  ground;  2  years  later  there 
were  more  than  78,000,  an  all-time  high.  In  the 
late  1990s,  a  certain  amount  of  fatigue,  coupled 
with  cost  concerns,  set  in,  and  the  number  of  Blue 
Helmets  dropped  to  30,000  in  1999;  however,  by 
the  end  of  the  first  decade  of  the  21st  century, 
there  were  again  almost  80,000  UN  soldiers 
involved  in  more  than  20  missions  (approximately 
100,000,  if  one  includes  military  observers  and 
civilian  police).  In  addition,  65,000  NATO  and 


European  Union  (EU)  soldiers  were  on  duty  in 
Afghanistan,  Kosovo,  and  Bosnia  as  part  of  robust 
operations  aimed  at  maintaining  a  fragile  peace.  The 
UN  is  therefore  more  active  than  ever.  The  number 
of  peace  missions  has  increased,  even  though  the 
number  of  conflicts  has  decreased.  Three  quarters  of 
the  total  number  of  peace  missions  in  the  history  of 
the  UN  have  been  launched  since  the  end  of  the  Cold 
War  at  an  annual  cost  of  around  $5  billion  (equal  to 
1/100  of  the  U.S.  security  budget).  According  to 
Peter  Wallensteen,  the  UN  helped  work  out  25  of 
the  39  peace  agreements  that  were  signed  between 
1989  and  2000  for  the  purpose  of  ending  armed 
conflicts.  The  UN  is  involved  in  peacemaking  and 
peacebuilding  in  no  less  than  half  of  the  civil  wars 
taking  place  today.  Between  1990  and  2002,  peace¬ 
making  initiatives  increased  fourfold;  the  imposition 
of  sanctions,  fivefold;  preventive  diplomacy  mis¬ 
sions,  sixfold;  and  mediation  mechanisms  and  truth 
and  reconciliation  commissions,  sevenfold.  The  total 
number  of  peace  operations  more  than  tripled  from 
7  in  1988  to  23  in  2008  (see  the  reports  from  the 
Human  Security  Centre  at  the  University  of  British 
Columbia  and  the  Annual  Review  of  Global  Feace 
Operations  from  the  Center  on  International 
Cooperation  at  New  York  University).  Between 
1948  and  2008,  some  2,200  Blue  Helmets  were 
killed  during  peace  missions;  more  than  half  of  those 
have  died  since  1993.  Since  intrastate  wars  have 
largely  replaced  interstate  wars,  to  which  UN  mech¬ 
anisms  appear  better  suited,  peace  missions  have 
become  more  demanding  and  more  dangerous.  In 
the  new  international  security  environment,  the  UN 
has  been  redesigning  the  mandates  and  methods  of 
peace  operations  launched  to  end  civil  wars  and 
ethnic  conflicts.  The  results,  however,  have  been 
mixed.  The  UN’s  own  self-assessment  reports  on  the 
Rwandan  genocide  of  1994  and  the  massacre  in  the 
Bosnian  village  of  Srebrenica  in  1995  uncovered 
serious  weaknesses  in  peacekeeping  mechanisms  and 
decision  making.  Since  these  events,  there  have  been 
many  analyses  of  the  shortcomings  of  the  UN’s 
peace  operations  and  international  security  efforts. 
Those  deficiencies  have  been  made  evident  again  by 
the  UN’s  inaction  in  Darfur  and  the  Democratic 
Republic  of  the  Congo. 

The  impartial  Blue  Helmets,  standing  between 
the  combatants  with  the  consent  of  both  parties, 
seem  to  be  a  thing  of  the  past.  In  today’s  peace 
missions,  UN  soldiers  (and,  ironically  enough, 
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NATO  soldiers)  are  called  on  to  maintain  and 
strengthen  the  standards  that  underpin  peace.  In  a 
country  that  has  become  ungovernable,  the  mere 
presence  of  UN  forces  can  be  enough  to  provide  a 
level  of  security,  which  will  enable  the  country  to 
get  past  the  roots  and  the  effects  of  war,  at  least 
for  a  time.  In  addition,  there  is  hope  that  as  they 
move  toward  a  liberal  conception  of  democratic 
standards  and  a  market  economy,  these  countries 
will  recover  the  peace  that  they  lost  or  never 
knew.  Together  with  other  actors,  particularly 
humanitarian  organizations  and  financial  institu¬ 
tions,  the  Blue  Helmets  are  taking  initiatives  and 
applying  principles  that  go  beyond  the  limited, 
traditional  concept  of  peacekeeping.  Their  new 
role  pursues  the  far  more  ambitious  objectives  of 
peacemaking,  peace  enforcement,  and  peacebuild¬ 
ing.  Realists  consider  this  to  be  a  praiseworthy  but 
naive  approach  that  is  based  on  false  hopes  and 
doomed  to  failure.  In  their  view,  the  UN  cannot 
abandon  the  Westphalian  conception  of  security 
on  which  it  was  founded  in  1945.  According  to 
the  realists,  if  the  UN  claims  a  right  of  interfer¬ 
ence,  which  entails  an  increased  practical  commit¬ 
ment  to  human  security,  it  will  flounder  on  the 
severe  limitations  and  enormous  obstacles  faced 
by  intrastate  peace  missions.  For  one  thing,  the 
possibility  of  artificially  reshaping  a  society  in 
the  image  of  a  Western  democracy  appears  to  the 
realists  to  be  doubtful  at  best.  A  basic  contradic¬ 
tion  immediately  presents  itself:  The  UN  and 
other  organizations  such  as  NATO  are  increas¬ 
ingly  expressing  a  desire  to  keep  the  peace  not 
only  between  states  but  also  within  states.  Whereas 
during  the  Cold  War,  most  states  wanted  the  UN 
to  be  no  more  than  an  arbiter  of  interstate  rela¬ 
tions,  and  a  weak  one  at  that,  some  now  expect 
the  UN  to  rescue  and,  if  necessary,  revive  col¬ 
lapsed  states  that  are  unable  to  govern  themselves 
or  maintain  security.  Can  the  UN  fulfill  this  role? 
Are  UN  peace  forces  able  to  resolve  intrastate 
conflicts?  This  question  can  be  expected  to  arise 
with  every  new  peace  mission  and  to  fuel  bitter 
debate  for  years  to  come. 

Definitions  of  Peace 

In  general,  a  peacekeeping  mission  refers  to  the 
deployment  by  the  UN  (or  another  intergovern¬ 
mental  organization  [IGOJ)  of  civilian  personnel, 


police,  and  Blue  Helmets  for  the  purpose  of  conflict 
prevention,  management,  and  resolution.  The  UN’s 
military  operations  include  a  full  range  of  activities 
aimed  at  curbing  and  resolving  conflicts,  from  tra¬ 
ditional  peacekeeping  to  peace  enforcement. 

Peace  has  often  been  defined  as  the  absence  of 
violence.  In  the  post-Cold  War  period,  this  classic, 
primarily  “negative,”  definition  has  come  to  be 
seen  as  inadequate,  and  the  international  commu¬ 
nity  is  working  toward  a  more  progressive  and 
“positive”  definition  of  peace.  The  negative  con¬ 
ception  of  peace  implies  that  peace  is  only  tempo¬ 
rary.  Peace  exists  when  we  succeed  in  preventing 
the  outbreak  of  conflict.  It  is  a  fragile  interval 
produced  by  the  balance  of  forces.  It  may  be  based 
on  the  hegemony  of  one  state,  the  balance  between 
major  powers  in  the  international  system  (e.g., 
where  several  countries  possess  weapons  of  mass 
destruction),  or  the  play  of  alliances.  These  are  all 
shaky  foundations  for  peace  and,  in  the  view  of  the 
realists,  have  not  prevented  the  persistence  and 
regular  resurgence  of  war,  the  natural  order  of 
things,  throughout  history. 

The  positive  conception  of  peace  is  based  on  the 
establishment  of  values,  networks,  and  multilateral 
mechanisms  that  can  ensure  the  long-term  survival 
of  a  stable  international  system.  The  threat  is  no 
longer  military  in  nature  and  cannot  be  addressed 
by  states  acting  on  their  own.  The  principle  “every 
man  for  himself”  is  replaced  by  “all  for  one”  and 
the  idea  of  power  by  the  concept  of  sharing. 
Individual  attitudes  must  change,  the  machinery  of 
war  must  be  abandoned,  and  peace  and  justice 
education  must  be  promoted. 

Approaches  to  Peace  Missions 

Approaches  to  peace  and  peace  missions  some¬ 
times  adopt  a  positive  and  sometimes  a  negative 
conception  of  peace. 

During  the  Cold  War,  the  (negative)  concept  of 
peacekeeping  was  the  only  one  in  common  use. 
With  the  proliferation  of  peace  missions  and  man¬ 
dates  since  1989,  new  (more  positive)  concepts 
have  gained  currency.  Most  notably,  the  Agenda 
for  Peace  proposed  in  1992  by  UN  Secretary 
General  Boutros  Boutros-Ghali  and  revised  in 
1995  set  out  principles  and  terms  that  gave  new 
impetus  to  research  efforts  and  introduced  a  more 
precise  classification  of  peace  missions.  From  the 
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voluminous  literature  on  peace  missions,  we  will 
discuss  four  terms  that  are  now  widely  in  use. 

Peacekeeping  means  deploying  UN  personnel — 
primarily,  military  personnel — with  the  consent  of 
the  parties  to  the  conflict  to  maintain  a  cease-fire 
and  prevent  a  resumption  of  hostilities.  By  posi¬ 
tioning  multinational  forces  between  the  parties, 
the  UN  tries  to  preserve  or  increase  the  chances  of 
peace.  The  forces  are  deployed  only  when  a  peace 
agreement  has  been  reached  and  has  taken  hold. 
The  forces  remain  impartial  and  neutral.  They  can 
fire  only  in  self-defense.  If  hostilities  break  out 
again,  they  are  immediately  withdrawn.  The  UN 
Charter  made  no  provision  for  such  forces;  they 
are  often  said  to  be  mandated  under  a  fictional 
Chapter  VI  and  a  half,  halfway  between  the  coop¬ 
erative  means  of  Chapter  VI  and  the  coercive 
means  of  Chapter  VII. 

Peacemaking  includes  all  forms  of  mediation  and 
negotiation  intended  to  bring  the  parties  closer 
together,  essentially  by  peaceful  means.  The  coop¬ 
erative  means  referred  to  in  Chapter  VI  of  the  UN 
Charter  are  used  to  help  achieve  a  settlement  of  the 
conflict.  Preventive  diplomacy,  particularly  the  pre¬ 
ventive  deployment  of  peacekeepers,  can  also  be 
useful  for  containing  the  outbreak  and  escalation  of 
violence  between  the  parties.  The  deployment  of 
forces  with  the  consent  of  the  parties  can  help 
establish  a  climate  of  trust  and  security  conducive 
to  the  resumption  of  negotiations  and  mediation. 

Peace  enforcement  refers  to  coercive  action  autho¬ 
rized  by  the  UN  Security  Council  pursuant  to 
Chapter  VII  of  the  Charter  in  response  to  “threats 
to  peace,  breaches  of  the  peace  and  acts  of  aggres¬ 
sion.”  Multinational  military  forces  under  UN 
command  must  then  enforce  the  agreements  that 
they  are  supposed  to  guarantee  and,  if  necessary, 
engage  in  armed  action.  Peace  enforcement  can 
also  be  undertaken  by  a  regional  organization 
under  Chapter  VIII  of  the  Charter  and  in  accor¬ 
dance  with  rules  stipulated  by  the  Security  Council. 

Peacebuilding  means  a  concerted  effort  by  the  UN 
and  the  international  community  as  a  whole  to 
develop  political,  economic,  and  security  infra¬ 
structures  to  achieve  long-term  suppression  or 
resolution  of  a  conflict.  Peacebuilding  attempts  to 


lay  the  foundations  for  reconciliation  and  recon¬ 
struction.  It  seeks  to  avert  any  resumption  of  vio¬ 
lence  in  a  bid  to  redraw  the  settlement.  While  it  is 
intended  primarily  for  the  postconflict  phase,  it 
can  also  be  applied  as  a  preventive  measure  before 
violence  breaks  out  or  even  during  the  conflict  to 
firm  up  an  unstable  or  precarious  peace.  It  is  based 
on  the  economic  and  social  measures  described  in 
Chapters  IX  and  X  of  the  Charter,  among  others. 

These  approaches  are  used  in  succession  as  part  of 
an  overall  strategy  for  peace. 

Development  of  Peace  Missions 

The  development  of  peace  missions  can  be  thought 
of  as  spanning  two  periods:  (1)  the  Cold  War, 
from  1948  to  1988,  during  which  traditional 
peacekeeping  operations  were  conducted  by  the 
Blue  Helmets,  and  then  (2)  the  1989  to  1993 
period,  during  which  the  deployment  of  peace¬ 
keepers  and  their  assignments  expanded  exponen¬ 
tially.  A  third  emerging  phase,  the  post-1994 
period,  is  less  distinct:  It  is  a  continuation  of  the 
preceding  period  with  some  accentuated  features. 

The  first  period  opened  with  the  UN  Observer 
Missions  monitoring  in  Palestine  in  1948  and 
Kashmir  in  1949  (the  dates  of  the  beginning  of  the 
operations).  The  goal  of  the  “Blue  Beret”  missions 
was  to  monitor  cease-fires  and  armistice  agree¬ 
ments  that  had  been  agreed  to  by  states.  The 
deployment  of  the  “Blue  Helmets”  (a  Canadian 
invention)  along  the  Suez  Canal  in  1956  was  the 
beginning  of  a  new,  more  engaged  phase.  Similar 
operations  were  subsequently  carried  out  to 
enforce,  as  far  as  possible,  other  demarcation  lines 
and  cease-fires  between  opposed  countries  or  com¬ 
munities,  such  as  Cyprus  in  1964,  the  Sinai  in 
1973,  and  the  Golan  Heights  in  1974. 

The  Blue  Helmets  were  positioned  between  the 
combatants  to  police  a  buffer  zone  and  reduce  the 
risk  of  a  resumption  of  hostilities.  The  principle 
was  clear,  and  the  success  criterion  was  the 
absence  of  war — that  is,  the  realization  of  the 
negative  conception  of  peace.  Other  missions  were 
mandated  to  verify  the  suspension  of  hostilities 
between  Iran  and  Iraq  in  1988,  to  supervise  the 
withdrawal  of  foreign  troops  from  Afghanistan  in 
1988  and  Angola  in  1989,  and  in  the  first  intra¬ 
state  conflicts  that  the  UN  has  dealt  with,  to 
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restore  order  in  Congo  in  1960  to  1964  and  in 
southern  Lebanon  in  1978.  This  period  also  saw 
multilateral  operations  outside  the  UN  frame¬ 
work,  such  as  the  one  in  Lebanon  in  1982,  in 
which  several  countries  joined  forces  to  deploy  an 
implementation  force. 

During  the  second  period,  which  extended  from 
1989  to  1993,  17  new  missions  were  conducted. 
As  the  Agenda  for  Peace  noted  in  1992,  the  UN 
deployed  tens  of  thousands  of  Blue  Helmets  to 
carry  out  expanded  missions.  They  supervised 
elections  (Namibia  and  Nicaragua  in  1989),  helped 
settle  disputes  (El  Salvador  in  1991,  Cambodia 
and  Mozambique  in  1992),  and  served  as  a  pre¬ 
vention  force  (on  the  Kuwaiti  border  in  1991  and 
Macedonia’s  border  in  1992).  They  also  moni¬ 
tored  disarmament  (Iraq  in  1991),  ensured  the 
security  of  humanitarian  operations,  protected 
refugees  and  displaced  persons  (in  northern  Iraq  in 
1991,  in  Somalia  and  Bosnia  in  1992),  and 
observed  and  monitored  the  (often  futile)  imple¬ 
mentation  of  peace  agreements  (in  Angola  in  1991, 
in  Rwanda  in  1993). 

These  examples  confirm  three  notable  changes 
in  Blue  Helmet  missions. 

1.  They  affirm  (reinforce)  the  new  right  of 
intervention  to  provide  humanitarian  assistance, 
proclaimed  by  the  Security  Council  after  the 
Gulf  War  in  1991. 

2.  Most  of  these  Blue  Helmet  missions  addressed 
intrastate  conflicts,  without  necessarily 
obtaining  prior  consent  from  the  government  or 
the  rival  factions. 

3.  The  missions  were  multidimensional;  that  is, 
they  simultaneously  comprised  peacekeeping, 
peacemaking,  peace  enforcement,  and  peace¬ 
building  components. 

Since  1994,  a  third  generation  of  missions  has 
emerged:  The  focus  is  on  peace  enforcement  to 
stabilize  collapsed  states  through  reconstruction, 
democratization,  and  development.  In  these 
missions,  peace  enforcement  and  peacebuilding 
interpenetrate. 

In  one  new  feature,  many  of  these  missions  were 
originally  conducted  by  an  actor  other  than  the 
UN  but  with  the  UN’s  approval.  This  occurred  for 
the  first  time  with  the  deployment  of  U.S.  troops  in 


Somalia  in  1992,  initially  for  strictly  humanitarian 
reasons.  The  subsequent  use  of  force  by  the  United 
States  in  Haiti  in  1994,  by  NATO  in  Bosnia  in 
1995  and  in  Kosovo  in  1999,  by  Nigeria  and 
Economic  Community  of  West  African  States 
Monitoring  Group  (ECOMOG)  in  Sierra  Leone  in 
1997,  by  Australia  in  East  Timor  in  1999,  by 
Great  Britain  in  Sierra  Leone  in  2000,  by  NATO 
again  in  Afghanistan  in  2001,  and  by  France  in 
Ivory  Coast  in  2002  revealed  the  UN’s  significant 
shortcomings  when  it  came  to  deploying  an  armed 
force  without  the  full  consent  of  the  host  country’s 
political  authorities.  Over  the  past  6  years,  UN 
peace  forces,  acting  in  some  cases  with  the  support 
of  small  contingents  from  the  African  Union  (AU), 
Organization  of  American  States  (OAS),  Economic 
Community  of  West  African  States  (ECOWAS),  or 
the  EU,  have  taken  over  such  missions  in  Sudan, 
Sierra  Leone,  Liberia,  Ivory  Coast,  Burundi,  the 
Democratic  Republic  of  the  Congo,  Haiti,  and 
East  Timor  but  only  once  the  situation  was  rela¬ 
tively  stable.  In  Somalia,  Haiti,  and  East  Timor, 
the  UN  Blue  Helmets  took  over  the  missions  only 
once  security  had  been  reestablished.  Elsewhere, 
the  Blue  Helmets  continued  to  conduct  traditional 
monitoring  and  peacekeeping  operations  lasting 
for  years  in  Cyprus,  the  Middle  East,  Western 
Sahara,  Kashmir,  and  the  Ethiopia-Eritrea  border. 
There  has  been  lively  discussion  about  the  role  of 
the  peace  forces  and  heated  polemics  about  their 
operations  against  the  background  of  UN  reform 
and  the  redefinition  of  its  mechanisms. 

Clearly,  there  are  huge  obstacles  to  giving  the 
UN  real  peace  enforcement  capabilities.  These 
include  the  lack  of  a  real  military  mechanism  at  the 
UN,  archaic  command  structures  and  doctrines, 
lack  of  equipment  and  financial  resources,  poor 
coordination,  overly  decentralized  training  and 
supervision  of  peace  forces,  inability  to  react 
swiftly,  and,  most  important,  the  UN’s  dependence 
on  the  countries  that  supply  the  Blue  Helmet 
troops. 

Hence,  it  is  not  surprising  to  see  devolution  of 
military  operations  to  defense  organizations, 
including  NATO,  and  to  major  powers,  such  as 
the  United  States,  which  have  greater  experience 
and  more  resources  in  this  area.  Another  promis¬ 
ing  approach  might  be  to  “regionalize”  peace 
enforcement  and  provide  regional  organizations 
such  as  the  AU  with  the  logistical  means,  funding, 
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and  training  in  command  and  coordination  that 
they  need  to  conduct  enforcement  and  combat 
missions  more  independently,  although  still  under 
the  aegis  of  the  UN.  Hybrid  missions  of  this  type 
would  also  better  reflect  the  contribution  of  south¬ 
ern  hemisphere  countries  to  UN  peace  missions: 
Five  nations — India,  Pakistan,  Bangladesh, 
Nigeria,  and  Ghana — are  now  supplying  three 
quarters  of  the  Blue  Helmet  troops.  At  the  end  of 
the  day,  the  UN  serves  the  cause  of  peace  more 
effectively  in  its  role  as  impartial  mediator  than  in 
a  military  capacity.  However,  it  appears  that 
armed  force  is  needed  to  establish  and  shore  up  the 
peace.  This  is  a  battle  that  the  founders  of  the  UN 
clearly  did  not  foresee  and  that  the  Blue  Helmets 
are  not  equipped  to  wage. 
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Warlords 


The  term  warlord  reemerged  on  the  political  scene 
due  to  a  number  of  bloody  intrastate  conflicts 
since  the  beginning  of  the  1990s.  In  particular,  the 
long-standing  wars  in  West  Africa,  in  Somalia,  in 
the  Democratic  Republic  of  the  Congo,  and  in 
Afghanistan  led  to  the  frequent  use  of  the  term  in 
politics,  media,  and  academia  to  describe  power¬ 
ful  men  and  their  armed  organizations,  which 
profit  economically  from  war  and  violence. 
Modern  archetypes,  such  as  Charles  Taylor 
(Liberia),  Johnny  Paul  Koroma  (Sierra  Leone), 
Mohammed  Farah  Aideed  (Somalia),  Jean-Pierre 
Bemba  (Democratic  Republic  of  the  Congo), 
Mahmud  Khudoberdyev  (Tajikistan),  and  Gul 
Agha  Shirzai,  Ismail  Khan,  and  Abdul  Rashid 
Dostum  (all  Afghanistan)  are  regarded  as  both  a 
result  and  a  key  characteristic  of  the  so-called  new 
wars.  This  entry  analyzes  the  phenomenon,  in 
general,  and  points  to  the  main  characteristics  of 
organization,  leadership,  and  other  key  aspects. 

The  Phenomenon  of  the  Warlord 

The  warlord  phenomenon  is  not  a  new  challenge 
for  governments  and  the  international  commu¬ 
nity.  On  the  contrary,  warlords  have  been  con¬ 
stant  companions  of  wars  since  the  ancient  times 
and  the  medieval  period.  In  particular,  the  situa¬ 
tion  in  Britain  in  the  early  Middle  Ages  with  a 
number  of  rival  armed  local  commanders,  dukes, 


and  lords  is  seen  as  a  historical  precedent.  But  also 
during  the  Thirty  Years’  War  of  the  17th  century, 
feudal  warlords — Max  Weber  used  the  terms 
Kriegsfiirst  and  Kriegsfiirstentum — dominated 
European  battlefields.  They  recruited  and  paid 
their  own  soldiers,  and  they  were  known  for  plun¬ 
dering  the  local  population;  perhaps  the  most 
prominent  figures  being  Duke  Albrecht  von 
Wallenstein  and  Duke  Mansfield.  Another  histori¬ 
cal  point  of  reference  is  the  Chinese  warlord  era 
after  the  collapse  of  the  Qing  Dynasty  in  1911. 
Due  to  the  fragmentation  of  the  Empire  as  well  as 
the  competition  of  local  rulers  and  army  com¬ 
manders,  about  1,300  warlords  (in  Chinese  called 
junfa),  subdivided  in  different  cliques,  were 
involved  in  more  than  140  greater  and  smaller 
wars  in  the  1920s  and  1930s.  From  a  historical 
point  of  view,  the  figure  of  the  warlord  can  thus 
be  seen  as  the  classical  antagonist  to  the  modern 
state’s  monopoly  on  the  use  of  force,  which  was 
successively  established  in  Europe  and  later  else¬ 
where  against  various  forms  of  warlordism. 

In  general,  warlords  are  portrayed  as  profit- 
driven,  economically  motivated  actors  who  have 
an  interest  in  ongoing  conflicts  and,  therefore, 
undermine  peace  and  stability.  As  Max  Weber 
(1978)  pointed  out,  “The  warlord  becomes  a  per¬ 
manent  figure  when  there  is  a  chronic  state  of 
war”  (p.  1142).  Moreover,  warlords  may  also  be 
present  in  a  fragile  postconflict  setting  or  in  situa¬ 
tions  of  state  failure.  Under  such  circumstances,  as 
demonstrated,  for  example,  in  Afghanistan  after 
2001  or  in  Bosnia  after  1995,  warlords  and  their 
private  militias  aim  at  consolidating  the  territo¬ 
rial,  financial,  or  economic  gains  that  they  have 
made  during  a  violent  conflict.  In  other  words, 
warlords  may  also  exist  outside  of  war  times  and 
may  influence  postwar  economies  and  politics,  in 
particular,  as  long  as  state  structures  are  weak  or 
failing  and  the  state’s  monopoly  of  the  use  of  force 
does  not  exist. 

In  press  reports  as  well  as  in  the  academic  lit¬ 
erature,  the  label  warlord  has  been  used  for  very 
different  persons  and  groups.  In  some  cases,  presi¬ 
dents,  heads  of  government,  leaders  of  political 
parties,  or  former  army  generals  also  have  been 
described  as  warlords.  But  mostly  the  term  refers 
to  an  armed  nonstate  actor  who  is  able  to  control 
a  particular  territory  and  its  inhabitants,  during  or 
after  the  end  of  a  violent  conflict.  Or  as  James 
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Sheridan  (1966)  put  it,  a  warlord  is  a  man  who 
exercised  effective  governmental  control  over  a 
fairly  well-defined  region  by  means  of  a  military 
organization  that  obeyed  no  higher  authority  than 
himself.  The  regions  or  territories  dominated  by 
warlords  are  often  remote,  far  away  from  the 
capital,  and  beyond  effective  control  of  the  national 
government.  In  some  cases,  territorial  control 
exceeds  even  international  borders.  For  example, 
in  1992-1993,  Charles  Taylor  controlled  not  only 
most  of  Liberia  but  also  parts  of  Sierra  Leone  and 
the  border  region  with  Ivory  Coast  (known  as 
“Taylorland”). 

Generally,  warlords  aim  at  controlling  areas 
rich  in  resources  such  as  timber,  oil,  minerals, 
diamonds,  gold,  or  other  precious  metals,  which 
are  easy  extractable.  These  resources  are  exploited 
directly  or  indirectly,  such  as  by  selling  conces¬ 
sions  to  national  or  international  companies.  The 
population  under  the  control  of  a  warlord  organi¬ 
zation  also  serves  as  an  important  resource. 
Warlords  and  their  fighters  may  regularly  plunder 
and  loot  villages  and  farms,  they  may  raise  special 
“taxes”  for  granting  physical  security,  and  they 
may  also  benefit  from  local  businesses  and  trade 
relationships  as  well  as  from  criminal  activities 
(e.g.,  drug  cultivation  and  trade,  human  traffick¬ 
ing,  smuggling  of  goods),  which  they  either  con¬ 
trol  or  at  least  allow  as  long  as  they  can  enjoy 
their  share  of  the  profits.  In  addition,  warlords — 
like  other  armed  groups — may  use  humanitarian 
or  developmental  aid,  provided  by  international 
donors,  for  their  own  purposes.  Or,  alternatively, 
they  demand  money  or  goods  for  allowing  non¬ 
governmental  organizations  (NGOs)  or  aid  agen¬ 
cies  to  supply  the  local  population  with  basic 
services  or  to  start  infrastructure  projects.  As 
emphasized  by  Mark  Duffield  (1998)  and  William 
Reno  (1998),  warlord  organizations  need  to  oper¬ 
ate  across  borders  and  try  to  get  access  to  regional 
and  global  networks  as  well  as  to  gray  and  illicit 
markets  in  order  to  persist.  Therefore,  successful 
warlords  establish  transnational  links  with  busi¬ 
nesspeople,  companies,  governments,  or  orga¬ 
nized  crime. 

The  Organization  and  Other  Key  Aspects 

These  economic  activities,  however,  are  not  only 
driven  by  selfish  profit  interests,  but  they  are  a 


necessary  requirement  to  finance  a  warlord’s 
activities,  including  acquiring  weapons  and  paying 
their  fighters  and  supporters.  Warlords  have  to  be 
able  to  recruit,  maintain,  train,  and  equip  their 
own  private  paramilitary  forces.  Sometimes  they 
also  use  child  soldiers,  but  mostly  they  will  rely  on 
experienced  comrades  and  fighters.  The  relation¬ 
ship  between  them  and  the  warlord  is  often  based 
on  personal  loyalty  and  to  a  lesser  extent  on  ethnic 
ties  or  political  ideology.  Therefore,  a  warlord 
should  be  distinguished  from  a  clan  chief  or  a  rebel 
leader.  It  may,  however,  be  the  case  that  key  mem¬ 
bers  of  a  warlord  organization  come  from  the 
same  ethnic  group,  tribe,  clan,  or  family  to  enhance 
the  coherence  of  the  group  and  to  ensure  trust 
among  its  leadership.  This  pattern  can  be  found  in 
particular  in  Somalia  and  Afghanistan  where  clan 
and  tribe  organizations  are  still  very  strong.  In 
some  instances,  the  charisma  of  the  warlord  and 
the  myths  about  his  (“glorious”)  life  play  an 
important  role  in  attracting  supporters,  in  particu¬ 
lar  young  men.  The  kind  of  organization  differs 
from  case  to  case  and  ranges  from  rather  hierarchi¬ 
cal  structures  to  a  loose  band  of  commanders  and 
subcommanders  who  pursue  their  own  interests 
and  have  a  certain  degree  of  autonomy. 

Nevertheless,  to  stay  in  power,  warlords  have  to 
rely  on  their  leadership  and  organizational  skills  as 
well  as  on  their  ability  to  sustain  coalitions  and 
networks.  In  this  respect,  many  warlords  can  make 
good  use  of  their  capabilities  and  contacts  they  have 
made  in  their  former  lives.  As  several  biographies 
show,  they  have  often  been  part  of  the  political  or 
military  establishment  of  a  country;  sometimes  they 
come  from  wealthy,  influential  families  (e.g., 
Bemba),  a  number  of  them  have  made  a  profes¬ 
sional  career  in  the  army  or  the  security  sector  (e.g., 
Khan,  Dostum,  Koroma,  Aideed,  and  Khudo- 
berdyev).  Others  had  experience  as  businessmen 
and  a  few  even  studied  abroad  such  as  Taylor  in  the 
United  States  or  Bemba  in  Belgium  (both  earned 
degrees  in  economics).  Most  of  them  first 
supported  the  existing  regime  but  later  joined  the 
opposition  and  set  up  armed  groups  (e.g.,  Taylor’s 
National  Patriotic  Front  of  Liberia  [NPFLJ)  or  used 
the  opportunities  offered  by  state  failure  and  civil 
wars.  For  example,  during  the  1980s  and  1990s, 
the  Afghan  warlords  managed  to  establish  them¬ 
selves  as  key  field  commanders  and  build  territorial 
strongholds.  Warlords,  however,  cannot  always  be 
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seen  as  antagonistic  to  state  actors.  Sometimes,  they 
collaborate  with  the  government  and  may  even 
join  forces  with  the  state’s  military  if  this  fits  their 
interests  and  as  long  as  the  government  does  not 
challenge  the  warlord’s  autonomy. 

To  sustain  their  rule,  warlords  cannot  only  rely 
on  the  (potential)  use  of  violence  to  deter  enemies 
or  intimidate  the  population;  they  often  perform  a 
political  role  and  try  to  achieve  some  form  of  gov¬ 
ernance.  As  Paul  Jackson  (2003)  pointed  out, 
established  warlord  organizations  can  be  under¬ 
stood  as  “quasi  governance”  or  “embryonic  states” 
that  aim  at  enhancing  support  and  legitimacy  of 
the  population  by  providing  basic  services  and 
maintaining  infrastructure.  “Taylorland”  serves 
again  as  an  illustration  since  it  had  its  own  cur¬ 
rency  and  banks,  media,  transport,  and  communi¬ 
cation  infrastructure  as  well  as  “external”  trade 
relationships.  Many  warlords,  therefore,  made 
attempts  to  act  as  “politicians”  in  order  to  secure 
their  power  base  as  well  as  their  resources  and 
profits.  They  establish  political  movements  and 
take  part  in  elections;  others  become  incorporated 
in  official  state  structures.  In  Afghanistan,  for 
example,  a  number  of  warlords  became — at  least 
for  a  certain  period  of  time — provincial  governors, 
ministers  in  the  central  government,  or  members  of 
parliament.  Similar  careers  can  be  studied  in 
Tajikistan,  in  the  Democratic  Republic  of  the 
Congo,  in  Somalia,  or  in  West  Africa.  The  most 
prominent  figure  has  certainly  been  Charles  Taylor 
who  was  elected  president  of  Liberia  (1997-2003). 
However,  it  can  be  stated  that  warlordism  is  not  a 
long-term  phenomenon.  Only  in  very  few  cases 
were  these  figures  able  to  keep  their  position  for 
more  than  10  years;  more  often  they  vanished 
from  the  scene,  they  became  ordinary  criminals,  or 
they  were  defeated  or  killed  by  their  enemies  or  by 
internal  rivals.  Even  in  the  case  of  Charles  Taylor, 
the  story  did  not  end  gloriously;  first  he  had  to  live 
in  exile  in  Nigeria  before  he  was  arrested  and 
transferred  to  Sierra  Leone  in  2006,  where  he  faces 
the  Special  Court  on  war  crimes. 

Ulrich  Schneckener 
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Weber,  Max  (1864-1920) 


At  the  beginning  of  the  21st  century,  the 
International  Sociological  Association  asked  its 
members  to  list  five  books  published  in  the  20th 
century  that  were  most  influential  in  their  work  as 
sociologists.  The  book  that  was  listed  most  often 
overall  was  Max  Weber’s  Economy  and  Society, 
and  The  Protestant  Ethic  and  the  Spirit  of 
Capitalism  ranked  fourth.  The  latter  was  also 
listed  among  the  New  York  Public  Library’s 
Books  of  the  Century  (1895-1995).  The  regard 
for  Weber’s  thought  thus  extends  beyond  sociol¬ 
ogy,  and  it  continues  to  inform  a  variety  of  disci¬ 
plines,  including  political  science,  today. 

The  Protestant  Ethic  and 
the  Spirit  of  Capitalism 

The  Protestant  Ethic  and  the  Spirit  of  Capitalism 
is  arguably  Weber’s  best  known  study,  but  its  pur¬ 
pose  and  main  results  have  often  been  misrepre¬ 
sented  or  misunderstood.  Both  terms  in  the  title 
represent  what  Weber  called  “ideal  types,”  that  is, 
they  are  not  directly  observable  entities  but  rather 
constitute  one-sided  conceptual  abstractions  that 
provide  the  social  scientist  with  a  tool  to  better 
understand  and  interpret  aspects  of  reality.  In  this 
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work,  the  particular  aspect  of  reality  that  Weber 
was  most  interested  in  was  a  core  constitutive  ele¬ 
ment  of  the  specifically  modern  type  of  capitalism: 
its  underlying  motivation  or  “spirit.”  For  Weber, 
modern  capitalism’s  constitutive  features  were  not 
only  the  systematic  pursuit  of  profit  and  a  rational 
organization  of  the  production  factors  of  labor 
and  capital  in  the  economy  but  also  a  distinctive 
modern  ethos  that  guided  work  and  acquisition. 

Weber  illustrated  this  ethos  by  using  Benjamin 
Franklin’s  admonitions  of  how  to  act  in  order  to 
succeed  in  business.  Franklin  stressed  methodical, 
industrious,  and  frugal  conduct,  viewing  it  as  an 
ethical  responsibility  to  increase  one’s  material 
well-being  through  honest  hard  labor  and  focusing 
thereby  on  long-term  results.  Such  ethical  respon¬ 
sibility,  or  duty,  to  pursue  one’s  vocation  or  (secu¬ 
lar)  “calling”  in  a  diligent  and  competent — thus,  in 
the  modern  sense,  professional — manner  sets  the 
“spirit”  of  modern  capitalism  and  also  sets  apart 
modern  capitalism  itself  from  other  economic 
motivations  and  forms  of  the  economy.  At  the 
most  general  level,  Weber  distinguishes  capitalism 
and  its  profit  motive  from  (a)  socialism/commu¬ 
nism,  which  does  not  aim  for  profit  making  but  for 
the  satisfaction  of  the  needs  of  a  collective;  (b)  the 
“hand-to-mouth”  existence  in  a  subsistence  econ¬ 
omy,  with  no  surplus  to  trade  on  markets;  and 
(c)  traditionalism,  an  economic  motive  according 
to  which  a  person  works  only  as  much  as  is  neces¬ 
sary  to  achieve  or  sustain  a  certain  standard  of 
living — an  economic  motive  antithetical  to  dynamic 
and  expanding  markets.  Capitalism  itself  can  be 
differentiated  into  market  capitalism  and  what 
Weber  termed  political  capitalism.  In  political 
capitalism,  a  state  uses  its  political  power  and 
resources  to  extract  a  surplus  from  its  own  citizens 
or  even  from  other  countries.  This  use  of  force 
contradicts  the  formally  peaceful  means  employed 
in  modern  capitalism.  The  “spirit”  of  modern 
capitalism  a  la  Franklin  undergirds  modern  ratio¬ 
nal  capitalism  and  sets  it  apart  from  robber  and 
booty  capitalism,  by  which  a  corporation  or  an 
individual  employs  force  and  illegal  activities  to 
gain  an  economic  advantage,  and  from  adventure 
capitalism,  which,  though  not  necessarily  illegal, 
involves  irrational  speculation  and  a  daredevil  atti¬ 
tude.  The  motives  of  adventure  and  robber  capital¬ 
ists  are  incompatible  with  the  spirit  of  capitalism 
and  its  emphasis  on  the  engagement  in  a  vocational 


calling  with  the  aim  to  make  renewed,  long-term 
profit. 

Weber  considered  the  “spirit”  of  modern  capi¬ 
talism  the  heir  of  the  “Protestant  ethic.”  The 
“Protestant  ethic,”  according  to  Weber,  was  a 
form  of  inner-worldly  asceticism  practiced  among 
ascetic  Protestant  groups  that  ultimately  derived 
from  Calvinist  theology  and  its  concept  of  a  “call¬ 
ing.”  These  groups  shared  the  view  that  individu¬ 
als  had  to  prove  themselves  in  their  entire  life 
conduct,  for  it  was  through  ascetic  conduct,  par¬ 
ticularly  in  one’s  profession  and  vocation,  that 
members  could  gain  assurance  of  their  salvation 
status.  Such  ascetic  moral  conduct,  Weber  wrote  in 
a  companion  essay,  “The  Protestant  Sects  and  the 
Spirit  of  Capitalism,”  also  allowed  ascetic 
Protestants  to  gain  and  retain  membership  in 
groups  that  provided  certification  of  moral  qualifi¬ 
cation  and  thus  business  and  social  creditworthi¬ 
ness.  In  this  context,  Weber  acknowledged  not 
only  the  sectarian  discipline  in  these  groups  but 
also  their  flat  hierarchies  and  self-governance, 
which  contributed  to  the  emergence  of  public  citi¬ 
zenship  in  civil  society,  dynamism  in  governance, 
and  the  social  responsibilities  of  the  rights-bearing 
individual.  Ultimately,  however,  while  Calvinist 
doctrine  had  led  to  the  disenchantment  of  the 
world,  as  the  use  of  magical  means  was  no  longer 
considered  legitimate,  paradoxically,  it  also  laid 
the  foundation  for  secularization  (the  decline  of 
religion’s  influence  on  other  social  spheres)  and  the 
embrace  of  materialism  (the  pursuit  of  material 
gain  instead  of  religious  goals),  which  led  Weber  to 
his  oft-quoted  characterization  of  the  modern 
world  as  an  “iron  cage”  devoid  of  meaning. 

In  spite  of  Weber’s  claim  that  his  study  was 
merely  an  illumination  of  how  ideas  can  become 
effective  in  history  and  focused  on  the  emergence 
of  the  modern  professional  (Berufsmenschentum) 
in  its  significance  as  a  component  of  the  capitalist 
spirit,  The  Protestant  Ethic  has  sometimes  been 
seen  as  an  anti-Marxist  idealist  construction  of  his¬ 
tory,  and  many  historians  have  raised  doubts 
about  the  imputed  influence  of  religious  notions 
on  economic  activities  in  the  early  modern  age. 
Weber  himself  sought  to  broaden  the  scope  of  his 
inquiry  by  addressing  the  mutual  influence  of  reli¬ 
gious  and  economic  factors  in  his  writings  on  the 
“Economic  Ethics  of  the  World  Religions,”  in 
which  he  also  widened  his  thematic  focus  beyond 
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religion  and  the  economy  to  religion  and  the  ratio¬ 
nalization  of  all  spheres  of  life. 

Economy  and  Society 

Many  social  scientists  consider  Economy  and 
Society  as  Weber’s  major  work  and  one  of  the 
most  important  contributions  to  the  discipline. 
Due  to  the  fragmented  nature  of  the  compilation, 
it  is  often  used  as  a  compendium  from  which 
scholars  draw  selectively  on  Weber’s  ideas,  fre¬ 
quently  from  the  first  four  chapters,  which  contain 
Weber’s  “Conceptual  Composition”  and  are  the 
only  parts  that  Weber  actually  had  sent  off  to  the 
printer  before  his  death.  In  contrast  to  this  newer 
part,  the  older  parts  of  the  compilation  include 
Weber’s  writings  on  communities,  in  general;  reli¬ 
gious  communities;  law;  domination;  and  the  city. 

Among  the  most  influential  perspectives  is 
Weber’s  sociology  of  domination.  Weber  distin¬ 
guished  between  power  and  authority.  Power  is  the 
chance  to  realize  intentions  even  against  the  resis¬ 
tance  of  others,  whereas  authority  is  the  probabil¬ 
ity  that  specific  commands  are  obeyed  by  a  group 
of  people.  Weber  believed  that  people  typically 
obeyed  commands  so  long  as  they  considered  them 
legitimate.  He  further  specified  three  different 
claims  to  such  legitimacy.  Traditional  legitimacy  is 
based  on  the  belief  in  the  validity  of  long-standing 
traditions  and  customs.  The  ruled  owe  a  personal 
loyalty  to  the  ruler,  who  may  rule  over  a  household 
as  a  patriarch  or  use  staff  or  other  administrative 
apparatus,  as  in  the  case  of  patrimonialism.  The 
social  and  political  order  under  traditional  rule  is 
static.  Charismatic  authority  is  based  on  the 
extraordinary  and  exceptional  qualities,  or  cha¬ 
risma  (literally,  a  “gift  of  grace”),  of  a  person,  who 
in  declaiming  a  “mission”  motivates  others  to 
become  followers  and  even  disciples.  Charismatic 
authority  is  apt  to  break  through  the  static  order  of 
traditional  rule.  It  tends  to  be  unstable,  however, 
as  it  is  tied  to  a  person;  thus,  Weber  addressed  the 
ways  by  which  the  inherently  unstable  nature  of 
charismatic  authority  is  depersonalized  and  trans¬ 
formed  into  a  more  stable  system  of  authority,  a 
process  he  called  the  routinization  of  charisma. 
One  outcome  of  the  routinization  of  charisma  is 
“office  charisma,”  in  which  an  office  is  vested  with 
authority  from  customs  (traditional  authority) 
or  legal-rational  rules  (bureaucracy).  Finally, 


rational-legal  authority  rests  on  the  formal  legal¬ 
ity  of  enacted  rules  and  the  right  of  those  in 
authority  to  enforce  rules.  The  purest  manifesta¬ 
tion  of  such  authority  was  modern  bureaucracy, 
which  in  Weber’s  view  was  the  most  formally 
efficient  and  rational  way  to  administer  complex 
modern  societies.  A  bureaucracy  relies  on  the 
knowledge  of  trained  professional  officials,  whose 
duties  and  obligations  are  expressly  defined  in 
written  regulations  and  whose  employment  and 
promotion  are  based  on  administrative  training 
and  experience. 

Weber’s  views  on  domination  had  implications 
for  his  views  on  politics.  In  modern  society,  the 
bureaucratization  of  society  could  lead  to  its  ossi¬ 
fication  into  an  “iron  cage,”  governed  by  bureau¬ 
crats,  who,  unlike  politicians,  were  typically  not 
elected  nor  subject  to  direct  public  oversight.  He 
foresaw  the  bureaucratization  of  modern  capital¬ 
ism  through  the  proliferation  of  bureaucratically 
run  corporations  and  the  bureaucratization  of 
politics  through  the  emergence  of  party  machines 
run  by  political  functionaries.  He  believed  that 
what  he  called  a  plebiscitary  leader  democracy,  in 
which  a  charismatic  leader  injects  politics  with 
dynamism  yet  remains  tied  to  approbation  by  the 
ruled  masses,  could  be  a  solution  to  this  problem 
of  the  bureaucratization  of  politics.  In  one  of  his 
most  famous  lectures,  “The  Profession  and 
Vocation  of  Politics,”  he  elaborated  on  this 
thought. 

“The  Profession  and  Vocation  of  Politics” 

Weber  noted  that  the  modern  state  is  character¬ 
ized  as  a  community  that  successfully  claims  a 
monopoly  on  legitimate  physical  force  on  a  cer¬ 
tain  territory,  using  a  bureaucratic  apparatus  and 
holding  ownership  over  administrative  means 
(including  those  that  involve  force).  Weber  not 
only  related  his  notions  about  the  state  to  his 
views  on  authority  and  power  but  also  more  pre¬ 
cisely  outlined  the  characteristics  of  a  modern 
politician,  whom  he  viewed  as  the  product  of 
larger  societal  trends  of  democratization,  bureau¬ 
cratization,  and  professionalization.  The  modern 
politician  was  a  full-time  professional  who  lived 
both  for  and  off  politics.  The  action  of  politicians, 
in  contrast  to  that  of  bureaucrats,  is  not  about 
administering  rules  and  regulations  impartially 
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but  governed  by  the  principle  of  taking  responsi¬ 
bility  in  leadership.  In  doing  so,  the  politician  can 
adopt  either  an  “ethic  of  conviction,”  which  is 
motivated  by  good  intentions  but  does  not  take 
into  account  the  outcomes  of  action,  or  an  “ethic 
of  responsibility,”  which  takes  them  into  account 
and  which  Weber  personally  preferred  over  the 
former  and  considered  more  appropriate  for  mod¬ 
ern  politics.  Leadership  by  professional  politicians 
is  part  of  the  struggle  among  parties,  which  Weber 
saw  headed  toward  governance  by  an  impersonal 
apparatus,  unless  a  politician  who  saw  politics  as 
both  profession  and  vocation  established  a  plebi¬ 
scitary  form  of  leadership  democracy.  This  notion 
has  proved  controversial,  for  some  scholars  have 
viewed  Weber’s  concept  as  an  endorsement  of 
nationalist  power  politics  that  paved  the  way  to 
fascist  rule  in  Europe  after  Weber’s  death,  while 
others  have  pointed  to  Weber’s  cosmopolitan, 
Anglophile  political  liberalism  to  reject  this  view. 

Lutz  Kaelber 
University  of  Vermont 
Burlington,  Vermont,  United  States 
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Weighted  Least  Squares 


Weighted  least  squares  (WLS)  is  a  standard  com¬ 
pensation  technique  for  nonconstant  error  variance 


(heteroscedasticity),  which  is  common  in  political 
science  data.  By  assigning  individual  weights  to 
the  observations  the  heteroscedasticity  can  be 
removed  by  design.  The  square  root  of  the  inverse 
of  the  error  variance  of  the  observation  is  typically 
used  as  a  weight.  The  key  idea  is  that  less  weight 
is  given  to  those  observations  with  a  large  error 
variance.  This  forces  the  variance  of  the  residuals 
to  be  constant.  WLS  is  an  example  of  the  broader 
class  of  generalized  least  squares  estimators.  The 
idea  was  first  presented  by  Alexander  Aitken 
(1935).  The  foundations  of  this  procedure  and  a 
concrete  example  are  presented  below. 

Motivation 

Standard  regression  modeling  assumes  that  the 
variance  of  the  residuals  is  constant  across  the 
levels  of  the  explanatory  variables.  Often  in  politi¬ 
cal  science  this  is  violated,  causing  potential  prob¬ 
lems  of  fit  and  interpretation.  George  Downs  and 
David  Rocke  (1979)  give  the  well-known  exam¬ 
ples  of  increasing  spread  around  consumption  as 
income  increases,  increasing  spread  in  the  propor¬ 
tion  of  felons  incarcerated  in  states  as  legislative 
interest  increases,  and  decreasing  spread  in  that 
proportion  as  the  average  education  level  in  the 
state  decreases.  All  of  these  are  examples  where 
one  aspect  of  the  relationship  between  the  explan¬ 
atory  variable  and  the  outcome  variable,  outcome 
uncertainty,  changes  as  the  levels  of  the  right-hand 
side  are  altered.  WLS  addresses  this  problem  by 
modeling  this  heteroscedasticity  rather  than  ignor¬ 
ing  it.  Thus,  there  is  a  trade-off  between  increased 
model  complexity  and  increased  interpretational 
complexity. 

Theory 

The  ordinary  linear  model  has  the  form  y  =  Xp  +  £, 
where  y  is  an  n  X  1  outcome  vector  with  continu¬ 
ous  measure,  X  is  an  n  X  k  nonsingular  matrix 
with  explanatory  variables  down  the  columns  and 
a  leading  column  of  ones,  P  is  a  k  X  1  parameter 
vector  to  be  estimated,  and  8  is  an  n  X  1  error  vec¬ 
tor  with  assumed  mean  zero.  The  ordinary  least 
squares  (OLS)  estimator  of  3  is  arrived  at  by 
minimizing  the  squared  error  terms  and  is  formed 
by  (X'X)_1X'y.  In  the  presence  of  heteroscedastic¬ 
ity,  the  OLS  estimator  of  P  is  not  the  best  linear 
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unbiased  estimator  (BLUE).  The  term  best  here 
means  that  the  estimator  achieves  the  minimum 
possible  variance. 

WLS  allows  one  to  reformulate  the  model  and 
generate  estimates  that  are  in  principle  the  BLUE. 
The  introduction  of  a  weight  matrix  Cl  into  the 
calculation  of  (3  removes  the  heteroscedasticity 
from  the  model.  The  LI  matrix  is  created  by  taking 
the  error  variance  of  the  zth  case  (estimated  or 
known),  v(,  and  assigning  it  to  the  z’th  diagonal,  ClH 
=  l/v(  =  o)(,  leaving  the  off-the-diagonal  elements 
as  zero.  In  this  way,  large  error  variances  are 
reduced  by  premultiplying  the  model  terms  by  this 
reciprocal.  We  can  premultiply  each  term  in  the 
standard  linear  model  setup  by  the  square  root  of 
the  il  matrix  (i.e.,  by  the  standard  deviation).  This 
“square  root”  is  actually  produced  from  a  Cholesky 
factorization:  If  A  is  a  positive  definite  symmetric 
(A'  =  A)  matrix,  then  there  must  exist  a  matrix  G 
such  that  A  =  GG'.  A  matrix  A  is  positive  definite 
if  for  any  nonzero  p  X  1  vector  x,  x'Ax  >  0.  In  our 
case,  this  decomposition  is  greatly  simplified 
because  the  Cl  matrix  has  only  diagonal  values  (all 
off-the-diagonal  values  equal  zero).  Therefore,  the 
Cholesky  factorization  is  produced  simply  from  the 
square  root  of  these  diagonal  values.  Premultiplying 
gives 

Cl112 y  -  li  2X(3  +  a1/2£. 

Instead  of  minimizing  the  squared  errors  in  the 
usual  manner,  we  now  minimize  (y  —  X0)'O(y  — 
X0),  and  the  subsequent  WLS  estimator  is  calcu¬ 
lated  as  P  =  (X'CIX)  ^'Lly.  The  WLS  estimator 
gives  theoretically  the  BLUE  of  the  coefficient  esti¬ 
mator  in  the  presence  of  heteroscedasticity. 

Weighted  Least  Squares  and 
Feasible  Weighted  Least  Squares 

In  this  setup,  it  is  required  that  the  variance  of  the 
error,  v„  be  known.  In  principle,  there  are  two  pos¬ 
sibilities:  (1)  vt  is  derived  from  the  underlying  data- 
generating  process  or  (2)  v(  is  estimated.  An 
example  of  the  first  is  the  linear  probability  model 
in  which  the  structure  of  heteroscedasticity  is 
known.  In  a  binary  model,  the  variance  is  Var 
(e(.)  =  X,P(1  —  Xjl).  This  gives  an  expression  of 
the  form  of  heteroscedasticity  and  allows  the  ana¬ 
lyst  to  estimate  a  linear  model  with  WLS.  The 


weights  are  directly  computed  by  using  the  OLS 
estimates  of  P  to  compute  Var(£,).  Because  POLS  is 
an  unbiased  estimator,  Var(£;)  is  also  unbiased.  A 
possible  obstacle  here  is  that  the  linear  probability 
model  may  produce  a  y,  that  lies  outside  the  [0,  I J 
interval  and  therefore  produces  negative  weights. 

Often  the  form  of  heteroscedasticity  is  not 
known,  and  v,  rather  than  v  is  used.  By  relying  on 
an  estimate  of  v(,  the  WLS  estimator  is  no  longer 
unbiased.  But  it  is  still  a  consistent  estimate  and 
asymptotically  more  efficient  than  the  OLS  estima¬ 
tor.  This  is  often  referred  to  as  feasible  weighted 
least  squares  (LWLS).  This  implies  a  two-step  pro¬ 
cedure.  In  the  first  step,  a  linear  model  is  estimated 
using  OLS,  and  based  on  £0ls,  one  can  derive  v  and 
therefore  Cl.  The  LWLS  estimate  is  obtained  by 
minimizing  (y  —  XP)'L1  (y  —  XP).  The  next  sec¬ 
tion  illustrates  an  LWLS  estimation. 

Example 

This  illustration  is  based  on  political  data  from 
Swiss  cantons  in  1990  (Adrian  Vatter,  Markus 
Lreitag,  Christoph  Muller,  &  Marc  Biihlmann, 
2004).  The  outcome  variable  is  the  number  of 
cantonal  employees  per  1,000  inhabitants.  The 
two  predictor  variables  are  the  degree  of  propor¬ 
tionality  (PR)  in  the  electoral  system  and  cantonal 
gross  domestic  product  (GDP,  in  1,000s).  The 
results  from  a  standard  OLS  estimation  of  the 
linear  model  do  not  allow  rejection  of  the  null 
hypothesis  for  either  of  the  explanatory  variables 
at  standard  thresholds. 

Based  on  the  results  presented  in  Table  1,  the 
conclusion  is  that  the  degree  of  PR  has  no  effect  on 
the  number  of  public  employees.  It  is  possible  to 
test  for  heteroscedasticity  by  using,  for  example, 
the  Breusch-Pagan  test  (Trevor  Breusch  &  Adrian 
Pagan,  1979;  Dennis  Cook  &  Sanford  Weisberg, 
1983).  In  this  example,  the  squared  residuals  (e^) 
are  regressed  on  the  predicted  values  of  the  out¬ 
come  (y,).  If  the  residuals  have  a  common  vari¬ 
ance,  the  explanatory  power  of  the  regression  is 
low.  The  null  hypothesis  of  the  test  states  a  con¬ 
stant  error  variance.  In  the  example  here,  the  test 
value  is  5.68  (X~  with  1  degree  of  freedom),  which 
corresponds  to  a  p  value  of  .017.  Based  on  this,  the 
null  hypothesis  can  be  rejected. 

In  Ligure  1,  the  residuals  from  the  OLS  estima¬ 
tion  are  plotted  against  the  outcome  variable. 
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Table  I  Comparison  of  Ordinary  Least  Squares 

(OLS)  and  Weighted  Least  Squares  (WLS) 


Variable 

OLS 

WLS 

GDP 

0.14 

0.07 

(0.74) 

(1.13) 

PR 

-1.95 

-1.47 

(0.46) 

(2.15) 

Intercept 

26.14 

24.46 

(2.40) 

(12.74) 

N 

26 

26 

Breusch-Pagan 

5.68 

(p  value) 

(.017) 

Source:  Based  on  data  from  Vatter,  A.,  Freitag,  M.,  Muller, 
C.,  &  Biihlmann,  M.  (2004).  Political,  social,  and  economic 
data  of  the  Swiss  cantons  1983-2002.  Bern,  Switzerland: 
University  of  Bern. 

Notes:  The  outcome  variable  is  the  number  of  public  employees 
per  1,000  habitants.  Absolute  t  values  are  in  parentheses. 
GDP  =  gross  domestic  product,  in  1 ,000s;  PR  =proportionality . 


There  is  a  strong  positive  trend,  which  visually 
confirms  the  Breusch-Pagan  test  result.  The  WLS 
approach  is  an  effective  way  to  address  the  het- 
eroscedasticity  in  such  cases.  Based  on  the  estima¬ 
tion  results  of  the  OLS  procedure,  it  is  possible  to 
derive  £1,  where  fi„  =  l/e,2  and  e  is  the  estimated 
error  of  the  OLS  procedure.  The  second  column  in 
Table  1  reports  the  results  from  the  WLS  proce- 
dure,  ft  y  =  ft  Xf5  +  fl  s.  The  GDP  of  a 
canton  still  does  not  have  a  reliable  effect  on  the 
outcome  variable.  But  in  the  WLS  procedure,  the 
effect  of  the  degree  of  PR  on  the  number  of  can¬ 
tonal  employees  is  negative  and  significant  at  con¬ 
ventional  levels. 
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Figure  I  Outcome  Variable  Versus  Ordinary  Least 
Squares  (OLS)  Errors 

Source:  Based  on  data  from  Vatter,  A.,  Freitag,  M.,  Muller, 
C.,  &  Biihlmann,  M.  (2004).  Political,  social,  and  economic 
data  of  the  Swiss  cantons  1983-2002.  Bern,  Switzerland: 
University  of  Bern. 


Huber- White  sandwich  estimator  can  be  used  to 
correct  for  heteroscedasticity.  WLS  estimation  is  a 
standard  regression  tool  for  social  scientists  and 
others,  and  is  used  in  iteratively  WLS  to  estimate 
generalized  linear  models  (Jeff  Gill,  2007). 

Lucas  Leemann 
Columbia  University 
New  York  City,  New  York,  United  States 

Jeff  Gill 

Washington  University  in  St.  Louis 
St.  Louis,  Missouri,  United  States 


Conclusion 

WLS  allows  analysts  to  estimate  linear  models  in 
the  presence  of  heteroscedasticity.  Premultiplying 
the  observations  by  a  weight  matrix,  £1,  makes  the 
error  variance  constant.  There  are  also  other  pos¬ 
sible  remedies  for  heteroscedasticity.  First,  het¬ 
eroscedasticity  may  be  the  result  of  a  misspecified 
model  and  may  require  the  researcher  to  change 
the  model.  Second,  the  OLS  estimates  may  still  be 
unbiased,  and  robust  standard  errors  such  as  the 
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Welfare  Policies 


This  entry  examines  the  role  of  the  state  and  the 
political  dimension  surrounding  the  recognition  of 
social  needs  to  improve  the  well-being  of  citizens. 
It  then  discusses  the  multiple  meanings  of  the  term 
welfare  policies  across  jurisdictions  and  analyzes 
welfare  policies  defined  as  actions  to  provide  a 
basic  level  of  subsistence  for  individuals.  The  type 
of  policy  instruments  used  to  address  basic  needs 
has  also  generated  a  rich  literature  on  the  causes 
of  poverty  and  on  the  consequences  of  welfare 
policies,  which  is  summarized  in  the  third  section. 
The  conclusion  of  the  entry  presents  two  new  rec¬ 
ognized  needs:  (1)  low  wages  obtained  by  the 
working  poor  and  (2)  social  exclusion. 

Generally,  welfare  policies  represent  various 
programs,  transfers,  and  assistance  provided  by  the 
state  or  other  organizations  aimed  at  improving  the 
well-being  of  individuals.  It  is  impossible  to  develop 
policies  to  fulfill  every  citizen’s  well-being,  as  there 
are  unlimited  wants  but  limited  resources.  As  a 
result,  governmental  authorities  must  decide  on 
welfare  priorities,  which  generate  political  debates 
usually  involving  a  discussion  of  social  needs  and 
risk.  First,  most  welfare  policies  seek  to  address 
recognized  social  needs.  As  stressed  by  the  sociolo¬ 
gist  Richard  Titmuss,  defining  what  constitutes  a 
need  is  very  trivial  and  it  is  constantly  changing. 
For  example,  Titmuss  mentioned  that  Britain’s 
needs  in  1 900  were  very  different  from  those  in  the 
1950s.  The  process  of  need  recognition  and,  subse¬ 
quently,  the  development  of  a  policy  to  address  a 
need  can  be  long  and  arduous.  Providing  income 
relief  to  the  elderly  and  facilitating  alternatives  to 


the  poorhouse  took  decades,  and  it  took  nearly  a 
century  to  raise  substantially  the  living  standards 
of  retirees.  Major  events,  such  as  wars,  can  accel¬ 
erate  both  the  identification  of  needs  and  the 
implementation  of  solutions  to  address  them.  For 
example,  a  major  extension  of  welfare  policy  in 
Japan  is  attributed  to  World  War  II  with  one  key 
issue  being  the  ill  health  of  its  citizens,  resulting  in 
high  rates  of  failed  physicals  to  join  the  army. 

There  are  other  elements  that  affect  the  recogni¬ 
tion  of  needs.  For  example,  citizens  tend  to  con¬ 
sider  the  needs  of  seniors  more  favorably  than  the 
needs  of  young  adults  because  the  former  are  con¬ 
sidered  to  be  more  worthy  and  vulnerable.  Thus, 
there  will  be  stronger  support  to  provide  social 
assistance  for  an  elderly  citizen  who  failed  to  rein¬ 
tegrate  the  labor  market  near  retirement  age  than 
a  young  citizen  who  cannot  find  employment  on 
graduation  even  though  both  conditions  can  be 
caused  by  a  weak  labor  market.  The  recognition  of 
need  can  also  be  based  on  other  factors  such  as  the 
meaning  of  citizenship.  Historically,  in  the  United 
States,  access  to  welfare  policies  often  excluded 
African  Americans,  and  race  continues  to  perme¬ 
ate  discussions  surrounding  welfare  policies.  In  a 
similar  optic,  the  gender  critique  stresses  that  rec¬ 
ognized  needs  often  fail  to  consider  women.  Also, 
immigrants  often  face  multiple  obstacles  to  obtain¬ 
ing  the  same  treatment  as  citizens  who  were  born 
in  the  country  of  residence.  In  Canada,  for  exam¬ 
ple,  immigrant  retirees  must  have  resided  in  the 
country  for  at  least  10  years  prior  to  receiving  the 
Guaranteed  Income  Supplement,  which  is  given  to 
low-income  retirees. 

The  role  of  the  state  with  regard  to  social  needs 
is  also  politically  salient.  In  multiple  jurisdictions, 
social  assistance  is  still  not  fully  endorsed  by  all 
elected  officials  as  a  need  that  must  be  covered  by 
governmental  authorities.  Worse,  a  need  can  be 
recognized  and  the  state  can  opt  to  address  it,  but 
policies  may  still  fail  to  resolve  it.  The  lack  of  uni¬ 
versal  coverage  granted  by  the  American  health 
system  has  been  acknowledged,  but  reform  efforts 
have  failed  to  resolve  this  issue.  Some  needs  also 
consist  of  policy  problems  that  cannot  be  fully 
resolved.  It  can  be  possible  to  reduce  poverty,  but 
eradicating  poverty  may  be  impossible. 

An  alternative  way  to  conceive  welfare  policies 
is  by  considering  the  ways  in  which  they  protect 
individuals  against  social  risk.  This  form  of  welfare 
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policies  is  often  termed  social  insurance  because  the 
policies  fulfill  primarily  insurance  functions.  This 
includes,  for  example,  sickness,  unemployment, 
disability,  and  retirement  insurance.  Other  pro¬ 
grams  such  as  active  labor  market  policies  also  fall 
into  this  category  since  they  seek  to  alleviate  the 
risk  of  an  individual  to  leave  the  labor  market  and 
fall  into  poverty. 

Within  this  perspective,  an  interesting  debate 
spearheaded  by  the  Danish  sociologist  Gosta 
Esping-Andersen  has  sought  to  classify  industrial¬ 
ized  nations  within  welfare  regimes.  In  a  liberal 
regime,  exemplified  by  the  United  Kingdom  (UK) 
and  the  United  States,  the  private  sector  plays  a 
dominant  role  in  providing  social  insurance  and 
the  state  is  involved  primarily  as  a  last  resort.  In  a 
corporatist  regime,  the  state  has  sought  to  preserve 
status  along  occupational  lines,  which  explains  the 
complexity  and  multiplicity  of  social  insurance 
schemes  in  countries  such  as  France  and  Germany. 
The  state  plays  an  integral  role  in  providing  gener¬ 
ous  universal  coverage  in  Scandinavian  countries, 
which  typifies  the  Social  Democrat  regime.  While 
Esping-Andersen  argues  that  there  are  three  wel¬ 
fare  regimes,  critics  have  pointed  out  that 
Australasia  and  Southern  Europe  are  distinct  and 
could  represent  welfare  regimes  in  their  own  right. 
The  feminist  critique  has  been  the  most  salient, 
however,  by  demonstrating  that  women  are  not 
attached  to  the  labor  market  in  ways  similar  to 
men,  resulting  in  a  very  different  experience  of 
social  insurance. 

In  the  past  10  years,  a  strand  of  the  welfare  lit¬ 
erature  has  focused  on  the  emergence  of  new  social 
risks  caused  by  the  socioeconomic  changes  that 
have  transpired  with  the  transition  to  a  postindus¬ 
trial  society.  This  shift  has  led  to  an  increasing 
number  of  single-parent  families  and  elderly,  to 
employment  precariousness,  and  to  the  rising 
importance  of  education  and  training  in  the  labor 
market.  This  literature  on  social  investment 
assumes  that  the  role  of  the  state  is  not  to  satisfy  a 
basic  need  but  rather  to  invest  in  individuals  in 
order  to  empower  them  to  assume  more  responsi¬ 
bilities.  As  such,  it  seeks  to  make  individuals  more 
competitive  in  the  new  economy. 

The  literature  on  both  social  needs  and  risks 
has  stressed  the  impact  of  preexisting  programs  on 
the  capacity  of  the  state  to  adapt  its  welfare  poli¬ 
cies  to  new  realities.  Once  a  welfare  program  is 


adopted  to  fulfill  specific  needs  and/or  cover  a 
social  risk,  it  fosters  a  clientele  and  support 
groups,  which  can  facilitate  its  expansion.  This 
clientele  will  likely  oppose  the  creation  of  new 
programs  or  a  redefinition  of  welfare  objectives. 
Further,  there  is  a  substantial  literature  stressing 
that  organizations  have  a  bias  toward  improving 
existing  programs,  which  occurs  at  the  expense  of 
developing  new  ones. 

These  conclusions  are,  however,  being  increas¬ 
ingly  challenged.  In  light  of  increasing  budgetary 
pressures  that  are  accentuated  by  an  aging  popula¬ 
tion  and  a  recent  financial  crisis,  multiple  reform 
initiatives  have  been  introduced.  Contrary  to  pre¬ 
vious  expectations,  political  leaders  have  not  shied 
away  from  substantial  reforms  even  if  it  leads  to  a 
noticeable  impact  on  opinion  polls.  The  literature 
has  been  quite  eclectic  in  providing  explanations 
for  this  reversal  of  fortune  for  the  welfare  state. 
While  some  contributions  have  reverted  to  tradi¬ 
tional  cleavages  such  as  class  and  party  politics  or 
a  reconsideration  of  arguments  related  to  global¬ 
ization,  others  have  targeted  the  role  of  political 
leadership  in  altering  successfully  the  discourse 
surrounding  the  goals  and  objectives  of  the  welfare 
state.  The  latter  stresses  that  the  basic  understand¬ 
ing  of  social  rights  has  been  altered.  In  many 
instances,  such  as  in  health  and  pensions,  the  state 
has  been  assuming  less  responsibility  by  facilitat¬ 
ing  the  creation  and/or  expansion  of  private  alter¬ 
natives  while  reducing  the  generosity  of  public 
benefits.  The  roles  of  international  financial  asso¬ 
ciations  have  been  considered  to  be  particularly 
important  in  prompting  privatization  in  develop¬ 
ing  countries.  Nonprofit  organizations  have  also 
seen  their  role  expand  in  many  countries  by  pro¬ 
viding  a  growing  range  of  welfare  services  to  an 
increasing  number  of  citizens. 

Multiple  Meanings  of  the 
Term  Welfare  Policies 

It  is  important  to  note  that  there  are  many  national 
differences  with  regard  to  what  welfare  policies 
entail.  For  example,  the  term  welfare  has  tradition¬ 
ally  been  used  in  the  United  States  as  a  synonym  for 
transfer  payments  to  the  poor  (e.g.,  “being  on  wel¬ 
fare”).  As  such,  it  is  a  synonym  for  social  assistance. 
In  contrast,  welfare  has  a  more  general  use  in 
Sweden,  focusing  mostly  on  social  services,  but  it 
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can  also  include  policies  such  as  taxation,  pen¬ 
sions,  employment,  health  care,  and  social  assis¬ 
tance.  In  the  latter  case,  the  term  is  closely  related 
to  welfare  state.  What  welfare  policies  include  and 
do  not  include  has  been  the  source  of  multiple 
debates  in  the  social  sciences.  The  remainder  of 
this  entry  focuses  primarily  on  welfare  policies  in 
the  context  of  providing  a  basic  level  of  subsistence 
for  individuals.  Beyond  the  enactment  of  programs 
and  services,  this  also  includes  goals  such  as  allevi¬ 
ating  poverty  and  social  exclusion. 

The  role  of  the  state  in  combating  poverty  var¬ 
ies  noticeably  across  industrialized  countries.  The 
use  of  means-tested  benefit,  where  the  state  plays 
the  role  of  last  resort,  is  predominant  in  the  UK 
and  its  former  colonies  (i.e.,  Australia,  Canada, 
New  Zealand,  and  the  United  States)  but  not  con¬ 
fined  exclusively  to  this  group  of  countries.  In 
these  cases,  the  influence  of  the  English  Poor  Laws 
remains.  Although  previous  acts  and  practices 
dealt  with  the  poor  and  the  working  poor,  the  first 
set  of  formalized  poor  laws  was  adopted  in  the 
early  17th  century  (Old  Poor  Law),  followed  by 
the  New  Poor  Law  in  1834.  Immediately  on  its 
creation,  a  sharp  distinction  was  made  to  distin¬ 
guish  between  those  considered  too  ill  or  too  old 
to  work  and  those  capable  of  working.  This  dis¬ 
tinction  is  often  stated  as  a  difference  between  the 
deserving  and  undeserving  poor.  Individuals  in 
the  latter  group  had  to  enter  workhouses  (in  the 
United  States,  these  were  referred  to  as  poor- 
houses),  whose  conditions  were  pitiful  and  where 
individuals  were  often  referred  to  as  inmates.  This 
was  done  deliberately  to  ensure  that  this  would  be 
a  solution  of  last  resort  in  order  to  receive  poor 
relief. 

The  creation  and  expansion  of  friendly  societ¬ 
ies,  the  emergence  of  social  programs  (such  as 
unemployment  insurance  and  old-age  pensions), 
and  changes  in  attitude  toward  the  poor  resulted  in 
the  abolition  of  the  workhouses  in  the  early  20th 
century.  The  causes  behind  these  welfare  improve¬ 
ments  have  been  the  object  of  multiple  debates  in 
political  science.  They  have  been  attributed  to 
protest  actions  by  the  poor,  industrialization,  the 
rise  of  socialist  parties  and  unions,  the  leadership 
role  of  the  civil  service,  state  capacity,  and  even 
war.  While  the  private  sector  continues  to  perform 
a  role  in  providing  welfare  provisions,  whose 
importance  varies  across  jurisdictions,  the  state  is 


now  the  primary  welfare  giver  in  industrialized 
countries.  However,  in  many  developing  countries, 
the  importance  of  employers  and  other  private  par¬ 
ties  remains  noticeable.  When  employers  provide 
social  benefits,  which  can  include  child  allowance, 
death  benefits,  and  health  and  pension  insurance, 
to  name  a  few,  it  is  usually  termed  occupational 
welfare  since  the  type  and  scope  of  benefits  offered 
varies  across  occupations.  Although  not  provided 
by  the  state  and  granted  exclusively  to  some  mem¬ 
bers  of  the  labor  force,  these  benefits  often  receive 
a  subsidy  by  the  state  to  provide  an  incentive  to 
both  employers  and  employees  to  establish  this 
kind  of  social  benefit. 

Types  of  Welfare  Policies 

There  are  currently  multiple  types  of  welfare  poli¬ 
cies  based  on  needs  and  instruments.  The  most 
common  form  of  welfare  policies  involves  the 
transfer  of  financial  resources  where  recipients 
receive  a  sum  of  money  or  a  tax  credit.  Lirst,  there 
are  policies  put  in  place  to  provide  a  source  of 
income  for  individuals  living  with  disabilities.  This 
can  include  physical  and  psychological  disability, 
and  eligibility  is  usually  based  on  medical  recom¬ 
mendations  and  the  extent  to  which  an  individual 
can  thrive  in  the  labor  market.  Second,  financial 
assistance  is  granted  to  poor  retirees  based  on  the 
value  of  their  retirement  income  and  assets.  Third, 
social  assistance  is  granted  to  individuals  who,  for 
various  reasons,  exit  or  fail  to  enter  the  labor  mar¬ 
ket.  Lourth,  payments  can  be  made  for  families  to 
help  raise  children.  Rather  than  giving  financial 
assistance  directly  to  the  recipient,  the  state  can 
also  subsidize  basic  needs  or  limit  the  goods  and 
services  an  individual  can  obtain  with  the  financial 
support  offered.  This  is  achieved  by  tying  benefits 
to  specific  needs  such  as  housing  (subsidy)  and 
food  (food  stamps).  Linancial  help  cannot  assist 
with  issues  such  as  sickness,  substance  abuse,  or 
being  homeless.  As  a  result,  welfare  policies  also 
involve  a  wide  range  of  services  such  as  health 
care,  shelters,  and  job  training. 

In  spite  of  strong  variance  in  the  level  of  support 
across  industrialized  countries,  survey  research 
demonstrates  that  benefits  targeting  the  old  and 
the  disabled  receive  substantially  more  support 
than  policies  assisting  other  groups.  The  schism 
between  deserving  and  undeserving  poor  remains 
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strongly  embedded  in  most  debates  surrounding 
welfare. 

Still,  today,  welfare  remains  stringent  on  fulfill¬ 
ing  social,  economic,  and  financial  conditions.  The 
demonstration  of  needs  plays  a  key  role  in  access¬ 
ing  benefits.  Means-tested  benefits  are  often  the 
norm,  which  implies  that  individuals  must  demon¬ 
strate  a  lack  of  resources  and/or  a  specific  condi¬ 
tion  to  be  eligible.  The  means  test  can  actually  take 
multiple  forms.  When  related  to  a  medical  condi¬ 
tion,  individuals  must  often  substantiate  the  claims 
that  they  are  unable  to  participate  in  the  labor 
force.  Authorities  can  also  develop  criteria  on  the 
basis  of  various  sources  of  income  (including 
wages,  dividends,  and  interest  earned)  and/or 
assets  such  as  an  automobile,  a  dwelling,  or  even 
artwork.  Moreover,  the  income  and  assets  of  a 
third  party  can  also  be  considered  for  assessing 
eligibility.  It  is  quite  common  to  consider  the  rev¬ 
enues  and/or  assets  of  a  spouse  or  partner,  but  this 
is  sometimes  extended  to  others  such  as  parents 
(which  can  occur  in  the  case  of  students  seeking 
public  financial  aid)  or  roommates  in  the  case  of 
shared  living  space. 

The  means  test  can  also  apply  to  socioeco¬ 
nomic  conditions  that  are  not  related  to  health 
and  financial  resources  such  as  age,  citizenship, 
location,  and  marital  status.  Younger  adults  face 
the  most  difficulty  in  receiving  social  assistance 
with  the  requirement  of  prior  work  experience 
being  the  most  important.  They  also  tend  to 
receive  less  generous  benefits  irrespective  of 
employment  opportunities.  Citizenship  also  mat¬ 
ters  quite  often  when  it  comes  to  means-tested 
benefits.  Recent  immigrants  are  often  excluded 
from  receiving  state  aid.  In  some  instances,  author¬ 
ities  consider  the  region  where  a  claimant  resides 
to  increase  or  restrict  the  generosity  of  social  assis¬ 
tance.  In  regions  where  unemployment  is  wide¬ 
spread,  eligibility  criteria  might  be  more  relaxed 
than  in  regions  where  labor  shortages  are  present. 
Marital  status  continues  to  play  a  role  in  the  treat¬ 
ment  of  need.  The  way  in  which  an  individual 
becomes  a  single  parent  is  sometime  considered  in 
the  application  of  the  means  test,  with  a  widower 
receiving  preferential  treatment  to  someone  who 
failed  to  declare  a  parent. 

As  a  result,  street-level  bureaucrats,  or  third  par¬ 
ties  such  as  medical  doctors,  often  play  a  key  role 
in  assessing  who  can  and  cannot  receive  assistance. 


The  stigma  attached  to  welfare  often  results  in 
admissible  individuals  not  seeking  their  benefits. 
For  example,  the  British  Treasury  still  assumes  in 
its  budget  that  25%  of  elderly,  considered  to  be 
deserving  poor,  will  not  collect  their  means-tested 
pension.  Another  issue  involves  the  process  on 
which  individuals  receive  the  benefits.  It  is  often 
onerous,  intrusive,  and  complicated.  The  applica¬ 
tion  for  the  British  Pensions  Credit  has  18  pages, 
which  can  make  reading  cumbersome  for  many 
seniors  and  make  the  benefit  less  accessible.  This  is 
in  stark  contrast  to  the  Canadian  case  where  fed¬ 
eral  authorities  consider  only  taxable  income. 
Individuals  can  access  the  benefit  easily  by  adding 
a  one-page  attachment  when  they  file  their  income 
tax. 

At  the  heart  of  welfare  policies  are  two  impor¬ 
tant  debates  on  the  causes  of  poverty  and,  ulti¬ 
mately,  reliance  on  welfare  benefits.  First,  to  what 
extent  does  welfare  cause  dependency?  This  has 
been  a  hallmark  of  recent  American  debates  sur¬ 
rounding  welfare.  Conservative  critics  have  stressed 
that  welfare  policies  create  disincentive  to  work 
and  create  a  culture  of  poverty  that  makes  a  return 
(or  entry  into)  to  the  labor  market  less  likely. 
These  concerns  are  hardly  new.  The  Royal 
Commission  of  1832  in  England,  established  to 
study  the  Poor  Law,  was  particularly  concerned 
with  the  impact  of  granting  cash  benefits  to  non¬ 
working  individuals.  Thus,  it  stressed  that  indi¬ 
viduals  in  poorhouses  should  face  conditions 
worse  than  those  of  the  poorest  working  poor.  An 
underlying  assumption  by  these  critics  is  that  indi¬ 
viduals  on  welfare  are  primarily  responsible  for 
this  outcome  and  eventually  feel  entitled  to  their 
allowance.  With  the  expansion  of  welfare  benefits 
to  a  higher  number  of  individuals,  they  became 
one  of  the  neoconservatives’  favorite  targets  for 
retrenchment.  In  the  past  25  years,  multiple  juris¬ 
dictions  have  tightened  qualifying  conditions  to 
receive  welfare  benefits  with  countries  such  as 
Canada  (particularly  the  states  of  Alberta  and 
Ontario),  the  United  States,  and  the  UK  even  intro¬ 
ducing  some  forms  of  workfare,  whereby  individu¬ 
als  must  perform  compulsory  labor  or  service  to 
obtain  their  benefit. 

Critics  have  emphasized  that  individuals  on  wel¬ 
fare  are  not  themselves  mostly  responsible  for  their 
inability  to  find  work,  as  indicated  by  the  rise  of 
welfare  cases  during  a  recession.  Moreover,  the  use 
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of  the  word  dependency  in  the  context  of  welfare 
policies  evokes  powerful  negative  images  of  specific 
groups,  such  as  single  African  American  mothers  in 
the  case  of  the  former  Aid  to  Families  with 
Dependent  Children  (AFDC)  program,  and  it  fails 
to  acknowledge  that  non-welfare  recipients  also 
receive  substantial  state  support.  For  example,  the 
“hidden”  welfare  state  in  the  United  States,  consist¬ 
ing  of  multiple  tax  credits,  costs  far  more  than  the 
totality  of  welfare  programs  targeting  the  poor. 

Second,  another  important  debate  within  the 
literature  focuses  on  the  consequences  of  having 
welfare  policies  targeted  at  specific  groups  of  indi¬ 
viduals,  as  opposed  to  being  universal.  The  more 
universal  a  benefit  is,  the  more  likely  that  it  will 
receive  strong  political  support,  which  is  in  stark 
contrast  to  targeted  benefits  such  as  social  assis¬ 
tance,  where  beneficiaries  lack  numbers  and 
resources  to  bolster  their  support.  Survey  research 
demonstrates  that  citizens  are  more  likely  to  sup¬ 
port  universal  programs  such  as  education  than 
targeted  programs  such  as  subsidized  housing  and 
social  assistance.  The  retrenchment  literature  has 
actually  stressed  that  politicians  have  been  far 
more  successful  with  the  latter. 

Targeting  the  poor  (or  any  other  group)  may 
actually  result  in  a  growing  gap  between  the  policy 
outcomes  generated  by  public  programs  and  the 
original  need  behind  their  creation.  Ironically, 
there  is  strong  evidence  suggesting  that  means- 
tested  programs  fail  to  combat  poverty  and  may 
even  accentuate  it.  The  type  of  program  or  mixes 
of  programs  needed  to  alleviate  poverty  has  been 
the  source  of  multiple  debates  in  the  literature. 

New  Focus:  The  Working 

Poor  and  Social  Exclusion 

In  recent  years,  welfare  policies  have  been  devel¬ 
oped  to  tackle  two  newly  recognized  needs: 
(1)  poverty  among  full-time  workers  and  (2)  social 
exclusion.  The  first  is  a  direct  consequence  of  stag¬ 
nant  wages  for  low-income  individuals.  In  many 
industrialized  countries,  minimum  wage  has  not 
been  indexed  properly,  resulting  in  an  actual 
decrease  in  real  income.  Various  measures,  such  as 
granting  tax  credits  and  readjusting  the  minimum 
wage,  have  been  introduced. 

The  term  social  exclusion  originated  in  France 
and  it  has  since  been  adopted  by  the  European 


Union  (EU),  which  has  made  social  inclusion  a  vital 
component  of  its  fight  against  poverty.  The  EU 
defines  social  exclusion  as  “a  process  whereby  cer¬ 
tain  individuals  are  pushed  to  the  edge  of  society 
and  prevented  from  participating  fully  by  virtue  of 
their  poverty,  or  lack  of  basic  competencies  and 
lifelong  learning  opportunities,  or  as  a  result  of 
discrimination”  (European  Commission,  2004). 
EU  countries  have  been  elaborating  and  imple¬ 
menting  national  action  plans  to  tackle  social 
exclusion  and  the  World  Bank  has  recently  included 
social  exclusion  within  its  own  analyses  of  poverty. 
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The  expression  welfare  state  denotes  a  number  of 
policies  through  which  the  government  provides 
protection  against  a  predetermined  set  of  social 
risks  and  needs.  The  term  made  its  appearance  in 
the  political  vocabulary  of  English-speaking  coun¬ 
tries  during  the  1930s  and  early  1940s.  State 
activism  in  the  social  sphere  dates  back,  however, 
to  the  second  half  of  the  19th  century  and  was 
essentially  prompted  by  the  modernization  pro¬ 
cess  and  its  accelerating  pace  throughout  Western 
Europe. 

From  an  institutional  viewpoint,  the  essence  of 
the  welfare  state  is  the  rights-based  character  of 
protection.  Through  social  policies,  the  govern¬ 
ment  defines  rules  and  standards  regarding 
resources  and  opportunities,  which  are  considered 
to  be  highly  relevant  for  individual  life  conditions 
and  thus  worthy  of  being  “guaranteed”  by  state 
authority.  In  contemporary  democracies,  such 
rules  and  standards  are  typically  incorporated  in 
the  notion  of  social  citizenship.  Being  a  citizen 
means  to  enjoy  not  only  certain  civil  and  political 
rights  but  also  specific  social  rights — that  is,  enti¬ 
tlements  to  obtain  resources  such  as  a  pension  or 
opportunities  (e.g.,  access  to  medical  or  employ¬ 
ment  services)  that  uphold  life  chances.  Social  citi¬ 
zenship  thus  contributes  to  the  concrete  realization 
of  the  great  normative  ideals  of  Western  moder¬ 
nity:  freedom,  equality,  solidarity,  and  security. 

A  welfare  state  does  not  only  limit  itself  to 
defining  citizens’  entitlements  (and,  of  course,  the 
corresponding  financial  obligations)  but  also  typi¬ 
cally  organizes  the  production  and  distribution  of 
social  protection,  for  example,  through  public 
insurance  schemes  or  health  services.  Welfare 
administrations  occupy  the  center  stage  of  contem¬ 
porary  bureaucratic  systems  (e.g.,  in  terms  of  staff 
employed),  while  social  spending  makes  up  around 
50%  of  the  public  budget  in  the  Organisation  for 
Economic  Co-operation  and  Development  (OECD) 
area.  In  the  European  Union  (EU),  social-protection 
expenditures  amount  to  about  27%  of  GDP  (gross 


domestic  product;  27  average,  mid-2000s).  State 
programs  funded  by  tax  money  are  not  the  only 
providers  of  welfare:  markets,  families,  and  inter¬ 
mediary  associations  are  other  important  spheres 
and  channels  of  provision.  Each  society  has  its 
own  composite  welfare  “mix”  or  “regime.”  In 
the  course  of  the  20th  century,  however,  the  state 
has  extended  and  strengthened  its  regulatory 
power  over  all  forms  of  welfare  provision  (e.g., 
through  family  or  labor  law)  and  has  affirmed 
itself  not  only  as  the  most  salient  but  also  as  the 
ultimate  “social  sovereign.”  By  expanding  its  sov¬ 
ereignty  in  the  social  sphere,  the  state  has  funda¬ 
mentally  transformed  its  own  structure,  function, 
and  legitimation  basis,  prompting  the  appearance 
of  novel  political  actors  and  dynamics.  The  fol¬ 
lowing  sections  of  this  entry  will  illustrate  the 
main  stages  of  development  of  the  welfare  state, 
from  its  early  origins  to  the  present  phase  of  crisis 
and  reform. 

Early  Origins 

The  historical  background  of  the  modern  welfare 
state  is  constituted  by  the  various  “poor  relief” 
measures  introduced  in  European  states  since  the 
17th  century  and  codified  in  some  of  them  (e.g., 
England)  in  organic  sets  of  provisions  (the  Poor 
Laws).  The  institutional  watershed  was,  however, 
the  establishment  of  compulsory  insurance,  resting 
on  the  new  principle  of  a  rights-based  protection 
backed  by  state  authority.  The  pioneer  country 
was  Wilhelmine  Germany  (1890-1914),  which 
introduced  sickness  insurance  in  1883,  work  acci¬ 
dent  insurance  in  1884,  and  pension  insurance  in 
1889.  By  the  end  of  the  Great  War,  virtually  all 
West  European  countries  had  legislated  at  least 
one  compulsory  insurance  scheme.  The  United 
States  and  Canada  followed  suit  with  federal 
insurance  schemes  between  the  1920s  and  1930s. 

The  relatively  close  temporal  proximity  of  the 
introduction  of  compulsory  social  insurance 
reflects  the  similar  problematic  pressure  that  all 
countries  were  suffering  in  the  wake  of  capitalist 
industrialization  and,  more  generally,  moderniza¬ 
tion.  The  specific  timing  reflects,  in  contrast,  spe¬ 
cific  national  political  constellations.  Authoritarian 
regimes  (e.g.,  Germany  and  Austria)  moved  as 
front-runners,  in  the  hope  of  taming  an  increas¬ 
ingly  active  workers’  movement  that  threatened  to 
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jeopardize  conservative  hegemony.  Despite  their 
more  advanced  level  of  socioeconomic  develop¬ 
ment,  the  early  democratizers,  such  as  Britain, 
lagged  somewhat  behind:  In  these  countries,  polit¬ 
ical  elites  were  less  worried  about  legitimation  and 
consensus,  and  the  introduction  of  social  insurance 
had  to  wait  for  the  strengthening  of  socialist  repre¬ 
sentation  within  national  parliaments. 

The  early  phase  of  “instauration”  witnessed  the 
emergence  of  two  distinct  basic  approaches  to 
social  insurance.  The  occupational  approach  (pre¬ 
vailing  in  continental  Europe)  was  based  on  labor 
market  status  and  proceeded  through  the  estab¬ 
lishment  of  several  occupational  schemes  (the  first 
typically  covering  blue-collar  industrial  workers), 
often  with  different  regulations.  The  universalist 
approach  (typical  of  English-speaking  countries 
and  Nordic  Europe),  in  contrast,  was  based  on 
need  and  later  citizenship.  In  1913,  Sweden  intro¬ 
duced,  for  example,  the  first  universal  pension 
scheme  for  all  elderly  citizens,  including  a  modest 
flat-rate  benefit  topped  by  means-tested  supple¬ 
ments.  Rooted  in  different  institutional  traditions 
and  reflecting  historical  variations  in  class  struc¬ 
tures  and  systems  of  corporate  and  political  repre¬ 
sentation,  the  early  choice  of  the  occupational 
versus  universalist  path  had  significant  implica¬ 
tions  for  subsequent  institutional  developments 
and  distributive  outcomes. 

The  Golden  Age  of  Postwar  Expansion 

In  many  countries,  World  War  II  spurred  ambi¬ 
tious  reform  projects  (e.g.,  the  Beveridge  Plan  in 
Britain)  for  strengthening  social  protection,  also 
with  a  view  to  underpin  national  cohesion.  The  3 
decades  from  1945  to  1975  witnessed  an  impres¬ 
sive  growth  of  the  welfare  state  in  both  quantita¬ 
tive  and  qualitative  terms.  At  the  beginning  of  the 
1950s,  social  expenditure  was  still  below  10%  of 
GDP  in  most  European  countries.  By  the  early 
1970s,  many  countries  (e.g.,  Belgium,  Denmark, 
France,  Germany,  Italy,  the  Netherlands,  and 
Sweden)  had  come  to  pass  the  20%  mark  and 
most  of  the  remaining  ones  had  already  surpassed 
15%.  The  vast  majority,  if  not  the  totality,  of  the 
population  had  come  to  be  included  in  social  pro¬ 
tection  schemes  for  all  the  “standard  risks”:  old 
age,  disability,  and  bereavement;  sickness,  mater¬ 
nity,  and  work  injuries;  and  unemployment  and 


family  dependents.  At  least  in  terms  of  eligibility, 
the  European  welfare  states  had  “grown  to  limits”: 
They  had  reached  or  were  about  to  reach  their 
widest  possible  domestic  boundaries,  coinciding 
with  the  whole  citizenry.  During  this  “golden  age” 
of  welfare  state  expansion,  the  more  localized  sys¬ 
tems  of  protection  were  progressively  marginalized 
in  their  financial  size  and  functional  scope. 
Sophisticated  techniques  were  invented  and 
deployed  to  improve  and  rationalize  the  extraction 
of  taxes  and  contributions,  govern  redistributive 
flows  from  the  center,  and  deliver  benefits  and 
services  to  the  various  clientele.  Finally,  alongside 
the  various  insurance  schemes  for  the  standard 
risks,  new  noncontributory  programs  of  general 
social  assistance  were  created,  as  well  as  increas¬ 
ingly  complex  health  care  systems  providing  a 
wide  array  of  medical  services. 

A  vast  literature  has  explored  cross-national 
variations  during  the  long  expansion  phase. 
According  to  the  Danish  sociologist  Gosta  Esping- 
Andersen’s  influential  analysis,  three  distinct  wel¬ 
fare  “regimes”  (i.e.,  systems  of  relationships 
between  state  policies,  labor  markets,  and  the 
family)  consolidated  themselves  between  the 
1950s  and  the  late  1970s:  a  social  democratic 
regime,  prevailing  in  the  Nordic  countries  and 
centered  on  encompassing  and  generous  schemes; 
a  conservative-corporative  regime,  prevailing  in 
the  continental  countries  and  centered  on  frag¬ 
mented  schemes  linked  to  labor  market  status; 
and  a  liberal  regime,  prevailing  in  the  English- 
speaking  countries,  centered  on  universal  but 
means-tested  schemes  offering  only  meager  bene¬ 
fits.  The  three  regimes  were  the  product  of  specific 
class  coalitions  and  their  capacity  to  mobilize 
“power  resources”  in  their  own  interest.  Each 
regime  had  also  specific  distributional  implica¬ 
tions,  especially  in  terms  of  decommodification-, 
that  is,  the  extent  to  which  workers  are  freed  from 
market  dependence  for  the  satisfaction  of  basic 
needs.  Esping-Andersen’s  pathbreaking  work  led 
to  a  vast  debate.  By  looking  in  more  depth  at  the 
experience  of  specific  areas  or  countries  (e.g., 
southern  Europe,  Australia  and  New  Zealand,  the 
United  States,  and,  more  recently,  the  postcom¬ 
munist  countries  or  East  Asia)  various  authors 
have  articulated  the  original  regime  tripartition, 
adding  new  types  and  models,  and  have  high¬ 
lighted  additional  analytical  dimensions  (e.g.,  the 
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capacity  of  regimes  to  “defamilize”  care  and  pro¬ 
mote  gender  equality)  and  theoretical  insights. 

The  Crisis  and  Permanent  Austerity 

In  the  wake  of  the  oil  shocks  of  the  1970s,  the 
developmental  trajectory  of  the  welfare  state 
started  to  invert  its  direction:  The  “golden  age  of 
expansion”  gradually  faded  away  and  was  replaced 
by  a  new  phase  of  “permanent  austerity.”  This 
shift  was  mainly  the  result  of  exogenous  pressures 
coming  from  the  environment  of  the  welfare  state: 
the  environment  external  to  the  nation-state  and 
the  internal  environment  (i.e.,  changes  in  domestic 
economies  and  social  structures).  The  following 
section  briefly  reviews  the  nature  and  impact  of  all 
these  pressures,  starting  with  the  external  ones. 

Globalization 

Since  the  1980s,  the  growing  international  inte¬ 
gration  of  markets  has  started  to  impose  new  con¬ 
straints  on  the  welfare  state,  by  restricting  the 
margins  of  maneuver  that  national  governments 
enjoyed  during  the  golden  age  in  designing,  man¬ 
aging,  and  funding  their  social-protection  systems. 
The  globalization  of  finance  has  seriously  weak¬ 
ened  governments’  control  over  national  tax  bases. 
Policy  liberalizations  have  widened  the  “exit 
options”  for  capital,  investors,  and  firms,  thus 
making  it  much  more  difficult  for  the  state  to  steer 
the  economy  and  the  labor  market  and  to  reconcile 
the  twin  goals  of  economic  competitiveness  and 
social  consensus. 

In  the  political  debates  of  the  1980s  and  early 
1990s,  globalization  was  often  portrayed  as  a  sub¬ 
versive  force,  inevitably  conducive  to  a  “race  to  the 
bottom”  through  regulatory  competition.  During 
the  past  decade,  a  number  of  serious  systematic 
comparative  analyses  have,  however,  challenged 
these  negative  and  pessimistic  views.  Far  from 
being  an  irresistible  destroyer  of  social  protection 
institutions  and  a  “flattener”  of  country- specific 
preferences  and  diversities,  globalization  is  now 
seen  as  a  challenge  that  (a)  is  filtered  through  vari¬ 
ous  elements  of  domestic  political  economy  con¬ 
figurations  and  (b)  can  be  more  or  less  effectively 
contravened  through  politico-economic  responses 
and  institutional  adaptations.  The  debate  has 
noted  that,  if  appropriately  (re)configured,  social 


policies  can  play  a  precious  role  in  upholding  the 
performance  of  domestic  political  economies  in  the 
new  globalized  environment. 

European  Integration 

When  it  was  launched  in  the  1950s,  the  project 
of  European  integration  did  not  intend  to  chal¬ 
lenge  the  institutional  foundations  of  the  nation- 
based  welfare  state.  Quite  to  the  contrary,  the 
founders  conceived  of  European  integration  as  a 
project  capable  of  creating  and  sustaining  a  virtu¬ 
ous  circle  between  open  economies  and  outward¬ 
looking  economic  policies  on  the  one  hand  and 
closed  welfare  states  and  inward-looking  social 
policy  on  the  other. 

The  international  economic  crisis  of  the  1970s 
prompted,  however,  an  ambitious  project  to 
enhance  economic  integration,  based  on  two  steps: 

(1)  completing  the  internal  market  and  then 

(2)  moving  toward  a  fully  fledged  economic  and 
monetary  union  (EMU).  The  Single  European  Act 
of  1986  unleashed  a  dynamic  of  “market  making,” 
primarily  through  measures  of  negative  integration 
(i.e.,  the  removal  of  national  barriers  to  economic 
transactions).  By  setting  fixed  macroeconomic 
requirements  and  deadlines  for  admission  into  the 
“euro,”  the  Maastricht  Treaty  of  1992  imposed 
further  constraints  on  the  autonomy  of  national 
governments  in  the  welfare  state  sphere. 

Like  globalization,  the  EMU  project  was  largely 
considered  in  the  early  debate  of  the  1990s  as  a 
subversive  factor  for  national  social  contracts,  in 
the  wake  of  increasingly  stringent  market  compat¬ 
ibility  requirements,  the  new  power  of  business, 
the  hardening  of  budgetary  constraints,  and  the 
impossibility  of  creating  some  sort  of  Social 
Europe  due  to  the  institutional  obstacles  to  “posi¬ 
tive”  integration.  Recent  discussions  have  adopted 
a  much  more  nuanced  view.  European  integration 
has  indeed  forced  several  domestic  adjustments 
but  not  necessarily  for  the  worse:  the  economic, 
fiscal,  and  monetary  discipline  enforced  by  supra¬ 
national  authorities  has  prompted  or  accelerated  a 
dynamic  of  welfare  state  recalibration,  which  was 
anyway  appropriate  and  desirable  for  coping  with 
a  host  of  endogenous  problems.  Liberalizations 
have  made  several  goods  and  services  more  afford¬ 
able  to  consumers,  enhancing  the  range  of  options 
available  to  them;  in  certain  areas,  such  as  health 
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and  safety,  market  integration  has  also  brought 
about  more  consumer  protection  and  higher  labor 
standards.  Some  regulatory  competition  has  indeed 
taken  place  but  not  necessarily  with  the  worst 
results.  As  in  the  case  of  globalization,  the  literature 
has  moreover  highlighted  a  multitude  of  factors 
that  mediate  the  impact  of  European  integration  on 
domestic  redistributive  arrangements,  as  well  as  the 
scope  and  intensity  of  their  “Europeanization.” 

Internal  Challenges:  The  Transformation  of 

Domestic  Economies  and  Social  Structures 

In  addition  to  external  pressures  and  constraints, 
welfare  state  programs  have  also  been  facing 
growing  challenges  posed  by  the  transformations 
of  their  domestic  economic  and  social  environ¬ 
ments.  The  foundation  and  the  expansion  of  such 
programs  in  states  had  taken  place  in  the  context 
of  “industrialism” — understood  as  a  general  mode 
of  organizing  the  spheres  of  economic  production 
and  social  reproduction.  The  past  3  decades  have 
witnessed  a  rapid  transition  toward  a  new  “postin¬ 
dustrial”  order,  in  the  wake  of  the  rising  impor¬ 
tance  of  services,  changes  in  household  patterns, 
and  behaviors  and  population  ageing. 

The  shift  from  an  industrial  to  a  postindustrial 
economy,  mainly  centered  on  the  service  sector,  has 
originated  serious  upheavals  in  the  occupational 
structures  of  advanced  countries.  “Fordist”  employ¬ 
ment  (stable  and  guaranteed  jobs  with  permanent 
contracts)  has  been  witnessing  a  steady  decline 
since  the  1970s,  not  fully  compensated  by  the  rise 
of  nonstandard  “atypical”  forms  of  employment 
(such  as  temporary  or  part-time  jobs).  Low  wages 
and  poor-quality  jobs  have  increased  the  numbers 
of  “working  poor,”  systematically  exposed  to  the 
risk  of  the  labor  market  and  social  exclusion. 

The  second  important  change  has  involved  fam¬ 
ily  and  gender  relations.  While  in  postwar  industrial 
societies,  traditional  families  with  a  male  breadwin¬ 
ner  and  a  housewife  predominated,  the  postindus¬ 
trial  age  is  characterized  by  a  greater  plurality  of 
household  forms:  dual-earner  families,  single¬ 
parent  households,  de  facto  unions,  and  so  on. 
Average  household  size  has  declined,  not  only 
partly  due  to  a  fall  in  fertility  rates  but  also  due  to 
the  greater  number  of  single-person  households  and 
to  a  drastic  drop  in  multigenerational  households. 
In  parallel  with  the  increase  of  separations  and 


divorces,  these  changes  point  toward  a  general 
“precarization”  of  social  relations  in  advanced 
societies.  Another  important  trend  since  the  1970s 
has  been  the  increased  participation  of  women  in 
the  labor  market — a  phenomenon  that  is  closely 
connected  to  the  rise  of  the  service  sector. 

The  third  significant  transformation  has  been 
demographic  aging,  due  to  lower  birth  rates  and 
higher  life  expectancy.  The  proportion  of  elderly 
people  in  the  advanced  societies  has  been  con¬ 
stantly  increasing  in  the  past  3  decades.  All  projec¬ 
tions  point  toward  a  dramatic  intensification  of 
this  process,  especially  in  Europe  and  Japan.  The 
OECD  estimates  that  the  over-60  age-group, 
mainly  retired  people,  will  rise  by  almost  50%  by 
2020:  For  each  elderly  person,  there  will  be  fewer 
than  two  workers.  Largely  due  to  the  full  matura¬ 
tion  of  the  generous  benefit  formulas  introduced  in 
the  past,  these  elderly  will  retire  on  average  with 
higher  pensions  than  current  retirees:  Thus,  the  real 
intergenerational  transfer  of  resources  will  grow  by 
much  more  than  50%.  A  similar  syndrome  will 
affect  the  real  expenditure  on  health  care. 

These  internal  transformations  have  reinforced 
the  pressures  and  constraints  (especially  in  finan¬ 
cial  terms)  linked  to  external  changes  and  have 
originated  delicate  policy  dilemmas.  They  have 
also  generated  a  host  of  new  social  risks  and  needs: 
from  new  forms  of  poverty  and  social  exclusion  to 
personal  dependency  and  from  skills  obsolescence 
(and  thus  unemployability)  to  situations  of  work- 
life  imbalances.  The  rebalancing  of  social  expendi¬ 
ture  toward  the  new  risks  and  the  more  vulnerable 
social  groups  has  clashed  with  the  high  “sticki¬ 
ness”  of  the  institutional  status  quo — a  defining 
feature  of  that  “new  politics  of  the  welfare  state,” 
which  has  been  the  object  of  a  lively  debate  in  the 
literature  of  the  past  decade  or  so. 

Welfare  State  Recalibration  at  a  Glance 

Despite  the  gloomy  prospects  outlined  by  the  early 
“crisis”  debate,  the  welfare  state  has  not  crumbled 
under  the  weight  of  the  contextual  changes  described 
so  far  and  has  actually  given  signs  of  at  least  some 
adaptive  capacity.  Institutional  adaptation  has  been 
a  complex  incremental  process,  which  has  pro¬ 
ceeded  with  different  speed  and  success  across  the 
various  countries  and  which  has  involved  different 
policy  areas:  from  macroeconomic  management 
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and  industrial  relations  to  tax  policy  and  labor 
market  regulation.  Limiting  ourselves  to  the  core 
elements  of  social  protection,  at  least  four  key  gen¬ 
eral  trends  of  reform  can  be  identified. 

The  first — and  the  most  significant — of  these 
trends  is  constituted  by  structural  adjustments  in 
response  to  socioeconomic  developments.  In  the 
field  of  pensions,  the  1990s  witnessed  a  substantial 
wave  of  reforms,  based  on  the  following  ingredi¬ 
ents:  increasing  the  age  of  retirement,  tightening 
qualifying  conditions,  restricting  indexation  rules, 
or  strengthening  the  link  between  contributions 
and  benefits.  In  the  field  of  health  care,  reforms 
have  been  introduced  with  a  view  to  enhancing 
efficacy  and  efficiency  in  the  allocation  of  resources 
and  in  the  provision  of  services  through  better 
incentives  at  both  the  macro-  and  the  microlevels. 
Social  services  and  family  policies  have  also  wit¬ 
nessed  some  innovation  in  both  substantive  and 
organizational  terms,  with  a  view  to  responding  to 
the  rising  needs  of  the  elderly  population,  the 
changing  gender  division  of  labor,  and  new  forms 
of  poverty  and  exclusion. 

A  second  general  trend  has  been  a  move  to  an 
active  approach  in  the  management  of  work 
incapacity  (e.g.,  disability)  and  especially  unem¬ 
ployment,  with  a  view  to  preventing  long-term 
dependency  on  income  support.  In  the  course  of 
the  1990s,  the  “job  first”  principle  has  gradually 
made  its  way  throughout  OECD  (un)employment 
protection  systems.  Access  to  benefits  has  been 
generally  made  more  restrictive  and  conditional, 
but  at  the  same  time,  new  networks  of  public  and 
private  employment  services  have  been  set  up  to 
promote  and  facilitate  the  labor  market  reintegra¬ 
tion  of  workers  without  jobs.  Activation  strategies 
have  proceeded  hand  in  hand  with  wider  exercises 
of  labor  market  reconfiguration,  pioneered  by  the 
Netherlands  in  the  1980s  under  the  banner  of 
“flexicurity.”  Reforms  in  this  field  have  received 
explicit  spurs  by  the  EU  and  in  particular  by  the 
European  Employment  Strategy  (EES),  launched  in 
1997. 

A  third  general  trend — which  cuts  across  vari¬ 
ous  social  protection  programs — has  been  greater 
“targeting”  or  selectivity  of  resources  toward  those 
most  in  need,  also  in  the  wake  of  the  policy  recom¬ 
mendation  of  influential  international  bodies  such 
as  the  OECD  or  the  World  Bank.  Different  strate¬ 
gies  have  been  experimented  with,  depending  on 


national  preferences,  constraints,  and  opportuni¬ 
ties:  greater  use  of  traditional  means  testing,  link¬ 
ing  the  amount  of  benefits  received  to  income  or 
means  testing  from  the  top,  clawing  back  transfer 
payments  from  those  less  in  need  via  the  tax  sys¬ 
tem,  and  so  on.  Alongside  these  strategies  of  “verti¬ 
cal”  targeting  based  on  economic  resources,  a  trend 
is  also  observable  toward  “horizontal”  targeting 
based  on  social  risk:  reducing  the  generosity  of 
some  core  transfer  programs  (e.g.,  old  age,  disabil¬ 
ity,  and  survivor  pensions)  while  increasing  family 
benefits,  introducing  new  subsidies  for  caregivers 
or  categories  with  special  needs,  or  expanding  pro¬ 
grams  against  social  exclusion. 

A  fourth  general  trend  has  involved  the  financial 
side  of  social  protection.  Pension  systems  have  wit¬ 
nessed  a  strengthening  of  “funding”  (i.e.,  the  accu¬ 
mulation  of  real  financial  reserves)  to  make  income 
security  at  retirement  less  vulnerable  to  demo¬ 
graphic  imbalances  and  to  shift  some  of  the  respon¬ 
sibility  for  its  provision  from  the  state  to  individual 
workers  or  the  social  partners.  Another  important 
development  on  the  financing  front  has  been  the 
attempt  at  reducing  charges  on  business  and  labor, 
particularly  those  in  the  form  of  nonwage  labor. 
This  development  has  been  primarily  motivated 
not  only  by  competitiveness  preoccupations  but 
also  by  the  wish  to  neutralize  the  vicious  circles 
generated  by  “contribution-heavy”  social  insur¬ 
ance  systems.  More  generally,  most  countries  have 
reviewed  the  incentives  of  their  tax-benefit  systems 
to  make  them  more  “employment  friendly.” 

Conclusion 

The  welfare  state  can  be  considered  as  one  of  the 
most  salient  achievements  and  legacies  of  the  20th 
century:  Its  programs  have  greatly  contributed  to 
consolidating  democratic  institutions  and  to  har¬ 
monizing  economic  growth  with  changing  social 
needs.  A  child  of  Western-style  (and  in  particular 
European-style)  modernization,  the  welfare  state 
has  been  rapidly  growing  throughout  the  develop¬ 
ing  world  as  well  during  the  past  decades.  A  num¬ 
ber  of  socioeconomic  transformations  are  posing 
today  serious  challenges  to  the  sustainability  and 
effectiveness  of  this  institution.  Although  many  of 
its  traditional  objectives  and  instruments  will  have 
to  be  reconsidered  and  redesigned  (a  process  that 
is  already  under  way  in  many  countries),  it  seems, 
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however,  plausible  to  predict  that  the  21st  century 
will  witness  a  further  strengthening  and  articula¬ 
tion  of  rights-based  social  protection,  anchored 
not  only  to  state-national  but  also  possibly  to 
supranational  and  international  institutions. 
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Westphalian  Ideal  State 


Westphalian  ideal  state  is  the  conceptualization  of 
the  modern  state  widely  assumed  to  have  derived 
from  the  Peace  of  Westphalia  treaties  of  1648. 


The  Munster  and  Osnabriick  treaties  ended  the 
Thirty  Years’  War  and  made  sovereignty  and  ter¬ 
ritorial  principles  the  cornerstones  of  the  new 
international  arena:  They  paved  the  way  to  the 
state  as  the  main  actor  in  international  relations 
(IR)  and  are  accepted  by  the  scientific  community 
as  the  symbol  of  the  “interstate  system,”  which  is 
still  preeminent  now,  even  if  it  is  more  and  more 
contested.  The  modern  state  has  been  central  to 
political  science  and  IR  since  their  consolidation  as 
academic  disciplines  around  the  turn  of  the  20th 
century;  in  fact,  both  disciplines  have  long  regarded 
it  as  axiomatic  that  the  state  is  of  primary  impor¬ 
tance,  indeed  indispensable  to  society.  Even  when, 
in  the  20th  century,  it  became  fashionable  to  pre¬ 
dict  the  obsolescence  of  the  state  as  we  know  it, 
this  in  itself  paid  tribute  to  the  state’s  significance 
and  tended  to  cement  it.  This  entry  first  defines  the 
ideal  state,  including  its  historical  context  and  key 
aspects  such  as  sovereignty,  internationally  recog¬ 
nized  territory,  and  nonintervention  of  other  states. 
It  then  critically  examines  the  theory  that  this  state 
originated  with  the  Westphalian  peace  treaties. 

Defining  the  Ideal  State 

What  is  the  state?  Prevailing  opinion  considers  it 
to  be  coeval  with  civilization;  thus,  as  far  as 
“Western”  civilization  is  concerned,  the  Sumerians 
are  usually  credited  with  its  invention  and  with 
passing  it  on  to  peoples  further  west.  Both  the 
“city-state”  and  imperial  structures  that  tran¬ 
scended  it  have,  since  the  19th  century,  been  held 
to  be  instantiations  of  “statehood,”  in  the  case  of 
the  Sumerians  as  well  as  in  the  case  of  the  pre- 
Christian  Greeks  and  Romans  or  of  the  kingdoms 
of  ancien  regime  Europe. 

Semantically,  this  attribution  of  statehood  to 
the  political  structures  of  bygone  eras  was  neces¬ 
sarily  retrospective,  since  the  process  by  which 
words  such  as  state  in  English  and  its  homologues 
in  other  European  languages  came  to  designate  the 
state  as  we  know  it  did  not  get  under  way  until  the 
15th  century  at  the  earliest  and  was  not  complete 
until  the  19th  century;  the  Latin  status ,  from 
which  those  words  are  derived,  has  never  had  this 
meaning.  Retrospective  attribution  of  statehood 
also  necessarily  presupposes  that  statehood  is  in 
fact  independent  of  the  words  used  to  designate 
it,  that  the  state  exists  independently  of  political 
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discourse — a  premise  that  constructivists,  in  par¬ 
ticular,  must  question. 

Moreover,  this  notion  of  the  timelessness  and 
ubiquity  of  the  state  has  always  sat  somewhat 
uneasily  with  the  realization  that  political  structures 
have  varied  greatly  in  history.  That  realization  has 
virtually  never,  as  yet,  led  modern  writers  to  chal¬ 
lenge  the  belief  in  the  inescapable  historic  necessity 
of  the  state  for  civilization  in  principle.  It  has,  how¬ 
ever,  encouraged  attempts  to  track  the  evolution  of 
concrete  historical  forms  that  the  state  is  thought  to 
have  taken — more  particularly,  the  evolution  of  the 
modern  state  over  the  past  few  centuries. 

Such  attempts  were  helped — indeed  made 
possible — by  the  efforts  of  social  scientists  around 
the  turn  of  the  20th  century  to  define  what  could  be 
called  the  ideal  state.  Most  important,  Georg  Jellinek 
in  1900  posited  that  the  state  needs  a  territory,  a 
people,  and  a  government.  Max  Weber,  in  his  lec¬ 
ture  Politics  as  a  Vocation  (1918),  identified  the  state 
as  an  “entity  which  claims  a  monopoly  on  the  legiti¬ 
mate  use  of  physical  force.  ”  In  practice,  this  monop¬ 
oly,  which  the  wording  chosen  by  Weber  vests  in  the 
state  as  such,  can  of  course  only  be  wielded  by  the 
government  or  those  empowered  by  it. 

Sovereignty  is  another  concept  commonly  regarded 
as  an  indispensable  ingredient  of  statehood — indeed, 
semantically,  the  two  are  commonly  yoked  together 
in  the  expression  “sovereign  statehood.”  Although 
in  modern  democracies  sovereignty  is  said  to  ema¬ 
nate  from  the  people,  it  is,  in  practice,  essentially 
wielded  by  the  government  and  closely  related  to 
the  monopoly  on  legitimate  violence  singled  out  by 
Weber,  giving  the  government  the  last  word  in 
most  domestic  matters.  Domestic  sovereignty  is 
complemented  by  external  sovereignty,  which, 
conversely,  denies  the  right  of  any  agency  outside 
the  state  to  dictate  to  it  or  any  of  its  parts  or  mem¬ 
bers.  The  “rule  of  nonintervention,”  proscribing 
interference  in  the  internal  affairs  of  another  sov¬ 
ereign  state,  is  a  corollary  of  this.  International  law 
in  what  may  be  called  its  “classic”  modern  form — 
likewise  elaborated  around  the  turn  of  the  20th 
century  by  scholars  such  as  Lassa  Oppenheim — 
also  laid  great  stress  on  sovereign  statehood,  con¬ 
sidered  the  precondition  for  being  able  to  create 
order  norms  as  a  subject  (rather  than  merely  an 
object)  of  international  law. 

It  is  quite  clear  that  many  of  the  preindustrial 
political  structures  to  which  19th-  and  20th-century 


historiography  habitually  applies  the  term  state 
fell  short  of  the  ideal  state  as  defined  by  scholars 
such  as  Jellinek,  Weber,  or  Oppenheim,  which 
their  own  time  came  at  least  close  to  making  a 
reality.  Rightly  or  wrongly,  the  general  conformity 
of  more  ancient  structures — such  as  the  “city- 
state”  of  the  Greeks  or  the  Roman  Empire — to  the 
ideal  state  is  not  normally  called  into  question. 
Concerning  Christian  Europe,  however,  the  “feu¬ 
dal  system”  is  widely  perceived  to  have  been  at 
odds  with  the  ideal  state,  that  system  being  char¬ 
acterized  by  diffusion  rather  than  centralization  of 
power,  by  competing  and  entangled,  rather  than 
territorially  exclusive,  jurisdictions  (borrowing 
from  Friedrich  Meinecke,  John  Ruggie  has,  influ¬ 
entially  in  IR,  called  them  “heteronomous”)  and 
by  a  corresponding  lack  of  any  monopoly  on 
legitimate  violence. 

Moreover,  the  collective  memory  of  Western 
civilization  correctly  holds  that  civilization  was 
informed  by  a  lingering  sense  of  the  oneness  of  the 
Christian  community  (at  least  to  the  extent  that  it 
was  traditionally  focused  on  Rome)  even  in  the  late 
pre-Reformation  phase  and  indeed,  if  probably  to 
an  ever-decreasing  degree,  for  a  century  or  two 
after  the  Reformation.  The  continuing  influence  of 
a  political  paradigm  that  emphasized  oneness  over 
diversity  is,  less  correctly,  thought  to  have  pro¬ 
vided  a  platform  for  the  Western  emperor  (the 
ruler  of  the  Holy  Roman  Empire)  to  claim  suzer¬ 
ainty  over  Western  Christendom,  this  of  course 
being  at  odds  with  the  concept  of  sovereignty. 

Critical  Issues 

A  well-known  article  by  the  international  lawyer 
Leo  Gross,  published  in  1948  to  mark  the  tercen¬ 
tenary  of  the  Peace  of  Westphalia  in  1648,  helped 
shape  the  widespread  view  of  that  settlement  as 
marking  a  watershed  in  the  consolidation  of  the 
ideal  state  as  conceived  today.  Gross  was  not  offer¬ 
ing  a  wholly  new  interpretation — thus,  the  arche¬ 
typal  IR  textbook,  Hans  J.  Morgenthau’s  Politics 
Among  Nations,  contains  brief  allusions  to  the 
1648  peace  that  are  consistent  with  the  article  by 
Gross,  and  since  Morgenthau’s  book  first  appeared 
in  the  same  year  as  that  article,  it  is  unlikely  that  it 
was  influenced  by  it.  Rather  than  create  a  new 
orthodoxy,  Gross  articulated  authoritatively  an 
orthodoxy  that  already  existed. 
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Echoing  (unwittingly  and  indirectly,  while  also 
adding  a  further  “spin”  of  its  own)  the  17th-century 
anti-Habsburg  propaganda,  this  view  presents  the 
Thirty  Years’  War  as  a  struggle  between  Habsburg 
aspirations  for  universal  monarchy  and  attempts  by 
other  actors  to  emancipate  themselves  from  such 
aspirations  in  the  name  of  sovereignty.  It  takes 
many  liberties  with  the  historical  evidence — 
beginning  with  the  text  of  the  peace  settlement 
itself,  which  Gross  suggested  need  not  be  taken 
too  literally,  which  later  exponents  of  the  same 
view  have  most  often  ignored  entirely,  and  which, 
it  should  be  stressed,  provides  no  corroboration  at 
all  of  the  interpretation  in  question.  But  this 
Grossian  view  nevertheless  commended  itself  by 
offering,  or  at  a  minimum  cementing,  what  has 
become  the  founding  myth  of  the  modern  state.  In 
this  view,  that  state  was  freed  by  the  Peace  of 
Westphalia  from  the  shackles  of  feudal  “heteron- 
omy,”  and  the  concept  of  sovereignty  and  the 
attendant  rule  of  nonintervention  were  considered 
(wrongly)  to  have  been  enshrined  by  the  text  of 
the  settlement  for  the  entire  European  system. 

It  is  true  that  writers  and  diplomats  started  a 
custom  of  declaring  the  1648  peace  as  being  of 
special  significance  within  a  few  decades  of  its 
signing.  The  grounds  for  doing  so  have  not  been 
uniform,  to  the  extent  that  they  are  actually  spelled 
out — once  established,  the  habit  probably  became 
self-perpetuating,  eliminating  the  need  to  give  a 
reason  for  it  while  at  the  same  time  opening  up  the 
possibility  of  identifying  new  grounds  for  this  spe¬ 
cial  significance.  The  view  expounded  by  Gross 
thus  put  the  established  significance  of  the  Peace  of 
Westphalia  on  a  new  foundation  in  line  with  the 
political  agenda  of  its  own  period. 

This  gave  rise  to  the  still  solid  consensus  among 
IR  scholars  that  the  “international  system”  as  we 
know  it  began  with  the  Peace  of  Westphalia — 
hence  its  designation  as  the  “Westphalian”  system 
in  much  IR  literature.  Once  that  expression  had 
become  established  usage  within  the  discipline  of 
IR,  the  epithet  Westphalian  came  to  be  applied 
also  to  the  type  of  state  that  is  the  modular  unit  of 
this  system  and  that  is  the  ideal  type  described 
earlier.  More  recently,  usage  of  the  epithet  has 
started  spreading  beyond  the  confines  of  IR. 

At  the  same  time,  criticism  of  the  epithet  has 
been  growing  as  well.  As  pointed  out,  there  is 
nothing  about  the  state  as  such  in  the  text  of  the 


Westphalian  settlement,  nor  is  that  settlement  con¬ 
cerned  in  any  way  with  issues  such  as  sovereignty, 
territorially  exclusive  jurisdictions,  or  the  rule  of 
nonintervention,  which  are  key  aspects  of  the 
modern  state.  It  has  also  been  asked  whether  the 
modern  state  did  not  in  fact  begin  to  emerge  only 
at  the  very  end  of  the  ancien  regime  and  was  con¬ 
solidated  only  in  the  course  of  the  19th  century. 
Thus,  whereas  external  sovereignty  is  today  con¬ 
sidered  to  be  vested  in  the  state  as  a  corporate 
entity,  in  1 8th-century  Europe  sovereignty  was  still 
vested,  not  only  in  fact  but  also  in  law,  in  rulers  as 
persons  rather  than  what  they  ruled  over  or  repre¬ 
sented.  Moreover,  while  those  rulers  did,  for  the 
most  part,  have  effective  external  sovereignty,  their 
domestic  sovereignty  was  still  severely  limited — for 
example,  no  central  power  in  ancien  regime 
Europe  was  capable  of  policing  its  own  territory 
effectively,  or  of  passing,  let  alone  enforcing,  gen¬ 
eral  legislation  on  a  regular  basis,  which,  in  con¬ 
trast,  became  normal  in  the  19th  century. 

Such  criticism,  on  historical  grounds,  of  the  epi¬ 
thet  Westphalian  seems  so  far  to  have  had  little 
impact.  It  may  of  course  be  argued  that  the  appel¬ 
lation  does  not  matter  unless,  beyond  the  reference 
to  a  current  political  phenomenon,  it  is  intended  to 
make  a  statement  about  history  as  well.  This,  how¬ 
ever,  is  usually  the  case,  implicitly  or  indeed  explic¬ 
itly,  with  much  of  the  textbook  literature,  for 
example,  expressly  recalling  the  presumed  histori¬ 
cal  origins  of  the  modern  state  and  the  modern 
international  system  and  thereby  perpetuating  the 
“Westphalian  myth.”  Political  scientists  were 
probably  more  in  search  of  a  name  for  this  inter¬ 
state  system  that  was  progressively  shaped;  histori¬ 
ans  generally  do  not  point  to  a  sharp  break  and 
designate  a  specific  year  for  the  emergence  of  this 
system,  as  political  scientists  did  with  1648. 
Political  scientists,  focusing  on  the  tension  that 
opposed  old  empires  to  new  states,  considered  the 
Westphalian  Peace  as  the  symbol  of  passing  from 
one  to  the  other.  Nowadays,  the  state  monopoly  is 
more  and  more  challenged  by  the  growing  influ¬ 
ence  of  nonstate  actors — as  a  result,  the  concept  of 
“post-Westphalian”  system  is  more  and  more  fre¬ 
quently  used. 
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World  Bank 


The  genesis  of  the  World  Bank  lies  in  the  creation 
of  the  International  Bank  for  Reconstruction  and 
Development  (IBRD),  following  a  conference  at 
Bretton  Woods  in  1944  (which  also  led  to  the  cre¬ 
ation  of  the  International  Monetary  Fund).  Today 
the  World  Bank  Group  comprises  in  addition  to 


the  IBRD,  the  International  Development  Asso¬ 
ciation  (IDA),  International  Finance  Corporation 
(IFC),  Multilateral  Investment  Guarantee  Agency 
(MIGA),  and  International  Centre  for  Settlement 
of  Investment  Disputes  (ICSID).  Since  their  incep¬ 
tion  through  2009,  the  institutions  comprising  the 
World  Bank  group  have  been  involved  in  lending 
nearly  three  fourths  of  a  trillion  dollars  (Table  1). 

There  are  two  key  questions  about  the  World 
Bank:  (1)  how  is  power  and  influence  brought  to 
bear  on  the  institution  and  (2)  how  does  the  insti¬ 
tution  itself  exercise  power  and  influence  on  oth¬ 
ers,  especially  its  borrowers? 

International  organizations  in  general  are 
plagued  by  severe  agency  problems,  and  the  World 
Bank  is  no  exception.  Long  chains  of  delegation 
and  multiple  stakeholders  make  this  inevitable. 
Larger  shareholders  obviously  have  greater  influ¬ 
ence,  with  the  United  States  being  the  most  domi¬ 
nant.  Nonetheless,  even  the  United  States  is  a 
distinct  minority  shareholder,  which  places  consid¬ 
erable  limits  on  how  much  it  can  directly  influence 
the  institution.  The  key  mechanisms  that  have 
shaped  the  World  Bank’s  governance  have  been 
senior  personnel  appointments  (especially  the 
President,  who  has  been  a  U.S.  national  since 
inception)  and  importantly,  the  nature  of  the  insti¬ 
tution’s  finances.  The  financial  design  of  the  Bank, 
relying  as  it  did  on  Wall  Street  initially  and  global 
financial  markets  in  later  years,  was  critical  in 
establishing  a  degree  of  political  autonomy  for  the 


Table  I  The  World  Bank  Group  Institutions 


Established 

Number  of 
Member 
Countries 

Operations 

International  Bank  for  Reconstruction  and 
Development  (IBRD) 

1944 

186 

Cumulative  lending:  $479  billion 

International  Development  Association 
(IDA) 

1960 

169 

Cumulative  commitments: 

$207  billion 

International  Finance  Corporation  (IFC) 

1956 

182 

Committed  portfolio:  $34.4  billion 

Multilateral  Investment  Guarantee  Agency 
(MIGA) 

1988 

174 

Cumulative  guarantees: 

$20.9  billion 

International  Centre  for  Settlement  of 
Investment  Disputes  (ICSID) 

1966 

143 

Total  cases  registered:  292 

Source:  Based  on  data  from  The  World  Bank  Annual  Report  2009. 
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institution,  although  this  came  at  the  expense  of 
greater  reliance  on  the  preferences  of  financial 
markets. 

This  began  to  change  with  the  advent  of  IDA  as 
the  market-based  autonomy  (from  governments) 
that  the  IBRD  built  for  itself  was  slowly  eroded  by 
the  public  resources  that  funded  IDA.  The  replen¬ 
ishment  procedures  of  IDA — its  periodicity  and 
burden-sharing  procedures — made  it  susceptible  to 
the  goodwill  of  major  shareholders.  In  any  burden¬ 
sharing  scheme,  the  largest  contributor  sets  the 
tone.  From  the  late  1960s  onward,  as  the  United 
States  began  a  long  process  of  reducing  its  financial 
share,  other  donors  began  to  link  their  contribu¬ 
tions  to  that  of  the  United  States — which  para¬ 
doxically  increased  the  bargaining  power  of  the 
United  States  even  as  its  contributions  declined. 
The  periodicity  meant  that  every  3  to  4  years,  new 
demands  could  be  made  on  the  institution.  The 
peculiarities  of  the  U.S.  budgetary  process  with 
annual  authorizations  ensured  that  the  exercise 
became  perennial  and  further  enhanced  U.S.  influ¬ 
ence.  Thus,  over  time,  the  Bank’s  overall  strategic 
direction  was  set  not  by  its  own  board  but  by  the 
IDA  deputies — the  personnel  representing  donor 
countries  who  were  charged  with  completing  the 
IDA  replenishment  process. 

The  reliance  on  governmental  monies  also 
resulted  in  a  shift  in  power  (most  acutely  in  the 
United  States)  from  the  executive  branch  to  the 
legislative  branch  and  nongovernment  actors,  a 
trend  that  accelerated  with  the  end  of  the  Cold 
War.  A  different  tack  was  taken  by  some  donors 
(particularly  the  Nordic  countries  and  Japan)  after 
the  1980s,  who  supplemented  the  institution’s 
budgetary  resources  through  “trust  funds,”  thereby 
seeking  to  shape  institutional  priorities  and  gover¬ 
nance  and  bypassing  the  Bank’s  budgetary  process. 

For  most  observers,  the  World  Bank’s  exercise  of 
power  and  influence  has  been  through  its  lending — 
who  it  lends  to,  for  what  purpose,  and  with  what 
conditions.  Although  the  Articles  of  Agreement  of 
the  World  Bank  specifically  proscribe  political  con¬ 
siderations  in  its  lending,  as  a  creature  of  nation¬ 
states,  whose  governance  has  long  been  dominated 
by  the  industrialized  countries  (and  especially  the 
United  States),  political  factors  inevitably  affected 
its  lending  priorities.  During  the  Cold  War,  foreign 
aid  was  an  instrument  of  the  West,  and  the  World 
Bank  was  firmly  embedded  in  this  architecture. 


Radical  and  left  governments  such  as  Chile  in  the 
Allende  era,  Vietnam  in  the  1970s  and  1980s,  and 
Nicaragua  and  Panama  in  the  1980s  were  denied 
Bank  lending.  With  the  end  of  the  Cold  War,  coun¬ 
tries  engaged  in  actions  deemed  hostile  to  Western 
interests  were  similarly  penalized — Iran  in  the 
1980s  and  1990s,  the  reduction  of  lending  to  India 
and  Pakistan  after  their  nuclear  tests  in  1998,  and 
the  hiatus  in  lending  to  China  in  1989  (after 
Tiananmen).  In  contrast,  the  continuation  of  lend¬ 
ing  to  countries  such  as  Egypt,  Pakistan,  and 
Russia,  despite  staff  concerns  about  their  policies 
in  different  years,  are  all  examples  of  lending  being 
swayed  by  political  pressures.  Nonlending  in  the 
1950s  and  1960s  and  later  during  the  1980s  debt 
crisis  and  the  1997-1998  Asian  financial  crisis  was 
at  least  in  part  shaped  by  pressure  from  major 
shareholders  to  press  debtor  countries  to  settle 
with  private  (mainly  Western)  creditors.  In  most 
cases,  however,  there  has  been  a  veneer  of  plausible 
justification  since  judgment  as  opposed  to  a  strict 
rule-based  approach  is  integral  to  the  Bank’s  work. 
If  the  political  biases  of  the  major  shareholders  had 
leaned  differently,  then  the  decisions  could  easily 
have  gone  the  other  way. 

The  purposes  of  World  Bank  lending  are 
another  key  mechanism  by  which  it  seeks  to 
influence  a  borrower.  Lending  focused  on  physi¬ 
cal  infrastructure  until  the  1960s,  agriculture  and 
rural  development  in  the  1970s,  policy-based 
loans  (so-called  structural  adjustment  lending)  in 
the  1980s,  and  human  capital  and  public  sector 
governance-related  loans  in  the  1990s.  More 
recently,  lending  has  focused  on  the  financial  sec¬ 
tor  and  global  public  goods,  and  is  done  in  sup¬ 
port  of  the  Millennium  Development  Goals.  As 
with  country  allocation  decisions,  lending  priori¬ 
ties  have  been  affected  by  a  specific  view  of  what 
constitutes  development  and  how  best  to  achieve 
these  goals  as  well  as  by  political  and  ideological 
considerations.  Indeed,  lending  by  the  Bank 
invariably  came  packaged  with  certain  messages 
and  conditions  on  what  to  do  and  how  to  do  it. 
Through  its  loan  conditions,  the  World  Bank  has 
acted  as  an  external  institution  of  restraint  as  it 
has  sought  to  restrain  developing  country  gov¬ 
ernments  from  engaging  in  poor  economic  poli¬ 
cies,  especially  microeconomic  policies  (the  IMF 
plays  the  same  role  with  regard  to  macroeco¬ 
nomic  policies).  The  wider  the  perceived  gap 
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between  norms  of  “sound”  economic  policies 
and  actual  policies,  the  more  important  this  role. 

In  early  decades,  this  role  of  the  Bank  was  tacit 
and  less  visible.  Later  on,  especially  in  the  1980s, 
with  the  advent  of  policy-based  structural  adjust¬ 
ment  lending,  with  specific  loan  conditions,  this  role 
became  more  overt  and  visible  (and  also  encoun¬ 
tered  much  greater  resistance).  However,  the  demise 
of  the  Soviet  Union  and  the  concomitant  shift  in 
global  ideological  currents  made  it  easier  for  the 
Bank’s  lending  to  press  for  neoliberal  economic 
policy  changes  and  equally,  to  tread  into  sectors  that 
were  politically  sensitive  and  previously  off-limits. 
Significantly,  the  Bank  began  to  stress  the  relation¬ 
ship  between  political  variables  in  borrowing  coun¬ 
tries  and  their  impact  on  the  poor.  “Governance,” 
or  the  quality  of  government  and  corruption-related 
issues,  began  to  be  debated  openly.  Rather  than 
dealing  exclusively  with  governments,  the  Bank 
began  to  work  with  civil  society  and  NGOs  in 
implementing  its  antipoverty  programs.  And 
acknowledging  that  the  fungibility  of  public 
resources  meant  that  money  channeled  to  a  govern¬ 
ment  may  or  may  not  go  to  the  official  objective,  the 
Bank  began  to  use  public  expenditure  reviews  to 
influence  the  overall  budgetary  priorities  of  borrow¬ 
ing  governments.  Lending  programs  began  focusing 
on  institutional  change  and  core  government  institu¬ 
tions  such  as  the  civil  service  and  judiciary. 

In  addition  to  lending,  the  Bank  has  always 
sought  to  set  the  development  agenda.  Its  didactic 
role,  through  its  research  and  information  interme¬ 
diary  roles,  has  arguably  been,  if  not  more  impor¬ 
tant  than  its  lending  role,  at  least  a  mechanism  that 
has  influenced  it.  The  Bank  has  always  sought  to 
portray  development  research  as  a  global  public 
good  and  used  that  to  obtain  greater  resources  for 
itself  for  this  purpose.  As  a  result,  it  emerged  as  the 
principal  producer  of  data  and  research,  giving  it  a 
legitimization  and  imprimatur  role  that  greatly 
leveraged  its  influence. 

Perhaps,  most  important,  the  Bank  has  exer¬ 
cised  influence  through  socializing  developing 
country  policy  elites.  Many  developing  country 
policy  elites  have  at  some  time  or  the  other  worked 
in  or  with  the  Bank.  By  educating,  convincing,  and 
exposing  LDC  (least  developed  country)  elites  to  a 
set  of  ideas  and  practices,  the  Bank  has  acted  to 
diffuse  global  neoliberal  economic  norms  and 
ideas.  Thus,  in  many  cases,  especially  in  countries 


with  a  reasonable  set  of  economic  institutions  and 
a  bureaucracy,  the  Bank  has  often  gone  into  coun¬ 
tries  whose  policy  doors  have  already  been 
unlocked  from  within  instead  of  being  forced  open 
through  loan  conditions. 

The  World  Bank’s  influence  has  been  a  function 
of  several  factors:  the  structural  power  of  its  major 
shareholders  in  the  world  system,  the  degree  of 
competition,  and  the  demand  for  its  money  and 
advice.  Until  the  1980s,  major  shareholders  had 
strong  influence,  but  the  imperatives  of  the  Cold 
War  also  limited  the  degree  to  which  the  Bank 
could  be  hegemonic.  With  the  end  of  the  Cold 
War,  while  the  influence  of  major  shareholders 
grew  even  more,  the  phenomenal  growth  of  inter¬ 
national  financial  markets  began  to  chip  away  the 
Bank’s  role  in  the  larger,  emerging  developing 
countries.  But  with  failures  inherent  in  financial 
markets,  the  boom-bust  cycle  in  private  financial 
flows  has  meant  that  to  the  extent  that  interna¬ 
tional  capital  flows  have  a  cyclical  component,  the 
importance  of  the  World  Bank  will  be  much  greater 
in  the  down  cycle.  Indeed,  in  times  of  crisis — 
whether  financial  or  in  the  aftermath  of  natural 
disasters — the  insurance  role  of  the  Bank  inevita¬ 
bly  gives  it  greater  influence. 

In  the  large  set  of  poor  developing  countries, 
where  private  financial  flows  have  been  limited — 
not  as  a  cyclical  but  as  a  trend  phenomenon — the 
Bank’s  soft  loan  facilities  (namely  IDA)  are  still 
important.  For  nearly  three  decades  (since  the 
early  1980s),  the  economic  travails  of  these  coun¬ 
tries  (especially  in  sub-Saharan  Africa)  made  them 
much  more  dependent  on  the  World  Bank.  More 
recently,  however,  a  revival  of  their  own  economic 
fortunes  and  growing  investments  from  China  and 
other  emerging  markets,  with  fewer  strings 
attached  and  with  much  lower  transaction  costs, 
has  given  these  countries  more  bargaining  power. 
Indeed,  major  shifts  in  global  economic  power  are 
bound  to  affect  the  governance  of  major  interna¬ 
tional  institutions  such  as  the  World  Bank  whose 
power  structures  still  reflect  the  world  as  it  was 
when  the  institution  was  established.  How  these 
shifts  influence  the  Bank’s  own  governance,  and  in 
turn,  its  roles  and  functions,  is  an  open  question. 

Devesh  Kapur 
University  of  Pennsylvania 
Philadelphia,  Pennsylvania,  United  States 
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World  Systems  Theory 


World  systems  theory  is  one  of  the  most  influen¬ 
tial  of  the  Marxist-inspired  theories  of  interna¬ 
tional  relations.  Its  impact  has  been  felt  across  the 
social  sciences  and  humanities,  influencing  not 
only  students  of  politics  but  also  historians,  geog¬ 
raphers,  and  sociologists.  The  focus  of  analysis  is 
on  a  global  economic  system,  which  is  the  source 
of  processes  of  historical  change  and  many  fea¬ 
tures  of  the  social  world.  The  key  influences  on 
world  systems  theory  include  Lenin’s  analysis  of 
imperialism,  the  work  of  dependency  school  theo¬ 
rists  such  as  Andre  Gunder  Frank,  and  the  French 
Annales  school,  in  particular  Fernand  Braudel. 
World  systems  theory  is  most  closely  associated 
with  the  work  of  Immanuel  Wallerstein,  and  his 
work  provides  the  focus  for  this  entry,  although 
the  general  approach  has  been  adapted,  devel¬ 
oped,  and  critiqued  by  a  number  of  other  writers. 

Key  Features  of  the  Theory:  An  Analysis 

Three  main  characteristics  distinguish  world  sys¬ 
tems  theory: 


1.  It  is  ontologically  and  epistemologically 
systemic. 

2.  It  provides  a  historical  explanation  of  the 
development  of  social  structures. 

3.  It  provides  an  account  of  the  spatial  division  of 
the  world. 

World  systems  theory  is  arguably  the  most 
systemic  approach  to  understanding  international 
relations.  As  such,  it  offers  a  radically  different  per¬ 
spective  on  world  politics.  Rather  than  a  world  of 
states,  diplomats,  and  international  organizations, 
the  focus  is  on  a  global  economic  system.  Waller- 
stein’s  earliest  research  was  on  the  characteristics  of 
poverty  in  Africa.  He  came  to  the  conclusion  that  it 
was  not  possible  to  understand  the  persistence  of 
poverty  by  studying  individual  countries.  Instead, 
the  poverty  of  specific  countries  could  be  under¬ 
stood  only  by  considering  their  position  within  a 
wider  economic  system  and  international  division 
of  labor.  That  a  higher  level  of  analysis  was  neces¬ 
sary  led  to  the  development  of  the  idea  of  a  world 
system.  The  “world”  in  world  systems  does  not 
necessarily  refer  to  a  global  reach.  Instead,  a  world 
system  is  the  smallest  unit  of  analysis  that  can  be 
described  as  a  stand-alone  entity,  or,  in  other 
words,  can  be  investigated  without  reference  to  ele¬ 
ments  outside  of  the  system.  Within  a  world 
system,  all  developments  can  be  analyzed  with  ref¬ 
erence  to  forces  within  the  system.  Further, 
Wallerstein  argues  that  the  character  of  a  world 
system  determines  all  features  of  the  social  world  at 
any  particular  time.  So,  for  example,  in  the  current 
world  system,  key  features  such  as  states,  markets, 
and  even  the  family  are  produced  through  the 
workings  of  the  system.  Additionally,  except  for 
periods  of  crisis,  when  a  system  is  about  to  collapse, 
there  is  very  little  potential  for  agency — in  other 
words,  there  are  few  possibilities  for  escaping  the 
constraints  of  the  system. 

A  second  key  feature  of  a  world  systems 
approach  is  that  it  provides  an  account  of  social 
change.  The  current  world  system,  which  Waller¬ 
stein  labels  the  “modern  world  system,”  is  one  of  a 
succession  of  world  systems.  There  have  been  two 
main  types  of  world  systems:  (1)  world  economies 
and  (2)  world  empires.  The  modern  world  system 
is  an  example  of  a  world  economy,  and,  according 
to  Wallerstein,  there  have  been  examples  of  world 
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systems  of  both  types.  What  differentiates  world 
systems  from  world  economies  is  the  system  of 
political  control.  In  world  empires,  there  is  one 
political  center  that  has  dominated  the  whole  sys¬ 
tem  (e.g.,  Rome  in  the  Roman  Empire),  whereas  in 
world  economies,  there  are  multiple,  competing 
centers  of  political  control.  In  the  modern  world 
system,  these  competing  systems  have  been  states. 
Historically,  world  empires  have  tended  to  be  more 
enduring  compared  with  world  economies.  The 
modern  world  system,  however,  has  been  remark¬ 
ably  stable.  A  number  of  factors  have  contributed 
to  this  stability,  one  being  the  efficiency  and  expan¬ 
sive  character  of  capitalism.  A  second  feature  has 
been  the  emergence  of  a  “geoculture,”  a  set  of  ideas 
and  norms  that  has  acted  to  provide  legitimation 
for  the  key  features  of  the  modern  world  system. 

In  addition  to  a  sequence  of  world  systems 
through  history,  each  world  system  has  its  own 
unique  history.  The  modern  world  system  is  no 
different  in  that  it  had  a  beginning,  a  middle,  and 
will  at  some  point  come  to  an  end.  The  origins  of 
the  modern  world  system  can  be  found  in  Europe 
in  the  16th  century,  where  a  combination  of  fac¬ 
tors  led  to  the  emergence  of  capitalism.  This  sys¬ 
tem  expanded  rapidly  and  ultimately  led  to  the 
incorporation,  for  the  first  time,  of  the  entire  globe 
within  a  world  system.  In  the  three  volumes  of  The 
Modern-World  System,  Wallerstein  outlines  in 
great  detail  the  early  expansion  of  this  system.  The 
developmental  path  of  the  modern  world  system 
has  been  influenced  by  rhythms  and  tendencies 
within  capitalism  itself.  Wallerstein  draws  on  the 
work  of  the  Russian  economist  Nikolai  Kondratieff, 
who  argued  that  capitalism  goes  through  succes¬ 
sive  waves  of  expansion  and  contraction  on  a  regu¬ 
lar  basis  (known  as  Kondratieff  waves).  Although 
these  waves  are  regular,  the  start  and  end  point  of 
each  wave  are  not  exactly  the  same,  meaning  that 
secular  trends  in  the  capitalist  system  can  be  detected 
(expansion  or  contraction).  Capitalism  is  also 
affected  by  contradictions,  and  Wallerstein  argues 
that  there  is  a  tendency  for  underconsumption — 
the  capitalist  system  produces  more  than  can  be 
consumed,  leading  to  a  decline  in  profitability.  For 
a  while,  the  world  system  can  absorb  contradic¬ 
tions;  however,  ultimately  they  lead  to  a  crisis, 
resulting  in  collapse  and  replacement  by  another 
world  system.  Much  of  Wallerstein’s  most  recent 
work  has  focused  on  the  end  of  the  world  system, 


as  his  analysis  indicates  that  the  modern  world 
system  is  now  in  a  state  of  terminal  decline  and 
will  be  replaced  by  some  other  form  of  system. 

The  final  defining  characteristic  of  Wallerstein’s 
world  systems  theory  is  the  division  of  space.  In 
the  modern  world  system,  there  are  three  main 
areas:  a  core,  a  periphery,  and  a  semiperiphery. 
The  core  constitutes  the  most  powerful  countries 
of  the  system.  These  have  the  highest  living  stan¬ 
dards,  the  most  effective  systems  of  social  provi¬ 
sion,  highest  education  levels,  most  profitable 
industries,  and  the  largest  coercive  forces.  The 
periphery  includes  the  most  exploited  countries  in 
the  system.  The  periphery  is  the  main  source  of 
raw  materials  for  use  in  manufacturing  in  the  core 
and  provides  a  market  for  the  core’s  manufac¬ 
tured  goods.  In  the  periphery,  states  are  typically 
weak  and  are  unable  to  provide  high  standards  of 
welfare  for  their  citizens  or  to  protect  their  popu¬ 
lations  from  the  depredations  of  core  states.  The 
countries  of  the  core  and  periphery  are  linked  by 
a  process  of  “unequal  exchange,”  the  implications 
of  which  are  that  wealth  is  systematically  extracted 
from  the  periphery  for  the  benefit  of  the  core.  This 
marks  an  important  distinguishing  feature  from 
more  traditional  Marxist  approaches.  Wallerstein 
sees  exploitation  as  deriving  from  exchange  rather 
than  being  a  result  of  the  production  process.  In 
between  the  core  and  periphery  lies  a  semiperiph¬ 
eral  zone.  The  countries  of  the  semiperiphery  are 
involved  in  some  manufacturing  but  primarily  in 
those  industries  that  are  no  longer  profitable  in 
the  core.  These  countries  are  also  involved  in  sup¬ 
plying  raw  materials  to  the  core.  The  semiperiph¬ 
ery  is  a  transitional  zone,  including  some  states 
that  were  once  in  the  core,  but  that  have  lost  their 
competitive  edge,  and  some  countries  that  have 
managed  to  escape  from  the  periphery.  The  semi¬ 
periphery,  according  to  Wallerstein,  plays  an 
important  part  in  stabilizing  the  modern  world 
system.  It  is  an  intermediate  zone  that  is  both 
exploiter  and  exploited.  In  its  intermediary  role,  it 
plays  important  functions  in  maintaining  the  sta¬ 
bility  of  the  system.  First,  by  dividing  the  opposi¬ 
tion  to  the  core  (and  sharing  some  of  the  core’s 
interests),  it  deflects  some  of  the  pressure  from  the 
periphery  for  changes  to  the  exploitative  system. 
Second,  it  also  acts  as  a  reserve  pool  of  labor, 
which  can  be  used  to  undermine  wage  militancy  in 
the  core. 
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Much  of  Wallerstein’s  recent  work  focuses  on 
the  more  immediate  developments  in  the  modern 
world  system.  The  collapse  of  the  Soviet  Union 
and  the  end  of  the  Cold  War  did  not  mean  the 
“end  of  history”  in  the  sense  that  there  was  now 
no  competitor  to  liberal  capitalism.  On  the  con¬ 
trary,  he  argues,  the  system  is  now  in  a  final  crisis. 
While  it  cannot  be  predicted  what  form  the  world 
system  (or  potential  world  systems)  will  take  that 
replaces  the  current  system,  ominously,  Wallerstein 
warns  that  it  may  be  replaced  by  an  even  less  just 
set  of  arrangements.  However,  in  this  point  of 
crisis  and  transition,  much  rests  on  personal 
choices  and  decisions.  The  determinism  of  the 
world  system  lessens,  and  there  is  much  more 
potential  for  individual  agency.  Wallerstein  advo¬ 
cates  the  study  of  “utopistics,”  a  consideration  of 
alternative  world  orders  that  could  be  more  equi¬ 
tably  organized,  as  a  means  of  pushing  the  world 
toward  a  more  just  future  system. 

Recent  Developments 

Wallerstein’s  work  has  been  taken  and  developed 
by  a  number  of  writers  within  a  world  systems 
school.  Some  approaches  have  been  more  sympa¬ 
thetic  to  Wallerstein’s  general  approach.  For 
example,  Christopher  Chase-Dunne  has  retained 
Wallerstein’s  basic  framework  and  historical  anal¬ 
ysis  but  has  focused  more  attention  on  the  devel¬ 
opment  and  roles  of  the  system  of  states  within 
the  modern  world  system.  Other  writers  have 
been  more  critical.  Janet  Abu-Lughold  argues  that 
rather  than  emerging  in  Europe,  the  European 
world  economy  was  a  peripheral  offshoot  of  an 
economic  system  that  had  its  core  in  the  Middle 
East.  Extending  the  analysis  even  further,  Andre 
Gunder  Frank  and  Barry  Gills  have  argued  that 
the  existence  of  a  world  system  can  be  traced  back 
5,000  years,  with  its  core  in  Asia.  As  such,  their 
work  offers  a  direct  challenge  to  the  notion  that 
capitalism  emerged  exceptionally  in  Europe — a 
view  held  by  most  analysts  of  capitalism  of  all 
perspectives. 

Conclusion 

The  world  systems  theory  provides  a  radically  dif¬ 
ferent  approach  to  the  understanding  of  interna¬ 
tional  relations.  Rather  than  being  a  set  of  relations 


primarily  between  states,  the  key  determinant  of 
social  relations  is  the  global  capitalist  system;  this 
system  is  the  source  of  the  main  features  of  the 
social  world,  and  the  fate  of  the  world  system  will 
have  impacts  for  all  social  relations.  While  being  a 
systemic  approach  to  international  relations,  it  also 
provides  the  basis  for  an  historical  and  geographi¬ 
cal  account  of  global  developments,  particularly 
the  division  of  the  world  into  differing  zones,  where 
the  life  chances  of  the  individual  will  be  radically 
different.  Finally,  if  Wallerstein  is  correct  that  the 
demise  of  the  world  system  is  imminent  and  that 
this  is  a  period  where  the  impacts  of  social  struc¬ 
tures  on  individual  agency  slacken,  then  it  is  also  a 
political  call  for  action  and  analysis  to  bring  about 
a  more  just  and  equitable  social  order. 

Stephen  Hobden 
University  of  East  London 
London,  United  Kingdom 
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World  Trade  Organization 

(WTO) 


The  World  Trade  Organization  (WTO)  is  the  heir 
of  the  General  Agreement  on  Trade  and  Tariffs 
(GATT).  The  GATT  treaty  was  hastily — almost  by 
default — adopted  in  1948,  and  GATT  as  an  insti¬ 
tution  was  barely  known  until  the  mid-1980s.  For 


World  Trade  Organization  (WTO)  2761 


many  years,  it  had  little  in  common  with  the 
International  Bank  for  Reconstruction  and  Devel¬ 
opment  (IBRD,  the  World  Bank)  or  the  Inter¬ 
national  Monetary  Fund  (IMF),  both  founded  in 
1944  with  great  unanimity  and  pomp.  Flowever, 
half  a  century  later,  the  WT O  is  widely  seen  as  an 
institution  that  is  as  important  as  the  two  others. 

Economics  and  Politics  of  Trade  Policy 

Such  success  is  due  largely  to  GATT’s  capacity  to 
fit  well  the  basic  economics  and  politics  of  trade 
policy,  a  crucial  heritage  for  the  WTO.  GATT  was 
largely  shaped  by  the  views  of  Cordell  FIull, 
Roosevelt’s  Secretary  of  State  (and  in  1945,  he 
won  the  Nobel  Peace  Prize  for  his  role  in  creating 
the  United  Nations).  Hull  believed  in  an  open 
world  trade  regime  based  on  no  discrimination 
among  countries  as  a  key  instrument  of  world 
prosperity  and  peace.  But  as  Hull  was  deeply 
involved  in  the  opening  of  U.S.  markets  to  foreign 
goods  after  1934,  he  was  well  aware  that  trade 
policy  required  both  economic  soundness  and 
political  realism — a  difficult  blend. 

Economic  soundness  values  trade  liberalization 
as  the  best  policy  (exceptions  are  very  limited  in 
practice).  It  is  in  the  best  interest  of  the  consum¬ 
ers,  and  consumers’  interest  is  the  widest  possible 
form  of  “general”  interest  (it  can  even  include 
future  generations,  as  in  the  case  of  environmental 
issues). 

But  consumers  are  too  numerous  to  build  strong 
coalitions  and  make  their  voices  heard.  This  cre¬ 
ates  a  deep  political  problem  because  it  leaves  gov¬ 
ernments  under  the  pressure  of  producers  alone. 
Fortunately,  producers  have  opposite  interests.  On 
the  one  hand,  domestic  producers  facing  increased 
competition  at  home  are  eager  to  maintain  or 
increase  the  level  of  domestic  protection.  They  are 
powerful  (they  are  often  close  to  the  political  estab¬ 
lishment  in  their  own  countries),  few  enough  to 
build  coalitions  easily,  and  induced  to  lobby  hard 
(they  gain  a  lot  from  keeping  things  unchanged). 
On  the  other  hand,  domestic  producers  who  are 
able  to  export  to  foreign  markets  support  the  open¬ 
ing  of  domestic  markets,  if  this  is  the  price  to  be 
paid  for  opening  the  foreign  markets  to  which  they 
could  export.  They  are  less  powerful  than  protec¬ 
tionist  interests:  They  are  often  embryonic  (they 
need  foreign  markets  to  expand),  more  diverse, 


and  more  focused  on  foreign  markets  to  conquer 
than  on  domestic  governments  to  lobby.  Their 
interests  are  close  to  those  of  the  consumers,  but 
their  influence  stops  as  soon  as  they  get  the  market 
access  they  want. 

In  short,  liberalizing  is  not  easy  and  requires 
political  will  from  governments.  Joining  efforts  in 
a  multilateral  (world)  forum  makes  such  an 
endeavor  politically  easier  while  amplifying  its 
expected  economic  benefits.  This  then  is  GATT’s, 
and  WTO’s,  role. 

GATT’s  Growing  Success  (1947-1986) 

In  1947,  the  Havana  Conference  tried  to  solve  this 
complex  blend  of  economics  and  politics  by  draft¬ 
ing  a  long  charter,  reflecting  the  diverging  views 
between  the  United  States  and  the  rest  of  the 
world,  led  by  the  United  Kingdom.  The  British 
view,  shaped  by  John  Maynard  Keynes,  did  not  see 
trade  as  a  key  instrument  to  achieving  world 
growth  and  stability;  hence,  it  was  favorable  to 
preferential  trade  agreements  and  moderately 
opposed  to  protection.  The  charter  ended  up  as  a 
stack  of  contradictory  provisions.  The  U.S.  Senate 
evidently  recognized  this  and  rejected  the  charter. 

This  decision  left  trade  negotiators  in  a  difficult 
situation.  They  had  already  agreed  on  tariff  cuts 
that  needed  to  be  consolidated  rapidly.  They  then 
took  the  charter’s  chapter  on  Commercial  Policy 
and  converted  it  into  the  GATT  text,  with  some 
limited  additions. 

The  GATT  text  suits  political  realism  by  stating 
three — only  three — key  rules: 

1.  The  “most  favored  nation”  (MFN)  rule  states 
that  a  member  lowering  a  tariff  (opening  up  a 
market)  has  to  do  so  for  the  same  goods  from 
all  the  other  GATT  members — rich  or  poor, 
weak  or  strong. 

2.  The  “national  treatment”  rule  states  that 
imported  and  locally  produced  goods  should  be 
treated  equally  after  the  foreign  goods  have 
entered  the  market. 

3.  The  “binding”  rule  states  that,  once  declared 
bound,  commitments  cannot  be  reneged  upon 
by  a  member  without  paying  compensation  to 
the  other  members  or  risking  retaliatory 
measures  from  them. 
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These  three  rules  have  little  to  do  with  liberaliza¬ 
tion.  A  protectionist-minded  country  can  abide  by 
them  as  easily  as  a  “free  trade”  country.  Their 
value  flows  from  the  trust  they  create.  GATT 
members  are  assured  that  they  will  not  be  dis¬ 
criminated  against  and  will  have  collective  ways  of 
dealing  with  breaches  of  commitments.  This  is 
GATT  as  a  “rule  maker.” 

GATT  “Rounds”  promote  economic  sound¬ 
ness.  A  Round  consists  in  GATT  members  negoti¬ 
ating  concessions  in  terms  of  tariff  cuts  or  other 
means  to  open  markets.  A  Round  is  launched 
when  enough  members  feel  ready  to  open  their 
economy  more  (the  United  States  was  mostly  in 
charge  of  testing  the  waters)  and  is  concluded 
when  members  feel  that  they  have  got  an  accept¬ 
able  deal.  The  agreements  are  achieved  by  consen¬ 
sus,  with  all  the  members  having  the  same  weight 
(there  is  no  vote).  This  is  GATT  as  a  “negotiating 
machinery.” 

Such  a  “collective”  liberalization  occurred  in 
seven  Rounds  scattered  between  1947  and  1986. 
It  generated  a  progressive — hence  politically 
manageable — liberalization  of  the  industrial  sec¬ 
tor  in  an  increasingly  large  number  of  countries. 
As  Hull  had  hoped,  trade  became  a  powerful 
engine  of  growth:  It  has  grown  almost  two  times 
faster  than  domestic  production  since  the  1970s. 

Of  course,  growth  is  not  enough.  Issues  such  as 
distribution  of  the  benefits  from  growth  are  key. 
But  economics  and  politics  converge  to  suggest 
that  such  issues  are  domestic  matters  heavily 
loaded  by  national  values.  Hence,  they  should  be 
handled  by  domestic  policies  (taxes,  subsidies, 
regulations)  that  GATT  rules  allow  in  the  vast 
majority  of  cases,  contrary  to  what  is  often  said. 

From  GATT  to  WTO: 

The  Uruguay  Round  (1986-1995) 

This  remarkable  success — nothing  comparable  hap¬ 
pened  during  the  19th  century — had  some  serious 
limits.  First,  in  1986,  GATT  was  a  de  facto  club  of 
rich  (or  rapidly  becoming  rich,  like  Japan  or  Korea) 
members.  Most  developing  countries  remained 
opposed  to  trade  liberalization.  Second,  GATT 
Rounds  covered  only  the  industrial  sector  (which 
ranges  from  20%  to  30%  of  GDP  for  most  coun¬ 
tries)  but  not  key  sectors  for  developing  countries 


(textiles).  Last  but  not  least,  agriculture  and  ser¬ 
vices  were  totally  untouched. 

The  key  objective  of  the  last  GATT  Round — the 
Uruguay  Round,  1986  to  1995 — was  precisely  to 
bring  these  untouched  sectors  under  GATT  rules, 
by  the  same  token  becoming  more  attractive  to 
developing  countries.  Agriculture  was  key  to 
changing  the  mind-set  in  countries  such  as  Brazil 
and  Argentina.  Services  played  the  same  role  for 
India  and  Pakistan.  This  new  mind-set  was  boosted 
by  China’s  rapid  growth  since  its  late  1970s 
reforms  (though  not  yet  a  GATT  member,  China 
was  rapidly  becoming  a  more  open  country  than 
most  developing  country  GATT  members). 

The  Uruguay  Round  results  have  been  mixed. 
New  market  openings  were  substantial  in  manu¬ 
facturing,  in  particular,  with  the  elimination  of 
nontariff  barriers  on  trade  in  goods  of  key  interest 
for  developing  countries  (textiles).  But  in  agricul¬ 
ture,  the  liberalization  process  was  so  badly 
designed  that,  despite  the  rhetoric,  market  open¬ 
ings  were  very  limited. 

Turning  to  a  broader  picture,  the  Round  suc¬ 
ceeded  in  expanding  the  GATT  negotiating 
machinery  to  services  and  intellectual  property 
rights.  It  also  established  a  “litigation  machinery,” 
with  a  dispute  settlement  mechanism  (DSM)  that 
was  much  more  credible  than  the  one  that  had 
existed  until  1995.  To  evaluate  some  of  these 
results  requires  caution.  Although  there  is  no 
doubt  that  the  potential  gains  from  liberalization 
in  services  are  huge,  the  case  of  intellectual  prop¬ 
erty  rights  is  more  debatable  (these  rights  consist  in 
granting  monopoly  rights,  possibly  endangering 
future  competition).  A  stronger  DSM  makes  it 
more  probable  that  WTO  members  will  litigate 
rather  than  negotiate.  Such  an  evolution  (not  yet 
observed)  would  be  worrisome  since  it  assumes  a 
drastic  change  of  mind  in  the  way  international 
relations  at  large  are  conceived. 

And  there  were  outright  failures.  The  Round 
was  unable  to  discipline  the  use  of  the  “escape 
clauses”  (safeguard,  antidumping,  and  antisub¬ 
sidy),  which  allow  members  to  reestablish  protec¬ 
tion  too  easily.  Even  more  important,  the  Round 
adopted  the  “Single  Undertaking”  approach — all 
members  shall  agree  on  all  the  results  of  a  Round 
(GATT  allowed  members  not  to  sign  some  agree¬ 
ments  at  the  end  of  a  Round).  Such  a  condition 
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deeply  rigidifies,  hence  weakens,  the  WT O  negoti¬ 
ating  process  since  it  gives  an  implicit  veto  to  any 
member. 


The  WTO:  Challenges  and  Ambitions 

While  the  Uruguay  Round  was  shaping  the  WTO, 
an  old  world  was  vanishing:  Centrally  planned 
economies  disappeared,  and  developing  countries 
opposed  to  freer  trade  had  begun  to  open  their 
domestic  markets.  These  changes  were  epitomized 
by  China’s  accession  to  the  WTO  in  2001  (today, 
one  third  of  the  Chinese  provinces  are  close  to 
becoming  as  rich  as  the  Central  European  coun¬ 
tries  that  are  members  of  the  European  Community). 

These  sweeping  changes  induced  the  WTO  to 
abandon  GATT’s  low-key  approach.  The  cautious 
preparations  of  GATT  Rounds  were  replaced  by  a 
rush  to  launch  a  new  Round  (the  European 
Community  proposed  to  launch  the  Millennium 
Round  5  years  before  the  end  of  the  enforcement 
of  the  Uruguay  Round).  The  price  was  high:  The 
TV  cameras  that  the  WTO  ministerial  meetings 
attracted  made  it  the  favorite  target  of  antiglobal- 
izers  of  all  kinds,  disturbing  WTO’s  first  years  of 
operation  and  making  cumbersome  the  launch  of 
the  Doha  Round  in  2001  (technically,  the  Round 
has  not  yet  been  officially  launched). 

However,  the  key  challenges  facing  the  WTO 
are  not  from  the  increasingly  scattered  antiglobal¬ 
ization  ranks.  Indeed,  most  Americans,  Europeans, 
Chinese,  and  Indians  realize  that  even  though 
globalization  has  its  drawbacks,  closing  their 
doors  to  foreign  products  would  be  a  much  more 
costly  solution.  Also,  the  current  economic  crisis 
has  not  so  far  witnessed  any  notable  move  toward 
reprotection. 

The  WTO’s  key  challenges  will  come  from  the 
new  world  that  is  emerging.  First,  there  is  an  ongo¬ 
ing  tectonic  shift  among  key  players.  GATT  greatly 
benefited  from  the  benevolent  role  of  the  United 
States.  The  United  States  was  always  very  careful 
to  keep  in  mind  the  interests  of  the  world  economy 
and  not  exploit  its  full  power  (if  the  United  States 
had  wanted  to  exploit  its  leverage  fully,  it  would 
have  concluded  bilateral  agreements,  not  pushed 
for  a  nondiscriminatory  world  trade  regime).  The 
rise  of  China  and  India  is  slowly  making  the 


United  States  one  key  player  among  a  few  others. 
Multipolarity  is  a  laudable  concept,  but  it  is  hard 
to  practice.  To  what  extent  the  four  to  six  largest 
economies  will  be  eager  to  play  collectively  the 
benevolent  role  that  the  United  States  played  alone 
for  the  past  50  years  is  a  crucial  open  question  for 
the  years  to  come. 

The  second  challenge  concerns  the  WTO 
negotiating  machinery.  In  trade  in  goods,  the 
WTO  will  be  doing  “GATT  business  as  usual”: 
eliminate  the  remaining  high  industrial  tariffs 
(all  the  members),  bind  more  firmly  and  system¬ 
atically  the  other  industrial  tariffs  (emerging 
and  developing  members),  and  cut  the  high  pro¬ 
tection  (tariffs  and  subsidies)  in  agriculture 
(industrial  members). 

Trade  liberalization  in  agriculture  will  be  par¬ 
ticularly  crucial.  Today,  the  issue  of  climate  change 
or  water  availability  is  widely  seen  as  antagonistic 
to  the  world  trade  regime  relying  on  GATT-WTO 
rules.  As  some  critics  have  pointed  out,  this  is  a 
major  mistake.  Climate  change  will  require  more — 
not  less — trade  in  agriculture.  The  half  dozen  mod¬ 
els  assessing  the  impact  of  climate  change  have 
only  one  common  result:  The  key  way  to  soften 
the  adjustments  required  by  climate  change  is  to 
facilitate  trade  among  countries.  In  short,  cuts  in 
barriers  to  trade  get  a  new  raison  d’etre — to  be  a 
key  tool  for  fighting  climate-driven  hunger  and 
water-driven  conflicts. 

Another  big  challenge  to  the  WTO  negotiating 
machinery  is  services.  Because  they  amount  to 
50%  to  70%  of  GDP,  opening  these  markets  will 
offer  huge  gains  to  consumers  for  many  decades 
to  come.  But  negotiating  concessions  in  services  is 
notoriously  difficult.  On  what  basis  can  opening 
a  domestic  audiovisual  sector  be  considered 
equivalent  to  the  opening  of  a  foreign  distribution 
sector?  More  crucially,  such  negotiations  require 
trust  in  trading  partners  because  services  liberal¬ 
ization  consists  in  a  dynamic  and  long  process  of 
domestic  regulatory  reforms,  which  are  often 
hard  to  anticipate  when  negotiating  the  initial 
agreements. 

Such  a  trust  cannot  be  delivered  by  the  WTO 
because  the  regulatory  reform  capacities  of  its 
members  are  too  heterogeneous.  This  makes  it 
attractive  to  negotiate  agreements  in  services  among 
a  narrower  group  of  countries  (“plurilateral” 
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agreements).  Such  agreements  could  be  negotiated 
during  (probably  more  frequent)  WTO  Rounds, 
but  they  could  also  be  initiated  outside  the  WTO 
and  then  repatriated  under  the  WTO  rules  of  non- 
discriminatory  binding. 

In  short,  the  systemic  shift  of  focus  from  goods 
to  services  suggests  that  the  role  of  the  WTO  as  a 
negotiating  body  may  become  less  prominent.  But 
the  role  of  the  WTO  as  a  rule  maker  is  as  crucial 
as  ever,  buttressed  by  its  capacity  for  litigation. 

Patrick  A.  Messerlin 
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Zionism 


The  word  Zionism  was  created  in  the  late  19th 
century  to  describe  an  ideology  and  a  political 
movement  whose  objective  was  the  rebuilding  of 
a  Jewish  nation  within  a  political  framework.  The 
goal  was  defined  as  a  Jewish  national  home  until 
1942,  when  the  establishment  of  a  state  was  offi¬ 
cially  endorsed  by  the  Zionists.  Although  nour¬ 
ished  by  the  2,000-year-old  Jewish  commitment 
to  the  land  of  Israel  (Zion),  and  by  religious  prac¬ 
tice,  Zionism  is  not  merely  a  simple  extension  of 
this  longing.  It  basically  embodies  a  nationalism, 
deeply  marked  by  the  European  nationality  move¬ 
ment  of  the  second  half  of  the  19th  century, 
whose  project  is  highly  political.  However,  from 
the  start,  this  project  was  contested  both  within 
the  Jewish  world  and,  more  forcefully  on  the 
ground,  by  the  Arab  majority  in  Palestine,  which 
saw  Zionism  as  a  settler  movement. 

Zionism  and  Its  Jewish  Critics 

The  Zionist  project  came  up  at  a  time  of  crisis  for 
the  Jewish  world,  which  faced  a  double  challenge 
during  the  19th  century.  The  first  challenge  was 
internal,  linked  with  the  weakening  of  traditional 
community  structures,  speeded  up  by  the  growing 
interaction  between  Jews  and  their  surrounding 
societies.  The  second  challenge  was  external  and 
stemmed  from  the  hostility  that  Jews  continued 
to  face  in  European  societies,  be  it  the  “old 


anti-Judaism”  based  on  religious  grounds  or  the 
“new,  racial  anti-Semitism,”  which  was  aimed  pri¬ 
marily  at  newly  assimilated  Jews.  Zionism  pre¬ 
sented  itself  as  a  way  to  overcome  this  double 
challenge  by  inventing  a  new  form  of  togetherness, 
a  national  one.  As  Theodor  Herzl,  the  founder  of 
political  Zionism,  wrote  in  his  manifesto  The 
Jewish  State  (1896),  the  “Jewish  question”  was 
first  and  last  a  national  issue  that  had  to  be 
resolved  by  the  building  of  a  state  for  the  Jewish 
people. 

Half  a  century  later,  this  objective  was  reached 
with  the  founding  of  the  state  of  Israel  in  May 
1948.  The  eventual  success  of  Zionism  should  not 
obscure  the  fact,  however,  that  among  the  Jews  the 
issue  was  passionately  debated  and  met  with 
strong,  sometimes  fierce,  opposition  from  some 
quarters  of  the  Jewish  world. 

The  critique  of  Zionism  took  three  main  forms. 
Those  who  chose  to  assimilate  had  implicitly 
rejected  the  basic  premise  of  Zionism  concerning 
the  historical  continuity  of  the  Jewish  people.  For 
the  followers  of  Marxism  or  supporters  of  political 
liberalism,  the  Jews  were  no  longer  a  people;  they 
were  just  human  beings  following  a  particular 
faith  who  had  to  take  part  either  in  the  general 
revolutionary  struggle  against  capitalism  or  in  the 
political  debates  within  the  states  of  which  they 
had  become  citizens  during  the  emancipation  pro¬ 
cess.  The  second  criticism  came  from  the  followers 
of  Jewish  ultra-orthodoxy  (who  were  numerous  in 
Eastern  Europe):  They  fully  accepted  the  idea  that 
the  Jews  were  a  people — but  a  people  apart  from 
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the  others,  with  a  strictly  religious  calling.  The 
longing  for  political  normalization,  which  was  the 
aim  of  Zionism,  was  condemned  because  it  broke 
with  this  unique  destiny.  The  third  form  of  opposi¬ 
tion  was  driven  by  other  Jewish  nationalisms, 
particularly  the  one  advocated  by  the  Bund 
(General  Jewish  Labor  Union  of  Lithuania,  Poland, 
and  Russia),  which  campaigned  for  the  recognition 
of  the  cultural  autonomy  of  Jewish  communities 
and  their  political  rights  in  the  diaspora. 

If  Zionism  at  the  beginning  was  clearly  a  minor¬ 
ity  movement,  how  did  it  manage  to  become  an 
almost  undisputed  fact  within  the  Jewish  world? 

Reasons  for  the  Historical  Success  of  Zionism 

Efforts  to  explain  why  Zionism  was  historically 
successful  have  focused  on  three  reasons:  (1)  its 
strong  ideological  coherence,  (2)  its  capacity  for 
mobilization  of  support,  and  (3)  its  political 
record.  Among  the  various  Jewish  nationalisms 
that  came  to  life  in  the  late  19th  century,  Zionism 
was  the  most  inclusive.  Unlike  its  main  rival,  the 
Bund,  which  wanted  to  be  the  representative  of  the 
Jewish  proletariat  and  thus  excluded  large  seg¬ 
ments  of  the  Jewish  people,  Zionism  had  as  a  goal 
the  creation  of  a  “home  secured  by  public  law,” 
which  was  able  to  attract  a  wide  range  of  Jews 
regardless  of  their  cultural,  political,  and  social 
differences.  Its  ideology  was  comprehensive 
enough  to  attract  both  middle-class  people  and 
workers,  religious  and  laypeople,  Ashkenazim 
(European  Jews)  and  Sephardim  (Jews  from 
Islamic  countries),  supporters  of  the  left  and  fol¬ 
lowers  of  the  right.  Of  course,  ideological  cleav¬ 
ages,  sometimes  marked,  persisted,  and  in  the 
1930s,  the  struggle  was  fierce  between  socialist 
Zionism,  the  dominant  force  in  Palestine  as  well  as 
within  the  World  Zionist  Organization,  led  by 
David  Ben  Gurion,  and  its  rival,  right-wing  revi¬ 
sionist  Zionism,  led  by  Vladimir  Jabotinsky. 
Nevertheless,  the  differences  (e.g.,  on  the  nature  of 
the  future  society  or  the  pace  of  state  building) 
were  always  subordinated  to  the  primary  objec¬ 
tive:  the  achievement  of  political  independence 
within  a  national  framework. 

Second,  Zionism  was  the  sole  Jewish  nationalism 
that  reached  the  ultimate  stage  of  development — 
namely,  mass  mobilization.  Zionism  was  defended 
by  an  intelligentsia — composed  chiefly  of  journalists, 


teachers,  and  students — that  strove  to  revive  the 
Hebrew  culture  and  provide  it  with  a  “political 
roof.”  Although  in  Western  Europe,  Zionism 
remained  a  minority  phenomenon  in  Jewish  com¬ 
munities,  where  the  goal  for  many  was  integration 
into  the  mainstream  culture,  it  became  a  genuine 
mass  movement  in  Poland,  Bulgaria,  and  Romania, 
where  historically  the  most  violent  anti-Jewish 
pogroms  had  occurred.  (In  Russia,  the  seizure  of 
power  by  the  Bolsheviks  quickly  led  to  a  ban  on 
Zionist  activity.)  It  also  had  considerable  success 
in  South  Africa,  where  most  of  the  Jewish  immi¬ 
grants  were  of  Lithuanian  origin.  Zionism  had 
more  difficulty  in  winning  over  the  Jews  in  the 
United  States  because  its  nationalist  creed  clashed 
with  the  dominant  ideology  of  Americanization  of 
the  Jews,  but  it  nonetheless  gained  influence  in  the 
1940s  with  the  gradual  discovery  of  the  genocide 
of  European  Jews.  Although  the  Holocaust  did  not 
“produce”  Zionism,  it  appeared  like  a  tragic  con¬ 
firmation  of  its  central  thesis — namely,  the  vulner¬ 
ability  of  Jews  in  the  diaspora.  From  then  on, 
Zionism — often  manifested  as  unconditional  sup¬ 
port  for  Israel — had  no  difficulty  in  rallying 
around  its  cause  the  vast  majority  of  the  Jewish 
world. 

Finally,  the  historical  rise  of  Zionism  was 
greatly  advanced  by  its  positive  political  record, 
while  its  Jewish  political  rivals  could  not  show  any 
tangible  results.  On  the  one  hand,  Zionism  was 
recognized  as  an  international  political  force  in 
1917  with  the  Balfour  Declaration,  in  which 
Britain  declared  itself  in  favor  of  the  establishment 
of  a  national  home  for  the  Jewish  people  in 
Palestine.  On  the  other  hand,  in  the  interwar 
period,  Zionism  also  asserted  itself  as  a  socio¬ 
political  reality  in  Palestine.  Gradually,  an  inde¬ 
pendent  Jewish  society,  regularly  strengthened  by 
the  arrival  of  new  immigrants,  took  shape — with 
its  own  economy,  Hebrew  culture,  political  par¬ 
ties,  and  institutions.  This  quasi-Jewish  state 
lacked  only  full  sovereignty,  which  it  finally 
gained,  first  through  endorsement  by  the  interna¬ 
tional  community  (the  United  Nations  resolution 
of  November  29,  1947),  then  by  the  force  of  arms 
in  1948,  during  the  war  against  the  Palestinians 
and  the  neighboring  Arab  countries,  which  had 
rejected  from  the  start  the  political  rationale  of 
Zionism.  Indeed,  like  Janus,  Zionism  always  had 
two  faces:  (1)  a  legitimate  national  self-determination 
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movement  for  the  Jews  and  (2)  an  enterprise  of 
gradual  dispossession  for  the  Arabs. 

Zionism  Seen  From  the  Arab  Side: 

A  Colonial  Process 

The  political  equation  Zionism  had  to  solve 
seemed  intractable.  Indeed,  in  1897,  when  the 
Zionist  Organization  was  created,  its  leaders 
claimed  for  the  Jewish  people,  99%  of  whom  were 
living  in  the  diaspora,  the  land  of  Palestine,  which 
culturally  and  demographically  was  Arab,  the  Jews 
being  only  a  small  minority  of  5%.  Zionism  could 
only  hope  to  reverse  this  unfavorable  trend  by 
building  a  strong  territorial  foothold,  despite  the 
opposition  of  the  native  population,  and  with  the 
help  of  an  imperial  outside  power,  namely,  Great 
Britain.  In  practice,  the  fulfillment  of  Zionism  had 
to  come  through  colonial  settlement,  which  seemed 
perfectly  legitimate  in  the  age  of  imperialism. 
Apart  from  a  small  minority  of  left-wing  Jewish 
intellectuals  who  defended  the  idea  of  binational- 
ism,  the  majority  of  Zionists  shared  the  then  com¬ 
mon,  dominant  Orientalist  vision,  which  viewed 
non-Europeans  as  irrelevant.  The  latter  rejected 
outright  this  ideological  depreciation  and  fought 
forcefully  against  Zionism  in  the  interwar  period. 
To  become  reality,  the  Zionist  dream  had  to  be 
imposed  on  the  natives,  who  refused  to  relinquish 
or  compromise  their  own  right  to  self-determination. 
Of  course,  unlike  other  colonial  settlers  overseas 
(e.g.,  the  United  States,  Australia),  the  Jewish 
immigrants  were  symbolically  connected  with  the 
land  in  which  they  had  settled,  but  for  the  Palestin¬ 
ian  population,  this  did  not  make  any  difference: 
“For  the  Palestinian,  Zionism  has  appeared  to  be 
an  uncompromisingly  exclusionary,  discrimina¬ 
tory  colonialist  praxis”  (Edward  W.  Said,  1980, 
p.  69).  The  return  of  the  Jews  was,  for  the  Arabs, 
akin  to  a  conquest. 

The  year  1948  was  a  turning  point:  Both  the 
local  Arabs  and  the  Arab  states  refused  the  parti¬ 
tion  plan  because  they  saw  it  as  basically  unfair. 
War  was  inevitable  and  led  to  a  massive  Arab 
depopulation  of  what  became  Israel.  The  authori¬ 
ties  of  the  new  state  chose  to  make  this  demo¬ 
graphic  upheaval  irreversible:  Palestinian  refugees 
were  immediately  barred  from  returning,  while  at 
the  same  time  the  small  Arab  minority  remaining 


in  the  state  of  Israel  was  controlled  and  margin¬ 
alized.  These  policies  could  not  but  strengthen 
the  Arab  conviction  that  Zionism  was  structur¬ 
ally  linked  with  the  subjection  of  Palestinians,  a 
conviction  that  was  restated  after  1967,  when  in 
the  newly  occupied  territories  (Golan  Heights, 
East  Jerusalem,  West  Bank,  Gaza  Strip,  Sinai) 
the  state  itself  encouraged  the  building  of  new 
settlements. 

Reconciling  the  Jewish  and  Arab  narratives  on 
Zionism  seems,  to  many,  utterly  unrealistic. 
Perhaps  the  best  hope  is  that  the  former  acknowl¬ 
edge  that  their  national  movement  has  had  tremen¬ 
dous  negative  consequences  for  the  native  society, 
which  has  retained  the  inalienable  right  to  build  a 
state  of  its  own,  while  the  latter  recognize  that, 
despite  its  initial  colonial  component,  the  Israeli 
polity  has  become  a  legitimate  reality  in  the  Middle 
East.  In  other  words,  a  two-state  solution  may  be 
the  only  credible  way  out  of  a  situation  of  perma¬ 
nent  conflict.  Many  observers  believe  that  such  a 
solution  should  be  the  objective  of  the  peace  pro¬ 
cess  launched  in  the  1990s  between  Israel  and  the 
Palestine  Liberation  Organization,  but  its  realiza¬ 
tion  remains  uncertain  as  it  is  caught  in  the  com¬ 
plexity  of  vexing  issues  such  as  the  status  of 
Jerusalem,  the  question  of  Jewish  settlements,  and 
the  plight  of  Palestinian  refugees. 
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critique  of  liberalism,  2:405,  2:410 
definition,  2:405 
as  ideology,  2:405 
individualism,  4:1176-1177 


main  currents  of  thought,  2:407-410 
as  philosophy,  2:405 

See  also  Christian  Right;  Fundamentalisms; 
Neo-conservative  idealism 
Conservative  dictatorships,  3:656 
Conservative  parties,  2:410-413 
British,  2:406,  2:411-412,  6:1801,  6:1802 
definition,  2:410 
electoral  performance,  2:412 
in  Europe,  2:412-413 
evolution,  2:411 

organization,  2:411-412,  6:1801 
political  principles,  2:410-411 
privatization  policies,  7:2133 
role  in  party  systems,  2:412 — 413,  5:1427 
social  bases,  2:412,  6:1794 
See  also  Christian  Democratic  parties 
Considerant,  Victor,  8:2450 
Consistency.  See  Attitude  consistency 
Consociational  democracy.  See  Democracy,  consociational 
Consolidation,  democratic.  See  Democratic  consolidation 
Constitution,  U.S. 

Bill  of  Rights,  2:419,  5:1430,  6:1720,  6:1721,  7:2319, 
7:2337 

Establishment  Clause,  1:236 

executive  branch,  3:864 

federalism,  3:896 

Fifth  Amendment,  7:2138 

First  Amendment,  1:236,  3:825,  6:1720,  6:1721 

goals,  6:1723 

judicial  review,  5:1373 

liberalism,  5:1440 

presidential  system,  7:2123 
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republic,  7:2289-2290 

separation  of  powers,  1:4,  1:6,  1:10,  1:17-19,  8:2403, 
8:2404,  8:2405-2407 
Twenty-Second  Amendment,  3:864 
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definition,  2:413 
duration,  2:415 
participants,  2:414-415 
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See  also  Separation  of  powers 
Constitutional  monarchy,  5:1613,  5:1614,  7:2290 
Constitutional  patriotism,  5:1649,  5:1657,  6:1833-1834 
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constraints  on  government,  6:1722-1723 
definition,  2:416 
democratic,  2:353 
goals,  2:416,  6:1722-1723 
identity  building,  5:1649 
international,  4:1243 
judiciary,  5:1383-1384 
legal,  2:416,  2:418-420,  4:1037 
legislative  voting  rules,  8:2718,  8:2719-2720 
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rule  of  law  and,  7:2336-2337,  7:2338,  7:2340 
statism  and,  8:2513 
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Construct  validity,  5:1515-1517 
Constructionism,  2:421 
Constructivism,  2:421-425 
contextual,  2:422 
definition,  2:421 
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on  ethics,  3:826-827 
frames,  2:422^123,  6:1924 
globalization,  4:976 
identity,  4:1136,  7:2113 
influence,  2:425 
international  law,  4:1241 
intersubjectivity,  2:428 
methodologies,  3:798 

mutual  constitution  of  structures  and  agents,  2:428-429 

perspective  on  crime,  2:423-424 
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policy  models,  6:1879-1881 
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in  social  sciences,  2:421-424,  6:1879 
sovereignty,  8:2471 
strict,  2:422 
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terminology,  2:421 
view  of  reality,  3:796-798,  6:1879 
See  also  Discourse  analysis 
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anarchic  world,  1:79,  4:1292 
building  blocks,  2:428-430 
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development,  2:426^128,  2:430 
domestic  politics  and,  3:694-695 
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epistemic  communities,  3:787-791 
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norms,  2:424-425 
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rise  of,  2:426 
security  studies,  8:2550 
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alternatives,  2:433 

during  Cold  War,  2:431-432,  2:433,  4:1280 
engagement  and,  2:433 
objectives,  2:431 
origins  of  concept,  2:431-432 
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Content  validity,  5:1514-1515 
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definition,  2:434 
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gender,  2:443-444 
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Catholic  concepts,  2:461 
contemporary,  2:459,  5:1670-1673 
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definition,  2:458 
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history,  2:459-461,  5:1668 
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model,  2:459 
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effects,  2:479 
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Cost-benefit  analysis  (CBA),  2:483-486 
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international  conventions,  4:1240 
international  coordination,  8:2593 
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deterrence,  2:486-487 
of  negotiated  settlements,  6:1841 
nonmajoritarian  institutions,  2:487-488 
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by  terrorist  groups,  8:2594,  8:2596 
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Crimes  against  humanity,  4:968-969,  5:1336 
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See  also  Judicial  systems;  Law 
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definition,  2:489,  2:494 
legitimacy,  3:623,  3:624 
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market  failures  and,  5:1491-1492 
natural  disasters,  3:668 
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political,  2:489^193 
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political  dimension,  2:495 
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Critical  rationalism.  See  Rationalism,  critical 
Critical  theory,  2:497-501 
communicative  action,  2:499,  4:1196 
definition,  2:497 
deliberative  democracy,  2:500 
on  ethics,  3:826-827 
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mass  communications,  5:1535-1536 
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power,  7:2099 

public  sphere,  2:499-500 
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critique  of  ideology,  2:502-503 
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influence,  2:503 

international  political  economy,  4:1264 
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Cross-case  studies,  1:194,  1:195,  1:196 
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Cross-national  surveys,  2:504-508 
archives,  6:1971-1972 
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International  Social  Survey  Programme,  2:506,  6:1778, 
7:2357 

methodological  issues,  2:506,  2:507 
social  trust  questions,  8:2672-2673 
See  also  Survey  research 

Cross-sectional  data.  See  Panel  data  analysis;  Time-series 
cross-section  data  and  methods 
Cross-tabular  analysis,  2:508-510 
correspondence  analysis,  2:468-474 
definition,  2:508 
use  of,  2:510 

Crouch,  Colin,  2:581-582 
Crowds,  psychology  of,  5:1503,  6:2000 
CSCE.  See  Conference  on  Security  and  Cooperation  in 
Europe 

CSOs.  See  Civil  society  organizations 
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Bay  of  Pigs  invasion,  4:1059,  4:1212,  4:1213 
communist  party,  2:319 
socialist  system,  2:311,  8:2457,  8:2460 
stability,  8:2477 

Cuban  Missile  Crisis,  5:1652,  7:2366 
Cultural  hegemony,  3:820,  3:821,  3:822,  7:2074 
Cultural  nationalism,  7:2072-2074 
Cultural  pluralism,  6:1869-1870,  6:1871-1872 
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Cultural  rights,  7:2319-2320 
Cultural  studies,  5:1536 
Cultural  violence,  7:2079 
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definition,  2:510-511 
neo-,  2:512-513 
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in  political  science,  2:511-512,  2:513,  7:2163 
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collective  identity,  1:278 
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effects  of  globalization,  4:980 
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organizational,  6:1749,  6:1750,  7:2216 
political  sociology,  6:2029-2031 
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national,  5:1620 
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Currency  crises.  See  Financial  crises 
Cusack,  Thomas,  6:1984 
CVM.  See  Civic  voluntarism  model 
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on  elites,  3:766,  3:768,  6:1956,  6:2025,  7:2104 
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on  opposition,  6:1745-1746 
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pluralism,  6:1869,  6:1870,  6:1871,  7:2225 
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coding,  5:1558-1559 
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imperfections,  1:212,  5:1598-1599 
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definition,  2:515 
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Data,  spatial,  2:520-525 
regression  analysis,  2:522-525 
spatial  relations,  2:520-522 
use  in  political  science,  2:520 
Data,  textual,  2:525-530 
advantages,  2:526-527 

analysis,  2:525,  2:527,  2:529-530,  5:1558-1559 
challenges,  2:526 
definition,  2:525 
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preprocessing,  2:528-529 
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